>Q92AT0 2.4.1.333~~~~~~1,2-beta-oligoglucan phosphorylase~~~COG3459
MTMLKEIKKADLSAAFYPSGELAWLKLKDIMLNQVIQNPLENRLSQIYVRAHVGDKIEIYPLLSRDAEVGFNENGVEYRG
VVGPFRYSVQMHFHTRGWFYDVTVDGDLEFDLVYLQDLGLAEQAAVRTNEAYMSQYIDYHVTEGATGFTVQARQNQPQNE
RFPAVQIGALTKIVGYATDGFDIYGTNYKLTSELANLKEKSLPSRVYQYEFAQISLQTELFTNHGETIFYGYATENQPKA
SGAPFENLAELKSNISEQPYQPSTKAILNKHIGTPITGETISDSWLQENFPDRIQEEQQNGALLSFFTPNYAHVVMREKE
AELERPHGSILLDKVDVLNPEATLSATTYMYGAFLSQLVAGNTNMNKWNSHARNPLNILQTSGLRIYIELDSELRLLGVP
SVWETSTNYSTWYYQWNGDLITVQTTLTADSKEAFVTVHSEKGHSYKLVLTNQVTMGTNEYDTTVKKEIKDGIVTYFPAE
DSPILETYPALQFRVDGTYNELTDERYFAKDYVGTAGLDVFVFEPSDKATFHVQAKLSDEFSKPTEDLEANNKVIRASYD
ELTAQFHLNHQSTTAEKLNLTVYWYAHQMLVHYASPHGLEQYSGAAWGTRDVSQGPFEFFLATGNKAVLRKLVLTIFSHQ
YQDTGDWPQWFMFDKYTSIQQEESHGDVIVWPLKIIGDYLEMSGDAGILEEAIPFVDRESKTFTKEQGTLLEHIELAVKT
IEARFMKGTALSNYGDGDWDDTLQPANAQLKKNMVSSWTVALTYQTFKRLAAFLPVGEKYETLAKNVQADFAKYMTNDTD
VIPGFLYLEEGKAPVWMIHPEDKETNIKYRLIPLTRSVISELVDKKQASRNFEIIGEHLLHPDGVRLMSEPAHYAGGVST
HFKRAEQAANFGREVGLQYVHAHIRYIEALAKIGDKSAWHMLDVINPINIKEVVPNAALRQSNTYFSSSDAAFLDRYQAQ
NEFGRVKEGSIPVKGGWRIYSSGPGIYLHQLISSVLGIRQTEDALIFDPILPEELDGLECHIELDNYPLDLTFESADEGS
IVVNGEKQPVENGANLYRTGALILPKKNLTTKCSQITIKFQKNNRL
>Q8GBW6 2.1.3.1~~~~~~Methylmalonyl-CoA carboxyltransferase 12S subunit~~~
MAENNNLKLASTMEGRVEQLAEQRQVIEAGGGERRVEKQHSQGKQTARERLNNLLDPHSFDEVGAFRKHRTTLFGMDKAV
VPADGVVTGRGTILGRPVHAASQDFTVMGGSAGETQSTKVVETMEQALLTGTPFLFFYDSGGARIQEGIDSLSGYGKMFF
ANVKLSGVVPQIAIIAGPCAGGASYSPALTDFIIMTKKAHMFITGPQVIKSVTGEDVTADELGGAEAHMAISGNIHFVAE
DDDAAELIAKKLLSFLPQNNTEEASFVNPNNDVSPNTELRDIVPIDGKKGYDVRDVIAKIVDWGDYLEVKAGYATNLVTA
FARVNGRSVGIVANQPSVMSGCLDINASDKAAEFVNFCDSFNIPLVQLVDVPGFLPGVQQEYGGIIRHGAKMLYAYSEAT
VPKITVVLRKAYGGSYLAMCNRDLGADAVYAWPSAEIAVMGAEGAANVIFRKEIKAADDPDAMRAEKIEEYQNAFNTPYV
AAARGQVDDVIDPADTRRKIASALEMYATKRQTRPAKKHGNFPC
>P16536 ~~~~~~14 kDa peptide of ubiquinol-cytochrome c2 oxidoreductase complex~~~
MFSFIDDIPSFEQIKARVRDDLRKHGWEKRWNDSRLVQKSRELLNDEELKIDPATWIWKRMPSREEVAARRQRDFETVWK
YRYRLGGFASGALLALALAGIFSTGNFGGSSDAGNRPSVVYPIE
>Q2YKY9 ~~~~~~Lectin-like protein BA14k~~~
MNSFRKTCAGALALIFGATSIVPTVAAPMNMDRPAINQNVIQARAHYRPQNYNRGHRPGYWHGHRGYRHYRHGYRRHNDG
WWYPLAAFGAGAIIGGAISQPRPVYRAPAGSPHVQWCYSRYKSYRASDNTFQPYNGPRKQCRSPYSR
>E7CY70 3.2.1.-~~~afuB~~~Exo-alpha-(1->6)-L-arabinofuranosidase~~~COG3534
MSEHSHDNKPGELEESRLVVDNDFEVAPVNDRLFGSFVEHLGRCVYDGIYEPGHPEADEDGFRKDVIELVKELGATTIRY
PGGNFVSGYRWEDGVGPRDERPRRLDLAWHSTETNQFGLHEMAKWLEKTGGNELMEAVNLGTRGLEEALDLLEYANIPGG
TKLSEERRANGADQPFGIKMWCLGNEMDGPWQTGHKSAEDYGTLAASVAAGMRAIDPNVELVVCGSSSHVMDTFGKWEET
VLEKTFDNVNFVSCHAYYHPELQPDGTRDMKSFLASGVDMDGFINDVAAAIDATKARLKSKHDVFISFDEWNVWYLNEEP
SKNPEGIGNWPVAPRLLEDVYSAADAVVFGDLMITLLKNADRVHAASLAQLVNVIAPIMTEPGGPAWRQTTFYPFSLTAK
LAKGGTVLEPKLASGTYETDKYGEVPTINSVAVRGEDGTISVFVVNRSMEAANDFAIKLPEGFALSAVEAQTLHEDDLLA
KNTLEDQNRVVLHPNTTITSDADTGTVRVTLPPVSWTAVHVK
>Q5PWZ8 3.5.99.7~~~acdS~~~1-aminocyclopropane-1-carboxylate deaminase~~~
MNLNRFERYPLTFGPSPITPLKRLSEHLGGKVELYAKREDCNSGLAFGGNKTRKLEYLIPEAIEQGCDTLVSIGGIQSNQ
TRQVAAVAAHLGMKCVLVQENWVNYSDAVYDRVGNIEMSRIMGADVRLDAAGFDIGIRPSWEKAMSDVVERGGKPFPIPA
GCSEHPYGGLGFVGFAEEVRQQEKELGFKFDYIVVCSVTGSTQAGMVVGFAADGRSKNVIGVDASAKPEQTKAQILRIAR
HTAELVELGREITEEDVVLDTRFAYPEYGLPNEGTLEAIRLCGSLEGVLTDPVYEGKSMHGMIEMVRRGEFPDGSKVLYA
HLGGAPALNAYSFLFRNG
>P30297 3.5.99.7~~~acdS~~~1-aminocyclopropane-1-carboxylate deaminase~~~
MNLNRFERYPLTFGPSPITPLKRLSQHLGGKVELYAKREDCNSGLAFGGNKTRKLEYLIPEAIEQGCDTLVSIGGIQSNQ
TRQVAAVAAHLGMKCVLVQENWVNYSDAVYDRVGNIEMSRIMGADVRLDAAGFDIGIRPSWEKAMSDVVEQGGKPFPIPA
GCSEHPYGGLGFVGFAEEVRQQEKELGFKFDYIVVCSVTGSTQAGMVVGFAADGRSKNVIGIDASAKPEQTKAQILRIAR
HTAELVELGREITEEDVVLDTRFAYPEYGLPNEGTLEAIRLCGSLEGVLTDPVYEGKSMHGMIEMVRRGEFPEGSKVLYA
HLGGAPALNAYSFLFRNG
>Q00740 3.5.99.7~~~acdS~~~1-aminocyclopropane-1-carboxylate deaminase~~~
MNLQRFPRYPLTFGPTPIQPLARLSKHLGGKVHLYAKREDCNSGLAFGGNKTRKLEYLIPEALAQGCDTLVSIGGIQSNQ
TRQVAAVAAHLGMKCVLVQENWVNYSDAVYDRVGNIQMSRILGADVRLVPDGFDIGFRRSWEDALESVRAAGGKPYAIPA
GCSDHPLGGLGFVGFAEEVRAQEAELGFKFDYVVVCSVTGSTQAGMVVGFAADGRADRVIGVDASAKPAQTREQITRIAR
QTAEKVGLERDIMRADVVLDERFAGPEYGLPNEGTLEAIRLCARTEGMLTDPVYEGKSMHGMIEMVRNGEFPEGSRVLYA
HLGGVPALNGYSFIFRDG
>P0DMQ7 ~~~~~~Uncharacterized protein Rv2003A~~~
MPTITVSSTSSLCGQALSGNPTFAEHLVRMGITSVSVHSGAIAATPGSVAAAERRLLLESARGDA
>L0TBY6 ~~~~~~Protein Rv2250A~~~COG0277
MTHREELLPPMKWDAWGDPAAAKPLSDGVRSLLKQVVGLADSEQPELDPAQVQLRPSALSGADHDALARIVGTEYFRTAD
RDRLLHAGGKSTPDLLRRKDTGVQDAPDAVLLPGGPNGGGRRRRHLALLLRPRHCRGPVWWRHQRRWWA
>Q6STM1 1.14.14.108~~~camP~~~2,5-diketocamphane 1,2-monooxygenase 1~~~
MKCGFFHTPYNLPTRTARQMFDWSLKLAQVCDEAGFADFMIGEHSTLAWENIPCPEIIIGAAAPLTKNIRFAPMAHLLPY
HNPATLAIQIGWLSQILEGRYFLGVAPGGHHTDAILHGFEGIGPLQEQMFESLELMEKIWAREPFMEKGKFFQAGFPGPD
TMPEYDVEIADNSPWGGRESMEVAVTGLTKNSSSLKWAGERNYSPISFFGGHEVMRSHYDTWAAAMQSKGFTPERSRFRV
TRDIFIADTDAEAKKRAKASGLGKSWEHYLFPIYKKFNLFPGIIADAGLDIDPSQVDMDFLAEHVWLCGSPETVKGKIER
MMERSGGCGQIVVCSHDNIDNPEPYFESLQRLASEVLPKVRMG
>M5AWY0 1.14.14.108~~~~~~2,5-diketocamphane 1,2-monooxygenase 2~~~
MQAGFFHTPYNLPTRTARQMFDWSLKLAQVCDEAGFADFMIGEHSTLAWENIPCPEIIIGAAAPLTKNIRFAPMAHLLPY
HNPASLAIQVGWLSQILEGRYFLGVAPGGHHTDAILHGFEGIGPLQEQMFEALELMEKVWARKPFMEKGKFFQAGFPGPD
TMPEYDVEIADNSPWGGREALEIAVTGLTKNSSSLKWAGERNYSPISFFGGHEVMRSHYDTWAAAMQSKGFTPDTSRFRV
TREIFIADTDAEARKRAKASGMAKTWEHYLFPIYKKFNLFPGIIADAGLDIDPSQIDMDFLADHVWLCGSPETVKGKIEN
MIERSGGCGQIIVNSHDNIDNPEPYFESLQRLAQEVLPNVKTS
>Q88FI7 2.6.1.39~~~~~~2-aminoadipate transaminase~~~COG0160
MNQESISQSIAIVHPITLSHGRNAEVWDTDGKRYIDFVGGIGVLNLGHCNPAVVEAIQAQATRLTHYAFNAAPHGPYLAL
MEQLSQFVPVSYPLAGMLTNSGAEAAENALKVARGATGKRAIIAFDGGFHGRTLATLNLNGKVAPYKQRVGELPGPVYHL
PYPSADTGVTCEQALKAMDRLFSVELAVEDVAAFIFEPVQGEGGFLALDPAFAQALRRFCDERGILIIIDEIQSGFGRTG
QRFAFPRLGIEPDLLLLAKSIAGGMPLGAVVGRKELMAALPKGGLGGTYSGNPISCAAALASLAQMTDENLATWGERQEQ
AIVSRYERWKASGLSPYIGRLTGVGAMRGIEFANADGSPAPAQLAKVMEAARARGLLLMPSGKARHIIRLLAPLTIEAEV
LEEGLDILEQCLAELN
>Q5FTU6 1.1.1.215~~~~~~2-ketogluconate reductase~~~COG1052
MSSKPDILTIDPLVPVMKERLEKSFTLHPYTSLENLKNIAPAIRGITTGGGSGVPSEIMDALPNLEVISVNGVGTDRINL
DEARRRNIGVAITQNTLTDDVADMAVALMMAVMRSIVTNDAFVRAGKWPSATAPLGRSLTRKKVGIAGFGHIGQAIAKRV
SAFGMEVAYFNSHARPESTCHFEPDLKALATWCDVLILAVSGGPRSANMIDRDTLDALGKDGFLVNIARGTVVDEAALLS
ALQEKRIAGAGLDVFQNEPNINPAFLSLPNTVLQAHQASATVETRTTMANLVVDNLIAYFTDKTLLTPVI
>D3DJ41 6.4.1.7~~~cfiA~~~2-oxoglutarate carboxylase large subunit~~~COG0511
MQAVEIMEEIREKFKEFEKGGFRKKILITDLTPRDGQQCKLATRVRTDDLLPLCEAMDKVGFYAVEVWGGATYDVCLRYL
KEDPWERLRRIKEVMPNTKLQMLFRGQNIVGYRPKSDKLVYKFVERAIKNGITVFRVFDALNDNRNIKTAVKAIKELGGE
AHAEISYTRSPIHTYQKWIEYALEIAEMGADWLSFKDATGIIMPFETYAIIKGIKEATGGKLPVLLHNHDMSGTAIVNHM
MAVLAGVDMLDTVLSPLAFGSSHPATESVVAMLEGTPFDTGIDMKKLDELAEIVKQIRKKYKKYETEYAGVNAKVLIHKI
PGGMISNMVAQLIEANALDKIEEALEEVPNVERDLGHPPLLTPSSQIVGVQAVLNVISGERYKVITKEVRDYVEGKYGKP
PGPISKELAEKILGPGKEPDFSIRAADLADPNDWDKAYEETKAILGREPTDEEVLLYALFPMQAKDFFVAREKGELHPEP
VDELVETTEVKAGVVPGAAPVEFEIVYHGEKFKVKVEGVSAHQEPGKPRKYYIRVDGRLEEVQITPHVEAIPKGGPTPTA
VQAEEKGIPKATQPGDATAPMPGRVVRVLVKEGDKVKEGQTVAIVEAMKMENEIHAPISGVVEKVFVKPGDNVTPDDALL
RIKHIEEEVSYG
>D3DJ42 6.4.1.7~~~cfiB~~~2-oxoglutarate carboxylase small subunit~~~COG0439
MFKKVLVANRGEIACRVIRACKELGIQTVAIYNEIESTARHVKMADEAYMIGVNPLDTYLNAERIVDLALEVGAEAIHPG
YGFLAENEHFARLCEEKGITFIGPHWKVIELMGDKARSKEVMKRAGVPTVPGSDGILKDVEEAKRIAKEIGYPVLLKASA
GGGGRGIRICRNEEELVRNYENAYNEAVKAFGRGDLLLEKYIENPKHIEFQVLGDKYGNVIHLGERDCSIQRRNQKLVEI
APSLLLTPEQREYYGSLVVKAAKEIGYYSAGTMEFIADEKGNLYFIEMNTRIQVEHPVTEMITGVDIVKWQIRIAAGERL
RYSQEDIRFNGYSIECRINAEDPKKGFAPSIGTIERYYVPGGFGIRVEHASSKGYEITPYYDSLIAKLIVWAPLWEVAVD
RMRSALETYEISGVKTTIPLLINIMKDKDFRDGKFTTRYLEEHPHVFDYAEHRDKEDFVAFISAVIASYHGL
>E0SKP1 4.3.2.-~~~~~~N-acetyl-S-(2-succino)cysteine lyase~~~COG0015
MASHLIDFLLIGNNFGTPEMRAVWSEQNRLTRQVDVEIALALAEGDLGVIPQDAASTIASHANASALNIEEIAQDAVRMK
HSLMPTIAAIQRQCGEAGEYIHYGVTTQDVVDTATVLQLRQAFDIVVRDTRLVAIELKRLAKKHQHTLMTGRTHGMQALP
TTFGFKLAVWLDEFVRHLQRLNEIRERVLVGNINGAIGTYASFGELGPEIERHTLTRLGLNTPNIGWQSARDRFSEYASV
TVLISGTLGKIGNELYNLMRTEINEIEEPFSEGKIGSTTMPHKRNPAALEGLASLTAPLFKSAALIHESMKVEHERDAMS
WRAEWIALPEINIYLSAQLQNALGILRGMSVNEKQMRANLDLQNGLLLSEKVMFEIGKLLGKQTAHHLVYECSMAAFEQN
REFKALLLEHPVLSQHLTADTLDTWLDPANYVGSAPQKVDEVIRYADGTGLLAE
>E6LDH5 4.3.2.-~~~purB~~~N-acetyl-S-(2-succino)cysteine lyase~~~COG0015
MGSHVIDLVMLRNNFSTAEMRAIWNDEARITKQLAVEAALAQAEGELGLIPKEAAKKIAKVAKETTFDIAAIAEQVAVLK
HSLMPTINALQAAAGDEGEFVHYGATTQDIVDTGTVLQLKDAYNIVLRDTQVVFEKLAKLAKHYQNVPMVGRTHGMQALP
ITFGYKLAIWVDEFGRHLERLHEIKERVFTGNINGAVGSYASFGPKGSEVERQTLAILDLNAPTIGWQSSRDRFSEYASV
IGLISATLGKIGNEFYNLMRTEINEIEEPFSKGKIGSSTMPHKRNPAAFEGLASLTPPVLKSVALIHESMHVEHERDAMS
WRQEWVALPEMNAYVSAQLAILANVLDGLQVKEAVMARNLEKQHGLLLSEKVMFEVGQKLGKQTAHHLVYECAMTSFEEE
RPFIDTLFEQAAIADTYARAEVEQWLDPTQYTGLCADKVDEVLAAWQTKGFLKEG
>V5QRX7 ~~~~~~Putative toxin Rv3098A/RVBD_3098A~~~COG2337
MVIRGAVYRVDFGDAKRGHEQRGRRYAVVISPGSMPWSVVTVVPTSTSAQPAVFRPELEVMGTKTRFLVDQIRTIGIVYV
HGDPVDYLDRDQMAKVEHAVARYLGL
>P9WIR9 ~~~~~~34 kDa antigenic protein homolog~~~
MTYSPGNPGYPQAQPAGSYGGVTPSFAHADEGASKLPMYLNIAVAVLGLAAYFASFGPMFTLSTELGGGDGAVSGDTGLP
VGVALLAALLAGVALVPKAKSHVTVVAVLGVLGVFLMVSATFNKPSAYSTGWALWVVLAFIVFQAVAAVLALLVETGAIT
APAPRPKFDPYGQYGRYGQYGQYGVQPGGYYGQQGAQQAAGLQSPGPQQSPQPPGYGSQYGGYSSSPSQSGSGYTAQPPA
QPPAQSGSQQSHQGPSTPPTGFPSFSPPPPVSAGTGSQAGSAPVNYSNPSGGEQSSSPGGAPV
>Q7NXD4 3.1.3.97~~~~~~3',5'-nucleoside bisphosphate phosphatase~~~COG0613
MANIDLHFHSRTSDGALTPTEVIDRAAARAPALLALTDHDCTGGLAEAAAAAARRGIPFLNGVEVSVSWGRHTVHIVGLG
IDPAEPALAAGLKSIREGRLERARQMGASLEAAGIAGCFDGAMRWCDNPEMISRTHFARHLVDSGAVKDMRTVFRKYLTP
GKPGYVSHQWASLEDAVGWIVGAGGMAVIAHPGRYDMGRTLIERLILDFQAAGGQGIEVASGSHSLDDMHKFALHADRHG
LYASSGSDFHAPGEGGRDVGHTEDLPPICRPIWRELEARILRPAD
>D7UER1 1.14.14.155~~~~~~3,6-diketocamphane 1,6-monooxygenase~~~
MAMETGLIFHPYMRPGRSARQTFDWGIKSAVQADSVGIDSMMISEHASQIWENIPNPELLIAAAALQTKNIKFAPMAHLL
PHQHPAKLATMIGWLSQILEGRYFLGIGAGAYPQASYMHGIRNAGQSNTATGGEETKNLNDMVRESLFIMEKIWKREPFF
HEGKYWDAGYPEELEGEEGDEQHKLADFSPWGGKAPEIAVTGFSYNSPSMRLAGERNFKPVSIFSGLDALKRHWEVYSEA
AIEAGHTPDRSRHAVSHTVFCADTDKEAKRLVMEGPIGYCFERYLIPIWRRFGMMDGYAKDAGIDPVDADLEFLVDNVFL
VGSPDTVTEKINALFEATGGWGTLQVEAHDYYDDPAPWFQSLELISKEVAPKILLPKR
>A7B3K3 1.1.1.-~~~baiA~~~3alpha-hydroxysteroid dehydrogenase~~~COG1028
MFMMLKNKVAIVTGGTRGIGFAVVKKFIENGAAVSLWGSRQETVDQALEQLKELYPDAKISGKYPSLKDTAQVTAMINQV
KEEFGAVDILVNNAGISQSTSFYNYQPEEFQKIVDLNVTAVFNCSQAAAKIMKEQGGGVILNTSSMVSIYGQPSGCGYPA
SKFAVNGLTKSLARELGCDNIRVNAVAPGITRTDMVAALPEAVIKPLIATIPLGRVGEPEDIANAFLFLASDMASYVTGE
ILSVDGAARS
>C8WMP0 1.1.1.-~~~~~~3alpha-hydroxysteroid dehydrogenase~~~COG1028
MGIYVITGATSGIGAKTAEILRERGHEVVNIDLNGGDINANLATKEGRAGAIAELHERFPEGIDAMICNAGVSGGKVPIS
LIISLNYFGATEMARGVFDLLEKKGGSCVVTSSNSIAQGAARMDVAGMLNNHADEDRILELVKDVDPAIGHVYYASTKYA
LARWVRRMSPDWGSRGVRLNAIAPGNVRTAMTANMLPEQRAAMEAIPVPTHFGEEPLMDPVEIANAMAFIASPEASGING
VVLFVDGGTDALLNSEKVY
>C8WJW0 1.1.1.-~~~~~~3beta-hydroxysteroid dehydrogenase 1~~~COG1028
MYDDLKGKTVVVTGSSKGLGAAMARRFGAEGMNVVANYRSDEEGARETVRAIEEAGGAAAAVQADVSKNECVDALFDAAM
FSFGGVDIWVNNAGIEVASPSDRKSIEEWQRVIDVNLTGVFAGCRRAIDHFLDRKMPGVIINLSSVHEIIPWPHFADYAA
SKAGVGMLTKTLALEYADRGIRVNAIAPGAMNTPINAEKFADPEARAATERLIPMGYVGAPEDVAAAAAWLASDQASYVT
GTTLFVDGGMTLYPGFQFGQG
>C8WGQ3 1.1.1.-~~~~~~3beta-hydroxysteroid dehydrogenase 2~~~COG1028
MSEARHNPVLAGQTAVITGGASGIGKSIVQRFLEAGASCLAADLNEEALAALKQELAEYGDKLDVVKVDVSNRDDVEGMV
DRAVQTFGQMDIIVNNAGIMDNLLPIAEMDDDVWERLMKVNLNSVMYGTRKAVRYFMERGEGGVIINTASLSGLCAGRGG
CAYTASKFAVVGLTKNVAFMYADTGIRCNAICPGNTQTNIGVGMRQPSERGMAKATTGYAGATRSGTPEEISAAAAFLAS
DQAGFINGETLTIDGGWSAY
>A7AZH2 1.1.1.-~~~~~~3beta-hydroxysteroid dehydrogenase~~~COG1028
MNFGGFIMGRFDEKIMLVTGATSGIGRAVAIRAAKEGATVVAVGRNEERGAAVVAAMEEAGGKGEFMKCDVSNKDAVKAL
FAEIQEKYGKLDVAVNNAGIVGASKTVEELEDDDWFQVIDANLNSCFFCCREEVKLMQPSGGAIVNVSSVAGMRGFPSAA
AYVASKHAVSGLTKAVAVDYATKGITCNAICPAGTDTPLTERSSADIKTRMAEIAAQGKDPMEWLKNSMLSGKTETLQKK
NATPEEQAATILYFASDEARHITGSIVASDGGFTTY
>P19871 1.1.1.51~~~~~~3-beta-hydroxysteroid dehydrogenase~~~
MTNRLQGKVALVTGGASGVGLEVVKLLLGEGAKVAFSDINEAAGQQLAAELGERSMFVRHDVSSEADWTLVMAAVQRRLG
TLNVLVNNAGILLPGDMETGRLEDFSRLLKINTESVFIGCQQGIAAMKETGGSIINMASVSSWLPIEQYAGYSASKAAVS
ALTRAAALSCRKQGYAIRVNSIHPDGIYTPMMQASLPKGVSKEMVLHDPKLNRAGRAYMPERIAQLVLFLASDESSVMSG
SELHADNSILGMGL
>P0DX23 1.1.1.51~~~~~~3-beta-hydroxysteroid dehydrogenase~~~
MKVLVTGATSGLGRNAVEYLRNKGISVRATGRNEAMGKLLSKMGAEFIPADLTELVSSQAKVMLAGIDTLWHCSSFTSPW
GTQQAFDLANVRATRRLGEWSVAWGVRNFVHISSPSLYFDYHHHRDIQEDFRPHRFANEFARSKAASEEVINLLAQANPH
TRFTILRPQSLFGPHDKVFIPRMVQMMRHYGSVLLPRGGNALVDMTYYENAVHAMWLASQPQCDHLPSARAWNISNGEPR
TLRSIVQKLIDELGIKCRIRSVPYPMLDIIARSMEHFGNKSAKEPAFTHYGVSKLNFDFTLDISRAQQELGYQPIVTLDE
GVTRTAAWLKDHGKLHD
>P0DX24 1.1.1.51~~~~~~3-beta-hydroxysteroid dehydrogenase~~~
MGDPTLRTDLGRVLVTGGSGFVGANLVTTLLERGHEVRSFDRVPSPLPAHPKLTTVVGDITNGEDVATAVAGIDTIFHTA
AIIDLMGGATVTEEYRQRSFSVNVEGTKNLVHAGQQAGVQRFVYTASNSVVMGGQDIVNGDETLPYTTRFNDLYTETKVV
AEKFVLGQNGEQGMLTCSIRPSGIWGRGDQTMFRKVFENVLAGHVKVLVGSKNIKLDNSYVHNLIHGFILAAEHLVPGGT
APGQAYFINDGEPLNMFEFARPVVVACGRKLPNIRVSGRLVHKAMMGWQWLHFKYGIREPLVEPLAVERLYLNNYFSIAK
ARRDLGYEPLFTTEQAMAECLPYYTDLFDTMVAQGAQPAVAAAPKS
>P9WQP7 ~~~~~~3 beta-hydroxysteroid dehydrogenase/Delta 5-->4-isomerase~~~COG0451
MLRRMGDASLTTELGRVLVTGGAGFVGANLVTTLLDRGHWVRSFDRAPSLLPAHPQLEVLQGDITDADVCAAAVDGIDTI
FHTAAIIELMGGASVTDEYRQRSFAVNVGGTENLLHAGQRAGVQRFVYTSSNSVVMGGQNIAGGDETLPYTDRFNDLYTE
TKVVAERFVLAQNGVDGMLTCAIRPSGIWGNGDQTMFRKLFESVLKGHVKVLVGRKSARLDNSYVHNLIHGFILAAAHLV
PDGTAPGQAYFINDAEPINMFEFARPVLEACGQRWPKMRISGPAVRWVMTGWQRLHFRFGFPAPLLEPLAVERLYLDNYF
SIAKARRDLGYEPLFTTQQALTECLPYYVSLFEQMKNEARAEKTAATVKP
>P37046 ~~~~~~Tricyclic peptide RP 71955~~~
CLGIGSCNDFAGCGYAVVCFW
>P85078 ~~~~~~Tricyclic peptide MS-271~~~
CLGVGSCNDFAGCGYAIVCFW
>Q59087 4.2.1.10~~~quiB~~~Catabolic 3-dehydroquinate dehydratase~~~COG0710
MSLPILSTTYAAENTVPASKSTYVVKNLNIGDLPVKTLVPITAKTREQALAQAKVIAENKDADIAEFRIDLLEFASDTKK
VIALGQELNQILKDKPLLATIRTSNEGGKLKVTDQEYEKIYSEYLKKPFMQLLDIEMFRDQAAVAKLTKLAHQKKVLVVM
SNHDFDKTPSEQEIVSRLLKQDQMGADILKIAVMPKSKQDVFTLMNATLKVSEQSTKPLLTMSMGRLGTISRIATANMGG
SLSFGMIGEASAPGQIDVTALKQFLKTVQPTP
>Q1LCS4 1.13.11.6~~~nbaC~~~3-hydroxyanthranilate 3,4-dioxygenase~~~COG1917
MLTYGAPFNFPRWIDEHAHLLKPPVGNRQVWQDSDFIVTVVGGPNHRTDYHDDPLEEFFYQLRGNAYLNLWVDGRRERAD
LKEGDIFLLPPHVRHSPQRPEAGSACLVIERQRPAGMLDGFEWYCDACGHLVHRVEVQLKSIVTDLPPLFESFYASEDKR
RCPHCGQVHPGRAA
>Q83V26 1.13.11.6~~~nbaC~~~3-hydroxyanthranilate 3,4-dioxygenase~~~
MMFTFGKPLNFQRWLDDHSDLLRPPVGNQQVWQDSDFIVTVVGGPNFRTDFHDDPMEEFFYQFKGNAYLNIMDRGQMDRV
ELKEGDIFLLPPHLRHSPQRPEAGSRCLVIERQRPKGMLDGFEWYCLSCNGLVYRVDVQLNSIVTDLPPLFDIFYGNVGL
RKCPQCGQVHPGKAAIEAVARGDQP
>Q46ZL2 5.4.4.3~~~~~~3-hydroxylaminophenol mutase~~~COG0174
MAQSVADVMKLVKENDVKFVDFRFTDTKGKEQHVSVPTSHFDDDKFESGHAFDGSSIAGWKGIEASDMLLMPDSNTAFID
PFYEEPTLVLTCDVVEPSDGKGYDRDPRSIAKRAEAYLKSTGLGDTAFFGPEPEFFIFDGVTWNVDMQGCFVKVHSEEAP
WSSGKEFEHGNSGHRPGKKGGYFPVAPIDTFQDMRSEMCLILESLGIPVEVHHHEVAGQGQNEIGTRFSTLVQRADWTQL
QKYVIQNVAHTYGKTATFMPKPIVGDNGSGMHVHQSVWKDGQNLFAGNGYAGLSEFALYYIGGIIKHARALNAITNPGTN
SYKRLVPGFEAPVKLAYSARNRSASIRIPYVANPKGRRIETRFPDPLMNPYLGFSALLMAGLDGVMNKIHPGEAADKNLY
DLPPEEDAKIPTVCNSLDQSLEYLDNDREFLTRGGVFSNSMLDAYIELKMEEVTRFRMTTHPVEFEMYYSL
>Q9AJS8 6.2.1.27~~~hcl~~~3-hydroxybenzoate--CoA/4-hydroxybenzoate--CoA ligase~~~
MSEQLQPQQSMNAADEIIGRPLAQGLGEQTAMLCAERSITYRELDAATNRHGNALRAHGVGKGDRVLFLMDDSPELVAAY
LGTLRIGAVAVALNVRLAPRDVLYVIQDSACRLLYIDAEFLHLYQQIAGELEQPPQVVVRGDEAPAPAIIAFKHFLDGQA
ATLESVQVAPDDVAYWLYSSGTTGRPKAVMHAHRSVLIADRLEREYFGIKPGDRVFTTSKMFFGWSLGHSLMGGLQCGAT
VIVAPGWPDAERVMATAARHRPTILFSTPVMYRNLLREGAGESAAMRDIRHFVSAGEKLPENIGQQWLDTFGIPITEGIG
ASETVFLFLCARPDAYRIGSCGKRVPWAEVRLLDELGNEITTPDTPGLIAIRMASQFVGYWKLPETTEKALRDGWYYPGD
MFSFDADGFWYHNGRADDMLKISGQWVSPGEIESCASAVPGIAEAVVVAVPNDDGLTRLTLFIVPEDPSASQQKLSEAWM
TTLRGTLSIYKCPRTIQFLEELPRTATGKVQKYRLRDMLQATL
>Q9F131 1.14.13.24~~~xlnD~~~3-hydroxybenzoate 6-hydroxylase 1~~~
MHNNILIAGAGIGGLSAALGLARKGMRSIVLEKAPELGEIGAGIQLAPNAYHALDALGIGEVARQTGVHVDKLLWMDGMT
DKEIASVPLANRFREFFGNPYAVIHRADFHGLLVEACHKTGLVEVRTNAEVVDYENFPDRVEAILHDGSCINGAVLVGAD
GLWSNVRQKVIGDGDPRVSGHTTYRSVIPAEDMPEELRWNMSTAWAGEGCHMVHYPLKGGKVFNLVLTSNSGASEPEAGV
PVTTDEVFEKFKTMKRRPTSLIHKGNNWKRWVLCDRDPLPNWVDGRVTLLGDAAHPMMQYMAQGASMAIEDAVCLAFELG
REMDPVSALKKYNRARFARTARVQTYSRYASDFIYHAKGGAAAMRNELMGGMTPTDFFQWINWLYGKETVEKYK
>Q0QFQ1 1.14.13.24~~~hbzD~~~3-hydroxybenzoate 6-hydroxylase 2~~~
MRSTNTRSARSRPTKRSVNASATPTQSFHRVDAHLSLLEGAEETGWVEFKTNTRVERIEQDADSVTVYDQNGNAYRGVAL
IGADGVRSVVRQTYVNDQPRVTGHVVYRAVVDKDEFPQDLRWNASSLWVGPKCHLVHYPLRGGEQYNIVVTFQSRQQEEW
GVTEGSKEEVESYFQDICPKARQLIGLPKSWKRWATADREPIPQWTFGRTTLLGDAAHPTTQYMAQGACMALEDAVTLGE
ALRVHGNDWGKALDLYQRSRITRTARIVLSGREMGRLYHAQGVERLVRNSLWKGRTTEQFYDAIQWLYGWNVDNCLSESI
>Q8NLB6 1.14.13.24~~~~~~3-hydroxybenzoate 6-hydroxylase~~~COG0654
MSLPHSDELRGQKIIISGGGIGGAAGALALALRGADVTLYERAAEFKEVGAGLQIGPHGWRMLESWGLLDQIVVAGYLPE
DMQFRDAVNRETILTMRFDEEFQQHYGGRYLVIHRSDLLNILVTNAEAAGAKLHNGVLVTDSRTVDGGIEVDIESSINKG
EDNKTLLVDAFLAFDGIHSVMRKKLVDDAPVASSYVAYRGTSKLAEDAEMKDLKSVIGYIGPHVHFIQYPLRGGELLNQV
AVFESQRYLDGRTAGDIPEDWGNPEELDRAYNHCDPFIQDRLDTLWRNNWWQMSDREPLENWRIGRMLLLGDAAHAPLQY
LASGAVMAMEDAEAVALFAADAARAGNLDWEEVLAEVEAERRPRCSRIQTVGRFWGELWHVEGTARLIRNEVFRQADRNG
WFIYADWLWGYDASKRAHIANPELGEMPQALKEWRYALLEQK
>Q5EXK1 1.14.13.24~~~mhbM~~~3-hydroxybenzoate 6-hydroxylase~~~COG0654
MAKVMRAIIVGGGIGGAATALSLARQGIKVMLLEKAHEIGEIGAGIQLGPNAFSALDSLGVGEVARQRAVFTDHITMMDA
VNGEEVVHIETGQAFRDHFGGPYAVIHRVDIHATVWEAALTHPAVEYRTSTQVVDIRQTADDVTVFDDKGNSWTADILIG
CDGGKSVVRQSLLGDSPRVTGHVVYRAVVDAADMPDDLRINAPVLWAGPHCHLVHYPLRGGKQYNLVVTFHSRQQEEWGV
RDGSKEEVLSYFKGIHPRPRQMLDKPTSWRRWSTADREPVEKWGNDRITLVGDAAHPVAQYMAQGACMALEDAVTLGKAL
AQCDGDAARAFALYESVRIPRTARIVWSTREMGRVYHAAGVERQVRNLLWKGKTQSEFYRGIEWLYGWKEDNCLEAR
>Q3S4B7 1.14.13.24~~~nagX~~~3-hydroxybenzoate 6-hydroxylase~~~COG0654
MSDNPADLPVLVAGGGIGGLAAALALVRRGFSVKVLEQAPEIGEIGAGIQLGPNAFHAFDALGIGEKARGRAVYTDEMVM
HDAIDGSLVGRIPTGEAFRQRFGNPYAVIHRVDVHLSLLEGAQETGKVEFLTSTRALRIEQDEGSVTVYDQHGNAHKGIA
LIGADGVKSVVREQFVGDAARVTGHVVYRAVVDKKDFPESLQWNAASIWVGPNCHLVHYPLRGGEQYNVVVTFHSRQPEQ
WGVTEGSKEEVQSYFQGICPQARQLIDLPKTWKRWATADREPIGQWSFGRVTLLGDAAHPTTQYMAQGACMAMEDGVTLG
EALRVNNNDFPKAFELYQRSRVARTARIVLSSREMGRIYHAQGVERLVRNDLWKGRTPERFYDAMEWLYGWNVGNCLAKD
>Q4W8C9 3.1.1.-~~~phaZc~~~3-hydroxybutyrate-oligomer hydrolase~~~
MSASPRLGFVQCISPAGLHRMAYHEWGDPANPRVLVCAHGLTRTGRDFDTVASALCGDYRVVCPDVAGRGRSEWLADANG
YVVPQYVSDMVTLIARLNVEKVDWFGTSMGGLIGMGLAGLPKSPVRKLLLNDVGPKLAPSAVERIGAYLGLPVRFKTFEE
GLAYLQTISASFGRHTPEQWRELNAAILKPVQGTDGLEWGLHYDPQLAVPFRKSTPEAIAAGEAALWRSFEAIEGPVLVV
RGAQSDLLLRETVAEMVARGKHVSSVEVPDVGHAPTFVDPAQIAIAPQFFTGA
>A9CH01 4.2.1.77~~~~~~Trans-3-hydroxy-L-proline dehydratase~~~COG3938
MRSIKTVHVISAHAEGEVGDVIVGGVKPPPGETIWEQSRFIARDETLRNFVLNEPRGGVFRHVNLLVPPKHPDADAAFII
MEPEDTPPMSGSNSICVSTVLLDGGIVPMQEPETHMLLEAPGGLVKVRAECRNGKAERIFVQNLPSFAAKLDAELEVEGL
GKLKVDTAYGGDSFVIVDAEAMGFSLKPEEAHEIARLGVRITNAANKALGFDHPENPDWRHFSFCLFAGKVERTAEGLRA
GAAVAIQPGKVDRSPTGTALSARMAVLHARGEMKEGETLTAVSLIGSTFTGRILGTTTVGDRPAILPEISGRGWITGIHQ
HMLDPSDPWPEGYRLTDTWGAR
>A9AKG8 5.1.1.-~~~prdF~~~3-hydroxyproline 2-epimerase~~~COG3938
MKISRSLSTVEVHTGGEAFRIVTSGLPRLPGDTIVRRRAWLKEHADEIRRALMFEPRGHADMYGGYLTEPVSPNADFGVI
FVHNEGYSDHCGHGVIALSTAAVELGWVQRTVPETRVGIDAPCGFIEAFVQWDGEHAGPVRFVNVPSFIWQRDVAVDTPS
FGTVTGDIAYGGAFYFYVDGAPFDLPVRESAVERLIRFGAEVKAAANAKYPVEHPEIPEINHIYGTIIANAPRDPRSTQA
NCCVFADREVDRSPTGSGTGGRVAQLYQRGLLAAGDTLVNESIVGTVFKGRVLRETTVGGMPAVIPEVEGSAHICGFANW
IVDERDPLTYGFLVR
>Q9I0N5 1.13.11.91~~~~~~3-mercaptopropionate dioxygenase~~~
MSSILRLDRLRQFIGELATLLDSRPDESTLLAQAHPLLAELVHQDDWLPEDCARPDPQRYQQYLLHVDSRQRFSVVSFVW
GPGQITPVHDHRVWGLIGMLRGAEYSQPYAFDAGGRPHPSGARRRLEPGEVEALSPRIGDVHQVSNAFSDRTSISIHVYG
ANIGAVRRAVFSAEGEEKPFISGYSNSRLPNIWDLSKENPA
>P05100 3.2.2.20~~~tag~~~DNA-3-methyladenine glycosylase 1~~~COG2818
MERCGWVSQDPLYIAYHDNEWGVPETDSKKLFEMICLEGQQAGLSWITVLKKRENYRACFHQFDPVKVAAMQEEDVERLV
QDAGIIRHRGKIQAIIGNARAYLQMEQNGEPFVDFVWSFVNHQPQVTQATTLSEIPTSTSASDALSKALKKRGFKFVGTT
ICYSFMQACGLVNDHVVGCCCYPGNKP
>P04395 3.2.2.21~~~alkA~~~DNA-3-methyladenine glycosylase 2~~~COG0122
MYTLNWQPPYDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRGVVTAIPDIARHTLHINLSAGLEPVAAECLAKMSR
LFDLQCNPQIVNGALGRLGAARPGLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQR
LAAADPQALKALGMPLKRAEALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYL
IKQRFPGMTPAQIRRYAERWKPWRSYALLHIWYTEGWQPDEA
>P37878 3.2.2.21~~~alkA~~~DNA-3-methyladenine glycosylase~~~COG0122
MTWHEVNDVIVITLPEIFDMNANLGYLTREKNECMYEIENNIITKVIAIGEIRSLVQVSVINNKQMIVQFLNDSRPVEQW
KREEIVKYIHEWFDLDNDLTPFYEMAKADPLLKMPARKFYGLRVIGIPDLFEALCWGVLGQQINLAFAYSLKKQFVEAFG
DSIEWNGKKYWVFPPYERIARLTPTDLADIKMTVKKSEYIIGIARLMASGELSREKLMKMNFKDAEKNLIKIRGIGPWTA
NYVLMRCLRFPTAFPIDDVGLIHSIKILRNMNRKPTKDEILEISVPWKEWQSYATFYLWRVLY
>P9WJP7 3.2.2.-~~~~~~Putative 3-methyladenine DNA glycosylase~~~COG2094
MNAEELAIDPVAAAHRLLGATIAGRGVRAMVVEVEAYGGVPDGPWPDAAAHSYRGRNGRNDVMFGPPGRLYTYRSHGIHV
CANVACGPDGTAAAVLLRAAAIEDGAELATSRRGQTVRAVALARGPGNLCAALGITMADNGIDLFDPSSPVRLRLNDTHR
ARSGPRVGVSQAADRPWRLWLTGRPEVSAYRRSSRAPARGASD
>Q06401 1.3.99.4~~~~~~3-oxosteroid 1-dehydrogenase~~~
MAEQEYDLIVVGSGAGACWAPIRAQEQGLKTLVVEKTELFGGTSALSGGGIWIPLNYDQKTAGIKDDLETAFGYMKRCVR
GMATDDRVLAYVETASKMAEYLRQIGIPYRAMAKYADYYPHIEGSRPGGRTMDPVDFNAARLRVTALETMRPGPPGNQLF
GRMSISAFEAHSMLSRELKSRFTILGIMLKYFLDYPWRNKTRRDRRMTGGQALVAGLLTAANKARVEMWCNSPLKELVQD
ASGRVTGVIVERNGQRQQINARRGVLLGAGGFERNQEMRDQYLNKPTRLVDGNPCGRQYGDAHRAGQAWAHTGADGLVLG
RAHHGCSQGAGLSRHFRGTLAAGVHGGQRQGAALPQRVRPVSGIPAAMLAENAKGNGGVPAWIVFDASFRAQNPMGPLMP
GSAVPDSKVRKSWLNNVYWKGRRWKIWRADRRGRAGLQVSARRMTEYARAGKDLDFDRGGNVFDRYYGDPRLKNPNLGPI
EKGPFYAMRLWPGEIGTKGGLLTDREGRVLDTQGRIIEGLYCVGNNSASVMAPAYAGAGSTLGPAMTFAFRAVADMVGKP
LPLENPHLLGKTV
>A0R4S9 1.3.99.4~~~ksdD~~~3-oxosteroid 1-dehydrogenase~~~COG1053
MFYMTGQEYDVVVVGSGAAGMVAALTAAHQGLSTVVVEKAPHYGGSTARSGGGVWIPNNEILKRDGVKDTPDEARKYLHA
IIGDVVPAEKIDTYLDRGPEMLSFVLKHSPLKLCWVPGYSDYYPETPGGKPTGRSVEPKPFDANKLGPDLKGLEPPYGKV
PMNMVVMQQDYVRLNQLKRHPRGVLRSLKVGIRATWGKVSGKNLVGMGRALIAPLRIGLREAGVPVLLNTALTDLYVEDG
RVLGIYVRDTTAGDDAEPRLIRARHGVILGSGGFEHNEQMRVKYQRAPITTEWTVGAVANTGDGIVAAEKLGAALELMED
AWWGPTVPLVGAPWFALSERNSPGSIIVNMSGKRFMNESMPYVEACHHMYGGQYGQGPGPGENIPAWLIFDQQYRDRYIF
AGLQPGQRIPSKWLESGVIVKADTLVELAEKTGLPADQFTSTIERFNGFARSGVDEDFHRGESAYDRYYGDPTNKPNPNL
GEIKHGPFYAAKMVPGDLGTKGGIRTDVRGRALRDDDSVIEGLYAAGNVSSPVMGHTYPGPGGTIGPAMTFGYLAALDIA
ATAAASRKG
>P71864 1.3.99.4~~~kstD~~~3-oxosteroid 1-dehydrogenase~~~COG1053
MTVQEFDVVVVGSGAAGMVAALVAAHRGLSTVVVEKAPHYGGSTARSGGGVWIPNNEVLKRRGVRDTPEAARTYLHGIVG
EIVEPERIDAYLDRGPEMLSFVLKHTPLKMCWVPGYSDYYPEAPGGRPGGRSIEPKPFNARKLGADMAGLEPAYGKVPLN
VVVMQQDYVRLNQLKRHPRGVLRSMKVGARTMWAKATGKNLVGMGRALIGPLRIGLQRAGVPVELNTAFTDLFVENGVVS
GVYVRDSHEAESAEPQLIRARRGVILACGGFEHNEQMRIKYQRAPITTEWTVGASANTGDGILAAEKLGAALDLMDDAWW
GPTVPLVGKPWFALSERNSPGSIIVNMSGKRFMNESMPYVEACHHMYGGEHGQGPGPGENIPAWLVFDQRYRDRYIFAGL
QPGQRIPSRWLDSGVIVQADTLAELAGKAGLPADELTATVQRFNAFARSGVDEDYHRGESAYDRYYGDPSNKPNPNLGEV
GHPPYYGAKMVPGDLGTKGGIRTDVNGRALRDDGSIIDGLYAAGNVSAPVMGHTYPGPGGTIGPAMTFGYLAALHIADQA
GKR
>Q04616 1.3.99.4~~~~~~3-oxosteroid 1-dehydrogenase~~~
MQDWTSECDLLVVGSGGGALTGAYTAAAQGLTTIVLEKTDRFGGTSAYSGASIWLPGTQVQERAGLPDSTENARSYLRAL
LGDAESERQDAYVETAPAVVALLEQNPNIEFEFRAFPDYYKAEGRMDTGRSINPLDLDPADIGDLAGRCVRNCTKTDRMD
HAPGRMIGGRALIAVSAAVQSTARQNFAPESVLTSLIVEDGRVVGGLRSNPRYRQRIKANRGVLMHAGGGFEGNAEMREQ
AGTPGKAIWSMGPSGPTPATRSPPELAGRRRNSLARSGVVLPRGRAARRRRLHGRVRGGLVVDSPGSVPQRVASVRPVRT
SHGCSPDDNGSAVPSFMIFDSREVTDCPPSASRTRPPPSTSKPEPGSVPTLSKNSLPRPDYRPERIAQHCRKVQRCRKLG
VDEEFHRGEDPYDAFFCPPNGGANAALTAIENGPFYAARDRLSDLGTKGGLVTDVNGRVLRADGSAIDGLYAAGNTSASV
APFYPGPGVPLGTAMVFSYRAAQDMAK
>A9KM56 2.4.1.282~~~~~~3-O-alpha-D-glucosyl-L-rhamnose phosphorylase~~~COG1554
MLIHEDNRYIVEKEYNLVTEPQNASLFTTGNGYMGVRGSLEEFGSTRIQGSFIRGFVDEIIEVIEPFCDNEYMKKYYFDE
EKLKKFDKQISCINLVDFLLIRFRIGDEIFYPWEGEILSWERRLDTSQSIFQRSVTWKDKMGNITVFEFERFASYDEEHR
YCMRAMAKPQNHFLPVEIISGIDTDVRTGGQRVLQFINNQILNNGLISCFQSGKRYGITCKIAVKNSFFMDGKLQHSIGE
QQENLLLNKALMPGGGREYCVEKTIYLTTDRDCDPLFDTIDTVLLDVGTYDAYKEAHIREWSQFFSNFDIKILGDDRKDA
QLRFATYHAVITGDRNNSIHSLSAKGLTGERYNQFVWWDCEIYQLPIFIHAFPEVAKHALIYRYDRLEEARENAKLEGCK
GARYPFVSSLEGKEHVWIYARHPFLQVHITADIGFGIINYFINTLDYEFMELYGFEMLYEICRYWVSKVILKDGTYQLLG
VTGTDEHHPYVDNDAYTNYIVQYVLQETILLDSQYSSTKVRDKIGITVNELKDIEQVSRLLYLPLEKSGLIPQFDGYFDL
SRDLEVDGSGTGKNFQMKQAGLYHKSQVIKQPDVMLLFSYLNFEIKNSRYEENWDYYEKMCESSSSLTFPVHAICSADAN
RMLSFLNYFNETVNIDLLDIHHCAWQGVHAGCLSGAWYAIFRGLMGIVTRIDCIQINPKLIPFWQGVELSFIYQTKKIKA
TLNGNVFTLGSEDKKEISVYFQGKRYAFVDRLEVSF
>A3DIJ8 3.6.1.25~~~~~~Inorganic triphosphatase~~~COG2954
MGKEIEKKFIVSGDAYKSLAKGVLYRQGYIFFDKDKSVRVRVFNDKGYLTVKGTSTGISRLEYEYEIPVGEANEILEYLC
EKPVIEKLRYKFQFEGFTWEVDEFLGENEGLVIAEIELPDENAVFKKPDWIGREVTGDPRYLNSNLVKNPYKNFKE
>P30871 3.6.1.25~~~ygiF~~~Inorganic triphosphatase~~~COG3025
MAQEIELKFIVNHSAVEALRDHLNTLGGEHHDPVQLLNIYYETPDNWLRGHDMGLRIRGENGRYEMTMKVAGRVTGGLHQ
RPEYNVALSEPTLDLAQLPTEVWPNGELPADLASRVQPLFSTDFYREKWLVAVDGSQIEIALDQGEVKAGEFAEPICELE
LELLSGDTRAVLKLANQLVSQTGLRQGSLSKAARGYHLAQGNPAREIKPTTILHVAAKADVEQGLEAALELALAQWQYHE
ELWVRGNDAAKEQVLAAISLVRHTLMLFGGIVPRKASTHLRDLLTQCEATIASAVSAVTAVYSTETAMAKLALTEWLVSK
AWQPFLDAKAQGKISDSFKRFADIHLSRHAAELKSVFCQPLGDRYRDQLPRLTRDIDSILLLAGYYDPVVAQAWLENWQG
LHHAIATGQRIEIEHFRNEANNQEPFWLHSGKR
>Q82UI9 3.6.1.25~~~~~~Inorganic triphosphatase~~~COG2954
MTEIERKFLVATFPDGELHAVPLRQGYLTTPTDSIELRLRQQGTEYFMTLKSEGGLSRQEYEIQIDVTQFEMLWPATEGR
RVEKTRYSGKLPDGQLFELDVFAGHLSPLMLVEVEFLSEDAAQAFIPPPWFGEEVTEDKRYKNKALALSIP
>D5MP61 3.2.1.32~~~xyl4~~~Beta-1,3-xylanase XYL4~~~
MKRTYLSLIAAGVMSLSVSAWSLDGVLVPESGILVSVGQDVDSVNDYASALGTIPAGVTNYVGIVNLDGLNSDADAGAGR
NNIAELANAYPTSALVVGVSMNGEVDAVASGRYNANIDTLLNTLAGYDRPVYLRWAYEVDGPWNGHSPSGIVTSFQYVHD
RIIALGHQAKISLVWQVASYCPTPGGQLDQWWPGSEYVDWVGLSYFAPQDCNWDRVNEAAQFARSKGKPLFLNESTPQRY
QVADLTYSADPAKGTNRQSKTSQQLWDEWFAPYFQFMSDNSDIVKGFTYINADWDSQWRWAAPYNEGYWGDSRVQANALI
KSNWQQEIAKGQYINHSETLFETLGYGSTGGGDNGGGDNGGTNPPEPCNEEFGYRYVSDSTIEVFHKNNGWSAEWNYVCL
NGLCLQGEIKNGEYVKQFDAQLGSTYGIEFKVADGESQFITDKSVTFENKQCGSTGTPGGGDNGSGGDNGGDNGSGGDNG
SGGGTDPSQCSADFGYNYRSDTEIEVFHKDLGWSASWNYICLDDYCVPGDKSGDSYNRSFNATLGSDYKITFKVEDSASQ
FITEKNITFVNTSCAQ
>Q9LCB9 3.2.1.32~~~txyA~~~Beta-1,3-xylanase TXYA~~~
MKKLAKMISVATLGACAFQAHALDGKLVPDQGILVSVGQDVDSVNDYSSAMGTTPAGVTNYVGIVNLDGLSTDADAGAGR
NNIVELANQYPTSALIVGVSMNGEVQNVANGQYNANIDTLIRTLGEFDRPVYLRWAYEVDGPWNGHNTEDLKQSFRHVYQ
RIRELGYADNISMVWQVASYCPTAPGQLGTWWPGDDVVDWVGLSYFAPQDCNWDRVNEAAQWARSHNKPLFINESSPQRY
QLADLTYSTDPAKGTNRQAKTDQQIWSEWFEPFFQFMVDNQDILKGFTYINADWDSQWRWAAPYNEGYWGDSRVQVIPYI
KQKWQETLSDPKFIRHSDELFAQLGYGNSDGGNGGDNGGDNGGDNGGETPENCTDDFNFNYVSDNEIEVYHVDKGWSAGW
NYLCLDDYCLSGTKSNGAFSRSFSAQLGQTYKMTFKVEDITGQGQQIIDKTVTFTNQVCN
>Q8RS40 3.2.1.32~~~txyA~~~Beta-1,3-xylanase~~~
MKKLAKMISIATLGACAFSAHALDGKLVPNEGVLVSVGQDVDSVNDYSSAMSTTPAGVTNYVGIVNLDGLASNADAGAGR
NNVVELANLYPTSALIVGVSMNGQIQNVAQGQYNANIDTLIQTLGELDRPVYLRWAYEVDGPWNGHNTEDLKQSFRNVYQ
RIRELGYGDNISMIWQVASYCPTAPGQLSSWWPGDDVVDWVGLSYFAPQDCNWDRVNEAAQWARSHNKPLFINESSPQRY
QLADRTYSSDPAKGTNRQSKTEQQIWSEWFAPYFQFMEDNKDILKGFTYINADWDSQWRWAAPYNEGYWGDSRVQVLPYI
KQQWQDTLENPKFINHSSDLFAKLGYVADGGDNGGDNGGDNGGDNGGDNGGDNGGTEPPENCQDDFNFNYVSDQEIEVYH
VDKGWSAGWNYVCLNDYCLPGNKSNGAFRKTFNAVLGQDYKLTFKVEDRYGQGQQILDRNITFTTQVCN
>B6VP39 2.2.1.8~~~~~~Fluorothreonine transaldolase~~~
MPSSVNRTSRTEPAGHHREFPLSLAAIDELVAEEEAEDARVLHLTANETVLSPRARAVLASPLTSRYLLEHLDMRGPSPA
RLGNLLLRGLDRIGTIEESATEVCRRLFGARYAEFRCLSGLHAMQTTFAALSRPGDTVMRVATKDGGHFLTELICRSFGR
RSCTYVFDDTMTIDLERTREVVEKERPSLLFVDAMNYLFPFPIAELKAIAGDVPLVFDASHTLGLIAGGRFQDPLREGAD
LLQANTHKTFFGPQKGIILGNDRSLMEELGYTLSTGMVSSQHTASTVALLIALHEMWYDGREYAAQVIDNARRLAGALRD
RGVPVVAEERGFTANHMFFVDTRPLGSGPAVIQRLVRAGVSANRAVAFNHLDTIRFGVQEITRRGYDHDDLDEAADLVAA
VLLERQEPERIRPRVAELVGRRRTVRYTGDPASAAGPPARERYAPPTAPAGHPARPRWIGVRLTPLPEPVTEAECAGAQR
LGRLAGAFPHQIDSSGNVSFTSTDGRLFVTGSGTYIKDLAPGDFVELTGAEGWTLHCRGDGPPSAEAYLHHLLRERVGAR
YVVHNHCIPGRALETSGALVIPPKEYGSVALAEAVADACQDSQVMYVRRHGLVFWAHSYDECLALIEDVRRITG
>Q53005 6.2.1.25~~~hbaA~~~4-hydroxybenzoate--CoA/benzoate--CoA ligase~~~COG0365
MPLRDYNAAVDFVDRNVAEGRGGKIAFIDPQRSLSYGELRDAVARVGPMLARLGVEQENRIALVLKDTVDFPILFWGAIR
AGIVPVLLNTRLTADQYRYLLEDSRSRVVFASSEFLPVIEEAAADLPHLRTIIAVGDAPAPTLQLANLLATEQEGGAPAA
TCADDIAYWQYSSGTTGMPKGVMHVHSSPRVMAENAGRRIGYREDDVVFSAAKLFFAYGLGNAMFCPMGIGATSVLYPER
PTADSVFDTLRLHQPTLLFAVPTLYAAMLADPRSRTETLPDRLRLCVSAGEPLPAQVGLNWRNRFGHDIVNGVGSTEMGH
LFLTNLPHAVEYGTSGVPVDGYRLRLVGDRGQDVADDEIGELLVSGGSSAAGYWNQRDKTRTTFVGEWTRTGDKYHRRAD
GVYTYCGRTDDIFKVSGIWVSPFEIEQALMSHAKVLEAAVIPAEDTDGLIKPKAFIVLASRGDIDPGALFDELKEHVKSA
IGPWKYPRWIQIMDDLPKTSSGKLQRYLLREMTLGGIEATESAPSEPALYGRVVAGNGR
>P38945 1.1.1.61~~~4hbD~~~4-hydroxybutyrate dehydrogenase~~~COG1454
MKLLKLAPDVYKFDTAEEFMKYFKVGKGDFILTNEFLYKPFLEKFNDGADAVFQEKYGLGEPSDEMINNIIKDIGDKQYN
RIIAVGGGSVIDIAKILSLKYTDDSLDLFEGKVPLVKNKELIIVPTTCGTGSEVTNVSVAELKRRHTKKGIASDELYATY
AVLVPEFIKGLPYKFFVTSSVDALIHATEAYVSPNANPYTDMFSVKAMELILNGYMQMVEKGNDYRVEIIEDFVIGSNYA
GIAFGNAGVGAVHALSYPIGGNYHVPHGEANYLFFTEIFKTYYEKNPNGKIKDVNKLLAGILKCDESEAYDSLSQLLDKL
LSRKPLREYGMKEEEIETFADSVIEGQQRLLVNNYEPFSREDIVNTYKKLY
>Q59104 1.1.1.61~~~gbd~~~4-hydroxybutyrate dehydrogenase~~~
MAFIYYLTHIHLDFGAVSLLKSECERIGIRRPLLVTDKGVVAAGVAQRAIDAMQGLQVAVFDETPSNPTEAMVRKAAAQY
REAGCDGLVAVGGGSSIDLAKGIAILATHEGELTTYATIEGGSARITDKAAPLIAVPTTSGTGSEVARGAIIILDDGRKL
GFHSWHLLPKSAVCDPELTLGLPAGLTAATGMDAIAHCIETFLAPAFNPPADGIALDGLERGWGHIERATRDGQDRDARL
NMMSASMQGAMAFQKGLGCVHSLSHPLGGLKIDGRTGLHHGTLNAVVMPAVLRFNADAPTVVRDDRYARLRRAMHLPDGA
DIAQAVHDMTVRLGLPTGLRQMGVTEDMFDKVIAGALVDHCHKTNPKEASAADYRRMLEQSM
>Q04416 3.1.2.23~~~fcbC~~~4-hydroxybenzoyl-CoA thioesterase~~~
MHRTSNGSHATGGNLPDVASHYPVAYEQTLDGTVGFVIDEMTPERATASVEVTDTLRQRWGLVHGGAYCALAEMLATEAT
VAVVHEKGMMAVGQSNHTSFFRPVKEGHVRAEAVRIHAGSTTWFWDVSLRDDAGRLCAVSSMSIAVRPRRD
>P56653 3.1.2.23~~~~~~4-hydroxybenzoyl-CoA thioesterase~~~
MARSITMQQRIEFGDCDPAGIVWFPNYHRWLDAASRNYFIKCGLPPWRQTVVERGIVGTPIVSCNASFVCTASYDDVLTI
ETCIKEWRRKSFVQRHSVSRTTPGGDVQLVMRADEIRVFAMNDGERLRAIEVPADYIELCS
>B9J8G8 5.1.1.8~~~~~~4-hydroxyproline 2-epimerase 1~~~COG3938
MRWKRTIQLLDVHCEGEIGRVAIGGVPKIPGNTVAEQLHWLNTDPKGEELRRFLVLEPRGAPIGSVNLLLPARHPDADAA
FIILQPDQAHASSGSNSICVTTALLESGIVEMKEPETVVTLETAAGLVRATATCRDGRCEKVRLTMVPSFVHELDVGIDT
PQWGRIKLDLCYGGIFYALVDVGQIGLTIGKANAASLVQAGMVLKELINRTVPVVHPEIPAISGVAYVMFRDIDADGAIR
TCTTMWPGRADRSPCGTGNSANLATLHARGKARVGDVFKSRSIIGSEFEVGLQAETEVAGKPAIIPTITGRGFTFGLSQV
ALDPFDPMANGFALTDVWGPLAGDI
>A6WW16 5.1.1.8~~~~~~4-hydroxyproline 2-epimerase 1~~~COG3938
MRSTKVIHIVGCHAEGEVGDVIVGGVAPPPGKTVWEQSRFIASDETLRNFVLNEPRGGVFRHVNLLVPPKDPRAQMGFII
MEPADTPPMSGSNSICVSTVLLDSGIIPMQEPVTRMVLEAPGGLIEVEAECRNGKAERISVRNVPSFADRLNASLEVEGL
GTITVDTAYGGDSFVIVDAASIGMKIEPGQARELAEIGVKITKAANEQLGFRHPEKDWNHISFCQITEPVTRDGDILTGV
NTVAIRPAKLDRSPTGTGCSARMAVLHAKGQMKVGERFIGKSVLGTEFHCRLDKTLELGGKPAISPIISGRAWVTGTSQL
MLDPSDPFPSGYRLSDTWPNMPE
>B3D6W2 5.1.1.8~~~prdF~~~4-hydroxyproline 2-epimerase 1~~~COG3938
MMKRIQIIDSHTGGEPTRLVVSGFPSLGSGTMAERRDVLAREYDRYRTACILEPRGSDVLVGALLCEPVSPDAAAGVIFF
NNSGYLGMCGHGTIGVVRTLHHMGRIGPGVHRIETPVGTVEATLHDDLSVSVRNVLAYRHAKDVALDVPGYGPVRGDIAW
GGNWFFLISDHGQRVAGDNVAALTAYASAVREGLERAGITGANGGEIDHIELFADDPEHDSRSFVLCPGLAYDRSPCGTG
TSAKLACLAADGKLAPGAVWRQASVIGSVFHASYVQAEGGIVPTIRGSAHLSAEATLLIEDDDPFRWGIVS
>B9JHU6 5.1.1.8~~~~~~4-hydroxyproline 2-epimerase 2~~~COG3938
MRRSFFCIDSHTCGNPVRVVAGGGPLLPHVSMAERREIFVRDHDWVRKALMFEPRGHDIMSGAIIYPSVREDCDFAALFI
EVSGCLPMCGAGTIGLATVAIEEGLITPRVPGRLSIETPAGKVDVDYQLKDGFVEAVRLFNVASYLHSRDVVVDVSGLGS
LSVDIAYGGNFYAVIEPQENWSGLDGMSASDIVTLSQRLRDALSVVCDPVHPDDERIRGVHHAIWCDAAGSENADGRGAV
FYGDKAIDRSPGGTGTSARMAQLYGRGRLGIGDSFRNESLIGTVFEGRIEGAALVGGIPGILPSIGGWARVIGHNTIFVD
ERDPLAHGFQIR
>A6WXX7 5.1.1.8~~~~~~4-hydroxyproline 2-epimerase 2~~~COG3938
MARHSFFCVDGHTCGNPVRLVAGGGPNLEGSTMMEKRAHFLREYDWIRTGLMFEPRGHDMMSGSILYPPTRPDCDVAVLF
IETSGCLPMCGHGTIGTVTMAIEQGLVTPKTPGKLNLDTPAGLVAIEYEQNGQYVERVRLTNVPAFLYAEGLEVECPDLG
NLKVDVAYGGNFYAIVEPQENYTDMEDYSALQLIAWSPILRERLNEKYKFQHPLLPDINRLSHILWTGKPKHPEAHARNA
VFYGDKAIDRSPCGTGTSARMAQLAAKGKLKPGDEFVHESIIGSLFHGRVERATEVVGQDRTLPAIIPSIAGWARMTGYN
TIFIDDRDPFAHGFTVA
>A9AQW9 5.1.1.8~~~prdF~~~4-hydroxyproline 2-epimerase 2~~~COG3938
MTIHGFTCIEGHTEGMPVRMVIDGAPTLQGATMNARREDFVAHHDWVRRTLMLEPRGHAHMSGTIFYPPVSDNADFSLLF
IETSGCLPMCGHATIGSIAFAIEERLVVPKRPGTVTVDVPAGQIVARYQTDGERVTSVRFTNVPSFLLKRDVEIAIPALG
TLAVDIAYGGNFYPIVEVQPNFPGCEHFTPDELLAWGRDVQRAVGDALEVVHPDNPAIRGVRHCMWTGKPIADDAHGRAV
VIAGDSLVDRSPCGTGSSARVAQRFARGWLKEGEAYCHESLIGSRFIGRVESTVRLESGVDAVLSSIEGRAWVTGRAQYR
VDPSQPYALGFSLQEFVN
>A3M4A9 5.1.1.8~~~~~~4-hydroxyproline 2-epimerase~~~
MKTVKILDSHTGGEPTRLVLEGFPDLGTGDMESRRKILSEQYDHFRRATMLEPRGNDVLVGALLCKPVNPKASAGVIFFN
NTGYLGMCGHGTIGLVASLAHLGKIQVGTHLIETPVGDVEATLHEDHSVSVRNVPAYRYKKAVEVDVEKYGKVTGDIAWG
GNWFFLINDHGQRVASDNLDQLTEYAWTVRQALTAQGITGKDGQEIDHIELFASDTEADSKNFVLCPGKAYDRSPCGTGT
SAKIACLAADGKLEPGKLWKQASIIGSQFIASYEQAGEYVIPTIRGEAYMSAEATLFMDENDPFAWGIQL
>B0VB44 5.1.1.8~~~~~~4-hydroxyproline 2-epimerase~~~
MKTVKILDSHTGGEPTRLVLEGFPDLGTGDIESRRKILSEQYDHFRRATMLEPRGNDVLVGALLCKPVNPKASAGVIFFN
NTGYLGMCGHGTIGLVASLAHLGKIQVGTHLIETPVGDVEATLHEDHSVSVRNVPAYRYKKAVEVNVEKYGKVTGDIAWG
GNWFFLINDHGQRVASDNLDQLTEYAWTVRQALTAQGITGKDGQEIDHIELFASDTEADSKNFVLCPGKAYDRSPCGTGT
SAKIACLAADGKLEPGKLWKQASIIGSQFIASYEQAGEYVIPTIRGEAYMSAEATLFMDENDPFAWGIQL
>A9CKB4 5.1.1.8~~~~~~4-hydroxyproline 2-epimerase~~~COG3938
MRWKRTLQLLDVHCEGEIGRVVTGGAPKIPGNTVAEQLHWMNTDPQGEALRRFLTLEPRGTPMGSVDLLLPPKHPDAHAA
FVILQPDQAHASSGSNSICATTALLESGMVEMQEPETVIILETAAGLVKATATCRDGRCEKVKLTMVPSFVHELDVSIDT
PEWGRVTMDISYGGIFYALVDVRQIGLTIEKANAAKLVAAGMTLKDLVNREMTVVHPEIPAISGVAYVMFRDVDADGSIR
TCTTMWPGRADRSPCGTGNSANLATLYARGKVKVGDEYKSRSIIGSEFDVGLSAVTEVAGRPAVIPTIAGRGFTFGLHQV
GLDPFDPLGDGFAMTDVWGPEAGNI
>B9JQV3 5.1.1.8~~~~~~4-hydroxyproline 2-epimerase~~~COG3938
MRWKRMMQLLDVHCEGEIGKVAIGGVPKIPGDTVADQLHWLNTDPKGRELRHFLVLEPRGAPIGSVNLLLPAKDSRADAA
FIILQPDQAHASSGSNSICVTTALLESGMIEMQEPETVVMLETAAGLVKAVAQCRDGHCDSVTLTMVPSFVHELDAQIAT
ESWGEIRFDLAYGGVFYALVDVRQLGLTIEPGNARRLVEAGMLLKGEINQRIQVVHPDIPAISGVAYVMFRDEDPDGAVR
TCTTMWPGRVDRSPCGTGNSANLATLHARGRVKPGDSFLSRSIIGSQFTVGLQGLTTVAGRSAVIPTITGRGFTYGIHQV
ALDAFDPLGGGFVLTDVWGAAAETIKI
>Q2YLF3 5.1.1.8~~~prpA~~~4-hydroxyproline epimerase~~~
MARHSFFCVDGHTCGNPVRLVAGGGPNLNGSTMMEKRAHFLAEYDWIRTGLMFEPRGHDMMSGSILYPPTRPDCDVAVLF
IETSGCLPMCGHGTIGTVTMAIEQGLVTPKTPGKLNLDTPAGLVAIEYEQDGQYVERVRLTNVPAFLYAEGLEVECPDLG
PIKVDVAYGGNFYAIVEPQENYTDMDDYSALQLIAWSPVLRQRLNEKYKFQHPELPDINRLSHILWTGKPKHPQAHARNA
VFYGDKAIDRSPCGTGTSARMAQLAAKGKLKPCDEFIHESIIGSLFHGRVERAAEVAGRPAIVPSIAGWARMTGYNTIFI
DDRDPFAHGFSMA
>Q57B94 5.1.1.8~~~~~~4-hydroxyproline epimerase~~~
MARHSFFCVDGHTCGNPVRLVAGGGPNLNGSTMMEKRAHFLAEYDWIRTGLMFEPRGHDMMSGSILYPPTRPDCDVAVLF
IETSGCLPMCGHGTIGTVTMAIEQGLVTPKTPGKLNLDTPAGLVAIEYEQDGQYVERVRLTNVPAFLYAEGLEVECPDLG
PIKVDVAYGGNFYAIVEPQENYTDMDDYSALQLIAWSPVLRQRLNEKYKFQHPELPDINRLSHILWTGKPKHPQAHARNA
VFYGDKAIDRSPCGTGTSARMAQLAAKGKLKPCDEFIHESIIGSLFHGRVERAAEVAGRPAIVPSIAGWARMTGYNTIFI
DDRDPFAHGFSMA
>Q8YJ29 5.1.1.8~~~~~~4-hydroxyproline epimerase~~~COG3938
MARHSFFCVDGHTCGNPVRLVAGGGPNLNGSTMMEKCAHFLAEYDWIRTGLMFEPRGHDMMSGSILYPPTRPDCDVAVLF
IETSGCLPMCGHGTIGTVTMAIEQGLVTPKTPGKLNLDTPAGLVAIEYEQDGQYVERVRLTNVPAFLYAEGLEVECPDLG
PIKVDVAYGGNFYAIVEPQENYTDMDDYSALQLIAWSPVLRQRLNEKYKFQHPELPDINRLSHILWTGKPKHPQAHARNA
VFYGDKAIDRSPCGTGTSARMAQLAAKGKLKPGDEFIHESIIGSLFHGRVERAAEVAGRPAIVPSIAGWARMTGYNTIFI
DDRDPFAHGFSAA
>Q8FYS0 5.1.1.8~~~~~~4-hydroxyproline 2-epimerase~~~
MARHSFFCVDGHTCGNPVRLVAGGGPNLNGSTMMEKRAHFLAEYDWIRTGLMFEPRGHDMMSGSILYPPTRPDCDVAVLF
IETSGCLPMCGHGTIGTVTMAIEQGLVTPKTPGKLNLDTPAGLVAIEYEQDGQYVERVRLTNVPAFLYAEGLEVECPDLG
PIKVDVAYGGNFYAIVEPQENYTDMDDYSALQLIAWSPVLRQRLNEKYKFQHPELPDINRLSHILWTGKPKHPQAHARNA
VFYGDKAIDRSPCGTGTSARMAQLAAKGKLKPGDEFIHESIIGSLFHGRVERAAEVAGRPAIVPSIAGWARMTGYNTIFI
DDRDPFAHGFSVA
>B4EHE6 5.1.1.8~~~~~~4-hydroxyproline 2-epimerase~~~COG3938
MKRIQIIDSHTGGEPTRLVVSGFPSLGDGTMAERRDVLAREHDRYRTACILEPRGSDVLVGALLCDPVAPDAAAGVIFFN
NSGYLGMCGHGTIGVVRTLHHMGRIAPGVHRIETPVGTVEATLHDDLSVSVRNVPAYRHAQGVALDVPGYGPVKGDIAWG
GNWFFLISDHGQRVAGDNVAALTAYASAVREGLERAGITGANGGEIDHIELFADDPEHDSRSFVLCPGLAYDRSPCGTGT
SAKLACLAADGKLAPGAVWRQASVIGSVFHASYERADGGIVPTIRGSAHLSAEATLLIEEDDPFGWGIGS
>Q0B9R9 5.1.1.8~~~~~~4-hydroxyproline 2-epimerase~~~COG3938
MMKRIQIIDSHTGGEPTRLVVSGFPSLGNGTMAERRDVLAREHNRYRTACILEPRGSDVMVGALLCEPVSPEAAAGVIFF
NNSGYLGMCGHGTIGVVRTLHHMGRIEPGVHRIETPVGTVEATLHDDLSVSVRNVLAYRHAKAVAVDVPGYGPVKGDIAW
GGNWFFLISDHGQRVAGDNVAALTAYSSAVREGLERAGITGANGGEIDHIELFADDAEHDSRSFVLCPGHAYDRSPCGTG
TSAKLACLAADGKLEPGVVWRQASVIGSVFQASYAQADGGIVPTIRGSAHLSAEATLLIEEDDPFGWGIVS
>Q3JHA9 5.1.1.8~~~~~~4-hydroxyproline 2-epimerase~~~
MRISTLDRRDMKHIHIIDSHTGGEPTRVVVSGFPALGGGTMAERLAVLAREHDRYRAACILEPRGSDVLVGALLCEPVSA
GAAAGVIFFNNAGYLGMCGHGTIGLVRTLHHMGRIGPGVHRIETPVGDVEATLHDDLSVSVRNVLAYRHAKDVVVDVPGH
GAVTGDVAWGGNWFFLVSDHGQRVAGENVAALAAYASAVRAALERAGVTGRDGAPIDHIELFADDPEYDSRSFVLCPGHA
YDRSPCGTGTSAKLACLAADGKLAAGVTWRQASVIGSVFSASYAAAEGGVVPTIRGSAHLSAEATLVIEDDDPFGWGIAS
>C5ZMD2 5.1.1.8~~~~~~4-hydroxyproline 2-epimerase~~~
MRISTLDRRDMKHIHIIDSHTGGEPTRVVVSGFPALGGGTMAERLAVLAREHDRYRAACILEPRGSDVLVGALLCEPVSA
GAAAGVIFFNNAGYLGMCGHGTIGLVRTLHHMGRIGPGVHRIETPVGDVEATLHDDLSVSVRNVLAYRHAKDVVVDVPGH
GAVTGDVAWGGNWFFLVSDHGQRVAGENVAALAAYASAVRAALERAGVTGRDGAPIDHIELFADDPEYDSRSFVLCPGHA
YDRSPCGTGTSAKLACLAADGKLAAGVTWRQASVIGSVFSASYAAAEGGVVPTIRGSAHLSAEATLVIEDDDPFGWGIAS
>Q63NG7 5.1.1.8~~~~~~4-hydroxyproline epimerase~~~COG3938
MKHIHIIDSHTGGEPTRVVVSGFPALGGGTMAERLAVLAREHDRYRAACILEPRGSDVLVGALLCEPVSAGAAAGVIFFN
NAGYLGMCGHGTIGLVRTLHHMGRIGPGVHRIETPVGDVEATLHDDLSVSVRNVLAYRHAKDVVVDVPGHGAVTGDVAWG
GNWFFLVSDHGQRVAGENVAALAAYASAVRAALERAGVTGRDGAPIDHIELFADDPEYDSRSFVLCPGHAYDRSPCGTGT
SAKLACLAADGKLAAGVTWRQASVIGSVFSASYAAAEGGVVPTIRGSAHLSAEATLVIEDDDPFGWGIAS
>Q2T3J4 5.1.1.8~~~~~~4-hydroxyproline 2-epimerase~~~
MKHLHIIDSHTGGEPTRVVVSGFPPLGDGPMAERLAALARDHDRYRTACILEPRGSDVLVGALLCEPVSAGAAAGVIFFN
NAGYLGMCGHGTIGLVRTLHHMGRIAPGVHRIETPVGDVEATLHDDLSVSVRNVLAYRHAKDVVVEVPGHGSVTGDVAWG
GNWFFLVSDHGQRIAGENVAALAAYASAVRAGLERAGVTGRDGAPIDHIELFADDPEHDSRSFVLCPGHAYDRSPCGTGT
SAKLACLAADGKLAPGAAWRQASVIGSVFSASYERAESGVVPTIRGSAHLSAEATLLIEDDDPFGWGIVS
>A3PPJ8 5.1.1.8~~~~~~4-hydroxyproline 2-epimerase~~~
MRVQDVYNVIYTHTEGEPLCIIYSGVPYPAGSTILEKRAFLEENYDWLRKALMREPRGHADMFGVFLTPPSSRDYDAGLI
YIDGKEYSHMCGHGTIAVAMAMVANGLVARDPSGLTRIRFETTAGLVVAEVAHEDDRVLWTRFENVPAYVAAQDIAFELP
GYGPLKADLVWGGNYFGIIDLRGTSLRIAPENGSELSRMGLIAREEIRKKVTVQHPTEAHINNLNFVTFWHEPTIEGCLY
KNVHVFSAGQLDRSPGGTGTSAMMAYFEARGVIGLNQPITSEGLLGSGTFEGCLIGETTLGTVRAVRPTVKGTAGMLGTA
SWTINREDPVDAGFLVL
>Q3IWG2 5.1.1.8~~~~~~4-hydroxyproline 2-epimerase~~~COG3938
MRVQDVYNVIYTHTEGEPLCIIYSGVPYPAGSTILEKRAFLEENYDWLRKALMREPRGHADMFGVFLTPPSSRDYDAGLI
YIDGKEYSHMCGHGTIAVAMAMVANGLVARDPSGLTRIRFETTAGLVVAEVAHEGDRVLWTRFENVPAYVAAQDIAFELP
GYGPLKADLVWGGNYFGIIDLRGTSLRIAPENGSELSRMGLIAREEIRKKVKVQHPTEAHINNLNFVTFWHEPTIEGCLY
KNVHVFSAGQLDRSPGGTGTSAMMAYFEARGVIGLNQPITSEGLLGSGTFEGCLIGETTLGTVRAVRPTVKGTAGMLGTA
SWTINREDPVDAGFLVL
>Q1QU06 5.1.1.8~~~~~~4-hydroxyproline 2-epimerase~~~COG3938
MKRVHVIDSHTAGEPTRLVMEGMPALSGRTIAEKCDDFRDNHDAWRRAIMLEPRGHDVLVGALYCAPESSDASCGVIFFN
NSGYLGMCGHGTIGLVASLHHLGQLTPGCHKIDTPAGPVSATLHDDGAVTVRNVLSYRHRRRVPVEVPGYGTVHGDIAWG
GNWFFLVSDHDMTLELDNVEALTDYTWAIRQALEAQSITGENGGVIDHIELFCDDREADSRNFVLCPGKAYDRSPCGTGT
SAKLACLAADGKLAPGQVWTQASICGSRFEAFYEREGDGIRPSIKGRAYLSADATLLIDERDPFAWGIASP
>Q7NU77 5.1.1.8~~~~~~4-hydroxyproline 2-epimerase~~~COG3938
MKQVEIIDSHTGGEPTRLVLSGFPALAGATMADKRDALRERHDQWRRACLLEPRGSDVLVGALYCEPVSPDAACGVIFFN
NTGYIGMCGHGTIGLIASLHCLGRIAPGAHKIDTPVGPVDAVLHEDGSVTLRNVPAYRYRRQAAVEVPGHGTVIGDIAWG
GNWFFLVAEHGLSVRLDNVAALSAFSCATMQALEEQGITGADGARIDHVELFADDEQADSRNFVMCPGKAYDRSPCGTGT
SAKLACLAADGKLAEGEQWVQAGITGSRFVGHYQREGDFIRPYITGRAHITARAMLLIDEQDPFAWGI
>A1BBM5 5.1.1.8~~~hypF~~~4-hydroxyproline 2-epimerase~~~COG3938
MTQYIFPCIDGHTCGNPVRLVAGGAPRLEGATMLEKRAHFLREFDWIRTGLMFEPRGHDMMSGAILYPPTRGDCDVAVLY
IETSGCLPMCGHGTIGTITMGIENGLIVPRTPGRLSIETPAGKVDIEYRQEGRHVEEVRLTNVPGFLYAEGLTAEVEGLG
EIVVDVAYGGNFYAIVEPQKNFRDMADHTAGELIGWSLTLRAALNQKYEFTHPEHPQINGLSHIQWTGAPTVPGAHARNA
VFYGDKAIDRSPCGTGTSARMAQLAARGRLGVGDEFWHESIIGSIFKGRIEAAATVAGRDAIIPSIAGWARQTGLNTIFI
DAERDPFAHGFVVK
>D5SQS4 5.1.1.8~~~~~~4-hydroxyproline 2-epimerase~~~COG3938
MTYIPRQWIQVVDSHTGGEPTRLIYDGQHWPFAGSREGALTSQESPSPSVLSGLRKAIDRSSLILPKTMSERRQFLETEA
DWLRTASLLEPRGSDVLVGAILTPPEHASSQAGVVFCNNTGYLGMCGHGMIGVIVSLGQMGLIAPGPVTIDTPVGSIAAT
WSGSASVTLTNVWSYRYRHAVSLSVPGLGVVTGDIAWGGNWFFLIGEEVHQKSLDLGNLSDLLAYTSQIRSELGRQGIAG
AQGAEIDHVELFASCDSSIADSQNFVLCPGGAYDRSPCGTGTSAKLACLVADGKVAEGGLWRQKSIVGSCFQAKALSIRE
GERGLEVLPQLTGEAYVTGVSTLQIDEADPFRWGILPPQ
>Q9I476 5.1.1.8~~~~~~4-hydroxyproline 2-epimerase~~~
MQRIRIIDSHTGGEPTRLVIGGFPDLGQGDMAERRRLLGERHDAWRAACILEPRGSDVLVGALLCAPVDPEACAGVIFFN
NSGYLGMCGHGTIGLVASLAHLGRIGPGVHRIETPVGEVEATLHEDGSVSVRNVPAYRYRRQVSVEVPGIGRVSGDIAWG
GNWFFLVAGHGQRLAGDNLDALTAYTVAVQQALDDQDIRGEDGGAIDHIELFADDPHADSRNFVLCPGKAYDRSPCGTGT
SAKLACLAADGKLLPGQPWRQASVIGSQFEGRYEWLDGQPGGPIVPTIRGRAHVSAEATLLLADDDPFAWGIRR
>Q4KGU2 5.1.1.8~~~~~~4-hydroxyproline 2-epimerase~~~COG3938
MKKITVIDSHTGGEPTRLVIDGFPDLGRGSMAERLQILEREHDQWRRACVLEPRGSDVLVGALLCQPQAGDACAGVIFFN
NSGYLGMCGHGTIGLVRSLYHLGRIDQGVHRIETPVGTVEATLHEDLSVSVRNVPAYRYRTQVMLQLPGHGKVHGDIAWG
GNWFFLISDHGQRIALDNVEALTHYTRDVRQALEAAGITGAEGGVIDHIELFADDPQADSRNFVLCPGKAYDRSPCGTGT
SAKLACLAADGKLAPGQAWRQASVIGSQFSAHYEKVGEQLIPILRGSAHISAEATLLLDDSDPFVWGIGS
>A5VZY6 5.1.1.8~~~~~~4-hydroxyproline 2-epimerase~~~COG3938
MKQIHVIDSHTGGEPTRLVMKGFPQLHGRSMAEQRDELRELHDRWRRACLLEPRGNDVLVGALYCPPVSADATCGVIFFN
NAGYLNMCGHGTIGLVASLQHLGLIAPGVHKIDTPVGQVSATLHEDGAITVANVPSYRYRQHVAVNVPGHGVVHGDIAWG
GNWFFLVAEHGQRIELDNREVLTEYTWAMLKALEAQGITGENGAPIDHVELFADDPNADSRNFVMCPGKAYDRSPCGTGT
SAKLACLAADGTLAEGQTWVQASITGSQFHGRYERDGERIRPFITGRAHMTADSTLLIDEQDPFAWGI
>Q88NF3 5.1.1.8~~~proR~~~4-hydroxyproline 2-epimerase~~~COG3938
MKQIHVIDSHTGGEPTRLVMKGFPQLRGRSMAEQRDELRELHDRWRRACLLEPRGNDVLVGALYCPPVSADATCGVIFFN
NAGYLNMCGHGTIGLVASLQHMGLITPGVHKIDTPVGQVSATLHEDGAITVANVPSYRYRQQVAVDVPGHGVVRGDIAWG
GNWFFLVSEHGQRIELDNREALTEYTWAMLKALETQGVTGENGAPIDHIELFADDPNADSRNFVMCPGKAYDRSPCGTGT
SAKLACLAADGKLAEGQTWVQASITGSQFHGRYARDGERIRPFITGRAYMTADSTLLIDEQDPFAWGI
>Q1QBF3 5.1.1.8~~~~~~4-hydroxyproline 2-epimerase~~~COG3938
MASSLFNFKIIDSHTGGEPTRMVYDGFPDLVGDTIQDKLQSFKQNFDHLRQSIILEPRGNDVLVGALLLPASHPKATAGV
IFFNNAGYLGMCGHGTIGVIVSLAYQQKISAGVHWLETPVGLVKATLHDDGSCSVQNVPSYRYKKQVEVHVPELGLIRGD
IAWGGNWFFLVSEHGQDIQASNVKQLTQVTMQIKQALVAANITGENSSEIDHIELFADSDDTQVDSKNFVLCPGSAYDRS
PCGTGTSAKLACLAADNKLAPEQLWQQQGVVGSVFTGSYQYASELNTTLKNPAGAAYPEQTIIPTICGHAYVCAETTLIM
QEDDPFKWGIPS
>Q2KD13 5.1.1.8~~~~~~4-hydroxyproline 2-epimerase~~~COG3938
MRWKRTIQLLDVHAEGEIGRVAIGGVPKIPGETIAAQLHWLNTDPKGDELRRFLCLEPRGAPIGSVNLLLPARHPDADAA
FIILQPDQAHASSGSNSICVTTALLESGIVEMQEPETIVTLETAAGLVKATATCRDGRCEKVKLTMVPSFVHELDVEIDT
PHWGKIKADLCYGGIFYALVDVGQINLTIEKANAAGLVQAGMILKELINRDIKVVHPEIPAISGVAYVMFRDTEADGTVR
TCTTMWPGRADRSPCGTGNSANLATLHARGKAKVGDVFTSKSIIGSEFEVGLQAVTEVAGRPAVIPTITGRGFTFGLTQV
ALDPFDPHPGGFALTDVWGPSAGEI
>Q92WS1 5.1.1.8~~~~~~Probable 4-hydroxyproline 2-epimerase~~~COG3938
MATHTFSCIDGHTCGNPVRLVSGGGPRLEGANMLEKRAHFLKEFDWIRTGLMFEPRGHDMMSGSILYPPTRPDCDVAVLF
IETSGCLPMCGHGTIGTITMGIENGLITPREPGKLSIDAPAGKVDITYRQEGRFVEEVRLTNVPSFLYAEGLAAEVEGLG
EIVVDVAYGGNFYAIVEPQKNFRDMADHTAGELVGWSPKLRAALNAKYEFVHPEHPEIRGLSHIQWTGKPTQPEAHARNA
VFYGEKAIDRSPCGTGTSARIAQLAAKGKLKVGDEFVHESIIGSLFKGRVEAAAKVADRDAIIPSIAGWARMTGINTIFI
DDRDPFAHGFVVR
>B9R4E3 5.1.1.8~~~~~~4-hydroxyproline 2-epimerase~~~
MRVIDSHTAGEPTRLVVEGGPDLGPGSLIEKAACLEAEHMDFCASVVLEPRGHDAIIGALLLPPSQPDCAAAVIYFNNLQ
NLGMCGHATIGLAVTLAHMGRIDPGRHKFETPVGIVEVDLQDANTVSVVNVESYRLHKDVTVEVPGHGKVTGDVAWGGNW
FFLVKESPFDLTLENVPALTAYTKTIRQALENAGVTGTDCAWIDHIELFGPPKDPFAQSRNFVLCPGGAYDRSPCGTGCS
AKLACLAEDGVLAPGEDWIQESVIGSTYRISYQPGTTKGVIPTITGQAFVTSDAHLIFNPADPYRFGIRPQNWTTSWITR
TLSVEHG
>A0NXQ7 5.1.1.8~~~~~~4-hydroxyproline 2-epimerase~~~COG3938
MRVIDSHTAGEPTRVVLDGGPDLGSGTLAERAARLEAEHLDFCASVVLEPRGHDAIIGALLVPPSDPACAAGVIYFNNLQ
NLGMCGHATIGLGVTLAHLGRIRPGRHRFETPVGVVEIDLIDANTVSVVNIESYRLAKDVTVEVEGVGPVTGDVAWGGNW
FFLVKNSPIALTGANIRPLTDLTLKIRTALEKAGVTGKDGAWIDHIELFGPAEDPAAQSRNFVLCPGGAYDRSPCGTGCS
AKLACLAADGALAPGQDYLQESVIGSTYKISYQPGPGGGVIPTITGQAFVTSDATLIFNPADPYRSGIRL
>Q5LKW3 5.1.1.8~~~~~~4-hydroxyproline 2-epimerase~~~COG3938
MHVIDSHTGGEPTRVILSGGPHLGSGPLSERAARLARESRAFYRSVMLEPRGQPAMVGALLVEPVDPDCITGVIFFDAEA
VLGMCGHGTIGLTVTLAHMGRIRAGTHKIETPVGIVEVCLSDANTVTITNIESRRVHRARQVDVDGFGPVTGDVAYGGNW
FFIVDPSPIPIERTNIRALSDAALAIRTAVIANGIGGEEGQPIDHVIFYEMSPRSAVHSRSFVFCPDGTYDRSPCGTGSS
ARLACLAAEGLLNAGEEIIQESVIGSTYRLSYQPGPNGGVIPKITGQAHVMAESTLHFHTDDPYRNGICHAPQ
>A3QFI1 5.1.1.8~~~~~~4-hydroxyproline 2-epimerase~~~COG3938
MLKGTFFCVDAHTCGNPVRLVTSGHPDLKGRTMSEKRQDFLAQYDWIRKALMFEPRGHDMMSGAFLYPPCSDNADAAILF
IETSGCLPMCGHGTIGTITAALESGLLTPKMPGQLTIDVPAGQIKVQYQQTGAKVDWVKIFNVPAYLAHKDVVLDIPGLG
PLKIDVSYGGNYYAIVDPQANFPGLRHWSAGDILRWSPIVREVAHRELNCVHPDDPTVNGVSHVLWTGDTISEGSNGANA
VFYGDKAIDRSPCGTGTSARLAQLYSRGELKVGDEYTHESIIGSQFVGRIEAATKVGAFDAIMPSIKGWARITGHNAITV
DDNDPYAFGFQVV
>D2QN44 5.1.1.8~~~~~~4-hydroxyproline 2-epimerase~~~COG3938
MAEFHFFCIDAHTCGNPVRVVTGGSIPFLQGNSMSEKRQHFLREFDWIRKGLMFEPRGHDMMSGSILYPPTDPANDAGVL
FIETSGCLPMCGHGTIGTVTVAIEQNLIRPKTPGVLNLEVPAGLVRAEYQQEGKKVTSVKITNIKSYLAAEKLTVDCPDL
GLLTVDVAYGGNFYAIVDPQPNFPGLEHYKAEQLIGWARVMRERMNEQYTFVHPENPTINGLSHILWTGKPIAETSTARN
AVFYGDKAIDRSPCGTGTSARMAQWYAQGRLKPGETFVHESIIGSIFNGRIEAETELANQPAIVPSIEGWARIHGYNHLI
LDEEDPYVFGFQVI
>D7A0Y3 5.1.1.8~~~~~~4-hydroxyproline 2-epimerase~~~COG3938
MARHSFFCIDGHTCGNPVRLVAGGGPNLQGANMIEKRAHFLAEYDWIRTGLMFEPRGHDMMSGSILYPPTRPDCDVAILF
IETSGCLPMCGHGTIGTVTMAIEHGLVTPKIPGVLMLDTPAGVVKAEYRQEGQYVEEVRITNVPAFLYARGLTAECPGLG
EVMVDVAYGGNFYAIVEPQEHFRDMADFTAGELIGMSGALRKALNAKYEFVHPEKPEIRGLSHILWTGAPKHAEAHARNA
VFYGDKAIDRSPCGTGTSARIAHWAANGKLKVGDDFVHESIIGSLFKGRVEATARVGNVDAIIPSIGGWARMTGYNTIFI
DDRDPFAHGFVVV
>D2AV87 5.1.1.8~~~~~~4-hydroxyproline 2-epimerase~~~COG3938
MRTKRVFHAVDSHTEGMPTRVITGGVGVIPGSTMAERREHFLAEMDHVRTLLMYEPRGHSAMSGAILQPPTRPDADYGVL
YIEVSGCLPMCGHGTIGVATVLVETGMVEVVEPVTTIRLDTPAGLVVAEVRVEDGAATAVTITNVPSFSAGLDRTVKVPG
IGEVTYDLAYGGNFYAILPIESVGLPFDRAHKQQILDAGLAIMDAINEQDEPVHPLDAGIRGCHHVQFTAPGSDARHSRH
AMAIHPGWFDRSPCGTGTSARMAQLHARGELPLDTDFVNESFIGTRFVGRLVEETEVTDLPAVVPTITGRAWVTGTAQYF
LDPRDPFPEGFLL
>Q8P833 5.1.1.8~~~~~~4-hydroxyproline 2-epimerase~~~COG3938
MHTIDVIDSHTAGEPTRVVLAGFPDLGDGDLAQCRERFRSDFDHWRSAIACEPRGSDTMVGALLLPPRDPSACTGVIFFN
NVGYLGMCGHGTIGVVRTLAELGRIAPGQHRIETPVGTVGVALADDGTVSIDNVESYRHAAGVEVDVPGHGRVRGDVAWG
GNWFFITEQAPCALGLAQQRELTAYTEAIRLALEAAGITGEAGGEIDHIEISGVAPDGSGAARNFVLCPGLAYDRSPCGT
GTSAKLACLAADGKLAEGERWLQQGILGSAFEGSYRHSGRGIAPRISGHAFITARSQLLIDPADPFAWGIVA
>Q01468 5.3.2.6~~~xylH~~~2-hydroxymuconate tautomerase~~~
MPIAQIHILEGRSDEQKETLIREVSEAISRSLDAPLTSVRVIITEMAKGHFGIGGELASKVRR
>P70994 5.3.2.6~~~ywhB~~~2-hydroxymuconate tautomerase~~~COG1942
MPYVTVKMLEGRTDEQKRNLVEKVTEAVKETTGASEEKIVVFIEEMRKDHYAVAGKRLSDME
>P49172 5.3.2.6~~~dmpI~~~2-hydroxymuconate tautomerase~~~
MPIAQLYIIEGRTDEQKETLIRQVSEAMANSLDAPLERVRVLITEMPKNHFGIGGEPASKVRR
>Q988C9 1.1.99.42~~~padh1~~~4-pyridoxate dehydrogenase~~~COG2303
MPHAESYDYIIVGAGSAGCVLANRLSADPRCSVLLLEAGGWDRDPMIHIPLGWGKILTERRHDWMYFCEPEDNVGGRRVE
CARGKVIGGSSSTNAMAYVRGNRGDYDRWAATGLSEWSYDKVLPYFRKQESWEGGANQYRGGNGPVSTQFCRYKDTLIDA
FAQASVQAGYAQTKDYNGERQEGFGRLQMTISKGRRASTASAYLRPVLKRPNLTVLTEASATRIVLEGARATGVTINHRG
GERTVLARKEVLLAGGVINTPQLMMLSGIGAQDELAAHGVQTRVNLPAVGKNLQDHVSVILMYRRRAPGGPFLRNMRADR
IGFDFVKTYLTGRGFSGDVPGGVVAFLKSGPARPLPDVQLLFTAAPLAAWPYFKPFKAPFADGFATRIVATQPESRGAVK
LASADPSAAPLIHQNFLASPKDWESLRAGFRVARDLAAQPSMQPFIEAEFFPGPKCQSDDEIDEHIRKTSITVHHPAGTC
RMGADAASVVDPQLRVRGVDRLRVVDASVMPDLVCGNINAAVIMIAEKAADLIASSKEGRAVQ
>Q8CTG7 3.1.3.-~~~~~~Putative 5'(3')-deoxyribonucleotidase~~~COG4502
MTRQRIAIDMDEVLADTLGAVVKAVNERADLNIKMESLNGKKLKHMIPEHEGLVMDILKEPGFFRNLDVMPHAQEVVKQL
NEHYDIYIATAAMDVPTSFHDKYEWLLEYFPFLDPQHFVFCGRKNIILADYLIDDNPKQLEIFEGKSIMFTASHNVYEHR
FERVSGWRDVKNYFNSIEK
>P76491 3.1.3.89~~~yfbR~~~5'-deoxynucleotidase YfbR~~~COG1896
MKQSHFFAHLSRLKLINRWPLMRNVRTENVSEHSLQVAMVAHALAAIKNRKFGGNVNAERIALLAMYHDASEVLTGDLPT
PVKYFNSQIAQEYKAIEKIAQQKLVDMVPEELRDIFAPLIDEHAYSDEEKSLVKQADALCAYLKCLEELAAGNNEFLLAK
TRLEATLEARRSQEMDYFMEIFVPSFHLSLDEISQDSPL
>P0AC28 6.3.3.2~~~ygfA~~~5-formyltetrahydrofolate cyclo-ligase~~~COG0212
MIRQRRRALTPEQQQEMGQQAATRMMTYPPVVMAHTVAVFLSFDGELDTQPLIEQLWRAGKRVYLPVLHPFSAGNLLFLN
YHPQSELVMNRLKIHEPKLDVRDVLPLSRLDVLITPLVAFDEYGQRLGMGGGFYDRTLQNWQHYKTQPVGYAHDCQLVEK
LPVEEWDIPLPAVVTPSKVWEW
>P44905 6.3.3.2~~~~~~5-formyltetrahydrofolate cyclo-ligase~~~COG0212
MNTQKRQQIRTEIRKIRANLTALQQHQAEQSVTQHALNLIEQRQAKNIALYFSFDGEISTKALIQSLWMQNKNVYLPVLH
PFTKHYLLFLRYLPDTPMKQNQFGIWEPKLNVQNVLPLNELDILFTPLVAFDKKGNRLGMGGGFYDRTLQNWQNKSFIPV
GLAYQCQQVENLPTEHWDVPLFDILVG
>A0R3H2 6.3.3.2~~~~~~5-formyltetrahydrofolate cyclo-ligase~~~COG0212
MSPRSKSQLRTALLQNRRSVPEAVREGEAEALRGWLSGLKISGRTVCAYVPVGSEPGSIALLDTLLELGARVLLPVARND
AAGIPLPLQWGKYRPGTLVAAEFGLREPPPPWLPAETIGEADVILVPALAVDRSGARLGRGAGFYDRTLHHAAATAQVIA
VVRDDELLDEIPAEPHDVAMTHVLTPKRGIVALR
>P44569 3.1.3.5~~~~~~NAD 5'-nucleotidase~~~COG0737
MLLSKKSASFALSAFAMLFTSVALAKEAPQAHKAVELSILHINDHHSYLEPHETRINLNGQQTKVDIGGFSAVNAKLNKL
RKKYKNPLVLHAGDAITGTLYFTLFGGSADAAVMNAGNFHYFTLGNHEFDAGNEGLLKLLEPLKIPVLSANVIPDKNSIL
YNKWKPYDIFTVDGEKIAIIGLDTVNKTVNSSSPGKDVKFYDEIATAQIMANALKQQGINKIILLSHAGSEKNIEIAQKV
NDIDVIVTGDSHYLYGNDELRSLKLPVIYEYPLEFKNPNGDPVFVMEGWAYSAVVGDLGVKFSPEGIASITRKIPHVLMS
SHKLQVKNAEGKWTELTGDERKKALDTLKSMKSISLDDHDAKTDMLISKYKSEKDRLAQEIVGVITGSAMPGGSANRIPN
KAGSNPEGSIATRFIAETMYNELKTVDLTIQNAGGVRADILPGNVTFNDAYTFLPFGNTLYTYKMEGSLVKQVLEDAMQF
ALVDGSTGAFPYGAGIRYEANETPNAEGKRLVSVEVLNKQTQQWEPIDDNKRYLVGTNAYVAGGKDGYKTFGKLFNDPKY
EGVDTYLPDAESFIKFMKKHPHFEAYTSSNVKFNASTDALPKK
>Q9I767 3.1.3.5~~~~~~5'-nucleotidase~~~
MRDAALRYPNILFDLDGTLTDPREGITRSVQFALARLGIDEPDLARLEHFIGPPLLQCFMQTYGFDEARAWEAVNHYRER
FRVTGLYENRVFDGIPELLEALVGRGHTLYVATSKPGVFAREIARHFAFDRHFKAIYGSELDGTRTHKEELIRHLLDSEG
LAAEHCLMIGDRMHDLLGASRNGVACIGVGYGFGSEDELRAHQPTHYCADLAALRQVLESH
>Q70AC7 2.1.3.1~~~~~~Methylmalonyl-CoA carboxyltransferase 5S subunit~~~
MSPREIEVSEPREVGITELVLRDAHQSLMATRMAMEDMVGACADIDAAGYWSVECWGGATYDSCIRFLNEDPWERLRTFR
KLMPNSRLQMLLRGQNLLGYRHYNDEVVDRFVDKSAENGMDVFRVFDAMNDPRNMAHAMAAVKKAGKHAQGTICYTISPV
HTVEGYVKLAGQLLDMGADSIALKDMAALLKPQPAYDIIKAIKDTYGQKTQINLHCHSTTGVTEVSLMKAIEAGVDVVDT
AISSMSLGPGHNPTESVAEMLEGTGYTTNLDYDRLHKIRDHFKAIRPKYKKFESKTLVDTSIFKSQIPGGMLSNMESQLR
AQGAEDKMDEVMAEVPRVRKAAGFPPLVTPSSQIVGTQAVFNVMMGEYKRMTGEFADIMLGYYGASPADRDPKVVKLAEE
QSGKKPITQRPADLLPPEWEKQSKEAATLKGFNGTDEDVLTYALFPQVAPVFFEHRAEGPHSVALTDAQLKAEAEGDEKS
LAVAGPVTYNVNVGGTVREVTVQQA
>A0A0H3LKL4 1.14.13.114~~~nicC~~~6-hydroxynicotinate 3-monooxygenase~~~COG0654
MQGKPRIAVIGAGLGGTAGAALMARAGFNVRLYEQAPAFSRLGAGIHLGPNVMKIMRRIGIEDELNRQGSHPDYWYSRDW
QSGAELARIPLGDYAVSHYGATYLTVHRGDFHALMTAALPAGLLQFNKRLTRVDEDDDVVRLHFADGSVEEAEIVIGADG
VNSRLREHLLGAELPKYTGYVAHRAVFPTPLDSGSLPFDMCVKWWSDDRHMMVYFVTGKRDEIYYVTGVPEQQWDMGKSW
VPSSKAEMRAAFAGWHPTVQALIEATPEVSKWPLLERDPLPLWSRGRIVLLGDACHPMKPHMAQGAAMAIEDAAMLTRIF
EQTGLQDHAAAFRLYEDNRAERASRVQRVSHDNTWLRTNENPDWCFGYDVYAEPLVEGRRAAA
>P86491 1.14.13.114~~~nicC~~~6-hydroxynicotinate 3-monooxygenase~~~
MSQSPRIAVVGAGLGGAAAAKLLLQEGFNVRVYEQAPSFSRLGAGIHVGPNVMKILRRIGIEDALNEQGSHPDYWYSRHW
QTGDVLAQIPLGDYAVKEYGASYLTVHRGDFHALLVEALPDSVMAYGKFLTKVEDRGNVVVMHFADGTTEEADIVIGPDG
VNSRIREELLGPELPKYAGYLAHRAVFPTPEVKAGMLPFDACVKWWSDDRHMMTYFVTGKADELYYVTGVPVEKWDLNDR
WLESSKEEMREAFSGWHPTVQALIDATVEVTKWSLLERDPLPLWSRGRLVLLGDACHPMKPHMAQGAAMAIEDGAMLARC
LKEVGAHNHELAFALYEANRAERASKVQRISHDNTWLRTNEDPSWCFGYDVFNVPLVEPKVKAAA
>Q88FY2 1.14.13.114~~~nicC~~~6-hydroxynicotinate 3-monooxygenase~~~COG0654
MRGRQKIAIVGAGLGGAAAATLLQQAGFDVEVFEQAPAFTRLGAGIHIGPNVMKIFRRMGLEQKLELMGSHPDFWFSRDG
NTGDYLSRIPLGEFARREYGAAYITIHRGDLHALQIEAIQPGTVHFGKRLEKIVDEGDQVRLDFADGTHTVADIVIGADG
IHSKIREELLGAEAPIYSGWVAHRALIRGVNLAQHADVFEPCVKWWSEDRHMMVYYTTGKRDEYYFVTGVPHEAWDFQGA
FVDSSQEEMRAAFEGYHPTVQKLIDATESITKWPLRNRNPLPLWSRGRLVLLGDACHPMKPHMAQGACMAIEDAAMLTRC
LQETGLSDHRTAFALYEANRKERASQVQSVSNANTWLYSQEDPAWVYGYDLYGQQLESGEAA
>P12013 1.1.1.343~~~gntZ~~~6-phosphogluconate dehydrogenase, NAD(+)-dependent, decarboxylating~~~COG0362
MFNSIGVIGLGVMGSNIALNMANKGENVAVYNYTRDLTDQLIQKLDGQSLSPYYELEDFVQSLEKPRKIFLMVTAGKPVD
SVIQSLKPLLEEGDVIMDGGNSHYEDTERRYDELKEKGIGYLGVGISGGEVGALTGPSIMPGGDRDVYEKAAPILTKIAA
QVGDDPCCVYIGPKGAGHFTKMVHNGIEYADMQLIAEAYTFLRETLRLPLDEIASIFETWNQGELKSYLIEITAEILRKK
DEKTGQPLIDVILDKTGQKGTGKWTSMQAIDNGIPSTIITESLFARYLSSLKEERMAAQDVLAGPEAEEKHLDKDTWIEY
VRQALYMGKVCAYAQGFAQYKMSSELYGWNLPLKDIALIFRGGCIIRADFLNVISEAFSEQPNLANLLIAPYFTDKLHAY
QTGLRKVVCEGISTGISFPCLTTALSYYDGYRTGRSNANLLQAQRDYFGAHTYERTDMDGVFHTNWSE
>G5EBD7 1.1.1.343~~~~~~6-phosphogluconate dehydrogenase, NAD(+)-dependent, decarboxylating~~~COG1023
MRIGIIGLGRMGGNIAVRLTRHGHDVVVHDRTSEVTTSVVGRCEAGRATPADTLADMAKLLEGDEHRVVWVMLPAGAITE
DCVQQLGGLLGRGDIIIDGGNTYYKDDVRRSAELAEKGISYVDVGTSGGVWGLERGYCMMFGGTKETAEYIDPILSALAP
GIGDVPRTPGRDEAGHDPRAEQGYLHCGPAGSGHFVKMVHNGIEYGMMQAFAEGFDIMKSKNSPILAEKDRFELNMGDIA
EVWRRGSVVSSWLLDLTAEALTRSETLNEFSGEVADSGEGRWTIEAAIEEDVPAPVMTAALFTRFRSRSGNNFAEKILSA
QRFGFGGHVEKK
>P70718 1.1.1.44~~~gnd~~~6-phosphogluconate dehydrogenase, decarboxylating~~~
MSVKGDIGVIGLAVMGQNLILNMNDHGFKVVAYNRTTSKVDEFLEGAAKGTNIIGAYSLEDLANKLEKPRKVMLMVRAGE
VVDHFIDALLPHLEAGDIIIDGGNSNYPDTNRRVAALREKGIRFIGTGVSGGEEGARHGPSIMPGGNEEAWQFVKPVLQA
ISAKTEQGEPCCDWVGKDGAGHFVKMVHNGIEYGDMQLICEAYQFLKEGVGLSDDELQATFNEWRNTELDSYLIDITADI
LGYKDADGSRLVDKVLDTAGQKGTGKWTGINALDFGIPLTLITESVFARCVSAFKDQRVAASKLFHKTIGKVEGDKKVWI
EAVRKALLASKIISYAQGFMLIREASEHFNWNINYGNTALLWREGCIIRSRFLGNIRDAYEANPDLIFLGSDSYFKGILE
NAMSDWRKVVAKSIEVGIPMPCMASAITFLDGYTSARLPANLLQAQRDYFGAHTYERTDKPRGEFFHTNWTGRGGNTAST
TYDV
>P80859 1.1.1.44~~~gndA~~~6-phosphogluconate dehydrogenase, NADP(+)-dependent, decarboxylating~~~COG0362
MSKQQIGVIGLAVMGKNLALNIESRGFSVSVYNRSSSKTEEFLQEAKGKNVVGTYSIEEFVQSLETPRKILLMVKAGTAT
DATIQSLLPHLEKDDILIDGGNTYYKDTQRRNKELAESGIHFIGTGVSGGEEGALKGPSIMPGGQKEAHELVKPILEAIS
AKVDGEPCTTYIGPDGAGHYVKMVHNGIEYGDMQLISESYFILKQVLGLSADELHEVFAEWNKGELDSYLIEITADIFTK
KDEETGKPLVDVILDKAGQKGTGKWTSQSALDLGVPLPIITESVFARFISAMKEERVKASGLLSGPEVKPVTENKEELIE
AVRKALFMSKICSYAQGFAQMKAASEEYNWDLKYGEIAMIFRGGCIIRAAFLQKIKEAYDREPELDNLLLDSYFKNIVES
YQGALRQVISLAVAQGVPVPSFSSALAYYDSYRTAVLPANLIQAQRDYFGAHTYERTDKEGIFHTEWMK
>P00350 1.1.1.44~~~gnd~~~6-phosphogluconate dehydrogenase, decarboxylating~~~COG0362
MSKQQIGVVGMAVMGRNLALNIESRGYTVSIFNRSREKTEEVIAENPGKKLVPYYTVKEFVESLETPRRILLMVKAGAGT
DAAIDSLKPYLDKGDIIIDGGNTFFQDTIRRNRELSAEGFNFIGTGVSGGEEGALKGPSIMPGGQKEAYELVAPILTKIA
AVAEDGEPCVTYIGADGAGHYVKMVHNGIEYGDMQLIAEAYSLLKGGLNLTNEELAQTFTEWNNGELSSYLIDITKDIFT
KKDEDGNYLVDVILDEAANKGTGKWTSQSALDLGEPLSLITESVFARYISSLKDQRVAASKVLSGPQAQPAGDKAEFIEK
VRRALYLGKIVSYAQGFSQLRAASEEYNWDLNYGEIAKIFRAGCIIRAQFLQKITDAYAENPQIANLLLAPYFKQIADDY
QQALRDVVAYAVQNGIPVPTFSAAVAYYDSYRAAVLPANLIQAQRDYFGAHTYKRIDKEGVFHTEWLD
>P96789 1.1.1.44~~~gnd~~~6-phosphogluconate dehydrogenase, decarboxylating~~~COG0362
MAQANFGVVGMAVMGKNLALNVESRGYTVAIYNRTTSKTEEVYKEHQDKNLVFTKTLEEFVGSLEKPRRIMLMVQAGAAT
DATIKSLLPLLDIGDILIDGGNTHFPDTMRRNAELADSGINFIGTGVSGGEKGALLGPSMMPGGQKEAYDLVAPIFEQIA
AKAPQDGKPCVAYMGANGAGHYVKMVHNGIEYGDMQLIAESYDLLKRILGLSNAEIQAIFEEWNEGELDSYLIEITKEVL
KRKDDEGEGYIVDKILDKAGNKGTGKWTSESALDLGVPLPLITESVFARYISTYKDERVKASKVLSGPALDFSGDKKEVI
EKIRKALYFSKIMSYAQGFAQLRKASEEFDWDLPYGTIAQIWRAGCIIRAEFLQNITDAFDKDSELENLLLDDYFVDITK
RYQEAVRDVVSLAVQAGTPIPTFTSAISYYDSYRSENLPANLIQAQRDYFGAHTYERTDKAGIFHYDWYTED
>P63334 1.1.1.44~~~gnd~~~6-phosphogluconate dehydrogenase, decarboxylating~~~
MTQQIGVIGLAVMGKNLAWNIESRGYSVSVFNRSSEKTDLMVEESKGKNIHPTYSLEEFVNSLEKPRKILLMVQAGKATD
ATIDSLLPLLDDGDILIDGGNTNYQDTIRRNKALAQSAINFIGMGVSGGEIGALTGPSLMPGGQEEAYNKVADILDAIAA
KAKDGASCVTYIGPNGAGHYVKMVHNGIEYADMQLIAESYAMMKELLGMSHEDIAQTFKDWNAGELESYLIEITGDIFMK
LDENKEALVEKILDTAGQKGTGKWTSINALELGIPLTIITESVFARFISSIKEERVNASKELNGPKASFDGDKKDFLEKI
RKALYMSKICSYAQGFAQMRKASEDNEWNLKLGDLAMIWREGCIIRAQFLQKIKDAYDNNPGLQNLLLDPYFKNIVTEYQ
DALRDVVATGVQNGVPTPGFSSSINYYDSYRAADLPANLIQAQRDYFGAHTYERKDKEGVFHTQWIEE
>P21577 1.1.1.44~~~gnd~~~6-phosphogluconate dehydrogenase, decarboxylating~~~COG0362
MALQQFGLIGLAVMGENLALNIERNGFSLTVYNRTAEKTEAFMADRAQGKNIVPAYSLEDFVASLERPRRILVMVKAGGP
VDAVVEQLKPLLDPGDLIIDGGNSLFTDTERRVKDLEALGLGFMGMGVSGGEEGALNGPSLMPGGTQAAYEAVEPIVRSI
AAQVDDGPCVTYIGPGGSGHYVKMVHNGIEYGDMQLIAEAYDLLKSVAGLNASELHDVFAAWNKTPELDSFLIEITADIF
TKVDDLGTGQPLVELILDAAGQKGTGRWTVETALEIGVAIPTIIAAVNARILSSIKAERQAASEILSGPITEPFSGDRQA
FIDSVRDALYCSKICSYAQGMALLAKASQVYNYGLNLGELARIWKGGCIIRAGFLNKIKQAYDADPTLANLLLAPEFRQT
ILDRQLAWRRVIAIAAERGIPVPAFSASLDYFDSYRRDRLPQNLTQAQRDYFGAHTYERTDRSGSFHAQWF
>O34499 3.1.1.31~~~pgl~~~6-phosphogluconolactonase~~~COG2706
MTKYIGYVGTYTKGGSEGIYSFELDTEKKALSEPKLAAKLGNPTYVATNKNNTILYSIEKADGQGGVAAYQIDKNSGELT
FLNHQLIDGPSPCHVSVDDQNQFVLTANYHSGKVHVFPVQEDGSLQSPVSEAAHTGKGPHERQEKPHTHYAGFTPEHNYV
VAVDLGIDKLYTYKLKDGVLTESGSHSFAPGAGPRHIAFHPKEKYAYVMTELSNEVIALEYNPTAGEFREIQVVSAIPDD
FTDNSQGSAIHVTQDGRFVYVANRGHDSIAVFEVNQYSGELAFVERVSTEGNWPRDFVFDPTEGFLVASNEETGNLVLFE
RDKETGRLTLLPSTVSVPYPVCVKFLHQV
>P52697 3.1.1.31~~~pgl~~~6-phosphogluconolactonase~~~COG2706
MKQTVYIASPESQQIHVWNLNHEGALTLTQVVDVPGQVQPMVVSPDKRYLYVGVRPEFRVLAYRIAPDDGALTFAAESAL
PGSPTHISTDHQGQFVFVGSYNAGNVSVTRLEDGLPVGVVDVVEGLDGCHSANISPDNRTLWVPALKQDRICLFTVSDDG
HLVAQDPAEVTTVEGAGPRHMVFHPNEQYAYCVNELNSSVDVWELKDPHGNIECVQTLDMMPENFSDTRWAADIHITPDG
RHLYACDRTASLITVFSVSEDGSVLSKEGFQPTETQPRGFNVDHSGKYLIAAGQKSHHISVYEIVGEQGLLHEKGRYAVG
QGPMWVVVNAH
>A6T6J6 3.1.1.31~~~pgl~~~6-phosphogluconolactonase~~~
MKQTVYTASPESQQIHVWSLEADGKLTLVQVVDAPGQVQPMVVSPNKEFLYVGVRPEFRVLAYRITPDNGALTFAGEAAL
PGSPTHISTDHHGRFVFSASYNQGCVSVTPLHDGLPGETITVVEGLEGCHSANISPDNRTLWVPALKQDRICLFTLSDDG
FLSAQEPAEVTTVEGAGPRHMVFHPNQQYGYCVNELNSSIDVWELKDPKGNIECVQTLDMMPPDFSGVRWAADIHITPDG
GHLYACDRTASIITVFSVSEDGSVLAVEGYQPTETQPRGFNLDHSGKYLIAAGQKSHHIAVYDIVGEQGLLQEKGRYAVG
QGPMWVVVNAH
>P9WQP5 3.1.1.31~~~pgl~~~6-phosphogluconolactonase~~~COG0363
MSSSIEIFPDSDILVAAAGKRLVGAIGAAVAARGQALIVLTGGGNGIALLRYLSAQAQQIEWSKVHLFWGDERYVPEDDD
ERNLKQARRALLNHVDIPSNQVHPMAASDGDFGGDLDAAALAYEQVLAASAAPGDPAPNFDVHLLGMGPEGHINSLFPHS
PAVLESTRMVVAVDDSPKPPPRRITLTLPAIQRSREVWLLVSGPGKADAVAAAIGGADPVSVPAAGAVGRQNTLWLLDRD
AAAKLPS
>P46016 3.1.1.31~~~pgl~~~6-phosphogluconolactonase~~~COG0363
MKKTVEVLPDQTALIARSLDLILTKLDTAIKQQGRFTIALSGGSTPKPLYEAIAAQKLPWDKIHVFWGDERYVSPDHPDS
NELMARTAWLDRVDIPAENIHAVPTLDNNPAVSAAKYEQHLQTFFNSAPGEFPALDVVLLGMGDDAHTASLFPHTEALQV
RDRLITVGNKDGNPRITFTYPFINAASSVIFVVAGANKRPALAQVFAPSADDLAYPSRFIQPQGELLWLLDAAAGAELSV
>P74618 3.1.1.31~~~pgl~~~6-phosphogluconolactonase~~~COG0363
MAPQVDVLINKQILIERALVCVTTRITKAIAERGQGTIALSGGNTPKPLYEALARQALPWEKIHVFWGDERYVSVDHPDS
NQRMARLAWLDQVDIPEANIHPMPTAAADPEQDAQTYENELATFFQVEAGHFPAFDLILLGLGDDGHTASLFPHTPALTV
GDRLITVGNKDGQPRLTFTIPLINRARSVVFLVAGASKQHALGEIFAPEADPQQYPARFIQPQGELIWLLDQQAGENLRP
>Q9X0N8 3.1.1.31~~~pgl~~~6-phosphogluconolactonase~~~COG0363
MEKTVIYLLEDGYVDFVVEKIRTKMEKLLEEKDKIFVVLAGGRTPLPVYEKLAEQKFPWNRIHFFLSDERYVPLDSDQSN
FRNINEVLFSRAKIPSGNVHYVDTSLPIEKACEKYEREIRSATDQFDLAILGMGPDGHVASIFDLETGNKDNLVTFTDPS
GDPKVPRVTLTFRALNTSLYVLFLIRGKEKINRLTEILKDTPLPAYFVRGKEKTVWFVGK
>P76578 ~~~yfhM~~~Alpha-2-macroglobulin~~~COG2373
MKKLRVAACMLMLALAGCDNNDNAPTAVKKDAPSEVTKAASSENASSAKLSVPERQKLAQQSAGKVLTLLDLSEVQLDGA
ATLVLTFSIPLDPDQDFSRVIHVVDKKSGKVDGAWELSDNLKELRLRHLEPKRDLIVTIGKEVKALNNATFSKDYEKTIT
TRDIQPSVGFASRGSLLPGKVVEGLPVMALNVNNVDVNFFRVKPESLPAFISQWEYRNSLANWQSDKLLQMADLVYTGRF
DLNPARNTREKLLLPLGDIKPLQQAGVYLAVMNQAGRYDYSNPATLFTLSDIGVSAHRYHNRLDIFTQSLENGAAQQGIE
VSLLNEKGQTLTQATSDAQGHVQLENDKNAALLLARKDGQTTLLDLKLPALDLAEFNIAGAPGYSKQFFMFGPRDLYRPG
ETVILNGLLRDADGKALPNQPIKLDVIKPDGQVLRSVVSQPENGLYHFTWPLDSNAATGMWHIRANTGDNQYRMWDFHVE
DFMPERMALNLTGEKTPLTPKDEVKFSVVGYYLYGAPANGNTLQGQLFLRPLREAVSALPGFEFGDIAAENLSRTLDEVQ
LTLDDKGRGEVSTESQWKETHSPLQVIFQGSLLESGGRPVTRRAEQAIWPADALPGIRPQFASKSVYDYRTDSTVKQPIV
DEGSNAAFDIVYSDAQGVKKAVSGLQVRLIRERRDYYWNWSEDEGWQSQFDQKDLIENEQTLDLKADETGKVSFPVEWGA
YRLEVKAPNEAVSSVRFWAGYSWQDNSDGSGAVRPDRVTLKLDKASYRPGDTIKLHIAAPTAGKGYAMVESSEGPLWWQE
IDVRAQGLDLTIPVDKTWNRHDLYLSTLVVRPGDKSRSATPKRAVGVLHLPLGDENRRLDLALETPAKMRPNQPLTVKIK
ASTKNGEKPKQVNVLVSAVDSGVLNITDYVTPDPWQAFFGQKRYGADIYDIYGQVIEGQGRLAALRFGGDGDELKRGGKP
PVNHVNIVVQQALPVTLNEQGEGSVTLPIGDFNGELRVMAQAWTADDFGSNESKVIVAAPVIAELNMPRFMASGDTSRLT
LDITNLTDKPQKLNVALTASGLLELVSDSPAAVELAPGVRTTLFIPVRALPGYGDGEIQATISGLALPGETVADQHKQWK
IGVRPAFPAQTVNYGTALQPGETWAIPADGLQNFSPVTLEGQLLLSGKPPLNIARYIKELKAYPYGCLEQTASGLFPSLY
TNAAQLQALGIKGDSDEKRRASVDIGISRLLQMQRDNGGFALWDKNGDEEYWLTAYVMDFLVRAGEQGYSVPTDAINRGN
ERLLRYLQDPGMMSIPYADNLKASKFAVQSYAALVLARQQKAPLGALREIWEHRADAASGLPLLQLGVALKTMGDATRGE
EAIALALKTPRNSDERIWLGDYGSSLRDNALMLSLLEENKLLPDEQYTLLNTLSQQAFGERWLSTQESNALFLAARTIQD
LPGKWQAQTSFSAEQLTGEKAQNSNLNSDQLVTLQVSNSGDQPLWLRMDASGYPQSAPLPANNVLQIERHILGTDGKSKS
LDSLRSGDLVLVWLQVKASNSVPDALVVDLLPAGLELENQNLANGSASLEQSGGEVQNLLNQMQQASIKHIEFRDDRFVA
AVAVDEYQPVTLVYLARAVTPGTYQVPQPMVESMYVPQWRATGAAEDLLIVRP
>P9WQP3 2.3.1.122~~~fbpA~~~Diacylglycerol acyltransferase/mycolyltransferase Ag85A~~~COG0627
MQLVDRVRGAVTGMSRRLVVGAVGAALVSGLVGAVGGTATAGAFSRPGLPVEYLQVPSPSMGRDIKVQFQSGGANSPALY
LLDGLRAQDDFSGWDINTPAFEWYDQSGLSVVMPVGGQSSFYSDWYQPACGKAGCQTYKWETFLTSELPGWLQANRHVKP
TGSAVVGLSMAASSALTLAIYHPQQFVYAGAMSGLLDPSQAMGPTLIGLAMGDAGGYKASDMWGPKEDPAWQRNDPLLNV
GKLIANNTRVWVYCGNGKPSDLGGNNLPAKFLEGFVRTSNIKFQDAYNAGGGHNGVFDFPDSGTHSWEYWGAQLNAMKPD
LQRALGATPNTGPAPQGA
>P0C2T2 2.3.1.122~~~fbpB~~~Diacylglycerol acyltransferase/mycolyltransferase Ag85B~~~
MTDVSRKIRAWGRRLMIGTAAAVVLPGLVGLAGGAATAGAFSRPGLPVEYLQVPSPSMGRDIKVQFQSGGNNSPAVYLLD
GLRAQDDYNGWDINTPAFEWYYQSGLSIVMPVGGQSSFYSDWYSPACGKAGCQTYKWETFLTSELPQWLSANRAVKPTGS
AAIGLSMAGSSAMILAAYHPQQFIYAGSLSALLDPSQGMGPSLIGLAMGDAGGYKAADMWGPSSDPAWERNDPTQQIPKL
VANNTRLWVYCGNGTPNELGGANIPAEFLENFVRSSNLKFQDAYNAAGGHNAVFNFPPNGTHSWEYWGAQLNAMKGDLQS
SLGAG
>P21160 2.3.1.122~~~fbpB~~~Diacylglycerol acyltransferase/mycolyltransferase Ag85B~~~
MTDVSGKIRAWGRRLLVGAAAAAALPGLVGLAGGAATAGAFSRPGLPVEYLQVPSAAMGRSIKVQFQSGGDNSPAVYLLD
GLRAQDDYNGWDINTPAFEWYYQSGLSVIMPVGGQSSFYSDWYSPACGKAGCTTYKWETFLTSELPQWLSANRSVKPTGS
AAVGISMAGSSALILSVYHPQQFIYAGSLSALMDPSQGMGPSLIGLAMGDAGGYKASDMWGPSSDPAWQRNDPSLHIPEL
VANNTRLWIYCGNGTPSELGGANVPAEFLENFVRSSNLKFQDAYNAAGGHNAVFNLDANGTHSWEYWGAQLNAMKGDLQA
SLGAR
>A0QU51 2.3.1.122~~~fbpB~~~Diacylglycerol acyltransferase/mycolyltransferase Ag85B~~~COG0627
MTFIDKIRGHWARRMTVAAVAALLLPGLVGVVGGSATAGAFSRPGLPVEYLMVPSPSMGRDIKVQFQSGGPGSHAVYLLD
GLRAQDDFNGWDINTNAFEMFLDSGLSVVMPVGGQSSFYSDWYQPACGNNGCVTYKWETFLTSELPEWLAANRDVAATGN
AAIGLSMAGSAALILAAYHPDRFIYAGSMSGFLNPSEGWWPFLINISMGDAGGYKANDMWGPTEDPNSAWKRNDPMVQIP
RLVANNTRIWVYCGNGQPNELGGGDLPATFLEGLTIRTNETFRDNYIAAGGNNGVFNFPNNGTHNWAYWGRELQAMVPDL
QRVLG
>P9WQP1 2.3.1.122~~~fbpB~~~Diacylglycerol acyltransferase/mycolyltransferase Ag85B~~~COG0627
MTDVSRKIRAWGRRLMIGTAAAVVLPGLVGLAGGAATAGAFSRPGLPVEYLQVPSPSMGRDIKVQFQSGGNNSPAVYLLD
GLRAQDDYNGWDINTPAFEWYYQSGLSIVMPVGGQSSFYSDWYSPACGKAGCQTYKWETFLTSELPQWLSANRAVKPTGS
AAIGLSMAGSSAMILAAYHPQQFIYAGSLSALLDPSQGMGPSLIGLAMGDAGGYKAADMWGPSSDPAWERNDPTQQIPKL
VANNTRLWVYCGNGTPNELGGANIPAEFLENFVRSSNLKFQDAYNAAGGHNAVFNFPPNGTHSWEYWGAQLNAMKGDLQS
SLGAG
>P9WQN9 2.3.1.122~~~fbpC~~~Diacylglycerol acyltransferase/mycolyltransferase Ag85C~~~COG0627
MTFFEQVRRLRSAATTLPRRLAIAAMGAVLVYGLVGTFGGPATAGAFSRPGLPVEYLQVPSASMGRDIKVQFQGGGPHAV
YLLDGLRAQDDYNGWDINTPAFEEYYQSGLSVIMPVGGQSSFYTDWYQPSQSNGQNYTYKWETFLTREMPAWLQANKGVS
PTGNAAVGLSMSGGSALILAAYYPQQFPYAASLSGFLNPSEGWWPTLIGLAMNDSGGYNANSMWGPSSDPAWKRNDPMVQ
IPRLVANNTRIWVYCGNGTPSDLGGDNIPAKFLEGLTLRTNQTFRDTYAADGGRNGVFNFPPNGTHSWPYWNEQLVAMKA
DIQHVLNGATPPAAPAAPAA
>Q5SF96 ~~~aaaP~~~Appendage-associated protein~~~
MIVTYGTVGCPVSRGGSPGCGRRIAEELRLAEDARLRLALLGRCIVKGSPAQARGELRAELKAIDATIELRKELDAIDAE
WAPKIELSAELRAIDAEWRPAIRLRSAYRAIIGRIELRKELDAIDAEWAPKIELSAELKAIDAEWRPAIRLRSAYRAIIG
RWELSKELKAIDAEWRPAIARESLRKELDAIDAEWQHAITFWHISRAIIGSIELSKELKAIDAKWKYVAIYERQKAQRRR
EERAAKAREELRKELNDIDAKWKSASAIKLRKDLRSTSEGVDHTEFALELRATDKSGNMELVLKLKATDTKNQHDAIVKA
IEDGFVGYAAECGAATRELNACGGMSTTSAPSTDLISTVVSAVTTGTGQQQSAGSESQRPTECGGGGTLLSSLFALILPS
SNWYCNAVLYGGFLGCLGS
>P46854 2.3.1.-~~~aaaT~~~L-amino acid N-acetyltransferase AaaT~~~COG1247
MSEIVIRHAETRDYEAIRQIHAQPEVYCNTLQVPHPSDHMWQERLADRPGIKQLVACIDGDVVGHLTIDVQQRPRRSHVA
DFGICVDSRWKNRGVASALMREMIEMCDNWLRVDRIELTVFVDNAPAIKVYKKYGFEIEGTGKKYALRNGEYVDAYYMAR
VK
>Q9HUQ0 ~~~~~~Amino acid binding protein~~~
MSKKLFRKGILALAVSSVMGLSTHALADVVIGVAGPHTGANASFGEQYWRGASQAAEDINAAGGINGEKIKLVKADDACE
PKQAVAVANRLVDQDKAIAVVGHFCSSSTIPASEVYDEAGIIAITPGSTNPQVTERGLSGMFRMCGRDDQQGVVAGDYIV
NVLKAKKVAVIHDKDTYGQGLADATRAQLNKLGVKEVLYEGLTRGEKDFNALVTKIRASGAEVVYFGGLHPEAGPLVRQM
REQGLTARFMSDDGVVTDELATTAGGPQYVKGVLMTFGADPRLIPDGKAVVEKFRAGGFEPEGYTLYSYASIQSLAAAFN
GAGANDPAKAAEWLKSHPVQTVMGKKEWDKKGDLKVSDYVVYEWDDKGKYHQLP
>P94968 2.3.1.-~~~aac~~~Aminoglycoside 2'-N-acetyltransferase~~~COG0456
MLTQHVSEARTRGAIHTARLIHTSDLDQETRDGARRMVIEAFRDPSGDSDFTDDFTDDDWDHALGGMHALISHHGALIAH
GAVVQRRLMYRGPDGRGHALRCGYVEAVAVREDRRGDGLGTAVLDALEQVIRGAYQIGALSASDIARPMYIARGWLSWEG
PTSVLTPTEGIVRTPEDDRSLFVLPVDLPDGLELDTAREITCDWRSGDPW
>P9WQG9 2.3.1.-~~~aac~~~Aminoglycoside 2'-N-acetyltransferase~~~COG0456
MHTQVHTARLVHTADLDSETRQDIRQMVTGAFAGDFTETDWEHTLGGMHALIWHHGAIIAHAAVIQRRLIYRGNALRCGY
VEGVAVRADWRGQRLVSALLDAVEQVMRGAYQLGALSSSARARRLYASRGWLPWHGPTSVLAPTGPVRTPDDDGTVFVLP
IDISLDTSAELMCDWRAGDVW
>Q52424 2.3.1.59~~~aac~~~Aminoglycoside 2'-N-acetyltransferase~~~
MGIEYRSLHTSQLTLSEKEALYDLLIEGFEGDFSHDDFAHTLGGMHVMAFDQQKLVGHVAIIQRHMALDNTPISVGYVEA
MVVEQSYRRQGIGRQLMLQTNKIIASCYQLGLLSASDDGQKLYHSVGWQIWKGKLFELKQGSYIRSIEEEGGVMGWKADG
EVDFTASLYCDFRGGDQW
>Q01515 2.3.1.81~~~aac3-Vb~~~Aminoglycoside N(3)-acetyltransferase III~~~
MNTIESITADLHGLGVRPGDLIMVHASLKAVGPVEGGAASVVSALRAAVGSAGTLMGYASWDRSPYEETLNGARMDEELR
RRWPPFDLATSGTYPGFGLLNRFLLEAPDARRSAHPDASMVAVGPLAATLTEPHRLGQALGEGSPLERFVGHGGKVLLLG
APLDSVTVLHYAEAIAPIPNKRRVTYEMPMLGPDGRVRWELAEDFDSNGILDCFAVDGKPDAVETIAKAYVELGRHREGI
VGRAPSYLFEAQDIVSFGVTYLEQHFGAP
>Q54441 2.3.1.82~~~~~~Aminoglycoside N(6')-acetyltransferase type 1~~~
MIVICDHDNLDAWLALRTALWPSGSPEDHRAEMREILASPHHTAFMARGLDGAFVAFAEVALRYDYVNGCESSPVAFLEG
IYTAERARRQGWAARLIAQVQEWAKQQGCSELASDTDIANLDSQRLHAALGFAETERVVFYRKTLG
>Q43899 2.3.1.82~~~~~~Aminoglycoside N(6')-acetyltransferase type 1~~~
MNIMPISESQLSDWLALRCLLWPDHEDVHLQEMRQLITQAHRLQLLAYTDTQQAIAMLEASIRYEYVNGTQTSPVAFLEG
IFVLPEYRRSGIATGLVQQVEIWAKQFACTEFASDAALDNQISHAMHQALGFHETERVVYFKKNIG
>Q44245 2.3.1.82~~~~~~Aminoglycoside N(6')-acetyltransferase type 1~~~
MNIMPVSESLMADWLGLRKLLWPDHDEAHLQEMQRLLQQTQSLQLLAYSDTQQAIAMLEASIRYEYVNGTQTSPVAFLEG
IYVLPDYRRSGIATHLVQQVEAWAKPFGCIEFASDAALDNRISHAMHQALGFHETERVVYFKKHIG
>Q44057 2.3.1.82~~~~~~Aminoglycoside N(6')-acetyltransferase type 1~~~COG0456
MNIKPASEASLKDWLELRNKLWSDSEASHLQEMHQLLAEKYALQLLAYSDHQAIAMLEASIRFEYVNGTETSPVGFLEGI
YVLPAHRRSGVATMLIRQAEVWAKQFSCTEFASDAALDNVISHAMHRSLGFQETEKVVYFSKKID
>Q9R381 2.3.1.82~~~~~~Aminoglycoside N(6')-acetyltransferase type 1~~~
MDIRQMNKTHLEHWRGLRKQLWPGHPDDAHLADGEEILQADHLASFIAMADGVAIGFADASIRHDYVNGCDSSPVVFLEG
IFVLPSFRQRGVAKQLIAAVQRWGTNKGCREMASDTSPENTISQKVHQALGFEETERVIFYRKRC
>P20092 2.3.1.82~~~aacA4~~~Aminoglycoside N(6')-acetyltransferase type 1~~~
MSIQHFQRKLGITKYSIVTNSNDSVTLRLMTEHDLAMLYEWLNRSHIVEWWGGEEARPTLADVQEQYLPSVLAQESVTPY
IAMLNGEPIGYAQSYVALGSGDGWWEEETDPGVRGIDQLLANASQLGKGLGTKLVRALVELLFNDPEVTKIQTDPSPSNL
RAIRCYEKAGFERQGTVTTPDGPAVYMVQTRQAFERTRRFA
>Q9RBW7 2.3.1.82~~~~~~Aminoglycoside N(6')-acetyltransferase type 1~~~
MIASAPTIRQATPADAAAWAQLRLGLWPDADDPLEELTQSLADAEGAVFLACAADGETVGFAEVRLRHDYVNGTESSPVG
FLEGWYVQPQWQGSGVGRALLAAVQAWTRDAGCRELASDSRVEDVQAHAAHRACGFEETERVVYFRMPLEPSA
>P0A0C2 ~~~aacA-aphD~~~Bifunctional AAC/APH~~~
MNIVENEICIRTLIDDDFPLMLKWLTDERVLEFYGGRDKKYTLESLKKHYTEPWEDEVFRVIIEYNNVPIGYGQIYKMYD
ELYTDYHYPKTDEIVYGMDQFIGEPNYWSKGIGTRYIKLIFEFLKKERNANAVILDPHKNNPRAIRAYQKSGFRIIEDLP
EHELHEGKKEDCYLMEYRYDDNATNVKAMKYLIEHYFDNFKVDSIEIIGSGYDSVAYLVNNEYIFKTKFSTNKKKGYAKE
KAIYNFLNTNLETNVKIPNIEYSYISDELSILGYKEIKGTFLTPEIYSTMSEEEQNLLKRDIASFLRQMHGLDYTDISEC
TIDNKQNVLEEYILLRETIYNDLTDIEKDYIESFMERLNATTVFEGKKCLCHNDFSCNHLLLDGNNRLTGIIDFGDSGII
DEYCDFIYLLEDSEEEIGTNFGEDILRMYGNIDIEKAKEYQDIVEEYYPIETIVYGIKNIKQEFIENGRKEIYKRTYKD
>P0A0C1 ~~~aacA-aphD~~~Bifunctional AAC/APH~~~
MNIVENEICIRTLIDDDFPLMLKWLTDERVLEFYGGRDKKYTLESLKKHYTEPWEDEVFRVIIEYNNVPIGYGQIYKMYD
ELYTDYHYPKTDEIVYGMDQFIGEPNYWSKGIGTRYIKLIFEFLKKERNANAVILDPHKNNPRAIRAYQKSGFRIIEDLP
EHELHEGKKEDCYLMEYRYDDNATNVKAMKYLIEHYFDNFKVDSIEIIGSGYDSVAYLVNNEYIFKTKFSTNKKKGYAKE
KAIYNFLNTNLETNVKIPNIEYSYISDELSILGYKEIKGTFLTPEIYSTMSEEEQNLLKRDIASFLRQMHGLDYTDISEC
TIDNKQNVLEEYILLRETIYNDLTDIEKDYIESFMERLNATTVFEGKKCLCHNDFSCNHLLLDGNNRLTGIIDFGDSGII
DEYCDFIYLLEDSEEEIGTNFGEDILRMYGNIDIEKAKEYQDIVEEYYPIETIVYGIKNIKQEFIENGRKEIYKRTYKD
>Q7ATH7 ~~~aacA-aphD~~~Bifunctional AAC/APH~~~
MNIVENEICIRTLIDDDFPLMLKWLTDERVLEFYGGRDKKYTLESLKKHYTEPWEDEVFRVIIEYNNVPIGYGQIYKMYD
ELYTDYHYPKTDEIVYGMDQFIGEPNYWSKGIGTRYIKLIFEFLKKERNANAVILDPHKNNPRAIRAYQKSGFRIIEDLP
EHELHEGKKEDCYLMEYRYDDNATNVKAMKYLIEHYFDNFKVDSIEIIGSGYDSVAYLVNNEYIFKTKFSTNKKKGYAKE
KAIYNFLNTNLETNVKIPNIEYSYISDELSILGYKEIKGTFLTPEIYSTMSEEEQNLLKRDIASFLRQMHGLDYTDISEC
TIDNKQNVLEEYILLRETIYNDLTDIEKDYIESFMERLNATTVFEGKKCLCHNDFSCNHLLLDGNNRLTGIIDFGDSGII
DEYCDFIYLLEDSEEEIGTNFGEDILRMYGNIDIEKAKEYQDIVEEYYPIETIVYGIKNIKQEFIENGRKEIYKRTYKD
>P23181 2.3.1.60~~~aacC1~~~Gentamicin 3-N-acetyltransferase~~~
MLRSSNDVTQQGSRPKTKLGGSSMGIIRTCRLGPDQVKSMRAALDLFGREFGDVATYSQHQPDSDYLGNLLRSKTFIALA
AFDQEAVVGALAAYVLPRFEQPRSEIYIYDLAVSGEHRRQGIATALINLLKHEANALGAYVIYVQADYGDDPAVALYTKL
GIREEVMHFDIDPSTAT
>P29808 2.3.1.81~~~aacC3~~~Aminoglycoside N(3)-acetyltransferase III~~~
MTDLNIPHTHAHLVDAFQALGIRAGQALMLHASVKAVGAVMGGPNVILQALMDALTPDGTLMMYAGWQDIPDFIDSLPDA
LKAVYLEQHPPFDPATARAVRENSVLAEFLRTWPCVHRSANPEASMVAVGRQAALLTANHALDYGYGVESPLAKLVAIEG
YVLMLGAPLDTITLLHHAEYLAKMRHKNVVRYPCPILRDGRKVWVTVEDYDTGDPHDDYSFEQIARDYVAQGGGTRGKVG
DADAYLFAAQDLTRFAVQWLESRFGDSASYG
>Q89VT8 6.2.1.n2~~~~~~Amino acid--[acyl-carrier-protein] ligase 1~~~COG0172
MNIAVLPNSPDTAPQIADPLDHLADKLFHSMGSDGVYARTALYESIVERLAALITSHREAGTEALRFPPVMSRAQLEKSG
YLKSFPNLLGCVCGLHGTEREINAAVSRFDAGGDWTTSLSPADLVLSPAACYPVYPIAASRGPLPKGGLRFDVAADCFRR
EPSKHLDRLQSFRMREYVCIGTPDDVSDFRERWMVRAQAIARDLGLTFRVDYASDPFFGRVGQMKAVSQKQQQLKFELLI
PLRSEEQPTACMSFNYHREHFGTTWGIQDANGEPAHTGCVAFGMDRLAVAMFHTHGTDLSAWPAKVRDILGLQPHVAAGA
HGEGWR
>Q89GR3 6.2.1.n2~~~~~~Amino acid--[acyl-carrier-protein] ligase 2~~~COG0172
MNLAIVEAPADSTPPPADPLDHLADALFHEMGSPGVYGRTALYEDVVERIAAVISRNREPNTEVMRFPPVMNRAQLERSG
YLKSFPNLLGCVCGLHGIESEIDAAISRFDAGGDWTESLSPADLVLSPAACYPLYPIAASRGPVPAAGWSFDVAADCFRR
EPSRHLDRLQSFRMREFVCIGSADHVSAFRERWIIRAQKIARDLGLTFRIDHANDPFFGRVGQMMAVSQKQLSLKFELLV
PLRSEERPTACMSFNYHRDHFGTTWGIVDAAGEPAHTACVAFGMDRLAVAMFHTHGKDVALWPIAVRDLLGLAQTDRGAP
SAFEEYRCAKEAGS
>Q7CWR3 6.2.1.n2~~~~~~Amino acid--[acyl-carrier-protein] ligase~~~COG0172
MTVFSAIPPISCWFTGRTPASWDKTMDMQTSFLDRLFEEGLLIETGVDGLYGRSGQFEDVIAAFERLIDRTGGADGAEAI
RFPPGINRAYFEKSGYMKSFPQLAGTVHSFCGCELDHVSLLKSMDEGGDWTKDQKATDIVLTPAACYPLYPTIAKRGALP
AGGGLYDIQSYCFRHEPSKDPARQQLFRMREYVCMGTESDVTEFRQTWMDRGVEMMKAVGLDVTIDIANDPFFGRAGKML
ANNQRDQNLKFELLIPVTSATNPTACMSFNYHQDAFGQKWGLNLENGDVAHTACVGFGLERIALALFAHHGLDVKKWPAK
VVETLWG
>Q89VT6 ~~~~~~Aminoacyl carrier protein 1~~~COG0236
MQAFNTDVRNRIIKLVKGILEQNALAADVTPQAKLVDVGLTSMDMVNLMLGVEAEFDFTIPQSEITPENFQSVETLERMV
MTQLQPATAA
>Q89GR1 ~~~~~~Aminoacyl carrier protein 2~~~COG0236
MHNTAINVQNRVLSVVRSVLQQNAISADVHPESRLVDIGLSSMGMVELMLKVEAEFDLILPQFEITPENFRSVKAMERMI
LNQLGSGSG
>A9CHM9 ~~~~~~Aminoacyl carrier protein~~~COG0236
MNATIREILAKFGQLPTPVDTIADEADLYAAGLSSFASVQLMLGIEEAFDIEFPDNLLNRKSFASIKAIEDTVKLILDGK
EAA
>P29958 3.5.1.-~~~aac~~~Aculeacin-A acylase~~~COG2366
MTSSYMRLKAAAIAFGVIVATAAVPSPASGREHDGGYAALIRRASYGVPHITADDFGSLGFGVGYVQAEDNICVIAESVV
TANGERSRWFGATGPDDADVRTTSSTQAIDDRVAERLLEGPRDGVRAPCDDVRDQMRGFVAGYNHFLRRTGVHRLTDPAC
RGKAWVRPLSEIDLWRTSWDSMVRAGSGALLDGIVAATPPTAAGPASAPEAPDAAAIAAALDGTSAGIGSNAYGLGAQAT
VNGSGMVLANPHFPWQGAERFYRMHLKVPGRYDVEGAALIGDPIIEIGHNRTVAWSHTVSTARRFVWHRLSLVPGDPTSY
YVDGRPERMRARTVTVQTGSGPVSRTFHDTRYGPVAVVPGTFDWTPATAYAITDVNAGNNRAFDGWLRMGQAKDVRALKA
VLDRHQFLPWVNVIAADARGEALYGDHSVVPRVTGALAAACIPAPFQPLYASSGQAVLDGSRSDCALGADPDAAVPGILG
PASLPVRFRDDYVTNSNDSHWLASPAAPLEGFPRILGNERTPRSLRTRLGLDQIQQRLAGTDGLPGKGFTTARLWQVMFG
NRMHGAELVRDDLVALCRRQPTATASNGAIVDLTAACTALSRFDERADLDSRGAHLFTEFLAGGIRFADTFEVTDPVRTP
APFWNTTDPRVRTALADACNGSPASPSTRSVGDIHTDSRGERRIPIHGGRGEAGTFNVITNPLVPGVGYPQVVHGTSFVM
AVELGPHGPSGRQILTYAQSTNPNSPWYADQTVLYSRKGWDTIKYTEAQIAADPNLRVYRVAQRGR
>P0AE05 2.7.7.46~~~aadB~~~2''-aminoglycoside nucleotidyltransferase~~~
MDTTQVTLIHKILAAADERNLPLWIGGGWAIDARLGRVTRKHDDIDLTFPGERRGELEAIVEMLGGRVMEELDYGFLAEI
GDELLDCEPAWWADEAYEIAEAPQGSCPEAAEGVIAGRPVRCNSWEAIIWDYFYYADEVPPVDWPTKHIESYRLACTSLG
AEKVEVLRAAFRSRYAA
>I0DFJ0 4.1.1.28~~~~~~Aromatic-L-amino-acid decarboxylase~~~
MSENLQLSAEEMRQLGYQAVDLIIDHMNHLKSKPVSETIDSDILRNKLTESIPENGSDPKELLHFLNRNVFNQITHVDHP
HFLAFVPGPNNYVGVVADFLASGFNVFPTAWIAGAGAEQIELTTINWLKSMLGFPDSAEGLFVSGGSMANLTALTVARQA
KLNNDIENAVVYFSDQTHFSVDRALKVLGFKHHQICRIETDEHLRISVSALKKQIKEDRTKGKKPFCVIANAGTTNCGAV
DSLNELADLCNDEDVWLHADGSYGAPAILSEKGSAMLQGIHRADSLTLDPHKWLFQPYDVGCVLIRNSQYLSKTFRMMPE
YIKDSETNVEGEINFGECGIELSRRFRALKVWLSFKVFGVAAFRQAIDHGIMLAEQVEAFLGKAKDWEVVTPAQLGIVTF
RYIPSELASTDTINEINKKLVKEITHRGFAMLSTTELKEKVVIRLCSINPRTTTEEMLQIMMKIKALAEEVSISYPCVAE
>P17585 2.7.7.-~~~aadK~~~Aminoglycoside 6-adenylyltransferase~~~
MRSEQEMMDIFLDFALNDERIRLVTLEGSRTNRNIPPDNFQDYDISYFVTDVESFKENDQWLEIFGKRIMMQKPEDMELF
PPELGNWFSYIILFEDGNKLDLTLIPIREAEDYFANNDGLVKVLLDKDSFINYKVTPNDRQYWIKRPTAREFDDCCNEFW
MVSTYVVKGLARNEILFAIDHLNEIVRPNLLRMMAWHIASQKGYSFSMGKNYKFMKRYLSNKEWEELMSTYSVNGYQEMW
KSLFTCYALFRKYSKAVSEGLAYKYPDYDEGITKYTEGIYCSVK
>Q01980 ~~~aadR~~~Transcriptional activatory protein AadR~~~COG0664
MPHLAYPTTTCEGFRCETHCAVRGLAICGELGPADHEEFERLAQHVRYGPKEALFSEDEVADSVYSLIEGIARLYKLLPD
GRRQIIGFALPGDFLGMAPGNRYSFSADSIGGVTVCKFFRGPFLRFIENRPQMLLRMNDFATRELSLAQDQMLLLGRRSA
EEKVAAFLVGWRDRLARLEGVTKTVSLPMGRQDIADFLGLTIETVSRTFTKLEREKLIVIVPDGVRVLDPKRFDALAAA
>P46482 ~~~aaeA~~~p-hydroxybenzoic acid efflux pump subunit AaeA~~~COG1566
MKTLIRKFSRTAITVVLVILAFIAIFNAWVYYTESPWTRDARFSADVVAIAPDVSGLITQVNVHDNQLVKKGQILFTIDQ
PRYQKALEEAQADVAYYQVLAQEKRQEAGRRNRLGVQAMSREEIDQANNVLQTVLHQLAKAQATRDLAKLDLERTVIRAP
ADGWVTNLNVYTGEFITRGSTAVALVKQNSFYVLAYMEETKLEGVRPGYRAEITPLGSNKVLKGTVDSVAAGVTNASSTR
DDKGMATIDSNLEWVRLAQRVPVRIRLDNQQENIWPAGTTATVVVTGKQDRDESQDSFFRKMAHRLREFG
>P46481 ~~~aaeB~~~p-hydroxybenzoic acid efflux pump subunit AaeB~~~COG1289
MGIFSIANQHIRFAVKLATAIVLALFVGFHFQLETPRWAVLTAAIVAAGTAFAAGGEPYSGAIRYRGFLRIIGTFIGCIA
GLVIIIAMIRAPLLMILVCCIWAGFCTWISSLVRIENSYAWGLAGYTALIIVITIQPEPLLTPQFAVERCSEIVIGIVCA
IMADLLFSPRSIKQEVDRELESLLVAQYQLMQLCIKHGDGEVVDKAWGDLVRRTTALQGMRSNLNMESSRWARANRRLKA
INTLSLTLITQSCETYLIQNTRPELITDTFREFFDTPVETAQDVHKQLKRLRRVIAWTGERETPVTIYSWVAAATRYQLL
KRGVISNTKINATEEEILQGEPEVKVESAERHHAMVNFWRTTLSCILGTLFWLWTGWTSGSGAMVMIAVVTSLAMRLPNP
RMVAIDFIYGTLAALPLGLLYFLVIIPNTQQSMLLLCISLAVLGFFLGIEVQKRRLGSMGALASTINIIVLDNPMTFHFS
QFLDSALGQIVGCVLAFTVILLVRDKSRDRTGRVLLNQFVSAAVSAMTTNVARRKENHLPALYQQLFLLMNKFPGDLPKF
RLALTMIIAHQRLRDAPIPVNEDLSAFHRQMRRTADHVISARSDDKRRRYFGQLLEELEIYQEKLRIWQAPPQVTEPVNR
LAGMLHKYQHALTDS
>Q11T61 5.1.1.-~~~tfdD~~~D-Ala-D/L-Ala epimerase~~~COG4948
MIITQVELYKSPVKLKEPFKISLGILTHANNVIVRIHTASGHIGYGECSPFMTIHGESMDTAFIVGQYLAKGLIGTSCLD
IVSNSLLMDAIIYGNSCIKSAFNIALYDLAAQHAGLPLYAFLGGKKDKIIQTDYTVSIDEPHKMAADAVQIKKNGFEIIK
VKVGGSKELDVERIRMIREAAGDSITLRIDANQGWSVETAIETLTLLEPYNIQHCEEPVSRNLYTALPKIRQACRIPIMA
DESCCNSFDAERLIQIQACDSFNLKLSKSAGITNALNIIRLAEQAHMPVQVGGFLESRLGFTAAAHVALVSKTICYYDFD
TPLMFEADPVRGGIVYQQRGIIEVPETAGLGAGYQKDYLSGLEKICIN
>P46478 ~~~aaeX~~~Protein AaeX~~~
MSLFPVIVVFGLSFPPIFFELLLSLAIFWLVRRVLVPTGIYDFVWHPALFNTALYCCLFYLISRLFV
>Q9LAP7 3.2.1.158~~~agaA~~~Alpha-agarase~~~
MFKTKRSLLNSSIAISFAVLGVQAQAETLELQAESFANSGGTYSDGQPNPVTIYNVNGQGAINFVNAGDYVDYNINALGG
EYDIEYFVGTGVTSGPNIEVLVDVNGTWQSQGSVAVPYGSWDDFQSLTPSHTVTLPVGTSTVRLLAVGSTWQWNLESFRL
TQVSPVEPVGDADNDGVNDNQDLCPNSPSGVTVDNNGCQITGGTDPGGESFVIQMEAFDSTGSDDSRAKGVIIGERGYPQ
DKHTVVDSVQTTDWVDYSINFPSSANYSVSMLASGQTDHATAVLYLDGTEINEVPVHTGSQADFANFQLAGSVYIASGTH
TIRVQAQSSTGEFSWLWFGDALTFTNLDSDGGNGGEATQDADNDGVLDSSDSCPNTPTGEPADVTGCSASQLDDDNDGVS
NNVDQCPNTVAGTEVDADGCEVIFADADNDGIEDSQDFCPNTPAGEAVNNSGCGASQLDADNDGVTNNIDQCPNTPAGTQ
VDASGCETDNGGEPGDSYYHNGQGLLFGRVDGATNFLGEEGYVANPDNYDVTTDLLETDDAIRANSTEVFRGEIYDADGH
IAFYEHIDDSVRLYIDGQLVLSNDSWENSSQTTDLNLTPGWHNFELRLGNADGGSGAVSGIGFGIDVDGGTNFVHPSNLS
PSMFRASGQVVVDPILPPSGGIYIQLEDFDETGTVGRVASDPNDGFVKGDSNVGWVTNGDWGKYHNVFLEAGTYRAFITV
STPAGGSYGARVDIDGEPFAWGYFDSTGGWDIAAEYELYGGHLVVESTGNHTLHVEAVGGSDWQWSGDLVRLAKVSDSAV
KQPRVYNPNEHIVAEIQGPATGLQYLKTPVEIPLANKVLKSDVWYTYPQNRNLVVDGDTPYADFGATGAFWGHPPEHDFY
DDTVIMDWAVNVVDDFQSEGFEYTARGEFDWGYGWFTEFTTNPQPHYVQTLDGRNVRMTFMGYLSHDGYNNNWLSNHSPA
FVPFMKSQVDQILKANPDKLMFDTQTNSTRSTDMRTFGGDFSPYAMENFRVWLLKKYSNAQLVSMGINDITSFDYGAYLR
AQGITHTDWSNAGDTISGNIPMMEDFIYFNRDVWNQKFAEVLEYIRQQRPNIEIGASTHLFESRGYIFNENITFLSGELN
LGARTSISELPTNILVHLKGAQAVDKTLAYFPYPWEFDELRIQNAPRFGRGWVAQAYAYGGLFSIPANVWVGGEVFTWSP
GADNYRDIYQFVRAQANLLDGYTSYAKAGYVHAMFSSMKAGFIDGGNQVQSSVKILTEDNINFDMLVFGDAGYPVVPRQA
DFDKFEYIFYDGDLNYLTAEQQAVLDAQGSKVKHIGQRGTIAGLQINVSINGSVSNETVSAVSRIHETDSTAPYVVHLIN
RPFAGGVTPILNNVEVAIPASYFPQGVTSAKLHLPDGSSSTVAVSTNANGDTVVSVSNLEVWGILELAH
>A1IGV8 3.2.1.158~~~~~~Alpha-agarase~~~
MITSSKKIVSAMLSTSLWIGVASAAYAETTNVEAEGYSTIGGTYQDGNPQPINIYSVNGVQAINFVNRGDFAEYDVSVST
AGEYSIEYLIGTSIASGSAVEISVLVDGNWQSAGSTNVPLGQWDNFQALAANNNISLAQGTNRIKITGAGTHDWQWNLDA
FSLTLVTPENPDNPDNPDNPDDGNTGQPGTPFTIEMEAFDATGSDDPRAQGMVIGERGYPEDKHTVVDSNQTTDWVDYNI
NFPVSGNYRIEMLASGQTSHATAILFVDNVQINEVAVDTGNQAVFLDFELTDSTYISAGAHTIRVQSGSQINEFSWMWFG
DALTFTPLDGGSTDGDADNDGVLDSVDTCPNTPAGAQVDANGCEIIVDNDTDNDGVDNSIDQCPNTPAGAQVDANGCEIV
AVVDADNDGVEDSLDMCPNTPAGAPVNGQGCADSQLDADNDGVSDDIDQCPSTPAGSVVDGTGCIVVTPPADSDNDGVVD
TLDMCPNTAAGLTVDSQGCALSQLDSDNDGVTDDIDQCANTPSGETANATGCSSSQEGGGTDPDTPQPGLLYGELAGAMN
VSDTNPNWERTTDLLQTEDSVKGNTTEVYTGFIYDADGHISFYEHIDDSVRLYIDGVLVLSNDSWEASSQTTDLNLTPGT
HEIELRIGNADGGSGAVDGIGFGIDVDGGTNFVHPSTLSESIFTSVGEETGNPDLEQEGDIIVELESFVFTSTNGRVGSD
SVEGFSPTATGVNWVTNGDYGDYMVTFEEPGTYGAYITISAANDGSYGARVDVDGWPVAWGYFGGTGSWDVSSENLLYGG
TFVVEQAGEKVVRVEAIGGSDWQWSGDRVRFTRLGDVTAIPSPIYNPDDHFVAEIQGPQTDVTYLKKPVEIPANKKVLKS
DVWYTYPQNRELEGYDNFGATGAFWGHPPEHDFYDDTVIMDWAVDAVYAFQAEGYEYTARGEFDWGYGWFTEYTTNPQPH
YVRTLDDRNVRMTFMGYLSHDGYNNNWLSNHSPAFVPFMKSQVDQILKANPDKLMFDTQTNSTRSTDMRDFGGDFSPYAM
ENFRVWLSKKYSTGELAALGINDINSFDYGDFLRAQGVTHTSWSNAGDTLSGNIPLQEDYIYFNRDVWNQKFAEVLDYIR
QQQPDIEIGASTHLFESRGYVFNENLTFLSGELNLGARTTISELPTNILVHLKGAQAVDKTLVYFPYPWEFDELRLQDAP
RFGRGWVAQAYAYGGLFSIPANVWVGGEVWTWSPGADNYRDIYLFVRAQADLLDDYTSYSKVGLVHAMYSSMKAGFIDGG
NQIQSSTKLLTEGNINFDLLVFGDEGYPVVPRPEDFDKFDHIFFDGDEQYLTAEQQALLDQQGDKVRHIGQRGTVSGIEI
TVSISGTESNETVSAVSRIHETDAAAPYVVHLVNRPFAGGVTPTLNNVEVAIPQSYFPEVVTGATLHLPDGTSTSLTLST
NADGDVVLPVNNLEVWGILELAH
>Q93K96 2.4.99.-~~~aah~~~Autotransporter heptosyltransferase Aah~~~
MTFLSPPEIPTIKADNGTYYDFNNGARILFPKGEWHVNIIDEESGNILFSCDTKAGWVTSTKKYYVKFRIQAFKKGDEKP
FLDTVMELKDKPVLISFPTGTLGDIIAWFHYAEKFRIKHQCKLECSVSEEFITLLSDNYPDIKFTSAQDKYEGKPYATYR
IGLFFNGDTDNQPVDFRLVGFHRNAGYILGVSPQEDPPRLNLSAERKIQEPYVCIAVQSTAQAKHWNNGLGWAEVVRYLK
ELGYRVLCIDRNAHAGNGFVWNHIPWGAEDFTGALPLQERVDLLRHASFFVGLSSGLSWLAWASRIPVVLISGFSRPDSE
FYTPWRVFNSHGCNGCWDNTNYNFDHTDFLWCPVHKGTDRQFECTRLITGKQVCGVIRTLHSYLTNHDRII
>Q988D4 3.5.1.29~~~~~~2-(acetamidomethylene)succinate hydrolase~~~COG0596
MDMAADIASDHFISRRVDIGRITLNVREKGSGPLMLFFHGITSNSAVFEPLMIRLSDRFTTIAVDQRGHGLSDKPETGYE
ANDYADDIAGLIRTLARGHAILVGHSLGARNSVTAAAKYPDLVRSVVAIDFTPYIETEALDALEARVNAGSQLFEDIKAV
EAYLAGRYPNIPADAIRIRAESGYQPVDGGLRPLASSAAMAQTARGLRSDLVPAYRDVTKPVLIVRGESSKLVSAAALAK
TSRLRPDLPVVVVPGADHYVNEVSPEITLKAITNFIDA
>D9XDR8 4.2.3.162~~~~~~(-)-alpha-amorphene synthase ((2E,6E)-farnesyl diphosphate cyclizing)~~~
MAKMSTTHEEIALAGPDGIPAVDLRDLIDAQLYMPFPFERNPHASEAAAGVDHWLSTWGLTDDPAVAAMISCTRPAELAA
FNGPDMDSGLLQIAANQIAYQFVFDDRAEDIGRHSPGRLLPMLSESVAILRDGQPPTTPLGAALADLHRQVQERCTPAQA
ARWAWNSREYVHGLLYEAVAQAHPAPVESGLCRSIRSLIAGVEPFYPLCEAAQRCELAPEELHHPAMRRLSRLSADAAVW
IPDLFSAVKEQRAGGMINLALAYRRTHRCSLPAAVTLAVRHINSTIREFEDLYGEVRPELSPSGIGYVEGMAGWIRGCYF
WSRTVPRYADTLTAPAGL
>A0P8X0 3.2.1.1~~~igtZ~~~Alpha-amylase~~~
MTRKTRYLHQITTLILGGLLIVPAAAPPVSADIAATHVYHNHMPNFWAYYDLNTYNSTPVGSPIRYTYDGEVIQLKQNPP
AGYPYYLPNGSPMPHDDLVSYYSHHAKTGAYLTWPWSVANTLHSSHPQAQMHVTMSGSVVNNVNSIIQQGNVSGYNNPAW
GTPWKNAVTQLKTAGGDNRLDLIHFSGHHSMGPLVGNDYLLKDMIYHGATMAQPYFLGSSYKSSKGFFPTELGFSERIIP
VLNKLGIQWSVIGNNHFSRTLKDYPLLDSPGTDTMISPPNRSDLQNVSTAGAWVNEPMFNEQQVVYNKYPFASTAHWVRY
VDPATGAESRVVGVPVAQAQSWEEGYLGQVKADALKPYENLVAQKQIFVVAHDGDNSSGRAGSEETWRNAGNVTYADSGV
TGMGIDEYLRSNTPAAADVVHVQDGSWIDTRDSSSDPAWYHWHLPFGIWKGQFAAFNQVNGTAYAPKKNLAGVEEGMTVS
FEKGYHYLERNFALLQASLNYAKTAEQIWLEEHPNYWKPANPLDREVTYEGNQLNPWMLSYPVKGNPANDYAGGANPAEL
AWYFLLPAMDSGFGYYDENVDDSVKPALSFNQSLYFSKPYVSQKLAKDKTGPSVWWPQRYPYNPGSANVSKAEGWTLQHY
NNAFAIYTYAFDTSGISEIKVKVRAHRDKTADAADNTFKVYDPAGLAAAGIANIDPAKVGAWTEYPMNVRDLSADINGVD
WQPSSMTIMQKVPATDIGNLYFSYISDYRDQLLDYFIEAKDAKGNVTQSDIQQVYVGAGKYKLANGKYTESMQGTIEGTH
PFITDVPAVPDTEAPAVPANLQATVMNASSVGLSWNAATDNIRVTGYEIYRNGVRIGTTPSTSYTDSGLSASTAYEYRVK
AYDASGNLSGFSAAATATTPAGNHVTVYYKQGYSTPYIHYRPAGGTWTTAPGVAIPAAEVAGYNKITINIGAATQLEACF
NNGSGTWDSNGGSNYLFGTGTWTYTPTGKIQAGAPVAPSATPTVAPTATPTPKPSVTPTVTPITTPTVAPTLSPTPTVAP
TVKPSATPIATPTVTPTVSPTATPTVVPTIAPTATPTTSPSATPVPTATPAGNSATIYYKNTAFSNSYIHYKLDGATAWT
TSPGVQMQASTFSGYKAITIPLGSATGLTAAFNNGSGIWDNNGGSNYHFGTGSSSLTGGNLITGEPQADSVTFRVSVPGS
TPANAPVYLTGSFNSWNAADPAYLLTRGSDGIYSITLNLPAGSAVTYKLTRGSWATVETASSGADITNRTLTPAGGAQTV
TLTVQRWKDQ
>K9NBS6 3.5.1.13~~~aam~~~Acylamidase~~~
MTEQNLHWLSATEMAASVASNNLSPNEIAEAMIQRVDAVNPSINAIVQFDREQVTRDAAELSRQQEAGEKLGPLHGVPFT
IKDLTAVDGLPTTFGMKPMADNIATGNAVVVDRLRGAGGLFLGKTNTPESGYYGGTDNHLYGPTHNPWKLGNSAGGSSGG
ASAAVAAGLGPLAEGSDGAGSVRIPSALCGVVGLKPTTGVIPQTILAGRFYNWAYHGPITRTVADNALMLDIMAGPDNAD
PLSIERAETSYVEASKGDVKGLRVAWSPNLGLGHVDPEVLAVCLDALAAFEELGAQITEATPQWGNPSESMWSGIWVPGF
ASEYDLLDWENQRGEVDDYLIEIMHEAERLTGVDVGRADAFRGDMWDTWTTFMNDYDVLVSPTLASATFPLRQFAPSWLE
GASLREQLLDWLFTYPYNMLNNPAITVPAGFTADGRPVGLQIAARHRRDALVLRTAANFEAVRPWADKKPADSLVVA
>P0DUM4 ~~~aapA1~~~Toxic protein AapA1~~~
MATKHGKNSWKTLYLKISFLGCKVVVLLKR
>P0DUM5 ~~~aapA3~~~Toxic protein AapA3~~~
MKHKSGKRSWKTLYFEFAFLGLKVIVSVKR
>A3PMF8 2.6.1.1~~~~~~Aspartate/prephenate aminotransferase~~~
MAFLSDTLARVKPSQTIAVTNKARELAAAGRDVIGLGAGEPDFDTPDNIKAAAKRAIDAGRTKYTAVDGIPELKRAICEK
FERENGLKYTPAQVTVGTGGKQILYNALVATLNPGDEVIIPAPYWVSYPDMVLLAGGTPVSVAAGMETGFKLTPEQLEAA
ITPRTKWFIFNSPSNPTGAAYTRAELAALCEVLMRHPQVWIMSDDMYEHLVFDDFDFTTPAQIEPGLYDRTLTCNGVSKA
YCMTGWRIGYAAGPVELIRAMGTIQSQSTSNPCSIAQYAALEALSGPQEFLATNREAFQRRRDLVVSMLNEAKGVTCPNP
EGAFYVYPDISGCIGKTSAGGAKITDDEAFASALLEETGVAVVFGAAFGLSPNFRISYATADEVLREACARIQAFCAGLS
>Q8KDS8 2.6.1.1~~~~~~Aspartate/prephenate aminotransferase~~~COG0436
MSVESFERFLSRRVLSMQESQTMKITGLAKKMQAEGKDVVSLSAGEPDFPTPENVCEAGIEAIRKGFTRYTANSGIPELK
KAIIRKLQRDNGLEYAEDEIIVSNGGKQALANTFLALCDEGDEVIVPAPYWVSFPEMARLAEATPVIVETSIETGYKMTP
EQLAAAITPKTRILVLNSPSNPSGAVYNEAEVRALMQVIEGKEIFVLSDEMYDMICYGGVRPFSPARIPEMKPWVIVSNG
TSKSYSMTGWRIGYLAAPKWIINACDKIQSQTTSNANSIAQKAAVAALDGDQSIVEQRRAEFEKRRDFMFRELNTISGIE
CTLPEGAFYIFPSIKGLLGKTFGGKVMKDSTDVAEYLLTEHYVATVPGDAFGAPENLRLSYAASIEELAEAVNRIRKAFS
>Q82WA8 2.6.1.1~~~aatA~~~Aspartate/prephenate aminotransferase~~~COG0436
MKLSQRVQAIKPSPTLAVTAKAARLKAEGKNIIGLGAGEPDFDTPLHIKDAAITAIRNGFTKYTAVGGTASLKQAIISKF
KRENSLEFMPGEILVSSGGKQSFFNLVLATIDPGDEVIIPAPYWVSYPDIVLIAEGKPVFIDTGIEEKFKISPDQLEKAI
TPRTRMFVVNSPSNPSGSVYSLEELQALGAVLRKYPDILIATDDMYEHILLSGDGFVNILNACPDLKARTVVLNGVSKAY
AMTGWRIGYCGGPAAIITAMENIQSQSTSNPNSIAQVAAEAALNGDQSCMVPMIEAFRERNQFLTNALNSIAGIHCLLSE
GAFYAFVDVRQAISRLNTQQILQNSSDIAFCNYVLEKAEVAAVPGSAFGCEGYMRLSFATSMDNLQEAVKRIASLLS
>Q02635 2.6.1.1~~~aatA~~~Aspartate/prephenate aminotransferase~~~COG0436
MAFLADALSRVKPSATIAVSQKARELKAKGRDVIGLGAGEPDFDTPDNIKKAAIDAIDRGETKYTPVSGIPELREAIAKK
FKRENNLDYTAAQTIVGTGGKQILFNAFMATLNPGDEVVIPAPYWVSYPEMVALCGGTPVFVPTRQENNFKLKAEDLDRA
ITPKTKWFVFNSPSNPSGAAYSHEELKALTDVLMKHPHVWVLTDDMYEHLTYGDFRFATPVEVEPGLYERTLTMNGVSKA
YAMTGWRIGYAAGPLHLIKAMDMIQGQQTSGAASIAQWAAVEALNGPQDFIGRNKEIFQGRRDLVVSMLNQAKGISCPTP
EGAFYVYPSCAGLIGKTAPSGKVIETDEDFVSELLETEGVAVVHGSAFGLGPNFRISYATSEALLEEACRRIQRFCAACR
>Q56232 2.6.1.1~~~aspC~~~Aspartate/prephenate aminotransferase~~~COG0436
MRGLSRRVQAMKPSATVAVNAKALELRRQGVDLVALTAGEPDFDTPEHVKEAARRALAQGKTKYAPPAGIPELREALAEK
FRRENGLSVTPEETIVTVGGKQALFNLFQAILDPGDEVIVLSPYWVSYPEMVRFAGGVVVEVETLPEEGFVPDPERVRRA
ITPRTKALVVNSPNNPTGAVYPKEVLEALARLAVEHDFYLVSDEIYEHLLYEGEHFSPGRVAPEHTLTVNGAAKAFAMTG
WRIGYACGPKEVIKAMASVSSQSTTSPDTIAQWATLEALTNQEASRAFVEMAREAYRRRRDLLLEGLTALGLKAVRPSGA
FYVLMDTSPIAPDEVRAAERLLEAGVAVVPGTDFAAFGHVRLSYATSEENLRKALERFARVLGRA
>Q52812 ~~~aapJ~~~General L-amino acid-binding periplasmic protein AapJ~~~COG0834
MKNKLLSAAIGAAVLAVGASAASATTLSDVKAKGFVQCGVNTGLTGFAAPDASGNWAGFDVDFCKAVASAVFGDPTKVKY
TPTNAKERFTALQSGEIDVLSRNTTWTINRDTALGFNFRPVTYYDGQGFMVRKGLNVKSALELSGAAICVQSGTTTELNL
ADYFKTNNLQYNPVVFENLPEVNAAYDAGRCDVYTTDQSGLYSLRLTLKNPDEHIILPEIISKEPLGPAVRQGDDQWFDI
VSWTAYALINAEEFGITQANVDEMKNSPNPDIKRFLGSETDTKIGTDLGLTNDWAANVIKGVGNYGEIFERNIGQGSPLK
IARGLNALWNKGGIQYAPPVR
>P46116 3.4.21.105~~~aarA~~~Rhomboid protease AarA~~~
MAEQQNPFSIKSKARFSLGAIALTLTLVLLNIAVYFYQIVFASPLDSRESNLILFGANIYQLSLTGDWWRYPISMMLHSN
GTHLAFNCLALFVIGIGCERAYGKFKLLAIYIISGIGAALFSAYWQYYEISNSDLWTDSTVYITIGVGASGAIMGIAAAS
VIYLIKVVINKPNPHPVIQRRQKYQLYNLIAMIALTLINGLQSGVDNAAHIGGAIIGALISIAYILVPHKLRVANLCITV
IAASLLTMMIYLYSFSTNKHLLEEREFIYQEVYTELADANQ
>Q54765 1.2.1.80~~~~~~Long-chain acyl-[acyl-carrier-protein] reductase~~~COG5322
MFGLIGHLTSLEQARDVSRRMGYDEYADQGLEFWSSAPPQIVDEITVTSATGKVIHGRYIESCFLPEMLAARRFKTATRK
VLNAMSHAQKHGIDISALGGFTSIIFENFDLASLRQVRDTTLEFERFTTGNTHTAYVICRQVEAAAKTLGIDITQATVAV
VGATGDIGSAVCRWLDLKLGVGDLILTARNQERLDNLQAELGRGKILPLEAALPEADFIVWVASMPQGVVIDPATLKQPC
VLIDGGYPKNLGSKVQGEGIYVLNGGVVEHCFDIDWQIMSAAEMARPERQMFACFAEAMLLEFEGWHTNFSWGRNQITIE
KMEAIGEASVRHGFQPLALAI
>P31119 ~~~aas~~~Bifunctional protein Aas~~~COG0204
MLFSFFRNLCRVLYRVRVTGDTQALKGERVLITPNHVSFIDGILLGLFLPVRPVFAVYTSISQQWYMRWLKSFIDFVPLD
PTQPMAIKHLVRLVEQGRPVVIFPEGRITTTGSLMKIYDGAGFVAAKSGATVIPVRIEGAELTHFSRLKGLVKRRLFPQI
TLHILPPTQVAMPDAPRARDRRKIAGEMLHQIMMEARMAVRPRETLYESLLSAMYRFGAGKKCVEDVNFTPDSYRKLLTK
TLFVGRILEKYSVEGERIGLMLPNAGISAAVIFGAIARRRMPAMMNYTAGVKGLTSAITAAEIKTIFTSRQFLDKGKLWH
LPEQLTQVRWVYLEDLKADVTTADKVWIFAHLLMPRLAQVKQQPEEEALILFTSGSEGHPKGVVHSHKSILANVEQIKTI
ADFTTNDRFMSALPLFHSFGLTVGLFTPLLTGAEVFLYPSPLHYRIVPELVYDRSCTVLFGTSTFLGHYARFANPYDFYR
LRYVVAGAEKLQESTKQLWQDKFGLRILEGYGVTECAPVVSINVPMAAKPGTVGRILPGMDARLLSVPGIEEGGRLQLKG
PNIMNGYLRVEKPGVLEVPTAENVRGEMERGWYDTGDIVRFDEQGFVQIQGRAKRFAKIAGEMVSLEMVEQLALGVSPDK
VHATAIKSDASKGEALVLFTTDNELTRDKLQQYAREHGVPELAVPRDIRYLKQMPLLGSGKPDFVTLKSWVDEAEQHDE
>A0A1H7VGH3 3.2.2.-~~~~~~3' cyclic ADP-D-ribose synthase AaTIR~~~
MKNRSYEYDVALSFAGENRAYVERVANSLKTKGVKVFYDLFEEANLWGKNLYEYLSEIYQNKARYTVLFVSSFYNKKLWT
NHERVSMQARAFQESREYILPARFDDTEIPGILKTIGYINLENRTPEELAVLIENKLKKDQTFFKNRWSKLSTMISPKPF
IFTIKVVDEKSQLVKHAKVVLVANNSTYLEGYTDENGLAHFVIRTRKLYTVLIAHSEYPAVVFKSMNPKEDIEVTIEKTN
NSGSVIINKSGQIPGISGKIEPVLKSDKNLSVYADNIAIEGGKDQPYDFELNKSIVLEDNKGNIVHLTFRFYQARIALID
FYRGRSM
>O66728 2.7.7.-~~~~~~A-adding tRNA nucleotidyltransferase~~~COG0517
MVCPKVVILSEGADLDSLSAAYGVLKLYPDAYLLKPKHLSKKAGEVFKKYRDKFRVIEDLPDCFELVLVDTHFLPEGLPR
ERIKRIIVYDHHPIGDVKEFEGKIEKVGAATTLVVEEIKEKGIDINPRDATLLAFGIYEDTGNFTYEGTTPRDALALAFL
LEKGANLREIREVVMETYTPEQIEAVGKIVQSIEKVFINGRQISFATAVLERYQPDINTLLYEIKDLKESDAFFVIIEAE
GKTYVFGRSQSEDVDVGEILSHFGGGGHREAGAVKLENVSAERIKELIKAFLKRKYVKLKVRDIMNTPPFVLEEHVSVKD
ALTELSERGIANAPVINREGKLVGIISKKALLKLVKLYPDEPIELFVNRDFYTLSPDAPVWEAEEILTKFGQKLIPVVED
GTVVGVVTRLDILQAVKEDLEKLKEKRRKIKVPENIEEIAREVGQIAKEMGLRAYIVGGVVRDILLGKEVWDVDFVVEGN
AIELAKELARRHGVNVHPFPEFGTAHLKIGKLKLEFATARRETYPRPGAYPKVEPASLKEDLIRRDFTINAMAISVNLED
YGTLIDYFGGLRDLKDKVIRVLHPVSFIEDPVRILRALRFAGRLNFKLSRSTEKLLKQAVNLGLLKEAPRGRLINEIKLA
LREDRFLEILELYRKYRVLEEIIEGFQWNEKVLQKLYALRKVVDWHALEFSEERIDYGWLYLLILISNLDYERGKHFLEE
MSAPSWVRETYKFMKFKLGSLKEELKKAKENYEVYRLLKPLHTSVLLLLMLEEELKEKIKLYLEKLRKVKLPKEKIEELK
KQGLKGKELGERIEELKREIMNKI
>Q9RV39 2.7.7.-~~~~~~A-adding tRNA nucleotidyltransferase~~~COG0617
MFRRRPPLPPFPPGAALVGGAVRDWLRGVRSADYDWAHPDPAAGARALAALVGGAAFPLDEERGYWRVTAGEVQHDFVPL
PPNLEDDLRRRDFTVNAIALREGRRLVDPLGGQQDLKRRVLRMVSEDNLRADPLRAWRAARFVTTLSFTLEPQTEQAVRQ
VAADLKAGRLPFPAWERVRDELHALLRSPDAARGILTLEALGLLDLTLPELREGQGLTQGGFHHLDVFEHGVEALHQLLT
RRPDADLLLRWATLLHDVGKPRTFARDPDTGRRSFHGHDRVGAELTTQILTRLKLPGADVKRAAALVKAHMVQLPADDAQ
ARRFVHRRRELLPDLLSLMLADREAARGPSSSELGRFAYMLAMERVLAALEEQPAAPPPLLSGKEVMALLGLTPGPRVGE
VLRALAEARALGEVGTPQEARAFVQRWAEETPGS
>Q74CU0 2.7.7.-~~~~~~A-adding tRNA nucleotidyltransferase~~~COG0617
MDVITTHVNADFDCLGAMVAASKLYPDALMVFSGSQEKSMRDLFLKTTGYALPFTRLRDVDFSDITRLVLVDCQHTSRIG
RFAEVARRPGVEVHIYDHHPGSSGDIRPSGGEIRDCGSSTTILTRKLMEQGIEVTAVEATLMMLGIYEDTGNLTFPSTTP
EDYAAASWLLERGANLNIVSDFVSQELTAEQVALLNDLLKSLRSTPVNGVDIAVAHATLDHYVGDIAVLAHMMRDMQNLD
AIFLVVGMGERVYLVARSRIAEVDAGAVMRVFGGGGHATAAAATVRDQTVIQVLGRLNRLLPELVNPVRTAADLMSSPVI
TLPLATTITEAREILTRYNVNAMPVMDGERMAGIISRRIVEKALYHGLGNLPVDEYMHTEFLRAAPDTPINAIQDYIVGQ
HRRLVPVFSGERLVGVITRTDLLRYMYTGTQRNAEPVYDLGSENLPVRRREVVHLMNRHLPRPTVAMLRDLGKVGDELEL
PVYAVGGFVRDLLLGAENDDIDVSVEGDGILFAETVANRVGCRVKSHAKFGTAVIVFPDGLKVDVASTRLEYYETPGALP
TVERSSLKMDLYRRDFTINTLAVKLNAEGFGTLIDYFGAYRDLQEKTIRVLHNLSFVEDPTRVFRAIRFEQRLGFPISRH
TENLIKNAVKMGFLDKLGGRRLLNELVLILREREPVKAILRMSGLGLLRFIHPDLVLAPNTLQVLDEVKKVITWFDLLYL
GEKVETWVVYFLALTSSLPDEGFWGTCTRLSVSEHYREKLIDMRVHGEQVLEVMTRKAARREDVRRSDIYFWLRGLSPEV
LLYIMAKTRSDEVRRYVSLYVTQLRGIVTHITGDDLKTLGIPSGPRYREILDRVLTARLNGEAATRDDEMRIAVRLADSA
>Q9K8X1 2.7.7.-~~~~~~A-adding tRNA nucleotidyltransferase~~~COG0617
MSEEHQESYHSDNLIDLMNHTLTNDHLQLLKKLGEMAAKLRMNLFLVGGTVRDMLRGVPGGDLDLVIEGDALAFSQNVAN
VLGGKVKHHEPFATATWVGAENLKLDIVSARAESYAKPGALPTIRHSHITDDLARRDFSINAMAIHLHPASYGQLVDPFH
GRHDLTNGLIRILHSQSFIDDPTRLLRGVRFVSRFNYRFEQKTANLALATQPALTNALANVSPERIVHELKLLCHETDPV
SSFSKLEDLHVWQALLGLTFSSSSATHLSRLQEEQNGEPLHWFQAIATVGFLEDNWKASLVPFAITAMEQRFLQNIEDIQ
KRLTNMTRFSTDYLHKQLYQVPEEPLRFYALSSGEEMQKVLDLYLHQRKQLQPLLTGHDLMELGMKPSPLFKECLLLHEC
EQLKGTIENKQDALQFAREFFNHKQPL
>P74081 2.7.7.-~~~~~~A-adding tRNA nucleotidyltransferase~~~COG0517
MDLILCHQTADFDVLGAAVGLAKLHPGSRIVLTGGSHPTVRQFLALHRNEFPLIELRSVNPDKIRSLYIVDNQQGDRLGK
AADWLTLPHLRQVAIYDHHLNSPRDIEADIWELEAVGASTTLIVEKLQRADISLSMVEASVMALGIHVDTGSLTFTQTTV
RDVKALAWLMEQGANLRLIAEYADPGFPPPLQFLFAEAMQNLHKEMVRGYWLGSVLLTTENFVPGLSHLTERLLSLTECD
ALLLGHVYDKGKDKTKNDEQREKTGVISNQRFSLIGRTRIPDTDLTQLLEPYGGGGHAQAAAVNLRDVEPTTVMAEIYQA
LQRQIPKPLLARDFMSSPVRTIRPHTTIEQAQRVLFRYGHSGLTVVNQEEKLVGIISRRDLDLALHHGFSHAPVKGYMTR
NVKTIAPDTPLPRIEAIMVADDVGRLPVMDQEKLVGIVTRTDVLRQLLQDKQEQSGRFGAPRSGSLRDRPTTPNQNFGQG
SLLQLLKTHLPPATWQLLTTAAQQAQTRGWHLYLVGGAVRDLWLRHSQGAEKHQNITFQDIDLVVDGFHATADVGAGVEL
AQALQGDYPGARLSVHGEFQTAALLWHKDPQLDTLWVDIATARTEFYPYPAANPEVEASSIRQDLYRRDFTINALSIRLT
NPNPGKLLDFFGGMNDLQSQQVRVLHANSFIEDPTRIYRAVRFVVRLGFELEPQTETYIRYAIASGVYERWRLTEHPTPA
LTTRLKAELSIILKAPYWKGALTLLADLDALKCLHAELKLTEQLWWQVRYLSRWLRWFDPERNLEHWLLRLGILLGALPP
QEREKIAAGLQLPKATIDGLTNLETIETAIAKGLNSPASPKSSPKSSVIYQTLKDYDRFSLFLVAARGNKLLRKQIWFYF
SQLCQVSPFLTGHDLKALGYKPGPQFKQILTDLLNACLDGELGDRQQEEAWLRTHYPLPNKV
>P23034 2.6.1.1~~~~~~Aspartate aminotransferase~~~
MKELLANRVKTLTPSTTLAITAKAKEMKAQGIDVIGLGAGEPDFNTPQNIMDAAIDSMQQGYTKYTPSGGLPALKQAIIE
KFKRDNQLEYKPNEIIVGVGAKHVLYTLFQVILNEGDEVIIPIPYWVSYPEQVKLAGGVPVYIEATSEQNYKITAEQLKN
AITDKTKAVIINSPSNPTGMVYTREELEDIAKIALENNILIVSDEIYEKLLYNGAEHFSIAQISEEVKAQTIVINGVSKS
HSMTGWRIGYAAGNADIINAMTDLASHSTSNPTTASQYAAIEAYNGPQDSVEEMRKAFESRLETIYPKLSAIPGFKVVKP
QGAFYLLPDVSEAAQKTGFASVDEFASALLTEANVAVIPGSGFGAPSTIRISYATSLNLIEEAIERIDRFVK
>P00509 2.6.1.1~~~aspC~~~Aspartate aminotransferase~~~COG1448
MFENITAAPADPILGLADLFRADERPGKINLGIGVYKDETGKTPVLTSVKKAEQYLLENETTKNYLGIDGIPEFGRCTQE
LLFGKGSALINDKRARTAQTPGGTGALRVAADFLAKNTSVKRVWVSNPSWPNHKSVFNSAGLEVREYAYYDAENHTLDFD
ALINSLNEAQAGDVVLFHGCCHNPTGIDPTLEQWQTLAQLSVEKGWLPLFDFAYQGFARGLEEDAEGLRAFAAMHKELIV
ASSYSKNFGLYNERVGACTLVAADSETVDRAFSQMKAAIRANYSNPPAHGASVVATILSNDALRAIWEQELTDMRQRIQR
MRQLFVNTLQEKGANRDFSFIIKQNGMFSFSGLTKEQVLRLREEFGVYAVASGRVNVAGMTPDNMAPLCEAIVAVL
>C6C2Z3 2.6.1.1~~~~~~Aspartate aminotransferase~~~COG0436
MRSVADRVKRIGLSETYAILDKVKKMKAEGHVVYDLGGGEPDFSTPEHIINFTVSAMKNGMTHYTASKGSPGLLKAIANR
LFEENHISACWDKNIIVTPSAKHALFITLMTLLNPGDEIVIPSPCWVSYIAMAEMAGAKAVDLPLTRENKYQITRKALAA
CITDKTRVLLLNNPNNPTGHILTEEEIQVICQVALEHDLFVVMDEIYEHIRYITAPHRSIAAEPGMFERTITVSGFSKAW
AMTGWRLGYLCAPEYVLNEILKVQQHSVGCAGAFIQQGGLAALIGDRQPMEDMVKAYRKRRDYMVDSLNRIPGIECYVPE
GGLYVYADIRGLGMGDAQTFTLWLLAHAHVAVTPGTAFGKEETMMIRLSFAGAMETIVAAMDSIAEAITEYDASLQQEAS
>P58350 2.6.1.1~~~aatB~~~Aspartate aminotransferase~~~COG0436
MTINATVKEAGFQPASRISSIGVSEILKIGARAAAMKREGKPVIILGAGEPDFDTPEHVKQAASDAIHRGETKYTALDGT
PELKKAIREKFQRENGLAYELDEITVATGAKQILFNAMMASLDPGDEVIIPTPYWTSYSDIVHICEGKPVLIACDASSGF
RLTAEKLEAAITPRTRWVLLNSPSNPSGAAYSAADYRPLLEVLLRHPHVWLLVDDMYEHIVYDGFRFVTPAQLEPGLKNR
TLTVNGVSKAYAMTGWRIGYAGGPRELIKAMAVVQSQATSCPSSISQAASVAALNGPQDFLKERTESFQRRRDLVVNGLN
AIDGLDCRVPEGAFYTFSGCAGVLGKVTPSGKRIKTDTDFCAYLLEDAHVAVVPGSAFGLSPFFRISYATSEAELKEALE
RIAAACDRLS
>Q06191 2.6.1.1~~~aatB~~~Aspartate aminotransferase~~~
MTINATVKEAGFRPASRISSIGVSEILKIGARAAAMKREGKPVIILGAGEPDFDTPDHVKQAASDAIHRGETKYTALDGT
PELKKAIREKFQRENGLAYELDEITVATGAKQILFNAMMASLDPGDEVVIPTPYWTSYSDIVQICEGKPILIACDASSGF
RLTAQKLEAAITPRTRWVLLNSPSNPSGAAYSAADYRPLLDVLLKHPHVWLLVDDMYEHIVYDAFRFVTPARLEPGLKDR
TLTVNGVSKAYAMTGWRIGYAGGPRALIKAMAVVQSQATSCPSSVSQAASVAALNGPQDFLKERTESFQRRRNLVVNGLN
AIEGLDCRVPEGAFYTFSGCAGVARRVTPSGKRIESDTDFCAYLLEDSHVAVVPGSAFGLSPYFRISYATSEAELKEALE
RISAACKRLS
>Q82DR2 2.6.1.1~~~aspC1~~~Aspartate aminotransferase~~~COG0436
MSAATPPTERRVSARVGAISESATLAVDAKAKALKAAGRPVIGFGAGEPDFPTPDYIVQAAIEACSNPKYHRYTPAGGLP
ELKAAIAAKTLRDSGYEVDASQVLVTNGGKQAIYEAFAAILDPGDEVIVPAPYWTTYPESIRLAGGVPVDVVADETTGYR
VSVEQLEAARTENTKVLLFVSPSNPTGAVYTREQIEEIGRWAAEKGLWVLTDEIYEHLVYGDAEFHSLPVVVPELADKCI
VVNGVAKTYAMTGWRVGWVIGPKDVIKAATNLQSHATSNVSNVAQVAALAAVSGDLTAVAEMREAFDRRRKTIVRMLNEI
GGVLCPEPEGAFYAYPSVKALLGKEIRGKRPQDTVELAALILEEAEVAVVPGEAFGTPGYLRLSYALGDEDLVEGVSRIQ
KLLSEAKD
>Q55128 2.6.1.1~~~aspC~~~Aspartate aminotransferase~~~COG0436
MRLTQRVSQVVPSITLEITAKAKAMRTEGIDVLSFTAGEPDFTTPPHIVEAAKLALDEGKTRYGPAAGEPALRQAIAKKL
REKNNLPYEAANILVTNGGKHSLFNLMLAMIEQGDEVIIPAPYWLSYPEMVRLAEGTPVIVNTTAATDYKITPEQLRQAI
TSKSKLFVLNSPSNPTGAVYTPAEIRALAAVILEYEDLYVVSDEIYERILYDGTEHLSIGAVNDEIFQRTIISNGFAKSY
SMTGWRVGYLAGELPLIQACSTIQGHSTSNVCTFAQYGAIAALENPQTCVETMVKAFTERRQVIVEGINQIAGLSCPNPK
GAFYVFVDIAKTGLNSLEFSARLLESHQVAVIPGAAFGADDCVRFSYATDMDTIKQGLAELERFVSTLA
>Q9X0Y2 2.6.1.1~~~aspC~~~Aspartate aminotransferase~~~COG0436
MVSRRISEIPISKTMELDAKAKALIKKGEDVINLTAGEPDFPTPEPVVEEAVRFLQKGEVKYTDPRGIYELREGIAKRIG
ERYKKDISPDQVVVTNGAKQALFNAFMALLDPGDEVIVFSPVWVSYIPQIILAGGTVNVVETFMSKNFQPSLEEVEGLLV
GKTKAVLINSPNNPTGVVYRREFLEGLVRLAKKRNFYIISDEVYDSLVYTDEFTSILDVSEGFDRIVYINGFSKSHSMTG
WRVGYLISSEKVATAVSKIQSHTTSCINTVAQYAALKALEVDNSYMVQTFKERKNFVVERLKKMGVKFVEPEGAFYLFFK
VRGDDVKFCERLLEEKKVALVPGSAFLKPGFVRLSFATSIERLTEALDRIEDFLNSR
>P84887 1.4.9.2~~~aauA~~~Aralkylamine dehydrogenase light chain~~~
MRWLDKFGESLSRSVAHKTSRRSVLRSVGKLMVGSAFVLPVLPVARAAGGGGSSSGADHISLNPDLANEDEVNSCDYWRH
CAVDGFLCSCCGGTTTTCPPGSTPSPISWIGTCHNPHDGKDYLISYHDCCGKTACGRCQCNTQTRERPGYEFFLHNDVNW
CMANENSTFHCTTSVLVGLAKN
>P84888 1.4.9.2~~~aauB~~~Aralkylamine dehydrogenase heavy chain~~~COG3391
MKSKFKLTTAAAMLGLMVLAGGAQAQDKPREVLTGGHSVSAPQENRIYVMDSVFMHLTESRVHVYDYTNGKFLGMVPTAF
NGHVQVSNDGKKIYTMTTYHERITRGKRSDVVEVWDADKLTFEKEISLPPKRVQGLNYDGLFRQTTDGKFIVLQNASPAT
SIGIVDVAKGDYVEDVTAAAGCWSVIPQPNRPRSFMTICGDGGLLTINLGEDGKVASQSRSKQMFSVKDDPIFIAPALDK
DKAHFVSYYGNVYSADFSGDEVKVDGPWSLLNDEDKAKNWVPGGYNLVGLHRASGRMYVFMHPDGKEGTHKFPAAEIWVM
DTKTKQRVARIPGRDALSMTIDQQRNLMLTLDGGNVNVYDISQPEPKLLRTIEGAAEASLQVQFHPVGGV
>Q9Z6M6 ~~~aaxA~~~Porin AaxA~~~COG3659
MISFRFLLLSGLCALGISSYAETPKETTGHYHRYKARIQKKHPESIKESAPSETPHHNSLLSPVTNIFCSHPWKDGISVS
NLLTSVEKATNTQISLDFSILPQWFYPHKALGQTQALEIPSWQFYFSPSTTWTLYDSPTAGQGIVDFSYTLIHYWQTNGV
DANQAAGTASSMNDYSNRENNLAQLTFSQTFPGDFLTLAIGQYSLYAIDGTLYDNDQYSGFISYALSQNASATYSLGSTG
AYLQFTPNSEIKVQLGFQDSYNIDGTNFSIYNLTKSKYNFYGYASWTPKPSCGDGQYSVLLYSTRKVPEQNSQVTGWSLN
AAQHIHEKLYLFGRINGATGTALPINRSYVLGLVSENPLNRHSQDLLGIGFATNKVNAKAISNVNKLRRYESVMEAFATI
GFGPYISLTPDFQLYIHPALRPERRTSQVYGLRANLSL
>Q9Z6M7 4.1.1.19~~~aaxB~~~Pyruvoyl-dependent arginine decarboxylase AaxB~~~COG1945
MAYGTRYPTLAFHTGGIGESDDGMPPQPFETFCYDSALLQAKIENFNIVPYTSVLPKELFGNIVPVDTCVKSFKHGAVLE
VIMAGRGAALSDGTHAIATGIGICWGKDKNGELIGGWAAEYVEFFPTWINDEIAETHAKMWLKKSLQHELDLRSIAKHSE
FQFFHNYINIKQKFGFCLTALGFLNFENAEPAKVN
>Q9Z6M8 ~~~aaxC~~~Arginine/agmatine antiporter~~~COG0531
MTSRTKSSKNLGTIALAGMVVSSIIGGGIFSLPQNMAATAGAGAVILSWILTGFGMFFIANTFRILSTIRPDLKEGIYMY
SREGFGPYIGFTIGWGYWLCQIFGNVGYAVITMDALNYFFPPYFQGGNTLPAILGGSILIWVFNFIVLKGIRQASIINVI
GTIFKIIPLIIFIILTAFFFKLAVFKTDFWGHAVTKAQPSLGSVSSQLKGTMLVTLWAFIGIEGAVVMSGRAKNPLSVGQ
ATVLGFLGCLTIYILFSLLPFGSLFQHQLANIPNPSTAGVLDILVGKWGEVLMNVGLIIAVLSSWLSWTIIVAEIPFSAA
KNGTFPEIFTIENKEKSPSVSLYITSSVMQLAMLLVYFSSNAWNTMLSITGVMVLPAYLASAAFLFKLSKSKTYPKKGSI
KAPLAMITGILGVVYSLWLIYAGGLKYLFMALVLLALGIPFYIDAGKKKKNAKTFFAKKEIVGMTFIGLLALTAIFLFST
GRIKI
>P0DPR5 ~~~abaF~~~Fosfomycin resistance protein AbaF~~~
MTTTLKKVVAASMVGSVAEWYEFFLYGTASALVFGELFFQQTGNAIDGILAAFALYAVGFLARPLGGLVFGHYGDKIGRK
KLLQISLIIVGITTFLMGCIPTFHQIGYWAPTLLVILRLIQGFAFGGEWGGAVILVSEHSPDDRRGYWASWPQTGVPLGN
LVATLVLLLLSKNLSPEQFLDWGWRCAFWFSAVVVLIGLWIRKNVDDAEVFKEAQAKQQLLEKQQLGIIEVLKYHKKSVI
AGIGARFAENILYYMVVTFSISYLKLVVHKDTSQILLLMFGAHLIHFFIIPFMGHLSDIFGRKPIYLIGAVLTAFWGFVG
FPLMDTGNDWLIMLAIVLGLFIESMTYSPYSALMTELFPTHIRYTALSFCYQVAPIMAGSLAPLIALTLLKEFNSSIPIS
LYLVAASLISIVSILLVKETKGRSLAFKD
>B0FLN1 2.3.1.184~~~abaI~~~Acyl-homoserine-lactone synthase~~~COG3916
MNIIAGFQNNFSEGLYTKFKSYRYRVFVEYLGWELNCPNNEETRIQFDKVDTAYVVAQDRESNIIGCARLLPTTQPYLLG
EIFPQLLNGMPIPCSPEIWELSRFSAVDFSKPPSSSSQAVSSPISIAILQEAINFAREQGAKQLITTSPLGVERLLRAAG
FRAHRAGPPMMIDGYSMFACLIDV
>P0DPR4 ~~~abaQ~~~Quinolone resistance transporter~~~
MDFEKDVIRTVTFKLIPALVILYLVAYIDRAAVGFAHLHMGADVGIGDAAYGLGAGLFFIGYFLFEVPSNLLLDKFGARK
WFTRILLTWGLITMAMALIQGPKSFYLLRFLLGVAEAGFFPGVLYLITQWYPVRHRGKIMGMFVLSQPIAMMIAGPLAGL
LLGMDGIANLHGWQWLFVAVGLPAVLLALPTFLWLPDNIDKVKWLSIEQKQWLKNELVKDEAEYDQTRHANPLHALKDKR
VLLLALYYLPVTLSIYGLNLWLPTIIKQFGGGSDIQIGFLSSIPYIFGIIGLLIIPRSTDRLNDRYGHLSFLYALGACAM
FLSGWLNSPVMQLAALAVVAFCLFSSTAVFWTLPGRFLTGASAAAGIALINSVGNLGGYVGPFGIGLLKEYTGNMAAGLY
FLSIVMLFGLILTYIVYAKLERQKTQTVNIQKPL
>O65934 7.-.-.-~~~~~~ABC transporter ATP-binding/permease protein Rv1747~~~COG0842
MPMSQPAAPPVLTVRYEGSERTFAAGHDVVVGRDLRADVRVAHPLISRAHLLLRFDQGRWVAIDNGSLNGLYLNNRRVPV
VDIYDAQRVHIGNPDGPALDFEVGRHRGSAGRPPQTTSIRLPNLSAGAWPTDGPPQTGTLGSGQLQQLPPATTRIPAAPP
SGPQPRYPTGGQQLWPPSGPQRAPQIYRPPTAAPPPAGARGGTEAGNLATSMMKILRPGRLTGELPPGAVRIGRANDNDI
VIPEVLASRHHATLVPTPGGTEIRDNRSINGTFVNGARVDAALLHDGDVVTIGNIDLVFADGTLARREENLLETRVGGLD
VRGVTWTIDGDKTLLDGISLTARPGMLTAVIGPSGAGKSTLARLVAGYTHPTDGTVTFEGHNVHAEYASLRSRIGMVPQD
DVVHGQLTVKHALMYAAELRLPPDTTKDDRTQVVARVLEELEMSKHIDTRVDKLSGGQRKRASVALELLTGPSLLILDEP
TSGLDPALDRQVMTMLRQLADAGRVVLVVTHSLTYLDVCDQVLLLAPGGKTAFCGPPTQIGPVMGTTNWADIFSTVADDP
DAAKARYLARTGPTPPPPPVEQPAELGDPAHTSLFRQFSTIARRQLRLIVSDRGYFVFLALLPFIMGALSMSVPGDVGFG
FPNPMGDAPNEPGQILVLLNVGAVFMGTALTIRDLIGERAIFRREQAVGLSTTAYLIAKVCVYTVLAVVQSAIVTVIVLV
GKGGPTQGAVALSKPDLELFVDVAVTCVASAMLGLALSAIAKSNEQIMPLLVVAVMSQLVFSGGMIPVTGRVPLDQMSWV
TPARWGFAASAATVDLIKLVPGPLTPKDSHWHHTASAWWFDMAMLVALSVIYVGFVRWKIRLKAC
>P77674 1.2.1.19~~~patD~~~Gamma-aminobutyraldehyde dehydrogenase~~~COG1012
MQHKLLINGELVSGEGEKQPVYNPATGDVLLEIAEASAEQVDAAVRAADAAFAEWGQTTPKVRAECLLKLADVIEENGQV
FAELESRNCGKPLHSAFNDEIPAIVDVFRFFAGAARCLNGLAAGEYLEGHTSMIRRDPLGVVASIAPWNYPLMMAAWKLA
PALAAGNCVVLKPSEITPLTALKLAELAKDIFPAGVINILFGRGKTVGDPLTGHPKVRMVSLTGSIATGEHIISHTASSI
KRTHMELGGKAPVIVFDDADIEAVVEGVRTFGYYNAGQDCTAACRIYAQKGIYDTLVEKLGAAVATLKSGAPDDESTELG
PLSSLAHLERVGKAVEEAKATGHIKVITGGEKRKGNGYYYAPTLLAGALQDDAIVQKEVFGPVVSVTPFDNEEQVVNWAN
DSQYGLASSVWTKDVGRAHRVSARLQYGCTWVNTHFMLVSEMPHGGQKLSGYGKDMSLYGLEDYTVVRHVMVKH
>Q8ZPC9 1.2.1.19~~~patD~~~Gamma-aminobutyraldehyde dehydrogenase~~~
MQYQLLINGVLVDGEGERQSVYNPATGEVILEIAEASPAQVDAAVQAADNAFAEWGQTTPKARAECLLKLADSIEQNALE
FARLESQNCGKPLHCVINDEIPAIVDVFRFFAGAARCLSGLAAGEYLEGHTSMIRRDPIGVVASIAPWNYPLMMAAWKLA
PALAAGNCVVIKPSEITPLTALKLAVLAKDIFPPGVLNVLFGRGQTVGDVLTGHEKVRMVSLTGSIATGEHILRHTAPAI
KRTHMELGGKAPVIVFDDADLDAVAQGVRTFGFYNAGQDCTAACRIYAQRGIYDALVEKLGNAVSSLKMGAPEDESTELG
PLSSLAHLKRVTAAVEEAKALSHIRVITGGSQTEGKGYYFAPTLLADAKQEDAIVQREVFGPVVSITVFDDEDQVLRWAN
DSRYGLASSVWTQDVGRAHRLSARLQYGCTWINTHFMLVSEMPHGGQKQSGYGKDMSLYGLEDYTLVRHIMVKH
>P77357 3.5.1.-~~~abgA~~~p-aminobenzoyl-glutamate hydrolase subunit A~~~COG1473
MESLNQFVNSLAPKLSHWRRDFHHYAESGWVEFRTATLVAEELHQLGYSLALGREVVNESSRMGLPDEFTLQREFERARQ
QGALAQWIAAFEGGFTGIVATLDTGRPGPVMAFRVDMDALDLSEEQDVSHRPYRDGFASCNAGMMHACGHDGHTAIGLGL
AHTLKQFESGLHGVIKLIFQPAEEGTRGARAMVDAGVVDDVDYFTAVHIGTGVPAGTVVCGSDNFMATTKFDAHFTGTAA
HAGAKPEDGHNALLAAAQATLALHAIAPHSEGASRVNVGVMQAGSGRNVVPASALLKVETRGASDVINQYVFDRAQQAIQ
GAATMYGVGVETRLMGAATASSPSPQWVAWLQSQAAQVAGVNQAIERVEAPAGSEDATLMMARVQQHQGQASYVVFGTQL
AAGHHNEKFDFDEQVLAIAVETLARTALNFPWTRGI
>P76052 3.5.1.-~~~abgB~~~p-aminobenzoyl-glutamate hydrolase subunit B~~~COG1473
MQEIYRFIDDAIEADRQRYTDIADQIWDHPETRFEEFWSAEHLASALESAGFTVTRNVGNIPNAFIASFGQGKPVIALLG
EYDALAGLSQQAGCAQPTSVTPGENGHGCGHNLLGTAAFAAAIAVKKWLEQYGQGGTVRFYGCPGEEGGSGKTFMVREGV
FDDVDAALTWHPEAFAGMFNTRTLANIQASWRFKGIAAHAANSPHLGRSALDAVTLMTTGTNFLNEHIIEKARVHYAITN
SGGISPNVVQAQAEVLYLIRAPEMTDVQHIYDRVAKIAEGAALMTETTVECRFDKACSSYLPNRTLENAMYQALSHFGTP
EWNSEELAFAKQIQATLTSNDRQNSLNNIAATGGENGKVFALRHRETVLANEVAPYAATDNVLAASTDVGDVSWKLPVAQ
CFSPCFAVGTPLHTWQLVSQGRTSIAHKGMLLAAKTMAATTVNLFLDSGLLQECQQEHQQVTDTQPYHCPIPKNVTPSPL
K
>P46133 ~~~abgT~~~p-aminobenzoyl-glutamate transport protein~~~COG2978
MSMSSIPSSSQSGKLYGWVERIGNKVPHPFLLFIYLIIVLMVTTAILSAFGVSAKNPTDGTPVVVKNLLSVEGLHWFLPN
VIKNFSGFAPLGAILALVLGAGLAERVGLLPALMVKMASHVNARYASYMVLFIAFFSHISSDAALVIMPPMGALIFLAVG
RHPVAGLLAAIAGVGCGFTANLLIVTTDVLLSGISTEAAAAFNPQMHVSVIDNWYFMASSVVVLTIVGGLITDKIIEPRL
GQWQGNSDEKLQTLTESQRFGLRIAGVVSLLFIAAIALMVIPQNGILRDPINHTVMPSPFIKGIVPLIILFFFVVSLAYG
IATRTIRRQADLPHLMIEPMKEMAGFIVMVFPLAQFVAMFNWSNMGKFIAVGLTDILESSGLSGIPAFVGLALLSSFLCM
FIASGSAIWSILAPIFVPMFMLLGFHPAFAQILFRIADSSVLPLAPVSPFVPLFLGFLQRYKPDAKLGTYYSLVLPYPLI
FLVVWLLMLLAWYLVGLPIGPGIYPRLS
>P39758 ~~~abh~~~Putative transition state regulator Abh~~~COG2002
MKSIGVVRKVDELGRIVMPIELRRALDIAIKDSIEFFVDGDKIILKKYKPHGVCLMTGEITSENKEYGNGKITLSPEGAQ
LLLEEIQAALKE
>Q9ZJ19 3.1.-.-~~~abiQ~~~Endoribonuclease AbiQ~~~
MSSFFYKEILRMTLRFFTVTDEYIAYLRKFESKVHYQYENNASTYVGVVLKKNDFNYFIPLSSYKKGNPEKDKAMKKRSR
IVTRLFEIGNINNPLGYLLHHNMIPVPDSELIPLPLDLKKPKHKMMQKQLIYMKSISEKIENKSEVVYRKAAHEKDGYYL
KFSCDFKLLEAKATLYSKKSTFQ
>P52127 ~~~abpA~~~Anti-bacteriophage protein A~~~
MESNDSGGVAAKHGFLFQDCVAAYHVTRMLRDKTIRSVRCEVTDDIDIVSDGYIDFVQVKSTGKTRWNISDIVQNSKGAD
KKTIPCSSILHKSMQCESDLSLGRRYSIVTEEKVNKTLEYLTISPNARLDKPGRQELIDDLNKRTDNFLTDSGISVSDWI
DAATWEVFSSLRELELLGIKNIRLASQDLHGVILSSETVAEDIWCRILDTVTRKGEHSRRIHSADDKSYLRPDLLEWFKQ
RVEDDQSRSGRKIYVKRDLPHILTPFRAPMASVCAKRKGQVLHQQYSLKKYRYKHIADNVCQWLDEVFLRPKEMSDIHKL
TFIEKRERLKNSVFKSLHDVSEFLGRVLLHATIRQHHESQPIPCMLYVEKAGAEKILENVHIVRRDPEGDQLWIGFSELV
TDINIAVRLPEIRDQLYEDISDCIDTARKKILDIKDDNYLLRHDIDEILDGSQPFDAHLDRFTFVLFVGYDSNLLTEPET
PGFEDDLEKETAVLFEKFAADLIEDSPFANLCIHVFIYPAPSLERLTQLVDEKVREVV
>P52126 ~~~abpB~~~Anti-bacteriophage protein B~~~COG1204
MTEIYEQAKHSLQGEDFSSFNYLFAVNKLLSNPVSYDLGRDLIVRALDSRERFSEHTTILKNMVRKSGLFPYLKKEFTSL
TPDDLRVLELYRTPFSDGYVFHSMQFHIFDLLKSGQNVVLSAPTSMGKSAIVDSLLGMGTLKRLVLVVPTVALADETRRR
LQERFGDRYQIIHHSSQVCHSDQAVYVLTQERVNERDDIVDIDLFVIDEFYKLAFRQLKSGDIDHQDERVIELNIALSKL
LKVSRQFYLTGPFVNSIRGLEKLGYPHTFVSTDFNTVALDVKTFGIKANDDKAKLKALGEIAHACVDATIIYCKSPTVAG
LVARELIRLGHGTPTENPHVDWVSEEFDADWDYTVALRNGIGLHFGALPRALQQYTADQFNAGKLRFLLCTSTIIEGVNT
IAKNVVIYDNRDGTRSIDKFTHGNIKGRAGRMGVHFVGKIFCLEEIPEDNLNQEVDIPLGIQGIDTPINLLASVQPDHLS
EFSQDRFDEVFINDRVSIDLVKKHSYFRVEQFEMLQSMFEMMDDNEFSSLVFHWTPATNFLKTFAKIIARLVPHTFSRNG
VPVKPTDVMIAKLAGYLSAESYSEYLKNQIDYARQWISEGEKRTLSIALNNDLKLITNTFGYTLPKVLSLMEDVVKHHAV
KRGIRSKVDYTHVKLAFESFHLPPGVNALEEIGIPIQTLHRLVDLLEFSDEADVDELSQYLRDTQDIWSRSIGYVDQMFI
RRALGIRRH
>B2FKA7 ~~~~~~Actin-binding protein Smlt3054~~~COG0666
MEMDIQESLLRLLRPLGLQRAEALAGALAREAGASKGLHDSQVLARAHALSVAPVEGRLGDLVWQVRQREHDGAPQVDLR
WGLHRLGLDAPSRASTRDLVRAYERRLADRNEPMVYSTLAERVAGSMAEHTSLFQGMAMAVEEARARRSDANRLRENAPW
QGWLVGASRAGHEAALLACIGMGADARLPDASGNTPLHHAARFGHFSLVTPLVEAGADVAALNAHGWAPLHLAALHKHAR
ACLHLMAHGANPEQPGWRGRTPTRMHRHEQTQAL
>P08874 ~~~abrB~~~Transition state regulatory protein AbrB~~~COG2002
MFMKSTGIVRKVDELGRVVIPIELRRTLGIAEKDALEIYVDDEKIILKKYKPNMTCQVTGEVSDDNLKLAGGKLVLSKEG
AEQIISEIQNQLQNLK
>Q9RBG5 1.14.12.14~~~absAa~~~2-aminobenzenesulfonate 2,3-dioxygenase subunit alpha~~~
MSRSAAEFLKPQNVASTHYLDNRVYWDHEIFEEEKKRIFSKVWKFVCHVSEIPSTFDYRTIKVADTPLVVIRGKDEKVRT
FVNACSHRGIQIVRRPRGNAKTMECIFHRWNYDSTNGELTGAPRKEAYGPSNFDLKQCGLREVRTETYLGLVFVNLDDSA
VSLSEFIGDALEMEKDILGAEELEVFDYYEQVLDTNWKNWQETNLDLYHEFMHFANRKTGLTVKEYYQRAWKLYPNGHAA
IERYRAQYSNYAGWQDRDDGIRLPGLHPNEFQLVNLFPDLAINARGTVIRIDSQTPISPGKTLVQYRGLGLKRDSERERV
QRVRDYTSIWGPFGTNLAEDTLATSLHAKTIQTGSVPFTYLTRDEGGMTQDDLGLRTFYREWERLMSRQANQIR
>Q9RBG4 1.14.12.14~~~absAb~~~2-aminobenzenesulfonate 2,3-dioxygenase subunit beta~~~
MDTVSIAEFLYTNADLLNQEQFDSWLEQCSNDFSYRITTFSEELGRPMDWMDKDKSGLAHYLQNANNHERYTGRLRRHLA
MPRVTKQADSSFEVRTAVAIYVIEMNGETALYGIGSYVDNVMSESSGLRLTSRVVTLDTRRLQFGPHVPI
>A0A009IHW8 3.2.2.-~~~~~~2' cyclic ADP-D-ribose synthase AbTIR~~~
MSLEQKKGADIISKILQIQNSIGKTTSPSTLKTKLSEISRKEQENARIQSKLSDLQKKKIDIDNKLLKEKQNLIKEEILE
RKKLEVLTKKQQKDEIEHQKKLKREIDAIKASTQYITDVSISSYNNTIPETEPEYDLFISHASEDKEDFVRPLAETLQQL
GVNVWYDEFTLKVGDSLRQKIDSGLRNSKYGTVVLSTDFIKKDWTNYELDGLVAREMNGHKMILPIWHKITKNDVLDYSP
NLADKVALNTSVNSIEEIAHQLADVILNR
>P67603 3.5.1.135~~~yqfB~~~N(4)-acetylcytidine amidohydrolase~~~COG3097
MQPNDITFFQRFQDDILAGRKTITIRDESESHFKTGDVLRVGRFEDDGYFCTIEVTATSTVTLDTLTEKHAEQENMTLTE
LKKVIADIYPGQTQFYVIEFKCL
>P44172 3.5.1.135~~~~~~N(4)-acetylcytidine amidohydrolase~~~COG3097
MQPNDITFYQRFEADILAGHKTISIRDDSESHFKAGDILRVGRFEDNQYFCNIEVLSVSPITLDELTQPHAKQENMGLDE
LKEVIRGIYPNEIIFWVIQFSLKEYFNEEKCVREIIITDRKFN
>P0DTK9 ~~~~~~Anti-CBASS protein Acb1~~~
MISLDSLRSLVTGLGTSKDKGASASYYFQPLGPDELNAMHRGDWLAQKVIDIIPNDMTREWRNWQAKQPQIEKIEAVEKA
PLINLQVKVNLALKMARLHGGSVIYIGIKGTVDLSEPLDPRSIGRGDLAYLHVLSRYEVTCGETVTDVTSEFYGQPSYYE
VAGANGAPVQIHPSRVVRFVGAPVLDRRSVGNDPWGDSVLQAVYDAVRNAGSAQGHIAALIPEAKVDIIHMPGLGEFTKT
EAGRAKLTARFTYANTMKSMLNAVLLDGTGGAGKDAGGEKWEQKQISFAQLPELLQSYLSIASAAADIPATRMLSQSPKG
LNATGDSDMRNHYDNCTARQGTELSPALNRLDEVILRSALGSRPAAIYYEWAPLWGLTEKEQAEVFKMKADGARALAGAK
GGPLLPVNALSDALVNTFTEDGSLPGLEAAIEEHGTLADQPETESANENTEQPESGEEGEEGQPTRRAANDAKPRPLYVS
RKLINGREFLKWAKEQGFAKTLKADDLHVTITYSREAVDWMAMGQSWGGEDGKLTVPAGGARLVEPLGDEGAVVLLFNSS
ELAWRHMGMREAGASWDHPEYQPHVTITYEVGDVDLSQVEPYRGKLVFGPEVFEEIDECWAENLNET
>A0A494TJG7 ~~~~~~Anti-CBASS protein Acb1~~~
MSMVRSFFDGLTNVLSGAGTSVDKRVHARYGLNIVDQHQVEASYRTSWLARKIVDMPAHDMTREWRDWKADGELIGKIEA
EEKRLCLRERVTQAIVLGRLGGGAIYLGIKGDDPSQPLAVEHIRPGQLSYIAVFSRWQLTIGQEVSDPEDALFGGPDYFQ
ITSIANKVGVRIHPSRMVIFKGAHVMRGIGSQWEDAFWGDPIYQAVGDAIRNADSAQNSFASLIDEATYDVIGIPGLMER
LSQPGGDAQLSKRLDAARQGKSNHRAIILDSGEGGKDAETWVTRQVTWAGMPELMAAFLQTVAGASDIPYTRLLGTSATG
MSATGEGDKNDYLSSIATKQETMLRPNMVRIDAVMLRSAGIKDELWFDWSPLFEMGEKERAALDKLKADTAKVWGDSGLV
PIDALAKGAQNLLTEDGTYPGLDEELKKAEAIVAPDADVAPVVANPDDLGLTSEAKPKPLLKIVGDAAPRPLYVNRPLLN
AAEFIAWAKAQGFKTTTPADDLHVTILYSKTPVDWMKMGTDGWGSDAKGNLDVAPGGARIVEPLGDKGAVVLLFTSSSLS
WRHEEMVRNGASHDFDEYQSHVTISYDASDVDLSKVEPYRGVLKFGPEVFTEIVEDWKPTGGEA
>Q9ZAE9 4.2.3.152~~~acbC~~~2-epi-5-epi-valiolone synthase~~~COG0337
MSGVETVGVHADAHRDSWQVRAQKQITYEVRFRDDVFGLDSTDLLEAGADGAGSRRRFVVVDSAVDALYGSRIREYFTHH
GIDHSILVMRVGETVKDFDTAGRIVAAMDAFGLARRREPMIVVGGGVLMDVAGLVASLYRRGTPFLRVPTTLVGLIDAGV
GAKTGVNFNGHKNRLGTYAPADLTLLDRRFLATLDRRHLSNGLAEMLKIALIKDAELFQLLERHGRVLIEERFQGRTGTG
DRAAVRALRAATHGMLEELGPNLWESRLERSVDYGHTFSPTIEMRALPALLHGEAVCVDMALTTVLAYRRGLLDVAQRDR
IFAVMTALGLPTWHPLLTPEVLEAALQDTVRHRDGWQRLPLPVGIGGVTFVNDVTAAELQAAALMQHRLAEDALLLRA
>Q8RMD4 2.7.1.187~~~acbK~~~Acarbose 7(IV)-phosphotransferase~~~COG0524
MSEHTDVLVLGGAGVDTIAYVPELPLPFQDSYVVAAIEPRAGQTGDNVALGLHTLGLRTMHVDVLGDDPEGDLVRAFHTR
HGLPFAALPTAAGTKRAVNLVGPDGRRLSLWDGSREAEEDRYPAALIAAHTAHARHVHVCITPPGQHVFGQLNDLPVTVS
TDLHNWDGAYEGFEVYAFNADLVFLSATALTDVAATMRRVIDRGRARLVVATDGAHGGSVLVRGETEVRRYAAVAPEAPV
VDSNGAGDAFVSGFLFGHLAGEPLETCLRYGAIAGAYACTIPATRAGAIDRAALLRPAA
>Q8RIS8 2.7.1.188~~~acbM~~~2-epi-5-epi-valiolone 7-kinase~~~COG1940
MKRPPHHPVTVADVGGTHLRWARWSPDGGLGEVHTTPSPGHARRPGAGAADLQAELIRELASRVEPGARAGVSLGAAMDH
HSGTAYASAPLWGPQVSPFDVPAALRAARPDVHWTVVNDVTAGLLHLAEMVRDAGVRKACLVTISTGIACRTMDLRTGGI
PVDAAGLQGEIGHLPATVLADGVPVVTRCDCGEPGHVAASSSGPGIRRVAAVLARRDPATWAGSGPTTRMMAGSGFEDAF
RAALDDGDPVAADLLTAVTAPIADLLRTALCLDPELDLIALTGGVAHGLEPHYSAAVHDHLRRRGLYLTSEREPDWLTGR
IRVVPPATADPLVGAGLAALAAGPVPAYSGGGREALVGR
>Q8RMD1 5.1.3.35~~~acbO~~~2-epi-5-epi-valiolone 7-phosphate 2-epimerase~~~COG1082
MTCRVGLTEWRLAPSGAAAIRLAAAVGADGIQLDFGGPGRGVLVDGPGRAGQLRAVADEAGVDLLALAGNLLNDIGLTSQ
PAVVQPVLARLADTATELGVPLLIVPSFRRSAITDAMSFTRTAAALRWAVSLAEARGIVLASENVLPPARARQLVEEVGS
PAFRLLLDTFNPVRYGLDPAWLATELRPWWADQIHLKDGPPDTGPSPLLGAGQGGVRRTLTALRGSPAPVRALVLENDYR
DGHGARLRADLEWARRAAVNARESEKGKLT
>P9WPQ3 ~~~accA1~~~Biotin-dependent 3-methylcrotonyl-coenzyme A carboxylase alpha1 subunit~~~COG4770
MFDTVLVANRGEIAVRVIRTLRRLGIRSVAVYSDPDVDARHVLEADAAVRLGPAPARESYLDIGKVLDAAARTGAQAIHP
GYGFLAENADFAAACERARVVFLGPPARAIEVMGDKIAAKNAVAAFDVPVVPGVARAGLTDDALVTAAAEVGYPVLIKPS
AGGGGKGMRLVQDPARLPEALVSARREAMSSFGDDTLFLERFVLRPRHIEVQVLADAHGNVVHLGERECSLQRRHQKVIE
EAPSPLLDPQTRERIGVAACNTARCVDYVGAGTVEFIVSAQRPDEFFFMEMNTRLQVEHPVTEAITGLDLVEWQLRVGAG
EKLGFAQNDIELRGHAIEARVYAEDPAREFLPTGGRVLAVFEPAGPGVRVDSSLLGGTVVGSDYDPLLTKVIAHGADREE
ALDRLDQALARTAVLGVQTNVEFLRFLLADERVRVGDLDTAVLDERSADFTARPAPDDVLAAGGLYRQWALARRAQGDLW
AAPSGWRGGGHMAPVRTAMRTPLRSETVSVWGPPESAQVQVGDGEIDCASVQVTREQMSVTISGLRRDYRWAEADRHLWI
ADERGTWHLREAEEHKIHRAVGARPAEVVSPMPGSVIAVQVESGSQISAGDVVVVVEAMKMEHSLEAPVSGRVQVLVSVG
DQVKVEQVLARIKD
>P96890 ~~~accA3~~~Biotin-dependent acyl-coenzyme A carboxylase alpha3 subunit~~~COG4770
MASHAGSRIARISKVLVANRGEIAVRVIRAARDAGLPSVAVYAEPDAESPHVRLADEAFALGGQTSAESYLDFAKILDAA
AKSGANAIHPGYGFLAENADFAQAVIDAGLIWIGPSPQSIRDLGDKVTARHIAARAQAPLVPGTPDPVKGADEVVAFAEE
YGLPIAIKAAHGGGGKGMKVARTIDEIPELYESAVREATAAFGRGECYVERYLDKPRHVEAQVIADQHGNVVVAGTRDCS
LQRRYQKLVEEAPAPFLTDFQRKEIHDSAKRICKEAHYHGAGTVEYLVGQDGLISFLEVNTRLQVEHPVTEETAGIDLVL
QQFRIANGEKLDITEDPTPRGHAIEFRINGEDAGRNFLPAPGPVTKFHPPSGPGVRVDSGVETGSVIGGQFDSMLAKLIV
HGADRAEALARARRALNEFGVEGLATVIPFHRAVVSDPAFIGDANGFSVHTRWIETEWNNTIEPFTDGEPLDEDARPRQK
VVVEIDGRRVEVSLPADLALSNGGGCDPVGVIRRKPKPRKRGAHTGAAASGDAVTAPMQGTVVKFAVEEGQEVVAGDLVV
VLEAMKMENPVTAHKDGTITGLAVEAGAAITQGTVLAEIK
>O34847 2.1.3.15~~~accA~~~Acetyl-coenzyme A carboxylase carboxyl transferase subunit alpha~~~COG0825
MAPRLEFEKPVIELQTKIAELKKFTQDSDMDLSAEIERLEDRLAKLQDDIYKNLKPWDRVQIARLADRPTTLDYIEHLFT
DFFECHGDRAYGDDEAIVGGIAKFHGLPVTVIGHQRGKDTKENLVRNFGMPHPEGYRKALRLMKQADKFNRPIICFIDTK
GAYPGRAAEERGQSEAIAKNLFEMAGLRVPVICIVIGEGGSGGALGLGVGNHLHMLENSTYSVISPEGAAALLWKDSSLA
KKAAETMKITAPDLKELGIIDHMIKEVKGGAHHDVKLQASYMDETLKQSLKTLLKLSEEELVQQRYEKYKAIGKVSVEDQ
YIGVN
>P0ABD5 2.1.3.15~~~accA~~~Acetyl-coenzyme A carboxylase carboxyl transferase subunit alpha~~~COG0825
MSLNFLDFEQPIAELEAKIDSLTAVSRQDEKLDINIDEEVHRLREKSVELTRKIFADLGAWQIAQLARHPQRPYTLDYVR
LAFDEFDELAGDRAYADDKAIVGGIARLDGRPVMIIGHQKGRETKEKIRRNFGMPAPEGYRKALRLMQMAERFKMPIITF
IDTPGAYPGVGAEERGQSEAIARNLREMSRLGVPVVCTVIGEGGSGGALAIGVGDKVNMLQYSTYSVISPEGCASILWKS
ADKAPLAAEAMGIIAPRLKELKLIDSIIPEPLGGAHRNPEAMAASLKAQLLADLADLDVLSTEDLKNRRYQRLMSYGYA
>Q9HXZ2 2.1.3.15~~~accA~~~Acetyl-coenzyme A carboxylase carboxyl transferase subunit alpha~~~
MNPNFLDFEQPIADLQAKIEELRLVGNDNALNISDEISRLQDKSKALTENIFGNLSSWQIAQLARHPKRPYTLDYIGYLF
SDFEELHGDRHFADDPAIVGGVARLDGSPVMVIGHQKGREVREKVRRNFGMPRPEGYRKACRLMEMAERFKMPILTFIDT
PGAYPGIDAEERGQSEAIAWNLRVMARLKTPIIATVIGEGGSGGALAIGVCDQLNMLQYSTYSVISPEGCASILWKTAEK
APEAAEAMGITAERLKGLGIVDKVIDEPLGGAHRDPASMAESIRGELLAQLKMLQGLEMGELLERRYDRLMSYGAP
>P0A1C3 2.1.3.15~~~accA~~~Acetyl-coenzyme A carboxylase carboxyl transferase subunit alpha~~~
MSLNFLDFEQPIAELEAKIDSLTAVSRQDEKLDINIDEEVHRLREKSVELTRKIFADLGAWQVAQLARHPQRPYTLDYVR
LAFDEFDELAGDRAYADDKAIVGGIARLEGRPVMIIGHQKGRETKEKIRRNFGMPAPEGYRKALRLMEMAERFNMPIITF
IDTPGAYPGVGAEERGQSEAIARNLREMSRLNVPVICTVIGEGGSGGALAIGVGDKVNMLQYSTYSVISPEGCASILWKS
ADKAPLAAEAMGIIAPRLKELKLIDSIIPEPLGGAHRNPEAMAASLKAQLLEDLADLDVLSTDDLKNRRYQRLMSYGYA
>Q2FG38 2.1.3.15~~~accA~~~Acetyl-coenzyme A carboxylase carboxyl transferase subunit alpha~~~
MLDFEKPLFEIRNKIESLKESQDKNDVDLQEEIDMLEASLERETKKIYTNLKPWDRVQIARLQERPTTLDYIPYIFDSFM
ELHGDRNFRDDPAMIGGIGFLNGRAVTVIGQQRGKDTKDNIYRNFGMAHPEGYRKALRLMKQAEKFNRPIFTFIDTKGAY
PGKAAEERGQSESIATNLIEMASLKVPVIAIVIGEGGSGGALGIGIANKVLMLENSTYSVISPEGAAALLWKDSNLAKIA
AETMKITAHDIKQLGIIDDVISEPLGGAHKDIEQQALAIKSAFVAQLDSLESLSRDEIANDRFEKFRNIGSYIE
>Q2FXM7 2.1.3.15~~~accA~~~Acetyl-coenzyme A carboxylase carboxyl transferase subunit alpha~~~COG0825
MLDFEKPLFEIRNKIESLKESQDKNDVDLQEEIDMLEASLERETKKIYTNLKPWDRVQIARLQERPTTLDYIPYIFDSFM
ELHGDRNFRDDPAMIGGIGFLNGRAVTVIGQQRGKDTKDNIYRNFGMAHPEGYRKALRLMKQAEKFNRPIFTFIDTKGAY
PGKAAEERGQSESIATNLIEMASLKVPVIAIVIGEGGSGGALGIGIANKVLMLENSTYSVISPEGAAALLWKDSNLAKIA
AETMKITAHDIKQLGIIDDVISEPLGGAHKDIEQQALAIKSAFVAQLDSLESLSRDEIANDRFEKFRNIGSYIE
>Q7A558 2.1.3.15~~~accA~~~Acetyl-coenzyme A carboxylase carboxyl transferase subunit alpha~~~
MLDFEKPLFEIRNKIESLKESQDKNDVDLQEEIDMLEASLERETKKIYTNLKPWDRVQIARLQERPTTLDYIPYIFDSFM
ELHGDRNFRDDPAMIGGIGFLNGRAVTVIGQQRGKDTKDNIYRNFGMAHPEGYRKALRLMKQAEKFNRPIFTFIDTKGAY
PGKAAEERGQSESIATNLIEMASLKVPVIAIVIGEGGSGGALGIGIANKVLMLENSTYSVISPEGAAALLWKDSNLAKIA
AETMKITAHDIKQLGIIDDVISEPLGGAHKDVEQQALAIKSAFVAQLDSLESLSRDEIANDRFEKFRNIGSYIE
>Q6GG07 2.1.3.15~~~accA~~~Acetyl-coenzyme A carboxylase carboxyl transferase subunit alpha~~~
MLDFEKPLFEIRNKIESLKESQDKNDVDLQEEIDMLEASLERETKKIYTNLKPWDRVQIARLQERPTTLDYIPYIFDSFM
ELHGDRNFRDDPAMIGGIGFLNGRAVTVIGQQRGKDTKDNIYRNFGMAHPEGYRKALRLMKQAEKFNRPIFTFIDTKGAY
PGKAAEERGQSESIATNLIEMASLKVPVIAIVIGEGGSGGALGIGIANKVLMLENSTYSVISPEGAAALLWKDSNLAKIA
AETMKITAHDIKQLGIIDDVISEPLGGAHKDIEQQALAIKSAFVEQLDSLESLSRDEIANDRFEKFRNIGSYIE
>Q9FBB7 2.1.3.15~~~accA~~~Acetyl-coenzyme A carboxylase carboxyl transferase subunit alpha~~~COG0825
MNIAKIVREAREQSRLTTLDFATGIFDEFIQLHGDRSFRDDGAVVGGIGWLGDQAVTVVGIQKGKSLQDNLKRNFGQPHP
EGYRKALRLMKQAEKFGRPVVTFINTAGAYPGVGAEERGQGEAIARNLMEMSDLKVPIIAIIIGEGGSGGALALAVADRV
WMLENSIYAILSPEGFASILWKDGTRAMEAAELMKITSHELLEMDVVDKVISEIGLSSKELIKSVKKELQTELARLSQKP
LEELLEERYQRFRKY
>P24182 6.3.4.14~~~accC~~~Biotin carboxylase~~~COG0439
MLDKIVIANRGEIALRILRACKELGIKTVAVHSSADRDLKHVLLADETVCIGPAPSVKSYLNIPAIISAAEITGAVAIHP
GYGFLSENANFAEQVERSGFIFIGPKAETIRLMGDKVSAIAAMKKAGVPCVPGSDGPLGDDMDKNRAIAKRIGYPVIIKA
SGGGGGRGMRVVRGDAELAQSISMTRAEAKAAFSNDMVYMEKYLENPRHVEIQVLADGQGNAIYLAERDCSMQRRHQKVV
EEAPAPGITPELRRYIGERCAKACVDIGYRGAGTFEFLFENGEFYFIEMNTRIQVEHPVTEMITGVDLIKEQLRIAAGQP
LSIKQEEVHVRGHAVECRINAEDPNTFLPSPGKITRFHAPGGFGVRWESHIYAGYTVPPYYDSMIGKLICYGENRDVAIA
RMKNALQELIIDGIKTNVDLQIRIMNDENFQHGGTNIHYLEKKLGLQEK
>P43873 6.3.4.14~~~accC~~~Biotin carboxylase~~~COG0439
MLEKVVIANRGEIALRILRACKELGIKTVAVHSTADRDLKHVLLADETICIGPAPSAKSYLNIPAIIAAAEVTGADAIHP
GYGFLSENADFAEQVERSGFTFIGPTADVIRLMGDKVSAIKAMKKAGVPCVPGSDGPVSNDIAKNKEIAKRIGYPIIIKA
SGGGGGRGMRVVRSEDALEESIAMTKAEAKAAFNNDMVYMEKYLENPRHVEIQVLADTHGNAVYLAERDCSMQRRHQKVV
EEAPAPGITEEVRRDIGSRCANACVEIGYRGAGTFEFLYENGEFYFIEMNTRIQVEHPVTEMITGVDLVKEQLRIAAGLP
ISFKQEDIKVKGHAMECRINAEDPKTFLPSPGKVNHLHSPGGLGVRWDSHVYGGYTVPPHYDSMIAKLITYGDTREVAIR
RMQNALSETIIDGIKTNIPLHELILEDENFQKGGTNIHYLEKKLGMNE
>P37798 6.3.4.14~~~accC~~~Biotin carboxylase~~~
MLEKVLIANRGEIALRILRACKELGIKTVAVHSTADRELMHLSLADESVCIGPAPATQSYLQIPAIIAAAEVTGATAIHP
GYGFLAENADFAEQIERSGFTFVGPTAEVIRLMGDKVSAKDAMKRAGVPTVPGSDGPLPEDEETALAIAREVGYPVIIKA
AGGGGGRGMRVVYDESELIKSAKLTRTEAGAAFGNPMVYLEKFLTNPRHVEVQVLSDGQGNAIHLGDRDCSLQRRHQKVI
EEAPAPGIDEKARQEVFARCVQACIEIGYRGAGTFEFLYENGRFYFIEMNTRVQVEHPVSEMVTGVDIVKEMLRIASGEK
LSIRQEDVVIRGHALECRINAEDPKTFMPSPGKVKHFHAPGGNGVRVDSHLYSGYSVPPNYDSLVGKVITYGADRDEALA
RMRNALDELIVDGIKTNTELHKDLVRDAAFCKGGVNIHYLEKKLGMDKH
>I6YDK7 2.1.3.-~~~accD1~~~Biotin-dependent 3-methylcrotonyl-coenzyme A carboxylase beta1 subunit~~~COG4799
MTTPSIAIAPSFADEHRRLVAELNNKLAAAALGGNERARKRHVSRGKLLPRERVDRLLDPGSPFLELAPLAAGGMYGDES
PGAGIITGIGRVSGRQCVIVANDATVKGGTYYPMTVKKHLRAQEVALQNMLPCIYLVDSGGAFLPRQDEVFPDREHFGRI
FYNQATMSAKGIPQVAAVLGSCTAGGAYVPAMSDEAVIVREQGTIFLGGPPLVKAATGEIVSAEELGGGDLHSRTSGVTD
HLADDDEDALRIVRAIADTFGPCEPAQWDVRRSVEPKYPQAELYDVVPPDPRVPYDVHEVVVRIVDGSEFSEFKAKYGKT
LVTAFARVHGHPVGIVANNGVLFSESALKGAHFIELCDKRKIPLLFLQNIAGFMVGRDYEAGGIAKHGAKMVTAVACARV
PKLTVVIGGSYGAGNYSMCGRAYSPRFLWMWPNARISVMGGEQAASVLATVRGEQLSAAGTPWSPDEEEAFKAPIRAQYE
DQGNPYYSTARLWDDGIIDPADTRTVVGLALSLCAHAPLDQVGYGVFRM
>O86318 2.1.3.-~~~accD2~~~Probable biotin-dependent acyl-coenzyme A carboxylase beta2 subunit~~~COG4799
MLQSTLDPNASAYDEAAATMSGKLDEINAELAKALAGGGPKYVDRHHARGNLTPRERIELLVDPDSPFLELSPLAAYGSN
FQIGASLVTGIGAVCGVECMIVANDPTVKGGTSNPWTLRKILRANQIAFENRLPVISLVESGGADLPTQKEIFIPGGQMF
RDLTRLSAAGIPTIALVFGNSTAGGAYVPGMSDHVVMIKERSKVFLAGPPLVKMATGEESDDESLGGAEMHARISGLADY
FALDELDAIRIGRRIVARLNWIKQGPAPAPVTEPLFDAEELIGIVPPDLRIPFDPREVIARIVDGSEFDEFKPLYGSSLV
TGWARLHGYPLGILANARGVLFSEESQKATQFIQLANRADTPLLFLHNTTGYMVGKDYEEGGMIKHGSMMINAVSNSTVP
HISLLIGASYGAGHYGMCGRAYDPRFLFAWPSAKSAVMGGAQLSGVLSIVARAAAEARGQQVDEAADAAMRAAVEGQIEA
ESLPLVLSGMLYDDGVIDPRDTRTVLGMCLSAIANGPIKGTSNFGVFRM
>P9WQH9 2.1.3.-~~~accD3~~~Probable biotin-dependent acyl-coenzyme A carboxylase beta3 subunit~~~COG0777
MSRITTDQLRHAVLDRGSFVSWDSEPLAVPVADSYARELAAARAATGADESVQTGEGRVFGRRVAVVACEFDFLGGSIGV
AAAERITAAVERATAERLPLLASPSSGGTRMQEGTVAFLQMVKIAAAIQLHNQARLPYLVYLRHPTTGGVFASWGSLGHL
TVAEPGALIGFLGPRVYELLYGDPFPSGVQTAENLRRHGIIDGVVALDRLRPMLDRALTVLIDAPEPLPAPQTPAPVPDV
PTWDSVVASRRPDRPGVRQLLRHGATDRVLLSGTDQGEAATTLLALARFGGQPTVVLGQQRAVGGGGSTVGPAALREARR
GMALAAELCLPLVLVIDAAGPALSAAAEQGGLAGQIAHCLAELVTLDTPTVSILLGQGSGGPALAMLPADRVLAALHGWL
APLPPEGASAIVFRDTAHAAELAAAQGIRSADLLKSGIVDTIVPEYPDAADEPIEFALRLSNAIAAEVHALRKIPAPERL
ATRLQRYRRIGLPRD
>O53578 2.1.3.-~~~accD4~~~Biotin-dependent long chain acyl-coenzyme A carboxylase beta4 subunit~~~COG4799
MTVTEPVLHTTAEKLAELRERLELAKEPGGEKAAAKRDKKGIPSARARIYELVDPGSFMEIGALCRTPGDPNALYGDGVV
TGHGLINGRPVGVFSHDQTVFGGTVGEMFGRKVARLMEWCAMVGCPIVGINDSGGARIQDAVTSLAWYAELGRRHELLSG
LVPQISIILGKCAGGAVYSPIQTDLVVAVRDQGYMFVTGPDVIKDVTGEDVSLDELGGADHQASYGNIHQVVESEAAAYQ
YVRDFLSFLPSNCFDKPPVVNPGLEPEITGHDLELDSIVPDSDNMAYDMHEVLLRIFDDGDFLDVAAQAGQAIITGYARV
DGRTVGVVANQPMHMSGAIDNEASDKAARFIRFSDAFDIPLVFVVDTPGFLPGVEQEKNGIIKRGGRFLYAVVEADVPKV
TITIRKSYGGAYAVMGSKQLTADLNFAWPTARIAVIGADGAAQLLMKRFPDPNAPEAQAIRKSFVENYNLNMAIPWIAAE
RGFIDAVIDPHETRLLLRKSMHLLRDKQLWWRVGRKHGLIPV
>P9WQH7 2.1.3.15~~~accD5~~~Biotin-dependent acetyl-/propionyl-coenzyme A carboxylase beta5 subunit~~~COG4799
MTSVTDRSAHSAERSTEHTIDIHTTAGKLAELHKRREESLHPVGEDAVEKVHAKGKLTARERIYALLDEDSFVELDALAK
HRSTNFNLGEKRPLGDGVVTGYGTIDGRDVCIFSQDATVFGGSLGEVYGEKIVKVQELAIKTGRPLIGINDGAGARIQEG
VVSLGLYSRIFRNNILASGVIPQISLIMGAAAGGHVYSPALTDFVIMVDQTSQMFITGPDVIKTVTGEEVTMEELGGAHT
HMAKSGTAHYAASGEQDAFDYVRELLSYLPPNNSTDAPRYQAAAPTGPIEENLTDEDLELDTLIPDSPNQPYDMHEVITR
LLDDEFLEIQAGYAQNIVVGFGRIDGRPVGIVANQPTHFAGCLDINASEKAARFVRTCDCFNIPIVMLVDVPGFLPGTDQ
EYNGIIRRGAKLLYAYGEATVPKITVITRKAYGGAYCVMGSKDMGCDVNLAWPTAQIAVMGASGAVGFVYRQQLAEAAAN
GEDIDKLRLRLQQEYEDTLVNPYVAAERGYVDAVIPPSHTRGYIGTALRLLERKIAQLPPKKHGNVPL
>P9WQH5 2.1.3.15~~~accD6~~~Biotin-dependent acetyl-/propionyl-coenzyme A carboxylase beta6 subunit~~~COG4799
MTIMAPEAVGESLDPRDPLLRLSNFFDDGSVELLHERDRSGVLAAAGTVNGVRTIAFCTDGTVMGGAMGVEGCTHIVNAY
DTAIEDQSPIVGIWHSGGARLAEGVRALHAVGQVFEAMIRASGYIPQISVVVGFAAGGAAYGPALTDVVVMAPESRVFVT
GPDVVRSVTGEDVDMASLGGPETHHKKSGVCHIVADDELDAYDRGRRLVGLFCQQGHFDRSKAEAGDTDIHALLPESSRR
AYDVRPIVTAILDADTPFDEFQANWAPSMVVGLGRLSGRTVGVLANNPLRLGGCLNSESAEKAARFVRLCDAFGIPLVVV
VDVPGYLPGVDQEWGGVVRRGAKLLHAFGECTVPRVTLVTRKTYGGAYIAMNSRSLNATKVFAWPDAEVAVMGAKAAVGI
LHKKKLAAAPEHEREALHDQLAAEHERIAGGVDSALDIGVVDEKIDPAHTRSKLTEALAQAPARRGRHKNIPL
>C0SP93 2.1.3.15~~~accD~~~Acetyl-coenzyme A carboxylase carboxyl transferase subunit beta~~~COG0777
MLKDIFTKKKKYASVPSDQAKHDVPEGIMTKCPKCKKIMLTKELDKNMRVCMNCDYHFPMNAKQRIESLMDEQSFEEFNQ
GMLSENPLGFPGYLEKLEKDREKTSLNEAVVTGKGTIGGHPAVVAVMDSSFRMGSMGSVVGEKITLAIEKAKADKVPFII
FTASGGARMQEGVLSLMQMAKTSSALKLFSEEQGLIISVMTHPTTGGVSASFASLGDYNFAEPGALIGFAGRRIIEQTIG
EKLPEDFQTAEFLLKHGQLDAVIHRDDMKKTLENLLDMHQTGGDIEWLQD
>P0A9Q5 2.1.3.15~~~accD~~~Acetyl-coenzyme A carboxylase carboxyl transferase subunit beta~~~COG0777
MSWIERIKSNITPTRKASIPEGVWTKCDSCGQVLYRAELERNLEVCPKCDHHMRMTARNRLHSLLDEGSLVELGSELEPK
DVLKFRDSKKYKDRLASAQKETGEKDALVVMKGTLYGMPVVAAAFEFAFMGGSMGSVVGARFVRAVEQALEDNCPLICFS
ASGGARMQEALMSLMQMAKTSAALAKMQERGLPYISVLTDPTMGGVSASFAMLGDLNIAEPKALIGFAGPRVIEQTVREK
LPPGFQRSEFLIEKGAIDMIVRRPEMRLKLASILAKLMNLPAPNPEAPREGVVVPPVPDQEPEA
>Q9HZA7 2.1.3.15~~~accD~~~Acetyl-coenzyme A carboxylase carboxyl transferase subunit beta~~~
MSNWLVDKLIPSIMRSESQKSSVPEGLWHKCPSCEAVLYRPELEKTLDVCPKCDHHMRINARTRLDIFLDEDGREELGAD
LEPVDRLKFRDSKKYKDRLAAAQKDTGEKDALIAMSGKLQGMPVVACAFEFSFMGGSMGAIVGERFVRAANVALEKRCPL
ICFSASGGARMQEALISLMQMAKTSAVLARLREEGIPFVSVLTDPVYGGVSASLAMLGDVIVGEPKALIGFAGPRVIEQT
VREKLPEGFQRSEFLLEHGAIDMIVHRAELRPRLANLLSAFTHSPSPVSA
>Q2FXM6 2.1.3.15~~~accD~~~Acetyl-coenzyme A carboxylase carboxyl transferase subunit beta~~~COG0777
MFKDFFNRTKKKKYLTVQDSKNNDVPAGIMTKCPKCKKIMYTKELAENLNVCFNCDHHIALTAYKRIEAISDEGSFTEFD
KGMTSANPLDFPSYLEKIEKDQQKTGLKEAVVTGTAQLDGMKFGVAVMDSRFRMGSMGSVIGEKICRIIDYCTENRLPFI
LFSASGGARMQEGIISLMQMGKTSVSLKRHSDAGLLYISYLTHPTTGGVSASFASVGDINLSEPKALIGFAGRRVIEQTI
NEKLPDDFQTAEFLLEHGQLDKVVHRNDMRQTLSEILKIHQEVTK
>Q5HF73 2.1.3.15~~~accD~~~Acetyl-coenzyme A carboxylase carboxyl transferase subunit beta~~~
MFKDFFNRTKKKKYLTVQDSKNNDVPAGIMTKCPKCKKIMYTKELAENLNVCFNCDHHIALTAYKRIEAISDEGSFTEFD
KGMTSANPLDFPSYLEKIEKDQQKTGLKEAVVTGTAQLDGMKFGVAVMDSRFRMGSMGSVIGEKICRIIDYCTENRLPFI
LFSASGGARMQEGIISLMQMGKTSVSLKRHSDAGLLYISYLTHPTTGGVSASFASVGDINLSEPKALIGFAGRRVIEQTI
NEKLPDDFQTAEFLLEHGQLDKVVHRNDMRQTLSEILKIHQEVTK
>Q7A557 2.1.3.15~~~accD~~~Acetyl-coenzyme A carboxylase carboxyl transferase subunit beta~~~
MFKDFFNRTKKKKYLTVQDSKNNDVPAGIMTKCPKCKKIMYTKELAENLNVCFNCDHHIALTAYKRIEAISDEGSFTEFD
KGMTSANPLDFPSYLEKIEKDQQKTGLKEAVVTGTAQLDGMKFGVAVMDSRFRMGSMGSVIGEKICRIIDYCTENRLPFI
LFSASGGARMQEGIISLMQMGKTSVSLKRHSDAGLLYISYLTHPTTGGVSASFASVGDINLSEPKALIGFAGRRVIEQTI
NEKLPDDFQTAEFLLEHGQLDKVVHRNDMRQTLSEILKIHQEVTK
>P0CC08 2.1.3.15~~~accD~~~Acetyl-coenzyme A carboxylase carboxyl transferase subunit beta~~~COG0777
MALFSKKDKYIRINPNRSVREKPQAKPEVPDELFSQCPGCKHTIYQKDLGSERICPHCSYTFRISAQERLALTIDMGTFK
ELFTGIESKDPLHFPGYQKKLASMREKTGLHEAVVTGTALIKGQTVALGIMDSNFIMASMGTVVGEKITRLFEYATVEKL
PVVLFTASGGARMQEGIMSLMQMAKISAAVKRHSNAGLFYLTILTDPTTGGVTASFAMEGDIILAEPQSLVGFAGRRVIE
NTVRESLPEDFQKAEFLLEHGFVDAIVKRRDLPDTIASLVRLHGGSPR
>P96886 ~~~accE5~~~Biotin-dependent acetyl-/propionyl-coenzyme A carboxylase epsilon subunit~~~
MGTCPCESSERNEPVSRVSGTNEVSDGNETNNPAEVSDGNETNNPAEVSDGNETNNPAPVSRVSGTNEVSDGNETNNPAP
VSRVSGTNEVSDGNETNNPAPVTEKPLHPHEPHIEILRGQPTDQELAALIAVLGSISGSTPPAQPEPTRWGLPVDQLRYP
VFSWQRITLQEMTHMRR
>P95280 1.3.99.-~~~~~~Putative acyl-CoA dehydrogenase FadE17~~~COG1960
MDVSYPPEAEAFRDRIREFVAEHLPPGWPGPGALPPHEREEFARHWRRALAGAGLVAVSWPTEYGGGGLSPMEQVVLAEE
FARAGAPERAENDLLGIDLLGNTLIALGSEAQKRHFLPRILSGEHRWCQGFSEPEAGSDLASVRTRGVLDGDEWVINGHK
IWTSAGTTANWIFLLARTDPSAAKHRGLSFLLVPMDQPGVVVRPIVNAAGHSSFSEVFLTDARTSAGNVVGRVGDGWSTA
MTLLGFERGSHIATAAIDFERDLQRLCELARDRGLHTDPRVRDGLAWCYARVQIMRYRGYRDLTLALTGRPPGAEAAITK
VIWSEYFRRYTDLAVEILGLEALGPRGPGNGGARLVPEAGTPNSPACWMDELLYARAATIYAGSSQIQRNVIGERLLGLP
KEPRPEVLC
>P96831 1.3.99.-~~~fadE2~~~Probable acyl-CoA dehydrogenase FadE2~~~COG1960
MDFAMSAKAIDYRTRLSDFMTEHVFGAEADYDDYRRAAGPADHTAPPIIEELKTKAKDRGLWNLFLSAESGLTNLEYAPL
AEMTGWSMEIAPEALNCAAPDTGNMEILHMFGTEQQRAQWLRPLLDGKIRSAFSMTEPAVASSDARNIETTISRDGADYV
INGRKWWTSGAADPRCKILIVMGRTNPDAAAHQQQSMVLVPIDTPGVTIVRSTPVFGWQDRHGHCEIDYHNVRVPATNLL
GEEGSGFAIAQARLGPGRIHHCMRALGAAERALALMVNRVRNRVAFGRPLAEQGVVQQAIAQSRNEIDQARLLCEKAAWT
IDQHGNKEARHLVAMIKAVAPRVACDVIDRAIQVHGAAGVSDDTPLARLYGWHRAMRIFDGPDEVHLRSIARAELSREKS
TFAAAVT
>Q8P8J5 3.5.1.-~~~argE'~~~N-acetyl-L-citrulline deacetylase~~~COG0624
MTDLLASTLEHLETLVSFDTRNPPRAIAAEGGIFDYLRAQLPGFQVEVIDHGDGAVSLYAVRGTPKYLFNVHLDTVPDSP
HWSADPHVMRRTEDRVIGLGVCDIKGAAAALVAAANAGDGDAAFLFSSDEEANDPRCIAAFLARGLPYDAVLVAEPTMSE
AVLAHRGISSVLMRFAGRAGHASGKQDPAASALHQAMRWGGKALDHVESLAHARFGGLTGLRFNIGRVDGGIKANMIAPA
AELRFGFRPLPSMDVDGLLATFAGFADPAAAHFEETFRGPSLPSGDIARAEERRLAARDVADALDLPIGNAVDFWTEASL
FSAGGYTALVYGPGDIAQAHTADEFVTLAQLQRYVESVNRIINGSH
>P45867 1.3.99.-~~~acdA~~~Acyl-CoA dehydrogenase~~~COG1960
MNFSLSEEHEMIRKLVRDFAKHEVAPTAAERDEQERFDRELFREMANLGLTGIPWPEDYGGIGSDYLAYVIAVEELSKVC
ASTGVTLSAHISLCSWPLFAFGTEEQKTEYLTQLALGEKIGAFALTEAGSGSDAGSMKTTAERIGDDYVLNGSKVFITNG
GVADIYIVFAVTDPEKKKKGVTAFIVEKDFEGFFTGKKEKKLGIRSSPTTEIMFEDCVVPASKRLGEEGEGFKIAMKTLD
GGRNGIAAQAVGIAQGALDAALQYAKERKQFGKSIAEQQGIAFKLADMATMIEASRLLTYQAAWLESSGLPYGKASAMSK
LMAGDTAMKVTTEAVQIFGGYGYTKDYPVERYMRDAKITQIYEGTQEIQRLVISRMLAD
>J7TF92 1.3.8.15~~~acdA~~~3-(aryl)acrylate reductase~~~
MFFTEQHELIRKLARDFAEQEIEPIADEVDKTAEFPKEIVKKMAQNGFFGIKMPKEYGGAGADNRAYVTIMEEISRASGV
AGIYLSSPNSLLGTPFLLVGTDEQKEKYLKPMIRGEKTLAFALTEPGAGSDAGAVATTAREEGDYYILNGRKTFITGAPI
SDNIIVFAKTDMSKGTKGITTFIVDSKQEGVSFGKPEDKMGMIGCPTSDIILENVKVHKSDILGELNKGFITAMKTLSVG
RIGVAAQALGIAQAAVDEAVKYAKQRKQFNRPIAKFQAIQFKLANMETKLNAAKLLVYNAAYKMDCGEKADKEASMAKYF
AAESAIQIVNDALQIHGGYGYIKDYKIERLYRDVRVIAIYEGTSEVQQMVIASNLLK
>P45857 1.3.99.-~~~mmgC~~~Acyl-CoA dehydrogenase~~~COG1960
MHVTQEQVMMRKMVRDFARKEIAPAAEIMEKTDEFPFQLIKKMGKHGLMGIPVPEQYGGAGADVVSYILAIHEISRISAA
VGVILSVHTSVGTNPILYFGNEEQKMKYIPNLASGDHLGAFALTEPHSGSDAGSLRTTAIKKNGKYLLNGSKIFITNGGA
ADIYITFALTAPDQGRHGISAFIVEKNTPGFTVGKKERKLGLYGSNTTELIFDNAEVPEANLLGKEGDGFHIAMANLNVG
RIGIAAQALGIAEAALEHAVDYAKQRVQFGRPIAANQGISFKLADMATRAEAARHLVYHAADLHNRGLNCGKEASMAKQF
ASDAAVKAALDAVQIYGGYGYMKDYPVERLLRDAKVTQIYEGTNEIQRLIISKYLLGGT
>P9WQG3 1.3.99.-~~~~~~Acyl-CoA dehydrogenase FadE12~~~COG1960
MTDTSFIESEERQALRKAVASWVANYGHEYYLDKARKHEHTSELWAEAGKLGFLGVNLPEEYGGGGAGMYELSLVMEEMA
AAGSALLLMVVSPAINGTIIAKFGTDDQKKRWLPGIADGSLTMAFAITEPDAGSNSHKITTTARRDGSDWIIKGQKVFIS
GIDQAQAVLVVGRSEEAKTGKLRPALFVVPTDAPGFSYTPIEMELVSPERQFQVFLDDVRLPADALVGAEDAAIAQLFAG
LNPERIMGAASAVGMGRFALGRAVDYVKTRKVWSTPIGAHQGLAHPLAQCHIEVELAKLMTQKAATLYDHGDDFGAAEAA
NMAKYAAAEASSRAVDQAVQSMGGNGLTKEYGVAAMMTSARLARIAPISREMVLNFVAQTSLGLPRSY
>Q79AF6 1.2.1.10~~~bphJ~~~Acetaldehyde dehydrogenase 4~~~COG4569
MTKKIKCALIGPGNIGTDLLAKLQRSPVLEPIWMVGIDPESDGLKRAREMGIKTTADGVDGLIPHMQADGVQIVFDATSA
YVHADNSRKVNALGALMIDLTPAAIGPFCVPTVNLKEHVGKGEMNVNMVTCGGQATIPMVAAVSRVQPVAYGEIVATVSS
KSAGPGTRKNIDEFTRTTAGAVEKVGGAKKGKAIIILNPAEPPLIMRDTVHCLLESEPDQAKITESIHAMIKEVQKYVPG
YKLVNGPVFDGLRVSVYLEVEGLGDYLPKYAGNLDIMTAAAARTAEMFAEEILAGQLTLQPVHA
>P77580 1.2.1.10~~~mhpF~~~Acetaldehyde dehydrogenase~~~COG4569
MSKRKVAIIGSGNIGTDLMIKILRHGQHLEMAVMVGIDPQSDGLARARRMGVATTHEGVIGLMNMPEFADIDIVFDATSA
GAHVKNDAALREAKPDIRLIDLTPAAIGPYCVPVVNLEANVDQLNVNMVTCGGQATIPMVAAVSRVARVHYAEIIASIAS
KSAGPGTRANIDEFTETTSRAIEVVGGAAKGKAIIVLNPAEPPLMMRDTVYVLSDEASQDDIEASINEMAEAVQAYVPGY
RLKQRVQFEVIPQDKPVNLPGVGQFSGLKTAVWLEVEGAAHYLPAYAGNLDIMTSSALATAEKMAQSLARKAGEAA
>B0VXM6 1.2.1.10~~~pheF~~~Acetaldehyde dehydrogenase~~~
MSKVKVAILGSGNIGTDLMMKLERSNILQLTAMIGIDPESDGLRRAKEKGYTVISTGIKGFLEQPELADIVFDATSAKAH
IRHAKLLKEAGKTVLDLTPAAVGALVVPPVNLHKHLDEWNVNLITCGGQATIPIVHAINRVHPVGYAEIVATIASKSAGP
GTRANIDEFTQTTARGIEKIGGAKKGKAIIILNPAEPPIMMRNTVYALVEEGKIDENAIVQSILEMVKTVQSYVPGYRIR
TEPIMDGNKITVFLEVEGAGDYLPKYSGNLDIMTAAAVKVAEELAKHKLAAQTA
>P9WQH3 1.2.1.87~~~hsaG~~~Propanal dehydrogenase (CoA-propanoylating)~~~COG4569
MPSKAKVAIVGSGNISTDLLYKLLRSEWLEPRWMVGIDPESDGLARAAKLGLETTHEGVDWLLAQPDKPDLVFEATSAYV
HRDAAPKYAEAGIRAIDLTPAAVGPAVIPPANLREHLDAPNVNMITCGGQATIPIVYAVSRIVEVPYAEIVASVASVSAG
PGTRANIDEFTKTTARGVQTIGGAARGKAIIILNPADPPMIMRDTIFCAIPTDADREAIAASIHDVVKEVQTYVPGYRLL
NEPQFDEPSINSGGQALVTTFVEVEGAGDYLPPYAGNLDIMTAAATKVGEEIAKETLVVGGAR
>Q52060 1.2.1.10~~~dmpF~~~Acetaldehyde dehydrogenase~~~
MNQKLKVAIIGSGNIGTDLMIKVLRNAKYLEMGAMVGIDAASDGLARAQRMGVTTTYAGVEGLIKLPEFADIDFVFDATS
ASAHVQNEALLRQAKPGIRLIDLTPAAIGPYCVPVVNLEEHLGKLNVNMVTCGGQATIPMVAAVSRVAKVHYAEIVASIS
SKSAGPGTRANIDEFTETTSKAIEVIGGAAKGKAIIIMNPAEPPLIMRDTVYVLSAAADQAAVAASVAEMVQAVQAYVPG
YRLKQQVQFDVIPESAPLNIPGLGRFSGLKTSVFLEVEGAAHYLPAYAGNLDIMTSAALATAERMAQSMLNA
>Q53WH9 1.2.1.10~~~~~~Acetaldehyde dehydrogenase~~~
MSERVKVAILGSGNIGTDLMYKLLKNPGHMELVAVVGIDPKSEGLARARALGLEASHEGIAYILERPEIKIVFDATSAKA
HVRHAKLLREAGKIAIDLTPAARGPYVVPPVNLKEHLDKDNVNLITCGGQATIPLVYAVHRVAPVLYAEMVSTVASRSAG
PGTRQNIDEFTFTTARGLEAIGGAKKGKAIIILNPAEPPILMTNTVRCIPEDEGFDREAVVASVRAMEREVQAYVPGYRL
KADPVFERLPTPWGERTVVSMLLEVEGAGDYLPKYAGNLDIMTASARRVGEVFAQHLLGKPVEEVVA
>P9WQG1 1.3.99.-~~~~~~Probable acyl-CoA dehydrogenase FadE25~~~COG1960
MVGWAGNPSFDLFKLPEEHDEMRSAIRALAEKEIAPHAAEVDEKARFPEEALVALNSSGFNAVHIPEEYGGQGADSVATC
IVIEEVARVDASASLIPAVNKLGTMGLILRGSEELKKQVLPALAAEGAMASYALSEREAGSDAASMRTRAKADGDHWILN
GAKCWITNGGKSTWYTVMAVTDPDRGANGISAFMVHKDDEGFTVGPKERKLGIKGSPTTELYFENCRIPGDRIIGEPGTG
FKTALATLDHTRPTIGAQAVGIAQGALDAAIAYTKDRKQFGESISTFQAVQFMLADMAMKVEAARLMVYSAAARAERGEP
DLGFISAASKCFASDVAMEVTTDAVQLFGGAGYTTDFPVERFMRDAKITQIYEGTNQIQRVVMSRALLR
>P52042 1.3.8.1~~~bcd~~~Acyl-CoA dehydrogenase, short-chain specific~~~COG1960
MDFNLTREQELVRQMVREFAENEVKPIAAEIDETERFPMENVKKMGQYGMMGIPFSKEYGGAGGDVLSYIIAVEELSKVC
GTTGVILSAHTSLCASLINEHGTEEQKQKYLVPLAKGEKIGAYGLTEPNAGTDSGAQQTVAVLEGDHYVINGSKIFITNG
GVADTFVIFAMTDRTKGTKGISAFIIEKGFKGFSIGKVEQKLGIRASSTTELVFEDMIVPVENMIGKEGKGFPIAMKTLD
GGRIGIAAQALGIAEGAFNEARAYMKERKQFGRSLDKFQGLAWMMADMDVAIESARYLVYKAAYLKQAGLPYTVDAARAK
LHAANVAMDVTTKAVQLFGGYGYTKDYPVERMMRDAKITEIYEGTSEVQKLVISGKIFR
>Q06319 1.3.8.1~~~~~~Acyl-CoA dehydrogenase, short-chain specific~~~
MDFNLTDIQQDFLKLAHDFGEKKLAPTVTERDHKGIYDKELIDELLSLGITGAYFEEKYGGSGDDGGDVLSYILAVEELA
KYDAGVAITLSATVSLCANPIWQFGTEAQKEKFLVPLVEGTKLGAFGLTEPNAGTDASGQQTIATKNDDGTYTLNGSKIF
ITNGGAADIYIVFAMTDKSKGNHGITAFILEDGTPGFTYGKKEDKMGIHTSQTMELVFQDVKVPAENMLGEEGKGFKIAM
MTLDGGRIGVAAQALGIAEAALADAVEYSKQRVQFGKPLCKFQSISFKLADMKMQIEAARNLVYKAACKKQEGKPFTVDA
AIAKRVASDVAMRVTTEAVQIFGGYGYSEEYPVARHMRDAKITQIYEGTNEVQLMVTGGALLR
>C3UVB0 1.3.99.32~~~Acd~~~Glutaryl-CoA dehydrogenase~~~
MDFNLSKELQMLQKEVRNFVNKKIVPFADQWDNENHFPYEEAVRPMGELGFFGTVIPEEYGGEGMDQGWLAAMIVTEEIA
RGSSALRVQLNMEVLGCAYTILTYGSEALKKKYVPKLSSAEFLGGFGITEPDAGSDVMAMSSTAEDKGDHWLLNGSKTWI
SNAAQADVLIYYAYTDKAAGSRGLSAFVIEPRNFPGIKTSNLEKLGSHASPTGELFLDNVKVPKENILGKPGDGARIVFG
SLNHTRLSAAAGGVGLAQACLDAAIKYCNERRQFGKPIGDFQMNQDMIAQMAVEVEAARLLAYKAAAAKDEGRLNNGLDV
AMAKYAAGEAVSKCANYAMRILGAYGYSTEYPVARFYRDAPTYYMVEGSANICKMIIALDQLGVRKANR
>H8EVV4 4.1.3.1~~~icl1~~~Isocitrate lyase 1~~~
MSVVGTPKSAEQIQQEWDTNPRWKDVTRTYSAEDVVALQGSVVEEHTLARRGAEVLWEQLHDLEWVNALGALTGNMAVQQ
VRAGLKAIYLSGWQVAGDANLSGHTYPDQSLYPANSVPQVVRRINNALQRADQIAKIEGDTSVENWLAPIVADGEAGFGG
ALNVYELQKALIAAGVAGSHWEDQLASEKKCGHLGGKVLIPTQQHIRTLTSARLAADVADVPTVVIARTDAEAATLITSD
VDERDQPFITGERTREGFYRTKNGIEPCIARAKAYAPFADLIWMETGTPDLEAARQFSEAVKAEYPDQMLAYNCSPSFNW
KKHLDDATIAKFQKELAAMGFKFQFITLAGFHALNYSMFDLAYGYAQNQMSAYVELQEREFAAEERGYTATKHQREVGAG
YFDRIATTVDPNSSTTALTGSTEEGQFH
>P9WKK6 4.1.3.1~~~icl1~~~Isocitrate lyase 1~~~
MSVVGTPKSAEQIQQEWDTNPRWKDVTRTYSAEDVVALQGSVVEEHTLARRGAEVLWEQLHDLEWVNALGALTGNMAVQQ
VRAGLKAIYLSGWQVAGDANLSGHTYPDQSLYPANSVPQVVRRINNALQRADQIAKIEGDTSVENWLAPIVADGEAGFGG
ALNVYELQKALIAAGVAGSHWEDQLASEKKCGHLGGKVLIPTQQHIRTLTSARLAADVADVPTVVIARTDAEAATLITSD
VDERDQPFITGERTREGFYRTKNGIEPCIARAKAYAPFADLIWMETGTPDLEAARQFSEAVKAEYPDQMLAYNCSPSFNW
KKHLDDATIAKFQKELAAMGFKFQFITLAGFHALNYSMFDLAYGYAQNQMSAYVELQEREFAAEERGYTATKHQREVGAG
YFDRIATTVDPNSSTTALTGSTEEGQFH
>H8F3R6 4.1.3.1~~~aceAb~~~Isocitrate lyase 2~~~
MAIAETDTEVHTPFEQDFEKDVAATQRYFDSSRFAGIIRLYTARQVVEQRGTIPVDHIVAREAAGAFYERLRELFAARKS
ITTFGPYSPGQAVSMKRMGIEAIYLGGWATSAKGSSTEDPGPDLASYPLSQVPDDAAVLVRALLTADRNQHYLRLQMSER
QRAATPAYDFRPFIIADADTGHGGDPHVRNLIRRFVEVGVPGYHIEDQRPGTKKCGHQGGKVLVPSDEQIKRLNAARFQL
DIMRVPGIIVARTDAEAANLIDSRADERDQPFLLGATKLDVPSYKSCFLAMVRRFYELGVKELNGHLLYALGDSEYAAAG
GWLERQGIFGLVSDAVNAWREDGQQSIDGIFDQVESRFVAAWEDDAGLMTYGEAVADVLEFGQSEGEPIGMAPEEWRAFA
ARASLHAARAKAKELGADPPWDCELAKTPEGYYQIRGGIPYAIAKSLAAAPFADILWMETKTADLADARQFAEAIHAEFP
DQMLAYNLSPSFNWDTTGMTDEEMRRFPEELGKMGFVFNFITYGGHQIDGVAAEEFATALRQDGMLALARLQRKMRLVES
PYRTPQTLVGGPRSDAALAASSGRTATTKAMGKGSTQHQHLVQTEVPRKLLEEWLAMWSGHYQLKDKLRVQLRPQRAGSE
VLELGIHGESDDKLANVIFQPIQDRRGRTILLVRDQNTFGAELRQKRLMTLIHLWLVHRFKAQAVHYVTPTDDNLYQTSK
MKSHGIFTEVNQEVGEIIVAEVNHPRIAELLTPDRVALRKLITKEA
>Q8VJU4 4.1.3.1~~~icl2~~~Isocitrate lyase 2~~~
MAIAETDTEVHTPFEQDFEKDVAATQRYFDSSRFAGIIRLYTARQVVEQRGTIPVDHIVAREAAGAFYERLRELFAARKS
ITTFGPYSPGQAVSMKRMGIEAIYLGGWATSAKGSSTEDPGPDLASYPLSQVPDDAAVLVRALLTADRNQHYLRLQMSER
QRAATPAYDFRPFIIADADTGHGGDPHVRNLIRRFVEVGVPGYHIEDQRPGTKKCGHQGGKVLVPSDEQIKRLNAARFQL
DIMRVPGIIVARTDAEAANLIDSRADERDQPFLLGATKLDVPSYKSCFLAMVRRFYELGVKELNGHLLYALGDSEYAAAG
GWLERQGIFGLVSDAVNAWREDGQQSIDGIFDQVESRFVAAWEDDAGLMTYGEAVADVLEFGQSEGEPIGMAPEEWRAFA
ARASLHAARAKAKELGADPPWDCELAKTPEGYYQIRGGIPYAIAKSLAAAPFADILWMETKTADLADARQFAEAIHAEFP
DQMLAYNLSPSFNWDTTGMTDEEMRRFPEELGKMGFVFNFITYGGHQIDGVAAEEFATALRQDGMLALARLQRKMRLVES
PYRTPQTLVGGPRSDAALAASSGRTATTKAMGKGSTQHQHLVQTEVPRKLLEEWLAMWSGHYQLKDKLRVQLRPQRAGSE
VLELGIHGESDDKLANVIFQPIQDRRGRTILLVRDQNTFGAELRQKRLMTLIHLWLVHRFKAQAVHYVTPTDDNLYQTSK
MKSHGIFTEVNQEVGEIIVAEVNHPRIAELLTPDRVALRKLITKEA
>O07718 4.1.3.1~~~aceAa~~~Putative isocitrate lyase subunit A~~~COG2224
MAIAETDTEVHTPFEQDFEKDVAATQRYFDSSRFAGIIRLYTARQVVEQRGTIPVDHIVAREAAGAFYERLRELFAARKS
ITTFGPYSPGQAVSMKRMGIEAIYLGGWATSAKGSSTEDPGPDLASYPLSQVPDDAAVLVRALLTADRNQHYLRLQMSER
QRAATPAYDFRPFIIADAGTGHGGDPHVRNLIRRFVEVGVPGYHIEDQRPGTKKCGHQGGKVLVPSDEQIKRLNAARFQL
DIMRVPGIIVARTDAEAANLIDSRADERDQPFLLGATKLDVPSYKSCFLAMVRRFTNWASRSSMVIFSMRLATASTRRPA
VGLSAKAFSAWSPTRSTRGGRTASSRSTAFSTRSSRGSWRPGRTTRA
>P42449 4.1.3.1~~~aceA~~~Isocitrate lyase~~~COG2224
MSNVGKPRTAQEIQQDWDTNPRWNGITRDYTADQVADLQGSVIEEHTLARRGSEILWDAVTQEGDGYINALGALTGNQAV
QQVRAGLKAVYLSGWQVAGDANLSGHTYPDQSLYPANSVPSVVRRINNALLRSDEIARTEGDTSVDNWVVPIVADGEAGF
GGALNVYELQKAMIAAGAAGTHWEDQLASEKKCGHLGGKVLIPTQQHIRTLNSARLAADVANTPTVVIARTDAEAATLIT
SDVDERDQPFITGERTAEGYYHVKNGLEPCIARAKSYAPYADMIWMETGTPDLELAKKFAEGVRSEFPDQLLSYNCSPSF
NWSAHLEADEIAKFQKELGAMGFKFQFITLAGFHSLNYGMFDLAYGYAREGMTSFVDLQNREFKAAEERGFTAVKHQREV
GAGYFDQIATTVDPNSSTTALKGSTEEGQFHN
>P0A9G6 4.1.3.1~~~aceA~~~Isocitrate lyase~~~COG2224
MKTRTQQIEELQKEWTQPRWEGITRPYSAEDVVKLRGSVNPECTLAQLGAAKMWRLLHGESKKGYINSLGALTGGQALQQ
AKAGIEAVYLSGWQVAADANLAASMYPDQSLYPANSVPAVVERINNTFRRADQIQWSAGIEPGDPRYVDYFLPIVADAEA
GFGGVLNAFELMKAMIEAGAAAVHFEDQLASVKKCGHMGGKVLVPTQEAIQKLVAARLAADVTGVPTLLVARTDADAADL
ITSDCDPYDSEFITGERTSEGFFRTHAGIEQAISRGLAYAPYADLVWCETSTPDLELARRFAQAIHAKYPGKLLAYNCSP
SFNWQKNLDDKTIASFQQQLSDMGYKFQFITLAGIHSMWFNMFDLANAYAQGEGMKHYVEKVQQPEFAAAKDGYTFVSHQ
QEVGTGYFDKVTTIIQGGTSSVTALTGSTEESQF
>O50078 4.1.3.1~~~aceA~~~Isocitrate lyase~~~
MAHKKTYSQLRSELLARYPVGLTKGGVSIDDIVQLRLQSPYESHLDVARAMASVMRADMAAYDRDTGKFTQSLGCWSGFH
AQQMIKAVKRLRGTTKGAYVYLSGWMVAGLRNRWGHLPDQSMHEKTSVVDLIEEIYVSLRQADEVALNDLFNELKDARAK
GATNKACEEIISRIDGFESHVVPIIADIDAGFGNEHATYLLAKEMIKAGACCLQIENQVSDAKQCGHQDGKVTVPREDFI
EKLRACRLAFEELGVDDGVIVARTDSLGASLTQKIPVSQQAGDFASSYIKWLKTEPITDANPLSEGELAIWQSGNFARPI
RMPNGLFSFREGTGRARVIEDCIASLKDGDADLIWIETDTPNVDEIASMVAEIRKQVPDAKLVYNNSPSFNWTLNLRKQV
RAQWISEGKIAEADYPDGTALMSAQYDTSELGREADDRLRQFQVDISARAGVFHNLITLPTFHLTAKSTDELSHGYFGED
RMLAYVATVQREEIRRSISAVRHQHEVGSDLGDTFKEMVSGDRALKAGGAHNTMNQFAAE
>P9WKK7 4.1.3.1~~~icl~~~Isocitrate lyase~~~COG2224
MSVVGTPKSAEQIQQEWDTNPRWKDVTRTYSAEDVVALQGSVVEEHTLARRGAEVLWEQLHDLEWVNALGALTGNMAVQQ
VRAGLKAIYLSGWQVAGDANLSGHTYPDQSLYPANSVPQVVRRINNALQRADQIAKIEGDTSVENWLAPIVADGEAGFGG
ALNVYELQKALIAAGVAGSHWEDQLASEKKCGHLGGKVLIPTQQHIRTLTSARLAADVADVPTVVIARTDAEAATLITSD
VDERDQPFITGERTREGFYRTKNGIEPCIARAKAYAPFADLIWMETGTPDLEAARQFSEAVKAEYPDQMLAYNCSPSFNW
KKHLDDATIAKFQKELAAMGFKFQFITLAGFHALNYSMFDLAYGYAQNQMSAYVELQEREFAAEERGYTATKHQREVGAG
YFDRIATTVDPNSSTTALTGSTEEGQFH
>Q9I0K4 4.1.3.1~~~~~~Isocitrate lyase~~~
MSAYQNEIKAVAALKEKNGSSWSAINPEYAARMRIQNRFKTGLDIAKYTAAIMRKDMAEYDADSSVYTQSLGCWHGFIGQ
QKLISIKKHLKTTNKRYLYLSGWMVAALRSDFGPLPDQSMHEKTAVSGLIEELYTFLRQADARELDLLFTGLDAARAAGD
KAKEAELLAQIDNFETHVVPIIADIDAGFGNAEATYLLAKKMIEAGACCIQIENQVSDEKQCGHQDGKVTVPHIDFLAKI
NAVRYAFLELGVDDGVIVARTDSLGAGLTKQIAVTNEPGDLGDLYNSFLDCEEISESELGNGDVVIKREGKLLRPKRLAS
NLFQFRKGTGEDRCVLDCITSLQNGADLLWIETEKPHVGQIKAMVDRIREVIPNAKLVYNNSPSFNWTLNFRQQVFDAFV
AEGKDVSAYDRNKLMSVEYDDTELAKVADEKIRTFQRDGSAHAGIFHHLITLPTYHTAALSTDNLAKGYFADEGMLAYVK
GVQRQELRQGIACVKHQNMAGSDIGDNHKEYFAGEAALKASGKDNTMNQFH
>Q8X607 2.7.11.5~~~aceK~~~Isocitrate dehydrogenase kinase/phosphatase~~~COG4579
MPRGLELLIAQTILQGFDAQYGRFLEVTSGAQQRFEQADWHAVQQAMKNRIHLYDHHVGLVVEQLRCITNGQSTDAEFLL
RVKEHYTRLLPDYPRFEIAESFFNSVYCRLFDHRSLTPERLFIFSSQPERRFRTIPRPLAKDFHPDHGWESLLMRVISDL
PLRLHWQNKSRDIHYIIRHLTETLGPENLSKSHLQVANELFYRNKAAWLVGKLITPSGTLPFLLPIHQTDDGELFIDTCL
TTTAEASIVFGFARSYFMVYAPLPAALVEWLREILPGKTTAELYMAIGCQKHAKTESYREYLVYLQGCNEQFIEAPGIRG
MVMLVFTLPGFDRVFKVIKDKFAPQKEMSAAHVRACYQLVKEHDRVGRMADTQEFENFVLEKRHISPALMELLLQEAAEK
ITDLGEQIVIRHLYIERRMVPLNIWLEQVEGQQLRDAIEEYGNAIRQLAAANIFPGDMLFKNFGVTRHGRVVFYDYDEIC
YMTEVNFRDIPPPRYPEDELASEPWYSVSPGDVFPEEFRHWLCADPRIGPLFEEMHADLFRADYWRALQNRIREGHVEDV
YAYRRRQRFSVRYGEMLF
>P11071 2.7.11.5~~~aceK~~~Isocitrate dehydrogenase kinase/phosphatase~~~COG4579
MPRGLELLIAQTILQGFDAQYGRFLEVTSGAQQRFEQADWHAVQQAMKNRIHLYDHHVGLVVEQLRCITNGQSTDAAFLL
RVKEHYTRLLPDYPRFEIAESFFNSVYCRLFDHRSLTPERLFIFSSQPERRFRTIPRPLAKDFHPDHGWESLLMRVISDL
PLRLRWQNKSRDIHYIIRHLTETLGTDNLAESHLQVANELFYRNKAAWLVGKLITPSGTLPFLLPIHQTDDGELFIDTCL
TTTAEASIVFGFARSYFMVYAPLPAALVEWLREILPGKTTAELYMAIGCQKHAKTESYREYLVYLQGCNEQFIEAPGIRG
MVMLVFTLPGFDRVFKVIKDRFAPQKEMSAAHVRACYQLVKEHDRVGRMADTQEFENFVLEKRHISPALMELLLQEAAEK
ITDLGEQIVIRHLYIERRMVPLNIWLEQVEGQQLRDAIEEYGNAIRQLAAANIFPGDMLFKNFGVTRHGRVVFYDYDEIC
YMTEVNFRDIPPPRYPEDELASEPWYSVSPGDVFPEEFRHWLCADPRIGPLFEEMHADLFRADYWRALQNRIREGHVEDV
YAYRRRQRFSVRYGEMLF
>P0DUU5 ~~~aceR~~~HTH-type transcriptional regulator AceR~~~
MNINQEQLLMFQAVMETGSFSAAARKLGKVPSAVSMSIANLEIDLNLTLFERKGREPTPTAEARVLYEKTAQLLIEMNQW
KQHAHALSTGLEPNLTIVVVSELLHTNWTDYVCLLESRFPDLQINIVSAPQEDALQMLLDGSAQLALMFEREHLDNREQF
VELKREALIPVISKTHPLASQEHVSYEQILGTRQIVVASRDETLKPELLFSKHYWRTDNHHSACLMILRNLGWGVLPQEM
FKENPELNNKLKALDVFDFTPRFEYYVDLVWSRESELGAAARFLIDYIRNKRMQPAP
>P9WIZ9 1.-.-.-~~~acg~~~Putative NAD(P)H nitroreductase acg~~~COG0778
MPDTMVTTDVIKSAVQLACRAPSLHNSQPWRWIAEDHTVALFLDKDRVLYATDHSGREALLGCGAVLDHFRVAMAAAGTT
ANVERFPNPNDPLHLASIDFSPADFVTEGHRLRADAILLRRTDRLPFAEPPDWDLVESQLRTTVTADTVRIDVIADDMRP
ELAAASKLTESLRLYDSSYHAELFWWTGAFETSEGIPHSSLVSAAESDRVTFGRDFPVVANTDRRPEFGHDRSKVLVLST
YDNERASLLRCGEMLSAVLLDATMAGLATCTLTHITELHASRDLVAALIGQPATPQALVRVGLAPEMEEPPPATPRRPID
EVFHVRAKDHR
>Q9RKF7 5.5.1.25~~~~~~3,6-anhydro-alpha-L-galactonate cycloisomerase~~~COG4948
MIERVRTDLYRIPLPTRLTDSTHGAMMDFELITVRIEDSDGATGLGYTYTVNHGGAAVATMVDKDLRGCLLGADAEQIEK
IWQSMWWRLHYAGRGGHATSAISAVDIALWDLKGIRARTPLWKLFGGYDPVVPVYAGGIDLELPVADLKTQADRFLAGGF
RAIKMKVGRPDLKEDVDRVSALREHLGDSFPLMVDANMKWTVDGAIRAARALAPFDLHWIEEPTIPDDLVGNARIVRESG
HTIAGGENLHTLYDFHNAVRAGSLTLPEPDVSNIGGYTTFRKVAALAEANNMLLTSHGVHDLTVHALASVPHRTYMEAHG
FGLHAYMAEPMAVTDGCVSAPDRPGHGVVLDFERLGRLAVG
>H2IFX0 5.5.1.25~~~Vejaci~~~3,6-anhydro-alpha-L-galactonate cycloisomerase~~~COG4948
MKTTIKDIKTRLFKIPLKEILSDAKHGDHDHFELITTTVTLEDGSQGTGYTYTGGKGGYSIKAMLEYDIQPALIGKDATQ
IEEIYDFMEWHIHYVGRGGISTFAMSAVDIALWDLKGKREGLPLWKMAGGKNNTCKAYCGGIDLQFPLEKLLNNICGYLE
SGFNAVKIKIGRENMQEDIDRIKAVRELIGPDITFMIDANYSLTVEQAIKLSKAVEQYDITWFEEPTLPDDYKGFAEIAD
NTAIPLAMGENLHTIHEFGYAMDQAKLGYCQPDASNCGGITGWLKAADLITEHNIPVCTHGMQELHVSLVSAFDTGWLEV
HSFPIDEYTKRPLVVENFRAVASNEPGIGVEFDWDKIAQYEV
>P37877 2.7.2.1~~~ackA~~~Acetate kinase~~~COG0282
MSKIIAINAGSSSLKFQLFEMPSETVLTKGLVERIGIADSVFTISVNGEKNTEVTDIPDHAVAVKMLLNKLTEFGIIKDL
NEIDGIGHRVVHGGEKFSDSVLLTDETIKEIEDISELAPLHNPANIVGIKAFKEVLPNVPAVAVFDTAFHQTMPEQSYLY
SLPYEYYEKFGIRKYGFHGTSHKYVTERAAELLGRPLKDLRLISCHLGNGASIAAVEGGKSIDTSMGFTPLAGVAMGTRS
GNIDPALIPYIMEKTGQTADEVLNTLNKKSGLLGISGFSSDLRDIVEATKEGNERAETALEVFASRIHKYIGSYAARMSG
VDAIIFTAGIGENSVEVRERVLRGLEFMGVYWDPALNNVRGEEAFISYPHSPVKVMIIPTDEEVMIARDVVRLAK
>P77845 2.7.2.1~~~ackA~~~Acetate kinase~~~COG0282
MALALVLNSGSSSIKFQLVNPENSAIDEPYVSGLVEQIGEPNGRIVLKIEGEKYTLETPIADHSEGLNLAFDLMDQHNCG
PSQLEITAVGHRVVHGGILFSAPELITDEIVEMIRDLIPLAPLHNPANVDGIDVARKILPDVPHVAVFDTGFFHSLPPAA
ALYAINKDVAAEHGIRRYGFHGTSHEFVSKRVVEILEKPTEDINTITFHLGNGASMAAVQGGRAVDTSMGMTPLAGLVMG
TRSGDIDPGIVFHLSRTAGMSIDEIDNLLNKKSGVKGLSGVNDFRELREMIDNNDQDAWSAYNIYIHQLRRYLGSYMVAL
GRVDTIVFTAGVGENAQFVREDALAGLEMYGIEIDPERNALPNDGPRLISTDASKVKVFVIPTNEELAIARYAVKFA
>P0A6A3 2.7.2.1~~~ackA~~~Acetate kinase~~~COG0282
MSSKLVLVLNCGSSSLKFAIIDAVNGEEYLSGLAECFHLPEARIKWKMDGNKQEAALGAGAAHSEALNFIVNTILAQKPE
LSAQLTAIGHRIVHGGEKYTSSVVIDESVIQGIKDAASFAPLHNPAHLIGIEEALKSFPQLKDKNVAVFDTAFHQTMPEE
SYLYALPYNLYKEHGIRRYGAHGTSHFYVTQEAAKMLNKPVEELNIITCHLGNGGSVSAIRNGKCVDTSMGLTPLEGLVM
GTRSGDIDPAIIFHLHDTLGMSVDAINKLLTKESGLLGLTEVTSDCRYVEDNYATKEDAKRAMDVYCHRLAKYIGAYTAL
MDGRLDAVVFTGGIGENAAMVRELSLGKLGVLGFEVDHERNLAARFGKSGFINKEGTRPAVVIPTNEELVIAQDASRLTA
>A0QLU8 2.7.2.1~~~ackA~~~Acetate kinase~~~
MDGSDGARRVLVINSGSSSLKFQLVDPESGVAASTGIVERIGEESSPVPDHDAALRRAFDMLAGDGVDLNTAGLVAVGHR
VVHGGNTFYRPTVLDDAVIARLHELSELAPLHNPPALLGIEVARRLLPGIAHVAVFDTGFFHDLPPAAATYAIDRELADR
WQIRRYGFHGTSHRYVSEQAAAFLDRPLRGLKQIVLHLGNGCSASAIAGTRPLDTSMGLTPLEGLVMGTRSGDIDPSVVS
YLCHTAGMGVDDVESMLNHRSGVVGLSGVRDFRRLRELIESGDGAAQLAYSVFTHRLRKYIGAYLAVLGHTDVISFTAGI
GENDAAVRRDAVSGMEELGIVLDERRNLPGAKGARQISADDSPITVLVVPTNEELAIARDCVRVLGG
>B2HPZ3 2.7.2.1~~~ackA~~~Acetate kinase~~~COG0282
MSASRPNRVVLVLNSGSSSLKFQLVEPDSGMSRATGNIERIGEESSSVPDHDAALRRVFEILAEDDIDLQSCGLVAVGHR
VVHGGKDFYEPTLLNDAVIGKLDELSPLAPLHNPPAVLCIRVARALLPDVPHIAVFDTAFFHQLPPAAATYAIDRELADV
WKIRRYGFHGTSHEYVSQQAAEFLGKPIGDLNQIVLHLGNGASASAVAGGRPVETSMGLTPLEGLVMGTRSGDLDPGVIG
YLWRTAKLGVDEIESMLNHRSGMLGLAGERDFRRLRAMIDDGDPAAELAYDVFIHRLRKYVGAYLAVLGHTDVVSFTAGI
GEHDAAVRRDTLAGMAELGISLDERRNACPSGGARRISADDSPVTVLVIPTNEELAIARHCCSVLVAV
>Q73T33 2.7.2.1~~~ackA~~~Acetate kinase~~~COG0282
MDGSDGARRVLVINSGSSSLKFQLVDPEFGVAASTGIVERIGEESSPVPDHDAALRRAFDMLAGDGVDLNTAGLVAVGHR
VVHGGNTFYRPTVLDDAVIARLHELSELAPLHNPPALQGIEVARRLLPDIAHVAVFDTGFFHDLPPAAATYAIDRELADR
WQIRRYGFHGTSHRYVSEQAAAFLDRPLRGLKQIVLHLGNGCSASAIAGTRPLDTSMGLTPLEGLVMGTRSGDIDPSIVS
YLCHTAGMGVDDVESMLNHRSGVVGLSGVRDFRRLRELIESGDGAAQLAYSVFTHRLRKYIGAYLAVLGHTDVISFTAGI
GENDAAVRRDAVSGMEELGIVLDERRNLAGGKGARQISADDSPITVLVVPTNEELAIARDCVRVLGG
>A0QQK1 2.7.2.1~~~ackA~~~Acetate kinase~~~COG0282
MTVLVVNSGSSSLKYAVVRPASGEFLADGIIEEIGSGAVPDHDAALRAAFDELAAAGLHLEDLDLKAVGHRMVHGGKTFY
KPSVVDDELIAKARELSPLAPLHNPPAIKGIEVARKLLPDLPHIAVFDTAFFHDLPAPASTYAIDRELAETWHIKRYGFH
GTSHEYVSQQAAIFLDRPLESLNQIVLHLGNGASASAVAGGKAVDTSMGLTPMEGLVMGTRSGDIDPGVIMYLWRTAGMS
VDDIESMLNRRSGVLGLGGASDFRKLRELIESGDEHAKLAYDVYIHRLRKYIGAYMAVLGRTDVISFTAGVGENVPPVRR
DALAGLGGLGIEIDDALNSAKSDEPRLISTPDSRVTVLVVPTNEELAIARACVGVV
>P9WQH1 2.7.2.1~~~ackA~~~Acetate kinase~~~COG0282
MSSTVLVINSGSSSLKFQLVEPVAGMSRAAGIVERIGERSSPVADHAQALHRAFKMLAEDGIDLQTCGLVAVGHRVVHGG
TEFHQPTLLDDTVIGKLEELSALAPLHNPPAVLGIKVARRLLANVAHVAVFDTAFFHDLPPAAATYAIDRDVADRWHIRR
YGFHGTSHQYVSERAAAFLGRPLDGLNQIVLHLGNGASASAIARGRPVETSMGLTPLEGLVMGTRSGDLDPGVISYLWRT
ARMGVEDIESMLNHRSGMLGLAGERDFRRLRLVIETGDRSAQLAYEVFIHRLRKYLGAYLAVLGHTDVVSFTAGIGENDA
AVRRDALAGLQGLGIALDQDRNLGPGHGARRISSDDSPIAVLVVPTNEELAIARDCLRVLGGRRA
>B2RK02 2.7.2.1~~~ackA~~~Acetate kinase~~~COG0282
MKVLVLNCGSSSVKYKLLEMPKGDVLAQGGVEKLGLPGSFLKLTMPNGEKVVLEKDMPEHTIAVEFILSVLKDDKYGCIK
SYEEIDAVGHRLVHGGEKFSNSVEITPEVIAKVEECIPLAPLHNPANLKGVVAIEKLLPGIRQVGVFDTAFFQTMPEHVY
RYALPYDMCNKHGVRRYGFHGTSHRYVSARACEILGLDYDKTRIITAHIGNGASIAAIKNGKALDVSLGMTPVEGLMMGT
RSGDVDPGVLTFLMEAEGLQAAGISELINKKSGVLGVSGVSSDLREIEDAIKNGNERATLAMTMYDYRIKKYVGAYAAAM
GGVDVLVFTGGVGENQYTTREKVCTDMEFMGIVFDSKVNEGMRGKEMVISKPESKVTVIVVPTDEEYMIASDTMTILK
>P63411 2.7.2.1~~~ackA~~~Acetate kinase AckA~~~
MSSKLVLVLNCGSSSLKFAIIDAVNGDEYLSGLAECFHLPEARIKWKMDGSKQEAALGAGAAHSEALNFIVNTILAQKPE
LSAQLTAIGHRIVHGGEKYTSSVVIDESVIQGIKDSASFAPLHNPAHLIGIAEALKSFPQLKDKNVAVFDTAFHQTMPEE
SYLYALPYSLYKEHGVRRYGAHGTSHFYVTQEAAKMLNKPVEELNIITCHLGNGGSVSAIRNGKCVDTSMGLTPLEGLVM
GTRSGDIDPAIIFHLHDTLGMSVDQINKMLTKESGLLGLTEVTSDCRYVEDNYATKEDAKRAMDVYCHRLAKYIGSYTAL
MDGRLDAVVFTGGIGENAAMVRELSLGKLGVLGFEVDHERNLAARFGKSGFINKEGTRPAVVIPTNEELVIAQDASRLTA
>Q99TF2 2.7.2.1~~~ackA~~~Acetate kinase~~~
MSKLILAINAGSSSLKFQLIRMPEEELVTKGLIERIGLKDSIFTIEVNGEKVKTVQDIKDHVEAVDIMLDAFKAHNIIND
INDIDGTGHRVVHGGEKFPESVAITDEVEKEIEELSELAPLHNPANLMGIRAFRKLLPNIPHVAIFDTAFHQTMPEKAYL
YSLPYHYYKDYGIRKYGFHGTSHKFVSQRAAEMLDKPIEDLRIISCHIGNGASIAAIDGGKSIDTSMGFTPLAGVTMGTR
SGNIDPALIPFIMEKTGKTAEQVLEILNKESGLLGLSGTSSDLRDLSEEAESGKARSQMALDVFASKIHKYIGSYAARMH
GVDVIVFTAGIGENSVEIRAKVLEGLEFMGVYWDPKKNENLLRGKEGFINYPHSPVKVVVIPTDEESMIARDVMTFGGLK
>Q9WYB1 2.7.2.1~~~ackA~~~Acetate kinase~~~COG0282
MRVLVINSGSSSIKYQLIEMEGEKVLCKGIAERIGIEGSRLVHRVGDEKHVIERELPDHEEALKLILNTLVDEKLGVIKD
LKEIDAVGHRVVHGGERFKESVLVDEEVLKAIEEVSPLAPLHNPANLMGIKAAMKLLPGVPNVAVFDTAFHQTIPQKAYL
YAIPYEYYEKYKIRRYGFHGTSHRYVSKRAAEILGKKLEELKIITCHIGNGASVAAVKYGKCVDTSMGFTPLEGLVMGTR
SGDLDPAIPFFIMEKEGISPQEMYDILNKKSGVYGLSKGFSSDMRDIEEAALKGDEWCKLVLEIYDYRIAKYIGAYAAAM
NGVDAIVFTAGVGENSPITREDVCSYLEFLGVKLDKQKNEETIRGKEGIISTPDSRVKVLVVPTNEELMIARDTKEIVEK
IGR
>Q9ZAA1 1.2.1.-~~~exaC~~~Acetaldehyde dehydrogenase~~~
MIYAAPGTPGAVVTFKPRYGNYIGGEFVPPVKGQYFTNTSPVNGQPIAEFPRSTAEDIDKALDAAHAAADAWGRTSVQER
SNILLKIADRIEQNLELLAVTETWDNGKAVRETLNADIPLAADHFRYFAGCIRAQEGSAAEINDSTVAYHIHEPLGVVGQ
IIPWNFPLLMAAWKLAPALAAGNCVVLKPAEQTPLGICVLLELIGDLLPPGVLNVVQGFGREAGEALATSKRIAKIAFTG
STPVGSHILKCAAESIIPSTVELGGKSPNIYFEDIMQAEPAFIEKAAEGLVLAFFNQGEVCTCPSRALVQESIYPAFMEE
VLKKVRAIKRGDPLDTETMVGAQASQQQYEKILSYLDIAQQEGAELLAGGSVEKLEGNLASGYYIQPTLLKGHNGMRVFQ
EEIFGPVVGVTTFKDEAEALAIANDTEYGLGAGLWTRDINRAYRMGRGIKAGRVWTNCYHLYPAHAAFGGYKKSGVGRET
HKMMLDHYQQTKNLLVSYDIDPLGFF
>Q7M181 5.1.1.15~~~~~~2-aminohexano-6-lactam racemase~~~
MTKALYDRDGAAIGNLQKLRFFPLAISGGRGARLIEENGRELIDLSGAWGAASLGYGHPAIVAAVSAAAANPAGATILSA
SNAPAVTLAERLLASFPGEGTHKIWFGHSGSDANEAAYRAIVKATGRSGVIAFAGAYHGCTVGSMAFSGHSVQADAAKAD
GLILLPYPDPYRPYRNDPTGDAILTLLTEKLAAVPAGSIGAAFIEPIQSDGGLIVPPDGFLRKFADICRAHGILVVCDEV
KVGLARSGRLHCFEHEGFVPDILVLGKGLGGGLPLSAVIAPAEILDCASAFAMQTLHGNPISAAAGLAVLETIDRDDLPA
MAERKGRLLRDGLSELAKRHPLIGDIRGRGLACGMELVCDRQSREPARAETAKLIYRAYQLGLVVYYVGMNGNVLEFTPP
LTITETDIHKALDLLDRAFSELSAVSNEEIAQFAGW
>A1IHE6 1.14.13.226~~~acmA~~~Acetone monooxygenase (methyl acetate-forming)~~~
MSTTTLDAAVIGTGVAGLYELHMLREQGLEVRAYDKASGVGGTWYWNRYPGARFDSEAYIYQYLFDEDLYKGWSWSQRFP
GQEEIERWLNYVADSLDLRRDISLETEITSAVFDEDRNRWTLTTADGDTIDAQFLITCCGMLSAPMKDLFPGQSDFGGQL
VHTARWPKEGIDFAGKRVGVIGNGATGIQVIQSIAADVDELKVFIRTPQYALPMKNPSYGPDEVAWYKSRFGELKDTLPH
TFTGFEYDFTDAWEDLTPEQRRARLEDDYENGSLKLWLASFAEIFSDEQVSEEVSEFVREKMRARLVDPELCDLLIPSDY
GFGTHRVPLETNYLEVYHRDNVTAVLVRDNPITRIRENGIELADGTVHELDVIIMATGFDAGTGALTRIDIRGRDGRTLA
DDWSRDIRTTMGLMVHGYPNMLTTAVPLAPSAALCNMTTCLQQQTEWISEAIRHLRATGKTVIEPTAEGEEAWVAHHDEL
ADANLISKTNSWYVGSNVPGKPRRVLSYVGGVGAYRDATLEAAAAGYKGFALS
>A1IHE7 3.1.1.114~~~acmB~~~Methyl acetate hydrolase~~~
MTSTFSSLDVSAFTSAADRILAEAVTGDARVPGVVAMVTDRDRTVYSGAAGQRSLGGSAPMTTDDVFAIFSTTKAITATA
ALQLVEEGLLDLDAPASTYAPAIGTLQVIEGFDDAGEPILRAPKSVPTTRQLLTHTGGFGYDFFDEIYNRLAEEKGQPSV
TTASRAALMTPLLFDPGERWQYGTNIDWVGQVVEGLRGKRLGEVFAERIFAPLGIENMSFILREDFRSHLTEIHARNADG
SLTPMGLELPSPPEVDFGGHGLYGTVGEYMKFIRMWLNDGVGEGGRVLKAETVEMALRNHLGDLPVTMLPGVIPSLSNDA
EFFPGQSKSWSLPFMINNETAPTGRPAGAQGWAGLANLFYWIDRQNGYGGYWATQILPFGDPTSFTKYMEFETAFYDALK
S
>P09339 4.2.1.3~~~citB~~~Aconitate hydratase A~~~COG1048
MANEQKTAAKDVFQARKTFTTNGKTYHYYSLKALEDSGIGKVSKLPYSIKVLLESVLRQVDGFVIKKEHVENLAKWGTAE
LKDIDVPFKPSRVILQDFTGVPAVVDLASLRKAMAAVGGDPDKINPEIPVDLVIDHSVQVDKAGTEDALAVNMDLEFERN
AERYKFLSWAKKAFNNYQAVPPATGIVHQVNLEFLASVVHAIEEDGELVTYPDTLVGTDSHTTMINGIGVLGWGVGGIEA
EAGMLGQPSYFPVPEVIGAKLVGKLPNGTTATDLALKVTQVLREKGVVGKFVEFFGPGIAELPLADRATIANMAPEYGAT
CGFFPVDEEALNYLRLTGRDPEHIDVVEAYCRSNGLFYTPDAEDPQFTDVVEIDLSQIEANLSGPKRPQDLIPLSAMQET
FKKQLVSPAGNQGFGLNAEEENKEIKFKLLNGEETVMKTGAIAIAAITSCTNTSNPYVLIGAGLVAKKAVELGLKVPNYV
KTSLAPGSKVVTGYLVNSGLLPYMKELGFNLVGYGCTTCIGNSGPLSPEIEEAVAKNDLLITSVLSGNRNFEGRIHPLVK
GNYLASPPLVVAYALAGTVNINLKTDPIGVGKDGQNVYFNDIWPSMDEINALVKQTVTPELFRKEYETVFDDNKRWNEIE
TTDEALYKWDNDSTYIQNPPFFEEMSVEPGKVEPLKGLRVVGKFGDSVTTDHISPAGAIGKDTPAGKYLQEKGVSPRDFN
SYGSRRGNHEVMMRGTFANIRIKNQIAPGTEGGFTTYWPTGEVTSIYDACMKYKEDKTGLVVLAGKDYGMGSSRDWAAKG
TNLLGIRTVIAESFERIHRSNLVFMGVLPLQFKQGENADTLGLTGKEVIEVDVDETVRPRDLVTVRAINEDGNVTTFEAV
VRFDSEVEIDYYRHGGILQMVLREKMKQS
>P70920 4.2.1.3~~~acnA~~~Aconitate hydratase A~~~COG1048
MTSLDSFKCKKTLKVGAKTYVYYSLPTAEKNGLKGISKLPYSMKVLLENLLRNEDGRSVKKADIVAVSKWLRKKSLEHEI
AFRPARVLMQDFTGVPAVVDLAAMRNAMQKLGGDAEKINPLVPVDLVIDHSVIVNFFGDNKAFAKNVTEEYKQNQERYEF
LKWGQAAFSNFSVVPPGTGICHQVNLEYLSQTVWTKKEKMTVGKKTGTFEVAYPDSLVGTDSHTTMVNGLAVLGWGVGGI
EAEACMLGQPLSMLLPNVVGFKLKGAMKEGVTATDLVLTVTQMLRKLGVVGKFVEFFGPGLDHLSVADKATIANMAPEYG
ATCGFFPVDAAAIDYLKTSGRAAPRVALVQAYAKAQGLFRTAKSADPVFTETLTLDLADVVPSMAGPKRPEGRIALPSVA
EGFSVALANEYKKTEEPAKRFAVEGKKYEIGHGDVVIAAITSCTNTSNPSVLIGAGLLARNAAAKGLKAKPWVKTSLAPG
SQVVAAYLADSGLQAHLDKVGFNLVGFGCTTCIGNSGPLPEEISKSINDNGIVAAAVLSGNRNFEGRVSPDVQANYLASP
PLVVAHALAGSVTKNLAVEPLGEGKDGKPVYLKDIWPTSKEINAFMKKFVTASIFKKKYADVFKGDTNWRKIKTVESETY
RWNMSSTYVQNPPYFEGMKKEPEPVTDIVEARILAMFGDKITTDHISPAGSIKLTSPAGKYLSEHQVRPADFNQYGTRRG
NHEVMMRGTFANIRIKNFMLKGADGNIPEGGLTKHWPDGEQMSIYDAAMKYQQEQVPLVVFAGAEYGNGSSRDWAAKGTR
LLGVRAVICQSFERIHRSNLVGMGVLPLTFEEGTSWSSLGLKGDEKVTLRGLVGDLKPRQKLTAEIVSGDGSLQRVSLLC
RIDTLDELDYYRNGGILHYVLRKLAA
>Q8NQ98 4.2.1.3~~~acn~~~Aconitate hydratase A~~~COG1048
MTESKNSFNAKSTLEVGDKSYDYFALSAVPGMEKLPYSLKVLGENLLRTEDGANITNEHIEAIANWDASSDPSIEIQFTP
ARVLMQDFTGVPCVVDLATMREAVAALGGDPNDVNPLNPAEMVIDHSVIVEAFGRPDALAKNVEIEYERNEERYQFLRWG
SESFSNFRVVPPGTGIVHQVNIEYLARVVFDNEGLAYPDTCIGTDSHTTMENGLGILGWGVGGIEAEAAMLGQPVSMLIP
RVVGFKLTGEIPVGVTATDVVLTITEMLRDHGVVQKFVEFYGSGVKAVPLANRATIGNMSPEFGSTCAMFPIDEETTKYL
RLTGRPEEQVALVEAYAKAQGMWLDEDTVEAEYSEYLELDLSTVVPSIAGPKRPQDRILLSEAKEQFRKDLPTYTDDAVS
VDTSIPATRMVNEGGGQPEGGVEADNYNASWAGSGESLATGAEGRPSKPVTVASPQGGEYTIDHGMVAIASITSCTNTSN
PSVMIGAGLIARKAAEKGLKSKPWVKTICAPGSQVVDGYYQRADLWKDLEAMGFYLSGFGCTTCIGNSGPLPEEISAAIN
EHDLTATAVLSGNRNFEGRISPDVKMNYLASPIMVIAYAIAGTMDFDFENEALGQDQDGNDVFLKDIWPSTEEIEDTIQQ
AISRELYEADYADVFKGDKQWQELDVPTGDTFEWDENSTYIRKAPYFDGMPVEPVAVTDIQGARVLAKLGDSVTTDHISP
ASSIKPGTPAAQYLDEHGVERHDYNSLGSRRGNHEVMMRGTFANIRLQNQLVDIAGGYTRDFTQEGAPQAFIYDASVNYK
AAGIPLVVLGGKEYGTGSSRDWAAKGTNLLGIRAVITESFERIHRSNLIGMGVVPLQFPAGESHESLGLDGTETFDITGL
TALNEGETPKTVKVTATKENGDVVEFDAVVRIDTPGEADYYRHGGILQYVLRQMAASSK
>Q937N8 4.2.1.3~~~acnM~~~Aconitate hydratase A~~~
MNSANRKPLPGTKLDYFDARAAVEAIQPGAYDKLPYTSRVLAENLVRRCDPATLTDSLLQLVGRKRDLDFPWFPARVVCH
DILGQTALVDLAGLRDAIADQGGDPAKVNPVVPVQLIVDHSLAVECGGFDPDAFAKNRAIEDRRNEDRFHFIDWTKQAFK
NVDVIPPGNGIMHQINLEKMSPVIHADNGVAYPDTCVGTDSHTPHVDALGVIAIGVGGLEAENVMLGRASWMRLPDIVGV
ELTGKRQPGITATDIVLALTEFLRKEKVVGAYLEFRGEGASSLTLGDRATISNMAPEYGATAAMFFIDEQTIDYLRLTGR
TDEQLKLVETYARTAGLWADSLKNAEYERVLKFDLSSVVRNMAGPSNPHKRLPTSALAERGIAVDLDKASAQEAEGLMPD
GAVIIAAITSCTNTSNPRNVIAAALLARNANARGLARKPWVKSSLAPGSKAVELYLEEANLLPDLEKLGFGIVAFACTTC
NGMSGALDPKIQQEIIDRDLYATAVLSGNRNFDGRIHPYAKQAFLASPPLVVAYAIAGTIRFDIEKDVLGTDQDGKPVYL
KDIWPSDEEIDAIVAKSVKPEQFRKVYEPMFAITAASGESVSPLYDWRPQSTYIRRPPYWEGALAGERTLKALRPLAVLG
DNITTDHLSPSNAIMLNSAAGEYLARMGLPEEDFNSYATHRGDHLTAQRATFANPTLINEMAVVDGQVKKGSLARIEPEG
KVVRMWEAIETYMDRKQPLIIIAGADYGQGSSRDWAAKGVRLAGVEVIVAEGFERIHRTNLIGMGVLPLEFKPGVNRLTL
GLDGTETYDVIGERQPRATLTLVVNRKNGERVEVPVTCRLDSDEEVSIYEAGGVLHFAQDFLESSRATA
>Q9RTN7 4.2.1.3~~~acn~~~Aconitate hydratase A~~~COG1048
MSDKAMNLFGARDTLQVPGSDKKLYFYNLNKLQGHDVSRLPVSIKVLLESVLREANDYDVRREDVETVAGWSATNPEVEI
PFKPARVILQDFTGVPAVVDLAAMRSAMVKLGGDPSKINPLIPVDLVIDHSVQVDEFGTEFALANNMALEFERNRERYEF
LRWGQQAFDNFGVVPPASGIVHQVNLEYLAKGVQSRAEDDGEVVYPDSLVGTDSHTTMINGLGIVGWGVGGIEAEAVMLG
QPIYMLMPEVIGFKITGAMPEGATATDLALRVTQMLREKGVVGKFVEFYGAGLSNMTLPDRATIANMAPEYGATMGFFPV
DDEALRYLRRTGRLEDEIGLVEAYYKAQGMFRTDETPDPVFTDTIELDLATIVPSLAGPKRPQDRVNLSDMHSVFNEALT
APVKNRGFELGSDKLDAQGTIGGTDIKIGHGAVTLASITSCTNTSNPSVLIAAGLVAKKAVEKGLKTKPWVKTSLAPGSR
VVTEYLETAGLQQYLDQIGFNTVGYGCMTCIGNSGPLPEPVVEAIQEGDLVVASVLSGNRNFEGRVNPHIKANYLASPPL
VVAYALAGTVVNDIVNDAIGQDSNGQDVFLKDIWPTNAEIQEAMDRSINAEMFKKVYDGIEKSNADWNAIPVAEGALFDW
KEDSTYIQNPPFFDTLAGGAHEIESIKGARALVKVGDSVTTDHISPAGSFKADTPAGRYLTERGIAPKDFNSYGSRRGND
RIMTRGTFANIRLKNQLAPGTEGGFTTNFLNGEVTSIFDASTAYKEAGVPLVVLAGKDYGMGSSRDWAAKGTFLLGVKAV
IAESFERIHRSNLVGMGVLPLQYKNGETADSLGINGDETFEFVLPGDLKPRQDVTVKVTGKDGNTRDITVMCRIDTPVEI
DYYKNGGILQTVLRGILSKSQGEVKA
>P25516 4.2.1.3~~~acnA~~~Aconitate hydratase A~~~COG1048
MSSTLREASKDTLQAKDKTYHYYSLPLAAKSLGDITRLPKSLKVLLENLLRWQDGNSVTEEDIHALAGWLKNAHADREIA
YRPARVLMQDFTGVPAVVDLAAMREAVKRLGGDTAKVNPLSPVDLVIDHSVTVDRFGDDEAFEENVRLEMERNHERYVFL
KWGKQAFSRFSVVPPGTGICHQVNLEYLGKAVWSELQDGEWIAYPDTLVGTDSHTTMINGLGVLGWGVGGIEAEAAMLGQ
PVSMLIPDVVGFKLTGKLREGITATDLVLTVTQMLRKHGVVGKFVEFYGDGLDSLPLADRATIANMSPEYGATCGFFPID
AVTLDYMRLSGRSEDQVELVEKYAKAQGMWRNPGDEPIFTSTLELDMNDVEASLAGPKRPQDRVALPDVPKAFAASNELE
VNATHKDRQPVDYVMNGHQYQLPDGAVVIAAITSCTNTSNPSVLMAAGLLAKKAVTLGLKRQPWVKASLAPGSKVVSDYL
AKAKLTPYLDELGFNLVGYGCTTCIGNSGPLPDPIETAIKKSDLTVGAVLSGNRNFEGRIHPLVKTNWLASPPLVVAYAL
AGNMNINLASEPIGHDRKGDPVYLKDIWPSAQEIARAVEQVSTEMFRKEYAEVFEGTAEWKGINVTRSDTYGWQEDSTYI
RLSPFFDEMQATPAPVEDIHGARILAMLGDSVTTDHISPAGSIKPDSPAGRYLQGRGVERKDFNSYGSRRGNHEVMMRGT
FANIRIRNEMVPGVEGGMTRHLPDSDVVSIYDAAMRYKQEQTPLAVIAGKEYGSGSSRDWAAKGPRLLGIRVVIAESFER
IHRSNLIGMGILPLEFPQGVTRKTLGLTGEEKIDIGDLQNLQPGATVPVTLTRADGSQEVVPCRCRIDTATELTYYQNDG
ILHYVIRNMLK
>A0QX20 4.2.1.3~~~acnA~~~Aconitate hydratase A~~~COG1048
MSSENTGKSSLNSFGARDTLTVGDQSYEIYRLNAVPGTEKLPYSLKVLAENLLRTEDGANITKDHIEAIANWDPNAEPSI
EIQFTPARVIMQDFTGVPCIVDLATMREAVAALGGDPNKVNPLAPAELVIDHSVILDVFGNASAFERNVELEYERNAERY
QFLRWGQGAFDDFKVVPPGTGIVHQVNIEYLARTVMVRDGVAYPDTCVGTDSHTTMVNGLGVLGWGVGGIEAEAAMLGQP
VSMLIPRVVGFKLSGEIKPGVTATDVVLTVTDMLRRHGVVGKFVEFYGKGVAEVPLANRATLGNMSPEFGSTAAIFPIDE
ETINYLRLTGRTDEQLALVEAYAKAQGMWHDPEREPVFSEYLELDLSTVVPSISGPKRPQDRIELTDAKNAFRKDIHNYV
EQNHPTPETKLDEAVEESFPASDPVSLSFADDGAPDMRPSAANGATGRPTNPVLVHSEERGDFVLDHGAVVVAGITSCTN
TSNPSVMLGAALLAKKAVEKGLTTKPWVKTNMAPGSQVVTDYYNKAGLWPYLEKLGYYLGGYGCTTCIGNTGPLPEEISK
AINDNDLAVTAVLSGNRNFEGRISPDVKMNYLASPPLVIAYGIAGTMDFDFESDPLGQDSEGNDVFLRDIWPSAAEIEET
IASSINREMFTESYADVFKGDDRWRSLPTPEGDTFEWDPASTYVRKAPYFDGMPAEPEPVSDIKGARVLALLGDSVTTDH
ISPAGAIKPGTPAAQYLDANGVERKDYNSLGSRRGNHEVMIRGTFANIRLRNQLLDDVSGGYTRDFTQPGGPQAFIYDAS
ENYKKAGIPLVVLGGKEYGSGSSRDWAAKGTVLLGVKAVITESFERIHRSNLIGMGVIPLQFPAGESAASLKLDGTETYD
IEGIEELNSGKTPKTVHVTATKEDGSKVEFDAVVRIDTPGEADYYRNGGILQYVLRNMLKSSK
>O53166 4.2.1.3~~~acn~~~Aconitate hydratase A~~~COG1048
MTSKSVNSFGAHDTLKVGEKSYQIYRLDAVPNTAKLPYSLKVLAENLLRNEDGSNITKDHIEAIANWDPKAEPSIEIQYT
PARVVMQDFTGVPCIVDLATMREAIADLGGNPDKVNPLAPADLVIDHSVIADLFGRADAFERNVEIEYQRNGERYQFLRW
GQGAFDDFKVVPPGTGIVHQVNIEYLASVVMTRDGVAYPDTCVGTDSHTTMVNGLGVLGWGVGGIEAEAAMLGQPVSMLI
PRVVGFRLTGEIQPGVTATDVVLTVTEMLRQHGVVGKFVEFYGEGVAEVPLANRATLGNMSPEFGSTAAIFPIDEETIKY
LRFTGRTPEQVALVEAYAKAQGMWHDPKHEPEFSEYLELNLSDVVPSIAGPKRPQDRIALAQAKSTFREQIYHYVGNGSP
DSPHDPHSKLDEVVEETFPASDPGQLTFANDDVATDETVHSAAAHADGRVSNPVRVKSDELGEFVLDHGAVVIAAITSCT
NTSNPEVMLGAALLARNAVEKGLTSKPWVKTTIAPGSQVVNDYYDRSGLWPYLEKLGFYLVGYGCTTCIGNSGPLPEEIS
KAVNDNDLSVTAVLSGNRNFEGRINPDVKMNYLASPPLVIAYALAGTMDFDFQTQPLGQDKDGKNVFLRDIWPSQQDVSD
TIAAAINQEMFTRNYADVFKGDDRWRNLPTPSGNTFEWDPNSTYVRKPPYFEGMTAKPEPVGNISGARVLALLGDSVTTD
HISPAGAIKPGTPAARYLDEHGVDRKDYNSFGSRRGNHEVMIRGTFANIRLRNQLLDDVSGGYTRDFTQPGGPQAFIYDA
AQNYAAQHIPLVVFGGKEYGSGSSRDWAAKGTLLLGVRAVIAESFERIHRSNLIGMGVIPLQFPEGKSASSLGLDGTEVF
DITGIDVLNDGKTPKTVCVQATKGDGATIEFDAVVRIDTPGEADYYRNGGILQYVLRNILKSG
>Q8ZP52 4.2.1.3~~~acnA~~~Aconitate hydratase A~~~
MSSTLREASKDTLQAKDKTYHYYSLPLAAKSLGDIARLPKSLKVLLENLLRWQDGESVTDEDIQALAGWLKNAHADREIA
WRPARVLMQDFTGVPAVVDLAAMREAVKRLGGDTSKVNPLSPVDLVIDHSVTVDHFGDDDAFEENVRLEMERNHERYMFL
KWGKQAFSRFSVVPPGTGICHQVNLEYLGKAVWSELQDGEWIAYPDSLVGTDSHTTMINGLGVLGWGVGGIEAEAAMLGQ
PVSMLIPDVVGFKLTGKLREGITATDLVLTVTQMLRKHGVVGKFVEFYGDGLDSLPLADRATIANMSPEYGATCGFFPID
AITLEYMRLSGRSDDLVELVETYAKAQGMWRNPGDEPVFTSTLELDMGDVEASLAGPKRPQDRVALGDVPKAFAASAELE
LNTAQRDRQPVDYTMNGQPYQLPDGAVVIAAITSCTNTSNPSVLMAAGLLAKKAVTLGLKRQPWVKASLAPGSKVVSDYL
AQAKLTPYLDELGFNLVGYGCTTCIGNSGPLPEPIETAIKKGDLTVGAVLSGNRNFEGRIHPLVKTNWLASPPLVVAYAL
AGNMNINLATDPLGYDRKGDPVYLKDIWPSAQEIARAVELVSSDMFRKEYAEVFEGTEEWKSIQVESSDTYGWQSDSTYI
RLSPFFDEMQAQPAPVKDIHGARILAMLGDSVTTDHISPAGSIKPDSPAGRYLQNHGVERKDFNSYGSRRGNHEVMMRGT
FANIRIRNEMLPGVEGGMTRHLPGTEAMSIYDAAMLYQQEKTPLAVIAGKEYGSGSSRDWAAKGPRLLGIRVVIAESFER
IHRSNLIGMGILPLEFPQGVTRKTLGLTGEEVIDIADLQNLRPGATIPVTLTRSDGSKETVPCRCRIDTATELTYYQNDG
ILHYVIRNMLN
>P99148 4.2.1.3~~~acnA~~~Aconitate hydratase A~~~
MAANFKEQSKKHFDLNGQSYTYYDLKAVEEQGITKVSNLPYSIRVLLESLLRQEDDFVITDDHIKALSQFGKDGNEGEVP
FKPSRVILQDFTGVPAVVDLASLRKAMDDVGGDITKINPEVPVDLVIDHSVQVDSYANPEALERNMKLEFERNYERYQFL
NWATKAFDNYNAVPPATGIVHQVNLEYLASVVHVRDVDGEKTAFPDTLVGTDSHTTMINGIGVLGWGVGGIEAEAGMLGQ
PSYFPIPEVIGVRLVNSLPQGATATDLALRVTQELRKKGVVGKFVEFFGPGVQHLPLADRATIANMAPEYGATCGFFPVD
DESLKYMKLTGRSDEHIALVKEYLKQNHMFFDVEKEDPNYTDVIELDLSTVEASLSGPKRPQDLIFLSDMKSSFENSVTA
PAGNQGHGLDKSEFDKKAEINFKDGSKATMKTGDIAIAAITSCTNTSNPYVMLGAGLVAKKAVEKGLKVPEYVKTSLAPG
SKVVTGYLRDAGLQPYLDDLGFNLVGYGCTTCIGNSGPLLPEIEKAIADEDLLVTSVLSGNRNFEGRIHPLVKANYLASP
QLVVAYALAGTVDIDLQNEPIGKGNDGEDVYLKDIWPSIKEVSDTVDSVVTPELFIEEYNNVYNNNELWNEIDVTDQPLY
DFDPNSTYIQNPSFFQGLSKEPGTIVPLNGLRVMGKFGDSVTTDHISPAGAIGKDTPAGKYLQDHQVPIREFNSYGSRRG
NHEVMVRGTFANIRIKNQLAPGTEGGFTTYWPTNEVMPIFDAAMKYKEDGTGLVVLAGNDYGMGSSRDWAAKGTNLLGVK
TVIAQSYERIHRSNLVMMGVLPLEFKKGESADSLGLDGTEEISVNIDENVQPHDYVKVTAKKQDGDLVEFDAMVRFDSLV
EMDYYRHGGILQMVLRNKLAQ
>P36683 4.2.1.3~~~acnB~~~Aconitate hydratase B~~~COG1049
MLEEYRKHVAERAAEGIAPKPLDANQMAALVELLKNPPAGEEEFLLDLLTNRVPPGVDEAAYVKAGFLAAIAKGEAKSPL
LTPEKAIELLGTMQGGYNIHPLIDALDDAKLAPIAAKALSHTLLMFDNFYDVEEKAKAGNEYAKQVMQSWADAEWFLNRP
ALAEKLTVTVFKVTGETNTDDLSPAPDAWSRPDIPLHALAMLKNAREGIEPDQPGVVGPIKQIEALQQKGFPLAYVGDVV
GTGSSRKSATNSVLWFMGDDIPHVPNKRGGGLCLGGKIAPIFFNTMEDAGALPIEVDVSNLNMGDVIDVYPYKGEVRNHE
TGELLATFELKTDVLIDEVRAGGRIPLIIGRGLTTKAREALGLPHSDVFRQAKDVAESDRGFSLAQKMVGRACGVKGIRP
GAYCEPKMTSVGSQDTTGPMTRDELKDLACLGFSADLVMQSFCHTAAYPKPVDVNTHHTLPDFIMNRGGVSLRPGDGVIH
SWLNRMLLPDTVGTGGDSHTRFPIGISFPAGSGLVAFAAATGVMPLDMPESVLVRFKGKMQPGITLRDLVHAIPLYAIKQ
GLLTVEKKGKKNIFSGRILEIEGLPDLKVEQAFELTDASAERSAAGCTIKLNKEPIIEYLNSNIVLLKWMIAEGYGDRRT
LERRIQGMEKWLANPELLEADADAEYAAVIDIDLADIKEPILCAPNDPDDARPLSAVQGEKIDEVFIGSCMTNIGHFRAA
GKLLDAHKGQLPTRLWVAPPTRMDAAQLTEEGYYSVFGKSGARIEIPGCSLCMGNQARVADGATVVSTSTRNFPNRLGTG
ANVFLASAELAAVAALIGKLPTPEEYQTYVAQVDKTAVDTYRYLNFNQLSQYTEKADGVIFQTAV
>Q8ZRS8 4.2.1.3~~~acnB~~~Aconitate hydratase B~~~
MLEEYRKHVAERAAQGIVPKPLDATQMAALVELLKTPPVGEEEFLLDLLINRVPPGVDEAAYVKAGFLAAVAKGDTTSPL
VSPEKAIELLGTMQGGYNIHPLIDALDDAKLAPIAAKALSHTLLMFDNFYDVEEKAKAGNEYAKQVMQSWADAEWFLSRP
PLAEKITVTVFKVTGETNTDDLSPAPDAWSRPDIPLHAQAMLKNAREGIEPDQPGVVGPIKQIEALQKKGYPLAYVGDVV
GTGSSRKSATNSVLWFMGDDIPNVPNKRGGGLCLGGKIAPIFFNTMEDAGALPIEVDVSNLNMGDVIDVYPYKGEVRNHE
TGELLATFELKTDVLIDEVRAGGRIPLIIGRGLTTKAREALGLPHSDVFRQAKDVAESSRGFSLAQKMVGRACGVKGIRP
GAYCEPKMTSVGSQDTTGPMTRDELKDLACLGFSADLVMQSFCHTAAYPKPVDVTTHHTLPDFIMNRGGVSLRPGDGVIH
SWLNRMLLPDTVGTGGDSHTRFPIGISFPAGSGLVAFAAATGVMPLDMPESVLVRFKGKMQPGITLRDLVHAIPLYAIKQ
GLLTVEKKGKKNIFSGRILEIEGLPDLKVEQAFELTDASAERSAAGCTIKLNKEPIVEYLTSNIVLLKWMIAEGYGDRRT
LERRIQGMEKWLADPQLLEADADAEYAAVIDIDLADIKEPILCAPNDPDDARLLSDVQGEKIDEVFIGSCMTNIGHFRAA
GKLLDNHKGQLPTRLWVAPPTRMDAAQLTEEGYYSVFGKSGARIEIPGCSLCMGNQARVADGATVVSTSTRNFPNRLGTG
ANVFLASAELAAVAALIGKLPTPEEYQTYVAQVDKTAVDTYRYLNFDQLSQYTEKADGVIFQTAV
>Q8EJW3 4.2.1.117~~~acnD~~~2-methylcitrate dehydratase (2-methyl-trans-aconitate forming)~~~COG1048
MSTVMNTQYRKPLPGTALDYFDTREAIEAIAPGAYAKLPYTSRVLAENLVRRCEPEMLTASLKQIIESKQELDFPWFPAR
VVCHDILGQTALVDLAGLRDAIAAKGGDPAQVNPVVPTQLIVDHSLAVEYGGFDKDAFAKNRAIEDRRNEDRFHFINWTQ
KAFKNIDVIPQGNGIMHQINLERMSPVIHARNGVAFPDTLVGTDSHTPHVDALGVIAIGVGGLEAESVMLGRASYMRLPD
IIGVELTGKPQPGITATDIVLALTEFLRAQKVVSSYLEFFGEGAEALTLGDRATISNMTPEFGATAAMFYIDQQTLDYLT
LTGREAEQVKLVETYAKTAGLWSDDLKQAVYPRTLHFDLSSVVRTIAGPSNPHARVPTSELAARGISGEVENEPGLMPDG
AVIIAAITSCTNTSNPRNVIAAGLLARNANAKGLTRKPWVKTSLAPGSKAVQLYLEEANLLPELESLGFGIVGFACTTCN
GMSGALDPVIQQEVIDRDLYATAVLSGNRNFDGRIHPYAKQAFLASPPLVVAYAIAGTIRFDIEKDVLGLDKDGKPVRLI
NIWPSDAEIDAVIAASVKPEQFRKVYEPMFDLSVDYGDKVSPLYDWRPQSTYIRRPPYWEGALAGERTLKGMRPLAVLGD
NITTDHLSPSNAIMMDSAAGEYLHKMGLPEEDFNSYATHRGDHLTAQRATFANPKLKNEMAIVDGKVKQGSLARIEPEGI
VTRMWEAIETYMDRKQPLIIIAGADYGQGSSRDWAAKGVRLAGVEAIVAEGFERIHRTNLVGMGVLPLEFKAGENRATYG
IDGTEVFDVIGSIAPRADLTVIITRKNGERVEVPVTCRLDTAEEVSIYEAGGVLQRFAQDFLESNLK
>Q8NQ97 ~~~acnR~~~HTH-type transcriptional repressor AcnR~~~COG1309
MSVAAGDKPTNSRQEILEGARRCFAEHGYEGATVRRLEEATGKSRGAIFHHFGDKENLFLALAREDAARMAEVVSENGLV
EVMRGMLEDPERYDWMSVRLEISKQLRTDPVFRAKWIDHQSVLDEAVRVRLSRNVDKGQMRTDVPIEVLHTFLETVLDGF
ISRLATGASTEGLSEVLDLVEGTVRKRD
>O53165 ~~~~~~HTH-type transcriptional repressor Rv1474c~~~COG1309
MPKVSEDHLAARRRQILDGARRCFAEYGYDKATVRRLEQAIGMSRGAIFHHFRDKDALFFALAREDTERMAAVASREGLI
GVMRDMLAAPDQFDWLATRLEIARKLRNDPDFSRGWAERSAELAAATTDRLRRQKQANRVRDDVPSDVLRCYLDLVLDGL
LARLASGEDPQRLAAVLDLVENSVRRS
>O31404 1.1.1.-~~~acoA~~~Acetoin:2,6-dichlorophenolindophenol oxidoreductase subunit alpha~~~COG1071
MKLLKREGLSLTEEKALWMYQKMLEIRGFEDKVHELFAQGVLPGFVHLYAGEEAVAVGVCAHLHDGDSITSTHRGHGHCI
AKGCDLDGMMAEIFGKATGLCKGKGGSMHIADLDKGMLGANGIVGGGFTLACGSALTAKYKQTKNVSVCFFGDGANNQGT
FHEGLNLAAVWNLPVVFVAENNGYGEATPFEYASACDSIADRAAAYNMPGVTVDGKDILAVYQAAEEAIERARNGGGPSL
IECMTYRNYGHFEGDAQTYKTKDERVEHLEEKDAIQGFKNYLLKETDANKLSDIEQRVSESIEKAVSFSEDSPYPKDSEL
LTDVYVSYEKGGM
>P27745 1.1.1.-~~~acoA~~~Acetoin:2,6-dichlorophenolindophenol oxidoreductase subunit alpha~~~COG1071
MTARASQDSAALPLDKETLLTVYRKMRTIRDFEERLHVDFGRGDIPGFVHLYAGEEAAGVGILHHLNDGDRIASTHRGHG
HCIAKGVDPVAMMKEIYGKKGGSCNGKGGSMHIADLSKGMMGANGILGAGAPLICGAALAAKFRGKGEVGITFCGDGASN
QGTFLESLNLAAVWNLPVIFVIENNGYAESTSRDYGTAVDSYVDRAAGFGIPGVTVDGTDFFAVHEAAGEVIRRAREGGG
PSLLECKMVRFYGHFEGDAQTYRAAGELDDIRANKDCLKLFGRAVTQAGVVAREELDTIDREVAALIEHAVQEAKAAPQP
GPEDLLTDVYVSY
>P27746 1.1.1.-~~~acoB~~~Acetoin:2,6-dichlorophenolindophenol oxidoreductase subunit beta~~~COG0022
MARKLSIKLAINEAIDQEMTRDPSVIMLGEDIVGGAGADGEKDAWGGVLGVTKGLYAKHGDRLLDTPLSESAYVGAAIGA
AACGMRPIAELMFIDFMGVCFDQIFNQAAKFRYMFGGKAETPVVIRAMVGAGFRAAAQHSQMLTPLFTHIPGLKVVCPST
PYDTKGLLIQAIRDNDPVIFCEHKNLYGLEGEVPEGAYAIPFGEANIVRDGKDVSIVTYGLMVHRALEAAATLAKEGIEA
EIVDLRTLSPLDMDTVLESVENTGRLVVVDEASPRCNIATDISAQVAQQAFGALKAGIEMVCPPHTPVPFSPTLEDLYIP
SAAQIAAAARKTMKGGKH
>A9X6P9 2.8.3.19~~~uctC~~~Acetyl-CoA:oxalate CoA-transferase~~~
MTETTPAKPKGPFDGLLVIDLTHVLNGPFGTTILTDLGARTIKIEPPGHGDDTRTYGPYVGDQSLYFSFVNRGKESIVLN
LKDEGDRAIFLEMVRKADVLAENFRPGVMDRLGFNYEELAKINPRLIYASSSGFGQTGPLAHYPAYDTIVQAMSGIMMAT
GFPDGPPTRVGGTSLSDLCGGVFMFCGIASALYARERTGKGAHIDVSMFDGTLAFLQHALMCWSATGKAPARIGNRHPYM
APFDVFQAQDKPFVICCGNDHLFKALCDVIGAPELATDPRFVENHDRMANNDALKAALEKALSKQPAAHWLDVIHKAGVP
VGPLLDVAEAANLPQTAARNMLIKSGGVMMPGNPVKISGYDDPHERPGAPKLDEQGAALRKEFAAPEAK
>P76518 2.8.3.19~~~yfdE~~~Acetyl-CoA:oxalate CoA-transferase~~~COG1804
MTNNESKGPFEGLLVIDMTHVLNGPFGTQLLCNMGARVIKVEPPGHGDDTRTFGPYVDGQSLYYSFINHGKESVVLDLKN
DHDKSIFINMLKQADVLAENFRPGTMEKLGFSWETLQEINPRLIYASSSGFGHTGPLKDAPAYDTIIQAMSGIMMETGYP
DAPPVRVGTSLADLCGGVYLFSGIVSALYGREKSQRGAHVDIAMFDATLSFLEHGLMAYIATGKSPQRLGNRHPYMAPFD
VFNTQDKPITICCGNDKLFSALCQALELTELVNDPRFSSNILRVQNQAILKQYIERTLKTQAAEVWLARIHEVGVPVAPL
LSVAEAIKLPQTQARNMLIEAGGIMMPGNPIKISGCADPHVMPGAATLDQHGEQIRQEFSS
>P27747 2.3.1.12~~~acoC~~~Dihydrolipoyllysine-residue acetyltransferase component of acetoin cleaving system~~~COG0508
MATEISPTIIPIVMPKWGLSMKEGTVNAWLVDEGTEITVGLPILDVETDKIANAVEAPDAGTLRRKVAQAGDVLPVKALL
GVLAPAEVSDAQIDDYVAAYETPADDAGEEDAAAAYQFADVDGIRVRYARKGGGAETVLFIHGFGGDLDNWLFNLDPLAD
AYTVVALDLPGHGQSSPRLAGTTLAQMAGFVARFMDETGIEAAHVVGHSMGGGVAAQLAVDAPQRVLSVALVSPVGFGDA
VNSGYTEGFVSAQSRRELKPVVELLFADAGLVSRQMLDDLLRYKRLDGVTEALTALGQGLFGGGRQSEQPGQRLANSGKR
VLVVWGGQDQIIPAAHAEAAPPGATVKVFADAGHMSQMEKANDFNALLKKHLGG
>P37032 4.2.1.3~~~acn~~~Aconitate hydratase A~~~COG1048
MKVGQDSLSTKSQLTVDGKTYNYYSLKEAENKHFKGINRLPYSLKVLLENLLRFEDGNTVTTKDIKAIADWLHNKTSQHE
IAFRPTRVLMQDFTGVPAVVDLAAMRTAIVKMGGNADKISPLSPVDLVIDHSVMVDKFASADALEVNTKIEIERNKERYE
FLRWGQKAFSNFQVVPPGTGICHQVNLEYLGKTVWNSENDGQLYAYPDTLVGTDSHTTMINGLGVLGWGVGGIEAEAAML
GQPVSMLIPEVIGFKLSGKLKEGITATDLVLTVTQMLRKKGVVGKFVEFYGPGLNDLPLADRATISNMAPEYGATCGFFP
VDKETIKYLELTGRDKHTIALVEAYAKAQGMWYDKDNEEPVFTDSLHLDLGSVEPSLAGPKRPQDKVNLSSLPVEFNNFL
IEVGKEKEKEKTFAVKNKDFQMKHGHVVIAAITSCTNTSNPSVLMAAGLVAKKAIEKGLQRKPWVKSSLAPGSKVVTDYL
RHAGLQTYLDQLGFNLVGYGCTTCIGNSGPLPDDISHCVAEHDLVVSSVLSGNRNFEGRVHPQVRANWLASPPLVVAYAL
CGTTCSDLSREPIGQDKEGNDVYLKDIWPSNEEIAAEVAKVSGTMFRKEYAEVFKGDAHWQAIQTSSGQTYEWNPDSTYI
QHPPFFENLSLKPEPLKPIKQAYVLALFGDSITTDHISPAGSIKASSPAGLYLKSKGVDEKDFNSYGSRRGNHEVMMRGT
FANIRIRNEMTPGQEGGVTRYVPTGETMSIYDAAMRYQENQQDLVIIAGKEYGTGSSRDWAAKGTNLLGVKAVITESFER
IHRSNLIGMGILPLQFKEGTTRKTLKLDGSERISIEISDKLTPGAMVPVTIERQDGDIEKIETLCRIDTADELEYYKNGG
ILQYVLRKISS
>O31551 ~~~acoR~~~Acetoin dehydrogenase operon transcriptional activator AcoR~~~COG3284
MNSVPNDLQTWKRFVKDGVLDEARLRKRIAESWHRCKKAEVNPYLEKGPKVLQQTELDQQSKKHSFFLTTAKPYLEKLLP
AIKEMEMMALLIDSDGVVLALDGHPRALYEAKRINFVEGACWTETAVGTNAIGTALHISEPVAIQGSEHYSIASHLWNCS
AAPIHHEDGSLAGVIDISCPAAGAHPHMLGIATAIAYAAERELAAKSREKELELISRFGERAASSVPMVLCNTKQHIISA
SMPIRTSMPDWQGRHLYELKERGYSIENAVTIGDGGTCFYLSEQKKKKAFRFNGVIGQSGRSQAMLMHLERAAATDASVC
LSGETGTGKEVAARALHENSERRHGPFVAVNCGAIPSDLIESELFGYAEGAFTGAKRNGYKGAFQKANQGTLFLDEIGEI
SHSMQVALLRVLQERKITPIGGTKEIPVDIRVIAATHCDLRELAENGKIREDLFYRLHVYPIELPPLRDRTEDIPDLFEY
YKQKNHWPGDLPSDFCNVLKQWKWPGNIRELFNVFERLSIRFPDGRLRDESLPALLEAAGLPASSAEKKPAAAGVLTFRE
QIQKDMMIKALESAKGNVSQAAKISGIPRSTFYKRLKKFNLSAES
>P28614 ~~~acoR~~~Acetoin catabolism regulatory protein~~~COG3284
MDLRQREHIETVVQATTYLAPPAVLADRIAHDAIIQNSWRRCVHQYGLDPSRMQEARILPQPRLREHQERIDDFARIARH
GLQSLYGQVAGLGYVVLLTDAQGVTVDYIGEARSDAALRHAGLYLGAEWSESGAGTCAVGTALATGQALTVHQADHFDAT
HIPLTCTAAPLFDTHGNLHAILDISALTSPQAKDSQGLALQMVRIYAAHIENANFLRAHRRDWILKLNVAPEFVDVNPEY
LLALDEAGRIVGHNHRARLMLEGELGGAPGATVLGQRFETLFDARLEDLGHYVYSRPSEQRLVALTRSGGLLYLSVLPPA
LRWQAPPAETQVAMPDALAALTGGDAALQLQLQRAARLVDSPINLLIHGETGSGKEFLAKALHLASARRGGPFVAVNCAA
IPETLIESELFGHLPNSFSGAGPRGKRGLIQEADGGTLFLDEIGDMPRELQSRLLRVLAEGEVLPVGAARPVPVRLRVIS
ATHHSLEQLVADGRFREDLYYRLNGARFTLPPLRARTDLDWLVRKLLQEGSAEGSEITLSPAARERLHRHRWPGNLRELR
NVLEYARAVCADGYIDVPDLPDSLAGPAPSAALPQPGPAQSPAAAPFDPHQLPPEGMLLMQYLRASGWNLSAVARQIGVS
RMTLYRRMERYGIQSPNRRDGGPEPTDA
>P74334 1.13.11.75~~~~~~Apocarotenoid-15,15'-oxygenase~~~COG3670
MVTSPPTSSPSQRSYSPQDWLRGYQSQPQEWDYWVEDVEGSIPPDLQGTLYRNGPGLLEIGDRPLKHPFDGDGMVTAFKF
PGDGRVHFQSKFVRTQGYVEEQKAGKMIYRGVFGSQPAGGWLKTIFDLRLKNIANTNITYWGDRLLALWEGGQPHRLEPS
NLATIGLDDLGGILAEGQPLSAHPRIDPASTFDGGQPCYVTFSIKSSLSSTLTLLELDPQGKLLRQKTETFPGFAFIHDF
AITPHYAIFLQNNVTLNGLPYLFGLRGAGECVQFHPDKPAQIILVPRDGGEIKRIPVQAGFVFHHANAFEENGKIILDSI
CYNSLPQVDTDGDFRSTNFDNLDPGQLWRFTIDPAAATVEKQLMVSRCCEFPVVHPQQVGRPYRYVYMGAAHHSTGNAPL
QAILKVDLESGTETLRSFAPHGFAGEPIFVPRPGGVAEDDGWLLCLIYKADLHRSELVILDAQDITAPAIATLKLKHHIP
YPLHGSWAQT
>Q44643 ~~~acpA~~~Capsule synthesis positive regulator AcpA~~~
MEKDISRKIDLLNILIEEKRWFTLFELEKNLNCSSKTIRKDISIINDLLPKTIFIHSKKGKGVKLSLPQNQSISEAISNL
LKKSLTFLAIQQLLEERSNTVTSLADKLYLPISSTNIVLKRVSKYIKKFGLSLEKKPLRIVGDEFQIILMFSERYLESYP
DTEWPFTEYKEEMLIDYINYIEEKLEIVFYSNDKRRMAFIMTILFKRIKQGHKVKFSEWIIKETMESIYYKKIFEGKNVI
KVNKNRSLNIEEQVLLVIMVKLSRYVSKDENNLKQEELVLYKEGESTTYTYVKNFISILEQELKIDLNNNEEFVYGMIEY
CREAFHILKFIPILKAPEKDTCKYIKKHYEETFYLVKRAYNKWGAEMKLTDIPDEEIAKVTMRIVAIGKQHNINRKKVLL
ITGEGKSWEEYMKSRINKRYGDQLKFVGGHAKILNGNTDNIDNIDIDFIITTVPLNFSWKSIVYVSPILQERDFYEIGIF
ASK
>Q9RMX9 ~~~acpB~~~Capsule synthesis positive regulator AcpB~~~
MEKDIKRQIQILEIITSEEKWFTTIEISKILRCCNKTIMKDISFIKDFLPEDWHIKIKKGKGVRIYLPYNKHRNEITFLL
FRESLTFRILQHLFERETKTIATLAERLYIQVPSILPALKRVENYLKKFGLKLRKKPLRLEGDEVRIMIMYLDLYLKSYN
DTEWPFEKLKKEVIFQYLGTLEESLGISLHVVSKRHLSFFIAILLKRKQQGYKVQLNRKFLYFNTETPDYVKIGRIFEKL
EREFGVSLTVQDKILLTISIKSSKYVYKDINKEKEESVQYFKEGNLSIYELVKDFINSLEEKLKVDLISDEEFIFALVDY
FKRTIYHLQYLCMFERPQKQTIQYMQTEHSETFSAVKEVYTEFVKKNEIADYVSVEEIAKVTMYIEASRLRYTSNYKKVL
LVTGESESWAEYLAATLAKRFGDKIQISTVFFAKKSDHDVNADFIISTIPLDLGSTPIICINSIPTERDYTNIQYYLDLQ
DG
>P21515 3.1.4.14~~~acpH~~~Acyl carrier protein phosphodiesterase~~~COG3124
MNFLAHLHLAHLAESSLSGNLLADFVRGNPEESFPPDVVAGIHMHRRIDVLTDNLPEVREAREWFRSETRRVAPITLDVM
WDHFLSRHWSQLSPDFPLQEFVCYAREQVMTILPDSPPRFINLNNYLWSEQWLVRYRDMDFIQNVLNGMASRRPRLDALR
DSWYDLDAHYDALETRFWQFYPRMMAQASRKAL
>Q7PC63 ~~~acpK~~~Polyketide biosynthesis acyl-carrier-protein AcpK~~~COG0236
MDKQRIFEVLITNICEVLPELDGHRFEPEDQLVELGADSVDRAEIITMVLEDLSLKIPRIELSGVKNIGELAEVLYDKVQ
SA
>A0R0B3 ~~~acpM~~~Meromycolate extension acyl carrier protein~~~COG0236
MAATQEEIIAGLAEIIEEVTGIEPSEVTPEKSFVDDLDIDSLSMVEIAVQTEDKYGVKIPDEDLAGLRTVGDVVAYIQKL
EEENPEAAAALREKFAADQ
>P9WQF3 ~~~acpM~~~Meromycolate extension acyl carrier protein~~~COG0236
MPVTQEEIIAGIAEIIEEVTGIEPSEITPEKSFVDDLDIDSLSMVEIAVQTEDKYGVKIPDEDLAGLRTVGDVVAYIQKL
EEENPEAAQALRAKIESENPDAVANVQARLEAESK
>Q81JG3 2.7.8.7~~~acpS~~~Holo-[acyl-carrier-protein] synthase~~~COG0736
MIVGIGIDIIELNRIEKMLDGKLKFMERILTENERNVAKGLKGSRLTEFVAGRFAAKEAYSKAVGTGIGKEVSFLDIEVR
NDDRGKPILITSTEHIVHLSISHSKEFAVAQVVLESSSS
>P96618 2.7.8.7~~~acpS~~~Holo-[acyl-carrier-protein] synthase~~~COG0736
MIYGIGLDITELKRIASMAGRQKRFAERILTRSELDQYYELSEKRKNEFLAGRFAAKEAFSKAFGTGIGRQLSFQDIEIR
KDQNGKPYIICTKLSQAAVHVSITHTKEYAAAQVVIERLSS
>P24224 2.7.8.7~~~acpS~~~Holo-[acyl-carrier-protein] synthase~~~COG0736
MAILGLGTDIVEIARIEAVIARSGDRLARRVLSDNEWAIWKTHHQPVRFLAKRFAVKEAAAKAFGTGIRNGLAFNQFEVF
NDELGKPRLRLWGEALKLAEKLGVANMHVTLADERHYACATVIIES
>O25488 2.7.8.7~~~acpS~~~Holo-[acyl-carrier-protein] synthase~~~COG0736
MIGIDIVSIARIEKCVKRFKMKFLERFLSPSEIVLCKDKSSSIAGFFALKEACSKALQVGIGKELSFLDIKISKSPKNAP
LITLSKEKMDYFNIQSLSASISHDAGFAIAVVVVSSSNE
>Q88Z44 2.7.8.7~~~acpS~~~Holo-[acyl-carrier-protein] synthase~~~COG0736
MIYGTGIDLTELSRIEAILAKGLRLPEKILTPAELAVFSRYPVKRQIEFMAGRFSAKEAYSKAYGTGIGAAVGFQDIEIL
DNAQGKPEVTRHPFDGPAWISISHTDTLVMTQVILERGNL
>A0R1H6 2.7.8.7~~~acpS~~~Holo-[acyl-carrier-protein] synthase~~~COG0736
MAIVGVGIDLVSIPDFAEQVDRPGTVFAETFTPGERRDAADKSSSAARHLAARWAAKEAVIKAWSSSRFSKRPALPEGIH
RDIEVVTDMWGRPKVRLSGEIAKHLEDVTIHVSLTHEDQTAAAVAIIEEP
>P9WQD3 2.7.8.7~~~acpS~~~Holo-[acyl-carrier-protein] synthase~~~COG0736
MGIVGVGIDLVSIPDFAEQVDQPGTVFAETFTPGERRDASDKSSSAARHLAARWAAKEAVIKAWSGSRFAQRPVLPEDIH
RDIEVVTDMWGRPRVRLTGAIAEYLADVTIHVSLTHEGDTAAAVAILEAP
>A1KVH5 2.7.8.7~~~acpS~~~Holo-[acyl-carrier-protein] synthase~~~
MIYGIGTDIVSLKRIIRLNKKFGQAFAGRILTPEELLEFPQAGKPVNYLAKRFAAKEAFAKAVGTGIRGAVSFRNIGIGH
DALGKPEFFYGPALSKWLEEQGISRVSLSMSDEEDTVLAFVVAEK
>Q5HED0 2.7.8.7~~~acpS~~~Holo-[acyl-carrier-protein] synthase~~~
MIHGIGVDLIEIDRIQALYSKQPKLVERILTKNEQHKFNNFTHEQRKIEFLAGRFATKEAFSKALGTGLGKHVAFNDIDC
YNDELGKPKIDYEGFIVHVSISHTEHYAMSQVVLEKSAF
>O86785 2.7.8.7~~~acpS~~~Holo-[acyl-carrier-protein] synthase~~~COG0736
MSIIGVGIDVAEVERFGAALERTPALAGRLFLESELLLPGGERRGVASLAARFAAKEALAKALGAPAGLLWTDAEVWVEA
GGRPRLRVTGTVAARAAELGVASWHVSLSHDAGIASAVVIAEG
>P0A2W6 2.7.8.7~~~acpS~~~Holo-[acyl-carrier-protein] synthase~~~COG0736
MIVGHGIDIEELASIESAVTRHEGFAKRVLTAQEMERFTSLKGRRQIEYLAGRWSAKEAFSKAMGTGISKLGFQDLEVLN
NERGAPYFSQAPFSGKIWLSISHTDQFVTASVILEENHES
>P0A2W7 2.7.8.7~~~acpS~~~Holo-[acyl-carrier-protein] synthase~~~COG0736
MIVGHGIDIEELASIESAVTRHEGFAKRVLTAQEMERFTSLKGRRQIEYLAGRWSAKEAFSKAMGTGISKLGFQDLEVLN
NERGAPYFSQAPFSGKIWLSISHTDQFVTASVILEENHES
>Q9WZF6 2.7.8.7~~~acpS~~~Holo-[acyl-carrier-protein] synthase~~~COG0736
MIVGVGIDVLEVERVPEKFAERILGESEKRLFLTRKRRREFIAGRFALKEAFFKALGTGLNGHSFTDVEFLESNGKPVLC
VHKDFGFFNYAHVSLSHDRFAVALVVLEKRKGDIIVEGDESFLRKRFEVLERSVEGWEIETSLPPFTLKKLLESSGCRLV
RYGNILIGE
>Q9KPB6 2.7.8.7~~~acpS~~~Holo-[acyl-carrier-protein] synthase~~~COG0736
MIVGLGTDIAEIERVEKALARSGENFARRILTDSELEQFHASKQQGRFLAKRFAAKEAASKALGTGIAQGVTFHDFTISH
DKLGKPLLILSGQAAELASQLQVENIHLSISDERHYAMATVILERR
>P0A2W5 ~~~acpXL~~~Acyl carrier protein AcpXL~~~
MTATFDKVADIIAETSEIDRATITPESHTIDDLGIDSLDFLDIVFAIDKEFGIKIPLEKWTQEVNEGKVSTEEYFVLKNL
CAKIDELKAAKA
>Q02054 ~~~~~~Actinorhodin polyketide synthase acyl carrier protein~~~COG0236
MATLLTTDDLRRALVECAGETDGTDLSGDFLDLRFEDIGYDSLALMETAARLESRYGVSIPDDVAGRVDTPRELLDLING
ALAEAA
>P43677 ~~~~~~Oxytetracycline polyketide synthase acyl carrier protein~~~
MTLLTLSDLLTLLRECAGEEESIDLGGDVEDVAFDALGYDSLALLNTVGRIERDYGVQLGDDAVEKATTPRALIEMTNAS
LTGASPSAGGAARDK
>P0DTL2 ~~~~~~Anti-Pycsar protein Apyc1~~~
MHIRMVGTGSAFAKKFDNNNALLEQDGCCLLIDCGITLPKALYQMGLAFPEIDAVLISHIHADHVGGLEEFAFQMMFKYN
RKPVLFIADTLIEPLWEHTLRGGLTQDPLNKLEHFFDVRPIIANTETELFPGLRVKLLPTKHIPNKPSYSFLFNDRFFYS
ADMRFDKELLLRLADGGVQTIFHDCQLEEPGVVHASLNELLTLPESVQKKTWLMHYGDTIDQYQGRTGHMRIVEPQRRYE
V
>A0A848M4Z0 ~~~~~~Anti-Pycsar protein Apyc1~~~
MTLKLQMLGTGGAFSRNYFNNNALIFDEDFTLLVDCGVTAPMALHQIDKSWESIDAVLITHTHADHVGGLEELAFQMKLK
HNRKMPLYLAESLVEPLWENTLKGGLYQEGTIMSLDDVFTVIPLPVGKAADISPGISLELTHTRHIPGRDSYSLYLNGRI
FYSADMTFDPDLIHQLVRDRRCDVILHECQLEGAGHVHTTLDELLTLPEEIQEMIYLMHYADNKNDFEGRIGKMRFLEQQ
QVYDL
>A0A4Y6UQ63 ~~~~~~Anti-Pycsar protein Apyc1~~~
MNLQMIGTGNAFAKKYFNNNALIEQDGFKLLIDCGITAPLALYELGIGMEELDAVLVTHTHGDHVGGLEEYGFQMKFKHG
RRPVLLLPEALVDPLWQNTLSGGMTQEGLEKLEDAFDVRALRVGDVQELAPNLCVELVPTSHIAGKKSYSLILNRDVFYS
ADMTFEPELLTTLVRDRGIRRILHEVQLEGPGAVHTTLDELLSLPEEMQSIIKLMHYADNKEQFVGRTGKMEFLEQGLVY
PI
>O67611 ~~~acpP~~~Acyl carrier protein~~~COG0236
MSLEERVKEIIAEQLGVEKEKITPEAKFVEDLGADSLDVVELIMAFEEEFGIEIPDEDAEKIQTVGDVINYLKEKVGG
>P94123 ~~~acpP~~~Acyl carrier protein~~~
MSDVAERVKKIVVDHLGVEESKVTENASFIDDLGADSLDTVELVMAFEEEFGCEIPDDAAEKILTVKDAIDFIKANAAA
>P80643 ~~~acpA~~~Acyl carrier protein~~~COG0236
MADTLERVTKIIVDRLGVDEADVKLEASFKEDLGADSLDVVELVMELEDEFDMEISDEDAEKIATVGDAVNYIQNQQ
>O51647 ~~~acpP~~~Acyl carrier protein~~~
MDNDEIFSKVRSIISEQLDKKEDEITTDSRFVEDLNADSLDIYELLYLLEEAFDDKIPENEANEFETVGDVVNFIKKRKG
>O34163 ~~~acpP~~~Acyl carrier protein~~~
MALIDEIKDVVANQLNISDKSKITDTASFVDDLNADSLDLVELIMELEKRYEIKIPQEDQEKIKNVADAAKYIEEHKK
>P12784 ~~~acpP~~~Acyl carrier protein~~~
MSDIADRVKKIVVEHLGVEEEKVTETTSFIDDLGADSLDTVELVMAFEEEFGIEIPDDAAETIQTFGDAP
>P80918 ~~~acpP~~~Acyl carrier protein~~~
MSDIEARVKKIIAEQLGVEESQVTNEKAFVADLGADSLDTVELVMALEDEFGIEIPDEDAEKITTVQNAIDYANTHQA
>Q72CS8 ~~~acpP~~~Acyl carrier protein~~~COG0236
MSVEEKVKKIIMDQLGVSAEEVKPEASFVEDLGADSLDLTELIMAMEEEFGVEIDDDDAQKILKVKDAIDYVSNKQ
>P0A6A8 ~~~acpP~~~Acyl carrier protein~~~COG0236
MSTIEERVKKIIGEQLGVKQEEVTNNASFVEDLGADSLDTVELVMALEEEFDTEIPDEEAEKITTVQAAIDYINGHQA
>B6JLE2 ~~~acpP~~~Acyl carrier protein~~~
MALFEDIQAVIAEQLNVDAAQVTPEAEFVKDLGADSLDVVELIMALEEKFGIEIPDEQAEKIVNVGDVVKYIEDNKLA
>B5Z6T1 ~~~acpP~~~Acyl carrier protein~~~
MALFEDIQAVIAEQLNVDAAQVTPEAEFVKDLGADSLDVVELIMALEEKFGVEIPDEQAEKIVNVGDVVKYIEDNKLA
>P80920 ~~~acpP~~~Acyl carrier protein~~~
MSDIEQRVKNVVVEQLGVDEAEVTNAASFVDDLGADSLDTVELVMALEEEFGTEIPDEEAEKITTVQLAIDYVKSHQ
>P80921 ~~~acpP~~~Acyl carrier protein~~~
MASKEEILAGLAEIVNEETGLDTAEVQPEKSFTDDLDIDSISMMTIVVNAEDKFGVKIPDEEVKNLKTVQDAVDFIXGA
>P80922 ~~~acpP~~~Acyl carrier protein~~~
MSTIEERVKKIVSEQLGVKEEEITNASSFVDDLGADSLDTVELVMALEEEFETEIPDEEAEKITTVQEAIDYVVSHQ
>Q88LL5 ~~~acpP~~~Acyl carrier protein~~~COG0236
MSTIEERVKKIVAEQLGVKEEEVTVEKSFVDDLGADSLDTVELVMALEEEFETEIPDEEAEKITTVQAAIDYVKAHQA
>P80923 ~~~acpP~~~Acyl carrier protein~~~COG0236
MSTIEERVKKIVAEQLGVKSEEVVNTASFVEDLGADSLDTVELVMALEEEFETEIPDEEAEKITTVQAAIDYVNSHQA
>P19372 ~~~acpP~~~Acyl carrier protein AcpP~~~COG0236
MSDIAERVKKIVIDHLGVDAEKVSEGASFIDDLGADSLDTVELVMAFEEEFGVEIPDDAADSILTVGDAVKFIEKAQA
>Q9ZCH9 ~~~acpP~~~Acyl carrier protein~~~COG0236
MEFKIMSTTDKIEQKVIEMVAEKLNKDKAIITTDSRFIEDLKADSLDTVELMMAIEVEYGIDIPDDEATKIKTVSDVIKY
IKERQS
>P11830 ~~~acpP~~~Acyl carrier protein~~~COG0236
MDRKEIFERIEQVLAEQLGIPAEQITEEADLREDLGMDSLDLVELVSALEDEVGMRVEQSQLEGIETVGHVMELTLDLVA
RLATASAADKPEAAS
>Q0T5U2 ~~~acpP~~~Acyl carrier protein~~~
MSTIEERVKKIIGEQLGVKQEEVTNNASFVEDLGADSLDTVELVMALEEEFDTEIPDEEAEKITTVQAAIDYINGHQA
>Q2FZ51 ~~~acpP~~~Acyl carrier protein~~~COG0236
MENFDKVKDIIVDRLGVDADKVTEDASFKDDLGADSLDIAELVMELEDEFGTEIPDEEAEKINTVGDAVKFINSLEK
>Q5HGK0 ~~~acpP~~~Acyl carrier protein~~~
MENFDKVKDIIVDRLGVDADKVTEDASFKDDLGADSLDIAELVMELEDEFGTEIPDEEAEKINTVGDAVKFINSLEK
>P20804 ~~~acpP~~~Acyl carrier protein~~~COG0236
MNQEIFEKVKKIVVEQLEVDPDKVTPDATFAEDLGADSLDTVELVMALEEEFDIEIPDEVAETIDTVGKAVEHIESK
>Q9WZD0 ~~~acpP~~~Acyl carrier protein~~~COG0236
MASREEIFSKVKSIISEKLGVDESQVTEEAKLIDDLGADSLDLVDLVMDFESEFGVKVDDADLEKISTVGDIVSYIEKKL
G
>Q5SL79 ~~~acpP~~~Acyl carrier protein~~~COG0236
MTEQEIFEKVKAVIADKLQVEPEKVTLEARFIEDLGADSLDTVELIMGLEDEFGLEISDEEAEKIRTVKDAVEYIKAKLG
>P0A2W3 ~~~acpP~~~Acyl carrier protein~~~
MSNIEERVKKIIVEQLGVDEAEVKNEASFVDDLGADSLDTVELVMALEEEFDTEIPDEEAEKITTVQAAIDYVNSAQ
>Q6F7B8 1.2.1.n2~~~acr1~~~Fatty acyl-CoA reductase~~~COG4221
MNKKLEALFRENVKGKVALITGASSGIGLTIAKRIAAAGAHVLLVARTQETLEEVKAAIEQQGGQASIFPCDLTDMNAID
QLSQQIMASVDHVDFLINNAGRSIRRAVHESFDRFHDFERTMQLNYFGAVRLVLNLLPHMIKRKNGQIINISSIGVLANA
TRFSAYVASKAALDAFSRCLSAEVLKHKISITSIYMPLVRTPMIAPTKIYKYVPTLSPEEAADLIVYAIVKRPKRIATHL
GRLASITYAIAPDINNILMSIGFNLFPSSTAALGEQEKLNLLQRAYARLFPGEHW
>A6TP80 ~~~acr3~~~Arsenical-resistance protein Acr3~~~COG0798
MGNENVVHEGKGIGFFERYLTVWVAACIIVGVAIGQLLPAVPETLSRWEYAQVSIPVAILIWLMIYPMMLKIDFTSIVEA
TKKPKGIIVTCVTNWLIKPFTMYLIAAFFFKIVFQNLIPESLANDYLAGAVLLGAAPCTAMVFVWSHLTKGDPAYTLVQV
AVNNIILLFAFTPIVAILLGITDVIVPYDTLFLSVVLFIVIPLVGGYLSRKYIVQSKGIEYFENVFLKKFDNVTIVGLLL
TLIIIFTFQAEVILSNPLHVLLIAVPLTIQTFFIFFLAYGWSKAWKLPHNVASPAGMIGASNFFELAVAVAITLFGLNSG
ATLATVVGVLVEVPVMLTLVKISNRTRHWFPEVAREN
>Q8NQC8 ~~~acr3~~~Arsenical-resistance protein Acr3~~~COG0798
MTNSTQTRAKPARISFLDKYIPLWIILAMAFGLFLGRSVSGLSGFLGAMEVGGISLPIALGLLVMMYPPLAKVRYDKTKQ
IATDKHLMGVSLILNWVVGPALMFALAWLFLPDQPELRTGLIIVGLARCIAMVLVWSDMSCGDREATAVLVAINSVFQVA
MFGALGWFYLQVLPSWLGLPTTTAQFSFWSIVTSVLVFLGIPLLAGVFSRIIGEKIKGREWYEQKFLPAISPFALIGLLY
TIVLLFSLQGDQIVSQPWAVVRLAIPLVIYFVGMFFISLIASKLSGMNYAKSASVSFTAAGNNFELAIAVSIGTFGATSA
QAMAGTIGPLIEIPVLVGLVYAMLWLGPKLFPNDPTLPSSARSTSQIINS
>D2TV88 ~~~CRAC~~~Autotransporter CRAC~~~
MNKVYNTVWNESTGMWVVTSELTRKGGRRPRQIRRTALAGLIAGLLLPSAPALAVDYNNETLGSGATSSSMSLNAGDTAT
DTTINSGGSQRVSSGGSATSTTINSGGFQYVSSGGSATDTTINSGGYQHVSSGGSATDTTINSGGYLSVSGGGTAVDITQ
NSGGVIDTNTYATLSGTNINGSFSIVNGSASNMLLENGGFLSVLNGHQATNTTINSGGYLSVYGDGSAVDITQNSGGAIS
TDTSATLSGTNINGSFSIAGGSASNMLLENRGQLNVNSGHQATNTIINSGGNQHISGGGSATDTTINSAGFQYVYSGGSA
TSTTINRGGNQYVSGGSATNTTINSGGYLSVYGGGTAVDITQNSGGAIDTNTYATLSGTNINGSFSIAGGSASNMLLENG
GYLNVNSGHQATNTTINSGGGLRVSGGGTAVDITQNSGGAISADTSATLSGTNINGSFSIANGSASNMLLENGGSLYVNS
GHQATNTTINSGGGLRVSGGGTAVDITQNSGGAIDTNTYATLSGTNINGSFSIANGSASNMLLENGGYLYVDGGHQATNT
TINSGGILSVSGGGTAVDITQNSGGVIDTNTYATLSGTNINGSFSIANGSASNMLLENGGFLYVNSGHQAMNTTINNSRS
TMNVLGGGSATSTTINSGGYQYVSSGGSATSTTINSGGNQYVSSGGSATDTTINSGGSLVVFDGTAVDITQNSGGAITAD
TSATLSGTNINGSFSIANGSASNMLLERGSLYVEGGHQATNTTINGGGSMDVSTDGSATNTTINDGGQMYVSTDGSVTST
IVNIGGFVNLLGGSATDTTLNEGGRMLVNPQGSATGTIINRGGYQEILRSAGAANTIINGGQQSVLSGGSATDTTLNSGG
AQYINNGGSATDTTLNSGGAQYINNGGSVTNTTINSGGGQYVYINGNVTKTTITDGGILQVDAGGSASQVTQNSGGAIVT
NTSAVLSGTNDKGTFSIAGGSASNMSLENGGLLTVLVGHDASDTTVGSDGTLSVQSGGVLRGTTTLTDNGTLVGNMVTNE
GNLYFLNNSAATFAGTLTGTGTLTQEGGNTRFSGLLSQDGGITLHSGAAMTMVTLQANANVTTQSGTSLTLDNGSILTGN
VTGDNTGAGDMTVKGASVWHLDGDATVGALTLDNGTVDFRPSATTRLTQAFRPVSLVSESLSGNGTFRMNTDIASHTGDM
LNVTGNANGNFVLDIRNTGLEPVSAGTPLQVVHTGSGDAAFSLNGGKVDAGTWEYYLNKENTDWYLKADSSQPGTDNPGT
DNPVPPVRHTTKSADAVLDMATAPVYVFNSELQSLRFRHGDVMQNTRSPGGVWGRYTGSDTRISGGAGSGYSLTQSGMET
GGDTVFDLNESRLAVGAFVSYSDNSISHNRGGSSTVGSTGGGLYATWFNNDGYYVDGVIKVNRFRNELRTWMSDGTAVKG
DYHQNGFGGSLEAGRTFSLNENTWIQPYLRSTAFRAESKDISLDNGMKAKAGTTKSLQGEVGVNLGMNLDVAGTVVRPYL
TTAVSHEFSDNNRVRINDSYNFTNDISGTTGKYGAGVSAQLTANAGVWAEASYQNGENTESPVTGSVGFRINF
>P0AE06 ~~~acrA~~~Multidrug efflux pump subunit AcrA~~~COG0845
MNKNRGFTPLAVVLMLSGSLALTGCDDKQAQQGGQQMPAVGVVTVKTEPLQITTELPGRTSAYRIAEVRPQVSGIILKRN
FKEGSDIEAGVSLYQIDPATYQATYDSAKGDLAKAQAAANIAQLTVNRYQKLLGTQYISKQEYDQALADAQQANAAVTAA
KAAVETARINLAYTKVTSPISGRIGKSNVTEGALVQNGQATALATVQQLDPIYVDVTQSSNDFLRLKQELANGTLKQENG
KAKVSLITSDGIKFPQDGTLEFSDVTVDQTTGSITLRAIFPNPDHTLLPGMFVRARLEEGLNPNAILVPQQGVTRTPRGD
ATVLVVGADDKVETRPIVASQAIGDKWLVTEGLKAGDRVVISGLQKVRPGVQVKAQEVTADNNQQAASGAQPEQSKS
>P31224 ~~~acrB~~~Multidrug efflux pump subunit AcrB~~~COG0841
MPNFFIDRPIFAWVIAIIIMLAGGLAILKLPVAQYPTIAPPAVTISASYPGADAKTVQDTVTQVIEQNMNGIDNLMYMSS
NSDSTGTVQITLTFESGTDADIAQVQVQNKLQLAMPLLPQEVQQQGVSVEKSSSSFLMVVGVINTDGTMTQEDISDYVAA
NMKDAISRTSGVGDVQLFGSQYAMRIWMNPNELNKFQLTPVDVITAIKAQNAQVAAGQLGGTPPVKGQQLNASIIAQTRL
TSTEEFGKILLKVNQDGSRVLLRDVAKIELGGENYDIIAEFNGQPASGLGIKLATGANALDTAAAIRAELAKMEPFFPSG
LKIVYPYDTTPFVKISIHEVVKTLVEAIILVFLVMYLFLQNFRATLIPTIAVPVVLLGTFAVLAAFGFSINTLTMFGMVL
AIGLLVDDAIVVVENVERVMAEEGLPPKEATRKSMGQIQGALVGIAMVLSAVFVPMAFFGGSTGAIYRQFSITIVSAMAL
SVLVALILTPALCATMLKPIAKGDHGEGKKGFFGWFNRMFEKSTHHYTDSVGGILRSTGRYLVLYLIIVVGMAYLFVRLP
SSFLPDEDQGVFMTMVQLPAGATQERTQKVLNEVTHYYLTKEKNNVESVFAVNGFGFAGRGQNTGIAFVSLKDWADRPGE
ENKVEAITMRATRAFSQIKDAMVFAFNLPAIVELGTATGFDFELIDQAGLGHEKLTQARNQLLAEAAKHPDMLTSVRPNG
LEDTPQFKIDIDQEKAQALGVSINDINTTLGAAWGGSYVNDFIDRGRVKKVYVMSEAKYRMLPDDIGDWYVRAADGQMVP
FSAFSSSRWEYGSPRLERYNGLPSMEILGQAAPGKSTGEAMELMEQLASKLPTGVGYDWTGMSYQERLSGNQAPSLYAIS
LIVVFLCLAALYESWSIPFSVMLVVPLGVIGALLAATFRGLTNDVYFQVGLLTTIGLSAKNAILIVEFAKDLMDKEGKGL
IEATLDAVRMRLRPILMTSLAFILGVMPLVISTGAGSGAQNAVGTGVMGGMVTATVLAIFFVPVFFVVVRRRFSRKNEDI
EHSHTVDHH
>G3KIM8 1.3.1.95~~~acrC~~~Acryloyl-CoA reductase (NADH)~~~
MFLLKIKKERMKRMDFSLTREQEMLKKLARQFAEIELEPVAEEIDREHVFPAENFKKMAEIGLTGIGIPKEFGGSGGGTL
EKVIAVSEFGKKCMASASILSIHLIAPQAIYKYGTKEQKETYLPRLTKGGELGAFALTEPNAGSDAGAVKTTAILDSQTN
EYVLNGTKCFISGGGRAGVLVIFALTEPKKGLKGMSAIIVEKGTPGFSIGKVESKMGIAGSETAELIFEDCRVPAANLLG
KEGKGFKIAMEALDGARIGVGAQAIGIAEGAIDLSVKYVHERIQFGKPIANLQGIQWYIADMATKTAAARALVEFAAYLE
DAGKPFTKESAMCKLNASENARFVTNLALQIHGGYGYMKDYPLERMYRDAKITEIYEGTSEIHKVVIAREVMKR
>P24180 ~~~acrE~~~Multidrug export protein AcrE~~~COG0845
MTKHARFFLLPSFILISAALIAGCNDKGEEKAHVGEPQVTVHIVKTAPLEVKTELPGRTNAYRIAEVRPQVSGIVLNRNF
TEGSDVQAGQSLYQIDPATYQANYDSAKGELAKSEAAAAIAHLTVKRYVPLVGTKYISQQEYDQAIADARQADAAVIAAK
ATVESARINLAYTKVTAPISGRIGKSTVTEGALVTNGQTTELATVQQLDPIYVDVTQSSNDFMRLKQSVEQGNLHKENAT
SNVELVMENGQTYPLKGTLQFSDVTVDESTGSITLRAVFPNPQHTLLPGMFVRARIDEGVQPDAILIPQQGVSRTPRGDA
TVLIVNDKSQVEARPVVASQAIGDKWLISEGLKSGDQVIVSGLQKARPGEQVKATTDTPADTASK
>P24181 ~~~acrF~~~Multidrug export protein AcrF~~~COG0841
MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDTVTQVIEQNMNGIDNLMYMSS
TSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQEVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVAS
NVKDTLSRLNGVGDVQLFGAQYAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRF
KNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANALDTAKAIKAKLAELQPFFPQG
MKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVL
AIGLLVDDAIVVVENVERVMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMAL
SVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGRYLLIYALIVAGMVVLFLRLPS
SFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLKNEKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDE
NSAEAVIHRAKMELGKIRDGFVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRPNGL
EDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAKFRMLPEDVDKLYVRSANGEMVPF
SAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSSGDAMALMENLASKLPAGIGYDWTGMSYQERLSGNQAPALVAISF
VVVFLCLAALYESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKEGKGVV
EATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLLAIFFVPVFFVVIRRCFKG
>P0ACS9 ~~~acrR~~~HTH-type transcriptional regulator AcrR~~~COG1309
MARKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESNIGELELEYQAK
FPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMT
RRAAIIMRGYISGLMENWLFAPQSFDLKKEARDYVAILLEMYLLCPTLRNPATNE
>P0AAW9 ~~~acrZ~~~Multidrug efflux pump accessory protein AcrZ~~~
MLELLKSLVFAVIMVPVVMAIILGLIYGLGEVFNIFSGVGKKDQPGQNH
>P0A5B8 ~~~hspX~~~Alpha-crystallin~~~
MATTLPVQRHPRSLFPEFSELFAAFPSFAGLRPTFDTRLMRLEDEMKEGRYEVRAELPGVDPDKDVDIMVRDGQLTIKAE
RTEQKDFDGRSEFAYGSFVRTVSLPVGADEDDIKATYDKGILTVSVAVSEGKPTEKHIQIRSTN
>P9WMK0 ~~~hspX~~~Alpha-crystallin~~~
MATTLPVQRHPRSLFPEFSELFAAFPSFAGLRPTFDTRLMRLEDEMKEGRYEVRAELPGVDPDKDVDIMVRDGQLTIKAE
RTEQKDFDGRSEFAYGSFVRTVSLPVGADEDDIKATYDKGILTVSVAVSEGKPTEKHIQIRSTN
>P9WMK1 ~~~hspX~~~Alpha-crystallin~~~COG0071
MATTLPVQRHPRSLFPEFSELFAAFPSFAGLRPTFDTRLMRLEDEMKEGRYEVRAELPGVDPDKDVDIMVRDGQLTIKAE
RTEQKDFDGRSEFAYGSFVRTVSLPVGADEDDIKATYDKGILTVSVAVSEGKPTEKHIQIRSTN
>Q48154 ~~~acs1~~~Bifunctional ribulose 5-phosphate reductase/CDP-ribitol pyrophosphorylase Acs1~~~
MLKNKNIGIILAGGIGSRMGLGYPKQFSKIAGKTALEHTIFIFQEHKEIDEIIIVSERTSYRRIEDIVSKAGFSKVNRII
FGGKERSDSTLSAITALQDEPRNTKLIIHDAVRPLLATEIISECIAKLDKYNAVDVAIPAVDTIVHVNNDTQEIIKIPKR
AEYYQGQTPQAFKLGTLKKAYDIYTQGGIEGTCDCSIVLKTLPEERVGIVSGFETNIKLTRPVDLFIADKLFQSRSHFSL
RNITSIDRLYDMKDQVLVVIGGSYGIGAHIIDVAKKFGIKTYSLSRSNGVDVGDVKSIEKAFAGIYGKEHKIDHIVNTAA
VLNHKTLASMSYEEIVTSINVNYTGMINAVITAYPYLKQTHGSFLGFTSSSYTRGRPFYAIYSSAKAAVVNLTQAISEEW
LPDNIKINCVNPERTKTPMRTKAFGIEPEGTLLDPKTVAFASLTVLASRETGNIIDVVLKDEEYISHILADLYK
>P0CW87 ~~~acsAB~~~Cellulose synthase 1~~~
MPEVRSSTQSESGMSQWMGKILSIRGAGLTIGVFGLCALIAATSVTLPPEQQLIVAFVCVVIFFIVGHKPSRRSQIFLEV
LSGLVSLRYLTWRLTETLSFDTWLQGLLGTMLLVAELYALMMLFLSYFQTIAPLHRAPLPLPPNPDEWPTVDIFVPTYNE
ELSIVRLTVLGSLGIDWPPEKVRVHILDDGRRPEFAAFAAECGANYIARPTNEHAKAGNLNYAIGHTDGDYILIFDCDHV
PTRAFLQLTMGWMVEDPKIALMQTPHHFYSPDPFQRNLSAGYRTPPEGNLFYGVVQDGNDFWDATFFCGSCAILRRTAIE
QIGGFATQTVTEDAHTALKMQRLGWSTAYLRIPLAGGLATERLILHIGQRVRWARGMLQIFRIDNPLFGRGLSWGQRLCY
LSAMTSFLFAVPRVIFLSSPLAFLFFGQNIIAASPLALLAYAIPHMFHAVGTASKINKGWRYSFWSEVYETTMALFLVRV
TIVTLLSPSRGKFNVTDKGGLLEKGYFDLGAVYPNIILGLIMFGGLARGVYELSFGHLDQIAERAYLLNSAWAMLSLIII
LAAIAVGRETQQKRNSHRIPATIPVEVANADGSIIVTGVTEDLSMGGAAVKMSWPAKLSGPTPVYIRTVLDGEELILPAR
IIRAGNGRGIFIWTIDNLQQEFSVIRLVFGRADAWVDWGNYKADRPLLSLMDMVLSVKGLFRSSGDIVHRSSPTKPLAGN
ALSDDTNNPSRKERVLKGTVKMVSLLALLTFASSAQAASAPRAVAAKAPAHQPEASDLPPLPALLPATSGAAQAGAGDAG
ANGPGSPTGQPLAADSADALVENAENTSDTATVHNYTLKDLGAAGSITMRGLAPLQGIEFGIPSDQLVTSARLVLSGSMS
PNLRPETNSVTMTLNEQYIGTLRPDPAHPTFGPMSFEINPIFFVSGNRLNFNFASGSKGCSDITNDTLWATISQNSQLQI
TTIALPPRRLLSRLPQPFYDKNVRQHVTVPMVLAQTYDPQILKSAGILASWFGKQTDFLGVTFPVSSTIPQSGNAILIGV
ADELPTSLGRPQVNGPAVLELPNPSDANATILVVTGRDRDEVITASKGIAFASAPLPTDSHMDVAPVDIAPRKPNDAPSF
IAMDHPVRFGDLVTASKLQGTGFTSGVLSVPFRIPPDLYTWRNRPYKMQVRFRSPAGEAKDVEKSRLDVGINEVYLHSYP
LRETHGLVGAVLQGVGLARPASGMQVHDLDVPPWTVFGQDQLNFYFDAMPLARGICQSGAANNAFHLGLDPDSTIDFSRA
HHIAQMPNLAYMATVGFPFTTYADLSQTAVVLPEHPNAATVGAYLDLMGFMGAATWYPVAGVDIVSADHVSDVADRNLLV
ISTLATSGEIAPLLSRSSYEVADGHLRTVSHASALDNAIKAVDDPLTAFRDRDSKPQDVDTPLTGGVGAMIEAESPLTAG
RTVLALLSSDGAGLNNLLQMLGERKKQANIQGDLVVAHGEDLSSYRTSPVYTIGTLPLWLWPDWYMHNRPVRVLLVGLLG
CILIVSVLARALARHATRRFKQLEDERRKS
>Q76KJ8 ~~~acsAB~~~Cellulose synthase 1~~~
MPEVRSSTQSESGMSQWMGKILSIRGAGLIIGVFGLCALIAATSVTLPPEQQLIVAFVCVVIFFIVGHKPSRRSQIFLEV
LSGLVSLRYLTWRLTETLSFDTWLQGLLGTMLLVAELYALMMLFLSYFQTIAPLHRAPLPLPPNPDEWPTVDIFVPTYNE
ELSIVRLTVLGSLGIDWPPEKVRVHILDDGRRPEFAAFAAECGANYIARPTNEHAKAGNLNYAIGHTDGDYILIFDCDHV
PTRAFLQLTMGWMVEDPKIALMQTPHHFYSPDPFQRNLSAGYRTPPEGNLFYGVVQDGNDFWDATFFCGSCAILRRTAIE
QIGGFATQTVTEDAHTALKMQRLGWSTAYLRIPLAGGLATERLILHIGQRVRWARGMLQIFRIDNPLFGRGLSWGQRLCY
LSAMTSFLFAVPRVIFLSSPLAFLFFGQNIIAASPLALLAYAIPHMFHAVGTASKINKGWRYSFWSEVYETTMALFLVRV
TIVTLLSPSRGKFNVTDKGGLLEKGYFDLGAVYPNIILGLIMFGGLARGVYELSFGHLDQIAERAYLLNSAWAMLSLIII
LAAIAVGRETQQKRNSHRIPATIPVEVANADGSIIVTGVTEDLSMGGAAVKMSWPAKLSGPTPVYIRTVLDGEELILPAR
IIRAGNGRGIFIWTIDNLQQEFSVIRLVFGRADAWVDWGNYKADRPLLSLMDMVLSVKGLFRSSGDIVHRSSPTKPSAGN
ALSDDTNNPSRKERVLKGTVKMVSLLALLTFASSAQAASAPRAVAAKAPAHQPEASDLPPLPALLPATSGAAQAGSGDAG
ADGPGSPTGQPLAADSADALVENAENTSDTATVHNYTLKDLGAAGSITMRGLAPLQGIEFGIPSDQLVTSARLVLSGSMS
PNLRPETNSVTMTLNEQYIGTLRPDPAHPTFGPMSFEINPIFFVSGNRLNFNFASGSKGCSDITNDTLWATISQNSQLQI
TTIALPPRRLLSRLPQPFYDKNVRQHVTVPMVLAQTYDPQILKSAGILASWFGKQTDFLGVTFPVSSTIPQSGNAILIGV
ADELPTSFGRPQVNGPAVLELPNPSDANATILVVTGRDRDEVITASKGIAFASAPLPTDSHMDVAPVDIAPRKPNDAPSF
IAMDHPVRFGDLVTASKLQGTGFTSGVLSVPFRIPPDLYTWRNRPYKMQVRFRSPAGEAKDVEKSRLDVGINEVYLHSYP
LRETHGLIGAVLQGVGLARPASGMQVHDLDVPPWTVFGQDQLNFYFDAMPLARGICQSGAANNAFHLGLDPDSTIDFSRA
HHIAQMPNLAYMATVGFPFTTYADLSQTAVVLPEHPNAATVGAYLDLMGFMGAATWYPVAGVDIVSADHVSDVADRNLLV
ISTLATSGEIAPLLSRSSYEVADGHLRTVSHASALDNAIKAVDDPLTAFRDRDSKPQDVDTPLTGGVGAMIEAESPLTAG
RTVLALLSSDGAGLNNLLQMLGERKKQANIQGDLVVAHGEDLSSYRTSPVYTIGTLPLWLWPDWYMHNRPVRVLLVGLLG
CILIVSVLARALARHAARRFKQLEDERRKS
>P39062 6.2.1.1~~~acsA~~~Acetyl-coenzyme A synthetase~~~COG0365
MNLKALPAIEGDHNLKNYEETYRHFDWAEAEKHFSWHETGKLNAAYEAIDRHAESFRKNKVALYYKDAKRDEKYTFKEMK
EESNRAGNVLRRYGNVEKGDRVFIFMPRSPELYFIMLGAIKIGAIAGPLFEAFMEGAVKDRLENSEAKVVVTTPELLERI
PVDKLPHLQHVFVVGGEAESGTNIINYDEAAKQESTRLDIEWMDKKDGFLLHYTSGSTGTPKGVLHVHEAMIQQYQTGKW
VLDLKEEDIYWCTADPGWVTGTVYGIFAPWLNGATNVIVGGRFSPESWYGTIEQLGVNVWYSAPTAFRMLMGAGDEMAAK
YDLTSLRHVLSVGEPLNPEVIRWGHKVFNKRIHDTWWMTETGSQLICNYPCMDIKPGSMGKPIPGVEAAIVDNQGNELPP
YRMGNLAIKKGWPSMMHTIWNNPEKYESYFMPGGWYVSGDSAYMDEEGYFWFQGRVDDVIMTSGERVGPFEVESKLVEHP
AIAEAGVIGKPDPVRGEIIKAFIALREGFEPSDKLKEEIRLFVKQGLAAHAAPREIEFKDKLPKTRSGKIMRRVLKAWEL
NLPAGDLSTMED
>Q89WV5 6.2.1.1~~~acsA~~~Acetyl-coenzyme A synthetase~~~COG0365
MSEKIYDVPAEWAKRAWVDQAKYKEMYARSISDPNGFWAEQAKRIDWMKAPTKIENVSFAPGNVSIKWFEDGVLNVAHNC
IDRHLHKRANQTAIIWEGDDPSQSRHITYKELHDEVCRMANILRTRNVKKGDRVTIYLPMIPEAAYAMLACARIGAIHSV
VFAGFSPDSLAQRINDCQSKVIITADEGLRGGKKVPLKANVDAALAKADGVDWVVVVKRTGGKIDMNPTRDLWYHEAAAM
VTTECPVEHMHAEDPLFILYTSGSTGQPKGVLHTSAGYLVYAAMTHQYVFDYHDGDIYWCTADVGWVTGHSYILYGPLAN
GATTLMFEGVPNYPDNSRFWNVIDKHKVNTFYTAPTAIRALMQGGDEPVKKTSRASLRLLGSVGEPINPEAWEWYHRVVG
EDRCPIVDTWWQTETGGILITPLPGATKLKPGSATQPFFGVVPEIVDADGKVLEGETTGNLCLTRAWPGMMRTVYGDHAR
FEQTYFSTYKGKYFTGDGCRRDADGYYWITGRVDDVINVSGHRMGTAEVESALVAHEKVSEAAVVGFPHDIKGQGIYAYV
TLMAGVQPTEDLRKELVTWVRKEIGPIASPDQIQFAPGLPKTRSGKIMRRILRKIAEDEPGSLGDTSTLADPAVVDDLVK
NRQNKKSA
>P27550 6.2.1.1~~~acs~~~Acetyl-coenzyme A synthetase~~~COG0365
MSQIHKHTIPANIADRCLINPQQYEAMYQQSINVPDTFWGEQGKILDWIKPYQKVKNTSFAPGNVSIKWYEDGTLNLAAN
CLDRHLQENGDRTAIIWEGDDASQSKHISYKELHRDVCRFANTLLELGIKKGDVVAIYMPMVPEAAVAMLACARIGAVHS
VIFGGFSPEAVAGRIIDSNSRLVITSDEGVRAGRSIPLKKNVDDALKNPNVTSVEHVVVLKRTGGKIDWQEGRDLWWHDL
VEQASDQHQAEEMNAEDPLFILYTSGSTGKPKGVLHTTGGYLVYAALTFKYVFDYHPGDIYWCTADVGWVTGHSYLLYGP
LACGATTLMFEGVPNWPTPARMAQVVDKHQVNILYTAPTAIRALMAEGDKAIEGTDRSSLRILGSVGEPINPEAWEWYWK
KIGNEKCPVVDTWWQTETGGFMITPLPGATELKAGSATRPFFGVQPALVDNEGNPLEGATEGSLVITDSWPGQARTLFGD
HERFEQTYFSTFKNMYFSGDGARRDEDGYYWITGRVDDVLNVSGHRLGTAEIESALVAHPKIAEAAVVGIPHNIKGQAIY
AYVTLNHGEEPSPELYAEVRNWVRKEIGPLATPDVLHWTDSLPKTRSGKIMRRILRKIAAGDTSNLGDTSTLADPGVVEK
LLEEKQAIAMPS
>P9WQD1 6.2.1.1~~~acsA~~~Acetyl-coenzyme A synthetase~~~COG0365
MSESTPEVSSSYPPPAHFAEHANARAELYREAEEDRLAFWAKQANRLSWTTPFTEVLDWSGAPFAKWFVGGELNVAYNCV
DRHVEAGHGDRVAIHWEGEPVGDRRTLTYSDLLAEVSKAANALTDLGLVAGDRVAIYLPLIPEAVIAMLACARLGIMHSV
VFGGFTAAALQARIVDAQAKLLITADGQFRRGKPSPLKAAADEALAAIPDCSVEHVLVVRRTGIEMAWSEGRDLWWHHVV
GSASPAHTPEPFDSEHPLFLLYTSGTTGKPKGIMHTSGGYLTQCCYTMRTIFDVKPDSDVFWCTADIGWVTGHTYGVYGP
LCNGVTEVLYEGTPDTPDRHRHFQIIEKYGVTIYYTAPTLIRMFMKWGREIPDSHDLSSLRLLGSVGEPINPEAWRWYRD
VIGGGRTPLVDTWWQTETGSAMISPLPGIAAAKPGSAMTPLPGISAKIVDDHGDPLPPHTEGAQHVTGYLVLDQPWPSML
RGIWGDPARYWHSYWSKFSDKGYYFAGDGARIDPDGAIWVLGRIDDVMNVSGHRISTAEVESALVAHSGVAEAAVVGVTD
ETTTQAICAFVVLRANYAPHDRTAEELRTEVARVISPIARPRDVHVVPELPKTRSGKIMRRLLRDVAENRELGDTSTLLD
PTVFDAIRAAK
>Q8ZKF6 6.2.1.1~~~acs~~~Acetyl-coenzyme A synthetase~~~
MSQTHKHAIPANIADRCLINPEQYETKYKQSINDPDTFWGEQGKILDWITPYQKVKNTSFAPGNVSIKWYEDGTLNLAAN
CLDRHLQENGDRTAIIWEGDDTSQSKHISYRELHRDVCRFANTLLDLGIKKGDVVAIYMPMVPEAAVAMLACARIGAVHS
VIFGGFSPEAVAGRIIDSSSRLVITADEGVRAGRSIPLKKNVDDALKNPNVTSVEHVIVLKRTGSDIDWQEGRDLWWRDL
IEKASPEHQPEAMNAEDPLFILYTSGSTGKPKGVLHTTGGYLVYAATTFKYVFDYHPGDIYWCTADVGWVTGHSYLLYGP
LACGATTLMFEGVPNWPTPARMCQVVDKHQVNILYTAPTAIRALMAEGDKAIEGTDRSSLRILGSVGEPINPEAWEWYWK
KIGKEKCPVVDTWWQTETGGFMITPLPGAIELKAGSATRPFFGVQPALVDNEGHPQEGATEGNLVITDSWPGQARTLFGD
HERFEQTYFSTFKNMYFSGDGARRDEDGYYWITGRVDDVLNVSGHRLGTAEIESALVAHPKIAEAAVVGIPHAIKGQAIY
AYVTLNHGEEPSPELYAEVRNWVRKEIGPLATPDVLHWTDSLPKTRSGKIMRRILRKIAAGDTSNLGDTSTLADPGVVEK
LLEEKQAIAMPS
>P37718 ~~~acsC~~~Cellulose synthase operon protein C~~~
MTHKRYASSLSAGLLATTCVAGLLLQANGARAQQAAEAQAPASSTTMMQAATVAPAQSGQAAVVQRLVQQARFWMQQHQY
ENARQSLQSAARLAPDSVDLLEAEGEYQSHIGNRDAALDTQRRLHQAAPGSTYESQLNDLLHEQAISQPDLAHARSLAAS
GHSDQAVEAYQHLFNGSTPTPSLAVEYYQTLAGVSGQAGTAQDGLIRLVKANPSDFRAQLALAQVLTYQPGTRMEGLQRL
QALQKYQSSAPVEAATAEKSYRQTLSWLPVTPETLPLMQKWLDAHPSDSALRTHMAEPAGGPPDKGALARQDGFKALNAG
RLSAAQAAFQSALNLNAKDGDALGGLGLVAMRAGHNEEAHRYLEDAIAADPKNAAHWRPALAGMAVGEEYGSVRRLIASG
QTQEAEQRLMTLARQPGQSEGATLMLADLQRSTGQTGEAERNYRAILARNGDNPIALMGLARVLMGEGQESEANALLSRL
GGRYSDQVQQIEVSGIMAEAARTSDSAQKVSLLRQAMTKAPDDPWLRINLANALQQQGDSAEAANVMRPLLTSPRTPADY
QAAILYASGNGNDTLARRLLAGLSPDDYSPAIRTIADEMAIKADLASRLSMVSNPTPLVREALAAPDPTGARGVAVADLF
RQRGDMLHAHMALRIASTRNIDLTTEQRLAYATEYMKISNPVAAARLLAPLGDGSGTATGSAMSPDQRQTLMQLRMGISV
AQSDLLNQRGDQAAAYDHLAPALQADPEATSPKLALARLYNGRGKYGHALDIDLAVLRHNPQDLDARQAAVQAAANDGKD
NLAMQLAQDGVQQSPMDARSWLGMAVADRAVGHGDRTLADLRRAYELRLQQLKISRGDAIGGDETQATAPPTANPFRRDA
YGHALSLGAPPGENGYSTAGSVPEISDQMLSSINGQIHTLSEDMAPSVDAGLGFRVRSGTPGMGALTEASVPIVGRIPLQ
AGTSALTFTATPTFLTSGHLPQTGYDIPRFGTNLFALERNLQNQNNSAEHRINTDTIGREAGVAPDVRFANNWVSADVGA
SPLGFTLPNVIGGVEFAPRVGPVTFRVSGERRSITNSVLSYGGMTDALTGKKWGGVVTNHFHGQVEATLGNTIVYGGGGY
AIQTGHHVQSNTEVEGGLGANTLVYRNRKHEVRVGVNLTYFGYKHNEDFYTYGQGGYFSPQSYFAATVPVRYSGHSGLFD
WDVTGSIGYQLFHEHSSAFFPTNPVYQALANGLAGVSTAELSLESARYPGDDVGSLVGGFDGRVGYRVSHSLRLDLSGRF
QKAGNWDEGGAMISAHYLIMDQ
>Q07340 ~~~acsC~~~Corrinoid/iron-sulfur protein large subunit~~~
MPLTGLEIYKQLPKKNCGECGTPTCLAFAMNLASGKASLDSCPYVSDAAREALDAAAAPPIAKVVLGAGPTAVEMGDETE
LFRHDKRFYHETAIAIQVSDNLSSEELKAKVEAINGLNFDRVGQHYTIQAIAIRHDADDPAAFKAAVASVAAATQLNLVL
MADDPDVLKEALAGVADRKPLLYAATGANYEAMTALAKENNCPLAVYGNGLEELAELVDKIVALGHKQLVLDPGARETSR
AIADFTQIRRLAIKKRFRSFGYPIIALTTAANPLDEVLQAVNYVTKYASLVVLRTDAKEHLLPLLSWRQNLYTDPQVPIR
VEEKLNEIGAVNENSPVYVTTNFSLTYYSVEGEIESTKIPSYLLSVDTDGLSVLTAYADGKFEAEKIAAVMKKVDLDNKV
KRHRIIIPGAVAVLKGKLEDLTGWEVIVGPREASGIVAFARANLAS
>P37719 ~~~acsD~~~Cellulose synthase operon protein D~~~
MTIFEKKPDFTLFLQTLSWEIDDQVGIEVRNELLREVGRGMGTRIMPPPCQTVDKLQIELNALLALIGWGTVTLELLSED
QSLRIVHENLPQVGSAGEPSGTWLAPVLEGLYGRWVTSQAGAFGDYVVTRDVDAEDLNAVPRQTIIMYMRVRSSAT
>Q07341 ~~~acsD~~~Corrinoid/iron-sulfur protein small subunit~~~
MAVQILRDRSRAAVQKVVLGATKDQGGTRSHTIVVGGDAALPFHHFEGEIVNRPVIGMEVQDIVPDWPDVLKDPFTDVIN
EPGRWAQKCVAEYGADLIYLKLDGADPEGANHSVDQCVATVKEVLQAVGVPLVVVGCGDVEKDHEVLEAVAEAAAGENLL
LGNAEQENYKSLTAACMVHKHNIIARSPLDINICKQLNILINEMNLPLDHIVIDPSIGGLGYGIEYSFSIMERIRLGALQ
GDKMLSMPVICTVGYEAWRAKEASAPVSEYPGWGKETERGILWEAVTATALLQAGAHILLMRHPEAVARVKENIDQLMVS
NAY
>Q46389 2.1.1.258~~~acsE~~~5-methyltetrahydrofolate:corrinoid/iron-sulfur protein co-methyltransferase~~~
MLIIGERINGMFGDIKRAIQERDPAPVQEWARRQEEGGARALDLNVGPAVQDKVSAMEWLVEVTQEVSNLTLCLDSTNIK
AIEAGLKKCKNRAMINSTNAEREKVEKLFPLAVEHGAALIGLTMNKTGIPKDSDTRLAFAMELVAAADEFGLPMEDLYID
PLILPANVAQDHAPEVLKTLQQIKMLADPAPKTVLGLSNVSQNCQNRPLINRTFLAMAMACGLDAAIADACDEALIETAA
TAEILLNQTVYCDSFVKMFKTR
>P16544 1.3.1.-~~~actIII~~~Putative ketoacyl reductase~~~COG1028
MATQDSEVALVTGATSGIGLEIARRLGKEGLRVFVCARGEEGLRTTLKELREAGVEADGRTCDVRSVPEIEALVAAVVER
YGPVDVLVNNAGRPGGGATAELADELWLDVVETNLTGVFRVTKQVLKAGGMLERGTGRIVNIASTGGKQGVVHAAPYSAS
KHGVVGFTKALGLELARTGITVNAVCPGFVETPMAASVREHYSDIWEVSTEEAFDRITARVPIGRYVQPSEVAEMVAYLI
GPGAAAVTAQALNVCGGLGNY
>P33379 ~~~actA~~~Actin assembly-inducing protein~~~
MGLNRFMRAMMVVFITANCITINPDIIFAATDSEDSSLNTDEWEEEKTEEQPSEVNTGPRYETAREVSSRDIKELEKSNK
VRNTNKADLIAMLKEKAEKGPNINNNNSEQTENAAINEEASGADRPAIQVERRHPGLPSDSAAEIKKRRKAIASSDSELE
SLTYPDKPTKVNKKKVAKESVADASESDLDSSMQSADESSPQPLKANQQPFFPKVFKKIKDAGKWVRDKIDENPEVKKAI
VDKSAGLIDQLLTKKKSEEVNASDFPPPPTDEELRLALPETPMLLGFNAPATSEPSSFEFPPPPTDEELRLALPETPMLL
GFNAPATSEPSSFEFPPPPTEDELEIIRETASSLDSSFTRGDLASLRNAINRHSQNFSDFPPIPTEEELNGRGGRPTSEE
FSSLNSGDFTDDENSETTEEEIDRLADLRDRGTGKHSRNAGFLPLNPFASSPVPSLSPKVSKISAPALISDITKKTPFKN
PSQPLNVFNKKTTTKTVTKKPTPVKTAPKLAELPATKPQETVLRENKTPFIEKQAETNKQSINMPSLPVIQKEATESDKE
EMKPQTEEKMVEESESANNANGKNRSAGIEEGKLIAKSAEDEKAKEEPGNHTTLILAMLAIGVFSLGAFIKIIQLRKNN
>Q02059 2.3.1.-~~~~~~Actinorhodin polyketide putative beta-ketoacyl synthase 1~~~COG0304
MPLDAAPVDPASRGPVSAFEPPSSHGADDDDDHRTNASKELFGLKRRVVITGVGVRAPGGNGTRQFWELLTSGRTATRRI
SFFDPSPYRSQVAAEADFDPVAEGFGPRELDRMDRASQFAVACAREAFAASGLDPDTLDPARVGVSLGSAVAAATSLERE
YLLLSDSGRDWEVDAAWLSRHMFDYLVPSVMPAEVAWAVGAEGPVTMVSTGCTSGLDSVGNAVRAIEEGSADVMFAGAAD
TPITPIVVACFDAIRATTARNDDPEHASRPFDGTRDGFVLAEGAAMFVLEDYDSALARGARIHAEISGYATRCNAYHMTG
LKADGREMAETIRVALDESRTDATDIDYINAHGSGTRQNDRHETAAYKRALGEHARRTPVSSIKSMVGHSLGAIGSLEIA
ACVLALEHGVVPPTANLRTSDPECDLDYVPLEARERKLRSVLTVGSGFGGFQSAMVLRDAETAGAAA
>Q02062 2.3.1.-~~~~~~Actinorhodin polyketide putative beta-ketoacyl synthase 2~~~COG0304
MSVLITGVGVVAPNGLGLAPYWSAVLDGRHGLGPVTRFDVSRYPATLAGQIDDFHAPDHIPGRLLPQTDPSTRLALTAAD
WALQDAKADPESLTDYDMGVVTANACGGFDFTHREFRKLWSEGPKSVSVYESFAWFYAVNTGQISIRHGMRGPSSALVAE
QAGGLDALGHARRTIRRGTPLVVSGGVDSALDPWGWVSQIASGRISTATDPDRAYLPFDERAAGYVPGEGGAILVLEDSA
AAEARGRHDAYGELAGCASTFDPAPGSGRPAGLERAIRLALNDAGTGPEDVDVVFADGAGVPELDAAEARAIGRVFGREG
VPVTVPKTTTGRLYSGGGPLDVVTALMSLREGVIAPTAGVTSVPREYGIDLVLGEPRSTAPRTALVLARGRWGFNSAAVL
RRFAPTP
>P32705 ~~~actP~~~Cation/acetate symporter ActP~~~COG4147
MKRVLTALAATLPFAANAADAISGAVERQPTNWQAIIMFLIFVVFTLGITYWASKRVRSRSDYYTAGGNITGFQNGLAIA
GDYMSAASFLGISALVFTSGYDGLIYSLGFLVGWPIILFLIAERLRNLGRYTFADVASYRLKQGPIRILSACGSLVVVAL
YLIAQMVGAGKLIELLFGLNYHIAVVLVGVLMMMYVLFGGMLATTWVQIIKAVLLLFGASFMAFMVMKHVGFSFNNLFSE
AMAVHPKGVDIMKPGGLVKDPISALSLGLGLMFGTAGLPHILMRFFTVSDAREARKSVFYATGFMGYFYILTFIIGFGAI
MLVGANPEYKDAAGHLIGGNNMAAVHLANAVGGNLFLGFISAVAFATILAVVAGLTLAGASAVSHDLYANVFKKGATERE
ELRVSKITVLILGVIAIILGVLFENQNIAFMVGLAFAIAASCNFPIILLSMYWSKLTTRGAMMGGWLGLITAVVLMILGP
TIWVQILGHEKAIFPYEYPALFSITVAFLGIWFFSATDNSAEGARERELFRAQFIRSQTGFGVEQGRAH
>Q8KP10 3.-.-.-~~~act~~~Methanol dehydrogenase activator~~~
MGKLFEEKTIKTEQIFSGRVVKLQVDDVELPNGQTSKREIVRHPGAVAVIAITNENKIVMVEQYRKPLEKSIVEIPAGKL
EKGEDPRVTALRELEEETGYECEQMEWLISFATSPGFADEIIHLYVAKGLSKKENAAGLDEDEFVDLIELTLDEALQYIK
EKRIYDSKTVIAVQYLQLQEALKHK
>Q65G33 2.3.1.-~~~acuA~~~Acetoin utilization protein AcuA~~~COG0454
MEHHKTYHAKELQTEKGSVLIEGPISPEKLAEYEFHDELTAFRPSQKQHEALIEIAGLPEGRIIIARFRQTIVGYVTYVY
PDPLERWSEGNMENLIELGAIEVIPAFRGHSVGKTLLAVSMMDPQMEKYIIITTEYYWHWDLKGTNKDVWEYRKMMEKMM
NAGGLVWFATDDPEISSHPANCLMARIGKEVSQESIERFDRLRFHNRFMY
>P39065 2.3.1.-~~~acuA~~~Acetoin utilization protein AcuA~~~COG0454
MEHHKTYHSANIKTATGSLLIEGPVSPEDLAGYEFHKDLTAFRPPREQHEALVDIAGLPEGRIIIARDGRTIVGYVTYLY
PDPLERWSEGNMEDLIELGAIEVAPDYRGCAVGKTLLTVSMMDEQMENYIVMTTEYYWHWDLKGMKKDVWEYRKIMEKMM
NAGGLVWFATDEPEISSHPANCLMARIGKNVSQESIEQFDRLRFYHRYMY
>P64376 ~~~acuC~~~Acetoin utilization protein AcuC~~~
MQQHSSKTAYVYSDKLLQYRFHDQHPFNQMRLKLTTELLLNANLLSPEQIVQPRIATGDELMLIHKYDYVEAIKHASHGI
ISEDEAKKYGLNDEENGQFKHMHRHSATIVGGALTLADLIMSGKVLNGCHLGGGLHHAQPGRASGFCIYNDIAITAQYIA
KEYNQRVLIIDTDAHHGDGTQWSFYADNHVTTYSIHETGKFLFPGSGHYTERGEDIGYGHTVNVPLEPYTEDASFLECFK
LTVEPVVKSFKPDIILSVNGVDIHYRDPLTHLNCTLHSLYEIPYFVKYLADSYTNGKVIMFGGGGYNIWRVVPRAWSHVF
LSLIDQPIQSGYLPLEWINKWKHYSSELLPKRWEDRLNDYTYVPRTKEISEKNKKLALHIASWYESTRQ
>Q3J6K9 1.3.1.84~~~acuI~~~Acrylyl-CoA reductase AcuI~~~COG0604
MRAVLIEKSDDTQSVSVTELAEDQLPEGDVLVDVAYSTLNYKDALAITGKAPVVRRFPMVPGIDFTGTVAQSSHADFKPG
DRVILNGWGVGEKHWGGLAERARVRGDWLVPLPAPLDLRQAAMIGTAGYTAMLCVLALERHGVVPGNGEIVVSGAAGGVG
SVATTLLAAKGYEVAAVTGRASEAEYLRGLGAASVIDRNELTGKVRPLGQERWAGGIDVAGSTVLANMLSMMKYRGVVAA
CGLAAGMDLPASVAPFILRGMTLAGVDSVMCPKTDRLAAWARLASDLDPAKLEEMTTELPFSEVIETAPKFLDGTVRGRI
VIPVTP
>P26646 1.3.1.84~~~acuI~~~Probable acrylyl-CoA reductase AcuI~~~COG0604
MQALLLEQQDGKTLASVQTLDESRLPEGDVTVDVHWSSLNYKDALAITGKGKIIRNFPMIPGIDFAGTVRTSEDPRFHAG
QEVLLTGWGVGENHWGGLAEQARVKGDWLVAMPQGLDARKAMIIGTAGFTAMLCVMALEDAGVRPQDGEIVVTGASGGVG
STAVALLHKLGYQVVAVSGRESTHEYLKSLGASRVLPRDEFAESRPLEKQVWAGAIDTVGDKVLAKVLAQMNYGGCVAAC
GLAGGFTLPTTVMPFILRNVRLQGVDSVMTPPERRAQAWQRLVADLPESFYTQAAKEISLSEAPNFAEAIINNQIQGRTL
VKVN
>Q5LS56 1.3.1.84~~~acuI~~~Acrylyl-CoA reductase AcuI~~~COG0604
MFNALVVDKDEESGKTQAAVKQLSLTDLPVGEVTVAVEYSTVNYKDGLCIGPGGGLVRKYPHVPGIDFAGTVENSSDERY
KPGDKVVLTGWRVGEAHWGGYSQKANVRADWLVPLPEGLDTRQAMAVGTAGFTAMLAVMALEDHGLTPGHGPVLVTGAAG
GVGSVATAILAHLGYEVAAVTGRPETADYLTSLGATQIVARDEINETVKRPLESEIWAGCVDAVGGAMLARVLGQMKYGA
SVAAVGLAGGAGLPATVIPFLLRGVNLLGIDSVMQPYANRLRAWERIARDLPMDKLEAMIRPATLSDLPGLGADILKGQV
QGRVVVDVNA
>Q3J6K8 ~~~acuR~~~Transcriptional regulator AcuR~~~COG1309
MPLTDTPPSVPQKPRRGRPRGAPDASLAHQSLIRAGLEHLTEKGYSSVGVDEILKAARVPKGSFYHYFRNKADFGLALIE
AYDTYFARLLDQAFLDGSLAPLARLRLFTRMAEEGMARHGFRRGCLVGNLGQEMGALPDDFRAALIGVLETWQRRTAQLF
REAQACGELSADHDPDALAEAFWIGWEGAILRAKLELRPDPLHSFTRTFGRHFVTRTQE
>Q8RM04 6.4.1.6~~~acxA~~~Acetone carboxylase beta subunit~~~COG0145
MNVPVGHLRNVQVLGIDAGGTMTDTFFVDQDGDFVVGKAQSTPQNEALGLIASSEDGLANWGMSLHEALAQLQTGVYSGT
AMLNRVVQRKGLKCGLIVNRGMEDFHRMGRAVQSHLGYAYEDRIHLNTHRYDPPLVPRHLTRGVVERTDMIGTQVIPLRE
DTARDAARDLIAADAEGIVISLLHSYKNPENERRVRDIVLEEVEKSGKKIPVFASADYYPVRKETHRTNTTILEGYAAEP
SRQTLSKISNAFKERGTKFDFRVMATHGGTISWKAKELARTIVSGPIGGVIGAKYLGEVLGYKNIACSDIGGTSFDVALI
TQGEMTIKNDPDMARLVLSLPLVAMDSVGAGAGSFIRLDPYTRAIKLGPDSAGYRVGVCWKESGIETVTISDCHMVLGYL
NPDNFLGGAVKLDRQRSVDAIKAQIADPLGLSVEDAAAGVIELLDSDLRDYLRSMISGKGYSPASFVCFSYGGAGPVHTY
GYTEGLGFEDVIVPAWAAGFSAFGCAAADFEYRYDKSLDINMPTETPDTDKEKAAATLQAAWEELTKNVLEEFKLNGYSA
DQVTLQPGYRMQYRGQLNDLEIESPLAQAHTAADWDQLTDAFNATYGRVYAASARSPELGYSVTGAIMRGMVPIPKPKIP
KEPEEGETPPESAKIGTRKFYRKKRWVDAQLYHMESLRPGNRVMGPAVIESDATTFVVPDGFETWLDGHRLFHLREV
>Q8RM03 6.4.1.6~~~acxB~~~Acetone carboxylase alpha subunit~~~COG0146
MNVTVDQSTLAGATRGIVRGGETLKEHRDRLMAATKATGRYAGLKTLELREREPILYNKLFSRLRAGVVDARETAKKIAA
SPIVEQEGELCFTLYNAAGDSLLTSTGIIIHVGTMGAAIKYMIENNWEANPGVHDKDIFCNNDSLIGNVHPCDIHTIVPI
FWEGELIGWVGGVTHVIDTGAVGPGSMATGQVQRFGDGYSITCRKVGANDTLFRDWLHESQRMVRTTRYWMLDERTRIAG
CHMIRKLVEEVVAEEGIEAYWKFAYEAVEHGRLGLQARIKAMTIPGTYRQVGFVDVPYAHEDVRVPSDFAKLDTIMHAPC
EMTIRRDGTWRLDFEGSSRWGWHTYNAHQVSFTSGIWVMMTQTLIPSEMINDGAAYGTEFRLPKGTWMNPDDRRVAFSYS
WHFLVSAWTALWRGLSRSYFGRGYLEEVNAGNANTSNWLQGGGFNQYDEIHAVNSFECAANGTGATAVQDGLSHAAAIWN
PEGDMGDMEIWELAEPLVYLGRQIKASSGGSGKYRGGCGFESLRMVWNAKDWTMFFMGNGHISSDWGLMGGYPAASGYRF
AAHKTNLKELIASGAEIPLGGDTDPENPTWDAMLPDAQIKRDKQAITTEEMFSDYDLYLNYMRGGPGFGDPLDREPQAVA
DDINGGYVLERFAGEVYGVVVRKGADGQYGVDEAGTAAARAQIRKDRLAKSVPVSEWMKGEREKILAKDAGTQVRQMFAA
SFKLGPRFEKDFRTFWSLPDSWTLPEEEIGVPTYGSRYSMDISELPDVHTVQFVEE
>Q8RM02 6.4.1.6~~~acxC~~~Acetone carboxylase gamma subunit~~~COG4647
MAYTRSKIVDLVDGKIDPDTLHQMLSTPKDPERFVTYVEILQERMPWDDKIILPLGPKLFIVQQKVSKKWTVRCECGHDF
CDWKDNWKLSARVHVRDTPQKMEEIYPRLMAPTPSWQVIREYFCPECGTLHDVEAPTPWYPVIHDFSPDIEGFYQEWLGL
PVPERADA
>A0QWG5 2.3.1.265~~~patA~~~Phosphatidylinositol mannoside acyltransferase~~~COG1560
MTLSGRIPLGGQVTDLGYAAGWRLVRAMPEAMAQGVFGAGARYAARNGGPEQLRRNLARVVGKPPADVPDDLIRASLASY
ARYWREAFRLPAMDHGRLGEQLDVIDIDHLWSALDAGRGAVLALPHSGNWDMAGVWLVQNYGPFTTVAERLKPESLYRRF
VEYRESLGFEVLPLTGGERPPFEVLAERLTDNRPICLMAERDLTRSGVQVDFFGEATRMPAGPAKLAIETGAALFPVHCW
FEGDGWGMRVYPELDTSSGDVTAITQALADRFAANIATYPADWHMLQPQWIADLSDERRARLGT
>P9WMB5 2.3.1.265~~~patA~~~Phosphatidylinositol mannoside acyltransferase~~~COG1560
MIAGLKGLKLPKDPRSSVTRTATDWAYAAGWMAVRALPEFAVRNAFDTGARYFARHGGPEQLRKNLARVLGVPPAAVPDP
LMCASLESYGRYWREVFRLPTINHRKLARQLDRVIGGLDHLDAALAAGLGAVLALPHSGNWDMAGMWLVQRHGTFTTVAE
RLKPESLYQRFIDYRESLGFEVLPLSGGERPPFEVLSERLRNNRVVCLMAERDLTRTGVEVDFFGEPTRMPVGPAKLAVE
TGAALLPTHCWFEGRGWGFQVYPALDCTSGDVAAITQALADRFAQNIAAHPADWHMLQPQWLADLSESRRAQLRSR
>O35031 3.6.1.7~~~acyP~~~Acylphosphatase~~~COG1254
MLQYRIIVDGRVQGVGFRYFVQMEADKRKLAGWVKNRDDGRVEILAEGPENALQSFVEAVKNGSPFSKVTDISVTESRSL
EGHHRFSIVYS
>Q83AB0 3.6.1.7~~~acyP~~~Acylphosphatase~~~COG1254
MTQKEKNETCIHVTVSGKVQGVFFRESVRKKAEELQLTGWVKNLSHGDVELVACGERDSIMILTEWLWEGPPQAAVSNVN
WEEIVVEDYSDFRVR
>P0AB65 3.6.1.7~~~yccX~~~Acylphosphatase~~~COG1254
MSKVCIIAWVYGRVQGVGFRYTTQYEAKRLGLTGYAKNLDDGSVEVVACGEEGQVEKLMQWLKSGGPRSARVERVLSEPH
HPSGELTDFRIR
>P9WQC9 3.6.1.7~~~acyP~~~Acylphosphatase~~~COG1254
MSAPDVRLTAWVHGWVQGVGFRWWTRCRALELGLTGYAANHADGRVLVVAQGPRAACQKLLQLLQGDTTPGRVAKVVADW
SQSTEQITGFSER
>Q2FYM9 3.6.1.7~~~acyP~~~Acylphosphatase~~~COG1254
MRHIHLQVFGRVQGVGFRYFTQRIAMNYNIVGTVQNVDDYVEIYAQGDDADIERFIQGVIEGASPASNVTSHQLEELELN
QKLSDFRSI
>Q5SKS6 3.6.1.7~~~acyP~~~Acylphosphatase~~~COG1254
MPRLVALVKGRVQGVGYRAFAQKKALELGLSGYAENLPDGRVEVVAEGPKEALELFLHHLKQGPRLARVEAVEVQWGEEA
GLKGFHVY
>A5F8G9 3.6.1.7~~~acyP~~~Acylphosphatase~~~COG1254
MEKQCSKFIVSGHVQGVGFCYHTSHQGLKLGLTGYAKNLNNGDVEVVACGTPERLEELYLWLQEGPKTASVRQVRRLSSE
LEHDYQGFEIL
>P19219 2.1.1.n11~~~adaA~~~Bifunctional transcriptional activator/DNA repair enzyme AdaA~~~COG2169
MPDSINNGHKESHDHRISNDAEMITDEKWQAIINNDAAYNNQFFYAVKSTGIFCKPSCKSRVPKKENVCIFPNTEQALRA
NFRPCKRCKPTNEKMPDSEWVDLITEYIDKNFTEKLTLESLADICHGSPYHMHRTFKKIKGITLVEYIQQVRVHAAKKYL
IQTNKAIGDIAICVGIANAPYFITLFKKKTGQTPARFRQMSKMEETYNGNK
>P19220 2.1.1.63~~~adaB~~~Methylated-DNA--protein-cysteine methyltransferase, inducible~~~COG0350
METNKPTLYWSLLMFKDWNFYIASTLKGLVFVGSQNKPIEELFEWARKRFPGSLLVEDDDKLEPYAVEITQYLEGKRKNF
TVPVEYAGTQFQLAVWNALCEIPYGQTKSYSDIANDINKPAAVRAVGAAIGANPVLITVPCHRVIGKNGSLTGYRGGFEM
KTLLLDLEKRASSEMDVPH
>P0DTR4 3.5.1.-~~~~~~A type blood N-acetyl-alpha-D-galactosamine deacetylase~~~
MRNRRKAVSLLTGLLVTAQLFPTAALAADSSESALNKAPGYQDFPAYYSDSAHADDQVTHPDVVVLEEPWNGYRYWAVYT
PNVMRISIYENPSIVASSDGVHWVEPEGLSNPIEPQPPSTRYHNCDADMVYNAEYDAMMAYWNWADDQGGGVGAEVRLRI
SYDGVHWGVPVTYDEMTRVWSKPTSDAERQVADGEDDFITAIASPDRYDMLSPTIVYDDFRDVFILWANNTGDVGYQNGQ
ANFVEMRYSDDGITWGEPVRVNGFLGLDENGQQLAPWHQDVQYVPDLKEFVCISQCFAGRNPDGSVLHLTTSKDGVNWEQ
VGTKPLLSPGPDGSWDDFQIYRSSFYYEPGSSAGDGTMRVWYSALQKDTNNKMVADSSGNLTIQAKSEDDRIWRIGYAEN
SFVEMMRVLLDDPGYTTPALVSGNSLMLSAETTSLPTGDVMKLETSFAPVDTSDQVVKYTSSDPDVATVDEFGTITGVSV
GSARIMAETREGLSDDLEIAVVENPYTLIPQSNMTATATSVYGGTTEGPASNVLDGNVRTIWHTNYAPKDELPQSITVSF
DQPYTVGRFVYTPRQNGTNGIISEYELYAIHQDGSKDLVASGSDWALDAKDKTVSFAPVEAVGLELKAIAGAGGFGTAAE
LNVYAYGPIEPAPVYVPVDDRDASLVFTGAWNSDSNGSFYEGTARYTNEIGASVEFTFVGTAIRWYGQNDVNFGAAEVYV
DGVLAGEVNVYGPAAAQQLLFEADGLAYGKHTIRIVCVSPVVDFDYFSYVGE
>P06134 ~~~ada~~~Bifunctional transcriptional activator/DNA repair enzyme Ada~~~COG0350
MKKATCLTDDQRWQSVLARDPNADGEFVFAVRTTGIFCRPSCRARHALRENVSFYANASEALAAGFRPCKRCQPEKANAQ
QHRLDKITHACRLLEQETPVTLEALADQVAMSPFHLHRLFKATTGMTPKAWQQAWRARRLRESLAKGESVTTSILNAGFP
DSSSYYRKADETLGMTAKQFRHGGENLAVRYALADCELGRCLVAESERGICAILLGDDDATLISELQQMFPAADNAPADL
MFQQHVREVIASLNQRDTPLTLPLDIRGTAFQQQVWQALRTIPCGETVSYQQLANAIGKPKAVRAVASACAANKLAIIIP
CHRVVRGDGTLSGYRWGVSRKAQLLRREAENEER
>Q8CWN2 ~~~adcA~~~Zinc-binding lipoprotein AdcA~~~COG0803
MKKISLLLASLCALFLVACSNQKQADGKLNIVTTFYPVYEFTKQVAGDTANVELLIGAGTEPHEYEPSAKAVAKIQDADT
FVYENENMETWVPKLLDTLDKKKVKTIKATGDMLLLPGGEEEEGDHDHGEEGHHHEFDPHVWLSPVRAIKLVEHIRDSLS
ADYPDKKETFEKNAAAYIEKLQSLDKAYAEGLSQAKQKSFVTQHAAFNYLALDYGLKQVAISGLSPDAEPSAARLAELTE
YVKKNKIAYIYFEENASQALANTLSKEAGVKTDVLNPLESLTEEDTKAGENYISVMEKNLKALKQTTDQEGPAIEPEKAE
DTKTVQNGYFEDAAVKDRTLSDYAGNWQSVYPFLEDGTFDQVFDYKAKLTGKMTQAEYKAYYTKGYQTDVTKINITDNTM
EFVQGGQSKKYTYKYVGKKILTYKKGNRGVRFLFEATDADAGQFKYVQFSDHNIAPVKAEHFHIFFGGTSQETLFEEMDN
WPTYYPDNLSGQEIAQEMLAH
>Q04I02 ~~~adcR~~~Transcriptional regulator AdcR~~~COG1846
MRQLAKDINAFLNEVILQAENQHEILIGHCTSEVALTNTQEHILMLLSEESLTNSELARRLNVSQAAVTKAIKSLVKEGM
LETSKDSKDARVIFYQLTDLARPIAEEHHHHHEHTLLTYEQVATQFTPNEQKVIQRFLTALVGEIK
>Q5XEA3 ~~~adcR~~~Transcriptional regulator AdcR~~~
MGTLEKKLDNLVNTILLKAENQHELLFGACQSDVKLTNTQEHILMLLSQQRLTNTDLAKALNISQAAVTKAIKSLVKQDM
LAGTKDTVDARVTYFELTELAKPIASEHTHHHDETLNVYNRLLQKFSAKELEIVDKFVTVFAEELEG
>Q7NSA6 4.1.1.4~~~adc~~~Acetoacetate decarboxylase~~~COG4689
MKQQEVRQRAFAMPLTSPAFPPGPYRFVNREYMIITYRTDPAAIEAVLPEPLQMAEPVVRYEFIRMPDSTGFGDYSESGQ
VIPVTFRGERGSYTLAMFLDDQPPLAGGRELWGFPKKAGKPRLEVHQDTLVGSLDFGPVRIATGTMGYKYEALDRSALLA
SLAEPNFLLKIIPHVDGSPRICELVRYHTTDVAIKGAWSAPGSLELHPHALAPVAALPVLEVLSARHFVCDLTLDLGTVV
FDYLRQ
>P23670 4.1.1.4~~~adc~~~Acetoacetate decarboxylase~~~
MLKDEVIKQISTPLTSPAFPRGPYKFHNREYFNIVYRTDMDALRKVVPEPLEIDEPLVRFEIMAMHDTSGLGCYTESGQA
IPVSFNGVKGDYLHMMYLDNEPAIAVGRELSAYPKKLGYPKLFVDSDTLVGTLDYGKLRVATATMGYKHKALDANEAKDQ
ICRPNYMLKIIPNYDGSPRICELINAKITDVTVHEAWTGPTRLQLFDHAMAPLNDLPVKEIVSSSHILADIILPRAEVIY
DYLK
>Q9AK25 3.5.4.4~~~add1~~~Adenosine deaminase 1~~~COG1816
MTSRSTEKSAAANPAAVSKTPSPDRIRRAPKVLLHDHLDGGLRPGTIVELARETGYGDLPETDADLLGTWFRQAADSGSL
ERYLETFSHTVGVMQTRDALVRVAAECAEDLAEDGVVYAEVRYAPEQHLEKGLTLEEVVEAVNEGFREGERRARDNGHRI
RVGALLTAMRHAARSLEIAELANRYRDLGVVGFDIAGAEAGYPPTRHLDAFEYLKRENNHFTIHAGEAFGLPSIWQALQW
CGADRLGHGVRIIDDIQVHEDGSVKLGRLASYVRDKRIPLELCPSSNLQTGAADSYAEHPIGLLRRLHFRATVNTDNRLM
SHTSMSREFEHLVEAFGYTLDDMQWFSVNAMKSAFIPFDERLAMINDVIKPGYAELKSEWLFQQTASTSGSSESDG
>P23478 3.1.-.-~~~addA~~~ATP-dependent helicase/nuclease subunit A~~~COG1074
MNIPKPADSTWTDDQWNAIVSTGQDILVAAAAGSGKTAVLVERMIRKITAEENPIDVDRLLVVTFTNASAAEMKHRIAEA
LEKELVQRPGSLHIRRQLSLLNRASISTLHSFCLQVLKKYYYLIDLDPGFRIADQTEGELIGDEVLDELFEDEYAKGEKA
FFELVDRYTTDRHDLDLQFLVKQVYEYSRSHPNPEAWLESFVHLYDVSEKSAIEELPFYQYVKEDIAMVLNGAKEKLLRA
LELTKAPGGPAPRADNFLDDLAQIDELIQHQDDFSELYKRVPAVSFKRAKAVKGDEFDPALLDEATDLRNGAKKLLEKLK
TDYFTRSPEQHLKSLAEMKPVIETLVQLVISYGKRFEAAKQEKSIIDFSDLEHYCLAILTAENDKGEREPSEAARFYQEQ
FHEVLVDEYQDTNLVQESILQLVTSGPEETGNLFMVGDVKQSIYRFRLAEPLLFLSKYKRFTESGEGTGRKIDLNKNFRS
RADILDSTNFLFKQLMGGKIGEVDYDEQAELKLGAAYPDNDETETELLLIDNAEDTDASEEAEELETVQFEAKAIAKEIR
KLISSPFKVYDGKKKTHRNIQYRDIVILLRSMPWAPQIMEELRAQGIPVYANLTSGYFEAVEVAVALSVLKVIDNPYQDI
PLASVLRSPIVGADENELSLIRLENKKAPYYEAMKDYLAAGDRSDELYQKLNTFYGHLQKWRAFSKNHSVSELIWEVYRD
TKYMDYVGGMPGGKQRQANLRVLYDRARQYESTAFRGLFRFLRFIERMQERGDDLGTARALSEQEDVVRLMTIHSSKGLE
FPVVFVAGLGRNFNMMDLNKSYLLDKELGFGTKYIHPQLRISYPTLPLIAMKKKMRRELLSEELRVLYVALTRAKEKLFL
IGSCKDHQKQLAKWQASASQTDWLLPEFDRYQARTYLDFIGPALARHRDLGDLAGVPAHADISGHPARFAVQMIHSYDLL
DDDLEERMEEKSERLEAIRRGEPVPGSFAFDEKAREQLSWTYPHQEVTQIRTKQSVSEIKRKREYEDEYSGRAPVKPADG
SILYRRPAFMMKKGLTAAEKGTAMHTVMQHIPLSHVPSIEEAEQTVHRLYEKELLTEEQKDAIDIEEIVQFFHTEIGGQL
IGAKWKDREIPFSLALPAKEIYPDAHEADEPLLVQGIIDCLYETEDGLYLLDYKSDRIEGKFQHGFEGAAPILKKRYETQ
IQLYTKAVEQIAKTKVKGCALYFFDGGHILTL
>A2RH77 3.1.-.-~~~addA~~~ATP-dependent helicase/nuclease subunit A~~~COG1074
MSEVKLTPEQNEAIHSSGKNILVSASAGSGKTFVMAQRIVEKVKQGIEIDRLFISTFTKKAASELRMRLERDLKKARQES
SDDEEAHRLTLALQNLSNADIGTMDSFTQKLTKANFNRVNIDPNFRILADQTESDLIRQEVFEQLVESYLSADESLNISK
DKFEKLIKNFSKDRNILGFQKVVYTIYRFASATENPISWLENQFLKGFETYKSLTDLSEDFTVNVKENLLTFFELLENSL
TNGVIAKKGAGRDKANLILDNKNELLEAISKKDFVTCTALFLSIDTDIRVGSSKDEALSALKKDFSAQKQDLVGSKSKPG
ELRKFVDKIKHGQLIEKYQNQAFEIASDLQKFIIDFYKTYLERKKNENAFEYSDIAHFAIEILEENPDIRENLREHYDEI
MIDEYQDTSHTQERMLELLSNGHNLFMVGDIKQSIYGFRLADPGLFLEKYKSYDQAENPNQLIRLKENFRSRGEVLNFTN
DIFKHLMDEKLGEMTYGKEEALVQGNISDYPVEAEKDFYPELLLYKENTSEEEIEDSEVKISDGEIKGAAQEIKKLIEYG
VEPKDIAILVRSKSNNNKIEDILLSYDIPVVLDEGRVDFLKSMEVLIMLDVLRAIDNPLYDLSLVAMLRSPLFGFNEDEL
TRISVQGSRDLRFWDKILLSLKKEGKNPELINLSLEQKLKAFNQKFTEWRKLVNKIPIHRLLWKIYTETYYFDYVGALKN
GEMRQANLQALSVRAESYESSGYKGLFKFVRLINKFMEQNNDLASVNIKLPQNAVRVMTFHKSKGLEFDYVFLMNLQSRF
NDRDLKEDVILSREHGLGMKYIADLKAEPDVITDFPYALVKMETFPYMVNKDLKQRAALSEEMRVLYVAFTRAKKKLYLV
GKIKDTDKKAGLELYDAATLEGKILSDKFRNSSRGFQHWILALQNATKLPMKLNVYTKDELETEKLEFTSQPDFKKLVEE
SEKFDNIMSFSDEIKEAQKIMNYQYPHQAATELSSIQTPSQVKKRSYEKQLQVGEVQPVSEFVRVKNLDFSDFGPKKITA
AEMGSATHSFMQYADFSQADLFSFQATLDEMGFDEKIKNQIDITKILTLFDTEFGQFLSENVDKTVKEAPFSMLRTDEFA
KEQYIVRGICDGFVKLADKIILFDYKTDRFTNVSAISEIKERYKDQMNLYSEALQKAYHVNQIDKYLILLGGPRKVFVEK
IDD
>Q7A6H4 3.1.-.-~~~addA~~~ATP-dependent helicase/nuclease subunit A~~~
MTIPEKPQGVIWTDAQWQSIYATGQDVLVAAAAGSGKTAVLVERIIQKILRDGIDVDRLLVVTFTNLSAREMKHRVDQRI
QEASIADPANAHLKNQRIKIHQAQISTLHSFCLKLIQQHYDVLNIDPNFRTSSEAENILLLEQTIDEVIEQHYDILDPAF
IELTEQLSSDRSDDQFRMIIKQLYFFSVANPNPTNWLDQLVTPYEEEAQQAQLIQLLTDLSKVFITAAYDALNKAYDLFS
MMDGVDKHLAVIEDERRLMGRVLEGGFIDIPYLTDHEFGARLPNVTAKIKEANEMMVDALEDAKLQYKKYKSLIDKVKND
YFSREADDLKADMQQLAPRVKYLARIVKDVMSEFNRKKRSKNILDFSDYEHFALQILTNEDGSPSEIAESYRQHFQEILV
DEYQDTNRVQEKILSCIKTGDEHNGNLFMVGDVKQSIYKFRQADPSLFIEKYQRFTIDGDGTGRRIDLSQNFRSRKEVLS
TTNYIFKHMMDEQVGEVKYDEAAQLYYGAPYDESDHPVNLKVLVEADQEHSDLTGSEQEAHFIVEQVKDILEHQKVYDMK
TGSYRSATYKDIVILERSFGQARNLQQAFKNEDIPFHVNSREGYFEQTEVRLVLSFLRAIDNPLQDIYLVGLMRSVIYQF
KEDELAQIRILSPNDDYFYQSIVNYINDEAADAILVDKLKMFLSDIQSYQQYSKDHPVYQLIDKFYNDHYVIQYFSGLIG
GRGRRANLYGLFNKAIEFENSSFRGLYQFIRFIDELIERGKDFGEENVVGPNDNVVRMMTIHSSKGLEFPFVIYSGLSKD
FNKRDLKQPVILNQQFGLGMDYFDVDKEMAFPSLASVAYKAVAEKELVSEEMRLVYVALTRAKEQLYLIGRVKNDKSLLE
LEQLSISGEHIAVNERLTSPNPFHLIYSILSKHQSASIPDDLKFEKDIAQVEDSSRPNVNISIIYFEDVSTETILDNNEY
RSVNQLETMQNGNEDVKAQIKHQLDYQYPYVNDTKKPSKQSVSELKRQYETEESGTSYERVRQYRIGFSTYERPKFLSEQ
GKRKANEIGTLMHTVMQHLPFKKERISEVELHQYIDGLIDKHIIEADAKKDIRMDEIMTFINSELYSIIAEAEQVYRELP
FVVNQALVDQLPQGDEDVSIIQGMIDLIFVKDGVHYFVDYKTDAFNRRRGMTDEEIGTQLKNKYKIQMKYYQNTLQTILN
KEVKGYLYFFKFGTLQL
>P23477 3.1.-.-~~~addB~~~ATP-dependent helicase/deoxyribonuclease subunit B~~~COG3857
MGAEFLVGRSGSGKTKLIINSIQDELRRAPFGKPIIFLVPDQMTFLMEYELAKTPDMGGMIRAQVFSFSRLAWRVLQHTG
GMSRPFLTSTGVQMLLRKLIEEHKQEFKVYQKASDKSGFTAQVERMLTEFKRYCLEPEDIRRMAESGTASEYRGERVLSE
KLHDLSILYQQMEKSLADQYLHSEDYLTLLAEHIPLAEDIKGAHIYVDGFYQFTPQEFRVLEQLMVHAEHITFSLTADKP
SYEREPHELELFRMTGKTYYRLHQKAKELNLDITYKELSGTERHTKTPELAHLEAQYEARPAIPYAEKQEALTVMQAANR
RAELEGIAREIHALVREKGYRYKDVAILARQPEDYKDMVKEVFADYEIPYFIDGKASMLNHPLIEFIRSSLDVLKGNWRY
EAVFRCVKTELLFPLNEPKAKVREQVDQLENYCIAYGIKGDRWTKGDRFQYRRFVSLDDDFAQTDQEIEMENMLNDTRDW
IVPPLFQLQKRMKKAKTVQEKAEALYRYLEETDVPLKLDQERQRAEDDGRIIEAQQHQQAWDAVIQLLEEFVEMMGDDEI
SLDLFQQMIEAGAESLTFSLIPPALDQVFVGNMDLSRMYGTSCTFVLGANDGVLPARPDENGVLSDDDREWLKTIGVELS
SGGRERLLDEHFLIYMAFSSPSDRLYVSYPIADAEGKTLLPSMIVKRLEELFPHHKERLLTNEPEQVSDEEQLMYVVNKS
VAQSFTASQLRLWTREYDISDVWWSTYNVLMSEQDRLQSKKLFSSLFFRNEVKQLERSVSRQLYGERIQGSVSRMETFNA
CPFSHFASHGLHLKERQFFKLEAPDIGQLFHSSLKLISDRLREQKLDWRDLTKEQCELFSYDAVERLAPKLQKEILLSSN
RHYYVKEKLQKIVTRVSGILSEHAKASGFVPIGLELGFGGKGPLPPLTFQLKNGCTMELVGRIDRVDKAESSKGLLLRIV
DYKSSDKGLDLAEVYYGLALQMLTYLDLSITHSADWLGMRATPAGVLYFHIHDPMIQSNLPLGLDEIEQEIFKKFKMKGL
LLGDQEVVRLMDTTLQEGRSNIINAGLKKDGSLRSDSAAVGEKEFDLLTKHVRRTFQEAGEQITDGRVSIEPYKMKNKTP
CTYCAFKSVCQFDESLEENEYRPLKAEKDKTILEWIKKEADGNEHS
>A2RH76 3.1.-.-~~~rexB~~~ATP-dependent helicase/deoxyribonuclease subunit B~~~COG3857
MEILYTEITQDLTEGLLEIALEELEKNRKVYYIVPSSMSFEKEKEILERLAKGSDTAVFDLLVTRFKQLPYYFDKREKAT
MKTELGTVGLSMLFRRVLRSFKKDEIPLYFSLQDSAGFLEMLIQLRAELLTANLSVENLPDNPKNQELKKILAKFEAELS
VEYANYSEFGDFTNRLVDGEFDQQLKDVTIIIDGYTRFSAEEELFIESIQEKVARFVVGTYSDENSLTAGSETIYVGTSQ
MITRFRNKFPVELRKIASSAVNEVYSKLTRILDLDSRFVITDEKIELKAEDEKYFRIWEAENQKVEIERVAKEIRQKIIQ
GAFFKDFTVLVGDPAAYEITLKEVFDLYEIPFFYAQEESMSQHPLVIFFESLFAIKKNNYRTDDVVNLLKSKVYTDANLD
EEVIDYFEYYVQKYKISGRKKFTEEFIESEFSQIELVNEMREKLLGSESPLQVFLGNNRKKTGKKWVSDLQGLLENGNVM
TNMNAYFSAAELQNEHQMADKHEQVWQMLISTLNEFLAVFSDEKLKSVEFLDILLAGLKNAKYRQIPANVDVVNVKDYEL
VEPKTNKYIYAIGLSQTNFPRIKKNSTLLSDEERLEINQTTDENQFIEQLNVANYQKNQFTVLSLINSAKESLVLSMPQI
MANEQGEFSPVFQLFLKDADEKILQKIQGVNLFESLEHIGNSRSVIAMIGQIERELVESEETSEDKRVFWSSIFRILVKS
NADFQKILLDLAKDIDTVNLAPDTLEQIYGDKIYASVSSFERFYNCEYQYFLENTLSLETFENIDINSKIVGNFFHEVFE
KVMKETDLSAENFDEKLTLVLQEVDKNYSRYFTQDATARFTWSNLEEIVRQTATVLKATVSTDELKTLLTESSFGLPKSE
LGNFSVDDIYLRGRIDRLDQLSTDYLGAIDYKSSAHSFKLQEAYDGLSLQFMTYLDVIKQAFPNQKIWGALYLQFKNQPI
NLSEINQLSEIANILKESMRYEGLVLEDAAEQIKGIENIALKKTNIYNEEEFEQLLKLNEEHYRAAGQRLKKGKIAINPI
MKRSEGIDQSGNVRGCRYCPLKSICRFEANIHMNEHSREIGQKSQAEILAELKGEERDE
>O25046 3.5.4.4~~~~~~Adenosine deaminase~~~COG0402
MQEIIGASLVFLCNEKCEVLEDYGVVFDEKIVEIGDYQSLTLKYPHLKAQFFENSVLLPAFINAHTHFEFSNNKASFDYG
SFSGWLGSVLNNGGAILENCQGAIQNAISTQLKSGVGSVGAISNHLIEVNLLKESPLNAVVFLEFLGSSYSLEKLKAFEA
KFKELKDLEDKKLKAALAVHAPYSVQKDMALSVIQLAKDSQSLLSTHFLESLEELEWVENSKGWFENFYQHFLKESHFKS
LYKGANDYIDMFKDTHTLFVHNQFASLEALKRIKSQVKNAFLITCPFSNRLLSGQALDLERTKEAGLSVSVATDGLSSNI
SLSLLDELRAFLLTHNMPLLELAKIALLGATRHGAKALALNNGEIEANKRADLSVFGFNEKFTKEQAILQFLLHAKEVEC
LFLGGKRVI
>P22333 3.5.4.4~~~add~~~Adenosine deaminase~~~COG1816
MIDTTLPLTDIHRHLDGNIRPQTILELGRQYNISLPAQSLETLIPHVQVIANEPDLVSFLTKLDWGVKVLASLDACRRVA
FENIEDAARHGLHYVELRFSPGYMAMAHQLPVAGVVEAVIDGVREGCRTFGVQAKLIGIMSRTFGEAACQQELEAFLAHR
DQITALDLAGDELGFPGSLFLSHFNRARDAGWHITVHAGEAAGPESIWQAIRELGAERIGHGVKAIEDRALMDFLAEQQI
GIESCLTSNIQTSTVAELAAHPLKTFLEHGIRASINTDDPGVQGVDIIHEYTVAAPAAGLSREQIRQAQINGLEMAFLSA
EEKRALREKVAAK
>P63907 3.5.4.4~~~add~~~Adenosine deaminase~~~COG1816
MTAAPTLQTIRLAPKALLHDHLDGGLRPATVLDIAGQVGYDDLPATDVDALASWFRTQSHSGSLERYLEPFSHTVAVMQT
PEALYRVAFECAQDLAADSVVYAEVRFAPELHISCGLSFDDVVDTVLTGFAAGEKACAADGQPITVRCLVTAMRHAAMSR
EIAELAIRFRDKGVVGFDIAGAEAGHPPTRHLDAFEYMRDHNARFTIHAGEAFGLPSIHEAIAFCGADRLGHGVRIVDDI
DVDADGGFQLGRLAAILRDKRIPLELCPSSNVQTGAVASIAEHPFDLLARARFRVTVNTDNRLMSDTSMSLEMHRLVEAF
GYGWSDLARFTVNAMKSAFIPFDQRLAIIDEVIKPRFAALMGHSE
>Q8ZPL9 3.5.4.4~~~add~~~Adenosine deaminase~~~
MIDITLPLTDIHRHLDGNIRAQTILDLGRQFNIALPAQTLETLIPHVQVTSTEPDLVSFLTKLDWGVKVLASLDACRRVA
FENIEDAARNGLHYVELRFSPGYMAMAHQLPIAGVVEAVIDGVRDGCNTFGVEARLIGIMSRTFGEAACLQELDALLAHR
ENITALDLAGDELGFPGSLFLSHFNRARDAGWHITVHAGEAAGPESIWQAIRELGAERIGHGVKAVEDRALMDFLAQQRI
GIESCLTSNIQTSTVASLADHPLKTFLEHGVLASLNTDDPAVQGVDIIHEYHVAAPAAGLSREQIRQAQINGLEIAFLSD
SEKRALREKVAEA
>Q9KNI7 3.5.4.4~~~add~~~Adenosine deaminase~~~COG1816
MITSSLPLTDLHRHLDGNIRTQTILELGQKFGVKLPANTLQTLTPYVQIVEAEPSLVAFLSKLDWGVAVLGDLDACRRVA
YENVEDALNARIDYAELRFSPYYMAMKHSLPVTGVVEAVVDGVRAGVRDFGIQANLIGIMSRTFGTDACQQELDAILSQK
NHIVAVDLAGDELGQPGDRFIQHFKQVRDAGLHVTVHAGEAAGPESMWQAIRDLGATRIGHGVKAIHDPKLMDYLAQHRI
GIESCLTSNLQTSTVDSLATHPLKRFLEHGILACINTDDPAVEGIELPYEYEVAAPQAGLSQEQIRQAQLNGLELAFLSD
SEKKALLAKAALRG
>Q7CUX4 3.5.4.2~~~ade2~~~Adenine deaminase 2~~~COG1001
MTAQIRLAEPADLNDDTLRARAVAAARGDQRFDVLITGGTLVDVVTGELRPADIGIVGALIASVHEPASRRDAAQVIDAG
GAYVSPGLIDTHMHIESSMITPAAYAAAVVARGVTTIVWDPHEFGNVHGVDGVRWAAKAIENLPLRAILLAPSCVPSAPG
LERGGADFDAAILADLLSWPEIGGIAEIMNMRGVIERDPRMSGIVQAGLAAEKLVCGHARGLKNADLNAFMAAGVSSDHE
LVSGEDLMAKLRAGLTIELRGSHDHLLPEFVAALNTLGHLPQTVTLCTDDVFPDDLLQGGGLDDVVRRLVRYGLKPEWAL
RAATLNAAQRLGRSDLGLIAAGRRADIVVFEDLNGFSARHVLASGRAVAEGGRMLVDIPTCDTTVLKGSMKLPLRMANDF
LVKSQGAKVRLATIDRPRFTQWGETEADVKDGFVVPPEGATMISVTHRHGMAEPTTKTGFLTGWGRWNGAFATTVSHDSH
NLTVFGGNAGDMALAANAVIGTGGGMAVASEGKVTAILPLPLSGLVSDAPLEEVARAFEDLREAVGKVVEWQPPYLVFKA
CFGATLACNIGPHQTDMGIADVLTGKVMESPVIEVLG
>P39761 3.5.4.2~~~adeC~~~Adenine deaminase~~~COG1001
MNKEALVNRLNASAKRQKADIVIKNGKIMDVYNQEWIYEDIAITDGVIVGLGEYEGENIIDAEGQMIVPGFIDGHVHIES
SMVTPIEFAKAVLPHGVTTVVTDPHEIANVSGEKGIEFMLEQARHTPLNIHFMLPSSVPAASFERSGAILKAADLKPFYE
EEEVLGLAEVMDYVSVQQAEKDMVQKLLDARVAGKRIDGHLAGLSTDLINIYRTAFVLNDHEVTSKEEALDRIRRGMYVM
MREGSVAKNTLNVLPAVNEKNARRFFFCTDDKHVDDLLSEGSVNHQVKMAIQAGLNPFLAYQLGSLNAAECYGLDTKGAI
APGFDADLLFVSDLENVTVTMTMVKGQTVAEDSKAVYQDHASTAAPDQALLDSVKLAAPLNKQDFHMPIDSEQQINVIQI
IPNQLETRLVQVPAPVAREFEPDTELDLLKIAVVERHKGLKETGLGVVKGFGFKSGAIATTISHDSHNIIAVGTNDEDIA
AAVNKLQEIGGGLTIIKNGEELHSVPLPIAGLLSDQSAEQVNQSLLTLHDKLSLIGFTGGFNPFLTLSFLALPVIPDIKM
TTTGLFDVKSFQHISLQ
>Q72EX7 3.5.4.2~~~ade~~~Adenine deaminase~~~COG1001
MTYRPLLNDLVDMAAGRAPVDLVVRNARIVDVFSQRIVEAPLAIGGGRFLGFFEAEAHATLDAEGRYLLPGLIDGHVHIE
SSLVSPAQFARLVLARGTTAVIADPHEIANVCGLAGLRYMLDATRDLPLDVRLALPSCVPATPFENAGAVLDAAALATLM
DDPRVAGLGEMMNFPGVLAGDADVLDKIALALDRGKTVDGHSPGLAGRDLATYAAARIATDHECTTVDEMHERIALGMYV
LLREGSAARDMARLAPGITPGNARRCVFCTDDRQPADILRDGHIDNHLRIAVSHGVDPVTAVTIATLNAAECFGLRDRGA
VAPGRVADFVLVDDLTGFAVRKVYAAGRLVARDGAVVVDLPDHADPAVRDTVNIRPLDDTAFRLPLPTGLARVIGLQPHS
LLTDALERDVPRDASGCFTPGDGLVKLAVVERHKATGNVGVGIIEGYGLRGGAVATTVAHDSHNIVVAGDNDADMLVAVR
ELERTGGGITLCAGGRVLASLPLPVAGLMSDRPATEVSATFAQMLSIAHETLHISRDIEPFMTLSFLTLPVIPALKLTDR
GLFDVRTFSFTTVGV
>P31441 3.5.4.2~~~ade~~~Adenine deaminase~~~COG1001
MNNSINHKFHHISRAEYQELLAVSRGDAVADYIIDNVSILDLINGGEISGPIVIKGRYIAGVGAEYTDAPALQRIDARGA
TAVPGFIDAHLHIESSMMTPVTFETATLPRGLTTVICDPHEIVNVMGEAGFAWFARCAEQARQNQYLQVSSCVPALEGCD
VNGASFTLEQMLAWRDHPQVTGLAEMMDYPGVISGQNALLDKLDAFRHLTLDGHCPGLGGKELNAYITAGIENCHESYQL
EEGRRKLQLGMSLMIREGSAARNLNALAPLINEFNSPQCMLCTDDRNPWEIAHEGHIDALIRRLIEQHNVPLHVAYRVAS
WSTARHFGLNHLGLLAPGKQADIVLLSDARKVTVQQVLVKGEPIDAQTLQAEESARLAQSAPPYGNTIARQPVSASDFAL
QFTPGKRYRVIDVIHNELITHSHSSVYSENGFDRDDVSFIAVLERYGQRLAPACGLLGGFGLNEGALAATVSHDSHNIVV
IGRSAEEMALAVNQVIQDGGGLCVVRNGQVQSHLPLPIAGLMSTDTAQSLAEQIDALKAAARECGPLPDEPFIQMAFLSL
PVIPALKLTSQGLFDGEKFAFTTLEVTE
>P31466 ~~~adeP~~~Adenine permease AdeP~~~COG2252
MSHQHTTQTSGQGMLERVFKLREHGTTARTEVIAGFTTFLTMVYIVFVNPQILGVAGMDTSAVFVTTCLIAAFGSIMMGL
FANLPVALAPAMGLNAFFAFVVVQAMGLPWQVGMGAIFWGAIGLLLLTIFRVRYWMIANIPVSLRVGITSGIGLFIGMMG
LKNAGVIVANPETLVSIGNLTSHSVLLGILGFFIIAILASRNIHAAVLVSIVVTTLLGWMLGDVHYNGIVSAPPSVMTVV
GHVDLAGSFNLGLAGVIFSFMLVNLFDSSGTLIGVTDKAGLADEKGKFPRMKQALYVDSISSVTGSFIGTSSVTAYIESS
SGVSVGGRTGLTAVVVGLLFLLVIFLSPLAGMVPGYAAAGALIYVGVLMTSSLARVNWQDLTESVPAFITAVMMPFSFSI
TEGIALGFISYCVMKIGTGRLRDLSPCVIIVALLFILKIVFIDAH
>P31440 ~~~adeQ~~~Adenine permease AdeQ~~~COG2252
MNNDNTDYVSNESGTLSRLFKLPQHGTTVRTELIAGMTTFLTMVYIVFVNPQILGAAQMDPKVVFVTTCLIAGIGSIAMG
IFANLPVALAPAMGLNAFFAFVVVGAMGISWQTGMGAIFWGAVGLFLLTLFRIRYWMISNIPLSLRIGITSGIGLFIALM
GLKNTGVIVANKDTLVMIGDLSSHGVLLGILGFFIITVLSSRHFHAAVLVSIVVTSCCGLFFGDVHFSGVYSIPPDISGV
IGEVDLSGALTLELAGIIFSFMLINLFDSSGTLIGVTDKAGLIDGNGKFPNMNKALYVDSVSSVAGAFIGTSSVTAYIES
TSGVAVGGRTGLTAVVVGVMFLLVMFFSPLVAIVPPYATAGALIFVGVLMTSSLARVNWDDFTESVPAFITTVMMPFTFS
ITEGIALGFMSYCIMKVCTGRWRDLNLCVVVVAALFALKIILVD
>P71073 ~~~adeR~~~DNA-binding transcriptional activator AdeR~~~COG2508
MSKRNQARKVGRFMTMPNDPFKYSFDRLEDVADHISDVLRCPITIEDVNHKLLAYSTHSDCTDPARTSTIIGRRVPEKVI
NKLWKDGTIPALLKTDQPIRVKQIDEVGLSNRVAISIWKNKQVLGFIWALEIQKTLSDEDLLTLQMAAKAVKNKLLKLQI
RKTKNEERSQEFFWKMLTGHIHQEDDMADGFHKLGMAAPSEFSVMIIRINGELTEKIEQQLQYLQETTQQVYVLLATVDS
NELIILTSPKTDHPFQDLKQFALSTQKQLKERYKIEDVSIAFGGIYNSISFVSRSYQEALSVLKTKERFAEETKHLFSFS
ELGIYQYLDVLNEKRKQAGHYNYSLSKLEQYDRDHQSNMVETLERFIEADSNVNTASKLLNIHVNTLNYRLKRISQIAEI
DLKNVNQKFTIYLDIKLRHMDL
>Q9I6Y4 3.5.4.2~~~~~~Adenine deaminase~~~
MYEWLNALPKAELHLHLEGTLEPELLFALAERNRIALPWNDVETLRKAYAFNNLQEFLDLYYAGADVLRTEQDFYDLTWA
YLQKCKAQNVVHVEPFFDPQTHTDRGIPFEVVLAGIRAALRDGEKLLGIRHGLILSFLRHLSEEQAQKTLDQALPFRDAF
IAVGLDSSEVGHPPSKFQRVFDRARSEGFLTVAHAGEEGPPEYIWEALDLLKVERIDHGVRAFEDERLMRRLIDEQIPLT
VCPLSNTKLCVFDDMSQHTILDMLERGVKVTVNSDDPAYFGGYVTENFHALQQSLGMTEEQARRLAQNSLDARLVK
>P12311 1.1.1.1~~~adhT~~~Alcohol dehydrogenase~~~
MKAAVVEQFKKPLQVKEVEKPKISYGEVLVRIKACGVCHTDLHAAHGDWPVKPKLPLIPGHEGVGVIEEVGPGVTHLKVG
DRVGIPWLYSACGHCDYCLSGQETLCERQQNAGYSVDGGYAEYCRAAADYVVKIPDNLSFEEAAPIFCAGVTTYKALKVT
GAKPGEWVAIYGIGGLGHVAVQYAKAMGLNVVAVDLGDEKLELAKQLGADLVVNPKHDDAAQWIKEKVGGVHATVVTAVS
KAAFESAYKSIRRGGACVLVGLPPEEIPIPIFDTVLNGVKIIGSIVGTRKDLQEALQFAAEGKVKTIVEVQPLENINDVF
DRMLKGQINGRVVLKVD
>A4IP64 1.1.1.192~~~adh1~~~Long-chain-alcohol dehydrogenase 1~~~COG1454
MSVARIVFPPLSHVGWGALDQLVPEVKRLGAKHILVITDPMLVKIGLVDQVTSPLRQEGYSVHVYTDVVPEPPLETGEKA
VAFARDGKFDLVIGVGGGSALDLAKLAAVLAVHDGSVADYLNLTGTRTLEKKGLPKILIPTTSGTGSEVTNISVLSLETT
KDVVTHDYLLADVAIVDPQLTVSVPPRVTAATGIDALTHAVEAYVSVNASPTSDGLAVAAIRLISRSLRKAVANGSDKQA
RIDMANGSYLAGLAFFNAGVAGVHALAYPLGGQFHIAHGESNAVLLPYVMGYIRQSCTKRMADIFNALGGNSSFLSEVEA
SYRCVEELERFVADVGIPKTLGGFGIPESALESLTKDAVQQKRLLARSPLPLLEADIRAIYEAAFAGTIVEPHKA
>P20368 1.1.1.1~~~adhA~~~Alcohol dehydrogenase 1~~~COG1064
MKAAVITKDHTIEVKDTKLRPLKYGEALLEMEYCGVCHTDLHVKNGDFGDETGRITGHEGIGIVKQVGEGVTSLKVGDRA
SVAWFFKGCGHCEYCVSGNETLCRNVENAGYTVDGAMAEECIVVADYSVKVPDGLDPAVASSITCAGVTTYKAVKVSQIQ
PGQWLAIYGLGGLGNLALQYAKNVFNAKVIAIDVNDEQLAFAKELGADMVINPKNEDAAKIIQEKVGGAHATVVTAVAKS
AFNSAVEAIRAGGRVVAVGLPPEKMDLSIPRLVLDGIEVLGSLVGTREDLKEAFQFAAEGKVKPKVTKRKVEEINQIFDE
MEHGKFTGRMVVDFTHH
>P42327 1.1.1.1~~~adh~~~Alcohol dehydrogenase~~~
MKAAVVNEFKKALEIKEVERPKLEEGEVLVKIEACGVCHTDLHAAHGDWPIKPKLPLIPGHEGVGIVVEVAKGVKSIKVG
DRVGIPWLYSACGECEYCLTGQETLCPHQLNGGYSVDGGYAEYCKAPADYVAKIPDNLDPVEVAPILCAGVTTYKALKVS
GARPGEWVAIYGIGGLGHIALQYAKAMGLNVVAVDISDEKSKLAKDLGADIAINGLKEDPVKAIHDQVGGVHAAISVAVN
KKAFEQAYQSVKRGGTLVVVGLPNADLPIPIFDTVLNGVSVKGSIVGTRKDMQEALDFAARGKVRPIVETAELEEINEVF
ERMEKGKINGRIVLKLKED
>A4ISB9 1.1.1.192~~~adh2~~~Long-chain-alcohol dehydrogenase 2~~~COG1979
MQNFTFRNPTKLIFGRGQIEQLKEEVPKYGKKVLLVYGGGSIKRNGLYDEVMSLLTDIGAEVVELPGVEPNPRLSTVKKG
VDICRREGIEFLLAVGGGSVIDCTKAIAAGAKFDGDPWEFITKKATVTEALPFGTVLTLAATGSEMNAGSVITNWETKEK
YGWGSPVTFPQFSILDPTYTMTVPKDHTVYGIVDMMSHVFEQYFHHTPNTPLQDRMCEAVLKTVIEAAPKLVDDLENYEL
RETIMYSGTIALNGFLQMGVRGDWATHDIEHAVSAVYDIPHAGGLAILFPNWMKHVLDENVSRFAQLAVRVFDVDPTGKT
ERDVALEGIERLRAFWSSLGAPSRLADYGIGEENLELMADKAMAFGEFGRFKTLNRDDVLAILRASL
>P0DJA2 1.1.1.1~~~adhB~~~Alcohol dehydrogenase 2~~~COG1454
MASSTFYIPFVNEMGEGSLEKAIKDLNGSGFKNALIVSDAFMNKSGVVKQVADLLKAQGINSAVYDGVMPNPTVTAVLEG
LKILKDNNSDFVISLGGGSPHDCAKAIALVATNGGEVKDYEGIDKSKKPALPLMSINTTAGTASEMTRFCIITDEVRHVK
MAIVDRHVTPMVSVNDPLLMVGMPKGLTAATGMDALTHAFEAYSSTAATPITDACALKAASMIAKNLKTACDNGKDMPAR
EAMAYAQFLAGMAFNNASLGYVHAMAHQLGGYYNLPHGVCNAVLLPHVLAYNASVVAGRLKDVGVAMGLDIANLGDKEGA
EATIQAVRDLAASIGIPANLTELGAKKEDVPLLADHALKDACALTNPRQGDQKEVEELFLSAF
>P42328 1.1.1.1~~~~~~Alcohol dehydrogenase~~~
MKAAVVEQFKEPLKIKEVEKPTISYGEVLVRIKACGVCHTDLHAAHGDWPVKPKLPLIPGHEGVGIVEEVGPGVTHLKVG
DRVGIPWLYSACGHCDYCLSGQETLCEHQKNAGYSVDGGYAEYCRAAADYVVKIPDNLSFEEAAPIFCAGVTTYKALKVT
GAKPGEWVAIYGIGGLGHVAVQYAKAMGLNVVAVDIGDEKLELAKELGADLVVNPLKEDAAKFMKEKVGGVHAAVVTAVS
KPAFQSAYNSIRRGGACVLVGLPPEEMPIPIFDTVLNGIKIIGSIVGTRKDLQEALQFAAEGKVKTIIEVQPLEKINEVF
DRMLKGQINGRVVLTLEDK
>P18278 1.1.5.5~~~adhA~~~Alcohol dehydrogenase (quinone), dehydrogenase subunit~~~
MTRPASAKRRSLLGILAAGTICAAALPYAAVPARADGQGNTGEAIIHADDHPENWLSYGRTYSEQRYSPLDQINRSNVGD
LKLLGYYTLDTNRGQEATPLVVDGIMYATTNWSKMEALDAATGKLLWQYDPKVPGNIADKGCCDTVNRGAGYWNGKVFWG
TFDGRLVAADAKTGKKVWAVNTIPADASLGKQRSYTVDGAVRVAKGLVLIGNGGAEFGARGFVSAFDAETGKLKWRFYTV
PNNKNEPDHAASDNILMNKAYKTWGPKGAWVRQGGGGTVWDSLVYDPVSDLIYLAVGNGSPWNYKYRSEGIGSNLFLGSI
VALKPETGEYVWHFQATPMDQWDYTSVQQIMTLDMPVKGEMRHVIVHAPKNGFFYVLDAKTGEFLSGKNYVYQNWANGLD
PLTGRPMYNPDGLYTLNGKFWYGIPGPLGAHNFMAMAYSPKTHLVYIPAHQIPFGYKNQVGGFKPHADSWNVGLDMTKNG
LPDTPEARTAYIKDLHGWLLAWDPVKMETVWKIDHKGPWNGGILATGGDLLFQGLANGEFHAYDATNGSDLYKFDAQSGI
IAPPMTYSVNGKQYVAVEVGWGGIYPISMGGVGRTSGWTVNHSYIAAFSLDGKAKLPALNNRGFLPVKPPAQYDQKVVDN
GYFQYQTYCQTCHGDNGEGAGMLPDLRWAGAIRHQDAFYNVVGRGALTAYGMDRFDTSMTPDEIEAIRQYLIKRANDTYQ
REVDARKNDKNIPENPTLGINP
>C0SPA5 1.1.1.-~~~adhA~~~Probable formaldehyde dehydrogenase AdhA~~~COG1064
MCNQHQTRVLSVSHAKAKFEQTTIERRGLRPHDVLIDIKFSGICHSDIHSAFDEWGGGIFPMVPGHEIAGVVTAVGTKVT
KLAVGDRVGVGCFVDSCGECEYCLNAEEQFCTKGVVQTYNSVDYDGNPTYGGYSQKIVVTDRFVVRIPDRLEMDVASPLL
CAGITTYSPLKHWNVGPGKKVAIVGVGGLGHLAIQFAHAMGAEVTVLSRSMNKKEEALELGANHYFATSDPATFTALAGR
FDVILNTVSANLDVDAYLSMLRIDGTLVSVGAPAKPDTYSVFSLIMGRRSIAGSLVGGIQETQEMLDFAAEHGIEPKIEV
IGADQVDEAYERILRSDVRYRFVIDISTL
>Q04944 1.1.1.-~~~bdhA~~~NADH-dependent butanol dehydrogenase A~~~COG1979
MLSFDYSIPTKVFFGKGKIDVIGEEIKKYGSRVLIVYGGGSIKRNGIYDRATAILKENNIAFYELSGVEPNPRITTVKKG
IEICRENNVDLVLAIGGGSAIDCSKVIAAGVYYDGDTWDMVKDPSKITKVLPIASILTLSATGSEMDQIAVISNMETNEK
LGVGHDDMRPKFSVLDPTYTFTVPKNQTAAGTADIMSHTFESYFSGVEGAYVQDGIAEAILRTCIKYGKIAMEKTDDYEA
RANLMWASSLAINGLLSLGKDRKWSCHPMEHELSAYYDITHGVGLAILTPNWMEYILNDDTLHKFVSYGINVWGIDKNKD
NYEIAREAIKNTREYFNSLGIPSKLREVGIGKDKLELMAKQAVRNSGGTIGSLRPINAEDVLEIFKKSY
>O05542 1.1.5.5~~~adhA~~~Alcohol dehydrogenase (quinone), dehydrogenase subunit~~~COG2010
MTSGLLTPIKVTKKRLLSCAAALAFSAAVPVAFAQEDTGTAITSSDNGGHPGDWLSYGRSYSEQRYSPLDQINTENVGKL
KLAWHYDLDTNRGQEGTPLIVNGVMYATTNWSKMKALDAATGKLLWSYDPKVPGNIADRGCCDTVSRGAAYWNGKVYFGT
FDGRLIALDAKTGKLVWSVYTIPKEAQLGHQRSYTVDGAPRIAKGKVLIGNGGAEFGARGFVSAFDAETGKLDWRFFTVP
NPENKPDGAASDDILMSKAYPTWGKNGAWKQQGGGGTVWDSLVYDPVTDLVYLGVGNGSPWNYKFRSEGKGDNLFLGSIV
AINPDTGKYVWHFQETPMDEWDYTSVQQIMTLDMPVNGEMRHVIVHAPKNGFFYIIDAKTGKFITGKPYTYENWANGLDP
VTGRPNYVPDALWTLTGKPWLGIPGELGGHNFAAMAYSPKTKLVYIPAQQIPLLYDGQKGGFKAYHDAWNLGLDMNKIGL
FDDNDPEHVAAKKDFLKVLKGWTVAWDPEKMAPAFTINHKGPWNGGLLATAGNVIFQGLANGEFHAYDATNGNDLYSFPA
QSAIIAPPVTYTANGKQYVAVEVGWGGIYPFLYGGVARTSGWTVNHSRVIAFSLDGKDSLPPKNELGFTPVKPVPTYDEA
RQKDGYFMYQTFCSACHGDNAISGGVLPDLRWSGAPRGRESFYKLVGRGALTAYGMDRFDTSMTPEQIEDIRNFIVKRAN
ESYDDEVKARENSTGVPNDQFLNVPQSTADVPTADHP
>P28036 1.1.5.5~~~adhA~~~Alcohol dehydrogenase (quinone), dehydrogenase subunit~~~
MISAVFGKRRSLSRTLTAGTICAALISGYATMASADDGQGATGEAIIHADDHPGNWMTYGRTYSDQRYSPLDQINRSNVG
NLKLAWYLDLDTNRGQEGTPLVIDGVMYATTNWSMMKAVDAATGKLLWSYDPRVPGNIADKGCCDTVNRGAAYWNGKVYF
GTFDGRLIALDAKTGKLVWSVNTIPPEAELGKQRSYTVDGAPRIAKGRVIIGNGGSEFGARGFVSAFDAETGKVDWRFFT
VPNPKNEPDAASDSVLMNKAYQTWSPTGAWTRQGGGGTVWDSIVYDPVADLVYLGVGNGSPWNYKYRSEGKGDNLFLGSI
VALKPETGEYVWHFQETPMDQWDFTSDQQIMTLDLPINGETRHVIVHARKNGFFYIIDAKTGEFISGKNYVYVNWASGLD
PKTGRPIYNPDALYTLTGKEWYGIPGDLGGHNFAAMAFSPKTGLVYIPAQQVPFLYTNQVGGFTPHPDSWNLGLDMNKVG
IPDSPEAKQAFVKDLKGWIVAWDPQKQAEAWRVDHKGPWNGGILATGGDLLFQGLANGEFHAYDATNGSDLFHFAADSGI
IAPPVTYLANGKQYVAVEVGWGGIYPFFLGGLARTSGWTVNHSRIIAFSLDGKSGPLPKQNDQGFLPVKPPAQFDSKRTD
NGYFQFQTYCAACHGDNAEGAGVLPDLRWSGSIRHEDAFYNVVGRGALTAYGMDRLHGNMNPTEIEDIRQFLIKRANETY
QREVDARKNADGIPEQLP
>P9WQC1 1.1.1.1~~~adhA~~~Probable alcohol dehydrogenase AdhA~~~COG1064
MVSPATTATMSAWQVRRPGPMDTGPLERVTTRVPRPAPSELLVAVHACGVCRTDLHVTEGDLPVHRERVIPGHEVVGEVI
EVGSAVGAAAGGEFDRGDRVGIAWLRHTCGVCKYCRRGSENLCPQSRYTGWDADGGYAEFTTVPAAFAHHLPSGYSDSEL
APLLCAGIIGYRSLLRTELPPGGRLGLYGFGGSAHITAQVALAQGAEIHVMTRGARARKLALQLGAASAQDAADRPPVPL
DAAILFAPVGDLVLPALEALDRGGILAIAGIHLTDIPDLNYQQHLFQERQIRSVTSNTRADARAFFDFAAQHHIEVTTPE
YPLGQADRALGDLSAGRIAGAAVLLI
>P74721 1.1.1.2~~~adhA~~~Aldehyde reductase AdhA~~~COG1064
MIKAYAALEANGKLQPFEYDPGALGANEVEIEVQYCGVCHSDLSMINNEWGISNYPLVPGHEVVGTVAAMGEGVNHVEVG
DLVGLGWHSGYCMTCHSCLSGYHNLCATAESTIVGHYGGFGDRVRAKGVSVVKLPKGIDLASAGPLFCGGITVFSPMVEL
SLKPTAKVAVIGIGGLGHLAVQFLRAWGCEVTAFTSSARKQTEVLELGAHHILDSTNPEAIASAEGKFDYIISTVNLKLD
WNLYISTLAPQGHFHFVGVVLEPLDLNLFPLLMGQRSVSASPVGSPATIATMLDFAVRHDIKPVVEQFSFDQINEAIAHL
ESGKAHYRVVLSHSKN
>Q9F282 1.1.1.2~~~adhA~~~Long-chain primary alcohol dehydrogenase AdhA~~~
MWETKINPNKVFELRCKNTTYFGIGSIKKIKDILEVLKNKGINNVILVTGKGSYKASGAWDVVKPALETLGFKYSLYDKV
GPNPTVDMIDEAAKIGRETGAKAVIGIGGGSPIDTAKSVAVLLEYTDKNARELYEQKFIPEKAAPIIAINLTHGTGTEVD
RFAVATIPEKNYKPAIAYDCLYPMYAIDDPSLMTKLDKKQTIAVTIDALNHVTEAATTLVASPYSVLMAKETVRLIVRYL
PAAVNDPENLVARYYLLYASALAGISFDNGLLHLTHALEHPLSAVKPEIAHGLGLGAILPAVVKAIYPSVAEVLAEVYSP
IVPGLKGLPAEAEYVAKKVEEWLFKVGCTQKLSDFGFTKEDIPTLVRLAKTTPSLDGLLSNAPVEATEAVIAKIYEESF
>Q04945 1.1.1.-~~~bdhB~~~NADH-dependent butanol dehydrogenase B~~~COG1979
MVDFEYSIPTRIFFGKDKINVLGRELKKYGSKVLIVYGGGSIKRNGIYDKAVSILEKNSIKFYELAGVEPNPRVTTVEKG
VKICRENGVEVVLAIGGGSAIDCAKVIAAACEYDGNPWDIVLDGSKIKRVLPIASILTIAATGSEMDTWAVINNMDTNEK
LIAAHPDMAPKFSILDPTYTYTVPTNQTAAGTADIMSHIFEVYFSNTKTAYLQDRMAEALLRTCIKYGGIALEKPDDYEA
RANLMWASSLAINGLLTYGKDTNWSVHLMEHELSAYYDITHGVGLAILTPNWMEYILNNDTVYKFVEYGVNVWGIDKEKN
HYDIAHQAIQKTRDYFVNVLGLPSRLRDVGIEEEKLDIMAKESVKLTGGTIGNLRPVNASEVLQIFKKSV
>Q47945 1.1.5.5~~~adhB~~~Alcohol dehydrogenase (quinone), cytochrome c subunit~~~COG2010
MLNALTRDRLVSEMKQGWKLAAAIGLMAVSFGAAHAQDADEALIKRGEYVARLSDCIACHTALHGQPYAGGLEIKSPIGT
IYSTNITPDPEHGIGNYTLEDFTKALRKGIRKDGATVYPAMPYPEFARLSDDDIRAMYAFFMHGVKPVALQNKAPDISWP
LSMRWPLGMWRAMFVPSMTPGVDKSISDPEVARGEYLVNGPGHCGECHTPRGFGMQVKAYGTAGGNAYLAGGAPIDNWIA
PSLRSNSDTGLGRWSEDDIVTFLKSGRIDHSAVFGGMADVVAYSTQHWSDDDLRATAKYLKSMPAVPEGKNLGQDDGQTT
ALLNKGGQGNAGAEVYLHNCAICHMNDGTGVNRMFPPLAGNPVVITDDPTSLANVVAFGGILPPTNSAPSAVAMPGFKNH
LSDQEMADVVNFMRKGWGNNAPGTVSASDIQKLRTTGAPVSTAGWNVSSKGWMAYMPQPYGEDWTFSPQTHTGVDDAQ
>P0A388 1.1.5.5~~~adhB~~~Alcohol dehydrogenase (quinone), cytochrome c subunit~~~
MINRLKVTFSAAAFSLLAGTALAQTPDADSALVQKGAYVARLGDCVACHTALHGQSYAGGLEIKSPIGTIYSTNITPDPT
YGIGRYTFAEFDEAVRHGIRKDGSTLYPAMPYPSFSRMTKEDMQALYAYFMHGVKPVAQPDKQPDISWPLSMRWPLGIWR
MMFSPSPKDFTPAPGTDPEIARGDYLVTGPGHCGACHTPRGFAMQEKALDAAGGPDFLSGGAPIDNWVAPSLRNDPVVGL
GRWSEDDIYTFLKSGRIDHSAVFGGMGDVVAWSTQYFTDDDLHAIAKYLKSLPPVPPSQGNYTYDPSTANMLASGNTASV
PGADTYVKECAICHRNDGGGVARMFPPLAGNPVVVTENPTSLVNVIAHGGVLPPSNWAPSAVAMPGYSKSLSAQQIADVV
NFIRTSWGNKAPGTVTAADVTKLRDTGAPVSSSGWNSVSSGWSVFLPQPYGSGWTFAPQTHTGQDAAQ
>P9WQC7 1.1.1.1~~~adhB~~~Alcohol dehydrogenase B~~~COG1062
MKTKGALIWEFNQPWSVEEIEIGDPRKDEVKIQMEAAGMCRSDHHLVTGDIPMAGFPVLGGHEGAGIVTEVGPGVDDFAP
GDHVVLAFIPSCGKCPSCQAGMRNLCDLGAGLLAGESVTDGSFRIQARGQNVYPMTLLGTFSPYMVVHRSSVVKIDPSVP
FEVACLVGCGVTTGYGSAVRTADVRPGDDVAIVGLGGVGMAALQGAVSAGARYVFAVEPVEWKRDQALKFGATHVYPDIN
AALMGIAEVTYGLMAQKVIITVGKLDGADVDSYLTITAKGGTCVLTAIGSLVDTQVTLNLAMLTLLQKNIQGTIFGGGNP
HYDIPKLLSMYKAGKLNLDDMVTTAYKLEQINDGYQDMLNGKNIRGVIRYTDDDR
>P0CH36 1.1.1.2~~~adhc1~~~NADP-dependent alcohol dehydrogenase C 1~~~COG1064
MSTVSAYAATSATEPLTKTTITRRAVGPHDVAFDIHFAGICHSDIHTVKAEWGVPNYPVVPGHEIAGVVTEVGSEVTKYK
VGDRVGVGCFVDSCRECDNCKAGLEQYCTGTGMVGTYNAIDRDGTPTHGGYSGAIVVDENYVLRIPDSLPLDAAAPLLCA
GITTYSPLRHWNAGPGKKVAVIGLGGLGHVAVKLAKAMGADVTVLSQSLKKMEDGLRLGASAYYATSDPETFDKLAGSFD
LILNTVSANLDLGAYLGLLKLDGALVELGLPEHPMEVPAFPLLAQRRNLTGSMIGGIPETQEMLDFCAEHDVRPEIEIIT
PDYINEAYERVLASDVRYRFVIDTASLRS
>P0CH37 1.1.1.2~~~adhC2~~~NADP-dependent alcohol dehydrogenase C 2~~~
MSTVSAYAATSATEPLTKTTITRRAVGPHDVAFDIHFAGICHSDIHTVKAEWGVPNYPVVPGHEIAGVVTEVGSEVTKYK
VGDRVGVGCFVDSCRECDNCKAGLEQYCTGTGMVGTYNAIDRDGTPTHGGYSGAIVVDENYVLRIPDSLPLDAAAPLLCA
GITTYSPLRHWNAGPGKKVAVIGLGGLGHVAVKLAKAMGADVTVLSQSLKKMEDGLRLGASAYYATSDPETFDKLAGSFD
LILNTVSANLDLGAYLGLLKLDGALVELGLPEHPMEVPAFPLLAQRRNLTGSMIGGIPETQEMLDFCAEHDVRPEIEIIT
PDYINEAYERVLASDVRYRFVIDTASLRS
>P9WQC5 1.1.1.2~~~adhC~~~NADP-dependent alcohol dehydrogenase C~~~COG1064
MSTVAAYAAMSATEPLTKTTITRRDPGPHDVAIDIKFAGICHSDIHTVKAEWGQPNYPVVPGHEIAGVVTAVGSEVTKYR
QGDRVGVGCFVDSCRECNSCTRGIEQYCKPGANFTYNSIGKDGQPTQGGYSEAIVVDENYVLRIPDVLPLDVAAPLLCAG
ITLYSPLRHWNAGANTRVAIIGLGGLGHMGVKLGAAMGADVTVLSQSLKKMEDGLRLGAKSYYATADPDTFRKLRGGFDL
ILNTVSANLDLGQYLNLLDVDGTLVELGIPEHPMAVPAFALALMRRSLAGSNIGGIAETQEMLNFCAEHGVTPEIELIEP
DYINDAYERVLASDVRYRFVIDISAL
>P9WQB9 1.1.1.1~~~adhD~~~Putative alcohol dehydrogenase D~~~COG1062
MKTTAAVLFEAGKPFELMELDLDGPGPGEVLVKYTAAGLCHSDLHLTDGDLPPRFPIVGGHEGSGVIEEVGAGVTRVKPG
DHVVCSFIPNCGTCRYCCTGRQNLCDMGATILEGCMPDGSFRFHSQGTDFGAMCMLGTFAERATVSQHSVVKVDDWLPLE
TAVLVGCGVPSGWGTAVNAGNLRAGDTAVIYGVGGLGINAVQGATAAGCKYVVVVDPVAFKRETALKFGATHAFADAASA
AAKVDELTWGQGADAALILVGTVDDEVVSAATAVIGKGGTVVITGLADPAKLTVHVSGTDLTLHEKTIKGSLFGSCNPQY
DIVRLLRLYDAGQLMLDELVTTTYNLEQVNQGYQDLRDGKNIRGVIVH
>E5Y379 1.2.1.10~~~adhE~~~Acetaldehyde dehydrogenase (acetylating)~~~COG1012
MDVRQQDVERIVVEVLKKMMSDQPTAAATTVVAASGCDCGDFGLFDRLEDAVQAAEAAQKKISTVAMRDKIIAAIRKAGL
ENAKAFAEIAHNETGMGRVSDKIAKNILVCERTPGTECLSPMAISGDMGLTLIENAPWGVIASVTPSTNPTATVINNAIS
MIAGGNSVIFAPHPNAKRASQTAIQVLNKAIIEATGVANLLVAVKEPTIEVAQELFSHPRIKLLVVTGGEAVVAQARKVA
TMRLIAAGAGNPPVVVDETANIARAARSIYDGASFDNNIICADEKEIIAVDSIADQLKAEMKAIGAVEISLEQADAVARV
VLRNYPQVEGGKAPNPNPKWVGRDAALIAKAAGIDVPDSCRLLIVDVKRDINHVFARVEQLMPVIPLLRAANVDEAIEWA
LILERGLSHTAGMHSRNIDNMDKMARAMNTSLFVKNGPHLAALGAGGEGWTTMTISTPTGEGVTCARSFVRLRRCCVVDN
FRIV
>P33744 ~~~adhE~~~Aldehyde-alcohol dehydrogenase~~~
MKVTTVKELDEKLKVIKEAQKKFSCYSQEMVDEIFRNAAMAAIDARIELAKAAVLETGMGLVEDKVIKNHFAGEYIYNKY
KDEKTCGIIERNEPYGITKIAEPIGVVAAIIPVTNPTSTTIFKSLISLKTRNGIFFSPHPRAKKSTILAAKTILDAAVKS
GAPENIIGWIDEPSIELTQYLMQKADITLATGGPSLVKSAYSSGKPAIGVGPGNTPVIIDESAHIKMAVSSIILSKTYDN
GVICASEQSVIVLKSIYNKVKDEFQERGAYIIKKNELDKVREVIFKDGSVNPKIVGQSAYTIAAMAGIKVPKTTRILIGE
VTSLGEEEPFAHEKLSPVLAMYEADNFDDALKKAVTLINLGGLGHTSGIYADEIKARDKIDRFSSAMKTVRTFVNIPTSQ
GASGDLYNFRIPPSFTLGCGFWGGNSVSENVGPKHLLNIKTVAERRENMLWFRVPHKVYFKFGCLQFALKDLKDLKKKRA
FIVTDSDPYNLNYVDSIIKILEHLDIDFKVFNKVGREADLKTIKKATEEMSSFMPDTIIALGGTPEMSSAKLMWVLYEHP
EVKFEDLAIKFMDIRKRIYTFPKLGKKAMLVAITTSAGSGSEVTPFALVTDNNTGNKYMLADYEMTPNMAIVDAELMMKM
PKGLTAYSGIDALVNSIEAYTSVYASEYTNGLALEAIRLIFKYLPEAYKNGRTNEKAREKMAHASTMAGMASANAFLGLC
HSMAIKLSSEHNIPSGIANALLIEEVIKFNAVDNPVKQAPCPQYKYPNTIFRYARIADYIKLGGNTDEEKVDLLINKIHE
LKKALNIPTSIKDAGVLEENFYSSLDRISELALDDQCTGANPRFPLTSEIKEMYINCFKKQP
>P0A9Q8 ~~~adhE~~~Bifunctional aldehyde-alcohol dehydrogenase AdhE~~~COG1012
MAVTNVAELNALVERVKKAQREYASFTQEQVDKIFRAAALAAADARIPLAKMAVAESGMGIVEDKVIKNHFASEYIYNAY
KDEKTCGVLSEDDTFGTITIAEPIGIICGIVPTTNPTSTAIFKSLISLKTRNAIIFSPHPRAKDATNKAADIVLQAAIAA
GAPKDLIGWIDQPSVELSNALMHHPDINLILATGGPGMVKAAYSSGKPAIGVGAGNTPVVIDETADIKRAVASVLMSKTF
DNGVICASEQSVVVVDSVYDAVRERFATHGGYLLQGKELKAVQDVILKNGALNAAIVGQPAYKIAELAGFSVPENTKILI
GEVTVVDESEPFAHEKLSPTLAMYRAKDFEDAVEKAEKLVAMGGIGHTSCLYTDQDNQPARVSYFGQKMKTARILINTPA
SQGGIGDLYNFKLAPSLTLGCGSWGGNSISENVGPKHLINKKTVAKRAENMLWHKLPKSIYFRRGSLPIALDEVITDGHK
RALIVTDRFLFNNGYADQITSVLKAAGVETEVFFEVEADPTLSIVRKGAELANSFKPDVIIALGGGSPMDAAKIMWVMYE
HPETHFEELALRFMDIRKRIYKFPKMGVKAKMIAVTTTSGTGSEVTPFAVVTDDATGQKYPLADYALTPDMAIVDANLVM
DMPKSLCAFGGLDAVTHAMEAYVSVLASEFSDGQALQALKLLKEYLPASYHEGSKNPVARERVHSAATIAGIAFANAFLG
VCHSMAHKLGSQFHIPHGLANALLICNVIRYNANDNPTKQTAFSQYDRPQARRRYAEIADHLGLSAPGDRTAAKIEKLLA
WLETLKAELGIPKSIREAGVQEADFLANVDKLSEDAFDDQCTGANPRYPLISELKQILLDTYYGRDYVEGETAAKKEAAP
AKAEKKAKKSA
>P0A9Q7 ~~~adhE~~~Bifunctional aldehyde-alcohol dehydrogenase AdhE~~~COG1012
MAVTNVAELNALVERVKKAQREYASFTQEQVDKIFRAAALAAADARIPLAKMAVAESGMGIVEDKVIKNHFASEYIYNAY
KDEKTCGVLSEDDTFGTITIAEPIGIICGIVPTTNPTSTAIFKSLISLKTRNAIIFSPHPRAKDATNKAADIVLQAAIAA
GAPKDLIGWIDQPSVELSNALMHHPDINLILATGGPGMVKAAYSSGKPAIGVGAGNTPVVIDETADIKRAVASVLMSKTF
DNGVICASEQSVVVVDSVYDAVRERFATHGGYLLQGKELKAVQDVILKNGALNAAIVGQPAYKIAELAGFSVPENTKILI
GEVTVVDESEPFAHEKLSPTLAMYRAKDFEDAVEKAEKLVAMGGIGHTSCLYTDQDNQPARVSYFGQKMKTARILINTPA
SQGGIGDLYNFKLAPSLTLGCGSWGGNSISENVGPKHLINKKTVAKRAENMLWHKLPKSIYFRRGSLPIALDEVITDGHK
RALIVTDRFLFNNGYADQITSVLKAAGVETEVFFEVEADPTLSIVRKGAELANSFKPDVIIALGGGSPMDAAKIMWVMYE
HPETHFEELALRFMDIRKRIYKFPKMGVKAKMIAVTTTSGTGSEVTPFAVVTDDATGQKYPLADYALTPDMAIVDANLVM
DMPKSLCAFGGLDAVTHAMEAYVSVLASEFSDGQALQALKLLKEYLPASYHEGSKNPVARERVHSAATIAGIAFANAFLG
VCHSMAHKLGSQFHIPHGLANALLICNVIRYNANDNPTKQTAFSQYDRPQARRRYAEIADHLGLSAPGDRTAAKIEKLLA
WLETLKAELGIPKSIREAGVQEADFLANVDKLSEDAFDDQCTGANPRYPLISELKQILLDTYYGRDYVEGETAAKKEAAP
AKAEKKAKKSA
>A0A0H2ZM56 ~~~adhE~~~Aldehyde-alcohol dehydrogenase~~~COG1012
MADKKTVTPEEKKLVAEKHVDELVQKALVALEEMRKLNQEQVDYIVAKASVAALDAHGELALHAFEETGRGVFEDKATKN
LFACEHVVNNMRHTKTVGVIEEDDVTGLTLIAEPVGVVCGITPTTNPTSTAIFKSLISLKTRNPIVFAFHPSAQESSAHA
ARIVRDAAIAAGAPENCVQWITQPSMEATSALMNHEGVATILATGGNAMVKAAYSCGKPALGVGAGNVPAYVEKSANIRQ
AAHDIVMSKSFDNGMVCASEQAVIIDKEIYDEFVAEFKSYHTYFVNKKEKALLEEFCFGVKANSKNCAGAKLNADIVGKP
ATWIAEQAGFTVPEGTNILAAECKEVGENEPLTREKLSPVIAVLKSESREDGITKARQMVEFNGLGHSAAIHTADEELTK
EFGKAVKAIRVICNSPSTFGGIGDVYNAFLPSLTLGCGSYGRNSVGDNVSAINLLNIKKVGRRRNNMQWMKLPSKTYFER
DSIQYLQKCRDVERVMIVTDHAMVELGFLDRIIEQLDLRRNKVVYQIFADVEPDPDITTVNRGTEIMRAFKPDTIIALGG
GSPMDAAKVMWLFYEQPEVDFRDLVQKFMDIRKRAFKFPLLGKKTKFIAIPTTSGTGSEVTPFAVISDKANNRKYPIADY
SLTPTVAIVDPALVLTVPGFVAADTGMDVLTHATEAYVSQMASDYTDGLALQAIKLVFENLESSVKNADFHSREKMHNAS
TIAGMAFANAFLGISHSMAHKIGAQFHTIHGRTNAILLPYVIRYNGTRPAKTATWPKYNYYRADEKYQDIARMLGLPAST
PEEGVESYAKAVYELGERIGIQMNFRDQGIDEKEWKEHSRELAFLAYEDQCSPANPRLPMVDHMQEIIEDAYYGYKERPG
RRK
>P80175 1.1.99.36~~~~~~NDMA-dependent alcohol dehydrogenase~~~
MKTKAAVLHSAGKPFEIEELELDGPREGEVLIKYTAAGLCHSDLHLIDNDLVPRFPIVGGHEGAGVIEDVGPGVTKVKPG
DHVVCSFIPNCGTCRYCATGRSNLCDMGATILDGGMPDGSFRFHRGGTDYGAMCMLGTFSERATISQHSVVKVDDWLPLE
TAVLVGCGVPTGWASANYAGGVRAGDTCVVYGIGGIGINAVQGAAHAGAANVIAVDPVAFKREKALELGATHAFASADEA
AAKVAELTWGQMADQALITVGTVVEQVVTDAFNVIGKGGTVVITGLANPEKLTVHLSGGVMTLFEKTVKGTLFGSANPQY
DIVRLLRLYQAGHVKLDELVTKRYSLEEVNEGYQDLRDGKNIRGVIMHSAD
>P39451 1.1.1.1~~~adhP~~~Alcohol dehydrogenase, propanol-preferring~~~COG1064
MKAAVVTKDHHVDVTYKTLRSLKHGEALLKMECCGVCHTDLHVKNGDFGDKTGVILGHEGIGVVAEVGPGVTSLKPGDRA
SVAWFYEGCGHCEYCNSGNETLCRSVKNAGYSVDGGMAEECIVVADYAVKVPDGLDSAAASSITCAGVTTYKAVKLSKIR
PGQWIAIYGLGGLGNLALQYAKNVFNAKVIAIDVNDEQLKLATEMGADLAINSHTEDAAKIVQEKTGGAHAAVVTAVAKA
AFNSAVDAVRAGGRVVAVGLPPESMSLDIPRLVLDGIEVVGSLVGTRQDLTEAFQFAAEGKVVPKVALRPLADINTIFTE
MEEGKIRGRMVIDFRH
>O06008 ~~~adhR~~~HTH-type transcriptional regulator AdhR~~~COG0789
MNIAQVAKQFGLTAATLRYYERVGLIPPVKRKDSGIRDYDEEDIKWIEFIKCMRNAGLSIEALIEYTTLFTEGDRTVEAR
KNILADERQRLIEKRKEIDETIKRLDTKIKDYDGKLRENEAKLKSRPKTESLHGSVEQRR
>O05544 ~~~adhS~~~Alcohol dehydrogenase, 15 kDa subunit~~~
MFRRIVPVLGLALGLGLASQAAMAQEQSPPPPPAVQGTPGKDFTGVSPANLAGIMNYCVEQQYVSYDEGNPVLYGLSEKY
KATEQTVGNFDYALGTAGYFDSNGKRFYLVAYTNEDDRRAACHAAVKAAQPML
>P25984 1.1.1.80~~~adh~~~NADP-dependent isopropanol dehydrogenase~~~
MKGFAMLGINKLGWIEKERPVAGSYDAIVRPLAVSPCTSDIHTVFEGALGDRKNMILGHEAVGEVVEVGSEVKDFKPGDR
VIVPCTTPDWRSLEVQAGFQQHSNGMLAGWKFSNFKDGVFGEYFHVNDADMNLAILPKDMPLENAVMITDMMTTGFHGAE
LADIQMGSSVVVIGIGAVGLMGIAGAKLRGAGRIIGVGSRPICVEAAKFYGATDILNYKNGHIVDQVMKLTNGKGVDRVI
MAGGGSETLSQAVSMVKPGGIISNINYHGSGDALLIPRVEWGCGMAHKTIKGGLCPGGRLRAEMLRDMVVYNRVDLSKLV
THVYHGFDHIEEALLLMKDKPKDLIKAVVIL
>Q0KDL6 1.1.1.1~~~adh~~~Alcohol dehydrogenase~~~COG1063
MTAMMKAAVFVEPGRIELADKPIPDIGPNDALVRITTTTICGTDVHILKGEYPVAKGLTVGHEPVGIIEKLGSAVTGYRE
GQRVIAGAICPNFNSYAAQDGVASQDGSYLMASGQCGCHGYKATAGWRFGNMIDGTQAEYVLVPDAQANLTPIPDGLTDE
QVLMCPDIMSTGFKGAENANIRIGDTVAVFAQGPIGLCATAGARLCGATTIIAIDGNDHRLEIARKMGADVVLNFRNCDV
VDEVMKLTGGRGVDASIEALGTQATFEQSLRVLKPGGTLSSLGVYSSDLTIPLSAFAAGLGDHKINTALCPGGKERMRRL
INVIESGRVDLGALVTHQYRLDDIVAAYDLFANQRDGVLKIAIKPH
>Q8GIX7 1.1.1.1~~~adh~~~Alcohol dehydrogenase~~~
MKAAVLHEFGQSLQIEEVDIPTPGAGEIVVKMQASGVCHTDLHAVEGDWPVKPSPPFIPGHEGVGLITAVGEGVTHVKEG
DRVGVAWLYSACGHCTHCLGGWETLCESQQNSGYSVNGSFAEYVLANANYVGIIPESVDSIEIAPVLCAGVTVYKGLKMT
DTKPGDWVVISGIGGLGHMAVQYAIAMGLNVAAVDIDDDKLAFAKKLGAKVTVNAKNTDPAEYLQKEIGGAHGALVTAVS
AKAFDQALSMLRRGGTLVCNGLPPGDFPVSIFDTVLNGITIRGSIVGTRLDLQESLDMAAAGKVKATVTAEPLENINDIF
ERMRQGKIEGRIVIDYTM
>P9WQC3 1.1.1.1~~~adh~~~Probable alcohol dehydrogenase adh~~~COG1063
MSDGAVVRALVLEAPRRLVVRQYRLPRIGDDDALVRVEACGLCGTDHEQYTGELAGGFAFVPGHETVGTIAAIGPRAEQR
WGVSAGDRVAVEVFQSCRQCANCRGGEYRRCVRHGLADMYGFIPVDREPGLWGGYAEYQYLAPDSMVLRVAGDLSPEVAT
LFNPLGAGIRWGVTIPETKPGDVVAVLGPGIRGLCAAAAAKGAGAGFVMVTGLGPRDADRLALAAQFGADLAVDVAIDDP
VAALTEQTGGLADVVVDVTAKAPAAFAQAIALARPAGTVVVAGTRGVGSGAPGFSPDVVVFKELRVLGALGVDATAYRAA
LDLLVSGRYPFASLPRRCVRLEGAEDLLATMAGERDGVPPIHGVLTP
>Q7A742 1.1.1.1~~~adh~~~Alcohol dehydrogenase~~~
MRAAVVTKDHKVSIEDKKLRALKPGEALVQTEYCGVCHTDLHVKNADFGDVTGVTLGHEGIGKVIEVAEDVESLKIGDRV
SIAWMFESCGRCEYCTTGRETLCRSVKNAGYTVDGAMAEQVIVTADYAVKVPEKLDPAAASSITCAGVTTYKAVKVSNVK
PGQWLGVFGIGGLGNLALQYAKNVMGAKIVAFDINDDKLAFAKELGADAIINSKDVDPVAEVMKLTDNKGLDATVVTSVA
KTPFNQAVDVVKAGARVVAVGLPVDKMNLDIPRLVLDGIEVVGSLVGTRQDLREAFEFAAENKVTPKVQLRKLEEINDIF
EEMEKGTITGRMVIKF
>P14941 1.1.1.80~~~adh~~~NADP-dependent isopropanol dehydrogenase~~~
MKGFAMLSIGKVGWIEKEKPAPGPFDAIVRPLAVAPCTSDIHTVFEGAIGERHNMILGHEAVGEVVEVGSEVKDFKPGDR
VVVPAITPDWRTSEVQRGYHQHSGGMLAGWKFSNVKDGVFGEFFHVNDADMNLAHLPKEIPLEAAVMIPDMMTTGFHGAE
LADIELGATVAVLGIGPVGLMAVAGAKLRGAGRIIAVGSRPVCVDAAKYYGATDIVNYKDGPIESQIMNLTEGKGVDAAI
IAGGNADIMATAVKIVKPGGTIANVNYFGEGEVLPVPRLEWGCGMAHKTIKGGLCPGGRLRMERLIDLVFYKRVDPSKLV
THVFRGFDNIEKAFMLMKDKPKDLIKPVVILA
>P28629 4.1.1.19~~~adiA~~~Biodegradative arginine decarboxylase~~~COG1982
MKVLIVESEFLHQDTWVGNAVERLADALSQQNVTVIKSTSFDDGFAILSSNEAIDCLMFSYQMEHPDEHQNVRQLIGKLH
ERQQNVPVFLLGDREKALAAMDRDLLELVDEFAWILEDTADFIAGRAVAAMTRYRQQLLPPLFSALMKYSDIHEYSWAAP
GHQGGVGFTKTPAGRFYHDYYGENLFRTDMGIERTSLGSLLDHTGAFGESEKYAARVFGADRSWSVVVGTSGSNRTIMQA
CMTDNDVVVVDRNCHKSIEQGLMLTGAKPVYMVPSRNRYGIIGPIYPQEMQPETLQKKISESPLTKDKAGQKPSYCVVTN
CTYDGVCYNAKEAQDLLEKTSDRLHFDEAWYGYARFNPIYADHYAMRGEPGDHNGPTVFATHSTHKLLNALSQASYIHVR
EGRGAINFSRFNQAYMMHATTSPLYAICASNDVAVSMMDGNSGLSLTQEVIDEAVDFRQAMARLYKEFTADGSWFFKPWN
KEVVTDPQTGKTYDFADAPTKLLTTVQDCWVMHPGESWHGFKDIPDNWSMLDPIKVSILAPGMGEDGELEETGVPAALVT
AWLGRHGIVPTRTTDFQIMFLFSMGVTRGKWGTLVNTLCSFKRHYDANTPLAQVMPELVEQYPDTYANMGIHDLGDTMFA
WLKENNPGARLNEAYSGLPVAEVTPREAYNAIVDNNVELVSIENLPGRIAANSVIPYPPGIPMLLSGENFGDKNSPQVSY
LRSLQSWDHHFPGFEHETEGTEIIDGIYHVMCVKA
>P60063 ~~~adiC~~~Arginine/agmatine antiporter~~~COG0531
MSSDADAHKVGLIPVTLMVSGNIMGSGVFLLPANLASTGGIAIYGWLVTIIGALGLSMVYAKMSFLDPSPGGSYAYARRC
FGPFLGYQTNVLYWLACWIGNIAMVVIGVGYLSYFFPILKDPLVLTITCVVVLWIFVLLNIVGPKMITRVQAVATVLALI
PIVGIAVFGWFWFRGETYMAAWNVSGLGTFGAIQSTLNVTLWSFIGVESASVAAGVVKNPKRNVPIATIGGVLIAAVCYV
LSTTAIMGMIPNAALRVSASPFGDAARMALGDTAGAIVSFCAAAGCLGSLGGWTLLAGQTAKAAADDGLFPPIFARVNKA
GTPVAGLIIVGILMTIFQLSSISPNATKEFGLVSSVSVIFTLVPYLYTCAALLLLGHGHFGKARPAYLAVTTIAFLYCIW
AVVGSGAKEVMWSFVTLMVITAMYALNYNRLHKNPYPLDAPISKD
>P60061 ~~~adiC~~~Arginine/agmatine antiporter~~~COG0531
MSSDADAHKVGLIPVTLMVSGNIMGSGVFLLPANLASTGGIAIYGWLVTIIGALGLSMVYAKMSFLDPSPGGSYAYARRC
FGPFLGYQTNVLYWLACWIGNIAMVVIGVGYLSYFFPILKDPLVLTITCVVVLWIFVLLNIVGPKMITRVQAVATVLALI
PIVGIAVFGWFWFRGETYMAAWNVSGLGTFGAIQSTLNVTLWSFIGVESASVAAGVVKNPKRNVPIATIGGVLIAAVCYV
LSTTAIMGMIPNAALRVSASPFGDAARMALGDTAGAIVSFCAAAGCLGSLGGWTLLAGQTAKAAADDGLFPPIFARVNKA
GTPVAGLIIVGILMTIFQLSSISPNATKEFGLVSSVSVIFTLVPYLYTCAALLLLGHGHFGKARPAYLAVTTIAFLYCIW
AVVGSGAKEVMWSFVTLMVITAMYALNYNRLHKNPYPLDAPISKD
>P60066 ~~~adiC~~~Arginine/agmatine antiporter~~~
MSSDADAHKVGLIPVTLMVSGNIMGSGVFLLPANLAATGGIAIYGWLVTIIGALALSMVYAKMSSLDPSPGGSYAYARRC
FGPFLGYQTNVLYWLACWIGNIAMVVIGVGYLSYFFPILKDPLVLTLTCVAVLWIFVLLNIVGPKMITRVQAVATVLALV
PIVGIAVFGWFWFKGETYMAAWNVSGMNTFGAIQSTLNVTLWSFIGVESASVAAGVVKNPKRNVPIATIGGVLIAAVCYV
LSTTAIMGMIPNAALRVSASPFGDAARMALGDTAGAIVSFCAAAGCLGSLGGWTLLAGQTAKAAADDGLFPPIFARVNKA
GTPVAGLLIVGVLMTIFQFSSMSPNAAKEFGLVSSVSVIFTLVPYLYTCAALLLLGHGHFGKARPLYLLITFVAFVYCIW
AVIGSGAKEVMWSFVTLMVITALYALNYNRIHKNPYPLDAPVKQD
>P33234 ~~~adiY~~~HTH-type transcriptional regulator AdiY~~~COG2207
MRICSDQPCIVLLTEKDVWIRVNGKEPISLKANHMALLNCENNIIDVSSLNNTLVAHISHDIIKDYLRFLNKDLSQIPVW
QRSATPILTLPCLTPDVFRVAAQHSMMPAETESEKERTRALLFTVLSRFLDSKKFVSLMMYMLRNCVSDSVYQIIESDIH
KDWNLSMVASCLCLSPSLLKKKLKSENTSYSQIITTCRMRYAVNELMMDGKNISQVSQSCGYNSTSYFISVFKDFYGMTP
LHYVSQHRERTVA
>P0DV33 ~~~admX~~~HTH-type transcriptional regulator AdmX~~~
MKLRHLEIFYTVMTCGSLSRAAESLNISQPAASKSLKNAELKLGFKLFQRVRGKLLPSREALELFEKAQGIYQDLSNLRL
LADNLARDPRAKFTLGCLPCLGLSLVPEIATDFYQQNSNLVMTLTAEHTETLVKKLDLREIDLALTMQPVQQGDIMATLI
AEVPLVYVDKDYRQGAVEIDSIDQQRWISPGLDSLSTAIAAHRVFPATGLNVETCYMAMEFVKRGVGCCITDIFSARHSL
TPEMIHQISPPMKIDLYLLRRADASLSPVTQKFVDFLCKRLRNELREINLELYPGNKKSIVSPV
>P83736 2.7.1.20~~~adoK~~~Adenosine kinase~~~
MTIAVTGSIATDHLMRFPGRFSEQLLPEHLHKVSLSFLVDDLVMHRGGVAGNMAFAIGVLGGEVALVGAAGADFADYRDW
LKARGVNCDHVLISETAHTARFTCTTDVDMAQIASFYPGAMSEARNIKLADVVSAIGKPELVIIGANDPEAMFLHTEECR
KLGLAFAADPSQQLARLSGEEIRRLVNGAAYLFTNDYEWDLLLSKTGWSEADVMAQIDLRVTTLGPKGVDLVEPDGTTIH
VGVVPETSQTDPTGVGDAFRAGFLTGRSAGLGLERSAQLGSLVAVLVLESTGTQEWQWDYEAAASRLAGAYGEHAAAEIV
AVLA
>A5U4N0 2.7.1.20~~~adoK~~~Adenosine kinase~~~COG0524
MTIAVTGSIATDHLMRFPGRFSEQLLPEHLHKVSLSFLVDDLVMHRGGVAGNMAFAIGVLGGEVALVGAAGADFADYRDW
LKARGVNCDHVLISETAHTARFTCTTDVDMAQIASFYPGAMSEARNIKLADVVSAIGKPELVIIGANDPEAMFLHTEECR
KLGLAFAADPSQQLARLSGEEIRRLVNGAAYLFTNDYEWDLLLSKTGWSEADVMAQIDLRVTTLGPKGVDLVEPDGTTIH
VGVVPETSQTDPTGVGDAFRAGFLTGRSAGLGLERSAQLGSLVAVLVLESTGTQEWQWDYEAAASRLAGAYGEHAAAEIV
AVLA
>P9WID5 2.7.1.20~~~adoK~~~Adenosine kinase~~~COG0524
MTIAVTGSIATDHLMRFPGRFSEQLLPEHLHKVSLSFLVDDLVMHRGGVAGNMAFAIGVLGGEVALVGAAGADFADYRDW
LKARGVNCDHVLISETAHTARFTCTTDVDMAQIASFYPGAMSEARNIKLADVVSAIGKPELVIIGANDPEAMFLHTEECR
KLGLAFAADPSQQLARLSGEEIRRLVNGAAYLFTNDYEWDLLLSKTGWSEADVMAQIDLRVTTLGPKGVDLVEPDGTTIH
VGVVPETSQTDPTGVGDAFRAGFLTGRSAGLGLERSAQLGSLVAVLVLESTGTQEWQWDYEAAASRLAGAYGEHAAAEIV
AVLA
>P20796 ~~~mgpA~~~Adhesin P1~~~
MHQPKKRLAKKSWAFLTAALTLGVITGVGGYFLFNQNKQRSSVSNFAYQPKQLSVKHQQAVDETLTPWTWNNNNFSSLKI
TGENPGSFGLVRSQNDNLNISSVTKNSSDDNLKYLNAVEKYLDGQQNFAIRRYDNNGRALYDINLAKMENPSTVQRGLNG
EPIFDPFKGFGLTGNAPTDWNEIKGKVPVEVVQSPHSPNLYFVLLVPKVALEYHNLNNQVVKESLEVKATQSSFNPTQRL
QKDSPVKDSSKQGEKLSETTASSMSSGMATSTRAKALKVEVERGSQSDSLLKNDFAKKPLKHKNSSGEVKLEAEKEFTEA
WKPLLTTDQIAREKGMGATVVSFYDAPYSENHTAFGLVDHIDPKKMVENYPPSWKTPKWNHHGIWDYNARNLLLQTTGFF
NPRRHPEWFDEGQAKADNTSPGFKVGDTDHKKDGFKKNSSSPIALPFEAYFANIGNMVAIGNSVFIFGGNGHATKMFTTN
PLSIGVFRIKYTDNFSKSSVTGWPYAVLFGGLINPQTNGLKDLPLGTNRWFEYVPRMAVSGVKWVGNQLVLAGTLTMGDT
ATVPRLKYDQLEKHLNLVAQGQGLLREDLQIFTPYGWANRPDIPVGAWLQDEMGSKFGPHYFLNNPDIQDNVNNDTVEAL
ISSYKNTDKLKHVYPYRYSGLYAWQLFNWSNKLTNTPLSANFVNENSYAPNSLFAAILNEDLLTGLSDKIFYGKENEFAE
NEADRFNQLLSLNPNPNTNWARYLNVVQRFTTGPNLDSSTFDQFLDFLPWIGNGKPFSNSPSPSTSASSSTPLPTFSNIN
VGVKSMITQHLNKENTRWVFIPNFSPDIWTGAGYRVQSANQKNGIPFEQVKPSNNSTPFDPNSDDNKVTPSGGSSKPTTY
PALPNSISPTSDWINALTFTNKNNPQRNQLLLRSLLGTIPVLINKSGDSNDQFNKDSEQKWDKTETNEGNLPGFGEVNGL
YNAALLHTYGFFGTNTNSTDPKIGFKADSSSSSSSTLVGSGLNWTSQDVGNLVVINDTSFGFQLGGWFITFTDFIRPRTG
YLGITLSSLQDQTIIWADQPWTSFKGSYLDSDGTPKSLWDPTALKSLPNSSTTYDTNPTLSPSFQLYQPNKVKAYQTTNT
YNKLIEPVDATSAATNMTSLLKLLTTKNIKAKLGKGTASSQGNNNGGGVSQTINTITTTGNISEGLKEETSIQAETLKKF
FDSKQNNKSEIGIGDSTFTKMDGKLTGVVSTPLVNLINGQGATSDSDTEKISFKPGNQIDFNRLFTLPVTELFDPNTMFV
YDQYVPLLVNLPSGFDQASIRLKVISYSVENQTLGVRLEFKDPQTQQFIPVLNASSTGPQTVFQPFNQWADYVLPLIVTV
PIVVIILSVTLGLTIGIPMHRNKKALQAGFDLSNKKVDVLTKAVGSVFKEIINRTGISNAPKKLKQATPTKPTPKTPPKP
PVKQ
>P11311 ~~~mgpA~~~Adhesin P1~~~
MHQTKKTALSKSTWILILTATASLATGLTVVGHFTSTTTTLKRQQFSYTRPDEVALRHTNAINPRLTPWTYRNTSFSSLP
LTGENPGAWALVRDNSAKGITAGSGSQQTTYDPTRTEAALTASTTFALRRYDLAGRALYDLDFSKLNPQTPTRDQTGQIT
FNPFGGFGLSGAAPQQWNEVKNKVPVEVAQDPSNPYRFAVLLVPRSVVYYEQLQRGLGLPQQRTESGQNTSTTGAMFGLK
VKNAEADTAKSNEKLQGAEATGSSTTSGSGQSTQRGGSSGDTKVKALKIEVKKKSDSEDNGQLQLEKNDLANAPIKRSEE
SGQSVQLKADDFGTALSSSGSGGNSNPGSPTPWRPWLATEQIHKDLPKWSASILILYDAPYARNRTAIDRVDHLDPKAMT
ANYPPSWRTPKWNHHGLWDWKARDVLLQTTGFFNPRRHPEWFDGGQTVADNEKTGFDVDNSENTKQGFQKEADSDKSAPI
ALPFEAYFANIGNLTWFGQALLVFGGNGHVTKSAHTAPLSIGVFRVRYNATGTSATVTGWPYALLFSGMVNKQTDGLKDL
PFNNNRWFEYVPRMAVAGAKFVGRELVLAGTITMGDTATVPRLLYDELESNLNLVAQGQGLLREDLQLFTPYGWANRPDL
PIGAWSSSSSSSHNAPYYFHNNPDWQDRPIQNVVDAFIKPWEDKNGKDDAKYIYPYRYSGMWAWQVYNWSNKLTDQPLSA
DFVNENAYQPNSLFAAILNPELLAALPDKVKYGKENEFAANEYERFNQKLTVAPTQGTNWSHFSPTLSRFSTGFNLVGSV
LDQVLDYVPWIGNGYRYGNNHRGVDDITAPQTSAGSSSGISTNTSGSRSFLPTFSNIGVGLKANVQATLGGSQTMITGGS
PRRTLDQANLQLWTGAGWRNDKASSGQSDENHTKFTSATGMDQQGQSGTSAGNPDSLKQDNISKSGDSLTTQDGNAIDQQ
EATNYTNLPPNLTPTADWPNALSFTNKNNAQRAQLFLRGLLGSIPVLVNRSGSDSNKFQATDQKWSYTDLHSDQTKLNLP
AYGEVNGLLNPALVETYFGNTRAGGSGSNTTSSPGIGFKIPEQNNDSKATLITPGLAWTPQDVGNLVVSGTTVSFQLGGW
LVTFTDFVKPRAGYLGLQLTGLDASDATQRALIWAPRPWAAFRGSWVNRLGRVESVWDLKGVWADQAQSDSQGSTTTATR
NALPEHPNALAFQVSVVEASAYKPNTSSGQTQSTNSSPYLHLVKPKKVTQSDKLDDDLKNLLDPNQVRTKLRQSFGTDHS
TQPQPQSLKTTTPVFGTSSGNLSSVLSGGGAGGGSSGSGQSGVDLSPVEKVSGWLVGQLPSTSDGNTSSTNNLAPNTNTG
NDVVGVGRLSESNAAKMNDDVDGIVRTPLAELLDGEGQTADTGPQSVKFKSPDQIDFNRLFTHPVTDLFDPVTMLVYDQY
IPLFIDIPASVNPKMVRLKVLSFDTNEQSLGLRLEFFKPDQDTQPNNNVQVNPNNGDFLPLLTASSQGPQTLFSPFNQWP
DYVLPLAITVPIVVIVLSVTLGLAIGIPMHKNKQALKAGFALSNQKVDVLTKAVGSVFKEIINRTGISQAPKRLKQTSAA
KPGAPRPPVPPKPGAPKPPVQPPKKPA
>P54570 3.6.1.13~~~nudF~~~ADP-ribose pyrophosphatase~~~COG0494
MKSLEEKTIAKEQIFSGKVIDLYVEDVELPNGKASKREIVKHPGAVAVLAVTDEGKIIMVKQFRKPLERTIVEIPAGKLE
KGEEPEYTALRELEEETGYTAKKLTKITAFYTSPGFADEIVHVFLAEELSVLEEKRELDEDEFVEVMEVTLEDALKLVES
REVYDAKTAYAIQYLQLKEALQAQK
>Q93K97 3.6.1.13~~~nudF~~~ADP-ribose pyrophosphatase~~~COG0494
MLKPDNLPVTFGKNDVEIIARETLYRGFFSLDLYRFRHRLFNGQMSHEVRREIFERGHAAVLLPFDPVRDEVVLIEQIRI
AAYDTSETPWLLEMVAGMIEEGESVEDVARREAIEEAGLIVKRTKPVLSFLASPGGTSERSSIMVGEVDATTASGIHGLA
DENEDIRVHVVSREQAYQWVEEGKIDNAASVIALQWLQLHHQALKNEWA
>P44684 3.6.1.13~~~nudF~~~ADP-ribose pyrophosphatase~~~COG0494
MQFWRKQMSEIQHFSQQDIEILGEQTLYEGFFTLKRIQFKHKLFAGGQSGVVTRELLIKGAASAVIAYDPKEDSVILVEQ
VRIGAAYHPESHRSPWLLELIAGMVEKGEKPEDVALRESEEEAGIQVKNLTHCLSVWDSPGGIVERIHLFAGEVDSAQAK
GIHGLAEENEDIKVHVVKREQAYQWMCEGKIDNGIAVIGLQWLQLNYAQLQQSWKRS
>I6X235 3.6.1.13~~~~~~ADP-ribose pyrophosphatase~~~COG0494
MAEHDFETISSETLHTGAIFALRRDQVRMPGGGIVTREVVEHFGAVAIVAMDDNGNIPMVYQYRHTYGRRLWELPAGLLD
VAGEPPHLTAARELREEVGLQASTWQVLVDLDTAPGFSDESVRVYLATGLREVGRPEAHHEEADMTMGWYPIAEAARRVL
RGEIVNSIAIAGVLAVHAVTTGFAQPRPLDTEWIDRPTAFAARRAER
>P67343 3.2.1.-~~~~~~Protein-ADP-ribose hydrolase~~~
METLKSNKARLEYLINDMRRERNDNDVLVMPSSFEDLWELYRGLANVRPALPVSDEYLAVQDAMLSDLNHQHVTDLKDLK
PIKGDNIFVWQGDITTLKIDAIVNAANSRFLGCMQANHDCIDNIIHTKAGVQVRLDCAEIIRQQGRNEGVGKAKKTRGYN
LPAKYIIHTVGPQIRRLPVSKMNQDLLAKCYLSCLKLADQHSLNHVAFCCISTGVFAFPQDEAAEIAVRTVESYLKETNS
TLKVVFNVFTDKDLQLYKEALNRDAE
>P67344 3.2.1.-~~~~~~Protein-ADP-ribose hydrolase~~~
METLKSNKARLEYLINDMRRERNDNDVLVMPSSFEDLWELYRGLANVRPALPVSDEYLAVQDAMLSDLNHQHVTDLKDLK
PIKGDNIFVWQGDITTLKIDAIVNAANSRFLGCMQANHDCIDNIIHTKAGVQVRLDCAEIIRQQGRNEGVGKAKKTRGYN
LPAKYIIHTVGPQIRRLPVSKMNQDLLAKCYLSCLKLADQHSLNHVAFCCISTGVFAFPQDEAAEIAVRTVESYLKETNS
TLKVVFNVFTDKDLQLYKEALNRDAE
>P0DN70 3.2.1.-~~~~~~Protein-ADP-ribose hydrolase~~~
MPSSFDLLGEMIDLLQTEQLTSYWACPLPNALTKRQDLWRALINQRPALPLSKDYLNLEDTYLDDWRASFVPVSVKDCQK
TNYTSLFLYHGDIRYLAVDAIVNAANSELLGCFIPNHGCIDNAIHTFAGSRLRLACQAIMTEQGRKEAIGQAKLTSAYHL
PASYIIHTVGPRITKGRHVSPIRADLLARCYRSSLDLAVKAGLTSLAFCSISTGEFGFPKKEAAQIAIKTVLKWQAEHPE
SKTLTIIFNTFTSEDKALYDTYLQKENNCE
>O34508 5.1.1.20~~~ykfB~~~L-Ala-D/L-Glu epimerase~~~COG4948
MKIIRIETSRIAVPLTKPFKTALRTVYTAESVIVRITYDSGAVGWGEAPPTLVITGDSMDSIESAIHHVLKPALLGKSLA
GYEAILHDIQHLLTGNMSAKAAVEMALYDGWAQMCGLPLYQMLGGYRDTLETDYTVSVNSPEEMAADAENYLKQGFQTLK
IKVGKDDIATDIARIQEIRKRVGSAVKLRLDANQGWRPKEAVTAIRKMEDAGLGIELVEQPVHKDDLAGLKKVTDATDTP
IMADESVFTPRQAFEVLQTRSADLINIKLMKAGGISGAEKINAMAEACGVECMVGSMIETKLGITAAAHFAASKRNITRF
DFDAPLMLKTDVFNGGITYSGSTISMPGKPGLGIIGAALLKGEKEQ
>Q8A861 5.1.1.20~~~~~~L-Ala-D/L-Glu epimerase~~~COG4948
MPNRRDFLKTAAFATLGSGIAVSQVLAGECMPSAIHINKYGIGGKMKMTFFPYELKLRHVFTVATYSRTTTPDVQVEIEY
EGVTGYGEASMPPYLGETVESVMNFLKKVNLEQFSDPFQLEDILSYVDSLSPKDTAAKAAVDIALHDLVGKLLGAPWYKI
WGLNKEKTPSTTFTIGIDTPDVVRAKTKECAGLFNILKVKLGRDNDKEMIETIRSVTDLPIAVDANQGWKDRQYALDMIH
WLKEKGIVMIEQPMPKEQLDDIAWVTQQSPLPVFADESLQRLGDVAALKGAFTGINIKLMKCTGMREAWKMVTLAHALGM
RVMVGCMTETSCAISAASQFSPAVDFADLDGNLLISNDRFKGVEVVNGKITLNDLPGIGVMKI
>Q97MK4 5.1.1.20~~~~~~L-Ala-D/L-Glu epimerase~~~COG4948
MIIKDIVIGHLSVPLKKPFKTAVRSVNSVNDVVVKIITDTGNVGFGSAASTGLVTGDITESIEGAINNYIKRSIVGMDIE
DFEAILIKLDNCIVGNTSAKAAVDIALYDLYGQRYGAPLYKLLGGFRNKLETDITISVNSPEEMSRDSVDAVKLGYKTLK
IKVGKNPKLDIKRMREIRKAIGYEVNLRIDANQGWQPKEAIRALNEIENEGLKIELVEQPVKAWNLEGLKMVTDNVNIPV
MADESVFSPKDAARVMEMRACDLINIKLMKTGGIHNALKICALAEVYGMECMLGCMLEGKVSVTAAVHLAAAKRIITKID
LDGPVLCSRDDVVGGAMYDNSNIVLVDEPGLGIEGINN
>P51981 5.1.1.20~~~ycjG~~~L-Ala-D/L-Glu epimerase~~~COG4948
MRTVKVFEEAWPLHTPFVIARGSRSEARVVVVELEEEGIKGTGECTPYPRYGESDASVMAQIMSVVPQLEKGLTREELQK
ILPAGAARNALDCALWDLAARRQQQSLADLIGITLPETVITAQTVVIGTPDQMANSASTLWQAGAKLLKVKLDNHLISER
MVAIRTAVPDATLIVDANESWRAEGLAARCQLLADLGVAMLEQPLPAQDDAALENFIHPLPICADESCHTRSNLKALKGR
YEMVNIKLDKTGGLTEALALATEARAQGFSLMLGCMLCTSRAISAALPLVPQVSFADLDGPTWLAVDVEPALQFTTGELH
L
>Q9WXM1 5.1.1.20~~~~~~L-Ala-D/L-Glu epimerase~~~COG4948
MSRIVNVKLSLKRYEYEKPFHITGSVSSESRNVEVEIVLESGVKGYGEASPSFRVNGERVEALLAIENAVREMITGIDVR
NYARIFEITDRLFGFPSLKAAVQFATLDALSQELGTQVCYLLGGKRDEIETDKTVGIDTVENRVKEAKKIFEEGFRVIKI
KVGENLKEDIEAVEEIAKVTRGAKYIVDANMGYTQKEAVEFARAVYQKGIDIAVYEQPVRREDIEGLKFVRFHSPFPVAA
DESARTKFDVMRLVKEEAVDYVNIKLMKSGISDALAIVEIAESSGLKLMIGCMGESSLGINQSVHFALGTGAFEFHDLDS
HLMLKEEVFRGKFIQDGPRMRVKDQ
>P37127 ~~~aegA~~~Putative oxidoreductase AegA~~~COG0493
MNRFIMANSQQCLGCHACEIACVMAHNDEQHVLSQHHFHPRITVIKHQQQRSAVTCHHCEDAPCARSCPNGAISHVDDSI
QVNQQKCIGCKSCVVACPFGTMQIVLTPVAAGKVKATAHKCDLCAGRENGPACVENCPADALQLVTDVALSGMAKSRRLR
TARQEHQPWHASTAAQEMPVMSKVEQMQATPARGEPDKLAIEARKTGFDEIYLPFRADQAQREASRCLKCGEHSVCEWTC
PLHNHIPQWIELVKAGNIDAAVELSHQTNTLPEITGRVCPQDRLCEGACTIRDEHGAVTIGNIERYISDQALAKGWRPDL
SHVTKVDKRVAIIGAGPAGLACADVLTRNGVGVTVYDRHPEIGGLLTFGIPSFKLDKSLLARRREIFSAMGIHFELNCEV
GKDVSLDSLLEQYDAVFVGVGTYRSMKAGLPNEDAPGVYDALPFLIANTKQVMGLEELPEEPFINTAGLNVVVLGGGDTA
MDCVRTALRHGASNVTCAYRRDEANMPGSKKEVKNAREEGANFEFNVQPVALELNEQGHVCGIRFLRTRLGEPDAQGRRR
PVPVEGSEFVMPADAVIMAFGFNPHGMPWLESHGVTVDKWGRIIADVESQYRYQTTNPKIFAGGDAVRGADLVVTAMAEG
RHAAQGIIDWLGVKSVKSH
>P09167 ~~~aerA~~~Aerolysin~~~
MQKIKLTGLSLIISGLLMAQAQAAEPVYPDQLRLFSLGQGVCGDKYRPVNREEAQSVKSNIVGMMGQWQISGLANGWVIM
GPGYNGEIKPGTASNTWCYPTNPVTGEIPTLSALDIPDGDEVDVQWRLVHDSANFIKPTSYLAHYLGYAWVGGNHSQYVG
EDMDVTRDGDGWVIRGNNDGGCDGYRCGDKTAIKVSNFAYNLDPDSFKHGDVTQSDRQLVKTVVGWAVNDSDTPQSGYDV
TLRYDTATNWSKTNTYGLSEKVTTKNKFKWPLVGETELSIEIAANQSWASQNGGSTTTSLSQSVRPTVPARSKIPVKIEL
YKADISYPYEFKADVSYDLTLSGFLRWGGNAWYTHPDNRPNWNHTFVIGPYKDKASSIRYQWDKRYIPGEVKWWDWNWTI
QQNGLSTMQNNLARVLRPVRAGITGDFSAESQFAGNIEIGAPVPLAADSKVRRARSVDGAGQGLRLEIPLDAQELSGLGF
NNVSLSVTPAANQ
>A0A073CEA3 4.1.1.100~~~aerD~~~Prephenate decarboxylase~~~
MLKFSMEFCYPQPDVKTLIVGTLGPKETSSEQTLNYLITQWQAEQISVTSHLFDTFTELKEALLQDRVDLALVPHAYERV
NDFYMEPSLKLGFVFTYPTPIYGLAKRKNEELVWENCTLVTHPAPFPLLPYLLPGYPHQKNIKVEFVNSTSAAAIQVKQG
LADLAITNENALKENDLEFIAEYGKIEMSWSIFHKKGTVHRE
>P50466 ~~~aer~~~Aerotaxis receptor~~~COG0840
MSSHPYVTQQNTPLADDTTLMSTTDLQSYITHANDTFVQVSGYTLQELQGQPHNMVRHPDMPKAAFADMWFTLKKGEPWS
GIVKNRRKNGDHYWVRANAVPMVREGKISGYMSIRTRATDEEIAAVEPLYKALNAGRTSKRIHKGLVVRKGWLGKLPSLP
LRWRARGVMTLMFILLAAMLWFVAAPVVTYILCALVVLLASACFEWQIVRPIENVAHQALKVATGERNSVEHLNRSDELG
LTLRAVGQLGLMCRWLINDVSSQVSSVRNGSETLAKGTDELNEHTQQTVDNVQQTVATMNQMAASVKQNSATASAADKLS
ITASNAAVQGGEAMTTVIKTMDDIADSTQRIGTITSLINDIAFQTNILALNAAVEAARAGEQGKGFAVVAGEVRHLASRS
ANAANDIRKLIDASADKVQSGSQQVHAAGRTMEDIVAQVKNVTQLIAQISHSTLEQADGLSSLTRAVDELNLITQKNAEL
VEESAQVSAMVKHRASRLEDAVTVLH
>Q9I3F6 ~~~aer~~~Methyl-accepting chemotaxis protein Aer~~~
MRNNQPITQHERVYPAEQRLITTTNLKGIITYCNEAFIDISGFSREELMSAPHNLIRHPDVPPAVFAHMWTTLKAGRPWM
GIVKNRCKNGDHYWVSAYVTPIYDQGAVVGYESVRVKPTAEQIQRAEALYRRLGAGKPAIPRRDRWLPVLLDWLPFILIS
QIGFLIGIWLNSWWGFILAGLLAVPLGLAGLRWQKRGLKRLMRLAEQTTSDPLIAQMYTDSRGDQARLEMAILSQDARLK
TCLTRLQDTAEYLTEQARQADTLAHHSSAGLEQQRAETEQVATAVNEMAATTQEVANNVQLTADATQKANELTSRGRDIA
AETRNAIQRLSESVGETGAAVSRLAQDSNEIGGVVDVIKGIADQTNLLALNAAIEAARAGDQGRGFAVVADEVRSLAQRT
AASTEQIHHLIAKLQNTANDAVHTMESGLQQAEAGVQRVLEADSALVGISEAVSNITEMTTQIAAAAEEQSAVAEEINRN
ISTIAALAEQTSDEALRTAKLSEELTTTAQSQYSLVERFNR
>P23872 3.1.1.-~~~aes~~~Acetyl esterase~~~COG0657
MKPENKLPVLDLISAEMKTVVNTLQPDLPPWPATGTIAEQRQYYTLERRFWNAGAPEMATRAYMVPTKYGQVETRLFCPQ
PDSPATLFYLHGGGFILGNLDTHDRIMRLLASYSQCTVIGIDYTLSPEARFPQAIEEIVAACCYFHQQAEDYQINMSRIG
FAGDSAGAMLALASALWLRDKQIDCGKVAGVLLWYGLYGLRDSVTRRLLGGVWDGLTQQDLQMYEEAYLSNDADRESPYY
CLFNNDLTREVPPCFIAGAEFDPLLDDSRLLYQTLAAHQQPCEFKLYPGTLHAFLHYSRMMKTADEALRDGAQFFTAQL
>Q8ZRA1 3.1.1.-~~~aes~~~Acetyl esterase~~~
MKPENKIPVLTRLSDEMTAVVNFQQPGLPPWPADGDIETQRQYYLLERRFWNADAPSMTTRTCAVPTPYGDVTTRLYSPQ
PTSQATLYYLHGGGFILGNLDTHDRIMRLLARYTGCTVIGIDYSLSPQARYPQAIEETVAVCSYFSQHADEYSLNVEKIG
FAGDSAGAMLALASALWLRDKHIRCGNVIAILLWYGLYGLQDSVSRRLFGGAWDGLTREDLDMYEKAYLRNDEDRESPWY
CLFNNDLTRDVPPCFIASAEFDPLIDDSRLLHQTLQAHQQPCEYKMYPGTLHAFLHYSRMMTIADDALQDGARFFMARMK
TPR
>Q47038 ~~~afaD~~~Protein AfaD~~~
MNGSIRKMMRVTCGMLLMVMSGVSQAAELHLESRGGSGTQLRDGAKVATGRIICREAHTGFHVWMNERQVDGRAERYVVQ
SKDGRHELRVRTGGDGWSPVKGEGGKGVSRPGQEEQVFFDVMADGNQDIAPGEYRFSVGGACVVPQE
>Q57254 ~~~afaE3~~~Afimbrial adhesin AFA-III~~~
MKKLAIMAAASMVFAVSSAHAGFTPSGTTGTTKLTVTEECQVRVGDLTVAKTRGQLTDAAPIGPVTVQALGCNARQVALK
ADTDNFEQGKFFLISDNNRDKLYVNIRPMDNSAWTTDNGVFYKNDVGSWGGTIGIYVDGQQTNTPPGNYTLTLTGGYWAK
>A0A493R6X0 ~~~~~~Afifavidin~~~
MRRLASLAVALPLLAVVASPALAQDMSPRQSAEAFGVPAVSSSWVNQDGSTMTLVFGAGNSVSGFYVNNAPGFGCQGTPY
PLVGLTWGNFIGFTVAWDNATANCNSVTSWTGFAEAAGSDVTIVTDWNLAYQGSSSGEIQQGSDTFTLVNKAMKETPKM
>C4ULG3 ~~~~~~Toxin Afp18~~~
MPYFNKSKKNEIRPEKSKEEVGGVLFDDSAIHENIDHNMEPQTGDSVATFPDNSDEVVGGDLAALRARLQATIGDYLPEF
YEFQQTGRNILYPKPVDIKALQARLNSMPITAWIQPLQQQLEQAGNAFLDRFRLFRKEKDWHNNALEFVKFRLAVKETGW
DEQQTRDYNEALKHANRLSGYLSGTLDLIQHLNDNPERQTSWEAVRKTSYDMLLNELLPVDAENNGNQYTAKIRGNLTKN
HPDNIKFNTIVNSVTRDERMASEALSAETKGLSARLLSGIALTRNSAEKIDKKSETSGITRDIVALCQVLQGVIKTIKRR
GEYIQPLTDWQPSEHKLPGQKEELTKAETVKRELVKSRKVMMQKVQGAKTILGVLSDKTNKNRHRITQTLPYSANPDTNT
VSRKTNEAVFRAGIMLLDKIQQTTSGIHKASLASRPLQHAVTQYSALEQMSSGMPSNRILDAKLRSESGRWQEKAEKSKK
QLQHILREITALAEEHLKHKFMSALRDELKRAPETLASNNIISNFDTRAKIVVEGLASIEIGMNQSVVRLAGHGEAGRKD
LDEQVVAWLQRLERIKGELKTDITQATGQSINNFSRQGMLARRMGEWNEAEKQRYLAALSTEDRAVAEMQYNTLFFEVIQ
HYLPLLSKETDPQGERLLQRLRLEVSNAAEGTTVYPATMAEILAGMKSTEQAIRDWSGRKLLRVVFLAACLEGVKLMPKL
AALPLRVAIKFVITGAKVAWATHKGQQGIRGGEGDVDDEIGEYAKRSFKTASVKVVLSLPPGLATMLGVASIALDVYEGG
LKGAGGKIAKNIVGDAPWRALNQGSKIAAEAYTTASMNAALKEGGANPVSHSSTLQQQMDAKPFFDNSDQDADQPRVRRK
REMTDEIMLSDGSSRSEAKALPDENELDTDQSKSRPESALAVETLQSEHFDFDRGIRYQDFSDEQKKQTYLHGIKFVLLQ
IENDGHFAQNIRNNAYLARIGAKLAVPVDIYRYKLNNTFLLPDEIDSKSGVLIRLDSEIPYYYVSEGKDLLENIAWAMPY
NAANRGPLKFSLDPGEVTSSHSGVDILNNIRSERFKFETYFNYNAPEAMSIESLSAQLANTIEADYKFKNTSPTNKILIS
RAIVGAHIPDPGVRATQGEYHIEFDSDELAPAKYLRSFARPFSTLSGEMQLISSSIKGETIQETELHVHQAEYIGSWVDA
TAGAIISFTPEGWFLNTAQSAAEITADLTEGKDPDPLAVAGLLVGIIPGGKIAAKVGKFTRIGGKTVKYGLILGNKSVDL
AIVGKSIKTAVDTGEPLAIYQAFLASGMSVKNSYDIAKNMSSELKISKKIEESARLRKLKALQKNTYKYSMNSKMPVRKF
RVGQTDLLGKIHKGEIKISRNNGTTWEKGNQLHLLAYRLQNAGGGRLLPDVFRDKIVIGEYSFKRVKYNQKKLNEMMRIA
KMYTPTSNSTERIAKLQQNYKTGKEMSHAPQYDTYNDLSLGEKLDLFINSNTDATTRGVLAGKINESITNINLYETAKGV
DAWKTSANKATDVVLAPQNIFLKGRAGECLPESILMGWALQSGQDTKLAKKLMNIHSSSNIAANPLYKSLVELHSDGNAS
RFGESVISDINMKMLSGAESKLFPTENSSVRVDIPEHTMLISKVNKDGKIKYVFYDPNYGMAYFNKYKEMISFFKKKIKG
YDTPKRSTSFRQLDYSHLSDIKIKGKNLNEIIDGEIPQIFRQEDVNLEGITPQDGIYRMLGTHQQENNTYIKSQNNIYQV
EWDQTTNTWRVFDSANTNRSRITVPIKRDTDGEWFKYTETGLKGGGLFDEIKNNWLQRKRFKNLQDFNDIVDFEENKWPS
EPINKDIHMIWVGTRNISEKNIGLSLETARKNSDYNTTIIYDSGIAGYESAKDFMTEKFKDSKVTLVDFRKKSYFHQLQQ
EPSFPYYEQAIRDKKYAQASDILRLLVLKYEGGIYKDIDDVQVKAFGSLAFPKGIGVMREYAPEAGKTTAFPNTPIAATK
NNPVVNKTLELAVENYHRGETNVLKLAGPDVFTESLYQEIPGMRPQVLGAQLDQFELAKRQALGMRLEKPKGFADEKLTL
QEKAKIRQPYEAIRGLSGYVDNGADHSWVTDMPGNSTQSSGLS
>Q9ZN78 ~~~arpA~~~A-factor receptor protein~~~
MAKQARAVQTWRSIVDAAASVFDDYGYERAAISEILRRAKVTKGALYFHFASKEAIAQAIMDEQTSTVEFEQEGSPLQSL
VDGGQQFAFALRHNSMARAGTRLSIEGVFLGGPHPWGDWIDATARMLELGQERGEVFPQIDPMVSAKIIVASFTGIQLVS
EADSGRADLRGQVAEMWRHILPSIAHPGVIAHIKPEGRVDLAAQAREKAEREEQEARIAAEAKGAGSDAATDSGSRSGGS
GLRGGGSGRGPRAGGAGDEGDEEPAGAGVAAGGVVA
>Q2I8V6 1.1.1.292~~~afr~~~1,5-anhydro-D-fructose reductase~~~
MNRWGLIGASTIAREWVIGAIRATGGEVVSMMSTSAERGAAYATENGIGKSVTSVEELVGDPDVDAVYVSTTNELHREQT
LAAIRAGKHVLCEKPLAMTLEDAREMVVAAREAGVVLGTNHHLRNAAAHRAMRDAIAEGRIGRPIAARVFHAVYLPPHLQ
GWRLERPEAGGGVILDITVHDADTLRFVLNDDPAEAVAISHSAGMGKEGVEDGVMGGVRFQSGVIAQFHDAFTTKFAETG
FEVHGTEGSLIGRNVMTQKPVGTVTLRNAEGESQLPLDPANLYETALAAFHSAIEGHGQPSATGEDGVWSLATGLAVVKA
AATGQAAEIETGL
>Q92KZ3 1.1.1.292~~~afr~~~1,5-anhydro-D-fructose reductase~~~COG0673
MIRWGLIGASTIAREWVIGAIRAAGGEVVSVMSSSAERGEAYAAENGIAKAVTSVDDLVGDPDVDAVYISTTNELHHGQA
LAAIRAGKHVLCEKPLAMNLNDGCEMVLKACEAGVVLGTNHHLRNAATHRAMREAIAAGRIGRPIAARVFHAVYLPPHLQ
GWRLDKPEAGGGVILDITVHDADTLRFVLNDDPIEAVAISHSAGMGKEGLEDGVMGVLRFRSGVIAQFHDAFTTKFAETG
LEVHGTAGSLIGRNVMTQRPVGTVVLRNEEGESELPLDHRNLYETAIAAFHSAIGGNGRPSASGEDGVWSLATGLAVVKA
AATGGAVEIETGL
>B1VN93 2.3.1.277~~~afsA~~~2-oxo-3-(phosphooxy)propyl 3-oxoalkanoate synthase~~~
MPEAAVLIDPVPTMDAEAEVVHPVGIEMVHRTRPEDAFPRNWVRLGRDRFAVEAVLPHDHPFFAPVGDDLHDPLLVAEAM
RQAAMLAFHAGYGIPLGYHFLLTELDYVCHPEHLGVGGEPTEIGLEVFCSDLKWRAGLPAQGRVGWAVHRGDRLAATGVA
ATRFSTPKAYRRMRGDVPVEGISLPETAPVPASPAGRARVEDVVLSGTGREGVWELRVDTRHPTLFQRPNDHVPGMLLLE
AARQAACLVAGPAGIVPVEARTRFHRYSEFGSPCWIGAVVQPGADEDTVTVRVTGHQDGETVFSTVLSGPRAHG
>P54741 2.7.11.1~~~afsK~~~Serine/threonine-protein kinase AfsK~~~COG0515
MVDQLTQHDPRRIGPFEVLGRLGAGGMGLVYLARSASGRRVAIKTVRTELAEDQLFRVRFTREVEAARAVSGFYTAAVVD
ADPRAAVPWLATAYVPAPSLEEIVNECGPMPAQAVRWLAAGVAEALQSIHGAGLVHRDLKPSNVLVVEDGPRVIDFGIAS
GVSNTRLTMTNVAVGTPAYMSPEQAKDSRSVTGASDVFSLGSMLVFAATGHPPFHGANPVETVFMLLREGPDLEGLPDEL
RPLIESCMQMEATGRPNPADLQAQLAPHLFGSGSDDSGTASAWLPERAVGLIEGRRNGRPAVKPATTAGGRGHGHGPSGA
RAPVHAPPLPPPPAHDPVVPAPPAHVPAVPAPVGAPDGGPVRLPGAAVPIGPGPRVADMRAAAVAAPPPESALAASWSRP
RPGVNGADPAVPAPAPAPPEASPAGWRPWRFRMSNDVWGTPRVAEDLVYVTSFEVHALDVATGRRRFKTRDVAWSMAVAD
GRIHASDGPTLFALDAREGADLWRVQTDAWVYSLQADRGTVLTATRGGGVQAWEASAGQKLWEVTGAQTDFESPEAGAAL
HDGTAYVWQDARLRALDARTGDERWSYPIGDAASCGGVPVRLTQAPDGYVYVAAGTRVLALEVASGHVRWHFEAPAVFLA
PPTFVPGPAVTGGGVYLADYLGTVYALDATDGRDRWRIATEARSSTDPVLVAAGHVHVGSGKGLYTLDAVTGTPKWRFQA
GGDIVGAPAVAEGRIHFGSSDHLLYTLKADDGRLRWKLATGGEITGSPVVRDGIVYACSKDRCVYALDAEKGTGTARTT
>Q04942 ~~~afsQ1~~~Transcriptional regulatory protein AfsQ1~~~COG0745
MPSLLLIEDDDAIRTALELSLTRQGHRVATAASGEDGLKLLREQRPDLIVLDVMLPGIDGFEVCRRIRRTDQLPIILLTA
RNDDIDVVVGLESGADDYVVKPVQGRVLDARIRAVLRRGERESTDSASFGSLVIDRSAMTVTKNGEDLQLTPTELRLLLE
LSRRPGQALSRQQLLRLVWEHDYLGDSRLVDACVQRLRAKVEDVPSSPTLIRTVRGVGYRLDPPQ
>Q9S0Y6 ~~~afsR~~~Regulatory protein AfsR~~~
MDRDNGPRVRVPEQRTPSIPATADALRFTVLGPVRAWRGSELLSSGSPQQRALLTALLLREGRTATAGELIDAFWGEDPP
SQALATIRTYASRLRKILGQDTLVSESGGYAIRTERSALDLTLAQDLAAEAEKARAAGDRCQARTLINKVLGLWDGEALA
SVPGPYADNQRTRLEEWRLQLTETRLDLDLEVGCHAEAVSELTALTAAHPLRERLRELLMVALYRSGRQAEALAVYADTR
RLLAEELGVDPRPELAELQQRILRADEELARPADEPAPAPAPLKPAQLPATVPDFTGRSAFVTELGSRLATAEGSVMAVS
AVAGIGGVGKTTLAVHVAHQARRHFPDGQLYVDLQGAGARAAEPETVLGSFLRALGTADSAIPDTLDERAALYRSTLDGR
RILILLDNAHDAAQIRPLLPGTPGCAALVTSRVRMVDLAGAHLVDLDVMSPEEALQLFTRIVGAERVGAEREAALDVVAA
CGFLPLAIRIAASRLAARRTWTVSVLAAKLADERRRLDELQAGDLTVKATFELGYGQLEPAQAHAFRLLGLADGPDISLA
AAAALLDLDPHVAEDLLEALVDTSLVESAAPGRYRYHDLVRLYARACAERDEQPPVRRELALSRLLDFYLATAAGVYALE
RPGERVLDHFTPTEYPGLTFPAREAALDWLFTESSGLLACARQSAAIGMPQRAADLLMAVVDLGESGANSHQFATAAKAV
SEAAKAAGVPRAEARARTMLSHVHSVSGRFAEAEAEAMRALDLGRLAQDAVSQGQAPNQRGIIALYENRHDDAEAHLTQA
LTAFRADGNKPGEAAALCNLSRVHLATGRTATAVRLAEEGVAIYDSDASGLALIGSGRTDPARHVLLEALQIFRESRQQL
WHGMTLFRLSELHLTEQEGAQAAAHAEQSLVVLRGIGGDWRRANVLTVLGRALTVIGQTDRAQVCWGEALTVFEELGSPE
AEAVRQLLDPAGVG
>Q8NTW4 2.4.2.46~~~aftA~~~Galactan 5-O-arabinofuranosyltransferase~~~
MNPYGPSRVPEGEVYRPDRLNRKATLVAIVGAAILAFAFALVLWMGLKQTNLPAFGPSNVTRAVASATIAAVLIVTGFLT
WLWLRDEHQSNPRWELEDVKPRPKWRTALTYLASYLSPAALVVAVLAIPLSATRLYLDGISVDQGFRTQFLTRMADDIGL
SDMNYIDMPTFYPAGWFWLGGRLANLLGLPGWEAFQPWAIVSMAVAASVLVPVWQRITGSLPVATGIALVTTCIILAMNS
EEPYAAIVAMGIPAMLVLASRIAKGDKFALAGGIIYLGVSATFYTLFTGAIALSAVAVCIVVAAIVQRSIKPLLWLAVLG
GGSIVIALISWGPYLLASINGAERSGDSATHYLPLEGTQFPVPFLASSVVGLLCLVGLIYLVVRFHNNEVRAMWVGIAVF
YAWMGMSMAITLLGNTLLGFRLDTVLVLIFATAGVLGIADFRLASVYQLYPTQITERTATHLTNLIVVLVLLGGLYYAQD
LPQKNARAIDLAYTDTDGYGERADLYPAGAARYYKDINDHLLDQGFEPSETVVLTDELDFMSYYPYRGYQAFTSHYANPL
GEFGNRNAFIEDLAIRSWDELADPQQFSDALNTSPWTIPEVFIFRGSIDDPDAGWKYDVAEDLYPNNPNVRFRGVYFNPE
SFDQMWQTKQVGPFVVVTHNE
>P9WN03 2.4.2.46~~~aftA~~~Galactan 5-O-arabinofuranosyltransferase~~~
MPSRRKSPQFGHEMGAFTSARAREVLVALGQLAAAVVVAVGVAVVSLLAIARVEWPAFPSSNQLHALTTVGQVGCLAGLV
GIGWLWRHGRFRRLARLGGLVLVSAFTVVTLGMPLGATKLYLFGISVDQQFRTEYLTRLTDTAALRDMTYIGLPPFYPPG
WFWIGGRAAALTGTPAWEMFKPWAITSMAIAVAVALVLWWRMIRFEYALLVTVATAAVMLAYSSPEPYAAMITVLLPPML
VLTWSGLGARDRQGWAAVVGAGVFLGFAATWYTLLVAYGAFTVVLMALLLAGSRLQSGIKAAVDPLCRLAVVGAIAAAIG
STTWLPYLLRAARDPVSDTGSAQHYLPADGAALTFPMLQFSLLGAICLLGTLWLVMRARSSAPAGALAIGVLAVYLWSLL
SMLATLARTTLLSFRLQPTLSVLLVAAGAFGFVEAVQALGKRGRGVIPMAAAIGLAGAIAFSQDIPDVLRPDLTIAYTDT
DGYGQRGDRRPPGSEKYYPAIDAAIRRVTGKRRDRTVVLTADYSFLSYYPYWGFQGLTPHYANPLAQFDKRATQIDSWSG
LSTADEFIAALDKLPWQPPTVFLMRHGAHNSYTLRLAQDVYPNQPNVRRYTVDLRTALFADPRFVVEDIGPFVLAIRKPQ
ESA
>O53582 2.4.2.-~~~aftB~~~Terminal beta-(1->2)-arabinofuranosyltransferase~~~COG1807
MVRVSLWLSVTAVAVLFGWGSWQRRWIADDGLIVLRTVRNLLAGNGPVFNQGERVEANTSTAWTYLLYVGGWVGGPMRLE
YVALALAMVLSLLGMVLLMLGTGRLYAPSLRGRRAIMLPAGALVYIAVPPARDFATSGLESGLVLAYLGLLWWMMVCWSQ
PLRARPDSQMFLGALAFVAGCSVLVRPEFALIGGLALIMMLIAARTWRRRVLIVLAGGFLPVAYQIFRMGYYGLLVPSTA
LAKDAAGDKWSQGMIYVSNFNRPYALWVPLVLSVPLGLLLMTARRRPSFLRPVLAPDYGRVARAVQSPPAVVAFIVGSGV
LQALYWIRQGGDFMHGRVLLAPLFCLLAPVGVIPILLPDGKDFSRETGRWLVGALSGLWLGIAGWSLWAANSPGMGDDAT
RVTYSGIVDERRFYAQATGHAHPLTAADYLDYPRMAAVLTALNNTPEGALLLPSGNYNQWDLVPMIRPSSGTAPGGKPAP
KPQHAVFFTNMGMLGMNVGLDVRVIDQIGLVNPLAAHTERLKHARIGHDKNLFPDWVIADGPWVKWYPGIPGYIDQQWVT
QAEAALQCPATRAVLNSVRAPITLHRFLSNVLHSYEFTRYRIDRVPRYELVRCGLDVPDGPGPPPRE
>A0QW28 2.4.2.47~~~aftC~~~Alpha-(1->3)-arabinofuranosyltransferase~~~
MYCALVTATDSITTKLLNAFRPRTSAPSTATVLRSVLWPIAILSVIHRSYVLGTNGYITDDFGPVYRAVINFKLGLDIYN
EQFDHVDPHYLYPPGGTLLLAPFGYLPVDASRYWYISFNVLAFLIAAYLMLRIFDYTLSSVAAPALVLAMFCTESVTNTL
VFTNINGCMLLGAVLFFRWLLKGGRNAELLAGAAIGLTLVVKPSLAPLLLLPVLNRQFYTLITAFGVPLVFNIAAWPLVP
DPMNFVRHTVPYIMSTRDYFNSSIVGNGIYYGLPMWLILLLRVVFLLLAVGSLWLLYRYYRERDPRFWLLTSSGVLLTAS
FLLLSLGQGYYSTMLFPFLMTVVLPNSVLRNWPAWLAIYGFMTMDRWLLGHWPTTGRFLEYMKITYGWSLMLVVVFCVLY
FRYLDAKQDGRLDQGIDPPWMARERTPAPV
>P9WMZ7 2.4.2.47~~~aftC~~~Alpha-(1->3)-arabinofuranosyltransferase~~~
MYGALVTAADSIRTGLGASLLAGFRPRTGAPSTATILRSALWPAAVLSVLHRSIVLTTNGNITDDFKPVYRAVLNFRRGW
DIYNEHFDYVDPHYLYPPGGTLLMAPFGYLPFAPSRYLFISINTAAILVAAYLLLRMFNFTLTSVAAPALILAMFATETV
TNTLVFTNINGCILLLEVLFLRWLLDGRASRQWCGGLAIGLTLVLKPLLGPLLLLPLLNRQWRALVAAVVVPVVVNVAAL
PLVSDPMSFFTRTLPYILGTRDYFNSSILGNGVYFGLPTWLILFLRILFTAITFGALWLLYRYYRTGDPLFWFTTSSGVL
LLWSWLVMSLAQGYYSMMLFPFLMTVVLPNSVIRNWPAWLGVYGFMTLDRWLLFNWMRWGRALEYLKITYGWSLLLIVTF
TVLYFRYLDAKADNRLDGGIDPAWLTPEREGQR
>A0QPD4 2.4.2.47~~~aftD~~~Alpha-(1->3)-arabinofuranosyltransferase~~~COG4981
MVAAATLVLTFAQSPGQISPDTKLDLTANPLRFLARAFNLWNSDLPFGQAQNQAYGYLFPHGTFFLLGDVLGVPGWVTQR
LWWALLLTVGFWGVLRVAEALGIGSTPSRLIGAAAFALSPRVLTTLGAISSETLPMMLAPWVLLPVILALRGQHSVRLMA
ARSAGAVALMGAVNAVATLTGCLAAVIWWACHRPNRLWWRFTAWWLLCGALAVTWWVVALLMLGRISPPFLDFIESSGVT
TQWMSLTEMLRGTMSWTPFVAPSATAGASLVTSTTAVLATTVVAAAGLAGLALRTMPARGRLITMLLIGVVLLGLGYSGG
LGSPVALQVQAFLDGSGTPLRNLAKLEPVIRLPLALGLVHLLGRIPLPGSAPRAVWVSAFAHPERDKRVAVAIVVLSALA
AGTSLAWTARLTPPGSFTAIPQHWHDAAAWLDEHNTDRGRVLVAPGAPFATQVWGNSHDEPLQVLGDNPWGVRDSIPLTP
PETIRALDSVQRLFASGRPSPGLADTLARQGISYVVVRNDLDPDTSRSARPILVHRAVEGSPGLTKVAEFGDPVGPGTLE
GFVADSGLRPRYPAVEIFRVEPADAGSSQQRSPMHPYLVDSDAMTRVAGAPEALLRLDERRRLNGEPPLGPMLLAADARR
AGLPVDGVIVTDTPTAREIDYGRVDDHASAIRTPDDARHTYNRVPDYPSDGADLVYGKWTGGRLSVSSSAADSTALPYVA
PATGPAAAIDSDSSTAWVSNALQAAVGQWLQVDFDHPVTNATLTITPSATAVGAQVRRIEIATATGTSSLRFDTAGKPLT
IPLPVGETPWVRVTAVATDDGSPGVQFGVTDLAITQYDASGFAHPVTLRHTVEVPGPPAGSVVQQWDLGTELLGRPGCAD
SPVGVRCAAAMALASEEPVNLSRTLTVPQDTEVQPTVWIRGRQGPNLADLVAQPDTTRAFGDSDPIDVLGSAYAATDGDP
RTSWTAPQRVVQFQTPPTLTLKLPRPTEVSGMRIVPGDTEPPAHPTLVAIDLGDGPQMHRLPADGEPRTVTLKPRVTDTV
TVSLLAWNDIIDRTSLGFDQLKPPGLAELTVLDGRGAPVGAADAAKNRSRAVALPCGQGPIIAVAGQFIQTSVHTTVGAL
LDGEPIPARPCRSEPVKLPAGQQELVVSPGAAFIVDGVELPTPAADEIRSAPTTSAETGTWTADRREVRVSAAAQQRVLV
VPESVNRGWSAHDPAGAELQSVTVNGWQQGWVVPAGTEGTVTLTFASNMPYRVGLIGGLALLPLLALLALIPVRRPVRAA
APARPWNPGPVLTGAAALVAGTAISGVAGLLVVGAAMGVRILLNRRGAAGEKVWDNVTVVVAAGGLILAGSVLSQYPWRS
VDGYVGHTPGVQFLALLSVAFLAASAVRLVNRPEPSEDGRSAKPEHTGASAHAG
>P96419 2.4.2.47~~~aftD~~~Alpha-(1->3)-arabinofuranosyltransferase~~~COG4981
MAPLSRKWLPVVGAVALALTFAQSPGQVSPDTKLDLTANPLRFLARATNLWNSDLPFGQAQNQAYGYLFPHGTFFVIGHL
LGVPGWVTQRLWWAVLLTVGFWGLLRVAEALGVGGPSSRVVGAVAFALSPRVLTTLGSISSETLPMMLAPWVLLPTILAL
RGTSGRSVRALAAQAGLAVALMGAVNAIATLAGCLPAVIWWACHRPNRLWWRYTAWWLLAMALATLWWVMALTQLHGVSP
PFLDFIESSGVTTQWSSLVEVLRGTDSWTPFVAPNATAGAPLVTGSAAILGTCLVAAAGLAGLTSPAMPARGRLVTMLLV
GVVLLAVGHRGGLASPVAHPVQAFLDAAGTPLRNVHKVGPVIRLPLVLGLAQLLSRVPLPGSAPRPAWLRAFAHPERDKR
VAVAVVALTALMVSTSLAWTGRVAPPGTFGALPQYWQEAADWLRTHHAATPTPGRVLVVPGAPFATQVWGTSHDEPLQVL
GDGPWGVRDSIPLTPPQTIRALDSVQRLFAAGRPSAGLADTLARQGISYVLVRNDLDPETSRSARPILLHRSIAGSPGLA
KLAEFGAPVGPDPLAGFVNDSGLRPRYPAIEIYRVSAPANPGAPYFAATDQLARVDGGPEVLLRLDERRRLQGQPPLGPV
LMTADARAAGLPVPQVAVTDTPVARETDYGRVDHHSSAIRAPGDARHTYNRVPDYPVPGAEPVVGGWTGGRITVSSSSAD
ATAMPDVAPASAPAAAVDGDPATAWVSNALQAAVGQWLQVDFDRPVTNAVVTLTPSATAVGAQVRRILIETVNGSTTLRF
DEAGKPLTAALPYGETPWVRFTAAATDDGSAGVQFGITDLAITQYDASGFAHPVQLRHTVLVPGPPPGSAIAGWDLGSEL
LGRPGCAPGPDGVRCAASMALAPEEPANLSRTLTVPRPVSVTPMVWVRPRQGPKLADLIAAPSTTRASGDSDLVDILGSA
YAAADGDPATAWTAPQRVVQHKTPPTLTLTLPRPTVVTGLRLAASRSMLPAHPTVVAINLGDGPQVRQLQVGELTTLWLH
PRVTDTVSVSLLDWDDVIDRNALGFDQLKPPGLAEVVVLSAGGAPIAPADAARNRARALTVDCDHGPVVAVAGRFVHTSI
RTTVGALLDGEPVAALPCEREPIALPAGQQELLISPGAAFVVDGAQLSTPGAGLSSATVTSAETGAWGPTHREVRVPESA
TSRVLVVPESINSGWVARTSTGARLTPIAVNGWQQAWVVPAGNPGTITLTFAPNSLYRASLAIGLALLPLLALLAFWRTG
RRQLADRPTPPWRPGAWAAAGVLAAGAVIASIAGVMVMGTALGVRYALRRRERLRDRVTVGLAAGGLILAGAALSRHPWR
SVDGYAGNWASVQLLALISVSVVAASVVATSESRGQDRMQ
>P39180 ~~~flu~~~Antigen 43~~~COG3468
MKRHLNTCYRLVWNHMTGAFVVASELARARGKRGGVAVALSLAAVTSLPVLAADIVVHPGETVNGGTLANHDNQIVFGTT
NGMTISTGLEYGPDNEANTGGQWVQDGGTANKTTVTSGGLQRVNPGGSVSDTVISAGGGQSLQGRAVNTTLNGGEQWMHE
GAIATGTVINDKGWQVVKPGTVATDTVVNTGAEGGPDAENGDTGQFVRGDAVRTTINKNGRQIVRAEGTANTTVVYAGGD
QTVHGHALDTTLNGGYQYVHNGGTASDTVVNSDGWQIVKNGGVAGNTTVNQKGRLQVDAGGTATNVTLKQGGALVTSTAA
TVTGINRLGAFSVVEGKADNVVLENGGRLDVLTGHTATNTRVDDGGTLDVRNGGTATTVSMGNGGVLLADSGAAVSGTRS
DGKAFSIGGGQADALMLEKGSSFTLNAGDTATDTTVNGGLFTARGGTLAGTTTLNNGAILTLSGKTVNNDTLTIREGDAL
LQGGSLTGNGSVEKSGSGTLTVSNTTLTQKAVNLNEGTLTLNDSTVTTDVIAQRGTALKLTGSTVLNGAIDPTNVTLASG
ATWNIPDNATVQSVVDDLSHAGQIHFTSTRTGKFVPATLKVKNLNGQNGTISLRVRPDMAQNNADRLVIDGGRATGKTIL
NLVNAGNSASGLATSGKGIQVVEAINGATTEEGAFVQGNRLQAGAFNYSLNRDSDESWYLRSENAYRAEVPLYASMLTQA
MDYDRIVAGSRSHQTGVNGENNSVRLSIQGGHLGHDNNGGIARGATPESSGSYGFVRLEGDLMRTEVAGMSVTAGVYGAA
GHSSVDVKDDDGSRAGTVRDDAGSLGGYLNLVHTSSGLWADIVAQGTRHSMKASSDNNDFRARGWGWLGSLETGLPFSIT
DNLMLEPQLQYTWQGLSLDDGKDNAGYVKFGHGSAQHVRAGFRLGSHNDMTFGEGTSSRAPLRDSAKHSVSELPVNWWVQ
PSVIRTFSSRGDMRVGTSTAGSGMTFSPSQNGTSLDLQAGLEARVRENITLGVQAGYAHSVSGSSAEGYNGQATLNVTF
>A0KYQ5 3.5.1.-~~~agaAII~~~N-acetylgalactosamine-6-phosphate deacetylase~~~COG1820
MKPNTDFMLIADGAKVLTQGNLTEHCAIEVSDGIICGLKSTISAEWTADKPHYRLTSGTLVAGFIDTQVNGGGGLMFNHV
PTLETLRLMMQAHRQFGTTAMLPTVITDDIEVMQAAADAVAEAIDCQVPGIIGIHFEGPHLSVAKRGCHPPAHLRGITER
EWLLYLRQDLGVRLITLAPESVTPEQIKRLVASGAIISLGHSNADGETVLKAIEAGASGFTHLYNGMSALTSREPGMVGA
AFASENTYCGIILDGQHVHPISALAAWRAKGTEHLMLVTDAMSPLGSDQTEFQFFDGKVVREGMTLRDQHGSLAGSVLDM
ASAVRYAATELNLGLSNAVQMATRTPAEFIQRPQLGDIAEGKQADWVWLDDDQRVLAVWIAGELLYQAEQARFA
>Q8GGD4 3.5.1.133~~~agaA~~~N(alpha)-acyl-glutamine aminoacylase~~~
MAQENLQKIVDSLESSRAEREELYKWFHQHPEMSMQEHETSKRIAEELEKLGLEPQNIGVTGQVAVIKNGEGPSVAFRAD
FDALPITENTGLDYSADPELGMMHACGHDLHTTALLGAVRALVENKDLWSGTFIAVHQPGEEGGGGARHMVDDGLAEKIA
APDVCFAQHVFNEDPAFGYVFTPGRFLTAASNWRIHIHGEGGHGSRPHLTKDPIVVAASIITKLQTIVSREVDPNEVAVV
TVGSIEGGKSTNSIPYTVTLGVNTRASNDELSEYVQNAIKRIVIAECQAAGIEQEPEFEYLDSVPAVINDEDLTEQLMAQ
FREFFGEDQAVEIPPLSGSEDYPFIPNAWGVPSVMWGWSGFAAGSDAPGNHTDKFAPELPDALERGTQAILVAAAPWLMK
>Q8XAC3 3.5.1.-~~~agaA~~~N-acetylgalactosamine-6-phosphate deacetylase~~~COG1820
MTHVLRARRLLTEEGWLDDHQLRIADGVIAAIEPIPVSVTERDAELLCPAYIDTHVHGGAGVDVMDDAPDVLDKLAMHKA
REGVGSWLPTTVTAPLSTIHAALKRIAQRCQRGGPGAQVLGSYLEGPYFTPQNKGAHPPELFRELEIAELDQLIAVSQHT
LRVVALAPEKEGALQAIRHLKQQNVRVMLGHSAATWQQTRAAFDAGADGLVHCYNGMTGLHHREPGMVGAGLTDKRAWLE
LIADGHHVHPAAMSLCCCCAKERIVLITDAMQAAGMPDGRYTLCGEEVQMHGGVVRTASGGLAGSTLSVDAAVRNMVELT
GVTPAEAIHMASLHPARMLGVDGVLGSLKPGKRASVVALDSGLHVQQIWIQGQLASF
>Q9ALJ4 3.2.1.22~~~agaA~~~Alpha-galactosidase AgaA~~~
MSVAYNPQTKQFHLRAGKASYVMQLFRSGYLAHVYWGKAVRDVRGARAFPRLDRAFSPNPDPSDRTFSLDTLLQEYPAYG
NTDFRAPAYQVQLENGSTVTDLRYKTHRIYKGKPRLNGLPATYVEHEQEAETLEIVLGDALIGLEVTLQYTAYEKWNVIT
RSARFENKGGERLKLLRALSMSVDFPTADYDWIHLPGAWGRERWIERRPLVTGVQAAESRRGASSHQQNPFIALVAKNAD
EHQGEVYGFSFVYSGNFLAQIEVDQFGTARVSMGINPFDFTWLLQPGESFQTPEVVMVYSDQGLNGMSQTYHELYRTRLA
RGAFRDRERPILINNWEATYFDFNEEKIVNIARTAAELGIELVVLDDGWFGERDDDRRSLGDWIVNRRKLPNGLDGLAKQ
VNELGLQFGLWVEPEMVSPNSELYRKHPDWCLHVPNRPRSEGRNQLVLDYSREDVCDYIIETISNVLASAPITYVKWDMN
RHMTEIGSSALPPERQRETAHRYMLGLYRVMDEITSRFPHILFESCSGGGGRFDPGMLYYMPQTWTSDNTDAVSRLKIQY
GTSLVYPISAMGAHVSAVPNHQVGRVASLKTRGHVAMSGNFGYELDITKLTETEKQMMKQQVAFYKDVRRLVQFGTFYRL
LSPFEGNEAAWMFVSADRSEALVAYFRVLAEANAPLSYLRLKGLDSNQDYEIEGLGVYGGDELMYAGVALPYRSSDFISM
MWRLKAVQQ
>G0L322 3.2.1.81~~~agaA~~~Beta-agarase A~~~
MKKNYLLLYFIFLLCGSIAAQDWNGIPVPANPGNGMTWQLQDNVSDSFNYTSSEGNRPTAFTSKWKPSYINGWTGPGSTI
FNAPQAWTNGSQLAIQAQPAGNGKSYNGIITSKNKIQYPVYMEIKAKIMDQVLANAFWTLTDDETQEIDIMEGYGSDRGG
TWFAQRMHLSHHTFIRNPFTDYQPMGDATWYYNGGTPWRSAYHRYGCYWKDPFTLEYYIDGVKVRTVTRAEIDPNNHLGG
TGLNQATNIIIDCENQTDWRPAATQEELADDSKNIFWVDWIRVYKPVAVSGGGNNGNDGATEFQYDLGTDTSAVWPGYTR
VSNTTRAGNFGWANTNDIGSRDRGASNGRNNINRDINFSSQTRFFTQDLSNGTYNVLITFGDTYARKNMNVAAEGQNKLT
NINTNAGQYVSRSFDVNVNDGKLDLRFSVGNGGDVWSITRIWIRKVTSNSANLLAAKGLTLEDPVETTEFLYPNPAKTDD
FVTVPNSEIGSSIIIYNSAGQVVKKVSVVSENQKISLEGFAKGMYFINLNGQSTKLIVQ
>Q9RGX8 3.2.1.81~~~agaB~~~Beta-agarase B~~~
MYLIYLRLVFCCALLLGCGDNSKFDSATDLPVEQEQEQETEQEGEPEESSEQDLVEEVDWKDIPVPADAGPNMKWEFQEI
SDNFEYEAPADNKGSEFLEKWDDFYHNAWAGPGLTEWKRDRSYVADGELKMWATRKPGSDKINMGCITSKTRVVYPVYIE
ARAKVMNSTLASDVWLLSADDTQEIDILEAYGADYSESAGKDHSYFSKKVHISHHVFIRDPFQDYQPKDAGSWFEDGTVW
NKEFHRFGVYWRDPWHLEYYIDGVLVRTVSGKDIIDPKHFTNTTDPGNTEIDTRTGLNKEMDIIINTEDQTWRSSPASGL
QSNTYTPTDNELSNIENNTFGVDWIRIYKPVEK
>D7GXG5 3.2.1.81~~~agaC~~~Beta-agarase C~~~
MNLTKMAVFAASLFCLACKNDIDTELEKKSIPESEIQKSEEKLPNEEELTPTDPDEETNKEETVTANATYDFTGNTPPPA
PQGMKWVKISQLSDEFNNGFNTDKWTKSLWNYGVPVQMKAENSGVSDGKLWIKATLGNDPERWFETSRVMSKAQVNYPMY
TVSRIKGAHISAYNTFWLNNGNISNRNEIDVIENNSNPSCNCQPDFPWQMNSQYFHVVNDDTKRNKGNFDNRELSDANPL
KGVAWNEEYHTFGVWWKDATHIQFYLDGEPAGSVVSARDFTRELNIIWDLWTVDADWLGGLAKKEHLSNNNINTMKIDWI
HTYQLVEE
>D7GXG4 3.2.1.81~~~agaD~~~Beta-agarase D~~~
MKRSILLAIIAFLQFFTSYGQYDWDNVPIPANAGAGKTWKLQTAASDDFNYTFNPTNNVVDFGPNGNMKWYNKYHNRPNG
QPNNFEGPGPTKWMQNHVAVSGGNLNIWASRIPGATKSFTGSNNTPISRPETRAGCITNKTRVKYPVFVEARVKVMNSTL
ASDIWLLSPDDTQEIDIMECYGGPGNDNRNSYFASKIHLSHHVFIRPPNFKDYQPADLNSWWGKNGVTQWGGKTIRIGVN
WVSPTRLEYFVDGQMVRILDNDAVQTRLADGTWQYTYPAGVTSTGVNGQLIKENGYQKMNIASSLSDAKNKSNISVIDPF
NYLNNGRKFSKEMDIIINVEDQSWQAEAYRSPNAAEMANFYDNNLLVDWIRVYKPVNASAANSAETTSTVEKPASFEPQG
QPTEKLQVYPVPATDVLNISQSDYVEARVYNLKGWVMLRKDVIDQKIDVSSLKKGIYILEITKATGETVKQKIVISE
>A0KYQ6 2.7.1.-~~~agaK~~~N-acetylgalactosamine kinase AgaK~~~COG1940
MYYGLDIGGTKIELAIFDTQLALQDKWRLSTPGQDYSAFMATLAEQIEKADQQCGERGTVGIALPGVVKADGTVISSNVP
CLNQRRVAHDLAQLLNRTVAIGNDCRCFALSEAVLGVGRGYSRVLGMILGTGTGGGLCIDGKLYLGANRLAGEFGHQGVS
ANVACRHQLPLYVCGCGLEGCAETYVSGTGLGRLYQDIAGQTADTFAWLNALRCNDPLAIKTFDTYMDILGSLMASLVLA
MDPDIIVLGGGLSEVEEILAALPQATKAHLFDGVTLPQFKLADFGSASGVRGAALLGHGLDAGISYEA
>O34645 3.2.1.22~~~melA~~~Alpha-galactosidase~~~COG1486
MKKITFIGAGSTIFAKNVLGDCLLTEALNGFEFALYDIDPKRLQESQLMLENLRDRYNPSVAINSYDDRKLALQNAGYVI
NAIQVGGYKPSTVIDFEIPKRYGLRQTIADTVGIGGIFRSLRTIPVLFDIAKDMEEMCPDAWFLNYTNPMATLTGAMLRY
TNIKTIGLCHSVQVCTKDLFKALGMEHDGIEERIAGINHMAWLLEVKKDGTDLYPEIKRRAKEKQKTKHHDMVRFELMDK
FGYYVTESSEHNAEYHPYFIKRNYPELISELQIPLDEYPRRCVKQIENWEKMRDDIVNNKNLTHERSKEYGSRIIEAMET
NEPFTFGGNVLNTGLITNLPSKAVVEVTCVADRKKITPCFAGELPEQLAALNRTNINTQLMTIEAAVTRKKEAVYQAAML
DPHTSAELSMKDIISMCDDLFAAHGDWLPEYK
>Q8A6L0 3.2.1.22~~~~~~Retaining alpha-galactosidase~~~COG4948
MKKLTFLLLCVLCTLSLQAQKQFTLASPDGNLKTTITIGDRLTYDITCNGRQILTPSPISMTLDNGTVWGENAKLSGTSR
KSVDEMIPSPFYRASELRNHYNGLTLRFKKDWNVEFRAYNDGIAYRFVNQGKKPFRVVTEVSDYCFPSDMTASVPYVKSG
KDGDYNSQFFNSFENTYTTDKLSKLNKQRLMFLPLVVDAGDGVKVCITESDLENYPGLYLSASEGANRLSSMHAPYPKRT
VQGGHNQLQMLVKEHEDYIAKVDKPRNFPWRIAVVTTTDKDLAATNLSYLLGAPSRMSDLSWIKPGKVAWDWWNDWNLDG
VDFVTGVNNPTYKAYIDFASANGIEYVILDEGWAVNLQADLMQVVKEIDLKELVDYAASKNVGIILWAGYHAFERDMENV
CRHYAEMGVKGFKVDFMDRDDQEMTAFNYRAAEMCAKYKLILDLHGTHKPAGLNRTYPNVLNFEGVNGLEQMKWSSPSVD
QVKYDVMIPFIRQVSGPMDYTQGAMRNASKGNYYPCYSEPMSQGTRCRQLALYVVFESPFNMLCDTPSNYMREPESTAFI
AEIPTVWDESIVLDGKMGEYIVTARRKGDVWYVGGITDWSARDIEVDCSFLGDKSYHATLFKDGVNAHRAGRDYKCESFP
IKKDGKLKVHLAPGGGFALKIK
>B3PGJ1 3.2.1.22~~~agaA~~~Alpha-galactosidase A~~~COG3345
MRKQLLLGLGLVSALLVSVQASAQKFEQLAKTPQMGWNSWNTFGCNVDEKMIRAMADAMVTSGMKAAGYEYINIDDCWHG
ERDKNGFIQADKKHFPSGMKALADYVHAKGLKLGIYSDAGNTTCAGRPGSRGHEYQDALTYASWGIDYVKYDWCDTQDIN
PKSAYATMRDAIHKAGRPMLFSICEWGDNQPWEWAQDVGHSWRTTGDIYPCWNCEHNHGSWSSFGVLPILDKQAGLRKYA
GPGHWNDMDMMEVGNGMTEEEDRAHFSLWAFMASPLIAGNDLRNMSDTTRAILTHKETIAINQDKLGIQAMKWIDEGDLE
IYIKPLEKGHYAVLFLNRADDAMDYRFDWSFHYMKDDISKHEIFFDKQAFNWRNIWNGETGSTKEVLNIKVPAHGVVVLR
LSPR
>P06720 3.2.1.22~~~melA~~~Alpha-galactosidase~~~COG1486
MMSAPKITFIGAGSTIFVKNILGDVFHREALKTAHIALMDIDPTRLEESHIVVRKLMDSAGASGKITCHTQQKEALEDAD
FVVVAFQIGGYEPCTVTDFEVCKRHGLEQTIADTLGPGGIMRALRTIPHLWQICEDMTEVCPDATMLNYVNPMAMNTWAM
YARYPHIKQVGLCHSVQGTAEELARDLNIDPATLRYRCAGINHMAFYLELERKTADGSYVNLYPELLAAYEAGQAPKPNI
HGNTRCQNIVRYEMFKKLGYFVTESSEHFAEYTPWFIKPGREDLIERYKVPLDEYPKRCVEQLANWHKELEEYKKASRID
IKPSREYASTIMNAIWTGEPSVIYGNVRNDGLIDNLPQGCCVEVACLVDANGIQPTKVGTLPSHLAALMQTNINVQTLLT
EAILTENRDRVYHAAMMDPHTAAVLGIDEIYALVDDLIAAHGDWLPGWLHR
>P0DTR5 3.2.1.-~~~~~~A type blood alpha-D-galactosamine galactosaminidase~~~
MRGKKFISLTLSTMLCLQLLPTASFAAAPATDTGNAGLIAEGDYAIAGNGVRVTYDADGQTITLYRTEGSGLIQMSKPSP
LGGPVIGGQEVQDFSHISCDVEQSTSGVMGSGQRMTITSQSMSTGLIRTYVLETSDIEEGVVYTATSYEAGASDVEVSWF
IGSVYELYGAEDRIWSYNGGGEGPMHYYDTLQKIDLTDSGKFSRENKQDDTAASIPVSDIYIADGGITVGDASATRREVH
TPVQETSDSAQVSIGWPGKVIAAGSVIEIGESFAVVHPGDYYNGLRGYKNAMDHLGVIMPAPGDIPDSSYDLRWESWGWG
FNWTIDLIIGKLDELQAAGVKQITLDDGWYTNAGDWALNPEKFPNGASDALRLTDAIHEHGMTALLWWRPCDGGIDSILY
QQHPEYFVMDADGRPARLPTPGGGTNPSLGYALCPMADGAIASQVDFVNRAMNDWGFDGFKGDYVWSMPECYNPAHNHAS
PEESTEKQSEIYRVSYEAMVANDPNVFNLLCNCGTPQDYYSLPYMTQIATADPTSVDQTRRRVKAYKALMGDYFPVTADH
NNIWYPSAVGTGSVLIEKRDLSGTAKEEYEKWLGIADTVQLQKGRFIGDLYSYGFDPYETYVVEKDGVMYYAFYKDGSKY
SPTGYPDIELKGLDPNKMYRIVDYVNDRVVATNLMGDNAVFNTRFSDYLLVKAVEISEPDPEPVDPDYGFTSVDDRDEAL
IYTGTWHDDNNASFSEGTARYTNSTDASVVFSFTGTSIRWYGQRDTNFGTAEVYLDDELKTTVDANGAAEAGVCLFEALD
LPAAEHTIKIVCKSGVIDIDRFAYEAATLEPIYEKVDALSDRITYVGNWEEYHNSEFYMGNAMRTDEAGAYAELTFRGTA
VRLYAEMSFNFGTADVYLDGELVENIILYGQEATGQLMFERTGLEEGEHTIRLVQNAWNINLDYISYLPEQDQPTPPETT
VTVDAMDAQLVYTGVWNDDYHDVFQEGTARYASSAGASVEFEFTGSEIRWYGQNDSNFGVASVYIDNEFVQQVNVNGAAA
VGKLLFQKADLPAGSHTIRIVCDTPVIDLDYLTYTTNA
>G4FEF4 3.2.1.22~~~galA~~~Alpha-galactosidase~~~COG3345
MEIFGKTFREGRFVLKEKNFTVEFAVEKIHLGWKISGRVKGSPGRLEVLRTKAPEKVLVNNWQSWGPCRVVDAFSFKPPE
IDPNWRYTASVVPDVLERNLQSDYFVAEEGKVYGFLSSKIAHPFFAVEDGELVAYLEYFDVEFDDFVPLEPLVVLEDPNT
PLLLEKYAELVGMENNARVPKHTPTGWCSWYHYFLDLTWEETLKNLKLAKNFPFEVFQIDDAYEKDIGDWLVTRGDFPSV
EEMAKVIAENGFIPGIWTAPFSVSETSDVFNEHPDWVVKENGEPKMAYRNWNKKIYALDLSKDEVLNWLFDLFSSLRKMG
YRYFKIDFLFAGAVPGERKKNITPIQAFRKGIETIRKAVGEDSFILGCGSPLLPAVGCVDGMRIGPDTAPFWGEHIEDNG
APAARWALRNAITRYFMHDRFWLNDPDCLILREEKTDLTQKEKELYSYTCGVLDNMIIESDDLSLVRDHGKKVLKETLEL
LGGRPRVQNIMSEDLRYEIVSSGTLSGNVKIVVDLNSREYHLEKEGKSSLKKRVVKREDGRNFYFYEEGERE
>P07883 3.2.1.81~~~dagA~~~Extracellular agarase~~~COG2273
MVNRRDLIKWSAVALGAGAGLAGPAPAAHAADLEWEQYPVPAAPGGNRSWQLLPSHSDDFNYTGKPQTFRGRWLDQHKDG
WSGPANSLYSARHSWVADGNLIVEGRRAPDGRVYCGYVTSRTPVEYPLYTEVLMRVSGLKLSSNFWLLSRDDVNEIDVIE
CYGNESLHGKHMNTAYHIFQRNPFTELARSQKGYFADGSYGYNGETGQVFGDGAGQPLLRNGFHRYGVHWISATEFDFYF
NGRLVRRLNRSNDLRDPRSRFFDQPMHLILNTESHQWRVDRGIEPTDAELADPSINNIYYRWVRTYQAV
>G4T4R7 ~~~agaSK~~~Bifunctional alpha-galactosidase/sucrose kinase AgaSK~~~
MAIIYNPNKKIFTLHTAHTTYQMQVDPLGYLLHLYYGEKTNSSMDYVLTYADRGFSGNPYAAGMDRTYSLDALPQEYPSL
GTGDYRNIALNIKNEKGVESADLLFKSYEIRNGKYRLQGLPAVWADEKEAQTLEIVLADENAQVEVHLLYGVLEENDVIT
RSVRIKNTGTGQITIEKAAAACLDFVQGEFDVLRFYGKHAMERNLERTPLGHGTIAFGSRRGTSSHQYNPAVILAEKGTT
ETAGSCYGMLFVYSGNFSCEAEKDQFNQTRLLLGLNEELFSYPLASGETFTVPEVILSYSAEGLSALSQQYHNCIRNHVC
RSKYVHMQRPVLINSWEAAYFDFTGDTIVDLAKEAASLGIDMVVMDDGWFGKRNDDNSSLGDWQVNETKLGGSLAELITR
VHEQGMKFGIWIEPEMINEDSDLYRAHPDWAIRIQGKKPVRSRNQLLLDFSRKEVRDCVFDQICVVLDQGKIDYVKWDMN
RSMADVYAGNLSYDYVLGVYDFMERLCSRYPDLLLEGCSGGGGRFDAGMLYYSPQIWCSDNTDAINRTRIQYGTSFFYPV
SAMGAHVSAVPNHQTGRVTSFHTRGVTAMAGTFGYELNPALLSDEEKQQIREQIKTYKKYETLINEGTYWRLSDPFTDEI
AAWMSVSEEQDHALVSVVRLMAEANQATVYVRLRGLKPDAVYLEEQSGRQYSGAALMHAGIPLPPFTEEYEAYQFAFTEL
KEAGRLYEKVQKWCDGNAENRVVISIYGGSGSGKTTLATALQQYFLNDGTECYLLSGDDYPHRIPKRNDEERMRVYKEAG
EDGLRGYLGTKKEIDFDRINEVLAAFHEGKDSITLRHMGREDGEISLEETDFSGISVLLLEWTHGGSDDLHGVDLPVFLE
SSPGETRERRIRRNRDENAASPFICRVVELEQEKLEVQRKNAGLIVGKDGSVYEQ
>Q8XAC2 3.5.99.-~~~agaS~~~D-galactosamine-6-phosphate deaminase AgaS~~~COG2222
MPENYTPAAAATGTWTEEEIRHQPRAWIRSLTNIDALHSALNNFLEPLLRKENLRIILTGAGTSAFIGDIIAPWLASHTG
KNFSAVPTTDLVTNPMDYLNPAHPLLLISFGRSGNSPESVAAVELANQFVPECYHLPITCNEAGALYQNAINSDNAFAVL
MPAETHDRGFAMTSSITTMMASCLAVFAPETINSQTFRDVADRCQAILTSLGDFSEGVFGYAPWKRIVYLGSGGLQGAAR
ESALKVLELTAGKLAAFYDSPTGFRHGPKSLVDNETLVVVFVSSHPYTRQYDLDLLAELHRDNQAMRVIAIAAESSDIVA
AGPHIILPPSRHFIDVEQAFCFLMYAQTFALMQSLHMGNTPDTPSASGTVNRVVQGVIIHPWQA
>P42907 3.5.99.-~~~agaS~~~Putative D-galactosamine-6-phosphate deaminase AgaS~~~COG2222
MPENYTPAAAATGTWTEEEIRHQPRAWIRSLTNIDALRSALNNFLEPLLRKENLRIILTGAGTSAFIGDIIAPWLASHTG
KNFSAVPTTDLVTNPMDYLNPAHPLLLISFGRSGNSPESVAAVELANQFVPECYHLPITCNEAGALYQNAINSDNAFALL
MPAETHDRGFAMTSSITTMMASCLAVFAPETINSQTFRDVADRCQAILTSLGDFSEGVFGYAPWKRIVYLGSGGLQGAAR
ESALKVLELTAGKLAAFYDSPTGFRHGPKSLVDDETLVVVFVSSHPYTRQYDLDLLAELRRDNQAMRVIAIAAESSDIVA
AGPHIILPPSRHFIDVEQAFCFLMYAQTFALMQSLHMGNTPDTPSASGTVNRVVQGVIIHPWQA
>Q9KIP9 3.5.99.-~~~agaS~~~D-galactosamine-6-phosphate deaminase AgaS~~~COG2222
MPENYTPAAAATGTWTEEEIRHQPRAWIRSLTNIDALRSALNNFLEPLLRKENLRIILTGAGTSAFIGDIIAPWLASHTG
KNFSAVPTTDLVTNPMDYLNPAHPLLLISFGRSGNSPESVAAVELANQFVPECYHLPITCNEAGALYQNAINSDNAFALL
MPAETHDRGFAMTSSITTMMASCLAVFAPETINSQTFRDVADRCQAILTTLGDFSEGVFGYAPWKRIVYLGSGGLQGAAR
ESALKVLELTAGKLAAFYDSPTGFRHGPKSLVDDETLVVVFVSSHPYTRQYDLDLLAELRRDNQAMRVIAIAAESSDIVA
AGPHIILPPSRHFIDVEQAFCFLMYAQTFALMQSLHMGNTPDTPSASGTVNRVVQGVIIHPWQA
>A0KYQ7 3.5.99.-~~~agaS~~~D-galactosamine-6-phosphate deaminase AgaS~~~COG2222
MLTSPLSPFEHEDSNLLLSAEQLTQYGAFWTAKEISQQPKMWRKVSEQHSDNRTIAAWLTPILAKPQLRIILTGAGTSAY
IGDVLAAHIQQHLPLATQQVEAISTTDIVSHPELYLRGNIPTLLISYGRSGNSPESMAAVELAEQLVDDCYHLAITCNGQ
GKLANYCADKSHCYLYKLPDETHDVSFAMTSSFTCMYLATLLIFAPNSQALMQCIEMAEHILTERLADIRLQSEQPSKRV
VFLGGGPLKAIAQEAALKYLELTAGQVVSAFESPLGFRHGPKSLVDSHTQVLVMMSSDPYTRQYDNDLIQELKRDNQALS
VLTLSEELLTGSSGLNEVWLGLPFILWCQILAIYKAIQLKVSPDNPCPTGQVNRVVQGVNVYPFVK
>P9WMS9 ~~~~~~Arabinogalactan biosynthesis recruiting protein Rv3789~~~COG2246
MRFVVTGGLAGIVDFGLYVVLYKVAGLQVDLSKAISFIVGTITAYLINRRWTFQAEPSTARFVAVMLLYGITFAVQVGLN
HLCLALLHYRAWAIPVAFVIAQGTATVINFIVQRAVIFRIR
>P46006 ~~~aggB~~~Protein AggB~~~
MLKKSILPMSCGVLVMVMSGLLDAAEITLISHKTLGSQLRDGMKLATGRIACREPHDGFHIWINASQNGKVGHYIVQNNR
ETKHELKVKIGGGGWSSSLIEGQRGVYRQGEEKQAIFDIMSDGNQYSAPGEYIFSVSGECLISRG
>Q8DPV9 2.4.1.-~~~cpoA~~~Alpha-galactosylglucosyldiacylglycerol synthase~~~COG0438
MRKFPLFSSSLSFLLLILFKENDIIVVMEKKKLRINMLSSSEKVAGQGVSGAYRELVRLLHRAAKDQLIVTENLPIEADV
THFHTIDFPYYLSTFQKKRSGRKIGYVHFLPATLEGSLKIPFFLKGIVKRYVFSFYNRMEHLVVVNPMFIEDLVAAGIPR
EKVTYIPNFVNKEKWHPLPQEEVVRLRTDLGLSDNQFIVVGAGQVQKRKGIDDFIRLAEELPQITFIWAGGFSFGGMTDG
YEHYKTIMENPPKNLIFPGIVSPERMRELYALADLFLLPSYNELFPMTILEAASCEAPIMLRDLDLYKVILEGNYRATAG
REEMKEAILEYQANPAVLKDLKEKAKNISREYSEEHLLQIWLDFYEKQAALGRK
>O33830 3.2.1.20~~~aglA~~~Alpha-glucosidase~~~COG1486
MPSVKIGIIGAGSAVFSLRLVSDLCKTPGLSGSTVTLMDIDEERLDAILTIAKKYVEEVGADLKFEKTMNLDDVIIDADF
VINTAMVGGHTYLEKVRQIGEKYGYYRGIDAQEFNMVSDYYTFSNYNQLKYFVDIARKIEKLSPKAWYLQAANPIFEGTT
LVTRTVPIKAVGFCHGHYGVMEIVEKLGLEEEKVDWQVAGVNHGIWLNRFRYNGGNAYPLLDKWIEEKSKDWKPENPFND
QLSPAAIDMYRFYGVMPIGDTVRNSSWRYHRDLETKKKWYGEPWGGADSEIGWKWYQDTLGKVTEITKKVAKFIKENPSV
RLSDLGSVLGKDLSEKQFVLEVEKILDPERKSGEQHIPFIDALLNDNKARFVVNIPNKGIIHGIDDDVVVEVPALVDKNG
IHPEKIEPPLPDRVVKYYLRPRIMRMEMALEAFLTGDIRIIKELLYRDPRTKSDEQVEKVIEEILALPENEEMRKHYLKR
>O86960 3.2.1.20~~~aglA~~~Alpha-glucosidase~~~
MPAVKIGIIGAGSAVFSLRLVSDLCKTPGLSGSTVTLMDIDEERLDAVLTIAKKYVEEVGADLKFEKTTSVDEAIADADF
VINTAMVGGHTYLEKVRRISEKYGYYRGIDAQEFNMVSDYYTFSNYNQLKYFVDIARKIERLSPKAWYSAAANPVFEGTT
LVTRTVPIKAVGFCHGHYGVMEIIEKLGLERKQVDWQVAGVNHGIWLNRFRYNGEDAYPLLPRWISEKSKDWKPENPFND
QLSPAAIDMYKFYGVMPIGDTVRNASWRYHRDLETKKRWYGEPWGGADSEIGWKWYQDTLGKVTDITKKVAKFIKENPAL
KLSDLGSVLGKDLSEKQFVLEVEKILDPEKKSGEQHISFHDALLNDNRSRFVINIPNKGIIQGIDDDVVVEVPAVVDRDG
IHPEKIDPPLPERVVKYYLRPRIMRMEMALEAFLTGDIRIIKEVLYRDPRTKSDEQVEKVIEEILSLPENEEMRKNYLKK
>Q9AGA6 3.2.1.122~~~aglB~~~6-phospho-alpha-glucosidase~~~
MKKFSVVIAGGGSTFTPGIVLMLLANQDRFPLRSLKFYDNDGARQETIAEACKVILKEQAPEIEFSYTTDPQAAFTDVDF
VMAHIRVGKYPMREQDEKIPLRHGVLGQETCGPGGIAYGMRSIGGVLELVDYMEKYSPNAWMLNYSNPAAIVAEATRRLR
PNAKILNICDMPIGIEGRMAQIVGLKDRKQMRVRYYGLNHFGWWTSIEDLDGNDLMPKLREYVAKYGYVPPSNDPHTEAS
WNDTFAKAKDVQALDPQTMPNTYLKYYLFPDYVVAHSNPERTRANEVMDHREKNVFSACRAIIAAGKSTAGDLEIDEHAS
YIVDLATAIAFNTQERMLLIVPNNGAIHNFDADAMVEIPCLVGHNGPEPLTVGDIPHFQKGLMSQQVAVEKLVVDAWEQR
SYHKLWQAITLSKTVPSASVAKAILDDLIAANKDYWPELH
>Q9X2F4 3.2.1.54~~~aglB~~~Cyclomaltodextrinase~~~COG0366
MMYPMPSWVYDSVVYQIFPDRFFIGKGKTVEDKKDLYLKRGGVIEKWGVPPRKLPGAQHVKIFYGGDLWGIAEKVDYFEE
LGINVLYLTPIFLSDTNHKYDTIDYFRVDPQFGGKRAFLHLLRVLHERSMKLILDGVFNHVGSQHPWFKKAKKNDPEYVN
RFFLYKDRHRSWFDVGSLPELNVEVEEVKEYILKVVEHYLKLGIDGWRLDCGHDLGPTVNLWINMKVKEFSAEKYLVSEI
WTYPAGWDMVDGLMNYNFRNLVLSYVNGETDSIGFHLERAYRETKNIFGCWNMLDSHDTPRLATMVPDRDLRKLAVVLQF
TYPGVPLVYYGTEIGLTGGEDPECRATMEWNREKWDVDLFEFYKKMIRLRRTDPGLRFGEFVLLNDSPLAFLRKAPHPLQ
NTIVVVNPGEEKVLVLSIPDGKIMNTTPLVDVFSGERFHVDGGVVKLPLLARSFRILKPEDLRVGKYRLYKRV
>O86959 3.2.1.54~~~aglB~~~Cyclomaltodextrinase~~~
MYPIPSWVYDSVVYQIFPDRFFIGKGKTVEDKKDLYLKRGGTIEKWGVPPRKLPGAQHVKVFYGGDLWGIAEKIDYLEEL
GVNAVYLTPIFLSDTNHKYDTIDYFKIDPQFGGKRAFVHLLKVLHSRNIKLILDGVFNHVGSQHPWFKKARKKDPEYVNR
FFLYRDRHRSWFDVGSLPELNVEVEEVREYILKVVQHYLEVGVDGWRLDCGHDLGPLVNLWINMKVKEFSSEKYLVSEIW
TYPAGWEMVDGLMNYNFRSLVLSYVNGETDSIGTELERAYRETKNIFGCWNMLDSHDTPRLATTVPVKDLRKLAIVLQFT
YPGVPLVYYGTEIGLTGGEDPECRATMEWNREKWDMELFEFYKKMIRFRRTDPGLRFGEFILLKEKPLAFMRKAPHPLQD
TIVVVNPEEEKNVVLSLPDGKIMNATPLFDIFTGEKFHVDGGVVKVPVGRRSFRILKPIDLRVGRYRLYKRV
>Q1D823 ~~~aglZ~~~Adventurous-gliding motility protein Z~~~COG0745
MERRVLIVESEHDFALSMATVLKGAGYQTALAETAADAQRELEKRRPDLVVLRAELKDQSGFVLCGNIKKGKWGQNLKVL
LLSSESGVDGLAQHRQTPQAADGYLAIPFEMGELAALSHGIVPPGTDDTGASLDAALNGTREAPPPMPPSLKAAAGGPPK
LPKRERRSAMTEEDRAFLDRTFQSIADRKAELLAESRQLKRPPPRRELMGTPEGKIQILRDELKTREAQLARLSEIWNVR
ERELLSGEDRIHEKDVELQGLKMQVDDLLRRFNEAQQATIQKEREHGATVDDLLLQKFSAEKDLIEVVASKEKDINLLRR
EVSRAEEELSRRAGELEHGRNEYDKLEKHLGVVTLEFEVKEQKLQDTVLANEGEIARLTKRGDDFEAELNRTISERDQRF
AELDGEIQALQERLQQTEQERDTTVRGLEARAARAEEHGTQADAEIHRLNAERDALEAKLSQQVADLEADLARTMGERDQ
LRLDKDAQEAELTQRIEERDAKLGTLERELSETIARNEHTEAELNANIQQQLERIGELEGEVEAVKTHLEDRENELTAEL
QALGQAKDELETDLNDRLQALSQAKDALEADLSRQLEELRSAKAELEADLTGQIQALTSQLEETQRQLDDSQRTGEQLSA
RVAQLEDTVSQRESTIESLQGDVAARDQRISELSGDLEATSQTLAQTQQTLAQTEQQLADTQNTLASTEGALAETRGELD
ATSQTLQQTQQTLAQTEGALAETRGELDATSQTLAQTQQTLAQTEQQLADTQNTLASTEGTLAETRGELEATSQTLQQTH
AALEDTRGALQETSDTLAHTTRERDQRIAELADLGAAKDALEQELTGQIGHLRSELSETQGNYEAERAAHEKLAAESSAH
IGDLTSERDGLRSELEATSQTLEQTHGQLAATRDALAREQHAHQESRKAAASTQTTLEGQLAEARAHGEDLGEHLTLTKH
ELGTRVAELTQLTATLAQTENTRAHLEERLHTLTEESQRREELLQNDLTQKGTELSDTLRKLTHVTQEKMRQAEVLNREV
ATRTEQLKAMEAKLQTQATEARRQAEGLGQQITGLNEQLEQGRKALAGREDQLRAAGAAQQKLTAERDGLAGQLQQAEAR
LQQQAQQANQERADAKRAADELAAKLAKTEQRITQFAQDAQTQATEADARAKDLQGQLSARAKKIQDLELAVENAQGAKS
RAEKELNAKVAAAESKAHEASTRLAAAQKERKDLEARHAKEQEDLAAKQKAELERRDAIKAQEVARLQQSVQEKSKALKV
AELELARYKSKSATTATPAKAAAKPAAAEDDELAVRTQLNQVIAPAAAAQAPAPAKKPAAKPAAQAPAKKAPAPAPAPPA
ALSDESEPTDRTLVIQLPTAKEDDDWTALVDELDK
>O67434 3.1.26.-~~~ago~~~Protein argonaute~~~COG1431
MGKEALLNLYRIEYRPKDTTFTVFKPTHEIQKEKLNKVRWRVFLQTGLPTFRREDEFWCAGKVEKDTLYLTLSNGEIVEL
KRVGEEEFRGFQNERECQELFRDFLTKTKVKDKFISDFYKKFRDKITVQGKNRKIALIPEVNEKVLKSEEGYFLLHLDLK
FRIQPFETLQTLLERNDFNPKRIRVKPIGIDFVGRVQDVFKAKEKGEEFFRLCMERSTHKSSKKAWEELLKNRELREKAF
LVVLEKGYTYPATILKPVLTYENLEDEERNEVADIVRMEPGKRLNLIRYILRRYVKALRDYGWYISPEEERAKGKLNFKD
TVLDAKGKNTKVITNLRKFLELCRPFVKKDVLSVEIISVSVYKKLEWRKEEFLKELINFLKNKGIKLKIKGKSLILAQTR
EEAKEKLIPVINKIKDVDLVIVFLEEYPKVDPYKSFLLYDFVKRELLKKMIPSQVILNRTLKNENLKFVLLNVAEQVLAK
TGNIPYKLKEIEGKVDAFVGIDISRITRDGKTVNAVAFTKIFNSKGELVRYYLTSYPAFGEKLTEKAIGDVFSLLEKLGF
KKGSKIVVHRDGRLYRDEVAAFKKYGELYGYSLELLEIIKRNNPRFFSNEKFIKGYFYKLSEDSVILATYNQVYEGTHQP
IKVRKVYGELPVEVLCSQILSLTLMNYSSFQPIKLPATVHYSDKITKLMLRGIEPIKKEGDIMYWL
>A4WYU7 ~~~ago~~~Protein argonaute~~~
MAPVQAADEMYDSNPHPDRRQLVSNGFEVNLPDQVEVIVRDLPDPSKVKEERTRLMGYWFVHWFDGKLFHLRIKAGGPNV
DGEHRAIRTAEHPWLLRARLDDALEEALPKYAAVKKRPFTFLAQKDELIDAAATAAGLSHRLLNSFKVIPRFALSPKIYE
PVDGTTRVGVFVTIGMRYDIEASLRDLLEAGIDLRGMYVVRRKRQPGERGLLGRVRAISDDMVQLFEETDLASVNVNDAK
LEGSKENFTRCLSALLGHNYKKLLNALDDQEAGYRTGPRFDDAVRRMGEFLAKKPIRLADNINAQVGDRIVFSNEGQARN
VRLAPKVEYVFDRTGAKSAEYAWRGLSQFGPFDRPSFANRSPRILVVYPSSTQGKVENFLSAFRDGMGSNYSGFSKGFVD
LMGLTKVEFVMCPVEVSSADRNGAHTKYNSAIEDKLAGAGEVHAGIVVLFEDHARLPDDRNPYIHTKSLLLTLGVPTQQV
RMPTVLLEPKSLQYTLQNFSIATYAKLNGTPWTVNHDKAINDELVVGMGLAELSGSRTEKRQRFVGITTVFAGDGSYLLG
NVSKECEYEGYSDAIRESMTGILRELKKRNNWRPGDTVRVVFHAHRPLKRVDVASIVFECTREIGSDQNIQMAFVTVSHD
HPFVLIDRSERGLEAYKGSTARKGVFAPPRGAISRVGRLTRLLAVNSPQLIKRANTPLPTPLLVSLHPDSTFKDVDYLAE
QALKFTSLSWRSTLPAATPVTIFYSERIAELLGRLKSIPNWSSANLNIKLKWSRWFL
>A0A1M5A5Z8 3.1.24.-~~~ago~~~Protein argonaute~~~
MYLNLYEIKIPYRVKRLYYFNKENDPKEFARNLSRVNNIRFNDSKDLVWLEIPDIDFKITPQQAEKYKIEKNEIIGEKED
SDLFVKTIYRYIKKKFIDNNFYYKRGNNYISINDKFPLDSNTNVNAHLTYKIKLYKINERYYISVLPKFTFLSDKPALES
PIKSTYLFNIKSGKTFPYISGLNGVLKIDLGENGIKEVLFPENYYFNFTSKEAEKFGFSKEIHNIYKEKIFSGYKKIKQS
LYFLEDIININNYNLTMDKKIYVNIEYEFKKGISRNIKDVFKYSFYKNDQKIKIAFFFSSKKQIYEIQRSLKMLFQNKNS
IFYQTIYEMGFSKVIFLREPKTNSSAFMYNPETFEISNKDFFENLEGNIMAIIILDKFLGNIDSLIQKFPENLILQPILK
EKLEKIQPYIIKSYVYKMGNFIPECQPYVIRNLKDKNKTLYIGIDLSHDNYLKKSNLAISAVNNFGDIIYLNKYKNLELN
EKMNLDIVEKEYIQILNEYYERNKNYPENIIVLRDGRYLEDIEIIKNILNIENIKYSLIEVNKSVNINSCEDLKEWIIKL
SDNNFIYYPKTYFNQKGVEIKIIENNTDYNNEKILEQVYSLTRVVHPTPYVNYRLPYPLQVVNKVALTELEWKLYIPYMK
>H2J4R4 3.1.24.-~~~ago~~~Protein argonaute~~~COG1431
MYLNLYKIDIPKKIKRLYFYNPDMEPKLFARNLSRVNNFKFQDSNDLVWIEIPDIDFQITPKNVFQYKVEKEEIIKEEED
KKLFVKTLYKYIKKLFLDNDFYFKKGNNFISNSEVFSLDSNENVNAHLTYKIKIHNISNEYYLSILPKFTFLSKEPALES
AIKSGYLYNIKSGKSFPYISGLDGILKIDIGNNQIVEVAYPENYLFNFTTRDAEKYGFSKEVHEIYKNKVFEGFKKIPKT
LGFLNKITNLNENYQLKDGYKIFINVIYKFKNGESRYAKDVFKYSFYKNEQPLKAIFFFSSKKQFFEVQKSLKELFHNKH
SVFYRAAAELGFSKVEFLRDSKTKSSAFLYNPEEFTVKNTEFINQIEDNVMAIVLLDKYIGNIDPLVRNFPDNLILQPIL
KEKLEDIKPFIIKSYVYKMGNFIPECKPFILKKMEDKEKNLYIGIDLSHDTYARKTNLCIAAVDNTGDILYIGKHKNLEL
NEKMNLDILEKEYIKAFEKYIEKFNVSPENVFILRDGRFIEDIEIIKNFISYNDTKYTLVEVNKNTNINSYDDLKEWIIK
LDENTYIYYPKTFLNQKGVEVKILENNTDYTIEEIIEQIYLLTRVAHSTPYTNYKLPYPLHIANKVALTDYEWKLYIPY
>Q31N05 3.1.24.-~~~ago~~~Protein argonaute~~~COG1431
MDLLSNLRRSSIVLNRFYVKSLSQSDLTAYEYRCIFKKTPELGDEKRLLASICYKLGAIAVRIGSNIITKEAVRPEKLQG
HDWQLVQMGTKQLDCRNDAHRCALETFERKFLERDLSASSQTEVRKAAEGGLIWWVVGAKGIEKSGNGWEVHRGRRIDVS
LDAEGNLYLEIDIHHRFYTPWTVHQWLEQYPEIPLSYVRNNYLDERHGFINWQYGRFTQERPQDILLDCLGMSLAEYHLN
KGATEEEVQQSYVVYVKPISWRKGKLTAHLSRRLSPSLTMEMLAKVAEDSTVCDREKREIRAVFKSIKQSINQRLQEAQK
TASWILTKTYGISSPAIALSCDGYLLPAAKLLAANKQPVSKTADIRNKGCAKIGETSFGYLNLYNNQLQYPLEVHKCLLE
IANKNNLQLSLDQRRVLSDYPQDDLDQQMFWQTWSSQGIKTVLVVMPWDSHHDKQKIRIQAIQAGIATQFMVPLPKADKY
KALNVTLGLLCKAGWQPIQLESVDHPEVADLIIGFDTGTNRELYYGTSAFAVLADGQSLGWELPAVQRGETFSGQAIWQT
VSKLIIKFYQICQRYPQKLLLMRDGLVQEGEFQQTIELLKERKIAVDVISVRKSGAGRMGQEIYENGQLVYRDAAIGSVI
LQPAERSFIMVTSQPVSKTIGSIRPLRIVHEYGSTDLELLALQTYHLTQLHPASGFRSCRLPWVLHLADRSSKEFQRIGQ
ISVLQNISRDKLIAV
>Q746M7 3.1.24.-~~~ago~~~Protein argonaute~~~COG1431
MNHLGKTEVFLNRFALRPLNPEELRPWRLEVVLDPPPGREEVYPLLAQVARRAGGVTVRMGDGLASWSPPEVLVLEGTLA
RMGQTYAYRLYPKGRRPLDPKDPGERSVLSALARRLLQERLRRLEGVWVEGLAVYRREHARGPGWRVLGGAVLDLWVSDS
GAFLLEVDPAYRILCEMSLEAWLAQGHPLPKRVRNAYDRRTWELLRLGEEDPKELPLPGGLSLLDYHASKGRLQGREGGR
VAWVADPKDPRKPIPHLTGLLVPVLTLEDLHEEEGSLALSLPWEERRRRTREIASWIGRRLGLGTPEAVRAQAYRLSIPK
LMGRRAVSKPADALRVGFYRAQETALALLRLDGAQGWPEFLRRALLRAFGASGASLRLHTLHAHPSQGLAFREALRKAKE
EGVQAVLVLTPPMAWEDRNRLKALLLREGLPSQILNVPLREEERHRWENALLGLLAKAGLQVVALSGAYPAELAVGFDAG
GRESFRFGGAACAVGGDGGHLLWTLPEAQAGERIPQEVVWDLLEETLWAFRRKAGRLPSRVLLLRDGRVPQDEFALALEA
LAREGIAYDLVSVRKSGGGRVYPVQGRLADGLYVPLEDKTFLLLTVHRDFRGTPRPLKLVHEAGDTPLEALAHQIFHLTR
LYPASGFAFPRLPAPLHLADRLVKEVGRLGIRHLKEVDREKLFFV
>Q9X4Y1 ~~~agpA~~~Periplasmic alpha-galactoside-binding protein~~~COG0747
MKTHRLNMTASLLIGISAFAVQAFASEPTVVPEQPPFPAQGKITYVSRDSILEFKALREYREPEWVTEKFVKAGKLPPVA
ERLPKEPMVFKAGNMPDGMGVYGDVMRHVIGGRPEGWNYSAGQTQGWGGIDIGMFECLTRTAPLFQVEADDMEPLPNLAK
SWDWSEDGRKLTMHLIEGAKWSDGDPFDADDVMFYWEDNVLDSSVSPLNGATPETFGEGTTLKKIDQYTVEWTFKEAFPR
QHLFAMAYGTFCPGPSHILKTKHPKYAGTTYNEYKNGFPAEYMNLPVMGAWVPVAYRPDDIIVLRRNPYYWKVDEAGNQL
PYLNELHYKLSTWADRDVQAIAGSGDISNLEQPENFVESLKRAANESAPARLAFGPRVIGYNMHMNFSGNGWGDPDERAK
AVRELNRNLDFRKAVTMAVDRKKLGEALVKGPFTAIYPGGLSSGTSFYDRNSTIYYPHDLEGAKVLLEKVGLKDTDGNGF
VNFPAGKLGGRDVEIVLLVNSDYSTDRNLAEGMVGQMEKLGLRVVLNALDGKQRDAANYAGRFDWMIHRNTAEFASVVQN
TPQLAPTGPRTSWHHRAPEGGEVDVMPHEQELVDIVNKFIASNDNDERTELMKQYQKVATTNVDTVGLTEYPGALIINKR
FSNIPPGAPIFMFNWAEDTIIRERVFVAADKQGDYELYPEQLPGKPGESGPIN
>P19926 3.1.3.10~~~agp~~~Glucose-1-phosphatase~~~
MNKTLIAAAVAGIVLLASNAQAQTVPEGYQLQQVLMMSRHNLRAPLANNGSVLEQSTPNKWPEWDVPGGQLTTKGGVLEV
YMGHYMREWLAEQGMVKSGECPPPYTVYAYANSLQRTVATAQFFITGAFPGCDIPVHHQEKMGTMDPTFNPVITDDSAAF
SEQAVAAMEKELSKLQLTDSYQLLEKIVNYKDSPACKEKQQCSLVDGKNTFSAKYQQEPGVSGPLKVGNSLVDAFTLQYY
EGFPMDQVAWGEIKSDQQWKVLSKLKNGYQDSLFTSPEVARNVAKPLVSYIDKALVTDRTSAPKITVLVGHDSNIASLLT
ALDFKPYQLHDQNERTPIGGKIVFQRWHDSKANRDLMKIEYVYQSAEQLRNADALTLQAPAQRVTLELSGCPIDADGFCP
MDKFDSVLNEAVK
>Q5HEG2 ~~~agrA~~~Accessory gene regulator A~~~
MKIFICEDDPKQRENMVTIIKNYIMIEEKPMEIALATDNPYEVLEQAKNMNDIGCYFLDIQLSTDINGIKLGSEIRKHDP
VGNIIFVTSHSELTYLTFVYKVAAMDFIFKDDPAELRTRIIDCLETAHTRLQLLSKDNSVETIELKRGSNSVYVQYDDIM
FFESSTKSHRLIAHLDNRQIEFYGNLKELSQLDDRFFRCHNSFVVNRHNIESIDSKERIVYFKNKEHCYASVRNVKKI
>P0A0I5 ~~~agrA~~~Accessory gene regulator protein A~~~
MKIFICEDDPKQRENMVTIIKNYIMIEEKPMEIALATDNPYEVLEQAKNMNDIGCYFLDIQLSTDINGIKLGSEIRKHDP
VGNIIFVTSHSELTYLTFVYKVAAMDFIFKDDPAELRTRIIDCLETAHTRLQLLSKDNSVETIELKRGSNSVYVQYDDIM
FFESSTKSHRLIAHLDNRQIEFYGNLKELSQLDDRFFRCHNSFVVNRHNIESIDSKERIVYFKNKEHCYASVRNVKKI
>P0A0I7 ~~~agrA~~~Accessory gene regulator protein A~~~
MKIFICEDDPKQRENMVTIIKNYIMIEEKPMEIALATDNPYEVLEQAKNMNDIGCYFLDIQLSTDINGIKLGSEIRKHDP
VGNIIFVTSHSELTYLTFVYKVAAMDFIFKDDPAELRTRIIDCLETAHTRLQLLSKDNSVETIELKRGSNSVYVQYDDIM
FFESSTKSHRLIAHLDNRQIEFYGNLKELSQLDDRFFRCHNSFVVNRHNIESIDSKERIVYFKNKEHCYASVRNVKKI
>Q7WYU3 3.4.-.-~~~~~~Putative AgrB-like protein~~~
MIKYLSTNISLYFQENNSCLSKKDVLKIQYTLEAILSDLSKFIIIFLVFLFIKEIPLFLFSFIILNSTRPLLGGIHCKTY
YGCLTCSILYFMIILLFTRLFPELNTNFYIVFFILSLAITFIFAPCPNEKRPVKNKATLKILSLISLTFWIILFYLSPLQ
TRNCILISIFLQIIQVIIINTKGVIFNAKNNKTFFNRTT
>P0C1P7 3.4.-.-~~~agrB~~~Accessory gene regulator protein B~~~
MNYFDNKIDQFATYLQKRNNLDHIQFLQVRLGMQVLAKNIGKLIVMYTIAYILNIFLFTLITNLTFYLIRRHAHGAHAPS
SFWCYVESIILFILLPLVIVNFHINFLIMIILTVISLGVISVYAPAATKKKPIPVRLIKRKKYYAIIVSLTLFIITLIIK
EPFAQFIQLGIIIEAITLLPIFFIKEDLK
>B3PC73 3.2.1.131~~~~~~Extracellular xylan exo-alpha-(1->2)-glucuronosidase~~~COG3661
MSTFARLFLCLVFFASLQPAMAQTEDGYDMWLRYQPIADQTLLKTYQKQIRHLHVAGDSPTINAAAAELQRGLSGLLNKP
IVARDEKLKDYSLVIGTPDNSPLIASLNLGERLQALGAEGYLLEQTRINKRHVVIVAANSDVGVLYGSFHLLRLIQTQHA
LEKLSLSSAPRLQHRVVNHWDNLNRVVERGYAGLSLWDWGSLPNYLAPRYTDYARINASLGINGTVINNVNADPRVLSDQ
FLQKIAALADAFRPYGIKMYLSINFNSPRAFGDVDTADPLDPRVQQWWKTRAQKIYSYIPDFGGFLVKADSEGQPGPQGY
GRDHAEGANMLAAALKPFGGVVFWRAFVYHPDIEDRFRGAYDEFMPLDGKFADNVILQIKNGPIDFQPREPFSALFAGMS
RTNMMMEFQITQEYFGFATHLAYQGPLFEESLKTETHARGEGSTIGNILEGKVFKTRHTGMAGVINPGTDRNWTGHPFVQ
SSWYAFGRMAWDHQISAATAADEWLRMTFSNQPAFIEPVKQMMLVSREAGVNYRSPLGLTHLYSQGDHYGPAPWTDDLPR
ADWTAVYYHRASKTGIGFNRTKTGSNALAQYPEPIAKAWGDLNSVPEDLILWFHHLSWDHRMQSGRNLWQELVHKYYQGV
EQVRAMQRTWDQQEAYVDAARFAQVKALLQVQEREAVRWRNSCVLYFQSVAGRPIPANYEQPEHDLEYYKMLARTTYVPE
PWHPASSSRVLK
>Q837U5 3.5.3.12~~~aguA~~~Putative agmatine deiminase~~~COG2957
MAKRIVGSTPKQDGFRMPGEFEPQEKVWMIWPERPDNWRDGGKPVQEAFTNVAKAISQFTPMNVVVSQQQFQNCRRQLPP
EITVYEMSNNDAWVRDCGPSFVINDHGEIRGVDWTFNAWGGLVDGLYFPWDQDDLVAQKICEIEHVDSYRTDDFVLEGGS
FHVDGQGTVLTTEMCLLSEGRNPQLSKEAIEQKLCDYLNVEKVLWLGDGIDPEETNGHVDDVACFIAPGEVACIYTEDQN
SPFYEAAQDAYQRLLKMTDAKGRQLKVHKLCCPVKNVTIKGSFKIDFVEGTMPREDGDICIASYMNFLITNDGVIVPQYG
DENDHLALEQVQTMFPDKKIVGVNTVEVVYGGGNIHCITQQEPKR
>Q09LY5 3.2.1.131~~~aguA~~~Xylan alpha-(1->2)-glucuronosidase~~~
MTAGYEPCWLRYERKDQYSRLRFEEIVAKRTSPIFQAVVEELQKGLRSMMEIEPQVVQEVNETANSIWLGTLEDEEFERP
LEGTLVHPEGYVIRSDVDDGPFRIYIIGKTDAGVLYGVFHFLRLLQMGENIAQLSIIEQPKNRLRMINHWDNMDGSIERG
YAGRSIFFVDDQFVKQNQRIKDYARLLASVGINAISINNVNVHKTETKLITDHFLPDVAEVADIFRTYGIKTFLSINYAS
PIEIGGLPTADPLDPEVRRWWKETAKRIYQYIPDFGGFVVKADSEFRPGPFTYGRDHAEGANMLAEALAPFGGLVIWRCF
VYNCQQDWRDRTTDRAKAAYDHFKPLDGQFRENVILQIKNGPMDFQVREPVSPLFGAMPKTNQMMEVQITQEYTGQQKHL
CFLIPQWKEVLDFDTYAKGKGSEVKKVIDGSLFDYRYSGIAGVSNIGSDPNWTGHTLAQANLYGFGRLAWNPDLSAEEIA
NEWVVQTFGDDSQVVETISWMLLSSWRIYENYTSPLGVGWMVNPGHHYGPNVDGYEYSHWGTYHYADRDGIGVDRTVATG
TGYTAQYFPENAAMYESLDTCPDELLLFFHHVPYTHRLHSGETVIQHIYNTHFEGVEQAKQLRKRWEQLKGKIDEKRYHD
VLERLTIQVEHAKEWRDVINTYFYRKSGIDDQYGRKIYR
>Q9I6J9 3.5.3.12~~~aguA~~~Agmatine deiminase~~~
MSNPTSTPRADGFRMPAEWEPHEQTWMVWPERPDNWRNGGKPAQAAFAAVAKAIARFEPVTVCASAGQYENARARLDDGN
IRVVEISSDDAWVRDTGPTFVIDDKGDVRGVDWGFNAWGGFEGGLYFPWQRDDQVARKILEIERRARYRTDDFVLEGGSI
HVDGEGTLITTEECLLNHNRNPHLSQAEIERTLRDYLAVESIIWLPNGLYNDETDGHVDNFCCYARPGEVLLAWTDDQDD
PNYLRCQAALRVLEESRDAKGRKLVVHKMPIPGPLYATQEECDGVDIVEGSQPRDPSIRLAGSYVNFLIVNGGIIAPSFD
DPKDAEARAILQRVFPEHEVVMVPGREILLGGGNIHCITQQQPAPRKA
>Q8DW17 3.5.3.12~~~aguA~~~Putative agmatine deiminase~~~COG2957
MAKRIKNTTPKQDGFRMPGEFEKQKQIWMLWPWRNDNWRLGAKPAQKAFLEVAEAISEFEPVSLCVPPLQYENALARVSE
LGSHNIRIIEMTNDDAWIRDCGPTFLVNDKGDLRAVDWEFNAWGGLVDGLYFPWDQDALVARKVCEIEGVDSYKTKDFVL
EGGSIHVDGEGTVLVTEMCLLHPSRNPHLTKEDIEDKLKDYLNCVKVLWVKDGIDPYETNGHIDDVACFIRPGEVACIYT
DDKEHPFYQEAKAAYDFLSQQTDAKGRPLKVHKMCVTKEPCYLQEAATIDYVEGSIPREEGEMAIASYLNFLIVNGGIIL
PQYGDENDQLAKQQVQEMFPDRKVVGVRTEEIAYGGGNIHCITQQQPAT
>P96105 3.2.1.131~~~aguA~~~Xylan alpha-(1->2)-glucuronosidase~~~COG3661
MDYRMCWLEYRGLPADVAGKLKDWFSSVSILEPGSSVLKDEIRRFSERSIGITPRFYSRPLKKEKYIMVGRLESLPIKLD
VNLGEEGFMLRTIEWNGSKILLVTGETKKALVYGIFDLMKRIRLGEDIEKMNVLAKPKAKFRMLNHWDNLDGTIERGYAG
NSIFFKDNRIIINQRTKDYARLLASIGINGVVINNVNVKKREVYLIDSIYLKKLKKLADIFREYGIKIYLSINFASPVYL
GGLDTADPLDERVARWWREKARGIYDYIPDFGGFLVKADSEFNPGPHMFGRTHAEGANMLARALAPFGGVVIWRAFVYNC
LQDWRDYKTDRAKAAYDNFKPLDGQFDDNVIIQIKYGPMDFQVREPVNPLFGGMEKTNQILELQITQEYTGQQIHLCFLG
TLWKEILEFDTFAKGEGSYVKRIVDGTLFDRENNGFAGVSNVGDSVNWTGHDLAQANLYAFGRLAWNPDEEIERIVEEWI
KLTFGDDEKVLENVSYMLMKSHRTYEKYTTPFGLGWMVNPGHHYGPNPEGYEYSKWGTYHRANWEAIGVDRTSRGTGYTL
QYHSPWKEIYDDINTCPEDLLLFFHRVRYDHRLKSGKTLLQTMYDLHFEGVEEVEEFIKKWEELKDRVSPDIFERVKERL
HMQLEHAKEWRDVINTYFYRRTGIPDEKGRKIYP
>B8J364 4.1.1.111~~~~~~Siroheme decarboxylase alpha subunit~~~COG1522
MTTQTSAATGSPTQQNNAALADMDSMDRQLLDIIQTGFPLSPRPYAELGQRLGLDEQEVLDRVRGLKARKIIRRLGANFQ
SAKLGFVSTLCAAKVPQDKMDAFVAEVNAKPGVTHNYLREHDYNIWFTLISPSREETQAILDGITQATGVPILNLPATKL
FKIRVDFRMDNDS
>Q72DS5 4.1.1.111~~~ahbA~~~Siroheme decarboxylase alpha subunit~~~COG1522
MTEAHNACCHPSGTAAGHHGAGKASTDMMDAVDRRLLDIIQTGFPIEPRPYAVLGETLGITECEALARVRALRERKVIRR
LGANFDSWKLGFRSTLCAAKVPEDRIDAFVAEVNRHVNVTHNYLRNHEYNIWFTCICPSWEQVCSLLDGITERTGIPILN
LPATKLYKIRVDFRMD
>Q30Y72 4.1.1.111~~~ahbA~~~Siroheme decarboxylase alpha subunit~~~COG1522
MTGPAVQDQELDQFDKKILDIIQTGFPLEPRPYAVIGDAVGLTEAEALARVRALKERKIIRRLGANFNSWKLGFRSTLCA
AKVPEDKFDEFVAEVNSHVGVTHNYLRAHAYNVWFTFIGPSWEEVCSTLDSITQKTGIPILNLPAEELYKIRVDFKMDED
PAAD
>B8J3A4 4.1.1.111~~~~~~Siroheme decarboxylase beta subunit~~~COG1522
MSHQFSPEEQAVLRIVQANLPDSLTPYADLAEQAGMTEAQVLELLGRLKASGAIRRFGASIKHQKTGWTHNAMVAWKVTP
DQVDDCGRKAAEHSHISHVYYRPSSAPDWPYEMYTMIHGRSEAECLGVVEDVKRTTSLKEHAILRSLKELKKTSMTYFT
>Q725I2 4.1.1.111~~~ahbB~~~Siroheme decarboxylase beta subunit~~~COG1522
MSRYDDFTEVERAILRIVQSNLPDSLTPYADIAREVGTDEETVLALLRSLKEEGPIRRFGASIKHQRAGWNHNAMVAWKV
DPAIVEEAGTKAAEHPHISHVYYRPSSAPDWPYELYTMIHGRHATAHMDVIEQLRRETPLEEFAVLESLRELKKTSMTYF
>Q30WH3 4.1.1.111~~~ahbB~~~Siroheme decarboxylase beta subunit~~~COG1522
MAQTFTDTERAILRIVQKNLPDSATPYADIAEQTGTDEQTVLALLRRMKEEGSIRRFGASLKHQKAGYTHNAMVAWIVDK
DTVDEVGRQAAEHRLISHVYYRPSTAPDWPYTLYTMIHGRHENEYLEVIDTLRKETALEEYAVLNSLKELKKTSMTYF
>Q72DS4 1.3.98.6~~~ahbD~~~AdoMet-dependent heme synthase~~~COG0535
MGAHPTAHGPRTLEDGSPTCKLIAWEVTRSCNLACKHCRAEAHMEPYPGEFSTDEAKALIDTFPDVGNPIIIFTGGDPMM
RGDVYELIAYATDKGLRCVMSPNGTLITPEHAQRMKASGVQRCSISIDGPDAASHDAFRGVPGAFEQSMRGIGYLRDAGI
EFQINTTVTRDNLHSFKDIFKLCERIGAVAWHIFLLVPTGRAAGLSDQVISAAEYEEVLNWFYDFRKTTSMHLKATCAPH
YYRIMRQRAKEEGVSVTPDNFGMDAMTRGCLGGTGFCFISHTGQVQPCGYLELDCGNVRNTPFPEIWRKSEHFRQFRTQE
EYTGKCGPCEYHKVCGGCRARAYNMSGDHMAEEPLCSYKPRRMTPCR
>Q30Y73 1.3.98.6~~~ahbD~~~AdoMet-dependent heme synthase~~~COG0535
MHNANHPHGNGHPAEKKGMGAHSGAMNMPRTLEDGSPACRLIAWEVTRSCNLACKHCRAEAHTEPYPGELSTQEAKALID
TFPEVGNPIIIFTGGDPMMRADLYELIRYATGLGLRCVLSPNGTLITGQNAVQIREAGVQRCSISIDGPSAELHDEFRGV
PGAFEQSMRGIEFLKQAGVEFQINTTVTRDNLPYFKDIFKLCENLGAAAWHIFLLVPTGRAAQLGAQVITAEEYEEVLNW
FYDFRKTTSMHLKATCAPHYYRIMRQRAKEEGLPVTPDNFGMDAMTRGCLGGIGFCFISHTGQVQPCGYLELDCGNVRDT
RFPEIWRKSEYFRQFRTPEEYDGKCGHCEYHNVCGGCRARGFTMSGSHMAEEPLCTYQPRKKPAADRK
>A0A0K2JKU1 1.14.13.249~~~creL~~~3-amino-4-hydroxybenzoate 2-monooxygenase~~~
MVDRDIRIAVVGAGVAGLTVAALLRDRGVDCRVFERAPRLVAVGAGIQLSPNGVRVLHGLGLRDSLAATGVRARAIETRS
WADGAPIARTPLGDRCEELYGAPYYLIHRADLHRCLTSLLPASAVELGRACARVEERPDAVRLHFTDGTMADADLVIGAD
GVHSVVRRSVVRDAPRYAGYAVHRGLVPASVVPSFRDDPRVMFWLGPGRHVTYYPVAGGRTVHFSAVGVSPEESPGGGPE
DLGAAFGHWHEEVRRVVTSASSVTRWGLYDRDIPDRYATGRVVLLGDAAHPTLPYLSQGANQALEDVTTLVGCLDARPGA
PQEAVRQYESLRLPRTAEVHRRARRLAEEFHLPDGPECTDRDQRMRATQDPTHLAWLYGRTAGLPDASDLAPRP
>Q9RKF1 1.2.1.92~~~~~~3,6-anhydro-alpha-L-galactose dehydrogenase~~~COG1012
MTHELFDSKGLLGSAPAAFVAGEYELDSSHGTLPVINPANGQLVAEVPSSSSSTVDRAVTAAVAAQREWGRRSHVARAAV
LEAVRDAIAVHADELARIVSVEQGKPLSDARGETEGACAFFDFAISQKYRAVGSMMASEPGRSLGVREEPIGVVAAILPW
NFPVAIFARKVAPALMAGNAVVLKPSELTPLSALALARLCRLAGVPDGLLSVVCGEGKDTGRALVTHPGVGMVTMTGSTR
GGREILAQVADQIIPVSLELGGKAPFIVFEDADLDAAVEAAADARLWNTGQVCTCNEVTYVHADLHDEFVRRVVDRFASV
TPLDPFAAGSRLGPLVAERERTRVQGMVDAAVAAGARVRTGGGRPDGEQYQSGAWFAPTVLTNVRPEMDIARREVFGPVL
PIIPFDAEAEVVSAANSTAYGLTAYVYTRDLSRAMRMIDALEFGEVYVNQAGPEQVQGFHTGWKSSGLGGDDGPHGYEKY
LRRKTVYVRHAV
>H2IFE7 1.2.1.92~~~Vejahgd~~~3,6-anhydro-alpha-L-galactose dehydrogenase~~~COG1012
MKRYQMYVDGQWIDAENGKVDQVINPSTEEVLAEIQDGDQDDAERVLSVAKRAQSDWKRVPARQRAELLRKFAQEIRNNR
EHLAELLVSEQGKLYRVALGEVDVAASFIEYACDWARQMDGDIVQSDNVNEHIWIQKIPRGVVVAITAWNFPFALAGRKI
GPALVAGNTIVVKPTSETPLATLELGYIAEKVGIPAGVLNIVTGGGASLGGALTSHRYTNMVTMTGSTPVGQQIIKASAN
NMAHVQLELGGKAPFIVMEDADLEQAAAAALHSRFDNCGQVCTCNERMYVHSSVYDEFMAIFMEKVQNIKVGNPMDPESD
MGPKVNKRELDHMEALVAQALKEGAQLLHGGKRLTEGEFGKGFWFEPTILGNVQQSMTIVHEEAFGPILPVIKFDTFEEV
IDYANDSEYGLATMICTRNMKYVHRLTHELECGEIYVNRGHGEQHQGFHNGYKLSGTGGEDGKYGFEQYLEKKTFYVNFD
>P0CJ63 3.1.1.81~~~aiiA~~~N-acyl homoserine lactonase AiiA~~~
MTVKKLYFIPAGRCMLDHSSVNSALTPGKLLNLPVWCYLLETEEGPILVDTGMPESAVNNEGLFNGTFVEGQILPKMTEE
DRIVNILKRVGYEPDDLLYIISSHLHFDHAGGNGAFTNTPIIVQRTEYEAALHREEYMKECILPHLNYKIIEGDYEVVPG
VQLLYTPGHSPGHQSLFIETEQSGSVLLTIDASYTKENFEDEVPFAGFDPELALSSIKRLKEVVKKEKPIIFFGHDIEQE
KSCRVFPEYI
>A9CKY2 3.1.1.81~~~aiiB~~~N-acyl homoserine lactonase AiiB~~~
MGNKLFVLDLGEIRVDENFIIANSTFVTPQKPTVSSRLIDIPVSAYLIQCTDATVLYDTGCHPECMGTNGRWPAQSQLNA
PYIGASECNLPERLRQLGLSPDDISTVVLSHLHNDHAGCVEYFGKSRLIAHEDEFATAVRYFATGDHSSPYIVKDIEAWL
ATPRNWDLVGRDERERELAPGVNLLNFGTGHASGMLGLAVRLEKQPGFLLVSDACYTATNYGPPARRAGVLHDTIGYDRT
VSHIRQYAESRSLTVLFGHDREQFASLIKSTDGFYE
>Q8VPD5 3.1.1.81~~~attM~~~N-acyl homoserine lactonase AttM~~~
MTDIRLYMLQSGTLKCKVHNIKMNQGNGADYEIPVPFFLITHPGGHTVIDGGNAIEVATDPRGHWGGICDVYWPVLDKDQ
GCVDQIKALGFDPADVKYVVQSHLHLDHTGAIGRFPNATHIVQRSEYEYAFTPDWFAGGGYIRKDFDKPGLKWQFLNGTQ
DDYYDVYGDGTLTTIFTPGHAPGHQSLLVRLPNSKPLLLTIDAAYTLDHWEEKALPGFLASTVDTVRSVQKLRTYAEKHD
ATVVTGHDPDAWANFKKAPEFYA
>Q7X3T2 3.1.1.81~~~ahlD~~~N-acyl homoserine lactonase~~~
MEKDQLKVRVLETGVMEADMAWLLLKPGRIIADRNNKERQREWGEIPTHAVLIEHPEGRILWDTGVPRDWSSRWQESGMD
NYFPVKTESSSESGFLDSSLAQVGLEPADIDLLILSHLHLDHAGNARLFDNGKTKIVANRKELEGVQEIMGSHLGGHLKA
DFEGLKIDAIEGDTEIVPGVSVIDTPGHTWGTMSLQVDLPDDGTKIFTSDAVYLRDSFGPPAIGAAVVWNNLLWLESVEK
LRRIQERTNAEMIFGHESEQTSQIRWAHQGHYQ
>Q08GP4 3.1.1.81~~~Y2-aiiA~~~N-acyl homoserine lactonase~~~COG0491
MTVKKLYFVPAGRCMLDRSSVNSTLTPGNLLNLPVWCYLLETEEGPILVDTGMPESAVHNENLFEGTFAEGQILPKMTEE
DRIVTILKRVGYKPEDLLYIISSHLHFDHAGGNGAFSNTPIIIQRAEYEAAQYREEYLKECILPNLNYKIIEGDYEVVPG
VQLLYTPGHSPGHQSLLIETEKSGLVLLTIDASYTKENFEDEVPFAGFDSELALSSIKRLKEVVMKEKPIVFFGHDIEQE
KGCKVFPEYI
>Q9L8R8 3.1.1.81~~~aiiA~~~N-acyl homoserine lactonase~~~
MTVKKLYFVPAGRCMLDHSSVNSTLTPGELLDLPVWCYLLETEEGPILVDTGMPESAVNNEGLFNGTFVEGQVLPKMTEE
DRIVNILKRVGYEPEDLLYIISSHLHFDHAGGNGAFINTPIIVQRAEYEAAQHSEEYLKECILPNLNYKIIEGDYEVVPG
VQLLHTPGHTPGHQSLLIETEKSGPVLLTIDASYTKENFENEVPFAGFDSELALSSIKRLKEVVMKEKPIVFFGHDIEQE
RGCKVFPEYI
>A3FJ64 3.1.1.81~~~aiiA~~~N-acyl homoserine lactonase~~~
MTVKKLYFIPAGRCMLDHSSVNSALTPGKLLNLPVWCYLLETEEGPILVDTGMPESAVNNEGLFNGTFVEGQILPKMTEE
DRIVNILKRVGYEPDDLLYIISSHLHFDHAGGNGAFTNTPIIVQRTEYEAALHREEYMKECILPHLNYKIIEGDYEVVPG
VQLLYTPGHSPGHQSLFIETEQSGSVLLTIDASYTKENFEDEVPFAGFDPELALSSIKRLKEVVKKEKPIIFFGHDIEQE
KSCRVFPEYI
>C6L862 3.1.1.81~~~aiiM~~~N-acyl homoserine lactonase~~~COG2267
MILAHDVSGSGPLLVLLHGITEDRRSWDPVDFTDGFTVVRVDLRGHGASAAEEPYDIPTLATDVHDTLAQLAENDVIPGE
LPVIVGHSMGGIVATAYGALFPARAIVNVDQPLQLAGMQGQVQQAEGMLRGADFPLFIHGMFAQMAGGLDAEELARVNGI
RSPRQDVVLGMWRPLLEDSPEELAALVSGLTRIPEDVPYLVITGLDAGPEYAAWLQREIPQAVQEVWQPPTHYPHLVDPA
RFVERVEAFVR
>K0J4Q8 1.11.1.26~~~ahpC~~~Alkyl hydroperoxide reductase C~~~COG0450
MSLIGTEVQPFRAQAFQSGKDFFEVTEADLKGKWSIVVFYPADFSFVCPTELEDVQKEYAELKKLGVEVYSVSTDTHFVH
KAWHENSPAVGSIEYIMIGDPSQTISRQFDVLNEETGLADRGTFIIDPDGVIQAIEINADGIGRDASTLINKVKAAQYVR
ENPGEVCPAKWEEGGETLKPSLDIVGKI
>P80239 1.11.1.26~~~ahpC~~~Alkyl hydroperoxide reductase C~~~COG0450
MSLIGKEVLPFEAKAFKNGEFIDVTNEDLKGQWSVFCFYPADFSFVCPTELEDLQEQYAALKELGVEVYSVSTDTHFVHK
GWHDSSEKISKITYAMIGDPSQTISRNFDVLDEETGLADRGTFIIDPDGVIQTVEINAGGIGRDASNLVNKVKAAQYVRQ
NPGEVCPAKWEEGGETLTPSLDLVGKI
>P0AE08 1.11.1.26~~~ahpC~~~Alkyl hydroperoxide reductase C~~~COG0450
MSLINTKIKPFKNQAFKNGEFIEITEKDTEGRWSVFFFYPADFTFVCPTELGDVADHYEELQKLGVDVYAVSTDTHFTHK
AWHSSSETIAKIKYAMIGDPTGALTRNFDNMREDEGLADRATFVVDPQGIIQAIEVTAEGIGRDASDLLRKIKAAQYVAS
HPGEVCPAKWKEGEATLAPSLDLVGKI
>P56876 1.11.1.26~~~ahpC~~~Alkyl hydroperoxide reductase C~~~COG0450
MLVTKLAPDFKAPAVLGNNEVDEHFELSKNLGKNGVILFFWPKDFTFVCPTEIIAFDKRVKDFHEKGFNVIGVSIDSEQV
HFAWKNTPVEKGGIGQVSFPMVADITKSISRDYDVLFEEAIALRGAFLIDKNMKVRHAVINDLPLGRNADEMLRMVDALL
HFEEHGEVCPAGWRKGDKGMKATHQGVAEYLKENSIKL
>P21762 1.11.1.26~~~ahpC~~~Alkyl hydroperoxide reductase C~~~COG0450
MLVTKLAPDFKAPAVLGNNEVDEHFELSKNLGKNGAILFFWPKDFTFVCPTEIIAFDKRVKDFQEKGFNVIGVSIDSEQV
HFAWKNTPVEKGGIGQVTFPMVADITKSISRDYDVLFEEAIALRGAFLIDKNMKVRHAVINDLPLGRNADEMLRMVDALL
HFEEHGEVCPAGWRKGDKGMKATHQGVAEYLKENSIKL
>A0R1V9 1.11.1.28~~~~~~Alkyl hydroperoxide reductase C~~~COG0450
MALLTIGDQFPEYDLTAVVGGDLSKVDAKQPDDYFTRVTSKDYEGKWRIIFFWPKDFTFVCPTEIAAFGKLNEDFEDRDA
KVLGVSVDNEFVHFQWRAQHEDLKTLPFPMVSDLKRELTAACGVLNADGVADRATFIVDPNNEVQFVSVTAGSVGRNVDE
VLRVLDALQSDELCACNWKKGDPTINAGELLAGAV
>P9WQB7 1.11.1.28~~~ahpC~~~Alkyl hydroperoxide reductase C~~~COG0450
MPLLTIGDQFPAYQLTALIGGDLSKVDAKQPGDYFTTITSDEHPGKWRVVFFWPKDFTFVCPTEIAAFSKLNDEFEDRDA
QILGVSIDSEFAHFQWRAQHNDLKTLPFPMLSDIKRELSQAAGVLNADGVADRVTFIVDPNNEIQFVSATAGSVGRNVDE
VLRVLDALQSDELCACNWRKGDPTLDAGELLKASA
>Q02UU0 1.11.1.26~~~ahpC~~~Alkyl hydroperoxide reductase C~~~
MSLINTQVQPFKVNAFHNGKFIEVTEESLKGKWSVLIFMPAAFTFNCPTEIEDAANNYGEFQKAGAEVYIVTTDTHFSHK
VWHETSPAVGKAQFPLIGDPTHQLTNAFGVHIPEEGLALRGTFVINPEGVIKTVEIHSNEIARDVGETVRKLKAAQYTAA
HPGEVCPAKWKEGEKTLAPSLDLVGKI
>P0A251 1.11.1.26~~~ahpC~~~Alkyl hydroperoxide reductase C~~~
MSLINTKIKPFKNQAFKNGEFIEVTEKDTEGRWSVFFFYPADFTFVCPTELGDVADHYEELQKLGVDVYSVSTDTHFTHK
AWHSSSETIAKIKYAMIGDPTGALTRNFDNMREDEGLADRATFVVDPQGIIQAIEVTAEGIGRDASDLLRKIKAAQYVAA
HPGEVCPAKWKEGEATLAPSLDLVGKI
>P0A0B7 1.11.1.26~~~ahpC~~~Alkyl hydroperoxide reductase C~~~COG0450
MSLINKEILPFTAQAFDPKKDQFKEVTQEDLKGSWSVVCFYPADFSFVCPTELEDLQNQYEELQKLGVNVFSVSTDTHFV
HKAWHDHSDAISKITYTMIGDPSQTITRNFDVLDEATGLAQRGTFIIDPDGVVQASEINADGIGRDASTLAHKIKAAQYV
RKNPGEVCPAKWEEGAKTLQPGLDLVGKI
>P99074 1.11.1.26~~~ahpC~~~Alkyl hydroperoxide reductase C~~~
MSLINKEILPFTAQAFDPKKDQFKEVTQEDLKGSWSVVCFYPADFSFVCPTELEDLQNQYEELQKLGVNVFSVSTDTHFV
HKAWHDHSDAISKITYTMIGDPSQTITRNFDVLDEATGLAQRGTFIIDPDGVVQASEINADGIGRDASTLAHKIKAAQYV
RKNPGEVCPAKWEEGAKTLQPGLDLVGKI
>P9WQB5 1.11.1.28~~~ahpD~~~Alkyl hydroperoxide reductase AhpD~~~COG0599
MSIEKLKAALPEYAKDIKLNLSSITRSSVLDQEQLWGTLLASAAATRNPQVLADIGAEATDHLSAAARHAALGAAAIMGM
NNVFYRGRGFLEGRYDDLRPGLRMNIIANPGIPKANFELWSFAVSAINGCSHCLVAHEHTLRTVGVDREAIFEALKAAAI
VSGVAQALATIEALSPS
>P9WIE3 1.11.1.29~~~ahpE~~~Alkyl hydroperoxide reductase E~~~COG1225
MLNVGATAPDFTLRDQNQQLVTLRGYRGAKNVLLVFFPLAFTGICQGELDQLRDHLPEFENDDSAALAISVGPPPTHKIW
ATQSGFTFPLLSDFWPHGAVSQAYGVFNEQAGIANRGTFVVDRSGIIRFAEMKQPGEVRDQRLWTDALAALTA
>P35340 1.8.1.-~~~ahpF~~~Alkyl hydroperoxide reductase subunit F~~~COG3634
MLDTNMKTQLKAYLEKLTKPVELIATLDDSAKSAEIKELLAEIAELSDKVTFKEDNSLPVRKPSFLITNPGSNQGPRFAG
SPLGHEFTSLVLALLWTGGHPSKEAQSLLEQIRHIDGDFEFETYYSLSCHNCPDVVQALNLMSVLNPRIKHTAIDGGTFQ
NEITDRNVMGVPAVFVNGKEFGQGRMTLTEIVAKIDTGAEKRAAEELNKRDAYDVLIVGSGPAGAAAAIYSARKGIRTGL
MGERFGGQILDTVDIENYISVPKTEGQKLAGALKVHVDEYDVDVIDSQSASKLIPAAVEGGLHQIETASGAVLKARSIIV
ATGAKWRNMNVPGEDQYRTKGVTYCPHCDGPLFKGKRVAVIGGGNSGVEAAIDLAGIVEHVTLLEFAPEMKADQVLQDKL
RSLKNVDIILNAQTTEVKGDGSKVVGLEYRDRVSGDIHNIELAGIFVQIGLLPNTNWLEGAVERNRMGEIIIDAKCETNV
KGVFAAGDCTTVPYKQIIIATGEGAKASLSAFDYLIRTKTA
>P19480 1.8.1.-~~~ahpF~~~Alkyl hydroperoxide reductase subunit F~~~
MLDTNMKTQLRAYLEKLTKPVELIATLDDSAKSAEIKELLAEIAELSDKVTFKEDNTLPVRKPSFLITNPGSQQGPRFAG
SPLGHEFTSLVLALLWTGGHPSKEAQSLLEQIRDIDGDFEFETYYSLSCHNCPDVVQALNLMAVLNPRIKHTAIDGGTFQ
NEITERNVMGVPAVFVNGKEFGQGRMTLTEIVAKVDTGAEKRAAEALNKRDAYDVLIVGSGPAGAAAAVYSARKGIRTGL
MGERFGGQVLDTVDIENYISVPKTEGQKLAGALKAHVSDYDVDVIDSQSASKLVPAATEGGLHQIETASGAVLKARSIII
ATGAKWRNMNVPGEDQYRTKGVTYCPHCDGPLFKGKRVAVIGGGNSGVEAAIDLAGIVEHVTLLEFAPEMKADQVLQDKV
RSLKNVDIILNAQTTEVKGDGSKVVGLEYRDRVSGDIHSVALAGIFVQIGLLPNTHWLEGALERNRMGEIIIDAKCETSV
KGVFAAGDCTTVPYKQIIIATGEGAKASLSAFDYLIRTKIA
>P99118 1.8.1.-~~~ahpF~~~Alkyl hydroperoxide reductase subunit F~~~
MLNADLKQQLKQLLELMEGNVEFVASLGSDEKSKELKELLTEISDMSPRLSLSEKSLKRTPSFSVNRPGEETGVTFAGIP
LGHEFNSLVLAILQVSGRAPKEKQSIIDQIKNLEGSFHFETFISLTCQKCPDVVQALNLMSVINPNITHSMIDGAVFREE
SENIMAVPAVFLNGEEFGNGRMTIQDILSKLGSTADASEFENKEPYDVLIVGGGPASGSAAIYTARKGLRTGIVADRIGG
QVNDTAGIENFITVKETTGSEFSSNLAAHIDQYDIDAMTGIRATDIEKTDEAIKVTLENGAVLESKTVIIATGAGWRKLN
IPGEEQLINKGVAFCPHCDGPLFENKDVAVIGGGNSGVEAAIDLAGIVNHVTLFEFASELKADNVLQDRLRSLSNVDIKT
NAKTTEVVGEDHVTGIRYEDMSTGEEHLLNLDGIFVQIGLLPNTSWLKDAVELNERGEIVIDRNNNTNVPGIFAAGDVTD
QKNKQIIISMGAGANAALNAFDYIIRN
>O06218 1.11.1.-~~~~~~Alkyl hydroperoxide reductase Rv2159c~~~COG2128
MKFVNHIEPVAPRRAGGAVAEVYAEARREFGRLPEPLAMLSPDEGLLTAGWATLRETLLVGQVPRGRKEAVAAAVAASLR
CPWCVDAHTTMLYAAGQTDTAAAILAGTAPAAGDPNAPYVAWAAGTGTPAGPPAPFGPDVAAEYLGTAVQFHFIARLVLV
LLDETFLPGGPRAQQLMRRAGGLVFARKVRAEHRPGRSTRRLEPRTLPDDLAWATPSEPIATAFAALSHHLDTAPHLPPP
TRQVVRRVVGSWHGEPMPMSSRWTNEHTAELPADLHAPTRLALLTGLAPHQVTDDDVAAARSLLDTDAALVGALAWAAFT
AARRIGTWIGAAAEGQVSRQNPTG
>P27250 1.1.1.2~~~ahr~~~Aldehyde reductase Ahr~~~COG1064
MSMIKSYAAKEAGGELEVYEYDPGELRPQDVEVQVDYCGICHSDLSMIDNEWGFSQYPLVAGHEVIGRVVALGSAAQDKG
LQVGQRVGIGWTARSCGHCDACISGNQINCEQGAVPTIMNRGGFAEKLRADWQWVIPLPENIDIESAGPLLCGGITVFKP
LLMHHITATSRVGVIGIGGLGHIAIKLLHAMGCEVTAFSSNPAKEQEVLAMGADKVVNSRDPQALKALAGQFDLIINTVN
VSLDWQPYFEALTYGGNFHTVGAVLTPLSVPAFTLIAGDRSVSGSATGTPYELRKLMRFAARSKVAPTTELFPMSKINDA
IQHVRDGKARYRVVLKADF
>Q9KWN0 ~~~ath~~~Actinohivin~~~
MNTLTKLTIGAVALTGSFLAAAPASAAPAADTTASPALGSQVSAQFASVTIRNAQTGRLLDSNYNGNVYTLPANGGNYQR
WTGPGDGTVRNAQTGRCLDSNYDGAVYTLPCNGGSYQKWLFYSNGYIQNVETGRVLDSNYNGNVYTLPANGGNYQKWYTG
>Q71EW5 4.2.1.112~~~~~~Acetylene hydratase~~~
MASKKHVVCQSCDINCVVEAEVKADGKIQTKSISEPHPTTPPNSICMKSVNADTIRTHKDRVLYPLKNVGSKRGEQRWER
ISWDQALDEIAEKLKKIIAKYGPESLGVSQTEINQQSEYGTLRRFMNLLGSPNWTSAMYMCIGNTAGVHRVTHGSYSFAS
FADSNCLLFIGKNLSNHNWVSQFNDLKAALKRGCKLIVLDPRRTKVAEMADIWLPLRYGTDAALFLGMINVIINEQLYDK
EFVENWCVGFEELKERVQEYPLDKVAEITGCDAGEIRKAAVMFATESPASIPWAVSTDMQKNSCSAIRAQCILRAIVGSF
VNGAEILGAPHSDLVPISKIQMHEALPEEKKKLQLGTETYPFLTYTGMSALEEPSERVYGVKYFHNMGAFMANPTALFTA
MATEKPYPVKAFFALASNALMGYANQQNALKGLMNQDLVVCYDQFMTPTAQLADYVLPGDHWLERPVVQPNWEGIPFGNT
SQQVVEPAGEAKDEYYFIRELAVRMGLEEHFPWKDRLELINYRISPTGMEWEEYQKQYTYMSKLPDYFGPEGVGVATPSG
KVELYSSVFEKLGYDPLPYYHEPLQTEISDPELAKEYPLILFAGLREDSNFQSCYHQPGILRDAEPDPVALLHPKTAQSL
GLPSGEWIWVETTHGRLKLLLKHDGAQPEGTIRIPHGRWCPEQEGGPETGFSGAMLHNDAMVLSDDDWNLDPEQGLPNLR
GGILAKAYKC
>G2JHL6 ~~~aidA~~~Quorum-quenching protein AidA~~~
MGKSLNNVPQAPLDVQFDSNDVKCSAYLYRPTTEVATPMIVMAHGLGGTRRMRLTAFAERFVAEGYACLVFDYRYFGDSE
GQPRQLLDIKSQLEDWKAAIAYARSLDKIDPNRVVIWGTSFGGGHVLATAANDNRLAAVISQCPFTDGFSSSMAMNPITT
LKLMGLALKDKIGSILGAKPVMVPLAAPSGHTALMNAPDAYSGYLALMPSGSNIPNYVAARFVLDIIRYYPGRKTSRIQA
PVLFCVCDTDSVAPSKTTLRHASHTPNHEIKHYADGHFEIYVGEAFERVVRDQIDFLKRIVPVK
>Q03155 ~~~aidA~~~Autotransporter adhesin AIDA-I~~~
MNKAYSIIWSHSRQAWIVASELARGHGFVLAKNTLLVLAVVSTIGNAFAVNISGTVSSGGTVSSGETQIVYSGRGNSNAT
VNSGGTQIVNNGGKTTATTVNSSGSQNVGTSGATISTIVNSGGIQRVSSGGVASATNLSGGAQNIYNLGHASNTVIFSGG
NQTIFSGGITDSTNISSGGQQRVSSGGVASNTTINSSGAQNILSEEGAISTHISSGGNQYISAGANATETIVNSGGFQRV
NSGAVATGTVLSGGTQNVSSGGSAISTSVYNSGVQTVFAGATVTDTTVNSGGNQNISSGGIVSETTVNVSGTQNIYSGGS
ALSANIKGSQIVNSEGTAINTLVSDGGYQHIRNGGIASGTIVNQSGYVNISSGGYAESTIINSGGTLRVLSDGYARGTIL
NNSGRENVSNGGVSYNAMINTGGNQYIYSDGEATAAIVNTSGFQRINSGGTAPVQNSVVVTRTVSSAAKPFDAEVYSGGK
QTVYLWRGIWYSNFLTAVWSMFPGTASGANVNLSGRLNAFAGNVVGTILNQEGRQYVYSGATATSTVGNNEGREYVLSGG
ITDGTVLNSGGLQAVSSGGKASATVINEGGAQFVYDGGQVTGTNIKNGGTIRVDSGASALNIALSSGGNLFTSTGATLPE
LTTMAALSVSQNHASNIVLENGGLLRVTSGGTATDTTVNSAGRLRIDDGGTINGTTTINADGIVAGTNIQNDGNFILNLA
ENYDFETELSGSGVLVKDNTGIMTYAGTLTQAQGVNVKNGGIIFDSAVVNADMAVNQNAYINISDQATINGSVNNNGSIV
INNSIINGNITNDADLSFGTAKLLSATVNGSLVNNKNIILNPTKESAGNTLTVSNYTGTPGSVISLGGVLEGDNSLTDRL
VVKGNTSGQSDIVYVNEDGSGGQTRDGINIISVEGNSDAEFSLKNRVVAGAYDYTLQKGNESGTDNKGWYLTSHLPTSDT
RQYRPENGSYATNMALANSLFLMDLNERKQFRAMSDNTQPESASVWMKITGGISSGKLNDGQNKTTTNQFINQLGGDIYK
FHAEQLGDFTLGIMGGYANAKGKTINYTSNKAARNTLDGYSVGVYGTWYQNGENATGLFAETWMQYNWFNASVKGDGLEE
EKYNLNGLTASAGGGYNLNVHTWTSPEGITGEFWLQPHLQAVWMGVTPDTHQEDNGTVVQGAGKNNIQTKAGIRASWKVK
STLDKDTGRRFRPYIEANWIHNTHEFGVKMSDDSQLLSGSRNQGEIKTGIEGVITQNLSVNGGVAYQAGGHGSNAISGAL
GIKYSF
>P33224 1.3.99.-~~~aidB~~~Putative acyl-CoA dehydrogenase AidB~~~COG1960
MHWQTHTVFNQPIPLNNSNLYLSDGALCEAVTREGAGWDSDFLASIGQQLGTAESLELGRLANVNPPELLRYDAQGRRLD
DVRFHPAWHLLMQALCTNRVHNLAWEEDARSGAFVARAARFMLHAQVEAGSLCPITMTFAATPLLLQMLPAPFQDWTTPL
LSDRYDSHLLPGGQKRGLLIGMGMTEKQGGSDVMSNTTRAERLEDGSYRLVGHKWFFSVPQSDAHLVLAQTAGGLSCFFV
PRFLPDGQRNAIRLERLKDKLGNRSNASCEVEFQDAIGWLLGLEGEGIRLILKMGGMTRFDCALGSHAMMRRAFSLAIYH
AHQRHVFGNPLIQQPLMRHVLSRMALQLEGQTALLFRLARAWDRRADAKEALWARLFTPAAKFVICKRGMPFVAEAMEVL
GGIGYCEESELPRLYREMPVNSIWEGSGNIMCLDVLRVLNKQAGVYDLLSEAFVEVKGQDRYFDRAVRRLQQQLRKPAEE
LGREITHQLFLLGCGAQMLKYASPPMAQAWCQVMLDTRGGVRLSEQIQNDLLLRATGGVCV
>P16454 ~~~ail~~~Attachment invasion locus protein~~~
MKKTLLASSLIACLSIASVNVYAASESSISIGYAQSHVKENGYTLDNDPKGFNLKYRYELDDNWGVIGSFAYTHQGYDFF
YGSNKFGHGDVDYYSVTMGPSFRINEYVSLYGLLGAAHGKVKASVFDESISASKTSMAYGAGVQFNPLPNFVIDASYEYS
KLDSIKVGTWMLGAGYRF
>Q45577 ~~~aimA~~~Glutamate/serine transporter AimA~~~COG0531
MNQLHRRMGTFSLMMVGLGSMIGSGWLFGAWRAAQIAGPAAIISWVIGMVVILFIALSYSELGSMFPEAGGMVKYTQYSH
GSFIGFIAGWANWIAIVSVIPVEAVASVQYMSSWPWEWAKWTSGLVKNGTLTGEGLAFASVLLLIYFLLNYWTVNLFSKA
NSLITIFKIIIPGLTIGALLFVGFHGENFTGGQSIAPNGWASVLTAVATSGIVFAFNGFQSPINMAGEAKNPGKSIPIAV
VGSLFVATVIYVLLQIAFIGAVNPSDIAHGWSHLNFNSPFADLAIALNINWLVIVLYADAFVSPSGTGITYTATTSRMIY
GMEKNKYMPSIFGKLHPIYGVPRQAMFFNLIVSFIFLFLFRGWGVLAEIISVATLISYITGPITVMTLRRTGKDLYRPLR
LKGLNVIAPLGFIFASLVLYWARWPLTGQVLFIILIGLPIYFYYQAKAKWKGFGRNFKAGVWMVFYLLAMMVISYLGSDK
FGGLNVIHYGWDMVLIAMVSLVFYVWALKSGYQTEYLKDAKEINSQLLNGQSEAAAGKE
>D2PPM7 3.2.1.204~~~~~~1,3-alpha-isomaltosidase~~~COG1501
MIKHRPHGIEHPYAVSPDQRVPVLPLAGEPVLLGVVAPEADRVVCEWGTLELPLSATSAAAADAAALAGGEGHLSEAQAK
SLGADGAWSVQTPPLAEPVKYRFHAHRGGAAESTEWFEVSPAVWTADGVGEVRGGGERVRGVEWLVSSQGVHRGRFRLQL
QDGDRLVGFGERYDALDQRGRELDAVVFEQYKAQGVHGRTYLPMPFAHVVGADGNGWGFHVRTSRRTWYSSAGNELTVEV
ALGDEPVVDLAIYEGDPATVLTGFLDEVGRAEELPGWVFRLWASGNEWNTQQLVTARMDTHRDLAIPVGAVVIEAWSDEQ
GITIWRDAVYAVTEDGSAHRAEDFSYRPDGAWPDPKAMIDELHARGIKVILWQIPLQKTEFSTGQVAADAAAMVRDGHAV
LEADGTAYRNRGWWFPQALMPDLSVQRTRDWWTEKRRYLVEHFDVDGFKTDGGEHAWGHDLVYADGRKGDEGNNLYPVHY
ARAFGDLLRSAGKAPVTFSRAGFTGSQAHGIFWAGDEDSTWQAFRSSVTAGLTAASCGIVYWGWDLAGFSGPVPDAELYL
RAAAASAFMPIMQYHSEFNHHQLPLRDRTPWHVAETTGDDRVVPLFRRFATLRESLVPYLTEQAARTIATDRPLMRPLFF
DHENDPEIWNHPYQYLLGDELLINPVLEPGATTWTTYLPAGEWIDVWTGDRVPSGLVTRDVPLEVVPVYCRASRWSELQP
VFS
>Q7SIF4 1.20.9.1~~~aioA~~~Arsenite oxidase subunit AioA~~~
MSRPNDRITLPPANAQRTNMTCHFCIVGCGYHVYKWPELQEGGRAPEQNALGLDFRKQLPPLAVTLTPAMTNVVTEHNGR
RYNIMVVPDKACVVNSGLSSTRGGKMASYMYTPTGDGKQRLKAPRLYAADQWVDTTWDHAMALYAGLIKKTLDKDGPQGV
FFSCFDHGGAGGGFENTWGTGKLMFSAIQTPMVRIHNRPAYNSECHATREMGIGELNNAYEDAQLADVIWSIGNNPYESQ
TNYFLNHWLPNLQGATTSKKKERFPNENFPQARIIFVDPRETPSVAIARHVAGNDRVLHLAIEPGTDTALFNGLFTYVVE
QGWIDKPFIEAHTKGFDDAVKTNRLSLDECSNITGVPVDMLKRAAEWSYKPKASGQAPRTMHAYEKGIIWGNDNYVIQSA
LLDLVIATHNVGRRGTGCVRMGGHQEGYTRPPYPGDKKIYIDQELIKGKGRIMTWWGCNNFQTSNNAQALREAILQRSAI
VKQAMQKARGATTEEMVDVIYEATQNGGLFVTSINLYPTKLAEAAHLMLPAAHPGEMNLTSMNGERRIRLSEKFMDPPGT
AMADCLIAARIANALRDMYQKDGKAEMAAQFEGFDWKTEEDAFNDGFRRAGQPGAPAIDSQGGSTGHLVTYDRLRKSGNN
GVQLPVVSWDESKGLVGTEMLYTEGKFDTDDGKAHFKPAPWNGLPATVQQQKDKYRFWLNNGRNNEVWQTAYHDQYNSLM
QERYPMAYIEMNPDDCKQLDVTGGDIVEVYNDFGSTFAMVYPVAEIKRGQTFMLFGYVNGIQGDVTTDWTDRNIIPYYKG
TWGDIRKVGSMEEFKRTVSFKSRRFA
>Q8GGJ6 1.20.9.1~~~aioA~~~Arsenite oxidase subunit AioA~~~COG0243
MSKNRDRVALPPVNAQKTNMTCHFCIVGCGYHVYKWDENKEGGRAANQNALGLDFTKQLPPFATTLTPAMTNVITAKNGK
RSNIMIIPDKECVVNQGLSSTRGGKMAGYMYAADGMTADRLKYPRFYAGDQWLDTSWDHAMAIYAGLTKKILDQGNVRDI
MFATFDHGGAGGGFENTWGSGKLMFSAIQTPTVRIHNRPAYNSECHATREMGIGELNNSYEDAQVADVIWSIGNNPYETQ
TNYFLNHWLPNLNGSTEEKKKQWFAGEPVGPGLMIFVDPRRTTSIAIAEQTAKDRVLHLDINPGTDVALFNGLLTYVVQQ
GWIAKEFIAQHTVGFEDAVKTNQMSLADCSRITGVSEDKLRQAAEWSYKPKAAGKMPRTMHAYEKGIIWGNDNYNIQSSL
LDLVIATQNVGRRGTGCVRMGGHQEGYVRPPHPTGEKIYVDQEIIQGKGRMMTWWGCNNFQTSNNAQALREVSLRRSQIV
KDAMSKARGASAAEMVDIIYDATSKGGLFVTSINLYPTKLSEAAHLMLPAAHPGEMNLTSMNGERRMRLSEKFMDAPGDA
LPDCLIAAKAANTLKAMYEAEGKPEMVKRFSGFDWKTEEDAFNDGFRSAGQPGAEPIDSQGGSTGVLATYTLLRAAGTNG
VQLPIKRVENGKMIGTAIHYDDNKFDTKDGKAHFKPAPWNGLPKPVEEQKAKHKFWLNNGRANEVWQSAYHDQYNDFVKS
RYPLAYIELNPGDAQSLGVAAGDVVEVFNDYGSTFAMAYPVKDMKPSHTFMLFGYVNGIQGDVTTDWVDRNIIPYYKGTW
GSVRRIGSIEQYKKTVSTKRRAFDNV
>Q7SIF3 1.20.9.1~~~aioB~~~Arsenite oxidase subunit AioB~~~
MSDTINLTRRGFLKVSGSGVAVAATLSPIASANAQKAPADAGRTTLQYPATQVSVAKNLKANEPVSFTYPDTSSPCVAVK
LGSPVPGGVGPNNDIVAYSVLCTHMGCPTSYDKSSKTFKCPCHFTEFDAEKAGQMICGQATENLPRVLLRYDEASDALTA
VGVDGLIYGRQANVI
>Q8GGJ7 1.20.9.1~~~aioB~~~Arsenite oxidase subunit AioB~~~COG0723
MEHQTSRRNFLKIAGSSAAVAGAGLVSGNANAAPAKVNVGASTLPYPITAVGKAKGLKVDAPVSFNYPDASSPCVAIKMG
QPTPGGVGPNNDIVAHSILCTHMGCPVSYDASAKTFKCPCHFSVFDPDNHGQMVCGQATENLPQIQLSYNAANDTFTAIG
VTGLIYGRQSNIL
>Q8ZKR2 2.7.1.223~~~~~~Aminoimidazole riboside kinase~~~
MKAMNKVWVIGDASVDLVPEKQNSYLKCPGGASANVGVCVARLGGECGFIGCLGDDDAGRFLRQVFQDNGVDVTFLRLDA
DLTSAVLIVNLTADGERSFTYLVHPGADTYVSPQDLPPFRQYEWFYFSSIGLTDRPAREACLEGARRMREAGGYVLFDVN
LRSKMWGNTDEIPELIARSAALASICKVSADELCQLSGASHWQDARYYLRDLGCDTTIISLGADGALLITAEGEFHFPAP
RVDVVDTTGAGDAFVGGLLFTLSRANCWDHALLAEAISNANACGAMAVTAKGAMTALPFPDQLNTFLSSHSLAQAMTVK
>P45565 3.1.3.-~~~ais~~~Lipopolysaccharide core heptose(II)-phosphate phosphatase~~~COG0406
MLAFCRSSLKSKKYIIILLALAAIAGLGTHAAWSSNGLPRIDNKTLARLAQQHPVVVLFRHAERCDRSTNQCLSDKTGIT
VKGTQDARELGNAFSADIPDFDLYSSNTVRTIQSATWFSAGKKLTVDKRLLQCGNEIYSAIKDLQSKAPDKNIVIFTHNH
CLTYIAKDKRDATFKPDYLDGLVMHVEKGKVYLDGEFVNH
>A0A0A1H8I4 5.3.3.7~~~ais~~~Aconitate isomerase~~~
MFPRLPTLALGALLLASTPLLAAQPVTTLTVLSSGGIMGTIREVAPAYEKATGVKLDIAAAPSMGDTPQAIPNRLARNEP
ADVVLMVGSALDKLVASGQVAKDSRVDLGQSFIAMAVRQGAPKPDISNMDAFKQTLEKAQSVAYSDSASGVYLSRILFPR
MQLDKSFMAKARMIPAEPVGAVVARGEAQLGFQQLSELKAVPGIDIVGLIPDQAQKMTLYSGAMVSKSQHPEAARALLQY
LASKDAAKAIEDSGLKPVPAQP
>Q8ZNF4 3.1.3.-~~~ais~~~Lipopolysaccharide core heptose(II)-phosphate phosphatase~~~
MLAFTLRFIKNKRYFAILAGALVIIAGLTSQHAWSGNGLPQINGKALAALAKQHPVVVLFRHAERCDRSDNTCLSDSTGI
TVKGAQDARALGKAFSADIQNYNLYSSNTVRTIQSATWFSAGRSLTVDKKMMDCGSGIYASINTLLKKSQNKNIVIFTHN
HCLTYIAKNKRGVKFDPDYLNALVMHAENGKLFLDGEFVPG
>P00561 ~~~thrA~~~Bifunctional aspartokinase/homoserine dehydrogenase 1~~~COG0460
MRVLKFGGTSVANAERFLRVADILESNARQGQVATVLSAPAKITNHLVAMIEKTISGQDALPNISDAERIFAELLTGLAA
AQPGFPLAQLKTFVDQEFAQIKHVLHGISLLGQCPDSINAALICRGEKMSIAIMAGVLEARGHNVTVIDPVEKLLAVGHY
LESTVDIAESTRRIAASRIPADHMVLMAGFTAGNEKGELVVLGRNGSDYSAAVLAACLRADCCEIWTDVDGVYTCDPRQV
PDARLLKSMSYQEAMELSYFGAKVLHPRTITPIAQFQIPCLIKNTGNPQAPGTLIGASRDEDELPVKGISNLNNMAMFSV
SGPGMKGMVGMAARVFAAMSRARISVVLITQSSSEYSISFCVPQSDCVRAERAMQEEFYLELKEGLLEPLAVTERLAIIS
VVGDGMRTLRGISAKFFAALARANINIVAIAQGSSERSISVVVNNDDATTGVRVTHQMLFNTDQVIEVFVIGVGGVGGAL
LEQLKRQQSWLKNKHIDLRVCGVANSKALLTNVHGLNLENWQEELAQAKEPFNLGRLIRLVKEYHLLNPVIVDCTSSQAV
ADQYADFLREGFHVVTPNKKANTSSMDYYHQLRYAAEKSRRKFLYDTNVGAGLPVIENLQNLLNAGDELMKFSGILSGSL
SYIFGKLDEGMSFSEATTLAREMGYTEPDPRDDLSGMDVARKLLILARETGRELELADIEIEPVLPAEFNAEGDVAAFMA
NLSQLDDLFAARVAKARDEGKVLRYVGNIDEDGVCRVKIAEVDGNDPLFKVKNGENALAFYSHYYQPLPLVLRGYGAGND
VTAAGVFADLLRTLSWKLGV
>P00562 ~~~metL~~~Bifunctional aspartokinase/homoserine dehydrogenase 2~~~COG0460
MSVIAQAGAKGRQLHKFGGSSLADVKCYLRVAGIMAEYSQPDDMMVVSAAGSTTNQLINWLKLSQTDRLSAHQVQQTLRR
YQCDLISGLLPAEEADSLISAFVSDLERLAALLDSGINDAVYAEVVGHGEVWSARLMSAVLNQQGLPAAWLDAREFLRAE
RAAQPQVDEGLSYPLLQQLLVQHPGKRLVVTGFISRNNAGETVLLGRNGSDYSATQIGALAGVSRVTIWSDVAGVYSADP
RKVKDACLLPLLRLDEASELARLAAPVLHARTLQPVSGSEIDLQLRCSYTPDQGSTRIERVLASGTGARIVTSHDDVCLI
EFQVPASQDFKLAHKEIDQILKRAQVRPLAVGVHNDRQLLQFCYTSEVADSALKILDEAGLPGELRLRQGLALVAMVGAG
VTRNPLHCHRFWQQLKGQPVEFTWQSDDGISLVAVLRTGPTESLIQGLHQSVFRAEKRIGLVLFGKGNIGSRWLELFARE
QSTLSARTGFEFVLAGVVDSRRSLLSYDGLDASRALAFFNDEAVEQDEESLFLWMRAHPYDDLVVLDVTASQQLADQYLD
FASHGFHVISANKLAGASDSNKYRQIHDAFEKTGRHWLYNATVGAGLPINHTVRDLIDSGDTILSISGIFSGTLSWLFLQ
FDGSVPFTELVDQAWQQGLTEPDPRDDLSGKDVMRKLVILAREAGYNIEPDQVRVESLVPAHCEGGSIDHFFENGDELNE
QMVQRLEAAREMGLVLRYVARFDANGKARVGVEAVREDHPLASLLPCDNVFAIESRWYRDNPLVIRGPGAGRDVTAGAIQ
SDINRLAQLL
>P08495 2.7.2.4~~~lysC~~~Aspartokinase 2~~~COG0527
MGLIVQKFGGTSVGSVEKIQNAANRAIAEKQKGHQVVVVVSAMGKSTDELVSLAKAISDQPSKREMDMLLATGEQVTISL
LSMALQEKGYDAVSYTGWQAGIRTEAIHGNARITDIDTSVLADQLEKGKIVIVAGFQGMTEDCEITTLGRGGSDTTAVAL
AAALKADKCDIYTDVPGVFTTDPRYVKSARKLEGISYDEMLELANLGAGVLHPRAVEFAKNYQVPLEVRSSTETEAGTLI
EEESSMEQNLIVRGIAFEDQITRVTIYGLTSGLTTLSTIFTTLAKRNINVDIIIQTQAEDKTGISFSVKTEDADQTVAVL
EEYKDALEFEKIETESKLAKVSIVGSGMVSNPGVAAEMFAVLAQKNILIKMVSTSEIKVSTVVSENDMVKAVESLHDAFE
LSKHPSAV
>P94417 2.7.2.4~~~yclM~~~Aspartokinase 3~~~COG0527
MKVVKFGGSSLASGAQLDKVFHIVTSDPARKAVVVSAPGKHYAEDTKVTDLLIACAEQYLATGSAPELAEAVVERYALIA
NELQLGQSIIEKIRDDLFTLLEGDKSNPEQYLDAVKASGEDNNAKLIAAYFRYKGVKAEYVNPKDAGLFVTNEPGNAQVL
PESYQNLYRLRERDGLIIFPGFFGFSKDGDVITFSRSGSDITGSILANGLQADLYENFTDVDAVYSVNPSFVENPKEISE
LTYREMRELSYAGFSVFHDEALIPAFRAGIPVQIKNTNNPSAEGTRVVSKRDNTNGPVVGIASDTGFCSIYISKYLMNRE
IGFGRRALQILEEHGLTYEHVPSGIDDMTIILRQGQMDAATERSVIKRIEEDLHADEVIVEHHLALIMVVGEAMRHNVGT
TARAAKALSEAQVNIEMINQGSSEVSMMFGVKEAEERKAVQALYQEFFAGVLIS
>P08660 2.7.2.4~~~lysC~~~Lysine-sensitive aspartokinase 3~~~COG0527
MSEIVVSKFGGTSVADFDAMNRSADIVLSDANVRLVVLSASAGITNLLVALAEGLEPGERFEKLDAIRNIQFAILERLRY
PNVIREEIERLLENITVLAEAAALATSPALTDELVSHGELMSTLLFVEILRERDVQAQWFDVRKVMRTNDRFGRAEPDIA
ALAELAALQLLPRLNEGLVITQGFIGSENKGRTTTLGRGGSDYTAALLAEALHASRVDIWTDVPGIYTTDPRVVSAAKRI
DEIAFAEAAEMATFGAKVLHPATLLPAVRSDIPVFVGSSKDPRAGGTLVCNKTENPPLFRALALRRNQTLLTLHSLNMLH
SRGFLAEVFGILARHNISVDLITTSEVSVALTLDTTGSTSTGDTLLTQSLLMELSALCRVEVEEGLALVALIGNDLSKAC
GVGKEVFGVLEPFNIRMICYGASSHNLCFLVPGEDAEQVVQKLHSNLFE
>A4VFY3 2.7.2.4~~~ask~~~Aspartate kinase Ask_Ect~~~COG0527
MHTVEKIGGTSMSRFEEVLDNIFIGRREGAALYQRIFVVSAYSGMTNLLLEHKKTGEPGVYQRFADAQSEGAWREALEGV
RQRMLAKNAELFSSEYELHAANQFINSRIDDASECMHSLQKLCAYGHFQLSEHLMKVREMLASLGEAHSAFNSVLALKQR
GVNARLADLTGWQQEAPLPFEEMISSHFAGFDFSRELVVATGYTHCAEGLMNTFDRGYSEITFAQIAAATGAREAIIHKE
FHLSSADPNLVGADKVVTIGRTNYDVADQLSNLGMEAIHPRAAKTLRRAGVELRIKNAFEPEHGGTLISQDYKSEKPCVE
IIAGRKDVFGIEVFDQDMLGDIGYDMEISKLLKQLKLYVVNKDSDANSITYYASGSRKLINRAARLIEEQYPAAEVTVHN
LAIVSAIGSDLKVKGILAKTVAALAEAGISIQAIHQSIRQVEMQCVVNEEDYDAAIAALHRALIEPENHGDVIAAA
>A4VJB4 2.7.2.4~~~lysC~~~Aspartate kinase Ask_LysC~~~COG0527
MALIVQKFGGTSVGTVERIEQVAEKVKKFRDGGDDIVVVVSAMSGETNRLIDLAKQISEQPVPRELDVMVSTGEQVTIAL
LAMALIKRGVPAVSYTGNQVRILTDSAHTKARILQIDAQRIQRDIKAGRVVVVAGFQGVDEKGNITTLGRGGSDTTGVAL
AAALKADECQIYTDVDGVYTTDPRVVAKAQRLDKITFEEMLEMASLGSKVLQIRAVEFAGKYSVPLRVLHSFQEGPGTLI
TLDEEESMEQPIISGIAFNRDEAKLTIRGVPDTPGVAFKILGPISAANVEVDMIVQNVAHDNTTDFTFTVHRNDYNNALQ
VLQGIAAEMGAREAIGDTNIAKVSIVGVGMRSHAGVASRMFEALAKENINIQMISTSEIKVSVVIEEKYLELAVRALHTA
FELDAPAGNTAE
>Q9L555 2.4.1.327~~~aknK~~~Aclacinomycin-T 2-deoxy-L-fucose transferase~~~
MKVLFTTFAAKSHMHAQVPLAWALQTAGHEVRIASQPDLAEDITRTGLTAVCVGEPLLLEEQMQRVNEGLGDDAEIMESQ
AEAGMDMTETRPEMLTWDHVLGVFTSMTAMAFQNSCPERMIDDVVAFAREWQPDLIVWDTLSFAGPVAAQVTGAAHARLL
FGLDLLGRMRETFLDLQEERLPEQRDDPLREWLTWTLGRYGAEFEEEVAVGQWTVDPVPPSMRFPVKQPFVPLRYIPYNG
QAVIPDWLHEPPKKRRVCLTLGVAHREVLDGDRASIGELVEALAELDVEVVATLNEKQLAGMELPDNVRAVDFVPLNALL
PTCSAVIHHGGSGTFQTALAHGVPQLIVPDMVWDTIHKAKQLERFGAGLYLHDVDNYTAQDLRDHLLRLLEEPSFAENCA
RIRREMVGTPSPNDIVPLLEKLTAEHRRDRGARGTVRGEQ
>Q0PCD7 1.1.3.45~~~aknOx~~~Aclacinomycin-N/aclacinomycin-A oxidase~~~
MFVLNEFTRRGFLGTAAAVGGTTVVTTALGGAPAAQAAVPEAADGGACGARTALVKVDRVDRRYQDLVTRGFNGRFRGRP
DVVYVVHTADQVVDAVNQAMAAGQRIAVRSGGHCFEGFVDDPAVRAVIDMSQMRQVFYDSGKRAFAVEPGATLGETYRAL
YLDWGVTIPAGVCPQVGVGGHVLGGGYGPLSRRDGVVADHLYAVEVVVVDASGRARKVVATSAADDPNRELWWAHTGGGG
GNFGIVTRYWFRTPGATGTDPSQLLPKAPTSTLRHIVTWDWSALTEEAFTRIIDNHGAWHQSNSAAGTPYASMHSVFYLN
SRAAGQILLDIQIDGGLDGAEALLNDFVAAVNEGTGVEPAVQRSTEPWLRATLANKFDTGGFDRTKSKGAYLRKPWTAAQ
AATLYRHLSADSQVWGEVSLYSYGGKVNSVPETATATAQRDSIIKVWMSATWMDPAHDDANLAWIREIYREIFATTGGVP
VPDDRTEGTFINYPDVDLVDERWNTSGVPWYTLYYKGNYPRLQKVKARWDPRDVFRHALSVRPPG
>Q9L4U6 2.4.1.326~~~aknS~~~Aklavinone 7-beta-L-rhodosaminyltransferase~~~
MRVLLTSFALDAHFNGSVPLAWALRAAGHEVRVASQPALTASITAAGLTAVPVGADPRLDEMVKGVGDAVLSHHADQSLD
ADTPGQLTPAFLQGWDTMMTATFYTLINDDPMVDDLVAFARGWEPDLILWEPFTFAGAVAAKVTGAAHARLLSFPDLFMS
MRRAYLAQLGAAPAGPAGGNGTTHPDDSLGQWLEWTLGRYGVPFDEEAVTGQWSVDQVPRSFRPPSDRPVVGMRYVPYNG
PGPAVVPDWLRVPPTRPRVCVTLGMTARTSEFPNAVPVDLVLKAVEGLDIEVVATLDAEERALLTHVPDNVRLVDHVPLH
ALLPTCAAIVHHGGAGTWSTALVEGVPQIAMGWIWDAIDRAQRQQALGAGLHLPSHEVTVEGLRGRLVRLLDEPSFTAAA
ARLRAEAESEPTPAQVVPVLERLTAQHRAREPRRPGGTSPCVS
>P77256 1.1.1.-~~~ydjG~~~NADH-specific methylglyoxal reductase~~~COG0667
MKKIPLGTTDITLSRMGLGTWAIGGGPAWNGDLDRQICIDTILEAHRCGINLIDTAPGYNFGNSEVIVGQALKKLPREQV
VVETKCGIVWERKGSLFNKVGDRQLYKNLSPESIREEVAASLQRLGIDYIDIYMTHWQSVPPFFTPIAETVAVLNELKSE
GKIRAIGAANVDADHIREYLQYGELDIIQAKYSILDRAMENELLPLCRDNGIVVQVYSPLEQGLLTGTITRDYVPGGARA
NKVWFQRENMLKVIDMLEQWQPLCARYQCTIPTLALAWILKQSDLISILSGATAPEQVRENVAALNINLSDADATLMREM
AEALER
>P74308 1.1.1.184~~~~~~Aldo/keto reductase slr0942~~~COG0656
MQSFNRINSMKYFPLSNGEQIPALGLGTWKSSPQVVGQAVEQALDLGYRHLDCAAIYGNEAEIGATLANAFTKGVVKREE
LWITSKLWSNAHHPDAVLPALEKTLQDLGLDYLDLYLIHWPVVIQPDVGFPESGDQLLPFTPASLEGTWQALEKAVDLGL
CHHIGVSNFSLKKLEMVLSMARIPPAVNQVELHPYLQQSDLLTFANSQNILLTAYSPLGSGDRPAAFQQAAEPKLLTDPV
INGIAAEQGCSAAQVLLAWAIQRGTVTIPKSVNPERLEQNLRAADITLTDSEMAKIALLDRHYRYVSGDFWTMPGSPYTL
QNLWDEI
>Q59229 2.7.2.4~~~lysC~~~Aspartokinase~~~
MGLIVQKFGGTSVGSVERILNVANRVIEEKKNGNDVVVVVSAMGKTTDELVDLAKQISAHPPKREMDMLLTTGEQVTISL
LAMALNEKGYEAISYTGWQAGITTEPVFGNARILNIETEKIQKQLNEGKIVVVAGFQGIDEHGEITTLGRGGSDTTAVAL
AAALKAEKCDIYTDVTGVFTTDPRYVKSARKLASISYDEMLELANLGAGVLHPRAVEFAKNYGITLEVRSSMEREEGTII
EEEVTMEQNLVVRGVAFEDEITRVTVFGLPNSLTSLSTIFTTLAQNRINVDIIIQSATDAETTNLSFSIKSDDLEETMAV
LENNKNLLNYQGIESETGLAKVSIVGSGMISNPGVAAKMFEVLALNGIQVKMVSTSEIKVSTVVEESQMIKAVEALHQAF
ELSGSAVKSER
>P41398 2.7.2.4~~~lysC~~~Aspartokinase~~~
MALVVQKYGGSSLESAERIRNVAERIVATKKAGNDVVVVCSAMGDTTDELLELAAAVNPVPPAREMDMLLTAGERISNAL
VAMAIESLGAEAQSFTGSQAGVLTTERHGNARIVDVTPGRVREALDEGKICIVAGFQGVNKETRDVTTLGRGGSDTTAVA
LAAALNADVCEIYSDVDGVYTADPRIVPNAQKLEKLSFEEMLELAAVGSKILVLRSVEYARAFNVPLRVRSSYSNDPGTL
IAGSMEDIPVEEAVLTGVATDKSEAKVTVLGISDKPGEAAKVFRALADAEINIDMVLQNVSSVEDGTTDITFTCPRADGR
RAMEILKKLQVQGNWTNVLYDDQVDKVSLVGAGMKSHPGVTAEFMEALRDVNVNIELISTSEIRISVLIREDDLDAAARA
LHEQFQLGGEDEAVVYAGTGR
>P26512 2.7.2.4~~~lysC~~~Aspartokinase~~~COG0527
MALVVQKYGGSSLESAERIRNVAERIVATKKAGNDVVVVCSAMGDTTDELLELAAAVNPVPPAREMDMLLTAGERISNAL
VAMAIESLGAEAQSFTGSQAGVLTTERHGNARIVDVTPGRVREALDEGKICIVAGFQGVNKETRDVTTLGRGGSDTTAVA
LAAALNADVCEIYSDVDGVYTADPRIVPNAQKLEKLSFEEMLELAAVGSKILVLRSVEYARAFNVPLRVRSSYSNDPGTL
IAGSMEDIPVEEAVLTGVATDKSEAKVTVLGISDKPGEAAKVFRALADAEINIDMVLQNVSSVEDGTTDITFTCPRSDGR
RAMEILKKLQVQGNWTNVLYDDQVGKVSLVGAGMKSHPGVTAEFMEALRDVNVNIELISTSEIRISVLIREDDLDAAARA
LHEQFQLGGEDEAVVYAGTGR
>P9WPX3 2.7.2.4~~~ask~~~Aspartokinase~~~COG0527
MALVVQKYGGSSVADAERIRRVAERIVATKKQGNDVVVVVSAMGDTTDDLLDLAQQVCPAPPPRELDMLLTAGERISNAL
VAMAIESLGAHARSFTGSQAGVITTGTHGNAKIIDVTPGRLQTALEEGRVVLVAGFQGVSQDTKDVTTLGRGGSDTTAVA
MAAALGADVCEIYTDVDGIFSADPRIVRNARKLDTVTFEEMLEMAACGAKVLMLRCVEYARRHNIPVHVRSSYSDRPGTV
VVGSIKDVPMEDPILTGVAHDRSEAKVTIVGLPDIPGYAAKVFRAVADADVNIDMVLQNVSKVEDGKTDITFTCSRDVGP
AAVEKLDSLRNEIGFSQLLYDDHIGKVSLIGAGMRSHPGVTATFCEALAAVGVNIELISTSEIRISVLCRDTELDKAVVA
LHEAFGLGGDEEATVYAGTGR
>O69077 2.7.2.4~~~lysC~~~Aspartokinase~~~
MALIVQKFGGTSVGTVERIEQVAEKVKKFREAGDDVVVVVSAMSGETNRLIGLANQIMEQPVPRELDVMVSTGEQVTIAL
LSMALIKRGVPAVSYTGNQVRILTDSAHTKARILHIDDTHIRADLKAGRVVVVAGFQGVDGNGNITTLGRGGSDTTGVAL
AAALKADECQIYTDVDGVYTTDPRVVPQARRLDKITFEEMLEMASLGSKVLQIRAVEFAGKYNVPLRVLHSFQEGPGTLI
TIDDEEESMEQPIISGIAFNRDEAKLTIRGVPDTPGVAFKILGPISAANVEVDMIVQNVAHDNTTDFTFTVHRNDYLNAL
EILKQTAANIGAREAIGDTNIAKVSIVGVGMRSHAGVASRMFEALAKESINIQMISTSEIKVSVVIEEKYLELAVRALHT
AFELDAPARQGE
>C3JXY0 2.7.2.4~~~~~~Aspartate kinase~~~COG0527
MALIVQKFGGTSVGSVERIEQVADKVKKFRDAGDDLVVVLSAMSGETNRLIDLAKAISGDQQPLPRELDVIVSTGEQVTI
ALLAMALNKRGVPAVSYTGSQVRILTDSAHTKARILQIDDQKIRTDLKAGRVVVVAGFQGVDEQGNITTLGRGGSDTTGV
ALAAALKADECQIYTDVDGVYTTDPRVVSVAQRLDKITFEEMLEMASLGSKVLQIRAVEFAGKYNVPLRVLHSFKEGPGT
LITIDEEESMEQPIISGIAFNRDEAKLTIRGVPDTPGVAFKILGPISGANIEVDMIVQNVSHDNTTDFTFTVHRNEYDAA
ERILQNTAKEIGAREVVGDTKIAKVSIVGVGMRSHAGVASRMFEALAKESINIQMISTSEIKVSVVIEEKYLELAVRALH
TAFELDAPARQGE
>Q88EI9 2.7.2.4~~~~~~Aspartate kinase~~~COG0527
MALIVQKFGGTSVGSIERIEQVAEKVKKHREAGDDLVVVLSAMSGETNRLIDLAKQITDQPVPRELDVIVSTGEQVTIAL
LTMALIKRGVPAVSYTGNQVRILTDSSHNKARILQIDDQKIRADLKEGRVVVVAGFQGVDEHGSITTLGRGGSDTTGVAL
AAALKADECQIYTDVDGVYTTDPRVVPQARRLEKITFEEMLEMASLGSKVLQIRSVEFAGKYNVPLRVLHSFKEGPGTLI
TIDEEESMEQPIISGIAFNRDEAKLTIRGVPDTPGVAFKILGPISASNIEVDMIVQNVAHDNTTDFTFTVHRNEYEKAQS
VLENTAREIGAREVIGDTKIAKVSIVGVGMRSHAGVASCMFEALAKESINIQMISTSEIKVSVVLEEKYLELAVRALHTA
FDLDAPARQGE
>P61489 2.7.2.4~~~ask~~~Aspartokinase~~~
MALVVQKYGGTSVGDLERIHKVAQRIAHYREKGHRLAVVVSAMGHTTDELIALAKRVNPRPPFRELDLLTTTGEQVSVAL
LSMQLWAMGIPAKGFVQHQIGITTDGRYGDARILEVNPARIREALDQGFVAVIAGFMGTTPEGEITTLGRGGSDTTAVAI
AAALGAKECEIYTDTEGVYTTDPHLIPEARKLSVIGYDQMLEMAALGARVLHPRAVYYAKRYGVVLHVRSSFSYNPGTLV
KEVAMEMDKAVTGVALDLDHAQIGLIGIPDQPGIAAKVFQALAERGIAVDMIIQGVPGHDPSRQQMAFTVKKDFAQEALE
ALEPVLAEIGGEAILRPDIAKVSIVGVGLASTPEVPAKMFQAVASTGANIEMIATSEVRISVIIPAEYAEAALRAVHQAF
ELDKA
>P0A959 2.6.1.2~~~alaA~~~Glutamate-pyruvate aminotransferase AlaA~~~COG0436
MSPIEKSSKLENVCYDIRGPVLKEAKRLEEEGNKVLKLNIGNPAPFGFDAPDEILVDVIRNLPTAQGYCDSKGLYSARKA
IMQHYQARGMRDVTVEDIYIGNGVSELIVQAMQALLNSGDEMLVPAPDYPLWTAAVSLSSGKAVHYLCDESSDWFPDLDD
IRAKITPRTRGIVIINPNNPTGAVYSKELLMEIVEIARQHNLIIFADEIYDKILYDDAEHHSIAPLAPDLLTITFNGLSK
TYRVAGFRQGWMVLNGPKKHAKGYIEGLEMLASMRLCANVPAQHAIQTALGGYQSISEFITPGGRLYEQRNRAWELINDI
PGVSCVKPRGALYMFPKIDAKRFNIHDDQKMVLDFLLQEKVLLVQGTAFNWPWPDHFRIVTLPRVDDIELSLSKFARFLS
GYHQL
>P9WQ91 2.6.1.2~~~aspC~~~Alanine aminotransferase~~~COG0436
MDNDGTIVDVTTHQLPWHTASHQRQRAFAQSAKLQDVLYEIRGPVHQHAARLEAEGHRILKLNIGNPAPFGFEAPDVIMR
DIIQALPYAQGYSDSQGILSARRAVVTRYELVPGFPRFDVDDVYLGNGVSELITMTLQALLDNGDQVLIPSPDYPLWTAS
TSLAGGTPVHYLCDETQGWQPDIADLESKITERTKALVVINPNNPTGAVYSCEILTQMVDLARKHQLLLLADEIYDKILY
DDAKHISLASIAPDMLCLTFNGLSKAYRVAGYRAGWLAITGPKEHASSFIEGIGLLANMRLCPNVPAQHAIQVALGGHQS
IEDLVLPGGRLLEQRDIAWTKLNEIPGVSCVKPAGALYAFPRLDPEVYDIDDDEQLVLDLLLSEKILVTQGTGFNWPAPD
HLRLVTLPWSRDLAAAIERLGNFLVSYRQ
>P77434 2.6.1.2~~~alaC~~~Glutamate-pyruvate aminotransferase AlaC~~~COG0436
MADTRPERRFTRIDRLPPYVFNITAELKMAARRRGEDIIDFSMGNPDGATPPHIVEKLCTVAQRPDTHGYSTSRGIPRLR
RAISRWYQDRYDVEIDPESEAIVTIGSKEGLAHLMLATLDHGDTVLVPNPSYPIHIYGAVIAGAQVRSVPLVEGVDFFNE
LERAIRESYPKPKMMILGFPSNPTAQCVELEFFEKVVALAKRYDVLVVHDLAYADIVYDGWKAPSIMQVPGARDVAVEFF
TLSKSYNMAGWRIGFMVGNKTLVSALARIKSYHDYGTFTPLQVAAIAALEGDQQCVRDIAEQYKRRRDVLVKGLHEAGWM
VEMPKASMYVWAKIPEPYAAMGSLEFAKKLLNEAKVCVSPGIGFGDYGDTHVRFALIENRDRIRQAIRGIKAMFRADGLL
PASSKHIHENAE
>P64550 ~~~alaE~~~L-alanine exporter AlaE~~~
MFSPQSRLRHAVADTFAMVVYCSVVNMCIEVFLSGMSFEQSFYSRLVAIPVNILIAWPYGMYRDLFMRAARKVSPSGWIK
NLADILAYVTFQSPVYVAILLVVGADWHQIMAAVSSNIVVSMLMGAVYGYFLDYCRRLFKVSRYQQVKA
>P71011 1.21.98.-~~~albA~~~Antilisterial bacteriocin subtilosin biosynthesis protein AlbA~~~COG0535
MFIEQMFPFINESVRVHQLPEGGVLEIDYLRDNVSISDFEYLDLNKTAYELCMRMDGQKTAEQILAEQCAVYDESPEDHK
DWYYDMLNMLQNKQVIQLGNRASRHTITTSGSNEFPMPLHATFELTHRCNLKCAHCYLESSPEALGTVSIEQFKKTADML
FDNGVLTCEITGGEIFVHPNANEILDYVCKKFKKVAVLTNGTLMRKESLELLKTYKQKIIVGISLDSVNSEVHDSFRGRK
GSFAQTCKTIKLLSDHGIFVRVAMSVFEKNMWEIHDMAQKVRDLGAKAFSYNWVDDFGRGRDIVHPTKDAEQHRKFMEYE
QHVIDEFKDLIPIIPYERKRAANCGAGWKSIVISPFGEVRPCALFPKEFSLGNIFHDSYESIFNSPLVHKLWQAQAPRFS
EHCMKDKCPFSGYCGGCYLKGLNSNKYHRKNICSWAKNEQLEDVVQLI
>Q8GED9 1.3.3.13~~~albA~~~Albonoursin synthase~~~
MRRHPSHSPYRGGCEVRPKRRGLMLAHSSSESPPESLPDAWTVLKTRTAVRNYAKEPVDDALIEQLLEAMLAAPTASNRQ
AWSFMVVRRPAAVRRLRAFSPGVLGTPAFFVVACVDRSLTDNLSPKLSQKIYDTSKLCVAMAVENLLLAAHAAGLGGCPV
GSFRSDIVTSMLGIPEHIEPMLVVPIGRPATALVPSQRRAKNEVVNYESWGNRAAAPTA
>P71010 ~~~albB~~~Antilisterial bacteriocin subtilosin biosynthesis protein AlbB~~~
MSPAQRRILLYILSFIFVIGAVVYFVKSDYLFTLIFIAIAILFGMRARKADSR
>P71009 ~~~albC~~~Putative ABC transporter ATP-binding protein AlbC~~~COG1131
MSILDIHDVSVWYERDNVILEQVDLHLEKGAVYGLLGVNGAGKTTLINTLTGVNRNFSGRFTLCGIEAEAGMPQKTSDQL
KTHRYFAADYPLLFTEITAKDYVSFVHSLYQKDFSEQQFASLAEAFHFSKYINRRISELSLGNRQKVVLMTGLLLRAPLF
ILDEPLVGLDVESIEVFYQKMREYCEAGGTILFSSHLLDVVQRFCDYAAILHNKQIQKVIPIGEETDLRREFFEVIGHE
>P71008 ~~~albD~~~Antilisterial bacteriocin subtilosin biosynthesis protein AlbD~~~COG0474
MNNIIPIMSLLFKQLYSRQGKKDAIRIAAGLVILAVFEIGLIRQAGIDESVLRKTYIILALLLMNTYMVFLSVTSQWKES
YMKLSCLLPISSRSFWLAQSVVLFVDTCLRRTLFFFILPLFLFGNGTLSGAQTLFWLGRFSFFTVYSIIFGVVLSNHFVK
KKNLMFLLHAAIFACVCISAALMPAATIPLCAVHILWAVVIDFPVFLQAPPQQGKMHSFMRRSEFSFYKREWNRFISSKA
MLLNYAVMAVFSGFFSFQMMNTGIFNQQVIYIVISALLLICSPIALLYSIEKNDRMLLITLPIKRKTMFWAKYRFYSGLL
AGGFLLVVMIVGFISGRSISVLTFLQCIELLLAGAYIRLTADEKRPSFSWQTEQQLWSGFSKYRSYLFCLPLFLAILAGT
AVSLAVIPIAGLVIVYYLQKQDGGFFDTSKRERLGS
>P71007 ~~~albE~~~Antilisterial bacteriocin subtilosin biosynthesis protein AlbE~~~COG0612
MEVNLLKTHQFSTISIAASFLKPIESAAEPEEETIYFYGAAAYLKEQIIDAFGYAAGSRFMYSANLFFDQQLKTCGTRLI
HPLYNGNLHVDALMKTFADLSFPSSLSFEAFEKARNELLLKIEKKFTDPFSYSAARLAEEVFGNPMYGTGMFGRRDRIKA
IHPKRFLDATDFIVDLVSQQKQLNILGQVQACDVRGHAPQTSAVTSGRIPVNRHVFETETRSAAGPSVLTLGFDCGEMKD
ASDYIKIQLIDGLLGKYGHSALFKHFREKDLAVYHVITRYDVMNNLLLVSICTDQLHEKDIPPRVLEAVSAFHTDERELE
QAKQFLRNELLLQFDSPEGLLAYMGVLRRFSCTKEALLDGISAVTCRDVLQFIATINYIGAHVVRG
>P71006 3.4.24.-~~~albF~~~Putative zinc protease AlbF~~~COG0612
MEKKAFFQQLDERTDIRYTDSGLKIFRLKFPRAHLRLCNVKIDFGSRDVCIRAESGDTLLPYGTAHFLEHLLFWHNGRNL
YSDFFAHGALLNAFTTYTDTNFMFTSLPDRLRQTIPILLDALWNHSFDKKIVAQEKAVITSEIQTAHLNHQLSYHYQLIS
MLSPSSPAAVFPAGRIEDIEALDISDLQKAYKAAYQAHRMTLFLIGGSENTETLLPPHLQLEKRPDYHAERKIIPACPPV
LSQKMMLGDEERMEDTWTGLQIGALPGQNDLLSIKLYWDIAARILFQLDSPFFQEIQQTYRLEIDRLSAETYIYEDGGFL
ILHSQGTHSSAYIDVASYYVTQKKEQVAAWLQYGKDSLTDAIIYDSDYVRKCFEWAAECDRCDCSFLDMYHIIQDMDAQV
FLSLIDAMASSNKAIIHVSQKEAIRQ
>P71005 ~~~albG~~~Antilisterial bacteriocin subtilosin biosynthesis protein AlbG~~~
MKQSTVFTLLLLLIGMAAYSFGWVQAVAEAAAQYVQMINNDAVRLGLLACTAALLMLPAFLYLHYVTQSVKNMTAAFQKL
TQSHQSCCDFQQHNLCSRYAEDVKSLRDSYKNVRQTYVMAAVLCQVIIFGCMFEIVKAVPFRLHTPPAFSMGLAMLLILY
LLFCMRTYLRQLFRHGSLFRKVFAGALAAAGIWWMLSFSISELLFLIILAAIQQIGSFIYKRFSYHSTASLDL
>P30145 ~~~acp~~~Sodium/proton-dependent alanine carrier protein~~~
MIRLVTMGKSSEAGVSSFQALTMSLSGRIGVGNVAGTATGIAYGGPGAVFWMWVITFIGAATAYVESTWRKFIKRNKTDN
TVAVRRSTLKKALAGNGLRCSRAAIILSMAVLMPGIQANSIADSFSNAFGIPKLVTGIFVIAVLGFTIFGGVKRIAKTAE
IVVPFMAVGYLFVAIAIIAANIEKVPDVFGLIFKSAFGADQVFGGILGSAVMWGVKRGLYANEAGQGTGAHPAAAAEVSH
PAKQGLVQAFSIYLDVFLVVTATALMILFTGQYNVINEKTGETIVEHLKGVEPGAGYTQAAVDTLFPGFGSAFIAIALFF
FAFTTMYAYYYIAETNLAYLVRSEKRGTAFFALKLVFLAATFYGTVKTATTAWAMGDIGLGIMVWLNLIAILLLFKPAYM
ALKDYEEQLKQGKDPEFNASKYGIKNAKFWENGYKRWEEKKGKAL
>Q9FDS1 1.2.1.48~~~ald1~~~Long-chain-aldehyde dehydrogenase~~~
MHYVDPNQSGSKIHFKDQYENFIGGQWVAPVKGVYFDNISPVDGKSFTRIPRSSAEDIELALDAAHKAKKEWNKSSPTTR
SNLLLKIADRMEANLEMLAVAETWDNGKPVRETLAADIPLAIDHFRYFAGCIRAQEGGISEIDEDTIAYHFHEPLGVVGQ
IIPWNFPILMAAWKLAPALAAGNCVVIKPAEQTPVGILLVAELIQDLLPAGVLNIVNGYGAEVGRPLATSPRIAKIAFTG
STQVGQLIMQYATENIIPVTLELGGKSPNVFFADVMDHDDDFLDKTLEGFAMFALNQGEVCTCPSRALIQESIADQFMEK
AIERVKRIKLGHPLDTDTMVGAQASLEQQEKILRCIDTGRQEGAEVLLGGHGRQEVGNGYYIEPTIFKGHNNMQVFQEEI
FGPVLSVTTFKDFDEAIQIANDTMYGLGAGVWSRSTHTAYRAGRAIEAGRVWTNCYHIYPAHAAFGGYKKSGVGRENHKM
MLDHYQQTKNLLVSYSTKAMGFF
>B7MMH7 4.1.2.62~~~ald2~~~5-methylthioribulose-1-phosphate/5-deoxyribulose-1-phosphate aldolase~~~
MERIKLAEKIISTCREMNASGLNQGTSGNVSARYTGGMLITPSGIAYSKMTPDMIVFVDDKGKPEAGKIPSSEWLIHLAC
YKARPELNAVIHTHAVNSTAVAIHNHSIPAIHYMVAVSGTDHIPCIPYYTFGSPELADGVSKGIRESKSLLMQHHGMLAM
DVTLEKTLWLAGETETLADLYIKCGGLHHDVPVLSEAEMTIVLEKFKTYGLKA
>Q2RXI1 4.1.2.62~~~ald2~~~5-methylthioribulose-1-phosphate/5-deoxyribulose-1-phosphate aldolase~~~COG0235
MPGSRIALRHGLIDAARQVTTLGLNKGTAGNLSVRAGDGLLITPSGLQAADLRPNDIVFIDSEGEWRGPRKPSSEWRFHH
DILAERPDVGAVVHTHAPFSTVLACLGRPIPAFHYMVAMAGGNDIRIGAYATFGTAELSRHALAAMEGRKACLLAHHGMI
ATGRTLKAAIKLAVEVEELAEQYWRCLQIAEPEILPADEMERVLEKFKTYGDNAQLPSPPA
>P25553 1.2.1.22~~~aldA~~~Lactaldehyde dehydrogenase~~~COG1012
MSVPVQHPMYIDGQFVTWRGDAWIDVVNPATEAVISRIPDGQAEDARKAIDAAERAQPEWEALPAIERASWLRKISAGIR
ERASEISALIVEEGGKIQQLAEVEVAFTADYIDYMAEWARRYEGEIIQSDRPGENILLFKRALGVTTGILPWNFPFFLIA
RKMAPALLTGNTIVIKPSEFTPNNAIAFAKIVDEIGLPRGVFNLVLGRGETVGQELAGNPKVAMVSMTGSVSAGEKIMAT
AAKNITKVCLELGGKAPAIVMDDADLELAVKAIVDSRVINSGQVCNCAERVYVQKGIYDQFVNRLGEAMQAVQFGNPAER
NDIAMGPLINAAALERVEQKVARAVEEGARVAFGGKAVEGKGYYYPPTLLLDVRQEMSIMHEETFGPVLPVVAFDTLEDA
ISMANDSDYGLTSSIYTQNLNVAMKAIKGLKFGETYINRENFEAMQGFHAGWRKSGIGGADGKHGLHEYLQTQVVYLQS
>Q7A825 1.2.1.3~~~aldA~~~Putative aldehyde dehydrogenase AldA~~~
MAVNVRDYIAENYGLFINGEFVKGSSDETIEVTNPATGETLSHITRAKDKDVDHAVKVAQEAFESWSLTSKSERAQMLRD
IGDKLMAQKDKIAMIETLNNGKPIRETTAIDIPFAARHFHYFASVIETEEGTVNDIDKDTMSIVRHEPIGVVGAVVAWNF
PMLLAAWKIAPAIAAGNTIVIQPSSSTPLSLLEVAKIFQEVLPKGVVNILTGKGSESGNAIFNHDGVDKLSFTGSTDVGY
QVAEAAAKHLVPATLELGGKSANIILDDANLDLAVEGIQLGILFNQGEVCSAGSRLLVHEKIYDQLVPRLQEAFSNIKVG
DPQDEATQMGSQTGKDQLDKIQSYIDAAKESDAQILAGGHRLTENGLDKGFFFEPTLIAVPDNHHKLAQEEIFGPVLTVI
KVKDDQEAIDIANDSEYGLAGGVFSQNITRALNIAKAVRTGRIWINTYNQVPEGAPFGGYKKSGIGRETYKGALSNYQQV
KNIYIDTSNALKGLY
>P37685 1.2.1.4~~~aldB~~~Aldehyde dehydrogenase B~~~COG1012
MTNNPPSAQIKPGEYGFPLKLKARYDNFIGGEWVAPADGEYYQNLTPVTGQLLCEVASSGKRDIDLALDAAHKVKDKWAH
TSVQDRAAILFKIADRMEQNLELLATAETWDNGKPIRETSAADVPLAIDHFRYFASCIRAQEGGISEVDSETVAYHFHEP
LGVVGQIIPWNFPLLMASWKMAPALAAGNCVVLKPARLTPLSVLLLMEIVGDLLPPGVVNVVNGAGGVIGEYLATSKRIA
KVAFTGSTEVGQQIMQYATQNIIPVTLELGGKSPNIFFADVMDEEDAFFDKALEGFALFAFNQGEVCTCPSRALVQESIY
ERFMERAIRRVESIRSGNPLDSVTQMGAQVSHGQLETILNYIDIGKKEGADVLTGGRRKLLEGELKDGYYLEPTILFGQN
NMRVFQEEIFGPVLAVTTFKTMEEALELANDTQYGLGAGVWSRNGNLAYKMGRGIQAGRVWTNCYHAYPAHAAFGGYKQS
GIGRETHKMMLEHYQQTKCLLVSYSDKPLGLF
>Q04777 4.1.1.5~~~alsD~~~Alpha-acetolactate decarboxylase~~~COG3527
MKRESNIQVLSRGQKDQPVSQIYQVSTMTSLLDGVYDGDFELSEIPKYGDFGIGTFNKLDGELIGFDGEFYRLRSDGTAT
PVQNGDRSPFCSFTFFTPDMTHKIDAKMTREDFEKEINSMLPSRNLFYAIRIDGLFKKVQTRTVELQEKPYVPMVEAVKT
QPIFNFDNVRGTIVGFLTPAYANGIAVSGYHLHFIDEGRNSGGHVFDYVLEDCTVTISQKMNMNLRLPNTADFFNANLDN
PDFAKDIETTEGSPE
>P23616 4.1.1.5~~~aldB~~~Alpha-acetolactate decarboxylase~~~
MKKNIITSITSLALVAGLSLTAFAATTATVPAPPAKQESKPAVAANPAPKNVLFQYSTINALMLGQFEGDLTLKDLKLRG
DMGLGTINDLDGEMIQMGTKFYQIDSTGKLSELPESVKTPFAVTTHFEPKEKTTLTNVQDYNQLTKMLEEKFENKNVFYA
VKLTGTFKMVKARTVPKQTRPYPQLTEVTKKQSEFEFKNVKGTLIGFYTPNYAAALNVPGFHLHFITEDKTSGGHVLNLQ
FDNANLEISPIHEFDVQLPHTDDFAHSDLTQVTTSQVHQAESERK
>P05361 4.1.1.5~~~budA~~~Alpha-acetolactate decarboxylase~~~
MMMHSSACDCEASLCETLRGFSAKHPDSVIYQTSLMSALLSGVYEGDTTIADLLAHGDFGLGTFNELDGEMIAFSSQVYQ
LRADGSARAAKPEQKTPFAVMTWFQPQYRKTFDAPVSRQQIHDVIDQQIPSDNLFCALRIDGNFRHAHTRTVPRQTPPYR
AMTDVLDDQPVFRFNQREGVLVGFRTPQHMQGINVAGYHEHFITDDRQGGGHLLDYQLESGVLTFGEIHKLMIDLPADSA
FLQANLHPSNLDAAIRSVEN
>Q8L208 4.1.1.5~~~aldC~~~Alpha-acetolactate decarboxylase~~~COG3527
MSEAIKLFQYNTLGALMAGLYGGTLTVGELLEHGDLGLGTLDSIDGELIVLDGKAYQAKGSEGKVEVVEVSPDEKVPYAA
VVPHQAEVIFRQRYEMTDKELEDRIESYYDGVNLFRSIKIKGHFKHMHVRMIPKSNADIKFADVATRQPEYEVDDISGTI
VGIWTPEMFHGVSVAGYHLHFISDDLTFGGHVMDFVIENGIIEVGPVDQLDQRFPVQDRQYLFAKFNVDEMRKDITKAE
>B2J1M1 4.1.99.5~~~~~~Aldehyde decarbonylase~~~COG1633
MQQLTDQSKELDFKSETYKDAYSRINAIVIEGEQEAHENYITLAQLLPESHDELIRLSKMESRHKKGFEACGRNLAVTPD
LQFAKEFFSGLHQNFQTAAAEGKVVTCLLIQSLIIECFAIAAYNIYIPVADDFARKITEGVVKEEYSHLNFGEVWLKEHF
AESKAELELANRQNLPIVWKMLNQVEGDAHTMAMEKDALVEDFMIQYGEALSNIGFSTRDIMRLSAYGLIGA
>Q7V6D4 4.1.99.5~~~~~~Aldehyde decarbonylase~~~COG3396
MPTLEMPVAAVLDSTVGSSEALPDFTSDRYKDAYSRINAIVIEGEQEAHDNYIAIGTLLPDHVEELKRLAKMEMRHKKGF
TACGKNLGVEADMDFAREFFAPLRDNFQTALGQGKTPTCLLIQALLIEAFAISAYHTYIPVSDPFARKITEGVVKDEYTH
LNYGEAWLKANLESCREELLEANRENLPLIRRMLDQVAGDAAVLQMDKEDLIEDFLIAYQESLTEIGFNTREITRMAAAA
LVS
>Q54764 4.1.99.5~~~~~~Aldehyde decarbonylase~~~COG1633
MPQLEASLELDFQSESYKDAYSRINAIVIEGEQEAFDNYNRLAEMLPDQRDELHKLAKMEQRHMKGFMACGKNLSVTPDM
GFAQKFFERLHENFKAAAAEGKVVTCLLIQSLIIECFAIAAYNIYIPVADAFARKITEGVVRDEYLHRNFGEEWLKANFD
ASKAELEEANRQNLPLVWLMLNEVADDARELGMERESLVEDFMIAYGEALENIGFTTREIMRMSAYGLAAV
>Q55688 4.1.99.5~~~~~~Aldehyde decarbonylase~~~COG1633
MPELAVRTEFDYSSEIYKDAYSRINAIVIEGEQEAYSNYLQMAELLPEDKEELTRLAKMENRHKKGFQACGNNLQVNPDM
PYAQEFFAGLHGNFQHAFSEGKVVTCLLIQALIIEAFAIAAYNIYIPVADDFARKITEGVVKDEYTHLNYGEEWLKANFA
TAKEELEQANKENLPLVWKMLNQVQGDAKVLGMEKEALVEDFMISYGEALSNIGFSTREIMRMSSYGLAGV
>O06478 1.2.1.28~~~yfmT~~~Benzaldehyde dehydrogenase YfmT~~~COG1012
MFQYEELNKQFIGGKWQEGSSPNVLENKNPYTQKTFTTFRKATADDVDEAYRAAALAKKKWDAVNPFEKRTILEKAVTYI
EENEEAIIYLIMEELGGTRLKAAFEIGLVKNIIKEAATFPIRMEGKILPSTIDGKENRLYRVPAGVVGVISPFNFPFFLS
MKSVAPALGAGNGVVLKPHEETPICGGTLIAKIFENAGIPAGLLNVVVTDIAEIGDSFVEHPVPRIISFTGSTKVGSYIG
QLAMKHFKKPLLELGGNSAFIVLEDADIEYAVNAAVFSRFTHQGQICMSANRVLVHSSIYDKFLELYQAKVESLKVGDPM
DPDTIIGPLINSRQTDGLMKTVEQAIEEGAVPVKLGGFNGTIVEPTILKDVKPFMSIAKEELFGPVVSFMKFDSEDEAVD
IANETPFGLSGAVHTSNLERGVAFAKRIETGMIHVNDTTINDEPNVAFGGEKQSGLGRLNGEWSLEEFTTLKWISVQHEK
RSFPY
>P42329 1.2.1.5~~~aldHT~~~Aldehyde dehydrogenase, thermostable~~~
MKVQTEIKTYFNYINGNWVSSVSNNVEPSINPANRHDIVGYVQRSTLEDVNEAVTAANEAQTSWWKRSGVERGEYLYKAA
HILEQCLQDIAETMTREMGKTLAEAKAETMRGVHILRYYAGEGARKIGDVIPSSDSEGLLFTTRVPLGVVGVISPWNFPV
AIPIWKMAPALVYGNTVVLKPASETAVTAAKVIECFHEAGFPKGVVNMVCGSGSVVGQGIANHPDIDGVTFTGSNTVGKQ
VGRAAFERGAKYQLEMGGKNPVIVAKDADLDLAVEGTISGGLRSTGQKCTATSRVFIEREVYEPFKAKLLERVKQLKIGN
GLDAETWMGPCASESQFHTVLSYIEKGKSEGAKLIYGGNRCLEGELANGFFVEPTIFEDVDLQMTIAREEIFGPVLALIQ
VDSIEEAIKLANDTEYGLSASIYTKNIGNALEFIKDIEAGLIKVNAETAGVEFQAPFGGMKQSSSHSREQGQAAIEFFTS
IKTVFVKA
>P9WNY1 1.2.1.3~~~~~~Probable aldehyde dehydrogenase~~~COG1012
MTVFSRPGSAGALMSYESRYQNFIGGQWVAPVHGRYFENPTPVTGQPFCEVPRSDAADIDKALDAAHAAAPGWGKTAPAE
RAAILNMIADRIDKNAAALAVAEVWDNGKPVREALAADIPLAVDHFRYFAAAIRAQEGALSQIDEDTVAYHFHEPLGVVG
QIIPWNFPILMAAWKLAPALAAGNTAVLKPAEQTPASVLYLMSLIGDLLPPGVVNVVNGFGAEAGKPLASSDRIAKVAFT
GETTTGRLIMQYASHNLIPVTLELGGKSPNIFFADVLAAHDDFCDKALEGFTMFALNQGEVCTCPSRSLIQADIYDEFLE
LAAIRTKAVRQGDPLDTETMLGSQASNDQLEKVLSYIEIGKQEGAVIIAGGERAELGGDLSGGYYMQPTIFTGTNNMRIF
KEEIFGPVVAVTSFTDYDDAIGIANDTLYGLGAGVWSRDGNTAYRAGRDIQAGRVWVNCYHLYPAHAAFGGYKQSGIGRE
GHQMMLQHYQHTKNLLVSYSDKALGFF
>Q8GAK7 1.2.1.3~~~aldh~~~Aldehyde dehydrogenase~~~
MAIATIDPTTGITLKTFDAHTPEEVENRIARAEAAFRSLQNTSFEERARWMHKAADILESEADEVARLIATEMGKTLTTA
KYEALKSATGMRHFADHAQRYLSPETPVPASEVNASNLHVQFDPLGVVLAVMPWNYPLWQAVRFAAPALMAGNTGLLKHA
SNVPQCALYLGDLFARGGFPEGAFQTLLVEGKDVIPLVDDARIRAVTLTGSVAAGSAIAEAAGRNIKRSVLELGGMDVFI
VMPSADIEKAAAQAVIARLQNSGQSCIAAKRFYVHEDVYDRFEHLFVTGMAEAVAGDPLDESTSFGPLATERGRQDVHEL
VRDAREKGAAVQCGGEIPEGEGWYYPATVLTGVTEDMRIYREECFGPVACLYKVSSLQEAIALSNDSDFGLSSSVWTNDE
TEATEAARSIEAGGVFINGLTASFPAVPFGGLKDSGYGRELSAYGIREFVNIKTVWTS
>A1B4L2 1.2.1.3~~~adh~~~Aldehyde dehydrogenase~~~COG1012
MPNDQTHPFRGVNALPFEERYDNFIGGEWVAPVSGRYFTNTTPITGAEIGQIARSEAGDIELALDAAHAAKEKWGATSPA
ERANIMLKIADRMERNLELLATAETWDNGKPIRETMAADLPLAIDHFRYFAGVLRAQEGSISQIDDDTVAYHFHEPLGVV
GQIIPWNFPLLMACWKLAPAIAAGNCVVLKPAEQTPAGIMVWANLIGDLLPPGVLNIVNGFGLEAGKPLASSNRIAKIAF
TGETTTGRLIMQYASENLIPVTLELGGKSPNIFFADVAREDDDFFDKALEGFTMFALNQGEVCTCPSRVLIQESIYDKFM
ERAVQRVQAIKQGDPRESDTMIGAQASSEQKEKILSYLDIGKKEGAEVLTGGKAADLGGELSGGYYIEPTIFRGNNKMRI
FQEEIFGPVVSVTTFKDQAEALEIANDTLYGLGAGVWSRDANTCYRMGRGIKAGRVWTNCYHAYPAHAAFGGYKQSGIGR
ETHKMMLDHYQQTKNMLVSYSPKKLGFF
>P12693 1.2.1.3~~~alkH~~~Aldehyde dehydrogenase~~~
MTIPISLAKLNSSADTHSALEVFNLQKVASSARRGKFGIAERIAALNLLKETIQRREPEIIAALAADFRKPASEVKLTEI
FPVLQEINHAKRNLKDWMKPRRVRAALSVAGTRAGLRYEPKGVCLIIAPWNYPFNLSFGPLVSALAAGNSVVIKPSELTP
HTATLIGSIVREAFSVDLVAVVEGDAAVSQELLALPFDHIFFTGSPRVGKLVMEAASKTLASVTLELGGKSPTIIGPTAN
LPKAARNIVWGKFSNNGQTCIAPDHVFVHRCIAQKFNEILVKEIVRVYGKDFAAQRRSADYCRIVNDQHFNRINKLLTDA
KAKGAKILQGGQVDATERLVVPTVLSNVTAAMDINHEEIFGPLLPIIEYDDIDSVIKRVNDGDKPLALYVFSEDKQFVNN
IVARTSSGSVGVNLSVVHFLHPNLPFGGVNNSGIGSAHGVYGFRAFSHEKPVLIDKFSITHWLFPPYTKKVKQLIGITVK
YLS
>Q99SD6 1.2.1.3~~~~~~Putative aldehyde dehydrogenase~~~
MRDYTKQYINGEWVESNSNETIEVINPATEEVIGKVAKGNKADVDKAVEAADDVYLEFRHTSVKERQALLDKIVKEYENR
KDDIVQAITDELGAPLSLSERVHYQMGLNHFVAARDALDNYEFEERRGDDLVVKEAIGVSGLITPWNFPTNQTSLKLAAA
FAAGSPVVLKPSEETPFAAVILAEIFDKVGVPKGVFNLVNGDGAGVGNPLSEHPKVRMMSFTGSGPTGSKIMEKAAKDFK
KVSLELGGKSPYIVLDDVDIKEAAKATTGKVVNNTGQVCTAGTRVLVPNKIKDAFLAELKEQFSQVRVGNPREDGTQVGP
IISKKQFDQVQNYINKGIEEGAELFYGGPGKPEGLEKGYFARPTIFINVDNQMTIAQEEIFGPVMSVITYNDLDEAIQIA
NDTKYGLAGYVIGKDKETLHKVARSIEAGTVEINEAGRKPDLPFGGYKQSGLGREWGDYGIEEFLEVKSIAGYFK
>Q7A4D8 1.2.1.3~~~~~~Putative aldehyde dehydrogenase~~~
MRDYTKQYINGEWVESNSNETIEVINPATEEVIGKVAKGNKADVDKAVEAADDVYLEFRHTSVKERQALLDKIVKEYENR
KDDIVQAITDELGAPLSLSERVHYQMGLNHFVAARDALDNYEFEERRGDDLVVKEAIGVSGLITPWNFPTNQTSLKLAAA
FAAGSPVVLKPSEETPFAAVILAEIFDKVGVPKGVFNLVNGDGAGVGNPLSEHPKVRMMSFTGSGPTGSKIMEKAAKDFK
KVSLELGGKSPYIVLDDVDIKEAAKATTGKVVNNTGQVCTAGTRVLVPNKIKDAFLAELKEQFSQVRVGNPREDGTQVGP
IISKKQFDQVQNYINKGIEEGAELFYGGPGKPEGLEKGYFARPTIFINVDNQMTIAQEEIFGPVMSVITYNDLDEAIQIA
NDTKYGLAGYVIGKDKETLHKVARSIEAGTVEINEAGRKPDLPFGGYKQSGLGREWGDYGIEEFLEVKSIAGYFK
>Q56694 1.2.1.4~~~aldH~~~NADP-dependent fatty aldehyde dehydrogenase~~~
MNPQTDNVFYATNAFTGEALPLAFPVHTEVEVNQAATAAAKVARDFRRLNNSKRASLLRTIASELEARSDDIIARAHLET
ALPEVRLTGEIARTANQLRLFADVVNSGSYHQAILDTPNPTRAPLPKPDIRRQQIALGPVAVFGASNFPLAFSAAGGDTA
SALAAGCPVIVKGHTAHPGTSQIVAECIEQALKQEQLPQAIFTLLQGNQRALGQALVSHPEIKAVGFTGSVGGGRALFNL
AHERPEPIPFYGELGAINPTFIFPSAMRAKADLADQFVASMTMGCGQFCTKPGVVFALNTPETQAFIETAQSLIRQQSPS
TLLTPGIRDSYQSQVVSRGSDDGIDVTFSQAESPCVASALFVTSSENWRKHPAWEEEIFGPQSLIVVCENVADMLSLSEM
LAGSLTATIHATEEDYPQVSQLIPRLEEIAGRLVFNGWPTGVEVGYAMVHGGPYPASTHSASTSVGAEAIHRWLRPVAYQ
ALPESLLPDSLKAENPLEIARAVDGKAAHS
>P94358 1.2.1.3~~~aldY~~~Putative aldehyde dehydrogenase AldY~~~COG1012
MSFETLNKSFINGKWTGGESGRTEDILNPYDQSVITTASLATGKQLEDAFDIAQKAQKEWAKSTTEDRKAVLQKARGYLH
ENRDDIIMMIARETGGTIIKSTIELEQTIAILDEAMTYTGELGGVKEVPSDIEGKTNKIYRLPLGVISSISPFNFPMNLS
MRSIAPAIALGNSVVHKPDIQTAISGGTIIAKAFEHAGLPAGVLNVMLTDVKEIGDGMLTNPIPRLISFTGSTAVGRHIG
EIAGRAFKRMALELGGNNPFAVLSDADVDRAVDAAIFGKFIHQGQICMIINRIIVHQDVYDEFVEKFTARVKQLPYGDQT
DPKTVVGPLINERQIEKALEIIEQAKTDGIELAVEGKRVGNVLTPYVFVGADNNSKIAQTELFAPIATIIKAGSDQEAID
MANDTEYGLSSAVFTSDLEKGEKFALQIDSGMTHVNDQSVNDSPNIAFGGNKASGVGRFGNPWVVEEFTVTKWISIQKQY
RKYPF
>Q4VKV0 1.2.99.10~~~ald~~~4,4'-diapolycopene aldehyde oxidase~~~
MTTIAAVSPLDGRLLGHFPVSKPALIQQQLTKSRRAALLWRELPVTERVKRLSPLKKQLLDNLDRLCETIRLSTGKVRTE
ALLGEIYPVLDLLAYYQKRAPRILRTRAVSTSPFAFPAATARIERRPYGVVAVISPWNYPFHLSVAPLLTALLAGNAVIL
KPSELCLPVGQLIVDLFATLDLPDGLVQWVIGDGQTGAELIDARPDLVFFTGGLQTGRAVMQRAARHPIPVMLELGGKDT
MLVLADADLKRASAAALYGAFCNSGQVCVSVERLYVQQACFAEFLAMLLKGLSKLKVGHDPHGDVGVMTSARQIDIVQAH
YEDAIAQGAKASGPLLRDGNVVQPVVLWDVHHGMKVMREETFGPLLPVMPFSDEAEAIKLANDSDLGLNASIWSQDIIKA
ERLAGQLDVGNWAINDVLKNVGHSGLPFGGVKQSGFGRYHGAEGLLNFSYPVSGLTNRSRLPKEPNWFPYSASGYENFKG
FLDFIYGEDSMLQRGRRNQQALQAFREFSIFDWTQRWQNLKLLFSWTRDD
>O05156 3.4.24.75~~~~~~Glycyl-glycine endopeptidase ALE-1~~~
MDTNRKFTLVKSLSIGLGTFLVGSVFLTVNDEASASTKVDAPKVEQEAPAKADAPKVEQEAPAKADAPKVEQEAPAKVDA
PKVEQEAPAKVDAPKVEQEAPAKADAPKVEQKRTFVREAAQSNHSASWLNNYKKGYGYGPYPLGINGGNHYGVDFFMNVG
TPVRAISDGKIVEAGWTNYGGGNEIGLVENDGVHRQWYMHLSKFNVKVGDRVKAGQIIGWSGSTGYSTAPHLHFQRMTNS
FSNNTAQDPMPFLKSAGYGSNSTSSSNNNGYKTNKYGTLYKSESASFTANTDIITRLTGPFRSMPQSGVLRKGLTIKYDE
VMKQDGHVWVGYNTNSGKRVYLPVRTWNESTGELGPLWGTIK
>P0A991 4.1.2.13~~~fbaB~~~Fructose-bisphosphate aldolase class 1~~~COG1830
MTDIAQLLGKDADNLLQHRCMTIPSDQLYLPGHDYVDRVMIDNNRPPAVLRNMQTLYNTGRLAGTGYLSILPVDQGVEHS
AGASFAANPLYFDPKNIVELAIEAGCNCVASTYGVLASVSRRYAHRIPFLVKLNHNETLSYPNTYDQTLYASVEQAFNMG
AVAVGATIYFGSEESRRQIEEISAAFERAHELGMVTVLWAYLRNSAFKKDGVDYHVSADLTGQANHLAATIGADIVKQKM
AENNGGYKAINYGYTDDRVYSKLTSENPIDLVRYQLANCYMGRAGLINSGGAAGGETDLSDAVRTAVINKRAGGMGLILG
RKAFKKSMADGVKLINAVQDVYLDSKITIA
>P60053 4.1.2.13~~~fda~~~Fructose-bisphosphate aldolase class 1~~~COG3588
MNKEQLQQMRQAPGFVGALDQSGGSTPKALKAYGIQPDAYQSEEEMFDLIHQMRTRMITSPAFATGKIIGVILFERTMRG
KIEGMPTADFLWEKRHIVPFLKVDKGLQDEANGVQLMKPFPELGKLCEEAVGYHVFGTKMRSVIKQANEQGIRDIVEQQF
QWGKEILSHGLVPILEPEVDIHCPEKAKAEEILKRELLAQLDKMTEPVMLKITIPTVDNFYKEIIEHPMMLRVVALSGGY
SREQANELLSRNHGVIASFSRALVEGLSARQTDAEFNAMLEASIEDVYQASIK
>P99117 4.1.2.13~~~fda~~~Fructose-bisphosphate aldolase class 1~~~
MNKEQLEKMKNGKGFIAALDQSGGSTPKALKEYGVNEDQYSNEDEMFQLVHDMRTRVVTSPSFSPDKILGAILFEQTMDR
EVEGKYTADYLADKGVVPFLKVDKGLAEEQNGVQLMKPIDNLDSLLDRANERHIFGTKMRSNILELNEQGIKDVVEQQFE
VAKQIIAKGLVPIIEPEVNINAKDKAEIEKVLKAELKKGLDSLNADQLVMLKLTIPTEPNLYKELAEHPNVVRVVVLSGG
YSREKANELLKDNDELIASFSRALASDLRADQSKEEFDKALGDAVESIYDASVNKN
>Q07159 4.1.2.13~~~fda~~~Fructose-bisphosphate aldolase class 1~~~COG3588
MNQEQFDKIKNGKGFIAALDQSGGSTPKALKDYGVEENEYSNDEEMFNLVHDMRTRIITSPAFNGEKILGAILFEQTMDR
EVEGKYTGSYLADKGIVPFLKVDKGLAEEADGVQLMKPIPDLDKLLDRANERGIFGTKMRSNILENNKEAIEKVVKQQFE
VAKEIIAAGLVPIIEPEVNINAKDKEAIEANLAEAIKAELDNLKKDQYVMLKLTIPTKVNAYSELIEHPQVIRVVALSGG
YSRDEANKILKQNDGLIASFSRALVSDLNAQQSDAEFNEKLQEAIDTIFDASVNKA
>P99075 4.1.2.13~~~fba~~~Fructose-bisphosphate aldolase~~~
MPLVSMKEMLIDAKENGYAVGQYNINNLEFTQAILEASQEENAPVILGVSEGAARYMSGFYTIVKMVEGLMHDLNITIPV
AIHLDHGSSFEKCKEAIDAGFTSVMIDASHSPFEENVATTKKVVEYAHEKGVSVEAELGTVGGQEDDVVADGIIYADPKE
CQELVEKTGIDALAPALGSVHGPYKGEPKLGFKEMEEIGLSTGLPLVLHGGTGIPTKDIQKAIPFGTAKINVNTENQIAS
AKAVRDVLNNDKEVYDPRKYLGPAREAIKETVKGKIKEFGTSNRAK
>Q55664 4.1.2.13~~~fbaA~~~Fructose-bisphosphate aldolase class 2~~~COG0191
MALVPMRLLLDHAAENGYGIPAFNVNNMEQIISIMQAADETDSPVILQASRGARSYAGENFLRHLVLGAVETYPHIPIAM
HQDHGNSPATCYSAIRNGFTSVMMDGSLEADAKTPASFEYNVNVTAEVVKVAHSVGASVEGELGCLGSLETGQGEAEDGH
GFEGKLDHSQLLTDPEEAVEFVNKTQVDALAVAIGTSHGAYKFTRKPTGEVLAISRIEEIHRLLPNTHLVMHGSSSVPQE
WIDMINEFGGAIPETYGVPVEEIQKGIKSGVRKVNIDTDNRLAITAAFREAAAKDPKNFDPRHFLKPSIKYMKQVCADRY
QQFWTAGNASKIKQLTLDDYAAKYAKGELTATSRTSVAV
>P13243 4.1.2.13~~~fbaA~~~Probable fructose-bisphosphate aldolase~~~COG0191
MPLVSMTEMLNTAKEKGYAVGQFNLNNLEFTQAILQAAEEEKSPVILGVSEGAGRYMGGFKTVVAMVKALMEEYKVTVPV
AIHLDHGSSFESCAKAIHAGFTSVMIDASHHPFEENVATTAKVVELAHFHGVSVEAELGTVGGQEDDVIAEGVIYADPKE
CQELVERTGIDCLAPALGSVHGPYKGEPNLGFKEMEEIGKSTGLPLVLHGGTGIPTADIKKSISLGTAKINVNTENQISS
AKAVRETLAAKPDEYDPRKYLGPAREAIKETVIGKMREFGSSNQA
>O51401 4.1.2.13~~~fba~~~Fructose-bisphosphate aldolase~~~
MGVLDKIKPGVVYGKELHFLYEICKKEGFAIPSINCIGTNSINAVLEAAKEINSPIMIQFSNSGSAFISGKGLKMEKPQG
VSIVGAISGAMHVHLMAEHYGVPVVLHTDHCAKNLLPWVEGLLEYGEKYYSQHKKPLFSSHMLDLSEEPIKENIEISKKF
LERMAKIEMFLEIELGITGGEEDGVDNSDRALHELFSTPEDIYYGYSELLKVSPNFQIAAAFGNVHGVYKPGNVKLTPKV
LKDGQDYVISKTGVNMAKPVSYVFHGGSGSTIDEINEALSYGVVKMNIDTDTQWAAWEGVLNYYKKNESRLQGQLGDGKD
IDIPNKKFYDPRVWLREAEVSMKDRVKIACKNLNNINRN
>Q0PAS0 4.1.2.13~~~fba~~~Fructose-bisphosphate aldolase~~~COG0191
MGVLDIVKAGVISGDELNKIYDYAKAEGFAIPAVNVVGTDSINAVLEAAKKVNSPVIIQFSNGGAKFYAGKNCPNGEVLG
AISGAKHVHLLAKAYGVPVILHTDHAARKLLPWIDGLIEANAQYKKTHGQALFSSHMLDLSEESLEENLSTCEVYLQKLD
ALGVALEIELGCTGGEEDGVDNTGIDNSKLYTQPEDVALAYERLGKISDKFSIAASFGNVHGVYKPGNVSLQPEILKNSQ
KFVKDKFALNSDKPINFVFHGGSGSELKDIKNAVSYGVIKMNIDTDTQWAFWDGVREYELKNRAYLQGQIGNPEGDDKPN
KKYYDPRVWLRSGEESMIKRLEIAFEDLNCINKN
>P19537 4.1.2.13~~~fba~~~Fructose-bisphosphate aldolase~~~COG0191
MPIATPEVYNEMLDRAKEGGFAFPAINCTSSETINAALKGFAEAESDGIIQFSTGGAEFGSGLAVKNKVKGAVALAAFAH
EAAKSYGINVALHTDHCQKEVLDEYVRPLLAISQERVDRGELPLFQSHMWDGSAVPIDENLEIAQELLAKAKAANIILEV
EIGVVGGEEDGVEAKAGANLYTSPEDFEKTIDAIGTGEKGRYLLAATFGNVHGVYKPGNVKLRPEVLLEGQQVARKKLGL
ADDALPFDFVFHGGSGSEKEKIEEALTYGVIKMNVDTDTQYAFTRPIVSHMFENYNGVLKIDGEVGNKKAYDPRSYMKKA
EQSMSERIIESCQDLKSVGKTTSK
>P0AB71 4.1.2.13~~~fbaA~~~Fructose-bisphosphate aldolase class 2~~~COG0191
MSKIFDFVKPGVITGDDVQKVFQVAKENNFALPAVNCVGTDSINAVLETAAKVKAPVIVQFSNGGASFIAGKGVKSDVPQ
GAAILGAISGAHHVHQMAEHYGVPVILHTDHCAKKLLPWIDGLLDAGEKHFAATGKPLFSSHMIDLSEESLQENIEICSK
YLERMSKIGMTLEIELGCTGGEEDGVDNSHMDASALYTQPEDVDYAYTELSKISPRFTIAASFGNVHGVYKPGNVVLTPT
ILRDSQEYVSKKHNLPHNSLNFVFHGGSGSTAQEIKDSVSYGVVKMNIDTDTQWATWEGVLNYYKANEAYLQGQLGNPKG
EDQPNKKYYDPRVWLRAGQTSMIARLEKAFQELNAIDVL
>P56109 4.1.2.13~~~fba~~~Fructose-bisphosphate aldolase~~~COG0191
MLVKGNEILLKAHKEGYGVGAFNFVNFEMLNAIFEAGNEENSPLFIQTSEGAIKYMGIDMAVGMVKTMCERYPHIPVALH
LDHGTTFESCEKAVKAGFTSVMIDASHHAFEENLELTSKVVKMAHNAGVSVEAELGRLMGIEDNISVDEKDAVLVNPKEA
EQFVKESQVDYLAPAIGTSHGAFKFKGEPKLDFERLQEVKRLTNIPLVLHGASAIPDNVRKSYLDAGGDLKGSKGVPFEF
LQESVKGGINKVNTDTDLRIAFIAEVRKVANEDKSQFDLRKFFSPAQLALKNVVKERMKLLGSANKI
>P9WQA3 4.1.2.13~~~fba~~~Fructose-bisphosphate aldolase~~~COG0191
MPIATPEVYAEMLGQAKQNSYAFPAINCTSSETVNAAIKGFADAGSDGIIQFSTGGAEFGSGLGVKDMVTGAVALAEFTH
VIAAKYPVNVALHTDHCPKDKLDSYVRPLLAISAQRVSKGGNPLFQSHMWDGSAVPIDENLAIAQELLKAAAAAKIILEI
EIGVVGGEEDGVANEINEKLYTSPEDFEKTIEALGAGEHGKYLLAATFGNVHGVYKPGNVKLRPDILAQGQQVAAAKLGL
PADAKPFDFVFHGGSGSLKSEIEEALRYGVVKMNVDTDTQYAFTRPIAGHMFTNYDGVLKVDGEVGVKKVYDPRSYLKKA
EASMSQRVVQACNDLHCAGKSLTH
>Q8YNK2 4.1.2.13~~~fda~~~Fructose-bisphosphate aldolase~~~COG0191
MALVPLRLLLDHAAENGYGIPAFNVNNLEQIQAILKAAAETDSPVILQASRGARNYAGENFLRHLILAAVETYPEIPIVM
HQDHGNAPSTCYSAIKNNFTSVMMDGSLEADAKTPASFEYNVNVTREVVNVAHALGVSVEGELGCLGSLETGAGEAEDGH
GFEGTLDHSQLLTDPDEAVNFVEATQVDALAVAIGTSHGAYKFTRKPTGEILAISRIEEIHRRLPNTHLVMHGSSSVPED
LIALINEYGGAIPETYGVPVEEIQKGIKSGVRKVNIDTDNRLAITAAVREALAKNPKEFDPRHFLKPSITYMQKVCAERY
VQFGTAGNASKIKQVSLETFAAKYAKGELNAISKAAAKV
>Q9ZEM7 4.1.2.13~~~fba~~~Fructose-bisphosphate aldolase~~~
MPIATPEVYNEMLDRAKAGKFAYPAINVTSSQTLNAALRGFAEAESDGIVQISTGGAEFLGGQYSKDMVTGAVALAEFAH
IIAEKYPVNIALHTDHCPKDKLDGYVRPLLALSKKRVEAGLGPLFQSHMWDGSAEPLADNLAIAQELLETARAAQIILEV
EITPTGGEEDGVSHEINDSLYTTVDDAIRTAEALGLGEKGRYLLAASFGNVHGVYKPGNVVLRPELLKELNEGVAARFGK
ESPFDFVFHGGSGSSEEEIRTALENGVVKMNLDTDTQYAFTRPVAGHMFANYDGVLKVDGEVGNKKAYDPRTWGKLAEAS
MAARVVEATQHLRSAGNKIK
>Q5XA12 4.1.2.13~~~fba~~~Fructose-bisphosphate aldolase~~~
MAIVSAEKFVQAARENGYAVGGFNTNNLEWTQAILRAAEAKQAPVLIQTSMGAAKYMGGYKVCQSLITNLVESMGITVPV
AIHLDHGHYGDALECIEVGYTSIMFDGSHLPVEENLAKTAEVVKIAHAKGVSVEAEVGTIGGEEDGIIGKGELAPIEDAK
AMVETGIDFLAAGIGNIHGPYPENWEGLALDHLEKLTAAVPGFPIVLHGGSGIPDDQIKEAIRLGVAKVNVNTESQIAFS
NATREFARNYEANEAEYDGKKLFDPRKFLAPGMKAVQGAVEERIDVFGSANKA
>P0A4S2 4.1.2.13~~~fba~~~Fructose-bisphosphate aldolase~~~COG0191
MAIVSAEKFVQAARDNGYAVGGFNTNNLEWTQAILRAAEAKKAPVLIQTSMGAAKYMGGYKVARNLIANLVESMGITVPV
AIHLDHGHYEDALECIEVGYTSIMFDGSHLPVEENLKLAKEVVEKAHAKGISVEAEVGTIGGEEDGIIGKGELAPIEDAK
AMVETGIDFLAAGIGNIHGPYPVNWEGLDLDHLQKLTEALPGFPIVLHGGSGIPDEQIQAAIKLGVAKVNVNTECQIAFA
NATRKFARDYEANEAEYDKKKLFDPRKFLADGVKAIQASVEERIDVFGSEGKA
>Q703I2 4.1.2.13~~~fba~~~Fructose-bisphosphate aldolase~~~
MLVTGLEILRKARAEGYGVGAFNTNNMEFTQAILEAAEEMKSPVILALSEGAMKYGGRALTRMVVALAQEARVPVAVHLD
HGSSYESVLKALREGFTSVMIDKSHEDFETNVRETKRVVEAAHAVGVTVEAELGRLAGIEEHVAVDEKDALLTNPEEARI
FMERTGADYLAVAIGTSHGAYKGKGRPFIDHPRLARIAKLVPAPLVLHGASAVPQELVERFRAAGGEIGEASGIHPEDIK
KAISLGIAKINTDTDLRLAFTALVRETLGKNPKEFDPRKYLGPAREAVKEVVKSRMELFGSVGRA
>Q56815 4.1.2.13~~~cbbA~~~Fructose-bisphosphate aldolase~~~
MALVSMRQLLDHAADDSYGLPAFNVNNMEQVKAIMDAARATSSPVILQGSAGARKYAGEPFLRHLIAAAVEAYPEIPVVM
HQDHGASPAVCMGAIKSGFSSVMMDGSLKEDGKTPADYDYNVSVTAKVVELAHAVGVSVEGELGCLGSLETGKGEAEDGH
GAEEALDHSKLLTDPDEAAQFVKATQCDALAIAIGTSHGAYKFTRKPTGDILAIDRIKAIHQRIPTTHLVMHGSSSVPQE
LLEEIRTYGGDIKETYGVPVEEIQEGIRYGVRKVNIDTDIRLAMTAAIRRVGAKNKSEFDPRKFMAAAMEEAKKVCIARF
EAFGSAGKAEKIRAIELDEMAKRYASGELAQVVH
>Q9HY69 2.4.1.33~~~~~~Mannuronan synthase~~~
MNTAVNVNVVHESEAQRQFARVKLPARIRYIGANREGVDARLLDLSAGGFAFTASGAPIQPGDLYKGKLLFQVDSISFSL
EVEFQVRSVDPASRRVGCEFQNLKPREVAALRYLITSYLAGEVIGVGDMLNTLQRENFTKARKQGGGNGGMGFFGRVRAV
TLSTAIFVVGVGAFAFILNQMYNLYFVTHADSGVVSVPNQQITMPREGTVQSLLGPNAEVAKGAPIATFSANLLDMLKGN
LTEEQLNPGNIEKLFGHQMKGTLTSPCDCRVVQQLVADGQYANKGQVIFTLAPRDSVASIEARFPYRNAAELAPGTRVNF
QVAGDGVNRSGRIVNTAPVDGDLSSEIRVQIQPDQPLDAQYAGRPAEVSIGGLPGRTLLNKAVTLATAR
>Q52463 2.4.1.33~~~alg8~~~Mannuronan synthase~~~
MMETYKRGLAEATGWLVFLSLLMVLALAVPKTVFDADSKDFILLIGAVGIWRYSMGGVHFLRGMLFLHVVYPYYRRRVRQ
LGSAADPSHVFLMVTSFRIDALTTAMVYRSVIREAIDSGYPTTVVCSIVEMSDEVLVRSLWEKMNPPDRVSLDFVRIPGT
GKRDGLAYGFRAISRHLPDDDAVVAVIDGDTVLDHGVVKKTVPWFKLFPNVGGLTTNEFCEVQGGYVMSEWHKLRFAQRH
INMCSMALSKRVLTMTGRMSVFRARVVTNPEFITDVENDHLEHWRLGRFKFLTGDDKSSWFSLMRLGYDTFYVPDAAINT
VEHPPEKSFIKASRKLMYRWYGNNLRQNSRALKLGARRLGWFTMLVLFDQRVSMWTSLLGLVVAILASLKYSIAFLLVYL
LWIGLTRLVLTLLLSLSGHRIGPAYPLILYYNQIVGALVKIYVFFRLDRQSWTRQPTKLERGLASFQRWFNAWSSRAMTF
SAASIFVAVLLTIV
>P07874 ~~~algA~~~Alginate biosynthesis protein AlgA~~~
MIPVILSGGSGSRLWPLSRKQYPKQFLALTGDDTLFQQTIKRLAFDGMQAPLLVCNKEHRFIVQEQLEAQNLASQAILLE
PFGRNTAPAVAIAAMKLVAEGRDELLLILPADHVIEDQRAFQQALALATNAAEKGEMVLFGIPASRPETGYGYIRASADA
QLPEGVSRVQSFVEKPDEARAREFVAAGGYYWNSGMFLFRASRYLEELKKHDADIYDTCLLALERSQHDGDLVNIDAATF
ECCPDNSIDYAVMEKTSRACVVPLSAGWNDVGSWSSIWDVHAKDANGNVTKGDVLVHDSHNCLVHGNGKLVSVIGLEDIV
VVETKDAMMIAHKDRVQDVKHVVKDLDAQGRSETQNHCEVYRPWGSYDSVDMGGRFQVKHITVKPGARLSLQMHHHRAEH
WIVVSGTAQVTCDDKTFLLTENQSTYIPIASVHRLANPGKIPLEIIEVQSGSYLGEDDIERLEDVYGRTAEPALQVVAGS
R
>P23747 ~~~algB~~~Alginate biosynthesis transcriptional regulatory protein AlgB~~~
METTSEKQGRILLVDDESAILRTFRYCLEDEGYSVATASSAPQAEALLQRQVFDLCFLDLRLGEDNGLDVLAQMRVQAPW
MRVVIVTAHSAVDTAVDAMQAGAVDYLVKPCSPDQLRLAAAKQLEVRQLTARLEALEDEVRRQGDGLESHSPAMAAVLET
ARQVAATDANILILGESGSGKGELARAIHTWSKRAKKPQVTINCPSLTAELMESELFGHSRGAFTGATESTLGRVSQADG
GTLFLDEIGDFPLTLQPKLLRFIQDKEYERVGDPVTRRADVRILAATNRDLGAMVAQGQFREDLLYRLNVIVLNLPPLRE
RAEDILGLAERFLARFVKDYGRPARGFSEAAREAMRQYPWPGNVRELRNVIERASIICNQELVDVDHLGFSAAQSASSAP
RIGESLSLEDLEKAHITAVMASSATLDQAAKTLGIDASTLYRKRKQYGL
>Q02E40 5.4.2.2~~~algC~~~Phosphomannomutase/phosphoglucomutase~~~
MSTAKAPTLPASIFRAYDIRGVVGDTLTAETAYWIGRAIGSESLARGEPCVAVGRDGRLSGPELVKQLIQGLVDCGCQVS
DVGMVPTPVLYYAANVLEGKSGVMLTGSHNPPDYNGFKIVVAGETLANEQIQALRERIEKNDLASGVGSVEQVDILPRYF
KQIRDDIAMAKPMKVVVDCGNGVAGVIAPQLIEALGCSVIPLYCEVDGNFPNHHPDPGKPENLKDLIAKVKAENADLGLA
FDGDGDRVGVVTNTGTIIYPDRLLMLFAKDVVSRNPGADIIFDVKCTRRLIALISGYGGRPVMWKTGHSLIKKKMKETGA
LLAGEMSGHVFFKERWFGFDDGIYSAARLLEILSQDQRDSEHVFSAFPSDISTPEINITVTEDSKFAIIEALQRDAQWGE
GNITTLDGVRVDYPKGWGLVRASNTTPVLVLRFEADTEEELERIKTVFRNQLKAVDSSLPVPF
>P26276 5.4.2.2~~~algC~~~Phosphomannomutase/phosphoglucomutase~~~
MSTAKAPTLPASIFRAYDIRGVVGDTLTAETAYWIGRAIGSESLARGEPCVAVGRDGRLSGPELVKQLIQGLVDCGCQVS
DVGMVPTPVLYYAANVLEGKSGVMLTGSHNPPDYNGFKIVVAGETLANEQIQALRERIEKNDLASGVGSVEQVDILPRYF
KQIRDDIAMAKPMKVVVDCGNGVAGVIAPQLIEALGCSVIPLYCEVDGNFPNHHPDPGKPENLKDLIAKVKAENADLGLA
FDGDGDRVGVVTNTGTIIYPDRLLMLFAKDVVSRNPGADIIFDVKCTRRLIALISGYGGRPVMWKTGHSLIKKKMKETGA
LLAGEMSGHVFFKERWFGFDDGIYSAARLLEILSQDQRDSEHVFSAFPSDISTPEINITVTEDSKFAIIEALQRDAQWGE
GNITTLDGVRVDYPKGWGLVRASNTTPVLVLRFEADTEEELERIKTVFRNQLKAVDSSLPVPF
>P11759 1.1.1.132~~~algD~~~GDP-mannose 6-dehydrogenase~~~
MRISIFGLGYVGAVCAGCLSARGHEVIGVDVSSTKIDLINQGKSPIVEPGLEALLQQGRQTGRLSGTTDFKKAVLDSDVS
FICVGTPSKKNGDLDLGYIETVCREIGFAIREKSERHTVVVRSTVLPGTVNNVVIPLIEDCSGKKAGVDFGVGTNPEFLR
ESTAIKDYDFPPMTVIGELDKQTGDLLEEIYRELDAPIIRKTVEVAEMIKYTCNVWHAAKVTFANEIGNIAKAVGVDGRE
VMDVICQDHKLNLSRYYMRPGFAFGGSCLPKDVRALTYRASQLDVEHPMLGSLMRSNSNQVQKAFDLITSHDTRKVGLLG
LSFKAGTDDLRESPLVELAEMLIGKGYELRIFDRNVEYARVHGANKEYIESKIPHVSSLLVSDLDEVVASSDVLVLGNGD
ELFVDLVNKTPSGKKLVDLVGFMPHTTTAQAEGICW
>Q887P8 1.1.1.132~~~algD~~~GDP-mannose 6-dehydrogenase~~~COG1004
MRISIFGLGYVGAVCAGCLSARGHDVVGVDISSTKIDLINNGKSPIVEPGLEELLQKGLATGKLRGTTDFAEAIRATDLS
MICVGTPSKKNGDLELDYIESVCREIGYVLRDKNTRHTIVVRSTVLPGTVANVVIPILEDCSGKKAGVDFGVAVNPEFLR
ESTAIKDYDLPPMTVIGEFDKASGDVLQSLYEELDAPIIRKDIAVAEMIKYTCNVWHATKVTFANEIGNIAKAVGVDGRE
VMDVVCQDKALNLSQYYMRPGFAFGGSCLPKDVRALTYRAGSLDVDAPLLNSLMRSNTSQVQNAFDMVASYDTRKVALLG
LSFKAGTDDLRESPLVELAEMLIGKGFDLSIFDSNVEYARVHGANKDYIESKIPHVSSLLNSDFDQVINDSDVIILGNRD
ERFRSLANKTPEGKRVIDLVGFMTNATTEDGRAEGICW
>Q44494 5.1.3.37~~~algE1~~~Mannuronan C5-epimerase AlgE1~~~
MDYNVKDFGALGDGVSDDTAAIQAAIDAAHAAGGGTVYLPAGEYRVSGGEEPSDGCLTIKSNVHIVGAGMGETVIKMVDG
WTQNVTGMVRSAYGEETSNFGMSDLTLDGNRDNLSAKVDGWFNGYIPGQDGADRDVTLERVEIREMSGYGFDPHEQTINL
TIRDSVAHDNSLDGFVADYQVGGVFENNVSYNNDRHGFNIVTSTNDFVLSNNVAYGNGGAGLVVQRGSYDLPHPYDILID
GGAYYDNALEGVQLKMAHDVTLQNAEIYGNGLYGVRVYGAQDVQILDNQIHDNSQNGAYAEVLLQSYDDTAGVSGNFYVT
TGTWLEGNVISGSANSTYGIQERADGTDYSSLYANSIDGVQTGAVRLYGANSTVSSQSGSGQQATLEGSAGNDALSGTEA
HETLLGQAGDDRLNGDAGNDILDGGAGRDNLTGGAGADTFRFSARTDSYRTDSASFNDLITDFDADEDSIDLSALGFTGL
GDGYNGTLLLKTNAEGTRTYLKSYEADAQGRRFEIALDGNFTGLFNDNNLLFDAAPATGTEGSDNLLGTDAGETLLGYGG
NDTLNGGAGDDILVGGAGRDSLTGGAGADVFRFDALSDSQRNYTTGDNQADRILDFDPTLDRIDVSALGFTGLGNGRNGT
LAVVLNSAGDRTDLKSYDTDANGYSFELSLAGNYQGQLSAEQFVFATSQGGQMTIIEGTDGNDTLQGTEANERLLGLDGR
DNLNGGAGDDILDGGAGRDTLTGGTGADTFLFSTRTDSYRTDSASFNDLITDFDPTQDRIDLSGLGFSGFGNGYDGTLLL
QVNAAGTRTYLKSFEADANGQRFEIALDGDFSGQLDSGNVIFEPAVFNAKDFGALGDGASDDRPAIQAAIDAAYAAGGGT
VYLPAGEYRVSPTGEPGDGCLMLKDGVYLAGDGIGETVIKLIDGSDQKITGMVRSAYGEETSNFGMSDLTLDGNRDNTSG
KVDGWFNGYIPGQDGADRNVTIERVEIREMSGYGFDPHEQTINLTIRDSVAHDNGLDGFVADYLVDSVFENNVAYNNDRH
GFNIVTSTYDFVMTNNVAYGNGGAGLTIQRGSEDLAQPTDILIDGGAYYDNALEGVLFKMTNNVTLQNAEIYGNGSSGVR
LYGTEDVQILDNQIHDNSQNGTYPEVLLQAFDDSQVTGELYETLNTRIEGNLIDASDNANYAVRERDDGSDYTTLVDNDI
SGGQVASVQLSGAHSSLSGGTVEVPQGTDGNDVLVGSDANDQLYGGAGDDRLDGGAGDDLLDGGAGRDDLTGGTGADTFV
FAARTDSYRTDAGVFNDLILDFDASEDRIDLSALGFSGFGDGYNGTLLVQLSSAGTRTYLKSYEEDLEGRRFEVALDGDH
TGDLSAANVVFADDGSAAVASSDPAATQLEVVGSSGTQTDQLA
>Q44495 5.1.3.37~~~algE2~~~Mannuronan C5-epimerase AlgE2~~~
MDYNVKDFGALGDGVSDDTAAIQAAIDAAYAAGGGTVYLPAGEYRVSGGEEPSDGCLTIKSNVHIVGAGMGETVIKLVDG
WDQDVTGIVRSAYGEETSNFGMSDLTLDGNRDNTSGKVDGWFNGYIPGEDGADRDVTLERVEIREMSGYGFDPHEQTINL
TIRDSVAHDNGLDGFVADFQIGGVFENNVSYNNDRHGFNIVTSTNDFVLSNNVAYGNGGAGLVVQRGSSDVAHPYDILID
GGAYYDNGLEGVQIKMAHDVTLQNAEIYGNGLYGVRVYGAEDVQILDNYIHDNSQNGSYAEILLQSYDDTAGVSGNFYTT
TGTWIEGNTIVGSANSTYGIQERDDGTDYSSLYANSVSNVQNGSVRLYGANSVVSDLPGTGQQATLEGTAGNDTLGGSDA
HETLLGLDGNDRLNGGAGNDILDGGAGRDNLTGGAGADLFRVSARTDSYRTDSASFNDLITDFDASQDRIDLSALGFTGL
GDGYNGTLLLQVSADGSRTYLKSLEADAEGRRFEIALDGNFAGLLGAGNLLFERTAIEGDAGDNALLGTSAAETLLGHAG
NDTLDGGAGDDILVGGAGRDSLTGGAGADVFRFDALSDSQRNYDIGDNQGDRIADFAVGEDKLDVSALGFTGLGDGYNGT
LALVLNSAGDRTYVKSYENGADGYRFEFSLDGNYLELLGNEDFIFATPSGQQLLEGSAGNDSLQGTAADEVIHGGGGRDT
LAGGAGADVFRFSELTDSYRDSASYADLITDFDASEDRIDLSGLGFSGLGNGYGGTLALQVNSAGTRTYLKSFETNAAGE
RFEIALDGDLSALGGANLILDARTVLAGGDGNDTLSGSSAAEELLGGVGNDSLDGGAGNDILDGGAGRDTLSGGSGSDIF
RFGGALDSFRNYASGTNGTDSITDFTPGEDLIDLSVLGYTGLGDGYNGTLAIVLNDAGTKTYLKNRESDAEGNQFEIALE
GNHADQLDASDFIFATAAATTGIEVVGGSGTQTDQLA
>Q44493 5.1.3.37~~~algE4~~~Mannuronan C5-epimerase AlgE4~~~
MDYNVKDFGALGDGVSDDRASIQAAIDAAYAAGGGTVYLPAGEYRVSAAGEPGDGCLMLKDGVYLAGAGMGETVIKLIDG
SDQKITGMVRSAYGEETSNFGMRDLTLDGNRDNTSGKVDGWFNGYIPGGDGADRDVTIERVEVREMSGYGFDPHEQTINL
TIRDSVAHDNGLDGFVADYLVDSVFENNVAYANDRHGFNVVTSTHDFVMTNNVAYGNGSSGLVVQRGLEDLALPSNILID
GGAYYDNAREGVLLKMTSDITLQNADIHGNGSSGVRVYGAQDVQILDNQIHDNAQAAAVPEVLLQSFDDTAGASGTYYTT
LNTRIEGNTISGSANSTYGIQERNDGTDYSSLIDNDIAGVQQPIQLYGPHSTVSGEPGATPQQPSTGSDGEPLVGGDTDD
QLQGGSGADRLDGGAGDDILDGGAGRDRLSGGAGADTFVFSAREDSYRTDTAVFNDLILDFEASEDRIDLSALGFSGLGD
GYGGTLLLKTNAEGTRTYLKSFEADAEGRRFEVALDGDHTGDLSAANVVFAATGTTTELEVLGDSGTQAGAIV
>Q44492 5.1.3.37~~~algE5~~~Mannuronan C5-epimerase AlgE5~~~
MDYNVKDFGALGDGVSDDTAAIQAAIDAAYAAGGGTVYLPAGEYRVSGGEEPSDGCLTIKSNVYIVGAGMGETVIKLVDG
WDQDVTGIVRSAYGEETSNFGMSDLTLDGNRDNTSGKVDGWFNGYIPGEDGADRDVTLERVEIREMSGYGFDPHEQTINL
TIRDSVAHDNGLDGFVADFQIGGVFENNVSYNNDRHGFNIVTSTNDFVLSNNVAYGNGGAGLVIQRGSYDVAHPYGILID
GGAYYDNGLEGVQIKMAHDVTLQNAEIYGNGLYGVRVYGAEDVQILDNYIHDNSQSGSYAEILLQSYDDTAGVSGNFYTT
TGTWIEGNTIVGSANSTYGIQERADGTDYSSLYANSVSNVQSGSVRLYGTNSVVSDLPGTGQQATLEGTTGNDTLTGSEA
HETLLGLDGNDRLNGGAGNDILDGGAGRDNLTGGAGADLFRVSARTDSYRTDSASFNDLITDFDPAQDRIDLSALGFTGL
GDGYNGTLAVVLNSAGTRTYLKSYEADAEGRRFEIALDGNFAGLLDDGNLIFERPVIEGDAGNNALLGTSAAETLLGHAG
NDTLDGAGGDDILVGGAGRDTLTGGAGADLFRFDALSDSQRNYTTGDNQGDRIVDFSVGEDKLDVSALGFTGLGDGYNGT
LAVVVNSAGDRTYVKSYETDADGYRFEFSLEGNYQDLGSESFVFATPSGQQLLEGSAGNDSLQGTAADEIVHGGAGRDTL
SGGAGADVFRFSELTDSYRTASTSFADLITDFDLADDRIDLSGLGFSGLGDGYDGTLAVVVNSTGTRTYLKSYEANAAGE
RFEIALDGDLSAFTGANLILDERVVLEGSDGNDTLDGGSAAEELLGGAGNDSLDGGAGNDILDGGAGRDTLSGGSGSDIF
RYDDALDSFRNYGTGVTGTDTITDFTPGEDLIDLSALGYTGLGDGYNGTLAVVLNGDGTRTYLKDRESDAEGNQFEIALD
GDLVDRLDAGDFIFAEAAATTAIEVVGGTPTEEQLVA
>Q9ZFH0 5.1.3.37~~~algE6~~~Mannuronan C5-epimerase AlgE6~~~
MDYNVKDFGALGDGVSDDRVAIQAAIDAAHAAGGGTVYLPPGEYRVSAAGEPSDGCLTLRDNVYLAGAGMGQTVIKLVDG
SAQKITGIVRSPFGEETSNFGMRDLTLDGNRANTVDKVDGWFNGYAPGQPGADRNVTIERVEVREMSGYGFDPHEQTINL
VLRDSVAHHNGLDGFVADYQIGGTFENNVAYANDRHGFNIVTSTNDFVMRNNVAYGNGGNGLVVQRGSENLAHPENILID
GGSYYDNGLEGVLVKMSNNVTVQNADIHGNGSSGVRVYGAQGVQILGNQIHDNAKTAVAPEVLLQSYDDTLGVSGNYYTT
LNTRVEGNTITGSANSTYGVQERNDGTDFSSLVGNTINGVQEAAHLYGPNSTVSGTVSAPPQGTDGNDVLIGSDVGEQIS
GGAGDDRLDGGAGDDLLDGGAGRDRLTGGLGADTFRFALREDSHRSPLGTFSDLILDFDPSQDKIDVSALGFIGLGNGYA
GTLAVSLSADGLRTYLKSYDADAQGRSFELALDGNHAATLSAGNIVFAAATPVDPSAEAQPIVGSDLDDQLHGTLLGEEI
SGGGGADQLYGYGGGDLLDGGAGRDRLTGGEGADTFRFALREDSHRSAAGTFSDLILDFDPTQDKLDVSALGFTGLGNGY
AGTLAVSVSDDGTRTYLKSYETDAEGRSFEVSLQGNHAAALSADNILFATPVPVDPGVEGTPVVGSDLDDELHGTLGSEQ
ILGGGGADQLYGYAGNDLLDGGAGRDKLSGGEGADTFRFALREDSHRSPLGTFGDRILDFDPSQDRIDVSALGFSGLGNG
YAGSLAVSVSDDGTRTYLKSYEADAQGLSFEVALEGDHAAALSADNIVFAATDAAAAGELGVIGASGQPDDPAV
>Q9ZFG9 4.2.2.3~~~algE7~~~Alginate lyase 7~~~
MEYNVKDFGAKGDGKTDDTDAIQAAIDAAHKAGGGTVYLPSGEYRVSGGDEASDGALIIKSNVYIVGAGMGETVIKLVDG
WDEKLTGIIRSANGEKTHDYGISDLTIDGNQDNTEGEVDGFYTGYIPGKNGADYNVTVERVEIREVSRYAFDPHEQTINL
TIRDSVAHDNGKDGFVADFQIGAVFENNVSYNNGRHGFNIVTSSHDIVFTNNVAYGNGANGLVVQRGSEDRDFVYNVEIE
GGSFHDNGQEGVLIKMSTDVTLQGAEIYGNGYAGVRVQGVEDVRILDNYIHDNAQSKANAEVIVESYDDRDGPSDDYYET
QNVTVKGNTIVGSANSTYGIQERADGTDYTSIGNNSVSGTQRGIVQLSGTNSTFSGRSGDAYQFIDGSTGNDLLTGTPIA
DLIVGGSGNDTLSGDAGNDVLEGGAGSDRLTGGEGADIFRFTAVSDSYYTASSSVADQILDFDASNDRIDLTGLGFTGLG
DGYGGTLAVLANSDGSRTYLRSYEKDADGRYFSLTLDGNFVGRLDDSNLVFRHKTIAGTEGDDSLTGNAMAEILDGGSGN
DSLAGGLGNDVLRGGAGDDILNGGLGRDQLSGGEGADIFRFTSVADSYQNSGDNFSDLILDFDPGEDRIDLSGLGFSGLG
DGHNGTLLLWTSSETNRTYLKNFDTDADGRRFEIALEGVFSDLSEKQLVFERLVLEGTRLGDQLSGTELNEELLGGAGRD
ILNGGAGDDILDGGSERDTLTGGSGADVFRFNATLDSFRNYDNGTSRVDDITDFTVGEDLIDLSALGYSGLGNGYDGTLA
VLLNADGTKTYLKDRESDADGNHFEIALDGNYADQLSNGDFIFTNLEVIGSSSQAA
>P18895 ~~~algE~~~Alginate production protein AlgE~~~
MNSSRSVNPRPSFAPRALSLAIALLLGAPAFAANSGEAPKNFGLDVKITGESENDRDLGTAPGGTLNDIGIDLRPWAFGQ
WGDWSAYFMGQAVAATDTIETDTLQSDTDDGNNSRNDGREPDKSYLAAREFWVDYAGLTAYPGEHLRFGRQRLREDSGQW
QDTNIEALNWSFETTLLNAHAGVAQRFSEYRTDLDELAPEDKDRTHVFGDISTQWAPHHRIGVRIHHADDSGHLRRPGEE
VDNLDKTYTGQLTWLGIEATGDAYNYRSSMPLNYWASATWLTGDRDNLTTTTVDDRRIATGKQSGDVNAFGVDLGLRWNI
DEQWKAGVGYARGSGGGKDGEEQFQQTGLESNRSNFTGTRSRVHRFGEAFRGELSNLQAATLFGSWQLREDYDASLVYHK
FWRVDDDSDIGTSGINAALQPGEKDIGQELDLVVTKYFKQGLLPASMSQYVDEPSALIRFRGGLFKPGDAYGPGTDSTMH
RAFVDFIWRF
>Q06062 ~~~algF~~~Alginate biosynthesis protein AlgF~~~
MNPMTRRHTWTRLACALSLGVAAFAAQADEGALYGPQAPKGSAFVRAYNAGNSELDVSVGSTSLNDVAPLGSSDFKFLPP
GSYTAQVGQQSLPVKLDPDSYYTLVSQPGGKPQLVAEPPFKNKQKALVRVQNLSGSKLTLKTADGKTDVVKDVGPQSHGD
REINPVKVNLALFDGSKKVSDLKPVTLARGEVVCLYVTGSGGKLAPVWVKRPVKAD
>P70805 5.1.3.37~~~algG~~~Mannuronan C5-epimerase AlgG~~~
MNVQRKLASTQLKPVLLGVLLATSAWSQAAPPEQARQSAPPTLSSKQYSVTSASIEALKLDPPKLPDLSGYTHAAVEAKI
RRKPGGRIAAAMLQQTALKDFTGGSGRLREWIVRQGGMPHAIFIEGGYVELGQLARQLPANQFAETTPGVYVARVPIVVA
PGATLHIGKNVKELRLSEERGAFLVNDGKLFITDTKLVGWSEKNNAPSAYRGPESFWAFLVSWGGTETYISRRPVASLGY
NTSKAYGVSITQYTPEMHKRLKRPRPTGWLIDSVFEDIYYGFYCYEADDVVLKGNTYRDNIIYGIDPHDRSERLVIAENH
VYGTKKKHGIIVSREVNNSWIINNRTHDNKLSGIVLDRNSEHNLVAYNEVYQNHSDGITLYESSNNLIWGNRLINNARHG
IRMRNSVNIRIYENLSVVNQLTGIYGHIKDLSSTDRDFKLDPFDTKVSMIVVGGQLTGNGSSPISVDSPLSLELYRVEML
APTKSSGLTFTGILEDKQEEILDLLVRRQKAVLIDPVVDLAQAEL
>Q51371 5.1.3.37~~~algG~~~Mannuronan C5-epimerase~~~
MPDISLSIPRRRLPRLRPLAAAVLGAVLLHGQAWAAQPVEKPQPVPAQAGNEPGLTQGLKETGNYTVTTAPAEPLHLDPP
KLPDLSGYTAAAVEAKIVRKPGGRASVQRMVQQQPLKEFTGGSNRLAEWVKRQRQMPQAIFIEGGYVNLAQLAGKLPASA
LEQVEPGVFVARLPIVVSQGATLDIDKQVKELRLSQERGAFLVNDGMLFVRDSKVTGWSESKKEPAWFKTPNEFRPFLIS
WGGAEVYLSNSTFTSFGYNASKAYGISISQYSPGMDKQMKRPRPKGWVIDSTIVDSWYGFYCYEADDLVVKGNTYRDNIV
YGIDPHDRSHRLIIADNTVHGTRKKHGIIVSREVNDSFIFNNRSYENKLSGIVLDRNSEGNLVAYNEVYRNHSDGITLYE
SGDNLLWGNQVLANRRHGIRVRNSVNIRLYENLAAGNQLIGVYGHIKDLTNTDRNIALDPFDTKVSLIVVGGKLAGNGSG
PLSVDSPLSLELYRVAMLAPTKSSGISLPGVLGEKQDQILDLLVRQDKAVLIDPVESQAELQD
>P59828 5.1.3.37~~~algG~~~Mannuronan C5-epimerase~~~COG3420
MGACAMNPQALKGSAMLAAAMLLASGAAMADVAPQAKAPTIAKELQQAKTYTISSPPTAPLEMAKPALPALSGYTDAAME
KKIVRAKPGKISIRRMMQEDALKDFIGGDNKMAEWVVRQHGIPQAIFIDDGYMNLKDLLGKVPKQYLSETSPGVFLAKLP
IVVGRKGILEIDKKTQELRLSQEAGSFLINDGQLFVRDTKVTGWSEKANGPALYKSPKEFRPFLLAWGGTETYISNTKMA
SFGYANSKSYGVSISQYTPNMAKVLKRPEPTGWIIDSEFSDMWYGFYCYETTGFVIKGNTYKDNIVYGIDPHDRSHGLII
ADNTVYGTKKKHGIIISREVNDSFIFNNRSYDNKLSGLVLDRNSVNNFVADNEFYRNHTDGITLYESGDNLLWGNKVIAN
RRHGIRVRNSVNIKLYENTSMANGLTGLYGHIKDLTDTDRDIALDPFDAKVSLIVVGGELAGNGSGPLSIDSPLSVELYR
VSMLAPTKSSGISFNGVLGDRQEEILDLLVRQQKAVLIDPVERQTELQD
>Q887Q3 5.1.3.37~~~algG~~~Mannuronan C5-epimerase~~~COG3420
MNSHASNGRSRNWPHALLESALLTSALLMASSVALANAPAVPEAPKALVKELHQAKTYTITSPPTGPLEMAKPVLPDLSG
YTTEAALKKIARNKPGKITVARMMEETGLKEFIGGDNKMAEWVVRQKGIPQAIMISDGYVNLQDLVKKVPKQFLSEVSPG
VYVARLPILVKETGIFEIDSKTKELRLSQEKGSFIVSEGKMLITNTSVNAWSETRNGLAAYRTPDEFRPFVLTWGGSQTW
IAKTKMASMGYNQSKSYGVSISQYTPNTAKVLKRGEPTGWIIDSEFADMWYGFYCYETRDFVVKGNTYRDNIVYGIDPHD
RSHGLIIAENDVYGTKKKHGIIISREVDNSFIFRNKSHNNKLSGVVLDRNSVGNIVAYNEIYQNHTDGITLYESGNNLLW
GNRVIANRRHGIRVRNSVNIKLYENVAMANGLMGVYGHIKDLNDTDRDIELDPFDAQVSLIMVGGELSSNGSGPLSIDSP
LSVELYRVSMLMPTKEVGISLNGILGERQDEILDLLVRQKKAVLIDPVESQTELRE
>Q9RQ16 ~~~algH~~~UPF0301 protein AlgH~~~
MKQSSPTYLKHHFLIAMPHMADPNFAQTVTYLVEHNEQGAMGLVINRPSGLNLAEVLEQLKPDALPPARCQHIDIYNGGP
VQTDRGFVLHPSGLSYQSTLELGELAMSTSQDVLFAIAAGTGPEKSLISLGYAGWEAGQLEAELSDNAWLTCPADPAILF
DLPPEERLSAAAARLGVNLSLLTAQAGHA
>Q88ND3 2.3.1.-~~~algJ~~~Probable alginate O-acetylase AlgJ~~~
MTRTLRITYSLSFLGLLVGMGAWSTGGLQSFQRTEQMTLLNGKLAKAAETHYDAEFPIKRLGTNVWAAMDFKLFNEGRPG
VVLGRDQWLFSDEEFKPTAGAEQLMQENLALIRGVRDTLQQHGSQLVLAIVPAKARVYTEYLGKERPASLHDDLYNQFHA
QARQANVFAPDLMAPMEQAKARGQVFLRTDTHWTPMGAEVAAQALAEAVSRQSLLNGDPQAFITEAGNTAPYKGDLTNFL
PLDPLFSNLLPAPDNLQKRTTRPVDAEGDAGDALFADKQIPVALVGTSYSANPHWNFLGALQQALRSDVANYAEDGHGPL
LPMLKYLQSDAFKNAAPQVVVWEFPERYLPMKNDLSSFDPQWIAQLKNSRKSEENLALSSTRTDH
>P96956 ~~~algK~~~Alginate biosynthesis protein AlgK~~~
MKMPILPPLPLASRHLLLASAIALAAGCAGLPDQRLAQEALERGDLATAQSNYQALAAMGYADAQVGLADMQVASGDSAQ
QAKAEKLYREAAQTSPRARARLGKWLAAKPGASDAEHREAERLLSQAFEQGEDSALVPLIVLYLQYPQSWPEIDPQQRID
QWRARGLPQADLAQIILYRTQGTYAQHLGEIEQVCQRWLRRMDVCWYELATVYQMQGNAEKQKVLLEQLRAAYKAGRVPG
ERVDSVAGVLADGELGQPDPQTAQALLEEIAPSYPAAWVSLAKLLYDYPDQGDLEKMLGYLKNAQDAAQPRAELLLGRLY
YDGKWAPQDPRKAERHLLKAAASEPQANYYLGQIYRRGFLGKVYPQKAVDHLILAARAGQASADMALAQLWSQGRGIQPN
RVNAYVFGQLAVQQQVPQASDLLGQIEAQLPPAERSQAQQLLKREQQSRGNNWQATVSLLQSQDSPINEEEPESL
>Q88NC7 ~~~algK~~~Alginate biosynthesis protein AlgK~~~COG0790
MACEGRTMISDTYKVRVNLGLCALAAAITLAGCAGLPDQRLANEALKRGDTALAERNYKALADLGYSEAQVGLADIKVAT
RDPSQIKEAEATYRAAAATSPRAQARLGRLLVAKPDSTQAEREEAETLLKQAAKQGQSNTLIPLAMLYLSYPQSFPKVNA
QQQIDQWRAAGNPEAGLAQVLLYRTQGTYDQHLGEVEKICKAALNTTDICYVELATVYQKRGQADQQAALLGQLKSAYAR
GAVPATRVDSVARVLADRSLGQTDEKTAKELLEQVAPANPASWVSLAQLVYDFPELGDTDQLMAYIDKGREAEQPRAELL
LGRLYYEGKTLPADAQKAEQHLQAAAEAGEISAHYYLGQLYRRGYLGNVEPQKAVDHLLAAARGGQNSADYALAQLFSEG
HGIRPQPGNAWVFAQLSQANPTPQSAELLQQLDQQLTPDQRNQAQQLLDQEKRARGSLAQGANSTLALEALQDDEKEVDG
EDSL
>Q9X0Z7 5.1.1.1~~~aar~~~L-alanine/L-glutamate racemase~~~COG0626
MNTDDILFSYGEEDIPLKALSFPIFETTNFYFDSFDEMSKALRNGDYEFVYKRGSNPTTRLVEKKLAALEECEDARLVAS
GMSAISLSILHFLSSGDHVVCVDEAYSWAKKFFNYLSKKFDIEVSYVPPDAERIVEAITKKTKLIYLESPTSMRMKVIDI
RKVTEAAGELKIKTVIDNTWASPIFQKPKLLGVDVVVHSATKYISGHGDVMAGVIAGDVEDMKNIFVDEYKNIGPVLSPI
EAWLILRGLRTLELRMKKHYENALVVSDFLMDHPKVLEVNYPMNPRSPQYELASSQMSGGSGLMSFRLKTDSAEKVKEFV
ESLRVFRMAVSWGSHENLVVPRVAYGDCPKKDVNLIRIHVGLGDPEKLVEDLDQALKKI
>Q73GL9 5.1.1.1~~~aar~~~L-alanine/L-glutamate racemase~~~COG0626
MKEESILVKAGRKFNDYKGSMNPPVYHSSTILFPTYKDYLNAANGESIYDVINDGVARDYSYSNVGTPTVHYLSNALAEI
EGSGQALIYPSGLFALTFAILTFAKAGSHVLIQDNSYYRLRRFAENELPKRGTEVTFYDPTQDITDLIQSNTSLIMIETP
GSVTFEISNIEHIVKVAKEHKIVTVCDNSWATPLLFKPLDYGVDVALYAVTKYLAGHSDLVMGAIIAEGEIFKLLYESYK
NYGVTIQSHDCYLAHRGLRTLYTRMKRHQNTAMEVAKWLEKHSKIKKVLYPALPFHPQHELWKSYFKGASGTFSIALDRE
YSCEELSCMVDHMKIFGIGASWGGCDSLILPIDRRSMSRSVMNSDYGGSFIRIFCGLEDPEDLISDLNAALARLPCLNTK
TGR
>O50660 4.2.2.3~~~algL~~~Alginate lyase~~~
MKTRLALPCLLGSLLLSSAVHAASALVPPKGYYAALEIRKGEAQACQAVPEPYTGELVFRSKYEGSDSARSTLNKKAEKA
FRAKTKPITEIERGVSRMVMRYMEKGRLRRAGMRPGLLDAWAEDDALLSTEYNHTGKSMRKWALGSLAGAYLRLKFSTSQ
PLAAYPEQAKRIEAWFAKVGDQVIKDWSDLPLKQINNHSYWAAWSVMAAGVATNRRPLFDWAVEQFHIAAKQVDPRGFLA
NELKRRQRALAYHNYSLPPLMMIAAFAQANGVDLRGDNDGALGRLAGNVLAGVEDPEPFAERAGEDQDMEDLETDAKFSW
LEPYCALYACSPALRERKAEMGPFKNFRLGGDVTRIFDPQEKPSKSTVGNAD
>O52195 4.2.2.3~~~algL~~~Alginate lyase~~~
MHKTRLALSCLLGSLLLSGAVHAAEALVPPKGYYAPVDIRKGEAPACPVVPEPFTGELVFRSKYEGSDAARSTLNEEAEK
AFRTKTAPITQIERGVSRMVMRYMEKGRAGDLECTLAWLDAWAEDGALLTTEYNHTGKSMRKWALGSLAGAYLRLKFSSS
QPLAAYPEQARRIESWFAKVGDQVIKDWSDLPLKRINNHSYWAAWAVMAAGVATNRRPLFDWAVEQFHIAAGQVDSNGFL
PNELKRRQRALAYHNYSLPPLMMVAAFALANGVDLRGDNDGALGRLAGNVLAGVEKPEPFAERAGDEDQDMEDLETDAKF
SWLEPYCALYSCSPALRERKAEMGPFKNFRLGGDVTRIFDPAEKSPRSTVGKRD
>Q06749 4.2.2.3~~~algL~~~Alginate lyase~~~
MKTSHLIRIALPGALAAALLASQVSQAADLVPPPGYYAAVGERKGSAGSCPAVPPPYTGSLVFTSKYEGSDSARATLNVK
AEKTFRSQIKDITDMERGATKLVTQYMRSGRDGDLACALNWMSAWARAGALQSDDFNHTGKSMRKWALGSLSGAYMRLKF
SSSRPLAAHAEQSREIEDWFARLGTQVVRDWSGLPLKKINNHSYWAAWSVMSTAVVTNRRDLFDWAVSEFKVAANQVDEQ
GFLPNELKRRQRALAYHNYALPPLAMIAAFAQVNGVDLRQENHGALQRLAERVMKGVDDEETFEEKTGEDQDMTDLKVDN
KYAWLEPYCALYRCEPKMLEAKKDREPFNSFRLGGEVTRVFSREGGS
>Q9L7P2 4.2.2.3~~~algL~~~Alginate lyase~~~
MQTPKLIRPTLLSMAILSSMAWATGASAALVPPKGYDAPIEKMKTGDHNFSCEAIPKPYTDKLVFRSKYEGSDKARATLN
AVSEEAFRDATKDITTLERGVSKVVMQYMRDGRPEQLDCALNMMTTWAKADALESREFNHTGKSMRKWALGSMSSAYLRL
KFSESHPLANRQQDAKIIETWFSKLADQVVSDWSNLPLEKINNHSYWAAWSVMATAVATNRQDLFDWAVKEYKVAANQVD
KDGFLPNEMKRRQRALSYHNYALPPLAMIASFAQANGVDLRPENNGALKRLGDRVLAGVKDPSIFAEHNGEKQDMTDLKK
DPKFAWLEPYCSLYTCSPDVLEEKHEKQPFKTFRLGGDLTKVYDPTHEKGDKGDNDGS
>Q51372 2.3.1.-~~~algX~~~Alginate biosynthesis protein AlgX~~~
MKTRTSRLFRLSALAAGLCLAQAALAADPGAAPSYQALPAGNLCPAAAYDSRYNTKYLGFFTHLVQAQDDWLFRTTYDLR
TDFGTSAEGWRELRALRDELKRKGIELVVVYQPTRGLVNREKLSPAEKAGFDYELAKKNYLATIARFRQAGIWTPDFSPL
FDEKEEHAYYFKGDHHWTPHGARRSAKIVAETLKQVPGFEEIPKKQFESKRVGLLSKLGTFHKAAAQLCGNSYATQYVDR
FETEPVGASDSGDLFGDGGNPQIALVGTSNSGPAYNFAGFLEEFSGADILNNAVSGGGFDSSLLAYMTSEEFHKNPPKIL
IWEFATHYDMAQKSFYRQAMPLVDNGCSGRKTVLSRKVKLRQGRNEVLLNSAALPIRSGSYVADVTYSDPSVHELKNTIW
YMNGRREQLKIEQSKAVDTGGRYVFQLRNDSDWADQQFLSLEIEAPEDMPQGLEVQASICQAAPAKASQSVAGR
>Q88ND0 2.3.1.-~~~algX~~~Alginate biosynthesis protein AlgX~~~
MTPHLMKLLGLSAALLAISQGVRAEDVKAPTFSAEPCCQLCPEAHDASRYTTRYQQNFTTLVQAQGDWLFRTREDLRTEF
NTTPAGYKRLQQVHDAFKKRGVELVVVYQPTRGLVNRNMLNPAEKAAFDYQKALGNYQAMLKRFASMGYNVPDLSPLTNE
QLAAADQGKDFYFRGDQHWTPYGAERAAKIVADTVHKMPAFEGIPRKEFETRKSGRMGKTGTLHNVAGQLCGTSYAVQYM
DQFATEPKGASGGDDLFGDSGNAQITLVGTSHSGKNYNFSGFLEQYIGADVLNVAFPGGGLEGSMIQYLGSEEFQKNPPK
ILIWEFSPLYRLDQETIWRQILGLLDDGCDDRPALMSASTTLKPGKNELMVNGKGGVIKDLINRNLQMDVKFEDPSVKVL
QATLWYLNGRHEDIKLEKPETSDTDGRFVFQMREDEDWASQRLLAFEVQGPESGTQKVEAKLCKRNNFAVPAQTAQAGQ
>P9WJW3 ~~~alkA~~~Probable bifunctional transcriptional activator/DNA repair enzyme AlkA~~~COG0122
MHDDFERCYRAIQSKDARFDGWFVVAVLTTGVYCRPSCPVRPPFARNVRFLPTAAAAQGEGFRACKRCRPDASPGSPEWN
VRSDVVARAMRLIADGTVDRDGVSGLAAQLGYTIRQLERLLQAVVGAGPLALARAQRMQTARVLIETTNLPFGDVAFAAG
FSSIRQFNDTVRLACDGTPTALRARAAARFESATASAGTVSLRLPVRAPFAFEGVFGHLAATAVPGCEEVRDGAYRRTLR
LPWGNGIVSLTPAPDHVRCLLVLDDFRDLMTATARCRRLLDLDADPEAIVEALGADPDLRAVVGKAPGQRIPRTVDEAEF
AVRAVLAQQVSTKAASTHAGRLVAAYGRPVHDRHGALTHTFPSIEQLAEIDPGHLAVPKARQRTINALVASLADKSLVLD
AGCDWQRARGQLLALPGVGPWTAEVIAMRGLGDPDAFPASDLGLRLAAKKLGLPAQRRALTVHSARWRPWRSYATQHLWT
TLEHPVNQWPPQEKIA
>Q0VKZ3 1.14.15.3~~~alkB1~~~Alkane 1-monooxygenase 1~~~COG3239
MSENILTEPPRSDADNEGYVDRKRHLWILSVLWPATPIIGLYLVSQTGWSIWYGLVLILWYGLVPLIDTMLGEDYSNPPE
SVVPKLEQDRYYKVLTYLTVPIHYAALIISAWWVSTQPIGVFEFLALALSLGIVNGLALNTGHELGHKKETFDRWMAKLV
LAVVGYGHFFIEHNKGHHRDVATPMDPATSRMGESIYTFSLREIPGAFKRAWGLEEQRLSRCGKSVWSLDNEVLQPMILT
VVLYAALLAFFGPLMLIFLPIQMAFGWWQLTSANYIEHYGLLREKLPNGRYEHQKPHHSWNSNHVMSNLILFHLQRHSDH
HAHPTRSYQSLRDFSDLPTLPTGYPGMFFVAFFPSWFRSLMDDRVMEWAHGDINKIQIQPGMREFYEQKFGVKGSESPDT
TVAK
>Q9I0R2 1.14.15.3~~~alkB1~~~Alkane 1-monooxygenase 1~~~
MFENFSPSTMLAIKKYAYWLWLLLALSMPFNYWMAQDSAHPAFWAFSLVIAVFGIGPLLDMLFGRDPANPDEETQTPQLL
GQGYYVLLTLATVPVLIGTLVWAAGVFVAFQEWGWLGRLGWILSMGTVMGAVGIVVAHELIHKDSALEQAAGGILLAAVC
YAGFKVEHVRGHHVHVSTPEDASSARFGQSVYQFLPHAYKYNFLNAWRLEAVRLRKKGLPVFGWQNELIWWYLLSLALLV
GFGWAFGWLGMVFFLGQAFVAVTLLEIINYVEHYGLHRRKGEDGRYERTNHTHSWNSNFVFTNLVLFHLQRHSDHHAFAK
RPYQVLRHYDDSPQMPSGYAGMVVLALIPPLWRAVMDPKVRAYYAGEEFQLTAEQSERPAAS
>Q0VTH3 1.14.15.3~~~alkB2~~~Alkane 1-monooxygenase 2~~~COG3239
MFENTNPDVMLKMKKYGYLAFWAIMVPLVPFSAFVGVESGTQDYWAWFMYAFIFGIIPVLDYLVGKDPTNPSEDVQVPTM
SEEVFYRVSAIAMGFVWIAVLFYAGHIFMNNGYGLLGKIGWIVSIGTVGGIIAINLGHELIHKDPKVENWMGGLLLSSVT
YAGFKVEHVRGHHVHVSTPDDASSSRYNQSLYNFLPKAFVHNFINAWSLEKKYLERKGKKNISVHNELIWWYSISALFAA
TFGLLWGWQGVVFFLGQSFFAALALEIINYIEHYGLHRRVNDKGRFERVTPAHSWNSNFLLTNLALFQLQRHSDHHAYAK
RRYQVLRHYEESPQLPAGYATMYVLALIPPLWRKVMNPRVEAYYEGELDQLFRDGKRVNNIA
>Q6H941 1.14.15.3~~~alkB2~~~Alkane 1-monooxygenase 2~~~
MFASLSSAWMLRLKKYGYWIWLIAVLGIPLSYWWSLGSDYPNAWPWLVISVVFGLIPILDAIVGRDPANPEEASEVPEME
AQGYYRVLSLATVPLLLGMLVWSGWILAHETRWDWVGQLGWILSVGTVMGAIGITVSHELIHKDPQLEQNAGGLLLAAVC
YAGFKVEHVRGHHVHVSTPEDASSSRYGQSLYSFLPHAYKHNFLNAWRLEAERLKRKGLPALHWRNELIWWYAISALFLL
GFSLAFGWLGAIFFLGQSVMAFTLLEIVNYVEHYGLHRRRLDNGRYERTTPEHSWNSNFLLTNLFLFHLQRHSDHHAYAK
RRYQVLRHYDSSPQLPNGYAGMIVLALFPPLWRAVMDPKVRAYYAGEEYQLTDTQRI
>O31250 1.14.15.3~~~alkB~~~Alkane 1-monooxygenase~~~COG3696
MNAPVHVDQNFEEVINAARSMREIDRKRYLWMISPALPVIGIGILAGYQFSPRPIKKIFALGGPIVLHIIIPVIDTIIGK
DASNPTSEEIKQLENDPYYARLVKSFIPLQYIANVYACYLVSRKKTSFIDKILLGISMGAINGIAVNTAHELSHKADRLD
HILSHLALVPTGYNHFRIEHPYGHHKRAATPEDPASSQMGETFYEFWPRTVFGSLKSAIEIETHRLKRKGKKFWSKDNEL
LQGWGMSAAFHSSIIAIFGKGTIPYLVTQAFYGISLFEIINYIEHYGLKRQKRADGNYERTMPEHSWNNNNIVTNLFLYQ
LQRHSDHHAYPTRPFQALRHFDEAPELPSGYASMLLPAMIPPLWFKMMDKRVFEHYKEDLTKANIYPKRRAKILAKFGLT
DPNIENGK
>P05050 1.14.11.33~~~alkB~~~Alpha-ketoglutarate-dependent dioxygenase AlkB~~~COG3145
MLDLFADAEPWQEPLAAGAVILRRFAFNAAEQLIRDINDVASQSPFRQMVTPGGYTMSVAMTNCGHLGWTTHRQGYLYSP
IDPQTNKPWPAMPQSFHNLCQRAATAAGYPDFQPDACLINRYAPGAKLSLHQDKDEPDLRAPIVSVSLGLPAIFQFGGLK
RNDPLKRLLLEHGDVVVWGGESRLFYHGIQPLKAGFHPLTIDCRYNLTFRQAGKKE
>P12691 1.14.15.3~~~alkB~~~Alkane 1-monooxygenase~~~
MLEKHRVLDSAPEYVDKKKYLWILSTLWPATPMIGIWLANETGWGIFYGLVLLVWYGALPLLDAMFGEDFNNPPEEVVPK
LEKERYYRVLTYLTVPMHYAALIVSAWWVGTQPMSWLEIGALALSLGIVNGLALNTGHELGHKKETFDRWMAKIVLAVVG
YGHFFIEHNKGHHRDVATPMDPATSRMGESIYKFSIREIPGAFIRAWGLEEQRLSRRGQSVWSFDNEILQPMIITVILYA
VLLALFGPKMLVFLPIQMAFGWWQLTSANYIEHYGLLRQKMEDGRYEHQKPHHSWNSNHIVSNLVLFHLQRHSDHHAHPT
RSYQSLRDFPGLPALPTGYPGAFLMAMIPQWFRSVMDPKVVDWAGGDLNKIQIDDSMRETYLKKFGTSSAGHSSSTSAVA
S
>Q15X88 4.1.2.14~~~~~~2-dehydro-3-deoxy-phosphogluconate aldolase~~~COG0800
MTKNWKVSSQDVFSQGPVVPVLVIKDVKHAVPLAKALIAGGIRVLEVTLRTEAALDVIKAIATEVPDAIIGAGTVTNAKQ
LAEVEAAGAMFAISPGMTSDLLDAGNKGGIALIPGISSISELMRGIDFGYTHFKFFPAEASGGVKAIKAIGGPFPDIAFC
PTGGISPTNYLEYLSLPNVRCAGGSWLAPDDAVEAGDWDRITELAKQAVAGAAGI
>P00885 4.1.2.14~~~eda~~~2-dehydro-3-deoxy-phosphogluconate aldolase~~~COG0800
MTTLERPQPKLSMADKAARIDAICEKARILPVITIAREEDILPLADALAAGGIRTLEVTLRSQHGLKAIQVLREQRPELC
VGAGTVLDRSMFAAVEAAGAQFVVTPGITEDILEAGVDSEIPLLPGISTPSEIMMGYALGYRRFKLFPAEISGGVAAIKA
FGGPFGDIRFCPTGGVNPANVRNYMALPNVMCVGTGWMLDSSWIKNGDWARIEACSAEAIALLDAN
>P50846 ~~~kdgA~~~KHG/KDPG aldolase~~~COG0800
MESKVVENRLKEAKLIAVIRSKDKQEACQQIESLLDKGIRAVEVTYTTPGASDIIESFRNREDILIGAGTVISAQQAGEA
AKAGAQFIVSPGFSADLAEHLSFVKTHYIPGVLTPSEIMEALTFGFTTLKLFPSGVFGIPFMKNLAGPFPQVTFIPTGGI
HPSEVPDWLRAGAGAVGVGSQLGSCSKEDLQAVFQV
>P0A955 ~~~eda~~~KHG/KDPG aldolase~~~COG0800
MKNWKTSAESILTTGPVVPVIVVKKLEHAVPMAKALVAGGVRVLEVTLRTECAVDAIRAIAKEVPEAIVGAGTVLNPQQL
AEVTEAGAQFAISPGLTEPLLKAATEGTIPLIPGISTVSELMLGMDYGLKEFKFFPAEANGGVKALQAIAGPFSQVRFCP
TGGISPANYRDYLALKSVLCIGGSWLVPADALEAGDYDRITKLAREAVEGAKL
>P44480 ~~~eda~~~Putative KHG/KDPG aldolase~~~COG0800
MSYTTQQIIEKLRELKIVPVIALDNADDILPLADTLAKNGLSVAEITFRSEAAADAIRLLRANRPDFLIAAGTVLTAEQV
VLAKSSGADFVVTPGLNPKIVKLCQDLNFPITPGVNNPMAIEIALEMGISAVKFFPAEASGGVKMIKALLGPYAQLQIMP
TGGIGLHNIRDYLAIPNIVACGGSWFVEKKLIQSNNWDEIGRLVREVIDIIK
>Q00384 ~~~eda~~~KHG/KDPG aldolase~~~COG0800
MRDIDSVMRLAPVMPVLVIEDIADAKPIAEALVAGGLNVLEVTLRTPCALEAIKIMKEVPGAVVGAGTVLNAKMLDQAQE
AGCEFFVSPGLTADLGKHAVAQKAALLPGVANAADVMLGLDLGLDRFKFFPAENIGGLPALKSMASVFRQVRFCPTGGIT
PTSAPKYLENPSILCVGGSWVVPAGKPDVAKITALAKEASAFKRAAVA
>Q00593 1.1.99.-~~~alkJ~~~Alcohol dehydrogenase [acceptor]~~~
MYDYIIVGAGSAGCVLANRLSADPSKRVCLLEAGPRDTNPLIHMPLGIALLSNSKKLNWAFQTAPQQNLNGRSLFWPRGK
TLGGSSSINAMVYIRGHEDDYHAWEQAAGRYWGWYRALELFKRLECNQRFDKSEHHGVDGELAVSDLKYINPLSKAFVQA
GMEANINFNGDFNGEYQDGVGFYQVTQKNGQRWSSARAFLHGVLSRPNLDIITDAHASKILFEDRKAVGVSYIKKNMHHQ
VKTTSGGEVLLSLGAVGTPHLLMLSGVGAAAELKEHGVSLVHDLPEVGKNLQDHLDITLMCAANSREPIGVALSFIPRGV
SGLFSYVFKREGFLTSNVAESGGFVKSSPDRDRPNLQFHFLPTYLKDHGRKIAGGYGYTLHICDLLPKSRGRIGLKSANP
LQPPLIDPNYLSDHEDIKTMIAGIKIGRAILQAPSMAKHFKHEVVPGQAVKTDDEIIEDIRRRAETIYHPVGTCRMGKDP
ASVVDPCLKIRGLANIRVVDASIMPHLVAGNTNAPTIMIAENAAEIIMRNLDVEALEASAEFAREGAELELAMIAVCM
>Q00594 6.2.1.2~~~alkK~~~Medium-chain-fatty-acid--CoA ligase~~~
MLGQMMRNQLVIGSLVEHAARYHGAREVVSVETSGEVTRSCWKEVELRARKLASALGKMGLTPSDRCATIAWNNIRHLEV
YYAVSGAGMVCHTINPRLFIEQITYVINHAEDKVVLLDDTFLPIIAEIHGSLPKVKAFVLMAHNNSNASAQMPGLIAYED
LIGQGDDNYIWPDVDENEASSLCYTSGTTGNPKGVLYSHRSTVLHSMTTAMPDTLNLSARDTILPVVPMFHVNAWGTPYS
AAMVGAKLVLPGPALDGASLSKLIASEGVSIALGVPVVWQGLLAAQAGNGSKSQSLTRVVVGGSACPASMIREFNDIYGV
EVIHAWGMTELSPFGTANTPLAHHVDLSPDEKLSLRKSQGRPPYGVELKIVNDEGIRLPEDGRSKGNLMARGHWVIKDYF
HSDPGSTLSDGWFSTGDVATIDSDGFMTICDRAKDIIKSGGEWISTVELESIAIAHPHIVDAAVIAARHEKWDERPLLIA
VKSPNSELTSGEVCNYFADKVARWQIPDAAIFVEELPRNGTGKILKNRLREKYGDILLRSSSSVCE
>Q00595 ~~~alkL~~~Outer membrane protein AlkL~~~
MSFSNYKVIAMPVLVANFVLGAATAWANENYPAKSAGYNQGDWVASFNFSKVYVGEELGDLNVGGGALPNADVSIGNDTT
LTFDIAYFVSSNIAVDFFVGVPARAKFQGEKSISSLGRVSEVDYGPAILSLQYHYDSFERLYPYVGVGVGRVLFFDKTDG
ALSSFDIKDKWAPAFQVGLRYDLGNSWMLNSDVRYIPFKTDVTGTLGPVPVSTKIEVDPFILSLGASYVF
>Q0VKZ4 ~~~alkS~~~HTH-type transcriptional regulator AlkS~~~COG2909
MLCDNLENWCLRMSDRMWARQRFITQGCIERGRLKASAESESKVILYRAPLGYGKSVQVAFEAGTQGREEGGVAYINTRS
YPGSGEITDSLLAALILYQIKGREPWSVIQSNDDIIESLRDTLCNAKNPIKICIDGIGEAEHGVGLVENLICETPNNVKF
YIAPSNAGALARLSMMAGVVTYGAHDLVFTEEEVCELPGMHSAKAKNVIEATGGWPALVGLMCHTSNPNLPAATWPETRS
YFRNNLLNALPNNTREFICKAAMLEEISVACYDYVYKTEEAHKEIPFINENYALFTPTESSRECMVMHPVLREYLRGLFD
SIQRERRSYVLKRVAFWHWRRGEYLHSINAAQEASDHSWARAVSDSIILDVALRQGEIEVLRTWFEKVPVRTIKKIASLS
ISYAWILYFSQQARQAEKILASSTESCSRGLDNLDEKGWRKLVDAVGKATQDQFAQSQTLCQQWIDTFGERNMVGKGAAL
TCQAYIASSDRRFEDLEQLLHRGAVANQSSNQHYAFVWLKTAELQAEIFKGDIAHAMSILLEANKTAEKMGVSKTFLNKM
LGSLELQILHEKSPSLISYESAEESFNFALNYGVTDILWGCTQTFSSFLYQQGLRDRAMAILEQTRIAACERDLPRLNML
AKIQLAEFTLINDEELEPPILPDESELTFLPNQNQAIRARIALVNSMYRLRLGKQFGVAEKYAKKALQSASAISDARTKI
AAQYCQALAVFGLGSSKLAKRTIIDADQLTEHLSCYFTRDWIKEALMSFSPIARDLFDTLPESAETNATKDLEVRKEEAD
ARPKLTNTQSTITIKQISLLKCVSSGMTNKEIAERLLITEDTVKWHLKKIFSELKVTNRVRAVSEARLRGLL
>P17051 ~~~alkS~~~HTH-type transcriptional regulator AlkS~~~
MKIIINNDFPVAKVGADQITTLVSAKVHSCIYRPRLSIADGAAPRVCLYRAPPGYGKTVALAFEWLRHRTAGRPAVWLSL
RASSYSEFDICAEIIEQLETFEMVKFSRVREGVSKPALLRDLASSLWQSTSNNEIETLVCLDNINHDLDLPLLHALMEFM
LNTPKNIRFAVAGNTIKGFSQLKLAGAMREYTEKDLAFSAEEAVALAEAESVLGVPEEQIETLVQEVEGWPALVVFLLKR
ELPAKHISAVVEVDNYFRDEIFEAIPERYRVFLANSSLLDFVTPDQYNYVFKCVNGVSCIKYLSTNYMLLRHVSGEPAQF
TLHPVLRNFLREITWTENPAKRSYLLKRAAFWHWRRGEYQYAIRISLRANDCRWAVSMSERIILDLSFRQGEIDALRQWL
LELPKQAWHQKPIVLISYAWVLYFSQQGARAEKLIKDLSSQSDKKNKWQEKEWLQLVLAIGKATKDEMLSSEELCNKWIS
LFGDSNAVGKGAALTCLAFIFASEYRFAELEKVLAQAQAVNKFAKQNFAFGWLYVARFQQALASGKMGWARQIITQARTD
SRAQMMESEFTSKMFDALELELHYELRCLDTSEEKLSKILEFISNHGVTDVFFSVCRAVSAWRLGRSDLNGSIEILEWAK
AHAVEKNLPRLEVMSQIEIYQRLVCQGITGINNLKTLEDHKIFSGQHSAPLKARLLLVQSLVLSRDRNFHSAAHRALLAI
QQARKINAGQLEVRGLLCLAGAQAGAGDLKKAQLNIVYAVEIAKQLQCFQTVLDEVCLIERIIPASCEAFTAVNLDQAIG
AFSLPRIVEIGKSAENKADALLTRKQIAVLRLVKEGCSNKQIATNMHVTEDAIKWHMRKIFATLNVVNRTQATIEAERQG
II
>O05892 ~~~alkX~~~HTH-type transcriptional regulator AlkX~~~COG1309
MSTPSATVAPVKRIPYAEASRALLRDSVLDAMRDLLLTRDWSAITLSDVARAAGISRQTIYNEFGSRQGLAQGYALRLAD
RLVDNVHASLDANVGNFYEAFLQGFRSFFAESAADPLVISLLTGVAKPDLLQLITTDSAPIITRASARLAPAFTDTWVAT
TDNDANVLSRAIVRLCLSYVSMPPEADHDVAADLARLITPFAERHGVINVP
>P63486 4.3.2.3~~~allA~~~Ureidoglycolate lyase~~~COG3194
MKLQVLPLSQEAFSAYGDVIETQQRDFFHINNGLVERYHDLALVEILEQDRTLISINRAQPANLPLTIHELERHPLGTQA
FIPMKGEVFVVVVALGDDKPDLSTLRAFITNGEQGVNYHRNVWHHPLFAWQRVTDFLTIDRGGSDNCDVESIPEQELCFA
>B1XFU0 4.3.2.3~~~allA~~~Ureidoglycolate lyase~~~
MKLQVLPLSQEAFSAYGDVIETQQRDFFHINNGLVERYHDLALVEILEQDCTLISINRAQPANLPLTIHELERHPLGTQA
FIPMKGEVFVVVVALGDDKPDLSTLRAFITNGEQGVNYHRNVWHHPLFAWQRVTDFLTIDRGGSDNCDVESIPEQELCFA
>P77731 4.3.2.3~~~allA~~~Ureidoglycolate lyase~~~COG3194
MKLQVLPLSQEAFSAYGDVIETQQRDFFHINNGLVERYHDLALVEILEQDCTLISINRAQPANLPLTIHELERHPLGTQA
FIPMKGEVFVVVVALGDDKPDLSTLRAFITNGEQGVNYHRNVWHHPLFAWQRVTDFLTIDRGGSDNCDVESIPEQELCFA
>P59285 4.3.2.3~~~allA~~~Ureidoglycolate lyase~~~COG3194
MRTLMIEPLTKEAFAQFGDVIETDGSDHFMINNGSTMRFHKLATVETAEPEDKAIISIFRADAQDMPLTVRMLERHPLGS
QAFIPLLGNPFLIVVAPVGDAPVSGLVRAFRSNGRQGVNYHRGVWHHPVLTIEKRDDFLVVDRSGSGNNCDEHYFTEEQM
LILNPHQ
>P63487 4.3.2.3~~~allA~~~Ureidoglycolate lyase~~~
MKLQVLPLSQEAFSAYGDVIETQQRDFFHINNGLVERYHDLALVEILEQDRTLISINRAQPANLPLTIHELERHPLGTQA
FIPMKGEVFVVVVALGDDKPDLSTLRAFITNGEQGVNYHRNVWHHPLFAWQRVTDFLTIDRGGSDNCDVESIPEQELCFA
>O32137 3.5.2.5~~~allB~~~Allantoinase~~~COG0044
MAYDMVIKGAKAVTPDGVIEADIVVQNGVIAEIGSDIEASGTEIIQADGKYVFPGVIDCHVHFNEPGREDWEGFETGSQM
MAAGGCTTYFDMPLNCIPSTVTAEHLLAKAELGRQKSAVDFALWGGLVPGHIEDIRPMAEAGAIGFKAFLSKSGTDEFRS
VDERTLLKGMAEIAAAGKILALHAESDAITSYLQMVLANKGKVDADAYAASRPEEAEVEAVYRTIQYAKVTGCPVHFVHI
STAKAVRLIREAKQEGLDVSVETCPHYVLFSHDDLRQRGSVAKCAPPLRSRQSKETLIETLIAGDIDMVSSDHSPCRPSL
KREDNMFLSWGGISGGQFTLLGMLELALEHQIPFETIAEWTAAAPAKRFGLQKKGRLEAGCDADFVLVSMEPYTVTRESM
FAKHKKSIYEGHTFPCSISATYSKGRCVYNDGEKVTEIDGALVVPS
>P77671 3.5.2.5~~~allB~~~Allantoinase~~~COG0044
MSFDLIIKNGTVILENEARVVDIAVKGGKIAAIGQDLGDAKEVMDASGLVVSPGMVDAHTHISEPGRSHWEGYETGTRAA
AKGGITTMIEMPLNQLPATVDRASIELKFDAAKGKLTIDAAQLGGLVSYNIDRLHELDEVGVVGFKCFVATCGDRGIDND
FRDVNDWQFFKGAQKLGELGQPVLVHCENALICDELGEEAKREGRVTAHDYVASRPVFTEVEAIRRVLYLAKVAGCRLHV
CHVSSPEGVEEVTRARQEGQDVTCESCPHYFVLDTDQFEEIGTLAKCSPPIRDLENQKGMWEKLFNGEIDCLVSDHSPCP
PEMKAGNIMKAWGGIAGLQSCMDVMFDEAVQKRGMSLPMFGKLMATNAADIFGLQQKGRIAPGKDADFVFIQPNSSYVLT
NDDLEYRHKVSPYVGRTIGARITKTILRGDVIYDIEQGFPVAPKGQFILKHQQ
>Q9KAH8 3.5.2.5~~~allB~~~Allantoinase~~~COG0044
MKRFDLIIRSSTVVTETTTYRADVAIRNGIVSAITEPGSISSDDGPAIDGTGLHLFPGMVDVHVHFNEPGRTEWEGFASG
SKSLAAGGVTTYFDMPLNSNPPTITREELDKKRQLANEKSLVDYRFWGGLVPGNIDHLQDLHDGGVIGFKAFMSECGTDD
FQFSHDETLLKGMKKIAALGSILAVHAESNEMVNALTTIAIEEQRLTVKDYSEARPIVSELEAVERILRFAQLTCCPIHI
CHVSSRKVLKRIKQAKGEGVNVSVETCPHYLLFSLDEFAEIGYLAKCAPPLRERQEVEDLWDGLMAGEIDLISSDHSPSL
PQMKTGKTIFEVWGGIAGCQNTLAVMLTEGYHKRKMPLTQIVQLLSTEPAKRFGLYPQKGTIQVGAEASFTLIDLNESYT
LNASDLYYRHPISPYVGQRFRGKVKHTICQGKHVYQDH
>O32149 3.5.3.9~~~pucF~~~Allantoate amidohydrolase~~~COG0624
MEKQKLSVKNSITDYIEWLAQYGASADGGVTRLLYTKEWMDAQLAVKTEMSSFGLETRFDDVGNVFGRLSGTQSPDEVIV
TGSHIDTVINGGKYDGAYGVLAAMLALKQLKETYGAPKKTLEAVSLCEEEGSRFPMTYWGSGNMTGVFSEQDAKEPRDES
GVSLQTAMHESGFGKGVFQSAYRTDISAFVELHIEQGKTLEMSGRDLGIVTSIAGQRRYLVTLEGECNHAGTTSMKWRKD
PLAASSRIIHELLLRSDELPDELRLTCGKITAEPNVANVIPGRVQFSIDIRHQHQHVLEQFHQDMVALINGICLQKGIRA
VIDEYMRIEPVPMDERLKAAAFETALENGFSCEEMVSGAGHDAQMIGRRYPACMLFVPSRGGVSHSPKEYTSARQLEIGV
RALTDLLYKLAY
>P77425 3.5.3.9~~~allC~~~Allantoate amidohydrolase~~~COG0624
MITHFRQAIEETLPWLSSFGADPAGGMTRLLYSPEWLETQQQFKKRMAASGLETRFDEVGNLYGRLNGTEYPQEVVLSGS
HIDTVVNGGNLDGQFGALAAWLAIDWLKTQYGAPLRTVEVVAMAEEEGSRFPYVFWGSKNIFGLANPDDVRNICDAKGNS
FVDAMKACGFTLPNAPLTPRQDIKAFVELHIEQGCVLESNGQSIGVVNAIVGQRRYTVTLNGESNHAGTTPMGYRRDTVY
AFSRICHQSVEKAKRMGDPLVLTFGKVEPRPNTVNVVPGKTTFTIDCRHTDAAVLRDFTQQLENDMRAICDEMDIGIDID
LWMDEEPVPMNKELVATLTELCEREKLNYRVMHSGAGHDAQIFAPRVPTCMIFIPSINGISHNPAERTNITDLAEGVKTL
ALMLYQLAWQK
>P77555 1.1.1.350~~~allD~~~Ureidoglycolate dehydrogenase (NAD(+))~~~COG2055
MKISRETLHQLIENKLCQAGLKREHAATVAEVLVYADARGIHSHGAVRVEYYAERISKGGTNREPEFRLEETGPCSAILH
ADNAAGQVAAKMGMEHAIKTAQQNGVAVVGISRMGHSGAISYFVQQAARAGFIGISMCQSDPMVVPFGGAEIYYGTNPLA
FAAPGEGDEILTFDMATTVQAWGKVLDARSRNMSIPDTWAVDKNGVPTTDPFAVHALLPAAGPKGYGLMMMIDVLSGVLL
GLPFGRQVSSMYDDLHAGRNLGQLHIVINPNFFSSSELFRQHLSQTMRELNAITPAPGFNQVYYPGQDQDIKQRKAAVEG
IEIVDDIYQYLISDALYNTSYETKNPFAQ
>P75713 3.5.3.26~~~allE~~~(S)-ureidoglycine aminohydrolase~~~COG3257
MGYLNNVTGYREDLLANRAIVKHGNFALLTPDGLVKNIIPGFENCDATILSTPKLGASFVDYLVTLHQNGGNQQGFGGEG
IETFLYVISGNITAKAEGKTFALSEGGYLYCPPGSLMTFVNAQAEDSQIFLYKRRYVPVEGYAPWLVSGNASELERIHYE
GMDDVILLDFLPKELGFDMNMHILSFAPGASHGYIETHVQEHGAYILSGQGVYNLDNNWIPVKKGDYIFMGAYSLQAGYG
VGRGEAFSYIYSKDCNRDVEI
>P94575 ~~~pucI~~~Allantoin permease~~~COG1953
MKLKESQQQSNRLSNEDLVPLGQEKRTWKAMNFASIWMGCIHNIPTYATVGGLIAIGLSPWQVLAIIITASLILFGALAL
NGHAGTKYGLPFPVIIRASYGIYGANIPALLRAFTAIMWLGIQTFAGSTALNILLLNMWPGWGEIGGEWNILGIHLSGLL
SFVFFWAIHLLVLHHGMESIKRFEVWAGPLVYLVFGGMVWWAVDIAGGLGPIYSQPGKFHTFSETFWPFAAGVTGIIGIW
ATLILNIPDFTRFAETQKEQIKGQFYGLPGTFALFAFASITVTSGSQVAFGEPIWDVVDILARFDNPYVIVLSVITLCIA
TISVNVAANIVSPAYDIANALPKYINFKRGSFITALLALFTVPWKLMESATSVYAFLGLIGGMLGPVAGVMMADYFIIRK
RELSVDDLYSETGRYVYWKGYNYRAFAATMLGALISLIGMYVPVLKSLYDISWFVGVLISFLFYIVLMRVHPPASLAIET
VEHAQVRQAE
>P75712 ~~~ybbW~~~Putative allantoin permease~~~COG1953
MEHQRKLFQQRGYSEDLLPKTQSQRTWKTFNYFTLWMGSVHNVPNYVMVGGFFILGLSTFSIMLAIILSAFFIAAVMVLN
GAAGSKYGVPFAMILRASYGVRGALFPGLLRGGIAAIMWFGLQCYAGSLACLILIGKIWPGFLTLGGDFTLLGLSLPGLI
TFLIFWLVNVGIGFGGGKVLNKFTAILNPCIYIVFGGMAIWAISLVGIGPIFDYIPSGIQKAENGGFLFLVVINAVVAVW
AAPAVSASDFTQNAHSFREQALGQTLGLVVAYILFAVAGVCIIAGASIHYGADTWNVLDIVQRWDSLFASFFAVLVILMT
TISTNATGNIIPAGYQIAAIAPTKLTYKNGVLIASIISLLICPWKLMENQDSIYLFLDIIGGMLGPVIGVMMAHYFVVMR
GQINLDELYTAPGDYKYYDNGFNLTAFSVTLVAVILSLGGKFIHFMEPLSRVSWFVGVIVAFAAYALLKKRTTAEKTGEQ
KTIG
>P0ACN4 ~~~allR~~~HTH-type transcriptional repressor AllR~~~COG1414
MTEVRRRGRPGQAEPVAQKGAQALERGIAILQYLEKSGGSSSVSDISLNLDLPLSTTFRLLKVLQAADFVYQDSQLGWWH
IGLGVFNVGAAYIHNRDVLSVAGPFMRRLMLLSGETVNVAIRNGNEAVLIGQLECKSMVRMCAPLGSRLPLHASGAGKAL
LYPLAEEELMSIILQTGLQQFTPTTLVDMPTLLKDLEQARELGYTVDKEEHVVGLNCIASAIYDDVGSVVAAISISGPSS
RLTEDRFVSQGELVRDTARDISTALGLKAHP
>P0ACR0 ~~~allS~~~HTH-type transcriptional activator AllS~~~COG0583
MFDPETLRTFIAVAETGSFSKAAERLCKTTATISYRIKLLEENTGVALFFRTTRSVTLTAAGEHLLSQARDWLSWLESMP
SELQQVNDGVERQVNIVINNLLYNPQAVAQLLAWLNERYPFTQFHISRQIYMGVWDSLLYEGFSLAIGVTGTEALANTFS
LDPLGSVQWRFVMAADHPLANVEEPLTEAQLRRFPAVNIEDSARTLTKRVAWRLPGQKEIIVPDMETKIAAHLAGVGIGF
LPKSLCQSMIDNQQLVSRVIPTMRPPSPLSLAWRKFGSGKAVEDIVTLFTQRRPEISGFLEIFGNPRS
>Q9KJX5 3.1.3.1~~~pafA~~~Alkaline phosphatase PafA~~~COG1524
MLTPKKWLLGVLVVSGMLGAQKTNAVPRPKLVVGLVVDQMRWDYLYRYYSKYGEGGFKRMLNTGYSLNNVHIDYVPTVTA
IGHTSIFTGSVPSIHGIAGNDWYDKELGKSVYCTSDETVQPVGTTSNSVGQHSPRNLWSTTVTDQLGLATNFTSKVVGVS
LKDRASILPAGHNPTGAFWFDDTTGKFITSTYYTKELPKWVNDFNNKNVPAQLVANGWNTLLPINQYTESSEDNVEWEGL
LGSKKTPTFPYTDLAKDYEAKKGLIRTTPFGNTLTLQMADAAIDGNQMGVDDITDFLTVNLASTDYVGHNFGPNSIEVED
TYLRLDRDLADFFNNLDKKVGKGNYLVFLSADHGAAHSVGFMQAHKMPTGFFVEDMKKEMNAKLKQKFGADNIIAAAMNY
QVYFDRKVLADSKLELDDVRDYVMTELKKEPSVLYVLSTDEIWESSIPEPIKSRVINGYNWKRSGDIQIISKDGYLSAYS
KKGTTHSVWNSYDSHIPLLFMGWGIKQGESNQPYHMTDIAPTVSSLLKIQFPSGAVGKPITEVIGR
>A1YYW7 3.1.3.1~~~phoK~~~Alkaline phosphatase PhoK~~~
MLKHVAAALLLATAMPVVAQSPAPAAAPAPAARSIAATPPKLIVAISVDQFSADLFSEYRQYYTGGLKRLTSEGAVFPRG
YQSHAATETCPGHSTILTGSRPSRTGIIANNWFDLDAKREDKNLYCAEDESQPGSSSDKYEASPLHLKVPTLGGRMKAAN
PATRVVSVAGKDRAAIMMGGATADQVWWLGGPQGYVSYKGVAPTPLVTQVNQAFAQRLAQPNPGFELPAQCVSKDFPVQA
GNRTVGTGRFARDAGDYKGFRISPEQDAMTLAFAAAAIENMQLGKQAQTDIISIGLSATDYVGHTFGTEGTESCIQVDRL
DTELGAFFDKLDKDGIDYVVVLTADHGGHDLPERHRMNAMPMEQRVDMALTPKALNATIAEKAGLPGKKVIWSDGPSGDI
YYDKGLTAAQRARVETEALKYLRAHPQVQTVFTKAEIAATPSPSGPPESWSLIQEARASFYPSRSGDLLLLLKPRVMSIP
EQAVMGSVATHGSPWDTDRRVPILFWRKGMQHFEQPLGVETVDILPSLAALIKLPVPKDQIDGRCLDLVAGKDDSCAGQ
>Q55320 3.1.3.1~~~phoV~~~Alkaline phosphatase PhoV~~~
MKIKLLCISLAVLFCSSANAQKKQAKVQPSVFPQTVARPKLVVGMVIDQMRWDYLYRFYARYGNGGFKRLINEGFSAENT
LIPYTPTLTACGHSSIYTGSVPAINGIIGNNWFDPQLGRDVYCVEDKSVKTVGSSSNEGLMSPKNLLVTTVTDELRMATN
FRSKVISVSIKDRGAILPGGHTANGAYWYDDMTGSFISSTHYMQQLPTWVNDFNAQRLPNKYFEQDWNTLYPIETYTEST
ADAKPYERTFKGAKTSSFPHLFKQYANKNYSMMASMPQGNSFTLEFAKAAIPAEKLGQTGNTDFLAVSLSSTDYVGHQFG
PNSIELEDTYLRLDKDLEDFFNYLDKTIGKGNYLLFLTADHGATHVPGFLRNKMPGGRLLLKVQTDLDSLIFNEFKVRCN
FTIINNQVIFDTDAIKEAKADYAKIKQSTIDYLVKQDGVLNAVDIKNMGAVTIPQEIKNKIINGYNARRSGDVYIILDAG
WYPTLTPGTGHAAWNPYDSHIPALFMGWGVKPGKTNKEYYMSDIAPTVSALLHIQQPSGSIGKVITDLLK
>Q5NNZ8 3.1.3.1~~~phoD~~~Alkaline phosphatase PhoD~~~COG1524
MNSLLHHSFLKTVFSSLAIAIVTSSLSSVTIAATHPLDNHPKGEIAASSETAHNPWSGTRLIVAISVDQFSSDLFSEYRG
RFRSGMKQLQNGVVYPMAYHSHAATETCPGHSVLLTGDHPARTGIIANNWYDFSVKRADKKVYCSEDPSLSADPQNYQPS
VHYLKVPTLGDRMKKANPHSRVISVAGKDRAAIMMGGHMTDQIWFWSDNAYKTLADHKGEMPVTVKTVNEQVTRFMQQDE
APVMPSVCADHASALKIGNNRIIGLAPASRKAGDFKTFRVTPDYDRTTTDIAIGLIDELKLGHGNAPDLLTVSLSATDAV
GHAYGTEGAEMCSQMAGLDDNIARIIAALDSNGVPYVLVLTADHGGQDVPERAKLRGVETAQRVDPALSPDQLSLRLAER
FQLSHNQPLFFANEPQGDWYINRNLPEQTKAQLIQAAKSELSNHPQVAAVFTASELTHIPYPTRSPELWNLAERAKASFD
PLRSGDLIVLLKPRVTPIAKPVSYVATHGSAWDYDRRVPIIFYTPHASGFEQPMPVETVDIMPSLAALLQIPLRKGEVDG
RCLDLDPTEATTCPVK
>Q8RAK6 5.1.1.1~~~alr1~~~Alanine racemase 1~~~COG0787
MKFDGVRPTRVEVYLDAITHNFREIKKIVGKNVKIMAVIKGDAYGHGASYVAKFLEKEGVDYFGVATTEEALELREKGIK
TPILIFGYTPPTQLRQIVKHDLTQTVYDIKYAKELEKESLKQNKRAKVHIKIDTGLGRIGYIDFDLAQKEILEMANMRGL
ILEGIYSHFAAASEDDRDYCKEQFDKFMNLISSLEKKRLKIPLKHIANAAAILNLNYSHLDMVRPGIILFGAYPSKRVER
KVELRETLRFTTRVVHLKDVPAGFFIGYGKSFVTKRKSVIATIPVGYADGLDRRLSNNYKLLLKGKYVPIVGRVCMDQCM
IDVTDVEGVEIGDEVVIIGTQNNETVSVESMADKIETIPQEVFSRISRRVPRVYFYDGIKIGEVNYLK
>P0A6B4 5.1.1.1~~~alr~~~Alanine racemase, biosynthetic~~~COG0787
MQAATVVINRRALRHNLQRLRELAPASKMVAVVKANAYGHGLLETARTLPDADAFGVARLEEALRLRAGGITKPVLLLEG
FFDARDLPTISAQHFHTAVHNEEQLAALEEASLDEPVTVWMKLDTGMHRLGVRPEQAEAFYHRLTQCKNVRQPVNIVSHF
ARADEPKCGATEKQLAIFNTFCEGKPGQRSIAASGGILLWPQSHFDWVRPGIILYGVSPLEDRSTGADFGCQPVMSLTSS
LIAVREHKAGEPVGYGGTWVSERDTRLGVVAMGYGDGYPRAAPSGTPVLVNGREVPIVGRVAMDMICVDLGPQAQDKAGD
PVILWGEGLPVERIAEMTKVSAYELITRLTSRVAMKYVD
>Q9HUN4 5.1.1.1~~~alr~~~Alanine racemase, biosynthetic~~~
MRPLVATVDLSAIRHNYALAKRCAPQRQAFAVVKANAYGHGAREVVTALHDDADGFAVACLEEAAEVRALHASARILLLE
GCFEASEYALAGQLRLDLVIQGAEQGEAFLAAGLDIPLNVWLKLDSGMHRLGFDPAALRAWHARLRSHPGVRELNLISHF
ACADERNHPLTEQQLESFLGLLDLDFDQRSLANSAAVLTIPAAHMDWLRPGIMLYGSTPLADLSAAELGLKPAMSLGAQL
ISLREVAVGESVGYGATWIAERPARIGTVSCGYADGYPRTAPAGTPVLVGGRRAILAGRVSMDMLAVDLSDLPEARVGDP
VELWGAGLSVDEVARACGTLGYELLSKVTARVPRRYSH
>P0A1A4 5.1.1.1~~~alr~~~Alanine racemase, biosynthetic~~~COG0787
MQAATVVINRRALRHNLQRLRELAPASKLVAVVKANAYGHGLLETARTLPDADAFGVARLEEALRLRAGGITQPILLLEG
FFDAADLPTISAQCLHTAVHNQEQLAALEAVELAEPVTVWMKLDTGMHRLGVRPEEAEAFYQRLTHCKNVRQPVNIVSHF
ARADEPECGATEHQLDIFNAFCQGKPGQRSIAASGGILLWPQSHFDWARPGIILYGVSPLEHKPWGPDFGFQPVMSLTSS
LIAVRDHKAGEPVGYGGTWVSERDTRLGVVAMGYGDGYPRAAPSGTPVLVNGREVPIVGRVAMDMICVDLGPNAQDNAGD
PVVLWGEGLPVERIAEMTKVSAYELITRLTSRVAMKYID
>P0A1A3 5.1.1.1~~~alr~~~Alanine racemase, biosynthetic~~~
MQAATVVINRRALRHNLQRLRELAPASKLVAVVKANAYGHGLLETARTLPDADAFGVARLEEALRLRAGGITQPILLLEG
FFDAADLPTISAQCLHTAVHNQEQLAALEAVELAEPVTVWMKLDTGMHRLGVRPEEAEAFYQRLTHCKNVRQPVNIVSHF
ARADEPECGATEHQLDIFNAFCQGKPGQRSIAASGGILLWPQSHFDWARPGIILYGVSPLEHKPWGPDFGFQPVMSLTSS
LIAVRDHKAGEPVGYGGTWVSERDTRLGVVAMGYGDGYPRAAPSGTPVLVNGREVPIVGRVAMDMICVDLGPNAQDNAGD
PVVLWGEGLPVERIAEMTKVSAYELITRLTSRVAMKYID
>P0A6B6 5.1.1.1~~~alr~~~Alanine racemase, biosynthetic~~~
MQAATVVINRRALRHNLQRLRELAPASKMVAVVKANAYGHGLLETARTLPDADAFGVARLEEALRLRAGGITKPVLLLEG
FFDARDLPTISAQHFHTAVHNEEQLAALEEASLDEPVTVWMKLDTGMHRLGVRPEQAEAFYHRLTQCKNVRQPVNIVSHF
ARADEPKCGATEKQLAIFNTFCEGKPGQRSIAASGGILLWPQSHFDWVRPGIILYGVSPLEDRSTGADFGCQPVMSLTSS
LIAVREHKAGEPVGYGGTWVSERDTRLGVVAMGYGDGYPRAAPSGTPVLVNGREVPIVGRVAMDMICVDLGPQAQDKAGD
PVILWGEGLPVERIAEMTKVSAYELITRLTSRVAMKYVD
>Q932V0 5.1.1.1~~~alr~~~Alanine racemase, biosynthetic~~~
MQAATVVINRRALRHNLQRLRELAPASKMVAVVKANAYGHGLLETARTLPDADAFGVARLEEALRLRAGGITKPVLLLEG
FFDARDLPTISAQHFHTAVHNEEQLAALEEASLDEPVTVWMKLDTGMHRLGVRPEQAGAFYHRLTQCKNVRQPVNIVSHF
ARADEPKCGATEKQLAIFNTFCEGKPGQRSIAASGGILLWPQSHFDWVRPGIILYGVSPLEDRSTGADFGCQPVMSLTSS
LIAVREHKAGEPVGYGGTWVSERDTRLGVVAMGYGDGYPRAAPSGTPVLVNGREVPIVGRVAMDMICVDLGPQAQDKAGD
PVILWGEGLPVERIAEMTKVSAYELITRLTSRVAMKYVD
>P0A6B5 5.1.1.1~~~alr~~~Alanine racemase, biosynthetic~~~
MQAATVVINRRALRHNLQRLRELAPASKMVAVVKANAYGHGLLETARTLPDADAFGVARLEEALRLRAGGITKPVLLLEG
FFDARDLPTISAQHFHTAVHNEEQLAALEEASLDEPVTVWMKLDTGMHRLGVRPEQAEAFYHRLTQCKNVRQPVNIVSHF
ARADEPKCGATEKQLAIFNTFCEGKPGQRSIAASGGILLWPQSHFDWVRPGIILYGVSPLEDRSTGADFGCQPVMSLTSS
LIAVREHKAGEPVGYGGTWVSERDTRLGVVAMGYGDGYPRAAPSGTPVLVNGREVPIVGRVAMDMICVDLGPQAQDKAGD
PVILWGEGLPVERIAEMTKVSAYELITRLTSRVAMKYVD
>Q93HP9 5.1.1.1~~~alr~~~Alanine racemase, biosynthetic~~~
MQAATVVINRRALRHNLQRLRELAPASKMVAVVKANAYGHGLLETARTLPDADAFGVARLEEALRLRAGGITKPVLLLEG
FFDARDLPTISAQHFHTAVHNEEQLAALEEASLDEPVTVWMKLDTGMHRLGVRPEQAEAFYHRLTQCKNVRQPVNIVSHF
ARADEPKCGATEKQLAIFNTFCEGKPGQRSIAASGGILLWPQSHFDWVRPGIILYGVSPLEDRSIGADFGCQPVMSLTSS
LIAVREHKAGEPVGYGGTWVSERDTRLGVVAMGYGDGYPRAAPSGTPVLVNGREVPIVGRVAMDMICVDLGPQAQDKAGD
PVILWGEGLPVERIAEMTKVSAYELITRLTSRVAMKYVD
>Q5HED1 5.1.1.1~~~alr1~~~Alanine racemase 1~~~
MSDKYYRSAYMNVDLNAVASNFKVFSTLHPNKTVMAVVKANAYGLGSVKVARHLMENGATFFAVATLDEAIELRMHGITA
KILVLGVLPAKDIDKAIQHRVALTVPSKQWLKEAIKNISGEQEKKLWLHIKLDTGMGRLGIKDTKTYQEVIEIIQQYEQL
VFEGVFTHFACADEPGDMTTEQYQRFKDMVNEAIKPEYIHCQNSAGSLLMDCQFCNAIRPGISLYGYYPSEYVQQKVKVH
LKPSVQLIANVVQTKTLQAGESVSYGATYTATDPTTIALLPIGYADGYLRIMQGSFVNVNGHQCEVIGRVCMDQTIVKVP
DQVKAGDSVILIDNHRESPQSVEVVAEKQHTINYEVLCNLSRRLPRIYHDGDQRFVTNELLK
>P63479 5.1.1.1~~~alr1~~~Alanine racemase 1~~~
MSDKYYRSAYMNVDLNAVASNFKVFSTLHPNKTVMAVVKANAYGLGSVKVARHLMENGATFFAVATLDEAIELRMHGITA
KILVLGVLPAKDIDKAIQHRVALTVPSKQWLKEAIKNISGEQEKKLWLHIKLDTGMGRLGIKDTNTYQEVIEIIQQYEQL
VFEGVFTHFACADEPGDMTTEQYQRFKDMVNEAIKPEYIHCQNSAGSLLMDCQFCNAIRPGISLYGYYPSEYVQQKVKVH
LKPSVQLIANVVQTKTLQAGESVSYGATYTATDPTTIALLPIGYADGYLRIMQGSFVNVNGHQCEVIGRVCMDQTIVKVP
DQVKAGDSVILIDNHRESPQSVEVVAEKQHTINYEVLCNLSRRLPRIYHDGDQRFVTNELLK
>P63480 5.1.1.1~~~alr1~~~Alanine racemase 1~~~
MSDKYYRSAYMNVDLNAVASNFKVFSTLHPNKTVMAVVKANAYGLGSVKVARHLMENGATFFAVATLDEAIELRMHGITA
KILVLGVLPAKDIDKAIQHRVALTVPSKQWLKEAIKNISGEQEKKLWLHIKLDTGMGRLGIKDTNTYQEVIEIIQQYEQL
VFEGVFTHFACADEPGDMTTEQYQRFKDMVNEAIKPEYIHCQNSAGSLLMDCQFCNAIRPGISLYGYYPSEYVQQKVKVH
LKPSVQLIANVVQTKTLQAGESVSYGATYTATDPTTIALLPIGYADGYLRIMQGSFVNVNGHQCEVIGRVCMDQTIVKVP
DQVKAGDSVILIDNHRESPQSVEVVAEKQHTINYEVLCNLSRRLPRIYHDGDQRFVTNELLK
>P94494 5.1.1.1~~~alr2~~~Alanine racemase 2~~~COG0787
MIKLCREVWIEVNLDAVKKNLRAIRRHIPHKSKIMAVVKANGYGHGSIEVARHALEHGASELAVASVEEGIVLRKAGITA
PILVLGFTSLSCVKKSAAWNITLSAFQVDWMKEANEILEKEASANRLAIHINVDTGMGRLGVRTKEELLEVVKALKASKF
LRWTGIFTHFSTADEPDTTLTKLQHEKFISFLSFLKKQGIELPTVHMCNTAAAIAFPEFSADMIRLGIGLYGLYPSAYIK
QLNLVKLEPALSLKARIAYVKTMRTEPRTVSYGATYIAEPNEVIATLPIGYADGYSRALSNRGFVLHRGKRVPVAGRVTM
DMIMVSLGENGEGKQGDEVVIYGKQKGAEISVDEVAEMLNTINYEVVSTLSRRIPRFYIRDGEIFKVSTPVLYV
>P29012 5.1.1.1~~~dadX~~~Alanine racemase, catabolic~~~COG0787
MTRPIQASLDLQALKQNLSIVRQAATHARVWSVVKANAYGHGIERIWSAIGATDGFALLNLEEAITLRERGWKGPILMLE
GFFHAQDLEIYDQHRLTTCVHSNWQLKALQNARLKAPLDIYLKVNSGMNRLGFQPDRVLTVWQQLRAMANVGEMTLMSHF
AEAEHPDGISGAMARIEQAAEGLECRRSLSNSAATLWHPEAHFDWVRPGIILYGASPSGQWRDIANTGLRPVMTLSSEII
GVQTLKAGERVGYGGRYTARDEQRIGIVAAGYADGYPRHAPTGTPVLVDGVRTMTVGTVSMDMLAVDLTPCPQAGIGTPV
ELWGKEIKIDDVAAAAGTVGYELMCALALRVPVVTV
>Q9HTQ2 5.1.1.1~~~dadX~~~Alanine racemase, catabolic~~~
MRPARALIDLQALRHNYRLAREATGARALAVIKADAYGHGAVRCAEALAAEADGFAVACIEEGLELREAGIRQPILLLEG
FFEASELELIVAHDFWCVVHCAWQLEAIERASLARPLNVWLKMDSGMHRVGFFPEDFSAAHERLRASGKVAKIVMMSHFS
RADELDCPRTEEQLAAFAAASQGLEGEISLRNSPAVLGWPKVPSDWVRPGILLYGATPFERAHPLADRLRPVMTLESKVI
SVRDLPAGEPVGYGARYSTERSQRIGVVAMGYADGYPRHAADGTLVFIDGKPGRLVGRVSMDMLTVDLTDHPQAGLGSRV
ELWGPNVPVGALAAQFGSIPYQLLCNLKRVPRVYSGA
>P06191 5.1.1.1~~~dadX~~~Alanine racemase, catabolic~~~
MTRPIQASLDLQVMKQNLAIVRRAAPEARVWSVVKANAYGHGIERVWSALGATDGFAMLNLEEAITLRERGWKGPILMLE
GFFHAQDLEAYDTYRLTTCIHSNWQLKALQNARLNAPLDIYVKVNSGMNRLGFQPERAQTVWQQLRAMRNVGEMTLMSHF
AQADHPEGIGEAMRRIALATEGLQCAYSLSNSAATLWHPQAHYDWVRPGIILYGASPSGQWRDIADTGLKPVMTLSSEII
GVQTLSAGERVGYGGGYSVTQEQRIGIVAAGYADGYPRHAPTGTPVLVDGIRTRTVGTVSMDMLAVDLTPCPQAGIGTPV
ELWGKEIKVDDVASAAGTLGYELLCAVAPRVPFVTT
>A0KH11 5.1.1.1~~~alr-1~~~Alanine racemase~~~COG0787
MKAAIAQINTAALRHNLAVVKRHAPQCKIIAVVKANAYGHGLLPVARTLVDADAYAVARIEEALMLRSCAVVKPIVLLEG
FFSAADLPVLAANNLQTAVHTWEQLEALEQADLPAPVVAWLKLDTGMHRLGVRADEMPAFIERLAKCKNVVQPFNIMTHF
SRSDELEQPTTREQIDLFSQLTAPLLGERAMANSAGILAWPDSHCDWVRPGVILYGVSPFPNTVAADYDLQPVMTLKTQL
IAVRDHKAGEPVGYGANWVSDRDTRLGVIAIGYGDGYPRMAPNGTPVLVNGRIVPLVGRVSMDMTTVDLGPGATDKAGDE
AVLWGEGLPVERVADQIGTIPYELITKLTSRVFMEYV
>Q9RER4 5.1.1.1~~~alr~~~Alanine racemase~~~
MRRAVLEILEERIIHNVKEIHRFSGKRIIAVVKANAYGIGVREVSRILEGLEEVDAFAVACTQEGVELRECGIKKKILIL
GGILEEDVKLLEEYDLTPVISDPEHLKVLKDRNIKFHVKYDTGMGRLGFTNEIIKDPRVEGVMSHFSSPADRNFSKLQIK
RFEEILKNYEKVKYIHLESSAGLIYRVPFTTHVRVGLAIYGEKPLKDYPLEVKPALRLRARLISVKELPENYPVSYGRTY
ITKRKTKLGVVAFGYADGLMKTLSNRSFLIFEGRKVPIIGNITMDMTMVDLSGTEARTGDWVYIVNEERSFTPLARDAGT
IPYEIMCNLSRRVERLVIKKR
>Q8RSU9 5.1.1.1~~~alr~~~Alanine racemase~~~COG0787
MNLLTTKIDLDAIAHNTRVLKQMAGPAKLMAVVKANAYNHGVEKVAPVIAAHGADAFGVATLAEAMQLRDIGISQEVLCW
IWTPEQDFRAAIDRNIDLAVISPAHAKALIETDAEHIRVSIKIDSGLHRSGVDEQEWEGVFSALAAAPHIEVTGMFTHLA
CADEPENPETDRQIIAFRRALALARKHGLECPVNHVCNSPAFLTRSDLHMEMVRPGLAFYGLEPVAGLEHGLKPAMTWEA
KVSVVKQIEAGQGTSYGLTWRAEDRGFVAVVPAGYADGMPRHAQGKFSVTIDGLDYPQVGRVCMDQFVISLGDNPHGVEA
GAKAVIFGENGHDATDFAERLDTINYEVVCRPTGRTVRAYV
>P10724 5.1.1.1~~~alr~~~Alanine racemase~~~
MNDFHRDTWAEVDLDAIYDNVENLRRLLPDDTHIMAVVKANAYGHGDVQVARTALEAGASRLAVAFLDEALALREKGIEA
PILVLGASRPADAALAAQQRIALTVFRSDWLEEASALYSGPFPIHFHLKMDTGMGRLGVKDEEETKRIVALIERHPHFVL
EGLYTHFATADEVNTDYFSYQYTRFLHMLEWLPSRPPLVHCANSAASLRFPDRTFNMVRFGIAMYGLAPSPGIKPLLPYP
LKEAFSLHSRLVHVKKLQPGEKVSYGATYTAQTEEWIGTIPIGYADGWLRRLQHFHVLVDGQKAPIVGRICMDQCMIRLP
GPLPVGTKVTLIGRQGDEVISIDDVARHLETINYEVPCTISYRVPRIFFRHKRIMEVRNAIGRGESSA
>Q9RLU5 5.1.1.1~~~alr~~~Alanine racemase~~~COG0787
MKTSPHRNTSAIVDLKAIRNNIEKFKKHINPNAEIWPAVKADAYGHGSIEVSKAVSDLVGGFCVSNLDEAIELRNHLVTK
PILVLSGIVPEDVDIAAALNISLTAPSLEWLKLVVQEEAELSDLKIHIGVDSGMGRIGIRDVEEANQMIELADKYAINFE
GIFTHFATADMADETKFKNQQARFNKIMAGLSRQPKFIHSTNTAAALWHKEQVQAIERLGISMYGLNPSGKTLELPFEIE
PALSLVSELTHIKKIAAGETVGYGATYETSEETWIGTVPIGYADGWTRQMQGFKVLVDGKFCEIVGRVCMDQMMIKLDKS
YPLGTKVTLIGRDKANEITTTDVADWRGTINYEVLCLLSDRIKRIYK
>Q9L888 5.1.1.1~~~alr~~~Alanine racemase~~~
MAVTPISLTPGVLAEALVDLGAIEHNVRLLCEQARGAQVMAVVKADGYGHGAVQTARAALAAGAAELGVATVDEALALRA
AGISAPVLAWLHPPGIDFRPALLAGVQIGLSSQRQLDELLTAVRDTGRTATVTVKVDTGLNRNGVPPAQYPSMLTALRRA
VAEQAIVPRGLMSHMVYADQPANPVNDVQAQRFTDMLAQAREQGVRFEVAHLSNSSATMSRPDLAFDMVRPGIAVYGLSP
VPELGDMGLVPAMTVKCTVALVKSIRAGESVSYGHTWTAQRDTNLALLPVGYADGIFRSLGGRLQVSINGRRRPGVGRIC
MDQFVVDLGPGRPDVAEGDEAILFGPGSNGEPTAQDWADLLGTIHYEVVTSPRGRITRTYREAHTVES
>P9WQA9 5.1.1.1~~~alr~~~Alanine racemase~~~COG0787
MAMTPISQTPGLLAEAMVDLGAIEHNVRVLREHAGHAQLMAVVKADGYGHGATRVAQTALGAGAAELGVATVDEALALRA
DGITAPVLAWLHPPGIDFGPALLADVQVAVSSLRQLDELLHAVRRTGRTATVTVKVDTGLNRNGVGPAQFPAMLTALRQA
MAEDAVRLRGLMSHMVYADKPDDSINDVQAQRFTAFLAQAREQGVRFEVAHLSNSSATMARPDLTFDLVRPGIAVYGLSP
VPALGDMGLVPAMTVKCAVALVKSIRAGEGVSYGHTWIAPRDTNLALLPIGYADGVFRSLGGRLEVLINGRRCPGVGRIC
MDQFMVDLGPGPLDVAEGDEAILFGPGIRGEPTAQDWADLVGTIHYEVVTSPRGRITRTYREAENR
>Q9S5V6 5.1.1.1~~~alr~~~Alanine racemase~~~
MTLQNFYRDTWAEINLDAIFENAANMKKHLPPEITLFAVVKANAYGHGDVEVAETAIQAGAGYLAVAFLDEALALRKKGI
TAPILVLGASRPEDAQIAARESITLTVFQAGWLEAAQSFLEGTVLTIHLKCDSGMGRIGIRKQEEMNEIERFLQKTKCFV
LEGIFTHFATADQLDTEYFSKQLARFEEMLTWLKEKPKYVHAANSAALLRFPNAVFNSVRMGISLYGLSPSMEMKEVLPF
ALRPAFSLKTKLVHVKNISKGQSVSYGATYTAEEDTWIGTLPIGYADGWIRMLQGQEVLLEGGRSPLVGRICMDQCMVKL
SREFPVGTEVTLIGKNGTECITVDDIAEKLNTINYEVTCMISSRVPRMYLKDGRITGVVNQLI
>B9WZ64 5.1.1.1~~~alr~~~Alanine racemase~~~
MRPARALIDLQALRHNYRLARELTGAKALAVIKADAYGHGAVRCALALEAEADGFAVACIEEALELRAAGIKAPVLLLEG
FFEASELALIAEHDLWCVVHSLWQLEAIERTQLHKPLTVWLKLDSGMHRVGLHPKDYHEAYQRLLASGKVARIVLMSHFA
RADELDADATAQQIAVFEAARQGLAAECSLRNSPGVLGWPQAPSDWVRPGLMLYGATPFEVAQAEAERLQPVMTLQSRVI
SVRELPAGEPVGYGAKFVSPRPTRVGVVAMGYADGYPRQAPNGTPVLVAGKRTQLIGRVSMDMLSIDLTDVPEATVGSPV
ELWGKHVLASEVAAHAGTIPYQIFCNLKRVPRDYIGE
>O86786 5.1.1.1~~~alr~~~Alanine racemase~~~COG0787
MSETTARRDADAVLRARAEIDLAALRANVRALRERAPGAALMAVVKADAYGHGAIPCARAAVAAGATWLGTATPQEALAL
RAAEPGLPDDVRIMCWLWTPGGPWREAVEARLDVSVSAMWAMEEVTGAARAAGVPARVQLKADTGLGRGGCQPGADWERL
VGAALRAEEEGLLRVTGLWSHFACADEPGHPSIAAQLTRFREMTAYAEQRGLRPEVRHIANSPATLTLPDAHFDLVRPGI
AMYGVSPSPEIGTPADFGLRPVMTLAASLALVKQVPGGHGVSYGHHYTTPGETTLGLVPLGYADGIPRHASSSGPVLVDG
KWRTVAGRIAMDQFVVDLGGDRPEPGAEAVLFGPGDRGEPTAEDWAQAAGTIAYEIVTRIGSRVPRVYVNE
>P0A2W8 5.1.1.1~~~alr~~~Alanine racemase~~~COG0787
MKASPHRPTKALIHLGAIRQNIQQMGAHIPQGTLKLAVVKANAYGHGAVAVAKAIQDDVDGFCVSNIDEAIELRQAGLSK
PILILGVSEIEAVALAKEYDFTLTVAGLEWIQALLDKEVDLTGLTVHLKIDSGMGRIGFREASEVEQAQDLLQQHGVCVE
GIFTHFATADEESDDYFNAQLERFKTILASMKEVPELVHASNSATTLWHVETIFNAVRMGDAMYGLNPSGAVLDLPYDLI
PALTLESALVHVKTVPAGACMGYGATYQADSEQVIATVPIGYADGWTRDMQNFSVLVDGQACPIVGRVSMDQITIRLPKL
YPLGTKVTLIGSNGDKEITATQVATYRVTINYEVVCLLSDRIPREYY
>Q9KUY6 5.1.1.1~~~alr1~~~Alanine racemase~~~COG0787
MKAATAYINLEALQHNLQRVKQQAPESKIMAVVKANGYGHGLRHIARHALGADAFGVARIEEALQLRASGVVKPILLLEG
FYSPGDLPVLVTNNIQTVVHCEEQLQALEQAQLETPVMVWLKVDSGMHRLGVRPEQYQDFVARLHQCENVAKPLRYMSHF
GCADELDKSTTVEQTELFLSLTQGCQGERSLAASAGLLAWPQSQLEWVRPGIIMYGVSPFVEKSAVQLGYQPVMTLKSHL
IAVREVKAGESVGYGGTWTSQRDTKIGVIAIGYGDGYPRTAPNGTPVVVNGRRVPIAGRVSMDMLTVDLGPDACDRVGDE
AMLWGNELPVEEVAAHIGTIGYELVTKLTSRVEMSYYGAGV
>P39265 ~~~alsB~~~D-allose-binding periplasmic protein~~~COG1879
MNKYLKYFSGTLVGLMLSTSAFAAAEYAVVLKTLSNPFWVDMKKGIEDEAKTLGVSVDIFASPSEGDFQSQLQLFEDLSN
KNYKGIAFAPLSSVNLVMPVARAWKKGIYLVNLDEKIDMDNLKKAGGNVEAFVTTDNVAVGAKGASFIIDKLGAEGGEVA
IIEGKAGNASGEARRNGATEAFKKASQIKLVASQPADWDRIKALDVATNVLQRNPNIKAIYCANDTMAMGVAQAVANAGK
TGKVLVVGTDGIPEARKMVEAGQMTATVAQNPADIGATGLKLMVDAEKSGKVIPLDKAPEFKLVDSILVTQ
>P32720 ~~~alsC~~~D-allose transport system permease protein AlsC~~~COG1172
MGFTTRVKSEASEKKPFNFALFWDKYGTFFILAIIVAIFGSLSPEYFLTTNNITQIFVQSSVTVLIGMGEFFAILVAGID
LSVGAILALSGMVTAKLMLAGVDPFLAAMIGGVLVGGALGAINGCLVNWTGLHPFIITLGTNAIFRGITLVISDANSVYG
FSFDFVNFFAASVIGIPVPVIFSLIVALILWFLTTRMRLGRNIYALGGNKNSAFYSGIDVKFHILVVFIISGVCAGLAGV
VSTARLGAAEPLAGMGFETYAIASAIIGGTSFFGGKGRIFSVVIGGLIIGTINNGLNILQVQTYYQLVVMGGLIIAAVAL
DRLISK
>P32719 5.1.3.-~~~alsE~~~D-allulose-6-phosphate 3-epimerase~~~COG0036
MKISPSLMCMDLLKFKEQIEFIDSHADYFHIDIMDGHFVPNLTLSPFFVSQVKKLATKPLDCHLMVTRPQDYIAQLARAG
ADFITLHPETINGQAFRLIDEIRRHDMKVGLILNPETPVEAMKYYIHKADKITVMTVDPGFAGQPFIPEMLDKLAELKAW
REREGLEYEIEVDGSCNQATYEKLMAAGADVFIVGTSGLFNHAENIDEAWRIMTAQILAAKSEVQPHAKTA
>P32718 2.7.1.55~~~alsK~~~D-allose kinase~~~COG1940
MQKQHNVVAGVDMGATHIRFCLRTAEGETLHCEKKRTAEVIAPGLVSGIGEMIDEQLRRFNARCHGLVMGFPALVSKDKR
TIISTPNLPLTAADLYDLADKLENTLNCPVEFSRDVNLQLSWDVVENRLTQQLVLAAYLGTGMGFAVWMNGAPWTGAHGV
AGELGHIPLGDMTQHCACGNPGCLETNCSGMALRRWYEQQPRNYPLRDLFVHAENAPFVQSLLENAARAIATSINLFDPD
AVILGGGVMDMPAFPRETLVAMTQKYLRRPLPHQVVRFIAASSSDFNGAQGAAILAHQRFLPQFCAKAP
>Q45068 ~~~alsT~~~Amino-acid carrier protein AlsT~~~COG1115
MESFFNSLINIPSDFIWKYLFYILIGLGLFFTIRFGFIQFRYFIEMFRIVGEKPEGNKGVSSMQAFFISAASRVGTGNLT
GVALAIATGGPGAVFWMWVVAAVGMASSFVESTLAQLYKVRDGEDFRGGPAYYIQKGLGARWLGIVFAILITVSFGLIFN
AVQTNTIAGALDGAFHVNKIVVAIVLAVLTAFIIFGGLKRVVAVSQLIVPVMAGIYILIALFVVITNITAFPGVIATIVK
NALGFEQVVGGGIGGIIVIGAQRGLFSNEAGMGSAPNAAATAHVSHPAKQGFIQTLGVFFDTFIICTSTAFIILLYSVTP
KGDGIQVTQAALNHHIGGWAPTFIAVAMFLFAFSSVVGNYYYGETNIEFIKTSKTWLNIYRIAVIAMVVYGSLSGFQIVW
DMADLFMGIMALINLIVIALLSNVAYKVYKDYAKQRKQGLDPVFKAKNIPGLKNAETWEDEKQEA
>P39049 4.2.2.3~~~alxM~~~Alginate lyase~~~
MIKSNLVISSLAIVSSMSYAGVEFSNPSGQLGEPANYTQFANILSASELQISDPNGKKGNKEYFALDNDFTGIVNDNFYV
DKQSQALVFKMANDHLRNELRVQKNFRTDLPDHFYTLYANVEILHPLQSMANSTSKQNEITFLQVHNKGLDDQGTHNVPH
PLLRVVWKENNQGVKGHFWAITKNNAVICKGSFGKKNKDKEMCRADVAYSKIDLGPAPTDKGTDFTITVGNKTLAIDVNG
QRKVEKNIDYWRHLLSYFKAGVYNQFTQGESEAHFNQLRYQVNTP
>P42601 ~~~alx~~~Putative membrane-bound redox modulator Alx~~~COG0861
MNTVGTPLLWGGFAVVVAIMLAIDLLLQGRRGAHAMTMKQAAAWSLVWVTLSLLFNAAFWWYLVQTEGRAVADPQALAFL
TGYLIEKSLAVDNVFVWLMLFSYFSVPAALQRRVLVYGVLGAIVLRTIMIFTGSWLISQFDWILYIFGAFLLFTGVKMAL
AHEDESGIGDKPLVRWLRGHLRMTDTIDNEHFFVRKNGLLYATPLMLVLILVELSDVIFAVDSIPAIFAVTTDPFIVLTS
NLFAILGLRAMYFLLAGVAERFSMLKYGLAVILVFIGIKMLIVDFYHIPIAVSLGVVFGILVMTFIINAWVNYRHDKQRG
G
>Q59478 4.2.2.3~~~alyA~~~Alginate lyase~~~
MLKSGVMVASLCLFSVPSRAAVPAPGDKFELSGWSLSVPVDSDNDGKADQIKEKTLAAGYRNSDFFTLSDAGGMVFKAPI
SGAKTSKNTTYTRSELREMLRKGDTSIATQGVSRNNWVLSSAPLSEQKKAGGVDGTLEATLSVDHVTTTGVNWQVGRVII
GQIHANNDEPIRLYYRKLPHHQKGSVYFAHEPRKGFGDEQWYEMIGTLQPSHGNQTAAPTEPEAGIALGETFSYRIDATG
NKLTVTLMREGRPDVVKTVDMSKSGYSEAGQYLYFKAGVYNQNKTGKPDDYVQATFYRLKATHGAQR
>Q59639 4.2.2.3~~~aly~~~Alginate lyase~~~
MKIISCKSIIVSSLLALSATATAGSFNDISWTLENEDNLPETDASGCALKPSTSTSTSKTFEFGLTDDSNCLDGKQRDEF
KYQRRTGYNRLTGYFTIDGNYSDFNKMGVAQTHDHSTSDTGVFSIYQVRKENGSYIFGVQGDSNYSNNGWSDHPQVKISL
DTRYELIIKTNGLPNGNSYEDANLYLDDVKIWSSSIEVGGEEKQYKKIGAYQLTGGEGEFHVKWDSVKLYTGK
>P37710 3.2.1.-~~~~~~Autolysin~~~COG1388
MKKESMSRIERRKAQQRKKTPVQWKKSTTLFSSALIVSSVGTPVALLPVTAEATEEQPTNAEVAQAPTTETGLVETPTTE
TTPGTTEQPTTDSSTTTESTTESSKETPTTPSTEQPTADSTTPVESGTTDSSVAEITPVAPSATESEAAPAVTPDDEVKV
PEARVASAQTFSALSPTQSPSEFIAELARCAQPIAQANDLYASVMMAQAIVESGWGASTLSKAPNYNLFGIKGSYNGQSV
YMDTWEYLNGKWLVKKEPFRKYPSYMESFQDNAHVLKTTSFQAGVYYYAGAWKSNTSSYRDATAWLTGRYATDPSYNAKL
NNVITAYNLTQYDTPSSGGNTGGGTVNPGTGGSNNQSGTNTYYTVKSGDTLNKIAAQYGVSVANLRSWNGISGDLIFVGQ
KLIVKKGASGNTGGSGSGGSNNNQSGTNTYYTVKSGDTLNKIAAQYGVSVANLRSWNGISGDLIFVGQKLIVKKGASGNT
GGSNNGGSNNNQSGTNTYYTIKSGDTLNKIAAQYGVSVANLRSWNGISGDLIFAGQKIIVKKGTSGNTGGSSNGGSNNNQ
SGTNTYYTIKSGDTLNKISAQFGVSVANLQAWNNISGSLIFAGQKIIVKKGANSGSTNTNKPTNNGGGATTSYTIKSGDT
LNKISAQFGVSVANLRSWNGIKGDLIFAGQTIIVKKGASAGGNASSTNSASGKRHTVKSGDSLWGLSMQYGISIQKIKQL
NGLSGDTIYIGQTLKVG
>P06653 3.5.1.28~~~lytA~~~Autolysin~~~COG5263
MEINVSKLRTDLPQVGVQPYRQVHAHSTGNPHSTVQNEADYHWRKDPELGFFSHIVGNGCIMQVGPVDNGAWDVGGGWNA
ETYAAVELIESHSTKEEFMTDYRLYIELLRNLADEAGLPKTLDTGSLAGIKTHEYCTNNQPNNHSDHVDPYPYLAKWGIS
REQFKHDIENGLTIETGWQKNDTGYWYVHSDGSYPKDKFEKINGTWYYFDSSGYMLADRWRKHTDGNWYWFDNSGEMATG
WKKIADKWYYFNEEGAMKTGWVKYKDTWYYLDAKEGAMVSNAFIQSADGTGWYYLKPDGTLADKPEFTVEPDGLITVK
>P37112 3.5.1.14~~~amaA~~~N-acyl-L-amino acid amidohydrolase~~~
MTKEEIKRLVDEVKTDVIAWRRHLHAHPELSFQEEKTAQFVYETLQSFGHLELSRPTKTSVMARLIGQQPGRVVAIRADM
DALPIQEENTFEFASKNPGVMHACGHDGHTAMLLGTAKIFSQLRDDIRGEIRFLFQHAEELFPGGAEEMVQAGVMDGVDV
VIGTHLWSPLERGKIGIVYGPMMAAPDRFFIRIIGKGGHGAMPHQTIDAIAIGAQVVTNLQHIVSRYVDPLEPLVLSVTQ
FVAGTAHNVLPGEVEIQGTVRTFDETLRRTVPQWMERIVKGITEAHGASYEFRFDYGYRPVINYDEGDPRHGGNGVRAVR
RRGSGPLETEHGRRRFLRLFAKSARQLFLRRRGQCRKRHRLPAPPPALYD
>P37113 3.5.1.87~~~amaB~~~N-carbamoyl-L-amino acid hydrolase~~~
MIQGERLWQRLMELGEVGKQPSGGVTRLSFTAEERRAKDLVASYMREAGLFVYEDAAGNLIGRKEGTNPDATVVLVGSHL
DSVYNGGCFDGPLGVLAGVEVVQTMNEHGVVTHHPIEVVAFTDEEGARFRFGMIGSRAMAGTLPPEALECRDAEGISLAE
AMKQAGLDPDRLPQAARKPGTVKAYVELHIEQGRVLEEAGLPVGIVTGIAGLIWVKFTIAGPAEHAGATPMSLRRDPMAA
AAQIIIVIEEEARRTGTTVGTVGQLHVYPGGINVIPERVEFVLDLRDLKAEVRDQVWKAIAVRAETIAKERNVRLTTERL
QEMAPVLCSEVVKQAAERACKQLGYPPFWLPSGAAHDGVQLAPICPIGMIFVRSQDGVSHSPAEWSTKEDCAVGAEVLYH
TVWQLAQGE
>Q53389 3.5.1.87~~~amaB~~~N-carbamoyl-L-amino acid hydrolase~~~
MIQGERLWQRLMELGEVGKQPSGGVTRLSFTAEERRAKDLVASYMREAGLFVYEDAAGNLIGRKEGTNPDATVVLVGSHL
DSVYNGGCFDGPLGVLAGVEVVQTMNEHGVVTHHPIEVVAFTDEEGARFRFGMIGSRAMAGTLPPEALECRDAEGISLAE
AMKQAGLDPDRLPQAARKPGTVKAYVELHIEQGRVLEETGLPVGIVTGIAGLIWVKFTIEGKAEHAGATPMSLRRDPMAA
AAQIIIVIEEEARRTGTTVGTVGQLHVYPGGINVIPERVEFVLDLRDLKAEVRDQVWKAIAVRAETIAKERNVRVTTERL
QEMPPVLCSDEVKRAAEAACQKLGYPSFWLPSGAAHDSVQLAPICPIGMIFVRSQDGVSHSPAEWSTKEDCAAGAEVLYH
TVWQLAQGE
>O06543 5.1.99.4~~~mcr~~~Alpha-methylacyl-CoA racemase~~~COG1804
MAGPLSGLRVVELAGIGPGPHAAMILGDLGADVVRIDRPSSVDGISRDAMLRNRRIVTADLKSDQGLELALKLIAKADVL
IEGYRPGVTERLGLGPEECAKVNDRLIYARMTGWGQTGPRSQQAGHDINYISLNGILHAIGRGDERPVPPLNLVGDFGGG
SMFLLVGILAALWERQSSGKGQVVDAAMVDGSSVLIQMMWAMRATGMWTDTRGANMLDGGAPYYDTYECADGRYVAVGAI
EPQFYAAMLAGLGLDAAELPPQNDRARWPELRALLTEAFASHDRDHWGAVFANSDACVTPVLAFGEVHNEPHIIERNTFY
EANGGWQPMPAPRFSRTASSQPRPPAATIDIEAVLTDWDG
>Q9I1H0 6.2.1.67~~~ambB~~~AMB antimetabolite synthase AmbB~~~
MQERHGLPLRSFSSGKALTAGRAVRPEVAQERRYLQGAPLGLELPGRIALRDPHCAWQWFEPEAAAEAFPAAHWLAAFLV
LLGRYGNEEITLGFPEPITVRGRQAPALLRSSYRAMESSAERSARLAEELDDARRQLSADGQERAALAGRCAVQVLAARP
TASSPGWLALVLAADGSVGLALRDPQYDELRRIAGHLARLARGLVDAQACVGRLPWLDADEERRLQALRSEPQAAPSRGV
LHHLFEAQARRTPQRIAVHAADRSLSYAELERESAALAVRLRAAGVAPEQRVGVCLRRDSGLLVGLLGVLRAGGCYVPLD
PAYPEERVAYMLDDADCLLVLVDASTRERVAALGRPCLTLEEGGDQANDLALPASEVGADHLAYIIYTSGSTGRPKGVAI
EHGSAHAFLRWAGQHYAAEEWSGVLAATSVCFDLSVYELFGTLAEGGTLHLVENLFSLPDYPRRDEISLLNTVPSVCAAL
LALGDLPGGVRTLNLAGEPLRGHLVRQIRGQPQVRRLVNLYGPTEDTTYSTVHELDLHAEALDEPPIGRPLPGTTVEVLD
GFEAPLPLGVAGELYLGGIGLARGYFGKPEQTAERFRVDPGSGERRYRTGDRVRMREDGVLEHLGRLDDQVKFNGFRIEL
GEIASCLASFPGVSEACAMLTEDSAGLRRLVGYLAAPFAPPLQALNEHLGQSLPHYMLPSAFVVLAELPKTLNGKIDRKA
LPRPQATGAEPQALPSDPLEQALHQAWQAQLGAPPRAGQGFYAAGGDSLRAVHLLATLRQRLSRRVPLQAFAGGPATPEA
LLELLRQAAPEGDEPEPSAGAAGLSLAERRLWVAQQLAPEDTSYNLLAHLRIVGATADAIEQALRQLLERHVALRRRVET
GVDGPQPHALAAHAVPLQRLLASDAVHAERLLEDGVRREGARVFDLAHEAPARLLLVVTRDSARADLLLSVHHYAFDDVS
LAVFAAELKTLLDGGRLGVLASTPEQVAARERAALASGRLDRVAERWAERLLPLAKAPGAAPARPEESGGRAGQRLALPV
SAAVHAACRALAERTSVSPFSAALQAFAEVLGAELGVDDLLVGVALAGRSRLEMQGLVGCFVNLLPLAVGLRPEQSVEWR
LRQVGHDLLELLEHQDVPLECVTQALRQRGASGLPIRIACGAHNGRAAPAVDAGVRVEADFIPVPGARLDLTLWLEDQPQ
GWLAVWTGVSAIFDLHRIERLHQAWERRLLANAGEPISKRMSPEGCNAS
>Q9I1H3 6.2.1.68~~~ambE~~~AMB antimetabolite synthase AmbE~~~
MSASEDLQSAVQPAASEALEGFPLSPLQTRAWRRHAERPENTVVGVRLHAPADPVATLERLRRALDGEAQLRVAYRTMPG
MSLPVQVLDGRAADLLVERLPGDGDWAGRFARESARLAASPLGGEGQPVLALGLLLDAAGETLQGLLLAAPAFVVDAASL
VALLRRGLGPAGQASADEGDEALLFQHFSEWANEALAGEDGESASGYWREQAAVAAESPLALADDLGEGEWTARRLLPRA
LLERLAANGLPEAAALLAWTQVAGQFQGDEGLPLEMARLVSGRLFNEFAELAGPFAGVAPLCLENVRAGSVGERLDALQA
AILAQEEAAALRDPFAPDWPLAELGFAWLAGELDGAGVAELDCRQPPLGGFLELQVLPHGEGRLASLRVRRDHDGTLAGR
LLDAWVECLESIAADRQLPLAGLPLIGAAERERYQAWQGERVEPAPVESLVAAFDLRAALQPQAPALLDAHGSLDFATLR
ARSEAVAEALLAAGVRPGQAVAVMTGRNREAIVALLGVMRAAAVYTPVNPEFPAARVERMREAGGIVFALADAECAGRAR
EAFAGACLDLSTLPLAGSGMSLPAPGGRDAAYMIFTSGTSGQPKGVVVEHASALNLSQALARTVYANVVGEGLRVTVNAP
FSFDSSIKQILQLLSGHCLVLVPQEVRSDPQRMLGFLEERRIDVLDCTPSLFRLLLQAGLDDAHPALPGRILVGGERFDE
ASWEVAAGWRRCQVFNLYGPTEATVNASLARVAEHARPTIGRALANVDLHVVDGLGRRKTRGASGELWIGGAGVARGYAG
DAGEAAGRFVEEGWPGSGRLYRSGDLVRWRADGCLEFLGRIDEQVKINGYRIELGEIRSALLEHPAVGEAAVLTDEADAA
EPGADRRIVAFVTAAEETADESWLEVDLPSGHRVAGLNLNETEYVYQEIFVDEVYSRDGIVLPPDAVVLDVGANIGLFSL
YIASRAPRARVVAFEPLAPIRRRLEANLGRYAPQVEVFGIGLSDAEREETFTYYPGYSTFSGIAEYADASGERDVIRRYL
SNQGEEGGANLLLDNIDEILDDRLRAEAHRCRLRRLDQVIGELGLERIDLLKIDVQRAEMDVLLGLDDAALAKVRQIVLE
VHDKRDGATAGRADALSDLLRRHGFEVSIRQDALLEGTDRYNCYAVRPGYAESLAERIDWRALAPRPAAALGGELSEQAL
RGFLEARLPAYMLPSRIARVERLPLTAEGKLDRRALLAALAAEAAAQTLEAPANATEAALLEIWKSVLKRPAIGVSDNFF
QVGGDSIRLIQMQVMAREAGLAFTLRDVFNHQSIRELARLLAAPASPADALGTSAPQSLEPFALLSAAERKRLPEGLDDA
YPMTSLQQGMLLQSEASGDPRLLHNVVLHEVHGRLDGELLARAWAILIGRHAILRTGFDLHGGQVPLQWVHPATAVAAEV
PVHDLCGLDGETRRLRLRAWIEEEQATPFDWSRPPLVRLAALALDERRFALGVAEHHSVLDGWSLQSLVDELLAVYADLL
AGVVAREAEAPAVGFRDYVALEREAEANAASALFWLDYLAGARYRPLPGLAEEGPRRMAAVRVDVPADSLSRLRALAERS
GLPLRSLLLAAHGRALCRFSDADEVVTGFVSHGRPEEPGADRLLGLFLNTLPCRLSASVDLLDSARRAFDYERASLEHRR
HPLAAIRRRNRELRLDSLFNFVDFHQDDAAPAGVRHGGILDQVVVDVDVPLAVDFEVAGERLEVGFQYAAGRFPAERAEA
LAGAYREALLALLGDPVQPPAAAQAEDSVELRRVLKVLSRVLGRPLAADQGFASAGGHSLLGVQAIAELRRLTGRQLSLG
LLQGDPDAREVVRRCHAADAPPLPPATERARALWLQRSGSAQPRLRLIALPPAGGNAGTFRGWDARLPADVELLAIQYPG
RQERQDEPFVTDVEAMLCAIDDALLPLLDRPFALIGASLGGMLAYELAARLESLHGLRARQLFVISSRAPGPDLEYPRFH
AMGDAELLRTLREYDVLPLEVLDDPELREISLATLRADSRLAADYRYRPREPLAIPITAILGEQDPGVSRVAIDGWRRHA
SRYELETLAGGHGLVVTAAEEVCAILRQRLAPDVPGGVPANLAT
>V5TF65 4.1.99.25~~~ambI1~~~L-tryptophan isonitrile synthase AmbI1~~~
MISEKILRHIFQYRRLLSDTEPCAKEPCSICLAPHLPKIQSFIENNEPIHFILPAFPAKSPNPQKVLGPMPDMGERVALQ
FLQNLCNQISEIYASGAKITICSDGRVFTDLVAITDENVSLYRQGIQRLLNEINADAIDTFCLENVFTGMSFDQMRKTLV
KQYAQPIESIQERVNSEDKHRQFFKGIYHLLFDDYLVLYPDKSREQIEVECNLRAYEVIQRSNAWTTLVGQHFPQSLRLS
IHPQDYHSNKIGIHMIKTSDQWGTPWHNAPMFNGKEFLLMKRKHIEDIGASLVWHNDHPSHYILSEQVSQALVTLDNKS
>V5TES5 4.1.99.25~~~ambI2~~~L-tryptophan isonitrile synthase AmbI2~~~
MTQIINITQSKVISEQILRHVFRHRRLISDTEPCVHQPCSLCLAPHLEKVQYFVEHNEPIHFILPAFPAKSPNTQKVLGT
MPDMGEQVSLKFLQSLCDQISEIYAPGAKLTICSDGRVFSDLVGVTDENVTLYGQIIQALLKEMKADAIDVFNLEDMYTD
LSFDEMRQKLVKLYGQTIEAIKDAVKNNDHQCQMFNGIHRFLVEDYQVLEAHKSRNKIRLECKTRAYEVIQRSNAWSVLI
SELYPHSVRLSIHPQHYHSEKIGIHMIKTLDQWGTPWHNATVFDGKEFMLMKRSHLESMGATLVCQNGHPSYFAWTEQPL
ETRITVQEVI
>V5TD18 1.14.20.11~~~ambI3~~~3-((Z)-2-isocyanoethenyl)-1H-indole synthase~~~
MIVSTSVEQSAQFSVKSLTPFGALLEATEDHSDIQQLSIEQLCQLTWEHRLIVLRGFSLLEREELSTYCQRWGELLVWNF
GTVLDLIVHQNPENYLFTNGNVPFHWDGAFAEAVPRFLFFQCLKAPEAGSGGESLFCDTVRILQNVSPQQREIWQKTEIS
YKTQKVAHYGGEITKSLVIKHPITGLSTLRFAEPLNDASVHLNPLYVEVCNLPAEEQNPFINELIENLYLPQNCFAHEWQ
EGDFLIADNHALLHGRNPFLSNSQRHLQRVHIL
>P04172 ~~~mauC~~~Amicyanin-alpha~~~COG3794
MRALAFAAALAAFSATAALAAGALEAVQEAPAGSTEVKIAKMKFQTPEVRIKAGSAVTWTNTEALPHNVHFKSGPGVEKD
VEGPMLRSNQTYSVKFNAPGTYDYICTPHPFMKGKVVVE
>P22364 ~~~mauC~~~Amicyanin~~~
MISATKIRSCLAACVLAAFGATGALADKATIPSESPFAAAEVADGAIVVDIAKMKYETPELHVKVGDTVTWINREAMPHN
VHFVAGVLGEAALKGPMMKKEQAYSLTFTEAGTYDYHCTPHPFMRGKVVVE
>P22365 ~~~mauC~~~Amicyanin~~~COG3794
MISAKTLRPAIAAIALFAIGATGAWAQDKITVTSEKPVAAADVPADAVVVGIEKMKYLTPEVTIKAGETVYWVNGEVMPH
NVAFKKGIVGEDAFRGEMMTKDQAYAITFNEAGSYDYFCTPHPFMRGKVIVE
>Q05115 4.1.1.76~~~~~~Arylmalonate decarboxylase~~~
MQQASTPTIGMIVPPAAGLVPADGARLYPDLPFIASGLGLGSVTPEGYDAVIESVVDHARRLQKQGAAVVSLMGTSLSFY
RGAAFNAALTVAMREATGLPCTTMSTAVLNGLRALGVRRVALATAYIDDVNERLAAFLAEESLVPTGCRSLGITGVEAMA
RVDTATLVDLCVRAFEAAPDSDGILLSCGGLLTLDAIPEVERRLGVPVVSSSPAGFWDAVRLAGGGAKARPGYGRLFDES
>Q07838 3.5.1.4~~~amdA~~~Acetamidase~~~
MPEVVFSVDHSKSMRDQAVPGHNRWHPDIPAAATVKPGSEFRIECKEWTDGQIGNNDSANDVRDVDLAPCHMLSGPIKVE
GAEPGDLLIVDILDIGPVPQTNGPNCGEGWGYSGIFAKVNGGGFLTDYYPDAYKAIWDFHGQQCTSRHVPGVRYTGITHP
GLFGTAPSPDLLAKWNERERALIATDPDRVPPLALPPLVDGTLGGTASGDLLQAIANDGARTVPPRENGGNHDIKNFTRG
SRIFYPVFVEGAMLSGGDLHFSQGDGEINFCGAIEMGGFIDMHVDLIKGGMETYGVTTNPIFMPGRVEPLYSEWLTFIGI
SVDHAENRNAYMDATMAYRNACLNAIEYLKKWGYTGEQAYLILGTSPIEGASAASWTSRTHVVRCSCRPRSSTSTSPRRQ
GPAEGR
>Q93P60 2.4.1.337~~~mgs~~~Alpha-monoglucosyldiacylglycerol synthase~~~
MRIGIFSEAYLPLISGVVTSVVNLKEGLEALGHEVYVITPIPSKDKFENDPSVIRIPGWVIPRKSLKGFRLVLFVKRYVR
KMRKLKLDVVHIHTEFSMGKLGLAVAKKERIPSVYTLHTSYQDYTHYVSKLLTRFAPNAAKKLAGKINNQYTKNCHMTIV
PTKKIYDKMIRLKHDGEFTIIPSGINLKPFYKSSYTSEQVQALKDKLGIRNDEFVAILVARIAKEKSIGDLVEAFVEFYK
SYPNSRFIIIGDGPDKPVLDKLIDSKKASKYINTLGFVKNAEVGLYYQIADVFLNASTTETQGLTYVEALAASLPIIVRY
DDVFDAFVEDGKNGIFFNKNEELVKHLIHIRQNPEILGTLSKNAEISTKPYAKEVYAKSCETLYLDLIDKNNKKLNKK
>Q8CWR6 2.4.1.337~~~~~~Alpha-monoglucosyldiacylglycerol synthase~~~COG0438
MRIGLFTDTYFPQVSGVATSIRTLKTELEKQGHAVFIFTTTDKDVNRYEDWQIIRIPSVPFFAFKDRRFAYRGFSKALEI
AKQYQLDIIHTQTEFSLGLLGIWIARELKIPVIHTYHTQYEDYVHYIAKGMLIRPSMVKYLVRGFLHDVDGVICPSEIVR
DLLSDYKVKVEKRVIPTGIELAKFERPEIKQENLKELRSKLGIQDGEKTLLSLSRISYEKNIQAVLVAFADVLKEEDKVK
LVVAGDGPYLNDLKEQAQNLEIQDSVIFTGMIAPSETALYYKAADFFISASTSETQGLTYLESLASGTPVIAHGNPYLNN
LISDKMFGALYYGEHDLAGAILEALIATPDMNEHTLSEKLYEISAENFGKRVHEFYLDAIISNNFQKDLAKDDTVSQRIF
KTVLYLPQQVVAVPVKGSRRMLKASKTQLISMRDYWKDHEE
>Q88QT3 2.7.1.221~~~amgK~~~N-acetylmuramate/N-acetylglucosamine kinase~~~COG3178
MPEHDVRLQQLTVWLDEQLNDLFRDNAWGEVPAGSLTAASSDASFRRYFRWQGAGHSFVIMDAPPPQENCRPFVAIDHLL
ASADVHVPLIHAQDLERGFLLLGDLGTQTYLDIINADNADGLFADAIDALLKFQRLPMDAPLPSYDDALLRREVELFPEW
YVGRELGLTFTDAQKATWQRVSQLLIDSALAQPKVLVHRDYMPRNLMQSTPNPGVLDFQDAVYGPVTYDITCLFKDAFVS
WPQARVEGWLGDYWQQAQAAGIPVHAEFEAFHRASDLMGVQRHLKVIGIFARICHRDGKPRYLGDVPRFFAYINEVIGRR
PELAELGELIAELQAGARA
>D3FSJ2 ~~~amhM~~~Ammonium/H(+) antiporter subunit AmhM~~~COG0490
MKITSGDLPGVGKKISFITSEGSMVVLVIHHTGKREMYFFDDADDDEVSFSLTLSAEETKQMGAQLLGAILNPADTDKID
RIKLIRKQVVVEWIDITKHSPIISKSIAQIEKMKPKGISIVGVFKNDEMMVDPEPTLVLEKGDTLMAVGKRDAIQKFEEL
CACKENN
>D3FSJ3 ~~~amhT~~~Ammonium/H(+) antiporter subunit AmhT~~~COG0475
MVIPELFSAGLILLLLFITGFVGMKMKIPDVVIFILLGIAVGGLLSGSHLLHFAGEVGIVLLFFMLGMEFPLKQLMSIAK
KVLRAGILDVALSFGVTMAICMMMGLDVITSLIIGGVAYATSSSITAKMLESSKRMANPESEFMLGLLIFEDLVAPILVA
VLVGLTAGMALTAGSMSLLVVKVVALVAGAVILGVFLFRKLGSFFDRHMKHDLFILFVIGLALMYGGLALYLDLSEVLGA
FLAGIMLAEVKRTHELELMVVRFRDLLLPLFFLYFGTTISFSEGIPMIPLLILVLVWSVIAKVIVGVLGGRWYGLTKKVS
LRAGLSLTQRGEFSIIIASLAAGSIKAFSSVFILASAMIGILLFQFAPSIANKFYGKKAKTSVKQHVGSA
>P9WQ95 3.5.1.4~~~amiC~~~Putative amidase AmiC~~~COG0154
MSRVHAFVDDALGDLDAVALADAIRSGRVGRADVVEAAIARAEAVNPALNALAYAAFDVARDAAAMGTGQEAFFSGVPTF
IKDNVDVAGQPSMHGTDAWEPYAAVADSEITRVVLGTGLVSLGKTQLSEFGFSAVAEHPRLGPVRNPWNTDYTAGASSSG
SGALVAAGVVPIAHANDGGGSIRIPAACNGLVGLKPSRGRLPLEPEYRRLPVGIVANGVLTRTVRDTAAFYREAERLWRN
HQLPPVGDVTSPVKQRLRIAVVTRSVLREASPEVRQLTLKLAGLLEELGHRVEHVDHPPAPASFVDDFVLYWGFLALAQV
RSGRRTFGRTFDPTRLDELTLGLARHTGRNLHRLPLAIMRLRMLRRRSVRFFGTYDVLLTPTVAEATPQVGYLAPTDYQT
VLDRLSSWVVFTPVQNVTGVPAISLPLAQSADGMPVGMMLSADTGREALLLELAYELEEARPWARIHAPNIAE
>P9WQ93 3.5.1.4~~~amiD~~~Putative amidase AmiD~~~COG0154
MTDADSAVPPRLDEDAISKLELTEVADLIRTRQLTSAEVTESTLRRIERLDPQLKSYAFVMPETALAAARAADADIARGH
YEGVLHGVPIGVKDLCYTVDAPTAAGTTIFRDFRPAYDATVVARLRAAGAVIIGKLAMTEGAYLGYHPSLPTPVNPWDPT
AWAGVSSSGCGVATAAGLCFGSIGSDTGGSIRFPTSMCGVTGIKPTWGRVSRHGVVELAASYDHVGPITRSAHDAAVLLS
VIAGSDIHDPSCSAEPVPDYAADLALTRIPRVGVDWSQTTSFDEDTTAMLADVVKTLDDIGWPVIDVKLPALAPMVAAFG
KMRAVETAIAHADTYPARADEYGPIMRAMIDAGHRLAAVEYQTLTERRLEFTRSLRRVFHDVDILLMPSAGIASPTLETM
RGLGQDPELTARLAMPTAPFNVSGNPAICLPAGTTARGTPLGVQFIGREFDEHLLVRAGHAFQQVTGYHRRRPPV
>P9WQ99 3.5.1.4~~~amiA2~~~Putative amidase AmiA2~~~COG0154
MVGASGSDAGAISGSGNQRLPTLTDLLYQLATRAVTSEELVRRSLRAIDVSQPTLNAFRVVLTESALADAAAADKRRAAG
DTAPLLGIPIAVKDDVDVAGVPTAFGTQGYVAPATDDCEVVRRLKAAGAVIVGKTNTCELGQWPFTSGPGFGHTRNPWSR
RHTPGGSSGGSAAAVAAGLVTAAIGSDGAGSIRIPAAWTHLVGIKPQRGRISTWPLPEAFNGVTVNGVLARTVEDAALVL
DAASGNVEGDRHQPPPVTVSDFVGIAPGPLKIALSTHFPYTGFRAKLHPEILAATQRVGDQLELLGHTVVKGNPDYGLRL
SWNFLARSTAGLWEWAERLGDEVTLDRRTVSNLRMGHVLSQAILRSARRHEAADQRRVGSIFDIVDVVLAPTTAQPPPMA
RAFDRLGSFGTDRAIIAACPSTWPWNLLGWPSINVPAGFTSDGLPIGVQLMGPANSEGMLISLAAELEAVSGWATKQPQV
WWTS
>P36548 3.5.1.28~~~amiA~~~N-acetylmuramoyl-L-alanine amidase AmiA~~~COG0860
MSTFKPLKTLTSRRQVLKAGLAALTLSGMSQAIAKDELLKTSNGHSKPKAKKSGGKRVVVLDPGHGGIDTGAIGRNGSKE
KHVVLAIAKNVRSILRNHGIDARLTRSGDTFIPLYDRVEIAHKHGADLFMSIHADGFTNPKAAGASVFALSNRGASSAMA
KYLSERENRADEVAGKKATDKDHLLQQVLFDLVQTDTIKNSLTLGSHILKKIKPVHKLHSRNTEQAAFVVLKSPSVPSVL
VETSFITNPEEERLLGTAAFRQKIATAIAEGVISYFHWFDNQKAHSKKR
>P18791 ~~~amiA~~~Oligopeptide-binding protein AmiA~~~COG4166
MKKNRVFATAGLVLLAAGVLAACSSSKSSDSSAPKAYGYVYTADPETLDYLISSKNSTTVVTSNGIDGLFTNDNYGNLAP
AVAEDWEVSKDGLTYTYKIRKGVKWFTSDGEEYAEVTAKDFVNGLKHAADKKSEAMYLAENSVKGLADYLSGTSTDFSTV
GVKAVDDYTLQYTLNQPEPFWNSKLTYSIFWPLNEEFETSKGSDFAKPTDPTSLLYNGPFLLKGLTAKSSVEFVKNEQYW
DKENVHLDTINLAYYDGSDQESLERNFTSGAYSYARLYPTSSNYSKVAEEYKDNIYYTQSGSGIAGLGVNIDRQSYNYTS
KTTDSEKVATKKALLNKDFRQALNFALDRSAYSAQINGKDGAALAVRNLFVKPDFVSAGEKTFGDLVAAQLPAYGDEWKG
VNLADGQDGLFNADKAKAEFAKAKKALEADGVQFPIHLDVPVDQASKNYISRIQSFKQSVETVLGVENVVVDIQQMTSDE
FLNITYYAANASSEDWDVSGGVSWGPDYQDPSTYLDILKTTSSETTKTYLGFDNPNSPSVVQVGLKEYDKLVDEAARETS
DLNVRYEKYAAAQAWLTDSSLFIPAMASSGAAPVLSRIVPFTGASAQTGSKGSDVYFKYLKSQDKVVTKEEYEKAREKWL
KEKAESNEKAQKELASHVK
>P9WQ97 3.5.1.4~~~amiB2~~~Putative amidase AmiB2~~~COG0154
MDPTDLAFAGAAAQARMLADGALTAPMLLEVYLQRIERLDSHLRAYRVVQFDRARAEAEAAQQRLDAGERLPLLGVPIAI
KDDVDIAGEVTTYGSAGHGPAATSDAEVVRRLRAAGAVIIGKTNVPELMIMPFTESLAFGATRNPWCLNRTPGGSSGGSA
AAVAAGLAPVALGSDGGGSIRIPCTWCGLFGLKPQRDRISLEPHDGAWQGLSVNGPIARSVMDAALLLDATTTVPGPEGE
FVAAAARQPGRLRIALSTRVPTPLPVRCGKQELAAVHQAGALLRDLGHDVVVRDPDYPASTYANYLPRFFRGISDDADAQ
AHPDRLEARTRAIARLGSFFSDRRMAALRAAEVVLSSRIQSIFDDVDVVVTPGAATGPSRIGAYQRRGAVSTLLLVVQRV
PYFQVWNLTGQPAAVVPWDFDGDGLPMSVQLVGRPYDEATLLALAAQIESARPWAHRRPSVS
>P26365 3.5.1.28~~~amiB~~~N-acetylmuramoyl-L-alanine amidase AmiB~~~COG0860
MMYRIRNWLVATLLLLCTPVGAATLSDIQVSNGNQQARITLSFIGDPDYAFSHQSKRTVALDIKQTGVIQGLPLLFSGNN
LVKAIRSGTPKDAQTLRLVVDLTENGKTEAVKRQNGSNYTVVFTINADVPPPPPPPPVVAKRVETPAVVAPRVSEPARNP
FKTESNRTTGVISSNTVTRPAARATANTGDKIIIAIDAGHGGQDPGAIGPGGTREKNVTIAIARKLRTLLNDDPMFKGVL
TRDGDYFISVMGRSDVARKQNANFLVSIHADAAPNRSATGASVWVLSNRRANSEMASWLEQHEKQSELLGGAGDVLANSQ
SDPYLSQAVLDLQFGHSQRVGYDVATSMISQLQRIGEIHKRRPEHASLGVLRSPDIPSVLVETGFISNNSEERLLASDDY
QQQLAEAIYKGLRNYFLAHPMQSAPQGATAQTASTVTTPDRTLPN
>P63883 3.5.1.28~~~amiC~~~N-acetylmuramoyl-L-alanine amidase AmiC~~~COG0860
MSGSNTAISRRRLLQGAGAMWLLSVSQVSLAAVSQVVAVRVWPASSYTRVTVESNRQLKYKQFALSNPERVVVDIEDVNL
NSVLKGMAAQIRADDPFIKSARVGQFDPQTVRMVFELKQNVKPQLFALAPVAGFKERLVMDLYPANAQDMQDPLLALLED
YNKGDLEKQVPPAQSGPQPGKAGRDRPIVIMLDPGHGGEDSGAVGKYKTREKDVVLQIARRLRSLIEKEGNMKVYMTRNE
DIFIPLQVRVAKAQKQRADLFVSIHADAFTSRQPSGSSVFALSTKGATSTAAKYLAQTQNASDLIGGVSKSGDRYVDHTM
FDMVQSLTIADSLKFGKAVLNKLGKINKLHKNQVEQAGFAVLKAPDIPSILVETAFISNVEEERKLKTATFQQEVAESIL
AGIKAYFADGATLARRG
>Q9K0V3 3.5.1.28~~~amiC~~~N-acetylmuramoyl-L-alanine amidase AmiC~~~
MIKLTRRQIIRRTAGTLFALSPIASAVAKTVRAPQFTAARIWPSHTYTRLTLESTAALKYQHFTLDNPGRLVVDIQNANI
NTVLHGLSQKVMADDPFIRSIRAGQNTPTTVRLVIDLKQPTHAQVFALPPVGGFKNRLVVDLYPHGMDADDPMMALLNGS
LNKTLRGSPEADLAQNTTPQPGRGRNGRRPVIMLDPGHGGEDPGAISPGGLQEKHVVLSIARETKNQLEALGYNVFMTRN
EDVFIPLGVRVAKGRARRADVFVSIHADAFTSPSARGTGVYMLNTKGATSSAAKFLEQTQNNADAVGGVPTSGNRNVDTA
LLDMTQTATLRDSRKLGKLVLEELGRLNHLHKGRVDEANFAVLRAPDMPSILVETAFLSNPAEEKLLGSESFRRQCAQSI
ASGVQRYINTSVLKRG
>P27017 ~~~amiC~~~Aliphatic amidase expression-regulating protein~~~
MGSHQERPLIGLLFSETGVTADIERSQRYGALLAVEQLNREGGVGGRPIETLSQDPGGDPDRYRLCAEDFIRNRGVRFLV
GCYMSHTRKAVMPVVERADALLCYPTPYEGFEYSPNIVYGGPAPNQNSAPLAAYLIRHYGERVVFIGSDYIYPRESNHVM
RHLYRQHGGTVLEEIYIPLYPSDDDVQRAVERIYQARADVVFSTVVGTGTAELYRAIARRYGDGRRPPIASLTTSEAEVA
KMESDVAEGQVVVAPYFSSIDTAASRAFVQACHGFFPENATITAWAEAAYWQTLLLGRAAQAAGSWRVEDVQRHLYDICI
DAPQGPVRVERQNNHSRLSSRIAEIDARGVFQVRWQSPEPIRPDPYVVVHNLDDWSASMGGGALP
>P75820 3.5.1.28~~~amiD~~~N-acetylmuramoyl-L-alanine amidase AmiD~~~COG3023
MRRFFWLVAAALLLAGCAGEKGIVEKEGYQLDTRRQAQAAYPRIKVLVIHYTADDFDSSLATLTDKQVSSHYLVPAVPPR
YNGKPRIWQLVPEQELAWHAGISAWRGATRLNDTSIGIELENRGWQKSAGVKYFAPFEPAQIQALIPLAKDIIARYHIKP
ENVVAHADIAPQRKDDPGPLFPWQQLAQQGIGAWPDAQRVNFYLAGRAPHTPVDTASLLELLARYGYDVKPDMTPREQRR
VIMAFQMHFRPTLYNGEADAETQAIAEALLEKYGQD
>P27765 3.5.1.4~~~~~~Amidase~~~
MAITRPTLDQVLDIRTQLHMQLTHEQAASYLELMQPSFDAYDLVDELADFVPPIRYDRSSGYRHRPSAKENPLNAWYYRT
EVNGAREGLLAGKTVALKDNISLAGVPMMNGAAPLEGFVPGFDATVVTRLLDAGATILGKATCEHYCLSGGSHTSDPAPV
HNPHRHGYASGGSSSGSAALVASGEVDIAVGGDQGGSIRIPSAFCGTYGMKPTHGLVPYTGVMAIEATIDHVGPITGNVR
DNALMLQAMAGADGLDPRQAAPQVDDYCSYLEKGVSGLRIGVLQEGFALANQDPRVADKVRDAIARLEALGAHVEPVSIP
EHNLAGLLWHPIGCEGLTMQMMHGNGAGFNWKGLYDVGLLDKQASWRDDADQLSASLKLCMFVGQYGLSRYNGRYYAKAQ
NLARFARQGYDKALQTYDLLVMPTTPITAQPHPPANCSITEYVARALEMIGNTAPQDITGHPAMSIPCGLLDGLPVGLML
VAKHYAEGTIYQAAAAFEASVDWRTL
>P22984 3.5.1.4~~~amdA~~~Amidase~~~
MATIRPDDKAIDAAARHYGITLDKTARLEWPALIDGALGSYDVVDQLYADEATPPTTSREHAVPSASENPLSAWYVTTSI
PPTSDGVLTGRRVAIKDNVTVAGVPMMNGSRTVEGFTPSRDATVVTRLLAAGATVAGKAVCEDLCFSGSSFTPASGPVRN
PWDRQREAGGSSGGSAALVANGDVDFAIGGDQGGSIRIPAAFCGVVGHKPTFGLVPYTGAFPIERTIDHLGPITRTVHDA
ALMLSVIAGRDGNDPRQADSVEAGDYLSTLDSDVDGLRIGIVREGFGHAVSQPEVDDAVRAAAHSLTEIGCTVEEVNIPW
HLHAFHIWNVIATDGGAYQMLDGNGYGMNAEGLYDPELMAHFASRRIQHADALSETVKLVALTGHHGITTLGGASYGKAR
NLVPLARAAYDTALRQFDVLVMPTLPYVASELPAKDVDRATFITKALGMIANTAPFDVTGHPSLSVPAGLVNGLPVGMMI
TGRHFDDATVLRVGRAFEKLRGAFPTPAERASNSAPQLSPA
>P84650 3.5.1.4~~~amdA~~~Enantioselective amidase~~~
MSSLTPPNSNQMSALNNHFFFGLTTPKLEEFAPALEATLAYAVTDERVYERTAPEPPDRSWTTPTAAENPLSAWYVTTSI
SYTDGGPLAGRTVAIKDNVTVAGVPMMNGSRVVEGFTPRYDATVLRRLLDAGATKAGKAVCEDLCFSGSSVTSHPQPVRN
PWDESHGYKAGGSSSGSEALVASGHVDCAVGGDGGGSIRIPLACCGIVGCKPTHGLKPYTFPIERTIDHLGPMTRTVGDA
AMMLTVLAGTDGLDPRQADHRIEPVDYLAALAEPAGLRVVVVTEGFDTPVQDAAVDDAVRAAILVLRSGCLTVEIVSIPI
HLDAFAVWNVIATEGAAYQMLDGNYYGMNTGGFYDPELIIHFSRRRLEHGHQLSKTVKLVGMGGRYTSETGGGKYAAAAR
QLVREVRAAYDLALARYDVLVMPTLPYTATKIPITDIPLADYLDTALSMIINTAPFDVTGHPALCPVAGAVHGLPVGMMI
IGKAHDDDATVLRVAAFEHAVGNYPVPPEAASTLATL
>Q9L543 3.5.1.4~~~amiE~~~Aliphatic amidase~~~
MRHGDISSSHDTVGIAVVNYKMPRLHTKAEVIENAKKIADMVVGMKQGLPGMDLVVFPEYSTMGIMYDQDEMFATAASIP
GEETAIFAEACKKADTWGVFSLTGEKHEDHPNKAPYNTLVLINNKGEIVQKYRKIIPWCPIEGWYPGDTTYVTEGPKGLK
ISLIVCDDGNYPEIWRDCAMKGAELIVRCQGYMYPAKEQQIMMAKAMAWANNTYVAVANATGFDGVYSYFGHSAIIGFDG
RTLGECGTEENGIQYAEVSISQIRDFRKNAQSQNHLFKLLHRGYTGLINSGEGDRGVAECPFDFYRTWVLDAEKARENVE
KITRSTVGTAECPIQGIPNEGKTKEIGV
>O05213 3.5.1.28~~~amiE~~~N-acetylmuramyl-L-alanine amidase~~~COG1680
MKTKTLFIFSAILTLSIFAPNETFAQTAGNLIEPKIINAETAQFSTKKLRKVDQMIERDIAAGFPGAVLVVVKDGRIIKK
AAYGYSKKYEGSELLRRPAKMKTRTMFDLASNTKMYATNFALQRLVSQGKLDVYEKVSAYLPGFKDQPGDLIKGKDKIRV
IDVLQHQSGLPSSFYFYTPEKAGKYYSQERDKTIEYLTKIPLDYQTGTKHVYSDIGYMLLGCIVEKLTGKPLDVYTEQEL
YKPLRLKHTLYNPLQKGFKPKQFAATERMGNTRDGVIQFPNIRTNTLQGEVHDEKAFYSMDGVSGHAGLFSNADDMAILL
QVMLNKGSYRNISLFDQKTADLFTAPSATDPTFALGWRRNGSKSMEWMFGPHASENAYGHTGWTGTVTIIDPAYNLGIAL
LTNKKHTPVIDPEENPNVFEGDQFPTGSYGSVITAIYEAME
>Q9RQ17 3.5.1.4~~~~~~Aliphatic amidase~~~
MRHGDISSSHDTVGVAVVNYKMPRLHTKKEVIENAKNIANMIVGMKQGLPGMDLVIFPEYSTMGIMYDRKEMFETATTIP
GPETEIFAEACRKANTWGVFSLTGEQHEEHPHKNPYNTLVLINNKGEIVQKYRKIIPWCPIEGWYPGDTTYVTEGPKGIK
ISLIICDDGNYPEIWRDCAMKGAELIVRCQGYMYPAKEQQIMMAKTMAWANNVYVAVANATGFDGVYSYFGHSAIIGFDG
RTLGECGEEENGIQYAEISLSQIRDFRQNAQSQNHLFKLLHRGYTGIIQSGEGDKGVAECPFDFYRTWVMDAEKARENVE
KITRTTIGTAECPIEGIPHEGKEKEASV
>O25067 3.5.1.4~~~amiE~~~Aliphatic amidase~~~COG0388
MRHGDISSSPDTVGVAVVNYKMPRLHTKNEVLENCRNIAKVIGGVKQGLPGLDLIIFPEYSTHGIMYDRQEMFDTAASVP
GEETAIFAEACKKNKVWGVFSLTGEKHEQAKKNPYNTLILVNDKGEIVQKYRKILPWCPIECWYPGDKTYVVDGPKGLKV
SLIICDDGNYPEIWRDCAMRGAELIVRCQGYMYPAKEQQIAIVKAMAWANQCYVAVANATGFDGVYSYFGHSSIIGFDGH
TLGECGEEENGLQYAQLSVQQIRDARKYDQSQNQLFKLLHRGYSGVFASGDGDKGVAECPFEFYKTWVNDPKKAQENVEK
ITRPSVGVAACPVGDLPTK
>P11436 3.5.1.4~~~amiE~~~Aliphatic amidase~~~
MRHGDISSSNDTVGVAVVNYKMPRLHTAAEVLDNARKIAEMIVGMKQGLPGMDLVVFPEYSLQGIMYDPAEMMETAVAIP
GEETEIFSRACRKANVWGVFSLTGERHEEHPRKAPYNTLVLIDNNGEIVQKYRKIIPWCPIEGWYPGGQTYVSEGPKGMK
ISLIICDDGNYPEIWRDCAMKGAELIVRCQGYMYPAKDQQVMMAKAMAWANNCYVAVANAAGFDGVYSYFGHSAIIGFDG
RTLGECGEEEMGIQYAQLSLSQIRDARANDQSQNHLFKILHRGYSGLQASGDGDRGLAECPFEFYRTWVTDAEKARENVE
RLTRSTTGVAQCPVGRLPYEGLEKEA
>Q01360 3.5.1.4~~~amiE~~~Aliphatic amidase~~~
MRHGDISSSNDTVGVAVVNYKMPRLHDRAGVLENARKIADMMIGMKTGLPGMDLVVFPEYSTQGIMYNEEEMYATAATIP
GDETAIFSAACREADTWGIFSITGEQHEDHPNKPPYNTLILIDNKGEIVQRYRKILPWCPIEGWYPGDTTYVTEGPKGLK
ISLIICDDGNYPEIWRDCAMKGAELIVRCQGYMYPAKDQQVMMSKAMAWANNCYVAVANAAGFDGVYSYFGHSAIIGFDG
RTLGETGEEEYGIQYAQLSVSAIRDARENDQSQNHIFKLLHRGYSGVHAAGDGDKGVADCPFEFYKLWVTDAQKAQERVE
AITRDTVGVADCRVGNLPVEKTVEA
>P59701 3.5.1.49~~~amiF~~~Formamidase~~~
MGSSGSMVKPISGFLTALIQYPVPVVESRADIDKQIQQIIKTIHSTKSGYPGLELIVFPEYSTQGLNTKKWTTEEFLCTV
PGPETDLFAEACKESKVYGVFSIMEKNPDGGEPYNTAVIIDPQGEMILKYRKLNPWVPVEPWKAGDLGLPVCDGPGGSKL
AVCICHDGMFPEVAREAAYKGANVLIRISGYSTQVSEQWMLTNRSNAWQNLMYTLSVNLAGYDGVFYYFGEGQVCNFDGT
TLVQGHRNPWEIVTAEVYPELADQARLGWGLENNIYNLGSRGYVATPGGVKENPYTFVKDLAEGKYKVPWEDEIKVKDGS
IYGYPVKKTIHS
>O25836 3.5.1.49~~~amiF~~~Formamidase~~~COG0388
MGSIGSMGKPIEGFLVAAIQFPVPIVNSRKDIDHNIESIIRTLHATKAGYPGVELIIFPEYSTQGLNTAKWLSEEFLLDV
PGKETELYAKACKEAKVYGVFSIMERNPDSNKNPYNTAIIIDPQGEIILKYRKLFPWNPIEPWYPGDLGMPVCEGPGGSK
LAVCICHDGMIPELAREAAYKGCNVYIRISGYSTQVNDQWILTNRSNAWHNLMYTVSVNLAGYDNVFYYFGEGQICNFDG
TTLVQGHRNPWEIVTGEIYPKMADNARLSWGLENNIYNLGHRGYVAKPGGEHDAGLTYIKDLAAGKYKLPWEDHMKIKDG
SIYGYPTTGGRFGK
>O34640 2.7.1.230~~~amiN~~~Amicoumacin kinase~~~COG2334
MLDVHKDIKKIFHEEQVLAEAAARYGFSKDQVRFLADAENYVYECMKDNQPYILKITHTIRRSSDYMMGEMEWLRHLAIG
GISVAKPLPSLNGKDVEAVPDGNGGSFLLRVYEKAPGQKVDESDWNETLFYELGRYTGSMHSLTKSYKLSNPAFKRQEWD
EEEQLKLRKYVPEDQIKVFQQADSLMNELRRLPKSQDNYGLVHADLHHGNFNWDHGKITAFDFDDIGYNWFVNDISILLY
NVLWYPVVPYDDKAAFTEEFMTHFMKGYWEENELDPAWLMIIPDFLRLRHMLIYGLLHQMFDLNTIGEEEKEMLAGFRRD
IENGTPITAFDFSALV
>P10932 ~~~amiR~~~Aliphatic amidase regulator~~~
MSANSLLGSLRELQVLVLNPPGEVSDALVLQLIRIGCSVRQCWPPPESFDVPVDVVFTSIFQNRHHDEIAALLAAGTPRT
TLVALVEYESPAVLSQIIELECHGVITQPLDAHRVLPVLVSARRISEEMAKLKQKTEQLQERIAGQARINQAKALLMQRH
GWDEREAHQYLSREAMKRREPILKIAQELLGNEPSA
>P96581 ~~~amj~~~Lipid II flippase Amj~~~
MHVITTQVLFIFCFLLLIHSIETLAYATRLSGARVGFIASALSLFNVMVIVSRMSNMVQQPFTGHLIDDAGKNALAIVGE
QFRFLIFGSTVGTILGIILLPSFVALFSRAIIHLAGGGGSVFQVFRKGFSKQGFKNALSYLRLPSISYVKGFHMRLIPKR
LFVINMLITSIYTIGVLSALYAGLLAPERSTTAVMASGLINGIATMLLAIFVDPKVSVLADDVAKGKRSYIYLKWTSVTM
VTSRVAGTLLAQLMFIPGAYYIAWLTKWF
>Q6J1Z5 ~~~cnbCa~~~2-aminophenol 1,6-dioxygenase subunit alpha~~~
MTVVSAFLVPGTPLPQLKPEVPSWGQLAAATERAGKALAASRPDVVLVYSTQWLAVLDQQWLTRPRSEGVHVDENWYEFG
DLAYDIRADTALAEACVTSSPLHGVHARGVNYDGFPIDTGTITACTLMGIGTDAFPLVVGSNNLYHSGEITEKLAALAVD
CAKDQNKRVAVVGVGGLSGSLFREEIDPREDRIANEEDDKWNRRVLKLIEAGDVSALREAMPVYAKEARVDMGFKHLHWI
LGALKGKFSGANVLGYGPSYGSGAAVIEFRL
>O33478 ~~~amnA~~~2-aminophenol 1,6-dioxygenase alpha subunit~~~
MTIVSAFLVPGSPLPHLRPDVKSWESFKVAMQNVGEKLRASKPDVVLIYSTQWFAVLDEIWLTRQRSLDIHVDENWHEFG
ELPYDIYSDVDLANACIESCRAAGVNARGADYESFPIDTGTIVACNALKVGTSDLPVVVASNNLYDDQAATERLAALAVA
CISEKGKRIAVIGVGGLSGSVFTTAIDPAEDRVVKAVEDDCNKNILSLMESGNIQALREALKSYSKEARAEMGFKHFHWL
LGALDGHFKGATVHHYGALYGSGAAVVEFSI
>Q6J1Z6 1.13.11.74~~~cnbCb~~~2-aminophenol 1,6-dioxygenase subunit beta~~~
MQGEIIAGFLAPHPPHLVYGENPPQNEPRSQGGWEVLRWAYERARERLDAMKPDVLLVHSPHWITSVGHHFLGVPELSGK
SVDPIFPNVFRYDFSLNVDVELAEACAEEGRKAGLVTKMMRNPKFRVDYGTITTLHLIRPQWDIPVVGISANNSPYYLNT
KEGMSEMDVLGKATREAIRKTGRKAVLLASNTLSHWHFHEEPTIPEDMSKEYPATMAGYQWDIRMIELMRQGKTSEVFKL
LPQFIDEAFAEVKSGAFTWMHAAMQYPELAAELFGYGTVIGTGNAVMEWDLRKAGLSMLGAADQKQRSAAVA
>O33477 1.13.11.74~~~amnB~~~2-aminophenol 1,6-dioxygenase beta subunit~~~
MANGEIISGFIAPHPPHLVYGENPPQNEPKSTGGWEQLRWAYERARASIEELKPDVLLVHSPHWITSVGHHFIGVDHLQG
RSVDPIFPNLFRFDYSINFDVELSEACCEEGRKAGLVTKMMRNPRFRPDYGTITTLHMIRPQWDIPVVSISANNTPYYLS
MEEGLGEMDVLGKATREAILKSGKRAVLLASNTLSHWHFHEEPVPPEDMSKEHPQTKIGYEWDMRMIELMRQGRMEEVFQ
LLPQFIEEAFAEVKSGAFTWMHAAMQYPNLPAELHGYGTVIGTGNAVVEWNLVKAGLARVAGKAA
>Q9KWS5 1.2.1.32~~~amnC~~~2-aminomuconic 6-semialdehyde dehydrogenase~~~
MKQYRNFVDGKWVESSKTFQDVTPIDGSVVAVVHEADRDLVDAAVKAGHRALEGEWGRTTAAQRVDWLRRIANEMERRQQ
DFLDAEMADTGKPLSMAATIDIPRGIANFRNFADILATAPVDSHRLDLPDGAYALNYAARKPLGVVGVISPWNLPLLLLT
WKVAPALACGNAVVVKPSEDTPGTATLLAEVMEAVGIPPGVFNLVHGFGPNSAGEFISQHPDISAITFTGESKTGSTIMR
AAAEGVKPVSFELGGKNAAVIFADCDFEKMLDGMMRALFLNSGQVCLCSERVYVERPIFDRFCVALAERIKALKVDWPHE
TDTQMGPLISSKHRDKVLSYFELARQEGATFLAGGGVPRFGDERDNGAWVEPTVIAGLSDDARVVREEIFGPICHVTPFD
SESEVIRRANDTRYGLAATIWTTNLSRAHRVSELMRVGISWVNTWFLRDLRTPFGGAGLSGIGREGGMHSLNFYSELTNV
CVRIDKESPDV
>Q9KWS2 3.5.99.5~~~amnD~~~2-aminomuconate deaminase~~~
MVSKADNSAKLVEGKAKPMGSFPHVKRAGDFLFVSGTSSRRPDNTFVGAEPDDTGRPRPNIELQTREVISNIRDILQSVG
ADLGDVVEVCSYLVNMNDFAAYNKVYAEFFDATGPARTTVAVHQLPHPQLVIEIKVVAYKPL
>Q9KWS3 4.1.1.77~~~amnE~~~4-oxalocrotonate decarboxylase~~~
MKISRIAQRLDEAAVSGKATPQLTGDDAVTVREAAEIQRLLIAHRIERGARQVGLKMGFTSRAKMAQMGVSDLIWGRLTS
DMWVEEGGEIDLAHYVHPRVEPEICYLLGKRLEGNVTPLEALAAVEAVAPAMEIIDSRYRDFKFSLPDVIADNASSSGFV
VGAWHKPETDVSNLGMVMSFDGRAVELGTSAAILGSPIRALVAAARLAAQQGEALEAGSLILAGAATAAVALRPGISVRC
EVQNLGSLSFSTTGER
>Q9KWS4 4.2.1.80~~~amnF~~~2-oxopent-4-enoate hydratase~~~
MSEQNAKLAALLNEAELSEKPIEPVRGHIEGGIAQAYAIQQINVQRQLAAGRRVTGRKIGLTSAAVQKQLGVDQPDFGTL
FDSMAVNDGEEIAWSRTLQPKCEAEVALVIERDLDHENITLIDLIGATAYALPAIEVVGSRIANWDINILDTVADNASAG
LYVLGHTPVKLEGLDLRLAGMVMERAGQQVSLGVGAACLGHPLNAALWLARTLVKQGTPLKSGDVVLSGALGPLVAANPG
DVFEARIQGLGSVRACFSPAS
>P0AE12 3.2.2.4~~~amn~~~AMP nucleosidase~~~COG0775
MNNKGSGLTPAQALDKLDALYEQSVVALRNAIGNYITSGELPDENARKQGLFVYPSLTVTWDGSTTNPPKTRAFGRFTHA
GSYTTTITRPTLFRSYLNEQLTLLYQDYGAHISVQPSQHEIPYPYVIDGSELTLDRSMSAGLTRYFPTTELAQIGDETAD
GIYHPTEFSPLSHFDARRVDFSLARLRHYTGTPVEHFQPFVLFTNYTRYVDEFVRWGCSQILDPDSPYIALSCAGGNWIT
AETEAPEEAISDLAWKKHQMPAWHLITADGQGITLVNIGVGPSNAKTICDHLAVLRPDVWLMIGHCGGLRESQAIGDYVL
AHAYLRDDHVLDAVLPPDIPIPSIAEVQRALYDATKLVSGRPGEEVKQRLRTGTVVTTDDRNWELRYSASALRFNLSRAV
AIDMESATIAAQGYRFRVPYGTLLCVSDKPLHGEIKLPGQANRFYEGAISEHLQIGIRAIDLLRAEGDRLHSRKLRTFNE
PPFR
>Q07121 1.4.3.21~~~maoI~~~Primary amine oxidase~~~
MTLNAESEALVGVSHPLDPLSRVEIARAVAILKEGPAAAESFRFISVELREPSKDDLRAGVAVAREADAVLVDRAQARSF
EAVVDLEAGTVDSWKLLAENIQPPFMLDEFAECEDACRKDPEVIAALAKRGLTNLDLVCFEPWSVGYFGEDNEGRRLMRA
LVFVRDEADDSPYAHPIENFIVFYDLNAGKVVRLEDDQAIPVPSARGNYLPKYVGEARTDLKPLNITQPEGASFTVTGNH
VTWADWSFRVGFTPREGLVLHQLKFKDQGVDRPVINRASLSEMVVPYGDTAPVQAKKNAFDSGEYNIGNMANSLTLGCDC
LGEIKYFDGHSVDSHGNPWTIENAICMHEEDDSILWKHFDFREGTAETRRSRKLVISFIATVANYEYAFYWHLFLDGSIE
FLVKATGILSTAGQLPGEKNPYGQSLNNDGLYAPIHQHMFNVRMDFELDGVKNAVYEVDMEYPEHNPTGTAFMAVDRLLE
TEQKAIRKTNEAKHRFWKIANHESKNLVNEPVAYRLIPTNGIQLAARDDAYVSKRAQFARNNLWVTAYDRTERFAAGEYP
NQATGADDGLHIWTQKDRNIVDTDLVVWYTFGMHHVVRLEDWPVMPRQNIGFMLEPHGFFNQNPTLNLPTSTSTTQTGEA
DTCCHNGK
>Q07123 1.4.3.21~~~maoII~~~Copper methylamine oxidase~~~
MTLNAESEALVGVSHPLDPLSRVEIARAVAILKEGPAAAESFRFISVELREPSKDDLRAGVAVAREADAVLVDRAQARSF
EAVVDLEAGTVDSWKLLAENIQPPFMLDEFAECEDACRKDPEVIAALAKRGLTNLDLVCFEPWSVGYFGEDNEGRRLMRA
LVFVRDEADDSPYAHPIENFIVFYDLNAGKVVRLEDDQAIPVPSARGNYLPKYVGEARTDLKPLNITQPEGASFTVTGNH
VTWADWSFRVGFTPREGLVLHQLKFKDQGVDRPVINRASLSEMVVPYGDTAPVQAKKNAFDSGEYNIGNMANSLTLGCDC
LGEIKYFDGHSVDSHGNPWTIENAICMHEEDDSILWKHFDFREGTAETRRSRKLVISFIATVANYEYAFYWHLFLDGSIE
FLVKATGILSTAGQLPGEKNPYGQSLNNDGLYAPIHQHMFNVRMDFELDGVKNAVYEVDMEYPEHNPTGTAFMAVDRLLE
TEQKAIRKTNEAKHRFWKIANHESKNLVNEPVAYRLIPTNGIQLAARDDAYVSKRAQFARNNLWVTAYDRTERFAAGEYP
NQATGADDGLHIWTQKDRNIVDTDLVVWYTFGMHHVVRLEDWPVMPRQNIGFMLEPHGFFNQNPTLNLPTSTSTTQTGEA
DTCCHTDK
>Q04507 1.14.99.39~~~amoA1~~~Ammonia monooxygenase alpha subunit~~~
MSIFRTEEILKAAKMPPEAVHMSRLIDAVYFPILIILLVGTYHMHFMLLAGDWDFWMDWKDRQWWPVVTPIVGITYCSAI
MYYLWVNYRQPFGATLCVVCLLIGEWLTRYWGFYWWSHYPINFVTPGIMLPGALMLDFTLYLTRNWLVTALVGGGFFGLL
FYPGNWPIFGPTHLPIVVEGTLLSMADYMGHLYVRTGTPEYVRHIEQGSLRTFGGHTTVIAAFFSAFVSMLMFTVWWYLG
KVYCTAFFYVKGKRGRIVHRNDVTAFGEEGFPEGIK
>Q04508 1.14.99.39~~~amoB1~~~Ammonia monooxygenase beta subunit~~~
MGIKNLYKRGVMGLYGVAYAVAALAMTVTLDVSTVAAHGERSQEPFLRMRTVQWYDIKWGPEVTKVNENAKITGKFHLAE
DWPRAAAQPDFSFFNVGSPSPVFVRLSTKINGHPWFISGPLQIGRDYEFEVNLRARIPGRHHMHAMLNVKDAGPIAGPGA
WMNITGSWDDFTNPLKLLTGETIDSETFNLSNGIFWHVVWMSIGIFWIGVFTARPMFLPRSRVLLAYGDDLLMDPMDKKI
TWVLAILTLALVWGGYRYTENKHPYTVPIQAGQSKVAALPVAPNPVSIVITDANYDVPGRALRVTMEVTNNGDIPVTFGE
FTTAGIRFINSTGRKYLDPQYPRELIAVGLNFDDESAIQPGQTKELKMEAKDALWEIQRLMALLGDPESRFGGLLMSWDA
EGNRHINSIAGPVIPVFTKL
>Q59118 1.4.3.22~~~~~~Histamine oxidase~~~
MTLQTTPSTPLVQDPPVPATLVHAAAQHPLEQLSAEEIHEARRILAEAGLVGESTRFAYLGLIEPPKTTRQGDVTGAARL
VRAMLWDAAQSRSLDVRLSLATGLVVDRRELNPEADGQLPVLLEEFGIIEDILSEDPQWNAALTARGLTPAQVRVAPLSA
GVFEYGNEEGKRLLRGLGFRQDHPADHPWAHPIDGLVAFVDVENRRVNHLIDDGPVPVPEVNGNYTDPAIRGELRTDLLP
IEIMQPEGPSFTLEGNHLSWAGWDLRVGFDAREGLVLHQLHHSHKGRRRPVIHRASISEMVVPYGDPSPYRSWQNYFDSG
EYLVGRDANSLRLGCDCLGDITYMSPVVADDFGNPRTIENGICIHEEDAGILWKHTDEWAGSDEVRRNRRLVVSFFTTVG
NYDYGFYWYLYLDGTIEFEAKATGIVFTAALPDKDYAYASEIAPGLGAPYHQHLFSARLDMMIDGDANRVEELDLVRLPK
GPGNPHGNAFTQKRTLLARESEAVRDADGAKGRVWHISNPDSLNHLGHPVGYTLYPEGNPTLAMADDSSIASRAAFARHH
LWVTRHAEEELYAAGDFVNQHPGGAVLPAYVAQDRDIDGQDLVVWHSFGLTHFPRPEDWPIMPVDTTGFTLKPHGFFDEN
PTLNVPSSAAGHCGTGSEREHAAPGGTAVGHSGPDTGGQGHCGH
>P46883 1.4.3.21~~~tynA~~~Primary amine oxidase~~~COG3733
MGSPSLYSARKTTLALAVALSFAWQAPVFAHGGEAHMVPMDKTLKEFGADVQWDDYAQLFTLIKDGAYVKVKPGAQTAIV
NGQPLALQVPVVMKDNKAWVSDTFINDVFQSGLDQTFQVEKRPHPLNALTADEIKQAVEIVKASADFKPNTRFTEISLLP
PDKEAVWAFALENKPVDQPRKADVIMLDGKHIIEAVVDLQNNKLLSWQPIKDAHGMVLLDDFASVQNIINNSEEFAAAVK
KRGITDAKKVITTPLTVGYFDGKDGLKQDARLLKVISYLDVGDGNYWAHPIENLVAVVDLEQKKIVKIEEGPVVPVPMTA
RPFDGRDRVAPAVKPMQIIEPEGKNYTITGDMIHWRNWDFHLSMNSRVGPMISTVTYNDNGTKRKVMYEGSLGGMIVPYG
DPDIGWYFKAYLDSGDYGMGTLTSPIARGKDAPSNAVLLNETIADYTGVPMEIPRAIAVFERYAGPEYKHQEMGQPNVST
ERRELVVRWISTVGNYDYIFDWIFHENGTIGIDAGATGIEAVKGVKAKTMHDETAKDDTRYGTLIDHNIVGTTHQHIYNF
RLDLDVDGENNSLVAMDPVVKPNTAGGPRTSTMQVNQYNIGNEQDAAQKFDPGTIRLLSNPNKENRMGNPVSYQIIPYAG
GTHPVAKGAQFAPDEWIYHRLSFMDKQLWVTRYHPGERFPEGKYPNRSTHDTGLGQYSKDNESLDNTDAVVWMTTGTTHV
ARAEEWPIMPTEWVHTLLKPWNFFDETPTLGALKKDK
>P49250 1.4.3.21~~~maoA~~~Primary amine oxidase~~~
MANGLKFSPRKTALALAVAVVCAWQSPVFAHGSEAHMVPLDKTLQEFGADVQWDDYAQMFTLIKDGAYVKVKPGAKTAIV
NGKSLDLPVPVVMKEGKAWVSDTFINDVFQSGLDQTFQVEKRPHPLNSLSAAEISKAVTIVKAAPEFQPNTRFTEISLHE
PDKAAVWAFALQGTPVDAPRTADVVMLDGKHVIEAVVDLQNKKILSWTPIKGAHGMVLLDDFVSVQNIINTSSEFAEVLK
KHGITDPGKVVTTPLTVGFFDGKDGLQQDARLLKVVSYLDTGDGNYWAHPIENLVAVVDLEAKKIIKIEEGPVIPVPMEP
RPYDGRDRNAPAVKPLEITEPEGKNYTITGDTIHWQNWDFHLRLNSRVGPILSTVTYNDNGTKRQVMYEGSLGGMIVPYG
DPDVGWYFKAYLDSGDYGMGTLTSPIVRGKDAPSNAVLLDETIADYTGKPTTIPGAVAIFERYAGPEYKHLEMGKPNVST
ERRELVVRWISTVGNYDYIFDWVFHDNGTIGIDAGATGIEAVKGVLAKTMHDPSAKEDTRYGTLIDHNIVGTTHQHIYNF
RLDLDVDGENNTLVAMDPEVKPNTAGGPRTSTMQVNQYTIDSEQKAAQKFDPGTIRLLSNTSKENRMGNPVSYQIIPYAG
GTHPAATGAKFAPDEWIYHRLSFMDKQLWVTRYHPTERYPEGKYPNRSAHDTGLGQYAKDDESLTNHDDVVWITTGTTHV
ARAEEWPIMPTEWALALLKPWNFFDETPTLGEKKK
>P80695 1.4.3.21~~~maoA~~~Primary amine oxidase~~~
MAILSPRKTALALAVALSCAWQSPAFAHGGEAHMVPMDKTLQDFGADVQWDDYAQMFTLIKDGAYVKVKPGAKTAIVNGK
TLELQVPVVMKDGKAWVSDTFINDVFQSGLDQTFQVEKRPHPLNSLSAAEISAAVAIVKAAADFKPNTRFTEISLREPDK
KAVWDFALNGTPVNAPRAADVIMLDGKHVIEAVVDLQNKKVLSWTPIKDAHGMVLLDDFASVQNIINASSEFAEVLKKHG
IDDPSKVITTPLTVGYFDGKDGLKQDARLLKVVSYLDVGDGNYWAHPIENLVAVVDLEQKKIIKIEEGPTIPVPMAARPY
DGRDRVAPKIKPLDIIEPEGKNYTITGDMIHWQNWDFHLRMNSRVGPILSTVTYNDNGKKRQVMYEGSLGGMIVPYGDPD
VGWYFKAYLDSGDYGMGTLTSPIVRGKDAPSNAVLLDETIADYTGTPTTIPRAIAIFERYAGPEYKHQEMGKPNVSTERR
ELVVRWISTVGNYDYIFDWVFHENGTIGIDAGATGIEAVKGVQAKTMHDPSAKEDTRYGTLIDHNIVGTTHQHIYNFRLD
LDVDGENNTLVAMDPEVKPNTAGGPRTSTMQINQYTIDSEQKAAQKFDPGTIRLLSNITKENRMGNPVSYQIIPYAGGTH
PVATGAKFAPDEWIYHRLSFMDKQLWVTRYHPTERFPEGKYPNRSIHDTGLGQYAKDDESLDNHDDVVWITTGTTHVARA
EEWPIMPTEWAHALLKPWNFFDETPTLGEKKE
>B3A0L3 ~~~~~~Antimicrobial peptide EP-20~~~
EGPVGLADPDGPASAPLGAP
>P24828 3.4.11.-~~~~~~Aminopeptidase 2~~~
MNRWEKELDKYAELAVKVGVNIQPGQTLFVNAPLEAAPLVRKIAKTAYETGAKHVYFEWNDEALTYIKFHHAPEEAFSEY
PMLRARAMEELAEQGAAFLSIHAPNPDLLKDVDPKRIATANKTAAQALANYRSAIMADRNCWSLISVPTPAWAQKVFGDL
RDEEAIDKLWEAIFRITRIDQDDPIAAWREHNDRLARIVDYLNNKQYKQLVYEAPGPIFTVELVDGHVWHGGAATSQSGV
RFNPNIPTEEVFTMPHKDGVNGTVRNTKPLNYNGNVIDGFTLTFKDGQVVDFSAEQGYETLKHLLDTDDGARRLGEVALV
PHQSPVSLSNLIFYNTLFDENAACHLALGKAYPTNIENGASLSKEELDRRGVNDSLVHVDFMIGSADLNIDGVTKDGKRE
PIFRSGNWAFELA
>Q2GIB5 ~~~ampA~~~SUMOylated effector protein AmpA~~~
MYGIDIELSDYRIGSETISSGDDGYYEGCACDKDASTNAYSYDKCRVVRGTWRPSELVLYVGDEHVACRDVASGMHHGNL
PGKVYFIEAEAGRAATAEGGVYTTVVEALSLVQEEEGTGMYLINAPEKAVVRFFKIEKSAAEEPQTVDPSVVESATGSGV
DTQEEQEIDQEAPAIEEVETEEQEVILEEGTLIDLEQPVAQVPVVAEAELPGVEAAEAIVPSLEENKLQEVVVAPEAQQL
ESAPEVSAPAQPESTVLGVAEGDLKSEVSVEANADVAQKEVISGQQEQEIAEALEGTEAPVEVKEETEVLLKEDTLIDLE
QPVAQVPVVAEAELPGVEAAEAIVPSLEENKLQEVVVAPEAQQLESAPEVSAPAQPESTVLGVTEGDLKSEVSVEADAGM
QQEAGISDQETQATEEVEKVEVSVETKTEEPEVILEEGTLIDLEQPVAQVPVVAEAELPGVEAAEAIVPSLEENKLQEVV
VAPEAQQLESAPEVSAPVQPESTVLGVTEGDLKSEVSVEADAGMQQEAGISDQETQATEEVEKVEVSVEADAGMQQELVD
VPTALPLKDPDDEDVLSY
>O84049 3.4.11.1~~~pepA~~~Probable cytosol aminopeptidase~~~
MVLLYSQASWDKRSKADALVLPFWMKNSKAQEAAVVDEDYKLVYQNALSNFSGKKGETAFLFGNDHTKEQKIVLLGLGKS
EEVSGTTVLEAYAQATTVLRKAKCKTVNILLPTISQLRFSVEEFLTNLAAGVLSLNYNYPTYHKVDTSLPFLEKVTVMGI
VSKVGDKIFRKEESLFEGVYLTRDLVNTNADEVTPEKLAAVAKDLAGEFASLDVKILDRKAILKEKMGLLAAVAKGAAVE
PRFIVLDYQGKPKSKDRTVLIGKGVTFDSGGLDLKPGKAMITMKEDMAGAATVLGIFSALASLELPINVTGIIPATENAI
GSAAYKMGDVYVGMTGLSVEIGSTDAEGRLILADAISYALKYCNPTRIIDFATLTGAMVVSLGESVAGFFANNDVLARDL
AEASSETGEALWRMPLVEKYDQALHSDIADMKNIGSNRAGSITAALFLQRFLEDNPVAWAHLDIAGTAYHEKEELPYPKY
ATGFGVRCLIHYMEKFLSK
>P68767 3.4.11.1~~~pepA~~~Cytosol aminopeptidase~~~COG0260
MEFSVKSGSPEKQRSACIVVGVFEPRRLSPIAEQLDKISDGYISALLRRGELEGKPGQTLLLHHVPNVLSERILLIGCGK
ERELDERQYKQVIQKTINTLNDTGSMEAVCFLTELHVKGRNNYWKVRQAVETAKETLYSFDQLKTNKSEPRRPLRKMVFN
VPTRRELTSGERAIQHGLAIAAGIKAAKDLGNMPPNICNAAYLASQARQLADSYSKNVITRVIGEQQMKELGMHSYLAVG
QGSQNESLMSVIEYKGNASEDARPIVLVGKGLTFDSGGISIKPSEGMDEMKYDMCGAAAVYGVMRMVAELQLPINVIGVL
AGCENMPGGRAYRPGDVLTTMSGQTVEVLNTDAEGRLVLCDVLTYVERFEPEAVIDVATLTGACVIALGHHITGLMANHN
PLAHELIAASEQSGDRAWRLPLGDEYQEQLESNFADMANIGGRPGGAITAGCFLSRFTRKYNWAHLDIAGTAWRSGKAKG
ATGRPVALLAQFLLNRAGFNGEE
>O25294 3.4.11.1~~~pepA~~~Cytosol aminopeptidase~~~COG0260
MLKIKLEKTTFENAKAECSLVFIINKDFSHAWVKNKELLETFKYEGEGVFLDQENKILYAGVKEDDVHLLRESACLAVRT
LKKLAFKSVKVGVYTCGAHSKDNALLENLKALFLGLKLGLYEYDTFKSNKKESVLKEAIVALELHKPCEKTCANSLEKSA
KEALKYAEIMTESLNIVKDLVNTPPMIGTPVYMAEVAQKVAKENHLEIHVHDEKFLEEKKMNAFLAVNKASLSVNPPRLI
HLVYKPKKAKKKIALVGKGLTYDCGGLSLKPADYMVTMKADKGGGSAVIGLLNALAKLGVEAEVHGIIGATENMIGPAAY
KPDDILISKEGKSIEVRNTDAEGRLVLADCLSYAQDLNPDVIVDFATLTGACVVGLGEFTSAIMGHNEELKNLFETSGLE
SGELLAKLPFNRHLKKLIESKIADVCNISSSRYGGAITAGLFLNEFIRDEFKDKWLHIDIAGPAYVEKEWDVNSFGASGA
GVRACTAFVEELLKKA
>P9WHT3 3.4.11.1~~~pepA~~~Probable cytosol aminopeptidase~~~COG0260
MTTEPGYLSPSVAVATSMPKRGVGAAVLIVPVVSTGEEDRPGAVVASAEPFLRADTVAEIEAGLRALDATGASDQVHRLA
VPSLPVGSVLTVGLGKPRREWPADTIRCAAGVAARALNSSEAVITTLAELPGDGICSATVEGLILGSYRFSAFRSDKTAP
KDAGLRKITVLCCAKDAKKRALHGAAVATAVATARDLVNTPPSHLFPAEFAKRAKTLSESVGLDVEVIDEKALKKAGYGG
VIGVGQGSSRPPRLVRLIHRGSRLAKNPQKAKKVALVGKGITFDTGGISIKPAASMHHMTSDMGGAAAVIATVTLAARLR
LPIDVIATVPMAENMPSATAQRPGDVLTQYGGTTVEVLNTDAEGRLILADAIVRACEDKPDYLIETSTLTGAQTVALGTR
IPGVMGSDEFRDRVAAISQRVGENGWPMPLPDDLKDDLKSTVADLANVSGQRFAGMLVAGVFLREFVAESVDWAHIDVAG
PAYNTGSAWGYTPKGATGVPTRTMFAVLEDIAKNG
>O86436 3.4.11.1~~~pepA~~~Cytosol aminopeptidase~~~COG0260
MELVVKSVAAASVKTATLVIPVGENRKLGAVAKAVDLASEGAISAVLKRGDLAGKPGQTLLLQNLQGLKAERVLLVGSGK
DEALGDRTWRKLVASVAGVLKGLNGADAVLALDDVAVNNRDAHYGKYRLLAETLLDGEYVFDRFKSQKVEPRALKKVTLL
ADKAGQAEVERAVKHASAIATGMAFTRDLGNLPPNLCHPSFLAEQAKELGKAHKALKVEVLDEKKIKDLGMGAFYAVGQG
SDQPPRLIVLNYQGGKKADKPFVLVGKGITFDTGGISLKPGAGMDEMKYDMCGAASVFGTLRAVLELQLPVNLVCLLACA
ENMPSGGATRPGDIVTTMSGQTVEILNTDAEGRLVLCDTLTYAERFKPQAVIDIATLTGACIVALGSHTTGLMGNNDDLV
GQLLDAGKRADDRAWQLPLFDEYQEQLDSPFADMGNIGGPKAGTITAGCFLSRFAKAYNWAHMDIAGTAWISGGKDKGAT
GRPVPLLTQYLLDRAGA
>Q5H4N2 3.4.11.1~~~pepA~~~Probable cytosol aminopeptidase~~~
MALQFTLNQDAPASAAVDCIVVGAFADKTLSPAAQALDSASQGRLTALLARGDVAGKTGSTTLLHDLPGVAAPRVLVVGL
GDAGKFGVAPYLKAIGDATRALKTGAVGTALLTLTELTVKARDAAWNIRQAVTVSDHAAYRYTATLGKKKVDETGLTTLA
IAGDDARALAVGVATAEGVEFARELGNLPPNYCTPAYLADTAAAFAGKFPGAEAEILDEAQMEALGMGSLLSVARGSANR
PRLIVLKWNGGGDARPYVLVGKGITFDTGGVNLKTQGGIEEMKYDMCGGATVIGTFVATVKAELPINLVVVVPAVENAID
GNAYRPSDVITSMSGKTIEVGNTDAEGRLILCDALTYAERFNPEALVDVATLTGACMVALGHQTAGLMSKHDDLANELLA
AGEHVFDRAWRLPLWDEYQGLLDSTFADVYNIGGRWGGAITAGCFLSRFTENQRWAHLDIAGVASDEGKRGMATGRPVGL
LTQWLLDRAA
>P05193 3.5.2.6~~~ampC~~~Beta-lactamase~~~
MMKKSICCALLLTASFSTFAAAKTEQQIADIVNRTITPLMQEQAIPGMAVAIIYEGKPYYFTWGKADIANNHPVTQQTLF
ELGSVSKTFNGVLGGDRIARGEIKLSDPVTKYWPELTGKQWRGISLLHLATYTAGGLPLQIPGDVTDKAELLRFYQNWQP
QWTPGAKRLYANSSIGLFGALAVKSSGMSYEEAMTRRVLQPLKLAHTWITVPQSEQKNYAWGYLEGKPVHVSPGQLDAEA
YGVKSSVIDMARWVQANMDASHVQEKTLQQGIELAQSRYWRIGDMYQGLGWEMLNWPLKADSIINGSDSKVALAALPAVE
VNPPAPAVKASWVHKTGSTGGFGSYVAFVPEKNLGIVMLANKSYPNPARVEAAWRILEKLQ
>P00811 3.5.2.6~~~ampC~~~Beta-lactamase~~~COG1680
MFKTTLCALLITASCSTFAAPQQINDIVHRTITPLIEQQKIPGMAVAVIYQGKPYYFTWGYADIAKKQPVTQQTLFELGS
VSKTFTGVLGGDAIARGEIKLSDPTTKYWPELTAKQWNGITLLHLATYTAGGLPLQVPDEVKSSSDLLRFYQNWQPAWAP
GTQRLYANSSIGLFGALAVKPSGLSFEQAMQTRVFQPLKLNHTWINVPPAEEKNYAWGYREGKAVHVSPGALDAEAYGVK
STIEDMARWVQSNLKPLDINEKTLQQGIQLAQSRYWQTGDMYQGLGWEMLDWPVNPDSIINGSDNKIALAARPVKAITPP
TPAVRASWVHKTGATGGFGSYVAFIPEKELGIVMLANKNYPNPARVDAAWQILNALQ
>P05364 3.5.2.6~~~ampC~~~Beta-lactamase~~~
MMRKSLCCALLLGISCSALATPVSEKQLAEVVANTITPLMKAQSVPGMAVAVIYQGKPHYYTFGKADIAANKPVTPQTLF
ELGSISKTFTGVLGGDAIARGEISLDDAVTRYWPQLTGKQWQGIRMLDLATYTAGGLPLQVPDEVTDNASLLRFYQNWQP
QWKPGTTRLYANASIGLFGALAVKPSGMPYEQAMTTRVLKPLKLDHTWINVPKAEEAHYAWGYRDGKAVRVSPGMLDAQA
YGVKTNVQDMANWVMANMAPENVADASLKQGIALAQSRYWRIGSMYQGLGWEMLNWPVEANTVVEGSDSKVALAPLPVAE
VNPPAPPVKASWVHKTGSTGGFGSYVAFIPEKQIGIVMLANTSYPNPARVEAAYHILEALQ
>P24735 3.5.2.6~~~ampC~~~Beta-lactamase~~~
MRDTRFPCLCGIAASTLLFATTPAIAGEAPADRLKALVDAAVQPVMKANDIPGLAVAISLKGEPHYFSYGLASKEDGRRV
TPETLFEIGSVSKTFTATLAGYALTQDKMRLDDRASQHWPALQGSRFDGISLLDLATYTAGGLPLQFPDSVQKDQAQIRD
YYRQWQPTYAPGSQRLYSNPSIGLFGYLAARSLGQPFERLMEQQVFPALGLEQTHLDVPEAALAQYAQGYGKDDRPLRVG
PGPLDAEGYGVKTSAADLLRFVDANLHPERLDRPWAQALDATHRGYYKVGDMTQGLGWEAYDWPISLKRLQAGNSTPMAL
QPHRIARLPAPQALEGQRLLNKTGSTNGFGAYVAFVPGRDLGLVILANRNYPNAERVKIAYAILSGLEQQGKVPLKR
>P85302 3.5.2.6~~~ampC~~~Beta-lactamase~~~
ATDIRQVVDSTVEPLMQQQDIAGLSVAVIQNGKAQYFNYGVANKDSKQPITENTLFEIGSVSKTFTATLAGYALANGKLK
LSDPASQYLPALRGDKFDHISLLNLGTYTAGGLPLQFPEESDNTGKMISYYQHWKPAFAPGTQRLYSNPSIGLFGHLAAQ
SLGQPFEKLMEQTVLPKLGLKHTFISVPETQMSLYAQGYDKAGKPVRVSPGALDAEAYGIKTSTSDLIHYVEVNMHPAKL
EKPLQQAIAATHTGYYTVDGMTQGLGWEMYPYPIKVDALVEGNSTQMAMEPHKVNWLTPPQAAPLDTLVNKTGSTGGFGA
YVAYVPSKGLGVVILANKNYPNAERVKAAHAILSAMDQ
>O05465 3.5.2.6~~~ampC~~~Beta-lactamase~~~
MKLFTSTLTAKKSSTHKPLISLALSVLISTLLISETAQAADANDRLEQEVDKQAKQLMAQYQIPGMAFGIIVDGKSHFYN
YGLADKQRNQPVSEDTIFELGSVSKTFAATLASYSELNGTLSLDDTADKYIPYLKNSAIGNTKLISLVTYSAGGYHYRCL
KTLENNKELLQYYKSWHPDFPVNSKRLYSNASIGLFGYISALSMHSDYTKLIENTVLPSLKMTNTFVDVPANKMEDYAFG
YNAAGEPIRVNPGMLDAEAYGIKSTSADMTRFMAANMGLVTVDSQMQQALDNNRKGYYRTKSFTQGLAWEMYPLPTTLQQ
LVEGNSTETILQPQPIQLNEPPTPVLNDVWVNKTGATNGFGAYIAYMPAKKTGMFILANKNYPNTERVKAAYTILDSVMN
N
>P82974 3.5.1.28~~~ampD~~~1,6-anhydro-N-acetylmuramyl-L-alanine amidase AmpD~~~
MLLDEGWLAEARRVPSPHYDCRPDDENPSLLVVHNISLPPGEFGGPWIDALFTGTIDPNAHPYFAGIAHLRVSAHCLIRR
DGEIVQYVPFDKRAWHAGVSSYQGRERCNDFSIGIELEGTDTLAYTDAQYQQLAAVTNALITRYPAIANNMTGHCNIAPE
RKTDPGPSFDWARFRALVTPSSHKEMT
>P13016 3.5.1.28~~~ampD~~~1,6-anhydro-N-acetylmuramyl-L-alanine amidase AmpD~~~COG3023
MLLEQGWLVGARRVPSPHYDCRPDDETPTLLVVHNISLPPGEFGGPWIDALFTGTIDPQAHPFFAEIAHLRVSAHCLIRR
DGEIVQYVPFDKRAWHAGVSQYQGRERCNDFSIGIELEGTDTLAYTDAQYQQLAAVTRALIDCYPDIAKNMTGHCDIAPD
RKTDPGPAFDWARFRVLVSKETT
>O25681 3.4.11.-~~~~~~Aminopeptidase HP_1037~~~COG0006
MKGLERESHFTLNENAMFFECAYSCDNALFLQLDDRSFFITDSRYTQEAKESVQPKNGVLAEVVESSDLVQSAIDLIVKS
SVKKLFFDPNQVNLQTYKRLNSALGDKVALEGVPSYHRQKRIIKNEHEIQLLKKSQALNVEAFENFAEYVKKIFDEKESL
SERYLQHKVKDFLTREGVYDLSFEPILALNANASKPHALPSAKDFLKAEHSILLDMGIKYERYCSDRTRTAFFDPKDFVF
KREQSFKDKERQKIYDIVKEAQEKAISGIRAGMTGKEADSLARGVISDYGYGQYFTHSTGHGIGLDIHELPYISSRSETI
LEEGMVFSVEPGIYIPGFFGVRIEDLVVIKNSRSELL
>P0AE14 ~~~ampE~~~Protein AmpE~~~COG3725
MTLFTTLLVLIFERLFKLGEHWQLDHRLEAFFRRVKHFSLGRTLGMTIIAMGVTFLLLRALQGVLFNVPTLLVWLLIGLL
CIGAGKVRLHYHAYLTAASRNDSHARATMAGELTMIHGVPAGCDEREYLRELQNALLWINFRFYLAPLFWLIVGGTWGPV
TLMGYAFLRAWQYWLARYQTPHHRLQSGIDAVLHVLDWVPVRLAGVVYALIGHGEKALPAWFASLGDFHTSQYQVLTRLA
QFSLAREPHVDKVETPKAAVSMAKKTSFVVVVVIALLTIYGALV
>P0AE16 ~~~ampG~~~Anhydromuropeptide permease~~~COG2223
MSSQYLRIFQQPRSAILLILGFASGLPLALTSGTLQAWMTVENIDLKTIGFFSLVGQAYVFKFLWSPLMDRYTPPFFGRR
RGWLLATQILLLVAIAAMGFLEPGTQLRWMAALAVVIAFCSASQDIVFDAWKTDVLPAEERGAGAAISVLGYRLGMLVSG
GLALWLADKWLGWQGMYWLMAALLIPCIIATLLAPEPTDTIPVPKTLEQAVVAPLRDFFGRNNAWLILLLIVLYKLGDAF
AMSLTTTFLIRGVGFDAGEVGVVNKTLGLLATIVGALYGGILMQRLSLFRALLIFGILQGASNAGYWLLSITDKHLYSMG
AAVFFENLCGGMGTSAFVALLMTLCNKSFSATQFALLSALSAVGRVYVGPVAGWFVEAHGWSTFYLFSVAAAVPGLILLL
VCRQTLEYTRVNDNFISRTAYPAGYAFAMWTLAAGVSLLAVWLLLLTMDALDLTHFSFLPALLEVGVLVALSGVVLGGLL
DYLALRKTHLT
>P0AD70 3.4.-.-~~~ampH~~~D-alanyl-D-alanine-carboxypeptidase/endopeptidase AmpH~~~COG1680
MKRSLLFSAVLCAASLTSVHAAQPITEPEFASDIVDRYADHIFYGSGATGMALVVIDGNQRVFRSYGETRPGNNVRPQLD
SVVRIASLTKLMTSEMLVKLLDQGTVKLNDPLSKYAPPGARVPTYNGTPITLVNLATHTSALPREQPGGAAHRPVFVWPT
REQRWKYLSTAKLKAAPGSQAAYSNLAFDLLADALANASGKPYTQLFEEQITRPLGMKDTTYTPSPDQCRRLMVAERGAS
PCNNTLAAIGSGGVYSTPGDMMRWMQQYLSSDFYQRSNQADRMQTLIYQRAQFTKVIGMDVPGKADALGLGWVYMAPKEG
RPGIIQKTGGGGGFITYMAMIPQKNIGAFVVVTRSPLTRFKNMSDGINDLVTELSGNKPLVIPAS
>P04825 3.4.11.2~~~pepN~~~Aminopeptidase N~~~COG0308
MTQQPQAKYRHDYRAPDYQITDIDLTFDLDAQKTVVTAVSQAVRHGASDAPLRLNGEDLKLVSVHINDEPWTAWKEEEGA
LVISNLPERFTLKIINEISPAANTALEGLYQSGDALCTQCEAEGFRHITYYLDRPDVLARFTTKIIADKIKYPFLLSNGN
RVAQGELENGRHWVQWQDPFPKPCYLFALVAGDFDVLRDTFTTRSGREVALELYVDRGNLDRAPWAMTSLKNSMKWDEER
FGLEYDLDIYMIVAVDFFNMGAMENKGLNIFNSKYVLARTDTATDKDYLDIERVIGHEYFHNWTGNRVTCRDWFQLSLKE
GLTVFRDQEFSSDLGSRAVNRINNVRTMRGLQFAEDASPMAHPIRPDMVIEMNNFYTLTVYEKGAEVIRMIHTLLGEENF
QKGMQLYFERHDGSAATCDDFVQAMEDASNVDLSHFRRWYSQSGTPIVTVKDDYNPETEQYTLTISQRTPATPDQAEKQP
LHIPFAIELYDNEGKVIPLQKGGHPVNSVLNVTQAEQTFVFDNVYFQPVPALLCEFSAPVKLEYKWSDQQLTFLMRHARN
DFSRWDAAQSLLATYIKLNVARHQQGQPLSLPVHVADAFRAVLLDEKIDPALAAEILTLPSVNEMAELFDIIDPIAIAEV
REALTRTLATELADELLAIYNANYQSEYRVEHEDIAKRTLRNACLRFLAFGETHLADVLVSKQFHEANNMTDALAALSAA
VAAQLPCRDALMQEYDDKWHQNGLVMDKWFILQATSPAANVLETVRGLLQHRSFTMSNPNRIRSLIGAFAGSNPAAFHAE
DGSGYLFLVEMLTDLNSRNPQVASRLIEPLIRLKRYDAKRQEKMRAALEQLKGLENLSGDLYEKITKALA
>P37896 3.4.11.2~~~pepN~~~Aminopeptidase N~~~
MAVKRFYETFHPDHYDLYIDVDRAARSFSGTSTIHGEIQEETVLVHQKYMTISKVTVDGKEVPFTFGDDFEGIKIEAGKT
GEAVIAIDYSAPLTDTMMGIYPSYYQVDGVKKELIGTQFETTFAREAFPCVDEPEAKATFSLALKFDEHEGETVLANMPE
DRVENGVHYFKETVRMSSYLVAFAFGEMRSLTTHTKSGVLIGVYSTQAHTEKELTFSLDIAKRAIEFYEDFYQTPYPLPQ
SLQLALPDFSAGAMENWGLVTYREAYLLLDPDNTTLEMKKLVATVVTHELAHQWFGDLVTMEWWDNLWLNESFANMMEYL
SVDHLEPNWHIWEMFQTSEAAAALTRDATDGVQSVHVEVNDPAEIDALFDGAIVYAKGSRMLVMVRSLLGDEALRKGLKR
YFDKHKFGNAAGDDLWDALSTATDLNIGEIMHTWLDQPGYPVVNAFVEDGHLKLTQKQFFIGEGKEVGRKWEIPLNANFK
APKIMSDVELDLGDYQALRAEAGHALRLNVGNNSHFIVKYDQTLMDDIMKEAKDLDPVSQLQLLQDLRLLAEGKQASYAD
VVPVLELFKNSESHIVNDALYTTADKLRQFAPAGSEADKNLRALYNDLSKDQVARLGWLPKAGESDEDIQTRPYVLSASL
YGRNADSEKQAHEIYVEYADKLAELSADIRPYVLINEVENYGSSELTDKLIGLYQATSDPSFKMDLEAAIVKSKDEGELK
KIVSWFKNAEIVKPQDLRGWFSGVLSNPAGEQLAWDWIRDEWAWLEKTVGGDMEFATFITVISRVFKTKERYDEYNAFFT
DKESNMLLNREIKMDRKVIANRVDLIASEQADVNAAVAAALQK
>P0C2T8 3.4.11.2~~~pepN~~~Aminopeptidase N~~~
MAVKRLIETFVPENYKIFLDIDRKTKKIKGQVAITGEAKDTVVAFHAKGLHFNKVRAFSVDTNFIENEEDEEIVVKIGET
GRVTVSFEYEAELTDNMMGIYPSYYEVNGEKKMLIGTQFESHFARQAFPSIDEPEAKATFDLSVKFDEEEGDIIVSNMPE
LLNINGIHVFERTVKMSSYLLAFVFGELQYKKGKTKSGVEVGAFATKAHSQAALDFPLDIAIRSIEFYEDYYQTPYPLPH
SWHIALPDFSSGAMENWGCITYREVCMLVDPENATIQSKQYVATVIAHELAHQWFGDLVTMQWWDDLWLNESFANNMEYV
CMDALEPSWNVWESFSISEANMALNRDATDGVQSVHVEVTHPDEIGTLFDPAIVYAKGSRLMVMLRKWLGDEDFAAGLAL
YFKRHQYGNTVGDNLWDALAEVSGKDVAAFMHSWVNQPGYPVVTAEVVDDTLILSQKQFFVGEGVDKGRLWNVPLNTNWT
GLPDLLSSEKVEIPGFAALKTKNNGKALFLNDANMAHYIIDYKGALLTDLLSEVESLENVTKFQILQDRKLLAKAGVISY
ADVVNILPSFTNEESYLVNTGLSQLISELELFVDEDSETEKAFQSLVGKLFAKNYARLGWDKVAGESAGDESLRGIVLSK
TLYSENADAKTKASQIFATHKENLASIPADIRPIVLNNEIKTTNSAELVKTYRETYIKTSLQEFKRELEGAVALIKDEKV
IAELLESFKNADIVKPQDIAFSWFYLLRNDFSQDAAWAWEKANWASLEEKLGGDMSYDKFVIYPGNTFKTADKLAEYKAF
FEPKLENQGLKRSIEMAIKQITARVALIDSQKAAVDKAITDIAEKL
>Q48656 3.4.11.2~~~pepN~~~Aminopeptidase N~~~
MTASVARFIESFIPENYNLFLDINRSEKTFTGNVAITGEAIDNHISLHQKDLTINSVLLDNESLNFQMDDANEAFHIELP
ETGVLTIFIEFSGRITDNMTGIYPSYYTYNGEKKEIISTQFEISHFAREAFPCVDEPEAKATFDLSLKFDAEEGDTALSN
MPEINSHLREETGVWTFETTPRMSTYLLAFGFGALHGKTAKTKNGTEVGVFATVAQAENSFDFALDIAVRVIEFYEDYFQ
VKYPIPLSYHLALPDFSAGAMENWGLVTYREVYLLVDENSSAASRQQVALVVAHELAHQWFGNLVTMKWWDDLWLNESFA
NMMEYVSVNAIEPSWNIFEGFPNKLGVPNALQRDATDGVQSVHMEVSHPDEINTLFDSAIVYAKGSRLMHMLRRWLGDEA
FAKGLKAYFEKHQYNNTVGRDLWNALSEASGKDVSSFMDTWLEQPGYPVVSAEVVDDTLILSQKQFFIGEHEDKGRLWEI
PLNTNWNGLPDTLSGERIEIPNYSQLATENNGVLRLNTANTAHYITDYQGQLLDNILEDFANLDTVSKLQILQERRLLAE
SGRISYASLVGLLDLVEKEESFFLISQAKSQILAGLKRFIDEDTEAEVHYKALVRRQFQNDFERLGFDAKEGESDEDEMV
RQTALSYLIEADYQPTVLAAANVFQAHKENIESIPASIRGLVLINQMKQENSLSLVEEYINAYVATNDSNFRRQLTQALS
YLKNQEGLDYVLGQLKDKNVVKPQDLYLWYMNFLSKSFAQETVWDWAKENWEWIKAALGGDMSFDSFVNIPAGIFKNQER
LDQYIAFFEPQTSDKALERNILMGIKTIAARVDLIEKEKAAVESALKDY
>A2RI32 3.4.11.2~~~pepN~~~Aminopeptidase N~~~COG0308
MAVKRLIETFVPENYKIFLDIDRKTKKIKGQVAITGEAKDTVVSFHTKGLHFNKVRAFSVDTNFIENEEDEEIVVKIGET
GRVTVSFEYEAELTDNMMGIYPSYYEVNGEKKMLIGTQFESHFARQAFPSIDEPEAKATFDLSVKFDEEEGDIIVSNMPE
LLNINGIHVFERTVKMSSYLLAFVFGELQYKKGKTKSGVEVGAFATKAHSQAALDFPLDIAIRSIEFYEDYYQTPYPLPH
SWHIALPDFSAGAMENWGCITYREVCMLVDPENATIQSKQYVATVIAHELAHQWFGDLVTMQWWDDLWLNESFANNMEYV
CMDALEPSWNVWESFSISEANMALNRDATDGVQSVHVEVTHPDEIGTLFDPAIVYAKGSRLMVMLRKWLGDEDFAAGLAL
YFKRHQYGNTVGDNLWDALAEVSGKDVAAFMHSWVNQPGYPVVTAEVVDDTLILSQKQFFVGEGVDKGRLWNVPLNTNWT
GLPDLLSSEKVEIPGFAALKTKNNGKALFLNDANMAHYIIDYKGALLTDLLSEVESLENVTKFQILQDRKLLAKAGVISY
ADVVNILPSFTNEESYLVNTGLSQLISELELFVDEDSETEKAFQSLVGKLFAKNYARLGWDKVAGESAGDESLRGIVLSK
TLYSENADAKTKASQIFAAHKENLASIPADIRPIVLNNEIKTTNSAELVKTYRETYIKTSLQEFKRELEGAVALIKDEKV
IAELLESFKNADIVKPQDIAFSWFYLLRNDFSQDAAWAWEKANWAFLEEKLGGDMSYDKFVIYPGNTFKTADKLAEYKAF
FEPKLENQGLKRSIEMAIKQITARVALIDSQKAAVDKAITDIAEKL
>Q11010 3.4.11.2~~~pepN~~~Aminopeptidase N~~~
MPGTNLTREEARQRATLLTVDSYEIDLDLTGAQEGGTYRSVTTVRFDVAEGGGESFIDLVAPTVHEVTLNGDALDTAEVF
QDSRIALPGLLPGRNILRVVADCAYTNTGEGLHRFVDPVDDQAYLYTQFEVPDARRVFASFEQPDLKATFQFTVKAPEGW
TVISNSPTPEPKDNVWEFEPTPRISSYVTALIVGPYHSVHSVYEKDGQSVPLGIYCRPSLAEHLDADAIFEVTRQGFDWF
QEKFDYAYPFKKYDQLFVPEFNAGAMENAGAVTIRDQYVFRSKVTDAAYEVRAATILHELAHMWFGDLVTMEWWNDLWLN
ESFATYAEAACQAAAPGSKWPHSWTTFANQMKTWAYRQDQLPSTHPIMADISDLDDVLVNFDGITYAKGASVLKQLVAYV
GEEAFFKGVQAYFKRHAFGNTRLSDLLGALEETSGRDLKTWSKAWLETAGINVLRPEIETDADGVITSFAIRQEAPALPA
GAKGEPTLRPHRIAIGAYDLDGAGKLVRGDRVELDVDGELTAVPQLVGKARPAVLLLNDDDLSYAKVRLDEQSLAVVTEH
LGDFTESLPRALCWASAWDMTRDAELATRDYLALVLSGIGKESDIGVVQSLHRQVKLAIDQYAAPTAREALLTRWTEATL
AHLRAAEAGSDHQLAWARAFAATARTPEQLDLLDALLDGTQTIEGLAVDTELRWAFVQRLAAVGRFGGSEIAAEYERDKT
AAGERHAATARAARPTEAAKAEAWESVVESDKLPNAVQEAVIAGFVQTDQRELLAAYTERYFEALKDVWASRSHEMAQQI
AVGLYPAVQVSQDTLDRTDAWLASAEPNAALRRLVSESRSGIERALRAQAADAAAAE
>P0A3Z2 3.4.11.9~~~pepPI~~~Xaa-Pro aminopeptidase 1~~~
MAEELTPENPAIPETPEETEEPIKQRKNGLYPGVSDELAENMQSGWADTELHDLEPIAQAAETAARRAALSARFPGERLV
IPAGNLKTRSNDTEYSFRASVEYAYLTGNQTEDGVLVMEPEGDGHAATIYLLPRSDRENGEFWLDGQGELWVGRRHSLAE
AGELYGIPASDVRELAGSLREATGPVRVVRGFDAGIEAALTDKVTAERDEELRVFLSEARLVKDEFEIGELQKAVDSTVR
GFEDVVKVLDRAEATSERYIEGTFFLRARVEGNDVGYGSICAAGPHACTLHWVRNDGPVRSGDLLLLDAGVETHTYYTAD
VTRTLPISGTYSELQKKIYDAVYDAQEAGIAAVRPGAKYRDFHDASQRVLAERLVEWGLVEGPVERVLELGLQRRWTLHG
TGHMLGMDVHDCAAARVESYVDGTLEPGMVLTVEPGLYFQADDLTVPEEYRGIGVRIEDDILVTADGNRNLSAGLPRRSD
EVEEWMAALKG
>P15034 3.4.11.9~~~pepP~~~Xaa-Pro aminopeptidase~~~COG0006
MSEISRQEFQRRRQALVEQMQPGSAALIFAAPEVTRSADSEYPYRQNSDFWYFTGFNEPEAVLVLIKSDDTHNHSVLFNR
VRDLTAEIWFGRRLGQDAAPEKLGVDRALAFSEINQQLYQLLNGLDVVYHAQGEYAYADVIVNSALEKLRKGSRQNLTAP
ATMIDWRPVVHEMRLFKSPEEIAVLRRAGEITAMAHTRAMEKCRPGMFEYHLEGEIHHEFNRHGARYPSYNTIVGSGENG
CILHYTENECEMRDGDLVLIDAGCEYKGYAGDITRTFPVNGKFTQAQREIYDIVLESLETSLRLYRPGTSILEVTGEVVR
IMVSGLVKLGILKGDVDELIAQNAHRPFFMHGLSHWLGLDVHDVGVYGQDRSRILEPGMVLTVEPGLYIAPDAEVPEQYR
GIGIRIEDDIVITETGNENLTASVVKKPEEIEALMVAARKQ
>P12529 ~~~ampR~~~HTH-type transcriptional activator AmpR~~~
MTRSYIPLNSLRAFEAAARHLSFTRAAIELNVTHSAISQHVKSLEQQLNCQLFVRGSRGLMLTTEGESLLPVLNDSFDRM
AGMLDRFATKQTQEKLKIGVVGTFAIGCLFPLLSDFKRSYPHIDLHISTHNNRVDPAAEGLDYTIRYGGGAWHDTDAQYL
CSALMSPLCSPTLASQIQTPADILKFPLLRSYRRDEWALWMQAAGEAPPSPTHNVMVFDSSVTMLEAAQGGMGVAIAPVR
MFTHLLSSERIVQPFLTQIDLGSYWITRLQSRPETPAMREFSRWLTGVLHK
>P24734 ~~~ampR~~~HTH-type transcriptional activator AmpR~~~
MVRPHLPLNALRAFEASARHLSFTRAAIELCVTQAAVSHQVKSLEERLGVALFKRLPRGLMLTHEGESLLPVLCDSFDRI
AGLLERFEGGHYRDVLTVGAVGTFTVGWLLPRLEDFQARHPFIDLRLSTHNNRVDIAAEGLDYAIRFGGGAWHGTEALAL
FEAPLTVLCCPEVAAQLHSPADLLQHTLLRSYRADEWPLWFQAAGLPAHAPLTRSIVFDTSLAMLEAARQGVGVALAPAA
MFARQLASESIRRPFATEVSTGSYWLTRLQSRGETSAMLAFRGWLLEMAAVEARGR
>P23341 3.4.11.-~~~~~~Aminopeptidase T~~~
MDAFTENLNKLAELAIRVGLNLEEGQEIVATAPIEAVDFVRLLAEKAYENGASLFTVLYGDNLIARKRLALVPEAHLDRA
PAWLYEGMAKAFHEGAARLAVSGNDPKALEGLPPERVGRAQQAQSRAYRPTLSAITEFVTNWTIVPFAHPGWAKAVFPGL
PEEEAVQRLWQAIFQATRVDQEDPVAAWEAHNRVLHAKVAFLNEKRFHALHFQGPGTDLTVGLAEGHLWQGGATPTKKGR
LCNPNLPTEEVFTAPHRERVEGVVRASRPLALSGQLVEGLWARFEGGVAVEVGAEKGEEVLKKLLDTDEGARRLGEVALV
PADNPIAKTGLVFFDTLFDENAASHIAFGQAYAENLEGRPSGEEFRRRGGNESMVHVDWMIGSEEVDVDGLLEDGTRVPL
MRRGRWVI
>P42778 3.4.11.-~~~~~~Aminopeptidase T~~~COG2309
MDAFKRNLEKLAELAIRVGLNLEKGQEVIATAPIEAVDFVRLLAEKAYREGASLFTVIYGDQELARKRLALAPEEGLDKA
PAWLYEGMARAFREGAARLAVSGSDPKALEGLPPEKVGRAQKANARAYKPALEAITEFVTNWTIVPFAHPGWARAVFPGL
PEEEAVRRLWEAIFQATRADQEDPIAAWEAHNRALHEKVAYLNARRFHALHFKGPGTDLVVGLAEGHLWQGGATATKGGR
LCNPNLPTEEVFTAPHRERVEGVVRASRPLALGGTLVEGIFARFERGFAVEVRAEKGEEVLRRLLDTDEGARRLGEVALV
PADNPIAKTGLVFFDTLFDENAASHIAFGQAYQENLEGRPSGEAFRKRGGNESLVHVDWMIGSEEMDVDGLYEDGTRTPL
MRRGRWVV
>Q01693 3.4.11.10~~~~~~Bacterial leucyl aminopeptidase~~~
MKYTKTLLAMVLSATFCQAYAEDKVWISIGADANQTVMKSGAESILPNSVASSGQVWVGQVDVAQLAELSHNMHEEHNRC
GGYMVHPSAQSAMAASAMPTTLASFVMPPITQQATVTAWLPQVDASQITGTISSLESFTNRFYTTTSGAQASDWIASEWQ
ALSASLPNASVKQVSHSGYNQKSVVMTITGSEAPDEWIVIGGHLDSTIGSHTNEQSVAPGADDDASGIAAVTEVIRVLSE
NNFQPKRSIAFMAYAAEEVGLRGSQDLANQYKSEGKNVVSALQLDMTNYKGSAQDVVFITDYTDSNFTQYLTQLMDEYLP
SLTYGFDTCGYACSDHASWHNAGYPAAMPFESKFNDYNPRIHTTQDTLANSDPTGSHAKKFTQLGLAYAIEMGSATGDTP
TPGNQLEDGVPVTDLSGSRGSNVWYTFELETQKNLQITTSGGYGDLDLYVKFGSKASKQNWDCRPYLSGNNEVCTFNNAS
PGTYSVMLTGYSNYSGASLKASTF
>Q7M1T6 ~~~amp~~~Antigenic membrane protein~~~
MQNQKNQKSLVAKVLVLFAAVALMFVGVQVFADDKLDLNTLECKDALELTAADAADAEKVVKQWKVQNTSLNAKVTKDSV
KVAVADNKVTVTPADGDAGKALSGSKILNLVGVCELNKLTLGTEKKLTLTVKDGKVDAEAGLKALKEAGAKVPATVNKDD
VTFTVGKDDNANKVTVKAVDGKTTVSGQVVFEFTVAKTPWYKTVWFLTLVAVVVVAAVAGGVFFFVKKNKKNK
>G3XCY4 ~~~amrZ~~~Transcription factor AmrZ~~~
MRPLKQATPTYSSRTADKFVVRLPEGMREQIAEVARSHHRSMNSEIIARLEQSLLQEGALQDNLGVRLDSPELSLHEREL
LQRFRQLTHRQQNALVALIAHDAELAQA
>Q46630 3.1.3.48~~~amsI~~~Probable low molecular weight protein-tyrosine-phosphatase AmsI~~~
MINSILVVCIGNICRSPTGERLLKAALPERKIASAGLKAMVGGSADETASIVANEHGVSLQDHVAQQLTADMCRDSDLIL
VMEKKHIDLVCRINPSVRGKTMLFGHWINQQEIADPYKKSRDAFEAVYGVLENAAQKWVNALSR
>P22963 3.2.1.60~~~mta~~~Glucan 1,4-alpha-maltotetraohydrolase~~~
MSHILRAAVLAAVLLPFPALADQAGKSPAGVRYHGGDEIILQGFHWNVVREAPNDWYNILRQQASTIAADGFSAIWMPVP
WRDFSSWTDGGKSGGGEGYFWHDFNKNGRYGSDAQLRQAAGALGGAGVKVLYDVVPNHMNRGYPDKEINLPAGQGFWRND
CADPGNYPNDCDDGDRFIGGESDLNTGHPQIYGMFRDELANLRSGYGAGGFRFDFVRGYAPERVDSWMSDSADSSFCVGE
LWKGPSEYPSWDWRNTASWQQIIKDWSDRAKCPVFDFALKERMQNGSVADWKHGLNGNPDPRWREVAVTFVDNHDTGYSP
GQNGGQHHWALQDGLIRQAYAYILTSPGTPVVYWSHMYDWGYGDFIRQLIQVRRTAGVRADSAISFHSGYSGLVATVSGS
QQTLVVALNSDLANPGQVASGSFSEAVNASNGQVRVWRSGSGDGGGNDGGEGGLVNVNFRCDNGVTQMGDSVYAVGNVSQ
LGNWSPASAVRLTDTSSYPTWKGSIALPDGQNVEWKCLIRNEADATLVRQWQSGGNNQVQAAAGASTSGSF
>P13507 3.2.1.60~~~amyP~~~Glucan 1,4-alpha-maltotetraohydrolase~~~
MSHILRAAVLAAMLLPLPSMADQAGKSPNAVRYHGGDEIILQGFHWNVVREAPNDWYNILRQQAATIAADGFSAIWMPVP
WRDFSSWSDGSKSGGGEGYFWHDFNKNGRYGSDAQLRQAASALGGAGVKVLYDVVPNHMNRGYPDKEINLPAGQGFWRND
CADPGNYPNDCDDGDRFIGGDADLNTGHPQVYGMFRDEFTNLRSQYGAGGFRFDFVRGYAPERVNSWMTDSADNSFCVGE
LWKGPSEYPNWDWRNTASWQQIIKDWSDRAKCPVFDFALKERMQNGSIADWKHGLNGNPDPRWREVAVTFVDNHDTGYSP
GQNGGQHHWALQDGLIRQAYAYILTSPGTPVVYWSHMYDWGYGDFIRQLIQVRRAAGVRADSAISFHSGYSGLVATVSGS
QQTLVVALNSDLGNPGQVASGSFSEAVNASNGQVRVWRSGTGSGGGEPGALVSVSFRCDNGATQMGDSVYAVGNVSQLGN
WSPAAALRLTDTSGYPTWKGSIALPAGQNEEWKCLIRNEANATQVRQWQGGANNSLTPSEGATTVGRL
>P19571 3.2.1.98~~~~~~Glucan 1,4-alpha-maltohexaosidase~~~
MKMRTGKKGFLSILLAFLLVITSIPFTLVDVEAHHNGTNGTMMQYFEWYLPNDGNHWNRLNSDASNLKSKGITAVWIPPA
WKGASQNDVGYGAYDLYDLGEFNQKGTVRTKYGTRSQLQAAVTSLKNNGIQVYGDVVMNHKGGADATEMVRAVEVNPNNR
NQEVTGEYTIEAWTRFDFPGRGNTHSSFKWRWYHFDGVDWDQSRRLNNRIYKFRGHGKAWDWEVDTENGNYDYLMYADID
MDHPEVVNELRNWGVWYTNTLGLDGFRIDAVKHIKYSFTRDWINHVRSATGKNMFAVAEFWKNDLGAIENYLQKTNWNHS
VFDVPLHYNLYNASKSGGNYDMRNIFNGTVVQRHPSHAVTFVDNHDSQPEEALESFVEEWFKPLAYALTLTREQGYPSVF
YGDYYGIPTHGVPAMRSKIDPILEARQKYAYGKQNDYLDHHNIIGWTREGNTAHPNSGLATIMSDGAGGSKWMFVGRNKA
GQVWSDITGNRTGTVTINADGWGNFSVNGGSVSIWVNK
>Q9RBU1 2.1.4.3~~~amtA~~~L-arginine:L-lysine amidinotransferase~~~
MKKIQTFIQTSPVCSYTEWDLLEEIIVGVVDGACIPPWHAAMEPCLPTQQHQFFRDNAGKPFPQERIDLARKELDEFARI
LECEGVKVRRPEPKNQSLVYGAPGWSSTGMYAAMPRDVLLVVGTDIIECPLAWRSRYFETAAYKKLLKEYFHGGAKWSSG
PKPELSDEQYVDGWVEDEAATSANLVITEFEPTFDAADFTRLGKDIIAQKSNVTNEFGINWLQRHLGDDYKIHVLEFNDM
HPMHIDATLVPLAPGKLLINPERVQKMPEIFRGWDAIHAPKPIMPDSHPLYMTSKWINMNILMLDERRVVVERQDEPMIK
AMKGAGFEPILCDFRNFNSFGGSFHCATVDIRRRGKLESYLV
>Q07429 ~~~nrgA~~~Ammonium transporter~~~COG0004
MQMGDTVFMFFCALLVWLMTPGLALFYGGMVKSKNVLSTAMHSFSSIAIVSIVWVLFGYTLAFAPGNSIIGGLEWAGLKG
VGFDPGDYSDTIPHSLFMMFQMTFAVLTTAIISGAFAERMRFGAFLLFSVLWASLVYTPVAHWVWGGGWIGQLGALDFAG
GNVVHISSGVAGLVLAIVLGKRKDGTASSPHNLIYTFLGGALIWFGWFGFNVGSALTLDGVAMYAFINTNTAAAAGIAGW
ILVEWIINKKPTMLGAVSGAIAGLVAITPAAGFVTPFASIIIGIIGGAVCFWGVFSLKKKFGYDDALDAFGLHGIGGTWG
GIATGLFATTSVNSAGADGLFYGDASLIWKQIVAIAATYVFVFIVTFVIIKIVSLFLPLRATEEEESLGLDLTMHGEKAY
QDSM
>P69680 ~~~amtB~~~Ammonium transporter AmtB~~~COG0004
MKIATIKTGLASLAMLPGLVMAAPAVADKADNAFMMICTALVLFMTIPGIALFYGGLIRGKNVLSMLTQVTVTFALVCIL
WVVYGYSLAFGEGNNFFGNINWLMLKNIELTAVMGSIYQYIHVAFQGSFACITVGLIVGALAERIRFSAVLIFVVVWLTL
SYIPIAHMVWGGGLLASHGALDFAGGTVVHINAAIAGLVGAYLIGKRVGFGKEAFKPHNLPMVFTGTAILYIGWFGFNAG
SAGTANEIAALAFVNTVVATAAAILGWIFGEWALRGKPSLLGACSGAIAGLVGVTPACGYIGVGGALIIGVVAGLAGLWG
VTMLKRLLRVDDPCDVFGVHGVCGIVGCIMTGIFAASSLGGVGFAEGVTMGHQLLVQLESIAITIVWSGVVAFIGYKLAD
LTVGLRVPEEQEREGLDVNSHGENAYNA
>P69681 ~~~amtB~~~Ammonium transporter AmtB~~~COG0004
MKIATIKTGLASLAMLPGLVMAAPAVADKADNAFMMICTALVLFMTIPGIALFYGGLIRGKNVLSMLTQVTVTFALVCIL
WVVYGYSLAFGEGNNFFGNINWLMLKNIELTAVMGSIYQYIHVAFQGSFACITVGLIVGALAERIRFSAVLIFVVVWLTL
SYIPIAHMVWGGGLLASHGALDFAGGTVVHINAAIAGLVGAYLIGKRVGFGKEAFKPHNLPMVFTGTAILYIGWFGFNAG
SAGTANEIAALAFVNTVVATAAAILGWIFGEWALRGKPSLLGACSGAIAGLVGVTPACGYIGVGGALIIGVVAGLAGLWG
VTMLKRLLRVDDPCDVFGVHGVCGIVGCIMTGIFAASSLGGVGFAEGVTMGHQLLVQLESIAITIVWSGVVAFIGYKLAD
LTVGLRVPEEQEREGLDVNSHGENAYNA
>O66515 ~~~amt~~~Ammonium transporter~~~COG0004
MRALGFIGIILSIFSSFAYASEAKLDTGNTAWMLVASALVVFMTVPGLALFYGGLDKSKSILNTIAMSFSAFAVVTLTWI
FVGYSVAYGDDIFGFIGNPFQYVLGKGISGINSDTGYPALLDLMFQLTFATITTALISGSFVGRMKFSAWILFAILWSVF
VYPPVAHWVWGGGFLANDGALDFAGGTVVHINAGIAGLVGALILGRRKDTSLIPNNVPLVALGAGILWFGWFGFNAGSAL
GANESAAWAMINTTVATSTAALAWMFTEWLHVGKPTVVGISSGIVAGLVAITPAAGFVNLIGSIFIGAIASVCAYFMVAL
VKPKFGYDDALDVFGIHGVRGIVGAVLTGVFADPNVGGTPGLLYGNPKQVLIQIEGVIATILYSAILTAVILLVLKAVVG
LRVSEEEELELDSSLHGEKAYNL
>P54146 ~~~amt~~~Ammonium transporter~~~COG0004
MDPSDLAWILAAFALVSLMFPGLSLLYGGMLGGQHVLNTFMMVMSSLGIISLVYIIYGHGLVLGNSIGGWGIIGNPLEYF
GFRNIMEDDGTGDLMWAGFYILFAAISLALVSSGAAGRMRFGAWLVFGVLWFTFVYAPLAHWVFAIDDPESGYVGGWMKN
VLEFHDFAGGTAVHMNAGASGLALAIVLGRRHSMAVRPHNLPLILIGAGLIVAGWFGFNGGTAGGANFLASYVVVTSLIA
AAGGMMGFMLVERVFSGKPTFFGSATGTIAGLVAITPAADAVSPLGAFAVGALGAVVSFWAISWKKGHRVDDSFDVFAVH
GMAGIAGALFVMLFGDPLAPAGVSGVFFGGELSLLWREPLAIIVTLTYAFGVTWLIATILNKFMTLRITSEAEYEGIDRA
EHAESAYHLNSNGIGMATRTNFGPEIPEETVPDAVQVGVDKQKIADTRKASK
>P09961 3.2.1.1~~~amyA~~~Alpha-amylase 1~~~COG1449
MTKSIYFSLGIHNHQPVGNFDFVIERAYEMSYKPLINFFFKHPDFPINVHFSGFLLLWLEKNHPEYFEKLKIMAERGQIE
FVSGGFYEPILPIIPDKDKVQQIKKLNKYIYDKFGQTPKGMWLAERVWEPHLVKYIAEAGIEYVVVDDAHFFSVGLKEED
LFGYYLMEEQGYKLAVFPISMKLRYLIPFADPEETITYLDKFASEDKSKIALLFDDGEKFGLWPDTYRTVYEEGWLETFV
SKIKENFLLVTPVNLYTYMQRVKPKGRIYLPTASYREMMEWVLFPEAQKELEELVEKLKTENLWDKFSPYVKGGFWRNFL
AKYDESNHMQKKMLYVWKKVQDSPNEEVKEKAMEEVFQGQANDAYWHGIFGGLYLPHLRTAIYEHLIKAENYLENSEIRF
NIFDFDCDGNDEIIVESPFFNLYLSPNHGGSVLEWDFKTKAFNLTNVLTRRKEAYHSKLSYVTSEAQGKSIHERWTAKEE
GLENILFYDNHRRVSFTEKIFESEPVLEDLWKDSSRLEVDSFYENYDYEINKDENKIRVLFSGVFRGFELCKSYILYKDK
SFVDVVYEIKNVSETPISLNFGWEINLNFLAPNHPDYYFLIGDQKYPLSSFGIEKVNNWKIFSGIGIELECVLDVEASLY
RYPIETVSLSEEGFERVYQGSALIHFYKVDLPVGSTWRTTIRFWVK
>P25718 3.2.1.1~~~malS~~~Periplasmic alpha-amylase~~~COG0366
MKLAACFLTLLPGFAVAASWTSPGFPAFSEQGTGTFVSHAQLPKGTRPLTLNFDQQCWQPADAIKLNQMLSLQPCSNTPP
QWRLFRDGEYTLQIDTRSGTPTLMISIQNAAEPVASLVRECPKWDGLPLTVDVSATFPEGAAVRDYYSQQIAIVKNGQIM
LQPAATSNGLLLLERAETDTSAPFDWHNATVYFVLTDRFENGDPSNDQSYGRHKDGMAEIGTFHGGDLRGLTNKLDYLQQ
LGVNALWISAPFEQIHGWVGGGTKGDFPHYAYHGYYTQDWTNLDANMGNEADLRTLVDSAHQRGIRILFDVVMNHTGYAT
LADMQEYQFGALYLSGDEVKKSLGERWSDWKPAAGQTWHSFNDYINFSDKTGWDKWWGKNWIRTDIGDYDNPGFDDLTMS
LAFLPDIKTESTTASGLPVFYKNKMDTHAKAIDGYTPRDYLTHWLSQWVRDYGIDGFRVDTAKHVELPAWQQLKTEASAA
LREWKKANPDKALDDKPFWMTGEAWGHGVMQSDYYRHGFDAMINFDYQEQAAKAVDCLAQMDTTWQQMAEKLQGFNVLSY
LSSHDTRLFREGGDKAAELLLLAPGAVQIFYGDESSRPFGPTGSDPLQGTRSDMNWQDVSGKSAASVAHWQKISQFRARH
PAIGAGKQTTLLLKQGYGFVREHGDDKVLVVWAGQQ
>P14898 3.2.1.1~~~amyB~~~Alpha-amylase 2~~~
MIYDDKIFGDLCHKEFLVEREVKKLEEIYLEEVLPEDPKPEDEIEFTFNCPLKFHITSGKIVKDNREIYTFNIQERKTQW
NDSIFNFSEIIKIKIPPLKENGLYQIHLYEMNEKIYEQYLSIDNFEAPLWSEESIIYHIFIDRFAKDEKEVEYSENLKEK
LGGNLKGILSRLDYIENLGINTIWISPIFKSTSYHGYDIEDYFEIDPIWGTKEDLKKLVREAFNRGIRIILDFVPNHMSY
KNPIFQKALKDKNSNLRSWFIFKGEDYETFFGVKSMPKINLKNKEAIDYIINAAKYWIREFGISGYRMDHATGPDINFWS
IFYYNLKSEFPETFYFGEIVETPKETKKYVGKFDGTLDFYLFKIIRDFFIGKRWSTKEFVKMIDLEEKFYGNKFKRISFL
ENHDSNRFLWVAKDKKLLRLASIFQFSINAIPIIYNGQEMGCSQYRDILEGNRTLHEHARLPIPWSDDKQDKELIDFYRQ
LVKIRKSHPALYKGTFIPIFSDMISFIKETQEESILVLINIEDKEEIFNLNGTYRDLFSGNIYTNSLKLGPMSAHLLLRI
DH
>P26612 3.2.1.1~~~amyA~~~Cytoplasmic alpha-amylase~~~COG0366
MRNPTLLQCFHWYYPEGGKLWPELAERADGFNDIGINMVWLPPAYKGASGGYSVGYDSYDLFDLGEFDQKGSIPTKYGDK
AQLLAAIDALKRNDIAVLLDVVVNHKMGADEKEAIRVQRVNADDRTQIDEEIIECEGWTRYTFPARAGQYSQFIWDFKCF
SGIDHIENPDEDGIFKIVNDYTGEGWNDQVDDELGNFDYLMGENIDFRNHAVTEEIKYWARWVMEQTQCDGFRLDAVKHI
PAWFYKEWIEHVQEVAPKPLFIVAEYWSHEVDKLQTYIDQVEGKTMLFDAPLQMKFHEASRMGRDYDMTQIFTGTLVEAD
PFHAVTLVANHDTQPLQALEAPVEPWFKPLAYALILLRENGVPSVFYPDLYGAHYEDVGGDGQTYPIDMPIIEQLDELIL
ARQRFAHGVQTLFFDHPNCIAFSRSGTDEFPGCVVVMSNGDDGEKTIHLGENYGNKTWRDFLGNRQERVVTDENGEATFF
CNGGSVSVWVIEEVI
>P36924 3.2.1.2~~~spoII~~~Beta-amylase~~~
MKNQFQYCCIVILSVVMLFVSLLIPQASSAAVNGKGMNPDYKAYLMAPLKKIPEVTNWETFENDLRWAKQNGFYAITVDF
WWGDMEKNGDQQFDFSYAQRFAQSVKNAGMKMIPIISTHQCGGNVGDDCNVPIPSWVWNQKSDDSLYFKSETGTVNKETL
NPLASDVIRKEYGELYTAFAAAMKPYKDVIAKIYLSGGPAGELRYPSYTTSDGTGYPSRGKFQAYTEFAKSKFRLWVLNK
YGSLNEVNKAWGTKLISELAILPPSDGEQFLMNGYLSMYGKDYLEWYQGILENHTKLIGELAHNAFDTTFQVPIGAKIAG
VHWQYNNPTIPHGAEKPAGYNDYSHLLDAFKSAKLDVTFTCLEMTDKGSYPEYSMPKTLVQNIATLANEKGIVLNGENAL
SIGNEEEYKRVAEMAFNYNFAGFTLLRYQDVMYNNSLMGKFKDLLGVTPVMQTIVVKNVPTTIGDTVYITGNRAELGSWD
TKQYPIQLYYDSHSNDWRGNVVLPAERNIEFKAFIKSKDGTVKSWQTIQQSWNPVPLKTTSHTSSW
>P21543 ~~~~~~Beta/alpha-amylase~~~COG0366
MTLYRSLWKKGCMLLLSLVLSLTAFIGSPSNTASAAVADDFQASVMGPLAKINDWGSFKKQLQTLKNNGVYAITTDVWWG
YVESAGDNQFDWSYYKTYANAVKEAGLKWVPIISTHKCGGNVGDDCNIPLPSWLSSKGSADEMQFKDESGYANSEALSPL
WSGTGKQYDELYASFAENFAGYKSIIPKIYLSGGPSGELRYPSYYPAAGWSYPGRGKFQAYTETAKNAFRTAMNDKYGSL
DKINAAWGTKLTSLSQINPPTDGDGFYTNGGYNSAYGKDFLSWYQSVLEKHLGVIGAAAHKNFDSVFGVRIGAKISGLHW
QMNNPAMPHGTEQAGGYYDYNRLIQKFKDADLDLTFTCLEMSDSGTAPNYSLPSTLVDTVSSIANAKGVRLNGENALPTG
GSGFQKIEEKITKFGYHGFTLLRINNLVNNDGSPTGELSGFKQYIISKAKPDNNGGTGNKVTIYYKKGFNSPYIHYRPAG
GSWTAAPGVKMQDAEISGYAKITVDIGSASQLEAAFNDGNNNWDSNNTKNYSFSTGTSTYTPGNSGNAGTITSGAPAGAN
PGDGGGTTNKVTVYYKKGFNSPYIHYRPAGGSWTAAPGVKMQDAEISGYAKITVDIGSASQLEAAFNDGNNNWDSNNTKN
YLFSTGTSTYTPGSNGAAGTIRTGAPSGSVLSVVTSTYATDLNEVTGPIQTEKLSGVSLNVSTSTYAPNSNGVEVTAQTE
APSGAFTSMDLGTLSNPTSLNTDWSKQSIYFIMTDRFSNGDPSNDNYGGFNSNNSDQRKWHGGDFQGIINKLDYIKNMGF
TAIWITPVTMQKSEYAYHGYHTYDFYAVDGHLGTMDKLQELVRKAHDKNIAVMVDVVVNHTGDFQPGNGFAKAPFDKADW
YHHNGDITDGDYNSNNQWKIENGDVAGLDDLNHENPATANELKNWIKWLLNETGIDGLRLDTVKHVPKGFLKDFDQAANT
FTMGEIFHGDPAYVGDYTRYLDAALDFPMYYTIKDVFGHDQSMRKIKDRYSDDRYYRDAQTNGVFIDNHDVKRFLNDASG
KPGANYDKWPQLKAALGFTLTSRGIPIIYQGTEQGYSGGDDPANRENMNFNANHDLYQYIAKLNYVRNNHPALQNGSQRE
KWVDDSFYSFQRSKNGDEAIVFINNSWNSQTRTIGNFDNLSNGTRLTNQLSNDSVQINNGSITVTLAPKEVKVFTK
>P19584 3.2.1.2~~~~~~Thermophilic beta-amylase~~~
MIGAFKRLGQKLFLTLLTASLIFASSIVTANASIAPNFKVFVMGPLEKVTDFNAFKDQLITLKNNGVYGITTDIWWGYVE
NAGENQFDWSYYKTYADTVRAAGLKWVPIMSTHACGGNVGDTVNIPIPSWVWTKDTQDNMQYKDEAGNWDNEAVSPWYSG
LTQLYNEFYSSFASNFSSYKDIITKIYISGGPSGELRYPSYNPSHGWTYPGRGSLQCYSKAAITSFQNAMKSKYGTIAAV
NSAWGTSLTDFSQISPPTDGDNFFTNGYKTTYGNDFLTWYQSVLTNELANIASVAHSCFDPVFNVPIGAKIAGVHWLYNS
PTMPHAAEYCAGYYNYSTLLDQFKASNLAMTFTCLEMDDSNAYVSPYYSAPMTLVHYVANLANNKGIVHNGENALAISNN
NQAYVNCANELTGYNFSGFTLLRLSNIVNSDGSVTSEMAPFVINIVTLTPNGTIPVTFTINNATTYYGQNVYIVGSTSDL
GNWNTTYARGPASCPNYPTWTITLNLLPGEQIQFKAVKIDSSGNVTWEGGSNHTYTVPTSGTGSVTITWQN
>P29761 3.2.1.3~~~cga~~~Glucoamylase~~~
MSRKLIKYLPLLVLASSVLSGCSNNVSSIKIDRFNNISAVNGPGEEDTWASAQKQGVGTANNYVSKVWFTLANGAISEVY
YPTIDTADVKEIKFIVTDGKSFVSDETKDTISKVEKFTDKSLGYKLVNTDKKGRYRITKEIFTDVKRNSLIMKAKFEALE
GSIHDYKLYLAYDPHIKNQGSYNEGYVIKANNNEMLMAKRDNVYTALSSNIGWKGYSIGYYKVNDIMTDLDENKQMTKHY
DSARGNIIEGAEIDLKKNSQFEIVLSFGNSEDEAVKASIETLSENYDSLKSAYIDEWEKYCNSLNNFNGKANSLYYNSMM
ILKASEDKTNKGAYIASLSIPWGDGQGDDNTGGYHLVWSRDLYHVANAFIAAGDVDSANRSLDYLAKVVKDNGMIPQNTW
ISGKPYWTGIQLDEQADPIILSYRLRRYDLYDSLVKPLADFIIKMGPKTGQERWEEIGGYSPATMAAEVAGLTCAAYIAE
QNKDYESAQKYQEKADNWQKLIDNLTYTEHGPLENGQYYIRIAGLPDPNADFTISIANGGGVYDQKEIVDPSFLELVRLG
VKSPDDPKILNTLRVVDSTIKVDTPKGPSWYRYNHDGYGEPSKTELYHGAGKGRLWPLLTGERGMYEIAAGKDATPYLKA
MENFANEGGIISEQVWEDTGLPTDSASPLNWAHAEYVVLFPSNIEHKVLDMPDIVYKRYVAK
>P80696 ~~~amyL~~~Bacteriocin amylovorin-L~~~
MKQLNSEQLQNIIGGNRWTNAYSAALGCAVPGVKYGKKLGGVWGAVIGGVGGAAVCGLAGYVRKG
>P19531 3.2.1.133~~~amyM~~~Maltogenic alpha-amylase~~~
MKKKTLSLFVGLMLLIGLLFSGSLPYNPNAAEASSSASVKGDVIYQIIIDRFYDGDTTNNNPAKSYGLYDPTKSKWKMYW
GGDLEGVRQKLPYLKQLGVTTIWLSPVLDNLDTLAGTDNTGYHGYWTRDFKQIEEHFGNWTTFDTLVNDAHQNGIKVIVD
FVPNHSTPFKANDSTFAEGGALYNNGTYMGNYFDDATKGYFHHNGDISNWDDRYEAQWKNFTDPAGFSLADLSQENGTIA
QYLTDAAVQLVAHGADGLRIDAVKHFNSGFSKSLADKLYQKKDIFLVGEWYGDDPGTANHLEKVRYANNSGVNVLDFDLN
TVIRNVFGTFTQTMYDLNNMVNQTGNEYKYKENLITFIDNHDMSRFLSVNSNKANLHQALAFILTSRGTPSIYYGTEQYM
AGGNDPYNRGMMPAFDTTTTAFKEVSTLAGLRRNNAAIQYGTTTQRWINNDVYIYERKFFNDVVLVAINRNTQSSYSISG
LQTALPNGSYADYLSGLLGGNGISVSNGSVASFTLAPGAVSVWQYSTSASAPQIGSVAPNMGIPGNVVTIDGKGFGTTQG
TVTFGGVTATVKSWTSNRIEVYVPNMAAGLTDVKVTAGGVSSNLYSYNILSGTQTSVVFTVKSAPPTNLGDKIYLTGNIP
ELGNWSTDTSGAVNNAQGPLLAPNYPDWFYVFSVPAGKTIQFKFFIKRADGTIQWENGSNHVATTPTGATGNITVTWQN
>Q9ZEU2 2.4.1.4~~~ams~~~Amylosucrase~~~
MLTPTQQVGLILQYLKTRILDIYTPEQRAGIEKSEDWRQFSRRMDTHFPKLMNELDSVYGNNEALLPMLEMLLAQAWQSY
SQRNSSLKDIDIARENNPDWILSNKQVGGVCYVDLFAGDLKGLKDKIPYFQELGLTYLHLMPLFKCPEGKSDGGYAVSSY
RDVNPALGTIGDLREVIAALHEAGISAVVDFIFNHTSNEHEWAQRCAAGDPLFDNFYYIFPDRRMPDQYDRTLREIFPDQ
HPGGFSQLEDGRWVWTTFNSFQWDLNYSNPWVFRAMAGEMLFLANLGVDILRMDAVAFIWKQMGTSCENLPQAHALIRAF
NAVMRIAAPAVFFKSEAIVHPDQVVQYIGQDECQIGYNPLQMALLWNTLATREVNLLHQALTYRHNLPEHTAWVNYVRSH
DDIGWTFADEDAAYLGISGYDHRQFLNRFFVNRFDGSFARGVPFQYNPSTGDCRVSGTAAALVGLAQDDPHAVDRIKLLY
SIALSTGGLPLIYLGDEVGTLNDDDWSQDSNKSDDSRWAHRPRYNEALYAQRNDPSTAAGQIYQGLRHMIAVRQSNPRFD
GGRLVTFNTNNKHIIGYIRNNALLAFGNFSEYPQTVTAHTLQAMPFKAHDLIGGKTVSLNQDLTLQPYQVMWLEIA
>P00692 3.2.1.1~~~~~~Alpha-amylase~~~
MIQKRKRTVSFRLVLMCTLLFVSLPITKTSAVNGTLMQYFEWYTPNDGQHWKRLQNDAEHLSDIGITAVWIPPAYKGLSQ
SDNGYGPYDLYDLGEFQQKGTVRTKYGTKSELQDAIGSLHSRNVQVYGDVVLNHKAGADATEDVTAVEVNPANRNQETSE
EYQIKAWTDFRFPGRGNTYSDFKWHWYHFDGADWDESRKISRIFKFRGEGKAWDWEVSSENGNYDYLMYADVDYDHPDVV
AETKKWGIWYANELSLDGFRIDAAKHIKFSFLRDWVQAVRQATGKEMFTVAEYWQNNAGKLENYLNKTSFNQSVFDVPLH
FNLQAASSQGGGYDMRRLLDGTVVSRHPEKAVTFVENHDTQPGQSLESTVQTWFKPLAYAFILTRESGYPQVFYGDMYGT
KGTSPKEIPSLKDNIEPILKARKEYAYGPQHDYIDHPDVIGWTREGDSSAAKSGLAALITDGPGGSKRMYAGLKNAGETW
YDITGNRSDTVKIGSDGWGEFHVNDGSVSIYVQK
>P06278 3.2.1.1~~~amyS~~~Alpha-amylase~~~
MKQQKRLYARLLTLLFALIFLLPHSAAAAANLNGTLMQYFEWYMPNDGQHWKRLQNDSAYLAEHGITAVWIPPAYKGTSQ
ADVGYGAYDLYDLGEFHQKGTVRTKYGTKGELQSAIKSLHSRDINVYGDVVINHKGGADATEDVTAVEVDPADRNRVISG
EHRIKAWTHFHFPGRGSTYSDFKWHWYHFDGTDWDESRKLNRIYKFQGKAWDWEVSNENGNYDYLMYADIDYDHPDVAAE
IKRWGTWYANELQLDGFRLDAVKHIKFSFLRDWVNHVREKTGKEMFTVAEYWQNDLGALENYLNKTNFNHSVFDVPLHYQ
FHAASTQGGGYDMRKLLNSTVVSKHPLKAVTFVDNHDTQPGQSLESTVQTWFKPLAYAFILTRESGYPQVFYGDMYGTKG
DSQREIPALKHKIEPILKARKQYAYGAQHDYFDHHDIVGWTREGDSSVANSGLAALITDGPGGAKRMYVGRQNAGETWHD
ITGNRSEPVVINSEGWGEFHVNGGSVSIYVQR
>P00691 3.2.1.1~~~amyE~~~Alpha-amylase~~~COG0366
MFAKRFKTSLLPLFAGFLLLFHLVLAGPAAASAETANKSNELTAPSIKSGTILHAWNWSFNTLKHNMKDIHDAGYTAIQT
SPINQVKEGNQGDKSMSNWYWLYQPTSYQIGNRYLGTEQEFKEMCAAAEEYGIKVIVDAVINHTTSDYAAISNEVKSIPN
WTHGNTQIKNWSDRWDVTQNSLLGLYDWNTQNTQVQSYLKRFLDRALNDGADGFRFDAAKHIELPDDGSYGSQFWPNITN
TSAEFQYGEILQDSASRDAAYANYMDVTASNYGHSIRSALKNRNLGVSNISHYASDVSADKLVTWVESHDTYANDDEEST
WMSDDDIRLGWAVIASRSGSTPLFFSRPEGGGNGVRFPGKSQIGDRGSALFEDQAITAVNRFHNVMAGQPEELSNPNGNN
QIFMNQRGSHGVVLANAGSSSVSINTATKLPDGRYDNKAGAGSFQVNDGKLTGTINARSVAVLYPDDIAKAPHVFLENYK
TGVTHSFNDQLTITLRADANTTKAVYQINNGPETAFKDGDQFTIGKGDPFGKTYTIMLKGTNSDGVTRTEKYSFVKRDPA
SAKTIGYQNPNHWSQVNAYIYKHDGSRVIELTGSWPGKPMTKNADGIYTLTLPADTDTTNAKVIFNNGSAQVPGQNQPGF
DYVLNGLYNDSGLSGSLPH
>P06279 3.2.1.1~~~amyS~~~Alpha-amylase~~~
MLTFHRIIRKGWMFLLAFLLTALLFCPTGQPAKAAAPFNGTMMQYFEWYLPDDGTLWTKVANEANNLSSLGITALWLPPA
YKGTSRSDVGYGVYDLYDLGEFNQKGAVRTKYGTKAQYLQAIQAAHAAGMQVYADVVFDHKGGADGTEWVDAVEVNPSDR
NQEISGTYQIQAWTKFDFPGRGNTYSSFKWRWYHFDGVDWDESRKLSRIYKFRGIGKAWDWEVDTENGNYDYLMYADLDM
DHPEVVTELKSWGKWYVNTTNIDGFRLDAVKHIKFSFFPDWLSDVRSQTGKPLFTVGEYWSYDINKLHNYIMKTNGTMSL
FDAPLHNKFYTASKSGGTFDMRTLMTNTLMKDQPTLAVTFVDNHDTEPGQALQSWVDPWFKPLAYAFILTRQEGYPCVFY
GDYYGIPQYNIPSLKSKIDPLLIARRDYAYGTQHDYLDHSDIIGWTREGVTEKPGSGLAALITDGPGGSKWMYVGKQHAG
KVFYDLTGNRSDTVTINSDGWGEFKVNGGSVSVWVPRKTTVSTIAWSITTRPWTDEFVRWTEPRLVAWP
>P20845 3.2.1.1~~~~~~Alpha-amylase~~~
MKGKKWTALALTLPLAASLSTGVDAETVHKGKAPTADKNGVFYEVYVNSFYDANKDGHGDLKGLTQKLDYLNDGNSHTKN
DLQVNGIWMMPVNPSPSYHKYDVTDYYNIDPQYGNLQDFRKLMKEADKRDVKVIMDLVVNHTSSEHPWFQAALKDKNSKY
RDYYIWADKNTDLNEKGSWGQQVWHKAPNGEYFYGTFWEGMPDLNYDNPEVRKEMINVGKFWLKQGVDGFRLDAALHIFK
GQTPEGAKKNILWWNEFRDAMKKENPNVYLTGEVWDQPEVVAPYYQSLDSLFNFDLAGKIVSSVKAGNDQGIATAAAATD
ELFKSYNPNKIDGIFLTNHDQNRVMSELSGDVNKAKSAASILLTLPGNPYIYYGEEIGMTGEKPDELIREPFRWYEGNGI
GQTSWETPVYNKGGNGVSVEAQTKQKDSLLNHYREMIRVRQQHEELVKGTLQSISVDSKEVVAYSRTYKGKSISVYHNIS
NQPVKVSVAAKGNLIFASEKGAKKVKNQLVIPANRTVLIK
>P29957 3.2.1.1~~~amy~~~Alpha-amylase~~~
MKLNKIITTAGLSLGLLLPSIATATPTTFVHLFEWNWQDVAQECEQYLGPKGYAAVQVSPPNEHITGSQWWTRYQPVSYE
LQSRGGNRAQFIDMVNRCSAAGVDIYVDTLINHMAAGSGTGTAGNSFGNKSFPIYSPQDFHESCTINNSDYGNDRYRVQN
CELVGLADLDTASNYVQNTIAAYINDLQAIGVKGFRFDASKHVAASDIQSLMAKVNGSPVVFQEVIDQGGEAVGASEYLS
TGLVTEFKYSTELGNTFRNGSLAWLSNFGEGWGFMPSSSAVVFVDNHDNQRGHGGAGNVITFEDGRLYDLANVFMLAYPY
GYPKVMSSYDFHGDTDAGGPNVPVHNNGNLECFASNWKCEHRWSYIAGGVDFRNNTADNWAVTNWWDNTNNQISFGRGSS
GHMAINKEDSTLTATVQTDMASGQYCNVLKGELSADAKSCSGEVITVNSDGTINLNIGAWDAMAIHKNAKLNTSSASSTE
SDWQRTVIFINAQTQSGQDMFIRGGIDHAYANANLGRNCQTSNFECAMPIRHNNLKNVTTSPWKANDNYLDWYGIENGQS
SEAEGSATDWTTNVWPAGWGAEKTVNTDGFGVTPLNIWGEHYWMLDVDMDCSKAVNGWFELKAFIKNGQGWETAIAQDNA
PYTSTNHMAQCGKINKFEFNNSGVVIRSF
>Q05884 3.2.1.1~~~amy~~~Alpha-amylase~~~
MPATRRTARVRRVAAVTVTALAAALLPPLAARADTPPAPPSDAKLAKTAARHDLTREQFYFSCRTLRQRGRRERPRRLTG
TRLTTGYDPTDKGFYQGGDLKGLTEKLDYIKGLGTTSIWMAPIFKNQPVQGTGKDASAGYHGYWITDFTQVDPHFGTNKD
LKNLISKAHAKGMKVFFDVITNHTADVVDYEEKSYDYLSKGAFPYLTKDGQPFDDADYADGERRFPRVDSGSFPRTPTVP
TAKKNLKVPSWLNDPAMYHNRGDSTWAGESATYGDFNGLDDLWTERPEVVGGMEKIYQRWVEDFAIDGFRIDTVKHVDME
FWTQWATALDAYAAKKGRDDFFMFGEVYSADTSVTAPYVTQGRLDSTLDFPFQDAARAYASQGGSARKLAAVFGDDYKYT
TDKANAYEQVTFLGNHDMGRIGTFLKQDAPEAGDAELLKKDRLANELMFLSRGNPVIYYGDEQGFTGAGGDKDARQPMFA
SRTADYLDDDQLGTDRTHAEAAYDTSAPLYRQISALAELRKANPALADGVQTERYAADGAGIYAFSRTDAKTGTEYVVAF
NNAGTEPSAAFATGSAGMTFRGLYGTDATVKSGADSKVTVTVPARSAVVLKAAGRLAAPAAEPTISLHAPDPGATGTVEL
SADVAGGQLNRVVFAAQTGDGKWRTLGTADHAPYKVTHTVDADTPAGTALRYKAVVVDSAGRTASGRLHHRHPARRGGAH
RRLPGPRGRPLQARRRELRRLGPVRLGRPRRREAHHLARHPPLHRPGRLRAFAYVKLKPGASTVGFLVIDKDGNKDVAAD
RTIDVTETGEVWIEQGEEQLVTERPEYPAQDTTKAVLHYKRADGNYDGWGLHVWGDAANPTDWAKPLQPVRTDPYGAVFE
VPLTDGASSLSYMVHKGDEKDLPTDQAWTSRPTATRCGC
>P22998 3.2.1.1~~~aml~~~Alpha-amylase~~~
MARKTVAAALALVAGAAVAVTGNAPAQAVPPGEKDVTAVMFEWNFASVARECTDRLGPAGYGYVQVSPPQEHLQGGQWWT
SYQPVSYKIAGRLGDRTAFKNMIDTCHAAGVKVVADSVINHMANGSGTGTGGTSFSKYDYPGLYSGSDMDDCRATISNYQ
DRANVQNCELVQLPDLDTGEDHVRGKIAGYLNDLASLGVDGFRIDAAKHMPAADLANIKSRLTNPNVFWKLEAIHGAGEA
VSPSEYLGSGDVQEFRYARDLKRVLQGEKLSYLKNFGEAWGHMPSGQSGVFVDNHDTERGGDTLSYKDGANYTLASVFML
AWPYGSPDVHSGYEWTDKDAGPPNNGQVNACYTDGWKCQHAWREISSMVAFRNTARGQAVTNWWDNGNNAIAFGRGSKAY
VAINHETSALTRTFQTSLPAGSYCDVQSNTPVTVNSSGQFTATLAANTAVALHVNATGCGSTPTTPPTTPPATSGASFNV
TATTVVGQNIYVTGNRAELGNWAPASALKLDPATYPVWKLTVGLPAGTSFEYKYIRKDAAGNVTWESGANRTATVPASGQ
LVLNDTFRS
>P29750 3.2.1.1~~~tam~~~Alpha-amylase~~~
MGVRRSLAALLAALLGCATSLVALTVAASPAHAAPSGNRDVIVHLFQWRWKSIADECRTTLGPHGFGAVQVSPPQEHVVL
PAEDYPWWQDYQPVSYKLDQTRRGSRADFIDMVNTCREAGVKIYVDAVINHMTGTGSAGAGPGSAGSSYSKYDYPGIYQS
QDFNDCRRDITNWNDKWEVQHCELVGLADLKTSSPYVQDRIAAYLNELIDLGVAGFRIDAAKHIPEGDLQAILSRLKNVH
PAWGGGKPYIFQEVIADSTISTGSYTHLGSVTEFQYHRDISHAFANGNIAHLTGLGSGLTPSDKAVVFVVNHDTQRYEPI
LTHTDRARYDLAQKFMLAHPYGTPKVMSSYTWSGDDKAGPPMHSDGTTRPTDCSADRWLCEHRAVAGMVGFHNAVAGQGI
GSAVTDGNGRLAFARGSAGYAAFNATNTAWTRTFTTSLPDGVYCDVANGTFVDGVCDGPSYQVSGGKFTATVPANGAVAL
HVEAPGSCGPDGCGTPPGGGDDCTTVTARFHATVTTWYGQEVAVVGSIPELGSWQPAQGVRLRTDSGTYPVWSGAVDLPA
GVGFEYKYVKLNRTAPWSGSRAATASPPWMTSGGGCSQNFYDSWR
>Q06848 ~~~ancA~~~Cellulosome-anchoring protein~~~COG1361
MKRIKRILAVLTIFALLATINAFTFVSLAQTNTIEIIIGNVKARPGDRIEVPVSLKNVPDKGIVSSDFVIEYDSKLFKVI
ELKAGDIVENPSESFSYNVVEKDEIIAVLYLEETGLGIEAIRTDGVFFTIVMEVSKDVKPGISPIKFESFGATADNDMNE
MTPKLVEGKVEIIEASAPEATPTPGSTAGSGAGGGTGSSGSGQPSATPTPTATEKPSTTPKTTEQPHEDIPQSGGTGEHA
PFLKGYPGGLFKPENNITRAEAAVIFAKLLGADENSAGKNSSITFKDLKDSHWAAWAIKYVTEQNLFGGYPDGTFMPDKS
ITRAEFATVTYKFLEKLGKIEQGTDVKTQLKDIEGHWAQKYIETLVAKGYIKGYPDETFRPQASIKRAESVALINRSLER
GPLNGAVLEFTDVPVNYWAYKDIAEGVIYHSYKIDENGQEVMVEKLD
>Q84BZ0 1.18.1.3~~~andAa~~~Anthranilate 1,2-dioxygenase system ferredoxin--NAD(+) reductase component~~~
MSADPFVIVGAGHAARRTAEALRARDADAPIVMIGAERELPYDRPALSKDALLNDDGEQRAFVRDAAWYDAQRIALRLGT
RVDAIEREAQRVRLDDGTTLPYAKLVLATGSRVRTFGGPIDAGVVAHYVRTVADARALRAQLVRGRRVAVLGGGFIGLEV
AAAARQLGCNVTVIDPAARLLQRALPEVVGAYAHRLHDERGVGFQMATLPRAIRAAAGGGAIVETDRGDVHADVVVVGIG
VLPNVELAQAAGLDVDNGIRVDAGCRTADRAIFAAGEVTMHFNPLLGRHVRIESWQVAENQPAVAAANLLGADDAYAELP
WLWSDQYDCNLQMLGLFGAGQTTVVRGDPARGPFTVFGLGGDGRIVAAAAVNLGRDIGAARRLIAAGAMPDPQQLADPTV
GLKTFL
>Q84BZ1 ~~~andAb~~~Anthranilate 1,2-dioxygenase ferredoxin subunit~~~COG2146
MTEATLAEWHPLGAIDEFTEDEPAARVAGQKPIAVFRIGDELFAMHDLCSHGHARLSEGYVEDGCVECPLHQGLIDIRTG
APKCAPITEPVRVYPIRIVDGQVEVNVG
>Q84BZ3 1.14.12.1~~~andAc~~~Anthranilate 1,2-dioxygenase large subunit~~~COG4638
MEQTASPVVFAARDDASDVHFPHDDGSRVPYKVFSSRAVYDREQERIFRGPTWNFVALEAEIPNAGDFKSTFVGDTPVVV
TRTEDGALSAWVNRCAHRGAQVCRKSRGNASSHTCVYHQWSFDNEGNLLGVPFRRGQKGMTGMPADFDPKQHGLRKLRVD
SYRGLVFATFSDDVAPLPDYLGAQMRPWIDRIFHKPIEYLGCTRQYSKSNWKLYMENVKDPYHASMLHLFHTTFNIFRVG
MKARSIPDANHGLHSIITVTKTGDDTSAAYKQQNIRSFDEGFHLEDESILDLVSEYDEDCTNHIQPIFPQLVIQQIHNTL
VARQILPKGPDNFELIFHFFGYADDTPELRALRIKQANLVGPAGYISMEDTEATELVQRGTVRDADATSVIEMSRGNPEQ
QDTVITESLIRKFWVGYQKLMGY
>Q84BZ2 1.14.12.1~~~andAd~~~Anthranilate 1,2-dioxygenase small subunit~~~COG5517
MENLTEDMKTWFEIYMLQNRYIGHLDNDRLERWPEMFTEDCTYEIVPKENADLGLPVGIVHCTNQRMLRDRVVSLRHANI
YEEHTYRHMTSGLAIVAQRDGEIDTESNYVVVQTRSNGESNVYQAGKYYDTVVRTPDGLRYKAKRVIYDTSRVQTLLATP
I
>P16266 1.18.6.1~~~anfD~~~Nitrogenase iron-iron protein alpha chain~~~
MPHHEFECSKVIPERKKHAVIKGKGETLADALPQGYLNTIPGSISERGCAYCGAKHVIGTPMKDVIHISHGPVGCTYDTW
QTKRYISDNDNFQLKYTYATDVKEKHIVFGAEKLLKQNIIEAFKAFPQIKRMTIYQTCATALIGDDINAIAEEVMEEMPE
VDIFVCNSPGFAGPSQSGGHHKINIAWINQKVGTVEPEITGDHVINYVGEYNIQGDQEVMVDYFKRMGIQVLSTFTGNGS
YDGLRAMHRAHLNVLECARSAEYICNELRVRYGIPRLDIDGFGFKPLADSLRKIGMFFGIEDRAKAIIDEEVARWKPELD
WYKERLMGKKVCLWPGGSKLWHWAHVIEEEMGLKVVSVYIKFGHQGDMEKGIARCGEGTLAIDDPNELEGLEALEMLKPD
IILTGKRPGEVAKKVRVPYLNAHAYHNGPYKGFEGWVRFARDIYNAIYSPIHQLSGIDITKDNAPEWGNGFRTRQMLSDG
NLSDAVRNSETLRQYTGGYDSVSKLREREYPAFERKVG
>P16268 1.18.6.1~~~anfG~~~Nitrogenase iron-iron protein delta chain~~~
MSTASAAAVVKQKVEAPVHPMDARIDELTDYIMKNCLWQFHSRSWDRERQNAEILKKTKELLCGEPVDLSTSHDRCYWVD
AVCLADDYREHYPWINSMSKEEIGSLMQGLKDRMDYLTITGSLNEELSDKHY
>P16267 1.18.6.1~~~anfK~~~Nitrogenase iron-iron protein beta chain~~~
MTCEVKEKGRVGTINPIFTCQPAGAQFVSIGIKDCIGIVHGGQGCVMFVRLIFSQHYKESFELASSSLHEDGAVFGACGR
VEEAVDVLLSRYPDVKVVPIITTCSTEIIGDDVDGVIKKLNEGLLKEKFPDREVHLIAMHTPSFVGSMISGYDVAVRDVV
RHFAKREAPNDKINLLTGWVNPGDVKELKHLLGEMDIEANVLFEIESFDSPILPDGSAVSHGNTTIEDLIDTGNARATFA
LNRYEGTKAAEYLQKKFEIPAIIGPTPIGIRNTDIFLQNLKKATGKPIPQSLAHERGVAIDALADLTHMFLAEKRVAIYG
APDLVIGLAEFCLDLEMKPVLLLLGDDNSKYVDDPRIKALQENVDYGMEIVTNADFWELENRIKNEGLELDLILGHSKGR
FISIDYNIPMLRVGFPTYDRAGLFRYPTVGYGGAIWLAEQMANTLFADMEHKKNKEWVLNVW
>Q6W4T3 ~~~angR~~~Anguibactin system regulator~~~COG1020
MNQNEHPFAFPETKLPLTSNQNWQLSTQRQRTEKKSITNFTYQEFDYENISRDTLERCLTTIIKHHPIFGAKLSDDFYLH
FPSKTHIETFAVNDLSNALKQDIDKQLADTRSAVTKSRSQAIISIMFSILPKNIIRLHVRFNSVVVDNPSVTLFFEQLTQ
LLSGSPLSFLNQEQTISAYNHKVNNELLSVDLESARWNEYILTLPSSANLPTICEPEKLDETDITRRCITLSQRKWQQLV
TVSKKHNVTPEITLASIFSTVLSLWGHQKYLMMRFDITKINDYTGIIGQFTEPLLVGMSGFEQSFLSLVKNNQKKFEEAY
HYDVKVPVFQCVNKLSNISDSHRYPANITFSSELLNTNHSKKAVWGCRQSANTWLSLHAVIEQEQLVLQWDSQDAIFPKD
MIKDMLHSYTDLLDLLSQKDVNWAQPLPTLLPKHQESIRNKINQQGDLELTKELLHQRFFKNVESTPNALAIIHGQESLD
YITLASYAKSCAGALTEAGVKSGDRVAVTMNKGIGQIVAVLGILYAGAIYVPVSLDQPQERRESIYQGAGINVILINESD
SKNSPSNDLFFFLDWQTAIKSEPMRSPQDVAPSQPAYIIYTSGSTGTPKGVVISHQGALNTCIAINRRYQIGKNDRVLAL
SALHFDLSVYDIFGLLSAGGTIVLVSELERRDPIAWCQAIEEHNVTMWNSVPALFDMLLTYATCFNSIAPSKLRLTMLSG
DWIGLDLPQRYRNYRVDGQFIAMGGATEASIWSNVFDVEKVPMEWRSIPYGYPLPRQQYRVVDDLGRDCPDWVAGELWIG
GDGIALGYFDDELKTQAQFLHIDGHAWYRTGDMGCYWPDGTLEFLGRRDKQVKVGGYRIELGEIEVALNNIPGVQRAVAI
AVGNKDKTLAAFIVMDSEQAPIVTAPLDAEEVQLLLNKQLPNYMVPKRIIFLETFPLTANGKVDHKALTRMTNREKKTSQ
SINKPIITASEDRVAKIWNDVLGPTELYKSSDFFLSGGDAYNAIEVVKRCHKAGYLIKLSMLYRYSTIEAFAIIMDRCRL
APQEEAEL
>Q02219 1.7.2.1~~~aniA~~~Copper-containing nitrite reductase~~~
MKRQALAAMIASLFALAACGGEQAAQAPAETPAASAEAASSAAQATAETPAGELPVIDAVTTHAPEVPPAIDRDYPAKVR
VKMETVEKTMKMDDGVEYRYWTFDGDVPGRMIRVREGDTVEVEFSNNPSSTVPHNVDFHAATGQGGGAAATFTAPGRTST
FSFKALQPGLYIYHCAVAPVGMHIANGMYGLILVEPKEGLPKVDKEFYIVQGDFYTKGKKGAQGLQPFDMDKAVAEQPEY
VVFNGHVGSIAGDNALKAKAGETVRMYVGNGGPNLVSSFHVIGEIFDKVYVEGGKLINENVQSTIVPAGGSAIVEFKVDI
PGSYTLVDHSIFRAFNKGALGQLKVEGAENPEIMTQKLSDTAYAGSGAASAPAASAPAASAPAASASEKSVY
>Q5ZXN6 2.7.1.-~~~ankX~~~Phosphocholine transferase AnkX~~~COG0666
MVKIMPNLPGLYFLQAYPSEEIWRLFVDGRFWSKENGWRGYESREPGCLNAALESLCSIALQVEKSGEEFELSVDLIKRI
HKKCGKKVEELQEKNPGELRTDEPVSFGIPAGRASIKGIEEFLSLVFLTEGGAEFGPGKAGPFGPRFDKNYFKNLNPEQI
PDLAKQIYFDMCKYGHSNTNHFYLAVMKNVDVYLEKITQSYNKEIKTAETLDEKLKIIVKHIRMYEVLHPFRDANGRTFV
NNLLNILLMQQGLPPATFYEPNVFDLYSAEELVVVVKEAIFNTVEIIEQSKRKTPITLYGYHSSLEEQTKFRDMLDSPSY
EKIKHMDFSDLNPEKLHLKTQKCLSSLNEQYPLHRGAIYLSDPGEIKLLLSNRNESQINQQIEQGAPPIYVGKTPAHLAV
ISGNMAMLDELIAKKADLSLQDYDGKTALHYAAECGNMQIMGKILKVVLSQEDAIKVLNIKDNHGKTAFHYAAEFGTPEL
ISALTTTEVIQINEPDNSGSSAITLAYKNHKLKIFDELLNSGADISDELLDAIWARKDKETLGKIIAKNEKILLNKEAFR
IAISLGSVSLVKKFLRAGVDIDIPLTKDKATPLMLSINSGNPKLVSYLLKKGANTRLTDTSGNSVLHYVFYSKAENREAL
ANIITEKDKKLINQPNANGNPPLYNAVVVNDLKMATILLEMGARVDFEDRLGNNILHSAMRRCDLPIILDIVKKDSTLLH
KRNSERRNPFHQALHEMHTFPSSKETEEIHFMNLSDLLLKEGVDLNKKDIKGKTILDIALSKQYFHLCVKLMKAGAHTNI
SSPSKFLKNSDANSILERPFKFKNDLKKELDNNPLIAMAQINDLYVQIKNNRIRTPTGYAPKEGVSFFKGKSNDAKAHDE
VLSVLKELYDSKLTEMLGNLPGEGLEEIKRSQKFFDGELKLLIKNQDISRKVDKKSIQEAVGTSLKLKW
>P77570 2.7.1.170~~~anmK~~~Anhydro-N-acetylmuramic acid kinase~~~COG2377
MKSGRFIGVMSGTSLDGVDVVLATIDEHRVAQLASLSWPIPVSLKQAVLDICQGQQLTLSQFGQLDTQLGQLFADAVNAL
LKEQNLQARDIVAIGCHGQTVWHEPTGVAPHTLQIGDNNQIVARTGITVVGDFRRRDIALGGQGAPLVPAFHHALLAHPT
ERRMVLNIGGIANLSLLIPGQPVGGYDTGPGNMLMDAWIWRQAGKPYDKDAEWARAGKVILPLLQNMLSDPYFSQPAPKS
TGREYFNYGWLERHLRHFPGVDPRDVQATLAELTAVTISEQVLLSGGCERLMVCGGGSRNPLLMARLAALLPGTEVTTTD
AVGISGDDMEALAFAWLAWRTLAGLPGNLPSVTGASQETVLGAIFPANP
>Q9I5Q5 2.7.1.170~~~anmK~~~Anhydro-N-acetylmuramic acid kinase~~~
MPRYLGLMSGTSLDGMDIVLIEQGDRTTLLASHYLPMPAGLREDILALCVPGPDEIARAAEVEQRWVALAAQGVRELLLQ
QQMSPDEVRAIGSHGQTIRHEPARHFTVQIGNPALLAELTGIDVVADFRRRDVAAGGQGAPLVPAFHQALFGDDDTSRAV
LNIGGFSNVSLLSPGKPVRGFDCGPGNVLMDAWIHHQRGEHFDRDGAWAASGQVNHALLASLLADEFFAARGPKSTGRER
FNLPWLQEHLARHPALPAADIQATLLELSARSISESLLDAQPDCEEVLVCGGGAFNTALMKRLAMLMPEARVASTDEYGI
PPAWMEGMAFAWLAHRFLERLPGNCPDVTGALGPRTLGALYPA
>Q8EHB5 2.7.1.170~~~anmK~~~Anhydro-N-acetylmuramic acid kinase~~~COG2377
MNKAYYIGLMSGTSMDGVDAVLVDFAGEQPQLIGTHTETIPTHLLKGLQRLCLPGTDEINRLGRLDRSVGKLFALAVNNL
LAKTKIAKDEIIAIGSHGQTVRHMPNLEVGFTLQIGDPNTIATETGIDVIADFRRKDIALGGQGAPLVPAFHQQTFAQVG
KKRVILNIGGIANITYLPGNSEEVLGFDTGPGNTLIDAWVQQVKNESYDKNGAWAASGKTDPQLLAQLLSHPYFSLAYPK
STGRELFNQAWLEQQLSAFNQLNEEDIQSTLLDLTCHSIAQDILKLAQEGELFVCGGGAFNAELMQRLAALLPGYRIDTT
SALGVDPKWAEGIAFAWLAMRYQLGLPANLPAVTGASREAILGGRFSAK
>A0A0A7XNA3 2.3.1.184~~~anoI~~~Acyl-homoserine-lactone synthase~~~COG3916
MNIIAGFQNNFSEGLYSKFKSYRYKVFVEHLGWELNCPHNEELDQFDKVDTAYVVAQDRDSNIIGCARLLPTTQPYLLGE
IFPQLLNGIPLPCSPEIWELSRFSAVDFSNPPTTASQAVSSPVSIAILQEAINFARAQGAKQLITTSPLGVERLLRAAGF
RAHRAGPPMTIDGYSMFACLIDV
>A0A0N8YGA2 ~~~anoR~~~Transcriptional activator protein AnoR~~~COG2197
MESWQEDLLSAFLVVKNEDELFEVVKSTASKLGFDYCAYGMQSPLSIAEPKTIMLNNYPQAWQKRYIEQRYVKVDPTVQH
CMVSLQPLVWSSQNAKTQEEKDFWEEARSYGLNVGWAQSSRDFIGTRGMLTLARSTDQLSEKEQKAQYTNMYWLTQTVHS
SIAKIVNDVEFSQFNLYLTNREKEALRWTAEGKTSAEIAQIIGVTERTVNFHLCNSMQKLNVNNKISAAIRAVMLGLL
>O85222 ~~~anr~~~Transcriptional activator protein Anr~~~COG0664
MSEPVKLRAHNQAHCKDCSLAPLCLPLSLNLEDMDALDEIVKRGRPLKKGEFLFRQGDGFDSVYAVRSGALKTFSLSDSG
EEQITGFHLPSELVGLSGMDTESHPVSAQALETTSVCEIPFERLDELALQLPQLRRQLMRVMSREIRDDQQMMLLLSKKT
ADERIATFLVNLSARFRARGFSANQFRLSMSRNEIGNYLGLAVETVSRVFTRFQQNELIAAEGKEVHILDPIQLCALAGG
SVEG
>P01548 ~~~~~~Antibacterial substance A~~~
AAGNPSETGGAVATYSTAVGSFLDGTVKVVATGGASRVPGNCGTAAVLECDNPESFDGTRAWGDLSADQGTGEDAPPETA
SLIFAVN
>Q02550 1.1.98.7~~~chuR~~~Serine-type anaerobic sulfatase-maturating enzyme~~~COG0641
MKATTYAPFAKPLYVMVKPVGAVCNLACEYCYYLEKANLYKENPKHVMSDELLEKFIDEYINSQTMPQVLFTWHGGETLM
RPLSFYKKAMELQKKYARGRTIDNCIQTNGTLLTDEWCEFFRENNWLVGVSIDGPQEFHDEYRKNKMGKPSFVKVMQGIN
LLKKHGVEWNAMAVVNDFNAEYPLDFYNFFKEIDCHYIQFAPIVERIVSHQDGRHLASLAEGKEGALADFSVSPEQWGNF
LCTIFDEWVKEDVGKFFIQIFDSTLANWMGEQPGVCTMAKHCGHAGVMEFNGDVYSCDHFVFPEYKLGNIYSQTLVEMMH
SERQHNFGTMKYQSLPTQCKECDFLFACNGECPKNRFSRTADGEPGLNYLCKGYYQYFQHVAPYMDFMKKELMNQQAPAN
IMKALKDGSLKIEY
>Q0TTH1 1.8.98.7~~~~~~Cysteine-type anaerobic sulfatase-maturating enzyme~~~COG0641
MPPLSLLIKPASSGCNLKCTYCFYHSLSDNRNVKSYGIMRDEVLESMVKRVLNEANGHCSFAFQGGEPTLAGLEFFEKLM
ELQRKHNYKNLKIYNSLQTNGTLIDESWAKFLSENKFLVGLSMDGPKEIHNLNRKDCCGLDTFSKVERAAELFKKYKVEF
NILCVVTSNTARHVNKVYKYFKEKDFKFLQFINCLDPLYEEKGKYNYSLKPKDYTKFLKNLFDFWYEDFLNGNRVSIRYF
DGLLETILLGKSSSCGMNGTCTCQFVVESDGSVYPCDFYVLDKWRLGNIQDMTMKELFETNKNHEFIKLSFKVHEECKKC
KWFRLCKGGCRRCRDSKEDSALELNYYCQSYKEFFEYAFPRLINVANNIK
>Q9X758 1.1.98.7~~~atsB~~~Serine-type anaerobic sulfatase-maturating enzyme~~~
MLNIAALRQQQIPLAAEPRSPVPFHILMKPIGPACNLACRYCYYPQDETPVNKMDDARLEQFIRRYIAAQPAGAREINFV
WQGGEPLLAGLSFYKKALALQARYAPDGVTISNSLQTNGTLINDAWCRLFREHGFIIGLGLEGNEALQDYHRPDKRGRST
WSAALRGIDLLHQHQVDFNLLVVVHNEMAAHAAAIYVRLVSLGARYLQFQPLMSEGAALREGYQLSADNWGRFMVGIWRQ
WRKRCDRGRVFVINIEQAWAQYFTHTSGSCVHSARCGSNLVMESDGQLYACDHLINTEHRLGRLDEQTLAAAVDASVQLP
FGQQKSLRRECQTCSVKMVCQGGCPAHLNAAGNNRLCGGYYRFFSDILAPLRPFSRDLNGLKAWRAAFVGTAHTA
>P9WQM9 ~~~ansP1~~~L-asparagine permease 1~~~COG1113
MSAASQRVGAFGEEAGYHKGLKPRQLQMIGIGGAIGTGLFLGAGGRLAKAGPGLFLVYGVCGVFVFLILRALGELVLHRP
SSGSFVSYAREFFGEKAAYAVGWMYFLHWAMTSIVDTTAIATYLQRWTIFTVVPQWILALIALTVVLSMNLISVEWFGEL
EFWAALIKVLALMAFLVVGTVFLAGRYPVDGHSTGLSLWNNHGGLFPTSWLPLLIVTSGVVFAYSAVELVGTAAGETAEP
EKIMPRAINSVVARIAIFYVGSVALLALLLPYTAYKAGESPFVTFFSKIGFHGAGDLMNIVVLTAALSSLNAGLYSTGRV
MHSIAMSGSAPRFTARMSKSGVPYGGIVLTAVITLFGVALNAFKPGEAFEIVLNMSALGIIAGWATIVLCQLRLHKLANA
GIMQRPRFRMPFSPYSGYLTLLFLLVVLVTMASDKPIGTWTVATLIIVIPALTAGWYLVRKRVMAVARERLGHTGPFPAV
ANPPVRSRD
>P9WQM7 ~~~ansP2~~~L-asparagine permease 2~~~COG1113
MPPLDITDERLTREDTGYHKGLHSRQLQMIALGGAIGTGLFLGAGGRLASAGPGLFLVYGICGIFVFLILRALGELVLHR
PSSGSFVSYAREFYGEKVAFVAGWMYFLNWAMTGIVDTTAIAHYCHYWRAFQPIPQWTLALIALLVVLSMNLISVRLFGE
LEFWASLIKVIALVTFLIVGTVFLAGRYKIDGQETGVSLWSSHGGIVPTGLLPIVLVTSGVVFAYAAIELVGIAAGETAE
PAKIMPRAINSVVLRIACFYVGSTVLLALLLPYTAYKEHVSPFVTFFSKIGIDAAGSVMNLVVLTAALSSLNAGLYSTGR
ILRSMAINGSGPRFTAPMSKTGVPYGGILLTAGIGLLGIILNAIKPSQAFEIVLHIAATGVIAAWATIVACQLRLHRMAN
AGQLQRPKFRMPLSPFSGYLTLAFLAGVLILMYFDEQHGPWMIAATVIGVPALIGGWYLVRNRVTAVAHHAIDHTKSVAV
VHSADPI
>O85673 1.14.12.1~~~antA~~~Anthranilate 1,2-dioxygenase large subunit~~~COG4638
MTARNLAEWQNFVQGCIDFRPNDGVYRIARDMFTEPELFELEMELIFEKVWIYACHESEIPNNNDFVTVQIGRQPMIVSR
DGKGELHAMVNACEHRGATLTRVAKGNQSVFTCPFHAWCYKSDGRLVKVKAPGEYCEDFDKSSRGLKQGRIASYRGFVFV
SLDTQATDSLEDFLGDAKVFLDLMVDQSPTGELEVLQGKSAYTFAGNWKLQNENGLDGYHVSTVHYNYVSTVQHRQQVNA
AKGDELDTLDYSKLGAGDSETDDGWFSFKNGHSVLFSDMPNPTVRPGYNTVMPYLVEKFGEKRAEWAMHRLRNLNLYPSL
FFMDQISSQLRIIRPVAWNKTEVISQCIGVKGESSEARRNRIRQFEDFFNVSGLGTPDDLVEFREQQKGFQGRIERWSDI
SRGYHQWTYGPTQNSQDLGIEPVITGREFTHEGLYVNQHGQWQRLILDGLNKKALKMHDVTFDNQSVMDEV
>O85674 1.14.12.1~~~antB~~~Anthranilate 1,2-dioxygenase small subunit~~~COG5517
MSLELHFAVSQFLYKKAELCDNYDWDAYIDLYDEDSEYHIPQWIDDHNYVQDPNQGLSYIYYEDRSGLEDRVFRIRTGKA
ASATPLPRTQHNIHNVQVKTLEDGLIEAKVSWRTLYNRQGLEGCFYGRATYVLRPTEDSFRIRRQHSVLLNDKIDSVLDF
YHV
>O85675 ~~~antC~~~Anthranilate 1,2-dioxygenase electron transfer component~~~COG0543
MNHSVALNFADGKTFFIAVQEDELLLDAAVRQGINLPLDCREGVCGTCQGTCETGIYEQEYVDEDALSERDLAKRKMLAC
QTRVKSNAAFYFDHHSSICNAGETLKIATVVTGVELVSETTAILHLDASQHVKQLDFLPGQYARLQIPDTDDWRSYSFAN
RPNASNQLQFLIRLLPNGVMSNYLRERCQVGQTLIMEAPLGSFYLREVERPLVFIAGGTGLSAFLGMLDNIAEQPNQPSV
HLYYGVNTEADLCEQKRLTTYAERIKNFSYHPIISKASEQWQGKSGFIHEHLDKNQLSEQSFDMYLCGPPPMIEAVKTWL
DEQAIADCHIYSEKFLQSNTAKT
>Q7WY63 ~~~antE~~~Protein AntE~~~
MPQPFFVLQKTLAGLRNGLLYHLENQFGICQQGEKPTRQKNSLPSALKKRKHCRPLRSQSNEKSVCRAIRKTKVCFLHEK
DRVQSSFGLASLIEHNHL
>P9WQ15 1.4.3.-~~~aofH~~~Putative flavin-containing monoamine oxidase AofH~~~COG1231
MRLTRAVTNPPWTVDVVVVGAGFAGLAAARELTRQGHEVLVFEGRDRVGGRSLTGRVAGVPADMGGSFIGPTQDAVLALA
TELGIPTTPTHRDGRNVIQWRGSARSYRGTIPKLSLTGLIDIGRLRWQFERIARGVPVAAPWDARRARELDDVSLGEWLR
LVRATSSSRNLMAIMTRVTWGCEPDDVSMLHAARYVRAAGGLDRLLDVKNGAQQDRVPGGTQQIAQAAAAQLGARVLLNA
AVRRIDRHGAGVTVTSDQGQAEAGFVIVAIPPAHRVAIEFDPPLPPEYQQLAHHWPQGRLSKAYAAYSTPFWRASGYSGQ
ALSDEAPVFITFDVSPHADGPGILMGFVDARGFDSLPIEERRRDALRCFASLFGDEALDPLDYVDYRWGTEEFAPGGPTA
AVPPGSWTKYGHWLREPVGPIHWASTETADEWTGYFDGAVRSGQRAAAEVAALL
>P9WM25 ~~~aosR~~~Oxidative stress regulator AosR~~~
MPPVCGRRCSRTGEIRGYSGSIVRRWKRVETRDGPRFRSSLAPHEAALLKNLAGAMIGLLDDRDSSSPSDELEEITGIKT
GHAQRPGDPTLRRLLPDFYRPDDLDDDDPTAVDGSESFNAALRSLHEPEIIDAKRVAAQQLLDTVPDNGGRLELTESDAN
AWIAAVNDLRLALGVMLEIGPRGPERLPGNHPLAAHFNVYQWLTVLQEYLVLVLMGSR
>Q8P8J2 2.1.3.9~~~argF'~~~N-acetylornithine carbamoyltransferase~~~COG0078
MSLKHFLNTQDWSRAELDALLTQAALFKRNKLGSELKGKSIALVFFNPSMRTRTSFELGAFQLGGHAVVLQPGKDAWPIE
FNLGTVMDGDTEEHIAEVARVLGRYVDLIGVRAFPKFVDWSKDREDQVLKSFAKYSPVPVINMETITHPCQELAHALALQ
EHFGTPDLRGKKYVLTWTYHPKPLNTAVANSALTIATRMGMDVTLLCPTPDYILDERYMDWAAQNVAESGGSLQVSHDID
SAYAGADVVYAKSWGALPFFGNWEPEKPIRDQYQHFIVDERKMALTNNGVFSHCLPLRRNVKATDAVMDSPNCIAIDEAE
NRLHVQKAIMAALVGQSRP
>Q9KD90 3.6.1.41~~~~~~Bis(5'-nucleosyl)-tetraphosphatase, symmetrical~~~COG1713
MNRGKALQLVKPHLTEHRYQHTIGVMETAIDLAKLYGADQQKAELAAIFHDYAKFRDKNEMRTLIREKLSQQDILFYGDE
LLHAPCGAYYVREEVGIEDEDVLQAIRFHTTGRPNMSLLEKIIFLADYIEPNRQFPGVEKVRTQAKTDLNGAIISSLVNT
ITFLLKKNQPIYPDTLATYNQLLLEQK
>Q2G297 3.6.1.41~~~yqeK~~~Bis(5'-nucleosyl)-tetraphosphatase, symmetrical~~~COG1713
MNIEKAKRLAKEKLPEKRYNHSLRVAETAIKLAEIYDGDTSKVELAGVLHDFCKYDDLGKMYQIVRQYELGNDLLSYGSE
ILHGPVCAAIMEHEYGINDEEVLMAIKYHTTGRQQMTKTEKLIFIADYIEPGRTIPGVDDIRDMAYNQGSLDKTIYEISK
RTVLFLIQKDITVYNKTIDCLNYYNYSDERIKDD
>Q8DY32 3.6.1.41~~~~~~Bis(5'-nucleosyl)-tetraphosphatase, symmetrical~~~
MTYKDYTGLDRTELLSKVRHMMSDKRFNHVLGVERAAIELAERYGYDKEKAGLAALLHDYAKELSDDEFLRLIDKYQPDP
DLKKWGNNIWHGLVGIYKIQEDLAIKDQDILAAIAKHTVGSAQMSTLDKIVYVADYIEHNRDFPGVEEARELAKVDLNKA
VAYETARTVAFLASKAQPIYPKTIETYNAYIPYLD
>P9WMK9 2.7.7.53~~~~~~AP-4-A phosphorylase~~~COG0537
MSDEDRTDRATEDHTIFDRGVGQRDQLQRLWTPYRMNYLAEAPVKRDPNSSASPAQPFTEIPQLSDEEGLVVARGKLVYA
VLNLYPYNPGHLMVVPYRRVSELEDLTDLESAELMAFTQKAIRVIKNVSRPHGFNVGLNLGTSAGGSLAEHLHVHVVPRW
GGDANFITIIGGSKVIPQLLRDTRRLLATEWARQP
>Q7VU61 ~~~apaG~~~Protein ApaG~~~COG2967
MSNRERPVKPYDLTVSVTPRYVPEQSDPSQQQYVFAYTVRITNTGSHPAQVISRHWIITDGEERVQEVRGLGVVGQQPLL
APGETFEYTSGCPLPTPIGTMRGTYHCVGENGIPFEVPIAEFLLAMPRTLH
>Q8EB92 ~~~apaG~~~Protein ApaG~~~COG2967
MSALDNSIRVEVKTEYIEQQSSPEDEKYLFSYTITIINLGEQAAKLETRHWIITDANGKTSEVQGAGVVGETPTIPPNTA
YQYTSGTVLDTPFGIMYGTYGMVSESGEHFNAIIKPFRLATPGLLH
>Q9KUS3 ~~~apaG~~~Protein ApaG~~~COG2967
MDVSLPCIKIQVQTRYIEEQSNPEYQRFVFAYLITIKNLSSQTVQLMSRRWLITDADGKQTVVEGDGVVGEQPRIKANDE
YTYSSGTALDTPVGVMQGQYLMIDEQGESFTVEIEPFRLAVPHVLN
>Q8PP26 ~~~apaG~~~Protein ApaG~~~COG2967
MQDDPRYRVEVEVSPRFLAHQSTPDEGRYAFAYSIRIQNAGAVPARLVARHWQITDGNGRTEQVDGEGVVGEQPWLRPGE
AFHYTSGVLLETEQGQMQGHYDMVADDGTEFIAPIAAFVLSVPRTLH
>Q9I3T5 3.5.1.-~~~aphA~~~Acetylpolyamine amidohydrolase 1~~~
MLSVYSDDHRLHFGQSELVDGKLQPCFEMPSRADTVLARVKSQNLGEVIAPKDFGREPLLRLHDAAYLDFLQGAWARWTA
EGHSGDLVSTTFPGRRLRRDGPIPTALMGELGYYSFDTEAPITAGTWQAIYSSAQVALTAQEHMRQGARSAFALCRPPGH
HAGGDFMGGYCFLNNAAIATQAFLDQGARRVAILDVDYHHGNGTQDIFYRRDDVLFASIHGDPRVEYPYFLGYADERGEG
AGEGCNHNYPLAHGSGWDLWSAALDDACVRIAGYAPDALVISLGVDTYKEDPISQFRLDSPDYLRMGERIARLGLPTLFI
MEGGYAVEAIGINAVNVLQGYEGAAR
>Q9I6H0 3.5.1.-~~~aphB~~~Acetylpolyamine amidohydrolase 2~~~
MLTIYSDDHRLHHGRHELIGGQFTPCFEKPSRADMVLDRVKAVGLGEVRAPRDFGLEPIRRVHSEGFVRFLQNAWQDWLA
TGRSHDMLPIAWPTRRLRQTEPDNIDGRLGYYSFDAGAPITAGTWQAITSSANVALSGQSELANGARSVFSLCRPPGHHA
AADYMGGYCFFNNAAIAAQAFLDRGAGRVAILDVDYHHGNGTQDIFYDRADVLFTSIHGDPRFEYPYFLGYADEKGNGVG
TGYNFNYPLAAGSDWATWSQALQAAIRQIQAYAADALIVSLGVDTFKEDPISQFRLDSPDYLRMGEAIGKLGLATLFVME
GGYAVEEIGINAVNVLQGFEGVHR
>Q3JUN4 3.5.1.-~~~aphA~~~Acetylpolyamine amidohydrolase~~~
MLTYFHPDQSLHHPRTYFSRGRMRMPQEVPERAARLVAAAFAMGFPVREPDDFGIAPIAAVHDTHYLRFLETVHREWKAM
PEDWGDEAMSNIFVREPNALRGVLAQAARHLADGSCPVGEHTWRAAYWSAQSALAAAAAVRDGAPAAYALCRPPGHHARV
DAAGGFCYLNNAAIAAQALRARHARVAVLDTDMHHGQGIQEIFYARRDVLYVSIHGDPTNFYPAVAGFDDERGAGEGLGY
NVNLPMPHGSSEAAFFERVDDALRELRRFAPDALVLSLGFDVYRDDPQSQVAVTTDGFGRLGHLIGALRLPTVIVQEGGY
HIESLEANARSFFGGFGALRG
>P05637 3.6.1.41~~~apaH~~~Bis(5'-nucleosyl)-tetraphosphatase [symmetrical]~~~COG0639
MATYLIGDVHGCYDELIALLHKVEFTPGKDTLWLTGDLVARGPGSLDVLRYVKSLGDSVRLVLGNHDLHLLAVFAGISRN
KPKDRLTPLLEAPDADELLNWLRRQPLLQIDEEKKLVMAHAGITPQWDLQTAKECARDVEAVLSSDSYPFFLDAMYGDMP
NNWSPELRGLGRLRFITNAFTRMRFCFPNGQLDMYSKESPEEAPAPLKPWFAIPGPVAEEYSIAFGHWASLEGKGTPEGI
YALDTGCCWGGTLTCLRWEDKQYFVQPSNRHKDLGEAAAS
>Q48935 3.5.1.-~~~aphA~~~Acetylpolyamine amidohydrolase~~~
MRVIFSEDHKLRNAKTELYGGELVPPFEAPFRAEWILAAVKEAGFDDVVAPARHGLETVLKVHDAGYLNFLETAWDRWKA
AGYKGEAIATSFPVRRTSPRIPTDIEGQIGYYCNAAETAISPGTWEAALSSMASAIDGADLIAAGHKAAFSLCRPPGHHA
GIDMFGGYCFINNAAVAAQRLLDKGAKKIAILDVDFHHGNGTQDIFYERGDVFFASLHGDPAEAFPHFLGYAEETGKGAG
AGTTANYPMGRGTPYSVWGEALTDSLKRIAAFGAEAIVVSLGVDTFEQDPISFFKLTSPDYITMGRTIAASGVPLLVVME
GGYGVPEIGLNVANVLKGVAG
>Q83SQ2 3.6.1.41~~~apaH~~~Bis(5'-nucleosyl)-tetraphosphatase, symmetrical~~~
MATYLIGDVHGCYDELIALLHKVEFTPGKDTLWLTGDLVARGPGSLDVLRYVKSLGDSVRLVLGNHDLHLLAVFAGISRN
KPKDRLTPLLEAPDADELLNWLRRQPLLQIDEEKKLVMAHAGITPQWDLQTAKECARDVEAVLSSDSYPFFLDAMYGDMP
NNWSPELRGLGRLRFITNAFTRMRFCFPNGQLDMYSKESPEEAPAPLKPWFAIPGPVAEEYSIAFGHWASLEGKGTPEGI
YALDTGCCWGGSLTCLRWEDKQYFVQPSNRHKDLGEAAAS
>P80069 ~~~apa~~~Alanine and proline-rich secreted protein Apa~~~
MHQVDPNLTRRKGRLAALAIAAMASASLVTVAVPATANADPEPAPPVPTTAASPPSTAAAPPAPATPVAPPPPAAANTPN
AQPGDPNAAPPPADPNAPPPPVIAPNAPQPVRIDNPVGGFSFALPAGWVESDAAHLDYGSALLSKTTGDPPFPGQPPPVA
NDTRIVLGRLDQKLYASAEATDSKAAARLGSDMGEFYMPYPGTRINQETVSLDANGVSGSASYYEVKFSDPSKPNGQIWT
GVIGSPAANAPDAGPPQRWFVVWLGTANNPVDKGAAKALAESIRPLVAPPPAPAPAPAEPAPAPAPAGEVAPTPTTPTPQ
RTLPA
>P9WIR7 ~~~apa~~~Alanine and proline-rich secreted protein Apa~~~
MHQVDPNLTRRKGRLAALAIAAMASASLVTVAVPATANADPEPAPPVPTTAASPPSTAAAPPAPATPVAPPPPAAANTPN
AQPGDPNAAPPPADPNAPPPPVIAPNAPQPVRIDNPVGGFSFALPAGWVESDAAHFDYGSALLSKTTGDPPFPGQPPPVA
NDTRIVLGRLDQKLYASAEATDSKAAARLGSDMGEFYMPYPGTRINQETVSLDANGVSGSASYYEVKFSDPSKPNGQIWT
GVIGSPAANAPDAGPPQRWFVVWLGTANNPVDKGAAKALAESIRPLVAPPPAPAPAPAEPAPAPAPAGEVAPTPTTPTPQ
RTLPA
>P50863 ~~~salA~~~Iron-sulfur cluster carrier protein~~~COG0489
MIREDEVRKLVGEMREPFLQRPLGELDAVKEIKIKPEKRHISVKVALAKTGTAEQMQIQQEIVNVLKGAGAETVGLRFEE
LPEETVAKFRAPSAEKKTLLNMDNPPVFLAVASGKGGVGKSTVSVNLAISLARLGKKVGLIDADIYGFSVPDMMGITVRP
TIEGEKLLPVERFGVKVMSMGFFVEENAPVVWRGPMLGKMLNNFFHEVEWGEVDYIVLDLPPGTGDVALDVHTMLPSCKE
IIVSTPHPTAAFVAARAGSMAIKTDHEVVGVIENMAYYESAKTGEREYVFGKGGGDKLAEELNVPLLGRIPLKQPDWDKD
QFAPSVYDENHPIGEIYQDIAKKIDAKMSVQV
>P9WJN7 ~~~mrp~~~Iron-sulfur cluster carrier protein~~~COG0489
MSGTRDGDLNAAIRTALGKVIDPELRRPITELGMVKSIDTGPDGSVHVEIYLTIAGCPKKSEITERVTRAVADVPGTSAV
RVSLDVMSDEQRTELRKQLRGDTREPVIPFAQPDSLTRVYAVASGKGGVGKSTVTVNLAAAMAVRGLSIGVLDADIHGHS
IPRMMGTTDRPTQVESMILPPIAHQVKVISIAQFTQGNTPVVWRGPMLHRALQQFLADVYWGDLDVLLLDLPPGTGDVAI
SVAQLIPNAELLVVTTPQLAAAEVAERAGSIALQTRQRIVGVVENMSGLTLPDGTTMQVFGEGGGRLVAERLSRAVGADV
PLLGQIPLDPALVAAGDSGVPLVLSSPDSAIGKELHSIADGLSTRRRGLAGMSLGLDPTRR
>Q8ZNN5 ~~~apbC~~~Iron-sulfur cluster carrier protein~~~
MNEQSQAKSPDTLRAMVAGTLANFQHPTLKHNLTTLKALHHVAWMDDTLHVELVMPFVWNSAFEVLKEQCSADLLRITGA
KAIDWKLSYNIATLKRVKNQPGINGVKNIIAVSSGKGGVGKSSTAVNLALALAAEGAKVGVLDADIYGPSIPTMLGAEDQ
RPTSPDGTHMAPIMSHGLATNSIGYLVTDDNAMVWRGPMASKALMQMLQETLWPDLDYLVLDMPPGTGDIQLTLAQNIPV
TGAVVVTTPQDIALIDAKKGIVMFEKVEVPVLGIVENMSMHICSNCGHHEPIFGTGGAQKLAEKYHTQLLGQMPLHISLR
EDLDRGTPTVVSRPESEFTAIYRELADRVAAQLYWQGEVIPGEIAFRAV
>P0AB85 2.7.1.180~~~apbE~~~FAD:protein FMN transferase~~~COG1477
MEISFTRVALLAAALFFVGCDQKPQPAKTHATEVTVLEGKTMGTFWRASIPGIDAKRSAELKEKIQTQLDADDQLLSTYK
KDSALMRFNDSQSLSPWPVSEAMADIVTTSLRIGAKTDGAMDITVGPLVNLWGFGPEQQPVQIPSQEQIDAMKAKTGLQH
LTVINQSHQQYLQKDLPDLYVDLSTVGEGYAADHLARLMEQEGISRYLVSVGGALNSRGMNGEGLPWRVAIQKPTDKENA
VQAVVDINGHGISTSGSYRNYYELDGKRLSHVIDPQTGRPIEHNLVSVTVIAPTALEADAWDTGLMVLGPEKAKEVVRRE
GLAVYMITKEGDSFKTWMSPQFKSFLVSEKN
>P41780 2.7.1.180~~~apbE~~~FAD:protein FMN transferase~~~
MKMTFCRAVCLAAAFLLMGCDEAPETTTASPAAQVLEGKTMGTLWRVSVVGIDAKRAAELQTKIQTQLDADDWLLSTYKN
DSALMRFNHSRSLAPWPVSEAMADIVTSALRIGAKTDGAMDITVGPLVNLWGFGPDRQPLHIPTPAQIDAAKAKTGLQHL
QVIDRAGHQFLQKDLPDLYVDLSTVGEGYAADHLARLMEQEGIARYLVSVGGALSSRGMNAQGQPWRVAIQKPTDRENAV
QAIVDINGHGISTSGSYRNYYELDGKRVSHVIDPQTGRPIEHNLVSVTVIAPTALEADGWDTGLMVLGTQKAQEVVRREG
LAVFMIMKEGEGFKTWMSPQFKTFLVSDKN
>O83774 2.7.1.180~~~apbE~~~FAD:protein FMN transferase~~~COG1477
MKSSCVYWRIGVLVCILCGVGSCGGRARVREYSRAELVIGTLCRVRVYSKRPAAEVHAALEEVFTLLQQQEMVLSANRDD
SALAALNAQAGSAPVVVDRSLYALLERALFFAEKSGGAFNPALGAXVKLWNIGFDRAAVPDPDALKEALTRCDFRQVHLR
AGVSVGAPHTVQLAQAGMQLDLGAIAKGFLADKIVQLLTAHALDSALVDLGGNIFALGLKYGDVRSAAAQRLEWNVGIRD
PHGTGQKPALVVSVRDCSVVTSGAYERFFERDGVRYHHIIDPVTGFPAHTDVDSVSIFAPRSTDADALATACFVLGYEKS
CALLREFPGVDALFIFPDKRVRASAGIVDRVRVLDARFVLER
>A5F5Y3 2.7.1.180~~~apbE~~~FAD:protein FMN transferase~~~COG1477
MRNWLVALASLLLLAGCEKPAEQVHLSGPTMGTTYNIKYIQQPGIADSKTLQTEIDRLLEEVNDQMSTYRKDSELSRFNQ
HTSSEPFAVSTQTLTVVKEAIRLNGLTEGALDVTVGPLVNLWGFGPEARPDVVPTDEELNARRAITGIEHLTIEGNTLSK
DIPELYVDLSTIAKGWGVDVVADYLQSQGIENYMVEIGGEIRLKGLNRDGVPWRIAIEKPSVDQRSVQEIIEPGDYAIAT
SGDYRNYFEQDGVRYSHIIDPTTGRPINNRVVSVTVLDKSCMTADGLATGLMVMGEERGMAVAEANQIPVLMIVKTDDGF
KEYASSSFKPFLSK
>Q5P5G2 6.4.1.8~~~apc1~~~Acetophenone carboxylase alpha subunit~~~COG0145
MYTVDIDTGGTMTDALVSDGEQRHAIKVDTTPHDYTVSFNGCLSEAAKRLGYPSTEAFLAKVGMIRWSSTITTNVLGERR
GSKVGLLVTEGNEENLYGTVQSPVVGELVDERNIIGLPSNPTAVDILSGVKQLLEGGVRRICVCLANAFPDNGAEREIKA
VIEDQYPDHIIGAVPVLLGSEMAPLRHDQTRVHYSLMNAYTHTQLATSLFKAEDLLRDDHNWTGPLLIGNTNGGVARIGK
TKSVDTIESGPVFGTFGGAYMARLYGLKDVVCFDVGGTTTKASIIRDGQPMFQRGGELMEVPVQSSFAMLRSAVVGGGSI
ARVRDKSVTLGPESMGAAPGPACYGLGGNEATLTDALLALGYLDPNNFLGGRRQLKVDLARAAIERNVAKPLGVSLEVAA
LSIRDEAVAMMTELLQATLAEAKLTAQDAALFAFGGNGPMFAAFVAERLGVQAAYAFNLGPVFSAFGSAISDVVHVYERG
VDLRWNATVKGQLLPTLDALQTQAERDLKGESFDPAKAAYVWELDFGTTEAEVSTVRAELAQSAASTVLDALTQAVTAAG
VASLPLLGARLSSRFVVGAHGMKKRADRVPAEAPASREMRFNGASEAASPVYRWETMNVGDIAVGPAVVNGSTLTCPIPP
RWQLRVDDYGNAELSRAQ
>P07326 ~~~apcB~~~Allophycocyanin beta chain~~~
MQDAITSVINSSDVQGKYLDTAALEKLKGYFATGELRVRAATTISANAAAIVKEAVAKSLLYSDITRPGGNMYTTRRYAA
CIRDLDYYLRYSTYAMLAGDPSILDERVLNGLKETYNSLGVPVGATVQAIQAMKEVTASLVGPDAGKEMGVYFDYISSGL
S
>P00317 ~~~apcB~~~Allophycocyanin beta chain~~~
AQDAITAVINSADVQGKYLDTAALEKLKAYFSTGELRVRAATTISANAAAIVKEAVAKSLLYSDITRPGGNMYTTRRYAA
CIRDLDYYLRYATYAMLAGDPSILDERVLNGLKETYNSLGVPVGATVQAIQAIKEVTASLVGAKAGKEMGIYLDYISSGL
S
>Q5P5G3 6.4.1.8~~~apc2~~~Acetophenone carboxylase beta subunit~~~COG4647
MYERIRFTEYLDLDLNDEHWYCHDCGTKLISARESYKKGCLVAERRPHEIHNPVIEGEYSFAPDENWVRILEFYCPGCTR
QIETEYLPPGHPITVDIEVDIDSLKARLKKGVIVIKDGKLTKPEAEVLA
>P72505 ~~~apcB~~~Allophycocyanin beta chain~~~
MQDAITSVINSSDVQGKYLDRSAIQKLKAYFATGELRVRAATTISANAANIVKEAVAKSLLYSDITRPGGNMYTTRRYAA
CIRDLDYYLRYATYAMLAGDPSILDERVLNGLKETYNSLGVPIGATVQAIQAMKEVTAGLVGADAGKEMGIYFDYICSGL
S
>P00318 ~~~apcB~~~Allophycocyanin beta chain~~~
MQDAITAVINSSDVQGKYLDTAALEKLKSYFSTGELRVRAATTIAANAAAIVKEAVAKSLLYSDITRPGGNMYTTRRYAA
CIRDLDYYLRYATYAMLAGDPSILDERVLNGLKETYNSLGVPISATVQAIQAMKEVTASLVGPDAGKEMGVYFDYICSGL
S
>P16571 ~~~apcB1~~~Allophycocyanin beta chain~~~
MAQDAITSVINSADVQGKYLDSAALDKLKGYFGTGELRVRAASTISANAAAIVKEAVAKSLLYSDVTRPGGNMYTTRRYA
ACIRDLDYYLRYATYAMLAGDPSILDERVLNGLKETYNSLGVPVSSTVQAIQAIKEVTASLVGSDAGKEMGVYLDYISSG
LS
>P80557 ~~~apcB~~~Allophycocyanin subunit beta~~~
MAQDAITAVINSADVQGKYLDTAALEKLKAYFSTGELRVRAATTISANAAAIVKEAVAKSLLYSDITRPGGNMYTTRRYA
ACIRDLDYYLRYATYAMLAGDPSILDERVLNGLKETYNSLGVPVGATVQAIQAIKEVTASLVGADAGKEMGIYLDYISSG
LS
>O68970 ~~~apcB~~~Allophycocyanin beta subunit~~~
MQDAITSVINSADVQGKYLDGSAMDKLKAYFTTGALRVRAASTISANAAAIVKEAVAKSLLYSDVTRPGGNMYTTRRYAA
CIRDLDYYLRYATYAMLAGDPSILDERVLNGLKETYNSLGVPVGSTVQAIQAMKEVTAGLVGADAGREMGVYFDYICSGL
S
>P06113 ~~~apcB~~~Allophycocyanin beta chain~~~
MQDAITAVINASDVQGKYLDSSALDRLKSYFQSGELRVRAAATISANSALIVKEAVAKSLLYSDITRPGGNMYTTRRYAA
CIRDLEYYLRYATYAMLAGDTSILDERVLNGLKETYNSLGVPIGATVQAIQAIKEVTASLVGPDAGREMGVYLDYISSGL
S
>Q01952 ~~~apcB~~~Allophycocyanin beta chain~~~
MQDAITAVINSADVQGKYLDGAAMDKLKSYFASGELRVRAASVISANAATIVKEAVAKSLLYSDVTRPGGNMYTTRRYAA
CIRDLDYYLRYATYAMLAGDASILDERVLNGLKETYNSLGVPISSTVQAIQAIKEVTASLVGADAGKEMGVYLDYICSGL
S
>P50031 ~~~apcB~~~Allophycocyanin beta chain~~~
MQDAITAVINASDVQGKYLDTAAMEKLKAYFATGELRVRAASVISANAANIVKEAVAKSLLYSDITRPGGNMYTTRRYAA
CIRDLDYYLRYATYAMLAGDPSILDERVLNGLKETYNSLGVPIAATVQAIQAMKEVTASLVGADAGKEMGIYFDYICSGL
S
>Q5P5G4 6.4.1.8~~~apc3~~~Acetophenone carboxylase gamma subunit~~~COG0145
MSSLTNQDAINSIDIDVGGTFTDFVLTLDGERHIAKCPTTPHDLSIGFLNAVEAGGDKVGLSVEELLPRIDIIRYSTTVA
LNRLLQRQGPRIGLLTTEGHEDAILIGRGAQWTDGQRVAERRNIAVQNKPLPLIERDLILGVRERIDSSGSVVRPLDEED
VRTKLRMLMDRGARAIVVSLLWSFMNPAHEKRVREIIREEYKEYHIGFVPVVMSHSVVSKIGEYERTMTAVLDAYLQRSM
QNDIGATWDKLRAKGYHGAFLMIHNSGGSADIFKTPASRTFNGGPVAGLMGSAYFANKLGYKNVVAGDVGGTSFDVALVV
ESSVRNYTFRPVIDKWMVNVTMMQTISVGSGGGSIAKVDRSGTRLEVGPRSAGSMPGPVCYDLGGTEPTVTDADVVLGYI
NPDTYYGGRMPLNKAKAEKAIREKIAQPLGIETIEAAALIRYIVDENMASAIKREVHMRGYHPEDFVLFAFGGAGPTHMA
GLKGDIPKAVVFPAAPVFCAMGSSIMDIVHMYEQSRRMVFMEPGTEKFVVDYEHFNQTVDTMIERARQELRSEGLEVDDA
SFGLELDMLYGGQVNLKRMSSPLLHIRTAEDALKVYQAFETEFSEAFSPLVVNKPGGVFLDNFVLRVTVPTWKPPIPEYP
LQGTDPSAAFLGKRKAYWPETKHWADTPTYQFELLQAGNVIDGPAIVEAELTTIVVPPRQRLSIDTHGLAILEAIDPAPP
TKRVSAAAAAIV
>Q5P5G5 6.4.1.8~~~apc4~~~Acetophenone carboxylase delta subunit~~~COG0146
MAIPTLEQKLTWLKPAPASSRELDLAAQIDPAQFEIGFQRTNDILDEGMDVFVRSCRCAMGVAGDSLVAIMTADGDIVNG
SCGTYLHAVIPPLIIKYILETYGDEIRDGDLWFANDAVYGGVHNPDQMVCMPVYYEGKLVAWTAALVHTTETGAIEPGGM
PVSATTRFEEGMNLPPMRIGENFKLREDVVSMFVAFGLRAPSMIAVDLKARCTTADRVRTRIIELCEREGADYVTGLFRK
MLQVAEAGARELIEQWPDGKYRCVTFSDAVGLKQGLVRSCYMTLEKKGDRMLVDLSETGPETPSPYNAHPQAAIAHFSNY
IYEYLFHSLPISNGTFANIDFKFGKNTCLSPDPRAATSCSVMISTGVMSAVHNACAKAMFSTSLWKQSGASMGNGGNALV
LAGQNQWGSSFADMLAYSINTEGQGARPTEDGMDAFGFPWCVFGRAPNTESVENEFPLLVPLSNHWKDSCGHGKYRGGVG
TAQVWVAHHVPELYMMAIADNTKLQTPQPLFGGYAPCTVPGIGIRNANIKELMAEGSDKIKLDVETLLAERTIDGKYEIE
FQGRSVRPYSNGEVVTFAFSCGGTGYGDPLDRDPKSVEVDLLKGVLTEQTAQNIYKVKWDANLRRVDLDETSRLRAAEHD
ARRKRGVPYEQFEREWLKQRPDDEILKYYGTWPDAKVAQPLLRA
>P80556 ~~~apcD~~~Allophycocyanin subunit alpha-B~~~
MTVISQVILQADDELRYPSSGELKSISDFLQTGVQRTRIVATLAENEKKIVQEATKQLWQKRPDFIAPGGNAYGERQRAL
CIRDFGWYLRLITYGVLAGDIEPIEKIGIIGVREMYNSLGVPVPGMVEAINSLKKASLDLLSSEDAAAAAPYFDYIIQAM
S
>O68966 ~~~apcD~~~Allophycocyanin subunit alpha-B~~~
MSVVSQVILRADDELRYPSSGELSGIKNFLATGAVRIRIAEALADNEKKIVDQAQKQLFSIHPEYRTSGGNAATTKQYNQ
CLRDYGWYLRLVTYGILAGDKDPIERIGLIGVKEMYNALGVPVPGMVDAIRCLKDAALGVLDSEEARIAAPYFDFITQAM
S
>P11390 ~~~~~~Allophycocyanin alpha-B chain~~~
MTIVSQVILKADDELRYPSGGELKNITDFFKTGEQRLRIAQVLSDSEKKIVDQASRKLWQRRPDFIAPGGNAYGQRQRAQ
CLRDYGWYLRLITYGVLAGDKEPIESIGLLGAREMYNSLGVPLPGMAEAIRTLKEASLALLSSADATVAAPYFDFLIQGM
ETI
>P72870 ~~~apcD~~~Allophycocyanin subunit alpha-B~~~
MSVVSQVILQADDQLRYPTSGELKGIQAFLTTGAQRIRIAETLAENEKKIVDQAQKQLFKKHPEYRAPGGNAYGQRQYNQ
CLRDYGWYLRLVTYGVLAGNKEPIETTGLIGVKEMYNSLNVPVPGMVDAVTVLKDAALGLLSAEDANETAPYFDYIIQFM
S
>Q5P5G6 6.4.1.8~~~apc5~~~Acetophenone carboxylase epsilon subunit~~~
MEAAGALWRRRMQELARGAGKPHAPLFVPLIMGCAAQIEAIPAIDMVRDGTRLRKNLSELRRMLKLDALTCAVPSCMEAE
AVGVEVSQDQWPPRIGTTAQVDVTAEIDADRLAASPRIAAALDAVRQIAVDPGEPVIAAALTGPAALVAQLRAAGVEAGD
EAIYDFAGRILATLARLYAEAGVNLLSWHEAARPAEEQDDFWKGALGTAGNVARFHRVPPVLVLPASLAAGPWPAQAVPC
PALNHPPLPPVRTHARAWAADPAGWPCLPVEGVAERLILTDAEVPPETEIATLKAQVERVRGE
>P16566 4.-.-.-~~~apcE~~~Phycobiliprotein ApcE~~~
MSVKASGGSSVARPQLYQTLAVATITQAEQQDRFLGTGELNELATYFASGAKRLEIAQTLTENSEIIVSRAANRIFVGGS
PMSFLEKPREAELAMATVAPGNVQEGMKLGTVTYVESRGGFLENLRSIFNSSPSGPTPPGFRPINVARYGPSNMAKSLRD
LSWFLRYATYAIVAGDPNIIAVNTRGLREIIENACSGEPTIVALQEIKAASLSFFRQDAKATEIVSQYMDVLLTEFKAAT
PSNKLRQRPSGDQQGLQLPQIYFEAAERRPKFVMKPGLSASEKNEVIRAAYRQIFERDITRAYSLSVSDLESKVKNGDIS
MKEFVRRLAKSPLYQKQFYQPFINSRVIELAFRHILGRGPSSREEVQKYFSIISNGGLPALVDALVDSPEYSDYFGEETV
PYLRGLGQEAQECRNWGPQQDLFNYSAPFRKVPQFITTFACLYDRPLPDQHPYGSGNDPLEIQFGAIFPKETRNPNTSPA
PFSKDTRRILINQGPGINSQVSNPGARGEFPGSLGPKVFRLDQLPGTIGKKAAKGASIKFSESSTQAVIKAAYLQVFGRD
VYEGQRLKVQEIKLENGQLSVREFIRALAKSDVFRKTYWTSLYVCKAIEYIHRRLLGRPTYGRQEINKYFDIAAKQGFYA
VVDAIINSVEYSEAFGEDTVPYERYLTPSGVALRQLRVGSIREDVGGKVQKQETPLFVTLGTVTDTRTEPDIQFRINQGV
SKQREQTKVFKQVANISDKAAVQTLISAAYRQIFERDVAPYIAKNEFSALESKLSNGEITVKEFIEGLGYSNLYIKEFYT
PYPNTKVIELGTKHFLGRAPLDQVEIRKYNQILATQGIRAFIGALVSSAEYAEVFGEDTVPYRRYPTLPAANFPNTEKLY
NQLTKQNDDLVVPSFKTVQPRLTLAGTSSSGRNGFTDLGRSSTSAQGQLGETANRCKPARIYRLSGTNQAETQLVINAIY
SQVLDLFSSDIPANYRLNALEGKLQTGEISVREFVRELASSDIYCDRFYTPYPSAKVIEFLYRHLLGRAPATQEEISEYN
KLMASRGLRAVVEAIVDSQEYARYFGEDVVPYPRSSSLGN
>P80559 4.-.-.-~~~apcE~~~Phycobiliprotein ApcE~~~COG0237
MSVKASGGSSVARPQLYQTLAVATITQAEQQDRFLGRGELDELASYFASGAKRLEIAQLLTENSEIIVSRAANRIFVGGS
PMAFLEKPREPELAMAAVGGGGDVRESMKLGTVTYVETRGGFLENLRSIFNTSPSGPTPPGFRPINIARYGPSNMAKSLR
DLSWFLRYATYAIVAGDPNIIVVNTRGLREIIENACSGEATIVALQEIKAASLSYFRKDPEAAEIVSQYMDVLITEFKAP
TPSNKLRQRPSGDQQGLQLPQIYFSAAERRPKFVMKTGLSATEKNEVIKAAYRQIFERDITRAYSLSISDLESKVKNGDI
SMKEFVRRLAKSPLYQKQFYQPFINSRVIELAFRHILGRGPSSREEVQKYFSIISNGGLPALVDALVDSAEYSDYFGEET
VPYLRGLGQEAQECRNWGPQQDLFNYSAPFRKVPQFITTFAAYDRPLPDQHPYGSGNDPLEIQFGAIFPKETRNPSTSPA
PFGKDTRRILIHQGPGINNQVSNPSARGLAPGSLGPKVFKLDQLPGTIGKKAAKGASVKFSESSTQAVIKATYLQVFGRD
VYEGQRLKVQEIKLENGEISVRDFVRALAKSDLFRKLYWTPFYVCKAIEYIHRRLLGRPTYGRQENNKYFDIASKKGLYA
VVDAILDSLEYTETFGEDTVPYERYLTPAGVALRQLRVGTIREDVANVEKQETPRFVELGTVKENRTQPDIDFRINQGVT
KQREQTKVFKRVAGIKDKAAIKTLISAAYRQIFERDIAPYIAQNEFSGWESKLGNGEITVKEFIEGLGYSNLYLKEFYTP
YPNTKVIELGTKHFLGRAPIDQAEIRKYNQILATQGIRAFINALVNSQEYNEVFGEDTVPYRRFPTLPAANFPNTQKLYN
QLTKQNNDVVIPSFKPVQARIQSDKTPILAKAIADLAAQAKQMDKSKPLFIELGRSYNDGRGQSVEVGVGTTRRKPARIY
RLTNGIGQAEKQLVINAIYRQVLDVFSGQVPDYYRRTELDSKLRNGEISVREFVREIASSEIYRKRFYTPYPNTKVIEFL
FRHLLGRAPATQGEIRQYNKLLADNGLRAAVEAIVDSPEYSRYFGEDVVPYPRFPSLPAGNYLGSVQAAADLVKQSWSSL
SPSTLTGRPGDR
>O68973 4.-.-.-~~~apcE~~~Phycobiliprotein ApcE~~~COG0237
MTIKASGGSSLARPQLYQTVPLSNISQAEQQDRYLESGELTALKTFYDSGLKRLAIAQAIKLSSQLIVSRAANRIFAGGS
PLAYLDQPETDTDDSDLGVSMAVGDASGATGIFGGVKNLFLGSGGGKIPAGFRPISVSRYGPRNMTKSLRDMAWFLRYTT
YAIVAGDPSILVVNTRGLKEVIENACSIPATIVAIQEMKAASLDLFRGDREAQETVVQYFDVLITEMQTQVPNDKLRQRP
SIDAQGLQLPQSYFNAAEKRQKFVMKPGLSALEKNSVVKAAYRQIFERDITRAYSQSISYLESQVKSGDISMKEFVRRLA
KSPLYRKQFFEPFINSRALELAFRHILGRGPSSREEVQEYFAIVSSGGLAALVDALVDSQEYADYFGEETVPYLRGLGQE
AQECRNWGMQQDLFKYSAPFRKVPQFITTFASYNQPLPDQHVYGSGNDALEIQFGAIFPKATRSPSASPAPFNKDTRRIL
IHRGPGINNQLGNPRARATQPGSLGAKVFRLNNELPSGKTTNVSFSESATQKVIEAAYRQVFGRMVYAGQRQKVAEIKLE
NGEITLREFIRALAKSDVFRNTYWSSLYVTKAVEYIHRRLLGRPTYGRQEINSYFDTCAKKGFYALVDAIIDSKEYEEAF
GEDTVPYERYLTPGGYSLRQTRPGALREDVGVKVKVEKTARFIELGTSSTKNLPVTDVDARLKQGVNIQRQQTKAFKLTD
TFNKVELKTAIAAAYRQIFERDIEPYIVDAQFTALESKLGNREINMKEFIEGLGCSELYQKEFYTPYPNTKVIEMGTKHF
LGRAPLDQQEIRKYNQILASQGLKAFIGAMVNSMEYLDNFGEDTVPFRRFPTLPAANFPNTERLYNQLTKQNRDLVVPSF
EPAVKR
>Q55544 4.-.-.-~~~apcE~~~Phycobiliprotein ApcE~~~COG0237
MSVKASGGSSLARPQLYQTVPVSAISQAEQQDRFLEGSELNELTAYFQSGALRLEIAETLTQNADLIVSRAANRIFTGGS
PLSYLEKPVERQPALVGASSDSRNGSVTYAESNGSGGLFGGLRSVFSSTGPIPPGFRPINIARYGPSNMQKSLRDMSWFL
RYTTYAIVAGDPNIIVVNTRGLKEVIENACSIDATIVAIQEMRAASADYFRNNAQAKEIVLQYFDILLSEFKAPTPANKV
RQGPSNDIQGLELPQSYFNAAAKRQKYAMKPGLSALEKNAVIKAAYRQIFERDITKAYSQSISYLESQVRNGDISMKEFV
RRLAKSPLYRKQFFEPFINSRALELAFRHILGRGPSSREEVQKYFSIVSSGGLPALVDALVDSQEYADYFGEETVPYLRG
LGVEAQECRNWGMQQDLFSYSAPFRKVPQFITTFAQYDRPLPDQHVYGSGNDPLEIQFGAIFPKETRNPSKRPAPFNKDT
KRILIHRGPAVNNQVGNPSAVGEFPGSLGAKVFRLNGGLPGAKVGKNTGTSVKFGESSTQALIRAAYRQVFGRDLYEGQR
LSVAEIQLENGDISVREFIKRLAKSELFLKLYWAPHYVCKAIEYMHRRLLGRPTYGRQEMNQYFDIASKQGFYAVVEAMI
DSKEYSDAFGEDTVPYERYLTPGGLQMRSARVGSLREDIGQRVDKEVTPRFVELGQVSAIRTEPEIAYRSNQGVTRQRQQ
TKVFKLVSTYDKVAVKNAIRAAYRQVFERDLEPYIINSEFTALESKLSNNEINVKEFIEGLGTSELYMKEFYAPYPNTKV
IEMGTKHFLGRAPLNQKEIQQYNQILASQGLKAFIGAMVNGMEYLQTFGEDTVPYRRFPTLPAANFPNTERLYNKLTKQD
KELVVPSFTPVVKVGG
>Q7A2D6 ~~~apcF~~~Allophycocyanin subunit beta-18~~~
MRDAVTSLIKNYDVAGRYFDRNAIDTLKDYFDSGTARVQAAAAINSNAAALVKQAGSKLFEELPELIRPGGNAYTTRRLA
ACLRDMDYYLRYATYALVAGNTNVLDERVLQGLRETYNSLGVPIGPTVRGVQILKDLVKEQVAGAGIANTTFVEEPFDHI
TRELSERDV
>O68967 ~~~apcF~~~Allophycocyanin subunit beta-18~~~
MRDAVTSLIRNYDTTGRYFDRDAIESLKDYFASGNDRITVAAMINSQSAEIVKAAANSLFEAVPELLLAGGNAYTTRRFS
ACLRDMDYYLRYGTYALIAGDMDVLNERVLQGLRETYNSLGVPIAPTVRGIQFLKDAIKEMAAAAGIANTAFIDEPFDHM
TRELSEVDL
>P74551 ~~~apcF~~~Allophycocyanin subunit beta-18~~~
MRDAVTTLIKNYDLTGRYLDRNAMDELKAYFESGSARIAAAAMINANSATIVKRAAAQLFEEIPELIRPSGNAYTTRRFS
ACLRDMDYYLRYASYALIAADNNVLDERVLQGLRETYNSLGVPIGPTVRGIQIMKEMIEAMAEDSSLNSTDFIASPFDHM
TRELSELSV
>P0C925 3.4.11.-~~~apeA~~~Probable M18 family aminopeptidase 1~~~
MKKQNPWIYLNEEEKNQILNFSESYKKFISKFKTEREVTAYALDKAKKLGFINAEEKKNLMPGDKIFYTCREKSVAFAII
GKNPIEDGMNFIVSHTDSPRLDAKPSPISEENELTFIKTNYYGGIKKYQWLSTPLSIRGVVFLKNGEKVEINIGDNENDP
VFVIPDILPHLDRKIQRNKKSDEIVEGENLKILIGSLPIETKEKNKVKLATLQLIKEKYKIEEEDFVSSEIEIVPAGTAK
DVGFDKALIGAYGQDDKICVFTSLESIFDLEETPNKTAICFLVDKEEIGSTGSTGLDSRYLEYFVSDMIFKIKKSEYNNL
HVQKALWNSKSISADVCAAINPLFSSVHDEQNAPQLGYGIPIMKYTGHGGKSMASDADAELVSYIRQLLNKNNIAWQVAT
LGKVEEGGGGTVAKFLAGYGIRTIDMGPAVISMHSPMEITSKFDLYNAYLAYKAFYRE
>Q97K30 3.4.11.-~~~apeA~~~Probable M18 family aminopeptidase 1~~~COG1362
MPNDLLKEYKNAWDKYDDKQLKEVFALGDRFKNFISNCKTERECVTELIKTAEKSGYRNIEDILAKGETLKEGDKVYANN
RGKGLIMFLIGKEPLYTGFKILGAHIDSPRLDLKQNPLYEDTDLAMLETHYYGGIKKYQWVTLPLAIHGVIVKKDGTIVN
VCVGEDDNDPVFGVSDILVHLASEQLEKKASKVIEGEDLNILIGSIPLKDGEEKQKVKHNIMKILNEKYDISEEDFVSAE
LEIVPAGKARDYGFDRSMVMGYGQDDRICAYTSFEAMLEMKNAKKTCITILVDKEEVGSIGATGMQSKFFENTVADIMSL
CGDYDELKLRKALYNSEMLSSDVSAAFDPNYPNVMEKRNSAYLGKGIVFNKYTGSRGKSGCNDANPEYIAELRRILSKES
VNWQTAELGKVDQGGGGTIAYILAEYGMQVIDCGVALLNMHAPWEISSKADIYETKNGYSAFLNN
>Q9WYJ9 3.4.11.-~~~apeA~~~Probable M18 family aminopeptidase 1~~~COG1362
MKMERKNVWHHRKKEEIEAFSKEYMEFMSKAKTERMTVKEIKRILDESGFVPLEDFAGDPMNMTVYAVNRGKAIAAFRVV
DDLKRGLNLVVAHIDSPRLDFKPNPLIEDEQIALFKTHYYGGIKKYHWLSIPLEIHGVLFKNDGTEIEIHIGDKPEDPVF
TIPDLLPHLDKEDAKISEKFKGENLMLIAGTIPLSGEEKEAVKTNVLKILNEMYGITEEDFVSGEIEVVPAFSPREVGMD
RSLIGAYGQDDRICAYTALRALLSANPEKSIGVIFFDKEEIGSDGNTGAKARFYLKALRQILKMQGAKDSEFVLDEVLEN
TSVISGDVCAAVNPPYKDVHDLHNAPKLGYGVALVKYTGARGKYSTNDAHAEFVARVRKVLNEQGVIWQVATLGKVDQGG
GGTIAKFFAERGSDVIDMGPALLGMHSPFEISSKADLFETYVAYRSLMEKL
>P9WHT1 3.4.11.-~~~apeB~~~Probable M18 family aminopeptidase 2~~~COG1362
MAATAHGLCEFIDASPSPFHVCATVAGRLLGAGYRELREADRWPDKPGRYFTVRAGSLVAWNAEQSGHTQVPFRIVGAHT
DSPNLRVKQHPDRLVAGWHVVALQPYGGVWLHSWLDRDLGISGRLSVRDGTGVSHRLVLIDDPILRVPQLAIHLAEDRKS
LTLDPQRHINAVWGVGERVESFVGYVAQRAGVAAADVLAADLMTHDLTPSALIGASVNGTASLLSAPRLDNQASCYAGME
ALLAVDVDSASSGFVPVLAIFDHEEVGSASGHGAQSDLLSSVLERIVLAAGGTREDFLRRLTTSMLASADMAHATHPNYP
DRHEPSHPIEVNAGPVLKVHPNLRYATDGRTAAAFALACQRAGVPMQRYEHRADLPCGSTIGPLAAARTGIPTVDVGAAQ
LAMHSARELMGAHDVAAYSAALQAFLSAELSEA
>Q9HYZ3 3.4.11.-~~~apeB~~~Probable M18 family aminopeptidase 2~~~
MRAELNQGLIDFLKASPTPFHATASLARRLEAAGYRRLDERDAWHTETGGRYYVTRNDSSLIAIRLGRRSPLESGFRLVG
AHTDSPCLRVKPNPEIARNGFLQLGVEVYGGALFAPWFDRDLSLAGRVTFRANGKLESRLVDFRKAIAVIPNLAIHLNRA
ANEGWPINAQNELPPIIAQLAPGEAADFRLLLDEQLLREHGITADVVLDYELSFYDTQSAAVVGLNDEFIAGARLDNLLS
CHAGLEALLNAEGDENCILVCTDHEEVGSCSHCGADGPFLEQVLRRLLPEGDAFSRAIQRSLLVSADNAHGVHPNYADRH
DANHGPALNGGPVIKINSNQRYATNSETAGFFRHLCQDSEVPVQSFVTRSDMGCGSTIGPITASQVGVRTVDIGLPTFAM
HSIRELAGSHDLAHLVKVLGAFYASSELP
>Q81QL7 ~~~apeX~~~Apo-petrobactin exporter~~~
MKKHPLHMLGRLVAGKNTQWITLSVWILITLLLSFTLPQVNSTKEPNPKNLPETAMSQQAEALMKKEFPNNAGNPLLVVW
YRDGGLQSQDYKLIQDVYKELKASPLKEQSTLPPFDTIPEQVLSKSASKDGTSFVTPVFFNKSAGTDILKENLDDLRNIV
NSKVDEDPFKRKINDAGLHVRLSGPVGIQTDAVSLFSQADVKLLVATVLLVLVLLILLYRSPILAILPLLVVGFAYGIIS
PTLGFLADHGWIKVDAQAISIMTVLLFGAGTDYCLFLISRYREYLLEEESKYKALQLAIKASGGAIIMSALTVVLGLGTL
LLAHYGAFHRFAVPFSVAVFIMGIAALTILPAFLLIFGRTAFFPFIPRTTSMNEELARRKKKVVKVKKSKGAFSKKLGDV
VVRRPWTIIMLTVFVLGGLASFVPRIQYTYDLLESFPKDMPSREGFTLISDHFSAGELAPVKVIVDTKGKELPIKEELEK
FSFVNTVKDPKEGKENKQIQMYEVSLAENPYSIEALDQIPKLKNSVEKVFKDAGISNAEDQLWIGGETASLYDTKQITER
DEAVIIPVMISIIALLLLVYLRSIVAMIYLIVTVVLSFFSALGAGWLLLHYGMGAPAIQGAIPLYAFVFLVALGEDYNIF
MVSEIWKNRKTQNHLDAVKNGVIQTGSVITSAGLILAGTFAVLGTLPIQVLVQFGIVTAIGVLLDTFIVRPLLVPAITVV
LGRFAFWPGKLSRKSEEVQKVDA
>P0AE22 3.1.3.2~~~aphA~~~Class B acid phosphatase~~~COG3700
MRKITQAISAVCLLFALNSSAVALASSPSPLNPGTNVARLAEQAPIHWVSVAQIENSLAGRPPMAVGFDIDDTVLFSSPG
FWRGKKTFSPESEDYLKNPVFWEKMNNGWDEFSIPKEVARQLIDMHVRRGDAIFFVTGRSPTKTETVSKTLADNFHIPAT
NMNPVIFAGDKPGQNTKSQWLQDKNIRIFYGDSDNDITAARDVGARGIRILRASNSTYKPLPQAGAFGEEVIVNSEY
>P44009 3.1.3.2~~~aphA~~~Class B acid phosphatase~~~COG3700
MKNVMKLSVIALLTAAAVPAMAGKTEPYTQSGTNAREMLQEQAIHWISVDQIKQSLEGKAPINVSFDIDDTVMLFSSPCF
YHGQQKFSPGKHDYLKNQDFWNEVNAGCDKYSIPKQIAIDLINMHQARGDQVYFFTGRTAGKVDGVTPILEKTFNIKNMH
PVEFMGSRERTTKYNKTPAIISHKVSIHYGDSDDDVLAAKEAGVRGIRLMRAANSTYQPMPTLGGYGEEVLINSSY
>Q59544 3.1.3.2~~~aphA~~~Class B acid phosphatase~~~
MRKLTLTLSALALALSLNSVADAKVYMPEKVSDGVTVAQLAEQHAIHWISVEQIEESLKGQPMAVGFDIDDTVLFSSPGF
YRGKLEYSPNDYSYLKNPEFWEKMNNEWDKFSMPKKSGMELVQMHLKRGDTVYFITGRSKTKTETVTKYVQEGLRIPADK
MNPVIFAGDEEGQNNKVSWMRDHKLKIYYGDADADIAAARELNIRGIRVLRASNSSYQPLPKAGQFGEEVVINSEY
>O08430 3.1.3.2~~~aphA~~~Class B acid phosphatase~~~COG3700
MKKITLALSAVCLLFTLNHSANALVSSPSTLNPGTNVAKLAEQAPVHWVSVAQIENSLTGRPPMAVGFDIDDTVLFSSPG
FWRGKKTYSPDSDDYLKNPAFWEKMNNGWDEFSIPKEVARQLIDMHVRRGDSIYFVTGRSQTKTETVSKTLADNFHIPAA
NMNPVIFAGDKPGQNTKVQWLQEKNMRIFYGDSDNDITAARDCGIRGIRILRAANSTYKPLPQAGAFGEEVIVNSEY
>Q540U1 3.1.3.2~~~aphA~~~Class B acid phosphatase~~~COG3700
MKKITLALSAVCLLFTLNHSANALVSSPSTLNPGTNVAKLAEQAPVHWVSVAQIENSLTGRPPMAVGFDIDDTVLFSSPG
FWRGKKTYSPDSDDYLKNPAFWEKMNNGWDEFSIPKEVARQLIDMHVRRGDSIYFVTGRSQTKTETVSKTLADNFHIPAA
NMNPVIFAGDKPEQNTKVQWLQEKNMRIFYGDSDNDITAARDCGIRGIRILRAANSTYKPLPQAGAFGEEVIVNSEY
>P58683 3.1.3.2~~~aphA~~~Class B acid phosphatase~~~
MKKITLALSAVCLLFTLNHSANALVSSPSTLNPGTNVAKLAEQAPVHWVSVAQIENSLTGRPPMAVGFDIDDTVLFSSPG
FWRGKKTYSPDSDDYLKNPAFWEKMNNGWDEFSIPKEVARQLIDMHVRRGDSIYFVTGRSQTKTETVSKTLADNFHIPAA
NMNPVIFAGDKPEQNTKVQWLQEKNMRIFYGDSDNDITAARDCGIRGIRILRAANSTYKPLPQAGAFGEEVIVNSEY
>O69622 ~~~~~~Apoptosis inhibitor Rv3654c~~~
MVARHRAQAAADLASLAAAARLPSGLAAACARATLVARAMRVEHAQCRVVDLDVVVTVEVAVAFAGVATATARAGPAKVP
TTPG
>O69623 ~~~~~~Apoptosis inhibitor Rv3655c~~~
MEAALAIATLVLVLVLCLAGVTAVSMQVRCIDAAREAARLAARGDVRSATDVARSIAPRAALVQVHRDGEFVVATVTAHS
NLLPTLDIAARAISVAEPGSTAARPPCLPSRWSRCCCASPVRVHI
>A6VKQ8 ~~~~~~D-apiose import binding protein~~~COG1879
MKLLKASLVALSLAASTFVYADNGLIAIITPSHDNPFFKAEADGAKQKAEELGYTTLVASHDDDANKQDQLISTAVSRKA
KAIILDNAGSDVTVGALEKAKAAGVPAFLIDREINKTGVAVSQIVSNNYQGAQLSAEKFVELMGEKGQYVELLGRESDTN
ASVRSQGFHEIIDEYPEMKMVAQQTANWSQTEGFSRMESILQANPNIKGVISGNDTMALGAEAALKAAGRTDVIVVGFDG
SDYVRDSILAGGNIKATALQPAWDQAQEAVVQADKYIRTGSTGKEEKQLMDCILIDSSNAKKLNKFSLSK
>B1G898 ~~~~~~D-apiose import binding protein~~~
MKASKRWVALAAATLTLFTATGTAQAANLIAIITPSHDNPFFKAEADTANARAKALGYDTIVLVHDDDANKQSNLVDTAI
ARGAKAIILDNAGSEASISAVRKAKAAGIPSFLIDREINATGIAVSQIVSNNYQGAQLGGRAFVKALGEKGNYVELVGRE
ADINAGIRSKGYHDVIDQFPNMKMVERQSANWSQTEAYRVMETILQSHPDVKGVIAGNDTMAMGASAALKAAKRSDVIVV
GFDGSNDVRDAIMRNDIRATVLQPAALAATEAVEQADKYMKTGSTGKPEKQLINCSLITKANAGKLDMFALR
>Q2JZQ5 ~~~~~~D-apiose import binding protein~~~
MKLTRRLTLAAFASALALGTAMPAFAADLIAIITPAHDNPFFKAEAVGAEAKAKELGYETLVMTHDDDANKQSEMIDTAI
GRGAKAIILDNAGADASVAAVKKAKDAGIPSFLIDREINATGVAVAQIVSNNYQGAQLGAQEFVKLMGEKGNYVELVGKE
SDTNAGIRSQGYHDVIDDYPEMKSVAKQSANWSQTEAYSKMETILQANPDIKGVISGNDTMAMGAIAALQAAGRKDVIVV
GFDGSNDVRDSIKSGGIKATVLQPAYAQAQLAVEQADAYIKNKTTPKEEKQLMDCVLINADNAGKLETFALTN
>P15636 3.4.21.50~~~~~~Protease 1~~~
MKRICGSLLLLGLSISAALAAPASRPAAFDYANLSSVDKVALRTMPAVDVAKAKAEDLQRDKRGDIPRFALAIDVDMTPQ
NSGAWEYTADGQFAVWRQRVRSEKALSLNFGFTDYYMPAGGRLLVYPATQAPAGDRGLISQYDASNNNSARQLWTAVVPG
AEAVIEAVIPRDKVGEFKLRLTKVNHDYVGFGPLARRLAAASGEKGVSGSCNIDVVCPEGDGRRDIIRAVGAYSKSGTLA
CTGSLVNNTANDRKMYFLTAHHCGMGTASTAASIVVYWNYQNSTCRAPNTPASGANGDGSMSQTQSGSTVKATYATSDFT
LLELNNAANPAFNLFWAGWDRRDQNYPGAIAIHHPNVAEKRISNSTSPTSFVAWGGGAGTTHLNVQWQPSGGVTEPGSSG
SPIYSPEKRVLGQLHGGPSSCSATGTNRSDQYGRVFTSWTGGGAAASRLSDWLDPASTGAQFIDGLDSGGGTPNTPPVAN
FTSTTSGLTATFTDSSTDSDGSIASRSWNFGDGSTSTATNPSKTYAAAGTYTVTLTVTDNGGATNTKTGSVTVSGGPGAQ
TYTNDTDVAIPDNATVESPITVSGRTGNGSATTPIQVTIYHTYKSDLKVDLVAPDGTVYNLHNRTGGSAHNIIQTFTKDL
SSEAAQRAPGSCG
>Q6D5T8 2.7.1.233~~~aplK~~~Apulose kinase~~~COG0554
MYTPVILAIDEGTTNAKAIAVDERGRILAKAAVALQVTHPQPGRSEQDAMAIWRAVCQAAEVCLSSLHRAQVVGVAISNQ
RESVLIWDRQTGKPMTPLVSWQDRRAEKFCQALQGSAEARLIESRTGLQVDPLFPAAKLHAMLAELPNGVARAMQGELCI
GTVDCWLNWQFSGGRAFSTDYSNAARTQLFNIHRGCWDEDLLALFGIPSVCLPAVTPSSALHGHTGVTGISGLAACVPIV
ALIGDSHAALYGQGITQSGEIKATYGTGSSLMTTINTPHLHATGLSTTIAWHDGELRYALEGNITHTGSGFAWIGQMLGV
PSVTQLTELALSAESNQGVFFVPALSGLGAPYWDVQARGLLCGLCDATTPAIIARAGLEAIAYQVADVFFAMEQVSQAAL
PALRVDGGATQNRWLMQFQADLLQRPLIRNHNAEVSALGAAYLGGKMLGWWEHNEQIAALPREVEVIEPSATNHAILESY
QQWRTAVARARLRPEA
>A6X3G3 3.1.1.115~~~apnL~~~D-apionate lactonase~~~
MSNVDPFLLYGTRETEAKPTHLKAGLLSLDLNDGNLRTITYDGVEVLRAVSYLVRDRDWGTYNPQIHDLNVEQSDSGFIV
TYQARCEGPDATKLTIDVCIQAKSGGTLTFDAVANTATGFETNRCGFCILHPIVGVAGSPVRVEHVDGTLDRTQLPYLIE
PWQPFKDMRAITHEAMPGVTAECRMEGDTFEMEDQRNWSDASYKTYVRPLALPWPYQIEASKPQHQRIVLNISDTRKAAL
QAGMNAEPVSITLGNTSGKLPNIGIIITPEEAKASLAAIDLLQEIDPQDLLFQYDPMAGHNGSAFADFATLAAKHSARVS
LEIALPCEKSLIEETTAIASDMKAAGFNPDAVIVSPAIDRQSTPPGSEWPTCPPLEDVYAAARAAFPNARLGGGMLSYFT
ELNRKCVPGELVDFVTHCTNPIVHAADDLSVMQTLEALPFITRSVRAVYGDKPYRIGPSTIPMRQNPYGSRTMENPNGKR
IAMANRDPRHNGKFAESFALAYAISVLNAGLDSLTLSALTGPFGLTAGEGEPTLAGGKRPLFTTIKTLSGLAGKDWWQLI
SSRPDHVLAFATEHEFWLVNITSQPQTVSIAQFAKITLDPYAVQNLKRS
>B9JK75 1.1.1.421~~~apnO~~~D-apionate oxidoisomerase~~~COG0287
MTVIALFGAGGKMGYRLAKNLKGSRFDVRHVEVSDAGKARLKNDLDLSCVPVDEALNGAEVVILAVPDTAIGKVAAGIVD
KLKPGTMVVALDAAAPFAGHLPKRDELTYFVTHPCHPPIFNDETDMQAKKDHFGGLFAKQHIVSALMQGPESAYALGEEI
AKVIWAPVMRSHRVTVEQMAMLEPGLSETVCASLLVVMRQAMDECVARGVPEDAARDFLLGHMNVLGAVIFKEVDGVFSD
ACNKAIEFGIPALMRDDWKNVFEPKEIAASIQRIT
>C0CMQ7 1.1.1.421~~~apnO~~~D-apionate oxidoisomerase~~~COG0287
MGKIVVSVIGAGGKMGTRTSNNLAKKPEEFDLLLVEASEAGIQSIKDRGFEPTPVEEALEKSDVVVFAVPDTLIGKLSAI
YVPQLKPGTGFIILDPAAAVARELTLRDDCTFGVAHPCHPSYFLDQDTYEARQDRFGGCGGKQDIVMSKIQGNDDRFAQC
VEVAKQMYAPVEHAYVMSSEQIAFLEPTLVELLGATCLYAMAETVDEAVKRGIPKEAAVSFLTGHIYNLSANFLGYIPGN
PPVSDACKVAIGLGNRLVMREDWKKIWDDEVLNKVIATMLHPDKPQI
>F8GV06 1.1.1.421~~~apnO~~~D-apionate oxidoisomerase~~~
MKEKIALFGAGGKMGVRLAKNLLKSDYRVSHVEVSEVGKKRLKDELGLECVSTEAALDNVDVVILAVPDTIIGKIAAQIA
PQLRPGTMVMTLDAAAPFAGHLPDRPDLTYFVAHPCHPLIFNDETDPEARRDYFGGGAAKQSITSALMQGPEEAFDLGEA
VAKVIYAPILRSYRLTVDQMALLEPGLSETICATLLQVMREAMDETVRRGVPKEAARDFLLGHMNILGAVIFNEIPGAFS
DACNKAIEFGKPRLMRDDWIKVFDREEIAESIRRIT
>Q6D8V3 1.1.1.421~~~apnO~~~D-apionate oxidoisomerase~~~COG2084
MAAELKTITVLGAGGKMGMRISANFQKSDYQVFYCENSPRAQEQVVAAGRELSIAEQVIPESDVVILAVPDIALKAVSGI
VVPQMKSNAVLLTLDPAAAYANLIAKRDDIDYAVAHPCHPSVFLDRFTPEEHADAFGGVAAPQHVAASYETGSDEQKATL
ARVVKVMYGPVIDVHWVTVKQLAYLEPTLVETVACMVGTLMKEALDETINTIGVPEAAAKAMLYGHIQIALAVAFRSTNP
FSDACMIAIEYGKENIIKPDWKKIFDEKELDLVIAKMLKIDAIER
>P42061 ~~~appA~~~Oligopeptide-binding protein AppA~~~
MKRRKTALMMLSVLMVLAIFLSACSGSKSSNSSAKKSAGKPQQGGDLVVGSIGEPTLFNSLYSTDDASTDIENMLYSFLT
KTDEKLNVKLSLAESIKELDGGLAYDVKIKKGVKFHDGKELTADDVVFTYSVPLSKDYKGERGSTYEMLKSVEKKGDYEV
LFKLKYKDGNFYNNALDSTAILPKHILGNVPIADLEENEFNRKKPIGSGPFKFKEWKQGQYIKLEANDDYFEGRPYLDTV
TYKVIPDANAAVAQLQAGDINFFNVPATDYKTAEKFNNLKIVTDLALSYVYIGWNEKNELFKDKKVRQALTTALDRESIV
SQVLDGDGEVAYIPESPLSWNYPKDIDVPKFEYNEKKAKQMLAEAGWKDTNGDGILDKDGKKFSFTLKTNQGNKVREDIA
VVVQEQLKKIGIEVKTQIVEWSALVEQMNPPNWDFDAMVMGWSLSTFPDQYDIFHSSQIKKGLNYVWYKNAEADKLMKDA
KSISDRKQYSKEYEQIYQKIAEDQPYTFLYYPNNHMAMPENLEGYKYHPKRDLYNIEKWWLAK
>P26458 7.1.1.3~~~appB~~~Cytochrome bd-II ubiquinol oxidase subunit 2~~~COG1294
MFDYETLRFIWWLLIGVILVVFMISDGFDMGIGCLLPLVARNDDERRIVINSVGAHWEGNQVWLILAGGALFAAWPRVYA
AAFSGFYVAMILVLCSLFFRPLAFDYRGKIADARWRKMWDAGLVIGSLVPPVVFGIAFGNLLLGVPFAFTPQLRVEYLGS
FWQLLTPFPLLCGLLSLGMVILQGGVWLQLKTVGVIHLRSQLATKRAALLVMLCFLLAGYWLWVGIDGFVLLAQDANGPS
NPLMKLVAVLPGAWMNNFVESPVLWIFPLLGFFCPLLTVMAIYRGRPGWGFLMASLMQFGVIFTAGITLFPFVMPSSVSP
ISSLTLWDSTSSQLTLSIMLVIVLIFLPIVLLYTLWSYYKMWGRMTTETLRRNENELY
>P26459 7.1.1.3~~~appC~~~Cytochrome bd-II ubiquinol oxidase subunit 1~~~COG1271
MWDVIDLSRWQFALTALYHFLFVPLTLGLIFLLAIMETIYVVTGKTIYRDMTRFWGKLFGINFALGVATGLTMEFQFGTN
WSFYSNYVGDIFGAPLAMEALMAFFLESTFVGLFFFGWQRLNKYQHLLVTWLVAFGSNLSALWILNANGWMQYPTGAHFD
IDTLRMEMTSFSELVFNPVSQVKFVHTVMAGYVTGAMFIMAISAWYLLRGRERNVALRSFAIGSVFGTLAIIGTLQLGDS
SAYEVAQVQPVKLAAMEGEWQTEPAPAPFHVVAWPEQDQERNAFALKIPALLGILATHSLDKPVPGLKNLMAETYPRLQR
GRMAWLLMQEISQGNREPHVLQAFRGLEGDLGYGMLLSRYAPDMNHVTAAQYQAAMRGAIPQVAPVFWSFRIMVGCGSLL
LLVMLIALVQTLRGKIDQHRWVLKMALWSLPLPWIAIEAGWFMTEFGRQPWAIQDILPTYSAHSALTTGQLAFSLIMIVG
LYTLFLIAEVYLMQKYARLGPSAMQSEQPTQQQG
>P24244 ~~~appX~~~Putative cytochrome bd-II ubiquinol oxidase subunit AppX~~~
MWYLLWFVGILLMCSLSTLVLVWLDPRLKS
>P05052 ~~~appY~~~HTH-type transcriptional regulator AppY~~~COG2207
MDYVCSVVFICQSFDLIINRRVISFKKNSLFIVSDKIRRELPVCPSKLRIVDIDKKTCLSFFIDVNNELPGKFTLDKNGY
IAEEEPPLSLVFSLFEGIKIADSHSLWLKERLCISLLAMFKKRESVNSFILTNINTFTCKITGIISFNIERQWHLKDIAE
LIYTSESLIKKRLRDEGTSFTEILRDTRMRYAKKLITSNSYSINVVAQKCGYNSTSYFICAFKDYYGVTPSHYFEKIIGV
TDGINKTID
>T2G6Z9 1.8.99.2~~~aprA~~~Adenylylsulfate reductase subunit alpha~~~
MPKIPSKETPRGVAIAEPIIVEHSVDLLMVGGGMGNCGAAFEAVRWADKYAPEAKILLVDKASLERSGAVAQGLSAINTY
LGDNNADDYVRMVRTDLMGLVREDLIYDLGRHVDDSVHLFEEWGLPVWIKDEHGHNLDGAQAKAAGKSLRNGDKPVRSGR
WQIMINGESYKVIVAEAAKNALGQDRIIERIFIVKLLLDKNTPNRIAGAVGFNLRANEVHIFKANAMVVACGGAVNVYRP
RSVGEGMGRAWYPVWNAGSTYTMCAQVGAEMTMMENRFVPARFKDGYGPVGAWFLLFKAKATNCKGEDYCATNRAMLKPY
EERGYAKGHVIPTCLRNHMMLREMREGRGPIYMDTKTALQTSFATMSPAQQKHLEAEAWEDFLDMCVGQANLWAATNCAP
EERGSEIMPTEPYLLGSHSGCCGIWASGPDEAWVPEDYKVRAANGKVYNRMTTVEGLWTCADGVGASGHKFSSGSHAEGR
IVGKQMVRWYLDHKDFKPEFVETAEELKTLIYRPYYNYEKGKGASTCPVVNPEYISPKNFMMRLIKCTDEYGGGVGTYYN
TSKALLDTGFWLMEMLEEDSLKLAARDLHELLRCWENYHRLWTVRLHMQHIAFREESRYPGFYYRADFLGLDDSKWKCFV
NSKYDPAKKETKIFKKPYYQIIPTDA
>Q03023 3.4.24.40~~~aprA~~~Serralysin~~~
MSSNSLALKGRSDAYTQVDNFLHAYARGGDELVNGHPSYTVDQAAEQILREQASWQKAPGDSVLTLSYSFLTKPNDFFNT
PWKYVSDIYSLGKFSAFSAQQQAQAKLSLQSWSDVTNIHFVDAGQGDQGDLTFGNFSSSVGGAAFAFLPDVPDALKGQSW
YLINSSYSANVNPANGNYGRQTLTHEIGHTLGLSHPGDYNAGEGDPTYADATYAEDTRAYSVMSYWEEQNTGQDFKGAYS
SAPLLDDIAAIQKLYGANLTTRTGDTVYGFNSNTERDFYSATSSSSKLVFSVWDAGGNDTLDFSGFSQNQKINLNEKALS
DVGGLKGNVSIAAGVTVENAIGGSGSDLLIGNDVANVLKGGAGNDILYGGLGADQLWGGAGADTFVYGDIAESSAAAPDT
LRDFVSGQDKIDLSGLDAFVNGGLVLQYVDAFAGKAGQAILSYDAASKAGSLAIDFSGDAHADFAINLIGQATQADIVV
>Q1ID47 3.4.24.-~~~aprA~~~Metalloprotease AprA~~~COG2931
MSKVKESAIVSATSALQPQGPSSSYGLINSFAHQYDRGGANVNGKPSYTVDQAANYLLRDGAAWKDLNKDGTISLSYTFL
TKAPSDFYSRGLGTFSQFSDLQKGQAKLAMQSWADVAKVTFTEAASGGDGHMTFGNFSASNGGAAFAYLPFDMPGSHKGE
SWYLINSSYQVNTTPGTGNYGRQTLTHEIGHVLGLSHPGDYNAGEGNPTYRDATYAQDTRGYSVMSYWSESNTGQNFVKA
GGQYYASAPLMDDIAAIQKLYGANYATRSGDTVYGFNSNADRDFYSATSSSSKLVFSVWDGGGNDTFDFSGFTQNQKINL
NETSFSDVGGMIGNVSIAKGVTIENAFGGSGNDLLIGNALANVLKGGAGNDIIYGGGGADQLWGGTGADTFVFGAISDST
KAAPDRIMDFTSGQDKIDLSAISAFAVNKLPLQFVNAFTGHAGEAVLSYDQGTNLGSLSIDFTGNSSADFLVTTVGQAAV
TDIVV
>A0A0C5CJR8 3.4.24.-~~~aprA~~~Metallopeptidase AprA~~~
MSKAKDKAIVSAAQASTAYSQIDSFSHLYDRGGNLTINGKPSYTVDQAATQLLRDGAAYRDFDGNGKIDLTYTFLTSASS
STMNKHGISGFSQFNAQQKAQAALAMQSWSDVANVTFTEKASGGDGHMTFGNYSSGQDGAAAFAYLPGTGAGYDGTSWYL
TNNSYTPNKTPDLNNYGRQTLTHEIGHTLGLAHPGDYNAGEGAPTYNDATYGQDTRGYSLMSYWSESNTNQNFSKGGVEA
YASGPLIDDIAAIQKLYGANYNTRAGDTTYGFNSNTGRDFLSATSNADKLVFSVWDGGGNDTLDFSGFTQNQKINLNEAS
FSDVGGLVGNVSIAKGVTIENAFGGAGNDLIIGNNAANVIKGGAGNDLIYGAGGADQLWGGAGNDTFVFGASSDSKPGAA
DKIFDFTSGSDKIDLSGITKGAGLTFVNAFTGHAGDAVLTYAAGTNLGTLAVDFSGHGVADFLVTTVGQAAVSDIVA
>T2G899 ~~~aprB~~~Adenylylsulfate reductase subunit beta~~~
MPTFVDPSKCDGCKGGEKTACMYICPNDLMILDPEEMKAFNQEPEACWECYSCIKICPQGAITARPYADFAPMGGTCIPL
RGSEDIMWTIKFRNGSVKRFKFPIRTTPEGSIKPFEGKPEAGDLENELLFTETALTVPQVALGQKAQIADAETSQCWFDL
PCEGGNR
>O31788 3.4.21.-~~~aprX~~~Serine protease AprX~~~COG1404
MFGYSMVQMVRANAHKLDWPLRETVLQLYKPFKWTPCFLHKFFETKLQNRKKMSVIIEFEEGCHETGFQMAGEVLQKEKR
SKLKSRFNKINCCSAEVTPSALHSLLSECSNIRKVYLNREVKALLDTATEASHAKEVVRNGQTLTGKGVTVAVVDTGIYP
HPDLEGRIIGFADMVNQKTEPYDDNGHGTHCAGDVASSGASSSGQYRGPAPEANLIGVKVLNKQGSGTLADIIEGVEWCI
QYNEDNPDEPIDIMSMSLGGDALRYDHEQEDPLVRAVEEAWSAGIVVCVAAGNSGPDSQTIASPGVSEKVITVGALDDNN
TASSDDDTVASFSSRGPTVYGKEKPDILAPGVNIISLRSPNSYIDKLQKSSRVGSQYFTMSGTSMATPICAGIAALILQQ
NPDLTPDEVKELLKNGTDKWKDEDPNIYGAGAVNAENSVPGQ
>B9JK80 1.1.1.420~~~apsD~~~D-apiose dehydrogenase~~~COG0673
MNMTELKGALIGCGFFAVNQMHAWKDVKGAGIAAICDRDPKRLKLVGDQFGIERRYGDAAALFADGGFDFVDIATTVQSH
RALVEMAAAHKVPAICQKPFAKSLSDAKAMVRTCENADIPLMVHENFRWQTPIQAVKAVLESGAIGEPFWGRFSFRSGFD
VFSGQPYLAEGERFIIEDLGIHTLDIARFILGDVATLTARTKRVNPKIKGEDVATILLDHQNGATSIVDVSYATKLGTEP
FPETLIDIDGTQGTIRLSQGYRLEVTGPNGMTISDASPQLLSWASRPWHNIQESVLAIQQHWTDRLSSGGETSTSGADNL
KTFALVEAAYESAANGRTVDIGAML
>B1G894 1.1.1.420~~~apsD~~~D-apiose dehydrogenase~~~
MSATDATQVISATTTTVKRFKGALIGCGFFSRNHLHAWRDIDGAEIVALCDADSERLRAAGQAFGIERLYRDAAAMLSAE
QLDFVDIATTAPSHRSLVELAAQAGVAAICQKPFALTLADARAMVAACERAGVPLMVHENFRWQPAIQAVGRALRDGAIG
TPFWGRVSFRSAFDVFSGQPYLARNERFIVEDLGIHILDIARFLFGDVTRLAAATSRVNSAIAGEDVATILLTHESGVTS
VVDCSYASRQARELFPQTLVEVDGAEGTLRLSADYRLEIHNRDGTRTATAAPPPLAWASVPWEAIQASVVNIQRHWIACL
RNGGEPQTSGRDNLKTLALVEATYLSAREGRTVELKSLEAEAPAVTSTSSGAVRR
>Q6D5T7 5.3.1.36~~~apsI~~~D-apiose isomerase~~~COG1082
MATYNYPEFGAGLWHFANYIDRYAVDGYGPALSTIDQINAAKEVGELSYVDLPYPFTPGVTLSEVKDALKDAGLKAIGIT
PEIYLQKWSRGAFTNPDPAARAAAFELMHESAGIVRELGANYVKVWPGQDGWDYPFQVSHKNLWKLAVDGMRDLAGANPD
VKFAIEYKPREPRVKMTWDSAARTLLGIEDIGLDNVGVLLDFGHALYGGESPADSAQLIIDRGRLFGMDVNDNLRGWDDD
LVVGTVHMTEIFEFFYVLKINNWQGVWQLDQFPFRENHVEAAQLSIRFLKHIYRALDKLDIPALQAAQEAQNPLQAQRIV
QDALLSSITVSE
>A6VKQ4 2.2.1.13~~~aptA~~~Apulose-4-phosphate transketolase subunit A~~~COG3959
MNPYNLSYDELEKKAKAIRRKIVVLNANSPAGGHTGADLSQVEILTSLYFRVLNNDPKDLINPERDIYIQSKGHGAGGYY
CCLAEAGYIPEDWLPTYQHSDSKLPGHPVKHKTPGVELNTGALGHGLPVAVGLAIAAKKSGSKRKIYVLTGDGELGEGSN
WEAALTAAQYKLDNLIIINDKNKLQLAGFTKDILCTDPLDKKWEAFGMEVHECQGNDIRSVVDTLESIQPNGKPHVVIAN
TTKGAGISFIEGRPEWHHKVPKGDEVELALEELKDE
>Q9A3Q9 2.6.1.-~~~aptA~~~Omega-aminotransferase~~~COG0161
MPDFGANDLDAFWMPFTPNRRFKRHPRMLSSASGMWYRTPESREVLDATSGLWCVNAGHDRPKIREAIQKQAAEMDYAPC
FNMGHPLAFQFASRLAQITPKGLDRIFFTNSGSESVDTALKIALAYHRARGKGTKTRLIGRERGYHGVGFGGISVGGIPK
NRMYFGSLLTGVDHLPHTHGLPGNTCAKGQPENGAHLADDLERIVALHDASNIAAVIVEPVAGSTGVLIPPKGYLERLRA
ICDKHDILLIFDEVITGFGRVGAPFAAERFGVTPDLICMAKGLTNAAVPCGAVAASGKIYDAMMDGADAPIELFHGYTYS
AHPLACAAGLATLETYREDDLFARAAGLEGYWQDAMHSLADARHVVDVRNLGLVAGIELEPRPGAPTARAMEVFETCFDE
GLLIRVTGDIIALSPPLILEKDHIDRMVETIRRVLGQVD
>A6KXB4 2.2.1.13~~~aptA~~~Apulose-4-phosphate transketolase subunit A~~~COG3959
MQVETLELQSEKNRKRLVEIVYKAKAGHIGGDLSCLNVLTALYFDIMRVWPDKPKETKRDRFVMSKGHCVEALYVTLEAK
GFISREVTDTLGEFGSILSGHPTIEVPGIEVNTGALGHGLSVGVGMAMAAKMDKADYKTYVLMGDGEQGEGSIYEAAMAG
NQYKLDNLVAIIDRNRLQISGTTEEVMSLESMRDRWTAFGWDVLEMNGDEMEDIIRTFRSIDYTNKKPHLLISHTTKGKG
VSYMEGIAKWHHGVPTAEQYEEAVREVSERIEKLEKENNGK
>A6VKQ3 2.2.1.13~~~aptB~~~Apulose-4-phosphate transketolase subunit B~~~COG3958
MSNAEHLANIMVERFISAVKNGVDLVPVVADSTSTAKIAPFIKEFPDRLVNVGIAEQSLVGCAAGLALGGKVAVTCNAAP
FLISRANEQVKVDVCYNNTNVKLFGLNSGASYGPLASTHHSIDDIGVMRGFGNIQIFAPSSPNECRQIIDYAINYVGPVY
IRLDGKELPEIHNDDYEFVPGQIDVLRKGGKIALVAMGSTVYEIVDAAVKLAEKGIEVTVVNVPSIRPCDTEALFNAIKD
CKYVISVEEHNINGGVGSLVAEVIAEHGAPITLKRRGIADGGYALAGDRKSMRKHHGIDADSIVDLALSLN
>A6KXB3 2.2.1.13~~~aptB~~~Apulose-4-phosphate transketolase subunit B~~~COG3958
MANNMIACRKSFTDTLLELARQDKDIVAVTTDARGSVTLGDFAKELPAQFVECGIAEQDAVGISAGLAHSGKKVFVCGPA
CFYVARSLEQVKVDLAYSQNNVKILGVSGGVAYGALGATHHSLHDIAVLRTFPGMNIVLPCDARQTRKLVKLLVDYPEPV
YVRVGRAAVPDVYENDDFEFVLGKANTLLDGTDLTIIAAGETVYHAYQAGLMLREKGIQARVLDMSSIKPVDVEAIRKAA
EETGRIITVEEHSRFGGLGAIVVETLSENPVPVRIIGIPDENVVHGNSHEIFAHYGLDKEGICKTALEFVKK
>G8SD34 3.2.2.6~~~~~~NAD(+) hydrolase ApTIR~~~COG2319
MRYDAFISYSHAADGALAPAVQRGLQRLARRWHRPRALEVFRDQTGLAVSHALWSSIKVALDQSEFFVLLASPEAAASPW
VNQEIEHWLSRHSVDRLLPVVTSGEWVWDADAGDVDLERSTAVPPALRGVFGEEPRHLDLRWARAEHELDLRHGRFRDAI
AELAAGMHGMSKEDLDGEDVIRHRQMLRMRRGALAVVCALLLLVAGTAVAWRNARGEVTATNVALQRQRAATAAEQHRTE
EAADQARSQQQIVEAEQQRAQKAAEEARGQQAVAEAEQQRALRAAGEARRQEGIAAAEQRRAQKAAAEARRQRGVADAEK
AKANRAAAEAERQRKIAADEQRKAHEAAAEAERQREEAVKQQRIAIGRRLLGQAGEARDRDPRTAIQLGIAARHIYPGPQ
SQAGLVETLVRTHYAGTVTGHTAVVSAVALSGDGRTLVTDGLDGTVMVWDPTDRAAPRRLAQLTSSTAPVYTVALSGDGR
TLVTGSEDGTAMVWDLTDRAAPRRLAQLTGHTDVVDAVALSGDGRTLATGSFDGTAMVWDVTDRAAPRRLAQLTDHTAPV
TAVALSGDGRTLATGSDDHTAMVWDLTDRAAPRRLAQLTGHTAGVDAVALSGDGRTLATGSYDGTAMLWDLTDRAAPRRL
AQLTGHTAQVYTVALSRDGRTLATGSEDHTAMVWDLTDRAAPRRLAQLTGHTDAVDAVALSGDGRTLATAASITRRCCGM
>P69503 2.4.2.7~~~apt~~~Adenine phosphoribosyltransferase~~~COG0503
MTATAQQLEYLKNSIKSIQDYPKPGILFRDVTSLLEDPKAYALSIDLLVERYKNAGITKVVGTEARGFLFGAPVALGLGV
GFVPVRKPGKLPRETISETYDLEYGTDQLEIHVDAIKPGDKVLVVDDLLATGGTIEATVKLIRRLGGEVADAAFIINLFD
LGGEQRLEKQGITSYSLVPFPGH
>Q5NII9 2.4.2.7~~~apt~~~Adenine phosphoribosyltransferase~~~COG0503
MNLDFIKSKIAAVPDFPKPGIMFRDITPLLADPQGLRKTAEAMAQELKNKGIQPTIVAGTESRGFIFGVALAEVLGLGFV
PVRKPGKLPRATYSVKYDLEYGSDSLEIHQDAFKVTDEVLVVDDLLATGGTAKATVDLIEKTQAKVAGLIFVMELDGLGG
REVLAGYNVSALIKF
>A0QWJ5 2.4.2.7~~~apt~~~Adenine phosphoribosyltransferase~~~COG0503
MTQHHDTAEVSRVIATLTREVADFPEPGIQFKDLTPLLADARGLRVVTDALADIASGADLVAGLDARGFLLGAAVATRLG
TGVLAVRKGGKLPPPVHGATYQLEYGTATLEIPAEGIDIAGRNVVIIDDVLATGGTLAAAARLLGDCGANVTGAGVVLEL
EALRGREAVAPLGVRSLHII
>P9WQ07 2.4.2.7~~~apt~~~Adenine phosphoribosyltransferase~~~COG0503
MLNVIATGLSLKARGKRRRQRWVDDGRVLALGESRRSSAISVADVVASLTRDVADFPVPGVEFKDLTPLFADRRGLAAVT
EALADRASGADLVAGVDARGFLVAAAVATRLEVGVLAVRKGGKLPRPVLSEEYYRAYGAATLEILAEGIEVAGRRVVIID
DVLATGGTIGATRRLLERGGANVAGAAVVVELAGLSGRAALAPLPVHSLSRL
>P68779 2.4.2.7~~~apt~~~Adenine phosphoribosyltransferase~~~
MDLKQYVSEVQDWPKPGVSFKDITTIMDNGEAYGYATDKIVEYAKDRDVDIVVGPEARGFIIGCPVAYSMGIGFAPVRKE
GKLPREVIRYEYDLEYGTNVLTMHKDAIKPGQRVLITDDLLATGGTIEAAIKLVEKLGGIVVGIAFIIELKYLNGIEKIK
DYDVMSLISYDE
>P63544 2.4.2.7~~~apt~~~Adenine phosphoribosyltransferase~~~COG0503
MNLKDYIATIENYPKEGITFRDISPLMADGNAYSYAVREIVQYATDKKVDMIVGPEARGFIVGCPVAFELGIGFAPVRKP
GKLPREVISADYEKEYGVDTLTMHADAIKPGQRVLIVDDLLATGGTVKATIEMIEKLGGVMAGCAFLVELDELNGREKIG
DYDYKVLMHY
>B0K969 2.4.2.7~~~apt~~~Adenine phosphoribosyltransferase~~~COG0503
MTLEEIKMMIREIPDFPKKGIKFKDITPVLKDAKAFNYSIEMLAKALEGRKFDLIAAPEARGFLFGAPLAYRLGVGFVPV
RKPGKLPAETLSYEYELEYGTDSLEIHKDAVLEGQRVVIVDDLLATGGTIYASAKLVESLGGIVDSIIFLTELTFLDGRK
KLDGYDIISLIKF
>Q66DQ2 2.4.2.7~~~apt~~~Adenine phosphoribosyltransferase~~~
MTVSASKTAQQLKYIKDSIKTIPDYPKAGILFRDVTSLLENPKAYSASIELLSEHYSESGVTKVVGTEARGFLFGAPVAL
ALGVGFVPVRKPGKLPRETISESYELEYGTDTLEIHTDSIQPGDKVLVVDDLLATGGTIEATVKLIRRLGGEVVHAAFII
NLPELGGEARLTQQGIHCYSLVSFDGH
>P38939 ~~~apu~~~Amylopullulanase~~~COG0366
MFKRRTLGFLLSFLLIYTAVFGSMPVQFAKAETDTAPAIANVVGDFQSKIGDSDWNINSDKTVMTYKGNGFYEFTTPVAL
PAGDYEYKVALNHSWEGGGVPSQGNLSLHLDSDSVVTFYYNYNTSSVTDSTKYTPIPEEKLPRIVGTIQSAIGAGDDWKP
ETSTAIMRDYKFNNVYEYTANVPKGYYEFKVTLGPSWDINYGLNGEQNGPNIPLNVAYDTKITFYYDSVSHNIWTDYNPP
LTGPDNNIYYDDLKHDTHDPFFRSPFGAIKTGDTVTLRIQAKNHDLESAKISYWDDIKKTRTEVPMYKIGQSPDGQYEYW
EVKLSFDYPTRIWYYFILKDGTKTAYYGDNDEQLGGVGKATDTVNKDFELTVYDKNLDTPDWMKGAVMYQIFPDRFYNGD
PLNDRLKEYSRGFDPVEYHDDWYDLPDNPNDKDKPGYTGDGIWNNDFFGGDLQGINDKLDYLKNLGISVIYLNPIFQSPS
NHRYDTTDYTKIDELLGDLDTFKTLMKEAHARGIKVILDGVFNHTSDDSIYFDRYGKYLDNELGAYQAWKQGDQSKSPYG
DWYEIKPDGTYEGWWGFDSLPVIRQINGSEYNVKSWADFIINNPNAISKYWLNPDGDKDAGADGWRLDVANEIAHDFWVH
FRAAINTVKPNAPMIAELWGDASLDLLGDSFNSVMNYLFRNAVIDFILDKQFDDGNVVHNPIDAAKLDQRLMSIYERYPL
PVFYSTMNLLGSHDTMRILTVFGYNSANENQNSQEAKDLAVKRLKLAAILQMGYPGMPSIYYGDEAGQSGGKDPDNRRTF
SWGREDKDLQDFFKKVVNIRNENQVLKTGDLETLYANGDVYAFGRRIINGKDVFGNSYPDSVAIVVINKGEAKSVQIDTT
KFVRDGVAFTDALSGKTYTVRDGQIVVEVVALDGAILISDPGQNLTAPQPITDLKAVSGNGQVDLSWSAVDRAVSYNIYR
STVKGGLYEKIASNVTQITYIDTDVTNGLKYVYSVTAVDSDGNESALSNEVEAYPAFSIGWAGNMNQVDTHVIGVNNPVE
VYAEIWAEGLTDKPGQGENMIAQLGYRYIGDGGQDATRNKVEGVEINKDWTWVDARYVGDSGNNDKYMAKFVPDMVGTWE
YIMRFSSNQGQDWTYTKGPDGKTDEAKQFIVVPSNDVEPPTALGLQQPGIESSRVTLNWSLSTDNVAIYGYEIYKSLSET
GPFVKIATVADTVYNYVDTDVVNGKVYYYKVVAVDTSFNRTASNIVKATPDIIPIKVIFNVTVPDYTPDDGANIAGNFHD
AFWNPSAHQMTKTGPNTYSITLTLNEGTQLEYKYARGSWDKVEKGEYGEEIANRKITVVNQGSNTMVVNDTVQRWRDLPI
YIYSPKDNTTVDANTNEIEIKGNTYKGAKVTINDESFVQQENGVFTKVVPLEYGVNTTKIHVEPSGDKNNELTKDITITV
IREEPVQEKEPTPTPESEPAPMPEPQPTPTPEPQPSAIMAL
>P16950 ~~~apu~~~Amylopullulanase~~~
MFKRRALGFLLAFLLVFTAVFGSMPMEFAKAETDTAPAIANVVGNFQSKLGDSDWNINSDKTIMTYKGNGFYEFTTPVAL
PAGDYEYKVALNHSWEGGGVPSQGNLSFHLDSDSVVTFYYNYNTSSITDSTKYTPIPEDKLPRLVGTIQPAIGAGDDWKP
ETSTAIMRDYKFNNVYEYTANVPKGNYEFKVTLGPSWDINYGLNGEQNGPNIPLNVAYDTKITFYYDSVSHNIWTDYNPP
LTGPDNNIYYDDLRHDTHDPFFRSPFGAIKTGDTVTLRIQAKNHDIESAKISYWDDIKKTRIEVPMYRIGQSPDGKYEYW
EVKLSFDHPTRIWYYFILKDGTKTAYYGDNDEQLGGVGKATDTENKDFELTVYDKNLDTPDWMKGSVMYQIFPDRFFNGD
SSNDHLKKYSRGFDPVEYHSNWYELPDNPNDKNKLGYTGDGIWSNDFFGGDLKGIDDKLDYLKSLGISVIYLNPIFQSPS
NHRYDTTDYTKIDELLGDLSTFKKLMEDAHAKGIKVILDGVFNHTSDDSIYFDRYGKYLNTGVLGAYQAWKQGDQSKSPY
GDWYEIKPDGTYEGWWGFDSLPVIRQINGSEYNVKSWADFIINNPNAISKYWLNPDGDKNVGADGWRLDVANEVAHDFWV
HFRGAINTVKPNAPMVAENWNDASLDLLGDSFNSVMNYLFRNAVIDFILDKSFDDGNVVHNPIDAAKLDQRLMSIYERYP
LPVFYSTMNLLGSHDTMRILTVFGYNSADENQNSQAAKDLAVKRLKLAAILQMGYPGMPSIYYGDEAGQSGGKDPDNRRT
FPWGREDTDLQTFFKKVVNIRNENQVLKTGDLETLYANGDVYAFGRRIINGKDTFGKSYPDSVAIVVINKGDAKQVSIDT
TKFIRDGVAFTDALSGKTYTVQDGKIVVEVGSMDGAILISDTGQNLTAPQPITDLKAVSGNGKVDLSWSVVDKAVSYNIY
RSTVKGGLYEKIASNVTQITYTDTEVTNGLKYVYAVTAVDNDGNESALSNEVEAYPAFPIGWAGNMNQVNTHVIGVNNPV
EVYAEVWAQGLTDKPGQGENMIAQLGYRYIGDTVGDAVYNAVYNKVEGVEISKDWTWVDAQYVGDSGNNDKYMAKFVPDM
VGTWEYIMRFSSNQGHDWTYTKGPDGKTDEAKQFTVVPSNDVETPTAPVLQQPGIESSRVTLNWSPSADDVAIFGYEIYK
SSSETGPFIKIATVSDSVYNYVDTDVVNGNVYYYKVVAVDTSYNRTASNTVKATPDIIPIKVTFNVTIPDYTPDDGVNIA
GNFPDAFWNPNANQMTKAGSNTYSITLTLNEGTQIEYKYARGSWDKVEKGEYGNEIDNRKITVVNQGSNTMVVNDTVQRW
RDVPIYIYSPKDKTIVDANTSEIEIKGNTYKGAKVTINDESFVQQENGVFTKVVPLEYGVNTIKIHVEPSGDKNNELTKD
ITITVTREKPARRQNLLLLHQQKQQNHLKKYHKAK
>P80561 3.4.11.24~~~~~~Aminopeptidase S~~~COG2234
MRPNRFSLRRSPTAVAAVALAAVLAAGAPAAQAAGAAAPTAAAAAAPDIPLANVKAHLTQLSTIAANNGGNRAHGRPGYK
ASVDYVKAKLDAAGYTTTLQQFTSGGATGYNLIADWPGGDPNKVLMAGAHLDSVSSGAGINDNGSGSAAVLETALAVSRA
GYQPDKHLRFAWWGAEELGLIGSKYYVNNLPSADRSKLAGYLNFDMIGSPNPGYFVYDDDPVIEKTFKDYFAGLNVPTEI
ETEGDGRSDHAPFKNVGVPVGGLFTGAGYTKSAAQAQKWGGTAGQAFDRCYHSSCDSLSNINDTALDRNSDAAAHAIWTL
SSGTGEPPTGEGVFSNTTDVAIPDAGAAVTSSVAVTGRTGNAPAALQVGVDIKHTYRGDLVVDLLAPDGTAYRLKNSSSG
DSADNVIATYTVNASSEVANGSWKLRVQDIARQDTGYIDSWKLTF
>P0DTL1 ~~~~~~Anti-Pycsar protein Apyc1~~~
MALRLQMLGTGGAFAKKYFNNNALLYAGDFTLLIDCGITAPLALHTIGKSVEEIDAVLITHIHGDHVGGLEELAFRRKFG
SGRKPILYIAENLVEPLWENTLKGGLSQDGVIHSLNDVFDVRLLKESEPAQLAPELKVELIRTPHIPGKPSYSLYINDEI
FYSADMTFEPELLMRLVRERGCRRIFHEVQLTGKGEVHTTLQELLSLPTEIQSQILLKHYSDDMESFRGATGNMDFLRQH
EVYTL
>A0A2W1NDJ7 ~~~~~~Anti-Pycsar protein Apyc1~~~
MSLQIQMIGTGSAFAKKFYNNNALVKCNGFQLLIDCGVTAPRALHELGVPITGIDGILITHIHADHVGGIEEFAFRLKYK
YGMTIKLFVPAALVNPLWDHSLRGGLENKAEGLEQLADYFDVVALEEAVVHEIHPGLTVELVRSQHIAGKASYSLLLNNL
LFYSSDARFNYAQLVELSTSGRCKYILHDCQLAEPAAVHATLNELLTLPEAVQEMIMLMHYDDEMEQFIGKSGKMSFMQQ
HKTYSFTEAT
>F6C6C1 3.2.1.-~~~~~~Exo-alpha-(1->6)-L-arabinopyranosidase~~~
MSESTYPSVKDLTLEEKASLTSGGDAWHLQGVESKGIPGYMITDGPHGLRKSLASSAGETDLDDSVPATCFPPAAGLSSS
WNPELIHKVGEAMAEECIQEKVAVILGPGVNIKRNPLGGRCFEYWSEDPYLAGHEAIGIVEGVQSKGVGTSLKHFAANNQ
ESDRLRVDARISPRALREIYFPAFEHIVKKAQPWTIMCSYNRINGVHSAQNHWLLTDVLRDEWGFEGIVMSDWGADHDRG
ASLNAGLNLEMPPSYTDDQIVYAVRDGRITPAQLDRMAQGMIDLVNKTRAAMSIDNYRFDVDAHDEVAHQAAIESIVMLK
NDDAILPLNAGPVANPSAMPQKIAVIGEFARTPRYQGGGSSHITPTKMTSFLDTLAERGIKADFAPGFTLDLEPADPALE
SEAVETAKNADVVLMFLGLPEAAESEGFDRDTLDMPAKQITLLEQVAAANQNVVVVLSNGSVITVAPWAKNAKGILESWL
LGQSGGPALADVIFGQVSPSGKLAQSIPLDINDDPSMTNWPGEEGHVDYGEGVFVGYRYYDTYGKAVDCPFGYGLSYATF
EITGVAVAKTGANTATVNATVTNTSDVDAAETVQVYVAPGKADVARPKHELKGFTKVFLKSGESKTVTIDLDERAFAYWS
EKYNDWHVESGEYAIEVGTSSRDIAETVTVALEGDGKTQPLTEWSTYGEWEADPFGAKIVAAVAAAGEAGELPKLPDNAM
MRMFLNSMPINSLPTLLGEGGKKIAQFMVDEYAKLSK
>E7CY69 3.2.1.-~~~apy~~~Exo-alpha-(1->6)-L-arabinopyranosidase~~~COG1472
MSESTYPSVKDLTLEEKASLTSGGDAWHLQGVESKGIPSYMITDGPHGLRKSLASSAGETDLDDSVPATCFPPAAGLSSS
WNPELIHKVGEAMAEECIQEKVAVILGPGVNIKRNPLGGRCFEYWSEDPYLAGHEAIGIVEGVQSKGVGTSLKHFAANNQ
ETDRLRVDARISPRALREIYFPAFEHIVKKAQPWTIMCSYNRINGVHSAQNHWLLTDVLRDEWGFDGIVMSDWGADHDRG
ASLNAGLNLEMPPSYTDDQIVYAVRDGLITPAQLDRMAQGMIDLVNKTRAAMSIDNYRFDVDAHDEVAHQAAIESIVMLK
NDDAILPLNAGPVANPSATPQKIAVIGEFARTPRYQGGGSSHITPTKMTSFLDTLAERGIKADFAPGFTLDLEPADPALE
SEAVETAKNADVVLMFLGLPEAVESEGFDRDTLDMPAKQIALLEQVAAANQNVVVVLSNGSVITVAPWAKNAKGILESWL
LGQSGGPALADVIFGQVSPSGKLAQSIPLDINDDPSMLNWPGEEGHVDYGEGVFAGYRYYDTYGKAVDYPFGYGLSYATF
EITGVAVAKTGANTATVTATVTNTSDVDAAETVQVYVVPGKADVARPKHELKGFTKAFLKAGESKTVAIDLDERAFAYWS
EKYNDWHVEAGEYAIEVGVSSRDIADTVAVALDGDGKTQPLTEWSTYGEWEADPFGAKIVAAVAAAGEAGELTKLPDNAM
MRMFLNPMPINSLPTLLGEGGKKIAQFMLDEYAKLSK
>A0A0G3FWY4 3.5.1.-~~~aqdA1~~~Probable N-octanoylanthranilate hydrolase AqdA1~~~
MTANGDVRQPDARTYFTHQHPADYHADWKGYYERALVSRARSMERFAHELDIRYGTDPHQILNVFRAADTRSAPVIIYFH
GGRWREGHPAFYDHLADTWAADGAVFVSAGYRLTPEHSIADSVADAWAVTDWVVRNIAAYGGDPSRITVAGHSSGGHLAS
MVALTDNCAVSIVGLVCMSAPVDLRTLGFWDDDTLSPHLQISRVPRRVVVSFGDPEPNRKGDDALRLTREGQMLADSLVA
YGASLRTVVLPNADHVRTATAFADRQSPLFGAAHSVIFGDSTEDRSAPRSPHFQEEKQSCPE
>A0A0E4AET8 3.5.1.-~~~aqdA2~~~Probable N-octanoylanthranilate hydrolase AqdA2~~~
MFQTVTAPTGVWRGRVTGDVTVFHGIQYARADRFAPPQRCEPQLQHLVEVPEPGPIAPQSPSRLEGVMGAPSSLKQSEAC
LTVTVTTPHLAQPGSLPVLVWLHGGAFLSGSGAWEQYGAEQLVRETGIVVVSVNYRLGVLGYLCAPGISSGNLGLLDQIT
ALEWVRDNIEAFGGDNGRVTLDGQSAGAHSIVAMLGIDRARSLFSRAIIQSAPLGLGFHSVEQARRAAEIFEEELGSDPR
RAVVTDILAAQARTAHRLAGRGAMNSAPPFLPVHGMAPLPFVGEWNGKVAANAARRKILIGNTRDEMAAFFGPHPVFSAM
RRVPLAGPQLAGAIQRRVQKVVFDNPVQEFADRFASAGASVWRYGIGPLHPDNPFGACHCIDIPLLFGDGDTWRDAPMLR
PLSPKEIGESGTRTRRYWGEFVHTGRISDPAWPMHRPKSRYAHLLTDETIGGSA
>A0A0E4AFG7 1.14.13.182~~~aqdB1~~~Putative 2-heptyl-3-hydroxy-4(1H)-quinolone synthase AqdB1~~~
MSGVAGHAEVVGGGIGGLSAAIALGKRGWTVRLHERNDEIRASGSGIYLWDNGLAALDYLGALDSTLVGAHFGARMQTRD
AHNALVASSEVNRAGGPRVVTVARERLINALLASADAVGVEVVTGSTVTRVDAAGRIEFDNGHADADLIVVADGIGSRSR
DQLGVKTRRRQLNQKCARVLLPREPGMVPSEWVDEYVTFYSGQRFLLYTPCSADLLYLALVCPSDDAPATGDPLPREAWI
ASFPQLAPLIDRIGPTPRWDEFEMLTLDSWSSGRVAILGDAAHAQPPSLGQGGGCAMLSALGLAHSLSKNYDLTTALGEW
ESSERSVIQRTQWFSYWLARANKLPDRPRSLLLSAAGHSSLYRNNRMRAALTTPTGITSSK
>A0A0E4AFH6 1.14.13.182~~~aqdB2~~~Probable 2-heptyl-3-hydroxy-4(1H)-quinolone synthase AqdB2~~~
MTQRNAIVVGGGIGGLTAASALARQGWRVQLHERQPEIRAVGAGIYIWDNGLFALDAVHAYSEAIEGAHEPPSIDMRGQS
GKTLMRIKINGESQPRCLTLLRDQLIKALVNAAKDAGVELVTNSSVVAVRPEGEVHFEHGDHSTTDLVVVADGVHSRLRD
SVDLSYSRIRMSQGAARIMIPQSSHELPAEDRGRILESFHGSRRLLYTPCTPELVYLAFTCDSDDPAISGAYINTSEWSR
SFPTLSDALRATEGVPATRWDTFEYVRLASWSRGKVAFLGDAAHAQPPYLGQGGGTAMTNAIALANAVSSDMELSEALAT
WERITRPGIESTQRTSYQQRLLNYVPDRVRNPLVRIAGLTSNVAKSQLKATEIRPTLGSTGGSR
>B1MFK1 1.14.13.182~~~aqdB~~~2-heptyl-3-hydroxy-4(1H)-quinolone synthase~~~
MSSGHAEVVGGGIGGLTAATALALRGWTVRLHERDTRIRTVGAGIYVWDNGLEALDTIGAAAEGLDDAYEAPAITVRASD
GRPLYRIDVNQPGGARCVTLLRDRLIGALHVAAEHAGVEVCTGSAAVSATADGTVEFSTGTSTRADLVVAADGVHSLLRD
RLGISYRRIRMRQGAARVMVSGERPFIPGMDVDQHHEFLGGRRRLLYTPCTATQTYLAFVADNDDTATVGPELDLAAWAR
AFPLLVPVFDAARGRALIRWDNFELIRLSTWSHGRVAVLGDAAHAQPPYVGQGGGTAMNSAVGLAAAVSESADVEDGLNR
WEQALRPPIEKAQTTSYRMRLIGSVPEVLRGPLLGALGRSRSSATSQLIKKRSAA
>A0A0E4AE72 1.13.11.-~~~aqdC1~~~2-heptyl-3-hydroxy-4-quinolone dioxygenase AqdC1~~~
MNMPQLSTIQIGDHELAYLDNKLTSAVTPTIVMLPGWCGDHHSFSELIPQLNDTHRVVAVNWRGHAPVPHDVSDFGYAEQ
AQDALAILDAIGVDEFLPVSASHGGWALVQLLVDAGPARARAGVVLDWLMRRPTPEFTAALLSLQDPEGWVDSCRALFHT
WRPNDSDWVESRVERAKEFGFDMWARSGRVISGAYGEHGTPLEFMKTITPERHIRHLFSTPSDSDYVAPQEAFASENEWF
SYALLGGTSHFPHLEMPDRVAAHIVELAKNTYQAGAMR
>A0A0E4AE82 1.13.11.-~~~aqdC2~~~2-heptyl-3-hydroxy-4-quinolone dioxygenase AqdC2~~~
MTALMTLNGVRIEYQDIGTSSAGKPALVLLTGWGHDLRYYRRLIPHLAPEFRVVALSWRGHDADRTLVGDYGVHEQTADT
IALLDAIGVDVFVPVAHAHGGWVALQLADELGVQRVPRVLIADLIMTTIPSDFAAAVRDLQKPDRWKSARAGLAKSWLSG
GVTLPLLKHLLIESRGFGFDTWARSGRVIEDAYNRWGSPMGRMEQLNEPRPIRHVFSHPKTSSYDELHVAFRGRHPWFSH
RRLAGRTHFPAHELPREIATEIRAFVNEST
>B1MFK2 1.13.11.-~~~aqdC~~~2-heptyl-3-hydroxy-4(1H)-quinolone dioxygenase~~~
MITTKTVNGVQIAFDDQGHEPGPVFVTLSGWAHDLRAYDGMLPYLRAAQRTVRVCWRGHGPDRNLVGDFGIDEMAADTIG
LLDALEVDSFVPIAHAHGGWAALEIADRLGAQRVPAVMILDLIMTPAPREFVAALHGIQDPERWKEGRDGLVQSWLAGTT
NQAVLDHVRYDSGGHGFDMWARAGRVIDEAYRTWGSPMRRMEALAEPCAIRHVFSHPKIGEYDALHDDFAARHPWFSYRR
LGGETHFPGIELPQQVAAEAIDLLAGARI
>A0A0E4AFE3 ~~~aqdR~~~HTH-type transcriptional regulator AqdR~~~
MTDPEHSGQRAQVRGARFRERVLDATIACITEAGVDNVGFADVARKAGVNGVSLYRRWKTVPRLLIDALLTRTQAEVPIP
DTGSVHRDLEIFATELTKFAQTPIGTALIRFTVVSADSPEVDVSRREFWMQRLTAAEEIIERGKNRGEVDSSTDSRLVVL
TLGGLVHIYVTHLGTDIPTSLPHQAVSLILSGVTSGVTQARTNAQMG
>P08594 3.4.21.111~~~pstI~~~Aqualysin-1~~~
MRKTYWLMALFAVLVLGGCQMASRSDPTPTLAEAFWPKEAPVYGLDDPEAIPGRYIVVFKKGKGQSLLQGGITTLQARLA
PQGVVVTQAYTGALQGFAAEMAPQALEAFRQSPDVEFIEADKVVRAWATQSPAPWGLDRIDQRDLPLSNSYTYTATGRGV
NVYVIDTGIRTTHREFGGRARVGYDALGGNGQDCNGHGTHVAGTIGGVTYGVAKAVNLYAVRVLDCNGSGSTSGVIAGVD
WVTRNHRRPAVANMSLGGGVSTALDNAVKNSIAAGVVYAVAAGNDNANACNYSPARVAEALTVGATTSSDARASFSNYGS
CVDLFAPGASIPSAWYTSDTATQTLNGTSMATPHVAGVAALYLEQNPSATPASVASAILNGATTGRLSGIGSGSPNRLLY
SLLSSGSGSTAPCTSCSYYTGSLSGPGDYNFQPNGTYYYSPAGTHRAWLRGPAGTDFDLYLWRWDGSRWLTVGSSTGPTS
EESLSYSGTAGYYLWRIYAYSGSGMYEFWLQRP
>Q92R43 ~~~aqpS~~~Aquaglyceroporin AqpS~~~COG0580
MQEFDLTRRCVAEALGTGLLVAAVVGSGIMADALTADDALALVANTIATGAILVVLVTILGPLSGAHFNPAVSLVFALSG
RLTRRDCAAYVIAQVAGAIAGTALAHLMFDLPPLDMSMKVRTGPAQWLSEGVAAFGLVATILAGIRFHREAVPWLVGLYI
TAAYWFTASTSFANPAVALARSFTNTFSGIRPGDLPGFVIAELLGAVCALALMRWLLQPARPIIRQTSPETAP
>Q8UJW4 ~~~aqpZ2~~~Aquaporin Z 2~~~COG0580
MGRKLLAEFFGTFWLVFGGCGSAVFAAAFPELGIGFTGVALAFGLTVLTMAYAVGGISGGHFNPAVSVGLTVAGRFPASS
LVPYVIAQVAGAIVAAAALYVIATGKAGIDLGGFASNGYGEHSPGGYSLVSALLIEIILTAFFLIVILGSTHGRVPAGFA
PIAIGLALTLIHLISIPVTNTSVNPARSTGQALFVGGWALQQLWLFWLAPIVGGAAGAVIWKLFGEKD
>P60844 ~~~aqpZ~~~Aquaporin Z~~~COG0580
MFRKLAAECFGTFWLVFGGCGSAVLAAGFPELGIGFAGVALAFGLTVLTMAFAVGHISGGHFNPAVTIGLWAGGRFPAKE
VVGYVIAQVVGGIVAAALLYLIASGKTGFDAAASGFASNGYGEHSPGGYSMLSALVVELVLSAGFLLVIHGATDKFAPAG
FAPIAIGLALTLIHLISIPVTNTSVNPARSTAVAIFQGGWALEQLWFFWVVPIVGGIIGGLIYRTLLEKRD
>Q53TZ2 1.1.1.376~~~araA~~~L-arabinose 1-dehydrogenase (NAD(P)(+))~~~
MSDQVSLGVVGIGKIARDQHLPAIDAEPGFKLTACASRHAEVTGVRNYRDLRALLAAERELDAVSLCAPPQVRYAQARAA
LEAGKHVMLEKPPGATLGEVAVLEALARERGLTLFATWHSRCASAVEPAREWLATRAIRAVQVRWKEDVRRWHPGQQWIW
EPGGLGVFDPGINALSIVTRILPRELVLREATLIVPSDVQTPIAAELDCADTDGVPVRAEFDWRHGPVEQWEIAVDTADG
VLAISRGGAQLSIAGEPVELGPEREYPALYAHFHALIARGESDVDVRPLRLVADAFLFGRRVQTDAFGR
>P94523 5.3.1.4~~~araA~~~L-arabinose isomerase~~~COG2160
MLQTKDYEFWFVTGSQHLYGEETLELVDQHAKSICEGLSGISSRYKITHKPVVTSPETIRELLREAEYSETCAGIITWMH
TFSPAKMWIEGLSSYQKPLMHLHTQYNRDIPWGTIDMDFMNSNQSAHGDREYGYINSRMGLSRKVIAGYWDDEEVKKEMS
QWMDTAAALNESRHIKVARFGDNMRHVAVTDGDKVGAHIQFGWQVDGYGIGDLVEVMDRITDDEVDTLYAEYDRLYVISE
ETKRDEAKVASIKEQAKIELGLTAFLEQGGYTAFTTSFEVLHGMKQLPGLAVQRLMEKGYGFAGEGDWKTAALVRMMKIM
AKGKRTSFMEDYTYHFEPGNEMILGSHMLEVCPTVALDQPKIEVHSLSIGGKEDPARLVFNGISGSAIQASIVDIGGRFR
LVLNEVNGQEIEKDMPNLPVARVLWKPEPSLKTAAEAWILAGGAHHTCLSYELTAEQMLDWAEMAGIESVLISRDTTIHK
LKHELKWNEALYRLQK
>P08202 5.3.1.4~~~araA~~~L-arabinose isomerase~~~COG2160
MTIFDNYEVWFVIGSQHLYGPETLRQVTQHAEHVVNALNTEAKLPCKLVLKPLGTTPDEITAICRDANYDDRCAGLVVWL
HTFSPAKMWINGLTMLNKPLLQFHTQFNAALPWDSIDMDFMNLNQTAHGGREFGFIGARMRQQHAVVTGHWQDKQAHERI
GSWMRQAVSKQDTRHLKVCRFGDNMREVAVTDGDKVAAQIKFGFSVNTWAVGDLVQVVNSISDGDVNALVDEYESCYTMT
PATQIHGKKRQNVLEAARIELGMKRFLEQGGFHAFTTTFEDLHGLKQLPGLAVQRLMQQGYGFAGEGDWKTAALLRIMKV
MSTGLQGGTSFMEDYTYHFEKGNDLVLGSHMLEVCPSIAAEEKPILDVQHLGIGGKDDPARLIFNTQTGPAIVASLIDLG
DRYRLLVNCIDTVKTPHSLPKLPVANALWKAQPDLPTASEAWILAGGAHHTVFSHALNLNDMRQFAEMHDIEITVIDNDT
RLPAFKDALRWNEVYYGFRR
>Q5KYP7 5.3.1.4~~~araA~~~L-arabinose isomerase~~~COG2160
MLSLRPYEFWFVTGSQHLYGEEALKQVEEHSRIMVNEWNRDSVFPFPFVFKSVVTTPEEIRRVCLEANASEQCAGVVTWM
HTFSPAKMWIGGLLELRKPLLHLHTQFNRDIPWDSIDMDFMNLNQSAHGDREYGFIGARMGVARKVVVGHWEDPEVRERL
AKWMRTAVAFAESRNLKVARFGDNMREVAVTEGDKVGAQIQFGWSVNGYGIGDLVQYIRDVSEQKVNELLDEYEELYDIV
PAGRQEGPVRESIREQARIELGLKAFLQDGNFTAFTTTFEDLHGMKQLPGLAVQRLMAEGYGFGGEGDWKTAALVRLMKV
MADGKGTSFMEDYTYHFEPGNELILGAHMLEVCPTIAATRPRVEVHPLSIGGKEDPARLVFDGGEGAAVNASLIDLGHRF
RLIVNEVDAVKPEHDMPKLPVARILWKPRPSLRDSAEAWILAGGAHHTCFSFAVTTEQLQDFAEMAGIECVVINEHTSVS
SFKNELKWNEVFWRGR
>Q9S467 5.3.1.4~~~araA~~~L-arabinose isomerase~~~
MLSLRPYEFWFVTGSQHLYGEEALKQVEEHSMMIVNELNQDSVFPFPLVFKSVVTTPEEIRRVCLEANASEQCAGVITWM
HTFSPAKMWIGGLLELRKPLLHLHTQFNRDIPWDSIDMDFMNLNQSAHGDREYGFIGARMGVARKVVVGHWEDPEVRERL
AKWMRTAVAFAESRNLKVARFGDNMREVAVTEGDKVGAQIQFGWSVNGYGIGDLVQYIRDVSEQKVNELLDEYEELYDIV
PAGRQEGPVRESIREQARIELGLKAFLQDGNFTAFTTTFEDLHGMKQLPGLAVQRLMAEGYGFGGEGDWKTAALVRLMKV
MADGKGTSFMEDYTYHLEPGNEMILGAHMLEVCPTIAATRPRIEVHPLSIGGKEDPARLVFDGGEGAAVNASLIDLGHRF
RLIVNEVDAVKPEHDMPKLPVARILWKPRPSLRDSAEAWILAGGAHHTCFSFAVTTEQLQDFAEMAGIECVVINEHTSVS
SFKNELKWNEVFWRGR
>Q9WYB3 5.3.1.4~~~araA~~~L-arabinose isomerase~~~COG2160
MIDLKQYEFWFLVGSQYLYGLETLKKVEQQASKIVDSLNDDPIFPSKIVLKPVLKSSSEITEIFEKANADPKCAGVIVWM
HTFSPSKMWIRGLSINKKPLLHLHTQYNREIPWDTIDMDYMNLNQSAHGDREHGFIHARMRLPRKVVVGHWEEKEVREKI
AKWMRVACAIQDGRMGQIVRFGDNMREVASTEGDKVEAQIKLGWSINTWGVGELAERVKAVPEREVEELLKEYREKYIMP
EDEYSLKAIREQAKIEIALREFLKEKNAVGFTTTFEDLHDLPQLPGLAVQRLMEEGYGFGAEGDWKAAGLVRAIKVMGTS
LPGGTSFMEDYTYHLTPGNELVLGAHMLEVCPTIAKEKPRIEVHPLSIGGKADPARLVFDGQEGPAVNASIVDMGNRFRL
VVNKVLSVPIERKMPKLPTARVLWKPLPDFKRATTAWILAGGSHHTAFSTAIDVEYLIDWAEALEIEYVVIDENLDLEDF
KKELRWNELYWGLLKR
>P94524 2.7.1.16~~~araB~~~Ribulokinase~~~COG1069
MAYTIGVDFGTLSGRAVLVHVQTGEELAAAVKEYRHAVIDTVLPKTGQKLPRDWALQHPADYLEVLETTIPSLLEQTGVD
PKDIIGIGIDFTACTILPIDSSGQPLCMLPEYEEEPHSYVKLWKHHAAQKHADRLNQIAEEEGEAFLQRYGGKISSEWMI
PKVMQIAEEAPHIYEAADRIIEAADWIVYQLCGSLKRSNCTAGYKAMWSEKAGYPSDDFFEKLNPSMKTITKDKLSGSIH
SVGEKAGSLTEKMAKLTGLLPGTAVAVANVDAHVSVPAVGITEPGKMLMIMGTSTCHVLLGEEVHIVPGMCGVVDNGILP
GYAGYEAGQSCVGDHFDWFVKTCVPPAYQEEAKEKNIGVHELLSEKANHQAPGESGLLALDWWNGNRSTLVDADLTGMLL
GMTLLTKPEEIYRALVEATAYGTRMIIETFKESGVPIEELFAAGGIAEKNPFVMQIYADVTNMDIKISGSPQAPALGSAI
FGALAAGKEKGGYDDIKKAAANMGKLKDITYTPNAENAAVYEKLYAEYKELVHYFGKENHVMKRLKTIKNLQFSSAAKKN
>P08204 2.7.1.16~~~araB~~~Ribulokinase~~~COG1069
MAIAIGLDFGSDSVRALAVDCATGEEIATSVEWYPRWQKGQFCDAPNNQFRHHPRDYIESMEAALKTVLAELSVEQRAAV
VGIGVDSTGSTPAPIDADGNVLALRPEFAENPNAMFVLWKDHTAVEEAEEITRLCHAPGNVDYSRYIGGIYSSEWFWAKI
LHVTRQDSAVAQSAASWIELCDWVPALLSGTTRPQDIRRGRCSAGHKSLWHESWGGLPPASFFDELDPILNRHLPSPLFT
DTWTADIPVGTLCPEWAQRLGLPESVVISGGAFDCHMGAVGAGAQPNALVKVIGTSTCDILIADKQSVGERAVKGICGQV
DGSVVPGFIGLEAGQSAFGDIYAWFGRVLGWPLEQLAAQHPELKTQINASQKQLLPALTEAWAKNPSLDHLPVVLDWFNG
RRTPNANQRLKGVITDLNLATDAPLLFGGLIAATAFGARAIMECFTDQGIAVNNVMALGGIARKNQVIMQACCDVLNRPL
QIVASDQCCALGAAIFAAVAAKVHADIPSAQQKMASAVEKTLQPCSEQAQRFEQLYRRYQQWAMSAEQHYLPTSAPAQAA
QAVATL
>Q9KBQ3 2.7.1.16~~~araB~~~Ribulokinase~~~COG1069
MTTKYTIGVDYGTESGRAVLIDLSNGQELADHVTPYRHGVIDQYLPNTNIKLGHEWALQHPLDYVEVLTTSVPAVMKESG
VDADDVIGIGVDFTACTMLPVDEEGQPLCLLAQYKDNPHSWVKLWKHHAAQDKANAINEMAEKRGEAFLPRYGGKISSEW
MIAKVWQILDEAEDVYNRTDQFLEATDWIVSQMTGKIVKNSCTAGYKAIWHKREGYPSNEFFKALDPRLEHLTTTKLRGD
IVPLGERAGGLLPEMAEKMGLNPGIAVAVGNVDAHAAVPAVGVTTPGKLVMAMGTSICHMLLGEKEQEVEGMCGVVEDGI
IPGYLGYEAGQSAVGDIFAWFVKHGVSAATFDEAQEKGVNVHALLEEKASQLRPGESGLLALDWWNGNRSILVDTELSGM
LLGYTLQTKPEEIYRALLEATAFGTRAIVDAFHGRGVEVHELYACGGLPQKNHLLMQIFADVTNREIKVAASKQTPALGA
AMFASVAAGSEVGGYDSIEEAAKKMGRVKDETFKPIPEHVAIYEKLYQEYVTLHDYFGRGANDVMKRLKALKSIQHRPSS
LLT
>P0A9E0 ~~~araC~~~Arabinose operon regulatory protein~~~COG4977
MAEAQNDPLLPGYSFNAHLVAGLTPIEANGYLDFFIDRPLGMKGYILNLTIRGQGVVKNQGREFVCRPGDILLFPPGEIH
HYGRHPEAREWYHQWVYFRPRAYWHEWLNWPSIFANTGFFRPDEAHQPHFSDLFGQIINAGQGEGRYSELLAINLLEQLL
LRRMEAINESLHPPMDNRVREACQYISDHLADSNFDIASVAQHVCLSPSRLSHLFRQQLGISVLSWREDQRISQAKLLLS
TTRMPIATVGRNVGFDDQLYFSRVFKKCTGASPSEFRAGCEEKVNDVAVKLS
>Q1JUQ1 4.2.1.25~~~araC~~~L-arabonate dehydratase~~~
MSATKPRLRSTQWFGTNDKNGFMYRSWMKNQGIPDHEFDGRPIIGICNTWSELTPCNAHFRKLAEHVKRGISEAGGFPVE
FPVFSNGESNLRPSAMLTRNLASMDVEEAIRGNPIDAVVLLAGCDKTTPALLMGAASCDVPAIVVSGGPMLNGKLEGKNI
GSGTAVWQLHEALKAGEIDVHHFLSAEAGMSRSAGTCNTMGTASTMACMAEALGVALPHNAAIPAVDSRRYVLAHMSGIR
IVEMALEGLVLSKILTRAAFENAIRANAAIGGSTNAVIHLKAIAGRIGVPLELEDWMRIGRDTPTIVDLMPSGRFPMEEF
YYAGGLPAVLRRLGEGGLLPNPDALTVNGKSLWDNVREAPNYDEEVIRPLDRPLIADGGIRILRGNLAPRGAVLKPSAAS
PELLKHRGRAVVFENLDHYKATINDEALDIDASSVMVLKNCGPRGYPGMAEVGNMGLPPKLLRQGVKDMVRISDARMSGT
AYGTVVLHVAPEAAAGGPLAAVRNGDWIELDCEAGTLHLDITDDELHRRLSDVDPTAAPGVAGQLGKGGYARLYIDHVLQ
ADEGCDLDFLVGTRGAEVPSHSH
>P94525 5.1.3.4~~~araD~~~L-ribulose-5-phosphate 4-epimerase~~~COG0235
MLETLKKEVLAANLKLQEHQLVTFTWGNVSGIDREKERIVIKPSGVEYSDLTADDLVVLNLDGEVVEGSLKPSSDTPTHV
YLYKAFPNIGGIVHTHSQWATSWAQSGRDIPPLGTTHADYFDSAIPCTREMYDEEIIHDYELNTGKVIAETFQHHNYEQV
PGVLVNNHGPFCWGTDALNAIHNAVVLETVAEMAYHSIMLNKDVTPINTVLHEKHFYRKHGANAYYGQS
>P08203 5.1.3.4~~~araD~~~L-ribulose-5-phosphate 4-epimerase AraD~~~COG0235
MLEDLKRQVLEANLALPKHNLVTLTWGNVSAVDRERGVFVIKPSGVDYSVMTADDMVVVSIETGEVVEGTKKPSSDTPTH
RLLYQAFPSIGGIVHTHSRHATIWAQAGQSIPATGTTHADYFYGTIPCTRKMTDAEINGEYEWETGNVIVETFEKQGIDA
AQMPGVLVHSHGPFAWGKNAEDAVHNAIVLEEVAYMGIFCRQLAPQLPDMQQTLLDKHYLRKHGAKAYYGQ
>B5ZZ34 4.2.1.25~~~araD~~~L-arabinonate dehydratase~~~COG0129
MKKKAEWPRKLRSQEWYGGTSRDVIYHRGWLKNQGYPHDLFDGRPVIGILNTWSDMTPCNGHLRELAEKVKAGVWEAGGF
PLEVPVFSASENTFRPTAMMYRNLAALAVEEAIRGQPMDGCVLLVGCDKTTPSLLMGAASCDLPSIVVTGGPMLNGYFRG
ERVGSGTHLWKFSEMVKAGEMTQAEFLEAEASMSRSSGTCNTMGTASTMASMAEALGMALSGNAAIPGVDSRRKVMAQLT
GRRIVQMVKDDLKPSEIMTKQAFENAIRTNAAIGGSTNAVIHLLAIAGRVGIDLSLDDWDRCGRDVPTIVNLMPSGKYLM
EEFFYAGGLPVVLKRLGEAGLLHKDALTVSGETVWDEVKDVVNWNEDVILPAEKALTSSGGIVVLRGNLAPKGAVLKPSA
ASPHLLVHKGRAVVFEDIDDYKAKINDDNLDIDENCIMVMKNCGPKGYPGMAEVGNMGLPPKVLKKGILDMVRISDARMS
GTAYGTVVLHTSPEAAVGGPLAVVKNGDMIELDVPNRRLHLDISDEELARRLAEWQPNHDLPTSGYAFLHQQHVEGADTG
ADLDFLKGCRGNAVGKDSH
>P96710 ~~~araE~~~Arabinose-proton symporter~~~COG2814
MKNTPTQLEPNVPVTRSHSMGFVILISCAAGLGGLLYGYDTAVISGAIGFLKDLYSLSPFMEGLVISSIMIGGVVGVGIS
GFLSDRFGRRKILMTAALLFAISAIVSALSQDVSTLIIARIIGGLGIGMGSSLSVTYITEAAPPAIRGSLSSLYQLFTIL
GISATYFINLAVQRSGTYEWGVHTGWRWMLAYGMVPSVIFFLVLLVVPESPRWLAKAGKTNEALKILTRINGETVAKEEL
KNIENSLKIEQMGSLSQLFKPGLRKALVIGILLALFNQVIGMNAITYYGPEIFKMMGFGQNAGFVTTCIVGVVEVIFTVI
AVLLIDKVGRKKLMSIGSAFMAIFMILIGTSFYFELTSGIMMIVLILGFVAAFCVSVGPITWIMISEIFPNHLRARAAGI
ATIFLWGANWAIGQFVPMMIDSFGLAYTFWIFAVINILCFLFVVTICPETKNKSLEEIEKLWIK
>P0AE24 ~~~araE~~~Arabinose-proton symporter~~~COG2814
MVTINTESALTPRSLRDTRRMNMFVSVAAAVAGLLFGLDIGVIAGALPFITDHFVLTSRLQEWVVSSMMLGAAIGALFNG
WLSFRLGRKYSLMAGAILFVLGSIGSAFATSVEMLIAARVVLGIAVGIASYTAPLYLSEMASENVRGKMISMYQLMVTLG
IVLAFLSDTAFSYSGNWRAMLGVLALPAVLLIILVVFLPNSPRWLAEKGRHIEAEEVLRMLRDTSEKAREELNEIRESLK
LKQGGWALFKINRNVRRAVFLGMLLQAMQQFTGMNIIMYYAPRIFKMAGFTTTEQQMIATLVVGLTFMFATFIAVFTVDK
AGRKPALKIGFSVMALGTLVLGYCLMQFDNGTASSGLSWLSVGMTMMCIAGYAMSAAPVVWILCSEIQPLKCRDFGITCS
TTTNWVSNMIIGATFLTLLDSIGAAGTFWLYTALNIAFVGITFWLIPETKNVTLEHIERKLMAGEKLRNIGV
>A0A3R0A696 3.2.1.55~~~blArafA~~~Alpha-L-arabinofuranosidase~~~
MKHWKKMAASLIAISTMVAVVPTTYAMESEDSQPQTTDTATVQTTKAAEPTLLASWDFTGKNGTTNSAIADSTGKYNLTL
KDGAKIEQYGDRSNNEALSLRGDGQYAQIDDQLFKDAGDSFTLEFASKTRHDDSGKFFSFIVGKDGSNDANTTDQANANK
YLMFYNSKTAIKGVISNNNWGNEQGSKVTVSGNDNSWADYKIVVDGTNLAVFRNNALIIFKANTGIKMSDLGATTAYIGK
SFYSVDEYWNGAMDDIKVYRGADLTMPTAVAISGTGVVNNKLTLIEKDSTKLTATVTPDDAVSKNVTWSSSDESVAKVAA
DGTVTGVKAGTATITATTELGGVKAELPVTVEPMNAQNAAAADLDAAIAALKVPAAENLPLVAKGTKNGSAITWKSSDEK
LITSTNEKYENRTTGADDPYRGAGIINRPAYGDGDSKPVTLTATASYNGGEKVTKTIEVTVKEKTRIAPDTGYAAVTFES
DSNGGEKAWVASTEKNDFFTFKTRNNGQAVLTNDADTGGLRDMFVLRSHEGDKYYLIATDLKVSSMGWSQNQVNGSRKVE
VYESTDMMNWTRTNGDGNGGITINTPNAGMTWAPEAYWDDDLNAYVVFFSSRMFTDDTRTTPVKNDKTGNSSYAQVRYAI
TRDFVNFTEPQMWQDTGYSRIDSTVRKIGGYYYRFTKNEQGGAAGDYITTGKSIFLERSKVLTAPTTEASPGQDPNTGWQ
LLEQALLPFEGPETIKLNKDDELNTKDDDGYILLSDNFAYRAFMTTGAELSKTTWDNPMTKRYPDFNNEKKPVKAEPGAQ
GYITQGANGGLPDKVRHGAFVNVPESVLKVTKSWTAANPTHIEAVDSTTKAVYNAGTRELTATVTSADKGTLAGSVKFSA
GDWSKTVKLDAEGKATVTLPASVSGTVAVAYDGYTDGLVNPSDTTVDGIEQGKVDLAELNKQIAAAEALKESDYTADSWA
KLAAALKTAKAALAAENQGEVDTAAADLKTAIEALQKAPTNPGEGDGDKGDGNKPTTPTTGDKTNVNKPGSALSNTGTAV
LGLGGAVVALAIAGISLTLWRKRRA
>P02924 ~~~araF~~~L-arabinose-binding periplasmic protein~~~COG1879
MHKFTKALAAIGLAAVMSQSAMAENLKLGFLVKQPEEPWFQTEWKFADKAGKDLGFEVIKIAVPDGEKTLNAIDSLAASG
AKGFVICTPDPKLGSAIVAKARGYDMKVIAVDDQFVNAKGKPMDTVPLVMMAATKIGERQGQELYKEMQKRGWDVKESAV
MAITANELDTARRRTTGSMDALKAAGFPEKQIYQVPTKSNDIPGAFDAANSMLVQHPEVKHWLIVGMNDSTVLGGVRATE
GQGFKAADIIGIGINGVDAVSELSKAQATGFYGSLLPSPDVHGYKSSEMLYNWVAKDVEPPKFTEVTDVVLITRDNFKEE
LEKKGLGGK
>P0AAF3 7.5.2.12~~~araG~~~Arabinose import ATP-binding protein AraG~~~COG1129
MQQSTPYLSFRGIGKTFPGVKALTDISFDCYAGQVHALMGENGAGKSTLLKILSGNYAPTTGSVVINGQEMSFSDTTAAL
NAGVAIIYQELHLVPEMTVAENIYLGQLPHKGGIVNRSLLNYEAGLQLKHLGMDIDPDTPLKYLSIGQWQMVEIAKALAR
NAKIIAFDEPTSSLSAREIDNLFRVIRELRKEGRVILYVSHRMEEIFALSDAITVFKDGRYVKTFTDMQQVDHDALVQAM
VGRDIGDIYGWQPRSYGEERLRLDAVKAPGVRTPISLAVRSGEIVGLFGLVGAGRSELMKGMFGGTQITAGQVYIDQQPI
DIRKPSHAIAAGMMLCPEDRKAEGIIPVHSVRDNINISARRKHVLGGCVINNGWEENNADHHIRSLNIKTPGAEQLIMNL
SGGNQQKAILGRWLSEEMKVILLDEPTRGIDVGAKHEIYNVIYALAAQGVAVLFASSDLPEVLGVADRIVVMREGEIAGE
LLHEQADERQALSLAMPKVSQAVA
>P0AE26 ~~~araH~~~L-arabinose transport system permease protein AraH~~~COG1172
MSSVSTSGSGAPKSSFSFGRIWDQYGMLVVFAVLFIACAIFVPNFATFINMKGLGLAISMSGMVACGMLFCLASGDFDLS
VASVIACAGVTTAVVINLTESLWIGVAAGLLLGVLCGLVNGFVIAKLKINALITTLATMQIVRGLAYIISDGKAVGIEDE
SFFALGYANWFGLPAPIWLTVACLIIFGLLLNKTTFGRNTLAIGGNEEAARLAGVPVVRTKIIIFVLSGLVSAIAGIILA
SRMTSGQPMTSIGYELIVISACVLGGVSLKGGIGKISYVVAGILILGTVENAMNLLNISPFAQYVVRGLILLAAVIFDRY
KQKAKRTV
>P23910 ~~~araJ~~~Putative transporter AraJ~~~COG2814
MKKVILSLALGTFGLGMAEFGIMGVLTELAHNVGISIPAAGHMISYYALGVVVGAPIIALFSSRYSLKHILLFLVALCVI
GNAMFTLSSSYLMLAIGRLVSGFPHGAFFGVGAIVLSKIIKPGKVTAAVAGMVSGMTVANLLGIPLGTYLSQEFSWRYTF
LLIAVFNIAVMASVYFWVPDIRDEAKGNLREQFHFLRSPAPWLIFAATMFGNAGVFAWFSYVKPYMMFISGFSETAMTFI
MMLVGLGMVLGNMLSGRISGRYSPLRIAAVTDFIIVLALLMLFFCGGMKTTSLIFAFICCAGLFALSAPLQILLLQNAKG
GELLGAAGGQIAFNLGSAVGAYCGGMMLTLGLAYNYVALPAALLSFAAMSSLLLYGRYKRQQAADTPVLAKPLG
>Q1JUP5 3.1.1.15~~~araB~~~L-arabinolactonase~~~
MQQIHPAGQATLLADTRNTLGEGATWCDRTRALYWVDIEGAQLWRCRADGSDLTPWPMPERLACFALTDDPDVLLVGLAT
HLAFFDLRSGAFTRIVEVEPELPTRLNDGRCDGSGAFVFGMKDEGAEPPRAVGGFYRLNADLTLERLALPPAAIANSIGF
SPDGSKMYFCDSLVREIFVCDYRPGGEVANVRPFARLTDPDGDPDGSIVDRDGGLWNAQWGGRRVVRYGPDGVETDRVAV
PTAQPSCTALDGEGRLYVTSARVGLSDDALADDPHAGGVFVAQTRHAGMATARFAGTPRG
>P94526 3.1.3.23~~~araL~~~Sugar-phosphatase AraL~~~COG0647
MRIMASHDTPVSPAGILIDLDGTVFRGNELIEGAREAIKTLRRMGKKIVFLSNRGNISRAMCRKKLLGAGIETDVNDIVL
SSSVTAAFLKKHYRFSKVWVLGEQGLVDELRLAGVQNASEPKEADWLVISLHETLTYDDLNQAFQAAAGGARIIATNKDR
SFPNEDGNAIDVAGMIGAIETSAQAKTELVVGKPSWLMAEAACTAMGLSAHECMIIGDSIESDIAMGKLYGMKSALVLTG
SAKQGEQRLYTPDYVLDSIKDVTKLAEEGILI
>P94528 ~~~araN~~~Arabinooligosaccharide-binding protein~~~COG1653
MKKMTVCFLVLMMLLTLVIAGCSAEKSSGKSGETELTFWTFNGLHEQFYVEMVKEWNKKYPDRKIKLNTVVYPYGQMHDN
LSISLIAGEGVPDIADVELARFSNFLKGSDIPLADLTPLIEKDRDKFVEARLTLYSKNGKLYGLDTHVGTTVMFYNMDVM
KKAGVNPDDIKTWDDYHKAGQKVRKVTGKPMGTVETNDSATFLSMISQQNSGYFDKNGKLILNNDTNVKTLQYLKDMIND
KTMIPAPGGGHHSEEYYGFMNQGGAASVLMPIWYMGRFIDYMPDLKGKIAIRPLPAWKEGGDRSAGLGGTATVVPKQSKH
VELAKEFLAFAKGSEEGNKKLWSVLGFDPLRWDVWSSKELKEKNKYTDYFQNGTGIFSVLLDIKDEINPIYLHEDFAKAS
DLVNRSVLFDALKSQQKTPKQALDRAAGELKQK
>P94529 ~~~araP~~~Arabinooligosaccharides transport system permease protein AraP~~~COG1175
MKPVKTGTVHPVPSAAKQSGWRDLFYSKKAAPYLFTAPFVLSFLVFFLYPIISVFIMSFQRILPGEVSFVGLSNYTALNN
PTFYTALWNTLEYTFWTLIVLIPVPLLLAIFLNSKLVKFRNIFKSALFIPALTSTIVAGIIFRLIFGEMETSLANSILLK
LGFSPQNWMNNEHTGMFLMVLLASWRWMGINILYFLAGLQNVPKELYEAADIDGANTMKKFLHITLPFLKPVTVYVLTIS
IIGGFRMFEESYVLWQNNSPGNIGLTLVGYLYQQGLAYNEMGYGAAIGIVLLIVILVVSLISLKLSGSFKGEG
>P94530 ~~~araQ~~~Arabinooligosaccharides transport system permease protein AraQ~~~COG0395
MLRHSPQFSVYRIALTLFFMMLSLLYLFPIFCLLLGSLKPSSELLRVGLNLDIDPKVMSFDNYTFLFNGGSIYFKWFFNS
LVLGLFTTVLTLFFSSMIGYGLAVYDFKGRNIIFVLVLIIMMVPLEVMMLPLFKLTVGLHLIDSYTGVILPFIVSPVAVF
FFRQYALGLPRDLLDSARMDGCTEFGIFFRIMAPLMKPAFGAMIILQSLNSWNNFLWPLIVLRSKEMFTLPIGLSSLLSP
YGNNYDMLISGSVFAILPVIIIFLFFQKYFISGLTVGGVKG
>P96711 ~~~araR~~~Arabinose metabolism transcriptional repressor~~~COG1609
MLPKYAQVKEEISSWINQGKILPDQKIPTENELMQQFGVSRHTIRKAIGDLVSQGLLYSVQGGGTFVASRSAKSALHSNK
TIGVLTTYISDYIFPSIIRGIESYLSEQGYSMLLTSTNNNPDNERRGLENLLSQHIDGLIVEPTKSALQTPNIGYYLNLE
KNGIPFAMINASYAELAAPSFTLDDVKGGMMAAEHLLSLGHTHMMGIFKADDTQGVKRMNGFIQAHRERELFPSPDMIVT
FTTEEKESKLLEKVKATLEKNSKHMPTAILCYNDEIALKVIDMLREMDLKVPEDMSIVGYDDSHFAQISEVKLTSVKHPK
SVLGKAAAKYVIDCLEHKKPKQEDVIFEPELIIRQSARKLNE
>P95470 3.2.1.55~~~arbA~~~Extracellular exo-alpha-(1->5)-L-arabinofuranosidase ArbA~~~COG3507
MPTHHPITRQHWHHSWLSALALLCASLACGAKQVDVHDPVMTREGDTWYLFSTGPGITIYSSKDRVNWRYSDRAFGTEPT
WAKRVSPSFDGHLWAPDIYQHKGLFYLYYSVSAFGKNTSAIGVTVNKTLNPASPDYRWEDKGIVIESVPQRDLWNAIDPA
IIADDHGQVWMSFGSFWGGLKLFKLNDDLTRPAEPQEWHSIAKLERSVLMDDSQAGSAQIEAPFILRKGDYYYLFASWGL
CCRKGDSTYHLVVGRSKQVTGPYLDKTGRDMNQGGGSLLIKGNKRWVGLGHNSAYTWDGKDYLVLHAYEAADNYLQKLKI
LNLHWDGEGWPQVDEKELDSYISQRLK
>Q46134 2.4.2.-~~~c3~~~Mono-ADP-ribosyltransferase C3~~~
MNKLTERVLCVGVSGLILFSVAALVQGTKKCYANPVRNRAASRVKPYADSFKEFTNIDEARAWGDKQFAKYKLSSSEKNA
LTIYTRNAARINGPLRANQGNTNGLPADIRKEVEQIDKSFTKMQTPENIILFRGDDPGYLGPDFENTILNRDGTINKAVF
EQVKLRFKGKDRKEYGYISTSLVNGSAFAGRPIITKFKVLDGSKAGYIEPISTFKGQLEVLLPRSSTYTISDMQIAPNNK
QIIITALLKR
>P0A9Q1 ~~~arcA~~~Aerobic respiration control protein ArcA~~~COG0745
MQTPHILIVEDELVTRNTLKSIFEAEGYDVFEATDGAEMHQILSEYDINLVIMDINLPGKNGLLLARELREQANVALMFL
TGRDNEVDKILGLEIGADDYITKPFNPRELTIRARNLLSRTMNLGTVSEERRSVESYKFNGWELDINSRSLIGPDGEQYK
LPRSEFRAMLHFCENPGKIQSRAELLKKMTGRELKPHDRTVDVTIRRIRKHFESTPDTPEIIATIHGEGYRFCGDLED
>P23793 3.5.3.6~~~arcA~~~Arginine deiminase~~~COG2235
MSVFDSKFKGIHVYSEIGELESVLVHEPGREIDYITPARLDELLFSAILESHDARKEHKQFVAELKANDINVVELIDLVA
ETYDLASQEAKDKLIEEFLEDSEPVLSEEHKVVVRNFLKAKKTSRELVEIMMAGITKYDLGIEADHELIVDPMPNLYFTR
DPFASVGNGVTIHYMRYKVRQRETLFSRFVFSNHPKLINTPWYYDPSLKLSIEGGDVFIYNNDTLVVGVSERTDLQTVTL
LAKNIVANKECEFKRIVAINVPKWTNLMHLDTWLTMLDKDKFLYSPIANDVFKFWDYDLVNGGAEPQPVENGLPLEGLLQ
SIINKKPVLIPIAGEGASQMEIERETHFDGTNYLAIRPGVVIGYSRNEKTNAALEAAGIKVLPFHGNQLSLGMGNARCMS
MPLSRKDVKW
>P9WQ05 3.5.3.6~~~arcA~~~Arginine deiminase~~~COG2235
MGVELGSNSEVGALRVVILHRPGAELRRLTPRNTDQLLFDGLPWVSRAQDEHDEFAELLASRGAEVLLLSDLLTEALHHS
GAARMQGIAAAVDAPRLGLPLAQELSAYLRSLDPGRLAHVLTAGMTFNELPSDTRTDVSLVLRMHHGGDFVIEPLPNLVF
TRDSSIWIGPRVVIPSLALRARVREASLTDLIYAHHPRFTGVRRAYESRTAPVEGGDVLLLAPGVVAVGVGERTTPAGAE
ALARSLFDDDLAHTVLAVPIAQQRAQMHLDTVCTMVDTDTMVMYANVVDTLEAFTIQRTPDGVTIGDAAPFAEAAAKAMG
IDKLRVIHTGMDPVVAEREQWDDGNNTLALAPGVVVAYERNVQTNARLQDAGIEVLTIAGSELGTGRGGPRCMSCPAARD
PL
>P13981 3.5.3.6~~~arcA~~~Arginine deiminase~~~
MSTEKTKLGVHSEAGKLRKVMVCSPGLAHQRLTPSNCDELLFDDVIWVNQAKRDHFDFVTKMRERGIDVLEMHNLLTETI
QNPEALKWILDRKITADSVGLGLTSELRSWLESLEPRKLAEYLIGGVAADDLPASEGANILKMYREYLGHSSFLLPPLPN
TQFTRDTTCWIYGGVTLNPMYWPARRQETLLTTAIYKFHPEFANAEFEIWYGDPDKDHGSSTLEGGDVMPIGNGVVLIGM
GERSSRQAIGQVAQSLFAKGAAERVIVAGLPKSRAAMHLDTVFSFCDRDLVTVFPEVVKEIVPFSLRPDPSSPYGMNIRR
EEKTFLEVVAESLGLKKLRVVETGGNSFAAEREQWDDGNNVVCLEPGVVVGYDRNTYTNTLLRKAGVEVITISASELGRG
RGGGHCMTCPIVRDPIDY
>P41142 3.5.3.6~~~arcA~~~Arginine deiminase~~~COG2235
MSAEKQKYGVHSEAGKLRKVMVCSPGLAHKRLTPSNCDELLFDDVIWVDQAKRDHFDFVTKMRERGVDVLEMHNLLTDIV
QQPEALKWILDRKITSDTVGVGLTNEVRSWLEGLEPRHLAEFLIGGVAGQDLPVSEGAEVIKMYNKYLGHSSFILPPLPN
TQFTRDTTCWIYGGVTLNPMYWPARRQETLLTTAIYKFHKEFTGADFQVWYGDPDKDHGNATLEGGDVMPVGKGIVLIGM
GERTSRHAIGQLAQNLFEKGAAEKIIVAGLPKSRAAMHLDTVFSFCDRDLVTVFPEVVKEIKPFIITPDSSKPYGMNIAP
QDASFLEVVSEQLLGKKDKLRVVETGGNSFAAEREQWDDGNNVVALEPGVVIGYDRNTYTNTLLRKAGIEVITISAGELG
RGRGGGHCMTCPIVRDPIDY
>P63554 3.5.3.6~~~arcA~~~Arginine deiminase~~~
MTDGPIKVNSEIGALKTVLLKRPGKELENLVPDYLDGLLFDDIPYLEVAQKEHDHFAQVLREEGVEVLYLEKLAAESIEN
PQVRSEFIDDVLAESKKTILGHEEEIKTLFATLSNQELVDKIMSGVRKEEINPKCTHLVEYMDDKYPFYLDPMPNLYFTR
DPQASIGHGITINRMFWRARRRESIFIQYIVKHHPRFKDANIPIWLDRDCPFNIEGGDELVLSKDVLAIGVSERTSAQAI
EKLARRIFENPQATFKKVVAIEIPTSRTFMHLDTVFTMIDYDKFTMHSAILKAEGNMNIFIIEYDDVNKDIAIKQSSHLK
DTLEDVLGIDDIQFIPTGNGDVIDGAREQWNDGSNTLCIRPGVVVTYDRNYVSNDLLRQKGIKVIEISGSELVRGRGGPR
CMSQPLFREDI
>A8AYL1 3.5.3.6~~~arcA~~~Arginine deiminase~~~COG2235
MSTHPIHVFSEIGKLKKVMLHRPGKELENLMPDYLERLLFDDIPFLEDAQKEHDNFAQALRNEGIEVLYLEKLAAESLTS
PEIRDQFIEEYLDEANIRGRQTKVAIRELLQGIKDNQELVEKTMAGVQKAELPEIPEAAKGLTDLVESDYPFAIDPMPNL
YFTRDPFATIGNAVSLNHMYADTRNRETLYGKYIFKYHPVYGGNVELVYNREEDTRIEGGDELVLSKDVLAVGISQRTDA
ASIEKLLVNIFKKNVGFKKVLAFEFANNRKFMHLDTVFTMVDYDKFTIHPEIQGNLRVFSVTYENEQLKIVEEKGDLAEL
LAENLGVEKVTLIPCGDGNAVAAAREQWNDGSNTLTIAPGVVVVYDRNTVTNKKLEEYGLRLIKIRGSELVRGRGGPRCM
SMPFEREEI
>Q5XAY2 3.5.3.6~~~arcA~~~Arginine deiminase~~~
MTAQTPIHVYSEIGKLKKVLLHRPGKEIENLMPDYLERLLFDDIPFLEDAQKEHDAFAQALRDEGIEVLYLETLAAESLV
TPEIREAFIDEYLSEANIRGRATKKAIRELLMAIEDNQELIEKTMAGVQKSELPEIPASEKGLTDLVESNYPFAIDPMPN
LYFTRDPFATIGTGVSLNHMFSETRNRETLYGKYIFTHHPIYGGGKVPMVYDRNETTRIEGGDELVLSKDVLAVGISQRT
DAASIEKLLVNIFKQNLGFKKVLAFEFANNRKFMHLDTVFTMVDYDKFTIHPEIEGDLRVYSVTYDNEELHIVEEKGDLA
ELLAANLGVEKVDLIRCGGDNLVAAGREQWNDGSNTLTIAPGVVVVYNRNTITNAILESKGLKLIKIHGSELVRGRGGPR
CMSMPFEREDI
>P58827 3.5.3.6~~~arcA~~~Arginine deiminase~~~
MTAQTPIHVYSEIGKLKKVLLHRPGKEIENLMPDYLERLLFDDIPFLEDAQKEHDAFAQALRDEGIEVLYLETLAAESLV
TPEIREAFIDEYLSEANIRGRATKKAIRELLMAIEDNQELIEKTMAGVQKSELPEIPASEKGLTDLVESNYPFAIDPMPN
LYFTRDPFATIGTGVSLNHMFSETRNRETLYGKYIFTHHPIYGGGKVPMVYDRNETTRIEGGDELVLSKDVLAVGISQRT
DAASIEKLLVNIFKQNLGFKKVLAFEFANNRKFMHLDTVFTMVDYDKFTIHPEIEGDLRVYSVTYDNEELHIIEEKGDLA
ELLAANLGVEKVDLIRCGGDNLVAAGREQWNDGSNTLTIAPGVVVVYNRNTITNAILESKGLKLIKIHGSELVRGRGGPR
CMSMPFEREDI
>P0C0B3 3.5.3.6~~~arcA~~~Arginine deiminase~~~COG2235
MTAQTPIHVYSEIGKLKKVLLHRPGKEIENLMPDYLERLLFDDIPFLEDAQKEHDAFAQALRDEGIEVLYLETLAAESLV
TPEIREAFIDEYLSEANIRGRATKKAIRELLMAIEDNQELIEKTMAGVQKSELPEIPASEKGLTDLVESNYPFAIDPMPN
LYFTRDPFATIGTGVSLNHMFSETRNRETLYGKYIFTHHPIYGGGKVPMVYDRNETTRIEGGDELVLSKDVLAVGISQRT
DAASIEKLLVNIFKQNLGFKKVLAFEFANNRKFMHLDTVFTMVDYDKFTIHPEIEGDLRVYSVTYDNEELHIVEEKGDLA
ELLAANLGVEKVDLIRCGGDNLVAAGREQWNDGSNTLTIAPGVVVVYNRNTITNAILESKGLKLIKIHGSELVRGRGGPR
CMSMPFEREDI
>P0AEC3 2.7.13.3~~~arcB~~~Aerobic respiration control sensor protein ArcB~~~COG0784
MKQIRLLAQYYVDLMMKLGLVRFSMLLALALVVLAIVVQMAVTMVLHGQVESIDVIRSIFFGLLITPWAVYFLSVVVEQL
EESRQRLSRLVQKLEEMRERDLSLNVQLKDNIAQLNQEIAVREKAEAELQETFGQLKIEIKEREETQIQLEQQSSFLRSF
LDASPDLVFYRNEDKEFSGCNRAMELLTGKSEKQLVHLKPADVYSPEAAAKVIETDEKVFRHNVSLTYEQWLDYPDGRKA
CFEIRKVPYYDRVGKRHGLMGFGRDITERKRYQDALERASRDKTTFISTISHELRTPLNGIVGLSRILLDTELTAEQEKY
LKTIHVSAVTLGNIFNDIIDMDKMERRKVQLDNQPVDFTSFLADLENLSALQAQQKGLRFNLEPTLPLPHQVITDGTRLR
QILWNLISNAVKFTQQGQVTVRVRYDEGDMLHFEVEDSGIGIPQDELDKIFAMYYQVKDSHGGKPATGTGIGLAVSRRLA
KNMGGDITVTSEQGKGSTFTLTIHAPSVAEEVDDAFDEDDMPLPALNVLLVEDIELNVIVARSVLEKLGNSVDVAMTGKA
ALEMFKPGEYDLVLLDIQLPDMTGLDISRELTKRYPREDLPPLVALTANVLKDKQEYLNAGMDDVLSKPLSVPALTAMIK
KFWDTQDDEESTVTTEENSKSEALLDIPMLEQYLELVGPKLITDGLAVFEKMMPGYVSVLESNLTAQDKKGIVEEGHKIK
GAAGSVGLRHLQQLGQQIQSPDLPAWEDNVGEWIEEMKEEWRHDVEVLKAWVAKATKK
>P0A2X7 2.7.2.2~~~arcC1~~~Carbamate kinase 1~~~COG0549
MGKKMVVALGGNAILSNDASAHAQQQALVQTSAYLVHLIKQGHRLIVSHGNGPQVGNLLLQQQAADSEKNPAMPLDTCVA
MTQGSIGYWLSNALNQELNKAGIKKQVATVLTQVVVDPADEAFKNPTKPIGPFLTEAEAKEAMQAGAIFKEDAGRGWRKV
VPSPKPIDIHEAETINTLIKNDIITISCGGGGIPVVGQELKGVEAVIDKDFASEKLAELVDADALVILTGVDYVCINYGK
PDEKQLTNVTVAELEEYKQAGHFAPGSMLPKIEAAIQFVESQPNKQAIITSLENLGSMSGDEIVGTVVTK
>P0A2X8 2.7.2.2~~~arcC1~~~Carbamate kinase 1~~~
MGKKMVVALGGNAILSNDASAHAQQQALVQTSAYLVHLIKQGHRLIVSHGNGPQVGNLLLQQQAADSEKNPAMPLDTCVA
MTQGSIGYWLSNALNQELNKAGIKKQVATVLTQVVVDPADEAFKNPTKPIGPFLTEAEAKEAMQAGAIFKEDAGRGWRKV
VPSPKPIDIHEAETINTLIKNDIITISCGGGGIPVVGQELKGVEAVIDKDFASEKLAELVDADALVILTGVDYVCINYGK
PDEKQLTNVTVAELEEYKQAGHFAPGSMLPKIEAAIQFVESQPNKQAIITSLENLGSMSGDEIVGTVVTK
>Q7A627 2.7.2.2~~~arcC1~~~Carbamate kinase 1~~~
MAKIVVALGGNALGKSPQEQLELVKNTAKSLVGLITKGHEIVISHGNGPQVGSINLGLNYAAEHNQGPAFPFAECGAMSQ
AYIGYQLQESLQNELHSIGMDKQVVTLVTQVEVDENDPAFNNPSKPIGLFYNKEEAEQIQKEKGFIFVEDAGRGYRRVVP
SPQPISIIELESIKTLIKNDTLVIAAGGGGIPVIREQHDGFKGIDAVIDKDKTSALLGANIQCDQLIILTAIDYVYINFN
TENQQPLKTTNVDELKRYIDENQFAKGSMLPKIEAAISFIENNPKGSVLITSLNELDAALEGKVGTVIKK
>P99069 2.7.2.2~~~arcC2~~~Carbamate kinase 2~~~
MKEKIVIALGGNAIQTKEATAEAQQTAIRRAMQNLKPLFDSPARIVISHGNGPQIGSLLIQQAKSNSDTTPAMPLDTCGA
MSQGMIGYWLETEINRILTEMNSDRTVGTIVTRVEVDKDDPRFNNPTKPIGPFYTKEEVEELQKEQPDSVFKEDAGRGYR
KVVASPLPQSILEHQLIRTLADGKNIVIACGGGGIPVIKKENTYEGVEAVIDKDFASEKLATLIEADTLMILTNVENVFI
NFNEPNQQQIDDIDVATLKKYAAQGKFAEGSMLPKIEAAIRFVESGENKKVIITNLEQAYEALIGNKGTHIHM
>P13982 2.7.2.2~~~arcC~~~Carbamate kinase~~~
MRIVVALGGNALLRRGEPMTADNQRENVRIAAEQIAKVAPGNELVIAHGNGPQVGLLALQGAAYDKVSPYPLDVLGAETE
GMIGYMIEQEMGNLLPFEVPFATILTQVEVDGKDPAFQNPTKPIGPVYSREEAERLAAEKGWSIAPDGDKFRRVVPSPRP
KRIFEIRPVKWLLEKGTIVICAGGGGIPTMYDEAGKKLSGVEAVIDKDLCSSLLAQELVADILIIATDVDAAYVDWGKPT
QKAIAQAHPDELERLGFAAGSMGPKVQAAIEFARATGKDAVIGSLADIVAITEGKAGTRVSTRKAGIEYR
>A2RNI5 ~~~arcD1~~~Arginine/ornithine antiporter ArcD1~~~COG0531
MDAENKKGIGLAALVAIIVSGAIGGGVFNLSNDLATNASPGGVVISWIVIGFGILMLVLSLNHLVVNKPELSGVSDYARA
GFGNMVGFISGWGYWLSAWAGNIAFAVLMMTSVDYFFPGVFQAKNGSLTILSVIVVSIVSWGLTLLVMRGVEGAAAINAI
VLVAKLIPLFVFVIAGIVTFKAGVFSAHFWQNFVANTNADGVIKSLTWSNMTGGDLFSQVKGSLMVMIWVFVGIEGAAMM
GDRAKRKSDAGKASIFGLIALLVIYILLSLLPFGFMSQQELANTGQPGLVHILNAMVGGWGGSLMAIGLVISLLGAWLSW
TMLPVEATQQLSEQKLLPSWFGKLNDKGAPKNSLLLTQLIVQIFLIVTYFVADAYNVFVYLCTAVIMICYALVGLYLFKL
GIQEKKTSNIIIGFIAAAFQILALYYSGWQFVWLSLILYAVGFILYALGKKEYGTKMSTTEVIATFILTVLGILAVFGVY
GNWLGLQDALGIDGNTLLVAVVPLIVVTFIVYFVVRSDINKKGIKN
>A2RNI1 ~~~arcD2~~~Arginine/ornithine antiporter ArcD2~~~COG0531
MENKKTKGISLFALLAIIISGAIGGGVFNLANDLARGSTPGGVVISWLFIGFGILMLVLSFNRLITIKPDLSGVSDYARA
GFGDFVGFLSGWGYWISAWTGTIGFAVLMMTSADYFFPSKFANSNGSLTILSVIIVSIISWILMLLVDRGVETAAAVNAI
VMIAKLIPLVVFSITGIILFKANVFTQHFWQTFTTNFAADGSVKDFVWHAMTVSGLLSQIKGSLMVMVWVFVGIEGATMM
GNRAKKKSDTAKATVIGLAVLLVIYVLLSLLPYGYMDQASLANVKAPGLVYILNEMVGGWGGSLMAVGLMISLLGAWLSW
TMLPVEATQQLAEQKLLPSWFGKLNKYHAPSNSLLITQLMIQIFIIITYFVANAYNVFIYMATAVIMICYALVGAYLFKI
GLKEASVKNILIGFFTFAFQALALYLSGWQYVWLAMILYTIGFLLFIGAKKESHQSISVKEWLGMLVVTVLGVLAIVVLI
CGAKAGTAFDLRGLLGF
>P0AAE5 ~~~ydgI~~~Putative arginine/ornithine antiporter~~~COG0531
MEKKLGLSALTALVLSSMLGAGVFSLPQNMAAVASPAALLIGWGITGAGILLLAFAMLILTRIRPELDGGIFTYAREGFG
ELIGFCSAWGYWLCAVIANVSYLVIVFSALSFFTDTPELRLFGDGNTWQSIVGASALLWIVHFLILRGVQTAASINLVAT
LAKLLPLGLFVVLAMMMFKLDTFKLDFTGLALGVPVWEQVKNTMLITLWVFIGVEGAVVVSARARNKRDVGKATLLAVLS
ALGVYLLVTLLSLGVVARPELAEIRNPSMAGLMVEMMGPWGEIIIAAGLIVSVCGAYLSWTIMAAEVPFLAATHKAFPRI
FARQNAQAAPSASLWLTNICVQICLVLIWLTGSDYNTLLTIASEMILVPYFLVGAFLLKIATRPLHKAVGVGACIYGLWL
LYASGPMHLLLSVVLYAPGLLVFLYARKTHTHDNVLNRQEMVLIGMLLIASVPATWMLVG
>P18275 ~~~arcD~~~Arginine/ornithine antiporter~~~
MSQESSQKLRLGALTALVVGSMIGGGIFSLPQNMAASADVGAVLIGWAITAVGMLTLAFVFQTLANRKPELDGGVYAYAK
AGFGDYMGFSSAWGYWISAWLGNVGYFVLLFSTLGYFFPIFGKGDTVAAIVCASVLLWALHFLVLRGIKEAAFINTVTTV
AKVVPLFLFILICLFAFKLDIFTADIWGKSNPDLGSVMNQVRNMMLVTVWVFIGIEGASIFSSRAEKRSDVGKATVIGFI
TVLLLLVLVNVLSMGVMTQPELAKLQNPSMALVLEHVVGHWGAVLISVGLLISLLGALLSWVLLCAEIMFAAAKDHTMPE
FLRRENANQVPANALWLTNICVQVFLVVVFFTSGDPDGMDPYTKMLLLATSMILIPYFWSAAYGLLLTLKGETYENDARE
RSKDLVIAGIAVAYAVWLLYAGGLKYLLLSALLYAPGAILFAKAKHEVGQPIFTGIEKLIFAAVVIGALVAAYGLYDGFL
TL
>P9WQ02 ~~~~~~Probable protein archease~~~
MLHRDDHINPPRPRGLDVPCARLRATNPLRALARCVQAGKPGTSSGHRSVPHTADLRIEAWAPTRDGCIRQAVLGTVESF
LDLESAHAVHTRLRRLTADRDDDLLVAVLEEVIYLLDTVGETPVDLRLRDVDGGVDVTFATTDASTLVQVGAVPKAVSLN
ELRFSQGRHGWRCAVTLDV
>P9WQ03 ~~~~~~Probable protein archease~~~COG1371
MLHRDDHINPPRPRGLDVPCARLRATNPLRALARCVQAGKPGTSSGHRSVPHTADLRIEAWAPTRDGCIRQAVLGTVESF
LDLESAHAVHTRLRRLTADRDDDLLVAVLEEVIYLLDTVGETPVDLRLRDVDGGVDVTFATTDASTLVQVGAVPKAVSLN
ELRFSQGRHGWRCAVTLDV
>Q9X0H1 ~~~~~~Protein archease~~~COG1371
MRKPIEHTADIAYEISGNSYEELLEEARNILLEEEGIVLDTEEKEKMYPLEETEDAFFDTVNDWILEISKGWAPWRIKRE
GNELKVTFRKIRKKEGTEIKALTYHLLKFERDGDVLKTKVVFDT
>Q2FUY1 ~~~arcR~~~HTH-type transcriptional regulator ArcR~~~COG0664
MTENFILGRNNKLEHELKALADYINIPYSILQPYQSECFVRHYTKGQVIYFSPQESSNIYFLIEGNIIREHYNQNGDVYR
YFNKEQVLFPISNLFHPKEVNELCTALTDCTVLGLPRELMAFLCKANDDIFLTLFALINDNEQQHMNYNMALTSKFAKDR
IIKLICHLCQTVGYDQDEFYEIKQFLTIQLMSDMAGISRETAGHIIHELKDEKLVVKDHKNWLVSKHLFNDVCV
>Q7A381 ~~~arcR~~~HTH-type transcriptional regulator ArcR~~~
MTENFILGRNNKLEHELKALADYINIPYSILQPYQSECFVRHYTKGQVIYFSPQESSNIYFLIEGNIIREHYNQNGDVYR
YFNKEQVLFPISNLFHPKEVNELCTALTDCTVLGLPRELMAFLCKANDDIFLTLFALINDNEQQHMNYNMALTSKFAKDR
IIKLLCHLCQTVGYDQDEFYEIKQFLTIQLMSDMAGISRETAGHIIHELKDEKLVVKDHKNWLVSKHLFNDVCV
>A0QZ54 ~~~mpa~~~Proteasome-associated ATPase~~~COG1222
MSESERSEGFPEGFAGAGSGSLSSEDAAELEALRREAAMLREQLENAVGPQSGLRSARDVHQLEARIDSLAARNAKLMDT
LKEARQQLLALREEVDRLGQPPSGYGVLLATHDDDTVDVFTSGRKMRLTCSPNIEVKELKQGQTVRLNEALTVVEAGNFE
AVGEISTLREILADGHRALVVGHADEERIVWLAEPLVAAKDLPDEPTDYFDDSRPRKLRPGDSLLVDTKAGYAFERIPKA
EVEDLVLEEVPDVSYNDIGGLGRQIEQIRDAVELPFLHKDLYKEYSLRPPKGVLLYGPPGCGKTLIAKAVANSLAKKMAE
VRGDDAREAKSYFLNIKGPELLNKFVGETERHIRLIFQRAREKASEGTPVIVFFDEMDSIFRTRGTGVSSDVETTVVPQL
LSEIDGVEGLENVIVIGASNREDMIDPAILRPGRLDVKIKIERPDAEAAQDIFSKYLTEDLPVHADDLTEFNGDRALCIK
AMIEKVVDRMYAEIDDNRFLEVTYANGDKEVMYFKDFNSGAMIQNVVDRAKKYAIKSVLETGQKGLRIQHLLDSIVDEFA
ENEDLPNTTNPDDWARISGKKGERIVYIRTLVTGKSSSASRAIDTESNLGQYL
>P9WQN4 ~~~mpa~~~Proteasome-associated ATPase~~~
MGESERSEAFGIPRDSPLSSGDAAELEQLRREAAVLREQLENAVGSHAPTRSARDIHQLEARIDSLAARNSKLMETLKEA
RQQLLALREEVDRLGQPPSGYGVLLATHDDDTVDVFTSGRKMRLTCSPNIDAASLKKGQTVRLNEALTVVEAGTFEAVGE
ISTLREILADGHRALVVGHADEERVVWLADPLIAEDLPDGLPEALNDDTRPRKLRPGDSLLVDTKAGYAFERIPKAEVED
LVLEEVPDVSYADIGGLSRQIEQIRDAVELPFLHKELYREYSLRPPKGVLLYGPPGCGKTLIAKAVANSLAKKMAEVRGD
DAHEAKSYFLNIKGPELLNKFVGETERHIRLIFQRAREKASEGTPVIVFFDEMDSIFRTRGTGVSSDVETTVVPQLLSEI
DGVEGLENVIVIGASNREDMIDPAILRPGRLDVKIKIERPDAEAAQDIYSKYLTEFLPVHADDLAEFDGDRSACIKAMIE
KVVDRMYAEIDDNRFLEVTYANGDKEVMYFKDFNSGAMIQNVVDRAKKNAIKSVLETGQPGLRIQHLLDSIVDEFAENED
LPNTTNPDDWARISGKKGERIVYIRTLVTGKSSSASRAIDTESNLGQYL
>P9WQN5 ~~~mpa~~~Proteasome-associated ATPase~~~COG1222
MGESERSEAFGIPRDSPLSSGDAAELEQLRREAAVLREQLENAVGSHAPTRSARDIHQLEARIDSLAARNSKLMETLKEA
RQQLLALREEVDRLGQPPSGYGVLLATHDDDTVDVFTSGRKMRLTCSPNIDAASLKKGQTVRLNEALTVVEAGTFEAVGE
ISTLREILADGHRALVVGHADEERVVWLADPLIAEDLPDGLPEALNDDTRPRKLRPGDSLLVDTKAGYAFERIPKAEVED
LVLEEVPDVSYADIGGLSRQIEQIRDAVELPFLHKELYREYSLRPPKGVLLYGPPGCGKTLIAKAVANSLAKKMAEVRGD
DAHEAKSYFLNIKGPELLNKFVGETERHIRLIFQRAREKASEGTPVIVFFDEMDSIFRTRGTGVSSDVETTVVPQLLSEI
DGVEGLENVIVIGASNREDMIDPAILRPGRLDVKIKIERPDAEAAQDIYSKYLTEFLPVHADDLAEFDGDRSACIKAMIE
KVVDRMYAEIDDNRFLEVTYANGDKEVMYFKDFNSGAMIQNVVDRAKKNAIKSVLETGQPGLRIQHLLDSIVDEFAENED
LPNTTNPDDWARISGKKGERIVYIRTLVTGKSSSASRAIDTESNLGQYL
>O50202 ~~~arc~~~Proteasome-associated ATPase~~~
MSSTENPDSVAAAEELHALRVEAQVLRRQLAQSPEQVRELESKVDSLSIRNSKLMDTLKEARQQLIALREEVDRLGQPPS
GYGVLLSVHEDKTVDVFTSGRKMRLTCSPNIDTDTLALGQTVRLNEALTIVEAGTYEQVGEISTLREVLDDGLRALVVGH
ADEERIVWLAAPLAAVFADPEADIIAYDADSPTRKLRPGDSLLVDTKAGYAFERIPKAEVEDLVLEEVPDVHYDDIGGLG
RQIEQIRDAVELPFLHKDLFHEYSLRPPKGVLLYGPPGCGKTLIAKAVANSLAKKIAEARGQDSKDAKSYFLNIKGPELL
NKFVGETERHIRMIFQRAREKASEGTPVIVFFDEMDSIFRTRGSGVSSDVETTVVPQLLSEIDGVEGLENVIVIGASNRE
DMIDPAILRPGRLDVKIKIERPDAESAQDIFSKYLVDGLPINADDLAEFGGDRTACLKAMIVRVVDRMYAESEENRFLEV
TYANGDKEVLFFKDFNSGAMIQNIVDRAKKYAIKSVLDTGAPGLRVQHLFDSIVDEFAENEDLPNTTNPDDWARISGKKG
ERIVYIRTLVTGKNASASRAIDTESNTGQYL
>P0DW92 1.3.1.-~~~ard~~~NADH:acrylate oxidoreductase~~~
MAQLVDEIVFQSGVKLHNRIVMAPMTIQSAFFDGGVTQEMINYYAARSGGAGAIIVESAFVENYGRAFPGALGIDTDSKI
AGLTKLADAIKAKGSKAILQIYHAGRMANPEFNGGHQPISASPVAALRDNAETPLEMTKEQIEEMIERFGDAVNRAILAG
FDGVEIHGANTYLIQQFFSPHSNRRNDKWGGNIERRTSFPLAVLAKTKQVAEQHNKSDFIIGYRFSPEEIEQPGIRFDDT
MFLLDKLATHGLDYFHFSMGSWLRNSIVTPEDQEPLIDKYRKLQSESVAKVPVIGVGGIAQRKDAENALEQGYDMVSVGK
GYLVEPTWANKALNDETCAEFADIAQQEALQIPTPLWEIMDYMIVDSAAEALKHQRIKELQNVPIKFNSGEYTAYGRGHN
GDLPVTVTFSEDKILDIVVDSSKESDGIANPAFERIPQQILDGQTLNIDVISGATVSSQAVLDGVSNAVDLAGGNSEALR
CKAKEAVAWSSKTIEETVDIVVVGGGGAGLSATLTALDKGKSVVLLEKFPAIGGNTVRTGGWVNAAEPKWQGDFPALPGE
KETLMLLAKTAESEFSGEYLEDFKVLKAQLDGYFTDLENGKQYLFDSVELHRIQTYLGGKRTDLNGESIYGQYDLVETLT
SRSMESIDWLSEKGIDFDRSVVEIPVGALWRRAHKPKRPKGVEFIDKLSKRIQEQNGRIITDTRATDLMVDNGKVVGIKA
VQADGTELILHVNHGVVLASGGFGANTQMIKKYNTYWKEIADDIKTTNSPALVGDGIEIGEKAGAELVGMGFVQLMPVGD
PKSGALLTGLIVPPENFVFVNKQGKRFVDECGSRDVLSEAFFDNGGLIYMIADENIRQTAANTSDETIEREIKEGIIIQA
DTLEELAEKIGVPTQELTNTIAQYNACVDAGQDPEFHKSAFGLKVEKAPFYATPRQPSVHHTMGGLKIDTKARVIGKDGE
VIQGLYAAGEVTGGIHAGNRLGGNALIDIFTYGRIAGESASDLV
>A9B055 5.1.1.-~~~~~~Aromatic dipeptide epimerase~~~COG4948
MPTTIQAISAEAINLPLTEPFAIASGAQAVAANVLVKVQLADGTLGLGEAAPFPAVSGETQTGTSAAIERLQSHLLGADV
RGWRKLAAMLDHAEHEAAAARCGLEMAMLDALTRHYHMPLHVFFGGVSKQLETDMTITAGDEVHAAASAKAILARGIKSI
KVKTAGVDVAYDLARLRAIHQAAPTAPLIVDGNCGYDVERALAFCAACKAESIPMVLFEQPLPREDWAGMAQVTAQSGFA
VAADESARSAHDVLRIAREGTASVINIKLMKAGVAEGLKMIAIAQAAGLGLMIGGMVESILAMSFSANLAAGNGGFDFID
LDTPLFIAEHPFIGGFAQTGGTLQLADVAGHGVNLA
>P36675 ~~~arfA~~~Alternative ribosome-rescue factor A~~~COG3036
MSRYQHTKGQIKDNAIEALLHDPLFRQRVEKNKKGKGSYMRKGKHGNRGNWEASGKKVNHFFTTGLLLSGAC
>A1KH31 ~~~arfA~~~Peptidoglycan-binding protein ArfA~~~
MASKAGLGQTPATTDARRTQKFYRGSPGRPWLIGAVVIPLLIAAIGYGAFERPQSVTGPTGVLPTLTPTSTRGASALSLS
LLSISRSGNTVTLIGDFPDEAAKAALMTALNGLLAPGVNVIDQIHVDPVVRSLDFSSAEPVFTASVPIPDFGLKVERDTV
TLTGTAPSSEHKDAVKRAATSTWPDMKIVNNIEVTGQAPPGPPASGPCADLQSAINAVTGGPIAFGNDGASLIPADYEIL
NRVADKLKACPDARVTINGYTDNTGSEGINIPLSAQRAKIVADYLVARGVAGDHIATVGLGSVNPIASNATPEGRAKNRR
VEIVVN
>P9WIU5 ~~~arfA~~~Peptidoglycan-binding protein ArfA~~~COG2885
MASKAGLGQTPATTDARRTQKFYRGSPGRPWLIGAVVIPLLIAAIGYGAFERPQSVTGPTGVLPTLTPTSTRGASALSLS
LLSISRSGNTVTLIGDFPDEAAKAALMTALNGLLAPGVNVIDQIHVDPVVRSLDFSSAEPVFTASVPIPDFGLKVERDTV
TLTGTAPSSEHKDAVKRAATSTWPDMKIVNNIEVTGQAPPGPPASGPCADLQSAINAVTGGPIAFGNDGASLIPADYEIL
NRVADKLKACPDARVTINGYTDNTGSEGINIPLSAQRAKIVADYLVARGVAGDHIATVGLGSVNPIASNATPEGRAKNRR
VEIVVN
>P40711 3.1.1.29~~~arfB~~~Peptidyl-tRNA hydrolase ArfB~~~COG1186
MIVISRHVAIPDGELEITAIRAQGAGGQHVNKTSTAIHLRFDIRASSLPEYYKERLLAASHHLISSDGVIVIKAQEYRSQ
ELNREAALARLVAMIKELTTEKKARRPTRPTRASKERRLASKAQKSSVKAMRGKVRSGRE
>A1KH32 ~~~arfB~~~Uncharacterized membrane protein ArfB~~~
MDFVIQWSCYLLAFLGGSAVAWVVVTLSIKRASRDEGAAEAPSAAETGAQ
>P9WJG7 ~~~arfB~~~Uncharacterized membrane protein ArfB~~~
MDFVIQWSCYLLAFLGGSAVAWVVVTLSIKRASRDEGAAEAPSAAETGAQ
>A1KH33 ~~~arfC~~~Uncharacterized membrane protein ArfC~~~
MEHVHWWLAGLAFTLGMVLTSTLMVRPVEHQVLVKKSVRGSSAKSKPPTARKPAVKSGTKREESPTAKTKVATESAAEQI
PVAGEPAAEPIPVAGEPAARIPVVPYAPYGPGSARAGADGSGPQGWLVKGRSDTRLYYTPEDPTYDPTVAQVWFQDEESA
ARAFFTPWRKSTRRT
>P9WJG5 ~~~arfC~~~Uncharacterized membrane protein ArfC~~~COG0088
MEHVHWWLAGLAFTLGMVLTSTLMVRPVEHQVLVKKSVRGSSAKSKPPTARKPAVKSGTKREESPTAKTKVATESAAEQI
PVAGEPAAEPIPVAGEPAARIPVVPYAPYGPGSARAGADGSGPQGWLVKGRSDTRLYYTPEDPTYDPTVAQVWFQDEESA
ARAFFTPWRKSTRRT
>P46910 ~~~arfM~~~Probable transcription regulator ArfM~~~COG0664
MNQCDYLLFLKQLPMFNEVPLSIVETLLKNGTFIRGSCDQSPSFLHSQSVYIVLKGSVRFMDTRLPEGSKTVALWEKGDV
FPIDEKGGLYLSPFISVNATSDILILNIPYYIFKKMMSYHPQLQMNFLAMLQQNVFCSYQLFLRYLHTSQDENAEPGS
>O66143 2.3.1.1~~~argA~~~Amino-acid acetyltransferase~~~COG0548
MKERNTELVQGFRHSVPYINAHRGKTFVIMLGGEAIKYGNFYSIINDIGLLHSLGIRLVVVYGACPQINTSLKEKNIKII
YHKSIRITDLASLEQVKQAAGKLQLDITARLSMSLTNTPLQGANISVVSGNFIISQPLGVDDGVDYCHSGRVRRIDKNAI
NCQLNNGAIVLIGPVAVSVTGESFNLTSEEIATQVSIELKAEKMIGFCGNQGVINDEGKIISELLSNDIKNIIKKLEKKG
DYISSTVRFLKGSIKACKSGVNRSHLISYHKSGALLQELFSRDGIGTQMVMESAEKIRGASINDIGGILELIRPLEHKGI
LVRRSREQLEIEVDKFTIIEHDNLTIACAALYPFFKEKIGEMACLAVHPDYRNSSRGDALLKKIKMNAKDMHLKRIFVLT
TQSIHWFQERGFILVDIEVLPESKKKMYNYQRGSKILMIDVI
>P0A6C5 2.3.1.1~~~argA~~~Amino-acid acetyltransferase~~~COG0548
MVKERKTELVEGFRHSVPYINTHRGKTFVIMLGGEAIEHENFSSIVNDIGLLHSLGIRLVVVYGARPQIDANLAAHHHEP
LYHKNIRVTDAKTLELVKQAAGTLQLDITARLSMSLNNTPLQGAHINVVSGNFIIAQPLGVDDGVDYCHSGRIRRIDEDA
IHRQLDSGAIVLMGPVAVSVTGESFNLTSEEIATQLAIKLKAEKMIGFCSSQGVTNDDGDIVSELFPNEAQARVEAQEEK
GDYNSGTVRFLRGAVKACRSGVRRCHLISYQEDGALLQELFSRDGIGTQIVMESAEQIRRATINDIGGILELIRPLEQQG
ILVRRSREQLEMEIDKFTIIQRDNTTIACAALYPFPEEKIGEMACVAVHPDYRSSSRGEVLLERIAAQAKQSGLSKLFVL
TTRSIHWFQERGFTPVDIDLLPESKKQLYNYQRKSKVLMADLG
>O33289 2.3.1.1~~~argA~~~Amino-acid acetyltransferase~~~COG1246
MTERPRDCRPVVRRARTSDVPAIKQLVDTYAGKILLEKNLVTLYEAVQEFWVAEHPDLYGKVVGCGALHVLWSDLGEIRT
VAVDPAMTGHGIGHAIVDRLLQVARDLQLQRVFVLTFETEFFARHGFTEIEGTPVTAEVFDEMCRSYDIGVAEFLDLSYV
KPNILGNSRMLLVL
>P22567 2.3.1.1~~~argA~~~Amino-acid acetyltransferase~~~
MPDYVNWLRHASPYINSHRDRTFVVMLPGEGVEHPNFGNIVHDLVLLHSLGARLVLVHGSRPQIEARLAARGLAPRYHRD
LRVTDAPTLECVIDAVGSLRIAIEARLSMDMAASPMQGARLRVAGGNLVTARPIGVVEGVDYHHTGEVRRIDRKGIGRLL
DERSIVLLSPLGYSPTGEIFNLACEDVAMRAAIDLEAEKLILYGAEQGLLDASGKLVRELRPQQVPAHLQRLGNSYQAEL
LDAAAQACRAGVKRSHIVSYTEDGALLSELFTRTGNGTLVAQEQFEQLREAGIEDVGGLIELIRPLEEQGILVRRSREVL
EREIEQFSIVEREGLIIACAALYPIADSEAGELACLAVNPEYRHGGRGDELLERIEERARGLGLKTLFVLTTRTAHWFRE
RGFQPSSVERLPAARASLYNFQRNSQVFEKSL
>Q87M87 2.3.1.1~~~argA~~~Amino-acid acetyltransferase~~~COG0548
MKIRSTALVKGFRQSTPYVNAHRGKTMVIMLGGEAVAHNNFGNIINDIALMHSLGIKVVVVYGARPQINQLLEKQDLTTP
YHKNIRITDEAALSVVMQAAGQLQLAITARLSMSLNNTPMAGTQLNVVSGNFVIAQPLGVDDGVDYCHSGRIRRIDTDAI
NRTLDQGSIVLLGPIASSVTGECFNLLSEEVATQLAIKLGADKLIGFCSEQGVIDDNGNAVAELLPIEAEHVIKTLSENH
ASDSDYNTGTLRFLKGSIAACRAGVPRSHLISYKVDGALIQELFSFDGIGTQVVMASAEQVRQAGIDDIGGILELIHPLE
EQGILVRRSREQLEQEIGKFTIIEKDGLIIGCAALYPYSEERKAEMACVAIHPDYRDGNRGLLLLNYMKHRSKSENINQI
FVLTTHSLHWFREQGFYEVGVDYLPGAKQGLYNFQRKSKILALDL
>G3XD47 ~~~aotJ~~~L-arginine-binding protein~~~
MKKLALLGALALSVLSLPTFAADKPVRIGIEAAYPPFSLKTPDGQLAGFDVDIGNALCEEMKVQCKWVEQEFDGLIPALK
VRKIDAILSSMTITDERKRSVDFTNKYYNTPARFVMKEGASLNDPKADLKGKKAGVLRGSTADRYASAELTPAGVEVVRY
NSQQEANMDLVAGRLDAVVADSVNLEDGFLKTDAGKGYAFVGPQLTDAKYFGEGVGIAVRKGDSELAGKFNAAIDALRAN
GKYKQIQDKYFSFDVYGSN
>Q72C18 2.7.2.8~~~argB~~~Acetylglutamate kinase~~~COG0548
MDCVENARLQSKVLIESLPYLRQFHGETVVIKYGGHAMKDEALKKAFALNVALLKLVGINPVIVHGGGPQIGKMLEQLNI
QSHFREGLRVTDDATMDVVEMVLVGKVNKEIVNQMNLAGAKAVGLSGKDGMLIRARKMEMVISKEAQAPEIIDLGKVGEV
MGVNTTLLRSLERDGFVPVIAPVGVDDNGETYNINADAVAGAVAAALKAKRLLLLTDVAGILDHDKKLIRSVNMREAVNL
FSDGTLTGGMIPKVKCCLEALEEGVEKAMIIDGRTENCILLELLTDKGVGTEIVSDRAAQAACNCVLR
>P0A6C8 2.7.2.8~~~argB~~~Acetylglutamate kinase~~~COG0548
MMNPLIIKLGGVLLDSEEALERLFSALVNYRESHQRPLVIVHGGGCVVDELMKGLNLPVKKKNGLRVTPADQIDIITGAL
AGTANKTLLAWAKKHQIAAVGLFLGDGDSVKVTQLDEELGHVGLAQPGSPKLINSLLENGYLPVVSSIGVTDEGQLMNVN
ADQAATALAATLGADLILLSDVSGILDGKGQRIAEMTAAKAEQLIEQGIITDGMIVKVNAALDAARTLGRPVDIASWRHA
EQLPALFNGMPMGTRILA
>A0QYT0 2.7.2.8~~~argB~~~Acetylglutamate kinase~~~COG0548
MSTPSIDRGFKADVLASALPWLKQLHGKIVVVKYGGNAMTDDVLKAAFAADMVFLRNCGIHPVVVHGGGPQISAMLKRLG
IEGDFKGGFRVTTPEVLDVARMVLFGQVGRELVNLINAHGPYAVGVTGEDAQLFTAVRRNVTVDGVATDIGLVGDVEHVN
AGSLLDLIAAGRIPVVSTIAPDADGVVHNINADTAAAALAEALGAEKLVMLTDVEGLYTDWPDRTSLVSEIDTGALTQLL
PKLESGMVPKIEACLRAVNGGVPSAHVIDGRVEHCVLVELFTDEGTGTKVVAQ
>P9WQ01 2.7.2.8~~~argB~~~Acetylglutamate kinase~~~COG0548
MSRIEALPTHIKAQVLAEALPWLKQLHGKVVVVKYGGNAMTDDTLRRAFAADMAFLRNCGIHPVVVHGGGPQITAMLRRL
GIEGDFKGGFRVTTPEVLDVARMVLFGQVGRELVNLINAHGPYAVGITGEDAQLFTAVRRSVTVDGVATDIGLVGDVDQV
NTAAMLDLVAAGRIPVVSTLAPDADGVVHNINADTAAAAVAEALGAEKLLMLTDIDGLYTRWPDRDSLVSEIDTGTLAQL
LPTLESGMVPKVEACLRAVIGGVPSAHIIDGRVTHCVLVELFTDAGTGTKVVRG
>Q9HTN2 2.7.2.8~~~argB~~~Acetylglutamate kinase~~~
MTLSRDDAAQVAKVLSEALPYIRRFVGKTLVIKYGGNAMESEELKAGFARDVVLMKAVGINPVVVHGGGPQIGDLLKRLS
IESHFIDGMRVTDAATMDVVEMVLGGQVNKDIVNLINRHGGSAIGLTGKDAELIRAKKLTVTRQTPEMTKPEIIDIGHVG
EVTGVNVGLLNMLVKGDFIPVIAPIGVGSNGESYNINADLVAGKVAEALKAEKLMLLTNIAGLMDKQGQVLTGLSTEQVN
ELIADGTIYGGMLPKIRCALEAVQGGVTSAHIIDGRVPNAVLLEIFTDSGVGTLISNRKRH
>Q8DV44 2.7.2.8~~~argB~~~Acetylglutamate kinase~~~COG0548
MKDIIVIKIGGVASQQLSGDFLSQIKNWQDAGKQLVIVHGGGFAINKLMEENQVPVKKINGLRVTSKDDMVLVSHALLDL
VGKNLQEKLRQAGVSCQQLKSDIKHVVAADYLDKDTYGYVGDVTHINKRVIEEFLENRQIPILASLGYSKEGDMLNINAD
YLATAVAVALAADKLILMTNVKGVLENGAVLEKITSHQVQEKIDTAVITAGMIPKIESAAKTVAAGVGQVLIGDNLLTGT
LITAD
>Q6V1L5 2.7.2.8~~~argB~~~Acetylglutamate kinase~~~COG0548
MSSEFIEAGAADRVRILSEALPYLQQFAGRTVVVKYGGAAMKQEELKEAVMRDIVFLACVGMRPVVVHGGGPEINAWLGR
VGIEPQFHNGLRVTDADTMEVVEMVLVGRVNKDIVSRINTTGGRAVGFCGTDGRLVLARPHDQEGIGFVGEVNSVNSEVI
EPLLERGYIPVISSVAADENGQSFNINADTVAGEIAAALNAEKLILLTDTRGILEDPKRPESLIPRLNIPQSRELIAQGI
VGGGMIPKVDCCIRSLAQGVRAAHIIDGRIPHALLLEIFTDAGIGTMIVGSGYHEAHQPWQ
>Q9X2A4 2.7.2.8~~~argB~~~Acetylglutamate kinase~~~COG0548
MRIDTVNVLLEALPYIKEFYGKTFVIKFGGSAMKQENAKKAFIQDIILLKYTGIKPIIVHGGGPAISQMMKDLGIEPVFK
NGHRVTDEKTMEIVEMVLVGKINKEIVMNLNLHGGRAVGICGKDSKLIVAEKETKHGDIGYVGKVKKVNPEILHALIEND
YIPVIAPVGIGEDGHSYNINADTAAAEIAKSLMAEKLILLTDVDGVLKDGKLISTLTPDEAEELIRDGTVTGGMIPKVEC
AVSAVRGGVGAVHIINGGLEHAILLEIFSRKGIGTMIKELEG
>Q72HA9 2.7.2.8~~~argB~~~Acetylglutamate kinase~~~COG0548
MSEALLVKVGGSLRGAEALLDELAAYPGPLVLVHGGGPEIGAWLGRLGYESRFVGGLRVTPPEQLEVVEMALYLTGKRLA
EGLSRRGRKALALSGRDALCLKGRALPELGRVGEVVEVEVGLLQDLLAKGYTPLLAPIALDAEGPLNVNADTAAGAVAGA
LGWPAVFLTDVEGVYRDPKDPRTRFPRLTPKEVEALKGEGVIQGGMIPKVEAALSALRAGAPWAAIAKGERGVLEAVLRG
EAGTRFTL
>Q87EL2 2.7.2.8~~~argB~~~Acetylglutamate kinase~~~
MASAKEISQYLKRFSQLDAKRFAVVKVGGAVLRDDVDALTSSLSFLQEVGLTPIVLHGAGPQLDEELTAVGIQKKTVNGF
RVTLPETMAIVRKVFHATNLQLIEALQRNGARATSITGGVFEAHYLDQETYGLVGGISAVNIAPIEASLRAASIPVIASL
GETPSGQILNINADVAANELVHVLQPYKIIFLTGTGGLLDADGKIINSINLSTEYEQLIQQPWVYGGMKLKIEQIKHLLD
RLPLESSVSITRPADLAKELFTHKGSGTLIRRGERVIRATTWKDLDLPRLQHLIQSSFRRTLIPHYFETTPLLRAYVSEN
YRAAVILTKLGNVPYLDKFAVLDDAQGEGLGRAVWSIMREETPQLFWRSRHNNQANAFYYAESDGYYKQDHWKIFWNGLH
HFQQIQQCVAHCTQHPPTLID
>Q8ZA87 2.7.2.8~~~argB~~~Acetylglutamate kinase~~~COG0548
MMNPLVIKLGGVLLDSEEALERLFTALVTYREKHERPLVIMHGGGCLVDELMKRLALPVVKKNGLRVTPADQIDIITGAL
AGTANKTLLAWAVKHQINAVGLCLADGNTVTVTLLDAELGHVGKAQPGSAALVQTLLAAGYMPIISSIGITVEGQLMNVN
ADQAATALAATLGADLILLSDVSGILDGKGQRIAEMTAQKAEQLIAQGIITDGMVVKVNAALDAARSLGRPVDIASWRHS
EQLPALFNGVPIGTRISV
>P11446 1.2.1.38~~~argC~~~N-acetyl-gamma-glutamyl-phosphate reductase~~~COG0002
MLNTLIVGASGYAGAELVTYVNRHPHMNITALTVSAQSNDAGKLISDLHPQLKGIVDLPLQPMSDISEFSPGVDVVFLAT
AHEVSHDLAPQFLEAGCVVFDLSGAFRVNDATFYEKYYGFTHQYPELLEQAAYGLAEWCGNKLKEANLIAVPGCYPTAAQ
LALKPLIDADLLDLNQWPVINATSGVSGAGRKAAISNSFCEVSLQPYGVFTHRHQPEIATHLGADVIFTPHLGNFPRGIL
ETITCRLKSGVTQAQVAQVLQQAYAHKPLVRLYDKGVPALKNVVGLPFCDIGFAVQGEHLIIVATEDNLLKGAAAQAVQC
ANIRFGYAETQSLI
>Q07906 1.2.1.38~~~argC~~~N-acetyl-gamma-glutamyl-phosphate reductase~~~
MMNVAIIGATGYSGAELFRLLYGHPHVSQCDVFSSSQDGIHLSESFPHVGAVDGAVLHKLEIEALAKYDAVFFATPPGVS
GEWAPALVDRGVKVIDLSGDFRLKDGAVYAQWYGREAAPSAYLERAVYGLTEWNREAVRGAVLLSNPGCYPTATLLGLAP
LVKEGLIKEDSIIVDAKSGVSGAGRKAGLGTHFSEVNENVKIYKVNAHQHIPEIEQALQTWNEAVAPITFSTHLIPMTRG
IMATIYAKAKQSISPNDLVDLYKTSYEGSPFVRIRQLGQFPATKDVYGSNYCDIGLAYDERTERVTVVSVIDNLMKGAAG
QAVQNFNLMMGWDEAEGLRSLPIYP
>P9WPZ9 1.2.1.38~~~argC~~~N-acetyl-gamma-glutamyl-phosphate reductase~~~COG0002
MQNRQVANATKVAVAGASGYAGGEILRLLLGHPAYADGRLRIGALTAATSAGSTLGEHHPHLTPLAHRVVEPTEAAVLGG
HDAVFLALPHGHSAVLAQQLSPETLIIDCGADFRLTDAAVWERFYGSSHAGSWPYGLPELPGARDQLRGTRRIAVPGCYP
TAALLALFPALAADLIEPAVTVVAVSGTSGAGRAATTDLLGAEVIGSARAYNIAGVHRHTPEIAQGLRAVTDRDVSVSFT
PVLIPASRGILATCTARTRSPLSQLRAAYEKAYHAEPFIYLMPEGQLPRTGAVIGSNAAHIAVAVDEDAQTFVAIAAIDN
LVKGTAGAAVQSMNLALGWPETDGLSVVGVAP
>Q8ZKL8 1.2.1.38~~~argC~~~N-acetyl-gamma-glutamyl-phosphate reductase~~~
MLNTLIVGASGYAGAELVSYVNRHPHMTITALTVSAQSNDAGKLISDLHPQLKGIVDLPLQPMSDVRDFSADVDVVFLAT
AHEVSHDLAPQFLQAGCVVFDLSGAFRVNDRAFYEKYYGFTHQYPELLEQAVYGLAEWNVDKLNTANLIAVPGCYPTAAQ
LSLKPLIDGGLLDLTQWPVINATSGVSGAGRKAAISNSFCEVSLQPYGVFTHRHQPEIAVHLGAEVIFTPHLGNFPRGIL
ETITCRLKAGVTHAQVADVLQKAYGDKPLVRLYDKGVPALKNVVGLPFCDIGFAVQGEHLIVVATEDNLLKGAAAQAVQC
ANIRFGFAETQSLI
>P59310 1.2.1.38~~~argC~~~N-acetyl-gamma-glutamyl-phosphate reductase~~~
MLNTLIVGASGYAGAELVTYVNRHPHMNITALTVSAQSNDAGKLISDLHPQLKGIVELPLQPMSDISEFSPGVDVVFLAT
AHEVSHDLAPQFLEAGCVVFDLSGAFRVNDATFYEKYYGFTHQYPELLEQAAYGLAEWCGNKLKEANLIAVPGCYPTAAQ
LALKPLIDADLLDLNQWPVINATSGVSGAGRKAAISNSFCEVSLQPYGVFTHRHQPEIATHLGADVIFTPHLGNFPRGIL
ETITCRLKSGVTQAQVAQALQQAYAHKPLVRLYDKGVPALKNVVGLPFCDIGFAVQGEHLIIVATEDNLLKGAAAQAVQC
ANIRFGYAETQSLI
>Q9X2A2 1.2.1.38~~~argC~~~N-acetyl-gamma-glutamyl-phosphate reductase~~~COG0002
MIRAGIIGATGYTGLELVRLLKNHPEAKITYLSSRTYAGKKLEEIFPSTLENSILSEFDPEKVSKNCDVLFTALPAGASY
DLVRELKGVKIIDLGADFRFDDPGVYREWYGKELSGYENIKRVYGLPELHREEIKNAQVVGNPGCYPTSVILALAPALKH
NLVDPETILVDAKSGVSGAGRKEKVDYLFSEVNESLRPYNVAKHRHVPEMEQELGKISGKKVNVVFTPHLVPMTRGILST
IYVKTDKSLEEIHEAYLEFYKNEPFVHVLPMGIYPSTKWCYGSNHVFIGMQMEERTNTLILMSAIDNLVKGASGQAVQNM
NIMFGLDETKGLEFTPIYP
>O66442 2.6.1.11~~~argD~~~Acetylornithine aminotransferase~~~COG4992
MTYLMNNYARLPVKFVRGKGVYLYDEEGKEYLDFVSGIGVNSLGHAYPKLTEALKEQVEKLLHVSNLYENPWQEELAHKL
VKHFWTEGKVFFANSGTESVEAAIKLARKYWRDKGKNKWKFISFENSFHGRTYGSLSATGQPKFHKGFEPLVPGFSYAKL
NDIDSVYKLLDEETAGIIIEVIQGEGGVNEASEDFLSKLQEICKEKDVLLIIDEVQTGIGRTGEFYAYQHFNLKPDVIAL
AKGLGGGVPIGAILAREEVAQSFTPGSHGSTFGGNPLACRAGTVVVDEVEKLLPHVREVGNYFKEKLKELGKGKVKGRGL
MLGLELERECKDYVLKALEKGLLINCTAGKVLRFLPPLIIQKEHIDRAISVLREIL
>Q9PIR7 2.6.1.11~~~argD~~~Acetylornithine aminotransferase~~~COG4992
MDYKEQSHIIPTYKRFDIVLEKGQGVYLFDDKAKKYLDFSSGIGVCALGYNHAKFNAKIKAQVDKLLHTSNLYYNENIAA
AAKNLAKASALERVFFTNSGTESIEGAMKTARKYAFNKGVKGGQFIAFKHSFHGRTLGALSLTANEKYQKPFKPLISGVK
FAKYNDISSVEKLVNEKTCAIILESVQGEGGINPANKDFYKALRKLCDEKDILLIADEIQCGMGRSGKFFAYEHAQILPD
IMTSAKALGCGLSVGAFVINQKVASNSLEAGDHGSTYGGNPLVCAGVNAVFEIFKEEKILENVNKLTPYLEQSLDELINE
FDFCKKRKGLGFMQGLSLDKSVKVAKVIQKCQENALLLISCGENDLRFLPPLILQKEHIDEMSEKLRKALKSF
>P18335 2.6.1.11~~~argD~~~Acetylornithine/succinyldiaminopimelate aminotransferase~~~COG4992
MAIEQTAITRATFDEVILPIYAPAEFIPVKGQGSRIWDQQGKEYVDFAGGIAVTALGHCHPALVNALKTQGETLWHISNV
FTNEPALRLGRKLIEATFAERVVFMNSGTEANETAFKLARHYACVRHSPFKTKIIAFHNAFHGRSLFTVSVGGQPKYSDG
FGPKPADIIHVPFNDLHAVKAVMDDHTCAVVVEPIQGEGGVTAATPEFLQGLRELCDQHQALLVFDEVQCGMGRTGDLFA
YMHYGVTPDILTSAKALGGGFPISAMLTTAEIASAFHPGSHGSTYGGNPLACAVAGAAFDIINTPEVLEGIQAKRQRFVD
HLQKIDQQYDVFSDIRGMGLLIGAELKPQYKGRARDFLYAGAEAGVMVLNAGPDVMRFAPSLVVEDADIDEGMQRFAHAV
AKVVGA
>A0QYS9 2.6.1.11~~~argD~~~Acetylornithine aminotransferase~~~COG4992
MTLQSRWEAVMMNNYGTPPLSLVSGEGAVVTDADGREYLDLLGGIAVNLLGHRHPAVIEAVTTQLDTLGHTSNLYATEPG
IALAEALVGQLGTQARVFFCNSGTEANEVAFKITRLTGKTKIVAAEGAFHGRTMGSLALTGQPSKQAPFEPLPGNVMHVP
YGDVAALEAAVDDQTAAVFLEPIMGEGGVVVPPAGYLVAAREITSKHGALLVLDEVQTGVGRTGAFFAHQHDGIVPDVVT
MAKGLGGGLPIGACLAVGATGDLLTPGLHGSTFGGNPVCTAAGLAVLKTLAAEDLVARAGVLGKTLSHGIEELGHPLVDK
VRGKGLLQGIVLTVPSAKAVETAARDAGFLVNAAAPEVVRLAPPLIITEGQIEAFITALPAVLDTAAEDS
>P9WPZ7 2.6.1.11~~~argD~~~Acetylornithine aminotransferase~~~COG4992
MTGASTTTATMRQRWQAVMMNNYGTPPIALASGDGAVVTDVDGRTYIDLLGGIAVNVLGHRHPAVIEAVTRQMSTLGHTS
NLYATEPGIALAEELVALLGADQRTRVFFCNSGAEANEAAFKLSRLTGRTKLVAAHDAFHGRTMGSLALTGQPAKQTPFA
PLPGDVTHVGYGDVDALAAAVDDHTAAVFLEPIMGESGVVVPPAGYLAAARDITARRGALLVLDEVQTGMGRTGAFFAHQ
HDGITPDVVTLAKGLGGGLPIGACLAVGPAAELLTPGLHGSTFGGNPVCAAAALAVLRVLASDGLVRRAEVLGKSLRHGI
EALGHPLIDHVRGRGLLLGIALTAPHAKDAEATARDAGYLVNAAAPDVIRLAPPLIIAEAQLDGFVAALPAILDRAVGAP
>P40732 2.6.1.11~~~argD~~~Acetylornithine/succinyldiaminopimelate aminotransferase~~~
MATEQTAITRATFDEVILPVYAPADFIPVKGKGSRVWDQQGKEYIDFAGGIAVTALGHCHPALVEALKSQGETLWHTSNV
FTNEPALRLGRKLIDATFAERVLFMNSGTEANETAFKLARHYACVRHSPFKTKIIAFHNAFHGRSLFTVSVGGQPKYSDG
FGPKPADIIHVPFNDLHAVKAVMDDHTCAVVVEPIQGEGGVQAATPEFLKGLRDLCDEHQALLVFDEVQCGMGRTGDLFA
YMHYGVTPDILTSAKALGGGFPVSAMLTTQEIASAFHVGSHGSTYGGNPLACAVAGAAFDIINTPEVLQGIHTKRQQFVQ
HLQAIDEQFDIFSDIRGMGLLIGAELKPKYKGRARDFLYAGAEAGVMVLNAGADVMRFAPSLVVEEADIHEGMQRFAQAV
GKVVA
>Q9X2A5 2.6.1.11~~~argD~~~Acetylornithine aminotransferase~~~COG4992
MYLMNTYSRFPATFVYGKGSWIYDEKGNAYLDFTSGIAVNVLGHSHPRLVEAIKDQAEKLIHCSNLFWNRPQMELAELLS
KNTFGGKVFFANTGTEANEAAIKIARKYGKKKSEKKYRILSAHNSFHGRTLGSLTATGQPKYQKPFEPLVPGFEYFEFNN
VEDLRRKMSEDVCAVFLEPIQGESGIVPATKEFLEEARKLCDEYDALLVFDEVQCGMGRTGKLFAYQKYGVVPDVLTTAK
GLGGGVPIGAVIVNERANVLEPGDHGTTFGGNPLACRAGVTVIKELTKEGFLEEVEEKGNYLMKKLQEMKEEYDVVADVR
GMGLMIGIQFREEVSNREVATKCFENKLLVVPAGNNTIRFLPPLTVEYGEIDLAVETLKKVLQGI
>P23908 3.5.1.16~~~argE~~~Acetylornithine deacetylase~~~COG0624
MKNKLPPFIEIYRALIATPSISATEEALDQSNADLITLLADWFKDLGFNVEVQPVPGTRNKFNMLASIGQGAGGLLLAGH
TDTVPFDDGRWTRDPFTLTEHDGKLYGLGTADMKGFFAFILDALRDVDVTKLKKPLYILATADEETSMAGARYFAETTAL
RPDCAIIGEPTSLQPVRAHKGHISNAIRIQGQSGHSSDPARGVNAIELMHDAIGHILQLRDNLKERYHYEAFTVPYPTLN
LGHIHGGDASNRICACCELHMDIRPLPGMTLNELNGLLNDALAPVSERWPGRLTVDELHPPIPGYECPPNHQLVEVVEKL
LGAKTEVVNYCTEAPFIQTLCPTLVLGPGSINQAHQPDEYLETRFIKPTRELITQVIHHFCWH
>Q9K4Z2 3.5.1.16~~~argE~~~Acetylornithine deacetylase~~~
MQLPQFSELYKSLILIPSISSLEKELDISNKPVIDLLSGWFSELGFSINITSVPETNGKFNLVATYGQGDGGLLLAGHTD
TVPFDDDLWTKDPFKLTEKDDKWYGLGTIDMKGFFAFVLEACKNIDLTKLDKPLRILATADEETTMAGARAIAAAQSFRP
DYAVIGEPTGMVPVFMHKGHMSEAIRITGRSGHSSDPANGINAIEIMHQVTGQLLQLQRKLKEQYACDHFVIPQPTLNFG
HVHGGDSPNRICGSCELHIDMRPIPGVNPDELFMLLNQALLPIIKQWPGAVDVYHLHEPIPAYACNTDSALIKLAEKLTG
EAVIPVNYCTEAPFIQQLGCDTIVMGPGSINQAHQPDEYLDLSAIKPTQAIIQKLIEETCKN
>Q9K4Z7 3.5.1.16~~~argE~~~Acetylornithine deacetylase~~~
MQLPKFNELYKSLILTPSISSLEKELDISNKPVIDLLAAWFSELGFSINITSVPETNGKFNLVATYGQGDGGLLLAGHTD
TVPFDDGLWTKDPFQLTEKDDKWYGLGTIDMKGFFAFVLEACKNIDLTKLDKPLRILATADEETTMAGARAIAAAKSFRP
DYAVIGEPTSMVPVFMHKGHMSEAIRITGRSGHSSDPANGINAIEIMHQVTGQLLQLQRKLKEQYACDHFVIPQPTLNFG
HVHGGDSPNRICGSCELHIDMRPIPGVNPDELFMLLNQALLPIMKQWPGAVDVYHLHEPIPAYACDTDSALIKLAEKLTG
ETVIPVNYCTEAPFIHTGCDTIVMGPGSINQAHQPDEYLDLSAIKPTQAIIQKLIEQSCKN
>O68873 3.5.1.16~~~argE~~~Acetylornithine deacetylase~~~
MSDTLPALRATLTELVAMDTTSFRPNVPLIDYAQARLEAAGFSAERQKFLDDAGVEKVNLVAVKGGSGSGRAALALVGHS
DCVPYDAAWTDALRLTEKDGRLYARGACDTKGFIACALHAALNAEQLKAPLMVVLTADEEVGLTGAKKLVEAGLGRARHA
IVGEPTRLIPVRANKGYCLAEVEVRGKEGHSAYPDSGASAIFRAGRFLQRLEHLALTVLREDLDEGFQPPFTTVNVGVIQ
GGKAKNVIPGACRFVVEWRPIPGQPPERVSQLLETIRQELVRDEPAFEAQIRVVRTDRGVNTRADAEVVRFLAEASGNAP
ETVSFGTEAPQMTELGAEAVVFGPGDIRVAHQTGEYVPVEDLVRCEAVLARAVAHFCGGR
>Q6WZB0 1.14.11.41~~~vioC~~~Alpha-ketoglutarate-dependent L-arginine hydroxylase~~~
MTESPTTHHGAAPPDSVATPVRPWSEFRLTPAEAAAAAALAARCAQRYDETDGPEFLLDAPVIAHELPRRLRTFMARARL
DAWPHALVVRGNPVDDAALGSTPVHWRTARTPGSRPLSFLLMLYAGLLGDVFGWATQQDGRVVTDVLPIKGGEHTLVSSS
SRQELGWHTEDAFSPYRADYVGLLSLRNPDGVATTLAGVPLDDLDERTLDVLFQERFLIRPDDSHLQVNNSTAQQGRVEF
EGIAQAADRPEPVAILTGHRAAPHLRVDGDFSAPAEGDEEAAAALGTLRKLIDASLYELVLDQGDVAFIDNRRAVHGRRA
FQPRYDGRDRWLKRINITRDLHRSRKAWAGDSRVLGQR
>P14012 3.5.3.1~~~arcA~~~Arginase~~~
MNGAGEINASRHRKENELKTCQILGAPVQSGASQPGCLMGPDAFRTAGLTQVLTELGWAVTDLGDATPTVEPELSHPNSA
VKNLDALVGWTRSLSQKALEMARSCDLPVFLGGDHSMSAGTVSGVAQRTAELGKEQFVLWLDAHTDLHTLHTTASGNLHG
TPVAYYTGQSGFEGLPPLAAPVNPRNVSMMGIRSVDPEERRRVAEIGVQVADMRVLDEQGVVRPLEAFLDRVSKVSGRLH
VSLDVDFLDPAIAPAVGTTVPGGATFREAHLIMEMLHDSGLVTSLDLAELNPFLDERGRTARLITDLASSLFGRRVFDRV
TTAF
>P53608 3.5.3.1~~~rocF~~~Arginase~~~
MKPISIIGVPMDLGQTRRGVDMGPSAMRYAGVIERLERLHYDIEDLGDIPIGKAERLHEQGDSRLRNLKAVAEANEKLAA
AVDQVVQRGRFPLVLGGDHSIAIGTLAGVAKHYERLGVIWYDAHGDVNTAETSPSGNIHGMPLAASLGFGHPALTQIGGY
SPKIKPEHVVLIGVRSLDEGEKKFIREKGIKIYTMHEVDRLGMTRVMEETIAYLKERTDGVHLSLDLDGLDPSDAPGVGT
PVIGGLTYRESHLAMEMLAEAQIITSAEFVEVNPILDERNKTASVAVALMGSLFGEKLM
>P39138 3.5.3.1~~~rocF~~~Arginase~~~COG0010
MDKTISVIGMPMDLGQARRGVDMGPSAIRYAHLIERLSDMGYTVEDLGDIPINREKIKNDEELKNLNSVLAGNEKLAQKV
NKVIEEKKFPLVLGGDHSIAIGTLAGTAKHYDNLGVIWYDAHGDLNTLETSPSGNIHGMPLAVSLGIGHESLVNLEGYAP
KIKPENVVIIGARSLDEGERKYIKESGMKVYTMHEIDRLGMTKVIEETLDYLSACDGVHLSLDLDGLDPNDAPGVGTPVV
GGISYRESHLAMEMLYDAGIITSAEFVEVNPILDHKNKTGKTAVELVESLLGKKLL
>Q7M0Z3 3.5.3.1~~~rocF~~~Arginase~~~COG0010
MNKNMSIVGVPMDLGADRRGVDMGPSAIRYAGVVARLEKMGFNIEDRGDIFVTLPHHFTETENHKYLDEVVEANEKLANV
VSDIMTAGRFPLVLGGDHSIALGTIAGVAKHVKNLGVICLDAHGDLNTGATSPSGNIHGMPLAASLGYGHERLTNIGGYT
PKVKAENVVIIGARDLDQGERELIKRIGMKVFTMHEIDKLGMARVMDEAIAHVSKNTDGVHLSLDLDGLDPHDAPGVGTP
VIGGISYREGHVSLEMLADADILCSAEFVEVNPILDRENMTARVAVALMSSVFGDKLL
>P60088 3.5.3.1~~~arg~~~Arginase~~~
MTKTKAIDIIGAPSTFGQRKLGVDLGPTAIRYAGLISRLKQLDLDVYDKGDIKVPAVNIEKFHSEQKGLRNYDEIIDVNQ
KLNKEVSASIENNRFPLVLGGDHSIAVGSVSAISKHYNNLGVIWYDAHGDLNIPEESPSGNIHGMPLRILTGEGPKELLE
LNSNVIKPENIVLIGMRDLDKGERQFIKDHNIKTFTMSDIDKLGIKEVIENTIEYLKSRNVDGVHLSLDVDALDPLETPG
TGTRVLGGLSYRESHFALELLHQSHLISSMDLVEVNPLIDSNNHTAEQAVSLVGTFFGETLL
>Q07908 ~~~argJ~~~Arginine biosynthesis bifunctional protein ArgJ~~~
MTITKQTGQVTAVADGTVVTPEGFQAAGVNAGLRYSKNDLGVILCDVPASAAAVYTQSHFQAAPLKVTQASLAVEQKLQA
VIVNRPCANACTGAQGLKDAYEMRELCAKQFGLALHHVAVASTGVIGEYLPMEKIRAGIKQLVPGVTMADAEAFQTAILT
TDTVMKRACYQTTIDGKTVTVGGAAKGSGMIHPNMATMLAFITTDANVSSPVLHAALRSITDVSFNQITVDGDTSTNDMV
VVMASGLAGNDELTPDHPDWENFYEALRKTCEDLAKQIAKDGEGATKLIEVRVRGAKTDEEAKKIAKQIVGSNLVKTAVY
GADANWGRIIGAIGYSDAEVNPDNVDVAIGPMVMLKGSEPQPFSEEEAAAYLQQETVVIEVDLHIGDGVGVAWGCDLTYD
YVKINASYRT
>Q9K8V3 ~~~argJ~~~Arginine biosynthesis bifunctional protein ArgJ~~~COG1364
MNVINETANVLKLETGSVTSAKGFSAVGIHTGVKRKRKDLGAIVCEVPASSAAVYTLNKVQAAPLKVTQESIAVEGKLQA
MIVNSGIANACTGKRGLDDAYTMRAVGAETFHIPEHYVAVTSTGVIGEFLPMDVITNGIRQLKPEATIEGAHAFNEAILT
TDTVEKHTCYQTIVNGKTVTVGGVAKGSGMIHPNMATMLSFVTTDANIDHGHLQGALSAITNETFNRITVDGDTSTNDMV
VVMASGLAENEALTPEHPDWANFYKALQLACEDLAKQIARDGEGATKLIEVEVTGAANDQEAGMVAKQIVGSDLVKTAIY
GADANWGRIICAIGYSGCEVNQETIDIAIGPIVTLKQSEPTGFSEEEATAYLKEADPVKISVNLHIGNGTGKAWGCDLTY
DYVRINAGYRT
>P9WPZ3 ~~~argJ~~~Arginine biosynthesis bifunctional protein ArgJ~~~COG1364
MTDLAGTTRLLRAQGVTAPAGFRAAGVAAGIKASGALDLALVFNEGPDYAAAGVFTRNQVKAAPVLWTQQVLTTGRLRAV
ILNSGGANACTGPAGFADTHATAEAVAAALSDWGTETGAIEVAVCSTGLIGDRLPMDKLLAGVAHVVHEMHGGLVGGDEA
AHAIMTTDNVPKQVALHHHDNWTVGGMAKGAGMLAPSLATMLCVLTTDAAAEPAALERALRRAAAATFDRLDIDGSCSTN
DTVLLLSSGASEIPPAQADLDEAVLRVCDDLCAQLQADAEGVTKRVTVTVTGAATEDDALVAARQIARDSLVKTALFGSD
PNWGRVLAAVGMAPITLDPDRISVSFNGAAVCVHGVGAPGAREVDLSDADIDITVDLGVGDGQARIRTTDLSHAYVEENS
AYSS
>Q8CK24 ~~~argJ~~~Arginine biosynthesis bifunctional protein ArgJ~~~COG1364
MSVTAAKGFTAAGITAGIKESGSPDLALVVNTGPRRSAAGVFTSNRVKAAPVLWSEQVLKSGEVTAVVLNSGGANACTGP
KGFQDTHATAEKAADVLGTGAGEVAVCSTGLIGVLLPMDKLLPGVEAAAGQLSEHGGEKAAIAIKTTDTVHKTSVVTRDG
WTVGGMAKGAGMLAPGLATMLVVITTDADLETEALDRALRAATRVTFDRVDSDGCMSTNDTVLLLSSGSSGVTPEYDAFA
EAVRTVCDDLGQQLIRDAEGASKDIKVEVVNAATEDEAVQVGRTIARNNLLKCAIHGEDPNWGRVLSAIGTTDAAFEPDR
LNVAINGVWVCKNGGVGEDRELVDMRYREVHIVADLAAGDATATIWTNDLTADYVHENSAYSS
>Q9Z4S1 ~~~argJ~~~Arginine biosynthesis bifunctional protein ArgJ~~~COG1364
MFVPRGFSYAGVHCRIKRKRKDLGIIFSEVPCTAAGVFTTNVVKAAPVIYDMEILGKNPSGIRAITVNSGVANACTGEQG
MINARRMAEKTAKELNIPVESVLVSSTGVIGVQLPMEKVESGIEEAVKNLSKDPVPFAEAIMTTDTKIKIHSKKVTIEGK
EITVLGIAKGSGMIHPNMATMLSFITTDANVSEDALKKLLKISVDDSYNMIDVDGDTSTNDMVIILANGLAGNAPIQEET
DGFWKLYEAVHEVNQVLAEKIVEDGEGATKVIEVEVRNAPDRNSARLIARAIVSSNLVKTAIYGEDANWGRVIAAAGYSG
AQFDPDRLDLFFESAAGRIKVAENGQGVDFDEDTAKKILSEKKVKIILDMKQGKELARAWGCDLTEKYVEINGRYRT
>P27254 3.6.5.-~~~argK~~~GTPase ArgK~~~COG1703
MINEATLAESIRRLRQGERATLAQAMTLVESRHPRHQALSTQLLDAIMPYCGNTLRLGVTGTPGAGKSTFLEAFGMLLIR
EGLKVAVIAVDPSSPVTGGSILGDKTRMNDLARAEAAFIRPVPSSGHLGGASQRARELMLLCEAAGYDVVIVETVGVGQS
ETEVARMVDCFISLQIAGGGDDLQGIKKGLMEVADLIVINKDDGDNHTNVAIARHMYESALHILRRKYDEWQPRVLTCSA
LEKRGIDEIWHAIIDFKTALTASGRLQQVRQQQSVEWLRKQTEEEVLNHLFANEDFDRYYRQTLLAVKNNTLSPRTGLRQ
LSEFIQTQYFD
>P0DSE4 ~~~argL~~~Putative translational regulatory protein ArgL~~~
MNNYTYKVNFNSISGVRHARIKCPIYTKNTF
>P11667 ~~~argO~~~Arginine exporter protein ArgO~~~COG1279
MFSYYFQGLALGAAMILPLGPQNAFVMNQGIRRQYHIMIALLCAISDLVLICAGIFGGSALLMQSPWLLALVTWGGVAFL
LWYGFGAFKTAMSSNIELASAEVMKQGRWKIIATMLAVTWLNPHVYLDTFVVLGSLGGQLDVEPKRWFALGTISASFLWF
FGLALLAAWLAPRLRTAKAQRIINLVVGCVMWFIALQLARDGIAHAQALFS
>P0A8S1 ~~~argP~~~HTH-type transcriptional regulator ArgP~~~COG0583
MKRPDYRTLQALDAVIRERGFERAAQKLCITQSAVSQRIKQLENMFGQPLLVRTVPPRPTEQGQKLLALLRQVELLEEEW
LGDEQTGSTPLLLSLAVNADSLATWLLPALAPVLADSPIRLNLQVEDETRTQERLRRGEVVGAVSIQHQALPSCLVDKLG
ALDYLFVSSKPFAEKYFPNGVTRSALLKAPVVAFDHLDDMHQAFLQQNFDLPPGSVPCHIVNSSEAFVQLARQGTTCCMI
PHLQIEKELASGELIDLTPGLFQRRMLYWHRFAPESRMMRKVTDALLDYGHKVLRQD
>P17893 ~~~argR~~~Arginine repressor~~~COG1438
MNKGQRHIKIREIITSNEIETQDELVDMLKQDGYKVTQATVSRDIKELHLVKVPTNNGSYKYSLPADQRFNPLSKLKRAL
MDAFVKIDSASHMIVLKTMPGNAQAIGALMDNLDWDEMMGTICGDDTILIICRTPEDTEGVKNRLLELL
>P0A6D0 ~~~argR~~~Arginine repressor~~~COG1438
MRSSAKQEELVKAFKALLKEEKFSSQGEIVAALQEQGFDNINQSKVSRMLTKFGAVRTRNAKMEMVYCLPAELGVPTTSS
PLKNLVLDIDYNDAVVVIHTSPGAAQLIARLLDSLGKAEGILGTIAGDDTIFTTPANGFTVKDLYEAILELFDQEL
>O31408 ~~~argR~~~Arginine repressor~~~
MNKGQRHIKIREIIMSNDIETQDELVDRLREAGFNVTQATVSRDIKEMQLVKVPMANGRYKYSLPSDQRFNPLQKLKRAL
VDVFIKLDGTGNLLVLRTLPGNAHAIGVLLDNLDWDEIVGTICGDDTCLIICRTPKDAKKVSNQLLSML
>Q9K973 ~~~argR~~~Arginine repressor~~~COG1438
MNKGQRHIKIREIIANNDVETQDELVEQLKAAGYNVTQATVSRDIKELHLVKVPMMDGRYKYSLPADQRFNPLQKLKRGL
VDSFVSIDRTDNLIVMKTLPGNAHAIGALIDNLDWTEIMGTICGDDTILIICKDKQDGPVVTERFLNML
>P9WPY9 ~~~argR~~~Arginine repressor~~~COG1438
MSRAKAAPVAGPEVAANRAGRQARIVAILSSAQVRSQNELAALLAAEGIEVTQATLSRDLEELGAVKLRGADGGTGIYVV
PEDGSPVRGVSGGTDRMARLLGELLVSTDDSGNLAVLRTPPGAAHYLASAIDRAALPQVVGTIAGDDTILVVAREPTTGA
QLAGMFENLR
>P0A1B3 ~~~argR~~~Arginine repressor~~~
MRSSAKQEELVRAFKALLKEEKFSSQGEIVLALQDQGFENINQSKVSRMLTKFGAVRTRNAKMEMVYCLPAELGVPTTSS
PLKNLVLDIDYNDAVVVIHTSPGAAQLIARLLDSLGKAEGILGTIAGDDTIFTTPASGFSVRDLYEAILELFEQEL
>P63580 ~~~argR~~~Arginine repressor~~~
MPKKSVRHIKIREIISNEQIETQDELVKRLNDYDLNVTQATVSRDIKELQLIKVPIPSGQYVYSLPNDRKFHPLEKLGRY
LMDSFVNIDGTDNLLVLKTLPGNAQSIGAILDQINWEEVLGTICGDDTCLIICRSKEASDEIKSRIFNLL
>Q9L1A5 ~~~argR~~~Arginine repressor~~~COG1438
MSHAQEHEQPAGPALPQTRTARHRRIVDILNRQAVRSQSQLAKLLADDGLTVTQATLSRDLDELNAVKIRNTDGDLIYAV
PSEGGFRTPRAPLGESAKEERMRRLSAELLISAEASANLVVLRTPPGAAQFLASAIDQAELHDILGTIAGDDTLMLISRE
PTGGQALADHLLRLAQNGG
>Q8GND0 ~~~argR~~~Arginine regulator~~~COG1438
MNKIESRHRLIRSLIMEKKVHTQQELQELLEANGVIVTQSTLSRDMKALNLVKVTENNISYYVINSIAPSRWEKRLRFYM
EDALIMLRPVQNQVVMKTLPGLAQSFGAILDALELPQIVATVCGDDVCLIICEDNPSAIECFDKLKEFAPPFFFSK
>Q7MP98 ~~~argR~~~Arginine repressor~~~COG1438
MRPSEKQDNLVRAFKALLKEERFGSQGEIVEALKQEGFENINQSKVSRMLTKFGAVRTRNAKMEMVYCLPTELGVPTVSS
SLRELVLDVDHNQALVVIHTGPGAAQLIARMLDSLGKSEGILGVVAGDDTIFITPTLTITTEQLFKSVCELFEYAG
>P09551 ~~~argT~~~Lysine/arginine/ornithine-binding periplasmic protein~~~COG0834
MKKSILALSLLVGLSTAASSYAALPETVRIGTDTTYAPFSSKDAKGDFVGFDIDLGNEMCKRMQVKCTWVASDFDALIPS
LKAKKIDAIISSLSITDKRQQEIAFSDKLYAADSRLIAAKGSPIQPTLDSLKGKHVGVLQGSTQEAYANETWRSKGVDVV
AYANQDLVYSDLAAGRLDAALQDEVAASEGFLKQPAGKDFAFAGSSVKDKKYFGDGTGVGLRKDDAELTAAFNKALGELR
QDGTYDKMAKKYFDFNVYGD
>P02911 ~~~argT~~~Lysine/arginine/ornithine-binding periplasmic protein~~~
MKKTVLALSLLIGLGATAASYAALPQTVRIGTDTTYAPFSSKDAKGEFIGFDIDLGNEMCKRMQVKCTWVASDFDALIPS
LKAKKIDAIISSLSITDKRQQEIAFSDKLYAADSRLIAAKGSPIQPTLESLKGKHVGVLQGSTQEAYANDNWRTKGVDVV
AYANQDLIYSDLTAGRLDAALQDEVAASEGFLKQPAGKEYAFAGPSVKDKKYFGDGTGVGLRKDDTELKAAFDKALTELR
QDGTYDKMAKKYFDFNVYGD
>P75993 ~~~ariR~~~Probable two-component-system connector protein AriR~~~
MLEDTTIHNAITDKALASYFRSSGNLLEEESAVLGQAVTNLMLSGDNVNNKNIILSLIHSLETTSDILKADVIRKTLEIV
LRYTADDM
>Q9KJN4 ~~~arlR~~~Response regulator ArlR~~~COG0745
MTQILIVEDEQNLARFLELELTHENYNVDTEYDGQDGLDKALSHYYDLIILDLMLPSINGLEICRKIRQQQSTPIIIITA
KSDTYDKVAGLDYGADDYIVKPFDIEELLARIRAILRRQPQKDIIDVNGITIDKNAFKVTVNGAEIELTKTEYDLLYLLA
ENKNHVMQREQILNHVWGYNSEVETNVVDVYIRYLRNKLKPYDRDKMIETVRGVGYVIR
>Q9KJN3 2.7.13.3~~~arlS~~~Signal transduction histidine-protein kinase ArlS~~~COG5002
MTKRKLRNNWIIVTTMITFVTIFLFCLIIIFFLKDTLHNSELDDAERSSSDINNLFHSKPVKDISALDLNASLGNFQEII
IYDEHNNKLFETSNDNTVRVEPGYEHRYFDRVIKKRYKGIEYLIIKEPITTQDFKGYSLLIHSLENYDNIVKSLYIIALA
FGVIATIITATISYVFSTQITKPLVSLSNKMIEIRRDGFQNKLQLNTNYEEIDNLANTFNEMMSQIEESFNQQRQFVEDA
SHELRTPLQIIQGHLNLIQRWGKKDPAVLEESLNISIEEMNRIIKLVEELLELTKGDVNDISSEAQTVHINDEIRSRIHS
LKQLHPDYQFDTDLTSKNLEIKMKPHQFEQLFLIFIDNAIKYDVKNKKIKVKTRLKNKQKIIEITDHGIGIPEEDQDFIF
DRFYRVDKSRSRSQGGNGLGLSIAQKIIQLNGGSIKIKSEINKGTTFKIIF
>Q7A5N3 2.7.13.3~~~arlS~~~Signal transduction histidine-protein kinase ArlS~~~
MTKRKLRNNWIIVTTMITFVTIFLFCLIIIFFLKDTLHNSELDDAERSSSDINNLFHSKPVKDISALDLNASLGNFQEII
IYDEHNNKLFETSNDNTVRVEPGYEHRYFDRVIKKRYKGIEYLIIKEPITTQDFKGYSLLIHSLENYDNIVKSLYIIALA
FGVIATIITATISYVFSTQITKPLVSLSNKMIEIRRDGFQNKLQLNTNYEEIDNLANTFNEMMSQIEESFNQQRQFVEDA
SHELRTPLQIIQGHLNLIQRWGKKDPAVLEESLNISIEEMNRIIKLVEELLELTKGDVNDISSEAQTVHINDEIRSRIHS
LKQLHPDYQFDTDLTSKNLEIKMKPHQFEQLFLIFIDNAIKYDVKNKKIKVKTRLKNKQKIIEITDHGIGIPEEDQDFIF
DRFYRVDKSRSRSQGGNGLGLSIAQKIIQLNGGSIKIKSEINKGTTFKIIF
>Q72D36 4.3.2.1~~~argH~~~Argininosuccinate lyase~~~COG0165
MAEKKMWGGRFKQGTATLVEEYTESVSYDRALYAQDIAGSMAHARMLARQGVLTAEEAAIIVDGLATVRSEIEAGSFVWR
REFEDVHMNIENRLTELVGDVGKKLHTGRSRNDQVALDFRLFVSDRVRVWRELGRDLVGVIVDQARQHTATLLPGCTHMQ
PAQPVSLAQHLLAYAWMLRRDIDRLEDCDKRARVCPLGAAALAGTTYPLDPASVADELGMYGTFRNSMDAVSDRDFVLEA
LFDGSVIMAHLSRLCEEFILWANPAFGYIFLPDAYATGSSIMPQKKNPDVAELMRGKTGRVYGALTTMLTTVKGLPMTYN
RDLQEDKEPFIDADRTVSASLEIMAGMLREVRFNTARMRTALRSGFLNATELADYLVGKGIPFREAHHLTGAAVALAEEK
GVTLEELPLEDYRGICDRIDEDVYPILEPEAAVSRRETPGGTGPRSVAAQIAELDSWLGR
>P11447 4.3.2.1~~~argH~~~Argininosuccinate lyase~~~COG0165
MALWGGRFTQAADQRFKQFNDSLRFDYRLAEQDIVGSVAWSKALVTVGVLTAEEQAQLEEALNVLLEDVRARPQQILESD
AEDIHSWVEGKLIDKVGQLGKKLHTGRSRNDQVATDLKLWCKDTVSELLTANRQLQSALVETAQNNQDAVMPGYTHLQRA
QPVTFAHWCLAYVEMLARDESRLQDALKRLDVSPLGCGALAGTAYEIDREQLAGWLGFASATRNSLDSVSDRDHVLELLS
AAAIGMVHLSRFAEDLIFFNTGEAGFVELSDRVTSGSSLMPQKKNPDALELIRGKCGRVQGALTGMMMTLKGLPLAYNKD
MQEDKEGLFDALDTWLDCLHMAALVLDGIQVKRPRCQEAAQQGYANATELADYLVAKGVPFREAHHIVGEAVVEAIRQGK
PLEDLPLSELQKFSQVIDEDVYPILSLQSCLDKRAAKGGVSPQQVAQAIAFAQARLG
>A0QYS5 4.3.2.1~~~argH~~~Argininosuccinate lyase~~~COG0165
MSTNEGSLWGGRFADGPSDALAALSKSTHFDWALAPYDIKASKAHARVLHRAGLLTDEQRDGLLAGLDSLGSDVADGSFE
PLPTDEDVHGALERGLIDRVGPDLGGRLRAGRSRNDQVATLFRMWLRDAVRRVADGCLEVVNALAVQAAAHPTAIMPGKT
HLQAAQPILLAHHLLAHAHPLLRDVDRLADFDDRTAVSPYGSGALAGSSLGLDPDAIAEDLGFASAADNSVDATASRDFA
AEAAFVFAQIGVDLSRLAEDIILWSSTEFGYVTLHDAWSTGSSIMPQKKNPDIAELARGKSGRLIGNLTGLLATLKAQPL
AYNRDLQEDKEPVFDSVAQLELLLPAMAGLVGTLTFDEERMAELAPAGYTLATDIAEWLVRQGVPFRIAHEAAGAAVKVA
EGRGVGLDALTDDEFASINPALTPDVREVLTVEGSVNARNARGGTAPTQVAKQLGVVRKAMEELRIRLS
>P9WPY7 4.3.2.1~~~argH~~~Argininosuccinate lyase~~~COG0165
MSTNEGSLWGGRFAGGPSDALAALSKSTHFDWVLAPYDLTASRAHTMVLFRAGLLTEEQRDGLLAGLDSLAQDVADGSFG
PLVTDEDVHAALERGLIDRVGPDLGGRLRAGRSRNDQVAALFRMWLRDAVRRVATGVLDVVGALAEQAAAHPSAIMPGKT
HLQSAQPILLAHHLLAHAHPLLRDLDRIVDFDKRAAVSPYGSGALAGSSLGLDPDAIAADLGFSAAADNSVDATAARDFA
AEAAFVFAMIAVDLSRLAEDIIVWSSTEFGYVTLHDSWSTGSSIMPQKKNPDIAELARGKSGRLIGNLAGLLATLKAQPL
AYNRDLQEDKEPVFDSVAQLELLLPAMAGLVASLTFNVQRMAELAPAGYTLATDLAEWLVRQGVPFRSAHEAAGAAVRAA
EQRGVGLQELTDDELAAISPELTPQVREVLTIEGSVSARDCRGGTAPGRVAEQLNAIGEAAERLRRQLVR
>Q9LAE5 4.3.2.1~~~argH~~~Argininosuccinate lyase~~~COG0165
MTKEQTWSQRFESALHPAIARFNASIGFDIELIEYDLTGSQAHAKMLAHTGIISSEEGEQLVAGLEQIRQEHRQGKFHPG
VDAEDVHFAVEKRLTEIVGDVGKKLHTARSRNDQVGTDTRLYLRDQIQQIKSELREFQGVLLDIAEKHVETLIPGYTHLQ
RAQPVSLAHHLLAYFQMAQRDWERLGDVSRRVNISPLGCGALAGTTFPIDRHYTAKLLDFDNIYANSLDGVSDRDFAIEF
LCAASLIMVHLSRLAEEVILWSSEEFRFVILKDSCATGSSIMPQKKNPDVPELVRGKTGRVFGHLQAMLVIMKGLPLAYN
KDLQEDKEGLFDSVNTVKASLEAMTILLREGLEFRTQRLAQAVTEDFSNATDVADYLAARGVPFREAYNLVGKVVKTSIA
AGKLLKDLELEEWQQLHPAFAADIYEAISPRQVVAARNSHGGTGFVQVSKALIAARAQIDQ
>Q5SLL0 4.3.2.1~~~argH~~~Argininosuccinate lyase~~~COG0165
MAHRTWGGRFGEGPDALAARFNASLAFDRALWREDLWQNRVHARMLHAVGLLSAEELEAILKGLDRIEEEIEAGTFPWRE
ELEDVHMNLEARLTELVGPPGGKLHTARSRNDQVATDLRLYLRGAIDELLALLLALRRVLVREAEKHLDPLYVLPGYTHL
QRAQPVLLAHWFLAYYEMLKRDAGRLEDAKERLNESPLGAAALAGTGFPIDRHFTARELGFKAPMRNSLDAVASRDFALE
VLSALNIGMLHLSRMAEELILYSTEEFGFVEVPDAFATGSSIMPQKKNPDILELIRAKAGRVLGAFVGLSAVVKGLPLAY
NKDLQEDKEPLLDALATYRDSLRLLAALLPGLKWRRERMWRAAEGGYTLATELADYLAEKGLPFREAHHVVGRLVRRLVE
EGRALKDLTLEELQAHHPLFAEDALPLLRLETAIHRRRSYGGTAPEAVRERLEEAKKEVGLD
>P77398 ~~~arnA~~~Bifunctional polymyxin resistance protein ArnA~~~COG0223
MKTVVFAYHDMGCLGIEALLAAGYEISAIFTHTDNPGEKAFYGSVARLAAERGIPVYAPDNVNHPLWVERIAQLSPDVIF
SFYYRHLIYDEILQLAPAGAFNLHGSLLPKYRGRAPLNWVLVNGETETGVTLHRMVKRADAGAIVAQLRIAIAPDDIAIT
LHHKLCHAARQLLEQTLPAIKHGNILEIAQRENEATCFGRRTPDDSFLEWHKPASVLHNMVRAVADPWPGAFSYVGNQKF
TVWSSRVHPHASKAQPGSVISVAPLLIACGDGALEIVTGQAGDGITMQGSQLAQTLGLVQGSRLNSQPACTARRRTRVLI
LGVNGFIGNHLTERLLREDHYEVYGLDIGSDAISRFLNHPHFHFVEGDISIHSEWIEYHVKKCDVVLPLVAIATPIEYTR
NPLRVFELDFEENLRIIRYCVKYRKRIIFPSTSEVYGMCSDKYFDEDHSNLIVGPVNKPRWIYSVSKQLLDRVIWAYGEK
EGLQFTLFRPFNWMGPRLDNLNAARIGSSRAITQLILNLVEGSPIKLIDGGKQKRCFTDIRDGIEALYRIIENAGNRCDG
EIINIGNPENEASIEELGEMLLASFEKHPLRHHFPPFAGFRVVESSSYYGKGYQDVEHRKPSIRNAHRCLDWEPKIDMQE
TIDETLDFFLRTVDLTDKPS
>O52325 ~~~arnA~~~Bifunctional polymyxin resistance protein ArnA~~~
MKAVIFAYHDMGCQGVQAVLDAGYEIAAIFTHADNPAENTFFGSVSRQAAELGIPVYAPDNVNHPIWVDRIAELAPDIIF
SFYYRNLLSEEILHLAPAGAFNLHGSLLPAYRGRAPLNWVLVNGESETGVTLHRMVKRADAGEIVASQRVAIAQDDVALT
LHHKLCQAARQLLNSILPTMKCGDIPSVPQRESDSTYYGRRRPEDGLIDWHKPVSTVHNLVRAVAAPWPGAFSYNGSQKF
TIWSSRMCPDAQGALPGSVISVSPLRVACADGALEIITGQAGDDITVQGSQLAQTLGLVAGARLNRPPATSGKRRIRVLI
LGVNGFIGNHLTERLLNEENYEVYGMDIGSNAISRFLLHPRFHFVEGDISIHSEWIEYHVKKCDVVLPLVAIATPIEYTR
NPLRVFELDFEENLRIIRYCVKYRKRVVFPSTSEVYGMCTDASFDEDKSNLIVGPVNKPRWIYSVSKQLLDRVIWAYGEK
EGLRFTLFRPFNWMGPRLDSLNAARIGSSRAITQLILNLVEGTPIKLIDGGQQKRCFTDIRDGIEALFRIIVNDGDRCDG
KIINIGNPDNEASIQELATLLLDSFDKHPLRCHFPPFAGFQVVESRSYYGKGYQDVAHRKPSIDNARRCLGWEPSIAMRD
TVEETLDFFLRSVDIAERAS
>Q93PD8 ~~~arnA~~~Bifunctional polymyxin resistance protein ArnA~~~
MKAIVFAYHDIGCVGLNALAEAGYDIQAVFTHTDNPGENRFFSSVARVAADLALPVFAPEDVNHPLWVERIRELQPDIIF
SFYYRNMLSDEILSLAPQGGFNLHGSLLPQYRGRAPINWVLVNGETETGVTLHQMVKKADAGPIAGQYKVAISDVDTALT
LHAKMRDAAQELLRNLLPRMKEGPLPLTPQKEADASYFGRRTAADGEIHWQKSAFTINNLVRAVTEPYPGAFSYLGQRKL
TIWRSRPLDLVHNKLPGTVLSTAPLTVACGEGALEIITGQGEAGLYVQGDRLAQEMGIVTDVRLGNKPSNTLKRRTRVLI
LGVNGFIGNHLTERLLQDDRYEVYGLDIGSDAISRFLGNPAFHFVEGDISIHSEWIEYHIKKCDVILPLVAIATPIEYTR
NPLRVFELDFEENLKIVRDCVKYNKRIVFPSTSEVYGMCDDKEFDEDTSRLIVGPINKQRWIYSVSKQLLDRVIWAYGVK
EGLKFTLFRPFNWMGPRLDNLDAARIGSSRAITQLILNLVEGSPIKLVDGGAQKRCFTDIHDGIEALFRIIENRDGCCDG
QIINIGNPTNEASIRELAEMLLTSFENHELRDHFPPFAGFKDIESSAYYGKGYQDVEYRTPSIKNARRILHWQPEIAMQQ
TVTETLDFFLRAAVIEKTAAPKDELNA
>P77690 2.6.1.87~~~arnB~~~UDP-4-amino-4-deoxy-L-arabinose--oxoglutarate aminotransferase~~~COG0399
MAEGKAMSEFLPFSRPAMGVEELAAVKEVLESGWITTGPKNQALEQAFCQLTGNQHAIAVSSATAGMHITLMALKIGKGD
EVITPSLTWVSTLNMISLLGATPVMVDVDRDTLMVTPEAIESAITPRTKAIIPVHYAGAPADIDAIRAIGERYGIAVIED
AAHAVGTYYKGRHIGAKGTAIFSFHAIKNITCAEGGLIVTDNENLARQLRMLKFHGLGVDAYDRQTWGRAPQAEVLTPGY
KYNLTDINAAIALTQLVKLEHLNTRRREIAQQYQQALAALPFQPLSLPAWPHVHAWHLFIIRVDEQRCGISRDALMEALK
ERGIGTGLHFRAAHTQKYYRERFPTLSLPNTEWNSERICSLPLFPDMTTADADHVITALQQLAGQ
>Q8ZNF3 2.6.1.87~~~arnB~~~UDP-4-amino-4-deoxy-L-arabinose--oxoglutarate aminotransferase~~~
MAEGKMMSDFLPFSRPAMGAEELAAVKTVLDSGWITTGPKNQELEAAFCRLTGNQYAVAVSSATAGMHIALMALGIGEGD
EVITPSMTWVSTLNMIVLLGANPVMVDVDRDTLMVTPEHIEAAITPQTKAIIPVHYAGAPADLDAIYALGERYGIPVIED
AAHATGTSYKGRHIGARGTAIFSFHAIKNITCAEGGIVVTDNPQFADKLRSLKFHGLGVDAWDRQSGGRAPQAEVLAPGY
KYNLPDLNAAIALAQLQKLDALNARRAAIAAQYHQAMADLPFQPLSLPSWEHIHAWHLFIIRVDEARCGITRDALMASLK
TKGIGTGLHFRAAHTQKYYRERFPTLTLPDTEWNSERICSLPLFPDMTESDFDRVITALHQIAGQ
>Q7BF87 2.6.1.87~~~arnB~~~UDP-4-amino-4-deoxy-L-arabinose--oxoglutarate aminotransferase~~~
MQSFLPFSRPAIGSEEINAVANVLGSGWITTGPQNHQLETDFCQIFGCKHAIAVCSATAGMHITLLALGIGPGDEVITPS
QTWVSTINMIVLLGAEPVMVDVDRDTLMVNAAAIEAAITPNTKAIIPVHYAGAPCDLDALRQISQRHGIPLIEDAAHAVG
TRYRDQWIGEQGTAIFSFHAIKNITCAEGGLVATDDDELAARVRRLKFHGLGVDAFDRQIQGRSPQAEVVEPGYKYNLSD
IHAAIAVVQLRRLPEINARRQALVASYHKALAHLPLQPLALPHYSHQHAWHLFMVRVDEERCGISRDQLMACLKDMGIGS
GLHFRAVHSQKYYRERYPHLCLPNTEWNSARLCTLPLFPDMLDSDIERVANALTTIIGSHRVTK
>P77757 2.4.2.53~~~arnC~~~Undecaprenyl-phosphate 4-deoxy-4-formamido-L-arabinose transferase~~~COG0463
MFEIHPVKKVSVVIPVYNEQESLPELIRRTTTACESLGKEYEILLIDDGSSDNSAHMLVEASQAENSHIVSILLNRNYGQ
HSAIMAGFSHVTGDLIITLDADLQNPPEEIPRLVAKADEGYDVVGTVRQNRQDSWFRKTASKMINRLIQRTTGKAMGDYG
CMLRAYRRHIVDAMLHCHERSTFIPILANIFARRAIEIPVHHAEREFGESKYSFMRLINLMYDLVTCLTTTPLRMLSLLG
SIIAIGGFSIAVLLVILRLTFGPQWAAEGVFMLFAVLFTFIGAQFIGMGLLGEYIGRIYTDVRARPRYFVQQVIRPSSKE
NE
>Q7N3Q6 2.4.2.53~~~arnC~~~Undecaprenyl-phosphate 4-deoxy-4-formamido-L-arabinose transferase~~~COG0463
MSFEQIKKVSVVIPIYNEEESLPLLLERTLAACKQLTQEYELILVDDGSSDKSAEILIQAAEQPENHIIAILLNRNYGQH
SAIMAGFNQVNGDLIITLDADLQNPPEEIPRLVKTAEQGYDVVGTRRANRQDSLFRKTASKIINAMITKATGRSMGDYGC
MLRAYRRHIVEAMLQCHERSTFIPILANTFARKTIEIDVAHAEREFGDSKYSFMKLINLMYDLLTCLTTAPLRLLSVVGS
VIAVSGFLLAVLLMVLRLIFGAIWAAEGVFTLFALLFIFIGAQFVAMGLLGEYIGRIYNDVRARPRYFIQKVVGDNKTND
NQEEY
>O52324 2.4.2.53~~~arnC~~~Undecaprenyl-phosphate 4-deoxy-4-formamido-L-arabinose transferase~~~
MFDAAPIKKVSVVIPVYNEQESLPELIRRTTTACESLGKAWEILLIDDGSSDSSAELMVKASQEADSHIISILLNRNYGQ
HAAIMAGFSHVSGDLIITLDADLQNPPEEIPRLVAKADEGFDVVGTVRQNRQDSLFRKSASKIINLLIQRTTGKAMGDYG
CMLRAYRRPIIDTMLRCHERSTFIPILANIFARRATEIPVHHAEREFGDSKYSFMRLINLMYDLVTCLTTTPLRLLSLLG
SVIAIGGFSLSVLLIVLRLALGPQWAAEGVFMLFAVLFTFIGAQFIGMGLLGEYIGRIYNDVRARPRYFVQQVIYPESTP
FTEESHQ
>Q93PD9 2.4.2.53~~~arnC~~~Undecaprenyl-phosphate 4-deoxy-4-formamido-L-arabinose transferase~~~
MSLNEPIKKVSIVIPVYNEQESLPALIDRTTAACKLLTQAYEIILVDDGSSDNSTELLTAAANDPDSHIIAILLNRNYGQ
HSAIMAGFNQVSGDLIITLDADLQNPPEEIPRLVHVAEEGYDVVGTVRANRQDSLFRKTASRMINMMIQRATGKSMGDYG
CMLRAYRRHIVEAMLHCHERSTFIPILANTFARRTTEITVHHAEREFGNSKYSLMRLINLMYDLITCLTTTPLRLLSLVG
SAIALLGFTFSVLLVALRLIFGPEWAGGGVFTLFAVLFMFIGAQFVGMGLLGEYIGRIYNDVRARPRYFVQKVVGAEQTE
NNQDVEK
>P76472 3.5.1.n3~~~arnD~~~Probable 4-deoxy-4-formamido-L-arabinose-phosphoundecaprenol deformylase ArnD~~~COG0726
MTKVGLRIDVDTFRGTREGVPRLLEILSKHNIQASIFFSVGPDNMGRHLWRLVKPQFLWKMLRSNAASLYGWDILLAGTA
WPGKEIGHANADIIREAAKHHEVGLHAWDHHAWQARSGNWDRQTMIDDIARGLRTLEEIIGQPVTCSAAAGWRADQKVIE
AKEAFHLRYNSDCRGAMPFRPLLESGNPGTAQIPVTLPTWDEVIGRDVKAEDFNGWLLNRILRDKGTPVYTIHAEVEGCA
YQHNFVDLLKRAAQEGVTFCPLSELLSETLPLGQVVRGNIAGREGWLGCQQIAGSR
>O52326 3.5.1.n3~~~arnD~~~Probable 4-deoxy-4-formamido-L-arabinose-phosphoundecaprenol deformylase ArnD~~~
MTKVGLRIDVDTLRGTREGVPRLLATLHRHGVQASFFFSVGPDNMGRHLWRLIKPRFLWKMLRSNAASLYGWDILLAGTA
WPGKNIGNANAGIIRETATYHETGLHAWDHHAWQTHSGHWSIRQLEEDIARGITALEAIIGKPVTCSAAAGWRADGRVVR
AKESFNLRYNSDCRGTTLFRPLLMPGQTGTPQIPVTLPTWDEVIGPAVQAQSFNTWIISRMLQDKGTPVYTIHAEVEGIV
HQPLFEDLLVRARDAGITFCPLGELLPASPESLPLGQIVRGHIPGREGWLGCQQAASAS
>Q7BF86 3.5.1.n3~~~arnD~~~Probable 4-deoxy-4-formamido-L-arabinose-phosphoundecaprenol deformylase ArnD~~~
MKQVGLRIDVDTYRGTQYGVPSLLTVLEKHDIRASFFFSVGPDNMGRHLWRLFRPRFLWKMLRSNAASLYGWDILLAGTA
WPGKKIAKDFGPLMKAAAMAGHEVGLHAWDHQGWQANVASWSQQQLTEQVQRGVDTLQQSIGQPISCSAAAGWRADERVL
AVKQQFDFSYNSDCRGTHPFRPLLPNGSLGSVQIPVTLPTYDEVVGGEVQAENFNDFIIDAILRDSGVSVYTIHAEVEGM
SQAAMFEQLLMRAKQQDIEFCPLSKLLPSDLQLLPVGKVIRAAFPGREGWLGCQSDIKDAE
>Q47377 ~~~arnE~~~Probable 4-amino-4-deoxy-L-arabinose-phosphoundecaprenol flippase subunit ArnE~~~COG2076
MIWLTLVFASLLSVAGQLCQKQATCFVAINKRRKHIVLWLGLALACLGLAMVLWLLVLQNVPVGIAYPMLSLNFVWVTLA
AVKLWHEPVSPRHWCGVAFIIGGIVILGSTV
>Q66A07 ~~~arnE~~~Probable 4-amino-4-deoxy-L-arabinose-phosphoundecaprenol flippase subunit ArnE~~~
MNSYLLLLMVSLLTCIGQLCQKQAAQCWEQPQARRLNLTLRWLAIAVVSLGLGMLLWLRLLQQLPLSVAYPMLSFNFVLV
TLAAQLFYGEKATLRHWLGVAAIIFGILLMSWHL
>P76474 ~~~arnF~~~Probable 4-amino-4-deoxy-L-arabinose-phosphoundecaprenol flippase subunit ArnF~~~COG2076
MGLMWGLFSVIIASVAQLSLGFAASHLPPMTHLWDFIAALLAFGLDARILLLGLLGYLLSVFCWYKTLHKLALSKAYALL
SMSYVLVWIASMVLPGWEGTFSLKALLGVACIMSGLMLIFLPTTKQRY
>Q93PD4 ~~~arnF~~~Probable 4-amino-4-deoxy-L-arabinose-phosphoundecaprenol flippase subunit ArnF~~~
MKGYLWGGASVVLVTVAQLVLKWGMMNIPLLSLADINVQFLTMYFVQLASVMCGLMGYALSMLCWFFALRYLPLNRAYPL
LSLSYALVYLGAVLLPWFNEPATLLKTLGAGFILLGIWLINIKPIKAS
>P76473 2.4.2.43~~~arnT~~~Undecaprenyl phosphate-alpha-4-amino-4-deoxy-L-arabinose arabinosyl transferase~~~COG1807
MKSVRYLIGLFAFIACYYLLPISTRLLWQPDETRYAEISREMLASGDWIVPHLLGLRYFEKPIAGYWINSIGQWLFGANN
FGVRAGVIFATLLTAALVTWFTLRLWRDKRLALLATVIYLSLFIVYAIGTYAVLDPFIAFWLVAGMCSFWLAMQAQTWKG
KSAGFLLLGITCGMGVMTKGFLALAVPVLSVLPWVATQKRWKDLFIYGWLAVISCVLTVLPWGLAIAQREPNFWHYFFWV
EHIQRFALDDAQHRAPFWYYVPVIIAGSLPWLGLLPGALYTGWKNRKHSATVYLLSWTIMPLLFFSVAKGKLPTYILSCF
ASLAMLMAHYALLAAKNNPLALRINGWINIAFGVTGIIATFVVSPWGPMNTPVWQTFESYKVFCAWSIFSLWAFFGWYTL
TNVEKTWPFAALCPLGLALLVGFSIPDRVMEGKHPQFFVEMTQESLQPSRYILTDSVGVAAGLAWSLQRDDIIMYRQTGE
LKYGLNYPDAKGRFVSGDEFANWLNQHRQEGIITLVLSVDRDEDINSLAIPPADAIDRQERLVLIQYRPK
>Q7N3Q9 2.4.2.43~~~arnT~~~Undecaprenyl phosphate-alpha-4-amino-4-deoxy-L-arabinose arabinosyl transferase~~~COG1807
MLNNRACKVGAFLMALFFVITYLLPLNGRLLWQPDETRYAEISREMLQRGDWIVPYLLDIRYFEKPVAGYWINNISQWIF
GDNNFAVRFGSVFCIFISAILLYRLAMMMWHNRHIAFATSLIYISMFLVFAIGTYSVLDPMFSLWVTAAMMCSFWGLKTD
CTRRRIMAYLVLGLCCGMGFMTKGFLALAVPVIVMLPIVIYQKRVLQIVCFGPLAIISAIAISLPWVIAIALREPDYWHY
FFWVEHIKRFSSDDAQHIAPFWYYIPILILGVIPWLGLLPGAVMKSWKERKSNPEMFFLLCWFVVPLLFFSIAKGKLPTY
ILPCMAPLAMMMAKFGVDCVKNGKMELLKINGMVNVFLGLLAVIVLFAMEVVTKHALYQPSEWLKWVLAIVAFGIWGIIG
YLCFALNGKYWLLAAFCSIVVSLVIGHALPENTVNSKLPQNFIKLHHQELAGSRYILSESVGLATSVAWEMKRSDIYMFE
RWGELEYGLNYPDSRYRYISYKDFPQWLAKARKEGRVSVLFHLYKDEKLPDLPKADQISRNYRFAILVYEKQP
>O52327 2.4.2.43~~~arnT~~~Undecaprenyl phosphate-alpha-4-amino-4-deoxy-L-arabinose arabinosyl transferase~~~
MKSIRYYLAFAAFIALYYVIPVNSRLLWQPDETRYAEISREMLASGDWIVPHFLGLRYFEKPIAGYWINSLGQWLFGATN
FGVRAGAILTTLLAAALVAWLTFRLWRDKRTALLASVIFLSLFAVYSIGTYAVLDPMIALWLTAGMCCFWQGMQATTRMG
KIGMFLLLGATCGLGVLTKGFLALAVPVVSVLPWVIVQKRWKDFLLYGWLAVLSCFVVVLPWAIAIARREADFWHYFFWV
EHIQRFAMSDAQHKAPFWYYLPVLLAGSLPWLGLLPGALKLGWRERNGAFYLLGWTIMPLLFFSIAKGKLPTYVLSCFAP
IAILMARFVLHNVKEGVAALRVNGGINLVFGLVGIVAAFVVSSWGPLKSPVWTHIETYKVFCVWGVFTVWAFVGWYSLCH
SQKYLLPAFCPLGLALLFGFSIPDRVMESKQPQFFVEMTQAPLASSRYILADNVGVAAGLAWSLKRDDIMLYGHAGELRY
GLSYPDVQDKFVKADDFNAWLNQHRQEGIITLVLSIAKDEDISALSLPPADNIDYQGRLVLIQYRPK
>Q66A06 2.4.2.43~~~arnT~~~Undecaprenyl phosphate-alpha-4-amino-4-deoxy-L-arabinose arabinosyl transferase~~~
MKLLKDSGAALLALFFVLVYLLPVNSRLLWQPDETRYAEISREMLQRGDWVVPYFMDIRYFEKPVAGYWFNNISQWIFGD
SNFAVRFGSIFSTALSAVLVYWLATLLWRNRSTSVLATLIYLSFLLVFGIGTYAVLDPMISLWLTAAMVSFYLTLKAENW
QQKVGAYALLGVACGMGFMTKGFLALAVPVIAVLPIVIQQKRIKDLVVFGPIAIVCAVLLSLPWALAIAQREPDFWNYFF
WVEHIQRFAEASAQHKSPIWYYLPILCIGVLPWLGLLPGALFKGWRERATKPELFFLLSWVVMPLLFFSVAKGKLPTYIL
PCMAPLSLLMAAYATDCANNIRMRALKINGVINLLFGVACALVIVVIGLGLVKDIVAYGPQENQKVWLGVLAFAGWGVTG
FITLRNNARNWRWAAACPLLFILLVGYLIPQQVVDSKQPQNFIKNNFSELSSSRYVLTDSVGVAAGLAWELKRSDILMFS
EKGELTYGLAYPDSQDNYISNDDFPTWLAQARKEGDVSLVVQLAKNEALPAHLPPADKVNLMNRLALLWYQKTP
>Q9KCA6 2.5.1.19~~~aroA1~~~3-phosphoshikimate 1-carboxyvinyltransferase 1~~~COG0128
MENKTVIPHAKGLKGTIKVPGDKSISHRAVMFGALAKGTTTVEGFLPGADCLSTISCFQKLGVSIEQAEERVTVKGKGWD
GLREPSDILDVGNSGTTTRLILGILSTLPFHSVIIGDESIGKRPMKRVTEPLKSMGAQIDGRDHGNLTPLSIRGGQLKGI
DFHSPVASAQMKSAILLAGLRAEGKTSVTEPAKTRDHTERMLEAFGVNIEKDGLTVSIEGGQMLTGQHVVVPGDISSAAF
FLVAGAMVPHSRITLTNVGINPTRAGILEVLKQMGATLAMENERVQGGEPVADLTIETSVLQGVEIGGDIIPRLIDEIPI
IAVLATQASGRTVIKDAEELKVKETNRIDTVVSELTKLGASIHATDDGMIIEGPTPLKGGVTVSSHGDHRIGMAMAIAAL
LAEKPVTVEGTEAIAVSYPSFFDHLDRLKSE
>P0A2Y5 2.5.1.19~~~aroA~~~3-phosphoshikimate 1-carboxyvinyltransferase~~~
MSHSASPKPATARRSEALTGEIRIPGDKSISHRSFMFGGLASGETRITGLLEGEDVINTGRAMQAMGAKIRKEGDVWIIN
GVGNGCLLQPEAALDFGNAGTGARLTMGLVGTYDMKTSFIGDASLSKRPMGRVLNPLREMGVQVEAADGDRMPLTLIGPK
TANPITYRVPMASAQVKSAVLLAGLNTPGVTTVIEPVMTRDHTEKMLQGFGADLTVETDKDGVRHIRITGQGKLVGQTID
VPGDPSSTAFPLVAALLVEGSDVTIRNVLMNPTRTGLILTLQEMGADIEVLNARLAGGEDVADLRVRASKLKGVVVPPER
APSMIDEYPVLAIAASFAEGETVMDGLDELRVKESDRLAAVARGLEANGVDCTEGEMSLTVRGRPDGKGLGGGTVATHLD
HRIAMSFLVMGLAAEKPVTVDDSNMIATSFPEFMDMMPGLGAKIELSIL
>Q9R4E4 2.5.1.19~~~aroA~~~3-phosphoshikimate 1-carboxyvinyltransferase~~~
MSHGASSRPATARKSSGLSGTVRIPGDKSISHRSFMFGGLASGETRITGLLEGEDVINTGKAMQAMGARIRKEGDTWIID
GVGNGGLLAPEAPLDFGNAATGCRLTMGLVGVYDFDSTFIGDASLTKRPMGRVLNPLREMGVQVKSEDGDRLPVTLRGPK
TPTPITYRVPMASAQVKSAVLLAGLNTPGITTVIEPIMTRDHTEKMLQGFGANLTVETDADGVRTIRLEGRGKLTGQVID
VPGDPSSTAFPLVAALLVPGSDVTILNVLMNPTRTGLILTLQEMGADIEVINPRLAGGEDVADLRVRSSTLKGVTVPEDR
APSMIDEYPILAVAAAFAEGATVMNGLEELRVKESDRLSAVANGLKLNGVDCDEGETSLVVRGRPDGKGLGNASGAAVAT
HLDHRIAMSFLVMGLVSENPVTVDDATMIATSFPEFMDLMAGLGAKIELSDTKAA
>Q482G5 2.5.1.19~~~aroA~~~3-phosphoshikimate 1-carboxyvinyltransferase~~~COG0128
MEQLTLNPIGKINGEIFLPGSKSLSNRALLIAALANGVTKITNLLVSDDINHMLNALKSLGIEYTLSDCGTECTVIGNGG
FFNAKKPLELYLGNAGTAMRPLCAALAASEGEFILTGEPRMKERPIGHLVDALAQLDADIEYLENKDYPPVKIKGKALTG
NTVTIDGSISSQFLTAILMIAPLLETNTTIEIDGELVSKPYIDITLDIMRRFNVSVQNNDYKSFIVNGKQSYQALDKYMV
EGDASSASYFLAAGAIKGGEVTVHGIGKLSVQGDKHFADVLEKMGAEIHWKDESITVIGKPLTAVDMDMNHIPDAAMTIA
TTALFATGTTTIRNIYNWRVKETDRLNAMATELRKVGAEVVEGKDYISITPPKSLKHAEIDTYNDHRVAMCFSLVALSDT
PVTINDPKCTAKTFPDYFDKLAQVSC
>Q83E11 2.5.1.19~~~aroA~~~3-phosphoshikimate 1-carboxyvinyltransferase~~~COG0128
MDYQTIPSQGLSGEICVPGDKSISHRAVLLAAIAEGQTQVDGFLMGADNLAMVSALQQMGASIQVIEDENILVVEGVGMT
GLQAPPEALDCGNSGTAIRLLSGLLAGQPFNTVLTGDSSLQRRPMKRIIDPLTLMGAKIDSTGNVPPLKIYGNPRLTGIH
YQLPMASAQVKSCLLLAGLYARGKTCITEPAPSRDHTERLLKHFHYTLQKDKQSICVSGGGKLKANDISIPGDISSAAFF
IVAATITPGSAIRLCRVGVNPTRLGVINLLKMMGADIEVTHYTEKNEEPTADITVRHARLKGIDIPPDQVPLTIDEFPVL
LIAAAVAQGKTVLRDAAELRVKETDRIAAMVDGLQKLGIAAESLPDGVIIQGGTLEGGEVNSYDDHRIAMAFAVAGTLAK
GPVRIRNCDNVKTSFPNFVELANEVGMNVKGVRGRGGF
>P0A6D3 2.5.1.19~~~aroA~~~3-phosphoshikimate 1-carboxyvinyltransferase~~~COG0128
MESLTLQPIARVDGTINLPGSKSVSNRALLLAALAHGKTVLTNLLDSDDVRHMLNALTALGVSYTLSADRTRCEIIGNGG
PLHAEGALELFLGNAGTAMRPLAAALCLGSNDIVLTGEPRMKERPIGHLVDALRLGGAKITYLEQENYPPLRLQGGFTGG
NVDVDGSVSSQFLTALLMTAPLAPEDTVIRIKGDLVSKPYIDITLNLMKTFGVEIENQHYQQFVVKGGQSYQSPGTYLVE
GDASSASYFLAAAAIKGGTVKVTGIGRNSMQGDIRFADVLEKMGATICWGDDYISCTRGELNAIDMDMNHIPDAAMTIAT
AALFAKGTTTLRNIYNWRVKETDRLFAMATELRKVGAEVEEGHDYIRITPPEKLNFAEIATYNDHRMAMCFSLVALSDTP
VTILDPKCTAKTFPDYFEQLARISQAA
>P9WPY5 2.5.1.19~~~aroA~~~3-phosphoshikimate 1-carboxyvinyltransferase~~~COG0128
MKTWPAPTAPTPVRATVTVPGSKSQTNRALVLAALAAAQGRGASTISGALRSRDTELMLDALQTLGLRVDGVGSELTVSG
RIEPGPGARVDCGLAGTVLRFVPPLAALGSVPVTFDGDQQARGRPIAPLLDALRELGVAVDGTGLPFRVRGNGSLAGGTV
AIDASASSQFVSGLLLSAASFTDGLTVQHTGSSLPSAPHIAMTAAMLRQAGVDIDDSTPNRWQVRPGPVAARRWDIEPDL
TNAVAFLSAAVVSGGTVRITGWPRVSVQPADHILAILRQLNAVVIHADSSLEVRGPTGYDGFDVDLRAVGELTPSVAALA
ALASPGSVSRLSGIAHLRGHETDRLAALSTEINRLGGTCRETPDGLVITATPLRPGIWRAYADHRMAMAGAIIGLRVAGV
EVDDIAATTKTLPEFPRLWAEMVGPGQGWGYPQPRSGQRARRATGQGSGG
>P0A2Y4 2.5.1.19~~~aroA~~~3-phosphoshikimate 1-carboxyvinyltransferase~~~
MSHSASPKPATARRSEALTGEIRIPGDKSISHRSFMFGGLASGETRITGLLEGEDVINTGRAMQAMGAKIRKEGDVWIIN
GVGNGCLLQPEAALDFGNAGTGARLTMGLVGTYDMKTSFIGDASLSKRPMGRVLNPLREMGVQVEAADGDRMPLTLIGPK
TANPITYRVPMASAQVKSAVLLAGLNTPGVTTVIEPVMTRDHTEKMLQGFGADLTVETDKDGVRHIRITGQGKLVGQTID
VPGDPSSTAFPLVAALLVEGSDVTIRNVLMNPTRTGLILTLQEMGADIEVLNARLAGGEDVADLRVRASKLKGVVVPPER
APSMIDEYPVLAIAASFAEGETVMDGLDELRVKESDRLAAVARGLEANGVDCTEGEMSLTVRGRPDGKGLGGGTVATHLD
HRIAMSFLVMGLAAEKPVTVDDSNMIATSFPEFMDMMPGLGAKIELSIL
>P63585 2.5.1.19~~~aroA~~~3-phosphoshikimate 1-carboxyvinyltransferase~~~
MVSEQIIDISGPLKGEIEVPGDKSMTHRAIMLASLAEGTSNIYKPLLGEDCRRTMDIFRLLGVDIKEDEDKLVVNSPGYK
AFKTPHQVLYTGNSGTTTRLLAGLLSGLGIESVLSGDVSIGKRPMDRVLRPLKLMDANIEGIEDNYTPLIIKPSVIKGIN
YQMEVASAQVKSAILFASLFSNDTTVIKELDVSRNHTETMFRHFNIPIEAERLSITTTPDAIQHIKPADFHVPGDISSAA
FFIVAALITPESDVTIHNVGINPTRSGIIDIVEKMGGNIQLFNQTTGAEPTASIRIQYTPMLQPITIEGELVPKAIDELP
VIALLCTQAVGTSTIKDAEELKVKETNRIDTTADMLNLLGFELQPTNDGLIIHPSEFKTNATVDSLTDHRIGMMLAVASL
LSSEPVKIKQFDAVNVSFPGFLPKLKLLENEG
>Q9S400 2.5.1.19~~~aroA~~~3-phosphoshikimate 1-carboxyvinyltransferase~~~COG0128
MKLKTNIRHLHGSIRVPGDKSISHRSIIFGSLAEGETKVYDILRGEDVLSTMQVFRDLGVEIEDKDGVITIQGVGMAGLK
APQNALNMGNSGTSIRLISGVLAGADFEVEMFGDDSLSKRPMDRVTLPLKKMGVSISGQTERDLPPLRLKGTKNLRPIHY
ELPIASAQVKSALMFAALQAKGESVIIEKEYTRNHTEDMLKQFGGHLSVDGKKITVQGPQKLTGQKVVVPGDISSAAFWL
VAGLIAPNSRLVLQNVGINETRTGIIDVIRAMGGKLEITEIDPVAKSATLIVESSDLKGTEIGGALIPRLIDELPIIALL
ATQAQGVTVIKDAEELKVKETDRIQVVADALNSMGADITPTADGMIIKGKSALHGARVNTFGDHRIGMMTAIAALLVADG
EVELDRAEAINTSYPSFFDDLESLIHG
>Q9KRB0 2.5.1.19~~~aroA~~~3-phosphoshikimate 1-carboxyvinyltransferase~~~COG0128
MESLTLQPIELISGEVNLPGSKSVSNRALLLAALASGTTRLTNLLDSDDIRHMLNALTKLGVNYRLSADKTTCEVEGLGQ
AFHTTQPLELFLGNAGTAMRPLAAALCLGQGDYVLTGEPRMKERPIGHLVDALRQAGAQIEYLEQENFPPLRIQGTGLQA
GTVTIDGSISSQFLTAFLMSAPLAQGKVTIKIVGELVSKPYIDITLHIMEQFGVQVINHDYQEFVIPAGQSYVSPGQFLV
EGDASSASYFLAAAAIKGGEVKVTGIGKNSIQGDIQFADALEKMGAQIEWGDDYVIARRGELNAVDLDFNHIPDAAMTIA
TTALFAKGTTAIRNVYNWRVKETDRLAAMATELRKVGATVEEGEDFIVITPPTKLIHAAIDTYDDHRMAMCFSLVALSDT
PVTINDPKCTSKTFPDYFDKFAQLSR
>P07639 4.2.3.4~~~aroB~~~3-dehydroquinate synthase~~~COG0337
MERIVVTLGERSYPITIASGLFNEPASFLPLKSGEQVMLVTNETLAPLYLDKVRGVLEQAGVNVDSVILPDGEQYKSLAV
LDTVFTALLQKPHGRDTTLVALGGGVVGDLTGFAAASYQRGVRFIQVPTTLLSQVDSSVGGKTAVNHPLGKNMIGAFYQP
ASVVVDLDCLKTLPPRELASGLAEVIKYGIILDGAFFNWLEENLDALLRLDGPAMAYCIRRCCELKAEVVAADERETGLR
ALLNLGHTFGHAIEAEMGYGNWLHGEAVAAGMVMAARTSERLGQFSSAETQRIITLLKRAGLPVNGPREMSAQAYLPHML
RDKKVLAGEMRLILPLAIGKSEVRSGVSHELVLNAIADCQSA
>Q5NFS1 4.2.3.4~~~aroB~~~3-dehydroquinate synthase~~~COG0337
MISKLSVNPTFSPSYNIIVDSVLDFSHILEYVTNKQVLVVTNTTVAKLYLTKFLAALVDDLDVRTCILEDGEQYKSQQSL
DKILSTLLENHFTRNSTVLVALGGGVIGDITGFAAAIYQRGIDFIQIPTTLLSQVDSSVGGKTAINHQLGKNMIGAFYQP
KVVYTSIEFYKTLPQREYIAGMAEVVKYAFISKDFYLWLDSNRDKILAKDSVTLIEMVKRSCQIKAQVVAMDEKELTGAR
AILNFGHTFGHAIEKCQNYRGLKHGEAVGVGMAQAIDFSHYLGLISQQQAKDFNDFIVSFGISIDFPNDICQKEFLEAML
LDKKNSNKELKFILIENIGSLSLQKQSKNELEQFLDISR
>P56081 4.2.3.4~~~aroB~~~3-dehydroquinate synthase~~~COG0337
MQEILIPLKEKNYKVFLGELPEIKLKQKALIISDSIVAGLHLPYLLERLKALEVRVCVIESGEKYKNFHSLERILNNAFE
MQLNRHSLMIALGGGVISDMVGFASSIYFRGIDFINIPTTLLAQVDASVGGKTGINTPYGKNLIGSFHQPKAVYMDLAFL
KTLEKREFQAGVAEIIKMAVCFDKNLVERLETKDLKDCLEEVIFQSVNIKAQVVVQDEKEQNIRAGLNYGHTFGHAIEKE
TDYERFLHGEAIAIGMRMANDLALSLGMLTLKEYERIENLLKKFDLIFHYKILDLQKFYERLFLDKKSENKTIKFILPKG
VGAFEVASHIPKETIIKVLEKWH
>P9WPX9 4.2.3.4~~~aroB~~~3-dehydroquinate synthase~~~COG0337
MTDIGAPVTVQVAVDPPYPVVIGTGLLDELEDLLADRHKVAVVHQPGLAETAEEIRKRLAGKGVDAHRIEIPDAEAGKDL
PVVGFIWEVLGRIGIGRKDALVSLGGGAATDVAGFAAATWLRGVSIVHLPTTLLGMVDAAVGGKTGINTDAGKNLVGAFH
QPLAVLVDLATLQTLPRDEMICGMAEVVKAGFIADPVILDLIEADPQAALDPAGDVLPELIRRAITVKAEVVAADEKESE
LREILNYGHTLGHAIERRERYRWRHGAAVSVGLVFAAELARLAGRLDDATAQRHRTILSSLGLPVSYDPDALPQLLEIMA
GDKKTRAGVLRFVVLDGLAKPGRMVGPDPGLLVTAYAGVCAP
>Q6GGU4 4.2.3.4~~~aroB~~~3-dehydroquinate synthase~~~
MKLQTTYPSNNYPIFVEHGAIDHISTYIDQFDQSFILIDEHVNQYFADKFNDILSYENVHKVIIPAGEKTKTFEQYQETL
EYILSHHVTRNTAIIAVGGGATGDFAGFVAATLLRGVHFIQVPTTILAHDSSVGGKVGINSKQGKNLIGAFYRPTAVIYD
LDFLKTLPFEQILSGYAEVYKHALLNGESATQDIEQHFKDREILQSLKGMDKYIAKGIETKLDIVVADEKEQGVRKFLNL
GHTFGHAVEYYHKIPHGHAVMVGIIYQFIVANALFDSKHDINHYIQYLIQLGYPLDMITDLDFETLYQYMLSDKKNDKQG
VQMVLIRQFGDIVVQHVDQLTLQHACEQLKTYFK
>Q9KNV2 4.2.3.4~~~aroB~~~3-dehydroquinate synthase~~~COG0337
MERITVNLGERSYPISIGAGLFANPALLSLSAKQKVVIVTNHTVAPLYAPAIISLLDHIGCQHALLELPDGEQYKTLETF
NTVMSFLLEHNYSRDVVVIALGGGVIGDLVGFAAACYQRGVDFIQIPTTLLSQVDSSVGGKTAVNHPLGKNMIGAFYQPK
AVVIDTDCLTTLPAREFAAGMAEVIKYGIIYDSAFFDWLEAQMEALYALDEQALTYAIARCCQIKAEVVAQDEKESGIRA
LLNLGHTFGHAIEAHMGYGNWLHGEAVSAGTVMAAKTAQLQGLIDASQFERILAILKKAHLPVRTPENMTFADFMQHMMR
DKKVLAGELRLVLPTSIGTSAVVKGVPEAVIAQAIEYCRTV
>B0VDX7 4.2.3.5~~~aroC~~~Chorismate synthase~~~
MAGNSIGQLFRVTTCGESHGVGLMAIVDGVPPGLALTEEDLQKDLDRRKPGTSKFATQRKEPDQVEIISGVFEGKTTGTP
IGLLIRNTDQKSKDYGNIAQTFRPGHADYTYTQKYGFRDYRGGGRSSARETAMRVAAGAIAKKYLAEKFGVLIRGHVTQI
GNEVAEKLDWNEVPNNPFFCGDVDAVPRFEALVTSLREQGTSCGAKLEILAEKVPVGWGEPVFDRLDADIAHAMMSINAV
KGVEIGDGFAVAGQFGHETRDELTSHGFLANHAGGILGGISSGQTIRVAIALKPTASITTPGKTINLNREDTDVLTKGRH
DPCVGVRATPIAEAMLAIVLMDHFLRHRAQNADVVPPFAPIEP
>O66493 4.2.3.5~~~aroC~~~Chorismate synthase~~~COG0082
MSLRYLRFLTAGESHGKGLTAILEGIPANLPLSEEEINHELRRRQRGYGRGGRMKIEKDTAEILSGVRFGKTLGSPIALF
IRNRDWENWKEKMAIEGEPSPSVVPFTRPRPGHADLSGGIKYNQRDLRNILERASARETAARVAVGAVCKKFLSEFGIKI
GSFVVSIGQKEVEELKDKSYFANPEKLLSYHEKAEDSELRIPFPEKDEEFKTYIDEVKEKGESLGGVFEVFALNVPPGLG
SHIQWDRRIDGRIAQAMMSIQAIKGVEIGLGFEAARRFGSQVHDEIGWSEGKGYFRHSNNLGGTEGGITNGMPIVVRVAM
KPIPTLKNPLRSVDIETKEEMKAGKERTDIVAVPAASVVGEAMLAIVLADALLEKLGGDFMEEVKKRFEDYVNHVKSF
>P31104 4.2.3.5~~~aroC~~~Chorismate synthase~~~COG0082
MRYLTAGESHGPQLTTIIEGVPAGLYITEEDINFELARRQKGHGRGRRMQIEKDQAKIMSGVRHARTLGSPIALVVENND
WKHWTKIMGAAPITEDEEKEMKRQISRPRPGHADLNGAIKYNHRDMRNVLERSSARETTVRVAAGAVAKKILSELGIKVA
GHVLQIGAVKAEKTEYTSIEDLQRVTEESPVRCYDEEAGKKMMAAIDEAKANGDSIGGIVEVIVEGMPVGVGSYVHYDRK
LDSKLAAAVLSINAFKGVEFGIGFEAAGRNGSEVHDEIIWDEEKGYTRATNRLGGLEGGMTTGMPIVVRGVMKPIPTLYK
PLKSVDIETKEPFSASIERSDSCAVPAASVVAEAAVAWEIANAVVEQFGLDQIDRIRENVENMRKLSREF
>B7GS97 4.2.3.5~~~aroC~~~Chorismate synthase~~~
MLRWQTAGESHGEALVAMIEGLPAGVRISTDDIVSALARRRLGYGRGARMKFEQDKVRLLTGVRHGLTLGSPVAIEIANT
EWPKWTEVMSADALDHDLPREGRNAPLSRPRPGHADLTGMRKYGFDDARPVLERSSARETASRVALGEVAKQFLDQAFGI
RTVAHVVALGGVQTNPDLPLPTPDDLEALDASPVRTLDKEAEVRIIERINEAKKAADTLGGVIEVLAYGVPAGIGTYVES
DRRLDAALASAIMGIQAFKGVEIGDGFLAASRPGSQAHDEIVVNADGRIDRLSNRAGGIEGGMSNGQVIRVRGAMKPIPS
IPKALRTVDVLTGESAQAINQRSDSTAVPAASVVAEAMVRLTLAKYALDKFGGDSVAETRRNLESYLASWPEHMR
>Q9PM41 4.2.3.5~~~aroC~~~Chorismate synthase~~~COG0082
MNTFGTRLKFTSFGESHGVAVGCIIDGMPAGVKFDEEFLQNELDKRKGGSKFATPRKESDKAQVLSGVFEGYTTGHPIAI
VVFNENAHSKDYDNLKDLFRPAHADFTYFYKYGIRDHRGGGRSSARESVARVAGGAVAAMLLREFDICVQSGVFGVGTFV
SNLKEEEFDFEFAKKSEIFCLDPKLESDFKNEILNARNSKDSVGAAVFTKVSGMLIGLGEVLYDKLDSKLAHALMGINAV
KAVEIGEGINASKMRGSCNNDALKDGKFLSNHSGGILGGISNGENLILKTYFKPTPSIFAKQESIDKFGNNLKFELKGRH
DPCVGVRGSVVASAMVRLVLADCLLLNASANLNNLKNAYGLK
>P12008 4.2.3.5~~~aroC~~~Chorismate synthase~~~COG0082
MAGNTIGQLFRVTTFGESHGLALGCIVDGVPPGIPLTEADLQHDLDRRRPGTSRYTTQRREPDQVKILSGVFEGVTTGTS
IGLLIENTDQRSQDYSAIKDVFRPGHADYTYEQKYGLRDYRGGGRSSARETAMRVAAGAIAKKYLAEKFGIEIRGCLTQM
GDIPLDIKDWSQVEQNPFFCPDPDKIDALDELMRALKKEGDSIGAKVTVVASGVPAGLGEPVFDRLDADIAHALMSINAV
KGVEIGDGFDVVALRGSQNRDEITKDGFQSNHAGGILGGISSGQQIIAHMALKPTSSITVPGRTINRFGEEVEMITKGRH
DPCVGIRAVPIAEAMLAIVLMDHLLRQRAQNADVKTDIPRW
>P56122 4.2.3.5~~~aroC~~~Chorismate synthase~~~COG0082
MNTLGRFLRLTTFGESHGDVIGGVLDGMPSGIKIDYALLENEMKRRQGGRNVFITPRKEDDKVEITSGVFEDFSTGTPIG
FLIHNQRARSKDYDNIKNLFRPSHADFTYFHKYGIRDFRGGGRSSARESAIRVAAGAFAKMLLREIGIVCESGIIEIGGI
KAKNYDFNHALKSEIFALDEEQEEAQKTAIQNAIKNHDSIGGVALIRARSIKTNQKLPIGLGQGLYAKLDAKIAEAMMGL
NGVKAVEIGKGVESSLLKGSEYNDLMDQKGFLSNRSGGVLGGMSNGEEIIVRVHFKPTPSIFQPQRTIDINGNECECLLK
GRHDPCIAIRGSVVCESLLALVLADMVLLNLTSKIEYLKTIYNEN
>P9WPY1 4.2.3.5~~~aroC~~~Chorismate synthase~~~COG0082
MLRWITAGESHGRALVAVVEGMVAGVHVTSADIADQLARRRLGYGRGARMTFERDAVTVLSGIRHGSTLGGPIAIEIGNT
EWPKWETVMAADPVDPAELADVARNAPLTRPRPGHADYAGMLKYGFDDARPVLERASARETAARVAAGTVARAFLRQALG
VEVLSHVISIGASAPYEGPPPRAEDLPAIDASPVRAYDKAAEADMIAQIEAAKKDGDTLGGVVEAVALGLPVGLGSFTSG
DHRLDSQLAAAVMGIQAIKGVEIGDGFQTARRRGSRAHDEMYPGPDGVVRSTNRAGGLEGGMTNGQPLRVRAAMKPISTV
PRALATVDLATGDEAVAIHQRSDVCAVPAAGVVVETMVALVLARAALEKFGGDSLAETQRNIAAYQRSVADREAPAARVS
G
>B7UV40 4.2.3.5~~~aroC~~~Chorismate synthase~~~
MSGNTYGKLFTVTTAGESHGPALVAIVDGCPPGLELSARDLQRDLDRRKPGTSRHTTQRQEADEVEILSGVFEGKTTGTP
IGLLIRNTDQKSKDYSAIKDLFRPAHADYTYHHKYGVRDYRGGGRSSARETAMRVAAGAIAKKYLAGLGIQVRGYMSQLG
PIEIPFRSWDSVEQNAFFSPDPDKVPELEAYMDQLRRDQDSVGAKITVVAEGVPPGLGEPIFDRLDAELAHALMSINAVK
GVEIGAGFASIAQRGTEHRDELTPQGFLSNNAGGILGGISSGQPIVAHLALKPTSSITTPGRSIDTAGEPVDMITKGRHD
PCVGIRATPIAEAMMAIVLLDQLLRQRGQNADVRVDTPVLPQL
>Q59803 4.2.3.5~~~aroC~~~Chorismate synthase~~~
MRYLTSGESHGPQLTVIVEGVPANLEVKVEDINKEMFKRQGGYGRGRRMQIEKDTVEIVSGVRNGYTLGSPITMVVTNDD
FTHWRKIMGRAPISDEERENMKRTITKPRPGHADLLGGMKYNHRDLRNVLERSSARETAARVAVGALCKVLLEQLDIEIY
SRVVEIGGIKDKDFYDSETFKANLDRNDVRVIDDGIAQAMRDKIDEAKTDGDSIGGVVQVVVENMPVGVGSYVHYDRKLD
GRIAQGVVSINAFKGVSFGEGFKAAEKPGSEIQDEILYNTELGYYRGSNHLGGLEGGMSNGMPIIVNGVMKPIPTLYKPL
NSVDINTKEDFKATIERSDSCAVPAASIVCEHVVAFAIAKALLEEFQSNHIEQLKQQIIERRQLNIEF
>P0A2Y6 4.2.3.5~~~aroC~~~Chorismate synthase~~~COG0082
MRYLTAGESHGPRLTAIIEGIPAGLPLTAEDINEDLRRRQGGYGRGGRMKIENDQVVFTSGVRHGKTTGAPITMDVINKD
HQKWLDIMSAEDIEDRLKSKRKITHPRPGHADLVGGIKYRFDDLRNSLERSSARETTMRVAVGAVAKRLLAELDMEIANH
VVVFGGKEIDVPENLTVAEIKQRAAQSEVSIVNQEREQEIKDYIDQIKRDGDTIGGVVETVVGGVPVGLGSYVQWDRKLD
ARLAQAVVSINAFKGVEFGLGFEAGYRKGSQVMDEILWSKEDGYTRRTNNLGGFEGGMTNGQPIVVRGVMKPIPTLYKPL
MSVDIETHEPYKATVERSDPTALPAAGMVMEAVVATVLAQEILEKFSSDNLEELKEAVAKHRDYTKNY
>O66440 4.2.1.10~~~aroD~~~3-dehydroquinate dehydratase~~~COG0710
MLIAVPLDDTNFSENLKKAKEKGADIVELRVDQFSDTSLNYVKEKLEEVHSQGLKTILTIRSPEEGGREVKNREELFEEL
SPLSDYTDIELSSRGLLVKLYNITKEAGKKLIISYHNFELTPPNWIIREVLREGYRYGGIPKIAVKANSYEDVARLLCIS
RQVEGEKILISMGDYGKISRLAGYVFGSVITYCSLEKAFAPGQIPLEEMVELRKKFYRL
>Q186A6 4.2.1.10~~~aroD~~~3-dehydroquinate dehydratase~~~COG0710
MKRKVQVKNITIGEGRPKICVPIIGKNKKDIIKEAKELKDACLDIIEWRVDFFENVENIKEVKEVLYELRSYIHDIPLLF
TFRSVVEGGEKLISRDYYTTLNKEISNTGLVDLIDVELFMGDEVIDEVVNFAHKKEVKVIISNHDFNKTPKKEEIVSRLC
RMQELGADLPKIAVMPQNEKDVLVLLEATNEMFKIYADRPIITMSMSGMGVISRLCGEIFGSALTFGAAKSVSAPGQISF
KELNSVLNLLHKSIN
>P05194 4.2.1.10~~~aroD~~~3-dehydroquinate dehydratase~~~COG0710
MKTVTVKDLVIGTGAPKIIVSLMAKDIASVKSEALAYREADFDILEWRVDHYADLSNVESVMAAAKILRETMPEKPLLFT
FRSAKEGGEQAISTEAYIALNRAAIDSGLVDMIDLELFTGDDQVKETVAYAHAHDVKVVMSNHDFHKTPEAEEIIARLRK
MQSFDADIPKIALMPQSTSDVLTLLAATLEMQEQYADRPIITMSMAKTGVISRLAGEVFGSAATFGAVKKASAPGQISVN
DLRTVLTILHQA
>P36923 4.2.1.10~~~aroD~~~3-dehydroquinate dehydratase~~~COG0710
MKPVIVKNVRIGEGNPKIVVPIVAPTAEDILAEATASQTLDCDLVEWRLDYYENVADFSDVCNLSQQVMERLGQKPLLLT
FRTQKEGGEMAFSEENYFALYHELVKKGALDLLDIELFANPLAADTLIHEAKKAGIKIVLCNHDFQKTPSQEEIVARLRQ
MQMRQADICKIAVMPQDATDVLTLLSATNEMYTHYASVPIVTMSMGQLGMISRVTGQLFGSALTFGSAQQASAPGQLSVQ
VLRNYLKTFEQNK
>Q5KY94 4.2.1.10~~~aroD~~~3-dehydroquinate dehydratase~~~COG0710
MNISPKAIKVRNIWIGGTEPCICAPVVGEDDRKVLREAEEVCRKQPDLLEWRADFFRAIDDQERVLATANGLRNIAGEIP
ILFTIRSEREGGQPIPLNEAEVRRLIEAICRSGAIDLVDYELAYGERIADVRRMTEECSVWLVVSRHYFDGTPRKETLLA
DMRQAERYGADIAKVAVMPKSPEDVLVLLQATEEARRELAIPLITMAMGGLGAITRLAGWLFGSAVTFAVGNQSSAPGQI
PIDDVRTVLSILQTYSR
>P24670 4.2.1.10~~~aroD~~~3-dehydroquinate dehydratase~~~COG0710
MKTVTVKNLIIGEGMPKIIVSLMGRDINSVKAEALAYREATFDILEWRVDHFMDIASTQSVLTAARVIRDAMPDIPLLFT
FRSAKEGGEQTITTQHYLTLNRAAIDSGLVDMIDLELFTGDADVKATVDYAHAHNVYVVMSNHDFHQTPSAEEMVLRLRK
MQALGADIPKIAVMPQSKHDVLTLLTATLEMQQHYADRPVITMSMAKEGVISRLAGEVFGSAATFGAVKQASAPGQIAVN
DLRSVLMILHNA
>P58687 4.2.1.10~~~aroD~~~3-dehydroquinate dehydratase~~~
MKTVTVRDLVVGEGAPKIIVSLMGKTITDVKSEALAYREADFDILEWRVDHFANVTTAESVLEAAGAIREIITDKPLLFT
FRSAKEGGEQALTTGQYIDLNRAAVDSGLVDMIDLELFTGDDEVKATVGYAHQHNVAVIMSNHDFHKTPAAEEIVQRLRK
MQELGADIPKIAVMPQTKADVLTLLTATVEMQERYADRPIITMSMSKTGVISRLAGEVFGSAATFGAVKKASAPGQISVA
DLRTVLTILHQA
>O87007 4.2.1.10~~~aroD~~~3-dehydroquinate dehydratase~~~
MKTVTVKDLVIGAGAPKIIVSLMAKDIARVKSEALAYRETDFDILEWRVDHFADLSNVESVMAAAKILRETMPEKPLLFT
FRSAKEGGEQAISTEAYIALNRAAIDSGLVDMIDLELFTGDDQVKETVAYAHAHDVKVVMSNHDFHKTPEAEEIIARLRK
MQSFDADIPKIALMPQSTSDVLTLLAATLEMQEQYADRPIITMSMAKTGVISRLAGEVFGSAATFGAVKKASAPGQISVN
DLRILLTILHQA
>Q2G002 4.2.1.10~~~aroD~~~3-dehydroquinate dehydratase~~~COG0710
MTHVEVVATIAPQLSIEETLIQKINHRIDAIDVLELRIDQIENVTVDQVAEMITKLKVMQDSFKLLVTYRTKLQGGYGQF
TNDSYLNLISDLANINGIDMIDIEWQADIDIEKHQRIITHLQQYNKEVVISHHNFESTPPLDELQFIFFKMQKFNPEYVK
LAVMPHNKNDVLNLLQAMSTFSDTMDCKVVGISMSKLGLISRTAQGVFGGALTYGCIGVPQAPGQIDVTDLKAQVTLY
>Q2YWJ9 4.2.1.10~~~aroD~~~3-dehydroquinate dehydratase~~~
MTHVEVVATIAPQLYIEETLIQKINHRIDAIDVLELRIDQIENVTVNQVAEMITKLKVMQDSFKLLVTYRTKLQGGYGQF
TNDLYLNLISDLANINGIDMIDIEWQADIDIEKHQRIITHLQQYNKEVVISHHNFESTPPLDELQFIFFKMQKFNPEYVK
LAVMPHNKNDVLNLLQAMSTFSDTMDCKVVGISMSKLGLISRTAQGVFGGALTYGCIGEPQAPGQIDVTDLKAQVTLY
>Q6GII7 4.2.1.10~~~aroD~~~3-dehydroquinate dehydratase~~~
MTHVEVVATITPQLYIEETLIQKINHRIDAIDVLELRIDQFENVTVDQVAEMITKLKVMQDSFKLLVTYRTKLQGGYGQF
TNDSYLNLISDLANINGIDMIDIEWQADIDIEKHQRIITHLQQYNKEVIISHHNFESTPPLDELQFIFFKMQKFNPEYVK
LAVMPHNKNDVLNLLQAMSTFSDTMDCKVVGISMSKLGLISRTAQGVFGGALTYGCIGEPQAPGQIDVTDLKAQVTLY
>Q8DUW4 4.2.1.10~~~aroD~~~3-dehydroquinate dehydratase~~~COG0710
MKIVVPVMPQNIEEANQLDLTRIDSTDIIEWRADYLVKDDILTVAPAIFEKFSGHEVIFTLRTEKEGGNISLSNEDYLAI
IRDIAALYQPDYIDFEYFSYRDVLEEMYDFSNLILSYHNFEETPENLMEVFSELTALAPRVVKIAVMPKNEQDVLDLMNY
TRGFKTLNPNQEYVTMSMSKLGRISRLAADLIGSSWTFASLEQESAPGQISLADMRKIKEVLDAN
>P63590 4.2.1.10~~~aroD~~~3-dehydroquinate dehydratase~~~
MRIVAPVMPRHFDEAQAIDISKYEDVNLIEWRADFLPKDEIVAVAPAIFEKFAGKEIIFTLRTVQEGGNITLSSQEYVDI
IKEINAIYNPDYIDFEYFTHKSVFQEMLDFPNLILSYHNFEETPENLMEAFSEMTKLAPRVVKIAVMPQSEQDVLDLMNY
TRGFKTLNPEQEFATISMGKLGRLSRFAGDVIGSSWTYVSLDHVSGPGQVTLNDMKRIIEVLEMDISN
>P63588 4.2.1.10~~~aroD~~~3-dehydroquinate dehydratase~~~COG0710
MKLIVSVMPRSLEEAQALDATRYLDADIIEWRADYLPKEAILQVAPAIFEKFAGRELVFTLRTRSEGGEIDLSPEEYIHL
IKEVAQLYQPDYIDFEYYSYKDVFEEMLDFPNLVLSYHNFQETPENMMEILSELTILNPKLVKVAVMAHTEQDVLDLMNY
TRGFKTLNPEQEYVTISMGKVGKVSRITADVTGSSWSFASLDEVSAPGQISLASMKKIREILDEA
>O67049 1.1.1.25~~~aroE~~~Shikimate dehydrogenase (NADP(+))~~~COG0169
MINAQTQLYGVIGFPVKHSLSPVFQNALIRYAGLNAVYLAFEINPEELKKAFEGFKALKVKGINVTVPFKEEIIPLLDYV
EDTAKEIGAVNTVKFENGKAYGYNTDWIGFLKSLKSLIPEVKEKSILVLGAGGASRAVIYALVKEGAKVFLWNRTKEKAI
KLAQKFPLEVVNSPEEVIDKVQVIVNTTSVGLKDKDPEIFNYDLIKKDHVVVDIIYKETKLLKKAKEKGAKLFDGLPMLL
WQGIEAFKIWNGCEVPYSVAERSVRDLRG
>A4QB65 1.1.1.-~~~aroE~~~Quinate/shikimate dehydrogenase (NAD(+))~~~
MNDSILLGLIGQGLDLSRTPAMHEAEGLAQGRATVYRRIDTLGSRASGQDLKTLLDAALYLGFNGLNITHPYKQAVLPLL
DEVSEQATQLGAVNTVVIDANGHTTGHNTDVSGFGRGMEEGLPNAKLDSVVQVGAGGVGNAVAYALVTHGVQKLQVADLD
TSRAQALADVINNAVGREAVVGVDARGIEDVIAAADGVVNATPMGMPAHPGTAFDVSCLTKDHWVGDVVYMPIETELLKA
ARALGCETLDGTRMAIHQAVDAFRLFTGLEPDVSRMRETFLSL
>Q9X5C9 1.1.1.-~~~aroE~~~Quinate/shikimate dehydrogenase (NAD(+))~~~COG0169
MNDSILLGLIGQGLDLSRTPAMHEAEGLAQGRATVYRRIDTLGSRASGQDLKTLLDAALYLGFNGLNITHPYKQAVLPLL
DEVSEQATQLGAVNTVVIDATGHTTGHNTDVSGFGRGMEEGLPNAKLDSVVQVGAGGVGNAVAYALVTHGVQKLQVADLD
TSRAQALADVINNAVGREAVVGVDARGIEDVIAAADGVVNATPMGMPAHPGTAFDVSCLTKDHWVGDVVYMPIETELLKA
ARALGCETLDGTRMAIHQAVDAFRLFTGLEPDVSRMRETFLSL
>P15770 1.1.1.25~~~aroE~~~Shikimate dehydrogenase (NADP(+))~~~COG0169
METYAVFGNPIAHSKSPFIHQQFAQQLNIEHPYGRVLAPINDFINTLNAFFSAGGKGANVTVPFKEEAFARADELTERAA
LAGAVNTLMRLEDGRLLGDNTDGVGLLSDLERLSFIRPGLRILLIGAGGASRGVLLPLLSLDCAVTITNRTVSRAEELAK
LFAHTGSIQALSMDELEGHEFDLIINATSSGISGDIPAIPSSLIHPGIYCYDMFYQKGKTPFLAWCEQRGSKRNADGLGM
LVAQAAHAFLLWHGVLPDVEPVIKQLQEELSA
>Q5KWX7 1.1.1.25~~~aroE~~~Shikimate dehydrogenase (NADP(+))~~~COG0169
MEKVYGLIGFPVEHSLSPLMHNDAFARLGIPARYHLFSVEPGQVGAAIAGVRALGIAGVNVTIPHKLAVIPFLDEVDEHA
RRIGAVNTIINNDGRLVGYNTDGLGYVQALEEEMNITLDGKRILVIGAGGGARGIYFSLLSTAAERIDMANRTVEKAERL
VREGDERRSAYFSLAEAETRLAEYDIIINTTSVGMHPRVEVQPLSLERLRPGVIVSDIIYNPLETKWLKEAKARGARVQN
GVGMLVYQGALAFEKWTGQWPDVNRMKQLVIEALRR
>P43876 1.1.1.25~~~aroE~~~Shikimate dehydrogenase (NADP(+))~~~COG0169
MDLYAVWGNPIAQSKSPLIQNKLAAQTHQTMEYIAKLGDLDAFEQQLLAFFEEGAKGCNITSPFKERAYQLADEYSQRAK
LAEACNTLKKLDDGKLYADNTDGIGLVTDLQRLNWLRPNQHVLILGAGGATKGVLLPLLQAQQNIVLANRTFSKTKELAE
RFQPYGNIQAVSMDSIPLQTYDLVINATSAGLSGGTASVDAEILKLGSAFYDMQYAKGTDTPFIALCKSLGLTNVSDGFG
MLVAQAAHSFHLWRGVMPDFVSVYEQLKKAML
>Q56S04 1.1.1.25~~~aroE~~~Shikimate dehydrogenase (NADP(+))~~~COG0169
MKLKSFGVFGNPIKHSKSPLIHNACFLTFQKELGFLGHYHPILLPLESHIKSEFLHLGLSGANVTLPFKERAFQICDKIK
GIALECGAVNTLVVENDELVGYNTDALGFWLSLGGEGYQSALILGSGGSAKALACELQKQGLKVSVLNRSARGLDFFQRL
GCDCFMDPPKSTFDLIINATSASLNNELPLNKEVLKGYFKEGKLAYDLAYGFLTPFLSLAKELETPFQDGKDMLIYQAAL
SFEKFSASQIPYPKAFEVMRSVF
>P56119 1.1.1.25~~~aroE~~~Shikimate dehydrogenase (NADP(+))~~~COG0169
MKLKSFGVFGNPIKHSKSPLIHNACFLTFQKELRFLGHYHPILLPLESHIKSEFLHLGLSGANVTLPFKERAFQVCDKIK
GIALECGAVNTLVLENDELVGYNTDALGFYLSLKQKNYQNALILGAGGSAKALACELKKQGLQVSVLNRSSRGLDFFQRL
GCDCFMEPPKSAFDLIINATSASLHNELPLNKEVLKGYFKEGKLAYDLAYGFLTPFLSLAKELKTPFQDGKDMLIYQAAL
SFEKFSASQIPYSKAFEVMRSVF
>Q8Y9N5 1.1.1.25~~~aroE~~~Shikimate dehydrogenase (NADP(+))~~~COG0169
MTNKITERITGHTELIGLIATPIRHSLSPTMHNEAFAKLGLDYVYLAFEVGDKELKDVVQGFRAMNLRGWNVSMPNKTNI
HKYLDKLSPAAELVGAVNTVVNDDGVLTGHITDGTGYMRALKEAGHDIIGKKMTICGAGGAATAICIQAALDGVKEISIF
NRKDDFYANAEKTVEKINSKTDCKAQLFDIEDHEQLRKEIAESVIFTNATGVGMKPFEGETLLPSADMLRPELIVSDVVY
KPTKTRLLEIAEEQGCQTLNGLGMMLWQGAKAFEIWTHKEMPVDYIKEILF
>Q5HNV1 1.1.1.25~~~aroE~~~Shikimate dehydrogenase (NADP(+))~~~COG0169
MKFAVIGNPISHSLSPLMHHANFQSLNLENTYEAINVPVNQFQDIKKIISEKSIDGFNVTIPHKERIIPYLDDINEQAKS
VGAVNTVLVKDGKWIGYNTDGIGYVNGLKQIYEGIEDAYILILGAGGASKGIANELYKIVRPTLTVANRTMSRFNNWSLN
INKINLSHAESHLDEFDIIINTTPAGMNGNTDSVISLNRLASHTLVSDIVYNPYKTPILIEAEQRGNPIYNGLDMFVHQG
AESFKIWTNLEPDIKAMKNIVIQKLKGEL
>Q9WYI1 1.1.1.25~~~aroE~~~Shikimate dehydrogenase (NADP(+))~~~COG0169
MKFCIIGYPVRHSISPRLYNEYFKRAGMNHSYGMEEIPPESFDTEIRRILEEYDGFNATIPHKERVMRYVEPSEDAQRIK
AVNCVFRGKGYNTDWVGVVKSLEGVEVKEPVVVVGAGGAARAVIYALLQMGVKDIWVVNRTIERAKALDFPVKIFSLDQL
DEVVKKAKSLFNTTSVGMKGEELPVSDDSLKNLSLVYDVIYFDTPLVVKARKLGVKHIIKGNLMFYYQAMENLKIWGIYD
EEVFKEVFGEVLK
>Q5SJF8 1.1.1.25~~~aroE~~~Shikimate dehydrogenase (NADP(+))~~~COG0169
MLRFAVLGHPVAHSLSPAMHAFALESLGLEGSYEAWDTPLEALPGRLKEVRRAFRGVNLTLPLKEAALAHLDWVSPEAQR
IGAVNTVLQVEGRLFGFNTDAPGFLEALKAGGIPLKGPALVLGAGGAGRAVAFALREAGLEVWVWNRTPQRALALAEEFG
LRAVPLEKAREARLLVNATRVGLEDPSASPLPAELFPEEGAAVDLVYRPLWTRFLREAKAKGLKVQTGLPMLAWQGALAF
RLWTGLLPDPSGMEEAARRALGV
>Q9KVT3 1.1.1.25~~~aroE~~~Shikimate dehydrogenase (NADP(+))~~~COG0169
MASQIDQYAVFGNPINHSKSPFIHTLFARQTQQSMIYTAQCVPVDGFTEAAKHFFAQGGRGCNVTVPFKEEAYRFADRLT
ERARLAGAVNTLKKLDDGEILGDNTDGEGLVQDLLAQQVLLKGATILLIGAGGAARGVLKPLLDQQPASITVTNRTFAKA
EQLAELVAAYGEVKAQAFEQLKQSYDVIINSTSASLDGELPAIDPVIFSSRSVCYDMMYGKGYTVFNQWARQHGCAQAID
GLGMLVGQAAESFMLWRGLRPGTKQILRELRKNLEGAL
>P00888 2.5.1.54~~~aroF~~~Phospho-2-dehydro-3-deoxyheptonate aldolase, Tyr-sensitive~~~COG0722
MQKDALNNVHITDEQVLMTPEQLKAAFPLSLQQEAQIADSRKSISDIIAGRDPRLLVVCGPCSIHDPETALEYARRFKAL
AAEVSDSLYLVMRVYFEKPRTTVGWKGLINDPHMDGSFDVEAGLQIARKLLLELVNMGLPLATEALDPNSPQYLGDLFSW
SAIGARTTESQTHREMASGLSMPVGFKNGTDGSLATAINAMRAAAQPHRFVGINQAGQVALLQTQGNPDGHVILRGGKAP
NYSPADVAQCEKEMEQAGLRPSLMVDCSHGNSNKDYRRQPAVAESVVAQIKDGNRSIIGLMIESNIHEGNQSSEQPRSEM
KYGVSVTDACISWEMTDALLREIHQDLNGQLTARVA
>P80574 2.5.1.54~~~aroH~~~Phospho-2-dehydro-3-deoxyheptonate aldolase~~~COG3200
MTVNAKTSPSAGNTWRDLPAAQQPEYPDTEALRAVIADLESYPPLVFAGECDQLRARMAAVAKGEAFLLQGGDCAEAFDA
VSADHIRNKLKTLLQMGAVLTYAASVPVVKVGRIAGQYSKPRSKPTETRDGVTLPTYRGDSVNGFDFTEAARIPDPERLK
RMYHASASTLNLVRAFTTGGYADLRQVHAWNQDFVKSSPSGQRYEQLAREIDNALNFMRACGTDPAEFQTVEFFSSHEAL
LLDYESALTRVDSRTGQLYDVSGHMVWIGERTRQLDHAHIEFASRIRNPIGIKLGPSTTAEEALQYIERLDPEREPGRLT
FIVRMGADKIRDKLPELVEKVTASGATVAWITDPMHGNTYEAASGHKTRRFDDVLDEVKGFFEVHKSLGTHPGGIHVELT
GDDVTECVGGGDEIFVDDLHQRYETACDPRLNRSQSLDLAFLVAEMYRDQ
>Q9WYH8 2.5.1.54~~~aroF~~~Phospho-2-dehydro-3-deoxyheptonate aldolase~~~COG2876
MIVVLKPGSTEEDIRKVVKLAESYNLKCHISKGQERTVIGIIGDDRYVVADKFESLDCVESVVRVLKPYKLVSREFHPED
TVIDLGDVKIGNGYFTIIAGPCSVEGREMLMETAHFLSELGVKVLRGGAYKPRTSPYSFQGLGEKGLEYLREAADKYGMY
VVTEALGEDDLPKVAEYADIIQIGARNAQNFRLLSKAGSYNKPVLLKRGFMNTIEEFLLSAEYIANSGNTKIILCERGIR
TFEKATRNTLDISAVPIIRKESHLPILVDPSHSGGRRDLVIPLSRAAIAVGAHGIIVEVHPEPEKALSDGKQSLDFELFK
ELVQEMKKLADALGVKVN
>P39912 ~~~aroA~~~Protein AroA(G)~~~COG1605
MSNTELELLRQKADELNLQILKLINERGNVVKEIGKAKEAQGVNRFDPVRERTMLNNIIENNDGPFENSTIQHIFKEIFK
AGLELQEEDHSKALLVSRKKKPEDTIVDIKGEKIGDGQQRFIVGPCAVESYEQVAEVAAAAKKQGIKILRGGAFKPRTSP
YDFQGLGVEGLQILKRVADEFDLAVISEIVTPAHIEEALDYIDVIQIGARNMQNFELLKAAGAVKKPVLLKRGLAATISE
FINAAEYIMSQGNDQIILCERGIRTYETATRNTLDISAVPILKQETHLPVFVDVTHSTGRRDLLLPTAKAALAIGADGVM
AEVHPDPSVALSDSAQQMAIPEFEKWLNELKPMVKVNA
>P0AB91 2.5.1.54~~~aroG~~~Phospho-2-dehydro-3-deoxyheptonate aldolase, Phe-sensitive~~~COG0722
MNYQNDDLRIKEIKELLPPVALLEKFPATENAANTVAHARKAIHKILKGNDDRLLVVIGPCSIHDPVAAKEYATRLLALR
EELKDELEIVMRVYFEKPRTTVGWKGLINDPHMDNSFQINDGLRIARKLLLDINDSGLPAAGEFLDMITPQYLADLMSWG
AIGARTTESQVHRELASGLSCPVGFKNGTDGTIKVAIDAINAAGAPHCFLSVTKWGHSAIVNTSGNGDCHIILRGGKEPN
YSAKHVAEVKEGLNKAGLPAQVMIDFSHANSSKQFKKQMDVCADVCQQIAGGEKAIIGVMVESHLVEGNQSLESGEPLAY
GKSITDACIGWEDTDALLRQLANAVKARRG
>O53512 2.5.1.54~~~aroG~~~Phospho-2-dehydro-3-deoxyheptonate aldolase AroG~~~COG3200
MNWTVDIPIDQLPSLPPLPTDLRTRLDAALAKPAAQQPTWPADQALAMRTVLESVPPVTVPSEIVRLQEQLAQVAKGEAF
LLQGGDCAETFMDNTEPHIRGNVRALLQMAVVLTYGASMPVVKVARIAGQYAKPRSADIDALGLRSYRGDMINGFAPDAA
AREHDPSRLVRAYANASAAMNLVRALTSSGLASLHLVHDWNREFVRTSPAGARYEALATEIDRGLRFMSACGVADRNLQT
AEIYASHEALVLDYERAMLRLSDGDDGEPQLFDLSAHTVWIGERTRQIDGAHIAFAQVIANPVGVKLGPNMTPELAVEYV
ERLDPHNKPGRLTLVSRMGNHKVRDLLPPIVEKVQATGHQVIWQCDPMHGNTHESSTGFKTRHFDRIVDEVQGFFEVHRA
LGTHPGGIHVEITGENVTECLGGAQDISETDLAGRYETACDPRLNTQQSLELAFLVAEMLRD
>P19080 5.4.99.5~~~aroH~~~Chorismate mutase AroH~~~COG4401
MMIRGIRGATTVERDTEEEILQKTKQLLEKIIEENHTKPEDVVQMLLSATPDLHAVFPAKAVRELSGWQYVPVTCMQEMD
VTGGLKKCIRVMMTVQTDVPQDQIRHVYLEKVVVLRPDLSLTKNTEL
>P00887 2.5.1.54~~~aroH~~~Phospho-2-dehydro-3-deoxyheptonate aldolase, Trp-sensitive~~~COG0722
MNRTDELRTARIESLVTPAELALRYPVTPGVATHVTDSRRRIEKILNGEDKRLLVIIGPCSIHDLTAAMEYATRLQSLRN
QYQSRLEIVMRTYFEKPRTVVGWKGLISDPDLNGSYRVNHGLELARKLLLQVNELGVPTATEFLDMVTGQFIADLISWGA
IGARTTESQIHREMASALSCPVGFKNGTDGNTRIAVDAIRAARASHMFLSPDKNGQMTIYQTSGNPYGHIIMRGGKKPNY
HADDIAAACDTLHEFDLPEHLVVDFSHGNCQKQHRRQLEVCEDICQQIRNGSTAIAGIMAESFLREGTQKIVGSQPLTYG
QSITDPCLGWEDTERLVEKLASAVDTRF
>Q84FH6 5.4.99.5~~~aroH~~~Chorismate mutase AroH~~~
MVRGIRGAITVEEDTPEAIHQATRELLLKMLEANGIQSYEELAAVIFTVTEDLTSAFPAEAARQIGMHRVPLLSAREVPV
PGSLPRVIRVLALWNTDTPQDRVRHVYLREAVRLRPDLESAQ
>O67925 2.7.1.71~~~aroK~~~Shikimate kinase~~~COG0703
MRIYLIGFMCSGKSTVGSLLSRSLNIPFYDVDEEVQKREGLSIPQIFEKKGEAYFRKLEFEVLKDLSEKENVVISTGGGL
GANEEALNFMKSRGTTVFIDIPFEVFLERCKDSKERPLLKRPLDEIKNLFEERRKIYSKADIKVKGEKPPEEVVKEILLS
LEGNALGG
>Q8A2B2 2.7.1.71~~~aroK~~~Shikimate kinase~~~COG0703
MVRIFLTGYMGAGKTTLGKAFARKLNVPFIDLDWYIEERFHKTVGELFTERGEAGFRELERNMLHEVAEFENVVISTGGG
APCFYDNMEFMNRTGKTVFLNVHPDVLFRRLRIAKQQRPILQGKEDDELMDFIIQALEKRAPFYTQAQYIFNADELEDRW
QIESSVQRLQELLEL
>Q83AJ3 2.7.1.71~~~aroK~~~Shikimate kinase~~~COG0703
MKKNLTNIYLIGLMGAGKTSVGSQLAKLTKRILYDSDKEIEKRTGADIAWIFEMEGEAGFRRREREMIEALCKLDNIILA
TGGGVVLDEKNRQQISETGVVIYLTASIDTQLKRIGQKGEMRRPLFIKNNSKEKLQQLNEIRKPLYQAMADLVYPTDDLN
PRQLATQILVDIKQTYSDL
>P0A6D7 2.7.1.71~~~aroK~~~Shikimate kinase 1~~~COG0703
MAEKRNIFLVGPMGAGKSTIGRQLAQQLNMEFYDSDQEIEKRTGADVGWVFDLEGEEGFRDREEKVINELTEKQGIVLAT
GGGSVKSRETRNRLSARGVVVYLETTIEKQLARTQRDKKRPLLHVETPPREVLEALANERNPLYEEIADVTIRTDDQSAK
VVANQIIHMLESN
>P56073 2.7.1.71~~~aroK~~~Shikimate kinase~~~COG0703
MQHLVLIGFMGSGKSSLAQELGLALKLEVLDTDMIISERVGLSVREIFEELGEDNFRMFEKNLIDELKTLKTPHVISTGG
GIVMHENLKGLGTTFYLKMDFETLIKRLNQKEREKRPLLNNLTQAKELFEKRQALYEKNASFIIDARGGLNNSLKQVLQF
IA
>P9WPY3 2.7.1.71~~~aroK~~~Shikimate kinase~~~COG0703
MAPKAVLVGLPGSGKSTIGRRLAKALGVGLLDTDVAIEQRTGRSIADIFATDGEQEFRRIEEDVVRAALADHDGVLSLGG
GAVTSPGVRAALAGHTVVYLEISAAEGVRRTGGNTVRPLLAGPDRAEKYRALMAKRAPLYRRVATMRVDTNRRNPGAVVR
HILSRLQVPSPSEAAT
>P10880 2.7.1.71~~~aroL~~~Shikimate kinase 2~~~
MTEPIFMVGARGCGKTTVGRELARALGYEFVDTDIFMQHTSGMTVADVVAAEGWPGFRRRESEALQAVATPNRVVATGGG
MVLLEQNRQFMRAHGTVVYLFAPAEELALRLQASPQAHQRPTLTGRPIAEEMEAVLREREALYQDVAHYVVDATQPPAAI
VCELMQTMRLPAA
>P0A6E1 2.7.1.71~~~aroL~~~Shikimate kinase 2~~~COG0703
MTQPLFLIGPRGCGKTTVGMALADSLNRRFVDTDQWLQSQLNMTVAEIVEREEWAGFRARETAALEAVTAPSTVIATGGG
IILTEFNRHFMQNNGIVVYLCAPVSVLVNRLQAAPEEDLRPTLTGKPLSEEVQEVLEERDALYREVAHIIIDATNEPSQV
ISEIRSALAQTINC
>Q46065 ~~~aroP~~~Aromatic amino acid transport protein AroP~~~COG1113
MAKSNEGLGTGLRTRHLTMMGLGSAIGAGLFLGTGVGIRAAGPAVLLAYIIAGAIVVLVMQMLGEMAAARPASGSFSRYG
EDAFGHWAGFSLGWLYWFMLIMVMGAEMTGAAAIMGAWFGVEPWIPSLVCVVFFAVVNLVAVRGFGEFEYWFAFIKVAVI
IAFLIIGIALIFGWLPGSTFVGTSNFIGDHGFMPNGISGVAAGLLAVAFAFGGIEIVTIAAAESDKPREAISLAVRAVIW
RISVFYLGSVLVITFLMPYESINGADTAAESPFTQILAMANIPGTVGFMEAIIVLALLSAFNAQIYATSRLVFSMANRQD
APRVFSKLSTSHVPTNAVLLSMFFAFVSVGLQYWNPAGLLDFLLNAVGGCLIVVWAMITLSQLKLRKELQANDEISTVRM
WAHPWLGILTLVLLAGLVALMLGDAASRSQVYSVAIVYGFLVLLSFVTVNSPLRGGRTPSDLN
>P15993 ~~~aroP~~~Aromatic amino acid transport protein AroP~~~COG1113
MMEGQQHGEQLKRGLKNRHIQLIALGGAIGTGLFLGSASVIQSAGPGIILGYAIAGFIAFLIMRQLGEMVVEEPVAGSFS
HFAYKYWGSFAGFASGWNYWVLYVLVAMAELTAVGKYIQFWYPEIPTWVSAAVFFVVINAINLTNVKVFGEMEFWFAIIK
VIAVVAMIIFGGWLLFSGNGGPQATVSNLWDQGGFLPHGFTGLVMMMAIIMFSFGGLELVGITAAEADNPEQSIPKATNQ
VIYRILIFYIGSLAVLLSLMPWTRVTADTSPFVLIFHELGDTFVANALNIVVLTAALSVYNSCVYCNSRMLFGLAQQGNA
PKALASVDKRGVPVNTILVSALVTALCVLINYLAPESAFGLLMALVVSALVINWAMISLAHMKFRRAKQEQGVVTRFPAL
LYPLGNWICLLFMAAVLVIMLMTPGMAISVYLIPVWLIVLGIGYLFKEKTAKAVKAH
>O30557 4.2.1.10~~~aroQ1~~~3-dehydroquinate dehydratase 1~~~
MATLLVLHGPNLNLLGTREPGTYGSTTLGQINQDLERRAREAGHHLLHLQSNAEYELIDRIHAARDEGVDFIIINPAAFT
HTSVALRDALLAVSIPFIEVHLSNVHKREPFRHHSYFSDVAVGVICGLGATGYRLALESALEQLQRP
>A3M692 4.2.1.10~~~aroQ~~~3-dehydroquinate dehydratase~~~
MSSTILVIHGPNLNLLGKREPEVYGHLTLDNINRQLIAQAEQASITLDTFQSNWEGAIVDRIHQAQTEGVKLIIINPAAL
THTSVALRDALLGVAIPFIEVHLSNVHAREAFRHHSYLSDKAIGVICGLGAKGYSFALDYAIEKIQPSNPN
>P43877 4.2.1.10~~~aroQ~~~3-dehydroquinate dehydratase~~~
MKKILLLNGPNLNMLGKREPHIYGSQTLSDIEQHLQQSAQAQGYELDYFQANGEESLINRIHQAFQNTDFIIINPGAFTH
TSVAIRDALLAVSIPFIEVHLSNVHAREPFRHHSYLSDVAKGVICGLGAKGYDYALDFAISELQKIQLGEMMNG
>P54517 4.2.1.10~~~yqhS~~~3-dehydroquinate dehydratase~~~COG0757
MPHFLILNGPNVNRLGSREPEVFGRQTLTDIETDLFQFAEALHIQLTFFQSNHEGDLIDAIHEAEEQYSGIVLNPGALSH
YSYAIRDAVSSISLPVVEVHLSNLYAREEFRHQSVIAPVAKGQIVGLGAEGYKLAVRYLLSQQGGESR
>Q48255 4.2.1.10~~~aroQ~~~3-dehydroquinate dehydratase~~~COG0757
MKILVIQGPNLNMLGHRDPRLYGMVTLDQIHEIMQTFVKQGNLDVELEFFQTNFEGEIIDKIQESVGSDYEGIIINPGAF
SHTSIAIADAIMLAGKPVIEVHLTNIQAREEFRKNSYTGAACGGVIMGFGPLGYNMALMAMVNILAEMKAFQEAQKNNPN
NPINNQK
>P9WPX7 4.2.1.10~~~aroQ~~~3-dehydroquinate dehydratase~~~COG0757
MSELIVNVINGPNLGRLGRREPAVYGGTTHDELVALIEREAAELGLKAVVRQSDSEAQLLDWIHQAADAAEPVILNAGGL
THTSVALRDACAELSAPLIEVHISNVHAREEFRRHSYLSPIATGVIVGLGIQGYLLALRYLAEHVGT
>A1SZA3 4.2.1.10~~~aroQ~~~3-dehydroquinate dehydratase~~~COG0757
MTQQIKLLVLNGPNLNLLGQREPEVYGSKTLDDIIKALTDEAALQNVALSHLQSNREYELIEKIHDAFEKIDFIIINPAA
FTHTSVALRDALLGVNIPFIEVHLSNVHARESFRHHSYLSDIAQGVICGLGAKGYSFALQSAIGKLRNI
>P15474 4.2.1.10~~~aroQ~~~3-dehydroquinate dehydratase~~~COG0757
MPRSLANAPIMILNGPNLNLLGQRQPEIYGSDTLADVEALCVKAAAAHGGTVDFRQSNHEGELVDWIHEARLNHCGIVIN
PAAYSHTSVAILDALNTCDGLPVVEVHISNIHQREPFRHHSYVSQRADGVVAGCGVQGYVFGVERIAALAGAGSARA
>Q5SIL5 4.2.1.10~~~aroQ~~~3-dehydroquinate dehydratase~~~COG0757
MVLILNGPNLNLLGRREPEVYGRTTLEELEALCEAWGAELGLGVVFRQTNYEGQLIEWVQQAHQEGFLAIVLNPGALTHY
SYALLDAIRAQPLPVVEVHLTNLHAREEFRRHSVTAPACRGIVSGFGPLSYKLALVYLAETLEVGGEGF
>Q8ZAX1 4.2.1.10~~~aroQ~~~3-dehydroquinate dehydratase~~~COG0757
MSDKFHILLLNGPNLNLLGTREPEKYGYTTLAEIVSQLEIQAQGMDVALSHLQSNAEHALIDSIHQARGNTDFILINPAA
FTHTSVALRDALLGVQIPFIEIHLSNVHAREPFRHHSYLSDIAVGVICGLGADGYNFALQAAVNRLSKSN
>P13050 ~~~arp4~~~IgA receptor~~~
MARKDTNKQYSLRKLKTGTASVAVAVAVLGAGFANQTEVKAAEIKKPQADSAWNWPKEYNALLKENEELKVEREKYLSYA
DDKEKDPQYRALMGENQDLRKREGQYQDKIEELEKERKEKQERQEQLERQYQIEADKHYQEQQKKHQQEQQQLEAEKQKL
AKDKQISDASRQGLSRDLEASRAAKKELEAEHQKLKEEKQISDASRQGLSRDLEASREAKKKVEADLAALTAEHQKLKED
KQISDASRQGLSRDLEASREAKKKVEADLAEANSKLQALEKLNKELEEGKKLSEKEKAELQARLEAEAKALKEQLAKQAE
ELAKLKGNQTPNAKVAPQANRSRSAMTQQKRTLPSTGETANPFFTAAAATVMVSAGMLALKRKEEN
>Q9KJC3 ~~~arpA~~~Antibiotic efflux pump periplasmic linker protein ArpA~~~
MQFKPAVTALVSAVALATLLSGCKKEEAAPAAQAPQVGVVTIQPQAFTLTSELPGRTSAYRVAEVRPQVNGIILKRLFKE
GSEVKEGQQLYQIDPAVYEATLANAKANLLATRSLAERYKQLIDEQAVSKQEYDDANAKRLQAEASLKSAQIDLRYTKVL
APISGRIGRSSFTEGALVSNGQTDAMATIQQLDPIYVDVTQSTAELLKLRRDLESGQLQKAGNNAASVQLVLEDGSLFKQ
EGRLEFSEVAVDETTGSVTLRALFPNPDHTLLPGMFVHARLKAGVNANAILAPQQGVTRDLKGAPTALVVNQENKVELRQ
LKASRTLGSDWLIEEGLNPGDRLITEGLQYVSPRRRGEGQRCHQRQEAGRP
>Q9KJC2 ~~~arpB~~~Antibiotic efflux pump membrane transporter ArpB~~~COG0841
MSKFFIDRPIFAWVIALVIMLVGALSILKLPINQYPSIAPPAIAIAVTYPGASAQTVQDTVVQVIEQQLNGIDNLRYVSS
ESNSDGSMTITATFEQGTNPDTAQVQVQNKLNLATPLLPQEVQQQGIRVTKAVKNFLLVIGLVSEDGSMTKDDLANYIVS
NMQDPISRTAGVGDFQVFGAQYAMRIWLDPAKLNKFQLTPVDVKTAVAAQNVQVSSGQLGGLPALPGTQLNATIIGKTRL
QTAEQFESILLKVNKDGSQVRLGDVAQVGLGGENYAVSAQFNGKPASGLAVKLATGANALDTAKALRETIKGLEPFFPPG
VKAVFPYDTTPVVTESISGVIHTLIEAVVLVFLVMYLFLQNFRATIITTMTVPVVLLGTFGILAAAGFSINTLTMFAMVL
AIGLLVDDAIVVVENVERVMSEEGLPPKEATKRSMEQIQGALVGIALVLSAVLLPMAFFGGSTGVIYRQFSITIVSAMGL
SVLVALIFTPALCATMLKPLKKGEHHTAKGGFFGWFNRNFDRSVNGYERSVGTILRNKVPFLLGYALIVVGMIWLFARIP
TAFLPEEDQGVLFAQVQTPAGSSAERTQVVVDQMREYLLKDEADTVSSVFTVNGFNFAGRGQSSGMAFIMLKPWDERSKE
NSVFALAQRAQQHFFTFRDAMVFAFAPPAVLELGNATGFDVFLQDRGGVGHAKLMEARNQFLAKAAQSKILSAVRPNGLN
DEPQYQLTIDDERASALGVTIADINNTLSIALGASYVNDFIDRGRVKKVYIQGEPSARMSPEDLQKWYVRNGAGEMVPFS
SFAKGEWTYGSPKLSRYNGVEAMEILGAPAPGYSTGEAMAEVERIAGELPSGIGFSWTGMSYEEKLSGSQMPALFALSVL
FVFLCLAALYESWSIPIAVVLVVPLGIIGALIATSLRGLSNDVYFLVGLLTTIGLAAKNAILIVEFAKELHEQGRSLYDA
AIEACRMRLRPIIMTSLAFILGVVPLTIASGAGAGSQHAIGTGVIGGMISATVLAIFWVPLFFVAVSSLFGSKEPEKDVT
PENPRYEAGQ
>Q9KJC1 ~~~arpC~~~Antibiotic efflux pump outer membrane protein ArpC~~~COG1538
MTKSLLSLAVTAFILGGCSLIPDYQAPEAPVAAQWPQGPAYSPTQSADVAAAEQGWRQFFHDPALQQLIQTSLVNNRDLR
VAALNLDAYRAQYRIQRADLFPAVSATGSGSRQRVPANMSQTGESGITSQYSATLGVSAYELDLFGRVRSLTEQALETYL
SSEQARRSTQIALVASVANAYYTWQADQALFKLTEETLKTYEESYNLTRRSNEVGVASALDVSQARTAVEGARVKYSQYQ
RLVAQDVNSLTVLLGTGIPADLAKPLELDADQLAEVPAGLPSDILQRRPDIQEAEHLLKAANANIGAARAAFFPSISLTA
NAGSLSPDMGHLFSGGQGTWLFQPQINLPIFNAGSLKASLDYSKIQKDINVAKYEKTIQTAFQEVSDGLAARKTFEEQLQ
AQRDLVQANQDYYRLAERRYRIGIDSNLTFLDAQRNLFSAQQALIGDRLSQLTSEVNLYKALGGGWYEQTGQANQQASVE
TPKG
>Q8KQL2 1.1.1.301~~~~~~D-arabitol-phosphate dehydrogenase~~~
MSKTMKGVSKQAPGYDQMAFIDLSVPEATDDKVLIKVAYTGICGSDIHTFKGEYKNPTTPVVLGHEFSGQVVEVGANVPK
VKVGDRVTSETTFYVCGECDYCKEKQYNLCPHRKGIGTQQNGSMANYVLAREESIHLLPDHLSYEGAAMSEPLACCVHAM
YQKSHLELKDTIIIMGPGPIGLYLLQIAKEIGAFVIMTGITKDAHRLALAKKLGADVIVDTMKEDLAKVVNEITDGYGVD
KVYDASGAVPAVNASLPLIRKQGQFIQVGLFANKMVDLDTESIIQREIEYIGSRSQNPYDWPIAIHLLAKGAINIDEMIT
KKYPLTEWREAFDKVMEGNEIKVMIESNPEEF
>Q6LAD6 3.4.22.-~~~avrRpt2~~~Cysteine protease avirulence protein AvrRpt2~~~
MKIAPVAINHSPLSREVPSHAAPTQAKQTNLQSEAGDLDARKSSASSPETRALLATKTVLGRHKIEVPAFGGWFKKKSSK
HETGGSSANADSSSVASDSTEKPLFRLTHVPYVSQGNERMGCWYACARMVGHSVEAGPRLGLPELYEGREGPAGLQDFSD
VERFIHNEGLTRVDLPDNERFTHEELGALLYKHGPIIFGWKTPNDSWHMSVLTGVDKETSSITFHDPRQGPDLAMPLDYF
NQRLAWQVPHAMLYR
>Q5Y818 1.20.99.1~~~arrA~~~Arsenate respiratory reductase molybdopterin-containing subunit ArrA~~~
MRIKRREFLKASAAVGAVAVASPTLNAFAQTGTGASAMGEAEGKWIPSTCQGCTTWCPVEFLFRMAVRSKYAATQLSKAN
NGYCCVRGHLMLQQLYDPDRIKTPMKRTNPVKGRKEDPKICPYHMGMKQWDTIADKIMELRKNNETHKYLLMRGRYSDHN
SIFYGDLTKMIGSPNNISHSAICAEVEKMGSMATEGFWGYRDYDLDNMKYLIAWACDPLSSNRQIPNAIRKIQGVMDRGK
VVAVDPRMNNTASKAQEWLPIKPSEDGALALAMAHVIITKGLWSKEFVGDFKDGKNKFVAGKTVKEEDFEEKLTNGIVKW
WNLEVKDRTPKWAAKVTGIDEATIIRVATEFAQAAPACAIWYGPNMQPRGSYAVMCIHALNGLVGASDSEGGLCTGMGSP
SSSYPKIDAYQDDVAKAGAKNKKIDQRGTLKFPAMGSAKPGTGVVTNNVADALLAADPYDIKVAIGYFCNFNFSGTDGAR
WDKALAKVPFFVHCVPMFSEMTYFADIVLPAALHHTEDWAVIRSKANLHGHTSIQQPVVERMFDVKGVETEITWLLAEKL
KAKGFENMYNWLYNEYKDPETGKNPTNSLEFALYATKIRSKKCWDPKENAEYKGDKLNGWADFMEKGIVNSPKFKFRQKW
EKGFPTETKKFEFYSETLKKGLLAHAEKNKVTVDQVMEATNYEARGELAFIPHYESPKRHGDVKEFPFSLIDMKSRLNRE
GRSTNATWYHAFKKCDPGDVNQEDVLQINPADAKKLGINEGDMVKVTSVIGSLTVKARLWEGVRPGCVAKCYGQGHFAMG
RVSAKDFGKAVARGANFNDIMPADYDRITGATARNGGFTGVKIEKA
>Q7WTU0 1.20.99.1~~~arrA~~~Arsenate respiratory reductase molybdopterin-containing subunit ArrA~~~COG0243
MKKENQVNLGRRQLLKSTAAGTVLTGIGGTLSFTPIVEGIAAELPAPLRRTGVGEWLATTCQGCTSWCAKQIYVMDGRAL
KVRGNPNSGVHGMSSCPRQHLSLQQVYDPDRLRTPMMRTNPKKGRDQDPKFVPISWDKALDMLADKIIALRVANEPHKYA
LLRGRYSHINDLLYKKMTNLIGSPNNISHSSVCAEAHKMGPYYLDGNWGYNQYDVKNAKFILSFGADPIASNRQVSFYSQ
TWGDSLDHAKVVVVDPRLSASAAKAHKWIPIEPGQDSVLALAIAHVALVEGVWHKPFVGDFIEGKNLFKAGKTVSVESFK
ETHTYGLVEWWNQALKDYTPEWASKITGIDPKTIIAIAKDMGAAAPAVQVWTSRGAVMQARGTYTSISCHALNGLFGGID
SKGGLFPGNKTPLLKEYPEAKAYMDEIAAKGVKKEKIDQRGRLEFPALAKGKSGGGVITANAANGIRNQDPYEIKVMLAY
FNNFNFSNPEGQRWDEALSKVDFMAHITTNVSEFSWFADVLLPSSHHMFEKWGVLDSIGNGVAQISIQQPSIKRLWDTRI
DESEIPYMLAKKLADKGFDAPWRYINEQIVDPETGKPAADEAEFAKLMVRYLTAPLWKEDASKYGDKLSSWDEFVQKGVW
NSSPYKLEARWGKFKTETTKFEFYSKTLEKALQSHADKHKVSIDEVMKACDYQARGHLAFIPHYEEPYRFGDESEFPLLL
VDQKSRLNKEGRTANSPWYYEFKDVDPGDVANEDVAKFNPIDGKKFGLKDGDEIRITSPVGMLTCKAKLWEGVRPGTVAK
CFGQGHWAYGRYASAKFGVTPRGGSNNDLIADRYDRLSGASAFYGHIRVRVEKV
>Q7WTT9 ~~~arrB~~~Arsenate respiratory reductase iron-sulfur subunit ArrB~~~COG0437
MRLGMVIDLQKCVGCGGCSLACKTENNTNDGIHWSHHIATTEGTFPDVKYTYIPTLCNHCDDAPCVKVCPTGAMHKDKRG
LTLQNNDECIGCKKCMNACPYGVISFNAATPHRRWQDDSEVVANGTVSPLMLLKRTGATATPNENPERGDTYPMIRPKRT
TEKCTFCDHRLDKGLNPACVDACPSEARVIGDLDDPQSKVSQLIKLHKPMQLKPEAGTGPRVFYIRSFGVKTAY
>B7J950 1.-.-.-~~~arsH~~~NADPH-dependent FMN reductase ArsH~~~COG0431
MSGNLPNTDDVLLQVPDVRCLRSAAETDHPPRILLLYGSNRECSYSRLLTLEAERLLRYFGAETRVFHPTGLPLPDDAPV
THPKVVELQELVEWSEGQVWCSPERHGAMTGVFKSQVDWIPLNSGAIRPTQGKTLALMQVCGGSQSFNAVNQMRILGRWL
RMLTIPNQSSVPKAFLEFDDGGRMKPSAYYDRVVDVMEELMKFTLLTRGNSDYLVDRYSERKESAEELSRRVNLQNL
>Q92R45 1.-.-.-~~~arsH~~~NADPH-dependent FMN reductase ArsH~~~COG0431
MSDDDSSHDLPAANLQQLRLPDSASLRPAFSTHRPRILILYGSLRTVSYSRLLAEEARRLLEFFGAEVKVFDPSGLPLPD
AAPVSHPKVQELRELSIWSEGQVWVSPERHGAMTGIMKAQIDWIPLSTGSIRPTQGKTLAVMQVSGGSQSFNAVNQMRIL
GRWMRMITIPNQSSVAKAFQEFDANGRMKPSSYYDRVVDVMEELVKFTLLTRDCSAYLTDRYSERKESAAELEHRVTLKS
V
>Q7UC03 1.-.-.-~~~arsH~~~NADPH-dependent FMN reductase ArsH~~~
MRLRHLSDPDSLPALDKSFAIERPALGLAPDAPPVRILLLYGSLRARSFSRLAVEEAARLLQFFGAETRIFDPSDLPLPD
QVQSDDHPAVKELRALSEWSEGQVWCSPERHGQITSVMKAQIDHLPLEMAGIRPTQGRTLAVMQVSGGSQSFNAVNTLRL
LGRWMRMFTIPNQSSIAKAFQEFDAAGRMKPSPYYDRIADVMEELVRFTALVRPHREALTDRYSERKAAGHVIDEATDLS
SIAIAPQPLPESETS
>P74312 1.6.5.2~~~arsH~~~NADPH-dependent quinone reductase ArsH~~~COG0431
MTDFNHPPRILFLYGSLRERSYSRLLAEEAGRIITTMGAETKFFDPRELPLRGQVLDSHPKVQELLELSQWSEGQVWSSP
EMHGNITGILKNQIDWIPLEIGSIRPTQGRTLAVMQVSGGSQSFNAVNTMRILGRWMRMFTIPNQSSVAKAYQEFHEDGT
MKDSPYRDRVVDVMEELYKFTLLLRDKVDYLTDRHSERKAKVNTDG
>P08690 7.3.2.7~~~arsA~~~Arsenical pump-driving ATPase~~~
MQFLQNIPPYLFFTGKGGVGKTSISCATAIRLAEQGKRVLLVSTDPASNVGQVFSQTIGITIQAIASVPGLSALEIDPQA
AAQQYRARIVDPIKGVLPDDVVSSINEQLSGACTTEIAAFDEFTGLLTDASLLTRFDHIIFDTAPTGHTIRLLQLPGAWS
SFIDSNPEGASCLGPMAGLEKQREQYAYAVEALSDPKRTRLVLVARLQKSTLQEVARTHLELAAIGLKNQYLVINGVLPK
TEAANDTLAAAIWEREQEALANLPADLAGLPTDTLFLQPVNMVGVSALSRLLSTQPVASPSSDEYLQQRPDIPSLSALVD
DIARNEHGLIMLMGKGGVGKTTMAAAIAVRLADMGFDVHLTTSDPAAHLSMTLNGSLNNLQVSRIDPHEETERYRQHVLE
TKGKELDEAGKRLLEEDLRSPCTEEIAVFQAFSRVIREAGKRFVVMDTAPTGHTLLLLDATGAYHREIAKKMGEKGHFTT
PMMLLQDPERTKVLLVTLPETTPVLEAANLQADLERAGIHPWGWIINNSLSIADTRSPLLRMRAQQELPQIESVKRQHAS
RVALVPVLASEPTGIDKLKQLAG
>P0AB93 ~~~arsB~~~Arsenical pump membrane protein~~~COG1055
MLLAGAIFVLTIVLVIWQPKGLGIGWSATLGAVLALVTGVVHPGDIPVVWNIVWNATAAFIAVIIISLLLDESGFFEWAA
LHVSRWGNGRGRLLFTWIVLLGAAVAALFANDGAALILTPIVIAMLLALGFSKGTTLAFVMAAGFIADTASLPLIVSNLV
NIVSADFFGLGFREYASVMVPVDIAAIVATLVMLHLYFRKDIPQNYDMALLKSPAEAIKDPATFKTGWVVLLLLLVGFFV
LEPLGIPVSAIAAVGALILFVVAKRGHAINTGKVLRGAPWQIVIFSLGMYLVVYGLRNAGLTEYLSGVLNVLADNGLWAA
TLGTGFLTAFLSSIMNNMPTVLVGALSIDGSTASGVIKEAMVYANVIGCDLGPKITPIGSLATLLWLHVLSQKNMTISWG
YYFRTGIIMTLPVLFVTLAALALRLSFTL
>P0DKS5 2.8.4.2~~~arsC1~~~Arsenate-mycothiol transferase ArsC1~~~COG0394
MNNQPSVLFVCVGNGGKSQMAAALAKKHAGDALKVYSAGTKPGTKLNQQSLDSIAEVGADMSQGFPKGIDQELIKRVDRV
VILGAEAQLEMPIDANGILQRWVTDEPSERGIEGMERMRLVRDDIDARVQNLVAELTQNA
>P08692 1.20.4.1~~~arsC~~~Arsenate reductase~~~COG1393
MSNITIYHNPACGTSRNTLEMIRNSGTEPTIILYLENPPSRDELVKLIADMGISVRALLRKNVEPYEQLGLAEDKFTDDQ
LIDFMLQHPILINRPIVVTPLGTRLCRPSEVVLDILQDAQKGAFTKEDGEKVVDEAGKRLK
>P0DKS7 2.8.4.2~~~arsC2~~~Arsenate-mycothiol transferase ArsC2~~~COG0394
MKSVLFVCVGNGGKSQMAAALAQKYASDSVEIHSAGTKPAQGLNQLSVESIAEVGADMSQGIPKAIDPELLRTVDRVVIL
GDDAQVDMPESAQGALERWSIEEPDAQGMERMRIVRDQIDNRVQALLAG
>P45947 1.20.4.4~~~arsC~~~Arsenate reductase~~~COG0394
MENKIIYFLCTGNSCRSQMAEGWAKQYLGDEWKVYSAGIEAHGLNPNAVKAMKEVGIDISNQTSDIIDSDILNNADLVVT
LCGDAADKCPMTPPHVKREHWGFDDPARAQGTEEEKWAFFQRVRDEIGNRLKEFAETGK
>P0AB96 1.20.4.1~~~arsC~~~Arsenate reductase~~~COG1393
MSNITIYHNPACGTSRNTLEMIRNSGTEPTIIHYLETPPTRDELVKLIADMGISVRALLRKNVEPYEELGLAEDKFTDDR
LIDFMLQHPILINRPIVVTPLGTRLCRPSEVVLEILPDAQKGAFSKEDGEKVVDEAGKRLK
>Q92R44 1.20.4.1~~~arsC~~~Arsenate reductase~~~COG1393
MTVTIYHNPACGTSRNTLAMIRNAGIEPTVVEYLKNPPSRAELEAMIAAAGLTVRQAIREKGTPFAELGLGDPSRSDEEL
LDAMLEHPILINRPFVVAPLGTRLCRPSEVVLDILPDTHKGPFSKEDGEAVLDAGGKRIV
>P0A005 1.20.4.4~~~arsC~~~Arsenate reductase~~~
MDKKTIYFICTGNSCRSQMAEGWGKEILGEGWNVYSAGIETHGVNPKAIEAMKEVDIDISNHTSDLIDNDILKQSDLVVT
LCSDADNNCPILPPNVKKEHWGFDDPAGKEWSEFQRVRDEIKLAIEKFKLR
>P0A006 1.20.4.4~~~arsC~~~Arsenate reductase~~~
MDKKTIYFICTGNSCRSQMAEGWGKEILGEGWNVYSAGIETHGVNPKAIEAMKEVDIDISNHTSDLIDNDILKQSDLVVT
LCSDADNNCPILPPNVKKEHWGFDDPAGKEWSEFQRVRDEIKLAIEKFKLR
>P74313 1.20.4.1~~~arsC~~~Glutaredoxin arsenate reductase~~~COG0394
MKKVMFVCKRNSCRSQMAEGFAKTLGAGKIAVTSCGLESSRVHPTAIAMMEEVGIDISGQTSDPIENFNADDYDVVISLC
GCGVNLPPEWVTQEIFEDWQLEDPDGQSLEVFRTVRGQVKERVENLIAKIS
>P46003 ~~~arsD~~~Arsenical resistance operon trans-acting repressor ArsD~~~
MKTLMVFDPAMCCSTGVCGTDVDQALVDFSTDVQWLKQCGVQIERFNLAQQPMSFVQNEKVKAFIEASGAEGLPLLLLDG
ETVMAGRYPKRAELARWFGIPLDKVGLAPSGCCGGNTSCC
>Q6ZEM6 1.20.4.1~~~arsI1~~~Arsenate reductase ArsI1~~~
MTENMIVIYHNPDCGTSRNVLQLIEAAGYLPQVIEYVKEGWTKPQLLGLFAAADLTPRSALRTTKSPAAELNLLEETVTD
AQILDAMVEYPILVNRPIVCTPKGVRLCRPSEVVLDLLDHWPSGPFAKEDGELIIDERGNRVYT
>Q6YRW7 1.20.4.1~~~arsI2~~~Arsenate reductase ArsI2~~~
MIVIYHNPDCGTSRNVLQLIEAAGYLPQVIEYVKEGWTKPQLLGLFAAADLTPRSALRTTKSPAAELNLLEETVTDAQIL
DAMVEYPILVNRPIVCTPKGVRLCRPSEVVLDLLDHWPSGPFAKEDGELIIDERGNRVYT
>A0A0D3MJQ5 2.1.1.137~~~arsM~~~Arsenite methyltransferase~~~
MDNIREGVRQKYAFAIANRGQGCCGSPGCCSDGLSDAADPITGNLYDESDLQGLDPELIANSFGCGNPTALMNLNLGEVV
LDLGSGSGLDVLLSAKRVGPTGKAYGLDMTDEMLAVAKENQRKSGIENAEFLKGHIEEIPLAAKSIDVIISNCVINLSGD
KDKVLKEAYRVLKPQGRFAVSDIVIKRPLPEKIRDNILAWAGCIAGAMTEEEYRGKLSRAGFENISLQVTREYNLEDPSL
RGMLEDLTDGEIKEFQGAMVSCFIRAAKPA
>U2ZU49 2.1.1.137~~~arsM~~~Arsenite methyltransferase~~~COG2226
MHESVQNYYGKVLQNSSDLKTSACCDASSMPAWLKPLLSQVHPEVSARYYGCGLVAPALLDGCQVLDLGSGSGRDCYVLA
QLVGASGSVLGVDMTAEQLAVANAHLDYHAERFGFANVSFRHGYIEDLASLELADGSFDVIVSNCVINLSPDKDSVLREA
YRLLKPGGELYFSDVYADRRLADELRQDEVLYGECLGGALYWNDFEHLARRHGFTDPRLVEDQPISITDSALAEKLGDAR
FYSATYRLFKLDGLEPACEDYGQAVIYRGSIPGAAHAFVLDKHHRIETGRVFPVCGNTWRMLQDTRFAPHFQFIGDFSRH
FGLFEGCGGGLPYDRQAAVTAATSCC
>Q6N3Y0 2.1.1.137~~~arsM~~~Arsenite methyltransferase~~~COG2226
MPTDMQDVKDIVREKYASAALKVATGGASCCGSSALPGASPITSNLYDAAQEQGLPAEAMLASLGCGNPTALAQLSPGET
VLDLGSGGGIDVLLSARRVGPTGKAYGLDMTDEMLALARDNQRKAGLDNVEFLKGEIEAIPLPDHSVDVIISNCVINLSG
DKDRVLREAFRVLKPGGRFAVSDVVTRGEIPEALRRDVLLWVGCLAGALDEADYVAKLAAAGFAQISIEPTRVYDIEDAR
EFLTGKGIDVDALAPQMQDKFFSGFVRATKPGADGEVPARCCG
>Q88LK1 ~~~arsR1~~~Arsenic resistance transcriptional regulator ArsR1~~~COG0640
MAVRAFPGGHMREILTPPIVFKCLADDTRARMTLLIAREGELCVCELTHALELSQPKISRHLAQLREAGILMDRRKGQWV
YYRLHPEVPQWVDAMLKGVVDANQEWLSPDALRLAEMGERPQSPVACA
>Q88JD1 ~~~arsR2~~~Arsenic resistance transcriptional regulator ArsR2~~~COG0640
MITPPDVFKSLSDETRARATLLIASLGELCVCELMCALNDSQPKISRHLAQLRSNGMLLDRRQGQWVYYRLNPELPSWVH
EMLQVTLQANSQWLADNALRLKNMDGRPVRDSVCC
>P45949 ~~~arsR~~~Arsenical resistance operon repressor~~~COG0640
MDETKSELLRKYEQKFKALADQKRLEIMYELCQRGKTCVCDLTEIFEVTQSKLSYHLKILLDANLITKETKGTWSYYDLN
DEEVNGLLSEELCCIFRKKGEGDCC
>O24973 ~~~arsR~~~Transcriptional regulatory protein ArsR~~~COG0745
MIEVLMIEDDIELAEFLSEFLLQHGIHVTNYDEPYTGISAANTQNYDLLLLDLTLPNLDGLEVCRRISKQKHIPIIISSA
RSDVEDKIKALDYGADDYLPKPYDPKELLARIQSLLRRSHKKEEVSEPGDANIFRVDKDSREVYMHEKKLDLTRAEYEIL
SLLISKKGYVFSRESIAIESESINPESSNKSIDVIIGRLRSKIEKNPKQPQYIISVRGIGYKLEY
>O24972 2.7.13.3~~~arsS~~~Sensor histidine kinase ArsS~~~COG0642
MRFSIFFKVVALFMITLFSFGAFAYYFVSSQISHENYQNEMRHYQFVTTINEILNNYSDYRAIEDYLYKIGFRETTIENL
EKVLAKRRHQLHHRNIGYAEVFKFSDMVFILLKKDEHFVLYKDLHSVSYRNYFLAITVGLLLILFLFLFVLQSLLPLREL
RSQVKPFAQGDKSVSCKSKQKDEIGDLANEFDNCILKINAMNESRVLFLRSIMHELRTPITKGKILSSMLKEELSCKRFS
SIFDHLNMLIEQFARIEQLASKNYGSNKEKFLMSDLIDKIEKMLLIDEDKESPIHVSSSNYIIEADFELFSIALKNMVDN
AIKYSDDKQVFLDFIGNNLVVSNKSKPLKEDFEKYLQPYFKSSNPSQAHGFGLGMYIIKNALEAMGLNLSYHYSNGRICF
TIHDCVFNSFYDLEEDNEELPPPPPKI
>P51691 3.1.6.1~~~atsA~~~Arylsulfatase~~~
MSKRPNFLVIVADDLGFSDIGAFGGEIATPNLDALAIAGLRLTDFHTASTCSPTRSMLLTGTDHHIAGIGTMAEALTPEL
EGKPGYEGHLNERVVALPELLREAGYQTLMAGKWHLGLKPEQTPHARGFERSFSLLPGAANHYGFEPPYDESTPRILKGT
PALYVEDERYLDTLPEGFYSSDAFGDKLLQYLKERDQSRPFFAYLPFSAPHWPLQAPREIVEKYRGRYDAGPEALRQERL
ARLKELGLVEADVEAHPVLALTREWEALEDEERAKSARAMEVYAAMVERMDWNIGRVVDYLRRQGELDNTFVLFMSDNGA
EGALLEAFPKFGPDLLGFLDRHYDNSLENIGRANSYVWYGPRWAQAATAPSRLYKAFTTQGGIRVPALVRYPRLSRQGAI
SHAFATVMDVTPTLLDLAGVRHPGKRWRGREIAEPRGRSWLGWLSGETEAAHDENTVTGWELFGMRAIRQGDWKAVYLPA
PVGPATWQLYDLARDPGEIHDLADSQPGKLAELIEHWKRYVSETGVVEGASPFLVR
>P30859 ~~~artI~~~Putative ABC transporter arginine-binding protein 2~~~COG0834
MKKVLIAALIAGFSLSATAAETIRFATEASYPPFESIDANNQIVGFDVDLAQALCKEIDATCTFSNQAFDSLIPSLKFRR
VEAVMAGMDITPEREKQVLFTTPYYDNSALFVGQQGKYTSVDQLKGKKVGVQNGTTHQKFIMDKHPEITTVPYDSYQNAK
LDLQNGRIDGVFGDTAVVTEWLKDNPKLAAVGDKVTDKDYFGTGLGIAVRQGNTELQQKLNTALEKVKKDGTYETIYNKW
FQK
>P45091 ~~~artI~~~ABC transporter arginine-binding protein~~~COG0834
MKKTLLTAILLGASVAASAQELTFAMQPSYPPFETTNAKGEIIGFDVDVTNAICQEIQATCKFKSETFDALIPNLKAKRF
DAAISAIDITDARAKQVLFSDAYYDSSASYVALKGKATLESAKNIGVQNGTTFQQYTVAETKQYSPKSYASLQNAILDLK
SGRIDIIFGDTAVLADMISKEPEIQFIGEKVTNKKYFGNGLGIAMHKSNKDLAAQLNKGLAAIKANGEYQKIYDKWITK
>Q9Z869 ~~~artJ~~~Probable ABC transporter arginine-binding protein ArtJ~~~COG0834
MIKQIGRFFRAFIFIMPLSLTSCESKIDRNRIWIVGTNATYPPFEYVDAQGEVVGFDIDLAKAISEKLGKQLEVREFAFD
ALILNLKKHRIDAILAGMSITPSRQKEIALLPYYGDEVQELMVVSKRSLETPVLPLTQYSSVAVQTGTFQEHYLLSQPGI
CVRSFDSTLEVIMEVRYGKSPVAVLEPSVGRVVLKDFPNLVATRLELPPECWVLGCGLGVAKDRPEEIQTIQQAITDLKS
EGVIQSLTKKWQLSEVAYE
>O84385 ~~~artJ~~~Probable ABC transporter arginine-binding protein ArtJ~~~
MCIKRKKTWIAFLAVVCSFCLTGCLKEGGDSNSEKFIVGTNATYPPFEFVDKRGEVVGFDIDLAREISNKLGKTLDVREF
SFDALILNLKQHRIDAVITGMSITPSRLKEILMIPYYGEEIKHLVLVFKGENKHPLPLTQYRSVAVQTGTYQEAYLQSLS
EVHIRSFDSTLEVLMEVMHGKSPVAVLEPSIAQVVLKDFPALSTATIDLPEDQWVLGYGIGVASDRPALALKIEAAVQEI
RKEGVLAELEQKWGLNN
>P30860 ~~~artJ~~~ABC transporter arginine-binding protein 1~~~COG0834
MKKLVLAALLASFTFGASAAEKINFGVSATYPPFESIGANNEIVGFDIDLAKALCKQMQAECTFTNHAFDSLIPSLKFRK
YDAVISGMDITPERSKQVSFTTPYYENSAVVIAKKDTYKTFADLKGKRIGMENGTTHQKYIQDQHPEVKTVSYDSYQNAF
IDLKNGRIDGVFGDTAVVNEWLKTNPQLGVATEKVTDPQYFGTGLGIAVRPDNKALLEKLNNALAAIKADGTYQKISDQW
FPQ
>P54537 ~~~artM~~~Arginine transport ATP-binding protein ArtM~~~COG1126
MIKVEKLSKSFGKHEVLKNISTTIAEGEVVAVIGPSGSGKSTFLRCLNLLEKPNGGTITIKDTEITKPKTNTLKVRENIG
MVFQHFHLFPHKTVLENIMYAPVNVKKESKQAAQEKAEDLLRKVGLFEKRNDYPNRLSGGQKQRVAIARALAMNPDIMLF
DEPTSALDPEMVKEVLQVMKELVETGMTMVIVTHEMGFAKEVADRVLFMDQGMIVEDGNPKEFFMSPKSKRAQDFLEKIL
>P0AE30 ~~~artM~~~Arginine ABC transporter permease protein ArtM~~~COG4160
MFEYLPELMKGLHTSLTLTVASLIVALILALIFTIILTLKTPVLVWLVRGYITLFTGTPLLVQIFLIYYGPGQFPTLQEY
PALWHLLSEPWLCALIALSLNSAAYTTQLFYGAIRAIPEGQWQSCSALGMSKKDTLAILLPYAFKRSLSSYSNEVVLVFK
STSLAYTITLMEVMGYSQLLYGRTYDVMVFGAAGIIYLVVNGLLTLMMRLIERKALAFERRN
>P54535 ~~~artP~~~Arginine-binding extracellular protein ArtP~~~COG0834
MKKWLLLLVAACITFALTACGSSNSGSESGKKKLIMGTSADYKPFEYKEGDNIVGFDVELAKALAKKAGYEIEVQDMDFN
SLITALKSKQVDLVLSGMTPTPERKKQVDFSDVYYTANHMIVSKKDSGIQSLKDLKGKTVGVQLGSIQEEKGKELSPEYG
FKTEDRNRISDLVQEIKSDRFDAAIIEDIVAEGYFKSNDDLQGFVIPDAKAEEAGSAIAFRKDSELTDKFNKALKEMEDN
GELEKLKKKWFTGEK
>P0AAF6 7.4.2.-~~~artP~~~Arginine transport ATP-binding protein ArtP~~~COG1126
MSIQLNGINCFYGAHQALFDITLDCPQGETLVLLGPSGAGKSSLLRVLNLLEMPRSGTLNIAGNHFDFTKTPSDKAIRDL
RRNVGMVFQQYNLWPHLTVQQNLIEAPCRVLGLSKDQALARAEKLLERLRLKPYSDRYPLHLSGGQQQRVAIARALMMEP
QVLLFDEPTAALDPEITAQIVSIIRELAETNITQVIVTHEVEVARKTASRVVYMENGHIVEQGDASCFTEPQTEAFKNYL
SH
>P54536 ~~~artQ~~~Arginine transport system permease protein ArtQ~~~COG0765
MNLDFSATIPQIPFILEGLAITLKIVVVSAIIGLILGIVLSLCKISTFRPFIWIADFYTSVFRGTPLVLQLMIVYFGLPQ
LLGFQIDQFWAAVVALSLNSAAYVSEIIRAGINAIDKGQKEAAVALGVPYGKMMKDLLLPQAFKNISPAIVNELITLTKE
SAIVTVIGLGDVMRRAYQAGAATYNYLEPLIIAGLIYYVLVLILTFIGKAVERKLKSND
>P0AE34 ~~~artQ~~~Arginine ABC transporter permease protein ArtQ~~~COG4215
MNEFFPLASAAGMTVGLAVCALIVGLALAMFFAVWESAKWRPVAWAGSALVTILRGLPEILVVLFIYFGSSQLLLTLSDG
FTINLGFVQIPVQMDIENFDVSPFLCGVIALSLLYAAYASQTLRGALKAVPVGQWESGQALGLSKSAIFFRLVMPQMWRH
ALPGLGNQWLVLLKDTALVSLISVNDLMLQTKSIATRTQEPFTWYIVAAAIYLVITLLSQYILKRIDLRATRFERRPS
>O30508 2.6.1.11~~~aruC~~~Succinylornithine transaminase/acetylornithine aminotransferase~~~
MSAPHAQVERADFDRYMVPNYAPAAFIPVRGEGSRVWDQSGRELIDFAGGIAVTSLGHAHPALVKALTEQAQRIWHVSNV
FTNEPALRLARKLVDATFAERVFLANSGAEANEAAFKLARRYANDVYGPQKYEIIAASNSFHGRTLFTVNVGGQPKYSDG
FGPKFEGITHVPYNDLEALKAAISDKTCAVVLEPIQGEGGVLPAQQAYLEGARKLCDEHNALLVFDEVQSGMGRVGELFA
YMHYGVVPDILSSAKSLGGGFPIGAMLTTGEIAKHLSVGTHGTTYGGNPLASAVAEAALDVINTPEVLDGVKAKHERFKS
RLQKIGQEYGIFDEIRGMGLLIGAALTDEWKGKARDVLNAAEKEAVMVLQASPDVVRFAPSLVIDDAEIDEGLERFERAV
AKLVRG
>Q9HUI9 2.6.1.84~~~aruH~~~Arginine--pyruvate transaminase AruH~~~
MRYSDFTQRIAGDGAAAWDIHYRALARVEQGEEILLLSVGDPDFDTPAPIVQAAIDSLLAGNTHYADVRGKRALRQRIAE
RHRRRSGQAVDAEQVVVLAGAQCALYAVVQCLLNPGDEVIVAEPMYVTYEAVFGACGARVVPVPVRSENGFRVQAEEVAA
LITPRTRAMALNSPHNPSGASLPRATWEALAELCMAHDLWMISDEVYSELLFDGEHVSPASLPGMADRTATLNSLSKSHA
MTGWRVGWVVGPAALCAHLENLALCMLYGSPEFIQDAACTALEAPLPELEAMREAYRRRRDLVIECLADSPGLRPLRPDG
GMFVMVDIRPTGLSAQAFADRLLDRHGVSVLAGEAFGPSAAGHIRLGLVLGAEPLREACRRIALCAAELLGQA
>Q9HUI8 4.1.1.75~~~aruI~~~Probable 2-ketoarginine decarboxylase AruI~~~
MGARALRRERRLRWSPNWTRILPMQPQKTLTAGQALVRLLANYGVDTVFGIPGVHTLELYRGLPGSGIRHVLTRHEQGAG
FMADGYARVSGKPGVCFVITGPGVTNVATAIGQAYADSVPLLVISSVNHSASLGKGWGCLHETQDQRAMTAPITAFSALA
LSPEQLPELIARAYAVFDSERPRPVHISIPLDVLAAPVAHDWSAAVARRPGRGVPCSEALRAAAERLAAARRPMLIAGGG
ALAAGEALAALSERLAAPLFTSVAGKGLLPPDAPLNAGASLCVAPGWEMIAEADLVLAVGTEMADTDFWRERLPLSGELI
RVDIDPRKFNDFYPSAVALRGDARQTLEALLVRLPQEARDSAPAAARVARLRAEIRAAHAPLQALHQAILDRIAAALPAD
AFVSTDMTQLAYTGNYAFASRAPRSWLHPTGYGTLGYGLPAGIGAKLGAPQRPGLVLVGDGGFLYTAQELATASEELDSP
LVVLLWNNDALGQIRDDMLGLDIEPVGVLPRNPDFALLGRAYGCAVRQPQDLDELERDLRAGFGQSGVTLIELRHACAR
>P9WL89 ~~~~~~Putative arylamide transporter~~~COG4129
MSASLLVRTACGGRAVAQRLRTVLWPITQTSVVAGLAWYLTHDVFNHPQAFFAPISAVVCMSATNVLRARRAQQMIVGVA
LGIVLGAGVHALLGSGPIAMGVVVFIALSVAVLCARGLVAQGLMFINQAAVSAVLVLVFASNGSVVFERLFDALVGGGLA
IVFSILLFPPDPVVMLCSARADVLAAVRDILAELVNTVSDPTSAPPDWPMAAADRLHQQLNGLIEVRANAAMVARRAPRR
WGVRSTVRDLDQQAVYLALLVSSVLHLARTIAGPGGDKLPTPVHAVLTDLAAGTGLADADPTAANEHAAAARATASTLQS
AACGSNEVVRADIVQACVTDLQRVIERPGPSGMSA
>P0DTW2 ~~~~~~Pumilarin~~~
MTETKNEIKLHVLFGALAVGFLMLALFSFSLQMLPVADLAKEFGIPGSVAAVVLNVVEAGGAVTTIVSILTAVGSGGLSL
IAAAGKETIRQYLKNEIKKKGRKAVIAW
>P17953 ~~~asa1~~~Aggregation substance~~~
MKQQTEVKKRFKMYKAKKHWVVAPILFIGVLGVVGLATDDVQAAELDTQPGTTTVQPDNPDPQVGSTTPKTAVTEEATVQ
KDTTSQPTKVEEVASEKNGAEQSSATPNDTTNAQQPTVGAEKSAQEQPVVSPETTNEPLGQPTEVAPAENEANKSTSIPK
EFETPDVDKAVDEAKKDPNITVVEKPAEDLGNVSSKDLAAKEKEVDQLQKEQAKKIAQQAAELKAKNEKIAKENAEIAAK
NKAEKERYEKEVAEYNKHKNENGYVAKPVNKTLIFDREATKNSKVVSVKAAEYIDAKKLTDKHKDKKLLISMLSVDSSGL
TTKDSKKAHFYYNNGAGGTLVVLHKNQPVTITYGNLNASYLGKKIASAEFQYTVKATPDSKGRLNAFLHDDPVATIVYGI
NIDPRTKKAGAEIEMLVRFFGEDGKEILPTKENPFVFSGASLNSRGENITYEFVKVGNTDTVHEINGSKVARHGNKVYSK
TDIDVGTNGISISDWEAVQGKEYIGATVISTPNRIKFTFGNEIVNNPGYDGNSMWFAFNTDLKAKSITPYQEKGRPKQPE
KATIEFNRYKANVVPVLVPNKEVTDGQKNINDLNVKRGDSLQYIVTGDTTELAKVDPKTVTKQGIRDTFDAEKVTIDLSK
VKVYQADASLNEKDLKAVAAAINSGKAKDVTASYDLHLDQNTVTAMMKTNADDSVVLAMGYKYLLVLPFVVKNVEGDFEN
TAVQLTNDGETVTNTVINHVPGSNPSKDVKADKNGTVGSVSLHDKDIPLQTKIYYEVKSSERPANYGGITEEWGMNDVLD
TTHDRFTGKWHAITNYDLKVGDKTLKAGTDISAYILLENKDNKDLTFTMNQALLAALNEGSNKVGKQAWSVYLEVERIKT
GDVENTQTENYNKELVRSNTVVTHTPDDPKPTKAVHNKKGEDINHGKVARGDVLSYEMTWDLKGYDKDFAFDTVDLATGV
SFFDDYDETKVTPIKDLLRVKDSKGEDITNQFTISWDDAKGTVTISAKDPQAFILAHGGQELRVTLPTKVKANVSGDVYN
LAEQNTFGQRIKTNTVVNHIPKVNPKKDVVIKVGDKQSQNGATIKLGEKFFYEFTSSDIPAEYAGIVEEWSISDKLDVKH
DKFSGQWSVFANSTFVLADGTKVNKGDDISKLFTMTFEQGVVKITASQAFLDAMNLKENKNVAHSWKAFIGVERIAAGDV
YNTIEESFNNEKIKTNTVVTHTPEKPQTPPEKTVIVPPTPKTPQAPVEPLVVEKASVVPELPQTGEKQNVLLTVAGSLAA
MLGLAGLGFKRRKETK
>A0A0F7RJ52 6.3.2.-~~~asbA~~~Spermidine-citrate ligase~~~
MKHAKQIAEHATIQSFLNCYLRETGSGEWITEDKRIEDIFYHLFQRDTCSTYLCCRLSAQNITLYGEVIYKSPTDRHLFG
EQFYYQMGDSNSVMKADYVTVITFLIKEMSINYGEGTNPAELMLRVIRSCQNIEEFTKERKEDTSALYGFHTSFIEAEQS
LLFGHLTHPTPKSRQGILEWKSAMYSPELKGECQLHYFRAHKSIVNEKSLLLDSTTVILKEELRNDEMVSKEFISKYCNE
DEYSLLPIHPLQAEWLLHQPYVQDWIEQGVLEYIGPTGKCYMATSSLRTLYHPDAKYMLKFSFPVKVTNSMRINKLKELE
SGLEGKAMLNTAIGEVLEKFPGFDFICDPAFITLNYGTQESGFEVIIRENPFYSEHADDATLIAGLVQDAIPGERTRLSN
IIHRLADLESRSCEEVSLEWFRRYMNISLKPMVWMYLQYGVALEAHQQNSVVQLKDGYPVKYYFRDNQGFYFCNSMKEML
NNELAGIGERTGNLYDDYIVDERFRYYLIFNHMFGLINGFGTAGLIREEILLTELRTVLESFLPYNREPSTFLRELLEED
KLACKANLLTRFFDVDELSNPLEQAIYVQVQNPLVREVAVRS
>Q81RQ8 6.3.2.-~~~asbB~~~Citryl-spermidine/3,4-dihydroxybenzoyl-citryl-spermidine:spermidine ligase~~~
MRMDMYHTKILKAIESEDYISVRRRVLRQLVESLIYEGIITPARIEKEEQILFLIQGLDEDNKSVTYECYGRERITFGRI
SIDSLIVRVQDGKQEIQSVAQFLEEVFRVVNVEQTKLDSFIHELEQTIFKDTIAQYERCNKLKYTQKSYDELENHLIDGH
PYHPSYKARIGFQYRDNFRYGYEFMRPIKLIWIAAHKKNATVGYENEVIYDKILKSEVGERKLEAYKERIHSMGCDPKQY
LFIPVHPWQWENFIISNYAEDIQDKGIIYLGESADDYCAQQSMRTLRNVTNPKRPYVKVSLNILNTSTLRTLKPYSVASA
PAISNWLSNVVSQDSYLRDESRVILLKEFSSVMYDTNKKATYGSLGCIWRESVHHYLGEQEDAVPFNGLYAKEKDGTPII
DAWLNKYGIENWLRLLIQKAIIPVIHLVVEHGIALESHGQNMILVHKEGLPVRIALKDFHEGLEFYRPFLKEMNKCPDFT
KMHKTYANGKMNDFFEMDRIECLQEMVLDALFLFNVGELAFVLADKYEWKEESFWMIVVEEIENHFRKYPHLKDRFESIQ
LYTPTFYAEQLTKRRLYIDVESLVHEVPNPLYRARQLNIQKSVATGGNYANC
>Q81RQ7 6.2.1.62~~~asbC~~~3,4-dihydroxybenzoate--[aryl-carrier protein] ligase~~~
MLIVNREEYSKSDFDLRLQAYEEMEQFQEAAGNRFALCLKDPFDIITLVFFLKEKKSSVLLIHEDTPKETAIEMAKRANC
IGILYGENSDFTKLEAVNYLAEEPSLLQYSSGTTGEPKLIRRAWTEVDTEIKVYNEALNCDIDEVPIVMAPVSHSYGLIC
GTLSAITRGSKPIIITNKNPKFALNIVRNTEKHIVYAVPLMLHIMGSFPQGTFQFHKIMTSGAPLPEALFYKLKETTTYM
MQQYGCSEAGCISICHDMKSHLDLGNPLPHASISIGSDENAPEEIIVKMNDKEIFTKDLGYKSERGLHFMGRMDDVINVS
GLKVFPIEVEETMLRLEGVQEAIVYRGKHPVMGEIVKAKVISHIDPVQIREWCMQHLPSYKVPHEIESVTEIPKNKTGKV
SRKLLEMGEVTT
>A0A0J1I1I3 ~~~asbD~~~Acyl carrier protein AsbD~~~COG0236
MRREALKNAVLKIMTEKMELKNVTHLEETMRLNQDLYIDSVMMLQLIVYIEMDVKLCVPEDEVDPKAFLTVGSLLDFMEE
LQPLQDVNVNN
>Q81RQ5 2.3.2.-~~~asbE~~~Petrobactin synthase~~~
MTSIKVHCLVSCFCEIIKRRSDIDFRPFYFGLWDGDFDITEGGIISYHSENINHDHYLLWYEKLYGMKVNEWYDHAKDKD
SNVETFLQLVENKPENRYVIVMVDMSLLPERENKFHQKPFPHYLMISETEKEEEWFMLDPDFRWEGNMEREKVLYSVQDN
PFGGGYFIDVEEIQEPTAEMVASYFIETFKRNDNELTMELKNLIIKMANEEEGYLLSGLVAAVKQIPVLAIRKYSYEHAF
AYFRETLQYSEQEFDYWCDRVEDIVQGFTNVQYRAIKMAMTNNKGMLLSIVEKLDEMNAIELQIKTELERQFLSWKEMKS
NESVLVF
>Q81RQ4 4.2.1.118~~~asbF~~~3-dehydroshikimate dehydratase~~~COG1082
MKYSLCTISFRHQLISFTDIVQFAYENGFEGIELWGTHAQNLYMQEYETTERELNCLKDKTLEITMISDYLDISLSADFE
KTIEKCEQLAILANWFKTNKIRTFAGQKGSADFSQQERQEYVNRIRMICELFAQHNMYVLLETHPNTLTDTLPSTLELLG
EVDHPNLKINLDFLHIWESGADPVDSFQQLRPWIQHYHFKNISSADYLHVFEPNNVYAAAGNRTGMVPLFEGIVNYDEII
QEVRDTDHFASLEWFGHNAKDILKAEMKVLTNRNLEVVTS
>Q66DP5 1.17.1.-~~~ascD~~~CDP-6-deoxy-L-threo-D-glycero-4-hexulose-3-dehydrase reductase~~~
MSLNVKLHPSGIIFTSDGTSTILDAALDSNIHIEYSCKDGTCGSCKAILISGEVDSAENTFLTEEDVAKGAILTCCSKAK
SDIELDVNYYPELSHIQKKTYPCKLDSIEFVGEDIAILSLRLPPTAKIQYLAGQYIDLIINGQRRSYSIANAPGGNGNIE
LHVRKVVNGVFSNIIFNELKLQQLLRIEGPQGTFFVREDNLPIVFLAGGTGFAPVKSMVEALINKNDQRQVHIYWGMPAG
HNFYSDIANEWAIKHPNIHYVPVVSGDDSTWTGATGFVHQAVLEDIPDLSLFNVYACGSLAMITAARNDFINHGLAENKF
FSDAFVPSK
>P24242 ~~~ascG~~~HTH-type transcriptional regulator AscG~~~COG1609
MTTMLEVAKRAGVSKATVSRVLSGNGYVSQETKDRVFQAVEESGYRPNLLARNLSAKSTQTLGLVVTNTLYHGIYFSELL
FHAARMAEEKGRQLLLADGKHSAEEERQAIQYLLDLRCDAIMIYPRFLSVDEIDDIIDAHSQPIMVLNRRLRKNSSHSVW
CDHKQTSFNAVAELINAGHQEIAFLTGSMDSPTSIERLAGYKDALAQHGIALNEKLIANGKWTPASGAEGVEMLLERGAK
FSALVASNDDMAIGAMKALHERGVAVPEQVSVIGFDDIAIAPYTVPALSSVKIPVTEMIQEIIGRLIFMLDGGDFSPPKT
FSGKLIRRDSLIAPSR
>A0A0H3G0N3 3.1.-.-~~~~~~ASCH domain-containing ribonuclease~~~COG4933
MTDIPDRKEAVISLWPEFAKAIVSGKKTVEFRRRIPLPALSARIWIYATRPVKSVIGFAYLEAIVQGDVNTLWSRYGREA
FLSEQQYRDYFEGTEKATAFLLRDHQPIRPINLDQLKEIRANFQPPQSLTWLRKEETQKLVSLTSQVE
>Q4W470 ~~~ascH~~~Probable type 3 secretion system regulator AscH~~~
MKIEGSDQLGGEQPQRQPLPPESMAQRQLERLLAKAPESDLFERWQQGAPLEGLLAVVAPAAKRELLWQIYQQGNKPAPE
IGKQLFAPVTDKLIARFGERQSPVLDAIDLPELRATMREFDPLASRREKVLLNLLSELRDGQGAVPAEHLFLDALARREL
MTLIPLNGMVDNLMRNSHKLDLEA
>Q93QX0 2.6.1.1~~~asD~~~Bifunctional aspartate aminotransferase and L-aspartate beta-decarboxylase~~~
MSKDYQSLAKLSPFELKDELIKIASSDGNRLMLNAGRGNPNFLATTPRRAFFRLGLFAAAESELSYSYMTTVGVGGLAKI
DGIEGRFERYIAENRDQEGVRFLGKSLSYVRDQLGLDPAAFLHEMVDGILGCNYPVPPRMLNISEKIVRQYIIREMGADA
IPSESVNLFAVEGGTAAMAYIFESLKLNGLLKAGDKVAIGMPVFTPYIEIPELAQYALEEVAINADPSLNWQYPDSELDK
LKDPAIKIFFCVNPSNPPSVKMDQRSLERVRNIVAEHRPDLMILTDDVYGTFADDFQSLFAICPENTLLVYSFSKYFGAT
GWRLGVVAAHQQNVFDLALDKLQESEKVALDHRYRSLLPDVRSLKFIDRLVADSRAVALNHTAGLSTPQQVQMALFSLFA
LMDEADEYKHTLKQLIRRRETTLYRELGMPPLRDENAVDYYTLIDLQDVTAKLYGEAFSEWAVKQSSTGDMLFRIADETG
IVLLPGRGFGSNRPSGRASLANLNEYEYAAIGRALRKMADELYAEYSGQAQNL
>Q53IZ1 2.6.1.1~~~asD~~~Bifunctional aspartate aminotransferase and L-aspartate beta-decarboxylase~~~
MSKDYRSLANLSPFELKDELIKVASGKANRLMLNAGRGNPNFLATTPRRAFFRLGLFAAAESELSYSYMTVGVGGLAKLD
GIEGRFERFIAEHRDQEGVKFLGKSLSYVRDQLGLDPAAFLHEMVDGILGCNYPVPPRMLTVSEQIVRQYIVREMAGGAV
PPESVDLFAVEGGTAAMAYIFESLRISGLLKAGDKVAIGMPVFTPYIEIPELAQYDLKEVPIHADPDNGWQYSDAELDKL
KDPDVKIFFCVNPSNPPSVKMDQRSLDRVRAIVAEQRPDLLILTDDVYGTFADEFQSLFSVCPRNTLLVYSFSKYFGATG
WRLGVIAAHKDNVFDHALSQLPESAKKALDHRYRSLLPDVRSLKFIDRLVADSRVVALNHTAGLSTPQQVQMVLFSLFAL
MDEADAYKQALKQLIRRREATLYRELGMPPLENPNSVNYYTLIDLQNVTCRLYGEAFSQWAVQQSSTGDMLFRVADETGI
VLLPGRGFGSDRPSGRASLANLNEYEYAAIGRALRRLADELYEQYKALGKE
>P96677 ~~~aseR~~~HTH-type transcriptional repressor AseR~~~COG0640
MTIDVAAMTRCLKTLSDQTRLIMMRLFLEQEYCVCQLVDMFEMSQPAISQHLRKLKNAGFVNEDRRGQWRYYSINGSCPE
FDTLQLILHQIDQEDELLNHIKQKKTQACCQ
>Q8YQB1 3.4.19.5~~~~~~Isoaspartyl peptidase/L-asparaginase~~~COG1446
MKSQVQPKLIIHGGAGSSLHGKGGLEAVRQTLHAVVEEVYALLLSGVNASVAVVRGCQLLEDEPRFNAGTGSVLQSDGQI
RMSASIMDGALGRFSGVINVSRVKNPIELAQFLQNSPDRVLSDYGSAELAREMQIPSYNALTELRLQEWIQERQDNFKRT
MAGVIAEPELLETSNAGRGTIGVVALDTYGKLAVGTSTGGKGFERIGRVSDSAMPAGNYATSYAAVSCTGIGEDIIDECL
APKIVIRVTDGLSLQDSMQRSFAEAHDNKRDFGAIALDANGAIAWGKTCDIILAAFHDGEKIGDTLELAVGTQVGSIS
>P74383 3.4.19.5~~~~~~Isoaspartyl peptidase/L-asparaginase~~~COG1446
MTPKLIIHGGASSLDDKGGLATVRQSLHQIVAAVYETLTAGGSAMDAVVQGCELLENEPRFNAGTGSVLQSDGQVRMSAS
LMDGDRQNFSGVINVSRIKNPIQMAQFLQGQTDRILSDYGAADLAREMQLPIYDPATDFRIQEWMEERGEDVRKKMARLI
ADPTVGIEARKGTIGVVALDANGKIAAGTSTGGKGLERIGRVSDSAMPAGNYATRFAGVSCTGVGEDIINECLAAKVVIR
VKDGQNLAQAMAKSITEALENNTDLGAIALDHQGHIAWGKTCPVLLAAYHTGTAIGDTLELTDGDHYGNASILKLQKTVK
KIQTKTRGK
>H8L902 6.3.1.12~~~~~~D-aspartate ligase~~~
MMNSIENEEFIPILLGSDMNVYGMARSFNEAYGKICQAYASDQLAPTRYSKIVNVEVIPGFDKDPVFIETMLRLAKERYS
DKSKKYLLIACGDGYAELISQHKQELSEYFICPYIDYSLFERLINKVSFYEVCEEYDLPYPKTLIVREEMLVNGHLEQEL
PFEFPVALKPANSVEYLSVQFEGRKKAFILETREEFDLILGRIYEAGYKSEMIVQDFIPGDDSNMRVLNAYVDEDHQVRM
MCLGHPLLEDPTPASIGNYVVIMPDYNEKIYQTIKAFLEKIEYTGFANFDMKYDPRDGEYKLFEINLRQGRSSFFVTLNG
LNLARFVTEDRVFNKPFVETTYGTNQSDKARLWMGVPKKIFLEYARENEDKKLAEQMIKENRYGTTVFYEKDRSIKRWLL
MKYMFHNYIPRFKKYFHVKEG
>P00963 6.3.1.1~~~asnA~~~Aspartate--ammonia ligase~~~COG2502
MKTAYIAKQRQISFVKSHFSRQLEERLGLIEVQAPILSRVGDGTQDNLSGCEKAVQVKVKALPDAQFEVVHSLAKWKRQT
LGQHDFSAGEGLYTHMKALRPDEDRLSPLHSVYVDQWDWERVMGDGERQFSTLKSTVEAIWAGIKATEAAVSEEFGLAPF
LPDQIHFVHSQELLSRYPDLDAKGRERAIAKDLGAVFLVGIGGKLSDGHRHDVRAPDYDDWSTPSELGHAGLNGDILVWN
PVLEDAFELSSMGIRVDADTLKHQLALTGDEDRLELEWHQALLRGEMPQTIGGGIGQSRLTMLLLQLPHIGQVQCGVWPA
AVRESVPSLL
>P54420 6.3.5.4~~~asnB~~~Asparagine synthetase [glutamine-hydrolyzing] 1~~~COG0367
MCGFVGVFNKHPLAQTADQEELIKQMNQMIVHRGPDSDGYFHDEHVGFGFRRLSIIDVENGGQPLSYEDETYWIIFNGEI
YNYIELREELEAKGYTFNTDSDTEVLLATYRHYKEEAASKLRGMFAFLIWNKNDHVLYGARDPFGIKPLYYTTINDQVYF
ASERKSLMVAQNDIEIDKEALQQYMSFQFVPEPSTLDAHVKKVEPGSQFTIRPDGDITFKTYFKANFKPVQTEEDKLVKE
VRDAIYDSVNVHMRSDVPVGSFLSGGIDSSFIVSVAKEFHPSLKTFSVGFEQQGFSEVDVAKETAAALGIENISKVISPE
EYMNELPKIVWHFDDPLADPAAIPLYFVAKEAKKHVTVALSGEGADELFGGYNIYREPLSLKPFERIPSGLKKMLLHVAA
VMPEGMRGKSLLERGCTPLQDRYIGNAKIFEESVKKQLLKHYNPNLSYRDVTKTYFTESSSYSDINKMQYVDIHTWMRGD
ILLKADKMTMANSLELRVPFLDKVVFDVASKIPDELKTKNGTTKYLLRKAAEGIVPEHVLNRKKLGFPVPIRHWLKNEMN
EWVRNIIQESQTDAYIHKDYVLQLLEDHCADKADNSRKIWTVLIFMIWHSINIEKRYMPEELSHQPKEVIFV
>P22106 6.3.5.4~~~asnB~~~Asparagine synthetase B [glutamine-hydrolyzing]~~~COG0367
MCSIFGVFDIKTDAVELRKKALELSRLMRHRGPDWSGIYASDNAILAHERLSIVDVNAGAQPLYNQQKTHVLAVNGEIYN
HQALRAEYGDRYQFQTGSDCEVILALYQEKGPEFLDDLQGMFAFALYDSEKDAYLIGRDHLGIIPLYMGYDEHGQLYVAS
EMKALVPVCRTIKEFPAGSYLWSQDGEIRSYYHRDWFDYDAVKDNVTDKNELRQALEDSVKSHLMSDVPYGVLLSGGLDS
SIISAITKKYAARRVEDQERSEAWWPQLHSFAVGLPGSPDLKAAQEVANHLGTVHHEIHFTVQEGLDAIRDVIYHIETYD
VTTIRASTPMYLMSRKIKAMGIKMVLSGEGSDEVFGGYLYFHKAPNAKELHEETVRKLLALHMYDCARANKAMSAWGVEA
RVPFLDKKFLDVAMRINPQDKMCGNGKMEKHILRECFEAYLPASVAWRQKEQFSDGVGYSWIDTLKEVAAQQVSDQQLET
ARFRFPYNTPTSKEAYLYREIFEELFPLPSAAECVPGGPSVACSSAKAIEWDEAFKKMDDPSGRAVGVHQSAYK
>P0ACI6 ~~~asnC~~~Regulatory protein AsnC~~~COG1522
MENYLIDNLDRGILEALMGNARTAYAELAKQFGVSPGTIHVRVEKMKQAGIITGARIDVSPKQLGYDVGCFIGIILKSAK
DYPSALAKLESLDEVTEAYYTTGHYSIFIKVMCRSIDALQHVLINKIQTIDEIQSTETLIVLQNPIMRTIKP
>P42113 6.3.5.4~~~asnH~~~Asparagine synthetase [glutamine-hydrolyzing] 2~~~COG0367
MCGLAGIINLAAPRSQECTFHILKGMADAISYRGPDDEQYHIDSKVGFAFRRLSILDLVNGQQPFLNEDGSIVVMVNGEI
YNYKELKASLHNHMFKTTSDCEVIVHLYEEKGIGFVDDIIGMFSIAIWDKNKNKVFLVRDRFGIKPLFYTELKHELIFAS
EIKSLFSHPHCPRQFNWKEALSDIWLSGEAASNHKETTSFFVNIQNLDAGHYLEINLTTNERKTASYWSLQDILLRQGYR
ENLHPDDLIEGYRELLADSVHRCLQSDVEVGLFLSGGIDSAAVAHFAAEKQDLHTFTVLSQSTFTNEDAKYAHWLAKDLH
LPNHQVLYQLGNDELLQPESYKHLLWICETPFCGPEQLYKFHLHKYAKAIRPNLKVMLTGQGSDEFNGGYSTTLSPAENP
SWEGFIESVNTMEMNRLHRLQGNIFRVWEEHFGLSPINLSYLKSNDSSQADPWQSYVLTKYRDLQMYNCWHEDRIAAANH
IENRVPFLDHRLVEWVCGIPDGLRKDLLWDKSVLRKSLTNELHTSYTHRPKVPFFYGKDVRYTHKMMFHLLKKNNYQLIE
EAFSHSDASSIIQVEHIHAIMTYLEDDPEFTNFEFLLRLVNMGLLSKMTKETPSVQLDITSHLESITIKDWHSQEGDIAS
RLNISANKCEGQDILALNPGVTLLRPESDSEHCIYIAEEGFIQFIVSEEDVGAWLHILCDINGKDTLHTILDRHGVSLEE
VAKYIQEAIEHNIILIKQKNLPEGAYR
>P64248 6.3.5.4~~~asnB~~~Putative asparagine synthetase [glutamine-hydrolyzing]~~~
MCGLLAFVAAPAGAAGPEGADAASAIARASHLMRHRGPDESGTWHAVDGASGGVVFGFNRLSIIDIAHSHQPLRWGPPEA
PDRYVLVFNGEIYNYLELRDELRTQHGAVFATDGDGEAILAGYHHWGTEVLQRLRGMFAFALWDTVTRELFCARDPFGIK
PLFIATGAGGTAVASEKKCLLDLVELVGFDTEIDHRALQHYTVLQYVPEPETLHRGVRRLESGCFARIRADQLAPVITRY
FVPRFAASPITNDNDQARYDEITAVLEDSVAKHMRADVTVGAFLSGGIDSTAIAALAIRHNPRLITFTTGFEREGFSEID
VAVASAEAIGARHIAKVVSADEFVAALPEIVWYLDEPVADPALVPLFFVAREARKHVKVVLSGEGADELFGGYTIYREPL
SLRPFDYLPKPLRRSMGKVSKPLPEGMRGKSLLHRGSLTLEERYYGNARSFSGAQLREVLPGFRPDWTHTDVTAPVYAES
AGWDPVARMQHIDLFTWLRGDILVKADKITMANSLELRVPFLDPEVFAVASRLPAGAKITRTTTKYALRRALEPIVPAHV
LHRPKLGFPVPIRHWLRAGELLEWAYATVGSSQAGHLVDIAAVYRMLDEHRCGSSDHSRRLWTMLIFMLWHAIFVEHSVV
PQISEPQYPVQL
>P9WN33 6.3.5.4~~~asnB~~~Putative asparagine synthetase [glutamine-hydrolyzing]~~~COG0367
MCGLLAFVAAPAGAAGPEGADAASAIARASHLMRHRGPDESGTWHAVDGASGGVVFGFNRLSIIDIAHSHQPLRWGPPEA
PDRYVLVFNGEIYNYLELRDELRTQHGAVFATDGDGEAILAGYHHWGTEVLQRLRGMFAFALWDTVTRELFCARDPFGIK
PLFIATGAGGTAVASEKKCLLDLVELVGFDTEIDHRALQHYTVLQYVPEPETLHRGVRRLESGCFARIRADQLAPVITRY
FVPRFAASPITNDNDQARYDEITAVLEDSVAKHMRADVTVGAFLSGGIDSTAIAALAIRHNPRLITFTTGFEREGFSEID
VAVASAEAIGARHIAKVVSADEFVAALPEIVWYLDEPVADPALVPLFFVAREARKHVKVVLSGEGADELFGGYTIYREPL
SLRPFDYLPKPLRRSMGKVSKPLPEGMRGKSLLHRGSLTLEERYYGNARSFSGAQLREVLPGFRPDWTHTDVTAPVYAES
AGWDPVARMQHIDLFTWLRGDILVKADKITMANSLELRVPFLDPEVFAVASRLPAGAKITRTTTKYALRRALEPIVPAHV
LHRPKLGFPVPIRHWLRAGELLEWAYATVGSSQAGHLVDIAAVYRMLDEHRCGSSDHSRRLWTMLIFMLWHAIFVEHSVV
PQISEPQYPVQL
>O05272 6.3.5.4~~~asnO~~~Asparagine synthetase [glutamine-hydrolyzing] 3~~~COG0367
MCGITGWVDFKKQLVQEKQTMDRMTDTLSKRGPDDSNVWGEHHVLFGHKRLAVVDIEGGRQPMACTYKGDTYTIIYNGEL
YNTEDLRKELRARGHQFERTSDTEVLLHSYIEWQEDCVDHLNGIFAFAVWDEKRNLLFAARDRLGVKPFFYTKEGSSFLF
GSEIKAILAHPDIKARVDRTGLSEIFGLGPSRTPGTGIFKGIKEIRPAHALTFSKDGLNIWRYWNVESEKHTDSFDDTVA
NVRSLFQDAVTRQLVSDVPVCTFLSGGLDSSAITAIAAGHFEKEGKAPLHTYSIDYEENDKYFQASAFQPNDDGPWIEKM
TEAFGTTHHKCVISQKDLVDHLEEAVLVKDLPGMADVDSSLLWFCREIKKDFVVSLSGECADEIFGGYPWFHTADVESGF
PWMRSTEERIKLLSDSWQKKLNLKEYVNAKYEETLAETPLLDGETGVDKARRQLFYLNMLWFMTNLLDRKDRMSMGASLE
VRVPFADHRLVEYVWNIPWEMKMHDNREKGILRKALEGILPDDILYRKKSPYPKTHHPEYTKGVSEWLKTIRSQKDSVLH
TLLDRKQLDQLLETEGSSFKVPWFGQLMKGPQLIAHLAQIHTWFEAYRIDIDER
>Q9Z4Z5 1.14.11.39~~~asnO~~~L-asparagine oxygenase~~~COG2175
MAANAAGPASRYDVTLDQSDAELVEEIAWKLATQATGRPDDAEWVEAARNAWHAWPATLRRDLAGFRRDSGPDGAIVLRG
LPVDSMGLPPTPRVNGSVQREASLGAAVLLMTACGLGDPGAFLPEKNGALVQDVVPVPGMEEFQGNAGSTLLTFHNENAF
HEHRPDFVMLLCLRADPTGRAGLRTACVRRVLPLLSDSTVDALWAPEFRTAPPPSFQLSGPEEAPAPVLLGDRSDPDLRV
DLAATEPVTERAAEALRELQAHFDATAVTHRLLPGELAIVDNRVTVHGRTEFTPRYDGTDRWLQRTFVLTDLRRSRAMRP
ADGYVLGAAPQPA
>Q9AET9 ~~~asp1~~~Accessory Sec system protein Asp1~~~
MYYFIPSWSGSGKRVWHRDIIPWYRSMQRLEFDDTIHQIRIFHSENLPVKLLLQAYMPHARYFLHRQDIFETEYYSVFDE
IQAVESNDMQVLQIKDLEWEDDCEFIYTPFLIIVRRQGQLYAHVEFGVEGFISFIKFFKDDQLEKLNIFDDRGFVSSIVY
YEDGQEVCQDYLNPNGDWRIREYLKFENSHVVVNPVFSRDFDKLEYECMPDLILEKLGYYISHNVEEDSRFVVAAQPFTN
QGVLDLLPQHSHSILSFFHERNQASNIENLKADLEYADLVLTDRMDFKETLQNYFPLQAEKIHYLSPFDTRLQLGKSQQR
HESKIFYQIDLSELLNDYAIFKVLFYVAQHPDTELVIGVYNAWQEGIKQVENKVEELISDYLDLKDFIKKSFKNNQAENP
LPENQELEYRFRIRNITDELSLIQELDDTRLIIDLSQQPNLYTQIAGISAGIPQINLVASDYVTHLQNGYILDSISQLAV
AADYYLQGLKNWNQALIYSIEKIKLNTGHQVIKRWEKWLKEAIDEK
>P80485 ~~~~~~Acid shock protein~~~
MLNKIQHRNLNTYSVTPFDFFEEFSRNLFNDFKPNFIKTDIHETDNEYLVEAELPGIPKENIQVTYENGVLTISGQQQID
AVNEDKKGKLIRSERSLTSVQRQYLLENVKEDEIKASYSDGVLKVTLPKDSNKEIKKSISIE
>P99157 ~~~~~~Alkaline shock protein 23~~~
MTVDNNKAKQAYDNQTGVNEKEREERQKQQEQNQEPQFKNKLTFSDEVVEKIAGIAAREVKGILDMKGGLTDTFTNAFSS
GNNVTQGVSVEVGEKQAAVDLKVILEYGESAPKIFRKVTELVKEQVKYITGLDVVEVNMQVDDVMTQKEWKQKHEKNNEN
NNQERQGLQ
>P0A0P8 ~~~~~~Alkaline shock protein 23~~~
MTVDNNKAKQAYDNQTGVNEKEREERQKQQEQNQEPQFKNKLTFSDEVVEKIAGIAAREVKGILDMKGGLTDTFTNAFSS
GNNVTQGVSVEVGEKQAAVDLKVILEYGESAPKIFRKVTELVKEQVKYITGLDVVEVNMQVDDVMTQKEWKQKHEKNNEN
NNQERQGLQ
>Q9AET8 ~~~asp2~~~Accessory Sec system protein Asp2~~~
MKNKLKILQIGSIDWSKEVVIPDNMDWYYFFSLTLRLAIKKVMEMEKINHFSAIIVDDLDLIPDLFLIESRIIPYTIFYS
KKQQAIQEPIAFFLKRYCAQQIDLSDRPNLLRKLSKALFRGQYGDKMTPLDMVVSPGFKGRICHNGYENLELEGNFGSDF
RPIVSWKYNIVASKKNPVEIWLEYEKDLSCELRLRIYNIQEGSAADLVRESVFSETDMEETIVLDNDFTSFLGITLEARG
FGTLKIGAFHQRLTRYQFGKFVLGGKILKDSHRQEINYFFYPGDFKPPLVVYFSGYRRAEGFEGFGMMRGLGCPFLLISD
QRLDGGVFYLGSDELEEGIRRIIQEHMELLGFSERELILSGISMGTYGAAYYGADFSPRAIILCKPLANLGTIAQRGRLR
LPEVFPMALDILHRHTGGKDRENVMELDNRYWKKFKKADFSRTIFGLAYMKEEDYDPTAYEDLVQYLYPTETQLMSNGLS
GRHNDDSTMVINWFMNYHRIILEKEFGRKK
>Q9AET7 ~~~asp3~~~Accessory Sec system protein Asp3~~~
MKIQKHKEIYWGELRGASISKTRKDFTYLYGSTIIFHSPDQVYFENKLIASGQTIHEWSSSWNYQGDRQVPSLPLLKRGR
SYSLTRDMTSYPSESVFLKLIFFDRYNREVSNHVERSDKMTFTYPEEAYSYKVQLLSAGVESFEFHCLRIEEILEESNG
>Q8NTR2 2.6.1.1~~~~~~Aspartate aminotransferase~~~COG1167
MRRYAVMSSVSLQDFDAERIGLFHEDIKRKFDELKSKNLKLDLTRGKPSSEQLDFADELLALPGKGDFKAADGTDVRNYG
GLDGIVDIRQIWADLLGVPVEQVLAGDASSLNIMFDVISWSYIFGNNDSVQPWSKEETVKWICPVPGYDRHFSITERFGF
EMISVPMNEDGPDMDAVEELVKNPQVKGMWVVPVFSNPTGFTVTEDVAKRLSAMETAAPDFRVVWDNAYAVHTLTDEFPE
VIDIVGLGEAAGNPNRFWAFTSTSKITLAGAGVSFFLTSAENRKWYTGHAGIRGIGPNKVNQLAHARYFGDAEGVRAVMR
KHAASLAPKFNKVLEILDSRLAEYGVAQWTVPAGGYFISLDVVPGTASRVAELAKEAGIALTGAGSSYPLRQDPENKNLR
LAPSLPPVEELEVAMDGVATCVLLAAAEHYAN
>O69689 2.6.1.1~~~~~~Aspartate aminotransferase~~~COG1167
MSFDSLSPQELAALHARHQQDYAALQGMKLALDLTRGKPSAEQLDLSNQLLSLPGDDYRDPEGTDTRNYGGQHGLPGLRA
IFAELLGIAVPNLIAGNNSSLELMHDIVAFSMLYGGVDSPRPWIQEQDGIKFLCPVPGYDRHFAITETMGIEMIPIPMLQ
DGPDVDLIEELVAVDPAIKGMWTVPVFGNPSGVTYSWETVRRLVQMRTAAPDFRLFWDNAYAVHTLTLDFPRQVDVLGLA
AKAGNPNRPYVFASTSKITFAGGGVSFFGGSLGNIAWYLQYAGKKSIGPDKVNQLRHLRFFGDADGVRLHMLRHQQILAP
KFALVAEVLDQRLSESKIASWTEPKGGYFISLDVLPGTARRTVALAKDVGIAVTEAGASFPYRKDPDDKNIRIAPSFPSV
PDLRNAVDGLATCALLAATETLLNQGLASSAPNVR
>P31339 3.4.21.-~~~aspA~~~Microbial serine proteinase~~~
MRKTSLALAISALLSALPIASVQANESCTPLTGKEAGLDTGRSSAVRCLPGINPLQDLLNSGQNAFSPRGGMAGNDLNLW
WAHRTEVLGQGINVAVVDDGLAIAHPDLADNVRPGSKNVVTGGSDPTPTDPDRCPRHSVSGIIAAVDNSIGTLGVAPRVQ
LQGFNLLDDNIQQLQKDWLYALGQRRHRRQPGLQPELRMSLVDPEGANGLDQVQLDRLFEQRTQHASAAYIKAAGTAFSR
IAAGNYVAQPHRNLPKLPFENSNIDPSNSNFWNLVVRAINADGVRSSYSSVGSNVFLSAPGGEYGTDAPAMVTTDLPGCD
MGYNRVDDPSTNRLHNNPQLDASCDYNGVMNGTSSATPNTTGAMVLMAPYPDLSVRDLRDLLARNATRLDANQGPVQINY
TAANGERRQVTGLEGWERNAAGLWYSPSYGFGLVDVNKTQPCSRQPRTAATTGAVALAKGKGNGRSPSAPSRYVGSSPTR
SSTQVDQPLTVEAVQVMVSLDHQRLPDLLIELVSPSGTRSVLLNPNNSLVGQSLDRQQLGYVRTKGLRDMRMLSHKFYGE
PAHGEWRLEVTDVANAAAQVSLLDRRTNTRSTLTEGNNSQPGQLLDWSRGYSVLGHDAARS
>P0AC38 4.3.1.1~~~aspA~~~Aspartate ammonia-lyase~~~COG1027
MSNNIRIEEDLLGTREVPADAYYGVHTLRAIENFYISNNKISDIPEFVRGMVMVKKAAAMANKELQTIPKSVANAIIAAC
DEVLNNGKCMDQFPVDVYQGGAGTSVNMNTNEVLANIGLELMGHQKGEYQYLNPNDHVNKCQSTNDAYPTGFRIAVYSSL
IKLVDAINQLREGFERKAVEFQDILKMGRTQLQDAVPMTLGQEFRAFSILLKEEVKNIQRTAELLLEVNLGATAIGTGLN
TPKEYSPLAVKKLAEVTGFPCVPAEDLIEATSDCGAYVMVHGALKRLAVKMSKICNDLRLLSSGPRAGLNEINLPELQAG
SSIMPAKVNPVVPEVVNQVCFKVIGNDTTVTMAAEAGQLQLNVMEPVIGQAMFESVHILTNACYNLLEKCINGITANKEV
CEGYVYNSIGIVTYLNPFIGHHNGDIVGKICAETGKSVREVVLERGLLTEAELDDIFSVQNLMHPAYKAKRYTDESEQ
>W5JXD7 ~~~~~~ASP external chaperone~~~
MNKPVTLLLATLLAPLSGQLCAQESVTMDGKQYSTIEVNGQTYLIPDNGSKKRVARSLDSKVPQQTLRRGDVLMQGAASP
ELTVSGTLLVEADDASAKALATRHGLNFKQSSGGIALLEAKPGTDLNAIATKLKSEGVNVQIELSGAEQQPK
>Q9X1X6 1.4.1.21~~~nadX~~~L-aspartate dehydrogenase~~~COG1712
MTVLIIGMGNIGKKLVELGNFEKIYAYDRISKDIPGVVRLDEFQVPSDVSTVVECASPEAVKEYSLQILKNPVNYIIIST
SAFADEVFRERFFSELKNSPARVFFPSGAIGGLDVLSSIKDFVKNVRIETIKPPKSLGLDLKGKTVVFEGSVEEASKLFP
RNINVASTIGLIVGFEKVKVTIVADPAMDHNIHIVRISSAIGNYEFKIENIPSPENPKTSMLTVYSILRTLRNLESKIIF
G
>P26900 3.5.1.1~~~ansA~~~L-asparaginase 1~~~COG0252
MKKLLMLTTGGTIASVEGENGLAPGVKADELLSYVSKLDNDYTMETQSLMNIDSTNMQPEYWVEIAEAVKENYDAYDGFV
ITHGTDTMAYTSAALSYMLQHAKKPIVITGSQIPITFQKTDAKKNITDAIRFACEGVGGVYVVFDGRVIQGTRAIKLRTK
SYDAFESINYPYIAFINEDGIEYNKQVTEPENDTFTVDTSLCTDVCLLKLHPGLKPEMFDALKSMYKGIVIESYGSGGVP
FEGRDILSKVNELIESGIVVVITTQCLEEGEDMSIYEVGRRVNQDLIIRSRNMNTEAIVPKLMWALGQSSDLPVVKRIME
TPIADDVVL
>P0A962 3.5.1.1~~~ansA~~~L-asparaginase 1~~~COG0252
MQKKSIYVAYTGGTIGMQRSEQGYIPVSGHLQRQLALMPEFHRPEMPDFTIHEYTPLMDSSDMTPEDWQHIAEDIKAHYD
DYDGFVILHGTDTMAYTASALSFMLENLGKPVIVTGSQIPLAELRSDGQINLLNALYVAANYPINEVTLFFNNRLYRGNR
TTKAHADGFDAFASPNLPPLLEAGIHIRRLNTPPAPHGEGELIVHPITPQPIGVVTIYPGISADVVRNFLRQPVKALILR
SYGVGNAPQNKAFLQELQEASDRGIVVVNLTQCMSGKVNMGGYATGNALAHAGVIGGADMTVEATLTKLHYLLSQELDTE
TIRKAMSQNLRGELTPDD
>O34482 3.5.1.1~~~ansZ~~~L-asparaginase 2~~~COG0252
MKKQRMLVLFTALLFVFTGCSHSPETKESPKEKAQTQKVSSASASEKKDLPNIRILATGGTIAGADQSKTSTTEYKAGVV
GVESLIEAVPEMKDIANVSGEQIVNVGSTNIDNKILLKLAKRINHLLASDDVDGIVVTHGTDTLEETAYFLNLTVKSDKP
VVIVGSMRPSTAISADGPSNLYNAVKVAGAPEAKGKGTLVVLNDRIASARYVTKTNTTTTDTFKSEEMGFVGTIADDIYF
NNEITRKHTKDTDFSVSNLDELPQVDIIYGYQNDGSYLFDAAVKAGAKGIVFAGSGNGSLSDAAEKGADSAVKKGVTVVR
STRTGNGVVTPNQDYAEKDLLASNSLNPQKARMLLMLALTKTNDPQKIQAYFNEY
>P00805 3.5.1.1~~~ansB~~~L-asparaginase 2~~~COG0252
MEFFKKTALAALVMGFSGAALALPNITILATGGTIAGGGDSATKSNYTVGKVGVENLVNAVPQLKDIANVKGEQVVNIGS
QDMNDNVWLTLAKKINTDCDKTDGFVITHGTDTMEETAYFLDLTVKCDKPVVMVGAMRPSTSMSADGPFNLYNAVVTAAD
KASANRGVLVVMNDTVLDGRDVTKTNTTDVATFKSVNYGPLGYIHNGKIDYQRTPARKHTSDTPFDVSKLNELPKVGIVY
NYANASDLPAKALVDAGYDGIVSAGVGNGNLYKSVFDTLATAAKTGTAVVRSSRVPTGATTQDAEVDDAKYGFVASGTLN
PQKARVLLQLALTQTKDPQQIQQIFNQY
>P43843 3.5.1.1~~~ansB~~~Probable L-asparaginase periplasmic~~~COG0252
MKLTKLALCTLFGLGVSIANAADLPNITILATGGTIAGSGQSSVNSAYKAGQLSIDTLIEAVPEMKNIANIKGEQIVKIG
SQDMNDEVWLKLAKAINAQCKSTDGFVITHGTDTMEETAYFLDLTVKCEKPVVLVGAMRPATEKSADGPLNLYNAVVVAA
DKKSSGRGVLVAMNNEVLGARDVTKTSTTAVQTFHSPNYGSLGYIHNSKVDYERSPESKHTINTPFNVEKLDSLPKVGII
YAYSNAPVEPLNALLNAGYQGIVSAGVGNGNVNAAHLDRLEKAAKDSVVVVRSSRVPTGYTTRDAEVDDSKYGFVASGTL
NPQKARVLLQLALTQTKDPKVIQQYFEDF
>P06608 3.5.1.1~~~ansB~~~L-asparaginase~~~
MERWFKSLFVLVLFFVFTASAADKLPNIVILATGGTIAGSAATGTQTTGYKAGALGVDTLINAVPEVKKLANVKGEQFSN
MASENMTGDVVLKLSQRVNELLARDDVDGVVITHGTDTVEESAYFLHLTVKSDKPVVFVAAMRPATAISADGPMNLLEAV
RVAGDKQSRGRGVMVVLNDRIGSARYITKTNASTLDTFKANEEGYLGVIIGNRIYYQNRIDKLHTTRSVFDVRGLTSLPK
VDILYGYQDDPEYLYDAAIQHGVKGIVYAGMGAGSVSVRGIAGMRKAMEKGVVVIRSTRTGNGIVPPDEELPGLVSDSLN
PAHARILLMLALTRTSDPKVIQEYFHTY
>Q47898 3.5.1.26~~~~~~N(4)-(Beta-N-acetylglucosaminyl)-L-asparaginase~~~COG1446
MRIIYKQQTMNNNRRDFIKKLGIATAAIAINPLEAKNLLDTSEPKTTNKPIVLSTWNFGLHANVEAWKVLSKGGKALDAV
EKGVRLVEDDPTERSVGYGGRPDRDGRVTLDACIMDENYNIGSVACMEHIKNPISVARAVMEKTPHVMLVGDGALEFALS
QGFKKENLLTAESEKEWKEWLKTSQYKPIVNIENHDTIGMIALDAQGNLSGACTTSGMAYKMHGRVGDSPIIGAGLFVDN
EIGAATATGHGEEVIRTVGTHLVVELMNQGRTPQQACKEAVERIVKIVNRRGKNLKDIQVGFIALNKKGEYGAYCIQDGF
NFAVHDQKGNRLETPGFALK
>Q9ZLB9 3.5.1.1~~~ansA~~~Probable L-asparaginase~~~COG0252
MAQNLPTIALLATGGTIAGSGVDASLGSYKSGELGVKELLKAIPSLNKIARIQGEQVSNIGSQDMNEEIWFKLAQRAQEL
LDDSRIQGVVITHGTDTLEESAYFLNLVLHSTKPVVLVGAMRNASSLSADGALNLYYAVSVAVNEKSANKGVLVVMDDTI
FRVREVVKTHTTHISTFKALNSGAIGSVYYGKTRYYMQPLRKHTTESEFSLSQLKTPLPKVDIIYTHAGMTPDLFQASLN
SHAKGVVIAGVGNGNVSAGFLKAMQEASQMGVVIVRSSRVGSGGVTSGEIDDKAYGFITSDNLNPQKARVLLQLALTKTN
DKAKIQEMFEEY
>P9WPX5 3.5.1.1~~~ansA~~~L-asparaginase~~~COG0252
MARLTVITTGGTISTTAGPDGVLRPTHCGATLIAGLDMDSDIEVVDLMALDSSKLTPADWDRIGAAVQEAFRGGADGVVI
THGTDTLEETALWLDLTYAGSRPVVLTGAMLSADAPGADGPANLRDALAVAADPAARDLGVLVSFGGRVLQPLGLHKVAN
PDLCGFAGESLGFTSGGVRLTRTKTRPYLGDLGAAVAPRVDIVAVYPGSDAVAMDACVAAGARAVVLEALGSGNAGAAVI
EGVRRHCRDGSDPVVIAVSTRVAGARVGAGYGPGHDLVEAGAVMVPRLPPSQARVLLMAALAANSPVADVIDRWG
>P50286 3.5.1.1~~~ansA~~~L-asparaginase~~~COG0252
MAKPQVTILATGGTIAGSGESSVKSSYSAGAVTVDKLLAAVPAINDLATIKGEQISSIGSQEMTGKVWLKLAKRVNELLA
QKETEAVIITHGTDTMEETAFFLNLTVKSQKPVVLVGAMRSGSSMSADGPMNLYNAVNVAINKASTNKGVVIVMNDEIHA
AREATKLNTTAVNAFASPNTGKIGTVYYGKVEYFTQSVRPHTLASEFDISKIEELPRVDILYAHPDDTDVLVNAALQAGA
KGIIHAGMGNGNPFPLTQNALEKAAKSGVVVARSSRVGSGSTTQEAEVDDKKLGFVATESLNPQKARVLLMLALTKTSDR
EAIQKIFSTY
>O07002 ~~~yveA~~~Aspartate-proton symporter~~~COG0531
MSKQGNFQKSMSLFDLILIGMGAIFGSAWLFAVSNVASKAGPSGAFSWILGGAIILLIGLVYAELGAALPRTGGIIRYPV
YSHGHLVGYLISFVTIVAYTSLISIEVTAVRQYVAYWFPGLTIKGSDSPTISGWILQFALLCLFFLLNYWSVKTFAKANF
IISIFKYIVPITIIIVLIFHFQPENLSVQGFAPFGFTGIQAAISTGGVMFAYLGLHPIVSVAGEVQNPKRNIPIALIICI
IVSTIIYTVLQVTFIGAIPTETLKHGWPAIGREFSLPFKDIAVMLGLGWLATLVILDAILSPGGNGNIFMNTTSRLVYAW
ARNGTLFGIFSKVNKDTGTPRASLWLSFALSIFWTLPFPSWNALVNVCSVALILSYAIAPISSAALRVNAKDLNRPFYLK
GMSIIGPLSFIFTAFIVYWSGWKTVSWLLGSQLVMFLIYLCFSKYTPKEDVSLAQQLKSAWWLIGFYIMMLIFSYIGSFG
HGLGIISNPVDLILVAIGSLAIYYWAKYTGLPKAAIDYDK
>P10172 3.5.1.38~~~ansB~~~Glutaminase-asparaginase~~~
KNNVVIVATGGTIAGAGASSTNSATYSAAKVPVDALIKAVPQVNDLANITGIQALQVASESITDKELLSLARQVNDLVKK
PSVNGVVITHGTDTMEETAFFLNLVVHTDKPIVLVGSMRPSTALSADGPLNLYSAVALASSNEAKNKGVMVLMNDSIFAA
RDVTKGINIHTHAFVSQWGALGTLVEGKPYWFRSSVKKHTNNSEFNIEKIQGDALPGVQIVYGSDNMMPDAYQAFAKAGV
KAIIHAGTGNGSMANYLVPEVRKLHDEQGLQIVRSSRVAQGFVLRNAEQPDDKYGWIAAHDLNPQKARLLMALALTKTND
AKEIQNMFWNY
>Q9I407 3.5.1.38~~~ansB~~~Glutaminase-asparaginase~~~
MKPLLHAFAPGVMALMLLLPQAAQAKEVAPQQKLSNVVILATGGTIAGAGASAANSATYTAAKVPVDQLLASVPQLKDIA
NVRGEQVFQIASESFTNENLLELGKTVAKLADSDDVDGIVITHGTDTLEETAYFLTLVEHTEKPIVVVGSMRPGTAMSAD
GMLNLYNAVAVAGDKSARGKGVLITMNDEILSGRDASKMVNIKTEAFKSPWGPLGMVVEGKSYWFRAPVKRHTVNSEFDI
KQISALAPVEIAYSYGNVSDTAYKALAQAGAKAIIHAGTGNGSVPARVVPTLQELRKQGVQIIRSSHVNAGGFVLRNAEQ
PDDKNDWIVAHDLNPQKARILAAVAMTKTQDSKELQRIFWEY
>O68897 3.5.1.38~~~ansB~~~Glutaminase-asparaginase~~~
MKSALKTFVPGALALLLLFPVAAQAKEVETKTKLANVVILATGGTIAGAGASAANSATYQAAKVGIEQLIAGVPELSQIA
NVRGEQVMQIASESINNENLLQLGRRVAELADSKDVDGIVITHGTDTLEETAYFLNLVEKTDKPIIVVGSMRPGTAMSAD
GMLNLYNAVAVAGSKDARGKGVLVTMNDEIQSGRDVSKMINIKTEAFKSPWGPLGMVVEGKSYWFRLPAKRHTMDSEFDI
KTIKSLPDVEIAYGYGNVSDTAVKALAQAGAKAIIHAGTGNGSVSSKVVPALQELRKQGVQIIRSSHVNAGGFVLRNAEQ
PDDKYDWVVAHDLNPQKARILAMVALTKTQDSKELQRMFWEY
>Q88K39 3.5.1.38~~~ansB~~~Glutaminase-asparaginase~~~COG0252
MNAALKTFAPSALALLLILPSSASAKEAETQQKLANVVILATGGTIAGAGASAANSATYQAAKLGVDKLIAGVPELADIA
NVRGEQVMQIASESISNDDLLKLGKRVAELAESKDVDGIVITHGTDTLEETAFFLNLVEKTDKPIVVVGSMRPGTAMSAD
GMLNLYNAVAVASDKQSRGKGVLVTMNDEIQSGRDVSKAVNIKTEAFKSAWGPMGMVVEGKSYWFRLPAKRHTVNSEFDI
KQISSLPQVDIAYGYGNVTDTAYKALAQNGAKALIHAGTGNGSVSSRVVPALQELRKNGVQIIRSSHVNQGGFVLRNAEQ
PDDKNDWVVAHDLNPQKARILAMVAMTKTQDSKELQRIFWEY
>P10182 3.5.1.38~~~ansB~~~Glutaminase-asparaginase~~~
KEVENQQKLANVVILATGGTIAGAGASAANSATYQAAKVGVDKLIAGVPELADLANVRGEQVMQIASESITNDDLLKLGK
RVAELADSNDVDGIVITHGTDTLEETAYFLDLTLNTDKPIVVVGSMRPGTAMSADGMLNLYNAVAVASNKDSRGKGVLVT
MNDEIQSGRDVSKSINIKTEAFKSAWGPLGMVVEGKSYWFRLPAKRHTVNSEFDIKQISSLPQVDIAYSYGNVTDTAYKA
LAQNGAKALIHAGTGNGSVSSRLTPALQTLRKTGTQIIRSSHVNQGGFVLRNAEQPDDKNDWVVAHDLNPEKARILVELA
MVKTQDSKELQRIFWEY
>E3PJ88 ~~~gspS2~~~Pilotin AspS 2~~~
MSIKQMPGRVLISLLLSVTGLLSGCASHNENASLLAKKQAQNISQNLPIKSAGYTLVLAQSSGTTVKMTIISEAGTQTTQ
TPDAFLTSYQRQMCADPTVKLMLTEGINYSITINDTRTGNQYQRKLDRTTCGIVKA
>Q8L3K8 ~~~aspT~~~Aspartate/alanine antiporter~~~
MNAIGNFLVGTPVFTIFICLALGYLLGKLKIGSFTLGATVGVLIVALLIGQLGVFPRDTLLGDIFFDFFMFAIGYRVGPS
FISSMKKFGAKIVYATLIFLVSAFIVAYACFKMFHIGPGIAAGIIAGGLTQSAVIGSSLETISKLPISDHLKTLYSNQIP
IVYTLTYVFGTIGVLIFLRDIMPKLMHIDLKKQAVKTAKELDMIPVPVIVASTHFYTINDGSSLIGQTLGTVNTKFAKGL
VAAGLNDSADMASVINAGDVLAISGGIDEIGRAVQEFNLLEVTGKTKAYVSKQVVLKKNFSADVLKNAQDKGVLVATLAG
DVMDPAQFSTLKPAESVTLVGQKDAVSEVQSQLGRLRAAENIINYSWFALGIALSAALGIVGTKVSGVPIALGGGTASLI
VGLVQSIYRDKHAHMDTIPDSLLEFFQSIGLNLFIATVGLSAAKTFISAIQSMGISVLLIGAVISILPHIITFVICYYLM
KMEPISIIGAQTGADTLSAALNDVSERVGSDASPFFAAAVAPAYAIGNIFLTLMGPIFIVLLS
>Q9L5A4 3.4.21.121~~~asp~~~Aeromonas extracellular serine protease~~~
MKQTSLALAITALLSTLPSALVQANEGCAPLTGKESGMDIGRSSTERCLPGANPLQDQQWYLLNSGQDGFSARGGIAGND
LNLWWAHRTGVLGQGVNVAVVDDGLAIAHPDLADNVRPGSKNVVTGSDDPTPTDPDTAHGTSVSGIIAAVDNAIGTKGIA
PRAQLQGFNLLDDNSQQLQKDWLYALGDSNASRDNRVFNQSYGMSVVDPRSANSLDQSQLDRLFEQQTLKAQGAAYIKAA
GNGFNKIAAGGYVLNRTGNGPKLPFENSNLDPSNSNFWNLVVSALNADGVRSSYSSVGSNIFLSATGGEYGTDTPAMVTT
DLPGCDMGYNRTDDPSTNRLHGNSQLDASCDYNGVMNGTSSATPSTSGAMALLMSAYPDLSVRDLRDLLARSATRVDAKH
QPVMVSYTSSTGKVRDVKGLEGWERNAAGMWFSPTYGFGLIDVNKALELAANHQPLPPLVQLPWQKINVTGSAAAIADVG
NSPTSSTTRIATPLTVEAVQVMVSLDHQRLPDLLIELVSPAGTRSILLSPFNSLVGQSLDQQQLGFVRTKGLRDMRMLSN
KFYGESAQGTWRLEVTDVANGTRQVSLLNRETRERTTLTERNNRQPGKLISWSLRVLGHDANRS
>P26474 ~~~asrA~~~Anaerobic sulfite reductase subunit A~~~
MAIKITPDEFSLLIQRLNKKWRVFAPSAEFRGGRFSDTDNIIYQRISGWRDLIWHEKSHMSPNTIIAPITETLFYFDKDT
IQIAETDTSPIIIFARACDINAMSRLDYMYLSNGNNSDYSYQLLREHIRFVLIECEESFENCFCVSMGTNKTDCYSAAMR
FSDEGALVSIRDPFIEAAIQGLGQEADYTPSFVSENRETVVTPDSVCHDPQKIRDILTHHPLWDAYDSRCISCGRCTTGC
PTCTCYSVFDVAYDENPQRGERRRQWASCMVPGFSDMAGGHGFREKPGERLRYRALHKVNDYKARNGIEHMCVGCGRCDD
RCPQYIKFSLIINKMTAAVRQALAEEA
>P26475 ~~~asrB~~~Anaerobic sulfite reductase subunit B~~~
MSHCSCHDKPQHSLLPAAYRILSITRHTPLEWNFRVAVDFPAHWGQFVEVSLPRVGEAPISVSDYGDGWIDLLIRNVGKV
TSALFTLKEGDNVWLRGCYGNGYPVDTLRHKPLLVVAGGTGVAPVKGLMRYFVENPQEIGQLDMILGYKNRDCVLYKEEM
ATWRGKHNLVLTLDEGEADDRYQIGRVTDRLADMTLSDIDTMQAIVVGPPIMITFTVKMLLQKGLKPEQIWVDYERRMAC
SVGKCGHCRMGEVYVCTDGPIFNYAVAQRFAD
>P0A1Y2 1.8.1.-~~~asrC~~~Anaerobic sulfite reductase subunit C~~~
MSIDIDIIKARAKNEYRLSKVRGEAMISVRIPGGILPAHLLTVARDIAETWGNGQIHLTTRQKLAMPGIRYEDIDNVNAA
LEPFLREIEIELCDVQVEDTKAGYLAIGGRNIVACQGNRICQKANTDTTGLSRRLEKLVYPSPYHLKTVIVGCPNDCAKA
SMADLGIIGVAKMRFTADRCIGCGACVKACSHHAVGCLALKNGKAVKEESACIGCGECVLACPTLAWQRKPDQLWQVRLG
GRTSKKTPRVGKLFLNWVTEDVIKQVIVNLYEFEKEMLGGKPIYLHMGHLIDKGGYLRFKERVLRGVQLNPEAMVAERIY
WAEDESVARMHLKPAGH
>P36560 ~~~asr~~~Acid shock protein~~~
MKKVLALVVAAAMGLSSAAFAAETTTTPAPTATTTKAAPAKTTHHKKQHKAAPAQKAQAAKKHHKNTKAEQKAPEQKAQA
AKKHAKKHSHQQPAKPAAQPAA
>Q8FDI4 2.8.2.22~~~assT~~~Arylsulfate sulfotransferase AssT~~~
MFDKYRKTLVAGTVAITLGLSASGVMAAGFKPAPPAGQLGAVIVDPYGNAPLTALVDLDSHVISDVKVTVHGKGEKGVEI
SYPVGQESLKTYDGVPIFGLYQKFANKVTVEWKENGKVMKDDYVVHTSAIVNNYMDNRSISDLQQTKVIKVAPGFEDRLY
LVNTHTFTAQGXDLHWHGEKDKNAGILDAGPATGALPFDIAPFTFIVDTEGEYRWWLDQDTFYDGRDRDINKRGYLMGIR
ETPRGTFTAVQGQHWYEFDMMGQVLEDHKLPRGFADATHESIETPNGTVLLRVGKSNYRRDDGVHVTTIRDHILEVDKSG
RVVDVWDLTKILDPKRDALLGALDAGAVCVNVDLAHAGQQAKLEPDTPFGDALGVGPGRNWAHVNSIAYDAKDDSIILSS
RHQGVVKIGRDKQVKWILAPSKGWEKPLASKLLKPVDANGKPITCNENGLCENSDFDFTYTQHTAWISSKGTLTIFDNGD
GRHLEQPALPTMKYSRFVEYKIDEKKGTVQQVWEYGKERGYDFYSPITSIIEYQADRNTMFGFGGSIHLFDVGQPTVGKL
NEIDYKTKEVKVEIDVLSDKPNQTHYRALLVRPQQMFK
>Q7VTJ9 6.3.4.5~~~argG~~~Argininosuccinate synthase~~~COG0137
MTTILPNLPTGQKVGIAFSGGLDTSAALLWMRQKGAVPYAYTANLGQPDEPDYDEIPRRAMQYGAEAARLVDCRAQLVAE
GIAALQAGAFHISTAGLTYFNTTPIGRAVTGTMLVAAMKEDGVNIWGDGSTFKGNDIERFYRYGLLTNPDLKIYKPWLDQ
TFIDELGGRAEMSEYMRQAGFDYKMSAEKAYSTDSNMLGATHEAKDLELLSAGIRIVQPIMGVAFWQDSVQIKAEEVTVR
FEEGQPVALNGVEYADPVELLLEANRIGGRHGLGMSDQIENRIIEAKSRGIYEAPGLALLFIAYERLVTGIHNEDTIEQY
RENGRKLGRLLYQGRWFDPQAIMLRETAQRWVARAITGEVTLELRRGNDYSLLNTESANLTYAPERLSMEKVENAPFTPA
DRIGQLTMRNLDIVDTREKLFTYVKTGLLAPSAGSALPQIKDGKK
>Q9PHK7 6.3.4.5~~~argG~~~Argininosuccinate synthase~~~COG0137
MKNEVKKVVLAYSGGLDTSIILKWLQDEYNCEVVTFTADIGQGEELEPARKKALSLGIKEENIFIKDLRDEFVKDYVFPM
FRANAIYEGEYLLGTSIARPLIAKTQAQIALQTGADAVSHGATGKGNDQVRFELGYLAFSPDLKIIAPWREWDLNSREKL
LAYAQKHGIDISKKKGKSPYSMDANLLHISYEGLVLEDPAHAPEEDMWRWSKSPKDAPNESEIIELDFQKGDLVAINGEK
LSPAGLLTKLNELGCKHGIGRLDIVENRYVGMKSRGCYETPGGTILLKAHRALESITLDREAAHLKDELMPKYASLIYNG
YWFSPERMMLQALIDESQIHANGRVKLELYKGNVMVIGRESANDSLFNAAYCTFEEDEVYNQKDAAGFIKLNALRFIIAG
KNGRKF
>P0A6E4 6.3.4.5~~~argG~~~Argininosuccinate synthase~~~COG0137
MTTILKHLPVGQRIGIAFSGGLDTSAALLWMRQKGAVPYAYTANLGQPDEEDYDAIPRRAMEYGAENARLIDCRKQLVAE
GIAAIQCGAFHNTTGGLTYFNTTPLGRAVTGTMLVAAMKEDGVNIWGDGSTYKGNDIERFYRYGLLTNAELQIYKPWLDT
DFIDELGGRHEMSEFMIACGFDYKMSVEKAYSTDSNMLGATHEAKDLEYLNSSVKIVNPIMGVKFWDESVKIPAEEVTVR
FEQGHPVALNGKTFSDDVEMMLEANRIGGRHGLGMSDQIENRIIEAKSRGIYEAPGMALLHIAYERLLTGIHNEDTIEQY
HAHGRQLGRLLYQGRWFDSQALMLRDSLQRWVASQITGEVTLELRRGNDYSILNTVSENLTYKPERLTMEKGDSVFSPDD
RIGQLTMRNLDITDTREKLFGYAKTGLLSSSAASGVPQVENLENKGQ
>Q5ZY78 6.3.4.5~~~argG~~~Argininosuccinate synthase~~~COG0137
MKKVIKKIALAYSGGLDTSIMIPWLKEHYEHAEVIAVICDLGQQEDLDAIKNKALKSGASKAYVVDVKNEFATQYLWPLV
KSGALYEDQYILGTISRPLIAQKLVEIALTEQVNAVAHGATGKGNDQVRFEYSIKALAPQLEIIAPWRTWDIKSRQEAIV
YAKAHGIEVPVTPKAPYSRDHNIWYISHEGGVLEDPSQEMPNDVLLMTAPVSQTPDEEEVVVLDFKKGVPVALNGQELSP
VDLLNSLNQKAGQHGIGVADIVENRLVGMKIRGIYEAPAAAVLYKAHKLLESLCLTRSTLHLKQSLQQTYANLVYEGRWF
SQTKQALDAFIDVTQQHVTGCVKLKLFKGNIIPAGMHSPYSLHHPELATFEEDNVYNQKDAEGFINLFSLSAKIYSQVHQ
GGNYD
>P9WPW7 6.3.4.5~~~argG~~~Argininosuccinate synthase~~~COG0137
MSERVILAYSGGLDTSVAISWIGKETGREVVAVAIDLGQGGEHMDVIRQRALDCGAVEAVVVDARDEFAEGYCLPTVLNN
ALYMDRYPLVSAISRPLIVKHLVAAAREHGGGIVAHGCTGKGNDQVRFEVGFASLAPDLEVLAPVRDYAWTREKAIAFAE
ENAIPINVTKRSPFSIDQNVWGRAVETGFLEHLWNAPTKDIYAYTEDPTINWGVPDEVIVGFERGVPVSVDGKPVSMLAA
IEELNRRAGAQGVGRLDVVEDRLVGIKSREIYEAPGAMVLITAHTELEHVTLERELGRFKRQTDQRWAELVYDGLWYSPL
KAALEAFVAKTQEHVSGEVRLVLHGGHIAVNGRRSAESLYDFNLATYDEGDSFDQSAARGFVYVHGLSSKLAARRDLR
>Q06734 6.3.4.5~~~argG~~~Argininosuccinate synthase~~~
MSKVLTSLPAGERVGIAFSGGLDTSVAVAWMRDKGAVPCTYTADIGQYDEPDIASVPSRASAYGAEITRLVDCRAALVEE
GLAALACGAFHIRSGGRPYFNTTPLGRAVTGTLLVRAMLEDGVQIWGDGSTFKGNDIERFYRYGLLANPHLRIYKPWLDA
DFVTELGGRKEMSEWLLAHGLPYRDSTEKAYSTDANIWGATHEAKTLEHLDTGIETVDPIMGVRFWDPSVEIATEDVTVG
FEQGRPVSINGKEFASAVDLVMEANAIGGRHGLGMSDQIENRIIEAKSRGIYEAPGMALLHIVYERLVNAIHNEDTLAAY
HNEGRRLGRLMYEGRWLDPQALMIRESLQRWVGTAVTGEVTLRLRRGEDYSILDTTGPAFSYHPDKLSMERTEDSAFGPV
DRIGQLTMRNLDIADSRARLEQYVGLGLVGTPHPTPIGAAQAAATGLIGAMDEGGAEAIASRGEATDEETMLDRAAMESG
TD
>P77973 6.3.4.5~~~argG~~~Argininosuccinate synthase~~~COG0137
MGRAKKVVLAYSGGVDTSVCIPYLMHEWGVEEVITLAADLGQGDELGPIQEKALRCGAVESLVIDGKEEFVKEYAFRSIQ
ANALYENRYPLSTALARPLIAKMLVEAAEKYGADAVAHGCTGKGNDQVRFDISIMALNPNLKVLAPAREWKMSREETIAY
GERYGVESPVKKSSPYSIDRNILGRSIEAGPLEDPMTEPTEEIYLMTKAIADTPDEPEYVDIGFEKGIPVSLNGVMLDPV
TLVERLNEIAGNHGVGRLDMVENRVVGIKSREIYEAPALLVLIDAHRDLESLTQTADVTHYKNTVEEIYSQLIYRGLWYS
PLKEALDAFIVKTQERVTGMVRVKFFKGNANVAGRKSDYSIYDAELATYGMEDQFDHKAAEGFIYIWGLPTKVWAQKMRG
>Q9X2A1 6.3.4.5~~~argG~~~Argininosuccinate synthase~~~COG0137
MKEKVVLAYSGGLDTSVILKWLCEKGFDVIAYVANVGQKDDFVAIKEKALKTGASKVYVEDLRREFVTDYIFTALLGNAM
YEGRYLLGTAIARPLIAKRQVEIAEKEGAQYVAHGATGKGNDQVRFELTYAALNPNLKVISPWKDPEFLAKFKGRTDLIN
YAMEKGIPIKVSKKRPYSEDENLMHISHEAGKLEDPAHIPDEDVFTWTVSPKDAPDEETLLEIHFENGIPVKVVNLKDGT
EKTDPLELFEYLNEVGAKNGVGRLDMVENRFIGIKSRGVYETPGATILWIAHRDLEGITMDKEVMHLRDMLAPKFAELIY
NGFWFSPEMEFLLAAFRKAQENVTGKVTVSIYKGNVMPVARYSPYSLYNPELSSMDVEGGFDATDSKGFINIHALRLKVH
QLVKKGYQR
>P59846 6.3.4.5~~~argG~~~Argininosuccinate synthase~~~COG0137
MKIVLAYSGGLDTSIILKWLKETYRAEVIAFTADIGQGEEVEEAREKALRTGASKAIALDLKEEFVRDFVFPMMRAGAVY
EGYYLLGTSIARPLIAKHLVRIAEEEGAEAIAHGATGKGNDQVRFELTAYALKPDIKVIAPWREWSFQGRKEMIAYAEAH
GIPVPVTQEKPYSMDANLLHISYEGGVLEDPWAEPPKGMFRMTQDPEEAPDAPEYVEVEFFEGDPVAVNGERLSPAALLQ
RLNEIGGRHGVGRVDIVENRFVGMKSRGVYETPGGTILYHARRAVESLTLDREVLHQRDMLSPKYAELVYYGFWYAPERE
ALQAYFDHVARSVTGVARLKLYKGNVYVVGRKAPKSLYRQDLVSFDEAGGYDQKDAEGFIKIQALRLRVRALVEREGHGA
>P0AE37 2.3.1.109~~~astA~~~Arginine N-succinyltransferase~~~COG3138
MMVIRPVERSDVSALMQLASKTGGGLTSLPANEATLSARIERAIKTWQGELPKSEQGYVFVLEDSETGTVAGICAIEVAV
GLNDPWYNYRVGTLVHASKELNVYNALPTLFLSNDHTGSSELCTLFLDPDWRKEGNGYLLSKSRFMFMAAFRDKFNDKVV
AEMRGVIDEHGYSPFWQSLGKRFFSMDFSRADFLCGTGQKAFIAELMPKHPIYTHFLSQEAQDVIGQVHPQTAPARAVLE
KEGFRYRNYIDIFDGGPTLECDIDRVRAIRKSRLVEVAEGQPAQGDFPACLVANENYHHFRVVLVRTDPATERLILTAAQ
LDALKCHAGDRVRLVRLCAEEKTA
>P80357 2.3.1.109~~~astA~~~Arginine N-succinyltransferase subunit alpha~~~
MLVMRPAQAADLPQVQRLAADSPVGVTSLPDDAERLRDKILASEASFAAEVSYNGEESYFFVLEDSASGELVGCSAIVAS
AGFSEPFYSFRNETFVHASRSLSIHNKIHVLSLCHDLTGNSLLTSFYVQRDLVQSVYAELNSRGRLLFMASHPERFADAV
VVEIVGYSDEQGESPFWNAVGRNFFDLNYIEAEKLSGLKSRTFLAELMPHYPIYVPLLPDAAQESMGQVHPRAQITFDIL
MREGFETDNYIDIFDGGPTLHARTSGIRSIAQSRVVPVKIGEAPKSGRPYLVTNGQLQDFRAVVLDLDWAPGKPVALSVE
AAEALGVGEGASVRLVAV
>Q8ZPV1 2.3.1.109~~~astA~~~Arginine N-succinyltransferase~~~
MRVIRPVEHADIAALMQLAGKTGGGLTSLPANEATLAARIERALKTWSGELPKGEQGYVFVLEDSETGEVGGICAIEVAV
GLNDPWYNYRVGTLVHASKELNVYNALPTLFLSNDHTGSSELCTLFLDPEWRKEGNGYLLSKSRFMFMAAFRDKFNEKVV
AEMRGVIDEHGYSPFWQSLGKRFFSMDFSRADFLCGTGQKAFIAELMPKHPIYTHFLSEEAQAVIGEVHPQTAPARAVLE
KEGFRYRHYIDIFDGGPTLECDIDRVRAIRKSRLVEVAEGQPAPGDYPACLVANENYHHFRAALVRADPQTSRLVLTAAQ
LDALKCRAGDHVRLVRLCAEEKTV
>P76216 3.5.3.23~~~astB~~~N-succinylarginine dihydrolase~~~COG3724
MNAWEVNFDGLVGLTHHYAGLSFGNEASTRHRFQVSNPRLAAKQGLLKMKALADAGFPQAVIPPHERPFIPVLRQLGFSG
SDEQVLEKVARQAPHWLSSVSSASPMWVANAATIAPSADTLDGKVHLTVANLNNKFHRSLEAPVTESLLKAIFNDEEKFS
VHSALPQVALLGDEGAANHNRLGGHYGEPGMQLFVYGREEGNDTRPSRYPARQTREASEAVARLNQVNPQQVIFAQQNPD
VIDQGVFHNDVIAVSNRQVLFCHQQAFARQSQLLANLRARVNGFMAIEVPATQVSVSDTVSTYLFNSQLLSRDDGSMMLV
LPQECREHAGVWGYLNELLAADNPISELKVFDLRESMANGGGPACLRLRVVLTEEERRAVNPAVMMNDTLFNALNDWVDR
YYRDRLTAADLADPQLLREGREALDVLSQLLNLGSVYPFQREGGGNG
>Q8ZPU9 3.5.3.23~~~astB~~~N-succinylarginine dihydrolase~~~
MTAHEVNFDGLVGLTHHYAGLSFGNEASTRHRFQVSNPRLAVKQGLLKMKALADAGFPQAVIPPHERPFIPALRQLGFTG
SDEQILDKVARQAPRWLSSVSSASPMWVANAATVCPSADALDGKVHLTVANLNNKFHRALEAPVTEALLRAIFRDESQFS
VHSALPQVALLGDEGAANHNRLGGEYGSAGVQLFVYGREEENEIRPARYPARQSREASEAVARLNQVNPQQVIFAQQNPE
VIDQGVFHNDVIAVSNRQVLFCHEAAFARQKVLINQLRTRVDGFMAIEVPAGEVSVSDAVATYLFNSQLLSRNDGLMLLV
LPRECQDHVGVWRYLNKLVAEDNPISAMQVFDLRESMANGGGPACLRLRVVLTEEERRAVNPAVMMNDALFTALNAWADR
YYRDRLTAADLADPLLLREGREALDVLTRLLDLGSVYPFQQTGAADG
>P77581 2.6.1.81~~~astC~~~Succinylornithine transaminase~~~COG4992
MSQPITRENFDEWMIPVYAPAPFIPVRGEGSRLWDQQGKEYIDFAGGIAVNALGHAHPELREALNEQASKFWHTGNGYTN
EPVLRLAKKLIDATFADRVFFCNSGAEANEAALKLARKFAHDRYGSHKSGIVAFKNAFHGRTLFTVSAGGQPAYSQDFAP
LPADIRHAAYNDINSASALIDDSTCAVIVEPIQGEGGVVPASNAFLQGLRELCNRHNALLIFDEVQTGVGRTGELYAYMH
YGVTPDLLTTAKALGGGFPVGALLATEECARVMTVGTHGTTYGGNPLASAVAGKVLELINTPEMLNGVKQRHDWFVERLN
TINHRYGLFSEVRGLGLLIGCVLNADYAGQAKQISQEAAKAGVMVLIAGGNVVRFAPALNVSEEEVTTGLDRFAAACEHF
VSRGSS
>Q8ZPV2 2.6.1.81~~~astC~~~Succinylornithine transaminase~~~
MSLSVTRENFDEWMVPVYVPAPFIPVRGEGSRLWDQQGKEYIDFAGGIAVNALGHAHPALREALNEQANRFWHIGNGYTN
EPALRLAKKLIDATFAERVFFCNSGAEANEAALKLARKYAHDRVGNHKSGIVAFKNAFHGRTLFTVSAGGQPTYSQDFAP
LPPDIRHAAYNDLNSASALIDDNTCAVIVEPVQGEGGVIPATKAFLQGLRELCDRHQALLIFDEVQTGVGRTGELYAYMH
YGVTPDILTTAKALGGGFPIGAMLTTQDYASVMTPGTHGTTYGGNPLATAVAGKVLDIINTPEMQNGVRQRHDAFIERLN
TLNVRFGMFSEIRGLGLLLGCVLQTEFAGKAKLIAQEAAKAGVMVLIAGGDVVRFAPALNVSDEEIATGLDRFALACERL
QTGGVPCG
>Q2SXN9 1.2.1.71~~~astD~~~N-succinylglutamate 5-semialdehyde dehydrogenase~~~
MTELFIDGAWVDGAGPVFASRNPGTNERVWEGASASADDVERAVASARRAFAAWSALDLDARCTIVKRFAALLVERKEAL
ATMIGRETGKPLWEARTEVASMAAKVDISITAYHERTGEKRAPMADGVAVLRHRPHGVVAVFGPYNFPGHLPNGHIVPAL
IAGNTVVFKPSELAPGVARATVEIWRDAGLPAGVLNLVQGEKDTGVALANHRQIDGLFFTGSSDTGTLLHKQFGGRPEIV
LALEMGGNNPLVVAEVEDIDAAVHHAIQSAFLSAGQRCTCARRILVPRGAFGDRFVARLADVASKITASVFDADPQPFMG
AVISARAASRLVAAQARLVGLGASPIIEMKQRDPALGFVNAAILDVTNVRELPDEEHFGPLAQIVRYTDLDDAIARANDT
AFGLSAGLLADDEQAWHTFRRAIRAGIVNWNRPTNGASSAAPFGGAGRSGNHRPSAYYAADYCAYPMASVESAQLQMPAS
LSPGLHF
>P76217 1.2.1.71~~~astD~~~N-succinylglutamate 5-semialdehyde dehydrogenase~~~COG1012
MTLWINGDWITGQGASRVKRNPVSGEVLWQGNDADAAQVEQACRAARAAFPRWARLSFAERHAVVERFAALLESNKAELT
AIIARETGKPRWEAATEVTAMINKIAISIKAYHVRTGEQRSEMPDGAASLRHRPHGVLAVFGPYNFPGHLPNGHIVPALL
AGNTIIFKPSELTPWSGEAVMRLWQQAGLPPGVLNLVQGGRETGQALSALEDLDGLLFTGSANTGYQLHRQLSGQPEKIL
ALEMGGNNPLIIDEVADIDAAVHLTIQSAFVTAGQRCTCARRLLLKSGAQGDAFLARLVAVSQRLTPGNWDDEPQPFIGG
LISEQAAQQVVTAWQQLEAMGGRPLLAPRLLQAGTSLLTPGIIEMTGVAGVPDEEVFGPLLRVWRYDTFDEAIRMANNTR
FGLSCGLVSPEREKFDQLLLEARAGIVNWNKPLTGAASTAPFGGIGASGNHRPSAWYAADYCAWPMASLESDSLTLPATL
NPGLDFSDEVVR
>A1U5W8 1.2.1.71~~~astD~~~N-succinylglutamate 5-semialdehyde dehydrogenase~~~COG1012
MANLTGNVYIDGLWLPGHGAPFESVQPVTGETVWDGNAASLEDVDAAVREARKAFLAWRRKSLAERQAVIEAFGELLEAN
KEELAHQIGLETGKPLWESRTEVAAMMGKIPISVKAYNERTGHTESDVAGGHAVLRHRPHGVVAVFGPYNFPGHLPNGHI
VPALLAGNTVVFKPSELTPGVAELTVRLWEKAGLPDGVINLVQGGSDTGKCLARHSLIDGLFFTGSSTVGHLLHEQFGGQ
PEKILALEMGGNNPLIVQNVSDLDGAVHHALQSAFLSAGQRCTCARRLLVPKGKKGDEFLARLVEVAARITVAEFDADPQ
PFMGSVISAEAANQLLKAQAAMLEKGATSLLEMKQLKPDTGLLSPGIVDATGIELEDQEFFGPLLTVYRYKGFDEALELA
NNTRYGLSAGILSDDRKLYNRLVEEVRAGIVNWNRPLTGASSAAPFGGVGASGNHRPSAYYAADYCAWPMASLEAGKSEL
PDSLAPGLNFD
>O50174 1.2.1.71~~~astD~~~N-succinylglutamate 5-semialdehyde dehydrogenase~~~
MSTHYIAGQWLAGQGETLESLDPVGQGVVWSGRGADATQVDAAVCAAREAFPAWARRPLEQRIELLERFAATLKSRADEL
ARVIGEETGKPLWESATEVTSMVNKVAISVQAFRERTGEKSGPLADATAVLRHKPHGVVAVFGPYNFPGHLPNGHIVPAL
LAGNCVVFKPSELTPKVAELTLKAWIQAGLPAGVLNLVQGGRETGVALAAHRGLDGLFFTGSSRTGNLLHSQFGGQPQKI
LALEMGGNNPLVVEEVADLDAAVYTIIQSAFISAGQRCTCARRLLVPQGAWGDALLARLVAVSATLRVGRFDEQPAPFMG
AVISLSAAEHLLKAQEHLIGKGAQPLLAMTQPIDGAALLTPGILDVSAVAERPDEEFFGPLLQVIRYSDFAAAIREANAT
QYGLAAGLLSDSRERFEQFLVESRAGIVNWNKQLTGAASSAPFGGIGASGNHRPSAYYAADYCAYPVASLESPSVSLPAT
LTPGISL
>Q8ZPV0 1.2.1.71~~~astD~~~N-succinylglutamate 5-semialdehyde dehydrogenase~~~
MTLWINGDWITGQGERRRKTNPVSAEILWQGNDANAAQVAEACQAARAAFPRWARQPFAARQAIVEKFAALLEAHKAELT
EVIARETGKPRWEAATEVTAMINKIAISIKAYHARTGAQKSELVDGAATLRHRPHGVLAVFGPYNFPGHLPNGHIVPALL
AGNTLIFKPSELTPWTGETVIKLWERAGLPAGVLNLVQGGRETGQALSSLDDLDGLLFTGSASTGYQLHRQLSGQPEKIL
ALEMGGNNPLIIEDAANIDAAVHLTLQSAFITAGQRCTCARRLLVKQGAQGDAFLARLVDVAGRLQPGRWDDDPQPFIGG
LISAQAAQHVMEAWRQREALGGRTLLAPRKVKEGTSLLTPGIIELTGVADVPDEEVFGPLLNVWRYAHFDEAIRLANNTR
FGLSCGLVSTDRAQFEQLLLEARAGIVNWNKPLTGAASTAPFGGVGASGNHRPSAWYAADYCAWPMASLESPELTLPATL
SPGLDFSRREAV
>Q7NU26 3.5.1.96~~~astE~~~Succinylglutamate desuccinylase~~~COG2988
MTHSPSFLQHALSSSDTRAEWPLPGGLAARWLAPGCVELNGDARGADSVLLSCGVHGNETAPIEVVDGMLTDIAAGQLAL
NCRLLVMFANLDAIRQGVRYGNYDMNRLFNGAHARHPELPESVRAAELETLAAEFFAGARARKLHYDLHTAIRGSVFEKF
AIYPFLHDGRTHKREQLAWLQRCGIEAVLLHTQPANTFSYFTSQYCEADAFTLELGKARPFGQNDLSRFSGIDGALRGLL
SNPQANVPDLDEDKLPLFRAKYDLVKHSEAFKLNLADSVENFTLLPDGMLIAEDGAVRYQATGGEERILFPNPAVKPGLR
AGIVVEPARLPSR
>P76215 3.5.1.96~~~astE~~~Succinylglutamate desuccinylase~~~COG2988
MDNFLALTLTGKKPVITEREINGVRWRWLGDGVLELTPLTPPQGALVISAGIHGNETAPVEMLDALLGAISHGEIPLRWR
LLVILGNPPALKQGKRYCHSDMNRMFGGRWQLFAESGETCRARELEQCLEDFYDQGKESVRWHLDLHTAIRGSLHPQFGV
LPQRDIPWDEKFLTWLGAAGLEALVFHQEPGGTFTHFSARHFGALACTLELGKALPFGQNDLRQFAVTASAIAALLSGES
VGIVRTPPLRYRVVSQITRHSPSFEMHMASDTLNFMPFEKGTLLAQDGEERFTVTHDVEYVLFPNPLVALGLRAGLMLEK
IS
>Q8ZPU8 3.5.1.96~~~astE~~~Succinylglutamate desuccinylase~~~
MDNFLALTLSGTTPRVTQGKGAGFRWRWLGHGLLELTPDAPVDRALILSAGIHGNETAPVEMLDKLLSALYSGSLTLTWR
VLVVLGNPQALAAGIRYCHSDMNRMFGGRWQSFAESDETRRARELELSLETFFSSGQARVRWHLDLHTAIRGSHHLRFGV
LPQRDRPWETDFLAWLGAAGLEALVFHQAPGGTFTHFSSEHFGALSCTLELGKALPFRQNDLTQFNVTSQALSALLSGVE
TSTSFSPPLRYRVVSQITRHSDKFALYMDAQTLNFTAFAKGTLLAEEGDKRVTVTHDVEYVLFPNPSVACGLRAGLMLER
LP
>Q9KSL4 3.5.1.96~~~astE~~~Succinylglutamate desuccinylase~~~COG2988
MTKSLFRQSFLFDSLDLDHPMVAQTVRTEQGVTLKLHQRGVLEVIPAQTDAATKNMVISCGIHGDETAPMELLDKWIDDI
VSGFQPVAERCLFIMAHPQATVRHVRFIEQNLNRLFDDKPHTPSTELAIADNLKVLLRQFFANTDEHSRWHLDLHCAIRG
SKHYSFAVSPKARHPVRSRSLMQFIEQAHIEAVMLSNAPSSTFSWYSAEHYAAQALTLELGQVARLGENLLDRLLAFDLA
MRDLISRHKPEHLPRKSVMYRVSRTIVRLHDDFDFRFSDDVENFTAFMHGEVFGHDGDKPLMAKNEGEAIVFPNRKVAIG
QRAALMVCKVNTRYEDDQLVYD
>Q87Q40 3.5.1.96~~~astE~~~Succinylglutamate desuccinylase~~~COG2988
MTKSLFRQSFLTDTLDVHIDVAPAEQVLSNGVQLKLYQRGVLEVIPENPTQETKNIIISCGIHGDETAPMELVDSIIKDI
ESGFQKVDARCLFIIAHPESTLAHTRFLEENLNRLFDEKEHEPTKELAIADTLKLLVRDFYQDTEPKTRWHLDLHCAIRG
SKHYTFAVSPKTRHPVRSKALVDFLDSAHIEAVLLSNSPSSTFSWYSAENYSAQALTMELGRVARIGENALDRLTAFDLA
LRNLIAEAQPEHLSKPCIKYRVSRTIVRLHDDFDFMFDDNVENFTSFVHGEVFGHDGDKPLMAKNDNEAIVFPNRHVAIG
QRAALMVCEVKTRFEEGELVYD
>P80358 2.3.1.109~~~aruG~~~Arginine N-succinyltransferase subunit beta~~~
MIVRPVTSADLPALIELARSTGTGLTTLPANEQRLQHRVSWAEKAFRGEAERGDADYLFVLEDDAGKVVGISAIAGAVGL
REPWYNYRVGLTVSASQELNIHREIPTLFLANDLTGNSELCSLFLHADHRSGLNGKLLSRARFLFIAEFRHLFGDKLIAE
MRGMSDEEGRSPFWESLGRHFFKMEFSQADYLTGVGNKAFIAELMPKFPLYTCFLSEEARGVIGRVHPNTEPALAMLKAE
GFSYQGYVDIFDAGPAIEAETDKIRAIAESQNLVLAVGTPGDDAEPYLIHNRKREDCRITAAPARAAAGTLVVDPLTAKR
LRLSAGASVRAVPLSAQKRG
>E8RMD3 ~~~AtxA1~~~Astexin-1~~~
MHTPIISETVQPKTAGLIVLGKASAETRGLSQGVEPDIGQTYFEESRINQD
>E8RUP9 ~~~AtxA2~~~Astexin-2~~~
MTKRTTIAARRVGLIDLGKATRQTKGLTQIQALDSVSGQFRDQLGLSAD
>E8RUP8 ~~~AtxA3~~~Astexin-3~~~
MRTYNRSLPARAGLTDLGKVTTHTKGPTPMVGLDSVSGQYWDQHAPLAD
>D7P5V0 1.14.13.215~~~asuE1~~~Protoasukamycin 4-monooxygenase~~~
MTTSDPDLSGLETDVCVVGGGATALYGALLCARAGQSVVLVFSQPEFEAAGAGISPLLAPPTLGLLAASGIDEQLTAAGR
KVLGVDDHGSTGMLSSWRYADHAGIARPYGLTVPTGTTVQALLAELRAQPRATVLTGEGVVSVEQDDERVVLGFERAAGQ
DGGVPARRRVAARYAVAADGRQSALRDLVGIRLEVSAFDRPAWLLVAPDVPGRESVLLVRHRAPRALFTIPTPGPSSAVV
WAPDRDQEKQLEQGGPAELAEQIKEVDPELSEWLGTVGGRTSPVMRLGFSLWRAPSWRVGRVLLVGESVHGLHTLGGQGL
NQSLQGAASAARAIGEALASGDPAAIEDYERVRRPHVERLQDLQWNLQALGYGTAPAVKGAHEDFIDVMTALPPELVAQL
DGSDTRA
>D7P5W0 1.5.1.42~~~asuE2~~~NADH-dependent FMN reductase AsuE2~~~
MSTHTARRAGATAGHDRDRGTEPGRTEFRRAMGLLPTGVAVVTVGSGEQTEAVTVGSVVSVSLDPALVLVSLGSTGRLVE
AIDRAGGFAVNVLTTEQSDLSACFASHDRPHGRGAEERLGGVAGASGHVLLRDAVLSMECRTEHRYPGGDHVLFLGRVDE
LHTAEAAGSPLVHHRGAYTTLKDPC
>D7P5X0 1.14.13.-~~~asuE3~~~4-hydroxyprotoasukamycin monooxygenase~~~
MERISFGAFLSPLHPLGEDPGLSLWRDLELAEWLDQFGYEELWVGEHHSAGWGTISSPELFIATAAERTRRIRLGTGVVS
LPYHHPFMVASRAVQLDHLTGGRFTLGVGAGSIPSDMHFLGIDPADTRRRTAESLDVIHRLLTGDEPVTRVTDWFELHDA
RLQLRPRRASGLPLAVSSAVSPFGMRLAGQYGAAPLSFGVPPRPGSSVDDLAAQWQHAVDSAAEHGRTIDRADWRVALSV
HVADSREQALDDLVDGWMRYRNEYWALLGTPPVHSRTEARKALAELIDRRSTIVGSVDECIDAVRDVQEATGGFGRLLVN
VLDWADREAMKRSFELFARFVAPRFNGSLDGVTASYGWVAEQARAQRAAAR
>K7ZP88 ~~~ataA~~~Trimeric autotransporter adhesin AtaA~~~
MNKIYKVIWNATLLAWVAVSELAKGKTKSTTSKSKAKSLSSSVIVGGIILTTPLSLIAATVQVGGGTNSGTTATASTNCA
DLYNYQNPENSGSGAAGNYNAGNPSVCSIAIGENAQGGTSGTGGSPGIAIGGNSKATGGLSVAIGGYAQATNVGSIALGT
AALSSGFNSLAISRQAAATNNYSIAIGTTSVSKGVGSIAMGHSTNASGDQSIAIGSSDAVNSATATTTYDGTTNTQASGS
KSIAIGASAKASTNNSIALGAGSVTSAQSGNSYLTGVGASATNGVVSVGTSTATRRIQNVADGSAASDAVTVAQLDKAYD
DTNGRLAAALGTGSGAAYNAANNTYTAPTNIGGTGKNTIDDAIKATQRSVVAGSNIVVTPTTASDGSISYSVATSATPTF
TSITVNNAPTAGTDATNKTYVDSKAAASRTEVAAGSNVSGVVKTTGANGQDVYTVNANGTTASAGSSAVTVTPGTKDANN
VTDYKVDLSATTKTDIQKGVDAKNAVDTAGLKFKGDTATTSNTKKLGDTVSITGDTNISTVATTDGVQVKLNPNLDLGAT
GSVKTGNTTINNAGVTADQVTVGGVVINNTSGINAGGKAITNVAAPTNNTDAANKKYVDDAGTALTNLGFGLKAQDGTTV
NKKLGEAVDIVGSNSNISTKVNAGKVEVALSNTLDLGTTGSVTTGSTVINNAGVTATQVTANKVTINNAPTAGTDATNKT
YVDSKAAASRTEVAAGSNVSGVVKTTGANGQDIYAVNANGTTASAGSSAVTVTPGTKDANNVTDYKVDLSATTKTDIQKG
VDAKNAVDTAGLKFKGDTATTSNTKKLGDTVSITGDTNISTVATTDGVQVKLNPNLDLGATGSVKTGNTTINNAGVTADQ
VTVGGVVINNTSGINAGGKAITNVAAPTNNTDAANKKYVDDAGTALTNLGFGLKAQDGTTVNKKLGEAVDIVGSNSNIST
KVNAGKVEVALSNTLDLGTTGSVTTGSTVINNAGVTATQVTANKVTVNNAPTAGTDATNKTYVDSKAAASRTEVAAGSNV
SGVVKTTGANGQDVYTVNANGTTASAGSSAVTVTPGTKDANNVTDYKVDLSATTKTDIQKGVDAKNAVDTAGLKFKGDTA
TTSNTKKLGDTVSITGDTNISTVATTDGVQVKLNPNLDLGATGSVKTGNTTINNAGVTADQVTVGGVVINNTSGINAGGK
AITNVAAPTNNTDAANKKYVDDAGTALTNLGFGLKAQDGTTVNKKLGEAVEVVGADSNITTKVAGGQVAIELNKNLNNLT
GITVNDGTNGTNGSTVIGKDGISVKDGSGNTIAGVDNTALTVKDGSGNTETSINQAINTLNAAQGETDKFAVKYDKNADG
SVNYNNITLAGTTASSTQDATTGKITTTGGTSLNNVASAGDYKDVANASKGVNAGDLNNAVVDATNAATSKGFALQAADG
AKVQKNLGEAVEVVGADSNITTKVAGGQVAIELNKNLNNLTGITVNDGTNGTNGSTVIGKDGISVKDGSGNTIAGVDNTA
LTVKDGSGNTETSINQAINTLNAAQGETDKFAVKYDKNTDGSTNYNSITAGNGNGTAATIGTDTAGNSVVTSGGTKISNV
ANGVNASDAVNKGQLDSLSTGLTNTGFGLKAADGNTVNKKLGEAVDVVGADSNITTKVAGGQVAIELNKNLNNLTGITVN
DGTNGTNGSTVIGKDGISIKDGSGNTIAGVDNTALTVKDGSGNTETSINQAINTLNAAQGETDKFAVKYDKNADGSANYN
NITLAGTTASSTQDATTGKITTTGGTSLNNVASAGDYKDVANASKGVNAGDLNNAVVDATNAATSKGFALQAADGAKVQK
NLGEAVEVVGADSNITTKVVGGQVAIELNKNLNNLTGITVNDGTNGTNGSTVIGKDGISVKDGSGNTIAGVDNTALTVKD
GSGNTETSINQAINTLNAAQGETDKFAVKYDKNADGSVNYNNITLAGTTASSTQDATTGKITTTGGTSLNNVASAGDYKD
VANASKGVNAGDLNNAVVDATNAATSKGFALQAADGAKVQKNLGEAVEVVGADSNITTKVAGGQVAIELNKNLNNLTGIT
VNDGTNGTNGSTVIGKDGISVKDGSGNTIAGVDNTALTVKDGSGNTETSINQAINTLNAAQGETDKFAVKYDKNADGSVN
YNNITLAGTTASSTQDATTGKITTTGGTSLNNVASAGDYKDVANASKGVNAGDLNNAVVDATNAATSKGFALQAADGAKV
QKNLGEAVEVVGADSNITTKVAGGQVAIELNKNLNNLTGITVNDGTNGTNGSTVIGKDGISVKDGSGNTIAGVDNTALTV
KDGSGNTETSINQAINTLNAAQGETDKFAVKYDKNADGSANYNNVTLAGTNGTIISNVKAGAVTSTSTDAINGSQLYGVA
NSVKNAIGGSTTIDATTGAITTTNIGGTGSNTIDGAISSIKDSATKAKTTVSAGDNVVVTSGTNADGSTNYEVATAKDVN
FDKVTVGSVVVDKSSNTIKGLSNTTWNGTAVSGQAATEDQLKTVSDAQGETDKFAVKYDKNADGSANYNSITAGNGNGTA
ATIGTDTAGNSVVTSGGTKISNVANGVNASDAVNKGQLDSLSTGLTNTGFGLKAADGNTVNKKLGEAVDVVGADSNITTK
VAGGQVAIELNKNLNNLTGITVNDGTNGTNGSTVIGKDGISIKDGSGNTIAGVDNTALTVKDSSGNTETSINQAINTLNA
AQGETDKFAVKYDKNADGSVNYNNVTLAGTNGTIIRNVKAGAVTSTSTDAINGSQLYDIANSVKNAIGGSTTRDVTTGAI
TTTNIGGTGSNTIDGAISSIKDSATKAKTTISAGDNVVVTSGTNADGSTNYEVATAKDVNFDKVTVGNVVVDKANDTIQG
LSNKDLNSTDFATKGRAATEEQLKAVITSNITEVVDGNGNKVNIIDQVVNTKPDNKNQDSLFLTYDKQGQETTDRLTIGQ
TVQKMNTDGIKFFHTNADTSKGDLGTTNDSSAGGLNSTAIGVNAIVANGADSSVALGHNTKVNGKQSIAIGSGAEALGNQ
SISIGTGNKVTGDHSGAIGDPTIVNGANSYSVGNNNQVLTDDTFVLGNNVTKTIAGSVVLGNGSAATTGAGEAGYALSVA
TNADKAAITKTTSSTGAVAVGDASSGIYRQITGVAAGSVDSDAVNVAQLKAVGNQVVTTQTTLVNSLGGNAKVNADGTIT
GPTYNVAQGNQTNVGDALTALDNAINTAATTSKSTVSNGQNIVVSKSKNADGSDNYEVSTAKDLTVDSVKAGDTVLNNAG
ITIGNNAVVLNNTGLTISGGPSVTLAGIDAGNKTIQNVANAVNATDAVNKGQLDSAINNVNNNVNELANNAVKYDDASKD
KITLGGGATGTTITNVKDGTVAQGSKDAVNGGQLWNVQQQVDQNTTDISNIKNDINNGTVGLVQQAGKDAPVTVAKDTGG
TTVNVAGTDGNRVVTGVKEGAVNATSKDAVNGSQLNTTNQAVVNYLGGGAGYDNITGSFTAPSYTVGDSKYNNVGGAIDA
LNQADQALNSKIDNVSNKLDNAFRITNNRIDDVEKKANAGIAAAMALESAPYVPGKYTYAAGAAYHGGENAVGVTLRKTA
DNGRWSITGGVAAASQGDASVRIGISGVID
>A3M3H0 ~~~ata~~~Adhesin Ata autotransporter~~~
MNKVYKVIWNASIGAWVATSEIAKSKTKTKSKTLNLSAAVLSGVICFAPNAFAGTNTEGGIGQGTSISGTTSCREGSANT
ANQKDIAIGCGAQTQDRTGSNIANRNNPYNNSTGAYAGAMKQGGAISVGTGAVVEKGLGTAIGSYATTQGISGVAIGTGA
LSSGNTALAVGRQSAATADFSQAIGNVAAATGKGSLAIGHSATAEGYRSIAIGSPDIENADPVAGQAGAAYQPKMATKAT
GKDSIAFGGGAVATEENALAIGAFSESKGKKSVAIGTGAKAQKDNAVVIGDQAEASFEGGVAIGKGARSEAENSIALGKD
SKASQATGESFLTKQSAPTGVLSIGDIGTERRIQNVADGAADSDAATVRQLKAARTHYVSINDNGQPGGNFENDGATGRN
AIAVGVNASAAGREAMAIGGNAQAIGSGAIAMGSSSQTVGRGDVAIGRNASTQGAEGVNSNQSVAIGDQTKAIGDQSVAI
GADVIAKGNSSVAIGGDDVDKIARDTELSNTYTEITGGTLQAGKYPTTEANHGSTAVGVQAVGTGAFSSAFGMTSKATGD
ASSAFGVMSNASGKGAAAFGAVAQATGDGASAMGINSLASGTNSTAIGSGNKPGEGANATGNSSAAIGSGAQATGDNSAA
IGKGAEATNENAAAVGGGAKATGKNAAAIGGGAIADQENAVAVGQGAQSLVEGGVALGARSKVEAKNSVALGQDAVATEA
TGTSFLTNRDASQSNGVISVGSAGKERRITNVEDGSADSDAVTVRQLKNVDSRVNQNTSNIGKNTQNITNLNQKLDDTKT
NLGNQIADTNKNLNDAKKDLGNQITDTNTKLNTTKDQLTTQINDTNTELNNTIGNTKTELNTKIDNTKTELENKGLNFAG
NSGADVHRKLGDKLNIVGGAAASTPAAKTSGENVITRTTQDGIQIELLKDSKFDSVTTGNTTLNTNGLTIKEGPSITKQG
INAGSKQISNVADGINAKDAVNVDQLTKVKDNLNGRITDTNNQLNDAKKDLGNQIADTNKNLNDAKKDLGNQITDTNTKL
NNTKDQLTTQINDTKTELNNTIGNTKTELNSKIDSTKTELENKGLNFAGNSGADVHRKLGEKLNIIGGAAASTPAAKTSG
ENVITRTTKDGIQIELLKDSKFDSVTTGNTTLNTNGLTIKEGPSITKDGINASGKQITNVADGVNAKDAVNKGQLDNLAA
KQNATDDAAVKYDDAKTKDKVTLKGKDGTVLDNVKAGHISSTSKEAVNGSQIHNISNSIKNSIGGNTVVNPDGSLTTNNI
GGTGKNNINDAISEVKNTATKAKTTVTEGDNIVVKETVNKDGSTNYEVSTKKDLTLNSVTTGDTVLNNNGLTIKDGPSIT
KDGVNAGGKKITDVANGVIAQNSKDAVNGAQVHHISNSIKNSIGGNTVVNPDGSLTTNNIGGTGKNNINDAIKSVDEKVT
NGVNDLTQKGLNFGANDQKTTQGKAVHRKLGDTINIVGGADAKTAEDKTSGENIITRTTEDGVKIEMLKDVKFDSVNVGG
HVLNQQGLIIKGGPSITVNGINAGGKQITNVADGINAKDAVNKGQLDKQINEVKDQIGKDIGKLSDHAVQYDKDKNGNVD
KSSVTLGGGEKGTNLKNVADGKVAEGSKDAVNGGQLWNVQNQVDKNSNDIKNIQNNIDNISNGKAGLVQQQKPNGEITVG
RDTGGTSINMAGKEGDRVVQGVKDGEIKAGSNQAVNGGQIHKISESIKNSIGGNTTIDPKDGSITTNNIGGTGKNNINDA
IGTLNQSNQELGNKITNLGDQLQQVFYDTNKRIDDVEKKANAGIAAAMALENAPFVAGKYTYAVGAAYHGGENAVGVTLR
KTSDNGRWSITGGVAAASQGEPSVRVGISGVIN
>Q8EFW6 ~~~atcA~~~Adaptation to cold protein A~~~
MPMAKKLNISRINELKTNAYDNIESYDDPDTPKALEQFTSQIKKVLQADPKMLESVPEYLPVALYGRVKFPSDAKLKWAH
WINTATQPDWDEFKVTIGFNNADLPLVLAVRAYSEDLLIESCAVLYLLENQGKATPAPKRSADDDFEDEDSDYADYSDDD
DDEGEEEDGYYDHYDDEDR
>Q8EFW7 ~~~atcB~~~Adaptation to cold protein B~~~
MNTSLIEITVAEIKELADVEPKQASKRFELIATTMNDEQLVEVIEKMDIVTLTQINSHHDISCPSIMSELMTPEQIRDIV
CQQPLYWEEKIKNNAEELIQHTFDFLTYLIRIQDSEEKQTAILECIAEDPAGLFYLSIPFIEMMLGEGHDDEDHINDYYD
DEEDTDTIGYDSRVASEEAHSFSLDDPRSLMALIHELAPDVEKAIKNLLRNESSGWETIINKFVNELVIQAKEKNQVTDE
YAEVDDMFSFLD
>Q8EFW9 ~~~atcC~~~Adaptation to cold protein C~~~
MQLVLRDIDQGPFLSKVLAKGQADDTLSGEQLAQIKSKAILMSLKLADKFYNKYKMHLLEQAAHDVIGVVSLGLMELSNQ
DQQQALRLLITADGVVKCFQKGWSMLSVVSKHKLVNSKSLYGDVDKFLLEQVSTPPDADEWLGYEAYQDALVEHQRQQSI
AALMAQFYAQTSYDPLDFLNLESVLAEAVLYRMLFDNAKVRQDLKKRIAKISLQDEWFSLEYIEQQTQQALAELPAELAD
TIGKDLGKNFAPALLRTLHFAKSYRELLLNDASPERLERFEHKEGLVGLLGWPLYIVL
>Q8EFW5 ~~~atcJ~~~Adaptation to cold protein J~~~COG0484
MINHFSVLGIKPSAKEDDIKKAYRRLSNKYHPDKLLGASDEEKEQASQQLERVKKAYEVLSDPKLRNAFIRDFNNVIVTD
PNSAMRELWDQFYP
>O34431 7.2.2.10~~~yloB~~~Calcium-transporting ATPase~~~COG0474
MKFHEMGQTDLLEATNTSMKQGLTEKEVKKRLDKHGPNELQEGKKTSALLLFFAQFKDFMVLVLLAATLISGFLGEYVDA
VAIIAIVFVNGILGFFQERRAEQSLQALKELSTPHVMALREGSWTKIPSKELVPGDIVKFTSGDRIGADVRIVEARSLEI
EESALTGESIPVVKHADKLKKPDVSLGDITNMAFMGTIVTRGSGVGVVVGTGMNTAMGKIADMLESAGTLSTPLQRRLEQ
LGKILIVVALLLTVLVVAVGVIQGHDLYSMFLAGVSLAVAAIPEGLPAIVTVALSLGVQRMIKQKSIVRKLPAVETLGCA
SIICSDKTGTMTQNKMTVTHVWSGGKTWRVAGAGYEPKGSFTLNEKEISVNEHKPLQQMLLFGALCNNSNIEKRDGEYVL
DGDPTEGALLTAARKGGFSKEFVESNYRVIEEFPFDSARKMMTVIVENQDRKRYIITKGAPDVLMQRSSRIYYDGSAALF
SNERKAETEAVLRHLASQALRTIAVAYRPIKAGETPSMEQAEKDLTMLGLSGIIDPPRPEVRQAIKECREAGIKTVMITG
DHVETAKAIAKDLRLLPKSGKIMDGKMLNELSQEELSHVVEDVYVFARVSPEHKLKIVKAYQENGHIVAMTGDGVNDAPA
IKQADIGVSMGITGTDVAKEASSLVLVDDNFATIKSAIKEGRNIYENIRKFIRYLLASNVGEILVMLFAMLLALPLPLVP
IQILWVNLVTDGLPAMALGMDQPEGDVMKRKPRHPKEGVFARKLGWKVVSRGFLIGVATILAFIIVYHRNPENLAYAQTI
AFATLVLAQLIHVFDCRSETSVFSRNPFQNLYLIGAVLSSILLMLVVIYYPPLQPIFHTVAITPGDWMLVIGMSAIPTFL
LAGSLLTRKK
>P37278 7.2.2.10~~~pacL~~~Calcium-transporting ATPase~~~COG0474
MKGAIVSASLTDVRQPIAHWHSLTVEECHQQLDAHRNGLTAEVAADRLALYGPNELVEQAGRSPLQILWDQFANIMLLML
LAVAVVSGALDLRDGQFPKDAIAILVIVVLNAVLGYLQESRAEKALAALKGMAAPLVRVRRDNRDQEIPVAGLVPGDLIL
LEAGDQVPADARLVESANLQVKESALTGEAEAVQKLADQQLPTDVVIGDRTNCLFQGTEVLQGRGQALVYATGMNTELGR
IATLLQSVESEKTPLQQRLDKLGNVLVSGALILVAIVVGLGVLNGQSWEDLLSVGLSMAVAIVPEGLPAVITVALAIGTQ
RMVQRESLIRRLPAVETLGSVTTICSDKTGTLTQNKMVVQQIHTLDHDFTVTGEGYVPAGHFLIGGEIIVPNDYRDLMLL
LAAGAVCNDAALVASGEHWSIVGDPTEGSLLTVAAKAGIDPEGLQRVLPRQDEIPFTSERKRMSVVVADLGETTLTIREG
QPYVLFVKGSAELILERCQHCFGNAQLESLTAATRQQILAAGEAMASAGMRVLGFAYRPSAIADVDEDAETDLTWLGLMG
QIDAPRPEVREAVQRCRQAGIRTLMITGDHPLTAQAIARDLGITEVGHPVLTGQQLSAMNGAELDAAVRSVEVYARVAPE
HKLRIVESLQRQGEFVAMTGDGVNDAPALKQANIGVAMGITGTDVSKEASDMVLLDDNFATIVAAVEEGRIVYGNIRKFI
KYILGSNIGELLTIASAPLLGLGAVPLTPLQILWMNLVTDGIPALALAVEPGDPTIMQRRPHNPQESIFARGLGTYMLRV
GVVFSAFTIVLMVIAYQYTQVPLPGLDPKRWQTMVFTTLCLAQMGHAIAVRSDLLTIQTPMRTNPWLWLSVIVTALLQLA
LVYVSPLQKFFGTHSLSQLDLAICLGFSLLLFVYLEAEKWVRQRRY
>P73241 7.2.2.8~~~pacS~~~Probable copper-transporting ATPase PacS~~~COG2217
MAQTINLQLEGMRCAACASSIERAIAKVPGVQSCQVNFALEQAVVSYHGETTPQILTDAVERAGYHARVLKQQVLSSQQT
EDRKPVFSAKLVTGLVISAVLFFGSLPMMLGVNIPHFPHIFHDPWLQWLLATPVQFWSGAEFYRGAWKSVRTRSATMDTL
VALGTSAAYFYSVAITLFPQWLTSQGLAAHVYFEAAAVVITLILLGRSLEQRARRETSAAIRKLMGLQPQTALVKRGEHW
ETVAIAELAINDVVRVRPGEKIPVDGVVVAGNSTVDESLVTGESFPVDKTVGTEVIGATLNKSGSLDIQVSKLGQDSVLA
QIIQLVQQAQASKAPIQHFVDRITHWFVPTVIVVAIAAFCIWWLTTGNITLAVLTLVEVLIIACPCALGLATPTSVMVGT
GKGAEYGVLIKEASSLEMAEKLTAIVLDKTGTLTQGKPSVTNFFTLSPTSTEESLQLIQWAASVEQYSEHPLAEAVVNYG
QSQQVSLLEIDNFQAIAGCGVAGQWQGQWIRLGTSNWLTDLGVTGTEHQPWQSQAQQWEKEQKTVIWLAVDTEVKALLAI
ADAIKPSSPQVVQALKKLGLSVYMLTGDNQATAQAIADTVGIRHVLAQVRPGDKAQQVEQLQQKGNIVAMVGDGINDAPA
LAQADVGIAIGTGTDVAIAASDITLIAGDLQGILTAIKLSRATMGNIRQNLFFAFIYNVIGIPVAAGLFYPLFGLLLNPI
LAGAAMAFSSVSVVTNALRLKKFCP
>Q9X5V3 7.2.2.9~~~actP~~~Copper-transporting P-type ATPase~~~
MNIKQEDDHHHSHAHGDNHCHCGHDQEKAADAIVRDPICGMTVDPQAGKPSLGHGGRIYHFCSEHCRTKFAAAPEDYLTA
KDPVCGMSVDRSTARYFLKAEGEKFYFCSAACQAKFEADPAAYRDGQRPTAKPAPKGTLYTCPMHPEVVSDRPGDCPKCG
MALEPMGIPPTDEGPNPELVDFVRRLWVSAILALPLLALGMGPMLGLPLREAIGEPQATFIELLLATPVVLWAALPFFRR
AWASVVNRSPNMWTLIGLGVGTAYLYSVVATLAPGIFPMSFRGHGAAVPVYFEAAAVIVALVFVGQVLELKARERTGSAI
RALLDLAPKTARRIDAEGNESDVPVDDINVADRLRVRPGERVPVDGSVLEGQSTVDESMISGEPLPVEKSKGDPLTGGTI
NKNGTFVMSAEKVGADTVLSRIVDMVAKAQRSRAPIQGAVDRVSAVFVPAVVAVALLAFLAWAAIGPEPRMANGLLAAVA
VLIIACPCALGLATPMSIMIATGRGAGEGVLIKDAEALERFSKGDTLIVDKTGTLTEGKPKLTDIAAFGRVGEDRLLSLA
ASLERGSEHPLAEAIVSGAEERGVPFVEVTGFEAKTGKGVQGIADGTMVALGNSAMLADLGIDPAALSEKTEALRGDGKT
VMFVVFDGALAGLVAVADRIKPTTAAAIQALHDSGLKIIMATGDNERTARAVAKSLGIDEVRADVLPEGKKALIDELRSK
GAIIAMAGDGVNDAPALAAADVGIAMGTGADVAMESAGITLVKGDLTGIVRARRLAEATMRNIRQNLGFAFGYNALGVPV
AAGVLYPILGLLLSPMIAAAAMSLSSVSVISNALRLRFAKL
>Q9X5X3 7.2.2.9~~~actP~~~Copper-transporting P-type ATPase~~~
MTALKQTEKSTSLPMSFDFDIEGMTCASCVRRVEKAIAAVPGVASANVNLATERATVQFNGVPETTSVLRAVEKAGYAPR
IVTEEIQIEGMTCASCVSRVEKALKAVPGVADASVNLATEKATVRLVSGSAEISALAAAVKGAGYGIRKATPAEAMKEDV
DHRTAELRSLKSAVTISSLMTLPLFLLEMGSHFIPGVHDFIMGTIGMRNNLYLQFALATLVLFGPGLRFFRKGVPNLLRW
TPDMNSLVVLGTTAAWGYSVVTTFVPAILPSGTANVYYEAAAVIVTLILVGRYLESRAKGRTSQAIKRLVGLQPKTAFVL
HSGEFVETEITEVVTGDVIRIRPGEKIPVDGTVTDGSSYVDESMITGEPVPVQKATDSAVIGGTINKTGSITFKATKVGS
DTLLAQIIRLVEAAQGSKLPIQALVDRVTAWFVPVVILAALLTFAAWYVLGPSPALSFALVNAVAVLIIACPCAMGLATP
TSIMVGTGRAAELGILFRKGEALQSLRDADVVAVDKTGTLTKGRPELTDLVAAEGFEPDEVLCLVASLETLSEHPIAEAI
VSAAKSRGIATVAVSAFEATPGFGVSGTVSGRRVLVGADRALVKNGIDITGFADEAERLGSGGKSPLYAAIDGRLAAIVA
VSDPVKESTPQAIKSLHALGLKVAMVTGDNRRTAEAIAKKLGIDEVVAEVLPEGKVDAVRKLRQGGRSVAFIGDGINDAP
ALAEADVGIAVGTGTDIAIESADVVLMSGDLNGVAKALALSKATIRNIKQNLFWAFVYNISLVPVAAGVLYPVNGTLLSP
IFAAAAMAMSSVFVLGNALRLKSFDPA
>Q44249 6.3.1.18~~~atdA1~~~Gamma-glutamylanilide synthase~~~
MSEKLDFITKNNLWTDKQRDAADKVLAEIDSLGLEMIRLSWADQYGLLRGKALSVAALKAAFSEGSEVTMAPFSFNLVSE
WVFNPFTAGGGFGIDEFDELGGVPSVVMVPDPTTFKVLPWADKTGWMLADLHWKSGEPFPLCPRGIMKKAVKSLSDEGYL
FKCGIELEWYLTKIVDRSLSPESLGAPGVQPDAIQVQPVAQGYSYLLEYHLDQVDDIMSKVRKGLLELNLPLRSIEDELA
PSQMETTFDVMEGLEAADAALLIKSAIKQICSRHGYHATFMCKPAINGFSVASGWHMHQSLVDKDTRKNLFIPSEGEVVS
PLGRAYAGGLLANGSAASSFTTPTVNGYRRRQPHSLAPDRRAWAKENKAAMVRVISATGDPASRIENRIGEPGANPYLYM
ASQIVSGLDGIKIKRDPGGLQGAPYGAQVPMLPTALAEALDALEHDSELFRSCFGDTFIKYWLQLRRSEWARFLDAEGAE
AAEPTGAVTQWEQKEYFNLL
>P0A951 2.3.1.57~~~speG~~~Spermidine N(1)-acetyltransferase~~~COG1670
MPSAHSVKLRPLEREDLRYVHQLDNNASVMRYWFEEPYEAFVELSDLYDKHIHDQSERRFVVECDGEKAGLVELVEINHV
HRRAEFQIIISPEYQGKGLATRAAKLAMDYGFTVLNLYKLYLIVDKENEKAIHIYRKLGFSVEGELMHEFFINGQYRNAI
RMCIFQHQYLAEHKTPGQTLLKPTAQ
>Q9KL03 2.3.1.57~~~speG~~~Spermidine N(1)-acetyltransferase~~~COG1670
MNSQLTLRALERGDLRFIHNLNNNRNIMSYWFEEPYESFDELEELYNKHIHDNAERRFVVEDAQKNLIGLVELIEINYIH
RSAEFQIIIAPEHQGKGFARTLINRALDYSFTILNLHKIYLHVAVENPKAVHLYEECGFVEEGHLVEEFFINGRYQDVKR
MYILQSKYLNRSE
>O25711 3.6.1.-~~~~~~ATP/GTP phosphatase~~~COG1106
MIQSVRIKNFKNFKNTKIDGFTKLNIITGQNNAGKSNLLEALYYLVGKSMHPCTNVLEIYDNIRKEPLTSESKSLMFYGL
DTKEEIQIVTTLDNNQTLDLQIKFIASENQKVIESQIIPTAEQTQMSSQLNFTLKKNNEEIYNDHLNIAKVPNFPPIPNQ
SGYNRQFKNFDSNQLQKLLPFESAVIIPSDVVYRQAHMIQAVSKICSNNQLEEELNKHLNQFDNNIQAISFNTNNQLKLK
VKDIKEKVPLSVFGDGLKKYLHIVSAFMADNAKTIYIDEVENGLHFSRMRLLLKNTIDFINNNKDGNLQVFMTTHSQEFI
EILDQVIREKDFAHQTKLFCLKQDDQYVIPRTYYGENLEYYFENEENLFG
>P0AFP2 ~~~atl~~~DNA base-flipping protein~~~COG3695
MLVSCAMRLHSGVFPDYAEKLPQEEKMEKEDSFPQRVWQIVAAIPEGYVTTYGDVAKLAGSPRAARQVGGVLKRLPEGST
LPWHRVVNRHGTISLTGPDLQRQRQALLAEGVMVSGSGQIDLQRYRWNY
>Q2FZK7 ~~~atl~~~Bifunctional autolysin~~~COG3266
MAKKFNYKLPSMVALTLVGSAVTAHQVQAAETTQDQTTNKNVLDSNKVKATTEQAKAEVKNPTQNISGTQVYQDPAIVQP
KTANNKTGNAQVSQKVDTAQVNGDTRANQSATTNNTQPVAKSTSTTAPKTNTNVTNAGYSLVDDEDDNSENQINPELIKS
AAKPAALETQYKTAAPKAATTSAPKAKTEATPKVTTFSASAQPRSVAATPKTSLPKYKPQVNSSINDYICKNNLKAPKIE
EDYTSYFPKYAYRNGVGRPEGIVVHDTANDRSTINGEISYMKNNYQNAFVHAFVDGDRIIETAPTDYLSWGVGAVGNPRF
INVEIVHTHDYASFARSMNNYADYAATQLQYYGLKPDSAEYDGNGTVWTHYAVSKYLGGTDHADPHGYLRSHNYSYDQLY
DLINEKYLIKMGKVAPWGTQSTTTPTTPSKPTTPSKPSTGKLTVAANNGVAQIKPTNSGLYTTVYDKTGKATNEVQKTFA
VSKTATLGNQKFYLVQDYNSGNKFGWVKEGDVVYNTAKSPVNVNQSYSIKPGTKLYTVPWGTSKQVAGSVSGSGNQTFKA
SKQQQIDKSIYLYGSVNGKSGWVSKAYLVDTAKPTPTPTPKPSTPTTNNKLTVSSLNGVAQINAKNNGLFTTVYDKTGKP
TKEVQKTFAVTKEASLGGNKFYLVKDYNSPTLIGWVKQGDVIYNNAKSPVNVMQTYTVKPGTKLYSVPWGTYKQEAGAVS
GTGNQTFKATKQQQIDKSIYLFGTVNGKSGWVSKAYLAVPAAPKKAVAQPKTAVKAYTVTKPQTTQTVSKIAQVKPNNTG
IRASVYEKTAKNGAKYADRTFYVTKERAHGNETYVLLNNTSHNIPLGWFNVKDLNVQNLGKEVKTTQKYTVNKSNNGLSM
VPWGTKNQVILTGNNIAQGTFNATKQVSVGKDVYLYGTINNRTGWVNAKDLTAPTAVKPTTSAAKDYNYTYVIKNGNGYY
YVTPNSDTAKYSLKAFNEQPFAVVKEQVINGQTWYYGKLSNGKLAWIKSTDLAKELIKYNQTGMTLNQVAQIQAGLQYKP
QVQRVPGKWTDAKFNDVKHAMDTKRLAQDPALKYQFLRLDQPQNISIDKINQFLKGKGVLENQGAAFNKAAQMYGINEVY
LISHALLETGNGTSQLAKGADVVNNKVVTNSNTKYHNVFGIAAYDNDPLREGIKYAKQAGWDTVSKAIVGGAKFIGNSYV
KAGQNTLYKMRWNPAHPGTHQYATDVDWANINAKIIKGYYDKIGEVGKYFDIPQYK
>Q931U5 ~~~atl~~~Bifunctional autolysin~~~
MAKKFNYKLPSMVALTLVGSAVTAHQVQAAETTQDQTTNKNVLDSNKVKATTEQAKAEVKNPTQNISGTQVYQDPAIVQP
KTANNKTGNAQVSQKVDTAQVNGDTRANQSATTNNTQPVAKSTSTTAPKTNTNVTNAGYSLVDDEDDNSEHQINPELIKS
AAKPAALETQYKAAAPKAKTEATPKVTTFSASAQPRSVAATPKTSLPKYKPQVNSSINDYIRKNNLKAPKIEEDYTSYFP
KYAYRNGVGRPEGIVVHDTANDRSTINGEISYMKNNYQNAFVHAFVDGDRIIETAPTDYLSWGVGAVGNPRFINVEIVHT
HDYASFARSMNNYADYAATQLQYYGLKPDSAEYDGNGTVWTHYAVSKYLGGTDHADPHGYLRSHNYSYDQLYDLINEKYL
IKMGKVAPWGTQFTTTPTTPSKPTTPSKPSTGKLTVAANNGVAQIKPTNSGLYTTVYDKTGKATNEVQKTFAVSKTATLG
NQKFYLVQDYNSGNKFGWVKEGDVVYNTAKSPVNVNQSYSIKSGTKLYTVPWGTSKQVAGSVSGSGNQTFKASKQQQIDK
SIYLYGSVNGKSGWVSKAYLVDTAKPTPTPIPKPSTPTTNNKLTVSSLNGVAQINAKNNGLFTTVYDKTGKPTKEVQKTF
AVTKEASLGGNKFYLVKDYNSPTLIGWVKQGDVIYNNAKSPVNVMQTYTVKPGTKLYSVPWGTYKQEAGAVSGTGNQTFK
ATKQQQIDKSIYLFGTVNGKSGWVSKAYLAVPAAPKKAVAQPKTAVKAYTVTKPQTTQTVSKIAQVKPNNTGIRASVYEK
TAKNGAKYADRTFYVTKERAHGNETYVLLNNTSHNIPLGWFNVKDLNVQNLGKEVKTTQKYTVNKSNNGLSMVPWGTKNQ
VILTGNNIAQGTFNATKQVSVGKDVYLYGTINNRTGWVNAKDLTAPTAVKPTTSAAKDYNYTYVIKNGNGYYYVTPNSDT
AKYSLKAFNEQPFAVVKEQVINGQTWYYGKLSNGKLAWIKSTDLAKELIKYNQTGMTLNQVAQIQAGLQYKPQVQRVPGK
WTDANFNDVKHAMDTKRLAQDPALKYQFLRLDQPQNISIDKINQFLKGKGVLENQGAAFNKAAQMYGINEVYLISHALLE
TGNGTSQLAKGADVVNNKVVTNSNTKYHNVFGIAAYDNDPLREGIKYAKQAGWDTVSKAIVGGAKFIGNSYVKAGQNTLY
KMRWNPAHPGTHQYATDVDWANINAKIIKGYYDKIGEVGKYFDIPQYK
>Q99V41 ~~~atl~~~Bifunctional autolysin~~~
MAKKFNYKLPSMVALTLVGSAVTAHQVQAAETTQDQTTNKNVLDSNKVKATTEQAKAEVKNPTQNISGTQVYQDPAIVQP
KTANNKTGNAQVSQKVDTAQVNGDTRANQSATTNNTQPVAKSTSTTAPKTNTNVTNAGYSLVDDEDDNSEHQINPELIKS
AAKPAALETQYKAAAPKAKTEATPKVTTFSASAQPRSVAATPKTSLPKYKPQVNSSINDYIRKNNLKAPKIEEDYTSYFP
KYAYRNGVGRPEGIVVHDTANDRSTINGEISYMKNNYQNAFVHAFVDGDRIIETAPTDYLSWGVGAVGNPRFINVEIVHT
HDYASFARSMNNYADYAATQLQYYGLKPDSAEYDGNGTVWTHYAVSKYLGGTDHADPHGYLRSHNYSYDQLYDLINEKYL
IKMGKVAPWGTQFTTTPTTPSKPTTPSKPSTGKLTVAANNGVAQIKPTNSGLYTTVYDKTGKATNEVQKTFAVSKTATLG
NQKFYLVQDYNSGNKFGWVKEGDVVYNTAKSPVNVNQSYSIKSGTKLYTVPWGTSKQVAGSVSGSGNQTFKASKQQQIDK
SIYLYGSVNGKSGWVSKAYLVDTAKPTPTPIPKPSTPTTNNKLTVSSLNGVAQINAKNNGLFTTVYDKTGKPTKEVQKTF
AVTKEASLGGNKFYLVKDYNSPTLIGWVKQGDVIYNNAKSPVNVMQTYTVKPGTKLYSVPWGTYKQEAGAVSGTGNQTFK
ATKQQQIDKSIYLFGTVNGKSGWVSKAYLAVPAAPKKAVAQPKTAVKAYTVTKPQTTQTVSKIAQVKPNNTGIRASVYEK
TAKNGAKYADRTFYVTKERAHGNETYVLLNNTSHNIPLGWFNVKDLNVQNLGKEVKTTQKYTVNKSNNGLSMVPWGTKNQ
VILTGNNIAQGTFNATKQVSVGKDVYLYGTINNRTGWVNAKDLTAPTAVKPTTSAAKDYNYTYVIKNGNGYYYVTPNSDT
AKYSLKAFNEQPFAVVKEQVINGQTWYYGKLSNGKLAWIKSTDLAKELIKYNQTGMTLNQVAQIQAGLQYKPQVQRVPGK
WTDANFNDVKHAMDTKRLAQDPALKYQFLRLDQPQNISIDKINQFLKGKGVLENQGAAFNKAAQMYGINEVYLISHALLE
TGNGTSQLAKGADVVNNKVVTNSNTKYHNVFGIAAYDNDPLREGIKYAKQAGWDTVSKAIVGGAKFIGNSYVKAGQNTLY
KMRWNPAHPGTHQYATDVDWANINAKIIKGYYDKIGEVGKYFDIPQYK
>P0C5Z8 ~~~atl~~~Bifunctional autolysin~~~
MLGVINRMAKKFNYKLPSMVALTLVGSAVTAHQVQAAETTQDQTTNKNVLDSNKVKATTEQAKAEVKNPTQNISGTQVYQ
DPAIVQPKTANNKTGNAQVSQKVDTAQVNGDTRANQSATTNNTQPVAKSTSTTAPKTNTNVTNAGYSLVDDEDDNSEHQI
NPELIKSAAKPAALETQYKAAAPKAKTEATPKVTTFSASAQPRSVAATPKTSLPKYKPQVNSSINDYIRKNNLKAPKIEE
DYTSYFPKYAYRNGVGRPEGIVVHDTANDRSTINGEISYMKNNYQNAFVHAFVDGDRIIETAPTDYLSWGVGAVGNPRFI
NVEIVHTHDYASFARSMNNYADYAATQLQYYGLKPDSAEYDGNGTVWTHYAVSKYLGGTDHADPHGYLRSHNYSYDQLYD
LINEKYLIKMGKVAPWGTQFTTTPTTPSKPTTPSKPSTGKLTVAANNGVAQIKPTNSGLYTTVYDKTGKATNEVQKTFAV
SKTATLGNQKFYLVQDYNSGNKFGWVKEGDVVYNTAKSPVNVNQSYSIKSGTKLYTVPWGTSKQVAGSVSGSGNQTFKAS
KQQQIDKSIYLYGSVNGKSGWVSKAYLVDTAKPTPTPIPKPSTPTTNNKLTVSSLNGVAQINAKNNGLFTTVYDKTGKPT
KEVQKTFAVTKEASLGGNKFYLVKDYNSPTLIGWVKQGDVIYNNAKSPVNVMQTYTVKPGTKLYSVPWGTYKQEAGAVSG
TGNQTFKATKQQQIDKSIYLFGTVNGKSGWVSKAYLAVPAAPKKAVAQPKTAVKAYTVTKPQTTQTVSKIAQVKPNNTGI
RASVYEKTAKNGAKYADRTFYVTKERAHGNETYVLLNNTSHNIPLGWFNVKDLNVQNLGKEVKTTQKYTVNKSNNGLSMV
PWGTKNQVILTGNNIAQGTFNATKQVSVGKDVYLYGTINNRTGWVNAKDLTAPTAVKPTTSAAKDYNYTYVIKNGNGYYY
VTPNSDTAKYSLKAFNEQPFAVVKEQVINGQTWYYGKLSNGKLAWIKSTDLAKELIKYNQTGMTLNQVAQIQAGLQYKPQ
VQRVPGKWTDANFNDVKHAMDTKRLAQDPALKYQFLRLDQPQNISIDKINQFLKGKGVLENQGAAFNKAAQMYGINEVYL
ISHALLETGNGTSQLAKGADVVNNKVVTNSNTKYHNVFGIAAYDNDPLREGIKYAKQAGWDTVSKAIVGGAKFIGNSYVK
AGQNTLYKMRWNPAHPGTHQYATDVDWANINAKIIKGYYDKIGEVGKYFDIPQYK
>O33635 ~~~atl~~~Bifunctional autolysin~~~
MAKKFNYKLPSMVALTLFGTAFTAHQANAAEQPQNQSNHKNVLDDQTALKQAEKAKSEVTQSTTNVSGTQTYQDPTQVQP
KQDTQSTTYDASLDEMSTYNEISSNQKQQSLSTDDANQNQTNSVTKNQQEETNDLTQEDKTSTDTNQLQETQSVAKENEK
DLGANANNEQQDKKMTASQPSENQAIETQTASNDNESQQKSQQVTSEQNETATPKVSNTNASGYNFDYDDEDDDSSTDHL
EPISLNNVNATSKQTTSYKYKEPAQRVTTNTVKKETASNQATIDTKQFTPFSATAQPRTVYSVSSQKTSSLPKYTPKVNS
SINNYIRKKNMKAPRIEEDYTSYFPKYGYRNGVGRPEGIVVHDTANDNSTIDGEIAFMKRNYTNAFVHAFVDGNRIIETA
PTDYLSWGAGPYGNQRFINVEIVHTHDYDSFARSMNNYADYAATQLQYYNLKPDSAENDGRGTVWTHAAISNFLGGTDHA
DPHQYLRSHNYSYAELYDLIYEKYLIKTKQVAPWGTTSTKPSQPSKPSGGTNNKLTVSANRGVAQIKPTNNGLYTTVYDS
KGHKTDQVQKTLSVTKTATLGNNKFYLVEDYNSGKKYGWVKQGDVVYNTAKAPVKVNQTYNVKAGSTLYTVPWGTPKQVA
SKVSGTGNQTFKATKQQQIDKATYLYGTVNGKSGWISKYYLTTASKPSNPTKPSTNNQLTVTNNSGVAQINAKNSGLYTT
VYDTKGKTTNQIQRTLSVTKAATLGDKKFYLVGDYNTGTNYGWVKQDEVIYNTAKSPVKINQTYNVKPGVKLHTVPWGTY
NQVAGTVSGKGDQTFKATKQQQIDKATYLYGTVNGKSGWISKYYLTAPSKVQALSTQSTPAPKQVKPSTQTVNQIAQVKA
NNSGIRASVYDKTAKSGTKYANRTFLINKQRTQGNNTYVLLQDGTSNTPLGWVNINDVTTQNIGKQTQSIGKYSVKPTNN
GLYSIAWGTKNQQLLAPNTLANQAFNASKAVYVGKDLYLYGTVNNRTGWIAAKDLIQNSTDAQSTPYNYTFVINNSKSYF
YMDPTKANRYSLKPYYEQTFTVIKQKNINGVKWYYGQLLDGKYVWIKSTDLVKEKIKYAYTGMTLNNAINIQSRLKYKPQ
VQNEPLKWSNANYSQIKNAMDTKRLANDSSLKYQFLRLDQPQYLSAQALNKLLKGKGVLENQGAAFSQAARKYGLNEIYL
ISHALVETGNGTSQLAKGGDVSKGKFTTKTGHKYHNVFGIGAFDNNALVDGIKYAKNAGWTSVSKAIIGGAKFIGNSYVK
AGQNTLYKMRWNPANPGTHQYATDINWANVNAQVLKQFYDKIGEVGKYFEIPTYK
>Q5SI16 ~~~~~~DNA base-flipping protein~~~COG0350
MWLPTPLGPLWLEVSPLGVRRLEPALYPRGPEAEGALALRVREAVQAYFAGERPDFLDLPLDYTGLSPARLRLYERVRLV
PYGRTVSYGALGRELGLSPRAVGAALRACPFFLLVPAHRVIHADGRLGGFQGQEGLKLWLLRFEGAL
>A6B4U8 ~~~~~~DNA base-flipping protein~~~
MDQFLVQIFAVIHQIPKGKVSTYGEIAKMAGYPGYARHVGKALGNLPEGSKLPWFRVINSQGKISLKGRDLDRQKQKLEA
EGIEVSEIGKIALRKYKWQP
>Q2G506 7.-.-.-~~~atm1~~~ATM1-type heavy metal exporter~~~COG5265
MPPETATNPKDARHDGWQTLKRFLPYLWPADNAVLRRRVVGAILMVLLGKATTLALPFAYKKAVDAMTLGGGAQPALTVA
LAFVLAYALGRFSGVLFDNLRNIVFERVGQDATRHLAENVFARLHKLSLRFHLARRTGEVTKVIERGTKSIDTMLYFLLF
NIAPTVIELTAVIVIFWLNFGLGLVTATILAVIAYVWTTRTITEWRTHLREKMNRLDGQALARAVDSLLNYETVKYFGAE
SREEARYASAARAYADAAVKSENSLGLLNIAQALIVNLLMAGAMAWTVYGWSQGKLTVGDLVFVNTYLTQLFRPLDMLGM
VYRTIRQGLIDMAEMFRLIDTHIEVADVPNAPALVVNRPSVTFDNVVFGYDRDREILHGLSFEVAAGSRVAIVGPSGAGK
STIARLLFRFYDPWEGRILIDGQDIAHVTQTSLRAALGIVPQDSVLFNDTIGYNIAYGRDGASRAEVDAAAKGAAIADFI
ARLPQGYDTEVGERGLKLSGGEKQRVAIARTLVKNPPILLFDEATSALDTRTEQDILSTMRAVASHRTTISIAHRLSTIA
DSDTILVLDQGRLAEQGSHLDLLRRDGLYAEMWARQAAESAEVSEAAE
>P0ABB8 7.2.2.14~~~mgtA~~~Magnesium-transporting ATPase, P-type 1~~~COG0474
MFKEIFTRLIRHLPSRLVHRDPLPGAQQTVNTVVPPSLSAHCLKMAVMPEEELWKTFDTHPEGLNQAEVESAREQHGENK
LPAQQPSPWWVHLWVCYRNPFNILLTILGAISYATEDLFAAGVIALMVAISTLLNFIQEARSTKAADALKAMVSNTATVL
RVINDKGENGWLEIPIDQLVPGDIIKLAAGDMIPADLRILQARDLFVAQASLTGESLPVEKAATTRQPEHSNPLECDTLC
FMGTTVVSGTAQAMVIATGANTWFGQLAGRVSEQESEPNAFQQGISRVSMLLIRFMLVMAPVVLLINGYTKGDWWEAALF
ALSVAVGLTPEMLPMIVTSTLARGAVKLSKQKVIVKHLDAIQNFGAMDILCTDKTGTLTQDKIVLENHTDISGKTSERVL
HSAWLNSHYQTGLKNLLDTAVLEGTDEESARSLASRWQKIDEIPFDFERRRMSVVVAENTEHHQLVCKGALQEILNVCSQ
VRHNGEIVPLDDIMLRKIKRVTDTLNRQGLRVVAVATKYLPAREGDYQRADESDLILEGYIAFLDPPKETTAPALKALKA
SGITVKILTGDSELVAAKVCHEVGLDAGEVVIGSDIETLSDDELANLAQRTTLFARLTPMHKERIVTLLKREGHVVGFMG
DGINDAPALRAADIGISVDGAVDIAREAADIILLEKSLMVLEEGVIEGRRTFANMLKYIKMTASSNFGNVFSVLVASAFL
PFLPMLPLHLLIQNLLYDVSQVAIPFDNVDDEQIQKPQRWNPADLGRFMIFFGPISSIFDILTFCLMWWVFHANTPETQT
LFQSGWFVVGLLSQTLIVHMIRTRRVPFIQSCASWPLMIMTVIVMIVGIALPFSPLASYLQLQALPLSYFPWLVAILAGY
MTLTQLVKGFYSRRYGWQ
>D0ZTB2 7.2.2.14~~~mgtA~~~Magnesium-transporting ATPase, P-type 1~~~
MLKIITRQLFARLNRHLPYRLVHRDPLPGAQTAVNATIPPSLSERCLKVAAMEQETLWRVFDTHPEGLNAAEVTRAREKH
GENRLPAQKPSPWWVHLWVCYRNPFNILLTILGGISYATEDLFAAGVIALMVGISTLLNFVQEARSTKAADALKAMVSNT
ATVLRVINENGENAWLELPIDQLVPGDIIKLAAGDMIPADLRIIQARDLFVAQASLTGESLPVEKVAATREPRQNNPLEC
DTLCFMGTNVVSGTAQAVVMATGAGTWFGQLAGRVSEQDNEQNAFQKGISRVSMLLIRFMLVMAPVVLIINGYTKGDWWE
AALFALSVAVGLTPEMLPMIVTSTLARGAVKLSKQKVIVKHLDAIQNFGAMDILCTDKTGTLTQDKIVLENHTDISGKPS
EHVLHCAWLNSHYQTGLKNLLDTAVLEGVDETAARQLSGRWQKIDEIPFDFERRRMSVVVAEDSNVHQLVCKGALQEILN
VCTQVRHNGDIVPLDDNMLRRVKRVTDTLNRQGLRVVAVATKYLPAREGDYQRIDESDLILEGYIAFLDPPKETTAPALK
ALKASGITVKILTGDSELVAAKVCHEVGLDAGDVIIGSDIEGLSDDALAALAARTTLFARLTPMHKERIVTLLKREGHVV
GFMGDGINDAPALRAADIGISVDGAVDIAREAADIILLEKSLMVLEEGVIEGRRTFSNMLKYIKMTASSNFGNVFSVLVA
SAFLPFLPMLPLHLLIQNLLYDVSQVAIPFDNVDEEQIQKPQRWNPADLGRFMVFFGPISSIFDILTFCLMWWVFHANTP
ETQTLFQSGWFVVGLLSQTLIVHMIRTRRLPFIQSRAAWPLMAMTLLVMVVGVSLPFSPLASYLQLQALPLSYFPWLIAI
LVGYMTLTQLVKGFYSRRYGWQ
>P36640 7.2.2.14~~~mgtA~~~Magnesium-transporting ATPase, P-type 1~~~
MLKIITRQLFARLNRHLPYRLVHRDPLPGAQTAVNATIPPSLSERCLKVAAMEQETLWRVFDTHPEGLNAAEVTRAREKH
GENRLPAQKPSPWWVHLWVCYRNPFNILLTILGGISYATEDLFAAGVIALMVGISTLLNFVQEARSTKAADALKAMVSNT
ATVLRVINENGENAWLELPIDQLVPGDIIKLAAGDMIPADLRIIQARDLFVAQASLTGESLPVEKVAATREPRQNNPLEC
DTLCFMGTNVVSGTAQAVVMATGAGTWFGQLAGRVSEQDNEQNAFQKGISRVSMLLIRFMLVMAPVVLIINGYTKGDWWE
AALFALSVAVGLTPEMLPMIVTSTLARGAVKLSKQKVIVKHLDAIQNFGAMDILCTDKTGTLTQDKIVLENHTDISGKPS
EHVLHCAWLNSHYQTGLKNLLDTAVLEGVDETAARQLSGRWQKIDEIPFDFERRRMSVVVAEDSNVHQLVCKGALQEILN
VCTQVRHNGDIVPLDDNMLRRVKRVTDTLNRQGLRVVAVATKYLPAREGDYQRIDESDLILEGYIAFLDPPKETTAPALK
ALKASGITVKILTGDSELVAAKVCHEVGLDAGDVIIGSDIEGLSDDALAALAARTTLFARLTPMHKERIVTLLKREGHVV
GFMGDGINDAPALRAADIGISVDGAVDIAREAADIILLEKSLMVLEEGVIEGRRTFSNMLKYIKMTASSNFGNVFSVLVA
SAFLPFLPMLPLHLLIQNLLYDVSQVAIPFDNVDEEQIQKPQRWNPADLGRFMVFFGPISSIFDILTFCLMWWVFHANTP
ETQTLFQSGWFVVGLLSQTLIVHMIRTRRLPFIQSRAAWPLMAMTLLVMVVGVSLPFSPLASYLQLQALPLSYFPWLIAI
LVGYMTLTQLVKGFYSRRYGWQ
>P22036 7.2.2.14~~~mgtB~~~Magnesium-transporting ATPase, P-type 1~~~
MTDMNIENRKLNRPASENDKQHKKVFPIEAEAFHSPEETLARLNSHRQGLTIEEASERLKVYGRNEVAHEQVPPALIQLL
QAFNNPFIYVLMALAGVSFITDYWLPLRRGEETDLTGVLIILTMVSLSGLLRFWQEFRTNRAAQALKKMVRTTATVLRRG
PGNIGAVQEEIPIEELVPGDVVFLAAGDLVPADVRLLASRDLFISQSILSGESLPVEKYDVMADVAGKDSEQLPDKDKSL
LDLGNICLMGTNVTSGRAQAVVVATGSRTWFGSLAKSIVGTRTQTAFDRGVNSVSWLLIRFMLIMVPVVLLINGFSKGDW
VEASLFALAVAVGLTPEMLPMIVSSNLAKGAIAMSRRKVIVKRLNAIQNFGAMDVLCTDKTGTLTQDNIFLEHHLDVSGV
KSSRVLMLAWLNSSSQSGARNVMDRAILRFGEGRIAPSTKARFIKRDELPFDFVRRRVSVLVEDAQHGDRCLICKGAVEE
MMMVATHLREGDRVVALTETRRELLLAKTEDYNAQGFRVLLIATRKLDGSGNNPTLSVEDETELTIEGMLTFLDPPKESA
GKAIAALRDNGVAVKVLTGDNPVVTARICLEVGIDTHDILTGTQVEAMSDAELASEVEKRAVFARLTPLQKTRILQALQK
NGHTVGFLGDGINDAPALRDADVGISVDSAADIAKESSDIILLEKDLMVLEEGVIKGRETFGNIIKYLNMTASSNFGNVF
SVLVASAFIPFLPMLAIHLLIQNLMYDISQLSLPWDKMDKEFLRKPRKWDAKNIGRFMLWIGPTSSIFDITTFALMWYVF
AANNVEAQALFQSGWFIEGLLSQTLVVHMLRTQKIPFIQSRATLPVLLTTGLIMAIGIYIPFSPLGAMVGLEPLPLSYFP
WLVATLLSYCLVAQGMKRFYIKRFGQWF
>P76459 2.8.3.8~~~atoA~~~Acetate CoA-transferase subunit beta~~~COG2057
MDAKQRIARRVAQELRDGDIVNLGIGLPTMVANYLPEGIHITLQSENGFLGLGPVTTAHPDLVNAGGQPCGVLPGAAMFD
SAMSFALIRGGHIDACVLGGLQVDEEANLANWVVPGKMVPGMGGAMDLVTGSRKVIIAMEHCAKDGSAKILRRCTMPLTA
QHAVHMLVTELAVFRFIDGKMWLTEIADGCDLATVRAKTEARFEVAADLNTQRGDL
>P76461 2.3.1.9~~~atoB~~~Acetyl-CoA acetyltransferase~~~COG0183
MKNCVIVSAVRTAIGSFNGSLASTSAIDLGATVIKAAIERAKIDSQHVDEVIMGNVLQAGLGQNPARQALLKSGLAETVC
GFTVNKVCGSGLKSVALAAQAIQAGQAQSIVAGGMENMSLAPYLLDAKARSGYRLGDGQVYDVILRDGLMCATHGYHMGI
TAENVAKEYGITREMQDELALHSQRKAAAAIESGAFTAEIVPVNVVTRKKTFVFSQDEFPKANSTAEALGALRPAFDKAG
TVTAGNASGINDGAAALVIMEESAALAAGLTPLARIKSYASGGVPPALMGMGPVPATQKALQLAGLQLADIDLIEANEAF
AAQFLAVGKNLGFDSEKVNVNGGAIALGHPIGASGARILVTLLHAMQARDKTLGLATLCIGGGQGIAMVIERLN
>Q06065 ~~~atoC~~~Regulatory protein AtoC~~~COG2204
MTAINRILIVDDEDNVRRMLSTAFALQGFETHCANNGRTALHLFADIHPDVVLMDIRMPEMDGIKALKEMRSHETRTPVI
LMTAYAEVETAVEALRCGAFDYVIKPFDLDELNLIVQRALQLQSMKKEIRHLHQALSTSWQWGHILTNSPAMMDICKDTA
KIALSQASVLISGESGTGKELIARAIHYNSRRAKGPFIKVNCAALPESLLESELFGHEKGAFTGAQTLRQGLFERANEGT
LLLDEIGEMPLVLQAKLLRILQEREFERIGGHQTIKVDIRIIAATNRDLQAMVKEGTFREDLFYRLNVIHLILPPLRDRR
EDISLLANHFLQKFSSENQRDIIDIDPMAMSLLTAWSWPGNIRELSNVIERAVVMNSGPIIFSEDLPPQIRQPVCNAGEV
KTAPVGERNLKEEIKRVEKRIIMEVLEQQEGNRTRTALMLGISRRALMYKLQEYGIDPADV
>P76458 2.8.3.8~~~atoD~~~Acetate CoA-transferase subunit alpha~~~COG1788
MKTKLMTLQDATGFFRDGMTIMVGGFMGIGTPSRLVEALLESGVRDLTLIANDTAFVDTGIGPLIVNGRVRKVIASHIGT
NPETGRRMISGEMDVVLVPQGTLIEQIRCGGAGLGGFLTPTGVGTVVEEGKQTLTLDGKTWLLERPLRADLALIRAHRCD
TLGNLTYQLSARNFNPLIALAADITLVEPDELVETGELQPDHIVTPGAVIDHIIVSQESK
>P76460 ~~~atoE~~~Putative short-chain fatty acid transporter~~~COG2031
MIGRISRFMTRFVSRWLPDPLIFAMLLTLLTFVIALWLTPQTPISMVKMWGDGFWNLLAFGMQMALIIVTGHALASSAPV
KSLLRTAASAAKTPVQGVMLVTFFGSVACVINWGFGLVVGAMFAREVARRVPGSDYPLLIACAYIGFLTWGGGFSGSMPL
LAATPGNPVEHIAGLIPVGDTLFSGFNIFITVALIVVMPFITRMMMPKPSDVVSIDPKLLMEEADFQKQLPKDAPPSERL
EESRILTLIIGALGIAYLAMYFSEHGFNITINTVNLMFMIAGLLLHKTPMAYMRAISAAARSTAGILVQFPFYAGIQLMM
EHSGLGGLITEFFINVANKDTFPVMTFFSSALINFAVPSGGGHWVIQGPFVIPAAQALGADLGKSVMAIAYGEQWMNMAQ
PFWALPALAIAGLGVRDIMGYCITALLFSGVIFVIGLTLF
>Q06067 2.7.13.3~~~atoS~~~Signal transduction histidine-protein kinase AtoS~~~COG3852
MHYMKWIYPRRLRNQMILMAILMVIVPTLTIGYIVETEGRSAVLSEKEKKLSAVVNLLNQALGDRYDLYIDLPREERIRA
LNAELAPITENITHAFPGIGAGYYNKMLDAIITYAPSALYQNNVGVTIAADHPGREVMRTNTPLVYSGRQVRGDILNSML
PIERNGEILGYIWANELTEDIRRQAWKMDVRIIIVLTAGLLISLLLIVLFSRRLSANIDIITDGLSTLAQNIPTRLPQLP
GEMGQISQSVNNLAQALRETRTLNDLIIENAADGVIAIDRQGDVTTMNPAAEVITGYQRHELVGQPYSMLFDNTQFYSPV
LDTLEHGTEHVALEISFPGRDRTIELSVTTSRIHNTHGEMIGALVIFSDLTARKETQRRMAQAERLATLGELMAGVAHEV
RNPLTAIRGYVQILRQQTSDPIHQEYLSVVLKEIDSINKVIQQLLEFSRPRHSQWQQVSLNALVEETLVLVQTAGVQARV
DFISELDNELSPINADRELLKQVLLNILINAVQAISARGKIRIQTWQYSDSQQAISIEDNGCGIDLSLQKKIFDPFFTTK
ASGTGLGLALSQRIINAHQGDIRVASLPGYGATFTLILPINPQGNQTV
>Q81H98 ~~~~~~Immunity protein BC_0921~~~
MKYPYSFEVLTNGKLVMRLPQEIKLMETFLGVEVSAFGDWILEEIHSVLNGKENYVVVNGNICGLEIRKDTTTVLDNLAE
DGKGDFCEIETIELVDLIHIWQDKQKEFKKGKNKELK
>A3M137 ~~~atpB~~~ATP synthase subunit a~~~
MAAEEHALTSTEYIKHHLTNMTYGKMPDGTWKLAETAEEAHSMGFTAIHLDSMGWSIGLGVIFCLLFWIVARAANAGVPT
KFQSAIEMIIEFVDSSVRDTFHGKSRLIAPLALTIFVWIFLMNLMDLIPVDWIPQVAAFVGANVFGMDPHHVYFKIVPST
DPNITLGMSLSVFVLILFYSIREKGVGGFVGELALNPFNPSNPVAKALLIPVNLILELVTFLARPISLALRLFGNMYAGE
LIFILIALLPFWIQWALSVPWAIFHILVITLQAFIFMMLTIVYLSMASEKH
>P09218 ~~~atpB~~~ATP synthase subunit a~~~
MEHKAPLVEFLGLTFNLSDMLMITITCLIVFIIAVAATRSLQLRPTGMQNFMEWVFDFVRGIINSTMDWQTGGRFLTLGV
TLIMYVFVANMLGLPFSVHVNGELWWKSPTADATVTLTLAVMVVALTHYYGVKMKGASDYLRDYTRPVAWLFPLKIIEEF
ANTLTLGLRLFGNIYAGEILLGLLASLGTHYGVLGAVGASQFPIMVWQAFSIFVGTIQAFIFTMLTMVYMAHKVSHDH
>P37813 ~~~atpB~~~ATP synthase subunit a~~~COG0356
MNHGYRTIEFLGLTFNLTNILMITVASVIVLLIAILTTRTLSIRPGKAQNFMEWIVDFVRNIIGSTMDLKTGANFLALGV
TLLMYIFVSNMLGLPFSITIGHELWWKSPTADPAITLTLAVMVVALTHYYGVKMKGLKEYSKDYLRPVPFMLPMKIIEEF
ANTLTLGLRLYGNIFAGEILLGLLAGLATSHYSQSVALGLVGTIGAILPMLAWQAFSLFIGAIQAFIFTMLTMVYMSHKI
SHDH
>P0AB98 ~~~atpB~~~ATP synthase subunit a~~~COG0356
MASENMTPQDYIGHHLNNLQLDLRTFSLVDPQNPPATFWTINIDSMFFSVVLGLLFLVLFRSVAKKATSGVPGKFQTAIE
LVIGFVNGSVKDMYHGKSKLIAPLALTIFVWVFLMNLMDLLPIDLLPYIAEHVLGLPALRVVPSADVNVTLSMALGVFIL
ILFYSIKMKGIGGFTKELTLQPFNHWAFIPVNLILEGVSLLSKPVSLGLRLFGNMYAGELIFILIAGLLPWWSQWILNVP
WAIFHILIITLQAFIFMVLTIVYLSMASEEH
>Q2RFX3 ~~~atpB~~~ATP synthase subunit a~~~COG0356
MTHVRPVEIFHLGPIPIYSTVVNTWIIMILLLAGIFLATRKLSFIPRGAQHVLEMFLEFFYGLLEEIIGKEGRRYLPLVA
TLFIFILSLNLSWFIPGMKPPTMDLSTTAAFAVTTIILVQIFGIRKLGLRGYIRHFFQPAPFLFPLNVIEELVKPVSLSL
RLFGNLFGEEMVVTILFLMIPFLLPTPIMLLGVLMGTIQAFVFTLLTITYIANFVHGH
>P9WPV7 ~~~atpB~~~ATP synthase subunit a~~~COG0356
MTETILAAQIEVGEHHTATWLGMTVNTDTVLSTAIAGLIVIALAFYLRAKVTSTDVPGGVQLFFEAITIQMRNQVESAIG
MRIAPFVLPLAVTIFVFILISNWLAVLPVQYTDKHGHTTELLKSAAADINYVLALALFVFVCYHTAGIWRRGIVGHPIKL
LKGHVTLLAPINLVEEVAKPISLSLRLFGNIFAGGILVALIALFPPYIMWAPNAIWKAFDLFVGAIQAFIFALLTILYFS
QAMELEEEHH
>A1B619 ~~~atpB~~~ATP synthase subunit a~~~COG0356
MAEEEAGGLVFHPMDQFVIKPLFGEGPVNWYTPTNATLWMALAALAITALLVFGTRGRAIVPNRVQSIAELLYGMVHKMV
EDVTGKDGLKYFPYVMTLFCFILFANFLGLLPKSFSPTSHIAVTAVLAVLVFAGVTVLGFVKNGAHFLGLFWVSSAPLAL
RPVLAVIELISYFVRPVSHSIRLAGNIMAGHAVIKVFAAFAAVAAIAPVSVVAITAMYGLEVLVCLIQAYVFTILTCVYL
KDALHPAH
>O05330 ~~~atpB~~~ATP synthase subunit a~~~
MFDGEAIRWFEFFVPTNSTLWMAIGVLMIALLMVVGTLRRAIVPGRIQSLAELTYGFIHKMVEDVAGKDGLVYFPYIFTL
FLFILFSNFLGLIPMAFTPTSHIAVTGVMAMGVFIGVTALGFMKHGSHFLNLFWVSAAPLPLRPILAVIEVISYFVRPVS
HSIRLAGNMMAGHAVMEVFAAFAPLILFSFVGVIVTPLSVLAIVAMYALEILVAFVQAYVFTILTCVYLKDALHPGH
>Q7A4E5 ~~~atpB~~~ATP synthase subunit a~~~
MDHKSPLVSWNLFGFDIVFNLSSILMILVTAFLVFLLAIICTRNLKKRPTGKQNFVEWIFDFVRGIIEGNMAWKKGGQFH
FLAVTLILYIFIANMLGLPFSIVTKDHTLWWKSPTADATVTLTLSTTIILLTHFYGIKMRGTKQYLKGYVQPFWPLAIIN
VFEEFTSTLTLGLRLYGNIFAGEILLTLLAGLFFNEPAWGWIISIPGLIVWQAFSIFVGTIQAYIFIMLSMVYMSHKVAD
EH
>P0A2Y9 ~~~atpB~~~ATP synthase subunit a~~~COG0356
MEESINPIISIGPVIFNLTMLAMTLLIVGVIFVFIYWASRNMTLKPKGKQNVLEYVYDFVIGFTEPNIGSRYMKDYSLFF
LCLFLFMVIANNLGLMTKLQTIDGTNWWSSPTANLQYDLTLSFLVILLTHIESVRRRGFKKSIKSFMSPVFVIPMNILEE
FTNFLSLALRIFGNIFAGEVMTSLLLLLSHQAIYWYPVAFGANLAWTAFSVFISCIQAYVFTLLTSVYLGNKINIEEE
>P0DV91 ~~~~~~Retron Ec78 probable ATPase~~~
MTKQYERKAKGGNLLSAFELYQRNTDNMPGLGEMLVDEWFETCRDYIQDGHVDESGTFRPDNAFYLRRLTLKDFRRFSLL
EIKFEEDLTVIIGNNGKGKTSILYAIAKTLSWFVANILKEGGSGQRLSELTDIKNDAENRYADVSSTFFFGKGLKSVPIR
LSRSALGTAERRDSEVKPARDLADIWRVINEAKTINLPTFALYNVERSQPFNRNTKDNAGRREERFDAYSQALGGAGRFD
HFVEWYIYLHKRTISDISSSIKELEQQVNDLQRSVDGGMVSVKSLLEQMKLKLSEASERNDAAVSSKMVTESVQKSIVEK
SICSVVPSISKIWVEMTTGSDLVKVTNDGHDVTIDQLSDGQRVFLSLVADLARRMVMLNPLLENPLEGRGIVLIDEIELH
LHPKWQQEVILNLRSVFPNIQFIITTHSPIVLSTIEKRCIREFDPNDDGNQSFLDSPDMQTKGSENAQILEQVMNVHPTP
PGIAESHWLGDFELLLLDNSGELDNQSQELYDKIKTHFGIDSAELKKADSLIRINKMKNKINKIRAEKGK
>Q47527 ~~~~~~Retron Ec83 probable ATPase~~~
MEQNLPSRITKLIKKSESGDFASSYQLYKVFGSKEYGVEPDEKMSDYFKELSAKQLEGGQLRVADIHLENYKGFESLIMD
FSMKKNSTILVGNNGCGKSTILDAIQKGLTHLSSRLSTRSHNGDGIEKHELRKGQNYASIAINYDYMGIRFPMIIATTEP
GYEDRAKSNYSGINELGSIFKTAHSINPNVSFPLIAMYTVERANDVSTRDIENSEEIKEAQIWDKFKAYNKSLTGKADFK
LFFRWFKELIEIENSDNADITALRAEIRAKEKDLDNPLLKALLAENKNSETTKKLLEDHQNSLKVLKEKLNSYYSVNSKT
LHTVEDAMYSFLPGFSNLKLQRAPLDLIVDKNNVSLSVLQLSQGEKTILALIADIARRLTLLNPNSVNPLDGTGIVLIDE
IDLHLHPSWQQNIIPRLEKTFKNIQFIVTTHSPQVCHTIDSQNIWLLKNGQKFKAPKGVRGAISSWVLENLFEVAQRPPE
DKYTKLLQEYKNLVFSEKYASEDARKLGATLSQHFGPDDETLVELKLEIEKRIWEDDFEKDQ
>P0DV98 ~~~~~~Retron Vc95 probable ATPase~~~
MNEPTRIPKYVKDKIRQAKQGDLYSSFTVSEYYYDGEVLERDIEQSELYLRNVSRIINNAKIRLRNISLYDYKKFSKLKF
TSSEKNTTIIIGNNGSGKSTILESISKCLQFLSDNIRIQNNNNYKFQDSEINIHSISGQTIVRCILEIENDFSFSCSLTK
NRENISRKVSSELEEFKALARMYQRSNELDNNTLSYPLLAYYPVERSVTLKRDDAVKYYERKKAKYSDKSEGLKNAFDGT
SNFNDFFSWYKEIDDIINEFKANDSITKEEIEYLLSKTDNKEKIGSLISQLLEKKNNYNNNEDREFLIRQQKVIQESIKT
FVSDIDQVKISRTPHLDMTVIKNGSEISIFNLSQGEKTLIALVSDIARRLVILNPSLENPLNGYGIVLIDEIDLHLHPKW
QQTIVQKLENTFPNIQFILSTHSPLVLTTVTSEQIKIINELDYRFKLLSPTSNPFGKNASDALAIMETSESPLVHSEEIL
ALIKKYESLVKRGQEDCRKTKEIKKTHRKHWIYI
>A3M142 7.1.2.2~~~atpA~~~ATP synthase subunit alpha~~~
MQQLNPSEISALIKQRIGDLDTSATAKNEGTIVMVSDGIVRIHGLADAMYGEMIEFDGGLFGMALNLEQDSVGAVVLGNY
LSLQEGQKARCTGRVLEVPVGPELLGRVVDALGNPIDGKGPIDAKLTDAVEKVAPGVIWRQSVDQPVQTGYKSVDTMIPV
GRGQRELIIGDRQTGKTAMAIDAIIAQKNSGIKCVYVAIGQKQSTIANVVRKLEETGAMAYTTVVAAAAADPAAMQYLAP
YSGCTMGEYFRDRGEDALIIYDDLSKQAVAYRQISLLLRRPPGREAYPGDVFYLHSRLLERASRVSAEYVEKFTNGAVTG
KTGSLTALPIIETQAGDVSAFVPTNVISITDGQIFLETSLFNAGIRPAVNAGISVSRVGGSAQTKIIKKLSGGIRTALAQ
YRELAAFAQFASDLDEATRKQLEHGQRVTELMKQKQYAPYSIADQAVSVYASNEGYMADVEVKKIVDFDAALIAYFRSEY
APLMKQIDETGDYNKDIEAAIKAGIESFKATQTY
>P09219 7.1.2.2~~~atpA~~~ATP synthase subunit alpha~~~
MSIRAEEISALIKQQIENYESQIQVSDVGTVIQVGDGIARAHGLDNVMSGEAVEFANAVMGMALNLEENNVGIVILGPYT
GIKEGDEVRRTGRIMEVPVGETLIGRVVNPLGQPVDGLGPVETTETRPIESRAPGVMDRRSVHEPLQTGIKAIDALVPIG
RGQRELIIGDRQTGKTSVAIDTIINQKDQNMICIYVAIGQKESTVATVVETLAKHGAPDYTIVVTASASQPAPLLFLAPY
AGVAMGEYFMIMGKHVLVVIDDLSKQAAAYRQLSLLLRRPPGREAYPGDIFYLHSRLLERAAKLSDAKGGGSLTALPFVE
TQAGDISAYIPTNVISITDGQIFLQSDLFFSGVRPAINAGLSVSRVGGAAQIKAMKKVAGTLRLDLAAYRELEAFAQFGS
DLDKATQANVARGARTVEVLKQDLHQPIPVEKQVLIIYALTRGFLDDIPVEDVRRFEKEFYLWLDQNGQHLLEHIRTTKD
LPNEDDLNQAIEAFKKTFVVSQ
>P37808 7.1.2.2~~~atpA~~~ATP synthase subunit alpha~~~COG0056
MSIKAEEISTLIKQQIQNYQSDIEVQDVGTVIQVGDGIARVHGLDNCMAGELVEFSNGVLGMAQNLEESNVGIVILGPFS
EIREGDEVKRTGRIMEVPVGEELIGRIVNPLGQPVDGLGPILTSKTRPIESPAPGVMDRKSVHEPLQTGIKAIDALIPIG
RGQRELIIGDRQTGKTSVAIDAILNQKDQDMICVYVAIGQKESTVRGVVETLRKHGALDYTIVVTASASQPAPLLYLAPY
AGVTMAEEFMYNGKHVLVVYDDLSKQAAAYRELSLLLRRPPGREAFPGDVFYLHSRLLERAAKLSDAKGAGSITALPFVE
TQAGDISAYIPTNVISITDGQIFLQSDLFFSGVRPAINAGLSVSRVGGSAQIKAMKKVSGTLRLDLASYRELEAFAQFGS
DLDQATQAKLNRGARTVEVLKQDLNKPLPVEKQVAILYALTKGYLDDIPVADIRRFEEEYYMYLDQNHKDLLDGIAKTGN
LPADEDFKAAIEGFKRTFAPSN
>Q83AF7 7.1.2.2~~~atpA~~~ATP synthase subunit alpha~~~COG0056
MSTQLRAAEISDIIESRIEKFGIKAEERTEGTILNIKDGIVRVYGLRDVMFGEMVEFPENTYGLAFNLERDSVGAVVMGP
YEHLEEGMTARCTGRILEVPVGEALLGRVVDGLGKPIDGKGPIDTSETSPIEKVAPGVITRKSVDTSLPTGLKSIDAMVP
IGRGQRELIIGDRQTGKTAIAIDTIINQKHTGVKCIYVAIGQKQSSVAAVVRKLEEHGAMEHTIVVNASASEAAALQYLA
PYAGCTMGEYFRDRGQDALIVYDDLTKQAWAYRQISLLLRRPPGREAYPGDIFYLHSRLLERAAHVNEAYVKEFTKGKVT
GKTGSLTALPIIETQAGDVSAFIPTNVISITDGQIYLDVNLFNAGIRPAINAGLSVSRVGGAAQTKIIKKLIGGLRIALA
QYRELEAFSQFASDLDEATRKQLEHGQRVMEILKQPQYQPLSVGEMAIIWYVVNNNYLDQVELKKVVDFERSLLSFLRDQ
HQDLLDEINKNPNYSEKIIEKIKAVVEEFVKTQSY
>Q72E02 7.1.2.2~~~atpA~~~ATP synthase subunit alpha~~~COG0056
MQIKAEEISKIIEEQIQSYEQRVEMSETGTVLYVGDGIARVHGVQNAMAMELLEFPGGLMGMVLNLEEDNVGVALLGDDT
QIKEGDPVKRTGKIFSVPVGDAVMGRVLNPLGQPIDGLGPLDAKEFRPVELKAPGIIARKSVHEPMPTGIKAIDAMTPIG
RGQRELVIGDRQTGKTAVCIDAILAQKNTDIHCFYVAIGQKKATVALVADTLRKYGAMEYTTIISATASEPAPLQFISAY
SGCTMAEFYRNNGKHALIIYDDLSKQAVAYRQMSLLLRRPPGREAYPGDVFYLHSRLLERAAKVNDSLGAGSLTALPIIE
TQAGDVSAYIPTNVISITDGQVYLEPNLFNAGIRPAINVGLSVSRVGGAAQIKAMKQVAGTMRLDLAQYRELAAFAQFGS
DLDKATKAKLDRGARLVELLKQPQYEPMPTEEQVASMYAATRGLMDDVAVADIRKFETAMLDYLRSGKADILNDIKTKKA
LDQDIENRLKAAIAEFKKGYQA
>P0ABB0 7.1.2.2~~~atpA~~~ATP synthase subunit alpha~~~COG0056
MQLNSTEISELIKQRIAQFNVVSEAHNEGTIVSVSDGVIRIHGLADCMQGEMISLPGNRYAIALNLERDSVGAVVMGPYA
DLAEGMKVKCTGRILEVPVGRGLLGRVVNTLGAPIDGKGPLDHDGFSAVEAIAPGVIERQSVDQPVQTGYKAVDSMIPIG
RGQRELIIGDRQTGKTALAIDAIINQRDSGIKCIYVAIGQKASTISNVVRKLEEHGALANTIVVVATASESAALQYLAPY
AGCAMGEYFRDRGEDALIIYDDLSKQAVAYRQISLLLRRPPGREAFPGDVFYLHSRLLERAARVNAEYVEAFTKGEVKGK
TGSLTALPIIETQAGDVSAFVPTNVISITDGQIFLETNLFNAGIRPAVNPGISVSRVGGAAQTKIMKKLSGGIRTALAQY
RELAAFSQFASDLDDATRKQLDHGQKVTELLKQKQYAPMSVAQQSLVLFAAERGYLADVELSKIGSFEAALLAYVDRDHA
PLMQEINQTGGYNDEIEGKLKGILDSFKATQSW
>Q8RGE0 7.1.2.2~~~atpA~~~ATP synthase subunit alpha~~~COG0056
MNIRPEEVSSIIKKEIDNYKKSLEIKTSGTVLEVGDGIARIFGLSNVMSGELLEFPHGVMGMALNLEEDNVGAVILGNAS
LIKEGDEVRATGKVVSVPAGEDLLGRVINALGDPIDGKGEIHVDKYMPIERKASGIIARQPVSEPLQTGIKSIDGMVPIG
RGQRELIIGDRQTGKTAIAIDTIINQKGQDVKCIYVAIGQKRSTVAQIYKKLSDLGCMDYTIIVAATASEAAPLQYMAPY
SGVAIGEYFMEKGEHVLIIYDDLSKHAVAYREMSLLLRRPPGREAYPGDVFYLHSRLLERAAKLSDELGGGSITALPIIE
TQAGDVSAYIPTNVISITDGQIFLESQLFNSGFRPAINAGISVSRVGGAAQIKAMKQVASKVKLELAQYTELLTFAQFGS
DLDKATKAQLERGHRIMEILKQPQYHPFAVERQVVSFYIVINGHLDDIEVSKVRRFEKELLDYLKANTNILTEIADKKAL
DKDLEEKLKESIANFKKSFN
>Q5KUJ1 7.1.2.2~~~atpA~~~ATP synthase subunit alpha~~~COG0056
MSIRAEEISALIKQQIENYESQIQVSDVGTVIQVGDGIARAHGLDNVMSGELVEFANGVMGMALNLEENNVGIVILGPYT
GIKEGDEVRRTGRIMEVPVGEALIGRVVNPLGQPVDGLGPVETTETRPIESPAPGVMDRRSVHEPLQTGIKAIDALVPIG
RGQRELIIGDRQTGKTSVAIDTIINQKDQNMICIYVAIGQKESTVRTVVETLRKHGALDYTIVVTASASQPAPLLFLAPY
AGVAMGEYFMYKGQHVLVVYDDLSKQAAAYRELSLLLRRPPGREAYPGDIFYLHSRLLERAAKLSDAKGGGSLTALPFVE
TQAGDISAYIPTNVISITDGQIFLQSDLFFSGVRPAINAGLSVSRVGGAAQIKAMKKVAGTLRLDLAAYRELEAFAQFGS
DLDKATQAKLARGARTVEVLKQDLHQPIPVEKQVLIIYALTRGFLDDIPVEDVRRFEKEFYLWLDQNGQHLLEHIRTTKD
LPNEDDLNKAIEAFKKTFVVSQ
>Q5FKY2 7.1.2.2~~~atpA~~~ATP synthase subunit alpha~~~COG0056
MSIKAEEISSLIKQQLEHYDDKLDINEVGVVTYVGDGIARAHGLDDVLSGELLKFDNGSFGIAQNLESNDVGIIILGQFD
NIREGDRVQRTGRIMEVPVGDALIGRVVNPLGQPVDGLGEIKSDKTRPIEAKAPGVMDRQSVNQPLQTGIKAIDALVPIG
RGQRELIIGDRKTGKTSLAIDTILNQKGQDVICIYVAIGQKESTVRTQVETLKRFGAMDYTIVVEAGPSEPAPMLYIAPY
AGTAMGEEFMYNGKDVLIVFDDLSKQAVAYRELSLLLRRPPGREAYPGDVFYLHSRLLERSAKLSDKLGGGSLTALPIIQ
TEAGDISAYIPTNVISITDGQIFLQSDLFFAGTRPAIDAGNSVSRVGGNAQIKAMKKVAGTLRTDLTAYRELESFAQFGS
DLDQATQAKLNRGQRTVEVLKQPLHDPIPVEKQVLILYALTHGYLDAIPVEDISRFQNELFDNFDSSHADLLKTIRETGK
LPDDKELSAAIEEFSESFTPSEK
>Q2RFX7 7.1.2.2~~~atpA~~~ATP synthase subunit alpha~~~COG0056
MSIRPDEITSILKNQIEQYQLEVEMAEVGTVTQVGDGIARIYGLDRAMAGELLEFPGDIYGMVLNLEEDNVGAVILGPYT
HIKEGDQVKRTGRIVEVPVGEALIGRVVNAMGQPIDGKGPIQTDKFRPVESPAPGVVYRQPVNTPLQTGLKAIDSMVPIG
RGQRELIIGDRQTGKTAIAVDTIINQKGQNVICIYVAIGQKASTVAGVVQRLEEAGAMEYTIVVMATASEPAPMLYIAPY
AGCTMGEYFMYEQHRDVLCVYDDLSKHAAAYRELSLLLRRPPGREAYPGDVFYLHSRLLERAARLNDSLGGGSLTALPVI
ETQAGDVSAYIPTNVISITDGQIFLESDLFYAGQRPAINVGLSVSRVGGAAQIKAMKQVAGRLRLDLAQYRELAAFAQFG
SDLDKATQARLARGERMMEILKQDQYQPMPVEEQVVVLYAAVNGFLDDLPVARVRAFEKDFLRFLRNERPEVLAGIREKR
QLDDNLQEQLKKSIEDFKGSFTAAGES
>A0R202 7.1.2.2~~~atpA~~~ATP synthase subunit alpha~~~COG0056
MAELTISAADIEGAIEDYVSSFSADTEREEIGTVIDAGDGIAHVEGLPSVMTQELLEFPGGVLGVALNLDEHSVGAVILG
EFEKIEEGQQVKRTGEVLSVPVGDAFLGRVVNPLGQPIDGQGDIAAETRRALELQAPSVVQRQSVSEPLQTGIKAIDAMT
PIGRGQRQLIIGDRKTGKTAVCVDTILNQREAWLTGDPKQQVRCVYVAIGQKGTTIASVKRALEEGGAMEYTTIVAAPAS
DAAGFKWLAPYTGSAIGQHWMYNGKHVLIVFDDLSKQADAYRAISLLLRRPPGREAFPGDVFYLHSRLLERCAKLSDELG
GGSMTGLPIIETKANDISAFIPTNVISITDGQCFLESDLFNQGVRPAINVGVSVSRVGGAAQIKAMKEVAGSLRLDLSQY
RELEAFAAFASDLDAASKAQLDRGARLVELLKQPQYSPLAVEEQVVAIFLGTQGHLDSVPVEDVQRFESELLEHVKASHS
DIFDGIRETKKLSEEAEEKLVSVINEFKKGFQASDGSSVVVSENAEALDPEDLEKESVKVRKPAPKKA
>P9WPU7 7.1.2.2~~~atpA~~~ATP synthase subunit alpha~~~COG0056
MAELTIPADDIQSAIEEYVSSFTADTSREEVGTVVDAGDGIAHVEGLPSVMTQELLEFPGGILGVALNLDEHSVGAVILG
DFENIEEGQQVKRTGEVLSVPVGDGFLGRVVNPLGQPIDGRGDVDSDTRRALELQAPSVVHRQGVKEPLQTGIKAIDAMT
PIGRGQRQLIIGDRKTGKTAVCVDTILNQRQNWESGDPKKQVRCVYVAIGQKGTTIAAVRRTLEEGGAMDYTTIVAAAAS
ESAGFKWLAPYTGSAIAQHWMYEGKHVLIIFDDLTKQAEAYRAISLLLRRPPGREAYPGDVFYLHSRLLERCAKLSDDLG
GGSLTGLPIIETKANDISAYIPTNVISITDGQCFLETDLFNQGVRPAINVGVSVSRVGGAAQIKAMKEVAGSLRLDLSQY
RELEAFAAFASDLDAASKAQLERGARLVELLKQPQSQPMPVEEQVVSIFLGTGGHLDSVPVEDVRRFETELLDHMRASEE
EILTEIRDSQKLTEEAADKLTEVIKNFKKGFAATGGGSVVPDEHVEALDEDKLAKEAVKVKKPAPKKKK
>Q07405 7.1.2.2~~~atpA~~~ATP synthase subunit alpha~~~
MEIRADEISRIIREQIKDYGKKVTVAETGTVLSVGDGIARIYGLEGALAGELVEFANGVQGLVLNLEEDNVGVAIMGDFQ
AIREGDTVKRTQQIASVPVGKELLGRVVDPLGKPLDGKGPIAATETRRLEVKAPGIVSRKSVHEPLQTGIKALDALVPVG
RGQRELIIGDRQTGKTAVAIDTIINQKGLNVYCIYVAIGQKQSTVAQVVEKLNRYGAMEYTTVVASNASDPAPMQFFAPY
AGVAMGEYFRDNKMHALIVYDDLSKQAVAYRQLSLLLRRPPGREAYPGDVFYVHSRLLERAAKLSDEEGAGSLTALPIIE
TQAGDVSAYIPTNVISITDGQIFLETDLFFAGVRPAINVGLSVSRVGSAAQIKAMKQVAGTMKLELAQYRELAAFAQFGS
DLDKATQETLARGARMVELLKQGQYEPMPVEKQVMQIYAATNRDDPKKRGWIRDIPTADVVRWMREFLEFADGKHPNVAK
DLASKRELTADIKTALSKAITEFNEVFQPTPGAKV
>A1B8N8 7.1.2.2~~~atpA~~~ATP synthase subunit alpha~~~COG0056
MGIQAAEISAILKDQIKNFGQDAEVAEVGQVLSVGDGIARVYGLDKVQAGEMVEFPGGIRGMVLNLETDNVGVVIFGDDR
DIKEGDTVKRTGAIVEVPAGKELLGRVVDALGNPIDGKGPLNASERRIADVKAPGIMPRKSVHEPMATGLKSVDAMIPVG
RGQRELIIGDRQTGKTAIALDTILNQANYNGREADGMKTLHCIYVAVGQKRSTVAQLVKKLEETGAMAYTTVVAATASDP
APMQYLAPYSATAMGEYFRDNGMDALIIYDDLSKQAVAYRQMSLLLRRPPGREAYPGDVFYLHSRLLERSAKLNEANGAG
SLTALPIIETQAGDVSAYIPTNVISITDGQIFLETELFFQGIRPAVNTGLSVSRVGSAAQTKAMKSVAGPVKLELAQYRE
MAAFAQFGSDLDAATQKLLNRGARLTELMKQPQYSPLTNAEIVIVIYAGTKGYLDGIPVRDVTKWEHGLLQYLRNQKADL
LEDMTKNDRKVAGELEDAIKAALDGYAKTYA
>P29706 7.2.2.1~~~atpA~~~ATP synthase subunit alpha, sodium ion specific~~~
MKIRPEEISGIIKTEIENYKKSLDVKTSGSVVQVGDGIARIYGLSNAKAGELLEFPNGITGMALNLEENNVGAVILGDPT
GVKEGDEVRATGQIAAVGAGEALLGRVVNSLGEPIDGKGELKTEKMMPLDRKAYGIISRKPVHEPLQTGIKSIDGMVPIG
RGQRELIIGDRQTGKTAVALDAIINQKDTGVKCIYVAIGQKRSTVAQIVKRLEDAGALEYTIVVAATASESAPLQYMAPY
TGVSMGEYFMDKGEHVLIVYDDLSKHAVAYREMSLLLKRPPGREAFPGDVFYLHSRLLERAAKLSDEIGAGSITALPIIE
TQAGDVSAYIPTNVISITDGQIFLDSQLFNSGFRPAINAGISVSRVGGAAQIKAMKQVAAQVKLELAQYNELLTFAQFGS
DLDKATLAQLERGHRIMEILKQEQYKPFVVEEQVVSFFTVINGYLDDIAIDQVRRFEKELLEELKDNTTILAEIVEKKAI
KEDLDAKLRKAIEDFKKKFS
>P72245 7.1.2.2~~~atpA~~~ATP synthase subunit alpha~~~
MGIQAAEISAILKEQIKNFGQDAQVAEVGRVLSVGDGIARVHGLDNVQAGEMVEFPGGIRGMALNLEVDNVGIVIFGSDR
DIKEGDTVKRTNAIVDVPAGEGLLGRVVDGLGNPIDGKGPIVAKERRIADVKAPGIIPRKSVHEPMATGLKSVDAMIPIG
RGQRELIIGDRQTGKTAIALDTILNQKSYNDANPGNKLHCFYVAIGQKRSTVAQLVKKLEEAGAMEYTTVVAATASDPAP
MQFLAPYSATAMAEYFRDNGMHALIIYDDLSKQAVAYRQMSLLLRRPPGREAYPGDVFYLHSRLLERSAKLNEDFGSGSL
TALPVIETQGGDVSAFIPTNVISITDGQIFLETELFYQGIRPAVNTGLSVSRVGSSAQTNSMKSVAGPVKLELAQYREMA
AFAQFGSDLDAATQKLLNRGARLTELMKQPQYSPLTNAEIVAVIFAGTNGFLDAVPVKEVGRFEKGLLAYLRSTRKDVLE
WLTKEDPKIKGDAEKKLKDAIAEFAKTFA
>P05036 7.1.2.2~~~atpA~~~ATP synthase subunit alpha~~~
MEIRAAEISAILKEQIANFGTEAESAEVGQVLSVGDGIARVYGLDNVQAGEMVEFANGVKGMALNLESDNVGIVIFGEDR
GIKEGDVVKRTQTIVDVPVGKGLLGRVVDGLGNPIDGKGDLVDVERKRAEVKAPGIIPRKSVHEPVQTGIKAIDSLIPIG
RGQRELIIGDRQTGKTAVILDTILNQKAVNDKAKDDSEKLFCVYVAVGQKRSTVAQVVKVLADHGALDYTIVVAATASEP
APLQFLAPYTGCTMGEFFRDNGMHAVIFYDDLTKQAVAYRQMSLLLRRPPGREAFPGDVFYLHSRLLERAAKLNDDNGAG
SLTALPVIETQANDVSAYIPTNVISITDGQIFLETDLFFKGIRPAVNVGLSVSRVGSSAQIKAMKQVAGSIKLELAQYRE
MAAFAQFASDLDPATQKLLARGARLTELLKQAQYSPLAVEEQVCVIYAGTRGYLDKLKTTDVVRYEASLLGALRTSGADL
LESIRTGKALSKEIEQKLVKFLDDFGKKFA
>Q0SYU2 7.1.2.2~~~atpA~~~ATP synthase subunit alpha~~~
MQLNSTEISELIKQRIAQFNVVSEAHNEGTIVSVSDGVIRIHGLADCMQGEMISLPGNRYAIALNLERDSVGAVVMGPYA
DLAEGMKVKCTGRILEVPVGRGLLGRVVNTLGAPIDGKGPLDHDGFSAVEAIAPGVIERQSVDQPVQTGYKAVDSMIPIG
RGQRELIIGDRQTGKTALAIDAIINQRDSGIKCIYVAIGQKASTISNVVRKLEEHGALANTIVVVATASESAALQYLAPY
AGCAMGEYFRDRGEDALIIYDDLSKQAVAYRQISLLLRRPPGREAFPGDVFYLHSRLLERAARVNAEYVEAFTKGEVKGK
TGSLTALPIIETQAGDVSAFVPTNVISITDGQIFLETNLFNAGIRPAVNPGISVSRVGGAAQTKIMKKLSGGIRTALAQY
RELAAFSQFASDLDDATRKQLDHGQKVTELLKQKQYAPMSVAQQSLVLFAAERGYLADVELSKIGSFEAALLAYVDRDHA
PLMQEINQTGGYNDEIEGKLKGILDSFKATQSW
>P99111 7.1.2.2~~~atpA~~~ATP synthase subunit alpha~~~
MAIKAEEISALLRSQIENYESEMSVTDVGTVLQIGDGIALIHGLNDVMAGELVEFHNGVLGLAQNLEESNVGVVILGPYT
GITEGDEVKRTGRIMEVPVGEELIGRVVNPLGQPIDGQGPINTTKTRPVEKKATGVMDRKSVDEPLQTGIKAIDALVPIG
RGQRELIIGDRQTGKTTIAIDTILNQKDQGTICIYVAIGQKDSTVRANVEKLRQAGALDYTIVVAASASEPSPLLYIAPY
SGVTMGEEFMFNGKHVLIVYDDLTKQAAAYRELSLLLRRPPGREAYPGDVFYLHSRLLERAAKLNDDLGGGSITALPIIE
TQAGDISAYVPTNVISITDGQIFLQSDLFFSGVRPAINAGQSVSRVGGSAQIKAMKKVAGTLRLDLASYRELESFAQFGS
DLDEFTASKLERGKRTVEVLKQDQNKPLPVEHQVLIIYALTKGYLDDIPVVDITRFEDELNHWAESNATELLNEIRETGG
LPDAEKFDTAINEFKKSFSKSE
>P50001 7.1.2.2~~~atpA~~~ATP synthase subunit alpha~~~
MAELTIRPEEIRDALENFVQSYKPDAASREEVGTVTLAGDGIAKVEGLPSAMANELLKFEDGTLGLALNLEEREIGCVVL
GEFSGIEEGQPVSRTGEVLSVAVAEGYLGRVVDPLGNPIDGLGEIETSGRRALELQAPTVMQRKSVHEPMETGYKAVDAM
TPIGRGQRQLIIGDRQTGKTALAVDTIINQRDNWRTGDPNKQVRCIYVAIGQKGSTIASVRGALEENGALEYTTIVAAPA
SDPAGFKYLAPYTGSAIGQQWMYEGKHVLIIFDDLSKQADAYRAVSLLLRRPPGREAYPGDVFYLHSRLLERCAKLSDAE
GAGSMTGLPIVETKANDVSAFIPTNVISITDGQCFLESDLFNAGQRPALNVGISVSRVGGSAQHKAMKQVSGRLRVDLAQ
FRELEAFAAFGSDLDAASKSQLERGQRMVELLKQNQYQPMSTEDQVVSVWAGTTGKMDEVPVADIRRFEKELLEYLHRQE
QGLMTSIREGGKMSDDTLQAVAEAIAAFKKQFETSDGKLLGEDAPSAAK
>Q7CRB1 7.1.2.2~~~atpA~~~ATP synthase subunit alpha~~~COG0056
MAINAQEISALIKQQIENFKPNFDVTETGVVTYIGDGIARAHGLENVMSGELLNFENGSYGMAQNLESTDVGIIILGDFT
DIREGDTIRRTGKIMEVPVGESLIGRVVDPLGRPVDGLGEIHTDKTRPVEAPAPGVMQRKSVSEPLQTGLKAIDALVPIG
RGQRELIIGDRQTGKTTIAIDTILNQKDQDMICIYVAIGQKESTVRTQVETLRQYGALDYTIVVTASASQPSPLLFLAPY
AGVAMAEEFMYQGKHVLIVYDDLSKQAVAYRELSLLLRRPPGREAFPGDVFYLHSRLLERSAKVSDELGGGSITALPFIE
TQAGDISAYIATNVISITDGQIFLGDGLFNAGIRPAIDAGSSVSRVGGSAQIKAMKKVAGTLRIDLASYRELEAFTKFGS
DLDAATQAKLNRGRRTVEVLKQPVHKPLPVEKQVTILYALTHGFLDTVPVDDIVRFEEEFHAFFDAQHPEILETIRDTKD
LPEEAVLDAAITEFLNQSSFQ
>Q05372 7.1.2.2~~~atpA~~~ATP synthase subunit alpha~~~
MVSIRPDEISSIIRQQIEQYEQSINVDNVGTVLQVGDGIARVYGLDKVMASELVEFEDGTVGIALNLEEDNVGVVLMGAG
LGIEEGSTVRATGKIASVPVGEAVIGRVVDALMRPIDGKGEIHATATRLLESPAPGIVQRKSVCEPLQTGITAIDAMIPI
GRGQRELIIGDRQTGKTAVAIDTILNQKGQDVICVYVAIGQKASSVAQVVNVLRERGALDYTIVVAANASDPAALQYLAP
YTGASIAEYFMYQGKHTLVIYDDLSKQAQAYRQMSLLLRRPPGREAYPGDVFYLHSRLLERAAKLNDALGGGSMTALPIV
ETQAGDVSAYIPTNVISITDGQIFLSSDLFNAGLRPAINAGISVSRVGSAAQIKAMKQVAGKLKLELAQFDELQAFAQFA
SDLDKATQNQLARGQRLREILKQPQYSPIPVEYQVATIYAGTNGYLDDIPVEAVAKFVAGLRDYLATSKPQYGEAVRSSQ
KLDETAEALLKEAIAEYKAGFTA
>Q9X1U7 7.1.2.2~~~atpA~~~ATP synthase subunit alpha~~~COG0056
MRINPGEITKVLEEKIKSFEEKIDLEDTGKVIQVGDGIARAYGLNKVMVSELVEFVETGVKGVAFNLEEDNVGIIILGEY
KDIKEGHTVRRLKRIIEVPVGEELLGRVVNPLGEPLDGKGPINAKNFRPIEIKAPGVIYRKPVDTPLQTGIKAIDSMIPI
GRGQRELIIGDRQTGKTAIAIDTIINQKGQGVYCIYVAIGQKKSAIARIIDKLRQYGAMEYTTVVVASASDPASLQYIAP
YAGCAMGEYFAYSGRDALVVYDDLSKHAVAYRQLSLLMRRPPGREAYPGDIFYLHSRLLERAVRLNDKLGGGSLTALPIV
ETQANDISAYIPTNVISITDGQIYLEPGLFYAGQRPAINVGLSVSRVGGSAQIKAMKQVAGMLRIDLAQYRELETFAQFA
TELDPATRAQIIRGQRLMELLKQEQYSPMPVEEQVVVLFAGVRGYLDDLPVEEVRRFEKEFLRFMHEKHQDILDDIKTKK
ELTSETEEKLKKAIEEFKTTFRV
>Q8DLP3 7.1.2.2~~~atpA~~~ATP synthase subunit alpha~~~COG0056
MVSIRPDEISSIIRQQIEQYEQSIKVDNVGTVLQVGDGIARVYGLDKVMASELVEFEDGTVGIALNLEEDNVGVVLMGDG
LSIEEGSTVRATGKIASIPVGEAAIGRVVDALMRPIDGKGEIHTTQSRLIESPAPGIVQRKSVCEPLQTGITAIDAMIPI
GRGQRELIIGDRQTGKTAVAIDTILNQKGQDVICVYVAIGQKASSVAQVVNVLRERGALDYTIVIAANASDPAALQYLAP
YTGATVAEYFMYQGKHTLVVYDDLSKQAQAYRQMSLLLRRPPGREAYPGDVFYLHSRLLERAAKLNDALGGGSMTALPVV
ETQAGDVSAYIPTNVISITDGQIFLSSDLFNAGLRPAINAGISVSRVGSAAQIKAMKQVAGKLKLELAQFDELQAFAQFA
SDLDKATQNQLARGQRLREILKQPQYSPIPVEYQVATIYAGTNGYLDDIPVEAVAKFVAGLRDYLRTNKPEYGEIIRTTQ
KLDEKAEALLKEAIAEYKATFTA
>P12985 7.1.2.2~~~atpA~~~ATP synthase subunit alpha~~~COG0056
MQLNSTEISDLIKQRIESFEVVSEARNEGTIVSVSDGIIRIHGLADVMQGEMIELPGGRYALALNLERDSVGAVVMGPYA
DLKEGMKVTGTGRILEVPVGPELLGRVVNTLGEPIDGKGPIEAKMTSPVEVIAPGVIDRKSVDQPVQTGYKSVDSMIPIG
RGQRELIIGDRQIGKTALAIDAIINQKDSGIFSIYVAIGQKASTIANVVRKLEEHGALQNTIVVVASASESAALQYLAPY
AGCAMGEYFRDRGEDALIVYDDLSKQAVAYRQISLLLKRPPGREAFPGDVFYLHSRLLERAARVNEEYVERFTNGEVKGK
TGSLTALPIIETQAGDVSAFVPTNVISITDGQIFLQTELFNAGVRPAVDPGISVSRVGGSAQTKIIKKLSGGIRTALAAY
RELAAFAQFSSDLDEATKKQLDHGQKVTELMKQKQYAPMSVFDQALTIFAAERGYLDDVELNKVLDFEAALLSYARGQYA
ELAAEIDKSGAYNDEIEAQLKKLTDDFKATQTW
>Q2STE9 7.1.2.2~~~atpD1~~~ATP synthase subunit beta 1~~~
MSTAALVEGKIVQCIGAVIDVEFPRESMPKIYDALILEGSELTLEVQQQLGDGVVRTICLGASDGLRRGVVVKNTGNPIS
VPVGKPTLGRIMDVLGRPIDEAGPIESENKRSIHQKAPAFDELSPSTELLETGIKVIDLICPFAKGGKVGLFGGAGVGKT
VNMMELINNIAKEHGGYSVFAGVGERTREGNDFYHEMKDSNVLDKVALVYGQMNEPPGNRLRVALTGLTMAEHFRDEGLD
VLFFVDNIYRFTLAGTEVSALLGRMPSAVGYQPTLAEEMGKLQERITSTKKGSITSVQAVYVPADDLTDPSPATTFGHLD
ATVVLSRDIASLGIYPAVDPLDSTSRQIDPNVIGEEHYSITRRVQQTLQRYKELRDIIAILGMDELSPEDKLSVARARKI
QRFLSQPFHVAEVFTGSPGKYVPLKETIRGFKMIVDGECDHLPEQAFYMVGTIDEAFEKAKKIQ
>P50002 7.2.2.1~~~atpD~~~ATP synthase subunit beta, sodium ion specific~~~COG0055
MAQNIGKVVQVIGPVVDVKFQKDKLPKLNNAVNIELNGHTLVIEVAQQLGDDIVRCIAMDSTDGLMRNQEAVDTGSAIQV
PVGKATLGRMFNVLGEPIDGKPFDTKDVVMHPIHRHPPSFEEQQTQPEMFETGIKVVDLICPYVRGGKIGLFGGAGVGKT
VLIQELINNIATQHGGLSVFAGVGERTREGNDLYYEMMESGVINKTALCFGQMNEPPGARMRIALAGLTMAEYFRDDEGQ
DVLLFIDNIFRFTQAGSEVSALLGRMPSAVGYQPTLATEMGALQERITSTSKGSITSVQAVYVPADDLTDPAPATTFAHL
DATTVLSRAITEKGIYPAVDPLDSTSRILDPKIVGQEHYETAREVQEILQRYKELQDIIAILGMDELSDADKITVSRARK
VERFLSQPFNVAEQFTGTAGVYVTIGDTIKGFKEILEGQHDDLPESAFLLVGTIEDAVVKAKKIKG
>A3M144 7.1.2.2~~~atpD~~~ATP synthase subunit beta~~~
MSSGRIIQIIGAVIDVEFERTSVPKIYDALQVDGTETTLEVQQQLGDGVVRTIAMGSTEGLKRGLTVTSTNAPISVPVGT
ATLGRIMDVLGRPIDEAGPVATEERLPIHRQAPSYAEQAASTDLLETGIKVIDLLCPFAKGGKVGLFGGAGVGKTVNMME
LINNIAKAHSGLSVFAGVGERTREGNDFYHEMKDSNVLDKVAMVYGQMNEPPGNRLRVALTGLTMAEYFRDEKDENGKGR
DVLLFVDNIYRYTLAGTEVSALLGRMPSAVGYQPTLAEEMGVLQERITSTKSGSITSIQAVYVPADDLTDPSPATTFAHL
DATVVLSRDIASSGIYPAIDPLDSTSRQLDPLVVGQEHYEIARAVQNVLQRYKELKDIIAILGMDELAEEDKLVVYRARK
IQRFFSQPFHVAEVFTGAPGKLVPLKETIRGFKGLLAGEYDHIPEQAFYMVGGIDEVIAKAEKL
>P07677 7.1.2.2~~~atpD~~~ATP synthase subunit beta~~~
MTRGRVIQVMGPVVDVKFENGHLPAIYNALKIQHKARNENEVDIDLTLEVALHLGDDTVRTIAMASTDGLIRGMEVIDTG
APISVPVGQVTLGRVFNVLGEPIDLEGDIPADARRDPIHRPAPKFEELATEVEILETGIKVVDLLAPYIKGGKIGLFGGA
GVGKTVLIQELIHNIAQEHGGISVFAGVGERTREGNDLYHEMKDSGVISKTAMVFGQMNEPPGARMRVALTGLTMAEYFR
DEQGQDGLLFIDNIFRFTQAGSEVSALLGRMPSAIGYQPTLATEMGQLQERITSTAKGSITSIQAIYVPADDYTDPAPAT
TFSHLDATTNLERKLAEMGIYPAVDPLVSTSRALAPEIVGEEHYQVARKVQQTLERYKELQDIIAILGMDELSDEDKLVV
HRARRIQFFLSQNFHVAEQFTGQPGSYVPVKETVRGFKEILEGKYDHLPEDRFRLVGRIEEVVEKAKAMGVEV
>P37809 7.1.2.2~~~atpD~~~ATP synthase subunit beta~~~COG0055
MKKGRVSQVLGPVVDVRFEDGHLPEIYNAIKISQPAASENEVGIDLTLEVALHLGDDTVRTIAMASTDGVQRGMEAVDTG
APISVPVGDVTLGRVFNVLGENIDLNEPVPADAKKDPIHRQAPSFDQLSTEVEILETGIKVVDLLAPYIKGGKIGLFGGA
GVGKTVLIQELINNIAQEHGGISVFAGVGERTREGNDLFYEMSDSGVINKTAMVFGQMNEPPGARMRVALTGLTMAEHFR
DVQGQDVLFFIDNIFRFTQAGSEVSALLGRMPSAVGYQPTLATEMGQLQERITSTNVGSVTSIQAIYVPADDYTDPAPAT
TFAHLDATTNLERKLTEMGIYPAVDPLASTSRALAPEIVGEEHYAVAREVQSTLQRYKELQDIIAILGMDELGEEDKLVV
HRARRIQFFLSQNFHVAEQFTGQKGSYVPVKETVQGFKEILAGKYDHLPEDAFRLVGRIEEVVEKAKEMGVEV
>P0ABB4 7.1.2.2~~~atpD~~~ATP synthase subunit beta~~~COG0055
MATGKIVQVIGAVVDVEFPQDAVPRVYDALEVQNGNERLVLEVQQQLGGGIVRTIAMGSSDGLRRGLDVKDLEHPIEVPV
GKATLGRIMNVLGEPVDMKGEIGEEERWAIHRAAPSYEELSNSQELLETGIKVIDLMCPFAKGGKVGLFGGAGVGKTVNM
MELIRNIAIEHSGYSVFAGVGERTREGNDFYHEMTDSNVIDKVSLVYGQMNEPPGNRLRVALTGLTMAEKFRDEGRDVLL
FVDNIYRYTLAGTEVSALLGRMPSAVGYQPTLAEEMGVLQERITSTKTGSITSVQAVYVPADDLTDPSPATTFAHLDATV
VLSRQIASLGIYPAVDPLDSTSRQLDPLVVGQEHYDTARGVQSILQRYQELKDIIAILGMDELSEEDKLVVARARKIQRF
LSQPFFVAEVFTGSPGKYVSLKDTIRGFKGIMEGEYDHLPEQAFYMVGSIEEAVEKAKKL
>Q8RGE2 7.1.2.2~~~atpD~~~ATP synthase subunit beta~~~COG0055
MNKGTITQIISAVVDIAFKDELPAIYNALKVKLEDKELVLEVEQHLGNNVVRTVAMDSTDGLKRGMEVIDTGKPITIPVG
KAVLGRILNVLGEPVDNQGPLNAETFLPIHREAPEFDDLETETEIFETGIKVIDLLAPYIKGGKIGLFGGAGVGKTVLIM
ELINNIAKGHGGISVFAGVGERTREGRDLYGEMTESGVITKTALVYGQMNEPPGARLRVALTGLTVAENFRDKDGQDVLL
FIDNIFRFTQAGSEVSALLGRIPSAVGYQPNLATEMGALQERITSTKSGSITSVQAVYVPADDLTDPAPATTFSHLDATT
VLSRNIASLGIYPAVDPLDSTSKALSEDVVGKEHYEVARKVQEVLQRYKELQDIIAILGMDELSDEDKLTVSRARKIERF
FSQPFSVAEQFTGMEGKYVPVKETIRGFREILEGKHDDIPEQAFLYVGTIEEAVAKSKDLAK
>Q5KUJ3 7.1.2.2~~~atpD~~~ATP synthase subunit beta~~~COG0055
MTRGRVIQVMGPVVDVKFENGHLPAIYNALKIQHKARNENEVDIDLTLEVALHLGDDTVRTIAMASTDGLIRGMEVIDTG
APISVPVGEVTLGRVFNVLGEPIDLEGDIPADARRDPIHRPAPKFEELATEVEILETGIKVVDLLAPYIKGGKIGLFGGA
GVGKTVLIQELIHNIAQEHGGISVFAGVGERTREGNDLYHEMKDSGVISKTAMVFGQMNEPPGARMRVALTGLTMAEYFR
DEQGQDVLLFIDNIFRFTQAGSEVSALLGRMPSAVGYQPTLATEMGQLQERITSTAKGSITSIQAIYVPADDYTDPAPAT
TFSHLDATTNLERKLAEMGIYPAVDPLASTSRALAPEIVGEEHYQVARKVQQTLQRYKELQDIIAILGMDELSDEDKLVV
HRARRIQFFLSQNFHVAEQFTGQPGSYVPVKETVRGFKEILEGKYDHLPEDAFRLVGRIEEVVEKAKAMGVEV
>Q5FKY0 7.1.2.2~~~atpD~~~ATP synthase subunit beta~~~COG0055
MSEGEIVQVIGPVVDVKFPIDKNLPDINNALRVIKSEDESIVLEVTLELGDGVLRTIAMESTDGLRRGMKVEDTGAPISV
PVGEDTLGRVFNVLGQPIDGGPAFPKDHPREGIHKEAPKYEDLTTSREILETGIKVIDLLEPYVRGGKVGLFGGAGVGKT
TIIQELIHNIAQEHGGISVFTGVGERTREGNDLYFEMKASGVLSKTAMVFGQMNEPPGARMRVALTGLTLAEYFRDVEGQ
DVLLFIDNIFRFTQAGSEVSALLGRMPSAVGYQPTLATEMGQLQERITSTKKGSITSIQAVYVPADDYTDPAPSTTFAYL
DATTNLERSLVEQGIYPAVDPLESSSSALDPEVVGQEHYEVATRVQHVLQRYHELQDIISVLGMDELSDEEKLIVARARK
VQFFLSQNFFVAEQFTGVPGSYVPIKETIKGFKLILDGHLDDLPEDAFRGVGPIEDVLKKAQEMGVTPSDPEAKALLEK
>Q2RFX9 7.1.2.2~~~atpD~~~ATP synthase subunit beta~~~COG0055
MNEGQVVQVIGPVVDVEFASDRLPDLYNAITIKTDKINITMEAMQHLGNNTVRCVALSSTDGLQRGMKAVDTGQPITVPV
GRATLGRLFNVLGEPIDNQGPVETTERLPIHRPAPSFEEQQPSTEVLETGIKVVDLLAPYAKGGKIGLFGGAGVGKTVLI
MELIRNIAYEHGGFSVFSGVGERTREGNDLYLEMKESGVLEKTALVFGQMNEPPGARLRVGLTGLTMAEYFRDAEGQDVL
LFIDNIFRFVQAGSEVSALLGRMPSAVGYQPTLATEMGALQERITSTKKGSITSVQAIYVPADDLTDPAPATTFAHLDAT
TVLSRQIAELGIYPAVDPLDSTSRILDPRVLGEEHYQVARGVQQVLQRYKELQDIIAILGMDELSEEDKLIVARARKIQR
FLSQPFHVAEAFTGQPGVYVPLKETIRGFKEILEGRHDNLPEQAFYMVGTIDEAVKKGQELM
>A0R200 7.1.2.2~~~atpD~~~ATP synthase subunit beta~~~COG0055
MTATAEKTAGRVVRITGPVVDVEFPRGSVPELFNALHAEITFGALAKTLTLEVAQHLGDSLVRCISMQPTDGLVRGVEVT
DTGASISVPVGDGVKGHVFNALGDCLDDPGYGKDFEHWSIHRKPPAFSDLEPRTEMLETGLKVVDLLTPYVRGGKIALFG
GAGVGKTVLIQEMINRIARNFGGTSVFAGVGERTREGNDLWVELADANVLKDTALVFGQMDEPPGTRMRVALSALTMAEF
FRDEQGQDVLLFIDNIFRFTQAGSEVSTLLGRMPSAVGYQPTLADEMGELQERITSTRGRSITSMQAVYVPADDYTDPAP
ATTFAHLDATTELSRAVFSKGIFPAVDPLASSSTILDPAIVGDEHYRVAQEVIRILQRYKDLQDIIAILGIDELSEEDKQ
LVNRARRIERFLSQNMMAAEQFTGQPGSTVPLKETIEAFDKLTKGEFDHLPEQAFFLIGGLDDLAKKAESLGAKL
>P9WPU5 7.1.2.2~~~atpD~~~ATP synthase subunit beta~~~COG0055
MTTTAEKTDRPGKPGSSDTSGRVVRVTGPVVDVEFPRGSIPELFNALHAEITFESLAKTLTLEVAQHLGDNLVRTISLQP
TDGLVRGVEVIDTGRSISVPVGEGVKGHVFNALGDCLDEPGYGEKFEHWSIHRKPPAFEELEPRTEMLETGLKVVDLLTP
YVRGGKIALFGGAGVGKTVLIQEMINRIARNFGGTSVFAGVGERTREGNDLWVELAEANVLKDTALVFGQMDEPPGTRMR
VALSALTMAEWFRDEQGQDVLLFIDNIFRFTQAGSEVSTLLGRMPSAVGYQPTLADEMGELQERITSTRGRSITSMQAVY
VPADDYTDPAPATTFAHLDATTELSRAVFSKGIFPAVDPLASSSTILDPSVVGDEHYRVAQEVIRILQRYKDLQDIIAIL
GIDELSEEDKQLVNRARRIERFLSQNMMAAEQFTGQPGSTVPVKETIEAFDRLCKGDFDHVPEQAFFLIGGLDDLAKKAE
SLGAKL
>A1B8P0 7.1.2.2~~~atpD~~~ATP synthase subunit beta~~~COG0055
MAEANGKITQVIGAVVDVQFDGQLPAILNALETENNGKRLVLEVAQHLGENTVRTIAMDATEGLVRGLPVKDTGGPIMVP
VGDATLGRILNVVGEPVDEGGPVEATQTRAIHQQAPDFAAQATASEILVTGIKVIDLLAPYSKGGKIGLFGGAGVGKTVL
IMELINNIAKVHSGYSVFAGVGERTREGNDLYHEMVESGVIKPDDLSKSQVALVYGQMNEPPGARMRVALTGLTVAEQFR
DATGTDVLFFVDNIFRFTQAGSEVSALLGRIPSAVGYQPTLATDMGAMQERITSTKNGSITSIQAVYVPADDLTDPAPAT
TFAHLDATTVLSRAISELGIYPAVDPLDSNSRILDPAVVGEEHYQVARDVQGILQKYKSLQDIIAILGMDELSEEDKLTV
ARARKIQRFLSQPFDVAKVFTGSDGVQVPLEDTIKSFKAVVAGEYDHLPEAAFYMVGGIEDVKAKAQRLAADAA
>P29707 7.2.2.1~~~atpD~~~ATP synthase subunit beta, sodium ion specific~~~
MENKGVITQIIGPVVDVTFENELPRIYNALKIDRGNGEYLVAEVQQHLGNSVVRAVAMDATDGLQRGMEVVDTGPAITVP
VGKAVLGRILNVLGEPVDEAGEVKAEEYAPIHREAPAFEDQGTEKEVFETGIKVVDLLAPYVKGGKIGLFGGAGVGKTVL
IMELINNIAQGHGGLSVFAGVGERTREGRDLYDEMLESGVLDKTSLVYGQMNEPPGARLRVGLTGLTMAENFRDKEGQDV
LFFVDNIFRFTQAPSEVSALLGRMPSAVGYQPNLATDMGALQERITSTKTGSITSVQAVYVPADDLTDPAPATTFTHLDA
TTVLSRRIASLGIYPAVDPLDSTSTALEPQIIGHEHYNTAREVQQILQRYKELQDIIAILGMDELSDEDKVTVNRARKIE
RFFSQPFHVAEQFTGMDGKYVTVKETIRGFKEIIEGKHDDLPEQAFLYVGTIDEAIAKARELMKGAE
>P72247 7.1.2.2~~~atpD~~~ATP synthase subunit beta~~~
MASKGKVTQVIGAVVDVQFEDGLPAILNALETTNNGKRLVLEVAQHLGENTVRTIAMDATEGLVRGAAVSDTGGPITVPV
GNATLGRILNVIGEPVDERGDVSKAEARAIHQPAPDFAAQSTESQILVTGIKVIDLLAPYSKGGKIGLFGGAGVGKTVLI
MELINNIAKVHSGFSVFAGVGERTREGNDLYHEMIESGVINLEKLEESKVALVYGQMNEPPGARARVALTGLTLAEQFRD
QSGTDVLFFVDNIFRFTQAGSEVSALLGRIPSAVGYQPTLATDMGALQERITSTKAGSITSVQAIYVPADDLTDPAPATS
FAHLDATTVLSRAISELGIYPAVDPLDSTSRILDPQVVGEEHYQVARDVQGMLQRYKSLQDIIAILGMDELSEEDKLTVA
RARKIQRFLSQPFDVAKVFTGSDGVQVPLEDTIKSFKAVVAGEYDHLPEAAFYMVGGIDDVIAKAQRLAAAA
>P05038 7.1.2.2~~~atpD~~~ATP synthase subunit beta~~~
MAKNNLGTITQVTGAVVDVKFEGELPSILSALETDNHGNRLVLEVAQHLGESVVRTIAMDSTEGLVRGQQVTSTGGPITV
PVGPQVLGRIMNVIGEPVDERGPVVTAQRYPIHRQAPTFAEQATETEILVTGIKVIDLIAPYTKGGKVGLFGGAGVGKTV
LIQELINNVAKGHGGYSVFAGVGERTREGNDLYHEMIDAGIIDLEGDKSKVALVYGQMNEPPGARARVALAGLTQAEYFR
DEEGQDVLFFVDNIFRFTQAGSEVSALLGRIPSAVGYQPTLATDMGALQERITSTKKGSITSVQAIYVPADDLTDPAPAA
SFAHLDATTTLNRSIAELGIYPAVDPLDSTSRALDPLVVGEEHYKVAREVQRVLQTYKSLQDIIAILGMDELSEEDRLVV
ARARKIQRFLSQPFHVAEVFTGSPGKLVSLEDTIKGFKGLVEGEYDHLPEQAFYMVGNMAEAIEKAKKMAAEAA
>P99112 7.1.2.2~~~atpD~~~ATP synthase subunit beta~~~
MGIGRVTQVMGPVIDVRFEHNEVPKINNALVIDVPKEEGTIQLTLEVALQLGDDVVRTIAMDSTDGVQRGMDVKDTGKEI
SVPVGDETLGRVFNVLGETIDLKEEISDSVRRDPIHRQAPAFDELSTEVQILETGIKVVDLLAPYIKGGKIGLFGGAGVG
KTVLIQELINNIAQEHGGISVFAGVGERTREGNDLYFEMSDSGVIKKTAMVFGQMNEPPGARMRVALSGLTMAEYFRDEQ
GQDVLLFIDNIFRFTQAGSEVSALLGRMPSAVGYQPTLATEMGQLQERITSTTKGSVTSIQAVFVPADDYTDPAPATAFA
HLDATTNLERKLTEMGIYPAVDPLASTSRALEPSIVGQEHYEVARDVQSTLQKYRELQDIIAILGMDELSDEDKQTVERA
RRIQFFLSQNFHVAEQFTGQKGSYVPVKTTVANFKDILDGKYDHIPEDAFRLVGSMDDVIAKAKDMGVEV
>P0A301 7.1.2.2~~~atpD~~~ATP synthase subunit beta~~~
MTTTVETATATGRVARVIGPVVDVEFPVDAMPEIYNALHVEVADPAKEGELKTLTLEVAQHLGDGLVRTISMQPTDGLIR
QAPVTDTGAAISVPVGDFTKGKVFNTLGEVLNVDEQYTGERWPIHRKAPNFDELESKTEMFETGVKVIDLLTPYVKGGKI
GLFGGAGVGKTVLIQEMIYRVANNHDGVSVFAGVGERTREGNDLIDEMSESGVIDKTALVFGQMDEPPGTRLRVALAGLT
MAEYFRDVQKQDVLFFIDNIFRFTQAGSEVSTLLGRMPSAVGYQPNLADEMGLLQERITSTRGHSITSMQAIYVPADDLT
DPAPATTFAHLDATTVLSRPISEKGIYPAVDPLDSTSRILDPRYIAAEHYNAAMRVKNILQKYKDLQDIIAILGIDELGE
EDKLVVHRARRVERFLSQNTHVAKQFTGVDGSDVPLDESIAAFNAICDGEYDHFPEQAFFMCGGIEDLKNNAKELGVS
>Q8DP44 7.1.2.2~~~atpD~~~ATP synthase subunit beta~~~COG0055
MSSGKIAQVIGPVVDVLFAAGEKLPEINNALVVYKNDERKTKIVLEVALELGDGMVRTIAMESTDGLTRGMEVLDTGRPI
SVPVGKETLGRVFNVLGDTIDLEAPFTEDAERQPIHKKAPTFDELSTSSEILETGIKVIDLLAPYLKGGKVGLFGGAGVG
KTVLIQELIHNIAQEHGGISVFAGVGERTREGNDLYWEMKESGVIEKTAMVFGQMNEPPGARMRVALTGLTIAEYFRDVE
GQDVLLFIDNIFRFTQAGSEVSALLGRMPSAVGYQPTLATEMGQLQERITSTKKGSVTSIQAIYVPADDYTDPAPATAFA
HLDSTTNLERKLVQLGIYPAVDPLASSSRALAPEIVGEEHYAVAAEVKRVLQRYHELQDIIAILGMDELSDEEKTLVARA
RRIQFFLSQNFNVAEQFTGQPGSYVPVAETVRGFKEILDGKYDHLPEDAFRGVGSIEDVIAKAEKMGF
>Q05373 7.1.2.2~~~atpD~~~ATP synthase subunit beta~~~
MVTTAERTNVGFITQVIGPVIDIEFPSGKMPAIYNALRIQGKNAAGLDVAVTCEVQQLLGDNRVRAVAMSSTDGLVRGME
AVDTGAPISVPVGTATLGRIFNVLGEPVDEKGEVNISETLPIHRPAPSFTELETKPSVFETGIKVIDLLTPYRRGGKIGL
FGGAGVGKTVIMMELINNIATQHGGVSVFAGVGERTREGNDLYNEMIESGVIDKDDPSKSKIALVYGQMNEPPGARMRVG
LSGLTMAEYFRDVNKQDVLLFVDNIFRFVQAGSEVSALLGRMPSAVGYQPTLGTDVGALQERITSTMEGSITSIQAVYVP
ADDLTDPAPATTFAHLDGTTVLSRGLAAKGIYPAVDPLGSTSNMLQPDIVGSEHYQTARAVQATLQRYKELQDIIAILGL
DELSEEDRLTVARARKVERFLSQPFFVAEVFTGAPGKYVTLEETIKGFQMILSGELDDLPEQAFYMVGNIEEAKAKAEKL
KA
>Q8DLG8 7.1.2.2~~~atpD~~~ATP synthase subunit beta~~~COG0055
MVISAERTNVGFITQVIGPVVDIEFPSGKMPAIYNALRIQGKNAAGLDVAVTCEVQQLLGDNRVRAVAMSSTDGLVRGME
VVDTGAPISVPVGTATLGRIFNVLGEPVDEKGAVNATETLPIHRPAPSFTQLETKPSVFETGIKVIDLLTPYRRGGKIGL
FGGAGVGKTVIMMELINNIATQHGGVSVFAGVGERTREGNDLYNEMIESGVIDKDDPSKSKIALVYGQMNEPPGARMRVG
LSGLTMAEYFRDVNKQDVLLFIDNIFRFVQAGSEVSALLGRMPSAVGYQPTLGTDVGALQERITSTTEGSITSIQAVYVP
ADDLTDPAPATTFAHLDGTTVLSRSLAAKGIYPAVDPLGSTSNMLQPDIVGEEHYQTARAVQATLQRYKELQDIIAILGL
DELSEEDRLTVARARKIERFLSQPFFVAEVFTGAPGKYVTLEETIKGFQMILSGELDDLPEQAFYMVGNIEEAKAKAEKL
KA
>P12986 7.1.2.2~~~atpD~~~ATP synthase subunit beta~~~COG0055
MATGKIVQIIGAVVDVEFPQSNVPSVYDALNVTDSKERLVLEVQQQLGGGVVRCIVMGSSDGLRRGVEVVNTGAPISVPV
GTKTLGRIMNVLGDAIDERGEVGAEEVYSIHRSAPSYEEQSNEIALLETGVKVIDLICPFAKGGKIGLFGGAGVGKTVNM
MELINNIALQHSGLSVFAGVGERTREGNDFYYEMQEAGVVNVEKPEESKVAMVYGQMNEPPGNRLRVALTGLTMAERFRD
EGRDVLLFIDNIYRYTLAGTEVSALLGRMPSAVGYQPTLAEEMGVLQERITSTKSGSITSVQAVYVPADDLTDPSPATTF
AHLDATVVLNRNIAAMGLYPAIDPLDSTSRMLDPLVVGQDHYEVARGVQQTLQRYKELKDIIAILGMDELSEEDKQVVSR
ARKIERFLTQPYHVAEVFTGDPGIYVPLKETLRGFKGLLAGEYDDIPEQAFMYCGSIDDAIENAKKL
>A3M141 ~~~atpH~~~ATP synthase subunit delta~~~
MAELLTLARPYAKAAFAYASEQGATDNWSNALQVLSAAVQDEAFSAYLNRPELTPAEQVKLFAKVLGEDQSQAVSNFLTL
LADNDRLVLLPEIAAEYEQLKSQNNNNVDVVIESAFPLTAEQEQLLKSALEKRFNSTVTVSVEVKPELIAGVVIRAGDQV
IDDSALNKLEKMRTRLLA
>P09220 ~~~atpH~~~ATP synthase subunit delta~~~
MNQEVIAKRYASALFQIALEQGQLDRIEEDVRAVRQALAENGEFLSLLSYPKLSLDQKKALIREAFAGVSTPVQNTLLLL
LERHRFGLVPELAGTVSRPRSTTARGIAKAVAYSGAASTDEELRALSDVFAQKVGKQTLEIENIIDPELIGGVNVRIGNR
IYDGSVSGQLERIRRQLIG
>P37811 ~~~atpH~~~ATP synthase subunit delta~~~COG0712
MSGSAVSKRYASALFDIANESAQLNQVEEELIVVKQVFQNEKALNDVLNHPKVPAAKKKELIQNAFGSLSQSVLNTIFLL
IDRHRAAIVPELTDEFIKLANVARQTEDAIVYSVKPLTDAEMLPLSQVFAKKAGVASLRIRNEVQTDLIGGIKVRIGNRI
YDGSVSGKLQRIERQLAGENR
>Q0ZS22 ~~~atpH~~~ATP synthase subunit delta, sodium ion specific~~~
MAKLVATRYASAIFEVGVELNKEEMFYEELKIISSNFEENEKFFKMLKTPILSKQEKKELIENIYKDKASLEIVNFLKVL
IDKDRISIIREIVDIYKQLLNENKNEIQAIAITAVPMSEESLKELTYKLCEKTKKNVKVKNQVDPTVLGGVLVKMGNEEI
DGTVKTKLEKLKKQLHQIIA
>P0ABA4 ~~~atpH~~~ATP synthase subunit delta~~~COG0712
MSEFITVARPYAKAAFDFAVEHQSVERWQDMLAFAAEVTKNEQMAELLSGALAPETLAESFIAVCGEQLDENGQNLIRVM
AENGRLNALPDVLEQFIHLRAVSEATAEVDVISAAALSEQQLAKISAAMEKRLSRKVKLNCKIDKSVMAGVIIRAGDMVI
DGSVRGRLERLADVLQS
>Q8KRV1 ~~~atpH~~~ATP synthase subunit delta, sodium ion specific~~~
MIEAQVGKRYAEAIYGIAEANNKVKELYDSLNIVMELYKGDKEFKNLVDHPLVKKEEKKEFINKVFSEFEKFSLDILCYL
VEKNRLSYIRGVVAEYLKIYYTKNRIVDVEATFAIEPSEKQKAKLIEKLEKKTGKKVNLVIKINKAIIAGGIIKIGDEII
DGSVRRQLDTVARG
>Q9RGY4 ~~~atpH~~~ATP synthase subunit delta~~~COG0712
MALSREEVAARYGTALFGYAQDNKVLDTVYDEMMALKKAAIANPKFISVLSDPILSSKDKKSILTAVEKDFSDEVQGFLN
LLLEYNRFADLIDIIDQFSLLYDNENKIASGTATTAVKLDDDQLERLSESFAKKYDLNAVRLENKVDPSILGGVILQVKD
RVIDGSVKNKLKKIRAQIIDEN
>Q2RFX6 ~~~atpH~~~ATP synthase subunit delta~~~COG0712
MSEQNVARRYARALFNIAREQGTAGEFANGLEEVSRTLAENSDFRRVLYHQLIPVREKQKLIDTIFPDINPLLKNFLHLV
LAKGRERALPEMAAQFRRLVDQAENILPVEVTSAITLREDILAGLKERLAGITRRNIRLSSRVNPELIGGVVIRLGDRVL
DASVKKKLELLGEHLKRA
>A1B8N7 ~~~atpH~~~ATP synthase subunit delta~~~COG0712
MTVANSASISADIAGRYAQALFDLVRDSGGIDALSSQIDDLASAYDASQDLRDLTLSPLYDRQQQEAAVGALSERMGLSA
ELANTLRLLARNRRLFTLPQFVAKLRNLIADAKGEVTADVVSAQALTDEQKARLADTLAAKSGKTVKLNARVDESLIGGM
IVKLGSQMIDSSIRSKLASLQNAMKEVG
>P29708 ~~~atpH~~~ATP synthase subunit delta, sodium ion specific~~~
MIEAQVGRRYAEAIYEIAESNDNVKELYETLNGVMELYNTDKEFKTLVDHPLIKREDKKEFAKKIFGELEESSLNIIFYL
IEKDRLSSIRGIVAEYLKIYYAKNQILDVEAIFAIEPTKDQKAKLIEQLEKKTGKKVNLEVSIDKSIIAGGIIKIGDEII
DGSVRRQLDTIARS
>P72244 ~~~atpH~~~ATP synthase subunit delta~~~
MSEPASISAAIAGRYATAIFDLAQEAKGIDALSADVDALTAALAGSAELRDLISSPVYTREEQGDAIAAVAAKMGLSAPL
ANGLKLMATKRRLFALPQLLKGLAAAIAEAKGEMTADVTSATALSAAQAEKLAATLAKQTGKTVKLNVAVDESLIGGMIV
KLGSRMIDTTVKAKLASLQNAMKEVG
>P99109 ~~~atpH~~~ATP synthase subunit delta~~~
MVKVANKYAKALFDVSLDTNNLETINEELTVINEAVKDKIEQLKMVDSNPTQTAEQRRELINGVFTDINPYIKNMMYVLA
DNRHISLIADVFKAFQSLYNGHYNQDFATIESTYELSQEELDKIVKLVTQQTKLSKVIVDTKINPDLIGGFRVKVGTTVL
DGSVRNDLVQLQRKFRRVN
>P50008 ~~~atpH~~~ATP synthase subunit delta~~~
MSGMHGASREALAAARERLDALTDSTSVDAGSLADELAAVTALLHREVSLRRVLTDPAQSARPRPSSPSVSSAPRSAAPV
DLVAGTVRSRWSQSRDLVDALEQLANIADLTAAQKRGRLDNVEDELFRFGRIISSNTELRAALTSRSATTAAKSELLAGL
LGSRAERTTERLVTRLVTAPRGRSLESGLESLSKLAADRRDRMVAVVTSAVPLSDTQKQRLGAALAKVYGRPMHLNLDVD
PEVLGGIRVQVGDEVINGSIADRLEDAGRRLAS
>P0A2Z5 ~~~atpH~~~ATP synthase subunit delta~~~COG0712
MDKKTVKVIEKYSMPFVQLVLEKGEEDRIFSDLTQIKQVVEKTGLPSFLKQVAVDESDKEKTIAFFQDSVSPLLQNFIQV
LAYNHRANLFYDVLVDCLNRLEKETNRFEVTITSAHPLTDEQKTRLLPLIEKKMSLKVRSVKEQIDESLIGGFVIFANHK
TIDVSIKQQLKVVKENLK
>Q05374 ~~~atpH~~~ATP synthase subunit delta~~~
MQTTIRGELVEPYAEALLSLAQSHNLADQFQQDSGLILDLLAASAELQEFLANPLINPDAKKNVLRQLTVDKVHGYFLNF
LMLLVDRRRINLLAAICQQYRALLRKLRNIVLAEVISAVELTEQQRHAVVEKVKTMTGAADVELAIAIDPELLGGVVIKV
GSQIFDASLRGQLRRLSVSLAQPV
>Q8DLP4 ~~~atpH~~~ATP synthase subunit delta~~~COG0712
MMQTTVRGEVVEPYAEALLSLAQTHNLIDQFQQDTQLMVELVASSGELQQFLANPLIKPEAKKNVLRQLTVDKVHGYFLN
FLMLLVDRRRINFLSSICEHYRALVRKLRNVALAEVTSAVELNDDQRRAVVEKVKTMTGAADVELVTACDPELIGGVVIK
VGSQIFDASLRGQLRRLSVTLAQAT
>P50009 ~~~atpC~~~ATP synthase epsilon chain, sodium ion specific~~~COG0355
MAETFRLKIIAPTGVFFDDDIERVVIRGIEGELAILAEHTPLTTNVAIGTFNIIFADKKKKNGTLLGGIATINPRETIIL
TDAAEWPEEIDIKRAQEAKERALKRIHDDKFDTARARAALERAIARINSKENV
>A3M145 ~~~atpC~~~ATP synthase epsilon chain~~~
MATMQCDVVSVKESIYSGAVTMLIAKGAGGELGILPGHAPLVTLLQPGPIRVLLENGTEEIVYVSGGVLEVQPHVVTVLA
DTAIRADNLDEAAILEARKNAEQLLANQKSDLDSAAALAALAETAAQLETIRKIKNRAQ
>P07678 ~~~atpC~~~ATP synthase epsilon chain~~~
MKTIHVSVVTPDGPVYEDDVEMVSVKAKSGELGILPGHIPLVAPLEISAARLKKGGKTQYIAVSGGFLEVRPDNVTILAQ
AAERAEDIDVLRAKARKSGRTPLQSQQDDIDFKRAELALKRAMNRLSVAEMK
>P37812 ~~~atpC~~~ATP synthase epsilon chain~~~COG0355
MKTVKVNIVTPDGPVYDADIEMVSVRAESGDLGILPGHIPTVAPLKIGAVRLKKDGQTEMVAVSGGFVEVRPDHVTILAQ
AAETAEGIDKERAEAARQRAQERLNSQSDDTDIRRAELALQRALNRLDVAGK
>P0A6E6 ~~~atpC~~~ATP synthase epsilon chain~~~COG0355
MAMTYHLDVVSAEQQMFSGLVEKIQVTGSEGELGIYPGHAPLLTAIKPGMIRIVKQHGHEEFIYLSGGILEVQPGNVTVL
ADTAIRGQDLDEARAMEAKRKAEEHISSSHGDVDYAQASAELAKAIAQLRVIELTKKAM
>Q8RGE3 ~~~atpC~~~ATP synthase epsilon chain~~~COG0355
MPSFDVSVVTQVKKILEQEAGYLRLRTSEGDIGILPNHAPFVAELSMGKMEIESPNKDRRDIYFLSGGFLEISDNQATVI
ADEVFPIEKIDVESEQALVENLKKELEKVSTEEEKRKLQKKIKISLAKIDAKNN
>Q5KUJ4 ~~~atpC~~~ATP synthase epsilon chain~~~COG0355
MKTIHVSVVTPDGPVYEDDVEMVSVKAKSGELGILPGHIPLVAPLEISAARLKKGGKTQYIAVSGGFLEVRPDKVTILAQ
AAERAEDIDVLRAKAAKERAERRLQSQQDDIDFKRAELALKRAMNRLSVAEMK
>Q9RGY0 ~~~atpC~~~ATP synthase epsilon chain~~~COG0355
MADPEKLFKVIVVTPNGMIYSHRGSIVDVRAIDGERSILYNHIPILTPLAISEVKVKRSREMGSRIDHIAISGGYIEFSN
NVATIVADSAERARNIDVSRAQAAKERAEKRLREAREKHDERNLERAQVALKRAMNRISVYNARGH
>P80286 ~~~atpC~~~ATP synthase epsilon chain~~~COG0355
MAELNVEIVSEERSIWSGAASAVSARTVNGEIGILPGHTPMLAVLGDGEVVVRTTDGGTVTAQAHGGFFSVDHDRVVIAA
TSARLGDAAAA
>O05434 ~~~atpC~~~ATP synthase epsilon chain~~~COG0355
MASLNLEIITPERVVLQAEAASVIAPGIQGYLGVLPEHAPLITPLQAGVVTCRRRERAEERVAVSGGFLEAGPDQVIILA
DTAERSEEIDVEWARQARERAERRLRERPPGLDVARAEAALRRAVARLKAAGAI
>A0R1Z9 ~~~atpC~~~ATP synthase epsilon chain~~~COG0355
MADLNVEIVAVERELWSGPATFVFTRTTAGEIGILPRHIPLVAQLVDDAMVRVEREGEDDLRIAVDGGFLSVTEETVRIL
VENAQFESEIDADAAKEDAASDDERTAAWGRARLRALGQID
>P9WPV1 ~~~atpC~~~ATP synthase epsilon chain~~~COG0355
MAELNVEIVAVDRNIWSGTAKFLFTRTTVGEIGILPRHIPLVAQLVDDAMVRVEREGEKDLRIAVDGGFLSVTEEGVSIL
AESAEFESEIDEAAAKQDSESDDPRIAARGRARLRAVGAID
>A1B8P1 ~~~atpC~~~ATP synthase epsilon chain~~~COG0355
MADTMQFDLVSPERNLVSVPVREVRLPGADGDLTAMPGHAPAIVNLRPGLVTVVAGDGSETEFAVTGGFAEINNESVTLL
AERGHPRAEMTQEVFNEMMAQARRRVEAAKERESAGEELVAAAVKLLADMEALGTHIGLDPNHANFPH
>P29709 ~~~atpC~~~ATP synthase epsilon chain, sodium ion specific~~~
MATFKLEVVTPLKKVLDRDAEMVIMRTIEGDMGVMADHAPFVAELAVGEMKIKSANGEEAYFVSGGFLEISKEKTMILAD
EAIDVKEIDVERAKREAEIAKETLVKLKEDKDIAVTQKSLQEALTKVRIAEQYMHHL
>P72248 ~~~atpC~~~ATP synthase epsilon chain~~~
MADTMQFDLVSPERRLASVAASEVRLPGVEGDLTAMPGHAPVILSLRPGILTVVSAAGTAEYAVTGGFAEVSGEKVTVLA
ERGLTRAELTAAVHAEMLAEAKKVADAAHPSVADAAAKMLADMEALGSHINL
>P0A1B7 ~~~atpC~~~ATP synthase epsilon chain~~~
MAMTYHLDVVSAEQQMFSGLVEKIQVTGSEGELGIYPGHAPLLTAIKPGMIRIVKQHGHEEFIYLSGGILEVQPGSVTVL
ADTAIRGQDLDEARALEAKRKAEEHIKSSHGDVDYAQASAELAKAIAKLRVIELTKKAM
>P63665 ~~~atpC~~~ATP synthase epsilon chain~~~
MNTLNLDIVTPNGSVYNRDNVELVVMQTTAGEIGVMSGHIPTVAALKTGFVKVKFHDGTEYIAVSDGFVEVRKDKVSIIV
QTAETAREIDVERAKLAKARAESHLENDDDNTDIHRAERALERANNRLRVAELK
>P0A2Z7 ~~~atpC~~~ATP synthase epsilon chain~~~
MAAELHVALVAADREVWSGEATLVVARTTSGDIGVMPGHQPLLGVLESGPVTIRTSDGGTVVAAVHGGFISFADNKLSLL
AEVAELSDEIDVHRAERKLEQAKTEGDAHAERRADVRLRAAAGR
>P63668 ~~~atpC~~~ATP synthase epsilon chain~~~COG0355
MAQLTVQIVTPDGLVYDHHASYVSVRTLDGEMGILPRHENMIAVLAVDEVKVKRIDDKDHVNWIAVNGGVIEIANDMITI
VADSAERARDIDISRAERAKLRAERAIEEAQDKHLIDQERRAKIALQRAINRINVGNRL
>Q05375 ~~~atpC~~~ATP synthase epsilon chain~~~
MVMTVRVIAPDKTVWDAPAEEVILPSTTGQLGILSNHAPLLTALETGVMRVRQEREWVAIALMGGFAEVENNEVTVLVNA
AERGDTIDLETAKREFSEAQAAVAKAAQSGSKQAQIQAAQAFRRARARLQAAGGVVEI
>P26533 ~~~atpC~~~ATP synthase epsilon chain~~~COG0355
MTLTVRVITPDKVVWDEEVQELILPSTTGQLGILSNHAPLLTALEIGVMRVRPGKDWQNIAVMGGFAEVENNEVKVLVNG
AELGTTIDAESARQAYTAAQGALEEANRGEDKPNQLKASNNYKKARARLQAAGGAV
>Q8DLG7 ~~~atpC~~~ATP synthase epsilon chain~~~COG0355
MVMTVRVIAPDKTVWDAPAEEVILPSTTGQLGILSNHAPLLTALETGVMRVRQDREWVAIALMGGFAEVENNEVTILVNG
AERGDTIDLEKAKAEFAAAQAALAQAEQGESKQAKIQATQAFRRARARLQAAGGVVEI
>O05332 ~~~atpF2~~~ATP synthase subunit b'~~~
MANETNAVEAAAAVAGHAAEAAEKGGMPQLDFSTFPNQIFWLLLALGAIYWLLKNIAIPRIAAILADRAGTISGDLAAAE
QYKLKAKDAEAAYAKALADARAQAQKIIAETRAVIQKDLDAATAKADADIAARVAQSEVKIAEIRAGALEAVQIVATDTA
TAIVTALGGKADMGALNAAVGQRVKG
>Q8DLP6 ~~~atpF2~~~ATP synthase subunit b'~~~COG0711
MFDFDATLPLMAVQFLILTVILNALLYKPLGQALDNRDEYIRTNLQQAKERLQQATELAQQYEQELASTRRQAQALIEEA
RVEAQKIATAEIAEAQQAVQAELLKIQAEIDQQKQATLQALEGQVASLSEQLLAKLMA
>A0R203 ~~~atpFH~~~ATP synthase subunit b-delta~~~COG0711
MSIFIGQLIGFAVIAFIIVKWVVPPVRTLMRNQQEAVRAALAESAEAAKKLADADAMHAKALADAKAESEKVTEEAKQDS
ERIAAQLSEQAGSEAERIKAQGAQQIQLMRQQLIRQLRTGLGAEAVNKAAEIVRAHVADPQAQSATVDRFLSELEQMAPS
SVVIDTAATSRLRAASRQSLAALVEKFDSVAGGLDADGLTNLADELASVAKLLLSETALNKHLAEPTDDSAPKVRLLERL
LSDKVSATTLDLLRTAVSNRWSTESNLIDAVEHTARLALLKRAEIAGEVDEVEEQLFRFGRVLDAEPRLSALLSDYTTPA
EGRVALLDKALTGRPGVNQTAAALLSQTVGLLRGERADEAVIDLAELAVSRRGEVVAHVSAAAELSDAQRTRLTEVLSRI
YGRPVSVQLHVDPELLGGLSITVGDEVIDGSIASRLAAAQTGLPD
>P9WPV3 ~~~atpFH~~~ATP synthase subunit b-delta~~~COG0711
MSTFIGQLFGFAVIVYLVWRFIVPLVGRLMSARQDTVRQQLADAAAAADRLAEASQAHTKALEDAKSEAHRVVEEARTDA
ERIAEQLEAQADVEAERIKMQGARQVDLIRAQLTRQLRLELGHESVRQARELVRNHVADQAQQSATVDRFLDQLDAMAPA
TADVDYPLLAKMRSASRRALTSLVDWFGTMAQDLDHQGLTTLAGELVSVARLLDREAVVTRYLTVPAEDATPRIRLIERL
VSGKVGAPTLEVLRTAVSKRWSANSDLIDAIEHVSRQALLELAERAGQVDEVEDQLFRFSRILDVQPRLAILLGDCAVPA
EGRVRLLRKVLERADSTVNPVVVALLSHTVELLRGQAVEEAVLFLAEVAVARRGEIVAQVGAAAELSDAQRTRLTEVLSR
IYGHPVTVQLHIDAALLGGLSIAVGDEVIDGTLSSRLAAAEARLPD
>A3M140 ~~~atpF~~~ATP synthase subunit b~~~
MNINLTLIGQAIAFAFFVAFCMKFVWPPLINAISERQRKIADGLNAAEKAKADLADAQAQVKQELDAAKAQAAQLIEQAN
RRAAQLIEEARTQAAAEGERIRQQAKEAVDQEINSAREELRQQVAALAVTGAEKILNQQVDAEAHNAMLSQLAAKL
>P09221 ~~~atpF~~~ATP synthase subunit b~~~
MLWKANVWVLGEAAHGISGGTIIYQLLMFIILLALLRKFAWQPLMNIMKQREEHIATKSTRRKNDRQEAEKLLEEQRELM
KQSRQEAQALIENAASLAEEQKEQIVASARAEAERVKEAAKKEIEREKEQAMAALREQVASLSVLIASKVIEKELTEQDQ
AAS
>P37814 ~~~atpF~~~ATP synthase subunit b~~~COG0711
MSQLPLELGLSFNGGDILFQLLAMLILLALLKKYALGPLLNIMKQREDHIAGEITSAEEKNKEAQQLIEEQRVLLKEARQ
ESQTLIENAKKLGEKQKEEIIQAARAESERLKEAARTEIVKEKEQAVSALREQVASLSVMIASKVIEKELDEQAQEKLIQ
DYLKEVGESR
>Q0ZS23 ~~~atpF~~~ATP synthase subunit b, sodium ion specific~~~
MQFASFISLDWGVVFQIVNTIVMYLILKKLLFKPVTKFMNDRQESIANSIKEAEETKKEAYALKAEYEAKINASKEEGQE
IIKEASRKAEMRADEIIKNAQNEANRLMEKAHIEIEREKQKVVNELKDEISNIAILAASKVIEADIDKNKHEKLISDFIK
EVGEATWQN
>P0ABA0 ~~~atpF~~~ATP synthase subunit b~~~COG0711
MNLNATILGQAIAFVLFVLFCMKYVWPPLMAAIEKRQKEIADGLASAERAHKDLDLAKASATDQLKKAKAEAQVIIEQAN
KRRSQILDEAKAEAEQERTKIVAQAQAEIEAERKRAREELRKQVAILAVAGAEKIIERSVDEAANSDIVDKLVAEL
>Q8KRV2 ~~~atpF~~~ATP synthase subunit b, sodium ion specific~~~
MAPQNMPAVSIDINMFWQIINFLILMFFFKKYFQKPISKVLDARKEKIANELKQAEIDREMAAKANEETQGILKAARTEA
NEILLRAEKKADDRKEAILKEANSQREKTIKSAELEVEKMKKQARKELQSEVTALAVNLAEKMINEKLDSKLGANLLNVL
LKR
>Q9RGY5 ~~~atpF~~~ATP synthase subunit b~~~COG0711
MTIQTLFAASHHIYLGNAIWYLLCFAILMLLIKHYAWGPVSDMMEKRRQKIISDLDSAASDRKKAETLANEREAALKNSR
QEATQILSDAKTNAQNTSKEIVASANEDAAAIRKKANEEAAKAKSDALDAARDQVADISVAIAEKVIAKNLSAEDQKDLV
DQFIKGLDD
>P80285 ~~~atpF~~~ATP synthase subunit b~~~COG0711
MISNGLILAAAEGANPLIPNPWEILVVVVGFALLMFIVIKFIVPTLEKSYQDRVEAIEGGLAKAEKAQAEANAMMADYES
QLADARTEANRIREDARTEAAEIVAEARERATAEATRVFEQAQAQIAAERQQAAAQLKREVGSLATTLAGKIVGESLEDD
ARSQRVVDRFLADLDRHQSAGVAE
>Q2RFX5 ~~~atpF~~~ATP synthase subunit b~~~COG0711
MQAIFQALNFNPWTFLFQTLNLLVVMGLLYVFLYKPLGKVLADREARIEGNLNDAAAAREKAENILAEYRQQLQGARQEA
QAILDRATKMAEETRAEIINRAREEAERTLAQARREIEGEKSKALAAIRSEAASLAILAAGKVLERSLTPDDQERLAREA
IAEVERLQ
>Q50327 ~~~atpF~~~ATP synthase subunit b~~~
MKLRATFVFKTTLVALSFALFALFLVSCTENVKEIKSESVINELFPNLWVFLAHLLAFVILLFLLLFLFWKPTQKFLNQR
KALLEEQVNQANSLEQQAQALLQQANQRHENSLVVAKEIVDQANYEALQLKSEIEKKANRQANLMIFQARQEIEKEKRLI
QEQSLKESVELAMLAAKELIIKKVDVKADKAFIEEFIRELEAEDDHD
>A0R204 ~~~atpF~~~ATP synthase subunit b~~~COG0711
MGEFSATILAASQAAEEGGGGSNFLIPNGTFFAVLIIFLIVLGVISKWVVPPISKVLAEREAMLAKTAADNRKSAEQVAA
AQADYEKEMAEARAQASALRDEARAAGRSVVDEKRAQASGEVAQTLTQADQQLSAQGDQVRSGLESSVDGLSAKLASRIL
GVDVNSGGTQ
>P9WPV5 ~~~atpF~~~ATP synthase subunit b~~~COG0711
MGEVSAIVLAASQAAEEGGESSNFLIPNGTFFVVLAIFLVVLAVIGTFVVPPILKVLRERDAMVAKTLADNKKSDEQFAA
AQADYDEAMTEARVQASSLRDNARADGRKVIEDARVRAEQQVASTLQTAHEQLKRERDAVELDLRAHVGTMSATLASRIL
GVDLTASAATR
>P21904 ~~~atpF~~~ATP synthase subunit b, sodium ion specific~~~
MAPQNMPAVSIDINMFWQIINFLILMFFFKKYFQKPIAKVLDARKEKIANDLKQAEIDKEMAAKANGEAQGIVKSAKTEA
NEMLLRAEKKADERKETILKEANTQREKMLKSAEVEIEKMKEQARKELQLEVTDLAVKLAEKMINEKVDAKIGANLLDQF
IGEVGEEK
>O05333 ~~~atpF~~~ATP synthase subunit b~~~
MKKLTFLLVALAANPAFASEGPFVSLRNAHFVILVAFLIFVGVLIKFKVPSMLLGMLDKRAEGIKADLDEAKALRDEAQK
ILASYERKAREVQGQADEIVAAAKRDAQLAAEQAKADLKEAIARRLKGAEDRIASAEAAALKDVKDRAVQVAVAAAAEVL
ANQMSASDKSGMIDAAITEVETRLN
>P15013 ~~~atpF~~~ATP synthase subunit b~~~
MISLALAAETAEHGGEAASHGGLFADPAFWVSIAFLMVVGFVYIKAKNKILGALDGRGAAVKAKLDEARKLRDDAQALLA
EYQRRQRDAMKEADEIIRHAKDEAARLRAKAEADLEASIRRREQQAVDRIAQAEAQALAQVRNEAVDVAVSAARSLMAGS
LAKADQNRLIDAAIADLPGKLH
>Q7A4E7 ~~~atpF~~~ATP synthase subunit b~~~
MTETANLFVLGAAGGVEWGTVIVQVLTFIVLLALLKKFAWGPLKDVMDKRERDINRDIDDAEQAKLNAQKLEEENKQKLK
ETQEEVQKILEDAKVQARQQQEQIIHEANVRANGMIETAQSEINSQKERAIADINNQVSELSVLIASKVLRKEISEQDQK
ALVDKYLKEAGDK
>P0A2Z3 ~~~atpF~~~ATP synthase subunit b~~~COG0711
MHVTVGELIGNFILITGSFILLLVLIKKFAWSNITGIFEERAEKIASDIDRAEEARQKAEVLAQKREDELAGSRKEAKTI
IENAKETAEQSKANILADAKLEAGHLKEKANQEIAQNKVEALQSVKGEVADLTISLAGKIISQNLDSHAHKALIDQYIDQ
LGEA
>Q8DLP5 ~~~atpF~~~ATP synthase subunit b~~~COG0711
MSVMDALFLLATEEVGHFGINTNLLETNVINLAILIGVLVYFGRGVLGKTLGDRQKQIATAIAEAEERQKVAAARLAEAQ
QKLTQAKQEAQRIREDALTRAKAVKEEIIAQAKREIERLQETASQDTSAATERAIAEIRERIAAMALAEAENQLKARLSQ
NPDLQRTLIDRSIALLGGK
>P50005 ~~~atpG~~~ATP synthase gamma chain, sodium ion specific~~~COG0224
MAENVQDIKRRIKSVNSTMQITHAMELVASAKLRKSRELAEGRRPYFEAMIESIGRIVEKSGNARNIFMDQREVKKTAYI
IITGDKGLAGGYNVNVAKLVEEHITDKENAVLFTVGSRGRDHFRNREYHIQGEYLGISERPNFFNAKEVTAIVMEGFKNG
EYDEVYIAYTKFVSTITQHAQMMKLLPLSAEELITSGKVKTTEETKEEKSKMSDRELTIMTYEPEPEELLKYLIPNFVSS
TVYGSMIESAASEQGARRTAMESATTNANEMIDGLTLQYNRVRQAAITQEISEIVGGAEALN
>A3M143 ~~~atpG~~~ATP synthase gamma chain~~~
MANLKEIRAKVASIKSTQKITRAMQMVAASKMRRAQERMAQGRPYADNMRRVIAHLVQANPEYKHRYMVDRPVKRVGYII
VSSDRGLAGGLNINLFKKVVQHVKAQQEQSIEVQFALIGQKAVSFFKNYGGKVLGATTQIGDAPSLEQLTGSVQVMLDAF
DKGELDRIYLVSNGFVNAMTQKPKVEQLVPLAPAEEGDDLNRTYGWDYIYEPEAEELLNGLLVRYIESMVYQGVIENVAC
EQSARMVAMKAATDNAGQLIKDLQLIYNKLRQAAITQEISEIVGGAAAV
>P50006 ~~~atpG~~~ATP synthase gamma chain~~~
MSNLKAIRDRIQSVKNTKKITEAMRLVASAKVRRAQEQVLATRPFADRLAGVLYGLQGRLQFEDVECPLLQQREVKKVGL
VVLAGNRGLCGAYNSNIIKRAEARAAELKAEGLEYSYLLVGRKAIQHFTRRDAPISQCRDNPEKTPDPQEVSSATDEILA
WFESGAVDRVELIYTKFVSLISSRPVTQTLLPLDLQGLEAQDDEVFRLTSKGGKFDVTREKVSVEPEALAQDMIFEQDPV
EILNALLPLFLTNQLLRAWQESTASELAARMTAMSNASDNASDLVKTLTLSYNKARQASITQELLEVVAGA
>P09222 ~~~atpG~~~ATP synthase gamma chain~~~
MKPLASLRDIKTRINATKKTSQITKAMEMVLTSKLNRAEKREIVRPYMEKIQEVVANVALAARASHPMLVSRPVKKTGYL
VITSDRGLAGAYNSNVLRLVYQTIQKRHASPDEYAIIVIGRVGLSFFRKRNMPVILDITRLPDQPSFADIKEIARKTVGL
FADGTFDELYMYYNHYVSAIQQEVTERKLLPLTDLAENKQRTVYEFEPSQEEILDVLLPQYAESLIYGALLDAKASEHAA
RMTAMKNATDNANELIRTLTLSYNRARQAAITQEITEIVAGANALQ
>P37810 ~~~atpG~~~ATP synthase gamma chain~~~COG0224
MASLRDIKSRITSTKKTSQITKAMQMVSAAKLNRAENNAKSFVPYMDKIQEVVSNVGRVSGNVKHPMLLSREVKKTAYLV
ITSDRGLAGAFNSSVLRSAYQAMQERHQSKDEYAVIAIGRVGRDFFKKREIPIISELTGLGDEVTFTEIKDLARQTIQMF
IDGAFDELHLVYNHFVSAITQEVTEKKLLPLSDLGSGGGKRTASYEFEPSEEEVLEVLLPQYAESLIFGALLDSKASEHA
ARMTAMKNATDNAKELIDSLSLSYNRARQAAITQEITEIVGGAAALE
>Q72E03 ~~~atpG~~~ATP synthase gamma chain~~~COG0224
MPSLKDVKVKIAGVKKTKQITKAMNMVASAKLRGAQQRIERFRPYAEKFYGMLGDLASKADGSAHPLLEVRDEIKTCGIV
LATSDRGLCGSFNANLISTALKLAKQKAAEGKTVKFYCVGKKGRDTIRKADFEVVTAIADQMGSFDFQLANKLGLEVINH
YLTGELDEVVLVYGEFVSTAKQLPITLPILPIASEKKDEAEAAPSKEYIYEPAVEGLLAELLPRFIKVQIYRGLLDTSAS
EHAARMAAMDNATRSCDDMIGALTLLFNKTRQASITRDLMDIVGGAEALKG
>P0ABA6 ~~~atpG~~~ATP synthase gamma chain~~~COG0224
MAGAKEIRSKIASVQNTQKITKAMEMVAASKMRKSQDRMAASRPYAETMRKVIGHLAHGNLEYKHPYLEDRDVKRVGYLV
VSTDRGLCGGLNINLFKKLLAEMKTWTDKGVQCDLAMIGSKGVSFFNSVGGNVVAQVTGMGDNPSLSELIGPVKVMLQAY
DEGRLDKLYIVSNKFINTMSQVPTISQLLPLPASDDDDLKHKSWDYLYEPDPKALLDTLLRRYVESQVYQGVVENLASEQ
AARMVAMKAATDNGGSLIKELQLVYNKARQASITQELTEIVSGAAAV
>Q8RGE1 ~~~atpG~~~ATP synthase gamma chain~~~COG0224
MPGMKEIKSRIKSVQSTRQITNAMEIVSTTKFKRYSKLVTESRPYEESMRKILGNIASGVKNEGHPLFDGRKEVKSIAII
VITSDRGLCGSFNSSTLKELEKLVEKNKNKNITIIPFGRKAIDFITKRNYEFSESFSKISPDEMNKIAGEISEEVVEKYN
NHIYDEVYVIYNKFISALRYDLTCERIIPITRPEVELNSEYIFEPSTEYILSALLPRFINLQIYQAILNNTASEHSARKN
SMSSATDNADEMIKTLNIKYNRNRQSAITQEITEIVGGASAL
>Q5KUJ2 ~~~atpG~~~ATP synthase gamma chain~~~COG0224
MASLRDIKTRINATKKTSQITKAMEMVSTSKLNRAEQNAKSFVPYMEKIQEVVANVALGAGGASHPMLVSRPVKKTGYLV
ITSDRGLAGAYNSNVLRLVYQTIQKRHASPDEYAIIVIGRVGLSFFRKRNMPVILDITRLPDQPSFADIKEIARKTVGLF
ADGTFDELYMYYNHYVSAIQQEVTERKLLPLTDLAENKQRTVYEFEPSQEEILDVLLPQYAESLIYGALLDAKASEHAAR
MTAMKNATDNANELIRTLTLSYNRARQAAITQEITEIVAGANALQ
>Q9RGY2 ~~~atpG~~~ATP synthase gamma chain~~~COG0224
MPASLLELKRKIASVKQTGKITEAMRMVSASKLNQTENRDKDYTVYNDHVRKTISHLISSQVVDSLRERDISIDKNNISK
IDYTDVFGLGITADMIQPRKNIKTTGFLVVTGDRGLVGSYNSSVIKNMMSIFDDERAQGREVKVLAVGSVGAQFFKKNNV
NVVYEKDGVSDVPTFDEVLPIVSTAIKMFLNGVYDQLYVCYTHHVNSLSSAFRVEKMLPIVDLDIGVKEAEAHKELEYDI
EPDVNSVLMKLLPQYARSTIYGAILDAKTAEHASSMTAMQSATDNANDLVSNLTTKLNRARQAQITTEITEIISGANALE
>O05432 ~~~atpG~~~ATP synthase gamma chain~~~COG0224
MAHMRDLKRRIRSVQSTQHITRAMKMVAAAKLRKAQAQVTAARPYAAKLEEVVGRLMAAVDPETQPLAATREVKKAGYVL
ITADRGLAGGYNANLIRLTEERLREEGRPAALVAVGRKGRDFFRRRPVEIVKSFTDIGDNPELIQARELARQLVTMYLEG
TLDEVNLIYTRFYSAIRQVPMVERLLPIATPREKKDTGDYIYEPSPEAVLRVLLPRYCEIKVYRALLEAKASEHGARMTA
MDNATKNAAEMIDKFTLSFNRARQAAITNEIVEIVAGADALK
>A0R201 ~~~atpG~~~ATP synthase gamma chain~~~COG0224
MAATLRELRGRIRSAGSIKKITKAQELIATSRIAKAQARVEAARPYAAEITNMLTELAGASALDHPLLVERKQPKRAGVL
VVSSDRGLCGAYNANVLRRAEELFSLLRDEGKDPVLYVVGRKALGYFSFRQRTVVESWTGFSERPTYENAREIADTLVNA
FMAGADDEGDDAGADGILGVDELHIVFTEFRSMLSQTAVARRAAPMEVEYVGEVETGPRTLYSFEPDPETLFDALLPRYI
ATRVYAALLEAAASESASRRRAMKSATDNADDLIKALTLAANRERQAQITQEISEIVGGANALAGSK
>P9WPU9 ~~~atpG~~~ATP synthase gamma chain~~~COG0224
MAATLRELRGRIRSAGSIKKITKAQELIATSRIARAQARLESARPYAFEITRMLTTLAAEAALDHPLLVERPEPKRAGVL
VVSSDRGLCGAYNANIFRRSEELFSLLREAGKQPVLYVVGRKAQNYYSFRNWNITESWMGFSEQPTYENAAEIASTLVDA
FLLGTDNGEDQRSDSGEGVDELHIVYTEFKSMLSQSAEAHRIAPMVVEYVEEDIGPRTLYSFEPDATMLFESLLPRYLTT
RVYAALLESAASELASRQRAMKSATDNADDLIKALTLMANRERQAQITQEISEIVGGANALAEAR
>A1B8N9 ~~~atpG~~~ATP synthase gamma chain~~~COG0224
MPSLKDLKNRIGSVKNTRKITKAMQMVAAAKLRRAQEAAEAARPYADRMAAVMAGLTAAAAGSDMAPRLLAGTGEDRRHL
LVVMTSERGLAGGFNSSIVKLARLRLQELQAQGKQVSILTVGKKGREQLKREYGDLFVNHVDLSEVKRIGYDNARAIADE
ILDRFDNGEFDVATLFYNRFESVISQVPTARQVIPAVIEEGEAGASSLYDYEPDENAILNDLLPRSVATQVFAALLENAA
SEQGARMTAMDNATRNAGDMIDRLTTVYNRSRQAAITKELIEIISGAEAL
>P29710 ~~~atpG~~~ATP synthase gamma chain, sodium ion specific~~~
MAAGKEIKSRISSVQSTRQITKAMEIVSSTKFKKFQALVNQSKPYSGSMDKVLANLAAGIKNERHPLFDGKTEVKRIGII
VMTSDRGLCGGFNSSTLKEMEKLIVANPDKEVSVIAIGKKGRDYCKKKDRDLKAEYIQLIPETMFDKAKEISENIVEYFY
EDIFDEVYLIYNEFISALSTELIVKKLLPIERIEVQDNTTYIFEPSVEDILSSLLPKYLNIQLYQAILENTASEHSARKN
AMKNATDNAEDMIKDLTLQYNRERQAAITQEISEIVSGASAL
>P72246 ~~~atpG~~~ATP synthase gamma chain~~~
MPSLKDLKNRIVSVKNTRKITKAMQMVAAANIRRAQESAEAARPYAERMNAVMSSLAGAVGSTDGAPRLLAGTGSDKVHL
LVIMTGERGLCGGFNANIAKLAKAKAMELLAQGKTVKILTVGKKGRDALRRDLGQYYIDHIDLSDVKKLSYPVAQKISQN
IIDRFEAGEYDVATIFFSVFQSVISQVPTAKQVIPAQFETDAASASAVYDYEPGDQEILTALLPRAVATAIFAALLENNA
SFNGAQMSAMDNATRNAGDMIDRLTIEYNRSRQAAITKELIEIISGAEAL
>Q8ZKW8 ~~~atpG~~~ATP synthase gamma chain~~~
MAGAKEIRSKIASVQNTQKITKAMEMVAASKMRKSQDRMAASRPYAETMRKVIGHLANGNLEYKHPYLEERDVKRVGYLV
VSTDRGLCGGLNINLFKKLLADMKAWSDKGVQCELAMIGSKGVSFFNSVGGNVVAQVTGMGDNPSLSELIGPVKVMLQAY
DEGRLDKLYIVSNKFINTMSQVPTITQLLPLPASEDDDLKRKAWDYLYEPDPKALLDTLLRRYVESQVYQGVVENLASEQ
AARMVAMKAATDNGGSLIKELQLVYNKARQASITQELTEIVSGAAAV
>Q7A4E8 ~~~atpG~~~ATP synthase gamma chain~~~
MASLKEIDTRIKSTKKMKQITKAMNMVSSSKLRRAEKNTKQFTPYMDKMQDAITAVAGASSNTNHPMLRPRKITRSGYLV
ITSDKGLAGAYSANVLKKLITDIEAKHQDSSEYSIVVLGQQGVDFLKNRGYDIEYSQVDVPDQPSFKSVQALANHAIDLY
SEEEIDELNIYYSHYVSVLENKPTSRQVLPLSQEDSSKGHGHLSSYEFEPDKESILSVILPQYVESLIYGTILDAKASEH
ATRMTAMKNATDNATELIDDLSLEYNRARQAEITQQITEIVGGSAALE
>P50007 ~~~atpG~~~ATP synthase gamma chain~~~
MGAQLRVYKRRIRSVTATKKITKAMEMIAASRVVKAQRKVAASTPYARELTLPRLGTGSNTKHPLTTEADSPSRAAVLLL
TSDRGLAGAFNSNSIKAAEQLTERLEREGRQVDTYIVGRRGLAHYNFRERKVVESFAGFTDEPTYADAKKVAAPLIEAIE
KDTAEGGVDELHIVYTEFVSMMTQTAVDSRLLPLSLDEVAEESGAKDEILPLYDFEPSAEDVLDALLPRYVESRIYNALL
QSAASKHAATRRAMKSATDNAGELINTLSRLANAARQAEITQEISEIVGGASALADANAGSDN
>Q7CRB2 ~~~atpG~~~ATP synthase gamma chain~~~COG0224
MAVSLNDIKTKIASTKNTSQITNAMQMVSAAKLGRSEEAARNFQVYAQKVRKLLTDILHGNGAGASTNPMLISRSVKKTG
YIVITSDRGLVGGYNSSILKAVMELKEEYHPDGKGFEMICIGGMGADFFKARGIQPLYELRGLSDQPSFDQVRKIISKTV
EMYQNELFDELYVCYNHHVNTLTSQMRVEQMLPIVDLDPNEADEEYSLTFELETSREEILEQLLPQFAESMIYGAIIDAK
TAENAAGMTAMQTATDNAKKVINDLTIQYNRARQAAITQEITEIVAGASALE
>Q8DLU1 ~~~atpG~~~ATP synthase gamma chain~~~COG0224
MANLKAIRDRIKTIKDTRKITEAMRLVAAAKVRRAQEQVMASRPFADRLAQVLYSLQTRLRFEDVDLPLLAKRPVKTVAL
LVVTGDRGLCGGYNTNVIRRAKERLQELEAEGLKYTLVIVGRKAAQYFQRRDYPIDAVYSGLEQIPSASEAGQIASELLS
LFLSETVDRVELIYTKFVSLISSKPVVQTLLPLDPQGLETADDEIFRLTTRGSHLEVNREKVTSTLPALPSDMIFEQDPL
QILDALLPLYLNNQLLRALQEAAASELAARMTAMNNASDNAQALIGTLTLSYNKARQAAITQEILEVVAGAEALR
>A3M139 ~~~atpE~~~ATP synthase subunit c~~~
MELTLGLVAIASAILIAFGALGTAIGFGLLGGRFLEAVARQPELAPQLQTRMFLIAGLLDAVPMIGVGIGLFFIFANPFV
G
>P22483 ~~~atpE~~~ATP synthase subunit c~~~COG0636
MAFLGAAIAAGLAAVAGAIAVAIIVKATIEGTTRQPELRGTLQTLMFIGVPLAEAVPIIAIVISLLILF
>P00845 ~~~atpE~~~ATP synthase subunit c~~~
MSLGVLAAAIAVGLGALGAGIGNGLIVSRTIEGIARQPELRPVLQTTMFIGVALVEALPIIGVVFSFIYLGR
>P37815 ~~~atpE~~~ATP synthase subunit c~~~COG0636
MNLIAAAIAIGLGALGAGIGNGLIVSRTVEGIARQPEAGKELRTLMFMGIALVEALPIIAVVIAFLAFFG
>Q0ZS24 ~~~atpE~~~ATP synthase subunit c, sodium ion specific~~~
MERALILAASAIGAGLAMIAGIGPGIGQGFAAGKGAEAVGKQPEAQGDILRTMLLGAAVAESTGIYALVVALILLFANPL
LNLL
>P68699 ~~~atpE~~~ATP synthase subunit c~~~
MENLNMDLLYMAAAVMMGLAAIGAAIGIGILGGKFLEGAARQPDLIPLLRTQFFIVMGLVDAIPMIAVGLGLYVMFAVA
>Q8RGD7 ~~~atpE~~~ATP synthase subunit c~~~COG0636
MDLLTAKTIVLGCSAVGAGLAMIAGLGPGIGEGYAAGKAVESVARQPEARGSIISTMILGQAVAESTGIYSLVIALILLY
ANPFLSKLG
>Q8KRV3 ~~~atpE~~~ATP synthase subunit c, sodium ion specific~~~
MDMLFAKTVVLAASAVGAGTAMIAGIGPGVGQGYAAGKAVESVARQPEAKGDIISTMVLGQAVAESTGIYSLVIALILLY
ANPFVGLLG
>Q2RFX4 ~~~atpE~~~ATP synthase subunit c~~~COG0636
MATIGFIGVGLAIGLAALGSGLGQGIASRGALEGMARQPEASGDIRTTLLLALAFMEALTLFSFVIAILMWTKL
>P21905 ~~~atpE~~~ATP synthase subunit c, sodium ion specific~~~
MDMVLAKTVVLAASAVGAGAAMIAGIGPGVGQGYAAGKAVESVARQPEAKGDIISTMVLGQAIAESTGIYSLVIALILLY
ANPFVGLLG
>O05331 ~~~atpE~~~ATP synthase subunit c~~~
MEGDIVQMGAYIGAGLACTGMGGAAVGVGHVVGNFISGALRNPSAAASQTATMFIGIAFAEALGIFSFLVALLLMFAV
>P0A307 ~~~atpE~~~ATP synthase subunit c~~~
MNLTFLGLCIACMGVSVGEGLLMNGLFKSVARQPDMLSEFRSLMFLGVAFIEGTFFVTLVFSFIIK
>Q8DLP7 ~~~atpE~~~ATP synthase subunit c~~~COG0636
MNPLIASASVLAAALAIGLASLGPGLAQGNASGQALEGIARQPEAEGKIRGTLLLSLAFMESLTIYGLVIALVLLFANPF
AS
>P0ABC0 ~~~atpI~~~ATP synthase protein I~~~COG3312
MSVSLVSRNVARKLLLVQLLVVIASGLLFSLKDPFWGVSAISGGLAVFLPNVLFMIFAWRHQAHTPAKGRVAWTFAFGEA
FKVLAMLVLLVVALAVLKAVFLPLIVTWVLVLVVQILAPAVINNKG
>O05329 ~~~atpI~~~ATP synthase protein I~~~
MSEEVGGEPDPERLAALEKRLSQLKKTEEAPKRAADGDLRMADMAWRMVIELVSGLGIGFGIGYGLDAVFGTQPFLMLIF
VFLGLAAGVKVMLRSAADLTKAQARAAAAGKEGK
>A9CES3 1.1.1.407~~~~~~D-altritol 5-dehydrogenase~~~COG1063
MHAIQFVEKGRAVLAELPVADLPPGHALVRVKASGLCHTDIDVLHARYGDGAFPVIPGHEYAGEVAAVASDVTVFKAGDR
VVVDPNLPCGTCASCRKGLTNLCSTLKAYGVSHNGGFAEFSVVRADHLHGIGSMPYHVAALAEPLACVVNGMQSAGIGES
GVVPENALVFGAGPIGLLLALSLKSRGIATVTMADINESRLAFAQDLGLQTAVSGSEALSRQRKEFDFVADATGIAPVAE
AMIPLVADGGTALFFGVCAPDARISVAPFEIFRRQLKLVGSHSLNRNIPQALAILETDGEVMARLVSHRLPLSEMLPFFT
KKPSDPATMKVQFAAE
>Q5HCZ5 2.3.1.-~~~~~~Putative acetyltransferase SACOL2570~~~
MTEKEKMLAEKWYDANFDQYLINERARAKDICFELNHTRPSATNKRKELIDQLFQTTTDNVSISIPFDTDYGWNVKLGKN
VYVNTNCYFMDGGQITIGDNVFIGPNCGFYTATHPLNFHHRNEGFEKAGPIHIGSNTWFGGHVAVLPGVTIGEGSVIGAG
SVVTKDIPPHSLAVGNPCKVVRKIDNDLPSETLNDETIK
>Q7A3E8 2.3.1.-~~~~~~Putative acetyltransferase SA2342~~~
MTEKEKMLAEKWYDANFDQDLINERARAKDICFELNHTKPSDTNKRKELIDQLFQTTTDNVSISIPFDTDYGWNVKLGKN
VYVNTNCYFMDGGQITIGDNVFIGPNCGFYTATHPLNFYHRNEGYEKAGPIHIGSNTWFGGHVAVLPGVTIGEGSVIGAG
SVVTKDIPPHSLAVGNPCKVVRKIDNDLPSETLNDETIK
>Q1LJ80 2.5.1.-~~~cobO~~~Cobalamin adenosyltransferase~~~COG2096
MGNRLSKIATRTGDAGTTGLGDGSRVGKNSLRIVAIGDVDELNSHIGLLLTEPDLPEDVRAALLHIQHDLFDLGGELSIP
GYTLLKAPQVAQLDDWLAHYNAALPRLAEFILPGGSRPAAQAHICRTVCRRAERALVELGAAEALNEAPRQYLNRLSDLL
FVLARVLNRAGGGSDVLWQRERES
>P20713 3.1.6.1~~~atsA~~~Arylsulfatase~~~
MNKKAMAAAVSMILAGGAHAAQQERPNVIVIIADDMGYSDISPFGGEIPTPNLQAMAEQGMRMSQYYTSPMSAPARSMLL
TGNSNQQAGMGGMWWYDSTIGKEGYELRLTDRVTTMAERFKDAGYNTLMAGKWHLGFVPGATPKDRGFNHAFAFMGGGTS
HFNDAIPLGTVEAFHTYYTRDGERVSLPDDFYSSEAYARQMNSWIKATPKEQPVFAWLAFTAPHDPLQAPDEWIKRFKGQ
YEQGYAEVYRQRIARLKALGIIHDDTPLPHLELDKEWEALTPEQQKYTAKVMQVYAAMIANMDAQIGTLMETLKQTGRDK
NTLLVFLTDNGANPAQGFYYESTPEFWKQFDNSYDNVGRKGSFVSYGPHWANVSNAPYANYHKTTSAQGGINTDFMISGP
GITRHGKIDASTMAVYDVAPTLYEFAGIDPNKSLAKKPVLPMIGVSLSAISPAKYRSRRAELRG
>Q9X759 3.1.6.1~~~atsA~~~Arylsulfatase~~~
MNKKAMAAAVSMILAGGAHAAQQERPNVIVIIADDMGYSDISPFGGEIPTPNLQAMAEQGMRMSQYYTSPMSAPARSMLL
TGNSNQQAGMGGMWWYDSTIGKEGYELRLTDRVTTMAERFKDAGYNTLMAGKWHLGFVPGATPKERGFNHAFAFMGGGTS
HFNDAIPLGTVEAFHTYYTRDGERVSLPDDFYSSEAYARQMNSWIKATPKEQPVFAWLAFTAPHDPLQAPDEWIKRFKGQ
YEQGYAEVYRQRIARLKALGIIHDDTPLPHLELDKEWEALTPEQQKYTAKVMQVYAAMIANMDAQIGTLMETLKQTGRDK
NTLLVFLTDNGANPAQGFYYESTPEFWKQFDNSYDNVGRKGSFVSYGPHWANVSNAPYANYHKTTSAQGGINTDFMISGP
GITRHGKIDASTMAVYDVAPTLYEFAGIDPNKSLAKKPVLPMIGVSFKRYLTGEVQEPPRGNYGVELHHQAAWVDGEWKL
RRLVPRGLTAGDAPWQLFNLHDDPLETHDVAAEHPDRVKAMSEAYEAFAKRTMVTKAQGKMIDYVGIDSKTGRYLAVDPA
TMKPVPAPQAIPVSEIH
>Q9I0Q8 2.3.1.-~~~~~~Acetyltransferase PA2578~~~
MPTPGTGSVPELQLVPFQLGHFPILQRWFATEKELVQWAGPALRHPLSLEQMHEDLAESRRRPPLRLLWSACRDDQVIGH
CQLLFDRRNGVVRLARIVLAPSARGQGLGLPMLEALLAEAFADADIERVELNVYDWNAAARHLYRRAGFREEGLRRSATR
VGRERWNVVLMGLLRQEWAAGGAGND
>Q9HX72 2.3.1.-~~~~~~Acetyltransferase PA3944~~~
MNANLPPSAISELHGPRLLLRAWRDSDREAFAEMCADPQVMEFFPSVLDRAQSDALVDRVQAHFAERGYGPWALELPGEA
AFIGFTGLFDVTMDVHFAPTVEIGWRLAPAYWGRGLAREAAETALDFAFERLRLPEVVAFTTPPNRRSWGLMERLGMRRD
PAEDFDHPLLAADHPMRRHILYRVDAARWAER
>Q7CXI0 2.3.1.-~~~~~~Acetyltransferase Atu2258~~~COG0454
MNFVLSDVADAEAEKAIRDPLVAYNLARFGESDKRDLNITIRNDDNSVTGGLVGHTARGWLYVQLLFVPEAMRGQGIAPK
LLAMAEEEARKRGCMGAYIDTMNPDALRTYERYGFTKIGSLGPLSSGQSITWLEKRF
>Q5HH30 2.3.1.-~~~~~~Acetyltransferase SACOL1063~~~
MFSKVNNQKMLEDCFYIRKKVFVEEQGVPEESEIDEYESESIHLIGYDNGQPVATARIRPINETTVKIERVAVMKSHRGQ
GMGRMLMQAVESLAKDEGFYVATMNAQCHAIPFYESLNFKMRGNIFLEEGIEHIEMTKKLTSLN
>Q6FBW1 1.14.11.-~~~atsK~~~Alkylsulfatase~~~COG2175
MTTFIQNPTQQLQLRKLTGRIGAEISGIHLSSELDSSTVQFIHDALLEHKVLFFRGQQHLGDTEQEKFAELFGSPVKHPT
VPAADGTDFIFELDSQKGARANSWHTDVTFVDAYPKISILRGLIIPETGGDTTWANTETAYEDLPELLKQFAEQLVAVHS
NEYDYGGPKQNVEPEQLERLKKVFVSTKYETEHPVVIVHPETGKKSLLLGHFFKRLVGFSQSDSQLLFNILQEKVTRPEN
TVRWQWQEGDVVIWDNRSTQHYAVNDYGDQHRVVRRITLAGEVTTGAHGLKGKTTLPKDLSAEQLEKAKLHAVLNAN
>P9WKZ1 1.14.11.77~~~~~~Alpha-ketoglutarate-dependent sulfate ester dioxygenase~~~COG2175
MTDLITVKKLGSRIGAQIDGVRLGGDLDPAAVNEIRAALLAHKVVFFRGQHQLDDAEQLAFAGLLGTPIGHPAAIALADD
APIITPINSEFGKANRWHTDVTFAANYPAASVLRAVSLPSYGGSTLWANTAAAYAELPEPLKCLTENLWALHTNRYDYVT
TKPLTAAQRAFRQVFEKPDFRTEHPVVRVHPETGERTLLAGDFVRSFVGLDSHESRVLFEVLQRRITMPENTIRWNWAPG
DVAIWDNRATQHRAIDDYDDQHRLMHRVTLMGDVPVDVYGQASRVISGAPMEIAG
>Q9WWU5 1.14.11.77~~~atsK~~~Alpha-ketoglutarate-dependent sulfate ester dioxygenase~~~
MSNAALATAPHALELDVHPVAGRIGAEIRGVKLSPDLDAATVEAIQAALVRHKVIFFRGQTHLDDQSQEGFAKLLGEPVA
HPTVPVVDGTRYLLQLDGAQGQRANSWHTDVTFVEAYPKASILRSVVAPASGGDTVWANTAAAYQELPEPLRELADKLWA
VHSNEYDYASLKPDIDPAKLERHRKVFTSTVYETEHPVVRVHPISGERALQLGHFVKRIKGYSLADSQHLFAVLQGHVTR
LENTVRWRWEAGDVAIWDNRATQHYAVDDYGTQPRIVRRVTLAGEVPVGVDGQLSRTTRKG
>Q44636 ~~~atxA~~~Anthrax toxin expression trans-acting positive regulator~~~
MLTPISIEKEHIRLINLLHFINEQNRWFTIKELSDYLQVADKTVRKYLKLLEDEIPPSWNLLVQKGKGIYLKKPLNESLS
FVESKILRKSLNLQICEELVFKKNSMQSLAQKLHLQVGALYPIINQINYDIQSSHLNIKKKPLEISGREQDVRVFMLRLY
CNIPNDYWPFPYINKQNITDLINKMEKILNVQMYTYSKHKLCVLFAITISRLLSGNTIDNVSGLILVNKNDDHYKTVASI
TSELQNSFGVTLHETEISFLALALLLSLGNSITTDSNKTLTSYKKTIMPLAKEITKGIEHKLQLGINYDESFLTYVVLII
KKALDKNFIQYYNYNIKFIRHIKQRHPNTFNTIQECISNLNYTVYSHFDCYEISLLTMHFETQRMLFKNNPKKIYVYTSQ
GCIHREYISALLEKRYNGLIKIVRNTIINLTNESLQDMEIDIIISNVNLPIKNIPIVQISEFPTERDFHEIKKII
>P01551 ~~~axnA~~~Actinoxanthin~~~
MSLRHMSRRASRFGVVAVASIGLAAAAQSVAFAAPAFSVSPASGLSDGQSVSVSVSGAAAGETYYIAQCAPVGGQDACNP
ATATSFTTDASGAASFSFVVRKSYAGSTPEGTPVGSVDCATDACNLGAGNSGLDLGHVALTFG
>E8RUP5 3.4.-.-~~~atxE2~~~Lasso peptide isopeptidase AtxE2~~~COG0823
MRSSKIRCPGAIRVGTLVTAFGCLPHVAFAAAREAPPVTPEVLVRLADIGTMSASETTPLLSLSPDGRYVAFQVRQADPV
TNLNVFRMVVKATDGATDAIDVDVGGEYLFWTIPSWGYARNAPSGANLTIQPRWSPSGTHLAYLRQDQGRVRVWRASVKG
EGASPVIEDAYDIEDVQWLDDNTLIYSGRPGFVEAEAEIEREGRRGWVYDERFHPLTGARPRVLEPISIVYQVLDLKTGT
RRAATPTEVARLREKPDPLRAMVGRTTFSVSRTDPQNINAPTTLVARRGEGEPVRCDEEACQNITRMWGDETANVLYFLR
REGWASNEMALYRMPADALKPVRIWHATGLLQGCERQAKRLICAQESALQPRRLVTLNLTSGQMSPLYDPNPDLSRYRLP
KVERLTLRNRNGIEVFSDLVLPPDYQLGTRLPLVIVQYSSRGFLRGGTGDENPILPLATAGFAVLSFHSPRSEASYQRFT
SPIAQSKAEYSNWRNRWNILHTLEDLIDDLDRRGVIDPARVGLTGLSDGATTVHFGLINSHRFAAAVTSSCCTDSFTASV
MNGPRISGALKAYGIETDQADDGPFWAATSFVVNASRLDTPLLIQSADEEYLGALPGFTALQQARKPVELIIYPNEHHVK
WQPAHRLAVYNRTIDWFRFWLMDQSDPAPDKAAQYDRWRALRALRQKSPSPTPAP
>P72156 3.8.1.8~~~atzA~~~Atrazine chlorohydrolase~~~
MQTLSIQHGTLVTMDQYRRVLGDSWVHVQDGRIVALGVHAESVPPPADRVIDARGKVVLPGFINAHTHVNQILLRGGPSH
GRQFYDWLFNVVYPGQKAMRPEDVAVAVRLYCAEAVRSGITTINENADSAIYPGNIEAAMAVYGEVGVRVVYARMFFDRM
DGRIQGYVDALKARSPQVELCSIMEETAVAKDRITALSDQYHGTAGGRISVWPAPATTTAVTVEGMRWAQAFARDRAVMW
TLHMAESDHDERIHGMSPAEYMECYGLLDERLQVAHCVYFDRKDVRLLHRHNVKVASQVVSNAYLGSGVAPVPEMVERGM
AVGIGTDNGNSNDSVNMIGDMKFMAHIHRAVHRDADVLTPEKILEMATIDGARSLGMDHEIGSIETGKRADLILLDLRHP
QTTPHHHLAATIVFQAYGNEVDTVLIDGNVVMENRRLSFLPPERELAFLEEAQSRATAILQRANMVANPAWRSL
>P95442 3.5.4.43~~~atzB~~~Hydroxydechloroatrazine ethylaminohydrolase~~~
MTTTLYTGFHQLVTGDVAGTVLNGVDILVRDGEIIGLGPDLPRTLAPIGVGQEQGVEVVNCRGLTAYPGLINTHHHFFQA
FVRNLAPLDWTQLDVLAWLRKIYPVFALVDEDCIYHSTVVSMAELIKHGCTTAFDHQYNYSRRGGPFLVDRQFDAANLLG
LRFHAGRGCITLPMAEGSTIPDAMRESTDTFLADCERLVSRFHDPRPFAMQRVVVAPSSPVIAYPETFVESARLARHLGV
SLHTHLGEGETPAMVARFGERSLDWCENRGFVGPDVWLAHGWEFTAADIARLAATGTGVAHCPAPVFLVGAEVTDIPAMA
AAGVRVGFGVDGHASNDSSNLAECIRLAYLLQCLKASERQHPVPAPYDFLRMATQGGADCLNRPDLGALAVGRAADFFAV
DLNRIEYIGANHDPRSLPAKVGFSGPVDMTVINGKVVWRNGEFPGLDEMELARAADGVFRRVIYGDPLVAALRRGTGVTP
C
>O52063 3.5.4.42~~~atzC~~~N-isopropylammelide isopropyl amidohydrolase~~~
MSKDFDLIIRNAYLSEKDSVYDIGIVGDRIIKIEAKIEGTVKDEIDAKGNLVSPGFVDAHTHMDKSFTSTGERLPKFWSR
PYTRDAAIEDGLKYYKNATHEEIKRHVIEHAHMQVLHGTLYTRTHVDVDSVAKTKAVEAVLEAKEELKDLIDIQVVAFAQ
SGFFVDLESESLIRKSLDMGCDLVGGVDPATRENNVEGSLDLCFKLAKEYDVDIDYHIHDIGTVGVYSINRLAQKTIENG
YKGRVTTSHAWCFADAPSEWLDEAIPLYKDSGMKFVTCFSSTPPTMPVIKLLEAGINLGCASDNIRDFWVPFGNGDMVQG
ALIETQRLELKTNRDLGLIWKMITSEGARVLGIEKNYGIEVGKKADLVVLNSLSPQWAIIDQAKRLCVIKNGRIIVKDEV
IVA
>Q936X3 3.5.1.131~~~atzE~~~1-carboxybiuret hydrolase subunit AtzE~~~
MKTVEIIEGIASGRTSARDVCEEALATIGATDGLINAFTCRTVERARAEADAIDVRRARGEVLPPLAGLPYAVKNLFDIE
GVTTLAGSKINRTLPPARADAVLVQRLKAAGAVLLGGLNMDEFAYGFTTENTHYGPTRNPHDTGRIAGGSSGGSGAAIAA
GQVPLSLGSDTNGSIRVPASLCGVWGLKPTFGRLSRRGTYPFVHSIDHLGPLADSVEGLALAYDAMQGPDPLDPGCSASR
IQPSVPVLSQGIAGLRIGVLGGWFRDNAGPAARAAVDVAALTLGASEVVMWPDAEIGRAAAFVITASEGGCLHLDDLRIR
PQDFEPLSVDRFISGVLQPVAWYLRAQRFRRVYRDKVNALFRDWDILIAPATPISAPAIGTEWIEVNGTRHPCRPAMGLL
TQPVSFAGCPVVAAPTWPGENDGMPIGVQLIAAPWNESLCLRAGKVLQDTGIARLKC
>Q936X2 3.5.1.54~~~atzF~~~Allophanate hydrolase~~~
MNDRAPHPERSGRVTPDHLTDLASYQAAYAAGTDAADVISDLYARIKEDGENPIWISLLPLESALAMLADAQQRKDKGEA
LPLFGIPFGVKDNIDVAGLPTTAGCTGFARTPRQHAFVVQRLVDAGAIPIGKTNLDQFATGLNGTRTPFGIPRCVFNENY
VSGGSSSGSAVAVANGTVPFSLGTDTAGSGRIPAAFNNLVGLKPTKGLFSGSGLVPAARSLDCISVLAHTVDDALAVARV
AAGYDADDAFSRKAGAAALTEKSWPRRFNFGVPAAEHRQFFGDAEAEALFNKAVRKLEEMGGTCISFDYTPFRQAAELLY
AGPWVAERLAAIESLADEHPEVLHPVVRDIILSAKRMSAVDTFNGIYRLADLVRAAESTWEKIDVMLLPTAPTIYTVEDM
LADPVRLNSNLGFYTNFVNLMDLSAIAVPAGFRTNGLPFGVTFIGRAFEDGAIASLGKAFVEHDLAKGNAATAAPPKDTV
AIAVVGAHLSDQPLNHQLTESGGKLRATTRTAPGYALYALRDATPAKPGMLRDQNAVGSIEVEIWDLPVAGFGAFVSEIP
APLGIGTITLEDGSHVKGFLCEPHAIETALDITHYGGWRAYLAAQ
>A0A384E126 ~~~atzG~~~1-carboxybiuret hydrolase subunit AtzG~~~
MTETEIFAYIEAASIAIGIPLEPARARAVAHHFSRTALLAEMLESVPLSPESELAEIYRPAPFPAEDI
>Q59998 7.2.2.12~~~ziaA~~~Zinc-transporting ATPase~~~COG2217
MTQSSPLKTQQMQVGGMDCTSCKLKIEGSLERLKGVAEASVTVATGRLTVTYDPKQVSEITIQERIAALGYTLAEPKSSV
TLNGHKHPHSHREEGHSHSHGAGEFNLKQELLPVLTAIALFTIAILFEQPLHNTPGQIAEFAVIIPAYLLSGWTVLKTAG
RNILRGQIFDENFLMTIATLGALAIHQLPEAVAVMLFFRVGELFQEYSVGRSRRSIKALLEARPDTANLKRNGTVQQVSP
ETVQVDDLILVKPGEKVPLDGEILGGTSQVDTSALTGESVPGTVKPGDTILAGMINQSGVLTIRVTKLFSESSIAKVLDL
VENASSKKASTEKFITQFARYYTPVIVFLSLAVALLPPLFIPGADRADWVYRALVLLVISCPCGLVISIPLGYFGGIGGA
AKHGILIKGSTFLDSLTAVKTVVFDKTGTLTKGTFKVTQVVTKNGFSESELLTLAAKAESHSTHPIALSIREAYAQSIAD
SEVADYEEIAGHGIRAVVQNQVVIAGNDRLLHREKIDHDTCDVAGTVVHLAVDGRYGGYILIADEIKEDAVQAIRDLKRM
GVEKTVMLTGDSEIVAQSVAQQIGLDAFVAELLPEEKVDEIEQLLDPSGKAKLAFVGDGINDAPVIARADVGIAMGGLGS
DAAIETADVVLMTDAPSKVAEAIHVARKTRQIVVQNIVLALGIKALFIALGTIGLATLWEAVFADVGVALLAILNATRIA
K
>Q2CEE2 2.3.-.-~~~~~~Putative acetyltransferase OgpAT~~~COG0456
MASEVVIRRATAADHGDLCRVCLLTGDSGRDASSREDDPTLLGMIYAVPYQVGAPDFAFVLEDAEGVCGYLLGAPDTLSF
QHFLEKEWLPPLRAGLTDPGPDPAAWQGSDWARDAIHRPPALPPIDLAAYPAHGHIDLLPRAQGRGVGSRAMDHLEAALA
AAGAPGMHLQVSPENPRALGFYEHRGFRELCRSEDEVVVGRRLLDE
>H1ZZA4 1.14.13.222~~~auaG~~~Aurachin C monooxygenase/isomerase~~~
MKTGLTVLIAGGGIGGLTLGVALRRAGIAFKIFERAPALLRVGAGISMQSNAMLAFRTLGVDTAVAAAGQEIQGGAILNP
RGEEISSMPVSKASAEVGAPMITIHRGRLQDVLHQIVGDDNLVLGAKVEGFRDGPDGLFVRLADGREFQGDLLVGADGLR
SAVRAQLLKEPSPRYSGYTSWRGVCDVSEGVRRDYTSESWGPGMRFGVVPIGEGQTYWFATATAPEGGVDHPDARTELLQ
RFSGWHAPIPQLIENTPSSAIMRTDIHDRVPIRQWVQGRAVLLGDAAHPMTPNMGQGGCQAVEDAVVLARCLSLEAELPA
ALARYQAVRVERANDFVAGSYRIGQIGQWENAFACWVREKLMRMMSSDRVDARTRRNLQFTPL
>H1ZZB0 1.1.1.394~~~auaH~~~Aurachin B dehydrogenase~~~
MRTFVTGGSGYLGRNLLSALVARGISVRALVRSEEAAQKVQALGAQPILGTLEHRETLKEGMAGCDVLFHAAALTSARAT
DAEFHRANVLGTETVLAAARDARIQRMVHVSTEAVLADGRPLLQVDESHPLPKRPFAGYPATKAQAEQLVLQANGPGFTT
VVVRPRFIWGADDTAFLPQLIDAIRTKRFRWVDGGRYLTSTCHVANVCEGMLLAAERGPGGEVYFLTDGAPVELRSFLTL
LLETQGIKAEVGNIPFQAARAAAHLGESLWRALVPQARAPALRLAVYLLGREVTLNDDKARRELGYAGRVTHQQGLDALR
QAGPAGQGAMPHRA
>H8WEC1 ~~~aupA~~~Alkane uptake protein A~~~
MSERSVYMVLSPRFSVRAVSLAVAAVSASLSMPTSASMGNLGTSYGVMPVDVATAQSLSMFNEQVSATYYNPAALTKDPR
GELTAGILHSEQELRSDNPNASGDIVSDSPSQHVLIGMKTNLGSLTRFGHPIYLGFIAGVEKYGKEMLAFSSETSESGQF
LQYGKEPLFLNIGGATPIWRGISAGASVRVTLEATANLDAVSTLGGETSRERLAVNAEPSLKTILGTNIDLGSTFCPESD
CFLNGWETALTYRTKSSASTTVDSNIIVTQTIPDPGLSLAVTTIDSFQPETIAIGTQYSGDGWRIGGSIEQQNWSELEDE
FSGDSIKDQGSVASGNRIGFDDILIPRLGAEYQLNKNFAVRGGVAYEESPLKTTRNPELNYLDTDKLVVGLGISATYDRT
RLLAYPVRLDLGYQYQQLQERDFTVVDYDGDETSVTADGDIHVFSGSITLKF
>H8WEC0 ~~~aupB~~~Alkane uptake protein B~~~
MKYNKTLALIPAILLAACGGGDKQSIDEKPRPGSMVYSFPMDGQADVSPKADIVLRFSHAITDDEATLQNKIILQSSGQS
QPFTVTLIDSGKSLKLEPANPLATGEEFTVTFQEPLAAEGGRQITTPNATGDDGIQFATRGAYAGLAELANTADTFDIAW
QVPASDSPFQAMNFSTFRFAMTQPVHPEWQSLGGSIQLRDASGQEVPATVLVKGNRITVDPCVTPEPSQCGSKQDILNTG
ETYTLQLTNLASLTDSSEENRFTGEFTFTPRDTGPTVVLQQTAVDSGNGELTSVLNGQALNAVTLNSVLQGEAGPSQQTG
DLFAELAYAPAFDADEALPLRIPKGSVLKSTSLNVLVGGKVQVLDAATGEDQQTGTIKVTMLSDASGYMSPNQYTDDINA
PRHITLFMDVSMNTEAAQPNAALSQDLMGVELRGIALVRDGVLTIDAIGMVEPNLLGQEFTDSTIAFHLQAATDADSVLD
AENLRELDATPPSLVSWMPGPANAVPATRQSMQRPGDPIILFFDEPLDPASVENGVELIENGAPVTNLQARLDGTALVLN
PQGGLKHGVPYSVSVDGLTDLAGNRAMVSPLSFELDSIEENGSPGNKGFPLALTTYPGYPCETDYENTLDLENGVFGKCW
TADANNAGDVLPVSKMPADRPITVVFSKSLDLDSVIVGETFTVQEIAQEANGTVTVLNQQVAGRLEKNNQRIRFYPEQPW
QVGAHYRYILASSEQSGTCTPGQYTSICDEDGIPLKTDLLEGLNDGDGNNGPDNLEIVFTGSEARSTVFTPLRNLPIRDT
NSNLAIDCDDLGSESCLEPFNHQGSDSEGFLPSANAAKLLVTQDHAEDARVGCAVDGADCPRKKFIYQTYALNTEVIGPT
VDPDTGRDAVKVLLYPTQLATTSLDVHLSLLSTVSSTGPQVLRMRYAKDDPECTGASCARNSLIPGYITENDQGETIFKT
KAELFLDAPGLKATASILGIPTDLPLTHNLFGYPFTLELEGRVVFFDDGRMQIEQRNFNVPHIDVEAVALGIINTEIPLI
IPEQGVYLNFISNPVKEIPAQYE
>Q8RMH6 ~~~~~~Auracyanin-A~~~COG3241
MKITLRMMVLAVLTAMAMVLAACGGGGSSGGSTGGGSGSGPVTIEIGSKGEELAFDKTELTVSAGQTVTIRFKNNSAVQQ
HNWILVKGGEAEAANIANAGLSAGPAANYLPADKSNIIAESPLANGNETVEVTFTAPAAGTYLYICTVPGHYPLMQGKLV
VN
>P27197 ~~~~~~Auracyanin-B~~~COG3241
MSWRGSGRSNFRSRSSSNGGSTFSGGSAGGPPLIVMMGLAFGAGLIMLIVMIASNATAGGFVAATPRPTATPRPTAAPAP
TQPPAAQPTTAPATQAANAPGGSNVVNETPAQTVEVRAAPDALAFAQTSLSLPANTVVRLDFVNQNNLGVQHNWVLVNGG
DDVAAAVNTAAQNNADALFVPPPDTPNALAWTAMLNAGESGSVTFRTPAPGTYLYICTFPGHYLAGMKGTLTVTP
>Q8GPI4 ~~~aucA~~~Bacteriocin aureocin A53~~~
MSWLNFLKYIAKYGKKAVSAAWKYKGKVLEWLNVGPTLEWVWQKLKKIAGL
>P81177 3.4.24.29~~~aur~~~Zinc metalloproteinase aureolysin~~~
MRKFSRYAFTSMATVTLLSSLTPAALASDTNHKPATSDINFEITQKSDAVKALKELPKSENVKNHYQDYSVTDVKTDKKG
FTHYTLQPSVDGVHAPDKEVKVHADKSGKVVLINGDTDAKKVKPTNKVTLSKDEAADKAFNAVKIDKNKAKNLQDDVIKE
NKVEIDGDSNKYIYNIELITVTPEISHWKVKIDADTGAVVEKTNLVKEAAATGTGKGVLGDTKDININSIDGGFSLEDLT
HQGKLSAYNFNDQTGQATLITNEDENFVKDDQRAGVDANYYAKQTYDYYKNTFGRESYDNHGSPIVSLTHVNHYGGQDNR
NNAAWIGDKMIYGDGDGRTFTNLSGANDVVAHELTHGVTQETANLEYKDQSGALNESFSDVFGYFVDDEDFLMGEDVYTP
GKEGDALRSMSNPEQFGQPSHMKDYVYTEKDNGGVHTNSGIPNKAAYNVIQAIGKSKSEQIYYRALTEYLTSNSNFKDCK
DALYQAAKDLYDEQTAEQVYEAWNEVGVE
>Q70KH9 1.14.99.68~~~aurF~~~4-aminobenzoate N-oxygenase~~~
MREEQPHLATTWAARGWVEEEGIGSATLGRLVRAWPRRAAVVNKADILDEWADYDTLVPDYPLEIVPFAEHPLFLAAEPH
QRQRVLTGMWIGYNERVIATEQLIAEPAFDLVMHGVFPGSDDPLIRKSVQQAIVDESFHTYMHMLAIDRTRELRKISERP
PQPELVTYRRLRRVLADMPEQWERDIAVLVWGAVAETCINALLALLARDATIQPMHSLITTLHLRDETAHGSIVVEVVRE
LYARMNEQQRRALVRCLPIALEAFAEQDLSALLLELNAAGIRGAEEIVGDLRSTAGGTRLVRDFSGARKMVEQLGLDDAV
DFDFPERPDWSPHTPR
>Q9F5K5 2.1.1.209~~~aviRa~~~23S rRNA (guanine(2535)-N(1))-methyltransferase~~~
MSAYRHAVERIDSSDLACGVVLHSAPGYPAFPVRLATEIFQRALARLPGDGPVTLWDPCCGSGYLLTVLGLLHRRSLRQV
IASDVDPAPLELAAKNLALLSPAGLTARELERREQSERFGKPSYLEAAQAARRLRERLTAEGGALPCAIRTADVFDPRAL
SAVLAGSAPDVVLTDLPYGERTHWEGQVPAQPVAGLLRSLASALPAHAVIAVTDRSRKIPVAPVKALERLKIGTRSAVLV
RAADVLEAGP
>P14727 ~~~avrBs3~~~Avirulence protein AvrBs3~~~
MDPIRSRTPSPARELLPGPQPDGVQPTADRGVSPPAGGPLDGLPARRTMSRTRLPSPPAPSPAFSAGSFSDLLRQFDPSL
FNTSLFDSLPPFGAHHTEAATGEWDEVQSGLRAADAPPPTMRVAVTAARPPRAKPAPRRRAAQPSDASPAAQVDLRTLGY
SQQQQEKIKPKVRSTVAQHHEALVGHGFTHAHIVALSQHPAALGTVAVKYQDMIAALPEATHEAIVGVGKQWSGARALEA
LLTVAGELRGPPLQLDTGQLLKIAKRGGVTAVEAVHAWRNALTGAPLNLTPEQVVAIASHDGGKQALETVQRLLPVLCQA
HGLTPQQVVAIASNGGGKQALETVQRLLPVLCQAHGLTPQQVVAIASNSGGKQALETVQRLLPVLCQAHGLTPEQVVAIA
SNGGGKQALETVQRLLPVLCQAHGLTPEQVVAIASNIGGKQALETVQALLPVLCQAHGLTPEQVVAIASNIGGKQALETV
QALLPVLCQAHGLTPEQVVAIASNIGGKQALETVQALLPVLCQAHGLTPEQVVAIASHDGGKQALETVQRLLPVLCQAHG
LTPEQVVAIASHDGGKQALETVQRLLPVLCQAHGLTPQQVVAIASNGGGKQALETVQRLLPVLCQAHGLTPEQVVAIASN
SGGKQALETVQALLPVLCQAHGLTPEQVVAIASNSGGKQALETVQRLLPVLCQAHGLTPEQVVAIASHDGGKQALETVQR
LLPVLCQAHGLTPEQVVAIASHDGGKQALETVQRLLPVLCQAHGLTPEQVVAIASHDGGKQALETVQRLLPVLCQAHGLT
PQQVVAIASNGGGRPALETVQRLLPVLCQAHGLTPEQVVAIASHDGGKQALETVQRLLPVLCQAHGLTPQQVVAIASNGG
GRPALESIVAQLSRPDPALAALTNDHLVALACLGGRPALDAVKKGLPHAPALIKRTNRRIPERTSHRVADHAQVVRVLGF
FQCHSHPAQAFDDAMTQFGMSRHGLLQLFRRVGVTELEARSGTLPPASQRWDRILQASGMKRAKPSPTSTQTPDQASLHA
FADSLERDLDAPSPMHEGDQTRASSRKRSRSDRAVTGPSAQQSFEVRVPEQRDALHLPLSWRVKRPRTSIGGGLPDPGTP
TAADLAASSTVMREQDEDPFAGAADDFPAFNEEELAWLMELLPQ
>P13835 ~~~avrB~~~Avirulence protein B~~~
MGCVSSKSTTVLSPQTSFNEASRTSFRALPGPSQRQLEVYDQCLIGAARWPDDSSKSNTPENRAYCQSMYNSIRSAGDEI
SRGGITSFEELWGRATEWRLSKLQRGEPLYSAFASERTSDTDAVTPLVKPYKSVLARVVDHEDAHDEIMQDNLFGDLNVK
VYRQTAYLHGNVIPLNTFRVATDTEYLRDRVAHLRTELGAKALKQHLQRYNPDRIDHTNASYLPIIKDHLNDLYRQAISS
DLSQAELISLIARTHWWAASAMPDQRGSAAKAEFAARAIASAHGIELPPFRNGNVSDIEAMLSGEEEFVEKYRSLLDSDC
F
>Q9F5K6 2.1.1.208~~~aviRb~~~23S rRNA (uridine(2479)-2'-O)-methyltransferase~~~
MARSRGERTPAARRITSRNARFQQWQALLGNRNKRTRAGEFLVMGVRPISLAVEHGWPVRTLLYDGQRELSKWARELLRT
VRTEQIAMAPDLLMELGEKNEAPPEVVAVVEMPADDLDRIPVREDFLGVLFDRPTSPGNIGSIIRSADALGAHGLIVAGH
AADVYDPKSVRSSTGSLFSLPAVRVPSPGEVMDWVEARRAAGTPIVLVGTDEHGDCDVFDFDFTQPTLLLIGNETAGLSN
AWRTLCDYTVSIPMAGSASSLNAANAATAILYEAVRQRISGRTATTP
>Q52430 3.4.22.-~~~avrPph3~~~Cysteine protease avirulence protein AvrPphB~~~
MKIGTQATSLAVLHNQESHAPQAPIAVRPEPAHAIPEIPLDLAIRPRTRGIHPFLAMTLGDKGCASSSGVSLEDDSHTQV
SLSDFSVASRDVNHNNICAGLSTEWLVMSSDGDAESRMDHLDYNGEGQSRGSERHQVYNDALRAALSNDDEAPFFTASTA
VIEDAGFSLRREPKTVHASGGSAQLGQTVAHDVAQSGRKHLLSLRFANVQGHAIACSCEGSQFKLFDPNLGEFQSSRSAA
PQLIKGLIDHYNSLNYDVACVNEFRVS
>P09053 2.6.1.66~~~avtA~~~Valine--pyruvate aminotransferase~~~COG3977
MTFSLFGDKFTRHSGITLLMEDLNDGLRTPGAIMLGGGNPAQIPEMQDYFQTLLTDMLESGKATDALCNYDGPQGKTELL
TLLAGMLREKLGWDIEPQNIALTNGSQSAFFYLFNLFAGRRADGRVKKVLFPLAPEYIGYADAGLEEDLFVSARPNIELL
PEGQFKYHVDFEHLHIGEETGMICVSRPTNPTGNVITDEELLKLDALANQHGIPLVIDNAYGVPFPGIIFSEARPLWNPN
IVLCMSLSKLGLPGSRCGIIIANEKIITAITNMNGIISLAPGGIGPAMMCEMIKRNDLLRLSETVIKPFYYQRVQETIAI
IRRYLPENRCLIHKPEGAIFLWLWFKDLPITTKQLYQRLKARGVLMVPGHNFFPGLDKPWPHTHQCMRMNYVPEPEKIEA
GVKILAEEIERAWAESH
>P96847 2.6.1.66~~~aspB~~~Valine--pyruvate aminotransferase~~~COG0436
MTDRVALRAGVPPFYVMDVWLAAAERQRTHGDLVNLSAGQPSAGAPEPVRAAAAAALHLNQLGYSVALGIPELRDAIAAD
YQRRHGITVEPDAVVITTGSSGGFLLAFLACFDAGDRVAMASPGYPCYRNILSALGCEVVEIPCGPQTRFQPTAQMLAEI
DPPLRGVVVASPANPTGTVIPPEELAAIASWCDASDVRLISDEVYHGLVYQGAPQTSCAWQTSRNAVVVNSFSKYYAMTG
WRLGWLLVPTVLRRAVDCLTGNFTICPPVLSQIAAVSAFTPEATAEADGNLASYAINRSLLLDGLRRIGIDRLAPTDGAF
YVYADVSDFTSDSLAFCSKLLADTGVAIAPGIDFDTARGGSFVRISFAGPSGDIEEALRRIGSWLPSQ
>Q56830 ~~~~~~Avirulence protein AvrXa10~~~
MDPIRSRTPSPARELLPGPQPDRVQPTADRGGAPPAGGPLDGLPARRTMSRTRLPSPPAPSPAFSAGSFSDLLRQFDPSL
LDTSLLDSMPAVGTPHTAAAPAECDEVQSGLRAADDPPPTVRVAVTARPPRAKPAPRRRAAQPSDASPAAQVDLRTLGYS
QQQQEKIKPKVRSTVAQHHEALVGHGFTHAHIVALSQHPAALGTVAVTYQDIIRALPEATHEDIVGVGKQWSGARALEAL
LTEAGELRGPPLQLDTGQLLKIAKRGGVTAVEAVHAWRNALTGAPLNLTPDQVVAIASNIGGNQALETVQRLLPVLCQAH
GLTPDQVVAIASHGGGKQALETVQRLLPVLCQAHGLTPDQVVAIASNIGGKQALATVQRLLPVLCQDHGLTPDQVVAIAS
HGGGKQALETVQRLLPVLCQDHGLTPDQVVAIASNIGGKQALETVQRLLPVLCQDHGLTPDQVVAIASNIGGKQALETVQ
RLLPVLCQDHGLTPDQVVAIASNNGGKQALETVQRLLPVLCQTHGLTPDQVVAIANHDGGKQALETVQRLLPVLCQDHGL
TPDQVVAIASNIGGKQALATVQRLLPVLCQAHGLTPDQVVAIASHDGGKQALETVQRLLPVLCQDHGLTPDQVVAIASNN
GGKQALETVQRLLPVLCQDHGLTPAQVVAIANHGGGKQALETVQRLLPVLCQDHGLTPVQVVAIASNSGGKQALETVQRL
LPVLCQDHGLTPVQVVAIASNGGGKQALATVQRLLPVLCQDHGLTPVQVVAIASHDGGKQALETVQRLLPVLCQDHGLTP
DQVVAIASNGGKQALESIVAQLSRPDPALAALTNDHLVALACLGGRPALDAVKKGLPHAPELIRRINRRIPERTSHRVAD
LAHVVRVLGFFQSHSHPAQAFDDAMTQFGMSRHGLAQLFRRVGVTELEARYGTLPPASQRWDRILQASGMKRVKPSPTSA
QTPDQASLHAFADSLERDLDAPSPMHEGDQTRASSRKRSRSDRAVTGPSTQQSFEVRVPEQQDALHLPLSWRVKRPRTRI
GGGLPDPGTPIAADLAASSTVMWEQDAAPFAGAADDFPAFNEEELAWLMELLPQSGSVGGTI
>Q09LX1 3.1.1.72~~~axe2~~~Acetylxylan esterase~~~
MKIGSGEKLLFIGDSITDCGRARPEGEGSFGALGTGYVAYVVGLLQAVYPELGIRVVNKGISGNTVRDLKARWEEDVIAQ
KPDWVSIMIGINDVWRQYDLPFMKEKHVYLDEYEATLRSLVLETKPLVKGIILMTPFYIEGNEQDPMRRTMDQYGRVVKQ
IAEETNSLFVDTQAAFNEVLKTLYPAALAWDRVHPSVAGHMILARAFLREIGFEWVRSR
>D5EXI2 3.1.1.-~~~axe7A~~~Acetyl esterase Axe7A~~~COG3458
MFNFAPKQTTEMKKLLFTLVFVLGSMATALAENYPYRADYLWLTVPNHADWLYKTGERAKVEVSFCLYGMPQNVEVAYEI
GPDMMPATSSGKVTLKNGRAVIDMGTMKKPGFLDMRLSVDGKYQHHVKVGFSPELLKPYTKNPQDFDAFWKANLDEARKT
PVSVSCNKVDKYTTDAFDCYLLKIKTDRRHSIYGYLTKPKKAGKYPVVLCPPGAGIKTIKEPMRSTFYAKNGFIRLEMEI
HGLNPEMTDEQFKEITTAFDYENGYLTNGLDDRDNYYMKHVYVACVRAIDYLTSLPDWDGKNVFVQGGSQGGALSLVTAG
LDPRVTACVANHPALSDMAGYLDNRAGGYPHFNRLKNMFTPEKVNTMAYYDVVNFARRITCPVYITWGYNDNVCPPTTSY
IVWNLITAPKESLITPINEHWTTSETNYTQMLWLKKQVK
>D5EV35 3.1.1.72~~~axeA1~~~Acetylxylan esterase~~~COG0657
MNRKLFMTGLLMLAMTMQAQTAKKFTLNLSDDGKAQMVCFLPENPSGRAIVGVPGGGYSMLSNTHEGYQASDWLNKQGIA
YFVVNYRLPHGDRTIPVGDVEQGFRIVRDSAKVWNINPNDVGIMGFSAGGHLSSVISTMSPYEVRPNFSILFYPVISMDE
RVSHKWSCINFLGKEGYKDPKLIGQYSTQNAVRSHLTPPACIISANDDRLVPVVTNGIQYYSAMRNAGNECSLFIYPSGD
HGFGFGTWFKYHDQLLQDLGNWLKSIPAPKEDAIRVACIGNSITDGFGIDMRAKYGYPAQLQGILGDGYWVKNFGVSART
MLNKGDFPYMNEMAWKDALAFKPDVVVIKLGTNDSKPENWQYGSEFRQDLEQMIKALRPDLAQPAKKGKKKAKAAAQPAG
PKILLCTPIPAFKPSWNINDKVITDEIIPIQQEVAKQYGLQIIDLHALMLNDGDKVVDDGIHPNEKGAKKMAEIIAAAIK
>D5EXZ4 ~~~axe1-6A~~~Carbohydrate acetyl esterase/feruloyl esterase~~~COG2382
MYQSTLKTILLASALLILPASMSAQKRKAAPKKAATEQVGKPDPNFYIFLCFGQSNMEGNARPEAQDLTSPGPRFLLMPA
VDFPEKGRKMGEWCEASAPLCRPNTGLTPADWFGRTLVASLPENIKIGVIHVAIGGIDIKGFLPDSIQNYLKVAPNWMKG
MLAAYDNNPYERLVTLAKKAQKDGVIKGILMHQGETNTGDPKWAGMVKQVYDNLCGDLNLKPEEVNLYAGNIVQADGKGV
CIGCKKQIDELPLTLHTSQVISSDGCTNGPDRLHFDAAGYRELGCRYGEAVARHLGYEPKRPYIEMPKQIEVPADAFIAE
TTVPGNEFPKVDKEGRAYFRIAAPEARKVVLDICNKKYDMQRDGKGNFMAVTDPLPVGFHYYFLNINGVNFIDPSTETFF
GCNRESGGIEIPEGSEGDYYRPQQGVPAGQVRSIYYYSNEQQTWRHAMVYTPAEYELAKNAKKRYPVLYLQHGMGEDETG
WSKQGHMQHIMDNAIAKGEAVPMIVVMESGDIKAPFGGGNNQAGRSAYGASFYPVLLNDLIPYIDSNYRTKSDRENRAMA
GLSWGGHQTFDVVLTNLDKFAWLGTFSGAIFGLDVKTAYDGVFANADEFNKKIHYMYMNWGEEDFIKSGDIVKQLRELGI
KVDSNESKGTAHEWLTWRRGLNEFIPHLFKK
>A0A2A5K5H4 ~~~~~~Nucleobase transporter PlAzg1~~~
MKGTFDVSRWETWFQLKERGTTWTTEILAGCTTFMTMVFILVVNPAILSDAGMDFNGVYVATVLVTLISTLIIALFGNFP
FVIAPGMGINAFFAYSVVKAQGIPWQTALGSVFLAGTVFLVLALTRYRRFLLDAIPQSLKYAITAGTGLFICFVGLQNAK
LVISSPDTLVTLGNLREPGTLLSIIGLAVTLLLMTYRIRGALFLGMIVTSVLAWIMGLMQLPSHFLSIPSGLAHTALQLD
IDGVFNYDMLAVTFTFLLISVFETTGTMVSLAEQAGLMKEGRFPHSRSALLANAVGVTSGALLGTSPITTLVESGSGIAA
GGRTGLTPIVTCILLVITMFFAPVAETLASTPFVTAPALVIVGFSMLEEMVHVEWKSFEEAFPSFLVMVTMPLTYSVSSA
IGIGFIVYVLLKLFRGKAKEVHPALYFFALFFFIQLGFIHS
>A0A2A5K485 ~~~~~~Nucleobase transporter PlAzg2~~~
MYVSSLLFLYASESYESFIFHFQMCYNKAHNGFMPSIFQQLKGLGEGLQWKVGISLDHQEKQTSSHEKEMTSPNSATRLI
PSDWRRELLAGTVSFFAAVYIIIVNSSILADAGIPQEAGIIATILASAIGCFIMGLWGNAPLVIVPGMGINAMFTYTLVQ
GMGLTWQQALAAVMMSGICFFAISMTSLVEKLRTAIPASLQEAISVGIGLMLVLIGLHKGGVIASDRSSVIAVQSFADPG
VLVTLATLALTCILYIRKVPGNLLLAIIGGSALAYLFKAVPSKAATGVGGSSWSSYGDLFGQLSVKGASITTLVIAVFSL
TLVIVFENVGLINAQLKMSGRTERFKRVTQATSLTVILSGIFGTSPTVSTVEAAAGISAGGRTGWASIATGTLFLLSFIA
MPVITLVPDQAVAPILIFIGGLMMPAVRHISFERMEEGLPAFFIIAFIPLMHSIVDGIAIGFISYALFHIAVGKWREVKP
LFYIISLLFVMHFVLQTM
>B4XY99 1.14.13.189~~~aziB1~~~5-methyl-1-naphthoate 3-hydroxylase~~~
MTTEAADATDRLVTAFDHHDPGYTPRTAERINTEIRERGVTWSPAYGGIWILSRYADVRAALTDWRTYSSARGVHFPRAE
GMPMFSPIDYDPPAQRGIRERMAAPMTGDAVSAMVPELRRMVARLLAPLAGRGHGDLMAEFAEPFAIEVLGVAFGLSESC
RARIREATRTMWTYISADRDASKFWPAFHALLAEEVERVRDEPDGSYLARLAAMRRDGSPLPDEELYSIIVSFCVAGHDN
TMNSITRLVHTLAQDPALQLRLRREPELRPAVAEEALRRWCPTDRFTRVTTREVTVAGTVIPAGARVVLLFDAANRDPEK
FPDPDTFDPDRGNSHQHLSFGHGIHHCMGVHLARAEFAAVLDELSRLPLFDLEQPSDLHFENGRHIMFDRVSVRFRTGEE
H
>B4XY98 2.1.1.302~~~aziB2~~~3-hydroxy-5-methyl-1-naphthoate 3-O-methyltransferase~~~
MAAGSGSGTPPGTPPPTLLTDLATGLWKTQTLTAAIETGLFEALAAGDADAPETAQRLGIGKRPAEILLTACTALGLLEQ
RDGRYRNTAVAAHYLVPGLPDYFGGYVQMVARYTAPGWLRATEAVRTDAPTKPVPDPDRNMFEEGNRPESFWEGLFTFST
LTARQLAASVDLSGVRRIMDVGGGAGATLIELCRQHPHLSGTVVDLPHVCALAGERIAAAGMTGRIDTAAADFFADPLPS
GHDAVLLSMILHDWDESQNRKILASCLDALPSGGTVLISELLVDDDKSGPVDAALMSMNMLVGTWGRNYTGAEYTDWLRD
AGCSEVRTVRFASPGANGVVAGVKA
>B4XYB8 2.3.1.236~~~aziB~~~5-methyl-1-naphthoate synthase~~~
MAENVQNPPVEPLAVIGMSCRFAPDLDTPGRLWEFLRAGGSAVGEMPDRRWDPYVTDSRTRDILRTTTRKGSFMRDIEGF
DAEFFQITPREAEYIDPQQRIMLELAWEALCDAGLPPTSLAGTDASVYVAANSNDYGRRLLEDLDRTGAWAVNGTTFYGI
ANRISYFLDAHGPSMAVDTACAGSLTALHVAGQALHRGETSVAIVGGINIMASPALVVALDAASATSPDGRSKSFDKAAD
GYGRGEGGGVVVLKRLSDAVRDGDPVHGLVLASGVFQDGRSDGMMAPNGSAQQRMLEEIYRRSGIDPGTVQYVEAHGTGT
QLGDAAEAQAIGNVFGPGRDGDNPLLIGTLKPNVGHVEAASGIAGVIKVLLGMRHGELPPSPHEEPDPGLGLEARGLRLV
AEPTPWPRGEHGMRAGVSSYGVGGSIAHAVLQQAPPRPDRTERPAAAATGRPQVFPLSAASEQGVRGLAGSVAAWLRAHP
ETALDDLAHTFTARRSHLSRRAAVVAGTTEELLGGLDALAGGEKSPAVALASASGFGDGGAAGPAWVFSGHGAQWSGMGR
ELLTTEPVFAQVIDELAPVFSEELGWTPREAIEAGGPWTVVRTQAMTFAMQVALAEVWSDLGLRPGAIIGHSVGEIAAAA
VAGSLDRAEAARFACRRARALGKIAGRGAMAMVPMAFADVEQRVAGRDAVVAAIAASPLSTVVSGDTAAVEALLADLEAD
GIQARRVNTDVAFHSPHVQEILDEVRQAAAALRAGTPRVTLYSTALADPRSDAPREGEYWATNLADPVRFHQAVRAALDD
GTRVFLEVSSHPVVAHSITETALDAGVPDAHVAITLRREQPEQRTVLANLARLHSLGTPVTWSYDGDLVDVPAVRWQHKP
YWIFPDTAPEQGAGLGHDPQTHTLIGARTTVASAPVQRVWQTELHMENRPYAQSHKVVGVETVPASVVLNSFITAATNEG
ERACGLRDIVFRIPLAAHPTRVVQVVLEQDKVRIASRIKRDQESGGVRDDEWLTHTTATVVHEPEVGARPMEDPDVIRAR
CPVSWTWAKVDGIFRTMGVDGYTFPWVVEELLRGEDEQFSTITVDHTPKLHPSSWTAVVDAALTASGVLVMDENSNVLRT
CSHLESLSFVGPPPPRIHVHTVRDPRTPDTISMTVADESGAVVCEARGLRYVKVQDIGSGAVGPRDLVHELAWEPVEVPA
DAPVPSQALVVGGAAGGPALVEALTARGVRARAVPDATAIGDASLTCADVVVVAPEALLPGEAPEQAARRCAQLLVDAVQ
QVAAVPDERRRPRVWALTREVRAGATEAALAHAPLWGAGRIVAGERPDLWGGVIDVAENAVPQQVASLIGALPHTEDVLS
LDSEGVTAARLRQVARPAEREPVDCRPDGTYLVTGGLGALGLEAARHLVEQGARRLVLIGRRGLPSRSRWDQVDDPAVAA
QIAEVVALEAAGATVRVLSLDISDAEATARALDPGALDMPPVRGIVHCAGVVSDALVEKTGAANLDTTMGPKADGAMVLH
RLFPAGTLDFFTMFSSCGQLARLTGQVSYASANSFLDALAALRRSRGETGTTSFAWAQWIGRGMGETTGRATILEAESRG
LGGITVSEALRSWAYADRFALPYAAVMRVMPDHTLPVFSHLSVTDAGAQSADAGGVDWATVPAGELPELVLKVTHEQVAA
ELNLAVDDIAIDQPLLELGVDSVLTVALRVRLHRCFAVDLPPTILWSNPTVRALAEFLAAEVGGATADAEETDPVAGLPA
PQQGSGTAEQLDAVAAAAG
>Q7A782 1.7.-.-~~~azo1~~~FMN-dependent NADPH-azoreductase~~~
MKGLIIIGSAQVNSHTSALARYLTEHFKTHDIEAEIFDLAEKPLNQLDFSGTTPSIDEIKQNMKDLKEKAMAADFLILGT
PNYHGSYSGILKNALDHLNMDYFKMKPVGLIGNSGGIVSSEPLSHLRVIVRSLLGIAVPTQIATHDSDFAKNEDGSYYLN
DSEFQLRARLFVDQIVSFVNNSPYEHLK
>Q50H63 1.7.1.17~~~azo1~~~FMN-dependent NADPH-azoreductase~~~
MKGLIIIGSAQVNSHTSALARYLTEHFKTHDIEAEIFDLAEKPLNQLDFSGTTPSIDEIKQNMKDLKEKAMAADFLILGT
PNYHGSYSGILKNALDHLNMDYFKMKPVGLIGNSGGIVSSEPLSHLRVIVRSLLGIAVPTQIATHDSDFAKNEDGSYYLN
DSEFQLRARLFVDQIVSFVNNSPYEHLK
>Q8KU07 1.7.-.-~~~azoB~~~NAD(P)H azoreductase~~~
MILVVGGTGTIGSEVVRLLQEAKLPFKALVRDAAKARELNARGVQTAAGDLREPRTLPAALGGVDKVFVVTPLVPDQVQM
RAALITAAKTAGVKHFVMSTGIGAAPDSPVQIGRWLGENQQQVQESGMAWTFVQPGFFMQNLLMYAQAIREKGEFYMPLG
EGKVSWIDARDIAAVAVQALTKPGHENQAYPVTGPQALSGAEVAAALSAAAGRPVRYVAITLEQAKQAMTGMGMPESLAD
AMNELYALAPPDYLAGVLDTVPKVTGRPARTFAEFAKAHAAAFGAA
>Q81UB2 1.6.5.-~~~azoR1~~~FMN-dependent NADH:quinone oxidoreductase 1~~~COG1182
MNKTLIINAHPKVDDTSSVSIKVFKHFLESYKELISNNETIEQINLYDDVVPMIDKTVLSAWEKQGNGQELTREEQKVTE
RMSEILQQFKSANTYVIVLPLHNFNIPSKLKDYMDNIMIARETFKYTETGSVGLLKDGRRMLVIQASGGIYTNDDWYTDV
EYSHKYLKAMFNFLGIEDYQIVRAQGTAVLDPTEVLQNAYKEVEEAASRLANKYIFSLEE
>O35022 1.6.5.-~~~azoR1~~~FMN-dependent NADH:quinone oxidoreductase 1~~~COG1182
MSTVLFVKSSDRTAEEGVSTKLYEAFLAAYKENNPNDEVVELDLHKENLPYLGRDMINGTFKAGQGMEMTEDEKKQAAIA
DKYLNQFVKADKVVFAFPLWNFTVPAVLHTYVDYLSRAGVTFKYTQEGPVGLMGGKKVALLNARGGVYSEGPMAALEMSL
NFMKTVLGFWGVQDLHTVVIEGHNAAPDQAQEIVEKGLQEAKDLAAKF
>Q9I5F3 1.6.5.-~~~azoR1~~~FMN-dependent NAD(P)H:quinone oxidoreductase 1~~~
MSRILAVHASPRGERSQSRRLAEVFLAAYREAHPQARVARREVGRVPLPAVTEAFVAAAFHPQPEQRSLAMQADLALSDQ
LVGELFDSDLLVISTPMYNFSVPSGLKAWIDQIVRLGVTFDFVLDNGVAQYRPLLRGKRALIVTSRGGHGFGPGGENQAM
NHADPWLRTALGFIGIDEVTVVAAEGEESGGRSFEDSCDEAEQRLLALARSA
>Q88IY3 1.6.5.-~~~azoR1~~~FMN-dependent NADH:quinone oxidoreductase 1~~~COG1182
MKLLHIDSSILGDNSASRQLSREVVEAWKAADPSVEVVYRDLAADAIAHFSAATLVAAGTPEDVRDAAQAFEAKLSAETL
EEFLAADAVVIGAPMYNFTVPTQLKAWIDRVAVAGKTFRYTEAGPQGLCGNKKVVLVSTAGGLHAGQPTGAGHEDFLKVF
LGFIGITDLEIVRAHGLAYGPEQRSQAIDAAQAQIASELFAAA
>O32224 1.6.5.-~~~azoR2~~~FMN-dependent NADH:quinone oxidoreductase 2~~~COG1182
MAKVLYITAHPHDEATSYSMATGKAFIESYKEANPNDEVVHIDLYKENIPHIDADVFSGWGKLQSGTGFEELSESEKAKV
GRLGELSDQFASADKYVFVTPLWNFSFPPVMKAYLDSVAVAGKSFKYTEQGPVGLLTDKKAIHIQARGGYYSEGPAAEME
MGHRYIGIMMNFFGVPSFDGIFVEGHNAEPDKAQQIKEDAIARAKEAGKTF
>Q9I2E2 1.6.5.-~~~azoR2~~~FMN-dependent NADH:quinone oxidoreductase 2~~~
MKLLHIDSSILGDASASRQLSAELVQAWRQNEDGLDVTYRDLAADAVAHFSALTLAAGSTPAELRDAALKHEVAVGEEVL
EEFLAADVVVIGAPMYNFTISSQLKAWIDRIAVAGKTFRYTENGPVGLAGDKKVVIVSTAGGVHAGQPTGAAHEGYLRTV
LGFFGITDIEVVRAEGLAYGEEPRTQAIAAARRQIAGQFAAA
>Q9HZ17 1.6.5.-~~~azoR3~~~FMN-dependent NADH:quinone oxidoreductase 3~~~
MSRVLVIESSARQRGSVSRLLTAEFISHWKIAHPADRFQVRDLAREPLPHLDELLLGAWTTPCDGHSAAERRALERSNRL
TEELRMADVLVLAAPMYNFAIPSSLKSWFDHVLRAGLTFRYAEQGPEGLLQGKRAFVLTARGGIYAGGGLDHQEPYLRQV
LGFVGIHDVTFIHAEGMNMGPEFREKGLARARERMRQALETDTSLCVPLPTLR
>Q81JP2 1.6.5.-~~~azoR4~~~FMN-dependent NADH:quinone oxidoreductase 4~~~COG1182
MTKVLFVKANNRPAEQAVSVKLYEAFLASYKEAHPNDTVVELDLYKEELPYVGVDMINGTFKAGKGFDLTEEEAKAVAVA
DKYLNQFLEADKVVFGFPLWNLTIPAVLHTYIDYLNRAGKTFKYTPEGPVGLIGDKKIALLNARGGVYSEGPAAEVEMAV
KYVASMMGFFGATNMETVVIEGHNQFPDKAEEIITAGLEEAAKVANKF
>P41407 1.6.5.-~~~azoR~~~FMN-dependent NADH:quinone oxidoreductase~~~COG1182
MSKVLVLKSSILAGYSQSNQLSDYFVEQWREKHSADEITVRDLAANPIPVLDGELVGALRPSDAPLTPRQQEALALSDEL
IAELKAHDVIVIAAPMYNFNISTQLKNYFDLVARAGVTFRYTENGPEGLVTGKKAIVITSRGGIHKDGPTDLVTPYLSTF
LGFIGITDVKFVFAEGIAYGPEMAAKAQSDAKAAIDSIVSA
>Q831B2 1.6.5.-~~~azoR~~~FMN-dependent NADH:quinone oxidoreductase~~~COG1182
MSKLLVVKAHPLTKEESRSVRALETFLASYRETNPSDEIEILDVYAPETNMPEIDEELLSAWGALRAGAAFETLSENQQQ
KVARFNELTDQFLSADKVVIANPMWNLNVPTRLKAWVDTINVAGKTFQYTAEGPKPLTSGKKALHIQSNGGFYEGKDFAS
QYIKAILNFIGVDQVDGLFIEGIDHFPDRAEELLNTAMTKATEYGKTF
>P63462 1.6.5.-~~~azoR~~~FMN-dependent NADH:quinone oxidoreductase~~~
MSKVLVLKSSILAGYSQSGQLTDYFIEQWREKHVADEITVRDLAANPVPVLDGELVGAMRPGDAPLTPRQQDALALSDEL
IAELKAHDVIVIAAPMYNFNIPTQLKNYFDLIARAGITFRYTEKGPEGLVTGKRAVVLSSRGGIHKDTPTDLIAPYLKVF
LGFIGITDVNFVFAEGIAYGPEVAAKAQADAKAAIDSVVAA
>Q99X11 1.6.5.-~~~azoR~~~FMN-dependent NADH:quinone oxidoreductase~~~
MAKVLYITAHPFNELVSNSMAAGKAFIETYQQQHPDDEVKHIDLFETYIPVIDKDVLTGWGKMSNGETLTDDEQMKVSRL
SDILEEFLSADKYVFVTPMWNLSFPPVVKAYIDAISIAGKTFKYSAEGPQGLLTDKKVLHIQSRGGYYTEGPAADFEMGD
RYLRTIMTFLGVPSYETIIIEGHNAEPHKTEEIKATSINNAEKLATTF
>Q8ZE60 1.6.5.-~~~azoR~~~FMN-dependent NADH:quinone oxidoreductase~~~COG1182
MSKVLVLKSSILATSSQSNQLADFFVEQWQAAHAGDQITVRDLAAQPIPVLDGELVGALRPSGTALTPRQQEALALSDEL
IAELQANDVIVIAAPMYNFNIPTQLKNYFDMIARAGVTFRYTEKGPEGLVTGKRAIILTSRGGIHKDTPTDLVVPYLRLF
LGFIGITDVEFVFAEGIAYGPEVATKAQADAKTLLAQVVAA
>Q9FAW5 1.7.1.6~~~azr~~~NADPH azoreductase~~~
MKLVVINGTPRKFGRTRVVAKYIADQFEGELYDLAIEELPLYNGEESQRDLEAVKKLKTLVKAADGVVLCTPEYHNAMSG
ALKNSLDYLSSSEFIHKPVALLAVAGGGKGGINALNSMHASLAGVYANAIPKQVVLDGLHVQDGELGEDAKPLIHDVVKE
LKAYMSVYKEVKKQLGVE
>O07529 1.7.-.-~~~azr~~~FMN-dependent NADPH-azoreductase~~~COG0431
MNMLVINGTPRKHGRTRIAASYIAALYHTDLIDLSEFVLPVFNGEAEQSELLKVQELKQRVTKADAIVLLSPEYHSGMSG
ALKNALDFLSSEQFKYKPVALLAVAGGGKGGINALNNMRTVMRGVYANVIPKQLVLDPVHIDVENATVAENIKESIKELV
EELSMFAKAGNPGV
>A1B2F3 ~~~aztC~~~High-affinity zinc uptake system protein AztC~~~COG0803
MKDWLFRIATCSIMTFSSLAAAQAEPLDVVATFSIIGDFAAKVGGDRIRLNVLVGPDSDTHVYEPRPADAIALAGADVVL
TNGLEFEGFLTRLIAASGTDAAVATLTDGVETMEEPGGGHYHYIDGKAVFHAGAHDPHAWQAVPNAKVYVQNIAAAFCAA
DAEGCAAYQANAARYIGELDALDTEIRAAIAALPQDRRTVVVAHNAFRYFEAAYGVHFLSPQGVSTESEAAAADVAGLIR
EIRARNASAIFAENISDTRLLEQIAREAGLPLAGTLYSDALSGPDGPASNYIAMMRHNAGAIAAALAAR
>A8AF35 ~~~aztD~~~Zinc chaperone AztD~~~
MMENIMKKRLLSTSISTLLLGLSVMPAFADEDVTAWRLFIADHDKPVVNVIDALDGDKLATFNVKGPANLSRSESGATIF
AIQGSAGVVSTIASGIAFHDHGDHADIDIDAPKLLPLELTGKKPGHFVERQGKIAQWFDGEDSAQILGESAVLKGQKNIT
KVNVVAPHHGVAVPYDNYAVVSIPNPDDASKRPVGARVVDLQGKKVGDDALCPGLHGSAGSGDTFALSCETGLLLITQKN
AAPVIRHLPYAKTLPEGSTSTLIGGKGMQYFIGNYGPDRIILVDPTESDSFRLIQLPTRRVHFVVDPVRAKFAYVFTEDG
KLNQIDVLKGEISQSVRVTDPYSMDGHWNDPRPRIAVADNKIYVTDPLKSKIIVLDATSFKKTSEISVEGQPFNIVAVGG
SGKVHGEHHDHEAHHHDDHAH
>A1B2F4 ~~~aztD~~~Zinc chaperone AztD~~~COG3391
MLRHLAGASALALTLAGAGFAQDHDHDHEDVTLYRVFVGDHEKGQVTAFDLAEPDHRWTFPTTGQVKLYSVAGGAVVAAV
QSDADTVQFIRSGISFHDHGDHRDIEVGDPAAIDASLTGPRPFHLVEHDGKVVLNYDQGGYAEILDGHALAEGKAEPGRF
PQARAHHGFVAPLGGNWLSTVASDEKVEGDASVPRLGLQAFDAEGNPAGNLATCTGIHGEAFSGAYLAAGCKEGVLTVKA
GANGSEYKLLPYPADLPQGVTTGTLLGSTGIQVFLGNYGPDGLVVIDPVDEPHYRYIKLPFRRVDFALDPAKPSTGYVLT
EDGSLHRIDLLKAEIVASAKVTEPYSMDGHWNDPRPRIAMAGDEIVVTDPNAGLVRRIATEDLSERGTVPVEGKPYNIAV
TGGSGVTH
>C1P605 ~~~azuC~~~Uncharacterized protein AzuC~~~
MKLRKILKSMFNNYCKTFKDVPPGNMFR
>P19567 ~~~bcp~~~Pseudoazurin~~~
MEKTMLNAIKSGFGIAIAAMLVAAPAAAADFEVHMLNKGKDGAMVFEPASLKVAPGDTVTFIPTDKGHNVETIKGMIPDG
AEAFKSKINENYKVTFTAPGVYGVKCTPHYGMGMVGVVQVGDAPANLEAVKGAKNPKKAQERLDAALAALGN
>P04377 ~~~~~~Pseudoazurin~~~
MRNIAIKFAAAGILAMLAAPALAENIEVHMLNKGAEGAMVFEPAYIKANPGDTVTFIPVDKGHNVESIKDMIPEGAEKFK
SKINENYVLTVTQPGAYLVKCTPHYAMGMIALIAVGDSPANLDQIVSAKKPKIVQERLEKVIASAK
>P04171 ~~~~~~Pseudoazurin~~~COG3794
MMIFRALIAAATLAIAIATTLPAAADEVAVKMLNSGPGGMMVFDPALVRLKPGDSIKFLPTDKGHNVETIKGMAPDGADY
VKTTVGQEAVVKFDKEGVYGFKCAPHYMMGMVALVVVGDKRDNLEAAKSVQHNKLTQKRLDPLFAQIQ
>P80649 ~~~~~~Pseudoazurin~~~
ATHEVHMLNKGESGAMVFEPAFIRAEPGDVINFIPTDKSHNVEAIKEILPEGVETFKSKINEAYALTVTEPGLYGVKCTP
HFGMGMVGLVQVGDAPENLDAAQTAKMPKKARERMDAELAQVN
>P80401 ~~~pazS~~~Pseudoazurin~~~COG3794
MFHHSLAAAAAALLALAAPGFAATHEVHMLNKGESGAMVFEPAFVRAEPGDVINFVPTDKSHNVEAIKEILPEGVESFKS
KINESYTLTVTEPGLYGVKCTPHFGMGMVGLVQVGDAPENLDAAKTAKMPKKARERMDAELAQVN
>P56547 ~~~~~~Azurin-1~~~COG3241
AECSVDIAGNDGMQFDKKEITVSKSCKQFTVNLKHPGKLAKNVMGHNWVLTKQADMQGAVNDGMAAGLDNNYVKKDDARV
IAHTKVIGGGETDSVTFDVSKLAAGEDYAYFCSFPGHFALMKGVLKLVD
>P12334 ~~~~~~Azurin iso-1~~~
MKSSKIVAILLASLFSGSVLAAGCSVDVEANDAMQYNTKNIDVEKSCKEFTVNLKHTGSLPKNVMGHNLVITKTADFKAV
MNDGVAAGEAGNFVKAGDARVVAHTKLVGGGEKDSVKVDVSKLAAGEKYTFFCSFPGHATMMRGTVTVK
>P56275 ~~~~~~Azurin-2~~~COG3241
AQCEATVESNDAMQYNVKEIVVDKSCKQFTMHLKHVGKMAKVAMGHNLVLTKDADKQAVATDGMGAGLAQDYVKAGDTRV
IAHTKVIGGGESDSVTFDVSKIAAGENYAYFCSFPGHWAMMKGTLKLGS
>P12335 ~~~~~~Azurin iso-2~~~
ASCETTVTSGDTMTYSTRSISVPASCAEFTVNFEHKGHMPKTGMGHNWVLAKSADVGDVAKEGAHAGADNNFVTPGDKRV
IAFTPIIGGGEKTSVKFKVSALSKDEAYTYFCSYPGHFSMMRGTLKLEE
>Q9F646 ~~~~~~Azurin~~~
MFAKAVAVSLLTLASAQVFAADCKVTVDSTDQMSFNTKAIEIDKSCKTFTVELTHSGNLPKNVMGHNLVISKEADMQPIA
TDGLSAGIDKDYLKEGDDRVIAHTKVIGAGEKDSVTFDVSKLKADEKYGFFCSFPGHISMMKGTVTLK
>P00280 ~~~azu~~~Azurin~~~
MLAKATLAIVLSAASLPVLAAQCEATIESNDAMQYNLKEMVVDKSCKQFTVHLKHVGKMAKVAMGHNWVLTKEADKQGVA
TDGMNAGLAQDYVKAGDTRVIAHTKVIGGGESDSVTFDVSKLTPGEAYAYFCSFPGHWAMMKGTLKLSN
>P00281 ~~~~~~Azurin~~~COG3241
ACDVSIEGNDSMQFNTKSIVVDKTCKEFTINLKHTGKLPKAAMGHNVVVSKKSDESAVATDGMKAGLNNDYVKAGDERVI
AHTSVIGGGETDSVTFDVSKLKEGEDYAFFCSFPGHWSIMKGTIELGS
>P00279 ~~~~~~Azurin~~~
AECSVDIAGNDQMQFDKKEITVSKSCKQFTVNLKHPGKLAKNVMGHNWVLTKQADMQGAVNDGMAAGLDNNYVKKDDARV
IAHTKVIGGGETDSVTFDVSKLAAGEDYAYFCSFPGHFALMKGVLKLVD
>P0A321 ~~~~~~Azurin~~~COG3241
MFKQVLGGMALMAAFSAPVLAAECSVDIAGTDQMQFDKKAIEVSKSCKQFTVNLKHTGKLPRNVMGHNWVLTKTADMQAV
EKDGIAAGLDNQYLKAGDTRVLAHTKVLGGGESDSVTFDVAKLAAGDDYTFFCSFPGHGALMKGTLKLVD
>P00282 ~~~azu~~~Azurin~~~
MLRKLAAVSLLSLLSAPLLAAECSVDIQGNDQMQFNTNAITVDKSCKQFTVNLSHPGNLPKNVMGHNWVLSTAADMQGVV
TDGMASGLDKDYLKPDDSRVIAHTKLIGSGEKDSVTFDVSKLKEGEQYMFFCTFPGHSALMKGTLTLK
>B3EWN9 ~~~~~~Azurin~~~COG3241
AECSVDIQGNDQMQFNTNAITVDKSCKQFTVNLSHPGNLPKNVMGHNWVLSTAADMQGVVTDGMASGLDKDYLKPDDSRV
IAHTKLIGSGEKDSVTFDVSKLKEGEQYMSFCTFPGHSALMKGTLTLK
>P00286 ~~~~~~Azurin~~~
AECKVDVDSTDQMSFNTKEITIDKSCKTFTVNLTHSGSLPKNVMGHNWVLSKSADMAGIATDGMAAGIDKDYLKPGDSRV
IAHTKIIGSGEKDSVTFDVSKLTAGESYEFFCSFPGHNSMMKGAVVLK
>P00283 ~~~~~~Azurin~~~
AECSVDIQGNDQMQFSTNAITVDKACKTFTVNLSHPGSLPKNVMGHNWVLTTAADMQGVVTDGMAAGLDKNYVKDGDTRV
IAHTKIIGSGEKDSVTFDVSKLKAGDAYAFFCSFPGHSAMMKGTLTLK
>P80546 ~~~~~~Azurin~~~
AECKVTVDSTDQMSFNTKAIEIDKSCKTFTVELTHSGSLPKNVMGHNWVLSSAADMPGIASDGMAAGIDKNYLKEGDIRV
IAHTKIIGAGEKDSVTFDVSKLAAGTDYAFFCSFPGHISMMKGTVTVK
>P00284 ~~~~~~Azurin~~~
AECKTTIDSTDQMSFNTKAIEIDKACKTFTVELTHSGSLPKNVMGHNLVISKQADMQPIATDGLSAGIDKNYLKEGDTRV
IAHTKVIGAGEKDSLTIDVSKLNAAEKYGFFCSFPGHISMMKGTVTLK
>P00285 ~~~~~~Azurin~~~
AECKVTVDSTDQMSFDTKAIEIDKSCKTFTVDLKHSGNLPKNVMGHNWVLTTQADMQPVATDGMAAGIDKNYLKEGDTRI
IAHTKIIGAGETDSVTFDVSKLKADGKYMFFCSFPGHIAMMKGTVTLK
>P34097 ~~~~~~Azurin~~~
AECKVTVDSTDQMSFNTKDIAIDKSCKTFTVELTHSGSLPKNVMGHNLVISKEADMQPIATDGLSAGIDKQYLKDGDARV
IAHTKVIGAGEKDSVTFDVSKLAAGEKYGFFCSFPGHISMMKGTVTLK
>Q73E41 7.2.2.10~~~~~~Calcium-transporting ATPase 1~~~
MSNWYSKTKDQTLIDLETNEQHGLTEEIASERLKQYGSNELATKQKRTLWQRIFAQINDVLVYVLIIAALISAFVGEWAD
ASIIALVVVLNAVIGVVQESKAEQALEALKKMATPKAIVKRDGELKEIPSEHVVPGDIVMLDAGRYIPCDLRLIETANLK
VEESALTGESVPVDKDAIYHPSMQSDEQVPLGDQKNMAFMSTLVTYGRGVGVAVETGMNSQIGKIATLLHEADDDMTPLQ
KSLAQVGKYLGFVAVAICIVMFLIGFLQRRDTLEMFMTAISLAVAAIPEGLPAIVSIVLAIGVQRMIKQNVIIRKLPAVE
ALGSVTIICSDKTGTLTQNKMTVTHFYSDNTYDQLENLNVNNDVQRLLLENMVLCNDASYNNESQTGDPTEIALLVAGST
FNMQKDYLEKIHERINELPFDSDRKMMSTVHTYDESYYSMTKGAIDKLLPRCTHILKNGKIEVLTEADKNQILEAARAMS
REALRVLSFAFKQYNSSNVDIDHLEENLIFIGLVGMIDPPRTEVKDSITECKKAGIRTVMITGDHKDTAFAIAKELGIAE
DICEIMIGTELDNISDTELASKIDHLHVFARVSPEHKVKIVKALRAKGNIVSMTGDGVNDAPSLKQADVGVAMGITGTDV
AKGAADVVLTDDNFSSIVKAVEEGRNIYRNIKKSILFLLSCNFGEIITLFLAILLGWATPLRPIHILWVNLITDTLPALS
LGVDPEDPDVMKEKPRHAKESLFSGSVPFLIFNGVVIGLLTLIAFIAGAKFYTGDTNLFPLFPERIDEDALLHAQTMAFV
VLSFSQLVHSFNLRSRTKSIFSIGIFTNKYLVFSLLIGVLMQVCIISIPPLANIFGVHALTLRDWGFVLLLSIIPLVVNE
IIKLAKKN
>P39638 4.1.1.100~~~bacA~~~Prephenate decarboxylase~~~COG0077
MIILDNSIQTKKRTDSLSKLITVNTLGPEGTSSEYAAKHFISNFTLLQGLNSKLSLHDTFESCIERTLQSPLEYTIVPHA
YDGIKHFYMRPDLQLLQIFRCDTPMYGLAVRPDFEFRDDMLDTSVIVSHPSPINLIKYFTRKDVRFKLVNSTSQAARKVK
EGLYDIALTNELARQKYGLTFVKTFKSIPMSWSLFGKGDVDDEN
>P9WQI9 7.6.2.-~~~bacA~~~Hydrophilic compounds import ATP-binding/permease protein BacA~~~COG4178
MGPKLFKPSIDWSRAFPDSVYWVGKAWTISAICVLAILVLLRYLTPWGRQFWRITRAYFVGPNSVRVWLMLGVLLLSVVL
AVRLNVLFSYQGNDMYTALQKAFEGIASGDGTVKRSGVRGFWMSIGVFSVMAVLHVTRVMADIYLTQRFIIAWRVWLTHH
LTQDWLDGRAYYRDLFIDETIDNPDQRIQQDVDIFTAGAGGTPNAPSNGTASTLLFGAVQSIISVISFTAILWNLSGTLN
IFGVSIPRAMFWTVLVYVFVATVISFIIGRPLIWLSFRNEKLNAAFRYALVRLRDAAEAVGFYRGERVEGTQLQRRFTPV
IDNYRRYVRRSIAFNGWNLSVSQTIVPLPWVIQAPRLFAGQIDFGDVGQTATSFGNIHDSLSFFRNNYDAFASFRAAIIR
LHGLVDANEKGRALPAVLTRPSDDESVELNDIEVRTPAGDRLIDPLDVRLDRGGSLVITGRSGAGKTTLLRSLAELWPYA
SGTLHRPGGENETMFLSQLPYVPLGTLRDVVCYPNSAAAIPDATLRDTLTKVALAPLCDRLDEERDWAKVLSPGEQQRVA
FARILLTKPKAVFLDESTSALDTGLEFALYQLLRSELPDCIVISVSHRPALERLHENQLELLGGGQWRLAPVEAAPAEV
>P39639 5.3.3.19~~~bacB~~~H2HPP isomerase~~~COG1917
MKTKEDMQELYFPTPKLIEWENGVRQYSTVRGDTEVLMSYVPPHTNVEPHQHKEVQIGMVVSGELMMTVGDVTRKMTALE
SAYIAPPHVPHGARNDTDQEVIAIDIKRLKADETYTSPEDYFLDIFKTRDLLPGMEVTFFVEDWVEIMLAKIPGNGGEMP
FHKHRNEQIGICIGGGYDMTVEGCTVEMKFGTAYFCEPREDHGAINRSEKESKSINIFFPPRYNRAKAKKMKADE
>Q8KWS9 1.1.1.385~~~bacC~~~Dihydroanticapsin 7-dehydrogenase~~~COG1028
MNLTDKTVLITGGASGIGYAAVQAFLNQQANVVVADIDEAQGEAMIRKENNDRLHFVQTDITNEPACQNAILSAVDKFGG
LDVLINNAGIEIVAPIHEMELSDWNKVLNVNLTGMFLMSKHALKYMLKSGKGNIINTCSVGGVVAWPDIPAYNASKGGVL
QTDAFYRPSIIAKHNIRVNCVCPGIIDTPLNEKSFLENNEGTLEEIKKEKAKVNPLLRLGKPEEIANVMLFLASDLSSYM
TGSAITADGGYTAQ
>Q8KWT4 1.1.1.385~~~bacC~~~Dihydroanticapsin 7-dehydrogenase~~~
MNLTDKTVLITGGASGIGYAAVQAFLNQQANVVVADIDEAQGEAMIRKENNDRLHFVHTDITDEPACQNAIRSAVDKFGG
LDVLINNAGIEIVAPIHEMELSNWNKVLNVNLTGMFLMSKHALKYMLKSGKGNIINTCSVGGVVAWPDIPAYNASKGGVL
QLTRSMAVDYAKHNIRVNCVCPGIIDTPLNEKSFLENNEGTLEEIKKEKAKVNPLLRLGKPEEIANVMLFLASDLSSYMT
GSAITADGGYTAQ
>P39640 1.1.1.385~~~bacC~~~Dihydroanticapsin 7-dehydrogenase~~~COG1028
MNLTDKTVLITGGASGIGYAAVQAFLGQQANVVVADIDEAQGEAMVRKENNDRLHFVQTDITDEAACQHAVESAVHTFGG
LDVLINNAGIEIVAPIHEMELSDWNKVLQVNLTGMFLMSKHALKHMLAAGKGNIINTCSVGGLVAWPDIPAYNASKGGVL
QLTKSMAVDYAKHQIRVNCVCPGIIDTPLNEKSFLENNEGTLEEIKKEKAKVNPLLRLGKPEEIANVMLFLASDLSSYMT
GSAITADGGYTAQ
>Q8KWS8 6.3.2.49~~~bacD~~~Alanine--anticapsin ligase~~~COG0151
MERKTVLVIADLGGCPPHMFYESAAEKYNLVSFIPRPFAITASHAALIEKYSIAVIKDKDYFKSLADFEHPDSIYWAHED
HDKPEEEVVEEIVKVADMFAVDAITTNNELFIAPMAKACKRLGLRGAGVQAAENARDKNKMRAAFNRAGVKSIKNKRVTT
LEDFRAALQEIGTPLILKPTYLASSIGVTLIKEMETAEAEFNRVNEYLKSINVPKAVTFEAPFIAEEFLQGEYDDWYETS
GYSDYISIEGIMADGEYFPVAIHDKTPQIGFTETAHITPSILDDDAKRKIVEAAKKANEGLGLENCATHTEIKLMKNREA
GLIESAPRFAGWNMIPNIKKVFGVDMAQLLLDVLCFGKEADLPKGLLEQEPCYVADCHLYPQHFKENGQLPETVVDFVIE
SIEIPDGVLKGDTELVSFSAAEAGTSVDLRLFEAFNSIAAFELKGSNSNDVAESIKQIQQQAKLTAKYALSV
>Q8KWT3 6.3.2.49~~~bacD~~~Alanine--anticapsin ligase~~~
MERKTVLVIADLGGCPPHMFYKSAAEKYNLVSFIPRPFAITASHAALIEKYSVAVIKDKDYFKSLADFEHPDSIYWAHED
HDKPEEEVVEEIVKVAGMFAVDAITTNNELFIAPMAKACERLGLRGAGVQAAENARDKNKMRAAFNRAGVKSIKNKRVTT
LEDFRAALQEIGTPLILKPTYLASSIGVTLIKEMETAEAEFNRVNEYLKSINVPKAVTFEAPFIAEEFLQGEYDDWYETS
GYSDYISIEGIMADGEYFPVAIHDKTPQIGFTETSHITPSILDDDAKRKIVEAAKKANEGLGLENCATHTEIKLMKNREA
GLIESAARFAGWNMIPNIKKVFGVDMAQLLLDVLCFGKEADLPKGLLEQEPCYVADCHLYPQHFKENGQLPETAVDFVIE
SIDIPDGVLKGDTEIVSFSAAEAGTSVDLRLFEAFNSIAAFELKGSNSGDVAESIKQIQQQAKLTAKYALPV
>P39641 6.3.2.49~~~bacD~~~Alanine--anticapsin ligase~~~COG0151
MERKTVLVIADLGGCPPHMFYKSAAEKYNLVSFIPRPFAITASHAALIEKYSVAVIKDKDYFKSLADFEHPDSIYWAHED
HNKPEEEVVEQIVKVAEMFGADAITTNNELFIAPMAKACERLGLRGAGVQAAENARDKNKMRDAFNKAGVKSIKNKRVTT
LEDFRAALEEIGTPLILKPTYLASSIGVTLITDTETAEDEFNRVNDYLKSINVPKAVTFEAPFIAEEFLQGEYGDWYQTE
GYSDYISIEGIMADGEYFPIAIHDKTPQIGFTETSHITPSILDEEAKKKIVEAAKKANEGLGLQNCATHTEIKLMKNREP
GLIESAARFAGWNMIPNIKKVFGLDMAQLLLDVLCFGKDADLPDGLLDQEPYYVADCHLYPQHFKQNGQIPETAEDLVIE
AIDIPDGLLKGDTEIVSFSAAAPGTSVDLTLFEAFNSIAAFELKGSNSQDVAESIRQIQQHAKLTAKYVLPV
>P39642 ~~~bacE~~~Putative bacilysin exporter BacE~~~COG2814
MKQLKPNSKYLLYGQALSFMGDYCVLPALLILSTYYHDYWVTSGVIVVRSIPMVFQPFLGVLVDRLDRIKIMLWTDIIRG
IIFLGLTFLPKGEYPLIFLALLFITYGSGVFFNPARLAVMSSLESDIKSINTLFAKATTISIIVGAAAGGLFLLGGSVEL
AVAFNGVTYLVSAFFISRIKLQFVPIQSENIKEAFQSFKEGLKEIKTNSFVLNAMFTMITMALLWGVVYSYFPIVSRFLG
DGEIGNFILTFCIGFGGFIGAALVSKWGFNNNRGLTYFTVLSIVSLALFLFTPIFAVSVIAAILFFIAMEYGEVLAKVKV
QENAANQIQGRIFSVAEASIGLCISIGSMFINILSAPVIMGLIVVIVCGLFLHTKLVNKSFLERDNKTEQKGVF
>P39643 2.6.1.-~~~bacF~~~Transaminase BacF~~~COG0436
MEITPSDVIKTLPRQEFSLVFQKVKEMEKTGAHIINLGQGNPDLPTPPHIVEALREASLNPSFHGYGPFRGYPFLKEAIA
AFYKREYGVTINPETEVALFGGGKAGLYVLTQCLLNPGDIALVPNPGYPEYLSGITMARAELYEMPLYEENGYLPDFEKI
DPAVLEKAKLMFLNYPNNPTGAVADAAFYAKAAAFAKEHNIHLIHDFAYGAFEFDQKPASFLEAEDAKTVGAELYSFSKT
FNMAGWRMAFAVGNEKIIQAVNEFQDHVFVGMFGGLQQAASAALSGDPEHTESLKRIYKERIDFFTALCEKELGWKMEKP
KGTFYVWAEIPNTFETSHQFSDYLLEHAHVVVTPGEIFGSNGKRHVRISMVSKQEDLREFVTRIQKLNLPFGSLQETSR
>P39644 1.3.1.-~~~bacG~~~NADPH-dependent reductase BacG~~~COG1028
MSKRTAFVMGASQGIGKAIALKLADQHFSLVINSRNLDNIESVKEDILAKHPEASVIVLAGDMSDQHTRAGIFQKIESQC
GRLDVLINNIPGGAPDTFDNCNIEDMTATFTQKTVAYIDAIKRASSLMKQNEFGRIINIVGNLWKEPGANMFTNSMMNAA
LINASKNISIQLAPHNITVNCLNPGFIATDRYHQFVENVMKKNSISKQKAEEQIASGIPMKRVGSAEETAALAAFLASEE
ASYITGQQISADGGSMKSI
>C0HL87 ~~~~~~Bacteriocin~~~
KKNMLLVNPIVGIGGLFVGAPMLTANLGISSYAAKKVIDDINTGSAVATIIALVTAVVGGGLITAGIVATTKSLIKKYGA
KYSAAW
>P86291 ~~~~~~Bacteriocin~~~
TSYGNGVHCNKSKCWIDVSELETYKAGTVSNPKDILWSLKE
>P86386 ~~~~~~Bacteriocin mutacin F-59.1~~~
KYYGNGVTCGKHSXSVDWSKATTNI
>Q5MWV9 ~~~badA~~~Autotransporter adhesin BadA~~~
MKKLSVTSKRQYNLYASPISRRLSLLMKLSLETVTVMFLLGASPVLASNLALTGAKNLSQNSPGVNYSKGSHGSIVLSGD
DDFCGADYVLGRGGNSTVRNGIPISVEEEYERFVKQKLMNNATSPYSQSSEQQVWTGDGLTSKGSGYMGGKSTDGDKNIL
PEAYGIYSFATGCGSSAQGNYSVAFGANATALTGGSQAFGVAALASGRVSVAIGVGSEATGEAGVSLGGLSKAAGARSVA
IGTRAKAQGEESIAIGSSVKNGDKDGSAVAQGAKAIAIGSNSISFQHYAVAVGAKAHALLSKTVALGYDSVADVDAGIRG
YDPVEDEPSKDVSFVWKSSLGAVSVGNRKEGLTRQIIGVAAGTEDTDAVNVAQLKALRGMISEKGGWNLTVNNDNNTVVS
SGGALDLSSGSKNLKIVKDGKKNNVTFDVARDLTLKSIKLDGVTLNETGLFIANGPQITASGINAGSQKITGVAEGTDAN
DAVNFGQLKKIETEVKEQVAASGFVKQDSDTKYLTIGKDTDGDTINIANNKSDKRTLTGIKEGDISKDSSEAITGSQLFT
TNQNVKTVSDNLQTAATNIAKTFGGGAKYEDGEWIAPAFKVKTVTGEGKEEEKRYQNVADALAGVGSSITNVQNKVTEQV
NNAITKVEGDALLWSDEANAFVARHEKSKLGKGASKATQENSKITYLLDGDVSKDSTDAITGKQLYSLGDKIASYLGGNA
KYEDGEWTAPTFKVKTVKEDGKEEEKTYQNVAEALTGVGTSFTNVKNEITKQINHLQSDDSAVVHYDKNKDETGGINYAS
VTLGKGKDSAAVTLHNVADGSISKDSRDAINGSQIYSLNEQLATYFGGGAKYENGQWTAPIFKVKTVKEDGEEEEKTYQN
VAEALTGVGTSFTNIKSEITKQIANEISSVTGDSLVKKDLATNLITIGKEVAGTEINIASVSKADRTLSGVKEAVKDNEA
VNKGQLDKGLKHLSDSLQSDDSAVVHYDKKTDETGGINYTSVTLGGKDKTPVALHNVADGSISKDSHDAINGGQIHTIGE
DVAKFLGGAASFNNGAFTGPTYKLSNIDAKGDVQQSEFKDIGSAFAGLDTNIKNVNNNVTNKFNELTQNITNVTQQVKGD
ALLWSDEANAFVARHEKSKLGKGASKATQENSKITYLLDGDVSKDSTDAITGKQLYSLGDKIASYLGGNAKYENGEWTAP
TFKVKTVKEDGKEEEKTYQNVAEALTGVGASFTNVKNEITKQINHLQSDDSAVVHYDKNKDETGGINYASVTLGKGKDSA
AVTLHNVADGSISKDSRDAINGSQIYSLNEQLATYFGGGAKYENGQWTAPIFKVKTVKEDGEEEEKTYQNVAEALTGVGT
SFTNIKSEITKQIANEISSVTGDSLVKKDLATNLITIGKEVAGTEINIASVSKADRTLSGVKEAVKDNEAVNKGQLDTNI
KKVEDKLTEAVGKVTQQVKGDALLWSNEDNAFVADHGKDSAKTKSKITHLLDGNIASGSTDAVTGGQLYSLNEQLATYFG
GGAKYENGQWTAPTFKVKTVNGEGKEEEQTYQNVAEALTGVGASFMNVQNKITNEITNQVNNAITKVEGDSLVKQDNLGI
ITLGKERGGLKVDFANRDGLDRTLSGVKEAVNDNEAVNKGQLDADISKVNNNVTNKFNELTQNITNVTQQVKGDALLWSD
EANAFVARHEKSKLEKGVSKATQENSKITYLLDGDISKGSTDAVTGGQLYSLNEQLATYFGGGAKYENGQWTAPTFKVKT
VNGEGKEEEQTYQNVAAAFEGVGTSFTNIKSEITKQINNEIINVKGDSLVKRDLATNLITIGKEIEGSVINIANKSGEAR
TISGVKEAVKDNEAVNKGQLDTNIKKVEDKLTEAVGKVTQQVKGDALLWSNEDNAFVADHGKDSAKTKSKITHLLDGNIA
SGSTDAVTGGQLYSLNEQLATYFGGGAKYENGQWTAPTFKVKTVNGEGKEEEKTYQNVAAAFEGVGTSFTNIKSEITKQI
ANEISNVTGDSLVKKDLDTNLITIGKEIAGTEINIASVSKADRTLSGVKEAVNDNEAVNKGQLDANISKVNNNVTNKFNE
LTQSITNVTQQVKGDALLWSDEANAFVARHEKSKLEKGVSKATQENSKITYLLDGDISKGSTDAVTGGQLYSLNEQLATY
FGGGAKYENGQWTAPTFKVKTVNGEGKEEEQTYQNVAAAFEGVGTSFTNIKSEITKQINNEIINVKGDSLVKRDLATNLI
TIGKEIEGSVINIANKSGEARTISGVKEAVKDNEAVNKGQLDTNIKKVEDKLTEAVGKVTQQVKGDALLWSNEDNAFVAD
HGKDSAKTKSKITHLLDGNIASGSTDAVTGGQLYSLNEQLATYFGGGAKYENGQWTAPTFKVKTVNGEGKEEEKTYQNVA
AAFEGVGTSFTHVKNEITKQINHLQSDDSAVVHYDKDDKNGSINYASVTLGKGKDSAAVALHNVADGSISKDSHDAINGG
QIHTIGEDVAKFLGGDAAFKDGAFTGPTYKLSNIDAKGDVQQSEFKDIGSAFAGLDTNIKNVNNNVTNKLSELTQNITTV
TQQVKGNALLWSDEANAFVARHEKSKLEKGASKAIQENSKITYLLDGDVSKGSTDAVTGGQLYSMSNMLATYLGGNAKYE
NGEWTAPTFKVKTVNGEGKEEEQTYQNVAEALTGVGTSFTNIKSEIAKQINHLQSDDSAVIHYDKNKDETGTINYASVTL
GKGEDSAAVALHNVAAGNIAKDSRDAINGSQLYSLNEQLLTYFGGDAGYKDGQWIAPKFHVLQFKSDGSSGEKESYDNVA
AAFEGVNKSLAGMNERINNVTAGQNVSSSSLNWNETEGGYDARHNGVDSKLTHVENGDVSEKSKEAVNGSQLWNTNEKVE
AVEKDVKNIEKKVQDIATVADSAVKYEKDSTGKKTNVIKLVGGSESEPVLIDNVADGKIEADSKQAVNGGQLRDYTEKQM
KIVLDDAKKYTDERFNDVVNNGINEAKAYTDVKFEALSYTVEEVRKEARQAAAIGLAVSNLRYYDIPGSLSLSFGTGIWR
SQSAFAIGAGYTSEDGNIRSNLSITSSGGQWGVGAGITLRLK
>O07458 ~~~badR~~~Transcriptional activatory protein BadR~~~COG1846
MMAKKRVATDNAADAKMELANRLFFRLYQCANMLHKTGTRAVEAEGLTTQQWAVLGALSRPTVANGMSVGDLARYLMVSR
QNLTGLIGRMERDGHVAVVPDERDRRSRLVTMTKSGRHVWEVLAQPKIRAYYGEVLGDFSINDVTHTLHYLLKILDNMKR
LDDGAAGETAATDLE
>A7Z4X7 3.-.-.-~~~baeB~~~Probable polyketide biosynthesis zinc-dependent hydrolase BaeB~~~
MDHTYEVHQIKTYHQMWSNYCYIIADRSKKSAIAVDPSWEIDKITDKLHELDVDLSAILLTHSHYDHVNLAEPLQQIYHS
DIYMSSAEIDFYQFRCRNLIALEDGQTFAAGGFIIRSILTPGHTAGGMCYLLSDHLFTGDTVFTEGCGICGDRGSSAEDM
FHSIQRIKASIPPHVRVYPGHSFGEKPGQKMESLLKNNIYFQIEKKEHFVNFRNRKNQKGLFHFK
>A7Z4X8 2.3.1.39~~~baeC~~~Polyketide biosynthesis malonyl CoA-acyl carrier protein transacylase BaeC~~~
MITYLFPGQGSQKQGMGSSLFDEFKDLTEQADETLGYSMKRLCLENPYSNLHKTQFTQPALYVVNVLSYLKKIQDNDIKP
DYVAGHSLGEYNALFAAGAFDFITGLQLVRKRGELMSMATDGKMAAVMGLTAAQVSDALQTHGLHTIDIANMNSPHQVVI
SGRKEDIERAKSVFEGLKDVTMFHPLNVSGAFHSRYMSEAKQEFEKFLQSFHFSAISIPVISNVHARPYEQDGIHSVLAD
QIDHSVRWNDSIRYLLDKGRMEFEEVGPGHVLTGLIHRIKNETEASPAM
>A7Z4X9 2.3.1.-~~~baeD~~~Polyketide biosynthesis acyltransferase homolog BaeD~~~
MNQPIVFMFSGQGSQYYQMGKELFAHNAAFRQKMLDLDDFAVSRFGYSVLKEMYHTGNRLSDPFDRLLFSHPAIFMAEYA
LAYALEQRGIRPDYVIGASLGEYAAAAVSGVLSAEDALDCVLEQARIVTETCRNGSMLAILGDPALYQDDPLLGEHSELA
SVNYHSHFVISGEREHIKKIMDDLREKQIPHQLLPVSYGFHSALVDQAEQPYKRFLAQKSIRTPFIPYISSATGEAETDI
QADFFWDIVRKPIRFREALQFADSRQKGLYIDAGPSGTLAAFAKQILPAGSAERIRAIMTPFHKEQTHLQQIEDSILSPP
GRRL
>A7Z4Y0 ~~~baeE~~~Polyketide biosynthesis protein BaeE~~~
MISFVFPGQGSQRIGMGEDLFGRYPELTAKADHILGYSIQELCRDGERLNQTQFTQPALYVVNALSYLKKTEETGLKPDF
TAGHSLGEYNALYASGAFDFEEGLQLVKKRGELMSRAKGGGMAAVIGLTHEQVTDVLRENHLDMIDIANMNTPQQIVISG
YKEDIEKAASVFEAVNGVKMVHRLNVSGAFHSRYMLEAKEEFSRFIESFHFKPLSIPVISNVTARPYEQRELKETLTGQI
TGSVNWTDSIRFLMGRKNMSFEEIGPGKVLTGLIQRITAEAEPITDEIKVPAEAGKSSITAASLGNEEFKRDYQLKYAYL
AGGMYRGISSKEMVVKLAEKGMMGFFGTGGLNIAHVEDAILSIQHELRDGGSFGINVVHNMKHTDSEEKMIDLLLKHGIQ
NLEASAFLTVTPALVRFRAKGLKRGAGGQIIARQRIIAKLSRPEVAEAFLSPAPDHILQKLAAENKITAEEASLMREIPV
AHDICVEADSGGHTDGGVAYSLMPAIIRLRDDMMKKYRYGKTVRIGAAGGIGTPEAAMAAFMLGADFIVTGSINQCTVEA
ATSGLVKDLLQQMNVQDTAYAPAGDMFESGSKVQVLKKGLFFPTRASKLHELYQRHRSIEEIDEKTLRQIEEKYFKASVS
SIYDKVKAHYSNEDISKAERNPKEKMALIFKWYFRQSSASAIKGDPDAKVDFQIHCGPALGAFNQWVKGTELESWKNRHA
DGIGMRLMEETASLLNQKLGSFLQTC
>P69229 ~~~baeR~~~Transcriptional regulatory protein BaeR~~~COG0745
MTELPIDENTPRILIVEDEPKLGQLLIDYLRAASYAPTLISHGDQVLPYVRQTPPDLILLDLMLPGTDGLTLCREIRRFS
DIPIVMVTAKIEEIDRLLGLEIGADDYICKPYSPREVVARVKTILRRCKPQRELQQQDAESPLIIDEGRFQASWRGKMLD
LTPAEFRLLKTLSHEPGKVFSREQLLNHLYDDYRVVTDRTIDSHIKNLRRKLESLDAEQSFIRAVYGVGYRWEADACRIV
>P69228 ~~~baeR~~~Transcriptional regulatory protein BaeR~~~COG0745
MTELPIDENTPRILIVEDEPKLGQLLIDYLRAASYAPTLISHGDQVLPYVRQTPPDLILLDLMLPGTDGLTLCREIRRFS
DIPIVMVTAKIEEIDRLLGLEIGADDYICKPYSPREVVARVKTILRRCKPQRELQQQDAESPLIIDEGRFQASWRGKMLD
LTPAEFRLLKTLSHEPGKVFSREQLLNHLYDDYRVVTDRTIDSHIKNLRRKLESLDAEQSFIRAVYGVGYRWEADACRIV
>P30847 2.7.13.3~~~baeS~~~Signal transduction histidine-protein kinase BaeS~~~COG2205
MKFWRPGITGKLFLAIFATCIVLLISMHWAVRISFERGFIDYIKHGNEQRLQLLSDALGEQYAQHGNWRFLRNNDRFVFQ
ILRSFEHDNSEDKPGPGMPPHGWRTQFWVVDQNNKVLVGPRAPIPPDGTRRPILVNGAEVGAVIASPVERLTRNTDINFD
KQQRQTSWLIVALATLLAALATFLLARGLLAPVKRLVDGTHKLAAGDFTTRVTPTSEDELGKLAQDFNQLASTLEKNQQM
RRDFMADISHELRTPLAVLRGELEAIQDGVRKFTPETVASLQAEVGTLTKLVDDLHQLSMSDEGALAYQKAPVDLIPLLE
VAGGAFRERFASRGLKLQFSLPDSITVFGDRDRLMQLFNNLLENSLRYTDSGGSLQISAGQRDKTVRLTFADSAPGVSDD
QLQKLFERFYRTEGSRNRASGGSGLGLAICLNIVEAHNGRIIAAHSPFGGVSITVELPLERDLQREV
>B5CY73 3.2.1.81~~~~~~Beta-agarase~~~COG2273
MKRKLFTICLASLQFACAAENLNNKSYEWDIYPVPANAGDGMVWKLHPQSDDFNYIADEKDKGKEFYAKWTDFYHNHWTG
PAPTIWQRDHVSVSDGFLKIRASRPEDVPLKKVVSGPNTKELPGTYTGCITSKTRVKYPVYVEAYAKLSNSTMASDVWML
SPDDTQEIDIIEAYGGDRDGGGYGADRLHLSHHIFIRQPFKDYQPKDSGSWYKDDKGTLWRDDFHRVGVFWKDPFTLEYY
VDGELVRTISGKDIIDPNNYTGGTGLVKDMDIIINMEDQSWRAVKGLSPTDEELKNVEDHTFLVDWIRVYTLVPEE
>P27951 ~~~bag~~~IgA FC receptor~~~
MFKSNYERKMRYSIRKFSVGVASVAVASLFMGSVAHASELVKDDSVKTTEVAAKPYPSMAQTDQGNNSSSSELETTKMEI
PTTDIKKAVEPVEKTAGETSATDTGKREKQLQQWKNNLKNDVDNTILSHEQKNEFKTKIDETNDSDALLELENQFNETNR
LLHIKQHEEVEKDKKAKQQKTLKQSDTKVDLSNIDKELNHQKSQVEKMAEQKGITNEDKDSMLKKIEDIRKQAQQADKKE
DAEVKVREELGKLFSSTKAGLDQEIQEHVKKETSSEENTQKVDEHYANSLQNLAQKSLEELDKATTNEQATQVKNQFLEN
AQKLKEIQPLIKETNVKLYKAMSESLEQVEKELKHNSEANLEDLVAKSKEIVREYEGKLNQSKNLPELKQLEEEAHSKLK
QVVEDFRKKFKTSEQVTPKKRVKRDLAANENNQQKIELTVSPENITVYEGEDVKFTVTAKSDSKTTLDFSDLLTKYNPSV
SDRISTNYKTNTDNHKIAEITIKNLKLNESQTVTLKAKDDSGNVVEKTFTITVQKKEEKQVPKTPEQKDSKTEEKVPQEP
KSNDKNQLQELIKSAQQELEKLEKAIKELMEQPEIPSNPEYGIQKSIWESQKEPIQEAITSFKKIIGDSSSKYYTEHYFN
KYKSDFMNYQLHAQMEMLTRKVVQYMNKYPDNAEIKKIFESDMKRTKEDNYGSLENDALKGYFEKYFLTPFNKIKQIVDD
LDKKVEQDQPAPIPENSEMDQAKEKAKIAVSKYMSKVLDGVHQHLQKKNNSKIVDLFKELEAIKQQTIFDIDNAKTEVEI
DNLVHDAFSKMNATVAKFQKGLETNTPETPDTPKIPELPQAPDTPQAPDTPHVPESPKAPEAPRVPESPKTPEAPHVPES
PKAPEAPRVPESPKTPEAPHVPESPKTPEAPKIPEPPKTPDVPKLPDVPKLPDVPKLPDAPKLPDGLNKVGQAVFTSTDG
NTKVTVVFDKPTDADKLHLKEVTTKELADKIAHKTGGGTVRVFDLSLSKGGKETHVNGERTVRLALGQTGSDVHVYHVKE
NGDLERIPSKVENGQVVFKTNHFSLFAIKTLSKDQNVTPPKQTKPSTQGSQVEIAESQTGKFQSKAANHKALATGNETVA
KGNPTSTTEKKLPYTGVASNLVLEIMGLLGLIGTSFIAMKRRKS
>A1SGT4 3.5.2.1~~~~~~Barbiturase 1~~~
MPDAIEVRKVPIHSVADASELAKLIDDGVMQAERVIAIIGKTEGNGGVNDYTRIIADRAFREVLVEKGAPAEQVKQVPIV
WSGGTDGVISPHATIFATVPPEDLTGALAPSDEQRLTVGFAMSERLAPEDIGRTAMITKVADAVKVAMERAGISDPADVH
YVQTKTPLLTIHTIRDAKSRGKTVWTEHTHESMDLSNGCTALGIAVALGEIEMPSDEDVMHDRSLYSSVASCSSGVELDQ
AQVVVVGNAPGVGGRYRIGHSVMKDALDQDGIWEAIKDAGLDLPERPRTSDLDGRLVNVFLKCEASQDGLVRGRRNAMLD
DSDVHWHRQIKSCVGGVTAAVTGDPAVFVSVSAAHQGPDGGGPVAAIVDLG
>A1SPN2 3.5.2.1~~~~~~Barbiturase 2~~~
MTRPIEVRKVPIEHVSDAAGLADLIDAGVFSADDVIAVVGKTEGNGGVNDYTRIISTHAYRAVLEEKGTRSKEEVAQVPL
VWSGGTDGVISPHATIFAYAPEGRYLPTDEPRVTVGYAMSEVLLPEDIGRPAMVEKVAAGVRVAMERAGITDPADVHYVQ
TKTPLLVQDTINDAERRGETVYTHNTLESMDVSNATTALGIAVALGEIEMPTAEQIFHDLSLYSSVASCSSGVELDQAQI
VVVGNARGVGGRFRVGHSIMKDALDMDGVWAAIRDAGLDDMPVDCIHPRHIKGRLVNLFLKCEADPTGRVRGRRNIMLDD
SDVAWHRQIKACVGGVVAAVSGDPMNFVSVAAVHQGPSGGGPVIAIVDLEA
>P04252 ~~~vhb~~~Bacterial hemoglobin~~~
MLDQQTINIIKATVPVLKEHGVTITTTFYKNLFAKHPEVRPLFDMGRQESLEQPKALAMTVLAAAQNIENLPAILPAVKK
IAVKHCQAGVAAAHYPIVGQELLGAIKEVLGDAATDDILDAWGKAYGVIADVFIQVEADLYAQAVE
>D2TV87 2.4.99.-~~~~~~Autotransproter heptosyltransferase BAHTCr~~~
MQKSLATFFISPPEIPTQHGPDNILYDFNDGARVLLPEGKWHVRLLDADSGNILFCCDINNGWVTSSKKYFVRFRIQVFR
QGEDSPLLDETLNLTDRDVLISFPTGTLGDLLGWFPYAERFQSLHQCRLECTMAQDIIDLLAPQYPQICFSTPEKPRTTE
PYATYRVGLYFGGDTNNQPVDFRQVGFHRSAGYILGVDPREAPVRLNLSAPCTIREPYVCIATQSTCQAKYWNNGTGWSE
VVAHLKSLGYRVLCIDREAHYGQGFVWNHIPWGAEDFTGSFPLQERVNLLRHASFFIGLASGLSWLAWATGIPVVLISGF
SLPDSEFYTPWRVFNSHGCNGCWDDTSLNFDHKDFLWCPRHKNTDRQFECTRLITGTQVNGVISRLHASLMKQGDKACLT
KGTNNEQGL
>Q8RSQ2 3.5.2.1~~~bar~~~Barbiturase~~~
MPEAIEVRKVPLHSVSDASELAKLIDDGVLEADRVIAVIGKTEGNGGVNDYTRIIADRAFREVLSAKGNRSPEEVAEVPI
VWSGGTDGVISPHATIFATVPADKVTKTDEPRLTVGVAMSEQLLPEDIGRTAMITKVAAAVKDAMADAGITDPADVHYVQ
TKTPLLTIHTIRDAKSRGKTVWTEQTHESMDLSNGGTALGIAVALGEIDMPTDEDVMHSRELFSSVASCSSGVELDRAQI
VVVGNARGVGGRYRIGHSVMKDPLDQDGIWAAIRDAGLELPERPHSNDLDGQLVNVFLKCEASQDGTVRGRRNAMLDDSD
VHWHRQIKSCVGGVTAAVTGDPAVFVSVSAAHQGPEGGGPVAAIVDLGQ
>P07914 1.1.1.395~~~baiA1~~~3alpha-hydroxy bile acid-CoA-ester 3-dehydrogenase 1/3~~~
MKLVQDKITIITGGTRGIGFAAAKLFIENGAKVSIFGETQEEVDTALAQLKELYPEEEVLGFAPDLTSRDAVMAAVGTVA
QKYGRLDVMINNAGITMNSVFSRVSEEDFKNIMDINVNGVFNGAWSAYQCMKDAKQGVIINTASVTGIYGSLSGIGYPTS
KAGVIGLTHGLGREIIRKNIRVVGVAPGVVDTDMTKGLPPEILEDYLKTLPMKRMLKPEEIANVYLFLASDLASGITATT
ISVDGAYRP
>P19337 1.1.1.395~~~baiA2~~~3alpha-hydroxy bile acid-CoA-ester 3-dehydrogenase 2~~~
MNLVQDKVTIITGGTRGIGFAAAKIFIDNGAKVSIFGETQEEVDTALAQLKELYPEEEVLGFAPDLTSRDAVMAAVGQVA
QKYGRLDVMINNAGITSNNVFSRVSEEEFKHIMDINVTGVFNGAWCAYQCMKDAKKGVIINTASVTGIFGSLSGVGYPAS
KASVIGLTHGLGREIIRKNIRVVGVAPGVVNTDMTNGNPPEIMEGYLKALPMKRMLEPEEIANVYLFLASDLASGITATT
VSVDGAYRP
>P19409 6.2.1.7~~~baiB~~~Bile acid--coenzyme A ligase~~~
MHKKSACEREGKELKRDFFNKFNLGTSNFVTPGKQLEYVSECKPDSTAVICLDKEQNCSVITWHQLHVYSSQLAWYLIEN
EIGPGSIVLTMFPNSIEHIIAVFAIWKAGACYMPMSYKAAESEIREACDTIHPNAAFAECKIPGLKFCLSADEIYEAMEG
RSKEMPSDRLANPNMISLSGGTSGKMKFIRQNLPCGLDDETIRSWSLMSGMGFEQRQLLVGPLFHGAPHSAAFNGLFMGN
TLVLTRNLCPGNILNMIKKYKIEFIQMVPTLMNRLAKLEGVGKEDFASLKALCHTGGVCSPWLKQIWIDLLGPEKIYEMY
SMTECIGLTCIRGDEWVKHPGSIGRPVGDSKVSIRDENGKEVAPFEIGEIYMTAPASYLVTEYINWEPLEVKEGGFRSVG
DIGYVDEQGYLYFSDRRSDMLVSGGENVFATEVETALLRYKDILDAVVVGIPDEDLGRRLHAVIETGKEIPAEELKTFLR
KYLTPYKIPKTFEFVRSIRRGDNGKADRKRILEDCIARGG
>P19410 1.3.1.115~~~baiCD~~~3-oxocholoyl-CoA 4-desaturase~~~
MSYEALFSPFKVRGLELKNRIVLPGMNTKMAKNKHDIGEDMIAYHVARAKAGCALNIFECVALCPAPHAYMYMGLYTDHH
VEQLKKLTDAVHEAGGKMGIQLWHGGFSPQMFFDETNTLETPDTLTVERIHEIVEEFGRGARMAVQAGFDAVEFHAAHSY
LPHEFLSPGMNKRTDEYGGSFENRCRFCYEVVQAIRSNIPDDMPFFMRADCIDELMEQTMTEEEIVTFINKCAELGVDVA
DLSRGNATSFATVYEVPPFNLAHGFNIENIYNIKKQINIPVMGVGRINTGEMANKVIEEGKFDLVGIGRAQLADPNWITK
VREGKEDLIRHCIGCDQGCYDAVINPKMKHITCTHNPGLCLEYQGMPKTDAPKKVMIVGGGMAGMIAAEVLKTRGHNPVI
FEASDKLAGQFRLAGVAPMKQDWADVAEWEAKEVERLGIEVRLNTEVTAETIKEFNPDNVIIAVGSTYALPEIPGIDSPS
VYSQYQVLKGEVNPTGRVAVIGCGLVGTEVAELLASRGAQVIAIERKGVGTGLSMLRRMFMNPEFKYYKIAKMSGTNVTA
LEQGKVHYIMTDKKTKEVTQGVLECDATVICTGITARPSDGLKARCEELGIPVEVIGDAAGARDCTIATREGYDAGMAI
>P19412 4.2.1.106~~~baiE~~~Bile acid 7alpha-dehydratase~~~
MTLEERVEALEKELQEMKDIEAIKELKGKYFRCLDGKMWDELETTLSPNIVTSYSNGKLVFHSPKEVTDYLKSSMPKEEI
SMHMGHTPEITIDSETTATGRWYLEDRLIFTDGKYKDVGINGGAFYTDKYEKIDGQWYILETGYVRIYEEHFMRDPKIHI
TMNMHK
>P19413 2.8.3.25~~~baiF~~~Bile acid CoA-transferase BaiF~~~
MAGIKDFPKFGALAGLKILDSGSNIAGPLGGGLLAECGATVIHFEGPKKPDNQRGWYGYPQNHRNQLSMVADIKSEEGRK
IFLDLIKWADIWVESSKGGQYDRLGLSDEVIWEVNPKIAIVHVSGYGQTGDPSYVTRASYDAVGQAFSGYMSLNGTTEAL
KINPYLSDFVCGLTTCWAMLACYVSTILTGKGESVDVAQYEALARIMDGRMIQYATDGVKMPRTGNKDAQAALFSFYTCK
DGRTIFIGMTGAEVCKRGFPIIGLPVPGTGDPDFPEGFTGWMIYTPVGQRMEKAMEKYVSEHTMEEVEAEMQAHQIPCQR
VYELEDCLNDPHWKARGTITEWDDPMMGHITGLGLINKFKRNPSEIWRGAPLFGMDNRDILKDLGYDDAKIDELYEQGIV
NEFDLDTTIKRYRLDEVIPHMRKKEE
>P32370 1.3.1.116~~~baiH~~~7-beta-hydroxy-3-oxochol-24-oyl-CoA 4-desaturase~~~
MDMKHSRLFSPLQIGSLTLSNRVGMAPMSMDYEAADGTVPKRLADVFVRRAEGGTGYVMIDAVTIDSKYPYMGNTTALDR
DELVPQFKEFADRVKEAGSTLVPQIIHPGPESVCGYRHIAPLGPSANTNANCHVSRSISIDEIHDIIKQFGQAARRAEEA
GCGAISLHCAHAYMLPGSFLSPLRNKRMDEYGGSLDNRARFVIEMIEEARRNVSPDFPIFLRISGDERMVGGNSLEDMLY
LAPKFEAAGVSMLEVSGGTQYEGLEHIIPCQNKSRGVNVYEASEIKKVVGIPVYAVGKINDIRYAAEIVERGLVDGVAMG
RPLLADPDLCKKAVEGQFDEITPCASCGGSCISRSEAAPECHCHINPRLGREYEFPDVPAEKSKKVLVIGAGPGGMMAAV
TAAERGHDVTVWEADDKIGGQLNLAVVAPGKQEMTQWMVHLNYRAKKAGVKFEFNKEATAEDVKALAPEAVIVATGAKPL
VPPIKGTQDYPVLTAHDFLRGKFVIPKGRVCVLGGGAVACETAETALENARPNSYTRGYDASIGDIDVTLVEMLPQLLTG
VCAPNREPLIRKLKSKGVHINVNTKIMEVTDHEVKVQRQDGTQEWLEGFDYVLFGLGSRNYDPLSETLKEFVPEVHVIGD
AVRARQASYAMWEGFEKAYSL
>B4YST4 2.8.3.25~~~baiK~~~Bile acid CoA-transferase BaiK~~~
MKGTGLNNFPQFGVMEGVKILVCGGAIAGPFGATLLGEIGAEVVHFESPKNPDSVRGHYGYSQNHRNQLSMVADMKTPEG
LEIFKKLIKWTDIFIESSKGGTYEKMGLTDEVLWEINPRLAIVHVSGFGQTGVPEYIDRASYDAVGQAFSGYMSFNGTPK
EAMKVSPYLSDYVTALNTCWTALAAYVHVLRTGKGESVDVAQYESLARILDTRPMEYFTDGKEFPRTGNKDTQAALFSFY
TCKDGGEIFIGMNGYGPVRRGYPLIGLPKPGDGDPEIDEILSGWMADTDLGRRLEAAMEKFVSEHTVDEVEKIMLENQIP
CLKVYTLKDCAKDPHWKARDIFVEWDDPMMGRVKGLGIINKWKNNPGEIKWGAPLFGENNEEVLKDLGYTEEEIEDFAKR
GITASFDFDQTYEIYKLEELFPHYREGFTERWKKEEE
>B0NAQ4 1.3.1.114~~~baiN~~~3-dehydro-bile acid delta(4,6)-reductase~~~COG2081
MNRIGIIGGGASGIVAAIAAARSDGDAQVFILEQKENIGKKILATGNGRCNLTNEAMDASCYHGEDPEFARNVLKQFGYG
ETLEFFASLGLFTKSRGGYIYPRSDQAASVLELLEMELRRQKVKIYTGVRVEALKLSAKGFVIRADGQRFPADRVILACG
GKASKSLGSDGSGYALARSMGHTLSPVVPALVQLKVKKHPFAKAAGVRTDAKVAALLGRQVLAEDTGEMQITAYGISGIP
VFQISRHIAKGLYEGKEMKVRVDFLPEMEASQVRKAFNTHLDKCPYATCQEFLTGIFPKKLIPRLLELSHIRQNFPASEL
KPAQWEDLIRACKQTLLTIEDTNGFDNAQVCAGGVRTGEVYPDTLESRYADGLYLTGELLDVEGICGGYNLQWAWATGYL
AGRAAAERP
>Q79FW0 2.6.1.21~~~~~~Bifunctional aminodeoxychorismate lyase / D-amino acid transaminase~~~COG0115
MVVTLDGEILQPGMPLLHADDLAAVRGDGVFETLLVRDGRACLVEAHLQRLTQSARLMDLPEPDLPRWRRAVEVATQRWV
ASTADEGALRLIYSRGREGGSAPTAYVMVSPVPARVIGARRDGVSAITLDRGLPADGGDAMPWLIASAKTLSYAVNMAVL
RHAARQGAGDVIFVSTDGYVLEGPRSTVVIATDGDQGGGNPCLLTPPPWYPILRGTTQQALFEVARAKGYDCDYRALRVA
DLFDSQGIWLVSSMTLAARVHTLDGRRLPRTPIAEVFAELVDAAIVSDR
>P46024 ~~~bamA~~~Outer membrane protein assembly factor BamA~~~
MKKLLIASLLFGTTTTVFAAPFVAKDIRVDGVQGDLEQQIRASLPVRAGQRVTDNDVANIVRSLFVSGRFDDVKAHQEGD
VLVVSVVAKSIISDVKIKGNSVIPTEALKQNLDANGFKVGDVLIREKLNEFAKSVKEHYASVGRYNATVEPIVNTLPNNR
AEILIQINEDDKAKLASLTFKGNESVSSSTLQEQMELQPDSWWKLWGNKFEGAQFEKDLQSIRDYYLNNGYAKAQITKTD
VQLNDEKTKVNVTIDVNEGLQYDLRSARIIGNLGGMSAELEPLLSALHLNDTFRRSDIADVENAIKAKLGERGYGSATVN
SVPDFDDANKTLAITLVVDAGRRLTVRQLRFEGNTVSADSTLRQEMRQQEGTWYNSQLVELGKIRLDRTGFFETVENRID
PINGSNDEVDVVYKVKERNTGSINFGIGYGTESGISYQASVKQDNFLGTGAAVSIAGTKNDYGTSVNLGYTEPYFTKDGV
SLGGNVFFENYDNSKSDTSSNYKRTTYGSNVTLGFPVNENNSYYVGLGHTYNKISNFALEYNRNLYIQSMKFKGNGIKTN
DFDFSFGWNYNSLNRGYFPTKGVKASLGGRVTIPGSDNKYYKLSADVQGFYPLDRDHLWVVSAKASAGYANGFGNKRLPF
YQTYTAGGIGSLRGFAYGSIGPNAIYAEYGNGSGTGTFKKISSDVIGGNAIATASAELIVPTPFVSDKSQNTVRTSLFVD
AASVWNTKWKSDKNGLESDVLKRLPDYGKSSRIRASTGVGFQWQSPIGPLVFSYAKPIKKYENDDVEQFQFSIGGSF
>P0A940 ~~~bamA~~~Outer membrane protein assembly factor BamA~~~COG4775
MAMKKLLIASLLFSSATVYGAEGFVVKDIHFEGLQRVAVGAALLSMPVRTGDTVNDEDISNTIRALFATGNFEDVRVLRD
GDTLLVQVKERPTIASITFSGNKSVKDDMLKQNLEASGVRVGESLDRTTIADIEKGLEDFYYSVGKYSASVKAVVTPLPR
NRVDLKLVFQEGVSAEIQQINIVGNHAFTTDELISHFQLRDEVPWWNVVGDRKYQKQKLAGDLETLRSYYLDRGYARFNI
DSTQVSLTPDKKGIYVTVNITEGDQYKLSGVEVSGNLAGHSAEIEQLTKIEPGELYNGTKVTKMEDDIKKLLGRYGYAYP
RVQSMPEINDADKTVKLRVNVDAGNRFYVRKIRFEGNDTSKDAVLRREMRQMEGAWLGSDLVDQGKERLNRLGFFETVDT
DTQRVPGSPDQVDVVYKVKERNTGSFNFGIGYGTESGVSFQAGVQQDNWLGTGYAVGINGTKNDYQTYAELSVTNPYFTV
DGVSLGGRLFYNDFQADDADLSDYTNKSYGTDVTLGFPINEYNSLRAGLGYVHNSLSNMQPQVAMWRYLYSMGEHPSTSD
QDNSFKTDDFTFNYGWTYNKLDRGYFPTDGSRVNLTGKVTIPGSDNEYYKVTLDTATYVPIDDDHKWVVLGRTRWGYGDG
LGGKEMPFYENFYAGGSSTVRGFQSNTIGPKAVYFPHQASNYDPDYDYECATQDGAKDLCKSDDAVGGNAMAVASLEFIT
PTPFISDKYANSVRTSFFWDMGTVWDTNWDSSQYSGYPDYSDPSNIRMSAGIALQWMSPLGPLVFSYAQPFKKYDGDKAE
QFQFNIGKTW
>D5CHY0 ~~~bamA~~~Outer membrane protein assembly factor BamA~~~COG4775
MAMKKLLIASLLFSSATVYGADGFVVKDIHFEGLQRVAVGAALLSMPVRPGDTVNDDDISNTIRALFATGNFEDVRVLRD
GDTLLVQVKERPTIASITFSGNKSVKDDMLKQNLEASGVRVGESLDRTTLSDIEKGLEDFYYSVGKYSASVKAVVTPLPR
NRVDLKLVFQEGVSAKIQQINIVGNHAFTTDELISTFQLRDEVPWWNVVGDRKYQKQKLAGDLETLRSYYLDRGYARFNI
DSTQVSLTPDKKGIYITINITEGDQYKLSGVEVSGNLAGHSAEIESLTKIQPGDLYSGSKVTKMEDGIKKLLGRYGYAYP
RVQTQPEINDADKTVKLHVNVDAGNRFYVRKIRFEGNDTSKDSVLRREMRQMEGAWLGSDLVDQGKERLNRLGYFETVDT
DTQRVPGSPDQVDVVYKVKERNTGSFNFGVGYGTESGVSFQVGVQQDNWLGTGYSVGINGTKNDYQTYSEFSVTNPYFTV
DGVSLGGRIFYNDFKADDADLSSYTNKSYGVDGTLGFPVNEYNTLRAGLGYVHNDLSNMQPQVAMWRYLDSIGQSASTSS
DNNGFAADDFTFNYGWTYNRLDRGYFPTEGSRVNLNGKVTIPGSDNEFYKLTLDTASYFPIDDDHKWVVLGRTRWGYGDG
LGGKEMPFYENFYAGGSSTVRGFQSNNIGPKAVYYGGNDEDNCASRDPKQVCSSDDAVGGNAMAVASLEFITPTPFISDK
YANSVRTSFFWDAGTVWDTNWENTAQMRAAGVPDYSDPGNIRMSAGIALQWMSPLGPLVFSYAQPFKKYDGDKSEQFQFN
IGKTW
>Q39TV7 3.7.1.21~~~bamA~~~6-oxocyclohex-1-ene-1-carbonyl-CoA hydrolase~~~COG1024
MARTDEIIARTAPGHLNDHNLIDREVESLCDGMVKYEKRPAKRHDGSVAEGIYNAWIILDNPKQYNSYTTDMVKAIILAF
RRASVDRSVNAVVFTGVGDKAFCTGGNTKEYAEYYAGNPQEYRQYMRLFNDMVSAILGCDKAVISRVNGMRIGGGQEIGM
ACDFSIAQDLANFGQAGPKHGSAAIGGATDFLPLMVGCEQAMVSGTLCEPFSAHKAARLGIICDVVPALKVGGKFVANPT
VVTDRYLDEYGRVVHGEFKAGAAFKEGQGQIKEGEIDLSLLDEKVESLCTKLLETFPECMTKSLEELRKPKLHAWNLNKE
NSRAWLALNMMNEARTGFRAFNEGTKETGREIDFVKLRQGLAKGTPWTEELIESLMPTAQK
>P44935 ~~~bamA~~~Outer membrane protein assembly factor BamA~~~COG4775
MKKLLIASLLFGTTTTVFAAPFVAKDIRVDGVQGDLEQQIRASLPVRAGQRVTDNDVANIVRSLFVSGRFDDVKAHQEGD
VLVVSVVAKSIISDVKIKGNSIIPTEALKQNLDANGFKVGDVLIREKLNEFAKSVKEHYASVGRYNATVEPIVNTLPNNR
AEILIQINEDDKAKLASLTFKGNESVSSSTLQEQMELQPDSWWKLWGNKFEGAQFEKDLQSIRDYYLNNGYAKAQITKTD
VQLNDEKTKVNVTIDVNEGLQYDLRSARIIGNLGGMSAELEPLLSALHLNDTFRRSDIADVENAIKAKLGERGYGSATVN
SVPDFDDANKTLAITLVVDAGRRLTVRQLRFEGNTVSADSTLRQEMRQQEGTWYNSQLVELGKIRLDRTGFFETVENRID
PINGSNDEVDVVYKVKERNTGSINFGIGYGTESGISYQASVKQDNFLGTGAAVSIAGTKNDYGTSVNLGYTEPYFTKDGV
SLGGNVFFENYDNSKSDTSSNYKRTTYGSNVTLGFPVNENNSYYVGLGHTYNKISNFALEYNRNLYIQSMKFKGNGIKTN
DFDFSFGWNYNSLNRGYFPTKGVKASLGGRVTIPGSDNKYYKLSADVQGFYPLDRDHLWVVSAKASAGYANGFGNKRLPF
YQTYTAGGIGSLRGFAYGSIGPNAIYAEHGNGNGTFKKISSDVIGGNAITTASAELIVPTPFVSDKSQNTVRTSLFVDAA
SVWNTKWKSDKSGLDNNVLKSLPDYGKSSRIRASTGVGFQWQSPIGPLVFSYAKPIKKYENDDVEQFQFSIGGSF
>Q9K1H0 ~~~bamA~~~Outer membrane protein assembly factor BamA~~~
MKLKQIASALMMLGISPLALADFTIQDIRVEGLQRTEPSTVFNYLPVKVGDTYNDTHGSAIIKSLYATGFFDDVRVETAD
GQLLLTVIERPTIGSLNITGAKMLQNDAIKKNLESFGLAQSQYFNQATLNQAVAGLKEEYLGRGKLNIQITPKVTKLARN
RVDIDITIDEGKSAKITDIEFEGNQVYSDRKLMRQMSLTEGGIWTWLTRSNQFNEQKFAQDMEKVTDFYQNNGYFDFRIL
DTDIQTNEDKTKQTIKITVHEGGRFRWGKVSIEGDTNEVPKAELEKLLTMKPGKWYERQQMTAVLGEIQNRMGSAGYAYS
EISVQPLPNAETKTVDFVLHIEPGRKIYVNEIHITGNNKTRDEVVRRELRQMESAPYDTSKLQRSKERVELLGYFDNVQF
DAVPLAGTPDKVDLNMSLTERSTGSLDLSAGWVQDTGLVMSAGVSQDNLFGTGKSAALRASRSKTTLNGSLSFTDPYFTA
DGVSLGYDVYGKAFDPRKASTSIKQYKTTTAGAGIRMSVPVTEYDRVNFGLVAEHLTVNTYNKAPKHYADFIKKYGKTDG
TDGSFKGWLYKGTVGWGRNKTDSALWPTRGYLTGVNAEIALPGSKLQYYSATHNQTWFFPLSKTFTLMLGGEVGIAGGYG
RTKEIPFFENFYGGGLGSVRGYESGTLGPKVYDEYGEKISYGGNKKANVSAELLFPMPGAKDARTVRLSLFADAGSVWDG
KTYDDNSSSATGGRVQNIYGAGNTHKSTFTNELRYSAGGAVTWLSPLGPMKFSYAYPLKKKPEDEIQRFQFQLGTTF
>Q8ZRP0 ~~~bamA~~~Outer membrane protein assembly factor BamA~~~
MAMKKLLIASLLFSSATVYGAEGFVVKDIHFEGLQRVAVGAALLSMPVRTGDTVNDEDISNTIRALFATGNFEDVRVLRD
GNTLLVQVKERPTIASITFSGNKSVKDDMLKQNLEASGVRVGESLDRTTLSDIEKGLEDFYYSVGKYSASVKAVVTPLPR
NRVDLKLVFQEGVSAKIQQINIVGNHAFSTEELISHFQLRDEVPWWNVVGDRKYQKQKLAGDLETLRSYYLDRGYARFNI
DSTQVSLTPDKKGIYITVNITEGDQYKLSGVQVSGNLAGHSAEIENLTKIEPGELYNGTKVTKMEDDIKKLLGRYGYAYP
RVQSQPEINDADKTVKLRVNVDAGNRFYVRKIRFEGNDTSKDSVLRREMRQMEGAWLGSDLVDQGKERLNRLGFFETVDT
DTQRVPGSPDQVDVVYKVKERNTGSFNFGIGYGTESGVSFQAGVQQDNWLGTGYSVGINGTKNDYQTYSELSVTNPYFTV
DGVSLGGRIFYNDFQADDADLSDYTNKSYGTDVTLGFPINEYNTLRAGLGYVHNKLSNMQPQIAMDRYLESMGQSADTSS
FAADDFTFNYGWTYNKLDRGYFPTDGSRVNLTGKVTIPGSDNEYYKVSLDTATYVPIDNDHKWVVLGRTRWGYGDGLGGK
EMPFYENFYAGGSSTVRGFQSNTIGPKAVYKNGAHTSWDDNDDYEDCTQESGCKSDDAVGGNAMAVASLEFITPTPFISE
KYANSVRTSFFWDMGTVWDTNWDPSSAPSDVPDYSDPGNIRMSAGIALQWMSPLGPLVFSYAQPFKKYDGDKAEQFQFNI
GKTW
>P0A943 ~~~bamA~~~Outer membrane protein assembly factor BamA~~~
MAMKKLLIASLLFSSATVYGAEGFVVKDIHFEGLQRVAVGAALLSMPVRTGDTVNDEDISNTIRALFATGNFEDVRVLRD
GDTLLVQVKERPTIASITFSGNKSVKDDMLKQNLEASGVRVGESLDRTTIADIEKGLEDFYYSVGKYSASVKAVVTPLPR
NRVDLKLVFQEGVSAEIQQINIVGNHAFTTDELISHFQLRDEVPWWNVVGDRKYQKQKLAGDLETLRSYYLDRGYARFNI
DSTQVSLTPDKKGIYVTVNITEGDQYKLSGVEVSGNLAGHSAEIEQLTKIEPGELYNGTKVTKMEDDIKKLLGRYGYAYP
RVQSMPEINDADKTVKLRVNVDAGNRFYVRKIRFEGNDTSKDAVLRREMRQMEGAWLGSDLVDQGKERLNRLGFFETVDT
DTQRVPGSPDQVDVVYKVKERNTGSFNFGIGYGTESGVSFQAGVQQDNWLGTGYAVGINGTKNDYQTYAELSVTNPYFTV
DGVSLGGRLFYNDFQADDADLSDYTNKSYGTDVTLGFPINEYNSLRAGLGYVHNSLSNMQPQVAMWRYLYSMGEHPSTSD
QDNSFKTDDFTFNYGWTYNKLDRGYFPTDGSRVNLTGKVTIPGSDNEYYKVTLDTATYVPIDDDHKWVVLGRTRWGYGDG
LGGKEMPFYENFYAGGSSTVRGFQSNTIGPKAVYFPHQASNYDPDYDYECATQDGAKDLCKSDDAVGGNAMAVASLEFIT
PTPFISDKYANSVRTSFFWDMGTVWDTNWDSSQYSGYPDYSDPSNIRMSAGIALQWMSPLGPLVFSYAQPFKKYDGDKAE
QFQFNIGKTW
>Q2LXU2 3.7.1.21~~~bamA~~~6-oxocyclohex-1-ene-1-carbonyl-CoA hydrolase~~~COG1024
MSLDWMPREHGLKNHSRHTEQWWGTEAPCTVYEKRPLKDPKGNVVPGLYSAWIRLNNPGQYNSYTTEMVKGVIAGFENSS
TDREVVAVVFTGTGPNAFCTGGNTKEYSEYYSMRPEEYGSYMELFNNMVDSILMCKKPVICRVNGMRVAGGQEIGTATDI
TVSSDLAIFGQAGPRHGSAPVGGASDFLPWFLSIEDAMWNCVSCEMWSAYKMKAKNLISKALPVLKDDKGNWVRNPQVYT
DTYVKDGEIVYGEPKTGEEAKQARAWVNEKLKNNDYDFSLIDAEVDRIVWVFANLFPGCLMKSIDGIRQKKKFWWDQIKN
DHRYWLGTNMMGEAFLGFGAFNTKKITGKDTIDFIKNRQLIAEGALVDEAFMEQVLGKPLAK
>O87872 3.7.1.21~~~oah~~~6-oxocyclohex-1-ene-1-carbonyl-CoA hydrolase~~~
MNPTTQKLVEQNAPAQLVDHNLVPETVCPGVLYEKRPARNLKGEVVPGLYNVWISLDNPKQYNSYTTDMVKGLILAFRAA
SCARDVASVVFTAVGDKAFCTGGNTKEYAEYYAGNPQEYRQYMRLFNDMVSAILGCDKPVICRVNGMRIGGGQEIGMAAD
FTVAQDLANFGQAGPKHGSAAIGGATDFLPLMIGCEQAMVSGTLCEPFSAHKANRLGICMQIVPALKVDGKFIANPLVVT
DRYLDEFGRIIHGEFKTGDELAAGKELMKRGEIDLSLLDEAVEKLCAKLISTFPECLTKSFEELRKPKLDAWNRNKENSR
AWLALNMMNEARTGFRAFNEGNKETGREIEFTDLRQALAKGMPWTPELIESLMPGAK
>P77774 ~~~bamB~~~Outer membrane protein assembly factor BamB~~~COG1520
MQLRKLLLPGLLSVTLLSGCSLFNSEEDVVKMSPLPTVENQFTPTTAWSTSVGSGIGNFYSNLHPALADNVVYAADRAGL
VKALNADDGKEIWSVSLAEKDGWFSKEPALLSGGVTVSGGHVYIGSEKAQVYALNTSDGTVAWQTKVAGEALSRPVVSDG
LVLIHTSNGQLQALNEADGAVKWTVNLDMPSLSLRGESAPTTAFGAAVVGGDNGRVSAVLMEQGQMIWQQRISQATGSTE
IDRLSDVDTTPVVVNGVVFALAYNGNLTALDLRSGQIMWKRELGSVNDFIVDGNRIYLVDQNDRVMALTIDGGVTLWTQS
DLLHRLLTSPVLYNGNLVVGDSEGYLHWINVEDGRFVAQQKVDSSGFQTEPVAADGKLLIQAKDGTVYSITR
>Q9HXJ7 ~~~bamB~~~Outer membrane protein assembly factor BamB~~~
MVQWKHAALLALALAVVGCSSNSKKELPPAELTDFKEEVVLSKQWSRSVGDGQGDLYNLLEPAVDGSTIYAASAEGRVMA
IQRETGDVLWKKDLERPVSGGVGVGYGLVLVGTLRGDVIALDEATGKKKWTKRVNSEVLSAPATNGDVVVVQTQDDKLIG
LDAASGDQRWIYESTVPVLTLRGTGAPLIAGNMALAGLASGKVVAVDVQRGLPIWEQRVAIPQGRSELDRVVDIDGGLLL
SGDTLYVVSYQGRAAALDVNSGRLLWQREASSYVGVAEGFGNIYVSQASGSVEGLDSRGASSLWNNDALARRQLSAPAVF
SSNVVVGDLEGYVHLLSQVDGRFVGRERVDSDGVRVRPLVVGSWMYVFGNGGKLVAYTIR
>P43973 ~~~~~~Outer membrane protein assembly factor BamC homolog~~~COG3317
MKKIILNLVTAIILAGCSSNPETLKATNDSFQKSETSIPHFSPLATGGVQLPKADDSYSLPNIEVKKGEDIDIRPPLIPL
AIIQNSITKFDGERSLIVYPKQQAKLYNLQQVERLLKEEGISSTTDGSILTTDWAKTERIGDKSIEIKYQIEQVMTADVS
ALTVSILHMRRDGIIFTPNVSDKQYYTSERLNRIVLTLTTAYNKQLRDLSSTLIQ
>P0A903 ~~~bamC~~~Outer membrane protein assembly factor BamC~~~COG3317
MAYSVQKSRLAKVAGVSLVLLLAACSSDSRYKRQVSGDEAYLEAAPLAELHAPAGMILPVTSGDYAIPVTNGSGAVGKAL
DIRPPAQPLALVSGARTQFTGDTASLLVENGRGNTLWPQVVSVLQAKNYTITQRDDAGQTLTTDWVQWNRLDEDEQYRGR
YQISVKPQGYQQAVTVKLLNLEQAGKPVADAASMQRYSTEMMNVISAGLDKSATDAANAAQNRASTTMDVQSAADDTGLP
MLVVRGPFNVVWQRLPAALEKVGMKVTDSTRSQGNMAVTYKPLSDSDWQELGASDPGLASGDYKLQVGDLDNRSSLQFID
PKGHTLTQSQNDALVAVFQAAFSK
>P0AC02 ~~~bamD~~~Outer membrane protein assembly factor BamD~~~COG4105
MTRMKYLVAAATLSLFLAGCSGSKEEVPDNPPNEIYATAQQKLQDGNWRQAITQLEALDNRYPFGPYSQQVQLDLIYAYY
KNADLPLAQAAIDRFIRLNPTHPNIDYVMYMRGLTNMALDDSALQGFFGVDRSDRDPQHARAAFSDFSKLVRGYPNSQYT
TDATKRLVFLKDRLAKYEYSVAEYYTERGAWVAVVNRVEGMLRDYPDTQATRDALPLMENAYRQMQMNAQAEKVAKIIAA
NSSNT
>P44553 ~~~bamD~~~Outer membrane protein assembly factor BamD~~~COG4105
MRKIKSLALLAVAALVIGCSSGSKDVEQASVNELYTKGTTSLQEGSYSEAIRYLKATTERFPGSVYQEQAMLDLIYANYK
TQDYTQVLLMVDSFLHQFTQSPNQAYAVYMAGLTNAATGDNFIQDFFGIDRATRETTSMRTAFSNFQNLVRVFPNSPYSQ
DALARMAYIKDALARHELEIAKFYAKRKAWVAVANRVVGMLKQYPDTKATYEGLFLMQEAYEKMGLTALANDTQKIIDAN
KDKTFAPIEKPNEPDLKVPAVK
>Q9K0B1 ~~~bamD~~~Outer membrane protein assembly factor BamD~~~
MKKILLTVSLGLALSACATQGTVDKDAQITQDWSVEKLYAEAQDELNSSNYTRAVKLYEILESRFPTSRHAQQSQLDTAY
AYYKDDEKDKALAAIDRFRRLHPQHPNMDYALYLRGLVLFNEDQSFLNKLASQDWSDRDPKANREAYQAFAELVQRFPNS
KYAADATARMVKLVDALGGNEMSVARYYMKRGAYIAAANRAQKIIGSYQNTRYVEESLAILELAYKKLDKPRLAADTRRV
LETNFPKSPFLKQPWRSDDMPWWRYWH
>P0A937 ~~~bamE~~~Outer membrane protein assembly factor BamE~~~COG2913
MRCKTLTAAAAVLLMLTAGCSTLERVVYRPDINQGNYLTANDVSKIRVGMTQQQVAYALGTPLMSDPFGTNTWFYVFRQQ
PGHEGVTQQTLTLTFNSSGVLTNIDNKPALSGN
>O68562 ~~~bamE~~~Outer membrane protein assembly factor BamE~~~
MQNAKLMLTCLAFAGLAALAGCSFPGVYKIDIQQGNVVTQDMIDQLRPGMTRRQVRFIMGNPLIVDTFHANRWDYLYSIQ
PGGGRRQQERVSLFFNDSDQLAGLNGDFMPGVSRDEAILGKEGSTTVTQPADQQKPEAQKEEPPKPGSTLEQLQREVDEA
QPVPVPTPEPLDPSPQ
>Q9I061 ~~~bamI~~~Biofilm-associated metzincin protease inhibitor~~~
MAKTWIYAASAAAIGGALIGGWLLDPAPPEASPQARQSPAAQAAAAPTAALAAPAADATRMLAPTPVTTPAPRERVTLWQ
GELRSREGAQGIPEYLAQVEPALLDTLALGQVLEMSLPGRERPLQARLASTHNSAGLPVWRGGLVDGDEAESLTVVRGSL
ETHINVATLDGSYSIIVDNRSGKTRVIDENDIAARSDPHGDHVDAPLAELPPMPPPAQG
>A0MTQ2 3.4.11.25~~~~~~Beta-peptidyl aminopeptidase BapA~~~
MHYLKFPAIIAGMLLAGAASAEGPRARDLGVPFAGKPGANNAITDVAGVEVGYVSLISGEGKLERGKGPVRTGVTAVLPR
GKESRTPVYAGWETSNAAGEMTGTVWLEERGYFDGPMMITNTHSVGVVRDAVVGWLADVKWPGAWFTPVVAETYDGMLND
INGFHVKPEHALRAIQTAASGPVAEGNVGGGVGMQCFGFKGGTGTASRVVEMDGKSYTVGVLVQCNFGMRPWLRVAGAPV
GEELAGKYLPETRGTQTAAATNNGVAPGDGSIIVVMATDAPMLPHQLKRLAKRAAAGMGRMGDAGSNGSGDIFVAFSTAN
ANVQSVGGNVISVETMPNDKLTLIFEAATQATEEAITNVLVAADTLTGVNGYTIQRLPHAELRAILKKYRRLAAAK
>Q52VH2 3.4.11.25~~~bapA~~~Beta-peptidyl aminopeptidase BapA~~~
MTSTQRLWSGALPLLTALIVSIAATASLAGPRARDLGVPFEGTPGALNAITDVAGVEVGHTTVISGDGAMVIGKGPYRTG
VTIIHPLGKTSLDGVAAGRAVINGTGEWTGMHLVDEVGQFLGPIALTGTGNVGLVHQSMMDWSVGKVPEEALFSRLLPVV
AETLDNRLNDVFGHGLTRDHVFAALDGAKGGPVAEGNVGGGTGMIAYTFKGGIGTSSRVVSAGDTRYTVGVLVQANHGDR
NDLRIAGVQIGKEIKGAWPEVNGIVAAGPDAGKPQDKNSLLIVIATDAPLMPHQLERMARRAALGVGRNGSTAGALSGEF
ALAFSTSHVIPLGGKPRLPAIINDTDSETMNALFRGVVQATEEALVNQLVASETMTGANNAKVYGIPHDQLARIMKARFP
RR
>P0AEC6 2.7.13.3~~~barA~~~Signal transduction histidine-protein kinase BarA~~~COG2205
MTNYSLRARMMILILAPTVLIGLLLSIFFVVHRYNDLQRQLEDAGASIIEPLAVSTEYGMSLQNRESIGQLISVLHRRHS
DIVRAISVYDENNRLFVTSNFHLDPSSMQLGSNVPFPRQLTVTRDGDIMILRTPIISESYSPDESPSSDAKNSQNMLGYI
ALELDLKSVRLQQYKEIFISSVMMLFCIGIALIFGWRLMRDVTGPIRNMVNTVDRIRRGQLDSRVEGFMLGELDMLKNGI
NSMAMSLAAYHEEMQHNIDQATSDLRETLEQMEIQNVELDLAKKRAQEAARIKSEFLANMSHELRTPLNGVIGFTRLTLK
TELTPTQRDHLNTIERSANNLLAIINDVLDFSKLEAGKLILESIPFPLRSTLDEVVTLLAHSSHDKGLELTLNIKSDVPD
NVIGDPLRLQQIITNLVGNAIKFTENGNIDILVEKRALSNTKVQIEVQIRDTGIGIPERDQSRLFQAFRQADASISRRHG
GTGLGLVITQKLVNEMGGDISFHSQPNRGSTFWFHINLDLNPNIIIEGPSTQCLAGKRLAYVEPNSAAAQCTLDILSETP
LEVVYSPTFSALPPAHYDMMLLGIAVTFREPLTMQHERLAKAVSMTDFLMLALPCHAQVNAEKLKQDGIGACLLKPLTPT
RLLPALTEFCHHKQNTLLPVTDESKLAMTVMAVDDNPANLKLIGALLEDMVQHVELCDSGHQAVERAKQMPFDLILMDIQ
MPDMDGIRACELIHQLPHQQQTPVIAVTAHAMAGQKEKLLGAGMSDYLAKPIEEERLHNLLLRYKPGSGISSRVVTPEVN
EIVVNPNATLDWQLALRQAAGKTDLARDMLQMLLDFLPEVRNKVEEQLVGENPEGLVDLIHKLHGSCGYSGVPRMKNLCQ
LIEQQLRSGTKEEDLEPELLELLDEMDNVAREASKILG
>P0AEC5 2.7.13.3~~~barA~~~Signal transduction histidine-protein kinase BarA~~~COG2205
MTNYSLRARMMILILAPTVLIGLLLSIFFVVHRYNDLQRQLEDAGASIIEPLAVSTEYGMSLQNRESIGQLISVLHRRHS
DIVRAISVYDENNRLFVTSNFHLDPSSMQLGSNVPFPRQLTVTRDGDIMILRTPIISESYSPDESPSSDAKNSQNMLGYI
ALELDLKSVRLQQYKEIFISSVMMLFCIGIALIFGWRLMRDVTGPIRNMVNTVDRIRRGQLDSRVEGFMLGELDMLKNGI
NSMAMSLAAYHEEMQHNIDQATSDLRETLEQMEIQNVELDLAKKRAQEAARIKSEFLANMSHELRTPLNGVIGFTRLTLK
TELTPTQRDHLNTIERSANNLLAIINDVLDFSKLEAGKLILESIPFPLRSTLDEVVTLLAHSSHDKGLELTLNIKSDVPD
NVIGDPLRLQQIITNLVGNAIKFTENGNIDILVEKRALSNTKVQIEVQIRDTGIGIPERDQSRLFQAFRQADASISRRHG
GTGLGLVITQKLVNEMGGDISFHSQPNRGSTFWFHINLDLNPNIIIEGPSTQCLAGKRLAYVEPNSAAAQCTLDILSETP
LEVVYSPTFSALPPAHYDMMLLGIAVTFREPLTMQHERLAKAVSMTDFLMLALPCHAQVNAEKLKQDGIGACLLKPLTPT
RLLPALTEFCHHKQNTLLPVTDESKLAMTVMAVDDNPANLKLIGALLEDMVQHVELCDSGHQAVERAKQMPFDLILMDIQ
MPDMDGIRACELIHQLPHQQQTPVIAVTAHAMAGQKEKLLGAGMSDYLAKPIEEERLHNLLLRYKPGSGISSRVVTPEVN
EIVVNPNATLDWQLALRQAAGKTDLARDMLQMLLDFLPEVRNKVEEQLVGENPEGLVDLIHKLHGSCGYSGVPRMKNLCQ
LIEQQLRSGTKEEDLEPELLELLDEMDNVAREASKILG
>D0LWX4 ~~~barP~~~Bacterial actin-related protein~~~COG5277
MSDSYVFSSPIIIHPGSDTLQAGLADEEHPGSIFPNIVGRHKLAGLMEWVDQRVLCVGQEAIDQSATVLLRHPVWSGIVG
DWEAFAAVLRHTFYRALWVAPEEHPIVVTESPHVYRSFQLRREQLTRLLFETFHAPQVAVCSEAAMSLYACGLDTGLVVS
LGDFVSYVAPVHRGAIVDAGLTFLEPDGRSITEYLSRLLLERGHVFTSPEALRLVRDIKETLCYVADDVAKEAARNADSV
EATYLLPNGETLVLGNERFRCPEVLFHPDLLGWESPGLTDAVCNAIMKCDPSLQAELFGNIVVTGGGSLFPGLSERLQRE
LEQRAPAEAPVHLLTRDDRRHLPWKGAARFARDAQFAGFALTRQAYERHGAELIYQM
>Q9LBV3 1.1.1.413~~~barS1~~~A-factor type gamma-butyrolactone 1'-reductase (1S-forming)~~~
MTDRQGLLTDRIALITGASSGIGAAQRGLFAREGAAVVVTARREERLAGLVDELRAQGARAAYVVADVTRSEDAVRAVEF
TVERFGRLDAAFNKRRHGAGRTPLHLMDDPVYDDIMDTNVRGVFNCLRPEIAAMLASGAGGSIVNTSSTGGLVATPVAAP
YVVSKHAVLGLTKGPAAEYGAHGIRVNAIAPGTTRSEMVADWFAQNPDAEELLHRATPQPRTAEPQEIAEAAAWLCSERA
SFVTGSTLVVDGGFTIL
>P11540 ~~~~~~Barstar~~~COG2732
MKKAVINGEQIRSISDLHQTLKKELALPEYYGENLDALWDCLTGWVEYPLVLEWRQFEQSKQLTENGAESVLQVFREAKA
EGCDITIILS
>P30843 ~~~basR~~~Transcriptional regulatory protein BasR~~~COG0745
MKILIVEDDTLLLQGLILAAQTEGYACDSVTTARMAEQSLEAGHYSLVVLDLGLPDEDGLHFLARIRQKKYTLPVLILTA
RDTLTDKIAGLDVGADDYLVKPFALEELHARIRALLRRHNNQGESELIVGNLTLNMGRRQVWMGGEELILTPKEYALLSR
LMLKAGSPVHREILYNDIYNWDNEPSTNTLEVHIHNLRDKVGKARIRTVRGFGYMLVANEEN
>P36556 ~~~basR~~~Transcriptional regulatory protein BasR~~~
MKILIVEDDTLLLQGLILAAQTEGYACDGVSTARAAEHSLESGHYSLMVLDLGLPDEDGLHFLTRIRQKKYTLPVLILTA
HDTLNDRITGLDVGADDYLVKPFALEELHARIRALLRRHNNQGESELTVGNLTLNIGRHQAWRDGQELTLTPKEYALLSR
LMLKAGSPVHREILYNDIYNWDNEPSTNTLEVHIHNLRDKVGKSRIRTVRGFGYMLVATEES
>P30844 2.7.13.3~~~basS~~~Sensor protein BasS~~~COG0642
MHFLRRPISLRQRLILTIGAILLVFELISVFWLWHESTEQIQLFEQALRDNRNNDRHIMREIREAVASLIVPGVFMVSLT
LFICYQAVRRITRPLAELQKELEARTADNLTPIAIHSATLEIEAVVSALNDLVSRLTSTLDNERLFTADVAHELRTPLAG
VRLHLELLAKTHHIDVAPLVARLDQMMESVSQLLQLARAGQSFSSGNYQHVKLLEDVILPSYDELSTMLDQRQQTLLLPE
SAADITVQGDATLLRMLLRNLVENAHRYSPQGSNIMIKLQEDDGAVMAVEDEGPGIDESKCGELSKAFVRMDSRYGGIGL
GLSIVSRITQLHHGQFFLQNRQETSGTRAWVRLKKDQYVANQI
>P36557 2.7.13.3~~~basS~~~Sensor protein BasS~~~
MRFQRRAMTLRQRLMLTIGLILLVFQLISTFWLWHESTEQIQLFEQALRDNRNNDRHIMHEIREAVASLIVPGVFMVSLT
LLICYQAVRRITRPLAELQKELEARTADNLAPIAIHSSTLEIESVVSAINQLVTRLTTTLDNERLFTADVAHELRTPLSG
VRLHLELLSKTHNVDVAPLIARLDQMMDSVSQLLQLARVGQSFSSGNYQEVKLLEDVILPSYDELNTMLETRQQTLLLPE
SAADVVVRGDATLLRMLLRNLVENAHRYSPEGTHITIHISADPDAIMAVEDEGPGIDESKCGKLSEAFVRMDSRYGGIGL
GLSIVSRITQLHQGQFFLQNRTERTGTRAWVLLKKA
>E5AV36 ~~~bat1~~~Burkholderia TALE-like protein 1~~~COG2201
MSTAFVDQDKQMANRLNLSPLERSKIEKQYGGATTLAFISNKQNELAQILSRADILKIASYDCAAHALQAVLDCGPMLGK
RGFSQSDIVKIAGNIGGAQALQAVLDLESMLGKRGFSRDDIAKMAGNIGGAQTLQAVLDLESAFRERGFSQADIVKIAGN
NGGAQALYSVLDVEPTLGKRGFSRADIVKIAGNTGGAQALHTVLDLEPALGKRGFSRIDIVKIAANNGGAQALHAVLDLG
PTLRECGFSQATIAKIAGNIGGAQALQMVLDLGPALGKRGFSQATIAKIAGNIGGAQALQTVLDLEPALCERGFSQATIA
KMAGNNGGAQALQTVLDLEPALRKRDFRQADIIKIAGNDGGAQALQAVIEHGPTLRQHGFNLADIVKMAGNIGGAQALQA
VLDLKPVLDEHGFSQPDIVKMAGNIGGAQALQAVLSLGPALRERGFSQPDIVKIAGNTGGAQALQAVLDLELTLVEHGFS
QPDIVRITGNRGGAQALQAVLALELTLRERGFSQPDIVKIAGNSGGAQALQAVLDLELTFRERGFSQADIVKIAGNDGGT
QALHAVLDLERMLGERGFSRADIVNVAGNNGGAQALKAVLEHEATLNERGFSRADIVKIAGNGGGAQALKAVLEHEATLD
ERGFSRADIVRIAGNGGGAQALKAVLEHGPTLNERGFNLTDIVEMAANSGGAQALKAVLEHGPTLRQRGLSLIDIVEIAS
NGGAQALKAVLKYGPVLMQAGRSNEEIVHVAARRGGAGRIRKMVAPLLERQ
>E5AW45 ~~~bat2~~~Burkholderia TALE-like protein 2~~~
MPATSMHQEDKQSANGLNLSPLERIKIEKHYGGGATLAFISNQHDELAQVLSRADILKIASYDCAAQALQAVLDCGPMLG
KRGFSRADIVRIAGNGGGAQALYSVLDVEPTLGKRGFSQVDVVKIAGGGAQALHTVLEIGPTLGERGFSRGDIVTIAGNN
GGAQALQAVLELEPTLRERGFNQADIVKIAGNGGGAQALQAVLDVEPALGKRGFSRVDIAKIAGGGAQALQAVLGLEPTL
RKRGFHPTDIIKIAGNNGGAQALQAVLDLELMLRERGFSQADIVKMASNIGGAQALQAVLNLEPALCERGFSQPDIVKMA
GNSGGAQALQAVLDLELAFRERGFSQADIVKMASNIGGAQALQAVLELEPALHERGFSQANIVKMAGNSGGAQALQAVLD
LELVFRERGFSQPEIVEMAGNIGGAQALHTVLDLELAFRERGVRQADIVKIVGNNGGAQALQAVFELEPTLRERGFNQAT
IVKIAANGGGAQALYSVLDVEPTLDKRGFSRVDIVKIAGGGAQALHTAFELEPTLRKRGFNPTDIVKIAGNKGGAQALQA
VLELEPALRERGFNQATIVKMAGNAGGAQALYSVLDVEPALRERGFSQPEIVKIAGNIGGAQALHTVLELEPTLHKRGFN
PTDIVKIAGNSGGAQALQAVLELEPAFRERGFGQPDIVKMASNIGGAQALQAVLELEPALRERGFSQPDIVEMAGNIGGA
QALQAVLELEPAFRERGFSQSDIVKIAGNIGGAQALQAVLELEPTLRESDFRQADIVNIAGNDGSTQALKAVIEHGPRLR
QRGFNRASIVKIAGNSGGAQALQAVLKHGPTLDERGFNLTNIVKIAGNGGGAQALKAVIEHGPTLQQRGFNLTDIVEMAG
KGGGAQALKAVLEHGPTLRQRGFNLIDIVEMASNTGGAQALKTVLEHGPTLRQRDLSLIDIVEIASNGGAQALKAVLKYG
PVLMQAGRSNEEIVHVAARRGGAGRIRKMVALLLERQ
>Q9I700 2.6.1.18~~~bauA~~~Beta-alanine--pyruvate aminotransferase~~~
MNQPLNVAPPVSSELNLRAHWMPFSANRNFQKDPRIIVAAEGSWLTDDKGRKVYDSLSGLWTCGAGHSRKEIQEAVARQL
GTLDYSPGFQYGHPLSFQLAEKIAGLLPGELNHVFFTGSGSECADTSIKMARAYWRLKGQPQKTKLIGRARGYHGVNVAG
TSLGGIGGNRKMFGQLMDVDHLPHTLQPGMAFTRGMAQTGGVELANELLKLIELHDASNIAAVIVEPMSGSAGVLVPPVG
YLQRLREICDQHNILLIFDEVITAFGRLGTYSGAEYFGVTPDLMNVAKQVTNGAVPMGAVIASSEIYDTFMNQALPEHAV
EFSHGYTYSAHPVACAAGLAALDILARDNLVQQSAELAPHFEKGLHGLQGAKNVIDIRNCGLAGAIQIAPRDGDPTVRPF
EAGMKLWQQGFYVRFGGDTLQFGPTFNARPEELDRLFDAVGEALNGIA
>Q9I701 ~~~bauB~~~Beta-alanine degradation protein BauB~~~
MSGRPQAVPTVQVDNAEVIVTEWRFAPGAETGRHRHGHDYVVVPLTDGTLLLETPEGDRHAPLVAGQAYFRKAGVEHNVI
NASAHEVVFVETEIK
>Q9I702 1.2.1.-~~~bauC~~~Putative 3-oxopropanoate dehydrogenase~~~
MGTLHHLINGEMVADNGRSADVFNPSTGEAIHKVPLADGKTLQKAIDAARAAFPAWRNTPPAKRAQVLYRFKQLLEQNEA
RISKLISEEHGKTLEDAAGELKRGIENVEYACAAPEILKGEYSRNVGPNIDAWSDFQPIGVVAGITPFNFPAMVPLWMYP
LAIACGNTFILKPSERDPSSTLLIAELFHEAGLPKGVLNVVHGDKEAVDGLLQAPEVKAISFVGSTPIAEYIYAEGTKRG
KRVQALGGAKNHAVLMPDADLDNAVSALMGAAYGSCGERCMAISVAVCVGDQVADALIAKLVPQIKALKIGAGTSCGLDM
GPLVTAAAQAKVTGYIDSGVAQGAELVVDGRGYQVAGHENGFFLGGSLFDRVTPEMTIYKEEIFGPVLCVVRVNSLEEAM
QLINDHEYGNGTCIFTRDGEAARLFCDEIEVGMVGVNVPLPVPVAYHSFGGWKRSLFGDLHAYGPDGVRFYTRRKAITQR
WPQRASHEASQFAFPSL
>Q9I703 ~~~bauD~~~Probable GABA permease~~~
MSKVVLASQLPNKRNASLAPGLKQRHVTMLSIAGVIGAGLFVGSGHAIAAAGPAALLAYLIAGTLVVLVMRMLGEMAVAS
PDTGSFSTYADRSIGRWAGFTIGWLYWWFWVLVIPLEAIAAAAILNAWFPAIDTWIFALAVTFLLTVTNLFSVARYGEFE
FWFALLKVIAIIAFIVLGAVAIVGGLPEREVSGLSSLMASHGGFVPNGYGAVLGALLTTMFSFMGTEIVTIAAAESKDPA
KQITRATNSVIWRIGLFYLVSIFIVISIVPWNDPLLIQVGSYQRALELLDIPHAKLIVDLVVLVAVASCLNSAIYTSSRM
VFSLAKRGDAPSVLKLTNTAHVPRPAVLASTAVGFLTTIVNYFAPEKVFTFLLASSGAVALLVYLVIAVAQLRMRKQLQA
SGQPIEFRMWLYPWLTWAVILFIVAALSIMLIMPEHRHEVFATALLTIFTVCLGLLNARRKPRLGEDYAGKTARV
>P80953 ~~~~~~Bacteriocin bavaricin-A~~~
KYYGNGVHXGKHSXTVDWGTAIGNIGNNAAANXATGXNAGG
>P80493 ~~~~~~Bacteriocin bavaricin-MN~~~
TKYYGNGVYCNSKKCWVDWGQAAGGIGQTVVXGWLGGAIPGK
>O31762 ~~~ymfD~~~Bacillibactin exporter~~~COG2814
MKNIIALSSVPLVMTLGNSMLIPVLPMMEKKLSVTSFQVSLIITVYSVVAIICIPIAGYLSDRFGRKKILLPCLLIAGLG
GAVAAFASTYMKNPYAMILAGRVLQGIGSAGAAPIVMPFIGDLFKGDDEKVSAGLGDIETANTSGKVLSPILGALLASWY
WFVPFWFIPFFCLISFLLVLFLVAKPEEDEDAPAVSEFIKSVKKIFKQDGRWLYTVFIIGCVIMFLLFGVLFYLSDTLEN
KYAIDGVAKGGLLAIPLLFLSTSSFIAGKKIGKDKGRMKFCVVTGMILLTLSFIALWWNHSFYFLFVFLSFGGIGIGMAL
PALDALITEGIESEQCGTISSFYNSMRFIGVALGPPVFAALMSNANWIIFILSAFCSIVSLFLVLFTVDAKKSEEEKNLG
TV
>Q9R9H8 3.2.1.-~~~bbmA~~~Intracellular maltogenic amylase~~~
MEYAAIHHQPFSTDAYSYDGRTVHIKIRTKKGDADHIRFIWGDPYEYNDGKWSANEQPMRKIAATEMHDYWFAEVVPPFR
RLQYAFVVTDDHEDIFFGSSGVCPYNEKTLETIHYYFKFPFVHEADTFQAPEWVKSTVWYQIFPERFANGREDLSPKNAL
PWGSKDPGVNDFFGGDLQGIVDKLDYLEDLGVNGIYLTPIFSAPSNHKYDTLDYFSIDPHFGDPEIFRTLVSQLHQRGMR
IMLDAVFNHIGSASPQWQDVVKNGDQSRYKDWFHIHSFPVTDDNYDRFAFTADMPKLNTANPEVQKYLLDIALYWIREFD
IDGWRLDVANEVDHVFWKTFRQAVSTEKPDVYILGEIWHSAEPWLRGDEFHAAMNYPFTEPMIEYFADQTISASRMAHRV
NAHLMNGMKQANEVMFNLLDSHDTKRLLTRCRNDEKKARALLAFMFAQTGSPCIYYGTEIGLDGENDPLCRKCMVWEKEK
QNQDMLQFMKRLIALRKQENTLLTEGHLEWNLLDDKNDFISFSRTLDEKILIYFFNQGNVVQHISLRELNIDRNNKICDA
WTEQPLHYHDVIAVQPGEFLILSAAAPV
>Q14U76 ~~~bbp~~~Bone sialoprotein-binding protein~~~
MINRDNKKAITKKGMISNRLNKFSIRKYTVGTASILVGTTLIFGLGNQEAKAAENTSTENAKQDEASASDNKEVVSETEN
NSTQKNDLTNPIKKETNTDSHQEAKEAPTTSSTQQQQNNATTSTETKPQNIEKENVKPSTDKTATEDTSVILEEKKAPNN
TNNDVTTKPSTSEIQTTPTTPQESTNIENSQPQPTPSKVDNQVTDAINPKEPVNVSKEELKNNPEKLKELVRNDSNTDRS
TKPVATAPTSVAPKRVNAKIRFAVAQPAAVASNNVNDLITVTKQMITEGIKDDGVIQAHDGEHIIYTSDFKIDNAVKAGD
TMTVKYDKHTIPSDITDDFTPVDITDPSGEVIAKGTFDLNTKTITYKFTDYVDRYENVNAKLELNSYIDKKEVPNETNLN
LTFATADKETSKNVKVEYQKPIVKDESNIQSIFSHLDTTKHEVEQTIYVNPLKLNAKNTNVTIKSGGVADNGDYYTGDGS
TIIDSNTEIKVYKVASGQQLPQSNKIYDYSQYEDVTNSVTINKNYGTNMANINFGDIDSAYIVKVVSKYTPGAEDDLAVQ
QGVRMTTTNKYNYSSYAGYTNTILSTTDSGGGDGTVKPEEKLYKIGDYVWEDVDKDGVQGTDSKEKPMANVLVTLTYPDG
TTKSVRTDANGHYEFGGLKDGETYTVKFETPAGYLPTKENGTTDGEKDSNGSSVTVKINGKDDMSLDTGFYKEPKYNLGD
YVWEDTNKDGIQDANEPGIKDVKVTLKDSTGKVIGTTTTDASGKYKFTDLDNGNYTVEFETPAGYTPTVKNTTAEDKDSN
GLTTTGVIKDADNWTLDSGFYKTPKYSLGDYVWYDSNKDGKQDSTEKGIKDVTVTLQNEKGEVIGTTKTDENGKYRFDNL
DSGKYKVIFEKPAGLTQTGTNTTEDDKDADGGEVDVTITDHDDFTLDNGYFEEDTSDSDSDSDSDSDSDSDSDSDSDSDS
DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS
DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDAGKHTPVKPMSATKDHHNKAKALPETGSENN
GSNNATLFGGLFAALGSLLLFGRRKKQNK
>Q9KJF2 ~~~bbsC~~~(2S)-[(R)-hydroxy(phenyl)methyl]succinyl-CoA dehydrogenase subunit BbsC~~~
MKSNSNGKVALIVNADDAVGEAVALRLAGSGVQLALAGADAGRLDKLASQLAGKGATVMAVATAAVEAGAIRDSVAQVKA
RYGRIDVLVHNESALAANLPEISDADVGAALDTGLAAPFHYLRAVVPGMREAGFGRVVNISDLRYLGLANTSSVAAARSG
LFGLTRALALESARDGVTVNTVVMGDVDSETTPAAEREKLAGGIPVKRLGTPADIANAVGFLAADSSKYVTGQTLFVCGG
KSAYFSMSI
>Q9KJF1 1.1.1.429~~~bbsD~~~(2S)-[(R)-hydroxy(phenyl)methyl]succinyl-CoA dehydrogenase subunit BbsD~~~
MGIQNRVALITGSASGMGKQTALRFAEQGAAVVINDIDAEKVRATVDEFSARGHRVLGAVADIGNKAAVDGMVKQTIDAF
GRIDILVNNAGMERAGALRKLSEADWDVTINVNLKGTFLCTQAVHGHMVENKHGRIVNIASRAWLGGAGQTPYSSAKAGV
VGMTRALAIELGRAGITVNCVAPGLIHTPMWDELPEKDQQFLLSRQPTGKLGEPDDIANTLLFLADDDSGFVTGQVLYVC
GGRSLFAG
>Q9KJF0 2.8.3.15~~~bbsE~~~Succinyl-CoA:(R)-benzylsuccinate CoA-transferase subunit BbsE~~~
MGQDFSRFRVVDMTGELGPYTAKMFAGLGADVIHVESPAGDPLRRVGPWFRNQPGVQASLPYLYYNAGKRGFAVDLEHEA
GREVFRTLCSGADLLVESCRPGYLDGLGLSYEELSRDNARLVQTSVTPFGRTGPLAAYPGSDLTCSALSGFLWLAGIDGD
KPVRAPDNQAYRMAEAYAAVGSAIALFSAQRTGKGQLVDVACIEAEAMALENAAQFWDLEGKIRRGRGREAGSATLHPCA
DGYIALVAIMGRNKDMWTPFVRWMEAEGVEEWPLFDDDKWIDYAYRTSEEGYTTFCRVFERYTRSRSKAELYEIGQRFNV
AVTPVSDGRDLLANPQLAHREFWQTQFNDTLGADITYPGAPYEFGELQWQLGRNAPRIGEHTREILVECGYPAFEIDNLL
RMGAVYAEQH
>Q9KJE9 2.8.3.15~~~bbsF~~~Succinyl-CoA:(R)-benzylsuccinate CoA-transferase subunit BbsF~~~
MPNSIERALEGIVVCDFSWVGAGPIATSVLAQCGADVIKIESVKRPDTLRRGEPFKDGIGTGLDRSGYFAARNANKRDIA
LDMSHPRAREVAVRLIEKSDIVINNFRVGQMEKWKLGWEDVQKINPRAIYVTMSMQGIDGPHSRYMGYGVNLNALCGLTA
RAGFPGQAPFGTGTNYTDHVMVPTHTLFGIMAALLEREATGRGQTVSLSQLESAICMTPSAPMAFAANGEALGPQGYGDP
EAAPHGVYTTLGYRKWIAIAVFDDAQWATLRRVMGNPPWAEDERFATIEMRRRHAAELDERIEGWTATQYGDWLMEALLK
AGVAAGEVRDAREAIEDEHLRRRGFWAYLDHPEVGVTLYNRAPIVFSRTPVEMKSAAPSIGQHTREVLGGMLGYSHGEIE
DLAAQQVLV
>Q9KJE8 1.3.8.3~~~bbsG~~~(R)-benzylsuccinyl-CoA dehydrogenase~~~
MDFSLSEEQTMLKEVARRFTANELMPLEKVLLEREMRMWTDGYTLLPEADHARLMKITQEMGFWGIEVDEKLGGQGLGMF
AKTLVVEEMSKSLIGFSHHGFTLPPDAPNLYYLEECGSPAQRDKYVRRYCRGEIDSAMMATEPGAGSDISGLTTTAVREN
GQWVINGSKIFISKCDKDELFFICIAVTDKEAPTKRRFTAFILDKDTPGLRIGAEIPVIGAMPTWSVYLDNVRVGDEAVL
GEVGDAFIPLQNRFGVRRIELAAHCTGMAERLIQMMIDQANLRKTFGVALADRQTVQNWIADSTIELEQVRLQLYFTAWK
SDQGHKDLRLEAASLKIAATEMLTRVADRAIQLHGGLGLSREMGIEYVARMVRIWRVVEGASEIHRMSIAKKLLTDGRTY
SPFVAA
>Q9KJE7 4.2.1.180~~~bbsH~~~(E)-benzylidenesuccinyl-CoA hydratase~~~
MPVTLEVSNHVAYVTLNRPEAMNSLDPESTADLTEIWARVRTDPDIRVAVLTGAGEKSFCTGTDMKKSPPPTECMAATYL
RDGQPILPHMKMWKPIIAAINGYAVGGGLEIALACDLRIASTNAKFGLTEVKVASLAGLNGTQALPRAIPQAVAMKMLLT
GEMISAEEALRYGLVSDVVEPSALADLARSYAEKIASAAPLSVQATKQAAVLGKDMPLEHGILYSHLLWGVLRDTEDRKE
GFKAFGERRAPAFRGA
>B0MC58 2.8.3.-~~~~~~Butyryl-CoA:acetate CoA-transferase~~~COG0427
MSFKEEYQKKLKTADEAVKVVKSGDWLEYGWCVTTPAALDKALAKRMPELENINIRGGIVMWPLEITKIDSPADHFTWNS
WHMGGLERKWIKEGFSYYAPIRYSELPGYYRNYIDHVDVAMMQVAPMDEHGFFNFGPSASHLAAMLEKADCVIVEVNENM
PRCLGGFEEGVHISKVDMIVEGENPAIAELGGGGAATDVDKAVAKLIVDQIPNGACLQLGIGGMPNAVGSMIAESDLKDL
GVHTEMYVDAFVDIAKAGKITGARKNIDRYRQTYAFAAGTKKLYDYLNDNPECMSAPVNYTNDIARVSSIDNFISINNAV
DVDLYGQISAESSGIKQISGAGGQLDFVMGAYLSNGGKSFVCLSSTFTDKAGQMHSRILPTLHNGSIVTDTRANAHYIVT
EYGMANMKGLSAWQRAEALINIAHPDFRDQLIKDAEKAQIWRRSNK
>G2SYC0 2.8.3.-~~~~~~Butyryl-CoA:acetate CoA-transferase~~~COG0427
MDFREEYKQKLVSADEAVKLIKSGDWVDYGWCTNTVDALDQALAKRTDELTDVKLRGGILMKPLAVFAREDAGEHFCWNS
WHMSGIERKMINRGVAYYCPIRYSELPRYYRELDCPDDVAMFQVAPMDAHGYFNFGPSASHLGAMCERAKHIIVEVNENM
PRCLGGTECGIHISDVTYIVEGSNPPIGELGAGGPATDVDKAVAKLIVDEIPNGACLQLGIGGMPNAVGSLIAESDLKDL
GVHTEMYVDAFVDIAKAGKINGSKKNIDRYRQTYAFGAGTKKMYDYLDDNPELMSAPVDYTNDIRSISALDNFISINNAV
DIDLYGQVNAESAGIKQISGAGGQLDFVLGAYLSKGGKSFICLSSTFKTKDGQVQSRIRPTLANGSIVTDARPNTHYVVT
EYGKVNLKGLSTWQRAEALISIAHPDFRDDLIKEAEQMHIWRRSNR
>O07576 ~~~bcaP~~~Branched-chain amino acid permease BcaP~~~COG0531
MKGSVFRKKSIQDLIAATSGEKSLKRELGAFDLTLLGIGAIIGTGIFVLTGTGAVTAGPGLTISFVVAALACLFAALSYA
EFASSVPVSGSVYTFTYATLGELMAFIIGWDLILEYMLAVSAVSVGWSGYFQSFLSGLGIHLPVALTAAPGAVKGTFTLF
NLPAFVIVMAITYLLYLGIKESKRVNNIMVILKILVVLLFIAVAAVYVKPHNWQPFMPMGFGGVFSAAALVFFAFIGFDA
VSSAAEETKNPAKDLPKGIIFSLLVCTILYVTVSAIMTGVIPFAQFAGVDHPVSLVLQSAGQNWVAGIIDIGAVLGMTTV
MLVMLYGQTRVMFAMSRDGLVPGSLSKVHPKHKTPYVATWFFGTMSALLGSLVPLDELAKLVNIGTLSAFVLISVAVIVL
RKKQPDLPRAFKCPGVPVIPGLAILFCLFLILNLGWVTIVRFLVWLLIGLVIYFLYSRKHSKLNQ
>A2RHI9 ~~~bcaP~~~Branched-chain amino acid permease BcaP~~~COG0531
MGFMRKADFELYRDADKHYNQVLTTRDFLALGVGTIISTSIFTLPGQVAAQFAGPGVVFSYLLAALVAGFVALAYAEMST
VMPFAGSAYSWISVLFGEGFGWIAGWALLAEYFIAVAFVGSGFSANLQQLLAPLGFQLPKVLANPFGTDGGIVDIISLLV
ILLSAIIVFRGASDAGRISQILVVLKVAAVIAFIIVGITVIKPANYHPFIPPHNPKTGFGGFSGIWSGVSMIFLAYIGFD
SIAANSAEAKNPQKTMPRGIIGSLLIAVVLFAAVTLVLVGMHPYSAYAGNAAPVGWALQQSGYSVLSEVVTAIALAGMFI
ALLGMVLAGSRLLYAFGRDGLLPKGLGKMNARNLPANGVWTLAIVAIVIGAFFPFAFLAQLISAGTLIAFMFVTLGIYSL
RRRQGKDLPEATYKMPFYPVLPALGFIGSLFVFWGLDVQAKLYSGIWFLIGIAIYFAYGNRRKKK
>K0K750 4.2.3.57~~~ptlA~~~(-)-beta-caryophyllene synthase ((2E,6E)-farnesyl diphosphate cyclizing)~~~COG0664
MGRPATPQQTAFHIPFPRAISPDVSAVHPGSMAWLRRHGMLRSDASARRVDGWRLTELAGRFFPDARGEDLRLGADVMGF
FFLFDDQFDHPGGLRAEAVAVSKRLLHLTSLPAGPAPEGAGPVVAAWADLWNRSCQGMSSAWRVRAAREWRRYFVGNLEE
SVAREGMSGESVEDYLRLRAMTIGTTPVYDLCERTQHFEIPDEVLHSHHVQAMRDLATEIVVLCNDVASTIKESARGETL
NAVLLLERHHEAERGPAVARVQRMVEARLAAFRRLRDRTSRTCAALDLTAEQCDRVDRYVRTALMSVVRGNYDWQQRSAR
FSADDARPGSLPGYLDDLVGHSGVVGPPPVDGS
>Q02192 ~~~bca~~~C protein alpha-antigen~~~
MFRRSKNNSYDTSQTKQRFSIKKFKFGAASVLIGLSFLGGVTQGNLNIFEESIVAASTIPGSAATLNTSITKNIQNGNAY
IDLYDVKLGKIDPLQLIVLEQGFTAKYVFRQGTKYYGDVSQLQSTGRASLTYNIFGEDGLPHVKTDGQIDIVSVALTIYD
STTLRDKIEEVRTNANDPKWTEESRTEVLTGLDTIKTDIDNNPKTQTDIDSKIVEVNELEKLLVLSVPDKDKYDPTGGET
TVPQGTPVSDKEITDLVKIPDGSKGVPTVVGDRPDTNVPGDHKVTVEVTYPDGTKDTVEVTVHVTPKPVPDKDKYDPTGG
ETTVPQGTPVSDKEITDLVKIPDGSKGVPTVVGDRPDTNVPGDHKVTVEVTYPDGTKDTVEVTVHVTPKPVPDKDKYDPT
GGETTVPQGTPVSDKEITDLVKIPDGSKGVPTVVGDRPDTNVPGDHKVTVEVTYPDGTKDTVEVTVHVTPKPVPDKDKYD
PTGGETTVPQGTPVSDKEITDLVKIPDGSKGVPTVVGDRPDTNVPGDHKVTVEVTYPDGTKDTVEVTVHVTPKPVPDKDK
YDPTGGETTVPQGTPVSDKEITDLVKIPDGSKGVPTVVGDRPDTNVPGDHKVTVEVTYPDGTKDTVEVTVHVTPKPVPDK
DKYDPTGGETTVPQGTPVSDKEITDLVKIPDGSKGVPTVVGDRPDTNVPGDHKVTVEVTYPDGTKDTVEVTVHVTPKPVP
DKDKYDPTGGETTVPQGTPVSDKEITDLVKIPDGSKGVPTVVGDRPDTNVPGDHKVTVEVTYPDGTKDTVEVTVHVTPKP
VPDKDKYDPTGGETTVPQGTPVSDKEITDLVKIPDGSKGVPTVVGDRPDTNVPGDHKVTVEVTYPDGTKDTVEVTVHVTP
KPVPDKDKYDPTGGETTVPQGTPVSDKEITDLVKIPDGSKGVPTVVGDRPDTNVPGDHKVTVEVTYPDGTKDTVEVTVHV
TPKPVPDKDKYDPTGKAQQVNGKGNKLPATGENATPFFNVAALTIISSVGLLSVSKKKED
>P33569 1.11.1.-~~~bca~~~Bromoperoxidase-catalase~~~COG0753
MTQGPLTTEAGAPVADNQNSETAGVGGPVLVQDQLLLEKLAHFNRERIPERVVHARGAGAYGTFTLTRDVSRWTRAAFLS
EVGKRTETFLRFSTVAGSLGAADAVRDPRGWALKFYTEEGNYDLVGNNTPVFFIKDAIKFPDFIHTQKRDPYTGSQEADN
VWDFWGLSPESTHQVTWLFGDRGIPASYRHMNGYGSHTYQWNNEAGEVFWVKYHFKTDQGIKNLTQDEANRLAGEDPDSH
QRDLREAIERGDFPTWTVQVQIMPAADAAGYRFNPFDLTKVWPHEDYPPVEIGTLELNRNPENIFAEVEQSIFSPAHFVP
GIGPSPDKMLQGRLFAYGDAHRYRVGINADHLPVNRPHATEARTHSRDGFLYDGRHKGAKNYEPNSFGGPVQTDRPLWQP
TPVTGVTGDHAAPSHAEDDDFTQAGDLYRLMSEDEKGRLIDNLSGFIAKVSRDDIAERAIGNFRRADEDFGKRLEAAVQA
LRG
>P49786 ~~~accB~~~Biotin carboxyl carrier protein of acetyl-CoA carboxylase~~~COG0511
MLNIKEIHELIKAIDESTIDEFVYENEGVSLKLKKHEAGTVQVMQQAPAAPVQAQAPQAVQPQAQQAAAPAQEAPKQDEN
LHKITSPMVGTFYASSSPEAGPYVTAGSKVNENTVVCIVEAMKLFNEIEAEVKGEIVEVLVENGQLVEYGQPLFLVKAE
>P0ABD8 ~~~accB~~~Biotin carboxyl carrier protein of acetyl-CoA carboxylase~~~COG0511
MDIRKIKKLIELVEESGISELEISEGEESVRISRAAPAASFPVMQQAYAAPMMQQPAQSNAAAPATVPSMEAPAAAEISG
HIVRSPMVGTFYRTPSPDAKAFIEVGQKVNVGDTLCIVEAMKMMNQIEADKSGTVKAILVESGQPVEFDEPLVVIE
>P43874 ~~~accB~~~Biotin carboxyl carrier protein of acetyl-CoA carboxylase~~~COG0511
MDIRKIKKLIELVEESGITELEVQEEEGTVRISRAAPVIAPAAVQYAAAPVVAPTPAAAPAQVPAAATTAPAASDELSGH
LVRSPMVGTFYRSPSPEAKAFVEVGQSVKVGDALCIVEAMKMMNRIEADKAGVVKAILINDGNAVEFDEPLIVIE
>Q06881 ~~~accB~~~Biotin carboxyl carrier protein of acetyl-CoA carboxylase~~~COG0511
MPLDFNEIRQLLTTIAQTDIAEVTLKSDDFELTVRKAVGVNNSVVPVVTAPLSGVVGSGLPSAIPIVAHAAPSPSPEPGT
SRAADHAVTSSGSQPGAKIIDQKLAEVASPMVGTFYRAPAPGEAVFVEVGDRIRQGQTVCIIEAMKLMNEIEADVSGQVI
EILVQNGEPVEYNQPLMRIKPD
>P02904 2.1.3.1~~~~~~Methylmalonyl-CoA carboxyltransferase 1.3S subunit~~~
MKLKVTVNGTAYDVDVDVDKSHENPMGTILFGGGTGGAPAPRAAGGAGAGKAGEGEIPAPLAGTVSKILVKEGDTVKAGQ
TVLVLEAMKMETEINAPTDGKVEKVLVKERDAVQGGQGLIKIG
>Q5XAE6 ~~~accB~~~Biotin carboxyl carrier protein of acetyl-CoA carboxylase~~~
MNIQEIKDLMAQFDTSSLREFLFKTNEGELIFSKNEQHLNASISNQEHAVPVPQVQLVPNSTASEASSPASVKDVPVEEQ
PQAESFVAEGDIVESPLVGVAYLAASPDKPPFVAVGDTVKKGQTLVIIEAMKVMNEVPAPCDGVITEILVSNEDVIEFGQ
GLVRIK
>Q87PP5 ~~~~~~Glycine betaine/proline/choline/ectoine transporter VP1456~~~COG1292
MIKFASFLKFRVQINGGRYWSSSPLRSVSNYVKFVFMDNAFKKYSIDTTDYQVGQDNVQKWGFDIHNPVFGISAGLVVFC
LISLLLVEPVTARDALNGIKNGIIEQFDAFFMWSTNFFLLFAVGLLFSPLGKIRLGGKEATPDHSTVSWLSMLFAAGMGI
GLLFWSVAEPTAYFTDWWGTPLNAEAYSADAKSLAMGATMFHWGVHGWSIYALVALALAFFAFNKGLPLSLRAAFYPIFG
DRAWGWLGHVIDILAVLSTLFGLATSLGLGAQQATSGINHVFGLNGGIGTQMVVIAFVTFIAVLSVVRGIDGGVKLLSNV
NMIVAFALLIFITFITFDTAMGSLVDTTMAYIQNIIPLSNPHGREDETWMHGWTVFYWAWWVSWSPFVGMFIARVSKGRT
VREFLFAVIVIPTLVTLVWMSVFGGIALDQVVNKVGELGANGLTDISLTLFHVYDVLPYSSVISILSIVLILVFFITSSD
SGSLVIDSITAGGKIDAPVPQRIFWACIEGSIAAVMLWVGGKEALQALQSGVVATGLPFTFVLLLMCVSLVKGLRTELSA
YR
>Q87NZ5 ~~~~~~Glycine betaine/proline/choline transporter VP1723~~~COG1292
MSTDNNGGIKRPDGKVNAIDTDYQIGQDNVALKVGPFGLDIHNRVFAISGMAIVLFVVATLTFRQQVEPFFAGLRAWLVS
NLDWFFLASGNVFVIVCLVLIVTPLGRVRIGGTEATPDYSYAGWLAMLFAAGMGIGLVFFGVSEPMSHFSSALGGVNIEN
GVRTDWAPLGGAVGDTDAASALGMAATIYHWALHPWSIYALLALGLAIFSFNKGLPLTMRSIFYPLFGERVWGWVGHIID
ILAVVATVFGLATSLGYGASQAATGLNFLFGVPMTDTTQVVLIVVITALALISVVAGLDSGVKRLSEINMILAAMLLFFV
IIVGPTMAILTGFFDNIASYITNIPALSMPFEREDVNYSQGWTAFYWAWWISWSPFVGMFIARVSRGRSVREFIICVILI
PSTVCVLWMTAFGGTAISQYVNDGYEAVFNAELPLKLFAMLDVMPFAEITSVVGIILVVVFFITSSDSGSLVIDTIAAGG
KVDAPTPQRVFWCTFEGLVAIALMLGGGLAAAQAMAVTTGLPFTIVLLVATVSLIKGLMDEPRLSTKAVKKDK
>Q87NG3 ~~~~~~Glycine betaine transporter 1~~~COG1292
MTKGIDKYSIDSTDYTVGQDNVQKWGFDVHNPVFGISAGFIALFLVAALVLDAHTAKTALDGLKWKIIGSFDWLFIIAGN
IFVIFCLALIVSPLGKIRLGGKDAVADYSFMSWLAMLFAAGMGIGLMFWSVAEPVAYFTGWYETPLGVEANSPEAARLAL
GATMFHWGLHPWAIYGVVALSLAFFTYNKGLPLSMRSIFYPLLGDRAWGWAGHIVDILAVLATLFGLATSLGLGAQQAAS
GIHHVFGVEPGLGLQIVVITVVTLLAVVSVVRGIDGGVKVISNINMVVAFLLLILVGLIGWAASLGSIPTTLMAYVENII
PLSNPFGRTDEAWFQGWTVFYWAWWISWSPFVGMFIARVSRGRTVREFITAVLIVPTVVTVVWMSVFGGLAIDQVVNKVG
ELGANGLTDVSLAMFQMFDVLPFGNILSIIAVVLVLVFFITSSDSGSLVIDSITAGGKVDAPVLQRVFWAFMEGAIAVAL
LWIGGSEAVQALQAGAISTALPFTFILLAMCVSLLMGMKTERQ
>Q87J97 ~~~~~~Glycine betaine transporter 2~~~COG1292
MHGFRALKSEVIMSNMTNAAPHTPIQEADYSAIHPPSLLKRLELTNPVFWLSGSFLSLFVLLALTNTESLTAMVNAGFGF
ATKYFGAYWQVLLLLNFLIGLALAFGRTGYVRLGGLAKPDIDTFKWLSIVLCTLLAGGGVFWAAAEPIAHFVTAPPLYGE
ASPKTSAINALSQSFMHWGFLAWAILGCLSSIVLMHLHYDKGLPLKPRTLLYPIFGDKAIHGWIGNLADACSIIAVAAGT
IGPIGFLGLQISYALNSLFGFPDNFITQSMVIVAAIVMYTLSALSGVSKGIQLVSRYNIILSVLLIGYILFFGPTSFIID
GYVQGVGRMVDNFFPMALYRDDTGWLSWWTVFFWGWFIGYGPMMAIFIARISRGRTIRQLILSISIAAPLITCFWFSIVG
GSGLAFELANPGLISSAFEGFNLPAVLLAITGELPFPMIISVLFLILTTTFIVTTGDSMTYTISVVMTGSAEPNAVIRSF
WGLMMGVVAIALISMGSGGITALQSFIVITAVPVSFILLPSILKAPGIANQMAKDQGLV
>Q0AVM4 1.3.8.1~~~~~~Butyryl-CoA dehydrogenase Swol_1933~~~COG1960
MAHENYLYQMRDIKFAVKEWLDMNKLLSCDAYKDYYGIDDIDAFLDVNFKVCRDVLCPANKEADDPGCKFVGGDTQAVIT
PEVFKNAYNTVCEAGLGPQFSDRSAEGRMPLVWEAPILEMQSGASPSIVMFWCLTAGACTVIQHNASEELKERFLPKMYS
GEWGGTMGLTEPGAGSEVGAVATKCFPTDTPGLYKIKGQKCFITSGDHDLASNIIHLVLAKTPDAKPGTSGINCLIVPKF
WVNEDGTQGAWNDVTSTGIEHKMGIHGSSTLSLSFGENDNCYGWMIGDGPVDGRGKGMAQMFQMMNEERLNTGTFAQGCI
GSAYYAALDYCKMRVQSPKFTDPKGPSVRIIEHEDVRRMLLFQKSIMEACRALLYTTYFYQDLSHDAADPAEREYYDDMT
MIQIPLCKAYVSDMAWISTEQAIQCLGGYGFIEEYAPAELARDCKIYSLWEGTNFIQAQDFNNRKTTMKKGEPMKKWVAQ
IADFLATKKDPAFADEFAMMDDAFSAYNEILSTKEAWRASNPQLVQLFATRMLHAASMMICGKLMLDQALLAAKKLAELG
EDHFDAMFYKGKIATARFYVMNVVPGVFGTLKAMKVADTSAIDMPEEAFM
>Q0AVA8 1.3.8.1~~~~~~Butyryl-CoA dehydrogenase Swol_2052~~~COG1960
MPHNNYLYQTRDIKFQIKEWLDINKILSLDAYKDYYGADDFDAICDVNFKICRDVICPANKESDEIGMKHVGGNEKAVIS
PDVFKTVYNTVIEAGMGPQFGDRQVEGRMPLYWYAPILEMQTGASPAMVMLWCLTQGATTVLQYNLSKELQERFLPKMYS
GEWGGSMCLTEPGAGSEVGAVSTKCFPTDTPGLWKVKGQKCFITTGDWDGVDNIIHLVLAKDPDAKPGTAGISCLVVPKF
WVNEDGSMGAWNDVTTTGIEHKLGIHGSATCSLAFGENDNCYGWMIGDGPVDGRGQGMAQMFQMMNEERINTGIFSLGAF
GAAYYAALEYSKARVQSKKSTDPKGPSVRIIEHEDVRRMLLLQKSVMEACRALLYSSYYYIDMSKEAATEEEREYAEDMF
MIQNPLCKAYVSDMAWVMCAEAIQVHGGYGFMEEYAPASLARDCKIYTLWEGTNFIQSQDFTGRKFTMKKGEPFKKWLAE
IGDFIANKKTPEFAAEFAMMEKAFAAFNSIIDMNAAWTTTNKQLKQLFATRIMHAAARVICGKLMLDQGLLAAGKLAELG
DSHFDANFYKGKLASVKFYVMNVVPEIFGTEEAMKAADTSAIDCPEEAIM
>O34697 ~~~bceA~~~Bacitracin export ATP-binding protein BceA~~~COG1136
MVILEANKIRKSYGNKLNKQEVLKGIDIHIEKGEFVSIMGASGSGKTTLLNVLSSIDQVSHGTIHINGNDMTAMKEKQLA
EFRKQHLGFIFQDYNLLDTLTVKENILLPLSITKLSKKEANRKFEEVAKELGIYELRDKYPNEISGGQKQRTSAGRAFIH
DPSIIFADEPTGALDSKSASDLLNKLSQLNQKRNATIIMVTHDPVAASYCGRVIFIKDGQMYTQLNKGGQDRQTFFQDIM
KTQGVLGGVQHEH
>O34741 ~~~bceB~~~Bacitracin export permease protein BceB~~~COG0577
MNINQLILRNLKKNLRNYYLYVFALIFSVALYFAFVTLQYDPAINEVKASIKGAAAIKTASILLVAVVAIFILYANTIFI
KRRSKEIGLFQLIGMTKHKIFRILSAENVMLYFGSLAIGVAAGFSISKLVLMILFKIVDVKADAKLHFSEQALVQTVIVF
CGIYLLIMIMNYTFIKKQSILSLFKVTSSTEDKVKKISFFQMLIGALGIVLILTGYYVSSELFGGKFKTINELFVAMSFI
LGSVIIGTFLFYKGSVTFISNIIRKSKGGYLNISEVLSLSSIMFRMKSNALLLTIITTVSALAIGLLSLAYISYYSSEKT
AEQNVAADFSFMNEKDAKLFENKLRESNISFVKKATPVLQANVDIANIMDGTPKEMQGDPGNMQLAVVSDKDVKGVDVAA
GEAVFSGYTDLLQKIMVFKDSGVIKVKSKHETQPLKYKGLREEFLVSYTFTSGGMPAVIVDDSLFKQLDKDKDPRIQLAQ
STFIGVNVKHDDQMEKANELFQQVNKKNEHLSRLDTSAAQKSLFGMVMFIVGFLGLTFLITSGCILYFKQMGESEDEKPS
YTILRKLGFTQGDLIKGIRIKQMYNFGIPLVVGLFHSYFAVQSGWFLFGSEVWAPMIMVMVLYTALYSIFGFLSVLYYKK
VIKSSL
>Q9F715 1.3.7.7~~~bchB~~~Light-independent protochlorophyllide reductase subunit B~~~COG2710
MRLAFWLYEGTALHGVSRVTNSMKGVHTVYHAPQGDDYITATYTMLERTPEFPKLSISVVRGQDLARGTSRLPGTVEQVD
KHYKPELIVVAPSCSTALLQEDLGQMARASGVDQSKIMVYAVNPFRVAENEAAEGLFTELVRRFAAEQPKTEKPSVNLLG
FTSLGFHLRSNLTSLRRMLKTLGIEVNVVAPWGAGIDDLKKLPAAWVNIAPFREIGCQAAGYLKEKFGMPSITEAPLGVN
ATLRWLRAIIAEVNKIGAEKGMAPMAMPELRDFSLDGQSAPSSVPWFARTADMESFSNKRAFVFGDATQVVGVTKFLKDE
LGMKIIGAGTYLPKQADWVREQLEGYLPGELMVTDKFQEVSAFIEEEMPELVCGTQMERHSCRKLDVPCMVISAPTHIEN
HLLGYYPFFGFDGADVMADRVYTSAKLGLEKHLIDFFGDAGLEYEAEEPEAFTEPTMSGNGTVASVSSAEAPSEAAVVTA
TATGELSWTAEAEKMLGKVPFFVRKKVRKNTDNYAREIGEPVVTADVFRKAKEHLGG
>P26163 1.3.7.7~~~bchB~~~Light-independent protochlorophyllide reductase subunit B~~~COG2710
MKLTLWTYEGPPHVGAMRVATAMKDLQLVLHGPQGDTYADLLFTMIERRNARPPVSFSTFEASHMGTDTAILLKDALAAA
HARYKPQAMAVALTCTAELLQDDPNGISRALNLPVPVVPLELPSYSRKENYGADETFRALVRALAVPMERTPEVTCNLLG
ATALGFRHRDDVAEVTKLLATMGIKVNVCAPLGASPDDLRKLGQAHFNVLMYPETGESAARHLERACKQPFTKIVPIGVG
ATRDFLAEVSKITGLPVVTDESTLRQPWWSASVDSTYLTGKRVFIFGDGTHVIAAARIAAKEVGFEVVGMGCYNREMARP
LRTAAAEYGLEALITDDYLEVEKAIEAAAPELILGTQMERNIAKKLGLPCAVISAPVHVQDFPARYAPQMGFEGANVLFD
TWVHPLVMGLEEHLLTMFREDFEFHDAAGASHHGGKAVAREESPVAPADLAPAATSDTPAAPSPVVVTQASGEIRWMPEA
ERELRKIPFFVRGKAKRNTELYAAHKGVCDITVETLYEAKAHYAR
>O34845 6.6.1.1~~~bchD~~~Magnesium-chelatase 60 kDa subunit~~~COG1240
MPLGPWERVEAALTLLAIDPAGLKGLWLRARASALRDRITGALGALPLPVRRIHPTIGDDALFGGLDLAATLSAGTPVVQ
KGILDEPAVLVLAMAERTLPGLAARLGTALDAPRHCLIALDEGAERDELLPLGLVDRLALFLDLDGLPWGETREIALDPE
RLAAARARLAAVATPPEAAATLARVAAQLGIASLRAPTLALAAARAQAAWEGHAAVTDEDIRRAADLVFAHRAMPASEEA
PPEPEPEPPEDQPDDSPPPPEQQQGEEMFPEEMLVEAVRAALPADLLEQLAAGRAARMARGATGTGSAKAGNRRGRPLPS
RMGRLGTGARIDLVGTLRAAAPWQPLRRRQQKTDAVLLVRPSDIRIKRFRETSDRVLIFAVDASGSSAMARLSEAKGAVE
LLLGQAYARRDHVSLLAFRGRDAELILPPTRSLVQTKRRLAGLPGGGGTPLAHGLRLALAVGLQARARGMTPTVALLTDG
RGNIALDGSANRAQAEEDALKLAASLRGSGLPAVVIDTANRPQPSLAALARALDAPYIALPRADAHKLSNVLGAAMGD
>P26175 6.6.1.1~~~bchD~~~Magnesium-chelatase 60 kDa subunit~~~COG1240
MDHERLKSALAVLTVDPAAVGGLWLRSRAGPIRLAFTDTLAKLPFPMALRRLPPNVDDGALYGGLDVAETLHSGKPVLKG
GLLDRPSVFILPMAERCTAKLGARLAQALDLRQHALIALDEAAEPDEALPHAVADRLGLFVDLSEVRSIDGPGLLPETAQ
IERARELLPQVQMPAERVSEIVEGCRQLGISSLRAPMLALTAARILTALSGRTRVEAEDVLHAAELTLAHRALPLQEAPP
PPPPPPEPPEPNEGENQQDEQDQIDPLDGIPPEIVVEAVRAMLPDNILQTLNMGSRLRAASGGQGAGQEQIGNRRGRPLP
SRKGKLEDDAKIDLVATLRSAAPWQGLRRRQAPAGTERVLLVESSDIHIKRRKEMSDRVLIFAVDASGSAAVARLSEAKG
AVELLLGRAYAARDHVSLITFRGTAAQVLLQPSRSLTQTKRQLQGLPGGGGTPLASGMEMAMVTAKQARSRGMTPTIALL
TDGRGNIALDGTANRELAGEQATKVARAIRASGMPAVIIDTAMRPNPALVDLARTMDAHYIALPRATAHKMADVLGAALE
A
>P26168 1.21.98.3~~~bchE~~~Anaerobic magnesium-protoporphyrin IX monomethyl ester cyclase~~~COG1032
MRILFVHPNYHSGGAEIAGNWPPSWVPYLAGHLKKAGFDDIHFIDAMTLNVSHDELRKKFAELQPDLIGVTSITPSIYEA
EETLKIAKEVVPNAVRVLGGVHATFMFRQVLSEAPWVDAIVRGEGEEIMVELAKCVSEGRWPEDRASIKGLAFHDGTEIV
ATQAAPTVKDIDSLKPDWSLIDWKHYIYIPLGVRVAIPNMARGCPFTCSFCSQWKFWRDYRVRSPKAVVDEIEDLVNNYD
VGFFILADEEPTINKKKFVEFCQEMIDRGLNHKVKWGINTRVTDIYRDRDLLKFYREAGLVHISLGTEAAAQLKLDLFNK
ETTVAENKEAIRLLREADIFTEAQFIVGLDNETKETLEETFQMAWDWQPDLANWSMYTPWPFTPLFQELRDQVEVFDFSK
YNFVTPIMKPKALTRGELLDGVMKNYRRFYMRKALFHYPWRGTGFRRRYLLGCLKAFLKAGVGRTFYDLGKAGYWGPQTK
DTVDFHFDETRKIAEAQVADWEAAADRSRKHKERQEALRAQMKDRAADRNTANFVMPADAEDEFDLSAETHEARSAEHAA
MACGGGKDQMVDAAE
>Q7X2C7 1.21.98.3~~~bchE~~~Anaerobic magnesium-protoporphyrin IX monomethyl ester cyclase~~~
MRVLFIHPNYHSGGAEIAGNWPPAWVAYLAGYLKAGGYTDVIFVDAMTNDLSEDQVREKITTLKPDIVGCTAITPAIYKA
ERTLQIAKEVNPDIVTVLGGIHGTFMYPQVLKEAPWIDAIVRGEGEQVMLNLVTAVDQGRFMADRNCVNGIAYAAPDGKV
VATPAEPPIEDLDRITPDWGILEWEKYIYIPMNKRVAIPNFARGCPFTCSFCSQWKFWRDYRIRDPKKVVDEIEVLVKQH
DVGFFILADEEPTIHRKKFIEFCEELIKRDLGVLWGINTRVTDILRDEKLLPLFRKAGLIHVSLGTEAAAQLKLDMVNKE
TTIEQNKRAIQLLKDNGIVTEAQFIVGLENETAETLEETYKMARDWNPDMANWAMYTPWPFSDLFQELGDKVEVFDFEKY
NFVTPIMKPDAMDRGELLDRVMSNYRRFFMNKAFLQYPFTKDKERRKYLMGCLKAFLKSGFERKFYDLGRVGYWGPQTKK
TVNFSFDKNRRIDAQTADELSRVDDGWVTMHGPKIEMRRRKGDDNFEIAKAAMACGGGTEQLTEEQQAATEVRAS
>O30819 6.6.1.1~~~bchI~~~Magnesium-chelatase 38 kDa subunit~~~COG1239
MKKPFPFSAIVGQEQMKQAMVLTAIDPGIGGVLVFGDRGTGKSTAVRALAALLPLIKAVEGCPVNSARPEDCPEWAHVSS
TTMIERPTPVVDLPLGVTEDRVVGALDIERALTRGEKAFEPGLLARANRGYLYIDEVNLLEDHIVDLLLDVAQSGENVVE
REGLSIRHPARFVLVGSGNPEEGELRPQLLDRFGLSVEVRSPRDVETRVEVITRRDAYDADHDAFMEKWGAEDMQLRGRI
LGARAALPQLKTPNTVLHDCAALCIALGSDGLRGELTLLRAARAQAAFEGAEAVGRSHLRSVATMALSHRLRRDPLDEAG
SVSRVERCVAEVLP
>P26239 6.6.1.1~~~bchI~~~Magnesium-chelatase 38 kDa subunit~~~COG1239
MTTAVARLQPSASGAKTRPVFPFSAIVGQEDMKLALLLTAVDPGIGGVLVFGDRGTGKSTAVRALAALLPEIEAVEGCPV
SSPNVEMIPDWATVLSTNVIRKPTPVVDLPLGVSEDRVVGALDIERAISKGEKAFEPGLLARANRGYLYIDECNLLEDHI
VDLLLDVAQSGENVVERDGLSIRHPARFVLVGSGNPEEGDLRPQLLDRFGLSVEVLSPRDVETRVEVIRRRDTYDADPKA
FLEEWRPKDMDIRNQILEARERLPKVEAPNTALYDCAALCIALGSDGLRGELTLLRSARALAALEGATAVGRDHLKRVAT
MALSHRLRRDPLDEAGSTARVARTVEETLP
>Q9RFD6 1.3.7.7~~~bchL~~~Light-independent protochlorophyllide reductase iron-sulfur ATP-binding protein~~~COG1348
MSPKDLTIPTGADGEGSVQVHLDEADKITGAKVFAVYGKGGIGKSTTSSNLSAAFSILGKRVLQIGCDPKHDSTFTLTGS
LVPTVIDVLKDVDFHPEELRPEDFVFEGFNGVMCVEAGGPPAGTGCGGYVVGQTVKLLKQHHLLDDTDVVIFDVLGDVVC
GGFAAPLQHADQAVVVTANDFDSIYAMNRIIAAVQAKSKNYKVRLAGCVANRSRATDEVDRFCKETNFRRLAHMPDLDAI
RRSRLKKKTLFEMDEDQDVLAARAEYIRLAESLWRGLDPIDPHSLPDRDIFELLGFD
>P0CY53 1.3.7.7~~~bchL~~~Light-independent protochlorophyllide reductase iron-sulfur ATP-binding protein~~~
MSPRDDIPDLKGFDGDGEGSVQVHDSEDIGLDVGGARVFSVYGQGGIGKSTTSSNLSAAFSLLGKRVLQIGCDPKHDSTF
TLTGRLQETVIDILKQVNFHPEELRPEDYVTEGFNGVMCVEAGGPPAGTGCGGYVVGQTVKLLKQHHLLEDTDVVVFDVL
GDVVCGGFAAPLQHADRALIVTANDFDSIYAMNRIIAAVQAKSVNYKVRLAGCVANRSRETNEVDRYCEAANFKRIAHMP
DLDSIRRSRLKKRTLFEMDDAEDVVMARAEYIRLAETLWRSTGEPGLTPEPLTDRHIFELLGFD
>D5ANS3 1.3.7.7~~~bchL~~~Light-independent protochlorophyllide reductase iron-sulfur ATP-binding protein~~~COG1348
MSPRDDIPDLKGFDGDGEGSVQVHDSEDIGLDVGGARVFSVYGKGGIGKSTTSSNLSAAFSLLGKRVLQIGCDPKHDSTF
TLTGRLQETVIDILKQVNFHPEELRPEDYVTEGFNGVMCVEAGGPPAGTGCGGYVVGQTVKLLKQHHLLEDTDVVVFDVL
GDVVCGGFAAPLQHADRALIVTANDFDSIYAMNRIIAAVQAKSVNYKVRLAGCVANRSRETNEVDRYCEAANFKRIAHMP
DLDSIRRSRLKKRTLFEMDDAEDVVMARAEYIRLAETLWRSTGEPGLTPEPLTDRHIFELLGFD
>P26236 2.1.1.11~~~bchM~~~Magnesium-protoporphyrin O-methyltransferase~~~
MPSDYAEIRNRVEHYFDRTATRAWARLTTADEKVSKVRQTVREGRDTMRAVMLSRLPDDLTGCRVMDAGCGTGLTTVELA
RRGADVVAVDISPQLIDIAKDRLPPELRGKVSFHVGDMADPALGQFDYVVAMDSLIYYRAPDIGRVLTELGKRTHSAIVF
TVAPKTAFLMAFWWLGKLFPRSNRSPVMIPHALDKLQRHAGDSLIKIDRVARGFYISECLEYRP
>P26164 1.3.7.7~~~bchN~~~Light-independent protochlorophyllide reductase subunit N~~~COG2710
MSLDSPTFGCTDSPVRRERGQKAVFCGLTSIVWLHRKMQDAFFLVVGSRTCAHLLQAAAGVMIFAEPRFGTAVLEEQDLA
GLADAHKELDREVAKLLERRPDIRQLFLVGSCPSEVLKLDLDRAAERLSGLHAPHVRVYSYTGSGLDTTFTQGEDTCLAA
MVPTLDTTEAAELIVVGALPDVVEDQCLSLLTQLGVGPVRMLPARRSDIEPAVGPNTRFILAQPFLGETTGALERRGAKR
IAAPFPFGEEGTTLWLKAVADAYGVSAEKFEAVTAAPRARAKKAIAAHLETLTGKSLFMFPDSQLEIPLARFLARECGMK
TTEIATPFLHKAIMAPDLALLPSNTALTEGQDLEAQLDRHEAINPDLTVCGLGLANPLEAKGHATKWAIELVFTPVHFYE
QAGDLAGLFSRPLRRRALLNGGAA
>Q8KBK9 2.1.1.332~~~bchQ~~~Bacteriochlorophyllide d C-8(2)-methyltransferase~~~COG1032
MDDDSNQKPLFHMALGVLTSLTPPQHHIELVDEHFHDKINYDGDYDMVGITSRTIEATRAYEIADEFRKRGKTVVLGGLH
ISFNPEEAAAHADCIVVGEADNLWTTLLDDVANNRLKERYDSKDFPPVKAITPLDYARIAKASKRTKVDGTKSIPIYVTR
GCPFNCSFCVTPNFTGKQYRVQDPKLLKHQIEEAKKYFFKANGKNSKPWFMLTDENLGINKKKLWESLDLLKECDITFSV
FLSINFLEDPTTVKKLVDAGCNFVLAGLESIKQSTLEAYNKGHVNSAEKYSKIIEDCRKAGLNIQGNFLFNPAIDTFEDI
DELVQFVKKNHIFMPIFQIITPYPGTQMYHEYRESGLITIEDWEKYNALHLVIKSDRYEPLLFQYKVLKSYVEVYTWKEI
LLRTLYNPRKLINLVTSIAFKKHLAAQLKAFERNHKMNPAMLSGVKPVMNG
>Q8KCU0 2.1.1.331~~~bchR~~~Bacteriochlorophyllide d C-12(1)-methyltransferase~~~COG1032
MSLTNGVAPSLEKLAAEQKSRKKWLLVQPKSQTSMMVDSGAVSMPLNLIMVATLASKYFDVTFLDERTGDTIPQDFSGYD
VVAITSRTLNAKNAYRIGDRAKAQGKIVLIGGVHPTMLTDEASLHCTSVIYGEIESVWEELAIDIFRGKMKSVYKASNLK
PMTTMTPPDFSFALNSPHAKKYSQLIPILATKGCPVGCSFCTTPTVYGKSFRYREIDLVLDEMRAHQERLGKKKVRFSFM
DDNISFRPKYFMELLEGMAKLGVRWNANISMNFLQKPEVAELAGRSGCELMSIGFESLNPDILKSMNKGSNRLQNYEAVV
SNLHKHKIAIQGYFIFGFDDDSEKSFQATYDFIMQNRIEFPVFSLLTPFPGTPYFEEMKDRVRHFDWDKYDTYHYMFEPK
KLGGEKLLENFIKLQREVYKGSAIMKRMQGKPLNWVWFVNFLMNRFTRKLTPEMYL
>Q8KGE0 2.1.1.333~~~bchU~~~Bacteriochlorophyllide d C-20 methyltransferase~~~COG2813
MSNNDLLNYYHRANELVFKGLIEFSCMKAAIELDLFSHMAEGPKDLATLAADTGSVPPRLEMLLETLRQMRVINLEDGKW
SLTEFADYMFSPTPKEPNLHQTPVAKAMAFLADDFYMGLSQAVRGQKNFKGQVPYPPVTREDNLYFEEIHRSNAKFAIQL
LLEEAKLDGVKKMIDVGGGIGDISAAMLKHFPELDSTILNLPGAIDLVNENAAEKGVADRMRGIAVDIYKESYPEADAVL
FCRILYSANEQLSTIMCKKAFDAMRSGGRLLILDMVIDDPENPNFDYLSHYILGAGMPFSVLGFKEQARYKEILESLGYK
DVTMVRKYDHLLVQAVKP
>P26177 1.3.7.15~~~bchX~~~Chlorophyllide reductase 35.5 kDa chain~~~COG1348
MTDAPNLKGFDARLREEAAEEPTLEIPEQPPTKKTQIIAIYGKGGSGKSFTLANLSHMMAEMGKRVLLIGCDPKSDTTSL
LFGGKNCPTIIETATKKKLAGEEVKVGDVCFKSGGVFAMELGGPEVGRGCGGRGIIHGFELLEKLGFHDWDFDFVLLDFL
GDVVCGGFGLPIARDMAQKVIVIGSNDLQSLYVANNVCNAVEYFRKLGGNVGVAGIVINKDDGTGEAQAFAREVGIPILA
AIPADEELRRKSAAYQIVGSHATPWGKLFEELAGNVADAPPLRPRPLSPDALLALFETDEETRVVDLVPATDEDLRGSNA
APKKSLEVIYDDV
>P26178 1.3.7.15~~~bchY~~~Chlorophyllide reductase 52.5 kDa chain~~~COG2710
MTDLPQAEGGCGAGNERLAAQAAAAGNAELMARFKADYPVGPHDKPQTMCPAFGALRVGLRMRRVATVLCGSACCVYGLS
FISHFYGARRSVGYVPFDSETLVTGKLFEDVRASVHDLADPARYDAIVVINLCVPTASGVPLQLLPNEINGVRVVGIDVP
GFGVPTHAEAKDVLSGAMLAYARQEVMAGPVPAPISGRSDRPTVTLLGEMFPADPMVIGAMLAPMGLAVGPTVPMRDWRE
LYAALDSKVVAAIHPFYTAAIRQFEAAGRAIVGSAPVGHDGTMEWLANIGRAYDVSPDKIAAAQNAFGPAIRGAIAGAPI
KGRITVSGYEGSELLVARLLIESGAEVPYVGTAAPRTPWSAWDKDWLESRGVVVKYRASLEDDCAAMEGFEPDLAIGTTP
LVQKAKALGIPALYFTNLISARPLMGPAGAGSLAQVMNAAMGNRERMGKMKAFFEGVGEGDTAGIWQDTPKLYPDFREQQ
RKKMEKAAKLAKAEEMI
>P26179 1.3.7.15~~~bchZ~~~Chlorophyllide reductase subunit Z~~~COG2710
MFLLDHDRAGGYWGAVYTFCAVKGLQVVIDGPVGCENLPVTSVLHYTDGLPPHELPIVVTGLGDAELGREGTEGAMSRAW
KTLDPLLPSVVVTGSIAEMIGGGVTPQGTNLQRFLARTIDEDQWQCADRAMTWLFTEYGMTKGRMPGERMRPDGAKPRVN
ILGPMYGAFNMASDLHEIRRLVEGIGAEVNMVFPLGTHLSEVRNLVNADVNVVMYREFGRNLAEILGKPYLQAPIGLEST
TKFLRSLGELLGLDPEPFIEREKHATLKPLWDLWRSVTQDFFATASFGICATETYARGIKAYLEGDLGLPCAFAVARKAG
EKTKSDEVRGLIRQTRPLVVFGSINEKIYLAETKAGHGPAASFVPASFPGAAIRRATGTPFMGYMGSVYLLQEICNGLFD
ALFNILPLASEMDSAAATPATLRRDMPWDADAQAALDRIVSQHPVLTRISAAKSLRDAAEKAALDQGAERVVLEMVEALG
DATMDRKGGN
>N0DKX5 1.17.98.2~~~bciD~~~Bacteriochlorophyllide c C-7(1)-hydroxylase~~~
MSTKRVITKEDIHLKARLLSEGAKVTVNKPPASGFNPFRAMVLNGSDLATLVRQEPYTRLEVQVNGDDVEFYDCGQHLAS
GRMQEAFSWRSGKLSNGRPVDAAVIGMNQDIINIHYSYSCDNNNTGRSCRFCFFFADQHIGVGKELAKMPFSKIEELAKE
QAEAVKIATDAGWRGTLVIIGGLVDPSRRAQVADLVELVMAPLREQVSPEVLNELHITANLYPPDDFKEMEKWKASGLNS
TEFDLEVTHPDYFKAICPGKSATYPLEYWLEAQEASVKIFGPGRGTTSFILMGLEPMNIMLEGVEERMSKGVYPNMLVYQ
PVPGADMFRMPPPNADWLVEASEKVADLYIKYQDRFDMPLAKDHRPGYTRMGRSQYIILAGDMLAYKLQEQGYELPEAYP
VC
>Q8GQN9 6.2.1.25~~~bclA~~~Benzoate--CoA ligase~~~
MYTLSVADHSNTPPAIKIPERYNAADDLIGRNLLAGRGGKTVYIDDAGSYTYDELALRVNRCGSALRTTLGLQPKDRVLV
CVLDGIDFPTTFLGAIKGGVVPIAINTLLTESDYEYMLTDSAARVAVVSQELLPLFAPMLGKVPTLEHLVVAGGAGEDSL
AALLATGSEQFEAAPTRPDDHCFWLYSSGSTGAPKGTVHIHSDLIHTAELYARPILGIREGDVVFSAAKLFFAYGLGNGL
IFPLAVGATAVLMAERPTPAAVFERLRRHQPDIFYGVPTLYASMLANPDCPKEGELRLRACTSAGEALPEDVGRRWQARF
GVDILDGIGSTEMLHIFLSNRAGDVHYGTSGKPVPGYRLRLIDEDGAEITTAGVAGELQISGPSSAVMYWNNPEKTAATF
MGEWTRSGDKYLVNDEGYYVYAGRSDDMLKVSGIYVSPIEVESALIAHEAVLEAAVVGWEDEDHLIKPKAFIVLKPGYGA
GEALRTDLKAHVKNLLAPYKYPRWIEFVDDLPKTATGKIQRFKLRSA
>M5AW86 4.2.3.188~~~bcl-ts~~~Trifunctional sesterterpene/triterpene/sesquarterpene synthase~~~
MGTVPANPFKIIQLAFKETVPKAHAELQKWHQEALKIEDVEIREQAAWTVNDKTFHCEGGSIFALLAGENKDNHIQFLVA
YQTICDYLDTLCDKNDAHDPNDFRSIHQALLDCLTPDKPYGDYYQYRDRFEDNGYLRKLVDACREATASFPGFADMQTHM
QEVSQFYIDFQVYKHVEEEKREPLLKDFYERNKHFAPTMRWYEFACGTASTLALYCMAAYAAAPVQTAQGQQIKEAYFTW
VQGVHILLDYFIDQEEDRQENEMNFVAYYRDSKEMFERFKYIDEKATEKLQMLPDKKFHLLLKTGLYALYLSDKKVMSHP
RLKAEAKQLIKLGGFPASLFYYNRWIFKRKIS
>P86393 ~~~~~~Bacteriocin SRCAM 602~~~
ATYYGNGLYCNKQKHYTWVDWNKASREIGKITVNGWVQH
>P85833 ~~~~~~Bacteriocin~~~
KKIDTRTGKTMEKTEKKIELSLKNMKTAT
>P86395 ~~~~~~Bacteriocin SRCAM 37~~~
FVYGNGVTSILVQAQFLVNGQRRFFYTPDK
>P08696 ~~~bcn~~~Bacteriocin BCN5~~~
MANNIIPNVSSGDLVGSTPTFPPNAVVRGDFLYLRDVDGNQIPGRTVSDGDEITVLFISNEKNIVLVQYPTSSGYRQGYV
TNATSIIKYKDDYSWVNGSTPEPVYDFDKTTQIGTLDPRERAVVLYKVDGMTYVAYDTGKGKLTKSGLVHYEGSGSSTGG
GSFNGVAPGEVVPGGFTYENNAEVVGDELYLRDANGNLIPGRSVSVGDKITVLDVGYTKQLALVQYPAGDVVRQGYVTNA
TNLIRYFNQYSWHNGSTSEEVLDENGGHLGSLNPYEAATLLYEKNGMKHVVYDTNKGPNTKSGYVKYEGAAATRVDIPYP
SITNAQKIVYGISGRGRELAAYKVGNGSNSLVFVCAIHGWEDNWAADGIELTRIGNGLIEHFQNAGTNNWSLYIIPVANP
DGLSEGFTNNGPGRCTIVGAVDCNRDFPLGFSPGGVPRYHSGSEPLSVSESKSLHDFIQGVKNRTSGEMCVIDLHGWEGA
AIGNPEIGEYFRNQFGFGQRSGYGDNRGFMIGWAKSIGAKAALIELPGSTKSHSDVVNGRYLQKIINAVTNLIGGSGGSS
SGGSSFSDVSYEATGEVINVQSFLNVREGAGLYTNSIGQLRQGNKVNIVAKNGDWYKIKYGSEYGYVNSGYIIILKNNTS
VKLEDWQEDCIKFGWGPITKEKYLEYMDSTRLYKSIENDISQAIKNKSLINVINPLNFSVSEMIACTQIVFNNETTSFFR
DEWYSKSNPNFIVKYKKLSNGQIIVLDRINIKKPEKLKTKIPKAAKGAFKDTIKFEFFKGIDGWFTAISGAISIGSDLSV
FQSNGELKSNEDIAKALAAAVIVNGVETMFCAFLGGFIAQCIAPEFPIVAAVAGAIVSAIAAFAIGYFVDNHEKEKYLMN
SFKGLIDYLF
>P86394 ~~~~~~Bacteriocin SRCAM 1580~~~
VNYGNGVSCSKTKCSVNWGIITHQAFRVTSGVASG
>P15935 ~~~uviA~~~Bacteriocin UviA~~~
MSELYKNIVLCQNGDKKAIEYIINRFEILINKYKMSFLKEIHFNSYDIEDNKQDLIVSLINIVNKIPIDNPQFENEGCLV
NYIYKSILNSRKDMYINKNIKRYFIESQSLSSMVEFKDKPLVKYIESNIEIEDMLKCLTEKEQKVIKYKFLNDKSEVEIA
EIMGTSRQWINRIKNTALKKLKENI
>P15936 ~~~uviB~~~Bacteriocin UviB~~~
MDSELFKLMATQGAFAILFSYLLFYVLKENSKREDKYQNIIEELTELLPKIKEDVEDIKEKLNK
>Q46393 ~~~fmoA~~~Bacteriochlorophyll a protein~~~
MALFGSNDVTTAHSDYEIVLEGGSSSWGKVKARAKVNAPPASPLLPADCDVKLNVKPLDPAKGFVRISAVFESIVDSTKN
KLTIEADIANETKERRISVGEGMVSVGDFSHTFSFEGSVVNLFYYRSDAVRRNVPNPIYMQGRQFHDILMKVPLDNNDLI
DTWEGTVKAIGSTGAFNDWIRDFWFIGPAFTALNEGGQRISRIEVNGLNTESGPKGPVGVSRWRFSHGGSGMVDSISRWA
ELFPSDKLNRPAQVEAGFRSDSQGIEVKVDGEFPGVSVDAGGGLRRILNHPLIPLVHHGMVGKFNNFNVDAQLKVVLPKG
YKIRYAAPQYRSQNLEEYRWSGGAYARWVEHVCKGGVGQFEILYAQ
>P11741 ~~~~~~Bacteriochlorophyll a protein~~~
ALFGTKDTTTAHSDYEIILEGGSSSWGQVKGRAKVNVPAAIPLLPTDCNIRIDAKPLDAQKGVVRFTTKIESVVDSVKNT
LNVEVDIANETKDRRIAVGEGSLSVGDFSHSFSFEGQVVNMYYYRSDAVRRNIPNPIYMQGRQFHDILMKVPLDNNDLVD
TWEGFQQSISGGGANFGDWIREFWFIGPAFAAINEGGQRISPIVVNSSNVEGGEKGPVGVTRWKFSHAGSGVVDSISRWT
ELFPVEQLNKPASIEGGFRSDSQGIEVKVDGNLPGVSRDAGGGLRRILNHPLIPLVHHGMVGKFNDFTVDTQLKIVLPKG
YKIRYAAPQFRSQNLEEYRWSGGAYARWVEHVCKGGTGQFEVLYAQ
>P9WID9 1.11.1.24~~~bcpB~~~Putative peroxiredoxin Rv1608c~~~COG1225
MKTGDTVADFELPDQTGTPRRLSVLLSDGPVVLFFYPAAMTPGCTKEACHFRDLAKEFAEVRASRVGISTDPVRKQAKFA
EVRRFDYPLLSDAQGTVAAQFGVKRGLLGKLMPVKRTTFVIDTDRKVLDVISSEFSMDAHADKALATLRAIRSG
>Q45087 3.1.4.-~~~pehA~~~Multifunctional alkaline phosphatase superfamily protein PehA~~~
MTRKNVLLIVVDQWRADFIPHLMRAEGREPFLKTPNLDRLCREGLTFRNHVTTCVPCGPARASLLTGLYLMNHRAVQNTV
PLDQRHLNLGKALRAIGYDPALIGYTTTTPDPRTTSARDPRFTVLGDIMDGFRSVGAFEPNMEGYFGWVAQNGFELPENR
EDIWLPEGEHSVPGATDKPSRIPKEFSDSTFFTERALTYLKGRDGKPFFLHLGYYRPHPPFVASAPYHAMYKAEDMPAPI
RAENPDAEAAQHPLMKHYIDHIRRGSFFHGAEGSGATLDEGEIRQMRATYCGLITEIDDCLGRVFAYLDETGQWDDTLII
FTSDHGEQLGDHHLLGKIGYNAESFRIPLVIKDAGQNRHAGQIEEGFSESIDVMPTILEWLGGETPRACDGRSLLPFLAE
GKPSDWRTELHYEFDFRDVFYDQPQNSVQLSQDDCSLCVIEDENYKYVHFAALPPLFFDLKADPHEFSNLAGDPAYAALV
RDYAQKALSWRLSHADRTLTHYRSSPQGLTTRNH
>Q83CY8 1.11.1.24~~~bcp~~~Putative peroxiredoxin bcp~~~COG1225
MSIEVGQKAPIFTLPTDEGEMLSLDDLKGKKVILYFYPKDDTPGCTKEACGFRDVWSQLSKAGVVVLGISKDSVKAHQSF
KQKYNLPFTLLSDKDNTVCEQYGVMVDKNRFGKKYKGIERTTFLIDEEGVISAVWPKVKVDGHVAEVVGRL
>P0AE52 1.11.1.24~~~bcp~~~Peroxiredoxin Bcp~~~COG1225
MNPLKAGDIAPKFSLPDQDGEQVNLTDFQGQRVLVYFYPKAMTPGCTVQACGLRDNMDELKKAGVDVLGISTDKPEKLSR
FAEKELLNFTLLSDEDHQVCEQFGVWGEKSFMGKTYDGIHRISFLIDADGKIEHVFDDFKTSNHHDVVLNWLKEHA
>P9WIE1 1.11.1.24~~~bcp~~~Putative peroxiredoxin Rv2521~~~COG1225
MTKTTRLTPGDKAPAFTLPDADGNNVSLADYRGRRVIVYFYPAASTPGCTKQACDFRDNLGDFTTAGLNVVGISPDKPEK
LATFRDAQGLTFPLLSDPDREVLTAWGAYGEKQMYGKTVQGVIRSTFVVDEDGKIVVAQYNVKATGHVAKLRRDLSV
>Q8P9V9 1.11.1.24~~~bcp~~~Peroxiredoxin Bcp~~~COG1225
MTDAVLELPAATFDLPLSLSGGTQTTLRAHAGHWLVIYFYPKDSTPGCTTEGLDFNALLPEFDKAGAKILGVSRDSVKSH
DNFCAKQGFAFPLVSDGDEALCRAFDVIKEKNMYGKQVLGIERSTFLLSPEGQVVQAWRKVKVAGHADAVLAALKAHAKQ
>Q5WNX0 ~~~bcrA~~~Bacitracin transport ATP-binding protein BcrA~~~
MMIMEYVIETENLTKQYGETTVVNKINLHVPKGKIYGLLGRNGAGKTTAMKMMLQLAFPTDGTVRLFGTNYKENIHTLYS
KVGSIIETPGFYSNLTGYENLQILAKLRGGVSKSGVEKALEVVGLHKEKRKVFSDYSLGMKQRLGIAAAIMHEPELLILD
EPINGLDPIGISEIRSFLSKLSHENGTTIFISSHVLSEIEQIADVIGVMHEGHLVEEVNISELHKRNRKYTEFDVSDGKI
AAKILESSYHMTDFTVQDGTIRIYDFSQSVGEINREFARNGLLITRINDSEENLEDYFSKLIGGGGIA
>O87876 1.3.7.8~~~bcrA~~~Benzoyl-CoA reductase subunit A~~~
MECFVGIDLGSTTTKAVVMDDKGQVLGRGITNSRSNYDTAARVSKLEAFIDARLSLIRRELDKEPAVAGRVDEIIDGLTR
NFRREQFIEQLGDLEQTCVANVEGPRFAGKEKAIVGALTEVFRRLREEEADKLFAPDAQRKSDFFRDLAGSRFMQIGEEV
ARANGVEFDHLLHMYDKSIIEVENRPPSADMNRKFRSAMERVRGEMSSALDTAALGAPIDAALEIDMSERYVVGTGYGRV
RLPFPKEHIRSEILCHGLGAHLMYPKTRTVLDIGGQDTKGIQIDDKGIVVNFQMNDRCAAGTGRYLGYVADEMNMGLHEL
GPLAMKSTKSIRINSTCTVFAGAELRDRLALGDKREDILAGLHRAIMLRAMSIISRSGGITDQFTFTGGVAKNEAAVKEL
RQLVKENYGEVQINIDPDSIYTGALGASEFARRAVVEA
>Q5WNX1 ~~~bcrB~~~Bacitracin transport permease protein BcrB~~~
MLNLISCELSKLKRSKMVLISVAGVLSTPLLMLIEALQTHFDKPEIIFTLSDIYSDSVLYIMLLVNMMIYVAIAAYLYSR
EYTENTLKTILPIPISRTKLLIGKFCTLLLWIVMLTLVTWAGIFIVCGLYHVVFTLEGYSLLVAISWLPKFLFGGILMFL
TTSPFVFIAFKTKGFVAPVIASAVIVMGSVALSNQELGALYPWTATFFLIDGRIESTGYPLALAIGIIILVSAVGFFMTF
HHFKKEDLK
>O87875 1.3.7.8~~~bcrB~~~Benzoyl-CoA reductase subunit B~~~
MSAKTNPEVIKESSMVKQKEMIAGNYDRLTGTKESGEKVVSTFVPGNLNELIMCFDMVNNLPETNAIQNGMRKQSGGMIM
DAEKAGHSEDVCTYVKADIGMMGRGNIAPNGKPMPAPDMLLLSYTGCFTFMKWFELLRHEYKCPTVMLQIPYQGDGKITK
NMRDFVVKQLKEEVIPMFEQVSGVKFDIDRLREYLKNSAKAEDDLVWVLESAKNRPSPIDAYFGGVYYIGPMFTAFRGTA
DAVEYYGLLRGEIEQRIREGKGPITPEGDMKEEKYRLVVEGPPNWTSFREFWKLFYDEGAVVVASSYTKVGGLYDQGFRH
DPNDPLGTLADYCLGCYTNNNLPQRVELLEKYMNEYQADGLLINSIKSCNSFSAGQLLMMREIEKRTGKPAAFIETDLVD
PRYFSHANVKNRLESYFQMVDQKRSGASLATA
>P94571 3.6.1.27~~~bcrC~~~Undecaprenyl-diphosphatase BcrC~~~COG0671
MNYEIFKAIHGLSHHNSVLDSIMVFITEYAIVAYALILLAIWLFGNTQSRKHVLYAGITGIAGLVINYLITLVYFEPRPF
VAHTVHTLIPHAADASFPSDHTTGALAISIAMLFRNRKIGWPLVIFGLLTGFSRIWVGHHYPVDVLGSLVVAIIIGFLFF
RFSDLLRPFVDLVVRIYEAIINKLTKKPTDQNF
>O87874 1.3.7.8~~~bcrC~~~Benzoyl-CoA reductase subunit C~~~
MSTADIIARCEALYEDLDFTAARQWKEADPSRKVIAYMPVYVPREIIHAAGMLPLGIMGGGDGLEVIHGDAFYQSYICRI
PRSTIELGLSKRMDFVDGMLFPSICDVIRNLSGMWKLMFPGKYVRYFDVPQNYRDDVGGNYYTAELNELREGLEHLSGRK
ITDDALRASIKVYNENRKLVQDVYGLRSREPWKVPSADVYLLMRAGLVLPVEEHNQMLKDYLAAAVKVEAQKRDNCRVII
NGSFCEQPPLNLIKSIELSGCYIVDDDYMIVHRFLRNEVSTAGDPMQNLSLAFLHESISTAAKYDDKEEDKGKYLLEQVR
TNAAEGVIFAAPSFCDPALLERPMLADRCSENKVPYISFKYAENSGQMQPIREQAGTFADSIKLWS
>O87877 1.3.7.8~~~bcrD~~~Benzoyl-CoA reductase subunit D~~~
MTITAGIDIGTGAVKTVLFRVEGDKTEWLAKRNDRIRQRDPFKLAEEAYNGLLEEAGLKASDVDYVATTGEGESLAFHTG
HFYSMTTHARGAVYLNPEARAVLDIGALHGRAIRNDERGKVETYKMTSQCASGSGQFLENIARYLGIAQDEIGSLSTQAD
NPEVVSSICAVLAETDVINMVSRGISAPNILKGIHISMAGRLAKLLKSVGARDGVVLCTGGLALDEGLLKTLNESIQEQK
MAVVAYNHPDSPYAGAIGAALWGAFRHEKLARLGQQQVAEAA
>Q5WNW9 ~~~bcrR~~~HTH-type transcriptional activator BcrR~~~
MEFNEKLQQLRTGKNLTQEQLAEQLYVSRTAISKWESGKGYPNMESLKCISKFFSVTIDELLSGEELITLAETENRSNLK
KIYNYIYGILDMMAVAFIFLPLYGNSVGGYVYAVNLLSFTATTPFNLAVYWSAFAALIIIGIGKIISTHLDKEKWGGIAT
KCSLTITALAVCFFAAAREPYITVLVFLLLIGKIFVWIKQMGMK
>P28246 ~~~bcr~~~Bicyclomycin resistance protein~~~COG2814
MTTRQHSSFAIVFILGLLAMLMPLSIDMYLPALPVISAQFGVPAGSTQMTLSTYILGFALGQLIYGPMADSFGRKPVVLG
GTLVFAAAAVACALANTIDQLIVMRFFHGLAAAAASVVINALMRDIYPKEEFSRMMSFVMLVTTIAPLMAPIVGGWVLVW
LSWHYIFWILALAAILASAMIFFLIKETLPPERRQPFHIRTTIGNFAALFRHKRVLSYMLASGFSFAGMFSFLSAGPFVY
IEINHVAPENFGYYFALNIVFLFVMTIFNSRFVRRIGALNMFRSGLWIQFIMAAWMVISALLGLGFWSLVVGVAAFVGCV
SMVSSNAMAVILDEFPHMAGTASSLAGTFRFGIGAIVGALLSLATFNSAWPMIWSIAFCATSSILFCLYASRPKKR
>Q48230 ~~~bcs1~~~Bifunctional ribulose 5-phosphate reductase/CDP-ribitol pyrophosphorylase Bcs1~~~
MNKNKNIGIILAGGVGSRMGLGYPKQFSKIAGKTALEHTLAIFQEHKEIDEIIIVSERTSYRRIEDIVSKLDFSKVNRII
FGGKERSDSTLSAITALQDEPENTKLIIHDAVRPLLATEIISECIAKLDKYNAVDVAIPAVDTIVHVNNDTQEIIKIPKR
AEYYQGQTPQAFKLGTLKKAYDIYTQGGIEGTCDCSIVLKTLPEERVGIVSGSETNIKLTRPVDLFIADKLFQSRSHFSL
RNITSIDRLYDMKDQVLVVIGGSYGIGAHIIDIAKKFGIKTYSLSRSNGVDVGDVKSIEKAFAEIYAKEHKIDHIVNTAA
VLNHKTLVSMSYEEILTSINVNYTGMINAVITAYPYLKQTHGSFLGFTSSSYTRGRPFYAIYSSAKAAVVNLTQAISEEW
LPDNIKINCVNPERTKTPMRTKAFGIEPEGTLLDAKTVAFASLVVLASRETGNIIDVVLKDEEYITNILADLYK
>P19449 2.4.1.12~~~bcsA~~~Cellulose synthase catalytic subunit [UDP-forming]~~~
MSEVQSPVPAESRLDRFSNKILSLRGANYIVGALGLCALIAATTVTLSINEQLIVALVCVLVFFIVGRGKSRRTQIFLEV
LSALVSLRYLTWRLTETLDFDTWIQGGLGVTLLMAELYALYMLFLSYFQTIQPLHRAPLPLPDNVDDWPTVDIFIPTYDE
QLSIVRLTVLGALGIDWPPDKVNVYILDDGVRPEFEQFAKDCGALYIGRVDSSHAKAGNLNHAIKRTSGDYILILDCDHI
PTRAFLQIAMGWMVADRKIALMQTPHHFYSPDPFQRNLAVGYRTPPEGNLFYGVIQDGNDFWDATFFCGSCAILRREAIE
SIGGFAVETVTEDAHTALRMQRRGWSTAYLRIPVASGLATERLTTHIGQRMRWARGMIQIFRVDNPMLGGGLKLGQRLCY
LSAMTSFFFAIPRVIFLASPLAFLFFGQNIIAASPLAVLAYAIPHMFHSIATAAKVNKGWRYSFWSEVYETTMALFLVRV
TIITLMFPSKGKFNVTEKGGVLEEEEFDLGATYPNIIFAGIMTLGLLIGLFELTFHFNQLAGIAKRAYLLNCIWAMISLI
ILLAAIAVGRETKQVRYNHRVEAHIPVTVYEAPVAGQPNTYHNATPGMTQDVSMGGVAVHMPWPDVSTGPVKTRIHAVLD
GEEIDIPATMLRCKNGKAVFTWDNNDLDTERDIVRFVFGRADAWLQWNNYEDDRPLRSLWSLLLSIKALFRKKGKMMANS
RPKRKPLALPVERREPTTIQSGQTQEGKISRAAS
>P37653 2.4.1.12~~~bcsA~~~Cellulose synthase catalytic subunit [UDP-forming]~~~COG1215
MSILTRWLLIPPVNARLIGRYRDYRRHGASAFSATLGCFWMILAWIFIPLEHPRWQRIRAEHKNLYPHINASRPRPLDPV
RYLIQTCWLLIGASRKETPKPRRRAFSGLQNIRGRYHQWMNELPERVSHKTQHLDEKKELGHLSAGARRLILGIIVTFSL
ILALICVTQPFNPLAQFIFLMLLWGVALIVRRMPGRFSALMLIVLSLTVSCRYIWWRYTSTLNWDDPVSLVCGLILLFAE
TYAWIVLVLGYFQVVWPLNRQPVPLPKDMSLWPSVDIFVPTYNEDLNVVKNTIYASLGIDWPKDKLNIWILDDGGREEFR
QFAQNVGVKYIARTTHEHAKAGNINNALKYAKGEFVSIFDCDHVPTRSFLQMTMGWFLKEKQLAMMQTPHHFFSPDPFER
NLGRFRKTPNEGTLFYGLVQDGNDMWDATFFCGSCAVIRRKPLDEIGGIAVETVTEDAHTSLRLHRRGYTSAYMRIPQAA
GLATESLSAHIGQRIRWARGMVQIFRLDNPLTGKGLKFAQRLCYVNAMFHFLSGIPRLIFLTAPLAFLLLHAYIIYAPAL
MIALFVLPHMIHASLTNSKIQGKYRHSFWSEIYETVLAWYIAPPTLVALINPHKGKFNVTAKGGLVEEEYVDWVISRPYI
FLVLLNLVGVAVGIWRYFYGPPTEMLTVVVSMVWVFYNLIVLGGAVAVSVESKQVRRSHRVEMTMPAAIAREDGHLFSCT
VQDFSDGGLGIKINGQAQILEGQKVNLLLKRGQQEYVFPTQVARVMGNEVGLKLMPLTTQQHIDFVQCTFARADTWALWQ
DSYPEDKPLESLLDILKLGFRGYRHLAEFAPSSVKGIFRVLTSLVSWVVSFIPRRPERSETAQPSDQALAQQ
>P37716 ~~~bcsB~~~Cyclic di-GMP-binding protein~~~
MKMVSLIALLVFATGAQAAPVASKAPAPQPAGSDLPPLPAAASQAATPAAASADQPATTAPAADAASASAADAVVDNAEN
AIAASDVATVHTYSLKELGAQSALKMQGAATLQGLQFGIPADQLVTSARLIVSGAMSPSLQPDTSAVTITLNEQFIGTLR
PDPTHPTFGPLSFDINPIFFITGNRLNFSFASSSKGCTDPSNGLLWASVSEHSELQITTIPLPPRRQLSRLPQPFFDKNV
KQKIVIPFVLAQTFDPEVLKATGILASWFGQQTDFRGVTFPVFSTIPQTGNAVVVGVADELPSALGRQAVNGPTLMEVAN
PSDPNGTVLLVTGRDRDEVITASKGIGFGSSALPTANRMDVAPIDVGARVAYDAPSFIPTNRPVRLGELVPDSALQAQGY
APGALSVPFRVSPDLYTWRDRPYKLNVRFRAPPGPIVDVSRSSLNVGINDTYLEAYPLREPDSTLDQILRRVGLGRGDDS
VQKHTMPIPPYRVFGQNQLLFYFEMAAMAEPGCKPGPSTFHMSVDPDSTIDLSNSYHITRMPNLAFMASAGYPFTTYADL
SRSAVVLPDHPNGMVVSAYLDLMGFMGATTWYPVSGVDVVSSDHVNDVADRNLIVLSTLANSGDVSQLLSKSSYQISDGR
LHMGLRSTLSGVWNLFQDPMSGISNTAPTDVESTLTGGVAAMIEAESPLASGRTVLALLSGDGQGLNNLVQILAQRKNQA
KIQGDLVLAHGDDLTSYRSSPLYTVGTVPLWLEPDWYMHNHPSRVIVVGLLGCILIVAVMVRALAKHALRRRRELQEERQ
RT
>P37652 ~~~bcsB~~~Cyclic di-GMP-binding protein~~~COG1215
MKRKLFWICAVAMGMSAFPSFMTQATPATQPLINAEPAVAAQTEQNPQVGQVMPGVQGADAPVVAQNGPSRDVKLTFAQI
APPPGSMVLRGINPNGSIEFGMRSDEVVTKAMLNLEYTPSPSLLPVQSQLKVYLNDELMGVLPVTKEQLGKKTLAQMPIN
PLFISDFNRVRLEFVGHYQDVCEKPASTTLWLDVGRSSGLDLTYQTLNVKNDLSHFPVPFFDPSDNRTNTLPMVFAGAPD
VGLQQASAIVASWFGSRSGWRGQNFPVLYNQLPDRNAIVFATNDKRPDFLRDHPAVKAPVIEMINHPQNPYVKLLVVFGR
DDKDLLQAAKGIAQGNILFRGESVVVNEVKPLLPRKPYDAPNWVRTDRPVTFGELKTYEEQLQSSGLEPAAINVSLNLPP
DLYLMRSTGIDMDINYRYTMPPVKDSSRMDISLNNQFLQSFNLSSKQEANRLLLRIPVLQGLLDGKTDVSIPALKLGATN
QLRFDFEYMNPMPGGSVDNCITFQPVQNHVVIGDDSTIDFSKYYHFIPMPDLRAFANAGFPFSRMADLSQTITVMPKAPN
EAQMETLLNTVGFIGAQTGFPAINLTVTDDGSTIQGKDADIMIIGGIPDKLKDDKQIDLLVQATESWVKTPMRQTPFPGI
VPDESDRAAETRSTLTSSGAMAAVIGFQSPYNDQRSVIALLADSPRGYEMLNDAVNDSGKRATMFGSVAVIRESGINSLR
VGDVYYVGHLPWFERVWYALANHPILLAVLAAISVILLAWVLWRLLRIISRRRLNPDNE
>P37650 ~~~bcsC~~~Cellulose synthase operon protein C~~~COG0457
MRKFTLNIFTLSLGLAVMPMVEAAPTAQQQLLEQVRLGEATHREDLVQQSLYRLELIDPNNPDVVAARFRSLLRQGDIDG
AQKQLDRLSQLAPSSNAYKSSRTTMLLSTPDGRQALQQARLQATTGHAEEAVASYNKLFNGAPPEGDIAVEYWSTVAKIP
ARRGEAINQLKRINADAPGNTGLQNNLALLLFSSDRRDEGFAVLEQMAKSNAGREGASKIWYGQIKDMPVSDASVSALKK
YLSIFSDGDSVAAAQSQLAEQQKQLADPAFRARAQGLAAVDSGMAGKAIPELQQAVRANPKDSEALGALGQAYSQKGDRA
NAVANLEKALALDPHSSNNDKWNSLLKVNRYWLAIQQGDAALKANNPDRAERLFQQARNVDNTDSYAVLGLGDVAMARKD
YPAAERYYQQTLRMDSGNTNAVRGLANIYRQQSPEKAEAFIASLSASQRRSIDDIERSLQNDRLAQQAEALENQGKWAQA
AALQRQRLALDPGSVWITYRLSQDLWQAGQRSQADTLMRNLAQQKSNDPEQVYAYGLYLSGHDQDRAALAHINSLPRAQW
NSNIQELVNRLQSDQVLETANRLRESGKEAEAEAMLRQQPPSTRIDLTLADWAQQRRDYTAARAAYQNVLTREPANADAI
LGLTEVDIAAGDKAAARSQLAKLPATDNASLNTQRRVALAQAQLGDTAAAQRTFNKLIPQAKSQPPSMESAMVLRDGAKF
EAQAGDPTQALETYKDAMVASGVTTTRPQDNDTFTRLTRNDEKDDWLKRGVRSDAADLYRQQDLNVTLEHDYWGSSGTGG
YSDLKAHTTMLQVDAPYSDGRMFFRSDFVNMNVGSFSTNADGKWDDNWGTCTLQDCSGNRSQSDSGASVAVGWRNDVWSW
DIGTTPMGFNVVDVVGGISYSDDIGPLGYTVNAHRRPISSSLLAFGGQKDSPSNTGKKWGGVRADGVGLSLSYDKGEANG
VWASLSGDQLTGKNVEDNWRVRWMTGYYYKVINQNNRRVTIGLNNMIWHYDKDLSGYSLGQGGYYSPQEYLSFAIPVMWR
ERTENWSWELGASGSWSHSRTKTMPRYPLMNLIPTDWQEEAARQSNDGGSSQGFGYTARALLERRVTSNWFVGTAIDIQQ
AKDYAPSHFLLYVRYSAAGWQGDMDLPPQPLIPYADW
>P37657 ~~~bcsE~~~Cyclic di-GMP binding protein BcsE~~~
MRDIVDPVFSIGISSLWDELRHMPAGGVWWFNVDRHEDAISLANQTIASQAETAHVAVISMDSDPAKIFQLDDSQGPEKI
KLFSMLNHEKGLYYLTRDLQCSIDPHNYLFILVCANNAWQNIPAERLRSWLDKMNKWSRLNHCSLLVINPGNNNDKQFSL
LLEEYRSLFGLASLRFQGDQHLLDIAFWCNEKGVSARQQLSVQQQNGIWTLVQSEEAEIQPRSDEKRILSNVAVLEGAPP
LSEHWQLFNNNEVLFNEARTAQAATVVFSLQQNAQIEPLARSIHTLRRQRGSAMKILVRENTASLRATDERLLLACGANM
VIPWNAPLSRCLTMIESVQGQKFSRYVPEDITTLLSMTQPLKLRGFQKWDVFCNAVNNMMNNPLLPAHGKGVLVALRPVP
GIRVEQALTLCRPNRTGDIMTIGGNRLVLFLSFCRINDLDTALNHIFPLPTGDIFSNRMVWFEDDQISAELVQMRLLAPE
QWGMPLPLTQSSKPVINAEHDGRHWRRIPEPMRLLDDAVERSS
>A6TFD9 ~~~bcsE~~~Cyclic di-GMP binding protein BcsE~~~
MDNVFTLGISSLWDEVCHMPVGGVWWLNVDRYADAVSLFNQTLAAQAKNSHVAALVMGNKPKDIISLDHTHGPDNIALFT
LPNRPQALEEIHRDLVCSLEPGNYLFILLCAENAWQNINNEKLCAWVEKTSRWAQYHRCAFLAINSAQDIDRQLTPLLRE
YRSLSGLASIRYQGDRHIFDIAWWGSDKGISAQQQLMVQHDDAGWRLAQDAETSVQPRSDEKAILSHVRVLEGAPPLSEY
WTLFDTNDEVFNAGRTAQAATILFSITQNTQIEQLGRYIHTLRRQRGTALKIIVREQTPSLRATDERLLLSSGASLVIPS
SASLSRCLTLIESVQNQKFSRHIPEDFATLLTWSQPLKLRGYQKWDAFCEAVHNVMTNTLLPPDSKGVMVALRPAPGLRV
EQALTLCKPNRMGDIMTIGNNRLVLFLSFCRINDLDTALNHIFPLPTGDIFSNRMVWFEDKQILSEIVIMRGVEPARWNT
PLPLSVGKNETINATHDGRHWRRYPEPHRLTTREEQA
>Q8ZLB5 ~~~bcsE~~~Cyclic di-GMP binding protein BcsE~~~
MRDTVDPVFSLGISSLWDELRHMPTGGVWWVNADRQQDAISLVNQTIASQTENANVAVIGMEGDPGKVIKLDESHGPEKI
RLFTMPASEKGLYSLPHDLLCSVNPTHYFFILICANNTWRNITSESLHKWLEKMNKWTRFHHCSLLVINPCNNSDKQSSL
LMGEYRSLFGLASLRFQGDQHLFDIAFWCNEKGVSARQQLLLCQQDERWTLSHQEETAIQPRSDEKRILSHVAVLEGAPP
LSEHWTLFDNNEALFNDARTAQAATIIFSLTQNNQIEPLARRIHTLRRQRGSALKIVVRENIASLRATDERLLLGCGANM
IIPWNAPLSRCLTLIESVQGQQFSRYVPEDITTLLSMTQPLKLRGFQPWDTFCDAIHTMMSNTLLPADGKGVLVALRPVP
GIRVEQALTLCRPNRTGDIMTIGGNRLVLFLSFCRVNDLDTALNHIFPLPTGDIFSNRMVWFEDKQISAELVQMRLLSPE
LWGTPLPLAKRADPVINAEHDGRIWRRIPEPLRLLDDTAERAS
>P37659 ~~~bcsG~~~Cellulose biosynthesis protein BcsG~~~COG2194
MTQFTQNTAMPSSLWQYWRGLSGWNFYFLVKFGLLWAGYLNFHPLLNLVFAAFLLMPLPRYSLHRLRHWIALPIGFALFW
HDTWLPGPESIMSQGSQVAGFSTDYLIDLVTRFINWQMIGAIFVLLVAWLFLSQWIRITVFVVAILLWLNVLTLAGPSFS
LWPAGQPTTTVTTTGGNAAATVAATGGAPVVGDMPAQTAPPTTANLNAWLNNFYNAEAKRKSTFPSSLPADAQPFELLVI
NICSLSWSDIEAAGLMSHPLWSHFDIEFKNFNSATSYSGPAAIRLLRASCGQTSHTNLYQPANNDCYLFDNLSKLGFTQH
LMMGHNGQFGGFLKEVRENGGMQSELMDQTNLPVILLGFDGSPVYDDTAVLNRWLDVTEKDKNSRSATFYNTLPLHDGNH
YPGVSKTADYKARAQKFFDELDAFFTELEKSGRKVMVVVVPEHGGALKGDRMQVSGLRDIPSPSITDVPVGVKFFGMKAP
HQGAPIVIEQPSSFLAISDLVVRVLDGKIFTEDNVDWKKLTSGLPQTAPVSENSNAVVIQYQDKPYVRLNGGDWVPYPQ
>Q7CPI7 ~~~bcsG~~~Cellulose biosynthesis protein BcsG~~~
MTQHTQTPSMPSPLWQYWRGLSGWNFYFLVKFGLLWAGYLNFHPLLNLVFMAFLLMPIPKYRLHRLRHWIAIPVGFALFW
HDTWLPGPQSIMSQGTQVAEFSSGYLLDLIARFINWQMIGAIFVLLVAWLFLSQWIRVTVFVVAIMVWLNVLTLTGPVFT
LWPAGQPTDTVTTTGGNAAATVATAGDKPVIGDMPAQTAPPTTANLNAWLNTFYAAEEKRKTTFPAQLPPDAQPFDLLVI
NICSLSWSDVEAAGLMSHPLWSHFDILFKHFNSGTSYSGPAAIRLLRASCGQPSHTRLYQPANNECYLFDNLAKLGFTQH
LMMDHNGEFGGFLKEVRENGGMQSELMNQSGLPTALLSFDGSPVYDDLAVLNRWLTGEEREANSRSATFFNLLPLHDGNH
FPGVSKTADYKIRAQKLFDELDAFFTELEKSGRKVMVVVVPEHGGALKGDRMQISGLRDIPSPSITNVPAGVKFFGMKAP
HEGAPIDINQPSSYLAISELVVRAVDGKLFTEDSVNWNKLTSNLPQTAPVSENANAVVIQYQGKPYVRLNGGDWVPYPQ
>P0DP92 ~~~bcsQ~~~Cellulose biosynthesis protein BcsQ~~~
MAVLGLQGVRGGVGTTTITAALAWSLQMLGENVLVVDACPDNLLRLSFNVDFTHRQGWARAMLDGQDWRDAGLRYTSQLD
LLPFGQLSIEEQENPQHWQTRLSDICSGLQQLKASGRYQWILIDLPRDASQITHQLLSLCDHSLAIVNVDANCHIRLHQQ
ALPDGAHILINDFRIGSQVQDDIYQLWLQSQRRLLPMLIHRDEAMAECLAAKQPVGEYRSDALAAEEILTLANWCLLNYS
GLKTPVGSAS
>Q0AVM5 2.8.3.-~~~~~~Probable butyrate:acetyl-CoA coenzyme A-transferase~~~COG0427
MYQKLLEEYKSKLVTADEAAKQVKSGDWVEYGFGINCARDFDEALAKRKDELEDVKIRCDIGAYQHFTAEVDPDNKHFTW
NSWHVAGHDRKFINKNLFYIPMKFHENPMMTRKDCVPTNVAVIQCTAMDKHGYFNFGGSSVNCCAMMETARVTILEVNEK
MPRCLGGNQECLHISQVDYIIQSKNEPIATIGSAEPSPVEIAMAQHIIERLYDGNCIQLGIGGTPNAVGSMVAASDLKDL
GVHTEMYVDAYLLMAKAGKITGARKSIDKYKQVYSFAMGSQELYDYIDDNPGLASYSVDYTNNPWVVAQIDDFVSINACI
EVDLYGQVCAESVGTRHISGTGGQLDFVEGAYKSKNGQSFICLPSTIEIKGEVTSRIKPILTPGAIVTDPRTATHMMVTE
FGIATLKGRSTWERAEELIKIAHPDFQDELVKEAQKMNIWRKSNKIG
>P68571 ~~~bdbB~~~SPbeta prophage-derived disulfide bond formation protein B~~~COG1495
MNTRYVKSFFLLLFFLSFFGTMASLFYSEIMHFKPCVLCWYQRIFLYPIPIILLIGLLKKDLNSIFYVVFLSSIGLIIAF
YHYIIQLTQSKSVVCEIGTNSCAKIEVEYLGFITLPLMSSVCFALIFGIGLKLIIKSKKLKQNQHVYN
>O32217 ~~~bdbC~~~Disulfide bond formation protein C~~~COG1495
MKNRIVFLYASWVVALIAMLGSLYFSEIRKFIPCELCWYQRILMYPLVLILGIATFQGDTRVKKYVLPMAIIGAFISIMH
YLEQKVPGFSGIKPCVSGVPCSGQYINWFGFITIPFLALIAFILIIIFMCLLKGEKSE
>O32218 ~~~bdbD~~~Disulfide bond formation protein D~~~COG1651
MKKKQQSSAKFAVILTVVVVVLLAAIVIINNKTEQGNDAVSGQPSIKGQPVLGKDDAPVTVVEFGDYKCPSCKVFNSDIF
PKIQKDFIDKGDVKFSFVNVMFHGKGSRLAALASEEVWKEDPDSFWDFHEKLFEKQPDTEQEWVTPGLLGDLAKSTTKIK
PETLKENLDKETFASQVEKDSDLNQKMNIQATPTIYVNDKVIKNFADYDEIKETIEKELKGK
>Q44856 ~~~bdb~~~Disulfide bond formation protein~~~
MRAKWLWMTAVGSLLITVLTAWGWAAASSQDSKIVYVFSDSCGYCQTFRPTLETVLQEYPQTSVERLDIREERDLKEALR
LGAEATPTIFVVRDGTVMDKLEGDVAEAVLRSFFQKKS
>P39333 ~~~bdcA~~~Cyclic-di-GMP-binding biofilm dispersal mediator protein~~~COG1028
MGAFTGKTVLILGGSRGIGAAIVRRFVTDGANVRFTYAGSKDAAKRLAQETGATAVFTDSADRDAVIDVVRKSGALDILV
VNAGIGVFGEALELNADDIDRLFKINIHAPYHASVEAARQMPEGGRILIIGSVNGDRMPVAGMAAYAASKSALQGMARGL
ARDFGPRGITINVVQPGPIDTDANPANGPMRDMLHSLMAIKRHGQPEEVAGMVAWLAGPEASFVTGAMHTIDGAFGA
>P39334 ~~~bdcR~~~HTH-type transcriptional repressor BdcR~~~COG1309
MVTKKQSRVPGRPRRFAPEQAISAAKVLFHQKGFDAVSVAEVTDYLGINPPSLYAAFGSKAGLFSRVLNEYVGTEAIPLA
DILRDDRPVGECLVEVLKEAARRYSQNGGCAGCMVLEGIHSHDPQARDIAVQYYHAAETTIYDYIARRHPQRAQCVTDFM
STVMSGLSAKAREGHSIEQLCATAAMAGEAIKTILEE
>O86372 3.2.1.23~~~~~~Beta-D-galactosidase Rv1717~~~COG3837
MKLTRASQAPRYVAPAHHEVSTMRLQGREAGRTERFWVGLSVYRPGGTAEPAPTREETVYVVLDGELVVTVDGAETVLGW
LDSVHLAKGELRSIHNRTDRQALLLVTVAHPVAEVA
>O34788 1.1.1.4~~~bdhA~~~(R,R)-butanediol dehydrogenase~~~COG1063
MKAARWHNQKDIRIEHIEEPKTEPGKVKIKVKWCGICGSDLHEYLGGPIFIPVDKPHPLTNETAPVTMGHEFSGEVVEVG
EGVENYKVGDRVVVEPIFATHGHQGAYNLDEQMGFLGLAGGGGGFSEYVSVDEELLFKLPDELSYEQGALVEPSAVALYA
VRSSKLKAGDKAAVFGCGPIGLLVIEALKAAGATDIYAVELSPERQQKAEELGAIIVDPSKTDDVVAEIAERTGGGVDVA
FEVTGVPVVLRQAIQSTTIAGETVIVSIWEKGAEIHPNDIVIKERTVKGIIGYRDIFPAVLSLMKEGYFSADKLVTKKIV
LDDLIEEGFGALIKEKSQVKILVRPN
>O86034 1.1.1.30~~~bdhA~~~D-beta-hydroxybutyrate dehydrogenase~~~COG1028
MTKTAVITGSTSGIGLAIARTLAKAGANIVLNGFGAPDEIRTVTDEVAGLSSGTVLHHPADMTKPSEIADMMAMVADRFG
GADILVNNAGVQFVEKIEDFPVEQWDRIIAVNLSSSFHTIRGAIPPMKKKGWGRIINIASAHGLVASPFKSAYVAAKHGI
MGLTKTVALEVAESGVTVNSICPGYVLTPLVEKQIPDQARTRGITEEQVINEVMLKGQPTKKFITVEQVASLALYLAGDD
AAQITGTHVSMDGGWTAQ
>Q5FA46 1.1.1.4~~~budC~~~(R,R)-butanediol dehydrogenase~~~
MKAARFYNKGDIRIEDIPEPTVAPGTVGINVAWCGICGTDLHEFMEGPIFIPPCGHPHPISGESAPVTMGHEFSGVVYAV
GEGVDDIKVGQHVVVEPYIIRDDVPTGEGSNYHLSKDMNFIGLGGCGGGLSEKIAVKRRWVHPISDKIPLDQAALIEPLS
VGHHAYVRSGAKAGDVALVGGAGPIGLLLAAVLKAKGIKVIITELSKARKDKARESGVADYILDPSEVDVVEEVKKLTNG
EGVDVAFECTSVNKVLDTLVEACKPAANLVIVSIWSHPATVNVHSVVMKELDVRGTIAYCNDHAETIKLVEEGKINLEPF
ITQRIKLDKLVSEGFERLIHNNESAVKIIVNPNL
>Q9AF95 1.1.2.9~~~bdh~~~1-butanol dehydrogenase (cytochrome c)~~~
MLTTTFARKREESVPLRKGIQRALLGLSCLVLSTTSFAAGGEWRTHGYDDAGTRYSPLAQITPDNAKELGLVWSYDLESS
RGVEATPIVVDGVMYVTAPWSVVHALDVRSGKRLWTYDPEVPREKGKNACCDVVNRGVAVHEGKVFVGSLDGRLVAIDAR
TGKRVWERNTLIDDDKPYTITGAPRVIKGKVVIGNGGAEFGVRGYITAYDPTAASRPGVVPGPGDPSLPFEDASMEAAAK
TWDPAGQVLGSGRRRHGVELDGLYRKAGFCCTSAPATPSPWSHRKRSPAGGDNLYTASIVALRPDTGEYVWHYQQTPADN
WDYTSTQDLILADIELGGKPRKVILHAPKNGFFFVIDRTDGKFISAQNFVPVNWATGYDENGRPIENPEGAWPGHLSMRF
PAPSARTNWHSMSYSPQTGLAYFPAQNIPLVLQEDKNWSYNQAQPGQAMAGIGWNLGMLVNPRPPASQPFGRLIAWDPVQ
QKEVWRKEHVSPWNGGTLVTAGNVVFQGTADARLLAFDARDGKELWSAPMGTGVIAAPVTYEVDGKQYVSIAVGWGGVYG
NFTRASERRTPGTVYTFALGGKAEMPAFTEYQLNNLVSGVDYNPDDVAEGTGLYVTNCVFCHGVPGVDKGGGIPNLGYST
AETIAHLDQFVFKGPFMPRGMPDFTGKLTPEQVEKIKAFILGTADAVRPKK
>Q9I3S1 ~~~bdlA~~~Biofilm dispersion protein BdlA~~~
MAALDRSMARVEFDPDGNITDANENFLTLLGYRRDEILGKPHRQLCDGAYAQSEDYRRFWERLRRGEHFSGRCKRITREG
RPLWLEATYNPVRDGQGRLLKVVKYASDIDAIVHQEHEMQSKLDALSRSMAMIEFDLDGNVLAANDNFLATMGYGRAELA
SANHRQFCEPGYRDGPQYADLWRRLNRGEYVTGQFRRVHRNGQPVWLEASYNPVYDADGKLYKVVKFASDVSDRMRRYQA
EADNAHQAHTLSTETRTVAEHGALIIQSAVEEMLKIANTLDASSLNIGELSQHSQQITSIVNTIREIAEQTNLLALNAAI
EAARAGDQGRGFAVVADEVRQLAERTSKSTKEIADMIGRIQTGTRSVIDDMQHSQEQARRGVELANEAGAAILGIRESTH
KVVEAVQQFSRTLNADL
>B2IZD3 3.6.5.5~~~~~~Bacterial dynamin-like protein~~~COG0699
MVNQVATDRFIQDLERVAQVRSEMSVCLNKLAETINKAELAGDSSSGKLSLERDIEDITIASKNLQQGVFRLLVLGDMKR
GKSTFLNALIGENLLPSDVNPCTAVLTVLRYGPEKKVTIHFNDGKSPQQLDFQNFKYKYTIDPAEAKKLEQEKKQAFPDV
DYAVVEYPLTLLQKGIEIVDSPGLNDTEARNELSLGYVNNCHAILFVMRASQPCTLGERRYLENYIKGRGLTVFFLVNAW
DQVRESLIDPDDVEELQASENRLRQVFNANLAEYCTVEGQNIYDERVFELSSIQALRRRLKNPQADLDGTGFPKFMDSLN
TFLTRERAIAELRQVRTLARLACNHTREAVARRIPLLEQDVNELKKRIDSVEPEFNKLTGIRDEFQKEIINTRDTQARTI
SESFRSYVLNLGNTFENDFLRYQPELNLFDFLSSGKREAFNAALQKAFEQYITDKSAAWTLTAEKDINAAFKELSRSASQ
YGASYNQITDQITEKLTGKDVKVHTTTTAEEDNSPGWAKWAMGLLSLSKGNLAGFALAGAGFDWKNILLNYFTVIGIGGI
ITAVTGILLGPIGFALLGLGVGFLQADQARRELVKTAKKELVKHLPQVAHEQSQVVYNAVKECFDSYEREVSKRINDDIV
SRKSELDNLVKQKQTREINRESEFNRLKNLQEDVIAQLQKIEAAYSNLLAYYS
>P76127 ~~~bdm~~~Protein bdm~~~
MFTYYQAENSTAEPALVNAIEQGLRAQHGVVTEDDILMELTKWVEASDNDILSDIYQQTINYVVSGQHPTL
>P50736 1.8.1.-~~~bdr~~~Bacilliredoxin reductase Bdr~~~COG0492
MIQEKAIIIGGGPCGLSAAIHLKQIGIDALVIEKGNVVNSIYNYPTHQTFFSSSEKLEIGDVAFITENRKPVRIQALSYY
REVVKRKNIRVNAFEMVRKVTKTQNNTFVIETSKETYTTPYCIIATGYYDHPNYMGVPGEDLPKVFHYFKEGHPYFDKDV
VVIGGKNSSVDAALELVKSGARVTVLYRGNEYSPSIKPWILPEFEALVRNGTIRMEFGACVEKITENEVVFRSGEKELIT
IKNDFVFAMTGYHPDHQFLEKIGVEIDKETGRPFFNEETMETNVEGVFIAGVIAAGNNANEIFIENGRFHGGHIAAEIAK
RENH
>Q07946 1.18.1.3~~~bedA~~~Benzene 1,2-dioxygenase system ferredoxin--NAD(+) reductase subunit~~~
MANHVAIIGNGVAGFTTAQALRAEGYEGRISLIGEEQHLPYDRPSLSKAVLDGSFEQPPRLAEADWYSEASIEMLTGSEV
TDLDTQKKMISLNDGSTISADAIVIATGSRARMLSLPGSQLPGVVTLRTYGDVQLLRDSWTPNTRLLIVGGGLIGCEVAT
TARKLGLSVTILEAGDELLVRVLGRRIGAWLRGLLTEQGVQVELKTGVSGFSGEGQLEKVMVNDGRSFIADNALICVGAD
PADQLARQAGLECDRGVVVDHRGATSAKGIFAVGDVATWPLHSGGKRSLETYMNAQRQATAVAKAILGKEVSAPQLPVSW
TEIAGHRMQMAGDIEGPGEYVLRGTLGIGSALLFRLLDGRIQAVVAVDAPRDFALANRLVEAQVIIEPEKLADVSNNMRD
IVRANEGNQK
>Q07944 1.14.12.3~~~bedC1~~~Benzene 1,2-dioxygenase subunit alpha~~~
MNQTETTPIRVRKNWKTSEIETLFDEQAGRIDPRIYTDEDLYQLELERVFARSWLLLGHETHIRKPGDYFTTYMGEDPVV
VVRQKDASIAVFLNQCRHRGMRICRSDAGNAKAFTCSYHGWAYDTAGNLINVPYEAESFACLDKKEWSPLKARVETYKGL
IFANWDENAIDLDTYLGEAKFYMDHMLDRTEAGTEVIPGIQKWVIPCNWKFAAEQFCSDMYHAGTTAHLSGIIAGLPEDL
ELADLAPPKFGKQYRASWGGHGSGFYIGDPNMMLAMMGPKVTSYLTEGPAAEKAAERLGSIERGTKIMLEHMTVFPTCSF
LPGVNTIRTWHPRGPNEVEVWAFTVVDADAPDDIKEEFRRQTLRTFSAGGVFEQDDGENWVEIQHILRGHKARSRPFNAE
MSMGQTVDNDPIYPGRISNNVYSEEAARGLYAHWLKMMTSPDWEALKATR
>Q07945 1.14.12.3~~~bedC2~~~Benzene 1,2-dioxygenase subunit beta~~~
MIDSVNRADLFLRKPAPVALELQNEIEQFYYWEAKLLNDRRFDEWFALLAKDIHYFMPIRTTRIMRDSRLEYSGLRDYAH
FDDDATMMKGRLRKITSDVSWSENPASRTRHIVSNVMIIPTEVEGEYEISSTFIVYRNRLERQLDIFAGERRDRLRRNKG
EAGFEIVNRTILIDQSTILANNLSFFF
>A0A2S3XLU2 ~~~befA~~~Beta cell expansion factor A~~~
MNKRNWLLALSLSLAFSPCYADWAKLKAAASDLGAAVSETSKEVWQDVSDFSKKSWASISAWGEEAFNTAGVWTDKSIAT
GKEWLKAADKELNEMLNPKTAKEARIAINTMADTALIRLFNEQPSAKLLFDKAYGYAVFDSRKFSLMLHTNQGAGVAVNR
KTGKHTYMKMFGAGLAAGIGGKFYQQVILFEDKARFDAFVTQGWEATSEVGVVAGKESAELTAKYNGGMAIYQIGEKGLL
LDANISGSKYWIDKDLTETSR
>Q9HYV7 2.3.1.207~~~~~~Beta-ketodecanoyl-[acyl-carrier-protein] synthase~~~
MESFNTFVRQYNDQHAEAIAKGELEALAESSSAFIEKASGIKSRFVMNKEGILDPQRMVPYLPERSNDEWSILCEMAVAA
AREALQRAGRSAADIDGVIVACSNLQRAYPAIAVEVQAALGIQGYGYDMNVACSSATFGIQAATTAIQTGQARAILMVNP
EICTGHLNFRDRDSHFIFGDACTAVIVERADLAVSKHQFDIVSTRLLTQFSNNIRNNFGFLNRADESGIGKRDKLFVQEG
RKVFKDVCPMVAELIGEHLAANEIQVAEVKRFWLHQANLNMNLLITRKLLGRDAEAHEAPVILDSYANTSSAGSVIALHK
HQDDLPSGAIGVLSSFGAGYSIGSVILRKH
>P07771 ~~~benC~~~Benzoate 1,2-dioxygenase electron transfer component~~~COG0543
MSLYLNRIPAMSNHQVALQFEDGVTRFIRIAQGETLSDAAYRQQINIPMDCREGACGTCRAFCESGNYDMPEDNYIEDAL
TPEEAQQGYVLACQCRPTSDAVFQIQASSEVCKTKIHHFEGTLARVENLSDSTITFDIQLDDGQPDIHFLAGQYVNVTLP
GTTETRSYSFSSQPGNRLTGFVVRNVPQGKMSEYLSVQAKAGDKMSFTGPFGSFYLRDVKRPVLMLAGGTGIAPFLSMLQ
VLEQKGSEHPVRLVFGVTQDCDLVALEQLDALQQKLPWFEYRTVVAHAESQHERKGYVTGHIEYDWLNGGEVDVYLCGPV
PMVEAVRSWLDTQGIQPANFLFEKFSAN
>O68014 ~~~benM~~~HTH-type transcriptional regulator BenM~~~COG0583
MELRHLRYFVAVVEEQSFTKAADKLCIAQPPLSRQIQNLEEELGIQLLERGSRPVKTTPEGHFFYQYAIKLLSNVDQMVS
MTKRIASVEKTIRIGFVGSLLFGLLPRIIHLYRQAHPNLRIELYEMGTKAQTEALKEGRIDAGFGRLKISDPAIKRTLLR
NERLMVAVHASHPLNQMKDKGVHLNDLIDEKILLYPSSPKPNFSTHVMNIFSDHGLEPTKINEVREVQLALGLVAAGEGI
SLVPASTQSIQLFNLSYVPLLDPDAITPIYIAVRNMEESTYIYSLYETIRQIYAYEGFTEPPNW
>Q6G2A9 2.7.7.108~~~bepA~~~Protein adenylyltransferase~~~COG2184
MPKAKAKTKNTEIISPHHYVYPNTTTLKNKYGIKNLNAFLEKCSHDTAKAMINLREESLPEYFDTAYLCHIHQQLFKNTF
EWAGYLRHIPFTFADGTTAAMPEMKRTGWKNAFAIGDEIQEGLQRLDQTLAEKNNLQGLTREEFNSEAIELFNSLNQLHP
FREGNGRTQRLFFENLAKAAGHQLNFSLITKERMMVASVAVAENGDLEPMQHLFEDISNPEKIRLLKEFMHTMKNTGRNV
NDRPVMVAKEGETYTGTYRGAGLEGFALNVKGAYIIGNIDHLPPEQLKILKPGDKITFTAPKAEELKKTLIPKETLVPLT
KLEIAEMVAEDAFVHTCRDQICSLSKIVYGSQGVLNKNIIEIIKNPSKGQQLATQIERTPYSVHSLAGFDLICFKTGARV
RAEKHVALLSCAVANFTHAVKHARQEITKEHQAEQNRLRQEVPMPSQSLQDLLSLPKEFQQKALGVSPLLQKELTSLLQK
VNSRLSSSEQRALRENNHETLAKNLGVSEQKAKEITKTVMKAREVQQKSQTRTVSHSKTLAMAS
>P66948 3.4.-.-~~~bepA~~~Beta-barrel assembly-enhancing protease~~~COG4783
MFRQLKKNLVATLIAAMTIGQVAPAFADSADTLPDMGTSAGSTLSIGQEMQMGDYYVRQLRGSAPLINDPLLTQYINSLG
MRLVSHANSVKTPFHFFLINNDEINAFAFFGGNVVLHSALFRYSDNESQLASVMAHEISHVTQRHLARAMEDQQRSAPLT
WVGALGSILLAMASPQAGMAALTGTLAGTRQGMISFTQQNEQEADRIGIQVLQRSGFDPQAMPTFLEKLLDQARYSSRPP
EILLTHPLPESRLADARNRANQMRPMVVQSSEDFYLAKARTLGMYNSGRNQLTSDLLDEWAKGNVRQQRAAQYGRALQAM
EANKYDEARKTLQPLLAAEPGNAWYLDLATDIDLGQNKANEAINRLKNARDLRTNPVLQLNLANAYLQGGQPQEAANILN
RYTFNNKDDSNGWDLLAQAEAALNNRDQELAARAEGYALAGRLDQAISLLSSASSQVKLGSLQQARYDARIDQLRQLQER
FKPYTKM
>Q8G0Y6 ~~~bepC~~~Outer membrane efflux protein BepC~~~
MRYTVFKACKELVAAAVLLSGTVLTGQAALSETLTGALVKAYKNNAPLNSSRAGVRIQDENVAIAKSAYRPQITGSYNIS
RGKTPATDYRTTGTVGIQLNQMLFDGFQTRNNVAAAETQVFAQRENLRNDEQNTLYQAVAAYMDVYQLRQIAALREKNLA
AMNEQVRAARARLDVGEGTRTDVAQAEASRSTAIAALNAARADVKTAEATYMQVVGSLPDKLTPASAARHLPQSPSQAYA
SALASHPGILATKYAVNAAGYNVKAKEGALLPTIGLTASASQLDTIAGTDMGDGNTASIGVGVNIPIYTGGRTSAQIRQS
KEQLGQARIEVDVVQDKVRQAISSAWSQLEAARASVAANRDGIAAAQLALDGVIEERKVGQRTTLDVLNAQNDLVAVQIA
LVQAEHDVVVASYALLNATGRMTADQLGLQVAQYKPEEHYKAVKDKWFGLRTPDGR
>Q8G2M7 ~~~bepD~~~Efflux pump periplasmic linker BepD~~~
MTLNRTIRCFAAGAAFIVFAAQPALAQAPGGATPPPPQVFVVDIKPHDVPVTYEYAARINAYRNVQVRARVGGILLHRNF
VEGTQVKAGEVLFEIDPAPYQAELEKAQAQVAQAEAQYQQSIRDAERAEQLVQQKVQSAAVRDSAFATRDLNKAAVAAAK
AQLRTAELNLSYTKVTAPISGITSQEQVNEGSLIGTDASSSLLTSVTQLDPVYVNFSFTDTEAAEIAKLRAERGATGEDA
DRLKIKILFGDGKAYDHEGTIDFTSSSLDTETGTLGVRAVVENPNHRLIPGQFVRAEILDIQVKDAITVPKAALMQSAQG
QFVYVVNKDNVVEVRPVTGARELKNDWLISQGLNSGDRVITEGVIKAVPGRPVQPVVQGVDDKAQAEAGKEQAADKK
>Q8G2M6 ~~~bepE~~~Efflux pump membrane transporter BepE~~~
MNRFFVDRPVFAAVISIVLVLAGLICIRILPVAQYPELTPPQVVVSATYPGASAETVAQTVAAPLEQQINGVENMLYMQS
SSLGSGTMQLTVTFALGTDPDQATINVNNRVQRATSSLPQEVQRLGVTVDKRFTTILGMVAMFATTDRYDRTYVGNYALL
NVVDDLKRLPGVGDVQLLGNIDYSMRVWLRPDKLAQYNLTPSDVSAAIQEQNAQFAAGRFGDQPDPHAGPFTYTATTQGR
LPDAAAFENIILRSSSQNAATLRLKDVARVELGTESYLVDSNLNGTPAVPIAIYLQPGANALNTMELIQNRMNELKASFP
AGIDYAIPFDTTKFIKVSIEEVVHTFIEAIILVVLVVFIFLQNWRATLIPVIAVPISIIGTFAGMYVLGFSINLLTLFGL
VLAIGIVVDDAIVVLENVERIMTTEKLSPRKAAIKAMGEVTGPVIAIVLVLCAVFIPVAFMGGLVGEMYKQFAVTIAISV
TLSGLVALTLTPALCALILKPGHHEPILPFRIFNRAFERVTSGYTRGVRFFLKRATIGLIIFAGLLGSTYYLFERVPGSL
LPDEDQGFLFGVAVLPPAASLERTTVVLDQVSENIRKNPAVDNVFAVSGFDLLSGGLKTSAGTMFIMLKDWKERTTPDAD
ARNLPRTIMGMNAGIKDGMVLAFNPPPIMGLSTTGGFELYVQDRTGGGVESLTQATKLITEAAAKRPELQGVRTTFDPNV
PQYDIQLDREKAKAMGVPINSVFTAMQATFGSLYVNDFTLYGRNYQVNLQSEAEFRRDPGDLKHVFVRADSGSMIPLDAL
VTVKRIVGPDQLERFNAFNAAKVTGNPAPGYTSGDAIKAMQEVAAQVLPQGYQIAWTGSAYQEVSTSGTGSQAMIFGLIM
VFLILAAQYERWSLPLAVITAVPFAIFGALLATDLRGLTNDVYFQIGLVTLIGLAAKNAILIVEFAVLERESGKSAIEAA
ASAARLRFRPIVMTSLAFILGVVPLAVSTGAGSASRHSIGTGVIGGMLAATFIATFFIPMFYSLIARKPPKKHEDETADT
PETPGGTGGGI
>Q8FWV8 ~~~bepF~~~Efflux pump periplasmic linker BepF~~~
MVAFWTCRNAWFQHLPFAKRGDENAPSGPRRLRPWFLVLALGLAACSEDKSAPQQAAPLPPIPVGVIKITERPTHPQLSF
VGRVEATDSVDLIARVDGFLDKRTFTEGQAVKTGDLLFVLQKDALQAALDAAQANLAKAQADADNLKLQTERARSLYKQK
TVSQAMLDDRVAAEKQALAVVQQAQASLEQAQINLGYTDIRAPFSGRIGMANFSVGALVGPSSGPLATIVSQDPIYVTFP
VSDKTILDLTEGGRTATDRSNVAVSLTLSNGMTYPQTGAIDFTGIKINPNTDTLMVRAQFPNPNNVLIDGQYVQVTAASK
HPVEALLVPQKAIMTDQSGNYVLAVGEDNKVIQRQITQGSTFGSNVVVKSGLAVGDQVVVDGLQRIRPGQKVDPQIVDAT
TPAQKAMSVGN
>Q8FWV9 ~~~bepG~~~Efflux pump membrane transporter BepG~~~
MLSSVFINRPRLAIVIAIVITLAGLIAVTRIPVAQFPDIVPPQVSVTATYPGASAETVEAAIAQPIEAQVNGVDDMIYMS
STSGNNGTYTLTVTFKVGSDPNLNTVNVQNRVRLAEANLPQEVTRLGVTVKKQSSSFLQIITLLSPDSRYDELFLNNYGV
INVVDRLARVPGVGQAQSFGTFNYSMRIWFNTDALTSLNLTPNDIVNAISSQNVQAAVGRLGAPPMTDQQQIQLTLTTQG
RLTDAKQFENIIIRANPDGSSVRLKDVARVELAAQSYDTIGRLNGKPASVIAVYQAPGSNAVAAAEGVRNVMEQLKQSFP
AGLDYKITYDTTVFVSSTIHEVIKTLLEAFVLVVVVVFIFLGNFRATLIPTLAVPVSLIGTFAVLLVLGFSANTISLFAM
ILAIGIVVDDAIVVVENVERVMAETGLPPKEAAKQAMQEITAPIIAITLVLLSVFVPVAFIPGITGALYAQFALTVSVAM
LISAINALTLSPALCGVFLKPHQGRKKSLYGRTMDKLSSGIEKISDGYAHIVRRLVRMAFLSIVLVAGLGAGAYFLNTIV
PTGFLPEEDQGLFFVQVNLPPAASQSRTAAVVSEIEADITKMAGVADVTSVTGFSFIDGLAVSNAGLMIVTLKPLEERLK
DNITVFDVIAEVNRRTAAIPSAVAITMNLPPILGLGSSGGFQYQLEDQEGQSPQQLASVAQGLVMAANQNPKLSRVFTTF
ATDTPQLNLNIDRQKALSLGVSPNNIIQALQSTLGGYFVNNFNTLGRTWQVIIQGEQQDRKTVEDIYRINVRSSHGDMVP
LRSLVSVEERLGPLYITRYNNYRSASIQGNAAPGVSSGEALAAMAQVSKTTLPSGYGYEWTGTALQELQAAGQTSMILAL
AVLFAYLFLVALYESWTIPVGVLLSVTAGLAGAMLALWITGLSNDIYAQIGIVVLIALASKNGILIVEFAKERREEGVPL
EQAAIIGARQRFRPVMMTSFAFILGLVPLVIAVGAAAASRRAVGTSVFGGMIAASAVGIFLIPMLYVVLERVREWGHARI
LRKPLYEEEKQEKADGDASGPTVPPTQPEDRGLS
>Q8G2M8 ~~~bepR~~~HTH-type transcriptional repressor BepR~~~
MRRTKAEAAETREAILLAAEQVFLERGVNQSTLTEIACYAGVTRGAIYFHFEDKLDIFQSIIGRARFPQEEIMLQAARFD
HPNPLHILEQSIVAALELFATDERQQVVFTIINQRCEYVGEMAPVIDRLKEMRSDVLALFIGLLKVAERRGELASEWSAE
TAAQILLAMVGGFLNEWLHGEKGFDLIIHGSRVISTVIQSLRAPANIPQ
>O32102 3.1.-.-~~~besA~~~Ferri-bacillibactin esterase BesA~~~COG2819
MKEQTTDRTNGGTSNAFTIPGTEVRMMSSRNENRTYHIFISKPSTPPPPAGYPVIYLLDANSVFGTMTEAVRIQGRRPEK
TGVIPAVIVGIGYETAEPFSSARHRDFTMPTAQSKLPERPDGREWPEHGGAEGFFRFIEEDLKPEIERDYQIDKKRQTIF
GHSLGGLFVLQVLLTKPDAFQTYIAGSPSIHWNKPFILKKTDHFVSLTKKNNQPINILLAAGELEQHHKSRMNDNARELY
ERLAVLSEQGIRAEFCEFSGEGHISVLPVLVSRALRFALHPDGPHLSMG
>G8XHD8 6.3.2.-~~~besA~~~L-propargylglycine--L-glutamate ligase~~~
MYGAGVVTRMNEAERFHMTDPGSTKILIYAFNYADRMLEEVPYLRYSAERSLCFLGDLRDPGTRLVVITSEAVDPATLDY
HLRDVFRFDEPALADVRRRLTLLTPASRAARPLDSLVLEDEALVETLRRAVAERPAGTIVDFSASPATDELGRRTGATPE
EGDHAFVARWGSKSGGKEICLRAGVAVPGGTSEVLRSEAEVVEAIHRLSCGTAAARRAMVKLDAITWAASIGNVLIDRDK
LRHTGDLVGSAEVIRLPAEEFRRELAEQGAIVEEFLEEITDSPSGLGHIERDGTVRVVACHDQVLSGGQYWGCRFPADER
WRPEITDAVRRTGEVLSGLGHRGAFGVDFVVAGERGLLAVEINLRKVGPSHVVRYAEALVGARVGADGMLRGADGRPVYY
THGRLLEPETLGKLNPRTAVERLRAEGLLYRHDTGEGVALHVLGALNACGFVELTALARSPEAADGYSRAAQALLTGPYP
SA
>G8XHD7 4.5.1.-~~~besB~~~L-2-amino-4-chloropent-4-enoate dechlorinase/desaturase~~~
MSAGSPAEPRPVPGSVHSVSVSIPDVRSVIGFESGDPATLRRIAWGYPRFRTHPYVARVAALVAGAVGGRAQDLVLTRSV
RAAEAAAAYAGLAPGAVFEASGVRGVRVAEDDPALAAVRGHVQHTGAHLTSREAEDVLLEAGLIEARQAEEAVSEEPAEA
VRSALATAYGAGDPADVSLHNSGMNAVAAAVAAVTDLQRPAGRRRWIQLGWIFFDTMSLLDKRLFGTDHVTVPDPFDLAA
LSRVVAAHPGQLAGIIAELPSNPSLRCPDVPALREIADRAGCALVLDATIATPHNVDVLGYADVVCESLTKYATGSADVL
MGAAVVGSASPWAAQLREGLRRFGDVPYHRDAARVAARIRDYGDRMKRVNAGAVALAGFLERQSAVRAVSWPYDAASQAN
YRKVERLSDAPGGLLMVDLRVPLERVYDRLAVAKGPSFGAEFTMASPQIFIAHFDLLSTPEGRAELRSRGLHRDMLRISV
GVEDPELIAEVFRDAFDGAG
>F8JJ25 1.14.99.-~~~besC~~~4-chloro-allylglycine synthase~~~
MTDLNTPESTSKPVWEHFDHVEPGIRRRIAVADPEIKEYLDGMLARIASHRGVEHPFLNAYRTTALDPEQERHLFSECYY
FFRYLPFYITGMAVKTRDEMILREIILNVADEVGSDPTHSTLFADFLARIGIDKEHLDGYQPLEVTRQLNDGIRHLYTET
SINKALGALYADETMSSIMVSKINDGLRNQGYDDDLRHFWQLHIDVEVGHSNSVFNAIAPYVGSKAARAEFEEGVFEFLG
LVERYWDGVRELVGIGK
>G8XHD5 1.14.20.-~~~besD~~~L-lysine 4-chlorinase~~~
MCAPLEKDDIRRLSQAFHRFGIVTVTELIEPHTRKLVRAEADRLLDQYAERRDLRLATTDYTRRSMSVVPSETIAANSEL
VTGLYAHRELLAPLEAIAGERLHPCPKADEEFLITRQEQRGDTHGWHWGDFSFALIWVLQAPPIDVGGLLQCVPHTTWDK
ASPQINRYLVENPIDTYHFESGDVYFLRTDTTLHRTIPLREDTTRIILNMTWAGERDLSRKLAADDRWWDNAEVSAARAI
KD
>F8JJ27 1.14.11.-~~~besE~~~L-gamma-glutamyl-L-propargylglycine hydroxylase~~~
MSGTTHHHATFPAVEAAAFTRRHLDDLAAGLLGTVRVPGFFGRPALDTMLTSLHRVPVVSFDLDRMHHPMARFGTALNDY
RTPELALDADRYWHDADTARRQWAGIGMTPDPLELALDALGRAWGVRPAPATIGGRPAFVGMLREVNDGTFIHYDDINRE
YRGGLFDQKIVAQLAFNAWLAAPREGGTTTVWRHRWEPADENRRHGYGFQPTAVADDPYVTVAPAAGDALLFNANNYHVV
HPGAPGQRRIALACFLGVTAGGELVVWS
>P17444 1.1.99.1~~~betA~~~Oxygen-dependent choline dehydrogenase~~~COG2303
MQFDYIIIGAGSAGNVLATRLTEDPNTSVLLLEAGGPDYRFDFRTQMPAALAFPLQGKRYNWAYETEPEPFMNNRRMECG
RGKGLGGSSLINGMCYIRGNALDLDNWAQEPGLENWSYLDCLPYYRKAETRDMGENDYHGGDGPVSVTTSKPGVNPLFEA
MIEAGVQAGYPRTDDLNGYQQEGFGPMDRTVTPQGRRASTARGYLDQAKSRPNLTIRTHAMTDHIIFDGKRAVGVEWLEG
DSTIPTRATANKEVLLCAGAIASPQILQRSGVGNAELLAEFDIPLVHELPGVGENLQDHLEMYLQYECKEPVSLYPALQW
WNQPKIGAEWLFGGTGVGASNHFEAGGFIRSREEFAWPNIQYHFLPVAINYNGSNAVKEHGFQCHVGSMRSPSRGHVRIK
SRDPHQHPAILFNYMSHEQDWQEFRDAIRITREIMHQPALDQYRGREISPGVECQTDEQLDEFVRNHAETAFHPCGTCKM
GYDEMSVVDGEGRVHGLEGLRVVDASIMPQIITGNLNATTIMIGEKIADMIRGQEALPRSTAGYFVANGMPVRAKK
>P60337 1.1.99.1~~~betA~~~Oxygen-dependent choline dehydrogenase~~~
MSNKNKSYDYVIIGGGSAGSVLGNRLSEDKDKEVLVLEAGRSDYFWDLFIQMPAALMFPSGNKFYDWIYSTDEEPHMGGR
KVAHARGKVLGGSSSINGMIYQRGNPMDYEGWAEPEGMETWDFAHCLPYFKKLEKTYGAAPYDKFRGHDGPIKLKRGPAT
NPLFQSFFDAGVEAGYHKTPDVNGFRQEGFGPFDSQVHRGRRMSASRAYLHPAMKRKNLTVETRAFVTEIHYEGRRATGV
TYKKNGKLHTIDANEVILSGGAFNTPQLLQLSGIGDSEFLKSKGIEPRVHLPGVGENFEDHLEVYIQHKCKEPVSLQPSL
DIKRMPFIGLQWIFTRTGAAASNHFEGGGFVRSNNEVDYPNLMFHFLPIAVRYDGQKAAVAHGYQVHVGPMYSNSRGSLK
IKSKDPFEKPSIRFNYLSTEEDKKEWVEAIRVARNILSQKAMDPFNGGEISPGPEVQTDEEILDWVRRDGETALHPSCSA
KMGPASDPMAVVDPLTMKVHGMENLRVVDASAMPRTTNGNIHAPVLMLAEKAADIIRGRKPLEPQYIDYYKHGVHDENEG
AIEVKPYAK
>Q8UH56 1.2.1.8~~~betB~~~Betaine aldehyde dehydrogenase~~~COG1012
MTIATPLKAQPKASHFIDGDYVEDNTGTPFESIFPATGEMIAKLHAATPAIVERAIASAKRAQKEWAAMSPMARGRILKR
AADIMRERNDALSTLETLDTGKPIQETIVADPTSGADAFEFFGGIAPSALNGDYIPLGGDFAYTKRVPLGVCVGIGAWNY
PQQIACWKAAPALVAGNAMVFKPSENTPLGALKIAEILIEAGLPKGLFNVIQGDRDTGPLLVNHPDVAKVSLTGSVPTGR
KVAAAAAGHLKHVTMELGGKSPMIVFDDADIESAVGGAMLGNFYSSGQVCSNGTRVFVQKKAKARFLENLKRRTEAMILG
DPLDYATHLGPLVSKAQQEKVLSYIEKGKAEGATLITGGGIPNNVAGEGAYVQPTVFADVTDDMTIAREEIFGPVMCVLD
FDDEDEVLARANATEFGLAGGVFTADLARAHRVVDGLEAGTLWINTYNLCPVEIPFGGSKQSGFGRENSAAALEHYSELK
TVYVSTGKVDAPY
>Q3JLL8 1.2.1.8~~~betB~~~Betaine aldehyde dehydrogenase~~~
MSVYGLQRLYIAGAHADATSGKTFDTFDPATGELLARVQQASADDVDRAVASAREGQREWAAMTAMQRSRILRRAVELLR
ERNDALAELEMRDTGKPIAETRAVDIVTGADVIEYYAGLATAIEGLQVPLRPESFVYTRREPLGVCAGIGAWNYPIQIAC
WKSAPALAAGNAMIFKPSEVTPLSALKLAEIYTEAGVPAGVFNVVQGDGSVGALLSAHPGIAKVSFTGGVETGKKVMSLA
GASSLKEVTMELGGKSPLIVFDDADLDRAADIAVTANFFSAGQVCTNGTRVFVQQAVKDAFVERVLARVARIRVGKPSDS
DTNFGPLASAAQLDKVLGYIDSGKAEGAKLLAGGARLVNDHFASGQYVAPTVFGDCRDDMRIVREEIFGPVMSILSFETE
DEAIARANATDYGLAAGVVTENLSRAHRAIHRLEAGICWINTWGESPAEMPVGGYKQSGVGRENGITTLEHYTRIKSVQV
ELGRYQPVF
>P17445 1.2.1.8~~~betB~~~Betaine aldehyde dehydrogenase~~~COG1012
MSRMAEQQLYIHGGYTSATSGRTFETINPANGNVLATVQAAGREDVDRAVKSAQQGQKIWASMTAMERSRILRRAVDILR
ERNDELAKLETLDTGKAYSETSTVDIVTGADVLEYYAGLIPALEGSQIPLRETSFVYTRREPLGVVAGIGAWNYPIQIAL
WKSAPALAAGNAMIFKPSEVTPLTALKLAEIYSEAGLPDGVFNVLPGVGAETGQYLTEHPGIAKVSFTGGVASGKKVMAN
SAASSLKEVTMELGGKSPLIVFDDADLDLAADIAMMANFFSSGQVCTNGTRVFVPAKCKAAFEQKILARVERIRAGDVFD
PQTNFGPLVSFPHRDNVLRYIAKGKEEGARVLCGGDVLKGDGFDNGAWVAPTVFTDCSDDMTIVREEIFGPVMSILTYES
EDEVIRRANDTDYGLAAGIVTADLNRAHRVIHQLEAGICWINTWGESPAEMPVGGYKHSGIGRENGVMTLQSYTQVKSIQ
VEMAKFQSIF
>Q9HTJ1 1.2.1.8~~~betB~~~NAD/NADP-dependent betaine aldehyde dehydrogenase~~~
MARFEEQKLYIGGRYVEASSGATFETINPANGEVLAKVQRASREDVERAVQSAVEGQKVWAAMTAMQRSRILRRAVDILR
ERNDELAALETLDTGKPLAETRSVDIVTGADVLEYYAGLVPAIEGEQIPLRETSFVYTRREPLGVVAGIGAWNYPVQIAL
WKSAPALAAGNAMIFKPSEVTPLTALKLAEIYTEAGVPDGVFNVLTGSGREVGQWLTEHPLIEKISFTGGTSTGKKVMAS
ASSSSLKEVTMELGGKSPLIIFPDADLDRAADIAVMANFFSSGQVCTNGTRVFIHRSQQARFEAKVLERVQRIRLGDPQD
ENTNFGPLVSFPHMESVLGYIESGKAQKARLLCGGERVTDGAFGKGAYVAPTVFTDCRDDMTIVREEIFGPVMSILVYDD
EDEAIRRANDTEYGLAAGVVTQDLARAHRAIHRLEAGICWINTWGESPAEMPVGGYKQSGVGRENGLTTLAHYTRIKSVQ
VELGDYASVF
>O69787 3.1.6.6~~~betC~~~Choline-sulfatase~~~COG3119
MTTGKPNILIIMVDQLNGKLFPDGPADFLHAPNLKALAKRSARFHNNYTSSPLCAPARASFMAGQLPSRTRVYDNAAEYQ
SSIPTYAHHLRRAGYYTALSGKMHFVGPDQLHGFEERLTTDIYPADFGWTPDYRKPGERIDWWYHNLGSVTGAGVAEITN
QMEYDDEVAFLANQKLYQLSRENDDESRRPWCLTVSFTHPHDPYVARRKFWDLYEDCEHLTPEVGAIPLDEQDPHSQRIM
LSCDYQNFDVTEENVRRSRRAYFANISYLDEKVGELIDTLTRTRMLDDTLILFCSDHGDMLGERGLWFKMNFFEGSARVP
LMIAGPGIAPGLHLTPTSNLDVTPTLADLAGISLEEVRPWTDGVSLVPMVNGVERTEPVLMEYAAEASYAPLVAIREGKW
KYVYCALDPEQLFDLEADPLELTNLAENPRGPVDQATLTAFRDMRAAHWDMEAFDAAVRESQARRWVVYEALRNGAYYPW
DHQPLQKASERYMRNHMNLDTLEESKRYPRGE
>P17446 ~~~betI~~~HTH-type transcriptional regulator BetI~~~COG1309
MPKLGMQSIRRRQLIDATLEAINEVGMHDATIAQIARRAGVSTGIISHYFRDKNGLLEATMRDITSQLRDAVLNRLHALP
QGSAEQRLQAIVGGNFDETQVSSAAMKAWLAFWASSMHQPMLYRLQQVSSRRLLSNLVSEFRRELPREQAQEAGYGLAAL
IDGLWLRAALSGKPLDKTRANSLTRHFITQHLPTD
>Q9X4A5 ~~~betL~~~Glycine betaine transporter BetL~~~COG1292
MKKLTNVFWGSGFLVLLAVLFGAFLPEQFETFTNHIQKFLTSNFGWYYLIVVAIIIIFCLFLVLSPIGSIRLGKPGEEPG
YSNKSWFAMLFSAGMGIGLVFWGAAEPLSHYAVQAPGGEVGTQAAMKDALRYSFFHWGISAWSIYAIVALALAYFKFRKN
APGLISATLYPILGKHAKGPIGQLIDIIAVFATVIGVATTLGLGAQQINGGLTYLFGVPNNFTVQFTIIVIVTILFMLSA
MSGLDKGIQLLSNVNIYVAGVLLVLTLILGPTLFIMNNFTNSFGDYLQNIIQMSFQTAPDAPDARKWIDSWTIFYWAWWL
SWSPFVGIFIARISRGRTIRQFLLGVIVLPALVSVFWFAVFGGSAIFVEQHGNSGLSSLATEQVLFGVFNEFPGGMMLSI
VAMILIAVFFITSADSATFVLGMQTTGGSLNPPNSVKVTWGLLQAGIASVLLYAGGLTALQNASIIAAFPFSIVIILMIV
SLFVSLTREQEKLGLYVRPKKSQRSQL
>P54582 ~~~betP~~~Glycine betaine transporter BetP~~~COG1292
MTTSDPNPKPIVEDAQPEQITATEELAGLLENPTNLEGKLADAEEEIILEGEDTQASLNWSVIVPALVIVLATVVWGIGF
KDSFTNFASSALSAVVDNLGWAFILFGTVFVFFIVVIAASKFGTIRLGRIDEAPEFRTVSWISMMFAAGMGIGLMFYGTT
EPLTFYRNGVPGHDEHNVGVAMSTTMFHWTLHPWAIYAIVGLAIAYSTFRVGRKQLLSSAFVPLIGEKGAEGWLGKLIDI
LAIIATVFGTACSLGLGALQIGAGLSAANIIEDPSDWTIVGIVSVLTLAFIFSAISGVGKGIQYLSNANMVLAALLAIFV
FVVGPTVSILNLLPGSIGNYLSNFFQMAGRTAMSADGTAGEWLGSWTIFYWAWWISWSPFVGMFLARISRGRSIREFILG
VLLVPAGVSTVWFSIFGGTAIVFEQNGESIWGDGAAEEQLFGLLHALPGGQIMGIIAMILLGTFFITSADSASTVMGTMS
QHGQLEANKWVTAAWGVATAAIGLTLLLSGGDNALSNLQNVTIVAATPFLFVVIGLMFALVKDLSNDVIYLEYREQQRFN
ARLARERRVHNEHRKRELAAKRRRERKASGAGKRR
>G3XCN6 ~~~betS~~~Glycine betaine/proline betaine transporter BetS~~~
MQNRVVSCRFTGSTARRATMPEGIRGRSHILFLVPLSRAESVGRLHQVQRFKVNLPVFVGSVAVIALFVGIGVIAPKRAE
SIFSGMQTAILSGFGWLYLLSVAVFLFSMLFLAFSRYGELKLGPDDSEPEFRYLSWIAMLFAAGMGIGLMYFAVGEPMTH
FASPPEAEPLTIAAQREAMSVTFFHWGVHAWAIYSVVGLSLAYFGYRYNLPLTVRSGLYPLLKEGIHGPIGHVVDIFAIC
GTMFGLATSLGFGILQINSGLNYLLGIPQSIYVQLLLVTVVTAIATISVVTGVEKGVRILSETNLFLAVLLMLFVLVVGP
TGTLMRDFVQNIGLYLDSLVLRTFNIYAYEPRPWIDSWTLFYWAWWISWSPFVGMFIARISRGRTVREFVTAVLFVPAMF
TFLWMTVFGNTAIYVDTTIANGELARDVKADLSVALFQFFEYLPWPAVTSTLAVLLVSIFFVTSSDSGSLVIDTIASGGE
TATPALQRIFWCSLSGIVAAVLLSTGGLTALQSATISTALPFSLVMLILVWSLFVGMRADLARTQSPGSLGPRAYPASGV
PWQRRLAMTLSTPDRRAVEKFLQASVLPALEAVARELTRRSRPASVGRDAETGALTLTVPAEGHRDFVYGVQMSEHKLPA
FTAYDATVADVRYEARTFFSDGSRGYDIMGMADNQIINDVLFQFERYTGFVRSPESSLLATSPEER
>Q6FDF6 ~~~betT1~~~Osmo-independent choline transporter BetT1~~~COG1292
MWSKRDEQKTYPPIRLNPFVFWSSAISISIFGMLFVLFPETSQHGLTWIQQQVNQLFGWYYMLVIILSLGFVAWLAFSQV
GNIPLGKAQDKPEFGYLVWTSMLFSAGIGIALLYYGVAEPVDHFLRPPEGQGGTVEAAQNAMMYSFLHWGIHGWVLYALV
GVTLGYFAFRRDLPLALRSALYPIFGERIHGLVGHMVDGFGILATIISLVTNLGIGALVMISGISYLFPDLPNTSSTLVV
TVIMMMLVATLTTVIGIEKGLAWLSRINLRLLYLLLLFVFLTGPTNHLLNGLVQNTGDYLSHFVQKSFDLYLYDKNATGW
LASWTIFYWAWWIAWAPFVGMFIARISKGRTIREVVLGVCLIPLGFTLAWISIFGNTAIDLILNHGQQIIGSLVIQDPAL
SLFKLLEYLPFHPYVAGIVVVICFVLFLTPVGSGTLMIANLSSQGGSSDSDSPIWLRVFWSIAITIVSIGLLLAGSFSAM
QSAVVLCGLPFSVILLLYMFGLAKALKQETQQPVVESHTTETSGSD
>Q6FDF5 ~~~betT2~~~Osmo-dependent choline transporter BetT2~~~COG1292
MATDNPRAVDDQETHPKDRLNRVVFYVSALIILIFSLTTILFNDFANRALNQVLDWVSSTFSWYYLLAATLYMVFVIFIA
CSRYGNIKLGPKHSKPEFSLLSWSAMLFSAGIGIDLMFFSVAEPLSHYMHPPVGEGQTYEAARQGMVWTLFHYGLTGWCM
YALIGMALGYFSYRYNLPLTIRSALYPIFGKKINGPIGHSVDTAAVIGTIFGIATTCGIGVVQLNYGLHVLFDLPENLWV
QTALILVAVIITIISVTSGVNKGLRILSEVNIYVSVGLMLFILFLGNTEFLLNALVQNVGDYLSRFPSLALESFAFDQPK
EWMNSWTLFFWAWWVAWSPFVGLFLARISRGRTIREFVSGTLIIPLLFTLTWLSIFGNSALHNVIFDGNIALAETVLSNP
AHGFYDLLAQYPWFPFIAGVATITGLLFYVTSADSGALVLGNFTTQFTNIDHDAPRWLSVFWAVAIGLLTLAMLMTNGIT
ALQNATIIMGLPFSFVMFLVMAGLYKSLRLEDYRQASASLNAAPVVGNVDILNWKKRLTRVMHHPGTFETKRMLNEICRP
AVHAVAEELQKRAVQVDVLEVPLEEDEELYHLDITIHLEEEQNFIYQIWPVRYIAPNFSERGKRGKQFYYRLETYLYEGS
QGNDLVGYTKEQVINDILDRYERHMTFLHINRISPGNRPLFPDPKA
>P0ABC9 ~~~betT~~~High-affinity choline transport protein~~~COG1292
MTDLSHSREKDKINPVVFYTSAGLILLFSLTTILFRDFSALWIGRTLDWVSKTFGWYYLLAATLYIVFVVCIACSRFGSV
KLGPEQSKPEFSLLSWAAMLFAAGIGIDLMFFSVAEPVTQYMQPPEGAGQTIEAARQAMVWTLFHYGLTGWSMYALMGMA
LGYFSYRYNLPLTIRSALYPIFGKRINGPIGHSVDIAAVIGTIFGIATTLGIGVVQLNYGLSVLFDIPDSMAAKAALIAL
SVIIATISVTSGVDKGIRVLSELNVALALGLILFVLFMGDTSFLLNALVLNVGDYVNRFMGMTLNSFAFDRPVEWMNNWT
LFFWAWWVAWSPFVGLFLARISRGRTIRQFVLGTLIIPFTFTLLWLSVFGNSALYEIIHGGAAFAEEAMVHPERGFYSLL
AQYPAFTFSASVATITGLLFYVTSADSGALVLGNFTSQLKDINSDAPGWLRVFWSVAIGLLTLGMLMTNGISALQNTTVI
MGLPFSFVIFFVMAGLYKSLKVEDYRRESANRDTAPRPLGLQDRLSWKKRLSRLMNYPGTRYTKQMMETVCYPAMEEVAQ
ELRLRGAYVELKSLPPEEGQQLGHLDLLVHMGEEQNFVYQIWPQQYSVPGFTYRARSGKSTYYRLETFLLEGSQGNDLMD
YSKEQVITDILDQYERHLNFIHLHREAPGHSVMFPDA
>P9WQ27 2.4.1.18~~~~~~Probable 1,4-alpha-glucan branching enzyme Rv3031~~~COG1543
MNTSASPVPGLFTLVLHTHLPWLAHHGRWPVGEEWLYQSWAAAYLPLLQVLAALADENRHRLITLGMTPVVNAQLDDPYC
LNGVHHWLANWQLRAEEAASVRYARQSKSADYPSCTPEALRAFGIRECADAARALDNFATRWRHGGSPLLRGLIDAGTVE
LLGGPLAHPFQPLLAPRLREFALREGLADAQLRLAHRPKGIWAPECAYAPGMEVDYATAGVSHFMVDGPSLHGDTALGRP
VGKTDVVAFGRDLQVSYRVWSPKSGYPGHAAYRDFHTYDHLTGLKPARVTGRNVPSEQKAPYDPERADRAVDVHVADFVD
VVRNRLLSESERIGRPAHVIAAFDTELFGHWWYEGPTWLQRVLRALPAAGVRVGTLSDAIADGFVGDPVELPPSSWGSGK
DWQVWSGAKVADLVQLNSEVVDTALTTIDKALAQTASLDGPLPRDHVADQILRETLLTVSSDWPFMVSKDSAADYARYRA
HLHAHATREIAGALAAGRRDTARRLAEGWNRADGLFGALDARRLPK
>Q5SH28 2.4.1.18~~~~~~1,4-alpha-glucan branching enzyme TTHA1902~~~COG1543
MARFALVLHAHLPYVRAHGMWPFGEETLYEAMAETYLPLIRVLERLRAEGVEAPFTLGITPILAEQLADARIKEGFWAYA
KDRLERAQGDYQRYRGTALEASARHQVAFWELTLDHFQRLSGDLVAAFRKAEEGGQVELITSNATHGYSPLLGYDEALWA
QIKTGVSTYRRHFAKDPTGFWLPEMAYRPKGPWKPPVEGPPEGVRPGVDELLMRAGIRYTFVDAHLVQGGEPLSPYGEAA
LGPVESQEATYHVHELESGLRVLARNPETTLQVWSADYGYPGEGLYREFHRKDPLSGLHHWRVTHRKADLAEKAPYDPEA
AFAKTEEHARHFVGLLERLAGRHPEGVILSPYDAELFGHWWYEGVAWLEAVLRLLAQNPKVRPVTAREAVQGPAVRTALP
EGSWGRGGDHRVWLNEKTLDYWEKVYRAEGAMREAARRGVLPEGVLRQAMRELLLLEASDWPFLMETGQAEAYARERYEE
HARAFFHLLKGASPEELRALEERDNPFPEADPRLYLFREA
>H8WR05 2.6.1.-~~~~~~Beta-phenylalanine transaminase~~~
MTHAAIDQALADAYRRFTDANPASQRQFEAQARYMPGANSRSVLFYAPFPLTIARGEGAALWDADGHRYADFIAEYTAGV
YGHSAPEIRDAVIEAMQGGINLTGHNLLEGRLARLICERFPQIEQLRFTNSGTEANLMALTAALHFTGRRKIVVFSGGYH
GGVLGFGARPSPTTVPFDFLVLPYNDAQTARAQIERHGPEIAVVLVEPMQGASGCIPGQPDFLQALRESATQVGALLVFD
EVMTSRLAPHGLANKLGIRSDLTTLGKYIGGGMSFGAFGGRADVMALFDPRTGPLAHSGTFNNNVMTMAAGYAGLTKLFT
PEAAGALAERGEALRARLNALCANEGVAMQFTGIGSLMNAHFVQGDVRSSEDLAAVDGRLRQLLFFHLLNEDIYSSPRGF
VVLSLPLTDADIDRYVAAIGSFIGGHGALLPRAN
>P0AE56 ~~~bfd~~~Bacterioferritin-associated ferredoxin~~~COG2906
MYVCLCNGISDKKIRQAVRQFSPHSFQQLKKFIPVGNQCGKCVRAAREVMEDELMQLPEFKESA
>P58997 ~~~bfpA~~~Major structural subunit of bundle-forming pilus~~~
MVSKIMNKKYEKGLSLIESAMVLALAATVTAGVMFYYQSASDSNKSQNAISEVMSATSAINGLYIGQTSYSGLDSTILLN
TSAIPDNYKDTTNKKITNPFGGELNVGPANNNTAFGYYLTLTRLDKAACVSLATLNLGTSAKGYGVNISSENNITSFGNS
ADQAAKSTAITPAEAATACKNTDSTNKVTYFMK
>P33553 ~~~bfpA~~~Major structural subunit of bundle-forming pilus~~~
MVSKIMNKKYEKGLSLIESAMVLALAATVTAGVMFYYQSASDSNKSQNAISEVMSATSAINGLYIGQTSYSGLDSTILLN
TSAIPDNYKDTTNKKITNPFGGELNVGPANNNTAFGYYLTLTRLDKAACVSLATLNLGTSAKGYGVNISGENNITSFGNS
ADQAAKSTAITPAEAATACKNTDSTNKVTYFMK
>Q9S142 ~~~bfpB~~~Outer membrane lipoprotein BfpB~~~
MKLGRYSLFLLCPLLASCSGNGFYKDNLGVIDKNILHADTSLLKSKNKEHYKSSDMVSKTDSIYIGNSSFQTYHGEPLPG
KLEGVHGIILRSSTPLGFDEVLSMIQDSSGIPIVKHTTKDVISGGVSSKSLAATVAEKMNSATGGKSTDQFDHLLLEVSS
EHQLMDVNYQGALSTFLDKVAANYNLYWTYESGRIAFSNEETKRFSISILPGGKYTSKNSISSDSNSSSGSSGSSGSSSS
DSGAELKFDSDVDFWKDIENSIKLILGSDGSYSISTSTSSVIVRTSSANMKKINEYINTLNAQLERQVTIDVAIYNVTTT
DSSDLAMSLEALLKHNGGVLGSVSTSNFAATSGTPSFTGYLNGNGDSSNQVLLNLLAEKGKVSVVTSASVTTMSGQPVPL
KVGNDRTYVSEIGTVLSQSSTSTTASTSTVTSGFLMNLLPQVADDGNILLQYGVTLSELVGSNNGFDQATVNGTVIQLPN
VDSTTFVQSSMLRNGNTLVLAGYEKKRNESVDQGVGTTSFKLLGGALNGSASRTVTVICITPRIIDLKASGE
>P80893 ~~~Y1-BFP~~~Blue fluorescence protein~~~
MFKGNVQGVGTVENIDKGAKFQSLHGVSLLPIDADLQSHDIIFPEDILEGVTSGELIAINGVRLTVVHTDKSIVRFDIND
ALELTTLGQLKVGDKVNIEKSFKFGDMTGGRSLSGIVTGVADIVEFIEKENNRQIWIEAPEHLTEFLVEKKYIGVDGVYL
VIDAIENNRFCINLLLETDMRWYKKGSKVNIEIPDIAGNW
>O33833 3.2.1.26~~~bfrA~~~Beta-fructosidase~~~COG1621
MFKPNYHFFPITGWMNDPNGLIFWKGKYHMFYQYNPRKPEWGNICWGHAVSDDLVHWRHLPVALYPDDETHGVFSGSAVE
KDGKMFLVYTYYRDPTHNKGEKETQCVAMSENGLDFVKYDGNPVISKPPEEGTHAFRDPKVNRSNGEWRMVLGSGKDEKI
GRVLLYTSDDLFHWKYEGVIFEDETTKEIECPDLVRIGEKDILIYSITSTNSVLFSMGELKEGKLNVEKRGLLDHGTDFY
AAQTFFGTDRVVVIGWLQSWLRTGLYPTKREGWNGVMSLPRELYVENNELKVKPVDELLALRKRKVFETAKSGTFLLDVK
ENSYEIVCEFSGEIELRMGNESEEVVITKSRDELIVDTTRSGVSGGEVRKSTVEDEATNRIRAFLDSCSVEFFFNDSIAF
SFRIHPENVYNILSVKSNQVKLEVFELENIWL
>A0R647 1.16.3.1~~~bfrB~~~Ferritin BfrB~~~COG1528
MTNSGALDTKFHALIQDQIRSEFTASQQYIAIAVFFDGADLPQLAKHFYAQALEERNHAMMLVQYLLDRDVEVEIPGIDP
VCNNFTTPRDALALALDQERTVTEQISRLASVARDEGDHLGEQFMQWFLKEQVEEVAAMTTLVRIADRAGSNLFHIEDFV
AREMSAAGADPTAPRAAGGAL
>H8F1Z2 1.16.3.1~~~bfrB~~~Ferritin BfrB~~~
MTEYEGPKTKFHALMQEQIHNEFTAAQQYVAIAVYFDSEDLPQLAKHFYSQAVEERNHAMMLVQHLLDRDLRVEIPGVDT
VRNQFDRPREALALALDQERTVTDQVGRLTAVARDEGDFLGEQFMQWFLQEQIEEVALMATLVRVADRAGANLFELENFV
AREVDVAPAASGAPHAAGGRL
>P9WNE5 1.16.3.1~~~bfrB~~~Bacterioferritin BfrB~~~COG1528
MTEYEGPKTKFHALMQEQIHNEFTAAQQYVAIAVYFDSEDLPQLAKHFYSQAVEERNHAMMLVQHLLDRDLRVEIPGVDT
VRNQFDRPREALALALDQERTVTDQVGRLTAVARDEGDFLGEQFMQWFLQEQIEEVALMATLVRVADRAGANLFELENFV
AREVDVAPAASGAPHAAGGRL
>P81549 ~~~bfrD~~~Probable TonB-dependent receptor BfrD~~~COG4774
MKFYSSHPMPESLAAAIAVPLLGLLPAAQAASTAVQLPSVTVEGEYSSYQPESAQSPKFTAPLADTPRTVQVIPERLIQD
QGASDLEAVLRNAPGISMTAGEGGRPASDLPFIRGQNSASSLFVDGLRDPSTQSRDTFNLEQVDVVKGPDSVFSGRGGAG
GSINLVTKTPRNQDFTEVQAGIGTAETYRGTIDGNWVLGENTALRLNLLGTRDTVPGRDKAVEFSRVGIAPSLRLGLSGP
TRVTLGLYHYRHRRVPDYSIPYDPRTGTPITETIGVSRRNFYGLVRRDSGDTEDYAATVKWEHDLANGFKVENLARYSRA
TVEQITTMPELKTADLAKGLVYRNLRASYQVNDSFANRTDLRGTFDTGQWRHTFDLGGEFATSRRSRDRYKQEIPDAASP
CSPVTDGNNPALCASLRDPDPHVDFPGTVRRNHNPARYHTDILSLYGFDTIAFDEQWQLNLGLRWDHYKTSGRNLPVRGA
KPPVYERAARTDNLFNYQLGLVYKPRPDGSVYASYGTASTPSAVSDYAPADSISGTSQQLKPERSEAIEIGTKWQVLDRR
LLVTGAMFRETRKNTSIEVAEGLRAPAGKSRVTGMELGVAGSLTPRWDVYGGYALLDSKLVRASHKSGAQGQPLPSAPRH
AFSIWSTYKLLPELTVGAGAFYRSKVYGNADAGYNKDGTPKARWVPAYWRFDAMAAYQLNKHLTAQLNVYNLLDKTYYAK
TYRSHYAALGPGRSAMLTFKLSY
>P22759 1.16.3.1~~~bfr~~~Bacterioferritin~~~
MKGDKIVIQHLNKILGNELIAINQYFLHARMYEDWGLEKLGKHEYHESIDEMKHADKLIKRILFLEGLPNLQELGKLLIG
EHTKEMLECDLKLEQAGLPDLKAAIAYCESVGDYASRELLEDILESEEDHIDWLETQLDLIDKIGLENYLQSQMDE
>Q93PP9 1.16.3.1~~~bfr~~~Bacterioferritin~~~COG2193
MAGNREDRKAKVIEVLNKARAMELHAIHQYMNQHYSLDDMDYGELAANMKLIAIDEMRHAENFAERIKELGGEPTTQKEG
KVVTGQAVPVIYESDADQEDATIEAYSQFLKVCKEQGDIVTARLFERIIEEEQAHLTYYENIGSHIKNLGDTYLAKIAGT
PSSTGTASKGFVTATPAAE
>P0ABD3 1.16.3.1~~~bfr~~~Bacterioferritin~~~COG2193
MKGDTKVINYLNKLLGNELVAINQYFLHARMFKNWGLKRLNDVEYHESIDEMKHADRYIERILFLEGLPNLQDLGKLNIG
EDVEEMLRSDLALELDGAKNLREAIGYADSVHDYVSRDMMIEILRDEEGHIDWLETELDLIQKMGLQNYLQAQIREEG
>P43315 1.16.3.1~~~bfr~~~Bacterioferritin~~~COG2193
MQGDPDVLRLLNEQLTSELTAINQYFLHSKMQENWGFTELAERTRVESFDEMRHAEAITDRILLLDGLPNYQRIGSLRVG
QTLREQFEADLAIEYEVMSRLKPGIIMCREKQDSTSAVLLEKIVADEEEHIDYLETQLALMGQLGEELYSAQCVSRPPS
>P45430 1.16.3.1~~~bfr~~~Bacterioferritin~~~COG2193
MQGDPEVLRLLNEQLTSELTAINQYFLHSKMQDNWGFTELAEHTRAESFDEMRHAEAITDRILLLDGLPNYQRLFSLRIG
QTLREQFEADLAIEYEVMDRLKPAIILCREKQDSTTATLFEQIVADEEKHIDYLETQLELMDKLGVELYSAQCVSRPPS
>P9WPQ9 1.16.3.1~~~bfr~~~Bacterioferritin BfrA~~~COG2193
MQGDPDVLRLLNEQLTSELTAINQYFLHSKMQDNWGFTELAAHTRAESFDEMRHAEEITDRILLLDGLPNYQRIGSLRIG
QTLREQFEADLAIEYDVLNRLKPGIVMCREKQDTTSAVLLEKIVADEEEHIDYLETQLELMDKLGEELYSAQCVSRPPT
>Q9HWF9 1.16.3.1~~~bfr~~~Bacterioferritin~~~
MQGHPEVIDYLNTLLTGELAARDQYFIHSRMYEDWGFSKLYERLNHEMEEETQHADALLRRILLLEGTPRMRPDDIHPGT
TVPEMLEADLKLERHVRAALAKGIALCEQHKDFVSRDILKAQLADTEEDHAYWLEQQLGLIARMGLENYLQSQI
>Q59738 1.16.3.1~~~bfr~~~Bacterioferritin~~~
MKGDAKVIEFLNAALRSELTAISQYWVHFRLQEDWGLAKMAKKSREESIEEMGHADKIIARILFLEGHPNLQKLDPLRIG
EGPRETLECDLAGEHDALKLYREARDYCAEVGDIVSKNIFESLITDEEGHVDFLETQISLYDRLGPQGFALLNAAPMDAA
E
>Q9S2N0 1.16.3.1~~~bfr~~~Bacterioferritin~~~COG2193
MQGDPEVIEFLNEQLTAELTAINQYFLHAKLQDHKGWTKLAKYTRAESFDEMRHAEVLTDRILLLDGLPNYQRLFHVRVG
QSVTEMFQADREVELEAIDRLRRGIEVMRAKHDITSANVFEAILADEEHHIDYLETQLDLIEKLGESLYLSTVIEQTQPD
PSGPGSL
>P24602 1.16.3.1~~~bfr~~~Bacterioferritin~~~COG2193
MKGKPAVLAQLHKLLRGELAARDQYFIHSRMYQDWGLEKLYSRIDHEMQDETAHASLLIERILFLEETPDLSQQDPIRVG
KTVPEMLQYDLDYEYEVIANLKEAMAVCEQEQDYQSRDLLLKILADTEEDHAYWLEKQLGLIEKIGLQNYLQSQMS
>P06864 3.2.1.23~~~ebgA~~~Evolved beta-galactosidase subunit alpha~~~COG3250
MNRWENIQLTHENRLAPRAYFFSYDSVAQARTFARETSSLFLPLSGQWNFHFFDHPLQVPEAFTSELMADWGHITVPAMW
QMEGHGKLQYTDEGFPFPIDVPFVPSDNPTGAYQRIFTLSDGWQGKQTLIKFDGVETYFEVYVNGQYVGFSKGSRLTAEF
DISAMVKTGDNLLCVRVMQWADSTYVEDQDMWWSAGIFRDVYLVGKHLTHINDFTVRTDFDEAYCDATLSCEVVLENLAA
SPVVTTLEYTLFDGERVVHSSAIDHLAIEKLTSASFAFTVEQPQQWSAESPYLYHLVMTLKDANGNVLEVVPQRVGFRDI
KVRDGLFWINNRYVMLHGVNRHDNDHRKGRAVGMDRVEKDLQLMKQHNINSVRTAHYPNDPRFYELCDIYGLFVMAETDV
ESHGFANVGDISRITDDPQWEKVYVERIVRHIHAQKNHPSIIIWSLGNESGYGCNIRAMYHAAKALDDTRLVHYEEDRDA
EVVDIISTMYTRVPLMNEFGEYPHPKPRIICEYAHAMGNGPGGLTEYQNVFYKHDCIQGHYVWEWCDHGIQAQDDHGNVW
YKFGGDYGDYPNNYNFCLDGLIYSDQTPGPGLKEYKQVIAPVKIHARDLTRGELKVENKLWFTTLDDYTLHAEVRAEGET
LATQQIKLRDVAPNSEAPLQITLPQLDAREAFLNITVTKDSRTRYSEAGHPIATYQFPLKENTAQPVPFAPNNARPLTLE
DDRLSCTVRGYNFAITFSKMSGKPTSWQVNGESLLTREPKINFFKPMIDNHKQEYEGLWQPNHLQIMQEHLRDFAVEQSD
GEVLIISRTVIAPPVFDFGMRCTYIWRIAADGQVNVALSGERYGDYPHIIPCIGFTMGINGEYDQVAYYGRGPGENYADS
QQANIIDIWRSTVDAMFENYPFPQNNGNRQHVRWTALTNRHGNGLLVVPQRPINFSAWHYTQENIHAAQHCNELQRSDDI
TLNLDHQLLGLGSNSWGSEVLDSWRVWFRDFSYGFTLLPVSGGEATAQSLASYEFGAGFFSTNLHSENKQ
>A1A3T0 3.2.1.23~~~bgaC~~~Beta-galactosidase BgaC~~~
MADTAELAIVHATTASASWLTDPTVFAANRKPAHSSHRYVIGETREPKQSLDGEWKVRIEQARNVDVESAPFAAVDFEDG
DFGAIEVPGHLQMAGYLKNKYVNIQYPWDGHEDPQAPNIPENNHVAIYRRRFALDAQLARTLENDGTVSLTFHGAATAIY
VWLDGTFVGYGEDGFTPSEFDVTEALRNGNGNAADSPEAEHTLTVACYEYSSASWLEDQDFWRLHGLFRTVELAAQPHTH
VETVQLEADYTAADTAGTADTAELNAALTLRNPADAMTIESTLRDGDGNVVWESTQACNGEIALNSGKMTNIAPWSAESP
TLYTLTVRVVGHDGAIIETVTQKIGFRTFRIENGIMTINGKRIVFKGADRHEFDAKRGRAITREDMLSDVVFCKRHNINA
IRTSHYPNQEYWYDLCDEYGLYLIDETNMETHGTWVANNVERPEDGIPGSRPEWEGACVDRINSMMRRDYNHPSVLIWSL
GNESSAGEVFRAMYRHAHTIDPNRPVHYEGSVHMREFEDVTDIESRMYAHADEIERYLNDGSPAHTDGPKKPYISCEYMH
AMGNSCGNMDEYTALERYPMYQGGFIWDFIDQAIETKLPDGTTRMCYGGDFGDRPSDYEFSGDGLLFADRTPSPKAQEVK
QLYANVKIAVSVDEARITNDNLFVSTGDYRFVLRILADGKPVWSTTRRFDVAAGESASFEVDWPVDDYRSNAEELVLEVS
QQLGNACDWAPAGYELAFGQCVVAGAKTTADAVDAAGAPADGTVTLGRWNAGVRGQGREALFSRTQGGMVSYTFGEREFV
LRRPSITTFRPLTDNDRGAGHAFERAAWAVAGKYARCVDCAIANRGENAVEATYTYELAIPQRTKVTVRYVADTAGLVSL
DVEYPGEKNGDLPTIPAFGIEWALPVEYANLRFYGAGPEETYADRRHAKLGVWSTTAGDDCAPYLLPQETGNHEDVRWAE
ITDDSGHGVRVKRGAGAKPFAMSLLPYSSTMLEEALHQDELPKPRHMFLRLLAAQMGVGGDDSWMSPVHEQYQLPADQPL
SLNVQLKLF
>Q44233 3.2.1.23~~~~~~Beta-galactosidase~~~
MPVPTPLSEGTTPDTAAQELRTNRLWEALPGLSYGGDYIPNSGRNRSARKIYRSCRKPECRPSALASSPGLGLEPVEGSY
DFTWLDEVMDNLAATGIKVALATATAAPPAGWLRKHPEILPVTAEGSTLGPARAHYLVVGMVLFCRPVCGEDDPRLGERY
KDHPALALWHVDNELGCHVSEFYGPRRHRRFPSMAEPTLRHDRGPQRGLGTAFWSQRYSCFEEILTPRPAPTTLNPTQQL
DFQRFSSWGLIDFYSMLARGHFARSHPRCPPRQIWWPQAPPCLWDYFDWAKKLECHRQWSLPGGRRYRCVTSELAFRRRS
DSEAIAGGKPWSPDGALSPCRPCNWLASQHDSRTPGEMARNSLVHVGRGIWMLSCFSSGDRASRVRRNSTRPWCRTPEPT
REYGVKLLSWAQAQSLVRGSRRRGGITHRNRLRLRTLVGKRTGLHPAPMWKYLELLRAFHAPCSCPASPPIWSIPALTLT
AMTWWSSRPCTPSPMPRPAILRQRQNAEPQCSSATSVDIDENDAVRLGGYPGAFRDLLGVNVEEFHPLPENSTVSLDAGW
SGRIWSEHVHLTGAEAKVSFTEAPLTGVPAVTRHAVGTGAAWYLATFPDATGLESLLDSLIAESGVRAPAMAAAGVELSR
RSHADGRSYLFAINHNVTEAAVSAQGTELISGTPFNGTVPAGAVAVIAEG
>O31529 3.2.1.23~~~yesZ~~~Beta-galactosidase YesZ~~~COG1874
MRKLYHGACYYPELWDEETIQQDIDIMREVGVNVVRIGEFAWSVMEPEEGKIDVGFFKEIIARLYDSGIETIMCTPTPTP
PIWFSHGRPERMHANEKREIMGHGSRQHACTNNPYFRKKAAIITTAIAKELGRLPGLIGWQLDNEFKCHVAECMCETCLR
LWHDWLKNRYGVIERLNEAWGTDVWSETYQTFEQVPQPGPAPFLHHASLRTMYQLFSMEMIASFADEQAKIIRCYSDAPI
THNGSVMFSVDNERMFQNLDFASYDTYASQENASAFLLNCDLWRNLKQGRPFWILETSPSYAASLESSAYPHADGYLQAE
AVSSYALGSQGFCYWLWRQQRSGSEISHGSVLSAWGEPTIGYQNVLAVERARKEIEPIILSTEPVQAEAAMTYSDRAKAF
IKTEPHRGLRHRSLVTHFYERILNTGIHRDLIPEGAPLDGYRLLFTPFVPYLSSEFIKKASAFAEAGGIWITGPLTGGRT
CEHTIHTDCGLGELEKTSGIKTLFTFPMNENVNTGKAFGITAPLGLWSAVFDTESGNTLGTVEAGPGAGHAFLTERNYGE
GKIVMLGSLPSGKEGDAMLEALVRHYAEEAVISSRSDVTPGTIVAPRIGENGLVWIVVNMDGKGGSVTLPESGTDLLTHR
LEKAGRLAVGPHEYRVIQFDNHS
>Q5FJ41 3.2.1.23~~~lacZ~~~Beta-galactosidase LacZ~~~COG1874
MTQLSRFLYGGDYNPDQWPEETWSKDIHVFKKADINSATINIFSWALLEPREGKYNFSKLDKVVQQLSDANFDIVMGTAT
AAMPAWMFKKYPDIARVDYQDRRHVFGQRHNFCPNSSNYQRLAGELVKQLVERYKDNKHIVVWHINNEYGGNCYCENCQN
AFRKWLKNKYKTVEGLNKAWNMNVWSHTIYDWDEIVVPNELGDVWGIEGSETIVAGLSIDYLRFQSESMQNLFKMEKKII
KKYDPETPVTTNFHGLPNKMVDYQKWAKGQDIISYDSYPTYDAPAYKAAFLYDLMRSLKHQPFMLMESAPSQVNWQPYSP
LKRPGQMEATEFQAVAHGADTVQFFQLKQAVGGSEKFHSAVIAHSQRTDTRVFKELADLGKKLKNAGPTILGSKTKAKVA
IVFDWSNFWSYEYVDGITQDLNYVDSILDYYRQFYERNIPTDIIGVDDDFSNYDLVVAPVLYMVKHGLDKKINDYVENGG
NFVTTYMSGMVNSSDNVYLGGYPGPLKEVTGIWVEESDAVVPGQKIKVLMNGKDYDTGLICNLIHPNDAKILATYASEFY
AGTPAVTENQYGKGRAWYIGTRLEHQGLTQLFNHIIFETGVESLVCDSHKLEITKRVTEDGKELYFVLNMSNEERTLPSK
FTGYEDILTGEKAHKDMKGWDVQVLRN
>Q8GEA9 3.2.1.23~~~bgaA~~~Beta-galactosidase BgaA~~~
MLGVCYYPEHWPKARWKEDARRMREAGLSYVRVGEFAWALLEPEPGRLEWGWLDEALATLAAEGLKVVLGTPTATPPKWL
VDRYPEVLPVDREGRRRRFGGRRHYCFSSPAYREEARRIVTLLAERYGGLEAVAGFQTDNEYGCHGTVRCYCPRCQEAFR
GWLKARYGTIEALNEAWGTAFWSQRYRNFTEVELPHLTVAEPNPSHLLDYYRFASDQVRAFNRLQVEILRAHAPGKFITH
NFMGFFTDLDAFALAQDLDFASWDSYPLGFTDLMPLPPEEKLRYARTGHPDVAAFHHDLYRGVGRGRFWVMEQQPGPVNW
APHNPSPTPGMVRLWTWEALAHGAEVVSYFRWRQAPFAQEQMHAGLHRPDSAPDQGFFEAKQVAEELAALALPPVAQAPV
ALVFDYEAAWVYEVQPQGAEWSYLGLIYLFYSALRRLGLDVDVVPPGASLRGYALTVVPSLPIVRGEALKAFQEAEGIVL
FGPRSGSKTETFQIPRELPPGPLQALLPLKVVRVESLPPGLLEVAEGPMGRFSLGLWREWVESPLRPWLAFADGGGALYR
EGRYLYLAAWPSPELLGVLLAGLAQEAGLRPVFLPEGLRLRRRGPWVFAFNYGPEAVEAPAPEGARFLLGGKRVGPYDLA
VWEEA
>C7ASJ5 3.2.1.23~~~~~~Beta-galactosidase~~~
MGKRFPSGWFSPRVHPPRRQRSPMTNQATPGTASVWNNIEGIGFGGDYNPEQWPVSVRLEDLELMQEAGVNFLSVGIFSW
ALLEPAEGQYDFGWLDDVMDNLHGIGVKVALATATAAPPAWLVRKHPEILPVTADGTTLGPGSRRHYTPSSAVYRKYAAG
ITRVLAERYKDHPALALWHVDNELGCHVSEFYGEEDAAAFRLWLERRYGTIDALNAAWGTAFWSQHYGSFEEILPPGVAP
STLNPGQQLDFQRFNSWALMDYYRSLVAVLREVTPAVPCTTNLMASSATKSMDYFSWAKDLDVIANDHYLVAADPERHIE
LAFSADLTRGIAGGDPWILMEHSTSAVNWQPRNQPKMPGEMLRNSLAHVARGADAVMFFQWRQSFAGSEKFHSAMVPHGG
RDTRVWREVVDLGAALQLLAPVRGSRVESRAAIVFDYEAWWASEIDSKPSIDVRYLDLLRAFHRSLFLRGVSVDMVHPSA
SLDGYDLVLVCTLYSVTDEAAANIAAAAAGGATVLVSYFSGITDEKDHVRLGGYPGAFRELLGVRVEEFHPLLAGSQLKL
SDGTVSSIWSEHVHLDGAEAFQTFTGYPLEGVPSLTRRAVGTGAAWYLATFPDRDGIESLVDRLLAESGVSPVAEADAGV
ELTRRRSADGGSFLFAINHTRAAASVRASGTDVLSGERFTGTVEAGSVAVIAED
>Q65CX4 3.2.1.23~~~lacA~~~Beta-galactosidase GalA~~~COG1874
MLHGGDYNPDQWLDRPDILADDIKLMKLAHTNTFSVGIFSWSALEPEEGVYTFEWLDDIFESIHRNGGRIILATPSGARP
AWLSQKYPEVLRVNAERVKQLHGGRHNHCFTSYVYREKTKEINRMLAERYGSQHALLMWHVSNEYGGECHCDQCQHAFRD
WLKKKYNHDIKSLNDAWWTPFWSHTFNDWSQIESPSPIGENAVHGLNLDWRRFVTDQTISFFQNEIVPLKEITPNIPITT
NFMADTHDLIPFQGLDYSKFAKHLDVISWDAYPAWHNDWESTADLAMKVGFINDLYRSLKQQPFLLMESTPSAVNWHDFN
KAKRPGMHLLSSVQMIAHGSDSILYFQWRKSRGSSEKFHGAVVGHDNCSENRVFKEVAKVGQTLEALSEVTGTIRPADVA
ILYDWENHWALQDAQGFGMKTKRYPQTLHEHYRAFWERDIPVDVITKEQDFSSYRLLIVPMLYLASEETIARLKAFAANG
GTLVMTYISGIVNESDLTYLGGWPKDLQEMFGMEPVETDTLYPGDKNAVRYQNRSYELKDYATVLKLSTADPEGFYEDDF
YADTTAVTSHPYKQGKTYYIGARLSSQFHRDFYGTLIKELAIQPALDVKHQPGVSVQVRQDEENDYIFIMNFTEKRQPVV
LASAVKDMLTGETLAGEVTLEKYEARIAVKAKE
>O07012 3.2.1.23~~~ganA~~~Beta-galactosidase GanA~~~COG1874
MLHGGDYNPDQWLDRPDILADDIKLMKLSHTNTFSVGIFAWSALEPEEGVYQFEWLDDIFERIHSIGGRVILATPSGARP
AWLSQTYPEVLRVNASRVKQLHGGRHNHCLTSKVYREKTRHINRLLAERYGHHPALLMWHISNEYGGDCHCDLCQHAFRE
WLKSKYDNSLKTLNHAWWTPFWSHTFNDWSQIESPSPIGENGLHGLNLDWRRFVTDQTISFYENEIIPLKELTPDIPITT
NFMADTPDLIPYQGLDYSKFAKHVDAISWDAYPVWHNDWESTADLAMKVGFINDLYRSLKQQPFLLMECTPSAVNWHNVN
KAKRPGMNLLSSMQMIAHGSDSVLYFQYRKSRGSSEKLHGAVVDHDNSPKNRVFQEVAKVGETLERLSEVVGTKRPAQTA
ILYDWENHWALEDAQGFAKATKRYPQTLQQHYRTFWEHDIPVDVITKEQDFSPYKLLIVPMLYLISEDTVSRLKAFTADG
GTLVMTYISGVVNEHDLTYTGGWHPDLQAIFGVEPLETDTLYPKDRNAVSYRSQIYEMKDYATVIDVKTASVEAVYQEDF
YARTPAVTSHEYQQGKAYFIGARLEDQFQRDFYEGLITDLSLSPVFPVRHGKGVSVQARQDQDNDYIFVMNFTEEKQLVT
FDQSVKDIMTGDILSGDLTMEKYEVRIVVNTH
>C6H178 3.2.1.23~~~lacA~~~Beta-galactosidase LacA~~~
MTQLSRFLYGGDYNPDQWPEETWSKDIHVFKKADINSATINIFSWALLEPREGKYNFSKLDKVVQQLSDANFDIVMGTAT
AAMPAWMFKKYPDIARVDYQDRRHVFGQRHNFCPNSSNYQRLAGELEKQLVERYKDNKHIVFWHINNEYGGNCYCENCQN
AFKKWLKNKYKTVEGLNKAWNMNVWSHTIYDWDEIVVPNELGDVWGIKGSETIVAGLSIDYLRFQSESMQNLFKMEKKII
KKFDPETPVTTNFHGLPNKMVDYQKWAKGQDIISYDSYPTYDAPAYKAAFLYDLMRSLKHQPFMLMESAPSQVNWQPYSP
LKRPGQMEATEFQAVAHGADTVQFFRLKQAVGGSEKFHSAVIAHSQRTVTRVFKELADLGKKLKNAGPTILGSKTKARFA
IVFDWSNFWSYEYVDGITQDLNYVDSILDYYRQFYERNIPTDIIGVDDDFSNYDLVVAPVLYMVKHGLDKKINDYVENGG
NFVTTYMSGMVNSSDNVYLGGYPGPLKEVTGIWVEESDAVVPGQKIKVLMNGKDYDTGLICNLIHPNDAKILATYASEFY
AGTPAVTENQYGKGRAWYIGTRLEHQGLTQLFNHIIFETGVESLVCDSHKLEITKRVTEDGKELYFVLNMSNEERTLPSK
FTGYEDILTGEKAHKDMKGWDVQVLRN
>O54315 3.2.1.23~~~bgaA~~~Beta-galactosidase BgaA~~~
MLGVCYYPEHWPEERWEEDFKAMRALGLRYVRLGEFAWSALEPTPGALRWGWLDRVLDLAQKEGLAVVLGTPTATPPKWL
VDRYPEILPVDREGRRRNFGGRRHYCFSSPAYREETARIVALLAERYGRHPAVVGFQVDNEFGCHGTVRCYCPNCREAFR
GWLRAKYGTIDALNAAWGTVFWSQTYRDFGEVELPHLTVAEANPSHLLDYYRFASDQVRAYNRFQVDLLRDNAPGRFITH
NFMGFFTDLDPFALAEDLDFAAWDSYPLGFTDLMPLPQEEKVQWARTGHPDVAAFHHDLYRGVGRGRFWVMEQQPGPVNW
APHNPSPAPGMVRLWTWEAIAHGAEVVSYFRWRQAPFAQEQMQAGFNRPDFQPEVAFFEVQRVAEELSALPLPPAGCAPV
ALVYDSEAAWVFEIQPQGAEWKYLTLVFSFYSVFRRLGLEVDIFKPGAELGGYGLVVVPSLPIVRKEALEALSQADGLVI
VGPRSGSKTEKFQIPPEIPPGALQALLPLKVVRVESLPPGLLEEAEGPWGRFAFGVWREWVETDLPPLLRFTDGGGILFR
RGRYLYLAAWPSPELLFALCQSLAEEAGLHPRFLPEGLRLRRRGPLVFAFNYGPEVVEAPAPPGVRFLLGDRRIPPHDLA
VWEET
>C8WV58 3.2.1.23~~~bglY~~~Beta-galactosidase BglY~~~COG1874
MAKHAPIFPNVQGFLHGGDYNPDQWLAYPDVLEQDVQLMREAKWNVVSLGIFSWVSLEPEEGLFTFEWLDEAIERLTHAG
VRILLATPSGARPAWLSAKYPEVLRVGPDGRRNRHGGRHNHCYTSPIYREKVRIINRKLAERYAHHPGVIGWHVSNEYGG
ECHCPLCQEAFREWLKRKYKTLDALNHAWWTPFWSHTYTDWSQIESPMPHGETSIHGLNLDWKRFVTDQTVDFCRHEIEP
LKQVNPNLPVTTNFMGTYPGLNYWRFRDVLDVISWDSYPRWHAHETLVPEAVHTAMVHDLNRSILKKPFLLMESTPSVTN
WQAVSKQKRPGVHVLVSLQAVAHGADSVQYFQWRKSRGSYEKFHGAVVDHVGHANTRVFRDVQAVGEMLERLAPMAGAEV
KADAAVIFDWENRWALEDAKGPRNIGMHYEETVVNHYAALWRMGVPMDVIDEEQPLDGYKLVVAPMLYMVRPGVAERMKA
FVERGGSLVLTYWSGIVDENDLVFLGGFPGPLRELAGVWAEEIDALYDGERVPVRVADGNPLGLAGHYEARELCEVVHLE
GAEPIAVYGADYYEGMPAATVHRVGKGKVYYVAARLEDAFLRDFFARVAAEAGVARAIERELPDGVSAMVRSGDGVEYVM
LMNFTPEAREVALDEAEYKPLYGEAPTDGAVRLPAYGVSVLERPARNG
>Q59140 3.2.1.23~~~lacZ~~~Beta-galactosidase~~~
MSSSYITDQGPGSGLRVPARSWLNSDAPSLSLNGDWRFRLLPTAPGTPGAGSVLATGETVEAVASESFDDSSWDTLAVPS
HWVLAEDGKYGRPIYTNVQYPFPIDPPFVPDANPTGDYRRTFDVPDSWFESTTAALTLRFDGVESRYKVWVNGVEIGVGS
GSRLAQEFDVSEALRPGKNLLVVRVHQWSAASYLEDQDQWWLPGIFRDVKLQARPVGGLTDVWLRTDWSGSGTITPEITA
DPAAFPVTLRVPELGLEVIWDSPADVAPVSIDAVEPWSAEVPRLYDASVSSAAESISLRLGFRTVKIVGDQFLVNGRKVI
FHGVNRHETNADRGRVFDEASAREDLALMKRFNVNAIRTSHYPPHPRFLDLADELGFWVILECDLETHGFHALKWVGNPS
DDPAWRDALVDRMERTVERDKNHASIVMWSLGNESGTGANLAAMAAWTHARDLSRPVHYEGDYTGAYTDVYSRMYSSIPE
TDSIGRNDSHALLLGCNAIESARQRTRPFILCEYVHAMGNGPGAIDQYEDLVDKYPRLHGGFVWEWRDHGIRTRTADGTE
FFAYGGDFDEVIHDGNFVMDGMILSDSTPTPGLFEYKQIVSPIRLALTLNAEGNAGLTVANLRHTSDASDVVLRWRVEHN
GTRVDAGELTTDGANGPLQAGDSLTLTLPTIVAAAEGETWLSVEAVLREATAWAPAGHPLSETQLDLSPAQPPLRVPRPA
SPIAGAAPVELGPATFDAGSLVTLAGLPVAGPRLELWRAPTDNDKGQGFGAYGPEDPWINSGRGVPAPSSAVVWQQAGLD
RLTRRVEDVAALPQGLRVRSRYAAANSEHDVAVEENWQLSGDELWLRIDIAPSAGWDLVFPRIGVRLDLPSEVDGASWFG
AGPRESYPDSLHSAVVGTHGGSLEELNVNYARPQETGHHSDVRWVELSRDGAPWLRIEADPDALGRRPGFSLAKNTAQEV
ALAPHPHELPESQHSYLYLDAAQHGLGSRACGPDVWPDFALRPEARTLVLRIRAA
>A1A399 3.2.1.23~~~bgaB~~~Beta-galactosidase BgaB~~~
MSARRNFEWPELLTADGRGIAFGGDYNPDQWSEDIWDDDIRLMKQAGVNTVALAIFSWDRIQPTEDRWDFGWLDRIIDKL
GNAGIVVDLASATATAPLWLYESHPEVLPRDKYGHPVNAGSRQSWSPTSPVFKEYALTLCRKLAERYGTNPYVTAWHMGN
EYGWNNREDYSDNALEAFRAWCRRKYGTIDALNQAWGTTFWGQEMNGFDEVLIPRFMGADSMVNPGQKLDFERFGNDMLL
DFYKAERDAIAEICPDKPFTTNFMVSTDQCCMDYAAWAKEVNFVSNDHYFHEGESHLDELACSDALMDSLALGKPWYVME
HSTSAVQWKPLNTRKRKGETVRDSLAHVAMGADAINFFQWRASAFGAEAFHSAMVPHAGEDTKLFRQVCELGASLHTLAD
AGVQGTELAHSDTAILFSAESEWATRSQTLPSMKLNHWHDVRDWYRAFLDAGSRADIVPLAYDWSSYKTVVLPTVLILSA
ADTQRLADFAAAGGRVVVGYATGLIDEHFHTWLGGYPGAGDGLLRSMLGVRGEEFNILGAEAEGEPGEIRLSSADDSAAL
DGTTTRLWQNDVNVTGEHAQVLATYAGEEADEWELDGTAAVTRNPYGSGEAYFVGCDLDVADLTKLVRAYLAASSQENAD
VLHTVRASADATFDFYLPRGKKTVELQGIEGEPVILFQTDREEKPGSYTVRRNGVLVVRR
>Q93GI5 3.2.1.23~~~beta-galIII~~~Beta-galactosidase III~~~
MEHRAFKWPQPLAGNKPRIWYVGDYNPDQWPEEVWDEDVALMQQAGVNLVSVAIFSWAKLEPEEGVYDFDWLDRVIDKLG
KAGIAVDLASGTASPPMWMTQAHPEILWVDYRGDVCQPGARQHWRATSPVFLDYALNLCRKMAEHYKDNPYVVSWHVSNE
YGCHNRFDYSEDAERAFQKWCEKKYGTIDAVNDAWGTAFWAQRMNNFSEIIPPRFIGDGNFMNPGKLLDWKRFSSDALLD
FYKAERDALLEIAPKPQTTNFMVSAGCTVLDYDKWGHDVDFVSNDHYFSPGEAHFDEMAYAACLTDGIARKNPWFLMEHS
TSAVNWRPTNYRLEPGELVRDSLAHLAMGADAICYFQWRQSKAGAEKWHSAMVPHAGPDSQIFRDVCELGADLNKLADEG
LLSTKLVKSKVAVVFDYESQWATEHTATPTQEVRHWTEPLDWFRALADNGLTADVVPVRGPWDEYEAVVLPSLAILSEQT
TRRVREYVANGGKLFVTYYTGLVDDRDHVWLGGYPGSIRDVVGVRVEEFAPMGTDAPGTMDHLDLDNGTVAHDFADVITS
VADTAHVVASFKADKWTGFDGAPAITVNDFGDGKAAYVGARLGREGLAKSLPALLEELGIETSAEDDRGEVVRVERADET
GENHFVFLFNRTHDVAVVDVEGEPLVASLAQVNESEHTAAIQPNGVLVVKL
>Q9RFN0 3.2.1.23~~~bgaB~~~Beta-galactosidase BgaB~~~
MLQQKKLFYGGDYNPEQWSKAIILEDMRLMKKANVNYVSLNIFGWASIQPTEEGFDFSFLDEMLDLLWENGIGIDLANGT
ASPPAWLVKKHPEILPVTSQGTPLVHGSRQHYCPSNKVYRSYVIRLTEEVAKRYATHPGIVMWHVNNEYTCHISECYCES
CEKSFRQWLQMKYKKINTLNECWSTKFWSQSYSQWDEIFLPKEMPTFKNPAHQLDYKRFISDQNLTLFKAEKKAIRSYSK
DIPVMTNLMGLHKHVDGFAFAEEMDVVGWDSYPNPFEEKPYPQFLANDLTRSLKKKPFLVMEQAPSAVNWRRANGAKSPG
QMRLWSYEALAHGADGILFFQWRQSQGGAEKFHSGMVSHNQDTNSRIFKEVVQLGTEMSQLDELVGTNYNAEVAIVFDWE
NWWALELDAKPSGEINYIKQMRDLYTIFHELNIGVDFIHPKEDLSNYKLVLSIAQYLVTDDFSAKVKRYIKAGGHFLTTF
FSGIVDEYDRVYLGGYPGAFKEVLGIYVEEFDPMPIGRKSQIKYGETYYTTELWKEVIHLQGAETIATFTEGYLMGQPAL
TKFGYGKGKTYYMGTKLAKDGNMKFIQTILAESKIQPLNQVEIESENSKISMTCRSNSSHDYIFLLNYGQTSEKVKLKKG
GQSLLDGSMVEGEVSVKANDVKIIKLTK
>P24131 3.2.1.23~~~cbgA~~~Beta-galactosidase~~~
MINNKPSLDWLENPEIFRVNRIDAHSDTWFYEKFEDVKLEDTMPLKQNLNGKWRFSYSENSSLRIKEFYKDEFDVSWIDY
IEVPGHIQLQGYDKCQYINTMYPWEGHDELRPPHISKTYNPVGSYVTFFEVKDELKNKQTFISFQGVETAFYVWVNGEFV
GYSEDTFTPSEFDITDYLREGENKLAVEVYKRSSASWIEDQDFWRFSGIFRDVYLYAVPETHVNDIFIKTDLYDDFKNAK
LNAELKMIGNSETTVETYLEDKEGNKIAISEKIPFSDELTLYLDAQNINLWSAEEPNLYTLYILVNKKDGNLIEVVTQKI
GFRHFEMKDKIMCLKWKRIIFKGVNRHEFSARRGRSITKEDMLWDIKFLKQHNINAVRTSHYPNQSLWYRLCDEYGIYLI
DETNLESHGSWQKMGQIEPSWNVPGSLPQWQAAVLDRASSMVERDKNHPSVLIWSCGNESYAGEDIYQMSKYFRKKDPSR
LVHYEGVTRCREFMTRRHESRMYAKAAEIEEYLNDNPKKPYISCEYMHSMGNSTGGMMKYTELEDKYLMYQGGFIWDYGD
QALYRKLPDGKEVLAYGGDFTDRPTDYNFSGNGLIYADRTISPKAQEVKYLYQNVKLEPDEKGVTIKNQNLFVNTDKYDL
YYIVERDGKLIKDGYLNVSVAPDEEKYIELPIGNYNFPEEIVLTTSLRLAQATLWAEKGYEIAFGQKVIKEKSDMNNHNS
ESKMKIIHGDVNIGVHGKDFKAIFSKQEGGIVSLRYNNKEFITRTPKTFYWRATTDNDRGNRHEFRCSQWLAATMGQKYV
DFSVEEFDEKITLYYTYQLPTVPSTNVKITYEVSGEGIIKVNVKYKGVSGLPELPVLGMDFKLLAEFNSFSWYGMGPEEN
YIDRCEGAKLGIYESTQ
>D9SM34 3.2.1.23~~~bgaA~~~Beta-galactosidase BgaA~~~COG1874
MRIGVDYYPEHWDRQLWEKDAQLMKEIGVKVVRLAEFAWCKLEPIEGQYDFKWLDDVIEIFSVRNIEIVLGTPTNTPPLW
LYEKYPDAIQVNESGERQFIGIRGHRCYNSSSMRKYTKAIVEAMTERYANNKAVIGWQIDNELDATHCCCDNCTEKFRGW
LKNKYSTLENINKEYGNVVWSGEYSAWSQVTAPLGGSPFLNPSYLLDYNRFASDSMVEYIDFQREIIRKNCPSQFITTNT
WFTGNLPNFYDAFENLDFVSYDNYPTTNEITDEEELHSHAFHCDLMRGIKKKNFWIMEQLSGTPGCWMPMQRTPKPGMIK
GYSFQAIGRGAETVVHFRWRNAIIGAEMFWHGILDHSNVKGRRFYEFAELCREVNKINEEIPDYKINNEVAILYSSDQDF
AFKIQPQVEGLYYLQQLKAFHNALIRLGVGTDIINWSESLNKYKVVIAPTLYLTDDNVTTELYRFVEAGGTLILTNRTGV
KNMNNVCLMEQMPSNLKECAGVVVKEYDPIGHSIHTIKDEAGKVYQCKQWCDILEPTTAKVIATYNDDFYIDEAAVTVNK
YKKGNVYYLGTVFNSDYYIELLSKILDEKELPYYKKLPYGLELSVLENENGKYLMVFNNSNEIKCFEGKHEGKSIIRNEL
DGKSFTLEPYGIEVLQLVE
>P00722 3.2.1.23~~~lacZ~~~Beta-galactosidase~~~COG3250
MTMITDSLAVVLQRRDWENPGVTQLNRLAAHPPFASWRNSEEARTDRPSQQLRSLNGEWRFAWFPAPEAVPESWLECDLP
EADTVVVPSNWQMHGYDAPIYTNVTYPITVNPPFVPTENPTGCYSLTFNVDESWLQEGQTRIIFDGVNSAFHLWCNGRWV
GYGQDSRLPSEFDLSAFLRAGENRLAVMVLRWSDGSYLEDQDMWRMSGIFRDVSLLHKPTTQISDFHVATRFNDDFSRAV
LEAEVQMCGELRDYLRVTVSLWQGETQVASGTAPFGGEIIDERGGYADRVTLRLNVENPKLWSAEIPNLYRAVVELHTAD
GTLIEAEACDVGFREVRIENGLLLLNGKPLLIRGVNRHEHHPLHGQVMDEQTMVQDILLMKQNNFNAVRCSHYPNHPLWY
TLCDRYGLYVVDEANIETHGMVPMNRLTDDPRWLPAMSERVTRMVQRDRNHPSVIIWSLGNESGHGANHDALYRWIKSVD
PSRPVQYEGGGADTTATDIICPMYARVDEDQPFPAVPKWSIKKWLSLPGETRPLILCEYAHAMGNSLGGFAKYWQAFRQY
PRLQGGFVWDWVDQSLIKYDENGNPWSAYGGDFGDTPNDRQFCMNGLVFADRTPHPALTEAKHQQQFFQFRLSGQTIEVT
SEYLFRHSDNELLHWMVALDGKPLASGEVPLDVAPQGKQLIELPELPQPESAGQLWLTVRVVQPNATAWSEAGHISAWQQ
WRLAENLSVTLPAASHAIPHLTTSEMDFCIELGNKRWQFNRQSGFLSQMWIGDKKQLLTPLRDQFTRAPLDNDIGVSEAT
RIDPNAWVERWKAAGHYQAEAALLQCTADTLADAVLITTAHAWQHQGKTLFISRKTYRIDGSGQMAITVDVEVASDTPHP
ARIGLNCQLAQVAERVNWLGLGPQENYPDRLTAACFDRWDLPLSDMYTPYVFPSENGLRCGTRELNYGPHQWRGDFQFNI
SRYSQQQLMETSHRHLLHAEEGTWLNIDGFHMGIGGDDSWSPSVSAEFQLSAGRYHYQLVWCQK
>A3FEW8 3.2.1.23~~~lacZ~~~Beta-galactosidase~~~
MFTASPMSLSKILARRDWENPGVTQWHRLPAHAPFNSWRDEASARADDNASRKRSLNGDWQFSYYAAPEQVPDSWVTEDC
ADAVTTPVPSNWQMQGFDTPIYTNDTYPIPVNPPFVPAENPTGCYSLTFEVDEQWLESGQTRIVFDGVNSAFYLWCNGKW
MGYSQDSRLPAEFDLSAVLRPGTNRLAVLVLRWCDGSYLEDQDMWRMSGIFRDVSLLHKPHTHIADYHAVTELNADYDRA
KLQVEVALAGEQFADCEVAVTLWRDGLSVATVSAKPGSAIIDERGNWAERLNVTLPVKDPALWSAETPELYRLTFALRDG
QGEILDVEACDVGFRCVEISNGLLKVNGKPLLIRGVNRHEHHPENGQVMDEATMCRDIELMKQHNFNAVRCSHYPNHPLW
YTLCDRYGLYVVDEANIETHGMVPMSRLADDPRWLPAMSERVTRMVLRDRNHPSIIIWSLGNESGHGANHDALYRWVKTT
DPTRPVQYEGGGANTAATDIVCPMYARVDQDQPFEAVPKWSLKKWIGMPDETRPLILCEYAHAMGNSFGGFAKYWQAFRN
HPRLQGGFVWDWVDQALTKKDDNGNAFWAYGGDFGDTPNDRQFCLNGLVFPDRTPHPALFEAQRAQQFFTFTLVSTSPLV
IDVHSDYLFRQCDNEQLRWNIARDGEVLASGEVALTIAPQQTQRIEIDAPEFAAAAGEIWLNVDIVQTAATAWSPADHRC
AWDQWQLPAPLYIAPPVEGTAKPDLKVKEDVLEVSHQSQRWHFDRASGNLTQWWNNGTATLLAPLSDNFTRAPLDNDIGV
SEATRIDPNAWVERWKAAGMYNLTPRLLLCEGEQLAQAVTITTLHAWESNGKALFLSRKVWKIDRAGVLHGDVQVQVAND
IPQPARIGLSCQLAQTPQTASWLGLGPDENYPDRKLAARQGRWTLPLDALHTAYIFPTDNGLRCDTRELTFDTHQMQGDF
HFSLSRYSQQQLRDTSHHHLLEAEPGCWLNIDAFHMGVGGDDSWSPSVSPEFILQRREMRYAFSWRQD
>P19668 3.2.1.23~~~bgaB~~~Beta-galactosidase bgaB~~~
MNVLSSICYGGDYNPEQWPEEIWYEDAKLMQKAGVNLVSLGIFSWSKIEPSDGVFDFEWLDKVIDILYDHGVYINLGTAT
ATTPAWFVKKYPDSLPIDESGVILSFGSRQHYCPNHPQLITHIKRLVRAIAERYKNHPALKMWHVNNEYACHVSKCFCEN
CAVAFRKWLKERYKTIDELNERWGTNFWGQRYNHWDEINPPRKAPTFINPSQELDYYRFMNDSILKLFLTEKEILREVTP
DIPVSTNFMGSFKPLNYFQWAQHVDIVTWDSYPDPREGLPIQHAMMNDLMRSLRKGQPFILMEQVTSHVNWRDINVPKPP
GVMRLWSYATIARGADGIMFFQWRQSRAGAEKFHGAMVPHFLNENNRIYREVTQLGQELKKLDCLVGSRIKAEVAIIFDW
ENWWAVELSSKPHNKLRYIPIVEAYYRELYKRNIAVDFVRPSDDLTKYKVVIAPMLYMVKEGEDENLRQFVANGGTLIVS
FFSGIVDENDRVHLGGYPGPLRDILGIFVEEFVPYPETKVNKIYSNDGEYDCTTWADIIRLEGAEPLATFKGDWYAGLPA
VTRNCYGKGEGIYVGTYPDSNYLGRLLEQVFAKHHINPILEVAENVEVQQRETDEWKYLIIINHNDYEVTLSLPEDKIYQ
NMIDGKCFRGGELRIQGVDVAVLREHDEAGKV
>P0C1Y0 3.2.1.23~~~lacZ~~~Beta-galactosidase~~~
MSNKLVKEKRVDQADLAWLTDPEVYEVNTIPPHSDHESFQSQEELEEGKSSLVQSLDGDWLIDYAENGQGPVNFYAEDFD
DSNFKSVKVPGNLELQGFGQPQYVNVQYPWDGSEEIFPPQIPSKNPLASYVRYFDLDEAFWDKEVSLKFDGAATAIYVWL
NGHFVGYGEDSFTPSEFMVTKFLKKENNRLAVALYKYSSASWLEDQDFWRMSGLFRSVTLQAKPRLHLEDLKLTASLTDN
YQKGKLEVEANIAYRLPNASFKLEVRDSEGDLVAEKLGPIRSEQLEFTLADLPVAAWSAEKPNLYQVRLYLYQAGSLLEV
SRQEVGFRNFELKDGIMYLNGQRIVFKGANRHEFDSKLGRAITEEDMIWDIKTMKRSNINAVRCSHYPNQSLFYRLCDKY
GLYVIDEANLESHGTWEKVGGHEDPSFNVPGDDQHWLGASLSRVKNMMARDKNHASILIWSLGNESYAGTVFAQMADYVR
KADPTRVQHYEGVTHNRKFDDATQIESRMYAPAKVIEEYLTNKPAKPFISVEYAHAMGNSVGDLAAYTALEKYPHYQGGF
IWDWIDQGLEKDGHLLYGGDFDDRPTDYEFCGNGLVFADRTESPKLANVKALYANLKLEVKDGQLFLKNDNLFTNSSSYY
FLTSLLVDGKLTYQSRPLTFGLEPGESGTFALPWPEVADEKGEVVYRVTAHLKEDLPWADEGFTVAEAEEVAQKLPEFKP
EGRPDLVDSDYNLGLKGNNFQILFSKVKGWPVSLKYAGREYLKRLPEFTFWRALTDNDRGAGYGYDLARWENAGKYARLK
DISCEVKEDSVLVKTAFTLPVALKGDLTVTYEVDGRGKIAVTADFPGAEEAGLLPAFGLNLALPKELTDYRYYGLGPNES
YPDRLEGNYLGIYQGAVKKNFSPYRPQETGNRSKVRWYQLFDEKGGLEFTANGADLNLSALPYSAAQIEAADHAFELTNN
YTWVRALSAQMGVGGDDSWGQKVHPEFCLDAQKARQLRLVIQPLLLK
>Q7WTB4 3.2.1.23~~~lacL~~~Beta-galactosidase large subunit~~~COG3250
MQANINWLDNPEVFRVNQLPAHSDHPFFRDYREWQKQHSSYQQSLNGKWKFHFSANPMDRPQDFYQRDFDSSNFDSIPVP
SEIELSNYTQNQYINVLFPWEGKIFRRPAYALDPNDHEEGSFSKGADNTVGSYLKRFDLSSALIGKDVHIKFEGVEQAMY
VWLNGHFVGYAEDSFTPSEFDLTPYIQDKDNLLAVEVFKHSTASWLEDQDMFRFSGIFRSVELLGIPATHLMDMDLKPRV
ADNYQDGIFNLKLHFIGKKAGSFHLLVKDIKGHTLLEKNEDIKENVQINNEKFENVHLWNNHDPYLYQLLIEVYDEQQNL
LELIPFQFGFRRIEISPEKVVLLNGKRLIINGVNRHEWDAKRGRSITMSDMTTDINTFKENNINAVRTCHYPNQIPWYYL
CDQNGIYVMAENNLESHGTWQKMGEIEPSDNVPGSIPQWKEAVIDRARINYETFKNHTSILFWSLGNESYAGDNIIAMNE
FYKSHDDTRLVHYEGVVHRPELKDKISDVESCMYLPPKKVEEYLQNDPPKPFMECEYMHDMGNSDGGMGSYIKLLDKYPQ
YFGGFIWDFIDQALLVHDEISGHDVLRYGGDFDDRHSDYEFSGDGLMFADRTPKPAMQEVRYYYGLHK
>Q48846 3.2.1.23~~~lacL~~~Beta-galactosidase large subunit~~~
MQPNIQWLDTPAVFRVGQLPAHSDHRYYATLAEMAQQQSSFEQSLNGTWQFHYSVNAASRPKSFYELAFDAQDFEPITVP
QHIELAGYEQLHYINTMYPWEGHYYRRPAFSTSDDKQHLGMFSEADYNPVGSYLHHFDLTPALRNQRVIIRFEGVEQAMY
VWLNGQFIGYAEDSFTPSEFDLTPYLKETDNCLAVEVHKRSSAAFIEDQDFFRFFGIFRDVKLLAKPRTHLEDLWVIPEY
DVVQQTGQVKLRLQFSGDENRVHLRIRDQHQIILTADLTSGAQVNDLYKMPELVQAWSNQTPNLYTLELEVVDQAGETIE
ISQQPFGFRKIEIKDKVMLLNGKRLVINGVNRHEWHPETGRTITAEDEAWDIACMQRNHINAVRTSHYPDRLSFYNGCDQ
AGIYMMAETNLESHGSWQKMGAVEPSWNVPGSYDEWEAATLDRARTNFETFKNHVSILFWSLGNESYAGSVLEKMNAYYK
QQDPTRLVHYEGVFRAPEYKATISDVESRMYATPAEIKAYLDNAPQKPFILCEYMHDMGNSLGGMQSYIDLLSQYDMYQG
GFIWDFIDQALLVTDPVTGQRELRYGGDFDDRPSDYEFSGDGLVFATRDEKPAMQEVRYYYGEHK
>Q02603 3.2.1.23~~~lacL~~~Beta-galactosidase large subunit~~~
MQANLQWLDDPEVFRVNQLPAHSDHHYYHDTAEFKTGSRFIKSLNGAWRFNFAKTPAERPVDFYQPDFDATDFDTIQVPG
HIELAGYGQIQYINTLYPWEGKIYRRPPYTLNQDQLTPGLFSDAADNTVGSYLKTFDLDDVFKGQRIIIQFQGVEEALYV
WLNGHFIGYSEDSFTPSEFDLTPYIQDQGNVLAVRVYKHSTAAFIEDQDMFRFSGIFRDVNILAEPASHITDLDIRPVPN
ANLKSGELNITTKVTGEPATLALTVKDHDGRVLTSQTQTGSGSVTFDTMLFDQLHLWSPQTPYLYQLTIEVYDADHQLLE
VVPYQFGFRTVELRDDKVIYVNNKRLVINGVNRHEWNAHTGRVISMADMRADIQTMLANNINADRTCHYPDQLPWYQLCD
EAGIYLMAETNLESHGSWQKMGAIEPSYNVPGDNPHWPAAVIDRARSNYEWFKNHPSIIFWSLGNESYAGEDIAAMQAFY
KEHDDSRLVHYEGVFYTPELKDRISDVESRMYEKPQNIVAYLEDNPTKPFLNCEYMHDMGNSLGGMQSYNDLIDKYPMYQ
GGFIWDFIDQALFVHDPITDQDVLRYGGDFDERHSDYAFSGNGLMFADRTPKPAMQEVKYYYGLHK
>Q09HN2 3.2.1.23~~~bgaP~~~Beta-galactosidase BgaP~~~
MINDKLPKIWHGGDYNPEQWDSQEIWDEDVRMFKLAGIDVATLNVFSWALNQPNEDTYNFEWLDDKINRLYENGIYTCLA
TSTAAHPAWMAKKYPDVLRVDFYGRKRKFGSRHNSCPNSPTYREYSEKIADKLAERYKDHPAVLIWHVSNEYGGYCYCDN
CQDAFRVWLSDKYGTLEKLNKAWNTGFWGHTFYEWDEIVAPNMLSEEREDNVSDFQGISLDYRRFQSDSLLDCYKLEYNA
IRKHTPNIPITTNLMGTYPMLDYFKWAKEMDVVSWDNYPAIDTPFSYTAMTHDLMRGLKSGQPFMLMEQTPSQQNWQPYN
SLKRPGVMRLWSYQAIGRGADTILYFQLRRSVGACEKYHGAVIEHVGHEHTRVFNEVAQIGKEFNQLGDTLLDARVNARV
AIVFDWENRWATELSSGPSVSLDYVNEVHKYYDALYKLNVQVDMVGVEEDLSQYDVVIAPVLYMVKEGYAAKVESFVENG
GTFITTFFSGIVNETDIVTLGGYPGELRKVLGIWAEEIDALHPDETNEIVVNGSRGSLSGSYSCNLLFDLIHTEGAQAVA
EYGSDFYQGMPVLTVNEFGKGKAWYVASSPDAEFLVDFLQTVCEEAGVEPLLSVPEGVETTERVKDGQTYLFVLNHNNKV
ESIDLKDSQYQELLSTQQLSGTVELEAKGVFILAKV
>Q9KI47 3.2.1.23~~~bgaA~~~Beta-galactosidase BgaA~~~
MINDKLPKIWHGGDYNPEQWDSKEIWDEDVRMFKLAGIDVATLNVFSWALNQPNEDTYNFDWLDEKINRLYENGIYTCLA
TSTAAHPAWMAKKYPDVLRVDFYGRKRKFGSRHNSCPNSPTYRKYSERIAETLAERYKDHPAVLIWHVSNEYGGYCYCDN
CQDAFRNWLSDKYGTLEKLNKAWNTGFWGHTFYEWDEIVAPNMLSEKREDNVSDFQGISLDYRRFQSDRLLDCYKLEYNA
IRKHVPTSIPITTNLMGTYPMLDYFKWAKEMDVVSWDNYPSIDTPFSYTAMTHDLMRGLKGGKPFMLMEQTPSQQNWQPY
NSLKRPGVMRLWSYQAIGRGADTILYFQLRRSVGACEKYHGAVIEHVGHEHTRVFNEVAQLGQELNGLSDTLLDARVNAK
VAIVFDWENRWATELSSGPSVSLDYVNEVHKYYDALYKLNVQVDMIGVEEDLSKYDVVIAPVLYMVKEGYAAKVEKFVEN
GGTFLTTFFSGIVNETDIVTLGGYPGELRKVLGIWAEEIDALHPDETNQIVVKGSRGILSGKYSCNLLFDLIHTEGAEAV
AEYGSDFYKGMPVLTVNKFGKGKAWYVASSPDAEFLVDFLQTVCEEAGVEPLLDVPAGVETTERVKDGQTYLFVLNHNND
EVTIELHGSQYREVLTDEQVSGNLVLKEKGVLILAKV
>P81650 3.2.1.23~~~lacZ~~~Beta-galactosidase~~~COG3250
MTSLQHIINRRDWENPITVQVNQVKAHSPLNGFKTIEDARENTQSQKKSLNGQWDFKLFDKPEAVDESLLYEKISKELSG
DWQSITVPSNWQLHGFDKPIYCNVKYPFAVNPPFVPSDNPTGCYRTEFTITPEQLTQRNHIIFEGVNSAFHLWCNGQWVG
YSQDSRLPSEFDLSELLVVGTNRIAVMVIRWSDGSYLEDQDMWWLSGIFRDVNLLTKPQSQIRDVFITPDLDACYRDATL
HIKTAINAPNNYQVAVQIFDGKTSLCEPKIQSTNNKRVDEKGGWSDVVFQTIAIRSPKKWTAETPYLYRCVVSLLDEQGN
TVDVEAYNIGFRKVEMLNGQLCVNGKPLLIRGVNRHEHHPENGHAVSTADMIEDIKLMKQNNFNAVRTAHYPNHPLFYEL
CDELGLYVVDEANIETHGMFPMGRLASDPLWAGAFMSRYTQMVERDKNHASIIIWSLGNECGHGANHDAMYGWSKSFDPS
RPVQYEGGGANTTATDIICPMYSRVDTDIKDDAVPKYSIKKWLSLPGETRPLILCEYAHAMGNSLGSFDDYWQAFREYPR
LQGGFIWDWVDQGLSKIDENGKHYWAYGGDFGDELNDRQFCINGLLFPDRTPHPSLFEAKYSQQHLQFTLREQNQNQNQN
QYSIDVFSDYVFRHTDNEKLVWQLIQNGVCVEQGEMALNIAPQSTHTLTIKTKTAFEHGAQYYLNLDVALINDSHFANAN
HVMDSEQFKLINSNNLNSKSFASATEKSVISVNETDSHLSIENNTFKLVFNQQSGLIEQWLQDDTQVISSPLVDNFYRAP
LDNDIGVSEVDNLDPNAWEARWSRAGIGQWQRTCSSINAVQSSVDVRITCVFNYEFNGVLQAQTQWLYTLNNTGTISLNV
DVNLNDTLPPMPRIGLSTTINKQSDTKVNWLGLGPFENYPDRKSAARFGYYSLSLNELYTPYIFPTDNGLRSDCQLLSIN
NLIVTGAFLFAASEYSQNMLTQAKHTNELIADDCIHVHIDHQHMGVGGDDSWSPSTHKEYLLEQKNYNYSLTLTGGITT
>Q59750 3.2.1.23~~~lacZ~~~Beta-galactosidase~~~
MRSVTSFNDSWVFSEASTRDAERSGRVSRSACRTNAVELPFNYFDERCYQRAFTYQRVLAWRPDFSQGSRSSSTRQWPMR
SCISTAKRSSRIRDGYTPFEARLTDRLLEGDNLITVKIDGSENPEIPPFGAGIDYLTYAGIYRDVWLKVTDPVSIANIKI
ETRDVLSDHKAVSLRCDLSNPQGLSFSGTISALLKNAAGEVLAEVAGETTGQSLAFEMDGLRGLSLWDIDDPVLYVIEVE
LRTGQGFRLLRRAFRLPHGEFTTEGFRLNGRPLKIRGLNRHQSFPYVGLRMGRTAKGSAHADIMNAHRLHCNLVRTSHYP
QSKWFLDHCDRIGLLVFARNPRLAAYRWGGMETGGNPERPPHRSSATGTTRLSYIWGVRINESQDSHDFYAETNRLAREL
DPTRQTGGVRYITDSEFLEDVYTMNDFILGNEELPGANRPGTALRPQQECTGLPRKVPYLITEFGGHMYPTKIYDQEQRQ
AEHVRRHLEVLNAAYARNPGISGAIGWCMFDYNTTRISAPATGSAITASWTCSASPKFAAYVYASQCDPSEEIVMKPVTF
WARGDDDIGGVLPLIVLTNCDEIELKYGSLTKRVGPDRENFPHLPHPPVVIDHRHFTKDELGVWGMKWESAEFTGFIAGK
PVADLRMAADPVPTTLQVEADSKTLRREGRDTVRLILRALDQAGNVLPFLNDAVDIEIHGPARLVGPARIVLQGGSGFLA
GVHGRRRHASSRSRRRGSAAAKLDLVALADGAASA
>Q9X6C6 3.2.1.23~~~bgaT~~~Beta-galactosidase BgaT~~~
MLGVCYYPEHWPRERWSEDARRMRELGLAYVRVGEFAWALLEPEPGRLDWAWLDEAVAVLAQAGLKVVLGTPTATPPKWL
VDRYPEILPVDREGRRRRFGGRRHYCFSSPVYHEETRRIVTLLGERYGKHPAVAGFQTDNEYGCHGTVRCYCERCQDAFR
KWLEERYGSIDVLNEAWGTVFWSQRYRTFQEVELPNLTVAEANPSHLLDYYRFASEQVRRYNRLQVEILRAHAPGKFVTH
NFMGFFTDLNPFPLGEDLDFASWDSYPLGFTDLMPLPEEEKLRYARTGHPDVAAFHHDLYRAVGRGRFWVMEQQPGPVNW
APHNPNPAPGMVRLWTWEALANGAEVVSYFRWRQVPFAQEQMHAGLHRPDYAPDAAFFEVQRVVEELGALSLPPPGQAPV
ALVYDPEAPWVYEVQPHGAEWNYLALVFLFYSVARRLGLDVDIVPPGAALQGYRLVLVPSLPIVREKALNAFREADGIVL
FGPRSGSKNENFHIPQGLPPGPLQALLPLKVVRVESLPPGLREEVEGPWGRFSSGLWREWVETDLSPLLRFADGLGALFR
AGRYLYLAAWPSPELLGALLVGLAQEGGLSPKPLPSGLRLRWRGHLVFAFNYGPEEVVLPVPSGVRFRLGGPRLSPYEVA
VWEEG
>Q56307 3.2.1.23~~~lacZ~~~Beta-galactosidase~~~COG3250
MPYEWENPQLVSEGTEKSHASFIPYLDPFSGEWEYPEEFISLNGNWRFLFAKNPFEVPEDFFSEKFDDSNWDEIEVPSNW
EMKGYGKPIYTNVVYPFEPNPPFVPKDDNPTGVYRRWIEIPEDWFKKEIFLHFEGVRSFFYLWVNGKKIGFSKDSCTPAE
FRLTDVLRPGKNLITVEVLKWSDGSYLEDQDMWWFAGIYRDVYLYALPKFHIRDVFVRTDLDENYRNGKIFLDVEMRNLG
EEEEKDLEVTLITPDGDEKTLVKETVKPEDRVLSFAFDVKDPKKWSAETPHLYVLKLKLGEDEKKVNFGFRKIEIKDGTL
LFNGKPLYIKGVNRHEFDPDRGHAVTVERMIQDIKLMKQHNINTVRTSHYPNQTKWYDLCDYFGLYVIDEANIESHGIDW
DPEVTLANRWEWEKAHFDRIKRMVERDKNHPSIIFWSLGNEAGDGVNFEKAALWIKKRDNTRLIHYEGTTRRGESYYVDV
FSLMYPKMDILLEYASKKREKPFIMCEYAHAMGNSVGNLKDYWDVIEKYPYLHGGCIWDWVDQGIRKKDENGREFWAYGG
DFGDTPNDGNFCINGVVLPDRTPEPELYEVKKVYQNVKIRQVSKDTYEVENRYLFTNLEMFDGAWKIRKDGEVIEEKTFK
IFAEPGEKRLLKIPLPEMDDSEYFLEISFSLSEDTPWAEKGHVVAWEQFLLKAPAFEKKSISDGVSLREDGKHLTVEAKD
TVYVFSKLTGLLEQILHRRKKILKSPVVPNFWRVPTDNDIGNRMPQRLAIWKRASKERKLFKMHWKKEENRVSVHSVFQL
PGNSWVYTTYTVFGNGDVLVDLSLIPAEDVPEIPRIGFQFTVPEEFGTVEWYGRGPHETYWDRKESGLFARYRKAVGEMM
HRYVRPQETGNRSDVRWFALSDGETKLFVSGMPQIDFSVWPFSMEDLERVQHISELPERDFVTVNVDFRQMGLGGDDSWG
AMPHLEYRLLPKPYRFSFRMRISEEIPSWRVLAAIPETLHVEMSSEDVIREGDTLRVKFSLLNDTPLSKEKQVVLFVDGN
EYSVRRVVIPPFKKEELVFKVEGLKKGEHLIHTNLNTRKTIYVR
>O69315 3.2.1.23~~~~~~Beta-galactosidase~~~
MLGVCYYPEHWPKERWKEDARRMREAGLSHVRIGEFAWALLEPEPGRLEWGWLDEAIATLAAEGLKVVLGTPTATPPKWL
VDRYPEILPVDREGRRRRFGGRRHYCFSSPVYREEARRIVTLLAERYGGLEAVAGFQTDNEYGCHDTVRCYCPRCQEAFR
GWLEARYGTIEALNEAWGTAFWSQRYRSFAEVELPHLTVAEPNPSHLLDYYRFASDQVRAFNRLQVEILRAHAPGKFVTH
NFMGFFTDLDAFALAQDLDFASWDSYPLGFTDLMPLPPEEKLRYARTGHPDVAAFHHDLYRGVGRGRFWVMEQQPGPVNW
APHNPSPAPGMVRLWTWEALAHGAEVVSYFRWRQAPFAQEQMHAGLHRPDSAPDQGFFEAKRVAEELAALALPPVAQAPV
ALVFDYEAAWIYEVQPQGAEWSYLGLVYLFYSALRRLGLDVDVVPPGASLRGYAFAVVPSLPIVREEALEAFREAEGPVL
FGPRSGSKTETFQIPKELPPGPLQALLPLKVVRVESLPPGLLEVAEGALGRFPLGLWREWVEAPLKPLLTFQDGKGALYR
EGRYLYLAAWPSPELAGRLLSALAAEAGLKVLSLPEGLRLRRRGTWVFAFNYGPEAVEAPASEGARFLLGSRRVGPYDLA
VWEEA
>P26257 3.2.1.23~~~lacZ~~~Beta-galactosidase~~~
MRKIIPINNNWYFKADYEEGYEKVDDLRSFENVNLPHTNIELPYNYFDEKMYQIKSCYKYPLHISEKYRDKVIYIHFEGV
MAYAQVYLNGLYIGEHKGGYTPFDIRIDEVYDWKKELNMLTVVVDSTERSDIPPKGGQIDYLTYGGIYREVSLGIYDDVF
IKNIKVETHGIYDNEKSLNLIVHLENLNHQSGNVKFKVKINDKNGKEVFYKEFNTYLDAVKDVYSFNIENLKDIKLWDVD
NPNLYEIKVGMKINNFSDEYDNKFGFREAVFKPDGFYLNGRKLKLRGLNRHQSYPYVGYAMPRRVQEKDAEILKNELHLN
IVRTSHYPQSKHFLNKCDELGLLVFEEIPGWQYIGNSEWKKVAEQNLREMITRDWNHPSIILWGVRINESQDDDAFYKNM
NKIAHEIDPTRQTGGVRYITNSSFLEDVYTFNDFIHDGINKPLRKQQEVTGLEHNVPYLVTEYNGHMYPTKRFDNEERQM
EHCLRHLRIQNASYLDDSISGAIGWCAFDYNTHKDFGSGDRICYHGVMDMFRLPKFASYVYKSQVSPDIEPILEPVTFWA
RGERSIGGVIPLIIFTNCDYIELQYGNKTKIDNIYPNRDAYKGIPYPPIIIDYDIVKPEMIGAWGMVWEDLTLKGFYKGN
KVIERKFSREPIPTYLYVVPDDTILSATQKDATRIVVKILDQYGNLLPFINEVIKIEIEGPAKLQGPNEVALIGGA
>P48982 3.2.1.23~~~bga~~~Beta-galactosidase~~~
MLRTTLAPLVLALALALPAAAATPESWPTFGTQGTQFVRDGKPYQLLSGAIHFQRIPRAYWKDRLQKARALGLNTVETYV
FWNLVEPQQGQFDFSGNNDVAAFVKEAAAQGLNVILRPGPYACAEWEAGGYPAWLFGKGNIRVRSRDPRFLAASQAYLDA
LAKQVQPLLNHNGGPIIAVQVENEYGSYADDHAYMADNRAMYVKAGFDKALLFTSDGADMLANGTLPDTLAVVNFAPGEA
KSAFDKLIKFRPDQPRMVGEYWAGWFDHWGKPHAATDARQQAEEFEWILRQGHSANLYMFIGGTSFGFMNGANFQNNPSD
HYAPQTTSYDYDAILDEAGHPTPKFALMRDAIARVTGVQPPALPAPITTTTLPATPLRESASLWDNLPTPIAIDTPQPME
QFGQDYGYILYRTTITGPRKGPLYLGDVRDVARVYVDQRPVGSVERRLQQVSLEVEIPAGQHTLDVLVENSGRINYGTRM
ADGRAGLVDPVLLDSQQLTGWQAFPLPMRTPDSIRGWTGKAVQGPAFHRGTLRIGTPTDTYLDMRAFGKGFAWANGVNLG
RHWNIGPQTALYLRPSSARVTTRWWSSTWTMLHPSVRG
>Q7WTB3 3.2.1.23~~~lacM~~~Beta-galactosidase small subunit~~~COG3250
MDYTNNQLHIIYGDATLGVNGKDFQYIFSYERGGLESLKVHGKEWLYRVPTPTFWRATTDNDRGSGFNLKAAQWLGADMF
TKCTDIHLKVDRHDFAELPIAPFNNKFSNHEYAKSAEISFTYQTLTTPATNAKIIYNIDDVGHIKVTMRYYGKKGLPPLP
VIGIRLIMPTAATGFDYEGLSGETYPDRMAGAKEGKFHIDGLPVTEYLVPQENGMHMQTKKLTINRETTQNNVDRTNEKF
SLSIQQAEKPFNFSCLPYTAEELENATHIEELPLVRRTVLVIAGAVRGVGGIDSWGTDVESAYHINPELDHEFSFILN
>Q48847 3.2.1.23~~~lacM~~~Beta-galactosidase small subunit~~~
MANTNKRLAVIFGDVTLGLKGPDFHYLFSYQTGGPESLRIQGKEWLYRSPKPTFWRATTDNDRGNQFPLKSGMWLAADQF
IACQSITVAIDGQTIPLPIAPENNRYSGRETAQEVTVTYTYQTITTPQTTVEVSYTIQASGKIRVAVTYHGQAGLPSLPV
FGLRFVMPTPATRFIYQGLSGETYPDRMAGGIAGEYEVTGLPVTPYLVPQDCGVHMATDWVTIYRQAVLDNCLHEPVETG
LKFKMVDQPFAFSCLPYTAEELENATHHSELPAPHRTVLNLLGAVRGVGGIDSWGSDVEVAYQIDATQDRHFEFEISF
>Q02604 3.2.1.23~~~lacM~~~Beta-galactosidase small subunit~~~
MAYTNNQLHVIYGDGSLGLQGANFHYLFSYERGGLESLVVNDKEWLYRTPTPMFWRATTDNDHGSGFSVKSAQWYAADKF
STCQDIELTVDDQPVTPLPIAPLNNKYTDHEIATKVSLAYHFVTTTVPSTIVTVTYTVTADGQINIATHYSGQSDLPELP
AFGLRFIMPTTATGFDYTGLSGETYPDRLAGATHGQFHVDSLPVTPYLVPQECGMHMQTEQVTVTRSTTQNNADHDNTPF
SLTFSQTDAPFAFSCLPYTAAELENATHMEELPLARRTVLSIYGAVRGVGGIDSWGTDVEAPYHILANQDIDFSFNIHF
>A7LXS9 3.2.1.23~~~~~~Beta-galactosidase BoGH2A~~~COG3250
MMIGKLKYLMLGGCLILGSCLALGGCLMLLGACSSSSLVSPRERSDFNADWRFHLGDGLQAAQPGFADNDWRVLDLPHDW
AIEGDFSQENPSGTGGGALPGGVGWYRKTFSVDKADAGKIFRIEFDGVYMNSEVFINGVSLGVRPYGYISFSYDLTPYLK
WDEPNVLAVRVDNAEQPNSRWYSGCGIYRNVWLSKTGPIHVGGWGTYVTTSSVDEKQAVLNLATTLVNESDTNENVTVCS
SLQDAEGREVAETRSSGEAEAGKEVVFTQQLTVKQPQLWDIDTPYLYTLVTKVMRNEECMDRYTTPVGIRTFSLDARKGF
TLNGRQTKINGVCMHHDLGCLGAAVNTRAIERHLQILKEMGCNGIRCSHNPPAPELLDLCDRMGFIVMDEAFDMWRKKKT
AHDYARYFNEWHERDLNDFILRDRNHPSVFMWSIGNEVLEQWSDAKADTLSLEEANLILNFGHSSEMLAKEGEESVNSLL
TKKLVSFVKGLDPTRPVTAGCNEPNSGNHLFRSGVLDVIGYNYHNKDIPNVPANFPDKPFIITESNSALMTRGYYRMPSD
RMFIWPKRWDKSFADSTFACSSYENCHVPWGNTHEESLKLVRDNDFISGQYVWTGFDYIGEPTPYGWPARSSYFGIVDLA
GFPKDVYYLYQSEWTDKQVLHLFPHWNWTPGQEIDMWCYYNQADEVELFVNGKSQGVKRKDLDNLHVAWRVKFEPGTVKV
IARESGKVVAEKEICTAGKPAEIRLTPDRSILTADGKDLCFVTVEVLDEKGNLCPDADNLVNFTVQGNGFIAGVDNGNPV
SMERFKDEKRKAFYGKCLVVIQNDGKPGKAKLTATSEGLRQAVLKISAEEL
>A7LXS8 3.2.1.21~~~~~~Beta-glucosidase BoGH3A~~~COG1472
MIIGIMKTFLLTICFLSVQTGMVAIAQDKEQTPVYLDDTQPIEVRVQDALNRMTVEEKTRLSYAQGKFSSPGCPRLGIPE
LWMSDGPHGVRAEINWNDWGYAGWTNDSCTAFPALTCLAASWNPLLAAKYGYAIGEEARYREKDVLLGPGVNIYRTPLNG
RNFEYMGEDPYLASELCVPYIQGVQKNGVAACVKHYALNNQELWRGHIDVQLSDRALYEIYLPAFKAAVERGKAWSIMGA
YNKVRGTHATHHKLLNNDILKGEWNFDGCVITDWGAAHDTYEAAMYGLDIEMGSYTNGLTSESEFGYDDYYLGKSYLKMV
REGKIPMEVVNDKAARVLRLIFRTAMNRRKPFGALTSEEHYRTAYEIATEGIVLLKNGTGKKQPALLPVPQGKYKRILVV
GDNATRNLMLGGGSSELKVQKVISSLDGIKAKFGDGVVYAQGYTSGRPMYGRADVIPQVTVDSLRNDAVEKAMNSDLVIF
VGGLNKNHFQDCEGGDRLSYELPFAQNELIEALLKVNKNLVAVIVSGNAVEMPWVKEIPSIVQSWYLGSVGGEALADVLS
GEVTPSGKLPFSYPVKLEDCPAHFFGEISYPGDSIRQEYKEDILVGYRWYDTKKVQPLFPFGYGMSYTTFEYSKPVISAQ
TMNTDGSIDVSVKVKNTGKVAGKEIIQLYIGDEECSVLRPVKELKDFRKVQLLPNEEKEVKFTIKPEALQFFDDKQRTWV
AEPGKFKAYIAASSSDIRGTVTFEYIQ
>A7LXU3 3.2.1.21~~~~~~Beta-glucosidase BoGH3B~~~COG1472
MKNIKKMVLVSAFAGTCLTPHAQTASPVIPTDPAIETHIREWLQKMTLEQKIGQMCEITIDVVSDLETSRKKGFCLSEAM
LDTVIGKYKVGSLLNVPLGVAQKKEKWAEAIKQIQEKSMKEIGIPCIYGVDQIHGTTYTLDGTMFPQGINMGATFNRELT
RRGAKISAYETKAGCIPWTFAPVVDLGRDPRWARMWENYGEDCYVNAEMGVSAVKGFQGEDPNRIGEYNVAACMKHYMGY
GVPVSGKDRTPSSISRSDMREKHFAPFLAAVRQGALSVMVNSGVDNGLPFHANRELLTEWLKEDLNWDGLIVTDWADINN
LCTRDHIAATKKEAVKIVINAGIDMSMVPYEVSFCDYLKELVEEGEVSMERIDDAVARVLRLKYRLGLFDHPYWDIKKYD
KFGSKEFAAVALQAAEESEVLLKNDGNILPIAKGKKILLTGPNANSMRCLNGGWSYSWQGHVADEYAQAYHTIYEALCEK
YGKENIIYEPGVTYASYKNDNWWEENKPETEKPVAAAAQADIIITCIGENSYCETPGNLTDLTLSENQRNLVKALAATGK
PIVLVLNQGRPRIINDIVPLAKAVVNIMLPSNYGGDALANLLAGDANFSGKMPFTYPRLINALATYDYKPCENMGQMGGN
YNYDSVMDIQWPFGFGLSYTNYKYSNLKVNKPTFNADDELIFTVDVTNTGKVAGKESVLLFSKDLVASSTPDNIRLRNFE
KVSLEPGETKTVTLKLKGSDLAFVGYDGKWRLEKGDFKIKCGDQWMDIVCDQTKVWNTPNKNTLHK
>A7LXT7 3.2.1.151~~~~~~Xyloglucan-specific endo-beta-1,4-glucanase BoGH5A~~~COG2730
MEKQSFSDGLFSPLGIKRVIFMLVLLTTSFISCSNSDEKGGSLEVAQEYRNLEFDARGSRQTIQIDGPAEWHISTSESWC
KSSHTIGEGKQYVNITVEANDTQKERTATVTVSASGAPDIIINVKQSLYSVPAYDEYIAPDNTGMRDLTSMQLSALMKAG
VNVGNTFEAVIVGNDGSLSGDETCWGNPTPNKVLFEGIKAAGFDVVRIPVAYSHQFEDAATYKIKSAWMDKVEAAVKAAL
DAGLYVIINIHWEGGWLNHPVDANKEALDERLEAMWKQIALRFRDYDDRLLFAGTNEVNNDDANGAQPTEENYRVQNGFN
QVFVNTVRATGGRNHYRHLIVQAYNTDVAKAVAHFTMPLDIVQNRIFLECHYYDPYDFTIMPNDENFKSQWGAAFAGGDV
SATGQEGDIEATLSSLNVFINNNVPVIIGEYGPTLRDQLTGEALENHLKSRNDYIEYVVKTCVKNKLVPLYWDAGYTEKL
FDRTTGQPHNAASIAAIMKGLN
>A7LXT3 3.2.1.151~~~~~~Xyloglucan-specific endo-beta-1,4-glucanase BoGH9A~~~COG3291
MKIVRYIALFGILSGLAVACTPSTSVIPNDAIRLNQLGYYPNQEKIAVVDSGKVEEFVIWDAVSGEQVFVGKSLYTAKSA
WSDKTRTTLDFSAVTTPGKYILKVNGASVTFLIKDSVLSPLADAALKSFYYQRTAMPIEEQYAGQWHRMAGHPDNHVLIH
PSAASPDRPAGTIVSSSKGWYDAGDYNKYIVNSGYSIGLMQSIYQLFLDYFSRQKINIPESNNHTPDLLDEMQFNLDWML
TMQDPEDGGVYHKLTTPFFEGFVKPVDCKQQRYVVQKSVTAALDFAAVMAQSSRLFASYEEDYPGFSKRALLAAEKAYAW
AEKHPEAYYNQNLLNQKYQPAIATGEYGDTHADDEFFWAASELYFSTGKEIYREEAIKKAPQIYTAPGWGNTFALGIFAW
LQPGRELNEADRRFADSLKTELLKYADKVIEGAEQTPFHAPYGNDAKDFFWGCLAEKCMNQGVSLMYAYLQTGKDVYLTN
AYRNMDYILGRNATGFCYVTGLGTKSPKHPHHRLSASDDIEDPIPGFLVGGPNPGQQDGAFYPTASPDESYVDTEDSYAS
NEVAINWNAALVALASSLDALAVYSVK
>P26208 3.2.1.21~~~bglA~~~Beta-glucosidase A~~~COG2723
MSKITFPKDFIWGSATAAYQIEGAYNEDGKGESIWDRFSHTPGNIADGHTGDVACDHYHRYEEDIKIMKEIGIKSYRFSI
SWPRIFPEGTGKLNQKGLDFYKRLTNLLLENGIMPAITLYHWDLPQKLQDKGGWKNRDTTDYFTEYSEVIFKNLGDIVPI
WFTHNEPGVVSLLGHFLGIHAPGIKDLRTSLEVSHNLLLSHGKAVKLFREMNIDAQIGIALNLSYHYPASEKAEDIEAAE
LSFSLAGRWYLDPVLKGRYPENALKLYKKKGIELSFPEDDLKLISQPIDFIAFNNYSSEFIKYDPSSESGFSPANSILEK
FEKTDMGWIIYPEGLYDLLMLLDRDYGKPNIVISENGAAFKDEIGSNGKIEDTKRIQYLKDYLTQAHRAIQDGVNLKAYY
LWSLLDNFEWAYGYNKRFGIVHVNFDTLERKIKDSGYWYKEVIKNNGF
>P42973 3.2.1.86~~~bglA~~~Aryl-phospho-beta-D-glucosidase BglA~~~COG2723
MGNMPKDFLWGGALAAHQFEGGWNQGGKGPSVVDVMTAGAHGVPRKITDTIEENEFYPNHEAIDFYHRYKEDIALFAEMG
LKCLRTSIGWSRIFPKGDEAEPNEAGLQFYDDVFDELLKHGIEPVITLSHFEMPLHLAREYGGFRNRKVVDFFVNFAEAC
FTRYKDKVKYWMTFNEINNQMDVNNPLFLWTNSGVVVGENENAKEVMYQTAHHELVASALAVAKGKDINPEFQIGAMVSH
VPIYPFSSNPEDVMLAEEEMRQRYFFPDVQVRGYYPSYALKEFEREGYNITFEDGDDEILRNGTVDYLGFSYYMSTTVKS
DVKNDNTGDIVNGGLPNGVENPYITSSDWGWAIDPTGLRYTLNRFYDRYQIPLFIVENGFGAVDTLEEDGKVHDPERIQY
LKSHIEALKKAVTYDGVDLIGYTPWGIIDIVSFTTGEMKKRYGMIYVDRDNEGNGSMKRYKKDSFEWYKNVIQTNGEEL
>Q46829 3.2.1.86~~~bglA~~~6-phospho-beta-glucosidase BglA~~~COG2723
MIVKKLTLPKDFLWGGAVAAHQVEGGWNKGGKGPSICDVLTGGAHGVPREITKEVLPGKYYPNHEAVDFYGHYKEDIKLF
AEMGFKCFRTSIAWTRIFPKGDEAQPNEEGLKFYDDMFDELLKYNIEPVITLSHFEMPLHLVQQYGSWTNRKVVDFFVRF
AEVVFERYKHKVKYWMTFNEINNQRNWRAPLFGYCCSGVVYTEHENPEETMYQVLHHQFVASALAVKAARRINPEMKVGC
MLAMVPLYPYSCNPDDVMFAQESMRERYVFTDVQLRGYYPSYVLNEWERRGFNIKMEDGDLDVLREGTCDYLGFSYYMTN
AVKAEGGTGDAISGFEGSVPNPYVKASDWGWQIDPVGLRYALCELYERYQRPLFIVENGFGAYDKVEEDGSINDDYRIDY
LRAHIEEMKKAVTYDGVDLMGYTPWGCIDCVSFTTGQYSKRYGFIYVNKHDDGTGDMSRSRKKSFNWYKEVIASNGEKL
>Q03506 3.2.1.21~~~bglA~~~Beta-glucosidase~~~
MSIHMFPSDFKWGVATAAYQIEGAYNEDGRGMSIWDTFAHTPGKVKNGDNGNVACDSYHRVEEDVQLLKDLGVKVYRFSI
SWPRVLPQGTGEVNRAGLDYYHRLVDELLANGIEPFCTLYHWDLPQALQDQGGWGSRITIDAFAEYAELMFKELGGKIKQ
WITFNEPWCMAFLSNYLGVHAPGNKDLQLAIDVSHHLLVAHGRAVTLFRELGISGEIGIAPNTSWAVPYRRTKEDMEACL
RVNGWSGDWYLDPIYFGEYPKFMLDWYENLGYKPPIVDGDMELIHQPIDFIGINYYTSSMNRYNPGEAGGMLSSEAISMG
APKTDIGWEIYAEGLYDLLRYTADKYGNPTLYITENGACYNDGLSLDGRIHDQRRIDYLAMHLIQASRAIEDGINLKGYM
EWSLMDNFEWAEGYGMRFGLVHVDYDTLVRTPKDSFYWYKGVISRGWLDL
>P22073 3.2.1.21~~~bglA~~~Beta-glucosidase A~~~COG2723
MTIFQFPQDFMWGTATAAYQIEGAYQEDGRGLSIWDTFAHTPGKVFNGDNGNVACDSYHRYEEDIRLMKELGIRTYRFSV
SWPRIFPNGDGEVNQEGLDYYHRVVDLLNDNGIEPFCTLYHWDLPQALQDAGGWGNRRTIQAFVQFAETMFREFHGKIQH
WLTFNEPWCIAFLSNMLGVHAPGLTNLQTAIDVGHHLLVAHGLSVRRFRELGTSGQIGIAPNVSWAVPYSTSEEDKAACA
RTISLHSDWFLQPIYQGSYPQFLVDWFAEQGATVPIQDGDMDIIGEPIDMIGINYYSMSVNRFNPEAGFLQSEEINMGLP
VTDIGWPVESRGLYEVLHYLQKYGNIDIYITENGACINDEVVNGKVQDDRRISYMQQHLVQVHRTIHDGLHVKGYMAWSL
LDNFEWAEGYNMRFGMIHVDFRTQVRTPKESYYWYRNVVSNNWLETRR
>Q08638 3.2.1.21~~~bglA~~~Beta-glucosidase A~~~COG2723
MNVKKFPEGFLWGVATASYQIEGSPLADGAGMSIWHTFSHTPGNVKNGDTGDVACDHYNRWKEDIEIIEKLGVKAYRFSI
SWPRILPEGTGRVNQKGLDFYNRIIDTLLEKGITPFVTIYHWDLPFALQLKGGWANREIADWFAEYSRVLFENFGDRVKN
WITLNEPWVVAIVGHLYGVHAPGMRDIYVAFRAVHNLLRAHARAVKVFRETVKDGKIGIVFNNGYFEPASEKEEDIRAVR
FMHQFNNYPLFLNPIYRGDYPELVLEFAREYLPENYKDDMSEIQEKIDFVGLNYYSGHLVKFDPDAPAKVSFVERDLPKT
AMGWEIVPEGIYWILKKVKEEYNPPEVYITENGAAFDDVVSEDGRVHDQNRIDYLKAHIGQAWKAIQEGVPLKGYFVWSL
LDNFEWAEGYSKRFGIVYVDYSTQKRIVKDSGYWYSNVVKNNGLED
>B9K7M5 3.2.1.74~~~gghA~~~1,4-beta-D-glucan glucohydrolase~~~COG2723
MKKFPEGFLWGVATASYQIEGSPLADGAGMSIWHTFSHTPGNVKNGDTGDVACDHYNRWKEDIEIIEKIGAKAYRFSISW
PRILPEGTGKVNQKGLDFYNRIIDTLLEKNITPFITIYHWDLPFSLQLKGGWANRDIADWFAEYSRVLFENFGDRVKHWI
TLNEPWVVAIVGHLYGVHAPGMKDIYVAFHTVHNLLRAHAKSVKVFRETVKDGKIGIVFNNGYFEPASEREEDIRAARFM
HQFNNYPLFLNPIYRGEYPDLVLEFAREYLPRNYEDDMEEIKQEIDFVGLNYYSGHMVKYDPNSPARVSFVERNLPKTAM
GWEIVPEGIYWILKGVKEEYNPQEVYITENGAAFDDVVSEGGKVHDQNRIDYLRAHIEQVWRAIQDGVPLKGYFVWSLLD
NFEWAEGYSKRFGIVYVDYNTQKRIIKDSGYWYSNVIKNNGLTD
>P14002 3.2.1.21~~~bglB~~~Thermostable beta-glucosidase B~~~COG1472
MAVDIKKIIKQMTLEEKAGLCSGLDFWHTKPVERLGIPSIMMTDGPHGLRKQREDAEIADINNSVPATCFPSAAGLACSW
DRELVERVGAALGEECQAENVSILLGPGANIKRSPLCGRNFEYFSEDPYLSSELAASHIKGVQSQGVGACLKHFAANNQE
HRRMTVDTIVDERTLREIYFASFENAVKKARPWVVMCAYNKLNGEYCSENRYLLTEVLKNEWMHDGFVVSDWGAVNDRVS
GLDAGLDLEMPTSHGITDKKIVEAVKSGKLSENILNRAVERILKVIFMALENKKENAQYDKDAHHRLARQAAAESMVLLK
NEDDVLPLKKSGTIALIGAFVKKPRYQGSGSSHITPTRLDDIYEEIKKAGGDKVNLVYSEGYRLENDGIDEELINEAKKA
ASSSDVAVVFAGLPDEYESEGFDRTHMSIPENQNRLIEAVAEVQSNIVVVLLNGSPVEMPWIDKVKSVLEAYLGGQALGG
ALADVLFGEVNPSGKLAETFPVKLSHNPSYLNFPGEDDRVEYKEGLFVGYRYYDTKGIEPLFPFGHGLSYTKFEYSDISV
DKKDVSDNSIINVSVKVKNVGKMAGKEIVQLYVKDVKSSVRRPEKELKGFEKVFLNPGEEKTVTFTLDKRAFAYYNTQIK
DWHVESGEFLILIGRSSRDIVLKESVRVNSTVKIRKRFTVNSAVEDVMSDSSAAAVLGPVLKEITDALQIDMDNAHDMMA
ANIKNMPLRSLVGYSQGRLSEEMLEELVDKINNVE
>P11988 3.2.1.86~~~bglB~~~6-phospho-beta-glucosidase BglB~~~COG2723
MKAFPETFLWGGATAANQVEGAWQEDGKGISTSDLQPHGVMGKMEPRILGKENIKDVAIDFYHRYPEDIALFAEMGFTCL
RISIAWARIFPQGDEVEPNEAGLAFYDRLFDEMAQAGIKPLVTLSHYEMPYGLVKNYGGWANRAVIDHFEHYARTVFTRY
QHKVALWLTFNEINMSLHAPFTGVGLAEESGEAEVYQAIHHQLVASARAVKACHSLLPEAKIGNMLLGGLVYPLTCQPQD
MLQAMEENRRWMFFGDVQARGQYPGYMQRFFRDHNITIEMTESDAEDLKHTVDFISFSYYMTGCVSHDESINKNAQGNIL
NMIPNPHLKSSEWGWQIDPVGLRVLLNTLWDRYQKPLFIVENGLGAKDSVEADGSIQDDYRIAYLNDHLVQVNEAIADGV
DIMGYTSWGPIDLVSASHSQMSKRYGFIYVDRDDNGEGSLTRTRKKSFGWYAEVIKTRGLSLKKITIKAP
>P22505 3.2.1.21~~~bglB~~~Beta-glucosidase B~~~COG2723
MSENTFIFPATFMWGTSTSSYQIEGGTDEGGRTPSIWDTFCQIPGKVIGGDCGDVACDHFHHFKEDVQLMKQLGFLHYRF
SVAWPRIMPAAGIINEEGLLFYEHLLDEIELAGLIPMLTLYHWDLPQWIEDEGGWTQRETIQHFKTYASVIMDRFGERIN
WWNTINEPYCASILGYGTGEHAPGHENWREAFTAAHHILMCHGIASNLHKEKGLTGKIGITLNMEHVDAASERPEDVAAA
IRRDGFINRWFAEPLFNGKYPEDMVEWYGTYLNGLDFVQPGDMELIQQPGDFLGINYYTRSIIRSTNDASLLQVEQVHME
EPVTDMGWEIHPESFYKLLTRIEKDFSKGLPILITENGAAMRDELVNGQIEDTGRHGYIEEHLKACHRFIEEGGQLKGYF
VWSFLDNFEWAWGYSKRFGIVHINYETQERTPKQSALWFKQMMAKNGF
>P38645 3.2.1.21~~~bglB~~~Thermostable beta-glucosidase B~~~
MTESAMTSRAGRGRGADLVAAVVQGHAAASDAAGDLSFPDGFIWGAATAAYQIEGAWREDGRGLWDVFSHTPGKVASGHT
GDIACDHYHRYADDVRLMAGLGDRVYRFSVAWPRIVPDGSGPVNPAGLDFYDRLVDELLGHGITPYPTLYHWDLPQTLED
RGGWAARDTAYRFAEYALAVHRRLGDRVRCWITLNEPWVAAFLATHRGAPGAADVPRFRAVHHLLLGHGLGLRLRSAGAG
QLGLTLSLSPVIEARPGVRGGGRRVDALANRQFLDPALRGRYPEEVLKIMAGHARLGHPGRDLETIHQPVDLLGVNYYSH
VRLAAEGEPANRLPGSEGIRFERPTAVTAWPGDRPDGLRTLLLRLSRDYPGVGLIITENGAAFDDRADGDRVHDPERIRY
LTATLRAVHDAIMAGADLRGYFVWSVLDNFEWAYGYHKRGIVYVDYTTMRRIPRESALWYRDVVRRNGLRNGE
>P42403 3.2.1.86~~~bglC~~~Aryl-phospho-beta-D-glucosidase BglC~~~COG2723
MIHQHPESFPKHFLWGSASAAYQIEGAWNEDGKGPSVWDVFTKIPGKTFKGTNGEIAVDHYHRFKEDVALMAEMGLKAYR
FSVSWPRVFPKGKGEINEAGLAFYDSLIDELLSHHIEPVLTLYHWDLPQALMDEYGGFESRNIIEDFNHYCITLYKRFGD
RVKYWVTLNEQNYNFNHGFITAMHPPGVKDRKRFYEANHIAFLANAKAIESFREYVPEGKIGPSFAYSPAYPLSSHPEDI
LAFENAEEFTNNWWLDMYCWGTYPQIPFRCLEKQGWAPTIEAGDMDLLAKGKPDFVGVNYYQTITYERNPLDGVSEGKMN
TTGQKGTNQETGIPGVFKTKKNPHLTTSNWDWTIDPIGLRIGLRRITSRYQLPVFITENGLGEFDKVEDGTVQDDYRIDY
LRSHLEQCRQAISDGVDLIGYCSWSFTDLLSWLNGYQKRYGFVYVNRDEESTSDLKRLKKKSFYWYQDVIKTNGESL
>P94248 3.2.1.21~~~~~~Bifunctional beta-D-glucosidase/beta-D-fucosidase~~~
MTMIFPKGFMFGTATAAYQIEGAVAEGGRTPSIWDTFSHTGHTLNGDTGDVADDFYHRWEDDLKLLRDLGVNAYRFSIGI
PRVIPTPDGKPNQEGLDFYSRIVDRLLEYGIAPIVTLYHWDLPQYMASGDGREGGWLERETAYRIADYAGIVAKCLGDRV
HTYTTLNEPWCSAHLSYGGTEHAPGLGAGPLAFRAAHHLNLAHGLMCEAVRAEAGAKPGLSVTLNLQICRGDADAVHRVD
LIGNRVFLDPMLRGRYPDELFSITKGICDWGFVCDGDLDLIHQPIDVLGLNYYSTNLVKMSDRPQFPQSTEASTAPGASD
VDWLPTAGPHTEMGWNIDPDALYETLVRLNDNYPGMPLVVTENGMACPDKVEVGTDGVKMVHDNDRIDYLRRHLEAVYRA
IEEGTDVRGYFAWSLMDNFEWAFGYSKRFGLTYVDYESQERVKKDSFDWYRRFIADHSAR
>P11989 ~~~bglG~~~Cryptic beta-glucoside bgl operon antiterminator~~~COG3711
MNMQITKILNNNVVVVIDDQQREKVVMGRGIGFQKRAGERINSSGIEKEYALSSHELNGRLSELLSHIPLEVMATCDRII
SLAQERLGKLQDSIYISLTDHCQFAIKRFQQNVLLPNPLLWDIQRLYPKEFQLGEEALTIIDKRLGVQLPKDEVGFIAMH
LVSAQMSGNMEDVAGVTQLMREMLQLIKFQFSLNYQEESLSYQRLVTHLKFLSWRILEHASINDSDESLQQAVKQNYPQA
WQCAERIAIFIGLQYQRKISPAEIMFLAINIERVRKEH
>P40740 3.2.1.86~~~bglH~~~Aryl-phospho-beta-D-glucosidase BglH~~~COG2723
MSSNEKRFPEGFLWGGAVAANQVEGAYNEGGKGLSTADVSPNGIMSPFDESMTSLNLYHNGIDFYHRYKEDIALFAEMGF
KAFRTSIAWTRIFPNGDEEEPNEEGLRFYDDLFDELLKHHIEPVVTISHYEMPLGLVKNYGGWKNRKVIEFYERYAKTVF
KRYQHKVKYWMTFNEINVVLHAPFTGGGLVFEEGENKLNAMYQAAHHQFVASALAVKAGHDIIPDSKIGCMIAATTTYPM
TSKPEDVFAAMENERKTLFFSDVQARGAYPGYMKRYLAENNIEIEMAEGDEELLKEHTVDYIGFSYYMSMAASTDPEELA
KSGGNLLGGVKNPYLKSSEWGWQIDPKGLRITLNTLYDRYQKPLFIVENGLGAVDKVEEDGTIQDDYRINYLRDHLIEAR
EAIADGVELIGYTSWGPIDLVSASTAEMKKRYGFIYVDRDNEGNGTFNRIKKKSFNWYQQVIATNGESL
>P26218 ~~~bglH~~~Cryptic outer membrane porin BglH~~~COG4580
MFRRNLITSAILLMAPLAFSAQSLAESLTVEQRLELLEKALRETQSELKKYKDEEKKKYTPATVNRSVSTNDQGYAANPF
PTSSAAKPDAVLVKNEEKNASETGSIYSSMTLKDFSKFVKDEIGFSYNGYYRSGWGTASHGSPKSWAIGSLGRFGNEYSG
WFDLQLKQRVYNENGKRVDAVVMMDGNVGQQYSTGWFGDNAGGENYMQFSDMYVTTKGFLPFAPEADFWVGKHGAPKIEI
QMLDWKTQRTDAAAGVGLENWKVGPGKIDIALVREDVDDYDRSLQNKQQINTNTIDLRYKDIPLWDKATLMVSGRYVTAN
ESASEKDNQDNNGYYDWKDTWMFGTSLTQKFDKGGFNEFSFLVANNSIASNFGRYAGASPFTTFNGRYYGDHTGGTAVRL
TSQGEAYIGDHFIVANAIVYSFGNDIYSYETGAHSDFESIRAVVRPAYIWDQYNQTGVELGYFTQQNKDANSNKFNESGY
KTTLFHTFKVNTSMLTSRPEIRFYATYIKALENELDGFTFEDNKDDQFAVGAQAEIWW
>P39404 ~~~bglJ~~~Transcriptional activator protein BglJ~~~COG2197
MEHSRIKKRNVALIEKCVMSSIGIESLFRKFAGNPYKLHTYTSQESFQDAMSRISFAAVIFSFSAMRSERREGLSCLTEL
AIKFPRTRRLVIADDDIEARLIGSLSPSPLDGVLSKASTLEIFHQELFLSLNGVRQATDRLNNQWYINQSRTLSPTEREI
LRFMSRGYSMTQIAEQLKRNIKTIRAHKFNVMSKLGVSSDAGLLEAADILLCMRHCETSNVLHPY
>Q93LQ8 2.7.1.85~~~bglK~~~Beta-glucoside kinase~~~
MKIAAFDIGGTALKMGVMARDGRLLETARQSINDSDGDRILQAMLSWLAAHPSCEGIAISAPGYIDPHSGLITMGGAIRR
FDNFAMKSWLETRTGLPVSVENDANCVLLAERWQGKAAEMANFLVLTIGTGIGGAIFCQHQLINGARFRAGEFGYMLTDR
PGGRDPRRYSMNENCTLRVLRHRYAQHIGAPLDSVTGELIFDRYDAGDPVCQRLVAEFFNGLGHGLYNLVHIFDPQTIFI
GGGVVERPGFLTLLRQHLAWFGIADYLDTVSHGNDAGLIGAVYHFNQLYRSPDDDRH
>Q926Y3 2.7.1.85~~~bglK~~~Beta-glucoside kinase~~~COG1940
MKIAAFDIGGTALKMGVVLPHGEIILTKSAEIIASDGDQILAEMKLFLAENTDVTGIAVSAPGYVNPKTGLITMGGAIRR
FDNFNLKEWLEAETGLPVAIENDANCALLAEKWLGKGQDLDDFLCLTIGTGIGGGIFSNGALVRGGRFRAGEFGYMFSER
PGAFRPGKYTLNETTTMLVLRRQYAQLTGRPLKEITGEEIFANYDAHDPISERLINEFYTGICTGLYNLIYLFDPTHIFI
GGGITSRPTFITELKHHMASFGLRDTIIETATHKNQAGLLGAVYHFLQEENRHE
>Q8Y3R9 2.7.1.85~~~bglK~~~Beta-glucoside kinase~~~COG1940
MKIAAFDIGGTALKMGVVLPHGEIILTKSAEISGSDGDQILAEMKVFLAENTDVTGIAVSAPGYVNPKTGLITMGGAIRR
FDNFNLKEWLEAETGLPVAIENDANCALLAEKWLGKGQDLDDFLCLTIGTGIGGGIFSNGELVRGGRFRAGEFGYMFSER
PGAFRPGKYTLNETTTMLVLRRQYAELTGRPLEEITGEEIFANYDAHDAVSERLITEFYTGICTGLYNLIYLFDPTHIFI
GGGITSRPTFIAELKHHMESFGLRDTIIETATHKNQAGLLGAVYHFLQEENRHE
>Q5LIC7 3.2.1.31~~~lacZ_4~~~Beta-glucuronidase~~~COG3250
MKQQKCNYFPSLWWRGREKGLSTFLFLLLFSISLHAQRQDILLNNNWNFRFSHQVQGDTRRVDLPHTWNAQDALAGKIDY
KRGIGNYEKALYIRPEWKGKRLFLRFDGVNSIADVFINRKHIGEHRGGYGAFIFEITDLVKYGEKNSVLVRANNGEQLDI
MPLVGDFNFYGGIYRDVHLLITDETCISPLDYASPGVYLVQEVVSPQEAKVCAKVNLSNRAADGTAELQVLVTDGTKVIC
KESRNVSLKQGADILEQLPLLIQKPRLWNGCEDPFMYQVSISLHKDGKQIDSVTQPLGLRYYHTDPDKGFFLNGKHLPLH
GVCRHQDRAEVGNALRPQHHEEDVALMREMGVNAIRLAHYPQATYMYDLMDKHGIVTWAEIPFVGPGGYADKGFVDQASF
RENGKQQLIELIRQHYNHPSICFWGLFNELKEVGDNPVEYVKELNALAKQEDPTRPTTSASNQDGNLNFITENIAWNRYD
GWYGSTPKTLATFLDRTHKKHPELRIGISEYGAGASIYHQQDSLKQPSASGWWHPENWQTYYHMENWKIIAERPFVWGTF
VWNMFDFGAAHRTEGDRPGINDKGLVTFDRKVRKDAFYFYKANWNKQEPMIYLAEKRCRLRYQPEQTFMAFTTAPEAELF
VNGVSCGKQKADTYSTVVWKNVKLTSGENIIRVTTPGKKPLTDEVTVEYKEDRPL
>Q8XP19 3.2.1.31~~~bglR~~~Beta-glucuronidase~~~
MLYPIITESRQLIDLSGIWKFKLNEGNGLTEELSKAPLEDTIEMAVPSSYNDLVESQEVRDHVGWVWYERNFTIPKTLLN
ERIVLRFGSATHEAKVYLNGELLVEHKGGFTPFEAEINDLLVSGDNRLTVAVNNIIDETTLPVGLVKEVEVDGKKVIKNS
VNFDFFNYAGIHRPVKIYTTPKSYIEDITIVTDFKENNGYVNYEVQAVGKCNIKVTIIDEENNIVAEGEGKEGKLTINNV
HLWEPMNAYLYKLKVELLDDEEIIDTYFEEFGVRTVEVKDGKFLINNKPFYFKGFGKHEDSYVNGRGINEAINIKDFNLM
KWIGANSFRTSHYPYSEEIMRLADREGIVVIDETPAVGLHLNFMATGFGGDAPKRDTWKEIGTKEAHERILRELVSRDKN
HPCVVMWSVANEPDSDSEGAKEYFEPLIKLTKELDPQKRPVTVVTYLMSTPDRCKVGDIVDVLCLNRYYGWYVAGGDLEE
AKRMLEDELKGWEERCPKTPIMFTEYGADTVAGLHDTVPVMFTEEYQVEYYKANHEVMDKCKNFVGEQVWNFADFATSQG
IIRVQGNKKGIFTRERKPKMIAHSLRERWTNIPEFGYKK
>P05804 3.2.1.31~~~uidA~~~Beta-glucuronidase~~~COG3250
MLRPVETPTREIKKLDGLWAFSLDRENCGIDQRWWESALQESRAIAVPGSFNDQFADADIRNYAGNVWYQREVFIPKGWA
GQRIVLRFDAVTHYGKVWVNNQEVMEHQGGYTPFEADVTPYVIAGKSVRITVCVNNELNWQTIPPGMVITDENGKKKQSY
FHDFFNYAGIHRSVMLYTTPNTWVDDITVVTHVAQDCNHASVDWQVVANGDVSVELRDADQQVVATGQGTSGTLQVVNPH
LWQPGEGYLYELCVTAKSQTECDIYPLRVGIRSVAVKGEQFLINHKPFYFTGFGRHEDADLRGKGFDNVLMVHDHALMDW
IGANSYRTSHYPYAEEMLDWADEHGIVVIDETAAVGFNLSLGIGFEAGNKPKELYSEEAVNGETQQAHLQAIKELIARDK
NHPSVVMWSIANEPDTRPQGAREYFAPLAEATRKLDPTRPITCVNVMFCDAHTDTISDLFDVLCLNRYYGWYVQSGDLET
AEKVLEKELLAWQEKLHQPIIITEYGVDTLAGLHSMYTDMWSEEYQCAWLDMYHRVFDRVSAVVGEQVWNFADFATSQGI
LRVGGNKKGIFTRDRKPKSAAFLLQKRWTGMNFGEKPQQGGKQ
>Q8E0N2 3.2.1.31~~~~~~Beta-glucuronidase~~~
MLYPLLTKTRNTYDLGGIWNFKLGEHNPNELLPSDEVMVIPTSFNDLMVSKEKRDYIGDFWYEKVIEVPKVSEDEEMVLR
FGSVTHQAKIYVDGVLVGEHKGGFTPFEVLVPECKYNNEKIKVSICANNVLDYTTLPVGNYSEIIQEDGSIKKKVRENFD
FFNYAGVHRPLKLMIRPKNHIFDITITSRLSDDLQSADLHFLVETNQKVDEVRISVFDEDNKLVGETKDSRLFLSDVHLW
EVLNAYLYTARVEIFVDNQLQDVYEENFGLREIEVTNGQFLLNRKPIYFKGFGKHEDTFINGRGLNEAANLMDLNLLKDM
GANSFRTSHYPYSEEMMRLADRMGVLVIDEVPAVGLFQNFNASLDLSPKDNGTWNLMQTKAAHEQAIQELVKRDKNHPSV
VMWVVANEPASHEAGAHDYFEPLVKLYKDLDPQKRPVTLVNILMATPDRDQVMDLVDVVCLNRYYGWYVDHGDLTNAEVG
IRKELLEWQDKFPDKPIIITEYGADTLPGLHSTWNIPYTEEFQCDFYEMSHRVFDGIPNLVGEQVWNFADFETNLMILRV
QGNHKGLFSRNRQPKQVVKEFKKRWMTIPHYHNKKNSVK
>Q9X108 3.2.1.86~~~bglT~~~6-phospho-beta-glucosidase BglT~~~COG1486
MRIAVIGGGSSYTPELVKGLLDISEDVRIDEVIFYDIDEEKQKIVVDFVKRLVKDRFKVLISDTFEGAVVDAKYVIFQFR
PGGLKGRENDEGIPLKYGLIGQETTGVGGFSAALRAFPIVEEYVDTVRKTSNATIVNFTNPSGHITEFVRNYLEYEKFIG
LCNVPINFIREIAEMFSARLEDVFLKYYGLNHLSFIEKVFVKGEDVTEKVFENLKLKLSNIPDEDFPTWFYDSVRLIVNP
YLRYYLMEKKMFKKISTHELRAREVMKIEKELFEKYRTAVEIPEELTKRGGSMYSTAAAHLIRDLETDEGKIHIVNTRNN
GSIENLPDDYVLEIPCYVRSGRVHTLSQGKGDHFALSFIHAVKMYERLTIEAYLKRSKKLALKALLSHPLGPDVEDAKDL
LEEILEANREYVKLG
>A1B8Z3 2.6.1.35~~~bhcA~~~L-aspartate--glyoxylate aminotransferase~~~COG0075
MTSQNPIFIPGPTNIPEEMRKAVDMPTIDHRSPVFGRMLHPALEGVKKVLKTTQAQVFLFPSTGTGGWETAITNTLSPGD
KVLAARNGMFSHRWIDMCQRHGLDVTFVETPWGEGVPADRFEEILTADKGHEIRVVLATHNETATGVKSDIAAVRRALDA
AKHPALLFVDGVSSIGSMDFRMDEWGVDIAVTGSQKGFMLPPGLAIVGFSPKAMEAVETARLPRTFFDIRDMATGYARNG
YPYTPPVGLINGLNASCERILAEGLENVFARHHRIASGVRAAVDAWGLKLCAVRPELYSDSVSAIRVPEGFDANLIVSHA
LETYDMAFGTGLGQVAGKVFRIGHLGSLTDAMALSGIATAEMVMADLGLPIQLGSGVAAAQEHYRQTTAAAQKKAA
>A1B8Z2 4.2.1.-~~~bhcB~~~beta-hydroxyaspartate dehydratase~~~COG1171
MYIPTYEDMLAAHERIKPHIRRTPIRTSDYLNELTGAQLFFKCENFQEPGAFKVRGATNAVFGLDDAQAAKGVATHSSGN
HASCLSYAAMLRGIPCNVVMPRTAPQAKKDTVRRYGGVITECEPSTSSREETFAKVQAETGGDFVHPYNDPRVIAGQGTC
AKELVEQVDGLDAVVAPIGGGGMISGTCLTLSTLAPETRVIAAEPEQADDAYRSFKAGYIIADDAPKTVADGLLVPLKDL
TWHFVKNHVSEIYTASDAEIVDAMKLIWKHLRIVMEPSSAVPLATILKNPEAFAGKRVGVIVTGGNVDLDKLPWN
>Q8GRC8 4.1.3.41~~~dhaa~~~3-hydroxy-D-aspartate aldolase~~~
MNAKTDFSGYEVGYDIPALPGMDESEIQTPCLILDLDALERNIRKMGDYAKAHGMRHRSHGKMHKSVDVQKLQESLGGSV
GVCCQKVSEAEAFARGGIKDVLVTNEVREPAKIDRLARLPKTGATVTVCVDDVQNIADLSAAAQKHGTELGIFVEIDCGA
GRCGVTTKEAVVEIAKAAAAAPNLTFKGIQAYQGRDAAHGQLRGPQGQAGRRHCPGERGRGRAGGRGLAPEFVSGGGTGS
YYFESNSGIYNELQCGSYAFMDADYGRIHDAEGKRIDQGEWENALFILTSVMSHAKPHLAVVDAGLKAQSVDSGLPFVYG
RDDVKYIKCSDEHGVVEDKDGVLKVNDKLRLVPGHCDPTCNVHDWYVGVRNGKVETVWPVSARGKGY
>A1B8Z1 4.1.3.41~~~bhcC~~~3-hydroxy-D-aspartate aldolase~~~COG3616
MNAKTDFSGYEVGYDIPALPGMDESEIQTPCLILDLDALERNIRKMGDYAKAHGMRHRSHGKMHKSVDVQKLQESLGGSV
GVCCQKVSEAEAFARGGIKDVLVTNEVREPAKIDRLARLPKTGATVTVCVDDVQNIADLSAAAQKHGTELGIFVEIDCGA
GRCGVTTKEAVVEIAKAAAAAPNLTFKGIQAYQGAMQHMDSFEDRKAKLDAAIAQVKEAVDALEAEGLAPEFVSGGGTGS
YYFESNSGIYNELQCGSYAFMDADYGRIHDAEGKRIDQGEWENALFILTSVMSHAKPHLAVVDAGLKAQSVDSGLPFVYG
RDDVKYIKCSDEHGVVEDKDGVLKVNDKLRLVPGHCDPTCNVHDWYVGVRNGKVETVWPVSARGKGY
>A1B8Z0 1.4.1.-~~~bhcD~~~Iminosuccinate reductase~~~COG2423
MLVVAEKEIAGLMTPEAAFEAIEAVFASMARRKAYNFPVVREAIGHEDALYGFKGGFDASALVLGLKAGGYWPNNQKHNL
INHQSTVFLFDPDTGRVSAAVGGNLLTALRTAAASAVSIKYLAPKGAKVLGMIGAGHQSAFQMRAAANVHRFEKVIGWNP
HPEMLSRLADTAAELGLPFEAVELDRLGAEADVIVSITSSFSPLLMNEHVKGPTHIAAMGTDTKGKQELDPALVARARIF
TDEVAQSVSIGECQHAIAAGLIREDQVGELGAVVAGDDPGRGDAEVTIFDGTGVGLQDLAVAQAVVELAKHKGVAQEVEI
>A1B8Z4 ~~~bhcR~~~HTH-type transcriptional regulator BhcR~~~COG1414
MSVQIRKRGRPRGRAGGLGAEDSGGIRALDRALDILDLIAVSSGLTLTEIAQRLDMAPSTVHRVLVTLAARGVAESDSQT
QAWHVGPTAFRHGSAFMRRSGLVERARPLLRRLMEVTGETANLGILNGDAVLFLSQAETHETIRAFFPPGTRSALHASGI
GKALLAHARPLDLKRMLREMRLERFTDMTLTDPAALVEDLVQIRARGYALDNEERTPGMRCIAAPIFDLAGEAAAGISVS
GPTLRMSDARLSAMSDAVIEAARELSFGMAPRKDAGERA
>P0AB40 ~~~bhsA~~~Multiple stress resistance protein BhsA~~~
MKNVKTLIAAAILSSMSFASFAAVEVQSTPEGQQKVGTISANAGTNLGSLEEQLAQKADEMGAKSFRITSVTGPNTLHGT
AVIYK
>Q2YJB2 ~~~bhuA~~~Heme transporter BhuA~~~
MKFTRTLVLVSTSLLATVATSQAQEVKRDTKKQGEVVLKPITIISHGKDNIEATGGTVLTYKDIEKLQPANVSELFSRQS
SIAVSGGGGPSKRIHVLGMEQSNLAVSVDGVPQTATSWHHTGSNVIDPAFLKRVEVEAGAAAADSGFGAAAGAIRYETVN
ALDLLEPGKTFGARIIGSYGTNGRGFSGSTAAYGLKDGFDWLLMLHGTSGHNYKNGDGTEILGTEPAARNILGKAGYEFD
GNRIDIGYERSRDKADRLIKMNMGLPGDTEYPLEVARDSVNIKYTRTDATDMWDPEVQFYYNRNDYWRNDYQNRTNGNMI
LKEDLYGGKLQNTFTIDYGKITAGIDFGKHDYNTDNYGHNDRRYRKFNTQQVGAFTQGRFEFDNGFSLSTGARYDYSRFA
DWNDEVFSDSGASVNGTLSYKFNEHIEVFAGASRTWLGYVLGDYGYVHARNNAFYTDPTFSPGRARNYKAGVNFGGADWS
AGITLFDTRIAGLPNYDSQKLGNDPEEYRSRGFTLNARYIWNYTTIGATFTKAKVTAGDDPVLPNSGSFMPIGDMATLFI
DQEIPDYNMKVGATLAWAGRISDEAATAANFYDQPAYTVVNAYAEWNPPAVKNMTLRVGVENLFNENYYERTSFAPSQNR
GGIDAVWAPGRTFTFQTAFKF
>Q14SY0 ~~~bicA~~~Bicarbonate transporter BicA~~~COG0659
MQITNKIHFRNIRGDIFGGLTAAVIALPMALAFGVASGAGAEAGLWGAVLVGFFAALFGGTPTLISEPTGPMTVVMTAVI
AHFTASAATPEEGLAIAFTVVMMAGVFQIIFGSLKLGKYVTMMPYTVISGFMSGIGIILVILQLAPFLGQASPGGGVIGT
LQNLPTLLSNIQPGETALALGTVAIIWFMPEKFKKVIPPQLVALVLGTVIAFFVFPPEVSDLRRIGEIRAGFPELVRPSF
SPVEFQRMILDAAVLGMLGCIDALLTSVVADSLTRTEHNSNKELIGQGLGNLFSGLFGGIAGAGATMGTVVNIQSGGRTA
LSGLVRAFVLLVVILGAASLTATIPLAVLAGIAFKVGVDIIDWSFLKRAHEISPKGALIMYGVILLTVLVDLIVAVGVGV
FVANVLTIERMSNLQSEKVQTVSDADDNIRLTTTEKRWLDEGQGRVLLFQLSGPMIFGVAKAIAREHNAMGDCDALVFDI
GEVPHMGVTASLALENAIEEALDKERQVYIVGAAGQTRRRLEKLKLFKRVPPDKCLMSREEALKNAVLGIYPHLADGVTA
PSSEMG
>Q55415 ~~~bicA~~~Bicarbonate transporter BicA~~~COG0659
MQITNKIHFRNLQGDLFGGVTAAVIALPMALAFGIASGAGATAGLWGAVIVGFFAALFGGTPTLISEPTGPMTVVQTAVI
ASLVAADPDNGLAMAFTVVMMAGLFQIAFGLLKLGKYVTMMPYTVISGFMSGIGIILVILQLAPFLGQASPKGGVIGTLQ
ALPNLVSNVRPVETLLALMTVGIIWFMPSRWKKFAPPQLVALVLGTIISITLFGDLDIRRIGEIQAGLPALQLPVFQADQ
LQRMLIDAAVLGMLGCIDALLTSVVADSLTRTEHNSNKELVGQGIGNVMSGLFGGLGGAGATMGTVVNIQSGGRTALSGL
IRAMVLLVVILGAAKLAATIPLAVLAGIAFKVGVDIIDWGFLKRAHHVSIKGALIMYAVIVLTVLVDLIAAVGIGVFIAN
ILTIDRMSALQSKAVKSISDADDEILLSANEKRWLDEGNGRVLLFQLSGPMIFGVAKAIAREHNAIQECAAIVFDLSDVP
HLGVTASLALENAIEEAAEKGRAVYIVGATGQTKRRLEKLQVFRFVPESNCYDDRSEALKDAVLALGPHESEDSPSSSSV
QTTY
>Q8UAA8 ~~~bigR~~~Biofilm growth-associated repressor~~~COG0640
MVTETPLEKPLDIGEIPLPAMEKRATEVAILLKTLAHPARLMLACTLAQGEFSVGELEAKLDIRQPTLSQQLGVLREAGI
VDTRREAKQIFYRLAEDKAARLIEALYAIFCAPEENL
>Q9PFB1 ~~~bigR~~~Biofilm growth-associated repressor~~~COG0640
MVNEMRDDTRPHMTREDMEKRANEVANLLKTLSHPVRLMLVCTLVEGEFSVGELEQQIGIGQPTLSQQLGVLRESGIVET
RRNIKQIFYRLTEAKAAQLVNALYTIFCAQEKQA
>Q5SHZ8 2.3.1.29~~~~~~8-amino-7-oxononanoate synthase/2-amino-3-ketobutyrate coenzyme A ligase~~~COG0156
MSLDLRARVREELERLKREGLYISPKVLEAPQEPVTRVEGREVVNLASNNYLGFANHPYLKEKARQYLEKWGAGSGAVRT
IAGTFTYHVELEEALARFKGTESALVLQSGFTANQGVLGALLKEGDVVFSDELNHASIIDGLRLTKATRLVFRHADVAHL
EELLKAHDTDGLKLIVTDGVFSMDGDIAPLDKIVPLAKKYKAVVYVDDAHGSGVLGEKGKGTVHHFGFHQDPDVVQVATL
SKAWAGIGGYAAGARELKDLLINKARPFLFSTSHPPAVVGALLGALELIEKEPERVERLWENTRYFKRELARLGYDTLGS
QTPITPVLFGEAPLAFEASRLLLEEGVFAVGIGFPTVPRGKARIRNIVTAAHTKEMLDKALEAYEKVGKRLGIIR
>Q2T6X7 ~~~bimA~~~Autotransporter BimA~~~
MKYRRLSLAHARQDSGQAASNARSRRFARLLCSSIAPLALGFSADAFAADETMASPFNRGAPNDAHGNLLDEIRRGVPLR
HVPASERNTRGAGGSTLADAMRRVIDSRRTAFDSPPATPASPSPSWSDDESPPPTPIATRPASRPESAARSPRHSSPPHS
PPASAESPSPRSPDASPSRTPSPTFSFPSPSRTSTPRTQPPSPLRERPERSPAASPRVASPRSAHSRGSTQPPSNLSTPR
YEPPTPLQEDPERTPVASPRVASPRSAHSRGSTQPPSNLSTPRYEPPTPLQEDPERTPVASPHVTPAEHAQRRPFLLQKP
PQVPSWRKKAPSATLPDSHAPARPGGGQFTTPASGAAKYVAVNSGASDAFAAGVNAVAIGADARAQGQESLATGWRAQAD
GHRAVATGARAIASGRDAVALGAGSIADRDNTVSVGQRGSERQIVHVAPGAQGTDAVNVDQLNLAISNSNAYTNQRIGDL
QQSITETARDAYSGVAAATALTMIPDVDRDKMLSIGVGGAVYKGHRAVALGGTARIGENLKVRAGVAMSAGGNTVGVGMS
WQW
>Q2T6X6 ~~~bimC~~~Inactive autotransporter heptosyltransferase BimC~~~
MPKVTFSGSAPTLGVHAPPALDPRQPASPPPAASNGTHARGFSPPADMPTWSGTEGVRFDFNDGCRVLLPDGDWTVRLRD
MHTDTPLFDAQIGAGVVTSTRKHFVPFLIEIDAGGRRVFKHLFDAHGKPVLIQFEAQRLGEALGWFGYAVKFQRQHRCKL
TCSMPAPLIALLRPGYPDIEFVTPELVKPECYYATYRLGRFAGDEAHAYQPSAPQLVGAHRSAAYMLGVDPREAPPRIEL
TDDSRPLAGPYVCIAAQSALRCARWERPGGWRELQRFLTAAGYRIVCVDSPSPDVADESSALADVAYSLAPDTPWTERAR
WLRHAACLIGVPGDLSWLAWAVGAPVVLISGFTHPVSEFDTPYRVINSHACNSCWNDASANFDDADASSCPRHAGTLRQF
ECARLVSVEQIKRTIRSIPGIAC
>P20384 ~~~bin3~~~Putative transposon Tn552 DNA-invertase bin3~~~
MIIGYARVSSLDQNLERQLENLKTFGAEKIFTEKQSGKSIENRPILQKALNFVRMGDRFIVESIDRLGRNYNEVIHTVNY
LKDKEVQLMITSLPMMNEVIGNPLLDKFMKDLIIQILAMVSEQERNESKRRQAQGIQVAKEKGVYKGRPLLYSPNAKDPQ
KRVIYHRVVEMLEEGQAISKIAKEVNITRQTVYRIKHDNGLS
>P06575 ~~~binA~~~Binary larvicide subunit BinA~~~
MRNLDFIDSFIPTEGKYIRVMDFYNSEYPFCIHAPSAPNGDIMTEICSRENNQYFIFFPTDDGRVIIANRHNGSVFTGEA
TSVVSDIYTGSPLQFFREVKRTMATYYLAIQNPESATDVRALEPHSHELPSRLYYTNNIENNSNILISNKEQIYLTLPSL
PENEQYPKTPVLSGIDDIGPNQSEKSIIGSTLIPCIMVSDFISLGERMKTTPYYYVKHTQYWQSMWSALFPPGSKETKTE
KSGITDTSQISMTDGINVSIGADFGLRFGNKTFGIKGGFTYDTKTQITNTSQLLIETTYTREYTNTENFPVRYTGYVLAS
EFTLHRSDGTQVNTIPWVALNDNYTTIARYPHFASEPLLGNTKIITDDQN
>P05516 ~~~binA~~~Binary larvicide subunit BinA~~~
MRNLDFIDSFIPTEGKYIRVMDFYNSEYPFCIHAPSAPNGDIMTEICSRENNQYFIFFPTDDGRVIIANRHNGSVFTGEA
TSVVSDIYTGSPLQFFREFKRTMSTYYLAIQNPESATDVRALEPNSHELPSRLYFTNNIENNSNILISNKEQIYLTLPSL
PENEQYPKTPVLSGIDDIGPNQSEKSIIGSTLIPCIMVSDFISLGERMKTTPYYYVKHTQYWQSMWSALFPPGSKETKTE
KSGITDTSQISMTDGINVSIGADFGLKFGNKTFGIKGGFTYDTKTQITNTSQLLIETTYTREYTNTENFPVRYTGYVLAS
EFTLHRSDGTQVNTIPWVALNDNYTTIARYPHFASEPLLGNTKIITDDQN
>P10565 ~~~binB~~~Binary larvicide subunit BinB~~~
MCDSKDNSGVSEKCGKKFTNYPLNTTPTSLNYNLPEISKKFYNLKNKYSRNGYGLSKTEFPSSIENCPSNEYSIMYDNKD
PRFLIRFLLDDGRYIIADRDDGEVFDEAPTYLDNNNHPIISRHYTGEERQKFEQVGSGDYITGEQFFQFYTQNKTRVLSN
CRALDSRTILLSTAKIFPIYPPASETQLTAFVNSSFYAAAIPQLPQTSLLENIPEPTSLDDSGVLPKDAVRAVKGSALLP
CIIVHDPNLNNSDKMKFNTYYLLEYKEYWHQLWSQIIPAHQTVKIQERTGISEVVQNSMIEDLNMYIGADFGMLFYFRSS
GFKEQITRGLNRPLSQTTTQLGERVEEMEYYNSNDLDVRYVKYALAREFTLKRVNGEIVKNWVAVDYRLAGIQSYPNAPI
TNPLTLTKHTIIRCENSYDGHIFKTPLIFKNGEVIVKTNEELIPKINQ
>P18568 ~~~binB~~~Binary larvicide subunit BinB~~~
MCDSKDNSGVSEKCGKKFTNYPLNTTPTSLNYNLPEISKKFYNLKNKYSRNGYGLSKTEFPSSIENCPSNEYSIMYDNKD
PRFLIRFLLDDGRYIIADRDDGEVFDEAPTYLDNNNHPIISRHYTGEERQKFEQVGSGDYITGEQFFQFYTQNKTRVLSN
CRALDSRTILLSTAKIFPIYPPASETQLTAFVNSSFYAAAIPQLPQTSLLENIPEPTSLDDSGVLPKDAVRAVKGSALLP
CIIVHDPNLNNSDKMKFNTYYLLEYKEYWHQLWSQIIPAHQTVKIQERTGISEVVQNSMIEDLNMYIGADFGMYFYLRSS
GFKEQITRGLNRPLSQTPTQLGERVEEMEYYNSNDLDVRYVKHALAREFTLKRVNGEIVKNWVAVDYRMAGIQSYPNAPI
TNPLTLTKHTIIRCENSYDGHIFKTPLIFKNGEVIVKTNEELIPKINQ
>P12995 2.6.1.62~~~bioA~~~Adenosylmethionine-8-amino-7-oxononanoate aminotransferase~~~COG0161
MTTDDLAFDQRHIWHPYTSMTSPLPVYPVVSAEGCELILSDGRRLVDGMSSWWAAIHGYNHPQLNAAMKSQIDAMSHVMF
GGITHAPAIELCRKLVAMTPQPLECVFLADSGSVAVEVAMKMALQYWQAKGEARQRFLTFRNGYHGDTFGAMSVCDPDNS
MHSLWKGYLPENLFAPAPQSRMDGEWDERDMVGFARLMAAHRHEIAAVIIEPIVQGAGGMRMYHPEWLKRIRKICDREGI
LLIADEIATGFGRTGKLFACEHAEIAPDILCLGKALTGGTMTLSATLTTREVAETISNGEAGCFMHGPTFMGNPLACAAA
NASLAILESGDWQQQVADIEVQLREQLAPARDAEMVADVRVLGAIGVVETTHPVNMAALQKFFVEQGVWIRPFGKLIYLM
PPYIILPQQLQRLTAAVNRAVQDETFFCQ
>P22805 2.6.1.62~~~bioA~~~Adenosylmethionine-8-amino-7-oxononanoate aminotransferase~~~
MKQVLTELQEKDLQHVWHPCSQMKDYEAFPPIVIKKGEGVWLYDEQNQRYLDAVSSWWVNLFGHANPRISQALSEQAFTL
EHTIFANFSHEPAIKLAQKLVALTPQSLQKVFFADNGSSAIEVALKMSFQYHMQTGKTQKKRFLALTDAYHGETLGALSV
GGVDLYNEVYQPLLLDTVRAQGPDCFRCPFKHHPDSCHAQCISFVEDQLRMHHKEITAVIIEPLIQAAAGMKMYPAIYLR
RLRELCTQYDVHLIADEIAVGFGRTGTLFACEQANISPDFMCLSKGLTGGYLPLSVVMTTNDVYQAFYDDYATMKAFLHS
HSYTGNTLACRVALEVLAIFEEEQYIDVVQDKGERMRKLALEAFSDLPFVGEYRQVGFVGAIELVANRDTKEPLPSEERI
GYQIYKRALAKGLLIRPLGNVLYFMPPYIITDDEMQFMIQTTKDTIVQFFEEREG
>P0A4X7 2.6.1.62~~~bioA~~~Adenosylmethionine-8-amino-7-oxononanoate aminotransferase~~~
MAAATGGLTPEQIIAVDGAHLWHPYSSIGREAVSPVVAVAAHGAWLTLIRDGQPIEVLDAMSSWWTAIHGHGHPALDQAL
TTQLRVMNHVMFGGLTHEPAARLAKLLVDITPAGLDTVFFSDSGSVSVEVAAKMALQYWRGRGLPGKRRLMTWRGGYHGD
TFLAMSICDPHGGMHSLWTDVLAAQVFAPQVPRDYDPAYSAAFEAQLAQHAGELAAVVVEPVVQGAGGMRFHDPRYLHDL
RDICRRYEVLLIFDEIATGFGRTGALFAADHAGVSPDIMCVGKALTGGYLSLAATLCTADVAHTISAGAAGALMHGPTFM
ANPLACAVSVASVELLLGQDWRTRITELAAGLTAGLDTARALPAVTDVRVCGAIGVIECDRPVDLAVATPAALDRGVWLR
PFRNLVYAMPPYICTPAEITQITSAMVEVARLVGSLP
>P9WQ81 2.6.1.62~~~bioA~~~Adenosylmethionine-8-amino-7-oxononanoate aminotransferase~~~COG0161
MAAATGGLTPEQIIAVDGAHLWHPYSSIGREAVSPVVAVAAHGAWLTLIRDGQPIEVLDAMSSWWTAIHGHGHPALDQAL
TTQLRVMNHVMFGGLTHEPAARLAKLLVDITPAGLDTVFFSDSGSVSVEVAAKMALQYWRGRGLPGKRRLMTWRGGYHGD
TFLAMSICDPHGGMHSLWTDVLAAQVFAPQVPRDYDPAYSAAFEAQLAQHAGELAAVVVEPVVQGAGGMRFHDPRYLHDL
RDICRRYEVLLIFDEIATGFGRTGALFAADHAGVSPDIMCVGKALTGGYLSLAATLCTADVAHTISAGAAGALMHGPTFM
ANPLACAVSVASVELLLGQDWRTRITELAAGLTAGLDTARALPAVTDVRVCGAIGVIECDRPVDLAVATPAALDRGVWLR
PFRNLVYAMPPYICTPAEITQITSAMVEVARLVGSLP
>P12996 2.8.1.6~~~bioB~~~Biotin synthase~~~COG0502
MAHRPRWTLSQVTELFEKPLLDLLFEAQQVHRQHFDPRQVQVSTLLSIKTGACPEDCKYCPQSSRYKTGLEAERLMEVEQ
VLESARKAKAAGSTRFCMGAAWKNPHERDMPYLEQMVQGVKAMGLEACMTLGTLSESQAQRLANAGLDYYNHNLDTSPEF
YGNIITTRTYQERLDTLEKVRDAGIKVCSGGIVGLGETVKDRAGLLLQLANLPTPPESVPINMLVKVKGTPLADNDDVDA
FDFIRTIAVARIMMPTSYVRLSAGREQMNEQTQAMCFMAGANSIFYGCKLLTTPNPEEDKDLQLFRKLGLNPQQTAVLAG
DNEQQQRLEQALMTPDTDEYYNAAAL
>P19206 2.8.1.6~~~bioB~~~Biotin synthase~~~
MNWLQLADEVIAGKVISDDEALAILNSDDDDILKLMDGAFAIRKHYYGKKVKLNMIMNAKSGYCPEDCGYCSQSSKSTAP
IEKYPFITKEEILAGAKRAFENKIGTYCIVASGRGPTRKDVNVVSEAVEEIKAKYGLKVCACLGLLKEEQAQQLKEAGVD
RYNHNLNTSERHHSYITTTHTYEDRVNTVEVVKKHGISPCSGAIIGMKETKMDVVEIARALHQLDADSIPVNFLHAIDGT
KLEGTQDLNPRYCLKVLALFRYMNPSKEIRISGGREVNLGFLQPFGLYAANSIFVGDYLTTEGQEANSDYRMLEDLGFEI
ELTQKQEEAFCS
>P9WPQ7 2.8.1.6~~~bioB~~~Biotin synthase~~~COG0502
MTQAATRPTNDAGQDGGNNSDILVVARQQVLQRGEGLNQDQVLAVLQLPDDRLEELLALAHEVRMRWCGPEVEVEGIISL
KTGGCPEDCHFCSQSGLFASPVRSAWLDIPSLVEAAKQTAKSGATEFCIVAAVRGPDERLMAQVAAGIEAIRNEVEINIA
CSLGMLTAEQVDQLAARGVHRYNHNLETARSFFANVVTTHTWEERWQTLSMVRDAGMEVCCGGILGMGETLQQRAEFAAE
LAELGPDEVPLNFLNPRPGTPFADLEVMPVGDALKAVAAFRLALPRTMLRFAGGREITLGDLGAKRGILGGINAVIVGNY
LTTLGRPAEADLELLDELQMPLKALNASL
>Q7A3R9 2.8.1.6~~~bioB~~~Biotin synthase~~~
MNLAKRILQGEQLTKETVLKIYEDTNIDTLDLLNEAYILRKHYFGKKVKLNMILNAKSGICPENCGYCGQSRDIKQKQRY
ALIPEEQIIDGAKVAHDNHIGTYCIVMSGRGPSDKEVDHISNTVRTIKSQHPQLKICACLGLTNDEQAKKLKSAGVDRYN
HNINTSENYHDNVVTTHSYKDRTDTIELMKANNISPCSGVICGMGESNQDIVDMAFALKEMDADSIPINFLHPIKGTKFG
SMDDLTPMKCLRIVALFRLINPTKEIRIAGGREVNLRSLQPLALKAANSIFVGDYLITGGQPNQLDYDMINDLGFEIDYD
TCENKENKNDVSRAN
>P12999 2.1.1.197~~~bioC~~~Malonyl-[acyl-carrier protein] O-methyltransferase~~~COG2226
MATVNKQAIAAAFGRAAAHYEQHADLQRQSADALLAMLPQRKYTHVLDAGCGPGWMSRHWRERHAQVTALDLSPPMLVQA
RQKDAADHYLAGDIESLPLATATFDLAWSNLAVQWCGNLSTALRELYRVVRPKGVVAFTTLVQGSLPELHQAWQAVDERP
HANRFLPPDEIEQSLNGVHYQHHIQPITLWFDDALSAMRSLKGIGATHLHEGRDPRILTRSQLQRLQLAWPQQQGRYPLT
YHLFLGVIARE
>P36571 2.1.1.197~~~bioC~~~Malonyl-[acyl-carrier protein] O-methyltransferase~~~
MTSANDTVNKQAVASAFSRAAGSYDAAAALQRDVGERLLGMGSSHPGEQLLDAGCGTGYFSRMWRERGKRVTALDLAPGM
LDVARQRQAAHHYLLGDIEQVPLPDAAMDICFSSLVVQWCSDLPAALAELYRVTRPGGVILFSTLAAGSLQELGDAWQQV
DGERHVNAFLPLTQIRTACAAYRHELVTELRTLNYPDVMTLMRSLKGIGATHLHQGREGGLMSRGRLAALQAAYPCRQGQ
FPLSYHLAYGVIYRE
>P13000 6.3.3.3~~~bioD1~~~ATP-dependent dethiobiotin synthetase BioD 1~~~COG0132
MSKRYFVTGTDTEVGKTVASCALLQAAKAAGYRTAGYKPVASGSEKTPEGLRNSDALALQRNSSLQLDYATVNPYTFAEP
TSPHIISAQEGRPIESLVMSAGLRALEQQADWVLVEGAGGWFTPLSDTFTFADWVTQEQLPVILVVGVKLGCINHAMLTA
QVIQHAGLTLAGWVANDVTPPGKRHAEYMTTLTRMIPAPLLGEIPWLAENPENAATGKYINLALL
>Q5NGB5 6.3.3.3~~~bioD~~~ATP-dependent dethiobiotin synthetase BioD~~~COG0132
MKKFFIIGTDTEVGKTYISTKLIEVCEHQNIKSLCLKPVASGQSQFSELCEDVESILNAYKHKFTAAEINLISFNQAVAP
HIIAAKTKVDISIENLKQFIEDKYNQDLDILFIEGAGGLLTPYSDHTTQLDLIKALQIPVLLVSAIKVGCINHTLLTINE
LNRHNIKLAGWIANCNDSNIKYIDEQINTIEELSGYKCSAKISRNADYLDFIDLSKILISPEENE
>O24872 6.3.3.3~~~bioD~~~ATP-dependent dethiobiotin synthetase BioD~~~COG0132
MLFISATNTNAGKTTCARLLAQYCNACGVKTILLKPIETGVNDAINHSSDAHLFLQDNRLLDRSLTLKDISFYRYHKVSA
PLIAQQEEDPNAPIDTDNLTQRLHNFTKTYDLVIVEGAGGLCVPITLEENMLDFALKLKAKMLLISHDNLGLINDCLLND
FLLKSHQLDYKIAINLKGNNTAFHSISLPYIELFNTRSNNPIVIFQQSLKVLMSFALK
>P9WPQ5 6.3.3.3~~~bioD~~~Dethiobiotin synthetase BioD~~~COG0132
MTILVVTGTGTGVGKTVVCAALASAARQAGIDVAVCKPVQTGTARGDDDLAEVGRLAGVTQLAGLARYPQPMAPAAAAEH
AGMALPARDQIVRLIADLDRPGRLTLVEGAGGLLVELAEPGVTLRDVAVDVAAAALVVVTADLGTLNHTKLTLEALAAQQ
VSCAGLVIGSWPDPPGLVAASNRSALARIAMVRAALPAGAASLDAGDFAAMSAAAFDRNWVAGLVG
>Q55849 6.3.3.3~~~bioD~~~ATP-dependent dethiobiotin synthetase BioD~~~COG0132
MSNFSRAVDQKTLLVAGCDTGVGKTVTTSALAAYWWKCGKDQSFGLMKLMQTGLGDDELYQQLFGHLTRWDVVTPLKFAT
PLAPPLAADQEGKTIDLGVVWQTLQTMQQNHDHVLVEALGSLGSPVTHELTVADIAALWRLETILVVPVQLGAMGQAIAQ
VALARQTKVKLKGLVLSCASPEAEGKVEDWATPAMLESFTHLPVLGIVPYLTESERENLSRLAEITARFGLEKLAYF
>P9WQ87 2.3.1.47~~~bioF1~~~8-amino-7-oxononanoate synthase 1~~~COG0156
MKAATQARIDDSPLAWLDAVQRQRHEAGLRRCLRPRPAVATELDLASNDYLGLSRHPAVIDGGVQALRIWGAGATGSRLV
TGDTKLHQQFEAELAEFVGAAAGLLFSSGYTANLGAVVGLSGPGSLLVSDARSHASLVDACRLSRARVVVTPHRDVDAVD
AALRSRDEQRAVVVTDSVFSADGSLAPVRELLEVCRRHGALLLVDEAHGLGVRGGGRGLLYELGLAGAPDVVMTTTLSKA
LGSQGGVVLGPTPVRAHLIDAARPFIFDTGLAPAAVGAARAALRVLQAEPWRPQAVLNHAGELARMCGVAAVPDSAMVSV
ILGEPESAVAAAAACLDAGVKVGCFRPPTVPAGTSRLRLTARASLNAGELELARRVLTDVLAVARR
>P53556 2.3.1.47~~~bioF~~~8-amino-7-oxononanoate synthase 2~~~COG0156
MKIDSWLNERLDRMKEAGVHRNLRSMDGAPVPERNIDGENQTVWSSNNYLGLASDRRLIDAAQTALQQFGTGSSGSRLTT
GNSVWHEKLEKKIASFKLTEAALLFSSGYLANVGVLSSLPEKEDVILSDQLNHASMIDGCRLSKADTVVYRHIDMNDLEN
KLNETQRYQRRFIVTDGVFSMDGTIAPLDQIISLAKRYHAFVVVDDAHATGVLGDSGQGTSEYFGVCPDIVIGTLSKAVG
AEGGFAAGSAVFIDFLLNHARTFIFQTAIPPASCAAAHEAFNIIEASREKRQLLFSYISMIRTSLKNMGYVVKGDHTPII
PVVIGDAHKTVLFAEKLQGKGIYAPAIRPPTVAPGESRIRITITSDHSMGDIDHLLQTFHSIGKELHII
>P9WQ85 2.3.1.47~~~bioF2~~~Putative 8-amino-7-oxononanoate synthase 2~~~COG0156
MPTGLGYDFLRPVEDSGINDLKHYYFMADLADGQPLGRANLYSVCFDLATTDRKLTPAWRTTIKRWFPGFMTFRFLECGL
LTMVSNPLALRSDTDLERVLPVLAGQMDQLAHDDGSDFLMIRDVDPEHYQRYLDILRPLGFRPALGFSRVDTTISWSSVE
EALGCLSHKRRLPLKTSLEFRERFGIEVEELDEYAEHAPVLARLWRNVKTEAKDYQREDLNPEFFAACSRHLHGRSRLWL
FRYQGTPIAFFLNVWGADENYILLEWGIDRDFEHYRKANLYRAALMLSLKDAISRDKRRMEMGITNYFTKLRIPGARVIP
TIYFLRHSTDPVHTATLARMMMHNIQRPTLPDDMSEEFCRWEERIRLDQDGLPEHDIFRKIDRQHKYTGLKLGGVYGFYP
RFTGPQRSTVKAAELGEIVLLGTNSYLGLATHPEVVEASAEATRRYGTGCSGSPLLNGTLDLHVSLEQELACFLGKPAAV
LCSTGYQSNLAAISALCESGDMIIQDALNHRSLFDAARLSGADFTLYRHNDMDHLARVLRRTEGRRRIIVVDAVFSMEGT
VADLATIAELADRHGCRVYVDESHALGVLGPDGRGASAALGVLARMDVVMGTFSKSFASVGGFIAGDRPVVDYIRHNGSG
HVFSASLPPAAAAATHAALRVSRREPDRRARVLAAAEYMATGLARQGYQAEYHGTAIVPVILGNPTVAHAGYLRLMRSGV
YVNPVAPPAVPEERSGFRTSYLADHRQSDLDRALHVFAGLAEDLTPQGAAL
>A9AE46 2.3.1.47~~~bioF~~~8-amino-7-oxononanoate synthase~~~COG0156
MNLLDTLQRGLADLDAQGLRRVRRTADSACDAHMTVNGREIVGFASNDYLGLAAHPKLVAAFAEGAQRYGSGSGGSHLLG
GHSRAHAKLEDELAGFAGGFSDAPRALYFSTGYMANLAAVTALAGKDATIFSDALNHASLIDGTRLSRATVQVYPHADTA
TLGALLEACTSQTKLIVTDTVFSMDGDIAPLAELLALAERHGAWLVVDDAHGFGVLGPQGRGALAAAALRSPHLVYVGTL
GKAAGVAGAFVVAHETVIEWLIQRARSYIFTTAAPPAVAHAVSASLKVIAGDEGDARRAHLAALIERTRALLRRTRWQPV
DSHTAVQPLVIGSNEATLAAMRALDAHGLWVPAIRPPTVPAGTSRLRISLSAAHSFDDLARLETALLRASEEAA
>P12998 2.3.1.47~~~bioF~~~8-amino-7-oxononanoate synthase~~~COG0156
MSWQEKINAALDARRAADALRRRYPVAQGAGRWLVADDRQYLNFSSNDYLGLSHHPQIIRAWQQGAEQFGIGSGGSGHVS
GYSVVHQALEEELAEWLGYSRALLFISGFAANQAVIAAMMAKEDRIAADRLSHASLLEAASLSPSQLRRFAHNDVTHLAR
LLASPCPGQQMVVTEGVFSMDGDSAPLAEIQQVTQQHNGWLMVDDAHGTGVIGEQGRGSCWLQKVKPELLVVTFGKGFGV
SGAAVLCSSTVADYLLQFARHLIYSTSMPPAQAQALRASLAVIRSDEGDARREKLAALITRFRAGVQDLPFTLADSCSAI
QPLIVGDNSRALQLAEKLRQQGCWVTAIRPPTVPAGTARLRLTLTAAHEMQDIDRLLEVLHGNG
>P22806 2.3.1.47~~~bioF~~~8-amino-7-oxononanoate synthase~~~
MNDRFRRELQVIEEQGLTRKLRLFSTGNESEVVMNGKKFLLFSSNNYLGLATDSRLKKKATEGISKYGTGAGGSRLTTGN
FDIHEQLESEIADFKKTEAAIVFSSGYLANVGVISSVMKAGDTIFSDAWNHASIIDGCRLSKAKTIVYEHADMVDLERKL
RQSHGDGLKFIVTDGVFSMDGDIAPLPKIVELAKEYKAYIMIDDAHATGVLGNDGCGTADYFGLKDEIDFTVGTLSKAIG
AEGGFVSTSSIAKNYLLNNARSFIFQTALSPSAIEAAREGISIIQNEPERRKQLLKNAQYLRLKLEESGFVMKEGETPII
SLIIGGSHEAMQFSAKLLDEGVFIPAIRPPTVPKGSSRLRITVMATHTIEQLDMVISKIKKIGKEMGIV
>A0QX65 2.3.1.47~~~bioF~~~8-amino-7-oxononanoate synthase~~~COG0156
MTRAGLSPLAWLADIEQRRRAEGLRRELRVRPPVAAELDLASNDYLGLSQHPDVLDGGVEALRTWGGGAGGSRLVTGNTE
LHEAFEHQLASFLGAESALVFSSGYTANLGALVALSGPGSLIVSDALSHASLVDACRLSRARVVVSPHRDVDAVDAALAA
RTEERAVVVTESVFSADGDLAPLRDLHAVCRRHGALLLVDEAHGLGVRGTRGQGLLHEVGLAGAPDIVMTTTLSKALGSQ
GGAVLGPEAVRAHLIDTARSFIFDTGLAPAAVGAASAALRVLDAEPQRARAVLDRAAELATIAGVTEAPVSAVVSVILGD
PEIAVGAAAACLDRGVRVGCFRPPTVPAGTSRLRLAARASLTDDEMALARQVLTDVLATARA
>B2JKH6 2.3.1.47~~~bioF~~~8-amino-7-oxononanoate synthase~~~COG0156
MQLLDTLEQGLKEIDARGLRRRRRTVDSPCSAHMTVDGRNMIGFASNDYLGLAAHPLLVAAITEGARRYGAGSGGSHLLG
GHSRAHAQLEDDLAEFAGGFVDNPRALYFSTGYMANLATLTALAGRGTTLFSDSLNHASLIDGARLSRADIQIYPHADAE
ALGAMLEASDAAVKLIVSDTVFSMDGDIAPLARLLELAEHHGAWLVVDDAHGFGVLGPQGRGAVAEAALRSPHLIVVGTL
GKAAGVSGAFVVAHETVIEWLVQRARPYIFTTASVPSAAHAVSASLRIIGGDEGEHRRAHLRSLIALTRDMLKSTPWLPV
DSHTAVQPLIIGSNEATLDVAASLDRANLWVPAIRPPTVPEGTSRLRISLSAAHSHNDLEQLEHALMKTAEARA
>Q146K3 2.3.1.47~~~bioF~~~8-amino-7-oxononanoate synthase~~~COG0156
MHLLDTLAEGLKEIDARGLRRRRRTADTPCAAHMTVDGRAIIGFASNDYLGLAAHPQLIAAIAEGAQRYGAGSGGSHLLG
GHSRAHAQLEDDLAEFVGGFVENARALYFSTGYMANLATLTALAGRGTTLFSDALNHASLIDGARLSRADVQIYPHCDTD
ALSAMLEASDADVKVIVSDTVFSMDGDIAPLPRLLELAEQHGAWLIVDDAHGFGVLGPQGRGAIAQAALRSPNLISIGTL
GKAAGVSGAFVAAHETVIEWLVQRARPYIFTTASVPAAAHAVSASLRIIGGEEGDARRAHLQQLIGRTRAMLKATPWLPV
DSHTAVQPLIIGANDATLEIAATLDRAGLWVPAIRPPTVPTGTSRLRISLSAAHSQADLDRLEAGLQQLGAKAA
>P13001 3.1.1.85~~~bioH~~~Pimeloyl-[acyl-carrier protein] methyl ester esterase~~~COG0596
MNNIWWQTKGQGNVHLVLLHGWGLNAEVWRCIDEELSSHFTLHLVDLPGFGRSRGFGALSLADMAEAVLQQAPDKAIWLG
WSLGGLVASQIALTHPERVQALVTVASSPCFSARDEWPGIKPDVLAGFQQQLSDDFQRTVERFLALQTMGTETARQDARA
LKKTVLALPMPEVDVLNGGLEILKTVDLRQPLQNVSMPFLRLYGYLDGLVPRKVVPMLDKLWPHSESYIFAKAAHAPFIS
HPAEFCHLLVALKQRV
>A6TF35 3.1.1.85~~~bioH~~~Pimeloyl-[acyl-carrier protein] methyl ester esterase~~~
MNDIWWQTIGEGDCHLVLLHGWGLNAQVWDCITPQLASHFTLHLVDLPGYGRSGGFGAMSLEAMAQRVLEQAPPQAVWLG
WSLGGLVASQVAIMRPERVQALVTVASSPCFAARDDWPGIKPEVLAGFQQQLSDDFQRTVERFLALQTMGTESARQDARA
LKQAVLSLPMPSAEALNGGLEILRTVDLRQALVRLPMPFLRLYGRLDGLVPRKIVPLLDDLWPESESILFDKAAHAPFVS
HPAAFCEPLLALKTRLG
>E6MWF8 3.1.1.85~~~bioH~~~Pimeloyl-[acyl-carrier protein] methyl ester esterase~~~
MRRQRERKSMPDAVKKVYLIHGWGANRHMFDDLMPRLPATWPVSAVDLPGHGDAPFVRPFDIAAAADGIAAQIDAPADIL
GWSLGGLVALYLAARHPDKVRSLCLTASFARLTADEDYPEGLAAPALGKMVGAFRSDYAKHIKQFLQLQLLHTPDADGII
GRILPDLARCGTPQALQEALDAAERADARHLLDKIDVPVLLVFGGKDAITPPRMGEYLHRRLKGSRLVVMEKAAHAPFLS
HAEAFAALYRDFVEGGLR
>Q8ZLI9 3.1.1.85~~~bioH~~~Pimeloyl-[acyl-carrier protein] methyl ester esterase~~~
MNDIWWQTYGEGNCHLVLLHGWGLNAEVWHCIREELGSHFTLHLVDLPGYGRSSGFGAMTLEEMTAQVAKNAPDQAIWLG
WSLGGLVASQMALTHPERVQALVTVASSPCFSAREGWPGIKPEILGGFQQQLSDDFQRTVERFLALQTLGTETARQDART
LKSVVLAQPMPDVEVLNGGLEILKTVDLREALKNVNMPFLRLYGYLDGLVPRKIVPLLDTLWPHSTSQIMAKAAHAPFIS
HPAAFCQALMTLKSSL
>Q8GHL1 3.1.1.85~~~bioH~~~Pimeloyl-[acyl-carrier protein] methyl ester esterase~~~
MTALYWQTIGEGERDLVLLHGWGLNAEVWSCIQALTPHFRLHLVDLPGYGRSQGFGALSLAQMTEIVLAAAPPQAWWLGW
SLGGLVASQAALMQPQRVSGLITVASSPCFAARDEWPGIRPDVLSGFQHQLSLDFQRTVERFLALQTLGTESARQDARQL
KAVVLNQPTPSVEVLNGGLEILRTADLRAPLAELNLPLLRIYGYLDGLVPRKVAELLDAAWPNSTSQIVAKAAHAPFISH
PDEFVTMIEAFIAAH
>Q83PW0 3.1.1.85~~~bioH~~~Pimeloyl-[acyl-carrier protein] methyl ester esterase~~~
MNNIWWQTKGQGNVHLVLLHGWGLNAEVWRCIDEELSSHFTLHLVDLPGFGRSRGFGALSLADMAEAVLQQAPDKAIWLG
WSLGGLVASQIALTHPERVQALVTVASSPCFSARDEWPGIKPDVLAGFQQQLSDDFQRTVERFLALQTMGTETARQDARA
LKKTVLALPMPEVDVLNGGLEILKTVDLRQPLQNVSMPFLRLYGYLDGLVPRKVVPMLDKLWPHSESYIFAKAAHAPFIS
HPVEFCHLLVALKQRVLVVSES
>P53554 1.14.14.46~~~bioI~~~Biotin biosynthesis cytochrome P450~~~COG2124
MTIASSTASSEFLKNPYSFYDTLRAVHPIYKGSFLKYPGWYVTGYEETAAILKDARFKVRTPLPESSTKYQDLSHVQNQM
MLFQNQPDHRRLRTLASGAFTPRTTESYQPYIIETVHHLLDQVQGKKKMEVISDFAFPLASFVIANIIGVPEEDREQLKE
WAASLIQTIDFTRSRKALTEGNIMAVQAMAYFKELIQKRKRHPQQDMISMLLKGREKDKLTEEEAASTCILLAIAGHETT
VNLISNSVLCLLQHPEQLLKLRENPDLIGTAVEECLRYESPTQMTARVASEDIDICGVTIRQGEQVYLLLGAANRDPSIF
TNPDVFDITRSPNPHLSFGHGHHVCLGSSLARLEAQIAINTLLQRMPSLNLADFEWRYRPLFGFRALEELPVTFE
>P53555 2.6.1.105~~~bioK~~~L-Lysine--8-amino-7-oxononanoate transaminase~~~COG0161
MTHDLIEKSKKHLWLPFTQMKDYDENPLIIESGTGIKVKDINGKEYYDGFSSVWLNVHGHRKKELDDAIKKQLGKIAHST
LLGMTNVPATQLAETLIDISPKKLTRVFYSDSGAEAMEIALKMAFQYWKNIGKPEKQKFIAMKNGYHGDTIGAVSVGSIE
LFHHVYGPLMFESYKAPIPYVYRSESGDPDECRDQCLRELAQLLEEHHEEIAALSIESMVQGASGMIVMPEGYLAGVREL
CTTYDVLMIVDEVATGFGRTGKMFACEHENVQPDLMAAGKGITGGYLPIAVTFATEDIYKAFYDDYENLKTFFHGHSYTG
NQLGCAVALENLALFESENIVEQVAEKSKKLHFLLQDLHALPHVGDIRQLGFMCGAELVRSKETKEPYPADRRIGYKVSL
KMRELGMLTRPLGDVIAFLPPLASTAEELSEMVAIMKQAIHEVTSLED
>Q2KBP5 7.6.2.-~~~bioM~~~Biotin transport ATP-binding protein BioM~~~COG1122
MNIQFESAGVSFGARVALEPLTLAITGKRIGVIGLNGSGKTTFARLINGLTKPTTGRVIVNGRDTADEKTVVTDVGFIFQ
SPQNQIILPIVKDDIAFGLKRRGLSKAEIEARVEGVLARFGAEALADRRAHELSGGELQVAALCSVLATGPGILILDEPT
NQLDLKNRALVERIIAGLPESAIVITHDLELIAGFERVLVFHEGRLAADEPAAEAIARYREIAA
>D5ARH0 7.6.2.-~~~bioM~~~Biotin transport ATP-binding protein BioM~~~COG1122
MQAIDIGHVTLERDGTGVFSDLTLRLTERRIGIVGRNGAGKSSLIRLITGLVTPQKGRVVVNGVDVAADRAGALGTVGLL
FQNPDHQIIFPVVRDEIAFGLEQKGLKRAAALARAEAVLAAQGRADWGDRLCHTLSQGQRQLLCLMSILAMEPDWILFDE
PFNALDLPTALSIEARIAGLAQNVVLVTHDPSRLTGFDRILWLEGGRIEADGPPAEVLPRYIAAMQALARAGAC
>Q2KBP6 ~~~bioN~~~Energy-coupling factor transporter transmembrane protein BioN~~~COG0619
MQSLYVEGNSRMHRLSPRAKLLSLTAFAILLFISHNLLLLSGAVLVAAVLYGTVGLPIGEALLRLRPIFLTIAVVALFNL
IFNPWQAALVPVLRLTALMLLAASVTATTTITEFIDEVTALARPLERTGRVQADDIGLALGLVLRFVPEIVNRYQAIREA
HKARGLKVRPTSLLAPLIILTLKDADNVAAAIDARRIRRHGS
>D5ARG9 ~~~bioN~~~Energy-coupling factor transporter transmembrane protein BioN~~~COG0619
MLSLALPCRSWAHRLPAALKFGLLAVAMIALMRIGSLAGQGAAVLVVAALTASLGRKAIRQSLVTLRPLVWIVAVILIWD
SLQGAVAQGVLFGLRVLAMVGLANAVTLTTPLPEIVALIERLAQPLARFGISPRIPAISVALVIRFVPVLRARHDTLAEA
WRARSARKPRGKLLAPLTFSLLDDADHMADALRARGGLALPRKGRDTVGT
>P0ADP5 ~~~bioP~~~Biotin transporter~~~COG0697
MALLIITTILWAFSFSFYGEYLAGHVDSYFAVLVRVGLAALVFLPFLRTRGNSLKTVGLYMLVGAMQLGVMYMLSFRAYL
YLTVSELLLFTVLTPLYITLIYDIMSKRRLRWGYAFSALLAVIGAGIIRYDQVTDHFWTGLLLVQLSNITFAIGMVGYKR
LMETRPMPQHNAFAWFYLGAFLVAVIAWFLLGNAQKMPQTTLQWGILVFLGVVASGIGYFMWNYGATQVDAGTLGIMNNM
HVPAGLLVNLAIWHQQPHWPTFITGALVILASLWVHRKWVAPRSSQTADDRRRDCALSE
>O08250 ~~~bioS~~~Biotin transport regulator~~~
MQIENRLNAAAASGDGLGNLAGRSADPTGAADKGESGVPVPPTGFVDPTPRISLSADALLYLGRAKRTPEKLPPLTKDEW
NNRLSPQLAAREHQAFGRLAETGDYRAYYRAFIDYYDGLRPEDQNSLRYFGTREAAVAGLRSLDYDADSGLDMDAEFENL
VSVFLEEDKIAPSPATTTMSPAERAFFAWDASNISYEVDAPEPRPMTEIERLYSELL
>Q55650 2.6.1.121~~~bioU~~~(S)-8-amino-7-oxononanoate synthase BioU~~~COG1748
MENNSLAPLRVGILGFGGLGQAAARLLAPKQEMKLVAVADRHGYLYDADGIDVDNAVQAYTQQGSVGKAKKGQMSEQSIE
DLIGEGEVDGYFLALPNLPNTFMADVTRQFIASGWQGVLVDALKRTSAVEQLITLREDLAQAGITYMTGCGATPGLLTAA
AAIASQSFQEIHQVKITFGVGIANWEAYRATIREDIAHMPGYNVDKAQAMTDAEVAALLDQTNGILALEDMEHADDIMLE
LAGICHRDQVTVGGVVDTRNPKKPLSTHVKITGRTFEGKISSHTFTLGDETSMAANVCGPAFGYLKAGYGLHRQGLKGLF
TAADVMPKFVR
>O67575 6.2.1.14~~~bioW~~~6-carboxyhexanoate--CoA ligase~~~COG1424
MDLFSVRMRAQKNGKHVSGAERIVKKEELETAVKELLNRPKEFDFMNVKVEKVKDFEVVKFNLKISTYSFKSPEEAREFA
VKKLTQEGIKEEVAKKAVEILSKGANPKGGNMRGAVLMDIETGERLEEDKERGVRTIHFDWKDRKKVTEKLLKEGYTLRT
VDALALTFKNLFCGVVAELCWSDDPDYVTGYVSGKEIGYVRITPLKEKGDPLGGRVYFVSRKELSEIIECLTQKVVLIEL
>P53559 6.2.1.14~~~bioW~~~6-carboxyhexanoate--CoA ligase~~~COG1424
MMQEETFYSVRMRASMNGSHEDGGKHISGGERLIPFHEMKHTVNALLEKGLSHSRGKPDFMQIQFEEVHESIKTIQPLPV
HTNEVSCPEEGQKLARLLLEKEGVSRDVIEKAYEQIPEWSDVRGAVLFDIHTGKRMDQTKEKGVRVSRMDWPDANFEKWA
LHSHVPAHSRIKEALALASKVSRHPAVVAELCWSDDPDYITGYVAGKKMGYQRITAMKEYGTEEGCRVFFIDGSNDVNTY
IHDLEKQPILIEWEEDHDS
>P22822 6.2.1.14~~~bioW~~~6-carboxyhexanoate--CoA ligase~~~
MLETCYSIRMRAAEKNLEGGEKHISGGERIGSEFQIEPIVKQLLNKARNHSRGDADFIQITVEKLTGDQILYMPPLEITT
IDESSIERAHKEARSILTSVGVSKQAQNVAFHLLASNQNLRGAILLHSQTGLRLDNRGLKGVRVSRIDWQDADVGYNERV
REALALATKVANSPYTIAELCWSDDPEYVTGYVSNHEIGYVRITPLKREGCESGGRIFFVSDEVELESYIHYLEREPILI
RGHLK
>A2RI45 ~~~bioY2~~~Biotin transporter BioY2~~~COG1268
MQNTKLYSLTLIALGAAIIAVLSPLAIPIGIVPVTLQTLAVGLVATVLKARETFFAILLYLLLGFIGIPVFTGGTSGIAV
LFGPTGGFLLAFLVMGTLISWGLHQIKYKTIPAFIINIVGHLLMLVIGTLWLKFFTQVDWSLALKLGFTPFVFVEIIKAI
LVTIFGLALIRALSHTNKYFTN
>O07620 ~~~bioY~~~Probable biotin transporter BioY~~~COG1268
MLKLIDMMHIAIFTALMAVLGFMPPLFLSFTPVPITLQTLGVMLAGSILRPKSAFLSQLVFLLLVAFGAPLLPGGRGGFG
VFFGPSAGFLIAYPLASWLISLAANRLRKVTVLRLFFTHIVFGIIFIYLLGIPVQAFIMHIDLSQAAFMSLAYVPGDLIK
AAVSAFLAIKITQALSLSDTMFTKGG
>A2RMJ9 ~~~bioY~~~Biotin transporter BioY~~~COG1268
MTNNQKVKTLTYSAFMTAFIIILGFLPGIPIGFIPVPIILQNMGIMMAGGLLGPKYGTISVGAFLALALIGLPVLTGGNG
GAASFLGPSGGYRIAWLFTPFLIGFFLKKLKITTSQNWFGELIIVLLFGVIFVDFVGAIWLSFQSNIPLLTSLISNLVFI
PGDCIKAILTVVIVRRLRKQGGFELYFRK
>Q2KBP7 ~~~bioY~~~Biotin transporter BioY~~~COG1268
MSTRDLVLTALFAAIIVALGLLPPISLGFIPVPITAQSLGVMMAGVVLGARRGAIAVLIVLLLVAIGLPVLSGGRGGLAI
FASPTAGFLVGWIFGAFVTGYLSERLVNHGQSGLVQTVSFFLAAMIGGIVVLYAFGITYLATVAGLGFTKAFVGSMAFIP
GDVIKAVVAALLGRAVMVGYPLLPARA
>D5ARG8 ~~~bioY~~~Biotin transporter BioY~~~COG1268
MERNVTLIGLFAALIVALGFVPAIPLGFGVPITLQSLGVMLAGAVLGSWRGALAVLLVQALVAIGLPVLAGGRGGLGIFV
GPTAGFLIGWLPAAFVTGLIVERLRRVPVALAAGIGATLGGIIIMYALGILGFWLVKNAGLKPEDAPISLWAATAIMAPF
IPGDLVKVVVTGLVARTIAQYRPSALLARG
>Q9WZQ6 ~~~bioY~~~Biotin transporter BioY~~~COG1268
MRQLIKAGIFTALIVVGAWISIPLGPVPFTLQVFFVFLSAYVLGKKYGTLAVATYVLLGAMGLPVFANFKGGAQVLVGPT
GGYLFGFILGAFVIGLLAEKKESFAWYLASGVAGLGIIYALGVFVLNFYVHDIRKAISVGFVPFVWFDLIKLVVAALIAL
RLKKLEVER
>Q7CTU0 2.3.1.-~~~bioZ~~~3-oxopimeloyl-[acyl-carrier-protein] synthase~~~COG0332
MQTRSSRMAGFGHAVPARCVDNAEIEASLGLEAGWIERRTGIRSRYWAEAGDTLSGLAERAGRMALEDAKINADDIALTL
LATSTPDHLLPPSAPLLAHRLGLTRSGAIDLAGACSGFLYALTLADGFVRTYGRAVLVVAANILSRRINPAERASAVLFA
DAAGAVVLTPCPEVKRGVLSADLVADGSGYDLIQIAAGGSSQPFSAGMIAEDALMTMRDGREVFSRAVALMTNTSQRVLH
EAELTAADISRFVPHQANARMSDAVCGNLGIEREKTVRTIGSFGNSSAATIPLSLSITNAERPLAGGETLLLTAAGAGMT
GGAVVYRV
>Q2YKB4 2.3.1.-~~~bioZ~~~3-oxopimeloyl-[acyl-carrier-protein] synthase~~~
MTVCSSRLAGFGHAVPDRRVENAEIEAQLGLETGWIERRTGIRCRRWAMPDETLSHLAASAADMALSDAGIERSDIALTL
LATSTPDHLLPPTAPLLTHWLNLQNSGAADLAGACTGFLYALVLADGFVRAQGKPVLVVAANLLSRRINMAERASAVLFG
DAAGAVVLAPSAKANSFQSQFITNGSHYDLIKVPAGGSARAYAPERDASEFLMTMQDGRAVFTEAVRIMSGASQNVLASA
AMLPQAIDRFFPHQANIRIVDKVCETIGVPRAKAASTLETYGNSSAATIPLSLSLANLEQPLREGERLLFAAAGAGMTGG
AVLMQV
>D0MCQ4 2.3.1.-~~~bioZ~~~3-oxopimeloyl-[acyl-carrier-protein] synthase~~~COG0332
MLPEQSLTTPLPATATAAPARRAAVLGVGAALPAHREPSTETERRLGLPPGWIARRTGIRERPLVGPDEATSDLAVRAGA
AALAQAELSPERIGLLLLATSTPDHLLPPTAPVVAHRLGLKHAGAVDLAGACSGFLYALALADGYVRLQRTCVLVIGANV
LSRRTNPDDPKTSALFADGAGAVVLGPSEGSRGIVACWLGADGSCWDDLYIPAGGSRRPLTPERVARGEHLMYMKDGRAL
FRRAATGMAEAGRRVLQQAGLDLDDVAWWIPHQANHRLIEEARRQLGMPEARTVNLVDRIGNSSAATIPLALALEAHRFA
PGDLLLLTAVGAGLLSAAVLIQW
>O07631 3.6.5.-~~~bipA~~~Large ribosomal subunit assembly factor BipA~~~COG1217
MKLRNDLRNIAIIAHVDHGKTTLVDQLLHQAGTFRANEQVAERAMDSNDLERERGITILAKNTAINYKDTRINILDTPGH
ADFGGEVERIMKMVDGVVLVVDAYEGCMPQTRFVLKKALEQNLNPVVVVNKIDRDFARPEEVIDEVLDLFIELDANEEQL
EFPVVYASAINGTASLDPKQQDENMEALYETIIKHVPAPVDNAEEPLQFQVALLDYNDYVGRIGIGRVFRGTMKVGQQVS
LMKLDGTAKSFRVTKIFGFQGLKRVEIEEAKAGDLVAVSGMEDINVGETVCPVDHQDPLPVLRIDEPTLQMTFVVNNSPF
AGREGKYVTARKIEERLQSQLQTDVSLRVEPTASPDAWVVSGRGELHLSILIENMRREGYELQVSKPEVIIKEIDGVRCE
PVERVQIDVPEEHTGSVMESMGARKGEMVDMINNGNGQVRLIFTVPSRGLIGYSTEFLSLTRGFGILNHTFDSYQPMQAG
QVGGRRQGVLVSMENGKATSYGIQGIEDRGVIFVEPGTEVYEGMIVGEHNRDNDLVVNVSKMKQQTNVRSATKDQTTTIK
KARIMSLEESLEYLNEDEYCEVTPESIRLRKKILNKNEREKAAKKKKTAGLS
>P0A3B2 3.6.5.-~~~bipA~~~Large ribosomal subunit assembly factor BipA~~~
MIEKLRNIAIIAHVDHGKTTLVDKLLQQSGTFDSRAETQERVMDSNDLEKERGITILAKNTAIKWNDYRINIVDTPGHAD
FGGEVERVMSMVDSVLLVVDAFDGPMPQTRFVTKKAFAYGLKPIVVINKVDRPGARPDWVVDQVFDLFVNLDATDEQLDF
PIVYASALNGIAGLDHEDMAEDMTPLYQAIVDHVPAPDVDLDGPFQMQISQLDYNSYVGVIGIGRIKRGKVKPNQQVTII
DSEGKTRNAKVGKVLGHLGLERIETDLAEAGDIVAITGLGELNISDTVCDTQNVEALPALSVDEPTVSMFFCVNTSPFCG
KEGKFVTSRQILDRLNKELVHNVALRVEETEDADAFRVSGRGELHLSVLIENMRREGFELAVSRPKVIFREIDGRKQEPY
ENVTLDVEEQHQGSVMQALGERKGDLKNMNPDGKGRVRLDYVIPSRGLIGFRSEFMTMTSGTGLLYSTFSHYDDVRPGEV
GQRQNGVLISNGQGKAVAFALFGLQDRGKLFLGHGAEVYEGQIIGIHSRSNDLTVNCLTGKKLTNMRASGTDEAVVLVPP
IRMTLEQALEFIDDDELVEVTPTSIRIRKRHLTENDRRRANRAPKDD
>P0DTT0 3.6.5.-~~~bipA~~~Large ribosomal subunit assembly factor BipA~~~
MIEKLRNIAIIAHVDHGKTTLVDKLLQQSGTFDSRAETQERVMDSNDLEKERGITILAKNTAIKWNDYRINIVDTPGHAD
FGGEVERVMSMVDSVLLVVDAFDGPMPQTRFVTKKAFAYGLKPIVVINKVDRPGARPDWVVDQVFDLFVNLDATDEQLDF
PIVYASALNGIAGLDHEDMAEDMTPLYQAIVDHVPAPDVDLDGPFQMQISQLDYNSYVGVIGIGRIKRGKVKPNQQVTII
DSEGKTRNAKVGKVLGHLGLERIETDLAEAGDIVAITGLGELNISDTVCDTQNVEALPALSVDEPTVSMFFCVNTSPFCG
KEGKFVTSRQILDRLNKELVHNVALRVEETEDADAFRVSGRGELHLSVLIENMRREGFELAVSRPKVIFREIDGRKQEPY
ENVTLDVEEQHQGSVMQALGERKGDLKNMNPDGKGRVRLDYVIPSRGLIGFRSEFMTMTSGTGLLYSTFSHYDDVRPGEV
GQRQNGVLISNGQGKAVAFALFGLQDRGKLFLGHGAEVYEGQIIGIHSRSNDLTVNCLTGKKLTNMRASGTDEAVVLVPP
IRMTLEQALEFIDDDELVEVTPTSIRIRKRHLTENDRRRANRAPKDD
>P44910 3.6.5.-~~~bipA~~~Large ribosomal subunit assembly factor BipA~~~COG1217
MKNEIDIKKLRNIAIIAHVDHGKTTLVDKLLQQSGTFESARGDVDERVMDSNDLEKERGITILAKNTAINWNDYRINIVD
TPGHADFGGEVERVLSMVDSVLLVVDAFDGPMPQTRFVTQKAFAHGLKPIVVINKVDRPGARPDWVVDQVFDLFVNLGAS
DEQLDFPIIYASALNGVAGLEHEDLAEDMTPLFEAIVKHVEPPKVELDAPFQMQISQLDYNNYVGVIGIGRIKRGSIKPN
QPVTIINSEGKTRQGRIGQVLGHLGLQRYEEDVAYAGDIVAITGLGELNISDTICDINTVEALPSLTVDEPTVTMFFCVN
TSPFAGQEGKYVTSRQILERLNKELVHNVALRVEETPNPDEFRVSGRGELHLSVLIENMRREGYELAVSRPKVIYRDIDG
KKQEPYEQVTIDVEEQHQGSVMEALGIRKGEVRDMLPDGKGRVRLEYIIPSRGLIGFRGDFMTMTSGTGLLYSSFSHYDE
IKGGEIGQRKNGVLISNATGKALGYALFGLQERGKLMIDANIEVYEGQIIGIHSRSNDLTVNCLQGKKLTNMRASGKDDA
IVLTTPVKFSLEQAIEFIDDDELVEVTPESIRIRKKLLTENDRKRANRTTTSTSTH
>H9L427 3.6.5.-~~~bipA~~~Large ribosomal subunit assembly factor BipA~~~
MIENLRNIAIIAHVDHGKTTLVDKLLQQSGTFDARAETQERVMDSNDLEKERGITILAKNTAIKWNDYRINIVDTPGHAD
FGGEVERVMSMVDSVLLVVDAFDGPMPQTRFVTKKAFAHGLKPIVVINKVDRPGARPDWVVDQVFDLFVNLDATDEQLDF
PIIYASALNGIAGLDHEDMAEDMTPLYQAIVDHVPAPDVDLDGPLQMQISQLDYNNYVGVIGIGRIKRGKVKPNQQVTII
DSEGKTRNAKVGKVLTHLGLERIDSNIAEAGDIIAITGLGELNISDTICDPQNVEALPALSVDEPTVSMFFCVNTSPFCG
KEGKFVTSRQILDRLNKELVHNVALRVEETEDADAFRVSGRGELHLSVLIENMRREGFELAVSRPKVIFREIDGRKQEPY
ENVTLDVEEQHQGSVMQALGERKGDLKNMNPDGKGRVRLDYVIPSRGLIGFRSEFMTMTSGTGLLYSTFSHYDDIRPGEV
GQRQNGVLISNGQGKAVAFALFGLQDRGKLFLGHGAEVYEGQIIGIHSRSNDLTVNCLTGKKLTNMRASGTDEAVILVPP
IKMSLEQALEFIDDDELVEVTPTSIRIRKRHLTENDRRRANRGQKEE
>Q63K34 ~~~bipB~~~Translocator protein BipB~~~
MSSGVQGGPAANANAYQTHPLRDAASALGTLSPQAYVDVVSAAQRNFLERMSQLASEQCDAQPAAHDARLDDRPALRAPQ
ERDAPPLGASDTGSRASGAAKLTELLGVLMSVISASSLDELKQRSDIWNQMSKAAQDNLSRLSDAFQRATDEAKAAADAA
EQAAAAAKQAGADAKAADAAVDAAQKRYDDAVKQGLPDDRLQSLKAALEQARQQAGDAHGRADALQADATKKLDAASALA
TQARACEQQVDDAVNQATQQYGASASLRTPQSPRLSGAAELTAVLGKLQELISSGNVKELESKQKLFTEMQAKREAELQK
KSDEYQAQVKKAEEMQKTMGCIGKIVGWVITAVSFAAAAFTGGASLALAAVGLALAVGDEISRATTGVSFMDKLMQPVMD
AILKPLMEMISSLITKALVACGVDQQKAELAGAILGAVVTGVALVAAAFVGASAVKAVASKVIDAMAGQLTKLMDSAIGK
MLVQLIEKFSEKSGLQALGSRTATAMTRMRRAIGVEAKEDGMLLANRFEKAGTVMNVGNQVSQAAGGIVVGVERAKAMGL
LADVKEAMYDIKLLGDLLKQAVDAFAEHNRVLAQLMQQMSDAGEMQTSTGKLILRNARAV
>Q63K37 ~~~bipD~~~Translocator protein BipD~~~
MNMHVDMGRALTVRDWPALEALAKTMPADAGARAMTDDDLRAAGVDRRVPEQKLGAAIDEFASLRLPDRIDGRFVDGRRA
NLTVFDDARVAVRGHARAQRNLLERLETELLGGTLDTAGDEGGIQPDPILQGLVDVIGQGKSDIDAYATIVEGLTKYFQS
VADVMSKLQDYISAKDDKNMKIDGGKIKALIQQVIDHLPTMQLPKGADIARWRKELGDAVSISDSGVVTINPDKLIKMRD
SLPPDGTVWDTARYQAWNTAFSGQKDNIQNDVQTLVEKYSHQNSNFDNLVKVLSGAISTLTDTAKSYLQI
>P0CI75 6.3.4.15~~~birA~~~Bifunctional ligase/repressor BirA~~~COG0340
MRSTLRKDLIELFSQAGNEFISGQKISDALGCSRTAVWKHIEELRKEGYEVEAVRRKGYRLIKKPGKLSESEIRFGLKTE
VMGQHLIYHDVLSSTQKTAHELANNNAPEGTLVVADKQTAGRGRMSRVWHSQEGNGVWMSLILRPDIPLQKTPQLTLLAA
VAVVQGIEEAAGIQTDIKWPNDILINGKKTVGILTEMQAEEDRVRSVIIGIGINVNQQPNDFPDELKDIATSLSQAAGEK
IDRAGVIQHILLCFEKRYRDYMTHGFTPIKLLWESYALGIGTNMRARTLNGTFYGKALGIDDEGVLLLETNEGIKKIYSA
DIELG
>P06709 6.3.4.15~~~birA~~~Bifunctional ligase/repressor BirA~~~COG0340
MKDNTVPLKLIALLANGEFHSGEQLGETLGMSRAAINKHIQTLRDWGVDVFTVPGKGYSLPEPIQLLNAKQILGQLDGGS
VAVLPVIDSTNQYLLDRIGELKSGDACIAEYQQAGRGRRGRKWFSPFGANLYLSMFWRLEQGPAAAIGLSLVIGIVMAEV
LRKLGADKVRVKWPNDLYLQDRKLAGILVELTGKTGDAAQIVIGAGINMAMRRVEESVVNQGWITLQEAGINLDRNTLAA
MLIRELRAALELFEQEGLAPYLSRWEKLDNFINRPVKLIIGDKEIFGISRGIDKQGALLLEQDGIIKPWMGGEISLRSAE
K
>I6YFP0 6.3.4.15~~~birA~~~Biotin--[acetyl-CoA-carboxylase] ligase~~~COG0340
MTDRDRLRPPLDERSLRDQLIGAGSGWRQLDVVAQTGSTNADLLARAASGADIDGVVLIAEHQTAGRGRHGRGWAATARA
QIILSVGVRVVDVPVQAWGWLSLAAGLAVLDSVAPLIAVPPAETGLKWPNDVLARGGKLAGILAEVAQPFVVLGVGLNVT
QAPEEVDPDATSLLDLGVAAPDRNRIASRLLRELEARIIQWRNANPQLAADYRARSLTIGSRVRVELPGGQDVVGIARDI
DDQGRLCLDVGGRTVVVSAGDVVHLR
>P20099 1.-.-.-~~~bisC~~~Biotin sulfoxide reductase~~~COG0243
MANSSSRYSVLTAAHWGPMLVETDGETVFSSRGALATGMENSLQSAVRDQVHSNTRVRFPMVRKGFLASPENPQGIRGQD
EFVRVSWDEALDLIHQQHKRIREAYGPASIFAGSYGWRSNGVLHKASTLLQRYMALAGGYTGHLGDYSTGAAQAIMPYVV
GGSEVYQQQTSWPLVLEHSDVVVLWSANPLNTLKIAWNASDEQGLSYFSALRDSGKKLICIDPMRSETVDFFGDKMEWVA
PHMGTDVALMLGIAHTLVENGWHDEAFLARCTTGYAVFASYLLGESDGIAKTAEWAAEICGVGAAKIRELAAIFHQNTTM
LMAGWGMQRQQFGEQKHWMIVTLAAMLGQIGTPGGGFGLSYHFANGGNPTRRSAVLSSMQGSLPGGCDAVDKIPVARIVE
ALENPGGAYQHNGMNRHFPDIRFIWWAGGANFTHHQDTNRLIRAWQKPELVVISECFWTAAAKHADIVLPATTSFERNDL
TMTGDYSNQHLVPMKQVVPPRYEARNDFDVFAELSERWEKGGYARFTEGKSELQWLETFYNVARQRGASQQVELPPFAEF
WQANQLIEMPENPDSERFIRFADFCRDPLAHPLKTASGKIEIFSQRIADYGYPDCPGHPMWLEPDEWQGNAEPEQLQVLS
AHPAHRLHSQLNYSSLRELYAVANREPVTIHPDDAQERGIQDGDTVRLWNARGQILAGAVISEGIKPGVICIHEGAWPDL
DLTADGICKNGAVNVLTKDLPSSRLGNGCAGNTALAWLEKYNGPELTLTAFEPPASS
>Q1M7F4 3.5.1.84~~~biuH~~~Biuret amidohydrolase~~~
MDAMVETNRHFIDADPYPWPYNGALRPDNTALIIIDMQTDFCGKGGYVDHMGYDLSLVQAPIEPIKRVLAAMRAKGYHII
HTREGHRPDLADLPANKRWRSQRIGAGIGDPGPCGRILTRGEPGWDIIPELYPIEGETIIDKPGKGSFCATDLELVLNQK
RIENIILTGITTDVCVSTTMREANDRGYECLLLEDCCGATDYGNHLAAIKMVKMQGGVFGSVSNSAALVEALP
>Q89VI2 2.3.1.228~~~bjaI~~~Isovaleryl-homoserine lactone synthase~~~COG3916
MIHAISAVNRHLYEDVLEQHFRLRHDIFVEERHWETLRRPDGREVDSYDDEDTVYLLALEGRRVVGGHRLYPTTKPSMMS
EVFPHLAAVRGCPSDPLIWEWSRYFVVRDRRDGALNLQLMAAVQEFCLDQGIAQVSAIMETWWLPRFHEAGFVVTPLGLP
ALVENAWTMAATVDIRRQTLDVLHDRIGMPSIVQQDGPRLDAVARANLCGLAAAQRKSA
>P9WIS3 1.2.4.4~~~bkdA~~~3-methyl-2-oxobutanoate dehydrogenase subunit alpha~~~COG1071
MGEGSRRPSGMLMSVDLEPVQLVGPDGTPTAERRYHRDLPEETLRWLYEMMVVTRELDTEFVNLQRQGELALYTPCRGQE
AAQVGAAACLRKTDWLFPQYRELGVYLVRGIPPGHVGVAWRGTWHGGLQFTTKCCAPMSVPIGTQTLHAVGAAMAAQRLD
EDSVTVAFLGDGATSEGDVHEALNFAAVFTTPCVFYVQNNQWAISMPVSRQTAAPSIAHKAIGYGMPGIRVDGNDVLACY
AVMAEAAARARAGDGPTLIEAVTYRLGPHTTADDPTRYRSQEEVDRWATLDPIPRYRTYLQDQGLWSQRLEEQVTARAKH
VRSELRDAVFDAPDFDVDEVFTTVYAEITPGLQAQREQLRAELARTD
>P9WIS1 1.2.4.4~~~bkdB~~~3-methyl-2-oxobutanoate dehydrogenase subunit beta~~~COG0022
MTQIADRPARPDETLAVAVSDITQSLTMVQAINRALYDAMAADERVLVFGEDVAVEGGVFRVTEGLADTFGADRCFDTPL
AESAIIGIAVGLALRGFVPVPEIQFDGFSYPAFDQVVSHLAKYRTRTRGEVDMPVTVRIPSFGGIGAAEHHSDSTESYWV
HTAGLKVVVPSTPGDAYWLLRHAIACPDPVMYLEPKRRYHGRGMVDTSRPEPPIGHAMVRRSGTDVTVVTYGNLVSTALS
SADTAEQQHDWSLEVIDLRSLAPLDFDTIAASIQRTGRCVVMHEGPRSLGYGAGLAARIQEEMFYQLEAPVLRACGFDTP
YPPARLEKLWLPGPDRLLDCVERVLRQP
>O06159 2.3.1.168~~~bkdC~~~Dihydrolipoyllysine-residue acyltransferase component of branched-chain alpha-ketoacid dehydrogenase complex~~~COG0508
MSGEDSIRSFPVPDLGEGLQEVTVTCWSVAVGDDVEINQTLCSVETAKAEVEIPSPYAGRIVELGGAEGDVLKVGAELVR
IDTGPTAVAQPNGEGAVPTLVGYGADTAIETSRRTSRPLAAPVVRKLAKELAVDLAALQRGSGAGGVITRADVLAAARGG
VGAGPDVRPVHGVHARMAEKMTLSHKEIPTAKASVEVICAELLRLRDRFVSAAPEITPFALTLRLLVIALKHNVILNSTW
VDSGEGPQVHVHRGVHLGFGAATERGLLVPVVTDAQDKNTRELASRVAELITGAREGTLTPAELRGSTFTVSNFGALGVD
DGVPVINHPEAAILGLGAIKPRPVVVGGEVVARPTMTLTCVFDHRVVDGAQVAQFMCELRDLIESPETALLDL
>Q0KBP1 2.3.1.16~~~bktB~~~Beta-ketothiolase BktB~~~COG0183
MTREVVVVSGVRTAIGTFGGSLKDVAPAELGALVVREALARAQVSGDDVGHVVFGNVIQTEPRDMYLGRVAAVNGGVTIN
APALTVNRLCGSGLQAIVSAAQTILLGDTDVAIGGGAESMSRAPYLAPAARWGARMGDAGLVDMMLGALHDPFHRIHMGV
TAENVAKEYDISRAQQDEAALESHRRASAAIKAGYFKDQIVPVVSKGRKGDVTFDTDEHVRHDATIDDMTKLRPVFVKEN
GTVTAGNASGLNDAAAAVVMMERAEAERRGLKPLARLVSYGHAGVDPKAMGIGPVPATKIALERAGLQVSDLDVIEANEA
FAAQACAVTKALGLDPAKVNPNGSGISLGHPIGATGALITVKALHELNRVQGRYALVTMCIGGGQGIAAIFERI
>B3U538 3.5.2.6~~~~~~Beta-lactamase OXA-133~~~
MNKYFTCYVVASLFFSGCTVQHNLINETQSQIVQGHNQVIHQYFDEKNTSGVLVIQTDKKINLYGNALSRANTEYVPAST
FKMLNALIGLENQKTDINEIFKWKGEKRSFTTWEKDMTLGEAMKLSAVPVYQELARRIGLDLMQKEVERIDFGNAEIGQQ
VDNFWLIGPLKVTPIQEVEFVSQLAHTQLPFSEKVQANVKNMLLLEENNGYKIFGKTGWAMDIKPQVGWLTGWVEQPDGK
IVAFALNMEMRSEMPASIRNELLMKSLKQLNII
>P10424 3.5.2.6~~~penPC~~~Beta-lactamase 1~~~COG2367
MKNKKMLKIGMCVGILGLSITSLVTFTGGALQVEAKEKTGQVKHKNQATHKEFSQLEKKFDARLGVYAIDTGTNQTIAYR
PNERFAFASTYKALAAGVLLQQNSTKKLDEVITYTKEDLVDYSPVTEKHVDTGMTLGEIAEAAVRYSDNTAGNILFHKIG
GPKGYEKALRKMGDRVTMSDRFETELNEAIPGDIRDTSTAKAIARNLKDFTVGNALPHQKRNILTEWMKGNATGDKLIRA
GVPTDWVDADKSGAGSYGTRNDIAIVWPPNRSPIIIAILSSKDEKEATYDNQLIKEAAEVVIDAIK
>P0AD63 3.5.2.6~~~bla~~~Beta-lactamase SHV-1~~~
MRYIRLCIISLLATLPLAVHASPQPLEQIKLSESQLSGRVGMIEMDLASGRTLTAWRADERFPMMSTFKVVLCGAVLARV
DAGDEQLERKIHYRQQDLVDYSPVSEKHLADGMTVGELCAAAITMSDNSAANLLLATVGGPAGLTAFLRQIGDNVTRLDR
WETELNEALPGDARDTTTPASMAATLRKLLTSQRLSARSQRQLLQWMVDDRVAGPLIRSVLPAGWFIADKTGAGERGARG
IVALLGPNNKAERIVVIYLRDTPASMAERNQQIAGIGAALIEHWQR
>P0AD64 3.5.2.6~~~bla~~~Beta-lactamase SHV-1~~~
MRYIRLCIISLLATLPLAVHASPQPLEQIKLSESQLSGRVGMIEMDLASGRTLTAWRADERFPMMSTFKVVLCGAVLARV
DAGDEQLERKIHYRQQDLVDYSPVSEKHLADGMTVGELCAAAITMSDNSAANLLLATVGGPAGLTAFLRQIGDNVTRLDR
WETELNEALPGDARDTTTPASMAATLRKLLTSQRLSARSQRQLLQWMVDDRVAGPLIRSVLPAGWFIADKTGAGERGARG
IVALLGPNNKAERIVVIYLRDTPASMAERNQQIAGIGAALIEHWQR
>P52700 3.5.2.6~~~~~~Metallo-beta-lactamase L1 type 3~~~
MRSTLLAFALAVALPAAHTSAAEVPLPQLRAYTVDASWLQPMAPLQIADHTWQIGTEDLTALLVQTPDGAVLLDGGMPQM
ASHLLDNMKARGVTPRDLRLILLSHAHADHAGPVAELKRRTGAKVAANAESAVLLARGGSDDLHFGDGITYPPANADRIV
MDGEVITVGGIVFTAHFMAGHTPGSTAWTWTDTRNGKPVRIAYADSLSAPGYQLQGNPRYPHLIEDYRRSFATVRALPCD
VLLTPHPGASNWDYAAGARAGAKALTCKAYADAAEQKFDGQLAKETAGAR
>Q03680 3.5.2.6~~~blaL~~~Beta-lactamase 1~~~
MRIRPTRRLLLGAVAPLALVPLVACGQASGSESGQQPGLGGCGTSAHGSADAHEKEFRALEKKFDAHPGVYAIDTRDGQE
ITHRADERFAYGSTFKALQAGAILAQVLRDGREVRRGAEADGMDKVVHYGQDAILPNSPVTEKHVADGMSLRELCDAVVA
YSDNTAANLLFDQLGGRRGSTRVLKQLGDHTTSMDRYEQELGSAVPGDPRDTSTPRAFAEDLRAFAVEDGEKAALAPNDR
EQLNDWMSGSRTGDALIRAGVPKDWKVEDKSGQVKYGTRNDIAVVRPPGRAPIVVSVMSHGDTQDAEPHDELVAEAGLVV
ADGLK
>Q9S169 3.5.2.6~~~bla~~~Beta-lactamase SHV-24~~~
MRYIRLCIISLLATLPLAVHASPQPLEQIKLSESQLSGRVGMIEMDLASGRTLTAWRADERFPMMSTFKVVLCGAVLARV
DAGDEQLERKIHYRQQDLVDYSPVSEKHLADGMTVGELCAAAITMSDNSAANLLLATVGGPAGLTAFLRQIGDNVTRLDR
WETELNEALPGDARGTTTPASMAATLRKLLTSQRLSARSQRQLLQWMVDDRVAGPLIRSVLPAGWFIADKTGAGERGARG
IVALLGPNNKAERIVVIYLRDTPASMAERNQQIAGIGAALIEHWQR
>P10425 3.5.2.6~~~~~~Metallo-beta-lactamase type 2~~~
MKKNTLLKVGLCVSLLGTTQFVSTISSVQASQKVEQIVIKNETGTISISQLNKNVWVHTELGYFNGEAVPSNGLVLNTSK
GLVLVDSSWDNKLTKELIEMVEKKFQKRVTDVIITHAHADRIGGITALKERGIKAHSTALTAELAKKSGYEEPLGDLQTV
TNLKFGNTKVETFYPGKGHTEDNIVVWLPQYQILAGGCLVKSAEAKNLGNVADAYVNEWSTSIENMLKRYRNINLVVPGH
GKVGDKGLLLHTLDLLK
>P04190 3.5.2.6~~~blm~~~Metallo-beta-lactamase type 2~~~
MKKNTLLKVGLCVGLLGTIQFVSTISSVQASQKVEKTVIKNETGTISISQLNKNVWVHTELGSFNGEAVPSNGLVLNTSK
GLVLVDSSWDDKLTKELIEMVEKKFQKRVTDVIITHAHADRIGGIKTLKERGIKAHSTALTAELAKKNGYEEPLGDLQTV
TNLKFGNMKVETFYPGKGHTEDNIVVWLPQYNILVGGCLVKSTSAKDLGNVADAYVNEWSTSIENVLKRYRNINAVVPGH
GEVGDKGLLLHTLDLLK
>P0A9Z7 3.5.2.6~~~bla~~~Beta-lactamase SHV-2~~~
MRYIRLCIISLLATLPLAVHASPQPLEQIKLSESQLSGRVGMIEMDLASGRTLTAWRADERFPMMSTFKVVLCGAVLARV
DAGDEQLERKIHYRQQDLVDYSPVSEKHLADGMTVGELCAAAITMSDNSAANLLLATVGGPAGLTAFLRQIGDNVTRLDR
WETELNEALPGDARDTTTPASMAATLRKLLTSQRLSARSQRQLLQWMVDDRVAGPLIRSVLPAGWFIADKTGASERGARG
IVALLGPNNKAERIVVIYLRDTPASMAERNQQIAGIGAALIEHWQR
>P06548 3.5.2.6~~~blaZ~~~Beta-lactamase 3~~~
MFVLNKFFTNSHYKKIVPVVLLSCATLIGCSNSNTQSESNKQTNQTNQVKQENKRNHAFAKLEKEYNAKLGIYALDTSTN
QTVAYHADDRFAFASTSKSLAVGALLRQNSIEALDERITYTRKDLSNYNPITEKHVDTGMTLKELADASVRYSDSTAHNL
ILKKLGGPSAFEKILREMGDTVTNSERFEPELNEVNPGETHDTSTPKAIAKTLQSFTLGTVLPSEKRELLVDWMKRNTTG
DKLIRAGVPKGWEVADKTGAGSYGTRNDIAIIWPPNKKPIVLSILSNHDKEDAEYDDTLIADATKIVLETLKVTNK
>P30896 3.5.2.6~~~bla~~~Beta-lactamase SHV-3~~~
MRYIRLCIISLLATLPLAVHASPQPLEQIKLSESQLSGRVGMIEMDLASGRTLTAWRADERFPMMSTFKVVLCGAVLARV
DAGDEQLERKIHYRQQDLVDYSPVSEKHLADGMTVGELCAAAITMSDNSAANLLLATVGGPAGLTAFLRQIGDNVTRLDR
WETELNEALPGDARDTTTPASMAATLRKLLTSQRLSARSQLQLLQWMVDDRVAGPLIRSVLPAGWFIADKTGASERGARG
IVALLGPNNKAERIVVIYLRDTPASMAERNQQIAGIGAALIEHWQR
>P37323 3.5.2.6~~~bla~~~Beta-lactamase SHV-4~~~
SPQPLEQIKLSESQLSGRVGMIEMDLASGRTLTAWRADERFPMMSTFKVVLCGAVLARVDAGDEQLERKIHYRQQDLVDY
SPVSEKHLADGMTVGELCAAAITMSDNSAANLLLATVGGPAGLTAFLRQIGDNVTRLDRWETELNEALPGDARDTTTPAS
MAATLRKLLTSQRLSARSQLQLLQWMVDDRVAGPLIRSVLPAGWFIADKTGASKRGARGIVALLGPNNKAERIVVIYLRD
TPASMAERNQQIAGIGAALIEHWQR
>O08498 3.5.2.6~~~blaB1~~~Metallo-beta-lactamase type 2~~~
MLKKIKISLILALGLTSLQAFGQENPDVKIEKLKDNLYVYTTYNTFNGTKYAANAVYLVTDKGVVVIDCPWGEDKFKSFT
DEIYKKHGKKVIMNIATHSHDDRAGGLEYFGKIGAKTYSTKMTDSILAKENKPRAQYTFDNNKSFKVGKSEFQVYYPGKG
HTADNVVVWFPKEKVLVGGCIIKSADSKDLGYIGEAYVNDWTQSVHNIQQKFSGAQYVVAGHDDWKDQRSIQHTLDLINE
YQQKQKASN
>P26918 3.5.2.6~~~cphA~~~Metallo-beta-lactamase type 2~~~
MMKGWMKCGLAGAVVLMASFWGGSVRAAGMSLTQVSGPVYVVEDNYYVQENSMVYFGAKGVTVVGATWTPDTARELHKLI
KRVSRKPVLEVINTNYHTDRAGGNAYWKSIGAKVVSTRQTRDLMKSDWAEIVAFTRKGLPEYPDLPLVLPNVVHDGDFTL
QEGKVRAFYAGPAHTPDGIFVYFPDEQVLYGNCILKEKLGNLSFADVKAYPQTLERLKAMKLPIKTVIGGHDSPLHGPEL
IDHYEALIKAAPQS
>P14488 3.5.2.6~~~~~~Metallo-beta-lactamase type 2~~~COG0491
MKNTLLKLGVCVSLLGITPFVSTISSVQAERTVEHKVIKNETGTISISQLNKNVWVHTELGYFSGEAVPSNGLVLNTSKG
LVLVDSSWDDKLTKELIEMVEKKFKKRVTDVIITHAHADRIGGMKTLKERGIKAHSTALTAELAKKNGYEEPLGDLQSVT
NLKFGNMKVETFYPGKGHTEDNIVVWLPQYQILAGGCLVKSASSKDLGNVADAYVNEWSTSIENVLKRYGNINLVVPGHG
EVGDRGLLLHTLDLLK
>P25910 3.5.2.6~~~ccrA~~~Metallo-beta-lactamase type 2~~~
MKTVFILISMLFPVAVMAQKSVKISDDISITQLSDKVYTYVSLAEIEGWGMVPSNGMIVINNHQAALLDTPINDAQTEML
VNWVTDSLHAKVTTFIPNHWHGDCIGGLGYLQRKGVQSYANQMTIDLAKEKGLPVPEHGFTDSLTVSLDGMPLQCYYLGG
GHATDNIVVWLPTENILFGGCMLKDNQATSIGNISDADVTAWPKTLDKVKAKFPSARYVVPGHGDYGGTELIEHTKQIVN
QYIESTSKP
>P52664 3.5.2.6~~~blaB~~~Beta-lactamase~~~
MTMFKTTFRQTATIAVSLISLLVSPMLWANTNNTIEEQLSTLEKYSQGRLGVALINTEDNSQITYRGEERFAMASTSKVM
AVAAVLKESEKQAGLLDKNITIKKSDLVAYSPITEKHLVTGMSLAQLSAATLQYSDNTAMNKILDYLGGPAKVTQFARSI
NDVTYRLDRKEPELNTAIHGDPRDTTSPIAMAKSLQALTLGDALGQSQRQQLVTWLKGNTTGDHSIKAGLPKHWIVGDKT
GSGDYGTTNDIAVIWPKNHAPLILVVYFTQQEQDAKYRKDIIVKATEIVTKEISNSPQTK
>Q7WYA8 3.5.2.6~~~~~~Metallo-beta-lactamase type 2~~~
MKKLFVLCVCFFCSITAAGAALPDLKIEKLEEGVFVHTSFEEVNGWGVVTKHGLVVLVNTDAYLIDTPFTATDTEKLVNW
FVERGYEIKGTISSHFHSDSTGGIEWLNSQSIPTYASELTNELLKKSGKVQAKYSFSEVSYWLVKNKIEVFYPGPGHTQD
NLVVWLPESKILFGGCFIKPHGLGNLGDANLEAWPKSAKILMSKYGKAKLVVSSHSEKGDASLMKRTWEQALKGLKESKK
TSSPSN
>P52699 3.5.2.6~~~~~~Metallo-beta-lactamase type 2~~~
MSKLSVFFIFLFCSIATAAESLPDLKIEKLDEGVYVHTSFEEVNGWGVVPKHGLVVLVNAEAYLIDTPFTAKDTEKLVTW
FVERGYKIKGSISSHFHSDSTGGIEWLNSRSIPTYASELTNELLKKDGKVQATNSFSGVNYWLVKNKIEVFYPGPGHTPD
NVVVWLPERKILFGGCFIKPYGLGNLGDANIEAWPKSAKLLKSKYGKAKLVVPSHSEVGDASLLKLTLEQAVKGLNESKK
PSKPSN
>P33652 ~~~blaB~~~Beta-lactamase regulatory protein BlaB~~~
MLNSESLLRELRDALHEGGLTGSFLVRDLYTGEELGIDPDTELPTASLVKLPLALATLERIRLGEVDGAQQIEVAPGRIT
TPGPTGLSRFRHPARVAVDDLLYLSTSVSDGTASDALFEITPPAQVEQMVREWGFRDLTVRHSMRELSETPAERFESADA
HLAHALAISAGTSGRGHRVPQLDVARANTGTARAFVDLLEALWAPVLTGPRPGRTSRALPPEPAARLRELMAANLLRHRL
APDFASDAATWSSKTGTLLNLRHEVGVVEHADGQVFAVAVLTESQVPADSQPGAEALMAQVARRLRDRLREW
>P00808 3.5.2.6~~~penP~~~Beta-lactamase~~~
MKLWFSTLKLKKAAAVLLFSCVALAGCANNQTNASQPAEKNEKTEMKDDFAKLEEQFDAKLGIFALDTGTNRTVAYRPDE
RFAFASTIKALTVGVLLQQKSIEDLNQRITYTRDDLVNYNPITEKHVDTGMTLKELADASLRYSDNAAQNLILKQIGGPE
SLKKELRKIGDEVTNPERFEPELNEVNPGETQDTSTARALVTSLRAFALEDKLPSEKRELLIDWMKRNTTGDALIRAGVP
DGWEVADKTGAASYGTRNDIAIIWPPKGDPVVLAVLSSRDKKDAKYDDKLIAEATKVVMKALNMNGK
>P39824 3.5.2.6~~~penP~~~Beta-lactamase~~~COG2367
MKLKTKASIKFGICVGLLCLSITGFTPFFNSTHAEAKSIEDTNMASCITNKKFVQLEKKFDARLGVYAIDIGSNKTIAYR
PNERFAYASTYKVLAAAAVLKKNSIEKLNEVIHYSKDDLVTYSPITEKHLDTGMSLKEISEAAIRYSDNTAGNILLQQLG
GPKGFEKSLKQIGDHVTKAKRFETDLNSAIPGDIRDTSTAKALATDLKAFTLDNTLTTDKRMILTDWMRGNATGDELIRA
GAPIGWEVGDKSGAGSYGTRNDIAIVWPPNRAPIVVAILSNRFTKDANYDNALIAEAAKVVLNDLK
>P22390 3.5.2.6~~~~~~Beta-lactamase~~~
MFKKRGRQTVLIAAVLAFFTASSPLLARTQGEPTQVQQKLAALEKQSGGRLGVALINTADRSQILYRGDERFAMCSTSKT
MVAAAVLKQSETQHDILQQKMVIKKADLTNWNPVTEKYVDKEMTLAELSAATLQYSDNTAMNKLLEHLGGTSNVTAFARS
IGDTTFRLDRKEPELNTAIPGDERDTTCPLAMAKSLHKLTLGDALAGAQRAQLVEWLKGNTTGGQSIRAGLPEGWVVGDK
TGAGDYGTTNDIAVIWPEDRAPLILVTYFTQPQQDAKGRKDILAAAAKIVTEGL
>A5U493 3.5.2.6~~~blaC~~~Beta-lactamase~~~COG2367
MRNRGFGRRELLVAMAMLVSVTGCARHASGARPASTTLPAGADLADRFAELERRYDARLGVYVPATGTTAAIEYRADERF
AFCSTFKAPLVAAVLHQNPLTHLDKLITYTSDDIRSISPVAQQHVQTGMTIGQLCDAAIRYSDGTAANLLLADLGGPGGG
TAAFTGYLRSLGDTVSRLDAEEPELNRDPPGDERDTTTPHAIALVLQQLVLGNALPPDKRALLTDWMARNTTGAKRIRAG
FPADWKVIDKTGTGDYGRANDIAVVWSPTGVPYVVAVMSDRAGGGYDAEPREALLAEAATCVAGVLA
>P9WKD3 3.5.2.6~~~blaC~~~Beta-lactamase~~~COG2367
MRNRGFGRRELLVAMAMLVSVTGCARHASGARPASTTLPAGADLADRFAELERRYDARLGVYVPATGTTAAIEYRADERF
AFCSTFKAPLVAAVLHQNPLTHLDKLITYTSDDIRSISPVAQQHVQTGMTIGQLCDAAIRYSDGTAANLLLADLGGPGGG
TAAFTGYLRSLGDTVSRLDAEEPELNRDPPGDERDTTTPHAIALVLQQLVLGNALPPDKRALLTDWMARNTTGAKRIRAG
FPADWKVIDKTGTGDYGRANDIAVVWSPTGVPYVVAVMSDRAGGGYDAEPREALLAEAATCVAGVLA
>Q9EZQ7 3.5.2.6~~~bla~~~Beta-lactamase AST-1~~~
MTFSALPFRRADRRRLLAAALAACALTLTAACDSGTVTVPVTDSVTTSAVADPRFAELETTSGARLGVFAVDTGSGRTVA
HRADERFPMASTFKGLACGALLREHPLSTGYFDQVIHYSAAELVEYSPVTETRVETGMTVRELCDAAITVSDNTAGNQLL
KLLGGPEGFTASLRSLGDATSRLDRWETDLNTAIPGDERDTTTPAALAADYRALVVGDVLGAPERDQLKAWLVANTTGAT
RIRAGLPADWTVGDKTGSPAYGSALDVAVAWPPGRAPIVIAVLSTKSEQDAEPDNALLAEATRVVVDALG
>Q5YXD6 3.5.2.6~~~bla~~~Beta-lactamase FAR-1~~~COG2367
MPGVDISFLKKSGRRTMAAAAVIALLGGCGADAGSEPATTAASTTAPSAATDAATAEFAALEQRFGARLGVYAVDTTSGA
VVAYRADERFGMASTFKGLACGALLREHPLSSGYFDQVVRYSREEVVSYSPVTETRVDTGMTVAELCHATITVSDNTAGN
QILKLLGGPAGFTAFLRSLGDEVSRLDRWETELNEVPPGEERDTTTPAAVAANYRALVLGDVLAEPERAQLRDWLVANTT
GDQRIRAGVPAGWTVGDKTGGGSHGGNNDVAVAWTETGDPIVIALLSHRTDPAAKADNALLAEATRAVVTALR
>P30897 3.5.2.6~~~blaP~~~Beta-lactamase~~~
MNVRQHKASFFSVVITFLCLTLSLNANATDSVLEAVTNAETELGARIGLAAHDLETGKRWEHKSNERFPLSSTFKTLACA
NVLQRVDLGKERIDRVVRFSESNLVTYSPVTEKHVGKKGMSLAELCQATLSTSDNSAANFILQAIGGPKALTKFLRSIGD
DTTRLDRWEPELNEAVPGDKRDTTTPIAMVTTLEKLLIDETLSIKSRQQLESWLKGNEVGDALFRKGVPSDWIVADRTGA
GGYGSRAITAVMWPPNRKPIVAALYITETDASFEERNAVIAKIGEQIAKTVLMENSRN
>P80298 3.5.2.6~~~~~~Beta-lactamase~~~
NTNNTIEEQLSTLEKYSQGRLGVALINTEDNSQITYRGEERFAMASTSKVMAVAAILKESEKQAGLLDKNIIITKSDLVA
YSPITEKHLATGMSLAQLSAATLQYSDNTAMNKILDYLGGPSKVTQFARSINDVTYRLDRKEPELNTAIHGDPRDTTSPI
AMAKSLQALTLGDALGQSQRQQLVTWLKGNTTGDHSIKAGLPKHWIVGDKTGSGDYGTTNDIAVIWPKNHAPLILVVYFT
QQEQDAKYRKDIIVKATEIVTKEFSNTSQKK
>P80545 3.5.2.6~~~~~~Beta-lactamase~~~
QPANAKANIQQQLSELEKNSGGRLGVALIDTADNSQILYRGDERFPMCSTSKVMAVSALLKQSETDKNLLAKRMEIKQSD
LVNYNPIAEKHLDTGMTLAEFSAATIQYSDNTAMNKILEHLGGPAKVTEFARTIGDKTFRLDRTEPTLNTAIPGDKRDTT
SPQAMAISLQNLTLGKALAEPQRAQLVEWMKGNTTGGASIRAGLPTTWVVGDKTGSGDYGTTNDIAVIWPANHAPLVLVT
YFTQPQQNAEARKDVLAAAAKIVTAGL
>P00807 3.5.2.6~~~blaZ~~~Beta-lactamase~~~
MKKLIFLIVIALVLSACNSNSSHAKELNDLEKKYNAHIGVYALDTKSGKEVKFNSDKRFAYASTSKAINSAILLEQVPYN
KLNKKVHINKDDIVAYSPILEKYVGKDITLKALIEASMTYSDNTANNKIIKEIGGIKKVKQRLKELGDKVTNPVRYEIEL
NYYSPKSKKDTSTPAAFGKTLNKLIANGKLSKENKKFLLDLMLNNKSGDTLIKDGVPKDYKVADKSGQAITYASRNDVAF
VYPKGQSEPIVLVIFTNKDNKSDKPNDKLISETAKSVMKEF
>P14559 3.5.2.6~~~~~~Beta-lactamase~~~
MHPSTSRPSRRTLLTATAGAALAAATLVPGTAHASSGGRGHGSGSVSDAERRLAGLERASGARLGVYAYDTGSGRTVAYR
ADELFPMCSVFKTLSSAAVLRDLDRNGEFLSRRILYTQDDVEQADGAGPETGKPQNLANAQLTVEELCEVSITASDNCAA
NLMLRELGGPAAVTRFVRSLGDRVTRLDRWEPELNSAEPGRVTDTTSPRAITRTYGRLVLGDALNPRDRRLLTSWLLANT
TSGDRFRAGLPDDWTLGDKTGAGRYGTNNDAGVTWPPGRAPIVLTVLTAKTEQDAARDDGLVADAARVLAETLG
>Q59517 3.5.2.6~~~blaF~~~Beta-lactamase~~~
MTGLSRRNVLIGSLVAAAAVGAGVGGAAPAFAAPIDDQLAELERRDNVLIGLYAANLQSGRRITHRLDEMFAMCSTFKGY
AAARVLQMAEHGEISLDNRVFVDADALVPNSPVTEARAGAEMTLAELCQAALQRSDNTAANLLLKTIGGPAAVTAFARSV
GDERTRLDRWEVELNSAIPGDPRDTSTAAALAVGYRAILAGDALSPPQRGLLEDWMRANQTSSMRAGLPEGWTTADKTGS
GDYGSTNDAGIAFGPDGQRLLLVMMTRSQAHDPKAENLRPLIGELTALVLPSLL
>P06555 ~~~blaI~~~Penicillinase repressor~~~
MKKIPQISDAELEVMKVIWKHSSINTNEVIKELSKTSTWSPKTIQTMLLRLIKKGALNHHKEGRVFVYTPNIDESDYIEV
KSHSFLNRFYNGTLNSMVLNFLENDQLSGEEINELYQILEEHKNRKKE
>P9WMJ5 ~~~blaI~~~Transcriptional regulator BlaI~~~COG3682
MAKLTRLGDLERAVMDHLWSRTEPQTVRQVHEALSARRDLAYTTVMTVLQRLAKKNLVLQIRDDRAHRYAPVHGRDELVA
GLMVDALAQAEDSGSRQAALVHFVERVGADEADALRRALAELEAGHGNRPPAGAATET
>P0A042 ~~~blaI~~~Penicillinase repressor~~~
MANKQVEISMAEWDVMNIIWGKKSVSANEIVVEIQKYKEVSDKTIRTLITRLYKKEIIKRYKSENIYFYSSNIKEDDIKM
KTAKTFLNKLYGGDMKSLVLNFAKNEELNNKEIEELRDILNDISKK
>C7C422 3.5.2.6~~~blaNDM-1~~~Metallo-beta-lactamase type 2~~~
MELPNIMHPVAKLSTALAAALMLSGCMPGEIRPTIGQQMETGDQRFGDLVFRQLAPNVWQHTSYLDMPGFGAVASNGLIV
RDGGRVLVVDTAWTDDQTAQILNWIKQEINLPVALAVVTHAHQDKMGGMDALHAAGIATYANALSNQLAPQEGMVAAQHS
LTFAANGWVEPATAPNFGPLKVFYPGPGHTSDNITVGIDGTDIAFGGCLIKDSKAKSLGNLGDADTEHYAASARAFGAAF
PKASMIVMSHSAPDSRAAITHTARMADKLR
>P52663 3.5.2.6~~~nmcA~~~Imipenem-hydrolyzing beta-lactamase~~~
MSLNVKQSRIAILFSSCLISISFFSQANTKGIDEIKNLETDFNGRIGVYALDTGSGKSFSYRANERFPLCSSFKGFLAAA
VLKGSQDNRLNLNQIVNYNTRSLEFHSPITTKYKDNGMSLGDMAAAALQYSDNGATNIILERYIGGPEGMTKFMRSIGDE
DFRLDRWELDLNTAIPGDERDTSTPAAVAKSLKTLALGNILSEHEKETYQTWLKGNTTGAARIRASVPSDWVVGDKTGSC
GAYGTANDYAVVWPKNRAPLIISVYTTKNEKEAKHEDKVIAEASRIAIDNLK
>P52682 3.5.2.6~~~smeA~~~Carbapenem-hydrolyzing beta-lactamase Sme-1~~~
MSNKVNFKTASFLFSVCLALSAFNAHANKSDAAAKQIKKLEEDFDGRIGVFAIDTGSGNTFGYRSDERFPLCSSFKGFLA
AAVLERVQQKKLDINQKVKYESRDLEYHSPITTKYKGSGMTLGDMASAALQYSDNGATNIIMERFLGGPEGMTKFMRSIG
DNEFRLDRWELELNTAIPGDKRDTSTPKAVANSLNKLALGNVLNAKVKAIYQNWLKGNTTGDARIRASVPADWVVGDKTG
SCGAIGTANDYAVIWPKNRAPLIVSIYTTRKSKDDKHSDKTIAEASRIAIQAID
>C0H419 ~~~yngHB~~~Biotin/lipoyl attachment protein~~~COG0511
MTVSIQMAGNLWKVHVKAGDQIEKGQEVAILESMKMEIPIVADRSGIVKEVKKKEGDFVNEGDVLLELSNSTQ
>P12287 ~~~blaR1~~~Regulatory protein BlaR1~~~
MSSSFFIPFLVSQILLSLFFSIIILIKKLLRTQITVGTHYYISVISLLALIAPFIPFHFLKSHHFDWILNLGGAQSALSQ
THSTDKTTEAIGQHVNWVQDFSLSIEQSSSKMIDSAFFAVWILGVAVMLLATLYSNLKIGKIKKNLQIVNNKELLSLFHT
CKEEIRFHQKVILSRSPLIKSPITFGVIRPYIILPKDISMFSADEMKCVLLHELYHCKRKDMLINYFLCLLKIVYWFNPL
VWYLSKEAKTEMEISCDFAVLKTLDKKLHLKYGEVILKFTSIKQRTSSLLAASEFSSSYKHIKRRIVTVVNFQTASPLLK
AKSALVFTLVLGAILAGTPSVSILAMQKETRFLPGTNVEYEDYSTFFDKFSASGGFVLFNSNRKKYTIYNRKESTSRFAP
ASTYKVFSALLALESGIITKNDSHMTWDGTQYPYKEWNQDQDLFSAMSSSTTWYFQKLDRQIGEDHLRHYLKSIHYGNED
FSVPADYWLDGSLQISPLEQVNILKKFYDNEFDFKQSNIETVKDSIRLEESNGRVLSGKTGTSVINGELHAGWFIGYVET
ADNTFFFAVHIQGEKRAAGSSAAEIALSILDKKGIYPSVSR
>P18357 ~~~blaR1~~~Regulatory protein BlaR1~~~
MAKLLIMSIVSFCFIFLLLLFFRYILKRYFNYMLNYKVWYLTLLAGLIPFIPIKFSLFKFNNVNNQAPTVESKSHDLNHN
INTTKPIQEFATDIHKFNWDSIDNISTVIWIVLVIILSFKFLKALLYLKYLKKQSLYLNENEKNKIDTILFNHQYKKNIV
IRKAETIQSPITFWYGKYIILIPSSYFKSVIDKRLKYIILHEYAHAKNRDTLHLIIFNIFSIIMSYNPLVHIVKRKIIHD
NEVEADRFVLNNINKNEFKTYAESIMDSVLNVPFFNKNILSHSFNGKKSLLKRRLINIKEANLKKQSKLILIFICIFTFL
LMVIQSQFLMGQSITDYNYKKPLHNDYQILDKSKIFGSNSGSFVMYSMKKDKYYIYNEKESRKRYSPNSTYKIYLAMFGL
DRHIINDENSRMSWNHKHYPFDAWNKEQDLNTAMQNSVNWYFERISDQIPKNYTATQLKQLNYGNKNLGSYKSYWMEDSL
KISNLEQVIVFKNMMEQNNHFSKKAKNQLSSSLLIKKNEKYELYGKTGTGIVNGKYNNGWFVGYVITNHDKYYFATHLSD
GKPSGKNAELISEKILKEMGVLNGQ
>P62593 3.5.2.6~~~bla~~~Beta-lactamase TEM~~~
MSIQHFRVALIPFFAAFCLPVFAHPETLVKVKDAEDQLGARVGYIELDLNSGKILESFRPEERFPMMSTFKVLLCGAVLS
RVDAGQEQLGRRIHYSQNDLVEYSPVTEKHLTDGMTVRELCSAAITMSDNTAANLLLTTIGGPKELTAFLHNMGDHVTRL
DRWEPELNEAIPNDERDTTMPAAMATTLRKLLTGELLTLASRQQLIDWMEADKVAGPLLRSALPAGWFIADKSGAGERGS
RGIIAALGPDGKPSRIVVIYTTGSQATMDERNRQIAEIGASLIKHW
>Q48406 3.5.2.6~~~~~~Beta-lactamase TEM-12~~~
MSIQHFRVALIPFFAAFCLPVFAHPETLVKVKDAEDQLGARVGYIELDLNSGKILESFRPEERFPMMSTFKVLLCGAVLS
RVDAGQEQLGRRIHYSQNDLVEYSPVTEKHLTDGMTVRELCSAAITMSDNTAANLLLTTIGGPKELTAFLHNMGDHVTRL
DSWEPELNEAIPNDERDTTMPAAMATTLRKLLTGELLTLASRQQLIDWMEADKVAGPLLRSALPAGWFIADKSGAGERGS
RGIIAALGPDGKPSRIVVIYTTGSQATMDERNRQIAEIGASLIKHW
>P28585 3.5.2.6~~~bla~~~Beta-lactamase CTX-M-1~~~
MVKKSLRQFTLMATATVTLLLGSVPLYAQTADVQQKLAELERQSGGRLGVALINTADNSQILYRADERFAMCSTSKVMAV
AAVLKKSESEPNLLNQRVEIKKSDLVNYNPIAEKHVDGTMSLAELSAAALQYSDNVAMNKLISHVGGPASVTAFARQLGD
ETFRLDRTEPTLNTAIPGDPRDTTSPRAMAQTLRNLTLGKALGDSQRAQLVTWMKGNTTGAASIQAGLPASWVVGDKTGS
GDYGTTNDIAVIWPKDRAPLILVTYFTQPQPKAESRRDVLASAAKIVTNGL
>O33807 3.5.2.6~~~bla~~~Beta-lactamase CTX-M-4~~~
MMTQSIRRSMLTVMATLPLLFSSATLHAQANSVQQQLEALEKSSGGRLGVAQINTADNSQILYVADERFAMCSTSKVMAA
AAVLKQSESDKHLLNQRVEIRASDLVNYNPIAEKHVNGTMTLAELGAGALQYSDNTAMNKLIAHLGGPDKVTAFARSLGD
ETFRLDRTEPTLNSAIPGDPRDTTTPLAMAQTLKNLTLGKALAETQRAQLVTWLKGNTTGSASIRAGMPKSWGVGDKTGS
GDYGTTNDIAVIWPENHAPLVLVTYFTQPEQKAESRRDILAAAAKIVTHGF
>P81781 3.5.2.6~~~carB6~~~Beta-lactamase CARB-6~~~
MKFLLAFSLLIPSVVFASSSKFQQVEQDVKAIEVSLSARIGVSVLDTQNGEYWDYNGNQRFPLTSTFKTIACAKLLYDAE
QGKVNPNSTVEIKKADLVTYSPVIEKQVGQAITLDDACFATMTTSDNTAANIILSAVGGPKGVTDFLRQIGDKETRLDRI
EPDLNEGKLGDLRDTTTPKAIASTLNQLLFGSTLSEASQKKLESWMVNNQVTGNLLRSVLPVKWSIADRSGAGGFGARSI
TAIVWSEEKKTIIVSIYLAQTEASMAERNDAIVKIGRSIFEVYTSQSR
>E1ANH6 3.5.2.6~~~bla~~~Beta-lactamase CTX-M-97~~~
MMTQSIGRSMLTVMATLPLLFSSATLHAQANSVQQQLEALEKSSGGRLGVALINTADNSQILYRADERFAMCSTSKVMAA
AAVLKQSESDKHLLNQRVEIKKSDLVNYNPIAEKHVNGTMTLAELGAAALQYSDNTAMNKLIAHLGGPDKVTAFARSLGD
ETFRLDRTEPTLNTAIPGDPRDTTTPLAMAQTLKNLTLGKALAETQRAQLVTWLKGNTTGSASIRAGLPKSWVVGDKTGS
GDYGTTNDIAVIWPENHAPLVLVTYFTQPEQKAESRRDILAAAAKIVTHGF
>P82243 ~~~~~~Antimicrobial peptide LCI~~~
AIKLVQSPNGNFAASFVLDGTKWIFKSKYYDSSKGYWVGIYEVWDRK
>P0A901 ~~~blc~~~Outer membrane lipoprotein Blc~~~COG3040
MRLLPLVAAATAAFLVVACSSPTPPRGVTVVNNFDAKRYLGTWYEIARFDHRFERGLEKVTATYSLRDDGGLNVINKGYN
PDRGMWQQSEGKAYFTGAPTRAALKVSFFGPFYGGYNVIALDREYRHALVCGPDRDYLWILSRTPTISDEVKQEMLAVAT
REGFDVSKFIWVQQPGS
>P37321 3.5.2.6~~~per1~~~Extended-spectrum beta-lactamase PER-1~~~
MNVIIKAVVTASTLLMVSFSSFETSAQSPLLKEQIESIVIGKKATVGVAVWGPDDLEPLLINPFEKFPMQSVFKLHLAML
VLHQVDQGKLDLNQTVIVNRAKVLQNTWAPIMKAYQGDEFSVPVQQLLQYSVSHSDNVACDLLFELVGGPAALHDYIQSM
GIKETAVVANEAQMHADDQVQYQNWTSMKGAAEILKKFEQKTQLSETSQALLWKWMVETTTGPERLKGLLPAGTVVAHKT
GTSGIKAGKTAATNDLGIILLPDGRPLLVAVFVKDSAESSRTNEAIIAQVAQTAYQFELKKLSALSPN
>P13081 ~~~ble~~~Bleomycin resistance protein~~~
MTDQATPNLPSRDFDSTAAFYERLGFGIVFRDAGWMILQRGDLMLEFFAHPGLDPLASWFSCCLRLDDLAEFYRQCKSVG
IQETSSGYPRIHAPELQEWGGTMAALVDPDGTLLRLIQNELLAGIS
>Q7A8D1 ~~~ble~~~Bleomycin resistance protein~~~
MLQSIPALPVGDIKKSIGFYCDKLGFTLVHHEDGFAVLMCNEVRIHLWEASDEGWRSRSNDSPVCTGAESFIAGTASCRI
EVEGIDELYQHIKPLGILHPNTSLKDQWWDERDFAVIDPDNNLISFFQQIKS
>Q7DJ53 ~~~ble~~~Bleomycin resistance protein~~~
MLQSIPALPVGDIKKSIGFYCDKLGFTLVHHEDGFAVLMCNEVRIHLWEASDEGWRSRSNDSPVCTGAESFIAGTASCRI
EVEGIDELYQHIKPLGILHPNTSLKDQWWDERDFAVIDPDNNLISFFQQIKS
>P17493 ~~~ble~~~Bleomycin resistance protein~~~
MAKLTSAVPVLTARDVAGAVEFWTDRLGFSRDFVEDDFAGVVRDDVTLFISAVQDQVVPDNTLAWVWVRGLDELYAEWSE
VVSTNFRDASGPAMTEIGEQPWGREFALRDPAGNCVHFVAEEQD
>Q8UAA9 3.-.-.-~~~blh~~~Beta-lactamase hydrolase-like protein~~~COG0491
MKAVRINERLTIAGQPMIADFPSLSAQGFKSIINARPDGEEPGQPGNTQEKSAAGAAGMDYGFIPVSGPTITEADIRAFQ
QKMAEAEGPVFAHCKGGTRALTLYVLGEALDGRIQRSDIEDFGKTHGFDLCAATRWLERQSAAVPHIKAFFDPRTWSVQY
VVSDPATGGCAIIDPVYDFDEKSGATGTMNADAILDYVKRHGLSVEWILDTHPHADHFSAADYLKQKTGAKTAIGAKVTG
VQKLWQEKYNWSDFKTDGSQWDQLFEAGDRFSIGSLEARVLFSPGHTLASVTYVVGNAAFVHDTLFMPDSGTARADFPGG
SAKQLWASIQDILALPDDTRLFTGHDYQPGGRAPKWESTVGEQTRSNPHLAGMTEEDFVRLREARDRTLPMPKLILHALQ
VNIRGGRLPEPEANGKHYLKFPLDVLEGSTW
>Q4PNI0 1.13.11.63~~~blh~~~Beta-carotene 15,15'-dioxygenase~~~
MGLMLIDWCALALVVFIGLPHGALDAAISFSMISSAKRIARLAGILLIYLLLATAFFLIWYQLPAFSLLIFLLISIIHFG
MADFNASPSKLKWPHIIAHGGVVTVWLPLIQKNEVTKLFSILTNGPTPILWDILLIFFLCWSIGVCLHTYETLRSKHYNI
AFELIGLIFLAWYAPPLVTFATYFCFIHSRRHFSFVWKQLQHMSSKKMMIGSAIILSCTSWLIGGGIYFFLNSKMIASEA
ALQTVFIGLAALTVPHMILIDFIFRPHSSRIKIKN
>Q9PFB0 3.-.-.-~~~blh~~~Beta-lactamase hydrolase-like protein~~~COG0491
MRIVDINERLAISGQPNTDEFINFARRGYRSIINLRPDGEEPNQPGNDAEQAAARRAGLAYNFVPVIGTSITEADIQAFQ
RAIATTEGSVLVHCKSGTRALMLYALSEVIDGRMKRDEVEALGHAHGFDLGRAVTWLERQAIQTPRVSGFFDPRTSSIQY
VVTDQTTKRCAIIDPVLDFDEKSGATATTNADAILAHVEQQGLTVEWILDTHPHADHFSAAQYLKQRTGAPTAIGTHVTE
VQRLWREIYNWPTLSANGSQWDHLFADGDVFNVGSIKGRVMFSPGHTLASVTYVIGDTAFVHDTIFMPDAGTARADFPGG
SARALWSSIQTILSLPDETRLFTGHDYQPSGRHPRWESTVGEQKKANPHLAGVDETTFVALREARDKTLPMPKLILHALQ
VNVLGGRLPEPETNGRRYLKFPLNALEGAAW
>P35804 ~~~~~~Beta-lactamase inhibitory protein~~~
MRTVGIGAGVRRLGRAVVMAAAVGGLVLGSAGASNAAGVMTGAKFTQIQFGMTRQQVLDIAGAENCETGGSFGDSIHCRG
HAAGDYYAYATFGFTSAAADAKVDSKSQEKLLAPSAPTLTLAKFNQVTVGMTRAQVLATVGQGSCTTWSEYYPAYPSTAG
VTLSLSCFDVDGYSSTGFYRGSAHLWFTDGVLQGKRQWDLV
>Q848S6 3.5.2.6~~~bla~~~Carbapenem-hydrolyzing beta-lactamase KPC~~~
MSLYRRLVLLSCLSWPLAGFSATALTNLVAEPFAKLEQDFGGSIGVYAMDTGSGATVSYRAEERFPLCSSFKGFLAAAVL
ARSQQQAGLLDTPIRYGKNALVPWSPISEKYLTTGMTVAELSAAAVQYSDNAAANLLLKELGGPAGLTAFMRSIGDTTFR
LDRWELELNSAIPGDARDTSSPRAVTESLQKLTLGSALAAPQRQQFVDWLKGNTTGNHRIRAAVPADWAVGDKTGTCGVY
GTANDYAVVWPTGRAPIVLAVYTRAPNKDDKHSEAVIAAAARLALEGLGVNGQ
>Q9F663 3.5.2.6~~~bla~~~Carbapenem-hydrolyzing beta-lactamase KPC~~~
MSLYRRLVLLSCLSWPLAGFSATALTNLVAEPFAKLEQDFGGSIGVYAMDTGSGATVSYRAEERFPLCSSFKGFLAAAVL
ARSQQQAGLLDTPIRYGKNALVPWSPISEKYLTTGMTVAELSAAAVQYSDNAAANLLLKELGGPAGLTAFMRSIGDTTFR
LDRWELELNSAIPGDARDTSSPRAVTESLQKLTLGSALAAPQRQQFVDWLKGNTTGNHRIRAAVPADWAVGDKTGTCGVY
GTANDYAVVWPTGRAPIVLAVYTRAPNKDDKHSEAVIAAAARLALEGLGVNGQ
>P14489 3.5.2.6~~~bla~~~Beta-lactamase OXA-10~~~
MKTFAAYVIIACLSSTALAGSITENTSWNKEFSAEAVNGVFVLCKSSSKSCATNDLARASKEYLPASTFKIPNAIIGLET
GVIKNEHQVFKWDGKPRAMKQWERDLTLRGAIQVSAVPVFQQIAREVGEVRMQKYLKKFSYGNQNISGGIDKFWLEGQLR
ISAVNQVEFLESLYLNKLSASKENQLIVKEALVTEAAPEYLVHSKTGFSGVGTESNPGVAWWVGWVEKETEVYFFAFNMD
IDNESKLPLRKSIPTKIMESEGIIGG
>Q06778 3.5.2.6~~~bla~~~Beta-lactamase OXA-11~~~
MKTFAAYVIIACLSSTALAGSITENTSWNKEFSAEAVNGVFVLCKSSSKSCATNDLARASKEYLPASTFKIPNAIIGLET
GVIKNEHQVFKWDGKPRAMKQWERDLTLRGAIQVSAVPVFQQIAREVGEVRMQKYLKKFSYGSQNISGGIDKFWLEDQLR
ISAVNQVEFLESLYLNKLSASKENQLIVKEALVTEAAPEYLVHSKTGFSGVGTESNPGVAWWVGWVEKETEVYFFAFNMD
IDNESKLPLRKSIPTKIMESEGIIGG
>Q51574 3.5.2.6~~~bla~~~Beta-lactamase OXA-15~~~
MAIRIFAILFSIFSLATFAHAQEGTLERSDWRKFFSEFQAKGTIVVADERQADRAMLVFDPVRSKKRYSPASTFKIPHTL
FALDAGAVRDEFQIFRWDGVNRGFAGHNQDQDLRSAMRNSTVWVYELFAKEIGDDKARRYLKKIDYGNAGPSTSNGDYWI
EGSLAISAQEQIAFLRKLYRNELPFRVEHQRLVKDLMIVEAGRNWILRAKTGWEGRMGWWVGWVEWPTGSVFFALNIDTP
NRMDDLFKREAIVRAILRSIEALPPNPAVNSDAAR
>O07293 3.5.2.6~~~bla~~~Beta-lactamase OXA-18~~~
MQRSLSMSGKRHFIFAVSFVISTVCLTFSPANAAQKLSCTLVIDEASGDLLHREGSCDKAFAPMSTFKLPLAIMGYDADI
LLDATTPRWDYKPEFNGYKSQQKPTDPTIWLKDSIVWYSQELTRRLGESRFSDYVQRFDYGNKDVSGDPGKHNGLTHAWL
ASSLKISPEEQVRFLRRFLRGELPVSEDALEMTKAVVPHFEAGDWDVQGKTGTGSLSDAKGGKAPIGWFIGWATRDDRRV
VFARLTVGARKGEQPAGPAARDEFLNTLPALSENF
>P13661 3.5.2.6~~~bla~~~Beta-lactamase OXA-1~~~
MKNTIHINFAIFLIIANIIYSSASASTDISTVASPLFEGTEGCFLLYDASTNAEIAQFNKAKCATQMAPDSTFKIALSLM
AFDAEIIDQKTIFKWDKTPKGMEIWNSNHTPKTWMQFSVVWVSQEITQKIGLNKIKNYLKDFDYGNQDFSGDKERNNGLT
EAWLESSLKISPEEQIQFLRKIINHNLPVKNSAIENTIENMYLQDLDNSTKLYGKTGAGFTANRTLQNGWFEGFIISKSG
HKYVFVSALTGNLGSNLTSSIKAKKNAITILNTLNL
>P22391 3.5.2.6~~~bla~~~Beta-lactamase OXY-1~~~COG2367
MLKSSWRKTALMAAAAVPLLLASGSLWASADAIQQKLADLEKRSGGRLGVALINTADDSQTLYRGDERFAMCSTGKVMAA
AAVLKQSESNPEVVNKRLEIKKSDLVVWSPITEKHLQSGMTLAELSAAALQYSDNTAMNKMISYLGGPEKVTAFAQSIGD
VTFRLDRTEPALNSAIPGDKRDTTTPLAMAESLRKLTLGNALGEQQRAQLVTWLKGNTTGGQSIRAGLPASWAVGDKTGA
GDYGTTNDIAVIWPENHAPLVLVTYFTQPQQDAKSRKEVLAAAAKIVTEGL
>P23954 3.5.2.6~~~bla~~~Beta-lactamase OXY-2~~~
MIKSSWRKIAMLAAAVPLLLASGALWASTDAIHQKLTDLEKRSGGRLGVALINTADNSQILYRGDERFAMCSTSKVMAAA
AVLKQSESNKEVVNKRLEINAADLVVWSPITEKHLQSGMTLAELSAATLQYSDNTAMNLIIGYLGGPEKVTAFARSIGDA
TFRLDRTEPTLNTAIPGDERDTSTPLAMAESLRKLTLGDALGEQQRAQLVTWLKGNTTGGQSIRAGLPESWVVGDKTGAG
DYGTTNDIAVIWPEDHAPLVLVTYFTQPQQDAKNRKEVLAAAAKIVTEGL
>P0A1V8 3.5.2.6~~~bla~~~Beta-lactamase OXA-2~~~
MAIRIFAILFSIFSLATFAHAQEGTLERSDWRKFFSEFQAKGTIVVADERQADRAMLVFDPVRSKKRYSPASTFKIPHTL
FALDAGAVRDEFQIFRWDGVNRGFAGHNQDQDLRSAMRNSTVWVYELFAKEIGDDKARRYLKKIDYGNADPSTSNGDYWI
EGSLAISAQEQIAFLRKLYRNELPFRVEHQRLVKDLMIVEAGRNWILRAKTGWEGRMGWWVGWVEWPTGSVFFALNIDTP
NRMDDLFKREAIVRAILRSIEALPPNPAVNSDAAR
>Q03170 3.5.2.6~~~pse1~~~Beta-lactamase PSE-1~~~
MKFLLAFSLLIPSVVFASSSKFQQVEQDVKAIEVSLSARIGVSVLDTQNGEYWDYNGNQRFPLTSTFKTIACAKLLYDAE
QGKVNPNSTVEIKKADLVTYSPVIEKQVGQAITLDDACFATMTTSDNTAANIILSAVGGPKGVTDFLRQIGDKETRLDRI
EPDLNEGKLGDLRDTTTPKAIASTLNKFLFGSALSEMNQKKLESWMVNNQVTGNLLRSVLPAGWNIADRSGAGGFGARSI
TAVVWSEHQAPIIVSIYLAQTQASMAERNDAIVKIGHSIFDVYTSQSR
>P16897 3.5.2.6~~~pse4~~~Beta-lactamase PSE-4~~~
MKFLLAFSLLIPSVVFASSSKFQQVEQDVKAIEVSLSARIGVSVLDTQNGEYWDYNGNQRFPLTSTFKTIACAKLLYDAE
QGKVNPNSTVEIKKADLVTYSPVIEKQVGQAITLDDACFATMTTSDNTAANIILSAVGGPKGVTDFLRQIGDKETRLDRI
EPDLNEGKLGDLRDTTTPKAIASTLNKFLFGSALSEMNQKKLESWMVNNQVTGNLLRSVLPAGWNIADRSGAGGFGARSI
TAVVWSEHQAPIIVSIYLAQTQASMEERNDAIVKIGHSIFDVYTSQSR
>P56976 ~~~blr~~~Divisome-associated membrane protein Blr~~~
MNRLIELTGWIVLVVSVILLGVASHIDNYQPPEQSASVQHK
>P0DJQ7 6.3.3.4~~~bls~~~Carboxyethyl-arginine beta-lactam-synthase~~~COG0367
MGAPVLPAAFGFLASARTGGGRAPGPVFATRGSHTDIDTPQGERSLAATLVHAPSVAPDRAVARSLTGAPTTAVLAGEIY
NRDELLSVLPAGPAPEGDAELVLRLLERYDLHAFRLVNGRFATVVRTGDRVLLATDHAGSVPLYTCVAPGEVRASTEAKA
LAAHRDPKGFPLADARRVAGLTGVYQVPAGAVMDIDLGSGTAVTHRTWTPGLSRRILPEGEAVAAVRAALEKAVAQRVTP
GDTPLVVLSGGIDSSGVAACAHRAAGELDTVSMGTDTSNEFREARAVVDHLRTRHREITIPTTELLAQLPYAVWASESVD
PDIIEYLLPLTALYRALDGPERRILTGYGADIPLGGMHREDRLPALDTVLAHDMATFDGLNEMSPVLSTLAGHWTTHPYW
DREVLDLLVSLEAGLKRRHGRDKWVLRAAMADALPAETVNRPKLGVHEGSGTTSSFSRLLLDHGVAEDRVHEAKRQVVRE
LFDLTVGGGRHPSEVDTDDVVRSVADRTARGAA
>Q47066 3.5.2.6~~~bla~~~Beta-lactamase Toho-1~~~
MMTQSIRRSMLTVMATLPLLFSSATLHAQANSVQQQLEALEKSSGGRLGVALINTADNSQILYRADERFAMCSTSKVMAA
AAVLKQSESDKHLLNQRVEIKKSDLVNYNPIAEKHVNGTMTLAELGAAALQYSDNTAMNKLIAHLGGPDKVTAFARSLGD
ETFRLDRTEPTLNTAIPGDPRDTTTPLAMAQTLKNLTLGKALAETQRAQLVTWLKGNTTGSASIRAGLPKSWVVGDKTGS
GDYGTTNDIAVIWPENHAPLVLVTYFTQPEQKAERRRDILAAAAKIVTHGF
>D4FZ53 2.3.1.57~~~bltD~~~Probable spermine N(1)-acetyltransferase~~~
MSINIKAVTDDNRAAILDLHVSQNQLSYIESTKVCLEDAKECHYYKPVGLYYEGDLVGFAMYGLFPEYDEDNKNGRVWLD
RFFIDKHYQGKGLGKKMLKALIQHLAELYKCKRIYLSIFENNIHAIRLYQRFGFQFNGELDFNGEKVMVKEL
>P39909 2.3.1.57~~~bltD~~~Spermine/spermidine N(1)-acetyltransferase~~~COG0456
MSINIKAVTDDNRAAILDLHVSQNQLSYIESTKVCLEDAKECHYYKPVGLYYEGDLVGFAMYGLFPEYDEDNKNGRVWLD
RFFIDERYQGKGLGKKMLKALIQHLAELYKCKRIYLSIFENNIHAIRLYQRFGFQFNGELDFNGEKVMVKEL
>Q92PC8 1.13.11.79~~~bluB~~~5,6-dimethylbenzimidazole synthase~~~COG0778
MLPDPNGCLTAAGAFSSDERAAVYRAIETRRDVRDEFLPEPLSEELIARLLGAAHQAPSVGFMQPWNFVLVRQDETREKV
WQAFQRANDEAAEMFSGERQAKYRSLKLEGIRKAPLSICVTCDRTRGGAVVLGRTHNPQMDLYSTVCAVQNLWLAARAEG
VGVGWVSIFHESEIKAILGIPDHVEIVAWLCLGFVDRLYQEPELAAKGWRQRLPLEDLVFEEGWGVR
>Q2RNG5 1.13.11.79~~~~~~5,6-dimethylbenzimidazole synthase~~~COG0778
MRTGPLFDPSFRDGLDALFQWRRDVRHFRKDPIDEETVARLLACADLAPSVGNSQPWRFVRVDDGARRGVIIDDFTRCNA
AARALQPEERQDAYARLKLEGLREAPLQLAVFCDEATDQGHGLGQATMPETRRYSVVMAIHTLWLAARARGLGVGWVSVL
DPQTVTAALDVPAEWAFVAYLCIGWPREEHPIPELERLGWQSRRPHPVVRR
>P75990 ~~~bluF~~~Blue light- and temperature-regulated antirepressor BluF~~~COG2200
MLTTLIYRSHIRDDEPVKKIEEMVSIANRRNMQSDVTGILLFNGSHFFQLLEGPEEQVKMIYRAICQDPRHYNIVELLCD
YAPARRFGKAGMELFDLRLHERDDVLQAVFDKGTSKFQLTYDDRALQFFRTFVLATEQSTYFEIPAEDSWLFIADGSDKE
LDSCALSPTINDHFAFHPIVDPLSRRIIAFEAIVQKNEDSPSAIAVGQRKDGEIYTADLKSKALAFTMAHALELGDKMIS
INLLPMTLVNEPDAVSFLLNEIKANALVPEQIIVEFTESEVISRFDEFAEAIKSLKAAGISVAIDHFGAGFAGLLLLSRF
QPDRIKISQELITNVHKSGPRQAIIQAIIKCCTSLEIQVSAMGVATPEEWMWLESAGIEMFQGDLFAKAKLNGIPSIAWP
EKK
>P75989 ~~~bluR~~~HTH-type transcriptional repressor BluR~~~COG0789
MAYYSIGDVAERCGINPVTLRAWQRRYGLLKPQRSEGGHRLFDEEDIQRIEEIKRWISNGVPVGKVKALLETTSQDTEDD
WSRLQEEMMSILRMANPAKLRARIISLGREYPVDQLINHVYLPVRQRLVLDHNTSRIMSSMFDGALIEYAATSLFEMRRK
PGKEAILMAWNVEERARLWLEAWRLSLSGWHISVLADPIESPRPELFPTQTLIVWTGMAPTRRQNELLQHWGEQGYKVIF
HAP
>P43506 ~~~bm3R1~~~HTH-type transcriptional repressor Bm3R1~~~
MESTPTKQKAIFSASLLLFAERGFDATTMPMIAENAKVGAGTIYRYFKNKESLVNELFQQHVNEFLQCIESGLANERDGY
RDGFHHIFEGMVTFTKNHPRALGFIKTHSQGTFLTEESRLAYQKLVEFVCTFFREGQKQGVIRNLPENALIAILFGSFME
VYEMIENDYLSLTDELLTGVEESLWAALSRQS
>A0A0H3GGE2 ~~~bmaC~~~Adhesin BmaC autotransporter~~~
MPNLANQDFTQFKREQQTAPAWFRILRGGRMTKWKGKVASYAPHVAPAIGWAFSRTLQLSLISLVMAGTAAASDRYWDSN
GTAVGRGGSGAWNTSNAFWSPSGDGVSGPYSAWNNAAFDTAIFGGTAGTVTLGSPITVGAMTFETTGYILSGNTLTLGTA
TPTITTSSGTTTINSVLAGTNGLTKAGDGILSLTGANTFSGNIIVTGGTLSVNSSAALGAAANEISLANGAGLNSSGSLA
GRSVTLTGGQAAIGGAGVGDAHFTGAGGLRASSSVTLSDDSNDYTGQTSLSSGGTLFFSSIGNLGEVSALGAPVDEAAGT
ISLVVGSSASASATYTGSGASSNRNWQLSSRFYANSTISNRGSGTLTLTGNIFNNHTNSSLSARNINFDAGTADIELLGT
ISSNNNGVGVVFGGTAGRTIKVSGDNTFGGAAIIQNITVQVGSLKNTGDPSALGTGTGAAGAISINSGILSYLGAGDSSD
RNFTAQNNAILANDGTGALTLSGDVALTGTLTLGGSFAGTNTLAGTVSGTGNLRVDGAGSWILSSANTFTGDVGVNSGTL
VVGNMQALGTTPKAATVNGGTLDLGAFDTTLSSLSGTGGNVNLGGATLTVKGSTSTDFAGSMTGSGNLLKQGTSTLTLTG
ASSFTGDTTINGGAISLNFKNATALTDNILSTSSTLNLAGGTFNVIGMDNAANSQTVDGLNVTTGNNKITTTSGSGGTLT
LNLGAINRTGGFIDFGINADTTITTTSATLGGWATVNSTDYAKVDGGVIKALDESDYANKDDAGTWANGDIVSDAGGAAN
TPYFGTVGTGLQLGGLKYTAAANSTVTIAAGQTLGVNGTIIVANTVGNTNQTINGGSLTGITGGGVLGVLQTGTGTFTIQ
STITDNGGAIGFTKAGAGSVTLTGQNTYSGVTTLSGGILTVTQMANAGMASGIGQSTADPANLMLESGTFRYTGGSVTTD
RGFTLVNGGPARVIEVTGSGSNLAFSGLVTSPDDAGFEKKGAGTLTFLNGSNDYIGATTVSGGTLAVSTLADGGQVSSLG
KSGSDATNLILAGGALNYLGSTTSSDRSFTLGAGNGSIGVANAGTTLSMSGTAVGTGGLTKLGDGTLILSGTNTYTGNTA
VNAGVLRAGSAQAFGPSGLMTVGNGASLELGGYDITVSGLLGAGTVDLGGNTLTSSGSAANSFTGKITGTGGFTRTGGST
QTLSGCNSDYTGKTTIASNGTLSVDCLKNGGQASSIGASSNAPDNLVLNNGTLSYTGNTVTTDRGFTIQGGTGAISVTDA
ATTLTFSGQVVGTGALQKRGTGTLVLMNSNSYRGGTSVDAGTLRAGSSGAFGGGSMSLSNAAGAILDLDGFDTSVTSLSG
GGALGGNVALGGATLTISSGNSNGTSYTGAITGIGNFVKNGNGTQRLTGCASSYSGSTTINGGVLEDSCLADGGSVSSIG
MSSADADNLVINGGVLRYTGSGDSTDRQFTLGASGGNSIESEGTGAILFTSNAAVTFAAANTAQTLTLAGTNTDDNELGA
QLTNNGSGITSLTKTDTGTWFLTNSDSTYTGVTKINGGVLSVDKLANGGLASSIGASSSAASNLIIGNDSTLRYLGTGDT
TDRLFTLASGLTYIESSGSGAIVFTDTGQVALADNNQARTIALGGKNTGDNTLAGSIGDAGTGKTTLAKNDDGTWVLTGN
NTFTGPTNINKGLLKIGNGGTTGSLTSDIVVTDGGLIFNRSDTLNYGGLISGAGFVTQSGSGTTILTGANSYTGATSVSA
GTLLVNGDQSAATGQTSVANGSILGGSGIIGGNVVVTDGALAPGSNGAGTLTINGSLALSAGSILSMQLGQAGVAGGALN
DLIEVKGNLTLDGTLDVAETAGGSYGPGIYRLINYTGSLTDNGLDIGMLPNGAGAIQTAVAGQVNLLAGGTNFNFWDGDV
GPKFNSAVDGGNGTWQNSSGNNNWTDATGNINASYSDGAFAIFTGTAGTVTIDNSLGQVKAEGMQFAIDSYAVTGDKLEL
TGPQSTIRVGDGTTAGAAYIATINSVLTGNTQLEKTDAGTLVLTGANSYTGGTAINGGTIRISSDDNLGVASSDISFDGG
ALNTTANIATDRAIILTGAGTLLTDASTTLSLSGPISGTGALTKSGTGTLLLSGTAVHTGGTTITAGTLQIGNGGTDGSI
DGNIVNNGALVFDRAGTLAYTGSISGTGTLTKNGSSTLTMTGTSTYTGETTVSAGTLALQAGGQIKGTASLTVDGGAEVL
IDGSGSQFATGAGASVVGTGTVTVRDGGTASFDSLTTSNATGTNSTITVAGSGSQMTQTGIATFGLAGTATVDILDGGTM
ISSGASVFVGGQLPMDATGQVTISGAGSQWTIANALYARRGSITVDDGGVVTAGSAVIGYADTGINNPETDLVVTGAGSR
FETTGELAITNSAANAARGSITIADGGVVKVGGGALAMGPGNAVLNIGAAAGGSPAHAGTLDAGTVTMAVGSNQINFNHD
DASTTFSATISGAGSVSQNGPGATLLTGNNSYAGLTTVTAGSLYIDGDQSMATGLTTVNPGGTLGGTGTIGGDVTVASGG
AINPGSFGMAPGTLNINGDLTLASGSTQSFSFGQANIPGGPLNDLINVGGDLVLAGTLQVDTSAGGTMDPGIYRVFNYTG
TLSQNAWTVNLPSPDFYVQTSVAQQINLVNTAGLALRFWDGADPQNKNNGKIEGGNGIWQAFGSAPDNGNDNWTETGNIN
APFQDATFAVFTGEKGTVTVDDSKGAINVSGIQFVTDGYIVNGDAINLVGASGSTIRVGDGTTGGTDTVASIDAEITGAS
QLIKADMGTLILTGDNSYTGGTKITGGTLQVAKDSALGARTGELILDGGTLNTTADMTIDRSVTVDQAGTLDIDTGTTLK
IDGVLSGAGAFVKTGAGRLELAGDDHTYNGDASIASGTLALTGALGGTMNVGIDGRLEATGRVGATTNSGVIALDQEGFG
SLTVNGNYTGKDGRLEIATVLGDDTSLTNRLVIDGDMAGTTQVSVTNRGGLGAQTVEGIKIIQVGGASNGMFLLAGDYMF
NGEQAVVAGAYGYRLYKGGVSTPADGNWYLRSALLNPETPTNPTDPETPLYQPGVPLYESYAGSLQQLNKLGTLQQRVGN
RVWAKHPVPAQSDENGAGPSGNNGIWARIEAAHAEFDPKQSTSRASYDADIWKFQTGIDGMFAETASGKFIGGVYVQYET
VSSSVSSPFGNGSIESSGYGAGATLTWYGESDFYIDGVAQINWFDSDLNSATLGRQLVDGNRAVGYSLSVETGQKIEIGE
GWSLTPQAQLAYSAIRFDDFKDAFDTSVSPENDHDLTGRLGLAINRDAEWLDAQGRRVAMHIYGIGNLYYGFAGASKVDV
SSVRFVSGNERLRGGIGLGGTYDWADSKYSLYGETRFDTSLQNFGDSNVIAGSVGLRVRW
>Q8A8Y4 2.4.1.320~~~~~~1,4-beta-mannosyl-N-acetylglucosamine phosphorylase~~~COG2152
MNKIQIPWEERPVGCTDVMWRYSQNPVIGRYHIPSSNSIFNSAVVPFKDGFAGVFRCDNKAVQMNIFTGFSKDGIHWDIS
HEPIQFKAGNTEMIESEYKYDPRVTWIEDRYWVTWCNGYHGPTIGIAYTFDFVDFFQCENAFLPFNRNGVLFPQKIDGKY
AMLSRPSDNGHTPFGDIYISYSPDMKYWGEHRCVMKVTPFPESAWQCTKIGAGSVPFLTDEGWLLFYHGVITTCNGFRYA
MGSAILDKDHPEKVLYRTREYLIGPAAPYELQGDVPNVVFPCAALQDGERVAVYYGAADTVVGMAFGYIQEIIDFTKRTS
II
>Q92DF6 2.4.1.339~~~~~~Beta-1,2-mannobiose phosphorylase~~~COG2152
MNIYRYEENPLITPLDVKPIHEGFEVIGAFNGGVAEYNGEVLLLLRVAEKPVSEDPEIVLAPVYNAKNKELELQSFRLDD
ENYDFEDPRMIRSKAKLEGFSYLTSLSYIRIARSKDGHHFTLDEKPFLYPFNEYQTFGIEDARVTQIGDTYHVNFSAVSE
FGVADALVTTKDFENLEYQGNIFAPENKDVLIFPEKINGKYYALHRPSLKSIGNLDIWIASSPDLRSFGDHRHLLGIRPG
EYDSGRVGGGCVPIKTEEGWLILYHGATEENRYVMGAALLDLNDPTIVLKRTKTPILEPVADYEKNGFFGDVVFACGAIQ
EGDTLHMYYGVADTSMAGCDMKISEILHQLEVEAK
>B0K2C3 2.4.1.339~~~~~~Beta-1,2-mannobiose phosphorylase~~~
MFRLTRLSNKPILSPIKEHEWEKEAVFNAAVIYEGNKFHLFYRASNNKFVLNTEKPEEKYKFVSSIGYAVSEDGINFERF
DKPVLVGEIPQEAWGVEDPRITKIDNKYYMLYTGFGGRDWLDFRICMVWSDDLKNWKGHRIVLDEPNKDAALLSEKINGK
YVLFHRRMPDIWIAYSDDLVNWYNHKIIMSPKSHTWESKKIGIAGPPIKREDGWLLIYHGVDNNNVYRLGVALLDLKDPS
KVIARQKEPILEPELDWEINGLVPNVVFSCGAVEVNDMYYVYYGAADTHIGVAVIEKEKVKF
>D0LID5 ~~~~~~Bacterial microcompartment protein homohexamer~~~COG4577
MADALGMIEVRGFVGMVEAADAMVKAAKVELIGYEKTGGGYVTAVVRGDVAAVKAATEAGQRAAERVGEVVAVHVIPRPH
VNVDAALPLGRTPGMDKSA
>D0LHE5 ~~~~~~Bacterial microcompartment shell vertex protein~~~COG4576
MVLGKVVGTVVASRKEPRIEGLSLLLVRACDPDGTPTGGAVVCADAVGAGVGEVVLYASGSSARQTEVTNNRPVDATIMA
IVDLVEMGGDVRFRKD
>D0LHE3 ~~~~~~Bacterial microcompartment protein trimer-1~~~COG4577
MDHAPERFDATPPAGEPDRPALGVLELTSIARGITVADAALKRAPSLLLMSRPVSSGKHLLMMRGQVAEVEESMIAAREI
AGAGSGALLDELELPYAHEQLWRFLDAPVVADAWEEDTESVIIVETATVCAAIDSADAALKTAPVVLRDMRLAIGIAGKA
FFTLTGELADVEAAAEVVRERCGARLLELACIARPVDELRGRLFF
>D0LID6 ~~~~~~Bacterial microcompartment protein trimer-2~~~COG4577
MSITLRTYIFLDALQPQLATFIGKTARGFLPVPGQASLWVEIAPGIAINRVTDAALKATKVQPAVQVVERAYGLLEVHHF
DQGEVLAAGSTILDKLEVREEGRLKPQVMTHQIIRAVEAYQTQIINRNSQGMMILPGESLFILETQPAGYAVLAANEAEK
AANVHLVNVTPYGAFGRLYLAGSEAEIDAAAEAAEAAIRSVSGVAQESFRDR
>D0LV02 ~~~~~~Bacterial microcompartment protein trimer-3~~~COG4577
MELRAYTVLDALQPQLVAFLQTVSTGFMPMEQQASVLVEIAPGIAVNQLTDAALKATRCQPGLQIVERAYGLIEMHDDDQ
GQVRAAGDAMLAHLGAREADRLAPRVVSSQIITGIDGHQSQLINRMRHGDMIQAGQTLYILEVHPAGYAALAANEAEKAA
PIKLLEVVTFGAFGRLWLGGGEAEIAEAARAAEGALAGLSGRDNRG
>Q8YMK0 2.4.1.336~~~~~~Beta-monoglucosyldiacylglycerol synthase~~~COG1215
MPANSWPDNDSYKELDPLNSLLSDVSTTEESVVETRDLSLPSRFQGRRGKAALVLTIVWSGTIALHLVSWGSIFILGLTT
VLGIHALGVVFARPRHYQKEIQGSLPFVSILVAAKNEEAVIAKLAKNLCNLEYPNGQYEVWIIDDNSTDKTPHILAELAK
EYDKLKVLRRSAQATGGKSGALNQVLPLTQGEIIAVFDADAQVASDMLLHVVPLFQREKVGAVQVRKAIANAKENFWTKG
QMAEMSLDIWFQQQRTALGGIGELRGNGQFVRRQALDSCGGWNEETITDDLDLTFRLHLDKWDIECLFYPAVQEEGVTTA
IALWHQRNRWAEGGYQRYLDYWDLILKNRMGTRKTWDMLMFMLTMYILPTAAIPDLLMALTRHRPPMLGPVTGLSVTMSV
VGMFAGLRRIRQEQKFQVHTPFVLLLQTMRGTLYMLHWLVVMSSTTARMSFRPKRLKWVKTVHTGTGE
>P74165 2.4.1.336~~~~~~Beta-monoglucosyldiacylglycerol synthase~~~COG1215
MPQFPWKDNDAELSPLEAFLAEWDDPEAEEEDFRNDFFRGSEGRRKKAAVMLMAIWTVVITLHYWVWGSWLVWALTGALS
LQALRLMKATPEEAPPLLTGDASTVPYPQVCLMVAAKNEEAVIGKIVQQLCSLDYPGDRHEVWIVDDNSTDRTPAILDQL
RQQYPQLKVVRRGAGASGGKSGALNEVLAQTQGDIVGVFDADANVPKDLLRRVVPYFASPTFGALQVRKAIANEAVNFWT
RGQGAEMALDAYFQQQRIVTGGIGELRGNGQFVARQALDAVGGWNEQTITDDLDLTIRLHLHQWKVGILVNPPVEEEGVT
TAIALWHQRNRWAEGGYQRYLDYWRWICTQPMGWKKKLDLFSFLLMQYLLPTAAVPDLLMALWQRRFPLLTPLSYLAIGF
SCWGMYYGLKRLTPSEGESPWQQMPALLARTIGGTIYMFHWLIIMPAVTARMAFRPKRLKWVKTVHGAATEDALELKQS
>Q3MB01 2.4.1.336~~~~~~Beta-monoglucosyldiacylglycerol synthase~~~COG1215
MPANSWPDNDSYKELDPLNSLLSEVSTTEESVVETRDLSLPSRFQGRRGKAALVLTIVWSGTIALHLVSWGSIFILGLTT
VLGIHALGVVFARPRHYQKEMQGSLPFVSILVAAKNEEAVIAKLARNLCNLEYPNGQYEVWIIDDNSSDKTPHILAELAK
EYDKLKVLRRSAQATGGKSGALNQVLPLTQGEIIAVFDADAQVASDMLLHVVPLFQREKVGAVQVRKAIANAKENFWTKG
QMAEMSLDIWFQQQRTALGGIGELRGNGQFVRRQALDSCGGWNEETITDDLDLTFRLHLDKWDIECLFYPAVQEEGVTTA
IALWHQRNRWAEGGYQRYLDYWDLILKNRMGTRKTWDMLMFMLTMYILPTAAIPDLLMAVVRHRPPMLGPVTGLSVTMSV
VGMFAGLRRIRQEQKFQVHTPFVLLLQTMRGTLYMLHWLVVMSSTTARMSFRPKRLKWVKTVHTGSGE
>A2RK47 ~~~bmpA~~~ABC transporter nucleoside-binding protein BmpA~~~COG1744
MKKRVIAVSAIALASVAVLAGCRSHDASGTSGKVKTDLKAAIVTDANGVNDRSFNQSAWEGLQSWGKENNLKKGTGYTYF
QSNSASDYTTNYNSAEQQGYKLLFGIGFSLQDATSAAAKNNPKSNFVIVDSVIKDQKNVTSATFADNESAYLAGVAAAKA
TKTNKIGFIGGMQSDVITRFEKGYVAGAKSVKSDIKVDIQYAGSFSDAAKGKTIAAAMYGSGDDVVYQCAGGVGTGVFSE
AKALNSSKNEADKVWVIGVDQDQEYLGKYKSKDGKDSNFVLVSTIKEVGTVVKDIADKTKDGKFPGGTIVTYNLKNGGVD
LGLDNATSEIKDAVAKAKTDIIDGKITVPSK
>E4S1L1 ~~~bmpD~~~Basic membrane protein D~~~
MLKKVYYFLIFLFIVACSSSDDGKSEAKTVSLIVDGAFDDKGFNESSSKAIRKLKADLNINIIEKASTGNSYLGDIANLE
DGNSNLIWGIGFRLSDILFQRASENVSVNYAIIEGVYDEIQIPKNLLNISFRSEEVAFLAGYFASKASKTGKIGFVGGVR
GKVLESFMYGYEAGAKYANSNIKVVSQYVGTFGDFGLGRSTASNMYRDGVDIIFAAAGLSGIGVIEAAKELGPDHYIIGV
DQDQSYLAPNNVIVSAVKKVDSLMYSLTKKYLETGVLDGGKTMFLGLKEDGLGLVLNENLKSNYSEIYNKSLKIGQSIMN
GIIKVPYDKVSYDNFVLQMEN
>P96712 ~~~bmr3~~~Multidrug resistance protein 3~~~COG2814
MDTTTAKQASTKFVVLGLLLGILMSAMDNTIVATAMGNIVADLGSFDKFAWVTASYMVAVMAGMPIYGKLSDMYGRKRFF
LFGLIFFLIGSALCGIAQTMNQLIIFRAIQGIGGGALLPIAFTIIFDLFPPEKRGKMSGMFGAVFGLSSVLGPLLGAIIT
DSISWHWVFYINVPIGALSLFFIIRYYKESLEHRKQKIDWGGAITLVVSIVCLMFALELGGKTYDWNSIQIIGLFIVFAV
FFIAFFIVERKAEEPIISFWMFKNRLFATAQILAFLYGGTFIILAVFIPIFVQAVYGSSATSAGFILTPMMIGSVIGSMI
GGIFQTKASFRNLMLISVIAFFIGMLLLSNMTPDTARVWLTVFMMISGFGVGFNFSLLPAASMNDLEPRFRGTANSTNSF
LRSFGMTLGVTIFGTVQTNVFTNKLNDAFSGMKGSAGSGAAQNIGDPQEIFQAGTRSQIPDAILNRIIDAMSSSITYVFL
LALIPIVLAAVTILFMGKARVKTTAEMTKKAN
>O06967 7.6.2.-~~~bmrA~~~Multidrug resistance ABC transporter ATP-binding/permease protein BmrA~~~COG1132
MPTKKQKSKSKLKPFFALVRRTNPSYGKLAFALALSVVTTLVSLLIPLLTKQLVDGFSMSNLSGTQIGLIALVFFVQAGL
SAYATYALNYNGQKIISGLRELLWKKLIKLPVSYFDTNASGETVSRVTNDTMVVKELITTHISGFITGIISVIGSLTILF
IMNWKLTLLVLVVVPLAALILVPIGRKMFSISRETQDETARFTGLLNQILPEIRLVKASNAEDVEYGRGKMGISSLFKLG
VREAKVQSLVGPLISLVLMAALVAVIGYGGMQVSSGELTAGALVAFILYLFQIIMPMGQITTFFTQLQKSIGATERMIEI
LAEEEEDTVTGKQIENAHLPIQLDRVSFGYKPDQLILKEVSAVIEAGKVTAIVGPSGGGKTTLFKLLERFYSPTAGTIRL
GDEPVDTYSLESWREHIGYVSQESPLMSGTIRENICYGLERDVTDAEIEKAAEMAYALNFIKELPNQFDTEVGERGIMLS
GGQRQRIAIARALLRNPSILMLDEATSSLDSQSEKSVQQALEVLMEGRTTIVIAHRLSTVVDADQLLFVEKGEITGRGTH
HELMASHGLYRDFAEQQLKMNADLENKAG
>P39075 ~~~bmrR~~~Multidrug-efflux transporter 1 regulator~~~COG0789
MKESYYSIGEVSKLANVSIKALRYYDKIDLFKPAYVDPDTSYRYYTDSQLIHLDLIKSLKYIGTPLEEMKKAQDLEMEEL
FAFYTEQERQIREKLDFLSALEQTISLVKKRMKRQMEYPALGEVFVLDEEEIRIIQTEAEGIGPENVLNASYSKLKKFIE
SADGFTNNSYGATFSFQPYTSIDEMTYRHIFTPVLTNKQISSITPDMEITTIPKGRYACIAYNFSPEHYFLNLQKLIKYI
ADRQLTVVSDVYELIIPIHYSPKKQEEYRVEMKIRIAE
>P39074 2.7.1.-~~~bmrU~~~Putative lipid kinase BmrU~~~COG1597
MSHRKALLIHNGNAGNKNIEKALGAVVPVLSHHLDEVIIKQTKKKDDAYHFCRSIDDSVDTVFILGGDGTIHQCINAISA
LERKPAVGILPGGTCNDFSRVLGIPQNLAKAAEALMAGKKTSVDVCQMNDRYFLNFWGIGLIAETSNQINETEKALLGKI
SYFTSALRTVSSAASFPMTLKIDGEEIKEEAVMLLVMNGQYIGTNRIPLPDASIEDGLLDVLICRNTNLTALRELMSMEQ
GSIDRFTGELSYVQASRIEIETDTAKKADMDGEVYTHTPAVIQVLPQHIDMLVPANE
>A5W4F2 1.14.12.3~~~bnzA~~~Benzene 1,2-dioxygenase subunit alpha~~~COG4638
MNQTDTSPIRLRRSWNTSEIEALFDEHAGRIDPRIYTDEDLYQLELERVFARSWLLLGHETQIRKPGDYITTYMGEDPVV
VVRQKDASIAVFLNQCRHRGMRICRADAGNAKAFTCSYHGWAYDTAGNLVNVPYEAESFACLNKKEWSPLKARVETYKGL
IFANWDENAVDLDTYLGEAKFYMDHMLDRTEAGTEAIPGVQKWVIPCNWKFAAEQFCSDMYHAGTTSHLSGILAGLPEDL
EMADLAPPTVGKQYRASWGGHGSGFYVGDPNLMLAIMGPKVTSYWTEGPASEKAAERLGSVERGSKLMVEHMTVFPTCSF
LPGINTVRTWHPRGPNEVEVWAFTVVDADAPDDIKEEFRRQTLRTFSAGGVFEQDDGENWVEIQHILRGHKARSRPFNAE
MSMDQTVDNDPVYPGRISNNVYSEEAARGLYAHWLRMMTSPDWDALKATR
>A5W4F1 1.14.12.3~~~bnzB~~~Benzene 1,2-dioxygenase subunit beta~~~COG5517
MIDSANRADVFLRKPAPVAPELQHEVEQFYYWEAKLLNDRRFEEWFALLAEDIHYFMPIRTTRIMRDSRLEYSGSREYAH
FDDDATMMKGRLRKITSDVSWSENPASRTRHLVSNVMIVGAEAEGEYEISSAFIVYRNRLERQLDIFAGERRDTLRRNTS
EAGFEIVNRTILIDQSTILANNLSFFF
>P08086 ~~~bnzC~~~Benzene 1,2-dioxygenase system ferredoxin subunit~~~
MTWTYILRQSDLPPGEMQRYEGGPEPVMVCNVDGDFFAVQDTCTHGDWALSDGYLDGDIVECTLHFGKFCVRTGKVKALP
ACKPIKVFPIKVEGDEVHVDLDNGELK
>P80193 1.14.11.1~~~~~~Gamma-butyrobetaine dioxygenase~~~
NAIADYRTFPLISPLASAASFASGVSVTWADGRVSPFHNLWLRDNCPCGDCVYEVTREQVFLVADVPEDIQVQAVTIGDD
GRLVVQWDDGHASAYHPGWLRAHAYDAQSLAEREAARPHKHRWMQGLSLPVYDHGAVMQDDDTLLEWLLAVRDVGLTQLH
GVPTEPGALIPLAKRISFIRESNFGVLFDVRSKADADSNAYTAFNLPLHTDLPTRELQPGLQFLHCLVNDATGGNSTFVD
GFAIAEALRIEAPAAYRLLCETPVEFRNKDRHSDYRCTAPVIALDSSGEVREIRLANFLRAPFQMDAQRMPDYYLAYRRF
IQMTREPRFCFTRRLEAGQLWCFDNRRVLHARDAFDPASGDRHFQGCYVDRDELLSRILVLQR
>P24282 ~~~bofA~~~Sigma-K factor-processing regulatory protein BofA~~~
MEPIFIIGIILGLVILLFLSGSAAKPLKWIGITAVKFVAGALLLVCVNMFGGSLGIHVPINLVTTAISGILGIPGIAALV
VIKQFII
>O05391 ~~~bofC~~~Protein BofC~~~
MKRFSTAYLLLGILCSAAVFLIGAPSRALGAEVEHYEPLQVHVQLEKVYLDGDVSIEHKHEKVFSMDDFWAAYAGWTLVE
QKKGYVLFRKQMDDISPLSKVNGYIGVSDNGVISTFHGRPEPASEPIQSFFQIDLERLESHMQKNLLKGIPFRTKAEFED
VIEHMKTYSG
>Q9AGW3 1.1.5.11~~~boh~~~1-butanol dehydrogenase (quinone)~~~
MKKSHAKPFALRAIVVATAAALSLPAAAVTDVTWEDIANDHKTTGDVLTYGLGLKAQRHSPLKAINTDNVANLVPAWSFS
FGGEKQRGQEAQVLVHDGVIYATASYSRIFAIDARSGKRLWEYNARLPDDIRPCCDVVNRGAAIYGDKVFFGTLDAAMVA
LDRKTGKVVWRKKFGDHKVGYTMTGAPFVIKDQKSGRTLLVHGSSGDEFGVVGWLFARDPDTGEEVWARPMVEGHMGRLN
GKDSTPTGDPKAPSWPDDPNSPTGKVEAWSQGGGAPWQTASFDVENNMVVIGAGNPAPWNTWKRTAPGDDPRNWDSLFTS
GQAYVDASTGELKGFYQHTPNDAWDFSGNNSVVLFEYKDPKTGKMVNASAHADRNGFFFVTDRDMLAKGAGYPNKPTSLI
GAWPFVDGITWASGFDLKTGKPIEKDNRPPQPKEGADKGESIFVSPPFLGGTNWHPMSYSPDTGLFYIPANHWAMDYWTE
NVTYKAGSAYLGQGFRIKNLFDDHVGILRAIDPSPARSLGAQGRVPAVAGTLTTAGGWVFTGTSDGYLKAFDAKNGKELW
KFQTGSGVVSVPVTWEMDGEQYVAIQSGYGGAVPLWGGDMAELTKQVTQGGSMWVFKLPKASR
>P0ABE2 ~~~bolA~~~DNA-binding transcriptional regulator BolA~~~COG0271
MMIRERIEEKLRAAFQPVFLEVVDESYRHNVPAGSESHFKVVLVSDRFTGERFLNRHRMIYSTLAEELSTTVHALALHTY
TIKEWEGLQDTVFASPPCRGAGSIA
>B0K2C2 2.4.1.340~~~~~~1,2-beta-oligomannan phosphorylase~~~
MIKLKRLSDKPVLMPKAENEWERAAVFNTAAIYDNGLFHLIYRATDIGPHAKYGKYISRLGYAVSKDGINFMRLDKPVMS
NETEQELRGLEDPRIVKIDGIYYMMYTGFGDRFQDDYRICLATSKNLIDWERKGVVLDEPNKDASLFPEKINGKYVMLHR
RYPDIWIAFSDDLKNWYDHKPILKPIPNTWESARVGIGGPPIKTKDGWFLIYHAADDNNVYRLGAVLLDLEDPSKVIARQ
KEPILEPELGWEKEGYIPNVVFSCGNAVKDDTIYVYYGGADTVIGVAILEMKDIKF
>Q63K41 ~~~bopE~~~Guanine nucleotide exchange factor BopE~~~
MTYNPRIGGFTHVKQASFDVHVKRGEAQPRTSFAQQIKRIFSKIGETLGQLFRHRAPDSAPGRVRLQGVRYVGSYRPTGD
AKQAIRHFVDEAVKQVAHARTPEIRQDAEFGRQVYEATLCAIFSEAKDRFCMDPATRAGNVRPAFIEALGDAARATGLPG
ADKQGVFTPSGAGTNPLYTEIRLRADTLMGAELAARPEYRELQPYARQQAIDLVANALPAERSNTLVEFRQTVQTLEATY
RRAAQDASRDEKGATNAADGA
>Q9AIX6 1.14.13.208~~~boxA~~~Benzoyl-CoA oxygenase component A~~~
MNAPAEHANLARQHLIDPEICIRCNTCEEICPVDAITHDSRNYVVKFETCNGCLACISPCPTGAIDSWRNVDKATPHSLA
DQYSWDYLPDTTELDQFEATVMGAAELPAEVQQITEVATAGQGGPAMAPWSASHPYVNLYTPANPITATVTGNYRLTAED
ASSDIHHIVLDFGTTPFPVLEGQSIGIIPPGVDEKGKPHLLRMYSVASPRDGERPHYNNLSLTVKRVVEDHEGNPTRGVA
SNYVCDLKKGDKVQVTGPYGSTYLMPNHPGSSIMMICTGTGSAPMRAMTERRRRRMDRKEGGELVLFFGARAPEELPYFG
PLQKLPKEFIDINFAFSRVPGEPKRYVQDAIRERADKVFQMLQDDNCYIYICGLKGMEAGVLEAFRDICRAKGADWDALR
PQLLSKARFHVETY
>Q9AIX7 1.14.13.208~~~boxB~~~Benzoyl-CoA oxygenase component B~~~
MINYSERIPNNVNLNENKTLQRALEQWQPSFLNWWDDMGPENSSNYDVYLRTAVSVDPKGWADFGYVKMHDYRWGIFLAP
QEGEKKITFGEHKGQDVWQEVPGEYRSTLRRIIVTQGDTEPASVEQQRHLGLTAPSLYDLRNLFQVNVEEGRHLWAMVYL
LHAHFGRDGREEGEALLERRSGDEDNPRILTAFNEKTPDWLSFFMFTFITDRDGKFQLASLAESAFDPLARTCKFMLTEE
AHHLFVGESGIARVIQRTCEVMKELGTDDPAKLRAAGVIDLPTLQKYLNFHYSVTSDLYGAEISSNAATYYTNGLKGRFE
EEKIGDDHKLQNSEYEVMDVAGDKILTRHVPALSALNERLRDDWITDVQAGVDRWNRIPAKFGFDFRFTLPHKGFHRKIG
MFADVHVSPDGRLISEAEWTHQHKNWLPTESDRLYVHSLMGRCLEPGKFANWIAAPARGINNQPVNFEYVRFN
>Q84HH6 4.1.2.44~~~boxC~~~Benzoyl-CoA-dihydrodiol lyase~~~
MQAVANKPVAELVDYRTEPSKYRHWSLATDGEIATLTLNIDEDGGIRPGYKLKLNSYDLGVDIELHDALQRVRFEHPEVR
TVVVTSGKPKIFCSGANIYMLGLSTHAWKVNFCKFTNETRNGIEDSSQYSGLKFLAACNGTTAGGGYELALACDEIVLVD
DRNSSVSLPEVPLLGVLPGTGGLTRVTDKRRVRRDHADIFCTISEGVRGQRAKDWRLVDDVVKQQQFAEHIQARAKALAQ
TSDRPAGAKGVKLTTLERTVDEKGYHYEFVDATIDADGRTVTLTVRAPAAVTAKTAAEIEAQGIKWWPLQMARELDDAIL
NLRTNHLDVGLWQLRTEGDAQVVLDIDATIDANRDNWFVRETIGMLRRTLARIDVSSRSLYALIEPGSCFAGTLLEIALA
ADRSYMLDAAEAKNVVGLSAMNFGTFPMVNGLSRIDARFYQEEAPVAAVKAKQGSLLSPAEAMELGLVTAIPDDLDWAEE
VRIAIEERAALSPDALTGLEANLRFGPVETMNTRIFGRLSAWQNWIFNRPNAVGENGALKLFGSGKKAQFDWNRV
>Q84HH8 1.2.1.77~~~boxD~~~3,4-dehydroadipyl-CoA semialdehyde dehydrogenase~~~
MKLANYVYGQWIEGAGEGAALTDPVTGEALVRVSSDGIDVARALEFARTAGGAALKALTYEERAAKLAAIAELLQAKRAE
YFDISLRNSGATEGDASFDVDGAIFTVKSYARAGKALGAGRHLKEGGRVALAKTDVFQGQHFLMPLTGVAVFINAFNFPA
WGLWEKAAPALLAGVPVFAKPATPTAWLAQRMVADVVEAGILPPGAISIVCGGARDLLDHVTECDVVSFTGSADTAARMR
THPNVVARSVRINIEADSVNSAILGPDAQPGTPEFDLAVKEIVREMTVKTGQKCTAIRRILAPAGVSRALADAVSGKLAG
CKVGNPRSEGVRVGPLVSKAQQAAAFEGLAKLRQECEVVFGGDPDFEPVDADAAVSAFVQPTLLYCDKGLAARHVHDVEV
FGPVATMVPYADTRDAVAIARRGHGSLVASVYSGDAAFLGELVPGIADLHGRVMVVDAAVGANHTGHGNVMPTCLHGGPR
ARRRRRGVGRSARAGDVSPPLRRAGRPRGAGSPVA
>Q44642 ~~~~~~26 kDa periplasmic immunogenic protein~~~
MNTRASNFLAASFSTIMLVGAFSLPAFAQENQMTTQPARIAVTGEGMMTASPDMAILNLSVLRQAKTAREAMTANNEAMT
KVLDAMKKAGIEDRDLQTGGIDIQPIYVYPDDKNNLKEPTITGYSVSTSLTVRVRELANVGKILDESVTLGVNQGGDLNL
VNDNPSAVINEARKRAVANAIAKAKTLADAAGVGLGRVVEISELSRPPMPMPIARGQFRTMLAAAPDNSVPIAAGENSYN
VSVNVVFEIK
>A0A0H2WHF1 ~~~bpaC~~~Autotransporter adhesin BpaC~~~COG5295
MNRIFKSIWCEQTRTWVAASEHAVARGGRASSVVASAGGLEKVLKLSILGAASLIAMGVVGPFAEEAMAANNAGVCLTYN
GSSNNTSGTGGWFADGCKSAGWVQGMVTNSKTDWVGLTADDTQIVLDGSAGSIYFRTGGINGNVLTMSNATGGVLLSGLA
AGVNPTDAVNMSQLTSLSTSTATGITSLSTSTATSIASLSTSMLSLGVGVVTQDASSGAISVGANSPGLTVDFAGGQGPR
TLTGVAAGVNATDAVNVGQLASLSTSTAAGLSTAASGVASLSTSLLGAAGDLASLSTSASTGLATADSGIASLSTSLLGT
ADNVTSLSTSLSTVNANLAGLQTSVDNVVSYDDPSKSAITLGGAGVATPVLLTNVAAGKIAATSTDAVNGSQLYTLQQEF
SQQYDLLTSQVSSLSTSVSGLQGSVSANTGTASGDNSTASGDNATASGTNSTANGTNSTASGDNSTASGTNASASGENST
ATGTDSTASGSNSTANGTNSTASGDNSTASGTNASATGENSTATGTDSTASGSNSTANGTNSTASGDSSTASGTNASATG
ENSTATGTDSTASGSNSTANGTNSTASGDNSTASGTNASATGENSTATGTDSTASGSNSTANGTNSTASGDNSTASGTNA
SATGENSTATGTDSTASGSNSTANGANSTASGENSTATGTDSTASGSNSTANGTNSTASGDNSTASGTNASATGENSTAT
GTASTASGSNSTANGANSTASGAGATATGENAAATGAGATATGNNASASGTSSTAGGANAIASGENSTANGANSTASGNG
SSAFGESAAAAGDGSTALGANAVASGVGSVATGAGSVASGANSSAYGTGSNATGAGSVAIGQGATASGSNSVALGTGSVA
SEDNTVSVGSAGSERRITNVAAGVNATDAVNVGQLNSAVSGIRNQMDGMQGQIDTLARDAYSGIAAATALTMIPDVDPGK
TLAVGIGTANFKGYQASALGATARITQNLKVKTGVSYSGSNYVWGAGMSYQW
>A0A0H3HIJ5 ~~~bpaC~~~Autotransporter adhesin BpaC~~~
MNRIFKSIWCEQTRTWVAASEHAVARGGRASSVVASAGGLEKVLKLSILGAASLIAMGVVGPFAEEAMAANNAGVCLTYN
GSSNNTSGTGGWFADGCKSAGWVQGMVTNSKTDWVGLTADDTQIVLDGSAGSIYFRTGGINGNVLTMSNATGGVLLSGLA
AGVNPTDAVNMSQLTSLSTSTATGITSLSTSTATSIASLSTSMLSLGVGVVTQDASTGAISVGANSPGLTVDFAGGQGPR
TLTGVAAGVNATDAVNVGQLASLSTSTAAGLSTAASGVASLSTSLLGAVGDLASLSTSASTGLATADSGIASLSTSLLGT
ADNVTSLSTSLSTVNANLAGLQTSVDNVVSYDDPSKSAITLGGAGVTTPVLLTNVAAGKIAATSTDAVNGSQLYTLQQEF
SQQYDLLTSQVSSLSTSVSGLQGSVSANTGTASGDNSTASGDNATASGTNSTANGTNSTASGDNSTASGTNASASGENST
ATGTDSTASGSNSTANGTNSTASGDNSTASGTNASATGENSTATGTDSTASGSNSTANGTNSTASGDNSTASGTNASASG
ENSTATGTDSTASGSNSTANGTNSTASGDNSTASGTNASATGENSTATGTDSTASGSNSTANGTNSTASGDNSTASGTNA
SATGENSTATGTDSTASGSNSTANGTNSTASGDNSTASGTNASATGENSTATGTDSTASGSNSTANGTNSTASGDNSTAS
GTNASATGENSTATGTDSTASGSNSTANGANSTASGDNSTASGTNASATGENSTATGTDSTASGSNSTANGTNSTASGNN
STASGTNASATGENSTATGTDSAASGTNSTANGTNSTASGDNSTASGTNASATGENSTATGTASTASGSNSTANGANSTA
SGAGATATGENAAATGAGATATGNNASASGTSSTAGGANAIASGENSTTNGANSTASGNGSSAFGESAAAAGDGSTALGA
NAVASGVGSVATGAGSVASGANSSAYGTGSNATGAGSVAIGQGATASGSNSVALGTGSVASEDNTVSVGSAGSERRITNV
AAGVNATDAVNVGQLNSAVSGIRNQMDGMQGQIDTLARDAYSGIAAATALTMIPDVDPGKTLAVGIGTANFKGYQASALG
ATARITQNLKVKTGVSYSGSNYVWGAGMSYQW
>A0R5Z0 ~~~bpa~~~Bacterial proteasome activator~~~
MTINPDDDNIEILTGAAGGADTEGEGEGEGKSLTDLVEQPAKVMRIGTMIKQLLEEVRAAPLDDASRNRLREIHQTSIRE
LEDGLAPELREELERLTLPFTDDNVPSDAELRIAQAQLVGWLEGLFHGIQTALFAQQMAARAQLEQMRQGALPPGIQVPG
AQRGGATHPGTGQYL
>P9WKX3 ~~~bpa~~~Bacterial proteasome activator~~~
MVIGLSTGSDDDDVEVIGGVDPRLIAVQENDSDESSLTDLVEQPAKVMRIGTMIKQLLEEVRAAPLDEASRNRLRDIHAT
SIRELEDGLAPELREELDRLTLPFNEDAVPSDAELRIAQAQLVGWLEGLFHGIQTALFAQQMAARAQLQQMRQGALPPGV
GKSGQHGHGTGQYL
>Q53122 1.14.12.18~~~bphA1~~~Biphenyl 2,3-dioxygenase subunit alpha~~~
MTDVQCEPALAGRKPKWADADIAELVDERTGRLDPRIYTDEALYEQELERIFGRSWLLMGHETQIPKAGDFMTNYMGEDP
VMVVRQKNGEIRVFLNQCRHRGMRICRADGGNAKSFTCSYHGWAYDTGGNLVSVPFEEQAFPGLRKEDWGPLQARVETYK
GLIFANWDADAPDLDTYLGEAKFYMDHMLDRTEAGTEAIPGIQKWVIPCNWKFAAEQFCSDMYHAGTTSHLSGILAGLPD
GVDLSELAPPTEGIQYRATWGGHGSGFYIGDPNLLLAIMGPKVTEYWTQGPAAEKASERLGSTERGQQLMAQHMTIFPTC
SFLPGINTIRAWHPRGPNEIEVWAFTVVDADAPEEMKEEYRQQTLRTFSAGGVFEQDDGENWVEIQQVLRGHKARSRPFN
AEMGLGQTDSDNPDYPGTISYVYSEEAARGLYTQWVRMMTSPDWAALDATRPAVSESTHT
>Q53123 1.14.12.18~~~bphA2~~~Biphenyl 2,3-dioxygenase subunit beta~~~
MIDAESPTTAFRTKPAPVDPSLQHEIEQFYYWEAKLLNDRRFQEWFDLLAEDIHYFMPIRTTRIMRETAQEYSGAREYAH
FDDNAQMMRGRLRKITSDVSWSENPASRTRHVISNVMIVDGEKPGEYHVSSVFIVYRNRLERQLDIFAGERKDILRRTGS
EAGFELAKRTILIDQSTILSNNLSFFF
>Q52440 ~~~bphA3~~~Biphenyl dioxygenase ferredoxin subunit~~~
MTFTKACSVDEVPPGEALQVSHDAQKVAIFNVDGEFFATQDQCTHGEWSLSEGGYLDGDVVECSLHMGKFCVRTGKVKSP
PPCEPLKVYPIRIEGRDVLVDFSRAALHA
>Q53124 ~~~bphA3~~~Biphenyl 2,3-dioxygenase, ferredoxin component~~~
MALTKICSSGDLAPGEMLRFEEGPEPILVCNVGGEFFATQDTCSHADWALSEGYLEDDVVECTLHWAKFCVRTGKAKALP
ACVPLRTFVVKLEGDDVLVDLEGGVTT
>Q0S032 1.18.1.3~~~bphA4~~~Biphenyl 2,3-dioxygenase, ferredoxin reductase component~~~
MTSDIVVIGGGVAGVTAAQSLRSEGYDGRLVLIGKERELPYDRTALSKAVLAGDLADPPLLFPADWYDEWQIETVLDRTV
LQVDVTRREVLLDGGPWLKVDRVLLATGASARVPSFSGSDLPGVATLRTADDVHRMRRDWEPGQRLVVVGGGLIGCEVAT
TARKLGLEVSILEASDELLQRVLGRRIGGWCRARLMELGISVVLNTGVAEFKGVDRITTVIGTDGRSFVADRAIVCVGAE
PETAIAEQSGLACNRGILVNDSGGTAAEGVFAAGDVASWPLLTGGRRSLETYINSQREATAVASAMLGKAVHGPQLPLSW
TEMAGHRIQMIGDIEGSGEYVMRGDPDDGPALLFRLSDGRVTAAVSVDAPRDFAMATRLVERGAQVGREVLGDTSMELRE
LNRAARERALIAE
>Q46372 1.14.12.18~~~bphA~~~Biphenyl dioxygenase subunit alpha~~~
MSSTMKDTQEAPVRWSRNWTPDAIRALVDQDNGKLDARIYADQDLYQLELERVFGRSWLMLGHETHIPKIGDYLTTYMGE
DPVIMVRQKDQSIKVFLNQCRHRGMRIVRSDGGNAKAFTCTYHGWAYDIAGNLVNVPFEKEAFCDKKEGDCGFDKADWGP
LQARVETYKGLVFANWDPEAPDLKTYLSDAMPYMDVMLDRTEAGTEAIGGIQKWVIPCNWKFAAEQFCSDMYHAGTMSHL
SGVLAGLPPEMDLTQIQLSKNGNQFRSAWGGHGAGWFINDSSILLSVVGPKITQYWTQGPAAEKAARRVPQLPILDMFGQ
HMTVFPTCSFLPGINTIRTWHPRGPNEVEVWAFVLVDADAPEDIKEEFRLQNIRTFNAGGVFEQDDGENWVEIQRVMRGH
KAKSTSLCAKMGLNVPNKNNPAYPGKTAYVYAEEAARGMYHHWSRMMSEPSWDTLKP
>P37333 1.14.12.18~~~bphA~~~Biphenyl dioxygenase subunit alpha~~~COG4638
MSSAIKEVQGAPVKWVTNWTPEAIRGLVDQEKGLLDPRIYADQSLYELELERVFGRSWLLLGHESHVPETGDFLATYMGE
DPVVMVRQKDKSIKVFLNQCRHRGMRICRSDAGNAKAFTCSYHGWAYDIAGKLVNVPFEKEAFCDKKEGDCGFDKAEWGP
LQARVATYKGLVFANWDVQAPDLETYLGDARPYMDVMLDRTPAGTVAIGGMQKWVIPCNWKFAAEQFCSDMYHAGTTTHL
SGILAGIPPEMDLSQAQIPTKGNQFRAAWGGHGSGWYVDEPGSLLAVMGPKVTQYWTEGPAAELAEQRLGHTGMPVRRMV
GQHMTIFPTCSFLPTFNNIRIWHPRGPNEIEVWAFTLVDADAPAEIKEEYRRHNIRNFSAGGVFEQDDGENWVEIQKGLR
GYKAKSQPLNAQMGLGRSQTGHPDFPGNVGYVYAEEAARGMYHHWMRMMSEPSWATLKP
>Q46381 1.3.1.56~~~bphB~~~Cis-2,3-dihydrobiphenyl-2,3-diol dehydrogenase~~~
MKLTGEVALITGGASGLGRALVDRFVAEGARVAVLDKSAERLRELEVAHGGNAVGVVGDVRSLQDQKRAAERCLAAFGKI
DTLIPNAGIWDYSTALADLPEDKIDAAFDDIFHVNVKGYIHAVKACLPALVSSRGSVVFTISNAGFYPNGGGPLYTATKH
AVVGLVRQMAFELAPHVRVNGVAPGGMNTDLRGPSSLGLSEQSISSVPLADMLKSVLPIGRMPALEEYTGAYVFFATRGD
SLPATGALLNYDGGMGVRGFLTAAGGADLPEKLNINREGQE
>P47227 1.3.1.56~~~bphB~~~Cis-2,3-dihydrobiphenyl-2,3-diol dehydrogenase~~~COG1028
MKLKGEAVLITGGASGLGRALVDRFVAEGAKVAVLDKSAERLAELETDHGDNVLGIVGDVRSLEDQKQAASRCVARFGKI
DTLIPNAGIWDYSTALVDLPEESLDAAFDEVFHINVKGYIHAVKACLPALVASRGNVIFTISNAGFYPNGGGPLYTAAKH
AIVGLVRELAFELAPYVRVNGVGSGGINSDLRGPSSLGMGSKAISTVPLADMLKSVLPIGRMPEVEEYTGAYVFFATRGD
AAPATGALLNYDGGLGVRGFFSGAGGNDLLEQLNIHP
>P47232 1.13.11.39~~~bphC2~~~Biphenyl-2,3-diol 1,2-dioxygenase 2~~~
MTATPKFAHVVLQTSRFEAMRDWYCTVLDAHVVYEGHGLCFITFDEEHHRVALLGAPTALEPRNPGAAGMHHTAYTFDTL
GDLLDRYESLKSKGIEPKVPIQHGVTTSLYYQDPDGNFVELQIDNFSTPDEATAYMNGPEYGGNPVGVSFDPVLIPQALS
AGTPVDRITTHAWALETTPDLPNPMIALTS
>Q8GR45 1.13.11.39~~~bphC~~~Manganese-dependent 2,3-dihydroxybiphenyl 1,2-dioxygenase~~~
MTAEIAKFGHIALITPNLEKSVWFFRDIVGLEEVDRQGDTIFLRAWGDWEHHTLSLTPGNRARVDHIAWRTKRPEDVETF
AEQLKAKGTEVQWIEPGEEKGQGKAIRFRLPNGYPFEIYYDVEKPKAPEGKKSRLKNNVYRPSYGIAPRRIDHVNVWTTN
PSEIHQWLKDNMGFKMREYIRLNNGFVAGGWMSVTPLVHDIGVMVDPKGQPNRLHHFAYYLDNVTDILRAADILREHDIT
IEMGGPGRHGISQAFFLYVKDPGSGHRLELFSGGYLIFDPDWEPIEWQEHELQEGLIWYGPEMKPGGPMDDTTEC
>P47228 1.13.11.39~~~bphC~~~Biphenyl-2,3-diol 1,2-dioxygenase~~~COG0346
MSIRSLGYMGFAVSDVAAWRSFLTQKLGLMEAGTTDNGDLFRIDSRAWRIAVQQGEVDDLAFAGYEVADAAGLAQMADKL
KQAGIAVTTGDASLARRRGVTGLITFADPFGLPLEIYYGASEVFEKPFLPGAAVSGFLTGEQGLGHFVRCVPDSDKALAF
YTDVLGFQLSDVIDMKMGPDVTVPAYFLHCNERHHTLAIAAFPLPKRIHHFMLEVASLDDVGFAFDRVDADGLITSTLGR
HTNDHMVSFYASTPSGVEVEYGWSARTVDRSWVVVRHDSPSMWGHKSVRDKAAARNKA
>P17297 1.13.11.39~~~bphC~~~Biphenyl-2,3-diol 1,2-dioxygenase~~~
MSIERLGYLGFAVKDVPAWDHFLTKSVGLMAAGSAGDAALYRADQRAWRIAVQPGELDDLAYAGLEVDDAAALERMADKL
RQAGVAFTRGDEALMQQRKVMGLLCLQDPFGLPLEIYYGPAEIFHEPFLPSAPVSGFVTGDQGIGHFVRCVPDTAKAMAF
YTEVLGFVLSDIIDIQMGPETSVPAHFLHCNGRHHTIALAAFPIPKRIHHFMLQANTIDDVGYAFDRLDAAGRITSLLGR
HTNDQTLSFYADTPSPMIEVEFGWGPRTVDSSWTVARHSRTAMWGHKSVRGQR
>P47229 3.7.1.8~~~bphD~~~2-hydroxy-6-oxo-6-phenylhexa-2,4-dienoate hydrolase~~~COG0596
MTALTESSTSKFVKINEKGFSDFNIHYNEAGNGETVIMLHGGGPGAGGWSNYYRNVGPFVDAGYRVILKDSPGFNKSDAV
VMDEQRGLVNARAVKGLMDALDIDRAHLVGNSMGGATALNFALEYPDRIGKLILMGPGGLGPSMFAPMPMEGIKLLFKLY
AEPSYETLKQMLQVFLYDQSLITEELLQGRWEAIQRQPEHLKNFLISAQKAPLSTWDVTARLGEIKAKTFITWGRDDRFV
PLDHGLKLLWNIDDARLHVFSKCGHWAQWEHADEFNRLVIDFLRHA
>Q52011 3.7.1.8~~~bphD~~~2-hydroxy-6-oxo-6-phenylhexa-2,4-dienoate hydrolase~~~
MTALTESSTSKFVKINEKGFSDFNIHYNEAGNGETVIMLHGGGPGAGGWSNYYRNVGPFVDAGYRVILKDSPGFNKSDAV
VMDEQRGLVNARAVKGLMDALGIDRAHLVGNSMGGATALNFAIEYPERIGKLILMGPGGPGPSMFAPMPMEGIKLLFKLY
AEPSYENLKQMIQVFLYDQSLITEELLQGRWEAIQRQPEHLKNFLISAQKAPLSTWDVTARLGEIKAKTFITWGRDDRFV
PLDHGLKLLWNIDDARLHVFSKCGHWAQWEHADEFNRLAIDFLRQA
>P17548 3.7.1.8~~~bphD~~~2-hydroxy-6-oxo-6-phenylhexa-2,4-dienoate hydrolase~~~
MSELNESSTSKFVTINEKGLSNFRIHLNDAGQGERVIMLHGGGPGAGGWSNYYRNIGPFVEAGYRVLLPDAPGFNKSDTV
VMDEQRGLVNARSVKGMMDVLGIEKAHLVGNSMGGAGALNFALEYPERTGKLILMGPGGLGNSLFTAMPMEGIKLLFKLY
AEPSLETLKQMLNVFLFDQSVITDELLQGRWANIQRNPEHLKNFILSAQKVPLSAWDVSARLGEIKAKTLVTWGRDDRFV
PLDHGLKLIANMQDAHVHVFPRCAHWAQWEHADAFNRLTLDFLANG
>Q46373 1.14.12.18~~~bphE~~~Biphenyl dioxygenase subunit beta~~~
MISTPLSKEFEWPAKPVSLELQHQVEQFYYREAQLLDHHAFQAWFALLAEDIHYWMPIRTVRTAREQGLEYVPAGANAHF
DDTHATMYGRIRQKTSDLNWAEDPPSRTRHLVSNVIVREMDTPGTLEVASAFLLYRSRLERQVDVFAGERRDVLRIADNP
LGFQIAKRTIILDQSTVLANNLSVFF
>P37334 1.14.12.18~~~bphE~~~Biphenyl dioxygenase subunit beta~~~COG5517
MTNPSPHFFKTFEWPSKAAGLELQNEIEQFYYREAQLLDHRAYEAWFALLDKDIHYFMPLRTNRMIREGELEYSGDQDLA
HFDETHETMYGRIRKVTSDVGWAENPPSRTRHLVSNVIVKETATPDTFEVNSAFILYRNRLERQVDIFAGERRDVLRRAD
NNLGFSIAKRTILLDASTLLSNNLSMFF
>P37332 ~~~bphF~~~Biphenyl dioxygenase system ferredoxin subunit~~~COG2146
MKFTRVCDRRDVPEGEALKVESGGTSVAIFNVDGELFATQDRCTHGDWSLSDGGYLEGDVVECSLHMGKFCVRTGKVKSP
PPCEALKIFPIRIEDNDVLVDFEAGYLAP
>O05151 4.1.3.39~~~bphF~~~4-hydroxy-2-oxovalerate aldolase~~~
MQSPINSFKKALAEGRTQIGFWLALGDAYSAEVCAGAGFDWLLIDGEHAPQDLRSVLAQLQVIGAYRDCHAAVRVPSADT
TVIKQYLDLGAQSLLVPMVDTADEAAAVVRACRYPPGGIRGVGGARASRWGRYPRYLHEADEQVCVVVQAETALALSNLE
AIAEVDGIDGVFIGTADLAASLGFPGNPAHPEVQDAILDALQRVRAAGKAPGVLTPVEDLAQKYLAHGAVFVAVGIDTHL
LAKQTSALAARFAQVAYS
>Q9RZA4 2.7.13.3~~~bphP~~~Bacteriophytochrome~~~COG4251
MSRDPLPFFPPLYLGGPEITTENCEREPIHIPGSIQPHGALLTADGHSGEVLQMSLNAATFLGQEPTVLRGQTLAALLPE
QWPALQAALPPGCPDALQYRATLDWPAAGHLSLTVHRVGELLILEFEPTEAWDSTGPHALRNAMFALESAPNLRALAEVA
TQTVRELTGFDRVMLYKFAPDATGEVIAEARREGLHAFLGHRFPASDIPAQARALYTRHLLRLTADTRAAAVPLDPVLNP
QTNAPTPLGGAVLRATSPMHMQYLRNMGVGSSLSVSVVVGGQLWGLIACHHQTPYVLPPDLRTTLEYLGRLLSLQVQVKE
AADVAAFRQSLREHHARVALAAAHSLSPHDTLSDPALDLLGLMRAGGLILRFEGRWQTLGEVPPAPAVDALLAWLETQPG
ALVQTDALGQLWPAGADLAPSAAGLLAISVGEGWSECLVWLRPELRLEVAWGGATPDQAKDDLGPRHSFDTYLEEKRGYA
EPWHPGEIEEAQDLRDTLTGALGERLSVIRDLNRALTQSNAEWRQYGFVISHHMQEPVRLISQFAELLTRQPRAQDGSPD
SPQTERITGFLLRETSRLRSLTQDLHTYTALLSAPPPVRRPTPLGRVVDDVLQDLEPRIADTGASIEVAPELPVIAADAG
LLRDLLLHLIGNALTFGGPEPRIAVRTERQGAGWSIAVSDQGAGIAPEYQERIFLLFQRLGSLDEALGNGLGLPLCRKIA
ELHGGTLTVESAPGEGSTFRCWLPDAGPLPGAADA
>Q9HWR3 2.7.13.3~~~bphP~~~Bacteriophytochrome~~~
MTSITPVTLANCEDEPIHVPGAIQPHGALVTLRADGMVLAASENIQALLGFVASPGSYLTQEQVGPEVLRMLEEGLTGNG
PWSNSVETRIGEHLFDVIGHSYKEVFYLEFEIRTADTLSITSFTLNAQRIIAQVQLHNDTASLLSNVTDELRRMTGYDRV
MAYRFRHDDSGEVVAESRREDLESYLGQRYPASDIPAQARRLYIQNPIRLIADVAYTPMRVFPALNPETNESFDLSYSVL
RSVSPIHCEYLTNMGVRASMSISIVVGGKLWGLFSCHHMSPKLIPYPVRMSFQIFSQVCSAIVERLEQGRIAELLRVSTE
RRLALARRARDADDLFGALAHPDDGIAALIPCDGALVMLGGRTLSIRGDFERQAGNVLQRLQRDPERDIYHTDNWPQPSE
DSPDGGDCCGVLAIRFHRQESGWIFWFRHEEVHRIRWGGKPEKLLTIGPSGPRLTPRGSFEAWEEVVRGHSTPWSETDLA
IAEKLRLDLMELCLNHAAEVDRMRQRLIAVLGHDLRNPLQSISMAAALLSSSDTRTTELRQHISASSSRMERLVSQILDM
SRLQSGIGLTVNPVDTDVSQLVRQIVCETDVAYPGLVIEIAIDPQVRAVVDPDRYAQVAANLLSNARHHGLPGRPVLVTL
TRQGDEVCLSVLNETSGLSEAQLANLFEPFKRESADNQRNRNGLGIGLYISQAIAQAHQGRIDVDCRDDVITFCLRLPVR
QAETGSSS
>A0A0H2XCS3 ~~~bphP~~~Bacteriophytochrome~~~
MSTATNPLDLDVCAREPIHIPGLIQPYGVLLVIDPADGRIVQASTTAADLLGVPMAALLGMPYTQVLTLPEAQPFAVDDQ
PQHLMHAEVRFPQRATPPASAWVAAWHLYPQQWLVEMEPRDARLLDVTLREAMPLLRSVERDPGIAEAAVRVAKGLRSLI
GFDRVMIYRFDEEWNGDIIAEARKPELEAYLGLHYPASDIPAQARALYLRNRVRQIADVGYQPSPIQPTVHPQLGTPVDL
SDVSLRSVSPVHLEYLANMGVTATLVASIVVNDALWGLISCHHYSPHFTNHAMRDVTDAVARTLAGRIGALQAVARARLE
SVLLTVREKLITDFNDAEHMTVELLDDMAPDLMDVVDADGVAIFHGNDISRHGTTPDVAALRRIRDHIESEHHEALREDA
VGALHVDAIGEVFPELADLAPLAAGFIFVPLMPQSRSALLWTRREQIQQIKWAGNPQLAKLEDIPNSRLSPRKSFDLWQQ
TVRGRARRWSPLHLESARSLRVLIELMERKRFQQDFTLLEASLSRLRDGVAIIERGTANAAHRLLFVNTAFADVCGSDVA
ELIGRELQTLYASDAPRANVELLQDALRNGRAAYVTLPLQVSDGAPVYRQFHLEPLPSPSGVTAHWLLQLRDPE
>P29715 1.11.1.-~~~bpoA2~~~Non-haem bromoperoxidase BPO-A2~~~COG2267
MPFITVGQENSTSIDLYYEDHGTGQPVVLIHGFPLSGHSWERQSAALLDAGYRVITYDRRGFGQSSQPTTGYDYDTFAAD
LNTVLETLDLQDAVLVGFSMGTGEVARYVSSYGTARIAKVAFLASLEPFLLKTDDNPDGAAPQEFFDGIVAAVKADRYAF
YTGFFNDFYNLDENLGTRISEEAVRNSWNTAASGGFFAAAAAPTTWYTDFRADIPRIDVPALILHGTGDRTLPIENTARV
FHKALPSAEYVEVEGAPHGLLWTHAEEVNTALLAFLAK
>P9WNH1 ~~~bpoC~~~Putative non-heme bromoperoxidase BpoC~~~COG2267
MINLAYDDNGTGDPVVFIAGRGGAGRTWHPHQVPAFLAAGYRCITFDNRGIGATENAEGFTTQTMVADTAALIETLDIAP
ARVVGVSMGAFIAQELMVVAPELVSSAVLMATRGRLDRARQFFNKAEAELYDSGVQLPPTYDARARLLENFSRKTLNDDV
AVGDWIAMFSMWPIKSTPGLRCQLDCAPQTNRLPAYRNIAAPVLVIGFADDVVTPPYLGREVADALPNGRYLQIPDAGHL
GFFERPEAVNTAMLKFFASVKA
>Q5XAQ1 ~~~~~~Putative bifunctional phosphatase/peptidyl-prolyl cis-trans isomerase~~~
MEESMDAKLKYKAKKIKMVFFDIDDTLRVKDTGYMPESIQRVFKALKAKGILVGIASGRARYGVPQEVQDLHADYCVKLN
GAYVKDDAKTIIFQAPIPADVVVAYKKWADDMGIFYGMAGRHEAVLSARNDMISNAIDNVYAQLEVCPDYNEYHDVYQMW
TFEDKGDGLQLPAELAEHLRLVRWHDNSSDVVLKGTSKALGVSKVVDHLGLKPENILVFGDELNDLELFDYAGISIAMGV
SHPLLQEKADFITKKVEEDGILYALEELGLIDKELQFPQLDLENHTGPKVTIKTNHGDMTLVLFPDHAPKTVANFLGLAK
EGYYDGIIFHRIIPEFMIQGGDPTGTGMGGQSIYGESFEDEFSDELYNLRGALSMANAGPNTNGSQFFIVQNSKIPYAKK
ELERGGWPTPIAAAYAENGGTPHLDRRHTVFGQLVDETSFQVLDLIAGVETGAQDKPKEDVIIETIEVFD
>B1VN94 1.3.1.113~~~bprA~~~(4-alkanoyl-5-oxo-2,5-dihydrofuran-3-yl)methyl phosphate reductase~~~COG0702
MILVTGATGAVGREVAGRLADAGPVRILARRPERLTVRGTGVEVVQGAYGDRAALDRALRGVDAVFLVTNDPTEPDDERV
AAAAAAAGVRHLVKLSMMAVEEPDAEDFITRRQRENEQAVRDSGVPWTFVRPRTFMSNTLSWAPGIRSAGVVRALYGDAP
VACVDPRDVAAVAVAALTGTGHEGRAYAVSGPEAITAREQTAQLSRVLGRPLRFEELGVDAARTALMAKYPPPVAEAFLQ
SAERQRTGAKASVVPTVQELTGRPARPFRDWSAEHAEAFAPE
>P42779 3.4.21.-~~~bprV~~~Extracellular basic protease~~~
MNLSNISAVKVLTLVVSAAIAGQVCAAESIVNYESANAISKQPEGSVRFIVKYKDGTPSSQGLKTRSTTKVMASGMQVAG
FEAQFVRTTGLGAGIFAVPELKTTKEAHLVMDTIASNPDVEFVEVDRLAYPKAAPNDPSYRQQWHYFGNYGVKANKVWDR
GFTGQGVVVSVVDTGILDHVDLNGNMLPGYDFISSAPNARDGDQRDNNPADEGDWFDNWDCGGYPDPRREKKFSTWHGSH
VAGTIAAVTNNGVGVAGVAYGAKVIPVRVLGKCGGYDSDITDGMYWSAGGHIDGVPDNQNPAQVVNMSLGGGGGCSQNSQ
RMIDKTTNLGALIVIAAGNENQDASRTWPSSCNNVLSVGATTPKGKRAPFSNYGARVHLAAPGTNILSTIDVGQAGPVRS
SYGMKAGTSMAAPHVSGVAALVISAANSIGKTLTPSELSDILVRTTSRFNGRLDRGLGSGIVDANAAVNAVLGDQNRAQP
RPPVNQPINSGNKVYRSDRRVAIRDLRSVTSGIRVNDQARVGSANITLTLDIRYGDRSQLAVELIAPSGRVYPIYHDGKR
QPNIVGPATFSVKNERLQGTWTLKVTDKARGVTGSIDSWSLTF
>Q8UG81 2.3.2.29~~~bpt~~~Aspartate/glutamate leucyltransferase~~~COG2935
MNTQATPSPQFYLTAPATCPYLPNQMERKVFTHLVGPRAPEMNDLLTQGGFRRSQNIAYRPACETCRACVSVRILTEQFQ
PTKSMRRVLAANSDVVATVHAAEPSTEQFALFRRYLDHRHQSGGMSDMSALDYAIMVEDTHVNTRIIEYRVREPGSGIDS
SKRGELLAVALSDVMSDGLSMVYSFFNPELEKRSLGTFMIIDHITRTRALGLPHVYLGYWVDGSEKMGYKTRYHPQEHLT
PRGWEIYSPKEE
>Q983E4 2.3.2.29~~~bpt~~~Aspartate/glutamate leucyltransferase~~~COG2935
MTQHPTQSPQFFLTAPSPCPYLDGQFERKVFTHLVGDKASEMNDLLTQGGFRRSQNIAYRPACETCRACVSVRILAQEFT
ASRNMKRVLQHNSDLVGAMHNAEPSTEQYSLFRSYLDARHRRGGMSDMTVLDYAMMVEDTHVDTKVIEYRRRGPDTFITG
KGQGELIAVALTDKMADGLSMVYSYFNPEFEERSLGTFMILDHIARARAMGLPHVYLGYWVNGSRKMNYKMRFMPQEHLG
PKGWERYTNEAVSR
>Q8DAR7 2.3.2.29~~~bpt~~~Aspartate/glutamate leucyltransferase~~~
MSSDIHQIKIGLTDNHPCSYLPERKERVAVALEADMHTADNYEVLLANGFRRSGNTIYKPHCDSCHSCQPIRISVPDIEL
SRSQKRLLAKARSLSWSMKRNMDENWFDLYSRYIVARHRNGTMYPPKKDDFAHFSRNQWLTTQFLHIYEGQRLIAVAVTD
IMDHCASAFYTFFEPEHELSLGTLAVLFQLEFCQEEKKQWLYLGYQIDECPAMNYKVRFHRHQKLVNQRWQG
>P74388 2.1.1.295~~~~~~2-methyl-6-phytyl-1,4-hydroquinone methyltransferase~~~COG2226
MPEYLLLPAGLISLSLAIAAGLYLLTARGYQSSDSVANAYDQWTEDGILEYYWGDHIHLGHYGDPPVAKDFIQSKIDFVH
AMAQWGGLDTLPPGTTVLDVGCGIGGSSRILAKDYGFNVTGITISPQQVKRATELTPPDVTAKFAVDDAMALSFPDGSFD
VVWSVEAGPHMPDKAVFAKELLRVVKPGGILVVADWNQRDDRQVPLNFWEKPVMRQLLDQWSHPAFASIEGFAENLEATG
LVEGQVTTADWTVPTLPAWLDTIWQGIIRPQGWLQYGIRGFIKSVREVPTILLMRLAFGVGLCRFGMFKAVRKNATQA
>O34545 ~~~braB~~~Branched-chain amino acid permease BraB~~~COG1114
MKHSLPVKDTIIIGFMLFALFFGAGNMIYPPELGQAAGHNVWKAIGGFLLTGVGLPLLGIIAIALTGKDAKGLADKAHPV
FGTIFTVVLYLSIGPLFAIPRTGTVSYEIGAVPFLTGVPERLSLLIFTLIFFGVTYYLALNPSKVVDRVGKILTPIKFTI
ILIIVLKAIFTPMGGLGAVTEAYKGTPVFKGFLEGYKTMDALASIVFGVVVVNAVKSKGVTQSKALAAACIKAGVIAALG
LTFIYVSLAYLGATSTNAIGPVGEGAKILSASSHYLFGSLGNIVLGAAITVACLTTSIGLVTSCGQYFSKLIPALSYKIV
VTIVTLFSLIIANFGLAQIIAFSVPILSAIYPLAIVIIVLSFIDKIFKERREVYIACLIGTGLFSILDGIKAAGFSLGSL
DVFLNANLPLYSLGIGWVLPGIVGAVIGYVLTLFIGPSKQLNEIS
>P21175 ~~~braC~~~Leucine-, isoleucine-, valine-, threonine-, and alanine-binding protein~~~
MKKGTQRLSRLFAAMAIAGFASYSMAADTIKIALAGPVTGPVAQYGDMQRAGALMAIEQINKAGGVNGAQLEGVIYDDAC
DPKQAVAVANKVVNDGVKFVVGHVCSSSTQPATDIYEDEGVLMITPSATAPEITSRGYKLIFRTIGLDNMQGPVAGKFIA
ERYKDKTIAVLHDKQQYGEGIATEVKKTVEDAGIKVAVFEGLNAGDKDFNALISKLKKAGVQFVYFGGYHPEMGLLLRQA
KQAGLDARFMGPEGVGNSEITAIAGDASEGMLATLPRAFEQDPKNKALIDAFKAKNQDPSGIFVLPAYSAVTVIAKGIEK
AGEADPEKVAEALRANTFETPTGNLGFDEKGDLKNFDFTVYEWHKDATRTEVK
>Q8XYE3 ~~~~~~TAL effector protein Brg11~~~COG2201
MRIGKSSGWLNESVSLEYEHVSPPTRPRDTRRRPRAAGDGGLAHLHRRLAVGYAEDTPRTEARSPAPRRPLPVAPASAPP
APSLVPEPPMPVSLPAVSSPRFSAGSSAAITDPFPSLPPTPVLYAMARELEALSDATWQPAVPLPAEPPTDARRGNTVFD
EASASSPVIASACPQAFASPPRAPRSARARRARTGGDAWPAPTFLSRPSSSRIGRDVFGKLVALGYSREQIRKLKQESLS
EIAKYHTTLTGQGFTHADICRISRRRQSLRVVARNYPELAAALPELTRAHIVDIARQRSGDLALQALLPVATALTAAPLR
LSASQIATVAQYGERPAIQALYRLRRKLTRAPLHLTPQQVVAIASNTGGKRALEAVCVQLPVLRAAPYRLSTEQVVAIAS
NKGGKQALEAVKAHLLDLLGAPYVLDTEQVVAIASHNGGKQALEAVKADLLDLRGAPYALSTEQVVAIASHNGGKQALEA
VKADLLELRGAPYALSTEQVVAIASHNGGKQALEAVKAHLLDLRGVPYALSTEQVVAIASHNGGKQALEAVKAQLLDLRG
APYALSTAQVVAIASNGGGKQALEGIGEQLLKLRTAPYGLSTEQVVAIASHDGGKQALEAVGAQLVALRAAPYALSTEQV
VAIASNKGGKQALEAVKAQLLELRGAPYALSTAQVVAIASHDGGNQALEAVGTQLVALRAAPYALSTEQVVAIASHDGGK
QALEAVGAQLVALRAAPYALNTEQVVAIASSHGGKQALEAVRALFPDLRAAPYALSTAQLVAIASNPGGKQALEAVRALF
RELRAAPYALSTEQVVAIASNHGGKQALEAVRALFRGLRAAPYGLSTAQVVAIASSNGGKQALEAVWALLPVLRATPYDL
NTAQIVAIASHDGGKPALEAVWAKLPVLRGAPYALSTAQVVAIACISGQQALEAIEAHMPTLRQASHSLSPERVAAIACI
GGRSAVEAVRQGLPVKAIRRIRREKAPVAGPPPASLGPTPQELVAVLHFFRAHQQPRQAFVDALAAFQATRPALLRLLSS
VGVTEIEALGGTIPDATERWQRLLGRLGFRPATGAAAPSPDSLQGFAQSLERTLGSPGMAGQSACSPHRKRPAETAIAPR
SIRRSPNNAGQPSEPWPDQLAWLQRRKRTARSHIRADSAASVPANLHLGTRAQFTPDRLRAEPGPIMQAHTSPASVSFGS
HVAFEPGLPDPGTPTSADLASFEAEPFGVGPLDFHLDWLLQILET
>Q45340 ~~~brkA~~~BrkA autotransporter~~~COG3468
MYLDRFRQCPSSLQIPRSAWRLHALAAALALAGMARLAPAAAQAPQPPVAGAPHAQDAGQEGEFDHRDNTLIAVFDDGVG
INLDDDPDELGETAPPTLKDIHISVEHKNPMSKPAIGVRVSGAGRALTLAGSTIDATEGGIPAVVRRGGTLELDGVTVAG
GEGMEPMTVSDAGSRLSVRGGVLGGEAPGVGLVRAAQGGQASIIDATLQSILGPALIADGGSISVAGGSIDMDMGPGFPP
PPPPLPGAPLAAHPPLDRVAAVHAGQDGKVTLREVALRAHGPQATGVYAYMPGSEITLQGGTVSVQGDDGAGVVAGAGLL
DALPPGGTVRLDGTTVSTDGANTDAVLVRGDAARAEVVNTVLRTAKSLAAGVSAQHGGRVTLRQTRIETAGAGAEGISVL
GFEPQSGSGPASVDMQGGSITTTGNRAAGIALTHGSARLEGVAVRAEGSGSSAAQLANGTLVVSAGSLASAQSGAISVTD
TPLKLMPGALASSTVSVRLTDGATAQGGNGVFLQQHSTIPVAVALESGALARGDIVADGNKPLDAGISLSVASGAAWHGA
TQVLQSATLGKGGTWVVNADSRVQDMSMRGGRVEFQAPAPEASYKTLTLQTLDGNGVFVLNTNVAAGQNDQLRVTGRADG
QHRVLVRNAGGEADSRGARLGLVHTQGQGNATFRLANVGKAVDLGTWRYSLAEDPKTHVWSLQRAGQALSGAANAAVNAA
DLSSIALAESNALDKRLGELRLRADAGGPWARTFSERQQISNRHARAYDQTVSGLEIGLDRGWSASGGRWYAGGLLGYTY
ADRTYPGDGGGKVKGLHVGGYAAYVGDGGYYLDTVLRLGRYDQQYNIAGTDGGRVTADYRTSGAAWSLEGGRRFELPNDW
FAEPQAEVMLWRTSGKRYRASNGLRVKVDANTATLGRLGLRFGRRIALAGGNIVQPYARLGWTQEFKSTGDVRTNGIGHA
GAGRHGRVELGAGVDAALGKGHNLYASYEYAAGDRINIPWSFHAGYRYSF
>P94499 ~~~brnQ~~~Branched-chain amino acid permease BrnQ~~~COG1114
MSKKVSASYIIIIGLMLFALFFGAGNLIFPPMLGQLAGKNVWVANAGFLVTGVGLPLLAITAFVFSGKQNLQSLASRVHP
VFGIVFTTILYLAIGPFFAIPRSGNVSFEIGVKPFLSNDASPVSLIIFTILFFALACLLSLNPSKIIDIVGKFLTPIKLT
FIGLLVAVALIRPIGTIQAPSKGYTSQAFFKGFQEGYLTLDALVAFVFGIIIVNALKEQGASTKKQLIVVCAKAAAIAAV
LLAVMYTALSYMGASSVEELGILENGAEVLAKVSSYYFGSYGSILLGLMITVACLTTSVGLITACSSFFHELFPNISYKK
IAVVLSVFSTLVANIGLTQLIKVSMPVLLTMYPIAISLIFLTFLHSVFKGKTEVYQGSLLFAFIISLFDGLKAAGIKIEV
VNRIFTQILPMYNIGLGWLIPAIAGGICGYILSIFRTKTS
>O06754 ~~~brnQ~~~Branched-chain amino acid permease BrnQ~~~COG1114
MSKKSVLITSLMLFSMFFGAGNLIFPPMLGLSAGTNYLPAILGFLATSVLLPVLAIIAVVLSGENVKDMASRGGKIFGLV
FPIAAYLSIGAFYALPRTGAVSYSTAVGVDNALYSGLFNFVFFAVALALSWNPNGIADKLGKWLTPALLTLIVVLVVLSV
AKLDGTPGEPSSAYAQQPAGAGLLEGYMTMDAIAALAFGIVVISAFKYQKVNKVRTATVVSAFIAGILLALVYLGLGSIG
QVVNGEFADGTAILNYAALSTMGQAGRIMFVAILILACMTTAVGLISATSEFFNSLLPGVKYHVWATVFALISFGVATMG
LDTVLAVAAPVISFIYPSAITLVFLSLIEPLLFRLKWTYLFGIWTAVVWALFMSIPALNPFIEWAPLHSMSLGWVVPVLV
ASAIGLAIDWNKKGAQSVAKKESISV
>P0AD99 ~~~brnQ~~~Branched-chain amino acid permease BrnQ~~~COG1114
MTHQLRSRDIIALGFMTFALFVGAGNIIFPPMVGLQAGEHVWTAAFGFLITAVGLPVLTVVALAKVGGGVDSLSTPIGKV
AGVLLATVCYLAVGPLFATPRTATVSFEVGIAPLTGDSALPLFIYSLVYFAIVILVSLYPGKLLDTVGNFLAPLKIIALV
ILSVAAIVWPAGSISTATEAYQNAAFSNGFVNGYLTMDTLGAMVFGIVIVNAARSRGVTEARLLTRYTVWAGLMAGVGLT
LLYLALFRLGSDSASLVDQSANGAAILHAYVQHTFGGGGSFLLAALIFIACLVTAVGLTCACAEFFAQYVPLSYRTLVFI
LGGFSMVVSNLGLSQLIQISVPVLTAIYPPCIALVVLSFTRSWWHNSSRVIAPPMFISLLFGILDGIKASAFSDILPSWA
QRLPLAEQGLAWLMPTVVMVVLAIIWDRAAGRQVTSSAH
>P54104 ~~~brnQ~~~Branched-chain amino acid permease BrnQ~~~
MKEKLTHAESLTISSMLFGLFFGAGNLIFPAYLGEASGANLWISLLGFLITGVGLPLLAIASLGMTRSEGLLDLSGRVSH
KYSYFFTCLLYLTIGPFFAIPRSFTVPFETGISALLPSGMAKSTGLFIFSLIFFAIMLFFSLRPGQIMDWIGKFLTPAFL
LFFFFIMIMALLHPLGNYHAVKPVGEYASAPLISGVLAGYNTMDALAGLAFGIIVISSIRTFGVTKPEKVASATLKTGVL
TCLLMAVIYAITALVGAQSRTALGLAANGGEALSQIARHYFPGLGAVIFALMIFVACLKTAIGLITACSETFAEMFPKTL
SYNMWAIIFSLLAFGIANVGLTTIISFSLPVLMLLYPLAISLILLALTSKLFDFKQVDYQIMTAVTFLCALGDFFKALPA
GMQVKAVTGLYGHVLPLYQDGLGWLVPVTVIFAILAIKGVISKKRA
>A2RJ04 ~~~brnQ~~~Branched-chain amino acid permease BrnQ~~~COG1114
MKEKLAGKDYLFIGSMLFGLFFGAGNLIFPIHMGQEAGDAISQANFGFLVTAVGFPFLGIIALGISQSNGVFELATRVNR
IYAYIFTILLYLVIGPFFALPRLATTSFEIGISPFLSNKLQTPLLALFSILFFGTAWFLSRKPTKLLDYIGKFLNPLFLV
LLGLILIIAFSHPLGSVHQAEVGKLYQSSAFMNGFTQGYNTLDALAALAFGIIIITTIRQRGVKNSKIIAKETIKAGLIS
VGLMAIIYTCLSYLGAMSVGKFAISENGGIALAQISNYYLGTFGMIILALIVIIACLKTAVGLMSAFSETFVELFPKREY
RFYLMIVSILPCIFANIGLTKIIELSVPVLMFLYPLAITLILLALLGPVFQHSKLVYQITTAFTLLAAIADGLNALPNPL
KSLKPIAAILNFSKDSLPFFSLGMGWILLAVLGFIIGCLVHFLKRITNK
>Q8DVR0 ~~~brpA~~~Biofilm regulatory protein A~~~COG1316
MKIGKKILIMLVTIFLTSLVALGVYATSIYNFSLGEFSKTFKDYGTGSGKDVIADEKPFSILLMGVDTGSSERTSKWEGN
SDSMILVTVNPKTKKTTMTSLERDILVKLSGSKTNDQTGYDAKLNAAYAAGGAKMAIMTVQDMLDIKIDKYVQINMEGLV
QLVDAVGGITVTNHFDFPISIEEHEPEFTASVEPGTHKINGEQALVYSRMRYDDPDGDYGRQKRQREVISKVLKKILALD
SVSKYRKILSAVSKNMQTNIEISSSTIPKLLGYSDALKSIRTYQLKGEGTTIDGGSYQLVTSKELLKAQNRIKGQLGLKK
STAENLKTTASLYENFYGGDTSIYDSSSSASDYSSSGNYSGSSSDYGSSSSYGSNSSSGSSSDYSGQNSYNQGNYQQPAA
GTGIGN
>P0DUF0 ~~~brxA~~~BREX protein BrxA~~~
MVKEQVYKSTIKSRPLLFLEMKKVSILLNKGFKEFEVKEKAISENIFQVNTESRKKEIASSVLARLKILDDYLIREIGHG
DIESSKAIVLYSIMKTDRLFFEFMNEVFKEKFVFGETFLTDADFNIFFENKQQQSEKVASWNDYTFYKLKQVYIRILFEA
GYLKNQKGNREIERPLLNIDVVEHIRALGDGIYMDILVGE
>P54170 ~~~brxA~~~Bacilliredoxin BrxA~~~
MSMAYEEYMRQLVVPMRRELTGAGFEELTTAEEVENFMEKAEGTTLVVVNSVCGCAAGLARPAATQAVLQNDKTPDNTVT
VFAGQDKEATAKMREYFTGQEPSSPSMALLKGKEVVHFIPRHEIEGHDMEEIMKNLTAAFDAHC
>P0DUF6 ~~~brxA~~~BREX protein BrxA~~~
MRAAFQSSEGFSMIKNDKAWIGDLLGGPLMSRESRVIAELLLTDPDEQTWQEQIVGHNILQASSPNTAKRYAATIRLRLN
TLDKSAWTLIAEGSERERQQLLFVALMLHSPVVKDFLAEVVNDLRRQFKEKLPGNSWNEFVNSQVRLHPVLASYSDSSIA
KMGNNLVKALAEAGYVDTPRRRNLQAVYLLPETQAVLQRLGQQDLISILEGKR
>Q2W5N3 ~~~brxA~~~BREX protein BrxA~~~
MAEPRYKADIGGGSLKLPESRIIAGLLLEGVTEDQWRHAIEVENVLQRRSPGTAKRQSSLMRNRLETMGPELWQMVRDGS
TQVAIQAVFAAAIKHSTLLGDFLDLVVRDQFRMFRPDLPRKMWDQYLEQCRNRDPLMPVWQDSTANKLADCVYRILVEVG
YITDSKTYRLKSVRISGEVMSYLRENNEQYVIRCIQVSI
>P0DUF1 ~~~brxB~~~BREX protein BrxB~~~
MRRINERLDEILPKITDASFRENKGLGNEIGFYIFDYDPKYEMLVREHIVYMQERLKNDSSLHIREFDLYEVMLEILEEK
GYLQKNIDMEQKKGSDFILNATRKALRLTSNNDLVVQYITDRVQPNDIVFLTGVGKVFPIIRSHTILNNLHKAVDNVPLV
MFFPGTYDGLELVLFGEIKDDNYYRAFQLIDK
>P54534 ~~~brxB~~~Bacilliredoxin BrxB~~~
MNMDFNLFMNDIVRQARQEITAAGYTELKTAEEVDEALTKKGTTLVMVNSVCGCAGGIARPAAYHSVHYDKRPDQLVTVF
AGQDKEATARARDYFEGYPPSSPSFAILKDGKIMKMVERHEIEGHEPMAVVAKLQEAFEEYCEEV
>P0DUF7 ~~~brxB~~~BREX protein BrxB~~~
MIDPVLEYRLSQIQSRINEDRFLKNNGSGNEIGFWIFDYPAQCELQVREHLKYLLRHLEKDHKFACLNVFQIIIDMLNER
GLFERVCQQEVKVGTETLKKQLAGPLNQKKIADFIAKKVDLAAQDFVILTGMGNAWPLVRGHELMSALQDVMGFTPLLMF
YPGTYSGYNLSPLTDTGSQNYYRAFRLVPDTGPAATLNPQ
>P0DUF2 ~~~brxC~~~Probable ATP-binding protein BrxC~~~
MQIQKMFEKEINRDIKGVIKVGQDDDKNIYQELDEYVVTNELLHHMGEFFKSYKKGITGHTDKMGVWISGFFGSGKSHFL
KILSYLLENKHVMDENEINKDAISFFNNKILDPMVLADMKLAGNTTTDVILFNIDSKSESDSKSDKNAIVKVFNKVFNEM
QGFCGSIPWIADLERQMVKDGRYEEFKAEFEKISGNTWEEAREDFYYEEDSIIEALSKTTKMSEDAARNWYERAEEDYFI
SIDRFAKRVREYVEAKGNNHHVVFLIDELGQYIGNDSQLMLNLQTVVEDIGTQCGGKVWVLVTSQQDIDSVVKVNGNDFS
KIQGRFDTRLSLSSAHVDEVIKKRILLKNEVGKQTLRLLYGDNSSILKNLITFSGDTAEMKIFNNEEDFVDVYPFIPYQF
NLLQKVFTGIRIHGASGKHLAEGERSLLSAFQESAMKYAESETGALIPFSAFYQTIEAFLDSSIRTVIIHAQNNSRLNAY
DVEVLKLLFLIKYVKELPANLENLATLMIQNISDDKIELKKKIESSLIRLSKETLIQKNGQEYIFLTNDEQDVNREIKNM
HVDSAEVIQKIGDVIFNVVYQDKKYRYSPKYHFSFNTIIDDRPIGMQTNDIGLKIITPYFDATTELNDSELKMMSMRESN
VILKLPQDTSYLDEMTEILRIQAYLKIKSGTAASQAIEDIKVRKSREANERKDRVHIYITEALKHAQIFVNSQQLDVKEK
NPVERINDAFKVLIDNLYNKLHYVKKFIDTAKQLNELLVENTTQLTLSDDNEDANQLAVKEVNDYITRLTTRNQQITIKG
ITTHFTKQPYGWKDLDIASFIIKLFKGQEIKLQLNSSNLTTSERDLVNYITKRDYVERVVVKKRERISPALMKVVKDLSK
EVFEVTALPDDEDGLMNRFKELLVSEKNKINVLLVQYRNTFYPGQDVLQDGKESIEQLLNISDTTSFYNKVKQLQNDFLD
YAEDVEPVKAFFETQRDIYDDAVKRLNIFEKNQTYVTDQKVTGFIESISKIVKHKEPYKNIHQLPGLIKEFDELFVELLE
KECEPVKNVIESDYQTVLEELNKHEEIKSMFFNKFKNSFDGIKDRLNRVNNFYEAIAMQTESDRLKVRCIDDIANEVERR
KPPVTSPCTGTTCTTVIDPIVEYKAKKKTISKNTILRGTKTIENEEDIEAVLDEIRKQLQKELEDASVIKLV
>P39914 ~~~brxC~~~Monothiol bacilliredoxin BrxC~~~COG3118
MAKQLIQSEEEFKRIAEQEGVFVFLKHSTTCPISQAAFHEFDAFANQHEDVPAYYLQVQEARPLSNFIAETYGVKHESPQ
IFIIQNGEVKWHTSHSQITEAAIEQHLS
>P0DUF8 ~~~brxC~~~Probable ATP-binding protein BrxC~~~
MNIEQIFKKPLKRNINGVVKAEQTDDASAYIELDEYVITRELENHLRHFFESYVPATGPERIRMENKIGVWVSGFFGSGK
SHFIKILSYLLSNRKVTHNGTERNAYSFFEEKIKDALFLADINKAVHYPTEVILFNIDSRANVDDKEDAILKVFLKVFNE
RIGYCADFPHIAHLERELDKRGQYETFKAAFADINGSRWEDERDAYYFISDDMAQALSQATQQSLEASRQWVEQLDKNFP
LDINNFCQWVKEWLDDNGKNILFMVDEVGQFIGKNTQMMLKLQTITENLGVICGGRAWVIVTSQADINAAIGGMSSRDGQ
DFSKIQGRFSTRLQLSSSNTSEVIQKRLLVKTDEAKAALAKVWQEKGDILRNQLAFDTTTTTALRPFTSEEEFVDNYPFV
PWHYQILQKVFESIRTKGAAGKQLAMGERSQLEAFQTAAQQISAQGLDSLVPFWRFYAAIESFLEPAVSRTITQACQNGI
LDEFDGNLLKTLFLIRYVETLKSTLDNLVTLSIDRIDADKVELRRRVEKSLNTLERLMLIARVEDKYVFLTNEEKEIENE
IRNVDVDFSAINKKLASIIFDDILKSRKYRYPANKQDFDISRFLNGHPLDGAVLNDLVVKILTPKDPTYSFYNSDATCRP
YTSEGDGCILIRLPEEGRTWSDIDLVVQTEKFLKDNAGQRPEQATLLSEKARENSNREKLLRVQLESLLAEADVWAIGER
LPKKSSTPSNIVDEACRYVIENTFGKLKMLRPFNGDISREIHALLTVENDTELDLGNLEESNPDAMREVETWISMNIEYN
KPVYLRDILNHFARRPYGWPEDEVKLLVARLACKGKFSFSQQNNNVERKQAWELFNNSRRHSELRLHKVRRHDEAQVRKA
AQTMADIAQQPFNEREEPALVEHIRQVFEEWKQELNVFRAKAEGGNNPGKNEIESGLRLLNAILNEKEDFALIEKVSSLK
DELLDFSEDREDLVDFYRKQFATWQKLGAALNGSFKSNRSALEKDAAAVKALGELESIWQMPEPYKHLNRITLLIEQVQN
VNHQLVEQHRQHALERIDARIEESRQRLLEAHATSELQNSVLLPMQKARKRAEVSQSIPEILAEQQETKALQMDADKKIN
LWIDELRKKQEAQLRAANEAKRAAESEQTYVVVEKPVIQPVPKKTHLVNVASEMRNATGGEVLETTEQVEKALDTLRTTL
LAAIKAGDRIRLQ
>P0DUF5 ~~~brxL~~~Lon-like protease BrxL~~~
MEDLNIKLNAHFAGKVVRKDLTKKIKEGANVPVYVLEYLLGMYCATDDEKSMNDGVQMVKKILSDNFVRPDEAEKVKSKV
KELGKYTVIDKIGVKLNDKKDIYEAEFSNLGLNGVPISSHYVKEFDKLLAGGIWCIVKMEYYFDEESKGTSPFSIESVTP
IQMPNMDLEEMFEQRRQFSKEEWIDVLIRSTGMEPTQLEDTVKWHLLERMVPLVENNYNLCELGPRGTGKSHIYKEISPN
SILVSGGQTTVANLFYNMSTRKIGLVGMWDTVAFDEVAGITFKDKDGIQIMKDYMASGSFARGREEKNASASMVFVGNIN
QSVDVLLKTSHLFDPFPEAMAYDSAFFDRMHYYLPGWEIPKMRPEFFTNEYGFITDYLAEFLREMRKRSFSDAIDKYFRL
GNNLNQRDVIAVRKTVSGLIKLLYPNGEYIKEDVEEVLRYALIGRRRVKEQLKKIGGMEFYDVNFSYIDNESMNEEFVSV
PEQGGGTLIPEGMNKPGHIYTVARGKTGMIGTYKLETEVVSGNGKFEKTGLNSDRDAKESIDTAFRFFKANNKNISGTIS
TTTKDYLMHIQDIHGVGLTGELSLAAFIALCSGALNKPVQSQMVVLGSISISGTINKVEELANVLQVCFDSGAKKILLPM
VSAVDIPTVPPELFAKFQIGFYQSAEDAVFKALGVE
>P0DUG1 ~~~brxL~~~Lon-like protease BrxL~~~
MQTHHDLPVSGVSAGEIASEGYDLDALLNQHFAGRVVRKDLTKQLKEGANVPVYVLEYLLGMYCASDDDDVVEQGLQNVK
RILADNYVRPDEAEKVKSLIRERGSYKIIDKVSVKLNQKKDVYEAQLSNLGIKDALVPSQMVKDNEKLLTGGIWCMITVN
YFFEEGQKTSPFSLMTLKPIQMPNMDMEEVFDARKHFNRDQWIDVLLRSVGMEPANIEQRTKWHLITRMIPFVENNYNVC
ELGPRGTGKSHVYKECSPNSLLVSGGQTTVANLFYNMASRQIGLVGMWDVVAFDEVAGITFKDKDGVQIMKDYMASGSFS
RGRDSIEGKASMVFVGNINQSVETLVKTSHLLAPFPAAMIDTAFFDRFHAYIPGWEIPKMRPEFFTNRYGLITDYLAEYM
REMRKRSFSDAIDKFFKLGNNLNQRDVIAVRRTVSGLLKLMHPDGAYSKEDVRVCLTYAMEVRRRVKEQLKKLGGLEFFD
VNFSYIDNETLEEFFVSVPEQGGSELIPAGMPKPGVVHLVTQAESGMTGLYRFETQMTAGNGKHSVSGLGSNTSAKEAIR
VGFDYFKGNLNRVSAAAKFSDHEYHLHVVELHNTGPSTATSLAALIALCSILLAKPVQEQMVVLGSMTLGGVINPVQDLA
ASLQLAFDSGAKRVLLPMSSAMDIPTVPAELFTKFQVSFYSDPVDAVYKALGVN
>A0A0P0C3P7 ~~~~~~Bacteriocin BacSp222~~~
MAGLLRFLLSKGRALYNWAKSHVGKVWEWLKSGATYEQIKEWIENALGWR
>P99097 ~~~bsaA~~~Glutathione peroxidase homolog BsaA~~~
METIYDFVVETNKGVTYKLDAYKGDVMLIVNTASECGFTSQFEGLQSLYEKYKDQGFVILGFPCNQFGGQEPGSGEEAAQ
NCKLNYGVTFPMHQKIDVKGEHQLPLFRYLTAAQHGFFNEKIKWNFTKFLVDREGNVVKRFAPQKKPVQIEREIEKLL
>P25152 3.4.11.6~~~ywaD~~~Aminopeptidase YwaD~~~COG2234
MKKLLTVMTMAVLTAGTLLLPAQSVTPAAHAVQISNSERELPFKAKHAYSTISQLSEAIGPRIAGTAAEKKSALLIASSM
RKLKLDVKVQRFNIPDRLEGTLSSAGRDILLQAASGSAPTEEQGLTAPLYNAGLGYQKDFTADAKGKIALISRGDLTYYE
KAKNAEAAGAKAVIIYNNKESLVPMTPNLSGNKVGIPVVGIKKEDGEALTQQKEATLKLKAFTNQTSQNIIGIKKPKNIK
HPDIVYVTAHYDSVPFSPGANDNGSGTSVMLEMARVLKSVPSDKEIRFIAFGAEELGLLGSSHYVDHLSEKELKRSEVNF
NLDMVGTSWEKASELYVNTLDGQSNYVWESSRTAAEKIGFDSLSLTQGGSSDHVPFHEAGIDSANFIWGDPETEEVEPWY
HTPEDSIEHISKERLQQAGDLVTAAVYEAVKKEKKPKTIKKQMKAKASDIFEDIK
>Q9Z8L0 4.1.1.61~~~~~~4-hydroxybenzoate decarboxylase subunit C~~~COG0043
MSFLRRHISLFRSQKQLIDVFAPVSPNLELAEIHRRVIEDQGPALLFHNVIGSSFPVLTNLFGTKHRVDQLFSQAPDNLI
ARVAHLISSTPKLSSLWKSRDLLKRISSLGLKKARFRRFPFVSMSSVNLDHLPLLTSWPEDGGAFLTLPLVYTESPTLTT
PNLGMYRVQRFNQNTMGLHFQIQKGGGMHLYEAEQKKQNLPVSVFLSGNPFLTLSAIAPLPENVSELLFATFLQGAKLLY
KKTNDHPHPLLYDAEFILVGESPAGKRRPEGPFGDHFGYYSLQHDFPEFHCHKIYHRKDAIYPATVVGKPYQEDFYIGNK
LQEYLSPLFPLVMPGVRRLKSYGESGFHALTAAVVKERYWRESLTTALRILGEGQLSLTKFLMVTDQEVPLDRFSVVLET
ILERLQPDRDLIIFSETANDTLDYTGPSLNKGSKGIFMGIGKAIRDLPHGYQGGKIHGVQDIAPFCRGCLVLETSLEDRC
IKSLLHHPDLKSWPLIILADNLRETIQSEKDFLWRTFTRCAPANDLHALHSHFATHRPNYNFPFVIDALMKPSYPKEVEV
DPSTKQKVSERWHAYFPNKETFYI
>C0H3U9 ~~~bsdD~~~Protein BsdD~~~
MHTCPRCDSKKGEVMSKSPVEGAWEVYQCQTCFFTWRSCEPESITNPEKYNPAFKIDPKETETAIEVPAVPERKA
>Q81ST7 2.4.1.-~~~bshA~~~N-acetyl-alpha-D-glucosaminyl L-malate synthase~~~COG0438
MKLKIGITCYPSVGGSGVVGTELGKQLAERGHEIHFITSGLPFRLNKVYPNIYFHEVTVNQYSVFQYPPYDLALASKMAE
VAQRENLDILHVHYAIPHAICAYLAKQMIGERIKIVTTLHGTDITVLGSDPSLNNLIRFGIEQSDVVTAVSHSLINETHE
LVKPNKDIQTVYNFIDERVYFKRDMTQLKKEYGISESEKILIHISNFRKVKRVQDVVQAFAKIVTEVDAKLLLVGDGPEF
CTILQLVKNLHIEDRVLFLGKQDNVAELLAMSDLMLLLSEKESFGLVLLEAMACGVPCIGTRVGGIPEVIQHGDTGYLCE
VGDTTGVADQAIQLLKDEELHRNMGERARESVYEQFRSEKIVSQYETIYYDVLRDDKNGKI
>P42982 2.4.1.-~~~bshA~~~N-acetyl-alpha-D-glucosaminyl L-malate synthase~~~COG0438
MRKLKIGITCYPSVGGSGIIATELGKQLAEKGHEIHFITSSIPFRLNTYHPNIHFHEVEVNQYAVFKYPPYDLTLASKIA
EVAERENLDIIHAHYALPHAVCAYLAKQMLKRNIGIVTTLHGTDITVLGYDPSLKDLIRFAIESSDRVTAVSSALAAETY
DLIKPEKKIETIYNFIDERVYLKKNTAAIKEKHGILPDEKVVIHVSNFRKVKRVQDVIRVFRNIAGKTKAKLLLVGDGPE
KSTACELIRKYGLEDQVLMLGNQDRVEDLYSISDLKLLLSEKESFGLVLLEAMACGVPCIGTNIGGIPEVIKNNVSGFLV
DVGDVTAATARAMSILEDEQLSNRFTKAAIEMLENEFSSKKIVSQYEQIYADLAEPE
>Q81ST8 3.5.1.-~~~bshB1~~~N-acetyl-alpha-D-glucosaminyl L-malate deacetylase 1~~~COG2120
MSGLHILAFGAHADDVEIGMAGTIAKYTKQGYEVGICDLTEADLSSNGTIELRKEEAKAAARIMGVKTRLNLAMPDRGLY
MKEEYIREIVKVIRTYKPKLVFAPYYEDRHPDHANCAKLVEEAIFSAGIRKYMPEVPPHRVESFYHYMINGFHKPNFCID
ISEYVSQKVEALEAYESQFSTGSDGVKTPLTEGYVETVVAREKMFGKEVGVLYAEGFMSKKPVLLHADLIGGCK
>Q81FP2 3.5.1.-~~~bshB1~~~N-acetyl-alpha-D-glucosaminyl L-malate deacetylase 1~~~
MSGLHILAFGAHADDVEIGMAGTIAKYTKQGYEVGICDLTEADLSSNGTIELRKEEAKVAARIMGVKTRLNLAMPDRGLY
MKEEYIREIVKVIRTYKPKLVFAPYYEDRHPDHANCAKLVEEAIFSAGIRKYMPELSPHRVESFYNYMINGFHKPNFCID
ISEYLSIKVEALEAYESQFSTGSDGVKTPLTEGYVETVIAREKMFGKEVGVLYAEGFMSKKPVLLHADLLGGCK
>P42981 3.5.1.-~~~bshB1~~~N-acetyl-alpha-D-glucosaminyl L-malate deacetylase 1~~~COG2120
MYNADVLAFGAHSDDVEIGMGGTIAKFVKQEKKVMICDLTEAELSSNGTVSLRKEEAAEAARILGADKRIQLTLPDRGLI
MSDQAIRSIVTVIRICRPKAVFMPYKKDRHPDHGNAAALVEEAIFSAGIHKYKDEKSLPAHKVSKVYYYMINGFHQPDFV
IDISDTIEAKKQSLNAYKSQFIPSKDSVSTPLTNGYIEIVEAREKLYGKEAGVEYAEGFFSKRMLMLDHDVLGGEQ
>Q81WT0 3.5.1.-~~~bshB2~~~Probable N-acetyl-alpha-D-glucosaminyl L-malate deacetylase 2~~~COG2120
MKNERHVLIVFPHPDDESYCVAGTILAYTQRNVPLTYVCLTLGEMGRAMGNPPFATRESLYAIREKELKRATNILGIKDL
RMMGYRDKTLEFETPGELRRVIQKCVEELNPSLVISFYPGYAVHPDHDATGEAVAEALATIPENKRPTFYAVAFANNHEA
EIGPPHVKNEVKEYVPKKLEALQAHASQFATKVTELKREYEDGVTETVEWLEREPFWIYPFKDKNK
>Q81AU5 3.5.1.-~~~bshB2~~~Probable N-acetyl-alpha-D-glucosaminyl L-malate deacetylase 2~~~
MERHVLVVFPHPDDEAYAAGGTIRLLTDQGVPVTYACGTLGQMGRNMGKNVFANRETIPHIRKKELKDACEAMGIKDLRM
LGFHDKTLEFEDVDFVADKIEAIIQEVNPSRIITFYPEHGVHPDHNAFGRAVVRAVSRMPKEERPVIHAVAITKNREAVL
GEPDVVNNISEVFDHKLTALGAHRSQTEAMLEDTHAKIKNKDAATLKWLQLEQFWTYKWE
>P55342 6.-.-.-~~~bshC~~~Putative cysteine ligase BshC~~~COG4365
MQLTELSIKNQNVFVQHYIDGKEEMSSFFDYSIHHKDMWRERLEDLSSRFFAREELAAYLTSYHNKFGSSAMQSAIEKLK
DPSSAAVVGGQQAGLLTGPLYTIHKIISIIVLAKQQEKELQVPVIPIFWVAGEDHDLDEINFVHTSEENGPVKKKLPQSY
WKKSSAASTSLDQEKCAAWIDDVFAAFEETDHTNTLLDNVKRCLRESVTFTDFFELLIADLFQEEGLVLLNSGDPGIKKL
ETAMFQKILRENDELARAVSDQQAFMRQAGYKPIIESGKEQANLFYEYEDERFLIEKDNGRFVIKELDLGWTRDELHTHM
EEHPERFSNNVVTRPLMQEFLIPTLAFIAGPGEINYWGELKQAFAVMGFKMPPVMPRLNITILERHIEKKLAERNISLQD
AIERGTENQRETYFERQIPEEFTAVMDQAKSQIEAIHKTVRQEALKVDQSLEPLLLKNAAFIQDQLQFLERTVMKRIEEK
EGYVLKDYERIQNSIKPLLAPQERIWNIMYYLNRYGPKFFTTFKNLPFSFQNQHQVVKL
>P71014 ~~~bslA~~~Biofilm-surface layer protein A~~~
MKRKLLSSLAISALSLGLLVSAPTASFAAESTSTKAHTESTMRTQSTASLFATITGASKTEWSFSDIELTYRPNTLLSLG
VMEFTLPSGFTANTKDTLNGNALRTTQILNNGKTVRVPLALDLLGAGEFKLKLNNKTLPAAGTYTFRAENKSLSIGNKFY
AEASIDVAKRSTPPTQPCGCN
>P39632 ~~~bslB~~~Probable biofilm-surface layer protein B~~~
MLKRTSFVSSLFISSAVLLSILLPSGQAHAQSASIEAKTVNSTKEWTISDIEVTYKPNAVLSLGAVEFQFPDGFHATTRD
SVNGRTLKETQILNDGKTVRLPLTLDLLGASEFDLVMVRKTLPRAGTYTIKGDVVNGLGIGSFYAETQLVIDPR
>P39297 ~~~bsmA~~~Lipoprotein BsmA~~~COG3650
MVSRKRNSVIYRFASLLLVLMLSACSALQGTPQPAPPVTDHPQEIRRDQTQGLQRIGSVSTMVRGSPDDALAEIKAKAVA
AKADYYVVVMVDETIVTGQWYSQAILYRK
>Q03091 3.1.-.-~~~bsn~~~Extracellular ribonuclease~~~COG2356
MTKKLWFLPIVCLFFILGWTAPSASAGAPADTNLYSRLAVSTAGGTTLFPQTSSAVITPSADTETYYKEASGKSGTALKS
ALHRIISGHTKLSYSQVWNALKETDEDPANPNNVILLYTQESRAKSKNGGSVGDWNREHVWAKSHGNFGTAAGPGTDIHH
LRPADVQVNSARGNMDFDNGGSEYPKAPGNYYDGDSWEPRDEVKGDVARMLFYMAVRYEGGDGYPDLELNDKTGNGSAPY
MGKLSVLLKWNKQDPVDSKEKRRNEIIYEDYQHNRNPFIDHPEWADEIW
>A0A2K4Z9J5 ~~~bsrE~~~Small toxic protein BsrE~~~
MSTFQALMLMLAIGSFIIALLTYIEKIDLP
>L8EAY0 ~~~bsrG~~~Small toxic protein BsrG~~~
MTVYESLMIMINFGGLILNTVLLIFNIMMIVTSSQKKK
>A0A2K4Z9K4 ~~~bsrH~~~Probable small toxic protein BsrH~~~
MHVSTFQALMLMLAFGSFIIALLTYIKKK
>A0KLG5 5.1.1.10~~~bsr~~~Broad specificity amino-acid racemase~~~COG0787
MHKKTLLATLILGLLAGQAVAAPYLPLASDHRNGEVQTASNAWLEVDLGAFEHNIQTLKDRLGDKGPKICAIMKADAYGH
GIDLLVPSVVKAGIPCIGIASNEEARVAREKGFTGRLMRVRAATPAEVEQALPYKMEELIGSLVSAQGIADIAQRHHTNI
PVHIALNSAGMSRNGIDLRLADSKEDALAMLKLKGITPVGIMTHFPVEEKEDVKMGLAQFKLDSQWLLEAGKLDRSKITI
HAANSFATLEVPDAYFDMVRPGGLLYGDSIPSYTEYKRVMAFKTQVASVNHYPAGNTVGYDRTFTLKRDSWLANLPLGYS
DGYRRALSNKAYVLIQGQKVPVVGKTSMNTIMVDVTDLKGVKPGDEVVLFGRQGEAEVKQADLEEYNGALLADMYTIWGY
TNPKKIKR
>P33967 3.5.4.23~~~bsr~~~Blasticidin-S deaminase~~~
MKTFNISQQDLELVEVATEKITMLYEDNKHHVGAAIRTKTGEIISAVHIEAYIGRVTVCAEAIAIGSAVSNGQKDFDTIV
AVRHPYSDEVDRSIRVVSPCGMCRELISDYAPDCFVLIEMNGKLVKTTIEELIPLKYTRN
>Q88GJ9 5.1.1.10~~~alr~~~Broad specificity amino-acid racemase~~~COG0787
MPFRRTLLAASLALLITGQAPLYAAPPLSMDNGTNTLTVQNSNAWVEVSASALQHNIRTLQAELAGKSKLCAVLKADAYG
HGIGLVMPSIIAQGVPCVAVASNEEARVVRASGFTGQLVRVRLASLSELEDGLQYDMEELVGSAEFARQADAIAARHGKT
LRIHMALNSSGMSRNGVEMATWSGRGEALQITDQKHLKLVALMTHFAVEDKDDVRKGLAAFNEQTDWLIKHARLDRSKLT
LHAANSFATLEVPEARLDMVRTGGALFGDTVPARTEYKRAMQFKSHVAAVHSYPAGNTVGYDRTFTLARDSRLANITVGY
SDGYRRVFTNKGHVLINGHRVPVVGKVSMNTLMVDVTDFPDVKGGNEVVLFGKQAGGEITQAEMEEINGALLADLYTVWG
NSNPKILVD
>I6LNY0 5.1.1.10~~~bar~~~Broad specificity amino-acid racemase~~~
MPFRRTLLAASLVLLITGQAPLYAAPPLSMDNGTNALTVQNSNAWVEVSASALQHNIRTLQAELAGKSRLCAVLKADAYG
HGIGLVMPSIIAQGVPCVAVASNEEARVVRASGFTGQLVRVRAASLSELEDALQYDMEELVGSAEFARQADAIAARHGKT
LRIHLAFNSSGMSRNGVEMATWSGRGEALQITDQKHLELVALMTHFAVEDKDDVRKGLAAFNEQTDWLIKHARLDRSKLT
LHAANSFATLEVPEARLDMVRTGGALFGDTVPGRTEYKRAMQFKSRVAAVHSYPAGNTVGYDRTFTLARDSRLANITVGY
SDGYRRVFTNKGHVLINGHRVPVVGKVSMNTLMVDVTDFPDVKGGNEVVLFGKQAGGEITQAEMEEINGALLADLYTVWG
SSNPKILVD
>I0J1I6 5.1.1.10~~~argR~~~Broad specificity amino-acid racemase~~~
MPFSRTLLALSLGMALLQNPAFAAPPLSMTDGVAQVNTQDSNAWVEINKAAFEHNIRTLQTALAGKSQICAVLKADAYGH
GIGLLMPSVIAMGVPCVGVASNEEARVVRESGFKGQLIRVRTAALSELEAALPYNMEELVGNLDFAVKASLIAEDHGRPL
VVHLGLNSSGMSRNGVDMTTAQGRRDAVAITKVPNLEVRAIMTHFAVEDAADVRAGLKAFNQQAQWLMNVAQLDRSKITL
HAANSFATLEVPESHLDMVRPGGALFGDTVPSHTEYKRVMQFKSHVASVNSYPKGNTVGYGRTYTLGRDSRLANITVGYS
DGYRRAFTNKGIVLINGHRVPVVGKVSMNTLMVDVTDAPDVKSGDEVVLFGHQGKAEITQAEIEDINGALLADLYTVWGN
SNPKILKDQ
>Q9KSE5 5.1.1.10~~~bsrV~~~Broad specificity amino-acid racemase~~~COG0787
MHFKATLLSLSIAATLPSFSLSAAPLHIDTALPDAAQIQQSNSWLEISLGQFQSNIEQFKSHMNANTKICAIMKADAYGN
GIRGLMPTIIAQGIPCVGVASNAEARAVRESGFKGELIRVRSASLSEMSSALDLNIEELIGTHQQALDLAELAKQSGKTL
KVHIALNDGGMGRNGIDMTTEAGKKEAVSIATQPSLSVVGIMTHFPNYNADEVRAKLAQFKESSTWLMQQANLKREEITL
HVANSYTALNVPEAQLDMVRPGGVLFGDLPTNPEYPSIVSFKTRVSSLHHLPKDSTVGYDSTFTTSRDSVLANLPVGYSD
GYPRKMGNKAEVLINGQRAKVVGVTSMNTTVVDVTEIKGVLPGQEVVLFGQQQKQSIAVSEMENNAELIFPELYTLWGTS
NPRFYVK
>O87943 4.1.99.11~~~bssA~~~Benzylsuccinate synthase alpha subunit~~~
MSDVQTLEYKGKVVQFAPENPREAEIPADELHEHLQNPSTERTRRLKARCRWKHAAAGEFCEKGVTAGIERMRLLTESHW
ATRGEPEPIRRAHGLKNILDKSTLVLQTDEFIVGYHAEDPNMFPLYPELSYMAVQDYLKSKYSPQPAKEAQEIVDYWKPF
SLQARCEPYFDPVDLHRGYQVSTIEGPVFATGYNSVIPPYETVLEDGLQARIALAEEKIEHARAEMEKFPWHAPSGLEWI
DKIDNWKAMVIACKAVIAWARRHARLCKIVAEHFETDPKRKAELLEIADICQRMPAEPARGLKDAMQSKWFTFLICHAIE
RYASGFAQKEDSLLWPYYKASVIDKTFQPMEHKDAVELIEMERLKVSEHGAGKSRAYREIFPGSNDLFILTLGGTNGDGS
DACNDMTDAILEATKRIRTTEPSIVFRYSKKNRAKTLRWVFECIRDGLGYPSIKHNELGVQQMLEMAKYSRNGNGATPEE
AHYWVNVLCMAPGLAGRRKAQKTRSEGGSAIFPAKLLEITLNNGYDWSYADMQMGPETGYAKDFATFDQLWEAFRKQYQY
AIALAIRCKDVSRTMECRFLQMPFVSALDDGCMELGMDANALSEQPNGWHNPITSIVAGNSLVAIKKLIYDEKKYTMAQL
MDALQANWEGYEEMRRDFKNAPKWGNDDDDADVLISRFYEEILGGEMMKNINYSGGPVKPTGQAVGLYMEVGSRTGPTPD
GRFGGEAADDGGISPYSGTDKKGPTAVLRSVSKVQKNQKANLLNQRLSVPIMRSKHGFDIWHAYMDTWHDLNIDHVQFNV
VSTEEMKAAQREPEKHQDLIVRVSGFSARFVDIPTYGQNTIIARNEQNFNAQDLEFLNVEL
>O87944 4.1.99.11~~~bssB~~~Benzylsuccinate synthase beta subunit~~~
MSATPHTQVHWEENTARPCRKCKWQTPDPTDPLRGQCTVNRHAMGGVWKRWIRDVEHMTCSRHEEGELSFRDHV
>O87942 4.1.99.11~~~bssC~~~Benzylsuccinate synthase gamma subunit~~~
MTTCKDCAFFFSIPEDADDFEKSKGDCVTQKDDEKGRYWLSKPVFENDQCCGAFHKR
>O87941 1.97.1.-~~~bssD~~~Benzylsuccinate synthase activating enzyme~~~
MKIPLITEIQRFSLQDGPGIRTTIFLKGCPLRCPWCHNPETQDARQEFYFYPDRCVGCGRCVAVCPAETSRLVRNSDGRT
IVQIDRTNCQRCMRCVAACLTEARAIVGQHMSVDEILREALSDSAFYRNSGGGVTISGGDPLYFPDFTRQLASELHARGV
HVAIETSCFPKQGKVVESMIGIVDLFIVDLKTLDAHKHLDVIGWPLAPILANLETLFAAGAKVRIHIPVIPGFNDSHADI
DAYAEYLGKHAAAISGIDLLNFHCYGEGKYTFLGRAGSYQYSGVDETPAEKIVPLAQALKARGLAVTIGGIVGIANGKNE
LTGDIALEVHH
>Q8VVE4 ~~~bssE~~~Putative chaperone BssE~~~
MKNSGLLNSVHVPAADPYYYLNTETLSLLNRIQRISQKHPVNVLVIGKQGCGKSSLVRQYAAVHHLPLATFQIGLLSEPG
QLFGEYALENGETRYKQFLFPQAIQTPGCVIHLEEINRPEHPKALNMLFSILSDDRQVWMDELGLLKVADGVVFFATLNE
GDEFVGTELLDPALRDRFYVTAMDFLPNDVEREVLQKKTGVTIAQAEEIIGVVNSLRASPELGVEVSTRKTLMIGEMIAA
GGSLREAIAASLQTDRETLESVLLSLHVELGKTERGTTEYVLFTPR
>P0AB33 ~~~bssS~~~Biofilm regulator BssS~~~
MEKNNEVIQTHPLVGWDISTVDSYDALMLRLHYQTPNKSEQEGTEVGQTLWLTTDVARQFISILEAGIAKIESGDFQVNE
YRRH
>K4JY29 ~~~bstA~~~Bottromycin D~~~
MGPAVVFDCMTADFLNDDPNNAELSSLEMEELESWGAWSDDTDQSV
>O67791 3.1.3.11~~~suhB~~~Fructose-1,6-bisphosphatase/inositol-1-monophosphatase~~~COG0483
MENLKKYLEVAKIAALAGGQVLKENFGKVKKENIEEKGEKDFVSYVDKTSEERIKEVILKFFPDHEVVGEEMGAEGSGSE
YRWFIDPLDGTKNYINGFPIFAVSVGLVKGEEPIVGAVYLPYFDKLYWGAKGLGAYVNGKRIKVKDNESLKHAGVVYGFP
SRSRRDISIYLNIFKDVFYEVGSMRRPGAAAVDLCMVAEGIFDGMMEFEMKPWDITAGLVILKEAGGVYTLVGEPFGVSD
IIAGNKALHDFILQVAKKYMEVAV
>O33832 3.1.3.11~~~suhB~~~Fructose-1,6-bisphosphatase/inositol-1-monophosphatase~~~COG0483
MDRLDFSIKLLRKVGHLLMIHWGRVDNVEKKTGFKDIVTEIDREAQRMIVDEIRKFFPDENIMAEEGIFEKGDRLWIIDP
IDGTINFVHGLPNFSISLAYVENGEVKLGVVHAPALNETLYAEEGSGAFFNGERIRVSENASLEECVGSTGSYVDFTGKF
IERMEKRTRRIRILGSAALNAAYVGAGRVDFFVTWRINPWDIAAGLIIVKEAGGMVTDFSGKEANAFSKNFIFSNGLIHD
EVVKVVNEVVEEIGGK
>A0A0H3G586 ~~~btaE~~~Autotransporter adhesin BtaE~~~
MFGLSVNHAYAGPGIFINDGTDDGCIWTFDKEDYSPIGDYFGNTAPADKDSAGRNSPASVKYHIPSIQQLGGAATLKCLS
KDRDTQTDRVLFYGNSKEQGSISLTLGGELFVNNGNLGLGGGTDTKAMRIGSMATLTGPSGLRSLAIGAGEIATVASGDD
AIAIGTAAQAAHVGSIALGLQSTTELPSLVKDVTINGIKLSAFAGSNPASVLSIGNDTLKRSITNVGAGRVSKDSTDAVN
GRQLFAVSEQAASGWSLTVNGMDKSRVGPGDTVDLSNSDGNLVLSKKGKDVTFNLASDLKVTSLVAGNTFLDTNGLVITG
GPSMTVSGIDAGQLKISHVADGAVTVTSTDAVNGSQLHRVAHTIAEHLGGDAHVNADGSVIGPQYTVQKKRYKTIYDAFG
GVDENLANINDILHDIESGGGIKYFHANSIGADSRALGTNSIAVGSDSVASGEGSISVGNGAQASAHGSVALGENAAAPD
ANSVALGAGSKTSEVVATKGTTINGQYYDFAGDAPSGTVSVGDKGAERTITNVAAGRISVESTDAVNGSQLNAVNQAIEN
LAAGVTENDKFSVKYDRHSDGTKKNSMTLQGWDSATPVVLANVADGVHKNDAVNVSQLKAGLSTTLGEAKAYTDQTALQT
LDQANAYTDKKFGKLNEDIVATRIEARQAAAIGLAAASLRYDDRPGKISAAIGGGFWRGEGAVALGLGHTSEDQRMRSNL
SAATSGGNWGMGAGFSYTFN
>A0A0H3G4K1 ~~~btaF~~~Autotransporter adhesin BtaF~~~
MKLPPVFVFELVENQGLANIALIRPRVIAPDNNLRPGGIVSGIAGLLTLGQENRNLISENRQVINNNTTAIGQNSDRIDA
NAKGVADNRAAIGQNSGRIDANAKGVADNKAAIGRNSGRIDANAKGVADNKTAIGRNSGRIDTNAKGVADNRAAISQNRG
RINANAAGVASNRAAIRQNSAAISALGQRVDGLQGQINSARKEARAGAANAAALSGLRYDNRPGKVSIATGVGGFKGSTA
LAAGIGYTSKNENARYNVSVAYNEAGTSWNAGASFTLN
>P9WPQ1 ~~~~~~Biotinylated protein TB7.3~~~COG1038
MAEDVRAEIVASVLEVVVNEGDQIDKGDVVVLLESMKMEIPVLAEAAGTVSKVAVSVGDVIQAGDLIAVIS
>Q4H4F5 2.6.1.93~~~btrB~~~Neamine transaminase BtrB~~~
MKQETVKSSEQLLSVLGTYIDSPVDPFRKERVMFSRGSGAYLFDYDGGNYIDLMNGKGSIILGHNDPSVNAALRNFLEQD
REVVTGPSKPIIDLAERIKKDSALPDAKVSFYTTGTAACRAAVYAARDYSGKKIVLSSGYHGWDPMWRQQGPLLEPNEDG
VIEFYFIPELLERALTAHKDQVALVIFSPDYTYLSASTMERILGICRAHGVLVCCDDVKQGYRHRQGSSLELVTTEKADM
YVFSKGLSNGHRISCVVSSDEIMAETKEHTYTAYYQMLPILSSLETLKKMESGKGYDLIRSYGQTLTGNLKELFVQSSLP
IEVNGSSIFQLVFGDEELEEAFYREAFIQGLILFEGDNQSLSLCMDKDVQVDLIRRFANVTDVLSEQFKHLRGKEVTTEQ
TFRTAWNMIDGASDLLPYEKQLKLLDNLIGGG
>Q4H4F3 3.5.1.112~~~btrD~~~2'-N-acetylparomamine deacetylase~~~
MNQDKRAFMFISPHFDDVILSCASTLMELMNQGHTCKVLTVFGGCPSVRFQPGEIARQYAAEDLGLFEDEIEGDHLSILV
ARRLQEDQQAFRHLPGVQVEVLSFPDAIYRENKGQPYYRTEADLFGIPDKQDEDIFLPKIETYLQSCDLARKYTWVFPAI
SKHVDHRLLTKAGLRLMSQGYPVLFYSEFPYWQQHNEFLQDGWRQLELRNSVYTPVKRAAVLEYKTQLLGLFGEEAETKI
NNGGVLSEAELFWIQETDTQAWRVFRSLSPEPLQT
>Q4H4F0 4.3.2.6~~~btrG~~~Gamma-L-glutamyl-butirosin B gamma-glutamyl cyclotransferase~~~
MISWTKAFTKPLKGRIFMPNLFVYGTLREGENNHKYMKEATLLSRKASIAGSLVDTGNGYPGLLLENQLVAGEWYEVSEE
TLKRIDELEEYFGPGDTRNLFDRIECQVNESGGTHLGWTYVYNRDDYLETRFSDWKQYRLQHASGIEEKQDVPHSL
>Q4H4E9 2.3.2.19~~~btrH~~~Ribostamycin:4-(gamma-L-glutamylamino)-(S)-2-hydroxybutanoyl-[BtrI acyl-carrier protein] 4-(gamma-L-glutamylamino)-(S)-2-hydroxybutanoate transferase~~~
MCLTRYDEKFFDCRKSQIIAYLDSQQVPVIPLFYNSYQSTAEIYRQIFIENKSKWKYSEPSFSDDDLLRKGIRPVRASFP
DFSQASDCLKDLLARHKLVFVWGDEYCLPYRKEAFQAIHSTHSLVVTGYDGENKAYYVEDWDGLYGYLPAVHLEAAFDSL
SRQMRTLLVLELNDEEMRENKQEDTDLFRKWLQAFEDDYIFYDRVLLDMRDYEENRLISMDHGLRLIAASRHVFSKFLHY
IDDAPEEVGLLIRNHQLANHIAAIVRRYIIAKQIDWDGAACKIRQLREQEDDFMRKLKSRYG
>Q4H4E7 6.2.1.39~~~btrJ~~~[Butirosin acyl-carrier protein]--L-glutamate ligase~~~
MKFTHPLDYYRLNGKQILWYMNIGEDQDSQASNYFPSVKDPQSEKIVVQQEQQLLFLARPQDTVFFHTMPEQAFLDYWKE
RRLSLPSIICCDKLSQVPDLERYTIIPFIVSDQLLELKRRYPHMDIIAPDLAVCREINHKFNTRRLMERNGFNVTTGYFC
SDIESLEHAYEQLISAGFSKCVLKVPYGSSGKGLKVIDNERNFRFLLNYIQNRQTNVDLLLEGWHPHRLSLTSQLFITEY
EVHLLAVTEQIIDPNGVYKGTNFTPALSQSEAADYREEILRAGELIRQMGYRGVLGIDSILDTNGELIPVIEINARLTQV
TYILPLVIEQKKRYEFVESRVLVFNSRADLDFEDYENDLSEVTRDLPVRIDLYNFCKASGAFKNTYKLFVLVSAHNSEQL
IKARSLLDELNTKMTTAVH
>Q2L4H3 4.1.1.95~~~btrK~~~L-glutamyl-[BtrI acyl-carrier protein] decarboxylase~~~
MNLDQAEITALTKRFETPFYLYDGDFIEAHYRQLRSRTNPAIQFYLSLKANNNIHLAKLFRQWGLGVEVASAGELALARH
AGFSAENIIFSGPGKKRSELEIAVQSGIYCIIAESVEELFYIEELAEKENKTARVAIRINPDKSFGSTAIKMGGVPRQFG
MDESMLDAVMDAVRSLQFTKFIGIHVYTGTQNLNTDSIIESMKYTVDLGRNIYERYGIVCECINLGGGFGVPYFSHEKAL
DIGKITRTVSDYVQEARDTRFPQTTFIIESGRYLLAQAAVYVTEVLYRKASKGEVFVIVDGGMHHHAASTFRGRSMRSNY
PMEYIPVREDSGRRELEKVTIAGPLCTPEDCLGKDVHVPALYPGDLVCVLNSGAYGLSFSPVHFLGHPTPIEILKRNGSY
ELIRRKGTADDIVATQLQTESNLLFVDK
>Q4H4F7 2.4.2.49~~~btrL~~~Neamine phosphoribosyltransferase~~~
MKDNPISLFHELESYVRGSKDTLHRYMYACILEQRLADLLLLSRCGVHPDDEDRLAVRLASQRSLCVQAASDLAEGTEEA
RGSTPAWPDCKVTKSSAAQKRPELLSLVKVMQELRTPECSGYDLCDLEVCREDALHWLRDHADPSSPPVLIGVRTGGAFY
APLWSSALQQRWGNKALYHTVRALRMPSDPGASLYLPEELSILPAAIEPDTDIVILEDQPHTGGTVLELAGRLSAKYRLN
KPVWVSSPGRLFQIENGRLAKCSDRIPLNTGTRKRIWQMLGNSEEVAEFLLPKLGLTDAHPADLEAVPYKSVPQWNDPMY
RREVPFRINPKKTPFYIRRKSTNEPVVFAKFIGKDLFGDFQFHQLKKFEKYFPDILAYQDGYVMTKYEPGLKEMREMFVN
MPSAVTREICQSVSGYWKTLLDSCQISGGHPVHPLADHWQARLGEMEDFIGRKLPYDLDWFERSLHTEWTSSAHVYTSLP
YANQYGHWKARLSAGQRLKTYRFHIDSTWGGTSSIEVELASFLLENRVRPDDFRLLVNQVKKEAGSLTIEAVTDALPIAC
VLQAANLLKQAKDESRLSKELILSEAMQSFEYLQAMKPKLSDVR
>Q8G907 1.1.99.38~~~btrN~~~S-adenosyl-L-methionine-dependent 2-deoxy-scyllo-inosamine dehydrogenase~~~
MDKLFSMIEVEVNSQCNRTCWYCPNSVSKRKETGEMDPALYKTLMEQLSSLDFAGRISFHFYGEPLLCKNLDLFVGMTTE
YIPRARPIIYTNGDFLTEKRLQTLTELGIQKFIVTQHAGAKHKFRGVYDQLAGADKEKVVYLDHSDLVLSNRGGILDNIP
QASKANMSCMVPSNLAVVTVLGNVLPCFEDFNQKMVMGNIGEQHISDIWHNDKFTSFRKMLKEGHRGKSDLCKNCNNVSV
QTEEQYDYVL
>Q4H4E5 1.14.14.13~~~btrO~~~4-(gamma-L-glutamylamino)butanoyl-[BtrI acyl-carrier protein] monooxygenase BtrO~~~
MIALDTIQYYGQIPGVNHYNGKKEFMENAVKIAQLSDAYGIVGSLSFFNHSVLDPWAVSSVIMRHTERHVPLIALQPYMY
PPYTAAKLIQSFTYLYDRRIDLNMITGAVTGELQQTGGYIDHSSRYKKLHEYVQVLRLLLESDSAVSFKGDYYELNNLEF
KPLLPDKRLFPRIFMSGSSEEGLETGLKAADFVVTHPGPLEHFKRHFSEKVQGSAVQSAIRIEIIARESAEQAWKIAHAR
YPGNRQGKIQLRMKTNSESSWQRMLAELALASETYDEVFWMGGYMNGGIYSPVLVGDYEQVAAYLNEYYKLGVKAVLLGS
MYSEEDFIHFSRVKEGISNPV
>Q4H4E4 3.1.3.88~~~btrP~~~5''-phosphoribostamycin phosphatase~~~
MRLILIRHAQARCNILEDDALMDAYDPHCELTEAGIGQAVKLRDEYPVSLTPSVIYSSPLKRARETAGIFRGRYPSVPFV
EDERLSELKAPESFIPPITQGQWDLYLEQRIRSPHLEIVKGLESLDVQRERIERFYKDLFRKYAEEACNIVIFTHAFSIQ
LSILFFLGLGNEQLLQWQIKASNTAMHIIHYDPTSGSFLLESLNNRSHLQTTG
>Q7WLU9 ~~~btrV~~~Putative anti-sigma factor antagonist BtrV~~~COG1366
MKLTMDKIDGMLIACLQGVVNSANAEQLEAELAAQVDKGERRVVLDLGRLDYISSAGLRVVLLVAKQLRQVQGELVLCEL
KPHVREVFEISGFLSIFPVANSREAAAAAFKTALPR
>Q4H4E3 1.14.14.13~~~btrV~~~4-(gamma-L-glutamylamino)butanoyl-[BtrI acyl-carrier protein] monooxygenase BtrO~~~
MDNKERNTLYQVVYAISGVTGEDGDELVETFKQGPMEVDKRDFFEIVQRVESLFDCTLDMNLEGPYLIHADEIVTKITKL
NV
>Q7WLV0 2.7.11.1~~~btrW~~~Serine/threonine-protein kinase BtrW~~~COG2172
MSSKTDTLELSVTATTATDALYWLEHIALRDRWSARLRFTLTLCADEALNNIVSHAFTPGHPAAIHLTLRQTRREVSLHI
ADNGAAYDPTQALSPPLARSLDDAQPGGHGLRLMRHFMHALSYQRRDGWNHLTLTSHSAPES
>P40408 ~~~btr~~~HTH-type transcriptional activator Btr~~~COG0614
MQNAVIYQPVQIEYLKKTSDLFSEQQLADSFVLIFHLKGNGYISIGTNTNPLQKKTLYVCPPNETFGFTPAADGHIDACI
IRLLSYIKETGQDIFTPCTESELAKLKLMNVSHIENLAVRLQELAALWNESSQLSQLKCVIEVQSLIYDLFTASLSDQTD
THSAIEKTKHYIETHADTKITLAQLSQMAGISAKHYSESFKKWTGQSVTEFITKTRITKAKRLMAKSNCKLKEIAHQTGY
QDEFYFSRIFKKYTGCSPTSYMKKRRKKIAAYGRGTMGHLIPLHHIPFAAALHPKWTSYYYQHYSTDIPVQLSAYRFNEK
WEENLYTLSQAEPDVIVSMDSISPEEQDRLNRIAEVMYLPSEESWRTHFLQTASFLKEESEAEKWLADYDQQTTAAKKTL
QHVQGLRFLFLRLHKQNFYLAHNRSVREVFFGDLGFSSATTADTPSEQAISLENIANYQADCMMLFLFKEPETIAYYQQL
QQTEAWQNLSAVRDNRVYLLSLDPWNEYSACGHERIVQQTVSLLSGDCP
>P0AFT5 ~~~btsR~~~Transcriptional regulatory protein BtsR~~~COG3279
MIKVLIVDDEPLARENLRVFLQEQSDIEIVGECSNAVEGIGAVHKLRPDVLFLDIQMPRISGLEMVGMLDPEHRPYIVFL
TAFDEYAIKAFEEHAFDYLLKPIDEARLEKTLARLRQERSKQDVSLLPENQQALKFIPCTGHSRIYLLQMKDVAFVSSRM
SGVYVTSHEGKEGFTELTLRTLESRTPLLRCHRQYLVNLAHLQEIRLEDNGQAELILRNGLTVPVSRRYLKSLKEAIGL
>P0AD14 2.7.13.3~~~btsS~~~Sensor histidine kinase BtsS~~~COG3275
MYDFNLVLLLLQQMCVFLVIAWLMSKTPLFIPLMQVTVRLPHKFLCYIVFSIFCIMGTWFGLHIDDSIANTRAIGAVMGG
LLGGPVVGGLVGLTGGLHRYSMGGMTALSCMISTIVEGLLGGLVHSILIRRGRTDKVFNPITAGAVTFVAEMVQMLIILA
IARPYEDAVRLVSNIAAPMMVTNTVGAALFMRILLDKRAMFEKYTSAFSATALKVAASTEGILRQGFNEVNSMKVAQVLY
QELDIGAVAITDREKLLAFTGIGDDHHLPGKPISSTYTLKAIETGEVVYADGNEVPYRCSLHPQCKLGSTLVIPLRGENQ
RVMGTIKLYEAKNRLFSSINRTLGEGIAQLLSAQILAGQYERQKAMLTQSEIKLLHAQVNPHFLFNALNTIKAVIRRDSE
QASQLVQYLSTFFRKNLKRPSEFVTLADEIEHVNAYLQIEKARFQSRLQVNIAIPQELSQQQLPAFTLQPIVENAIKHGT
SQLLDTGRVAISARREGQHLMLEIEDNAGLYQPVTNASGLGMNLVDKRLRERFGDDYGISVACEPDSYTRITLRLPWRDE
A
>P39396 ~~~btsT~~~Pyruvate/proton symporter BtsT~~~COG1966
MDTKKIFKHIPWVILGIIGAFCLAVVALRRGEHISALWIVVASVSVYLVAYRYYSLYIAQKVMKLDPTRATPAVINNDGL
NYVPTNRYVLFGHHFAAIAGAGPLVGPVLAAQMGYLPGTLWLLAGVVLAGAVQDFMVLFISSRRNGASLGEMIKEEMGPV
PGTIALFGCFLIMIIILAVLALIVVKALAESPWGVFTVCSTVPIALFMGIYMRFIRPGRVGEVSVIGIVLLVASIYFGGV
IAHDPYWGPALTFKDTTITFALIGYAFVSALLPVWLILAPRDYLATFLKIGVIVGLALGIVVLNPELKMPAMTQYIDGTG
PLWKGALFPFLFITIACGAVSGFHALISSGTTPKLLANETDARFIGYGAMLMESFVAIMALVAASIIEPGLYFAMNTPPA
GLGITMPNLHEMGGENAPIIMAQLKDVTAHAAATVSSWGFVISPEQILQTAKDIGEPSVLNRAGGAPTLAVGIAHVFHKV
LPMADMGFWYHFGILFEALFILTALDAGTRSGRFMLQDLLGNFIPFLKKTDSLVAGIIGTAGCVGLWGYLLYQGVVDPLG
GVKSLWPLFGISNQMLAAVALVLGTVVLIKMKRTQYIWVTVVPAVWLLICTTWALGLKLFSTNPQMEGFFYMASQYKEKI
ANGTDLTAQQIANMNHIVVNNYTNAGLSILFLIVVYSIIFYGFKTWLAVRNSDKRTDKETPYVPIPEGGVKISSHH
>A0A0P0FGV9 3.2.2.-~~~~~~2' cyclic ADP-D-ribose synthase BtTIR~~~
MNSSYYQNQINRLEKDIADLQKKIADENKKEIDKNKQIDSVHRTINKNTSISTLNSKQRQIDGYQKDILNCRTKIASYQK
SIATKSAELGKKRQELLKAQQSEQKKLQDDQLKFQKKLQSEIEIQKRHLETLIAQNYSTQNNKLVSTEDIPEPTKQYDFF
ISHASEDKDDIVRDLAEALRNNGFEVWYDEFELKIGDSLRKKIDYGLSNANYGIVIISPSFVKKNWTEYELNGMVAREMN
GHKVILPIWHKITKDEVLRFSPSLADKLALNTSIHTIDDIVENLKNL
>P06129 ~~~btuB~~~Vitamin B12 transporter BtuB~~~COG4206
MIKKASLLTACSVTAFSAWAQDTSPDTLVVTANRFEQPRSTVLAPTTVVTRQDIDRWQSTSVNDVLRRLPGVDITQNGGS
GQLSSIFIRGTNASHVLVLIDGVRLNLAGVSGSADLSQFPIALVQRVEYIRGPRSAVYGSDAIGGVVNIITTRDEPGTEI
SAGWGSNSYQNYDVSTQQQLGDKTRVTLLGDYAHTHGYDVVAYGNTGTQAQTDNDGFLSKTLYGALEHNFTDAWSGFVRG
YGYDNRTNYDAYYSPGSPLLDTRKLYSQSWDAGLRYNGELIKSQLITSYSHSKDYNYDPHYGRYDSSATLDEMKQYTVQW
ANNVIVGHGSIGAGVDWQKQTTTPGTGYVEDGYDQRNTGIYLTGLQQVGDFTFEGAARSDDNSQFGRHGTWQTSAGWEFI
EGYRFIASYGTSYKAPNLGQLYGFYGNPNLDPEKSKQWEGAFEGLTAGVNWRISGYRNDVSDLIDYDDHTLKYYNEGKAR
IKGVEATANFDTGPLTHTVSYDYVDARNAITDTPLLRRAKQQVKYQLDWQLYDFDWGITYQYLGTRYDKDYSSYPYQTVK
MGGVSLWDLAVAYPVTSHLTVRGKIANLFDKDYETVYGYQTAGREYTLSGSYTF
>P06609 ~~~btuC~~~Vitamin B12 import system permease protein BtuC~~~COG4139
MLTLARQQQRQNIRWLLCLSVLMLLALLLSLCAGEQWISPGDWFTPRGELFVWQIRLPRTLAVLLVGAALAISGAVMQAL
FENPLAEPGLLGVSNGAGVGLIAAVLLGQGQLPNWALGLCAIAGALIITLILLRFARRHLSTSRLLLAGVALGIICSALM
TWAIYFSTSVDLRQLMYWMMGGFGGVDWRQSWLMLALIPVLLWICCQSRPMNMLALGEISARQLGLPLWFWRNVLVAATG
WMVGVSVALAGAIGFIGLVIPHILRLCGLTDHRVLLPGCALAGASALLLADIVARLALAAAELPIGVVTATLGAPVFIWL
LLKAGR
>P06611 7.6.2.8~~~btuD~~~Vitamin B12 import ATP-binding protein BtuD~~~COG4138
MSIVMQLQDVAESTRLGPLSGEVRAGEILHLVGPNGAGKSTLLARMAGMTSGKGSIQFAGQPLEAWSATKLALHRAYLSQ
QQTPPFATPVWHYLTLHQHDKTRTELLNDVAGALALDDKLGRSTNQLSGGEWQRVRLAAVVLQITPQANPAGQLLLLDEP
MNSLDVAQQSALDKILSALCQQGLAIVMSSHDLNHTLRHAHRAWLLKGGKMLASGRREEVLTPPNLAQAYGMNFRRLDIE
GHRMLISTI
>P06610 1.11.1.24~~~btuE~~~Thioredoxin/glutathione peroxidase BtuE~~~COG0386
MQDSILTTVVKDIDGEVTTLEKFAGNVLLIVNVASKCGLTPQYEQLENIQKAWVDRGFMVLGFPCNQFLEQEPGSDEEIK
TYCTTTWGVTFPMFSKIEVNGEGRHPLYQKLIAAAPTAVAPEESGFYARMVSKGRAPLYPDDILWNFEKFLVGRDGKVIQ
RFSPDMTPEDPIVMESIKLALAK
>P37028 ~~~btuF~~~Vitamin B12-binding protein~~~COG0614
MAKSLFRALVALSFLAPLWLNAAPRVITLSPANTELAFAAGITPVGVSSYSDYPPQAQKIEQVSTWQGMNLERIVALKPD
LVIAWRGGNAERQVDQLASLGIKVMWVDATSIEQIANALRQLAPWSPQPDKAEQAAQSLLDQYAQLKAQYADKPKKRVFL
QFGINPPFTSGKESIQNQVLEVCGGENIFKDSRVPWPQVSREQVLARSPQAIVITGGPDQIPKIKQYWGEQLKIPVIPLT
SDWFERASPRIILAAQQLCNALSQVD
>A5F5P5 ~~~btuF~~~Vitamin B12-binding protein~~~COG0614
MLVIRLIACTFLFITPSLLAKPFPAERIISLAPHATEIAYAAGLGDKLVAVSEYSDYPPQALELERVANHQTINIEKILT
LKPDLIIAWPAGNPPRELAKLRQLGFTIYDSQTKTLDEIADNIEALSHYSANPEVGQKAAHDFRQRLQDLRTQYASNQPI
RYFYQLSEKPIITLAQGHWPSEVFSLCGGVNIFADSEVPYPQVSIEQVLVKQPQVIFTSEHAIANGHMWRAWQAELSAVQ
NDQVWALNADWLNRPTPRTLDAVEQVCTYLKIAQKQ
>P31570 2.5.1.17~~~btuR~~~Corrinoid adenosyltransferase CobA~~~
MSDERYQQRQQKVKDRVDARVAQAQEERGIIIVFTGNGKGKTTAAFGTAARAVGHGKNVGVVQFIKGTWPNGERNLLEPH
GVEFQVMATGFTWETQNREADTAACMAVWQHGKRMLADPLLDMVVLDELTYMVAYDYLPLEEVISALNARPGHQTVIITG
RGCHRDILDLADTVSELRPVKHAFDAGVKAQMGIDY
>Q9ZNN8 1.1.1.76~~~budC~~~L-2,3-butanediol dehydrogenase~~~
MSKVAMVTGGAQGIGRGISEKLAADGFDIAVADLPQQEEQAAETIKLIEAADQKAVFVGLDVTDKANFDSAIDEAAEKLG
GFDVLVNNAGIAQIKPLLEVTEEDLKQIYSVNVFSVFFGIQAASRKFDELGVKGKIINAASIAAIQGFPILSAYSTTKFA
VRGLTQAAAQELAPKGHTVNAYAPGIVGTGMWEQIDAELSKINGKPIGENFKEYSSSIALGRPSVPEDVAGLVSFLASEN
SNYVTGQVMLVDGGMLYN
>Q48436 1.1.1.304~~~budC~~~Diacetyl reductase [(S)-acetoin forming]~~~
MKKVALVTGAGQGIGKAIALRLVKDGFAVAIADYNDATAKAVASEINQAGGRAMAVKVDVSDRDQVFAAVEQARKTLGGF
DVIVNNAGVAPSTPIESITPEIVDKVYNINVKGVIWGIQAAVEAFKKEGHGGKIINACSQAGHVGNPELAVYSSSKFAVR
GLTQTAARDLAPLGITVNGYCPGIVKTPMWAEIDRQVSEAAGKPLGYGTAEFAKRITLGRLSEPEDVAACVSYLASPDSD
YMTGQSLLIDGGMVFN
>Q97II1 2.7.2.7~~~buk2~~~Butyrate kinase 2~~~COG3426
MKFKLLTINPGSTSTKIAVFENEKEILSETLRHSSKELEAYKNIYEQFEFRKDTILKVLKDKNFNIQNIDAVVGRGGLLK
PIVGGTYKVNEKMLKDLKAGVQGEHASNLGGIIANSIAEAFGVSAYIVDPVVVDEMEDIARFSGIPELPRKSIFHALNQK
AVAKRYAKESERDYEDLNIIVAHMGGGVSVGAHKNGKIIDVNNALDGEGAFSPERSGNLPSGDLVRLCFSGKYTEDEILK
KITGKGGFVAYHGTNNALDVQNAALEGDYDAKMTYNAMGYQVAKDIGSAAAVLDGKVDCIILTGGIAYNKLMTDFIAKKV
SFIAPITIYPGEDEMLALAEGTLRVLSGQEEAKKYK
>Q9X278 2.7.2.7~~~buk2~~~Probable butyrate kinase 2~~~COG3426
MFRILTINPGSTSTKLSIFEDERMVKMQNFSHSPDELGRFQKILDQLEFREKIARQFVEETGYSLSSFSAFVSRGGLLDP
IPGGVYLVDGLMIKTLKSGKNGEHASNLGAIIAHRFSSETGVPAYVVDPVVVDEMEDVARVSGHPNYQRKSIFHALNQKT
VAKEVARMMNKRYEEMNLVVAHMGGGISIAAHRKGRVIDVNNALDGDGPFTPERSGTLPLTQLVDLCFSGKFTYEEMKKR
IVGNGGLVAYLGTSDAREVVRRIKQGDEWAKRVYRAMAYQIAKWIGKMAAVLKGEVDFIVLTGGLAHEKEFLVPWITKRV
SFIAPVLVFPGSNEEKALALSALRVLRGEEKPKNYSEESRRWRERYDSYLDGILR
>P99120 1.1.1.304~~~butA~~~Diacetyl reductase [(S)-acetoin forming]~~~
MTNNKVALVTGGAQGIGFKIAERLVEDGFKVAVVDFNEEGAKAAALKLSSDGTKAIAIKADVSNRDDVFNAVRQTAAQFG
DFHVMVNNAGLGPTTPIDTITEEQFKTVYGVNVAGVLWGIQAAHEQFKKFNHGGKIINATSQAGVEGNPGLSLYCSTKFA
VRGLTQVAAQDLASEGITVNAFAPGIVQTPMMESIAVATAEEAGKPEAWGWEQFTSQIALGRVSQPEDVSNVVSFLAGKD
SDYITGQTIIVDGGMRFR
>A4JS72 ~~~~~~DNA-binding protein Bv3F~~~COG2916
MPVQGRENMDPKSPGYLALIAQRESLDAQIIAARKAEREVAIGQIKALMKEFDLSVLDLQERVQKRNSKRMSTVPKYRDP
ATGKTWSGRGRQPAWLGNDPAAFLIQPDLPAI
>P0A4H2 ~~~bvgA~~~Virulence factors putative positive transcription regulator BvgA~~~COG2197
MYNKVLIIDDHPVLRFAVRVLMEKEGFEVIGETDNGIDGLKIAREKIPNLVVLDIGIPKLDGLEVIARLQSLGLPLRVLV
LTGQPPSLFARRCLNSGAAGFVCKHENLHEVINAAKAVMAGYTYFPSTTLSEMRMGDNAKSDSTLISVLSNRELTVLQLL
AQGMSNKDIADSMFLSNKTVSTYKTRLLQKLNATSLVELIDLAKRNNLA
>P16575 2.7.13.3~~~bvgS~~~Virulence sensor protein BvgS~~~COG0834
MPAPHRLYPRSLICLAQALLAWALLAWAPAQASQELTLVGKAAVPDVEVALDGDDWRWLARKRVLTLGVYAPDIPPFDVT
YGERYEGLTADYMAIIAHNLGMQAKVLRYPTREQALSALESGQIDLIGTVNGTDGRQQSLRLSVPYAADHPVIVMPIGAR
HVPASNLAGQRLAVDINYLPKETLARAYPQATLHYFPSSEQALAAVAYGQADVFIGDALTTSHLVSQSYFNDVRVVAPAH
IATGGESFGVRADNTRLLRVVNAVLEAIPPSEHRSLIYRWGLGSSISLDFAHPAYSAREQQWMADHPVVKVAVLNLFAPF
TLFRTDEQFGGISAAVLQLLQLRTGLDFEIIGVDTVEELIAKLRSGEADMAGALFVNSARESFLSFSRPYVRNGMVIVTR
QDPDAPVDADHLDGRTVALVRNSAAIPLLQRRYPQAKVVTADNPSEAMLMVANGQADAVVQTQISASYYVNRYFAGKLRI
ASALDLPPAEIALATTRGQTELMSILNKALYSISNDELASIISRWRGSDGDPRTWYAYRNEIYLLIGLGLLSALLFLSWI
VYLRRQIRQRKRAERALNDQLEFMRVLIDGTPNPIYVRDKEGRMLLCNDAYLDTFGVTADAVLGKTIPEANVVGDPALAR
EMHEFLLTRVAAEREPRFEDRDVTLHGRTRHVYQWTIPYGDSLGELKGIIGGWIDITERAELLRKLHDAKESADAANRAK
TTFLATMSHEIRTPMNAIIGMLELALLRPTDQEPDRQSIQVAYDSARSLLELIGDILDIAKIEAGKFDLAPVRTALRVLP
EGAIRVFDGLARQKGIELVLKTDIVGVDDVLIDPLRMKQVLSNLVGNAIKFTTEGQVVLAVTARPDGDAAHVQFSVSDTG
CGISEADQRQLFKPFSQVGGSAEAGPAPGTGLGLSISRRLVELMGGTLVMRSAPGVGTTVSVDLRLTMVEKSVQAAPPAA
ATAATPSKPQVSLRVLVVDDHKPNLMLLRQQLDYLGQRVIAADSGEAALALWREHAFDVVITDCNMPGISGYELARRIRA
AEAAPGYGRTRCILFGFTASAQMDEAQRCRAAGMDDCLFKPIGVDALRQRLNEAVARAALPTPPSPQAAAPATDDATPTA
FSAESILALTQNDEALIRQLLEEVIRTNRADVDQLQKLHQQADWPKVSDMAHRLAGGARVVDAKAMIDTVLALEKKAQGQ
AGPSPEIDGLVRTLAAQSAALETQLRAWLEQRPHQDQP
>Q9RL17 1.14.13.-~~~~~~Baeyer-Villiger monooxygenase~~~COG2072
MAHAQELTPEALAGLRERYRRERERRVRPDGTRQYLGADAEFGFYAADPWAGESDVREPVRDRVDVAVVGGGFGGVLAGA
RLRQQGVARVRVVEKGGDFGGTWYWNRYPGIHCDIEAHVYLPMLDETGYVPEWKYAPGEEIRRHAMRIAETFDLYTDVLF
STAVTSLSWDDTTGEWIVETDRHDAFRATYVITATGVLSELKLPGIPGIERFKGHTFHTSRWDYAYTGGGPDGGLTGLAD
KRVGVVGTGATGVQVIPKLAEDAGQLHVFQRTPSSVDVRANRRTTARDVGADRAGWASERRDNFLRVVSGEAVEEDLVAD
RWTATAGLLEKLLPSFRRPDDLAAFEAAYEVADAARMNDIRARVDDLVTDPATADRLKPWYRYACKRPTFSDLYLQAFNR
DNVTLVDTADTHGIERMNERGVVVGDTEYPLDCLVFATGFSVGVSGVHSGRLPVRGRGGVRLRDAWSARGPRTLHGLTSN
GFPNLIQLGGVQSASSVNHTHVLDEHAVHGAALVAAAEAKGAVVEPTREAEDAWIATLAEHAPDHAWFHAECTPGYYNAE
GRGRPNGPTAYPHGAAAFHELLRRWREESMDELLAPRARVRAC
>Q9RKB5 1.14.13.-~~~~~~Baeyer-Villiger monooxygenase~~~COG2072
MAEHEQVHEHVRVAVIGSGFGGLGAAVRLRREGITDFVVLERAGSVGGTWRDNSYPGCACDVPSHLYSFSFAPNPEWPRT
FSGQEHIRAYLEHVADTFGLRPHLRFDSEVKRMAWDTEQLRWEIETVRGTLTADVVVSATGPLSDPKVPDIPGLDTFPGK
VFHSARWDHDYDLAGQRVAMIGTGASAIQIVPSIQPKVDRLTLFQRTPAWVMPRVDRAISGAERALHRALPATTKLRRGL
LWGIRELQVQAFTKHPNELGFVEQIAKRNMGAAIKDPALRAKLTPDYRIGCKRILLSSTYYPALAKPNVDVVASGLSEVR
GSTLVAADGTEAEADAIVFGTGFHVTDMPIAERVVGADGRTLAETWKGGMEALRGGTAAGFPNFMTVIGPNTGLGNSSMI
LMIESQLNYLADYLRQLNVLGGRTALDPRPAAVRNWNHRVQERMKRTVWNTGGCTSWYLDASGRNTTVWPGTTAEFRRET
RRVDLAEYQVLRPAPAQVGAKAAEADTGADTGADAEVSA
>U5S003 1.14.13.-~~~~~~Baeyer-Villiger monooxygenase 4~~~
MPFTLPESKIAIDIDFDPDHLRQRFEADKQARERKDQLAQFQGLDDVLEVDDSDPFSEPITREPVTEELDALVLGGGFGG
LTAGAYLTQNGVENFRLVEYGGDFGGTWYWNRYPGVQCDIESHIYMPLLEETGYVPSQRYADGSEIFEHAQRIGRHYGLY
DRTYFQTRATHARWDEQIQRWEVTTDRGDRFVTRVLLRSNGALTKPQLPKVPGIGDFEGKIFHTSRWDYGYTGGSAAGDL
AHLRDKRVAVVGTGATGVQVVPYLAQDAKELVVVQRTPSVVQPRNNRKTDPEWVASLTPGWQYERHDNFNGIISGHEVEG
NLVDDGWTHLFPELTGQHLVDVPVGELPEGDQALVAELADMNLLMSAHARVDSIVTDPATADGLKPWFGYMCKRPCFNDE
YLEAFNRPNVTLAASPAGIDGITSSGIVVAGTHYEVDCIIFATGFETGSGPAGIYGYDVIGREGHSMQEYFSEGARTFHG
FFTHGFPNFVELGMSQTAYYVNFVYMLDRKARHAARLVRHLLDSGIGTFEPTAEAEADWVAEVRRSNEPREAYWGACTPG
YYNGQGEVSKAVFRDVYNSSEIDFWNMIEAWWNSGRFEGLVFEPARDAVPVA
>A7HU16 1.14.13.-~~~~~~Baeyer-Villiger monooxygenase~~~COG2072
MSSVQSSQTQKNDDAEVFDALIVGAGFNGIYQLHRLRQEGFKVRLFEAGADMGGIWYWNCYPGARVDSHIPIYEFSIEEL
WRDWNWTERFPAWDELRRYFHYVDKKLDLSRDIRFGMRVSAAEFDEARDQWVIRTTDGTVVRARFFILCTGFASKPYIPN
YKGLESFAGESFHTGLWPQEGASFTGKRVGVVGTGASGVQVVQEASKDAAHLTVFQRTPILALPMQQRKLDVETQQRMKA
DYPEIFRIRRETFGGFDILRDERSALEVPPEERCALYEKLWQKGGFHYWIGGFSDILTNEEANRTMYDFWRDKTRARIKN
PALADKLAPMEPPHPFGVKRPSLEQWYYEAFNQDNVSLVDVREMPIVEIVPEGVLTSDGLVELDMLVLATGFDAVTGGLT
QIDIHGTGGITLKEKWTEGARTYLGFATSGFPNMLFLYGPQSPSGFCNGPTCAEMQGEWVVDCLKHMRENNKGRIEATAQ
AEEEWAQLLNSIAGMTLFPRADSWYMGANIPGKPRQLLNFPGVPIYMDQCNTAAAKDYEGFVLD
>Q9I3H5 1.14.13.-~~~~~~Baeyer-Villiger monooxygenase~~~
MYTPANNHNRSLAMSTQPTPAAARHCKVAIIGTGFSGLGMAIRLRQEGEDDFLIFEKDAGVGGTWRVNNYPGCACDVQSH
VYSFSFEANPEWTRMFARQPEIRAYLEKCWEKYRLQEKTLLNTEIGKLAWDERQSLWHLHDAQGNHYTANAVVSGMGGLS
TPAYPRLDGLENFQGKVFHSQQWDHDYDLKGKRVAVIGTGASAIQFVPEIQPLVAALDLYQRTPPWILPKPDRAISETER
RRFRRFPLVQKLWRGGLYSLLEGRVLGFTFAPQVMKLVQRLAIRHIHKQIKDPELRRKVTPDYTIGCKRILMSHNYYPAL
AAANSTVITEGIRAVTANGIVDGNGREREVDAIIFGTGFTANDPIPRGVVFGRDGRDLLDSWSKGPEAYKGTTTAGFPNL
FFLMGPNTGLGHNSMVYMIESQIAYVLDALKLMKRRELLSLEVKAPVQERYNEYLQRKLDRSVWSVGGCKSWYLHPVSGR
NCTLWPGFTWRFRALTRQFDASAYHLTTTPLAALSNEARQQAEGVPA
>A3U3H1 1.14.13.-~~~~~~Baeyer-Villiger monooxygenase~~~
MNIQTENTKTVGADFDAVVIGAGFGGLYAVHKLRNEQGLNVRGYDSASDVGGTWWWNRYPGALSDTESYVYRYSFDKELL
RKGRWKTRYLTQPEILEYMNEVADHLDLRRSYKFDTKVDGAHYNEKTGLWNVITDSGETVTAKYLVTGLGLLSATNVPKF
KGIDDFKGRILHTGAWPEGVDLSNKRVGIIGTGSTGVQVITATAPIAKHLTVFQRSAQYVVPIGNTPQDDATIAEQKANY
DNIWNQVKNSVVAFGFEESAEPAETASPEERERVFEAAWQRGGGFYFMFGTFCDIATSQVANDAAADFIKGKIKQIVKDP
KVAEKLTPKDLYAKRPLCGNNYYEVYNRDNVTLADVKADPIAEFTPNGIRLESGEEHELDIVIFATGFDAVDGNYVKMDL
RGRGGVTMRDTWKEGPLGYLGMMEVDFPNFFMILGPNGPFTNLPPSIETQVEWIADTICAMEEEGVQSVEPTVEARDAWV
GTCREIADMTLFPKAESWIFGANIPGKKNAVMFYMAGIGNYRNAISAVKEEGYTSLIRDRTAEKV
>Q88J44 1.14.13.-~~~~~~Baeyer-Villiger monooxygenase~~~COG2072
MSSHTALPVEPLDVLIMGAGVSGIGAAAYLRRNQPNKTFAILESRERMGGTWDLFRYPGIRSDSDLYTFGFDFKPWTKAK
SLADAADILEYLSEAIDEHQLAPFIQYQQKVISANWQSDKGLWSVRVEDGRTAQIRTVECRWLFSAGGYYRYDQGFSPRF
EGSEQFKGQIIHPQHWPEDLDYTGKRVVVIGSGATAVTLIPAMADKVASITMLQRTPSYIINQPANDGVAAFLRKVLPAQ
TAYSLTRYKNAKITLAFWGFCQRFPKLSKKLLLWLTRKELPKDYPVDVHFNPPYNPWDQRLCSVPEGDLFKAISAGNADI
VTDHIERFTEHGVLLKSGKMLKADIIVTATGLNVQLFGGITLHKDGKPVVLSETLAYKGMMLSGVPNFAFAVGYTNSSWT
LKVCLLCDHFCRLLGLMEREGYNVCEPKAPEGVETRPLLDFGAGYVQRALDSMPRQGPREPWVMSMDYFRDVKLLRRGAV
TDKCLKFTAVPNAPLHADVQLQQQGSRR
>P0DPI1 ~~~botA~~~Botulinum neurotoxin type A~~~
MPFVNKQFNYKDPVNGVDIAYIKIPNAGQMQPVKAFKIHNKIWVIPERDTFTNPEEGDLNPPPEAKQVPVSYYDSTYLST
DNEKDNYLKGVTKLFERIYSTDLGRMLLTSIVRGIPFWGGSTIDTELKVIDTNCINVIQPDGSYRSEELNLVIIGPSADI
IQFECKSFGHEVLNLTRNGYGSTQYIRFSPDFTFGFEESLEVDTNPLLGAGKFATDPAVTLAHELIHAGHRLYGIAINPN
RVFKVNTNAYYEMSGLEVSFEELRTFGGHDAKFIDSLQENEFRLYYYNKFKDIASTLNKAKSIVGTTASLQYMKNVFKEK
YLLSEDTSGKFSVDKLKFDKLYKMLTEIYTEDNFVKFFKVLNRKTYLNFDKAVFKINIVPKVNYTIYDGFNLRNTNLAAN
FNGQNTEINNMNFTKLKNFTGLFEFYKLLCVRGIITSKTKSLDKGYNKALNDLCIKVNNWDLFFSPSEDNFTNDLNKGEE
ITSDTNIEAAEENISLDLIQQYYLTFNFDNEPENISIENLSSDIIGQLELMPNIERFPNGKKYELDKYTMFHYLRAQEFE
HGKSRIALTNSVNEALLNPSRVYTFFSSDYVKKVNKATEAAMFLGWVEQLVYDFTDETSEVSTTDKIADITIIIPYIGPA
LNIGNMLYKDDFVGALIFSGAVILLEFIPEIAIPVLGTFALVSYIANKVLTVQTIDNALSKRNEKWDEVYKYIVTNWLAK
VNTQIDLIRKKMKEALENQAEATKAIINYQYNQYTEEEKNNINFNIDDLSSKLNESINKAMININKFLNQCSVSYLMNSM
IPYGVKRLEDFDASLKDALLKYIYDNRGTLIGQVDRLKDKVNNTLSTDIPFQLSKYVDNQRLLSTFTEYIKNIINTSILN
LRYESNHLIDLSRYASKINIGSKVNFDPIDKNQIQLFNLESSKIEVILKNAIVYNSMYENFSTSFWIRIPKYFNSISLNN
EYTIINCMENNSGWKVSLNYGEIIWTLQDTQEIKQRVVFKYSQMINISDYINRWIFVTITNNRLNNSKIYINGRLIDQKP
ISNLGNIHASNNIMFKLDGCRDTHRYIWIKYFNLFDKELNEKEIKDLYDNQSNSGILKDFWGDYLQYDKPYYMLNLYDPN
KYVDVNNVGIRGYMYLKGPRGSVMTTNIYLNSSLYRGTKFIIKKYASGNKDNIVRNNDRVYINVVVKNKEYRLATNASQA
GVEKILSALEIPDVGNLSQVVVMKSKNDQGITNKCKMNLQDNNGNDIGFIGFHQFNNIAKLVASNWYNRQIERSSRTLGC
SWEFIPVDDGWGERPL
>P0DPI0 ~~~botA~~~Botulinum neurotoxin type A~~~
MPFVNKQFNYKDPVNGVDIAYIKIPNVGQMQPVKAFKIHNKIWVIPERDTFTNPEEGDLNPPPEAKQVPVSYYDSTYLST
DNEKDNYLKGVTKLFERIYSTDLGRMLLTSIVRGIPFWGGSTIDTELKVIDTNCINVIQPDGSYRSEELNLVIIGPSADI
IQFECKSFGHEVLNLTRNGYGSTQYIRFSPDFTFGFEESLEVDTNPLLGAGKFATDPAVTLAHELIHAGHRLYGIAINPN
RVFKVNTNAYYEMSGLEVSFEELRTFGGHDAKFIDSLQENEFRLYYYNKFKDIASTLNKAKSIVGTTASLQYMKNVFKEK
YLLSEDTSGKFSVDKLKFDKLYKMLTEIYTEDNFVKFFKVLNRKTYLNFDKAVFKINIVPKVNYTIYDGFNLRNTNLAAN
FNGQNTEINNMNFTKLKNFTGLFEFYKLLCVRGIITSKTKSLDKGYNKALNDLCIKVNNWDLFFSPSEDNFTNDLNKGEE
ITSDTNIEAAEENISLDLIQQYYLTFNFDNEPENISIENLSSDIIGQLELMPNIERFPNGKKYELDKYTMFHYLRAQEFE
HGKSRIALTNSVNEALLNPSRVYTFFSSDYVKKVNKATEAAMFLGWVEQLVYDFTDETSEVSTTDKIADITIIIPYIGPA
LNIGNMLYKDDFVGALIFSGAVILLEFIPEIAIPVLGTFALVSYIANKVLTVQTIDNALSKRNEKWDEVYKYIVTNWLAK
VNTQIDLIRKKMKEALENQAEATKAIINYQYNQYTEEEKNNINFNIDDLSSKLNESINKAMININKFLNQCSVSYLMNSM
IPYGVKRLEDFDASLKDALLKYIYDNRGTLIGQVDRLKDKVNNTLSTDIPFQLSKYVDNQRLLSTFTEYIKNIINTSILN
LRYESNHLIDLSRYASKINIGSKVNFDPIDKNQIQLFNLESSKIEVILKNAIVYNSMYENFSTSFWIRIPKYFNSISLNN
EYTIINCMENNSGWKVSLNYGEIIWTLQDTQEIKQRVVFKYSQMINISDYINRWIFVTITNNRLNNSKIYINGRLIDQKP
ISNLGNIHASNNIMFKLDGCRDTHRYIWIKYFNLFDKELNEKEIKDLYDNQSNSGILKDFWGDYLQYDKPYYMLNLYDPN
KYVDVNNVGIRGYMYLKGPRGSVMTTNIYLNSSLYRGTKFIIKKYASGNKDNIVRNNDRVYINVVVKNKEYRLATNASQA
GVEKILSALEIPDVGNLSQVVVMKSKNDQGITNKCKMNLQDNNGNDIGFIGFHQFNNIAKLVASNWYNRQIERSSRTLGC
SWEFIPVDDGWGERPL
>Q45894 ~~~botA~~~Botulinum neurotoxin type A2~~~
MPFVNKQFNYKDPVNGVDIAYIKIPNAGQMQPVKAFKIHNKIWVIPERDTFTNPEEGDLNPPPEAKQVPVSYYDSTYLST
DNEKDNYLKGVTKLFERIYSTDLGRMLLTSIVRGIPFWGGSTIDTELKVIDTNCINVIQPDGSYRSEELNLVIIGPSADI
IQFECKSFGHDVLNLTRNGYGSTQYIRFSPDFTFGFEESLEVDTNPLLGAGKFATDPAVTLAHELIHAEHRLYGIAINPN
RVFKVNTNAYYEMSGLEVSFEELRTFGGHDAKFIDSLQENEFRLYYYNKFKDVASTLNKAKSIIGTTASLQYMKNVFKEK
YLLSEDTSGKFSVDKLKFDKLYKMLTEIYTEDNFVNFFKVINRKTYLNFDKAVFRINIVPDENYTIKDGFNLKGANLSTN
FNGQNTEINSRNFTRLKNFTGLFEFYKLLCVRGIIPFKTKSLDEGYNKALNDLCIKVNNWDLFFSPSEDNFTNDLDKVEE
ITADTNIEAAEENISLDLIQQYYLTFDFDNEPENISIENLSSDIIGQLEPMPNIERFPNGKKYELDKYTMFHYLRAQEFE
HGDSRIILTNSAEEALLKPNVAYTFFSSKYVKKINKAVEAFMFLNWAEELVYDFTDETNEVTTMDKIADITIIVPYIGPA
LNIGNMLSKGEFVEAIIFTGVVAMLEFIPEYALPVFGTFAIVSYIANKVLTVQTINNALSKRNEKWDEVYKYTVTNWLAK
VNTQIDLIREKMKKALENQAEATKAIINYQYNQYTEEEKNNINFNIDDLSSKLNESINSAMININKFLDQCSVSYLMNSM
IPYAVKRLKDFDASVRDVLLKYIYDNRGTLVLQVDRLKDEVNNTLSADIPFQLSKYVDNKKLLSTFTEYIKNIVNTSILS
IVYKKDDLIDLSRYGAKINIGDRVYYDSIDKNQIKLINLESSTIEVILKNAIVYNSMYENFSTSFWIKIPKYFSKINLNN
EYTIINCIENNSGWKVSLNYGEIIWTLQDNKQNIQRVVFKYSQMVNISDYINRWIFVTITNNRLTKSKIYINGRLIDQKP
ISNLGNIHASNKIMFKLDGCRDPRRYIMIKYFNLFDKELNEKEIKDLYDSQSNSGILKDFWGNYLQYDKPYYMLNLFDPN
KYVDVNNIGIRGYMYLKGPRGSVVTTNIYLNSTLYEGTKFIIKKYASGNEDNIVRNNDRVYINVVVKNKEYRLATNASQA
GVEKILSALEIPDVGNLSQVVVMKSKDDQGIRNKCKMNLQDNNGNDIGFIGFHLYDNIAKLVASNWYNRQVGKASRTFGC
SWEFIPVDDGWGESSL
>C1FUH8 ~~~ntnh~~~Non-toxic nonhemagglutinin type A~~~
MKINNNFNIDSLIDNRDVAIVRGRKTDTFFKVFQVAPNIWIAPERYYGESLNINEDQKSDGGIYDSNFLSTNDEKDEFLQ
ATVKILQRINNNVIGAKLLSLISTAIPFPYEYKPGDYRQTNYLVSKDNQHYYTANLVIFGPGTNIVENNAIYYKKEDSEN
GMGTMSEIWFQPFLTYKYGQFYVDPALELIKCLIKSLYYLYGIKPSDDLSIPYRLRSELNSFEYSELDMIDFLISGGTEY
KLLDTNPYWFTDNYFIDAPKNFEKYKNDYETKIKNNNDIANSIKLYLEQKFKTNAQDIWELNLSYFSTEFEIMMPEIFNN
ALNHYYRKEYYVIDYFKNYNINGFINGQIKTILPLSKYNKNIINKPELVVNLINENNTVLMKSNVYGDGLKGTMDNFYAA
YKIPYNIGDEYHINYSYLNNVNVEEINNIPPINDADIYPYRKNSDPFIPVYNITETKEINTTTPLSVNYLQAQVTNSNDI
SLSSDFSKVISSKDRSLVYSFLDNTIDYLDSIKYDEPIDTDKKYYLWLKEIFRNYSFDMTETQEVNTPCGINKVVPWLGK
ALNILNTGNSFIEEFKSLGPISLINKKENITMPKIEIDEIPNSMLNLSFKDLSENLFNRFSKNNSYFEKIYYDFLDQWWT
QYYSQYFDLICMAKKSILAQETLIKKIIQKKLSYLIGNSNISSDNLALMNLTTTNTLRDISNESQIAMNNVDSFLNSAAI
CVFEGNIYSKFISFMEQCINNINKNTREFIQKCTNITENEKLQLINQNIFSSLDFDFLNIENLKSLFSSETALLIKEETS
PYELVLYAFQEPDNNAIGDASAKNTSIEYSKDIDLVYGINSDALYLNGSNQSISFSNDFFENGLTNSFSIYFWLRNLGKD
TIKSKLIGSKEDNCGWEIYFQDTGLVFNMIDSNGNEKNIYLSDVSNNSWHYITISVDRLKEQLLIFIDDNLVANESIKEI
LNIYSSNIISLLSENNPSYIEGLTILNKPTTSQEVLNNYFKVLNNSYIRDSNEERLEYNKTYQLYNYVFSDKPICEVKQN
NNIYLTINNTNNLNLQPSKFKLLSINSNKQYVQKFDEVIISILGNMEKYIDISEDNRLQLIDNKNGAKKMIISNDMFISN
CLTLSCGGKYICLSMKDENHNWMICNNDMSKYLYLWSFK
>Q45914 ~~~ant~~~Non-toxic nonhemagglutinin type A~~~
MNINDNLSINSPVDNKNVVVVRARKTDTVFKAFKVAPNIWVAPERYYGESLSIDEEYKVDGGIYDSNFLSQDSEKDKFLQ
AIITLLKRINSTNAGEKLLSLISTAIPFPYGYIGGGYYAPNMITFGSAPKSNKKLNSLISSTIPFPYAGYRETNYLSSED
NKSFYASNIVIFGPGANIVENNTVFYKKEDAENGMGTMTEIWFQPFLTYKYDEFYIDPAIELIKCLIKSLYFLYGIKPSD
DLVIPYRLRSELENIEYSQLNIVDLLVSGGIDPKFINTDPYWFTDNYFSNAKKVFEDHRNIYETEIEGNNAIGNDIKLRL
KQKFRININDIWELNLNYFSKEFSIMMPDRFNNALKHFYRKQYYKIDYPENYSINGFVNGQINAQLSLSDRNQDIINKPE
EIINLLNGNNVSLMRSNIYGDGLKSTVDDFYSNYKIPYNRAYEYHFNNSNDSSLDNVNIGVIDNIPEIIDVNPYKENCDK
FSPVQKITSTREINTNIPWPINYLQAQNTNNEKFSLSSDFVEVVSSKDKSLVYSFLSNVMFYLDSIKDNSPIDTDKKYYL
WLREIFRNYSFDITATQEINTNCGINKVVTWFGKALNILNTSDSFVEEFQNLGAISLINKKENLSMPIIESYEIPNDMLG
LPLNDLNEKLFNIYSKNTAYFKKIYYNFLDQWWTQYYSQYFDLICMAKRSVLAQETLIKRIIQKKLSYLIGNSNISSDNL
ALMNLTTTNTLRDISNESQIAMNNVDSFLNNAAICVFESNIYPKFISFMEQCINNINIKTKEFIQKCTNINEDEKLQLIN
QNVFNSLDFEFLNIQNMKSLFSSETALLIKEETWPYELVLYAFKEPGNNVIGDASGKNTSIEYSKDIGLVYGINSDALYL
NGSNQSISFSNDFFENGLTNSFSIYFWLRNLGKDTIKSKLIGSKEDNCGWEIYFQDTGLVFNMIDSNGNEKNIYLSDVSN
NSWHYITISVDRLKEQLLIFIDDNLVANESIKEILNIYSSNIISLLSENNPSYIEGLTILNKPTTSQEVLSNYFEVLNNS
YIRDSNEERLEYNKTYQLYNYVFSDKPICEVKQNNNIYLTINNTNNLNLQASKFKLLSINPNKQYVQKLDEVIISVLDNM
EKYIDISEDNRLQLIDNKNNAKKMIISNDIFISNCLTLSYNGKYICLSMKDENHNWMICNNDMSKYLYLWSFK
>B1INP5 ~~~botB~~~Botulinum neurotoxin type B~~~
MPVTINNFNYNDPIDNNNIIMMEPPFARGTGRYYKAFKITDRIWIIPERYTFGYKPEDFNKSSGIFNRDVCEYYDPDYLN
TNDKKNIFLQTMIKLFNRIKSKPLGEKLLEMIINGIPYLGDRRVPLEEFNTNIASVTVNKLISNPGEVERKKGIFANLII
FGPGPVLNENETIDIGIQNHFASREGFGGIMQMKFCPEYVSVFNNVQENKGASIFNRRGYFSDPALILMHELIHVLHGLY
GIKVDDLPIVPNEKKFFMQSTDAIQAEELYTFGGQDPSIITPSTDKSIYDKVLQNFRGIVDRLNKVLVCISDPNININIY
KNKFKDKYKFVEDSEGKYSIDVESFDKLYKSLMFGFTETNIAENYKIKTRASYFSDSLPPVKIKNLLDNEIYTIEEGFNI
SDKDMEKEYRGQNKAINKQAYEEISKEHLAVYKIQMCKSVKAPGICIDVDNEDLFFIADKNSFSDDLSKNERIEYNTQSN
YIENDFPINELILDTDLISKIELPSENTESLTDFNVDVPVYEKQPAIKKIFTDENTIFQYLYSQTFPLDIRDISLTSSFD
DALLFSNKVYSFFSMDYIKTANKVVEAGLFAGWVKQIVNDFVIEANKSNTMDKIADISLIVPYIGLALNVGNETAKGNFE
NAFEIAGASILLEFIPELLIPVVGAFLLESYIDNKNKIIKTIDNALTKRNEKWSDMYGLIVAQWLSTVNTQFYTIKEGMY
KALNYQAQALEEIIKYRYNIYSEKEKSNINIDFNDINSKLNEGINQAIDNINNFINGCSVSYLMKKMIPLAVEKLLDFDN
TLKKNLLNYIDENKLYLIGSAEYEKSKVNKYLKTIMPFDLSIYTNDTILIEMFNKYNSEILNNIILNLRYKDNNLIDLSG
YGAKVEVYDGVELNDKNQFKLTSSANSKIRVTQNQNIIFNSVFLDFSVSFWIRIPKYKNDGIQNYIHNEYTIINCMKNNS
GWKISIRGNRIIWTLIDINGKTKSVFFEYNIREDISEYINRWFFVTITNNLNNAKIYINGKLESNTDIKDIREVIANGEI
IFKLDGDIDRTQFIWMKYFSIFNTELSQSNIEERYKIQSYSEYLKDFWGNPLMYNKEYYMFNAGNKNSYIKLKKDSPVGE
ILTRSKYNQNSKYINYRDLYIGEKFIIRRKSNSQSINDDIVRKEDYIYLDFFNLNQEWRVYTYKYFKKEEEKLFLAPISD
SDEFYNTIQIKEYDEQPTYSCQLLFKKDEESTDEIGLIGIHRFYESGIVFEEYKDYFCISKWYLKEVKRKPYNLKLGCNW
QFIPKDEGWTE
>P10844 ~~~botB~~~Botulinum neurotoxin type B~~~
MPVTINNFNYNDPIDNNNIIMMEPPFARGTGRYYKAFKITDRIWIIPERYTFGYKPEDFNKSSGIFNRDVCEYYDPDYLN
TNDKKNIFLQTMIKLFNRIKSKPLGEKLLEMIINGIPYLGDRRVPLEEFNTNIASVTVNKLISNPGEVERKKGIFANLII
FGPGPVLNENETIDIGIQNHFASREGFGGIMQMKFCPEYVSVFNNVQENKGASIFNRRGYFSDPALILMHELIHVLHGLY
GIKVDDLPIVPNEKKFFMQSTDAIQAEELYTFGGQDPSIITPSTDKSIYDKVLQNFRGIVDRLNKVLVCISDPNININIY
KNKFKDKYKFVEDSEGKYSIDVESFDKLYKSLMFGFTETNIAENYKIKTRASYFSDSLPPVKIKNLLDNEIYTIEEGFNI
SDKDMEKEYRGQNKAINKQAYEEISKEHLAVYKIQMCKSVKAPGICIDVDNEDLFFIADKNSFSDDLSKNERIEYNTQSN
YIENDFPINELILDTDLISKIELPSENTESLTDFNVDVPVYEKQPAIKKIFTDENTIFQYLYSQTFPLDIRDISLTSSFD
DALLFSNKVYSFFSMDYIKTANKVVEAGLFAGWVKQIVNDFVIEANKSNTMDKIADISLIVPYIGLALNVGNETAKGNFE
NAFEIAGASILLEFIPELLIPVVGAFLLESYIDNKNKIIKTIDNALTKRNEKWSDMYGLIVAQWLSTVNTQFYTIKEGMY
KALNYQAQALEEIIKYRYNIYSEKEKSNINIDFNDINSKLNEGINQAIDNINNFINGCSVSYLMKKMIPLAVEKLLDFDN
TLKKNLLNYIDENKLYLIGSAEYEKSKVNKYLKTIMPFDLSIYTNDTILIEMFNKYNSEILNNIILNLRYKDNNLIDLSG
YGAKVEVYDGVELNDKNQFKLTSSANSKIRVTQNQNIIFNSVFLDFSVSFWIRIPKYKNDGIQNYIHNEYTIINCMKNNS
GWKISIRGNRIIWTLIDINGKTKSVFFEYNIREDISEYINRWFFVTITNNLNNAKIYINGKLESNTDIKDIREVIANGEI
IFKLDGDIDRTQFIWMKYFSIFNTELSQSNIEERYKIQSYSEYLKDFWGNPLMYNKEYYMFNAGNKNSYIKLKKDSPVGE
ILTRSKYNQNSKYINYRDLYIGEKFIIRRKSNSQSINDDIVRKEDYIYLDFFNLNQEWRVYTYKYFKKEEEKLFLAPISD
SDEFYNTIQIKEYDEQPTYSCQLLFKKDEESTDEIGLIGIHRFYESGIVFEEYKDYFCISKWYLKEVKRKPYNLKLGCNW
QFIPKDEGWTE
>Q00496 ~~~botE~~~Botulinum neurotoxin type E~~~
MPKINSFNYNDPVNDRTILYIKPGGCQEFYKSFNIMKNIWIIPERNVIGTTPQDFHPPTSLKNGDSSYYDPNYLQSDEEK
DRFLKIVTKIFNRINNNLSGGILLEELSKANPYLGNDNTPDNQFHIGDASAVEIKFSNGSQDILLPNVIIMGAEPDLFET
NSSNISLRNNYMPSNHRFGSIAIVTFSPEYSFRFNDNCMNEFIQDPALTLMHELIHSLHGLYGAKGITTKYTITQKQNPL
ITNIRGTNIEEFLTFGGTDLNIITSAQSNDIYTNLLADYKKIASKLSKVQVSNPLLNPYKDVFEAKYGLDKDASGIYSVN
INKFNDIFKKLYSFTEFDLRTKFQVKCRQTYIGQYKYFKLSNLLNDSIYNISEGYNINNLKVNFRGQNANLNPRIITPIT
GRGLVKKIIRFCKNIVSVKGIRKSICIEINNGELFFVASENSYNDDNINTPKEIDDTVTSNNNYENDLDQVILNFNSESA
PGLSDEKLNLTIQNDAYIPKYDSNGTSDIEQHDVNELNVFFYLDAQKVPEGENNVNLTSSIDTALLEQPKIYTFFSSEFI
NNVNKPVQAALFVSWIQQVLVDFTTEANQKSTVDKIADISIVVPYIGLALNIGNEAQKGNFKDALELLGAGILLEFEPEL
LIPTILVFTIKSFLGSSDNKNKVIKAINNALKERDEKWKEVYSFIVSNWMTKINTQFNKRKEQMYQALQNQVNAIKTIIE
SKYNSYTLEEKNELTNKYDIKQIENELNQKVSIAMNNIDRFLTESSISYLMKIINEVKINKLREYDENVKTYLLNYIIQH
GSILGESQQELNSMVTDTLNNSIPFKLSSYTDDKILISYFNKFFKRIKSSSVLNMRYKNDKYVDTSGYDSNININGDVYK
YPTNKNQFGIYNDKLSEVNISQNDYIIYDNKYKNFSISFWVRIPNYDNKIVNVNNEYTIINCMRDNNSGWKVSLNHNEII
WTFEDNRGINQKLAFNYGNANGISDYINKWIFVTITNDRLGDSKLYINGNLIDQKSILNLGNIHVSDNILFKIVNCSYTR
YIGIRYFNIFDKELDETEIQTLYSNEPNTNILKDFWGNYLLYDKEYYLLNVLKPNNFIDRRKDSTLSINNIRSTILLANR
LYSGIKVKIQRVNNSSTNDNLVRKNDQVYINFVASKTHLFPLYADTATTNKEKTIKISSSGNRFNQVVVMNSVGNCTMNF
KNNNGNNIGLLGFKADTVVASTWYYTHMRDHTNSNGCFWNFISEEHGWQEK
>P30995 ~~~~~~Botulinum neurotoxin type E~~~
MPTINSFNYNDPVNNRTILYIKPGGCQQFYKSFNIMKNIWIIPERNVIGTIPQDFLPPTSLKNGDSSYYDPNYLQSDQEK
DKFLKIVTKIFNRINDNLSGRILLEELSKANPYLGNDNTPDGDFIINDASAVPIQFSNGSQSILLPNVIIMGAEPDLFET
NSSNISLRNNYMPSNHGFGSIAIVTFSPEYSFRFKDNSMNEFIQDPALTLMHELIHSLHGLYGAKGITTKYTITQKQNPL
ITNIRGTNIEEFLTFGGTDLNIITSAQSNDIYTNLLADYKKIASKLSKVQVSNPLLNPYKDVFEAKYGLDKDASGIYSVN
INKFNDIFKKLYSFTEFDLATKFQVKCRQTYIGQYKYFKLSNLLNDSIYNISEGYNINNLKVNFRGQNANLNPRIITPIT
GRGLVKKIIRFCKNIVSVKGIRKSICIEINNGELFFVASENSYNDDNINTPKEIDDTVTSNNNYENDLDQVILNFNSESA
PGLSDEKLNLTIQNDAYIPKYDSNGTSDIEQHDVNELNVFFYLDAQKVPEGENNVNLTSSIDTALLEQPKIYTFFSSEFI
NNVNKPVQAALFVGWIQQVLVDFTTEANQKSTVDKIADISIVVPYIGLALNIGNEAQKGNFKDALELLGAGILLEFEPEL
LIPTILVFTIKSFLGSSDNKNKVIKAINNALKERDEKWKEVYSFIVSNWMTKINTQFNKRKEQMYQALQNQVNALKAIIE
SKYNSYTLEEKNELTNKYDIEQIENELNQKVSIAMNNIDRFLTESSISYLMKLINEVKINKLREYDENVKTYLLDYIIKH
GSILGESQQELNSMVIDTLNNSIPFKLSSYTDDKILISYFNKFFKRIKSSSVLNMRYKNDKYVDTSGYDSNININGDVYK
YPTNKNQFGIYNDKLSEVNISQNDYIIYDNKYKNFSISFWVRIPNYDNKIVNVNNEYTIINCMRDNNSGWKVSLNHNEII
WTLQDNSGINQKLAFNYGNANGISDYINKWIFVTITNDRLGDSKLYINGNLIDKKSILNLGNIHVSDNILFKIVNCSYTR
YIGIRYFNIFDKELDETEIQTLYNNEPNANILKDFWGNYLLYDKEYYLLNVLKPNNFINRRTDSTLSINNIRSTILLANR
LYSGIKVKIQRVNNSSTNDNLVRKNDQVYINFVASKTHLLPLYADTATTNKEKTIKISSSGNRFNQVVVMNSVGNCTMNF
KNNNGNNIGLLGFKADTVVASTWYYTHMRDNTNSNGFFWNFISEEHGWQEK
>A7GBG3 ~~~F~~~Botulinum neurotoxin type F~~~
MPVVINSFNYNDPVNDDTILYMQIPYEEKSKKYYKAFEIMRNVWIIPERNTIGTDPSDFDPPASLENGSSAYYDPNYLTT
DAEKDRYLKTTIKLFKRINSNPAGEVLLQEISYAKPYLGNEHTPINEFHPVTRTTSVNIKSSTNVKSSIILNLLVLGAGP
DIFENSSYPVRKLMDSGGVYDPSNDGFGSINIVTFSPEYEYTFNDISGGYNSSTESFIADPAISLAHELIHALHGLYGAR
GVTYKETIKVKQAPLMIAEKPIRLEEFLTFGGQDLNIITSAMKEKIYNNLLANYEKIATRLSRVNSAPPEYDINEYKDYF
QWKYGLDKNADGSYTVNENKFNEIYKKLYSFTEIDLANKFKVKCRNTYFIKYGFLKVPNLLDDDIYTVSEGFNIGNLAVN
NRGQNIKLNPKIIDSIPDKGLVEKIVKFCKSVIPRKGTKAPPRLCIRVNNRELFFVASESSYNENDINTPKEIDDTTNLN
NNYRNNLDEVILDYNSETIPQISNQTLNTLVQDDSYVPRYDSNGTSEIEEHNVVDLNVFFYLHAQKVPEGETNISLTSSI
DTALSEESQVYTFFSSEFINTINKPVHAALFISWINQVIRDFTTEATQKSTFDKIADISLVVPYVGLALNIGNEVQKENF
KEAFELLGAGILLEFVPELLIPTILVFTIKSFIGSSENKNKIIKAINNSLMERETKWKEIYSWIVSNWLTRINTQFNKRK
EQMYQALQNQVDAIKTVIEYKYNNYTSDERNRLESEYNINNIREELNKKVSLAMENIERFITESSIFYLMKLINEAKVSK
LREYDEGVKEYLLDYISEHRSILGNSVQELNDLVTSTLNNSIPFELSSYTNDKILILYFNKLYKKIKDNSILDMRYENNK
FIDISGYGSNISINGDVYIYSTNRNQFGIYSSKPSEVNIAQNNDIIYNGRYQNFSISFWVRIPKYFNKVNLNNEYTIIDC
IRNNNSGWKISLNYNKIIWTLQDTAGNNQKLVFNYTQMISISDYINKWIFVTITNNRLGNSRIYINGNLIDEKSISNLGD
IHVSDNILFKIVGCNDTRYVGIRYFKVFDTELGKTEIETLYSDEPDPSILKDFWGNYLLYNKRYYLLNLLRTDKSITQNS
NFLNINQQRGVYQKPNIFSNTRLYTGVEVIIRKNGSTDISNTDNFVRKNDLAYINVVDRDVEYRLYADISIAKPEKIIKL
IRTSNSNNSLGQIIVMDSIGNNCTMNFQNNNGGNIGLLGFHSNNLVASSWYYNNIRKNTSSNGCFWSFISKEHGWQEN
>P30996 ~~~botF~~~Botulinum neurotoxin type F~~~
MPVAINSFNYNDPVNDDTILYMQIPYEEKSKKYYKAFEIMRNVWIIPERNTIGTNPSDFDPPASLKNGSSAYYDPNYLTT
DAEKDRYLKTTIKLFKRINSNPAGKVLLQEISYAKPYLGNDHTPIDEFSPVTRTTSVNIKLSTNVESSMLLNLLVLGAGP
DIFESCCYPVRKLIDPDVVYDPSNYGFGSINIVTFSPEYEYTFNDISGGHNSSTESFIADPAISLAHELIHALHGLYGAR
GVTYEETIEVKQAPLMIAEKPIRLEEFLTFGGQDLNIITSAMKEKIYNNLLANYEKIATRLSEVNSAPPEYDINEYKDYF
QWKYGLDKNADGSYTVNENKFNEIYKKLYSFTESDLANKFKVKCRNTYFIKYEFLKVPNLLDDDIYTVSEGFNIGNLAVN
NRGQSIKLNPKIIDSIPDKGLVEKIVKFCKSVIPRKGTKAPPRLCIRVNNSELFFVASESSYNENDINTPKEIDDTTNLN
NNYRNNLDEVILDYNSQTIPQISNRTLNTLVQDNSYVPRYDSNGTSEIEEYDVVDFNVFFYLHAQKVPEGETNISLTSSI
DTALLEESKDIFFSSEFIDTINKPVNAALFIDWISKVIRDFTTEATQKSTVDKIADISLIVPYVGLALNIIIEAEKGNFE
EAFELLGVGILLEFVPELTIPVILVFTIKSYIDSYENKNKAIKAINNSLIEREAKWKEIYSWIVSNWLTRINTQFNKRKE
QMYQALQNQVDAIKTAIEYKYNNYTSDEKNRLESEYNINNIEEELNKKVSLAMKNIERFMTESSISYLMKLINEAKVGKL
KKYDNHVKSDLLNYILDHRSILGEQTNELSDLVTSTLNSSIPFELSSYTNDKILIIYFNRLYKKIKDSSILDMRYENNKF
IDISGYGSNISINGNVYIYSTNRNQFGIYNSRLSEVNIAQNNDIIYNSRYQNFSISFWVRIPKHYKPMNHNREYTIINCM
GNNNSGWKISLRTVRDCEIIWTLQDTSGNKENLIFRYEELNRISNYINKWIFVTITNNRLGNSRIYINGNLIVEKSISNL
GDIHVSDNILFKIVGCDDETYVGIRYFKVFNTELDKTEIETLYSNEPDPSILKNYWGNYLLYNKKYYLFNLLRKDKYITL
NSGILNINQQRGVTEGSVFLNYKLYEGVEVIIRKNGPIDISNTDNFVRKNDLAYINVVDRGVEYRLYADTKSEKEKIIRT
SNLNDSLGQIIVMDSIGNNCTMNFQNNNGSNIGLLGFHSNNLVASSWYYNNIRRNTSSNGCFWSSISKENGWKE
>Q60393 ~~~botG~~~Botulinum neurotoxin type G~~~
MPVNIKXFNYNDPINNDDIIMMEPFNDPGPGTYYKAFRIIDRIWIVPERFTYGFQPDQFNASTGVFSKDVYEYYDPTYLK
TDAEKDKFLKTMIKLFNRINSKPSGQRLLDMIVDAIPYLGNASTPPDKFAANVANVSINKKIIQPGAEDQIKGLMTNLII
FGPGPVLSDNFTDSMIMNGHSPISEGFGARMMIRFCPSCLNVFNNVQENKDTSIFSRRAYFADPALTLMHELIHVLHGLY
GIKISNLPITPNTKEFFMQHSDPVQAEELYTFGGHDPSVISPSTDMNIYNKALQNFQDIANRLNIVSSAQGSGIDISLYK
QIYKNKYDFVEDPNGKYSVDKDKFDKLYKALMFGFTETNLAGEYGIKTRYSYFSEYLPPIKTEKLLDNTIYTQNEGFNIA
SKNLKTEFNGQNKAVNKEAYEEISLEHLVIYRIAMCKPVMYKNTGKSEQCIIVNNEDLFFIANKDSFSKDLAKAETIAYN
TQNNTIENNFSIDQLILDNDLSSGIDLPNENTEPFTNFDDIDIPVYIKQSALKKIFVDGDSLFEYLHAQTFPSNIENLQL
TNSLNDALRNNNKVYTFFSTNLVEKANTVVGASLFVNWVKGVIDDFTSESTQKSTIDKVSDVSIIIPYIGPALNVGNETA
KENFKNAFEIGGAAILMEFIPELIVPIVGFFTLESYVGNKGHIIMTISNALKKRDQKWTDMYGLIVSQWLSTVNTQFYTI
KERMYNALNNQSQAIEKIIEDQYNRYSEEDKMNINIDFNDIDFKLNQSINLAINNIDDFINQCSISYLMNRMIPLAVKKL
KDFDDNLKRDLLEYIDTNELYLLDEVNILKSKVNRHLKDSIPFDLSLYTKDTILIQVFNNYISNISSNAILSLSYRGGRL
IDSSGYGATMNVGSDVIFNDIGNGQFKLNNSENSNITAHQSKFVVYDSMFDNFSINFWVRTPKYNNNDIQTYLQNEYTII
SCIKNDSGWKVSIKGNRIIWTLIDVNAKSKSIFFEYSIKDNISDYINKWFSITITNDRLGNANIYINGSLKKSEKILNLD
RINSSNDIDFKLINCTDTTKFVWIKDFNIFGRELNATEVSSLYWIQSSTNTLKDFWGNPLRYDTQYYLFNQGMQNIYIKY
FSKASMGETAPRTNFNNAAINYQNLYLGLRFIIKKASNSRNINNDNIVREGDYIYLNIDNISDESYRVYVLVNSKEIQTQ
LFLAPINDDPTFYDVLQIKKYYEKTTYNCQILCEKDTKTFGLFGIGKFVKDYGYVWDTYDNYFCISQWYLRRISENINKL
RLGCNWQFIPVDEGWTE
>A0A069CUU9 3.4.24.69~~~~~~Putative botulinum-like toxin Wo~~~
MDVLEMFDVNYESPILESFDSTTQSLNDVHVFMSRIQMSAYDADGEGRIEYRNLKLYEISSGIFISTDRLDTGASGVEDD
HEMVDYYSSARLTREFLGESLDSQKSDYFEGIKKVFSFYKNKCNESRYIKEFFEEIQFRNICGFPKQAGTSSTDIFDQFN
SVDVLLQDPVTSVWNKKVGSKKANIVIIPPATNLPITEACATAGFQPEGFPKLGSGSFFTVQFDPFFSTRFKAHETDDVA
LLDPTLTLLHEMTHGLHFQKGIANPVNRSGETPAWATTWGRVTGDNDAFKETPMEELLTFNKHTIDDDIEISDHLKSTYI
GFLYNGRNEDDPTESVDGVYQNVSSFLNQYRGFEISSDFQHFIESCYGVKYNQESKKFIVNPRNIKRYVQDGFFIDEAKF
ARILNIKTRSYYTLMPDNLGVWSYRVDILNRLRETFDEDRGLLSQELDFHTALTPVVSENPALELEVAGMQRMVSLPKIK
ASYLPSDIKIKNFTGQKISHDTILDTNISGIIISKIKYKSDFVVDESMPRSSLNTTNYNLSPIKGTKFETDIRDKTSVKV
TVSEITAPMINHVMKLDNSKVLTERPSLNEDLEETFKNTKDVYIPKTTAMMKLKEGADQTLGAVGFAVWSGQILEDLYNL
AQKKEVSIDQIKDDLMSILPFYCAYKNLSAEKYEQAFANATLDAFLIFATDGGGFAGLGITVGAIAINSMYAKAETMEAY
DSMFGKYVDQYQNDIKNFTLNAYVQWENNILSRLWNESRLAITGFRNMLKTVKTVMEFDATNQAYSEEDRKIIKAKCEEI
FSEFPMLMQTFAKNSMTANLENASKIFNDIVWQKIKEELDQYVIDSKKYFLDSLEEAYNNGSISAESYYKYQTEAREKFV
SPREVIDLYIAAHDTVVKRKRYIRRYSRKYDLATDFKGNTVHLNGLGEGTQDIQDLYGNYSVYADKKTVSTQEGHFDQTI
KIAKDTNTINKVVLAVSSNNGKEYALNKDEQYTISFWLRMPVPSSSEERRIFSYSAVSGVNKEVEELILQVKNNEFVLAT
ANLLRNSEFVIEPRIALNRWVKITIVNENTRIKVYQNDNLLGLIKDSSRKKPIAQRGTFKFYNYNVDYQLDDISYYNGTI
SQRDIKYTFKEDHGQFVYDHWGERLQYNKAYYLLSDDNKSAFETVYETKRLKLKSVPGVDIKYLGMNDRVYGYYGGLQFK
LVPLDSKNMNNYVRWGDKFTMQSIETTNLSLAIIQDNAYFAPTQLKLISNEGKSEEEIFTFDRNIKLQNAAILVGTGNSK
QGPISAYKRGYSGDLWINGARLDGYVTVVNKSNYSNDEIQEKFKWIFVPKDANWVE
>P0DPK1 ~~~~~~Botulinum neurotoxin type X~~~
MKLEINKFNYNDPIDGINVITMRPPRHSDKINKGKGPFKAFQVIKNIWIVPERYNFTNNTNDLNIPSEPIMEADAIYNPN
YLNTPSEKDEFLQGVIKVLERIKSKPEGEKLLELISSSIPLPLVSNGALTLSDNETIAYQENNNIVSNLQANLVIYGPGP
DIANNATYGLYSTPISNGEGTLSEVSFSPFYLKPFDESYGNYRSLVNIVNKFVKREFAPDPASTLMHELVHVTHNLYGIS
NRNFYYNFDTGKIETSRQQNSLIFEELLTFGGIDSKAISSLIIKKIIETAKNNYTTLISERLNTVTVENDLLKYIKNKIP
VQGRLGNFKLDTAEFEKKLNTILFVLNESNLAQRFSILVRKHYLKERPIDPIYVNILDDNSYSTLEGFNISSQGSNDFQG
QLLESSYFEKIESNALRAFIKICPRNGLLYNAIYRNSKNYLNNIDLEDKKTTSKTNVSYPCSLLNGCIEVENKDLFLISN
KDSLNDINLSEEKIKPETTVFFKDKLPPQDITLSNYDFTEANSIPSISQQNILERNEELYEPIRNSLFEIKTIYVDKLTT
FHFLEAQNIDESIDSSKIRVELTDSVDEALSNPNKVYSPFKNMSNTINSIETGITSTYIFYQWLRSIVKDFSDETGKIDV
IDKSSDTLAIVPYIGPLLNIGNDIRHGDFVGAIELAGITALLEYVPEFTIPILVGLEVIGGELAREQVEAIVNNALDKRD
QKWAEVYNITKAQWWGTIHLQINTRLAHTYKALSRQANAIKMNMEFQLANYKGNIDDKAKIKNAISETEILLNKSVEQAM
KNTEKFMIKLSNSYLTKEMIPKVQDNLKNFDLETKKTLDKFIKEKEDILGTNLSSSLRRKVSIRLNKNIAFDINDIPFSE
FDDLINQYKNEIEDYEVLNLGAEDGKIKDLSGTTSDINIGSDIELADGRENKAIKIKGSENSTIKIAMNKYLRFSATDNF
SISFWIKHPKPTNLLNNGIEYTLVENFNQRGWKISIQDSKLIWYLRDHNNSIKIVTPDYIAFNGWNLITITNNRSKGSIV
YVNGSKIEEKDISSIWNTEVDDPIIFRLKNNRDTQAFTLLDQFSIYRKELNQNEVVKLYNYYFNSNYIRDIWGNPLQYNK
KYYLQTQDKPGKGLIREYWSSFGYDYVILSDSKTITFPNNIRYGALYNGSKVLIKNSKKLDGLVRNKDFIQLEIDGYNMG
ISADRFNEDTNYIGTTYGTTHDLTTDFEIIQRQEKYRNYCQLKTPYNIFHKSGLMSTETSKPTFHDYRDWVYSSAWYFQN
YENLNLRKHTKTNWYFIPKDEGWDED
>A0A0K1TPY7 4.1.99.23~~~bzaA~~~5-hydroxybenzimidazole synthase BzaA~~~
MTLLEKAKCGEITAEMQYVAEKEGVRPEFICEGVANGDIVILYSSRENIHPVAVGKGLLTKVSASVGMYEEADTVDGEMA
KIDAAVKAHADTIMDLSVRGPIEEMREKVLSTVDRPVGTLPMYETLSVAEAKYGTALDMTPDDMFDMIEKQASQGVAFIA
VHPGTTLSVIHRAKDEGRIDPLVSYGGSHLIGWMLYNNTENPLYTEFDRLIEICKKYDVVLSFADGMRPGCIADSLDHAQ
VEELVILGGLVRRAREAGVQVMVKGPGHVPLDEIATTVQLEKKLCYGAPYFVFGCLPTDAAAGYDHITSAIGGAVAAYAG
ADFLCYVTPAEHIGMPNVDDVYQGVMASRIAAHAGDVAKGHPQAVKWDLDMSVARRAMNWKEQFKLSIDPETAERVWRER
STSFTSECTMCGKYCAMKIVEKYLRAE
>A0A0K1TQ05 4.1.99.23~~~bzaB~~~5-hydroxybenzimidazole synthase BzaB~~~
MTQMLEARKGHITEEMEKVALIEGVTPEFVREGVAKGHIVIPKNKFRSRDKICGIGGGLDVKVNGLMGTSSDRNDMEMEA
KKLRILEECGANAFMDLSTGDDIDAMRKQSLTISNIAAGCVPVYQASVEAIEKHGSMVGMTEDELFDTVEKQCQEGMDFM
AIHSALNWSVLNALKKSGRVTDVVSRGGSFLTAWMFHNKKENPLYEHFDRLLEILKATDTVLSIGDAIRPGANADSLDSA
QVQGLIVAGELTKRALEAGVQVMIEGPGHVPLNQIATTMQLQKQLCYGVPYYILGFLATDVAPGYDNITGAIGGAFAGMH
GADFLCYLTPAEHLGLPNEDDVRMGVRTTKIAADAANVLKRGGNAWNRSLAMSKARVARDEKVQVANALDPEYLESKLKA
EPESHGCAACGKSKCPADVAAEFFGIA
>Q1JZW3 4.1.99.23~~~bzaF~~~5-hydroxybenzimidazole synthase~~~
MKTQVEHAVDGIITEQMATVAHDEDLSPEYIRTMVAEGKIVIPNNSNSTPKPVGIGKGLRTKVNASIGTSSDIVNYQAEV
RKARIAEQAGADTLMELSVGGNLDRVRREVLAAVNLPVGNVPLYQAFCDATRKYGSADKLDPEELFDLIEQQCEDGLAFM
AIHCGINRYTIERLRKQHYRYGGLVSKGGTSMVSWMEHNNRENPLYEQFDRVVAILKKYDVCLSLGNGLRAGAIHDSHDR
AQMQELIINCELAQLGREMGCQMLVEGPGHMPLDEVEANILIQKRMSNEAPYYMLGPISTDVVPGFDHISSAIGAAQSAR
YGADLICYITPAEHLALPNEDDVRSGVEAARVATYIGDMNKYPDKGRQRDKAMSKARRDLQWDKQFELALMPEQARQVRD
SRLPEEEHSCTMCGNFCAANGSKTLFDGDLQGDKC
>P61425 4.1.99.23~~~bzaF~~~5-hydroxybenzimidazole synthase~~~COG0422
MKTQIEQAREGIITPQMAAVAAEEHVSPEYVCRMVAEGKVVIPWNHVRAPKAVGIGKGLRTKVNASIGTSSDIVDYEAEV
RKARAAQESGADTLMELSVGGDLDRVRREVIAAVDLPVGNVPLYQAFCEAARKYGDPNRLDPEMLFDLIERQCADGMAFM
AVHCGINLYTIERLRRQGYRYGGLVSKGGVSMVGWMMANGRENPLYEQFDRVVGILKKYDTVLSLGNGLRAGAIHDSSDR
AQIQELLINCELAEMGREMGCQMLVEGPGHVPLDEVEGNIQLQKRMSGGAPYYMLGPISTDVAPGFDHITAAIGAAQSSR
FGADLICYITPAEHLALPNEEDVRQGVKAARVAAYIGDMNKYPEKGRERDREMSKARRDLDWQRQFELALYPEDARAIRA
SRTPEDEATCTMCGDFCASRGAGRLFAGDLRGDKV
>P51853 4.1.2.38~~~bznB~~~Benzaldehyde lyase~~~
MAMITGGELVVRTLIKAGVEHLFGLHGAHIDTIFQACLDHDVPIIDTRHEAAAGHAAEGYARAGAKLGVAGHGGRGIYQC
GHAHCQRLAGSQGRCIPHPGSGALRDDETNTLQAGIDQVAMAAPITKWAHRVMATEHIPRLVMQAIRAALSAPRGPVLLD
LPWDILMNQIDEDSVIIPDLVLSAHGARPDPADLDQALALLRKAERPVIVLGSEASRTARKTALSAFVAATGVPVFADYE
GLSMLSGLPDAMRGGLVQNLYSFAKADAAPDLVLMLGARFGLNTGHGSGQLIPHSAQVIQVDPDACELGRLQGIALGIVA
DVGGTIEALAQATAQDAAWPDRGDWCAKVTDLAQERYASIAAKSSSEHALHPFHASQVIAKHVDAGVTVVADGALTYLWL
SEVMSRVKPGGFLCHGYLGSMGVGFGTALGAQVADLEAGRRTILVTGDGSVGYSIGEFDTLVRKQLPLIVIIMNNQSWGA
TLHFQQLAVGPNRVTGTRLENGSYHGVAAAFGADGYHVDSVESFSAALAQALAHNRPACINVAVALDPIPPEELILIGMD
PFA
>Q8RJB2 1.1.1.320~~~yueD~~~Benzil reductase ((S)-benzoin forming)~~~COG1028
MRYVIITGTSQGLGEAIATQLLEESTTVISISRRENKELTKLAEQYNSNCIFHSLDLQDVHNLETNFKEIISSIKEDNVS
SIHLINNAGTVAPMKPIEKAESEQFITNVHINLLAPMILTSTFMKHTKEWKVDKRVINISSGAGKNPYFGWGAYCTTKAG
VNMFTQCVATEEVEKEYPVKIVAFAPGVVDTNMQAQIRETAKEDFTNLDRFIALKEEGKLLSPEYVAKAIRNLLETEEFP
QGEVIRIDE
>Q81BF4 ~~~~~~Bifunctional cytochrome P450/NADPH--P450 reductase~~~
MEKKVSAIPQPKTYGPLGNLPLIDKDKPTLSFIKIAEEYGPIFQIQTLSDTIIVVSGHELVAEVCDETRFDKSIEGALAK
VRAFAGDGLFTSETHEPNWKKAHNILMPTFSQRAMKDYHAMMVDIAVQLVQKWARLNPNENVDVPEDMTRLTLDTIGLCG
FNYRFNSFYRETPHPFITSMTRALDEAMHQLQRLDIEDKLMWRTKRQFQHDIQSMFSLVDNIIAERKSSGDQEENDLLSR
MLNVPDPETGEKLDDENIRFQIITFLIAGHETTSGLLSFAIYFLLKNPDKLKKAYEEVDRVLTDPTPTYQQVMKLKYMRM
ILNESLRLWPTAPAFSLYAKEDTVIGGKYPIKKGEDRISVLIPQLHRDKDAWGDNVEEFQPERFEELDKVPHHAYKPFGN
GQRACIGMQFALHEATLVMGMLLQHFELIDYQNYQLDVKQTLTLKPGDFKIRILPRKQTISHPTVLAPTEDKLKNDEIKQ
HVQKTPSIIGADNLSLLVLYGSDTGVAEGIARELADTASLEGVQTEVVALNDRIGSLPKEGAVLIVTSSYNGKPPSNAGQ
FVQWLEELKPDELKGVQYAVFGCGDHNWASTYQRIPRYIDEQMAQKGATRFSKRGEADASGDFEEQLEQWKQNMWSDAMK
AFGLELNKNMEKERSTLSLQFVSRLGGSPLARTYEAVYASILENRELQSSSSDRSTRHIEVSLPEGATYKEGDHLGVLPV
NSEKNINRILKRFGLNGKDQVILSASGRSINHIPLDSPVSLLALLSYSVEVQEAATRAQIREMVTFTACPPHKKELEALL
EEGVYHEQILKKRISMLDLLEKYEACEIRFERFLELLPALKPRYYSISSSPLVAHNRLSITVGVVNAPAWSGEGTYEGVA
SNYLAQRHNKDEIICFIRTPQSNFELPKDPETPIIMVGPGTGIAPFRGFLQARRVQKQKGMNLGQAHLYFGCRHPEKDYL
YRTELENDERDGLISLHTAFSRLEGHPKTYVQHLIKQDRINLISLLDNGAHLYICGDGSKMAPDVEDTLCQAYQEIHEVS
EQEARNWLDRVQDEGRYGKDVWAGI
>P09662 ~~~~~~Pesticidal crystal protein Cry10Aa~~~
MNPYQNKNEYEIFNAPSNGFSKSNNYSRYPLANKPNQPLKNTNYKDWLNVCQDNQQYGNNAGNFASSETIVGVSAGIIVV
GTMLGAFAAPVLAAGIISFGTLLPIFWQGSDPANVWQDLLNIGGRPIQEIDKNIINVLTSIVTPIKNQLDKYQEFFDKWE
PARTHANAKAVHDLFTTLEPIIDKDLDMLKNNASYRIPTLPAYAQIATWHLNLLKHAATYYNIWLQNQGINPSTFNSSNY
YQGYLKRKIQEYTDYCIQTYNAGLTMIRTNTNATWNMYNTYRLEMTLTVLDLIAIFPNYDPEKYPIGVKSELIREVYTNV
NSDTFRTITELENGLTRNPTLFTWINQGRFYTRNSRDILDPYDIFSFTGNQMAFTHTNDDRNIIWGAVHGNIISQDTSKV
FPFYRNKPIDKVEIVRHREYSDIIYEMIFFSNSSEVFRYSSNSTIENNYKRTDSYMIPKQTWKNEEYGHTLSYIKTDNYI
FSVVRERRRVAFSWTHTSVDFQNTIDLDNITQIHALKALKVSSDSKIVKGPGHTGGDLVILKDSMDFRVRFLKNVSRQYQ
VRIRYATNAPKTTVFLTGIDTISVELPSTTSRQNPNATDLTYADFGYVTFPRTVPNKTFEGEDTLLMTLYGTPNHSYNIY
IDKIEFIPITQSVLDYTEKQNIEKTQKIVNDLFVN
>P21256 ~~~~~~Pesticidal crystal protein Cry11Aa~~~
MEDSSLDTLSIVNETDFPLYNNYTEPTIAPALIAVAPIAQYLATAIGKWAAKAAFSKVLSLIFPGSQPATMEKVRTEVET
LINQKLSQDRVNILNAEYRGIIEVSDVFDAYIKQPGFTPATAKGYFLNLSGAIIQRLPQFEVQTYEGVSIALFTQMCTLH
LTLLKDGILAGSAWGFTQADVDSFIKLFNQKVLDYRTRLMRMYTEEFGRLCKVSLKDGLTFRNMCNLYVFPFAEAWSLMR
YEGLKLQSSLSLWDYVGVSIPVNYNEWGGLVYKLLMGEVNQRLTTVKFNYSFTNEPADIPARENIRGVHPIYDPSSGLTG
WIGNGRTNNFNFADNNGNEIMEVRTQTFYQNPNNEPIAPRDIINQILTAPAPADLFFKNADINVKFTQWFQSTLYGWNIK
LGTQTVLSSRTGTIPPNYLAYDGYYIRAISACPRGVSLAYNHDLTTLTYNRIEYDSPTTENIIVGFAPDNTKDFYSKKSH
YLSETNDSYVIPALQFAEVSDRSFLEDTPDQATDGSIKFARTFISNEAKYSIRLNTGFNTATRYKLIIRVRVPYRLPAGI
RVQSQNSGNNRMLGSFTANANPEWVDFVTDAFTFNDLGITTSSTNALFSISSDSLNSGEEWYLSQLFLVKESAFTTQINP
LLK
>Q45730 ~~~~~~Pesticidal crystal protein Cry11Ba~~~
MQNNNFNTTEINNMINFPMYNGRLEPSLAPALIAVAPIAKYLATALAKWAVKQGFAKLKSEIFPGNTPATMDKVRIEVQT
LLDQRLQDDRVKILEGEYKGIIDVSKVFTDYVNQSKFETGTANRLFFDTSNQLISRLPQFEIAGYEGVSISLFTQMCTFH
LGLLKDGILAGSDWGFAPADKDALICQFNRFVNEYNTRLMVLYSKEFGRLLAKNLNEALNFRNMCSLYVFPFSEAWSLLR
YEGTKLENTLSLWNFVGESINNISPNDWKGALYKLLMGAPNQRLNNVKFNYSYFSDTQATIHRENIHGVLPTYNGGPTIT
GWIGNGRFSGLSFPCSNELEITKIKQEITYNDKGGNFNSIVPAATRNEILTATVPTSADPFFKTADINWKYFSPGLYSGW
NIKFDDTVTLKSRVPSIIPSNILKYDDYYIRAVSACPKGVSLAYNHDFLTLTYNKLEYDAPTTQNIIVGFSPDNTKSFYR
SNSHYLSTTDDAYVIPALQFSTVSDRSFLEDTPDQATDGSIKFTDTVLGNEAKYSIRLNTGFNTATRYRLIIRFKAPARL
AAGIRVRSQNSGNNKLLGGIPVEGNSGWIDYITDSFTFDDLGITTSSTNAFFSIDSDGVNASQQWYLSKLILVKESSFTT
QIPLKPYVIVRCPDTFFVSNNSSSTYEQGYNNNYNQNSSSMYDQGYNNSYNPNSGCTCNQDYNNSYNQNSGCTCNQGYNN
NYPK
>Q9ZIU5 ~~~~~~Pesticidal crystal protein Cry11Bb~~~
MENNSFNVLANNNMSSFPLFNSKIEPSIAPALIAVAPIAKYLATALAKWALKQGFAKLKSEIFPGNETATMEKVRLEVQT
ILNQTLQTDRVATLKAEYEGFIHLGKVFTDYVSQSTFTPATAKTHFLNMSNLLIQRLPQFEIAGYEGVSISLFTQMCTLH
LGLLKDGILAGSDWGFTPEDKDSLICQFNRYVNEYNTRMMGLYSIEFGRLLAKNLNEALNFRNMCSLYVFPFSEAWYLLR
YEGTKLENTLSLWNFVGEDIGGILHNDWKGALYKLLMGATNQRLANVRFNYSYFSDTQGTIHRENILGAHPTYNGEQTPT
GWIGNGRLGRFSAPYSNELEITKVEQEITYNNKGDHSNSIVPANTRNEILTATVPITADPFFKTADINWRYFSQGLYYGW
NIKFDDRVILNSRVPGGIPSNRLEYDGYYIRAVSACPRNVPLSYNHNYLTLTYNRLEYDAPTTQNIIVGFSPNNTKSFYA
RNSHYLSATNDAYVIPALQFATVSDRSFLEDTPDQATDGSIKFTETVLGNEAKYSIRLNTGFNTATRYRLVIRFKATARL
AAGIRVRSQNSGNNRLLGGIPVEGNSGWVDYITDSFTFNDLGITTASTNAFFSIDSDGVNASQQWYLSKLILVKDFVNNS
GFRNQVPLAPYVIARCPNTFFVSNNTSSGYEQGYNDNYNQNTSSGYEQGYNDNYNQNTSSGYEQGYNDNYNQNTSSGYEQ
GYNDNYNQNTSSGYEQGYNDNYNQNTSSGV
>Q45754 ~~~~~~Pesticidal crystal protein Cry12Aa~~~
MATLNEVYPVNYNVLSSDAFQQLDTTGFKSKYDEMIKAFEKKWKKGAKGKDLLDVAWTYITTGEIDPLNVIKGVLSVLTL
IPEVGTVASAASTIVSFIWPKIFGDKPNAKNIFEELKPQIEALIQQDITNYQDAINQKKFDSLQKTINLYTVAIDNNDYV
TAKTQLENLNSILTSDISIFIPEGYETGGLPYYAMVANAHILLLRDAIVNAEKLGFSDKEVDTHKKYIKMTIHNHTEAVI
KAFLNGLDKFKSLDVNSYNKKANYIKGMTEMVLDLVALWPTFDPDHYQKEVEIEFTRTISSPIYQPVPKNMQNTSSSIVP
SDLFHYQGDLVKLEFSTRTDNDGLAKIFTGIRNTFYKSPNTHETYHVDFSYNTQSSGNISRGSSNPIPIDLNNPIISTCI
RNSFYKAIAGSSVLVNFKDGTQGYAFAQAPTGGAWDHSFIESDGAPEGHKLNYIYTSPGDTLRDFINVYTLISTPTINEL
STEKIKGFPAEKGYIKNQGIMKYYGKPEYINGAQPVNLENQQTLIFEFHASKTAQYTIRIRYASTQGTKGYFRLDNQELQ
TLNIPTSHNGYVTGNIGENYDLYTIGSYTITEGNHTLQIQHNDKNGMVLDRIEFVPKDSLQDSPQDSPPEVHESTIIFDK
SSPTIWSSNKHSYSHIHLEGSYTSQGSYPHNLLINLFHPTDPNRNHTIHVNNGDMNVDYGKDSVADGLNFNKITATIPSD
AWYSGTITSMHLFNDNNFKTITPKFELSNELENITTQVNALFASSAQDTLASNVSDYWIEQVVMKVDALSDEVFGKEKKA
LRKLVNQAKRLSKIRNLLIGGNFDNLVAWYMGKDVVKESDHELFKSDHVLLPPPTFHPSYIFQKVEESKLKPNTRYTISG
FIAHGEDVELVVSRYGQEIQKVMQVPYEEALPLTSESNSSCCVPNLNINETLADPHFFSYSIDVGSLEMEANPGIEFGLR
IVKPTGMARVSNLEIREDRPLTAKEIRQVQRAARDWKQNYEQERTEITAIIQPVLNQINALYENEDWNGSIRSNVSYHDL
EQIMLPTLLKTEEINCNYDHPAFLLKVYHWFMTDRIGEHGTILARFQEALDRAYTQLESRNLLHNGHFTTDTANWTIEGD
AHHTILEDGRRVLRLPDWSSNATQTIEIEDFDLDQEYQLLIHAKGKGSITLQHGEENEYVETHTHHTNDFITSQNIPFTF
KGNQIEVHITSEDGEFLIDHITVIEVSKTDTNTNIIENSPINTSMNSNVRVDIPRSL
>P9WPN1 1.14.-.-~~~~~~Putative cytochrome P450 135A1~~~COG2124
MASTLTTGLPPGPRLPRYLQSVLYLRFREWFLPAMHRKYGDVFSLRVPPYADNLVVYTRPEHIKEIFAADPRSLHAGEGN
HILGFVMGEHSVLMTDEAEHARMRSLLMPAFTRAALRGYRDMIASVAREHITRWRPHATINSLDHMNALTLDIILRVVFG
VTDPKVKAELTSRLQQIINIHPAILAGVPYPSLKRMNPWKRFFHNQTKIDEILYREIASRRIDSDLTARTDVLSRLLQTK
DTPTKPLTDAELRDQLITLLLAGHETTAAALSWTLWELAHAPEIQSQVVWAAVGGDDGFLEAVLKEGMRRHTVIASTARK
VTAPAEIGGWRLPAGTVVNTSILLAHASEVSHPKPTEFRPSRFLDGSVAPNTWLPFGGGVRRCLGFGFALTEGAVILQEI
FRRFTITAAGPSKGETPLVRNITTVPKHGAHLRLIPQRRLGGLGDSDPP
>P9WPM9 1.14.-.-~~~~~~Putative cytochrome P450 135B1~~~COG2124
MSGTSSMGLPPGPRLSGSVQAVLMLRHGLRFLTACQRRYGSVFTLHVAGFGHMVYLSDPAAIKTVFAGNPSVFHAGEANS
MLAGLLGDSSLLLIDDDVHRDRRRLMSPPFHRDAVARQAGPIAEIAAANIAGWPMAKAFAVAPKMSEITLEVILRTVIGA
SDPVRLAALRKVMPRLLNVGPWATLALANPSLLNNRLWSRLRRRIEEADALLYAEIADRRADPDLAARTDTLAMLVRAAD
EDGRTMTERELRDQLITLLVAGHDTTATGLSWALERLTRHPVTLAKAVQAADASAAGDPAGDEYLDAVAKETLRIRPVVY
DVGRVLTEAVEVAGYRLPAGVMVVPAIGLVHASAQLYPDPERFDPDRMVGATLSPTTWLPFGGGNRRCLGATFAMVEMRV
VLREILRRVELSTTTTSGERPKLKHVIMVPHRGARIRVRATRDVSATSQATAQGAGCPAARGGGPSRAVGSQ
>Q45755 ~~~~~~Pesticidal crystal protein Cry13Aa~~~
MTCQLQAQPLIPYNVLAGVPTSNTGSPIGNAGNQFDQFEQTVKELKEAWEAFQKNGSFSLAALEKGFDAAIGGGSFDYLG
LVQAGLGLVGTLGAAIPGVSVAVPLISMLVGVFWPKGTNNQENLITVIDKEVQRILDEKLSDQLIKKLNADLNAFTDLVT
RLEEVIIDATFENHKPVLQVSKSNYMKVDSAYFSTGGILTLGMSDFLTDTYSKLTFPLYVLGATMKLSAYHSYIQFGNTW
LNKVYDLSSDEGKTMSQALARAKQHMRQDIAFYTSQALNMFTGNLPSLSSNKYAINDYNVYTRAMVLNGLDIVATWPTLY
PDDYSSQIKLEKTRVIFSDMVGQSESRDGSVTIKNIFDNTDSHQHGSIGLNSISYFPDELQKAQLRMYDYNHKPYCTDCF
CWPYGVILNYNKNTFRYGDNDPGLSGDVQLPAPMSVVNAQTQTAQYTDGENIWTDTGRSWLCTLRGYCTTNCFPGRGCYN
NSTGYGESCNQSLPGQKIHALYPFTQTNVLGQSGKLGLLASHIPYDLSPNNTIGDKDTDSTNIVAKGIPVEKGYASSGQK
VEIIREWINGANVVQLSPGQSWGMDFTNSTGGQYMVRCRYASTNDTPIFFNLVYDGGSNPIYNQMTFPATKETPAHDSVD
NKILGIKGINGNYSLMNVKDSVELPSGKFHVFFTNNGSSAIYLDRLEFVPLDQPAAPTQSTQPINYPITSRLPHRSGEPP
AIIWEKSGNVRGNQLTISAQGVPENSQIYLSVGGDRQILDRSNGFKLVNYSPTYSFTNIQASSSNLVDITSGTITGQVQV
SNL
>Q0P8H8 4.1.1.-~~~~~~L-serine phosphate decarboxylase Cj1436c~~~COG0079
MLIKLNDYEKNITQKIKDLKNASGSHSPSIFTMAEQIPELNIKIDSCFLSNPYATALFLRYLKEELIDGQKLRSVLEFYP
SQNSIIAKTVADFIGIDPKNVFIGNGAIEIIQAVMHNFVGKKIIVNIPTFSSYYEFAKSETNVVYYQLSKEDNYNLNIEH
YLNFVKNENPDSVVLINPNNPDGGYINYEKLRYILSELKYVKNIIIDESFIHFAYENKDYNGINIEYLFKEFHNTIIIKS
MSKDFGVAGIRIGYAIMSEDKIRGLLKNGYLWNSSGLSEYFLRLYVRKNFFDEYDKVRREYIQETQTFFRKLSGIKQFKV
YPSMANFALVELLDGSSSTDFVAKMLIKYGIYMRTCNDKIGLEGEFIRIASRTLEENDMVLKSICDVFKE
>Q0P8H7 2.6.1.-~~~~~~Dihydroxyacetone phosphate transaminase Cj1437c~~~COG0079
MQANKNIQKLTPYLSIPHKIWNSSQSNILKLDWNEATIPPSPYVIESIKKFLVNGNLNWYPNTKNLYLLDKIAEYTKQIN
SSFVELFEGSDSAHECIIDVFLDKCDKIGIVSPTYDNFRSRANGVGIETISFTLDDNFNLDFDSLEYFIHEKRIKLLYLC
NPNNPTGKSYNIQKIKSLIINNPNVMFIIDEAYYEFTSQSVCDLVEQCNNLIITRTFSKAFALASFRIGYIISHPENIES
INKLRNPKSVPMLSQIAANAALEDLQYMRDYVDEVSCARMEFVKFLNTLTTGGGGIFNDSVANFVLIQNENISLFVGFLE
KEGIFIRNYSHLISKNCRISIGTRNQMSYVAEKIQEFAKKQGGFHLV
>Q45710 ~~~~~~Pesticidal crystal protein Cry14Aa~~~
MDCNLQSQQNIPYNVLAIPVSNVNALVDTAGDLKKAWEEFQKTGSFSLTALQQGFSASQGGAFNYLTLLQSGISLAGSFV
PGGTFVAPIVNMVIGWLWPHKNKTADTENLIKLIDEEIQKQLNKALLDQDRNNWTSFLESIFDTSATVSNAIIDAQWSGT
VDTTNRQQKTPTTSDYLNVVGKFDSADSSIITNENQIMNGNFDVAAAPYFVIGATLRLSLYQSYIKFCNSWIDAVGFSTN
DANTQKANLARTKLTMRTTINEYTQRVMKVFKDSKNMPTIGTNKFSVDAYNVYVKGMTLNVLDMVAIWSSLYPNDYTSQT
AIEQTRVTFSNMVGQEEGTDGTLKIYNTFDSLSYQHSLIPNNNVNLISYYTDELQNLELAVYTPKGGSGYAYPYGFILNY
ANSNYKYGDNDPTGKPLNKQDGPIQQINAATQNSKYLDGETINGIGASLPGYCTTGCSATEQPFSCTSTANSYKASCNPS
DTNQKINALYAFTQTNVKGSTGKLGVLASLVPYDLNPKNVFGELDSDTNNVILKGIPAEKGYFPNNARPTVVKEWINGAS
AVPFYSGNTLFMTATNLTATQYKIRIRYANPNSDTQIGVLITQNGSQISNSNLTLYSTTDSSMSSNLPQNVYVTGENGNY
TLLDLYSTTNVLSTGDITLKLTGGNQKIFIDRIEFIPTMPVPAPTNNTNNNNGDNGNNNPPHHGCAIAGTQQLCSGPPKF
EQVSDLEKITTQVYMLFKSSSYEELALKVSSYQINQVALKVMALSDEKFCEEKRLLRKLVNKANQLLEARNLLVGGNFET
TQNWVLGTNAYINYDSFLFNGNYLSLQPASGFFTSYAYQKIDESTLKPYTRYKVSGFIGQSNQVELIISRYGKEIDKILN
VPYAGPLPITADASITCCAPEIDQCDGGQSDSHFFNYSIDVGALHPELNPGIEIGLKIVQSNGYITISNLEIIEERPLTE
MEIQAVNRKDQKWKREKLLECASVSELLQPIINQIDSLFKDANWYNDILPHVTYQTLKNIIVPDLPKLKHWFIDHLPGEY
HEIEQKMKEALKHAFTQLDEKNLIHNGHFATNLIDWQVEGDARMKVLENNALALQLSNWDSSVSQSIDILEFDEDKAYKL
RVYAQGSGTIQFGNCEDEAIQFNTNSFVYKEKIIYFDTPSINLHIQSEGSEFVVSSIDLVELSDDE
>Q9KZF5 1.14.19.69~~~~~~Biflaviolin synthase CYP158A1~~~COG2124
MTQETTTLTGQSPPPVRDWPALDLDGPEFDPVLAELMREGPLTRVRLPHGEGWAWLATRYDDVKAITNDPRFGRAEVTQR
QITRLAPHFKPRPGSLAFADQPDHNRLRRAVAGAFTVGATKRLRPRAQEILDGLVDGILAEGPPADLVERVLEPFPIAVV
SEVMGVPAADRERVHSWTRQIISTSGGAEAAERAKRGLYGWITETVRARAGSEGGDVYSMLGAAVGRGEVGETEAVGLAG
PLQIGGEAVTHNVGQMLYLLLTRRELMARMRERPGARGTALDELLRWISHRTSVGLARIALEDVEVHGTRIAAGEPVYVS
YLAANRDPDVFPDPDRIDLDRDPNPHLAYGNGHHFCTGAVLARMQTELLVDTLLERLPGLRLAVPAEQVAWRRKTMIRGP
RTLPCTW
>Q9FCA6 1.14.19.69~~~~~~Biflaviolin synthase CYP158A2~~~COG2124
MTEETISQAVPPVRDWPAVDLPGSDFDPVLTELMREGPVTRISLPNGEGWAWLVTRHDDVRLVTNDPRFGREAVMDRQVT
RLAPHFIPARGAVGFLDPPDHTRLRRSVAAAFTARGVERVRERSRGMLDELVDAMLRAGPPADLTEAVLSPFPIAVICEL
MGVPATDRHSMHTWTQLILSSSHGAEVSERAKNEMNAYFSDLIGLRSDSAGEDVTSLLGAAVGRDEITLSEAVGLAVLLQ
IGGEAVTNNSGQMFHLLLSRPELAERLRSEPEIRPRAIDELLRWIPHRNAVGLSRIALEDVEIKGVRIRAGDAVYVSYLA
ANRDPEVFPDPDRIDFERSPNPHVSFGFGPHYCPGGMLARLESELLVDAVLDRVPGLKLAVAPEDVPFKKGALIRGPEAL
PVTW
>Q45729 ~~~~~~Pesticidal crystal protein Cry15Aa~~~
MAIMNDIAQDAARAWDIIAGPFIRPGTTPTNRQLFNYQIGNIEVEPGNLNFSVVPELDFSVSQDLFNNTSVQQSQTASFN
ESRTETTSTAVTHGVKSGVTVSASAKFNAKILVKSIEQTITTTVSTEYNFSSTTTRTNTVTRGWSIAQPVLVPPHSRVTA
TLQIYKGDFTVPVLLSLRVYGQTGTLAGNPSFPSLYAATYENTLLGRIREHIAPPALFRASNAYISNGVQAIWRGTATTR
VSQGLYSVVRIDERPLAGYSGETRTYYLPVTLSNSSQILTPGSLGSEIPIINPVPNASCKKENSPIIIHHDREKHRERDY
DKEHICHDQAEKYERDYDKE
>Q9KIZ4 1.14.15.-~~~~~~Epothilone C/D epoxidase~~~
MTQEQANQSETKPAFDFKPFAPGYAEDPFPAIERLREATPIFYWDEGRSWVLTRYHDVSAVFRDERFAVSREEWESSAEY
SSAIPELSDMKKYGLFGLPPEDHARVRKLVNPSFTSRAIDLLRAEIQRTVDQLLDARSGQEEFDVVRDYAEGIPMRAISA
LLKVPAECDEKFRRFGSATARALGVGLVPRVDEETKTLVASVTEGLALLHGVLDERRRNPLENDVLTMLLQAEADGSRLS
TKELVALVGAIIAAGTDTTIYLIAFAVLNLLRSPEALELVKAEPGLMRNALDEVLRFDNILRIGTVRFARQDLEYCGASI
KKGEMVFLLIPSALRDGTVFSRPDVFDVRRDTSASLAYGRGPHVCPGVSLARLEAEIAVGTIFRRFPEMKLKETPVFGYH
PAFRNIESLNVILKPSKAG
>Q45882 ~~~~~~Pesticidal crystal-like protein Cry16Aa~~~
MNTNIFSTHLEFSKGVASVFKVIDTIHNISKNNNFNNILTQDFIIDTILSILWEDPNENEIFSSMIEDGETITNKNLSAQ
TKEGLLLNSNSFGLKFKYYNNAFRSWIDNYNPTSIDDVVYRFKDVNSICENNINEFKVKNYEVTVLPIYMQIANLHLLLL
RDGMIYGDAWNLYRELGFSDQDSFYNHVLDKTKFYINDCLNYYNTGLSNLKLDPNNSWIDITRYCRFMTFYILDMISICP
IYDTKVYDKPINMQTLTRKVYSDPVNFIDENIPISEYEKMYNISPELFSTLFSISFYTNKSGNKFLNGHVNRHVGTDLNY
NGLRETHYGNYGSNYEVESMAFDDIKAYSNNYFNNTQNNNPTSVKSIKFLITKNNDEWIYGEPDSSNIDFTRNIQGYLSN
LNNESYTHSLSDMILANNDKIQINIDTPHSYSYSWIYKGIEDTNYISDKLINQIPLVKEVKLKSRHYSEISVIKGPGFTG
GDLILSKVHKPANQIPAQYMKNKITIPIKTKFPAGSQDFKVRLCYASNHDIGLIRLIAGSKYITTNIQQTFNTTENNPSL
IYDDFKYFNFNETLSITSSGIDELYLEFYYSYTDGNFEDFPKLSIPYTRNYSC
>O05102 ~~~~~~Pesticidal crystal-like protein Cry17Aa~~~
MNNKKIEQNKIVEYNSNLDIQPRELNTLNGLVFTGATVSIILPLIGTTAVVPVVGGVIGIIAALLPVIWPAGTSSNDNLF
DAVMKDTEMIMDEKISEYVVNDAMTRLESLYNILDYYRLSKDFWEKNKDDPLAIAELKERFSKLHSQFIESMAYFKRANY
EVLLLPAYANAANLHLLLLREGLLLNKVIDNFITEGLHYEEFKTKRSTYIAHCSTWYNKGLENIKNKTRDFNKINKYDAY
MNLSVLDIISLFLSYDPYQYDKATKLQTLTRTVFSDPLQRAPRDLYISPKEETLFKNLKGLRAFFAEGDLVLTGFRNYFR
NTYINDQIIEGDLFGYTTNNERYKLFTDSKIYKVTVFIDNVALAIVKLIFHDTDNKEWDFSKTDITDINKYRKEEVYLNL
LSNNEIQKEPSHYLYKMHHYGDNYNDSYLFQWIHQSISPENYLFDKDKDDNYIITQIPAIKASELSNLGELSLQAIKGPR
FTGGNVILSSVSKIDNNDPLYGGTIKIPLLTAFNNTSKFKIRIYYAANHNYNHDYIGALLTINSQHVANFKFKQTFSGED
YSNLSYNNYQFDYLVQTVAFPQNTSDVTLNLQFFYDPKFLNDYKQIVIIDKIEFIPEN
>Q45358 ~~~~~~Parasporal crystal protein Cry18Aa~~~
MNNNFNGGNNTGNNFTGNTLSNGICTKKNMKGTLSRTAIFSDGISDDLICCLDPIYNNNDNNNDAICDELGLTPIDNNTI
CSTDFTPINVMRTDPFRKKSTQELTREWTEWKENSPSLFTPAIVGVVTSFLLQSLKKQATSFLLKTLTDLLFPNNSSLTM
EEILRATEQYVQERLDTDTANRVSQELVGLKNNLTTFNDQVEDFLQNRVGISPLAIIDSINTMQQLFVNRLPQFQVSGYQ
VLLLPLFAQAATLHLTFLRDVIINADEWNIPTAQLNTYTRYFKEYIAEYSNYALSTYDDGFRTRFYPRNTLEDMLQFKTF
MTLNALDLVSIWSLLKYVNLYVSTSANLYNIGDNKVNEGAYPISYGPFFNSYIQTKSNYVLSGVSGIGARFTYSTVLGRY
LHDDLKNIITTYVGGTQGPNIGVQLSTTELDELKKQQQATRDSLVDFQFFTLNCMLPNPITAPYFATSLYESRYSSIGGY
LRKDVFKSEDSTCGLGNPGAWTSYPDYYITNISATVQINGENTDTTPLYFKENRPITSTRGVNKVIAVYNRKANIAGTNQ
NGTMIHQAPPDGTGFTVSPLHPSANTITSYIKENYGNSGDSLHLKGQGYLHYMLSGNGQDRYRLVLRLSGAANQIKLQSP
TTSIYAFDTSTNNEGITDNGSKFKDFAFSTPFVIPEQKEIVLYFEGVGSLDLMNLIFLPADDTPLY
>P57091 ~~~~~~Parasporal crystal protein Cry18Ba~~~
MNNNGNALSRTALTPTNNKVISGDLVTNGLPPIDNNIICSNGFMPINVTRKNPFRKRTTQEFIREWTEWKENSPSLFTAP
IVGVVTSTLLEALKKQVQSRLLLLMTNLLFPNNSTSTMEEILRATEQYVQEQLDTVTWNRVSQELEGLKNNLRTFNDQID
DFLQNRVEISPTAMIDSINTMQQVFVNRLPQFQLSDYQLLLLPLFAQGATLHLTFIRDIIINAGEWNIPEAQLNTCKRYL
KQYVAQYSNYALSTYEGAFRARFYPRATLENMLQFKTFMTLNVLDLVSIWSLLKYMNLYISTSANLYNIGDNKVNEGEYS
ISYWPFFNSYIQTKSNYVLSGVSGYAIRWYYLNTFFGEYIQDNLYNIIASYVGGVNGPKIGVQLSTTELDKQIKQQARAG
MPTGLDDLSFNCTLRNPTTVPYFACNFQELTSSGTAGTGGFIRSDVFRSEDNICGLGTGYASAWTSYPDYYITNISATVQ
VDGINIDITPLCFGEDRAITSTHGVNKVIAVYNRKANIAGTNQNGTMIHQAPNDGTGFTVSPLHLASFTHPSEAHIQENY
GNSGDSLRLTGPTTAITYMLSGDGRTIYKLVLRVSGVITRITAKVRGNSIGYLEYINTVDNNQGITDNGSKFQDFEFRPT
ITIDAQTPIVLEFSATSNFDLMNLIFIPYYDTPIY
>P57092 ~~~~~~Parasporal crystal protein Cry18Ca~~~
MNNYFIGKVLSGHHINNNGNGNTLSRTALTPTNNNVNRGDLVTNGLTPIDNNFIGSNGFIPRNVTRKDPFRKRTTQEFIR
EWTEWKEKSASLFTAPIVGVITSTLLEALKKLVAGRVLMSLTNLLFPNNSTSTMEEILRATEQYIQEQLDTVTWNRVSQE
LEGLKNDLRTFNDQIDDFLQNRVGISPLAIIDSINTMQQLFVNRLPQFQVSDDQVLLLPLFAQAVTLHLTFVRDIIINAD
EWNIPEAQLNTYKRYLKQYVAQYSNYALSTYEEAFRARFYPRNTVENMLEFKTFMTLNVLDLVSMWSLLKYVNLYVSTSA
NLYNIGDNKVNEGEYSISYWPFFNTYIQTKSNYVLSGVSGYAMRWSYTNPFFGEYIQDHLYNITASYIGGVNGPQIGQQL
STTELDQLVQQQARADIPVDFTQIPINCTLRNPLEVPYYATRFNELTSLGTAGVGGFVRSDVFISNDSVCGLGTNYSSGQ
TFYPDYYITNISATVQVNGTNTDISPLYFGENRAITSTNGVNKVIAIYNRKTNYDDFTNIRGTIVHEAPTDSTGFTISPL
HLDTVNINSYLYIQENYGNNGDSLRVINRAIIKYRLSAARSVIYRLVLRVSGTASSIVAIYENYPVGSANQINTGTDNEG
VIDNDSKFIDLIFNTPFSVSGTARELQLQVSGATTSSPLDIMNIILIPINDVPLY
>O32307 ~~~~~~Pesticidal crystal protein Cry19Aa~~~
MHYYGNRNEYDILNASSNDSNMSNTYPRYPLANPQQDLMQNTNYKDWLNVCEGYHIENPREASVRAGLGKGLGIVSTIVG
FFGGSIILDTIGLFYQISELLWPEDDTQQYTWQDIMNHVEDLIDKRITEVIRGNAIRTLADLQGKVDDYNNWLKKWKDDP
KSTGNLSTLVTKFTALDSDFNGAIRTVNNQGSPGYELLLLPVYAQIANLHLLLLRDAQIYGDKWWSARANARDNYYQIQL
EKTKEYTEYCINWYNKGLNDFRTAGQWVNFNRYRREMTLTVLDIISMFPIYDARLYPTEVKTELTREIYSDVINGEIYGL
MTPYFSFEKAESLYTRAPHLFTWLKGFRFVTNSISYWTFLSGGQNKYSYTNNSSINEGSFRGQDTDYGGTSSTINIPSNS
YVYNLWTENYEYIYPWGDPVNITKMNFSVTDNNSSKELIYGAHRTNKPVVRTDFDFLTNKEGTELAKYNDYNHILSYMLI
NGETFGQKRHGYSFAFTHSSVDPNNTIAANKITQIPVVKASSINGSISIEKGPGFTGGDLVKMRADSGLTMRFKAELLDK
KYRVRIRYKCNYSSKLILRKWKGEGYIQQQIHNISPTYGAFSYLESFTITTTENIFDLTMEVTYPYGRQFVEDIPSLILD
KIEFLPTN
>O86170 ~~~~~~Pesticidal crystal protein Cry19Ba~~~
MNSYQNKNEYEILDAKRNTCHMSNCYPKYPLANDPQMYLRNTHYKDWINMCEEASYASSGPSQLFKVGGSIVAKILGMIP
EVGPLLSWMVSLFWPTIEEKNTVWEDMIKYVANLLKQELTNDTLNRATSNLSGLNESLNIYNRALAAWKQNKNNFASGEL
IRSYINDLHILFTRDIQSDFSLGGYETVLLPSYASAANLHLLLLRDVAIYGKELGYPSTDVEFYYNEQKYYTEKYSNYCV
NTYKSGLESKKQIGWSDFNRYRREMTLSVLDIVALFPLYDTGLYPSKDGKIHVKAELTREIYSDVINDHVYGLMVPYISF
EHAESLYTRRPHAFTWLKGFRFVTNSINSWTFLSGGENRYFLTHGEGTIYNGPFLGQDTEYGGTSSYIDISNNSSIYNLW
TKNYEWIYPWTDPVNITKINFSITDNSNSSESIYGAERMNKPTVRTDFNFLLNRAGNGPTTYNDYNHILSYMLINGETFG
QKRHGYSFAFTHSSVDRYNTIVPDKIVQIPAVKTNLVGANIIKGPGHTGGDLLKLEYERFLSLRIKLIASMTFRIRIRYA
SNISGQMMINIGYQNPTYFNIIPTTSRDYTELKFEDFQLVDTSYIYSGGPSISSNTLWLDNFSNGPVIIDKIEFIPLGIT
LNQAQGYDTYDQNANGMYHQNYSNSGYNYNQEYNTYYQSYNN
>O32321 ~~~~~~Pesticidal crystal protein Cry20Aa~~~
MNPYQNNDEIVDVPENYDNNLNRYPYANDPNVAMQNTNYKDWMNGYEEINPSSITAILASIGILNRVIALTGVLGNTQEV
ISIIQDALGFIRNGTGNELLIHVEQLIQQTLATQYRSAATGAIYGISRSYDNYLMFFRQWERNRTRENGQQVESAFTTIN
TLCINALAPQASLSRRGFETLLLPNYAMAANFHLLLLRDAVLYRNQWLSNSISTANVNLNILRAAINEYITHCTRWYQDG
LNRFDRSSRANMNEWRRFNAYRRDMTLSVLDFATVFPTYDPVLFPAATNVELTRVVYTDPIVMAGGRTAIPGFTRMENLV
NSASRVSFLNQMNIYTSFYFRPHNIPRYYWSGNQNFLSNGTSNLYGYRSDGRTTFNVSNIDIFRVNMTTHIGGAFTDDYR
GLHRAEFIGANTQNNQRTSLLYSVEIPSSHFRFENHTVFLPGESGLEPNERNYTHRLFQMMNEVSVNPNARGRVFLHAWT
HRSLRRTNGLRSDQILQIPAVKTISNGGDRAVVLNYGENIMKLDNLTTGLSYKLTAVDSEASNTRFIVRVRYASMNNNKL
NLVLNGAQIASLNVEHTVQRGGSLTDLQYGNFKYATFAGNFKMGSQSILGIFKEIPNIDFVLDKIELIPSNFMSSLEQTQ
NYNTYNQDTIYTHNQGYDTYDQNSSGMYHQSYNNYDQNMDTTYQPSYDNYNQNASGTYDDGYNPNASDSYDQSYTNNYSQ
NTNSMYDQGYYNNNYDQHSGCTCNQGYDNNYLK
>P56956 ~~~~~~Pesticidal crystal protein Cry21Aa~~~
MTNPTILYPSYHNVLAHPIRLDSFFDPFVETFKDLKGAWEEFGKTGYMDPLKQHLQIAWDTSQNGTVDYLALTKASISLI
GLIPGADAVVPFINMFVDFIFPKLFGRGSQQNAQAQFFELIIEKVKELVDEDFRNFTLNNLLNYLDGMQTALSHFQNDVQ
IAICQGEQPGLMLDQTPTACTPTTDHLISVRESFKDARTTIETALPHFKNPMLSTNDNTPDFNSDTVLLTLPMYTTGATL
NLILHQGYIQFAERWKSVNYDESFINQTKVDLQRRIQDYSTTVSTTFEKFKPTLNPSNKESVNKYNRYVRSMTLQSLDIA
ATWPTLDNVNYPSNVDIQLDQTRLVFSDVAGPWEGNDNITSNIIDVLTPINTGIGFQESSDLRKFTYPRIELQSMQFHGQ
YVNSKSVEHCYSDGLKLNYKNKTITAGVSNIDESNQNNKHNYGPVINSPITDINVNSQNSQYLDLNSVMVNGGQKVTGCS
PLSSNGNSNNAALPNQKINVIYSVQSNDKPEKHADTYRKWGYMSSHIPYDLVPENVIGDIDPDTKQPSLLLKGFPAEKGY
GDSIAYVSEPLNGANAVKLTSYQVLQMEVTNQTTQKYRIRIRYATGGDTAASIWFHIIGPSGNDLTNEGHNFSSVSSRNK
MFVQGNNGKYVLNILTDSIELPSGQQTILIQNTNSQDLFLDRIEFISLPSTSTPTSTNFVEPESLEKIINQVNQLFSSSS
QTELAHTVSDYKIDQVVLKVNALSDDVFGVEKKALRKLVNQAKQLSKARNVLVGGNFEKGHEWALSREATMVANHELFKG
DHLLLPPPTLYPSYAYQKIDESKLKSNTRYTVSGFIAQSEHLEVVVSRYGKEVHDMLDIPYEEALPISSDESPNCCKPAA
CQCSSCDGSQSDSHFFSYSIDVGSLQSDVNLGIEFGLRIAKPNGFAKISNLEIKEDRPLTEKEIKKVQRKEQKWKKAFNQ
EQAEVATTLQPTLDQINALYQNEDWNGSVHPASDYQHLSAVVVPTLPKQRHWFMEGREGEHVVLTQQFQQALDRAFQQIE
EQNLIHNGNLANGLTDWTVTGDAQLTIFDEDPVLELAHWDASISQTIEIMDFEGRHRIQTACTWKRQRNSYRSTWRKRLE
TMTFNTTSFTTQEQTFYFEGDTVDVHVQSENNTFLIDSVELIEIIEE
>P56957 ~~~~~~Pesticidal crystal protein Cry22Aa~~~
MKEQNLNKYDEITVQAASDYIDIRPIFQTNGSATFNSNTNITTLTQAINSQAGAIAGKTALDMRHDFTFRADIFLGTKSN
GADGIAIAFHRGSIGFVGTKGGGLGILGAPKGIGFELDTYANAPEDEVGDSFGHGAMKGSFPSFPNGYPHAGFVSTDKNS
RWLSALAQMQRIAAPNGRWRRLEIRWDARNKELTANLQDLTFNDITVGEKPRTPRTATWRLVNPAFELDQKYTFVIGSAT
GASNNLHQIGIIEFDAYFTKPTIEANNVNVPVGATFNPKTYPGINLRATDEIDGDLTSKIIVKANNVNTSKTGVYYVTYY
VENSYGESDEKTIEVTVFSNPTIIASDVEIEKGESFNPLTDSRVGLSAQDSLGNDITQNVKVKSSNVDTSKPGEYEVVFE
VTDSFGGKAEKDFKVTVLGQPSIEANNVELEIDDSLDPLTDAKVGLRAKDSLGNDITKDIKVKFNNVDTSNSGKYEVIFE
VTDRFGKKAEKSIEVLVLGEPSIEANDVEVNKGETFEPLTDSRVGLRAKDSLGNDITKDVKIKSSNVDTSKPGEYEVVFE
VTDRFGKYVEKTIGVIVPVIDDEWEDGNVNGWKFYAGQDIKLLKDPDKAYKGDYVFYDSRHVAISKTIPLTDLQINTNYE
ITVYAKAESGDHHLKVTYKKDPAGPEEPPVFNRLISTGTLVEKDYRELKGTFRVTELNKAPLIIVENFGAGYIGGIRIVK
IS
>O87906 ~~~~~~Pesticidal crystal protein Cry25Aa~~~
MNPYQNKSECEILNAPLNNINMPNRYPFANDPNAVMKNGNYKDWLNECDGITPSIFGTLGVLASIVISTINLATSPSIGD
AFALVSSIGEYWPETKTSFPLSVADVNRLIREALDQNAINRATGKFNGLMDTYNTVYLKNLQDWYDTRIPANPQGDSQLR
EAARRSLEEIERDFRKALAGEFAEAGSQIVLLPIYAQAANIHLLILKDAMQFRTDLGLIRPVGVPITTSAEDPFESEFLL
RIKKYTDHCISYYDDGLAKIRSRGSDGETWWEFNKFRREMTLTVLDLVALYPTHNIKLYPIPTQTELSRVVYTDPVGCFG
NRKSDIFSRLNFDYLENRLTRPREPFNYLNSVQLFASTVSNSNNGEVLRGNLNKIMFEGGWTASRSGDGVTTGTPFSTMD
WSYGWGYPRKHYAEITSRSQALPGLNNSIHVIVGIDSFRAIGPGGQGDHTFSLPGGDMYDCGKVQINPLEDYRNSDHWIS
DMMTINQSVQLASNPTQTFAFSALSLGWHHSSAGNRNVYVYDKITQIPATKTVREHPMIKGPGFTGGDLADLSSNSDILQ
YDLRSDYDDRLTEDVPFRIRIRCASIGVSTISVDNWGSSSPQVTVASTAASLDTLKYESFQYVSIPGNYYFDSAPRIRLL
RQPGRLLVDRIEIIPVNFFPLSEQENKSVDSLFIN
>Q9X597 ~~~~~~Pesticidal crystal protein Cry26Aa~~~
MNSEEMNHVNPFEISDNNDVSIPSQRYPFANDPADSVFCADDFLQSYGEFNMDNFGESEPFIDASGAINAAIGVTGTVLG
FLGVPFAGALTTFYQKLFGFLFPNNNTKQWEEFMKQVEALIDEKISDAVRNKAISELQGLVNNITLYTEALEEWLENKEN
PAVRDRVLQRWRILDGFFEQQMPSFAVKGFEVLLLVVYTQAANLHLLSLRDAYIYGAEWGLTPTNIDQNHTRLLRHSAEY
TDHCVNWYNTGLKQLENSDAKSWFQYNRFRREMTLSVLDVIALFPAYDVKMYPIPTNFQLTREVYTDVIGKIGRNDSDHW
YSANAPSFSNLESTLIRTPHVVDYIKKLKIFYATVDYYGIYGRSGKWVGHIITSATSANTTETRNYGTIVNHDSVELNFE
GKNIYKTGSLPQGVPPYQIGYVTPIYFITRAVNFFTVSGSKTSVEKYYSKKDRYYSEGLPEEQGVFSTEQLPPNSIAEPE
HIAYSHRLCHVTFISVSNGNKYSKDLPLFSWTHSSVDFDNYVYPTKITQLPATKGYNVSIVKEPGFIGGDIGKNNGQILG
KYKVNVEDVSQKYRFRVRYATETEGELGIKIDGRTVNLYQYKKTKAPGDPLTYKAFDYLSFSTPVKFNNASSTIELFLQN
KTSGTFYLAGIEIIPVKSNYEEELTLEEAKKAVSSLFTDARNALKIDVTDYQIDQAANLVECISGDLYAKEKIVLLRAVK
FAKQLSQSQNLLSDPEFNNVNRENSWTASTSVAIIEGDPLYKGRAVQLSSARDENFPTYLYQKIDESTLKPYTRYQLRGF
VEGSENLDVYLIRYGAAHVRMNVPYNLEIIDTSSPVNPCEEVDGLSHRSCNVFDRCKQSISVAPDANTGPDQIDGDPHAF
SFHIDTGTVDSTENLGIWVAFKISELDGSAIFGNLELIEVGPLSGEALAQVQRKEEKWKQVLAKKRETTAQTVCSGEASQ
LTNSSQILKIRNYDLIQNFRIFSLRNTLSIKFKIYTITNYPYSRLNYDLFMELENRIQNASLYMTSNILQNGGFKSDVTS
WETTANAEVQQIDGASVLVLSNWNASVAQSVNVQNDHGYVLRVTAKKEGIGNGYVTILDCANHIDTLTFSACRSDSDTSS
NELTAYVTKTLEIFPDTEQIRIEIGETEGMFYVESVELIRMEN
>Q9S597 ~~~~~~Pesticidal crystal protein Cry27Aa~~~
MNPYQNKNEYEILDAKRNNCHMSNGYPRHPLANDPQMYLRNAHYKDWLSMCNKNNPVGLIPPESFEWTWLNGTVAALTIV
SVIAGILVTAPVSVTAGLITVLGAGAALLAGITPLIWPATTDNTFNKITDATEVLLNKEISEFVRKTANTKIDSLQQLIY
YYQNALENWKKNPNDSAARNTVSTRFQIVNAFFVEAMPALSMPGYEVVQLGAYAQAANLHLILLREGIAYADQWNLARDP
MHAAGDLHYKEFLDYRNQYINHCSTWYNEGQNEANLKNNGLVYQRTMTLFVLDLIAMFSTYDPRLYTMPIKTEILTRTIY
TDGVNRNEPKSIHNPGLFRRLEQMKLHIYEYQGAQFLSGHQNIFRSMNYNHPLIYGPVQGYSSSNINKITTINLGDYDKI
YSINTESRNRLVQGSTTFDKINFYGAFNENWLFSVYNQNGPIIKHSNIPGIDAPSTGLNYSNYTHYLSNCIFQSNRNGGS
APDYNTQSYVFGWNHYTIDPTGNYVTDAFEVNKNLPESRYVPQISQVPAVKASDIFNPGRVVNAKVESGPYFTGGDVIVS
KAQLDGSGLARTLITFPIIPKRYRASGFRVRMYYAANHTGQVSYGVANINTTGYANFQKTFDGWEYFRARHEHFKYIEFD
TTFSLRNSGQLEEHLLHIYYPNTTKISGDQLLIIDKIEFIPVGIPLNQTSEGYNTYDQNTNSYNQNYNNYNQNMDTTYQP
NYDNYKQNSSGMYDNPYNQNPKDSYNQNYTDTYDSGYNNSQNVGSNYNQEYNTYNQDTENMYNQSYNNYNSDNNNYNQNS
DCMCSPGYNGNYECRCNQRANGNYPK
>Q9X682 ~~~~~~Pesticidal crystal protein Cry28Aa~~~
MAQTYYKIGVQSTEVNSESIFFNPEVDSSDTVAVVSAGIVVVGTILTAFASFVNPGVVLISFGTLAPVLWPDPEEDPKKI
WSQFMKHGEDLLNQTISTAVKEIALAHLNGFKDVLTYYERAFNDWKRNPSANTARLVSQRFENAHFNFVSNMPQLQLPTY
DTLLLSCYTEAANLHLNLLHQGVQFADQWNADQPHSPMLKSSGTYYDELLVYIEKYINYCTKTYHKGLNHLKESEKITWD
AYNTYRREMTLIVLDLVATFPFYDIRRFPRGVELELTREVYTSLDHLTRPPGLFTWLSDIELYTESVAEGDYLSGIRESK
YYTGNQFFTMKNIYGNTNRLSKQLITLLPGEFMTHLSINRPFQTIAGINKLYSLIQKIVFTTFKNDNEYQKNFNVNNQNE
PQETTNYPNDYGGSNSQKFKHNLSHFPLIIHKLEFAEYFHSIFALGWTHNSVNSQNLISESVSTQIPLVKAYEVTNNSVI
RGPGFTGGDLIELRDKCSIKCKASSLKKYAISLFYAANNAIAVSIDVGDSGAGVLLQPTFSRKGNNNFTIQDLNYKDFQY
HTLLVDIELPESEEIHIHLKREDDYEEGVILLIDKLEFKPIDENYTNEMNLEKAKKAVNVLFINATNALKMDVTDYHIDQ
VANLVECISDDLYAKEKIKFTPCIKFAKQLSQARNLLSDPNFNNLNAENSWTANTGVTIIEGDPLYKGRAIQLSAARDEN
FPTYLYQKIDESLLKPYTRYQLRGFVEGSQDLELDLVRYGATDIVMNVPGDLEILSYSAPINPCEEIETRLDTTCGALDR
CKQSNYVNSAADVRPDQVNGDPHAFSFHIDTGTTDNNRNLGIWIIFKIATPDGYATFGNLELIELGPLSGEALAQVQRKE
QKWGKNTTQKREEAAKLYAAAKQTINQLFADSQGTKLRFDTEFSNILSADKLVYKIRDVYSEVLSVIPGLNYDLFMELEN
RIQNAIDLYDARNTVTNGEFRNGLANWMASSNTEVRQIQAHPCWYSLGWNAQVAQSLNVKPDHGYVLRVTAKKEGIGNGY
VTILDCANHIDTLTFSSCDSGFTTSSNELAAYVTKTLEIFPDTDQIRIEIGETRSTFYVESVDLIRMED
>Q939T0 ~~~~~~Insecticidal crystal protein Cry34Ab1~~~
MSAREVHIDVNNKTGHTLQLEDKTKLDGGRWRTSPTNVANDQIKTFVAESNGFMTGTEGTIYYSINGEAEISLYFDNPFA
GSNKYDGHSNKSQYEIITQGGSGNQSHVTYTIQTTSSRYGHKS
>Q939S9 ~~~~~~Insecticidal crystal protein Cry35Ab1~~~
MLDTNKVYEISNHANGLYAATYLSLDDSGVSLMNKNDDDIDDYNLKWFLFPIDDDQYIITSYAANNCKVWNVNNDKINVS
TYSSTNSIQKWQIKANGSSYVIQSDNGKVLTAGTGQALGLIRLTDESSNNPNQQWNLTSVQTIQLPQKPIIDTKLKDYPK
YSPTGNIDNGTSPQLMGWTLVPCIMVNDPNIDKNTQIKTTPYYILKKYQYWQRAVGSNVALRPHEKKSYTYEWGTEIDQK
TTIINTLGFQINIDSGMKFDIPEVGGGTDEIKTQLNEELKIEYSHETKIMEKYQEQSEIDNPTDQSMNSIGFLTITSLEL
YRYNGSEIRIMQIQTSDNDTYNVTSYPNHQQALLLLTNHSYEEVEEITNIPKSTLKKLKKYYF
>A9CH00 4.2.1.171~~~lhpI~~~Cis-3-hydroxy-L-proline dehydratase~~~COG1679
MSSAVSTTAAPEARSILAGAAEGKVIATTEALSFWGGVDPATGKVIDVHHPLHGICLTGGVLFMPTSRGSCTGSGVLLDL
ILTGRAPSALVFCEAEDVLTLGALVAAEMFDKALPVIRLDAETFSRFSRAAHVSIDQNTIKADGVSLAIAPPATAHLDLN
DDDRAMLVGRDGIAVRQAMRIIVAMAAQQGASALVDVTQGHIDGCIYASPANLTFAEKMADMGGKVRAPSTMNAISVDKA
NWRAQGVPEDFGDPAARLADAYVRMGCKPTFTCSPYLLDSAPSAGESIAWAESNAVIFANTVLGARTAKHPDFLDLCIAM
TGRAPLSGVYIEENRRPQRIVDVALPAGIDDAFWPLVGYLAGKAAPDCIPLLRGLGAAKPSRDDLKALCAAFGTTSASPM
LHIEGATPEAGLAPLETAETVTISLQDMAAAWSLLNDGPEEVQLVAIGSPHASLEECRALAAVFAGRKRRADVAVIVTAG
QQVIDAAGKDGTLQSLKDSGVQVLPDLCWCSISEPVFPTKTRALMTNSGKYAHYGPGLSGRAVRFGSLADCVESALTGRA
VSRLPVWLS
>Q9I485 4.2.1.171~~~lhpI~~~Cis-3-hydroxy-L-proline dehydratase~~~
MKHAHLIVPRTLVAGSASGELLYAPTGLSFWGGVDPRSAEVIDRHHPLSGRHLHGRLLAIPGGRGSCTGSSVLLELILGG
RAPAAILLREPDEILALGAIVAEELFGRSLPIACLGERFDELAAYPWARLADGRLELHRDAPPPLEARPAEALATDAGPR
LDAFDQALLAGEHGEAARLAMRIVLRMAALQGAQRLIDIQRAHIDACIYTGPAGLRFAETLRDLGARVRVPTTLNAISVD
QRRWREQGVPAALGEPAAALARAYLDMGAQPSFTCAPYLLDDSARAGEQIVWAESNAVLFANSVLGARTNKYADFMDICC
ALTGRAPLAGCHLDEQRQARVLIEVEDLGSVDDAFYPTLGYLCGLLCDGQIPAIDGLRQRQPDHDALKAFGAALGTSSSV
PMFHVIGVTPEAPDLASAFGGRAPRRTLRVGRERLRDAWRELDSAGETRIDLVALGNPHFSASEFAQLAALCHGRRRHPE
VALVITSSRQVVAQAEAAGHLATLQAFGARLVTDTCWCMLDEPLVPPGARTLMTNSAKYAHYAPGLVGRQVRFAGLAGCV
EAAVGGRSPAGLPAWLSEDC
>A0NXQ8 4.2.1.171~~~~~~Cis-3-hydroxy-L-proline dehydratase~~~COG4948
MKITAINVFQVDLPLREGRYSWSNGNFVEVFDSTVVEIETDEGLKGYAECCPLGSAYLPSYALGVRSGLQELAPHLIGKD
PLNIGEINRVMDAALRGHPYAKAPIDIACWDLLGKATGQPLYTLLGGAAQDDVALYRAISQEAPEIMAKKIEGYAAEGYT
KFQLKVGGDANDDINRIHATRSVLKKSDLLVADANTGWTRHEAARVVGAVSSLDVYIEQPCLTYEESVSIRRRTALPFVL
DEVIDGPNTLVRGIAEDAMDCINLKISKVGGLTKAKLMRDLCIAHGIPMTIEDTWGGDIVTAAIAHLARSTPSEFTFSAT
DFNSYGTVDIAEGAPKRVNGRMTTSDLPGLGITPIFDVLGEPVARYS
>D7A0Y2 4.2.1.171~~~~~~Cis-3-hydroxy-L-proline dehydratase~~~COG4948
MKITGIKAWKVGLPLKEGRYNWSNGNFVEVFDSTVVAVETDAGITGYAECCPLGSAYLPAYAHGVRAGLEEIGPKVIGLD
PTDLNVLNRHMDSVLRGHPYVKAPIDIACWDILGKVSGLPVYKLLGGAAQEKVALYRAISQEAPEAMARKIAGYKAEGYT
KFQLKVGGDADQDIDRIRVTREILDATDTLVADANTGWTRAEAARIAAEVGDLDVYIEQPCPTYEECLSVRARTARPFVL
DEVIDGVGTLMKALADDAMDIINLKISKVGGLTKARLMRDICVASGTPMTIEDTWGGDIVTATIAHLARSTPEEFSFSAT
DFNSYGTVDIAKGAPKRVNGFMTASDAPGLGIEPIFEVLGEPVVVIG
>P24469 ~~~cccA~~~Cytochrome c-550~~~COG2010
MKWNPLIPFLLIAVLGIGLTFFLSVKGLDDSREIASGGESKSAEKKDANASPEEIYKANCIACHGENYEGVSGPSLKGVG
DKKDVAEIKTKIEKGGNGMPSGLVPADKLDDMAEWVSKIK
>P83170 ~~~~~~Cytochrome c-551~~~
APDWSKIPAREITVFHAGATSFEWIGSEHPGASIVKAGQPCIICHETKKGLDYTAKKLAPREPDAQAMPKTVSFPVSVQA
AVEKGTLNVRLSFKPPADSKTGSDADNELKAAIMLLDIKVPQAKLAGCWTSCHKDMRGMPGGDAKKGKYVSAGSFELLQW
KSGKTSAALPAGVKVESGKDGDRTTVTFSRKLGGAVVEGRSVPFGIAIHANRAAGRMHYVSLGYRLGIGGAPGEVQAAKQ
>P24037 ~~~nirB~~~Cytochrome c-552~~~
MKKTLMASAVGAVIAFGTHGAMAAAPADWSSVAATDVTLFYPGVSPVEWITKGTEHGGARALKKGETCAGCHSEEASDMG
EKMASGKKLEPSPIAGKAPFINAKVQAANDGENLYLRFTWKQPAASGAAPMDADNPVKIAYMLEGGSKVELAEAGGCWGS
CHGDARTMPGAADTKTKYVKDGSLANGVYYDLNQWRSGENKAFDGYVATERVMEGGQALVDAQGKLDGDTWTVVFTRKFA
GGEGDVTLAPGNLYNFGFAIHDDSATGRYHHVSLGYSLGIDAQGDITAAKQ
>P29967 ~~~cycB~~~Cytochrome c-553I~~~
MTSKTTASLLAICVACAASAIAGTALCADRRNAPAQAGAGAAAAVSGDAHEQPAAEAPAEEEEETPAVAATDGKLVLPNG
QDITPDHMENGRWYTAEDIPTYKIAEEGAVDWATFSGYRRYSAECHVCHGPDGEGSTYAPALRKSVLTMGYYDFLEIAAS
GKQEVNTAANLVMPAFGTNKNVWCYIDDIYAYLLARGTGDLPRGRPAKREDKSDEFVAQEDSCMSG
>Q3J2P2 ~~~cycF~~~Cytochrome c-554~~~COG3909
MRPIPALALTFSLVAMPALAQDARQIERMIEGRHGLMTLMAHELGKLGGMAKEETPYDAEVAGKAASNLSALASVISPEL
FPKGSAVGEAEDSEALPAIWEKPDDFAQKISGMEEAAAKMQAAAGTDLASLQGAMRDLGAACGSCHETYRQKD
>P0C0X7 ~~~cycF~~~Cytochrome c-554~~~
QDARQIERMIEGRHGLMTLMAYELGKLGGMAKEETPYDAEVAGDAASNLSALASVLSPELFPKGSAVGEAEDSEALPAIW
EKPDDFAQKISDMEEAAAKMQAAAGTDLASLQGAMRDLGAGCGSCHETYRQKD
>P33325 ~~~puf2C~~~Cytochrome c-554~~~
MQSSRPSDRQLAIVVSVAVGIVVAVITTATFWWVYDLTLGRAQREAAQTAGARWSPSDGIKVITSSPPVTPTDGRQNWMG
TQAWNEGVQAGQAWIQQYPNTVNVQVLIGMSSAQIWTYMQQYVSGALGVGCQYCHNINNFASDEYPQKIAARNMLRLVRD
VNAEFIVNLPNWQGNYVQCATCHNNAPNNLEGFGAQFINSVPPIKVTVDPLDANGMAILDPAQKPEAIREPVLLKDAILF
YIYNYQVWKPFDPNDPESGRGSLALTYDGGRTQDQVTINQNVMNYQAWSLGVGCTFCHNSRNFVAYELNPAGDNVLNPLY
AYNKLKAQRMLLLTTWLAENWPRYGAIAKPEIPTGSGAASRYSYQRLGDGQIYNVPGCYTCHQGNNIPLASINQANIPSG
DAGIVVLPPQIRGR
>P00105 ~~~~~~Cytochrome c-554(548)~~~
AGDAAAGEDKIGTCVACHGTDGQGLAPIYPNLTGQSATYLESSIKAYRDGQRKGGNAALMTPMAQGLSDEDIADIAAYYS
SQE
>P25938 ~~~~~~Cytochrome c-554(547)~~~
AGDAAAGKTLYDASCASCHGMQAQGQGMFPKLAGLTSERIKTTLVAFKSGDTATLKKEGLGGPMSAIMAPNAAGLSEQDM
DNLSAYIATLK
>D5QVH0 ~~~~~~Cytochrome c-554~~~
MKSISMLTLAASVAFAVTAGQAVAAGDPAAGEKVFNKCKACHQVGETAKNAVAPELNGIDGRKSASAEGYNYSEPFKALG
ITWDEAQFKEFIKNPKAKVPGTKMIFPGLSSENDQANVWAYLSQFGADGKKK
>Q57142 ~~~cycA1~~~Cytochrome c-554~~~COG0737
MKIMIACGLVAAALFTLTSGQSLAADAPFEGRKKCSSCHKAQAQSWKDTAHAKAMESLKPNVKKEAKQKAKLDPAKDYTQ
DKDCVGCHVDGFGQKGGYTIESPKPMLTGVGCESCHGPGRNFRGDHRKSGQAFEKSGKKTPRKDLAKKGQDFHFEERCSA
CHLNYEGSPWKGAKAPYTPFTPEVDAKYTFKFDEMVKEVKAMHEHYKLEGVFEGEPKFKFHDEFQASAKPAKKGK
>Q45234 ~~~cycC~~~Cytochrome c-555~~~COG3909
MKRTMIVVTTLLLGAGAVMAQQEVAVQQDNLMRSQARSLYTVILKMTKGDIPYDQKAADEAIANLETDVAKIAKTFEVNP
KQDVVNATYGASPKVWKNKADFDSKIPPVQKAIAQVKGKITDVASLKAAYTAINDRCTDCHETYRLKLK
>Q8KG93 ~~~~~~Cytochrome c-555~~~COG3245
MSRFVSAALVGAALLVSGNAFAYDAAAGKATYDASCATCHKTGMMGAPKVGDKAAWAPRIAQGMNTLVSKSIKGYKGTKG
MMPAKGGNAKLTDAQVGNAVAYMVGQSK
>P00123 ~~~~~~Cytochrome c-555~~~
YDAAAGKATYDASCAMCHKTGMMGAPKVGDKAAWAPHIAKGMNVMVANSIKGYKGTKGMMPAKGGNPKLTDAQVGNAVAY
MVGQSK
>P04369 ~~~~~~Cytochrome c-555~~~COG2010
MDHKKTSIRTTALAALVLGAVAAPAFSAPVDQATYNGFKIYKQQRCETCHGATGEGSAAFPNLLNSLKNLSKDQFKEVVL
KGRNAMPPFEANKKVAEGIDDLYTYIKGRSDGTVPAGELEKPQ
>P00124 ~~~~~~Cytochrome c-555~~~
AVTKADVEQYDLANGKTVYDANCASCHAAGIMGAPKTGTARKWNSRLPQGLATMIEKSVAGYEGEYRGSKTFMPAKGGNP
DLTDKQVGDAVAYMVNEVL
>P00141 ~~~~~~Cytochrome c-556~~~
ADGGTHDARIALMKKIGGATGALGAIAKGEKPYDAEIVKASLTTIAETAKAFPDQFNPKDSTDAEVNPKIWDNLDDFKAK
AAKLSTDAETALAQLPADQAGVGNTLKTLGGNCGACHQAYRIKKD
>P00139 ~~~~~~Cytochrome c-556~~~
AGEVEKREGMMKQIGGAMGSLAAISKGEKPFDADTVKAAVTTIGTNAKAFPEQFPAGTETGSAAAPAIWENFEDFKAKAA
KLGTDADIVLANLPGDQAGVATAMKTLGADCGTCHQTYRLKK
>P00140 ~~~~~~Cytochrome c-556~~~
AGEVEKREGMMKQIGGSMGALAAISKGQKPYDAEAVKAAVTTISTNAKAFPDQFPPGSETGSAAAPAIWENFDDFKSKAA
KLGADADKVLASLPADQAGVTAAMQTLGADCGACHQTYRLKK
>P00150 ~~~~~~Cytochrome c-556~~~COG3909
MLRTVIVAGALVLTASAVMAQQDLVDKTQKLMKDNGRNMMVLGAIAKGEKPYDQAAVDAALKQFDETAKDLPKLFPDSVK
GLKPFDSKYSSSPKIWAERAKFDTEIADFAKAVDGAKGKIKDVDTLKAAMQPIGKACGNCHENFRDKEG
>P0ABE5 1.10.3.17~~~cybB~~~Superoxide oxidase CybB~~~COG3038
MENKYSRLQISIHWLVFLLVIAAYCAMEFRGFFPRSDRPLINMIHVSCGISILVLMVVRLLLRLKYPTPPIIPKPKPMMT
GLAHLGHLVIYLLFIALPVIGLVMMYNRGNPWFAFGLTMPYASEANFERVDSLKSWHETLANLGYFVIGLHAAAALAHHY
FWKDNTLLRMMPRKRS
>P16670 ~~~~~~Cytochrome b562~~~COG3038
MTQEPGYTRLQITLHWAIAGLVLFNYIFGETMERAYDAVRQNVEPAGVGHYLHVVVGLAVLVLTLVRIGARFVLGVPEKG
TTPGDKVAAGLQGLLYLLTLLVPALGMTAWGGGQAWAAGPHVLAANAIMLLALVHAVSALFHQYVLKDRLLLRMMRPR
>P0ABE7 ~~~cybC~~~Soluble cytochrome b562~~~COG3783
MRKSLLAILAVSSLVFSSASFAADLEDNMETLNDNLKVIEKADNAAQVKDALTKMRAAALDAQKATPPKLEDKSPDSPEM
KDFRHGFDILVGQIDDALKLANEGKVKEAQAAAEQLKTTRNAYHQKYR
>P63727 ~~~cybC~~~Soluble cytochrome b562~~~
MRKSLLAILAVSSLVFGSAVFAADLEDNMDILNDNLKVVEKTDSAPELKAALTKMRAAALDAQKATPPKLEDKAPDSPEM
KDFRHGFDILVGQIDGALKLANEGNVKEAKAAAEALKTTRNTYHKKYR
>P76345 ~~~yodB~~~Cytochrome b561 homolog 1~~~COG3038
MNRFSKTQIYLHWITLLFVAITYAAMELRGWFPKGSSTYLLMRETHYNAGIFVWVLMFSRLIIKHRYSDPSIVPPPPAWQ
MKAASLMHIMLYITFLALPLLGIALMAYSGKSWSFLGFNVSPFVTPNSEIKALIKNIHETWANIGYFLIAAHAGAALFHH
YIQKDNTLLRMMPRRK
>P75925 ~~~yceJ~~~Cytochrome b561 homolog 2~~~COG3038
MSFTNTPERYGVISAAFHWLSAIIVYGMFALGLWMVTLSYYDGWYHKAPELHKSIGILLMMGLVIRVLWRVISPPPGPLP
SYSPMTRLAARAGHLALYLLLFAIGISGYLISTADGKPISVFGWFDVPATLADAGAQADFAGALHFWLAWSVVVLSVMHG
FMALKHHFIDKDDTLKRMLGKSSSDYGV
>P58099 3.4.21.110~~~scpA~~~C5a peptidase~~~
MRKKQKLPFDKLAIALMSTSILLNAQSDIKANTVTEDTPATEQAVETPQPTAVSEEAPSSKETKTPQTPDDAEETIADDA
NDLAPQAPAKTADTPATSKATIRDLNDPSQVKTLQEKAGKGAGTVVAVIDAGFDKNHEAWRLTDKTKARYQSKEDLEKAK
KEHGITYGEWVNDKVAYYHDYSKDGKTAVDQEHGTHVSGILSGNAPSETKEPYRLEGAMPEAQLLLMRVEIVNGLADYAR
NYAQAIIDAVNLGAKVINMSFGNAALAYANLPDETKKAFDYAKSKGVSIVTSAGNDSSFGGKTRLPLADHPDYGVVGTPA
AADSTLTVASYSPDKQLTETATVKTADQQDKEMPVLSTNRFEPNKAYDYAYANRGMKEDDFKDVKGKIALIERGDIDFKD
KIANAKKAGAVGVLIYDNQDKGFPIELPNVDQMPAAFISRKDGLLLKENPQKTITFNATPKVLPTASGTKLSRFSSWGLT
ADGNIKPDIAAPGQDILSSVANNKYAKLSGTSMSAPLVAGIMGLLQKQYETQYPDMTPSERLDLAKKVLMSSATALYDED
EKAYFSPRQQGAGAVDAKKASAATMYVTDKDNTSSKVHLNNVSDKFEVTVTVHNKSDKPQELYYQATVQTDKVDGKLFAL
APKALYETSWQKITIPANSSKQVTIPIDVSQFSKDLLAPMKNGYFLEGFVRFKQDPTKEELMSIPYIGFRGDFGNLSALE
KPIYDSKDGSSYYHEANSDAKDQLDGDGLQFYALKNNFTALTTESNPWTIIKAVKEGVENIEDIESSEITETIFAGTFAK
QDDDSHYYIHRHANGKPYAAISPNGDGNRDYVQFQGTFLRNAKNLVAEVLDKEGNVVWTSEVTEQVVKNYNNDLASTLGS
TRFEKTRWDGKDKDGKVVANGTYTYRVRYTPISSGAKEQHTDFDVIVDNTTPEVATSATFSTEDRRLTLASKPKTSQPVY
RERIAYTYMDEDLPTTEYISPNEDGTFTLPEEAETMEGATVPLKMSDFTYVVEDMAGNITYTPVTKLLEGHSNKPEQDGS
DQAPDKKPETKPEQDGSGQAPDKKPETKPEQDGSGQTPDKKPETKPEQDGSGQTPDKKPETKPEKDSSGQTPGKTPQKGQ
PSRTLEKRSSKRALATKASTKDQLPTTNDKDTNRLHLLKLVMTTFFLGLVAHIFKTKRTED
>P15926 3.4.21.110~~~scpA~~~C5a peptidase~~~COG1404
MRKKQKLPFDKLAIALMSTSILLNAQSDIKANTVTEDTPVTEQAVETPQPTAVSEEVPSSKETKTPQTPDDAEETIADDA
NDLAPQAPAKTADTPATSKATIRDLNDPSQVKTLQEKAGKGAGTVVAVIDAGFDKNHEAWRLTDKTKARYQSKEDLEKAK
KEHGITYGEWVNDKVAYYHDYSKDGKTAVDQEHGTHVSGILSGNAPSETKEPYRLEGAMPEAQLLLMRVEIVNGLADYAR
NYAQAIRDAVNLGAKVINMSFGNAALAYANLPDETKKAFDYAKSKGVSIVTSAGNDSSFGGKTRLPLADHPDYGVVGTPA
AADSTLTVASYSPDKQLTETAMVKTDDQQDKEMPVLSTNRFEPNKAYDYAYANRGMKEDDFKDVKGKIALIERGDIDFKD
KVANAKKAGAVGVLIYDNQDKGFPIELPNVDQMPAAFISRKDGLLLKDNPQKTITFNATPKVLPTASGTKLSRFSSWGLT
ADGNIKPDIAAPGQDILSSVANNKYAKLSGTSMSAPLVAGIMGLLQKQYETQYPDMTPSERLDLAKKVLMSSATALYDED
EKAYFSPRQQGAGAVDAKKASAATMYVTDKDNTSSKVHLNNVSDKFEVTVTVHNKSDKPQELYYQATVQTDKVDGKHFAL
APKVLYEASWQKITIPANSSKQVTVPIDASRFSKDLLAQMKNGYFLEGFVRFKQDPTKEELMSIPYIGFRGDFGNLSAVE
KPIYDSKDGSSYYHEANSDAKDQLDGDGLQFYALKNNFTALTTESNPWTIIKAVKEGVENIEDIESSEITETIFAGTFAK
QDDDSHYYIHRHANGEPYAAISPNGDGNRDYVQFQGTFLRNAKNLVAEVLDKEGNVVWTSEVTEQVVKNYNNDLASTLGS
TRFEKTRWDGKDKDGKVVANGTYTYRVRYTPISSGAKEQHTDFDVIVDNTTPEVATSATFSTEDRRLTLASKPKTSQPVY
RERIAYTYMDEDLPTTEYISPNEDGTFTLPEEAETMEGATVPLKMSDFTYVVEDMAGNITYTPVTKLLEGHSNKPEQDGS
GQTPDKKPEAKPEQDGSDQAPDKKPEAKPEQDGSGQTPDKKPETKPEKDSSGQTPGKTPQKGQPSRTLEKRSSKRALATK
ASTRDQLPTTNDKDTNRLHLLKLVMTTFFFGLVAHIFKTKRQKETKK
>Q8RN04 1.14.-.-~~~~~~Cytochrome P450 165B3~~~COG2124
MSEDDPRPLHIRRQGLDPADELLAAGALTRVTIGSGADAETHWMATAHAVVRQVMGDHQQFSTRRRWDPRDEIGGKGIFR
PRELVGNLMDYDPPEHTRLRRKLTPGFTLRKMQRMAPYIEQIVNDRLDEMERAGSPADLIAFVADKVPGAVLCELVGVPR
DDRDMFMKLCHGHLDASLSQKRRAALGDKFSRYLLAMIARERKEPGEGMIGAVVAEYGDDATDEELRGFCVQVMLAGDDN
ISGMIGLGVLAMLRHPEQIDAFRGDEQSAQRAVDELIRYLTVPYSPTPRIAREDLTLAGQEIKKGDSVICSLPAANRDPA
LAPDVDRLDVTREPIPHVAFGHGVHHCLGAALARLELRTVFTELWRRFPALRLADPAQDTEFRLTTPAYGLTELMVAW
>Q8RN03 1.14.-.-~~~~~~Cytochrome P450 165C4~~~COG2124
MGHDIDQVAPLLREPANFQLRTNCDPHEDNFGLRAHGPLVRIVGESSTQLGRDFVWQAHGYEVVRRILGDHEHFTTRPQF
TQSKSGAHVEAQFVGQISTYDPPEHTRLRKMLTPEFTVRRIRRMEPAIQSLIDDRLDLLEAEGPSADLQGLFADPVGAHA
LCELLGIPRDDQREFVRRIRRNADLSRGLKARAADSAAFNRYLDNLLARQRADPDDGLLGMIVRDHGDNVTDEELKGLCT
ALILGGVETVAGMIGFGVLALLDNPGQIELLFESPEKAERVVNELVRYLSPVQAPNPRLAIKDVVIDGQLIKAGDYVLCS
ILMANRDEALTPDPDVLDANRAAVSDVGFGHGIHYCVGAALARSMLRMAYQTLWRRFPGLRLAVPIEEVKYRSAFVDCPD
QVPVTW
>Q59I44 1.3.1.103~~~~~~2-haloacrylate reductase~~~
MVMAAVIHKKGGPDNFVWEEVKVGSPGPGQVRLRNTAIGVNFLDTYHRAGIPHPLVVGEPPIVVGFEAAAVVEEVGPGVT
DFTVGERVCTCLPPLGAYSQERLYPAEKLIKVPKDLDLDDVHLAGLMLKGMTAQYLLHQTHKVKPGDYVLIHAAAGGMGH
IMVPWARHLGATVIGTVSTEEKAETARKLGCHHTINYSTQDFAEVVREITGGKGVDVVYDSIGKDTLQKSLDCLRPRGMC
AAYGHASGVADPIRVVEDLGVRGSLFITRPALWHYMSNRSEIDEGSKCLFDAVKAGVLHSSVAKTFPLREAAAAHKYMGG
RQTIGSIVLLPQA
>P59807 4.2.2.20~~~~~~Chondroitin sulfate ABC endolyase~~~
MPIFRFTALAMTLGLLSAPYNAMAATSNPAFDPKNLMQSEIYHFAQNNPLADFSSDKNSILTLSDKRSIMGNQSLLWKWK
GGSSFTLHKKLIVPTDKEASKAWGRSSTPVFSFWLYNEKPIDGYLTIDFGEKLISTSEAQAGFKVKLDFTGWRAVGVSLN
NDLENREMTLNATNTSSDGTQDSIGRSLGAKVDSIRFKAPSNVSQGEIYIDRIMFSVDDARYQWSDYQVKTRLSEPEIQF
HNVKPQLPVTPENLAAIDLIRQRLINEFVGGEKETNLALEENISKLKSDFDALNIHTLANGGTQGRHLITDKQIIIYQPE
NLNSQDKQLFDNYVILGNYTTLMFNISRAYVLEKDPTQKAQLKQMYLLMTKHLLDQGFVKGSALVTTHHWGYSSRWWYIS
TLLMSDALKEANLQTQVYDSLLWYSREFKSSFDMKVSADSSDLDYFNTLSRQHLALLLLEPDDQKRINLVNTFSHYITGA
LTQVPPGGKDGLRPDGTAWRHEGNYPGYSFPAFKNASQLIYLLRDTPFSVGESGWNNLKKAMVSAWIYSNPEVGLPLAGR
HPFNSPSLKSVAQGYYWLAMSAKSSPDKTLASIYLAISDKTQNESTAIFGETITPASLPQGFYAFNGGAFGIHRWQDKMV
TLKAYNTNVWSSEIYNKDNRYGRYQSHGVAQIVSNGSQLSQGYQQEGWDWNRMQGATTIHLPLKDLDSPKPHTLMQRGER
GFSGTSSLEGQYGMMAFDLIYPANLERFDPNFTAKKSVLAADNHLIFIGSNINSSDKNKNVETTLFQHAITPTLNTLWIN
GQKIENMPYQTTLQQGDWLIDSNGNGYLITQAEKVNVSRQHQVSAENKNRQPTEGNFSSAWIDHSTRPKDASYEYMVFLD
ATPEKMGEMAQKFRENNGLYQVLRKDKDVHIILDKLSNVTGYAFYQPASIEDKWIKKVNKPAIVMTHRQKDTLIVSAVTP
DLNMTRQKAATPVTINVTINGKWQSADKNSEVKYQVSGDNTELTFTSYFGIPQEIKLSPLP
>C5G6D7 4.2.2.21~~~chonabc~~~Chondroitin sulfate ABC exolyase~~~
MLILSFLCPAFLNAQIVTDERMFSFEEPQLPACITGVQSQLGISGAHYKDGKHSLEWTFEPNGRLELRKDLKFEKKDPTG
KDLYLSAFIVWIYNEQPQDAAIEFEFLKDGRKCASFPFGINFKGWRAAWVCYERDMQGTPEEGMNELRIVAPDAKGRLFI
DHLITATKVDARQQTADLQVPFVNAGTTNHWLVLYKHSLLKPDIELTPVSDKQRQEMKLLEKRFRDMIYTKGKVTEKEAE
TIRKKYDLYQITYKDGQVSGVPVFMVRASEAYERMIPDWDKDMLTKMGIEMRAYFDLMKRIAVAYNNSEAGSPIRKEMRR
KFLAMYDHITDQGVAYGSCWGNIHHYGYSVRGLYPAYFLMKDVLREEGKLLEAERTLRWYAITNEVYPKPEGNGIDMDSF
NTQTTGRIASILMMEDTPEKLQYLKSFSRWIDYGCRPAPGLAGSFKVDGGAFHHRNNYPAYAVGGLDGATNMIYLFSRTS
LAVSELAHRTVKDVLLAMRFYCNKLNFPLSMSGRHPDGKGKLVPMHYAIMAIAGTPDGKGDFDKEMASAYLRLVSSDSSS
AEQAPEYMPKVSNAQERKIAKRLVENGFRAEPDPQGNLSLGYGCVSVQRRENWSAVARGHSRYLWAAEHYLGHNLYGRYL
AHGSLQILTAPPGQTVTPTTSGWQQEGFDWNRIPGVTSIHLPLDLLKANVLNVDTFSGMEEMLYSDEAFAGGLSQGKMNG
NFGMKLHEHDKYNGTHRARKSFHFIDGMIVCLGSDIENTNMDYPTETTIFQLAVTDKAAHDYWKNNAGEGKVWMDHLGTG
YYVPVAARFEKNFPQYSRMQDTGKETKGDWVSLIIDHGKAPKAGSYEYAILPGTDRKTMTAFAKKPAYSVLQQDRNAHIL
ESPSDRITSYVLFETPQSLLPGGLLQRTDTSCLVMVRKESADKVLLTVAQPDLALYRGPSDEAFDKDGKRMERSIYSRPW
IDNESGEIPVTVTLKGRWKVVETPYCKVVSEDKKQTVLRFLCKDGASYEVELEK
>Q8A2I1 4.2.2.21~~~chonabc~~~Chondroitin sulfate ABC exolyase~~~COG5492
MLILSFLCPAFLNAQIVTDERMFSFEEPQLPACITGVQSQLGISGAHYKDGKHSLEWTFEPNGKLELRKDLKFEKKDPTG
KDLYLSAFIVWIYNEQPQDAAIEFEFLKDGRKCASFPFGINFKGWRAAWVCYERDMQGTPEEGMNELRIVAPNAKGRLFI
DHLITATKVDARQQTADLQVPFVNAGTTNHWLVLYKHSLLKPDIELTPVSDRQRQEMKLLEKRFRDMIYTKGKVTEKEAE
TIRKKYDLYQITYKDGQVSGVPIFMVRASEAYERMIPDWDKDMLTKMGIEMRAYFDLMKRIAVAYNNSEAGSPVREEMKR
KFLAMYDHITDQGVAYGSCWGNIHHYGYSVRGLYPAYFLMKDVLREEGKLLEAERTLRWYAITNEVYPKPEGNGIDMDSF
NTQTTGRIASILMMEDTPEKLQYLKSFSRWIDYGCRPAPGLAGSFKVDGGAFHHRNNYPAYAVGGLDGATNMIYLFSRTS
LAVSELAHRTVKDVLLAMRFYCNKLNFPLSMSGRHPDGQGKLVPMHYAMMAIAGTPDGKGDFDKEMASAYLRLVSSDSSS
AEQAPEYMPKVSNAQERKIAKRLVENGFRAESDPQGNLSLGYGCVSVQRRENWSAVARGHSRYLWAAEHYLGHNLYGRYL
AHGSLQILTAPPGQTVTPATSGWQQEGFDWNRIPGVTSIHLPLDLLKANVLNVDTFSGMEEMLYSDEAFAGGLSQGKMNG
NFGMKLHEHDKYNGTHRARKSYHFIDGMIVCLGSDIENTNTDYPTETTIFQLAVTDKAAHDYWKNNAGEGKVWMDHLGTG
YYVPVPARFEKNFPQYSRMQDTGKETKGDWVSLIIDHGKAPKAGSYEYAILPGTDRKTMTAFAKKPAYSVLQQDRNAHIL
ESPSDRITSYVLFETPQSLLPGGLLQRTDTSCLVMVRKESADKVLLTVAQPDLALYRGPSDEAFDKDGKRMERSIYSRPW
IDNESGEIPVTVTLKGRWKVAETPFCKVVSEDKKQTVLRFLCKDGASYEVELEK
>P20021 7.2.2.21~~~cadA~~~Cadmium-transporting ATPase~~~
MSEQKVKLMEEEMNVYRVQGFTCANCAGKFEKNVKKIPGVQDAKVNFGASKIDVYGNASVEELEKAGAFENLKVSPEKLA
NQTIQRVKDDTKAHKEEKTPFYKKHSTLLFATLLIAFGYLSHFVNGEDNLVTSMLFVGSIVIGGYSLFKVGFQNLIRFDF
DMKTLMTVAVIGATIIGKWAEASIVVILFAISEALERFSMDRSRQSIRSLMDIAPKEALVRRNGQEIIIHVDDIAVGDIM
IVKPGEKIAMDGIIVNGLSAVNQAAITGESVPVSKAVDDEVFAGTLNEEGLIEVKITKYVEDTTITKIIHLVEEAQGERA
PAQAFVDKFAKYYTPIIMVIAALVAVVPPLFFGGSWDTWVYQGLAVLVVGCPCALVISTPISIVSAIGNAAKKGVLVKGG
VYLEKLGAIKTVAFDKTGTLTKGVPVVTDFEVLNDQVEEKELFSIITALEYRSQHPLASAIMKKAEQDNIPYSNVQVEEF
TSITGRGIKGIVNGTTYYIGSPKLFKELNVSDFSLGFENNVKILQNQGKTAMIIGTEKTILGVIAVADEVRETSKNVIQK
LHQLGIKQTIMLTGDNQGTANAIGTHVGVSDIQSELMPQDKLDYIKKMQSEYDNVAMIGDGVNDAPALAASTVGIAMGGA
GTDTAIETADIALMGDDLSKLPFAVRLSRKTLNIIKANITFAIGIKIIALLLVIPGWLTLWIAILSDMGATILVALNSLR
LMRVKDK
>O32219 7.2.2.12~~~cadA~~~Cadmium, zinc and cobalt-transporting ATPase~~~COG2217
MRLVKQEYVLDGLDCSNCARKIENGVKGIKGINGCAVNFAASTLTVSADGKEEQWVTNKVEKKVKSIDPHVTVRQKHIKK
SADDGYRNRMVNMLIRMAAAVILGAAAYLVQSGTIEFFLFLGAYLIIGGDIIIRAVKNIIRGQVFDEHFLMALATIGAFL
IQQYPEGVAVMLFYQIGELFQGAAVSRSRKSISALMDIRPDYANLKTKNGIEQVSSEDVQTGDIIVVNPGESIPLDGKVV
QGSAMVDTSALTGESVPRKAAEGQDVMSGFINQNGVLHIEVTKGYQESAVSKILDLVQNASSRKARTENFITKFAKYYTP
AVVIIAVLLAFVPPLVLSGAALSDWVYRALIFLVISCPCALVVSIPLGFFGGIGAASKAGVLVKGSNYLEALNQVKYAVF
DKTGTLTKGSFEVTEIKPAEGFTKDRLLEAAAYAELHSQHPIAESVRKAYGKMLSSDEIESYEEISGHGIFAKVNGTEIL
AGNKKLMEREQIEDVPDENAGTIVHVAVDQRYAGAIIIADEIKEDAAQAVADLKSLGIKQTAMLTGDSKQTGEAVGKQLG
IGEVYAELLPQDKVAQVEALEAKLLPSEKLIFVGDGINDTPVLARADIGVAMGGLGSDAAVEAADIVLMTDQPSKIAEAI
RIAKRTRRIVWQNIGFALGVKAIFLILGAFGIATMWEAVFSDVGVTLLAVANAMRVMRLKNK
>Q60048 7.2.2.21~~~cadA~~~Probable cadmium-transporting ATPase~~~
MAEKTVYRVDGLSCTNCAAKFERNVKEIEGVTEAIVNFGASKITVTGEASIQQVEQAGAFEHLKIIPEKESFTDPEHFTD
HQSFIRKNWRLLLSGLFIAVGYASQIMNGEDFYLTNALFIFAIFIGGYSLFKEGFKNLLKFEFTMETLMTIAIIGAAFIG
EWAEGSIVVILFAVSEALERYSMDKARQSIRSLMDIAPKEALVRRSGTDRMVHVDDIQIGDIMIIKPGQKIAMDGHVVKG
YSAVNQAAITGESIPVEKNIDDSVFAGTLNEEGLLEVAVTKRVEDTTISKIIHLVEEAQGERAPAQAFVDTFAKYYTPAI
IVIAALIATVPPLLFGGNWETWVYQGLSVLVVGCPCALVVSTPVAIVTAIGNAAKNGVLVKGGVYLEEIGGLKAIAFDKT
GTLTKGVPVVTDYIELTEATNIQHNKNYIIMAALEQLSQHPLASAIIKYGETREMDLTSINVNDFTSITGKGIRGTVDGN
TYYVGSPVLFKELLASQFTDSIHRQVSDLQLKGKTAMLFGTNQKLISIVAVADEVRSSSQHVIKRLHELGIEKTIMLTGD
NQATAQAIGQQVGVSEIEGELMPQDKLDYIKQLKINFGKVAMVGDGINDAPALAAATVGIAMGGAGTDTAIETADVALMG
DDLQKLPFTVKLSRKTLQIIKQNITFSLVIKLIALLLVIPGWLTLWIAIMADMGATLLVTLNGLRLMKVKD
>P0AAE8 ~~~cadB~~~Cadaverine/lysine antiporter~~~COG0531
MSSAKKIGLFACTGVVAGNMMGSGIALLPANLASIGGIAIWGWIISIIGAMSLAYVYARLATKNPQQGGPIAYAGEISPA
FGFQTGVLYYHANWIGNLAIGITAVSYLSTFFPVLNDPVPAGIACIAIVWVFTFVNMLGGTWVSRLTTIGLVLVLIPVVM
TAIVGWHWFDAATYAANWNTADTTDGHAIIKSILLCLWAFVGVESAAVSTGMVKNPKRTVPLATMLGTGLAGIVYIAATQ
VLSGMYPSSVMAASGAPFAISASTILGNWAAPLVSAFTAFACLTSLGSWMMLVGQAGVRAANDGNFPKVYGEVDSNGIPK
KGLLLAAVKMTALMILITLMNSAGGKASDLFGELTGIAVLLTMLPYFYSCVDLIRFEGVNIRNFVSLICSVLGCVFCFIA
LMGASSFELAGTFIVSLIILMFYARKMHERQSHSMDNHTASNAH
>P23890 ~~~cadC~~~Transcriptional activator CadC~~~COG3710
MQQPVVRVGEWLVTPSINQISRNGRQLTLEPRLIDLLVFFAQHSGEVLSRDELIDNVWKRSIVTNHVVTQSISELRKSLK
DNDEDSPVYIATVPKRGYKLMVPVIWYSEEEGEEIMLSSPPPIPEAVPATDSPSHSLNIQNTATPPEQSPVKSKRFTTFW
VWFFFLLSLGICVALVAFSSLDTRLPMSKSRILLNPRDIDINMVNKSCNSWSSPYQLSYAIGVGDLVATSLNTFSTFMVH
DKINYNIDEPSSSGKTLSIAFVNQRQYRAQQCFMSIKLVDNADGSTMLDKRYVITNGNQLAIQNDLLESLSKALNQPWPQ
RMQETLQKILPHRGALLTNFYQAHDYLLHGDDKSLNRASELLGEIVQSSPEFTYARAEKALVDIVRHSQHPLDEKQLAAL
NTEIDNIVTLPELNNLSIIYQIKAVSALVKGKTDESYQAINTGIDLEMSWLNYVLLGKVYEMKGMNREAADAYLTAFNLR
PGANTLYWIENGIFQTSVPYVVPYLDKFLASE
>P20047 ~~~cadC~~~Cadmium resistance transcriptional regulatory protein CadC~~~
MKKKDTCEIFCYDEEKVNRIQGDLQTVDISGVSQILKAIADENRAKITYALCQDEELCVCDIANILGVTIANASHHLRTL
YKQGVVNFRKEGKLALYSLGDEHIRQIMMIALAHKKEVKVNV
>O84616 1.3.3.-~~~~~~4-aminobenzoate synthase~~~
MMEVFMNFLDQLDLIIQNKHMLEHTFYVKWSKGELTKEQLQAYAKDYYLHIKAFPKYLSAIHSRCDDLEARKLLLDNLMD
EENGYPNHIDLWKQFVFALGVTPEELEAHEPSEAAKAKVATFMRWCTGDSLAAGVAALYSYESQIPRIAREKIRGLTEYF
GFSNPEDYAYFTEHEEADVRHAREEKALIEMLLKDDADKVLEASQEVTQSLYGFLDSFLDPGTCCSCHQSY
>Q72V44 3.5.4.46~~~add~~~Cyclic adenylate deaminase~~~
MALTFQEILDRIRIIDRDVTELNRLKSRLPADRPYSSSLQISFDKQINELLNERVGLMELEVLDPPSWILGVPTTGISQE
TPVPLKGLFPSGDLSKEKPDDQDVINFLRELPKTEIHLHLEACVNKDTMKRLMAKNGINVTDEEFEAKFNFKDLNSFIQV
FFFIQSLVKEPSDFSFFIESLAEYMRANNILYTEVFFAPSKFIQNGLDFEEMIDFLVNRIREEKENDGIVIRLLVDVSRS
FGPENAMKNLDRVLKLRHPEVIGIGLGGAELMGPARDYQGVFQKAREAGLRVVAHSGEDDGPWAIWEAVELLKAERIGHG
TSAIQDPELVKYLRENHIPIEICVTSNVFTGKYVRKEQNHPVRYYYDQGLPLSINTDDPEIFNVNLTYEYYKLWRFLDFS
LDEIVDLIRQGVFASFHPNKESLWAEMEKNIHLVKTRYGLKR
>P54721 1.13.11.2~~~catE~~~Catechol-2,3-dioxygenase~~~COG2514
MTSIHEDTHIGYAKLTIRSLERSLQFYCNVIGFQVLKKTDRQAELTADGKRVLLILEENPSAVVLPERSVTGLYHFAILL
PDRKELGIALARLIEHGIAIGHGDHAVSEALYLSDPDGNGIEMYADRPRSTWQRDREGNYVMTTTAVDIEGLLEEAGDER
KTSLPNDTIIGHIHLHVSDLKEAKAFYTDVLGFDIVGNYAGMSALFVSAGGYHHHIGLNIWAGRNAPPKPTNASGLDYYT
VVLPHQEELDLVANRVKHAGYSIEETENSFRVKDPVSGAYITFVI
>P0A5N7 ~~~cadI~~~Cadmium-induced protein CadI~~~
MSRVQLALNVDDLEAAITFYSRLFNAEPAKRKPGYANFAIADPPLKLVLLENPGTGGTLNHLGVEVGSSNTVHAEIARLT
EAGLVTEKEIGTTCCFATQDKVWVTGPGGERWEVYTVLADSETFGSGPRHNDTSDGEASMCCDGQVAVGASG
>P9WIR5 ~~~cadI~~~Cadmium-induced protein CadI~~~COG0346
MSRVQLALNVDDLEAAITFYSRLFNAEPAKRKPGYANFAIADPPLKLVLLENPGTGGTLNHLGVEVGSSNTVHAEIARLT
EAGLVTEKEIGTTCCFATQDKVWVTGPGGERWEVYTVLADSETFGSGPRHNDTSDGEASMCCDGQVAVGASG
>P9WHR5 3.1.1.-~~~caeB~~~Carboxylesterase B~~~COG0596
MAAMWRRRPLSSALLSFGLLLGGLPLAAPPLAGATEEPGAGQTPGAPVVAPQQSWNSCREFIADTSEIRTARCATVSVPV
DYDQPGGTQAKLAVIRVPATGQRFGALLVNPGGPGASAVDMVAAMAPAIADTDILRHFDLVGFDPRGVGHSTPALRCRTD
AEFDAYRRDPMADYSPAGVTHVEQVYRQLAQDCVDRMGFSFLANIGTASVARDMDMVRQALGDDQINYLGYSYGTELGTA
YLERFGTHVRAMVLDGAIDPAVSPIEESISQMAGFQTAFNDYAADCARSPACPLGTDSAQWVNRYHALVDPLVQKPGKTS
DPRGLSYADATTGTINALYSPQRWKYLTSGLLGLQRGSDAGDLLVLADDYDGRDADGHYSNDQDAFNAVRCVDAPTPADP
AAWVAADQRIRQVAPFLSYGQFTGSAPRDLCALWPVPATSTPHPAAPAGAGKVVVVSTTHDPATPYQSGVDLARQLGAPL
ITFDGTQHTAVFDGNQCVDSAVMHYFLDGTLPPTSLRCAP
>P26949 ~~~caf1A~~~F1 capsule-anchoring protein~~~
MRYSKLFLCAGLTLATLPCWGRAYTFDSTMLDTNSGESIDVSLFNQGLQLPGNYFVNVFVNGRKVDSGNIDFRLEKHNGK
ELLWPCLSSLQLTKYGIDIDKYPDLIKSGTEQCVDLLAIPHSDVQFYFNQQKLSLIVPPQALLPRFDGIMPMQLWDDGIP
ALFMNYNTNMQTRKFREGGKSLDSYYAQLQPGLNIGAWRFRSSTSWWKQQGWQRSYIYAERGLNTIKSRLTLGETYSDSS
IFDSIPIKGIKIASDESMVPYYQWNFAPVVRGIARTQARVEVLRDGYTVSNELVPSGPFELANLPLGGGSGELKVIIHES
DGTKQVFTVPYDTPAVALRKGYFEYSMMGGEYRPANDLTQTSYVGALGMKYGLPRNLTLYGGLQGSQNYHAAALGIGAML
GDFGAISTDVTQADSQKNKQKKESGQRWRVRYNKYLQSGTSLNIASEEYATEGFNKLADTLNTYCKPNTRNDCRFDYAKP
KNKVQFNLSQSIPGSGTLNFSGYRKNYWRDSRSTTSFSVGYNHFFRNGMSLTLNLSKTQNINKYGEKTSELLSNIWLSFP
LSRWLGNNSINSNYQMTSDSHGNTTHEVGVYGEAFDRQLYWDVRERFNEKGRKYTSNALNLNYRGTYGEISGNYSYDQTQ
SQLGIGVNGNMVITQYGITAGQKTGDTIALVQAPDISGASVGYWPGMKTDFRGYTNYGYLTPYRENKVEINPVTLPNDAE
ITNNIVSVIPTKGAVVLAKFNARIGGRLFLHLKRSDNKPVPFGSIVTIEGQSSSSGIVGDNSGVYLTGLPKKSKILVKWG
RDKNQSCSSNVVLPEKTDISGAYRLSTTCILNN
>P26926 ~~~caf1M~~~Chaperone protein caf1M~~~
MILNRLSTLGIITFGMLSFAANSAQPDIKFASKEYGVTIGESRIIYPLDAAGVMVSVKNTQDYPVLIQSRIYDENKEKES
EDPFVVTPPLFRLDAKQQNSLRIAQAGGVFPRDKESLKWLCVKGIPPKDEDIWVDDATNKQKFNPDKDVGVFVQFAINNC
IKLLVRPNELKGTPIQFAENLSWKVDGGKLIAENPSPFYMNIGELTFGGKSIPSHYIPPKSTWAFDLPKGLAGARNVSWR
IINDQGGLDRLYSKNVTL
>P26948 ~~~caf1~~~F1 capsule antigen~~~
MKKISSVIAIALFGTIATANAADLTASTTATATLVEPARITLTYKEGAPITIMDNGNIDTELLVGTLTLGGYKTGTTSTS
VNFTDAAGDPMYLTFTSQDGNNHQFTTKVIGKDSRDFDISPKVNGENLVGDDVVLATGSQDFFVRSIGSKGGKLAAGKYT
DAVTVTVSNQ
>P55980 ~~~cagA~~~Cytotoxicity-associated immunodominant antigen~~~COG1842
MTNETIDQTRTPDQTQSQTAFDPQQFINNLQVAFIKVDNVVASFDPDQKPIVDKNDRDNRQAFDGISQLREEYSNKAIKN
PTKKNQYFSDFIDKSNDLINKDNLIDVESSTKSFQKFGDQRYQIFTSWVSHQKDPSKINTRSIRNFMENIIQPPIPDDKE
KAEFLKSAKQSFAGIIIGNQIRTDQKFMGVFDESLKERQEAEKNGGPTGGDWLDIFLSFIFNKKQSSDVKEAINQEPVPH
VQPDIATTTTDIQGLPPEARDLLDERGNFSKFTLGDMEMLDVEGVADIDPNYKFNQLLIHNNALSSVLMGSHNGIEPEKV
SLLYAGNGGFGDKHDWNATVGYKDQQGNNVATLINVHMKNGSGLVIAGGEKGINNPSFYLYKEDQLTGSQRALSQEEIRN
KVDFMEFLAQNNTKLDNLSEKEKEKFQNEIEDFQKDSKAYLDALGNDRIAFVSKKDTKHSALITEFNNGDLSYTLKDYGK
KADKALDREKNVTLQGSLKHDGVMFVDYSNFKYTNASKNPNKGVGATNGVSHLEAGFNKVAVFNLPDLNNLAITSFVRRN
LENKLTAKGLSLQEANKLIKDFLSSNKELAGKALNFNKAVAEAKSTGNYDEVKKAQKDLEKSLRKREHLEKEVEKKLESK
SGNKNKMEAKAQANSQKDEIFALINKEANRDARAIAYTQNLKGIKRELSDKLEKISKDLKDFSKSFDEFKNGKNKDFSKA
EETLKALKGSVKDLGINPEWISKVENLNAALNEFKNGKNKDFSKVTQAKSDLENSVKDVIINQKVTDKVDNLNQAVSVAK
AMGDFSRVEQVLADLKNFSKEQLAQQAQKNEDFNTGKNSELYQSVKNSVNKTLVGNGLSGIEATALAKNFSDIKKELNEK
FKNFNNNNNGLKNSTEPIYAKVNKKKTGQVASPEEPIYTQVAKKVNAKIDRLNQIASGLGGVGQAAGFPLKRHDKVDDLS
KVGLSASPEPIYATIDDLGGPFPLKRHDKVDDLSKVGRSRNQELAQKIDNLNQAVSEAKAGFFGNLEQTIDKLKDSTKKN
VMNLYVESAKKVPASLSAKLDNYAINSHTRINSNIQNGAINEKATGMLTQKNPEWLKLVNDKIVAHNVGSVSLSEYDKIG
FNQKNMKDYSDSFKFSTKLNNAVKDIKSGFTHFLANAFSTGYYCLARENAEHGIKNVNTKGGFQKS
>Q06110 ~~~cagA~~~Antitumor antibiotic C-1027 apoprotein~~~
MSLRHMSRRASRFGVVAVASIGLAAAAQSVAFAAPAFSVSPASGLSDGQSVSVSVSGAAAGETYYIAQCAPVGGQDACNP
ATATSFTTDASGAASFSFVVRKSYTGSTPEGTPVGSVDCATAACNLGAGNSGLDLGHVALTFG
>Q48252 7.4.2.8~~~cagE~~~Type IV secretion system protein CagE~~~COG3451
MFVASKQADEQKKLVIEQEVQKRQFKKIEELKADMQKGVNPFFKVLFDGGNRLFGFPETFIYSSIFILFVTIVLSVILFQ
AYEPVLIVAIVIVLVALGFKKDYRLYQRMERAMKFKKPFLFKGVKNKAFMSIFSMKPSKEMANDIHLNPNREDRLVSAAN
SYLANNYECFLDDGVILTNNYSLLGTIKLGGIDFLTTSKKDLIELHASIYSVFRNFVTPEFKFYFHTVKKKIVIDETNRD
YSLIFSNDFMRAYNEKQKRESFYDISFYLTIEQDLLDTLNEPVMNKKHFADNNFEEFQRIIRAKLENFKDRIELIEELLS
KYHPIRLKEYTKDGVIYSKQCEFYNFLVGMNEAPFICNRKDLYLKEKMHGGVKEVYFANKHGKILNDDLSEKYFSAIEIS
EYAPKSQSDLFDKINALDSEFIFMHAYSPKNSQVLKDKLAFTSRRIIISGGSKEQGMTLGCLSELVGNGDITLGSYGNSL
VLFADSFEKMKQSVKECVSSLNAKGFLANAATFSMENYFFAKHCSFITLPFIFDVTSNNFADFIAMRAMSFDGNQENNAW
GNSVMTLKSEINSPFYLNFHMPTDFGSASAGHTLILGSTGSGKTVFMSMTLNAMGQFVHNFPANVSKDKQKLTMVYMDKD
YGAYGNIVAMGGEYVKIELGTDTGLNPFAWAACVQKTNATMEQKQTAISVVKELVKNLATKSDEKDENGNSISFSLADSN
TLAAAVTNLITGDMNLDYPITQLINAFGKDHNDPNGLVARLAPFCKSTNGEFQWLFDNKATDRLDFSKTIIGVDGSSFLD
NNDVSPFICFYLFARIQEAMDGRRFVLDIDEAWKYLGDPKVAYFVRDMLKTARKRNAIVRLATQSITDLLACPIADTIRE
QCPTKIFLRNDGGNLSDYQRLANVTEKEFEIITKGLDRKILYKQDGSPSVIASFNLRGIPKEYLKILSTDTVFVKEIDKI
IQNHSIIDKYQALRQMYQQIKEY
>P97227 ~~~cagS~~~CAG pathogenicity island protein 13~~~
MSNNMRKLFSMIADSKDKKEKLIESLQENELLSTDEKKKIIDQIKTMHDFFKQMHTNKGALDKVLRNYMKDYRAVIKSIG
VDKFKKVYRLLESETMELLHAIAENPNFLFSKFDRSILGIFLPFFSKPIMFKMSIREMDSQIELYGTKLPLLKLFVMTDE
EMNFYANLKTIEQYNDYVRDLLMKFDLEKYMKEKGV
>P97245 ~~~cagT~~~CAG pathogenicity island protein 12~~~
MKLRASVLIGATILCLILSACSNYAKKVVKQKNHVYTPVYNELIEKYSEIPLNDKLKDTPFMVQVKLPNYKDYLLDNKQV
VLTFKLVHHSKKITLIGDANKILQYKNYFQANGARSDIDFYLQPTLNQKGVVMIASNYNDNPNSKEKPQTFDVLQGSQPM
LGANTKNLHGYDVSGANNKQVINEVAREKAQLEKINQYYKTLLQDKEQEYTTRKNNQREILETLSNRAGYQMRQNVISSE
IFKNGNLNMQAKEEEVREKLQEERENEYLRNQIRSLLSGK
>Q5KW03 4.2.1.1~~~~~~Carbonic anhydrase~~~COG0663
MIYPYKGKTPQIAASAFIADYVTITGDVVIGEETSIWFNTVIRGDVAPTVIGNRVNIQDNSILHQSPNNPLIIEDGVTVG
HQVILHSAIVRKNALIGMGSIILDRAEIGEGAFIGAGSLVPPGKKIPPNTLALGRPAKVVRELTEDDIREMERIRREYVE
KGQYYKALQQQRTSCADKKELP
>A8IKD2 3.5.2.15~~~~~~Cyanuric acid amidohydrolase~~~
MPIAKVHRIATASPDDVSGLAAAIATGAIAPAGILAIFGKTEGNGCVNDFSRGFAVQSLQMLLRGHMGAAADEVCLVMSG
GTEGGMSPHFLVFERAEGNAPEAAPALAIGRAHTPDLPFEALGRMGQVRMVAQAVRRAMAAAGITDPEDVHFVQVKCPLL
TAMRVKEAEARGATTATSDTLKSMGLSRGASALGIALALGEVAEDALSDAVICADYGLWSARASCSSGIELLGHEIVVLG
MSEGWSGPLAIAHGVMADAIDVTPVKAALSALGAEAGEATIVLAKAEPSRSGRIRGKRHTMLDDSDISPTRHARAFVAGA
LAGVVGHTEIYVSGGGEHQGPDGGGPVAVIAARTMG
>P94388 3.1.1.41~~~cah~~~Cephalosporin-C deacetylase~~~COG3458
MQLFDLPLDQLQTYKPEKTAPKDFSEFWKLSLEELAKVQAEPDLQPVDYPADGVKVYRLTYKSFGNARITGWYAVPDKEG
PHPAIVKYHGYNASYDGEIHEMVNWALHGYATFGMLVRGQQSSEDTSISPHGHALGWMTKGILDKDTYYYRGVYLDAVRA
LEVISSFDEVDETRIGVTGGSQGGGLTIAAAALSDIPKAAVADYPYLSNFERAIDVALEQPYLEINSFFRRNGSPETEVQ
AMKTLSYFDIMNLADRVKVPVLMSIGLIDKVTPPSTVFAAYNHLETKKELKVYRYFGHEYIPAFQTEKLAFFKQHLKG
>Q89E06 3.5.2.15~~~~~~Cyanuric acid amidohydrolase~~~
MRTTSVGVFKIVTKGPGDVSGLMAMIGSGAIDPKSILAVLGKTEGNGGVNDFTREYAVAALCTALAPQLGLSPEEVEQRI
AFVMSGGTEGVLSPHITVFTRREVERRPAGLSGKRLSIGMAHTRDFLPEELGRAAQIAETAAAVKAAMADAGIADPADVH
FVQIKCPLLTSDRVEAASARGNKTATTSAYGSMAYSRGASALGVAVALGETGSDISDGDVLRRYDLFSKVASTSAGIELM
HNVVIVLGNSAASASEFEIGHAVMNDAIDAAAVTSALKCVGLGVAPQAEAGRELVNIFAKAEASPDGSVRGFRHTMLEDT
DISSTRHARAAVGGLIAGLAGTGAVYVSGGAEHQGPAGGGPVAVIARLSD
>H0SH23 3.5.2.15~~~~~~Cyanuric acid amidohydrolase~~~
MPTTLRRAHVHRLPMRSPDDVAALEAAITQGTIDPAGIVAILGKTEGNGCVNDFTRAFAVRSLEALLGRHLATEAVRQIA
MVMSGGTEGALSPHMIVFEAREVDEGHAPRAFAASLALGRARTPVLPSEHLGRMQQVAQVAAGVRAAMNDAGITDAGDVH
YVQVKCPLLTMERIEAAEARGVRTAVRDTLKSMGFSRGASALGVAVALGELAMDELSDTEICTDYARYSERAATSGGVEL
LDHEIMVAGMSRDWTGPLAIDHGVMRDAIDIEPARAALARLGLDVPGQLPAAARGRIAAVLAKAEAAQSGKVRDVRHTML
DDSDVSSTRHARAFVGGALAGLFGFTDLFVSGGAEHQGPDGGGPVAIIVERT
>P0A3V4 3.5.2.15~~~trzD~~~Cyanuric acid amidohydrolase~~~
MQAQVFRVPMSNPADVSGVAKLIDEGVIRAEEVVCVLGKTEGNGCVNDFTRGYTTLAFKVYFSEKLGVSRQEVGERIAFI
MSGGTEGVMAPHCTIFTVQKTDNKQKTAAEGKRLAVQQIFTREFLPEEIGRMPQVTETADAVRRAMREAGIADASDVHFV
QVKCPLLTAGRMHDAVERGHTVATEDTYESMGYSRGASALGIALALGEVEKANLSDEVITADYSLYSSVASTSAGIELMN
NEIIVMGNSRAWGGDLVIGHAEMKDAIDGAAVRQALRDVGCCENDLPTVDELGRVVNVFAKAEASPDGEVRNRRHTMLDD
SDINSTRHARAVVNAVIASIVGDPMVYVSGGSEHQGPAGGGPVAVIARTA
>B0UET5 3.5.2.15~~~~~~Cyanuric acid amidohydrolase~~~
MPRRAEILRLPMAAPDDVSAIAASLRDGRLDPGDVVAVFAKTEGNGCVNDFTRPLAVQALRGLFGPLIGEAALGRIAMVM
SGGTEGGLSPHWLVIAAREAAGPGPALAVGQARTPPLAAEDLGRRAQVEMVAAGVRAAMREAGLDAGQVHYVQVKCPLLT
SERIGAALARGAAPATRDTLKSMGLSRAAAALGAALALGEVPAAAIGETVAETDPGRHARRCGASAGVELLDHEVVVMGM
SPDWTGPLVIDHAVMADAIDLRPVAACLGRLGLLGPDGFVDEAGRARLAALLAKAEASADGAIRGRRHTMLTDSDIAPTR
HARGFVAGALAGLVGATDLFVSGGAEFQGPDGGGPVAVIARRSGAAGPA
>Q2RGM7 3.5.2.15~~~~~~Cyanuric acid amidohydrolase~~~
MQKVEVFRIPTASPDDISGLATLIDSGKINPAEIVAILGKTEGNGCVNDFTRGFATQSLAMYLAEKLGISREEVVKKVAF
IMSGGTEGVMTPHITVFVRKDVQEPAKPGKRLAVGVAFTRDFLPEELGRMEQVNEVARAVKEAMKDAQIDDPRDVHFVQI
KCPLLTAERIEDAKRRGKDVVVNDTYKSMAYSRGASALGVALALGEISADKISNEAICHDWNLYSSVASTSAGVELLNDE
IIVVGNSTNSASDLVIGHSVMKDAIDADAVRAALKDAGLKFDCCPPAEELAKIVNVLAKAEAASSGTVRGRRNTMLDDSD
INHTRSARAVVNAVIASVVGDPMVYVSGGAEHQGPDGGGPIAVIARV
>Q50940 4.2.1.1~~~cah~~~Carbonic anhydrase~~~
MPRFPRTLPRLTAVLLLACTAFSAAAHGNHTHWGYTGHDSPESWGNLSEEFRLCSTGKNQSPVNITETVSGKLPAIKVNY
KPSMVDVENNGHTIQVNYPEGGNTLTVNGRTYTLKQFHFHVPSENQIKGRTFPMEAHFVHLDENKQPLVLAVLYEAGKTN
GRLSSIWNVMPMTAGKVKLNQPFDASTLLPKRLKYYRFAGSLTTPPCTEGVSWLVLKTYDHIDQAQAEKFTRAVGSENNR
PVQPLNARVVIE
>W6RJ11 3.5.2.15~~~~~~Cyanuric acid amidohydrolase~~~
MKTRVTRLTVAAPNDVSALAQAIESGEVDPTRVIAVLGKTEGNGCVNDFTRAFATSTLKRFFAERLALNETEVDERIAFV
MSGGTEGGLSPHWLVFEVDDSAPSRDTTTPGLAAGVAFTRDLRPEEIGRTSQVELTRDAVLRAMAAAGIQRVEDVHFVQI
KCPLLTAARINEAAARGQSVACHDTYESMGYSRGASALGVAAALGDLPGDVRDEQICREWSLYSSRASSSAGIELLRNEV
LVLGNAPGWDPEYRIGHAVMEDALDAQAIERALASVPGGDKIKLTPERLAGLLVKAEPSASGSIRGNRHVMSDDSDINGS
RHARALVGGVLAGQLGDTRLFVSGGAEHQGPNGGGPLALIVRS
>P58329 3.5.2.15~~~atzD~~~Cyanuric acid amidohydrolase~~~
MYHIDVFRIPCHSPGDTSGLEDLIETGRVAPADIVAVMGKTEGNGCVNDYTREYATAMLAACLGRHLQLPPHEVEKRVAF
VMSGGTEGVLSPHHTVFARRPAIDAHRPAGKRLTLGIAFTRDFLPEEIGRHAQITETAGAVKRAMRDAGIASIDDLHFVQ
VKCPLLTPAKIASARSRGCAPVTTDTYESMGYSRGASALGIALATEEVPSSMLVDESVLNDWSLSSSLASASAGIELEHN
VVIAIGMSEQATSELVIAHGVMSDAIDAASVRRTIESLGIRSDDEMDRIVNVFAKAEASPDGVVRGMRHTMLSDSDINST
RHARAVTGAAIASVVGHGMVYVSGGAEHQGPAGGGPFAVIARA
>P0A3V5 3.5.2.15~~~trzD~~~Cyanuric acid amidohydrolase~~~
MQAQVFRVPMSNPADVSGVAKLIDEGVIRAEEVVCVLGKTEGNGCVNDFTRGYTTLAFKVYFSEKLGVSRQEVGERIAFI
MSGGTEGVMAPHCTIFTVQKTDNKQKTAAEGKRLAVQQIFTREFLPEEIGRMPQVTETADAVRRAMREAGIADASDVHFV
QVKCPLLTAGRMHDAVERGHTVATEDTYESMGYSRGASALGIALALGEVEKANLSDEVITADYSLYSSVASTSAGIELMN
NEIIVMGNSRAWGGDLVIGHAEMKDAIDGAAVRQALRDVGCCENDLPTVDELGRVVNVFAKAEASPDGEVRNRRHTMLDD
SDINSTRHARAVVNAVIASIVGDPMVYVSGGSEHQGPAGGGPVAVIARTA
>F4CUJ4 3.5.2.15~~~~~~Cyanuric acid amidohydrolase~~~
MTVVDIVKRTTSSPDDTALVKTLADAGYSTADVVALVAKTEGNGCVNDFSRTLADHTWDAVLPADAVTVFSGGTEGVLSP
HASAFVGTDRPAAPEGALVAAVGRTASIPIADLGRAGQVRAVAARVRELCADAALEPGDVHLVLVKCPLLTTESISRCLA
DGVEPATRDTLRSMAMSRAASALGVAVALGEISEPDAAAALRGEADVWSSVASISSGAELDDCHILVLGNSPAAHGPLRA
VHGVMRDAMDARTVLDLLDRVSADGGEVVQVLAKAEADPSGSIRGRRHTMLTDSDLSSTRHARAAVGGLLAGLVGDSAIY
VSGGAEHQGPPGGGPVTVVYRVAS
>Q1M7F3 3.5.2.15~~~~~~Cyanuric acid amidohydrolase~~~
MPSLRAHVFRVPADGPDDVAGVEALFASGLQANNIVAVLGKTEGNGCVNDFTRGYATRSFETLFSRYGVDGVSIIMSGGT
EGALSPHWTVFARETVETPGERALAIGVSRTPALSPEHLGRREQILLVAEGVKSAMRDAGIDDPADAHFVQIKCPLLTSR
RIAEAEAAGRTVATHDTLKSMGLSRGASALGVAVALGEIDATSINDADICTRFDLFSRCASTSSGVELTDHEIIVLGMSA
KWSGPLSIDHAVMRDAIDAHSVRKARERLPENSRLAAVLAKAEPDPSGEIDGRRHTMLDDSDIAGTRHARAFVGGVLAGI
FGITDLYVSGGAEHQGPPGGGPVAIIVEKEQ
>C6BAU4 3.5.2.15~~~~~~Cyanuric acid amidohydrolase~~~
MPSLRAHVFRVPADGPDDVAGVEALFASGLQANNVVAVLGKTEGNGCVNDFTRGYATRSFETLFSRYGVDGVSIIMSGGT
EGALSPHWTVFARETVETPGERALAIGVSRTPALPPEHLGRREQILLVAEGVKSAMRDAGIDDPADVHFVQIKCPLLTSR
RIAEAEAAGRTVATHDTLKSMGLSRGASALGVAVALGEIDATSIGDADICTRFDLFSRCASTSSGGELTDHEIIVLGMSA
KWSGPLSIDHAVMLDAIDAHSVRKARERLPENSRLAAVLAKAEPDPSGRIDGRRHTMLDDSDIAGTRHARAFVGGVLAGI
FGITDLYVSGGAEHQGPPGGGPVAIIVEKEQ
>Q9WXT2 3.1.1.41~~~axeA~~~Cephalosporin-C deacetylase~~~COG3458
MAFFDLPLEELKKYRPERYEEKDFDEFWEETLAESEKFPLDPVFERMESHLKTVEAYDVTFSGYRGQRIKGWLLVPKLEE
EKLPCVVQYIGYNGGRGFPHDWLFWPSMGYICFVMDTRGQGSGWLKGDTPDYPEGPVDPQYPGFMTRGILDPRTYYYRRV
FTDAVRAVEAAASFPQVDQERIVIAGGSQGGGIALAVSALSKKAKALLCDVPFLCHFRRAVQLVDTHPYAEITNFLKTHR
DKEEIVFRTLSYFDGVNFAARAKIPALFSVGLMDNICPPSTVFAAYNYYAGPKEIRIYPYNNHEGGGSFQAVEQVKFLKK
LFEKG
>P60584 1.3.8.13~~~caiA~~~Crotonobetainyl-CoA reductase~~~COG1960
MDFNLNDEQELFVAGIRELMASENWEAYFAECDRDSVYPERFVKALADMGIDSLLIPEEHGGLDAGFVTLAAVWMELGRL
GAPTYVLYQLPGGFNTFLREGTQEQIDKIMAFRGTGKQMWNSAITEPGAGSDVGSLKTTYTRRNGKIYLNGSKCFITSSA
YTPYIVVMARDGASPDKPVYTEWFVDMSKPGIKVTKLEKLGLRMDSCCEITFDDVELDEKDMFGREGNGFNRVKEEFDHE
RFLVALTNYGTAMCAFEDAARYANQRVQFGEAIGRFQLIQEKFAHMAIKLNSMKNMLYEAAWKADNGTITSGDAAMCKYF
CANAAFEVVDSAMQVLGGVGIAGNHRISRFWRDLRVDRVSGGSDEMQILTLGRAVLKQYR
>P31572 2.8.3.21~~~caiB~~~L-carnitine CoA-transferase~~~COG1804
MDHLPMPKFGPLAGLRVVFSGIEIAGPFAGQMFAEWGAEVIWIENVAWADTIRVQPNYPQLSRRNLHALSLNIFKDEGRE
AFLKLMETTDIFIEASKGPAFARRGITDEVLWQHNPKLVIAHLSGFGQYGTEEYTNLPAYNTIAQAFSGYLIQNGDVDQP
MPAFPYTADYFSGLTATTAALAALHKVRETGKGESIDIAMYEVMLRMGQYFMMDYFNGGEMCPRMSKGKDPYYAGCGLYK
CADGYIVMELVGITQIEECFKDIGLAHLLGTPEIPEGTQLIHRIECPYGPLVEEKLDAWLATHTIAEVKERFAELNIACA
KVLTVPELESNPQYVARESITQWQTMDGRTCKGPNIMPKFKNNPGQIWRGMPSHGMDTAAILKNIGYSENDIQELVSKGL
AKVED
>Q8GB19 2.8.3.21~~~caiB~~~L-carnitine CoA-transferase~~~
MTEHLPMPQFGPLAGVRVVFSGIEIAGPFAGQMFAEWGAEVIWIENVAWADTIRVQPHYPQLSRRNLHALSLNIFKDGGR
DAFLKLMETTDIFIEASKGPAFARRGITDEVLWEHNPKLVIAHLSGFGQYGDPQYTNLPAYNTIAQAFSGYLIQNGDKDQ
PMPAFPYTADYFSGMTATTSALAALYKVQQTGKGESIDIAMYEVMLRMGQYFMMDYFNGGEICPRMTKGKEPYYAGCGLY
RCQDGYIVMEVVGITQIEEIFKDIGLAHLLGTPEVPKGTQLIHRINCPHGQLFEDELDEWLANQPITAVLKRLSELNIAS
AKVLTIPELEGNPQYVARESITQWKTMSGETCKGPNIMPKFKNNPGKIWRGMPAHGMDTNAILKNIGYSDEQIRELVDKG
LAKIVE
>P31552 6.2.1.48~~~caiC~~~Crotonobetaine/carnitine--CoA ligase~~~COG0318
MDIIGGQHLRQMWDDLADVYGHKTALICESSGGVVNRYSYLELNQEINRTANLFYTLGIRKGDKVALHLDNCPEFIFCWF
GLAKIGAIMVPINARLLCEESAWILQNSQACLLVTSAQFYPMYQQIQQEDATQLRHICLTDVALPADDGVSSFTQLKNQQ
PATLCYAPPLSTDDTAEILFTSGTTSRPKGVVITHYNLRFAGYYSAWQCALRDDDVYLTVMPAFHIDCQCTAAMAAFSAG
ATFVLVEKYSARAFWGQVQKYRATVTECIPMMIRTLMVQPPSANDQQHRLREVMFYLNLSEQEKDAFCERFGVRLLTSYG
MTETIVGIIGDRPGDKRRWPSIGRVGFCYEAEIRDDHNRPLPAGEIGEICIKGIPGKTIFKEYFLNPQATAKVLEADGWL
HTGDTGYRDEEDFFYFVDRRCNMIKRGGENVSCVELENIIAAHPKIQDIVVVGIKDSIRDEAIKAFVVLNEGETLSEEEF
FRFCEQNMAKFKVPSYLEIRKDLPRNCSGKIIRKNLK
>P31551 4.2.1.149~~~caiD~~~Carnitinyl-CoA dehydratase~~~COG1024
MSESLHLTRNGSILEITLDRPKANAIDAKTSFEMGEVFLNFRDDPQLRVAIITGAGEKFFSAGWDLKAAAEGEAPDADFG
PGGFAGLTEIFNLDKPVIAAVNGYAFGGGFELALAADFIVCADNASFALPEAKLGIVPDSGGVLRLPKILPPAIVNEMVM
TGRRMGAEEALRWGIVNRVVSQAELMDNARELAQQLVNSAPLAIAALKEIYRTTSEMPVEEAYRYIRSGVLKHYPSVLHS
EDAIEGPLAFAEKRDPVWKGR
>Q8GB17 4.2.1.149~~~caiD~~~Carnitinyl-CoA dehydratase~~~
MSQSLHLTTRGSVLEIILDRPKANAIDAKTSHEMGEVFMRFRDDPSLRVAIITGAGERFFCAGWDLKAAAEGEAPDADFG
AGGFAGLTELFDLNKPVIAAINGYAFGGGFELALAADMIICSDNASFALPEAQLGIVPDSGGVLRLPKRLPPAIVNEMLM
TGRRMNAQEALRWGIANRVVSATELMDSARELADQIANSAPLAVAALKEIYRATSELSIEEGYKLMRSGVLKYYPRVLHS
EDALEGPLAFAEKRSPEWKGR
>P31553 ~~~caiT~~~L-carnitine/gamma-butyrobetaine antiporter~~~COG1292
MKNEKRKTGIEPKVFFPPLIIVGILCWLTVRDLDAANVVINAVFSYVTNVWGWAFEWYMVVMLFGWFWLVFGPYAKKRLG
NEPPEFSTASWIFMMFASCTSAAVLFWGSIEIYYYISTPPFGLEPNSTGAKELGLAYSLFHWGPLPWATYSFLSVAFAYF
FFVRKMEVIRPSSTLVPLVGEKHAKGLFGTIVDNFYLVALIFAMGTSLGLATPLVTECMQWLFGIPHTLQLDAIIITCWI
ILNAICVACGLQKGVRIASDVRSYLSFLMLGWVFIVSGASFIMNYFTDSVGMLLMYLPRMLFYTDPIAKGGFPQGWTVFY
WAWWVIYAIQMSIFLARISRGRTVRELCFGMVLGLTASTWILWTVLGSNTLLLIDKNIINIPNLIEQYGVARAIIETWAA
LPLSTATMWGFFILCFIATVTLVNACSYTLAMSTCREVRDGEEPPLLVRIGWSILVGIIGIVLLALGGLKPIQTAIIAGG
CPLFFVNIMVTLSFIKDAKQNWKD
>B4EY22 ~~~caiT~~~L-carnitine/gamma-butyrobetaine antiporter~~~COG1292
MSKDNKKAGIEPKVFFPPLIIVGILCWLTVRDLDASNEVINAVFSYVTNVWGWAFEWYMVIMFGGWFWLVFGRYAKKRLG
DEKPEFSTASWIFMMFASCTSAAVLFWGSIEIYYYISSPPFGMEGYSAPAKEIGLAYSLFHWGPLPWATYSFLSVAFAYF
FFVRKMEVIRPSSTLTPLVGEKHVNGLFGTVVDNFYLVALILAMGTSLGLATPLVTECIQYLFGIPHTLQLDAIIISCWI
LLNAICVAFGLQKGVKIASDVRTYLSFLMLGWVFIVGGASFIVNYFTDSVGTLLMYMPRMLFYTDPIGKGGFPQAWTVFY
WAWWVIYAIQMSIFLARISKGRTVRELCLGMVSGLTAGTWLIWTILGGNTLQLIDQNILNIPQLIDQYGVPRAIIETWAA
LPLSTATMWGFFILCFIATVTLINACSYTLAMSTCRSMKEGAEPPLLVRIGWSVLVGIIGIILLALGGLKPIQTAIIAGG
CPLFFVNIMVTLSFIKDAKVHWKDCSPYTQKMTH
>P0DMP5 1.1.1.194~~~calA~~~Coniferyl-alcohol dehydrogenase~~~
MQLTNKKIVVTGVSSGIGAETARVLRSHGATVIGVDRNMPSLTLDAFVQADLSHPEGIDKAISQLPEKIDGLCNIAGVPG
TADPQLVANVNYLGLKYLTEAVLSRIQPGGSIVNVSSVLGAEWPARLQLHKELGSVVGFSEGQAWLKQNPVAPEFCYQYF
KEALIVWSQVQAQEWFMRTSVRMNCIAPGPVFTPILNEFVTMLGQERTQADAHRIKRPAYADEVAAVIAFMCAEESRWIN
GINIPVDGGLASTYV
>O86447 1.2.1.68~~~calB~~~Coniferyl aldehyde dehydrogenase~~~
MSILGLNGAPVGAEQLGSALDRMKKAHLEQGPANLELRLSRLDRAIAMLLENREAIADAVSADFGNRSREQTLLCDIAGS
VASLKDSREHVAKWMEPEHHKAMFPGAEARVEFQPLGVVGVISPWNFPIVLAFGPLAGIFAAGNRAMLKPSELTPRTSAL
LAELIARYFDETELTTVLGDAEVGALFSAQPFDHLIFTGGTAVAKHIMRAAADNLVPVTLELGGKSPVIVSRSADMADVA
QRVLTVKTFNAGQICLAPDYVLLPEESLDSFVAEATRFVAAMYPSLLDNPDYTSIINARNFDRLHRYLTDAQAKGGRVIE
INPAAEELGDSGIRKIAPTLIVNVSDEMLVLNEEIFGPLLPIKTYRDFDSAIDYVNSKQRPLASYFFGEDAVEREQVLKR
TVSGAVVVNDVMSHVMMDTLPFGGVGHSGMGAYHGIYGFRTFSHAKPVLVQSPVGESNLAMRAPYGEAIHGLLSVLLSTE
C
>P16640 1.18.1.5~~~camA~~~Putidaredoxin reductase CamA~~~
MNANDNVVIVGTGLAGVEVAFGLRASGWEGNIRLVGDATVIPHHLPPLSKAYLAGKATAESLYLRTPDAYAAQNIQLLGG
TQVTAINRDRQQVILSDGRALDYDRLVLATGGRPRPLPVASGAVGKANNFRYLRTLEDAECIRRQLIADNRLVVIGGGYI
GLEVAATAIKANMHVTLLDTAARVLERVTAPPVSAFYEHLHREAGVDIRTGTQVCGFEMSTDQQKVTAVLCEDGTRLPAD
LVIAGIGLIPNCELASAAGLQVDNGIVINEHMQTSDPLIMAVGDCARFHSQLYDRWVRIESVPNALEQARKIAAILCGKV
PRDEAAPWFWSDQYEIGLKMVGLSEGYDRIIVRGSLAQPDFSVFYLQGDRVLAVDTVNRPVEFNQSKQIITDRLPVEPNL
LGDESVPLKEIIAAAKAELSSA
>Q93TU6 3.7.1.18~~~camK~~~6-oxocamphor hydrolase~~~
MKQLATPFQEYSQKYENIRLERDGGVLLVTVHTEGKSLVWTSTAHDELAYCFHDIACDRENKVVILTGTGPSFCNEIDFT
SFNLGTPHDWDEIIFEGQRLLNNLLSIEVPVIAAVNGPVTNHPEIPVMSDIVLAAESATFQDGPHFPSGIVPGDGAHVVW
PHVLGSNRGRYFLLTGQELDARTALDYGAVNEVLSEQELLPRAWELARGIAEKPLLARRYARKVLTRQLRRVMEADLSLG
LAHEALAAIDLGMESEQ
>O07431 2.1.1.6~~~~~~Catechol O-methyltransferase~~~COG4122
MGMDQQPNPPDVDAFLDSTLVGDDPALAAALAASDAAELPRIAVSAQQGKFLCLLAGAIQARRVLEIGTLGGFSTIWLAR
GAGPQGRVVTLEYQPKHAEVARVNLQRAGVADRVEVVVGPALDTLPTLAGGPFDLVFIDADKENNVAYIQWAIRLARRGA
VIVVDNVIRGGGILAESDDADAVAARRTLQMMGEHPGLDATAIQTVGRKGWDGFALALVR
>G8T6H8 2.1.1.6~~~~~~Catechol O-methyltransferase~~~COG4122
MNNQIFESVDHYISDLLGYEDDALLAATNSLAEAGMPAISVSPNQGKFLQLLAQLCQAKNILELGTLAGYSTIWMARALP
KNGRLITLEYDPKHAAVAQKNIDRAGLTSQVQIRTGKAIDILPQLVEEGAGPFDMIFIDADKPPYTEYFQWALRLSRPGT
LIVADNVIRDGKVLDENSTEPAVQGARRFNAMLGANTAVDATILQMVGVKEYDGMALAIVK
>Q84CG1 4.2.1.145~~~vioD~~~Capreomycidine synthase~~~
MTGPLGAGPQALPAAPLEDWLRERYFQAKTDISSSGVHNYTFGELRALDPALLGTRELDQLMFRDGPSLGDERLRAAVAA
RVRPGPGHVVMTTHGSSEALYLAFAALVRPGDEVVVATPAYHSLSGLATAAGASLRPWPLRPENGFAPDLDDLRAVLSDR
TRLVVVNFPHNPSGACVDPRGRTELLDLVANSQAVLLWDGAFTDLVHDHPPLAEPSQDLDRVLSFGTLSKAYGLPGLRVG
WCVVPQDLVSELVRIRDYLTLSLSPLVERVAAVAVEHADALITPRLTEARHNRRRVLEWAAASEGAIDCPVPRGGVTAFP
RFTAHTDVTDLCERLLARHGVLVVPGRVFGQADRMRIGFSCPRPELERGLAAISEELGTHARGRRRGTG
>P0DPE4 1.5.1.43~~~~~~Carboxynorspermidine synthase~~~
MAILQIGAGGVGWVVAHKAAQNNDVLGDITIASRTVGKCEKIIESIQKKNNLKDSTKKLEARAVNADDVDSLVALIKEVQ
PDLVINAGPPWVNMSIMEACYQAKVSYLDTSVAVDLCSEGQQVPQAYDWQWGYREKFEEAGITGILGAGFDPGVVSVFAA
YAVKHLFDEIDTIDVMDVNAGDHGKKFATNFDPETNMLEIQGDSFYWENGEWKQVPCHSRMLEFEFPNCGSHKVYSMAHD
EVRSMQEFIPAKRIEFWMGFGDRYLNYFNVMRDIGLLSPDPLTLHDGTVVQPLHVLKALLPDPTSLAPGYTGLTCIGTWV
QGKKDGKERSVFIYNNADHEVAYEDVEHQAISYTTGVPAITAALQFFRGKWADKGVFNMEQLDPDPFLETMPSIGLDWHV
QELEPGQPVIHKLK
>Q9KRL3 1.5.1.43~~~~~~Carboxynorspermidine synthase~~~COG1748
MSILQIGAGGVGWVVAHKAAQNNDVLGDITIASRSIAKCEKIIESIKGKNNLKDSSKKLEARQVNADDIESLVKLINEVK
PDLVINAGPPWVNVAIMEACYQAKVSYLDTSVSVDLCSKGQQVPEAYDAQWAFRDKFKQAGITAILSAGFDPGVVSVFAA
YAAKYLFDEIDTIDVLDINAGDHGKKFATNFDPETNLLEIQGDSIYWDAGEWKRVPCHTRMLEFDFPKCGKFKVYSMSHD
ELRSLKEFIPAKRIEFWMGFGDRYLNYFNMMRDIGLLSPEPLTLQDGTVVKPLQVLKAMLPDPTSLAPGYKGLTCIGTWV
QGKKDGKARSVFIYNHADHEVAYHDVEHQAIAYTTGVPAITAALQFFRGEWAEPGVFNMEQLNPDPFLETMPSIGLGWDV
MELEPGQPDIQVVK
>P61517 4.2.1.1~~~can~~~Carbonic anhydrase 2~~~COG0288
MKDIDTLISNNALWSKMLVEEDPGFFEKLAQAQKPRFLWIGCSDSRVPAERLTGLEPGELFVHRNVANLVIHTDLNCLSV
VQYAVDVLEVEHIIICGHYGCGGVQAAVENPELGLINNWLLHIRDIWFKHSSLLGEMPQERRLDTLCELNVMEQVYNLGH
STIMQSAWKRGQKVTIHGWAYGIHDGLLRDLDVTATNRETLEQRYRHGISNLKLKHANHK
>P45148 4.2.1.1~~~can~~~Carbonic anhydrase 2~~~COG0288
MDKIKQLFANNYSWAQRMKEENSTYFKELADHQTPHYLWIGCSDSRVPAEKLTNLEPGELFVHRNVANQVIHTDFNCLSV
VQYAVDVLKIEHIIICGHTNCGGIHAAMADKDLGLINNWLLHIRDIWFKHGHLLGKLSPEKRADMLTKINVAEQVYNLGR
TSIVKSAWERGQKLSLHGWVYDVNDGFLVDQGVMATSRETLEISYRNAIARLSILDEENILKKDHLENT
>B8G5D6 1.6.99.1~~~~~~NADPH dehydrogenase~~~COG1902
MQPHLFTPLTIGSVTLRNRIGMSPMCQYSAVDGFPTDWHLMHLGARAAGGVGLIILEATAVSPEGRISPFDLGIWSDDHI
AALSRIVKLIESLGAVAGIQLAHAGRKASVGRPWEGGKPIAPANGGWPVVGPTAEPFAPGYPTPIPLDAAGIARVVADFA
TATKRARAAGFRWIEIHAAHGYLLHNFLSPLGNDRNDEYGGDLRGRVRLLSEVTAAVRAEWPSDLPLAVRLSCSDWTPEG
LTIADTVEVARMLREQGVDLIDCSSGGIAPGITIPVGEGYQVPFAAQVRREANIATAAVGLITRPEHADAIVRNGDADLV
LLGRELLRDPHWPLRAARALGHDLAPPPQYLRAW
>A0A5D0EMF2 3.2.2.5~~~~~~CD-NTase-associated protein 12~~~
MKKKIFIGSSSEELRIAEKVKKILEQDNEFEVTIWNDNSIWDNSVFKLNHNFLTDLLNSSLSSDFGILIGTCDDKVIVRG
TERLQPRDNVLFELGFFIGKLGLDNCAFLIDKNIHILSDVQGITLARANMGDPDELKRAVNCIKEHFKHQPNSGINFFPS
STLASVYHENFIKPTCQTIIEDNGILDTAGKKHTNCLVKIIIPNKINIDVNSQFQILKNKISTNTLSFNYKGRPRNISIE
IVESSEENTTIIDFPTIISGIYYAISNLLPQDSETDSITYRNILARELDRFVNTLYKFIKRDGYDEIVKIVYEDKLY
>A0A381HAP5 3.2.2.5~~~~~~CD-NTase-associated protein 12~~~
MRGKRIFIGSSSEELRLAEQAKKILEKNTNYQVTIWNENMWDKAVFRLNNSYLNDLIRATLHFDFGILIGTKDDKVIFRG
SEEIQPRDNVLFELGLFIGRLGLNNCAFLVDEEIKILSDVKGISLARFKEKDSDSFNNAVLSIRESFDRQNDSDINFFPS
STLAAVYYENFIKPTCSHIINNGGLLDKNGYIYKKCTIKIIIPKKLTSDVNSQFQRIKAKIETKELSFEYLGRPRNINVE
IIAEDGEVMIIDFPTILSGINYAISNLLPQDFNSMSVDYEAILSRELERFVYTLKKIALRDGFDDLIKIVDEDN
>P0DUD9 3.2.2.5~~~~~~CD-NTase-associated protein 12~~~
MKTRIFIGSSKEGLEIAEYIKLQLGTKYECYLWTDDIFKFNESFLYTLLKEASLFDFGILVATKDDLSTIRDKSFDTPRD
NVIFEFGLFLGRLGPSRAFVIQESGAKLPTDLLGITVPQFEKTIPLANSTSLNNEIERISKTIDEKITLGELGLLPSTVL
AIGYFYNFVSIVCESIHTKSDIKVDDAIFKKFELNIVIPKDLDADIKKRATVYFKSKTLKEIQFETSSRNFPVFVTYDNQ
SKDVLKLYDMPTTLGGIDKAIEMFMRKGHIGKTSQQKLLEERELRNFQTTLQNLIDNDAFCRNIVKIIQEE
>P0DUE0 3.2.2.5~~~~~~CD-NTase-associated protein 12~~~
MKTRIFIGSSSEGIDVAKRIKTFFAPEYDCFLWTDDIFRNNESFLETLVKSASLFDFGFMVFSADDKTTIRDQHFESPRD
NVLFEYGLFLGRVGLDRAFVIAETDAKIPTDMLGITQTRYETIVNSKGIKVATDSLESSLQKLKKQIDENVQLGHLGLLP
STVIAISYFEGFVKLAAEWLVENTPELMINNHKFNKASLKIVMPESLDTDIKRSAMMYYKRHGLEEARIDTKHRNYPIHF
ASKTEDGILEVYDMPTILTGIDKAIDMYFRVGHIGKTNEQKLAEDHEMNNFKRVLQLLINEDAFCRECVEIIEPQP
>A0A1G6LGU2 3.2.2.5~~~~~~CD-NTase-associated protein 12~~~
MIKKRLFIGSSSEELKTAEIVKEVLLKDFEVTIWNDNVWDTAVFKINQNFLADLLKASLQFDFGILIGTKDDKVMFREVE
MIQPRDNVLFELGLFTGRLGTSKCAFLIDKEIKLPSDFNGLTLARFDSTNEATVIAGANSIKDLFLASADDEINFFPSAT
LASVYYENLIVPICRFIIDNNGFTKGDTHYQKCKLNIIVPERINQDVNLQFEKLKGLFTTENVSFKYSGRPRQISVDTQI
KNDTLEFIDFPTIITGINHAISNLLPNDFNKQSPDYSSILDRELRRFITTLKKLLIRGGFDEMVNVKRDSEL
>A0A2T5Y4G4 3.2.2.5~~~~~~CD-NTase-associated protein 12~~~
MKKRIFIGSSSEQLTILNEIVDLLGDDVECIPWTDAFALNKSGLDSLIKQTRLADYSILIATKDDLTKQRGESLTKPRDN
VVFEFGLFLGAAGPEKCYLIAEEDTDLPTDLDGITVAKFTRNSGQYNSLDKIVESIRTHLVKIAEMSQLGLLPSTALAIG
YYNSFIKRVCEEIHGSECVELEGKKIKVKSFRVDVVIPETLDDNGVGNFTTLYNKRYGLSKATTCTNPALLGTRGFPFHF
KVDPPDANQESPVDIHLLDIPSTLSTIVESLKLYLPSNQVGQDFDMDYLEMRELENFAKVLKYLIGRNAATKGYVNVLTN
VKL
>P0DUD7 ~~~~~~CD-NTase-associated protein 13~~~
MEKIYTWLKTNSYIVHHVSTSLNIISFIIVLIWIFESTIKEKLNIIFTVNLEAIVVFISILIVGLNQLLQKLLIEAEYSP
AFALAVGYFKNFIFPAITQIKENGEVNPKICIYKPKHFDELTSTNIDMIKAELTNKKYNLSEINLSLKGARARDILTLNK
KSKIHSYFDFPNTLLSLYSYVDFKIASSNNNSSELKKKKFVELLIEQFYLKLNELIQENNLTNNITFCDKNLQGL
>A0A150XSR0 ~~~~~~CD-NTase-associated protein 13~~~
MKYGLLQENGKVAEFIQIKTIMKNVIANVSTAITLALMILWIKYPNRIEWEAIIGILLVIKEVTIRWQIGKIESLEFSPA
ISLAHGYVNNFLEPAINELLMKASNNINFSIYIPHDLEELSDQQIDRMKLQIEANGYRLKEIKLKKKTGRPHDLLLVEKQ
EGTLSYFDFPRTLLSLQSYIDYKVDSTKNEFSEEKKIAMGAKLVDAFHNEVDRLIKKKNLEGIVTFVSKDLELY
>P0DTF2 ~~~cap2~~~CD-NTase-associated protein 2~~~
MEEGRLHRVMLSCGYSFTYARNLPEKAILYSQNCQQGYYTKEYTTVAGDVKIALIIRADPFIELPIAYILELPEQFKDRL
MPHISLEGFLCYVEQMEADWDSNNLEDTYREVDAQIQRTLVNSVSAAMEGSNDKKELEGEFTAYWRPSESLFVLSNANRS
TRLKTFISQSVRPNGSSQVEYITVEESSPENSGKVISAWLKLRYLPKNSLKEYHISTHYISVNPSRVAGMKWPPASFRDL
LSWLGKADHNAKNKVVEYIKKEGKKRYVFLFDVLKQDTFGIYVEFDLKSIDLKRYKHSAKNSTAKLSTVLGGKSVCSMYQ
RLGVVRADIETLLSRNTRREGSAKLSEKRIALVGCGTIGGYLAELLLRNGAGCGKGHLHLYDDDLYKPSNFGRHTLSAHD
FGRYKSLSLARKLKDSVHLPTQIIGFEKQFSIRADLMQKYDIIIDATGRPPVSKRMASIVRTIALEKRPKIIHAFNDGNG
RASKVFIDDGRSCYGCMVSNPEKYRNGIDSRFCHIDISREKNKNCGSTYTPYDAAVSNITASLTQMAVLSTLDPELKWTY
SEHMLEGGRSLKSQFLPRQPNCPICNEYE
>P0DTF3 ~~~cap3~~~CD-NTase-associated protein 3~~~
MNMNELVFIDDFDNHVVIMSEVVMRLNSYRQTHYTSTESGGTLIGERRGQHLVITHISEPGQDDVRNRTGLERKGIHHQQ
KVNDLFQQSNGFIVYLGEWHTHPEDFPHPSFIDIKSWVMGIVATEPMIMLIVGRKDIWIGKKIKNDIKKLKKKMVS
>C0VHC9 3.1.-.-~~~cap4~~~CD-NTase-associated protein 4~~~
MSASLLEKQSTGGAIARVGFGYQDAFVLRSLPLWLSQSAFSHIVSEALSDIEVCYFSSEKSLHVMYEAKNHSLTATEFWD
EIRRFKSLFDTHPKNFIWFNLVCPSYNTAISPLISKIDRLRGVGSSYDDDSSVSVNGRSEYLDWCVGKKIDFSLAEFALD
YVGFITFNSENSESIFLSEIQDTINIELLRSQVKQLKDQFKNLISRSSFGPIYRKDFENFICHALEEDRSQWLLDPIKIN
LSASSSQYQDLNLDISDFNGPDRAQKTSSDWNSLIKKAVSIGDFIHNSGDRRTLLIDGKQRMSTACMLGYVFSATRNFLL
EIEHNGLIYRTDDHKQKEGQFFTKIEAVEPQGETEAIVAIGFPTAIGKDIDSTINEVKSLPRLNLESSHAIDNMETLNLA
VREAKSALVSFKSENKLSKLHLFIKAPSVFAMVLGHRLNGICDIQLYDWVDGQYIPTAELNL
>P0DUD5 3.1.-.-~~~cap4~~~CD-NTase-associated protein 4~~~
MATSVLANWHGHDYQARYFWIEASRLKNPQQDFVVEVSYEADGPKAFDDVITRYNPPRRSTGPDRIQADYYQIKFHVTQA
ASFGFEDLIDPAFIGAETFSILERLKQAKGTEPANSAFHLVTTDRIIDEDPLGEIISNVDGSIRLDKLFDGTTDRSRKGK
VRKLWRQHLKLSTDQELEQVLSGFHIQQSQPTLEAMREKVNTCFQIIGLITCETSSDFRFDGAARALRSQERYRFTREQF
TALCEEENWIRSEAPESFRNVALRSFSDGPLDIMDALPEHTLSLLSLFEGRFPSPGIEWNDVIKPQVETFLTGIRQTERK
VRLYLNTHSSIAMLAGKCLGHKSGVEIELVQKGRMGDSIWSENESQDEPDAVIETETVGTGSDVAVVLSITRNALPKARA
YILENQPDIGRIIHVTPANGHGQRSVKNGSHAVAIAEQVSDVVMDADLPVEASLHIFSAAPNAVNFYLGQHTDFLGTCVF
YEFDFQRQRDGSYLPSFKV
>A0A2K8K5C5 3.1.-.-~~~cap4~~~CD-NTase-associated protein 4~~~
MSASLLEKQSTGGAIARVGFEYQDAFVLKNLPLWLSESAFSHIVSESIGDVEVCYFSLEKDFQRVMYEAKNHSLTSTDFW
KEIKRFKEAFDIPSSEFTRFGLVCPLYTSTLHPFLAQIERIRGVGSSYSADSVILQKSRQDITQWCSDKGFETSLAEFAL
DHVDFLSFNAEDSDSVFIGEIEEKLSNIELTTRKAKQLRDQFKNLISRSSFGPIHRKDFENFICHALEEDRTQWLSDPIK
INLSASSSQHQDLNLDISDFNGPDRAQKTSSDWNSLIKKAVSIGDFIHNSGDRRTLLIDGKQRMSTACMLGYVFSATRNF
LLEIEHNGLAYRTDDHKQKEGQFFNKTNSIELHGKTEAIVTIGFPTAIGKDIDSTINEIKNLPRLNLESSNVIDNMETLN
LAVKEAKSALVSFKAENKLSKLHLFIKAPSVFAMVLGHRLNGVCNIQLYDWVNGEYMPTAELNI
>D7Y2H4 ~~~cap6~~~CD-NTase-associated protein 6~~~
MNVKPSLDELFERRINFPDFEPQERLARLVGLDEHKDRLSKILGLLVNPYGIQEWAKKYHPDARAAVDTVLRRPPLVVLA
GDVGSGKTELAETIGDAVARQEDIDITLYPLSLATRGQGRVGEMTQLVSAAFDYTIEAADKLKNTNGKARGAVLLLIDEA
DALAQSRENAQMHHEDRAGVNAFIRGIDRIANQKLPAAVLMCTNRLKALDPAVQRRAAEILTFSRPNDEQRHYLLHSKLT
GLGLNSTAVEELVRLTGPRDPNSPGFTFSDITQRLIPSIILAAYPYNAVSVHSALQVVNKMTPTPAFIDRQ
>P0DTF4 ~~~cap6~~~CD-NTase-associated protein 6~~~
MTKNPSSDATLPKGIHRSWKLPDKSLGDLWDSIVMDEAIKKQLLSQAIVNFTVRPKVERTVLPLHGVILLVGPPGTGKTS
LARGLAHRVAESFSSAKFRLLEVEPHTLTSSAMGKTQRAVADLFSQSIAESAAAGPTIVLLDEVETLAADRAKLSLEANP
VDVHRATDAVLVQLDMLAERNPHLLFVATSNFPQAVDSAFLSRCDMVMEVPLPGKDACKQILVDCLNGLAKTFPGIGKLS
SAHQFDVCAGECVGLDGRAIRKVVANALAADPQVAIDPNKLSVEHLRSAIRQAKQMRLQGGKQK
>D7Y2H3 ~~~cap7~~~CD-NTase-associated protein 7~~~
MSSYSYTVAETQTFSVTHARHMAAKVATDLRRMQRFYGYPSDADIEAYEEELVVFLKAGYLGEVSYGFQKNNNWIEPTLR
YTAGDLLGSGTDDDPGKIRPGKDVSGASFYSFMTYSSKYLNATQSEKDTALKDLPFKRVGAQSPGINGYLENDKTYSAGG
RSLTRTSVRNFV
>P0DTF6 ~~~cap7~~~CD-NTase-associated protein 7~~~
MSTVATYSYTHSVTYVTDNILKSLKDIILLSGLDPEHFADRWESNTRAIKTWLGTGDLRKVILEIYNPATDKLVTRWDID
IVYGWSDGDGSFWTDTEQLKYAIKKAGLLPSQAKYKLMLDTKPGRPDVEGWSKGSYRSTDGMVKQSLGSTVEHSGLAGQA
GYWRQR
>P72367 ~~~cap8A~~~Capsular polysaccharide type 8 biosynthesis protein cap8A~~~
MESTLELTKIKEVLQKNLKILIILPLLFLIISAIVTFFVLSPKYQANTQILVNQTKGDNPQFMAQEVQSNIQLVNTYKEI
VKSPRILDEVSKDLNNKYSPSKLSSMLTITNQENTQLINIQVKSGHKQDSEKIANSFAKVTSKQIPKIMSLDNVSILSKA
DGTAVKVAPKTVVNLIGAFFLGLVVALIYIFFKVIFDKRIKDEEDVEKELGLPVLGSIQKFN
>P0DTF5 ~~~cap8~~~CD-NTase-associated protein 8~~~
MTTVVSRTFRSSPHRDALQTWDAIVELLTQGKDGTARSELRAVTGVAASLIADQAPKSAPIVATCDGPRTRIYCLFDEDA
IDGDDANEEVLGFEPLKGDWGVSLPCPKEQLGWVQSALKKHSSRIIARDLSQGIATQAQADAGQALSLDLGGFLKS
>P19579 ~~~capA~~~Capsule biosynthesis protein CapA~~~
MRRKLTFQEKLLIFIKKTKKKNPRYVAIVLPLIAVILIAATWVQRTEAVAPVKHRENEKLTMTMVGDIMMGRHVKEIVNR
YGTDYVFRHVSPYLKNSDYVSGNFEHPVLLEDKKNYQKADKNIHLSAKEETVKAVKEAGFTVLNLANNHMTDYGAKGTKD
TIKAFKEADLDYVGAGENFKDVKNIVYQNVNGVRVATLGFTDAFVAGAIATKEQPGSLSMNPDVLLKQISKAKDPKKGNA
DLVVVNTHWGEEYDNKPSPRQEALAKAMVDAGADIIVGHHPHVLQSFDVYKQGIIFYSLGNFVFDQGWTRTKDSALVQYH
LRDNGTAILDVVPLNIQEGSPKPVTSALDKNRVYRQLTKDTSKGALWSKKDDKLEIKLNHKHVIEKMKKREKQEHQDKQE
KENQVSVETTT
>P19580 ~~~capB~~~Capsule biosynthesis protein CapB~~~
MKNIKIVRILKHDEAIRIEHRISELYSDEFGVVYAGNHLIFNWYQRLYLSRNILISKKSKSRKGLIQMIFIIGICTVFLI
IYGIWEQRCHQKRLNSIPIRVNINGIRGKSTVTRLITGVVQEAKYKTVGKTTGTSARMIYWFTDEEQPIKRRKEGPNIGE
QRRVVKEAADLEAEALICECMAVQPDYQIIFQNKMIQANVGVIVNVLEDHMDVMGPTLDEVAEAFTATIPYNGHLVTIES
EYLDYFKEVAEERNTKVIVADNSRISEEFLRKFDYMVFPDNASLALAVAEALGIDEETAFRGMLNAHPDPGAMRITRFAD
QSKPAFFVNGFAANDPSSTLRIWERVDDFGYSNLAPIVIMNCRPDRVDRTEQFARDVLPYIKAEIVIAIGETTAPITSAF
EKGDIPTQEYWNLEGWSTSEIMSRMRPYLKNRIVYGVGNIHGAAEPLIDMIMEEQIGKKQAKVI
>P0A106 ~~~capB~~~Cold shock protein CapB~~~COG1278
MSNRQTGTVKWFNDEKGFGFITPQSGDDLFVHFKAIQSDGFKSLKEGQQVSFIATRGQKGMQAEEVQVI
>P19581 ~~~capC~~~Capsule biosynthesis protein CapC~~~
MFGSDLYIALVLGVTLSLIFTERTGILPAGLVVPGYLALVFNQPVFMLVVLFISILTYVIVTYGVSRFMILYGRRKFAAT
LITGICLKLLFDYCYPVMPFEIFEFRGIGVIVPGLIANTIQRQGLPLTIGTTILLSGATFAIMNIYYLF
>Q51693 2.3.2.-~~~capD~~~Capsule biosynthesis protein CapD proenzyme~~~
MNSFKWGKKIILFCLIVSLMGGIGVSCSFNKIKDSVKQKIDSMGDKGTYGVSASHPLAVEEGMKVLKNGGSAVDAAIVVS
YVLGVVELHASGIGGGGGMLIISKDKETFIDYRETTPYFTGNQKPHIGVPGFVAGMEYIHDNYGSLPMGELLQPAINYAE
KGFKVDDSLTMRLDLAKPRIYSDKLSIFYPNGEPIETGETLIQTDLARTLKKIQKEGAKGFYEGGVARAISKTAKISLED
IKGYKVEVRKPVKGNYMGYDVYTAPPPFSGVTLLQMLKLAEKKEVYKDVDHTATYMSKMEEISRIAYQDRKKNLGDPNYV
NMDPNKMVSDKYISTMKNENGDALSEAEHESTTHFVIIDRDGTVVSSTNTLSNFFGTGKYTAGFFLNNQLQNFGSEGFNS
YEPGKRSRTFMAPTVLKKDGETIGIGSPGGNRIPQILTPILDKYTHGKGSLQDIINEYRFTFEKNTAYTEIQLSSEVKNE
LSRKGLNVKKKVSPAFFGGVQALIKDERDNVITGAGDGRRNGTWKSNK
>Q9ZDJ5 5.1.3.2~~~capD~~~UDP-glucose 4-epimerase~~~COG1086
MFVDKTLMITGGTGSFGNAVLSRFLKSNIINDIKEIRIFSRDEKKQEDMRIALNNSKLKFYIGDVRNYQSIDDAMHGVDY
VFHAAALKQVPTCEFYPMEAINTNVLGAENVLSAAINNKVTKVIVLSTDKAVYPINAMGLSKALMEKLAIAKARMRSPGE
TILCVTRYGNVMASRGSVIPLFIHQIKQGKELTITEPSMTRFLMSLVDSVDLVLYAFEHGRQGDIFVQKSPASTIEVLAK
ALQEIFGSKNAIRFIGTRHGEKHYESLVSSEDMAKADDLGGYYRIPMDGRDLNYAKYFVTGEKKVALLDDYTSHNTKRLN
LKEVKELLLTLDYVQKELKNA
>Q6XGD4 3.1.1.32~~~capE~~~cUMP-AMP-activated phospholipase~~~
MTYSVSPSSLLTEYGNDNICRVLALDGGGAKGFYTLGVLKEIEAMLGCPLYKRFDLVFGTSTGAIIAALIALGYEVDQIH
ALYTEHVPRVMSSRSAAARTMALQDLAKEVFQDKTFEDVLMGIGIVATRWMTERPMIFKGNVVQAHGRKGTFSPGFGVSI
ADAVQASCSAYPFFERKVIVTAAGDKVELIDGGYCANNPTLFAIADATVALKKDHKDIRVINVGVGIYPEPKPGLLMRIA
KKWLAVQLLQKTLEINTQSMDQLRDILFKDIPTIRISDTFERPEMATDLLEYNLDKLNTLRQRGRESFGAREAQLREFLI
>A0A2W0EVE0 3.5.2.-~~~capA~~~Caprolactamase subunit alpha~~~
MSKQQYRLGIDAGGTFTDFILADHQGNVQLFKAPSTPHDGTLAIRNGLAQIADALGRTPAEIIADCDLCINGTTVALNAL
IEKTGVKVGLLCTDGHEDSLEIRLGHKEDGHRYDATYPPAHMLVPRHLRRPIGGRIISDGSEFSPLDEAAIHAAIDYFRE
QQVQAVAISFVWSVRNPSHEQRAMAMVRAALPDVFVCSGHEVFPQIREYTRTSTTVVNAYLSPVMGRYIERIDALFEELG
AQQPTRYFQSNGGLAPGVVMRERAVNAINSGPASAPQAGLCVAQPFGIDNVITVDMGGTSFDITLSKGGRTNFSKDSDFL
RYRIGVPMIQVETLGAGGGSIAHLDDFGMLQVGPRSAGANPGPVCYGKGGVEPTVTDANLALGYLADGALLGGSIRLNRQ
AAIDAIRSKIAEPLGISVERAAVGIITLVNLSMVSGIRRVSIERGYDPRDFALIGAGGAAGMHVMRLAEEIGSKVVLIPK
VASGLCAFGQILSDIRYDQLTTLPMRLDDEFVDLEQLNQALQQLRERGMTNLRDDGFGGDNRIECQYSLEIRYLGQIHEC
SVELSCDRLDRSSLAALRESFHQRHKALFSYSEPNSPVELVNLECSVIARLQRPPMPELATPLKATAAIPAGHRPMLFNA
QDDWQDTPVYNGDRIEVGQIIQGPCVIEEATTNILVPPGWRVSLDPSATYELTPGH
>A0A2W0FH34 3.5.2.-~~~capB~~~Caprolactamase subunit beta~~~
MNTVDPITLAVVRGALETAQREMTLTLEKTSRSSVFNLAHDYSNALFDHLPEMILQGQDIPIHLGSLIPAMKCVAGFFGD
EIAEGDVIYHNDPAYMGSHILDCCMYKPVFYKGELVFWTVCKGHLTDIGGPVPAGYNPDAKEIYAEGLRIPPVKLWAQGQ
RREDVINLLLTNMRARAYQEGDLNAQYGACSVGERHLIELLDRYGVDQVRACITELKDMADRHMRALLRDVPDGFYSGTA
ILEDSGHGLGELSITAQVEIRGDEAHVLIESPPQVPYFINSYAGNSISGVYLGLMMFAQVPPPYNEGLYRCVSVDLGPSG
TLCNAQEPAPHVNCTTTPMETLADAVRLALEQAAPERVTASWGHASGINIAGHDPRNNNDEYVTMVLASVISGAGANKAM
DGWPACGPLCCFGALMSGDIELLEYSYPVLIHRYSLMTDSGGAGEFRGGSGTRLELEPLKHAMTVVGFGEGRQLPTAGAA
GAKNVLLEPKLGRLIHRHVDGEEDHYIQNTLLTAQPGERVINVNPGGGGYGDPLRRPLATVLADVRNGLVSIDGARLEYG
VVIDGNGQLDEAATHAHRAAH
>Q8XLE8 4.1.1.31~~~ppcA~~~Phosphoenolpyruvate carboxylase~~~
MKIPCSMMTQHPDNVETYISIQQEPAEAIKGLTPQDKGGLGIEEVMIDFEGKLTPYHQTSQIALGLISNGIIPGKDVRVT
PRIPNANKESVFRQLMSIMSIIETNVQSKELTGTPAISEVVVPMIETGKEISEFQDRVNSVVDMGNKNYKTKLDLNSVRI
IPLVEDVPALANIDRILDEHYEIEKSKGHILKDLRIMIARSDTAMSYGLISGVLSVLMAVDGAYKWGEKHGVTISPILGC
GSLPFRGHFSEENIDEILATYSGIKTFTFQSALRYDHGEEATKHAVRELKEKIAQSKPRNFSEEDKDLMKEFIGICSKHY
LQTFLKVIDTVSFVSDFIPKNRDRLTKAKTGLEYNREVANLDNVADLVKDEVLKQEILSIDNSKEYAVPRAISFTGAMYT
LGMPPELMGMGRALNEIKTKYGQEGIDKLLEIYPILRKDLAFAARFANGGVSKKIIDEEARQEYKEDMKYVNEILNLGLD
YDFLNENEFYHTLLKTTKPIIMHLMGLEENVMRNSTEELKILNEWIVRMGKVRGSIG
>P12880 4.1.1.31~~~ppc~~~Phosphoenolpyruvate carboxylase~~~COG2352
MTDFLRDDIRFLGQILGEVIAEQEGQEVYELVEQARLTSFDIAKGNAEMDSLVQVFDGITPAKATPIARAFSHFALLANL
AEDLYDEELREQALDAGDTPPDSTLDATWLKLNEGNVGAEAVADVLRNAEVAPVLTAHPTETRRRTVFDAQKWITTHMRE
RHALQSAEPTARTQSKLDEIEKNIRRRITILWQTALIRVARPRIEDEIEVGLRYYKLSLLEEIPRINRDVAVELRERFGE
GVPLKPVVKPGSWIGGDHDGNPYVTAETVEYSTHRAAETVLKYYARQLHSLEHELSLSDRMNKVTPQLLALADAGHNDVP
SRVDEPYRRAVHGVRGRILATTAELIGEDAVEGVWFKVFTPYASPEEFLNDALTIDHSLRESKDVLIADDRLSVLISAIE
SFGFNLYALDLRQNSESYEDVLTELFERAQVTANYRELSEAEKLEVLLKELRSPRPLIPHGSDEYSEVTDRELGIFRTAS
EAVKKFGPRMVPHCIISMASSVTDVLEPMVLLKEFGLIAANGDNPRGTVDVIPLFETIEDLQAGAGILDELWKIDLYRNY
LLQRDNVQEVMLGYSDSNKDGGYFSANWALYDAELQLVELCRSAGVKLRLFHGRGGTVGRGGGPSYDAILAQPRGAVQGS
VRITEQGEIISAKYGNPETARRNLEALVSATLEASLLDVSELTDHQRAYDIMSEISELSLKKYASLVHEDQGFIDYFTQS
TPLQEIGSLNIGSRPSSRKQTSSVEDLRAIPWVLSWSQSRVMLPGWFGVGTALEQWIGEGEQATQRIAELQTLNESWPFF
TSVLDNMAQVMSKAELRLAKLYADLIPDTEVAERVYSVIREEYFLTKKMFCVITGSDDLLDDNPLLARSVQRRYPYLLPL
NVIQVEMMRRYRKGDQSEQVSRNIQLTMNGLSTALRNSG
>P00864 4.1.1.31~~~ppc~~~Phosphoenolpyruvate carboxylase~~~COG2352
MNEQYSALRSNVSMLGKVLGETIKDALGEHILERVETIRKLSKSSRAGNDANRQELLTTLQNLSNDELLPVARAFSQFLN
LANTAEQYHSISPKGEAASNPEVIARTLRKLKNQPELSEDTIKKAVESLSLELVLTAHPTEITRRTLIHKMVEVNACLKQ
LDNKDIADYEHNQLMRRLRQLIAQSWHTDEIRKLRPSPVDEAKWGFAVVENSLWQGVPNYLRELNEQLEENLGYKLPVEF
VPVRFTSWMGGDRDGNPNVTADITRHVLLLSRWKATDLFLKDIQVLVSELSMVEATPELLALVGEEGAAEPYRYLMKNLR
SRLMATQAWLEARLKGEELPKPEGLLTQNEELWEPLYACYQSLQACGMGIIANGDLLDTLRRVKCFGVPLVRIDIRQEST
RHTEALGELTRYLGIGDYESWSEADKQAFLIRELNSKRPLLPRNWQPSAETREVLDTCQVIAEAPQGSIAAYVISMAKTP
SDVLAVHLLLKEAGIGFAMPVAPLFETLDDLNNANDVMTQLLNIDWYRGLIQGKQMVMIGYSDSAKDAGVMAASWAQYQA
QDALIKTCEKAGIELTLFHGRGGSIGRGGAPAHAALLSQPPGSLKGGLRVTEQGEMIRFKYGLPEITVSSLSLYTGAILE
ANLLPPPEPKESWRRIMDELSVISCDVYRGYVRENKDFVPYFRSATPEQELGKLPLGSRPAKRRPTGGVESLRAIPWIFA
WTQNRLMLPAWLGAGTALQKVVEDGKQSELEAMCRDWPFFSTRLGMLEMVFAKADLWLAEYYDQRLVDKALWPLGKELRN
LQEEDIKVVLAIANDSHLMADLPWIAESIQLRNIYTDPLNVLQAELLHRSRQAEKEGQEPDPRVEQALMVTIAGIAAGMR
NTG
>A0QWX4 4.1.1.31~~~ppc~~~Phosphoenolpyruvate carboxylase~~~COG2352
MADSNDTALEPFGSVQRTHIGREASEPMREDIRLLGAILGDTVREQNGEEVFDLVERARVESFRVRRSEIDRSELADMFS
GVDAHQAIPVIRAFTHFALLANVAEDIHRERRRAVHVAAGKPPQDSSLAATYRKLDAADLDVDKVADTLTGALVSPVITA
HPTETRRRTVFDTQHRITELMRLRLHGHTRTDDNRDIETELRRHILTLWQTALIRLSRLKISDEIETGLRYYEAAFFDVI
PQVNAEVRDALRKRWPDAKLLEEPILRPGSWIGGDRDGNPNVTPEVVRHATGRAAYVALAHYFEQITALEQELSMSARLV
KVTPALAALADACHEPARADEPYRRALRVIHARLTSTAREILDEQPEHGLDLGLPRYQTPAEFLADLDAVDGSLRANGSR
VLADDRLGRLREAVRVFGFHLSGLDMRQNSDVHEEVVAELLAWAGVHPDYTSLSEPQRVELLAAEIATRRPLIREGAELS
ELAQKELGIVAAAARAVKVFGPQAVPNYIISMCQSVSDMLEAAVLLKEAGLLDISGSTPYAPVGVVPLFETIDDLQRGSS
ILEAALDLPEYRTMVDARDGHQEVMLGYSDSNKDGGYLAANWALYRAELDLVESARKTGIRLRLFHGRGGTVGRGGGPSY
DAILAQPPGAVKGSLRITEQGEVIAAKYAEPRIAHRNLETLLAATLEASLLDVEGLGEEAEPAYQVLDELAALAQRAYSE
LVHETPGFVEYFKTSTPVSEIGALNIGSRPTSRKPTTSIADLRAIPWVLAWSQSRVMLPGWYGTGSAFENWIGTDPDGAR
LRVLQDLYARWPFFRTVLSNMAQVLAKADMGLAARYSELVEDADLRARVFDKIVAEHDRTIRMHRLITGQDDLLADNAAL
ARSVFNRFPYLEPLNHLQVELLRRYRSGETDELVQRGILLTMSGLATALRNSG
>Q59757 4.1.1.31~~~ppc~~~Phosphoenolpyruvate carboxylase~~~
MLPPLQIEIEGTGISRPLSEHVNLLGGLLGQVIQEMAGPEMLELVETLRRLCKQAAQENRPEFREQAYTRIHSATYDELL
WLLRAYTAFFHLVNQAEQQEIIRINRERAQQSTPERPRPESIDEAILALKQQGRTLDDVLTLLERLDIQPTVTAHPTEAR
RRSILYKQQHIAQMLSQQRRCQLTPEEQETLLLDLHNQITLLLGTAEVREERPTVRDEVEQGLYFIQSTIWEAVPRIYED
VRRALRRYYGADVDFRPFLRYRSWIGSDRDGNPYVTPEITRWTALTQRRLALQRYMEELRQLRRRLSLSDRYVAPPEELR
RSLARDAREVSLPPHVLRQFRHESFRLKISYIMGRLHGLLQALDDPTQPAPDYDADAFVEDLRLLQRCLEACGLERIARH
DQLTRLLVLAQTFGFHLVTLDVRQHSSVHEAAVAELLRLAGVENDYRALPESRRQELLAEELSNPRPLLPPGARVSEATR
QVLETFAVIRELVQLDPRLVGSYIVSMTHTVSDLLEPMLLAKEVGLWHYERDPRTGKPGHVRCPIDFVPLFETIEDLEAA
ASRMEAILSHPVYRMQVAARGGFQEIMLGYSDSTKDGGYWMANWALHRAQEQLAEVCLRHGVDFRLFHGRGGTVGRGGGR
ANQAILAMPPVVHNGRIRFTEQGEVISFRYALPEIAHRHLEQIVNAMLRVVGLPAASGTDGTDPATRNRLMDELAARSMR
AYRRLIDAPDFWSWYTRITPIDQISRLPIASRPVSRSSAREVDFESLRAIPWVFAWTQVRYLIPGWFGIGQALDELLQTS
PEHLETLRTWYRSWPFFRTVLQNAQREMVRARLEIAAYYDRLLGDGPTAFHQMIEEDYHRARTAILRITDQESLLDHDPI
IRKSVQLRNPYTDVLNLVQLELMRRIRSGAEADREPLRRALFLSINGIAAAMQSTG
>Q9RNU9 4.1.1.31~~~ppc~~~Phosphoenolpyruvate carboxylase~~~COG2352
MSSADDQTTTTTSSELRADIRRLGDLLGETLVRQEGPELLELVEKVRRLTREDGEAAAELLRGTELETAAKLVRAFSTYF
HLANVTEQVHRGRELGAKRAAEGGLLARTADRLKDADPEHLRETVRNLNVRPVFTAHPTEAARRSVLNKLRRIAALLDTP
VNESDRRRLDTRLAENIDLVWQTDELRVVRPEPADEARNAIYYLDELHLGAVGDVLEDLTAELERAGVKLPDDTRPLTFG
TWIGGDRDGNPNVTPQVTWDVLILQHEHGINDALEMIDELRGFLSNSIRYAGATEELLASLQADLERLPEISPRYKRLNA
EEPYRLKATCIRQKLENTKQRLAKGTPHEDGRDYLGTAQLIDDLRIVQTSLREHRGGLFADGRLARTIRTLAAFGLQLAT
MDVREHADAHHHALGQLFDRLGEESWRYADMPREYRTKLLAKELRSRRPLAPSPAPVDAPGEKTLGVFQTVRRALEVFGP
EVIESYIISMCQGADDVFAAAVLAREAGLIDLHAGWAKIGIVPLLETTDELKAADTILEDLLADPSYRRLVALRGDVQEV
MLGYSDSSKFGGITTSQWEIHRAQRRLRDVAHRYGVRLRLFHGRGGTVGRGGGPTHDAILAQPWGTLEGEIKVTEQGEVI
SDKYLIPALARENLELTVAATLQASALHTAPRQSDEALARWDAAMDVVSDAAHTAYRHLVEDPDLPTYFLASTPVDQLAD
LHLGSRPSRRPGSGVSLDGLRAIPWVFGWTQSRQIVPGWYGVGSGLKALREAGLDTVLDEMHQQWHFFRNFISNVEMTLA
KTDLRIAQHYVDTLVPDELKHVFDTIKAEHELTVAEVLRVTGESELLDADPVLKQTFTIRDAYLDPISYLQVALLGRQRE
AAAANEDPDPLLARALLLTVNGVAAGLRNTG
>P51060 4.1.1.31~~~ppc~~~Phosphoenolpyruvate carboxylase~~~
MSDPFEALKAEVDLLGRLLGEAIRKVSGERFFALVEEVRLLSKARRQGDGAAAEVLSQRVERMPVEEMEALVRAFTHYFH
LVNLAEERHRVRVNRLRTEGETLENPRPEGFLALAKALKERGLSLEEAEAHLNRLALLLTFTAHPTETRRRTLRHHLERL
QEELEGGDRERLLARVVLLYATEEVRKARPSVEDEIKGGLYYLPTTLWRAIPKVVEGLEAALERVYGKRPHLRSPVRFRS
WMGGDRDGNPYVTPEVTAFAGRYAREVAKGRYLEELEALVRDLSLSEARIPVPKEVREGGEGVERFPGEPYRRYFAALYR
ALEGEALSTEGLARALKVAEKGLEGVGLAQVAQAFLRPLEARLSAFGLELAPLDLREESGKLLEAAAELLRLGGVHPDFL
ALSPEEKEALLTEELKTARPLLPVGEVPQGEALRVALGALRAWGDKGAHVVSMTHHPADLLAVFLLAREVGLYRPGKPLP
FDVVPLFETLEDLERAPEVLRRLLANPVFRAHAQGRGGVEVMIGYSDSNKDAGFLMANLALYQAQEALHAVGEAQGIPVF
FFHGRGTSTARGGGPAGRAIAGLPPKSVGHRLRLTEQGEALADRYAHPDLAVRHLEQLLYHFAQAALGDGVEPKAHWREA
LGEAGERSMARYRALLSQEGFFPFFEAFTPIREIGELPIASRPVYRHGRVRDIRDLRAIPWVMAWTQVRLLLPGWYGLSA
LEGLPMPLLREMYREWPFFATTLESAAMALAKADLGIAERYLKLVPEGLQGFYHHLAEEYRRTVALLEAIFEAPLLHNQK
TLERQIALRNPYVDPINFVQVELLARYRAPGGREDEGVRRALLLSLLGVAAGLRNAG
>P0DTE9 3.1.1.32~~~capV~~~cGAMP-activated phospholipase~~~
MSDVSAVDKPRVRVLSLNGGGARGMFTISILAEIERILARKHPHQDIKIGDYFDLITGTSIGGILALGLATGKSARELES
VFFDKAKDIFPTRWSLVNLCKALCAPIYNSSPLRETIEMMIGAETTFNDLTRRVMIPAVNLSTGKPLFFKTPHNPDFTRD
GPLKLIDAALATSAAPTYFAPHYCKDLRSYFADGGLVANNPSYIGLLEVFRDMKSDFDVSHKDVYILNIGTVGEDYSLSP
SLLSKKRWTGYCHLWGMGKRLVLTTMTANQHLHKNMLLRELALHDALDNYLYLDEVIPNEAASDITLDNASDSSLQNLSA
RGKQLANVQFAQNQKLKNFFISPAKPFKRTDVQEKL
>Q9KVG8 3.1.1.-~~~capV~~~cGAMP-activated phospholipase~~~COG3621
MPNPPEYEHLKNQVRILSLNGGGARGLFTISLLAEIERIIEEKQGINGFKVGDYFDLITGTSIGGILALGLAYGKSAREL
EDVFRKQAGYIFPEQKYPRFFPVFRRRYRLARGPLYDSKPLAKTIASMVGEESTFNDLKCRVLIPTVNLSTGKPQFFKTP
HNPEFHRDGRIKLIDAALATSAAPTYFAPHYCVDLDSYFADGGLVANNPSFIGLHEVFRDMATDFPEAKVSDVKILNVGT
LGEEYSLSPSSLAGKSGYLGLWGMGERLVLSAMAANQELHKAMLLREFATHDAIGNFVRLDNNIPHEAASDITLDNASAS
SLSNLASRGRQLATEEFTKNKALADFFKVPARKFK
>P0DQM5 ~~~CapA~~~Capistruin~~~
MVRLLAKLLRSTIHGSNGVSLDAVSSTHGTPGFQTPDARVISRFGFN
>Q8G8B6 1.14.12.22~~~carAa~~~Carbazole 1,9a-dioxygenase, terminal oxygenase component CarAa~~~
MANVDEAILKRVKGWAPYVDAKLGFRNHWYPVMFSKEIDEGEPKTLKLLGENLLVNRIDGKLYCLKDRCLHRGVQLSVKV
ECKTKSTITCWYHAWTYRWEDGVLCDILTNPTSAQIGRQKLKTYPVQEAKGCVFIYLGDGDPPPLARDTPPNFLDDDMEI
LGKNQIIKSNWRLAVENGFDPSHIYIHKDSILVKDNDLALPLGFAPGGDRKQQTRVVDDDVVGRKGVYDLIGEHGVPVFE
GTIGGEVVREGAYGEKIVANDISIWLPGVLKVNPFPNPDMMQFEWYVPIDENTHYYFQTLGKPCANDEERKNYEQEFESK
WKPMALEGFNNDDIWAREAMVDFYADDKGWVNEILFEVDEAIVAWRKLASEHNQGIQTQAHVSG
>D5IGG0 1.14.12.22~~~carAa~~~Carbazole 1,9a-dioxygenase, terminal oxygenase component CarAa~~~
MANQPSIAERRTKVWEPYIRAKLGFRNHWYPVRLASEIAEGTPVPVKLLGEKILLNRVGGKVYAIQDRCLHRGVTLSDRV
ECYSKNTISCWYHGWTYRWDDGRLVDILTNPGSVQIGRRALKTFPVEEAKGLIFVYVGDGEPTPLIEDVPPGFLDENRAI
HGQHRLVASNWRLGAENGFDAGHVLIHKNSILVKGNDIILPLGFAPGDPDQLTRSEVAAGKPKGVYDLLGEHSVPVFEGM
IEGKPAIHGNIGSKRVAISISIWLPGVLKVEPWPDPELTQFEWYVPVDETSHLYFQTLGKVVTSKEAADSFEREFHEKWV
GLALNGFNDDDIMARESMEPFYTDDRGWSEEILFEPDRAIIEWRGLASQHNRGIQEAR
>Q8GI16 ~~~carAc~~~Ferredoxin CarAc~~~
MNQIWLKVCAASDMQPGTIRRVNRVGAAPLAVYRVGDQFYATEDTCTHGIASLSEGTLDGDVIECPFHGGAFNVCTGMPA
SSPCTVPLGVFEVEVKEGEVYVAGEKK
>D5IGG4 ~~~carAc~~~Ferredoxin CarAc~~~
MTAKVRVIFRAAGGFEHLVETEAGVSLMEAAVLNGVDGIEAVCGGACACATCHVYVGPEWLDALKPPSETEDEMLDCVAE
RAPHSRLSCQIRLTDLLDGLTLELPKAQS
>Q8GI14 1.18.1.2~~~carAd~~~Ferredoxin--NAD(P)(+) reductase CarAd~~~
MYQLKIEGQAPGTCGSGKSLLVSALANGIGFPYECASGGCGVCKFELLEGNVQSMWPDAPGLSSRDREKGNRHLACQCVA
LSDLRIKVAVQDKYVPTIPISRMEAEVVEVRALTHDLLSVRLRTDGPANFLPGQFCLVEAEQLPGVVRAYSMANLKNPEG
IWEFYIKRVPTGRFSPWLFENRKEGARLFLTGPMGTSFFRPGTGRKSLCIGGGAGLSYAAAIARASMRETDKPVKLFYGS
RTPRDAVRWIDIDIDEDKLEVVQAVTEDTDSLWQGPTGFIHQVVDAALLETLPEYEIYLAGPPPMVDATVRMLLGKGVPR
DQIHFDAFF
>F1CYZ5 2.8.3.23~~~carA~~~Caffeate CoA-transferase~~~
MAKFISAKEAAKLIPDGSTVGVAGMGLAGWPEEVAVAIADNFKETGHPCNLTMKQGSAMGDWRERGMTRLGLEGLVTKWS
AAHIGSAFAMNDLVRAEKMACHCLPQGVIVNLWREIAAKRPGLITKVGLGTFVDPRLEGGKMNKVTTEDLVELIEFNGEE
YLFYKSFKLDVAMLRGTTADENGNITFENEGPINEGLAVAQAAKNSGGIVIVQVEYQALKNTLKPKDVKIPGALVDYVVV
ATDKNACWQTEGVYYEPAFAGNLRKPLSAIPILPLTERKVMARRAAMELSKGDLVNLGVGIPSDVASIVSEAGYIEEITM
TTEIGGFGGIPASLPNFGSSYNAEANIDHGSMFDLYDGGGIDVAVLGLAQADEAGNINVSKFTIPGLGDRLTGPGGFINI
TQSTQKVVFAGSFNAKCEVEISDGKLIIKKEGRGKKLLKEVEQVTFSGKYAAENGQEILYVTERCVFKLINGKMTVIEIA
PGIDLQKDILDQMDFTPAISADLKEMDSGLFSEKWDGLDTIMGK
>P25993 6.3.5.5~~~pyrAA~~~Carbamoyl-phosphate synthase pyrimidine-specific small chain~~~COG0505
MKRRLVLENGAVFEGEAFGSLEHNMGEVVFNTGMTGYQEILSDPSYCGQIVTLTYPLIGNYGINRDDFESITPFVKGLII
KELCELPSNWRSAYTLDEYLKMKNIPGLQGIDTRKLTRMIRTAGALKGTFASSDEDIEAVLKRLNETELPRNQVSQVSAK
TAYPSPGRGKRIVLVDFGMKHGILRELNKRKCDVIVVPYNITAEEVLQLKPDGIMLSNGPGDPKDVPEAIEMIKGVLGKV
PLFGICLGHQLFALACGANTEKMKFGHRGSNHPVKELATGKVALTSQNHGYTVSSISKTELEVTHIAINDDTIEGLKHKT
LPAFTVQYHPEASPGPEDANHLFDRFIEMIETTEKEGEAVCQNA
>Q726J4 6.3.5.5~~~carA~~~Carbamoyl-phosphate synthase small chain~~~COG0505
MRALLALEDGFVLEGRSFTGPGETGGEAIFNTGMTGYQEVLTDPSYAGQMVCMTYPLVGNYGVTREDMESGKVHVEAFIV
KECCKVPSNWRSEISLPDYLKRHGVMGIEGIDTRALTRHLRIHGAMRGVISTQETDPARLVERARALPSMEGQNLVTRVA
PAAPYRWDGERPQAVTLEPGGCAWVGKGPRLVVYDFGIKWNILRLLAQQGFDMLVVPPSFKAADVAAVGAQAVFLSNGPG
DPATLKDEIAEIAKLAQTYPTAGICLGHQLLGHALGGRTMKLKFGHHGCNHPVKDLTTGRIEISSQNHGFCVDIDSLTDV
EITHVNLNDGTLEGFAHKTKPVIAVQHHPEASPGPNDSRYFFARFRNMVREAAGC
>P0A6F1 6.3.5.5~~~carA~~~Carbamoyl-phosphate synthase small chain~~~COG0505
MIKSALLVLEDGTQFHGRAIGATGSAVGEVVFNTSMTGYQEILTDPSYSRQIVTLTYPHIGNVGTNDADEESSQVHAQGL
VIRDLPLIASNFRNTEDLSSYLKRHNIVAIADIDTRKLTRLLREKGAQNGCIIAGDNPDAALALEKARAFPGLNGMDLAK
EVTTAEAYSWTQGSWTLTGGLPEAKKEDELPFHVVAYDFGAKRNILRMLVDRGCRLTIVPAQTSAEDVLKMNPDGIFLSN
GPGDPAPCDYAITAIQKFLETDIPVFGICLGHQLLALASGAKTVKMKFGHHGGNHPVKDVEKNVVMITAQNHGFAVDEAT
LPANLRVTHKSLFDGTLQGIHRTDKPAFSFQGHPEASPGPHDAAPLFDHFIELIEQYRKTAK
>P9WPK5 6.3.5.5~~~carA~~~Carbamoyl-phosphate synthase small chain~~~COG0505
MSKAVLVLEDGRVFTGRPFGATGQALGEAVFSTGMSGYQETLTDPSYHRQIVVATAPQIGNTGWNGEDSESRGERIWVAG
YAVRDPSPRASNWRATGTLEDELIRQRIVGIAGIDTRAVVRHLRSRGSMKAGVFSDGALAEPADLIARVRAQQSMLGADL
AGEVSTAEPYVVEPDGPPGVSRFTVAALDLGIKTNTPRNFARRGIRCHVLPASTTFEQIAELNPHGVFLSNGPGDPATAD
HVVALTREVLGAGIPLFGICFGNQILGRALGLSTYKMVFGHRGINIPVVDHATGRVAVTAQNHGFALQGEAGQSFATPFG
PAVVSHTCANDGVVEGVKLVDGRAFSVQYHPEAAAGPHDAEYLFDQFVELMAGEGR
>Q50899 ~~~carA~~~HTH-type transcriptional repressor CarA~~~
MTLRIRTIARMTGIREATLRAWERRYGFPRPLRSEGNNYRVYSREEVEAVRRVARLIQEEGLSVSEAIAQVKTEPPREQP
EAERLRERFWSSVGALEGDEVTRVLDDAQTVMDVEAYCDGFLLPLLREMGVRLDVAREHLASALIRQRLRQVYDALSPAP
AGPRALLACPSGDHHEGGLLVLGIHLKRKGWRVTMLGADTPAAALQGACVQVRPDVVALSFVRARAPEEFASVLEDALRA
CAPFPVVVGGLGAREHLKAIFSLGAQYAESSEELVAIWNQVRNAQNRP
>Q9XB61 6.3.3.6~~~carA~~~Carbapenam-3-carboxylate synthase~~~
MSNSFCVVYKGSDTDINNIQRDFDGKGEALSNGYLFIEQNGHYQKCEMERGTAYLIGSLYNRTFLIGLAGVWEGEAYLAN
DAELLALLFTRLGANALALAEGDFCFFIDEPNGELTVITESRGFSPVHVVQGKKAWMTNSLKLVTAAEGEGALWFEEEAL
VCQSLMRADTYTPVKNAQRLKPGAVHVLTHDSEGYSFVESRTLTTPASNQLLALPREPLLALIDRYLNAPLEDLAPRFDT
VGIPLSGGLDSSLVTALASRHFKKLNTYSIGTELSNEFEFSQQVADALGTHHQMKILSETEVINGIIESIYYNEIFDGLS
AEIQSGLFNVYRQAQGQVSCMLTGYGSDLLFGGILKPGAQYDNPNQLLAEQVYRTRWTGEFATHGASCYGIDIRHPFWSH
SLISLCHALHPDYKIFDNEVKNILREYADSLQLLPKDIVWRKKIGIHEGSSVNQAFANVLGSTVDNYQTKSRFTYRVYQA
FLRGRLSITDVTPSQLKDLIKKD
>P99147 6.3.5.5~~~carA~~~Carbamoyl-phosphate synthase small chain~~~
MQSKRYLVLEDGSFYEGYRLGSDNLTVGEIVFNTAMTGYQETISDPSYTGQIITFTYPLIGNYGINRDDFESLVPTLNGI
VVKEASAHPSNFRQQKTLHDVLELHQIPGIAGVDTRSITRKIRQHGVLKAGFTDRKEDIDQLVKHLQQVELPKNEVEIVS
TKTPYVSTGKDLSVVLVDFGKKQNIVRELNVRGCNVTVVPYTTTAEEILAMAPDGVMLSNGPGNPEVVECAIPMIQGILG
KIPFFGICLGHQLFALSQGASSFKMKFGHRGANHPVKNLETGKVDITSQNHGYAIDIDSLKSTDLEVTHLALNDGTVEGL
KHKTLPAFSVQYHPEANPGPSDSNYLFDDFVAMMTNFKEKERHINA
>P25994 6.3.5.5~~~pyrAB~~~Carbamoyl-phosphate synthase pyrimidine-specific large chain~~~COG0458
MPKRVDINKILVIGSGPIIIGQAAEFDYAGTQACLALKEEGYEVILVNSNPATIMTDTEMADRVYIEPLTPEFLTRIIRK
ERPDAILPTLGGQTGLNLAVELSERGVLAECGVEVLGTKLSAIQQAEDRDLFRTLMNELNEPVPESEIIHSLEEAEKFVS
QIGFPVIVRPAYTLGGTGGGICSNETELKEIVENGLKLSPVHQCLLEKSIAGYKEIEYEVMRDSQDHAIVVCNMENIDPV
GIHTGDSIVVAPSQTLSDREYQLLRNVSLKLIRALGIEGGCNVQLALDPDSFQYYIIEVNPRVSRSSALASKATGYPIAK
LAAKIAVGLSLDEMMNPVTGKTYAAFEPALDYVVSKIPRWPFDKFESANRKLGTQMKATGEVMAIGRTLEESLLKAVRSL
EADVYHLELKDAADISDELLEKRIKKAGDERLFYLAEAYRRGYTVEDLHEFSAIDVFFLHKLFGIVQFEKELKANAGDTD
VLRRAKELGFSDQYISREWKMKESELYSLRKQAGIAPVFKMVDTCAAEFESETPYFYSTYEEENESVVTDKKSVMVLGSG
PIRIGQGVEFDYATVHSVWAIKQAGYEAIIVNNNPETVSTDFSISDKLYFEPLTIEDVMHIIDLEQPMGVVVQFGGQTAI
NLADELSARGVKILGTSLEDLDRAEDRDKFEQALGELGVPQPLGKTATSVNQAVSIASDIGYPVLVRPSYVLGGRAMEIV
YHEEELLHYMKNAVKINPQHPVLIDRYLTGKEIEVDAVSDGETVVIPGIMEHIERAGVHSGDSIAVYPPQSLTEDIKKKI
EQYTIALAKGLNIVGLLNIQFVLSQGEVYVLEVNPRSSRTVPFLSKITGIPMANLATKIILGQKLAAFGYTEGLQPEQQG
VFVKAPVFSFAKLRRVDITLGPEMKSTGEVMGKDSTLEKALYKALIASGIQIPNYGSVLLTVADKDKEEGLAIAKRFHAI
GYNILATEGTAGYLKEASIPAKVVGKIGQDGPNLLDVIRNGEAQFVINTLTKGKQPARDGFRIRRESVENGVACLTSLDT
AEAILRVLESMTFRADQMPAVNTNQEAAVTI
>P00968 6.3.5.5~~~carB~~~Carbamoyl-phosphate synthase large chain~~~COG0458
MPKRTDIKSILILGAGPIVIGQACEFDYSGAQACKALREEGYRVILVNSNPATIMTDPEMADATYIEPIHWEVVRKIIEK
ERPDAVLPTMGGQTALNCALELERQGVLEEFGVTMIGATADAIDKAEDRRRFDVAMKKIGLETARSGIAHTMEEALAVAA
DVGFPCIIRPSFTMGGSGGGIAYNREEFEEICARGLDLSPTKELLIDESLIGWKEYEMEVVRDKNDNCIIVCSIENFDAM
GIHTGDSITVAPAQTLTDKEYQIMRNASMAVLREIGVETGGSNVQFAVNPKNGRLIVIEMNPRVSRSSALASKATGFPIA
KVAAKLAVGYTLDELMNDITGGRTPASFEPSIDYVVTKIPRFNFEKFAGANDRLTTQMKSVGEVMAIGRTQQESLQKALR
GLEVGATGFDPKVSLDDPEALTKIRRELKDAGADRIWYIADAFRAGLSVDGVFNLTNIDRWFLVQIEELVRLEEKVAEVG
ITGLNADFLRQLKRKGFADARLAKLAGVREAEIRKLRDQYDLHPVYKRVDTCAAEFATDTAYMYSTYEEECEANPSTDRE
KIMVLGGGPNRIGQGIEFDYCCVHASLALREDGYETIMVNCNPETVSTDYDTSDRLYFEPVTLEDVLEIVRIEKPKGVIV
QYGGQTPLKLARALEAAGVPVIGTSPDAIDRAEDRERFQHAVERLKLKQPANATVTAIEMAVEKAKEIGYPLVVRPSYVL
GGRAMEIVYDEADLRRYFQTAVSVSNDAPVLLDHFLDDAVEVDVDAICDGEMVLIGGIMEHIEQAGVHSGDSACSLPAYT
LSQEIQDVMRQQVQKLAFELQVRGLMNVQFAVKNNEVYLIEVNPRAARTVPFVSKATGVPLAKVAARVMAGKSLAEQGVT
KEVIPPYYSVKEVVLPFNKFPGVDPLLGPEMRSTGEVMGVGRTFAEAFAKAQLGSNSTMKKHGRALLSVREGDKERVVDL
AAKLLKQGFELDATHGTAIVLGEAGINPRLVNKVHEGRPHIQDRIKNGEYTYIINTTSGRRAIEDSRVIRRSALQYKVHY
DTTLNGGFATAMALNADATEKVISVQEMHAQIK
>O32771 6.3.5.5~~~carB~~~Carbamoyl-phosphate synthase large chain~~~COG0458
MPKRNDIKKIMIIGSGPIIIGQAAEFDYAGTQACLALKEEGYEVVLVNSNPATIMTDREIADTVYIEPITLEFVSKILRK
ERPDALLPTLGGQTGLNMAMELSKTGILEELNVELLGTKLSAIDQAEDRELFKELCESINEPLCASDIATTVEEAINIAD
KIGYPIIVRPAFTMGGTGGGICDTEEELREIVANGLKLSPVTQCLIEESIAGYKEIEYEVMRDSADNAIVVCNMENFDPV
GVHTGDSIVFAPSQTLSDNEYQMLRDASLNIIRALKIEGGCNVQLALDPNSYEYRVIEVNPRVSRSSALASKATGYPIAK
MSAKIAIGMTLDEIINPVTNKTYAMFEPALDYVVAKIARFPFDKFENGDRHLGTQMKATGEVMAIGRNIEESLLKAVRSL
EIGVFHNEMTEAIEADDEKLYEKMVKTQDDRLFYVSEAIRRGIPIEEIADLTKIDIFFLDKLLYIVEIENQLKVNIFEPE
LLKTAKKNGFSDREIAKLWNVTPEEVRRRRQENKIIPVYKMVDTCAAEFESSTPYFYSTYEWENESKRSDKEKIIVLGSG
PIRIGQGVEFDYATVHCVKAIQALGKEAIVINSNPETVSTDFSISDKLYFEPLTFEDVMNVIDLEEPLGVIVQFGGQTAI
NLAEPLSKAGVKILGTQVEDLDRAEDRDLFEKALQDLDIPQPPGATATNEEEAVANANKIGYPVLIRPSFVLGGRAMEII
NNEKDLRDYMNRAVKASPEHPVLVDSYLQGQECEVDAICDGKEVLLPGIMEHIERAGVHSGDSMAVYPPQNLSQAIIDTI
VDYTKRLAIGLNCIGMMNIQFVIYEEQVYVIEVNPRASRTVPFLSKVTNIPMAQLATQMILGENLKDLGYEAGLAPTPDM
VHVKAPVFSFTKLAKVDSLLGPEMKSTGEAMGSDVTLEKALYKSFEAAKLHMADYGSVLFTVADEDKEETLALAKDFAEI
GYSLVATAGTAAFLKENGLYVREVEKLAGGEDEEGTLVEDIRQGRVQAVVNTMGNTRASLTTATDGFRIRQEAISRGIPL
FTSLDTVAAILKVMQSRSFTTKNI
>P9WPK3 6.3.5.5~~~carB~~~Carbamoyl-phosphate synthase large chain~~~COG0458
MPRRTDLHHVLVIGSGPIVIGQACEFDYSGTQACRVLRAEGLQVSLVNSNPATIMTDPEFADHTYVEPITPAFVERVIAQ
QAERGNKIDALLATLGGQTALNTAVALYESGVLEKYGVELIGADFDAIQRGEDRQRFKDIVAKAGGESARSRVCFTMAEV
RETVAELGLPVVVRPSFTMGGLGSGIAYSTDEVDRMAGAGLAASPSANVLIEESIYGWKEFELELMRDGHDNVVVVCSIE
NVDPMGVHTGDSVTVAPAMTLTDREYQRMRDLGIAILREVGVDTGGCNIQFAVNPRDGRLIVIEMNPRVSRSSALASKAT
GFPIAKIAAKLAIGYTLDEIVNDITGETPACFEPTLDYVVVKAPRFAFEKFPGADPTLTTTMKSVGEAMSLGRNFVEALG
KVMRSLETTRAGFWTAPDPDGGIEEALTRLRTPAEGRLYDIELALRLGATVERVAEASGVDPWFIAQINELVNLRNELVA
APVLNAELLRRAKHSGLSDHQIASLRPELAGEAGVRSLRVRLGIHPVYKTVDTCAAEFEAQTPYHYSSYELDPAAETEVA
PQTERPKVLILGSGPNRIGQGIEFDYSCVHAATTLSQAGFETVMVNCNPETVSTDYDTADRLYFEPLTFEDVLEVYHAEM
ESGSGGPGVAGVIVQLGGQTPLGLAHRLADAGVPIVGTPPEAIDLAEDRGAFGDLLSAAGLPAPKYGTATTFAQARRIAE
EIGYPVLVRPSYVLGGRGMEIVYDEETLQGYITRATQLSPEHPVLVDRFLEDAVEIDVDALCDGAEVYIGGIMEHIEEAG
IHSGDSACALPPVTLGRSDIAKVRKATEAIAHGIGVVGLLNVQYALKDDVLYVLEANPRASRTVPFVSKATAVPLAKACA
RIMLGATIAQLRAEGLLAVTGDGAHAARNAPIAVKEAVLPFHRFRRADGAAIDSLLGPEMKSTGEVMGIDRDFGSAFAKS
QTAAYGSLPAQGTVFVSVANRDKRSLVFPVKRLADLGFRVLATEGTAEMLRRNGIPCDDVRKHFEPAQPGRPTMSAVDAI
RAGEVNMVINTPYGNSGPRIDGYEIRSAAVAGNIPCITTVQGASAAVQGIEAGIRGDIGVRSLQELHRVIGGVER
>Q9XB60 2.3.1.226~~~carB~~~Carboxymethylproline synthase~~~
MVFEENSDEVRVITLDHPNKHNPFSRTLETSVKDALARANADDSVRAVVVYGGAERSFSAGGDFNEVKQLSRSEDIEEWI
DRVIDLYQAVLNVNKPTIAAVDGYAIGMGFQFALMFDQRLMASTANFVMPELKHGIGCSVGAAILGFTHGFSTMQEIIYQ
CQSLDAPRCVDYRLVNQVVESSALLDAAITQAHVMASYPASAFINTKRAVNKPFIHLLEQTRDASKAVHKAAFQARDAQG
HFKNVLGKKY
>P63740 6.3.5.5~~~carB~~~Carbamoyl-phosphate synthase large chain~~~
MPKRNDIKTILVIGSGPIIIGQAAEFDYAGTQACLALKEEGYRVILVNSNPATIMTDKEIADKVYIEPLTHDFIARIIRK
EQPDALLPTLGGQTGLNMAIQLHESGVLQDNNVQLLGTELTSIQQAEDREMFRTLMNDLNVPVPESDIVNTVEQAFKFKE
QVGYPLIVRPAFTMGGTGGGICHNDEELHEIVSNGLHYSPATQCLLEKSIAGFKEIEYEVMRDKNDNAIVVCNMENIDPV
GIHTGDSIVVAPSQTLSDVEYQMLRDVSLKVIRALGIEGGCNVQLALDPHSFDYYIIEVNPRVSRSSALASKATGYPIAK
LAAKIAVGLTLDEMLNPITGTSYAAFEPTLDYVISKIPRFPFDKFEKGERELGTQMKATGEVMAIGRTYEESLLKAIRSL
EYGVHHLGLPNGESFDLDYIKERISHQDDERLFFIGEAIRRGTTLEEIHNMTQIDYFFLHKFQNIIDIEHQLKEHQGDLE
YLKYAKDYGFSDKTIAHRFNMTEEEVYQLRMENDIKPVYKMVDTCAAEFESSTPYYYGTYETENESIVTDKEKILVLGSG
PIRIGQGVEFDYATVHAVWAIQKAGYEAIIVNNNPETVSTDFSISDKLYFEPLTEEDVMNIINLEKPKGVVVQFGGQTAI
NLADKLAKHGVKILGTSLENLNRAEDRKEFEALLRKINVPQPQGKSATSPEEALANAAEIGYPVVVRPSYVLGGRAMEIV
DNDKELENYMTQAVKASPEHPVLVDRYLTGKEIEVDAICDGETVIIPGIMEHIERAGVHSGDSIAVYPPQTLTEDELATL
EDYTIKLAKGLNIIGLINIQFVIAHDGVYVLEVNPRSSRTVPFLSKITDIPMAQLAMRAIIGEKLTDMGYQEGVQPYAEG
VFVKAPVFSFNKLKNVDITLGPEMKSTGEVMGKDTTLEKALFKGLTGSGVEVKDHGTVLMTVSDKDKEEVVKLAQRLNEV
GYKILATSGTANKLAEYDIPAEVVGKIGGENDLLTRIQNGDVQIVINTMTKGKEVERDGFQIRRTTVENGIPCLTSLDTA
NALTNVIESMTFTMRQM
>P13079 2.1.1.-~~~carB~~~rRNA methyltransferase~~~
MAALLKRILRRRMAEKRSGRGRMAAARTTGAQSRKTAQRSGRSEADRRRRVHGQNFLVDRETVQRFVRFADPDPGEVVLE
VGAGNGAITRELARLCRRVVAYEIDRHFADRLREATAEDPRIEVVAGDFLKTSQPKVPFSVVGNIPFGNTADIVDWCLNA
RRLRTTTLVTQLEYARKRTGGYRRWSRLTVATWPEVEWRMGERISRRWFRPVPAVDSAVLRLERRPVPLIPPGLMHDFRD
LVETGFTGKGGSLDASLRRRFPARRVAAGFRRARLEQGVVVAYVTPGQWITLFEELHGR
>H6LGM6 1.3.1.108~~~carC~~~Caffeyl-CoA reductase-Etf complex subunit CarC~~~COG1960
MYFSEQNKMIRKLARDFAEKELTTEILDEVEESGEFPQEILDKMAKFGFFGIKIPKSLGGSGGDHMSYVICMEEFARVSG
VASVYLSSPNSLAGGPLLLSGTEEQIEKYLKPIITGKKKLAFALTEPGAGSDAGGMSTTAVDMGDYYLLNGRKTFITMAP
LCDDAVIYAKTDMSKGTRGISAFIVDLKSEGVSMGKNEHKMGLIGCATSDIIMEDVKVPKENRLGEVNKGFSNAMKTLDV
GRLGVASQSIGVAQGALDEAIKYAKERKQFGKRIADFQAIAFMIADMATKLEAAKLLVYNAASLMDNKKNATKEASMAKF
YASEICNEICAKAVQIHGGYGYIKEYKVERMYRDCRVFTIYEGTSQVQQMVISGMLLKK
>Q9XB59 1.14.20.3~~~carC~~~(5R)-carbapenem-3-carboxylate synthase~~~
MSEIVKFNPVMASGFGAYIDHRDFLEAKTETIKNLLMRQGFVVVKNLDIDSDTFRDIYSAYGTIVEYADEKIGVGFGYRD
TLKLEGEKGKIVTGRGQLPFHADGGLLLSQVDQVFLYAAEIKNVKFRGATTVCDHALACQEMPAHLLRVLEEETFEVRVL
ERGYYVDVSPDGWFKVPVFTDLGWVRKMLIYFPFDEGQPASWEPRIVGFTDHETQAFFQELGAFLKQPRYYYKHFWEDGD
LLIMDNRRVIHEREEFNDDDIVRRLYRGQTADI
>Q9AQM4 3.7.1.13~~~carC~~~2-hydroxy-6-oxo-6-(2'-aminophenyl)hexa-2,4-dienoic acid hydrolase~~~
MLNKAEQISEKSESAYVERFVNAGGVETRYLEAGKGQPVILIHGGGAGAESEGNWRNVIPILARHYRVIAMDMLGFGKTA
KPDIEYTQDRRIRHLHDFIKAMNFDGKVSIVGNSMGGATGLGVSVLHSELVNALVLMGSAGLVVEIHEDLRPIINYDFTR
EGMVHLVKALTNDGFKIDDAMINSRYTYATDEATRKAYVATMQWIREQGGLFYDPEFIRKVPVPTLVVHGKDDKVVPVET
AYKFLDLIDDSWGYIIPHCGHWAMIEHPEDFANATLSFLSRRADITRAAA
>A4XES9 1.13.11.82~~~~~~8'-apo-carotenoid 13,14-cleaving dioxygenase~~~COG3670
MVTKGVVVVSSFRRSRQDANRPHAFLTGIHAPVKEERTIEDLAVTGTIPAELSGRYVRIGPNPFRADPRGHHWFVGDGMV
HGVCMKGGKALWYRNRYVRSRNLQDAGGPAAAPGPRRSTFDTVNTNVIQHAGRTFALVEAGSFPVELTHDLESFAYSDLG
GTLKGPFSAHPHLDPLTGELHAVTYDGQTLDTVWHVVVDREGRVRREEPVPVAHGPSIHDCAITAKYVLILDLPVTFSMA
ALVGGARFPYRWNPAHRARVGLLPREGTAADVIWCDVDAAYVFHVANAFDNPDGTVTVDLAAYETMFAHGPDGPNGKSLG
MERWTVDPAARKVARKTLDAAPQEFHRPDERFFGQPYRFAWSMGLPAENAEDFLGHAPIYGYDLATGQRSAHDFGPGKIP
GEFVFIPRRADAEEGDGWLMGYVIDLASETTDLAILDARNLAAPPLALIHIPCRIPPGFHGNWLPDAAD
>P75409 2.4.2.-~~~cards~~~ADP-ribosylating toxin CARDS~~~
MPNPVRFVYRVDLRSPEEIFEHGFSTLGDVRNFFEHILSTNFGRSYFISTSETPTAAIRFFGSWLREYVPEHPRRAYLYE
IRADQHFYNARATGENLLDLMRQRQVVFDSGDREMAQMGIRALRTSFAYQREWFTDGPIAAANVRSAWLVDAVPVEPGHA
HHPAGRVVETTRINEPEMHNPHYQELQTQANDQPWLPTPGIATPVHLSIPQAASVADVSEGTSASLSFACPDWSPPSSNG
ENPLDKCIAEKIDNYNLQSLPQYASSVKELEDTPVYLRGIKTQKTFMLQADPQNNNVFLVEVNPKQKSSFPQTIFFWDVY
QRICLKDLTGAQISLSLTAFTTQYAGQLKVHLSVSAVNAVNQKWKMTPQDIAITQFRVSSELLGQTENGLFWNTKSGGSQ
HDLYVCPLKNPPSDLEELQIIVDECTTHAQFVTMRAASTFFVDVQLGWYWRGYYYTPQLSGWSYQMKTPDGQIFYDLKTS
KIFFVQDNQNVFFLHNKLNKQTGYSWDWVEWLKHDMNEDKDENFKWYFSRDDLTIPSVEGLNFRHIRCYADNQQLKVIIS
GSRWGGWYSTYDKVESNVEDKILVKDGFDRF
>H6LGM7 1.3.1.108~~~carD~~~Caffeyl-CoA reductase-Etf complex subunit CarD~~~COG2086
MRILVCAKQVPDTNEVKIDPKTGTMIREGVPSILNPDDANALEAALVIKDENPGTEVIVMTMGPPQASEMLRECLAMGAD
EAYLLSDRAFGGADTWATSATLAAGIKKVKKVDLVLAGRQAIDGDTAQVGSQIAQRLKMPVVTYVEDIKIEDKKAIVHRQ
MEDGYEVIEVQLPCLLTCVKELNDPRYMSVGGIMDAYEQPITIWNHEDIGLSPEACGLNASPTQVFRSFSPPAKGGGEMI
TGTTVNEVAGSLVSKLKEKHII
>A0R561 ~~~carD~~~RNA polymerase-binding transcription factor CarD~~~COG1329
MIFKVGDTVVYPHHGAALIEAIETRTIKGEQKEYLVLKVAQGDLTVRVPADNAEYVGVRDVVGQEGLDKVFQVLRAPHTE
EPTNWSRRYKANLEKLASGDVNKVAEVVRDLWRRDQERGLSAGEKRMLAKARQILVGELALAENTDDAKAETILDEVLAA
AS
>P9WJG3 ~~~carD~~~RNA polymerase-binding transcription factor CarD~~~COG1329
MIFKVGDTVVYPHHGAALVEAIETRTIKGEQKEYLVLKVAQGDLTVRVPAENAEYVGVRDVVGQEGLDKVFQVLRAPHTE
EPTNWSRRYKANLEKLASGDVNKVAEVVRDLWRRDQERGLSAGEKRMLAKARQILVGELALAESTDDAKAETILDEVLAA
AS
>H6LGM8 1.3.1.108~~~carE~~~Caffeyl-CoA reductase-Etf complex subunit CarE~~~COG2025
MAIKVIEEKCIGCSKCQKSCPFDAITIENKIAVIGDACTNCGTCIDVCPTEAILQEGTEKIVRDLSMYKGVWVFAEQREG
KIMPVVFELLGEGKKLANEIGTELCAILCGSNVAELTDELFAYGADKVYLADAPELEKYTTDGYSKIINEAIGLYKPEIV
LYGATHIGRDLAPCLAVKVNTGLTADCTKLEIDPDDKKIRQTRPAFGGNLMATIVCPGSRPQMSTVRPGVMDKAAYDPSQ
KGEVIKLDATFNEGDIRTKVLEIVKTTTDNISISDADFIVSGGMGLGKPEGFELLKQLADKLGGTVATSRACVDAGWADH
AQQVGQTGTTVKPQIYFACGISGAIQHIAGMQDSDIIIAINKNENAPIFEVADYGIVGDLYKVIPAIIEELDKIGK
>Q9AE87 1.14.19.77~~~carF~~~Plasmanylethanolamine desaturase~~~
MKTQEIEKKVRQQDAQVLAQGYSPAIRAMEIAAIVSFVSLEVALVYRLWGTPYAGTWLLLSAVLLGYLAADFVSGFVHWM
GDTWGSTEMPVLGKALIRPFREHHVDEKAITRHDFVETNGNNCLISLPVAIIALCLPMSGPGWVFCASFLGAMIFWVMAT
NQFHKWSHMDSPPALVGFLQRVHLILPPDHHRIHHTKPYNKYYCITVGWMNKPLTMVHFFPTAERLITWATGLLPRQDDI
GAEAARALVVAAGGSEAPVVQAAKELLTQATVQEKPASTRP
>Q50900 ~~~carH~~~HTH-type transcriptional repressor CarH~~~
MAERTYRINIAAELAGVRVELIRAWERRYGVLTPRRTPAGYRAYTDRDVAVLKQLKRLTDEGVAISEAAKLLPQLMEGLE
AEVAGRGASQDARPHAETWRESMLAATQAYDQPRVSDVLDEVLAALPPLKAFDEVLAPLLCDVGERWESGTLTVAQEHLV
SQMVRARLVSLLHAAPLGRHRHGVLACFPEEEHEMGLLGAALRLRHLGVRVTLLGQRVPAEDLGRAVLALRPDFVGLSTV
ASRSAEDFEDTLTRLRQALPRGLPVWVGGAAARSHQAVCERLAVHVFQGEEDWDRLAGT
>Q53W62 ~~~carH~~~HTH-type transcriptional repressor CarH~~~
MTSSGVYTIAEVEAMTGLSAEVLRQWERRYGFPKPRRTPGGHRLYSAEDVEALKTIKRWLEEGATPKAAIRRYLAQEVRP
EDLGTGLLEALLRGDLAGAEALFRRGLRFWGPEGVLEHLLLPVLREVGEAWHRGEIGVAEEHLASTFLRARLQELLDLAG
FPPGPPVLVTTPPGERHEIGAMLAAYHLRRKGVPALYLGPDTPLPDLRALARRLGAGTVVLSAVLSEPLRALPDGALKDL
APRVFLGGQGAGPEEARRLGAEYMEDLKGLAEALWLPRGPEKEAI
>Q06909 ~~~carQ~~~RNA polymerase sigma factor CarQ~~~
MERFRDGAQDAFEDLFARHAPRVQGFLARMVRNGALAEDLLQATFLSVIRSRGRYEPGTRFIPWLMTIAANAARDALRHQ
RHVDAYASREDTATPASAAPDDSDPSLRRHLLDALQQLHPDHREAVVLSKVEGWSFEEIGALRGISPGAARLRAHRGYEK
LRELLGELELEVAR
>Q06910 ~~~carR~~~Carotenogenesis protein CarR~~~
MKPPMDLDSLLTQTPAKDNAALERVLAAARGELALRRPVRRWRTQAVGLMAASAGLGLLAAVVLLAVGAVTGPLLLARAP
LLAMLVGTSAVCAWGALSPKGRWMRRLGVGLAVVSAAALVLARGAPHSPPSFPGWVCTVSHLAIGVVPLVVALFALRGAF
FQPLRAVVAGLSVGSTGALLGELACEQDWRHVLSHHLLAWVVITVVLVVISKSLKPRSYAP
>Q06911 ~~~carS~~~Antirepressor protein CarS~~~
MIQDPSLIICHDVDGAPVRIGAKVKVVPHSEDGTISQRFLGQTGIVVGLVFDDPATQYPDDPLIQVLVEGLGEDLFFPEE
LELAPEWARNRIAQHRQAVRTGGRSSLERLP
>P54324 6.3.5.5~~~carA~~~Carbamoyl-phosphate synthase arginine-specific small chain~~~
MKAYLHVASGKTFSGELAAPLEEKVSGEIVFFTGMTGYQEVLTDPSYKNQIIVFTYPLIGNYGINENDFESKRPHVEAVV
VYEASREGFHYGAKYSLAEYLQHWNIPLLTHVDTRALVKEIRTAGTMMAELSLSPISAVGGVEAVFPVRAVSTRTIETYG
EGGPHLVLVDFGYKKSILQSLLARGCRVTVVPHDTAPEAIDALKPDGLVLSNGPGDPKQLRHQLPAIRQLIDRYPTLAIC
LGHQLVALATGRIRKKLRFGHRGANQPVWDAVKQNVMMTSQNHSYVVKEGSLVGKPFDIRFINVNDGSVEGIVHRHKPIL
SVQYHPEAHPGPHDTGYIFDEFLQTVFKGENVYA
>O08317 6.3.5.5~~~carA~~~Carbamoyl-phosphate synthase arginine-specific small chain~~~COG0505
MNKYLTLADGTQWIGTAIGDCQLEAAGRIVFNTGMTGYQETLTDPSYLNQMIAFTYPLIGNYGIDPTVAQAPTIGAQAII
VHELATFNDHYTSRQSLASFLTIHHVAGIEGVDTRDLTIHIRQTGAQMAILSNHPITDFEAQLATFAPQVLTATPLPVAT
TTIRPRVAILNFGEKAAITAELQARGADVVVLPPTASLKAVAAYHPDGILLSNGPGDPTDYHTYLATIRQLAQRYPLAGI
CLGHQLIALAYGAQTYQLSFGHHGLNHPVQACADGRIIMTSQNHDYAVDPASIKGTPLIVTHTELNDGSIEGLRLPHQAV
MSVQFHPEAHPGPQEAGQFFDDFLLTIQKEAVVNA
>Q9ZB63 6.3.5.5~~~carB~~~Carbamoyl-phosphate synthase arginine-specific large chain~~~
MPKDSSLQSILLIGSGRSSSAKAAEFDYSGTQACIAFKEEGYRVILVNNNPATIMTDDVHADAVYFEPLTVEAVEAIIAK
ETPDGLLATFGGQTGLNLAFQLHEAGVLKKYGVRLLGTPIEAIKRPRGGPRTFRALMHELGEPVPESEIVTSVEEAVAFA
EQIGFPIIIRPAYTLGGTGGGIAENMEQFLALVEKGLNESPIHQCLIERSVAGFKEIEYEVMRDQSNTCITVCNMENVDP
VGIHTGDSIVVAPSQTLTDEEYQMLRSSAVKIISALGIIGGCNIQFALDPNSKQYYLIEVNPRVSRSSALASKATGYPIA
ALPAKLAVGYTLAELVNPVTKTTYASFEPALDYVVVKFPRLPFDKFPHADRKLGTQMKATGEVMAIDRNMERAFQKAVQS
LEGKNNGLLLPELSVKTNDELKQLLVDKDDRRFFAILELLRRGVAVEAIHKWTKIDRFFLCSFERLVALEKQAAAATLDT
IEEQTFRFLKEKGCSDAFLAETWGVTELDVRNKRKELGIVPSYKMVDTCAAEFHSETDYYYSTYFGEDERKQPSGKEKVL
IIGAGPIRIGQGIEFDYSSVHSVFALQKEGYETVMINNNPETVSTDFAVADRLYFEPLTLESVLDVIEAEQIKHVIVQFG
GQTAINLVKGLEEAGVPLLGVTYDMIDQLEDRDRFYQLLEELDIPHVPGLVANNAEELAAKAAEIGYPVLLRPSYVIGGC
GMFIVHSEAQLAALIEQGELTYPILIDAYLDGKEAEADIVTDGTDIVLPVIIEHVEKAGVHSGDSYAWLPAQTLTGEEKA
KIIDYAGRIAKKLGFKGIMNIQYVIADGNVYVLEVNPRASRTVPIVSKTTGVPLAQIATKLLLGKSLVDIVDEKARGLAV
MPYAVLKYPVFSTHKLPGVDPMVGPEMKSTGEGISIAATKEEAAYKAFYPYLQKKANANEVYVIGNIDAELEAEMTAKQL
TIVADVPFSDWVKRDTALAVIDLGKEEGEANKRMTALSRQLLVFTERETLKLFLQALDVDHLDVQPIHGWLEKKKQAEQA
VIA
>Q9RLS9 6.3.5.5~~~carB~~~Carbamoyl-phosphate synthase arginine-specific large chain~~~COG0458
MPKNNAIHSIAVIGSGPIKIGQAAEFDYAGTQACLSLKAAGYHVILINSNPATIMTDTATADEVYLEPLTLTSVTKILRA
AHPDALLPTLGGQTGLNLAMALDQAGVLNELKIQLLGTSLATINQAEDRAAFKTLMKRLHQPIPASTTVHHVTSALSFAE
KIGYPVIVRPAFTLGGSGGGIANNAAELTQTLQRGLTMSPVTECLIEQSIAGFKEIEFEVMRDNQGTKIIVCSMENFDPV
GIHTGDSIVYAPVQTLTDTEYQQLRTAALTIVEALDIRGGYNVQLAQDPNSRQYYVIEVNPRVSRSSALASKATGYPIAK
IAADIAIGLNLSEIKNPVTQTTYAAFEPALDYVVAKIPRFAFDKFPTADAHLGTQMKATGEVMAIGSTLEEATLKAIASL
EIDPKTQASLTPDHHVTTTEYIDQLTHPTDQRLFYLLAALQAGWPLAKLATLTQITPFFLSKLQHIAQLIRNIKQVPTSQ
HLLSAKKYGVSLATMAHYAQTSVATIAAMTADLPFVYKMVDTCAGEFASVTPYFYSTAFGQTNESHPLGHSILVLGSGPI
RIGQGIEFDYTTVHCVKAIQQAGYHAIIVNNNPETVSTDFSTSDKLYFEPLTIERLMPIIALEQPAGVIVQFGGQTAINL
AHQLTKLGVTVLGTSVSATDLTEDRQSFADCLRLLKIAQAAGTTVTELSGARQAAHAIGYPLLVRPSFVLGGRAMAIVHN
DHELTPVIKSAVAAGHGAPILMDQYLAGTECEVDVLSDGTSCFIPGIMEHIEGAGVHSGDSITVYPPQHLSPAVQDKIVT
IATKLAQHLHCVGMMNIQFVVTDDVYVIDVNPRASRTVPYMSKVTHLPLAQLATRLILGQSLATLGLVPGLLTPAPQQIA
IKAPVFSFNKLPQSPVVLSPEMKSTGETLGIGPTFTTAWHAAMADSYHLETWQTVDGLITDIATVAQPAVQELFAANHLT
VTTIANTTDWPAKTKAVGFTLNDQPDNPVAIAALNHGQPLITAIDTLKTLLAVTPTPATI
>B2HN69 1.2.1.-~~~car~~~Carboxylic acid reductase~~~COG1022
MSPITREERLERRIQDLYANDPQFAAAKPATAITAAIERPGLPLPQIIETVMTGYADRPALAQRSVEFVTDAGTGHTTLR
LLPHFETISYGELWDRISALADVLSTEQTVKPGDRVCLLGFNSVDYATIDMTLARLGAVAVPLQTSAAITQLQPIVAETQ
PTMIAASVDALADATELALSGQTATRVLVFDHHRQVDAHRAAVESARERLAGSAVVETLAEAIARGDVPRGASAGSAPGT
DVSDDSLALLIYTSGSTGAPKGAMYPRRNVATFWRKRTWFEGGYEPSITLNFMPMSHVMGRQILYGTLCNGGTAYFVAKS
DLSTLFEDLALVRPTELTFVPRVWDMVFDEFQSEVDRRLVDGADRVALEAQVKAEIRNDVLGGRYTSALTGSAPISDEMK
AWVEELLDMHLVEGYGSTEAGMILIDGAIRRPAVLDYKLVDVPDLGYFLTDRPHPRGELLVKTDSLFPGYYQRAEVTADV
FDADGFYRTGDIMAEVGPEQFVYLDRRNNVLKLSQGEFVTVSKLEAVFGDSPLVRQIYIYGNSARAYLLAVIVPTQEALD
AVPVEELKARLGDSLQEVAKAAGLQSYEIPRDFIIETTPWTLENGLLTGIRKLARPQLKKHYGELLEQIYTDLAHGQADE
LRSLRQSGADAPVLVTVCRAAAALLGGSASDVQPDAHFTDLGGDSLSALSFTNLLHEIFDIEVPVGVIVSPANDLQALAD
YVEAARKPGSSRPTFASVHGASNGQVTEVHAGDLSLDKFIDAATLAEAPRLPAANTQVRTVLLTGATGFLGRYLALEWLE
RMDLVDGKLICLVRAKSDTEARARLDKTFDSGDPELLAHYRALAGDHLEVLAGDKGEADLGLDRQTWQRLADTVDLIVDP
AALVNHVLPYSQLFGPNALGTAELLRLALTSKIKPYSYTSTIGVADQIPPSAFTEDADIRVISATRAVDDSYANGYSNSK
WAGEVLLREAHDLCGLPVAVFRCDMILADTTWAGQLNVPDMFTRMILSLAATGIAPGSFYELAADGARQRAHYDGLPVEF
IAEAISTLGAQSQDGFHTYHVMNPYDDGIGLDEFVDWLNESGCPIQRIADYGDWLQRFETALRALPDRQRHSSLLPLLHN
YRQPERPVRGSIAPTDRFRAAVQEAKIGPDKDIPHVGAPIIVKYVSDLRLLGLL
>Q50631 1.2.1.-~~~car~~~Carboxylic acid reductase~~~COG1022
MSINDQRLTRRVEDLYASDAQFAAASPNEAITQAIDQPGVALPQLIRMVMEGYADRPALGQRALRFVTDPDSGRTMVELL
PRFETITYRELWARAGTLATALSAEPAIRPGDRVCVLGFNSVDYTTIDIALIRLGAVSVPLQTSAPVTGLRPIVTETEPT
MIATSIDNLGDAVEVLAGHAPARLVVFDYHGKVDTHREAVEAARARLAGSVTIDTLAELIERGRALPATPIADSADDALA
LLIYTSGSTGAPKGAMYRESQVMSFWRKSSGWFEPSGYPSITLNFMPMSHVGGRQVLYGTLSNGGTAYFVAKSDLSTLFE
DLALVRPTELCFVPRIWDMVFAEFHSEVDRRLVDGADRAALEAQVKAELRENVLGGRFVMALTGSAPISAEMTAWVESLL
ADVHLVEGYGSTEAGMVLNDGMVRRPAVIDYKLVDVPELGYFGTDQPYPRGELLVKTQTMFPGYYQRPDVTAEVFDPDGF
YRTGDIMAKVGPDQFVYLDRRNNVLKLSQGEFIAVSKLEAVFGDSPLVRQIFIYGNSARAYPLAVVVPSGDALSRHGIEN
LKPVISESLQEVARAAGLQSYEIPRDFIIETTPFTLENGLLTGIRKLARPQLKKFYGERLERLYTELADSQSNELRELRQ
SGPDAPVLPTLCRAAAALLGSTAADVRPDAHFADLGGDSLSALSLANLLHEIFGVDVPVGVIVSPASDLRALADHIEAAR
TGVRRPSFASIHGRSATEVHASDLTLDKFIDAATLAAAPNLPAPSAQVRTVLLTGATGFLGRYLALEWLDRMDLVNGKLI
CLVRARSDEEAQARLDATFDSGDPYLVRHYRELGAGRLEVLAGDKGEADLGLDRVTWQRLADTVDLIVDPAALVNHVLPY
SQLFGPNAAGTAELLRLALTGKRKPYIYTSTIAVGEQIPPEAFTEDADIRAISPTRRIDDSYANGYANSKWAGEVLLREA
HEQCGLPVTVFRCDMILADTSYTGQLNLPDMFTRLMLSLAATGIAPGSFYELDAHGNRQRAHYDGLPVEFVAEAICTLGT
HSPDRFVTYHVMNPYDDGIGLDEFVDWLNSPTSGSGCTIQRIADYGEWLQRFETSLRALPDRQRHASLLPLLHNYREPAK
PICGSIAPTDQFRAAVQEAKIGPDKDIPHLTAAIIAKYISNLRLLGLL
>Q6RKB1 1.2.1.-~~~car~~~Carboxylic acid reductase~~~
MAVDSPDERLQRRIAQLFAEDEQVKAARPLEAVSAAVSAPGMRLAQIAATVMAGYADRPAAGQRAFELNTDDATGRTSLR
LLPRFETITYRELWQRVGEVAAAWHHDPENPLRAGDFVALLGFTSIDYATLDLADIHLGAVTVPLQASAAVSQLIAILTE
TSPRLLASTPEHLDAAVECLLAGTTPERLVVFDYHPEDDDQRAAFESARRRLADAGSLVIVETLDAVRARGRDLPAAPLF
VPDTDDDPLALLIYTSGSTGTPKGAMYTNRLAATMWQGNSMLQGNSQRVGINLNYMPMSHIAGRISLFGVLARGGTAYFA
AKSDMSTLFEDIGLVRPTEIFFVPRVCDMVFQRYQSELDRRSVAGADLDTLDREVKADLRQNYLGGRFLVAVVGSAPLAA
EMKTFMESVLDLPLHDGYGSTEAGASVLLDNQIQRPPVLDYKLVDVPELGYFRTDRPHPRGELLLKAETTIPGYYKRPEV
TAEIFDEDGFYKTGDIVAELEHDRLVYVDRRNNVLKLSQGEFVTVAHLEAVFASSPLIRQIFIYGSSERSYLLAVIVPTD
DALRGRDTATLKSALAESIQRIAKDANLQPYEIPRDFLIETEPFTIANGLLSGIAKLLRPNLKERYGAQLEQMYTDLATG
QADELLALRREAADLPVLETVSRAAKAMLGVASADMRPDAHFTDLGGDSLSALSFSNLLHEIFGVEVPVGVVVSPANELR
DLANYIEAERNSGAKRPTFTSVHGGGSEIRAADLTLDKFIDARTLAAADSIPHAPVPAQTVLLTGANGYLGRFLCLEWLE
RLDKTGGTLICVVRGSDAAAARKRLDSAFDSGDPGLLEHYQQLAARTLEVLAGDIGDPNLGLDDATWQRLAETVDLIVHP
AALVNHVLPYTQLFGPNVVGTAEIVRLAITARRKPVTYLSTVGVADQVDPAEYQEDSDVREMSAVRVVRESYANGYGNSK
WAGEVLLREAHDLCGLPVAVFRSDMILAHSRYAGQLNVQDVFTRLILSLVATGIAPYSFYRTDADGNRQRAHYDGLPADF
TAAAITALGIQATEGFRTYDVLNPYDDGISLDEFVDWLVESGHPIQRITDYSDWFHRFETAIRALPEKQRQASVLPLLDA
YRNPCPAVRGAILPAKEFQAAVQTAKIGPEQDIPHLSAPLIDKYVSDLELLQLL
>E5XP76 1.2.1.-~~~car~~~Carboxylic acid reductase~~~COG0236
MTESQSYETRQARPAGQSLAERVARLVAIDPQAAAAVPDKAVAERATQQGLRLAQRIEAFLSGYGDRPALAQRAFEITKD
PITGRAVATLLPKFETVSYRELLERSHAIASELANHAEAPVKAGEFIATIGFTSTDYTSLDIAGVLLGLTSVPLQTGATT
DTLKAIAEETAPAVFGASVEHLDNAVTTALATPSVRRLLVFDYRQGVDEDREAVEAARSRLAEAGSAVLVDTLDEVIARG
RALPRVALPPATDAGDDSLSLLIYTSGSTGTPKGAMYPERNVAQFWGGIWHNAFDDGDSAPDVPDIMVNFMPLSHVAGRI
GLMGTLSSGGTTYFIAKSDLSTFFEDYSLARPTKLFFVPRICEMIYQHYQSELDRIGAADGSPQAEAIKTELREKLLGGR
VLTAGSGSAPMSPELTAFIESVLQVHLVDGYGSTEAGPVWRDRKLVKPPVTEHKLIDVPELGYFSTDSPYPRGELAIKTQ
TILPGYYKRPETTAEVFDEDGFYLTGDVVAEVAPEEFVYVDRRKNVLKLSQGEFVALSKLEAAYGTSPLVRQISVYGSSQ
RSYLLAVVVPTPEALAKYGDGEAVKSALGDSLQKIAREEGLQSYEVPRDFIIETDPFTIENGILSDAGKTLRPKVKARYG
ERLEALYAQLAETQAGELRSIRVGAGERPVIETVQRAAAALLGASAAEVDPEAHFSDLGGDSLSALTYSNFLHEIFQVEV
PVSVIVSAANNLRSVAAHIEKERSSGSDRPTFASVHGAGATTIRASDLKLEKFLDAQTLAAAPSLPRPASEVRTVLLTGS
NGWLGRFLALAWLERLVPQGGKVVVIVRGKDDKAAKARLDSVFESGDPALLAHYEDLADKGLEVLAGDFSDADLGLRKAD
WDRLADEVDLIVHSGALVNHVLPYSQLFGPNVVGTAEVAKLALTKRLKPVTYLSTVAVAVGVEPSAFEEDGDIRDVSAVR
SIDEGYANGYGNSKWAGEVLLREAYEHAGLPVRVFRSDMILAHRKYTGQLNVPDQFTRLILSLLATGIAPKSFYQLDATG
GRQRAHYDGIPVDFTAEAITTLGLAGSDGYHSFDVFNPHHDGVGLDEFVDWLVEAGHPISRVDDYAEWLSRFETSLRGLP
EAQRQHSVLPLLHAFAQPAPAIDGSPFQTKNFQSSVQEAKVGAEHDIPHLDKALIVKYAEDIKQLGLL
>E6LHV7 3.1.-.-~~~~~~CRISPR system single-strand-specific deoxyribonuclease Cas10/Csm1 (subtype III-A)~~~COG1353
MNKKLELMYGSLLHDIGKIVYRSNSVDFAKGTHSKIGSQFLNKFKPFQLSGIVDSVSYHHYKELASSSLLDDSVAYITYI
ADNIASGTDRRASEGDYEGEGNRQRFDKRAPLASIFNVVNSETKGLANYTYSFEKEQVYRYPTDAKKEYTSSQYAALVNK
MTDDLSNKLKVGPDSFSSLLQWTESLWSYIPSSTDTNQVMDVSLYDHSKITCAIASCIYDYLTEMNCVNYRKELFSPYEK
TKQFYQEDVFLLVSLDMSGIQDFIYNISGSKALKSLRSRSFYLETMLESLVDDLLSDLELSRANLLYTGGGHAYLLLPNT
ERARDVLASFEGEMKEWFIKIFKTDLSVAIAYKACTGEDLMNSNGTYSDLWQTVSRKLSDKKAHKYSLNEIKLFNSTIHA
GTQECKECLRSDIDISEDSLCKICEGIIAISNDLRDYSFFVVSPEGKVPLPRNRYLSVENQDGAERKIKMNKETRIYSKN
QPFVGKQLVTNLWMCDYDFSTLNPETKKQGIASYVNREVGIPRLGVLRADIDNLGTTFIKGIPEQYRSISRTATLSRQLS
MFFKFELSNILKGARISVIYSGGDDLFLIGAWDDVISKALVLRKAFTRFSAGKLTFSAGIGMYPVKYPISKMASETGVLE
DLAKRGEKNQVALWNDSKVFGWSQLEEQILKEKMIPLQEALTNSQEHGKSFLYKMLELLRNEDQINIARLAYLLARSSLS
EELTQSIFAWSQNKQQKVELITAIEYLVYQIREAD
>P71629 3.1.-.-~~~~~~CRISPR system single-strand-specific deoxyribonuclease Cas10/Csm1 (subtype III-A)~~~COG1353
MNPQLIEAIIGCLLHDIGKPVQRAALGYPGRHSAIGRAFMKKVWLRDSRNPSQFTDEVDEADIGVSDRRILDAISYHHSS
ALRTAAENGRLAADAPAYIAYNIAAGTDRRKADSDDGHGASTWDPDTPLYSMFNRFGSGTANLAFAPEMLDDRKPINIPS
PRRIEFDKDRYAAIVNKLKAILVDLERSDTYLASLLNVLEATLSFVPSSTDASEVVDVSLFDHLKLTGALGACIWHYLQA
TGQSDFKSALFDKQDTFYNEKAFLLTTFDVSGIQDFIYTIHSSGAAKMLRARSFYLEMLTEHLIDELLARVGLSRANLNY
SGGGHAYLLLPNTESARKSVEQFEREANDWLLENFATRLFIATGSVPLAANDLMRRPNESASQASNRALRYSGLYRELSE
QLSAKKLARYSADQLRELNSRDHDGQKGDRECSVCHTVNRTVSADDEPKCSLCQALTAASSQIQSESRRFLLISDGATKG
LPLPFGATLTFCSRADADKALQQPQTRRRYAKNKFFAGECLGTGLWVGDYVAQMEFGDYVKRASGIARLGVLRLDVDNLG
QAFTHGFMEQGNGKFNTISRTAAFSRMLSLFFRQHINYVLARPKLRPITGDDPARPREATIIYSGGDDVFVVGAWDDVIE
FGIELRERFHEFTQGKLTVSAGIGMFPDKYPISVMAREVGDLEDAAKSLPGKNGVALFDREFTFGWDELLSKVIEEKYRH
IADYFSGNEERGMAFIYKLLELLAERDDRITKARWVYFLTRMRNPTGDTAPFQQFANRLHQWFQDPTDAKQLKTALHLYI
YRTRKEESE
>A0A0A7HFE1 3.1.-.-~~~~~~CRISPR system single-strand-specific deoxyribonuclease Cas10/Csm1 (subtype III-A)~~~
MKKEKIDLFYGALLHDIGKVIQRATGERKKHALVGADWFDEIADNQVISDQIRYHMANYQSDKLGNDHLAYITYIADNIA
SGVDRRQSNEESDEDASAKIWDTYTNQADIFNVFGAQTDKRYFKPTVLNLKSKPNFASATYEPFSKGDYAAIATRIKNEL
AEFEFNQAQIDSLLNLFEAILSFVPSSTNSKEIADISLAEHSRLTAAFALAIYDYLEDKGRHNYKEDLFTKASAFYEEEA
FLLASFDLSGIQDFIYNIATSGAAKQLKARSLYLDFMSEYIADSLLDKLGLNRANLLYVGGGHAYFVLANTEKTVETLVQ
FEKDFNQFLLANFQTRLYVAFGWGSFAAKDIMSELNSPESYRQIYQKASRMISEKKISRYDYRTLMLLNRGGKSSERECE
ICHSVENLVSYHDQKVCDICRGLYQFSKEIAHDHFIITENEGLPIGPNACLKGVAFEKLSQESFSRVYVKNDYKAGTIKA
THVFVGDYQCDEIHKYAALSKNEDGLGIKRLAVVRLDVDDLGAAFMAGFSRQGNGQYSTLSRSATFSRSMSLFFKVYINQ
FASDKKLSIIYAGGDDVFAIGSWQDIIAFTVELRQNFIKWTNGKLTLSAGIGLFADKTPISLMAHQTGELEEAAKGNEKD
SISLFSSDYTFKFDRFITNVYDDKLEQIRYFFNHQDERGKNFIYKLIELLRNYESEEKMNVARLAYYLTRLEELTDKDER
DKFKQFKKLFFKWYTNNESDRKEAELALLLYVYEIRKD
>A0Q5Y4 3.1.-.-~~~cas1-1~~~CRISPR-associated endonuclease Cas1 1~~~
MKHLIISEYGIYLGLESGRLVVKNKDDKKYFPLNRLATLSIAKKGVSFSSDLVEQFSLRGIKLFFLDFRGVAHSMLVGAN
QHAVVQARINQYRYIDRNALTLSIKLIAAKIKNQRATLNYFNKHHKSINLLNAIEELKRVAQLIKNAKTLNDVLGYEGYA
ANIYFSSLAKDKFLSASFANREGRGSQEIANSMLNFGYAILSSYILNAITNAGLEPYLGFLHQKRPGKMSLVLDLMEEYR
AWVVDRVVIKLREQYKNKQYIDTKLKSILISEIQATIAKKYIYNGKKLKLEHIIQRQVYRLSGEFAGEHNYKPYIFKW
>Q53WG8 3.1.-.-~~~cas1-2~~~CRISPR-associated endonuclease Cas1 2~~~
MPPVSSARNLKELPKFRDGLSYLYVEHAVVEREAGGIGIYDQEGLTLAPVAGLGVLFLGPGTRITHAAVRLLAENGCTVA
WVGEGMARFYAQGLGDTRSAARFYRQARAWADPALHLEVVMRLYRMRFSEPLPEGLTLEQVRGLEGVRVRNAYARWSRET
GVPWYGRSYDRGNWRAADPVNRALSAGASYLYGLAHAAIVSLGFSPALGFIHTGKLLSFVYDIADLYKADYLVPAAFRTV
AESEEAVERRVRRALREAIQEGRLLERMAEDLLNLFRGLGLPEEEDPVEEDPTRPGGLWDLEGEVEGGVAYGGDDPGEGA
EEPEG
>O66692 3.1.-.-~~~cas1~~~CRISPR-associated endonuclease Cas1~~~COG1518
MGRVYYINSHGTLSRHENTLRFENAEVKKDIPVEDVEEIFVFAELSLNTKLLNFLASKGIPLHFFNYYGYYTGTFYPRES
SVSGHLLIKQVEHYLDAQKRLYLAKSFVIGSILNLEYVYKISADTYLNKVKETNSIPELMSVEAEFRKLCYKKLEEVTGW
ELEKRTKRPPQNPLNALISFGNSLTYAKVLGEIYKTQLNPTVSYLHEPSTKRFSLSLDVAEVFKPIFVDNLIIRLIQENK
IDKTHFSTELNMTFLNEIGRKVFLKAFNELLETTIFYPKLNRKVSHRTLIKLELYKLIKHLLEEEVYLPLNYGGLK
>Q46896 3.1.-.-~~~ygbT~~~CRISPR-associated endonuclease Cas1~~~COG1518
MTWLPLNPIPLKDRVSMIFLQYGQIDVIDGAFVLIDKTGIRTHIPVGSVACIMLEPGTRVSHAAVRLAAQVGTLLVWVGE
AGVRVYASGQPGGARSDKLLYQAKLALDEDLRLKVVRKMFELRFGEPAPARRSVEQLRGIEGSRVRATYALLAKQYGVTW
NGRRYDPKDWEKGDTINQCISAATSCLYGVTEAAILAAGYAPAIGFVHTGKPLSFVYDIADIIKFDTVVPKAFEIARRNP
GEPDREVRLACRDIFRSSKTLAKLIPLIEDVLAAGEIQPPAPPEDAQPVAIPLPVSLGDAGHRSS
>P9WPJ5 3.1.-.-~~~cas1~~~CRISPR-associated endonuclease Cas1~~~COG1518
MVQLYVSDSVSRISFADGRVIVWSEELGESQYPIETLDGITLFGRPTMTTPFIVEMLKRERDIQLFTTDGHYQGRISTPD
VSYAPRLRQQVHRTDDPAFCLSLSKRIVSRKILNQQALIRAHTSGQDVAESIRTMKHSLAWVDRSGSLAELNGFEGNAAK
AYFTALGHLVPQEFAFQGRSTRPPLDAFNSMVSLGYSLLYKNIIGAIERHSLNAYIGFLHQDSRGHATLASDLMEVWRAP
IIDDTVLRLIADGVVDTRAFSKNSDTGAVFATREATRSIARAFGNRIARTATYIKGDPHRYTFQYALDLQLQSLVRVIEA
GHPSRLVDIDITSEPSGA
>Q6D0X0 3.1.-.-~~~cas1~~~CRISPR-associated endonuclease Cas1~~~COG1518
MDNAFSPSDLKTILHSKRANVYYLQHCRILVNGGRVEYVTEEGNQSLYWNIPIANTSVVMLGTGTSVTQAAMREFARAGV
MIGFCGGGGTPLFAANEAEVAVSWLSPQSEYRPTEYLQDWVSFWFDDEKRLAAAIAFQQVRITQIRQHWLGSRLSRESRF
TFKSEHLQALLDRYQKGLTDCRTSNDVLVQEAMMTKALYRLAANAVSYGDFTRAKRGGGTDLANRFLDHGNYLAYGLAAV
STWVLGLPHGLAVLHGKTRRGGLVFDVADLIKDALVLPQAFIAAMEGEDEQEFRQRCLTAFQQSEALDVMIGSLQDVASK
LSQVVR
>Q02ML7 3.1.-.-~~~cas1~~~CRISPR-associated endonuclease Cas1~~~
MDDISPSELKTILHSKRANLYYLQHCRVLVNGGRVEYVTDEGRHSHYWNIPIANTTSLLLGTGTSITQAAMRELARAGVL
VGFCGGGGTPLFSANEVDVEVSWLTPQSEYRPTEYLQRWVGFWFDEEKRLVAARHFQRARLERIRHSWLEDRVLRDAGFA
VDATALAVAVEDSARALEQAPNHEHLLTEEARLSKRLFKLAAQATRYGEFVRAKRGSGGDPANRFLDHGNYLAYGLAATA
TWVLGIPHGLAVLHGKTRRGGLVFDVADLIKDSLILPQAFLSAMRGDEEQDFRQACLDNLSRAQALDFMIDTLKDVAQRS
TVSA
>Q05581 1.14.11.21~~~cs1~~~Clavaminate synthase 1~~~COG2175
MTSVDCTAYGPELRALAARLPRTPRADLYAFLDAAHTAAASLPGALATALDTFNAEGSEDGHLLLRGLPVEADADLPTTP
SSTPAPEDRSLLTMEAMLGLVGRRLGLHTGYRELRSGTVYHDVYPSPGAHHLSSETSETLLEFHTEMAYHRLQPNYVMLA
CSRADHERTAATLVASVRKALPLLDERTRARLLDRRMPCCVDVAFRGGVDDPGAIAQVKPLYGDADDPFLGYDRELLAPE
DPADKEAVAALSKALDEVTEAVYLEPGDLLIVDNFRTTHARTPFSPRWDGKDRWLHRVYIRTDRNGQLSGGERAGDVVAF
TPRG
>G3ECR2 3.1.-.-~~~cas1~~~CRISPR-associated endonuclease Cas1~~~COG1518
MAGWRTVVVNIHSKLSYKNNHLIFRNSYKTEMIHLSEIDILLLETTDIVLTTMLVKRLVDENILVIFCDDKRLPTAFLTP
YYARHDSSLQIARQIAWKENVKCEVWTAIIAQKILNQSYYLGECSFFEKSQSIMELYHGLERFDPSNREGHSARIYFNTL
FGNDFTRESDNDINAALDYGYTLLLSMFAREVVVCGCMTQIGLKHANQFNQFNLASDIMEPFRPIIDRIVYQNRHNNFVK
IKKELFSIFSETYLYNGKEMYLSNIVSDYTKKVIKALNQLGEEIPEFRI
>Q9X2B7 3.1.-.-~~~cas1~~~CRISPR-associated endonuclease Cas1~~~COG1518
MESVYLFSSGTLKRKANTICLETESGRKYIPVENVMDIKVFGEVDLNKRFLEFLSQKRIPIHFFNREGYYVGTFYPREYL
NSGFLILKQAEHYINQEKRMLIAREIVSRSFQNMVDFLKKRKVRADSLTRYKKKAEEASNVSELMGIEGNAREEYYSMID
SLVSDERFRIEKRTRRPPKNFANTLISFGNSLLYTTVLSLIYQTHLDPRIGYLHETNFRRFSLNLDIAELFKPAVVDRLF
LNLVNTRQINEKHFDEISEGLMLNDEGKSLFVKNYEQALRETVFHKKLNRYVSMRSLIKMELHKLEKHLIGEQVFGSEE
>A0Q5Y5 3.1.-.-~~~cas2-1~~~CRISPR-associated endoribonuclease Cas2 1~~~
MQLRKEYLIAYDIEDNKTRTIIYKQLLAYGLKAVQKSVFWGYVSIAELNAIKRLFDSSLTISDKVFITRVNMHEQKLDYS
FGYDDKTFKDWDEYGHI
>Q6ZEI1 3.1.-.-~~~cas2-1~~~CRISPR-associated endoribonuclease Cas2 1~~~
MLYLIIYDVPATKAGNKRRTRLFDLLSGYGKWRQFSVFECFLSVKQFAKLQTAMEKLIKLDEDAVCIYVLDENTVQRTIT
YGTPQPEKPGSIII
>Q746F4 3.1.-.-~~~cas2b~~~CRISPR-associated endonuclease Cas2 2~~~COG1343
MGKRLYAVAYDIPDDTRRVKLANLLKSYGERVQLSVFECYLDERLLEDLRRRARRLLDLGQDALRIYPVAGQVEVLGVGP
LPELREVQVL
>Q82W51 3.1.-.-~~~~~~CRISPR-associated endoribonuclease Cas2 3~~~COG1343
MLIIVTYDVSTETRAGRKRLRRVAKLCESIGQRVQKSVFECRINLMQYEELERRLLSEIDEQEDNLRLYRLTEPAELHVK
EYGNFKAIDFEGPLTI
>Q72WF4 3.1.-.-~~~cas2~~~CRISPR-associated endoribonuclease Cas2~~~
MYGNDAMLVLISYDVSFEDPGGQRRLRRIAKACQDYGQRVQYSVFECVVDPAQWAKLKHRLLSEMDKEKDCLRFYYLGAN
WRNKVEHVGAKPAYDPEGPLIL
>P45956 3.1.-.-~~~ygbF~~~CRISPR-associated endoribonuclease Cas2~~~COG0847
MSMLVVVTENVPPRLRGRLAIWLLEVRAGVYVGDVSAKIREMIWEQIAGLAEEGNVVMAWATNTETGFEFQTFGLNRRTP
VDLDGLRLVSFLPV
>Q74H35 3.1.-.-~~~cas2~~~CRISPR-associated endoribonuclease Cas2~~~COG1343
MEHLYIVSYDIRNQRRWRRLFKTMHGFGCWLQLSVFQCRLDRIRIIKMEAAINEIVNHAEDHVLILDLGPAENVKPKVSS
IGKTFDPILRQAVIV
>Q9KFX8 3.1.-.-~~~cas2~~~CRISPR-associated endonuclease Cas2~~~COG1343
MLVLITYDVQTSSMGGTKRLRKVAKACQNYGQRVQNSVFECIVDSTQLTSLKLELTSLIDEEKDSLRIYRLGNNYKTKVE
HIGAKPSIDLEDPLIF
>P9WPJ3 3.1.-.-~~~cas2~~~CRISPR-associated endoribonuclease Cas2~~~COG1343
MPTRSREEYFNLPLKVDESSGTIGKMFVLVIYDISDNRRRASLAKILAGFGYRVQESAFEAMLTKGQLAKLVARIDRFAI
DCDNIRIYKIRGVAAVTFYGRGRLVSAEEFVFF
>Q1CW51 3.1.-.-~~~cas2~~~CRISPR-associated endoribonuclease Cas2~~~COG1343
MAEPRRWYLITYDIRDPKRWRKVHALLKGYGEWLQLSVFRCSLTDRDREKLRWELSRRMDAVDTLLVIGLCGGCVERVRA
INAKEDWPEEPAPFKVL
>Q05582 1.14.11.21~~~cs2~~~Clavaminate synthase 2~~~COG2175
MASPIVDCTPYRDELLALASELPEVPRADLHGFLDEAKTLAARLPEGLAAALDTFNAVGSEDGYLLLRGLPVDDSELPET
PTSTPAPLDRKRLVMEAMRALAGRRLGLHTGYQELRSGTVYHDVYPSPGAHYLSSETSETLLEFHTEMAYHILQPNYVML
ACSRADHENRAETLVGSVRKALPLLDEKTRARLFDRKVPCCVDVAFRGGVDDPGAIANVKPLYGDANDPFLGYDRELLAP
EDPADKEAVAHLSQALDDVTVGVKLVPGDVLIIDNFRTTHARTPFSPRWDGKDRWLHRVYIRTDRNGELSGGERAGDTIS
FSPRR
>G3ECR3 3.1.-.-~~~cas2~~~CRISPR-associated endoribonuclease Cas2~~~COG3512
MSYRYMRMILMFDMPTDTAEERKAYRKFRKFLLSEGFIMHQFSVYSKLLLNHTANTAMVGRLKANNPKKGNITILTVTEK
QFARMIYLYGDKNTSIANSEERLVFLGDNYCDED
>Q9X2B6 3.1.-.-~~~cas2~~~CRISPR-associated endoribonuclease Cas2~~~COG1343
MYVIMVYDVNEKRVAKILKIARKYLKWVQNSVLEGELSPGKYEKLKLEVSRLIDEKEDSVRFYVMDSQKVFNLETLGVEK
GEDGFIF
>Q73QW4 3.1.-.-~~~cas2~~~CRISPR-associated endoribonuclease Cas2~~~COG3512
MRVIVFFDLPVITPENRHNYSVFRKYLIKSGFIMQQKSVYSKLVLNLTNRDSIVKSIEKNKPPEGLVEVLTVTEKQYAKM
EIIIGESKTEYLNTDERLVVL
>P38036 3.1.-.-~~~ygcB~~~CRISPR-associated endonuclease/helicase Cas3~~~COG1203
MEPFKYICHYWGKSSKSLTKGNDIHLLIYHCLDVAAVADCWWDQSVVLQNTFCRNEMLSKQRVKAWLLFFIALHDIGKFD
IRFQYKSAESWLKLNPATPSLNGPSTQMCRKFNHGAAGLYWFNQDSLSEQSLGDFFSFFDAAPHPYESWFPWVEAVTGHH
GFILHSQDQDKSRWEMPASLASYAAQDKQAREEWISVLEALFLTPAGLSINDIPPDCSSLLAGFCSLADWLGSWTTTNTF
LFNEDAPSDINALRTYFQDRQQDASRVLELSGLVSNKRCYEGVHALLDNGYQPRQLQVLVDALPVAPGLTVIEAPTGSGK
TETALAYAWKLIDQQIADSVIFALPTQATANAMLTRMEASASHLFSSPNLILAHGNSRFNHLFQSIKSRAITEQGQEEAW
VQCCQWLSQSNKKVFLGQIGVCTIDQVLISVLPVKHRFIRGLGIGRSVLIVDEVHAYDTYMNGLLEAVLKAQADVGGSVI
LLSATLPMKQKQKLLDTYGLHTDPVENNSAYPLINWRGVNGAQRFDLLAHPEQLPPRFSIQPEPICLADMLPDLTMLERM
IAAANAGAQVCLICNLVDVAQVCYQRLKELNNTQVDIDLFHARFTLNDRREKENRVISNFGKNGKRNVGRILVATQVVEQ
SLDVDFDWLITQHCPADLLFQRLGRLHRHHRKYRPAGFEIPVATILLPDGEGYGRHEHIYSNVRVMWRTQQHIEELNGAS
LFFPDAYRQWLDSIYDDAEMDEPEWVGNGMDKFESAECEKRFKARKVLQWAEEYSLQDNDETILAVTRDGEMSLPLLPYV
QTSSGKQLLDGQVYEDLSHEQQYEALALNRVNVPFTWKRSFSEVVDEDGLLWLEGKQNLDGWVWQGNSIVITYTGDEGMT
RVIPANPK
>Q1CW46 3.1.-.-~~~cas3~~~Putative CRISPR-associated nuclease/helicase Cas3~~~COG1203
MKRLLAKSTATPDRPEGEATLLGHTALVLSAARRLLEHRGRASLLAAGLDPALEPRLRRIVLLAAALHDLGKCSEHFQSM
LRRQREAPQLVRHEALSLWLCWPGQPLSAWLQRDVSELDLCLALVCVAAHHRKFQTEAFAPDGTGAGLSLELRVQHEDFA
RTLGRIAEELALSAPPLFTAPIVLRATRKEHPRDQLQSWQDDFERTVPAGSVDARLLAVCKALVLAADVAGSALPRSGEK
QDWVDRQLTAPHPAEALRAVVERRLAGHTPRPFQEEVARSAAPLTLVRAGCGSGKTAAAYLWAARQHPGRPLWLTYPTMG
TATEGFRDYLHGADVEARLQHSRAEVDFDIFGLRDGAAPGTSSRDQDRLDALRSWGADAMSCTADTVLGLVQSQRQGLYA
WPGLCAACVVFDEVHAYDDRLFGCLLRFLEALPGIPALLMTASLPATRLDVLRGMCERVHGRSLAEVEGPEDLETLPRYQ
RLDVAEPWALVAECLRDHGKVLWVSNTVDRCMRTAEAGTSHGARALLYHSRFRYEDRVRRHGDVIEAFATEGRAAFASTT
QVAEMSLDLSADLLVTDLAPIPALIQRLGRLNRRSTLERPAPVRPFVVLPFDGPPYAAPDLRDARAWMERLGTGPLSQRD
LVDAWGPPGMMDAPRRQSSTWLDGRFDTWPAPCRDGSPSLTVLLEEDARAVLDGAVSAHRVTLPMNLPPESFKWRAWPRA
GRLPYPIPPANALDYDGLRGARWRKP
>Q6D0W9 3.1.-.-~~~cas3~~~CRISPR-associated nuclease/helicase Cas3 subtype I-F/YPEST~~~COG1203
MNILLISECNKRALVETRRILDQFAERKGERSWQTAITLEGLNTLRKLLRKTARRNTAVACHWIRSTNHTELLWVVGNLR
RFNAQGSVPTNTTSRDVLRTKDENPWHSAEVFSLLAAIAGLFHDVGKANALFQAGLSGTGPRSQPYRHEWVSLRLFQAFV
GEQDDKAWLTTLSMITSEAEVALLATLQQDKPTFSDSPFRTLPPLAQTIAWLIVSHHRLPVFNKSTELAPNSSEPQLSFA
ETWLTDHLSPQWNALNHCQIGWTPHEREQNWQFPNGTPLYSAVWREKARKFAGRALKLPSFMHFSQLDQRLTVHLARLAL
MLADHHYSAGAATVGWQDITYPVWANTDRKTGEYKQRLDEHCVGVGQNALLLGRSLPHLRDTLPAITRHKGFRQRSTHPR
FRWQDRAFDLACSIRETSKQHGFFGINMASTGRGKTFANARIMYGLSDESTGCRFSVALGLRTLTLQTGDALRQRLKLDE
DDLAVLIGSQAVQDLHEMRKENQQRQQNTPQTGSESADPLFSEHQYVRYDGSLDDGRLKAWLERSPTLHQLLSAPVLITT
IDHLMPATEALRGGHQIAPMLRLLTADLVLDEPDDFGLEDLPALCRLVNWAGMLGSRVLLSSATLPPALIRALFDAYLDG
RTAWQQAYGTPNTPLNVCCGWFDEFDCQHEQYGDVKDFMVRHDAFVHQRLKNLTKDELPLRFATIVPVSSPSKNADDVYL
AVAQAIHPRMFDLHSQHHQQHENGKTVSLGLVRMANIDPLVAVARQLLAIPSPPDTCIHYCIYHSQHPLAMRSHIEQRLD
AALMRNDADALWQVEEIRQAIEKSSQRHHVFVALATSVAEVGRDHDYDWAIVEPSSMRSLIQPAGRILRHRQDKQYVPKT
PNIYLLSHNIRALRGKDIAYCKPGFESQDDSLDTHDLHQLLQEKEYRHLSAAPRIVQPMSFAKPLSLVALEHAVLGKTLL
GLKNKKLDDLKRPPAAFWWRAHPHWNGELQRRTPFRQSAKDEAYTLWIADDDEEPVFMVQDDGPSGWKQSDIARPVTLNM
AEGVSAWIEPDYHALYQQLAEEKQWELSWVSARFGEIRLREEEDWYWHPLLGVFGALS
>Q02ML8 3.1.-.-~~~cas3~~~CRISPR-associated nuclease/helicase Cas3 subtype I-F/YPEST~~~
MNILLVSQCEKRALSETRRILDQFAERRGERTWQTPITQAGLDTLRRLLKKSARRNTAVACHWIRGRDHSELLWIVGDAS
RFNAQGAVPTNRTCRDILRKEDENDWHSAEDIRLLTVMAALFHDIGKASQAFQAKLRNRGKPMADAYRHEWVSLRLFEAF
VGPGSSDEDWLRRLADKRETGDAWLSQLARDDRQSAPPGPFQKSRLPPLAQAVGWLIVSHHRLPNGDHRGSASLARLPAP
IQSQWCGARDADAKEKAACWQFPHGLPFASAHWRARTALCAQSMLERPGLLARGPALLHDSYVMHVSRLILMLADHHYSS
LPADSRLGDPNFPLHANTDRDSGKLKQRLDEHLLGVALHSRKLAGTLPRLERQLPRLARHKGFTRRVEQPRFRWQDKAYD
CAMACREQAMEHGFFGLNLASTGCGKTLANGRILYALADPQRGARFSIALGLRSLTLQTGQAYRERLGLGDDDLAILVGG
SAARELFEKQQERLERSGSESAQELLAENSHVHFAGTLEDGPLREWLGRNSAGNRLLQAPILACTIDHLMPASESLRGGH
QIAPLLRLMTSDLVLDEVDDFDIDDLPALSRLVHWAGLFGSRVLLSSATLPPALVQGLFEAYRSGREIFQRHRGAPGRAT
EIRCAWFDEFSSQSSAHGAVTSFSEAHATFVAQRLAKLEQLPPRRQAQLCTVHAAGEARPALCRELAGQMNTWMADLHRC
HHTEHQGRRISFGLLRLANIEPLIELAQAILAQGAPEGLHVHLCVYHSRHPLLVRSAIERQLDELLKRSDDDAAALFARP
TLAKALQASTERDHLFVVLASPVAEVGRDHDYDWAIVEPSSMRSIIQLAGRIRRHRSGFSGEANLYLLSRNIRSLEGQNP
AFQRPGFETPDFPLDSHDLHDLLDPALLARIDASPRIVEPFPLFPRSRLVDLEHRRLRALMLADDPPSSLLGVPLWWQTP
ASLSGALQTSQPFRAGAKERCYALLPDEDDEERLHFSRYEEGTWSNQDNLLRNLDLTYGPRIQTWGTVNYREELVAMAGR
EDLDLRQCAMRYGEVRLRENTQGWSYHPYLGFKKYN
>F2XG53 3.1.-.-~~~cas3~~~CRISPR-associated nuclease/helicase Cas3~~~
MKHINDYFWAKKTEENSRLLWLPLTQHLEDTKNIAGLLWEHWLSEGQKVLIENSINVKSNIENQGKRLAQFLGAVHDIGK
ATPAFQTQKGYANSVDLDIQLLEKLERAGFSGISSLQLASPKKSHHSIAGQYLLSHYGVDEDIATIIGGHHGRPVDDLDG
LNSQKSYPSNYYQDEKKDSLVYQKWKSNQEAFLNWALTETGFNSVSQLPKIKQPAQVILSGLLIMSDWIASNEHFFPLLS
LDETDVKNKSQRIETGFKKWKKSNLWQPETFVDLVTLYQERFGFSPRNFQLILSQTIEKTTNPGIVILEAPMGIGKTEAA
LAVSEQLSSKKGCSGLFFGLPTQATSNGIFKRIEQWTENIKGNNSDHFSIQLVHGKAALNTDFIELLKGNTINMDDSENG
SIFVNEWFSGRKTSALDDFVVGTVDQFLMVALKQKHLALRHLGFSKKVIVIDEVHAYDAYMSQYLLEAIRWMGAYGVPVI
ILSATLPAQQREKLIKSYMAGMGVKWRDIENIDQIKIDAYPLITYNDGPDIHQVKMFEKQEQKNIYIHRLPEEQLFDIVK
EGLDNGGVVGIIVNTVRKSQELARNFSDIFGDDMVDLLHSNFIATERIRKEKDLLQEIGKKAIRPPKKIIIGTQVLEQSL
DIDFDVLISDLAPMDLLIQRIGRLHRHKIKRPQKHEVARFYVLGTFEEFDFDEGTRLVYGDYLLARTQYFLPDKIRLPDD
ISPLVQKVYNSDLTITFPKPELHKKYLDAKIEHDDKIKNKETKAKSYRIANPVLKKSRVRTNSLIGWLKNLHPNDSEEKA
YAQVRDIEDTVEVIALKKISDGYGLFIENKDISQNITDPIIAKKVAQNTLRLPMSLSKAYNIDQTINELERYNNSHLSQW
QNSSWLKGSLGIIFDKNNEFILNGFKLLYDEKYGVTIERLDKNESV
>Q53VY2 3.1.-.-~~~cas3~~~CRISPR-associated endonuclease/helicase Cas3~~~
MSVEEAALALWAKSGNPFHPLLAHMLDTAAVALAVLRMEPPRTRALYAEDWGLPEEGALAWAAALVGLHDLGKASPVFQA
GWEEGKERVQRAGLPFGELLDWVAHGVFTELFLRRLLKEKGLPERAANDLAAALGAHHGFPANAEEKSRARRHLRTEDPL
WKEARRWLLEEVFRRLGAPLPPSQGNGEARPEAVLRVMALASFADWVASDPSLFPYGRDPRRGDYLKEALRLAQEALNRL
GWPAFAKAQRREFGELFPYIPKPNALQESVPALLEGACTPVLLLVEAPMGMGKTEAALYAHHLLQAGLGHRGLYVALPTQ
ATANGLFPRVRGFLERLGEGSRLELQLQHGTALLNPHYAGLLERAAPRQVGEEEEGGAVASAWFSARKRAMLAPYGVGTL
DQALLGVLRVKHHFVRLWGLMNRVVVLDEVHAYDVYTSGLLQALLRWLRALGSSAVVMTATLPPSRRRALLEAWAGEEVE
GQDLGPYPRVVLVGEGVKARSLPPAREVEVALEVLREVDVEPLAQRLKGALPGAVGAIVNTVDRAQDLYRALGEGTPLTL
EELARRLGGISGGQAWEEVRQALPERGGEVVGKVLTDGTLVFLLHARFPAEERALRGSVVLALFGKGGPRPPRAILVATQ
VAEQSLDLDFDLLYTDLAPIDLLFQRSGRLHRHERPRPEEHARPRLLLGVPEDLDFGKPLYWDKVYEDYVLLATWRALSG
RDRLRVPGDLEALLEEIYEGENPESFPEGLRERAKKSLKALQERRDREANTARRLSLSELDRLLAYWDEGALVAQERLED
DEEKAETQRLLTRLGDPSVAVVPLFRVGEGLFLDREGRRRAPLKGEVSREEAEALFRRAVRLSRFPLPQELLKEEPPPAW
RKSGLLRGLRPLEVGRVFRSGERAFQVELDPELGVVYLPV
>A0Q5Y6 3.1.12.1~~~cas4~~~CRISPR-associated exonuclease Cas4~~~
MNMDIFDNDLSIPVNLIRQWCFCPRIVYYQELLAIKPNKPLWVAQGEEFHKKVEQLEKRRSFSRYGLENAIRHFNLSIKS
QKYKLHGIVDWVIETDTNVYVVEYKTNPNPNSLGHKLQIAAYALLVQEYFAKPCKTTFLTSDKKSYEIKITDELINKLIK
TISDILSTLDSGNKPDSSASDHQCIQCEYYNFCNDR
>Q65TW5 3.1.-.-~~~cas5d~~~CRISPR pre-crRNA endoribonuclease Cas5d~~~
MANRIRLHIWGDYACFTRPEMKVERVSYDVITPSAARGILSAIHWKPAINWVIDKIYVLKPIRFESVRRNELGAKISESK
VSGAMKRKSVADLYTVIEDDRQQRAATVLKDVAYVIEAHAVMTSKAGVDENTTKHIEMFKRRALKGQCFQQPCMGVREFP
AHFALIDDNDPLPLSQLSESEFNRDLGWMLHDIDFEHGNTPHFFRAELKNGVIDVPPFYAEEVKR
>Q746C2 3.1.-.-~~~cas5d~~~CRISPR pre-crRNA endoribonuclease Cas5d~~~
MARLKVKVWGEYACFSRPEFKVERVSYPVPTPSAARGLLEAIFWKPEFRYEVRRIGVLRLGTPFALLRNEVGNRMGAKPF
FVEDARQQRTSLVLKDVAYLVEADMVLRPHATDPLPKYLEQFERRLKKGQYHHTPYLGTREFPAYFSPPDGEVPDGGLNL
DLGPMLFDLAFVEDPGRPELTFKRPGRGEVQGYALPLFFHARIREGWLEVPAEKYQELYRLEEGHAKGA
>Q46898 ~~~casD~~~CRISPR system Cascade subunit CasD~~~
MRSYLILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSESVQFAVRCDELILDDRRVSVT
GLRDYHTVLGAREDYRGLKSHETIQTWREYLCDASFTVALWLTPHATMVISELEKAVLKPRYTPYLGRRSCPLTHPLFLG
TCQASDPQKALLNYEPVGGDIYSEESVTGHHLKFTARDEPMITLPRQFASREWYVIKGGMDVSQ
>Q46897 3.1.-.-~~~casE~~~CRISPR system Cascade subunit CasE~~~
MYLSKVIIARAWSRDLYQLHQGLWHLFPNRPDAARDFLFHVEKRNTPEGCHVLLQSAQMPVSTAVATVIKTKQVEFQLQV
GVPLYFRLRANPIKTILDNQKRLDSKGNIKRCRVPLIKEAEQIAWLQRKLGNAARVEDVHPISERPQYFSGDGKSGKIQT
VCFEGVLTINDAPALIDLVQQGIGPAKSMGCGLLSLAPL
>Q1RE32 3.1.-.-~~~cas6f~~~CRISPR-associated endonuclease Cas6/Csy4~~~
MDHYLEIRVLPDPEFSSEMLMAALFAKLHRVLGARGQGDIGVSFPDVNVMPGARLRLHGSAQALQALEASTWRKGLTDYC
QCSPVTPVPEIKGWRVVSRVQVKSNPQRLLRRSVKKGWLTEEQAIERLATQAEQRTDLPFLNMKSLSSQQLFKLFIRHGD
LLKEPVKGEFSSYGLSATATIPWF
>P9WPJ1 3.1.-.-~~~cas6~~~CRISPR-associated endoribonuclease Cas6~~~COG5551
MAARRGGIRRTDLLRRSGQPRGRHRASAAESGLTWISPTLILVGFSHRGDRRMTEHLSRLTLTLEVDAPLERARVATLGP
HLHGVLMESIPADYVQTLHTVPVNPYSQYALARSTTSLEWKISTLTNEARQQIVGPINDAAFAGFRLRASGIATQVTSRS
LEQNPLSQFARIFYARPETRKFRVEFLTPTAFKQSGEYVFWPDPRLVFQSLAQKYGAIVDGEEPDPGLIAEFGQSVRLSA
FRVASAPFAVGAARVPGFTGSATFTVRGVDTFASYIAALLWFGEFSGCGIKASMGMGAIRVQPLAPREKCVPKP
>Q1CW45 3.1.-.-~~~cas6~~~CRISPR-associated endonuclease Cas6~~~
MARSVGWRRSALITVSGCVILNWWGNLMVFVDLLFPVQGGPVPLDHAYLLFSALSRHLPALHERSDMGVFSLRGVSNTRE
LLYLGRGTMRLRCPIEAVATLLPLVSAPLEIAGRRLSLGAPSLHALEPVPSLFARLVTFKHAMDEAAFVAAASRALEALG
VQATLKVGRRRIVRITGKKVVGFALELHGLSAEHSLRVQEQGMGGRRHMGCGLFLPPGRAARVQSHGKAA
>Q6D0W5 3.1.-.-~~~cas6f~~~CRISPR-associated endonuclease Cas6f/Csy4~~~
MDHYIDIRVQPDPEFTASQLLNALFAKLHRVLGQLANGKIGISFPEVGKTLGECLRLHGTEDALSTLEKTSWLKGLRDYT
QVSECKVVPNGVKFRTVRRVQLKSSAERLRRRSVSKGWLTAAEAAARIPDAVEKRSALPFVQIKSLSNGQMFFVFVEHGP
LQNAPTAGRFSSYGLSTEATVPWF
>Q02MM2 3.1.-.-~~~cas6f~~~CRISPR-associated endonuclease Cas6/Csy4~~~
MDHYLDIRLRPDPEFPPAQLMSVLFGKLHQALVAQGGDRIGVSFPDLDESRSRLGERLRIHASADDLRALLARPWLEGLR
DHLQFGEPAVVPHPTPYRQVSRVQAKSNPERLRRRLMRRHDLSEEEARKRIPDTVARALDLPFVTLRSQSTGQHFRLFIR
HGPLQVTAEEGGFTCYGLSKGGFVPWF
>A0A0A7HF73 3.1.-.-~~~cas6~~~CRISPR-associated endoribonuclease Cas6~~~
MKKLVFTFKRIDHPAQDLAVKFHGFLMEQLDSDYVDYLHQQQTNPYATKVIQGKENTQWVVHLLTDDHEDKVFMTLLQIK
EVSLNDLPKLSVEKVEIQELGADKLLEIFNSEENQTYFSIIFETPTGFKSQGSYVIFPSMRLIFQSLMQKYGRLVENQPE
IEEDTLDYLSEHSTITNYRLETSYFRVHRQRIPAFRGKLTFKVQGAKTLKAYVKMLLTFGEYSGLGMKTSLGMGGIKLEE
RKD
>Q53WG9 3.1.-.-~~~cse3~~~CRISPR-associated endoribonuclease Cse3~~~
MWLTKLVLNPASRAARRDLANPYEMHRTLSKAVSRALEEGRERLLWRLEPARGLEPPVVLVQTLTEPDWSVLDEGYAQVF
PPKPFHPALKPGQRLRFRLRANPAKRLAATGKRVALKTPAEKVAWLERRLEEGGFRLLEGERGPWVQILQDTFLEVRRKK
DGEEAGKLLQVQAVLFEGRLEVVDPERALATLRRGVGPGKALGLGLLSVAP
>Q03LF7 3.1.-.-~~~cas9-1~~~CRISPR-associated endonuclease Cas9 1~~~
MSDLVLGLDIGIGSVGVGILNKVTGEIIHKNSRIFPAAQAENNLVRRTNRQGRRLARRKKHRRVRLNRLFEESGLITDFT
KISINLNPYQLRVKGLTDELSNEELFIALKNMVKHRGISYLDDASDDGNSSVGDYAQIVKENSKQLETKTPGQIQLERYQ
TYGQLRGDFTVEKDGKKHRLINVFPTSAYRSEALRILQTQQEFNPQITDEFINRYLEILTGKRKYYHGPGNEKSRTDYGR
YRTSGETLDNIFGILIGKCTFYPDEFRAAKASYTAQEFNLLNDLNNLTVPTETKKLSKEQKNQIINYVKNEKAMGPAKLF
KYIAKLLSCDVADIKGYRIDKSGKAEIHTFEAYRKMKTLETLDIEQMDRETLDKLAYVLTLNTEREGIQEALEHEFADGS
FSQKQVDELVQFRKANSSIFGKGWHNFSVKLMMELIPELYETSEEQMTILTRLGKQKTTSSSNKTKYIDEKLLTEEIYNP
VVAKSVRQAIKIVNAAIKEYGDFDNIVIEMARETNEDDEKKAIQKIQKANKDEKDAAMLKAANQYNGKAELPHSVFHGHK
QLATKIRLWHQQGERCLYTGKTISIHDLINNSNQFEVDHILPLSITFDDSLANKVLVYATANQEKGQRTPYQALDSMDDA
WSFRELKAFVRESKTLSNKKKEYLLTEEDISKFDVRKKFIERNLVDTRYASRVVLNALQEHFRAHKIDTKVSVVRGQFTS
QLRRHWGIEKTRDTYHHHAVDALIIAASSQLNLWKKQKNTLVSYSEDQLLDIETGELISDDEYKESVFKAPYQHFVDTLK
SKEFEDSILFSYQVDSKFNRKISDATIYATRQAKVGKDKADETYVLGKIKDIYTQDGYDAFMKIYKKDKSKFLMYRHDPQ
TFEKVIEPILENYPNKQINEKGKEVPCNPFLKYKEEHGYIRKYSKKGNGPEIKSLKYYDSKLGNHIDITPKDSNNKVVLQ
SVSPWRADVYFNKTTGKYEILGLKYADLQFEKGTGTYKISQEKYNDIKKKEGVDSDSEFKFTLYKNDLLLVKDTETKEQQ
LFRFLSRTMPKQKHYVELKPYDKQKFEGGEALIKVLGNVANSGQCKKGLGKSNISIYKVRTDVLGNQHIIKNEGDKPKLD
F
>Q03JI6 3.1.-.-~~~cas9-2~~~CRISPR-associated endonuclease Cas9 2~~~
MTKPYSIGLDIGTNSVGWAVTTDNYKVPSKKMKVLGNTSKKYIKKNLLGVLLFDSGITAEGRRLKRTARRRYTRRRNRIL
YLQEIFSTEMATLDDAFFQRLDDSFLVPDDKRDSKYPIFGNLVEEKAYHDEFPTIYHLRKYLADSTKKADLRLVYLALAH
MIKYRGHFLIEGEFNSKNNDIQKNFQDFLDTYNAIFESDLSLENSKQLEEIVKDKISKLEKKDRILKLFPGEKNSGIFSE
FLKLIVGNQADFRKCFNLDEKASLHFSKESYDEDLETLLGYIGDDYSDVFLKAKKLYDAILLSGFLTVTDNETEAPLSSA
MIKRYNEHKEDLALLKEYIRNISLKTYNEVFKDDTKNGYAGYIDGKTNQEDFYVYLKKLLAEFEGADYFLEKIDREDFLR
KQRTFDNGSIPYQIHLQEMRAILDKQAKFYPFLAKNKERIEKILTFRIPYYVGPLARGNSDFAWSIRKRNEKITPWNFED
VIDKESSAEAFINRMTSFDLYLPEEKVLPKHSLLYETFNVYNELTKVRFIAESMRDYQFLDSKQKKDIVRLYFKDKRKVT
DKDIIEYLHAIYGYDGIELKGIEKQFNSSLSTYHDLLNIINDKEFLDDSSNEAIIEEIIHTLTIFEDREMIKQRLSKFEN
IFDKSVLKKLSRRHYTGWGKLSAKLINGIRDEKSGNTILDYLIDDGISNRNFMQLIHDDALSFKKKIQKAQIIGDEDKGN
IKEVVKSLPGSPAIKKGILQSIKIVDELVKVMGGRKPESIVVEMARENQYTNQGKSNSQQRLKRLEKSLKELGSKILKEN
IPAKLSKIDNNALQNDRLYLYYLQNGKDMYTGDDLDIDRLSNYDIDHIIPQAFLKDNSIDNKVLVSSASNRGKSDDVPSL
EVVKKRKTFWYQLLKSKLISQRKFDNLTKAERGGLSPEDKAGFIQRQLVETRQITKHVARLLDEKFNNKKDENNRAVRTV
KIITLKSTLVSQFRKDFELYKVREINDFHHAHDAYLNAVVASALLKKYPKLEPEFVYGDYPKYNSFRERKSATEKVYFYS
NIMNIFKKSISLADGRVIERPLIEVNEETGESVWNKESDLATVRRVLSYPQVNVVKKVEEQNHGLDRGKPKGLFNANLSS
KPKPNSNENLVGAKEYLDPKKYGGYAGISNSFTVLVKGTIEKGAKKKITNVLEFQGISILDRINYRKDKLNFLLEKGYKD
IELIIELPKYSLFELSDGSRRMLASILSTNNKRGEIHKGNQIFLSQKFVKLLYHAKRISNTINENHRKYVENHKKEFEEL
FYYILEFNENYVGAKKNGKLLNSAFQSWQNHSIDELCSSFIGPTGSERKGLFELTSRGSAADFEFLGVKIPRYRDYTPSS
LLKDATLIHQSVTGLYETRIDLAKLGEG
>J3F2B0 3.1.-.-~~~cas9~~~CRISPR-associated endonuclease Cas9~~~COG3513
MWYASLMSAHHLRVGIDVGTHSVGLATLRVDDHGTPIELLSALSHIHDSGVGKEGKKDHDTRKKLSGIARRARRLLHHRR
TQLQQLDEVLRDLGFPIPTPGEFLDLNEQTDPYRVWRVRARLVEEKLPEELRGPAISMAVRHIARHRGWRNPYSKVESLL
SPAEESPFMKALRERILATTGEVLDDGITPGQAMAQVALTHNISMRGPEGILGKLHQSDNANEIRKICARQGVSPDVCKQ
LLRAVFKADSPRGSAVSRVAPDPLPGQGSFRRAPKCDPEFQRFRIISIVANLRISETKGENRPLTADERRHVVTFLTEDS
QADLTWVDVAEKLGVHRRDLRGTAVHTDDGERSAARPPIDATDRIMRQTKISSLKTWWEEADSEQRGAMIRYLYEDPTDS
ECAEIIAELPEEDQAKLDSLHLPAGRAAYSRESLTALSDHMLATTDDLHEARKRLFGVDDSWAPPAEAINAPVGNPSVDR
TLKIVGRYLSAVESMWGTPEVIHVEHVRDGFTSERMADERDKANRRRYNDNQEAMKKIQRDYGKEGYISRGDIVRLDALE
LQGCACLYCGTTIGYHTCQLDHIVPQAGPGSNNRRGNLVAVCERCNRSKSNTPFAVWAQKCGIPHVGVKEAIGRVRGWRK
QTPNTSSEDLTRLKKEVIARLRRTQEDPEIDERSMESVAWMANELHHRIAAAYPETTVMVYRGSITAAARKAAGIDSRIN
LIGEKGRKDRIDRRHHAVDASVVALMEASVAKTLAERSSLRGEQRLTGKEQTWKQYTGSTVGAREHFEMWRGHMLHLTEL
FNERLAEDKVYVTQNIRLRLSDGNAHTVNPSKLVSHRLGDGLTVQQIDRACTPALWCALTREKDFDEKNGLPAREDRAIR
VHGHEIKSSDYIQVFSKRKKTDSDRDETPFGAIAVRGGFVEIGPSIHHARIYRVEGKKPVYAMLRVFTHDLLSQRHGDLF
SAVIPPQSISMRCAEPKLRKAITTGNATYLGWVVVGDELEINVDSFTKYAIGRFLEDFPNTTRWRICGYDTNSKLTLKPI
VLAAEGLENPSSAVNEIVELKGWRVAINVLTKVHPTVVRRDALGRPRYSSRSNLPTSWTIE
>Q0P897 ~~~cas9~~~CRISPR-associated endonuclease Cas9~~~COG3513
MARILAFDIGISSIGWAFSENDELKDCGVRIFTKVENPKTGESLALPRRLARSARKRLARRKARLNHLKHLIANEFKLNY
EDYQSFDESLAKAYKGSLISPYELRFRALNELLSKQDFARVILHIAKRRGYDDIKNSDDKEKGAILKAIKQNEEKLANYQ
SVGEYLYKEYFQKFKENSKEFTNVRNKKESYERCIAQSFLKDELKLIFKKQREFGFSFSKKFEEEVLSVAFYKRALKDFS
HLVGNCSFFTDEKRAPKNSPLAFMFVALTRIINLLNNLKNTEGILYTKDDLNALLNEVLKNGTLTYKQTKKLLGLSDDYE
FKGEKGTYFIEFKKYKEFIKALGEHNLSQDDLNEIAKDITLIKDEIKLKKALAKYDLNQNQIDSLSKLEFKDHLNISFKA
LKLVTPLMLEGKKYDEACNELNLKVAINEDKKDFLPAFNETYYKDEVTNPVVLRAIKEYRKVLNALLKKYGKVHKINIEL
AREVGKNHSQRAKIEKEQNENYKAKKDAELECEKLGLKINSKNILKLRLFKEQKEFCAYSGEKIKISDLQDEKMLEIDHI
YPYSRSFDDSYMNKVLVFTKQNQEKLNQTPFEAFGNDSAKWQKIEVLAKNLPTKKQKRILDKNYKDKEQKNFKDRNLNDT
RYIARLVLNYTKDYLDFLPLSDDENTKLNDTQKGSKVHVEAKSGMLTSALRHTWGFSAKDRNNHLHHAIDAVIIAYANNS
IVKAFSDFKKEQESNSAELYAKKISELDYKNKRKFFEPFSGFRQKVLDKIDEIFVSKPERKKPSGALHEETFRKEEEFYQ
SYGGKEGVLKALELGKIRKVNGKIVKNGDMFRVDIFKHKKTNKFYAVPIYTMDFALKVLPNKAVARSKKGEIKDWILMDE
NYEFCFSLYKDSLILIQTKDMQEPEFVYYNAFTSSTVSLIVSKHDNKFETLSKNQKILFKNANEKEVIAKSIGIQNLKVF
EKYIVSALGEVTKAEFRQREDFKK
>Q6NKI3 3.1.-.-~~~cas9~~~CRISPR-associated endonuclease Cas9~~~
MKYHVGIDVGTFSVGLAAIEVDDAGMPIKTLSLVSHIHDSGLDPDEIKSAVTRLASSGIARRTRRLYRRKRRRLQQLDKF
IQRQGWPVIELEDYSDPLYPWKVRAELAASYIADEKERGEKLSVALRHIARHRGWRNPYAKVSSLYLPDGPSDAFKAIRE
EIKRASGQPVPETATVGQMVTLCELGTLKLRGEGGVLSARLQQSDYAREIQEICRMQEIGQELYRKIIDVVFAAESPKGS
ASSRVGKDPLQPGKNRALKASDAFQRYRIAALIGNLRVRVDGEKRILSVEEKNLVFDHLVNLTPKKEPEWVTIAEILGID
RGQLIGTATMTDDGERAGARPPTHDTNRSIVNSRIAPLVDWWKTASALEQHAMVKALSNAEVDDFDSPEGAKVQAFFADL
DDDVHAKLDSLHLPVGRAAYSEDTLVRLTRRMLSDGVDLYTARLQEFGIEPSWTPPTPRIGEPVGNPAVDRVLKTVSRWL
ESATKTWGAPERVIIEHVREGFVTEKRAREMDGDMRRRAARNAKLFQEMQEKLNVQGKPSRADLWRYQSVQRQNCQCAYC
GSPITFSNSEMDHIVPRAGQGSTNTRENLVAVCHRCNQSKGNTPFAIWAKNTSIEGVSVKEAVERTRHWVTDTGMRSTDF
KKFTKAVVERFQRATMDEEIDARSMESVAWMANELRSRVAQHFASHGTTVRVYRGSLTAEARRASGISGKLKFFDGVGKS
RLDRRHHAIDAAVIAFTSDYVAETLAVRSNLKQSQAHRQEAPQWREFTGKDAEHRAAWRVWCQKMEKLSALLTEDLRDDR
VVVMSNVRLRLGNGSAHKETIGKLSKVKLSSQLSVSDIDKASSEALWCALTREPGFDPKEGLPANPERHIRVNGTHVYAG
DNIGLFPVSAGSIALRGGYAELGSSFHHARVYKITSGKKPAFAMLRVYTIDLLPYRNQDLFSVELKPQTMSMRQAEKKLR
DALATGNAEYLGWLVVDDELVVDTSKIATDQVKAVEAELGTIRRWRVDGFFSPSKLRLRPLQMSKEGIKKESAPELSKII
DRPGWLPAVNKLFSDGNVTVVRRDSLGRVRLESTAHLPVTWKVQ
>A0Q5Y3 3.1.-.-~~~cas9~~~CRISPR-associated endonuclease Cas9~~~
MNFKILPIAIDLGVKNTGVFSAFYQKGTSLERLDNKNGKVYELSKDSYTLLMNNRTARRHQRRGIDRKQLVKRLFKLIWT
EQLNLEWDKDTQQAISFLFNRRGFSFITDGYSPEYLNIVPEQVKAILMDIFDDYNGEDDLDSYLKLATEQESKISEIYNK
LMQKILEFKLMKLCTDIKDDKVSTKTLKEITSYEFELLADYLANYSESLKTQKFSYTDKQGNLKELSYYHHDKYNIQEFL
KRHATINDRILDTLLTDDLDIWNFNFEKFDFDKNEEKLQNQEDKDHIQAHLHHFVFAVNKIKSEMASGGRHRSQYFQEIT
NVLDENNHQEGYLKNFCENLHNKKYSNLSVKNLVNLIGNLSNLELKPLRKYFNDKIHAKADHWDEQKFTETYCHWILGEW
RVGVKDQDKKDGAKYSYKDLCNELKQKVTKAGLVDFLLELDPCRTIPPYLDNNNRKPPKCQSLILNPKFLDNQYPNWQQY
LQELKKLQSIQNYLDSFETDLKVLKSSKDQPYFVEYKSSNQQIASGQRDYKDLDARILQFIFDRVKASDELLLNEIYFQA
KKLKQKASSELEKLESSKKLDEVIANSQLSQILKSQHTNGIFEQGTFLHLVCKYYKQRQRARDSRLYIMPEYRYDKKLHK
YNNTGRFDDDNQLLTYCNHKPRQKRYQLLNDLAGVLQVSPNFLKDKIGSDDDLFISKWLVEHIRGFKKACEDSLKIQKDN
RGLLNHKINIARNTKGKCEKEIFNLICKIEGSEDKKGNYKHGLAYELGVLLFGEPNEASKPEFDRKIKKFNSIYSFAQIQ
QIAFAERKGNANTCAVCSADNAHRMQQIKITEPVEDNKDKIILSAKAQRLPAIPTRIVDGAVKKMATILAKNIVDDNWQN
IKQVLSAKHQLHIPIITESNAFEFEPALADVKGKSLKDRRKKALERISPENIFKDKNNRIKEFAKGISAYSGANLTDGDF
DGAKEELDHIIPRSHKKYGTLNDEANLICVTRGDNKNKGNRIFCLRDLADNYKLKQFETTDDLEIEKKIADTIWDANKKD
FKFGNYRSFINLTPQEQKAFRHALFLADENPIKQAVIRAINNRNRTFVNGTQRYFAEVLANNIYLRAKKENLNTDKISFD
YFGIPTIGNGRGIAEIRQLYEKVDSDIQAYAKGDKPQASYSHLIDAMLAFCIAADEHRNDGSIGLEIDKNYSLYPLDKNT
GEVFTKDIFSQIKITDNEFSDKKLVRKKAIEGFNTHRQMTRDGIYAENYLPILIHKELNEVRKGYTWKNSEEIKIFKGKK
YDIQQLNNLVYCLKFVDKPISIDIQISTLEELRNILTTNNIAATAEYYYINLKTQKLHEYYIENYNTALGYKKYSKEMEF
LRSLAYRSERVKIKSIDDVKQVLDKDSNFIIGKITLPFKKEWQRLYREWQNTTIKDDYEFLKSFFNVKSITKLHKKVRKD
FSLPISTNEGKFLVKRKTWDNNFIYQILNDSDSRADGTKPFIPAFDISKNEIVEAIIDSFTSKNIFWLPKNIELQKVDNK
NIFAIDTSKWFEVETPSDLRDIGIATIQYKIDNNSRPKVRVKLDYVIDDDSKINYFMNHSLLKSRYPDKVLEILKQSTII
EFESSGFNKTIKEMLGMKLAGIYNETSNN
>Q927P4 3.1.-.-~~~cas9~~~CRISPR-associated endonuclease Cas9~~~COG3513
MKKPYTIGLDIGTNSVGWAVLTDQYDLVKRKMKIAGDSEKKQIKKNFWGVRLFDEGQTAADRRMARTARRRIERRRNRIS
YLQGIFAEEMSKTDANFFCRLSDSFYVDNEKRNSRHPFFATIEEEVEYHKNYPTIYHLREELVNSSEKADLRLVYLALAH
IIKYRGNFLIEGALDTQNTSVDGIYKQFIQTYNQVFASGIEDGSLKKLEDNKDVAKILVEKVTRKEKLERILKLYPGEKS
AGMFAQFISLIVGSKGNFQKPFDLIEKSDIECAKDSYEEDLESLLALIGDEYAELFVAAKNAYSAVVLSSIITVAETETN
AKLSASMIERFDTHEEDLGELKAFIKLHLPKHYEEIFSNTEKHGYAGYIDGKTKQADFYKYMKMTLENIEGADYFIAKIE
KENFLRKQRTFDNGAIPHQLHLEELEAILHQQAKYYPFLKENYDKIKSLVTFRIPYFVGPLANGQSEFAWLTRKADGEIR
PWNIEEKVDFGKSAVDFIEKMTNKDTYLPKENVLPKHSLCYQKYLVYNELTKVRYINDQGKTSYFSGQEKEQIFNDLFKQ
KRKVKKKDLELFLRNMSHVESPTIEGLEDSFNSSYSTYHDLLKVGIKQEILDNPVNTEMLENIVKILTVFEDKRMIKEQL
QQFSDVLDGVVLKKLERRHYTGWGRLSAKLLMGIRDKQSHLTILDYLMNDDGLNRNLMQLINDSNLSFKSIIEKEQVTTA
DKDIQSIVADLAGSPAIKKGILQSLKIVDELVSVMGYPPQTIVVEMARENQTTGKGKNNSRPRYKSLEKAIKEFGSQILK
EHPTDNQELRNNRLYLYYLQNGKDMYTGQDLDIHNLSNYDIDHIVPQSFITDNSIDNLVLTSSAGNREKGDDVPPLEIVR
KRKVFWEKLYQGNLMSKRKFDYLTKAERGGLTEADKARFIHRQLVETRQITKNVANILHQRFNYEKDDHGNTMKQVRIVT
LKSALVSQFRKQFQLYKVRDVNDYHHAHDAYLNGVVANTLLKVYPQLEPEFVYGDYHQFDWFKANKATAKKQFYTNIMLF
FAQKDRIIDENGEILWDKKYLDTVKKVMSYRQMNIVKKTEIQKGEFSKATIKPKGNSSKLIPRKTNWDPMKYGGLDSPNM
AYAVVIEYAKGKNKLVFEKKIIRVTIMERKAFEKDEKAFLEEQGYRQPKVLAKLPKYTLYECEEGRRRMLASANEAQKGN
QQVLPNHLVTLLHHAANCEVSDGKSLDYIESNREMFAELLAHVSEFAKRYTLAEANLNKINQLFEQNKEGDIKAIAQSFV
DLMAFNAMGAPASFKFFETTIERKRYNNLKELLNSTIIYQSITGLYESRKRLDD
>C9X1G5 3.1.-.-~~~cas9~~~CRISPR-associated endonuclease Cas9~~~
MAAFKPNSINYILGLDIGIASVGWAMVEIDEEENPIRLIDLGVRVFERAEVPKTGDSLAMARRLARSVRRLTRRRAHRLL
RTRRLLKREGVLQAANFDENGLIKSLPNTPWQLRAAALDRKLTPLEWSAVLLHLIKHRGYLSQRKNEGETADKELGALLK
GVAGNAHALQTGDFRTPAELALNKFEKESGHIRNQRSDYSHTFSRKDLQAELILLFEKQKEFGNPHVSGGLKEGIETLLM
TQRPALSGDAVQKMLGHCTFEPAEPKAAKNTYTAERFIWLTKLNNLRILEQGSERPLTDTERATLMDEPYRKSKLTYAQA
RKLLGLEDTAFFKGLRYGKDNAEASTLMEMKAYHAISRALEKEGLKDKKSPLNLSPELQDEIGTAFSLFKTDEDITGRLK
DRIQPEILEALLKHISFDKFVQISLKALRRIVPLMEQGKRYDEACAEIYGDHYGKKNTEEKIYLPPIPADEIRNPVVLRA
LSQARKVINGVVRRYGSPARIHIETAREVGKSFKDRKEIEKRQEENRKDREKAAAKFREYFPNFVGEPKSKDILKLRLYE
QQHGKCLYSGKEINLGRLNEKGYVEIDHALPFSRTWDDSFNNKVLVLGSENQNKGNQTPYEYFNGKDNSREWQEFKARVE
TSRFPRSKKQRILLQKFDEDGFKERNLNDTRYVNRFLCQFVADRMRLTGKGKKRVFASNGQITNLLRGFWGLRKVRAEND
RHHALDAVVVACSTVAMQQKITRFVRYKEMNAFDGKTIDKETGEVLHQKTHFPQPWEFFAQEVMIRVFGKPDGKPEFEEA
DTLEKLRTLLAEKLSSRPEAVHEYVTPLFVSRAPNRKMSGQGHMETVKSAKRLDEGVSVLRVPLTQLKLKDLEKMVNRER
EPKLYEALKARLEAHKDDPAKAFAEPFYKYDKAGNRTQQVKAVRVEQVQKTGVWVRNHNGIADNATMVRVDVFEKGDKYY
LVPIYSWQVAKGILPDRAVVQGKDEEDWQLIDDSFNFKFSLHPNDLVEVITKKARMFGYFASCHRGTGNINIRIHDLDHK
IGKNGILEGIGVKTALSFQKYQIDELGKEIRPCRLKKRPPVR
>A1IQ68 3.1.-.-~~~cas9~~~CRISPR-associated endonuclease Cas9~~~
MAAFKPNPINYILGLDIGIASVGWAMVEIDEDENPICLIDLGVRVFERAEVPKTGDSLAMARRLARSVRRLTRRRAHRLL
RARRLLKREGVLQAADFDENGLIKSLPNTPWQLRAAALDRKLTPLEWSAVLLHLIKHRGYLSQRKNEGETADKELGALLK
GVADNAHALQTGDFRTPAELALNKFEKESGHIRNQRGDYSHTFSRKDLQAELILLFEKQKEFGNPHVSGGLKEGIETLLM
TQRPALSGDAVQKMLGHCTFEPAEPKAAKNTYTAERFIWLTKLNNLRILEQGSERPLTDTERATLMDEPYRKSKLTYAQA
RKLLGLEDTAFFKGLRYGKDNAEASTLMEMKAYHAISRALEKEGLKDKKSPLNLSPELQDEIGTAFSLFKTDEDITGRLK
DRIQPEILEALLKHISFDKFVQISLKALRRIVPLMEQGKRYDEACAEIYGDHYGKKNTEEKIYLPPIPADEIRNPVVLRA
LSQARKVINGVVRRYGSPARIHIETAREVGKSFKDRKEIEKRQEENRKDREKAAAKFREYFPNFVGEPKSKDILKLRLYE
QQHGKCLYSGKEINLGRLNEKGYVEIDHALPFSRTWDDSFNNKVLVLGSENQNKGNQTPYEYFNGKDNSREWQEFKARVE
TSRFPRSKKQRILLQKFDEDGFKERNLNDTRYVNRFLCQFVADRMRLTGKGKKRVFASNGQITNLLRGFWGLRKVRAEND
RHHALDAVVVACSTVAMQQKITRFVRYKEMNAFDGKTIDKETGEVLHQKTHFPQPWEFFAQEVMIRVFGKPDGKPEFEEA
DTPEKLRTLLAEKLSSRPEAVHEYVTPLFVSRAPNRKMSGQGHMETVKSAKRLDEGVSVLRVPLTQLKLKDLEKMVNRER
EPKLYEALKARLEAHKDDPAKAFAEPFYKYDKAGNRTQQVKAVRVEQVQKTGVWVRNHNGIADNATMVRVDVFEKGDKYY
LVPIYSWQVAKGILPDRAVVQGKDEEDWQLIDDSFNFKFSLHPNDLVEVITKKARMFGYFASCHRGTGNINIRIHDLDHK
IGKNGILEGIGVKTALSFQKYQIDELGKEIRPCRLKKRPPVR
>Q9CLT2 3.1.-.-~~~cas9~~~CRISPR-associated endonuclease Cas9~~~
MQTTNLSYILGLDLGIASVGWAVVEINENEDPIGLIDVGVRIFERAEVPKTGESLALSRRLARSTRRLIRRRAHRLLLAK
RFLKREGILSTIDLEKGLPNQAWELRVAGLERRLSAIEWGAVLLHLIKHRGYLSKRKNESQTNNKELGALLSGVAQNHQL
LQSDDYRTPAELALKKFAKEEGHIRNQRGAYTHTFNRLDLLAELNLLFAQQHQFGNPHCKEHIQQYMTELLMWQKPALSG
EAILKMLGKCTHEKNEFKAAKHTYSAERFVWLTKLNNLRILEDGAERALNEEERQLLINHPYEKSKLTYAQVRKLLGLSE
QAIFKHLRYSKENAESATFMELKAWHAIRKALENQGLKDTWQDLAKKPDLLDEIGTAFSLYKTDEDIQQYLTNKVPNSVI
NALLVSLNFDKFIELSLKSLRKILPLMEQGKRYDQACREIYGHHYGEANQKTSQLLPAIPAQEIRNPVVLRTLSQARKVI
NAIIRQYGSPARVHIETGRELGKSFKERREIQKQQEDNRTKRESAVQKFKELFSDFSSEPKSKDILKFRLYEQQHGKCLY
SGKEINIHRLNEKGYVEIDHALPFSRTWDDSFNNKVLVLASENQNKGNQTPYEWLQGKINSERWKNFVALVLGSQCSAAK
KQRLLTQVIDDNKFIDRNLNDTRYIARFLSNYIQENLLLVGKNKKNVFTPNGQITALLRSRWGLIKARENNNRHHALDAI
VVACATPSMQQKITRFIRFKEVHPYKIENRYEMVDQESGEIISPHFPEPWAYFRQEVNIRVFDNHPDTVLKEMLPDRPQA
NHQFVQPLFVSRAPTRKMSGQGHMETIKSAKRLAEGISVLRIPLTQLKPNLLENMVNKEREPALYAGLKARLAEFNQDPA
KAFATPFYKQGGQQVKAIRVEQVQKSGVLVRENNGVADNASIVRTDVFIKNNKFFLVPIYTWQVAKGILPNKAIVAHKNE
DEWEEMDEGAKFKFSLFPNDLVELKTKKEYFFGYYIGLDRATGNISLKEHDGEISKGKDGVYRVGVKLALSFEKYQVDEL
GKNRQICRPQQRQPVR
>J7RUA5 3.1.-.-~~~cas9~~~CRISPR-associated endonuclease Cas9~~~
MKRNYILGLDIGITSVGYGIIDYETRDVIDAGVRLFKEANVENNEGRRSKRGARRLKRRRRHRIQRVKKLLFDYNLLTDH
SELSGINPYEARVKGLSQKLSEEEFSAALLHLAKRRGVHNVNEVEEDTGNELSTKEQISRNSKALEEKYVAELQLERLKK
DGEVRGSINRFKTSDYVKEAKQLLKVQKAYHQLDQSFIDTYIDLLETRRTYYEGPGEGSPFGWKDIKEWYEMLMGHCTYF
PEELRSVKYAYNADLYNALNDLNNLVITRDENEKLEYYEKFQIIENVFKQKKKPTLKQIAKEILVNEEDIKGYRVTSTGK
PEFTNLKVYHDIKDITARKEIIENAELLDQIAKILTIYQSSEDIQEELTNLNSELTQEEIEQISNLKGYTGTHNLSLKAI
NLILDELWHTNDNQIAIFNRLKLVPKKVDLSQQKEIPTTLVDDFILSPVVKRSFIQSIKVINAIIKKYGLPNDIIIELAR
EKNSKDAQKMINEMQKRNRQTNERIEEIIRTTGKENAKYLIEKIKLHDMQEGKCLYSLEAIPLEDLLNNPFNYEVDHIIP
RSVSFDNSFNNKVLVKQEENSKKGNRTPFQYLSSSDSKISYETFKKHILNLAKGKGRISKTKKEYLLEERDINRFSVQKD
FINRNLVDTRYATRGLMNLLRSYFRVNNLDVKVKSINGGFTSFLRRKWKFKKERNKGYKHHAEDALIIANADFIFKEWKK
LDKAKKVMENQMFEEKQAESMPEIETEQEYKEIFITPHQIKHIKDFKDYKYSHRVDKKPNRELINDTLYSTRKDDKGNTL
IVNNLNGLYDKDNDKLKKLINKSPEKLLMYHHDPQTYQKLKLIMEQYGDEKNPLYKYYEETGNYLTKYSKKDNGPVIKKI
KYYGNKLNAHLDITDDYPNSRNKVVKLSLKPYRFDVYLDNGVYKFVTVKNLDVIKKENYYEVNSKCYEEAKKLKKISNQA
EFIASFYNNDLIKINGELYRVIGVNNDLLNRIEVNMIDITYREYLENMNDKRPPRIIKTIASKTQSIKKYSTDILGNLYE
VKSKKHPQIIKKG
>Q8DTE3 3.1.-.-~~~cas9~~~CRISPR-associated endonuclease Cas9~~~COG3513
MKKPYSIGLDIGTNSVGWAVVTDDYKVPAKKMKVLGNTDKSHIEKNLLGALLFDSGNTAEDRRLKRTARRRYTRRRNRIL
YLQEIFSEEMGKVDDSFFHRLEDSFLVTEDKRGERHPIFGNLEEEVKYHENFPTIYHLRQYLADNPEKVDLRLVYLALAH
IIKFRGHFLIEGKFDTRNNDVQRLFQEFLAVYDNTFENSSLQEQNVQVEEILTDKISKSAKKDRVLKLFPNEKSNGRFAE
FLKLIVGNQADFKKHFELEEKAPLQFSKDTYEEELEVLLAQIGDNYAELFLSAKKLYDSILLSGILTVTDVGTKAPLSAS
MIQRYNEHQMDLAQLKQFIRQKLSDKYNEVFSDVSKDGYAGYIDGKTNQEAFYKYLKGLLNKIEGSGYFLDKIEREDFLR
KQRTFDNGSIPHQIHLQEMRAIIRRQAEFYPFLADNQDRIEKLLTFRIPYYVGPLARGKSDFAWLSRKSADKITPWNFDE
IVDKESSAEAFINRMTNYDLYLPNQKVLPKHSLLYEKFTVYNELTKVKYKTEQGKTAFFDANMKQEIFDGVFKVYRKVTK
DKLMDFLEKEFDEFRIVDLTGLDKENKVFNASYGTYHDLCKILDKDFLDNSKNEKILEDIVLTLTLFEDREMIRKRLENY
SDLLTKEQVKKLERRHYTGWGRLSAELIHGIRNKESRKTILDYLIDDGNSNRNFMQLINDDALSFKEEIAKAQVIGETDN
LNQVVSDIAGSPAIKKGILQSLKIVDELVKIMGHQPENIVVEMARENQFTNQGRRNSQQRLKGLTDSIKEFGSQILKEHP
VENSQLQNDRLFLYYLQNGRDMYTGEELDIDYLSQYDIDHIIPQAFIKDNSIDNRVLTSSKENRGKSDDVPSKDVVRKMK
SYWSKLLSAKLITQRKFDNLTKAERGGLTDDDKAGFIKRQLVETRQITKHVARILDERFNTETDENNKKIRQVKIVTLKS
NLVSNFRKEFELYKVREINDYHHAHDAYLNAVIGKALLGVYPQLEPEFVYGDYPHFHGHKENKATAKKFFYSNIMNFFKK
DDVRTDKNGEIIWKKDEHISNIKKVLSYPQVNIVKKVEEQTGGFSKESILPKGNSDKLIPRKTKKFYWDTKKYGGFDSPI
VAYSILVIADIEKGKSKKLKTVKALVGVTIMEKMTFERDPVAFLERKGYRNVQEENIIKLPKYSLFKLENGRKRLLASAR
ELQKGNEIVLPNHLGTLLYHAKNIHKVDEPKHLDYVDKHKDEFKELLDVVSNFSKKYTLAEGNLEKIKELYAQNNGEDLK
ELASSFINLLTFTAIGAPATFKFFDKNIDRKRYTSTTEILNATLIHQSITGLYETRIDLNKLGGD
>Q99ZW2 3.1.-.-~~~cas9~~~CRISPR-associated endonuclease Cas9/Csn1~~~
MDKKYSIGLDIGTNSVGWAVITDEYKVPSKKFKVLGNTDRHSIKKNLIGALLFDSGETAEATRLKRTARRRYTRRKNRIC
YLQEIFSNEMAKVDDSFFHRLEESFLVEEDKKHERHPIFGNIVDEVAYHEKYPTIYHLRKKLVDSTDKADLRLIYLALAH
MIKFRGHFLIEGDLNPDNSDVDKLFIQLVQTYNQLFEENPINASGVDAKAILSARLSKSRRLENLIAQLPGEKKNGLFGN
LIALSLGLTPNFKSNFDLAEDAKLQLSKDTYDDDLDNLLAQIGDQYADLFLAAKNLSDAILLSDILRVNTEITKAPLSAS
MIKRYDEHHQDLTLLKALVRQQLPEKYKEIFFDQSKNGYAGYIDGGASQEEFYKFIKPILEKMDGTEELLVKLNREDLLR
KQRTFDNGSIPHQIHLGELHAILRRQEDFYPFLKDNREKIEKILTFRIPYYVGPLARGNSRFAWMTRKSEETITPWNFEE
VVDKGASAQSFIERMTNFDKNLPNEKVLPKHSLLYEYFTVYNELTKVKYVTEGMRKPAFLSGEQKKAIVDLLFKTNRKVT
VKQLKEDYFKKIECFDSVEISGVEDRFNASLGTYHDLLKIIKDKDFLDNEENEDILEDIVLTLTLFEDREMIEERLKTYA
HLFDDKVMKQLKRRRYTGWGRLSRKLINGIRDKQSGKTILDFLKSDGFANRNFMQLIHDDSLTFKEDIQKAQVSGQGDSL
HEHIANLAGSPAIKKGILQTVKVVDELVKVMGRHKPENIVIEMARENQTTQKGQKNSRERMKRIEEGIKELGSQILKEHP
VENTQLQNEKLYLYYLQNGRDMYVDQELDINRLSDYDVDHIVPQSFLKDDSIDNKVLTRSDKNRGKSDNVPSEEVVKKMK
NYWRQLLNAKLITQRKFDNLTKAERGGLSELDKAGFIKRQLVETRQITKHVAQILDSRMNTKYDENDKLIREVKVITLKS
KLVSDFRKDFQFYKVREINNYHHAHDAYLNAVVGTALIKKYPKLESEFVYGDYKVYDVRKMIAKSEQEIGKATAKYFFYS
NIMNFFKTEITLANGEIRKRPLIETNGETGEIVWDKGRDFATVRKVLSMPQVNIVKKTEVQTGGFSKESILPKRNSDKLI
ARKKDWDPKKYGGFDSPTVAYSVLVVAKVEKGKSKKLKSVKELLGITIMERSSFEKNPIDFLEAKGYKEVKKDLIIKLPK
YSLFELENGRKRMLASAGELQKGNELALPSKYVNFLYLASHYEKLKGSPEDNEQKQLFVEQHKHYLDEIIEQISEFSKRV
ILADANLDKVLSAYNKHRDKPIREQAENIIHLFTLTNLGAPAAFKYFDTTIDRKRYTSTKEVLDATLIHQSITGLYETRI
DLSQLGGD
>G3ECR1 3.1.-.-~~~cas9~~~CRISPR-associated endonuclease Cas9~~~COG3513
MLFNKCIIISINLDFSNKEKCMTKPYSIGLDIGTNSVGWAVITDNYKVPSKKMKVLGNTSKKYIKKNLLGVLLFDSGITA
EGRRLKRTARRRYTRRRNRILYLQEIFSTEMATLDDAFFQRLDDSFLVPDDKRDSKYPIFGNLVEEKVYHDEFPTIYHLR
KYLADSTKKADLRLVYLALAHMIKYRGHFLIEGEFNSKNNDIQKNFQDFLDTYNAIFESDLSLENSKQLEEIVKDKISKL
EKKDRILKLFPGEKNSGIFSEFLKLIVGNQADFRKCFNLDEKASLHFSKESYDEDLETLLGYIGDDYSDVFLKAKKLYDA
ILLSGFLTVTDNETEAPLSSAMIKRYNEHKEDLALLKEYIRNISLKTYNEVFKDDTKNGYAGYIDGKTNQEDFYVYLKNL
LAEFEGADYFLEKIDREDFLRKQRTFDNGSIPYQIHLQEMRAILDKQAKFYPFLAKNKERIEKILTFRIPYYVGPLARGN
SDFAWSIRKRNEKITPWNFEDVIDKESSAEAFINRMTSFDLYLPEEKVLPKHSLLYETFNVYNELTKVRFIAESMRDYQF
LDSKQKKDIVRLYFKDKRKVTDKDIIEYLHAIYGYDGIELKGIEKQFNSSLSTYHDLLNIINDKEFLDDSSNEAIIEEII
HTLTIFEDREMIKQRLSKFENIFDKSVLKKLSRRHYTGWGKLSAKLINGIRDEKSGNTILDYLIDDGISNRNFMQLIHDD
ALSFKKKIQKAQIIGDEDKGNIKEVVKSLPGSPAIKKGILQSIKIVDELVKVMGGRKPESIVVEMARENQYTNQGKSNSQ
QRLKRLEKSLKELGSKILKENIPAKLSKIDNNALQNDRLYLYYLQNGKDMYTGDDLDIDRLSNYDIDHIIPQAFLKDNSI
DNKVLVSSASNRGKSDDFPSLEVVKKRKTFWYQLLKSKLISQRKFDNLTKAERGGLLPEDKAGFIQRQLVETRQITKHVA
RLLDEKFNNKKDENNRAVRTVKIITLKSTLVSQFRKDFELYKVREINDFHHAHDAYLNAVIASALLKKYPKLEPEFVYGD
YPKYNSFRERKSATEKVYFYSNIMNIFKKSISLADGRVIERPLIEVNEETGESVWNKESDLATVRRVLSYPQVNVVKKVE
EQNHGLDRGKPKGLFNANLSSKPKPNSNENLVGAKEYLDPKKYGGYAGISNSFAVLVKGTIEKGAKKKITNVLEFQGISI
LDRINYRKDKLNFLLEKGYKDIELIIELPKYSLFELSDGSRRMLASILSTNNKRGEIHKGNQIFLSQKFVKLLYHAKRIS
NTINENHRKYVENHKKEFEELFYYILEFNENYVGAKKNGKLLNSAFQSWQNHSIDELCSSFIGPTGSERKGLFELTSRGS
AADFEFLGVKIPRYRDYTPSSLLKDATLIHQSVTGLYETRIDLAKLGEG
>Q46899 ~~~casC~~~CRISPR system Cascade subunit CasC~~~COG1857
MSNFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGESSLRTIHLAQLRDVLRQKLG
ERFDQKIIDKTLALLSGKSVDEAEKISADAVTPWVVGEIAWFCEQVAKAEADNLDDKKLLKVLKEDIAAIRVNLQQGVDI
ALSGRMATSGMMTELGKVDGAMSIAHAITTHQVDSDIDWFTAVDDLQEQGSAHLGTQEFSSGVFYRYANINLAQLQENLG
GASREQALEIATHVVHMLATEVPGAKQRTYAAFNPADMVMVNFSDMPLSMANAFEKAVKAKDGFLQPSIQAFNQYWDRVA
NGYGLNGAAAQFSLSDVDPITAQVKQMPTLEQLKSWVRNNGEA
>Q0S4D9 6.2.1.-~~~casG~~~Steroid-24-oyl-CoA synthetase~~~COG0318
MTAPTDPQLHLDQVMSRLTGPGGRFELVEEPVLGTRMPVMKNRGRSVGELLTTSLRWGDRDYLVTADRRMSYTEHAAAVA
ALATALREDYGVRKGDRVAILAANTPEWVVAFWATQVLGAISVGLNGWWVPREVEYGLTHSRPTVVVADAKRAETLAAVG
TDLPVLTMEEDLPALFARYAGSPMPHTDVDEDDPAAILYTSGTSGRPKGALHSQRNILAVVDYHRFSDAVVGEFSGRPVD
PAVPSPLRYLLTSPLFHIASLHNLVIPRLATGGAVVMHQGGFDVDAVLRLVERERVTNWGAVPTMASRLVEHDDLDKYDL
SSLTSFSLASAPSSVAFKERLREKVPFARNALVDSYGLTECSTAIAVATAPELEQFPGTLGRPIITVSMEIRDPYGEWLP
DGVEGEVCVRSPFVMLGYWEDEAATAAAIAPGRWLRTGDYGLVENGRLRLTGRRSDLILRGGENVYPTEIEQCLDEHPEV
LECAVIGTPHEDLGQEVAAVVVLRPGAAATEAELREYAADRLSYFKVPTRWRITTDLLPRNATGKMVRRDITV
>Q0S4D7 6.2.1.-~~~casI~~~Steroid-22-oyl-CoA synthetase~~~COG0318
MELERTTIARMLFDRLGDDRLGVRTREQDWTWDEVVRESAARGAVASSLRRDGPFHVGVLLENTPEFLFWLGGAALAGAA
VVGVNPTRRGAELEAEIRYVDCQLIVTDTAGKAQLAGLDLGLSEDRFLLVDDPAYTELVAAHAVESPAEDPGIDASTLFL
LLFTSGTTGTSKAVRCSQGRLARLAYANTAKYGHVREDVDYCCMPLFHGNALMALWAPALANGATVCLPRKFSASGFLPD
VRFFGATFFTYVGKALAYLMATPEQPDDRDNTLVRGFGTEASPEDKTEFVRRFGAELYEGYGSSEGAGSVTLDPDAPEGA
LGRPANENIVIVDPDTRVEKARARLDEHGRVLNPDEAIGEMVDKAGASRFEGYYKNEDAIADRIRHGWYWTGDLGYVDEA
GFIYFAGRKGDWIRVDGENTSALMVERILRRHPKVVATGVFAVPDPRSGDQVMAAVEVADPTDFDPAEFAAFLGNQDDLG
TKAAPRFVRVSRDLPVTGSNKVLKRTLQEQRWRCDDPVFRWVGRGVPEYHEMTDSEKAVLEQEFHTHGRQRFLHV
>P38946 2.8.3.-~~~cat1~~~Succinyl-CoA:coenzyme A transferase~~~COG0427
MSKGIKNSQLKKKNVKASNVAEKIEEKVEKTDKVVEKAAEVTEKRIRNLKLQEKVVTADVAADMIENGMIVAISGFTPSG
YPKEVPKALTKKVNALEEEFKVTLYTGSSTGADIDGEWAKAGIIERRIPYQTNSDMRKKINDGSIKYADMHLSHMAQYIN
YSVIPKVDIAIIEAVAITEEGDIIPSTGIGNTATFVENADKVIVEINEAQPLELEGMADIYTLKNPPRREPIPIVNAGNR
IGTTYVTCGSEKICAIVMTNTQDKTRPLTEVSPVSQAISDNLIGFLNKEVEEGKLPKNLLPIQSGVGSVANAVLAGLCES
NFKNLSCYTEVIQDSMLKLIKCGKADVVSGTSISPSPEMLPEFIKDINFFREKIVLRPQEISNNPEIARRIGVISINTAL
EVDIYGNVNSTHVMGSKMMNGIGGSGDFARNAYLTIFTTESIAKKGDISSIVPMVSHVDHTEHDVMVIVTEQGVADLRGL
SPREKAVAIIENCVHPDYKDMLMEYFEEACKSSGGNTPHNLEKALSWHTKFIKTGSMK
>P00485 2.3.1.28~~~cat~~~Chloramphenicol acetyltransferase~~~
MNFNKIDLDNWKRKEIFNHYLNQQTTFSITTEIDISVLYRNIKQEGYKFYPAFIFLVTRVINSNTAFRTGYNSDGELGYW
DKLEPLYTIFDGVSKTFSGIWTPVKNDFKEFYDLYLSDVEKYNGSGKLFPKTPIPENAFSLSIIPWTSFTGFNLNINNNS
NYLLPIITAGKFINKGNSIYLPLSLQVHHSVCDGYHAGLFMNSIQELSDRPNDWLL
>P22615 2.3.1.28~~~cmlA~~~Chloramphenicol acetyltransferase 2~~~
MNFTRIDLNTWNRREHFALYRQQIKCGFSLTTKLDITALRTALAETGYKFYPLMIYLISRAVNQFPEFRMALKDNELIYW
DQSDPVFTVFHKETETFSALSCRYFPDLSEFMAGYNAVTAEYQHDTRLFPQGNLPENHLNISSLPWVSFDGFNLNITGND
DYFAPVFTMAKFQQEGDRVLLPVSVQVHHAVCDGFHAARFINTLQLMCDNILK
>P00484 2.3.1.28~~~cat3~~~Chloramphenicol acetyltransferase 3~~~
MNYTKFDVKNWVRREHFEFYRHRLPCGFSLTSKIDITTLKKSLDDSAYKFYPVMIYLIAQAVNQFDELRMAIKDDELIVW
DSVDPQFTVFHQETETFSALSCPYSSDIDQFMVNYLSVMERYKSDTKLFPQGVTPENHLNISALPWVNFDSFNLNVANFT
DYFAPIITMAKYQQEGDRLLLPLSVQVHHAVCDGFHVARFINRLQELCNSKLK
>P06135 2.3.1.28~~~cat~~~Chloramphenicol acetyltransferase~~~
MTFNIIKLENWDRKEYFEHYFNQQTTYSITKEIDITLFKDMIKKKGYEIYPSLIYAIMEVVNKNKVFRTGINSENKLGYW
DKLNPLYTVFNKQTEKFTNIWTESDNNFTSFYNNYKNDLFEYKDKEEMFPKKPIPENTIPISMIPWIDFSSFNLNIGNNS
SFLLPIITIGKFYSENNKIYIPVALQLHHAVCDGYHASLFINEFQDIINKVDDWI
>P26841 2.3.1.28~~~cat~~~Chloramphenicol acetyltransferase~~~
MGNYFESPFRGKLLSEQVSNPNIRVGRYSYYSGYYHGHSFDDCARYLMPDRDDVDKLVIGSFCSIGSGAAFIMAGNQGHR
AEWASTFPFHFMHEEPVFAGAVNGYQPAGDTLIGHDVWIGTEAMFMPGVRVGHGAIIGSRALVTGDVEPYAIVGGNPART
IRKRFSDGDIQNLLEMAWWDWPLADIEAAMPLLCTGDIPALYRHWKQRQATA
>P36882 2.3.1.28~~~cat~~~Chloramphenicol acetyltransferase~~~
MTFNIIKLENWDRKEYFEHYFNQQTTYSITKEIDITLFKDMSKKKGYEIYPSLIYAIMEVVNKNKVFRTGINSENKLGYW
DKLNPLYTVFNKQTEKFTNIWTESDNNFTSFYNNYKNDLLEYKDKEEMFPKKPIPENTLPISMIPWIDFSSFNLNIGNNS
NFLLPIITIGKFYSENNKIYIPVALQLHHAVCDGYHASLFINEFQDIIKKVDDWI
>O33948 1.13.11.1~~~catA1~~~Catechol 1,2-dioxygenase 1~~~
MSIKVFGTKEVQDLLKAATNLEGKGGNARSKQIVHRLLSDLFKAIDDLDITPDEVWAGVNYLNKLGQDGEATLLAAGSGL
EKYLDIRLDAADKAEGIEGGTPRTIEGPLYVAGATVHDGVSKIDINPDEDAGPLVIHGTVTGPDGKPVAGAVVECWHANS
KGFYSHFDPTGAQSDFNLRGAVKTGADGKYEFRTLMPVGYGCPPQGATQQLLNVLGRHGNRPAHVHFFVSSDSARKLTTQ
FNIEGDPLIWDDFAYATREELIPPVTEKKGGTALGLKADTYKDIEFNLTLTSLVKGKDNQVVHRLRAEVAA
>O33950 1.13.11.1~~~catA2~~~Catechol 1,2-dioxygenase 2~~~
MNKQAIDALLQKINDSAINEGNPRTKQIVNRIVRDLFYTIEDLDVQPDEFWTALNYLGDAGRSGELGLLAAGLGFEHFLD
LRMDEAEAKAGVEGGTPRTIEGPLYVAGAPVSDGHARLDDGTDPGQTLVMRGRVFGEDGKPLANALVEVWHANHLGNYSY
FDKSQPAFNLRRSIRTDAEGKYSFRSVVPVGYSVPPQGQTQLLLDQLGRHGHRPAHIHFFVSAPGFRKLTTQINIDGDPY
LWDDFAFATRDGLVPAVRQAEVRKANRTAWTVSSR
>P07773 1.13.11.1~~~catA~~~Catechol 1,2-dioxygenase~~~COG3485
MEVKIFNTQDVQDFLRVASGLEQEGGNPRVKQIIHRVLSDLYKAIEDLNITSDEYWAGVAYLNQLGANQEAGLLSPGLGF
DHYLDMRMDAEDAALGIENATPRTIEGPLYVAGAPESVGYARMDDGSDPNGHTLILHGTIFDADGKPLPNAKVEIWHANT
KGFYSHFDPTGEQQAFNMRRSIITDENGQYRVRTILPAGYGCPPEGPTQQLLNQLGRHGNRPAHIHYFVSADGHRKLTTQ
INVAGDPYTYDDFAYATREGLVVDAVEHTDPEAIKANDVEGPFAEMVFDLKLTRLVDGVDNQVVDRPRLAV
>Q43984 1.13.11.1~~~catA~~~Catechol 1,2-dioxygenase~~~
MMMNRQQIDSLVQQMNVATATGEVNLRVQQIVVRLLGDLFQAIEDLNMSQTELWKGLEYLTDAGQANELGLLAAGLGLEH
YLDLRADEADAKAGITGGTPRTIEGPLYVAGAPESVGFARMDDGSESAHVDALIIEGNVTDTAGQIIPNAKVEIWHANSL
GNYSFFDKSQSAFNLRRSIFTDTQGQYIAQTTMPVGYGCPPEGTTQALLNLLGRHGNRPSHVHYFVSAPGYRKLTTQFNI
EGDKYLWDDFAFATRDGLIATALDVTDLAKIKQYNLNKAFKHIKFNFQLVQDADQVPLQRLIVVE
>O68146 1.11.1.6~~~katA~~~Catalase~~~COG0753
MSKKLTTAAGCPVAHNQNVQTAGKRGPQLLQDVWFLEKLAHFDREVIPERRMHAKGSGAYGTFTVTHDITKYTKAKLFSE
IGKQTELFARFTTVAGERGAADAERDIRGFALKFYTEEGNWDLVGNNTPVFFLRDPLKFPDLNHAVKRDPRTNMRSAKNN
WDFWTSLPEALHQVTIVMSDRGIPATYRHMHGFGSHTFSFINSDNERFWVKFHFKSQQGIKNLSDAEAAQVIGQDRESHQ
RDLLESIDNQDFPKWTLKVQIMPEADAATVPYNPFDLTKVWPHKDYPLIEVGEFELNRNPQNFFAEVEQSAFNPANVVPG
ISFSPDKMLQGRLFAYGDAQRYRLGVNHQHIPVNAPRCPVHSYHRDGAMRVDGNFGSTLGYEPNNEGQWAEQPDFAEPAL
NLDGAAAHWDHREDEDYFSQPGDLFRLMTAEQQAILFDNTARNLNGVPKEIQLRHLRHCYKADPAYGEGIGKLLDIDVSE
FN
>P45737 1.11.1.6~~~katA~~~Catalase~~~
MENKKLTAANGRPIADNQNSQTAGPRGPIMLQDPWLIEKLAHFDREVIPERRMHAKGSGAYGTFTVTHDITKYTRAAIFS
QVGKQTECFVRFSTVAGERGAADAERDIRGFAMKFYTEEGNWDLVGNNTPVFFLRDPLKFPDLNHAVKRDPRNNMRSANN
NWDFWTLLPEALHQVTITMSPRGIPASYRHMHGFGSHTYSFLNAENKRIWVKFHLKTMQGIKNLTDQEAEAIIAKDRESH
QRDLYESIERGDFPKWKFQIQLMTEEEADNYRINPFDLTKVWPHKDFPLQDVGILELNRNPENYFAEVEQSAFNPMNIVE
GIGFSPDKMLQGRLFSYGDAQRYRLGVNSEQIPVNKPRCPFHAFHRDGAMRVDGNYGSAKGYEPNSYGEWQDSPEKKEPP
LKVHGDVFNYNEREYDDDYYSQPGDLFRLMPADEQQLLFENTARAMGDAELFIKQRHVRNCYKADPAYGTGVAQALGIDL
EEALKE
>P26901 1.11.1.6~~~katA~~~Vegetative catalase~~~COG0753
MSSNKLTTSWGAPVGDNQNSMTAGSRGPTLIQDVHLLEKLAHFNRERVPERVVHAKGAGAHGYFEVTNDVTKYTKAAFLS
EVGKRTPLFIRFSTVAGELGSADTVRDPRGFAVKFYTEEGNYDIVGNNTPVFFIRDAIKFPDFIHTQKRDPKTHLKNPTA
VWDFWSLSPESLHQVTILMSDRGIPATLRHMHGFGSHTFKWTNAEGEGVWIKYHFKTEQGVKNLDVNTAAKIAGENPDYH
TEDLFNAIENGDYPAWKLYVQIMPLEDANTYRFDPFDVTKVWSQKDYPLIEVGRMVLDRNPENYFAEVEQATFSPGTLVP
GIDVSPDKMLQGRLFAYHDAHRYRVGANHQALPINRARNKVNNYQRDGQMRFDDNGGGSVYYEPNSFGGPKESPEDKQAA
YPVQGIADSVSYDHYDHYTQAGDLYRLMSEDERTRLVENIVNAMKPVEKEEIKLRQIEHFYKADPEYGKRVAEGLGLPIK
KDS
>P0A323 1.11.1.6~~~katA~~~Catalase~~~COG0753
MNAMTNKTLTTAAGAPVADNNNTMTAGPRGPALLQDVWFLEKLAHFDRERIPERVVHAKGSGAYGTFTVTHDISRYTRAR
IFAEVGKQTPLFLRFSTVAGERGAADAERDVRGFAIKFYTDEGNWDLVGNNTPVFFIRDPLKFPDFIHTQKRDPKTNLRN
ATAAWDFWSLNPESLHQVTILMSDRGLPQNYRQQHGFGSHTYSFVNDAGERFYVKFHFKSQQGIACYTDGEAAELVGRDR
ESAQRDLFQNIEQGQFPRWTLKVQVMPEAEAATYHINPFDLTKVWPHADYPLIEVGVLELNKNPENYFAEVEQAAFTPAN
VVPGIGFSPDKMLQGRLFSYGDTHRYRLGINHHQIPVNAPRCPFHSFHRDGMGRVDGNGGATLNYEPNSFGEWREAKHAA
EPPLALDGQAADRWNHRVDEDYYSQPGALFRLMNDDQKQQLFGNIGRHMAGVPEEIQRRQLEHFRRADPAYAAGVAKALG
LK
>P0A327 1.11.1.6~~~katA~~~Catalase~~~
MTDRPIMTTSAGAPIPDNQNSLTAGERGPILMQDYQLIEKLSHQNRERIPERAVHAKGWGAYGTLTITGDISRYTKAKVL
QPGAQTPMLARFSTVAGELGAADAERDVRGFALKFYTQEGNWDLVGNNTPVFFVRDPLKFPDFIHTQKRHPRTHLRSATA
MWDFWSLSPESLHQVTILMSDRGLPTDVRHINGYGSHTYSFWNDAGERYWVKFHFKTMQGHKHWTNAEAEQVIGRTREST
QEDLFSAIENGEFPKWKVQVQIMPELDADKTPYNPFDLTKVWPHADYPPIDIGVMELNRNPENYFTEVENAAFSPSNIVP
GIGFSPDKMLQARIFSYADAHRHRLGTHYESIPVNQPKCPVHHYHRDGQMNVYGGIKTGNPDAYYEPNSFNGPVEQPSAK
EPPLCISGNADRYNHRIGNDDYSQPRALFNLFDAAQKQRLFSNIAAAMKGVPGFIVERQLGHFKLIHPEYEAGVRKALKD
AHGYDANTIALNEKITAAE
>Q59337 1.11.1.6~~~katA~~~Catalase~~~COG0753
MSDENNKGVGTAVQGVGGPRDGRTAPGEQGTTLTTRQGHPVHDNQNSRTVGSRGPMTLENYQFIEKLSHFDRERIPERVV
HARGVGAHGVFRATGKVGDEPVSKYTRAKLFQEDGKETPVFVRFSTVGHGTHSPETLRDPRGFAVKFYTEDGNWDLVGNN
LKIFFIRDALKFPDLIHSQKPSPTTNIQSQERIFDFFAGSPEATHMITLLYSPWGIPASYRFMQGSGVNTYKWVNDQGEG
VLVKYHWEPVQGVRNLTQMQADEVQATNFNHATQDLHDAIERGDFPQWDLFVQIMEDGEHPELDFDPLDDTKIWPREQFP
WRHVGQMTLNRNPENVFAETEQAAFGTGVLVDGLDFSDDKMLQGRTFSYSDTQRYRVGPNYLQLPINAPKKHVATNQRDG
QMAYRVDTFEGQDQRVNYEPSLLSGPKEAPRRAPEHTPRVEGNLVRAAIERPNPFGQAGMQYRNFADWERDELVSNLSGA
LAGVDKRIQDKMLEYFTAADADYGQRVREGIQAKEAEMKGQKQEAPVYGTEASSLY
>P44390 1.11.1.6~~~katA~~~Catalase~~~COG0753
MSSQCPFSHLAATNLTMGNGAPVADNQNSLTAGPRGPLLAQDLWLNEKLADFVREVIPERRMHAKGSGAFGTFTVTHDIT
KYTRAKIFSEVGKKTEMFARFTTVAGERGAADAERDIRGFALKFYTEEGNWDLVGNNTPVFFLRDPRKFPDLNKAVKRDP
RTNMRSATNNWDFWTLLPEALHQVTVVMSDRGIPASYRHMHGFGSHTYSFWNEAGERFWVKFHFRTQQGIKNLTDAEAAE
IIANDRESHQRDLYEAIERGDFPKWTLFVQIMPEADAEKVPYHPFDLTKVWSKKDYPLIEVGEFELNRNPENFFADVEQS
AFAPSNLVPGIGASPDRMLQARLFNYADAQRYRLGVNYRQIPVNRPRCPVHSNQRDGQGRVDGNYGSLPHYEPNSFSQWQ
QQPDFAEPPLRINGDAAHWDYRNDDNDYFSQPRALFNLMNAEQKQSLFNNTAAAMGDAPDFIKYRHIRNCHWCDAAYGEG
VAKALGLTVEDALKARDTDPALGQGGLL
>P77872 1.11.1.6~~~katA~~~Catalase~~~COG0753
MVNKDVKQTTAFGAPVWDDNNVITAGPRGPVLLQSTWFLEKLAAFDRERIPERVVHAKGSGAYGTFTVTKDITKYTKAKI
FSKVGKKTECFFRFSTVAGERGSADAVRDPRGFAMKYYTEEGNWDLVGNNTPVFFIRDAIKFPDFIHTQKRDPQTNLPNH
DMVWDFWSNVPESLYQVTWVMSDRGIPKSFRHMDGFGSHTFSLINAKGERFWVKFHFHTMQGVKHLTNEEAAEVRKYDPD
SNQRDLFNAIARGDFPKWKLSIQVMPEEDAKKYRFHPFDVTKIWYLQDYPLMEVGIVELNKNPENYFAEVEQAAFSPANV
VPGIGYSPDRMLQGRLFSYGDTHRYRLGVNYPQIPVNKPRCPFHSSSRDGYMQNGYYGSLQNYTPSSLPGYKEDKSARDP
KFNLAHIEKEFEVWNWDYRADDSDYYTQPGDYYRSLPADEKERLHDTIGESLAHVTHKEIVDKQLEHFKKADPKYAEGVK
KALEKHQKMMKDMHGKDMHHTKKKK
>P29422 1.11.1.6~~~katA~~~Catalase~~~
MEHQKTTPHATGSTRQNGAPAVSDRQSLTVGSEGPIVLHDTHLLETHQHFNRMNIPERRPHAKGSGAFGEFEVTEDVSKY
TKALVFQPGTKTETLLRFSTVAGELGSPDTWRDVRGFALRFYTEEGNYDLVGNNTPIFFLRDPMKFTHFIRSQKRLPDSG
LRDATMQWDFWTNNPESAHQVTYLMGPRGLPRTWREMNGYGSHTYLWVNAQGEKHWVKYHFISQQGVHNLSNDEATKIAG
ENADFHRQDLFESIAKGDHPKWDLYIQAIPYEEGKTYRFNPFDLTKTISQKDYPRIKVGTLTLNRNPENHFAQIESAAFS
PSNTVPGIGLSPDRMLLGRAFAYHDAQLYRVGAHVNQLPVNRPKNAVHNYAFEGQMWYDHTGDRSTYVPNSNGDSWSDET
GPVDDGWEADGTLTREAQALRADDDDFGQAGTLVREVFSDQERDDFVETVAGALKGVRQDVQARAFEYWKNVDATIGQRI
EDEVKRHEGDGIPGVEAGGEARM
>Q27710 1.11.1.6~~~cat~~~Catalase~~~
MSQNKTLTTASGPPVADNQNSRSAGPRGPLLLDDFHLIEKLAHFNRENIPERRVHAKGSGAYGTFTVTQDITQYTSAKLF
DSVGKQTPTFLRFSTVGGERGSADTERDPRGFALKFYTEEGNWDIVGNNTPVFFIRDPLKFPDFIHTQKRLPQSNLKSAQ
MMWDFWSHSPEALHQVTILFSDRGIPDGYRHMHGFGSHTYSLINAKGERHWVKWHYKTKQGIKNLAPADAARLAGTDPDY
AQRDLFGAIERGDFPKWRVCIQIMTEAQANAHYENPFDVTKTWSQKEFPLIEVGELELNRNPLNYFAEVEQAAFGPSNMV
PGVGLSPDRMLQGRVFAYADAHRYRVGTNHQQLPVNAPRSPVNSYQRDGSMAFGSNGGAAPNYEPNSYADAPKQAPQYAE
PALALSGAADRYDHREDTDYYSHAGALFRLMNDEQKALLINNIAGAMAGVSSDVVQRQLQYFFKADPAYGEGIASALGVS
LN
>P42321 1.11.1.6~~~katA~~~Catalase~~~
MEKKKLTTAAGAPVVDNNNVITAGPRGPMLLQDVWFLEKLAHFDREVIPERRMHAKGSGAFGTFTVTHDITKYTRAKIFS
EVGKKTEMFARFSTVAGERGAADAERDIRGFALKFYTEEGNWDMVGNNTPVFYLRDPLKFPDLNHIVKRDPRTNMRNMAY
KWDFFSHLPESLHQLTIDMSDRGLPLSYRFVHGFGSHTYSFINKDNERFWVKFHFRCQQGIKNLMDDEAEALVGKDRESS
QRDLFEAIERGDYPRWKLQIQIMPEKEASTVPYNPFDLTKVWPHADYPLMDVGYFELNRNPDNYFSDVEQAAFSPANIVP
GISFSPDKMLQGRLFSYGDAHRYRLGVNHHQIPVNAPKCPFHNYHRDGAMRVDGNSGNGITYEPNSGGVFQEQPDFKEPP
LSIEGAADHWNHREDEDYFSQPRALYELLSDDEHQRMFARIAGELSQASKETQQRQIDLFTKVHPEYGAGVEKAIKVLEG
KDAK
>O52762 1.11.1.6~~~katA~~~Catalase~~~
MEEKTRLTTAAGAPVVDNQNVQTAGPRGPMLLQDVWFLEKLAHFDREVIPERRMHAKGSAAYGTFTVTHDITPYTRAKIF
SQVGKKTDMFLRFSTVAGERGAADAERDIRGFSMRFYTEQGNWDLVGNNTPVFYLRDPLKFPDLNHVVKRDPRTNLRNAT
FKWDFFSHLPESLHQLTIDFSDRGLPKSYRHIHGFGSHTFSFINANNERFWVKFHFKTQQGIENLTNAEAAEVIAQDRES
SQRDLYESIEKGDFPRWKMYVQIMPEKEAATYRYNPFDLTKVWPHGDYPLIEVGFFELNRNPDNYFAEVEQAAFTPANVV
PGIGFSPDKMLQGRLFSYGDAHRYRLGVNHHQIPVNAARCPHQVYHRDGGMRVDGNNAHQRVTYEPNSFNQWQEQPDFSE
PPLSLEGAADHWNHRVDDDYYSQPAALFHLFTDEQKQRLFANIAEDIRDVPEQIQRRQIGLFLKVDPAYGKGVADALGLK
LD
>Q2FYU7 1.11.1.6~~~katA~~~Catalase~~~COG0753
MSQQDKKLTGVFGHPVSDRENSMTAGPRGPLLMQDIYFLEQMSQFDREVIPERRMHAKGSGAFGTFTVTKDITKYTNAKI
FSEIGKQTEMFARFSTVAGERGAADAERDIRGFALKFYTEEGNWDLVGNNTPVFFFRDPKLFVSLNRAVKRDPRTNMRDA
QNNWDFWTGLPEALHQVTILMSDRGIPKDLRHMHGFGSHTYSMYNDSGERVWVKFHFRTQQGIENLTDEEAAEIIATDRD
SSQRDLFEAIEKGDYPKWTMYIQVMTEEQAKNHKDNPFDLTKVWYHDEYPLIEVGEFELNRNPDNYFMDVEQAAFAPTNI
IPGLDFSPDKMLQGRLFSYGDAQRYRLGVNHWQIPVNQPKGVGIENICPFSRDGQMRVVDNNQGGGTHYYPNNHGKFDSQ
PEYKKPPFPTDGYGYEYNQRQDDDNYFEQPGKLFRLQSEDAKERIFTNTANAMEGVTDDVKRRHIRHCYKADPEYGKGVA
KALGIDINSIDLETENDETYENFEK
>Q7A5T2 1.11.1.6~~~katA~~~Catalase~~~
MSQQDKKLTGVFGHPVSDRENSMTAGPRGPLLMQDIYFLEQMSQFDREVIPERRMHAKGSGAFGTFTVTKDITKYTNAKI
FSEIGKQTEMFARFSTVAGERGAADAERDIRGFALKFYTEEGNWDLVGNNTPVFFFRDPKLFVSLNRAVKRDPRTNMRDA
QNNWDFWTGLPEALHQVTILMSDRGIPKDLRHMHGFGSHTYSMYNDSGERVWVKFHFRTQQGIENLTDEEAAEIIATDRD
SSQRDLFEAIEKGDYPKWTMYIQVMTEEQAKNHKDNPFDLTKVWYHDEYPLIEVGEFELNRNPDNYFMDVEQAAFAPTNI
IPGLDFSPDKMLQGRLFSYGDAQRYRLGVNHWQIPVNQPKGVGIENICPFSRDGQMRVVDNNQGGGTHYYPNNHGKFDSQ
PEYKKPPFPTDGYGYEYNQRQDDDNYFEQPGKLFRLQSEDAKERIFTNTANAMEGVTDDVKRRHIRHCYKADPEYGKGVA
KALGIDINSIDLETENDETYENFEK
>P08310 5.5.1.1~~~catB~~~Muconate cycloisomerase 1~~~COG4948
MTSVLIERIEAIIVHDLPTIRPPHKLAMHTMQTQTLVLIRVRCSDGVEGIGEATTIGGLAYGYESPEGIKANIDAHLAPA
LVGLPADNINAAMLKLDKLAKGNTFAKSGIESALLDAQGKRLGLPVSELLGGRVRDSLEVAWTLASGDTARDIAEAQHML
EIRRHRVFKLKIGANPLAQDLKHVVAIKRELGDSASVRVDVNQYWDESQAIRACQVLGDNGIDLIEQPISRINRSGQVRL
NQRSPAPIMADESIESVEDAFSLAADGAASIFALKIAKNGGPRAVLRTAQIAEAAGIALYGGTMLEGSIGTLASAHAFLT
LRQLTWGTELFGPLLLTEEIVNEPPQYRDFQLHIPRTPGLGLTLDEQRLARFARR
>P46206 1.11.1.6~~~katB~~~Catalase~~~
MPLLNWSRHMVCLTAAGLITVPTVYATDTLTRDNGAVVGDNQNSQTAGAQGPVLLQDVQLLQKLQRFDRERIPERVVHAR
GTGVKGEFTASADISDLSKATVFKSGEKTPVFVRFSSVVHGNHSPETLRDPHGFATKFYTADGNWDLVGNNFPTFFIRDA
IKFPDMVHAFKPDPRTNLDNDSRRFDFFSHVPEATRTLTLLYSNEGTPAGYRFMDGNGVHAYKLVNAKGEVHYVKFHWKS
LQGIKNLDPKEVAQVQSKDYSHLTNDLVGAIKKGDFPKWDLYVQVLKPEELAKFDFDPLDATKIWPDVPEKKIGQMVLNK
NVDNFFQETEQVAMAPANLVPGIEPSEDRLLQGRVFSYADTQMYRLGAIGLSLPVNQPKVAVNNGNQDGALNTGHTTSGV
NYEPSRLEPRPADDKARYSELPLSGTTQQAKITREQNFKQAGDLFRSYSAKEKTDLVQRFGESLADTHTESKNIMLSVLY
KEDRHYGTRVAEVAKGDLSKVKSLAASLKD
>P95608 5.5.1.1~~~catB~~~Muconate cycloisomerase 1~~~COG4948
MTDLSIVSVETTILDVPLVRPHKFATTSMTAQPLLLVAVTTAGGVTGYGEGVVPGGPWWGGESVETMQAIVERYIVPVLL
GRGVDEITGIMPDIERVVANARFAKAAVDVALHDAWARSLGVPVHTLLGGAFRKSVDVTWALGAAPAEEIIEEALDLVES
KRHFSFKLKMGALDPAVDTARVVQIAQALQGKAGVRIDVNARWDRLTALKYVPRLVEGGVELIEQPTPGEQLEVLAELNR
LVPVPVMADESVQTPHDALEVARRGAADVIALKTTKCGGLQKSREVVAIAKAAGIACHGATSIEGPIGTAASIHFACAEP
GIDFGTELFGPLLFSEELLQEPIRYADGQVFLPEGPGLGVELNMDAVKTWTRN
>P80573 5.3.3.4~~~catC~~~Muconolactone Delta-isomerase~~~COG4829
MLYLVRMDVNLPHDMPAAQADDIKAREKAYAQQLQHEGKWQQLYRVVGEYANYSIFDVGSHDELHTLLSGLPLFPYMKIH
VTPLAKHPSSIR
>P00948 5.3.3.4~~~catC~~~Muconolactone Delta-isomerase~~~COG4829
MLFHVKMTVKLPVDMDPAKATQLKADEKELAQRLQREGTWRHLWRIAGHYANYSVFDVSSVEACNDTLMQLPLFPYMDIE
VDGLCRHPSSIHSDDR
>P95609 5.3.3.4~~~catC~~~Muconolactone Delta-isomerase~~~COG4829
MALFHVRMDVAIPRDLDPKVRDETIAKEKAYSQELQRSGKWPEIWRIVGQYSNISIFDVESADELHEILWNLPLFPYMNI
EIMPLTKHGSDVK
>P54720 1.-.-.-~~~catD~~~Putative oxidoreductase CatD~~~COG2259
MNKSFEIGTLLLRVITGIIFFVHGLSKFQGMEGTIQFFGSIGLPSFMAYVIAAIELIGGVLVFFGLATRIVGVLFALTLI
GAIITVKLKAPFMGNAEFDYLLLLTSIHLALTGSRFLALDPFVFKGKKNGNVSA
>P21179 1.11.1.6~~~katE~~~Catalase HPII~~~COG0753
MSQHNEKNPHQHQSPLHDSSEAKPGMDSLAPEDGSHRPAAEPTPPGAQPTAPGSLKAPDTRNEKLNSLEDVRKGSENYAL
TTNQGVRIADDQNSLRAGSRGPTLLEDFILREKITHFDHERIPERIVHARGSAAHGYFQPYKSLSDITKADFLSDPNKIT
PVFVRFSTVQGGAGSADTVRDIRGFATKFYTEEGIFDLVGNNTPIFFIQDAHKFPDFVHAVKPEPHWAIPQGQSAHDTFW
DYVSLQPETLHNVMWAMSDRGIPRSYRTMEGFGIHTFRLINAEGKATFVRFHWKPLAGKASLVWDEAQKLTGRDPDFHRR
ELWEAIEAGDFPEYELGFQLIPEEDEFKFDFDLLDPTKLIPEELVPVQRVGKMVLNRNPDNFFAENEQAAFHPGHIVPGL
DFTNDPLLQGRLFSYTDTQISRLGGPNFHEIPINRPTCPYHNFQRDGMHRMGIDTNPANYEPNSINDNWPRETPPGPKRG
GFESYQERVEGNKVRERSPSFGEYYSHPRLFWLSQTPFEQRHIVDGFSFELSKVVRPYIRERVVDQLAHIDLTLAQAVAK
NLGIELTDDQLNITPPPDVNGLKKDPSLSLYAIPDGDVKGRVVAILLNDEVRSADLLAILKALKAKGVHAKLLYSRMGEV
TADDGTVLPIAATFAGAPSLTVDAVIVPCGNIADIADNGDANYYLMEAYKHLKPIALAGDARKFKATIKIADQGEEGIVE
ADSADGSFMDELLTLMAAHRVWSRIPKIDKIPA
>Q9X576 1.11.1.6~~~katE~~~Catalase C~~~COG0753
MAKKPSAPNNTKPATIHDQKATRGNGGELHQIAEGDTPVLTTAQGGPVADDQNSLRAGERGPTLIEDFHFREKIFHFDHE
RIPERVVHARGYGVHGFFETYESLAAYTRADLFQRPGERTPAFVRFSTVAGSKGSFDLARDVRGFAVKIYTKEGNWDLVG
NNIPVFFIQDAIKFPDVIHSVKPEPDREFPQAQSAHDNFWDFISLTPESMHMIMWVMSDRAIPRSFRFMEGFGVHTFRFV
NAKDESTFVKFHWKPKLGLQSVVWNEAVKINGADPDFHRRDMWQAIQSGNFPEWDLHVQLFDQDFADKFDFDILDPTKII
PEEVLPTKPVGRLVLDRMPENFFAETEQVAFMTQNVPPGIDFSDDPLLQGRNFSYLDTQLKRLGSPNFTHLPINAPKCPF
QHFQQDGHMAMRNPVGRVNYQPNSWGEGPRESPMKGFRHFPSEEQGPKLRIRAESFADHYSQARQFFISQTPPEQRHIAD
ALTFELSKVETPVIRERMVAHLLNIDETLGKKVGHALGLETMPKPADAAVATRQDLDPSPALSIIQRGPKRFEGRKLGIL
ATDGADGALLDALIAAVEKEKAAFELIAPKVGGFTASDGKRIAAHQMLDGGPSVLYDAVVLLPSAEAVTDLIDVATARDF
VADAFAHCKYIGYAGAAVPLLERAGIAELLDEGTIELTDAASAAAFLTEIGKLRVWGREPSVKLK
>Q8VPF3 2.8.3.6~~~catI~~~3-oxoadipate CoA-transferase subunit A~~~COG1788
MAELLTLREAVERFVNDGDTVALEGFTHLIPTAASHEIIRQGKKDLHLVRMTPDLVYDLLIGAGCARKLTFSWGGNPGVG
SLHRLRDAVEKGWPNALEIDEHSHADLANSYVAGASGLPFAVLRAYAGSDLPKVNPNIKFINCPFTGEQLAAVPSVRPDV
TVIHAQKADRKGNVLLWGILGVQKEAALAAKRCIVTVEEIVDDLNAPMNSCVLPTWALSAVCHVPGGSHPSYAHGYYERD
NRFYQAWDPIARDRETFTAWIDEYIRGTKDFSEFQAKIAEGK
>Q8VPF2 2.8.3.6~~~catJ~~~3-oxoadipate CoA-transferase subunit B~~~COG2057
MSAYSTNEMMTVAAARRLKNGAVCFVGIGLPSKAANLARLTSSPDVVLIYESGPIGAKPTVLPLSIGDGELAETADTVVP
TGEIFRYWLQGGRIDVGFLGAAQVDRFGNINTTVIGDYNKPKVRLPGAGGAPEIAGSAKEVLIILKQSHRTFVDKLAFIT
SVGHGEGGDHRKQLGLPGKGPVAIITDLCIMEPEAGSNEFIVTSLHPGVTREQVIENTGWAIRFAEQVKETAAPTEVELE
ALRALEARTAAAHGQQGGEE
>P07774 ~~~catM~~~HTH-type transcriptional regulator CatM~~~COG0583
MELRHLRYFVTVVEEQSISKAAEKLCIAQPPLSRQIQKLEEELGIQLFERGFRPAKVTEAGMFFYQHAVQILTHTAQASS
MAKRIATVSQTLRIGYVSSLLYGLLPEIIYLFRQQNPEIHIELIECGTKDQINALKQGKIDLGFGRLKITDPAIRRIVLH
KEQLKLAIHKHHHLNQFAATGVHLSQIIDEPMLLYPVSQKPNFATFIQSLFTELGLVPSKLTEIREIQLALGLVAAGEGV
CIVPASAMDIGVKNLLYIPILDDDAYSPISLAVRNMDHSNYIPKILACVQEVFATHHIRPLIE
>Q5WGA1 2.7.7.-~~~~~~CC-adding tRNA nucleotidyltransferase~~~COG0617
MSIWMEGFKAVRTLNAAGFEAYIVGGAVRDYLLGKDVDDVDIATQASPHQVADIFTKGVHINQEHKTVLIPGEKGPIEIT
TYKGETLAEDLQKRDFTINALAMTETREVIDPYGGRQDLQKRLLRSYDAQKRLSEDPLRMLRAARFISSLGFEADRQLVK
ETTVQKAALQRCARERVVVELEKLLKGMETEAAFAFLQETGAIHSLPGIQITDSQLAELMRLPKQQWDSGDRAWLEFAIC
TGGPSSMAALPLPKKRKQLVAAGLKAFEYRQTQQRWSDWQLYISGLAIAMQIEEIRAGRQLPSIQKEELAEQWSALPIKA
KSDLAITGRDLLHAKAAPGPWMKEALQAAEKAVVTKKCPNEKAAILAFLTMREDGK
>O67911 2.7.7.-~~~~~~CC-adding tRNA nucleotidyltransferase~~~COG0617
MENIEIVSSGKHTLHGLNFYLSYFDDVAKVLPREHYCFIVGGWVRDRILGEPVGYNIDVDFLTTADPVELAKNFAKRIGG
HFFVFEKRGFLIKRPTIASVVLHLPPYRYRFDFSPLKGKDLEKALIEDLKERDFTANAIAVNLDDVLSIGAKQTIVYDPT
GGIKDLEQGLLRPVSIENLKRDPVRVLRGFRIAIEKNLQLTEDFYEFVKEDPRIVLKSAVERITHELFKIMKEKTAHKVI
RELYEYGVLEAIIPEIGRLREVKDQGEHHIYPLDEHTLKTLEYLEQVIEDRAKYLSAELLENFGKKRVLGEFTDVELLKW
GALFHDIGKPQTFAVREGKVTFYEHDKVGAQIVREIGERLRWGDEATEFVAKLVRHHLRPFFLREAFKKGELKRRGMANF
WRECGDIAPHLFLLSIADAMASGDEEEDIKALMETIAELESFNRNEMKEEIQKPLLNGDEIMEILGIKPGKIVGILKKAL
LEAQIDGKVETKEEAIEFIKRSTKNLKPLDEG
>Q9RVP2 2.7.7.-~~~~~~CC-adding tRNA nucleotidyltransferase~~~COG0617
MATPDGEQVWAQLQPQDRAWLNDLSRRAGPDTELALVGGAVRDALLGQTPLDLDIVVAGQDGQGVEALALASGLPFVFHP
AFENATLTLPDGRGADLVRARREHYPQPGRNPEPLPGTLHDDLRRRDFGLNALALRLREDGAPELLDVVGGLRDLERREL
RPLHDRSLHEDASRLVRAARLAARLELHPAPELLAQVPDALALADDTPRLWAELKLLLAEPRPGQAARVLDGWGAGTLLP
GLPLLEALDVQQNAGTPVQPGTYAAAVLSAAPDAAALAERMALGERPAALLARALSDSYFAPGTPELQLRGLLRPESYLP
LTGREVVALGVAPGPAVGRALAHLAGLRQSGAVRSADEERTALRAYLGANPKAT
>Q74B57 2.7.7.-~~~~~~CC-adding tRNA nucleotidyltransferase~~~COG0617
MDHRLLSFISAPLPSLIASLARHGGFGAWFVGGCVRDALLARPSNDIDIVVGPGGEDLPRAVAARIGGSFFPLDEERGHA
RVVLKGEGASCDFAPLQGGTIAADLALRDFTINALAVSCGSDDLLDPLGGAADLAQRVIRACSAGAFAADPLRIVRAYRF
AAHLDFEIHAATLALIPDHAPLLATVAGERIRDELFRMLDLPHAVPYVLKMSCAGVTGAIFGADDLPADTAAGALDRVES
LCRDLSAFGTEAEPVRARLRQEVQPGITIRALAKLAAFLNGAGIPAGIASQRLMLGKAATRLLELLCSSARLTWPAPAAA
PDPHALFTLFCHREPAGCEQLILPLAEGILPEDRCRHLAAYLTRQHIPRGGRLLLTGDDIMILLGLPPGRQVGEAIELLR
AAQSTGEVRTRAEAQRYLAKKQLTTPEPLR
>Q9KC89 2.7.7.-~~~~~~CC-adding tRNA nucleotidyltransferase~~~COG0617
MDDVIKKGLSIVSELRDHGFEAYIVGGAVRDYHIGRKPKDVDVVTSASPEEIRTLYPHAFQINRQFQTLTVHLQKVAIEV
STLRGGSIEDDLCSRDFTINAMALAMNGDIIDPTGGKTDLENGVIRSFHPEARFKEDPLRMLRAPRFASELGFTVAKGTA
EAIKGSCSLLADVAVERVEKELTQLMIGTHRSSGWCLLHETGLYPFIPGVSLSKETVLRMKEISRSPGLLPADGFWAILY
LLENCSMKLPLAKEKKKRIRTIVHYVGERQNHSWNETMLYQASLSVATVVEQIRALFGQASVHEEELRQLWSSLPIQTRT
ELAVTGRDVMAHFQKKGGPWLADTLADIEEAVLLKHIENEKKSIIQWLEERRVES
>Q55428 2.7.7.-~~~~~~CC-adding tRNA nucleotidyltransferase~~~COG0617
MLCPVSHLADLRQQVPFDLALLPPQACLVGGAVRDALLGRRREYLDWDFVVPSGAIETASAIASRYRAGFVVLDKARHIA
RVVFAHGTVDFAQQEGMSLEQDLARRDFTVNAIAYNFQQNKLIDPMAGVGDLQRGQLKMVAAVNLADDPLRLLRAYRQAA
QLQFTLDPDTRTVLRELAPRIKTVAAERVQAEFNYLLGSPRGSQWLLAAWQDGILAHWFSHANLSSLNAIGCIDLAIAAI
KNQLTLVERQQFFQALGKKGIAIAKLASLVCADVKIAEGELQRLKYSRHELRSVQAILQGYPQLSCLENSPTVRQLYFFF
VELGKYLPHFVLYALAHCPHNYHSFIFELLTHYLNSGDRLAHPQPLITGKDLIDKLHIKPSPLIGQLLTEINIAHIEGKI
SNEQEALAYAQELGKS
>Q72K91 2.7.7.-~~~~~~CC-adding tRNA nucleotidyltransferase~~~COG0617
MAHMDFPFYTPKDAFPVGGAVRDLLLGRRPTDLDYAALDPEKAAEEAKRRLGGSLFPLDPKRGHYRLVVGERTLDFTPLE
GRLEEDLLRRDYRVNALLWKGGAVFGLKGVEEDLRRRLLVPVREENLYQDHLRSLRGVRLAATLGFGLPRRTREALGRHA
RFLQAHPEALPARERVKEELARLLLSPRAAFGLRLLERVGLLGVYLPELALLVGLHQGGVHHLPAWEHTLSAVFHLLWLW
PEAPLEARLAALFHDVGKPLTRRFDPEVGRFRFLGHAEVGAEIARASLFWLRFPKEVVERVAGLVRRHMDRLPEERKALR
RFFLRRQDLLPDLVYLMAADRLATRGVEREAWEVLGRYEEVLKDPLPQRPLLSGEEVMALLGLQEGPEVGRALKALLEAQ
AEGRVGTKEEARAFLLYWRGGREAQASGTPDHPH
>P00487 2.3.1.28~~~~~~Chloramphenicol acetyltransferase~~~
MFKQIDENYLRKEHFHHYMTLTRCSYSLVINLDITKLHAILKEKKLKVYPVQIYLLARAVQKIPEFRMDQVNDELGYWEI
LHPSYTILNKETKTFSSIWTPFDENFAQFYKSCVADIETFSKSSNLFPKPHMPENMFNISSLPWIDFTSFNLNVSTDEAY
LLPIFTIGKFKVEEGKIILPVAIQVHHAVCDGYHAGQYVEYLRWLIEHCDEWLNDSLHIT
>P62577 2.3.1.28~~~cat~~~Chloramphenicol acetyltransferase~~~
MEKKITGYTTVDISQWHRKEHFEAFQSVAQCTYNQTVQLDITAFLKTVKKNKHKFYPAFIHILARLMNAHPEFRMAMKDG
ELVIWDSVHPCYTVFHEQTETFSSLWSEYHDDFRQFLHIYSQDVACYGENLAYFPKGFIENMFFVSANPWVSFTSFDLNV
ANMDNFFAPVFTMGKYYTQGDKVLMPLAIQVHHAVCDGFHVGRMLNELQQYCDEWQGGA
>P58777 2.3.1.28~~~cat~~~Chloramphenicol acetyltransferase~~~
MEKKITGYTTVDISQWHRKEHFEAFQSVAQCTYNQTVQLDITAFLKTVKKNKHKFYPAFIHILARLMNAHPEFRMAMKDG
ELVIWDSVHPCYTVFHEQTETFSSLWSEYHDDFRQFLHIYSQDVACYGENLAYFPKGFIENMFFVSANPWVSFTSFDLNV
AAMDNFFAPVFTMGKYYTQGDKVLMPLAIQVHHAVCDGFHVGRMLNELQQYCDEWQGGA
>Q75XW3 ~~~~~~Ca(2+)/H(+) antiporter~~~
MLNKNTIFFGLLLFIPISLLGHWLHWDEVSIFLTASLAIIPLAAFMGEATEEIAIVVGPTLGGLLNATFGNATELILAFI
ALKSGLISVVKATITGSIISNLLLVMGFAMLLGGLRYKEQVFQSEVARVNASSMNLAVIAILLPTAVEHTSNGIGEETLQ
TLSVAVAIVLIIVYGLTLLFSMKTHSYLCEVGEIDQQSSSAMESGDKIPEKKPEDVNLWFWLGILLVVTITVAIESELLV
NSLESATESLGLTALFTGVILLPVIGNAAEHFTAVTVAMKNKMDLSLSVAVGSTLQIALFVAPVLVIAGWIMGQPMDLDF
NPFELVAVAVSVLIANSISSDGKSNWLEGSLLLATYIVIGLAFFFHPVVPEIG
>P74072 ~~~~~~Ca(2+)/H(+) antiporter~~~COG0387
MSTKSKIFLVLLVFCPLSFAAHWLGWGETTVFILAGLAIVPLAAFMGTATEEIAVVIGPNAGGLLNATFGNATELILAYI
ALKEGLIEVVKATLTGSIIGNLLLVMGFAVFLGGLRYKEQNFQPLAARLNASTMNLGVVAILLPTALQYTSTGVEETVLQ
NLSVAVAVVLIGVYLLSLVFSMGTHAYLYDVGVAENMEMPELGEDVSEPEPPTEEEKPNLWLWTGVLLVVTLGVAVESEL
LVGSLEVATESLGLTALFTGVIVLPIIGNAAEHATAVTVAMKDKMDLSMSVVMGSSLQIAFFVAPVLVIVGWAIGQPMDL
NFNPFELVAVLVAVLIVNSISSDGTSNWLEGILLLATYAIVALAFFFHPTLV
>P38582 ~~~~~~Putative carnobacteriocin-B2 immunity protein~~~
MDIKSQTLYLNLSEAYKDPEVKANEFLSKLVVQCAGKLTASNSENSYIEVISLLSRGISSYYLSHKRIIPSSMLTIYTQI
QKDIKNGNIDTEKLRKYEIAKGLMSVPYIYF
>A5JTM6 6.2.1.33~~~~~~4-chlorobenzoate--CoA ligase~~~
MQTVHEMLRRAVSRVPHRWAIVDAARSTFDICRTGETSRNEGSATARLWPQPARPLAVVSGNSVEAVIAVLALHRLQAVP
ALMNPRLKPAEISELVARGEMARAVVANDAGVMEAIRTRVPSVCVLALDDLVSGSRVPEVAGKSLPPPPCEPEQAGFVFY
TSGTTGLPKGAVIPQRAAESRVLFMATQAGLRHGSHNVVLGLMPLYHTIGFFAVLVAAMAFDGTYVVVEEFDAGNVLKLI
ERERVTAMFATPTHLDALTTAVEQAGARLESLEHVTFAGATMPDTVLERVNRFIPGEKVNIYGTTEAMNSLYMRAVRIAG
TVMRPGFFSEVRIVRVGGDVDDGCPTVKRASWRWRRRMRPFQATLTNLRLLQKSFRKAGTGRAICVRDGSGNIVVLGRVD
DMIISGGENIHPSEVERILAAAPGVAEVVVIGVKDERWGQSVVACVVLQPGASASAERLDAFCRASALADFKRPRRYVFL
DELPKSAMNKVLRRQLMQHVSATSSAAVVPAPAVKQRTYAPSGRAIAR
>O85078 3.8.1.7~~~fcbB1~~~4-chlorobenzoyl coenzyme A dehalogenase-1~~~
MSSNSDHHISVEHTDGVATIRFTRPSKHNAASGQLLLETLEALYRLESDDSVGAIVLTGEGAVFSAGFDLEEVPMGPASE
IQSHFRLKALYYHAVIHMLARIEKPTLAAINGPAVGGGLGMSLACDLAVCTDRATFLPAWMSIGIANDASSSFYLPRIVG
YRRAMEWLLTNRTLGADEAYEWGVVNRVFSEADFQSRVGEIARQLAAAPTHLQGLVKNRIQEGSSETLESCTEHEVQNVI
ASVGHPHFAERLAMFRSKEMRSSALAVDLDAVCGGR
>Q9LCU3 3.8.1.7~~~fcbB2~~~4-chlorobenzoyl coenzyme A dehalogenase-2~~~
MSSNSDHHISVEHTDGVATIRFTRPSKHNAASAQLLLETLEALYRLESDDSVGAIVLTGEGAVFSAGFDLEEVPMGPASE
IQSHFRLKALYYHAVIHMLARIEKPTLAAINGPAVGGGLGMSLACDLAVCTDRATFLPAWMSIGIANDASSSFYLPRIVG
YRRAMEWLLTNRTLGADEAYEWGVVNRVFSEADFQSRVGEIARQLAAAPTHLQGLVKNRIQEGSSETLESCTEHEVQNVI
ASVGHPHFAERLAMFRSKEMRSSALAVDLDAVCGGR
>A5JTM5 3.8.1.7~~~~~~4-chlorobenzoyl coenzyme A dehalogenase~~~
MYEAIGHRVEDGVAEITIKLPRHRNALSVKAMQEVTDALNRAEEDDSVGAVMITGAEDAFCAGFYLREIPLDKGVAGVRD
HFRIGALWWHQMIHKIIRVKRPVLAAINGVAAGGGLGISLASDMAICADSAKFVCAWHTIGIGNDTATSYSLARIVGMRR
AMELMLTNRTLYPEEAKDWGLVSRVYPKDDFREVAWKVARELAAAPTHLQVMAKERFHAGWMQPVEECTEFEIQNVIASV
THPHFMPCLTEFLDGHRADRPQVELPAGV
>P38578 ~~~cbnBA~~~Bacteriocin carnobacteriocin-A~~~
MNNVKELSIKEMQQVTGGDQMSDGVNYGKGSSLSKGGAKCGLGIVGGLATIPSGPLGWLAGAAGVINSCMK
>P38579 ~~~cbnBM1~~~Bacteriocin carnobacteriocin BM1~~~
MKSVKELNKKEMQQINGGAISYGNGVYCNKEKCWVNKAENKQAITGIVIGGWASSLAGMGH
>P38580 ~~~cbnB2~~~Bacteriocin carnobacteriocin B2~~~
MNSVKELNVKEMKQLHGGVNYGNGVSCSKTKCSVNWGQAFQERYTAGINSFVSGVASGAGSIGRRP
>P95648 ~~~cbbX~~~Protein CbbX~~~
MTDAATAPTSIDLRAEYEGSGAKEVLEELDRELIGLKPVKDRIRETAALLLVERARQKLGLAHETPTLHMSFTGNPGTGK
TTVALKMAGLLHRLGYVRKGHLVSVTRDDLVGQYIGHTAPKTKEVLKRAMGGVLFIDEAYYLYRPDNERDYGQEAIEILL
QVMENNRDDLVVILAGYADRMENFFQSNPGFRSRIAHHIEFPDYSDEELFEIAGHMLDDQNYQMTPEAETALRAYIGLRR
NQPHFANARSIRNALDRARLRQANRLFTASSGPLDARALSTMAEEDIRASRVFKGGLDSERRAAEALAR
>P95649 3.1.3.-~~~cbbY~~~Protein CbbY~~~
MIEAILFDVDGTLAETEELHRRAFNETFAALGVDWFWDREEYRELLTTTGGKERIARFLRHQKGDPAPLPIADIHRAKTE
RFVALMAEGEIALRPGIADLIAEAKRAGIRLAVATTTSLPNVEALCRACFGHPAREIFDVIAAGDMVAEKKPSPDIYRLA
LRELDVPPERAVALEDSLNGLRAAKGAGLRCIVSPGFYTRHEEFAGADRLLDSFAELGGLAGLDLTAPVA
>P86831 6.2.1.33~~~fcbA1~~~4-chlorobenzoate--CoA ligase~~~
MRTAFELVAWSAHRQPGAVALLDPESGHRLTYSELLKRIEGVATVLASRGVVRDELVATAMANTLDHAIILLALNRLGAI
PVIINPRLKADEMVQLIRRDNIRTVIRTVAEGKSGTPADIDGVEELTLSAEVLSEGLRIDGNATPAFEAPRPEDPAFVFY
TSGTTGLPKGVVIPHRAIEPRVLFMSTQAGLRFGGHNNLLGLMPIHHVIGFFGVFLGSLAFNGTWIPVTAFDPAQAVKWV
EELDVTCLFASPTHFDALLATSEFAPEKLKSVDSVIFAGAAINQSILKRLEKCLQVPIVDIYGTTETMNSLFNPDATQER
GLRPGYHSRVQFASVSESPSVALPAGVEGELVVDASADATFTHYLNNPEATAAKIVDGWYRTGDSGYVDDSGRVILTGRI
DDMINTGAENVHAEEVEQIISRHPAVVEAAVVGLPDTRWGEVVTAVVVVSEPLTADLLDQVCLDSELANFKRPRRYFVVN
ELPRNAAMKVSRRTLREYLGAHAADQPNPETGFIQFTIEESQ
>P86832 6.2.1.33~~~fcbA2~~~4-chlorobenzoate--CoA ligase~~~
MRTAFELVAWSAHRQPGAVALLDPESGHRLTYSELLKRIEGVATVLASRGVVRDELVATAMANTLDHAIILLALNRLGAI
PVIINPRLKADEMVQLIRRDNIRTVIRTVAEGKSGTPADIDGVEELTLSAEVLSEGLRIDGNATPAFEAPRPEDPAFVFY
TSGTTGLPKGVVIPHRAIEPRVLFMSTQAGLRFGGHNNLLGLMPIHHVIGFFGVFLGSLAFNGTWIPVTAFDPAQAVKWV
EELDVTCLFASPTHFDALLATSEFAPEKLKSVDSVIFAGAAINQSILKRLEKCLQVPIVDIYGTTETMNSLFNPDATQER
GLRPGYHSRVQFASVSESPSVALPAGVEGELVVDASADATFTHYLNNPEATAAKIVDGWYRTGDSGYVDDSGRVILTGRI
DDMINTGAENVHAEEVEQIISRHPAVVEAAVVGLPDTRWGEVVTAVVVVSEPLTADLLDQVCLDSELANFKRPRRYFVVN
ELPRNAAMKVSRRTLREYLGAHAADQPNPETGFIQFTIEESQ
>Q51601 1.14.12.13~~~cbdA~~~2-halobenzoate 1,2-dioxygenase large subunit~~~
MSTPLIAGTGPSAVRQLISNAVQNDPVSGNFRCRRDIFTDAALFDYEMKYIFEQNWVFLAHESQVANPDDYLVSNIGRQP
VIITRNKAGDVSAVINACSHRGAELCRRKQGNRSTFTCQFHGWTFSNTGKLLKVKDGQDDNYPEGFNVDGSHDLTRIPSF
ANYRGFLFGSMNPDACPIEEHLGGSKAILDQVIDQTPGELEVLRGSSSYIYDGNWKLQIENGADGYHVGSVHWNYVATIG
RRDRTSDTIRTVDVTTWSKKNIGGTYTFEHGHMLLWTRLPNPEVRPVFARREELKARVGEEVADAIVNQTRNLCIYPNLY
VMDQISTQIRVVRPISVDKTEVTIYCFAPRDESEEVRNARIRQYEDFFNVSGMGTPDDLEEFRACQSGYRGSAREWNDLS
RGAPHWISGPDDNARRLGLAPLMSGARMEDEGLFVQQHTYWAETMLRGIEAEPKVFNVQPVEVAQ
>Q51602 1.14.12.13~~~cbdB~~~2-halobenzoate 1,2-dioxygenase small subunit~~~
MTSLESSYLDVVAFIFREARLLDDRSWDEWLECYDPEAVFWMPCWDDADTLVDDPRKHVSLIYYSDRMGLEDRVFRLRSE
RSGASTPEPRTTHNIANVEILERTERQIEARFNWHTMNYRYKLLDHYFGTSFYTLKVSSSGLSILNKKVVLKNDLIHQVI
DVYHV
>Q51603 ~~~cbdC~~~2-halobenzoate 1,2-dioxygenase electron transfer component~~~
MLHSIALRFEDDVTYFITSSEHETVADAAYQHGIRIPLDCRNGVCGTCKGFCEHGEYDGGDYIEDALSADEAREGFVLPC
QMQARTDCVVRILASSSACQVKKSTMTGQMTEIDRGSSSTLQFTLAIDPSSKVDFLPGQYAQLRIPGTTESRAYSYSSMP
GSSHVTFLVRDVPNGKMSGYLRNQATITETFTFDGPYGAFYLREPVRPILMLAGGTGLAPFLSMLQYMAGLQRNDLPSVR
LVYGVNRDDDLVGLDKLDELATQLSGFSYITTVVDKDSAQLRRGYVTQQITNDDMNGGDVDIYVCGPPPMVEAVRSWLAA
EKLNPVNFYFEKFAPTVGN
>A3DHD2 ~~~~~~Carbohydrate-binding domain-containing protein Cthe_2159~~~
MSIKKLILAASILTTLALTGCGGKGAVQPSGVSTGDVNAKIVFDNDKVNADNVDGLSVSEREVKITKPGMYTFSGTWNDG
QILVDIGKEFEAVLVLDGVNITNTKSAPIYIKSAEKVKIELADGKDNVLTDAEFYEFEDPQDNKPNACIYSRDDITIKGN
GNLTVNANFNNGIGTSNDLKITGGNITVKAFNNGLKGNDSVTISGGNIDITAEADGIKVENTEEPHKGYVNITGGTIKIR
AKDDAIDSVRSVSINNADVKVSVGGKDVKCEGVLNIAEGCLGKLEE
>P76364 ~~~cbeA~~~Cytoskeleton bundling-enhancing antitoxin CbeA~~~
MSDTLPGTTLPDDNHDRPWWGLPCTVTPCFGARLVQEGNRLHYLADRAGIRGLFSDADAYHLDQAFPLLMKQLELMLTSG
ELNPRHQHTVTLYAKGLTCKADTLSSCDYVYLAVYPTPEMKN
>Q0PAS1 5.2.1.8~~~cbf2~~~Putative peptidyl-prolyl cis-trans isomerase Cbf2~~~COG0760
MKKFSLVAATLIAGVVLNVNAATVATVNGKSISDTEVSEFFAPMLRGQDFKTLPDNQKKALIQQYIMQDLILQDAKKQNL
EKDPLYTKELDRAKDAILVNVYQEKILNTIKIDAAKVKAFYDQNKDKYVKPARVQAKHILVATEKEAKDIINELKGLKGK
ELDAKFSELAKEKSIDPGSKNQGGELGWFDQSTMVKPFTDAAFALKNGTITTTPVKTNFGYHVILKENSQAKGQIKFDEV
KQGIENGLKFEEFKKVINQKGQDLLNSAKVEYK
>Q9KK62 3.5.1.-~~~bsh~~~Conjugated bile acid hydrolase~~~
MCTGVRFSDDEGNTYFGRNLDWSFSYGETILVTPRGYHYDTVFGAGGKAKPNAVIGVGVVMADRPMYFDCANEHGLAIAG
LNFPGYASFVHEPVEGTENVATFEFPLWVARNFDSVDEVEETLRNVTLVSQIVPGQQESLLHWFIGDGKRSIVVEQMADG
MHVHHDDVDVLTNQPTFDFHMENLRNYMCVSNEMAEPTSWGKASLTAWGAGVGMHGIPGDVSSPSRFVRVAYTNAHYPQQ
NDEAANVSRLFHTLGSVQMVDGMAKMGDGQFERTLFTSGYSSKTNTYYMNTYDDPAIRSYAMADYDMDSSELISVAR
>P54965 3.5.1.-~~~cbh~~~Conjugated bile acid hydrolase~~~
MCTGLALETKDGLHLFGRNMDIEYSFNQSIIFIPRNFKCVNKSNKKELTTKYAVLGMGTIFDDYPTFADGMNEKGLGCAG
LNFPVYVSYSKEDIEGKTNIPVYNFLLWVLANFSSVEEVKEALKNANIVDIPISENIPNTTLHWMISDITGKSIVVEQTK
EKLNVFDNNIGVLTNSPTFDWHVANLNQYVGLRYNQVPEFKLGDQSLTALGQGTGLVGLPGDFTPASRFIRVAFLRDAMI
KNDKDSIDLIEFFHILNNVAMVRGSTRTVEEKSDLTQYTSCMCLEKGIYYYNTYENNQINAIDMNKENLDGNEIKTYKYN
KTLSINHVN
>Q06115 3.5.1.-~~~cbh~~~Conjugated bile acid hydrolase~~~COG3049
MCTAITYQSYNNYFGRNFDYEISYNEMVTITPRKYPLVFRKVENLDHHYAIIGITADVESYPLYYDAMNEKGLCIAGLNF
AGYADYKKYDADKVNITPFELIPWLLGQFSSVREVKKNIQKLNLVNINFSEQLPLSPLHWLVADKQESIVIESVKEGLKI
YDNPVGVLTNNPNFDYQLFNLNNYRALSNSTPQNSFSEKVDLDSYSRGMGGLGLPGDLSSMSRFVRAAFTKLNSLSMQTE
SGSVSQFFHILGSVEQQKGLCEVTDGKYEYTIYSSCCDMDKGVYYYRTYDNSQINSVSLNHEHLDTTELISYPLRSEAQY
YAVN
>P29946 6.3.5.11~~~cbiA~~~Cobyrinate a,c-diamide synthase~~~
MAARHHAFILAGTGSGCGKTTVTLGLLRLLQKRALRVQPFKVGPDYLDTGWHTAICGVASRNLDSFMLPPPVLNALFCEQ
MRQADIAVIEGVMGLYDGYGVDPNYCSTAAMAKQLGCPVILLVDGKAVSTSLAATVMGFQHFDPTLNLAGVIVNRVTSDA
HYQLLKNAIEHYCSLPVLGYVPPCDGVALPERHLGLITARESLVNQQSWHDFAATLEQTVDVDALLSLSVLSALPAGMWP
ERPDNTAGAGLTLALADDEAFNFYYPDNIDLLERAGVNIVRFSPLHDRALPDCQMIWLGGGYPELYAADLAANTVMLKHL
RAAHQRGVAIYAECGGLMYLGSTLEDSGGEIHQMANIIPGHSKMGKRLTRFGYCEAQAMQPTLLAAPGEIVRGHEFHYSD
FIPETPAVMACRKVRDGRVLQEWTGGWQTGNTFASYLHVHFAQRPEMLQHWLAAARRVL
>Q8EXP7 5.4.99.60~~~cbiC~~~Cobalt-precorrin-8 methylmutase~~~
MRQITNLGRNIENKSFSIIDEEAGPHSFAQEEWEVVRRIIHATADFDYKNITKIHPQAIDSGIQALKKGCPIVCDVQMIL
SGLNPERLKVYGCKTYCFISDEDVIENAKRKNSTRAIESIQKANSFNLLNESIIVIGNAPTALLEIEKLIRQEGIKPALI
VGVPVGFVSAKESKESILKLEYYNVTSIPYILTMGRKGGSTIAVAILHALLLLSSKRGER
>O87692 5.4.99.60~~~cbiC~~~Cobalt-precorrin-8 methylmutase~~~
MDFRTEFKPLTVQPQQIEGKSFEMITEELGPHPFTDEQYPIVQRVIHRSADFELGRSMLFHPDAIQAGIKAIRSGKQVVA
DVQMVQVGTNKQRIEKHGGEIKVYISDSDVMEEAKRLNTTRAIISMRKAIKEADGGIFAIGNAPTALLELIRLIKEGEAK
PGLVIGLPVGFVSAAESKEELAKLYVPFITNIGRKGGSTVTVAALNAISILADSGVTYEGSAKRT
>Q05601 5.4.99.60~~~cbiC~~~Cobalt-precorrin-8 methylmutase~~~
MHYIQQPQTIEANSFTIISDIIRETRPDYRFASPLHEAIIKRVIHTTADFDWLDILWFSADALEQLCDALRHPCIIYTDT
TMALSGINKRLLATFGGECRCYISDPRVVRAAQTQGITRSMAAVDIAIAEEEKNKLFVFGNAPTALFRLLEHNVTVSGVV
GVPVGFVGAAESKEALTHSHFPAVAALGRKGGSNVAAAIVNALLYHLREA
>O87693 2.1.1.195~~~cbiD~~~Cobalt-precorrin-5B C(1)-methyltransferase~~~
MKEVPKEPKKLREGYTTGACATAATRAALLTLISGEVQDESTIYLPVGRFATFHLEECEYRTSSAVASIIKDAGDDPDAT
HGALIISEVSWCNGVDIIIDGGVGVGRVTKPGLPVPVGEAAINPVPRKMLKETAQQLLAEYNIQKGVKVVISVPEGEEMA
KKTLNARLGILGGISILGTRGIVVPFSTAAYKASIVQAISVAKASNCEHVVITTGGRSEKYGMKQFPELPEEAFIQMGDF
VGFTLKQCKKQGMKKVSLVGMMGKFSKVAQGVMMVHSKSAPIDFNFLAKAASESGASAELVEEIKGANTASQVGDLMTQS
GHHQFFEKLCEYCCLSALKEVGDGIDVDTSLYTLKGDFLGQAVQHGN
>O87694 ~~~cbiET~~~Cobalamin biosynthesis bifunctional protein CbiET~~~
MAIKIIGIGDDGKLSLLPMYEQWIYESDVLIGGKRHLDFFQDFQGEKVAIEGGLSSLVERLKNEEGNAVVLASGDPLFYG
IGSYLSTKLDVEIYPYLSSIQLAFSRLKERWQDAYFTSVHGRSIKGLAQRIDGYKKVAILTDEQNSPTALANYLLSFGMT
EYKMFVAENLGGETERCQLLSLEEAANQFFSPLNVVILKQVEESPVWPLGIEDDEFIQRKPDKGLITKKEIRTLSISALQ
LKRDSVVWDIGTCTGSVAIEAAKIAREGQIFAVEKNEADLENCRENLAKFRVDAHTVHGKAPEGLNEFADPDAVFIGGTA
GGMETILDVCCSRLNSGGRIVLNAVTIENLAEAMKAFKERGFETAVTLAQISRSKPILHLTRFDALNPIYIITAKRGE
>P0A2H1 2.1.1.289~~~cbiE~~~Cobalt-precorrin-7 C(5)-methyltransferase~~~
MLTVVGMGPAGRHLMTPAALEAIDHADALAGGKRHLAQFPAFGGERFTLGADIGALLSWIAARRDKGIVVLASGDPLFYG
IGTRLVAHFGIEQVRIIPGISAVQYLCAQAGIDMNDMWLTSSHGRCVSFEQLANHRKVAMVTDARCGPREIARELVARGK
GHRLMVIGENLAMENERIHWLPVSAVNADYEMNAVVILDER
>O87696 2.1.1.271~~~cbiF~~~Cobalt-precorrin-4 C(11)-methyltransferase~~~
MKLYIIGAGPGDPDLITVKGLKLLQQADVVLYADSLVSQDLIAKSKPGAEVLKTAGMHLEEMVGTMLDRMREGKMVVRVH
TGDPAMYGAIMEQMVLLKREGVDIEIVPGVTSVFAAAAAAEAELTIPDLTQTVILTRAEGRTPVPEFEKLTDLAKHKCTI
ALFLSATLTKKVMKEFINAGWSEDTPVVVVYKATWPDEKIVRTTVKDLDDAMRTNGIRKQAMILAGWALDPHIHDKDYRS
KLYDKTFTHGFRKGVKSE
>P0A2G9 2.1.1.271~~~cbiF~~~Cobalt-precorrin-4 C(11)-methyltransferase~~~
MSETFDPRCVWFVGAGPGDRELITLKGYRLLQQAQVVIYAGSLINTELLDYCPAQAERYDSAELHLEQIIELMAAGVKAG
KTVVRLQTGDVSLYGSVREQGEELTRRGIDWQVVPGVSAFLGAAAELGVEYTVPEVSQSLIITRLEGRTPVPAREQLEAF
ASHQTSMAIYLSVQRIHRVAERLIAGGYPATTPVAVIYKATWPESQTVRGTLADISDKVRDAGIRKTALILVGNFLGKEY
HYSRLYAADFSHEYRKA
>O87697 3.7.1.12~~~cbiG~~~Cobalt-precorrin-5A hydrolase~~~
MIQLEEGKKAPITQRGDYAVVAITKHGVEIARNLGRIFQQSDVYYMSKFEKGDEQEQNIQMFSGSVRMLLPSLFESYKGL
IIIISLGAVVRMIAPILKDKKTDPAVVVIDDKGENVISVLSGHIGGANELTREVAAALRAHPVITTASDVQKTIPVDLFG
KRFGWVWESAENVTPVSASVVNEEEIAVVQESGEKSWWHYEHPVPANIKTYSSIQTALEASPHAALVVTHRDLKKEEEAI
LENGVLYRPKVLAIGMGCNRGTSAAEIETVIEKTLAELQFSMKSVKALCTIELKKDEEGLLEVASKYGWEFVYYSPQELN
SISIQQPSDTVFKYTGAYGVSEPAAMLYSGADTLELVKKKSGNVTISVALIPYD
>O87689 2.1.1.272~~~~~~Cobalt-factor III methyltransferase~~~
MKGKLLVIGFGPGSFEHITQRAREAIQESDMIIGYKTYVELIQGLLTNQQIISTGMTEEVSRAQEAVKQAEAGKTVAVIS
SGDAGVYGMAGLVYEVLIEKGWKKETGVELEVIPGISAINSCASLLGAPVMHDACTISLSDHLTPWELIEKRIEAAAQAD
FVVAFYNPKSGRRTRQIVEAQRILLKYRSPDTPVGLVKSAYRDREEVVMTNLKDMLNHEIGMLTTVVVGNSSTFFYDDLM
ITPRGYQRKYTLNQTEQPLRPHQRLRKEAEPWALDQEEAVKQSASAIEAVQNTREETAASRALAEEALQAILGESTSAVV
HQPIESIFEVAVSPGLANKKFTPVQMTTLAEVVGEKGTMEYTPDHQIKLQIPTAHPDMIIEKLQAASFLLSPVGDVFTIK
ACDFCDGEKSDAIPHTEELQKRLGGMDMPKELKLGINGCGMACYGAVQEDIGIVYRKGAFDLFLGAKTVGRNAHSGQIVA
EGIAPDDIVEIVENIIHEYKEKGHPNERFHKFFKRVKNVYGFDYQDITPKIKVEPAPCGD
>Q05590 2.1.1.-~~~cbiH~~~Probable cobalt-factor III C(17)-methyltransferase~~~
MLSVIGIGPGSQAMMTMEAIEALQAAEIVVGYKTYTHLVKAFTGDKQVIKTGMCREIERCQAAIELAQAGHNVALISSGD
AGIYGMAGLVLELVSKQKLDVEVRLIPGMTASIAAASLLGAPLMHDFCHISLSDLLTPWPVIEKRIVAAGEADFVICFYN
PRSRGREGHLARAFDLLAASKSAQTPVGVVKSAGRKKEEKWLTTLGDMDFEPVDMTSLVIVGNKTTYVQDGLMITPRGYT
L
>O87691 1.3.1.106~~~cbiJ~~~Cobalt-precorrin-6A reductase~~~
MILLLAGTSDARALAVQVKKAGYDVTATVVTDNAAIELQRAEVKVKIGRLTKEDMTDFINEHGVKAIVDASHPFAEEASK
NAIGAAAETAIPYIRYERASQAFTYDNMTMVSTYEEAAEVAAEKKGVIMLTTGSKTLQVFTEKLLPLSDVRLVARMLPRL
DNMEKCQQLGLPQKNIIAIQGPFTKEFDRALYKQYGVTVMVTKESGKVGSVDKKVEAAKELGLDIIMIGRPKIEYGTVYS
TFEEVVHALVNQTRS
>Q72CB8 4.99.1.3~~~cbiKc~~~Sirohydrochlorin cobaltochelatase CbiKC~~~COG4822
MVPPRWGSLDSLKPQQHLPMTKKGILLAAFGSGNRQGESTLRLFDERVRERFPGVPVRWAFTSVIMRRRLAAARKKTDSV
LKALQKMWFEKYTHVAVQSLHIIPGAEYGDLVADVEAMRRDDGFTAATVGAPLLAGSGDMERSAAALLAHLPAGRKPDEA
VVFMGHGTRHPAESSYEALAALVRRVDPHVHIGTMGGSRTLDHILPELQQGGVKGVWLMPLLSVVGRHATEDMAGTDPES
WKSRLEASGLRCIPVLRGTAEYEGFVDIWLDHLTAAVSALDD
>Q72EC8 4.99.1.3~~~cbiKp~~~Sirohydrochlorin cobaltochelatase CbiKP~~~COG4822
MSRHPMVTRLLCLVFSCLIILACSPAFAGHGAPKAQKTGILLVAFGTSVEEARPALDKMGDRVRAAHPDIPVRWAYTAKM
IRAKLRAEGIAAPSPAEALAGMAEEGFTHVAVQSLHTIPGEEFHGLLETAHAFQGLPKGLTRVSVGLPLIGTTADAEAVA
EALVASLPADRKPGEPVVFMGHGTPHPADICYPGLQYYLWRLDPDLLVGTVEGSPSFDNVMAELDVRKAKRVWLMPLMAV
AGDHARNDMAGDEDDSWTSQLARRGIEAKPVLHGTAESDAVAAIWLRHLDDALARLN
>Q05592 4.99.1.3~~~cbiK~~~Sirohydrochlorin cobaltochelatase~~~
MKKALLVVSFGTSYHDTCEKNIVACERDLAASCPDRDLFRAFTSGMIIRKLRQRDGIDIDTPLQALQKLAAQGYQDVAIQ
SLHIINGDEYEKIVREVQLLRPLFTRLTLGVPLLSSHNDYVQLMQALRQQMPSLRQTEKVVFMGHGASHHAFAAYACLDH
MMTAQRFPARVGAVESYPEVDILIDSLRDEGVTGVHLMPLMLVAGDHAINDMASDDGDSWKMRFNAAGIPATPWLSGLGE
NPAIRAMFVAHLHQALNMAVEEAA
>Q05593 2.1.1.151~~~cbiL~~~Cobalt-precorrin-2 C(20)-methyltransferase~~~
MNGKLYALSTGPGAPDLITVRAARILGSLDILYAPAGRKGGDSLALSIVRDYLGEQTEVRCCHFPMSADGAEKEAVWNEV
AAALTAEVEAGKQVGFITLGDAMLFSTWIFLLQRIGCPEWLEIVPGVTSFAAIAARAKMPLAIERQSLAVISCTAPEAEI
AQALQQHDSLVLMKVYGRFARIKALLAQAGLLECALMMSEATLPGEQCWRHLHEVNDDRPLPYFSTILVNKQWEYAE
>D5AUZ9 ~~~cbiM~~~Cobalt transport protein CbiM~~~COG0310
MHIMEGYLPVTHAIGWSLAAGPFVVAGAVKIRKIVAERPEARMTLAASGAFAFVLSALKIPSVTGSCSHPTGTGLGAVVF
GPSVMAVLGVIVLLFQALLLAHGGLTTLGANAFSMAIVGPWVAWGVYKLAGKAGASMAVAVFLAAFLGDLATYVTTSLQL
ALAYPDPVSGFLGAALKFGSVFALTQIPLAIAEGFLTVIVVDALAGKVDDKDKLRILAGEAR
>Q05594 ~~~cbiM~~~Cobalt transport protein CbiM~~~
MKLEQQLRQLSFSGLAAALLLMVVPQQAFAMHIMEGFLPPVWALAWWLLFLPCLWYGLVRLRRIVQEDNHQKVLLALCGA
FIFVLSALKIPSVTGSCSHPTGVGLAVILFGPGVVAILGAVVLLFQALLLAHGGLTTLGANGMSMAVIGPVVGYLVWKMA
CRAGLRRDVAVFLCAMLADLATYFVTSVQLGVAFPDPHAGATGSVVKFMGIFCLTQIPVAIAEGLLTVMIYDQLTKRQVI
TVQGH
>O68104 ~~~cbiN~~~Cobalt transport protein CbiN~~~COG1930
MSSKRTLWLLAGTVALVVVPLLMGGEFGGADGQAAELIEATVPGFAPWADPLWEPPSGEVESLFFALQAALGAFVVGLVI
GRRQGAAKTREQNAPAPRSFPAE
>Q05595 ~~~cbiN~~~Cobalt transport protein CbiN~~~
MKKTLMLLAMVVALVILPFFINHGGEYGGSDGEAESQIQAIAPQYKPWFQPLYEPASGEIESLLFTLQGSLGAAVIFYIL
GYCKGKQRRDDRA
>O68106 7.2.2.-~~~cbiO~~~Cobalt import ATP-binding protein CbiO~~~COG1122
MTPILAAEALTYAFPGGVKALDDLSLAVPQGESLAILGPNGAGKSTLLLHLNGTLRPQSGRVLLGGTATGHSRKDLTDWR
RRVGLVLQDADDQLFAATVFEDVSFGPLNLGLSEAEARARVEEALAALSISDLRDRPTHMLSGGQKRRVAIAGAVAMRPE
VLLLDEPTAGLDLAGTEQLLTLLHGLRAAGMTLVFSTHDVELAAALADRVALFRTGRVLAEGAAAAVLSDRATLAQGGLR
PPLVIDLALSRARSRPFGPRSALPRTRDALAAQMAGWTRR
>Q05596 7.-.-.-~~~cbiO~~~Cobalt import ATP-binding protein CbiO~~~
MLATSDLWFRYQNEPVLKGLNMDFSLSPVTGLVGANGCGKSTLFMNLSGLLRPQKGAVLWQGKPLDYSKRGLLALRQQVA
TVFQDPEQQIFYTDIDSDIAFSLRNLGVPEAEITRRVDEALTLVDAQHFRHQPIQCLSHGQKKRVAIAGALVLQARYLLL
DEPTAGLDPAGRTQMIAIIRRIVAQGNHVIISSHDIDLIYEISDAVYVLRQGQILTHGAPGEVFACTEAMEHAGLTQPWL
VKLHTQLGLPLCKTETEFFHRMQKCAFREAS
>D5AUZ7 ~~~cbiQ~~~Cobalt transport protein CbiQ~~~COG0619
MSIASIDRVAAQGRWRNRPLAEKCLIGLGFLALAVTVPPFPGAVLVTVAILAFTFLGARVPLRFWAAVAVLPLGFLTTGA
AVLLIQIGPDGIGLAPQGPAKAAALVMRASAATCCLLFLATTTPAADLLSGLRRWRVPAELIEIALLTYRFVFILAEEAA
AMTTAQRARLGHATRRRWLRSTAQVIAALLPRALDRARRLETGLAARNWQGEMRVLSTRPAASPLVLGLILTLQAAILAA
GVLL
>Q05598 ~~~cbiQ~~~Cobalt transport protein CbiQ~~~
MTGLDRLSYQSRWAHVAPQRKFLLWLAMMILAFVLPPVGQGIELLIIAGLSCWLLRISLWRWCRWMAIPFGFLLVGVITI
IFSISREPQMLLAGISVGPYWIGITRAGVVTANETFWRSLTALSATLWLVMNLPFPQLISLLKRAHIPRLLTEQILLTWR
FLFILLDEAVAIRRAQTLRFGYCSLPNGYRSLAMLAGLLFTRVLMRYQQMTTTLDIKLYQGDFHL
>Q05632 2.1.1.196~~~cbiT~~~Cobalt-precorrin-6B C(15)-methyltransferase (decarboxylating)~~~
MKDELFLRGENVPMTKEAVRALALSKLELHRASHLIDVGAGTGSVSIEAALQFPSLQVTAIERNPAALRLLDENRQRFAC
GNIDILPGEAPMTITGKADAVFMGGSGGHLTALIDWAMGHLHPGGRLVMTFILQENLHSALAHLAHIGACRMDCVQLQLS
SLTPLGAGHYFKPNNPVFVIACQKEENHVRDI
>O87690 4.99.1.3~~~cbiX~~~Sirohydrochlorin cobaltochelatase~~~
MGGHYMKSVLFVGHGSRDPEGNDREFISTMKHDWDASILVETCFLEFERPNVSQGIDTCVAKGAQDVVVIPIMLLPAGHS
KIHIPAAIDEAKEKYPHVNFVYGRPIGVHEEALEILKTRLQESGENLETPAEDTAVIVLGRGGSDPDANSDLYKITRLLW
EKTNYKIVETSFMGVTAPLIDEGVERCLKLGAKKVVILPYFLFTGVLIKRLEEMVKQYKMQHENIEFKLAGYFGFHPKLQ
TILKERAEEGLEGEVKMNCDTCQYRLGIMEHIDHHHHHDHDHDHDHGHHHHDHHHDHHEDKVGELK
>Q55451 4.99.1.3~~~cbiX~~~Sirohydrochlorin cobaltochelatase~~~COG2138
MTLTSVPAPVSLFPELELPPLPYHRPLLMIGHGTRDEDGRQTFLDFVAQYQALDHSRPVIPCFLELTEPNIQAGVQQCVD
QGFEEISALPILLFAARHNKFDVTNELDRSRQAHPQINFFYGRHFGITPAILDLWKARLNQLDSPEANPQGIDRQDTVLL
FVGRGSSDPDANGDVYKMARMLWEGSGYQTVETCFIGISHPRLEEGFRRARLYQPKRIIVLPYFLFMGALVKKIFTITEE
QRATFPEIEIQSLSEMGIQPELLALVREREIETQLGQVAMNCEACKFRLAFKNQGHGHDHGHGHHHHGHDHGHSHGEWVD
TYIEPTAYHEKIWQAP
>Q08432 4.4.1.13~~~patB~~~Cystathionine beta-lyase PatB~~~COG1168
MNFDKREERLGTQSVKWDKTGELFGVTDALPMWVADMDFRAPEAITEALKERLDHGIFGYTTPDQKTKDAVCGWMQNRHG
WKVNPESITFSPGVVTALSMAVQAFTEPGDQVVVQPPVYTPFYHMVEKNGRHILHNPLLEKDGAYAIDFEDLETKLSDPS
VTLFILCNPHNPSGRSWSREDLLKLGELCLEHGVTVVSDEIHSDLMLYGHKHTPFASLSDDFADISVTCAAPSKTFNIAG
LQASAIIIPDRLKRAKFSASLQRNGLGGLNAFAVTAIEAAYSKGGPWLDELITYIEKNMNEAEAFLSTELPKVKMMKPDA
SYLIWLDFSAYGLSDAELQQRMLKKGKVILEPGTKYGPGGEGFMRLNAGCSLATLQDGLRRIKAALS
>Q93QC6 4.4.1.13~~~metC~~~Cystathionine beta-lyase~~~
MRFPELEELKNRRTLKWTRFPEDVLPLWVAESDFGTCPQLKEAMADAVEREVFGYPPDATGLNDALTGFYERRYGFGPNP
ESVFAIPDVVRGLKLAIEHFTKPGSAIIVPLPAYPPFIELPKVTGRQAIYIDAHEYDLKEIEKAFADGAGSLLFCNPHNP
LGTVFSEEYIRELTDIAAKYDARIIVDEIHAPLVYEGTHVVAAGVSENAANTCITITATSKAWNTAGLKCAQIFFSNEAD
VKAWKNLSDITRDGVSILGLIAAETVYNEGEEFLDESIQILKDNRDFAAAELEKLGVKVYAPDSTYLMWLDFAGTKIEEA
PSKILREEGKVMLNDGAAFGGFTTCARLNFACSRETLEEGLRRIASVL
>Q64HC5 4.4.1.13~~~metC~~~Cysteine-S-conjugate beta-lyase~~~
MQFSNLDTLRTRGTRKWTQFDDDVIPMFVAESDFPTAPAIKEAIIDACEREMFGYTPAPHAHHLGEAVADFYDWRYGWRP
DAAKIFPVADVVRGVLLAIQYFTDGDVIVPVPAYFPFLPIAEAAGRNRIDISSDKGLEGGLDMAEVEEAFKNGAGSIIVT
NPFNPGGWMFTSEELDQICDIARRYKGRVLVDEIHAPLTYGKRHVCAAANNPDVCITVTATSKAWNVAGLKCAQMIFTND
EDVKTWNAINPVAKDGVGTLGIIAAEAAYESGREHLDWQVEQLKANRDWLVENLPSLIPGIRFEIPDATYLMFLDFKDTK
LGVDKPAAYLLKHARVALSEGVDFGPGGEHRARMNFATSPEILKEATERIARAIEVV
>Q47083 ~~~cbl~~~HTH-type transcriptional regulator cbl~~~COG0583
MNFQQLKIIREAARQDYNLTEVANMLFTSQSGVSRHIRELEDELGIEIFVRRGKRLLGMTEPGKALLVIAERILNEASNV
RRLADLFTNDTSGVLTIATTHTQARYSLPEVIKAFRELFPEVRLELIQGTPQEIATLLQNGEADIGIASERLSNDPQLVA
FPWFRWHHSLLVPHDHPLTQISPLTLESIAKWPLITYRQGITGRSRIDDAFARKGLLADIVLSAQDSDVIKTYVALGLGI
GLVAEQSSGEQEEENLIRLDTRHLFDANTVWLGLKRGQLQRNYVWRFLELCNAGLSVEDIKRQVMESSEEEIDYQI
>P9WQ83 4.4.1.13~~~~~~Putative cystathionine beta-lyase~~~COG1168
MIPNPLEELTLEQLRSQRTSMKWRAHPADVLPLWVAEMDVKLPPTVADALRRAIDDGDTGYPYGTEYAEAVREFACQRWQ
WHDLEVSRTAIVPDVMLGIVEVLRLITDRGDPVIVNSPVYAPFYAFVSHDGRRVIPAPLRGDGRIDLDALQEAFSSARAS
SGSSGNVAYLLCNPHNPTGSVHTADELRGIAERAQRFGVRVVSDEIHAPLIPSGARFTPYLSVPGAENAFALMSASKAWN
LGGLKAALAIAGREAAADLARMPEEVGHGPSHLGVIAHTAAFRTGGNWLDALLRGLDHNRTLLGALVDEHLPGVQYRWPQ
GTYLAWLDCRELGFDDAASDEMTEGLAVVSDLSGPARWFLDHARVALSSGHVFGIGGAGHVRINFATSRAILIEAVSRMS
RSLLERR
>Q5SHW0 4.4.1.13~~~~~~Cystathionine beta-lyase~~~COG1168
MDLPPRTGSLKWGTYPEDVLPLWVADMDFPPAEAIQQALAERARGFLGYPPREGDRELRELILEALGLEAELAFMPGVVV
GLYAAVAAFTAPGQGVLTQVPIYPPFLAAIRDQRRTVLANPLRETPEGYRLDLAGLERLAFATRLLLFCHPHNPTGRVFG
EEELAALAQIARRHDLIVVSDELHAPLTYEKPHVPLARFLPERTLTLVGPGKTYNLAGLPIGAVLGPKPLVEAVKRHLPH
VFPNVLAMAAWKAALKEGGPWLKATLEQLRANRDRVAAWAKARGLGHHPPEGTYLAWIQTPFPKAAAYFLERARVALNPG
ESFGRGYDTYVRLNFATYPEVLEEALRRLDGALK
>P50848 3.4.17.19~~~ypwA~~~Carboxypeptidase 1~~~COG2317
MEIHTYEKEFFDLLKRISHYSEAVALMHWDSRTGAPKNGSEDRAESIGQLSTDIFNIQTSDRMKELIDVLYERFDDLSED
TKKAVELAKKEYEENKKIPEAEYKEYVILCSKAETAWEEAKGKSDFSLFSPYLEQLIEFNKRFITYWGYQEHPYDALLDL
FEPGVTVKVLDQLFAELKEAIIPLVKQVTASGNKPDTSFITKAFPKEKQKELSLYFLQELGYDFDGGRLDETVHPFATTL
NRGDVRVTTRYDEKDFRTAIFGTIHECGHAIYEQNIDEALSGTNLSDGASMGIHESQSLFYENFIGRNKHFWTPYYKKIQ
EASPVQFKDISLDDFVRAINESKPSFIRVEADELTYPLHIIIRYEIEKAIFSNEVSVEDLPSLWNQKYQDYLGITPQTDA
EGILQDVHWAGGDFGYFPSYALGYMYAAQLKQKMLEDLPEFDALLERGEFHPIKQWLTEKVHIHGKRKKPLDIIKDATGE
ELNVRYLIDYLSNKYSNLYLL
>P42663 3.4.17.19~~~~~~Thermostable carboxypeptidase 1~~~
MTPEAAYQNLLEFQRETAYLGSLGALAAWDQRTMIPRKGHGHRARQMAALARLLHERATDPRIGEWLEKVEGSSLVEDPL
SDAAVNVRAWRRAYERARAIPERLAVELAQARSEGETAWEALRPRDDWQGFLPYLKRLFALAKEEAEILMAVGPDPLDPP
YGELYDALLDGYEPGARARDLEPLFRELSSGLKGLLDRILGSGRRPDVGVLHRHYPKEAQRAFALELLQACGYDLEAGRL
DPTAHPFEIAIGPGDVRITTRYYEDFFNAGIFGTLHEMGHALYEQGLPEAHWGTPRGEAASLGVHESQSRTWENLVGRSL
GFWERFFPRAKEVFSSLADVRLEDFHFAVNAVEPSLIRVEADEVTYNLHILVRLELELALFRGELFLEDLPEAWREKYRA
YLGVAPRDYKDGVMQDVHWSGGMFGYFPTYTLGNLYAAQFFAKAQEELGPLEPLFARGEFTPFLDWTRRKIHAEGSRFRP
RALVERVTGSPPGAQAFLRYLEAKYGALYGF
>Q5SLM3 3.4.17.19~~~~~~Thermostable carboxypeptidase 1~~~COG2317
MTPEAAYQNLLEFQRETAYLASLGALAAWDQRTMIPKKGHEHRARQMAALARLLHQRMTDPRIGEWLEKVEGSPLVQDPL
SDAAVNVREWRQAYERARAIPERLAVELAQAESEAESFWEEARPRDDWRGFLPYLKRVYALTKEKAEVLFALPPAPGDPP
YGELYDALLDGYEPGMRARELLPLFAELKEGLKGLLDRILGSGKRPDTSILHRPYPVEAQRRFALELLSACGYDLEAGRL
DPTAHPFEIAIGPGDVRITTRYYEDFFNAGIFGTLHEMGHALYEQGLPKEHWGTPRGDAVSLGVHESQSRTWENLVGRSL
GFWERFFPRAREVFASLGDVSLEDFHFAVNAVEPSLIRVEADEVTYNLHILVRLELELALFRGELSPEDLPEAWAEKYRD
HLGVAPKDYKDGVMQDVHWAGGLFGYFPTYTLGNLYAAQFFQKAEAELGPLEPRFARGEFQPFLDWTRARIHAEGSRFRP
RVLVERVTGEAPSARPFLAYLEKKYAALYG
>P36659 ~~~cbpA~~~Curved DNA-binding protein~~~COG0484
MELKDYYAIMGVKPTDDLKTIKTAYRRLARKYHPDVSKEPDAEARFKEVAEAWEVLSDEQRRAEYDQMWQHRNDPQFNRQ
FHHGDGQSFNAEDFDDIFSSIFGQHARQSRQRPATRGHDIEIEVAVFLEETLTEHKRTISYNLPVYNAFGMIEQEIPKTL
NVKIPAGVGNGQRIRLKGQGTPGENGGPNGDLWLVIHIAPHPLFDIVGQDLEIVVPVSPWEAALGAKVTVPTLKESILLT
IPPGSQAGQRLRVKGKGLVSKKQTGDLYAVLKIVMPPKPDENTAALWQQLADAQSSFDPRKDWGKA
>B9K7M6 2.4.1.20~~~cbpA~~~Cellobiose phosphorylase~~~COG3459
MKFGYFDDKNREYVIVTPRTPYPWINYLGTEDFFSIISHMAGGYCFYKDARLRRITRFRYNNVPTDAGGRYFYIREEDGD
FWSPTWMPVRRDLSFFEARHGLGYTKIAGERNGLRATITFFVPRHFTGEVHHLVLQNRTERPRRIKLFSFIEFCLWNALD
DMTNFQRNYSTGEVEIEGSVIYHKTEYRERRNHYAFYSVNHSIDGFDTDRESFMGLYNGFEAPQAVVEGNPRNSVASGWA
PIASHYLELEIPPLGEKELIFILGYVENPEEEKWERPGVINKKRAKEMIERFKTGEDVERALKELKEYWDELLGRIQVET
HDEKLNRMVNIWNQYQCMVTFNIARSASYFESGISRGIGFRDSNQDILGFVHMIPEKARQRILDLASIQFEDGSTYHQFQ
PLTKKGNNEIGGGFNDDPLWLILSTSAYIKETGDWSILNEEVPFDNDPDKKATLFEHLKRSFYFTVNNLGPHGLPLIGRA
DWNDCLNLNCFSKNPDESFQTTVNALDGRVAESVFIAGLFVLAGKEFVEICRRLGLEDEAKEAEKHVKKMIETTLEYGWD
GEWFLRAYDAFGRKVGSKECEEGKIFIEPQGMCVMAGIGVENGYAKKALDSVKEHLDTPYGLVLQQPAYSRYYIELGEIS
SYPPGYKENAGIFCHNNPWVAIAETVIGRGDRAFEIYRKITPAYLEDISEIHRTEPYVYAQMVAGKDAPRHGEAKNSWLT
GTAAWSFVAITQYILGVRPTYDGLMVDPCIPEDWDGFKITRRFRGATYEITVKNPHHVSKGVKEIIVDGKKIEGQVLPVF
NDGKVHRVEVLMG
>Q02I11 ~~~cpbD~~~Chitin-binding protein CbpD~~~
MKHYSATLALLPLTLALFLPQAAHAHGSMETPPSRVYGCFLEGPENPKSAACKAAVAAGGTQALYDWNGVNQGNANGNHQ
AVVPDGQLCGAGKALFKGLNLARSDWPSTAIAPDASGNFQFVYKASAPHATRYFDFYITKDGYNPEKPLAWSDLEPAPFC
SITSVKLENGTYRMNCPLPQGKTGKHVIYNVWQRSDSPEAFYACIDVSFSGAVANPWQALGNLRAQQDLPAGATVTLRLF
DAQGRDAQRHSLTLAQGNNGAKQWPLALAQKVNQDSTLVNIGVLDAYGAVSPVASSQDNQVYVRQAGYRFQVDIELPVEG
GGEQPGGDGKVDFDYPQGLQQYDAGTVVRGADGKRYQCKPYPNSGWCKGWDLYYAPGKGMAWQDAWTLL
>Q9I589 ~~~cbpD~~~Chitin-binding protein CbpD~~~
MKHYSATLALLPLTLALFLPQAAHAHGSMETPPSRVYGCFLEGPENPKSAACKAAVAAGGTQALYDWNGVNQGNANGNHQ
AVVPDGQLCGAGKALFKGLNLARSDWPSTAIAPDASGNFQFVYKASAPHATRYFDFYITKDGYNPEKPLAWSDLEPAPFC
SITSVKLENGTYRMNCPLPQGKTGKHVIYNVWQRSDSPEAFYACIDVSFSGAVANPWQALGNLRAQQDLPAGATVTLRLF
DAQGRDAQRHSLTLAQGANGAKQWPLALAQKVNQDSTLVNIGVLDAYGAVSPVASSQDNQVYVRQAGYRFQVDIELPVEG
GGEQPGGDGKVDFDYPQGLQQYDAGTVVRGADGKRYQCKPYPNSGWCKGWDLYYAPGKGMAWQDAWTLL
>P06621 3.4.17.11~~~cpg2~~~Carboxypeptidase G2~~~
MRPSIHRTAIAAVLATAFVAGTALAQKRDNVLFQAATDEQPAVIKTLEKLVNIETGTGDAEGIAAAGNFLEAELKNLGFT
VTRSKSAGLVVGDNIVGKIKGRGGKNLLLMSHMDTVYLKGILAKAPFRVEGDKAYGPGIADDKGGNAVILHTLKLLKEYG
VRDYGTITVLFNTDEEKGSFGSRDLIQEEAKLADYVLSFEPTSAGDEKLSLGTSGIAYVQVNITGKASHAGAAPELGVNA
LVEASDLVLRTMNIDDKAKNLRFNWTIAKAGNVSNIIPASATLNADVRYARNEDFDAAMKTLEERAQQKKLPEADVKVIV
TRGRPAFNAGEGGKKLVDKAVAYYKEAGGTLGVEERTGGGTDAAYAALSGKPVIESLGLPGFGYHSDKAEYVDISAIPRR
LYMAARLIMDLGAGK
>P63264 ~~~cbpM~~~Chaperone modulatory protein CbpM~~~COG0789
MANVTVTFTITEFCLHTGISEEELNEIVGLGVVEPREIQETTWVFDDHAAIVVQRAVRLRHELALDWPGIAVALTLMDDI
AHLKQENRLLRQRLSRFVAHP
>P00733 3.4.17.14~~~~~~Zinc D-Ala-D-Ala carboxypeptidase~~~
MRPRPIRLLLTALVGAGLAFAPVSAVAAPTATASASADVGALDGCYTWSGTLSEGSSGEAVRQLQIRVAGYPGTGAQLAI
DGQFGPATKAAVQRFQSAYGLAADGIAGPATFNKIYQLQDDDCTPVNFTYAELNRCNSDWSGGKVSAATARANALVTMWK
LQAMRHAMGDKPITVNGGFRSVTCNSNVGGASNSRHMYGHAADLGAGSQGFCALAQAARNHGFTEILGPGYPGHNDHTHV
AGGDGRFWSAPSCGI
>P18143 3.4.17.18~~~scpD~~~Zinc carboxypeptidase~~~
MRLVTARRPRPTKGRRNAALTVLLALALAAPATAVATAGNAAPNAAVAADERTLQYEITGRTTPAARTDIARAGVSIDEV
HDHGVVITADAAQARKLRARGHVLEALPAPDAAPRAADGVSALDFPPADSRYHNYAEMNAAIDARIAANPSIMSKRVIGK
TYQGRDVIAVKVSDNVATDEAEPEVLFTAHQHAREHLTVEMALYLLRELGQGYGSDSRITQAVNGRELWIVPDMNPDGGE
YDIASGSYRSWRKNRQPNAGSSAVGTDLNRNWAYKWGCCGGSSSSPSSETYRGAAAESAPETKVVADFVRSRVVGGKQQI
TAAIDFHTYSELVLWPFGYTYNDTAPGMTADDRNAFAAVGQKMAASNGYTAEQSSDLYITDGSIDDWLWGSQKIFGYTFE
MYPRSASGGGFYPPDEVIERETSRNRDAVLQLIENADCMYRSIGKEAQYCS
>P29068 3.4.17.18~~~cpt~~~Carboxypeptidase T~~~
MRKKWLSLSLVLVLIVACVPALGFSQNIENPSIFDLGIKLYKIDGVSTKEQRSAIASTGAAIEEVGKDYVKVLATPSEAK
QIKQKGFTATVDTSLTTQDFPSYDSGYHNYNEMVNKINTVASNYPNIVKKFSIGKSYEGRELWAVKISDNVGTDENEPEV
LYTALHHAREHLTVEMALYTLDLFTQNYNLDSRITNLVNNREIYIVFNINPDGGEYDISSGSYKSWRKNRQPNSGSSYVG
TDLNRNYGYKWGCCGGSSGSPSSETYRGRSAFSAPETAAMRDFINSRVVGGKQQIKTLITFHTYSELILYPYGYTYTDVP
SDMTQDDFNVFKTMANTMAQTNGYTPQQASDLYITDGDMTDWAYGQHKIFAFTFEMYPTSYNPGFYPPDEVIGRETSRNK
EAVLYVAEKADCPYSVIGKSCSTK
>P06495 ~~~~~~Calerythrin~~~
MTTAIASDRLKKRFDRWDFDGNGALERADFEKEAQHIAEAFGKDAGAAEVQTLKNAFGGLFDYLAKEAGVGSDGSLTEEQ
FIRVTENLIFEQGEASFNRVLGPVVKGTWGMCDKNADGQINADEFAAWLTALGMSKAEAAEAFNQVDTNGNGELSLDELL
TAVRDFHFGRLDVELLG
>P31456 ~~~cbrA~~~Protein CbrA~~~COG0644
MEHFDVAIIGLGPAGSALARKLAGKMQVIALDKKHQCGTEGFSKPCGGLLAPDAQRSFIRDGLTLPVDVIANPQIFSVKT
VDVAASLTRNYQRSYININRHAFDLWMKSLIPASVEVYHDSLCRKIWREDDKWHVIFRADGWEQHITARYLVGADGANSM
VRRHLYPDHQIRKYVAIQQWFAEKHPVPFYSCIFDNSITNCYSWSISKDGYFIFGGAYPMKDGQTRFTTLKEKMSAFQFQ
FGKTVKSEKCTVLFPSRWQDFVCGKDNAFLIGEAAGFISASSLEGISYALDSTDILRSVLLKQPEKLNTAYWRATRKLRL
KLFGKIVKSRCLTAPALRKWIMRSGVAHIPQLKD
>P31468 ~~~cbrB~~~Inner membrane protein CbrB~~~
MSVSRRVIHHGLYFAVLGPLIGVLFLVLYIFFAKEPLVLWVIIHPIFLLLSITTGAIPALLTGVMVACLPEKIGSQKRYR
CLAGGIGGVVITEIYCAVIVHIKGMASSELFENILSGDSLVVRIIPALLAGVVMSRIITRLPGLDISCPETDSLS
>P31469 ~~~cbrC~~~UPF0167 protein CbrC~~~COG3196
MTQNIRPLPQFKYHPKPLETGAFEQDKTVECDCCEQQTSVYYSGPFYCVDEVEHLCPWCIADGSAAEKFAGSFQDDASIE
GVEFEYDEEDEFAGIKNTYPDEMLKELVERTPGYHGWQQEFWLAHCGDFCVFIGYVGWNDIKDRLDEFANLEEDCENFGI
RNSDLAKCLQKGGHCQGYLFRCLHCGKLRLWGDFS
>P64524 ~~~cbtA~~~Cytoskeleton-binding toxin CbtA~~~
MKTLPVLPGQAASSRPSPVEIWQILLSRLLDQHYGLTLNDTPFADERVIEQHIEAGISLCDAVNFLVEKYALVRTDQPGF
SACTRSQLINSIDILRARRATGLMTRDNYRTVNNITLGKYPEAK
>O06380 3.4.16.-~~~~~~Carboxypeptidase Rv3627c~~~COG2027
MGPTRWRKSTHVVVGAAVLAFVAVVVAAAALVTTGGHRAGVRAPAPPPRPPTVKAGVVPVADTAATPSAAGVTAALAVVA
ADPDLGKLAGRITDALTGQELWQRLDDVPLVPASTNKILTAAAALLTLDRQARISTRVVAGGQNPQGPVVLVGAGDPTLS
AAPPGQDTWYHGAARIGDLVEQIRRSGVTPTAVQVDASAFSGPTMAPGWDPADIDNGDIAPIEAAMIDAGRIQPTTVNSR
RSRTPALDAGRELAKALGLDPAAVTIASAPAGARQLAVVQSAPLIQRLSQMMNASDNVMAECIGREVAVAINRPQSFSGA
VDAVTSRLNTAHIDTAGAALVDSSGLSLDNRLTARTLDATMQAAAGPDQPALRPLLDLLPIAGGSGTLGERFLDAATDQG
PAGWLRAKTGSLTAINSLVGVLTDRSGRVLTFAFISNEAGPNGRNAMDALATKLWFCGCTT
>P06961 ~~~cca~~~Multifunctional CCA protein~~~COG0617
MKIYLVGGAVRDALLGLPVKDRDWVVVGSTPQEMLDAGYQQVGRDFPVFLHPQTHEEYALARTERKSGSGYTGFTCYAAP
DVTLEDDLKRRDLTINALAQDDNGEIIDPYNGLGDLQNRLLRHVSPAFGEDPLRVLRVARFAARYAHLGFRIADETLALM
REMTHAGELEHLTPERVWKETESALTTRNPQVFFQVLRDCGALRVLFPEIDALFGVPAPAKWHPEIDTGIHTLMTLSMAA
MLSPQVDVRFATLCHDLGKGLTPPELWPRHHGHGPAGVKLVEQLCQRLRVPNEIRDLARLVAEFHDLIHTFPMLNPKTIV
KLFDSIDAWRKPQRVEQLALTSEADVRGRTGFESADYPQGRWLREAWEVAQSVPTKAVVEAGFKGVEIREELTRRRIAAV
ASWKEQRCPKPE
>Q7SIB1 2.7.7.72~~~cca~~~CCA-adding enzyme~~~
MKPPFQEALGIIQQLKQHGYDAYFVGGAVRDLLLGRPIGDVDIATSALPEDVMAIFPKTIDVGSKHGTVVVVHKGKAYEV
TTFKTDGDYEDYRRPESVTFVRSLEEDLKRRDFTMNAIAMDEYGTIIDPFGGREAIRRRIIRTVGEAEKRFREDALRMMR
AVRFVSELGFALAPDTEQAIVQNAPLLAHISVERMTMEMEKLLGGPFAARALPLLAETGLNAYLPGLAGKEKQLRLAAAY
RWPWLAAREERWALLCHALGVQESRPFLRAWKLPNKVVDEAGAILTALADIPRPEAWTNEQLFSAGLERALSVETVRAAF
TGAPPGPWHEKLRRRFASLPIKTKGELAVNGKDVIEWVGKPAGPWVKEALDAIWRAVVNGEVENEKERIYAWLMERNRTR
EKNC
>Q3IZ91 1.3.1.85~~~ccr~~~Crotonyl-CoA carboxylase/reductase~~~COG0604
MALDVQSDIVAYDAPKKDLYEIGEMPPLGHVPKEMYAWAIRRERHGEPDQAMQIEVVETPSIDSHEVLVLVMAAGVNYNG
IWAGLGVPVSPFDGHKQPYHIAGSDASGIVWAVGDKVKRWKVGDEVVIHCNQDDGDDEECNGGDPMFSPTQRIWGYETPD
GSFAQFTRVQAQQLMKRPKHLTWEEAACYTLTLATAYRMLFGHKPHDLKPGQNVLVWGASGGLGSYAIQLINTAGANAIG
VISEEDKRDFVMGLGAKGVINRKDFKCWGQLPKVNSPEYNEWLKEARKFGKAIWDITGKGINVDMVFEHPGEATFPVSSL
VVKKGGMVVICAGTTGFNCTFDVRYMWMHQKRLQGSHFANLKQASAANQLMIERRLDPCMSEVFPWAEIPAAHTKMYRNQ
HKPGNMAVLVQAPRTGLRTFADVLEAGRKA
>P62553 ~~~ccdA~~~Antitoxin CcdA~~~COG5302
MKQRITVTVDSDSYQLLKAYDVNISGLVSTTMQNEARRLRAERWKAENQEGMAEVARFIEMNGSFADENRDW
>P62552 ~~~ccdA~~~Antitoxin CcdA~~~
MKQRITVTVDSDSYQLLKAYDVNISGLVSTTMQNEARRLRAERWKAENQEGMAEVARFIEMNGSFADENRDW
>P62554 ~~~ccdB~~~Toxin CcdB~~~
MQFKVYTYKRESRYRLFVDVQSDIIDTPGRRMVIPLASARLLSDKVSRELYPVVHIGDESWRMMTTDMASVPVSVIGEEV
ADLSHRENDIKNAINLMFWGI
>P0DOA0 ~~~cckA~~~Sensor kinase CckA~~~
MRPKWCKTTVLASFPRGSACTARQLKHSANRIYLDARASVPARRAGILNEDRIELPDDAIGASAERRSTRDLRKPSRAGI
QHERAVGLMSRQTDNTYPKPLVMPKRGPSAALRLLIVGILLMGAAFIYFLFRDQLGDGFALVLMGVLSMVGVFYLFGAAT
GLIQFSQRSDHQDLAHSFMDTQPEGTVISDPRGQIVYANQAYARMTGATDADGIRPLDQVLASEPAASDAIYRLTNAVRD
GLSAQEEVRISGGLSRGANGSLAPVWYRIKARALEGGAEFKGPLVAWQVADISEERAEQERFFQELQEAINHLDHAPAGF
FSANPAGRIIYLNATLAEWLGVDLTQFTPGSLTLNDIVAGSGMALIKAVKAEPGTSRNTVIDLDLIKRNGQSLAVRFYHR
VQTARDGMPGTTRTIVLDRAEGDDSSVAQRSAEVRFTRFFNSAPMAIAAVDAEGHTLRTNARFLDIFSGVVDRDAIDRRV
KLENVVHERDRETFNKALAAAFAGQASISPVDTVLPGNEERHIRFYMSPVTDLGGEAAEEAAVISAVETTEQKALENQMA
QSQKMQAVGQLAGGIAHDFNNVLTAIIMSSDLLLTNHRASDPSFPDIMNIKQNANRAASLVRQLLAFSRRQTLRPEVLDL
TDVLADLRMLLARLVGKDIELKIDHGRDLWPVKADLGQFEQVAVNLAVNARDAMPEGGQITLRTRNIPAADAAKLHYRDL
PEADYVVFEVEDTGTGIPADVLEKIFEPFFTTKEVGKGTGLGLSMVYGIIKQTGGFIYCDSEVGKGTTFKIFLPRLIEEK
RADDAPVAAKEKKVEKATDLSGSATVLLVEDEDAVRMGGVRALQSRGYTVHEAASGVEALEIMEELGGEVDIVVSDVVMP
EMDGPTLLRELRKTHPDIKFIFVSGYAEDAFARNLPADAKFGFLPKPFSLKQLATTVKEMLEKQD
>B2MVM5 ~~~cclA~~~Carnocyclin-A~~~
MLYELVAYGIAQGTAEKVVSLINAGLTVGSIISILGGVTVGLSGVFTAVKAAIAKQGIKKAIQL
>A9WGE2 4.1.3.46~~~ccl~~~(R)-citramalyl-CoA lyase~~~COG0119
MEAVTIVDVAPRDGLQNEPDVLEPATRVELIERLLAAGVPRIEIGSFVNPRQVPQMAGIDQIARMLIERGHNLAARTTND
LFRFTALVPNQRGYELAAAAGLRHVRLVLAASDGLNRANFKRTTAESLIEFSRFALNIRRDGLTFGVAIGAAFGCPFDGY
VSPERVRAIAEHAVDIGAGEIILADTTGMAVPTQVAALCRTILDRIPDVTVTLHLHNTRNTGYANAFAAWQVGIRSFDAA
LGGIGGCPFAPRAVGNIASEDLVHLFNGLGVPTGIDLSALIAASDWLSATLGRPLPALVGKAGPVYPQVVSMAPYLS
>P33931 7.6.2.5~~~ccmA~~~Cytochrome c biogenesis ATP-binding export protein CcmA~~~COG4133
MGMLEARELLCERDERTLFSGLSFTLNAGEWVQITGSNGAGKTTLLRLLTGLSRPDAGEVLWQGQPLHQVRDSYHQNLLW
IGHQPGIKTRLTALENLHFYHRDGDTAQCLEALAQAGLAGFEDIPVNQLSAGQQRRVALARLWLTRATLWILDEPFTAID
VNGVDRLTQRMAQHTEQGGIVILTTHQPLNVAESKIRRISLTQTRAA
>P52218 7.6.2.5~~~ccmA~~~Cytochrome c biogenesis ATP-binding export protein CcmA~~~COG4133
MNLLAVRDLAVARGGLRAVEGVCFNLNAGGALVLRGPNGIGKTTLLRTLAGLQPLVSGVIEAAPDAIAYAGHSDGLKPAL
TVTENLRFWAEIFGGRNIDAALEAMNLRDLANRPAHALSAGQKRRLGLARLMVTGRPVWLLDEPTVSLDRDSVALFAAML
RAHLGRGGAAVIATHIDLGLPEAEILELGPFRASELRRQSRPAGFNEAFG
>P0ABL8 ~~~ccmB~~~Heme exporter protein B~~~COG2386
MMFWRIFRLELRVAFRHSAEIANPLWFFLIVITLFPLSIGPEPQLLARIAPGIIWVAALLSSLLALERLFRDDLQDGSLE
QLMLLPLPLPAVVLAKVMAHWMVTGLPLLILSPLVAMLLGMDVYGWQVMALTLLLGTPTLGFLGAPGVALTVGLKRGGVL
LSILVLPLTIPLLIFATAAMDAASMHLPVDGYLAILGALLAGTATLSPFATAAALRISIQ
>P0ABM1 ~~~ccmC~~~Heme exporter protein C~~~COG0755
MWKTLHQLAIPPRLYQICGWFIPWLAIASVVVLTVGWIWGFGFAPADYQQGNSYRIIYLHVPAAIWSMGIYASMAVAAFI
GLVWQMKMANLAVAAMAPIGAVFTFIALVTGSAWGKPMWGTWWVWDARLTSELVLLFLYVGVIALWHAFDDRRLAGRAAG
ILVLIGVVNLPIIHYSVEWWNTLHQGSTRMQQSIDPAMRSPLRWSIFGFLLLSATLTLMRMRNLILLMEKRRPWVSELIL
KRGRK
>P0ABM5 ~~~ccmD~~~Heme exporter protein D~~~COG3114
MTPAFASWNEFFAMGGYAFFVWLAVVMTVIPLVVLVVHSVMQHRAILRGVAQQRAREARLRAAQQQEAA
>P69490 ~~~ccmE~~~Cytochrome c-type biogenesis protein CcmE~~~COG2332
MNIRRKNRLWIACAVLAGLALTIGLVLYALRSNIDLFYTPGEILYGKRETQQMPEVGQRLRVGGMVMPGSVQRDPNSLKV
TFTIYDAEGSVDVSYEGILPDLFREGQGVVVQGELEKGNHILAKEVLAKHDENYTPPEVEKAMEANHRRPASVYKDPAS
>Q9LA01 ~~~ccmE~~~Cytochrome c-type biogenesis protein CcmE~~~COG2332
MRNLKKTRRIQILLVAGGALVLSTALIGYGMRDGINFFRAPSQIVAEPPAAGEVFRLGGLVEAGTLVRGQGEEITFKVTD
GGASVPVTFTGVLPDLFGEGKGMVGTGEMVEGTFVAREILAKHDENYMPKEVTEALKEQGVYRDPAQPEG
>Q8EK44 ~~~ccmE~~~Cytochrome c-type biogenesis protein CcmE~~~COG2332
MNPRRKKRLTLAVALIGGVAAIASLLLYALNSNLNLFYTPSEIVNGKTDTGVKPEAGQRIRVGGMVTVGSMVRDPNSLHV
QFAVHDSLGGEILVTYDDLLPDLFREGQGIVAQGVLGEDGKLAATEVLAKHDENYMPPEVAEAMGQKHEKLDYSQQKSAT
Q
>P33927 ~~~ccmF~~~Cytochrome c-type biogenesis protein CcmF~~~COG1138
MMPEIGNGLLCLALGIALLLSVYPLWGVARGDARMMASSRLFAWLLFMSVAGAFLVLVNAFVVNDFTVTYVASNSNTQLP
VWYRVAATWGAHEGSLLLWVLLMSGWTFAVAIFSQRIPLDIVARVLAIMGMVSVGFLLFILFTSNPFSRTLPNFPIEGRD
LNPLLQDPGLIFHPPLLYMGYVGFSVAFAFAIASLLSGRLDSTYARFTRPWTLAAWIFLTLGIVLGSAWAYYELGWGGWW
FWDPVENASFMPWLVGTALMHSLAVTEQRASFKAWTLLLAISAFSLCLLGTFLVRSGVLVSVHAFASDPARGMFILAFMV
LVIGGSLLLFAARGHKVRSRVNNALWSRESLLLANNVLLVAAMLVVLLGTLLPLVHKQLGLGSISIGEPFFNTMFTWLMV
PFALLLGVGPLVRWGRDRPRKIRNLLIIAFISTLVLSLLLPWLFESKVVAMTVLGLAMACWIAVLAIAEAALRISRGTKT
TFSYWGMVAAHLGLAVTIVGIAFSQNYSVERDVRMKSGDSVDIHEYRFTFRDVKEVTGPNWRGGVATIGVTRDGKPETVL
YAEKRYYNTAGSMMTEAAIDGGITRDLYAALGEELENGAWAVRLYYKPFVRWIWAGGLMMALGGLLCLFDPRYRKRVSPQ
KTAPEAV
>P0ABM9 ~~~ccmH~~~Cytochrome c-type biogenesis protein CcmH~~~COG3088
MRFLLGVLMLMISGSALATIDVLQFKDEAQEQQFRQLTEELRCPKCQNNSIADSNSMIATDLRQKVYELMQEGKSKKEIV
DYMVARYGNFVTYDPPLTPLTVLLWVLPVVAIGIGGWVIYARSRRRVRVVPEAFPEQSVPEGKRAGYVVYLPGIVVALIV
AGVSYYQTGNYQQVKIWQQATAQAPALLDRALDPKADPLNEEEMSRLALGMRTQLQKNPGDIEGWIMLGRVGMALGNASI
ATDAYATAYRLDPKNSDAALGYAEALTRSSDPNDNRLGGELLRQLVRTDHSNIRVLSMYAFNAFEQQRFGEAVAAWEMML
KLLPANDTRRAVIERSIAQAMQHLSPQESK
>Q9I3N0 ~~~ccmH~~~Cytochrome c-type biogenesis protein CcmH~~~
MKRFLATALLGLALCGVARAAIDTYEFASDAERERFRNLTQELRCPKCQNQDIADSNAPIAADLRKQIYGQLQQGKSDGE
IVDYMVARYGDFVRYKPPVNERTWLLWFGPGALLLFGVLVIGVIVLRRRRTAAKVQTTLSAEEQARLANLLKNDK
>P72760 ~~~ccmK1~~~Carboxysome shell protein CcmK1~~~COG4577
MSIAVGMIETLGFPAVVEAADSMVKAARVTLVGYEKIGSGRVTVIVRGDVSEVQASVTAGIENIRRVNGGEVLSNHIIAR
PHENLEYVLPIRYTEAVEQFREIVNPSIIRR
>Q8DKB3 ~~~ccmK1~~~Carboxysome shell protein CcmK1~~~COG4577
MAIAVGMIETLGFPAVVEAADAMVKAARVTLVGYEKIGSGRVTVIVRGDVSEVQASVAAGVENVKRVNGGQVLSTHIIAR
PHENLEYVLPIRYTEAVEQFRESVSGIRPMGRP
>Q03511 ~~~ccmK2~~~Carboxysome shell protein CcmK2~~~COG4577
MPIAVGMIETLGFPAVVEAADAMVKAARVTLVGYEKIGSGRVTVIVRGDVSEVQASVSAGLDSAKRVAGGEVLSHHIIAR
PHENLEYVLPIRYTEAVEQFRM
>P72761 ~~~ccmK2~~~Carboxysome shell protein CcmK2~~~COG4577
MSIAVGMIETRGFPAVVEAADSMVKAARVTLVGYEKIGSGRVTVIVRGDVSEVQASVSAGIEAANRVNGGEVLSTHIIAR
PHENLEYVLPIRYTEEVEQFRTY
>Q8DKB2 ~~~ccmK2~~~Carboxysome shell protein CcmK2~~~COG4577
MPIAVGMIETRGFPAVVEAADAMVKAARVTLVGYEKIGSGRVTVIVRGDVSEVQASVAAGVDSAKRVNGGEVLSTHIIAR
PHENLEYVLPIRYTEAVEQFRN
>K9Y7V9 ~~~ccmK3~~~Carboxysome shell protein CcmK3~~~COG4577
MPVAVGVIQTDGFPAVLAAADAMVKAASVTLVSFDKAERAQFYVAVRGPVSEVERSMEAGIAAAEETYNGTVITHYMIPN
PPDNVETVMPIAYSDEVEPFRV
>Q31RK3 ~~~ccmK3~~~Carboxysome shell protein CcmK3~~~COG4577
MPIAVGTIQTLGFPPIIAAADAMVKAARVTITQYGLAESAQFFVSVRGPVSEVETAVEAGLKAVAETEGAELINYIVIPN
PQENVETVMPIDFTAESEPFRS
>P73406 ~~~ccmK3~~~Carboxysome shell protein CcmK3~~~COG4577
MPQAVGVIQTLGFPSVLAAADAMLKGGRVTLVYYDLAERGNFVVAIRGPVSEVNLSMKMGLAAVNESVMGGEIVSHYIVP
NPPENVLAVLPVEYTEKVARFRT
>K9Y6N7 ~~~ccmK4~~~Carboxysome shell protein CcmK4~~~COG4577
MSLDAVGSLETKGFPGVLAAADAMVKTGRVTLVGYIRAGSARFTIIIRGDVSEVKTAMDAGIHAVDKAYGAALETWVIIP
RPHENVECVLPIAYNENVERFRESTERPLIGSSQNRS
>Q31RK2 ~~~ccmK4~~~Carboxysome shell protein CcmK4~~~COG4577
MSQQAIGSLETKGFPPILAAADAMVKAGRITIVSYMRAGSARFAVNIRGDVSEVKTAMDAGIEAAKNTPGGTLETWVIIP
RPHENVEAVFPIGFGPEVEQYRLSAEGTGSGRR
>P73407 ~~~ccmK4~~~Carboxysome shell protein CcmK4~~~COG4577
MSAQSAVGSIETIGFPGILAAADAMVKAGRITIVGYIRAGSARFTLNIRGDVQEVKTAMAAGIDAINRTEGADVKTWVII
PRPHENVVAVLPIDFSPEVEPFREAAEGLNRR
>Q7NIT8 ~~~ccmL~~~Carboxysome shell vertex protein CcmL~~~COG4576
MQIGRVRGTVVSSQKEPSMVGVKFLLLQLIDEAGQPLPQYEVAADGVGAGLDEWVLFSRGSAARQVAGSEKRPVDAVVIG
IIDTVSVDNRPLYSKKDQYR
>Q8YYI2 ~~~ccmL~~~Carboxysome shell vertex protein CcmL~~~COG4576
MQIAKVRGTVVSTQKDPSLRGVKLLLLQLVDEEGNLLQKYEVAADNSVGAGFDEWVLISRGSAARQLLGNEQRPVDAAVV
AIIDTIHVEDRLIYSKKDQYR
>Q03512 ~~~ccmL~~~Carboxysome shell vertex protein CcmL~~~COG4576
MRIAKVRGTVVSTYKEPSLQGVKFLVVQFLDEAGQALQEYEVAADMVGAGVDEWVLISRGSQARHVRDCQERPVDAAVIA
IIDTVNVENRSVYDKREHS
>P72759 ~~~ccmL~~~Carboxysome shell vertex protein CcmL~~~COG4576
MQLAKVLGTVVSTSKTPNLTGVKLLLVQFLDTKGQPLERYEVAGDVVGAGLNEWVLVARGSAARKERGNGDRPLDAMVVG
IIDTVNVASGSLYNKRDDGR
>Q8DKB4 ~~~ccmL~~~Carboxysome shell vertex protein CcmL~~~COG4576
MKIARVCGTVTSTQKEDTLTGVKFLVLQYLGEDGEFLPDYEVAADTVGAGQDEWVLVSRGSAARHIINGTDKPIDAAVVA
IIDTVSRDNYLLYSKRTQY
>Q8YYI3 4.2.1.1~~~ccmM~~~Carboxysome assembly protein CcmM~~~COG0663
MAVRSTAAPPTPWSRSLAEAQIHESAFVHPFSNIIGDVHIGANVIIAPGTSIRADEGTPFHIGENTNIQDGVVIHGLEQG
RVVGDDNKEYSVWVGSSASLTHMALIHGPAYVGDNSFIGFRSTVFNAKVGAGCIVMMHALIKDVEVPPGKYVPSGAIITN
QKQADRLPDVQPQDRDFAHHVIGINQALRAGYLCAADSKCIAPLRNDQVKSYTSTTVIGLERSSEVASNSLGAETIEQVR
YLLEQGYKIGSEHVDQRRFRTGSWTSCQPIEARSVGDALAALEACLADHSGEYVRLFGIDPKGKRRVLETIIQRPDGVVA
GSTSFKAPASNTNGNGSYHSNGNGNGYSNGATSGKVSAETVDQIRQLLAGGYKIGTEHVDERRFRTGSWNSCKPIEATSA
GEVVAALEECIDSHQGEYIRLIGIDPKAKRRVLESIIQRPNGQVAPSSSPRTVVSASSASSGTATATATRLSTEVVDQVR
QILGGGYKLSIEHVDQRRFRTGSWSSTGAISATSEREAIAVIEASLSEFAGEYVRLIGIDPKAKRRVLETIIQRP
>Q03513 ~~~ccmM~~~Carboxysome assembly protein CcmM~~~COG0663
MPSPTTVPVATAGRLAEPYIDPAAQVHAIASIIGDVRIAAGVRVAAGVSIRADEGAPFQVGKESILQEGAVIHGLEYGRV
LGDDQADYSVWIGQRVAITHKALIHGPAYLGDDCFVGFRSTVFNARVGAGSVIMMHALVQDVEIPPGRYVPSGAIITTQQ
QADRLPEVRPEDREFARHIIGSPPVIVRSTPAATADFHSTPTPSPLRPSSSEATTVSAYNGQGRLSSEVITQVRSLLNQG
YRIGTEHADKRRFRTSSWQPCAPIQSTNERQVLSELENCLSEHEGEYVRLLGIDTNTRSRVFEALIQRPDGSVPESLGSQ
PVAVASGGGRQSSYASVSGNLSAEVVNKVRNLLAQGYRIGTEHADKRRFRTSSWQSCAPIQSSNERQVLAELENCLSEHE
GEYVRLLGIDTASRSRVFEALIQDPQGPVGSAKAAAAPVSSATPSSHSYTSNGSSSSDVAGQVRGLLAQGYRISAEVADK
RRFQTSSWQSLPALSGQSEATVLPALESILQEHKGKYVRLIGIDPAARRRVAELLIQKP
>P72758 ~~~ccmM~~~Carboxysome assembly protein CcmM~~~COG0663
MLAKSLGWLLAVSRRNYCMGSRTALASRPWSKHLADPQIDPTAYVHSFANVVGDVRIQPGVSVAPGSSIRADEGTPFWIG
GNVLIQHGVVIHGLETGRVLGDDDQEYSVWIGPGTCVAHLALVHGPVYLGANCFIGFRSTVLNARVGDGAVVMMHSLVQD
VEIPPNKLVPSGAMITQQHQADSLPDVQAGDRHFVQQIAAMHGQSASPTQGTDPTVCVLPESLPAVTPVTETPYINSIDN
MSINSDITNQIRSLLAQGYGIGAEHANERRFKTKSWQSCGTADGFRPDQVIATVEGWLQEFAGEYVRLIGIDQGAKRRVV
EVIIQRPGDVPGSPSRGTTTTKALSSGGSGRSAVAHQTGNLAGDSANQLRALLHQGYKIGLEYASARRFKTGSWLTGGTI
GSHREGEALQELNRFLADHTNEYVRIIGIDPAGKRRVAEIVVHRPNGNGNGKPSSSSSSVGYKSAPVSSAGGSSAGGLTP
EVIATVRGLLANGHSIGTEHTDKRRFKAKSWDTCPTIDGGREAEVLAKLEACLADHAGEYVRIIGIDRVGKRRVLEQIIQ
RPGDNVVAGRSPSSSSASTSSSASSNGFGSGNGGGYSNSAVRLDNSVVTQVRSLLAQGYKIGTEHTDKRRFKAKSWQSCA
PITSTHESEVLRALEGCLADHNGEYVRLLGIDPTAKRRVLETIIQRP
>Q8DKB5 4.2.1.1~~~ccmM~~~Carboxysome assembly protein CcmM~~~COG0663
MAVQSYAAPPTPWSRDLAEPEIAPTAYVHSFSNLIGDVRIKDYVHIAPGTSIRADEGTPFHIGSRTNIQDGVVIHGLQQG
RVIGDDGQEYSVWIGDNVSITHMALIHGPAYIGDGCFIGFRSTVFNARVGAGCVVMMHVLIQDVEIPPGKYVPSGMVITT
QQQADRLPNVEESDIHFAQHVVGINEALLSGYQCAENIACIAPIRNELQRQEDPPTLHVEMLTGEKNTMTTDYGTHVRQL
LQQGYQISLEYADARRYRTSSWQSGPTLTGQQESQVMAAIAQLLKEHEGEYVRLIGVDPKAKRRVFEEIIQRPGQAAVAS
SSSSRPSATVNASPVGSLDAAVVAQVRQLLQQGYQIGTEHADARRYRTSSWTSCAPIQSKQEPEVLAALEACLQEHAGEY
VRLIGIDQKQKRRVLEQIIQRPQGPVAIAPKTPTPVATSHASVSSGGNDTLLSADLVNQIQDLLRQGCQVITEYADQRRF
RTSSWQSGIKITSAQQINDLRSFLAEHQRDYIRLVGVNPQAKQRVLETIIHRPNGKAASNGNSTRGQGFTPRPTASSQGS
PSTHSLSQEVIEQVRQLLQQGYTLGLEHVDARRYRTNSWQSGPRIEAKNLNEALAAIQACLQEYSGEYVRLIGINPAGKQ
RVAEILLQQAAK
>P46204 ~~~ccmN~~~Carboxysome assembly protein CcmN~~~COG0663
MHLPPLEPPISDRYFASGEVTIAADVVIAPGVLLIAEADSRIEIASGVCIGLGSVIHARGGAIIIQAGALLAAGVLIVGQ
SIVGRQACLGASTTLVNTSIEAGGVTAPGSLLSAETPPTTATVSSSEPAGRSPQSSAIAHPTKVYGKEQFLRMRQSMFPD
R
>P72757 ~~~ccmN~~~Carboxysome assembly protein CcmN~~~COG0663
MQLPPVHSVSLSEYFVSGNVIIHETAVIAPGVILEAAPDCQITIEAGVCIGLGSVISAHAGDVKIQEQTAIAPGCLVIGP
VTIGATACLGSRSTVFQQDIDAQVLIPPGSLLMNRVADVQTVGASSPTTDSVTEKKSPSTANPIAPIPSPWDNEPPAKGT
DSPSDQAKESIARQSRPSTAEAAEQISSNRSPGESTPTAPTVVTTAPLVSEEVQEKPPVVGQVYINQLLLTLFPERRYFS
S
>P46205 ~~~ccmO~~~Carboxysome assembly protein CcmO~~~COG4577
MSASLPAYSQPRNAGALGVICTRSFPAVVGTADMMLKSADVTLIGYEKTGSGFCTAIIRGGYADIKLALEAGVATARQFE
QYVSSTILPRPQGNLEAVLPISRRLSQEAMATRSHQNVGAIGLIETNGFPALVGAADAMLKSANVKLICYEKTGSGLCTA
IVQGTVSNVTVAVEAGMYAAERIGQLNAIMVIPRPLDDLMDSLPEPQSDSEAAQPLQLPLRVREKQPLLELPELERQPIA
IEAPRLLAEERQSALELAQETPLAEPLELPNPRDDQ
>Q31QW7 ~~~ccmP~~~Carboxysome shell protein CcmP~~~COG4577
MGVELRSYVYLDNLQRQHASYIGTVATGFLTLPGDASVWIEISPGIEINRMMDIALKAAVVRPGVQFIERLYGLMEVHAS
NQGEVREAGRAVLSALGLTERDRLKPKIVSSQIIRNIDAHQAQLINRQRRGQMLLAGETLYVLEVQPAAYAALAANEAEK
AALINILQVSAIGSFGRLFLGGEERDIIAGSRAAVAALENLSGREHPGDRSRE
>D9IA43 7.1.1.9~~~ccoN1~~~Cbb3-type cytochrome c oxidase subunit CcoN1~~~COG3278
MNTATSTAYSYKVVRQFAIMTVVWGIVGMGLGVFIAAQLAWPFLNFDLPWTSFGRLRPLHTNAVIFAFGGCALFATSYYS
VQRTCQTTLFAPKLAAFTFWGWQLVILLAAISLPLGFTSSKEYAELEWPIDILITIVWVAYAVVFFGTLAKRKVKHIYVG
NWFFGAFILTVAILHVVNNLEIPVTAMKSYSLYAGATDAMVQWWYGHNAVGFFLTAGFLGIMYYFVPKQAERPVYSYRLS
IVHFWALITVYIWAGPHHLHYTALPDWAQSLGMVMSLILLAPSWGGMINGMMTLSGAWHKLRSDPILRFLVVSLAFYGMS
TFEGPMMAIKTVNALSHYTDWTIGHVHAGALGWVAMVSIGALYHLVPKVFGREQMHSIGLINTHFWLATIGTVLYIASMW
VNGIAQGLMWRAINDDGTLTYSFVESLEASHPGFVVRMIGGAIFFAGMLVMAYNTWRTVQAAKPAEYDAAAQIA
>D9IA45 ~~~ccoP1~~~Cbb3-type cytochrome c oxidase subunit CcoP1~~~
MSTFWSGYIALLTLGTIVALFWLIFATRKGESAGTTDQTMGHAFDGIEEYDNPLPRWWFLLFIGTLVFGILYLVLYPGLG
NWKGVLPGYEGGWTQEKQWEREVAQADEKYGPIFAKYAAMSVEEVAQDPQAVKMGARLFANYCSICHGSDAKGSLGFPNL
ADQDWRWGGDAASIKTSILNGRIAAMPAWGQAIGEEGVKNVAAFVRKDLAGLPLPEGTDADLSAGKNVYAQTCAVCHGQG
GEGMAALGAPKLNSAAGWIYGSSLGQLQQTIRHGRNGQMPAQQQYLGDDKVHLLAAYVYSLSQKPEQLANQ
>Q8KS19 ~~~ccoP2~~~Cbb3-type cytochrome c oxidase subunit CcoP2~~~COG2010
MTSFWSWYVTLLSLGTIAALVWLLLATRKGQRPDSTEETVGHSYDGIEEYDNPLPRWWFMLFVGTVIFALGYLVLYPGLG
NWKGILPGYEGGWTQVKEWQREMDKANEQYGPLYAKYAAMPVEEVAKDPQALKMGGRLFASNCSVCHGSDAKGAYGFPNL
TDDDWLWGGEPETIKTTILHGRQAVMPGWKDVIGEEGIRNVAGYVRSLSGRDTPEGISVDIEQGQKIFAANCVVCHGPEA
KGVTAMGAPNLTDNVWLYGSSFAQIQQTLRYGRNGRMPAQEAILGNDKVHLLAAYVYSLSQQPEQ
>Q3J015 ~~~ccoP~~~Cbb3-type cytochrome c oxidase subunit CcoP~~~COG2010
MSVKPTKQKPGEPPTTGHSWDGIEEFDNPMPRWWLWTFYVTIVWAIGYSILYPAWPLINGATNGLIGHSTRADVQRDIEA
FAEANATIRQQLVNTDLTAIAADPNLLQYATNAGAAVFRTNCVQCHGSGAAGNVGYPNLLDDDWLWGGDIESIHTTVTHG
IRNTTDDEARYSEMPRFGADGLLDSTQISQVVEYVLQISGQDHDAALSAEGATIFADNCAACHGEDGTGSRDVGAPNLTD
AIWLYGGDRATVTETVTYARFGVMPNWNARLTEADIRSVAVYVHGLGGGE
>A1B348 ~~~ccoP~~~Cbb3-type cytochrome c oxidase subunit CcoP~~~COG2010
MADTDDEHASPQNPDNRIELERQAADEAHKAKILAHPPEGPGGDPLHPPVTPRPGATRVVRDRKGGRRVVEVPSTGHSWD
GIEEYDNPLPRWWLWTFYATIVWGVLYLIAYPAIPLVNGATQGLLGQNYRSDVAAEIQRFNEANAPIQAKLVETPLEEIA
ADPELANYTANAGAAIFRTWCAQCHGSGAGGATGYPSLLDNDWLWGGTLEEIHTTVMHGIRDPKDADTRYSEMPRFGIDG
LLENAQISQVVNHVLELGGLPHDAALAAEGVEVFADNCSSCHAEDGTGDRAQGAPDLTDAVWLYGSDPATITRIVRDGPF
GVMPAWTGRLSEADIVAVAAYVHSLGGGE
>Q52689 ~~~ccoP~~~Cbb3-type cytochrome c oxidase subunit CcoP~~~
MSKKPTTKKEVQTTGHQWDGIEELNTPLPRWWLWTFYATIIWGVAYSIAMPAWPIFSDKATPGLLGSSTRADVEKDIAKF
AEMNKAVEEKLVATDLTAIAADPELVTYTRNAGAAVFRTWCAQCHGAGAGGNTGFPSLLDGDWLHGGAIETIYTNVKHGI
RDPLDPDTLLVANMPAHLTDELLEPAQIDEVVQYVLQISGQPADEVKATAGQQIFAENCASCHGEDAKGLVEMGAPNLTD
GIWLYGGDVATLTSTIQYGRGGVMPSWSWAADGAKPRLSEAQIRAVASYVHSLGGGQ
>D5ARP7 ~~~ccoP~~~Cbb3-type cytochrome c oxidase subunit CcoP~~~COG2010
MSKKPTTKKEVQTTGHSWDGIEELNTPLPRWWLWTFYATIVWGVAYSIAMPAWPIFASGATPGILGSSTRADVEKDIAKF
AEMNKAVEDKLVATDLTAIAADPELVTYTRNAGAAVFRTWCAQCHGAGAGGNTGFPSLLDGDWLHGGSIETIYTNIKHGI
RDPLDPDTLPVANMPAHLTDELLEPAQIDDVVQYVLKISGQPADEARATAGQQVFADNCVSCHGEDAKGMVEMGAPNLTD
GIWLYGGDANTITTTIQLGRGGVMPSWSWAADGAKPRLSEAQIRAVASYVHSLGGGQ
>Q5GCA5 ~~~ccoP~~~Cbb3-type cytochrome c oxidase subunit CcoP~~~
MSDFFNSGWSLYVAGITVVSLIFCLVVLIVASRRKVMADDNTTGHVWDEDLQELNNPLPRWWAGLFLVTIAFAVIYLALY
PGLGSNKGTLDWTSTGQHSAEMEKARAQMAPLYAKFVSQPAEALAKDPQAMAIGERLFANNCAQCHGADARGSKGFPNLT
DNDWLHGGTHDKIKETITGGRVGNMPPMAAAVGTPEDVKNVAQYVLSLSGAPHNEVAAQLGKAKFAVCAACHGPDGKGMQ
AVGSANLTDKIWLHGLRRTGHHRLINNGKTNIMPAQASRLSPEQIHVLGAYVWSLSQTSTVAAR
>P9WPR5 1.13.11.-~~~~~~Carotenoid cleavage oxygenase~~~COG3670
MTTAQAAESQNPYLEGFLAPVSTEVTATDLPVTGRIPEHLDGRYLRNGPNPVAEVDPATYHWFTGDAMVHGVALRDGKAR
WYRNRWVRTPAVCAALGEPISARPHPRTGIIEGGPNTNVLTHAGRTLALVEAGVVNYELTDELDTVGPCDFDGTLHGGYT
AHPQRDPHTGELHAVSYSFARGHRVQYSVIGTDGHARRTVDIEVAGSPMMHSFSLTDNYVVIYDLPVTFDPMQVVPASVP
RWLQRPARLVIQSVLGRVRIPDPIAALGNRMQGHSDRLPYAWNPSYPARVGVMPREGGNEDVRWFDIEPCYVYHPLNAYS
ECRNGAEVLVLDVVRYSRMFDRDRRGPGGDSRPSLDRWTINLATGAVTAECRDDRAQEFPRINETLVGGPHRFAYTVGIE
GGFLVGAGAALSTPLYKQDCVTGSSTVASLDPDLLIGEMVFVPNPSARAEDDGILMGYGWHRGRDEGQLLLLDAQTLESI
ATVHLPQRVPMGFHGNWAPTT
>P25144 ~~~ccpA~~~Catabolite control protein A~~~COG1609
MSNITIYDVAREANVSMATVSRVVNGNPNVKPTTRKKVLEAIERLGYRPNAVARGLASKKTTTVGVIIPDISSIFYSELA
RGIEDIATMYKYNIILSNSDQNMEKELHLLNTMLGKQVDGIVFMGGNITDEHVAEFKRSPVPIVLAASVEEQEETPSVAI
DYEQAIYDAVKLLVDKGHTDIAFVSGPMAEPINRSKKLQGYKRALEEANLPFNEQFVAEGDYTYDSGLEALQHLMSLDKK
PTAILSATDEMALGIIHAAQDQGLSIPEDLDIIGFDNTRLSLMVRPQLSTVVQPTYDIGAVAMRLLTKLMNKEPVEEHIV
ELPHRIELRKSTKS
>P46828 ~~~ccpA~~~Catabolite control protein A~~~
MNVTIYDVAREASVSMATVSRVVNGNPNVKPSTRKKVLETIERLGYRPNAVARGLASKKTTTVGVIIPDISNIFYAELAR
GIEDIATMYKYNIILSNSDQNQDKELHLLNNMLGKQVDGIIFMSGNVTEEHVEELKKSPVPVVLAASIESTNQIPSVTID
YEQAAFDAVQSLIDSGHKNIAFVSGTLEEPINHAKKVKGYKRALTESGLPVRDSYIVEGDYTYDSGIEAVEKLLEEDEKP
TAIFVGTDEMALGVIHGAQDRGLNVPNDLEIIGFDNTRLSTMVRPQLTSVVQPMYDIGAVAMRLLTKYMNKETVDSSIVQ
LPHRIEFRQSTK
>P99175 ~~~ccpA~~~Catabolite control protein A~~~
MTVTIYDVAREARVSMATVSRVVNGNQNVKAETKNKVNEVIKRLNYRPNAVARGLASKKTTTVGVIIPDISNIYYSQLAR
GLEDIATMYKYHSIISNSDNDPEKEKEIFNNLLSKQVDGIIFLGGTITEEMKELINQSSVPVVVSGTNGKDAHIASVNID
FTEAAKEITGELIEKGAKSFALVGGEHSKKAQEDVLEGLTEVLNKNGLQLGDTLNCSGAESYKEGVKAFAKMKGNLPDAI
LCISDEEAIGIMHSAMDAGIKVPEELQIISFNNTRLVEMVRPQLSSVIQPLYDIGAVGMRLLTKYMNDEKIEEPNVVLPH
RIEYRGTTK
>P37517 ~~~ccpB~~~Catabolite control protein B~~~COG1609
MANIKEIARLANVSVSTVSRVLNHHPYVSEEKRKLVHQVMKELDYTPNRTAIDLIRGKTHTVGVILPYSDHPCFDKIVNG
ITKAAFQHEYATTLLPTNYNPDIEIKYLELLRTKKIDGLIITSRANHWDSILAYQEYGPVIACEDTGDIDVPCAFNDRKT
AYAESFRYLKSRGHENIAFTCVREADRSPSTADKAAAYKAVCGRLEDRHMLSGCNDMNDGELAAEHFYMSGRVPTAIYAN
SDEVAAGIHLFAKKNNWDVEIIGEGNTSISRVLGFPSLDLNLEQLGIAAFSLFLQDEPADIKIQHKFKKKA
>O34994 ~~~ccpN~~~Transcriptional repressor CcpN~~~COG0517
MSTIELNKRQEHILQIVKENGPITGEHIAEKLNLTRATLRPDLAILTMSGFLEARPRVGYFYTGKTGTQLLADKLKKLQV
KDFQSIPVVIHENVSVYDAICTMFLEDVGTLFVVDRDAVLVGVLSRKDLLRASIGQQELTSVPVHIIMTRMPNITVCRRE
DYVMDIAKHLIEKQIDALPVIKDTDKGFEVIGRVTKTNMTKILVSLSENEIL
>P55929 1.11.1.5~~~ccp~~~Cytochrome c551 peroxidase~~~COG1858
MIKRTLTVSLLSLSLGAMFASAGVMAANEPIQPIKAVTPENADMAELGKMLFFDPRLSKSGFISCNSCHNLSMGGTDNIT
TSIGHKWQQGPINAPTVLNSSMNLAQFWDGRAKDLKEQAAGPIANPKEMASTHEIAEKVVASMPQYRERFKKVFGSDEVT
IDRITTAIAQFEETLVTPGSKFDKWLEGDKNALNQDELEGYNLFKGSGCVQCHNGPAVGGSSYQKMGVFKPYETKNPAAG
RMDVTGNEADRNVFKVPTLRNIELTYPYFHDGGAATLEQAVETMGRIQLNREFNKDEVSKIVAFLKTLTGDQPDFKLPIL
PPSNNDTPRSQPYE
>P14532 1.11.1.5~~~ccpA~~~Cytochrome c551 peroxidase~~~
MQSSQLLPLGSLLLSFATPLAQADALHDQASALFKPIPEQVTELRGQPISEQQRELGKKLFFDPRLSRSHVLSCNTCHNV
GTGGADNVPTSVGHGWQKGPRNSPTVFNAVFNAAQFWDGRAKDLGEQAKGPIQNSVEMHSTPQLVEQTLGSIPEYVDAFR
KAFPKAGKPVSFDNMALAIEAYEATLVTPDSPFDLYLKGDDKALDAQQKKGLKAFMDSGCSACHNGINLGGQAYFPFGLV
KKPDASVLPSGDKGRFAVTKTQSDEYVFRAAPLRNVALTAPYFHSGQVWELKDAVAIMGNAQLGKQLAPDDVENIVAFLH
SLSGKQPRVEYPLLPASTETTPRPAE
>Q2YMK2 2.1.1.72~~~ccrM~~~DNA methyltransferase CcrM~~~
MSLVRLAHELPIEAPRTAWLDSIIKGDCVSALERLPDHSVDVIFADPPYNLQLGGDLHRPDQSMVSAVDDHWDQFESFQA
YDAFTRAWLLACRRVLKPNGTIWVIGSYHNIFRVGTQLQDLGFWLLNDIVWRKTNPMPNFRGRRFQNAHETLIWASREQK
GKGYTFNYEAMKAANDDVQMRSDWLFPICTGSERLKDENGDKVHPTQKPEALLARIMMASSKPGDVILDPFFGSGTTGAV
AKRLGRHFVGIEREQPYIDAATARINAVEPLGKAELTVMTGKRAEPRVAFTSVMEAGLLRPGTVLCDERRRFAAIVRADG
TLTANGEAGSIHRIGARVQGFDACNGWTFWHFEENGVLKPIDALRKIIREQMAAAGA
>P0CAW2 2.1.1.72~~~ccrMIM~~~DNA methyltransferase CcrM~~~COG2189
MKFGPETIIHGDCIEQMNALPEKSVDLIFADPPYNLQLGGDLLRPDNSKVDAVDDHWDQFESFAAYDKFTREWLKAARRV
LKDDGAIWVIGSYHNIFRVGVAVQDLGFWILNDIVWRKSNPMPNFKGTRFANAHETLIWASKSQNAKRYTFNYDALKMAN
DEVQMRSDWTIPLCTGEERIKGADGQKAHPTQKPEALLYRVILSTTKPGDVILDPFFGVGTTGAAAKRLGRKFIGIEREA
EYLEHAKARIAKVVPIAPEDLDVMGSKRAEPRVPFGTIVEAGLLSPGDTLYCSKGTHVAKVRPDGSITVGDLSGSIHKIG
ALVQSAPACNGWTYWHFKTDAGLAPIDVLRAQVRAGMN
>B8GZ33 2.1.1.72~~~ccrMIM~~~DNA methyltransferase CcrM~~~
MKFGPETIIHGDCIEQMNALPEKSVDLIFADPPYNLQLGGDLLRPDNSKVDAVDDHWDQFESFAAYDKFTREWLKAARRV
LKDDGAIWVIGSYHNIFRVGVAVQDLGFWILNDIVWRKSNPMPNFKGTRFANAHETLIWASKSQNAKRYTFNYDALKMAN
DEVQMRSDWTIPLCTGEERIKGADGQKAHPTQKPEALLYRVILSTTKPGDVILDPFFGVGTTGAAAKRLGRKFIGIEREA
EYLEHAKARIAKVVPIAPEDLDVMGSKRAEPRVPFGTIVEAGLLSPGDTLYCSKGTHVAKVRPDGSITVGDLSGSIHKIG
ALVQSAPACNGWTYWHFKTDAGLAPIDVLRAQVRAGMN
>C0SPC1 2.7.1.15~~~ccrZ~~~Cell cycle regulator CcrZ~~~COG0510
MNIDMNWLGQLLGSDWEIFPAGGATGDAYYAKHNGQQLFLKRNSSPFLAVLSAEGIVPKLVWTKRMENGDVITAQHWMTG
RELKPKDMSGRPVAELLRKIHTSKALLDMLKRLGKEPLNPGALLSQLKQAVFAVQQSSPLIQEGIKYLEEHLHEVHFGEK
VVCHCDVNHNNWLLSEDNQLYLIDWDGAMIADPAMDLGPLLYHYVEKPAWESWLSMYGIELTESLRLRMAWYVLSETITF
IAWHKAKGNDKEFHDAMEELHILMKRIVD
>A0A0H2ZQL5 2.7.1.15~~~ccrZ~~~Cell cycle regulator CcrZ~~~COG0510
MDLGDNELTLTPIPGKSGKAYMGSYPDGKRIFVKMNTSPILPGLAREQIAPQLLWSRRLADGRDMCAQEWLTGKILTPYD
MNRKQIVNILTRLHRSRPLMTQLSRLGYAMETPVDLLQSWQETAPDALRKNHFISEVMADLRQTIPGFREDHATIVHGDV
RHSNWIETDSGLIYLVDWDSVRLTDRMFDVAHMLCHYISEHQWKEWLTYYGYKYNQTVLSKLYWYGQLSYLSQISKYYMN
QDLENVNREIHGLRHFRDKYGKRR
>Q82LU9 1.3.1.86~~~ccrA2~~~Crotonyl-CoA reductase~~~COG0604
MKEILDAIQSQTATSADFAALPLPDSYRAITVHKDETEMFAGLSTRDKDPRKSIHLDDVPVPELGPGEALVAVMASSVNY
NSVWTSIFEPVSTFNFLERYGRLSDLSKRHDLPYHIIGSDLAGVVLRTGPGVNSWKPGDEVVAHCLSVELESSDGHNDTM
LDPEQRIWGFETNFGGLAEIALVKSNQLMPKPDHLSWEEAAAPGLVNSTAYRQLVSRNGAGMKQGDNVLIWGASGGLGSY
ATQFALAGGANPICVVSSEQKADICRSMGAEAIIDRNAEGYKFWKDETTQDPKEWKRFGKRIREFTGGEDIDIVFEHPGR
ETFGASVYVTRKGGTITTCASTSGYMHEYDNRYLWMSLKRIIGSHFANYREAWEANRLVAKGKIHPTLSKVYSLEDTGQA
AYDVHRNLHQGKVGVLALAPREGLGVRDEEKRAQHIDAINRFRNI
>Q53865 1.3.1.86~~~ccr~~~Crotonyl-CoA reductase~~~
MTVKDILDAIQSKDATSADFAALQLPESYRAITVHKDETEMFAGLETRDKDPRKSIHLDEVPVPELGPGEALVAVMASSV
NYNSVWTSIFEPVSTFAFLERYGKLSPLTKRHDLPYHIIGSDLAGVVLRTGPGVNAWQPGDEVVAHCLSVELESPDGHDD
TMLDPEQRIWGFETNFGGLAEIALVKTNQLMPKPKHLTWEEAAAPGLVNSTAYRQLVSRNGAAMKQGDNVLIWGASGGLG
SYATQFALAGGANPICVVSSPQKAEICRSMGAEAIIDRNAEGYKFWKDEHTQDPKEWKRFGKRIRELTGGEDIDIVFEHP
GRETFGASVYVTRKGGTITTCASTSGYMHEYDNRYLWMSLKRIIGSHFANYREAYEANRLIAKGKIHPTLSKTYSLEETG
QAAYDVHRNLHQGKVGVLCLAPEEGLGVRDAEMRAQHIDAINRFRNV
>P73912 ~~~ccsB~~~Cytochrome c biogenesis protein CcsB~~~COG1333
MTIANPSPSNFFQQLGRQCLKTLADLRLAIALLLLIAVFSISGTVIEQGESLSFYQQNYPEDPALFGFLSWQVILQLGLN
QVYRTWWFLGLLILFGSSLTACTFNRQFPALKAARSWQFYHQPRQFKKLALSFSLPDGDINKIESLLRDRGYKIFQEGDS
VYARKGLMGKVGPIIVHGAMLIILGGAIWGALTGFFAQHMIPSGETFQVSNIIEKGPLADSQIPKDWGIKVNRFWINYTE
NGAIDQFYSDLSVVNNQGEELDRQTISVNHPLRHRGVTFYQTNWGIAGVKVQLNNSPVLQLPMAPLQTANGGQLWGAYIP
TKTDFSEGAALLVKDLQGTMIIYDQEGNLTDAVRAGSTVEINGVNITIKELVGSTGLQIKADPGIPFVYLGFGLLMVGVM
MSYVSHSQVWLLSVDGDGQREIYLGGRTNRAQVAFEREILAIAEEAEVSSKTEAKVNA
>P72978 ~~~ccsA~~~Cytochrome c biogenesis protein CcsA~~~COG0755
MNLVSLESFLDNTAFLVLLLTMFAYWVAVVFPKPWLVQGASGAMAIANLTITALLGARWLEAGYFPISNLYESLFFLAWG
ITAVHFIAERMSQSRFVGAVTSPIALGIVAFAALTLPVDMQQSAPLVPALKSNWLMMHVSVMMVSYATLMVGSLLAIAFL
FVTRGQAVELRGSSVGTGGFRQGLVKGNNLNPVGNLNPALEGVSGNSGNVAVLEKTTSTPAITLSPQRLTLADTLDNISY
RIIGLGFPLLTIGIIAGAVWANEAWGSYWSWDPKETWALITWLVFAAYLHARITKGWQGRKPAILAASGFTVVWICYLGV
NLLGKGLHSYGWFL
>A0A2I9 4.1.1.-~~~undec1A~~~Cysteine/Cysteine sulfinic acid decarboxylase~~~
MITPLTLATLSKNPILVDFFDPEDGRWNSHVDLGLWSDLYLIAPATANTIGKMAAGIADNLLLTSYLSARCPVFIAPAMD
VDMLMHPATQRNLGILKSSGNHIIEPGSGELASGLTGKGRMAEPEEIVREVISFFSKKKITEKPLNGRRVFINAGPTIEP
IDPVRFISNYSSGRMGIALADAAAAMGAEVTLVLGPVTLRPSSQDINVIDVRSAAEMKEASVEAFRECDIAILAAAVADF
TPLTTSDKKIKRGSGEMVINLRPTEDIAAELGKMKKKNQLLVGFALETDDEITNASSKLKRKNLDMIVLNSLKDPGAGFG
HETNRITIIDKSNNIDKFELKTKGEVAADIIRKILTLVH
>Q45589 2.7.7.85~~~cdaA~~~Cyclic di-AMP synthase CdaA~~~COG1624
MAFEDIPFLQYLGNAVDILLVWYVIYKLIMVIRGTKAVQLLKGIVVIVLVRMASQYLGLSTLQWLMDQAITWGFLAIIII
FQPELRRALEQLGRGRFFSRSGTPVEEAQQKTIEAITKAINYMAKRRIGALLTIERDTGMGDYIETGIPLNAKVSSELLI
NIFIPNTPLHDGAVIMKNNEIAAAACYLPLSESPFISKELGTRHRAAVGISEVTDSLTIIVSEETGGVSVAKNGDLHREL
TEEALKEMLEAEFKKNTRDTSSNRWYWRGKKNG
>O34659 ~~~cdaR~~~CdaA regulatory protein CdaR~~~COG4856
MDKFLNNRWAVKIIALLFALLLYVAVNSNQAPTPKKPGESFFPTSTTDEATLTDIPVKAYYDDENYVVTGVPQTVNVTIK
GSTSAVKKARQTKNFEIYADMEHLKTGTHKVELKAKNVSDGLTISINPSVTTVTIQERTTKSFPVEVEYYNKSKMKKGYS
PEQPIVSPKNVQITGSKNVIDNISLVKASVNLENADETIEKEAKVTVYDKDGNALPVDVEPSVIKITVPVTSPSKKVPFK
IERTGSLPDGVSIANIESSPSEVTVYGSQDVLDSLEFIDGVSLDLSKINKDSDIEADIPLPDGVKKISPSKVTLHIEVDS
EADQKFENVPIKTVGLSSSQNIEFLDPESQAIDVTAKGSPTNINKLKKSDIELYVNVSDLEDGEHSVKLEVNGPQNVTWS
LGRKNAKIKLTSKKSNTSTNDNSSNTSGNQDTDKQTNDQKNNQQEDTKNTDKNNNDQNQDGNKDQNQDQDEDESTANSQS
SSE
>P37047 ~~~cdaR~~~Carbohydrate diacid regulator~~~COG3835
MAGWHLDTKMAQDIVARTMRIIDTNINVMDARGRIIGSGDRERIGELHEGALLVLSQGRVVDIDDAVARHLHGVRQGINL
PLRLEGEIVGVIGLTGEPENLRKYGELVCMTAEMMLEQSRLMHLLAQDSRLREELVMNLIQAEENTPALTEWAQRLGIDL
NQPRVVAIVEVDSGQLGVDSAMAELQQLQNALTTPERNNLVAIVSLTEMVVLKPALNSFGRWDAEDHRKRVEQLITRMKE
YGQLRFRVSLGNYFTGPGSIARSYRTAKTTMVVGKQRMPESRCYFYQDLMLPVLLDSLRGDWQANELARPLARLKTMDNN
GLLRRTLAAWFRHNVQPLATSKALFIHRNTLEYRLNRISELTGLDLGNFDDRLLLYVALQLDEER
>Q59226 3.2.1.135~~~CDI5~~~Cyclomaltodextrinase~~~
MFLEAVYHRPRKNWSYAYNGTTVHLRIRTKKDDMTAVYALAGDKYMWDHTMEYVPMTKLATDELFDYWECEVTPPYRRVK
YGFLLQQGHEKRWMTEYDFLTEPPRNPDRLFEYPFINPVDVFQPPAWVKDAIFYQIFPERFANGDTRNDPEGTLPWGSAD
PTPSCFFGGDLQGVIDHLDHLSKLGVNAVYFTPLFKATTNHKYDTEDYFQIDPQFGDKDTLKKLVDLCHERGIRVLLDAV
FNHSGRTFPPFVDVLKNGEKSKYKDWFHIRSLPLEVVDGIPTYDTFAFEPLMPKLNTEHPDVKEYLLKAAEYWIRETGID
GWRLDVANEVSHQFWREFRRVVKQANPDAYILGEVWHESSIWLEGDQFDAVMNYPFTNAVLDFFIHQIADAEKFSFMLGK
QLAGYPRQASEVMFNLLDSHDTARLLTQADGDKRKMKLAVLFQFTYFGTPCIYYGDEVGLDGGHDPGCRKCMEWDETKHD
KDLFAFYQTVIRLRQAHAALRTGTFKFLTAEKNSRQIAYLREDDQDTILVVMNNDKAGHTLRCLSGMHSGPICGTTMS
>O31854 2.7.7.85~~~cdaS~~~Cyclic di-AMP synthase CdaS~~~COG1624
MKAMRYEQISENAFKGKIQVYLEQILGDASLILKTLHEKDQCLLCELDDLGHVFQDMQGIASSFYLQSYIEEFTPAFIEL
AKAIKALSEHKHGALIVIERADPVERFIQKGTSLHAEISSSLIESIFFPGNPLHDGALLVRENKLVSAANVLPLTTKEVD
IHLGTRHRAALGMSGYTDALVLVVSEETGKMSFAKDGVLYPLISPRT
>D5TM67 2.7.7.85~~~cdaS~~~Diadenylate cyclase CdaS~~~
MHEWGLSEELKIQTKQMIEIAEKELSIMRNAIDKEDECILCKMEDIHHMLANVQTLAATYYIQAYLSPYTESSSFITTAI
QHLSARKHGALIVVERNETLEALIQTGTTLNAHLTAPLLESIFYPGNPLHDGAVLVKNNHIVSAANILPLTKSTEVDPEL
GTRHRAAIGLSEKSDALILVVSEETGRTSFALNGILYTISL
>A0A7U9P668 3.2.1.54~~~~~~Cyclomaltodextrinase~~~
MRKEAIHHRSTDNFAYAYDSETLHLRLQTKKNDVDHVELLFGDPYEWHDGAWQFQTMPMRKTGSDGLFDYWLAEVKPPYR
RLRYGFVLRAGGEKLVYTEKGFYHEAPSDDTAYYFCFPFLHRVDLFQAPDWVKDTVWYQIFPERFANGNPAISPKGARPW
GSEDPTPTSFFGGDLQGIIDHLDYLADLGITGIYLTPIFRAPSNHKYDTADYFEIDPHFGDKETLKTLVKRCHEKGIRVM
LDAVFNHCGYEFGPFQDVLKNGAASRYKDWFHIREFPLQTEPRPNYDTFAFVPQMPKLNTAHPEVKRYLLDVATYWIREF
DIDGWRLDVANEIDHQFWREFRQAVKALKPDVYILGEIWHDAMPWLRGDQFDAVMNYPLADAALRFFAKEDMSASEFADR
LMHVLHSYPKQVNEAAFNLLGSHDTPRLLTVCGGDVRKVKLLFLFQLTFTGSPCIYYGDEIGMTGGNDPECRKCMVWDPE
KQNKELYEHVKQLIALRKQYRALRRGDVAFLAADDEVNHLVYAKTDGNETVMIIINRSNEAAEIPMPIDARGKWLVNLLT
GERFAAEAETLCVSLPPYGFVLYAVESW
>Q08341 3.2.1.54~~~~~~Cyclomaltodextrinase~~~
MIMLEAVYHRMGQNWSYAYNDSTLHIRIRTKRDNVPRIDLHCGEKYDPEKYKETIPMERMASDGLFDYWQAAVQPRYRRL
VYYFALHSDNGDAVYFMEKGFFDQPPKVMYEGLFDFPYLNRQDVHTPPAWVKEAIFYQIFPERFANGDPSNDPEGVQEWG
GTPSAGNFFGGDLQGVIDHLDYLSDLGVNALYFNPLFAATTNHKYDTADYMKIDPQFGTNEKLKELVDACHARGMRVLLD
AVFNHCGHTFPPFVDVLNNGLNSRYADWFHVREWPLRVVDGIPTYDTFAFEPIMPKLNTGNEEVKAYLLNVGRYWLEEMG
LDGWRLDVANEVDHQFWREFRSEIKRINPSAYILGEIMHDSMPWLQGDQFDAVMNYPFTNILLNFFARRLTNAAEFAQAI
GTQLAGYPQQVTEVSFNLLGSHDTTRLLTLCSGNVERMKLATLFQLTYQGTPCIYYGDEIGMDGEYDPLNRKCMEWDKSK
QNTELLAFFRSMISLRKAHPALRGSGLRFLPVLEHPQLLVYERWDDNERFLIMLNNEDAPVNVVIPAAQPGASWRTVNGE
PCAVVEESSIQAALPPYGYAILHAPIAGTAE
>P29964 3.2.1.54~~~~~~Cyclomaltodextrinase~~~COG0366
MIKEAIFHKSDVPYAYPLNENQLKIVLRTAVFDVDRVYVLYKDRYDWLGKFKIKPMVLTHTNELFDYYETTLELNKKFVY
FFYLVSDGGEKLYYTEAGFYKKRPENHFWGFFHYPYIGEKDVFFAPEWTSDCMVYQIFPERFNNGDKSNDPENVKPWGEK
PTADSFFGGDLQGIIDKIDYLKDLGINAIYLTPIFLSHSTHKYDTTDYYTIDPHFGDTQKARELVQKCHDNGIKVIFDAV
FNHCGYDFFAFQDVIKNGKKSKYWDWFNIYEWPIKTHPKPSYEAFADTVWRMPKLMTKNPEVQKYLLEVAEYWIKEVDID
GWRLDVANEIDHHFWRKFREVVKAAKPEAIIVGEVWHDASPWLRGDQFDSVMNYPFRNAVVDFFAKRKISASRFNTMITE
QLMRHMDSVNRVMFNLIGSHDTERFLTLANGMVARMKLALVFQFTFVGIPYIYYGDEVGMVGDYDPDCRRCMIWEEEKQN
KSIFNFYKKLISIRRENEELKYGSFCTLYAIGRVFAFKREYKGKSIIVVLNNSSKQEVIFLNEVEGKEDILKMKELKRSG
NLLYLQPNSAYILK
>Q47910 7.2.2.10~~~cda~~~Calcium-transporting ATPase~~~
MNFKSTVITAMCCFFSFAVLASEKLEKPKLVVGLVVDQMRWDYLYRYYDRYSENGFKRLLNEGFSSENTLIDYVPTYTAI
GHSTIYTGSVPAINGIAGNDFIIQATGQNMYCTQDDSVQAVGGEGKVGQQSPKNLLVSTITDQLKLATNFQSKVIGIAIK
DRGGILPAGHFANAAYWLDGKTGDWITSTYYMKDLPKWVKGFNKEKVVDQYYKQGWKTLYPIDTYVLSTADDNLYEETFK
GEKTPTFPRDLVKLKKENGYELIKSTPQGNTLTLDFAKRAIENEQLGNNPLQVTDFLAVSLSSTDYIGHQFAINSIEIED
TYLRLDRDIADFLAYLDQNIGKGNYTLFLSADHGAAHNPKFFADQKGNSGYFDTKAIRKDLNEKLASKFGVADLVKSLAN
YQVHLNYEVIEANDVEEDEVIAAAIKLLKKVDGVAFVVDMNEAAESSVPQILRERIINGYNFKRSGAIQLILEPQWFSGS
KDGKGTTHGSWNSYDAHIPAVFLGWGVKPGKTTRQTHMTDIAPTIAQILKIEFPNGNIGTPIQEAIEQ
>I6X7F9 ~~~~~~Transcriptional regulator Rv3488~~~COG1695
MREFQRAAVRLHILHHAADNEVHGAWLTQELSRHGYRVSPGTLYPTLHRLEADGLLVSEQRVVDGRARRVYRATPAGRAA
LTEDRRALEELAREVLGGQSHTAGNGT
>P19079 3.5.4.5~~~cdd~~~Cytidine deaminase~~~COG0295
MNRQELITEALKARDMAYAPYSKFQVGAALLTKDGKVYRGCNIENAAYSMCNCAERTALFKAVSEGDTEFQMLAVAADTP
GPVSPCGACRQVISELCTKDVIVVLTNLQGQIKEMTVEELLPGAFSSEDLHDERKL
>P0ABF6 3.5.4.5~~~cdd~~~Cytidine deaminase~~~COG0295
MHPRFQTAFAQLADNLQSALEPILADKYFPALLTGEQVSSLKSATGLDEDALAFALLPLAAACARTPLSNFNVGAIARGV
SGTWYFGANMEFIGATMQQTVHAEQSAISHAWLSGEKALAAITVNYTPCGHCRQFMNELNSGLDLRIHLPGREAHALRDY
LPDAFGPKDLEIKTLLMDEQDHGYALTGDALSQAAIAAANRSHMPYSKSPSGVALECKDGRIFSGSYAENAAFNPTLPPL
QGALILLNLKGYDYPDIQRAVLAEKADAPLIQWDATSATLKALGCHSIDRVLLA
>A6TBN1 3.5.4.5~~~cdd~~~Cytidine deaminase~~~
MHSRFQAALTTLAADLQAAIAPMLADPHFPALLEADQVATLQHATGLDEDALAFALLPLAAACARPDLSHFNVGAIARGV
SGRWYFGGNMEFLGATMQQTVHAEQSAISHAWLRGETSLRAITVNYTPCGHCRQFMNELNSGLALRIHLPGREAHALEHY
LPDAFGPKDLEIKTLLMDEQDHGFPVSGDALTQAAIQAANRCHAPYSHSPSGVALELKDGTIFSGSYAENAAFNPTLPPL
QGALNLLSLNGYDYPAIQRAILAEKADAALIQWDATVATLKALGCHNIERVLLG
>P9WPH3 3.5.4.5~~~cdd~~~Cytidine deaminase~~~COG0295
MPDVDWNMLRGNATQAAAGAYVPYSRFAVGAAALVDDGRVVTGCNVENVSYGLTLCAECAVVCALHSTGGGRLLALACVD
GHGSVLMPCGRCRQVLLEHGGSELLIDHPVRPRRLGDLLPDAFGLDDLPRERR
>Q9KSM5 3.5.4.5~~~cdd~~~Cytidine deaminase~~~COG0295
MRNRIEQALQQMPASFAPYLRELVLAKDFDATFSAEQYQQLLTLSGLEDADLRVALLPIAAAYSYAPISEFYVGAIVRGI
SGRLYLGANMEFTGAQLGQTVHAEQCAISHAWMKGEKGVADITINFSPCGHCRQFMNELTTASSLKIQLPKRAAKTLQEY
LPESFGPADLGIDSGLMSPVNHGKTSDDDEELIQQALRAMNISHSPYTQNFSGVALKMRSGAIYLGAYAENAAFNPSLPP
LQVALAQAMMMGESFEDIEAAALVESATGKISHLADTQATLEVINPDIPLSYLSL
>Q9HVI1 ~~~~~~Cyclic diguanosine monophosphate-binding protein PA4608~~~
MSDQHDERRRFHRIAFDADSEILQGERRWEVLLHDVSLHGILVGQPQDWNGDPQRPFEARLYLGLDVLIRMEISLAWARD
GLLGFECQHIDLDSISHLRRLVELNLGDEELLERELALLVSAHDD
>Q73R77 ~~~~~~Cyclic di-GMP binding protein TDE_0214~~~
MAFAASQQLNRYYNLYKNIDVTFSKEVVSTLNFEPKQVFVRCSGGQWPCIINSASMTKAKIICGKKSGFLARLRSGITSV
NIRFAFFDTEGKDSLSFFVAAKLVGISSYEAGNQDLVLITFEYTQRAPDDLIEKLGILLEANINSQKRRNERVVITPEIS
RKIGLVEKGTVVYIDAVPRRCLIRDLSFSGAKILLVGIANFLINKEVILRFAFDDPQSVFGIKGKTVRTEPVEGRKDLVA
LAVQYYPKNIPMMYKMYLNKYFSVVRKPASDGFGDDFLEDVAPASSFTPVSSPIGTNTAPLTPPPADSAPEQIS
>P76236 2.7.7.65~~~cdgI~~~Probable diguanylate cyclase CdgI~~~COG2199
MIQSTRISMGLFFKYFLSLTKIDPGQNYISLPSIKSSTHIALLFMVSMGTQKLKAQSFFIFSLLLTLILFCITTLYNENT
NVKLIPQMNYLMVVVALFFLNAVIFLFMLMKYFTNKQILPTLILSLAFLSGLIYLVETIVIIHKPINGSTLIQTKSNDVS
IFYIFRQLSFICLTSLALFCYGKDNILDNNKKKTGILLLALIPFLVFPLLAHNLSSYNADYSLYVVDYCPDNHTATWGIN
YTKILVCLWAFLLFFIIMRTRLASELWPLIALLCLASLCCNLLLLTLDEYNYTIWYISRGIEVSSKLFVVSFLIYNIFQE
LQLSSKLAVHDVLTNIYNRRYFFNSVESLLSRPVVKDFCVMLVDINQFKRINAQWGHRVGDKVLVSIVDIIQQSIRPDDI
LARLEGEVFGLLFTELNSAQAKIIAERMRKNVELLTGFSNRYDVPEQMTISIGTVFSTGDTRNISLVMTEADKALREAKS
EGGNKVIIHHI
>Q9KVK6 3.1.4.52~~~cdgJ~~~Cyclic di-GMP phosphodiesterase CdgJ~~~COG3434
MVRCLWAAECCWLPPKRKQPFEDTMYTTYVARQPILNAKRHTLGYELLFRDGEKNAFPEYMDADRATYRLIVENFLSLGT
NPRIARSRCFINFPHKSLIRRLPLTLPREQIVVEILETCQPTDDLFEAVQELSQRGYLLALDDFVYSPAWERFLPYVQIV
KIDIMAMGLDKACEFVRGRLAQGSRRRFLAERVETEDEFHQARHAGFTFFQGYFFSKPEIIKQRYVSPEHVIAMQLFREV
CQPEVDYVRVERLVAQDIALSYKLLRFVNTMSDRISVSISSFRQALVYLGQDKLRIFVSLAVASYISSKKPKELYNLSLQ
RAQFCQLMATHTHFKAHREQAFLIGMFSVLDALLDTSIEQLVEQLPLADDVKLALREREGPLGTLLDLEECFEKADWQGV
EQHCLELGFDLEDVRQELIEAQRWSQDINRLI
>P30920 2.4.1.19~~~~~~Cyclomaltodextrin glucanotransferase~~~
MFQMAKRAFLSTTLTLGLLAGSALPFLPASAVYADPDTAVTNKQSFSTDVIYQVFTDRFLDGNPSNNPTGAAYDATCSNL
KLYCGGDWQGLINKINDNYFSDLGVTALWISQPVENIFATINYSGVTNTAYHGYWARDFKKTNPYFGTMADFQNLITTAH
AKGIKIVIDFAPNHTSPAMETDTSFAENGRLYDNGTLVGGYTNDTNGYFHHNGGSDFSSLENGIYKNLYDLADFNHNNAT
IDKYFKDAIKLWLDMGVDGIRVDAVKHMPLGWQKSWMSSIYAHKPVFTFGEWFLGSAASDADNTDFANKSGMSLLDFRFN
SAVRNVFRDNTSNMYALDSMINSTATDYNQVNDQVTFIDNHDMDRFKTSAVNNRRLEQALAFTLTSRGVPAIYYGTEQYL
TGNGDPDNRAKMPSFSKSTTAFNVISKLAPLRKSNPAIAYGSTQQRWINNDVYVYERKFGKSVAVVAVNRNLSTSASITG
LSTSLPTGSYTDVLGGVLNGNNITSTNGSINNFTLAAGATAVWQYTTAETTPTIGHVGPVMGKPGNVVTIDGRGFGSTKG
TVYFGTTAVTGAAITSWEDTQIKVTIPSVAAGNYAVKVAASGVNSNAYNNFTILTGDQVTVRFVVNNASTTLGQNLYLTG
NVAELGNWSTGSTAIGPAFNQVIHQYPTWYYDVSVPAGKQLEFKFFKKNGSTITWESGSNHTFTTPASGTATVTVNWQ
>P04830 2.4.1.19~~~cgtM~~~Cyclomaltodextrin glucanotransferase~~~
MKSRYKRLTSLALSLSMALGISLPAWASPDTSVDNKVNFSTDVIYQIVTDRFADGDRTNNPAGDAFSGDRSNLKLYFGGD
WQGIIDKINDGYLTGMGVTALWISQPVENITSVIKYSGVNNTSYHGYWARDFKQTNDAFGDFADFQNLIDTAHAHNIKVV
IDFAPNHTSPADRDNPGFAENGGMYDNGSLLGAYSNDTAGLFHHNGGTDFSTIEDGIYKNLYDLADINHNNNAMDAYFKS
AIDLWLGMGVDGIRFDAVKHMPFGWQKSFVSSIYGGDHPVFTFGEWYLGADQTDGDNIKFANESGMNLLDFEYAQEVREV
FRDKTETMKDLYEVLASTESQYDYINNMVTFIDNHDMDRFQVAGSGTRATEQALALTLTSRGVPAIYYGTEQYMTGDGDP
NNRAMMTSFNTGTTAYKVIQALAPLRKSNPAIAYGTTTERWVNNDVLIIERKFGSSAALVAINRNSSAAYPISGLLSSLP
AGTYSDVLNGLLNGNSITVGSGGAVTNFTLAAGGTAVWQYTAPETSPAIGNVGPTMGQPGNIVTIDGRGFGGTAGTVYFG
TTAVTGSGIVSWEDTQIKAVIPKVAAGKTGVSVKTSSGTASNTFKSFNVLTGDQVTVRFLVNQANTNYGTNVYLVGNAAE
LGSWDPNKAIGPMYNQVIAKYPSWYYDVSVPAGTKLDFKFIKKGGGTVTWEGGGNHTYTTPASGVGTVTVDWQN
>P43379 2.4.1.19~~~cgt~~~Cyclomaltodextrin glucanotransferase~~~
MKKFLKSTAALALGLSLTFGLFSPAQAAPDTSVSNKQNFSTDVIYQIFTDRFSDGNPANNPTGAAFDGTCTNLRLYCGGD
WQGIINKINDGYLTGMGVTAIWISQPVENIYSIINYSGVNNTAYHGYWARDFKKTNPAYGTIADFQNLIAAAHAKNIKVI
IDFAPNHTSPASSDQPSFAENGRLYDNGTLLGGYTNDTQNLFHHNGGTDFSTTENGIYKNLYDLADLNHNNSTVDVYLKD
AIKMWLDLGIDGIRMDAVKHMPFGWQKSFMAAVNNYKPVFTFGEWFLGVNEVSPENHKFANESGMSLLDFRFAQKVRQVF
RDNTDNMYGLKAMLEGSAADYAQVDDQVTFIDNHDMERFHASNANRRKLEQALAFTLTSRGVPAIYYGTEQYMSGGTDPD
NRARIPSFSTSTTAYQVIQKLAPLRKCNPAIAYGSTQERWINNDVLIYERKFGSNVAVVAVNRNLNAPASISGLVTSLPQ
GSYNDVLGGLLNGNTLSVGSGGAASNFTLAAGGTAVWQYTAATATPTIGHVGPMMAKPGVTITIDGRGFGSSKGTVYFGT
TAVSGADITSWEDTQIKVKIPAVAGGNYNIKVANAAGTASNVYDNFEVLSGDQVSVRFVVNNATTALGQNVYLTGSVSEL
GNWDPAKAIGPMYNQVVYQYPNWYYDVSVPAGKTIEFKFLKKQGSTVTWEGGSNHTFTAPSSGTATINVNWQP
>P31835 2.4.1.19~~~~~~Cyclomaltodextrin glucanotransferase~~~
MKKQVKWLTSVSMSVGIALGAALPVWASPDTSVNNKLNFSTDTVYQIVTDRFVDGNSANNPTGAAFSSDHSNLKLYFGGD
WQGITNKINDGYLTGMGITALWISQPVENITAVINYSGVNNTAYHGYWPRDFKKTNAAFGSFTDFSNLIAAAHSHNIKVV
MDFAPNHTNPASSTDPSFAENGALYNNGTLLGKYSNDTAGLFHHNGGTDFSTTESGIYKNLYDLADINQNNNTIDSYLKE
SIQLWLNLGVDGIRFDAVKHMPQGWQKSYVSSIYSSANPVFTFGEWFLGPDEMTQDNINFANQSGMHLLDFAFAQEIREV
FRDKSETMTDLNSVISSTGSSYNYINNMVTFIDNHDMDRFQQAGASTRPTEQALAVTLTSRGVPAIYYGTEQYMTGNGDP
NNRGMMTGFDTNKTAYKVIKALAPLRKSNPALAYGSTTQRWVNSDVYVYERKFGSNVALVAVNRSSTTAYPISGALTALP
NGTYTDVLGGLLNGNSITVNGGTVSNFTLAAGGTAVWQYTTTESSPIIGNVGPTMGKPGNTITIDGRGFGTTKNKVTFGT
TAVTGANIVSWEDTEIKVKVPNVAAGNTAVTVTNAAGTTSAAFNNFNVLTADQVTVRFKVNNATTALGQNVYLTGNVAEL
GNWTAANAIGPMYNQVEASYPTWYFDVSVPANTALQFKFIKVNGSTVTWEGGNNHTFTSPSSGVATVTVDWQN
>P30921 2.4.1.19~~~cgt~~~Cyclomaltodextrin glucanotransferase~~~
MKKISKLTTALALSLSLALSLLGPAHAAPDTSVSNKQNFSTDVIYQIFTDRFSDGNPANNPTGPAFDGTCTNLRLYCGGD
WQGIINKINDGYLTGMGVTAIWISQPVENIYSVINYSGVNNTAYHGYWARDFKKTNPAYGTIADFQNLIAAAHAKNIKVI
IDFAPNHTSPASLDQPSFAENGKLYNNGRDEGGYTNDTHNLFHHNGGTDFSTTENGIYKNLYDLADLNHNNSTVDTYLKD
AIKMWLDLGIDGIRMDAVKHMPFGWQKSFMATVNNYKPVFTFGEWFLGVNEVSAENHKFANVSGMSLLDFRFAQKVRQVF
KDNTDNMYGLKSMLEGSATDYAQMEDQVTFIDNHDMERFHNNSANRRKLEQALAFTLTSRGVPAIYYGTEQYMSGGNDPD
NRARIPSFSTTTTAYQVSKKLAPLRKSNPAIAYGTTQERWINNDVLIYERKFGNNVAVIAVNRNVNTSASITGLVTSLPA
GSYTDVLGGLLNGNNLTVGSGGSASIFTLAAGGTAVWQYTTAVTAPTIGHVGPMMAKPGAAVTIDGRGFGATKGTVYFGT
TAVTGANITAWEDTQIKVKIPAVAGGVYNIKIANSAGTSSNVHDNFEVLSGDQVSVRFVVNNATTALGQNVYLAGSVSEL
GNWDPAKAIGPLYNQVIYQYPTWYYDVTVPAGKTIEFKFLKKQGSTVTWEGGSNHTFTAPTSGTATINVNWQP
>P05618 2.4.1.19~~~cgt~~~Cyclomaltodextrin glucanotransferase~~~
MKRFMKLTAVWTLWLSLTLGLLSPVHAAPDTSVSNKQNFSTDVIYQIFTDRFSDGNPANNPTGAAFDGSCTNLRLYCGGD
WQGIINKINDGYLTGMGITAIWISQPVENIYSVINYSGVNNTAYHGYWARDFKKTNPAYGTMQDFKNLIDTAHAHNIKVI
IDFAPNHTSPASSDDPSFAENGRLYDNGNLLGGYTNDTQNLFHHYGGTDFSTIENGIYKNLYDLADLNHNNSSVDVYLKD
AIKMWLDLGVDGIRVDAVKHMPFGWQKSFMATINNYKPVFTFGEWFLGVNEISPEYHQFANESGMSLLDFRFAQKARQVF
RDNTDNMYGLKAMLEGSEVDYAQVNDQVTFIDNHDMERFHTSNGDRRKLEQALAFTLTSRGVPAIYYGSEQYMSGGNDPD
NRARLPSFSTTTTAYQVIQKLAPLRKSNPAIAYGSTHERWINNDVIIYERKFGNNVAVVAINRNMNTPASITGLVTSLRR
ASYNDVLGGILNGNTLTVGAGGAASNFTLAPGGTAVWQYTTDATTPIIGNVGPMMAKPGVTITIDGRGFGSGKGTVYFGT
TAVTGADIVAWEDTQIQVKIPAVPGGIYDIRVANAAGAASNIYDNFEVLTGDQVTVRFVINNATTALGQNVFLTGNVSEL
GNWDPNNAIGPMYNQVVYQYPTWYYDVSVPAGQTIEFKFLKKQGSTVTWEGGANRTFTTPTSGTATVNVNWQP
>P31746 2.4.1.19~~~cgt~~~Cyclomaltodextrin glucanotransferase~~~
MNDLNDFLKTILLSFIFFLLLSLPTVAEADVTNKVNYSKDVIYQIVTDRFSDGNPGNNPSGAIFSQNCIDLHKYCGGDWQ
GIIDKINDGYLTDLGITALWISQPVENVYALHPSGYTSYHGYWARDYKKTNPYYGNFDDFDRLMSTAHSNGIKVIMDFTP
NHSSPALETNPNYVENGAIYDNGALLGNYSNDQQNLFHHNGGTDFSSYEDSIYRNLYDLADYDLNNTVMDQYLKESIKFW
LDKGIDGIRVDAVKHMSEGWQTSLMSEIYSHKPVFTFGEWFLGSGEVDPQNHHFANESGMSLLDFQFGQTIRNVLKDRTS
NWYDFNEMITSTEKEYNEVIDQVTFIDNHDMSRFSVGSSSNRQTDMALAVLLTSRGVPTIYYGTEQYVTGGNDPENRKPL
KTFDRSTNSYQIISKLASLRQTNSALGYGTTTERWLNEDIYIYERTFGNSIVLTAVNSSNSNQTITNLNTSLPQGNYTDE
LQQRLDGNTITVNANGAVNSFQLRANSVAVWQVSNPSTSPLIGQVGPMMGKAGNTITVSGEGFGDERGSVLFDSTSSEII
SWSNTKISVKVPNVAGGYYDLSVVTAANIKSPTYKEFEVLSGNQVSVRFGVNNATTSPGTNLYIVGNVNELGNWDADKAI
GPMFNQVMYQYPTWYYDISVPAGKNLEYKYIKKDQNGNVVWQSGNNRTYTSPTTGTDTVMINW
>P09121 2.4.1.19~~~cgt~~~Cyclomaltodextrin glucanotransferase~~~
MKRFMKLTAVWTLWLSLTLGLLSPVHAAPDTSVSNKQNFSTDVIYQIFTDRFSDGNPANNPTGAAFDGSCTNLRLYCGGD
WQGIINKINDGYLTGMGITAIWISQPVENIYSVINYSGVHNTAYHGYWARDFKKTNPAYGTMQDFKNLIDTAHAHNIKVI
IDFAPNHTSPASSDDPSFAENGRLYDNGNLLGGYTNDTQNLFHHYGGTDFSTIENGIYKNLYDLADLNHNNSSVDVYLKD
AIKMWLDLGVDGIRVDAVKHMPFGWQKSFMSTINNYKPVFNFGEWFLGVNEISPEYHQFANESGMSLLDFPFAQKARQVF
RDNTDNMYGLKAMLEGSEVDYAQVNDQVTFIDNHDMERFHTSNGDRRKLEQALAFTLTSRGVPAIYYGSEQYMSGGNDPD
NRARIPSFSTTTTAYQVIQKLAPLRKSNPAIAYGSTQERWINNDVIIYERKFGNNVAVVAINRNMNTPASITGLVTSLPQ
GSYNDVLGGILNGNTLTVGAGGAASNFTLAPGGTAVWQYTTDATAPINGNVGPMMAKAGVTITIDGRASARQGTVYFGTT
AVTGADIVAWEDTQIQVKILRVPGGIYDIRVANAAGAASNIYDNFEVLTGDQVTVRFVINNATTALGQNVFLTGNVSELG
NWDPNNAIGPMYNQVVYQYPTWYYDVSVPAGQTIEFKFLKKQGSTVTWEGGANRTFTTPTSGTATVNVNWQP
>P17692 2.4.1.19~~~cgt~~~Cyclomaltodextrin glucanotransferase~~~
MKKFLKMTAAFSLGLSLAFGLFSPAQAAPDTSVSNKQNFSTDVIYQIFTDRFSDGNPANNPTGAAFDGTCTNLRLYCGGD
WQGIINKINDGYLTGMGVTAIWISQPVENIYSIINYSGVNNTAYHGYWARDFKKTNPAYGTIADFQNLIAAAHAKNIKVI
IDFAPNHTSPASSDQPSFAENGRLYDNGTLLGGYTNDTQNLFHHNGGTDFSTTENGIYKNLYDLADLNHNNSTSDVYLKD
AIKMWLDLGIDGIRMDAVKHMPFGWQKSFMAAVNNYKPVFTFGEWFLGVNEVGPENHKFANESGMSLLDFRFAQKVRQVF
RDNTDNMYGLKAMLEGSAADYAQVDDQVTFIDNHDMERFHASNANRRKLEQALAFTLILARVPAIYYGTEQYMSGGTDPD
NRARIPSFSTSTTAYQVIQKLAPLRKSNPAIAYGSTQERWINNDVLIYERKFGSNVAVVAVNRNLNAPASISGLVTSLPQ
GSYNDVLGGLLNGNTLTVGSGGAASNFTLAAGGTAVWQYTAATATPTIGHVGPMMAKPGVTITIDGRGFGSSKGTVYFGT
TAVSGANITSWEDTQIKVKIPAVAGGIYNIKVANAAGTASNVYDNFEVLSGDQVSVRFVVNNATTALGQNLYLTGNVSEL
GNWDPAKAIGPMYNQVVYQYPNWYYDVSVPAGKTIEFKFLKKQGSTVTWEGGSNHTFTAPSSGTATINVNWQP
>P31797 2.4.1.19~~~cgt~~~Cyclomaltodextrin glucanotransferase~~~
MRRWLSLVLSMSFVFSAIFIVSDTQKVTVEAAGNLNKVNFTSDVVYQIVVDRFVDGNTSNNPSGALFSSGCTNLRKYCGG
DWQGIINKINDGYLTDMGVTAIWISQPVENVFSVMNDASGSASYHGYWARDFKKPNPFFGTLSDFQRLVDAAHAKGIKVI
IDFAPNHTSPASETNPSYMENGRLYDNGTLLGGYTNDANMYFHHNGGTTFSSLEDGIYRNLFDLADLNHQNPVIDRYLKD
AVKMWIDMGIDGIRMDAVKHMPFGWQKSLMDEIDNYRPVFTFGEWFLSENEVDANNHYFANESGMSLLDFRFGQKLRQVL
RNNSDNWYGFNQMIQDTASAYDEVLDQVTFIDNHDMDRFMIDGGDPRKVDMALAVLLTSRGVPNIYYGTEQYMTGNGDPN
NRKMMSSFNKNTRAYQVIQKLSSLRRNNPALAYGDTEQRWINGDVYVYERQFGKDVVLVAVNRSSSSNYSITGLFTALPA
GTYTDQLGGLLDGNTIQVGSNGSVNAFDLGPGEVGVWAYSATESTPIIGHVGPMMGQVGHQVTIDGEGFGTNTGTVKFGT
TAANVVSWSNNQIVVAVPNVSPGKYNITVQSSSGQTSAAYDNFEVLTNDQVSVRFVVNNATTNLGQNIYIVGNVYELGNW
DTSKAIGPMFNQVVYSYPTWYIDVSVPEGKTIEFKFIKKDSQGNVTWESGSNHVYTTPTNTTGKIIVDWQN
>P26827 2.4.1.19~~~amyA~~~Cyclomaltodextrin glucanotransferase~~~
MKKTFKLILVLMLSLTLVFGLTAPIQAASDTAVSNVVNYSTDVIYQIVTDRFVDGNTSNNPTGDLYDPTHTSLKKYFGGD
WQGIINKINDGYLTGMGVTAIWISQPVENIYAVLPDSTFGGSTSYHGYWARDFKRTNPYFGSFTDFQNLINTAHAHNIKV
IIDFAPNHTSPASETDPTYAENGRLYDNGTLLGGYTNDTNGYFHHYGGTDFSSYEDGIYRNLFDLADLNQQNSTIDSYLK
SAIKVWLDMGIDGIRLDAVKHMPFGWQKNFMDSILSYRPVFTFGEWFLGTNEIDVNNTYFANESGMSLLDFRFSQKVRQV
FRDNTDTMYGLDSMIQSTASDYNFINDMVTFIDNHDMDRFYNGGSTRPVEQALAFTLTSRGVPAIYYGTEQYMTGNGDPY
NRAMMTSFNTSTTAYNVIKKLAPLRKSNPAIAYGTTQQRWINNDVYIYERKFGNNVALVAINRNLSTSYNITGLYTALPA
GTYTDVLGGLLNGNSISVASDGSVTPFTLSAGEVAVWQYVSSSNSPLIGHVGPTMTKAGQTITIDGRGFGTTSGQVLFGS
TAGTIVSWDDTEVKVKVPSVTPGKYNISLKTSSGATSNTYNNINILTGNQICVRFVVNNASTVYGENVYLTGNVAELGNW
DTSKAIGPMFNQVVYQYPTWYYDVSVPAGTTIQFKFIKKNGNTITWEGGSNHTYTVPSSSTGTVIVNWQQ
>D7REY3 1.17.5.2~~~cdhA~~~Caffeine dehydrogenase subunit alpha~~~
MFADINKGDAFGTWVGKSVPRREDADILAGRAEYIADIKLPGMLEAAFLRSPFAHARIVSIDVSQALALPGVYDVMVGAD
IPDYVKPLPLMITYQNHRETPTSPLARDIVRYAGEPVAVVAAINRYVAEDALELIVVKYEELPVVASIDASLAVDGPRLY
EGWPDNVVAKVSSEIGDVDAAMASADLVFEERFEIQRCHPAPLETRGFIAQWDFKGENLNVWNGTQIINQCRDFMSEVLD
IPASKIRIRSPRLGGGFGAKFHFYVEEPAIVLLAKRVKAPVRWIEDRLEAFSATVHAREQVIDVKLCAMNDGRITGIVAD
IKGDLGASHHTMSMGPVWLTSVMMTGVYLIPNARSVAKAIVTNKPPSGSYRGWGQPQANFAVERMVDLLAHKLQLDPAAV
RRINYVPEARMPYTGLAHTFDSGRYEVLHDRALKTFGYEAWLERQAAAQAQGRRIGIGMSFYAEVSAHGPSRFLNYVGGR
QGGYDIARIRMDTTGDVYVYTGLCDMGQGVTNSLAQIAADALGLNPDDVTVMTGDTALNPYTGWGTGASRSITIGGPAVM
RAATRLREKILSIARHWLQADPDTLVLANRGVMVRDDPGRYVSFASIGRAAYCQIIELPEDVEPGLEAVGVFDTVQLAWP
YGMNLVAVEVDEDTGAVSFLDCMLVHDMGTIVNPMIVDGQLHGGIAQGIAQALYEELRYDENGQLGTGSFADFLMPTASE
IPNMRFDHMVTESPLIPGGMKGVGEGGTIGTPAAVVNAIENALRPITNSKLNRTPVTPDRILTAISAGACA
>D7REY4 1.17.5.2~~~cdhB~~~Caffeine dehydrogenase subunit beta~~~
MKPTAFDYIRPTSLPEALAILAEHSDDVAILAGGQSLMPLLNFRMSRPALVLDINDISELQQVRCENDTLYVGSMVRHCR
VEQEEIFRSTIPLMSEAMTSVAHIQIKTRGTLGGNLCNAHPASEMPAVITALGASMVCKSEKRGERVLTPEEFFEGALQN
GLQSDELLCEIRIPVPSQYVGWAFEEVARRHGDFAQCGAAVLIGAEDRKIDYARIALCSIGETPIRFHALEQWLIGRPVG
NDLPADVKLHCREILDVAEDSTMTAENRAKLASAVTSRAIARAADRIVHLDVKRG
>D7REY5 1.17.5.2~~~cdhC~~~Caffeine dehydrogenase subunit gamma~~~
MSSHVISLTVNGQAIERKVDSRTLLADFLRDELRLTGTHVGCEHGVCGACTIQFDGEPARSCLMLAVQAEGHSIRTVEAL
AVDGCLGALQQAFHEKHGLQCGFCTPGLLMTLDYALTADLHIDFSSDKEIRELISGNLCRCTGYQNIINAIKSVSPTTEI
AKSEELV
>Q9HTH5 ~~~cdhR~~~HTH-type transcriptional regulator CdhR~~~
MSQDFWFLLLPGFSVMGFVSAVEPLRVANRFHADLYRWHVLSADGGPVLASNGMSVNSDGALEPLKKGDLLFVVAGFEPL
RAVTPALVQWLRKLDRNGVTLGGIDTGSVVLAEAGLLDGRRATLHWEAIDAFQESYPQLSVTQELFEIDGPRITSAGGTA
SIDLMLDLIAQAHGPQLAVQVSEQFVLGRIRPRQDHQRLQVATRYGVSNRKLVQVIGEMERHTEPPLTTLELAERIQVTR
RQLERLFRVHLDDTPSNFYLGLRLDKARQLLRQTDLSVLQVSLACGFESPSYFSRSYRARFAASPSQDRAVLPLKAPAAT
PPGAPAGHRTPRAERG
>Q8X7A5 3.6.1.26~~~cdh~~~CDP-diacylglycerol pyrophosphatase~~~COG2134
MKKAGLLFLVMIVIAVVAAGIGYWKLTGEESDTLRKIVLEECLPNQQQNQNPSPCAEVKPNAGYVVLKDLNGPLQYLLMP
TYRINGTESPLLTDPSTPNFFWLAWQARDFMSKKYGQPVPDRAVSLAINSRTGRTQNHFHIHISCIRPDVREQLDNNLAN
ISSRWLPLPGGLRGHEYLARRVTESELVQRSPFMMLAEEVPEAREHMGSYGLAMVRQSDNSFVLLATQRNLLTLNRASAE
EIQDHQCEILR
>P9WPG9 3.6.1.26~~~cdh~~~Probable CDP-diacylglycerol pyrophosphatase~~~COG2134
MPKSRRAVSLSVLIGAVIAALAGALIAVTVPARPNRPEADREALWKIVHDRCEFGYRRTGAYAPCTFVDEQSGTALYKAD
FDPYQFLLIPLARITGIEDPALRESAGRNYLYDAWAARFLVTARLNNSLPESDVVLTINPKNARTQDQLHIHISCSSPTT
SAALRNVDTSEYVGWKQLPIDLGGRRFQGLAVDTKAFESRNLFRDIYLKVTADGKKMENASIAVANVAQDQFLLLLAEGT
EDQPVAAETLQDHDCSITKS
>B3BM48 3.1.-.-~~~cdiA1~~~tRNA nuclease CdiA~~~
MNQPPVHFTYRLLSYLVSAIIAGQPLLPAVGAVITPQNGAGMDKAANGVPVVNIATPNGAGISHNRFTDYNVGKEGLILN
NATGKLNPTQLGGLIQNNPNLKAGGEAKGIINEVTGGNRSLLQGYTEVAGKAANVMVANPYGITCDGCGFINTPHATLTT
GRPVMNADGSLQALEVTEGSITINGAGLDGTRSDAVSIIARATEVNAALHAKDLTVTAGANRITADGRVSALKGEGDVPK
VAVDTGALGGMYARRIHLTSTESGVGVNLGNLYAREGDIILNSAGKLVLKNSLAGGNTTVTGTDVSLSGDNKAGGNLSVT
GTTGLTLNQSRLVTDKNLVLSSSGQIVQNGGELTAGQNAMLSAQHLNQTSGTVNAAENVTLTTTDDTTLKGRSIAGKTLT
VSSGSLNNGGTLVAGRDATVKTGTFSNTGTVQGNGLKVTATDLTSTGSIKSGSTLDISARNATLSGDAGAKDSARVTVSG
TLENRGRLVSDDVLTLSATQINNSGTLSGAKELVASADTLTTTEKSVTHSDGNLMLNSASSTLAGETSAGSTVSVKGNSL
KTTATAQTQGNSVSVDVQNAQLDGTQAARDILTLNASEKLTHSGKSSAPSLSLSAPELTSSGVLVGSALNTQSQTLTNSG
LLQGKASLTVNTQRLDNQQNGTLYSAADLTLDIPDIRNSGLITGDNGLTLNTASLSNPGKIIADTLNVRATTLDGDGLLQ
GAGALALAGDTLSQGRNGRWLTAGDLSLRGKTLHTAGTTQGQNLTVQADNWANSGSVLATGNLTASATGQLTSTGDIMSQ
GDTTLNAATTDNRGSLLSAGTLSLDGNSLDNRGTVQGDHVTIRQNSVTNSGTFTGIAALTLAARMVSPQPALMNNGGSLL
TSGDLTITAGSITSSGHWQGKRVLITADSLANSGAIQAADSLTARLTGELVSAAGSKVTSNGEMALSALNLSNSGQWIAK
NLTLKANSLTSAGDITGVDALTLTVNQTLNNHASGKLLSAGVLTLKADSVTNDGQLQGNATTITAGQLTNGGHLQGETLT
LTASGGVNNRSGGVLMSRNALNVSTATLSNQGTIQGGGGVSLNATDRLQNDGKILSGSNLTLTAQVLANTGSGLVQAATL
LLDVVNTVNGGRVLATGSADVKGTTLNNTGTLQGADLLVNYHTFSNSGTLLGTSGLGVKGSSLLQNGTGRLYSAGNLLLD
AQDFSGQGQVVATGDVTLKLIAALTNHGTLAAGKTLSVTSQNAITNGGVMQGDAMVLGAGEAFTNNGMLTAGKGNSVFSA
QRLFLNAPGSLQAGGDVSLNSRSDITISGFTGTAGSLTMNVAGTLLNSALIYAGNNLKLFTDRLHNQHGDILAGNSLWVQ
KDSSGTANSEIINRSGNIETTRGDITMNTAHLLNSWDAISASHEVIPGSSHGVISPVPENNRWWGVVRHDGVEYLAVYWG
EGATVPDEYRIRTGDTETVTVSASGHAARISGGADMHIRAGRLDNEASFILAGGSMTLSGDTLNNQGWQEGTTGKETVWR
LASGSLPKAWFTEPWYKVYRQVSTDATEASGTSPAGQYRAVISAASDVSASFATDTGNTTVMPRAGGAGNTITVPSLNSL
TPPTVSQGVSGEALLNESGTGITGPVWNDALPDTLKDIPGALSLSGASVSSYPLPSGNNGYFVPSTDPDSPYLITVNPKL
DGLGKVDSSLFAGLYDLLRMQPGQAPRETDPAYTDEKQFLGSSYILDRLGLKPEKDYRFLGDAAFDTRYVSNVILNQTGS
RYINGTGSDLAQMKYLMDSAAAQQKALGLTFGVSLTAGQVAQLTRSILWWESVTINGQTVMVPKLYLSPEDITLHNGSVI
SGNNVQLAGGNITNSGGSINAQNDLLLDSTGSIDNLNAGLINAGGALNLKAIGDIGNISSVISGKTVSLESATGNISNLT
RTEQWAMNNGYNHFSGTDTGPLAAVRATDSLFMGAAGDISITGAAVSAGDSVLLSAGNDLNMNAIQAGERRRYGGSGWYE
THAVAPTVTAGNSLMLSAGRDVNSQAAGIMAENSMDIRAGRDVNMAAESTGTGDHDSTFSMKTVHDSVRQQGTDMTSGGD
ITVTAGRDITSVATAVTAKGDIRVNAGHDIVLGTATESDYHYSESGETRNRLLSHQTTRTITEDSVTREKGSLLSGNRVT
VDAGDNLTVEGSDVVADRDVSLAAGNHVDVLAATSTDTSWRFKETKKSGLMGTGGIGFTIGSSKTTHDRREAGTTQSQSA
STIGSTAGNVSITAGKQAHISGSDVIANRDISITGDSVVVDPGHDRRTVDEKFEQKKSGLTVALSGTVGSAINNAVTSAQ
ETKESSDSRLKALQATKTALSGVQAGQAAAMATATGDPNATGVSLSLTTQKSKSQQHSESDTVSGSTLNAGNNLSVVATG
KNRGDNRGDIVIAGSQLKAGGNTSLDAANDVLLSGAANTQKTTGRNSSSGGGVGVSIGAGGNGAGISVFASVNAAKGSEK
GNGTEWTETTIDSGKTVTINSGRDTVLNGAQVNGNRIIADVGHDLLISSQQDTSKYDSKQTSVAAGGSFTFGSMTGSGYI
AASRDKMKSRFDSVAEQTGMFAGDGGFDITVGRHTQLDGAVIASTATPDKNHLDTGTLGFSDLHNEADYKVSHSGISLSG
GGSFGDKFQGNMPGGMISAGGHSGHAEGTTQAAVAEGTITIRDRDNQKQNLANLSRDPVHANDSISPIFDKEKEQRRLQT
VGLISDIGSQVADIARTQGELNALKAAKEATGETLPANATEKQRQEYLAKLRDTQAYRNEMAKYGTGSEIQRGIQAATAA
LQGLAGGNLAGALAGASAPELAHLLKSTEKYPAVNAIAHAILGGAAAAMQGNNVAAGAAGAATGELAARAIAGMLYPGVK
QSDLSEEQKQTISTLATVSAGLAGGLTGNSTASAAVGAQSGKNAVENNYLSKAQKAQKADELAKCQTAACKAQTEAKWTA
IDLGQDGSFAAGMIAGVPAGLYDAVDSIVKAGSNPTETLEAMKALFNSGDILGSLSDAVKQSYIDRIDRMEAEYQKAGTS
GSFNAGVEGGKLITDIAGLLAGGVGVVKGGAVLTEKVVAKVVGKSESAAAKVGTDIVKTGTVFDSIKATQPAIPGTSIPK
SFELHVNGQTVWVNPNATKHMGEYLTRNGLSHSTAEGSQAMLTSLQSAVKDAFSQGLKFNEKMQVGRWELVFSQRSSDPY
PVLKHALYK
>I1WVY3 3.1.-.-~~~cdiA2~~~tRNA nuclease CdiA-2~~~
MNKNRYRVVFNRARGALMVVQENGRASHGSGSRDARAGVVPAWLSLSPFALRHVALAVLVAAGVVPIWVNAQVVAGGAHA
PSVIQTQNGLQQVNINRPGASGVSMNTYNQFDVPKPGIILNNSPINVQTQLGGIIGGNPNFQAGDAARLIVNQVNSNNPS
FIRGKVEIGGAAAQLVIANQAGLVVDGGGFLNTSRATLTTGNPNFGPDGSLTGFNVNQGLISVVGAGLDTANVDQVDLLA
RAVQINAKAYAKTLNVVAGSNQVDYNTLNATPIAANGPAPTIAIDVSQLGGMYANRVFLVSSENGVGVANAGDIAAQAGD
LTLQANGRLVLSGHTNAAGNMSLSASGGIQNSGVTYGKQSVTITTGADLTNSGALTAQQNLTANVGSLNSTGTLGAGINV
DSTVGTSGDLNVTSSGQLTATGTNSAAGNATFTGSGVNLSNSATAANGNLALTATAGDVNLAGSTVSAKGAVNAQASGTV
VNDRGNLSSGAGMTLGGGSLSNQGGRANSQGPLSVQMAGTVSNQNGMLSSQSTADVRGSAIQNNAGLIQSAGKQTIAGAS
IDNSAGRLISLNADGLSVTATGALTNAAGANVSGDPGGVIGGKGDVTVQGNTVTNSGSMSADATLHVIGQSVDNGNGALH
AGQTTTVDAGNHLSNAGGRVEGQSAVLNGATLDNSQGTVNAATVSLNGTTLLNHGGTVTQTGTGPMTVAITDTLDNSNNG
LIQTRSTDLSLTSTTLINDNGGTITHVGPGTLTVGNGSGTVSNKAGAIASNGRTVLQGKTIDNSAGSASGQTGLSVNAAD
SITNLGGKLTSNANVDVTAGGALVNDGGELGSKTAATTIHSASLSNLNGKIVSPTLTATVAGLLDNSQNGDFEANQLALT
AANLKNQGGHISQWQSGPTTLAVSGTLDNSNGGVIQTNSTDLTLAPAVLDNSKGTITHGGTGTLTLTPGNGAGALQNTGG
TIGTNGQAIVKAGSLDNGSGVIAAKLGLSATIAGAMNNTQGLMRSNAALSIISNGALSNHQGHIEAGTPGDTSTLSIQAA
SIDNTDGAVHDFGTGKMTVQGGSQIVNSHAGGVDGMGQMTGQGDVTIGAASISNTQGGQLMGANLLIQGATLDNSGGQVG
NVANATGDVNVAMSGAVTNTNGSITSTRDLSVAASTLLGGGAYSAARDAAINLQGDFTTTPQTQFNIGRDLTFTLPGTFA
NSANLQSVHNLTVNAGNIVNTGAMTAGSLLSTHSGDLTNYGAMVGGSVAIQASGTVSNLGPVALIGASDTSGLLEIVAHD
IENRDDTTLGDSMPTTTIFGLGKVALAGGKDANGNYTNAALINNSSAAIQSGASMELHADKVTNTRRVMQTSGNTSQVDP
ALLQQLGISMSGCAAYYIAACSGQDVHWINLFHDPNYPDYDPAPIIAALKLQPGGVFTVPPNGGQWNSGYQYTTYEGKAT
ANTVTKLSPGAQIASGGDLDASTVKTFQNYWSSVTAAGNIKQPASLDMDGWGATGQQAPGVTVVYSGYYHYNNYDNSEHN
WTLPFGDKPFVGGPGGYTQAAPADVRQYSLPDYRSTWGANGTISGNGVSVNNTAANATIPSLGLLPGQAVPGLTIGTVSG
NASGTQSGAAAIKGGTPTWVDPVIASATAVNVLSNLTIPQGGLYRPNSAPNPTYLIETNPAFTRMNNFLSSDYYLNQIGV
NPLTTEKRLGDGFYEQQLVRNQVTQLTGKAVLGPYTDLQGMYQSLMLAGAEWSKSLNLPLGMSLSAQQVAALTTNVIIMQ
TETVGGQQVLVPVVYLAKADQQNANGPLITAGNIDLKNTQVFTNSGTVKADTTLALQGKQIDNAFGALQSGGLTSLDTTG
NVDLTSANVKAGSLDLNAGNKLILDTATQTTHQVSRDGATSDKTTLGPAANLNVAGDASIKTGGDFQQNAGNLNVGGNLN
ANIGGNWNLGVQQTGEHKVVQRANGVSDTDLNSATGSTVNVGGKSAIGVGGDLTAQGARLDFGQGGTVAAKGNVTFGAAS
TTSTINANSSGDQGNRSYAETRHGSDQALTGTTVKGGDTLNVVSGKDINVIGSTIDLKKGDANLLAAGDVNVGAVTERHV
YNSRETHSRSGVVSGTKIASSQDATSTVANGSLISADGVSIGSGKDINVQGSTVVGTHDVALNAAHDVNITTSQDTSQSS
TTYQEQHSGLMSGGGLSFSVGNSKLAQQNQSSSVTNNASTVGSVDGNLTVNAGNTLHVKGSDLVAGKDVTGTAANIVVDS
ATDTTRQAQQQQTSKSGLTVGLSGSVGDAINNAISETQAARESAKDSNGRASALHSIAAAGDVAFGGLGAKALLDGAKGP
QAPSIGVQVSVGSSHSSMQSSEDQTIQRGSSINAGGNAKLIATGNGTPKDGNITIAGSNVNAANVALIANNQVNLVNTTD
TDKTQSSNSSSGSSVGVSIGTNGIGVSASMQRAHGDGNSDAAIQNNTHINASQTATIVSGGDTNVIGANVNANKVVADVG
GNLNVASVQDTTVSAAHQSSAGGGFTISQTGGGASFSAQNGHADGNYAGVNEQAGIQAGSGGFDVTVKGNTDLKGAYIAS
TADASKNSLTTGTLTTSDIENHSHYSANSAGFSAGASVGVSTKAVGPSSVSGSGGVTPMVFQNDSGDQSATTKSAVSAGA
INITKPGEQTQDVANLNRDATNLNGTVSKTPDVQKMLSQQADTMNAAQAAGQTVSQGIGLYADGKRKDAIDAAKAAYERG
DLVAMQSYIDQAKSWDEGGASRAGLQATGGALIGGLGGGSVLTAIGGAAGAGTSSLLAGQAEKISKSVGDMTGSSLVGNI
AANVAATVGGALVGGSAGAAMASNVELYNAGNDPQKTDDRATIAGLQGLLNQAVAAGAKGLSTIANARNAIGNAISGALD
SAADQFGTLMKRDAEGKMSQSPAELVSQGVANGINTVLGSKGGEPPLAGPSAVAVDSLTGQAANAALGATDRTPPSNAIL
SNSNSDNNSTQGSQSGTVTKTPNPEATGSLSGKPTQIPPLSDEVTTRSLIRENQSAVTLANKGYDVVQNPEVLGPKNPDY
TINGQVFDNYAPATGNVRNIATTISNKVSSGQASNIVVNLADSSASPAAIEAQINSYPIPGLGKVIVIDKLGNITIIKPK
GN
>B3BM80 3.1.-.-~~~cdiA4~~~Deoxyribonuclease CdiA-o11~~~
MVNATLSVVQKNSAFVGSATGELAARAIGMLYPGVKQSDLSEEQKQTISTLATVSAGLAGGLTGSSTASAAVGAQSGKNA
VENNYLSTNQSLTFDKELSDCRKSGGNCQDIIDKWEKISDEQSAEIDQKLKDNPLEAQVIDKEVAKGGYDMTQRPGWLGN
IGVEVMTSDEAKAYVQKWNGRDLTKIDVNSPEWTKFAVFASDPENQAMLVSGGLLVKDITKAAISFMSRNTATATVNASE
VGMQWGQGNMKQGMPWEDYVGKSLPADARLPKNFKIFDYYDGATKTATSVKSIDTQTMAKLANPNQVYSSIKGNIDAAAK
FKEYALSGRELTSSMISNREIQLAIPADTTKTQWAEINRAIEYGKSQGVKVTVTQVK
>E0SDG8 3.1.-.-~~~cdiA~~~Deoxyribonuclease CdiA~~~COG3210
MAADTLMVTGAWLSNSGTLQGRQSVGLAVGRDFSQTADGVLTSGGTVTVTAGGVATAGALTAQGLALTAGRWRHQGAVTL
GGDGRLVLDELDNGGTLRAGGAWDMQAAALSNGGTLQGGRLALTLSGAAVNRGTLAGERVTLTADSLDNGGTLLGMDALT
LAIAGTARNQASGQWLSQGESRLTAGTLDNQGQWQGDSLSVTADRIRNAGQLLGLSALTLTADGTLTNTATGTLLTQGAA
VLRAATVDNDGEWQAGRLRLTADSLRNGGRIQSDGALDVALSPAGVLTNTGTLAANGDTTLTPGGLDNRGAVSVRGDLTV
TGTDLDNAGQLAARGALTLTGSYAGAGSLYSDAALTLRGTTLANDGGRWQGQTVDIGGGPLTNDGNITGLDSLTVTTTGA
LTNRGRLAGQTLGITADALDNAGTLLGVDALTLAIAGTARNQTSGQWLSNGAGRLTAGTLDNRGQWQGDSLDATADRLDN
AGTLLGLSAMTLTVNGALTNTGRLLTQGAAVLSAATADNDGEWQTGSLWLTADSLRNGGQIHSDGEVRITLPTADGDPLR
PTLRAARQLAQDVEAIGAGRLSNTGVLTAGGDGRITGRGLDNAGTLAAGGALTLAAGDLTNAGRLESRTLSLTGDSLDNG
GTLLAEQGGELTLGGGLHVGADGRLLSNGDWQVQAGTVTSLGQWQGKTLLLSAASLDNGGALLATDAVTLTLTQGYTGGA
GSQVLGSGAVTLTADTVTQQGDIGGDRLALTTGTLTNGGRLVGLSQLDVTSRGQLTNRATGSLLGNGTAGVTAATLDNAG
SVQADTLTLTADTVTNAGRMQGTSALTLNGVSRYTGTDGSQLLSGGTATLAIDNADNAGLWQAGELRFRGASLTSRGQIT
GLDSLTVDAASLTSTGQLTTRGLATLRGQRFDNGGTLTALGGFTARFSDSVTNQGGGQLLSGGTGSLTTGTLVNRGRWQS
DRLTLTADTLRNPGTLLGLDDGNIQLTGAYVGEAGSQVGGNGALSLSAATIDQAGQWQARDVTLRATRLRNQGSITGSGQ
LTATLDEQLENLAGATLLGGTVWLGGATVSNGGQIQGRSGLTVQGGTLLDNQGGGQLLSGGQLALGATQLTNAGWVQGQD
LTLTTAQLDNSGTLQAQSGLTLHLPQWTNRGTVQAGQLDITTDGALDNRGTLLGLTRLALQAASLNNADGARLYSAGGLQ
LRTGQLTQDGQLAALGDLRADIGTPFTFTRTLAAGGQLTLAVTGDLVQAGTLQGHGVTVTSTGTLTQQGRIVAGGGNSTL
SAAAISQTESGSIQGGGPLSLRATGNIVNRGFVGTAGDLLVQAGGVMENGSLLYGGGNLQLLSAALVNRFGNILAGGSLW
IQRDAAGNASDSVLNSSGTIETQRGDITVRTGTLTNQREGLVVTESGSTAADMPDWVGGTTIYIPVERFEVIKDYLVYSF
EHTPGAGSDSPTTYNYFYPFPLSHVSKQEFSASSKIVNIESKGGSSLIHSAGDINIFSSVLVNDASIIASEKNILMNGGV
LKNSSYQSGVMSESLIYEYERDDKDDFLPYIEWLWEKTKREEGVSDYDYWEYLSGYNIHAYNRNILTNDRFKYVLKDRQI
IFTPGQTYAATIQAGGAITANFSQNISNTNLQPGSGGFMPAMATPTLAGVNALGPVGAQADRGLNGGTAGNVSGSTLSGA
GNGVALAGQAGRLNAGYSAVTRDNTASSGSALNPVGIPAGPGTAGGAPVAGASLTPVAPGALALSDLQAALAQGLQQLGS
PSLTDYPLPTSQSGLFVADTAGDSRYLIRTNPTLSQLGQVDNALFGDLRGLLGQTPGTTAPVERSPTLTDPTQVLGSSYL
LGKLNLDAEHDYRFLGDAAFDTRYISNAVLSQTGQRYLNGVGSELAQMQQLMDNAAAEKSRLNLQLGVSLTPEQVAGLSH
SLVWWENITVGGQTVLAPKLYLAQADKTNLQGSRIVANSVSLSAGGDIDNRGSTVTAQDALAVASGGNLTNSEGGLLNAG
GALNLVALGNLTNSSATIQGNTVTLASVGGDIVNTTTTDQWQTAARDGRGRGSLTRTDIGQAGLISAQGGLTLQAGHDIA
LNGAQLSAGGPLQLAAGNDIRLTALSTVTDTVRQDGGATTERRGQGLVQSTVASGGDLSLSAGRDLSGTAAQLSAAGTLA
LSAGRDLSLLSASEEQFSSNAWKRHLDWQQTVTQQGTVLNAGEGLSLRAGQDLTLQGAQAETRGALTAQAGRDLSLLSAT
ESRHDFFEETTVKKGFLSKTTTHTLRETQQTTEKGTLLSAGSVALTAGHDIGVQGSAVAADGEVTLTAGNDITTAASVET
YRNYEEQSRKKSGVFSGGGIGFTIGSTSLRQTLESAGTTQSQSVSTLGSTGGSVRLNAGQAVSMAATDVIAARDIQVTGN
SVTIDPGYDTRKQSRQMEQKTAGLTVTLSGVVGSALNSAVQTVQAVREQSDSRLQALQGMKAALSGYQAYQGTQIDTNNQ
GASSFVGISVSLGAQRSSSSQTSEQSQSFASTLNAGHDISVVARQGDITAVGSQLKAANNVELNASRAINLLSARNTESM
TGSNSSSGGNIGVSFGLSNSGAGFSVFANVNAAKGRELGNGNSWSETTVDAGQQIALTSGGDTRLTGAQVSGERIVANVG
GDLLLKSQQDSNRYDSKQTSVSAGGSFTFGSMTGSGYLSASQDKMHSSFDSVQQQTGLFAGKGGYDISVGNHTQLDGAVI
GSTAGADKNRLDTGTLGFSNIDNRAEFSVSHSGIGLSASPSLSMSDMLKSAALTAPSALMSMGRGGNAGSTTYAAVSDGA
LIIRNQAGQQQDIAGLSREVEHANNALSPIFDKEKEQKRLQTAQMVGELGAQVMDVIRTEGEIRAVRAAEAKGDVKRPPD
NASEKDWDKYKKDLTETPAYKAVMQSYGTGSDLQRATQAATAAIQALAGGGNLQQALAGASAPYLAQLVKGVTMPADESK
ATASDIAANAMGHALMGAVVAQLSGKDAVAGAVGAAGGELTARLLIMKELYSGRDTSDLTEAEKQSVSALASLAAGLASG
IASGNTTGAATGAQAGRNAVENNSLGDIAQAQSEGKTLEQNAGEYVEAENERYKKENCAGLSAEACSVKMYEERREELKE
TLSTGADFVPVIGDIKSFAEAQSALDYLAAAVGLIPGAGDAAGKAIKAAETALKKGELAEASKLINKASDEIQAVKPLDV
GSYKELKDRAVVGDGLEHDHIPSFAALRTAKENELGRKLTPAEEKTLYQNATAVEVPKDVHRAGPTYGGKNTAAQVQQDA
LDLCGAVCRDTDALRTNMIERGYEPALVDDAVKKIIDRNRQIGVIK
>Q0T963 3.1.-.-~~~cdiA~~~tRNA nuclease CdiA~~~
MHQPPVRFTYRLLSYLISTIIAGQPLLPAVGAVITPQNGAGMDKAANGVPVVNIATPNGAGISHNRFTDYNVGKEGLILN
NATGKLNPTQLGGLIQNNPNLKAGGEAKGIINEVTGGNRSLLQGYTEVAGKAANVMVANPYGITCDGCGFINTPHATLTT
GRPVMNADGSLQALEVTEGSITINGAGLDGTRSDAVSIIARATEVNAALHAKDLTVTAGANRITADGRVSALKGEGDVPK
VAVDTGALGGMYARRIHLTSTESGVGVNLGNLYARDGDIILSSAGKLVLKNSLAGGNTTVTGTDVSLSGDNKAGGNLSVT
GTTGLTLNQPRLVTDKNLVLSSSGQIVQNGGELTAGQNAMLSAQHLNQTSGTVNAAENVTLTTTNDTTLKGRSIAGKTLT
VSSGSLNNGGTLVAGRDATVKTGTFSNTGTVQGNGLKVTATDLTSTGSIKSGSTLDISARNATLSGDAGAKDSARVTVSG
TLENRGRLVSDDVLTLSATQINNSGTLSGAKELVASADTLTTTEKSVTNSDGNLMLDSASSTLAGETSAGGTVSVKGNSL
KTTTTAQTQGNSVSVDVQNAQLDGTQAARDILTLNASEKLTHSGKSSAPSLSLSAPELTSSGVLVGSALNTQSQTLTNSG
LLQGEASLTVNTQRLDNQQNGTLYSAADLTLDIPDIRNSGLITGDNGLMLNAVSLSNPGKIIADTLSVRATTLDGDGLLQ
GAGALALAGDTLSQGSHGRWLTADDLSLRGKTLNTAGTTQGQNITVQADRWANSGSVLATGNLTASATGQLTSTGDIMSQ
GDTTLKAATTDNRGSLLSAGTLSLDGNSLDNSGTVQGDHVTIRQNSVTNSGTLTGIAALTLAARMVSPQPALMNNGGSLL
TSGDLTITAGSLVNSGAIQAADSLTARLTGELVSTAGSKVTSNGEMALSALNLSNSGQWIAKNLTLKANSLTSAGDITGV
DTLTLTVNQTLNNQANGKLLSAGVLTLKADSVTNDGQLQGNATTITAGQLTNGGHLQGETLTLAASGGVNNRFGGVLMSR
NALNVSTATLSNQGTIQGGGGVSLNVTDRLQNDSKILSGSNLTLTAQVLANTGSGLVQAATLLLDVVNTVNGGRVLATGS
ADVKGTTLNNTGTLQGADLLVNYHTFSNSGTLLGTSGLGVKGSSLLQHGTGRLYSAGNLLLDAQDFSGQGQVVATGDVTL
KLIAALTNHGTLAAGKTLSVTSQNAITNGGVMQGDAMVLGAGEAFTNNGMLTAGKGNSVFSAQRLFLNAPGSLQAGGDVS
LNSRSDITISGFTGTAGSLTMNVAGTLLNSALIYAGNNLKLFTDRLHNQHGDILAGNSLWVQKDASGGANTEIINTSGNI
ETHQGDIVVRTGHLLNQREGFSATTTTRTNPSSIQGMGNALVDIPLSLLPDGSYGYFTREVENQHGTPCNGHGACNITMD
TLYYYAPFADSATQRFLSSQNITTVTGADNPAGRIASGRNLSAEAERLENRASFILANGDIALSGRELSNQSWQTGTENE
YLVYRYDPKTFYGSYATGSLDKLPLLSPEFENNTIRFSLDGREKDYTPGKTYYSVIQAGGDVKTRFTSSINNGTTTAHAG
SVSPVVSAPVLNTLSQQTGGDSLTQTALQQYEPVVVGSPQWHDELAGALKNIAGGSPLTGQTGISDDWPLPSGNNGYLVP
STDPDSPYLITVNPKLDGLGQVDSHLFAGLYELLGAKPGQAPRETAPSYTDEKQFLGSSYFLDRLGLKPEKDYRFLGDAV
FDTRYVSNAVLSRTGSRYLNGLGSDTEQMRYLMDNAARQQKGLGLEFGVALTAEQIAQLDGSILWWESATINGQTVMVPK
LYLSPEDITLHNGSVISGNNVQLAGGNITNSGSSINAQNGLSLDSTGYIDNLNAGLISAGGSLDLSAIGDISNISSVISG
KTVQLESVSGNISNITRRQQWNAGSDSRYGGVHLSGTDTGPVATIKGTDSLSLDAGKNIDITGATVSSGGTLGMSAGNDI
NIAANLISGSKSQSGFWHTDDNSASSTTSQGSSISAGGNLAMAAGHNLDVTASSVSAGHSALLSAGNDLSLNAVRESKNS
RNGRSESHESHAAVSTVTAGDNLLLVAGRDVASQAAGVAAENNVVIRGGRDVNLVAESAGAGDSYTSKKKKEINETVRQQ
GTEIASGGDTTVNAGRDITAVASSVTATGNISVNAGRDVALTTATESDYHYLETKKKSGGFLSKKTTHTISEDSASREAG
SLLSGNRVTVNAGDNLTVEGSDVVADQDVSLAAGNHVDVLAATSTDTSWRFKETKKSGLMGTGGIGFTIGSSKTTHDRRE
AGTTQSQSASTIGSTAGNVSITAGKQAHISGSDVIANRDISITGDSVVVDPGHDRRTVDEKFEQKKSGLTVALSGTVGSA
INNAVTSAQETKESSDSRLKALQATKTALSGVQAGQAATMASATGDPNATGVSLSLTTQKSKSQQHSESDTVSGSTLNAG
NNLSVVATGKNRGDNRGDIVIAGSQLKVGGNTSLDAANDILLSGAANTQKTTGRNSSSGGGVGVSIGAGGNGAGISVFAG
VNAAKGSEKGNGTEWTETTTDSGKTVTINSGRDTVLNGAQVNGNRIIADVGHDLLISSQQDTSKYDSKQTSVAAGGSFTF
GSMTGSGYIAASRDKMKSRFDSVAEQTGMFAGDGGFDITVGRHTQLDGAVIASTATPDKNHLDTGTLGFSDLHNEADYKV
SHSGISLSGGGSFGDKFQGNMPGGMISAGGHSGHAEGTTQAAVAEGTITIRDRDNQKQNLANLSRDPAHTNDSISPIFDK
EKEQRRLQTVGLISDIGSQVADIARTQGELNALKAAQDKYGPVPADATEEQRQAYLAKLRDTPEYKKEQEKYGTGSDMQR
GIQAATAALQGLVGGNMAGALAGASAPELANIIGHHAGIDDNTAAKAIAHAILGGVTAALQGNSAAAGAIGAGTGEVIAS
AIAKSLYPGVDPSKLTEDQKQTVSTLATLSAGMAGGIASGDVAGAAAGAGAGKNVVENNALSLVARGCAVAAPCRTKVAE
QLLEIGAKAGMAGLAGAAVKDMADRMTSDELEHLITLQMMGNDEITTKYLSSLHDKYGSGAASNPNIGKDLTDAEKVELG
GSGSGTGTPPPSENDPKQQNEKTVDKLNQKQESAIKKIDNTIKNALKDHDIIGTLKDMDGKPVPKENGGYWDHMQEMQNT
LRGLRNHADTLKNVNNPEAQAAYGRATDAINKIESALKGYGI
>Q3YL96 ~~~cdiA~~~Toxin CdiA~~~
MHQPPVRFTYRLLSYLVSAIIAGQPLLPAVGAVITPQNGAGMDKAANGVPVVNIATPNGAGISHNRFTDYNVGKEGLILN
NATGKLNPTQLGGLIQNNPNLKAGGEAKGIINEVTGGKRSLLQGYTEVAGKAANVMVANPYGITCDGCGFINTPHATLTT
GKPVMNADGSLQALEVTEGSITINGAGLDGTRSDAVSIIARATEVNAALHAKDLTVTAGANRITADGRVSALKGEGNVPK
VAVDTGALGGMYARRIHLTSTESGVGVNLGNLYAREGDIILSSSGKLVLKNSLAGGNTTVTGTDVSLSGDNKAGGNLSVT
GTTGLTLNQSRLVTDKNLVLSSSGQIVQNGGELTAGQNAMLSAQHLNQTSGTVNAAENVTLTTTDDTTLKGRSVAGKTLT
VSSGSLNNGGTLVAGRDATVKTGTFSNTGTVQGNGLKVTATDLTSTGSIKSGSTLDISARNATLSGDAGAKDRALVTVSG
TLENRGRLVSDDVLTLSATQINNSGTLSGAKELVASADTLTTTEKSVTNSDGNLMLDSASSTLAGETSAGGTVSVKGNSL
KTTTTAQTQGNSVSVDVQNAQLDGTQAARDILTLNASEKLTHSGKSSAPSLSLSAPELTSSGVLVGSALNTQSQTLTNSG
LLQGKASLTVNTQRLDNQQNGTLYSAADLTLDIPDIRNSGLITGDNGLMLNAVSLSNPGKIIADTLSVRATTLDGDGLLQ
GAGALALAGDTLSLGSNGRWLTAGDLSLRGKTLHTAGTTQGQNLTVQADRWANSGSVQATGNLTASATGQLTSTGDIMSQ
GDTTLNAATTDNRGSLLSAGTLSLDGNSLDNSGTVQGNHVTIRQNGVTNSGTLTGIAALTLAARMDMASPQPALMNNGGS
LLTSGDLTITAGSLANSGAIQAADSLTARLTGELVSTAGSKVTSNGEMALSALNLSNSGQWIAKNLTLKANSLTSAGDIT
GVDALTLTVNQTLNNHASGKLLSAGVLTLKADSVKNDGQLQGNATTITAGQLTNGGHLQGETLTLAASGGVNNRSGGVLM
SRNALNVSTATLSNQGTIQGGGGVSLNATDRLQNDGKILSGSNLTLTAQVLANTGSGLVQAATLLLDVVNTVNGGRVLAT
GSADVKGTTLNNTGTFQGADLLVNYHTFSNSGTLLGTSGLGVKGSSLLQNGTGRLYSAGNLLLDAQDFSGQGQVVATGDV
TLKLIAALTNHGTLAAGKTLSVTSQNAVTNGGVMQGDAMVLGAGEAFTNNGTLTAGKGNSVFSAQRLFLNAPGSLQAGGD
VSLNSRSDITISGFTGTAGSLTMNVAGTLLNSALIYAGNNLKLFTDRLHNQHGDILAGNSLWVQKDSSGTANSEIINRSG
NIETTRGDITMNTAHLLNSWDAISASHEVIPGSSHGVISPVPENNRWWGVVRHDGVEYLAVYWGKGATVPDEYRIRTGDT
ETVTVSASGHAARISGGADMHIRAGRLDNEASFILAGGGMTLSGDTLNNQGWQEGTTGKETVWRLASGSLPKAWFTEPWY
KVYRQVSPDATEASGTSPAGQYRAVISAAGDVSASFATDTGNTTVMPRAGGAGNTITVPSLNSLTPPTVSQGVSGEALLN
ESGTGITGPVWNDALPDTLKDIPGALSLSGASVSSYPLPSGNNGYFVPSTDPDSPYLITVNPKLDGLGKVDSSLFAGLYD
LLRMQPGEAPRETDPAYTDEKQFLGSSYILDRLGLKPEKDYRFLGDAAFDTRYVSNVILNQTGSRYINGTGSDLAQMKYL
MDSAAAQQKALGLTFGVSLTAGQVAQLTRSLLWWESVTINGQTVMVPKLYLSPEDITLHNGSVISGNNVQLAGGNITNSG
SSINAQNDLLLDRTGSIDNLNAGLINAGGALNLKAIGDIGNISSVISGKTVSLESATGNISNLTRTEQWAMNNGYNHFSG
TDTGPLAAVRATDSLFMGAAGDISITGAAVSAGDSVLLAAGNDLNMNAIQAGERRRYGGSGWYETHAVAPTVTAGNSLML
SAGRDVNSQAAGITAENSMDIRAGRDVNMAAESTGAGDHDSTFSMKTVHDSVRQQGTDMTSGGDITVTAGRDITSVATAV
TAKGDIRVNAGHDIVLGTATESDYHYSESGETRNRLLSHQTTRTITEDSVTREKGSLLSGNRVTVNAGNNLTVQGSDVVA
DRDVSLAADNHVDVLAATSTDTSWRFKETKTSGLTGTGGIGFTTGSSKTTHDRREAGTTQSQSASTIGSTAGNVSITAGK
QAHISGSDVIANRDISITGDSVVVDPGHDRRTVDEKFEQKKSGLTVALSGAVGSAINNAVTMAREAKETSDSRLAALKGT
QAVLSGVQAGVNHGLQQQSADPNNGIGVSISLNHQQSKSETKYQHDIVSGSTLSAGNNVSVTATGKNKDHNNSGDMLITG
SQIKSGNDTSLNAQNDILLAAAADTRQTTGKNSSKGGGVGVSFGGGTNGGGLSIFAGINGSEGREKGNGTTWTETTLDAG
KNVSLTSGRDTTLSGAQVSGEKVTADVGNNLTISSLQDSDRYDSRQNRVAAGGSFTFGSMSGSGYASISQDKIKSNYDSV
REQSGIYAGKDGFDVTVGNHTQLNGAVIASTATDDKNSLNTNTLGWSDIHNQADYKASHTGISLSGGSGMSASQMVASNA
IAGAANALTGMSGSSGHAEGTTSSAISGGNLIIRNKESQKQDIAGLSRDPENANGSIAPIFDREKEQKRLQEAQVISQIS
GQMSNIVMTYGETEAMKAARKEHPGMSDAQLRETPEYREVMKGYGTGSTPQMVVQAITGVLGGLNAGNPGQVLAGGLNPA
VAQLIKQATGDNREANLMAHAVWGALAAQLGGNNAASGAAGAFSGELAARYIIDNYYGGRTDNLSEQERQQISMLATIAS
GIAGGLVGNSTSAAGTGAQAGRNSVENNAMSGLEGFGTGFQSYVQAQEALVNNTNLTDKNGKVLNPATPEEIKYASDKLV
TGSIPEGQDPARGLLISWGAGASVFGGELIAPAVGTVAVIGGTLLGGTTDAVKQFLTLKPGEQYSTTDTLIAAGEGGLTQ
GKGVIFSTFINTMGAYLGSKAKGEDPTGPMVGNAIGTALGNKAGDKFTKEMLSRGFGSVTSEVTGTVTGSVIGTVTDYQI
EKLGKGNKEGAK
>P0DSI1 3.1.-.-~~~cdiA~~~tRNA nuclease CdiA~~~
MHQPPVRFTYRLLSYLISTIIAGQPLLPAVGAVITPQNGAGMDKAANGVPVVNIATPDGAGISHNRFTDYNVGKEGLILN
NATGKLNPTQLGGLIQNNPNLKAGGEAKGIINEVTGGNRSLLQGYTEVAGKAANVMVANPYGITCDGCGFINTPHATLTT
GRPVMNADGSLQALEVTEGSITINGAGLDGTRSDAVSIIARATEVNAALHAKDLTVTAGANRITADGRVSALKGEGDVPK
VAVDTGALGGMYARRIHLTSTESGVGVNLGNLYARDGDITLDASGRLTVNNSLATGAVTAKGQGVTLTGDHKAGGNLSVS
SRSDIVLSNGTLNSDKDLSLTAGGRITQQNEKLTAGRDVTLAAKNITQDTASQINAARDIVTVSSDTLTTQGQITAGQNL
TASATTLTQDGTLLAKGHAGLDAGTLNNSGAVQGASLTLGSTTLSNSGSLLSGGPLTVNTRDFTQSGRTGAKGKVDITAS
GKLTSTGSLVSDDVLVLKAQDVTQNGVLSGGKGLTVSAQALSSGKKSVTHSDAAMTLNVTTVALDGENSAGDTLRVQADK
LSTAAGAQLQSGKNLSINARDARLAGTQAAQQTMAVNASEKLTHSGKSSAPSLSLSAPELTSSGVLVGSALNTQSQTLTN
SGLLQGEASLTVNTQRLDNQQNGTLYSAADLTLDIPDIRNSGLITGDNGLTLNTASLSNPGKIIADTLNVRATTLDGDGL
LQGAGALALAGDTLSQGRNGRWLTAGDLSLRGKTLHTAGTTQGQNLTVQADNWANSGSVLATGNLTASATGQLTSTGDIM
SQGDTTLNAATTDNRGSLLSAGTLSLDGNSLDNRGTVQGNHVTIRQNSVTNSGTLTGIAALTLAARMDMASPQPALMNNG
GSLLTSGDLTITAGSITSSGHWQGKQVLITADSLANSGAIQAADSLTARLTGELVSTAGSKVTSNGEMALSALNLSNSGQ
WIAKNLTLKANSLTSAGDITGVDALTLTVNQTLNNHASGKLLSAGVLTLKADSVKNDGQLQGNATTITAGQLTNGGHLQG
ETLTLTASGGVNNRSGGVLMSRNALNVSTATLSNQGTIQGGGGVSLNATDRLQNDGKILSGSNLTLTAQVLANTGSGLVQ
AATLLLDVVNTVNGGRVLATGSADVKGTTLNNTGTLQGADLLVNYHTFSNSGTLLGTSGLGVKGSSLLQNGTGRLYSAGN
LLLDAQDFSGQGQVVATGDVTLKLIAALTNHGTLAAGKTLSVTSQNAITNGGVMQGDAMVLGAGEAFTNNGMLTAGKGNS
VFSAQRLFLNAPGSLQAGGDVSLNSRSDITISGFTGTAGSLTMNVAGTLLNSALIYAGNNLKLFTDRLHNQHGDILAGNS
LWVQKDASGGANTEIINTSGNIETHQGDIVVRTGHLLNQREGFSATTTTRTNPSSIQGMGNALVDIPLSLLPDGSYGYFT
REVENQHGTPCNGHGACNITMDTLYYYAPFADSATQRFLSSQNITTVTGADNPAGRIASGRNLSAEAERLENRASFILAN
GDIALSGRELSNQSWQTGTENEYLVYRYDPKTFYGSYATGSLDKLPLLSPEFENNTIRFSLDGREKDYTPGKTYYSVIQA
GGDVKTRFTSSINNGTTTAHAGSVSPVVSAPVLNTLSQQTGGDSLTQTALQQYEPVVVGSPQWHDELAGALKNIAGGSPL
TGQTGISDDWPLPSGNNGYLVPSTDPDSPYLITVNPKLDGLGQVDSHLFAGLYELLGAKPGQAPRETAPSYTDEKQFLGS
SYFLDRLGLKPEKDYRFLGDAVFDTRYVSNAVLSRTGSRYLNGLGSDTEQMRYLMDNAARQQKGLGLEFGVALTAEQIAQ
LDGSILWWESATINGQTVMVPKLYLSPEDITLHNGSVISGNNVQLAGGNITNSGGSINAQNGLSLDSTGYIDNLNAGLIS
AGGSLDLSAIGDISNISSVISGKTVQLESVSGNISNITRRQQWNAGSDSRYGGVHLSGTDTGPVATIKGTDSLSLDAGKN
IDITGATVSSGGTLGMSAGNDINIAANLISGSKSQSGFWHTDDNSASSTTSQGSSISAGGNLAMAAGHNLDVTASSVSAG
HSALLSAGNDLSLNAVRESKNSRNGRSESHESHAAVSTVTAGDNLLLVAGRDVASQAAGVAAENNVVIRGGRDVNLVAES
AGAGDSYTSKKKKEINETVRQQGTEIASGGDTTVNAGRDITAVASSVTATGNISVNAGRDVALTTATESDYHYLETKKKS
GGFLSKKTTHTISEDSASREAGSLLSGNRVTVNAGDNLTVEGSDVVADQDVSLAAGNHVDVLAATSTDTSWRFKETKKSG
LMGTGGIGFTIGSSKTTHDRREAGTTQSQSASTIGSTAGNVSITAGKQAHISGSDVIANRDISITGDSVVVDPGHDRRTV
DEKFEQKKSGLTVALSGTVGSAINNAVTSAQETKESSDSRLKALQATKTALSGVQAGQAAAMATATGDPNATGVSLSLTT
QKSKSQQHSESDTVSGSTLNAGNNLSVVATGKNRGDNRGDIVIAGSQLKAGGNTSLDAANDVLLSGAANTQKTTGRNSSS
GGGVGVSIGAGGNGAGISVFASVNAAKGSEKGNGTEWTETTIDSGKTVTINSGRDTVLNGAQVNGNRIIADVGHDLLISS
QQDTSKYDSKQTSVAAGGSFTFGSMTGSGYIAASRDKMKSRFDSVAEQTGMFSGDGGFDITVGNHTQLDGAVIASTATAD
KNSLDTGTLGFSDIHNEADYKVSHSGISLSGGGSFGDKFQGNMPGGMISAGGHSGHAEGTTQAAVADGTITIRDRDNQKQ
NLANLSRDPAHANDSISPIFDKEKEQRRLQTVGLISDIGSQVADIARTQGELNALKAAQDKYGPVPADATEEQRQAYLAK
LRDTPEYKKEQEKYGTGSEIQRGIQAATAALQGLAGGNLAGALAGASAPELAHLLKSTEKDPAVNAIAHAILGGTVAAMQ
GNNVAAGAAGAATGELAARAIAGMLYPGVKQSDLSEEQKQTISTLATVSAGLAGGLTGNSTASAAVGAQSGKNAVENNYL
SVSEKTELEIAKQTLKNSKDPAEREKAQQKYDALLEKDIASDKEVIAACSNGNASSSACASARLKVIASKEGYEDGPYNS
KYSQQYADAYGQIVNLLDITSVDAQNQQQVKNAMINYFMVTKGVDRQTAESYTETTQGLEIIAASVTPLIGQAASNKLSY
LGIGKKISFDGDFYTVDGMKFSKSYYEKLWEQGRPAPFVQAREVLNSNPKIEPDPRGAPGYLRYEGAGLEMIYNPKTGQV
GHIQPVKVK
>A0A1S4NYE3 3.1.-.-~~~cdiA~~~tRNA nuclease CdiA~~~
MHQPPVRFPYRLLSYLISTIIAGQPLLPAVGAVITPQNGAGMDKAANGVPVVNIATPNGAGISHNRFTDYNVGKEGLILN
NATGKLNPTQLGGLIQNNPNLKAGGEAKGIINEVTGGNRSLLQGYTEVAGKAANVMVANPYGITCDGCGFINTPRATLTT
GRPVMNADGSLQALEVTEGSITINGAGLDGTRSDAVSIIARATEVNAALHAKDLTVTAGANRVTADGRVSALKGEGDVPK
VAVDTGALGGMYARRIHLTSTESGVGVNLGNLYAREGDIILSSSGKLVLKNSLAGGNTTVTGTDVSLSGDNKAGGNLSVT
GTTGLTLNQSRLVTDKNLVLSSSGQIVQNGGELTAGQNAMLSAQHLNQTSGAVNAAENVTLTTTGGITLKGRSVAGKTLT
VSSGSLNNGGTLGAGRDATVKTGTFSNTGAVQGNGLKVTATDLTSTGSIKSGSTLDISARNATLSGDAGAKDSARVTVSG
TLENRGRLVSDDVLTLSATQINNSGTLSGAKELVASADTLTTTEKSVTNSDGNLMLNSASSTLAGETSAGGTVSVKGNSL
KTTTTAQTQGNSVSVDVQNAQLDGTQAARDILTLNASEKLTHSGKSSAPSLSLSAPELTSSGVLVASALNTQSQTLTNSG
LLQGEASLTVNTQRLDNQQNGTLYSAADLTLDIPDIRNSGLITGDNGLTLNTASLSNPGKITADTLNVRATTLDGDGLLQ
GAAALALAGDTLSQGSHGRWLTAGDLSLRGKTLNTAGTTQGQNLTVQADRWANSGSVLATGNLTASATGQLTSTGDIMSQ
GDTTLNAATTDNRGSLLSAGTLSLDGNSLDNSGTVQGNHVTLHHRSTDNSGTVTGLSGLTLHSADGLTNSGALLSQNSLV
LSAGDVTNSGRIQGQNITLDASSLTSSGAVQSALDLALTLSGDVIAATGSKITALGDARLTGKVLGNQGLISAKTLEVNG
DSLSNSGEISGVNSLNVTLSGNLQQHGKMLTGGALNVNARDISNSGQLQGADNRITASSLANSGRVQGESGLTLTLLNAL
TNQTSGVLLSQNVSALSAPVLTNDGTIQGNGKTTLSAATQAHNSGKILSGGELTFTTPDYSGSGWLQATDLLLNVAKLAG
NGTVMAANQATLTGNSLTNRGLFQAAQLNVNTQTITNSGTLLGNQGLTIKGNNLNNAGGKVFSGGDMLAEMVSLSGAGQL
VALGNLTLKLTRGLTAQGVIAANKQLSVSSQGDITNGATLQGNGITLNAAGRLTNNGQLTAGNGTTALSGSGIAMNASGS
LQAGGDVSLTSRGDITLDAFTGTTGSLMLTAAGAVINTALLYAGNNLSLFASTIRNHHGDMLAGDSLVMQKDVSGAANAE
VINTSGNIETTRGDITIRTGHLLNQREGINETKSYIPVENVAVPDGANSVSVRVGDLGEDGWGYYVKSWSGTAGGGFDAW
AVPTEKGATRKFLTGTTRVDVGATGGDARISAGNNLLIDADKLDNTGSHLLASGFVSLSGSQLNNQSFFGYTQDEYNVYR
YYGKLAMIPNDGHLQYGDASADDRVTFTLSGAPEYVTRDTGQALRAVIQAGKNVTAVFSSDISNTSTTSNAGRITNTLAA
PEINTPAEKNISPRMAQLAPDGTEMLTVTAPDWTDTITRLTIGSGTDLASGIVEGNYPLPSGNNGYFVPSADPDSPYLIT
VNPKLDGLGKVDSSLFAGLYDLLRMHPGQAPRETDPAYTDEKQFPGSSYFLDRLGLKPEKDYRFLGDAAFDTRYVSNYML
NQIGGRYINGVGSDTDQMRYLMDNAARAQKALGLKFGVALTADQVAALDQSILWYKAVTIKGQTVMVPEVYLSPKDVTLQ
NGSIISGQNVHLAGGNVTNSGSTLMAQNNLTIDSADSLGNLESGLINAGGALGLKAMGDINNISATITGKTVRLESLAGN
VNNLTRYSHWQLDAPEDSLALKHTYTGSIASVSAMDSLDIRADKNISVTGAEISAGDRAALIAGNDLSLNAIDRVSSRRH
ANSESHQRSAGLTTITAGDSVMLSAGRDVSSQGAGIAAEDNITVRAGRDVNLLAEESVTGSSSYSKKKTVIDETVRQQGA
EIASGGDTTITAGRDITAVASSVTATGNISVNAGRDVALTTATESDYHYLETKKKSGGFLSKKTTHTISENSATREAGAL
LSGNRVTVNAGDNLTVQGSDVVADRDVSLAAGNHVDVLAATSTDTSWRFKETKKSGLMGTGGIGFTIGSSKTTHDRREAG
TTQSQSASTIGSTAGNVSITAGKQAHISGSDVIANRDISITGDSVVVDPGHDRRTVDEKFEQKKSGLTVALSGTVGSAIN
NAVTSAQETKESSDSRLKALQATKTALSGVQAGQAAAMATATGDPNATGVSLSLTTQKSKSQQHSESDTVSGSTLNAGNN
LSVVATGKNRGDNRGDIVIAGSQLKAGGNTSLDAANDILLSGAANTQKTTGRNSSSGGGVGVSIGAGKGAGISVFASVNA
AKGSEKGNGTEWTETTTDSGKTVTINSGRDTVLNGAQVNGNRIIADVGHDLLISSQQDTSKYDSKQTSVAAGGSFTFGSM
TGSGYIAASRDKMKSRFDSVAEQTGMFAGDGGFDITVGRHTQLDGAVIASTATPDKNHLDTGTLGFSDLHNEADYKVSHS
GISLSGGGSFGDKFQGNMPGGMISAGGHSGHAEGTTQAAVAEGTITIRDRDNQKQNPADLSRDPAHANDSISPIFDKEKE
QRRLQTVGLISDIGSQVADIARTQGELNALKAAKEATGETLPANATEKQRQEYLAKLRDTPEYKKEQEKYGTGSEIQLGI
QAATAALQGLAGGNLAGALAGASAPELAHLLKSTEKDPAVNAIAHAILGGAVAAMQGNNVAAGAAGAATGELAARAIAGM
LYPGVKQSDLSEEQKQTISTLATVSAGLAGGLTGNSSASAAVGAQSGKNAVDNNYLSVSEKTELEIAKQTLKNSKNPAER
EKAQQKYDALLEKDIASDKEVIAACGNGNAGSSACASARLKVIASKEGYEDGPYNSKYSQQYADAYGQIVNLLDITSVDV
QNQQQVKDAMVSYFMATLGVDQKTAQGYVETTQGLEIAAASMTPLFGQAVANKITALVDKANKYPSGIGFKINQPEHLAQ
LDGYSQKKGISGAHNADVFNKAVVDNGVKIISETPTGVRGITQVQYEIPTKDAAGNTTGNYKGNGAKPFEKTIYDPKIFT
DEKMLQLGQEAAAIGYSNAIKNGLQAYDAKAGGVTFRVYIDQKTGIVSNFHPK
>D5CBA0 3.1.-.-~~~cdiA~~~16S rRNA endonuclease CdiA~~~COG3210
MMKQDQVRFSQRALSALLSVLLATQPLLPAVAASITPSGNTQMDKAANGVPVVNIATPNQSGISHNKYNDYNVGKEGLIL
NNATGQLNQTQLGGLIQNNPNLKAGQEAKGIINEVTGANRSNLQGYTEVAGKAANVIVANPYGITCNGCGFINTPNVTLT
TGKPVLDASGKLQSLDVTQGAVTIEGAGLNGSQSDAVSIISRATEINVQLHAKDLRVVAGANRVAADGSVSALKGEGTAP
KVAVDTGALGGMYANRIRLVSSETGVGVNLGNLNARQGDIALSSAGKVVLKNTLASGSTTVSAADVTLRGDHKAGGNVTV
SGQTALTLDQAHVAADNNLQLTTRGTLTQNGGAFTAANDATLAATTLIQSVDAQASAGRHLAVNAEKNAALNGSVVAGQQ
LSVKGGELVQQGNLSASEIALNAQTLTQESRSTTNASGNITLTTSGHSQLKGSTTAGQSLAVSAGSLANHGALAAVADTR
INTGIFSNTGTVQGNSLTVSGTDITSSGALKSASTLDIRADNATLSGETGAKGKTTVTASGNLNNSGTLISDDTLTLNAA
QIVNSGTLSGVRGLTTSGKTFTASATSVTQSDGDVALNNTDTTLAGETSAGGAVTVQGRSLNTTATAQTQGNSVGVAVQN
AKLEGTQAAKGNMTLKADSSLNHTGKSSASGLKVETGHLSNSGTLTASALVIDSPEVINGGLIHAGQTLSLVTRLLDNRS
SGVLYSPSALSLSLSELNNAGIITSDAALSLSGSNLTNSGELSGTSLAIDYETLKNSAEGMLLAQGANRITAQSVSSAGS
MVGNTLTLNADRLESAGLLQGDSALSLTAGILNLLTGSRTLTGGALGLSGTTLTTAGQLQGQDVSIRSHDWTNRGSSLAT
GSLDVTTAGTLSNTGELMSQGNGTLNAVTTVNSGNMLSAGDLSLNGKTLRNSGTLQGNRVTAHQDTITNSGTLTGIAALM
LAARLEMAAPLLTLVNDASGSLLTAGELSVTGGDLRNAGQWQGKRVLIHAQALTNGGAIQAENLLDAQIDSTLTGTAGSK
ITSNGELALSALTLANSGQWIAKHLTLGASTLNNSGEITGVVALSVALTQLNNQAGGKLLSAGALTLDVENATNAGQIQG
KATTVTAGQLINSGRLQGEALTLNASGALNNTASGVLLSENALTVSTATLNNQGTLQGGGESSVKATTRVQNDGKMLSGG
KLTLTAPELANSSSGLVQAVRLLLDVVKAVNGGNVLATTRAELRGSSLDNSGTLQGADLQANYQSVTNSGTVLGTTSLTI
NGDALDNTESGKLYSGDKLLLDVRNYSGRGDVVSLGDTTLKLVNALVNTGTLAASKTLSVSSQNAMTNSGVMQGNAIALS
AGGAFTNNGTLTTGNGSSTFNAQSLLLNASGSLQAGGDVQLTSRENITVNGFTGTAGSLTMTAAGTLLNTALIYAGNNIS
LFAARIHNIYGDILADNSLWMQKNAVGEANAEVVNRSGTIETTRGDITVNTGHLLNEADGLTVSQSEREYPDAIPAADEH
YFSYDLNGRRSDFVLLLEDWKNDGSKVVYDWYEQCLGSGANGSGQCRDRVDYRLTGEDIRQFLLSESVVSVSATGSSARI
AAGRDITINAGTLDNRASHILAGRNAVLAGGTLNNLSAEGGRRVTYVQAEYRCEWFYRDCSDSKWEPLTQYPDGSWGWFD
EDYGWYGWVPYILGERTTEFVADGGVYRSVISAGGNVSANFTSDISNTNVTANSGEFSNTIDAPTLNTLSPEAIGKGLNS
ESLAQGGSADIRFPEQLGNITDALKDISGGSSLSDQNGSSGNYPLPSGNNGYFVPSTDPDSPYLITVNPKLDELGNMDDS
LFNGLYDLLGITPGATPRETNSAYTDRNQFLGSSYFLDRLGLNPDRDYRFLGDAAFDTRYVSNAILNQTGSRYINGIGSD
LDQMRYLMDSAAEQQKTLGLKFGVALTAEQVAALDKSMLWWESATINGQTVMIPKVYLSPKDVTVHSGSVISGNNVQLAG
GNVINSGSTIAAQNGLSIDSSNSLSNLNAGLLSAGGGLNLSALGDINNIGSTISGKTVGLESVAGSINNITRAQQWNVDA
GNVHFSGTDVGKTASITATDGLTMRAGQDINVTGANVSAGGSLGMAAGNDINITANEIVTSEGRAGRNRATTETASVTHQ
GSTLSAGDDLTLQAGNDVNARAAAIAAEGDVGIQAGRDVDLLAEASMERSSSQAKKKTAIDESVRQQGTEIASGGNTVIL
AGRDVTAQAADVTAQGDIGVAAGRDVNLTTATESDYRYREQTKTSSGFLSKKTTHTIEEESATREKGSLLSGDNVTVSAG
NNLRVLGSAVAGDGDVALSAGNNVDIVAATNTDTAWRFKETKKSGLMGTGGIGFTIGSSKSTHDLREQGTTQSESFSTVG
STGGNVSIAAGKQAHIGGADIIAQKDISLTGDSVVIEPGHDKRTRDEKFEQKSSGLTVALSGAAGSAVNNAVTTAQSAKQ
SSDSRLAALQGTQAALSGVQAGQAVALDQVKGDSDKRNNNTIGVSASIGSQSSKSSSHMESETTTGSTLSAGNNVTIKAT
GSDITVAGSQIKAGKDVTLDAARDVNLIASQDTQQTTGKNSSSGGSLGVGVGVGSGGAGISISANANSSKGHEKGNGVWQ
NETTVDAGNRVTINTGRDATIAGAQVSGETVVADIGRDLTIASTQDSDHYNSKQNSVSGGAGYTFGAGGFSGSINVSRDK
MTSDYDSVQEQSGLFAGNGGFDVTVGNHTQLDSGVIASTATADKNRLDTGTLGFSDIHNQADFKTEHQGAGISSGGSIGK
QFAGNMANALLAGGGNSGHAEGTTQAAVSEGTLIIRDKENQKQDVADLSRDAEHANGSISPIFDKEKEQQRLQEVQLIGE
IGSQVVDIANTQGEINGLNAGRKELADKGITEPGADASDEVKAAYQNALRETDAYKTTTAKYGTGSDLQRGIQAATAALQ
GLAGSDLTAALAGASAPELAYRIGHGMGIDNNTAAKTIAHAILGGAVAALQGNSAAAGAAGAATGELAAKAIAGMLYPDV
KDLSTLSEEQKQTVSALATISAGMAGGLAGDSTGSAVAGGQAGKNAAENNSLALVARGCAVAAPCRTKVAEQLLEIGAKA
GIAGLAGAAVKDMADKMTSDELEHLVTLEMMGNDEIIAKYVSLLHDKYAPSHTGGNLLPETLPGHTGNNTGSVDTGPNHT
GNTNRQNDSGSNNTGNTEGAPNTGGNTTITPIPNGPSKDDIAYLALKGKEAQEAASNLGFDRRIPPQKAPFNSHGQPVFY
DGKNYITPDIDSHNVTNGWKMFNSKGKRIGTYDSGLNRIKD
>A0A0H3B0B8 3.1.-.-~~~cdiA~~~Deoxyribonuclease CdiA~~~
MLSAGSIDVSSQNIAVAGSSVVADKDIRLRAQENLTVSTAQQSESGTQLFEQKKSGLMSTGGIGVFIGTSRQKTTDQTQT
VSHIGSTVGSLTGNVRLEAGNQLTLHGSDVVAGKDLALTGADVAISAAENSRSQQYTAESKQSGLTVALSGPVGSAVNTA
VTTAKAAREENTGRLAGLQGVKAALSGVQAVQAGQLVQAQGGGITEMVGVSVSLGSQKSSSQQQQEQTQVSGSALTAGNN
LSIKATGGGNAANSGDILIAGSQLKAGGDTRLDAARDVQLLGAANRQKTDGSNSSRGGSIGVSVGGSGLSVFANANKGQG
NERGDGTFWTETTVDSGGMFSLRSGRDTTLTGAQVSAETVKADVGRNLTLQSQQDRDNYDAKQSRASGGISVPVAGGGAA
VNLSMSRDRLSSQYDSVQAQTGIFAGSGGVDIRVGEHTQLDGAVIASTAAADKNTLDTGTLGFSDIKNKAVFTVEHQGGS
LSTGGPVGSDLLSNLGGMVLAGLGNGGYAEGTTQAAVSEGTITVRDTENQQQNVDDLSRDTGNANGSIGPIFDKEKEQNR
LKEVQLIGEIGGQALDIASTQGKIIATHAANDKMKAVKPEDIVAAEKQWEKAHPGKAATAEDINQQIYQTAYNQAFNESG
FGTGGPVQRGMQAAIAAVQGLAGGNMGAALTGASAPYLAGVIKQSTGDNPAANTMAHAVLGAVTAYASGNHALAGAAGAA
TAELMAPTIISALGWDKNTLTEGQKQAVSALSTLAAGLAGGLTGDSTADALAGGQAGKNAVENNYLNSTQALTFDKELSD
CRKSGGNCQAVIDKWKKVSDEQSVKLDETLKNNPLEAQVWDKEVAQGGIAITERPGWLSSLGADVMSSEEAKAYVQQWNG
QDLSKIDVNSPGWTKFAAFASDPENQVAVASLGMLGKDLTKAALSYMGRNTSTATVSASSVGMKWGQGNMKQGMPWEDYV
GKTLPVGSRLPPNFKTYDYFDRATGAVVSAKSLDTQTMAKLSNPNQVYSSIKKNIDVTAKFEKASLSGVTVNSSMITSKE
VRLAVPVNTTKAQWTEINRAIEYGKNQGVKVTVTQVK
>Q3YL97 ~~~cdiB~~~Outer membrane transporter CdiB~~~
MRHRQDNLLANRTLLPGMASGQYVFRLCTFSPVVRYFSLLPCLCILSFSSPAAMLSPGDRSAIQQQQQQLLDENQRQRDA
LERSAPLTITPSPETSAGTEGPCFTVSSIVVSGATRLTSAETDRLVAPWVNQCLNITGLTAVTDAMTDSYIRRGYITSRA
FLTEQDLSGGVLHITVMEGRLQQIRAEGADLPARTLKMVFPGMEGKVLNLRDIEQGMEQINRLRTEPVQIEISPGDREGW
SVVTLTALPEWPVTGSVGIDNSGQKSTGTGQLNGVLSFNNPLGLADNWFVSGGRSSDFSVSHDARNFAAGVSLPYGYTLV
DYTYSWSDYLSTIDNRGWRWRSTGDLQTHRLGLSHVLFRNGDMKTALTGGLQHRIIHNYLDDVLLQGSSRKLTSFSVGLN
HTHKFLGGVGTLNPVFTRGMPWFGAESDHGKRGDLPVNQFRKWSVSASFQRPVTDRVWWLTSAYAQWSPDRLHGVEQLSL
GGESSVRGFKDQYISGNNGGYLRNELSWSLFSLPYVGTVRAVAALDGGWLHSDSDDPYSSGTLWGAAAGLSTTSGHVSGS
FTAGLPLVYPDWLAPDHLTVYWRVAVAF
>P0DMJ7 ~~~cdiI2~~~Immunity protein CdiI-2~~~
MAIDLFCYLSIDRGAAESDLNKIRSNHSELFEGKFLISPVRDADFSLKEIAAEHGLVAESFFLVSLNDKNSADLIPIVSK
ILVDGFNGGAILILQDNEYRR
>B3BM81 ~~~cdiI4~~~Immunity protein CdiI-o11~~~
MAFNKDQDYWANIFVTPDFLSVETYSGLGMTGRDPLFSPRLLQPDVDDKSLGEEILQALSDSRTLDVLEERVAFFDLEKS
KEQYAAWIATLMEKYGYRTKRALFKNMKKVGIHLVNDVITIRPSFHEKLEAWSGNRINESDYVVLPADSSPTEIGSGLRL
ALSRCKG
>H9T8I0 ~~~cdiI~~~Immunity protein CdiI~~~
MNIDLQRRYDSSDDFFSLGGSVVMKLSADAAIAVCERAGQHGLVVARIEGGIWHFPGFEARLDCIWDGIDPPVDVGVAEQ
NNLAAAEFVRSESQEHDVFVVTAPEITGW
>H9T8G7 ~~~cdiI~~~Immunity protein CdiI~~~
MAGSIVISKEVRVPVSTSQFDYLVSRIGDQFHSSDMWIKDEVYLPMEEGGMSFISTESLNSSGLSIFLATVMRARAASQA
EESFPLYENVWNQLVEKLRQDARLGVSGN
>E0SDG7 ~~~cdiI~~~Immunity protein CdiI~~~
MLAWNNLVEMLGCSKASNEFIYLPQKLNELPVFEEGVLGDRSYYSFFNSGVLFLLEDDLVNQISLYIQADEGFSAYTGEL
PLPVNSRESEIIQVLGTPSGSGGGKMDMLLGYVNRWIKYKTESHTLHIQFDQNDQLCRVTLMQ
>Q0T964 ~~~cdiI~~~Immunity protein CdiI~~~
MITLRKLIGNINMTKEPEQQSPLELWFERIIDVPLEKLTVEDLCRAIRQNLCIDQLMPRVLEVLTKEPLAGEYYDGELIA
ALSTIKGEDLKDQKSTFTQIRQLINQLEPSDINDDLRKDILKINQIIV
>P0DSM8 ~~~cdiI~~~Immunity protein CdiI~~~
MDIWPEFQRDLEMYRDVVLSIKRNLRLYEECIESLVHQIGSTNFDNAQPLFDDLFRMQSELATMLYKYEYKPGKRIQDLI
YHLDRDDFYSRKYWHKKFSDGLAWPE
>A0A1S4NYE4 ~~~cdiI~~~Immunity protein CdiI~~~
MNKYLFELPYERSEPGWTIRSYFDLMYNENRFLDAVENIVNKESYILDGIYCNFPDMNSYDESEHFEGVEFAVGYPPDED
DIVIVSEETCFEYVRLACEKYLQLHPEDTEKVNKLLSKIPS
>A0A023GPJ0 ~~~cdiI~~~Immunity protein CdiI~~~
MFGIFSKGEPVSMEGELVQPSSIVINDYEEELHLPLSYWDIKDYKNSWLKSLGEGLSNKTHSALAVSMYEPEKTNFIFTW
VLYFEDEKVYVQNNVIFLEECHGFSPENINKFIESRTTHDGDGMKISEWHTDLNSVLDFYHSLNN
>A0A0R4I987 ~~~cdiI~~~Immunity protein CdiI-YPIII~~~
MNDIVKSAWASVKMNTDFICVDTYSGYRSNQLDPLGVQHLSSPDVSDLDLGEMVKDALSHSRFVLPAPRTDIWIHPEVTF
DLDLYDSRRTVERYDEWVKKLMVHYGYKTKRALFKDMKSCDICCNHDAITISPTRHEKLEVWGGTGLKGSDNVILSVDSS
PTEIGAGLRLALSRCKG
>Q0P8J9 2.7.1.224~~~~~~Cytidine diphosphoramidate kinase~~~COG0529
MKNNPYIIWLTGLAGSGKTTIGQALYEKLKLKYKNLIYLDGDELREILGHYAYDRQGRIDMALKRAKFAKFLNDQGMMVI
VTTISMFNEIYDYNRKQLKNYYEIYIECDMHELIQRDQKGLYTKALNKEIDNVVGVDIEFDKPEADLVINNSCRNNLEEK
VELIIKKLAL
>Q65EX3 2.3.2.22~~~yvmC~~~Cyclo(L-leucyl-L-leucyl) synthase~~~
MTELIMESKHQLFKTETLTQNCNEILKRRRHVLVGISPFNSRFSEDYIHRLIAWAVREFQSVSVLLAGKEAANLLEALGT
PHGKAERKVRKEVSRNRRFAEKALEAHGGNPEDIHTFSDFANQTAYRNLRMEVEAAFFDQTHFRNACLEMSHAAILGRAR
GTRMDVVEVSADMLELAVEYVIAELPFFIAAPDILGVEETLLAYHRPWKLGEQISRNEFAVKMRPNQGYLMVSEADERVE
SKSMQEERV
>O34351 2.3.2.22~~~yvmC~~~Cyclo(L-leucyl-L-leucyl) synthase~~~
MTGMVTERRSVHFIAEALTENCREIFERRRHVLVGISPFNSRFSEDYIYRLIGWAKAQFKSVSVLLAGHEAANLLEALGT
PRGKAERKVRKEVSRNRRFAERALVAHGGDPKAIHTFSDFIDNKAYQLLRQEVEHAFFEQPHFRHACLDMSREAIIGRAR
GVSLMMEEVSEDMLNLAVEYVIAELPFFIGAPDILEVEETLLAYHRPWKLGEKISNHEFSICMRPNQGYLIVQEMAQMLS
EKRITSEG
>Q4JVS0 2.3.2.22~~~~~~Cyclo(L-leucyl-L-leucyl) synthase~~~
MGESKQEHLIVGVSPFNPRFTPEWLSSAFQWGAERFNTVDVLHPGEISMSLLTSTGTPLGRAKRKVRQQCNRDMRNVEHA
LEISGIKLGRGKPVLISDYLQTQSYQCRRRSVIAEFQNNQIFQDACRAMSRAACQSRLRVTNVNIEPDIETAVKYIFDEL
PAYTHCSDLFEYETAALGYPTEWPIGKLIESGLTSLERDPNSSFIVIDFEKELIDD
>Q7N9M5 2.3.2.22~~~~~~Cyclo(L-leucyl-L-leucyl) synthase~~~
MLHENSPSFTVQGETSRCDQIIQKGDHALIGISPFNSRFSKDYVVDLIQWSSHYFRQVDILLPCEREASRLLVASGIDNV
KAIKKTHREIRRHLRNLDYVISTATLKSKQIRVIQFSDFSLNHDYQSLKTQVENAFNESESFKKSCLDMSFQAIKGRLKG
TGQYFGQIDLQLVYKALPYIFAEIPFYLNTPRLLGVKYSTLLYHRPWSIGKGLFNGSYPIQVADKQSYGIVTQL
>Q4L2X9 2.3.2.22~~~~~~Cyclo(L-leucyl-L-leucyl) synthase~~~
MQNFKVDFLTKNCKQIYQRKKHVILGISPFTSKYNESYIRKIIQWANSNFDDFSILLAGEESKNLLECLGYSSSKANQKV
RKEIKRQIRFCEDEIIKCNKTITNRIHRFSDFKNNIYYIDIYKTIVDQFNTDSNFKNSCLKMSLQALQSKGKNVNTSIEI
TDETLEYAAQYVLAELPFFLNANPIINTQETLMAYHAPWELGTNIINDQFNLKMNEKQGYIILTEKGDNYVKSV
>D7Y2H2 2.7.7.-~~~cdnC~~~Cyclic AMP-AMP-AMP synthase~~~
MSTEHVDHKTIARFAEDKVNLPKVKADDFREQAKRLQNKLEGYLSDHPDFSLKRMIPSGSLAKGTALRSLNDIDVAVYIS
GSDAPQDLRGLLDYLADRLRKAFPNFSPDQVKPQTYSVTVSFRGSGLDVDIVPVLYSGLPDWRGHLISQEDGSFLETSIP
LHLDFIKARKRAAPKHFAQVVRLAKYWARLMKQERPNFRFKSFMIELILAKLLDNGVDFSNYPEALQAFFSYLVSTELRE
RIVFEDNYPASKIGTLSDLVQIIDPVNPVNNVARLYTQSNVDAIIDAAMDAGDAIDAAFYAPTKQLTVTYWQKVFGSSFQ
G
>P0DSP4 2.7.7.-~~~~~~Cyclic AMP-AMP-GMP synthase~~~
MELQPQFNEFLANIRPTDTQKEDWKSGARTLRERLKNFEPLKEIVVSTFLQGSIRRSTAIRPLGDKRPDVDIVVVTNLDH
TRMSPTDAMDLFIPFLEKYYPGKWETQGRSFGITLSYVELDLVITAIPESGAEKSHLEQLYKSESVLTVNSLEEQTDWRL
NKSWTPNTGWLSESNSAQVEDAPASEWKAHPLVLPDREKNEWGRTHPLAQIRWTAEKNRLCNGHYINLVRAVKWWRQQNS
EDLPKYPKGYPLEHLIGNALDNGTTSMAQGLVQLMDTFLSRWAAIYNQKSKPWLSDHGVAEHDVMARLTAEDFCSFYEGI
ASAAEIARNALASEEPQESAQLWRQLFGSKFPLPGPQGGDRNGGFTTPSKPAEPQKTGRFA
>C0VHD2 2.7.7.-~~~~~~Cyclic AMP-AMP-AMP synthase~~~COG1746
MGSERIMTTQQQFLDLLSDIEPSTTTVNDCSSAHNTLRDALKVHNEFSKVHVHTFLSGSYKRNTAVRPTTIGGITQRPDV
DIIALTNHTINDDPQIVLDAVHTALKDIGYTDLTVNRRSVNVKLKKVDMDVVPIISDGYGGYLIPDIHLEEWLVTNPPAH
TEWTVEVNKNANGRFKPLVKLFKWWRRENLSDLKRPKGFILECLVAKHMNYYESNYEKLFVYLLETIRDSYGIYASLGII
PHLEDPGVAGNNVFSAVTADEFKTFFEKVEEQAAIARNALNETDDDKALALWRQVLGNRFPRSASHKSANSADMASSLIR
SALGAGLTFPSTPVYPNKPGGFA
>P0DTF7 2.7.7.-~~~cdnD~~~Cyclic AMP-AMP-AMP synthase~~~
MLSIDEAFRKFKSRLELNEREQKNASQRQNEVRDYLQTKFGIARSFLTGSYARYTKTKPLKDIDIFFVLKDSEKHYHGKA
ASVVLDDFHSALVEKYGSAAVRKQARSINVDFGVHIDAEDNTDYRVVSVDAVPAFDTGDQYEIPDTASGKWIKTDPEIHK
DKATAAHQAYANEWKGLVRMVKYWNNNPKHGDLKPVKPSFLIEVMALECLYGGWGGSFDREIQSFFATLADRVHDEWPDP
AGLGPAISNDMDAARKQRAQQLLFQASQDASIAIDHARRGRNIEALRAWRALFGPKFPLS
>P0DSP3 2.7.7.-~~~~~~Cyclic dipyrimidine nucleotide synthase~~~
MSIDWEQTFRKWSKPSSETESTKAENAERMIKAAINSSQILSTKDISVFPQGSYRNNTNVREDSDVDICVCLNTLVLSDY
SLVPGMNDKLAELRTASYTYKQFKSDLETALKNKFGTLGVSRGDKAFDVHANSYRVDADVVPAIQGRLYYDKNHNAFIRG
TCIKPDSGGTIYNWPEQNYSNGVNKNKSTGNRFKLIVRAIKRLRNHLAEKGYNTAKPIPSYLMECLVYIVPDQYFTGDSY
KTNVENCINYLYNQIDSSDWTEINEIKYLFGSHQMWNKTQVKEFLLTAWSYIQKN
>A0A381HBN1 2.7.7.65~~~cdnE~~~c-di-GMP synthase~~~
MEKKNYSALFENLQNRSNPEKLQEITTKFFSDNPDVKYNDVLKYITLAMNGVSPEYTNKSREAGEKVKLHLQDILLDVEY
QYQGSVMTNTHIKGYSDIDLLVISDKFYTLDERNIIENLEVNKFSLSQEKIQKLQQELLGKKYHSATNDLKNNRLLSEQK
LSSVYEICDITHPKAIKITNKSMGRDVDIVIANWYDDAQSVINNRQIEYRGIQIYNKRSNTIENRDFPFLSIQRINKRSS
ETKGRLKKMIRFLKNLKADSDEKIELSSFDINAICYNIEKNKYLHSNKYQLVPILYEQLNELVSNSNKINSLKSVDGHEY
IFSRNNIDKKESLKMLLQEVKIIYSNLQSYL
>Q6XGD5 2.7.7.-~~~cdnE~~~Cyclic UMP-AMP synthase~~~
MGIPESQLDTWSHQGSIAQSASTYSIIKNALESANTKYHGKNFKVFLQGSYGNDTNIYAESDVDVVICLDDVYYSDLTQL
SPEDKDAYDRAFVPATYSYTQFKQDVLEALTERFGSDVKVGDKAIVVAANGSRRKADVIASMQFRRYWKFKGHYDSQYDE
GICFFNGAGERIANYPKQHSENLTLKHQASNKWLKPMVRVLKNLRSKLIADGKLKSGLAPSYYLEGLLYNVPNEKFGTSY
ADCFVNAMNWIQTEADKDKLVCANEQYYLLWEGTHTSWEKADAEAFIDAAIKMWNEW
>P0DSP2 2.7.7.-~~~~~~Cyclic dipurine nucleotide synthase~~~
MNFSEQQLINWSRPVSTTEDLKCQNAITQITAALRAKFGNRVTIFLQGSYRNNTNVRQNSDVDIVMRYDDAFYPDLQRLS
ESDKAIYNAQRTYSGYNFDELKADTEEALRNVFTTSVERKNKCIQVNGNSNRITADVIPCFVLKRFSTLQSVEAEGIKFY
SDDNKEIISFPEQHYSNGTEKTNQTYRLYKRMVRILKVVNYRLIDDGEIADNLVSSFFIECLVYNVPNNQFISGNYTQTL
RNVIVKIYEDMKNNADYTEVNRLFWLFSNRSPRTRQDALGFMQKCWNYLGYQ
>P0DUE2 2.7.7.65~~~cdnE~~~c-di-GMP synthase~~~
MQKNYLELIKKVRERSNPDLVQMTKMYSETLSGSKLFENKSIEYSDVSIYIKESMKGVAPSYTMNSKVAANKVEAHLKKS
HGNLVDFERQGSVMTNTHILKENDVDLVQITNKSSEFDHKGLEKALNNTSVLKTEEILNLKKHKENFSPYQGNQIDDLKY
VRLKSELVLSSTYKTVDIEKENSIYVKVTEPERDIDVVTATYYKSVDFMKTNDKSRKGIQIYNKKTGKINDVDYPFLSIE
RINVKDIISNRRLKNMIRFLKNIKYDCPHIENKGSIRSFHINAICYNIDVKKYEDLHYLDLVSILYQELTNIISNKSYRD
NIKSVDGCEYIFEFDCAKKLIEIEFLSQELDSIIADLHNQSLLVG
>P0DUE3 2.7.7.65~~~cdnE~~~c-di-GMP synthase~~~
MSSFDYRSRLKELSARYNPEASILVNERMQSEDHYLDTDVVRYVKRSMRAVDDDYTKRTKDAGEAVKQHLNNELINVTYE
YQGSVMTNTHIKGASDIDLLVICEKFEDTEINRVRDCLKTPYGYSNIQLSRLRNYELSFSLYRGDSREDLSNLRRQIESI
MISKYTICDISKAKSVRITNQHLHRDVDIVTSSWFQSLEYVLDGMPKEEKGIKIYNKNLGFAEGPDFPFLSISRINQKSS
ESNGRLKRMIRFLKNVRTDSQKDIQLTSFDINAICYSIPVADYAYLDYKQLVYLLWSTMYHLWYDNKLDKLKSVVGDEYV
FKGKPNKIEALKALEDDVFKIHQDLN
>G2SLH8 2.7.7.-~~~cdnE~~~Cyclic UMP-AMP synthase~~~
MPVPESQLERWSHQGATTTAKKTHESIRAALDRYKWPKGKPEVYLQGSYKNSTNIRGDSDVDVVVQLNSVFMNNLTAEQK
RRFGFVKSDYTWNDFYSDVERALTDYYGASKVRRGRKTLKVETTYLPADVVVCIQYRKYPPNRKSEDDYIEGMTFYVPSE
DRWVVNYPKLHYENGAAKNQQTNEWYKPTIRMFKNARTYLIEQGAPQDLAPSYFLECLLYNVPDSKFGGTFKDTFCSVIN
WLKRADLSKFRCQNGQDDLFGEFPEQWSEEKARRFLRYMDDLWTGWGQ
>A0A150XSC5 2.7.7.65~~~cdnG~~~c-di-GMP synthase~~~
MLITSNAKTQLEDTLAKMAEAAELDKTRWSRLNTAYEAISKWLSDDPEFFGGVEIEIYPQGSVSIGTTTKPYGKTEFDLD
VVIHIKLLSSNYDPKTIFNEVVRRLNENETYRKICEPKSRCVRLNYQGDFHLDVVPGCMVIIYNHELIDITDQKNEIWLR
SSPKGYQKWFLDIANRVELTLLEMTFSAHKVEIEEYAKKKPLQRAVQLIKMRRNIYFDQNPENAPTSIILTTLAAQFYEG
QSSISETFEGIISKLKNHIETLFPNRPFELPNPVNPSENLADIWVDKPELYKHFISFINNLHNEWQDLKKAHGIEEEAIL
MKGMFGNDPYIKAMEARAETVNQKRGNGLGILATGVMVDRAVEKSLPVMPNTFYGD
>O32085 1.13.11.20~~~cdoA~~~Cysteine dioxygenase~~~COG5553
MELYECIQDIFGGLKNPSVKDLATSLKQIPNAAKLSQPYIKEPDQYAYGRNAIYRNNELEIIVINIPPNKETTVHDHGQS
IGCAMVLEGKLLNSIYRSTGEHAELSNSYFVHEGECLISTKGLIHKMSNPTSERMVSLHVYSPPLEDMTVFEEQKEVLEN
S
>Q9KVL2 3.1.4.52~~~cdpA~~~Cyclic di-GMP phosphodiesterase CdpA~~~COG2199
MFTVSRLIPDLASANQVLETMDLPHGQSILVQIFSPLSREHVVQLARLIRSRHPQACLLGCSTEEVIFQGEVHHQVTLLQ
ITVFEQTYLSRAVVDYSDDEAADAERLARQLELTSMSRAVVCFSWQMDTLQVARFALRDTQGAPVPVAGGAAKQTPSGRW
VLLDEACYQNASVAIALHGEALYVETGGYTEWQPVGRTYRVTAVEGDRVLRLDDEPIEAIYQRNLGAQADLPHDWLISFP
LMKGECRHQDLYLPLGLAEEGGLRFNRPLALQDEVRFCFDHPSLTLERVYLTAQQLQAKQCQQVWVFNCALRLNFMHENH
ELQPLQAVAPTDGCYCWGELLYEHGQQQVMHHSMTFLALREGAVRDDLVPIPLPSYPEGMTSPLFNLIRHAFHDLDAMTD
NLAQQIRAQTSLLTASYRRDRRTGLPNRVVLRERLANFAANEHLIALKVTNFNQINEKYGYPVGDKLLRDLSEQFQVFLD
QKLAGQSGLYAIGVGEWATVFRAKLDGKSIHSHFYQFVEQLEHVNFEPYGLPNVDYLSISLCAGLVSQGDFAEHSPDELL
LRAIEARRYAFNNNHHFCNAARLKVQESVRQERLNWLSRVSRAVVRDDVVVYAQPICQARSHIVASYECLVRIEDEGEII
LPGNFLPIITDTHLYTRLSRQMITHTFNMMRHRPEAFSINLSPQDLMSERTLQHLEAAIKSVADPARVGLEVLESEQIKD
YGRMIEVCNHFRTLGATIIVDDFGSGYSNIDEIVKLEPQVIKLDGSLIRNIDQDVKQRRIAEQLVKLCQVLNAKTVAEFV
HNQTVCRISEDMGVDYLQGYFLGRPSRLG
>Q9HWS0 3.1.4.-~~~~~~Cyclic di-GMP phosphodiesterase PA4108~~~
MLKRIPVTQLRLGMFVQSLCGSWLDHPFWKRGGFLLDSQADLQRLRESAVKEVWIDASKGLDLPEEAAVSAAAVLPVTMP
AGPSPARVALEEEIRHAALLCSRAKAAVVSMFRDARMGQAIDTAHASDLVDEISASVLRHPNALLSLVRLKTSDEYTYMH
SVAVCALMIALARQLELPDPLVREAGLAGLLHDIGKMAVPDPILNKPGKLTDPEFGLVRRHPQNGARMLLDCRQVSALVV
DVCLHHHERIDGTGYPFGLAQEQISLLARMGAVCDVYDAITSDRPYKKGWNAAEAIRRMAEWNGHFDPQVFRAFVKAVGI
YPVGALVRLESGRLGVVLEQHGRSLLTPRVKVFFSARSKVPIPQQVVDLGRAGQTDRIVGFEPAEAWNFRNLDEMWTGLA
KSTGSYFDAGTGNP
>Q9KSG1 3.1.4.-~~~~~~Cyclic di-GMP phosphodiesterase VC_1295~~~COG2770
MTIWVLSLAMKNTIYHAPLTFGLYALAGVLFALYTSRVCPFLATLSTREIATQVGAVFILAWLIRHTLLRQHIWARKQQF
IQLDTALLFAMSLPLALYYNLQYQFTLDSNLKVLFGMTLFGFFTGALLQLSSKLRTLRQMPQSQNLALSSDERRSLVKQL
IGLIVLLISTLTVMLSMVAIKDIHWLENNPARLLDGSGKISIVKEFAFLSLVLGGYITAILVLWSRMMKEILDHQERSLQ
AVTQGNLQVRLPVYSNDELGNVAMLTNQMLDSLEATQNEVKTTRDVAIVSLSALAESRDNETGAHILRTQEYVKALAEYL
AAFPQYSTLLTPAYIELLYKSAPLHDVGKVGIPDSVLLKPGKLTDEEFTVMKEHPRIGAQALAIAERHLGTSSFLAIAKE
IALTHHEKWDGTGYPAQLQGEAIPLSGRLMALADVYDALISARVYKPAFSHDKAKAIIVEGSGHHFDPAVVEAFLAVEEK
FVAIAAHFKDAA
>Q9HV27 3.1.4.-~~~~~~Cyclic di-GMP phosphodiesterase PA4781~~~
MESMLDRPEQELVLVVDDTPDNLLLMRELLEEQYRVRTAGSGPAGLRAAVEEPRPDLILLDVNMPGMDGYEVCRRLKADP
LTRDIPLMFLTARADRDDEQQGLALGAVDYLGKPVSPPIVLARVRTHLQLKANADFLRDKSEYLELEVRRRTRQLQQLQD
AVIEALATLGDLRDNPRSRHLPRIERYVRLLAEHLAAQRAFADELTPEAVDLLSKSALLHDIGKVAVPDRVLLNPGQLDA
ADTALLQGHTRAGRDALASAERRLGQPSGFLRFARQIAYSHHERWDGRGFPEGLAGERIPLAARIVALADRYDELTSRHA
YRPPLAHAEAVLLIQAGAGSEFDPRLVEAFVAVADAFAEVARRYADSAEALDVEMQRLEQAVAESIELTAPPA
>Q9KSB1 3.1.4.-~~~~~~Probable cyclic di-GMP phosphodiesterase VC_1348~~~COG3437
MATANIAIKQNSSTNIAFIERMPSPSLHRILYNLFSKFSYSLSNSCHFNLKCLNFVMNRMFHMSMEDLSQCTILIVDDSP
DNIAFMSQGLAQYYRIKAARSGKVALEILAQYPIDLVLLDIVMPEMSGYEVINQIKHNPHTEHIPVIFLTGKSNPEDEQL
GFELGAVDYVFKPVSIPLLKSRVHTHLQNKRSKDILLNQNDYLETEVLRRSGELDRMQDAVVFALASLAETRDPETGNHL
LRTQHYVKVLAQRLATTDKYRDVLSPTVIDTYFKAAPLHDIGKVGIPDNILLKPGKLTPDEFTTMRNHALLGKLALEKAE
KLSGACTALINVAKEIAMGHHEKWDGSGYPLGLKGDDIPLSARLMALADVYDALICRRVYKEPMSHEEAKAIILQGRGSH
FDPMVIDAFLIEEQNFIDIAQKFADEESAMVPIQLGQQASG
>Q9I0R8 3.1.4.52~~~~~~Cyclic di-GMP phosphodiesterase PA2567~~~
MATPPYPEDEHSRQAYVDQLGLLEEGADEVFEEILAAATSYFQTPIALISILDHQRQWFRASIGLDIRQTPRRDSFCAYA
ILGKGVFEVADATLDPRFRDNPYVQGEPRIRFYAGAPLATAEGLNLGSLCVIDREPRGPLAERDVAMLEHFARLVMARIH
TLRSTNYIDEPTGLYNRLRLQEDVSLRLQRDGALTVIAADLLPLALLNTIIRTLGYPFSNDLMLEARDRIRAELPDFTLY
KISPTRFGLLLPRQQQEETESVCLRLLRAFESPVVCRGIPIKANVGLGVLPLADDTLDGDQDWLRLVVSAADDARDRGVG
WARYNPPLDQAQQRAFTLLTSLSQAIGTEEGFHLVYQPKIDLPTGRCTGVEALLRWRHPQLGFVSPAEFVPLAEKTALMR
PLSDWVLRHAMAQLAQWNARNIPLRLAINVSASDMEDSSFLEEAVRLAKTYDIDLSALELEFTESVLIRDASAVGSVLLR
ARELGMGIAVDDFGTGYSNWTYLRDLPITAIKLDQSFTRDLAGSPKAQSVTQAVIGLASQLGYRVVAEGIETHDTFHLLQ
AWGCHEGQGYLIAQPMLPEQLEDWLRR
>Q9WY30 3.1.4.-~~~~~~Cyclic di-GMP phosphodiesterase TM_0186~~~
MTVLIVEDDDITREAMGQYLKLSGFNVIEAENGEKAVELSENVDVALVDVMLPGMSGIEVVNKIKAKNPSCVVFVVTAYD
DTEIVKKCVEAGADDFIKKPVNLELLRLKITHALRNRVFHMYRNSYLKSLKKKLFLLEKTAEEFFTEYEDFLFEVLEILN
MLSEYRDMETHRHTERVGWLSGRIAEEMGMSEVFVTEIQFAAPLHDIGKIGIPDRILLKPGILTPEEFEIMKQHTTIGFR
ILSRSNSPILQLGAEIALTHHERWDGSGYPRGLKEREIPISGLIVAVADSFDAMVSRRPYKNPKPLEEAFREIESLSGKL
YSPEVVEAFLKLEKEITDVYRREKDEDTSHNGGRSHQSSPGEGVEGIR
>Q2FIA5 1.8.1.14~~~cdr~~~Coenzyme A disulfide reductase~~~
MPKIVVVGAVAGGATCASQIRRLDKESDIIIFEKDRDMSFANCALPYVIGEVVEDRRYALAYTPEKFYDRKQITVKTYHE
VIAINDERQTVSVLNRKTNEQFEESYDKLILSPGASANSLGFESDITFTLRNLEDTDAIDQFIKANQVDKVLVVGAGYVS
LEVLENLYERGLHPTLIHRSDKINKLMDADMNQPILDELDKREIPYRLNEEINAINGNEITFKSGKVEHYDMIIEGVGTH
PNSKFIESSNIKLDRKGFIPVNDKFETNVPNIYAIGDIATSHYRHVDLPASVPLAWGAHRAASIVAEQIAGNDTIEFKGF
LGNNIVKFFDYTFASVGVKPNELKQFDYKMVEVTQGAHANYYPGNSPLHLRVYYDTSNRQILRAAAVGKEGADKRIDVLS
MAMMNQLTVDELTEFEVAYAPPYSHPKDLINMIGYKAK
>O52582 1.8.1.14~~~cdr~~~Coenzyme A disulfide reductase~~~COG0446
MPKIVVVGAVAGGATCASQIRRLDKESDIIIFEKDRDMSFANCALPYVIGEVVEDRRYALAYTPEKFYDRKQITVKTYHE
VIAINDERQTVSVLNRKTNEQFEESYDKLILSPGASANSLGFESDITFTLRNLEDTDAIDQFIKANQVDKVLVVGAGYVS
LEVLENLNERGLHPTLIHRSDKINKLMDADMNQPILDELDKREIPYRLNEEINAINGNEITFKSGKVEHYDMIIEGVGTH
PNSKFIESSNIKLDRKGFIPVNDKFETNVPNIYAIGDIATSHYRHVDLPASVPLAWGAHRAASIVAEQIAGNDTIEFKGF
LGNNIVKFFDYTFASVGVKPNELKQFDYKMVEVTQGAHANYYPGNSPLHLRVYYDTSNRQILRAAAVGKEGADKRIDVLS
MAMMNQLTVDELTEFEVAYAPPYSHPKDLINMIGYKAK
>Q7A6H1 1.8.1.14~~~cdr~~~Coenzyme A disulfide reductase~~~
MPKIVVVGAVAGGATCASQIRRLDKESDIIIFEKDRDMSFANCALPYVIGEVVEDRKYALAYTPEKFYDRKQITVKTYHE
VIAINDERQTVTVLNRKTNEQFEESYDKLILSPGASANSLGFESDITFTLRNLEDTDAIDQFIKANQVDKVLVIGAGYVS
LEVLENLYERGLHPTLIHRSDKINKLMDADMNQPILDELDKREIPYRLNEEIDAINGNEITFKSGKVEHYDMIIEGVGTH
PNSKFIESSNIKLDRKGFIPVNDKFETNVPNIYAIGDIATSHYRHVDLPASVPLAWGAHRAASIVAEQIAGNDTIEFKGF
LGNNIVKFFDYTFASVGVKPNELKQFDYKMVEVTQGAHANYYPGNSPLHLRVYYDTSNRQILRAAAVGKEGADKRIDVLS
MAMMNQLTVDELTEFEVAYAPPYSHPKDLINMIGYKAK
>A5U908 4.4.1.1~~~cds1~~~L-cysteine desulfhydrase Cds1~~~COG0031
MSGGACIAVRSLSRSWTDNAIRLIEADARRSADTHLLRYPLPAAWCTDVDVELYLKDETTHITGSLKHRLARSLFLYALC
NGWINENTTVVEASSGSTAVSEAYFAALLGLPFIAVMPAATSASKIALIESQGGRCHFVQNSSQVYAEAERVAKETGGHY
LDQFTNAERATDWRGNNNIAESIYVQMREEKHPTPEWIVVGAGTGGTSATIGRYIRYRRHATRLCVVDPENSAFFPAYSE
GRYDIVMPTSSRIEGIGRPRVEPSFLPGVVDRMVAVPDAASIAAARHVSAVLGRRVGPSTGTNLWGAFGLLAEMVKQGRS
GSVVTLLADSGDRYADTYFSDEWVSAQGLDPAGPAAALVEFERSCRWT
>O69652 4.4.1.1~~~cds1~~~L-cysteine desulfhydrase Cds1~~~COG0031
MSGGACIAVRSLSRSWTDNAIRLIEADARRSADTHLLRYPLPAAWCTDVDVELYLKDETTHITGSLKHRLARSLFLYALC
NGWINENTTVVEASSGSTAVSEAYFAALLGLPFIAVMPAATSASKIALIESQGGRCHFVQNSSQVYAEAERVAKETGGHY
LDQFTNAERATDWRGNNNIAESIYVQMREEKHPTPEWIVVGAGTGGTSATIGRYIRYRRHATRLCVVDPENSAFFPAYSE
GRYDIVMPTSSRIEGIGRPRVEPSFLPGVVDRMVAVPDAASIAAARHVSAVLGRRVGPSTGTNLWGAFGLLAEMVKQGRS
GSVVTLLADSGDRYADTYFSDEWVSAQGLDPAGPAAALVEFERSCRWT
>P0ABG1 2.7.7.41~~~cdsA~~~Phosphatidate cytidylyltransferase~~~COG0575
MLKYRLISAFVLIPVVIAALFLLPPVGFAIVTLVVCMLAAWEWGQLSGFTTRSQRVWLAVLCGLLLALMLFLLPEYHRNI
HQPLVEISLWASLGWWIVALLLVLFYPGSAAIWRNSKTLRLIFGVLTIVPFFWGMLALRAWHYDENHYSGAIWLLYVMIL
VWGADSGAYMFGKLFGKHKLAPKVSPGKTWQGFIGGLATAAVISWGYGMWANLDVAPVTLLICSIVAALASVLGDLTESM
FKREAGIKDSGHLIPGHGGILDRIDSLTAAVPVFACLLLLVFRTL
>P9WPF7 2.7.7.41~~~cdsA~~~Phosphatidate cytidylyltransferase~~~COG4589
MSWLNTKKASCWRSSGRSATKSVTTNDAGTGNPAEQPARGAKQQPATETSRAGRDLRAAIVVGLSIGLVLIAVLVFVPRV
WVAIVAVATLVATHEVVRRLREAGYLIPVIPLLIGGQAAVWLTWPFGAVGALAGFGGMVVVCMIWRLFMQDSVTRPTTGG
APSPGNYLSDVSATVFLAVWVPLFCSFGAMLVYPENGSGWVFCMMIAVIASDVGGYAVGVLFGKHPMVPTISPKKSWEGF
AGSLVCGITATIITATFLVGKTPWIGALLGVLFVLTTALGDLVESQVKRDLGIKDMGRLLPGHGGLMDRLDGILPSAVAA
WIVLTLLP
>Q9X1B7 2.7.7.41~~~cdsA~~~Phosphatidate cytidylyltransferase~~~COG0575
MDDLKTRVITASVVAPFVVLCFVSYESLIGLVSAILILAGYELITLEMKERDARFFYVILLALYPVLYGLVFEEPTQPLS
ILFITGVVFSLITDKDPSQVFKTVAAFSIALIYVTFFLSFFLPIYRDFGAANALLVLTSTWVFDSFAYFTGLKFGRTRIS
PRYSPRKSLEGVIGGFLGVVIYTFLYRLVVNDLLSVNVISFRTFLPFAATVAIMDTFGDIFESALKRHYGVKDSGKTLPG
HGGMLDRIDGLLFVAPVSYIVFKILEGVVR
>O87120 ~~~cdtA~~~Cytolethal distending toxin subunit A~~~
MKKFLPGLLLMGLVACSSNQRMSDYSQPESQSDLAPKSSTTQFQPQPLLSKASSMPLNLLSSSKNGQVSPSEPSNFMTLM
GQNGALLTVWALAKRNWLWAYPNIYSQDFGNIRNWKIEPGKHREYFRFVNQSLGTCIEAYGNGLIHDTCSLDKLAQEFEL
LPTDSGAVVIKSVSQGRCVTYNPVSPTYYSTVTLSTCDGATEPLRDQTWYLAPPVLEATAVN
>O06522 ~~~cdtA~~~Cytolethal distending toxin subunit A~~~
MKKFLPSLLLMGSVACSSNQRMNDYSQPESQSDLAPKSSTIQPQPQPLLSKTPSMSLNLLSSSGPNRQVLPSEPSNFMTL
MGQNGALLTVWALAKRNWLWAYPNIYSQDFGNIRNWKMEPGKHREYFRFVNQSLGTCVEAYGNGLIHDICSLDKLAQEFE
LLPTDSGAVVIKSVSQGRCVTYNPVSTTFYSTVTLSVCDGATEPSRDQTWYLAPPVLEATAVN
>Q46669 3.1.-.-~~~cdtB~~~Cytolethal distending toxin subunit B~~~
MKKYIISLIVFLSFYAQADLTDFRVATWNLQGASATTESKWNINVRQLISGENAVDILAVQEAGSPPSTAVDTGTLIPSP
GIPVRELIWNLSTNSRPQQVYIYFSAVDALGGRVNLALVSNRRADEVFVLSPVRQGGRPLLGIRIGNDAFFTAHAIAMRN
NDAPALVEEVYNFFRDSRDPVHQALNWMILGDFNREPADLEMNLTVPVRRASEIISPAAATQTSQRTLDYAVAGNSVAFR
PSPLQAGIVYGARRTQISSDHFPVGVSRR
>Q8Z6A7 3.1.-.-~~~cdtB~~~Cytolethal distending toxin subunit B homolog~~~COG3021
MKKPVFFLLTMIICSYISFACANISDYKVMTWNLQGSSASTESKWNVNVRQLLSGTAGVDILMVQEAGAVPTSAVPTGRH
IQPFGVGIPIDEYTWNLGTTSRQDIRYIYHSAIDVGARRVNLAIVSRQRADNVYVLRPTTVASRPVIGIGLGNDVFLTAH
ALASGGPDAAAIVRVTINFFRQPQMRHLSWFLAGDFNRSPDRLENDLMTEHLERVVAVLAPTEPTQIGGGILDYGVIVDR
APYSQRVEALRNPQLASDHYPVAFLARSC
>P9WPF9 2.3.2.21~~~~~~Cyclo(L-tyrosyl-L-tyrosyl) synthase~~~
MSYVAAEPGVLISPTDDLQSPRSAPAAHDENADGITGGTRDDSAPNSRFQLGRRIPEATAQEGFLVRPFTQQCQIIHTEG
DHAVIGVSPGNSYFSRQRLRDLGLWGLTNFDRVDFVYTDVHVAESYEALGDSAIEARRKAVKNIRGVRAKITTTVNELDP
AGARLCVRPMSEFQSNEAYRELHADLLTRLKDDEDLRAVCQDLVRRFLSTKVGPRQGATATQEQVCMDYICAEAPLFLDT
PAILGVPSSLNCYHQSLPLAEMLYARGSGLRASRNQGHAIVTPDGSPAE
>B0B9A0 3.4.22.-~~~cdu1~~~Deubiquitinase and deneddylase Dub1~~~
MLSPTNSTSKTAPVPPRDSSKPVLISEEPRNQLLQKVARTALAVLLVVVTLGLILLFYSFSDLQSFPWCCQTHPSTKEQP
TISIPVPLPSPPLAVPRPSTPPPPVISRPSTPSAPKPSTPPPLLPKAPKPVKTQEDLLPLVPEQVFVEMYEDMARRQTIE
ALVPAWDSDIIFKCLCYFHTLYPGLIPLETFPPATIFNFKQKIISILEDKKAVLRGEPIKGPLPICCSKENYRRHLQRTT
LLPVFMWYHPTPKTLSDTMQTMKQLAIKGSVGASHWLLVIVDIQARRLVYFDSLYNYVMPPENMKKELQSFAQQLDQVYP
AYDSKKFSVKIAAKEVIQRGSGSSCGAWCCQFLHWYLKDPLTDALNDLPVDSVERHENLASFVQACEAAVQDLPELSWPE
A
>O84876 3.4.22.-~~~cdu1~~~Deubiquitinase and deneddylase Dub1~~~
MLSPTNSTSKKAPVPPQDSSKPVLISEEPQNQLLQKVARTALAVLLVVVTLGLILLFYSFSDLQSFPWCCQTRPSTKEQP
TISIPVPLPSPPLAVPRPSTPPPPVISRPSTPPAPTPAISPPSTPSAPKPSTPPPLPPKAPKPVKTQEDLLPFVPEQVFV
EMYEDMARRWIIEALVPAWDSDIIFKCLCYFHTLYQGLIPLETFPPATIFNFKQKIISILEDKKAVLRGEPIKGSLPICC
SEENYRRHLHGTTLLPVFMWYHPTPKTLSDTMQTMKQLAIKGSVGASHWLLVIVDIQARRLVYFDSLYNYVMSPEDMEKD
LQSFAQQLDQVYPAYDSQKFSVKIAAKEVIQKGSGSSCGAWCCQFLHWYLRDPFTDALNDLPVDSVERHENLASFVQACE
AAVQDLPELFWPEAKALF
>B0B999 3.4.22.-~~~cdu2~~~Deubiquitinase and deneddylase Dub2~~~
MEPIHNPPPQTCSYSRPSTTYTSFKDASCDTKVTRIIIALFLIVISCGLILCAYTFRDLLDADYLAQEGPQQATKLLQQL
DDVLTGPPLPIWDNEHLFQFSCLMQNKHRRVLPIDICNPLTKFNFLECICNCLMTKQSVNVNETDMCELFCPPTCTPENY
RRLLCTSSVFPFVMWHDPSADTQEAMLTKMDQTMSSGRVGNSHWVLVIVDIEYRCVTFFDSLCDYVASPQQMREQLEGLA
VSLGAIYPKEGGADSDQEELLSPFQVRIGSTVKVQSPGEFTCGAWCCQFLAWYLENPDFDLEEKVPTNPSERRALLADFI
STTEQAMSRYSSLSWPTTD
>O84875 3.4.22.-~~~cdu2~~~Deubiquitinase and deneddylase Dub2~~~
MEPIHNPPPQTCSYSRPSTTYTSFKDASCGTKVTRIIIALFLIVISCGLILCAYTFRDLLDADYSAQEGPQQATKLLQQL
DKVLTGPPLPIWDNEHLFQFSCLMQNKHRRVLPIDICNPLTKFNFLEYICNCLMTKQSVNVNETDMCELFCPPTCTPENY
RRLLCTSSVFPFVMWHDPSADTQEAMLTKMDQTMSSGRVGNSHWVLVIVDIEHRCVTFFDSFYDYIASPQQMREQLEGLA
ASLGAIYPKEGGADSDQEELLSPFQVRIGSTVKVQSPGEFTCGAWCCQFLAWYLENPDFDLEEKVPTNPSERRALLADFI
STTEQAMSRYSSLSWPTTD
>A0A0K2VM55 3.1.1.-~~~~~~Carbohydrate esterase MZ0003~~~
MQRTCVLIVLIVTSTMWTPDPDVYAQPRGFNYDEAQVPKYTLPDPLVMVDGTKVTSAKQWNDKRRDEVQQLFEAYMYGKV
PDGETELIFTDAKGERALGGAAIRKQVKISFGEKEDAPAMDLLIYLPADAKVRVPVFLGLNFHGNHTIHKDKEIWLTESW
VRTNKKFGITKNKANELSRGVAAGRWQIEKAIAKGYGVATIYCGDIDPDFNFPSNGIQAYYYKKDQTIPEKGQWGTIAAW
AFGLSCAMDYFETDTDIDHKKVAVLGHSRLGKTSLWAGAIDTRFALTISNCSGCGGAALSRRRFGETVRRINTSFPHWFC
SRFHQYNDKEDKLPIDQHMLIALCAPRPVLINSATEDKWADPHGEFLAAQGADAVYRMLGTGGLDAKKWPEPNKLVKSTI
GYHLRPGKHDVTARDWDVYIEFADHHMTGGAE
>A9GMG8 3.1.1.-~~~~~~Multifunctional esterase~~~COG3509
MDQFKTHVLGLASLSLALLVALPARGASLQKVNQSEWGADGLPSYVNMYIYVPDKLATKPPIVVAPHHCQGNGQGTFSEM
SSLVSIANTSGFIMIFPEATGQNCWDAGSTRSLNHGGGGDTGAIVQMVKYTLAKYGGDAGRVYSVGGSSGGIMTEALLGV
YPDVFMAGVSLMGVPCGCWAEGYNDVTGTGSSAQWSGPCGGGNVTKTGQQWGDLVRSYYPGYTGHRPRLQHWHGTADTIL
SYKNMAEDIKEWTNVLGLSETPSGTDTPKRGTTRQFWKSACGYTVYEAFSMDGVGHAVPFDGPAVAAYFGLDRAGGQDPE
TAACPGAVPGGSDTGGAGGATGAGGAGGEAGTGGAGGEVGAGGAGGEAGAGAGGAGGVTVSGSGGSAGSGGAMDTASGGA
NGQDASAGTGDSTGGEGSSSGCSCAVGNDARDAGAQAGFLLAALGLLLGRQKRRPR
>B3PIB0 3.1.1.-~~~axe2C~~~Acetylxylan esterase / glucomannan deacetylase~~~COG2755
MKLLFPILLLTGSYFLSACNNTQSLMSSTHTIAASDPHIQVMGRTHINDDASLTFGYPGVSLSTIVAGSRLTAEMQSSNG
NSWIDVIIDNHPPTSIKLDAQQQTVELFHFPNSGEHRVEIIHRSENWHGQVTLKQLTLTGTQFLPAPVLPQRKILVLGDS
VTCGEAIDRVAGEDKNTRWWNARESYGMLTAKALDAQVQLVCWGGRGLIRSWNGKTDDANLPDFYQFTLGDTGQAPQWDH
HRYQPDLIISAIGTNDFSPGIPDRATYINTYTRFVRTLLDNHPQATIVLTEGAILNGDKKAALVSYIGETRQQLHSNRVF
YASSSHHPGDNSDAHPTKDQHAAMARELTPQLRQIMDW
>B3PDE5 3.1.1.-~~~ce2C~~~Acetylxylan esterase / glucomannan deacetylase~~~COG2755
MKPHALIGLLAGMLLSSSLYAADSTKPLPLHIGGRVLVESPANQPVSYTYSWPAVYFETAFKGQSLTLKFDDDQNIFRLI
VDDKAPVVINKPGKVDYPVESLAPGKHRVRLEKLTETQSTSGRFLGFYTDPSAKPLALPKRKRQIEFIGDSFTVGYGNTS
PSRECTDEELFKTTNSQMAFGPLTAKAFDADYQINASSGFGIVRNYNGTSPDKSLLSLYPYTLNNPDQLYHNKHWKPQVI
VIGLGTNDFSTALNDNERWKTREALHADYVANYVKFVKQLHSNNARAQFILMNSDQSNGEIAEQVGKVVAQLKGGGLHQV
EQIVFKGLDYSGCHWHPSANDDQLLANLLITHLQQKKGIW
>P02978 ~~~cea~~~Colicin-E1~~~
METAVAYYKDGVPYDDKGQVIITLLNGTPDGSGSGGGGGKGGSKSESSAAIHATAKWSTAQLKKTQAEQAARAKAAAEAQ
AKAKANRDALTQRLKDIVNEALRHNASRTPSATELAHANNAAMQAEDERLRLAKAEEKARKEAEAAEKAFQEAEQRRKEI
EREKAETERQLKLAEAEEKRLAALSEEAKAVEIAQKKLSAAQSEVVKMDGEIKTLNSRLSSSIHARDAEMKTLAGKRNEL
AQASAKYKELDELVKKLSPRANDPLQNRPFFEATRRRVGAGKIREEKQKQVTASETRINRINADITQIQKAISQVSNNRN
AGIARVHEAEENLKKAQNNLLNSQIKDAVDATVSFYQTLTEKYGEKYSKMAQELADKSKGKKIGNVNEALAAFEKYKDVL
NKKFSKADRDAIFNALASVKYDDWAKHLDQFAKYLKITGHVSFGYDVVSDILKIKDTGDWKPLFLTLEKKAADAGVSYVV
ALLFSLLAGTTLGIWGIAIVTGILCSYIDKNKLNTINEVLGI
>P04419 3.1.-.-~~~col~~~Colicin-E2~~~
MSGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGNSGGGSGTGGN
LSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAIADIMAALKGPFKFGLWGVALYGVLPSQIAKDDPNMMSKIVTSLP
ADDITESPVSSLPLDKATVNVNVRVVDDVKDERQNISVVSGVPMSVPVVDAKPTERPGVFTASIPGAPVLNISVNNSTPE
VQTLSPGVTNNTDKDVRPAGFTQGGNTRDAVIRFPKDSGHNAVYVSVSDVLSPDQVKQRQDEENRRQQEWDATHPVEAAE
RNYERARAELNQANEDVARNQERQAKAVQVYNSRKSELDAANKTLADAIAEIKQFNRFAHDPMAGGHRMWQMAGLKAQRA
QTDVNNKQAAFDAAAKEKSDADAALSAAQERRKQKENKEKDAKDKLDKESKRNKPGKATGKGKPVGDKWLDDAGKDSGAP
IPDRIADKLRDKEFKNFDDFRKKFWEEVSKDPDLSKQFKGSNKTNIQKGKAPFARKKDQVGGRERFELHHDKPISQDGGV
YDMNNIRVTTPKRHIDIHRGK
>P00646 3.1.-.-~~~ceaC~~~Colicin-E3~~~
MSGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGNSGGGSGTGGN
LSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAIADIMAALKGPFKFGLWGVALYGVLPSQIAKDDPNMMSKIVTSLP
ADDITESPVSSLPLDKATVNVNVRVVDDVKDERQNISVVSGVPMSVPVVDAKPTERPGVFTASIPGAPVLNISVNNSTPA
VQTLSPGVTNNTDKDVRPAGFTQGGNTRDAVIRFPKDSGHNAVYVSVSDVLSPDQVKQRQDEENRRQQEWDATHPVEAAE
RNYERARAELNQANEDVARNQERQAKAVQVYNSRKSELDAANKTLADAIAEIKQFNRFAHDPMAGGHRMWQMAGLKAQRA
QTDVNNKQAAFDAAAKEKSDADAALSSAMESRKKKEDKKRSAENNLNDEKNKPRKGFKDYGHDYHPAPKTENIKGLGDLK
PGIPKTPKQNGGGKRKRWTGDKGRKIYEWDSQHGELEGYRASDGQHLGSFDPKTGNQLKGPDPKRNIKKYL
>Q47112 3.1.-.-~~~colE7~~~Colicin-E7~~~
MSGGDGRGHNSGAHNTGGNINGGPTGLGGNGGASDGSGWSSENNPWGGGSGSGVHWGGGSGHGNGGGNSNSGGGSNSSVA
APMAFGFPALAAPGAGTLGISVSGEALSAAIADIFAALKGPFKFSAWGIALYGILPSEIAKDDPNMMSKIVTSLPAETVT
NVQVSTLPLDQATVSVTKRVTDVVKDTRQHIAVVAGVPMSVPVVNAKPTRTPGVFHASFPGVPSLTVSTVKGLPVSTTLP
RGITEDKGRTAVPAGFTFGGGSHEAVIRFPKESGQKPVYVSVTDVLTPAQVKQRQDEEKRLQQEWNDAHPVEVAERNYEQ
ARAELNQANKDVARNQERQAKAVQVYNSRKSELDAANKTLADAKAEIKQFERFAREPMAAGHRMWQMAGLKAQRAQTDVN
NKKAAFDAAAKEKSDADVALSSALERRKQKENKEKDAKAKLDKESKRNKPGKATGKGKPVNNKWLNNAGKDLGSPVPDRI
ANKLRDKEFKSFDDFRKKFWEEVSKDPELSKQFSRNNNDRMKVGKAPKTRTQDVSGKRTSFELHHEKPISQNGGVYDMDN
ISVVTPKRHIDIHRGK
>P09883 3.1.-.-~~~col~~~Colicin-E9~~~
MSGGDGRGHNTGAHSTSGNINGGPTGIGVSGGASDGSGWSSENNPWGGGSGSGIHWGGGSGRGNGGGNGNSGGGSGTGGN
LSAVAAPVAFGFPALSTPGAGGLAVSISASELSAAIAGIIAKLKKVNLKFTPFGVVLSSLIPSEIAKDDPNMMSKIVTSL
PADDITESPVSSLPLDKATVNVNVRVVDDVKDERQNISVVSGVPMSVPVVDAKPTERPGVFTASIPGAPVLNISVNDSTP
AVQTLSPGVTNNTDKDVRPAGFTQGGNTRDAVIRFPKDSGHNAVYVSVSDVLSPDQVKQRQDEENRRQQEWDATHPVEAA
ERNYERARAELNQANEDVARNQERQAKAVQVYNSRKSELDAANKTLADAIAEIKQFNRFAHDPMAGGHRMWQMAGLKAQR
AQTDVNNKQAAFDAAAKEKSDADAALSAAQERRKQKENKEKDAKDKLDKESKRNKPGKATGKGKPVGDKWLDDAGKDSGA
PIPDRIADKLRDKEFKSFDDFRKAVWEEVSKDPELSKNLNPSNKSSVSKGYSPFTPKNQQVGGRKVYELHHDKPISQGGE
VYDMDNIRVTTPKRHIDIHRGK
>E4N7E5 4.2.3.163~~~~~~(+)-corvol ether B synthase/(+)-corvol ether A synthase ((2E,6E)-farnesyl diphosphate cyclizing)~~~
MIPRFDFPWPSACHPHARQAEQGALAFAERHGLVPTAAYRSRLERTRYGWLAARCYPDADDVLLQLCADYFIWFFIVDDL
FVDRVDTLSERTIPNLTAMIDVLDHHRPGAEPVFGEHAWLDVCTRLRAYLSDEHFQRFAHGMRMWAATAGLQIANHLGAD
TVDVAPYETIRRHTSGTNPCLALADAAKHGPVTPAEYHSPPVQRLVLHANNVVCWSNDVQSLKMELNQPGQYWNMAAIYA
HRGLSLQQAVDLVALRVRGEIASFQSLALTLEPHASRPLRGFVDGLRHWMRGYQDWVENDTLRYADAFIAEDADDTAVRT
>P05819 ~~~cba~~~Colicin-B~~~
MSDNEGSVPTEGIDYGDTMVVWPSTGRIPGGDVKPGGSSGLAPSMPPGWGDYSPQGIALVQSVLFPGIIRRIILDKELEE
GDWSGWSVSVHSPWGNEKVSAARTVLENGLRGGLPEPSRPAAVSFARLEPASGNEQKIIRLMVTQQLEQVTDIPASQLPA
AGNNVPVKYRLTDLMQNGTQYMAIIGGIPMTVPVVDAVPVPDRSRPGTNIKDVYSAPVSPNLPDLVLSVGQMNTPVRSNP
EIQEDGVISETGNYVEAGYTMSSNNHDVIVRFPEGSGVSPLYISAVEILDSNSLSQRQEAENNAKDDFRVKKEQENDEKT
VLTKTSEVIISVGDKVGEYLGDKYKALSREIAENINNFQGKTIRSYDDAMSSINKLMANPSLKINATDKEAIVNAWKAFN
AEDMGNKFAALGKTFKAADYAIKANNIREKSIEGYQTGNWGPLMLEVESWVISGMASAVALSLFSLTLGSALIAFGLSAT
VVGFVGVVIAGAIGAFIDDKFVDELNHKIIK
>P17998 ~~~cda~~~Colicin-D~~~
MSDYEGSGPTEGIDYGHSMVVWPSTGLISGGDVKPGGSSGIAPSMPPGWGDYSPQGIALVQSVLFPGIIRRIILDKELEE
GDWSGWSVSVHSPWGNEKVSAARTVLENGLRGGLPEPSRPAAVSFARLEPASGNEQKIIRLMVTQQLEQVTDIPASQLPA
AGNNVPVKYRLMDLMQNGTQYMAIIGGIPMTVPVVDAVPVPDRSRPGTNIKDVYSAPVSPNLPDLVLSVGQMNTPVLSNP
EIQEEGVIAETGNYVEAGYTMSSNNHDVIVRFPEGSDVSPLYISTVEILDSNGLSQRQEAENKAKDDFRVKKEEAVARAE
AEKAKAELFSKAGVNQPPVYTQEMMERANSVMNEQGALVLNNTASSVQLAMTGTGVWTAAGDIAGNISKFFSNALEKVTI
PEVSPLLMRISLGALWFHSEEAGAGSDIVPGRNLEAMFSLSAQMLAGQGVVIEPGATSVNLPVRGQLINSNGQLALDLLK
TGNESIPAAVPVLNAVRDTATGLDKITLPAVVGAPSRTILVNPVPQPSVPTDTGNHQPVPVTPVHTGTEVKSVEMPVTTI
TPVSDVGGLRDFIYWRPDAAGTGVEAVYVMLNDPLDSGRFSRKQLDKKYKHAGDFGISDTKKNRETLTKFRDAIEEHLSD
KDTVEKGTYRREKGSKVYFNPNTMNVVIIKSNGEFLSGWKINPDADNGRIYLETGEL
>P05820 ~~~cma~~~Colicin-M~~~
METLTVHAPSPSTNLPSYGNGAFSLSAPHVPGAGPLLVQVVYSFFQSPNMCLQALTQLEDYIKKHGASNPLTLQIISTNI
GYFCNADRNLVLHPGISVYDAYHFAKPAPSQYDYRSMNMKQMSGNVTTPIVALAHYLWGNGAERSVNIANIGLKISPMKI
NQIKDIIKSGVVGTFPVSTKFTHATGDYNVITGAYLGNITLKTEGTLTISANGSWTYNGVVRSYDDKYDFNASTHRGIIG
ESLTRLGAMFSGKEYQILLPGEIHIKESGKR
>P08083 ~~~cna~~~Colicin-N~~~
MGSNGADNAHNNAFGGGKNPGIGNTSGAGSNGSASSNRGNSNGWSWSNKPHKNDGFHSDGSYHITFHGDNNSKPKPGGNS
GNRGNNGDGASAKVGEITITPDNSKPGRYISSNPEYSLLAKLIDAESIKGTEVYTFHTRKGQYVKVTVPDSNIDKMRVDY
VNWKGPKYNNKLVKRFVSQFLLFRKEEKEKNEKEALLKASELVSGMGDKLGEYLGVKYKNVAKEVANDIKNFHGRNIRSY
NEAMASLNKVLANPKMKVNKSDKDAIVNAWKQVNAKDMANKIGNLGKAFKVADLAIKVEKIREKSIEGYNTGNWGPLLLE
VESWIIGGVVAGVAISLFGAVLSFLPISGLAVTALGVIGIMTISYLSSFIDANRVSNINNIISSVIR
>Q9LCV9 2.5.1.66~~~ceaS~~~N(2)-(2-carboxyethyl)arginine synthase~~~
MSRVSTAPSGKPTAAHALLSRLRDHGVGKVFGVVGREAASILFDEVEGIDFVLTRHEFTAGVAADVLARITGRPQACWAT
LGPGMTNLSTGIATSVLDRSPVIALAAQSESHDIFPNDTHQCLDSVAIVAPMSKYAVELQRPHEITDLVDSAVNAAMTEP
VGPSFISLPVDLLGSSEGIDTTVPNPPANTPAKPVGVVADGWQKAADQAAALLAEAKHPVLVVGAAAIRSGAVPAIRALA
ERLNIPVITTYIAKGVLPVGHELNYGAVTGYMDGILNFPALQTMFAPVDLVLTVGYDYAEDLRPSMWQKGIEKKTVRISP
TVNPIPRVYRPDVDVVTDVLAFVEHFETATASFGAKQRHDIEPLRARIAEFLADPETYEDGMRVHQVIDSMNTVMEEAAE
PGEGTIVSDIGFFRHYGVLFARADQPFGFLTSAGCSSFGYGIPAAIGAQMARPDQPTFLIAGDGGFHSNSSDLETIARLN
LPIVTVVVNNDTNGLIELYQNIGHHRSHDPAVKFGGVDFVALAEANGVDATRATNREELLAALRKGAELGRPFLIEVPVN
YDFQPGGFGALSI
>P22522 ~~~cvaC~~~Colicin-V~~~
MRTLTLNELDSVSGGASGRDIAMAIGTLSGQFVAGGIGAAAGGVAGGAIYDYASTHKPNPAMSPSGLGGTIKQKPEGIPS
EAWNYAAGRLCNWSPNNLSDVCL
>P04480 ~~~caa~~~Colicin-A~~~
MPGFNYGGKGDGTGWSSERGSGPEPGGGSHGNSGGHDRGDSSNVGNESVTVMKPGDSYNTPWGKVIINAAGQPTMNGTVM
TADNSSMVPYGRGFTRVLNSLVNNPVSPAGQNGGKSPVQTAVENYLMVQSGNLPPGYWLSNGKVMTEVREERTSGGGGKN
GNERTWTVKVPREVPQLTASYNEGMRIRQEAADRARAEANARALAEEEARAIASGKSKAEFDAGKRVEAAQAAINTAQLN
VNNLSGAVSAANQVITQKQAEMTPLKNELAAANQRVQETLKFINDPIRSRIHFNMRSGLIRAQHNVDTKQNEINAAVANR
DALNSQLSQANNILQNARNEKSAADAALSAATAQRLQAEAALRAAAEAAEKARQRQAEEAERQRQAMEVAEKAKDERELL
EKTSELIAGMGDKIGEHLGDKYKAIAKDIADNIKNFQGKTIRSFDDAMASLNKITANPAMKINKADRDALVNAWKHVDAQ
DMANKLGNLSKAFKVADVVMKVEKVREKSIEGYETGNWGPLMLEVESWVLSGIASSVALGIFSATLGAYALSLGVPAIAV
GIAGILLAAVVGALIDDKFADALNNEIIRPAH
>P0ACU0 ~~~cecR~~~HTH-type transcriptional dual regulator CecR~~~COG1309
MNNPAMTIKGEQAKKQLIAAALAQFGEYGMNATTREIAAQAGQNIAAITYYFGSKEDLYLACAQWIADFIGEQFRPHAEE
AERLFAQPQPDRAAIRELILRACRNMIKLLTQDDTVNLSKFISREQLSPTAAYHLVHEQVISPLHSHLTRLIAAWTGCDA
NDTRMILHTHALIGEILAFRLGKETILLRTGWTAFDEEKTELINQTVTCHIDLILQGLSQRSL
>P0AE60 ~~~cedA~~~Cell division activator CedA~~~
MKKPLRQQNRQIISYVPRTEPAPPEHAIKMDSFRDVWMLRGKYVAFVLMGESFLRSPAFTVPESAQRWANQIRQEGEVTE
>Q5LH66 5.1.3.11~~~bfce~~~Cellobiose 2-epimerase~~~COG2942
MDEILKQEMQKELTTRILPYWMERMVDQENGGFYGRITGQEELMPRADKGAILNARILWTYSAAYRLLGREEYKEMANRA
KRYLIDHFYDSEFGGVYWSLNYRGEPLDTKKQIYAIGFAIYGLSEFHRATGDPEALMYAVRLFNDIESHSFDGLKNGYCE
ALTREWNEIADMRLSEKDANERKTMNTHLHILEPYTNLYRVWKDARLERQLYNLIGLFTEKILDKDTSHLQLFFDNDWQS
KYPVVSYGHDIEASWLLHEAARVLGDAGLIAEIEPVVKKIAAAASEGLTSDGGMIYEKNLTTGHIDGDYHWWVQAETVVG
YYNLFRYFGDRGALQHSIDCWEFIKRHLTDDVHGEWFWSLRADGSLNRDDDKAGFWKCPYHNGRMCIELLGE
>B8DZK4 5.1.3.11~~~~~~Cellobiose 2-epimerase~~~COG2942
MDLKVLKSEIFEHLNNKIIPFWEELKDENNGGYISYVGFDLKPDPYAPKGLVLTSRILWFFSRLYNQLRKEEFINFADHS
YKFLIKSFLDKENKGFYWMVDYKGEPIDKRKHLYGQAFVLYGLSEYYKATQKKESLDLALEIYKIIEEVCKNDVGYKEEF
DEKWNPKENIIVSEYGIICERSMNTLLHILEAYTNLFTATYDQSIKKKIEDLIILFKEKIYDSKTNHLYVFFDKKMNPII
DAISYGHDIEATWLIDEALRYIDNNKLIKEMSEINLKIAEKVLEEAFESGSLLNERVRGIVDKNRIWWVQAEALVGFLNA
YQKSKLDKFLKAVFELWEFIKDFLVDKRAQGEWFWKLDENYIPSPMPEVDLWKCPYHNGRMCLEVIKRI
>B3XZI5 5.1.3.11~~~~~~Cellobiose 2-epimerase~~~
MKNEVVYKQLTEKILPFWNAMRDDENGGFYGYMSEDLHIDDHADKGCILNSRILWFYSTAYMYLQDEKLLDNAKHAFEFL
KTYCFDPMCGGIFWSVRYNGKPADTTKHTYNQAFAIYALSAYYEATGSIEAIAIAEIIYEKIEDTMRDTKGYLEAFTRDF
RPADNDKLSENGVMAERTMNTLLHIIEAYSALVHALRKKVADPAKGDVRDELFMNVVENKLAAALELMRDKFYNSDRHRL
DVFFDKEYESLIDLTSYGHDIEASWLLEWAAGILDDEEITESLHPISSDLVEKVYKEAFDGHSIVNECEDGDVNTDRIWW
VEAESVLGFLKAFEREGKEEYRKAAHEILAFILDKQVDKREGSEWFEMLKEDGTPCHKPMVREWKCPYHNGRMCLEILKS
GIEIG
>F8WRK9 5.1.3.11~~~ce~~~Cellobiose 2-epimerase~~~
MSTETIPDVRRLRALQAEVHEELTENILKFWATRTHDPVHGGFVGRVGPDGRPHPEAPRGAILNARILWTFAAAYRQLGT
PLYREMAERAYRYFVRHFVDAEHGGVYWMVAADGRPLDTRKHVYAQSFAIYALSEWHRATGGEAALALARSIYDLIETHC
ADRVHGGYVEACDRAWRPLEDARLSAKDAPEPRSMNTHLHVLEAYANLYRVWPETELAARLQALIELFLRAIYHPATGHL
ILFFDERWRPRSRAVSFGHDIEASWLLLEAVDVLGQATLRPRVQQASLHLARATLAEGRAPDGSLYYEIGEQGHLDTDRH
WWPQAEALVGFLNAYQESGEVLFYEAAEDVWRYIRERQRDTRGGEWFARVRDDGAPYPDDKVDFWKGPYHNGRACLEAIQ
RLRHLLEHVRSR
>P0DKY4 5.1.3.11~~~ce-ne1~~~Cellobiose 2-epimerase~~~
MMISEIRQELTDHIIPFWNKLRDDENGGFYGYLSYGLGLDKKADKGVILHSRILWFYSNAYMTLGGDELLDNAKHAYEFI
KNNCIDYEYGGVYWMMDFEGKPADTMKHTYNIAFAIYALSSYYRASGDKEALALAYRPFEDIEKNTLYEYGYREAFDRQW
RLVDNEALSENGLKADKTMNAILHLIEAYTELYKADGNEKVADRLKFQLGQMRDIVYTPDTNALKVFFDTAFNLVGDIHS
YGHDIEATWLMDRACDVLGDEDLKKQFAEMDLKISHNIQDIALEDGALNNERDKNEIDKTRVWWVQAEAVVGFINAYQHS
GDEKFLESAKSVWENIKEYIIDKREGGEWYSEVTFDHTPHDYKETVGPWKCPYHNGRMCMEVITRGVDI
>P18549 5.1.1.17~~~cefD~~~Isopenicillin N epimerase~~~COG0520
MAVADWEEARGRMLLDPTVVNLNTGSGGPLPRSAFERVTGFRAHLAAEPMDFLLREVPALLWQARESLARLIGGDPLRLA
LATNVTAAVNLVASSLRLEAPGEILLSDDEYTPMRWCWERVARRHGLELRTFRLPELPSDPAEITAAAVAAMGPRTRLFF
FSHVVSTTGLILPAAELCEEARARGITTVVDGAHAPGFLDLDLSRIPCDFYAGSGHKWLLAPTGVGFLHLAPGRLEELEP
TQVSWAYEPPEGSGPPAARDRFGSTPGLRRLECEGTRDICPWLATPESIDFQAELGPGAIRARRRELTDHARRLLADRPG
RTLLTPDSPELSGGMVAYRLPPGTDAAELRRGLWERFRIEAAVAEQPPGPVLRISANFYTTEEEIDRLADALDALTGE
>P18548 1.14.20.1~~~cefE~~~Deacetoxycephalosporin C synthase~~~COG3491
MDTTVPTFSLAELQQGLHQDEFRRCLRDKGLFYLTDCGLTDTELKSAKDLVIDFFEHGSEAEKRAVTSPVPTMRRGFTGL
ESESTAQITNTGSYSDYSMCYSMGTADNLFPSGDFERIWTQYFDRQYTASRAVAREVLRATGTEPDGGVEAFLDCEPLLR
FRYFPQVPEHRSAEEQPLRMAPHYDLSMVTLIQQTPCANGFVSLQAEVGGAFTDLPYRPDAVLVFCGAIATLVTGGQVKA
PRHHVAAPRRDQIAGSSRTSSVFFLRPNADFTFSVPLARECGFDVSLDGETATFQDWIGGNYVNIRRTSKA
>P42220 1.14.11.26~~~cefF~~~Deacetoxycephalosporin C hydroxylase~~~
MADTPVPIFNLAALREGADQEKFRECVTGMGVFYLTGYGAGDKDHRLATDTAMDFFANGTEAEKAAVTTDVPTMRRGYSA
LEAESTAQVTRTGSYTDYSMSFSMGISGNVFPSPEFERVWTEYFDKLYAAAQETARLVLTASGGYDAEIVGSLDELLDAD
PVLRLRYFPEVPEHRSAEHEPRRMAPHYDLSIITFIHQTPCANGFVSLQAEIGGELVSLPVVEDAVVVMCGAMAPLATQG
ALPAPRHHVRSPGAGMREGSDRTSSVFFLRPTTDFSFSVAKARSYGLAVDLDMETATFGDWIGTNYVTMHAKNEPQAG
>P06716 ~~~cia~~~Colicin-Ia~~~
MSDPVRITNPGAESLGYDSDGHEIMAVDIYVNPPRVDVFHGTPPAWSSFGNKTIWGGNEWVDDSPTRSDIEKRDKEITAY
KNTLSAQQKENENKRTEAGKRLSAAIAAREKDENTLKTLRAGNADAADITRQEFRLLQAELREYGFRTEIAGYDALRLHT
ESRMLFADADSLRISPREARSLIEQAEKRQKDAQNADKKAADMLAEYERRKGILDTRLSELEKNGGAALAVLDAQQARLL
GQQTRNDRAISEARNKLSSVTESLNTARNALTRAEQQLTQQKNTPDGKTIVSPEKFPGRSSTNHSIVVSGDPRFAGTIKI
TTSAVIDNRANLNYLLSHSGLDYKRNILNDRNPVVTEDVEGDKKIYNAEVAEWDKLRQRLLDARNKITSAESAVNSARNN
LSARTNEQKHANDALNALLKEKENIRNQLSGINQKIAEEKRKQDELKATKDAINFTTEFLKSVSEKYGAKAEQLAREMAG
QAKGKKIRNVEEALKTYEKYRADINKKINAKDRAAIAAALESVKLSDISSNLNRFSRGLGYAGKFTSLADWITEFGKAVR
TENWRPLFVKTETIIAGNAATALVALVFSILTGSALGIIGYGLLMAVTGALIDESLVEKANKFWGI
>Q8P3J4 2.4.1.321~~~~~~Cellobionic acid phosphorylase~~~COG3459
MSAASPAPDDLATLMAPSADGMRYALYSPTAMPTAGGFLWNRRMMVQLTCRGYATAQFMQPEPAKYAHAPLLEARNFMMP
EQPYYAHHPGRFFYLKDEDTGALYSVPHEPVRAPAETFEFSAGKHDVRWRVRHDGIVVELCVSLPTDDAVELWECRVHNQ
SGRTRRLSLYPYFPIGYMSWMHQSGGYSPELGGIVCRSVTPYQKVDDYFRQRDFKDCTFLLHEQPPVAWDAQQMAFEGEG
GLHAPSAVQAEQLGNHDAHYENPAAALQYRLSLAPDAATVYRFAFGPAKDDAEIAALRARYLSAEGFAAAAQDYAQYLQA
GRGCVQIATPDAALDNLVNHWLPRQVFYHGDVNRLTTDPQTRNYLQDHMGMAYLQPATARAALLHALSQQEPSGAMPDGI
LLVEGAELKYINQVPHTDHCVWLPIFLDAYLAETGDVAVLDAVVRTHDGQALSVAARLDAAMQWLLDARDARGLSFIAQG
DWNDPMNMVGWRGVGVSGWLTVATAYALRLWSGICAANGRSAQATQFGQAVEEVNAAANRELWDGHWYARGITDDGVRFG
IADDEEGRIYLNPQSWALLAGTADAEQRTALLAAVREQLHTPYGPVMLAPAYTHMRDDVGRLTQKWPGAAENGAVYNHAV
AFYLYSLYQIGDADRAWEILRAMLPGPDMADALQRGHLPVSLPNYYRGAWHQYPRTAGRSSQLFNTGTVAWVYRCVLEGL
FGLVGDGDALAVRPQLPSHWPQAQVTRQFRGAQFEVALTREPGRTQLEVQVDGVVSPDQRVHGIVAGRTYQLQVRLPG
>Q9CJ32 ~~~celB~~~PTS system cellobiose-specific EIIC component~~~COG1455
MNGITAWMEKYLVPVAAKIGSQKHLVALRDSFIGMLPATLAGALAAMISAIVTTFPSAIQQMMLGATAFSKLAPEKVWTL
ANTPIIGDLNNISALVNQGTLTVIGLIFAFSWGYNLARAYGVNDLAGGIVSLATLFAGLPNQMGKFTAALGTGKAGVAAT
DKLNGVLGDQGLAAWKPLFASAHLDAGAYFTVIIMGALAVIIYAKLMLADITIKMPESVPPAVAKAFLAIIPTIAALYIV
GLIYYIIGKLTNDSVINLITHYIAEPFQILSQNIFSVLIVTLFVSVFWFFGLHGPNVLAPVLDGIWGPLGLNNQALYFQV
HSQGIRDLIAKGAVDKAHAINGDYVNLWVRGSWDAFAWFGGSGGTITLVIAIILFSKRKDYKIVGRLGLAPGIFNINEPV
LFGLPVVLNAIFFIPFAVAPLISVIIAYTATALHLVDPVVNAVPWVTPPIMNAFMATGFDWRAIVLTIINLIITFVIWVP
FVIAANKLEETELD
>A0A0U4EBH5 3.2.1.4~~~celDZ1a~~~Cellulase CelDZ1~~~
MNKWHINKWYFFVGMLVIFAVIISLILKDTSLTFSSYDREKFPHLIGNSMVKKPSLAGRLKIIEIDGRKTLGDQHGNPIQ
LRGMSTHGLQWFPQIINNNAFSALSKDWEANVIRLAMYVGEGGYSTDPSVKEKVIEGINLAIKNDMYVIVDWHILNPGDP
NAKIYSGAKEFFKEIASKYPNDLHIIYELANEPNPTESDITNDIAGWEKVKKYAEPIIKMLRDMGNENIIIVGNPEWSTR
PDLAVNDPIDDKNVMYSAHFYTGSASVWENGNKGHIARNIEKALENGLTVFVTEWGTSEASGDGGPYLNEADEWLEFLNS
NNISWVNWSLANKNEASAAFLPTTSLDPGNGKVWAVNQLSLSGEYVRARIKGIPYKPISRETMGK
>P10477 ~~~celE~~~Cellulase/esterase CelE~~~COG2730
MKKIVSLVCVLVMLVSILGSFSVVAASPVKGFQVSGTKLLDASGNELVMRGMRDISAIDLVKEIKIGWNLGNTLDAPTET
AWGNPRTTKAMIEKVREMGFNAVRVPVTWDTHIGPAPDYKIDEAWLNRVEEVVNYVLDCGMYAIINVHHDNTWIIPTYAN
EQRSKEKLVKVWEQIATRFKDYDDHLLFETMNEPREVGSPMEWMGGTYENRDVINRFNLAVVNTIRASGGNNDKRFILVP
TNAATGLDVALNDLVIPNNDSRVIVSIHAYSPYFFAMDVNGTSYWGSDYDKASLTSELDAIYNRFVKNGRAVIIGEFGTI
DKNNLSSRVAHAEHYAREAVSRGIAVFWWDNGYYNPGDAETYALLNRKTLSWYYPEIVQALMRGAGVEPLVSPTPTPTLM
PTPSPTVTANILYGDVNGDGKINSTDCTMLKRYILRGIEEFPSPSGIIAADVNADLKINSTDLVLMKKYLLRSIDKFPAE
DSQTPDEDNPGILYNGRFDFSDPNGPKCAWSGSNVELNFYGTEASVTIKSGGENWFQAIVDGNPLPPFSVNATTSTVKLV
SGLAEGAHHLVLWKRTEASLGEVQFLGFDFGSGKLLAAPKPLERKIEFIGDSITCAYGNEGTSKEQSFTPKNENSYMSYA
AITARNLNASANMIAWSGIGLTMNYGGAPGPLIMDRYPYTLPYSGVRWDFSKYVPQVVVINLGTNDFSTSFADKTKFVTA
YKNLISEVRRNYPDAHIFCCVGPMLWGTGLDLCRSYVTEVVNDCNRSGDLKVYFVEFPQQDGSTGYGEDWHPSIATHQLM
AERLTAEIKNKLGW
>P0C2S1 3.2.1.91~~~celK~~~Cellulose 1,4-beta-cellobiosidase~~~
MNFRRMLCAAIVLTIVLSIMLPSTVFALEDKSSKLPDYKNDLLYERTFDEGLCFPWHTCEDSGGKCDFAVVDVPGEPGNK
AFRLTVIDKGQNKWSVQMRHRGITLEQGHTYTVRFTIWSDKSCRVYAKIGQMGEPYTEYWNNNWNPFNLTPGQKLTVEQN
FTMNYPTDDTCEFTFHLGGELAAGTPYYVYLDDVSLYDPRFVKPVEYVLPQPDVRVNQVGYLPFAKKYATVVSSSTSPLK
WQLLNSANQVVLEGNTIPKGLDKDSQDYVHWIDFSNFKTEGKGYYFKLPTVNSDTNYSHPFDISADIYSKMKFDALAFFY
HKRSGIPIEMPYAGGEQWTRPAGHIGIEPNKGDTNVPTWPQDDEYAGRPQKYYTKDVTGGWYDAGDHGKYVVNGGIAVWT
LMNMYERAKIRGIANQGAYKDGGMNIPERNNGYPDILDEARWEIEFFKKMQVTEKEDPSIAGMVHHKIHDFRWTALGMLP
HEDPQPRYLRPVSTAATLNFAATLAQSARLWKDYDPTFAADCLEKAEIAWQAALKHPDIYAEYTPGSGGPGGGPYNDDYV
GDEFYWAACELYVTTGKDEYKNYLMNSPHYLEMPAKMGENGGANGEDNGLWGCFTWGTTQGLGTITLALVENGLPATDIQ
KARNNIAKAADRWLENIEEQGYRLPIKQAEDERGGYPWGSNSFILNQMIVMGYAYDFTGNSKYLDGMQDGMSYLLGRNGL
DQSYVTGYGERPLQNPHDRFWTPQTSKKFPAPPPGIIAGGPNSRFEDPTITAAVKKDTPPQKCYIDHTDSWSTNEITVNW
NAPFAWVTAYLDEIDLITPPGGVDPEEPEVIYGDCNGDGKVNSTDAVALKRYILRSGISINTDNADVNADGRVNSTDLAI
LKRYILKEIDVLPHK
>P15244 1.5.1.24~~~ceo~~~N(5)-(carboxyethyl)ornithine synthase~~~
MKIGLVKANFPGERRVPLLPKDIKDFKNEILVEEGFGKFLDIDDQEYSDKGCHILSRAEVFAESEAIFSLKLIQPTDYYH
LREGQMIIGWTHPFGSGQSFMKEQALPKKLIVVDLDSNSPCIYYENEIFESGIPKGLLYKNSFYAGYAGVLDALLQYGLI
PTEETKIAILGSGNVAQGAFSSISKYSSNIRMYYRKTMSIFKENYTKYDIIINGIEIGKDDDPILSFSEQKSLKKGTLII
DVAADAGNTIEGSHFTSIDAPIYENAGKYYYVVPNTPSLIYRNVSQELSKILSENIFRKDCSRFIEKVKPLNK
>A5HZ59 1.5.1.24~~~ceo~~~N(5)-(carboxyethyl)ornithine synthase~~~
MKLGFLIPNHPNEKRVALLPEHVKGFNNELVIETGFGETLGISDAEYVKVGCTIASREEIFKTCEGIFSLKVLKPQDYKH
IREGQIIVGWTHPEGSGKIFMEEQGIPKNLIIVDLDNIHPSIYYKDYVIPMEWIPSNFVRKNSYIAGYASTMHAVMNYGS
IPTSETKVAILGSGNVSQGAFSAISKFNPDIRMFYRKTMNQLKDELEEFDIIINGIEMDNPNKHILTLEDQMRLKKNCLI
IDAAANLGKAIEGARHTTASDPIYNKDGKYYYAVNNSPSIFYRQSSKAISEAFSKHVYSKELEFYLDVIAEVEEMIV
>A0A0H3C8X7 1.-.-.-~~~cerR~~~Ceramide reductase~~~
MATDARGVVAITGATGFLGRHLVRALAQDGWRPRVLVRRDPVHPFWRDLEVEVVTGDLGTPRALDRLAKGAEVFIHVAGL
IKARTLEGFNRVNQDGARAAAEAARAAGARFILVSSLAAREPSLSNYAASKRAGEDAVRAADPSALIVRPPAIYGPGDTE
TLGLFQLAARSPVLPVLSQTSRVAMIHVEDAAAKLVAFCRTPVLGLVELSDVRRDGYTWTEIMRGAAHVMGAKPRLIRLP
DPGILTAGALVDAWSSLTNTPSVFGLGKARELLHTDWTPSSAPMAEGVPSKFGLIDGFTHTVDWYRAAGWLPKNIVA
>A0A0H3C8X0 2.3.1.-~~~bcerS~~~Bacterial ceramide synthase~~~
MPFDSTNADLSVIPVKTPAELKRFIALPARLNAKDPNWITPLFMERTDALTPKTNPFFDHAEVQLFLATRGGRDVGRISA
QIDQLTPQPTEGRLDGHFGMIAAEDDPAVFNVLFRAAEDWLRARGRTHAVGPFNLSINEEVGLLVWGFDTPPMVLMGHDP
VYAGPRVEEQGYAKAQDLFAYKADETGDIPEIAQRRVKRGLPSGVVLRQLDMSRYDQEVQTLTEILNDAWSDNWGFTPTT
EAETRQLAKSLKQVIDQRLVWFSEIDGEAAGVVVFLPNVNEAIADLKGKLLPFGWAKLLWRLKVKGVKSARIPLMGVKKK
FQTSQRGRMLPFWMMKASRDMAMSLGYNRYEISWVLEANKAMTHIAENVGGTHYKTYRVYEKAL
>Q9RLB8 ~~~cesA~~~Multidomain esterase~~~
MKKHFVVGETIKRFLRIGTSLALSISTLSLLPSAPRLSSAAGTIKIMPLGDSITYGMADEGGYRKYLSYFLQQKGYTNVD
LVGPEGKDSASFNYNGQSVKYDDNHAGYSGYTITNLPGGWFGQLNGILETMQGGDYIKKYSPDIILLQIGTNDVSNGHLD
GSEERLHKLLDYLRENMPSNGKVFLTTIPDLGNSGWGGNSNGDIAKYNELIKKVANDYSSKNVIYADIHSVIDASKDLAD
GVHPNAGGYEKMGKYWLEQIEGYLKASDGPQQTQPTQPSQGDSGPELIYGDLDGDKTITSFDAVIMRKGLINDFKDNNVK
KAADIDQNGKAEVADLVQLQSFIIGKIKEFTVAEKTVTEKPVFEKSYNFPAVNQLKSSKDIPDPFIFMDGSKVESTDDWW
KRQSEISCMYEYYMYGKWIDGSDDETTYSISGNSMTINVKRKSTGKTASFKAVINLPKNVRHEGGAPVILGMHKGISEST
ATSNGYAVITYDSDGMFSAPGTAQDNNQHKGAFYDLYPYGRNWDEQTGDLMAWSWGISRILDALYNGAAKELNINPDSSI
VTGVSRYGKAASVCGAFDTRIKMCAPSCSGAGGLALYRYSSVGKTYDFSSKGGSSSYTYKENEPLGSLQASGEQGWFNGR
FMEFRNAEQFPMDQHMLGALCCDPDRYLFIIGSCESEDWVNAPSVWMAYLGMKHVWDYVGISDHLAINIHKSGHAVIAED
IEKMVQYFDYHVYGIQPKMNLEELQTSVFALPKNKDSFADTFASKWLY
>P21244 ~~~cesT~~~Tir chaperone~~~
MSSRSELLLDRFAEKIGVGSISFNENRLCSFAIDEIYYISLSDANDEYMMIYGVCGKFPTDNPNFALEILNANLWFAENG
GPYLCYESGAQSLLLALRFPLDDATPEKLENEIEVVVKSMENLYLVLHNQGITLENEHMKIEEISSSDNKHYYAGR
>P58233 ~~~cesT~~~Tir chaperone~~~
MSSRSELLLEKFAEKIGIGSISFNENRLCSFAIDEIYYISLSDANDEYMMIYGVCGKFPTDNSNFALEILNANLWFAENG
GPYLCYEAGAQSLLLALRFPLDDATPEKLENEIEVVVKSMENLYLVLHNQGITLENEHMKIEEISSSDNKHYYAGR
>Q47015 ~~~cesT~~~Tir chaperone~~~
MSSRSELLLDRFAEKIGIGSISFNENRLCSFAIDEIYYISLSDANDEYMMIYGVCGKFPTDNPNFALEILNANLWFAENG
GPYLCYESGAQSLLLALRFPLDDATPEKLENEIEVVVKSMENLYLVLHNQGITLENEHMKIEEISSSDNKYYYAGR
>A1YPR2 4.2.3.152~~~cetA~~~2-epi-5-epi-valiolone synthase~~~
MANQWQAQAEQTITYEVQMTDGVLDPSNRALLDAGATVRTDQPRRFIVIDANVHEIYGDALRKYLAHHNCEYRLCVLSAS
EEAKTMESVFTVVDGLDSFGISRRHEPIIAIGGGIVLDIAGLAASMYRRSTPYVRVPTSLIGLVDAGVGIKTGVNFGSHK
NRLGTYFAPTAALLDRGFLDTVDDRHISNGLAEILKIALVKDAELFRLMEEHAELLLAERLTGRTPTGDVVAREVFSRAV
GGMLEELEPNLWEQELERLVDYGHSFSPTLEMRALPALLHGEAVTVDMALTTVLAEARGLVSTSDRERIFQVMRRLRLPV
WHPLLEAGLLEHALRETTRHRDGLQRMPIPVGIGGARFLHDLTVAELTGAAESLRELGGGE
>A1YPR3 5.1.3.33~~~cetB~~~2-epi-5-epi-valiolone epimerase~~~
MTGRGIPGAVAVHHVAYTVPDLDQAVEFFTEVIGAELAYTLVQDAAGDWMTRKLDVDATATARIAMLRLGPVTNLELFEY
AAPDQRRQLPRNSDWGGHHLAIHVADVDAAAEYLRAQPGVRVLGDPETITDGPIAGDRWVYFATPWGMQLELINLPAGAP
FEQQTEVRLYQPEGSWSDHRGAS
>A2TJI4 ~~~cexE~~~Protein CexE~~~
MKKYILGVILAMGSLSAIAGGGNSERPPSVAAGECVTFNSKLGEIGGYSWKYSNDACNETVAKGYAIGVAMHRTVNYEGG
YSIQSSGIVKPGSDFIMKGGKTYKGHKKVSAGGDTPYWYK
>P40942 3.2.1.4~~~xynB~~~Thermostable celloxylanase~~~
MNKFLNKKWSLILTMGGIFLMATLSLIFATGKKAFNDQTSAEDIPSLAEAFRDYFPIGAAIEPGYTTGQIAELYKKHVNM
LVAENAMKPASLQPTEGNFQWADADRIVQFAKENGMELRFHTLVWHNQTPTGFSLDKEGKPMVEETDPQKREENRKLLLQ
RLENYIRAVVLRYKDDIKSWDVVNEVIEPNDPGGMRNSPWYQITGTEYIEVAFRATREAGGSDIKLYINDYNTDDPVKRD
ILYELVKNLLEKGVPIDGVGHQTHIDIYNPPVERIIESIKKFAGLGLDNIITELDMSIYSWNDRSDYGDSIPDYILTLQA
KRYQELFDALKENKDIVSAVVFWGISDKYSWLNGFPVKRTNAPLLFDRNFMPKPAFWAIVDPSRLRE
>P25393 ~~~cfaD~~~CFA/I fimbrial subunit D~~~
MDFKYTEEKEMIKINNIMIHKYTVLYTSNCIMDIYSEEEKITCFSNRLVFLERGVNISVRIQKKILSERPYVAFRLNGDI
LRHLKNALMIIYGMSKVDTNDCRGMSRKIMTTEVNKTLLDELKNINSHDDSAFISSLIYLISKIENNEKIIESIYISSVS
FFSDKVRNVIEKDLSRKWTLGIIADAFNVSEITIRKRLESENTNFNQILMQLRMSKAALLLLENSYQISQISNMIGISSA
SYFIRVFNKHYGVTPKQFFTYFKGG
>P25734 ~~~cfaE~~~CFA/I fimbrial subunit E~~~
MNKILFIFTLFFSSGFFTFAVSADKNPGSENMTNTIGPHDRGGSSPIYNILNSYLTAYNGSHHLYDRMSFLCLSSQNTLN
GACPSSDAPGTATIDGETNITLQFTEKRSLIKRELQIKGYKQFLFKNANCPSKLALNSSHFQCNREQASGATLSLYIPAG
ELNKLPFGGVWNAVLKLNVKRRYDTTYGTYTINITVNLTDKGNIQIWLPQFKSNARVDLNLRPTGGGTYIGRNSVDMCFY
DGYSTNSSSLEIRFQDDNSKSDGKFYLKKINDDSKELVYTLSLLLAGKNLTPTNGQALNINTASLETNWNRITAVTMPEI
SVPVLCWPGRLQLDAKVKNPEAGQYMGNIKITFTPSSQTL
>P0A9H7 2.1.1.79~~~cfa~~~Cyclopropane-fatty-acyl-phospholipid synthase~~~COG2230
MSSSCIEEVSVPDDNWYRIANELLSRAGIAINGSAPADIRVKNPDFFKRVLQEGSLGLGESYMDGWWECDRLDMFFSKVL
RAGLENQLPHHFKDTLRIAGARLFNLQSKKRAWIVGKEHYDLGNDLFSRMLDPFMQYSCAYWKDADNLESAQQAKLKMIC
EKLQLKPGMRVLDIGCGWGGLAHYMASNYDVSVVGVTISAEQQKMAQERCEGLDVTILLQDYRDLNDQFDRIVSVGMFEH
VGPKNYDTYFAVVDRNLKPEGIFLLHTIGSKKTDLNVDPWINKYIFPNGCLPSVRQIAQSSEPHFVMEDWHNFGADYDTT
LMAWYERFLAAWPEIADNYSERFKRMFTYYLNACAGAFRARDIQLWQVVFSRGVENGLRVAR
>P9WIR3 ~~~~~~Putative glyoxylase CFP32~~~COG3324
MPKRSEYRQGTPNWVDLQTTDQSAAKKFYTSLFGWGYDDNPVPGGGGVYSMATLNGEAVAAIAPMPPGAPEGMPPIWNTY
IAVDDVDAVVDKVVPGGGQVMMPAFDIGDAGRMSFITDPTGAAVGLWQANRHIGATLVNETGTLIWNELLTDKPDLALAF
YEAVVGLTHSSMEIAAGQNYRVLKAGDAEVGGCMEPPMPGVPNHWHVYFAVDDADATAAKAAAAGGQVIAEPADIPSVGR
FAVLSDPQGAIFSVLKPAPQQ
>P9WIR1 ~~~cfp6~~~Low molecular weight protein antigen 6~~~
MAHFAVGFLTLGLLVPVLTWPVSAPLLVIPVALSASIIRLRTLADERGVTVRTLVGSRAVRWDDIDGLRFHRGSWARATL
KDGTELRLPAVTFATLPHLTEASSGRVPNPYR
>Q56336 ~~~cfpA~~~Cytoplasmic filament protein A~~~
MASLDLPKSPNVFHPEKPSAVGSRNSLAQDCRDQQQEVNQLIEEETNKILHHLNTKLPKEVLERLDVMGGLKEKLYNYFN
QNYQNMFNRYMVTAEDEMLKKVRGFIDREEMKVLNRYTPKEIAILLDEVAGADKFNTGEIEKSMVNMYGHLQGHIQRGVN
ELETHTNSLLRQKVDVGAFVRGENAYAVVKCAFKDNLARPKTVTDVKLSINILDSELVSPIFHYQTTVAYLIKDLISNHY
IDAIDKEIDRVKDELIDQGKEEMSDSSIIFEKMKMVSDFTDDDCENPDSKRYELISRELMERISNLRAEIDPETFDQLNV
RENIKKIVDLENIRNRGFNTAINSITSILDTSRMGYQYIENFKNARELILREYDDTDISNLPDERYQLRLKYLDNAQLIE
ERKGYEVMLRSFETEVDHLWDVLRTKYDKSKASRFMAKITDFDDLAKVYKKHIKKHYKDKTGEPVYEDIAKVWDEIAFVK
PAETEVERMNRTFVYEKDKMRRKLILMRGKLKGMYDYQYPIERRVMEERLAFLESEFNRFDYLVNPFHLQPGLLLDIDIT
SIKRKKATLDGMANVLNEFLHGISKGFADAAFASFSRRRSTVRADIGQSFASDGSADQKESSGRVAFMDMVNETPALESS
VAAEQVDVRSDVGMKTRKAGAVDAGKGRRGRRSAIREL
>Q9FBG4 2.1.1.224~~~cfr~~~Ribosomal RNA large subunit methyltransferase Cfr~~~
MNFNNKTKYGKIQEFLRSNNEPDYRIKQITNAIFKQRISRFEDMKVLPKLLREDLINNFGETVLNIKLLAEQNSEQVTKV
LFEVSKNERVETVNMKYKAGWESFCISSQCGCNFGCKFCATGDIGLKKNLTVDEITDQVLYFHLLGHQIDSISFMGMGEA
LANRQVFDALDSFTDPNLFALSPRRLSISTIGIIPSIKKITQEYPQVNLTFSLHSPYSEERSKLMPINDRYPIDEVMNIL
DEHIRLTSRKVYIAYIMLPGVNDSLEHANEVVSLLKSRYKSGKLYHVNLIRYNPTISAPEMYGEANEGQVEAFYKVLKSA
GIHVTIRSQFGIDIDAACGQLYGNYQNSQ
>A5HBL2 2.1.1.224~~~cfr~~~Ribosomal RNA large subunit methyltransferase Cfr~~~
MNFNNKTKYGKIQEFLRSNNEPDYRIKQITNAIFKQRISRFEDMKVLPKLLREDLINNFGETVLNIKLLAEQNSEQVTKV
LFEVSKNERVETVNMKYKAGWESFCISSQCGCNFGCKFCATGDIGLKKNLTVDEITDQVLYFHLLGHQIDSISFMGMGEA
LANRQVFDALDSFTDPNLFALSPRRLSISTIGIIPSIKKITQEYPQVNLTFSLHSPYSEERSKLMPINDRYPIDEVMNIL
DEHIRLTSRKVYIAYIMLPGVNDSLEHANEVVSLLKSRYKSGKLYHVNLIRYNPTISAPEMYGEANEGQVEAFYKVLKSA
GIHVTIRSQFGIDIDAACGQLYGNYQNSQ
>P80200 ~~~cagA~~~Cytotoxicity-associated immunodominant antigen~~~
MTNETIDQQPQTEAAFNPQQFINNLQVAFLKVDNAVASYDPDQKPIVDKNDRDNRQAFEGISQLREEYSNKAIKNPTKKN
QYFSDFINKSNDLINKDNLIDVESSTKSFQKFGDQRYRIFTSWVSHQNDPSKINTRSIRNFMENIIQPPILDDKEKAEFL
KSAKQSFAGIIIGNQIRTDQKFMGVFDESLKERQEAEKNGEPTGGDWLDIFLSFIFDKKQSSDVKEAINQEPVPHVQPDI
ATTTTDIQGLPPEARDLLDERGNFSKFTLGDMEMLDVEGVADIDPNYKFNQLLIHNNALSSVLMGSHNGIEPEKVSLLYG
GNGGPGARHDWNATVGYKDQQGNNVATIINVHMKNGSGLVIAGGEKGINNPSFYLYKEDQLTGSQRALSQEEIQNKIDFM
EFLAQNNAKLDNLSEKEKEKFRTEIKDFQKDSKAYLDALGNDRIAFVSKKDTKHSALITEFGNGDLSYTLKDYGKKADKA
LDREKNVTLQGSLKHDGVMFVDYSNFKYTNASKNPNKGVGVTNGVSHLEVGFNKVAIFNLPDLNNLAITSFVRRNLEDKL
TTKGLSPQEANKLIKDFLSSNKELVGKTLNFNKAVADAKNTGNYDEVKKAQKDLEKSLRKREHLEKEVEKKLESKSGNKN
KMEAKAQANSQKDEIFALINKEANRDARAIAYAQNLKGIKRELSDKLENVNKNLKDFDKSFDEFKNGKNKDFSKAEETLK
ALKGSVKDLGINPEWISKVENLNAALNEFKNGKNKDFSKVTQAKSDLENSVKDVIINQKVTDKVDNLNQAVSVAKATGDF
SRVEQALADLKNFSKEQLAQQAQKNESLNARKKSEIYQSVKNGVNGTLVGNGLSQAEATTLSKNFSDIKKELNAKLGNFN
NNNNNGLKNEPIYAKVNKKKAGQAASLEEPIYAQVAKKVNAKIDRLNQIASGLGVVGQAAGFPLKRHDKVDDLSKVGLSR
NQELAQKIDNLNQAVSEAKAGFFGNLEQTIDKLKDSTKHNPMNLWVESAKKVPASLSAKLDNYATNSHIRINSNIKNGAI
NEKATGMLTQKNPEWLKLVNDKIVAHNVGSVPLSEYDKIGFNQKNMKDYSDSFKFSTKLNNAVKDTNSGFTQFLTNAFST
ASYYCLARENAEHGIKNVNTKGGFQKS
>Q9KLR1 3.1.4.-~~~~~~3'3'-cGAMP-specific phosphodiesterase 1~~~COG2206
MRWSEIGCTMKSVNIEWNVNLRQAFFCIARALDSVGVDDINHGHRVGYMAYSCAQAMEWSEEECQLVFALGLIHDCGVAQ
KRDFYRLLENMQPDNTQQHCVRGNELLSNCPPLAPFADAILYHHTPWDELKNIAISDRNKRFAALIFLADRVDYLKELYP
RDEYGNVTQEARNQVCLEIGRLSGSLFERDLVRTMQHLLSKEFIWFSMEHHHIEAMGHNLPSTPFFEQKLGVEEIMSIAM
LMANVVDAKSQFTFQHSQKVAELCQHLAKELGLNVEMQKALYLTGLVHDIGKLHTPEEILHKPGKLNESEYLCIQRHSTD
SRYTLQMVFGQSVVCEWAGNHHERLDGSGYPRGLQGAAIDLPSRIIAIADVFQALTQARPYRGSMSLNEVMNIMRHEVSC
GRLDSQVFDVIVRNSQQYYQLSIAESPTEWA
>Q9KMV8 3.1.4.-~~~~~~3'3'-cGAMP-specific phosphodiesterase 2~~~COG0784
MKWFKYGDGMDLFADMRQEAAGEKERVVMHSQEPWCVLLVDDDEQMHQITRLALTGFKFQNRPLELISVLSGLEARKVMA
ERSDIALALVDVVMETEHAGLDLVRYIREELQNRQVRLVLRTGQAGQAPEDRVIKEYEIDDYKEKTELTTQKLRTLLYSM
LRAYRDLCLIEDQKLGLSHVIEASANVQNTKSLQSYATAVLNQLTSLLKLHASAFYCVATPCPDSEKCNALTVATTAERV
ELYVESPFKGLPEDVQRRCKEVLSQRTTRDYGDAYVFFKQDERGVDSVLYVGFEQELSELDRKLLEIYMYNIGLTFENIN
LMVDLRETSKELVYNLANAVEARSRETGAHVQRVALYCERLAHLYGLAESEADMIKNASPLHDVGKVAIPDSILHKPGKL
DAQEWAIMQKHVEYGVEILNRSKRRLMQVAKEIAATHHEKWDGSGYPNRLQGDDIPISGRITAIADVFDALGAKRSYKDP
WTDEQIREELMAQKGRHFEPKLVELLLEHWDEFIAIRASLPD
>Q9KL18 3.1.4.-~~~~~~3'3'-cGAMP-specific phosphodiesterase 3~~~COG2206
MSVAQNTFPLSELMISLTTALDMTEGQPPEHCIRCCWIGMHIGMQLELSEPELHDLFFTLLLKDAGCSSNAARICELYAT
DDLTFKRRYKTVGTSLSSVINFIVKNTGSEQSWTERILTTIDILKNGNDYAQELIQTRCTRGADVARELRFSEAVAQGIH
SLDEHWNGQGRPEQRKGEAIPLFSRIALLAQVFDVFQMEHSIEEALQEIMARSGVWFDPKLVEVVEQLVENPRFLSGLKA
TDISQRVMNLPPAQAHLPLDDAYLECIVTAFGKIVDAKSPYTAGHSERVAVYTDLIARQLAISDADRIWLRRAALLHDIG
KLGVSNAILDKPGKLDEAEWRAVQAHAAYTEQILYKLSPFKTLARMAGAHHEKLDGTGYPRGVNGDEISLMTRIITTADI
FDALSAERPYRAAMPIDKALAIMEENLHTAIDPECFAALKKALNLLPDEYTQLPHSSDKT
>O32253 ~~~cggR~~~Central glycolytic genes regulator~~~COG2390
MNQLIQAQKKLLPDLLLVMQKRFEILQYIRLTEPIGRRSLSASLGISERVLRGEVQFLKEQNLVDIKTNGMTLTEEGYEL
LSVLEDTMKDVLGLTLLEKTLKERLNLKDAIIVSGDSDQSPWVKKEMGRAAVACMKKRFSGKNIVAVTGGTTIEAVAEMM
TPDSKNRELLFVPARGGLGEDVKNQANTICAHMAEKASGTYRLLFVPGQLSQGAYSSIIEEPSVKEVLNTIKSASMLVHG
IGEAKTMAQRRNTPLEDLKKIDDNDAVTEAFGYYFNADGEVVHKVHSVGMQLDDIDAIPDIIAVAGGSSKAEAIEAYFKK
PRNTVLVTDEGAAKKLLRDE
>Q8NS43 3.1.1.-~~~~~~Probable esterase Cgl0839~~~COG2021
MLDNSFYTAEVQGPYETASIGRLELEEGGVIEDCWLAYATAGTLNEDKSNAILIPTWYSGTHQTWFQQYIGTDHALDPSK
YFIISINQIGNGLSVSPANTADDSISMSKFPNVRIGDDVVAQDRLLRQEFGITELFAVVGGSMGAQQTYEWIVRFPDQVH
RAAPIAGTAKNTPHDFIFTQTLNETVEADPGFNGGEYSSHEEVADGLRRQSHLWAAMGFSTEFWKQEAWRRLGLESKESV
LADFLDPLFMSMDPNTLLNNAWKWQHGDVSRHTGGDLAAALGRVKAKTFVMPISEDMFFPVRDCAAEQALIPGSELRVIE
DIAGHLGLFNVSENYIPQIDKNLKELFES
>Q9F5I8 3.2.1.157~~~cgiA~~~Iota-carrageenase~~~
MRLYFRKLWLTNLFLGGALASSAAIGAVSPKTYKDADFYVAPTQQDVNYDLVDDFGANGNDTSDDSNALQRAINAISRKP
NGGTLLIPNGTYHFLGIQMKSNVHIRVESDVIIKPTWNGDGKNHRLFEVGVNNIVRNFSFQGLGNGFLVDFKDSRDKNLA
VFKLGDVRNYKISNFTIDDNKTIFASILVDVTERNGRLHWSRNGIIERIKQNNALFGYGLIQTYGADNILFRNLHSEGGI
ALRMETDNLLMKNYKQGGIRNIFADNIRCSKGLAAVMFGPHFMKNGDVQVTNVSSVSCGSAVRSDSGFVELFSPTDEVHT
RQSWKQAVESKLGRGCAQTPYARGNGGTRWAARVTQKDACLDKAKLEYGIEPGSFGTVKVFDVTARFGYNADLKQDQLDY
FSTSNPMCKRVCLPTKEQWSKQGQIYIGPSLAAVIDTTPETSKYDYDVKTFNVKRINFPVNSHKTIDTNTESSRVCNYYG
MSECSSSRWER
>Q9F284 3.2.1.157~~~cgiA~~~Iota-carrageenase~~~
MKLQFKPVYLASIAIMAIGCTKEVTENDTSEISEVPTELRAAASSFYTPPGQNVRANKKNLVTDYGVNHNDQNDDSSKLN
LAIKDLSDTGGILTLPKGKYYLTKIRMRSNVHLEIEKGTVIYPTKGLTPAKNHRIFDFASKTEEKIENASIVGKGGKFIV
DLRGNSSKNQIVADVGNVTNFKISNFTIKDEKTIFASILVSFTDKAGNAWPHKGIIENIDQANAHTGYGLIQAYAADNIL
FNNLSCTGGVTLRLETDNLAMKTAKKGGVRDIFATKIKNTNGLTPVMFSPHFMENGKVTIDDVTAIGCAYAVRVEHGFIE
IFDKGNRASADAFKNYIEGILGAGSVEVVYKRNNGRTWAARIANDFNEAAYNHSNPAVSGIKPGKFATSKVTNVKATYKG
TGAKLKQAFLSYLPCSERSKVCRPGPDGFEYNGPSLGVTIDNTKRDNSLGNYNVNVSTSSVQGFPNNYVLNVKYNTPKVC
NQNLGSITSCN
>P43478 3.2.1.83~~~cgkA~~~Kappa-carrageenase~~~
MKPISIVAFPIPAISMLLLSAVSQAASMQPPIAKPGETWILQAKRSDEFNVKDATKWNFQTENYGVWSWKNENATVSNGK
LKLTTKRESHQRTFWDGCNQQQVANYPLYYTSGVAKSRATGNYGYYEARIKGASTFPGVSPAFWMYSTIDRSLTKEGDVQ
YSEIDVVELTQKSAVRESDHDLHNIVVKNGKPTWMRPGSFPQTNHNGYHLPFDPRNDFHTYGVNVTKDKITWYVDGEIVG
EKDNLYWHRQMNLTLSQGLRAPHTQWKCNQFYPSANKSAEGFPTSMEVDYVRTWVKVGNNNSAPGEGQSCPNTFVAVNSV
QLSAAKQTLRKGQSTTLESTVLPNCATNKKVIYSSSNKNVATVNSAGVVKAKNKGTATITVKTKNKGKIDKLTIAVN
>Q05JY7 3.2.1.162~~~cglA~~~Lambda-carrageenase~~~
MKIKILSAMIASSLLIGCVIPTVKASQSAIKSIETNRTITKVRTGMLSGGSSIITTSYEGTVAAYKFNGEKLWENELSGF
MNHDIWVQDINGDGLVEIFAANADGNVYCINSDGSLKWTFGLNEVPMNSVTVISDADEKYVVAGGYDKNLYYISANGELL
KTIESSAYSEEGVFGDGVKPEARTHTVNFVRPVKSSDGTEKLVVLGTNNSLQSSGRFYIFEPFADLPSEKSRISIKKGIG
DLRTVDFDNDGNDELTLGNSAQIGDAAISVMNLDDLSQKKSQINDIARRIDRFGYRVAQTEVVMNEGTPTYLTLFGSRIL
LTPESFDVNDSEILANKYSYYDIWKDKSSNKLVLASAQSGGSQVHIIDTSNPSWKSAYEELEPQGKLAAIQENTREVERQ
LSNFQKPTRERAPLPVYFISESRNEIPATIERSESLYDSPVFLNYSTLPNVENWDRSEVLADNPKYRDKRDRRKNYTLSS
EEMFNKLSAGYESSDGISQWAGHGNDPYMISLATMKRIISSGDGKKTVNIYPEIEGHGDAFNKVLNDHFYPLAEFSSENN
ANLFMRNKHTFWQSTIYAPEWSELRSGRLADAFVPAMEETTDKSMEMSVAGRMGLWAAGSVDNWGERYARDNPSFDRLRQ
HSHQMVPNHALRQIIYKIASGARYINNFGFNQEYMSLAWELIGKGALYVPKREELLSLSPVHISMKEPDPIYRETSNNVK
WTTFYDEEKDSIPYVFSRLNGTWPGAKTLPWDYSNYAADTKERRLDFIPKFPKGLVLITPVQQGKFKDEGTVRGTLADNM
HPIYKDIMKEYITDGKNYYNANGEQVMAADSVRYRQIKNKIEEKSNLLPMTVSGEAAWVVAQSARKHLRLTLVDSGYLNP
SNKVAKVKFNSVTPVAIVDVLSGETFSPDSNGVVEIPVLAGAFRFIDVKITEDLRNMQSSTL
>Q0JRK4 3.2.1.162~~~cglA~~~Lambda-carrageenase~~~
MKIKILSAMVASSLLIGCVIPTVKASQSAIKSIETNRTITKVRTGMLSGGSSIITTSYEGTVAAYKFNGEKLWENELSGF
MNHDIWVQDINGDGLVEIFAANADGNVYCINSDGSLKWTFGLNEVPMNSVTVISDADKKYVVAGGYDKNLYYISTNGELL
KTIESGTYSEEGVFGDGVKPEARTHTVNFVRPVKSSDGTEKLVVLGTNNSLQSSGRFYIFEPFADLPSEKSRISIKKGIG
DLRTVDFDNDGNDELTLGNSAQIGDAAISVMNLDDLSQKKSQINDIARRIDRFGYRVAQTEVVMNEGTPTYLTLFGSRIL
LTPESFDVNDSEILANKYSYYDMWKDKSSNKLVLASAQSGGSQVHIIDTSNPSWKSAYEELEPQGKLAAIQENTRAIERQ
LSNFQKPTRERAPLPVYFISESRNEIPTTIERSEFLYDSPVFLNYSTLPNVENWDRSEVLADNPKYRDKRDRRKNYTLSS
EEMFNKLSAGYDNSDGISQWAGHGNDPYMISLATMKRIISSGDGKKTVNIYPEIEGHGDAFNKVLSDHFYPLAEFSSENN
ANLFMRNKHTFWQSTIYAPEWSELRSGRLADAFVPAMEETTDKSMEMSVAGRMGLWAAGSVDNWGERYARDNPSFDRLRQ
HSHQMVPNHALRQIIYKIASGARYINNFGFNQEYMSLAWELIGKGALYVPKREELLSLSPVHISMKEPDPIYRETSNNVK
WTTFYDEEKDSIPYVFSRLNGTWPGAKTLPWDYSNYAADTKERRLDFIPKFPKGLVLITPVQQGKFKDEGTVRGTLADNM
HPIYKDIMKEYITDGKNYYNPNGEQVMAADSVRYRQIKNKIEEKSNLLPMTVSGEAAWVVAQSAEKHLRLTLVDSGYLNP
SNKVAKVKFNSVTPVAIVDVLSGETFSPDSNGVVEIPVLAGAFRFIDVKITEDLRNMQSSTL
>P37126 ~~~xcg~~~Chorionicgonadotropic hormone-like protein~~~
MRPWVPVTGASPHAAVGPAWTTGRRCVGVVTTWWWSVSYVERQVQIIDSLYRAMDASAENGYTDAACRFRYLPEEDGSLG
IDSSFFYTIGGVSVSALLNDYGDKGCADLVYDLHDVMYKDIRFGESIRSRMCPRGVNPVPSAPTAPPAGLAAFPSANRRT
ERCLVWIACVHRLSAVGEAGSLNEPEAARLANDPAEHSNRHVDALGRQQLVCAHVVAGVASDGFGVRAGEERTGARGRHS
GSIKPLDNLVVETPIEAARSRHRCAGRPCPRRHRRPCNASKSHRPMRMQQRDRGWTVWQKDFSMLTPLSAGFQHRVLAQQ
LVARRGRMQPSVVQIGVGRIRADVAGRVAGQDALGRIGRILPNAVARLSIAVRFLRAQSINHSIQSKRDRRSWNGCSKGQ
RRRGFISIDKKPGPKNKLALMFIREPDRFISIDRNPARRTSWL
>P32397 1.3.3.15~~~cgoX~~~Coproporphyrinogen III oxidase~~~COG1232
MSDGKKHVVIIGGGITGLAAAFYMEKEIKEKNLPLELTLVEASPRVGGKIQTVKKDGYIIERGPDSFLERKKSAPQLVKD
LGLEHLLVNNATGQSYVLVNRTLHPMPKGAVMGIPTKIAPFVSTGLFSLSGKARAAMDFILPASKTKDDQSLGEFFRRRV
GDEVVENLIEPLLSGIYAGDIDKLSLMSTFPQFYQTEQKHRSLILGMKKTRPQGSGQQLTAKKQGQFQTLSTGLQTLVEE
IEKQLKLTKVYKGTKVTKLSHSGSCYSLELDNGVTLDADSVIVTAPHKAAAGMLSELPAISHLKNMHSTSVANVALGFPE
GSVQMEHEGTGFVISRNSDFAITACTWTNKKWPHAAPEGKTLLRAYVGKAGDESIVDLSDNDIINIVLEDLKKVMNINGE
PEMTCVTRWHESMPQYHVGHKQRIKELREALASAYPGVYMTGASFEGVGIPDCIDQGKAAVSDALTYLFS
>P9WMP1 1.3.3.15~~~cgoX~~~Coproporphyrinogen III oxidase~~~COG1232
MTPRSYCVVGGGISGLTSAYRLRQAVGDDATITLFEPADRLGGVLRTEHIGGQPMDLGAEAFVLRRPEMPALLAELGLSD
RQLASTGARPLIYSQQRLHPLPPQTVVGIPSSAGSMAGLVDDATLARIDAEAARPFTWQVGSDPAVADLVADRFGDQVVA
RSVDPLLSGVYAGSAATIGLRAAAPSVAAALDRGATSVTDAVRQALPPGSGGPVFGALDGGYQVLLDGLVRRSRVHWVRA
RVVQLERGWVLRDETGGRWQADAVILAVPAPRLARLVDGIAPRTHAAARQIVSASSAVVALAVPGGTAFPHCSGVLVAGD
ESPHAKAITLSSRKWGQRGDVALLRLSFGRFGDEPALTASDDQLLAWAADDLVTVFGVAVDPVDVRVRRWIEAMPQYGPG
HADVVAELRAGLPPTLAVAGSYLDGIGVPACVGAAGRAVTSVIEALDAQVAR
>Q2FXA5 1.3.3.15~~~cgoX~~~Coproporphyrinogen III oxidase~~~COG1232
MTKSVAIIGAGITGLSSAYFLKQQDPNIDVTIFEASNRPGGKIQSYRKDGYMIELGPESYLGRKTIMTELAKDIGLEQDI
VTNTTGQSYIFAKNKLYPIPGGSIMGIPTDIKPFVTTKLISPLGKLRAGFDLLKKPTQMQDGDISVGAFFRARLGNEVLE
NLIEPLMGGIYGTDIDKLSLMSTFPNFKEKEEAFGSLIKGMKDEKNKRLKQRQLYPGAPKGQFKQFKHGLSSFIEALEQD
VKNKGVTIRYNTSVDDIITSQKQYKIVYNDQLEEVYDGVLVTTPHQVFLNWFGQDPAFDYFKTMDSTTVATVVLAFDEKD
IENTHDGTGFVIARTSDTDITACTWTSKKWPFTTPEGKVLIRAYVGKPGDTVVDDHTDNELVSIVRRDLSQMMTFKGDPE
FTIVNRLPKSMPQYHVGHIQQIRQIQAHIKQTYPRLRVTGASFEAVGLPDCITQGKVAAEEVIAEL
>C8WLM0 ~~~cgr1~~~Cytochrome c-type protein Cgr1~~~COG3303
MAEEPVVIGDPAPRTRKWPIVVGVVVVVLIAAGAGFWVWHEQPSFCAAICHTPMDEYLETYEQEAGTAGVDKWGNEVANT
NAMLAVSHKAQGKDCMACHVPTLSEQMSEGMNWVTGNYVYPLEERDTEMLTEARGVDADEFCLNESCHNLTRDDLIKATS
DMEFNPHQPQHGEIECSECHKAHRASVMYCTQCHSEAEVPEGWLTVAEANKLSTAA
>C8WLM1 1.3.2.-~~~cgr2~~~Digoxin reductase~~~COG1053
MEYGKCRGIERGMGRRDFLKAATLLGATAAGAGMLAGCAPKSASEAQAQTAPAATGGLDPADVDWKYETDVVIVGSGSGG
TCAAIEAAEAGADVVVFEKDKAMYGGNSALCGGYMLAAGWSTQEEITGYAGDTGEAFANQMLRWSQGLGNQDMIREACLR
SGEAVDWMMDTGRTYEGASPLPPVWSCGDTEADVVPRSVYNHNAYGATEGHMATLKKRAESLSNIEIEMGCEVAHILKNA
EGSVIGVQLADGSFAKARKGVVMACASVDNNLEMSKDLGLMQNVWGLTLEGAGLLAPGNPDMDSNTGDGVRMLREIGAEL
CMQQAVCMNDSIYVGGISDWGMSEILGKDVNIHDSSNIDAILVDKTGRRFCQDDAEWGYVMHECAQAAWKQGFTPDDPTT
GYIFYVYDATGAPFFEMKGHTPDTCDTTFSADSVDGLAEFIGCDPTALASEVERWNSFCEAGLDADFGRRANMAPIATPP
FYCDVVRPGPMGTFAGAKSNVEAEIIGLDGNPIPRLYGAGCIIGGNVSGAFYFGCGWSITNTVVWGREAGRNVAALEPWE
>P77828 ~~~groES1~~~Co-chaperonin GroES 1~~~COG0234
MHFRPLHDRVLVRRIDAEEKTAGGIIIPDTAKEKPQEGEIIAAGSGGRNEQGQLIPIDVKPGDRVLFGKWSGTEVKIDGQ
DYLIMKESDLLGVVDKTGSVKKAA
>P35473 ~~~groES1~~~Co-chaperonin GroES 1~~~COG0234
MASTNFRPLHDRVVVRRVESEEKTKGGIIIPDTAKEKPQEGEIVAVGSGARDESGKVVPLDVKAGDRILFGKWSGTEVKI
NGEDLLIMKEADIMGVIG
>P35863 ~~~groES2~~~Co-chaperonin GroES 2~~~COG0234
MKFRPLHDRVVVKRIDAEEKTAGGIIIPDTVKEKPSQGEVIAVGPGGRDESGKLIPIDVRVGDRVLFGKWSGTEVKIDTQ
ELLIMKESDIMGVLADVSSKKKAA
>P35864 ~~~groES3~~~Co-chaperonin GroES 3~~~COG0234
MKFRPLHDRVVVKRIDAEEKTAGGIIIPDTAKEKPSQGEVIAVGPGGHDDSGKLIPIDIEVGDRVLFGKWSGTEVKIDGQ
DLLIMKESDVMGVLTDVFSKKKAA
>P35474 ~~~groES5~~~Co-chaperonin GroES 5~~~COG0234
MAFRPLHDRILVRRVESEEKTKGGIIIPDTAKEKPQEGEVLAVGPGARGEQGQIQPLDVKVGDRILFGKWSGTEIKIDGE
DLLIMKESDVMGIIEARAAEKIAA
>P48223 ~~~groES~~~Co-chaperonin GroES~~~COG0234
MNIRPLGDRVVVKMVETEETTKSGIVLPGSAKEKPQVAEVVAVGPGTVVDGKEVKMEVKVGDKVIISKYAGTEVKFDGQE
YTILRQNDILAVVE
>P31295 ~~~groES~~~Co-chaperonin GroES~~~
MNIRPLHDRVVVRRMEEERLSAGGIVIPDSATEKPIQGEIIAVGHGKILDNGSVRALDVKVGDSVLFGKYSGTEVKLDGK
EFLVMREEDIMAVVEG
>P28599 ~~~groES~~~Co-chaperonin GroES~~~COG0234
MLKPLGDRVVIELVESEEKTASGIVLPDSAKEKPQEGKIVAAGSGRVLESGERVALEVKEGDRIIFSKYAGTEVKYEGTE
YLILRESDILAVIG
>B2SCZ5 ~~~groES~~~Co-chaperonin GroES~~~
MADIKFRPLHDRVVVRRVESEAKTAGGIIIPDTAKEKPQEGEVVAAGAGARDEAGKLVPLDVKAGDRVLFGKWSGTEVKI
GGEDLLIMKESDILGIVG
>P0A342 ~~~groES~~~Co-chaperonin GroES~~~COG0234
MADIKFRPLHDRVVVRRVESEAKTAGGIIIPDTAKEKPQEGEVVAAGAGARDEAGKLVPLDVKAGDRVLFGKWSGTEVKI
GGEDLLIMKESDILGIVG
>P0A343 ~~~groES~~~Co-chaperonin GroES~~~
MADIKFRPLHDRVVVRRVESEAKTAGGIIIPDTAKEKPQEGEVVAAGAGARDEAGKLVPLDVKAGDRVLFGKWSGTEVKI
GGEDLLIMKESDILGIVG
>B8H164 ~~~groES~~~Co-chaperonin GroES~~~
MKFRPLGDRVLVKRVEEETKTKGGIIIPDTAKEKPQEGEVVAVGPGARNDKGDVVALDVKAGDRILFGKWSGTEVKVDGQ
DLLIMKESDVLGVVEA
>P25969 ~~~groES~~~Co-chaperonin GroES~~~
MAFKPLHDRVLVRRVQSDEKTKGGLIIPDTAKEKPAEGEVVSCGEGARKDSGELIAMSVKAGDRVLFGKWSGTEVTIDGA
ELLIMKESDILGILS
>P0A6F9 ~~~groES~~~Co-chaperonin GroES~~~COG0234
MNIRPLHDRVIVKRKEVETKSAGGIVLTGSAAAKSTRGEVLAVGNGRILENGEVKPLDVKVGDIVIFNDGYGVKSEKIDN
EEVLIMSESDILAIVEA
>P94797 ~~~groES~~~Co-chaperonin GroES~~~
MNIRPLQDRVLVRRAEEEKKSAGGIILTGNAQEKPSQGEVVAVGNGKKLDNGTTLPMDVKVGDKVLFGKYSGSEVKVGDE
TLLMMREEDIMGIIA
>Q07200 ~~~groES~~~Co-chaperonin GroES~~~
MLKPLGDRVVIEVIETEEKTASGIVLPDTAKEKPQEGRVVAVGKGRVLDSGERVAPEVEVGDRIIFSKYAGTEVKYDGKE
YLILRESDILAVIG
>O50304 ~~~groES~~~Co-chaperonin GroES~~~COG0234
MLKPLGDRVVIEQVETEEKTASGIVLPDTAKEKPQEGRVVAVGTGRVTENGEKIALEVKEGDSVIFSKYAGTEVKYDGKE
YLILRESDILAIIG
>P0A0R3 ~~~groES~~~Co-chaperonin GroES~~~COG0234
MKFQPLGERVLVERLEEENKTSSGIIIPDNAKEKPLMGVVKAVSHKISEGCKCVKEGDVIAFGKYKGAEIVLDGTEYMVL
ELEDILGIVGSGSCCHTGNHDHKHAKEHEACCHDHKKH
>P26879 ~~~groES~~~Co-chaperonin GroES~~~COG0234
MKIRPLHDRVVVRRMEEERTTAGGIVIPDSATEKPMRGEIIAVGAGKVLENGDVRALAVKVGDVVLFGKYSGTEVKVDGK
ELVVMREDDIMGVIEK
>P61436 ~~~groES~~~Co-chaperonin GroES~~~
MASIKPLGDRVLVEPRQEAEEKIGSIFVPDTAKEKPQEGKVVEIGSGKYEDGKLIPLEVKVGDTVLYGKYSGTEIKSEGK
EYLIIRESDILAVVKK
>P15020 ~~~groES~~~Co-chaperonin GroES~~~
MAKVNIKPLEDKILVQANEAETTTASGLVIPDTAKEKPQEGTVVAVGPGRWDEDGEKRIPLDVAEGDTVIYSKYGGTEIK
YNGEEYLILSARDVLAVVSK
>P24301 ~~~groES~~~Co-chaperonin GroES~~~COG0234
MAKVKIKPLEDKILVQAGEAETMTPSGLVIPENAKEKPQEGTVVAVGPGRWDEDGAKRIPVDVSEGDIVIYSKYGGTEIK
YNGEEYLILSARDVLAVVSK
>A0QSS3 ~~~groES~~~Co-chaperonin GroES~~~COG0234
MASVNIKPLEDKILVQANEAETTTASGLVIPDTAKEKPQEGTVVAVGPGRWDEDGEKRIPLDVAEGDTVIYSKYGGTEIK
YNGEEYLILSARDVLAVVSK
>P9WPE5 ~~~groES~~~Co-chaperonin GroES~~~COG0234
MAKVNIKPLEDKILVQANEAETTTASGLVIPDTAKEKPQEGTVVAVGPGRWDEDGEKRIPLDVAEGDTVIYSKYGGTEIK
YNGEEYLILSARDVLAVVSK
>P80469 ~~~groES~~~Co-chaperonin GroES~~~COG0234
MSFKPLHDRIAIKPIENEEKTKGGIIIPDTAKEKPMQGEIVAVGNGVLNKNGEIYPLELKVGDKVLYGKWAGTEIEIKGE
KLIVMKESDVFGIIN
>P99104 ~~~groES~~~Co-chaperonin GroES~~~
MLKPIGNRVIIEKKEQEQTTKSGIVLTDSAKEKSNEGVIVAVGTGRLLNDGTRVTPEVKEGDRVVFQQYAGTEVKRDNET
YLVLNEEDILAVIE
>P0A014 ~~~groES~~~Co-chaperonin GroES~~~
MLKPIGNRVIIEKKEQEQTTKSGIVLTDSAKEKSNEGVIVAVGTGRLLNDGTRVTPEVKEGDRVVFQQYAGTEVKRDNET
YLVLNEEDILAVIE
>Q97NV3 ~~~groES~~~Co-chaperonin GroES~~~COG0234
MLKPLGDRVVLKIEEKEQTVGGFVLAGSAQEKTKTAQVVATGQGVRTLNGDLVAPSVKTGDRVLVEAHAGLDVKDGDEKY
IIVGEANILAIIEE
>Q05971 ~~~groES~~~Co-chaperonin GroES~~~COG0234
MAAISINVSTVKPLGDRVFVKVSPAEEKTAGGILLPDNAKEKPQIGEVVQVGPGKRNDDGTYSPVEVKVGDKVLYSKYAG
TDIKLGGDDYVLLTEKDILASVA
>Q60023 ~~~groES~~~Co-chaperonin GroES~~~
MRLKPLGDRVVVKVIQAEEVTKGGVILPGTAKEKPQQGEVVAVGTGEYIDGKKVELEVKVGDRVIFSKYAGTEVKLDGEE
YLLLRESDILAIIE
>P61492 ~~~groES~~~Co-chaperonin GroES~~~COG0234
MAAEVKTVIKPLGDRVVVKRIEEEPKTKGGIVLPDTAKEKPQKGKVIAVGTGRVLENGQRVPLEVKEGDIVVFAKYGGTE
IEIDGEEYVILSERDLLAVLQ
>P61493 ~~~groES~~~Co-chaperonin GroES~~~COG0234
MAAEVKTVIKPLGDRVVVKRIEEEPKTKGGIVLPDTAKEKPQKGKVIAVGTGRVLENGQRVPLEVKEGDIVVFAKYGGTE
IEIDGEEYVILSERDLLAVLQ
>Q2LQN9 1.3.8.10~~~~~~Cyclohex-1-ene-1-carbonyl-CoA dehydrogenase~~~COG1960
MKGPIKFNALSLQGRSVMSNQSNDTTITQRRDTMNELTEEQKLLMEMVRNLAVREIAPRAIEIDENHSFPVHARDLFADL
GLLSPLVPVEYGGTGMDITTFAMVLEEIGKVCASTALMLLAQADGMLSIILDGSPALKEKYLPRFGEKSTLMTAFAATEP
GAGSDLLAMKTRAVKKGDKYVINGQKCFITNGSVADILTVWAYTDPSKGAKGMSTFVVERGTPGLIYGHNEKKMGMRGCP
NSELFFEDLEVPAENLVGEEGKGFAYLMGALSINRVFCASQAVGIAQGALERAMQHTREREQFGKPIAHLTPIQFMIADM
ATEVEAARLLVRKATTLLDAKDKRGPLIGGMAKTFASDTAMKVTTDAVQVMGGSGYMQEYQVERMMREAKLTQIYTGTNQ
ITRMVTGRSLLFPS
>Q9AMJ8 5.6.1.7~~~groEL1~~~Chaperonin GroEL 1~~~
MAKRIIYNENARRALERGIDILAEAVAVTLGPKGRNVVLEKKYGAPQIVNDGVTIAKEIELEDHIENTGVALIRQAASKT
NDVAGDGTTTATVLAHAIVKEGLRNVAAGANAILLKRGIDKATNFLVDRIREHARSVEDSKAIAQVGAISAGNDDEVRQM
IAEALDKVGKEAVISLEEGKSVTTELEVTEGMRFDKGYISPYFATDPERMEAIFDEPFLAVDDKQIALVQDLVPVLEPVA
RAGRPLVIIAEDIEKEALATLVVNRLRGVLNVAAVKAPGFGDRRKAMLEDIAILTGGQLITEDAGLKLDNTKLDSLGKAR
RITITKDSTTIVAEGNDVAVKARVEQIRRQMEETESSYDKGKLQERLAKLSGGVAVVKVGAATETEMKDKKLRLEDAINA
TKAAVEEGIVPGGGTTLAHLTPELEAWANSTLKDEELTGALIVARALPAPLKRIAENAGQNGAVIAERVKEKEFNVDFNA
ATNEFVDMFSAGIVDPAKVTRSALQNALSYACMVLTTGTVVDKPEPKDAAPAGVGGGGGDFDY
>P77829 5.6.1.7~~~groEL1~~~Chaperonin GroEL 1~~~COG0459
MAAKEVKFSTDARDRVLRGVDTLANAVKVTLGPKGRNVVIEKSFGAPRITKDGVTVAKEIELEDKFENMGAQMVREVASK
TSDLAGDGTTTATVLAQAIVKEGAKSVAAGMNPMDLKRGIDLAVEAIVNDLKAHAKKVTTNEEIAQIATISANGDIEIGR
FLADAMQKVGNDGVITVEEAKSLDTELEVVEGMQFDRGYASPYFVTNAEKMRVEFEDPYILIHEKKLSTLQSMLPLLEAV
VQSGKPLLVVAEDVEGEALATLVVNRLRGGLKVAAVKAPGFGDRRKAMLEDIAILTGGQAISEDLGIKLENVTLKMLGRA
KKVVIDKENTTIVNGAGSKKDIEARVTQIKMQIEETTSDYDREKLQERLAKLAGGVAVIRVGGATEVEVKERKDRVDDAM
HATRAAVEEGILPGGGVALLRGLKALDAIKTVNADQKAGVDIVRRAIQVPARQIVQNAGEDGSLVVGKLLENSSYNWGFN
AASGEYQDLAKAGVIDPAKVVRTALQDAASVAALLITTEALIAEKPKKSEPAPAAPPMDF
>P20110 5.6.1.7~~~groEL1~~~Chaperonin GroEL 1~~~
MAAKDVKFDTDARDRMLRGVNILADAVKVTLGPKGRNVVIDKSFGAPRITKDGVSVAKEIELSDKFENMGAQMVKEVASR
TNDEAGDGTTTATVLAQAIIKEGLKAVAAGMNPMDLKRGIDLATSKVVEAIKAAARPVNDSHEVAQVGTISANGEAQIGR
FIADAMQKVGNEGVITVEENKGLETEVEVVEGMQFDRGYLSPYFVTNADKMTAELDDVYILLHEKKLSSLQPMVPLLEAV
IQSQKPLLIIAEDVEGEALATLVVNKLRGGLKIAAVKAPGFGDRRKAMLQDIAILTGGQVISEDLGMKLENVTIDMLGRA
KKISINKDNTTIVDGNGDKAEIDARVAQIRNQIEETSSDYDREKLQERVAKLAGGVAVIRVGGMTEVEVKERKDRVDDAL
NATRAAVQEGIVVGGGVALIQGGKALDGLTGENPDQNAGITIVRRALEAPLRQIAQNAGVDGSVVAGKVRESNEKSFGFN
AQTEEYGDMFKFGVIDPAKVVRTALEDAASVASLLITTEAMIADKPEPKSPAGGPGMGGMGGMDGMM
>P15599 5.6.1.7~~~groEL1~~~Chaperonin GroEL 1~~~COG0459
MAAKNIKYNEDARKKIHKGVKTLAEAVKVTLGPKGRHVVIDKSFGSPQVTKDGVTVAKEIELEDKHENMGAQMVKEVASK
TADKAGDGTTTATVLAEAIYSEGLRNVTAGANPMDLKRGIDKAVKVVVDEIKKISKPVQHHKEIAQVATISANNDAEIGN
LIAEAMEKVGKNGSITVEEAKGFETVLDVVEGMNFNRGYLSSYFSTNPETQECVLEEALVLIYDKKISGIKDFLPVLQQV
AESGRPLLIIAEDIEGEALATLVVNRLRAGFRVCAVKAPGFGDRRKAMLEDIAILTGGQLISEELGMKLENTTLAMLGKA
KKVIVSKEDTTIVEGLGSKEDIESRCESIKKQIEDSTSDYDKEKLQERLAKLSGGVAVIRVGAATEIEMKEKKDRVDDAQ
HATLAAVEEGILPGGGTALVRCIPTLEAFIPILTNEDEQIGARIVLKALSAPLKQIAANAGKEGAIICQQVLSRSSSEGY
DALRDAYTDMIEAGILDPTKVTRCALESAASVAGLLLTTEALIADIPEEKSSSAPAMPGAGMDY
>P31681 5.6.1.7~~~groEL1~~~Chaperonin GroEL 1~~~COG0459
MAAKNIKYNEEARKKIHKGVKTLAEAVKVTLGPKGRHVVIDKSFGSPQVTKDGVTVAKEIELEDKHENMGAQMVKEVASK
TADKAGDGTTTATVLAEAIYSEGLRNVTAGANPMDLKRGIDKAVKVVVDELKKISKPVQHHKEIAQVATISANNDSEIGN
LIAEAMEKVGKNGSITVEEAKGFETVLDVVEGMNFNRGYLSSYFSTNPETQECVLEDALILIYDKKISGIKDFLPVLQQV
AESGRPLLIIAEEIEGEALATLVVNRLRAGFRVCAVKAPGFGDRRKAMLEDIAILTGGQLVSEELGMKLENTTLAMLGKA
KKVIVTKEDTTIVEGLGNKPDIQARCDNIKKQIEDSTSDYDKEKLQERLAKLSGGVAVIRVGAATEIEMKEKKDRVDDAQ
HATIAAVEEGILPGGGTALVRCIPTLEAFLPMLANEDEAIGTRIILKALTAPLKQIASNAGKEGAIICQQVLARSANEGY
DALRDAYTDMIDAGILDPTKVTRSALESAASIAGLLLTTEALIADIPEEKSSSAPAMPSAGMDY
>P0A519 5.6.1.7~~~groEL1~~~Chaperonin GroEL 1~~~
MSKLIEYDETARRAMEVGMDKLADTVRVTLGPRGRHVVLAKAFGGPTVTNDGVTVAREIELEDPFEDLGAQLVKSVATKT
NDVAGDGTTTATILAQALIKGGLRLVAAGVNPIALGVGIGKAADAVSEALLASATPVSGKTGIAQVATVSSRDEQIGDLV
GEAMSKVGHDGVVSVEESSTLGTELEFTEGIGFDKGFLSAYFVTDFDNQQAVLEDALILLHQDKISSLPDLLPLLEKVAG
TGKPLLIVAEDVEGEALATLVVNAIRKTLKAVAVKGPYFGDRRKAFLEDLAVVTGGQVVNPDAGMVLREVGLEVLGSARR
VVVSKDDTVIVDGGGTAEAVANRAKHLRAEIDKSDSDWDREKLGERLAKLAGGVAVIKVGAATETALKERKESVEDAVAA
AKAAVEEGIVPGGGASLIHQARKALTELRASLTGDEVLGVDVFSEALAAPLFWIAANAGLDGSVVVNKVSELPAGHGLNV
NTLSYGDLAADGVIDPVKVTRSAVLNASSVARMVLTTETVVVDKPAKAEDHDHHHGHAH
>A1KPA8 5.6.1.7~~~groEL1~~~Chaperonin GroEL 1~~~
MSKLIEYDETARRAMEVGMDKLADTVRVTLGPRGRHVVLAKAFGGPTVTNDGVTVAREIELEDPFEDLGAQLVKSVATKT
NDVAGDGTTTATILAQALIKGGLRLVAAGVNPIALGVGIGKAADAVSEALLASATPVSGKTGIAQVATVSSRDEQIGDLV
GEAMSKVGHDGVVSVEESSTLGTELEFTEGIGFDKGFLSAYFVTDFDNQQAVLEDALILLHQDKISSLPDLLPLLEKVAG
TGKPLLIVAEDVEGEALATLVVNAIRKTLKAVAVKGPYFGDRRKAFLEDLAVVTGGQVVNPDAGMVLREVGLEVLGSARR
VVVSKDDTVIVDGGGTAEAVANRAKHLRAEIDKSDSDWDREKLGERLAKLAGGVAVIKVGAATETALKERKESVEDAVAA
AKAAVEEGIVPGGGASLIHQARKALTELRASLTGDEVLGVDVFSEALAAPLFWIAANAGLDGSVVVNKVSELPAGHGLNV
NTLSYGDLAADGVIDPVKVTRSAVLNASSVARMVLTTETVVVDKPAKAEDHDHHHGHAH
>A0QSS4 5.6.1.7~~~groEL1~~~Chaperonin GroEL 1~~~COG0459
MSKQIEFNETARRAMEAGVDKLADAVKVTLGPRGRHVVLAKSFGGPQVTNDGVTIAREIDLEDPYENLGAQLVKSVATKT
NDVAGDGTTTATVLAQALVRAGLRNVAAGANPIALGSGISKAADAVSEALLASATPVDDKKAIAQVATVSSRDEQVGELV
GEAMTKVGHDGVVTVEESSTLETYLEVTEGVGFDKGFLSAYFVTDFDSQEAVLEDALVLLHRDKISSLPDLLPLLEKVAE
AGKPLLIVAEDVEGEALSTLVVNAIRKTLKAVAVKAPFFGDRRKAFLDDLAIVTGGQVVNPDVGLLLREVGLEVLGSARR
VVVNKDSTVIVDGGGTAEAIADRVKQIKSEIETTDSDWDREKLQERLAKLAGGVAVIKVGAATETDLKKRKEAVEDAVAA
AKAAVEEGIVTGGGAALVQARSAVEKLRGELSGDEALGVDVFASALSAPLYWIATNAGLDGSVVVNKVSELPKGQGFNAA
TLEFGDLVSAGVVDPAKVTRSAVLNAASVARMILTTETAVVDKPADEDEHGHGHHHGHAH
>P9WPE9 5.6.1.7~~~groEL1~~~Chaperonin GroEL 1~~~COG0459
MSKLIEYDETARRAMEVGMDKLADTVRVTLGPRGRHVVLAKAFGGPTVTNDGVTVAREIELEDPFEDLGAQLVKSVATKT
NDVAGDGTTTATILAQALIKGGLRLVAAGVNPIALGVGIGKAADAVSEALLASATPVSGKTGIAQVATVSSRDEQIGDLV
GEAMSKVGHDGVVSVEESSTLGTELEFTEGIGFDKGFLSAYFVTDFDNQQAVLEDALILLHQDKISSLPDLLPLLEKVAG
TGKPLLIVAEDVEGEALATLVVNAIRKTLKAVAVKGPYFGDRRKAFLEDLAVVTGGQVVNPDAGMVLREVGLEVLGSARR
VVVSKDDTVIVDGGGTAEAVANRAKHLRAEIDKSDSDWDREKLGERLAKLAGGVAVIKVGAATETALKERKESVEDAVAA
AKAAVEEGIVPGGGASLIHQARKALTELRASLTGDEVLGVDVFSEALAAPLFWIAANAGLDGSVVVNKVSELPAGHGLNV
NTLSYGDLAADGVIDPVKVTRSAVLNASSVARMVLTTETVVVDKPAKAEDHDHHHGHAH
>P35469 5.6.1.7~~~groEL1~~~Chaperonin GroEL 1~~~COG0459
MAAKEVKFGRSAREKMLRGVDILADAVKVTLGPKGRNVVIDKSFGAPRITKDGVSVAKEIELEDKFENMGAQMVREVASK
TNDIAGDGTTTATVLAQAIVREGAKAVAAGMNPMDLKRGIDLAVAEVVKDLLAKAKKINTSDEVAQVGTISANGEKQIGL
DIAEAMQKVGNEGVITVEEAKTAETELEVVEGMQFDRGYLSPYFVTNPEKMVADLEDAFILLHEKKLSNLQAMLPVLEAV
VQTGKPLLIIAEDVEGEALATLVVNKLRGGLKIAAVKAPGFGDRRKAMLEDIAILTGGTVISEDLGIKLESVTLDMLGRA
KKVSITKENTTIVDGAGQKSDIEGRVAQIKAQIEETTSDYDREKLQERLAKLAGGVAVIRVGGATEVEVKEKKDRIDDAL
NATRAAVQEGIVPGGGVALLRSSVKITVKGENDDQDAGVNIVRRALQSPARQIVENAGDEASIVVGKILEKNTDDFGYNA
QTGEYGDMIAMGIIDPVKVVRTALQDAASVASLLITTEAMIAELPKKDAPAMPGGMGGMGGMDMM
>Q00767 5.6.1.7~~~groEL1~~~Chaperonin GroEL 1~~~
MAKILKFDEDARRALERGVNQLADTVKVTIGPKGRNVVIDKKFGAPTITNDGVTIAREVECDDPYENLGAQLVKEVATKT
NDIAGDGTTTATVLAQALVREGLRNVAAGASPAALKKGIDAAVAAVSAELLDTARPIDDKSDIAAVAALSAQDKQVGELI
AEAMDKVGKDGVITVEESNTFGVDLDFTEGMAFDKGYLSPYMVTDQERMEAVLDDPYILIHQGKIGSIQDLLPLLEKVIQ
AGGSKPLLIIAEDVEGEALSTLVVNKIRGTFNAVAVKAPGFGDRRKAMLGDMATLTGATVIAEEVGLKLDQAGLDVLGTA
RRVTVTKDDTTIVDGGGNAEDVQGRVAQIKAEIESTDSDWDREKLQERLAKLAGGVCVIRVGAATEVELKERKHRLEDAI
SATRAAVEEGIVSGGGSALVHAVKVLDDNLGRTGDEATGVAVVRRAAVEPLRWIAENAGLEGYVITTKVAELDKGQGFNA
ATGEYGDLVKAGVIDPVKVTRSALENAASIASLLLTTETLVVEKPAEEEPEAGHGHGHSH
>Q05972 5.6.1.7~~~groEL1~~~Chaperonin GroEL 1~~~COG0459
MAKSIIYNDEARRALERGMDILAEAVAVTLGPKGRNVVLEKKFGSPQIINDGITIAKEIELEDHVENTGVSLIRQAASKT
NDVAGDGTTTATVLAHAIVKEGLRNVAAGANPISLKRGIDKATDFLVARIKEHAQPVGDSKAIAQVGAISAGNDEEVGQM
IANAMDKVGQEGVISLEEGKSMTTELEITEGMRFDKGYISPYFVTDAERMEAVLEDPRILITDKKINLVQDLVPILEQVA
RQGKPLLIIAEDIEKEALATLVVNRLRGVLNVAAVKAPGFGDRRKQMLEDIATLTGGQVISEDAGLKLESATVDSLGSAR
RINITKDNTTIVAEGNEAAVKSRCEQIRRQIEETDSSYDKEKLQERLAKLAGGVAVIKVGAATETEMKDRKLRLEDAINA
TKAAVEEGIVPGGGTTLAHLAPQLEDWATGNLKDEELTGALIVARALPAPLKRIAENAGQNGAVISERVKEKEFNVGYNA
ASLEYVDMLAAGIVDPAKVTRSALQNAASIAGMVLTTECIVVDKPEKEKAPAGAPGGDFDY
>Q7WVY0 5.6.1.7~~~groEL2~~~Chaperonin GroEL 2~~~
MAKIISFDEESRRALERGVNALADAVKITLGPKGRNVLLEKKYGTPQIVNDGITVAKEIELEDPLENTGARLIQEVASKT
KDVAGDGTTTATVLVQALIKEGLKNVAAGINPVSLKRGIDKTTEALVEEIAKVAKPVEGSAIAQVATVSAGNDEEVGGMI
AEAVERVTKDGVITVEESKSLTTELDVVEGMHIDRGYISPYFITNNERQTVELENARILITDKKINSIQELVPVLKKVAR
LGQPLLIVAEDVEGDALATLVVNKARGVLSVAAIKAPGFGERRKALLQDIAILTDGQLISEEIGLSLDTASIDALCTART
ITIDKENTTIVAGTTTKPEIQKRIGQIRKQLEETDSEYDKEKLQERIAKLAGGIAVIKVGAVPETELKDRKLRIENALNA
TKAAVAESIGPGGGKTLIYLASKVDPIKAYFEEEEKIGADIVKRALEAPLRQIADNAGEEGSVIVSRVKDSDFNVGYNAA
TGEFEDLIAAGIIDPAKVVRSALQNAASIAGLVLTTEAIVVEKPEKKPAVPADPGMGGMGGMGGMGGMGGMGGMGMF
>P35861 5.6.1.7~~~groEL2~~~Chaperonin GroEL 2~~~COG0459
MSAKEVKFGVDARDRMLRGVDILHNAVKVTLGPKGRNVVLDKSFGAPRITKDGVTVAKEIELEDKFENMGAQMVREVASK
SADAAGDGTTTATVLAAAIVREGAKSVAAGMNPMDLKRGIDMAVEAVVADLVKNSKKVTSNEEIAQVGTISANGDAEIGK
FISDAMKKVGNEGVITVEEAKSLETELEVVEGMQFDRGYISPYFVTNADKMRVEMDDAYVLINEKKLSQLNELLPLLEAV
VQSGKPLVIIAEDVEGEALATLVVNRLRGGLKVAAVKAPGFGDRRKAMLQDIAILTGGQAISEDLGIKLENVTLNMLGRA
KKVMIDKENTTIVSGAGKKADIEARVAQIKAQIEETTSDYDREKLQERLAKLAGGVAVIRVGGATEVEVKERKDRVDDAM
HATRAAVEEGILPGGGVALLRASEHLKGIRTKNDDQKTGVEIVRKALSYPARQIAINAGEDGSVIVGKILEKDQYSYGYD
SQTGEYGNLVSKGIIDPTKVVRVAIQNAASVAALLITTEAMVAEVPKKNTGAGGMPPGGGGMGGMGGMDF
>P0A521 5.6.1.7~~~groEL2~~~Chaperonin GroEL 2~~~
MAKTIAYDEEARRGLERGLNALADAVKVTLGPKGRNVVLEKKWGAPTITNDGVSIAKEIELEDPYEKIGAELVKEVAKKT
DDVAGDGTTTATVLAQALVREGLRNVAAGANPLGLKRGIEKAVEKVTETLLKGAKEVETKEQIAATAAISAGDQSIGDLI
AEAMDKVGNEGVITVEESNTFGLQLELTEGMRFDKGYISGYFVTDPERQEAVLEDPYILLVSSKVSTVKDLLPLLEKVIG
AGKPLLIIAEDVEGEALSTLVVNKIRGTFKSVAVKAPGFGDRRKAMLQDMAILTGGQVISEEVGLTLENADLSLLGKARK
VVVTKDETTIVEGAGDTDAIAGRVAQIRQEIENSDSDYDREKLQERLAKLAGGVAVIKAGAATEVELKERKHRIEDAVRN
AKAAVEEGIVAGGGVTLLQAAPTLDELKLEGDEATGANIVKVALEAPLKQIAFNSGLEPGVVAEKVRNLPAGHGLNAQTG
VYEDLLAAGVADPVKVTRSALQNAASIAGLFLTTEAVVADKPEKEKASVPGGGDMGGMDF
>A0QQU5 5.6.1.7~~~groEL2~~~Chaperonin GroEL 2~~~COG0459
MAKTIAYDEEARRGLERGLNSLADAVKVTLGPKGRNVVLEKKWGAPTITNDGVSIAKEIELEDPYEKIGAELVKEVAKKT
DDVAGDGTTTATVLAQALVREGLRNVAAGANPLGLKRGIEKAVEKVTETLLKSAKEVETKEQIAATAGISAGDQSIGDLI
AEAMDKVGNEGVITVEESNTFGLQLELTEGMRFDKGYISGYFVTDAERQEAVLEDPYILLVSSKVSTVKDLLPLLEKVIQ
SGKPLLIIAEDVEGEALSTLVVNKIRGTFKSVAVKAPGFGDRRKAMLQDMAILTGGQVISEEVGLSLETADVSLLGKARK
VVVTKDETTIVEGAGDAEAIQGRVAQIRAEIENSDSDYDREKLQERLAKLAGGVAVIKAGAATEVELKERKHRIEDAVRN
AKAAVEEGIVAGGGVALLQSAPSLEELSLTGDEATGANIVRVALSAPLKQIALNGGLEPGVVAEKVSNLPAGHGLNAATG
EYEDLLAAGVADPVKVTRSALQNAASIAALFLTTEAVVADKPEKAAAPAGDPTGGMGGMDF
>P9WPE7 5.6.1.7~~~groEL2~~~Chaperonin GroEL 2~~~COG0459
MAKTIAYDEEARRGLERGLNALADAVKVTLGPKGRNVVLEKKWGAPTITNDGVSIAKEIELEDPYEKIGAELVKEVAKKT
DDVAGDGTTTATVLAQALVREGLRNVAAGANPLGLKRGIEKAVEKVTETLLKGAKEVETKEQIAATAAISAGDQSIGDLI
AEAMDKVGNEGVITVEESNTFGLQLELTEGMRFDKGYISGYFVTDPERQEAVLEDPYILLVSSKVSTVKDLLPLLEKVIG
AGKPLLIIAEDVEGEALSTLVVNKIRGTFKSVAVKAPGFGDRRKAMLQDMAILTGGQVISEEVGLTLENADLSLLGKARK
VVVTKDETTIVEGAGDTDAIAGRVAQIRQEIENSDSDYDREKLQERLAKLAGGVAVIKAGAATEVELKERKHRIEDAVRN
AKAAVEEGIVAGGGVTLLQAAPTLDELKLEGDEATGANIVKVALEAPLKQIAFNSGLEPGVVAEKVRNLPAGHGLNAQTG
VYEDLLAAGVADPVKVTRSALQNAASIAGLFLTTEAVVADKPEKEKASVPGGGDMGGMDF
>P35470 5.6.1.7~~~groEL2~~~Chaperonin GroEL 2~~~COG0459
MAAKEVKFTSDARDRMLRGVDIMANAVRVTLGPKGRNVVIDKSFGAPRITKDGVSVAKEIELEDKFENMGAQMLREVASR
TSDIAGDGTTTATVLAQAIVREGAKAVASGMNPMDLKRGIDLAVEAIVKELRNNARKVSKNAEIAQVATISANGDAEIGR
YLAEAMEKVGNEGVITVEEAKTAEIELEVVEGMEFDRGYLSPYFITNQEKMRVELEDAYILLHEKKLSNLQAMIPILESV
IQSGKPLLIIAEDVEGEALATLVVNKLRGGLKIAAVKAPGFGDRRKSMLEDIAILTGGTVISEELGIKLENTTMDTLGRA
KRIMVDKETTTIVDGAGSKEDIGGRVAQIKAQIEDTTSDYDREKLQERLAKLAGGVAVIRVGGSTEVEVKEKKDRVDDAL
HATRAAVEEGILPGGGVALLRVVSALNGLATANDDQRVGIEIVRRAIEAPVRQIAENAGAEGSIIVGKLREKQDFAFGWN
AQTGEFGDLFQMGVIDPAKVVRAALQDAASIAGLLVTTEAMIAEKPKKDGQPQMPPGGGMDF
>Q00768 5.6.1.7~~~groEL2~~~Chaperonin GroEL 2~~~
MAKIIAFDEEARRGLERGMNQLADAVKVTLGPKGRNVVLEKKWGAPTITNDGVSIAKEIELEDPYEKIGAELVKEVAKKT
DDVAGDGTTTATVLAQALVREGLRNVAAGANPMALKRGIEKAVEAVSSALLEQAKDVETKEQIASTASISAADTQIGELI
AEAMDKVGKEGVITVEESQTFGLELELTEGMRFDKGYISAYFATDMERMEASLDDPYILIVNSKIGNVKDLLPLLEKVMQ
SGKPLLIIAEDVEGEALSTLVVNKIRGTFKSVAVKAPGFGDRRKAMLGDIAILTGGTVISEEVGLKLENAGLDLLGRARK
VVITKDETTIVDGAGDTDQVNGRVAQIRAEIENSDSDYDREKLQERLANVAGGVAVIKAGAATEVELKERKHRIEDAVRN
AKAAVEEGIVAGGGVALLQASSVFEKLELEGDEATGAAAVKLALEAPLKQIAVNGGLEGGVVVEKVRNLSVGHGLNAATG
QYVDMIAEGILDPAKVTRSALQNAASIAALFLTTEAVIADKPEKAAAAAPGGMPGGDMDF
>P22034 5.6.1.7~~~groEL2~~~Chaperonin GroEL 2~~~COG0459
MSKLISFKDESRRSLEAGINALADAVRITLGPKGRNVLLEKQYGAPQIVNDGITVAKEIELSNPEENAGAKLIQEVASKT
KEIAGDGTTTATIIAQALVREGLRNVAAGANPVALRRGIEKVTTFLVQEIEAVAKPVEGSAIAQVATVSSGNDPEVGAMI
ADAMDKVTKDGVITVEESKSLNTELEVVEGMQIDRGYISPYFITDSDRQLVEFDNPLILITDKKISAIAELVPVLEAVAR
AGRPLLIIAEDIEGEALATLVVNKARGVLNVAAIKAPAFGDRRKAVLQDIAILTGGSVISEDIGLSLDTVSLDQLGQAVK
ATLEKDNTILVAGADKRASAGVKERIEQLRKEYAASDSDYDKEKIQERIAKLAGGVAVIKVGAATETELKDRKLRIEDAL
NATKAAVEEGIVPGGGTTLIRLAGKIESFKAQLSNDEERVAADIIAKALEAPLHQLASNAGVEGSVIVEKVKEATGNQGY
NVITGKIEDLIAAGIIDPAKVVRSALQNAASIAGMVLTTEALVVEKPEPAAPAMPDMGGMGGMGGMGGMGMM
>P35862 5.6.1.7~~~groEL3~~~Chaperonin GroEL 3~~~COG0459
MSAKEVKFGVNARDRMLRGVDILANAVQVTLGPKGRNVVLDKSFGAPRITKDGVAVAKEIELDDKFENMGAQMVREVASK
AADAAGDGTTTATVLAAAIVREGAKSVAAGMNPMDLKRGIDLAVEAVVADLQKNSKKVTSNDEIAQVGAISANGDQEIGK
FLADAVKKVGNEGVITVEEAKSLETELDVVEGMQFDRGYISPYFVTNADKMRVEMDDAYILINEKKLSSLNELLPLLEAV
VQTGKPLVIVAEDVEGEALATLVVNRLRGGLKVAAVKAPGFGDRRKAMLQDIAILTGGQAISEDLGIKLENVTLNMLGRA
KKVMIDKENTTIVNGAGKKADIEARVAQIKAQIEETTSDYDREKLQERLAKLAGGVAVIRVGGATEVEVKERKDRVDDAM
HATRAAVEEGIVPGGGVALLRASEQLKGLRTENDDQKTGVEIVRKALSWPARQIAINAGEDGSIVVGKVLDNEQYSFGFD
AQTGEYSNLVSKGIIDPAKVVRIAVQNASSVAGLLITTEAMVAELPKKATAGPAMPAAPGMGGMDF
>P35471 5.6.1.7~~~groEL5~~~Chaperonin GroEL 5~~~COG0459
MAAKEVKFQTDARERMLRGVDVLANAVKVTLGPKGRNVVIDKSFGAPRITKDGVSVAKEIELEDKFENMGAQMLREVASR
TNDLAGDGTTTATVLAQAIVREGAKAVASGMNPMDLKRGIDLAVDAVVKELKNNARKISKNSEIAQVGTISANGDTEIGR
YLAEAMEKVGNEGVITVEEAKTAETELEVVEGMQFDRGYLSPYFITNQDKMRVELEDPYILIHEKKLSNLQAMLPVLEAV
VQSGKPLLIIAEDVEGEALATLVVNKLRGGLKVAAVKAPGFGDRRKAMLEDIAILTGGTVVSEDLGIKLESVTLDMLGRA
KKVSIEKENTTIIDGAGSKADIEGRTAQIRAQIEETTSDYDREKLQERLAKLAGGVAVIRVGGSTEVEVKEKKDRVDDAL
HATRAAVEEGILPGGGVALLRAVKALDGLKTANNDQRVGVDLVRRAIEAPVRQIAENAGAEGSIIVGKLREKTEFSYGWN
AQTNEYGDLYAMGVIDPAKVVRTALQDAASVAGLLVTTEAMIAEKPKKEAAPALPAGGGMDF
>P48212 5.6.1.7~~~groEL~~~Chaperonin GroEL~~~COG0459
MAKQIKFGEEARRALERGVNQLADTVKVTLGPKGRNVVLDKKFGSPMITNDGVTIAKEIELEDPFENMGAQLVKEVATKT
NDVAGDGTTTATLLAQAIIREGLKNVAAGANPMLLKKGIAKAVDAAVEGIKEISQKVKGKEDIARVASISANDEVIGELI
ADAMEKVTNDGVITVEEAKTMGTNLEIVEGMQFDRGYVSPYMVTDTEKMEAVLDEPYILITDKKISNIQDILPLLEQIVQ
QGKKLVIIAEDVEGEALATLLVNKLRGTFTCVAVKAPGFGDRRKAMLEDIAILTGGQVITSDLGLELKDTTVEQLGRARQ
VKVQKENTIIVDGAGDPKEIQKRIASIKSQIEETTSDFDREKLQERLAKLAGGVAVIQVGAATETEMKEKKLRIEDALAA
TKAAVEEGIVAGGGTALVNVIPKVAKVLDTVSGDEKTGVQIILRALEEPVRQIAENAGLEGSVIVEKVKASEPGIGFDAY
NEKYVNMIEAGIVDPAKVTRSALQNAASVASMVLTTESVVADIPEKETSGGPGGAGMGGMY
>P46398 5.6.1.7~~~groEL~~~Chaperonin GroEL~~~COG0459
MAAKDVKFGNDARVKMLNGVNILADAVKVTLGPKGRNVVLDKSFGAPTITKDGVSVAREIELEDKFENMGAQMVKEVASK
ANDAAGDGTTTATVLAQAIVNEGLKAVAAGMNPMDLKRGIDKAVNSVVAELKNLSKPCETSKEIEQVGTISANSDSIVGQ
LIAQAMEKVGKEGVITVEDGTGLEDELDVVEGMQFDRGYLSPYFINKPETATVELDNPFILLVDKKISNIRELLPVLEGV
AKAGKPLLIIAEDVEGEALATLVVNTMRGIVKVAAVKAPGFGDRRKAMLQDIAILTAGTVISEEIGMELEKATLEDLGQA
KRIVINKDNTTIIDGIGDEAQIQGRVAQIRQQIEESTSDYDKEKLQERVAKLAGGVAVIKVGAATEVEMKEKKARVEDAL
HATRAAVEEGIVAGGGVALIRAAGRVVGLQGENEEQNVGIKLALRAMEAPLRQIVANAGEEASVIASAVKNGEGNFGYNA
GTEQYGDMIAMGILDPTKVTRSALQFAASVAGLMITTECMVTELPKDDKADLGAAGMGGMGGMGGMM
>P31293 5.6.1.7~~~groEL~~~Chaperonin GroEL~~~
MSAKDVKFGGDARVRMMEGVNILANAVKVTLGPKGRNVVLEKSFGAPTVTKDGVSVAKEIELKDKFENMGAQMVKEVASK
TSDIAGDGTTTATVLAQAMVREGLKAVAAGMNPMDLKRGMDKAVEAATEELKKLSKPCPRPMAIAQVGTISANSDDSIGT
IIAEAMEKVGKEGVITVEDGTSLQNELDVVEGMQFDRGYLSPYFINNQQSQSAELDAPYILLYDKKISNIRDLLPVLEGV
AKAGKPLLIIAEDVEGEALATLVVNTIRGIVKVCAVKAPGFGDRRKAMLQDIAILTGATVISEEVGLSLEKATLTDLGTA
KRVQVGKDETTIIDGSGSEIDIKARCEQIRAQVEETSSDYDREKLQERLAKLAGGVAVIKVGAATEIEMKEKKARVEDAL
HATRAAVEEGIVPGGGVALVRAIAAVKDLKGANHDQDVGIAIARRAMEEPLRQIVANAGEEPSVILHKVAEGTGNFGYNA
ANGEYGDMVEMGILDPTKVTRSALQNSCSVAGLMITTEAMIADEPKDDAPAMPGGGMGDMGGMGMM
>Q4MPR6 5.6.1.7~~~groEL~~~Chaperonin GroEL~~~COG0459
MAKDIKFSEEARRSMLRGVDTLANAVKVTLGPKGRNVVLEKKFGSPLITNDGVTIAKEIELEDAFENMGAKLVAEVASKT
NDVAGDGTTTATVLAQAMIREGLKNVTAGANPMGLRKGIEKAVTAAIEELKTISKPIEGKSSIAQVAAISAADEEVGQLI
AEAMERVGNDGVITLEESKGFTTELDVVEGMQFDRGYASPYMITDSDKMEAVLDNPYILITDKKISNIQEILPVLEQVVQ
QGKPLLIIAEDVEGEALATLVVNKLRGTFNVVAVKAPGFGDRRKAMLEDIAILTGGEVITEELGRDLKSATVESLGRAGK
VVVTKENTTVVEGIGNTQQIEARIGQIRAQLEETTSEFDREKLQERLAKLAGGVAVIKVGAATETELKERKLRIEDALNS
TRAAVEEGIVAGGGTSLMNVYTKVASIVAEGDEATGINIVLRALEEPVRQIAINAGLEGSVVVERLKGEKVGVGFNAATG
EWVNMLETGIVDPAKVTRSALQNAASVAAMFLTTEAVVADKPEPNAPAMPDMGGMGMGGMGGMM
>P28598 5.6.1.7~~~groEL~~~Chaperonin GroEL~~~COG0459
MAKEIKFSEEARRAMLRGVDALADAVKVTLGPKGRNVVLEKKFGSPLITNDGVTIAKEIELEDAFENMGAKLVAEVASKT
NDVAGDGTTTATVLAQAMIREGLKNVTAGANPVGVRKGMEQAVAVAIENLKEISKPIEGKESIAQVAAISAADEEVGSLI
AEAMERVGNDGVITIEESKGFTTELEVVEGMQFDRGYASPYMVTDSDKMEAVLDNPYILITDKKITNIQEILPVLEQVVQ
QGKPLLLIAEDVEGEALATLVVNKLRGTFNAVAVKAPGFGDRRKAMLEDIAVLTGGEVITEDLGLDLKSTQIAQLGRASK
VVVTKENTTIVEGAGETDKISARVTQIRAQVEETTSEFDREKLQERLAKLAGGVAVIKVGAATETELKERKLRIEDALNS
TRAAVEEGIVSGGGTALVNVYNKVAAVEAEGDAQTGINIVLRALEEPIRQIAHNAGLEGSVIVERLKNEEIGVGFNAATG
EWVNMIEKGIVDPTKVTRSALQNAASVAAMFLTTEAVVADKPEENGGGAGMPDMGGMGGMGGMM
>P35635 5.6.1.7~~~groEL~~~Chaperonin GroEL~~~
MAAKEVKFGRDARERLLRGVDILADAVKVTLGPKGRNVVIDKSFGAPRITKDGVSVAKEIELENKFENMGAQMLREVASK
TNDIAGDGTTTATVLGQAIVQEGVKAVAASMNPMDLKRGIDAAVEAVVADLFKKAKKIQTSEEIAQVATISANGAEDIGK
MIADAMEKVGNEGVITVEEAKTAETELEVVEGMQFDRGYLSPYFVTNSEKMMVDLDDPYILIHEKKLSNLQSLLPVLEAV
AQSGKPLLIIAEDVEGEALATLVVNKLRGGLKIAAVKAPGFGDRRKAMLEDIAVLTSGQVISEDVGIKLENVTLEMLGRA
KKVHVSKETTTIVDGAGQKSEINARVSQIKAQIEETTSDYDREKLQERLAKLAGGVAVIRVGGSTEVEVKEKKDRVDDAL
NATRAAVEEGIVPGGGTPLLRAAKALSIKGKNPDQEAGIGIIRRALQAPARQIAHNAGEEAAVIVGKVLENCSDTFGYNT
ATAQFRDLISFGIVDPVKVVRSALQNAASIASLLITTEAMVAEVPKKEAAAPAMPGGGMGGMDF
>B2SCZ4 5.6.1.7~~~groEL~~~Chaperonin GroEL~~~
MAAKDVKFGRTAREKMLRGVDILADAVKVTLGPKGRNVVIEKSFGAPRITKDGVSVAKEVELEDKFENMGAQMLREVASK
TNDTAGDGTTTATVLGQAIVQEGAKAVAAGMNPMDLKRGIDLAVNEVVAELLKKAKKINTSEEVAQVGTISANGEAEIGK
MIAEAMQKVGNEGVITVEEAKTAETELEVVEGMQFDRGYLSPYFVTNPEKMVADLEDAYILLHEKKLSNLQALLPVLEAV
VQTSKPLLIIAEDVEGEALATLVVNKLRGGLKIAAVKAPGFGDRRKAMLEDIAILTGGQVISEDLGIKLESVTLDMLGRA
KKVSISKENTTIVDGAGQKAEIDARVGQIKQQIEETTSDYDREKLQERLAKLAGGVAVIRVGGATEVEVKEKKDRVDDAL
NATRAAVEEGIVAGGGTALLRASTKITAKGVNADQEAGINIVRRAIQAPARQITTNAGEEASVIVGKILENTSETFGYNT
ANGEYGDLISLGIVDPVKVVRTALQNAASVAGLLITTEAMIAELPKKDAAPAGMPGGMGGMGGMDF
>B8H163 5.6.1.7~~~groEL~~~Chaperonin GroEL~~~
MAAKDVYFSSDARDKMLRGVNILANAVKVTLGPKGRNVVIEKSFGAPRTTKDGVSVAKEIELADKFENLGAQMIREVASK
TNDKAGDGTTTATVLAQAIVQEGLKSVAAGMNPMDLKRGIDKAVAIAIEDIKTSSKKVTTNAEIAQVGTISANGDKEVGE
MIAKAMDKVGNEGVITVEEAKTAETELDVVEGMQFDRGYLSPYFITNADKMEVQLEEPLILLFEKKLSSLQPLLPVLEAV
VQSGRPLLIIAEDVEGEALATLVVNKLRGGLRVAAVKAPGFGDRRKAMLEDIAILTGAQVVSEDIGIKLENVSLEMLGRA
KKVSITKDDTTIVDGVGEKADIEARIAQIKRQIEDTTSDYDKEKLQERLAKLAGGVAVIRVGGSTEVEVKEKKDRVDDAL
NATRAAADEGIVPGGGTALLKASKALAGVVGDNDDQTAGIAIVRRALQAPIRQIAENAGVEGSIVVGKILENDNSAFGFN
AQTEQYVDLVVDGVIDPAKVVRTALQNAASVAGLLITTEAAIVEAPKKGGGAPAGGGMPGGMGDMDF
>Q59322 5.6.1.7~~~groEL~~~Chaperonin GroEL~~~COG0459
MVAKNIKYNEEARKKIQKGVKTLAEAVKVTLGPKGRHVVIDKSFGSPQVTKDGVTVAKEVELADKHENMGAQMVKEVASK
TADKAGDGTTTATVLAEAIYTEGLRNVTAGANPMDLKRGIDKAVKVVVDQIKKISKPVQHHKEIAQVATISANNDAEIGN
LIAEAMEKVGKNGSITVEEAKGFETVLDVVEGMNFNRGYLSSYFATNPETQECVLEDALVLIYDKKISGIKDFLPVLQQV
AESGRPLLIIAEDIEGEALATLVVNRIRGGFRVCAVKAPGFGDRRKAMLEDIAILTGGQLISEELGMKLENASLAMLGKA
KKVIVSKEDTTIVEGMGEKEALDARCESIKKQIEDSTSDYDKEKLQERLAKLSGGVAVIRVGAATEIEMKEKKDRVDDAQ
HATIAAVEEGILPGGGTALIRCIPTLEAFLPMLTNEDERIGARIVLKALSAPLKQIAANAGKEGAIIFQQVMSRSANEGY
DALRDAYTDMIEAGILDPAKVTRSALESAASVAGLLLTTEALIAEIPEEKPAAAPAMPGAGMDY
>Q3KMQ9 5.6.1.7~~~groEL~~~Chaperonin GroEL~~~
MVAKNIKYNEEARKKIQKGVKTLAEAVKVTLGPKGRHVVIDKSFGSPQVTKDGVTVAKEVELADKHENMGAQMVKEVASK
TADKAGDGTTTATVLAEAIYTEGLRNVTAGANPMDLKRGIDKAVKVVVDQIKKISKPVQHHKEIAQVATISANNDAEIGN
LIAEAMEKVGKNGSITVEEAKGFETVLDVVEGMNFNRGYLSSYFATNPETQECVLEDALVLIYDKKISGIKDFLPILQQV
AESGRPLLIIAEDIEGEALATLVVNRIRGGFRVCAVKAPGFGDRRKAMLEDIAILTGGQLISEELGMKLENANLAMLGKA
KKVIVSKEDTTIVEGMGEKEALEARCESIKKQIEDSSSDYDKEKLQERLAKLSGGVAVIRVGAATEIEMKEKKDRVDDAQ
HATIAAVEEGILPGGGTALIRCIPTLEAFLPMLTNEDEQIGARIVLKALSAPLKQIAANAGKEGAIIFQQVMSRSANEGY
DALRDAYTDMLEAGILDPAKVTRSALESAASVAGLLLTTEALIAEIPEEKPAAAPAMPGAGMDY
>Q8KF02 5.6.1.7~~~groEL~~~Chaperonin GroEL~~~COG0459
MTAKDILFDAEARTKLKVGVDKLANAVKVTLGPAGRNVLIDKKFGAPTSTKDGVTVAKEIELVDPVENMGAQMVREVASK
TSDVAGDGTTTATVLAQAIYREGLKNVTAGARPIDLKRGIDRAVKEVVAELRNISRSISGKKEIAQVGTISANNDPEIGE
LIAEAMDKVGKDGVITVEEAKGMETELKVVEGMQFDRGYLSPYFVTNSETMEAELDEALILIHDKKISNMKELLPILEKA
AQSGRPLLIIAEDIEGEALATLVVNKLRGTLKVAAVKAPGFGDRRKAMLEDIAILTGGTVISEEKGYKLENATMAYLGQA
ARITIDKDNTTIVEGKGKQEEIKARINEIKGQIEKSTSDYDTEKLQERLAKLSGGVAVLKIGASTEVEMKEKKARVEDAL
HATRAAVQEGIVVGGGVALIRAAKGLAKAVADNEDQKTGIEIIRRALEEPLRQIVANTGTTDGAVVLEKVKNAEGDYGFN
ARTEQYENLIEAGVVDPTKVTRSALENAASVASILLTTEAAITDVKEDKADMPAMPPGGMGGGMY
>P0C0Z7 5.6.1.7~~~groEL~~~Chaperonin GroEL~~~
MVAKNIKYNEEARKKIQKGVKTLAEAVKVTLGPKGRHVVIDKSFGSPQVTKDGVTVAKEVELADKHENMGAQMVKEVASK
TADKAGDGTTTATVLAEAIYTEGLRNVTAGANPMDLKRGIDKAVKVVVDQIRKISKPVQHHKEIAQVATISANNDAEIGN
LIAEAMEKVGKNGSITVEEAKGFETVLDIVEGMNFNRGYLSSYFATNPETQECVLEDALVLIYDKKISGIKDFLPVLQQV
AESGRPLLIIAEDIEGEALATLVVNRIRGGFRVCAVKAPGFGDRRKAMLEDIAILTGGQLISEELGMKLENANLAMLGKA
KKVIVSKEDTTIVEGMGEKEALEARCESIKKQIEDSSSDYDKEKLQERLAKLSGGVAVIRVGAATEIEMKEKKDRVDDAQ
HATIAAVEEGILPGGGTALIRCIPTLEAFLPMLTNEDEQIGARIVLKALSAPLKQIAANAGKEGAIIFQQVMSRSANEGY
DALRDAYTDMLEAGILDPAKVTRSALESAASVAGLLLTTEALIAEIPEEKPAAAPAMPGAGMDY
>P19421 5.6.1.7~~~groEL~~~Chaperonin GroEL~~~COG0459
MAAKVLKFSHEVLHAMSRGVEVLANAVKVTLGPKGRNVVLDKSFGAPTITKDGVSVAKEIELEDKFENMGAQMVKEVASR
TSDDAGDGTTTATVLAQAILVEGIKAVIAGMNPMDLKRGIDKAVTAAVAELKKISKPCKDQKAIAQVGTISANSDKSIGD
IIAEAMEKVGKEGVITVEDGSGLENALEVVEGMQFDRGYLSPYFINNQQNMSAELENPFILLVDKKISNIRELIPLLENV
AKSGRPLLVIAEDIEGEALATLVVNNIRGVVKVAAVKAPGFGDRRKAMLQDIAVLTGGKVISEEVGLSLEAASLDDLGSA
KRVVVTKDDTTIIDGSGDAGDIKNRVEQIRKEIENSSSDYDKEKLQERLAKLAGGVAVIKVGAATEVEMKEKKARVEDAL
HATRAAVEEGVVPGGGVALIRVLKSLDSVEVENEDQRVGVEIARRAMAYPLSQIVKNTGVQAAVVADKVLNHKDVNYGYN
AATGEYGDMIEMGILDPTKVTRTALQNAASIAGLMITTECMVTEAPKKKEESMPGGGDMGGMGGMGGMGGMM
>Q9RWQ9 5.6.1.7~~~groEL~~~Chaperonin GroEL~~~COG0459
MAKQLVFDESARRSLERGVNAVANAVKVTLGPRGRNVVIEKKFGSPTITKDGVTVAKEVELEDKLENIGAQLLKEVASKT
NDITGDGTTTATVLGQAIVKEGLRNVAAGANPLALKRGIDKAVAVAIEEIKKLAVSVEDSEAIKKVAGISANDETVGQEI
ASAMDKVGKEGVITIEESKGFDTEVDVVEGMQFDKGFINPYFITNPEKMEAVLEDAYILINEKKISNLKDMLPVLEKVAQ
TGRPLLIIAEDVEGEALATLVVNKLRGTLNIAAVKAPGFGDRRKEMLRDIAAVTGGEVVSEDLGHKLENVGMEMLGRAAR
IRITKDETTIVDGKGEQAQIDARVNAIKGELDSTDSDYAREKLQERLAKLSGGVAVIRVGAATETELKEKKHRYEDALST
ARSAVEEGIVAGGGTTLLRVIPAVRKAAESLTGDEATGARILIRALEEPARQIAANAGEEGSVIVNAVVGSDKARYGFNA
ATGEYVEDMVAAGIVDPAKVTRTALQNAASIGALILTTEAIVSDKPEKAAPAMPQGGGDMGGMGGMDF
>C5A1D5 5.6.1.7~~~groEL~~~Chaperonin GroEL~~~
MAAKDVKFGNDARVKMLRGVNVLADAVKVTLGPKGRNVVLDKSFGAPTITKDGVSVAREIELEDKFENMGAQMVKEVASK
ANDAAGDGTTTATVLAQAIITEGLKAVAAGMNPMDLKRGIDKAVTAAVEELKALSVPCSDSKAIAQVGTISANSDETVGK
LIAEAMDKVGKEGVITVEDGTGLQDELDVVEGMQFDRGYLSPYFINKPETGAVELESPFILLADKKISNIREMLPVLEAV
AKAGKPLLIIAEDVEGEALATLVVNTMRGIVKVAAVKAPGFGDRRKAMLQDIATLTGGTVISEEIGMELEKATLEDLGQA
KRVVINKDTTTIIDGVGEEAAIQGRVAQIRQQIEEATSDYDREKLQERVAKLAGGVAVIKVGAATEVEMKEKKARVEDAL
HATRAAVEEGVVAGGGVALIRVASKLADLRGQNEDQNVGIKVALRAMEAPLRQIVLNCGEEPSVVANTVKGGDGNYGYNA
ATEEYGNMIDMGILDPTKVTRSALQYAASVAGLMITTECMVTDLPKNDAADLGAAGGMGGMGGMGGMM
>P0A6F5 5.6.1.7~~~groEL~~~Chaperonin GroEL~~~COG0459
MAAKDVKFGNDARVKMLRGVNVLADAVKVTLGPKGRNVVLDKSFGAPTITKDGVSVAREIELEDKFENMGAQMVKEVASK
ANDAAGDGTTTATVLAQAIITEGLKAVAAGMNPMDLKRGIDKAVTAAVEELKALSVPCSDSKAIAQVGTISANSDETVGK
LIAEAMDKVGKEGVITVEDGTGLQDELDVVEGMQFDRGYLSPYFINKPETGAVELESPFILLADKKISNIREMLPVLEAV
AKAGKPLLIIAEDVEGEALATLVVNTMRGIVKVAAVKAPGFGDRRKAMLQDIATLTGGTVISEEIGMELEKATLEDLGQA
KRVVINKDTTTIIDGVGEEAAIQGRVAQIRQQIEEATSDYDREKLQERVAKLAGGVAVIKVGAATEVEMKEKKARVEDAL
HATRAAVEEGVVAGGGVALIRVASKLADLRGQNEDQNVGIKVALRAMEAPLRQIVLNCGEEPSVVANTVKGGDGNYGYNA
ATEEYGNMIDMGILDPTKVTRSALQYAASVAGLMITTECMVTDLPKNDAADLGAAGGMGGMGGMGGMM
>Q1R3B6 5.6.1.7~~~groEL~~~Chaperonin GroEL~~~
MAAKDVKFGNDARVKMLRGVNVLADAVKVTLGPKGRNVVLDKSFGAPTITKDGVSVAREIELEDKFENMGAQMVKEVASK
ANDAAGDGTTTATVLAQAIITEGLKAVAAGMNPMDLKRGIDKAVTAAVEELKALSVPCSDSKAIAQVGTISANSDETVGK
LIAEAMDKVGKEGVITVEDGTGLQDELDVVEGMQFDRGYLSPYFINKPETGAVELESPFILLADKKISNIREMLPVLEAV
AKAGKPLLIIAEDVEGEALATLVVNTMRGIVKVAAVKAPGFGDRRKAMLQDIATLTGGTVISEEIGMELEKATLEDLGQA
KRVVINKDTTTIIDGVGEEAAIQGRVAQIRQQIEEATSDYDREKLQERVAKLAGGVAVIKVGAATEVEMKEKKARVEDAL
HATRAAVEEGVVAGGGVALIRVASKLADLRGQNEDQNVGIKVALRAMEAPLRQIVLNCGEEPSVVANTVKGGDGNYGYNA
ATEEYGNMIDMGILDPTKVTRSALQYAASVAGLMITTECMVTDLPKNDAADLGAAGGMGGMGGMGGMM
>Q5NEE1 5.6.1.7~~~groEL~~~Chaperonin GroEL~~~COG0459
MAAKQVLFSDEARAKMLDGVNTLANAVKVTLGPKGRNVVLDKSFGTPTITKDGVSVAKEIELEDKFENMGAQIVKEVASK
TADVAGDGTTTATVLAQALLTEGLKAVAAGMNPMDLKRGIDKATARLVEELKALSKPCSDPKSIEQVGTISANSDATVGK
LIADAMAKVGKEGVITVEEGKGFEDELDVVEGMQFDRGYLSPYFATNQENMTTDLENPYILIVDKKISNIRDLLPILEGV
SKSGRALLIIAEDVESEALATLVVNNMRGVVKVCAVKAPGFGDRRKAMLEDIATLTGATFVSEDLSMKLEETNMEHLGTA
SRVQVTKDNTTIIDGAGEKEAIAKRINVIKANIAEANSDYDREKLQERLAKLSGGVAVIKVGAVTEAEMKEKKDRVDDAL
HATRAAVEEGIVAGGGVALIRAQKALDGLTGENDDQNYGIALLRKAIEAPLRQIVSNAGGESSVVVNQVKANQGNYGYNA
ANDTYGDMVEMGILDPTKVTRSALQHAASIAGLMITTEAMIGEIKEAAPAMPMGGGMGGMPGMM
>Q07201 5.6.1.7~~~groEL~~~Chaperonin GroEL~~~
MAKEIKFSEEARRAMLRGVDKLADAVKVTLGPKGRNVVLEKKFGSPLITNDGVTIAKEIELEDPFENMGAKLVAEVASKT
NDVAGDGTTTATVLAQAMIREGLKNVTAGANPMGIRKGIEKAVAVAVEELKAISKPIQGKESIAQVAAISAADEEVGQLI
AEAMERVGNDGVITLEESKGFATELDVVEGMQFDRGYVSPYMITDTEKMEAVLENPYILITDKKVSSIQEILPILEQVVQ
QGRPLLIIAEDVEGEALATLVVNKLRGTFNAVAVKAPGFGDRRKAMLEDIAILTGGEVISEELGRELKSATIASLGRASK
VVVTKENTTIVEGAGDSKRIKARINQIRAQLEETTSEFDREKLQERLAKLAGGVAVIKVGAATETELKERKLRIEDALNS
TRAAVEEGIVAGGGTALMNVYSKVAAIEAEGDEATGVKIVLRAIEEPVRQIAQTAGLEGSIIVERLKTEKPGIGFNAATG
EWVDMIEAGIVDPTKVTRSALQNAASVAAMFLTTEAVVADKPEENKGGNPGMPDMGGMM
>O50305 5.6.1.7~~~groEL~~~Chaperonin GroEL~~~COG0459
MAKDIKFSEDARRSMLRGVDKLADAVKVTLGPKGRNVVLEKKFGSPLITNDGVTIAKEIELEDAFENMGAKLVAEVASKT
NDIAGDGTTTATVLAQAMIREGLKNVTSGANPMVIRKGIEKATQVAVEELSKISKPIEGKDSIAQVAAISSADDEVGKII
AEAMERVGNDGVITIEESKGFSTELEVVEGMQFDRGYASPYMVTDSDKMEAVLDNPYVLITDKKISNIQEVLPVLEQVVQ
QGKPILIIAEDVEGEALATLVVNKLRGTFNAVAVKAPGFGDRRKAMLEDIAILTGGEVITEDLGLDLKSANITQLGRASK
VVVTKENTTIVEGAGESDKIAARVNQIKAQIEETTSDFDKEKLQERLAKLAGGVAVLKVGAATETEMKERKLRIEDALNS
TRAAVEEGIVAGGGTALVNVIKAVSSIGAEGDEATGVNIVLRALEEPVRQIAHNAGLEGSVIVERLKKEEAGFGFNAATG
EWVNMVEAGIVDPTKVTRSALQHAASVSAMFLTTEAVIADKPEENEGGGGMPDMGGMGGMGGMM
>P61438 5.6.1.7~~~groEL~~~Chaperonin GroEL~~~
MAKDIEYNETARRKLLEGVNKLANAVKVTLGPKGRNVVIDKKFGAPTITKDGVTVAKEIELEDPLENMGAQMVKEVSTKT
NDVAGDGTTTATILAQSIINEGLKNVTAGANPMSLKKGIDKAVTAAVESIQKRAVKIENKKDIANVASISANNDNTIGNL
IADAMDKVGKDGVITVEEAKSIETTLDVVEGMQFDRGYISPYMVTDAESMVATLNDPFILIYDKKISSMKDLIHILEKVA
QAGKPLVIISEEVEGEALATIVVNTLRKTISCVAVKAPGFGDRRKSMLEDIAILTGGQVISEDLGMKLENTTLQMLGRAN
KVTVDKENTTIIEGKGQTKEIQGRIGQIKKQIEDTTSEYDREKLQERLAKLAGGVAVIHVGAATEVEMKEKKARVEDALS
ATRAAVEEGIVPGGGLTLLKAQEAVGSLKLDGDEATGAKIIFRALEEPIRMITSNAGLEGSVIVEHAKAKKGNEGFNALT
MVWEDMIQAGVVDPAKVVRSALQNAASIGSMILTTEVTITDKPDKDAPNPMAGMGGGGMGGMGGMM
>P78012 5.6.1.7~~~groEL~~~Chaperonin GroEL~~~
MAKELVFGKNARNKLLAGINKLADAVKVTVGPKGQNVILGRKFSNPLITNDGVTIAKEIELTDPLENIGAKVISVAAVST
NDIAGDGTTTATILAQEMTNRGVEAVNNGANPVNVRRGIEDASQLIITELDKRSKKINTNEEIEQVAAISSGSKEIGKLI
AQAMALVGKNGVITTDDAKTINTTLETTEGIEFKGTYASPYMVSDQEKMEVVLDQPKILVSAMKINTIKEILPLLEGSME
NGNPLLIVAPDFAEEVVTTLAVNKLRGTINVVAVKCNEYGERQKAALEDLAISTGTLAYNNELGGGFKDVTVNHLGEARR
VQVAKEKTTVIGGKGSKETIQKHLDLLNGRLKQTTEKYDTDLLKERIAHLSQGVAVVRVGGATELAQKELKLRIEDALNS
TKAAVEEGIISGGGIALLNVSTILNDSKLADKYKAETSAENLKEILVGYEIVRKSLEAPVRQIIENSGVNPVKVFAELRS
EADGVGFDAETKKKVDMIRSGIIDPTKVTKTALEKAASVASSLITTSVAVYDIKENKEGSFQE
>P29842 5.6.1.7~~~groEL~~~Chaperonin GroEL~~~
MAAKDVQFGNEVRQKMVNGVNILANAVRVTLGPKGRNVVVDRAFGGPHITKDGVTVAKEIELKDKFENMGAQMVKEVASK
TNDVAGDGTTTATVLAQSIVAEGIKAVTAGMNPTDLKRGIDKAVAALVEELKNIAKPCDTSKEIAQVGSISANSDEQVGA
IIAEAMEKVGKEGVITVEDGKSLENELDVVEGMQFDRGYLSPYFINDAEKQIAGLDNPFVLLFDKKISNIRDLLPVLEQV
AKASRPLLIIAEDVEGEALATLVVNNIRGVLKTVAVKAPGFGDRRKAMLQDIAILTGAVVISEEVGLSLEKATLDDLGQA
KRIEIGKENTTVIDGFGDAAQIEARVAEIRQQIETATSDYDKEKLQERVAKLAGGVAVIKVGAATEVEMKEKKDRVEDAL
HATRAAVEEGVVAGGGVALLRARAALENLHTGNADQDAGVQIVLRAVESPLRQIVANAGGEPSVVVNKVLEGKGNYGYNA
GSGEYGDMIGMGVLDPAKVTRSALQHAASIAGLMLTTDCMIAEIPEEKPAVPDMGGMGGMGGMM
>P42385 5.6.1.7~~~groEL~~~Chaperonin GroEL~~~
MAAKDVQFGNEVRQKMVNGVNILANAVRVTLGPKGRNVVVDRAFGGPHITKDGVTVAKEIELKDKFENMGAQMVKEVASK
TNDVAGDGTTTATVLAQSIVAEGMKYVTAGMNPTDLKRGIDKAVAALVDELKNIAKPCDTSKEIAQVGSISANSDEQVGA
IIAEAMEKVGKEGVITVEDGKSLENELDVVEGMQFDRGYLSPYFINDAEKQIAALDNPFVLLFDKKISNIRDLLPVLEQV
AKASRPLLIIAEDVEGEALATLVVNNIRGILKTVAVKAPGFGDRRKAMLQDIAILTGGVVISEEVGLSLEKATLDDLGQA
KRIEIGKENTTIIDGFGDAAQIEARVAEIRQQIETATSDYDKEKLQERVAKLAGGVAVIKVGAATEVEMKEKKDRVEDAL
HATRAAVEEGVVAGGGVALLRARAALENLHTGNADQDAGVQIVLRAVESPLRQIVANAGGEPSVVVNKVLEGKGNYGYNA
GSGEYGDMIEMGVLDPAKVTRSALQHAASIAGLMLTTDCMIAEIPEDKPAVPDMGGMGGMGGMM
>Q9Z462 5.6.1.7~~~groEL~~~Chaperonin GroEL~~~
MAAKEVKFNSDARDRMLKGVNILADAVKVTLGPKGRNVVIDKSFGAPRITKDGVSVAKEIELSDKFENMGAQMVREVASR
TNDEAGDGTTTATVLAQAIVREGLKAVAAGMNPMDLKRGIDVATAKVVEAIKSAARPVNDSSEVAQVGTISANGESFIGQ
QIAEAMQRVGNEGVITVEENKGMETEVEVVEGMQFDRGYLSPYFVTNADKMIAELEDAYILLHEKKLSSLQPMVPLLESV
IQSQKPLLIVAEDVEGEALATLVVNKLRGGLKIAAVKAPGFGDRRKAMLQDIAILTGGQVISEDLGMKLENVTIDMLGRA
KKVSINKDNTTIVDGAGEKAEIEARVSQIRQQIEETTSDYDREKLQERVAKLAGGVAVIRVGGMTEIEVKERKDRVDDAL
NATRAAVQEGIVVGGGVALVQGAKVLEGLSGANSDQDAGIAIIRRALEAPMRQIAENAGVDGAVVAGKVRESSDKAFGFN
AQTEEYGDMFKFGVIDPAKVVRTALEDAASVAGLLITTEAMIAEKPEPKAPAGGMPDMGGMGGMM
>P30718 5.6.1.7~~~groEL~~~Chaperonin GroEL~~~
MAAKEVKFGDSARKKMLVGVNVLADAVKATLGPKGRNVVLDKSFGAPTITKDGVSVAKEIELKDKFENMGAQLVKDVASK
ANDAAGDGTTTATVLAQAIVNEGLKAVAAGMNPMDLKRGIDKATVAIVAQLKELAKPCADTKAIAQVGTISANSDESIGQ
IIAEAMEKVGKEGVITVEEGSGLENELSVVEGMQFDRGYLSPYFVNKPDTMAAELDSPLLLLVDKKISNIREMLPVLEAV
AKAGRPLLIVAEDVEGEALATLVVNNMRGIVKVAAVKAPGFGDRRKAMLQDIAILTGGTVISEEVGLSLEGATLEHLGNA
KRVVINKENTTIIDGAGVQADIEARVLQIRKQIEETTSDYDREKLQERLAKLAGGVAVIKVGAATEVEMKEKKARVEDAL
HATRAAVEEGVVPGGGVALVRALQAIEGLKGDNEEQNVGIALLRRAVESPLRQIVANAGDEPSVVVDKVKQGSGNYGFNA
ATGVYGDMIEMGILDPAKVTRSALQAAASIGGLMITTEAMVAEIVEDKPAMGGMPDMGGMGGMGGMM
>P48216 5.6.1.7~~~groEL~~~Chaperonin GroEL~~~COG0459
MAAKDVKFGDSARKKMLVGVNVLADAVKATLGPKGRNVVLAKSFGAPTITKDGVSVAKEIELKDAFENMGAQLVKEVASK
ANDAAGDGTTTATVLAQSIVNEGLKAVAAGMNPMDLKRGIDKATAAVVAELKNLSKPCADSKAIAQVGTISANSDSSIGE
IIAEAMEKVGKEGVITVEEGSGLENELSVVEGMQFDRGYLSPYFVNKPDTMVAELEGPLLLLVDKKISTSRAAASTERAS
AGRPLLIVAEDVEGEALATLVVNNMRGIVKVAAVKAPGFGDRRKAMLQDIAVLTGGQVISEEIGVSLETATLEHLGNAKR
VILSKENTTIIDGAGADTEIEARVKQIRAQIEETSSDYDREKLQERLAKLAGGVAVIKVGAGTEVEMKEKKARVEDALHA
TRAAVEEGVVPGGGVALVRALNAIVDLKGDNEDQNVGIALLRRAVESPLRQITANAGDEPSVVANKVKQGSGNFGYNAAT
GEYGDMIEMGILDPAKVTRSALQAAASIGGLMITTEAMIADAPSDAPAGGGMPDMGGMGGMGGMM
>P34939 5.6.1.7~~~groEL~~~Chaperonin GroEL~~~
MASKEIKFGRTGREKMLRGVDILADAVKVTLGPKGRNVIIDKSFGAPRITKDGVSVAKEIELEDKFENMGAQMVREVASK
TNDIAGDGTTTATVLAQAIVREGNKAVAAGMNPMDLKRGIDLAVADVVKDLQAKAKKISTSEEVAQVGTISANGDKQVGL
DIAEAMQKVGNEGVITVEEAKTAETELEVVEGMQFDRGYLSPYFVTNPEKMIADLEDVFILLHEKKLSNLQSMLPVLEAV
VQTGKPLLIVAEDVEGEALATLVVNKLRGGLKIAAVKAPGFGDRRKRMLEDIAILTGGTVISEDLGIKLESVTLDMLGRA
KKVSISKENTTIVDGSGAKTDIEGRVAQIKAQIEETTSDYDREKLQERLAKLAGGVAVIRVGGSTEVEVKEKKDRIDDAL
NATRAAVQEGIVPGGGIALARSSTKITVKGANDDQEAGINIVRRALQSLVRQIAENAGDEASIVVGKVLDKNEDNFGYNA
QTSEYGDMIAMGIVDPLKVVRTALQNAASVASLLITTEAMIAELPKKDAPAGMPGGMGGMGGMDMM
>P95678 5.6.1.7~~~groEL~~~Chaperonin GroEL~~~
MAAKEVKFSTDARDRMLKGVNILADAVKVTLGPKGRNVVIEKSFGAPRITKDGVSVAKEIELADKFENMGAQMVKEVASR
TNDEAGDGTTTATVLAQAIVREGMKAVAAGMNPMDLKRGIDLATTTVVEAIKAAARPVKDSDEVAQVGTISANGEAQIGR
FIADASQKVGNEGVITVEENKGMDTEVEVVEGMQFDRGYLSPYFVTNPDKMIADLEDAYILLHEKKLSSLQPMVPLLEAV
IQSTRPLIIVAEDVEGEALATLVVNKLRGGLKIAAVKAPGFGDRRKAMLQDIAILTGGQVISDDLGMKLENVTLDMLGRA
KKVTISKENTTIVDGHGDKAEINARVAHIRTQIEETTSDYDREKLQERVAKLAGGVAVIRVGGMTEVEVKERKDRVDDAL
NATRAAVQEGIIVGGGVALVQAAKKLNDLTGANSDQDAGISIVRRALEAPLRQIAENAGVDGAVVAGKVRESADPAFGFN
AQTEEYGDMFGFGVIDPAKVTRTALEDAASIAGLLITTECMIAEKPEPKAAPAGGMGGMGGMDMM
>Q2NW94 5.6.1.7~~~groEL~~~Chaperonin GroEL~~~COG0459
MAAKDVKFGNDARVKMLRGVNVLADAVKVTLGPKGRNVVLDKSFGAPVITKDGVSVAREIELEDKFENMGAQMVKEVASK
ANDAAGDGTTTATVLAQSIVNEGLKAVAAGMNPMDLKRGIDKAVIAAVEELKKLSVPCSDSKAIAQVGTISANADETVGT
LIAEAMAKVGKEGVITVEEGSGLQDELDVVEGMQFDRGYLSPYFVNKPETGAVELESPFILLADKKISNIREMLPVLEAV
AKAGKPLLIIAEDVEGEALATLVVNTMRGIVKIAAVKAPGFGDRRKAMLQDIAILTAGTVISEEIGLELEKATLEDMGQA
KRVVITKDTTTIIDGEGDKALIDSRVTQINQQRDEATSDYDREKLQERVAKLAGGVAVIKVGAATEVEMKEKKARVEDAL
HATRAAVEEGVVAGGGVALIRVANRIAELRGDNEDQNVGIKVARRAMEAPLRQIVANAGEEPSVIANKVKAGEGNTGYNA
ATEEYGNMIDMGILDPTKVTRSALQYAASIAGLMITTECMVTDLPKEDKPDLGGAGGMGGMGGMGGMM
>P99083 5.6.1.7~~~groEL~~~Chaperonin GroEL~~~
MVKQLKFSEDARQAMLRGVDQLANAVKVTIGPKGRNVVLDKEFTAPLITNDGVTIAKEIELEDPYENMGAKLVQEVANKT
NEIAGDGTTTATVLAQAMIQEGLKNVTSGANPVGLRQGIDKAVKVAVEALHENSQKVENKNEIAQVGAISAADEEIGRYI
SEAMEKVGNDGVITIEESNGLNTELEVVEGMQFDRGYQSPYMVTDSDKMVAELERPYILVTDKKISSFQDILPLLEQVVQ
SNRPILIVADEVEGDALTNIVLNRMRGTFTAVAVKAPGFGDRRKAMLEDLAILTGAQVITDDLGLDLKDASIDMLGTASK
VEVTKDNTTVVDGDGDENSIDARVSQLKSQIEETESDFDREKLQERLAKLAGGVAVIKVGAASETELKERKLRIEDALNS
TRAAVEEGIVAGGGTALVNVYQKVSEIEAEGDIETGVNIVLKALTAPVRQIAENAGLEGSVIVERLKNAEPGVGFNAATN
EWVNMLEAGIVDPTKVTRSALQHAASVAAMFLTTEAVVASIPEKNNDQPNMGGMPGMM
>Q08854 5.6.1.7~~~groEL~~~Chaperonin GroEL~~~
MVKQLKFSEDARQAMLRGVDQLANAVKVTIGPKGRNVVLDKEFTAPLITNDGVTIAKEIELEDPYENMGAKLVQEVANKT
NEIAGDGTTTATVLAQAMIQEGLKNVTSGANPVGLRQGIDKAVKVAVEALHENSQKVENKNEIAQVGAISAADEEIGRYI
SEATEKVGNDGVITIITIEESNRLNTELELGMQFDRGYQSPYMVTDSDKMVAELERPYILVTDKKISSFQDILPLLEQVV
QSNRPILIVADEVEGDALTNIVLNRMRGTFTAVAVKAPGFGDRRKAMLEDLAILTGAQVITDDLGLDLKDASIDMLGTAS
KVEVTKDNTTVVDGDGDENSIDARVSQLKSQIEETESDFDREKLQERLAKLAGGVAVIKVGAASETELKERKLRIEDALN
STRAAVEEGIVAGGGTALVNVYQKVSENEAEGDIETGVNIVLKALTAPVRQIAENAGLEGSVIVERLKNAEPGVGFNGAT
NEWVNMLRRGIVDPTKVTRSALQHAASVAAMFLTTEAVVASIPEKNNDQPNMGGMPGMM
>Q5X9L8 5.6.1.7~~~groEL~~~Chaperonin GroEL~~~
MAKDIKFSADARAAMVRGVDMLADTVKVTLGPKGRNVVLEKAFGSPLITNDGVTIAKEIELEDHFENMGAKLVSEVASKT
NDIAGDGTTTATVLTQAIVHEGLKNVTAGANPIGIRRGIETATATAVEALKAIAQPVSGKEAIAQVAAVSSRSEKVGEYI
SEAMERVGNDGVITIEESRGMETELEVVEGMQFDRGYLSQYMVTDNEKMVADLENPFILITDKKVSNIQDILPLLEEVLK
TNRPLLIIADDVDGEALPTLVLNKIRGTFNVVAVKAPGFGDRRKAMLEDIAILTGGTVITEDLGLELKDATMTALGQAAK
ITVDKDSTVIVEGSGSSEAIANRIALIKSQLETTTSDFDREKLQERLAKLAGGVAVIKVGAPTETALKEMKLRIEDALNA
TRAAVEEGIVAGGGTALITVIEKVAALELEGDDATGRNIVLRALEEPVRQIALNAGYEGSVVIDKLKNSPAGTGFNAATG
EWVDMIKTGIIDPVKVTRSALQNAASVASLILTTEAVVANKPEPAAPAPAMPAGMDPGMMGGF
>P0A335 5.6.1.7~~~groEL~~~Chaperonin GroEL~~~COG0459
MSKEIKFSSDARSAMVRGVDILADTVKVTLGPKGRNVVLEKSFGSPLITNDGVTIAKEIELEDHFENMGAKLVSEVASKT
NDIAGDGTTTATVLTQAIVREGIKNVTAGANPIGIRRGIETAVAAAVEALKNNAIPVANKEAIAQVAAVSSRSEKVGEYI
SEAMEKVGKDGVITIEESRGMETELEVVEGMQFDRGYLSQYMVTDSEKMVADLENPYILITDKKISNIQEILPLLESILQ
SNRPLLIIADDVDGEALPTLVLNKIRGTFNVVAVKAPGFGDRRKAMLEDIAILTGGTVITEDLGLELKDATIEALGQAAR
VTVDKDSTVIVEGAGNPEAISHRVAVIKSQIETTTSEFDREKLQERLAKLSGGVAVIKVGAATETELKEMKLRIEDALNA
TRAAVEEGIVAGGGTALANVIPAVATLELTGDEATGRNIVLRALEEPVRQIAHNAGFEGSIVIDRLKNAELGIGFNAATG
EWVNMIDQGIIDPVKVSRSALQNAASVASLILTTEAVVANKPEPVAPAPAMDPSMMGGMM
>P81284 5.6.1.7~~~groEL~~~Chaperonin GroEL~~~
MAKEIKFDMNARDLLKKGVDELANAVKVTLGPKGRNVILEKKFGAPQITKDGVTVAKEIELACPYENMGAQLVKEVASKT
NDKAGDGTTTATVLAQAIIGVGLKNVTAGANPMDLKRGIDKAVSKVVESIASQSEAVGTNMDRIEHVAKISANGDEGIGK
LIAEAMQKVKKEGVITVEEAKGTETTVEVVEGMQFDRGYISAYFVTDTEKMETQFENPYILIYDKKISVLKDLLPILEQM
VQSGRALLIIAEDIDSEALATLVVNRLRGGLKVCAVKAPGFGDRRKAMLEDIAILTGGTVITEEKGMKLEDAKMDMLGSA
DKVTVNKDNTTIVKGNGDKAAIESRIGQIKAQIETTTSDYDKEKLQERLAKLAGGVAVLYVGAPSEVEMKEKKDRVDDAL
HATRAAIEEGTVPGGGVAYLRAIPALEGLKGENEDETTGIEIVKRAIEEPLRQIVNNAGKEGAVVVQKVKEGTGAFGYNA
RTDVYEDLSEAGVVDPAKVTRIALENAASIAGMFLTTECVVADKKEEAPAPPMNPGMGGMGGMM
>P26194 5.6.1.7~~~groEL~~~Chaperonin GroEL~~~
MAKELRFGDDARQQMLAGVNALADAVKATMGPSGRNVVLERSFGAPTVTKDGVSVAKEIEFENRFKNMGAQMVKEVAAKT
SDTAGDGTTTATVLARAIVVEGHKAVAAGMNPMDLKRGIDKAVAAVTKKLQEMSKPCKDGKAIAQVGTISANSDQAIGSI
IAEAMEKVGKEGVITVEDGNSLENELAVVEGMQFDRGYISPYFINNQQNMSAELEHPFILLVDKKISTIRDMLSVLEAVA
KSGPSLLIIAEDVEGEALATLVVNNMRGIVKVCAVKAPGFGDRRKAMLQDIAILTAGEVISEEVGTSLESATLDSLGTAK
RVVVTKENTTIIDGEGKAADINARITQIRAQMEETTSDYDREKLQERVAKLAGGVAVIKLVLLPNRMKRKARVEDALHAT
RAAVEEGIVAGGGVALIRAQKALDGLKGENADQDMGINILRRAIESPLRQIVANAGYEPSVIVNKVAESKDNFGFNAATG
EYGDMVEMGILDPTKVTRTALQNAASVASLMLTTECMVADLPKKEEPMGAGEMGGMGGMGGMGGMM
>Q60024 5.6.1.7~~~groEL~~~Chaperonin GroEL~~~
MAKQIKYGEEARRALERGVNAVADTVKVTLGPRGRNVVLDKKYGSPTVTNDGVTIAREIELEDPFENQGAQLLKEAATKT
NDIAGDGTTTATLLAQAMVREGLKNLAAGANPMLLRRGIAKAVDAAVEGLKRISKPIDNKESIAHVASISAADEEIGKLI
AEAMDKVGKDGVITVEESKTLGTTLEVVEGMQFDRGYISPYMVTDAEKMEAVLEEPVILITDKKISNIQDLLPLLEQIVQ
QGKKLLIIADDVEGEALATLIVNKLRGTFTCVAVKAPGFGDRRKEMLQDIAILTGGQVISEELGYDLKDVRLDMLGRARQ
VKVTKEYTTIVGGAGDPSEIKKRVNQIKAQIEETTSDYDREKLQERLAKLAGGVAVIQAGAATETELKEKKHRIEDALAA
TKAAVEEGIVPGGGIALLNVIEDVQKVVDSLEGDFKTGAKIVLRALEEPVRQIATNAGVDGSVIVEKIKAAKDPNFGYDA
YKEEFTDMFKAGIVDPTKVTRTALQNAASIASMILTTEAIVVDIPEKNTGMPNPGAGMDMM
>P61490 5.6.1.7~~~groEL~~~Chaperonin GroEL~~~COG0459
MAKILVFDEAARRALERGVNAVANAVKVTLGPRGRNVVLEKKFGSPTITKDGVTVAKEVELEDHLENIGAQLLKEVASKT
NDVAGDGTTTATVLAQAIVREGLKNVAAGANPLALKRGIEKAVEAAVEKIKALAIPVEDRKAIEEVATISANDPEVGKLI
ADAMEKVGKEGIITVEESKSLETELKFVEGYQFDKGYISPYFVTNPETMEAVLEDAFILIVEKKVSNVRELLPILEQVAQ
TGKPLLIIAEDVEGEALATLVVNKLRGTLSVAAVKAPGFGDRRKEMLKDIAAVTGGTVISEELGFKLENATLSMLGRAER
VRITKDETTIVGGKGKKEDIEARINGIKKELETTDSEYAREKLQERLAKLAGGVAVIRVGAATETELKEKKHRFEDALNA
TRAAVEEGIVPGGGVTLLRAISAVEELIKKLEGDEATGAKIVRRALEEPARQIAENAGYEGSVIVQQILAETKNPRYGFN
AATGEFVDMVEAGIVDPAKVTRSALQNAASIGALILTTEAVVAEKPEKKESTPASAGAGDMDF
>Q5SLM2 5.6.1.7~~~groEL~~~Chaperonin GroEL~~~COG0459
MAKILVFDEAARRALERGVNAVANAVKVTLGPRGRNVVLEKKFGSPTITKDGVTVAKEVELEDHLENIGAQLLKEVASKT
NDVAGDGTTTATVLAQAIVREGLKNVAAGANPLALKRGIEKAVEAAVEKIKALAIPVEDRKAIEEVATISANDPEVGKLI
ADAMEKVGKEGIITVEESKSLETELKFVEGYQFDKGYISPYFVTNPETMEAVLEDAFILIVEKKVSNVRELLPILEQVAQ
TGKPLLIIAEDVEGEALATLVVNKLRGTLSVAAVKAPGFGDRRKEMLKDIAAVTGGTVISEELGFKLENATLSMLGRAER
VRITKDETTIVGGKGKKEDIEARINGIKKELETTDSEYAREKLQERLAKLAGGVAVIRVGAATETELKEKKHRFEDALNA
TRAAVEEGIVPGGGVTLLRAISAVEELIKKLEGDEATGAKIVRRALEEPARQIAENAGYEGSVIVQQILAETKNPRYGFN
AATGEFVDMVEAGIVDPAKVTRSALQNAASIGALILTTEAVVAEKPEKKESTPASAGAGDMDF
>P23033 5.6.1.7~~~groEL~~~Chaperonin GroEL~~~COG0459
MAKQLLFNEEARKKLLSGVEQISSAVKVTLGPKGRNVLLEKGYGAPTVTKDGVSVAKEVELEDPFENMGAQLLKEVATKT
NDVAGDGTTTATVLAYSMVREGLKAVAAGMTPLELKRGMDKAVAIAVDDIKQNSKGIKSNEEVAHVASVSANNDKEIGRI
LASAIEKVGNDGVIDVDEAQTMETVTEFVEGMQFDRGYISSYFVTDRDRMETVYENPYILIYDKSISTMKDLLPLLEKIA
QTGRPLLIIAEDVEGEALATLVVNSLRGTLKTCAVKAPGFGDRRKEMLEDIAILSGGQVISEDLGLKLESADIALLGQAK
SVKVDKENTTIIDGSGKSKDIKDRIEQIKKQIEASTSDYDSEKLKERLAKLSGGVAVIKIGAVTEVEMKEKKHRVEDALN
ATRAAIEEGIVAGGGLALIQAAAALEKADLSGLTPDEAVGFKIVRRALEEPIRQISENAGIDGAVVAEKAKEKRGIGFDA
SKMEWVDMIKVGIIDPAKVTRSALQNAASVSGLLLTTECAIAAIPEKSSSTPPAPDMGGMGGMY
>Q5GUT1 5.6.1.7~~~groEL~~~Chaperonin GroEL~~~
MAAKDIRFGEDARTRMVRGVNVLANAVKATLGPKGRNVVLEKSFGAPTITKDGVSVAKEIELADKFENMGAQMVKEVASK
TNDNAGDGTTTATVLAQALIREGAKAVAAGMNPMDLKRGIDQAVKAAVVELKNISKPTTDDKAIAQVGTISANSDESIGN
IIAEAMKKVGKEGVITVEEGSGLENELDVVEGMQFDRGYLSPYFINNQQSQSADLDDPFILLHDKKISNVRDLLPVLEGV
AKAGKPLLIVAEEVEGEALATLVVNTIRGIVKVVAVKAPGFGDRRKAMLEDMAVLTGGTVISEEVGLALEKATIKDLGRA
KKVQVSKENTTIIDGAGDSAAIESRVGQIKTQIEDTSSDYDREKLQERVAKLAGGVAVIKVGASTEIEMKEKKARVEDAL
HATRAAVEEGVVPGGGVALVRALVAVGNLTGANEDQTHGIQIALRAMEAPLREIVANAGEEPSVILNKVKEGTGNYGYNA
ANGEFGDMVEFGILDPTKVTRSALQNAASIAGLMITTEAMVADAPKKDEPAMPAGGGMGGMGGMDF
>O34840 ~~~chaA~~~Ca(2+)/H(+) antiporter ChaA~~~COG0387
MNRIFFILVAAGVPLSVIGSLMHWPSAVLFAVYCVTIIALASYMGRATESLSIIAGPRIGGLLNATFGNAVELIISLFAL
KEGLTGIVLASLTGSVLGNLLLVAGLSFFVGGLKYKRQEFNIHDARHNSGLLIFAIIVAFVIPEVFSVGMGNASKLNLSI
GISIIMILLYVAALYFKLVTHRGVYQPNNAAQTEEEEEPEWSGKVATIVLFAATIVVAYISENLVHTFHSVAEQFGWSEL
FIGVIIVAIVGNAAEHASAIIMAFKNKMDIAVEIAVGSTLQIAMFVAPVLVICSIFFPTSMPLVFTLPELVAMVSAVLLM
IAISNDGDSNWFEGATLLAAYVIMAIGFFLL
>P31801 ~~~chaA~~~Sodium-potassium/proton antiporter ChaA~~~COG0387
MSNAQEAVKTRHKETSLIFPVLALVVLFLWGSSQTLPVVIAINLLALIGILSSAFSVVRHADVLAHRLGEPYGSLILSLS
VVILEVSLISALMATGDAAPTLMRDTLYSIIMIVTGGLVGFSLLLGGRKFATQYMNLFGIKQYLIALFPLAIIVLVFPMA
LPAANFSTGQALLVALISAAMYGVFLLIQTKTHQSLFVYEHEDDSDDDDPHHGKPSAHSSLWHAIWLIIHLIAVIAVTKM
NASSLETLLDSMNAPVAFTGFLVALLILSPEGLGALKAVLNNQVQRAMNLFFGSVLATISLTVPVVTLIAFMTGNELQFA
LGAPEMVVMVASLVLCHISFSTGRTNVLNGAAHLALFAAYLMTIFA
>P0AE65 ~~~chaB~~~Putative cation transport regulator ChaB~~~COG4572
MPYKTKSDLPESVKHVLPSHAQDIYKEAFNSAWDQYKDKEDRRDDASREETAHKVAWAAVKHEYAKGDDDKWHKKS
>P39163 4.3.2.7~~~chaC~~~Glutathione-specific gamma-glutamylcyclotransferase~~~COG3703
MITRDFLMNADCKTAFGAIEESLLWSAEQRAASLAATLACRPDEGPVWIFGYGSLMWNPALEFTESCTGTLVGWHRAFCL
RLTAGRGTAHQPGRMLALKEGGRTTGVAYRLPEETLEQELTLLWKREMITGCYLPTWCQLDLDDGRTVNAIVFIMDPRHP
EYESDTRAQVIAPLIAAASGPLGTNAQYLFSLEQELIKLGMQDDGLNDLLVSVKKLLAENFPDGVLRPGFA
>P17411 3.2.1.86~~~chbF~~~6-phospho-beta-glucosidase~~~COG1486
MSQKLKVVTIGGGSSYTPELLEGFIKRYHELPVSELWLVDVEGGKPKLDIIFDLCQRMIDNAGVPMKLYKTLDRREALKD
ADFVTTQLRVGQLPARELDERIPLSHGYLGQETNGAGGLFKGLRTIPVIFDIVKDVEELCPNAWVINFTNPAGMVTEAVY
RHTGFKRFIGVCNIPIGMKMFIRDVLMLKDSDDLSIDLFGLNHMVFIKDVLINGKSRFAELLDGVASGQLKASSVKNIFD
LPFSEGLIRSLNLLPCSYLLYYFKQKEMLAIEMGEYYKGGARAQVVQKVEKQLFELYKNPELKVKPKELEQRGGAYYSDA
ACEVINAIYNDKQAEHYVNIPHHGQIDNIPADWAVEMTCKLGRDGATPHPRITHFDDKVMGLIHTIKGFEIAASNAALSG
EFNDVLLALNLSPLVHSDRDAELLAREMILAHEKWLPNFADCIAELKKAH
>P37794 3.5.1.105~~~chbG~~~Chitooligosaccharide deacetylase ChbG~~~COG3394
MERLLIVNADDFGLSKGQNYGIIEACRNGIVTSTTALVNGQAIDHAVQLSRDEPSLAIGMHFVLTMGKPLTAMPGLTRDG
VLGKWIWQLAEEDALPLEEITQELVSQYLRFIELFGRKPTHLDSHHHVHMFPQIFPIVARFAAEQGIALRADRQMAFDLP
VNLRTTQGFSSAFYGEEISESLFLQVLDDAGHRGDRSLEVMCHPAFIDNTIRQSAYCFPRLTELDVLTSASLKGAIAQRG
YRLGSYRDV
>A6T7U7 3.5.1.105~~~chbG~~~Chitooligosaccharide deacetylase~~~
MERVLIVNADDFGLSKGQNYGIIEACRNGVVTSTTALVNGAAIDHAAQLGRSTPELAVGMHFVLTLGEPLSAMPGLTRDG
RLGKWIWQQAEEDSLPLEEIAHELACQYHRFVELFGHEPTHIDSHHHVHMFAQIYPIVAAFAREKGIALRIDRQVAAQSG
LDQQAARSSAGFSSEFYGEAVSEELFLQTLDASIARGERSLEVMCHPAYVDRIIMGSAYCYPRLDELDVLTAASLKAAVA
DRGYRLGTYRDV
>Q9F8X1 2.4.1.280~~~chbP~~~N,N'-diacetylchitobiose phosphorylase~~~
MKYGYFDNDNREYVITRPDVPAPWTNYLGTEKFCTVISHNAGGYSFYHSPEYNRVTKFRPNFTQDRPGHYIYLRDDETGD
FWSVSWQPVAKNLDDAHYEVRHGLSYSKFRCDYNGIVATKTLFVPKGEDAQVWDVEIENTSDQPRTISAFGYVEFSFSHI
ASDNQNHQMSLYSAGTEYNNGVLEYDLYYNTDDFLGFYYLTATFDADSYDGQRDAFLGMYRDEANPIAVANGRCSNSAQT
CYNHCGALHKQFVLQPGEKVRFAVILGVGKGNGEKLRAKYQDLSQVDAAFAGIKQHWDERCAKFQVRSPNQGLDTMINAW
TLYQAETCVVWSRFASFIEVGGRTGLGYRDTAQDAISVPHTNPAMTRKRLVDLLRGQVKAGYGLHLFDPDWFDPEKADVK
PSKSPTVVPTPSDEDKIHGIKDTCSDDHLWIVPTILNFVKETGDLSFIDEVIPYADGGDATVYQHMMAALDFSAEYVGQT
GICKGLRADWNDCLNLGGGESAMVSFLHFWALEAFLELARHRQDAAAIDKYQAMANGVREACETHLWDDNGGWYIRGLTK
DGDKIGTFEQQEGKVHLESNTLAVLSGAVSQQRGEKAMDAVYEYLFSPYGLHLNAPSFATPNDDIGFVTRVYQGVKENGA
IFSHPNPWAWVAEAKLGRGDRAMEFYDSLNPYNQNDIIETRVAEPYSYVQFIMGRDHQDHGRANHPWLTGTSGWAYYATT
NFILGVRTGFDTLTVDPCIPAAWSGFEVTREWRGATYHISVQNPNGVSKGVQSILVNGEAVDAINAQPAGSENQVTVILG
>Q76IQ9 2.4.1.280~~~chbP~~~N,N'-diacetylchitobiose phosphorylase~~~
MKYGYFDNDNREYVITRPDVPAPWTNYLGTEKFCTVISHNAGGYSFYNSPEYNRVTKFRPNATFDRPGHYVYLRDDDSGD
YWSISWQPVAKSLDEAQYQIRHGLSYSKFQCDYNGIHARKTLFVPKGEDAEIWDVVIKNTSDQVRTISAFSFVEFSFSHI
QSDNQNHQMSLYSAGTAYRPGLIEYDLYYNTDDFEGFYYLASTFDPDSYDGQRDRFLGLYRDEANPLAVEQGRCSNSAQT
CYNHCGSLHKQFTLQPGEEIRFAYILGIGKGNGERLREHYQDVANIDAAFAAIKAHWDERCAKFQVKSPNQGLDTMINAW
TLYQAETCVVWSRFASFIEVGGRTGLGYRDTAQDAISVPHANPEMTRKRIVDLLRGQVKAGYGLHLFDPDWFDPEKEDVA
PSKSPTVVPTPSDEDKIHGIKDTCSDDHLWLIPTICKYVMETGETSFFDQMIPYADGGEASVYEHMKAALDFSAEYVGQT
GICKGLRADWNDCLNLGGGESSMVSFLHFWALQEFIDLAKFLGKDQDVNTYTEMAANVREACETHLWDDEGGWYIRGLTK
NGDKIGTAQQQEGRVHLESNTLAVLSGLASQERGEQAMDAVDEHLFSPYGLHLNAPSFSTPNDDIGFVTRVYQGVKENGA
IFSHPNPWAWVAETKLGRGDRAMKFYDALNPYNQNDIIEKRIAEPYSYVQFIMGRDHQDHGRANHPWLTGTSGWAYFAVT
NYILGVQSGFTGLSVDPCIPSDWPGFEVTRQWRGATYHIQVENPDHVSKGVKSITLNGAPIQGRIPPQAQGSDNQVVVVL
G
>P17410 ~~~chbR~~~HTH-type transcriptional regulator ChbR~~~COG1917
MMQPVINAPEIATAREQQLFNGKNFHVFIYNKTESISGLHQHDYYEFTLVLTGRYFQEINGKRVLLERGDFVFIPLGSHH
QSFYEFGATRILNVGISKRFFEQHYLPLLPYCFVASQVYRTNNAFLTYVETVISSLNFRETGLEEFVEMVTFYVINRLRH
YREEQVIDDVPQWLKSTVEKMHDKEQFSESALENMVALSAKSQEYLTRATQRYYGKTPMQIINEIRINFAKKQLEMTNYS
VTDIAFEAGYSSPSLFIKTFKKLTSFTPKSYRKKLTEFNQ
>Q54468 3.2.1.52~~~chb~~~Chitobiase~~~
MNAFKLSALARLTATMGFLGGMGSAMADQQLVDQLSQLKLNVKMLDNRAGENGVDCAALGADWASCNRVLFTLSNDGQAI
DGKDWVIYFHSPRQTLRVDNDQFKIAHLTGDLYKLEPTAKFSGFPAGKAVEIPVVAEYWQLFRNDFLPRWYATSGDAKPK
MLANTDTENLDQFVAPFTGDQWKRTKDDKNILMTPASRFVSNADLQTLPAGALRGKIVPTPMQVKVHAQDADLRKGVALD
LSTLVKPAADVVSQRFALLGVPVQTNGYPIKTDIQPGKFKGAMAVSGAYELKIGKKEAQVIGFDQAGVFYGLQSILSLVP
SDGSGKIATLDASDAPRFPYRGIFLDVARNFHKKDAVLRLLDQMAAYKLNKFHFHLSDDEGWRIEIPGLPELTEVGGQRC
HDLSETTCLLPQYGQGPDVYGGFFSRQDYIDIIKYAQARQIEVIPEIDMPAHARAAVVSMEARYKKLHAAGKEQEANEFR
LVDPTDTSNTTSVQFFNRQSYLNPCLDSSQRFVDKVIGEIAQMHKEAGQPIKTWHFGGDEAKNIRLGAGYTDKAKPEPGK
GIIDQSNEDKPWAKSQVCQTMIKEGKVADMEHLPSYFGQEVSKLVKAHGIDRMQAWQDGLKDAESSKAFATSRVGVNFWD
TLYWGGFDSVNDWANKGYEVVVSNPDYVYMDFPYEVNPDERGYYWGTRFSDERKVFSFAPDNMPQNAETSVDRDGNHFNA
KSDKPWPGAYGLSAQLWSETQRTDPQMEYMIFPRALSVAERSWHRAGWEQDYRAGREYKGGETHFVDTQALEKDWLRFAN
ILGQRELAKLDKGGVAYRLPVPGARVAAGKLEANIALPGLGIEYSTDGGKQWQRYDAKAKPAVSGEVQVRSVSPDGKRYS
RAEKV
>P13670 3.2.1.52~~~chb~~~N,N'-diacetylchitobiase~~~
MLKHSLIAASVITTLAGCSSLQSSEQQVVNSLADNLDIQYEVLTNHGANEGLACQDMGAEWASCNKVNMTLVNQGEAVDS
KDWAIYFHSIRLILDVDNEQFKISRVTGDLHKLEPTDKFDGFAAGEEVVLPLVGEYWQLFETDFMPGAFVSAPNAEPKMI
ASLNTEDVASFVTGLEGNNLKRTPDDNNVFANAVSRFEKNEDLATQDVSTTLLPTPMHVEAGKGKVDIADGIALPKDAFD
ATQFAAIQDRAEVVGVDVRGDLPVSITVVPADFTGELAKSGAYEMSIKGDGIVIKAFDQAGAFYAVQSIFGLVDSQNADS
LPQLSIKDAPRFDYRGVMVDVARNFHSKDAILATLDQMAAYKMNKLHLHLTDDEGWRLEIPGLPELTEVGANRCFDTQEK
SCLLPQLGSGPTTDNFGSGYFSKADYVEILKYAKARNIEVIPEIDMPAHARAAVVSMEARYDRLMEEGKEAEANEYRLMD
PQDTSNVTTVQFYNKQSFINPCMESSTRFVDKVISEVAAMHQEAGAPLTTWHFGGDEAKNIKLGAGFQDVNAEDKVSWKG
TIDLSKQDKPFAQSPQCQTLITDGTVSDFAHLPSHFAEEVSKIVAEKGIPNFQAWQDGLKYSDGEKAFATENTRVNFWDV
LYWGGTSSVYEWSKKGYDVIVSNPDYVYMDMPYEVDPKERGYYWATRATDTRKMFGFAPENMPQNAETSVDRDGNGFTGK
GEIEAKPFYGLSAQLWSETVRNDEQYEYMVFPRVLAAAQRAWHRADWENDYKVGVEYSQNSNLVDKASLNQDYNRFANVL
GQRELAKLEKSGIDYRLPVPGAKVEDGKLAMNVQFPGVTLQYSLDGENWLTYADNARPNVTGEVFIRSVSATGEKVSRIT
SVK
>P95727 1.3.1.120~~~chcA~~~1-cyclohexenylcarbonyl-CoA reductase~~~
MNSPHQQQTADRRQVSLITGASRGIGRTLALTLARRGGTVVVNYKKNADLAQKTVAEVEEAGGQGFAVQADVETTEGVTA
LFDEVAQRCGRLDHFVSNAAASAFKNIVDLGPHHLDRSYAMNLRPFVLGAQQAVKLMDNGGRIVALSSYGSVRAYPTYAM
LGGMKAAIESWVRYMAVEFAPYGINVNAVNGGLIDSDSLEFFYNVEGMPPMQGVLDRIPARRPGTVQEMADTIAFLLGDG
AGYITGQTLVVDGGLSIVAPPFFADAGEALELPPRPTRDA
>Q2LQP0 1.3.8.11~~~~~~Cyclohexane-1-carbonyl-CoA dehydrogenase~~~COG1960
MYINTETEDLKTAIDAIRKAVKDRIAPLAAEVDDSGVIKPEIYDLLWDLGLMTVTYPPEYGGSETNPGTLLCIGCEEIAK
ACASTALLLIIQAVGSFPLMHGGRKELLDRIAPRIVNNRELAGYLVSEPGAGSDVKAIRTKAVKDGNDWVINGTKCWATN
GPIASFYSCLCRTKDDKGVQGYSFFLVERNTPGLSVGKIEHKMGMRGSQTSEVILEDVRVPAENLLGELNNGFKLAMKDF
DMSRPAIAAQALGISEGAFAQMETYSRERYTFGKPLCEHGMITQIIADSAALIEAGRGLIYQAADLYDKGKKNTKLASMA
KFFMGDAAVKITTDAIQVFGGYGYTHDYPVERMFRDAKLTQIFEGANQIQRIVVAREIRDEQSK
>Q5KUD5 1.3.98.5~~~chdC~~~Coproheme decarboxylase~~~COG3253
MSEAAQTLDGWYCLHDFRTIDWSAWKTLPNEEREAAISEFLALVDQWETTESEKQGSHAVYTIVGQKADILFMILRPTLD
ELHEIETALNKTKLADYLLPAYSYVSVVELSNYLASGSEDPYQIPEVRRRLYPILPKTNYICFYPMDKRRQGNDNWYMLS
MEQRRELMRAHGMTGRKYAGKVTQIITGSVGLDDFEWGVTLFSDDALQFKKLVYEMRFDEVSARFGEFGSFFVGTRLSVE
KVPSFFHV
>A0A0K2H9D8 1.3.98.5~~~chdC~~~Coproheme decarboxylase~~~
MSEAAQTLDGWYCLHDFRTIDWSAWKTLPNEEREAAISEFLALVDQWETTESEKQGSHAVYTIVGQKADILFMILRPTLD
ELHEIETALNKTKLADYLLPAYSYVSVVELSNYLASGSEDPYQIPEVRRRLYPILPKTNYICFYPMDKRRQGNDNWYMLS
MEQRRELMRAHGMTGRKYAGKVTQIITGSVGLDDFEWGVTLFSDDALQFKKLVYEMRFDEVSARFGEFGSFFVGTRLPME
NVSSFFHV
>Q8Y5F1 1.3.98.5~~~chdC~~~Coproheme decarboxylase~~~COG3253
MNEAVKTLDGWFCLHDFRSIDWAAWRELNPGNQELMLNELSHFLSDMEITKNIGEGEHTIYSILGQKADLVFFTLRDSLE
ALNEVENRFNKLAIADYLLPTYSYISVVELSNYLASHMAGGDDPYQNKGVRARLYPALPPKKHICFYPMSKKRDGADNWY
MLPMEERQQLIRDHGLIGRSYAGKVQQIIGGSIGFDDYEWGVTLFSDDALEFKRIVTEMRFDEASARYAEFGSFFIGNLL
LSEQLSKLFTI
>A0QW25 1.3.98.5~~~chdC~~~Coproheme decarboxylase~~~COG3253
MAKLDFDALNSTIRYLMFSVFAVAPGELGEDRADVIDEAATFLKQQEDKGVVVRGLYDVAGLRADADFMIWTHADNVEAL
QSTYSDFRRTTALGRISDPVWSSVALHRPAEFNKSHIPAFLAGEEPGNYICVYPFVRSYEWYLLPDEERRRMLSEHGMAA
RGYKDVRANTVPAFALGDYEWILAFEAPELHRIVDLMRDLRATDARRHTREETPFFTGPRISVENLIAKLP
>P9WL45 1.3.98.5~~~chdC~~~Coproheme decarboxylase~~~COG3253
MARLDYDALNATLRYLMFSVFSVSPGALGDQRDAIIDDASTFFKQQEERGVVVRGLYDVAGLRADADFMVWTHAERVEAL
QATYADFRRTTTLGRACTPVWSGVGLHRPAEFNKSHIPAFLAGEEPGAYICVYPFVRSYEWYLLPDEERRRMLAEHGMAA
RGYKDVRANTVPAFALGDYEWILAFEAPELDRIVDLMRELRATDARRHTRAETPFFTGPRVPVEQLVHSLP
>Q2G0J1 1.3.98.5~~~chdC~~~Coproheme decarboxylase~~~COG3253
MSQAAETLDGWYSLHLFYAVDWASLRIVPKDERDALVTEFQSFLENTATVRSSKSGDQAIYNITGQKADLLLWFLRPEMK
SLNHIENEFNKLRIADFLIPTYSYVSVIELSNYLAGKSDEDPYENPHIKARLYPELPHSDYICFYPMNKRRNETYNWYML
TMEERQKLMYDHGMIGRKYAGKIKQFITGSVGFDDFEWGVTLFSDDVLQFKKIVYEMRFDETTARYGEFGSFFVGHIINT
NEFDQFFAIS
>A6QEP0 1.3.98.5~~~chdC~~~Coproheme decarboxylase~~~
MSQAAETLDGWYSLHLFYAVDWASLRIVPKDERDALVTEFQSFLENTATVRSSKSGDQAIYNITGQKADLLLWFLRPEMK
SLNHIENEFNKLRIADFLIPTYSYVSVIELSNYLAGKSDEDPYENPHIKARLYPELPHSDYICFYPMNKRRNETYNWYML
TMEERQKLMYDHGMIGRKYAGKIKQFITGSVGFDDFEWGVTLFSDDVLQFKKIVYEMRFDETTARYGEFGSFFVGHIINT
NEFDQFFAIS
>Q7A759 1.3.98.5~~~chdC~~~Coproheme decarboxylase~~~
MSQAAETLDGWYSLHLFYAVDWASLRIVPKDERDALVTEFQSFLENTATVRSSKSGDQAIYNITGQKADLLLWFLRPEMK
SLNHIENEFNKLRIADFLIPTYSYVSVIELSNYLAGKSDEDPYENPHIKARLYPELPHSDYICFYPMNKRRNETYNWYML
TMEERQKLMYDHGMIGRKYAGKIKQFITGSVGFDDFEWGVTLFSDDVLQFKKIVYEMRFDETTARYGEFGSFFVGHLINT
NEFDQFFAIS
>Q5SHL6 1.3.98.5~~~chdC~~~Coproheme decarboxylase~~~COG3253
MERHVPEPTHTLEGWHVLHDFRLLDFARWFSAPLEAREDAWEELKGLVREWRELEEAGQGSYGIYQVVGHKADLLFLNLR
PGLDPLLEAEARLSRSAFARYLGRSYSFYSVVELGSQEKPLDPESPYVKPRLTPRVPKSGYVCFYPMNKRRQGQDNWYML
PAKERASLMKAHGETGRKYQGKVMQVISGAQGLDDWEWGVDLFSEDPVQFKKIVYEMRFDEVSARYGEFGPFFVGKYLDE
EALRAFLGL
>P0CH62 3.7.1.11~~~~~~Cyclohexane-1,2-dione hydrolase~~~
MAIKRGADLIVEALEEYGTEQVVGFIGHTSHFVADAFSKSHLGKRVINPATELGGAWMVNGYNYVKDRSAAVGAWHCVGN
LLLHAAMQEARTGRIPAVHIGLNSDGRLAGRSEAAQQVPWQSFTPIARSTQRVERLDKVGEAIHEAFRVAEGHPAGPAYV
DIPFDLTADQIDDKALVPRGATRAKSVLHAPNEDVREAAAQLVAAKNPVILAGGGVARSGGSEALLKLAEMVGVPVVTTS
TGAGVFPETHALAMGSAGFCGWKSANDMMAAADFVLVLGSRLSDWGIAQGYITKMPKFVHVDTDPAVLGTFYFPLLSVVA
DAKTFMEQLIEVLPGTSGFKAVRYQERENFRQATEFRAAWDGWVREQESGDGMPASMFRAMAEVRKVQRPEDIIVTDIGN
HTLPMFGGAILQRPRRLVTSMAEGILGCGFPMALGAQLAEPNSRVFLGTGDGALYYHFNEFRVAVEHKLPVITMVFTNES
YGANWTLMNHQFGQNNWTEFMNPDWVGIAKAFGAYGESVRETGDIAGALQRAIDSGKPALIEIPVSKTQGLASDPVGGVG
PNLLLKGREIPVDTGGSMYPGENLLHLKS
>O25153 2.7.13.3~~~cheAY~~~Sensor histidine kinase CheAY~~~COG0643
MDDLQEIMEDFLIEAFEMNEQLDQDLVELEHNPEDLDLLNRIFRVAHTIKGSSSFLNLNILTHLTHNMEDVLNRARKGEI
KITPDIMDVVLRSIDLMKTLLVTIRDTGSDTNNGKENEIEEAVKQLQAITSQNLESAKERTTEAPQKENKEETKEEAKEE
NKENKAKAPTAENTSSDNPLADEPDLDYANMSAEEVEAEIERLLNKRQEADKERRAQKKQEAKPKQEVTPTKETPKAPKT
ETKAKAKADTEENKAPSIGVEQTVRVDVRRLDHLMNLIGELVLGKNRLIRIYSDVEERYDGEKFLEELNQVVSSISAVTT
DLQLAVMKTRMQPVGKVFNKFPRMVRDLSRELGKSIELIIEGEETELDKSIVEEIGDPLIHIIRNSCDHGIEPLEERRKL
NKPETGKVQLSAYNEGNHIVIKISDDGKGLDPVMLKEKAIEKGVISERDAEGMSDREAFNLIFKPGFSTAKVVSNVSGRG
VGMDVVKTNIEKLNGIIEIDSEVGVGTTQKLKIPLTLAIIQALLVGVQEEYYAIPLSSVLETVRISQDEIYTVDGKSVLR
LRDEVLSLVRLSDIFKVDAILESNSDVYVVIIGLADQKIGVIVDYLIGQEEVVIKSLGYYLKNTRGIAGATVRGDGKITL
IVDVGAMMDMAKSIKVNITTLMNESENTKSKNSPSDYIVLAIDDSSTDRAIIRKCLKPLGITLLEATNGLEGLEMLKNGD
KIPDAILVDIEMPKMDGYTFASEVRKYNKFKNLPLIAVTSRVTKTDRMRGVESGMTEYITKPYSGEYLTTVVKRSIKLEG
DQS
>P29072 2.7.13.3~~~cheA~~~Chemotaxis protein CheA~~~COG0643
MDMNQYLDVFIDESKEHLQTCNEKLLLLEKDPTDLQLVHDIFRAAHTLKGMSATMGYTDLAHLTHLLENVLDAIRNGDME
VTSDWLDILFEALDHLETMVQSIIDGGDGKRDISEVSAKLDVNGAHAESAASAEPAEAQSSASDWEYDEFERTVIQEAEE
QGFKRYEIKISLNENCMLKAVRVYMVFEKLNEVGEVAKTIPSAEVLETEDFGTDFQVCFLTHQSAEDIEQLINGVSEIEH
VEVIQGAPLTSAEKPEESKQEDSPAAAVPAKQEKQKQPAKNDEQAKHSAGGSKTIRVNIDRLDSLMNLFEELVIDRGRLE
QIAKELEHNELTETVERMTRISGDLQSIILNMRMVPVETVFNRFPRMIRQLQKELNKKIELSIIGAETELDRTVIDEIGD
PLVHLIRNSIDHGIEAPETRLQKGKPESGKVVLKAYHSGNHVFIEVEDDGAGLNRKKILEKALERGVITEKEAETLEDNQ
IYELIFAPGFSTADQISDISGRGVGLDVVKNKLESLGGSVSVKSAEGQGSLFSIQLPLTLSIISVLLIKLEEETFAIPIS
SIIETAVIDRKDILQTHDREVIDFRGHIVPVVYLKEEFKIEDTRKDAEQLHIIVVKKGDKPTAFVVDSFIGQQEVVLKSL
GDYLTNVFAISGATILGDGEVALIIDCNALII
>P07363 2.7.13.3~~~cheA~~~Chemotaxis protein CheA~~~COG0643
MSMDISDFYQTFFDEADELLADMEQHLLVLQPEAPDAEQLNAIFRAAHSIKGGAGTFGFSVLQETTHLMENLLDEARRGE
MQLNTDIINLFLETKDIMQEQLDAYKQSQEPDAASFDYICQALRQLALEAKGETPSAVTRLSVVAKSEPQDEQSRSQSPR
RIILSRLKAGEVDLLEEELGHLTTLTDVVKGADSLSAILPGDIAEDDITAVLCFVIEADQITFETVEVSPKISTPPVLKL
AAEQAPTGRVEREKTTRSNESTSIRVAVEKVDQLINLVGELVITQSMLAQRSSELDPVNHGDLITSMGQLQRNARDLQES
VMSIRMMPMEYVFSRYPRLVRDLAGKLGKQVELTLVGSSTELDKSLIERIIDPLTHLVRNSLDHGIELPEKRLAAGKNSV
GNLILSAEHQGGNICIEVTDDGAGLNRERILAKAASQGLTVSENMSDDEVAMLIFAPGFSTAEQVTDVSGRGVGMDVVKR
NIQKMGGHVEIQSKQGTGTTIRILLPLTLAILDGMSVRVADEVFILPLNAVMESLQPREADLHPLAGGERVLEVRGEYLP
IVELWKVFNVAGAKTEATQGIVVILQSGGRRYALLVDQLIGQHQVVVKNLESNYRKVPGISAATILGDGSVALIVDVSAL
QAINREQRMANTAA
>Q52880 2.7.13.3~~~cheA~~~Chemotaxis protein CheA~~~COG0643
MDMNEIKEIFFQECEEQLAELESGLLKLNDGDRDPETVNAVFRAVHSIKGGAGAFGLDDLVSFAHVFETTLDCVRSNRLE
PNQDVLKVMLRSADVLADLTNAARDGGGVDEARSRQLIKELEALANGELPQAAAESAPKTTPAGVAPAAPVVNEEGFQPV
AFSFDDFETGDEPTIEPSTYEIVFKPKSDLYSKGNDATLLLRDLSRLGEMSIHCNMDTLPPLDRMNPEEAYFSWKISLKT
DKGEEAIRSVFEFAEWDCELDVALAGGTVGMDEDLPMQPVPFDLSILEDEAQAPAGEEDRAAASEGDSRNAAVAAAQTAS
NVLQMAQSTARVSPENARNSQSASAAQAAAQQAASAATPTIRVDLDRVDRLINLVGELVINQAMLSQSVIENDTNGTSSI
NMGLEELQQLTREIQDSVMAIRAQPVKPVFQRMSRIVREIADMTGKSVRLITEGENTEVDKTVIDKLAEPLTHMIRNAVD
HGLETPEKRVAAGKNPEGTVRLTAKHRSGRIVIELADDGAGINREKVRQKAIDNDLIAADANLSDEEVDNLIFHAGFSTA
DKISDISGRGVGMDVVKRSIQALGGRINISSKPGQGSIFTMSLPLTLAVLDGMVVTVANQTLVVPLTAIVETLQPEASAI
HSFGSSQRLISIRDSFCPLVDVGRILNFRGAQANPVEGVALLVESEGGGQRALMVDAIQGQRQVVIKSLEANYTHVPGIA
AATILGDGRVALILDVDAIVAASRGQSLKPEMSLAAAG
>P09384 2.7.13.3~~~cheA~~~Chemotaxis protein CheA~~~
MSMDISDFYQTFFDEADELLADMEQHLLDLVPESPDAEQLNAIFRAAHSIKGGAGTFGFTILQETTHLMENLLDEARRGE
MQLNTDIINLFLETKDIMQEQLDAYKNSEEPDAASFEYICNALRQLALEAKGETTPAVVETAALSAAIQEESVAETESPR
DESKLRIVLSRLKANEVDLLEEELGNLATLTDVVKGADSLSATLDGSVAEDDIVAVLCFVIEADQIAFEKVVAAPVEKAQ
EKTEVAPVAPPAVVAPAAKSAAHEHHAGREKPARERESTSIRVAVEKVDQLINLVGELVITQSMLAQRSNELDPVNHGDL
ITSMGQLQRNARDLQESVMSIRMMPMEYVFSRFPRLVRDLAGKLGKQVELTLVGSSTELDKSLIERIIDPLTHLVRNSLD
HGIEMPEKRLEAGKNVVGNLILSAEHQGGNICIEVTDDGAGLNRERILAKAMSQGMAVNENMTDDEVGMLIFAPGFSTAE
QVTDVSGRGVGMDVVKRNIQEMGGHVEIQSKQGSGTTIRILLPLTLAILDGMSVRVAGEVFILPLNAVMESLQPREEDLH
PLAGGERVLEVRGEYLPLVELWKVFDVDGAKTEATQGIVVILQSAGRRYALLVDQLIGQHQVVVKNLESNYRKVPGISAA
TILGDGSVALIVDVSALQGLNREQRMAITAA
>Q56310 2.7.13.3~~~cheA~~~Chemotaxis protein CheA~~~COG0643
MMEEYLGVFVDETKEYLQNLNDTLLELEKNPEDMELINEAFRALHTLKGMAGTMGFSSMAKLCHTLENILDKARNSEIKI
TSDLLDKIFAGVDMITRMVDKIVSEGSDDIGENIDVFSDTIKSFASSGKEKPSEIKNETETKGEEEHKGESTSNEEVVVL
PEEVAHVLQEARNKGFKTFYIKVILKEGTQLKSARIYLVFHKLEELKCEVVRTIPSVEEIEEEKFENEVELFVISPVDLE
KLSEALSSIADIERVIIKEVTAVTEESGAEKRTEKEEKTEKTEEKAERKKVISQTVRVDIEKLDNLMDLMGELVIARSRI
LETLKKYNIKELDESLSHLSRITLDLQNVVMKIRMVPISFVFNRFPRMVRDLAKKMNKEVNFIMRGEDTELDRTFVEEIG
EPLLHLLRNAIDHGIEPKEERIAKGKPPIGTLILSARHEGNNVVIEVEDDGRGIDKEKIIRKAIEKGLIDESKAATLSDQ
EILNFLFVPGFSTKEKVSEVSGRGVGMDVVKNVVESLNGSISIESEKDKGTKVTIRLPLTLAIIQALLVKVNNLVYAIPI
ANIDTILSISKEDIQRVQDRDVIVIRGEVIPVYRLWEVLQIEHKEELEEMEAVIVRVGNRKYGIVVDDLLGQDDIVIKSL
GKVFSEVKEFSGAAILGDGSIALIINVSGIV
>O87125 3.1.1.61~~~cheB1~~~Protein-glutamate methylesterase/protein-glutamine glutaminase 1~~~
MAVKVLVVDDSGFFRRRVSEILSADGQIQVVGTGTNGREAIEQALALRPDVITMDYEMPLMDGITAVRNIMQRCPTPVLM
FSSLTHEGARVTLDALDAGAVDYLPKNFEDISRNPDKVRQLLCEKVLTIARSNRRSISLPPLPSATSSSHAPASSSSVGA
SARVGAGASPAPASTSAAPKRKAYRLVAIGTSTGGPVALQRVLTQLPANFPAPLVLIQHMPAAFTKAFAERLDKLCRINV
KEAEDGDILRPGLALLAPGGKQMMVDGRGTVRILPGDERLNYKPCVDVTFGSAAKAYNDKVLAVVLTGMGADGREGARLL
KQGGSQVWAQDEASCVIYGMPMAVVKANLADAVYGLDDIGRHLVEACQ
>Q8KLS5 3.1.1.61~~~cheB2~~~Protein-glutamate methylesterase/protein-glutamine glutaminase of group 2 operon~~~
MKAAADRLMAARAREGRTRVLIVDDSAMVRQALALGLSTDPRLEVVGTASGAEAARAQMAALKPDVVTLDLEMPQMDGLT
FLRSYMESAPVPTVVISSLTRTSGETAMRAMEAGAVDIISKPSLGAGQGLPAIMRDVCARVWAAARARLALPDGAAPAPV
APGASEDWIHALGASTGGVQALSRILPFFPAQSPGLLVVQHMPEGFTAAFARRLDALCRMRVREAADGDLVLPGLVLIAP
GGLRHMEIERAGGVCRVRLVAGAPVSYSRPSVDRMFLSLAAAAGPRVSAALLTGMGRDGAAGLLAIRRAGGRTFAQDEGS
SAVFGMPLAARDLRAAEEILTLDDIPARMMLAAAADTRAPSLASND
>Q9I6V9 3.1.1.61~~~cheB2~~~Protein-glutamate methylesterase/protein-glutamine glutaminase 2~~~
MPISVLVVDDSALIRSLLKEIIQADPELRLVGCAPDAFVARDLIKQHAPDVISLDVEMPRMDGLTFLDKLMKARPTPVLM
ISSLTERGSEATLRALELGAVDFIAKPRLGIAEGMQAYAEEIRAKLKTVARARLRRRAADAPAPPESAAPLLSTEKIIAL
GASTGGTEALKEVLLGLPAHSPGVVITQHMPPGFTRSFAERLDRLTRLSVSEARDGDRILPGHALVAPGDHHMEVQRSGA
NYVVRLNRQAQVNGHRPAVDVMFESLARCAGRNLLAGLLTGMGKDGARGLLAIRQAGGYTLAQDEATCVVYGMPREAVEL
GAAEDVLPLERIAAALLQQAARRGSGNRL
>O33558 3.1.1.61~~~cheB3~~~Protein-glutamate methylesterase/protein-glutamine glutaminase of group 3 operon~~~
MTTHAAAPSTRVLIVDDSAAARAMFKVIVESDPALQVMAAVPDAFAAARAMRTELPDVILLDLELPSMDGLTFLRKIMQQ
HPIPVVVCSSHVGAGTEAMVSALELGAREVISKPAARNDLERQEASIRICDAIRAATETTRRRSQPEPRPLAPGPKLTAD
EILPARPPRPVPETMPVVCIGASTGGTEALRDVLTALPASAPPIVIVQHMPRGFTAAFARRLDSLCAIEVLEAEDEMQVM
PGRAIIAQGDRHLLLRRRNQGYRVSVLDGAYVCRHRPSVDVLFRSAAQEAGGNALGVIMTGMGDDGARCMAEMRAAGAET
IAQNEESCVVYGMPREAVAHGGVGKVEPLDRLAARIMEFGRRHTERTVR
>P07330 3.1.1.61~~~cheB~~~Protein-glutamate methylesterase/protein-glutamine glutaminase~~~COG2201
MSKIRVLSVDDSALMRQIMTEIINSHSDMEMVATAPDPLVARDLIKKFNPDVLTLDVEMPRMDGLDFLEKLMRLRPMPVV
MVSSLTGKGSEVTLRALELGAIDFVTKPQLGIREGMLAYNEMIAEKVRTAAKASLAAHKPLSAPTTLKAGPLLSSEKLIA
IGASTGGTEAIRHVLQPLPLSSPALLITQHMPPGFTRSFADRLNKLCQIGVKEAEDGERVLPGHAYIAPGDRHMELSRSG
ANYQIKIHDGPAVNRHRPSVDVLFHSVAKQAGRNAVGVILTGMGNDGAAGMLAMRQAGAWTLAQNEASCVVFGMPREAIN
MGGVCEVVDLSQVSQQMLAKISAGQAIRI
>Q6D6I7 3.1.1.61~~~cheB~~~Protein-glutamate methylesterase/protein-glutamine glutaminase~~~COG2201
MSKIRVLCVDDSALMRQIMTEIINSHPDMEVVATAPDPLVARDLIKKFNPQVLTLDVEMPRMDGLDFLEKLMRLRPMPVV
MVSSLTGKGSEITLRALELGAIDFVTKPQLGIREGMLAYSELIAEKIRMAAKARLPQRSTTAEPTKIIQHMPLLSSEKLI
AIGASTGGTEAIRHVLQPLPPTSPALLITQHMPPGFTKSFAERLNKLCQITVKEAEDGERVLPGHAYIAPGARHLELARS
GANYQVRLNDGPPVNRHRPSVDVLFRSVAQYAGRNAVGVILTGMGNDGAAGMLELHQAGAYTLAQNEASCVVFGMPREAI
AMGGVDEVVDLHQVSQRMLAQISAGQALRI
>P04042 3.1.1.61~~~cheB~~~Protein-glutamate methylesterase/protein-glutamine glutaminase~~~
MSKIRVLSVDDSALMRQIMTEIINSHSDMEMVATAPDPLVARDLIKKFNPDVLTLDVEMPRMDGLDFLEKLMRLRPMPVV
MVSSLTGKGSEVTLRALELGAIDFVTKPQLGIREGMLAYSEMIAEKVRTAARARIAAHKPMAAPTTLKAGPLLSSEKLIA
IGASTGGTEAIRHVLQPLPLSSPAVIITQHMPPGFTRSFAERLNKLCQISVKEAEDGERVLPGHAYIAPGDKHMELARSG
ANYQIKIHDGPPVNRHRPSVDVLFHSVAKHAGRNAVGVILTGMGNDGAAGMLAMYQAGAWTIAQNEASCVVFGMPREAIN
MGGVSEVVDLSQVSQQMLAKISAGQAIRI
>Q9WYN9 3.1.1.61~~~cheB~~~Protein-glutamate methylesterase/protein-glutamine glutaminase~~~COG2201
MTDRVIRVLVVDDSAFMRMVLKDIIDSQPDMKVVGFAKDGLEAVEKAIELKPDVITMDIEMPNLNGIEALKLIMKKAPTR
VIMVSSLTEEGAAITIEALRNGAVDFITKPHGSISLTFRQVAPELLEKIRQAMNVDPRTLLFKPKVSRLTITKPAVSGKI
VVIGSSTGGPRSLDMIIPNLPKNFPAPIVVVQHMPPGFTKSLAMRLDSTSELTVKEAEDGEEVKPGFVYIAPGDFHLGLK
AQNGKVFFFLDKSDKINNVRPAVDFTLDKAAEIYKSKTIAVILTGMGKDGTKGAFKVKFYGGTVIAEDKETCVVFGMPKS
VIEEGYADYVLPAYKIPEKLIELV
>P40403 3.-.-.-~~~cheC~~~CheY-P phosphatase CheC~~~COG1776
MSIFNGIKEEQMDILREVGNIGAGHSASAMAQLLNRKIDMEVPFAKLLSFDELVDFFGGADVPVASIFLRMEGDLTGSMF
FIMPFFQAEQFIRELIGNPDFDIEDLGEDHMSSSALHELGNILAGSYLTALADLTKLQLYPSVPEVSLDMFGAVISEGLM
ELSQVGEHAIVVDTSIFDQSHQQELKAHMFMLPDYDSFEKLFVALGASL
>Q9X006 3.-.-.-~~~cheC~~~CheY-P phosphatase CheC~~~COG1776
MKISERQKDLLKEIGNIGAGNAATAISYMINKKVEISVPNVEIVPISKVIFIAKDPEEIVVGVKMPVTGDIEGSVLLIMG
TTVVKKILEILTGRAPDNLLNLDEFSASALREIGNIMCGTYVSALADFLGFKIDTLPPQLVIDMISAIFAEASIEELEDN
SEDQIVFVETLLKVEEEEEPLTSYMMMIPKPGYLVKIFERMGIQE
>P40404 3.5.1.44~~~cheD~~~Chemoreceptor glutamine deamidase CheD~~~COG1871
MSTTEAVVIKVGIADVKIARFPDTIRTSGLGSCVGLVLYDKEKQTAGLVHVMLPDSTLSKTAELNRAKYADTAVQTTIDM
LIEAGCRKFALKAKLAGGSEMFKFKSTNDLMKIGPRNVLAIKEQLSLFNIPIISEDTGGSSGRTIEFEPKSCMLHIRTVK
QGEKTI
>Q9X005 3.5.1.44~~~cheD~~~Chemoreceptor glutamine deamidase CheD~~~COG1871
MKKVIGIGEYAVMKNPGVIVTLGLGSCVAVCMRDPVAKVGAMAHVMLPDSGGKTDKPGKYADTAVKTLVEELKKMGAKVE
RLEAKIAGGASMFESKGMNIGARNVEAVKKHLKDFGIKLLAEDTGGNRARSVEYNIETGKLLVRKVGGGEQLEIKEI
>A2PU44 2.4.2.31~~~~~~NAD(+)--arginine ADP-ribosyltransferase Chelt~~~
MKTIISLIFIMFPLFVSAHNGNFYRADSRSPNEIKDLGGLYPRGYYDFFERGTPMSISLYDHARGAPSGNTRYDDGFVST
TTDIDSAHEIGQNILSGYTEYYIYLIAPAPNLLDVNAVLGRYSPHPQENEYSALGGIPWTQVIGWYVVNNGVLDRNIHRN
RQFRADLFNNLSPALPSESYQFAGFEPEHPAWRQEPWINFAPPGCGRNVRLTKHINQQDCSNSQEELVYKKLQDLRTQFK
VDKKLKLVNKTSSNNIIFPNHDFIREWVDLDGNGDLSYCGFTVDSDGSRKRIVCAHNNGNFTYSSINISLSDYGWPKGQR
FIDANGDGLVDYCRVQYVWTHLYCSLSLPGQYFSLDKDAGYLDAGYNNSRAWAKVIGTNKYSFCRLTSNGYICTDIDSYS
TAFKDDDQGWADSRYWMDIDGNGGDDYCRLVYNWTHLRCNLQGKDGLWKRVESKYLDGGYPSLRFKIKMTSNKDNYCRIV
RNHRVMECAYVSDNGEFHNYSLNMPFSLYNKNDIQFIDIDGDNRDDICRYNSAPNTMECYLNQDKSFSQNKLVLYLSAKP
ISSLGSGSSKIIRTFNSEKNSSAYCYNAGYGTLRCDEFVIY
>O87131 2.1.1.80~~~cheR1~~~Chemotaxis protein methyltransferase 1~~~
MSAANADFELFRVFLEKTCGIVLGSNKQYLVSSRLNKLMEQQGIKSLGELVQRIQTQRGGLREMVVDAMTTNETLWFRDT
YPFEVLKQRVLPELIKANGGQRLRIWSAACSSGQEPYSLSMAIDEFEKTNLGQLKAGVQIVATDLSGSMLTAAKAGEYDT
LAMGRGLSPERLQRYFDAKGPGRWAVKPAIRSRVEFRALNLLDSYASLGKFDMVFCRNVLIYFSAEVKRDILLRIHGTLK
PGGYLFLGASEALNNLPDHYQMVQCSPGIIYRAK
>Q9I6V7 2.1.1.80~~~cheR2~~~Chemotaxis protein methyltransferase 2~~~
MPTSTPSPVFGNQEFHYTREDFQQVRERLYRLTGISLAESKAQLVYSRLSRRLRLLRLGSFAEYFTHLDREPGEQQLFVN
ALTTNLTAFFRERHHFPLLADLARRQLQRHRPLRIWSAAASTGEEPYSIAITLVEALGSFDPPVKIVASDIDTGVLDCAR
QGVYPLERLEQMPAPLKKRFFLRGTGPNAGKARVVEELRQLVEFRQINLLEADWSIAGELDAIFCRNVMIYFDKPTQTRL
LERMVALLRPEGLFFAGHSENFVHASHLVRSVGQTVYSPA
>Q88ER1 2.1.1.80~~~cheR2~~~Chemotaxis protein methyltransferase Cher2~~~COG1352
MSTGNLDFEQFRVFLEKACGILLGENKQYLVSSRLNKLMEQQGIKSLGELVQRIQAQPRGGLREQVVDAMTTNETLWFRD
TYPFEVLKNKVIPEFIRNNPGQRLRMWSAACSSGQEPYSISMAIDEFERSNLGQLKMGAQIVATDLSGTMLTNCKTGEYD
SLAIARGLSQERLQRYFDPKGPGRWAVKPAIRSRVEFRSFNLLDSYASLGKFDIVFCRNVLIYFSAQVKKDILLRIHSTL
KPGGYLFLGASEALNGLPDHYQMVQCSPGIIYQAK
>P31105 2.1.1.80~~~cheR~~~Chemotaxis protein methyltransferase~~~COG1352
MDTYSVFTTKWKQLTGVDLTLYKEAQMKRRLTSLYEKKGFQSFKDFAAALEKDQALLNETLDRMTINVSEFYRNYKRWEV
LETAILPLIKTSRPLKIWSAACSTGEEPYTLAMLLDQQKGLPGYQILATDIDEKALEKAKKGVYQERSLQEVPLSVKDRY
FTQNANRSYEVKTEIKKNITFKKHNLLADRYEQDFDLIVCRNVFIYFTESAKEELYLKMAHSLKKNGVLFVGSTEQIFNP
EKFGLVPADTFFYQKR
>P07801 2.1.1.80~~~cheR~~~Chemotaxis protein methyltransferase~~~
MTSSLPSGQTSVLLQMTQRLALSDAHFRRICQLIYQRAGIVLADHKRDMVYNRLVRRLRALGLDDFGRYLSMLEANQNSA
EWQAFINALTTNLTAFFREAHHFPILAEHARRRHGEYRVWSAAASTGEEPYSIAITLADALGMAPGRWKVFASDIDTEVL
EKARSGIYRLSELKTLSPQQLQRYFMRGTGPHEGLVRVRQELANYVEFSSVNLLEKQYNVPGPFDAIFCRNVMIYFDKTT
QEDILRRFVPLLKPDGLLFAGHSENFSNLVREFSLRGQTVYALSKDKA
>O25337 ~~~cheV2~~~Chemotaxis protein CheV2~~~COG0784
MVRDIDKTTSLHLNNEAQFLCFRLDAEKDAQLYGMNIFKIREIIHYDGEVTEILGGSDGVMLGFLSVRGESIPLVDVKRW
LHYNANDPSRDLKECSVKDDHNLVIVCHFSNHSIALKVLKIERIIHKNWTEISAGDKQGINEEGKLSAITRFDEERVVQI
LDVEKMISDVFPSLKDLDDLTLRCIEAIQSQKLILIAEDSLSALKTLEKIVQTLELRYLAFPNGRELLDYLYEKEHYQQV
GVVITDLEMPNISGFEVLKTIKADHRTEHLPVIINSSMSSDSNRQLAQSLEADGFVVKSNILEIHEMLKKTLS
>P37599 ~~~cheV~~~Chemotaxis protein CheV~~~COG0784
MSLQQYEILLDSGTNELEIVKFGVGENAFGINVMKVREIIQPVEVTSVPHSHQHVEGMIKLRGEILPVISLFSFFGVEPE
GSKDEKYIVTEFNKRKIVFHVGSVSQIHRVSWEAIEKPTSLNQGMERHLTGIIKLEDLMIFLPDYEKIIYDIESDSGVDT
YNMHTEGFDERRTDKKLIIVEDSPLLMRLLQDELKEAGYNNIASFENGKEAYEYIMNLAENETDLSKQIDMIITDIEMPK
MDGHRLTKLLKENPKSSDVPVMIFSSLITDDLRHRGEVVGADEQISKPEISDLIKKVDTYVIE
>P39802 ~~~cheW~~~Chemotaxis protein CheW~~~COG0835
MTAEIKTGEKMIVFMVNGKEYAISVTQVKSIEKWQKPTRVPGVEPYICGVINLRGVVTPVIDLRKRLNLPEYEITDETRI
IIIAYRDIEVGWIVDEANDVITVHESEIESAPEGVQKDTDVSIEQIVKQENRLLNIIDANAVLDKESSQSAVPDQA
>P0A964 ~~~cheW~~~Chemotaxis protein CheW~~~COG0835
MTGMTNVTKLASEPSGQEFLVFTLGDEEYGIDILKVQEIRGYDQVTRIANTPAFIKGVTNLRGVIVPIVDLRIKFSQVDV
DYNDNTVVIVLNLGQRVVGIVVDGVSDVLSLTAEQIRPAPEFAVTLSTEYLTGLGALGDRMLILVNIEKLLNSEEMALLD
SAASEVA
>O25152 ~~~cheW~~~Chemotaxis protein CheW~~~COG0835
MSNQLKDLFERQKEASAGSKQEDNEEVLQFIGFIIGDEEYAIPILNILEIVKPIGYTRVPETPNYVLGVFNLRGNVFPLI
SLRLKFGLKAEKQNKDTRYLVVRHNDQIAGFFIDRLTEAIRIKQTDIDPVPETLSDNNNLTYGIGKQNDRLVTILRVEEI
LKKDF
>P06110 ~~~cheW~~~Chemotaxis protein CheW~~~
MTGMSNVSKLAGEPSGQEFLVFTLGNEEYGIDILKVQEIRGYDQVTRIANTPAFIKGVTNLRGVIVPIVDLRVKFCEGDV
EYDDNTVVIVLNLGQRVVGIVVDGVSDVLSLTAEQIRPAPEFAVTLSTEYLTGLGALGERMLILVNIEKLLNSEEMALLD
IAASHVA
>Q56311 ~~~cheW~~~Chemotaxis protein CheW~~~COG0835
MKTLADALKEFEVLSFEIDEQALAFDVDNIEMVIEKSDITPVPKSRHFVEGVINLRGRIIPVVNLAKILGISFDEQKMKS
IIVARTKDVEVGFLVDRVLGVLRITENQLDLTNVSDKFGKKSKGLVKTDGRLIIYLDIDKIIEEITVKEGV
>Q9X1V3 3.-.-.-~~~cheX~~~CheY-P phosphatase CheX~~~COG1406
MDARIVNALIGSVYETIRDVLGIEPKTGKPSTVSHIEIPHSLVTVIGITGGIEGSLIYSFSSETALKVVSAMMGGMEYNQ
LDELALSAIGELGNMTAGKLAMKLEHLGKHVDITPPTVVSGRDLKIKSFGVILKLPISVFSEEDFDLHLSVKSGG
>P71403 ~~~cheY1~~~Chemotaxis protein CheY1~~~COG0745
MKLLVVDDSSTMRRIIKNTLSRLGYEDVLEAEHGVEAWEKLDANADTKVLITDWNMPEMNGLDLVKKVRSDSRFKEIPII
MITTEGGKAEVITALKAGVNNYIVKPFTPQVLKEKLEVVLGTND
>A0A0H3AMJ9 ~~~cheY-3~~~Chemotaxis protein CheY-3~~~COG0745
MEAILNKNMKILIVDDFSTMRRIVKNLLRDLGFNNTQEADDGLTALPMLKKGDFDFVVTDWNMPGMQGIDLLKNIRADEE
LKHLPVLMITAEAKREQIIEAAQAGVNGYIVKPFTAATLKEKLDKIFERL
>P24072 ~~~cheY~~~Chemotaxis protein CheY~~~COG2201
MAHRILIVDDAAFMRMMIKDILVKNGFEVVAEAENGAQAVEKYKEHSPDLVTMDITMPEMDGITALKEIKQIDAQARIIM
CSAMGQQSMVIDAIQAGAKDFIVKPFQADRVLEAINKTLN
>P0AE67 ~~~cheY~~~Chemotaxis protein CheY~~~COG0745
MADKELKFLVVDDFSTMRRIVRNLLKELGFNNVEEAEDGVDALNKLQAGGYGFVISDWNMPNMDGLELLKTIRADGAMSA
LPVLMVTAEAKKENIIAAAQAGASGYVVKPFTAATLEEKLNKIFEKLGM
>P0A2D5 ~~~cheY~~~Chemotaxis protein CheY~~~
MADKELKFLVVDDFSTMRRIVRNLLKELGFNNVEEAEDGVDALNKLQAGGFGFIISDWNMPNMDGLELLKTIRADSAMSA
LPVLMVTAEAKKENIIAAAQAGASGYVVKPFTAATLEEKLNKIFEKLGM
>Q56312 ~~~cheY~~~Chemotaxis protein CheY~~~COG2201
MGKRVLIVDDAAFMRMMLKDIITKAGYEVAGEATNGREAVEKYKELKPDIVTMDITMPEMNGIDAIKEIMKIDPNAKIIV
CSAMGQQAMVIEAIKAGAKDFIVKPFQPSRVVEALNKVSK
>Q9KQD5 ~~~~~~Chemotaxis protein CheY-3~~~COG0745
MEAILNKNMKILIVDDFSTMRRIVKNLLRDLGFNNTQEADDGLTALPMLKKGDFDFVVTDWNMPGMQGIDLLKNIRADEE
LKHLPVLMITAEAKREQIIEAAQAGVNGYIVKPFTAATLKEKLDKIFERL
>C6EBU6 3.1.3.-~~~cheZ~~~Protein phosphatase CheZ~~~COG3143
MMQPSIKPADEHSAGDIIARIGSLTRMLRDSLRELGLDQAIAEAAEAIPDARDRLYYVVQMTAQAAERALNSVEASQPHQ
DQMEKSAKALTQRWDDWFADPIDLADARELVTDTRQFLADVPAHTSFTNAQLLEIMMAQDFQDLTGQVIKRMMDVIQEIE
RQLLMVLLENIPEQESRPKRENQSLLNGPQVDTSKAGVVASQDQVDDLLDSLGF
>P0A9H9 3.1.3.-~~~cheZ~~~Protein phosphatase CheZ~~~COG3143
MMQPSIKPADEHSAGDIIARIGSLTRMLRDSLRELGLDQAIAEAAEAIPDARDRLYYVVQMTAQAAERALNSVEASQPHQ
DQMEKSAKALTQRWDDWFADPIDLADARELVTDTRQFLADVPAHTSFTNAQLLEIMMAQDFQDLTGQVIKRMMDVIQEIE
RQLLMVLLENIPEQESRPKRENQSLLNGPQVDTSKAGVVASQDQVDDLLDSLGF
>O24976 3.1.3.-~~~cheZ~~~Protein phosphatase CheZ~~~
MTQEELDALMNGGDLENLEALETKEETKEEAKEEAKEEAKEEAKEKEEIKEESSSQKMTVKKEDAEKYGKISPNEWPPPP
PTEEHKVVHQLDDVTRDSEVKATQIFDQLDLIGASAEKIAKMVKKIQEPLQKHQEIFDNLHGHFPHVESFKTALNEQQEI
LNALKSIEEEAANCSDSSMQAMDIMQFQDIHRQKIERVVNVMRALSQYMNSLFEGKIDDSKRVSSATFITGDDDKDLASA
DDIEALIASFGAK
>P07800 3.1.3.-~~~cheZ~~~Protein phosphatase CheZ~~~
MMQPSIKPADEGSAGDIIARIGSLTRMLRDSLRELGLDQAIAEAAEAIPDARDRLDYVVQMTAQAAERALNSVEASQPHQ
DAMEKEAKALTQRWDEWFDNPIELSDARELVTDTRQFLRDVPGHTSFTNAQLLDIMMAQDFQDLTGQVIKRMMDVIQEIE
RQLLMVLLENIPEQSARPKRENESLLNGPQVDTSKAGVVASQDQVDDLLDSLGF
>Q838S2 3.2.1.14~~~~~~Chitinase~~~COG3469
MKLKKIIPAFPLLSTVAVGLWLTPTQASADAADTMVDISGKKVLVGYWHNWASKGRDGYKQGTSASLNLSEVNQAYNVVP
VSFMKSDGTTRIPTFKPYNQTDTAFRQEVAQLNSQGRAVLLALGGADAHIQLVKGDEQAFANEIIRQVETYGFDGLDIDL
EQLAITAGDNQTVIPATLKIVKDHYRAQGKNFIITMAPEFPYLKPGAAYETYITSLNGYYDYIAPQLYNQGGDGVWVDEI
MTWVAQSNDALKYEFLYYMSDSLIHGTRGYLQIPNDKLVLGLPANRDAAGSGYVVEATPVAKTFDQLAKDGNPIRGLMTW
SANWDVGQDVNGKSYNNEFATRYSNLVK
>B6A876 3.2.1.14~~~chi1~~~Chitinase 1~~~
MEKEEKSNLIYDKDPGYVWDNKNECEGAAEETYQELNYEPSISADKLTWTPTRLAKTVFNTYEDDDDFNVLCYFTDWSQY
DPRIINKEIRDTGGRSADILRLNTPDGRPFKRLIYSFGGLIGDKKYSADGNASIAVRLGVATDPDDAIANHKGKTIPVDP
DGAVLASINCGFTKWEAGDANERYNQEKAKGLLGGFRLLHEADKELEFSLSIGGWSMSGLFSEIAKDEILRTNFVEGIKD
FFQRFPMFSHLDIDWEYPGSIGAGNPNSPDDGANFAILIQQITDAKISNLKGISIASSADPAKIDAANIPALMDAGVTGI
NLMTYDFFTLGDGKLSHHTNIYRDPSDVYSKYSIDDAVTHLIDEKKVDPKAIFIGYAGYTRNAKNATITTSIPSEEALKG
TYTDANQTLGSFEYSVLEWTDIICHYMDFEKGEGRNGYKLVHDKVAKADYLYSEATKVFISLDTPRSVRDKGRYVKDKGL
GGLFIWSGDQDNGILTNAAHEGLKRRIKNKVIDMTPFYLDSDEELPTYTEPAEPQCEACNIK
>B6A879 3.2.1.14~~~chi2~~~Chitinase 2~~~
MVNKYTYTSSKAMSDISDVIGEPLAAWDSQVGGRVFNVIFDGKVYTNTYWVERWQVPGIGSSDGNPHNAWKFVRAATADE
INKIGNPTTADVKPTENIPSPILVEDKYTEETYSRPDVNFKEDGSQGNLSYTATRVCAPMYNHYVGDKTKPKLSAYITDW
CQYDARLDGGGSKEEERGRGFDLATLMQNPATYDRLIFSFLGICGDIGNKSKKVQEVWDGWNAQAPSLGLPQIGKGHIVP
LDPYGDLGTARNVGLPPESADTSIESGTFLPYYQQNRAAGLLGGLRELQKKAHAMGHKLDLAFSIGGWSLSSYFSALAEN
PDERRVFVASVVDFFVRFPMFSCVDIDWEYPGGGGDEGNISSDKDGENYVLLIKELRSALDSRFGYSNRKEISIACSGVK
AKLKKSNIDQLVANGLDNIYLMSYDFFGTIWADYIGHHTNLYSPKDPGEQELFDLSAEAAIDYLHNELGIPMEKIHLGYA
NYGRSAVGGDLTTRQYTKNGPALGTMENGAPEFFDIVKNYMDAEHSLSMGKNGFVLMTDTNADADFLFSEAKGHFISLDT
PRTVKQKGEYAAKNKLGGVFSWSGDQDCGLLANAAREGLGYVADSNQETIDMGPLYNPGKEIYLKSISEIKSK
>A0A8G1A3Q5 3.2.1.14~~~~~~Chitinase Chi52~~~
MNQAVRFRPVITFALAFLLLITWFAPRADAAAQWQAGTAYKKGDLVTYQNKDYECIQAHTALTGWEPSVVPALWKYVGEG
SGGETPTPDTAPPSVPAGLTSSSITDTSVSLSWNASTDNVGVAGYEVYRNGVLVTSTSTTTAVVTGLTASTTYAFTVKAK
DAAGNISAASTSLSVTTSNGSSNPGPTGTKWLIGYWHNFDNGSTNIRLRNVSTAYDVINVSFAEPISHGSGTLAFTPYNA
TVAEFKSDIAYLQSQGKKVLLSMGGANGTIELTDATKRQQFEDSLKSIISTYGFNGLDIDLEGSSLSLNAGDTDFRSPTT
PKIVNLIQGVKAVKSHFGANFILTAAPETAYVQGGYLSYGGPWGAYLPVIHALRNELTLLHVQHYNTGSMVGLDGRSYAQ
GTADFHVAMAEMLLQGFHVGGSTGPFFSPLRPDQIAIGVPASQQAAGGGYTTPADLQKALNYLIKGVSYGGSYTLRQSTG
YAGIKGIMTWSINWDAYTNNQFSSAHRPFLNGLSTQTTKEVVY
>P20533 3.2.1.14~~~chiA1~~~Chitinase A1~~~
MINLNKHTAFKKTAKFFLGLSLLLSVIVPSFALQPATAEAADSYKIVGYYPSWAAYGRNYNVADIDPTKVTHINYAFADI
CWNGIHGNPDPSGPNPVTWTCQNEKSQTINVPNGTIVLGDPWIDTGKTFAGDTWDQPIAGNINQLNKLKQTNPNLKTIIS
VGGWTWSNRFSDVAATAATREVFANSAVDFLRKYNFDGVDLDWEYPVSGGLDGNSKRPEDKQNYTLLLSKIREKLDAAGA
VDGKKYLLTIASGASATYAANTELAKIAAIVDWINIMTYDFNGAWQKISAHNAPLNYDPAASAAGVPDANTFNVAAGAQG
HLDAGVPAAKLVLGVPFYGRGWDGCAQAGNGQYQTCTGGSSVGTWEAGSFDFYDLEANYINKNGYTRYWNDTAKVPYLYN
ASNKRFISYDDAESVGYKTAYIKSKGLGGAMFWELSGDRNKTLQNKLKADLPTGGTVPPVDTTAPSVPGNARSTGVTANS
VTLAWNASTDNVGVTGYNVYNGANLATSVTGTTATISGLTAGTSYTFTIKAKDAAGNLSAASNAVTVSTTAQPGGDTQAP
TAPTNLASTAQTTSSITLSWTASTDNVGVTGYDVYNGTALATTVTGTTATISGLAADTSYTFTVKAKDAAGNVSAASNAV
SVKTAAETTNPGVSAWQVNTAYTAGQLVTYNGKTYKCLQPHTSLAGWEPSNVPALWQLQ
>P13656 ~~~chiA~~~Probable bifunctional chitinase/lysozyme~~~COG3979
MKLNIFTKSMIGMGLVCSALPALAMEAWNNQQGGNKYQVIFDGKIYENAWWVSSTNCPGKAKANDATNPWRLKRTATAAE
ISQFGNTLSCEKSGSSSSSNSNTPASNTPANGGSATPAQGTVPSNSSVVAWNKQQGGQTWYVVFNGAVYKNAWWVASSNC
PGDAKSNDASNPWRYVRAATATEISETSNPQSCTSAPQPSPDVKPAPDVKPAPDVQPAPADKSNDNYAVVAWKGQEGSST
WYVIYNGGIYKNAWWVGAANCPGDAKENDASNPWRYVRAATATEISQYGNPGSCSVKPDNNGGAVTPVDPTPETPVTPTP
DNSEPSTPADSVNDYSLQAWSGQEGSEIYHVIFNGNVYKNAWWVGSKDCPRGTSAENSNNPWRLERTATAAELSQYGNPT
TCEIDNGGVIVADGFQASKAYSADSIVDYNDAHYKTSVDQDAWGFVPGGDNPWKKYEPAKAWSASTVYVKGDRVVVDGQA
YEALFWTQSDNPALVANQNATGSNSRPWKPLGKAQSYSNEELNNAPQFNPETLYASDTLIRFNGVNYISQSKVQKVSPSD
SNPWRVFVDWTGTKERVGTPKKAWPKHVYAPYVDFTLNTIPDLAALAKNHNVNHFTLAFVVSKDANTCLPTWGTAYGMQN
YAQYSKIKALREAGGDVMLSIGGANNAPLAASCKNVDDLMQHYYDIVDNLNLKVLDFDIEGTWVADQASIERRNLAVKKV
QDKWKSEGKDIAIWYTLPILPTGLTPEGMNVLSDAKAKGVELAGVNVMTMDYGNAICQSANTEGQNIHGKCATSAIANLH
SQLKGLHPNKSDAEIDAMMGTTPMVGVNDVQGEVFYLSDARLVMQDAQKRNLGMVGIWSIARDLPGGTNLSPEFHGLTKE
QAPKYAFSEIFAPFTKQ
>A5FB63 3.2.1.14~~~chiA~~~Chitinase ChiA~~~COG3325
MKHYYRLLFLLLFPLLASAQPAHGKKVVGYYAQWSIYARDFNVPKIDGSKLTHLNYSFYGTTYDPAHPENTKLKCLDTYA
DFEHMEGGIPWDAPVKGNFYDLMKLKQKYPHLKILISVGGWTKGQDLSPIAASPVARAALAADMANFIVTYPFIDGFDID
WEYPLSGGTDGTEIVNGMPVPPQKYSPDDNKNLVLLLKAMRQAMPNKLVTIAAGNNVRNVSKQYLGPNNRAQYGMTEDIS
TYCDYITYFGYDFGGNWYDKTCYNAPLYASGNPNDPLYGATQSESLDELTNQYLNVIGFPANKLIMGLPFYGKKFDNVAA
NSTNGLFVAAPRYIVPGCTNPQNPTGTWDGSGACEKSGSIEICDLVGNPVTNSHAYLDPNTMMVTPSAASAGWVRYFDNT
TKVPYLYNSTLKQFISYEDKQSMDLKVQYIKSRNLAGGMIWELSQDTRGSIPNSLLNQVDTSFGSVVPGTVSISGSVKNG
SALVTDVTVELRNASNAVIQTVVSANGNFAFNNLTSGQNYSLTALKATYTFTPVTLVNVTVNQTAVVINGTQPTYTVSGT
VLDGSTPVSGVTVTAVSGSTTLTAVSNASGVYSIAGLTAGLNFTVTAAKSGFSYAPASTVYNAIDSNKTLNFTQGAPVVN
YTVSGTVLNSTTPVSGVTVTASFTGGSYAAVTNASGTYSLSLPSGGNYTVTAALTGQTFTPASTVYSNLNANKTLNFTQD
VVVSTSKISGTVKNGTNPVAGAKVELVLPWTDNTHNWKSVIATTDAQGKYSFDNSVVDGYTQVLSLKLNSWQNGEVAYYP
NNLANFAVPANPTVYNFNTSSTAKSALAAAANLISGTVKNGTTPVAGAKVEIVLPWTDNTHNWKSVLATTDASGNYSFDN
SVVAGYTQILSLKLNGWENGDVTYYPNNLANFAVPTTPTIYNFNRQAVVATKPVVTITAPTASAIAINLGSAINFVASVG
LSAVDATTISSVVFSLDGQSLSTANSSGTYTAAWTPAANQFSLSHTLTVTATASNGTTDSKTYSFTLTCSGANCPNALPV
ITWNSPSNTTVYQNTFQVVPISVTAVDSDGTVSGVTITINGGTFNMTAGTNNTYTYNFTPSAYQDYPVVIKATDNKSGVT
TLNNTIKIATVSTNRFIPLPSKIILGYAHSWENAGAPFLYFSQMVGSKFNVVDYSFVETVNRDGYTPILTTNDTRYLTNG
VFNKQLLKNDIKSLRDSGVPVIVSIGGQNGHVVLDNVTQKNIFVNGLKAIIDEYQFDGVDIDFEGGSMNFNAGGLRDISY
AGISAYPRLKNVVDAFKELKAYYGPGFLLTAAPETQYVQGGYTTYTDTFGSFLPIIQNLRNELDLLAVQLYNTGGENGLD
GQYYGTAKKSNMVTALTDMVIKGYNIASTGMRFDGLPASKVLIALPACPSAAGSGYLTPTEGINAMHYLRTGTTFSGRTY
TMQPGGPYPSLRGLMTWSVNWDASSCGNSSELSKAYAAYFASQTAAKTLVLDDISAKSNATIAYFKNNALSVTNENEDIA
QVDVFNVLGQNLVSHRNVQNNKEVLLHNQSFSSKQLFLVVVTDKAGNKKSFKVMNFLN
>P32823 3.2.1.14~~~chiA~~~Chitinase A~~~
MKLNKITSYIGFALLSGGALAAPSTPTLDWQPQQYSFVEVNVDGLGSYKQLVKAKDVVDISIKWNAWSGSGGDNYKVYFD
DLLVNQGSLPAGTKSGVVQFPYTKSGRHQLYLELCEGTVCARSAGKEIVIADTDGAHLAPLPMNVDPNNRNNGTIPGRVT
GAYFVEWGIYGRNYDVTKIPAHNLSHILYGFIPICGPNESLKSIEIGNSWRALQTACADSQDYEVVIHDPWAAVQKSMPG
VDAKDPIRGVYSQLMALKQRYPDLKILPSVGGWTLSDPFHGFTNKANRDTFVASVKQFLKTWKFYDGVDIDWEFPGGDGP
NPDLGDPINDGPAYVALMQELRAMLDELEAETGRQYELTSAIGAGYDKIEDVDYQAAQQYMDYIFAMTYDFYGAWNNETG
HQTGIYCGSHLSTDECNGTGVDDNGVPRKGPAYTGDHAIQLLLQQGVQPSKLVMGVAMYGRGWEGVLDANAAIPGNPMTA
PGNGPLTGSTSEGVWEPGIMDYKAIAANAVGQGGSGVNGYEVGYDEQAQAAYVWNRSNGKLITYDSPRSVIAKGQYANTH
QLAGLFGWEIDADNGDILNAMYDGLTAGEIPNRAPTIGVSGPINVTSGQVVNVDAQASDLDNDPLTYSWVAAPGLALSAN
NTAAVAVTAPSVAQQTSYDLTVTVNDGALSTTKTIVVVVNPEGANAAPVVTPVSDISVNEGASATVNVSATDPEGAALSY
SWSVPAELSVANGSSATITAANVTADTTVPVTVTVSDGVNAVDTTFNVTIKDGAEYPTWDRSTVYVGGDRVIHNSNVFEA
KWWTQGEEPGTADVWKAVTN
>P07254 3.2.1.14~~~chiA~~~Chitinase A~~~
MRKFNKPLLALLIGSTLCSAAQAAAPGKPTIAWGNTKFAIVEVDQAATAYNNLVKVKNAADVSVSWNLWNGDAGTTAKIL
LNGKEAWSGPSTGSSGTANFKVNKGGRYQMQVALCNADGCTASDATEIVVADTDGSHLAPLKEPLLEKNKPYKQNSGKVV
GSYFVEWGVYGRNFTVDKIPAQNLTHLLYGFIPICGGNGINDSLKEIEGSFQALQRSCQGREDFKVSIHDPFAALQKAQK
GVTAWDDPYKGNFGQLMALKQAHPDLKILPSIGGWTLSDPFFFMGDKVKRDRFVGSVKEFLQTWKFFDGVDIDWEFPGGK
GANPNLGSPQDGETYVLLMKELRAMLDQLSAETGRKYELTSAISAGKDKIDKVAYNVAQNSMDHIFLMSYDFYGPFDLKN
LGHQTALNAPAWKPDTAYTTVNGVNALLAQGVKPGKVVVGTAMYGRGWTGVNGYQNNIPFTGTATGPVKGTWKNGIVDYR
QIAGQFMSGEWQYTYDATAEAPYVFKPSTGDLITFDDARSVQAKGKYVLDKQLGGLFSWEIDADNGDILNSMNASLGNSA
GVQ
>P11797 3.2.1.14~~~chiB~~~Chitinase B~~~
MSTRKAVIGYYFIPTNQINNYTETDTSVVPFPVSNITPAKAKQLTHINFSFLDINSNLECAWDPATNDAKARDVVNRLTA
LKAHNPSLRIMFSIGGWYYSNDLGVSHANYVNAVKTPAARTKFAQSCVRIMKDYGFDGVDIDWEYPQAAEVDGFIAALQE
IRTLLNQQTIADGRQALPYQLTIAGAGGAFFLSRYYSKLAQIVAPLDYINLMTYDLAGPWEKITNHQAALFGDAAGPTFY
NALREANLGWSWEELTRAFPSPFSLTVDAAVQQHLMMEGVPSAKIVMGVPFYGRAFKGVSGGNGGQYSSHSTPGEDPYPN
ADYWLVGCDECVRDKDPRIASYRQLEQMLQGNYGYQRLWNDKTKTPYLYHAQNGLFVTYDDAESFKYKAKYIKQQQLGGV
MFWHLGQDNRNGDLLAALDRYFNAADYDDSQLDMGTGLRYTGVGPGNLPIMTAPAYVPGTTYAQGALVSYQGYVWQTKWG
YITSAPGSDSAWLKVGRLA
>P27050 3.2.1.14~~~chiD~~~Chitinase D~~~
MNQAVRFRPVITFALAFILIITWFAPRADAAAQWQAGTAYKQGDLVTYLNKDYECIQPHTALTGWEPSNVPALWKYVGEG
TGGGTPTPDTTPPTVPAGLTSSLVTDTSVNLTWNASTDNVGVTGYEVYRNGTLVANTSTTTAVVTGLTAGTTYVFTVKAK
DAAGNLSAASTSLSVTTSTGSSNPGPSGSKWLIGYWHNFDNGSTNIKLRNVSTAYDVINVSFAEPISPGSGTLAFTPYNA
TVEEFKSDIAYLQSQGKKVLISMGGANGRIELTDATKKRQQFEDSLKSIISTYGFNGLDIDLEGSSLSLNAGDTDFRSPT
TPKIVNLINGVKALKSHFGANFVLTAAPETAYVQGGYLNYGGPWGAYLPVIHALRNDLTLLHVQHYNTGSMVGLDGRSYA
QGTADFHVAMAQMLLQGFNVGGSSGPFFSPLRPDQIAIGVPASQQAAGGGYTAPAELQKALNYLIKGVSYGGSYTLRQPA
GYVGLKGIMTWSINWDAYTNNQFSNAHRPFLNGLSTQKTEEVVY
>P96156 3.2.1.202~~~endo I~~~Chitodextrinase~~~
MRLHRAKVSKSVFTLSTLTASCLMAFNSYAAVDCSALAEWQSDTIYTGGDQVQYNGSAYQANYWTQNNDPEQFSGDYAQW
KLLDACTTDGGDDNQAPNATLTSPSASDVLTTGDVVTLAASASDNDGTIARVDFLVDGVVVAQASSAPYSATWTAVAGTH
QISAIAYDDKALASTASQVSVSVTDSTQPGNEAPTVDITLSASQVDVGDVVTLTANAADADGSVDKVDFYVAGSLVGTVA
STPYTLDYTTTRSGRWLCLRARLITSARQRIRPRRRLTVAAGPWSVPVVLMVCIKPKGQCAVLYGVREDGREKMGADHPR
RVIGYFTSWRAGDDDQTAYLVKDIPWEQLTHINYAFVSIGSDGKVNVGDVNDANNAAVGKEWDGVEIDPTLGFKGHFGAL
ATYKQKYGVKTLISIGGWAETGGHFDNDGNRVADGGFYTMTTNADGSINQQGIETFADSAVEMMRKYRFDGLDIDLRISN
IDGGTGNPDDTAFSESRRAYLMNSYHELMRVLREKLDVASAQDGVHYMLTIAAPSSAYLLRGMETMAVTQYLDYVNIMSY
DLHGAWNDHVGHNAALYDTGKDSELAQWNVYGTAQYGGIGYLNTDWAFHYFRGSMPAGRINIGVPYYTRGWQGVTGGDNG
LWGARLAKSKRVSNRYGEGEKNNCGYGATGLDNMWHDVNAAGDEMGAGSNPMWHAKNLEHGIWGSYLAVYGLDPTTAPLV
GTYARNYDSVAIAPWLWNAEKKVFLSTEDKQSIDVKADYVIDKEIGGIMFWELAGDYNCYVLDANGQRTSIDSTEQACES
GQGEYHMGNTMTKAIYDKFKAATPYGNTVATGAVPSETVDIAVSIGGFKVGDQNYPINPKVTFTNNTGVDIPGGTAFQFD
IPVSAPDNAKDQSGGGLSVIASGHTRADNIGGLDGTMHRVAFSLPAWKTLPAGDTYELDMVYYLPISGPANYSVNINGVD
YAFKFEQPDLPLADLSSGNGGGTGGGDTGGGTTEPGDVVEWVPGSTQVSDGTTVTYNGKCFVAQNSPGVWESPTQTNWFW
EEVTCP
>Q2FWV5 ~~~chp~~~Chemotaxis inhibitory protein~~~
MKKKLATTVLALSFLTAGISTHHHSAKAFTFEPFPTNEEIESNKKLLEKEKAYKESFKNSGLPTTLGKLDERLRNYLKKG
TKNSAQFEKMVILTENKGYYTVYLNTPLAEDRKNVELLGKMYKTYFFKKGESKSSYVINGPGKTNEYAY
>A6QIG7 ~~~chp~~~Chemotaxis inhibitory protein~~~
MKKKLATTVLALSFLTAGISTHHHSAKAFTFEPFPTNEEIESNKKMLEKEKAYKESFKNSGLPTTLGKLDERLRNYLKKG
TKNSAQFEKMVILTENKGYYTVYLNTPLAEDRKNVELLGKMYKTYFFKKGESKSSYVINGPGKTNEYAY
>P75733 ~~~chiP~~~Chitoporin~~~
MRTFSGKRSTLALAIAGVTAMSGFMAMPEARAEGFIDDSTLTGGIYYWQRERDRKDVTDGDKYKTNLSHSTWNANLDFQS
GYAADMFGLDIAAFTAIEMAENGDSSHPNEIAFSKSNKAYDEDWSGDKSGISLYKAAAKFKYGPVWARAGYIQPTGQTLL
APHWSFMPGTYQGAEAGANFDYGDAGALSFSYMWTNEYKAPWHLEMDEFYQNDKTTKVDYLHSFGAKYDFKNNFVLEAAF
GQAEGYIDQYFAKASYKFDIAGSPLTTSYQFYGTRDKVDDRSVNDLYDGTAWLQALTFGYRAADVVDLRLEGTWVKADGQ
QGYFLQRMTPTYASSNGRLDIWWDNRSDFNANGEKAVFFGAMYDLKNWNLPGFAIGASYVYAWDAKPATWQSNPDAYYDK
NRTIEESAYSLDAVYTIQDGRAKGTMFKLHFTEYDNHSDIPSWGGGYGNIFQDERDVKFMVIAPFTIF
>Q7CQY4 ~~~chiP~~~Chitoporin~~~
MRTFSGKRSTLALAIAGITAMSGWIVVPQAQASGFFDDSTLTGGIYYWQRERDRKDVTDGDKYKTNLSHATWNANLDFQS
GYAADMFGLDIAAFTAIEMAENGDSGHPNEIAFSKKNKGYDEDYSGDKSGISLYKAAAKFKYGPVWARAGYIQPTGQTLL
APHWSFMPGTYQGAEAGASFDYGDAGALSFSYMWTNEYKAPWHTEMDKFYQADKKTNVDYLHSIGAKYDFKNDLVLEAAF
GQSEGYVDQYFAKASYKFDLGGNPFTTSYQFYGARDKVDDRSVNDIYDGTAWLQALTFGYKVAEVVDLRLEGTWVKADGQ
QGYFLQRMTPTYASSNGRLDIWWDNRSDFNANGEKAVFFGAMYDLKNWNLPGWAVGASYVYAWDAKPATWQSNPDAYYDK
NRTIEESSYSLDAVYTLQEGRAKGTMFKLHFTEYDNHSNIPSWGGGYGNIFQDERDVKFIVIAPFTIF
>Q9KK91 ~~~chiP~~~Chitoporin~~~
MDKMFKRTVIGAAVALASTGLMAKEVGVNSDFNVDVYGVAAMSVVNYNTTDNRDDSSGYVLENESRIGFRAHKEMFENFT
VTMQIESGYVDSTDWGHGGVSGGVLGFRDTYIGASGDWGNVRVGRVLTPLYEIVDWPFSNPGLGAAFDWGGINAHYDRQS
NQIRYDSPKFGGFSFAASVGRDDNDNGGGAATRDANFFGANARYSFEKITLLGAVESGTRVVAETGGDWELDNTGTAVQN
PVVAGYDDDTFAYLVGFEASLPAGFGLAAAFKGEELDNGIRKHKQDSFSIVGQYWNGPLGIKIGYAANLDSKIDGVKQDD
ANNILSGQVMGVINGFVPYVRVAARSDFTSDKDTDIVTRVGLEYGF
>P75734 ~~~chiQ~~~Uncharacterized lipoprotein ChiQ~~~
MKKLILIAIMASGLVACAQSTAPQEDSRLKEAYSACINTAQGSPEKIEACQSVLNVLKKEKQHQQFADQESVRVLDYQQC
LRATQTGNDQAVKADCDKVWQEIRSNNK
>Q8ZQX4 ~~~chiQ~~~Uncharacterized lipoprotein ChiQ~~~
MKKILLIASMTAGLTACASSPAPEEDSRLKEAYSACINTAQGSPEKIEACQSVLNVLKKERRHQQFANEESVRVLDYQQC
IQATRTGNDQAVKADCDKVWQEIRSHNNVQ
>O07921 3.2.1.132~~~csn~~~Chitosanase~~~COG3409
MKISMQKADFWKKAAISLLVFTMFFTLMMSETVFAAGLNKDQKRRAEQLTSIFENGTTEIQYGYVERLDDGRGYTCGRAG
FTTATGDALEVVEVYTKAVPNNKLKKYLPELRRLAKEESDDTSNLKGFASAWKSLANDKEFRAAQDKVNDHLYYQPAMKR
SDNAGLKTALARAVMYDTVIQHGDGDDPDSFYALIKRTNKKAGGSPKDGIDEKKWLNKFLDVRYDDLMNPANHDTRDEWR
ESVARVDVLRSIAKENNYNLNGPIHVRSNEYGNFVIK
>P33673 3.2.1.132~~~csn~~~Chitosanase~~~
MHMSNARPSKSRTKFLLAFLCFTLMASLFGATALFGPSKAAAASPDDNFSPETLQFLRNNTGLDGEQWNNIMKLINKPEQ
DDLNWIKYYGYCEDIEDERGYTIGLFGATTGGSRDTHPDGPDLFKAYDAAKGASNPSADGALKRLGINGKMKGSILEIKD
SEKVFCGKIKKLQNDAAWRKAMWETFYNVYIRYSVEQARQRGFTSAVTIGSFVDTALNQGATGGSDTLQGLLARSGSSSN
EKTFMKNFHAKRTLVVDTNKYNKPPNGKNRVKQWDTLVDMGKMNLKNVDSEIAQVTDWEMK
>P33665 3.2.1.132~~~csn~~~Chitosanase~~~
MHSQHRTARIALAVVLTAIPASLATAGVGYASTQASTAVKAGAGLDDPHKKEIAMELVSSAENSSLDWKAQYKYIEDIGD
GRGYTGGIIGFCSGTGDMLELVQHYTDLEPGNILAKYLPALKKVNGSASHSGLGTPFTKDWATAAKDTVFQQAQNDERDR
VYFDPAVSQAKADGLRALGQFAYYDAIVMHGPGNDPTSFGGIRKTAMKKARTPAQGGDETTYLNAFLDARKAAMLTEAAH
DDTSRVDTEQRVFLKAGNLDLNPPLKWKTYGDPYVINS
>P14529 3.2.1.14~~~chiA2~~~Chitinase~~~COG3469
MGKLRKNLLAWAGTGVAAACAVTMTAVPALSTPEPEAAGVVSASPYLYNGWGNPPSPTEVMNASGIKNFTLAFILADGTC
NPAWDGNRPLDGQDKATIDAIRGAGGDVIPSIGGYSGSKLGEVCQDSQSLAGAYQKVIDAYGLKAIDVDIEATEFENDAS
QTRVLEALKIVKEANPGLRTVVTFPTLVNGPNDVGKRMIDKAARIGSDVDVWTQMPFNFGGGDMAADTITSTEGLVAHLK
SAFGYDDATAYAHAGISSMNGKSDTGETVDQAAFQKMADYAGEKGLGRLSFWSVNRDRPCDGAPDACGGIDQQPWDFTKI
VAGLQS
>P36909 3.2.1.14~~~chiC~~~Chitinase C~~~
MRFRHKAAALAATLALPLAGLVGLASPAQAATSATATFAKTSDWGTGFGGSWTVKNTGTTSLSSWTVEWDFPTGTKVTSA
WDATVTNSGDHWTAKNVGWNGTLAPGASVSFGFNGSGPGSPSNCKLNGGSCDGTSVPGDAAPSAPGTPTASNITDTSVKL
SWSAATDDKGVKNYDVLRDGAKVATVTGTTYTDNGLTKGTAYSYSVKARDTADQTGPASGAVKVTTTGGGDGGNPGTGAE
VKMGYFTNWGVYGRNYHVKNLVTSGSADKITHINYAFGNVQGGKCTIGDSYADYDKAYTADQSVDGVADTWDQPLRGNFN
QLRKLKAKYPNIKILYSFGGWTWSGGFPDAVKNPAAFAKSCHDLVEDPRWADVFDGIDLDWEYPNACGLSCDETSAPNAF
SSMMKAMRAEFGQDYLITAAVTADGSDGGKIDAADYGEASKYIDWYNVMTYDFFGAWAKNGPTAPHSPLTAYDGIPQQGF
NTADAMAKFKSKGVPADKLLIGIGFYGRGWTGVTQSAPGGTATGPATGTYEAGIEDYKVLKNSCPATGTIAGTAYAHCGS
NWWSYDTPATIKSKMDWAEQQGLGGAFFWEFSGDTANGDWWRHRQRPQVTPAVRTTRRH
>P11220 3.2.1.14~~~chtA~~~Chitinase 63~~~
MRFRHKAAALAATLALPLAGLVGLASPAQAATSATATFQKTSDWGTGFGGKWTVKNTGTTSLSSWTVEWDFPSGTKVTSA
WDATVTNSADHWTAKNVGWNGTLAPGASVSFGFNGSGPGSPSGCKINGGSCDGSSVPGDEAPSAPGTPTASNITDTSVKL
SWSAATDDKGVKNYDVLRDGATVATVTGTTYTDNGLTKGTDYSYSVKARDTGDQTGPASGSVKVTTTGGDGGEPNPNPGA
EVKMGYFTNWGVYGRNYHVKNLVTSGSAEKITHINLRFGNVQGGKCTIGDAYADYDKAYTADQSVDGVADTWDQPLRANF
NQLRNLKAEYPHIKILYSFGGWTWSGGFPDAVKNPAAFAKSCHDLVEDPRWADVFDGIDLDWEYPNACGLSCDETSAPNA
FSSMMKAMRAEFGQDYLITAAVTADGSDGGKIDAADYGEASKYIDWYNVMTYDFFGAWAKNGPTAPHSPLNAYDGIPQQG
FTTADAMAKFKSKGVPADKLLIGIGFYGRGWTGVTQSAPGGTATGPAAGTYEAGIEDYKVLKNSCPATGTVAGTAYAHCG
TNWWSYDTPATIKSKMDWAEQQGLGGAFFWEFSGDTTNGELVSAIDSGLK
>Q05638 3.2.1.14~~~~~~Exochitinase 1~~~
MDRFRPLAVLIAAALTLSGTTALSSAARAADADLARNGGFEAGLDGWTCTAGTTVNSPVRSGSSALKATPAGSDNARCAQ
TVTVQPNSQYTLSGHVQGSYVYLGASGTGTTDVSTWTQSAPDWRQLTTTFRTGPSTTRVTLYTHGWYGTGAYHADDISLV
GPGGGTEQPPAPPTGLRTGSVTATSVALSWSPVTGATGYAVYRDGVKVATASGTSATVTGLTPDTAYAFQVAAVNGAGES
AKSATVTATTAPGTGGGSADLPPHALVGYLHASFANGSGYTRLADVPDSWDVIDLAFGEPTSVTSGDIRFDRCPATECPN
VESDAEFKAAIKAKQAAGKKVLISIGGQNGQVQLTTTAARDTFVSSVSKIIDEYGLDGLDIDFEGHSLSLNADDTDFKNP
KTPVIVNLIQALKTLKAKYGDDFVLTMAPETFFVQLGYQYYGTGKWGGQDPRAGAYLPVIHALRDDLTLLHVQDYNSGPI
MGLDNQYHSMGGADFHIAMTDMLLTGFPVAGDAANVFPPLRADQVAIGMPATTNAGNGHVAPAEAVKTLDCLTRKTNCGS
YATHGTWPALRALMTWSINWDRFGGWEFQRTFDGYFG
>I6YA32 3.4.-.-~~~chiZ~~~Cell wall hydrolase ChiZ~~~COG1388
MTPVRPPHTPDPLNLRGPLDGPRWRRAEPAQSRRPGRSRPGGAPLRYHRTGVGMSRTGHGSRPVPPATTVGLALLAAAIT
LWLGLVAQFGQMITGGSADGSADSTGRVPDRLAVVRVETGESLYDVAVRVAPNAPTRQVADRIRELNGLQTPALAVGQTL
IAPVG
>P95463 1.3.7.7~~~chlB~~~Light-independent protochlorophyllide reductase subunit B~~~
MKLAYWMYAGPAHIGTLRIASSFKNVHAIMHAPLGDDYFNVMRSMLERERDYTPVTTSVVDRHVLARGSQEKVVDNITRK
DAEEHPDLIVLTPTCTSSILQEDLQNFVERAQLEAKGDVMLADVNHYRVNELQAADRTLQQIVQFYIAKARKQGNLVTEK
TEKPSVNIFGMTTLGFHNNHDATELKKLMSDLGIEVNAIVPAGASVHELKSLPRAWFNLVPYRETGLLAAEFLQQEFNMP
YVDITPIGIVETARCIRKIQQVINAQGANVDYEPFIDQQTRFVSQAAWFSRSIDCQNLTGKKAVVFGDNTHAAALTKILA
REMGIHVLLAGTYCKYDEAWFREQVSEYCDDVLVSDDNGQIADAIARLEPAAIFGTQMERHVGKRLDIPCGVIAAPIHIQ
NFPVGYKPFVGYEGSNQIVDLIYNSFTLGMEDHLLEIFGGHDTKEVITKSVSADSDLNWSKDGLAELNRIPGFVRGKVKR
NTEKFARDRNITQITAEVLYAAKEAVGA
>Q7VD38 1.3.7.7~~~chlB~~~Light-independent protochlorophyllide reductase subunit B~~~COG2710
MELTLWTYEGPPHIGAMRIATSMKGLHYVLHAPQGDTYADLLFTMIERRGSRPPVTYTTFQARDLGGDTAELVKGHIFEA
VERFKPEALLVGESCTAELIQDQPGSLAKGMGLNIPIVSLELPAYSKKENWGASETFYQLIRGLLKEISEDSSNNAKQSW
QEEGRRPRVNLLGPSLLGFRCRDDVLEIQKILGENGIDINVIAPLGASPSDLMRLPKADANVCLYPEIAESTCLWLERNF
KTPFTKVVPIGVKATQDFLEELYELLGMEVSNSISNSDQSKLPWYSKSVDSNYLTGKRVFIFGDGTHVLAAARIANEELG
FEVVGIGTYSREMARKVRAAATELGLEALITNDYLEVEESIKECAPELVLGTQMERHSAKRLGIPCAVISTPMHVQDVPA
RYSPQMGWEGANVIFDDWVHPLMMGLEEHLIGMFRHDFEFTDGHQSHLGHLGGHASETKTSSKGINQSPNNHSPAGESIH
WTSEGESELAKIPFFVRGKVRRNTEKYARQAGCREIDGETLLDAKAHFGA
>Q8DGC6 1.3.7.7~~~chlB~~~Light-independent protochlorophyllide reductase subunit B~~~COG2710
MKLAYWMYAGPAHIGTLRIASSFKNVHGIMHAPLGDDYFNVMRSMLERERDFTPVTASIVDRHVLARGSQEKVVDNIIRK
DTEEHPDLIVLTPTCTSSILQEDLQNFVRRASLSTTADVLLADVNHYRVNELQAADRTLEQIVQFYIDKARRQGTLGTSK
TPTPSVNIIGITTLGFHNQHDCRELKQLMADLGIQVNLVIPAAATVHDLQRLPQAWFNLVPYREIGGLTAQYLEREFGQP
SVRITPMGVVETARCIRAIQGVLNAQGAGVNYEAFIEQQTREVSQAAWFSRSIDCQNLTGKKAVVFGDNTHAAAMTKILS
REMGIHVVWAGTYCKYDADWFRAEVAGFCDEVLITDDHTVVGDAIARVEPAAIFGTQMERHVGKRLNIPCGVIAAPIHIQ
DFPVGYRPFLGYEGTNQLVDLIYNSFTLGMEDHLLEIFGGHDTKAVIHKGLSADSDLTWTAAGLAELNKIPGFVRGKVKR
NTEKFAREQGISEITVEVLYAAKEAVGA
>P9WJX7 ~~~~~~Chloramphenicol efflux pump Rv0191~~~COG2814
MTAPTGTSATTTRPWTPRIATQLSVLACAAFIYVTAEILPVGALSAIARNLRVSVVLVGTLLSWYALVAAVTTVPLVRWT
AHWPRRRALVVSLVCLTVSQLVSALAPNFAVLAAGRVLCAVTHGLLWAVIAPIATRLVPPSHAGRATTSIYIGTSLALVV
GSPLTAAMSLMWGWRLAAVCVTGAAAAVALAARLALPEMVLRADQLEHVGRRARHHRNPRLVKVSVLTMIAVTGHFVSYT
YIVVIIRDVVGVRGPNLAWLLAAYGVAGLVSVPLVARPLDRWPKGAVIVGMTGLTAAFTLLTALAFGERHTAATALLGTG
AIVLWGALATAVSPMLQSAAMRSGGDDPDGASGLYVTAFQIGIMAGALLGGLLYERSLAMMLTASAGLMGVALFGMTVSQ
HLFENPTLSPGDG
>P9WMD3 ~~~~~~HTH-type transcriptional repressor Rv1353c~~~COG1309
MQTTPGKRQRRQRGSINPEDIISGAFELAQQVSIDNLSMPLLGKHLGVGVTSIYWYFRKKDDLLNAMTDRALSKYVFATP
YIEAGDWRETLRNHARSMRKTFADNPVLCDLILIRAALSPKTARLGAQEMEKAIANLVTAGLSLEDAFDIYSAVSVHVRG
SVVLDRLSRKSQSAGSGPSAIEHPVAIDPATTPLLAHATGRGHRIGAPDETNFEYGLECILDHAGRLIEQSSKAAGEVAV
RRPTATADAPTPGARAKAVAR
>P0DOC9 1.-.-.-~~~chlF~~~Light-dependent chlorophyll f synthase~~~
MKLESDHVIATSDSSDYTSEPTANKLSKRRKKVNYWEKFCSWVTSTENRLYVGWFGVLMIPCVLTAATVFIIAIIAAPPV
DMDGIGVPISGSILSGNNIITAAVVPTSAAIGLHFYPIWEAASIDEWLYNGGPYQLIVLHFLIGIIAYQDREWELSYRLG
MRPWISLAFTAPVAASVSVLLIYPVGQGSLSAGMPLGISGTFHFMLQFQADHNILMSPLHQLGVIGVLGGAFAAAMHGSL
VTSTLIRSHNHSESESINKGYKLGQQHPTYNFRSAQVYLWHLIWQRVSFPNSRKLHFFLAALPVAGIWSAALGVDIAAFD
FDYLQFHQPELKSQGQIIHTWADTIDWASLGIKVLDERHIYDFPENLTAGEVVPWK
>B4WP19 ~~~chlF~~~Light-dependent chlorophyll f synthase~~~
MIQTGFGRTSALEGFEQPFDPAQAIDLESPLTSTDTSVENTTRNAGALWPSSQPLSPWERFCRWVTSTENRIYIGWFGML
AIPTLATAAIVFVLAIIAAPAVDMDGTGRMVSGSLLDGNNLITAAVVPTSAAIGLHFYPIWEAASLDEWLINGGPYQLIV
LHFIIGIISYQDREWELSYRLKMRPWISLAFTAPVAASVSVLLVYPVGQGGFASGMPLGISGTFTFMMQFQADHNILASP
LHQMGVIGVLGGALLCAVHGSLVTSTVCRAPAQTMALTTTKTGTDRQKPKKAKTYSFEHAQAYQQTLLWRGAKFNSSRAV
HFCLAALPVAGIWSAAIGVDLAAFDFDRLSFELPSHISVRKTVVPTWSDVVNQANLGIHTVGEKTPPKFSESGFPEFKLS
EFVEPIAEDSASTLLSPHS
>P51634 6.6.1.1~~~chlI~~~Magnesium-chelatase subunit ChlI~~~COG1239
MTATLAAPSKTRRVVFPFTAIVGQDEMKLALLLNVIDPKIGGVMIMGDRGTGKSTTIRALADLLPEIEVVANDPFNSSPS
DPEMMSEEVRIRVDSQEPLSIVKKKVTMVDLPLGATEDRVCGTIDIEKALSEGVKAFEPGLLAKANRGILYVDEVNLLDD
HLVDVLLDSAAGGWNTVEREGISIRHPARFVLVGSGNPEEGELRPQLLDRFGMHAEIRTVREPELRVKIVEQRTEFDQNP
HPFCDQYQTEQEALQAKIVNAQNLLPQVTIDYDYRVKVSEVCAELDVDGLRGDIVTNRAAKALAAFEGRTEVTVDDISRV
IVLCLRHRLRKDPLESIDSGSKVEKVFKRVFGVVDEA
>Q7VD39 1.3.7.7~~~chlL~~~Light-independent protochlorophyllide reductase iron-sulfur ATP-binding protein~~~COG1348
MTTTLANRPDGEGSVQVKLDPKVNIEEGALVIAVYGKGGIGKSTTSSNLSAAFSKLGKKVLQIGCDPKHDSTFTLTHKMV
PTVIDILEEVDFHSEELRPQDFMFEGFNGVQCVESGGPPAGTGCGGYVTGQTVKLLKEHHLLEDTDVVIFDVLGDVVCGG
FAAPLQHANYCLIVTANDFDSIFAMNRIVAAINAKAKNYKVRLGGVIANRSAELDQIEKFNEKTGLKTMAHFRNVDAIRR
SRLKKCTIFEMDPEEEGVLEVQNEYLSLAKKMIDNVEPLEAEPLKDREIFDLLGFD
>Q55467 2.1.1.11~~~chlM~~~Magnesium-protoporphyrin O-methyltransferase~~~COG2227
MTNAALDDKTIVRDYFNSTGFDRWRRIYGDGQVNFVQKDIRVGHQQTVDSVVAWLVADGNLPGLLVCDAGCGVGSLSIPL
AQAGALVYGSDISEKMVGEAQQKAQEVLAYGNQPTFMTQDLAQLGGKYDTVICLDVLIHYPTEEASAMISHLASLADRRL
ILSFAPKTLGLTVLKKIGGLFPGPSKTTRAYQHKEADIRKILGDNGFSIARTGMTSTRFYYSRILEAVRS
>Q7VD37 1.3.7.7~~~chlN~~~Light-independent protochlorophyllide reductase subunit N~~~COG2710
MSGSTLLKETGPREVFCGLTSIVWLHRRMPDAFFLVVGSRTCAHLIQSAAGVMIFAEPRFGTAILEERDLAGLADAHEEL
DRVVKSLLKRRPEIRTLFLVGSCPSEVIKIDLSRAAERLSSQFNGQVRILNYSGSGIETTFTQGEDGALKALVPLMPSSQ
EEQLLLAGTLANPVEDRLKTIFNRLGIQKVESFPPRESTKLPAIGPGTKVLLAQPYLTDTARELKDRGAEILQAPFPLGV
EGSQLWIEAAANAFKIKKTLVDATLEPLITRAHKALKPYVEQLSGKKLFLLPESQLEIPLARFLSNECGMKLIEVGVPYL
NREMMGPELDLLPQNTRIVEGQHVEKQLDRVREHHPDLVVCGMGLANPLEAEGISTKWSIEMVFSPIHGIDQASDLAELF
ARPLHRQNLLNKKTLEAV
>Q8DGH2 1.3.7.7~~~chlN~~~Light-independent protochlorophyllide reductase subunit N~~~COG2710
MTVTAPNALNFECETGNYHTFCPISCVAWLYQKIEDSFFLVIGTKTCGYFLQNAMGVMIFAEPRYAMAELEEGDISAQLN
DYEELKRLCLEIKRDRNPSVIVWIGTCTTEIIKMDLEGLAPKLEAEIGIPIVVARANGLDYAFTQGEDTVLAAMAARCPT
STAISDPEERNPIQRLLNFGKKKEEVQAQSSQYHPHPPLVLFGSLPDPVVTQLTLELKKQGIKVSGWLPAKRYTELPVID
EGYYVAGVNPFLSRTATTLIRRRKCQLITAPFPIGPDGTRTWIEQICATFGIQPQGLAEREAETWQKLSDYLELVRGKSV
FFMGDNLLEISLARFLIRCGMRVLEIGIPYMDKRYQAAELALLSQTCAEMGHPLPTIVEKPDNYNQLQRIKALQPDLVIT
GMAHANPLEARGISTKWSVEFTFAQIHGFGNARDILELVTRPLRRNQALAGLGWQKLVAH
>Q5SFA6 1.1.1.364~~~chmD~~~dTDP-4-dehydro-6-deoxy-D-allose reductase~~~
MTADRWAGRTVLVTGALGFIGSHFVRQLEARGAEVLALYRTERPQLQAELAALDRVRLIRTELRDESDVRGAFKYLAPSI
DTVVHCAAMDGNAQFKLERSAEILDSNQRTISHLLNCVRDFGVGEAVVMSSSELYCAPPTAAAHEDDDFRRSMRYTDNGY
VLSKTYGEILARLHREQFGTNVFLVRPGNVYGPGDGYDPSRGRVIPSMLAKADAGEEIEIWGDGSQTRSFIHVTDLVRAS
LRLLETGKYPEMNVAGAEQVSILELARMVMAVLGRPERIRLDPGRPVGAPSRLLDLTRMSEVIDFEPQPLRTGLEETARW
FRHHTR
>Q5SFD1 5.1.3.27~~~chmJ~~~dTDP-4-dehydro-6-deoxyglucose 3-epimerase~~~
MHPLSIEGAWSQEPVIHSDHRGRSHEWFRGESFRQAFGHDFPVAQVNVAVSHRGALRGIHYTEIPPGQAKYSVCVRGAGL
DVVVDVRIGSPTFGRWEIVPMDAERNTAVYLTAGLGRAFLSLTDDATLVYLCSSGYAPAREHSVNPLDPDLGIAWPDDIE
PLLSDRDENAPTLATAERLGLLPTYQAWQEQQQAQR
>P12015 1.14.13.22~~~~~~Cyclohexanone 1,2-monooxygenase~~~
MSQKMDFDAIVIGGGFGGLYAVKKLRDELELKVQAFDKATDVAGTWYWNRYPGALTDTETHLYCYSWDKELLQSLEIKKK
YVQGPDVRKYLQQVAEKHDLKKSYQFNTAVQSAHYNEADALWEVTTEYGDKYTARFLITALGLLSAPNLPNIKGINQFKG
ELHHTSRWPDDVSFEGKRVGVIGTGSTGVQVITAVAPLAKHLTVFQRSAQYSVPIGNDPLSEEDVKKIKDNYDKSLGWCM
NSALAFALNESTVPAMSVSAEERKAVFEKAWQTGGGFRFMFETFGDIATNMEANIEAQNFIKGKIAEIVKDPAIAQKLMP
QDLYAKRPLCDSGYYNTFNRDNVRLEDVKANPIVEITENGVKLENGDFVELDMLICATGFDAVDGNYVRMDIQGKNGLAM
KDYWKEGPSSYMGVTVNNYPNMFMVLGPNGPFTNLPPSIESQVEWISDTIQYTVENNVESIEATKEAEEQWTQTCANIAE
MTLFPKAQSWIFGANIPGKKNTVYFYLGGLKEYRTCASNCKNHAYEGFDIQLQRSDIKQPANA
>P42517 5.4.99.5~~~aroQ~~~Monofunctional chorismate mutase~~~COG1605
MTHFVAIFFSSLFMCSNVFAGSVSSVSLGSLSSALNERMQVMKAVAGYKALHHLPIEDLPREQVVLDHMLQNAQQAGLEP
HSVEPFVHALMNASKTIQYRYRADWLSSPDSAVPVRDLTETRQQIQQLDTQLLTAISQRLMTGAFSQEDKEFLMSHLTAP
HLSESDKNSLFASLSRIQRQH
>A0R3N5 5.4.99.5~~~~~~Intracellular chorismate mutase~~~COG1605
MRPDHRMGPPHDEEPHMPETIDAVPEIDDLRREIDELDATIIAAIQRRTEVSKTIGKARMASGGTRLVHSREMKVIERYI
DALGPEGKDLAMLLLRLGRGRLGY
>P9WIC1 5.4.99.5~~~~~~Intracellular chorismate mutase~~~COG1605
MRPEPPHHENAELAAMNLEMLESQPVPEIDTLREEIDRLDAEILALVKRRAEVSKAIGKARMASGGTRLVHSREMKVIER
YSELGPDGKDLAILLLRLGRGRLGH
>Q9HU05 5.4.99.5~~~aroQ~~~Monofunctional chorismate mutase~~~
MRPSFASWGLLALLLLQGPLLQAQPLSPALQQLLSLSSQRLQLADQVAQSKAQSGKAVQDSPREEQQLQMLAGQAGSHGV
GAEQVRLLFAAQIEANKLVQYRLLSRPLPDAGQAVDLERIRSRLNQLNLELLRGYAPALAELRVDDCRPRLNQALQRQVR
VDRLDELHAIALSRAAGDLCHWAEL
>Q93LJ4 5.4.99.5~~~aroQ~~~Monofunctional chorismate mutase~~~COG1605
MIRHIAIFLCSLLMCSTTFADSVTSVSLGALLTALNERMLLMKDVAAYKMKHHLPIEDFTREQNVFAEAEEEAKNNGLDP
HSITPFIRSLMDASKAIQYRYLAQWRTGSEPSFPIQTLSVTRQRIRQLDNQMLIIISQRLMVGAFSHDDMVWLRAQFNAP
NLNESDISNVLAALSLVRRAR
>Q9F7E0 1.1.1.245~~~chnA~~~Cyclohexanol dehydrogenase~~~
MEKIMSNKFNNKVALITGAGSGIGKSTALLLAQQGVSVVVSDINLEAAQKVVDEIVALGGKAAANKANTAEPEDMKAAVE
FAVSTFGALHLAFNNAGILGEVNSTEELSIEGWRRVIDVNLNAVFYSMHYEVPAILAAGGGAIVNTASIAGLIGIQNISG
YVAAKHGVTGLTKAAALEYADKGIRINSVHPGYIKTPLIAEFEEAEMVKLHPIGRLGQPEEVAQVVAFLLSDDASFVTGS
QYVVDGAYTSK
>Q6TMA3 1.1.1.245~~~chnA~~~Cyclohexanol dehydrogenase~~~
MTDNLPLRGKVALVTGAARGIGRAYALRLAKRGADVAVVDFDLHSYKDYQLEAASMRGDTVVDEIREIGMRALGFQADVT
DATTLNEAVQQIVGEWGRLDIAICNAGGGVGSPEETRASIVEKDLVDVVVARNLTGTIHTCQAVAVPMKEQRSGKIVTVG
SQAGHRIEDNGGYAHYGAAKAAVAKYTQYLARDLGPFGVTVNCVAPGYISTGRLAPILSAMGDAQLLDDVPLGRYGTPED
CAGVIEFLSSDLSDYVTGAIIPVDGGLTYS
>P22637 1.1.3.6~~~choB~~~Cholesterol oxidase~~~
MTDSRANRADATRGVASVSRRRFLAGAGLTAGAIALSSMSTSASAAPSRTLADGDRVPALVIGSGYGGAVAALRLTQAGI
PTQIVEMGRSWDTPGSDGKIFCGMLNPDKRSMWLADKTDQPVSNFMGFGINKSIDRYVGVLDSERFSGIKVYQGRGVGGG
SLVNGGMAVTPKRNYFEEILPSVDSNEMYNKYFPRANTGLGVNNIDQAWFESTEWYKFARTGRKTAQRSGFTTAFVPNVY
DFEYMKKEAAGQVTKSGLGGEVIYGNNAGKKSLDKTYLAQAAATGKLTITTLHRVTKVAPATGSGYSVTMEQIDEQGNVV
ATKVVTADRVFFAAGSVGTSKLLVSMKAQGHLPNLSSQVGEGWGNNGNIMVGRANHMWDATGSKQATIPTMGIDNWADPT
APIFAEIAPLPAGLETYVSLYLAITKNPERARFQFNSGTGKVDLTWAQSQNQKGIDMAKKVFDKINQKEGTIYRTDLFGV
YFKTWGDDFTYHPLGGVLLNKATDNFGRLPEYPGLYVVDGSLVPGNVGVNPFVTITRLAERNMDKIISSDIQ
>P9WMV9 1.1.3.6~~~choD~~~Cholesterol oxidase~~~COG2303
MKPDYDVLIIGSGFGGSVTALRLTEKGYRVGVLEAGRRFSDEEFAKTSWDLRKFLWAPRLGCYGIQRIHPLRNVMILAGA
GVGGGSLNYANTLYVPPEPFFADQQWSHITDWRGELMPHYQQAQRMLGVVQNPTFTDADRIVKEVADEMGFGDTWVPTPV
GVFFGPDGTKTPGKTVPDPYFGGAGPARTGCLECGCCMTGCRHGAKNTLVKNYLGLAESAGAQVIPMTTVKGFERRSDGL
WEVRTVRTGSWLRRDRRTFTATQLVLAAGTWGTQHLLFKMRDRGRLPGLSKRLGVLTRTNSESIVGAATLKVNPDLDLTH
GVAITSSIHPTADTHIEPVRYGKGSNAMGLLQTLMTDGSGPQGTDVPRWRQLLQTASQDPRGTIRMLNPRQWSERTVIAL
VMQHLDNSITTFTKRGKLGIRWYSSKQGHGEPNPTWIPIGNQVTRRIAAKIDGVAGGTWGELFNIPLTAHFLGGAVIGDD
PEHGVIDPYHRVYGYPTLYVVDGAAISANLGVNPSLSIAAQAERAASLWPNKGETDRRPPQGEPYRRLAPIQPAHPVVPA
DAPGALRWLPIDPVSNAG
>P12676 1.1.3.6~~~choA~~~Cholesterol oxidase~~~
MTAQQHLSRRRMLGMAAFGAAALAGGTTIAAPRAAAAAKSAADNGGYVPAVVIGTGYGAAVSALRLGEAGVQTLMLEMGQ
LWNQPGPDGNIFCGMLNPDKRSSWFKNRTEAPLGSFLWLDVVNRNIDPYAGVLDRVNYDQMSVYVGRGVGGGSLVNGGMA
VEPKRSYFEEILPRVDSSEMYDRYFPRANSMLRVNHIDTKWFEDTEWYKFARVSREQAGKAGLGTVFVPNVYDFGYMQRE
AAGEVPKSALATEVIYGNNHGKQSLDKTYLAAALGTGKVTIQTLHQVKTIRQTKDGGYALTVEQKDTDGKLLATKEISCR
YLFLGAGSLGSTELLVRARDTGTLPNLNSEVGAGWGPNGNIMTARANHMWNPTGAHQSSIPALGIDAWDNSDSSVFAEIA
PMPAGLETWVSLYLAITKNPQRGTFVYDAATDRAKLNWTRDQNAPAVNAAKALFDRINKANGTIYRYDLFGTQLKAFADD
FCYHPLGGCVLGKATDDYGRVAGYKNLYVTDGSLIPGSVGVNPFVTITALAERNVERIIKQDVTAS
>Q7X2H8 1.1.3.17~~~codA~~~Choline oxidase~~~
MHIDNIENLSDREFDYIVVGGGSAGAAVAARLSEDPAVSVALVEAGPDDRGVPEVLQLDRWMELLESGYDWDYPIEPQEN
GNSFMRHARAKVMGGCSSHNSCIAFWAPREDLDEWEAKYGATGWNAEAAWPLYKRLETNEDAGPDAPHHGDSGPVHLMNV
PPKDPTGVALLDACEQAGIPRAKFNTGTTVVNGANFFQINRRADGTRSSSSVSYIHPIVEQENFTLLTGLRARQLVFDAD
RRCTGVDIVDSAFGHTHRLTARNEVVLSTGAIDTPKLLMLSGIGPAAHLAEHGIEVLVDSPGVGEHLQDHPEGVVQFEAK
QPMVAESTQWWEIGIFTPTEDGLDRPDLMMHYGSVPFDMNTLRHGYPTTENGFSLTPNVTHARSRGTVRLRSRDFRDKPM
VDPRYFTDPEGHDMRVMVAGIRKAREIAAQPAMAEWTGRELSPGVEAQTDEELQDYIRKTHNTVYHPVGTVRMGAVEDEM
SPLDPELRVKGVTGLRVADASVMPEHVTVNPNITVMMIGERCADLIRSARAGETTTADAELSAALA
>P76213 3.1.25.-~~~cho~~~Excinuclease cho~~~COG0322
MVRRLTSPRLEFEAAAIYEYPEHLRSFLNDLPTRPGVYLFHGESDTMPLYIGKSVNIRSRVLSHLRTPDEAAMLRQSRRI
SWICTAGEIGALLLEARLIKEQQPLFNKRLRRNRQLCALQLNEKRVDVVYAKEVDFSRAPNLFGLFANRRAALQALQTIA
DEQKLCYGLLGLEPLSRGRACFRSALKRCAGACCGKESHEEHALRLRQSLERLRVVCWPWQGAVALKEQHPEMTQYHIIQ
NWLWLGAVNSLEEATTLIRTPAGFDHDGYKILCKPLLSGNYEITELDPANDQRAS
>O07801 2.3.1.284~~~chp1~~~SL1278 acyltransferase Chp1~~~COG5651
MKCPGVSDCVATVRHDNVFAIAAGLRWSAAVPPLHKGDAVTKLLVGAIAGGMLACAAILGDGIASADTALIVPGTAPSPY
GPLRSLYHFNPAMQPQIGANYYNPTATRHVVSYPGSFWPVTGLNSPTVGSSVSAGTNNLDAAIRSTDGPIFVAGLSQGTL
VLDREQARLANDPTAPPPGQLTFIKAGDPNNLLWRAFRPGTHVPIIDYTVPAPAESQYDTINIVGQYDIFSDPPNRPGNL
LADLNAIAAGGYYGHSATAFSDPARVAPRDITTTTNSLGATTTTYFIRTDQLPLVRALVDMAGLPPQAAGTVDAALRPII
DRAYQPGPAPAVNPRDLVQGIRGIPAIAPAIAIPIGSTTGASAATSTAAATAAATNALRGANVGPGANKALSMVRGLLPK
GKKH
>O50440 2.3.1.-~~~chp2~~~Diacyltrehalose acyltransferase Chp2~~~COG5651
MKRVIAGAFAVWLVGWAGGFGTAIAASEPAYPWAPGPPPSPSPVGDASTAKVVYALGGARMPGIPWYEYTNQAGSQYFPN
AKHDLIDYPAGAAFSWWPTMLLPPGSHQDNMTVGVAVKDGTNSLDNAIHHGTDPAAAVGLSQGSLVLDQEQARLANDPTA
PAPDKLQFTTFGDPTGRHAFGASFLARIFPPGSHIPIPFIEYTMPQQVDSQYDTNHVVTAYDGFSDFPDRPDNLLAVANA
AIGAAIAHTPIGFTGPGDVPPQNIRTTVNSRGATTTTYLVPVNHLPLTLPLRYLGMSDAEVDQIDSVLQPQIDAAYARND
NWFTRPVSVDPVRGLDPLTAPGSIVEGARGLLGSPAFGG
>Q8CJY7 ~~~chpA~~~Chaplin-A~~~
MVAAAAATGILSLCGSPALADSHADGAATNSPGAVSGNALQVPVDVPVNACGNTVDVIAALNPAFGNECENASDEKTDGH
GGGYGEDASSSSSSSTSASSSGSHADGATEGSPGVGSGNNAQVPVDVPVNLCGNTVDVIAALNPVFGNKCENDAEEPPGY
GEEEPPPPTTPPGYGEEEPPPPTHEEPPPPSGEEEPPPPSEEEHTPPAPQTEQPPALAETGSEGTLGAAAAGAVLIAGGA
ILYRRGRALSGR
>P33647 3.1.-.-~~~chpB~~~Endoribonuclease toxin ChpB~~~COG2337
MVKKSEFERGDIVLVGFDPASGHEQQGAGRPALVLSVQAFNQLGMTLVAPITQGGNFARYAGFSVPLHCEEGDVHGVVLV
NQVRMMDLHARLAKRIGLAADEVVEEALLRLQAVVE
>Q9X7U2 ~~~chpB~~~Chaplin-B~~~
MRRVTRNGVLAVAASGALAVTMPAYAAFASDGAGAEGSAAGSPGLISGNTVQLPVDVPVDVCGNTVNVVGLLNPAAGNGC
ADSGEPGASYQAAGASGGTSGSATEATSGGAAAEGSGKDSPGVLSGNGVQLPVHLPVNVSGNSVNVVGIGNPAVGNESTN
DSGDHPEPVRPPAEPEPSAPEEERAGPGPSAHAAPPREEVSLAHTGTDRTLPTLAGGAALVLGGTVLYRRFRPGSGD
>Q9AD93 ~~~chpC~~~Chaplin-C~~~
MRQATRKGLMTMAAATGVIAAAGGAAHADSGAHGTSSGSPGVLSGNTVQAPVHVPVNVCGNTVDVVGVLNPAMGNACANQ
GGGASGGHGGHGGHGGYGDSGGEGGSHGGSHAGGHATDSPGVGSGNHVEVPIDVPVNVCGNSIDVVGALNPTTGNDCGNG
GGGDHSTPPGDHETPPGEPHNPGNPGNPDTPDKPSGPDDETPGDSTDGNRPGAQTVDQPRGDAALAETGSDLPLGLALPV
GAGALLAGTVLYRKARASV
>Q9L1J9 ~~~chpD~~~Chaplin-D~~~
MKKSAAVVAGAIMALGMAAPAFADAGAEGAAVGSPGVLSGNVIQVPVHVPVNVCGNSINVVGLLNPAFGNKCEND
>O25089 ~~~chePep~~~Chemotaxis regulatory protein ChePep~~~
MKMILFNQNPMITKLLESVSKKLELPIENFNHYQELSARLKENQEWLLIADDECLEKLDQVDWLELKETISQNKNSVCMY
KKGNEAQPFLEGFEVKIKKPFLPTEMLKVLQKKLGSNASELEPSQNLDPTQEVLETNWDELENLGDLEALVQEEPNNEEQ
LLPTLNDQEEKEEVKEEEKEEVKEEEKEEVKEEEKEEVKETPQEEKKPKDDETQEGETLKDKEVSKELEAPQELEIPKEE
TQEQDPIKEETQENKEEKQEKTQDSPSAQELEAMQELVKEIQENSNGQENKEKTQESAEIPQDKEIQEVVTEKTQAQELE
VPKEKTQESAEALQETQAHELEKQEIAETPQDVEIPQSQDKEVQELEIPKEETQENTETPQDVETPQEKETQEDHYESIE
DIPEPVMAKAMGEELPFLNEAVAKIPNNENDTETPKESVTETSKNENNTETPQEKEESDKTSSPLELRLNLQDLLKSLNQ
ESLKSLLENKTLSIKITLEDKKPNA
>Q9X9Z2 ~~~chpE~~~Chaplin-E~~~
MKNLKKAAAVTMVAGGLIAAGAGMASATDGGAHAHGKAVGSPGVASGNLVQAPIHIPVNAVGNSVNVIGVLNPAFGNLGV
NH
>Q9KYG7 ~~~chpF~~~Chaplin-F~~~
MYNPKEHFSMSRIAKGLALTSVAAAAVAGTAGVAAADSGAQAAAAHSPGVLSGNVVQVPVHIPVNVCGNTIDVIGLLNPA
FGNECEND
>Q9KYH3 ~~~chpG~~~Chaplin-G~~~
MSRIAKAAGVALGTGAVVLSGTGMAMADAGAAGAAVGSPGVLSGNVVQVPVHVPVNLCGNTIDVIGLLNPAFGNACENGD
DDDKSGGYGG
>Q9AD92 ~~~chpH~~~Chaplin-H~~~
MLKKVVAAAAATGGLVLAGAGMAVADSGAQGAAVHSPGVLSGNVVQVPVHVPVNVCGNTISVIGLLNPAFGNVCINK
>P08365 ~~~chpS~~~Antitoxin ChpS~~~COG2336
MRITIKRWGNSAGMVIPNIVMKELNLQPGQSVEAQVSNNQLILTPISRRYSLDELLAQCDMNAAELSEQDVWGKSTPAGD
EIW
>Q2YQA5 2.7.99.-~~~chpT~~~Protein phosphotransferase ChpT~~~
MSLPVTLSALDLGALLCSRICHDIISPVGAINNGLELLEEGGADEDAMALIKSSARNASARLQFARIAFGAAGSAGVQID
TGDAQNVATEYFRNEKPEFTWEGARVLLPKNKVKLLLNMLLIGNGAIPRGGSLAVRLEGSDTDPRFVITVKGRMLRVPPK
FLELHSGAAPEEPIDAHSVQPYYTLLLAEEAGMKISIHATAEDIVFSAE
>Q5PXQ6 1.13.11.37~~~chqB~~~Hydroxyquinol 1,2-dioxygenase~~~COG3485
MSTPVSAEQQAREQDLVERVLRSFDATADPRLKQVMQALTRHLHAFLREVRLTEAEWETGIGFLTDAGHVTNERRQEFIL
LSDVLGASMQTIAMNNEAHGDATEATVFGPFFVEGSPRIESGGDIAGGAAGEPCWVEGTVTDTDGNPVPDARIEVWEADD
DGFYDVQYDDDRTAARAHLLSGPDGGYAFWAITPTPYPIPHDGPVGRMLAATGRSPMRASHLHFMVTAPGRRTLVTHIFV
EGDELLDRDSVFGVKDSLVKSFERQPAGAPTPGGREIDGPWSRVRFDIVLAPA
>P17551 ~~~chrA1~~~Chromate transport protein~~~
MNSPQPPDTTAAGSVHTAPTYTLRQLVMYFLRLGTLGFGGPVALAGYMHRDLVEAKQWITDADYKEGLALAQLAPGPLAA
QLAIYLGYVHYRIVGATLVGVAFVLPSFLMVLALGWAYVRFGGLTWMQSVFYGVGAAVIGIIAISAYKLTKKSVGNDKLL
WFIYLVLVAVTVITESEVAWLFLAAGVLVWFWRAPPKWLRQGKMNAFAATPLPAASGMMSTLDWPLLSQIGVFFAKAGAF
VFGSGLAIVPFLYGGVVTEYHWLNDKQFVDAVAVAMITPGPVVITVGFIGYLVAGLPGACVAAAATFLPCYLFTVLPAPY
FKKYGKLPAILAFVDGVTAAAIGAITGAVIVLAKRSIVDIPTALLALVTVALLLKFKKLSEPMIVAGAALIGLVAYPLLH
H
>P14285 ~~~chrA~~~Chromate transport protein~~~
MSVANEESYRPSKATDATTEAVPPPMSYPQLFARFLKFGLLAWGGPVAQIDMLRRELVDEERWISSKRFNKLLAVMQVLP
GPEAHEICVHLGIRAKGRLGGVLAGLGFMLPGFLLMFALSWLYFQIEFVGTALGAAFLGVQAAVIALIVRAVHRIGEHIL
LDRWLWVIAIVCALAAIGRVDFWITLPAGGLVYALLVLNHRASALLVTLAAVALAAAVALWAAPTAKLVEAVVQGQASVL
LIFASGLKAGLLTFGGAYTAIPFVRNDAVGRGWMTDGQFLDGLALSGVLPAPLIIFATFVGYVAGGPIGAVAMTVGVFLP
AFAFSLIFYDRLEAVVENKRLHAFLDGVAAGVVGLIGATTIDLAQVTAERVPSLTVGMSIFAAGLAFLYAWKNKLNVVVV
ILAAGLAGWLVFPNQG
>P17552 ~~~chrB1~~~Protein ChrB~~~
MNALPSSPETAWLLLVVSLPTSASTARMRFWRGIKALGATALRDGAYLLPNLPGLRAPLQTLATDAASEDGKVWMLSVQA
ADDQQEAEYRALFDRSTEYAEWMVELSSARSTLSDSDEAELLRVARRHGRGIDAIRKVDFFPNEASARAELQWRDFNAAI
DILLSPGEPHGVAGNIPRRDPTQYQGRQWATRQHLWVDRVACAWLIRRFIDPHATFLWLEDVRQCPDDALGFDFDGATFT
HIGDRVSFEVLLASFGLDEDKGLARLGQMIHVLDVGGTPVAEASGFEAVLAGARERLPNDDALLDEVGYVLDSLYTHFSS
PRKR
>P40685 ~~~chrR~~~Anti-sigma-E factor ChrR~~~COG3806
MTIRHHVSDALLTAYAAGTLSEAFSLVVATHLSLCDECRARAGALDAVGGSLMEETAPVALSEGSLASVMAQLDRQIQRP
APARRADPRAPAPLADYVGRRLEDVRWRTLGGGVRQAILPTGGEAIARLLWIPGGQAVPDHGHRGLELTLVLQGAFRDET
DRFGAGDIEIADQELEHTPVAERGLDCICLAATDAPLRFNSFLPKLVQPFFRI
>P0AGE6 1.6.5.2~~~chrR~~~Quinone reductase~~~COG0431
MSEKLQVVTLLGSLRKGSFNGMVARTLPKIAPASMEVNALPSIADIPLYDADVQQEEGFPATVEALAEQIRQADGVVIVT
PEYNYSVPGGLKNAIDWLSRLPDQPLAGKPVLIQTSSMGVIGGARCQYHLRQILVFLDAMVMNKPEFMGGVIQNKVDPQT
GEVIDQGTLDHLTGQLTAFGEFIQRVKI
>Q88FF8 1.6.5.2~~~chrR~~~Quinone reductase~~~COG0431
MSQVYSVAVVVGSLRKESYNRKVARALSELAPSSLALKIVEIGDLPLYNEDIEAEAPPETWKRFRDEIRRSDAVLFVTPE
YNRSVPGCLKNAIDVGSRPYGQSAWSGKPTAVVSVSPGAIGGFGANHAVRQSLVFLDMPCMQMPEAYLGGAASLFEDSGK
LNDKTRPFLQAFVDRFASWVKLNRAV
>Q93T20 1.6.5.2~~~chrR~~~Quinone reductase~~~COG0431
MSQVYSVAVVVGSLRKESYNRKVARALSELAPSSLALKIVEIGDLPLYNEDVEAEAPPEAWKRFREEIRRSDAVLFVTPE
HNRSVPGCLKNAIDVGSRPYGQSAWSGKPTAVVSVSPGAIGGFGANHAVRQSLVFLDMPCMQMPEAYIGGAASLFDDSGK
LNDKTRPFLQAFVDKFASWVKLNRAV
>Q0VZ70 ~~~cmdD~~~Chondramide synthase cmdD~~~
MLREGQGTVAHAPPRPLPHADVLPVSQAQRRLWFLCQLDGASVAYNMPFVTALDGHLDARALQRALDEIIRRHESLRTTF
RLQAEGPVQVIHPPAPLDLPLHDLRSLDEPARAAEIQRRIDRAAHQPFDIERGPLLRAQLLRQSETRHVLCLVIHHIVAD
GWSIGVFVREFEALYGAFSASRPSPLTEPPLQYADFSRWQEERFPPSAVERHLTYWKQKLSDVQPLQLPADHPRPAVESF
RGDHTIFRLDRGLTRGLHELAQCEGVTLFITLLSAFNVLLGRYSGQDDLAIASGTANRKHAELEGLIGFFVNTVVIRTDL
SGNPTFRTVLSRVLASVMEATEHEDLPFERVVEELKPERTASHNPLAQVALTLQSFASNRLTLPGLTTSPCDFRFRTSKL
DLMLLVTEVDGELEVVVEYNTDLFEDATIARMSAHLRTVMAAMVADPGARIGDISLLTTEERHRLLVDWNDTALACPEAE
GVHHAFEQNAARQPDAIAVVFDGDPISRITYGALNERANQLAHHLIQQGVGPDVVVGIHVERSITMIVALLAVLKAGGAY
LPLDPTYPQQRLAFILADAGAQVILTQEKWFDDLPPHTARVLDLDAIAPQLDANATSNPPLRATADHLAYIIYTSGSTGN
PKGVLIPRRDTWSVARALAETYALTPESRVLQFASLNFDGSVVEITMTLFSGAALHVAPQEKLLPGAPLNAFLQRHAITH
VQLAPSLLARLPPEGLEHVRTIMVAGEASSVGTVRGWLPGRRILNGYGPTETTVGAAMIAFTEADDAYLAKLDALPIGRP
FYNKRVYLLDARLQPVPVGVPGEIYVASPGLARGYINRPAATAEKFLPNPFSETPGERIYRTGDLARYLPDGNLVFLGRV
DNQVKLRGLRIELEEIESALKSHPHVGDAAVIVHEAPADQATSERDGKRLVAYVVPRRGWEPEGAQSDHIASWQTLHEQL
LDESQAPEDWSFNITGWKSSYTGEALPAAEMRLWVESTVERILAHGPKDVLEIGTGTGLLLARIAPRVRAYLATDFSLEA
IRYLETCKARAPELSNVTLLQRMADDFTGFSAGQFDTIVLNSVVQYFPTLDYLSAVIEGALRVLKPGGTLFLGDIRNLAL
LDAFHASVQTAKASGTLSRDELRYRVQQGVMNENELVIDPRFFTALSRKFPQITHVEVTPKRGLHRNELTLFRYDVALQV
GGTPKGAPTITWFDWREEGLTSDSLPPWLSDTLATSPDAGVGLRRVPNARLQPDLAILSWLATRAEASLDAWRARQHDVP
EGCAPEALWALETTWPGRVHLSWAAGHPDGSFDLVVTPPQAERRAPWSPAVDLTDEQLSAYVNHPLQAKVVRETLGQELR
RYLQDKLPAYMVPTVLIPLPALPLTSNGKLDRRALPAPDIERRSRASTYVAPRNAREETLVAIWSKVLGVDPIGVEDNFF
ELGGDSILSIQIVGQAKQAGFSLTSRQMFEHQTIAALAEVASASKSIQAEQGLVEGSIPLTPIQRWFFETHQETPDHFNQ
AILLKVSADVSASRLEQAFHHLFTHHDALRMRFSRTADGFEQVNLGPIEGVTVDVIDLAHLPAAEQTRALTEAATSLQQR
LSITSGPLSRIALIHLGAEQPARLLWILHHLVVDGVSWRILLDDLVTVLRQLEAGQPARFPPKTTSFKEWSERLHATAQQ
EQANTASSRAERDAWRSVPVPALPLDHPQGTNRKASAAQVQVALSVADTHALLHDAPRAYGTQVNDLLLTALALAFNAWT
GDATLALDLEGHGREEDLVGADLSRTVGWFTTMHPVALRLPGRELSLALRAVKEQLRAQPGRGIAYGLFRYASGEGSLAS
WPAPQVNFNYLGQLDAMTDTAPLLGFAPEEIGPSDGPTGDRTHLFQVNGMVKDGSLQFTWTYSRELHRPETVQKLAHDFA
ETARRLTQHCLAHESHPTPGDFPAVTLSQNQLDVVLDALGADRDNVAAIYPLTSLQEGLLFHSLSAVPAPVPALADEDDE
EDDELDEEFDAEVDEEDEDEEEEEDDDGENVYVTQLVFRIQGPLDAEKFRTAWQETVQRHPLLRSRFVWEGCERPLQVVL
RSADLRWEEDELEEDSWSSPLRVHARREQQAGMLLDEAPLFRLNLLRAEDTEHHLIWTSHHILLDGWSGPLILKDVFASY
DAQLLGESRTAADPPPYEAYVAWLKRQDGTASERFWRENLRGFSAPTPLVVDNEEPTGKQKHLHHRCKLSAETSQALKAL
AESFRVTLSTVYQAAWALLLHRYSGMSDVLFGVTVSGREADVPGIEEMVGLFIRTVPLRLHVDESQTLGAWLKEVQARQI
EQREHQYVSLVDIQRWSDVPGGTQLFDSMFVFENYPLDSALLEQSGLRLTVSTMASPTHYPLVIAVVPGRTVETLFDHDT
SRLSKHTVERLAAHWVELLTGMARRPDARIHTLPHLTSAEREKLLVTWNARPYVDEQRKYRGEEEPFGEELAAESTFLDL
FQHHVAQTPDALALVGPSLQSTDERPVSRTYRALSARVHLLARHLRGLGVGPEVTVGVCLDRSIELVIGMLAIFEAGGVY
LPLDPSQPLERLAYLVSDARPEVVLTQQRWNDRLPEQATRRVALDTAWAEIEAQPEVSHQHRTAGDNLAYVLYTSGSTGT
PKGVQVTVDNLSRLTPALITAFDVTPRSRVLQYSSLSFDGSISEVAMALGAGAALHLAPAHELVPGPPLQKLLATRAITH
VTLLPAALRWLSPRGLPALDVLIVTGEACPASLVRTWASGRRFVNAYGPTEITVAATAMECPVTMFQETEQPPPIGCPLQ
STEIYILDAHLRPVPVGVPGDLYIGGAKLTRGYIHRPALTAERYIPHPFSDRPGARLYVTGDIARYQLDGTIDFLGRRDN
QVKVRGYRIELGEVEAALNDHPGVREAVVVAQKDGAGDNRLVAYWAAKSTPPTTTEALRDALSKRLAAYMIPSVFVRMDA
LPLNATGKIDRQGLPPVDDTMLDREQFVAPRTATEETLTAIWSSTLGVARVGIRDDFFKLGGHSLLALNITTQIQKRFGH
VITVDSIFRAPTIAVLARVIDEALAPTGARRALSLVVPLRERGTKVPLFFAAGMGMHAHYLRPLAEHLGEDQPFYALQSP
AQGGEITDMATLVDTLIGAIQQIQPSGPYHLGGHSAGARIAFAVALELQRRGAEVPLVSIVDMRPPGRGATSDESAEWTQ
IGGLIGYVTMIKQAIGEGVLFVTPEELRKLDEAAAWQRTLDAFIAARWMPKDADVEQLQHLCAMNQNVVRVVRDHVPTDT
HQGKLLVFSAAFAMRNGRQVSTEGWQAFCANPVTTHEVPGDHMTMLREPDVRGLAIKLRREIDELALERTDEAPGLPTPP
EFPVVWEHPEDARMLWVHDVTHCREQMTPLDFCLRQQAMVEGSNLANLAYGVPFTGEIRLINTYVYQKIIPTTASPTELA
AAMKRAEASVAALLPDLGRWWTETLLPEIEAHLEALDPENNYDFVHRHTLVEALAEAHRRTARLWEIHFRLLQPVMLAIS
RFVDLCKDLSTDDDPIDPYALLVGFPNKTTEGNRALWSLSRLALETPEVASILTSNEASRVSWKLRSTRGGRAFVAQLDA
YLATYGQRNDSTYLDAPTWEEDPTPVIRNLQAYMTQPERDLDAELNALSEQRTQRLDALRARLRHYPRAVVDEFEQALTA
AQTATVLSEDHNYWIDYKITHRLRHLCLYLGEQLKDWELLGDCEEIFYLSMDDVSRAAVETKRGGPFSANQRFYHLACAR
KDEAKRFHGVQPPRFLGTPSPLPALHDALSLASARFTGVAPSPSNDEKEIVGLSGAKGKARGKARVARNLADVPTLEPGE
ILVAMAMLPAWTPLFATVAAIVTDSGGMLSHAAVVAREYGIPAVVGTQVGTQRIRDGQLVEVDGERGVVTLL
>O53547 1.1.1.-~~~chsB1~~~Hydroxyacyl-CoA dehydrogenase ChsB1~~~COG1028
MKLTESNRSPRTTNTTDLSGKVAVVTGAAAGLGRAEALGLARLGATVVVNDVASALDASDVVDEIGAAAADAGAKAVAVA
GDISQRATADELLASAVGLGGLDIVVNNAGITRDRMLFNMSDEEWDAVIAVHLRGHFLLTRNAAAYWRDKAKDAEGGSVF
GRLVNTSSEAGLVGPVGQANYAAAKAGITALTLSAARALGRYGVCANVICPRARTAMTADVFGAAPDVEAGQIDPLSPQH
VVSLVQFLASPAAAEVNGQVFIVYGPQVTLVSPPHMERRFSADGTSWDPTELTATLRDYFAGRDPEQSFSATDLMRQ
>P71857 1.3.99.-~~~~~~Acyl-CoA dehydrogenase FadE28~~~COG1960
MDFDPTAEQQAVADVVTSVLERDISWEALVCGGVTALPVPERLGGDGVGLFEVGALLTEVGRHGAVTPALATLGLGVVPL
LELASAEQQDRFLAGVAKGGVLTAALNEPGAALPDRPATSFVGGRLSGTKVGVGYAEQADWMLVTADNAVVVVSPTADGV
RMVRTPTSNGSDEYVMTMDGVAVADCDILADVAAHRVNQLALAVMGAYADGLVAGALRLTADYVANRKQFGKPLSTFQTV
AAQLAEVYIASRTIDLVAKSVIWRLAEDLDAGDDLGVLGYWVTSQAPPAMQICHHLHGGMGMDVTYPMHRYYSTIKDLTR
LLGGPSHRLELLGARCSLT
>P71858 1.3.99.-~~~~~~Acyl-CoA dehydrogenase FadE29~~~COG1960
MFIDLTPEQRQLQAEIRQYFSNLISPDERTEMEKDRHGPAYRAVIRRMGRDGRLGVGWPKEFGGLGFGPIEQQIFVNEAH
RADVPLPAVTLQTVGPTLQAHGSELQKKKFLPAILAGEAHFAIGYTEPEAGTDLASLRTTAVRDGDHYIVNGQKVFTTGA
HDADYIWLACRTDPNAAKHKGISILIVDTKDPGYSWTPIILADGAHHTNATYYNDVRVPVDMLVGKENDGWRLITTQLNN
ERVMLGPAGRFASIYDRVHAWASVPGGNGVTPIDHDDVKRALGEIRAIWRINELLNWQVASAGEDINMADAAATKVFGTE
RVQRAGRLAEEIVGKYGNPAEPDTAELLRWLDAQTKRNLVITFGGGVNEVMREMIAASGLKVPRVPR
>P96855 1.3.99.-~~~~~~Acyl-CoA dehydrogenase FadE34~~~COG1960
MVATVTDEQSAARELVRGWARTAASGAAATAAVRDMEYGFEEGNADAWRPVFAGLAGLGLFGVAVPEDCGGAGGSIEDLC
AMVDEAARALVPGPVATTAVATLVVSDPKLRSALASGERFAGVAIDGGVQVDPKTSTASGTVGRVLGGAPGGVVLLPADG
NWLLVDTACDEVVVEPLRATDFSLPLARMVLTSAPVTVLEVSGERVEDLAATVLAAEAAGVARWTLDTAVAYAKVREQFG
KPIGSFQAVKHLCAQMLCRAEQADVAAADAARAAADSDGTQLSIAAAVAASIGIDAAKANAKDCIQVLGGIGCTWEHDAH
LYLRRAHGIGGFLGGSGRWLRRVTALTQAGVRRRLGVDLAEVAGLRPEIAAAVAEVAALPEEKRQVALADTGLLAPHWPA
PYGRGASPAEQLLIDQELAAAKVERPDLVIGWWAAPTILEHGTPEQIERFVPATMRGEFLWCQLFSEPGAGSDLASLRTK
AVRADGGWLLTGQKVWTSAAHKARWGVCLARTDPDAPKHKGITYFLVDMTTPGIEIRPLREITGDSLFNEVFLDNVFVPD
EMVVGAVNDGWRLARTTLANERVAMATGTALGNPMEELLKVLGDMELDVAQQDRLGRLILLAQAGALLDRRIAELAVGGQ
DPGAQSSVRKLIGVRYRQALAEYLMEVSDGGGLVENRAVYDFLNTRCLTIAGGTEQILLTVAAERLLGLPR
>I6YCA3 1.3.99.-~~~~~~Acyl-CoA dehydrogenase FadE26~~~COG1960
MRISYTPQQEELRRELRSYFATLMTPERREALSSVQGEYGVGNVYRETIAQMGRDGWLALGWPKEYGGQGRSAMDQLIFT
DEAAIAGAPVPFLTINSVAPTIMAYGTDEQKRFFLPRIAAGDLHFSIGYSEPGAGTDLANLRTTAVRDGDDYVVNGQKMW
TSLIQYADYVWLAVRTNPESSGAKKHRGISVLIVPTTAEGFSWTPVHTMAGPDTSATYYSDVRVPVANRVGEENAGWKLV
TNQLNHERVALVSPAPIFGCLREVREWAQNTKDAGGTRLIDSEWVQLNLARVHAKAEVLKLINWELASSQSGPKDAGPSP
ADASAAKVFGTELATEAYRLLMEVLGTAATLRQNSPGALLRGRVERMHRACLILTFGGGTNEVQRDIIGMVALGLPRANR
>I6Y3Q0 1.3.99.-~~~~~~Acyl-CoA dehydrogenase FadE27~~~COG1960
MDFTTTEAAQDLGGLVDTIVDAVCTPEHQRELDKLEQRFDRELWRKLIDAGILSSAAPESLGGDGFGVLEQVAVLVALGH
QLAAVPYLESVVLAAGALARFGSPELQQGWGVSAVSGDRILTVALDGEMGEGPVQAAGTGHGYRLTGTRTQVGYGPVADA
FLVPAETDSGAAVFLVAAGDPGVAVTALATTGLGSVGHLELNGAKVDAARRVGGTDVAVWLGTLSTLSRTAFQLGVLERG
LQMTAEYARTREQFDRPIGSFQAVGQRLADGYIDVKGLRLTLTQAAWRVAEDSLASRECPQPADIDVATAGFWAAEAGHR
VAHTIVHVHGGVGVDTDHPVHRYFLAAKQTEFALGGATGQLRRIGRELAETPA
>I6XHI0 4.2.1.-~~~chsH1~~~3-oxo-4,17-pregnadiene-20-carboxyl-CoA hydratase beta subunit~~~COG2030
MTVVGAVLPELKLYGDPTFIVSTALATRDFQDVHHDRDKAVAQGSKDIFVNILTDTGLVQRYVTDWAGPSALIKSIGLRL
GVPWYAYDTVTFSGEVTAVNDGLITVKVVGRNTLGDHVTATVELSMRDS
>I6YGF8 4.2.1.-~~~chsH2~~~3-oxo-4,17-pregnadiene-20-carboxyl-CoA hydratase alpha subunit~~~COG1545
MTGVSDIQEAVAQIKAAGPSKPRLARDPVNQPMINNWVEAIGDRNPIYVDDAAARAAGHPGIVAPPAMIQVWTMMGLGGV
RPKDDPLGPIIKLFDDAGYIGVVATNCEQTYHRYLLPGEQVSISAELGDVVGPKQTALGEGWFINQHIVWQVGDEDVAEM
NWRILKFKPAGSPSSVPDDLDPDAMMRPSSSRDTAFFWDGVKAHELRIQRLADGSLRHPPVPAVWQDKSVPINYVVSSGR
GTVFSFVVHHAPKVPGRTVPFVIALVELEEGVRMLGELRGADPARVAIGMPVRATYIDFPDWSLYAWEPDE
>D1AB77 4.2.1.-~~~chsH2~~~Probable enoyl-CoA hydratase alpha subunit~~~COG1545
MSGEDYEKRLQAWVGRTLGEPRRGQDPVNVPMIRHWVEAMGDTNPVYLDEEAARATGRETVVAPASMMQAWTMRGYAATV
NPEPEAGGMEELTALLAEGGYTSVVATDSEFEFHRELVPGDHISVQEQVESISPEKKTALGEGRFITTLRTYRDQRGEVV
ATQRWRLLRFRPKKTEQTEQKPKALRPRPAINRDNAFWFEAAKQRRLVIQRCAACKTLRHPPGPCCPHCGSFDWDTVEAA
GTGQVYSYIVAHHPPHPAFEMPYVVALVELTEGTRLVTNLVGIAPDKIEIGMPVVLDWLEADPELTLPVFRPAVPQEES
>Q6MWW2 4.2.1.-~~~chsH3~~~Enoyl-CoA hydratase ChsH3~~~COG2030
MPIDLDVALGAQLPPVEFSWTSTDVQLYQLGLGAGSDPMNPRELSYLADDTPQVLPTFGNVAATFHLTTPPTVQFPGIDI
ELSKVLHASERVEVPAPLPPSGSARAVTRFTDIWDKGKAAVICSETTATTPDGLLLWTQKRSIYARGEGGFGGKRGPSGS
DVAPERAPDLQVAMPILPQQALLYRLCGDRNPLHSDPEFAAAAGFPRPILHGLCTYGMTCKAIVDALLDSDATAVAGYGA
RFAGVAYPGETLTVNVWKDGRRLVASVVAPTRDNAVVLSGVELVPA
>Q8L0V4 ~~~kfoC~~~Chondroitin synthase~~~
MSILNQAINLYKNKNYRQALSLFEKVAEIYDVSWVEANIKLCQTALNLSEEVDKLNRKAVIDIDAATKIMCSNAKAISLN
EVEKNEIISKYREITAKKSERAELKEVEPIPLDWPSDLTLPPLPESTNDYVWAGKRKELDDYPRKQLIIDGLSIVIPTYN
RAKILAITLACLCNQKTIYDYEVIVADDGSKENIEEIVREFESLLNIKYVRQKDYGYQLCAVRNLGLRAAKYNYVAILDC
DMAPNPLWVQSYMELLAVDDNVALIGPRKYIDTSKHTYLDFLSQKSLINEIPEIITNNQVAGKVEQNKSVDWRIEHFKNT
DNLRLCNTPFRFFSGGNVAFAKKWLFRAGWFDEEFTHWGGEDNEFGYRLYREGCYFRSVEGAMAYHQEPPGKENETDRAA
GKNITVQLLQQKVPYFYRKKEKIESATLKRVPLVSIYIPAYNCSKYIVRCVESALNQTITDLEVCICDDGSTDDTLRILQ
EHYANHPRVRFISQKNKGIGSASNTAVRLCRGFYIGQLDSDDFLEPDAVELCLDEFRKDLSLACVYTTNRNIDREGNLIS
NGYNWPIYSREKLTSAMICHHFRMFTARAWNLTEGFNESISNAVDYDMYLKLSEVGPFKHINKICYNRVLHGENTSIKKL
DIQKENHFKVVNESLSRLGIKKYKYSPLTNLNECRKYTWEKIENDL
>P01555 ~~~ctxA~~~Cholera enterotoxin subunit A~~~
MVKIIFVFFIFLSSFSYANDDKLYRADSRPPDEIKQSGGLMPRGQSEYFDRGTQMNINLYDHARGTQTGFVRHDDGYVST
SISLRSAHLVGQTILSGHSTYYIYVIATAPNMFNVNDVLGAYSPHPDEQEVSALGGIPYSQIYGWYRVHFGVLDEQLHRN
RGYRDRYYSNLDIAPAADGYGLAGFPPEHRAWREEPWIHHAPPGCGNAPRSSMSNTCDEKTQSLGVKFLDEYQSKVKRQI
FSGYQSDIDTHNRIKDEL
>P01556 ~~~ctxB~~~Cholera enterotoxin subunit B~~~
MIKLKFGVFFTVLLSSAYAHGTPQNITDLCAEYHNTQIYTLNDKIFSYTESLAGKREMAIITFKNGAIFQVEVPGSQHID
SQKKAIERMKDTLRIAYLTEAKVEKLCVWNNKTPHAIAAISMAN
>A0A384LP51 2.1.1.342~~~chuW~~~Anaerobilin synthase~~~
MNANNTLDLTPHFALDGDQPFKDRRAMMPFRGAIPVAKEQLAQTWQEMINQTASPRKRLVYLHIPFCATHCTFCGFYQNR
FNEDACAHYTDALIREIEMEADSVLHQSAPIHAVYFGGGTPSALSAHDLARIITTLREKLPLAPDCEITIEGRVLNFDAE
RIDACLDAGANRFSIGIQSFNSKIRKKMARTSDGPTAIAFMESLVKRDRAAVVCDLLFGLPGQDAQTWGEDLAIARDIGL
DGVDLYALNVLSNTPLGKAVENGRTTVPSPAERRDLYLQGCDFMDDAGWRCISNSHWGRTTRERNLYNLLIKQGADCLAF
GSGAGGSINGYSWMNERNLQTWHESVAAGKKPLMLIMRNAERNAQWRHTLQSGVETARVPLDELTPHAEKLAPLLAQWHQ
KGLSRDASTCLRLTNEGRFWASNILQSLNELIQVLNAPAIMREKP
>P25548 ~~~chvE~~~Multiple sugar-binding periplasmic receptor ChvE~~~COG4213
MKSIISLMAACAIGAASFAAPAFAQDKGSVGIAMPTKSSARWIDDGNNIVKQLQEAGYKTDLQYADDDIPNQLSQIENMV
TKGVKVLVIASIDGTTLSDVLKQAGEQGIKVIAYDRLIRNSGDVSYYATFDNFQVGVLQATSITDKLGLKDGKGPFNIEL
FGGSPDDNNAFFFYDGAMSVLKPYIDSGKLVVKSGQMGMDKVGTLRWDPATAQARMDNLLSAYYTDAKVDAVLSPYDGLS
IGIISSLKGVGYGTKDQPLPVVSGQDAEVPSVKSIIAGEQYSTIFKDTRELAKVTVNMVNAVMEGKEPEVNDTKTYENGV
KVVPSYLLKPVAVTKENYKQVLVDGGYYKEDQLK
>Q5EK40 2.4.2.36~~~chxA~~~Cholix toxin~~~
MYLTFYLEKVMKKMLLIAGATVISSMAHPTFAVEDELNIFDECRSPCSLTPEPGKPIQSKLSIPSDVVLDEGVLYYSMTI
NDEQNDIKDEDKGESIITIGEFATVRATRHYVNQDAPFGVIHLDITTENGTKTYSYNRKEGEFAINWLVPIGEDSPASIK
ISVDELDQQRNIIEVPKLYSIDLDNQTLEQWKTQGNVSFSVTRPEHNIAISWPSVSYKAAQKEGSRHKRWAHWHTGLALC
WLVPMDAIYNYITQQNCTLGDNWFGGSYETVAGTPKVITVKQGIEQKPVEQRIHFSKGNAMSALAAHRVCGVPLETLARS
RKPRDLTDDLSCAYQAQNIVSLFVATRILFSHLDSVFTLNLDEQEPEVAERLSDLRRINENNPGMVTQVLTVARQIYNDY
VTHHPGLTPEQTSAGAQAADILSLFCPDADKSCVASNNDQANINIESRSGRSYLPENRAVITPQGVTNWTYQELEATHQA
LTREGYVFVGYHGTNHVAAQTIVNRIAPVPRGNNTENEEKWGGLYVATHAEVAHGYARIKEGTGEYGLPTRAERDARGVM
LRVYIPRASLERFYRTNTPLENAEEHITQVIGHSLPLRNEAFTGPESAGGEDETVIGWDMAIHAVAIPSTIPGNAYEELA
IDEEAVAKEQSISTKPPYKERKDELK
>A0A0H3MDW1 ~~~chxR~~~Atypical response regulator protein ChxR~~~
MAGPKHVLLVSEHWDLFFQTKELLNPEEYRCTIGQQYKQELSADLVVCEYSLLPREIRSPKSLEGSFVLVLLDFFDEETS
VDLLDRGFWYLIRPITPRILKSAISLFLSQHSLHSVPESIRFGPNVFYVLKLTVETPEGSVHLTPSESGILKRLLINKGQ
LCLRKHLLEEIKNHAKAIVARNVDVHIASLRKKLGAYGSRIVTLRGVGYLFSDDGDKKFSQQDTKLS
>P60647 ~~~cidA~~~Holin-like protein CidA~~~COG1380
MHKVQLIIKLLLQLGIIIVITYIGTEIQKIFHLPLAGSIVGLFLFYLLLQFKIVPLTWVEDGANFLLKTMVFFFIPSVVG
IMDVASEITLNYILFFAVIIIGTCIVALSSGYIAEKMSVKHKHRKGVDAYE
>P60639 ~~~cidB~~~Holin-like protein CidB~~~COG1346
MNDYVQALLMILLTVVLYYFAKRLQQKYPNPFLNPALIASLGIIFVLLIFGISYNGYMKGGSWINHILNATVVCLAYPLY
KNREKIKDNVSIIFASVLTGVMLNFMLVFLTLKAFGYSKDVIVTLLPRSITAAVGIEVSHELGGTDTMTVLFIITTGLIG
SILGSMLLRFGRFESSIAKGLTYGNASHAFGTAKALEMDIESGAFSSIGMILTAVISSVLIPVLILLFY
>Q63KH5 3.5.1.44~~~cif~~~Protein-glutamine deamidase Cif~~~
MLEHGVMKIPGINNVGKTGQAGGETERIPSTEPLGSSAATSPAGPLGGLPARSSSISNTNRTGENPMITPIISSNLGLKH
RVTLRKATLASLMQSLSGESSNRVMWNDRYDTLLIARDPREIKNAIEKSVTDFGGLENYKELTGGADPFALMTPVCGLSA
NNIFKLMTEKDVPIDPTSIEYLENTSFAEHVNTLDSHKNYVVIVNDGRLGHKFLIDLPALTQGPRTAYIIQSDLGGGALP
AVRVEDWISRRGSDPVSLDELNQLLSKDFSKMPDDVQTRLLASILQIDKDPHKVDIKKLHLDGKLRFASHEYDFRQFQRN
AQYVAGLG
>P0DUW5 3.5.1.44~~~cif~~~Protein-glutamine deamidase Cif~~~
MKDITLPPPTSASCLTGAISVNTEAVLSPMQHTSALHVRDFASLCSQNLKANVLLNSDDHEVPIHQKNPAAIMQNIDSNI
KQMATDWGMSIEEVEVIIGREKGIVEPSCGVTANAIMKLFLDKDGFSYCFENEQTLSLEQLQERLSCMPECKSFVLRVND
GALGHAYIVDIPKGENSCRPAFLYQSDLGEGVTRKLRFEDWMTHKALTPILLDDICNYFSCMSQNKTDLEQIATLFDIDG
NVKMLRKENIQYQKHDNFSFQLFEYDTDNIEKNIEIIKSLCS
>Q7N439 3.5.1.44~~~cif~~~Protein-glutamine deamidase Cif~~~
MGDDIMPISNLAKESEVRAVKDIPCKNIETDNHLEIGLSSGLSRSKDTSKFKKNSINTIKLIDDIIALHNDPKGNKLLWN
DNWQDKIINRDLANIFEKIDESVSELGGLEMYQEMVGVNPYDPTEPVCGLSAQNIFKLMTEGEHAVDPVEMAQTGKIDGN
EFAESVDQLSSAKNYVALVNDRRLGHMFLIDIPSNDQETVGYIYQSDLGQGALPPLKIADWLNSRGKDAVSLNKLKKLLS
REFNLLSDDEKRALISETLDIHKDVSNVELDRIKRDRGVDIYLTEYDVNNFYENIETLKSKLSNYDKKLSKPK
>A0A0H3B1Q8 3.5.1.44~~~cif~~~Protein-glutamine deamidase Cif~~~
MKISPNTISPSQSDPRMSTNVSQRSRVSGIGVPVSHSINNPSIQHVQDFATLSARSLRANVLLNSDDHSVPIHAKNPSEL
LEAIDNNISQTAQDWGVSIQEVEVILGSSKRIIEPVCGVTANTIMKLFLDNDIFSYSFEKGQSLSLSQLQERLASLPAHK
NFILRVNDGGLGHAYVIDFPATTNPSRDAFLYQSDLGEGVTREVRFEDWMTQKASHPISLDDINTHFIGIAQDQIDLAHI
AKLFDVDGNVKMLRADHLISHKTSEFNFQLFEYDLKNLENNMSIIKTHCN
>Q9KHI5 2.7.13.3~~~cikA~~~Circadian input-output histidine kinase CikA~~~COG2205
MLAPSSNCSLASQRLTPEGFAQLQSALQDFVATLPQAFYWDSRSLHTHLRTQTGDCAIAIAAGFQLLLLGRTAAEYCQPH
PLSEPHHVSVQFGADSIQRYCQATNLPVEYQPALAQLGDLSLNPDLISQFSNLLIAAIAADRAPLAAQYPAVSVCQPLEQ
ALHWQEEQDRLISQVSAQIRLSLDLSEILTTTIREIRQLLNADRAIIYQFKPQCLDAGLDQRWPLYIPSQSYITYEDRRN
EALLSVIDPLVQPGLLITTEEWQRFQQGETLLIDSVGFYKERLPEQYSFYERVQVRSVCKIPILVQGRIWGLLVAHQCQQ
DHRWQPRERDILQHLAEHLSIAIYQAQLYGQLQDQTQTLENRVLERTQELIDALALAQAANAAKGEFLATMSHELRTPLT
CVIGMSSTLLRWAFGPLTERQREYIKAIHDSGEHLLELINDILDLSQIEAGKAALQVRPFSLSRLATQTLNTLQEKARLG
EIQLMLDLQLNNRVDVFRADPKRLRQILINLLSNAVKFTEPQGTVFLRVWREGDRAIFQVSDTGIGIPESEQAQLFQKFQ
QLDTSIRRQYGGTGLGLALTKQLVELHGGHIQIESTVGQGSTFTVWIPEQTLIEPVEPRPSIDNLPAGHILLLEEEDEAA
TVVCEMLTAAGFKVIWLVDGSTALDQLDLLQPIVILMAWPPPDQSCLLLLQHLREHQADPHPPLVLFLGEPPVDPLLTAQ
ASAILSKPLDPQLLLTTLQGLCPPNLSEGDRPSS
>P74111 2.7.13.3~~~cikA~~~Circadian input-output histidine kinase CikA~~~COG0642
MLPAFSPIFRRLLPAVTFERLLRFWRTLAQQTGDGVQCFVGDLPSSLKPPPGPSVLEAEVDHRFALLVSPGQWALLEGEQ
ISPHHYAVSITFAQGIIEDFIQKQNLPVVAEAMPHRPETPSGPTIAEQLTLGLLEILNSDSTSFSPEPSLQDSLQASQVK
LLSQVIAQIRQSLDLSEILNNAVTAVQKFLFVDRLVIYQFHYSQPSLTPLEENQIPAPRPRQQYGEVTYEARRSPEIDTM
LGIMTENDCFSQVFSYEQKYLKGAVVAVSDIENHYSSSYCLVGLLQRYQVRAKLVAPIIVEGQLWGLLIAHQCHHPRQWL
DSEKNFLGQIGEHLAVAIVQSLLYSEVQKQKNNFEKRVIERTKELRDTLMAAQAANLLKSQFINNISHELRTPLTSIIGL
SATLLRWFDHPASLPPAKQQYYLLNIQENGKKLLDQINSIIQLSQLESGQTALNCQSFSLHTLAQTVIHSLLGVAIKQQI
NLELDYQINVGQDQFCADQERLDQILTQLLNNALKFTPAEGTVILRIWKESNQAIFQVEDTGIGINEQQLPVLFEAFKVA
GDSYTSFYETGGVGLALTKQLVELHGGYIEVESSPGQGTIFTTVIPQQNFPPTTKGQVQDKLDAAMPFNSSVIVIEQDEE
IATLICELLTVANYQVIWLIDTTNALQQVELLQPGLIIVDGDFVDVTEVTRGIKKSRRISKVTVFLLSESLSSAEWQALS
QKGIDDYLLKPLQPELLLQRVQSIQQEPLR
>Q8DKG0 2.7.13.3~~~cikA~~~Circadian input-output histidine kinase CikA~~~COG0745
MPQPIFDRILPAFLYERIATVLLAQASRRGATVLTREEVIASTDAPFLIVVAESFALLLQAEPVPQMSTYRVAILTNPRA
IARFLRKIRSQVPVNRRPLIRAVLQQLSPLNAKEQMLPADLAIALMAVLGEETTAQCQSCQPLVTAALNERQAQERLLHQ
VTTQIRQSLELPELLKIAVDRIREFLDVDRLLVGQFAQTEGELRGQITYESCRNSEIPSVLGIWDDCWQWSGLPSSSYQR
LSQGEAIVVSDIQQFYGAVPCLQSFAAHWQIKSWLIVPIIVQDRLWGVLIAHQCDRPRQWQPQEVEFLTHLSQHLSIAIY
QAQLYSELQQQKATLEQRVNERTQALREALSAMEAAHRIKNDFLATMSHELRTPLTCVIGVSATLLRWPLGPLTAKQREY
LEIIHESGTHLLELINSILDLSEAELGRSQLHRSAFSIRQLCADCLEVVKPQAHRHQVNLRHQLMIPPTRDRFWGDYRRI
QQILINLLSNAIKFTPAMGEVILRAWWKEDELIFQVQDTGIGIPAHLQSLLFQKFQQLDSSFGRAYTGAGLGLALTKQWV
DLHHGWIDVDSTEGKGSTFTVGLPAISDPLPDPPKPKLDVPPLATTEVLVEPEGRIVLVSEDEATSTLICSILTTAGYQV
IWLVDGEVERLLALTPIAVLLAEPFSYGDVQELVDQLRQRCTPEQLKIFILGSKGNYQGVDRYIPLPIHPESFLQQVTMG
LTSLATSAQ
>P75726 4.1.3.6~~~citF~~~Citrate lyase alpha chain~~~COG3051
MTQKIEQSQRQERVAAWNRRAECDLAAFQNSPKQTYQAEKARDRKLCANLEEAIRRSGLQDGMTVSFHHAFRGGDLTVNM
VMDVIAKMGFKNLTLASSSLSDCHAPLVEHIRQGVVTRIYTSGLRGPLAEEISRGLLAEPVQIHSHGGRVHLVQSGELNI
DVAFLGVPSCDEFGNANGYTGKACCGSLGYAIVDADNAKQVVMLTEELLPYPHNPASIEQDQVDLIVKVDRVGDAAKIGA
GATRMTTNPRELLIARSAADVIVNSGYFKEGFSMQTGTGGASLAVTRFLEDKMRSRDIRADFALGGITATMVDLHEKGLI
RKLLDVQSFDSHAAQSLARNPNHIEISANQYANWGSKGASVDRLDVVVLSALEIDTQFNVNVLTGSDGVLRGASGGHCDT
AIASALSIIVAPLVRGRIPTLVDNVLTCITPGSSVDILVTDHGIAVNPARPELAERLQEAGIKVVSIEWLRERARLLTGE
PQPIEFTDRVVAVVRYRDGSVIDVVHQVKE
>P45413 4.1.3.6~~~citF~~~Citrate lyase alpha chain~~~
MKETVAMLNQQYVMPNGLTPYAGVTAKSPWLASESEKRQRKICDSLETAIRRSGLQNGMTISFHHAFRGGDKVVNMVVAK
LAEMGFRDLTLASSSLIDAHWPLIEHIKNGVIRQIYTSGLRGKLGEEISAGLMENPVQIHSHGGRVQLIQSGELSIDVAF
LGVPCCDEFGNANGFSGKSRCGSLGYARVDAEHAKCVVLLTEEWVDYPNYPASIAQDQVDLIVQVDEVGDPQKITAGAIR
LTSNPRELLIARQAAKVVEHSGYFKEGFSLQTGTGGASLAVTRFLEDKMRRNGITASFGLGGITGTMVDLHEKGLIKTLL
DTQSFDGDAARSLAQNPNHVEISTNQYASPGSKGASCERLNVVMLSALEIDIDFNVNVMTGSNGVLRGASGGHSDTAAGA
DLTIITAPLVRGRIPCVVEKVLTRVTPGASVDVLVTDHGIAVNPARQDLIDNLRSAGIPLMTIEELQQRAELLTGKPQPI
EFTDRVVAVVRYRDGSVIDVIRQVKNSD
>Q74C76 2.3.3.21~~~cimA~~~(R)-citramalate synthase~~~COG0119
MSLVKLYDTTLRDGTQAEDISFLVEDKIRIAHKLDEIGIHYIEGGWPGSNPKDVAFFKDIKKEKLSQAKIAAFGSTRRAK
VTPDKDHNLKTLIQAEPDVCTIFGKTWDFHVHEALRISLEENLELIFDSLEYLKANVPEVFYDAEHFFDGYKANPDYAIK
TLKAAQDAKADCIVLCDTNGGTMPFELVEIIREVRKHITAPLGIHTHNDSECAVANSLHAVSEGIVQVQGTINGFGERCG
NANLCSIIPALKLKMKRECIGDDQLRKLRDLSRFVYELANLSPNKHQAYVGNSAFAHKGGVHVSAIQRHPETYEHLRPEL
VGNMTRVLVSDLSGRSNILAKAEEFNIKMDSKDPVTLEILENIKEMENRGYQFEGAEASFELLMKRALGTHRKFFSVIGF
RVIDEKRHEDQKPLSEATIMVKVGGKIEHTAAEGNGPVNALDNALRKALEKFYPRLKEVKLLDYKVRVLPAGQGTASSIR
VLIESGDKESRWGTVGVSENIVDASYQALLDSVEYKLHKSEEIEGSKK
>Q8F3Q1 2.3.3.21~~~cimA~~~(R)-citramalate synthase CimA~~~
MTKVETRLEILDVTLRDGEQTRGVSFSTSEKLNIAKFLLQKLNVDRVEIASARVSKGELETVQKIMEWAATEQLTERIEI
LGFVDGNKTVDWIKDSGAKVLNLLTKGSLHHLEKQLGKTPKEFFTDVSFVIEYAIKSGLKINVYLEDWSNGFRNSPDYVK
SLVEHLSKEHIERIFLPDTLGVLSPEETFQGVDSLIQKYPDIHFEFHGHNDYDLSVANSLQAIRAGVKGLHASINGLGER
AGNTPLEALVTTIHDKSNSKTNINEIAITEASRLVEVFSGKRISANRPIVGEDVFTQTAGVHADGDKKGNLYANPILPER
FGRKRSYALGKLAGKASISENVKQLGMVLSEVVLQKVLERVIELGDQNKLVTPEDLPFIIADVSGRTGEKVLTIKSCNIH
SGIGIRPHAQIELEYQGKIHKEISEGDGGYDAFMNALTKITNRLGISIPKLIDYEVRIPPGGKTDALVETRITWNKSLDL
EEDQTFKTMGVHPDQTVAAVHATEKMLNQILQPWQI
>P94363 ~~~cimH~~~Citrate/malate transporter~~~COG3493
MGELQTHMQLQTDTIHEGVRKENWFAKAMNIKVGIIPLPVYALLFILITVFVMHHDVKSDILTSIAVMAFFGFTFAQIGK
SIPIVRSIGGPAILATFIPSAVVYYHLLPNDIVKSTTEFTENSNFLYLFIAGIVVGSILGMKRETLVKAFMKIFIPLIVG
SVTAAIVGLAVGTLLGLGFQHTLLYIVIPIMAGGVGEGAIPLSIGYSDIMPISQGEAFALVLPSIMLGSLCAIILAGLLN
RIGKKKPEWTGNGKVDRSEEESPALEESQSGQQMFNLSLFASGGILAVSLYLVGMLAHDFFGFPAPVAMLLLAVLIKLFR
LVPASIENGAFGVSRFFSTAVTYPLLFAIGVSMTPWDKLVAAFNLSNIITILSVVVTMMAVGFFTGKWLNMYPIETAIIN
ACHSGQGGTGDVAILSAAERLELMPFAQVSTRIGGAITVSLTLLLLHQFY
>P9WPE3 ~~~cinA~~~CinA-like protein~~~COG1058
MAVSARAGIVITGTEVLTGRVQDRNGPWIADRLLELGVELAHITICGDRPADIEAQLRFMAEQGVDLIVTSGGLGPTADD
MTVEVVARYCGRELVLDDELENRIANILKKLMGRNPAIEPANFDSIRAANRKQAMIPAGSQVIDPVGTAPGLVVPGRPAV
MVLPGPPRELQPIWSKAIQTAPVQDAIAGRTTYRQETIRIFGLPESSLADTLRDAEAAIPGFDLVEITTCLRRGEIEMVT
RFEPNAAQVYTQLARLLRDRHGHQVYSEDGASVDELVAKLLTGRRIATAESCTAGLLAARLTDRPGSSKYVAGAVVAYSN
EAKAQLLGVDPALIEAHGAVSEPVAQAMAAGALQGFGADTATAITGIAGPSGGTPEKPVGTVCFTVLLDDGRTTTRTVRL
PGNRSDIRERSTTVAMHLLRRTLSGIPGSP
>Q8VQF6 1.14.14.133~~~cinA~~~1,8-cineole 2-endo-monooxygenase~~~
MTATVASTSLFTTADHYHTPLGPDGTPHAFFEALRDEAETTPIGWSEAYGGHWVVAGYKEIQAVIQNTKAFSNKGVTFPR
YETGEFELMMAGQDDPVHKKYRQLVAKPFSPEATDLFTEQLRQSTNDLIDARIELGEGDAATWLANEIPARLTAILLGLP
PEDGDTYRRWVWAITHVENPEEGAEIFAELVAHARTLIAERRTNPGNDIMSRVIMSKIDGESLSEDDLIGFFTILLLGGI
DNTARFLSSVFWRLAWDIELRRRLIAHPELIPNAVDELLRFYGPAMVGRLVTQEVTVGDITMKPGQTAMLWFPIASRDRS
AFDSPDNIVIERTPNRHLSLGHGIHRCLGAHLIRVEARVAITEFLKRIPEFSLDPNKECEWLMGQVAGMLHVPIIFPKGK
RLSE
>P29827 ~~~cinA~~~Lantibiotic cinnamycin~~~
MTASILQQSVVDADFRAALLENPAAFGASAAALPTPVEAQDQASLDFWTKDIAATEAFACRQSCSFGPFTFVCDGNTK
>Q8DRX2 ~~~cinA~~~Putative competence-damage inducible protein~~~COG1058
MKSEIIAVGTEILTGQIVNTNSQFLSEKFAELGIDVYFQTAVGDNEERLLSVLKIAKERSDLIVLCGGLGPTEDDLTKQT
LAKFLKRELVFDKTAQERLDEFFASRPTSMRTPNNECQAQIIAGSQPLSNKTGLAVGGLLEADGVTYVVLPGPPSELKPM
VNKELLPYLSKTSEKLYSRVLRFFGIGESHLVTLLHDLIAEQTDPTIAPYAKTGEVTIRLSTKAHRQKEADSKLDKLEKK
IITIDNLADYFYGYGEENSLPQVVFDLLKEKGKTITAAESLTAGLFQARLADFAGASDIFKGGFITYSIEEKARMLGIPF
EDLQLHGVVSAFTAEKMAERSRQLTQADLAISLTGVAGPDSLEGQPAGTVFIGLSSSKRTMAIKVLIGGRSRSDVRYIAV
LHAFNLVRQTLLSHKNLV
>P54184 ~~~cinA~~~Putative competence-damage inducible protein~~~COG1058
MKAEIIAVGTEILTGQIVNTNAQFLSEKLAEIGVDVYFQTAVGDNEVRLLSLLEIASQRSSLVILTGGLGPTEDDLTKQT
LAKFLGKALVFDPQAQEKLDIFFTLRPDYARTPNNERQAQIVEGAIPLPNETGLAVGGKLEVDGVTYVVLPGPPSELKPM
VLNQLLPKLMTGSKLYSRVLRFFGIGESQLVTILADLIDNQIDPTLAPYAKTGEVTLRLSTKASSQEEANQALDILENQI
LDCQTFEGISLRDFCYGYGEETSLASIVVEELKRQGKTIAAAESLTAGLFQATVANFSGVSSIFKGGFVTYSLEEKSRML
DIPAKNLEEHGVVSEFTAQKMAEQARSKTQSDFGISLTGVAGPDSLEGHPVGTVFIGLAQDQGTEVIKVNIGGRSRADVR
HIAVMHAFNLVRKALLSD
>Q8VQF4 ~~~cinC~~~Cindoxin~~~
MNALILYGTETGNAEACATTISQVLADTVDTKVHDLADMTPRAMLDSGADLIVFATATYGEGEFAGGGAAFFETLRETKP
DLSGLRFAVFGLGDSYYTTFNQAGATAATILASLGGTQVGDTARHDTSSGDDPEETAEEWAREILTALATPAVS
>Q06851 ~~~cipA~~~Cellulosomal-scaffolding protein A~~~COG2911
MRKVISMLLVVAMLTTIFAAMIPQTVSAATMTVEIGKVTAAVGSKVEIPITLKGVPSKGMANCDFVLGYDPNVLEVTEVK
PGSIIKDPDPSKSFDSAIYPDRKMIVFLFAEDSGRGTYAITQDGVFATIVATVKSAAAAPITLLEVGAFADNDLVEISTT
FVAGGVNLGSSVPTTQPNVPSDGVVVEIGKVTGSVGTTVEIPVYFRGVPSKGIANCDFVFRYDPNVLEIIGIDPGDIIVD
PNPTKSFDTAIYPDRKIIVFLFAEDSGTGAYAITKDGVFAKIRATVKSSAPGYITFDEVGGFADNDLVEQKVSFIDGGVN
VGNATPTKGATPTNTATPTKSATATPTRPSVPTNTPTNTPANTPVSGNLKVEFYNSNPSDTTNSINPQFKVTNTGSSAID
LSKLTLRYYYTVDGQKDQTFWCDHAAIIGSNGSYNGITSNVKGTFVKMSSSTNNADTYLEISFTGGTLEPGAHVQIQGRF
AKNDWSNYTQSNDYSFKSASQFVEWDQVTAYLNGVLVWGKEPGGSVVPSTQPVTTPPATTKPPATTKPPATTIPPSDDPN
AIKIKVDTVNAKPGDTVNIPVRFSGIPSKGIANCDFVYSYDPNVLEIIEIKPGELIVDPNPDKSFDTAVYPDRKIIVFLF
AEDSGTGAYAITKDGVFATIVAKVKSGAPNGLSVIKFVEVGGFANNDLVEQRTQFFDGGVNVGDTTVPTTPTTPVTTPTD
DSNAVRIKVDTVNAKPGDTVRIPVRFSGIPSKGIANCDFVYSYDPNVLEIIEIEPGDIIVDPNPDKSFDTAVYPDRKIIV
FLFAEDSGTGAYAITKDGVFATIVAKVKSGAPNGLSVIKFVEVGGFANNDLVEQKTQFFDGGVNVGDTTEPATPTTPVTT
PTTTDDLDAVRIKVDTVNAKPGDTVRIPVRFSGIPSKGIANCDFVYSYDPNVLEIIEIEPGDIIVDPNPDKSFDTAVYPD
RKIIVFLFAEDSGTGAYAITKDGVFATIVAKVKSGAPNGLSVIKFVEVGGFANNDLVEQKTQFFDGGVNVGDTTEPATPT
TPVTTPTTTDDLDAVRIKVDTVNAKPGDTVRIPVRFSGIPSKGIANCDFVYSYDPNVLEIIEIEPGDIIVDPNPDKSFDT
AVYPDRKIIVFLFAEDSGTGAYAITKDGVFATIVAKVKEGAPNGLSVIKFVEVGGFANNDLVEQKTQFFDGGVNVGDTTE
PATPTTPVTTPTTTDDLDAVRIKVDTVNAKPGDTVRIPVRFSGIPSKGIANCDFVYSYDPNVLEIIEIEPGELIVDPNPT
KSFDTAVYPDRKMIVFLFAEDSGTGAYAITEDGVFATIVAKVKSGAPNGLSVIKFVEVGGFANNDLVEQKTQFFDGGVNV
GDTTEPATPTTPVTTPTTTDDLDAVRIKVDTVNAKPGDTVRIPVRFSGIPSKGIANCDFVYSYDPNVLEIIEIEPGDIIV
DPNPDKSFDTAVYPDRKIIVFLFAEDSGTGAYAITKDGVFATIVAKVKEGAPNGLSVIKFVEVGGFANNDLVEQKTQFFD
GGVNVGDTTVPTTSPTTTPPEPTITPNKLTLKIGRAEGRPGDTVEIPVNLYGVPQKGIASGDFVVSYDPNVLEIIEIEPG
ELIVDPNPTKSFDTAVYPDRKMIVFLFAEDSGTGAYAITEDGVFATIVAKVKEGAPEGFSAIEISEFGAFADNDLVEVET
DLINGGVLVTNKPVIEGYKVSGYILPDFSFDATVAPLVKAGFKVEIVGTELYAVTDANGYFEITGVPANASGYTLKISRA
TYLDRVIANVVVTGDTSVSTSQAPIMMWVGDIVKDNSINLLDVAEVIRCFNATKGSANYVEELDINRNGAINMQDIMIVH
KHFGATSSDYDAQ
>P17315 ~~~cirA~~~Colicin I receptor~~~COG4771
MFRLNPFVRVGLCLSAISCAWPVLAVDDDGETMVVTASSVEQNLKDAPASISVITQEDLQRKPVQNLKDVLKEVPGVQLT
NEGDNRKGVSIRGLDSSYTLILVDGKRVNSRNAVFRHNDFDLNWIPVDSIERIEVVRGPMSSLYGSDALGGVVNIITKKI
GQKWSGTVTVDTTIQEHRDRGDTYNGQFFTSGPLIDGVLGMKAYGSLAKREKDDPQNSTTTDTGETPRIEGFSSRDGNVE
FAWTPNQNHDFTAGYGFDRQDRDSDSLDKNRLERQNYSVSHNGRWDYGTSELKYYGEKVENKNPGNSSPITSESNTVDGK
YTLPLTAINQFLTVGGEWRHDKLSDAVNLTGGTSSKTSASQYALFVEDEWRIFEPLALTTGVRMDDHETYGEHWSPRAYL
VYNATDTVTVKGGWATAFKAPSLLQLSPDWTSNSCRGACKIVGSPDLKPETSESWELGLYYMGEEGWLEGVESSVTVFRN
DVKDRISISRTSDVNAAPGYQNFVGFETGANGRRIPVFSYYNVNKARIQGVETELKIPFNDEWKLSINYTYNDGRDVSNG
ENKPLSDLPFHTANGTLDWKPLALEDWSFYVSGHYTGQKRADSATAKTPGGYTIWNTGAAWQVTKDVKLRAGVLNLGDKD
LSRDDYSYNEDGRRYFMAVDYRF
>A5N6T4 2.3.3.3~~~~~~Citrate (Re)-synthase~~~COG0119
MKKCSYDYKLNNVNDPNFYKDIFPYEEVPKIVFNNIQLPMDLPDNIYITDTTFRDGQQSMPPYTSREIVRIFDYLHELDN
NSGIIKQTEFFLYTKKDRKAAEVCMERGYEFPEVTSWIRADKEDLKLVKDMGIKETGMLMSCSDYHIFKKLKMTRKETMD
MYLDLAREALNNGIRPRCHLEDITRADFYGFVVPFVNELMKMSKEANIPIKIRACDTLGLGVPYNGVEIPRSVQGIIHGL
RNICEVPSESIEWHGHNDFYGVVTNSSTAWLYGASSINTSFLGIGERTGNCPLEAMIFEYAQIKGNTKNMKLHVITELAQ
YFEKEIKYSVPVRTPFVGTDFNVTRAGIHADGILKDEEIYNIFDTDKILGRPVVVAVSQYSGRAGIAAWVNTYYRLKDED
KVNKNDSRIDQIKMWVDEQYRAGRTSVIGNNELELLVSKVMPEVIEKTEERAS
>P0DO96 2.3.3.3~~~~~~Citrate (Re)-synthase~~~
MGKIFIIDVTNRDGVQTARLGLSKLEKTLINIYLDEMGIFQSEFGFPTTKHERGYVEANLELAKMGVIKNLRLEGWIRAI
VADVDLAFRRAPSLKHLNLSISTSEQMINGKFQGRKVFKDIIEDMTIAVNAAYAKGAETVGVNAEDASRTSIVNLIEFGK
AAKEVGATRLRYCDTLGYDNPFTIYETARTLAEKVGMPIEIHCHGDLGMAIGNSLAGAKGVIDGGQDVYVNTTVNGIGER
AGNADLVAFLLAILKSKGFGEKYQLGHEVDLSKAWKIARFASYAFDVEIPINQPGVGRNCFAHASGIHADGVIKDSQNYE
LYGYEELGRGEALMVETGREICAGQYSGISGFRHVMGNMSVELPEDKDEANKILELVRYANVEAHKPLVEDELIFIAKYP
EISRRLLTLTPLMND
>Q2LTE1 2.3.3.3~~~~~~Citrate (Re)-synthase~~~COG0119
MAKWNPQKRVLNHEHTRFWRFELRDVDEPNLQKEVFPYDEVSRIDFDHRIIPIQPAEEIFITDTTFRDGQQARPPYTTQQ
IVDLYQMMSRLGGYNGIIRQTEFFLYSNRDKEAVRMCQDLGLQYPEITGWIRAAREDIPLVKEAGLKETGILTSVSDYHI
FLKLNMTRSQALEEYLGIVKAILDAGIVPRCHFEDITRADIYGFCIPFAIELMKLREESGVDIKIRLCDTMGYGVTYPGA
SLPRGVDKLVRAFIDDADVPGRLLEWHGHNDFHKALINATTAWLYGCSAANSTLLGLGERTGNPPIEGLIIEYIGLMGKT
NGIDTTVITDIANYFKNEIEYKIPSNYPFVGADFNVTRAGVHADGLIKSEEIYNIFNTTKILKRPIVPMITDKSGKAGIA
YWINSHFGLSGDSTVDKRHPGISKINKWIADEYELGRVTTISTEELEAKVRKYMPELFMSDLERIKFKAAEAAIAVLRKI
IDDPAMKTMQPELQEPVMQRFIEEYPSIQFAYVVDMNGKKTTRNITNIVDRAKYENYGVGTDQSDREWFIKPLQTGKLHV
TDFYISKMTGALCFTVSEPITDDNDDMVGIFGVDIRVEDLVKEPEYIAEATQIALKAEYDAKYKSDHWL
>P9WPD5 2.3.3.16~~~gltA2~~~Citrate synthase 1~~~COG0372
MADTDDTATLRYPGGEIDLQIVHATEGADGIALGPLLAKTGHTTFDVGFANTAAAKSSITYIDGDAGILRYRGYPIDQLA
EKSTFIEVCYLLIYGELPDTDQLAQFTGRIQRHTMLHEDLKRFFDGFPRNAHPMPVLSSVVNALSAYYQDALDPMDNGQV
ELSTIRLLAKLPTIAAYAYKKSVGQPFLYPDNSLTLVENFLRLTFGFPAEPYQADPEVVRALDMLFILHADHEQNCSTST
VRLVGSSRANLFTSISGGINALWGPLHGGANQAVLEMLEGIRDSGDDVSEFVRKVKNREAGVKLMGFGHRVYKNYDPRAR
IVKEQADKILAKLGGDDSLLGIAKELEEAALTDDYFIERKLYPNVDFYTGLIYRALGFPTRMFTVLFALGRLPGWIAHWR
EMHDEGDSKIGRPRQIYTGYTERDYVTIDAR
>P39120 2.3.3.16~~~citZ~~~Citrate synthase 2~~~COG0372
MTATRGLEGVVATTSSVSSIIDDTLTYVGYDIDDLTENASFEEIIYLLWHLRLPNKKELEELKQQLAKEAAVPQEIIEHF
KSYSLENVHPMAALRTAISLLGLLDSEADTMNPEANYRKAIRLQAKVPGLVAAFSRIRKGLEPVEPREDYGIAENFLYTL
NGEEPSPIEVEAFNKALILHADHELNASTFTARVCVATLSDIYSGITAAIGALKGPLHGGANEGVMKMLTEIGEVENAEP
YIRAKLEKKEKIMGFGHRVYKHGDPRAKHLKEMSKRLTNLTGESKWYEMSIRIEDIVTSEKKLPPNVDFYSASVYHSLGI
DHDLFTPIFAVSRMSGWLAHILEQYDNNRLIRPRADYTGPDKQKFVPIEERA
>P63778 2.3.3.16~~~citA~~~Putative citrate synthase 2~~~
MTVVPENFVPGLDGVVAFTTEIAEPDKDGGALRYRGVDIEDLVSQRVTFGDVWALLVDGNFGSGLPPAEPFPLPIHSGDV
RVDVQAGLAMLAPIWGYAPLLDIDDATARQQLARASVMALSYVAQSARGIYQPAVPQRIIDECSTVTARFMTRWQGEPDP
RHIEAIDAYWVSAAEHGMNASTFTARVIASTGADVAAALSGAIGAMSGPLHGGAPARVLPMLDEVERAGDARSVVKGILD
RGEKLMGFGHRVYRAEDPRARVLRAAAERLGAPRYEVAVAVEQAALSELRERRPDRAIETNVEFWAAVVLDFARVPANMM
PAMFTCGRTAGWCAHILEQKRLGKLVRPSAIYVGPGPRSPESVDGWERVLTTA
>P9WPD3 2.3.3.16~~~citA~~~Putative citrate synthase 2~~~COG0372
MTVVPENFVPGLDGVVAFTTEIAEPDKDGGALRYRGVDIEDLVSQRVTFGDVWALLVDGNFGSGLPPAEPFPLPIHSGDV
RVDVQAGLAMLAPIWGYAPLLDIDDATARQQLARASVMALSYVAQSARGIYQPAVPQRIIDECSTVTARFMTRWQGEPDP
RHIEAIDAYWVSAAEHGMNASTFTARVIASTGADVAAALSGAIGAMSGPLHGGAPARVLPMLDEVERAGDARSVVKGILD
RGEKLMGFGHRVYRAEDPRARVLRAAAERLGAPRYEVAVAVEQAALSELRERRPDRAIETNVEFWAAVVLDFARVPANMM
PAMFTCGRTAGWCAHILEQKRLGKLVRPSAIYVGPGPRSPESVDGWERVLTTA
>P20901 2.3.3.16~~~aarA~~~Citrate synthase~~~
MSASQKEGKLSTATISVDGKSAEMPVLSGTLGPDVIDIRKLPAQLGVFTFDPGYGETAACNSKITFIDGDKGVLLHRGYP
IAQLDENASYEEVIYLLLNGELPNKVQYDTFTNTLTNHTLLHEQIRNFFNGFRRDAHPMAILCGTVGALSAFYPDANDIA
IPANRDLAAMRLIAKIPTIAAWAYKYTQGEAFIYPRNDLNYAENFLSMMFARMSEPYKVNPVLARAMNRILILHADHEQN
ASTSTVRLAGSTGANPFACIAAGIAALWGPAHGGANEAVLKMLARIGKKENIPAFIAQVKDKNSGVKLMGFGHRVYKNFD
PRAKIMQQTCHEVLTELGIKDDPLLDLAVELEKIALSDDYFVQRKLYPNVDFYSGIILKAMGIPTSMFTVLFAVARTTGW
VSQWKEMIEEPGQRISRPRQLYIGAPQRDYVPLAKR
>P39119 2.3.3.16~~~citA~~~Citrate synthase 1~~~COG0372
MVHYGLKGITCVETSISHIDGEKGRLIYRGHHAKDIALNHSFEEAAYLILFGKLPSTEELQVFKDKLAAERNLPEHIERL
IQSLPNNMDDMSVLRTVVSALGENTYTFHPKTEEAIRLIAITPSIIAYRKRWTRGEQAIAPSSQYGHVENYYYMLTGEQP
SEAKKKALETYMILATEHGMNASTFSARVTLSTESDLVSAVTAALGTMKGPLHGGAPSAVTKMLEDIGEKEHAEAYLKEK
LEKGERLMGFGHRVYKTKDPRAEALRQKAEEVAGNDRDLDLALHVEAEAIRLLEIYKPGRKLYTNVEFYAAAVMRAIDFD
DELFTPTFSASRMVGWCAHVLEQAENNMIFRPSAQYTGAIPEEVLS
>Q9RWB2 2.3.3.16~~~gltA~~~Citrate synthase~~~COG0372
MSNIAKGLEGVLFTESKLTFINGSEGILTHLGIPIQEWAEKSTFEELSLALLDAKLPTAEELAKFDAELKANRAIPDQLV
GIIRDMPKGVHPMQALRTAVSYLGLLDPQAEDITPEARRAISTRMIAQFSTIIAAINRAQEGQDIVAPRADLTHAGNFLY
MLTGNEPTPEQARLFDIALVLHADHGMNASTFTAIATSSTLSDMYSCMVSAIGALKGPLHGGANEAVMTMLDEIGTVDKA
EAYITGKLDNKEKIMGVGHRVYKYFDPRSRVLRDYAEHVANKEGKSNYYQILEAIEKIIVDRMGAKGIYPNVDFYSGTVY
SDLGIKKEYFTPIFALARISGWCASVIEYSQDNRLLRPDAEYTGARDQHYVDIKDRQ
>P0ABH7 2.3.3.16~~~gltA~~~Citrate synthase~~~COG0372
MADTKAKLTLNGDTAVELDVLKGTLGQDVIDIRTLGSKGVFTFDPGFTSTASCESKITFIDGDEGILLHRGFPIDQLATD
SNYLEVCYILLNGEKPTQEQYDEFKTTVTRHTMIHEQITRLFHAFRRDSHPMAVMCGITGALAAFYHDSLDVNNPRHREI
AAFRLLSKMPTMAAMCYKYSIGQPFVYPRNDLSYAGNFLNMMFSTPCEPYEVNPILERAMDRILILHADHEQNASTSTVR
TAGSSGANPFACIAAGIASLWGPAHGGANEAALKMLEEISSVKHIPEFVRRAKDKNDSFRLMGFGHRVYKNYDPRATVMR
ETCHEVLKELGTKDDLLEVAMELENIALNDPYFIEKKLYPNVDFYSGIILKAMGIPSSMFTVIFAMARTVGWIAHWSEMH
SDGMKIARPRQLYTGYEKRDFKSDIKR
>P14165 2.3.3.16~~~gltA~~~Citrate synthase~~~
MADKKAQLIIEGSAPVELPVLSGTMGPDVVDVRGLTATGHFTFDPGFMSTASCESKITYIDGDKGVLLHRGYPIEQLAEK
SDYLETCYLLLNGELPTAAQKEQFVGTIKNHTMVHEQLKTFFNGFRRDAHPMAVMCGVIGALSAFYHDSLDINNPKHREV
SAHRLIAKMPTIAAMVYKYSKGEPMMYPRNDLNYAENFLHMMFNTPCETKPISPVLAKAMDRIFILHADHEQNASTSTVR
LAGSSGANPFACIASGIAALWGPAHGGANEAVLRMLDEIGDVSNIDKFVEKAKDKNDPFKLMGFGHRVYKNFDPRAKVMK
QTCDEVLQELGINDPQLELAMKLEEIARHDPYFVERNLYPNVDFYSGIILKAIGIPTSMFTVIFALARTVGWISHWQEML
SGPYKIGRPRQLYTGHTQRDFTALKDRG
>A0A0S3QTD0 2.3.3.16~~~gltA~~~Citrate synthase~~~
MKLKERLAELIPQWRAEVAEIRKKYGNRKTMDCTIGHAYGGMRGLKALVCDTSEVFPDEGVKFRGYTIPELREGPHKLPT
AEGGFEPLPEGLWYLLLTGELPTEEDVKEISAEFTKRMQNVPQYVFDVLRAMPVDTHPMTMFAAGILAMQRESVFAKRYE
EGMRREEHWEAMLEDSLNMLAALPVIAAYIYRRKYKGDTHIAPDPNLDWSANLAHMMGFDDFEVYELFRLYMFLHSDHEG
GNVSAHTNLLVNSAYSDIYRSFSAAMNGLAGPLHGLANQEVLRWIQMLYKKFGGVPTKEQLERFAWDTLNSGQVIPGYGH
AVLRVTDPRYVAQRDFALKHLPDDELFKIVSLCYEVIPEVLKKHGKAKNPWPNVDAHSGVLLWHYGIREYDFYTVLFGVS
RALGCTAQAILVRGYMLPIERPKSITTRWVKEVAESLPVAGS
>P52687 2.7.13.3~~~citA~~~Sensor histidine kinase CitA~~~
MSIYPMYTRKITHWFARRSFQNRIFLLILFTSTIVMLAMSWYLTDITEERLHYQVGQRALIQAMQISAMPELVEAVQKRD
LARIKALIDPMRSFSDATYITVGDASGQRLYHVNPDEIGKSMEGGDSDEALINAKSYVSVRKGSLGSSLRGKSPIQDATG
KVIGIVSVGYTIEQLENWLSLQISSLLIPMAIMLLLLLFCARRFSLHIKKQMLNMEPQQLSQLLIQQSVLFESVFEGLIA
IDSDYKITAINQTARRLLNLSQPEPTLIGKRISSVISQEVFFYDAPQTNKKDEIVTFNQIKVIASRMAVILNNEPQGWVI
SFRSKDDINTLSLQLSQVQQYADNLRAVQHEHRNLISTIAGLLFLKRYNQALELIQQQSESHQKVIDFIARNFQDNHLAG
LLIGKYYRAKELGLELIFDPACFVDRLPTALSHNEWISIVGNLLDNAYNASLRQPQGSKQIECLINSDGQEVIIEIADQG
CGIDEALRDRIFERGVTSSASKDHGIGLWLVRSYVEQAGGSIVVENNIPFGTIFTLYIPLTRDEHHG
>P52688 ~~~citB~~~Transcriptional regulatory protein CitB~~~
MDSITTLIVEDEPMLAEILVDNIKQFPQFDVIGIADKLESARKQLRLYQPQLILLDNFLPDGKGIDLIRHAVSTHYKGRI
IFITADNHMETISEALRLGVFDYLIKPVHYQRLQHTLERFARYRSSLRSSEQASQLHVDALFNIQAREQTEPASAPLRGI
DESTFQRVLQLFADPTVVHTADSLARILGSSKTTARRYLEQGVKNDFLEAEISYGKVGRPERIYHGKQTYPEQR
>P77390 6.2.1.22~~~citC~~~[Citrate [pro-3S]-lyase] ligase~~~COG3053
MFGNDIFTRVKRSENKKMAEIAQFLHENDLSVDTTVEVFITVTRDEKLIACGGIAGNIIKCVAISESVRGEGLALTLATE
LINLAYERHSTHLFIYTKTEYEALFRQCGFSTLTSVPGVMVLMENSATRLKRYAESLKKFRHPGNKIGCIVMNANPFTNG
HRYLIQQAAAQCDWLHLFLVKEDSSRFPYEDRLDLVLKGTADIPRLTVHRGSEYIISRATFPCYFIKEQSVINHCYTEID
LKIFRQYLAPALGVTHRFVGTEPFCRVTAQYNQDMRYWLETPTISAPPIELVEIERLRYQEMPISASRVRQLLAKNDLTA
IAPLVPAVTLHYLQNLLEHSRQDAAARQKTPA
>O53076 6.2.1.22~~~citC~~~[Citrate [pro-3S]-lyase] ligase~~~
MRRDWQNFLMACGIKNFDDSELNPLDITIAVYENEEIIGTGSIAGDVIKYVAVQETTMSGHSTLFNQLMTKLENFMAVEG
RFHQFVLRNQFTKKVLNTLASKRWLSVNKEFCWKKDYQILRNTCQQFPSQTPIDKVASVVINANPFTNGHRFLIEEASRN
NELVYVFVLNQEASLFHTDERIALVKAGVQDLSNVIVVNGGAYIISYLTFPAYFLKHNDSAIDYQTTIDVRLFKYKIASA
LGITSRYVGSEPLSHTTNLYNQKLISELNPQIEVHVIQRKLAAGDLGVISARTVREAIDKGDEAVWQKMVTETTQHFISN
NLLELQQRIRKGQKINGN
>P69330 ~~~citD~~~Citrate lyase acyl carrier protein~~~COG3052
MKINQPAVAGTLESGDVMIRIAPLDTQDIDLQINSSVEKQFGDAIRTTILDVLARYNVRGVQLNVDDKGALDCILRARLE
ALLARASGIPALPWEDCQ
>P02903 ~~~citD~~~Citrate lyase acyl carrier protein~~~
MEMKIDALAGTLESSDVMVRIGPAAQPGIQLEIDSIVKQQFGAAIEQVVRETLAQLGVKQANVVVDDKGALECVLRARVQ
AAALRAAQQTQLQWSQL
>Q9RUZ0 4.1.-.-~~~~~~Citrate lyase subunit beta-like protein~~~COG2301
MNAPPALLRSVLFAPGNRADLIAKLPRSAPDAVVIDLEDAVPGTAEAKAAARPVAHDAARDLIAAAPHLAVFVRVNALHS
PYFEDDLSVLTPELSGVVVPKLEMGAEARQVAQMLQERSLPLPILAGLETGAGVWNAREIMEVPEVAWAYFGAEDYTTDL
GGKRTPGGLEVLYARSQVALAARLTGVAALDIVVTALNDPETFRADAEQGRALGYSGKLCIHPAQVALAHEYFGPTEADR
ARARALLDAAAAAAQRGHGAFSFEGQMVDEPMLAKARTLLSHEA
>P9WPE1 4.1.-.-~~~citE~~~Citrate lyase subunit beta-like protein~~~COG2301
MNLRAAGPGWLFCPADRPERFAKAAAAADVVILDLEDGVAEAQKPAARNALRDTPLDPERTVVRINAGGTADQARDLEAL
AGTAYTTVMLPKAESAAQVIELAPRDVIALVETARGAVCAAEIAAADPTVGMMWGAEDLIATLGGSSSRRADGAYRDVAR
HVRSTILLAASAFGRLALDAVHLDILDVEGLQEEARDAAAVGFDVTVCIHPSQIPVVRKAYRPSHEKLAWARRVLAASRS
ERGAFAFEGQMVDSPVLTHAETMLRRAGEATSE
>P0A9I1 4.1.3.6~~~citE~~~Citrate lyase subunit beta~~~COG2301
MISASLQQRKTRTRRSMLFVPGANAAMVSNSFIYPADALMFDLEDSVALREKDTARRMVYHALQHPLYRDIETIVRVNAL
DSEWGVNDLEAVVRGGADVVRLPKTDTAQDVLDIEKEILRIEKACGREPGSTGLLAAIESPLGITRAVEIAHASERLIGI
ALGAEDYVRNLRTERSPEGTELLFARCSILQAARSAGIQAFDTVYSDANNEAGFLQEAAHIKQLGFDGKSLINPRQIDLL
HNLYAPTQKEVDHARRVVEAAEAAAREGLGVVSLNGKMVDGPVIDRARLVLSRAELSGIREE
>P17725 4.1.3.6~~~citE~~~Citrate lyase subunit beta~~~
MKPRRSMLFIPGANAAMLSTSFVYGADAVMFDLEDAVSLREKDTARLLVYQALQHPLYQDIETVVRINPLNTPFGLADLE
AVVRAGVDMVRLPKTDSKEDIHELEAHVERIERECGREVGSTKLMAAIESALGVVNAVEIARASPRLAAIALAAFDYVMD
MGTSRGDGTELFYARCAVLHAARVAGIAAYDVVWSDINNEEGFLAEANLAKNLGFNGKSLVNPRQIELLHQVYAPTRKEV
DHALEVIAAAEEAETRGLGVVSLNGKMIDGPIIDHARKVVALSASGIRD
>P77231 2.4.2.52~~~citG~~~2-(5''-triphosphoribosyl)-3'-dephosphocoenzyme-A synthase~~~COG1767
MSMPATSTKTTKLATSLIDEYALLGWRAMLTEVNLSPKPGLVDRINCGAHKDMALEDFHRSALAIQGWLPRFIEFGACSA
EMAPEAVLHGLRPIGMACEGDMFRATAGVNTHKGSIFSLGLLCAAIGRLLQLNQPVTPTTVCSTAASFCRGLTDRELRTN
NSQLTAGQRLYQQLGLTGARGEAEAGYPLVINHALPHYLTLLDQGLDPELALLDTLLLLMAINGDTNVASRGGEGGLRWL
QREAQTLLQKGGIRTPADLDYLRQFDRECIERNLSPGGSADLLILTWFLAQI
>P55069 ~~~citM~~~Mg(2+)/citrate complex secondary transporter~~~COG2851
MLAILGFLMMLVFMALIMTKRLSVLTALVLTPIVFALIAGFGFTEVGDMMISGIQQVAPTAVMIMFAILYFGIMIDTGLF
DPMVGKILSMVKGDPLKIVVGTAVLTMLVALDGDGSTTYMITTSAMLPLYLLLGIRPIILAGIAGVGMGIMNTIPWGGAT
PRAASALGVDPAELTGPMIPVIASGMLCMVAVAYVLGKAERKRLGVIELKQPANANEPAAAVEDEWKRPKLWWFNLLLTL
SLIGCLVSGKVSLTVLFVIAFCIALIVNYPNLEHQRQRIAAHSSNVLAIGSMIFAAGVFTGILTGTKMVDEMAISLVSMI
PEQMGGLIPAIVALTSGIFTFLMPNDAYFYGVLPILSETAVAYGVDKVEIARASIIGQPIHMLSPLVPSTHLLVGLVGVS
IDDHQKFALKWAVLAVIVMTAIALLIGAISISV
>P42308 ~~~citN~~~Citrate transporter~~~COG2851
MLAILGFVMMIVFMYLIMSNRLSALIALIVVPIVFALISGFGKDLGEMMIQGVTDLAPTGIMLLFAILYFGIMIDSGLFD
PLIAKILSFVKGDPLKIAVGTAVLTMTISLDGDGTTTYMITIAAMLPLYKRLGMNRLVLAGIAMLGSGVMNIIPWGGPTA
RVLASLKLDTSEVFTPLIPAMIAGILWVIAVAYILGKKERKRLGVISIDHAPSSDPEAAPLKRPALQWFNLLLTVALMAA
LITSLLPLPVLFMTAFAVALMVNYPNVKEQQKRISAHAGNALNVVSMVFAAGIFTGILSGTKMVDAMAHSLVSLIPDAMG
PHLPLITAIVSMPFTFFMSNDAFYFGVLPIIAEAASAYGIDAAEIGRASLLGQPVHLLSPLVPSTYLLVGMAGVSFGDHQ
KFTIKWAVGTTIVMTIAALLIGIISF
>P31602 ~~~citS~~~Citrate-sodium symporter~~~
MTNMSQPPATEKKGVSDLLGFKIFGMPLPLYAFALITLLLSHFYNALPTDIVGGFAIMFIIGAIFGEIGKRLPIFNKYIG
GAPVMIFLVAAYFVYAGIFTQKEIDAISNVMDKSNFLNLFIAVLITGAILSVNRRLLLKSLLGYIPTILMGIVGASIFGI
AIGLVFGIPVDRIMMLYVLPIMGGGNGAGAVPLSEIYHSVTGRSREEYYSTAIAILTIANIFAIVFAAVLDIIGKKHTWL
SGEGELVRKASFKVEEDEKTGQITHRETAVGLVLSTTCFLLAYVVAKKILPSIGGVAIHYFAWMVLIVAALNASGLCSPE
IKAGAKRLSDFFSKQLLWVLMVGVGVCYTDLQEIINAITFANVVIAAIIVIGAVLGAAIGGWLMGFFPIESAITAGLCMA
NRGGSGDLEVLSACNRMNLISYAQISSRLGGGIVLVIASIVFGMMI
>O34427 2.7.13.3~~~citS~~~Sensor protein CitS~~~COG3290
MVKKRFHFSLQTKIMGLIAALLVFVIGVLTITLAVQHTQGERRQAEQLAVQTARTISYMPPVKELIERKDGHAAQTQEVI
EQMKEQTGAFAIYVLNEKGDIRSASGKSGLKKLERSREILFGGSHVSETKADGRRVIRGSAPIIKEQKGYSQVIGSVSVD
FLQTETEQSIKKHLRNLSVIAVLVLLLGFIGAAVLAKSIRKDTLGLEPHEIAALYRERNAMLFAIREGIIATNREGVVTM
MNVSAAEMLKLPEPVIHLPIDDVMPGAGLMSVLEKGEMLPNQEVSVNDQVFIINTKVMNQGGQAYGIVVSFREKTELKKL
IDTLTEVRKYSEDLRAQTHEFSNKLYAILGLLELGEYDEAIDLIKEEYAIQNEQHDLLFHNIHSQQVQAILLGKISKASE
KKVKLVIDENSSLAPLPAHIGLSHLITIIGNLIDNAFEAVAEQSVKEVLFFITDMGHDIVIEVSDTGPGVPPEKIEAVFE
RGYSSKGMRRGYGLANVKDSVRELGGWIELANQKTGGAVFTVFIPKEKQRGNPFDSHRDCGG
>O34534 ~~~citT~~~Transcriptional regulatory protein CitT~~~COG4565
MIHIAIAEDDFRVAQIHERLIKQLDGFKIIGKAANAKETLALLKEHKADLLLLDIYMPDELGTALIPDIRSRFPEVDIMI
ITAATETRHLQEALRAGIAHYLIKPVTADKFRQVLLQYKEKRKLLMSQPEVSQSMIDHIFGNGVKTALPAEDLPTGINSI
TLRKIKEALQTASEGLTAEELGEKMGASRTTARRYAEYLVSKEEARAELEYGIIGRPERKYYLAAD
>P0A6G5 2.7.7.61~~~citX~~~Apo-citrate lyase phosphoribosyl-dephospho-CoA transferase~~~COG3697
MHLLPELASHHAVSIPELLVSRDERQARQHVWLKRHPVPLVSFTVVAPGPIKDSEVTRRIFNHGVTALRALAAKQGWQIQ
EQAALVSASGPEGMLSIAAPARDLKLATIELEHSHPLGRLWDIDVLTPEGEILSRRDYSLPPRRCLLCEQSAAVCARGKT
HQLTDLLNRMEALLNDVDACNVN
>P37019 ~~~clcA~~~H(+)/Cl(-) exchange transporter ClcA~~~COG0038
MKTDTPSLETPQAARLRRRQLIRQLLERDKTPLAILFMAAVVGTLVGLAAVAFDKGVAWLQNQRMGALVHTADNYPLLLT
VAFLCSAVLAMFGYFLVRKYAPEAGGSGIPEIEGALEDQRPVRWWRVLPVKFFGGLGTLGGGMVLGREGPTVQIGGNIGR
MVLDIFRLKGDEARHTLLATGAAAGLAAAFNAPLAGILFIIEEMRPQFRYTLISIKAVFIGVIMSTIMYRIFNHEVALID
VGKLSDAPLNTLWLYLILGIIFGIFGPIFNKWVLGMQDLLHRVHGGNITKWVLMGGAIGGLCGLLGFVAPATSGGGFNLI
PIATAGNFSMGMLVFIFVARVITTLLCFSSGAPGGIFAPMLALGTVLGTAFGMVAVELFPQYHLEAGTFAIAGMGALLAA
SIRAPLTGIILVLEMTDNYQLILPMIITGLGATLLAQFTGGKPLYSAILARTLAKQEAEQLARSKAASASENT
>P11451 1.13.11.-~~~clcA~~~Chlorocatechol 1,2-dioxygenase~~~
MDKRVAEVAGAIVEAVRKILLDKRVTEAEYRAGVDYLTEVAQTRETALLLDVFLNSTIIEGKAQRSRTSAPAIQGPYFLE
GAPVVEGVLKTYDTDDHKPLIIRGTVRSDTGELLAGAVIDVWHSTPDGLYSGIHDNIPVDYYRGKLVTDSQGNYRVRTTM
PVPYQIPYEGPTGRLLGHLGSHTWRPAHVHFKVRKDGFEPLTTQYYFEGGKWVDDDCCHGVTPDLITPETIEDGVRVMTL
DFVIEREQAEQRKSATETVA
>O67987 1.13.11.-~~~clcA~~~Chlorocatechol 1,2-dioxygenase~~~
MANTRVIELFDEFTDLIRDFIVRHEITTPEYETIMQYMISVGEAGEWPLWLDAFFETTVDSVSYGKGNWTSSAIQGPFFK
EGAPLLTGKPATLPMRADEPGDRMRFTGSVRDTSGTPITGAVIDVWHSTNDGNYSFFSPALPDQYLLRGRVVPAEDGSIE
FHSIRPVPYEIPKAGPTGQLMNSYLGRHSWRPAHIHIRITADGYRPLITQLYFEGDPYLDSDSCSAVKSELVLPVNKIDI
DGETWQLVDFNFILQHN
>Q8ZRP8 ~~~clcA~~~H(+)/Cl(-) exchange transporter ClcA~~~
MKTDTSTFLAQQIVRLRRRDQIRRLMQRDKTPLAILFMAAVVGTLTGLVGVAFEKAVSWVQNMRIGALVQVADHAFLLWP
LAFILSALLAMVGYFLVRKFAPEAGGSGIPEIEGALEELRPVRWWRVLPVKFIGGMGTLGAGMVLGREGPTVQIGGNLGR
MVLDVFRMRSAEARHTLLATGAAAGLSAAFNAPLAGILFIIEEMRPQFRYNLISIKAVFTGVIMSSIVFRIFNGEAPIIE
VGKLSDAPVNTLWLYLILGIIFGCVGPVFNSLVLRTQDMFQRFHGGEIKKWVLMGGAIGGLCGILGLIEPAAAGGGFNLI
PIAAAGNFSVGLLLFIFITRVVTTLLCFSSGAPGGIFAPMLALGTLLGTAFGMAAAVLFPQYHLEAGTFAIAGMGALMAA
SVRAPLTGIVLVLEMTDNYQLILPMIITCLGATLLAQFLGGKPLYSTILARTLAKQDAEQAEKNQNAPADENT
>Q3Z5K2 ~~~clcA~~~H(+)/Cl(-) exchange transporter ClcA~~~
MKTDTPSLETPQAARLRRRQLIRQLLERDKTPLAILFMAAVVGTLVGLAAVAFDKGVAWLQNQRMGALVHTADNYPLLLT
VAFLCSAVLAMFGYFLVRKYAPEAGGSGIPEIEGALEDQRPVRWWRVLPVKFFGGLGTLGGGMVLGREGPTVQIGGNIGR
MVLDIFRLKGDEARHTLLATGAAAGLAAAFNAPLAGILFIIEEMRPQFRYTLISIKAVFIGVIMSTIMYRIFNHEVALID
VGKLSDAPLNTLWLYLILGIIFGIFGPIFNKWVLGMQDLLHRVHGGNITKWVLMGGAIGGLCGLLGFVAPATSGGGFNLI
PIATAGNFSMGMLVFIFVARVITTLLCFSSGAPGGIFAPMLALGTVLGTAFGMVAVELFPQYHLEAGTFAIAGMGALLAA
SIRAPLTGIILVLEMTDNYQLILPMIITGLGATLLAQFTGGKPLYSAILARTLAKQEAEQLARSKAASASENT
>P76175 ~~~clcB~~~Voltage-gated ClC-type chloride channel ClcB~~~COG0038
MFHRLLIATVVGILAAFAVAGFRHAMLLLEWLFLNNDSGSLVNAATNLSPWRRLLTPALGGLAAGLLLMGWQKFTQQRPH
APTDYMEALQTDGQFDYAASLVKSLASLLVVTSGSAIGREGAMILLAALAASCFAQRFTPRQEWKLWIACGAAAGMAAAY
RAPLAGSLFIAEVLFGTMMLASLGPVIISAVVALLVSNLINHSDALLYNVQLSVTVQARDYALIISTGVLAGLCGPLLLT
LMNACHRGFVSLKLAPPWQLALGGLIVGLLSLFTPAVWGNGYSTVQSFLTAPPLLMIIAGIFLCKLCAVLASSGSGAPGG
VFTPTLFIGLAIGMLYGRSLGLWFPDGEEITLLLGLTGMATLLAATTHAPIMSTLMICEMTGEYQLLPGLLIACVIASVI
SRTLHRDSIYRQHTAQHS
>P0A115 3.1.1.45~~~clcD~~~Carboxymethylenebutenolidase~~~COG0412
MLTEGISIQSYDGHTFGALVGSPAKAPAPVIVIAQEIFGVNAFMRETVSWLVDQGYAAVCPDLYARQAPGTALDPQDERQ
REQAYKLWQAFDMEAGVGDLEAAIRYARHQPYSNGKVGLVGYCLGGALAFLVAAKGYVDRAVGYYGVGLEKQLKKVPEVK
HPALFHMGGQDHFVPAPSRQLITEGFGANPLLQVHWYEEAGHSFARTSSSGYVASAAALANERRLDFLAPLQSKKP
>P0A114 3.1.1.45~~~clcD~~~Carboxymethylenebutenolidase~~~
MLTEGISIQSYDGHTFGALVGSPAKAPAPVIVIAQEIFGVNAFMRETVSWLVDQGYAAVCPDLYARQAPGTALDPQDERQ
REQAYKLWQAFDMEAGVGDLEAAIRYARHQPYSNGKVGLVGYCLGGALAFLVAAKGYVDRAVGYYGVGLEKQLKKVPEVK
HPALFHMGGQDHFVPAPSRQLITEGFGANPLLQVHWYEEAGHSFARTSSSGYVASAAALANERRLDFLAPLQSKKP
>A0A0H5BB10 5.5.1.12~~~cldB~~~Copalyl diphosphate synthase~~~
MSSISHQAAPTRAQHTNSYVQLARQLVTSVDNDPWGDVPPSVYETARVTSWAPWLEGHERRLAWLLERQSAAGSWGEGPT
PYRLLPTLSVTEALLSTLRQNTAAGVSRERLAAAVDNGLAALRDLSGTGGWPDTAAIEILAPDLVVLINDHLDQPEVAAL
PRLGPWARGQRLAQPHGFQAALPDRVAERCQVAGGVPLKLHHTFEGVARRLPRMVPGVPGGLLGSSPAATAAWLATGPDE
GRDQAVTALTAVAERYDGLFPEATPISVFERLWISVALARPGLPAACVPTIRAWAAEIYDATGVRGAPGLLPDTDDTAMA
VLASALAGSPRDPSPLSAFEAGDHYDCYVGEDTGSSTANAHALQALTAWLSHRPATGDALQARRDLTRDWLLAQQESDGA
WRDKWHASPYYATERCVTALSGHTGPTTRDAIRSAADWVLDAQSDDGSWGVWGGTAEETAYAVNILLNSPDHTGTPEATQ
ALKLAENVLREAVHSSGHHHPALWHDKTLYAPQAMAQAEVIAALELLQARRP
>A0A0H5BN57 4.2.3.193~~~cldD~~~(12E)-labda-8(17),12,14-triene synthase~~~
MTRTGDAVTILPQPDFTATFPGPFPTSPHGERTERQLLGWLEEYPLLPSARARSVLVNITSHGVSRTLPTADADDLVLFA
ELLLWLTAFDDMHGESNAARDLVALVDRTAELTLVLAGGSPPPLTNPFPAALYDLLARFRARTGPAAYLRLAASLRDTIM
ALVWEAHHVAEPERVALETYLEMRPHTVFVRTIFAAAEIVLDYELTDAQRALAPVRHLETAVANLAGWINDLASYEREAA
RGPAQPLSLPTLLRARHGGSLEEAFARAGGMCENEAAVARQGITSLAGDPPSALTAHARALEDIARSFVWHTSHARYQGP
KRGAAPTSR
>Q47CX0 1.13.11.49~~~~~~Chlorite dismutase~~~COG3253
MTNLSIHNFKLSLVAAVIGSAMVMTSSPVAAQQAMQPMQSMKIERGTILTQPGVFGVFTMFKLRPDWNKVPVAERKGAAE
EVKKLIEKHKDNVLVDLYLTRGLETNSDFFFRINAYDLAKAQTFMREFRSTTVGKNADVFETLVGVTKPLNYISKDKSPG
LNAGLSSATYSGPAPRYVIVIPVKKNAEWWNMSPEERLKEMEVHTTPTLAYLVNVKRKLYHSTGLDDTDFITYFETDDLT
AFNNLMLSLAQVKENKFHVRWGSPTTLGTIHSPEDVIKALAD
>Q9F437 1.13.11.49~~~cld~~~Chlorite dismutase~~~
MKVRCVSLVAAGLLTIAGSAIGQPAPAPMPAMAPAAKPAMNTPVDRAKILSAPGVFVAFSTYKIRPDYFKVALAERKGAA
DEVMAVLEKHKEKVIVDAYLTRGYEAKSDYFLRVHAYDAVAAQAFLVDFRATRFGMYSDVTESLVGITKALNYISKDKSP
DLNKGLSGATYAGDAPRFAFMIPVKKNADWWNLTDEQRLKEMETHTLPTLPFLVNVKRKLYHSTGLDDTDFITYFETNDL
GAFNNLMLSLAKVPENKYHVRWGNPTVLGTIQPIENLVKTLSMGN
>Q2G015 ~~~clfA~~~Clumping factor A~~~COG2931
MNMKKKEKHAIRKKSIGVASVLVGTLIGFGLLSSKEADASENSVTQSDSASNESKSNDSSSVSAAPKTDDTNVSDTKTSS
NTNNGETSVAQNPAQQETTQSSSTNATTEETPVTGEATTTTTNQANTPATTQSSNTNAEELVNQTSNETTSNDTNTVSSV
NSPQNSTNAENVSTTQDTSTEATPSNNESAPQSTDASNKDVVNQAVNTSAPRMRAFSLAAVAADAPVAGTDITNQLTNVT
VGIDSGTTVYPHQAGYVKLNYGFSVPNSAVKGDTFKITVPKELNLNGVTSTAKVPPIMAGDQVLANGVIDSDGNVIYTFT
DYVNTKDDVKATLTMPAYIDPENVKKTGNVTLATGIGSTTANKTVLVDYEKYGKFYNLSIKGTIDQIDKTNNTYRQTIYV
NPSGDNVIAPVLTGNLKPNTDSNALIDQQNTSIKVYKVDNAADLSESYFVNPENFEDVTNSVNITFPNPNQYKVEFNTPD
DQITTPYIVVVNGHIDPNSKGDLALRSTLYGYNSNIIWRSMSWDNEVAFNNGSGSGDGIDKPVVPEQPDEPGEIEPIPED
SDSDPGSDSGSDSNSDSGSDSGSDSTSDSGSDSASDSDSASDSDSASDSDSASDSDSASDSDSDNDSDSDSDSDSDSDSD
SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD
SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSASDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD
SDSDSDSESDSDSDSDSDSDSDSDSDSDSDSASDSDSGSDSDSSSDSDSESDSNSDSESVSNNNVVPPNSPKNGTNASNK
NEAKDSKEPLPDTGSEDEANTSLIWGLLASIGSLLLFRRKKENKDKK
>Q53653 ~~~clfA~~~Clumping factor A~~~
MNMKKKEKHAIRKKSIGVASVLVGTLIGFGLLSSKEADASENSVTQSDSASNESKSNDSSSVSAAPKTDDTNVSDTKTSS
NTNNGETSVAQNPAQQETTQSSSTNATTEETPVTGEATTTTTNQANTPATTQSSNTNAEELVNQTSNETTFNDTNTVSSV
NSPQNSTNAENVSTTQDTSTEATPSNNESAPQSTDASNKDVVNQAVNTSAPRMRAFSLAAVAADAPAAGTDITNQLTNVT
VGIDSGTTVYPHQAGYVKLNYGFSVPNSAVKGDTFKITVPKELNLNGVTSTAKVPPIMAGDQVLANGVIDSDGNVIYTFT
DYVNTKDDVKATLTMPAYIDPENVKKTGNVTLATGIGSTTANKTVLVDYEKYGKFYNLSIKGTIDQIDKTNNTYRQTIYV
NPSGDNVIAPVLTGNLKPNTDSNALIDQQNTSIKVYKVDNAADLSESYFVNPENFEDVTNSVNITFPNPNQYKVEFNTPD
DQITTPYIVVVNGHIDPNSKGDLALRSTLYGYNSNIIWRSMSWDNEVAFNNGSGSGDGIDKPVVPEQPDEPGEIEPIPED
SDSDPGSDSGSDSNSDSGSDSGSDSTSDSGSDSASDSDSASDSDSASDSDSASDSDSASDSDSDNDSDSDSDSDSDSDSD
SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD
SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSASDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD
SDSDSDSESDSDSESDSDSDSDSDSDSDSDSDSDSDSASDSDSGSDSDSSSDSDSESDSNSDSESGSNNNVVPPNSPKNG
TNASNKNEAKDSKEPLPDTGSEDEANTSLIWGLLASIGSLLLFRRKKENKDKK
>Q99VJ4 ~~~clfA~~~Clumping factor A~~~
MNMKKKEKHAIRKKSIGVASVLVGTLIGFGLLSSKEADASENSVTQSDSASNESKSNDSSSVSAAPKTDDTNVSDTKTSS
NTNNGETSVAQNPAQQETTQSSSTNATTEETPVTGEATTTTTNQANTPATTQSSNTNAEELVNQTSNETTSNDTNTVSSV
NSPQNSTNAENVSTTQDTSTEATPSNNESAPQNTDASNKDVVSQAVNPSTPRMRAFSLAAVAADAPAAGTDITNQLTDVK
VTIDSGTTVYPHQAGYVKLNYGFSVPNSAVKGDTFKITVPKELNLNGVTSTAKVPPIMAGDQVLANGVIDSDGNVIYTFT
DYVDNKENVTANITMPAYIDPENVTKTGNVTLTTGIGTNTASKTVLIDYEKYGQFHNLSIKGTIDQIDKTNNTYRQTIYV
NPSGDNVVLPALTGNLIPNTKSNALIDAKNTDIKVYRVDNANDLSESYYVNPSDFEDVTNQVRISFPNANQYKVEFPTDD
DQITTPYIVVVNGHIDPASTGDLALRSTFYGYDSNFIWRSMSWDNEVAFNNGSGSGDGIDKPVVPEQPDEPGEIEPIPED
SDSDPGSDSGSDSNSDSGSDSGSDSTSDSGSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSD
SASDSDSASDSDSASDSDSASDSDSASDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD
SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD
SDSDSDSDSDSDSDSDSDSDSASDSDSDSDSESDSDSDSDSDSDSDSDSDSDSDSESDSDSDSDSDSESDSDSDSDSDSD
SASDSDSGSDSDSSSDSDSDSTSDTGSDNDSDSDSNSDSESGSNNNVVPPNSPKNGTNASNKNEAKDSKEPLPDTGSEDE
ANTSLIWGLLASLGSLLLFRRKKENKDKK
>Q2FUY2 ~~~clfB~~~Clumping factor B~~~COG2931
MKKRIDYLSNKQNKYSIRRFTVGTTSVIVGATILFGIGNHQAQASEQSNDTTQSSKNNASADSEKNNMIETPQLNTTAND
TSDISANTNSANVDSTTKPMSTQTSNTTTTEPASTNETPQPTAIKNQATAAKMQDQTVPQEANSQVDNKTTNDANSIATN
SELKNSQTLDLPQSSPQTISNAQGTSKPSVRTRAVRSLAVAEPVVNAADAKGTNVNDKVTASNFKLEKTTFDPNQSGNTF
MAANFTVTDKVKSGDYFTAKLPDSLTGNGDVDYSNSNNTMPIADIKSTNGDVVAKATYDILTKTYTFVFTDYVNNKENIN
GQFSLPLFTDRAKAPKSGTYDANINIADEMFNNKITYNYSSPIAGIDKPNGANISSQIIGVDTASGQNTYKQTVFVNPKQ
RVLGNTWVYIKGYQDKIEESSGKVSATDTKLRIFEVNDTSKLSDSYYADPNDSNLKEVTDQFKNRIYYEHPNVASIKFGD
ITKTYVVLVEGHYDNTGKNLKTQVIQENVDPVTNRDYSIFGWNNENVVRYGGGSADGDSAVNPKDPTPGPPVDPEPSPDP
EPEPTPDPEPSPDPEPEPSPDPDPDSDSDSDSGSDSDSGSDSDSESDSDSDSDSDSDSDSDSESDSDSESDSESDSDSDS
DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS
DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS
DSDSRVTPPNNEQKAPSNPKGEVNHSNKVSKQHKTDALPETGDKSENTNATLFGAMMALLGSLLLFRKRKQDHKEKA
>O86476 ~~~clfB~~~Clumping factor B~~~
MKKRIDYLSNKQNKYSIRRFTVGTTSVIVGATILFGIGNHQAQASEQSNDTTQSSKNNASADSEKNNMIETPQLNTTAND
TSDISANTNSANVDSTTKPMSTQTSNTTTTEPASTNETPQPTAIKNQATAAKMQDQTVPQEANSQVDNKTTNDANSIATN
SELKNSQTLDLPQSSPQTISNAQGTSKPSVRTRAVRSLAVAEPVVNAADAKGTNVNDKVTASNFKLEKTTFDPNQSGNTF
MAANFTVTDKVKSGDYFTAKLPDSLTGNGDVDYSNSNNTMPIADIKSTNGDVVAKATYDILTKTYTFVFTDYVNNKENIN
GQFSLPLFTDRAKAPKSGTYDANINIADEMFNNKITYNYSSPIAGIDKPNGANISSQIIGVDTASGQNTYKQTVFVNPKQ
RVLGNTWVYIKGYQDKIEESSGKVSATDTKLRIFEVNDTSKLSDSYYADPNDSNLKEVTDQFKNRIYYEHPNVASIKFGD
ITKTYVVLVEGHYDNTGKNLKTQVIQENVDPVTNRDYSIFGWNNENVVRYGGGSADGDSAVNPKDPTPGPPVDPEPSPDP
EPEPTPDPEPSPDPEPEPSPDPDPDSDSDSDSGSDSDSGSDSDSESDSDSDSDSDSDSDSDSESDSDSESDSDSDSDSDS
DSDSDSDSDSDSDSDSDSDSDSDSESDSDSESDSESDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS
DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS
DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSRVTPPNNEQKAPSNPKGEVNHSNKVSKQHKTDALPETGDK
SENTNATLFGAMMALLGSLLLFRKRKQDHKEKA
>Q7A382 ~~~clfB~~~Clumping factor B~~~
MKKRIDYLSNKQNKYSIRRFTVGTTSVIVGATILFGIGNHQAQASEQSNDTTQSSKNNASADSEKNNMIETPQLNTTAND
TSDISANTNSANVDSTTKPMSTQTSNTTTTEPASTNETPQPTAIKNQATAAKMQDQTVPQEANSQVDNKTTNDANSIATN
SELKNSQTLDLPQSSPQTISNAQGTSKPSVRTRAVRSLAVAEPVVNAADAKGTNVNDKVTASNFKLEKTTFDPNQSGNTF
MAANFTVTDKVKSGDYFTAKLPDSLTGNGDVDYSNSNNTMPIADIKSTNGDVVAKATYDILTKTYTFVFTDYVNNKENIN
GQFSLPLFTDRAKAPKSGTYDANINIADEMFNNKITYNYSSPIAGIDKPNGANISSQIIGVDTASGQNTYKQTVFVNPKQ
RVLGNTWVYIKGYQDKIEESSGKVSATDTKLRIFEVNDTSKLSDSYYADPNDSNLKEVTDQFKNRIYYEHPNVASIKFGD
ITKTYVVLVEGHYDNTGKNLKTQVIQENVDPVTNRDYSIFGWNNENVVRYGGGSADGDSAVNPKDPTPGPPVDPEPSPDP
EPEPTPDPEPSPDPEPEPSPDPDPDSDSDSDSGSDSDSGSDSDSESDSDSDSDSDSDSDSDSESDSDSESDSDSDSDSDS
DSDSDSESDSDSDSDSDSDSDSDSESDSDSESDSESDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSESDSDSDS
DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS
DSDSRVTPPNNEQKAPSNPKGEVNHSNKVSKQHKTDALPETGDKSENTNATLFGAMMALLGSLLLFRKRKQDHKEKA
>Q6GDH2 ~~~clfB~~~Clumping factor B~~~
MKKRIDYLSNKQNKYSIRRFTVGTTSVIVGATILFGIGNHQAQASEQSNDTTQSSKNNASADSEKNNTIETPQLNTTAND
TSDISANTNSANVDSTAKTMSTQTSNTTTTEPASTNETPQPTAIKDQATAAKMQDQTVPQEANSQVDNKTTNDANNIATN
SELKNPQTLDLPQSSPQTISNAQGTSKPSVRTRAVRSLAVAEPVVNAADAKGTNVNDKVTASDFKLEKTAFDPNQSGNTF
MAANFKVTGQVKSGDYFTAKLPDSVTGNGDVDYSNSNNTMPIADIKSTNGDVVAKATYDILTKTYTFVFTDYVNDKENIN
GQFSLPLFTDRAKAPKSGTYDANINIADEMFDNKITYNYSSPIAGIDKPNGANISSQIIGVDTASGQNTYKQTVFVNPKQ
RVLGNTWVYIKGYQDKIEESSGKVSATDTKLRIFEVNDTSKLSDSYYADPNDSNLKEVTGEFKDKISYKYDNVASINFGD
INKTYVVLVEGHYDNTGKNLKTQVIQENIDPATGKDYSIFGWNNENVVRYGGGSADGDSAVNPVDPTPGPPVDPEPEPEP
TPDPEPSPEPEPEPTPDPEPSPEPDPDSDSDSDSGSDSDSGSDSDSDSDSDSDSDSDSNSDSESDSDSDSDSDSDSDSDS
DSDSDSDSDSDSESDSDSDSDSDSDSDSDSDSDSDSDSNSDSDSDSDSDSDSDSDSDSDSDSESDSDSDSDSDSDSDSDS
DSDSDSDSDSDSDSDSDSESDSESDSDSDSESDSDSDPDSESDSDSDSDSDSDSDSDSESDSDSESDSDSDSDSDSDSDS
RVTPPNNEQKAPSNPKGEVNHSNKASKQHQTDALPETGDKSENTNATLFDAMVALLGSLLLFRKRKQDHKEKA
>A0QVU1 ~~~clgR~~~Transcriptional regulator ClgR~~~COG1396
MTALLREVIGDVLRNARTDQGRTLREVSDAARVSLGYLSEVERGRKEASSELLSAICDALDVPLSRVLTDAGESMARREH
DAREAEQVAVANLGRIDAATKVVIPQVSMAVA
>P9WMH6 ~~~clgR~~~Transcriptional regulator ClgR~~~
MAALVREVVGDVLRGARMSQGRTLREVSDSARVSLGYLSEIERGRKEPSSELLSAICTALQLPLSVVLIDAGERMARQER
LARATPAGRATGATIDASTKVVIAPVVSLAVA
>P9WMH7 ~~~clgR~~~Transcriptional regulator ClgR~~~COG1396
MAALVREVVGDVLRGARMSQGRTLREVSDSARVSLGYLSEIERGRKEPSSELLSAICTALQLPLSVVLIDAGERMARQER
LARATPAGRATGATIDASTKVVIAPVVSLAVA
>Q8GHB2 2.5.1.111~~~cloQ~~~4-hydroxyphenylpyruvate 3-dimethylallyltransferase~~~
MPALPIDQEFDCERFRADIRATAAAIGAPIAHRLTDTVLEAFRDNFAQGATLWKTTSQPGDQLSYRFFSRLKMDTVSRAI
DAGLLDAAHPTLAVVDAWSSLYGGAPVQSGDFDAGRGMAKTWLYFGGLRPAEDILTVPALPASVQARLKDFLALGLAHVR
FAAVDWRHHSANVYFRGKGPLDTVQFARIHALSGSTPPAAHVVEEVLAYMPEDYCVAITLDLHSGDIERVCFYALKVPKN
ALPRIPTRIARFLEVAPSHDVEECNVIGWSFGRSGDYVKAERSYTGNMAEILAGWNCFFHGEEGRDHDLRALHQHTESTM
GGAR
>Q8GHB1 1.13.11.83~~~cloR~~~4-hydroxy-3-prenylphenylpyruvate oxygenase/4-hydroxy-3-prenylbenzoate synthase~~~
MSKALANMPGDDYFRHPPVFDTYAEHRAYLKFRHAVALRHFARLGFDQDGLAGLITVADPEHADTYWANPLAHPFSTITP
ADLIRVDGDSTETVDGQRRVNIAAFNIHAEIHRARPDVQAVIHLHTVYGRAFSAFARKLPPLTQDACPFFEDHEVFDDYT
GLVLAKDDGRRIAKQLRGHKAILLKNHGLVTVGETLDAAAWWFTLLDTCCHVQLLADAAGGAEPIPAEVARLTGQQLGSH
LLGWNSYQPLHEATLARNPDLAAMAPALPPQTPALAR
>P09870 3.4.22.8~~~cloSI~~~Clostripain~~~
MLRRKVSTLLMTALITTSFLNSKPVYANPVTKSKDNNLKEVQQVTSKSNKNKNQKVTIMYYCDADNNLEGSLLNDIEEMK
TGYKDSPNLNLIALVDRSPRYSSDEKVLGEDFSDTRLYKIEHNKANRLDGKNEFPEISTTSKYEANMGDPEVLKKFIDYC
KSNYEADKYVLIMANHGGGAREKSNPRLNRAICWDDSNLDKNGEADCLYMGEISDHLTEKQSVDLLAFDACLMGTAEVAY
QYRPGNGGFSADTLVASSPVVWGPGFKYDKIFDRIKAGGGTNNEDDLTLGGKEQNFDPATITNEQLGALFVEEQRDSTHA
NGRYDQHLSFYDLKKAESVKRAIDNLAVNLSNENKKSEIEKLRGSGIHTDLMHYFDEYSEGEWVEYPYFDVYDLCEKINK
SENFSSKTKDLASNAMNKLNEMIVYSFGDPSNNFKEGKNGLSIFLPNGDKKYSTYYTSTKIPHWTMQSWYNSIDTVKYGL
NPYGKLSWCKDGQDPEINKVGNWFELLDSWFDKTNDVTGGVNHYQW
>P0ABH9 ~~~clpA~~~ATP-dependent Clp protease ATP-binding subunit ClpA~~~COG0542
MLNQELELSLNMAFARAREHRHEFMTVEHLLLALLSNPSAREALEACSVDLVALRQELEAFIEQTTPVLPASEEERDTQP
TLSFQRVLQRAVFHVQSSGRNEVTGANVLVAIFSEQESQAAYLLRKHEVSRLDVVNFISHGTRKDEPTQSSDPGSQPNSE
EQAGGEERMENFTTNLNQLARVGGIDPLIGREKELERAIQVLCRRRKNNPLLVGESGVGKTAIAEGLAWRIVQGDVPEVM
ADCTIYSLDIGSLLAGTKYRGDFEKRFKALLKQLEQDTNSILFIDEIHTIIGAGAASGGQVDAANLIKPLLSSGKIRVIG
STTYQEFSNIFEKDRALARRFQKIDITEPSIEETVQIINGLKPKYEAHHDVRYTAKAVRAAVELAVKYINDRHLPDKAID
VIDEAGARARLMPVSKRKKTVNVADIESVVARIARIPEKSVSQSDRDTLKNLGDRLKMLVFGQDKAIEALTEAIKMARAG
LGHEHKPVGSFLFAGPTGVGKTEVTVQLSKALGIELLRFDMSEYMERHTVSRLIGAPPGYVGFDQGGLLTDAVIKHPHAV
LLLDEIEKAHPDVFNILLQVMDNGTLTDNNGRKADFRNVVLVMTTNAGVRETERKSIGLIHQDNSTDAMEEIKKIFTPEF
RNRLDNIIWFDHLSTDVIHQVVDKFIVELQVQLDQKGVSLEVSQEARNWLAEKGYDRAMGARPMARVIQDNLKKPLANEL
LFGSLVDGGQVTVALDKEKNELTYGFQSAQKHKAEAAH
>P53533 ~~~clpB1~~~Chaperone protein ClpB 1~~~COG0542
MQPTNPNQFTEKAWEAIVRTTDVAKQAQHQQIESEHLFLALLQEPGLALNILKKAGLEAAQLQQFTERFIARQPKVSGGN
QSVYLGRSLDQLLDQADQFRKDFGDEFISVEHLILSFPRDSRFGRLLSQEFKVDEKQLRQIIQQIRGSQKVTDQNPEGKY
EALEKYGRDLTEMARRGKLDPVIGRDDEIRRTIQILSRRTKNNPVLIGEPGVGKTAIAEGLAQRIINGDVPQSLKDRRLI
ALDMGALIAGAKFRGEFEERLKAVLKEVTDSEGIIILFIDEIHTVVGAGAVQGSMDAGNLLKPMLARGELRCIGATTLDE
YRQYIEKDAALERRFQQVFVDQPTVEDTISILRGLKERYEVHHGVRISDNALVAAAVLSTRYISDRFLPDKAIDLVDESA
ARLKMEITSKPEELDEIDRKILQLEMERLSLQKESDLASQERLQRLEKELADLKEEQRSLSSQWQAEKDVITDIQSVKEE
IDQVNLLIQQAERDYDLNKAAELKYGKLTELQRKLNEMEGGLATTHTSGKSLLREEVTEVDIAEIISKWTGIPVSKLVES
EMQKLLNLDEELHQRVIGQEEAVSAVADAIQRSRAGLSDPKRPIASFIFLGPTGVGKTELAKALAAYLFDTEDAMIRIDM
SEYMEKHAVSRLIGAPPGYVGYDEGGQLTEAVRRRPYSVILFDEIEKAHPDVFNVMLQILDDGRVTDSRGRTVDFKNTIL
ILTSNIGSQYILDVAGDDSRYEEMRSRVTEALRANFRPEFLNRVDETIIFHSLRKDQLQQIVRIQLHRLEERLSDRKLSL
SMSPEAIDFLVEIGFDPVYGARPLKRVIQRELETAIAKAILRGQFSDGDTIQVAVENERLVFKAIATPTAVPLS
>Q7CEG6 ~~~clpB~~~Chaperone protein ClpB~~~
MNIEKYTERVRGFIQSAQTFALSSGNQQFTPEHILKVLIDDDEGLAASLVERAGGRVGDVRMGLQSALEKLPKVSGGNDQ
LYLSQPLAKVFSLAEELASKAGDSFVTVERLLTALAMEKSAKTSEILSAAGVTPTALNRVINDMRKGRTADSASAESNYD
ALKKYARDLTEDARAGKLDPVIGRDEEIRRTIQVLSRRTKNNPVLIGEPGVGKTAIAEGLALRIVNGDVPESLKDKQLMA
LDMGALIAGAKYRGEFEERLKAVLSEVQTAAGQIILFIDEMHTLVGAGKTDGAMDASNLLKPALARGELHCVGATTLEEY
RKYVEKDAALARRFQPVFVDEPTVEDTISILRGLKEKYEQHHKVRVSDSALVAAATLSNRYITDRFLPDKAIDLVDEAAS
RLRMHVDSKPEELDEIDRRIMQLKIEREALKVETDAASKDRLQRIEKELSDLEEESAELTAKWQAEKQKLGLAADLKRQL
EEARNALAIAQRNGEFQKAGELAYGTIPQLEKQLADAESQENKGSLLEETVTPDHVAQVISRWTGIPVDRMLEGEREKLL
RMEDEIGKRVVGQGEAVQAISKAVRRARAGLQDPNRPIGSFIFLGPTGVGKTELTKALASFLFQDDTAMVRIDMSEFMEK
HSVSRLIGAPPGYVGYEEGGVLTEAVRRRPYQVILFDEIEKAHPDVFNVLLQVLDDGRLTDGQGRTVDFRNTVIIMTSNL
GAEYLVNLGENDDVETVRDDVMGVVRASFRPEFLNRVDEIILFHRLRREDMGAIVDIQMQRLQYLLSDRKITLQLEDDAR
EWLANKGYDPAYGARPLKRVIQKEVQDPLAERILLGDILDGSLVKITAGSDRLNFRPISGAFSAAEPEREDEKA
>P53532 ~~~clpB~~~Chaperone protein ClpB~~~COG0542
MSSFNPTTKTNEAMQAALQQASSAGNPDIRPAHLLAAILEQTDGVAAPVLMATGVDPKEILAEAKKLVASYPKASGANMA
NPNFNRDALNAFTAAQELAGELGDEYVSTEVLLAGIARGKSDAADLLTNKGATYDAIKEAFPSVRGSQRVTTQDPEGQFQ
ALEKYSTDLTKLAREGKIDPVIGRDQEIRRVVQVLSRRTKNNPVLIGEPGVGKTAIVEGLARRIVAGDVPESLKGKTLIS
LDLGSMVAGAKYRGEFEERLKAVLDEIKGANGEVVTFIDELHTIVGAGASGESAMDAGNMIKPLLARGELRLVGATTLNE
YRKYIEKDAALERRFQQVYVGEPTVEDAIGILRGLKERYEVHHGVRIQDSALVAAAELSNRYITSRFLPDKAIDLVDEAA
SRLRMEIDSSPQEIDELERIVRRLEIEEMALSKESDAASKERLEKLRSELADEREKLSELKARWQNEKTAIDDVREMKEE
LEALRSESDIAERDGNYGRVAELRYGRIPELEKQIEDAESKVEVNENAMLTEEVTPDTIADVVSAWTGIPAGKMMQGETE
KLLNMERVLGNRVVGQLEAVTAVSDAVRRSRAGVADPNRPTGSFLFLGPTGVGKTELAKAVAEFLFDDDRAMIRIDMSEY
GEKHSVARLVGAPPGYVGYDQGGQLTEAVRRRPYTVVLFDEVEKAHPDVFDILLQVLDEGRLTDGQGRTVDFRNTILILT
SNLGAGGTREQMMDAVKMAFKPEFVNRLDDVVIFDRLSPEQLTSIVDIQIKQLTDRLAGRRLNLRVSDSAKAWLAERGYD
PAYGARPLRRLIQQAIGDTLAKELLAGNVRDGDGVLVDVADGGQKLDVSRAV
>P63284 ~~~clpB~~~Chaperone protein ClpB~~~COG0542
MRLDRLTNKFQLALADAQSLALGHDNQFIEPLHLMSALLNQEGGSVSPLLTSAGINAGQLRTDINQALNRLPQVEGTGGD
VQPSQDLVRVLNLCDKLAQKRGDNFISSELFVLAALESRGTLADILKAAGATTANITQAIEQMRGGESVNDQGAEDQRQA
LKKYTIDLTERAEQGKLDPVIGRDEEIRRTIQVLQRRTKNNPVLIGEPGVGKTAIVEGLAQRIINGEVPEGLKGRRVLAL
DMGALVAGAKYRGEFEERLKGVLNDLAKQEGNVILFIDELHTMVGAGKADGAMDAGNMLKPALARGELHCVGATTLDEYR
QYIEKDAALERRFQKVFVAEPSVEDTIAILRGLKERYELHHHVQITDPAIVAAATLSHRYIADRQLPDKAIDLIDEAASS
IRMQIDSKPEELDRLDRRIIQLKLEQQALMKESDEASKKRLDMLNEELSDKERQYSELEEEWKAEKASLSGTQTIKAELE
QAKIAIEQARRVGDLARMSELQYGKIPELEKQLEAATQLEGKTMRLLRNKVTDAEIAEVLARWTGIPVSRMMESEREKLL
RMEQELHHRVIGQNEAVDAVSNAIRRSRAGLADPNRPIGSFLFLGPTGVGKTELCKALANFMFDSDEAMVRIDMSEFMEK
HSVSRLVGAPPGYVGYEEGGYLTEAVRRRPYSVILLDEVEKAHPDVFNILLQVLDDGRLTDGQGRTVDFRNTVVIMTSNL
GSDLIQERFGELDYAHMKELVLGVVSHNFRPEFINRIDEVVVFHPLGEQHIASIAQIQLKRLYKRLEERGYEIHISDEAL
KLLSENGYDPVYGARPLKRAIQQQIENPLAQQILSGELVPGKVIRLEVNEDRIVAVQ
>O68185 ~~~clpB~~~Chaperone protein ClpB~~~COG0542
MDIEKMTTTMQEALGSAQQIAQVRHHQVIEVPHLWRIFVQPNSFGANFYKDLGIDLDDFTNLIEKEIDKINSVEGSNITY
GQNLSPDLFQVFTEADKIAQKMGDEYLSTEIILLALFELKQNPLTEYLVSHGLTKAKAQAAIEKLRGGDKVTSQNAEETY
KALEKYGVDLVAQVKSGNQDPVIGRDEEIRDVIRVLSRKTKNNPVLIGEPGVGKTAIVEGLAQRIVRKDVPENLKDKTIF
SLDMGALIAGAKYRGEFEERLKAVLNEVKKADGQIILFIDELHTIVGAGKTEGSMDAGNLLKPMLARGELHLIGATTLDE
YRKYMETDKALERRFQKVLVTEPTVEDTISILRGLKERFEIHHGVTIHDNALVAAATLSNRYITDRFLPDKAIDLIDEAS
ATIRVEMNSLPTELDQANRRLMQLEIEEAALKKERDDASKKRLEIIRGEIAELREENNQLKAQWEAEKKEVGNISEKRNE
LEHARHELEEAQNEGNLEKAAALRYGKIPEIEKELKAIEEKAKSDDLSLVQESVTEEQIAEVVGRMTGIPITKLVEGERE
KLLHLPETLHQRVVGQDEAVEAVSDAIIRARAGIQDPNRPLGSFLFLGPTGVGKTELAKALAENLFDSEEHMVRIDMSEY
MEKHSVSRLVGAPPGYVGYDEGGQLTEAVRRNPYTIILLDEIEKAHPDVFNILLQVLDDGRLTDSKGVLVDFKNTVLIMT
SNVGSQYLLDNVGENGEISEETTENVMSQLRAHFKPEFLNRIDDTILFKPLALEDIKNIILKMTSQLAHRLEEMEVELEL
SEEVKVWIAENAYEPAYGARPLKRYLTKVIENPLAKLIIGGKIPPKSKVIVRLIDNKVDFDVQSIAE
>O87444 ~~~clpB~~~Chaperone protein ClpB~~~
MQPTNSEKFTEKVWEAIYRTQEMYKQAQQQQIETEHLMKALLEQDGLAISIFNKLAVPVDRVRDRTDDFIRRQPKVSGSG
TSVYWGRRADALLXRAEEYRKQFEDSFISIEHLLLGYAQDSRFGKALLSEFRYPDEAKLRNAIEQVRGNQKVTDQTPENK
YESLEKYGRDLTQYAREGKLDPVIGRDDEIRRTIQILSRRTKNNPVLIGEPGVGKTAIAEGLAQRILSGDVPQSLKDRKL
IALDMGALIAGAKYRGEFEERLKAVLKEVTDSRGNIILFIDEIHTVVGAGATQGAMDAGNLLKPMLARGELRCIGATTLD
EYRKYIEKDAALERRFQQVFVDQPSVEDTISILRGLKERYEVHHGVKISDSALVAAATLSTRYISDRFLPSKAIDLVDEA
AAKLKMEITSKPEELDEVDRKVLQLEMERLSLQKENDAGSRDRLERLERELADFKEDQSKLNAQWQAEKSVITDLQKLKE
EIDRVNLEIQQAERDYDLNRAAELKYGKLNELNRKVEETESQLSQIQKSGATLLREEVLESDIAEIISKWTGIPVSKLVE
SEMQKLLQLDDVLHQRVIGQDEAVTAVSDAIQRSRAGLSDPNRPTASFIFLGPTGVGKTELAKALAAFLFDTEEAMVRID
MSEYMEKHSVSRLIGAPPGYVGYEEGGQLTEAVRRRPYSVILFDEIEKAHPDVFNVMLQILDDGRVTDSQGRTVDFKNTI
IIMTSNIGSQYIFEYGGDDDRYEEILSRVMEAMLSNFRPEFLNRIDEIIIFHSLQKAQLREIVKIQTHRLESRLARKMSL
KLSDAALDFLAEGFDPVYGARPLKRAIQRELETTIAKEILRSNFTEGDTIFVDVGETERLEFKRLPSEVLTTQ
>G2K265 ~~~clpB~~~Chaperone protein ClpB~~~
MDLQKFTQQVQQTIADAQNLAIASEHQEIDVAHVFKVLLTESDFAKRVYDVAEVDTDALQKVIENTLEKIPVVSGSGVNY
GQAMSQALFQLMRDAEKEQQQLEDDFVSTEHLILAVMDQKSNPITAELKNQHKAKKQIKEAILKIRGGKRVTSQNAEENY
EALTKYGRDLVAEVRSGKLDPVIGRDAEIRNVIRILSRKTKNNPVLIGEPGVGKTAIVEGLAQRIVRKDVPEGLKDKTII
SLDIGSLIAGAKYRGEFEERLKAVLQEVKQSDGQILLFIDEIHTIVGAGKTDGAMDAGNMLKPMLARGELHCIGATTLDE
YRQYIEKDAALERRFQKVLVPEPTVEDTVSILRGLKERFEIHHGVNIHDNALVAAASLSNRYITDRFLPDKAIDLVDEAC
ATIRVEIDSMPSELDEVTRKVMQLEIEEAALKEEKDPASERRLEILQRELADYKEEANQMKSKWESEKNEISKIREVREQ
IDHLRHELEEAENNYDLNKAAELRHGRIPAVEKELLELEAENREKTAQEDRILQEEVTENEIAEIVGRWTGIPVTKLVEG
EREKLLKLADVLHQKVIGQDDAVQLVSDAVLRARAGIKDPKRPIGSFIFLGPTGVGKTELAKALAFNMFDSEDHMIRIDM
SEYMEKHSVSRLVGAPPGYIGYEEGGQLTEAVRRNPYSIVLLDEIEKAHPDVFNILLQVLDDGRITDSQGRLIDFKNTVI
IMTSNIGSNLLLERTEEGEISPELESDVMQILQSEFKPEFLNRVDDIILFKPLTLADIKGIVEKLVEELQIRLADQEITI
TISDDAKAFIAEEAYDPVYGARPLKRYIVRHVETPLAREIVSGKIMPHSSVEIDLADKEFTFKVTE
>Q8Y570 ~~~clpB~~~Chaperone protein ClpB~~~COG0542
MDLQKFTQQVQQTIADAQNLAIASEHQEIDVAHVFKVLLTESDFAKRVYDVAEVDTDALQKVIENTLEKIPVVSGSGVNY
GQAMSQALFQLMRDAEKEQQQLEDDFVSTEHLILAVMDQKSNPITAELKNQHKAKKQIKEAILKIRGGKRVTSQNAEENY
EALTKYGRDLVAEVRSGKLDPVIGRDAEIRNVIRILSRKTKNNPVLIGEPGVGKTAIVEGLAQRIVRKDVPEGLKDKTII
SLDIGSLIAGAKYRGEFEERLKAVLQEVKQSDGQILLFIDEIHTIVGAGKTDGAMDAGNMLKPMLARGELHCIGATTLDE
YRQYIEKDAALERRFQKVLVPEPTVEDTVSILRGLKERFEIHHGVNIHDNALVAAASLSNRYITDRFLPDKAIDLVDEAC
ATIRVEIDSMPSELDEVTRKVMQLEIEEAALKEEKDPASERRLEILQRELADYKEEANQMKSKWESEKNEISKIREVREQ
IDHLRHELEEAENNYDLNKAAELRHGRIPAVEKELLELEAENREKTAQEDRILQEEVTENEIAEIVGRWTGIPVTKLVEG
EREKLLKLADVLHQKVIGQDDAVQLVSDAVLRARAGIKDPKRPIGSFIFLGPTGVGKTELAKALAFNMFDSEDHMIRIDM
SEYMEKHSVSRLVGAPPGYIGYEEGGQLTEAVRRNPYSIVLLDEIEKAHPDVFNILLQVLDDGRITDSQGRLIDFKNTVI
IMTSNIGSNLLLERTEEGEISPELESDVMQILQSEFKPEFLNRVDDIILFKPLTLADIKGIVEKLVEELQIRLADQEITI
TISDDAKAFIAEEAYDPVYGARPLKRYIVRHVETPLAREIVSGKIMPHSSVEIDLADKEFTFKVTE
>P9WPD1 ~~~clpB~~~Chaperone protein ClpB~~~COG0542
MDSFNPTTKTQAALTAALQAASTAGNPEIRPAHLLMALLTQNDGIAAPLLEAVGVEPATVRAETQRLLDRLPQATGASTQ
PQLSRESLAAITTAQQLATELDDEYVSTEHVMVGLATGDSDVAKLLTGHGASPQALREAFVKVRGSARVTSPEPEATYQA
LQKYSTDLTARAREGKLDPVIGRDNEIRRVVQVLSRRTKNNPVLIGEPGVGKTAIVEGLAQRIVAGDVPESLRDKTIVAL
DLGSMVAGSKYRGEFEERLKAVLDDIKNSAGQIITFIDELHTIVGAGATGEGAMDAGNMIKPMLARGELRLVGATTLDEY
RKHIEKDAALERRFQQVYVGEPSVEDTIGILRGLKDRYEVHHGVRITDSALVAAATLSDRYITARFLPDKAIDLVDEAAS
RLRMEIDSRPVEIDEVERLVRRLEIEEMALSKEEDEASAERLAKLRSELADQKEKLAELTTRWQNEKNAIEIVRDLKEQL
EALRGESERAERDGDLAKAAELRYGRIPEVEKKLDAALPQAQAREQVMLKEEVGPDDIADVVSAWTGIPAGRLLEGETAK
LLRMEDELGKRVIGQKAAVTAVSDAVRRSRAGVSDPNRPTGAFMFLGPTGVGKTELAKALADFLFDDERAMVRIDMSEYG
EKHTVARLIGAPPGYVGYEAGGQLTEAVRRRPYTVVLFDEIEKAHPDVFDVLLQVLDEGRLTDGHGRTVDFRNTILILTS
NLGSGGSAEQVLAAVRATFKPEFINRLDDVLIFEGLNPEELVRIVDIQLAQLGKRLAQRRLQLQVSLPAKRWLAQRGFDP
VYGARPLRRLVQQAIGDQLAKMLLAGQVHDGDTVPVNVSPDADSLILG
>Q7WSY8 ~~~clpB~~~Chaperone protein ClpB~~~COG0542
MDTEKLTTMSRDAVTAAVRLALTKGNPTAEPVHLLHAMLMVPESSVAPLLKAVGADAARVDGAASAAIDKLPSSSGSSVA
QPQLSGALARVLADAETRADKLGDQFVSTEHLLIALAEVDSDAKNILASNGVTTAALEKAFNDSRGDKRITSAESEGGES
ALDKYSIDLTQRAKDGKLDPVIGRDSEIRRVAQVLSRRTKNNPVLIGEAGVGKTAVVEGLAQRIVKGDVPDSLKGRRLVS
LDLASMVAGAKYRGEFEERLKAVLNEIKSAEGQIITFIDELHTVVGAGASEGSMDASNMLKPLLARGELRLIGATTLDEY
REHIEKDPALERRFQQVYVGEPSVEDTVAILRGLRERYEAHHKVRITDSALVAAAQLSHRYITGRQLPDKAIDLVDEAAS
RLRMEIDSSPEEIDTLRRQVDRLTMEQFAVEKEEDPGSKARLARINSDLADAKEQLRGLEARWAAEKEGLNKVGELKTRI
DALRTEADKHTRDGDLAKASEILYGEIPELNKQLDEASAAEEDSQGKSMVSEEVTSDDIAEVVSAWTGVPVGKMLEGESE
KLLDMENRIGKRLVGQQAAVKAVSDAVRRSRAGISDPNRPTGSFMFLGPTGVGKTELAKALADFLFDDETAMVRIDMSEY
MEKHSVSRLVGAPPGYVGYEEGGQLTEAVRRRPYSVVLLDEIEKAHPDVFNILLQVLDDGRLTDGQGRTVDFRNVILIMT
SNLGSQFMADPSMSPEERRNQVMAVVKDHFRPEFLNRLDEIVLFDELSREDLDKIVDISLDKLNRRLAERRISIDVSAAA
REWLARTGYDPVYGARPLRRLIQTTVEDQLARAMLAGTISDDQKVSVDMNQAGDGVDVKGEAPVSA
>Q7A6G6 ~~~clpB~~~Chaperone protein ClpB~~~
MDINKMTYAVQSALQQAVELSQQHKLQNIEIEAILSAALNESESLYKSILERANIEVDQLNKAYEDKLNTYASVEGDNIQ
YGQYISQQANQLITKAESYMKEYEDEYISMEHILRSAMDIDQTTKHYINNKVEVIKEIIKKVRGGNHVTSQNPEVNYEAL
AKYGRDLVEEVRQGKMDPVIGRDEEIRNTIRILSRKTKNNPVLIGEPGVGKTAIVEGLAQRIVKKDVPESLLDKTVFELD
LSALVAGAKYRGEFEERLKAVLKEVKESDGRIILFIDEIHMLVGAGKTDGAMDAGNMLKPMLARGELHCIGATTLNEYRE
YIEKDSALERRFQKVAVSEPDVEDTISILRGLKERYEVYHGVRIQDRALVAAAELSDRYITDRFLPDKAIDLVDQACATI
RTEMGSNPTELDQVNRRVMQLEIEESALKNESDNASKQRLQELQEELANEKEKQAALQSRVESEKEKIANLQEKRAQLDE
SRQALEDAQTNNNLEKAAELQYGTIPQLEKELRELEDNFQDEQGEDTDRMIREVVTDEEIGDIVSQWTGIPVSKLVETER
EKLLHLSDILHKRVVGQDKAVDLVSDAVVRARAGIKDPNRPIGSFLFLGPTGVGKTELAKSLAASLFDSEKHMIRIDMSE
YMEKHAVSRLIGAPPGYIGHDEGGQLTEAVRRNPYSVILLDEVEKAHTDVFNVLLQILDEGRLTDSKGRSVDFKNTIIIM
TSNIGSQVLLENVKETGEITESTEKAVMTNLNAYFKPEILNRMDDIVLFKPLSIDDMSMIVDKILTQLNIRLLEQRISIE
VSDDAKAWLGQEAYEPQYGARPLKRFVQRQIETPLARMMIKEGFPEGTTIKVNLNSDNNLTFNVEKIHE
>Q9Z6E4 ~~~clpB~~~Chaperone protein ClpB~~~
MDAELTNRSRDALNAATTRAVSAGNPDLTPAHLLLALLEGQDNENLVDLLAAVERRPGGGSARAPSASLGSLPGVTGSTV
APPQPNRDLLAVIADAGRRAKDLGDEFLSTEHLLIGIRPTARPPRCSPGRAPTPEKLLEAFQNTRGGRRVTTPRPEGQYK
ALEKFGTDFTAAARERKLDPVIGRDQEIRRVVQVLSRRTKNNPVLIGEPGVGKTAVVEGLAQRIVKGDVPESLKDKRLVS
LDLGAMVAGAKYRGEFEERLKTVLSEIKESDGQIVTFIDELHTVVGAGAADSAMDAGNMLKPMLARGELRMVGATTLDEY
RERIEKDPALERRFQQVLVAEPSVEDSIAILRGLKGRYEAHHKVQIADSALVAAATLSDRYITSRFLPDKAIDLVDEAAS
RLRMEIDSSPLEIDELQRSVDRLKMEELALDRETDPASRQRLEKLRRDLADRERSCAAHRPWEKEKQSLNRVGELKERLD
ELRGQAERAQQHGDFDTASKLLYGEIPTLERDLRWRPAEEEAAKDTMVKEEVGPDDIADVVGSWTGIPAGRLLEGETQKL
LRMEAELGRRLIGQSEAVQAVSDAVRTRAGIADPDRPTGSFLFLGPTGVGKTELAKALADFLFDDERAMIRIDMSEYGEK
HSVARLVGAPPGYVGYEEGGQLTEAVRRRPYSVVLLDEVEKAHPGVFDILLQVLDDGRLTDGQGRTVDFRNTILVLTSNL
GSQYLVGSAPEEEKRRQVMEVVRSSFKPEFLNRLDDLVIFSALDEDELARIAGLQIAGLARRLADRRLSLDVTPEALAWL
AKEGFDPAYGARPLRRLIQTAIGDRLAKEILAGEVRDGDTVRVDRVEDGLLVGRAEG
>Q9RA63 ~~~clpB~~~Chaperone protein ClpB~~~COG0542
MNLERWTQAAREALAQAQVLAQRMKHQAIDLPHLWAVLLKDERSLAWRLLEKAGADPKALKELQERELARLPKVEGAEVG
QYLTSRLSGALNRAEALMEELKDRYVAVDTLVLALAEATPGLPGLEALKGALKELRGGRTVQTEHAESTYNALEQYGIDL
TRLAAEGKLDPVIGRDEEIRRVIQILLRRTKNNPVLIGEPGVGKTAIVEGLAQRIVKGDVPEGLKGKRIVSLQMGSLLAG
AKYRGEFEERLKAVIQEVVQSQGEVILFIDELHTVVGAGKAEGAVDAGNMLKPALARGELRLIGATTLDEYREIEKDPAL
ERRFQPVYVDEPTVEETISILRGLKEKYEVHHGVRISDSAIIAAATLSHRYITERRLPDKAIDLIDEAAARLRMALESAP
EEIDALERKKLQLEIEREALKKEKDPDSQERLKAIEAEIAKLTEEIAKLRAEWEREREILRKLREAQHRLDEVRREIELA
ERQYDLNRAAELRYGELPKLEAEVEALSEKLRGARFVRLEVTEEDIAEIVSRWTGIPVSKLLEGEREKLLRLEEELHKRV
VGQDEAIRAVADAIRRARAGLKDPNRPIGSFLFLGPTGVGKTELAKTLAATLFDTEEAMIRIDMTEYMEKHAVSRLIGAP
PGYVGYEEGGQLTEAVRRRPYSVILFDEIEKAHPDVFNILLQILDDGRLTDSHGRTVDFRNTVIILTSNLGSPLILEGLQ
KGWPYERIRDEVFKVLQQHFRPEFLNRLDEIVVFRPLTKEQIRQIVEIQLSYLRARLAEKRISLELTEAAKDFLAERGYD
PVFGARPLRRVIQRELETPLAQKILAGEVKEGDRVQVDVGPAGLVFAVPARVEA
>P9WPC8 ~~~clpC1~~~ATP-dependent Clp protease ATP-binding subunit ClpC1~~~
MFERFTDRARKVVVLAQEEARMLNHNYIGTEHILLGLIHEGEGVAAKSLESLGISLEGVRSQVEEIIGQGQQAPSGHIPF
TPRAKKVLELSLREALQLGHNYIGTEHILLGLIREGEGVAAQVLVKLGAELTRVRQQVIQLLSGYQGKEAAEAGTGGRGG
ESGSPSTSLVLDQFGRNLTAAAMEGKLDPVIGREKEIERVMQVLSRRTKNNPVLIGEPGVGKTAVVEGLAQAIVHGEVPE
TLKDKQLYTLDLGSLVAGSRYRGDFEERLKKVLKEINTRGDIILFIDELHTLVGAGAAEGAIDAASILKPKLARGELQTI
GATTLDEYRKYIEKDAALERRFQPVQVGEPTVEHTIEILKGLRDRYEAHHRVSITDAAMVAAATLADRYINDRFLPDKAI
DLIDEAGARMRIRRMTAPPDLREFDEKIAEARREKESAIDAQDFEKAASLRDREKTLVAQRAEREKQWRSGDLDVVAEVD
DEQIAEVLGNWTGIPVFKLTEAETTRLLRMEEELHKRIIGQEDAVKAVSKAIRRTRAGLKDPKRPSGSFIFAGPSGVGKT
ELSKALANFLFGDDDALIQIDMGEFHDRFTASRLFGAPPGYVGYEEGGQLTEKVRRKPFSVVLFDEIEKAHQEIYNSLLQ
VLEDGRLTDGQGRTVDFKNTVLIFTSNLGTSDISKPVGLGFSKGGGENDYERMKQKVNDELKKHFRPEFLNRIDDIIVFH
QLTREEIIRMVDLMISRVAGQLKSKDMALVLTDAAKALLAKRGFDPVLGARPLRRTIQREIEDQLSEKILFEEVGPGQVV
TVDVDNWDGEGPGEDAVFTFTGTRKPPAEPDLAKAGAHSAGGPEPAAR
>P9WPC9 ~~~clpC1~~~ATP-dependent Clp protease ATP-binding subunit ClpC1~~~COG0542
MFERFTDRARRVVVLAQEEARMLNHNYIGTEHILLGLIHEGEGVAAKSLESLGISLEGVRSQVEEIIGQGQQAPSGHIPF
TPRAKKVLELSLREALQLGHNYIGTEHILLGLIREGEGVAAQVLVKLGAELTRVRQQVIQLLSGYQGKEAAEAGTGGRGG
ESGSPSTSLVLDQFGRNLTAAAMEGKLDPVIGREKEIERVMQVLSRRTKNNPVLIGEPGVGKTAVVEGLAQAIVHGEVPE
TLKDKQLYTLDLGSLVAGSRYRGDFEERLKKVLKEINTRGDIILFIDELHTLVGAGAAEGAIDAASILKPKLARGELQTI
GATTLDEYRKYIEKDAALERRFQPVQVGEPTVEHTIEILKGLRDRYEAHHRVSITDAAMVAAATLADRYINDRFLPDKAI
DLIDEAGARMRIRRMTAPPDLREFDEKIAEARREKESAIDAQDFEKAASLRDREKTLVAQRAEREKQWRSGDLDVVAEVD
DEQIAEVLGNWTGIPVFKLTEAETTRLLRMEEELHKRIIGQEDAVKAVSKAIRRTRAGLKDPKRPSGSFIFAGPSGVGKT
ELSKALANFLFGDDDALIQIDMGEFHDRFTASRLFGAPPGYVGYEEGGQLTEKVRRKPFSVVLFDEIEKAHQEIYNSLLQ
VLEDGRLTDGQGRTVDFKNTVLIFTSNLGTSDISKPVGLGFSKGGGENDYERMKQKVNDELKKHFRPEFLNRIDDIIVFH
QLTREEIIRMVDLMISRVAGQLKSKDMALVLTDAAKALLAKRGFDPVLGARPLRRTIQREIEDQLSEKILFEEVGPGQVV
TVDVDNWDGEGPGEDAVFTFTGTRKPPAEPDLAKAGAHSAGGPEPAAR
>P37571 ~~~clpC~~~Negative regulator of genetic competence ClpC/MecB~~~COG0542
MMFGRFTERAQKVLALAQEEALRLGHNNIGTEHILLGLVREGEGIAAKALQALGLGSEKIQKEVESLIGRGQEMSQTIHY
TPRAKKVIELSMDEARKLGHSYVGTEHILLGLIREGEGVAARVLNNLGVSLNKARQQVLQLLGSNETGSSAAGTNSNANT
PTLDSLARDLTAIAKEDSLDPVIGRSKEIQRVIEVLSRRTKNNPVLIGEPGVGKTAIAEGLAQQIINNEVPEILRDKRVM
TLDMGTVVAGTKYRGEFEDRLKKVMDEIRQAGNIILFIDELHTLIGAGGAEGAIDASNILKPSLARGELQCIGATTLDEY
RKYIEKDAALERRFQPIQVDQPSVDESIQILQGLRDRYEAHHRVSITDDAIEAAVKLSDRYISDRFLPDKAIDLIDEAGS
KVRLRSFTTPPNLKELEQKLDEVRKEKDAAVQSQEFEKAASLRDTEQRLREQVEDTKKSWKEKQGQENSEVTVDDIAMVV
SSWTGVPVSKIAQTETDKLLNMENILHSRVIGQDEAVVAVAKAVRRARAGLKDPKRPIGSFIFLGPTGVGKTELARALAE
SIFGDEESMIRIDMSEYMEKHSTSRLVGSPPGYVGYDEGGQLTEKVRRKPYSVVLLDEIEKAHPDVFNILLQVLEDGRLT
DSKGRTVDFRNTILIMTSNVGASELKRNKYVGFNVQDETQNHKDMKDKVMGELKRAFRPEFINRIDEIIVFHSLEKKHLT
EIVSLMSDQLTKRLKEQDLSIELTDAAKAKVAEEGVDLEYGARPLRRAIQKHVEDRLSEELLRGNIHKGQHIVLDVEDGE
FVVKTTAKTN
>Q2G0P5 ~~~clpC~~~ATP-dependent Clp protease ATP-binding subunit ClpC~~~COG0542
MLFGRLTERAQRVLAHAQEEAIRLNHSNIGTEHLLLGLMKEPEGIAAKVLESFNITEDKVIEEVEKLIGHGQDHVGTLHY
TPRAKKVIELSMDEARKLHHNFVGTEHILLGLIRENEGVAARVFANLDLNITKARAQVVKALGNPEMSNKNAQASKSNNT
PTLDSLARDLTVIAKDGTLDPVIGRDKEITRVIEVLSRRTKNNPVLIGEPGVGKTAIAEGLAQAIVNNEVPETLKDKRVM
SLDMGTVVAGTKYRGEFEERLKKVMEEIQQAGNVILFIDELHTLVGAGGAEGAIDASNILKPALARGELQCIGATTLDEY
RKNIEKDAALERRFQPVQVDEPSVVDTVAILKGLRDRYEAHHRINISDEAIEAAVKLSNRYVSDRFLPDKAIDLIDEASS
KVRLKSHTTPNNLKEIEQEIEKVKNEKDAAVHAQEFENAANLRDKQTKLEKQYEEAKNEWKNAQNGMSTSLSEEDIAEVI
AGWTGIPLTKINETESEKLLSLEDTLHERVIGQKDAVNSISKAVRRARAGLKDPKRPIGSFIFLGPTGVGKTELARALAE
SMFGDDDAMIRVDMSEFMEKHAVSRLVGAPPGYVGHDDGGQLTEKVRRKPYSVILFDEIEKAHPDVFNILLQVLDDGHLT
DTKGRTVDFRNTIIIMTSNVGAQELQDQRFAGFGGSSDGQDYETIRKTMLKELKNSFRPEFLNRVDDIIVFHKLTKEELK
EIVTMMVNKLTNRLSEQNINIIVTDKAKDKIAEEGYDPEYGARPLIRAIQKTIEDNLSELILDGNQIEGKKVTVDHDGKE
FKYDIAEQTSETKTPSQA
>Q2YSD6 ~~~clpC~~~ATP-dependent Clp protease ATP-binding subunit ClpC~~~
MLFGRLTERAQRVLAHAQEEAIRLNHSNIGTEHLLLGLMKEPEGIAAKVLESFNITEDKVIEEVEKLIGHGQDHVGTLHY
TPRAKKVIELSMDEARKLHHNFVGTEHILLGLIRENEGVAARVFANLDLNITKARAQVVKALGNPEMSNKNAQASKSNNT
PTLDSLARDLTVIAKDGTLDPVIGRDKEITRVIEVLSRRTKNNPVLIGEPGVGKTAIAEGLAQAIVNNEVPETLKDKRVM
SLDMGTVVAGTKYRGEFEERLKKVMEEIQQAGNVILFIDELHTLVGAGGAEGAIDASNILKPALARGELQCIGATTLDEY
RKNIEKDAALERRFQPVQVDEPSVVDTVAILKGLRDRYEAHHRINISDEAIEAAVKLSNRYVSDRFLPDKAIDLIDEASS
KVRLKSHTTPNNLKEIEQEIEKVKNEKDAAVHAQEFENAANLRDKQTKLEKQYEEAKNEWKNTQNGMSTSLSEEDIAEVI
AGWTGIPLTKINETESEKLLSLEDTLHERVIGQKDAVNSISKAVRRARAGLKDPKRPIGSFIFLGPTGVGKTELARALAE
SMFGDDDAMIRVDMSEFMEKHAVSRLVGAPPGYVGHDDGGQLTEKVRRKPYSVILFDEIEKAHPDVFNILLQVLDDGHLT
DTKGRTVDFRNTIIIMTSNVGAQELQDQRFAGFGGSSDGQDYETIRKTMLKELKNSFRPEFLNRVDDIIVFHKLTKEELK
EIVTMMVNKLTNRLSEQNINIIVTDKAKDKIAEEGYDPEYGARPLIRAIQKTIEDNLSELILDGNQIEGKKVTVDHDGKE
FKYDIAEQTSETKTPSQA
>Q7A797 ~~~clpC~~~ATP-dependent Clp protease ATP-binding subunit ClpC~~~
MLFGRLTERAQRVLAHAQEEAIRLNHSNIGTEHLLLGLMKEPEGIAAKVLESFNITEDKVIEEVEKLIGHGQDHVGTLHY
TPRAKKVIELSMDEARKLHHNFVGTEHILLGLIRENEGVAARVFANLDLNITKARAQVVKALGNPEMSNKNAQASKSNNT
PTLDSLARDLTVIAKDGTLDPVIGRDKEITRVIEVLSRRTKNNPVLIGEPGVGKTAIAEGLAQAIVNNEVPETLKDKRVM
SLDMGTVVAGTKYRGEFEERLKKVMEEIQQAGNVILFIDELHTLVGAGGAEGAIDASNILKPALARGELQCIGATTLDEY
RKNIEKDAALERRFQPVQVDEPSVVDTVAILKGLRDRYEAHHRINISDEAIEAAVKLSNRYVSDRFLPDKAIDLIDEASS
KVRLKSHTTPNNLKEIEQEIEKVKNEKDAAVHAQEFENAANLRDKQTKLEKQYEEAKNEWKNAQNGMSTSLSEEDIAEVI
AGWTGIPLTKINETESEKLLSLEDTLHERVIGQKDAVNSISKAVRRARAGLKDPKRPIGSFIFLGPTGVGKTELARALAE
SMFGDDDAMIRVDMSEFMEKHAVSRLVGAPPGYVGHDDGGQLTEKVRRKPYSVILFDEIEKAHPDVFNILLQVLDDGHLT
DTKGRTVDFRNTIIIMTSNVGAQELQDQRFAGFGGSSDGQDYETIRKTMLKELKNSFRPEFLNRVDDIIVFHKLTKEELK
EIVTMMVNKLTNRLSEQNINIIVTDKAKDKIAEEGYDPEYGARPLIRAIQKTIEDNLSELILDGNQIEGKKVTVDHDGKE
FKYDIAEQTSETKTPSQA
>Q5XCL7 ~~~clpC~~~Probable ATP-dependent Clp protease ATP-binding subunit~~~
MTHFSGKDPFVNMDDIFNQLMANMGGYRSENPRYLVNGREITPEEFQHYRQTGQLPVATTKATNSQMLTPKADSVLTQLG
TNLTQEARQGHLDPVIGRNKEIQDTAEILARRTKNNPVLVGDAGVGKTAVIEGLAQAIVNGDVPAAIKNKEIVSIDISSL
EAGTQYRGSFEETIQNLIQEVKEAGNIILFFDEIHQIVGAGATSSDSGSKGLADILKPALSRGELTLIGATTQDEYRNTI
LKNAALARRFNEVKVNAPSAEDTFHILMGIRNLYEQHHHITLPDNVLKAAVDYSIQYIPQRSLPDKAIDLLDMTAAHLAA
QHPVTDLKTLETEIAKQKESQEKAVAKEDFEKALAAKTRIETLQKQIEQHNQSQNVTATVNDIAESVERLTGIPVSNMRT
NDLERLKGISSRLKSHVIGQDEAVAAVARAIRRNRAGFDDGNRPIGSFLFVGPTGVGKTELAKQLALDLFGSKDAIIRLD
MSEYNDRTAVSKLIGTTAGYVGYDDNNNTLTERVRRNPYAIVLLDEIEKADPQIITLLLQVLDDGRLTDGQGNTINFKNT
VIIATSNAGFGQQDTETSESNIMDRIAPYFRPEFLNRFNSIIKFNHLQKENLEEIVDLMLAEVNQTTAKKGISLTIADDA
KAHLIDLGYNHAMGARPLRRIIEQEIRDRITDYYLDHPEVKKLQAILKEGQLVIRQNDQ
>O31673 ~~~clpE~~~ATP-dependent Clp protease ATP-binding subunit ClpE~~~COG0542
MRCQHCHQNEATIRLNMQINSVHKQMVLCETCYNELTRKPSMSMGPQSFGFPFEQAFQPKEQSAAKQSEKKGLLDELAQN
ITNGAKAGLIDPVIGRDDEVARVIEILNRRNKNNPVLIGEPGVGKTAIAEGLALKIAEGDVPNKLKNKELYLLDVASLVA
NTGIRGQFEERMKQLITELKERKNVILFIDEIHLLVGAGSAEGSMDAGNILKPALARGELQVIGATTLKEYRQIEKDAAL
ERRFQPVMVQEPSIEQAILILQGIKDKYEAYHGVTFSDEAIKACVTLSSRYIQDRHLPDKAIDLLDEAGSKANLLIDELN
DEDAAERLTAIEAEKTKALEEENYELAAKLRDEELALEKKLNSSSAHTAVTVEAEHIQEIVEQKTGIPVGKLQADEQTKM
KELEAKLHERVIGQEAAVQKVAKAVRRSRAGLKSKNRPVGSFLFVGPTGVGKTELSKTLADELFGTKDAIIRLDMSEYME
KHAVSKIIGSPPGYVGHEEAGQLTEKVRRNPYSIVLLDEIEKAHPDVQHMFLQIMEDGRLTDSQGRTVSFKDTVIIMTSN
AGAGEKQTKVGFQSDDSVIEEQTLIDSLSMFFKPEFLNRFDSIIEFRSLEKEHLVKIVSLLLGELEETLAERGISLNVTD
EAKEKIAELGYHPSFGARPLRRTIQEWVEDEMTDLLLDNGEITSFHVILEDDKIKVRAK
>Q9S5Z2 ~~~clpE~~~ATP-dependent Clp protease ATP-binding subunit ClpE~~~COG0542
MLCQNCNINEATIHLYTSVNGQKKQIDLCQNCYQIMKSGGQEALFGAGNASNGNSDEPFNPFNDIFSALQGQDFNGAASN
QTPPTQTGGRGPRGPQNPRAKQPKGMLEEFGINITESARRGEIDPVIGRDEEIKRVIEILNRRTKNNPVLIGEPGVGKTA
VVEGLAQKIVDGDVPQKLQNKEVIRLDVVSLVQGTGIRGQFEERMQKLMDEIRKRNDVIMFIDEIHEIVGAGSAGDGNMD
AGNILKPALARGELQLVGATTLNEYRIIEKDAALERRMQPVKVDEPSVDETITILRGIQARYEDYHHVKYTDEAIEAAAH
LSNRYIQDRFLPDKAIDLLDESGSKKNLTLKFVDPEDINRRIADAESKKNEATKAEDFEKAAHFRDQISKLRELQKQEVT
DEDMPVITEKDIEQIVEQKTQIPVGDLKEKEQTQLINLADDLKAHVIGQDEAVDKISKAIRRSRVGLGKPNRPIGSFLFV
GPTGVGKTELAKQLAKELFGSSESMIRFDMSEYMEKHSVAKLIGAPPGYVGYEEAGQLTERVRRNPYSLILLDEIEKAHP
DVMHMFLQILEDGRLTDAQGRTVSFKDSLIIMTSNAGTGKVEASVGFGAAREGRTKSVLGQLGDFFSPEFMNRFDGIIEF
SALSKENLLKIVDLMLDEVNEQIGRNDIHLSVTQAAKEKLVDLGYNPAMGARPLRRIIQENIEDSIADFYIEHPEYKQLV
ADLIDDKIVISNQTQETAETTDEEVPAE
>Q2FV74 ~~~clpL~~~ATP-dependent Clp protease ATP-binding subunit ClpL~~~COG0542
MNNGFFNSDFDSIFRRMMKDMQGSNQVGNKKYYINGKEVSPEELAQLTQQGGNHSAEQSAQAFQQAAQRQQGQQGGNGNY
LEQIGRNLTQEARDGLLDPVIGRDKEIQETAEVLSRRTKNNPILVGEAGVGKTAIVEGLAQAIVEGNVPAAIKDKEIISV
DISSLEAGTQYRGAFEENIQKLIEGVKSSQNAVLFFDEIHQIIGSGATGSDSGSKGLSDILKPALSRGEISIIGATTQDE
YRNNILKDAALTRRFNEVLVNEPSAKDTVEILKGIREKFEEHHQVKLPDDVLKACVDLSIQYIPQRLLPDKAIDVLDITA
AHLSAQSPAVDKVETEKRISELENDKRKAVSAEEYKKADDIQNEIKSLQDKLENSNGEHTAVATVHDISDTIQRLTGIPV
SQMDDNDIERLKNISNRLRSKIIGQDQAVEMVSRAIRRNRAGFDDGNRPIGSFLFVGPTGVGKTELAKQLAIDLFGNKDA
LIRLDMSEYSDTTAVSKMIGTTAGYVGYDDNSNTLTEKVRRNPYSVILFDEIEKANPQILTLLLQVMDDGNLTDGQGNVI
NFKNTIIICTSNAGFGNGNDAEEKDIMHEMKKFFRPEFLNRFNGIVEFLHLDKDALQDIVNLLLDDVQVTLDKKGITMDV
SQDAKDWLIEEGYDEELGARPLRRIVEQQVRDKITDYYLDHTDVKHVDIDVEDNELVVKGK
>Q7A3F4 ~~~clpL~~~ATP-dependent Clp protease ATP-binding subunit ClpL~~~
MNNGFFNSDFDSIFRRMMQDMQGSNQVGNKKYYINGKEVSPEELAQLTQQGSNQSAEQSAQAFQQAAQRQQGQQGGNGNY
LEQIGRNLTQEARDGLLDPVIGRDKEIQETAEVLSRRTKNNPILVGEAGVGKTAIVEGLAQAIVEGNVPAAIKDKEIISV
DISSLEAGTQYRGAFEENIQKLIEGVKSSQNAVLFFDEIHQIIGSGATGSDSGSKGLSDILKPALSRGEISIIGATTQDE
YRNNILKDAALTRRFNEVLVNEPSAKDTVEILKGIREKFEEHHQVKLPDDVLKACVDLSIQYIPQRLLPDKAIDVLDITA
AHLSAQSPAVDKVETEKRISELENDKRKAVSAEEYKKADDIQNEIKSLQDKLENSNGEHTAVATVHDISDTIQRLTGIPV
SQMDDNDIERLKNISNRLRSKIIGQDQAVEMVSRAIRRNRAGFDDGNRPIGSFLFVGPTGVGKTELAKQLAIDLFGNKDA
LIRLDMSEYSDTTAVSKMIGTTAGYVGYDDNSNTLTEKVRRNPYSVILFDEIEKANPQILTLLLQVMDDGNLTDGQGNVI
NFKNTIIICTSNAGFGNGNDAEEKDIMHEMKKFFRPEFLNRFNGIVEFLHLDKDALQDIVNLLLDDVQVTLDKKGITMDV
SQDAKDWLIEEGYDEELGARPLRRIVEQQVRDKITDYYLDHTDVKHVDIDVEDNELVVKGK
>B0B803 3.4.21.92~~~clpP1~~~ATP-dependent Clp protease proteolytic subunit 1~~~
MPEGEMMHKLQDVIDRKLLDSRRIFFSEPVTEKSAAEAIKKLWYLELTNPGQPIVFVINSPGGSVDAGFAVWDQIKMISS
PLTTVVTGLAASMGSVLSLCAVPGRRFATPHARIMIHQPSIGGTITGQATDLDIHAREILKTKARIIDVYVEATGQSPEV
IEKAIDRDMWMSANEAMEFGLLDGILFSFNDL
>P9WPC4 3.4.21.92~~~clpP1~~~ATP-dependent Clp protease proteolytic subunit 1~~~
MSQVTDMRSNSQGLSLTDSVYERLLSERIIFLGSEVNDEIANRLCAQILLLAAEDASKDISLYINSPGGSISAGMAIYDT
MVLAPCDIATYAMGMAASMGEFLLAAGTKGKRYALPHARILMHQPLGGVTGSAADIAIQAEQFAVIKKEMFRLNAEFTGQ
PIERIEADSDRDRWFTAAEALEYGFVDHIITRAHVNGEAQ
>P9WPC5 3.4.21.92~~~clpP1~~~ATP-dependent Clp protease proteolytic subunit 1~~~COG0740
MSQVTDMRSNSQGLSLTDSVYERLLSERIIFLGSEVNDEIANRLCAQILLLAAEDASKDISLYINSPGGSISAGMAIYDT
MVLAPCDIATYAMGMAASMGEFLLAAGTKGKRYALPHARILMHQPLGGVTGSAADIAIQAEQFAVIKKEMFRLNAEFTGQ
PIERIEADSDRDRWFTAAEALEYGFVDHIITRAHVNGEAQ
>Q9I2U1 3.4.21.92~~~clpP1~~~ATP-dependent Clp protease proteolytic subunit 1~~~
MSRNSFIPHVPDIQAAGGLVPMVVEQSARGERAYDIYSRLLKERIIFLVGQVEDYMANLVVAQLLFLEAENPEKDIHLYI
NSPGGSVTAGMSIYDTMQFIKPNVSTTCIGQACSMGALLLAGGAAGKRYCLPHSRMMIHQPLGGFQGQASDIEIHAKEIL
FIKERLNQILAHHTGQPLDVIARDTDRDRFMSGDEAVKYGLIDKVMTQRDLAV
>Q9F315 3.4.21.92~~~clpP1~~~ATP-dependent Clp protease proteolytic subunit 1~~~COG0740
MRRPGAVVRRAGGYVTNLMPSAAGEPSIGGGLGDQVYNRLLGERIIFLGQPVDDDIANKITAQLLLLASDPDKDIFLYIN
SPGGSITAGMAIYDTMQYIKNDVVTIAMGLAASMGQFLLSAGTPGKRFALPNAEILIHQPSAGLAGSASDIKIHAERLLH
TKRRMAELTSQHTGQTIEQITRDSDRDRWFDAFEAKEYGLIDDVIATAAGMPGGGGTGA
>O84712 3.4.21.92~~~clpP2~~~ATP-dependent Clp protease proteolytic subunit 2~~~
MTLVPYVVEDTGRGERAMDIYSRLLKDRIVMIGQEITEPLANTVIAQLLFLMSEDPTKDIQIFINSPGGYITAGLAIYDT
IRFLGCDVNTYCIGQAASMGALLLSAGTKGKRYALPHSRMMIHQPSGGIIGTSADIQLQAAEILTLKKHLSNILAECTGQ
SVEKIIEDSERDFFMGAEEAIAYGLIDKVISSAKETKDKSIAS
>P9WPC2 3.4.21.92~~~clpP2~~~ATP-dependent Clp protease proteolytic subunit 2~~~
MNSQNSQIQPQARYILPSFIEHSSFGVKESNPYNKLFEERIIFLGVQVDDASANDIMAQLLVLESLDPDRDITMYINSPG
GGFTSLMAIYDTMQYVRADIQTVCLGQAASAAAVLLAAGTPGKRMALPNARVLIHQPSLSGVIQGQFSDLEIQAAEIERM
RTLMETTLARHTGKDAGVIRKDTDRDKILTAEEAKDYGIIDTVLEYRKLSAQTA
>P9WPC3 3.4.21.92~~~clpP2~~~ATP-dependent Clp protease proteolytic subunit 2~~~COG0740
MNSQNSQIQPQARYILPSFIEHSSFGVKESNPYNKLFEERIIFLGVQVDDASANDIMAQLLVLESLDPDRDITMYINSPG
GGFTSLMAIYDTMQYVRADIQTVCLGQAASAAAVLLAAGTPGKRMALPNARVLIHQPSLSGVIQGQFSDLEIQAAEIERM
RTLMETTLARHTGKDAGVIRKDTDRDKILTAEEAKDYGIIDTVLEYRKLSAQTA
>Q9HYR9 3.4.21.92~~~clpP2~~~ATP-dependent Clp protease proteolytic subunit 2~~~
MKTDDKDREGGDSHGAIGAKLMEYALKVRKVFVTGGVDEKMAKDVVQQLHILASISDDPIYMFVNSPGGHVESGDMIFDA
IRFITPKVIMIGSGSVASAGALIYAAADKENRYSLPNTRFLLHQPSGGIQGPASNIEIYRREIVRMKERLDRIFAEATGQ
TPEKISADTERDFWLNAEEAVQYGLVNKIIVSEREITLPGQ
>Q9ZH58 3.4.21.92~~~clpP2~~~ATP-dependent Clp protease proteolytic subunit 2~~~COG0740
MRAASQGRYTGPQAESRYVIPRFVERTSQGVREYDPYAKLFEERVIFLGVQIDDASANDVMAQLLCLESMDPDRDISVYI
NSPGGSFTALTAIYDTMQYVKPDVQTVCMGQAASAAAVLLAAGTPGKRMALPNARVLIHQPYSETGRGQVSDLEIAANEI
LRMRSQLEEMLAKHSTTPVEKIREDIERDKILTAEDALSYGLIDQIITTRKMDNSSLR
>P80244 3.4.21.92~~~clpP~~~ATP-dependent Clp protease proteolytic subunit~~~COG0740
MNLIPTVIEQTNRGERAYDIYSRLLKDRIIMLGSAIDDNVANSIVSQLLFLAAEDPEKEISLYINSPGGSITAGMAIYDT
MQFIKPKVSTICIGMAASMGAFLLAAGEKGKRYALPNSEVMIHQPLGGAQGQATEIEIAAKRILLLRDKLNKVLAERTGQ
PLEVIERDTDRDNFKSAEEALEYGLIDKILTHTEDKK
>B8GX16 3.4.21.92~~~clpP~~~ATP-dependent Clp protease proteolytic subunit~~~
MYDPVSTAMNLVPMVVEQTSRGERAFDIFSRLLKERIIFLTGPVEDGMASLICAQLLFLESENPKKEIAMYINSPGGVVT
AGLAIYDTMQYIKSPVSTVCMGMAASMGSLLLAAGAAGQRISLPNARIMVHQPSGGFRGQASDIERHAEDIIKTKRRLNE
IYVKHCGRTYEEVERTLDRDHFMSADEAKAWGLVDHVYDSRDAAEAGAE
>Q83DJ2 3.4.21.92~~~clpP~~~ATP-dependent Clp protease proteolytic subunit~~~COG0740
MSVLVPMVVEQTSRGERAYDIYSRLLKDRVIFLVGQVEDHMANLAIAQMLFLESENPNKDINLYINSPGGAVTSAMAIYD
TMQFVKPDVRTLCIGQAASAGALLLAGGAKGKRHCLPHSSVMIHQVLGGYQGQGTDIQIHAKQTQRVSDQLNQILAKHTG
KDIERVEKDTNRDYFLTPEEAVEYGLIDSIFKERP
>P0A6G7 3.4.21.92~~~clpP~~~ATP-dependent Clp protease proteolytic subunit~~~COG0740
MSYSGERDNFAPHMALVPMVIEQTSRGERSFDIYSRLLKERVIFLTGQVEDHMANLIVAQMLFLEAENPEKDIYLYINSP
GGVITAGMSIYDTMQFIKPDVSTICMGQAASMGAFLLTAGAKGKRFCLPNSRVMIHQPLGGYQGQATDIEIHAREILKVK
GRMNELMALHTGQSLEQIERDTERDRFLSAPEAVEYGLVDSILTHRN
>Q5NH47 3.4.21.92~~~clpP~~~ATP-dependent Clp protease proteolytic subunit~~~COG0740
MITNNLVPTVIEKTAGGERAFDIYSRLLKERIVFLNGEVNDHSANLVIAQLLFLESEDPDKDIYFYINSPGGMVTAGMGV
YDTMQFIKPDVSTICIGLAASMGSLLLAGGAKGKRYSLPSSQIMIHQPLGGFRGQASDIEIHAKNILRIKDRLNKVLAHH
TGQDLETIVKDTDRDNFMMADEAKAYGLIDHVIESREAIIK
>P56156 3.4.21.92~~~clpP~~~ATP-dependent Clp protease proteolytic subunit~~~COG0740
MMGYIPYVIENTDRGERSYDIYSRLLKDRIVLLSGEINDSVASSIVAQLLFLEAEDPEKDIGLYINSPGGVITSGLSIYD
TMNFIRPDVSTICIGQAASMGAFLLSCGAKGKRFSLPHSRIMIHQPLGGAQGQASDIEIISNEILRLKGLMNSILAQNSG
QSLEQIAKDTDRDFYMSAKEAKEYGLIDKVLQKNVK
>Q9ZAB0 3.4.21.92~~~clpP~~~ATP-dependent Clp protease proteolytic subunit~~~COG0740
MGYLVPTVIEQSSRGERAYDIYSRLLKDRIIMLTGPVEDGMANSIIAQLLFLDAQDNTKDIYLYVNTPGGSVSAGLAIVD
TMNFIKSDVQTIVMGMAASMGTIIASSGTKGKRFMLPNAEYLIHQPMGGAGQGTQQTDMAIVAEQLLKTRKRLEQILADN
SNRSLEQIHKDAERDHWMDAKETLEYGFIDEIMENNSLK
>Q9RQI6 3.4.21.92~~~clpP~~~ATP-dependent Clp protease proteolytic subunit~~~COG0740
MNLIPTVIEQTSRGERAYDIYSRLLKDRIIMLGSAIDDNVANSIVSQLLFLDAQDPEKDIFLYINSPGGSISAGMAIYDT
MNFVKADVQTIGMGMAASMGSFLLTAGANGKRFALPNAEIMIHQPLGGAQGQATEIEIAARHILKIKERMNTIMAEKTGQ
PYEVIARDTDRDNFMTAQEAKDYGLIDDIIINKSGLKG
>Q9JZ38 3.4.21.92~~~clpP~~~ATP-dependent Clp protease proteolytic subunit~~~
MSFDNYLVPTVIEQSGRGERAFDIYSRLLKERIVFLVGPVTDESANLVVAQLLFLESENPDKDIFFYINSPGGSVTAGMS
IYDTMNFIKPDVSTLCLGQAASMGAFLLSAGEKGKRFALPNSRIMIHQPLISGGLGGQASDIEIHARELLKIKEKLNRLM
AKHCDRDLADLERDTDRDNFMSAEEAKEYGLIDQILENRASLRL
>Q2G036 3.4.21.92~~~clpP~~~ATP-dependent Clp protease proteolytic subunit~~~COG0740
MNLIPTVIETTNRGERAYDIYSRLLKDRIIMLGSQIDDNVANSIVSQLLFLQAQDSEKDIYLYINSPGGSVTAGFAIYDT
IQHIKPDVQTICIGMAASMGSFLLAAGAKGKRFALPNAEVMIHQPLGGAQGQATEIEIAANHILKTREKLNRILSERTGQ
SIEKIQKDTDRDNFLTAEEAKEYGLIDEVMVPETK
>Q2YSF8 3.4.21.92~~~clpP~~~ATP-dependent Clp protease proteolytic subunit~~~
MNLIPTVIETTNRGERAYDIYSRLLKDRIIMLGSQIDDNVANSIVSQLLFLQAQDSEKDIYLYINSPGGSVTAGFAIYDT
IQHIKPDVQTICIGMAASMGSFLLAAGAKGKRFALPNAEVMIHQPLGGAQGQATEIEIAANHILKTREKLNRILSERTGQ
SIEKIQKDTDRDNFLTAEEAKEYGLIDEVMVPETK
>P99089 3.4.21.92~~~clpP~~~ATP-dependent Clp protease proteolytic subunit~~~
MNLIPTVIETTNRGERAYDIYSRLLKDRIIMLGSQIDDNVANSIVSQLLFLQAQDSEKDIYLYINSPGGSVTAGFAIYDT
IQHIKPDVQTICIGMAASMGSFLLAAGAKGKRFALPNAEVMIHQPLGGAQGQATEIEIAANHILKTREKLNRILSERTGQ
SIEKIQKDTDRDNFLTAEEAKEYGLIDEVMVPETK
>P63786 3.4.21.92~~~clpP~~~ATP-dependent Clp protease proteolytic subunit~~~
MNLIPTVIETTNRGERAYDIYSRLLKDRIIMLGSQIDDNVANSIVSQLLFLQAQDSEKDIYLYINSPGGSVTAGFAIYDT
IQHIKPDVQTICIGMAASMGSFLLAAGAKGKRFALPNAEVMIHQPLGGAQGQATEIEIAANHILKTREKLNRILSERTGQ
SIEKIQKDTDRDNFLTAEEAKEYGLIDEVMVPETK
>Q5XDM4 3.4.21.92~~~clpP~~~ATP-dependent Clp protease proteolytic subunit~~~
MIPVVIEQTSRGERSYDIYSRLLKDRIIMLTGPVEDNMANSVIAQLLFLDAQDNTKDIYLYVNTPGGSVSAGLAIVDTMN
FIKADVQTIVMGMAASMGTVIASSGTKGKRFMLPNAEYMIHQPMGGTGGGTQQTDMAIAAEHLLKTRHRLEKILAQNAGK
TIKQIHKDAERDYWMSAEETLAYGFIDEIMENNELK
>P63788 3.4.21.92~~~clpP~~~ATP-dependent Clp protease proteolytic subunit~~~COG0740
MIPVVIEQTSRGERSYDIYSRLLKDRIIMLTGPVEDNMANSVIAQLLFLDAQDSTKDIYLYVNTPGGSVSAGLAIVDTMN
FIKADVQTIVMGMAASMGTVIASSGAKGKRFMLPNAEYMIHQPMGGTGGGTQQTDMAIAAEHLLKTRNTLEKILAENSGQ
SMEKVHADAERDNWMSAQETLEYGFIDEIMANNSLN
>Q5SKM8 3.4.21.92~~~clpP~~~ATP-dependent Clp protease proteolytic subunit~~~COG0740
MVIPYVIEQTARGERVYDIYSRLLKDRIIFLGTPIDAQVANVVVAQLLFLDAQNPNQEIKLYINSPGGEVDAGLAIYDTM
QFVRAPVSTIVIGMAASMAAVILAAGEKGRRYALPHAKVMIHQPWGGVRGTASDIAIQAQEILKAKKLLNEILAKHTGQP
LEKVEKDTDRDYYLSAQEALEYGLIDQVVTREEA
>Q5H434 3.4.21.92~~~clpP~~~ATP-dependent Clp protease proteolytic subunit~~~
MSIVTKALNLVPMVVEQTSRGERAYDIYSRLLKERLIFLVGPIDDHMANVIVAQLLFLEADNPEKDISIYINSPGGVVTA
GMAIYDTMQYIKPDVSTICVGQAASMGALLLASGAAGKRYALPNSRVMIHQPLGGFQGQATDIDIHAREILTLRSRLNEI
LAKHTGQSLETIARDTERDNFKSAVDAQAYGLVDQVLERRPEESIQPS
>P39070 3.4.21.-~~~clpQ~~~ATP-dependent protease subunit ClpQ~~~COG5405
MSSFHATTIFAVQHKGRSAMSGDGQVTFGQAVVMKHTARKVRKLFNGKVLAGFAGSVADAFTLFEKFEAKLEEYNGNLKR
AAVELAKEWRSDKVLRKLEAMLIVMNQDTLLLVSGTGEVIEPDDGILAIGSGGNYALAAGRALKKHAGESMSASEIARAA
LETAGEICVYTNDQIILEELE
>Q8UD95 ~~~clpS2~~~ATP-dependent Clp protease adapter protein ClpS 2~~~COG2127
MSDSPVDLKPKPKVKPKLERPKLYKVMLLNDDYTPREFVTVVLKAVFRMSEDTGRRVMMTAHRFGSAVVVVCERDIAETK
AKEATDLGKEAGFPLMFTTEPEE
>Q9A5I0 ~~~clpS~~~ATP-dependent Clp protease adapter protein ClpS~~~COG2127
MICPPGENKSMAERKQGGQGNGVGSSVVTEVKPKTQKPSLYRVLILNDDYTPMEFVVYVLERFFNKSREDATRIMLHVHQ
NGVGVCGVYTYEVAETKVAQVIDSARRHQHPLQCTMEKD
>P0A8Q6 ~~~clpS~~~ATP-dependent Clp protease adapter protein ClpS~~~COG2127
MGKTNDWLDFDQLAEEKVRDALKPPSMYKVILVNDDYTPMEFVIDVLQKFFSYDVERATQLMLAVHYQGKAICGVFTAEV
AETKVAMVNKYARENEHPLLCTLEKA
>P9WPC1 ~~~clpS~~~ATP-dependent Clp protease adapter protein ClpS~~~COG2127
MAVVSAPAKPGTTWQRESAPVDVTDRAWVTIVWDDPVNLMSYVTYVFQKLFGYSEPHATKLMLQVHNEGKAVVSAGSRES
MEVDVSKLHAAGLWATMQQDR
>Q8GED7 2.3.2.20~~~albC~~~Cyclo(L-leucyl-L-phenylalanyl) synthase~~~
MLAGLVPAPDHGMREEILGDRSRLIRQRGEHALIGISAGNSYFSQKNTVMLLQWAGQRFERTDVVYVDTHIDEMLIADGR
SAQEAERSVKRTLKDLRRRLRRSLESVGDHAERFRVRSLSELQETPEYRAVRERTDRAFEEDAEFATACEDMVRAVVMNR
PGDGVGISAEHLRAGLNYVLAEAPLFADSPGVFSVPSSVLCYHIDTPITAFLSRRETGFRAAEGQAYVVVRPQELADAA
>Q9I742 ~~~clpV1~~~AAA+ ATPase ClpV1~~~
MSEISRVALFGKLNSLAYKAIEAATVFCKLRGNPYVELVHWFHQILQLPDSDLHQIVRQSGIDPARLAKDLTEALDRLPR
GSTSITDLSSHVEEAVERGWVYGSLMFGESQVRTGYLVIGILKTPSLRHALTGLSAEFAKLKVEALTERFDEYVGASPEN
GLSASDGFNAGAAPGEASGALAPSAMGKQEALKRFTVDLTEQARSGKLDPIVGRDEEIRQLVDILMRRRQNNPILTGEAG
VGKTAVVEGFALRIVAGDVPPALKDVELRALDVGLLQAGASMKGEFEQRLRQVIEDVQSSEKPIILFIDEAHTLVGAGGA
AGTGDAANLLKPALARGTLRTVAATTWAEYKKHIEKDPALTRRFQVVQVDEPSEHKAILMMRGVASTMEKHHQVQILDEA
LEAAVRLSHRYIPARQLPDKSVSLLDTACARTAISLHAVPAEVDDSRRRIEALETELAIIRRESAIGVATAERQRNAETL
LAEERERLAALEQRWAEEKRLVDELLETRARLRAAAEAVDAGGVPLGEGEVRLDEEQRQALHARLAELQAQLSALQGEEP
LILPTVDYQAVASVVADWTGIPVGRMARNEIETVLNLDRHLKKRIIGQDHALEMIAKRIQTSRAGLDNPSKPIGVFMLAG
TSGVGKTETALALAEAMYGGEQNVITINMSEFQEAHTVSTLKGAPPGYIGYGEGGVLTEAVRRKPYSVVLLDEVEKAHPD
VHEIFFQVFDKGVMEDGEGRVIDFKNTLILLTTNAGTEMIASLCADPELMPEPEAIAKSLREPLLKIFPPALLGRLVTIP
YYPLSDDMLKAISRLQLGRIKKRVEATHKVPFEFDEGVVDLIVSRCTETESGGRMIDAILTNTLLPDMSREFLTRMLEGK
PLAGVRISSRDNQFHYDFAEAE
>P50866 ~~~clpX~~~ATP-dependent Clp protease ATP-binding subunit ClpX~~~COG1219
MFKFNEEKGQLKCSFCGKTQDQVRKLVAGPGVYICDECIELCTEIVEEELGTEEEVEFKDVPKPQEIREILNEYVIGQDQ
AKKSLAVAVYNHYKRINSNSKVDDVELSKSNISLIGPTGSGKTLLAQTLARILNVPFAIADATSLTEAGYVGEDVENILL
KLIQAADYDVEKAEKGIIYIDEIDKVARKSENPSITRDVSGEGVQQALLKILEGTVASVPPQGGRKHPHQEFIQIDTTNI
LFICGGAFDGIEQIIKRRLGQKVIGFGADNKAADLEKEDLLSKVLPEDLLRFGLIPEFIGRLPVIASLEKLDEEALVAIL
TKPKNALVKQFKKMLELDNVELEFEEEALSEIAKKAIERKTGARGLRSIIEGIMLDVMFELPSRDDIEKCVITGATVTHG
EPPRLLLKDGTEVSQDKTSA
>P0CAU2 ~~~clpX~~~ATP-dependent Clp protease ATP-binding subunit ClpX~~~COG1219
MTKAASGDTKSTLYCSFCGKSQHEVRKLIAGPTVFICDECVELCMDIIREEHKIAFVKSKDGVPTPREICEVLDDYVIGQ
GHAKKVLAVAVHNHYKRLNHASKNNDVELAKSNILLVGPTGTGKTLLAQTLARIIDVPFTMADATTLTEAGYVGEDVENI
VLKLLQAADYNVERAQRGIVYIDEIDKISRKSDNPSITRDVSGEGVQQALLKIMEGTVASVPPQGGRKHPQQEFLQVDTT
NILFICGGAFAGLEKIISARGAAKSIGFGAKVTDPEERRTGEILRNVEPDDLQRFGLIPEFIGRLPVVATLEDLDEAALV
KILTEPKNAFVKQYQRLFEMENIGLTFTEDALHQVAKKAIARKTGARGLRSIMEGILLETMFELPTYEGVEEVVVNAEVV
EGRAQPLLIYAEKKGGAASA
>B8GX14 ~~~clpX~~~ATP-dependent Clp protease ATP-binding subunit ClpX~~~
MTKAASGDTKSTLYCSFCGKSQHEVRKLIAGPTVFICDECVELCMDIIREEHKIAFVKSKDGVPTPREICEVLDDYVIGQ
GHAKKVLAVAVHNHYKRLNHASKNNDVELAKSNILLVGPTGTGKTLLAQTLARIIDVPFTMADATTLTEAGYVGEDVENI
VLKLLQAADYNVERAQRGIVYIDEIDKISRKSDNPSITRDVSGEGVQQALLKIMEGTVASVPPQGGRKHPQQEFLQVDTT
NILFICGGAFAGLEKIISARGAAKSIGFGAKVTDPEERRTGEILRNVEPDDLQRFGLIPEFIGRLPVVATLEDLDEAALV
KILTEPKNAFVKQYQRLFEMENIGLTFTEDALHQVAKKAIARKTGARGLRSIMEGILLETMFELPTYEGVEEVVVNAEVV
EGRAQPLLIYAEKKGGAASA
>P0A6H1 ~~~clpX~~~ATP-dependent Clp protease ATP-binding subunit ClpX~~~COG1219
MTDKRKDGSGKLLYCSFCGKSQHEVRKLIAGPSVYICDECVDLCNDIIREEIKEVAPHRERSALPTPHEIRNHLDDYVIG
QEQAKKVLAVAVYNHYKRLRNGDTSNGVELGKSNILLIGPTGSGKTLLAETLARLLDVPFTMADATTLTEAGYVGEDVEN
IIQKLLQKCDYDVQKAQRGIVYIDEIDKISRKSDNPSITRDVSGEGVQQALLKLIEGTVAAVPPQGGRKHPQQEFLQVDT
SKILFICGGAFAGLDKVISHRVETGSGIGFGATVKAKSDKASEGELLAQVEPEDLIKFGLIPEFIGRLPVVATLNELSEE
ALIQILKEPKNALTKQYQALFNLEGVDLEFRDEALDAIAKKAMARKTGARGLRSIVEAALLDTMYDLPSMEDVEKVVIDE
SVIDGQSKPLLIYGKPEAQQASGE
>O25926 ~~~clpX~~~ATP-dependent Clp protease ATP-binding subunit ClpX~~~COG1219
MNETLYCSFCKKPESRDPKKRRIIFASNLNKDVCVCEYCIDVMHGELHKYDNSLLALKRDRLRRMESSAYEEEFLLSYIP
APKELKAVLDNYVIGQEQAKKVFSVAVYNHYKRLSFKEKLKKQDNQDSNVELEHLEEVELSKSNILLIGPTGSGKTLMAQ
TLAKHLDIPIAISDATSLTEAGYVGEDVENILTRLLQASDWNVQKAQKGIVFIDEIDKISRLSENRSITRDVSGEGVQQA
LLKIVEGSLVNIPPKGGRKHPEGNFIQIDTSDILFICAGAFDGLAEIIKKRTTQNVLGFTQEKMSKKEQEAILHLVQTHD
LVTYGLIPELIGRLPVLSTLDSISLEAMVDILQKPKNALIKQYQQLFKMDEVDLIFEEEAIKEIAQLALERKTGARGLRA
IIEDFCLDIMFDLPKLKGSEVRITKDCVLKQAEPLIIAKTHSKILP
>Q8Y7K9 ~~~clpX~~~ATP-dependent Clp protease ATP-binding subunit ClpX~~~COG1219
MFKFNDEKGQLKCSFCGKTQDQVRKLVAGPGVYICDECIELCNEIIEEELGISEFVDFGEVPKPQEIRHILSDYVIGQER
AKKALAVAVYNHYKRINSNETKEDEVELSKSNICLIGPTGSGKTLLAQTLARILNVPFAIADATSLTEAGYVGEDVENIL
LKLIQSADYDVEKAEKGIIYIDEIDKVARKSENPSITRDVSGEGVQQALLKILEGTVASVPPQGGRKHPHQELIQIDTGN
ILFIVGGAFDGIEQIVKNRMGEKVIGFGTDNAKLKDDETYLSRVVPEDLLKFGLIPEFIGRLPVIATLEQLDEAALVSIL
TEPKNALVKQYKRMLELDDVELEFEPTALIEIAKEAIERKTGARGLRSIIEQIMLEVMFEIPSRDDITKCIITEKAARGE
EEPQLQLEDGSIIPIKTSA
>P9WPB9 ~~~clpX~~~ATP-dependent Clp protease ATP-binding subunit ClpX~~~COG1219
MARIGDGGDLLKCSFCGKSQKQVKKLIAGPGVYICDECIDLCNEIIEEELADADDVKLDELPKPAEIREFLEGYVIGQDT
AKRTLAVAVYNHYKRIQAGEKGRDSRCEPVELTKSNILMLGPTGCGKTYLAQTLAKMLNVPFAIADATALTEAGYVGEDV
ENILLKLIQAADYDVKRAETGIIYIDEVDKIARKSENPSITRDVSGEGVQQALLKILEGTQASVPPQGGRKHPHQEFIQI
DTTNVLFIVAGAFAGLEKIIYERVGKRGLGFGAEVRSKAEIDTTDHFADVMPEDLIKFGLIPEFIGRLPVVASVTNLDKE
SLVKILSEPKNALVKQYIRLFEMDGVELEFTDDALEAIADQAIHRGTGARGLRAIMEEVLLPVMYDIPSRDDVAKVVVTK
ETVQDNVLPTIVPRKPSRSERRDKSA
>Q9JYY3 ~~~clpX~~~ATP-dependent Clp protease ATP-binding subunit ClpX~~~
MSNENRTCSFCGKSKSHVKHLIEGENAFICDECVSNCIEILHEDGNDGTPSESAGGEPEESGKLPTPAEIVANLNDHVIG
QEQAKKALAVSVYNHYKRLRHPKAGANVELSKSNILLIGPTGSGKTLLAQSLARKLDVPFVMADATTLTEAGYVGEDVEQ
IITKLLGKCDFDVEKAQRGIVYIDEIDKISRKSDNPSITRDVSGEGVQQALLKLIEGTVASVPPQGGRKHPNQEFINVDT
TNILFICGGAFAGLEKVIRQRTEKGGIGFGASVHSKDENADITKLFGIVEPEDLIKFGLIPELIGRLPVIATLEELDEDA
LINILTEPKNALVKQYQALFGMENVELEFEEGALRSIARQAMERKTGARGLRSIVERCLLDTMYRLPDLKGLKKVVVGKA
VIEEGREPELVFES
>P63790 ~~~clpX~~~ATP-dependent Clp protease ATP-binding subunit ClpX~~~
MFKFNEDEENLKCSFCGKDQDQVKKLVAGSGVYICNECIELCSEIVEEELAQNTSEAMTELPTPKEIMDHLNEYVIGQEK
AKKSLAVAVYNHYKRIQQLGPKEDDVELQKSNIALIGPTGSGKTLLAQTLAKTLNVPFAIADATSLTEAGYVGDDVENIL
LRLIQAADFDIDKAEKGIIYVDEIDKIARKSENTSITRDVSGEGVQQALLKILEGTTASVPPQGGRKHPNQEMIQIDTTN
ILFILGGAFDGIEEVIKRRLGEKVIGFSSNEADKYDEQALLAQIRPEDLQAYGLIPEFIGRVPIVANLETLDVTALKNIL
TQPKNALVKQYTKMLELDDVDLEFTEEALSAISEKAIERKTGARGLRSIIEESLIDIMFDVPSNENVTKVVITAQTINEE
TEPELYDAEGNLINNSKTSA
>P39778 ~~~clpY~~~ATP-dependent protease ATPase subunit ClpY~~~COG1220
MEKKPLTPRQIVDRLDQYIVGQQNAKKAVAVALRNRYRRSLLDEKLKDEVVPKNILMMGPTGVGKTEIARRIAKLSGAPF
IKIEATKFTEVGYVGRDVESMVRDLVETSVRLIKEEKMNEVKEQAEENANKRIVRLLVPGKKKQSGVKNPFEMFFGGSQP
NGEDEAESQEEANIEEKRKRMAHQLALGELEDYYVTVEVEEQQPSMFDMLQGSGMEQMGMNMQDALSGLMPKKKKRRKMT
VREARKVLTNEEASKLIDMDEVGQEAVQRAEESGIIFIDEIDKIAKNGGASSSADVSREGVQRDILPIVEGSTVVTKYGS
VKTDHVLFIAAGAFHMAKPSDLIPELQGRFPIRVELNKLTVDDFVRILVEPDNALLKQYQALLQTEGISLEFSDEAIHKI
AEVAYHVNQDTDNIGARRLHTILERLLEDLSFEAPDVTMEKITITPQYVEEKLGTIAKNKDLSQFIL
>Q8PQ45 ~~~clp~~~CRP-like protein Clp~~~COG0664
MSPGNTTVVTTTVRNATPSLALDAGTIERFLAHSHRRRYPTRTDVFRPGDPAGTLYYVISGSVSIIAEEDDDRELVLGYF
GSGEFVGEMGLFIESDTREVILRTRTQCELAEISYERLQQLFQTSLSPDAPKILYAIGVQLSKRLLDTTRKASRLAFLDV
TDRIVRTLHDLAKEPEAMSHPQGTQLRVSRQELARLVGCSREMAGRVLKKLQADGLLHARGKTVVLYGTR
>Q4UZF6 ~~~clp~~~CRP-like protein Clp~~~
MSLGNTTVVTTTVRNATPSLTLDAGTIERFLAHSHRRRYPTRTDVFRPGDPAGTLYYVISGSVSIIAEEDDDRELVLGYF
GSGEFVGEMGLFIESDTREVILRTRTQCELAEISYERLQQLFQTSLSPDAPRILYAIGVQLSKRLLDTTRKASRLAFLDV
TDRIVRTLHDLSKEPEAMSHPQGTQLRVSRQELARLVGCSREMAGRVLKKLQADGLLHARGKTVVLYGTR
>P22260 ~~~clp~~~CRP-like protein Clp~~~COG0664
MSLGNTTVVTTTVRNATPSLTLDAGTIERFLAHSHRRRYPTRTDVFRPGDPAGTLYYVISGSVSIIAEEDDDRELVLGYF
GSGEFVGEMGLFIESDTREVILRTRTQCELAEISYERLQQLFQTSLSPDAPRILYAIGVQLSKRLLDTTRKASRLAFLDV
TDRIVRTLHDLSKEPEAMSHPQGTQLRVSRQELARLVGCSREMAGRVLKKLQADGLLHARGKTVVLYGTR
>P60068 1.97.1.1~~~clrA~~~Chlorate reductase subunit alpha~~~
MNSPDEHNGRRRFLQFSAAALASAAASPSLWAFSKIQPIEDPLKDYPYRDWEDLYRKEWTWDSVGVMTHSNGCVAGCAWN
VFVKNGIPMREEQISKYPQLPGIPDMNPRGCQKGAVYCSWSKQPDHIKWPLKRVGERGERKWKRISWDEALTEIADKIID
TTVKRGPGNIYIPKRPFAVITNTAYTRMTKLLGAISPDATSMTGDLYTGIQTVRVPASTVSTFDDWFTSDLILMWHKNPI
VTRIPDAHFLMEARYNGARLVNISADYNPSSIHSDLFVPVTSGTDSHLAAALVNVLIAGKHYKADYLKEQTALPFLVRTD
NGKFLREKDFKADGSDQVFYVWDTKAGKAVLAPGSMGSKDKTLKLGTIDPALEGNFETHGIKVTTVFERLKAEITPYTPE
ATQATTGVHPSVVRQLAGWIAECKALRILDGYNNQKHFDGFQCGRLKILILTLIGHHGTTGSIDTTFEGWRLEGNSELGT
VKGKPGRSVSAVLAQWVWGEQYQRSKDYFNDAQLREELGFGVDEMESMRKESEANGWMPNWQSIKEPVVSITGGINMFAT
SNGYQHLRDNFLKRCELNVVVDFRLNSGAMYADIVLPAAENTEKLDIRETSVTRFIHAFGQPVKPMYERKTDWQIMVALA
AKIQERAKARGIARVDDPEIKSGIDFDKIYDEFTMNGKVVTDEQAVRFVMDNSKALGPGTYEEVMKNGFVAVGPSAGKTG
PVPKDKPYRPFTVNVTDKKPYGTLTGRLQFYVDHDWFQRLGATVPKPQYRGGVLGPKKYPFVRNSPHARWGVHSFARTEQ
WMLRHQRGEPDVRMSPKAMAAKGIKDGDMVRIFNDSGEFFAVVKAMPALPDNMLFTEHGWEQYQYKNMTHYNMVSSELIN
PLELVGGYGHIKYTSGGFNPNRIFYETTVDVEKA
>P60069 ~~~clrB~~~Chlorate reductase subunit beta~~~
MSQRQVAYVFDLNKCIGCHTCTMACKQLWTNRDGREYMYWNNVETRPGKGYPKNWEGKGGGFDQEGKLKTNGIIPIMADY
GGRIGDFNLNEVLLEGKADQVVPHEKATWGPNWDEDEGKGEFPNNHSFYLPRICNHCSNPACLAACPTKAIYKRPEDGIV
VVDQTRCRGYRYCVKACPYGKMYFNLQKGKSEKCIGCYPRVEKGEAPACVKQCSGRIRFWGYRDDKNGPIYKLVEQWKVA
LPLHAEYGTEPNVFYVPPMNTTPPPFEEDGRLGDKPRIPIEDLEALFGPGVKQALATLGGEMAKRRKAQASELTDILIGF
TNKDRYGV
>P60000 ~~~clrC~~~Chlorate reductase subunit gamma~~~
MKTNILVKRMAVIGLAVAAACTGAAAAAQGAVPQAQRIIRVLSVAGGDAASPQAAVWKKAPTTQVTLLTAFPGHISIVGT
AATQKLAAQAVRASGRLFVRLAWSDRTANTVMKDTDQFLDGAAVEFPVNGKVATLPFMGDPVNVVNVWHWRADGRTLNLL
AKGFGTSTPVPTEDLRSASVRTGDGWEVVLSRPLRVKAEEGANLQGRRTMPIGFAAWDGENQERDGLKAVTMEWWQLRF
>P60001 ~~~clrD~~~Chlorate reductase assembly chaperone protein~~~
MNTLIDNPKAMASGYLAMAQMFSYPDADAWRRLTENGLVDPALGRETLEAEYLGLFEMGGGTSTMSLYEGQNRPERGRDG
ILQELLRFYEFFDVHLNQDEREYPDHLVTELEFLAWLCLQEHAALRDGRDAEPFQNAARDFLVRHLAAWLPDFRQRLEAT
ETTYAQYGPTLGELVETHRSRLGDQPQKSREMQ
>P45860 2.7.8.-~~~ywiE~~~Probable cardiolipin synthase YwiE~~~COG1502
MLKRRLEFFFLYMMLIGAYVIWFFPVSRLEFYGGLLCYISIILFSIYSLILENRTSQHTLLWIHILVFFPIVGYVFYLFS
GQLYVKGKLFKTKRMYNREKLRKLFDKEETPEVTGLKDNQERFFTYSIRAAHMNINTKSNIKVLKNGEETFPDIFKAMRK
AESYIHIEYYMFKSDMLGRGMMDIMMEKARQGVEVRFLYDAAGSMKLARRDIMRMKQAGVDIVPFSPLKYGFFNQKLNFR
NHRKIVIIDGKTGFVGGLNVGKEYISRDPYIGFWRDTHLRLEGEIVQTLHAIFMLDWEYVSNEVLIDQEEYNTPVPVEGG
GIYQIVATGPDMKESMSDLYYEMISSAQKSIWIATPYFVPNESIRTALKAAATKGVEVRVMVPEKNDSFLTQYASRSYFP
ELLLEGIEVYSYQKGFMHQKVMIIDGDLASVGTANMDMRSFQLNFEVNVFFTDAEAIRTLEAHFEEDMQESEKLSPVGFY
KRGVADRTKESFARLFSGVL
>P71040 2.7.8.-~~~clsA~~~Major cardiolipin synthase ClsA~~~COG1502
MSISSILLSLFFILNILLAIIVIFKERRDASASWAWLLVLFFIPVLGFILYLLFGHNLRRKHLFQWEDRKKIGIERLLKH
QLEDLETKQFQFNNRATFDNKDLIYMLIMNNHAVFTEDNSVDVITDGRDKFQRLLSDISKAKDHIHLQYYIYKGDELGKK
LRDALIQKAKEGVQVRVLYDELGSRTLRKKFFKELREAGGHVEVFFPSKLRPINLRLNYRNHRKLVIIDGMTGYVGGFNV
GDEYLGLNPKFGYWRDTHIRLQGTAVHAIQTRFILDWNQASHHHTLTYIPNHFPDYGPKGNVGMQIVTSGPDSEWEQIKN
GYIKMISNAKRSILIQTPYFIPDASLLDALRIACLSGIDVNIMIPNKPDHAFVYWATLSYIGDLLKAGATVYIYDNGFIH
AKTIVVDDEIASVGTANIDVRSFRLNFEVNAFIYDITIAKKLVSTFKEDLLVSRKFTYEEYLQRPLWIRIKESVSRLLSP
IL
>P0A6H8 2.7.8.-~~~clsA~~~Cardiolipin synthase A~~~COG1502
MTTVYTLVSWLAILGYWLLIAGVTLRILMKRRAVPSAMAWLLIIYILPLVGIIAYLAVGELHLGKRRAERARAMWPSTAK
WLNDLKACKHIFAEENSSVAAPLFKLCERRQGIAGVKGNQLQLMTESDDVMQALIRDIQLARHNIEMVFYIWQPGGMADQ
VAESLMAAARRGIHCRLMLDSAGSVAFFRSPWPELMRNAGIEVVEALKVNLMRVFLRRMDLRQHRKMIMIDNYIAYTGSM
NMVDPRYFKQDAGVGQWIDLMARMEGPIATAMGIIYSCDWEIETGKRILPPPPDVNIMPFEQASGHTIHTIASGPGFPED
LIHQALLTAAYSAREYLIMTTPYFVPSDDLLHAICTAAQRGVDVSIILPRKNDSMLVGWASRAFFTELLAAGVKIYQFEG
GLLHTKSVLVDGELSLVGTVNLDMRSLWLNFEITLAIDDKGFGADLAAVQDDYISRSRLLDARLWLKRPLWQRVAERLFY
FFSPLL
>P45865 ~~~clsB~~~Minor cardiolipin synthase ClsB~~~COG1502
MKVFIVIMIIVVIFFALILLDIFMGRAGYRKKAYEPVFSKKKSDIELIHCGADLVERMMNDIRQAASSVHMMFFIMKNDE
VSHNMYTLLKTKAQAGVSVYLLLDWAGCRAIKKTALQTMKNAGVHVHVMNRPRFPFFFFHMQKRNHRKITVIDGKIGYIG
GFNIAEEYLGKKAKFGNWEDYHLRMIGEGVHDLQTLFASDLKRNTGIELGSDVWPKLQQGTISHKIYATDGYSLENIYLA
NIAQAKNRLTVCTPYYIPSKPLQEALINARKNGVSVRIIVPMKSDHPLVREAAFTYYSELLDAGCLIYRYYQGFYHVKAL
IIDDHLSIIGTANFDKRSLFLNEEVNVEIDDEAFTSEVYATIEEDMKKSELLTMEDFSKRTFRQRPAEWLGRALSYFL
>P0AA84 2.7.8.-~~~clsB~~~Cardiolipin synthase B~~~COG1502
MKCSWREGNKIQLLENGEQYYPAVFKAIGEAQERIILETFIWFEDDVGKQLHAALLAAAQRGVKAEVLLDGYGSPDLSDE
FVNELTAAGVVFRYYDPRPRLFGMRTNVFRRMHRKIVVIDARIAFIGGLNYSAEHMSSYGPEAKQDYAVRLEGPIVEDIL
QFELENLPGQSAARRWWRRHHKAEENRQPGEAQVLLVWRDNEEHRDDIERHYLKMLTQARREVIIANAYFFPGYRFLHAL
RKAARRGVRIKLIIQGEPDMPIVRVGARLLYNYLVKGGVQVFEYRRRPLHGKVALMDDHWATVGSSNLDPLSLSLNLEAN
VIIHDRHFNQTLRDNLNGIIAADCQQVDETMLPKRTWWNLTKSVLAFHFLRHFPALVGWLPAHTPRLAQVDPPAQPTMET
QDRVETENTGVKP
>P75919 2.7.8.-~~~clsC~~~Cardiolipin synthase C~~~COG1502
MPRLASAVLPLCSQHPGQCGLFPLEKSLDAFAARYRLAEMAEHTLDVQYYIWQDDMSGRLLFSALLAAAKRGVRVRLLLD
DNNTPGLDDILRLLDSHPRIEVRLFNPFSFRLLRPLGYITDFSRLNRRMHNKSFTVDGVVTLVGGRNIGDAYFGAGEEPL
FSDLDVMAIGPVVEDVADDFARYWYCKSVSPLQQVLDVPEGEMADRIELPASWHNDAMTHRYLRKMESSPFINHLVDGTL
PLIWAKTRLLSDDPAKGEGKAKRHSLLPQRLFDIMGSPSERIDIISSYFVPTRAGVAQLLRMVRKGVKIAILTNSLAAND
VAVVHAGYARWRKKLLRYGVELYELKPTREQSSTLHDRGITGNSGASLHAKTFSIDGKTVFIGSFNFDPRSTLLNTEMGF
VIESETLAQLIDKRFIQSQYDAAWQLRLDRWGRINWVDRHAKKEIILKKEPATSFWKRVMVRLASILPVEWLL
>P63801 2.7.8.-~~~cls~~~Cardiolipin synthase~~~
MIELLSIALKHSNIILNSIFIGAFILNLLFAFTIIFMERRSANSIWAWLLVLVFLPLFGFILYLLLGRQIQRDQIFKIDK
EDKKGLELIVDEQLAALKNENFSNSNYQIVKFKEMIQMLLYNNAAFLTTDNDLKIYTDGQEKFDDLIQDIRNATDYIHFQ
YYIIQNDELGRTILNELGKKAEQGVEVKILYDDMGSRGLRKKGLRPFRNKGGHAEAFFPSKLPLINLRMNNRNHRKIVVI
DGQIGYVGGFNVGDEYLGKSKKFGYWRDTHLRIVGDAVNALQLRFILDWNSQATRDHISYDDRYFPDVNSGGTIGVQIAS
SGPDEEWEQIKYGYLKMISSAKKSIYIQSPYFIPDQAFLDSIKIAALGGVDVNIMIPNKPDHPFVFWATLKNAASLLDAG
VKVFHYDNGFLHSKTLVIDDEIASVGTANMDHRSFTLNFEVNAFIYDQQIAKKLKQAFIDDLAVSSELTKARYAKRSLWI
KFKEGISQLLSPIL
>Q6TNA5 6.2.1.46~~~CmaA~~~L-allo-isoleucine:holo-[CmaA peptidyl-carrier protein] ligase~~~
MTSYHSHPPKAYRRFESVCTQAPNAIAVVHEGKPVTYQQLQTQVLERSEALIRQGLADHPYMPLMANRCLEYLITMLACC
KLGITYVSIEPSTPSKRLIAVLEQLGCNHLLLLGQPTDLRPDPTLTCFRLDDCGTLCSDGPALRQPIRRRLDDASVITVM
FTSGTTGVPKGVRISQDGLLNLVDNVQQQVQGKPRSYVHHSSIGFDAALFEVWVPLLTGACVTLQPSEFNIDALDHCVRA
ASCDVLLLTTSLFHLVAQHRLSMLEAVRVLYVGGEVLKPVHARALLLANPRITLVNGYGPTENTVFSTWYSLNKPEDAER
DVIPIGQFLHQVHGKIVDAKLQEVEVGTPGELLLTGANLALGYLDEALTPTRFLQLPEGTYYRTGDYVIQDEHGMLFYQG
RIDEQVKIKGFRVEIAEVEHALTQLPGVAQAVVQAHVMNDLENSLHAFIVFRHGSPTIEESKLMSLLGDRLPHYMVPRRI
HYLAELPLTANGKVDKRSLQPPEKAAVVSPQAGSAVLEIWSGILGTRNLQLEHSIYGYGASSLSVVMAHSRINEILGRTT
PFDEVARLSTFQEWVQYYATHADPVTSLRSQHGNH
>Q6TNA6 2.3.2.28~~~cmaE~~~L-allo-isoleucyltransferase~~~
MHSLFTLNLDLERAGKVICAQSLPFDTARETLLLLSPVGTQCCYMKNAALFLISHFNLIILESDTWLAYANEAGVNPEEG
VADFIRQFNAALPEPVRVDALVGYCSSAPLALLAANQGACRTLLLLNGAYFLKDDGVIKSQYERDVERMMQSIPQGNCAQ
VYEAVSLLHTQSTYTPSDYRYQQVRPLRELSAFRQYLTFLNNLASLELVRIAQAVKTPTLVWCGSQDRYTDTASSRYIAQ
LLPHSELVEDPDGQHHDFVDGHERLYLTMTRFLTRHKQRAIQ
>A5U866 2.1.1.79~~~cmaA1~~~Cyclopropane mycolic acid synthase 1~~~COG2230
MPDELKPHFANVQAHYDLSDDFFRLFLDPTQTYSCAYFERDDMTLQEAQIAKIDLALGKLGLQPGMTLLDVGCGWGATMM
RAVEKYDVNVVGLTLSKNQANHVQQLVANSENLRSKRVLLAGWEQFDEPVDRIVSIGAFEHFGHERYDAFFSLAHRLLPA
DGVMLLHTITGLHPKEIHERGLPMSFTFARFLKFIVTEIFPGGRLPSIPMVQECASANGFTVTRVQSLQPHYAKTLDLWS
AALQANKGQAIALQSEEVYERYMKYLTGCAEMFRIGYIDVNQFTCQK
>P9WPB7 2.1.1.79~~~cmaA1~~~Cyclopropane mycolic acid synthase 1~~~COG2230
MPDELKPHFANVQAHYDLSDDFFRLFLDPTQTYSCAYFERDDMTLQEAQIAKIDLALGKLGLQPGMTLLDVGCGWGATMM
RAVEKYDVNVVGLTLSKNQANHVQQLVANSENLRSKRVLLAGWEQFDEPVDRIVSIGAFEHFGHERYDAFFSLAHRLLPA
DGVMLLHTITGLHPKEIHERGLPMSFTFARFLKFIVTEIFPGGRLPSIPMVQECASANGFTVTRVQSLQPHYAKTLDLWS
AALQANKGQAIALQSEEVYERYMKYLTGCAEMFRIGYIDVNQFTCQK
>P9WPB5 2.1.1.79~~~cmaA2~~~Cyclopropane mycolic acid synthase 2~~~COG2230
MTSQGDTTSGTQLKPPVEAVRSHYDKSNEFFKLWLDPSMTYSCAYFERPDMTLEEAQYAKRKLALDKLNLEPGMTLLDIG
CGWGSTMRHAVAEYDVNVIGLTLSENQYAHDKAMFDEVDSPRRKEVRIQGWEEFDEPVDRIVSLGAFEHFADGAGDAGFE
RYDTFFKKFYNLTPDDGRMLLHTITIPDKEEAQELGLTSPMSLLRFIKFILTEIFPGGRLPRISQVDYYSSNAGWKVERY
HRIGANYVPTLNAWADALQAHKDEAIALKGQETYDIYMHYLRGCSDLFRDKYTDVCQFTLVK
>P9WPB3 2.1.1.79~~~pcaA~~~Cyclopropane mycolic acid synthase 3~~~COG2230
MSVQLTPHFGNVQAHYDLSDDFFRLFLDPTQTYSCAYFERDDMTLQEAQIAKIDLALGKLNLEPGMTLLDIGCGWGATMR
RAIEKYDVNVVGLTLSENQAGHVQKMFDQMDTPRSRRVLLEGWEKFDEPVDRIVSIGAFEHFGHQRYHHFFEVTHRTLPA
DGKMLLHTIVRPTFKEGREKGLTLTHELVHFTKFILAEIFPGGWLPSIPTVHEYAEKVGFRVTAVQSLQLHYARTLDMWA
TALEANKDQAIAIQSQTVYDRYMKYLTGCAKLFRQGYTDVDQFTLEK
>F2RB80 1.14.99.65~~~cmlA~~~4-amino-L-phenylalanyl-[CmlP-peptidyl-carrier-protein] 3-hydroxylase~~~COG2220
MRYSLRQDIAVEPVIAGWYGWSYLLPPQTLARFVHNRFNRIVESYLDDPQVHAAAVRQRRMHGGPWIHAHEHRDAIEAWY
RETAPRRERLDELFEAVRRLEEDILPRHHGECLDPVYQELPAALAGRVEVFYGRDNRTADYRFVEPLMYASEYYDESWQQ
VRFRPVTEDAREFALTTPMLEYGPEQLLVDVPLNSPLLDAVFRGGLTGTELDDLAARFGLDGERAARFASYFEPTPAASE
APAPASSSEEDVLEYVGHACVFARHRGTTFLVDPVLSYSGYPGGAENRFTFADLPERIDHLLITHNHQDHMLFETLLRIR
HRVGRVLVPKSTNASLVDPGLGGILRRLGFTDVVEVDDLETLSCGSAEVVALPFLGEHGDLRIRSKTGWLIRFGERSVLF
AADSTNISPTMYTKVAEVIGPVDTVFIGMESIGAAASWIYGPLYGEPLDRRTDQSRRLNGSNFPQAREIVDALEPDEVYV
YAMGLEPWMGVVMAVDYDESHPAIVDSDLLVRHVQDKGGTAERLHLRRTLRL
>F2RB83 1.14.99.67~~~cmlI~~~Alpha-N-dichloroacetyl-p-aminophenylserinol N-oxygenase~~~
MRDHTDEKSEAAGNDDGHVRIGGLPAFDPDDPAENAVINRLVGNWHRRAAVKREEPDVYALFDPGRPDFREDMIPFRGHP
IWERLSDETRSRLLSWGWVAYNRNTVLIEQRIANPAFELVIGGAYPGLGGQQLELAVAQAMVDEQYHTLMHINGSAVTRR
MRRSDFSDRVLPDSHITTIHQEHLDRCEEPWQRSLTTLGFATVAEISINAYLDLLADDQEIQVVNSTTVKLHNRDEYCHA
SISGEMMKQVYEALPADRRRFLLEKVVAGLEAFVAPDFTTWESIVAFEGVPGWEKAAAEVREAQGGTHLVQDHSGIHTLL
TEMDVLDQVEFGWGTTVTR
>F2RB81 6.2.1.-~~~cmlP~~~Non-ribosomal peptide synthetase CmlP~~~COG1020
MVELPDLLTDLFRRHRGRTALRTAGRTWTYEELDRVTSALARRIDAECPAGRRVLVAGEHTAEAVVWALAAMRSHAVHTP
MNPGLPADRFEEFARVADAALLVCFEREALVRGEKAGLRALYAGDVGWPTDPAPAPADGTADEPARSRVAYSIFTSGSTG
DPKLVDVGHGGLLNLCRSLRRLLDITPDDQVLHFASLSFDASVSEILGTLYAGATLVVPVRDQASWLGSVSRHLAAHGCD
LAMLSPSVYARLDEAARSRIRKVEFCGEALGEGEYDKAARYSRVFNAYGPTEATVCFSLAELTSYTPSIGTPVDGFRAYV
RDPDSGDHATAGTGELVIVGDGVALGYAGGSPAENEVFGTVDGSPAYATGDVVSLSDDGELTYLGRIDEQIKRLGHRVNL
AHVGSTLSRHLGREVALVRQDATILLVTAADGEATEESLMARIRDLVPVWEAPDRLVLVDALPLTSGGKVDRSALRELLA
SPAGAPHGGTDGEDAAELRRVLDVVTAVLGQEIGPETSIFDAGGSSLAMIQIQVKLSDAYGEEAVEAAFAAMDYDFAPAA
FLRHLRGEAVAPAESAVDTLLRRVETERDALRAELPLLRRDTRHEPVPGAADGDREVLLTGASGFIGGHVLDRLLAAGRP
VLVVSTGDPDGVLTGHATRFGRQAADYARVRAISYAELERWVDRRRGPVVDAVVHCGYQVNHLLPLDSHLSGSVRNTALV
VRAAAALGARSFAFLSAASAGADFLPLSAATLTAVGDPYSRSKLISEEYVNTLAVLGCAVSHYRPGLVYGHRPEDRHHLK
DDWFTALLETARRVGAMPRLSGHVPVCDVGTLADTLLGRPDANPGTADGASRTPDSRSAVVVHRTYALDELLRHTGLTEA
DVLAPAAWFERVRDGGEVPAPLLAAMQAALSGPGWPSAHREVDHDILGRLLGTPPDTPAGDRPERTGTTAEAQNGAAHAP
TPR
>P76290 2.1.3.-~~~cmoA~~~Carboxy-S-adenosyl-L-methionine synthase~~~COG2226
MSHRDTLFSAPIARLGDWTFDERVAEVFPDMIQRSVPGYSNIISMIGMLAERFVQPGTQVYDLGCSLGAATLSVRRNIHH
DNCKIIAIDNSPAMIERCRRHIDAYKAPTPVDVIEGDIRDIAIENASMVVLNFTLQFLEPSERQALLDKIYQGLNPGGAL
VLSEKFSFEDAKVGELLFNMHHDFKRANGYSELEISQKRSMLENVMLTDSVETHKARLHNAGFEHSELWFQCFNFGSLVA
LKAEDAA
>P43985 2.1.3.-~~~cmoA~~~Carboxy-S-adenosyl-L-methionine synthase~~~COG4106
MVKDTLFSTPIAKLGDFIFDENVAEVFPDMIQRSVPGYSNIITAIGMLAERFVTADSNVYDLGCSRGAATLSARRNINQP
NVKIIGIDNSQPMVERCRQHIAAYHSEIPVEILCNDIRHVEIKNASMVILNFTLQFLPPEDRIALLTKIYEGLNPNGVLV
LSEKFRFEDTKINHLLIDLHHQFKRANGYSELEVSQKRTALENVMRTDSIETHKVRLKNVGFSQVELWFQCFNFGSMIAV
K
>P76291 2.5.1.-~~~cmoB~~~tRNA U34 carboxymethyltransferase~~~COG0500
MIDFGNFYSLIAKNHLSHWLETLPAQIANWQREQQHGLFKQWSNAVEFLPEIKPYRLDLLHSVTAESEEPLSAGQIKRIE
TLMRNLMPWRKGPFSLYGVNIDTEWRSDWKWDRVLPHLSDLTGRTILDVGCGSGYHMWRMIGAGAHLAVGIDPTQLFLCQ
FEAVRKLLGNDQRAHLLPLGIEQLPALKAFDTVFSMGVLYHRRSPLEHLWQLKDQLVNEGELVLETLVIDGDENTVLVPG
DRYAQMRNVYFIPSALALKNWLKKCGFVDIRIADVSVTTTEEQRRTEWMVTESLADFLDPHDPGKTVEGYPAPKRAVLIA
RKP
>P44167 2.5.1.-~~~cmoB~~~tRNA U34 carboxymethyltransferase~~~COG0500
MIDFRPFYQQIATTNLSDWLETLPCQLKEWETQTHGDYAKWSKIVDFLPNLHADEIDLKSAVKSDRTSPLSEGEKQRIIH
HLKQLMPWRKGPYHLFGIHVDCEWRSDFKWDRVLPHLSPLQGRTILDVGCGSGYHMWRMVGEGAKMVVGIDPTELFLCQF
EAVRKLLNNDRRANLIPLGIEQMQPLAAFDTVFSMGVLYHRKSPLDHLSQLKNQLVKGGELVLETLVVDGDINTVLVPAD
RYAKMKNVYFIPSVATLINWLEKVGFTNVRCVDVATTTLEEQRKTDWLENESLIDFLDPNDHSKTIEGYQAPKRAVILAN
K
>Q87XG5 2.5.1.-~~~cmoB~~~tRNA U34 carboxymethyltransferase~~~COG0500
MIDLAPLVRRLAGTPLAEWANGLQAQLDTKMSKGHGDLQRWQSALDALPALQPEKVDLTDSFTLETECDGETRTVLRKAL
LGLSPWRKGPFNVFGVHIDTEWRSDWKWSRVSPHLDLKGKRVLDVGCGNGYYQWRMLGAGADSVIGVDPNWLFFCQFQAM
QRYLPDLPAWHLPFALEDLPANLEGFDTVFSMGVLYHRKSPIDHLLALKDCLVKGGELVMETLVIPGDVHQVLVPEDRYA
QMRNVWFLPSVPALELWMRRAGFTDVRCVDVSHTTVEEQRSTEWMRFQSLGDYLDPNDHSKTVEGLPAPMRAVIVGRKP
>Q8DAP0 2.5.1.-~~~cmoB~~~tRNA U34 carboxymethyltransferase~~~
MFNFANFYQLIAQDTRLQPWLNVLPQQLTDWQNAEHGDFPRWLKALNKIPEGAPDQIDIKHSVTISNDTPFHQGELKKLE
SLLRTFHPWRKGPYTVHGIHIDTEWRSDWKWDRVLPHISPLKNRSVLDVGCGNGYHMWRMLGEGARLCVGIDPSHLFLIQ
FEAIRKLMGGDQRAHLLPLGIEQLPKLEAFDTVFSMGVLYHRRSPLDHLIQLKDQLVSGGELILETLVIEGDETAVLVPK
ERYAQMRNVYFFPSARALKVWLELVGFEDVRIVDENVTSVDEQRTTNWMTHNSLPDYLDQNDPSKTVEGYPAPRRAILVA
KKP
>O34639 1.8.4.-~~~cmoI~~~N-acetyl-S-hydroxy-L-cysteine reductase~~~COG0695
MSDVVNIVVWSKKGCSYCEEVKNYLNEKGFPFQNIDVSEKEKLRDILQVKYGVRHVPVVEIGRGNQYQGITEIGIEHLDL
ALANHAQIKEAKR
>O34974 1.14.14.-~~~cmoJ~~~N-acetyl-S-alkylcysteine sulfoxide monooxygenase~~~COG2141
MTRADFIQFGAMIHGVGGTTDGWRHPDVDPSASTNIEFYMKKAQTAEKGLFSFIFIADGLFISEKSIPHFLNRFEPITIL
SALASVTKNIGLVGTFSTSFTEPFTISRQLMSLDHISGGRAGWNLVTSPQEGAARNHSKSNLPEHTERYEIAQEHLDVVR
GLWNSWEHDAFIHNKKTGQFFDQAKLHRLNHKGKYFQVEGPLNIGRSKQGEPVVFQAGSSETGRQFAAKNADAIFTHSNS
LEETKAFYADVKSRAADEGRDPSSVRIFPGISPIVADTEEEAEKKYREFAELIPIENAVTYLARFFDDYDLSVYPLDEPF
PDIGDVGKNAFQSTTDRIKREAKARNLTLREVAQEMAFPRTLFIGTPERVASLIETWFNAEAADGFIVGSDIPGTLDAFV
EKVIPILQERGLYRQDYRGGTLRENLGLGIPQHQSVLHSSHH
>Q8XDG3 2.1.1.-~~~cmoM~~~tRNA 5-carboxymethoxyuridine methyltransferase~~~COG2227
MQDRNFDDIAEKFSRNIYGTTKGQLRQAILWQDLDRVLAEMGPQKLRVLDAGGGEGQTAIKMAERGHQVILCDLSAQMID
RAKQAAEAKGVSDNMQFIHCAAQDVASHLETPVDLILFHAVLEWVADPRSVLQTLWSVLRPGGVLSLMFYNAHGLLMHNM
VAGNFDYVQAGMPKKKKRTLSPDYPRDPTQVYLWLEEAGWQIMGKTGVRVFHDYLREKHQQRDCYEALLELETRYCRQEP
YITLGRYIHVTARKPQSKDKV
>P36566 2.1.1.-~~~cmoM~~~tRNA 5-carboxymethoxyuridine methyltransferase~~~COG2227
MQDRNFDDIAEKFSRNIYGTTKGQLRQAILWQDLDRVLAEMGPQKLRVLDAGGGEGQTAIKMAERGHQVILCDLSAQMID
RAKQAAEAKGVSDNMQFIHCAAQDVASHLETPVDLILFHAVLEWVADPRSVLQTLWSVLRPGGVLSLMFYNAHGLLMHNM
VAGNFDYVQAGMPKKKKRTLSPDYPRDPAQVYLWLEEAGWQIMGKTGVRVFHDYLREKHQQRDCYEALLELETRYCRQEP
YITLGRYIHVTARKPQSKDKV
>O34846 1.14.14.-~~~cmoO~~~N-acetyl-S-alkylcysteine monooxygenase~~~COG2141
MIRLSILDQSLIGEGETAADTLQHTVKLAQMAEECGYHRFWVAEHHNNDEIAGSAPEVLLGYLAASTRKIRLASGGVMLQ
HYSSYKVAEQFHLLSALAPGRIDLGVGKAPGGFQLSTDALQAEYKKPVRQFDEKLEELTHFVRDDFPDTHRYAALRPRPQ
VDRKPGIFLLGGSTESAISAAKLGISFVFAYFINGEEEVLKEARRAFDAHLPPGSEAEFHLAPAVFAAHTKEEAEKHIVS
RESIKVVLKDGRKVNVGSREQAEAYLENVTEPYDIIVQKTGIIAGTKEEVAEELTRLSGTYKINDFVIFTPIKNAVEKQL
SYQLLSDAVLAAKR
>P14204 ~~~comA~~~Transcriptional regulatory protein ComA~~~COG2197
MKKILVIDDHPAVMEGTKTILETDSNLSVDCLSPEPSEQFIKQHDFSSYDLILMDLNLGGEVNGMELSKQILQENPHCKI
IVYTGYEVEDYFEEAIRAGLHGAISKTESKEKITQYIYHVLNGEILVDFAYFKQLMTQQKTKPAPSSQKEQDVLTPRECL
ILQEVEKGFTNQEIADALHLSKRSIEYSLTSIFNKLNVGSRTEAVLIAKSDGVL
>P39660 ~~~cmpA~~~Bicarbonate-binding protein CmpA~~~COG0715
MNEFQPVNRRQFLFTLGATAASAILLKGCGNPPSSSGGGTSSTTQPTAAGASDLEVKTIKLGYIPIFEAAPLIIGREKGF
FAKYGLDVEVSKQASWAAARDNVILGSAGGGIDGGQWQMPMPALLTEGAISNGQKVPMYVLACLSTQGNGIAVSNQLKAQ
NLGLKLAPNRDFILNYPQTSGRKFKASYTFPNANQDFWIRYWFAAGGIDPDKDIELLTVPSAETLQNMRNGTIDCFSTGD
PWPSRIAKDDIGYQAALTGQMWPYHPEEFLALRADWVDKHPKATLALLMGLMEAQQWCDQKANRAEMAKILSGRNFFNVP
VSILQPILEGQIKVGADGKDLNNFDAGPLFWKSPRGSVSYPYKGLTLWFLVESIRWGFNKQVLPDIAAAQKLNDRVTRED
LWQEAAKKLGVPAADIPTGSTRGTETFFDGITYNPDSPQAYLQSLKIKRA
>Q55460 ~~~cmpA~~~Bicarbonate-binding protein CmpA~~~COG0715
MGSFNRRKFLLTSAATATGALFLKGCAGNPPDPNAASTGTNPSPQAAGDISPEMMPETANIKLGYIPIVEAAPLIIAQEK
GFFAKYGMTGVEVSKQANWASARDNVTIGSQGGGIDGGQWQMPMPHLITEGIITNGNKVPMYVLAQLITQGNGIAVAPMH
EGKGVNLDITKAADYIKGFNKTNGRKFKAAHTFPNVNQDFWIRYWFAAGGVDPDTDIDLLAVPPAETVQGMRNGTMDAFS
TGDPWPYRIVTENIGYMAGLTAQIWPYHPEEYLAIRADWVDKNPKATKALLKGIMEAQQWIDDPKNRPEVVQIVSGRNYF
NVPTTILESPFKGQYTMGDGQPAIDDFQKGPLYWKDGIGNVSYPYKSHDLWFLTESIRWGFHKNAIPDLDTAQKIIDKVN
REDLWREAATEAGFTADIPSSTSRGVETFFDGITFDPANPSAYLQSLAIKKV
>Q55106 ~~~cmpB~~~Bicarbonate transport system permease protein CmpB~~~COG0600
MVTARETRRNGSRPSGLKKWRQKLDGILLPLAGILGFLIIWQIFSSSGATRLPGPLSLFTEERTRELLLYPFLDRGGLDK
GLFWQTIASLTRVAQGFSIAAIIGISVGILVGLNRQLNAMLDPLFQFLRMIAPLAWVPIALVAFQQNQPAAIFVIFITAV
WPILINTAEGVRQIPQDYNNVARVLRMSKSKYLMKVVLPAALPYIFTGLRIAIGLSWLAIIAAEIVMSGIVGIGFFIWDA
YQQNYVSDIILAVIYIGAVGLLLDRFVAWLQRWILRNM
>Q55461 ~~~cmpB~~~Bicarbonate transport system permease protein CmpB~~~COG0600
MTTTISQRKNRAAGANPLQKFWRKRRGDILPPIFGILGFLLLWQLISSAGLIKLPPPSSLWTDPRTRTLLMYPFYDQGGL
DKGLFWQTLASLGRVAQGYSLAAIVGISTGILVGTQPLLDKALDPIFQFLRMVAPLAWVPIALVALQQNQPAAIFVIFIT
SVWPILINTTEGVKQIPQDYINVRKVLRLSPQKFFFKILIPSALPYIFTGLRIAIGLAWLAIIAAEIVMSGIVGIGFFIW
DAYQQNYISEIILAVFYIGAVGLLLDRGIAYLQKLIAPGQ
>Q55107 7.6.2.-~~~cmpC~~~Bicarbonate transport ATP-binding protein CmpC~~~COG0715
MSLFVAVENIEKSFPLSGGNEYLALKGIDLEIKQGEFISLIGHSGCGKSTLLNLIAGLELPTDGAVSLEGQQITAPGPDR
MVVFQNYSLFPWLTVRENIALAVDEVLRDLPKEERQAIVEEHIQLVGLGHAADKPPAQLSGGMKQRVAIARGLATRPKLL
LLDEPFGALDALTRGNLQEKLMQICEENHVTAVMVTHDVDEAVLLSDRIVMLTNGPGSKIGGILEVDIPRPRKRMDVVHH
PSYYSLRSEIIYFLNQQKRVKKLNARKVTTVARHGLEKVNLEIGYVPLMACAPLVVAQEKAFFAKHGLDEVSLVRETSWR
GIVDGLTENYLDAAQMPAGMPVWMSVGGQGGSPLPIVSSLTMSRNGNGITLSKALYDEGIQTVDDFRNLLRSTADKQHIM
GIVHPASMHNLLLRYWLAANQIDPDRDVQLRTIPPAQMVADLKDGTIDGYCIGEPWNAWAAQKEIGFTIASDLEIWNGHP
GKVLGVREDWANRYPNSHVALVKALLEACQYCEDPANWDELRELLSDRRYLSCPKEYIQFSQSTADDLAVPHHRFAGAGV
NRPSRTEHLWMMTQLARWGDVPFPRNWVEILERVCRVGVFSTAARELGLSEVVNYQRSTPVELFDGVPFNAEDPIAYLNS
LPIHRDFSVAEIALDQPRPIAAA
>Q55462 7.6.2.-~~~cmpC~~~Bicarbonate transport ATP-binding protein CmpC~~~COG0715
MSLFVAVDNIDKVFSLTDGGEYIALKGINLEIKKGEFVSLIGHSGCGKSTLLNMIAGLDLPSEGIVTLEGQRIKQPGPDK
MVIFQNYSLLPWLTVKQNIALAVDEVMKGASAAERKAIVEEHINLVGLGHAVDKRPGELSGGMKQRVAIARALAIRPKLL
LLDEPFGALDALTRGNLQEQLMRICEQYQVTAVMVTHDVDEAVLLSDRIVMLTNGPGSNIGGILEVDIPRPRKRMEVVEH
PSYYSLRSEIIYFLNQQKRIKKLRAKKTTAIARHGLEKVNLELGYVPLVACAPLVVAQEKGFFAKHGLDEVSLVRETSWR
GIVDGIAGGYLDGAQMPAGMPTWLTAGGYREQSIPVVSALTMTRNGNAITLSKKLYDQGIYTAEDFRQLLLASDGDRHTL
GMVHPSSMHNLLLRYWLAAHNINPDRDVHLKTIPPAQMVADLKAGTIDGYCVSEPWNLRASMEGAGFSIATDLEIWQNHP
GKVLGVREDWAIAHPNTHVALVKALLEACAYCADPNHEMEIRELLATRQYLSTNIDYIHLGDPEGRTCRLGNPVEYSHHL
FFGDQFNRPSRTEHLWMMTQMARWGDIPFPRNWVEILERVCRVGVFSTAARELGYDVNYQRQPIALFDGKVFNADDPIAY
LNQTVIHRNFTIAEVHLDNPTPAPVFA
>P0A9J8 ~~~pheA~~~Bifunctional chorismate mutase/prephenate dehydratase~~~COG0077
MTSENPLLALREKISALDEKLLALLAERRELAVEVGKAKLLSHRPVRDIDRERDLLERLITLGKAHHLDAHYITRLFQLI
IEDSVLTQQALLQQHLNKINPHSARIAFLGPKGSYSHLAARQYAARHFEQFIESGCAKFADIFNQVETGQADYAVVPIEN
TSSGAINDVYDLLQHTSLSIVGEMTLTIDHCLLVSGTTDLSTINTVYSHPQPFQQCSKFLNRYPHWKIEYTESTSAAMEK
VAQAKSPHVAALGSEAGGTLYGLQVLERIEANQRQNFTRFVVLARKAINVSDQVPAKTTLLMATGQQAGALVEALLVLRN
HNLIMTRLESRPIHGNPWEEMFYLDIQANLESAEMQKALKELGEITRSMKVLGCYPSENVVPVDPT
>P27603 ~~~pheA~~~Bifunctional chorismate mutase/prephenate dehydratase~~~COG0077
MSEADQLKALRVRIDSLDERILDLISERARCAQEVARVKTASWPKAEEAVFYRPEREAWVLKHIMELNKGPLDNEEMARL
FREIMSSCLALEQPLRVAYLGPEGTFSQAAALKHFGHSVISKPMAAIDEVFREVVAGAVNFGVVPVENSTEGAVNHTLDS
FLEHDIVICGEVELRIHHHLLVGETTKTDRITRIYSHAQSLAQCRKWLDAHYPNVERVAVSSNADAAKRVKSEWNSAAIA
GDMAAQLYGLSKLAEKIEDRPVNSTRFLIIGSQEVPPTGDDKTSIIVSMRNKPGALHELLMPFHSNGIDLTRIETRPSRS
GKWTYVFFIDCMGHHQDPLIKNVLEKIGHEAVALKVLGSYPKAVL
>Q55108 7.6.2.-~~~cmpD~~~Bicarbonate transport ATP-binding protein CmpD~~~COG1116
MQTLRQQQPIVPSSPGHDPFLIVENVSKIYETPKGPYTVLDGVNLTVQEGEFICVIGHSGCGKSTLLNMVSGFNQPSHGS
VRLKGKEIDRPGPDRMVVFQNYALLPWMTAYENVYLAVDCVNPQMREGEKREIVREHLAMVGLTEAAEKKITQISGGMKQ
RVAIARALSIRPEVLILDEPFGALDAITKEELQEELLKIWNDHRCTVLMITHDIDEALFLADRLVMMTNGPAANIGEIMT
IPFPRPRDRERIMEDPQYYDLRNYALDFLYNRFAHDDE
>Q55463 7.6.2.-~~~cmpD~~~Bicarbonate transport ATP-binding protein CmpD~~~COG1116
MQTLKSQRAMPTATPSDQKTTAFLTIENVSKVYPTAKGPYTVLENVNLTVNEGEFICVIGHSGCGKSTLLNMVSGFASPT
DGSVQVGGKIITEPGPDRMVVFQNYALLPWLTALENVYIAVDAVHSQKTEAEKRAIAKDHLAMVGLTDSMDKKPGQISGG
MKQRVSIARALAIRPEVLILDEPFGALDAITKEELQEELLQIWNDHRCTVLMITHDIDEALFLADRLVMMTNGPHANIGE
IMTIPFSRPRDRDRIMEDPTYYQLRNYALDFLYNRFAHDDVA
>Q9F1R2 ~~~cmpR~~~HTH-type transcriptional activator CmpR~~~COG0583
MKNLTLHQLKVFEAAARHSSFTRAAEELYLTQPTVSIQVKQLTKAVGLPLFEQIGKRLYLTEAGRQLYKTTRQVFEQLEQ
LDMTIADLQGMKQGQLRLAVITTAKYFIPRLIGPFCQRYPGINVSLKVTNHEGLINRINDNLDDLYVLSRPPSGFDITVQ
PFLDNPLVVVGPASHPLANQRGISLERLAQEPFILRERGSGTREATEQLFAAHNLNLNVKLDLGSNEAIKQAILGGLGLA
VLSYHTLTSAGATPELKMFEVEGFPIHRQWHAVYPAGKQLSTVAATFLDYLLTESQRIAADIQIPESTTTDPELDAPQPV
VGV
>Q55459 ~~~cmpR~~~HTH-type transcriptional activator CmpR~~~COG0583
MKNATLHQFEVFAAIARTGSFTKAAEELFLTQPTVSQQMKQLTKAIGVPLYEQIGRKIYLTEAGQAVLDASKNITSCLDQ
LQEVIADLQGLKKGNLRLATITTGKYFVPRLLGEFRQQYPGISISLQIGNRQQILERLANNLDDLYFLGKPPSNLDINIR
HFLENPLVVIASRQHPLVKEKKISLERLVNEPLIMRESGSGTRMAVEEFFSENRLKMNVEMEISSNEAIKQAVYGGLGIS
ILSLYSLALEGINGPLAVLDVEGFPLQKHWYIIYQKSKQLSIVAQTFLDYLFAHDEAVSIAQIF
>Q53W05 ~~~cmr5~~~CRISPR system Cmr subunit Cmr5~~~
MRTRSQVWAQKAYEKVREAAKGEGRGEYRDMALKLPVLVRQAGLSQALAFVDSRGKEAHKALGNDLAQVLGYRDLRELAE
AAREAELLQYLRLTREVLAAAEWFKRFAQALIEE
>P9WMH5 ~~~cmr~~~HTH-type transcriptional regulator Cmr~~~COG0664
MADRSVRPLRHLVHAVTGGQPPSEAQVRQAAWIARCVGRGGSAPLHRDDVSALAETLQVKEFAPGAVVFHADQTADGVWI
VRHGLIELAVGSRRRRAVVNILHPGDVDGDIPLLLEMPMVYTGRALTQATCLFLDRQAFERLLATHPAIARRWLSSVAQR
VSTAQIRLMGMLGRPLPAQVAQLLLDEAIDARIELAQRTLAAMLGAQRPSINKILKEFERDRLITVGYAVIEITDQHGLR
ARAQ
>Q51973 1.18.1.3~~~cmtAa~~~p-cumate 2,3-dioxygenase system, ferredoxin--NAD(+) reductase component~~~
MGEDISKIVIIGAGQAGATVAFGLRRNGFAGEITLVGEESHLPYERPQLSKEMLRPEASAHKSIKTRADYEEQSILLELG
CKVVRADAQAHSIVLDDGRQLAFDRLVIATGVQPRRLSSAFQGAHRVHYLRTLEDAARLRADLEAGKSLAIVGGGVIGLE
VAAAARALNCPVTLIEAADRLMSRSVDEVVSAYLDRAHRRNGVDIRYGVAATELLDDGRLRLSDGGTVPAEAVLVGIGVT
PNIEGFEHLDITDATGVRVDAYSQTVVPGIFATGDIASQPNGGGFGRIETWANAQDHALNLVKNLMGEAVPYEAPVWFWS
DQGPINLQVVGDAANGRRIVRGDEHGDVFSVFRLDANQQVIGCATVNSPKDMAVARRWVKQRSSVDPQRLADPTIPLRDC
AV
>Q51974 1.14.12.25~~~cmtAb~~~p-cumate 2,3-dioxygenase system, large oxygenase component~~~
MNNDKNLVEIDDENLLFRVARESFVSEEVLAEEYEKIFDRCWLYVGHTSEFKKPGDFVTRTVARRNLLVTMGTDRTINAF
FNTCPHRGATVCRERSGNSKNFQCFYHGWVFGCDGNLKSQPGKERYCADFITGGAGNLVPVPRFDIYAGFCFVSFNAEVE
PLPDYLAGAKEYLELVSKYSESGMGITTGTQEYAIRANWKLLVENSIDGYHAVSTHASYLDYLKNINDGFSGAKLEGKST
DLGNGHAVIEFSAPWGRPIASWVPIWGEEGKQEIDQIYARLVELHGAEMADRMAYKNRNLLIFPNLIINDIMAITVRTFY
PQAPNYMHVNGWSLAPNEESDWARKYRLSNFLEFLGPGGFATPDDVEALESCQNGFSNYRLVPWSDISKGMGKETANYDD
ELQMRAFWTRWNQFIGGAPTPDSGVQYIPTIALA
>Q51975 ~~~cmtAc~~~p-cumate 2,3-dioxygenase system, small oxygenase component~~~
MSAVALEKEPLRIPSSQDFAAFVTRSEVEDLLYHEAELLDTWHLHDWLALFTEDCSYFVPSTDLPRTASADDSLFYIADD
AVRLRERVIRLMKKTAHAEYPRSRTRHLVSNIRILAANAEEIQVASAFVTYRMKLGNSDAYVGSTHYRLRRIDGQLRIVE
KRCFLDLEALRPHGRVSIIL
>Q51978 ~~~cmtAd~~~p-cumate 2,3-dioxygenase system, ferredoxin component~~~
MTNIIETVDLTDLVGLCATDDVAEGEILRVKLPSGHALAIYCVNGEFFATDDICSHGEASLSEDGSLDGYEVECSWHFGR
FDIRTGHACAMPCEHPLRSWPVTVEGGQIFVDVGAHPV
>P9WMI9 ~~~cmtR~~~HTH-type transcriptional regulator CmtR~~~COG0640
MLTCEMRESALARLGRALADPTRCRILVALLDGVCYPGQLAAHLGLTRSNVSNHLSCLRGCGLVVATYEGRQVRYALADS
HLARALGELVQVVLAVDTDQPCVAERAASGEAVEMTGS
>Q53654 ~~~cna~~~Collagen adhesin~~~
MNKNVLKFMVFIMLLNIITPLFNKNEAFAARDISSTNVTDLTVSPSKIEDGGKTTVKMTFDDKNGKIQNGDMIKVAWPTS
GTVKIEGYSKTVPLTVKGEQVGQAVITPDGATITFNDKVEKLSDVSGFAEFEVQGRNLTQTNTSDDKVATITSGNKSTNV
TVHKSEAGTSSVFYYKTGDMLPEDTTHVRWFLNINNEKSYVSKDITIKDQIQGGQQLDLSTLNINVTGTHSNYYSGQSAI
TDFEKAFPGSKITVDNTKNTIDVTIPQGYGSYNSFSINYKTKITNEQQKEFVNNSQAWYQEHGKEEVNGKSFNHTVHNIN
ANAGIEGTVKGELKVLKQDKDTKAPIANVKFKLSKKDGSVVKDNQKEIEIITDANGIANIKALPSGDYILKEIEAPRPYT
FDKDKEYPFTMKDTDNQGYFTTIENAKAIEKTKDVSAQKVWEGTQKVKPTIYFKLYKQDDNQNTTPVDKAEIKKLEDGTT
KVTWSNLPENDKNGKAIKYLVKEVNAQGEDTTPEGYTKKENGLVVTNTEKPIETTSISGEKVWDDKDNQDGKRPEKVSVN
LLANGEKVKTLDVTSETNWKYEFKDLPKYDEGKKIEYTVTEDHVKDYTTDINGTTITNKYTPGETSATVTKNWDDNNNQD
GKRPTEIKVELYQDGKATGKTAILNESNNWTHTWTGLDEKAKGQQVKYTVEELTKVKGYTTHVDNNDMGNLIVTNKYTPE
TTSISGEKVWDDKDNQDGKRPEKVSVNLLADGEKVKTLDVTSETNWKYEFKDLPKYDEGKKIEYTVTEDHVKDYTTDING
TTITNKYTPGETSATVTKNWDDNNNQDGKRPTEIKVELYQDGKATGKTAILNESNNWTHTWTGLDEKAKGQQVKYTVEEL
TKVKGYTTHVDNNDMGNLIVTNKYTPETTSISGEKVWDDKDNQDGKRPEKVSVNLLANGEKVKTLDVTSETNWKYEFKDL
PKYDEGKKIEYTVTEDHVKDYTTDINGTTITNKYTPGETSATVTKNWDDNNNQDGKRPTEIKVELYQDGKATGKTAILNE
SNNWTHTWTGLDEKAKGQQVKYTVDELTKVNGYTTHVDNNDMGNLIVTNKYTPKKPNKPIYPEKPKDKTPPTKPDHSNKV
KPTPPDKPSKVDKDDQPKDNKTKPENPLKELPKTGMKIITSWITWVFIGILGLYLILRKRFNS
>Q5XW77 1.7.1.16~~~cnbA~~~Chloronitrobenzene nitroreductase~~~
MPTSPFIDDLIRDRRTKRGFLDQPVPIEMVKDILSVAKYTPSSSNTQPWRCYVLTGEARERVTTAAVEAYRGAPEGLKPE
YSYFPEPLHEPYATRFNSFRGQLGDAEGCCRSDITGRRRYVERQFRFFDAPVGLIFTMDRRLEWASFICYGCFLQNIMLA
AKGRGLDTCPQGLWSLQHPVLRTELNLPDDQMVVAGMSLGWADNSMAVNQMSMSRVELEEFTTFVHE
>Q38M35 3.5.99.5~~~cnbH~~~2-amino-5-chloromuconic acid deaminase~~~
MNAAHLSLAEHAARLRRRELTAVALIDTCAQHHARMEPRLNAYKTWDGARARSAAAAVDTLLDQGQDLGPLMGLPVSVKD
LYGVPGLPVFAGSDEALPEAWQAAGPLVARLQRQLGIVVGKTHTVEFAFGGLGVNAHWGTPRNPWSPHEHRVPGGSSAGA
GVSLVQGSALLALGTDTAGSVRVPASMTGQVGLKTTVGRWPVEGIVPLSSSLDTAGVLTRTVEDLAYAFAALDTESQGLP
APAPVRVQGLRVGVPTNHFWDDIDPSIAAAVEAAVQRLAQAGAQVVRFPLPHCEEAFDIFRRGGLAASELAAYLDQHFPH
KVERLDPVVRDRVRWAEQVSSVEYLRRKAVLQRCGAGAARLFDDVDVLLTPTVPASPPRLADIGTVETYAPANMKAMRNT
AISNLFGWCALTMPVGLDANRMPVGLQLMGPPRAEARLIGIALGIEALIGQGHALLGAPDLP
>Q09KQ6 3.5.99.11~~~cnbZ~~~2-amino-5-chloromuconate deaminase~~~
MPDAVVFSPGGYRYIPAVFQYSAGIAAEPGFEIERVRFHRPVPLAEAFVAVESHLRAIGRPTTSFAQCELRSPDPFNDQG
FIDFNTEYVKTLERWGIYKDRVNPVARTNVCPMYDKPTTPSMFAFSYTVPTTSAAKRPSFQLAGGGDARGGSAPYKDRIV
AFGDTSPEGLREKVVFVIEEMESRLKTLGLGWADAVSTQLYTVQNIGHLVGPELARRGCGAGGLVWNYTRPPVIGLEYEM
DVRGAVRETVL
>A0A059WLZ7 1.14.15.23~~~cndA~~~Chloroacetanilide N-alkylformylase, oxygenase component~~~
MFLQNAWYAVAWCDEVTDGIVTRKVLGRELALFRDGEGQPRAILNRCPHRFAPLSLGKRIGDAIQCPYHGLHFGPDGRCV
HNPHGDGVVPDVATPTFPARERHKLIWAWMGDPALATDDIAGGEYGYLDDVELDLLPRGHLHLDCDYRLVIDNLMDPAHV
AVLHDSALASEALIRAVPRVWREEDVIRVESWAPDSKPSFLFGAWLGNHDDPVDHWVASRWQAAGLLSVEGGVVAVGGDR
EDGLRVRGAHMITPETETSAHYFWAVVRNFREDDAEQSEQIRATTAAIFTGEDKWMLEAIERSMDGEEFWSLRPAILGTD
RAAVMVRRALESEIKAEGRPKVVAVSAG
>X5CFH4 ~~~cndB1~~~Chloroacetanilide N-alkylformylase 1, ferredoxin component~~~
MPTIIVTTRDGEELSLEADTGLSLMEVIRDGGADELLALCGGCCSCATCHVKVDPAFLAALPPMSEDESDLLDSSDHRDA
TSRLSCQITVDDGLAGLRVAIAPED
>X5CWH9 ~~~cndB2~~~Chloroacetanilide N-alkylformylase 2, ferredoxin component~~~
MPKLVVVTREGEESVIEAETGLSVMEVIRDAGIDELLALCGGCCSCATCHVFVDPAFNGLLPDMSDDENDLLDSSDHRDD
RSRLSCQLTMTDELDGLTVTIAPED
>X5CY81 1.18.1.3~~~cndC1~~~Chloroacetanilide N-alkylformylase, ferredoxin reductase component~~~
MAQYDVLIVGAGHGGAQAAVALRQNKFEGTIAIVGDEPELPYERPPLSKEYFSGEKSFDRILIRPATFWAERNVDMLLGK
RVASVDPAGHSVTLTDGSTIGYGKLVWATGGAPRKLACSGHHLSGVHGVRTREDADRMLGEMERTTSVVVIGGGYIGLEA
AAVLSKAGKKVTVLEALDRVLARVAGEALSRFYEAEHRAHGVDVQLGAKVDCIVGDDQDRVTGVQMHDGSVIPADMVIVG
IGIIPAVEPLIAAGAAGGNGVDVDEYCRTSLPDIYAIGDCAMHANAFAEGARIRLESVQNANDQATTAAKHILGGTDAYH
AVPWFWSNQYDLRLQTMGLSIGYDETIVRGDPANRSFSVVYLKNGRVLALDCVNAVKDYVQGKALVTGGVSPDKASLANP
EIPLKTLLPA
>Q98GN8 ~~~~~~Cyclic nucleotide-gated potassium channel mll3241~~~COG0664
MSVLPFLRIYAPLNAVLAAPGLLAVAALTIPDMSGRSRLALAALLAVIWGAYLLQLAATLLKRRAGVVRDRTPKIAIDVL
AVLVPLAAFLLDGSPDWSLYCAVWLLKPLRDSTFFPVLGRVLANEARNLIGVTTLFGVVLFAVALAAYVIERDIQPEKFG
SIPQAMWWAVVTLSTTGYGDTIPQSFAGRVLAGAVMMSGIGIFGLWAGILATGFYQEVRRGDFVRNWQLVAAVPLFQKLG
PAVLVEIVRALRARTVPAGAVICRIGEPGDRMFFVVEGSVSVATPNPVELGPGAFFGEMALISGEPRSATVSAATTVSLL
SLHSADFQMLCSSSPEIAEIFRKTALERRGAAASA
>P77395 ~~~cnoX~~~Chaperedoxin~~~COG3118
MSVENIVNINESNLQQVLEQSMTTPVLFYFWSERSQHCLQLTPILESLAAQYNGQFILAKLDCDAEQMIAAQFGLRAIPT
VYLFQNGQPVDGFQGPQPEEAIRALLDKVLPREEELKAQQAMQLMQESNYTDALPLLKDAWQLSNQNGEIGLLLAETLIA
LNRSEDAEAVLKTIPLQDQDTRYQGLVAQIELLKQAADTPEIQQLQQQVAENPEDAALATQLALQLHQVGRNEEALELLF
GHLRKDLTAADGQTRKTFQEILAALGTGDALASKYRRQLYALLY
>P9WP65 3.1.4.16~~~~~~cAMP/cGMP dual specificity phosphodiesterase Rv0805~~~COG1409
MHRLRAAEHPRPDYVLLHISDTHLIGGDRRLYGAVDADDRLGELLEQLNQSGLRPDAIVFTGDLADKGEPAAYRKLRGLV
EPFAAQLGAELVWVMGNHDDRAELRKFLLDEAPSMAPLDRVCMIDGLRIIVLDTSVPGHHHGEIRASQLGWLAEELATPA
PDGTILALHHPPIPSVLDMAVTVELRDQAALGRVLRGTDVRAILAGHLHYSTNATFVGIPVSVASATCYTQDLTVAAGGT
RGRDGAQGCNLVHVYPDTVVHSVIPLGGGETVGTFVSPGQARRKIAESGIFIEPSRRDSLFKHPPMVLTSSAPRSPVD
>Q8YLG0 3.1.4.17~~~~~~3',5'-cyclic-nucleotide phosphodiesterase alr5338~~~COG1409
MNEKLPISIAQITDIHLLASESQRLQGISTTESFLAVMKRLEELRPELDLLLMTGDLSDDGTPESYENLQHYLNSLQIAT
YWLPGNHDCAIAMDKILNLGMVSRRKSFQRGNWNFILLNSSVTDCVYGYLSATTLDWLDSELKMLPNNPTLIALHHPPLS
VNSAWIDRSCLQNSQELFAVIDRYPQVKLVLFGHIHQEFRRQRHNVHYLGSPSTCYQFQSQSTTFAINQELPGFRLLKLY
ADGTWTTKIERVPYSLPIEPTVTVSY
>D4P095 3.1.4.53~~~cpdA~~~3',5'-cyclic adenosine monophosphate phosphodiesterase CpdA~~~COG1409
MSRHSNTPATDASVLLVQLSDSHLFAEDGARLLGMDTAHSLEKVVERVAREQPRIDLILATGDVSQDGSLDSYTRFRRLS
APLAAPLRWFAGNHDEREPMQRATEGSDLLEQIVDVGNWRVVLLDSSIPGAVPGYLEDDQLDLLRRAIDSAGERFLLVSF
HHHPVPIGSDWMDPIGLRNPQALFDLLAPYPQLRCLLWGHIHQEFDRQRGPLRLLASPSTCVQFAPGSSDFTLDRLAPGY
RWLRLHDDGRLETGISRVDDVVFEVDYDTAGY
>O25683 3.1.4.17~~~~~~3',5'-cyclic-nucleotide phosphodiesterase~~~COG2404
MMQVYHLSHIDLDGYACQLVSKQFFKNIQCYNANYGREVSARIYEILNAIAQSKESEFLILVSDLNLNLNEAEYLQDKIQ
EHRLQNKNIQIQLLDHHISGKEVAESFHWYFLDINRCATKIVYEFLKKHYAILEPKNTTWLEPLVEMVNSVDIWDTQGYG
FELGKVCMRMINQSSELNRFMFDDENRNYKLKLLEEVKNYLFLENAPVAYDNDLFKLKKIALGGDPDAETMDNISSNAQT
HLLSLKKHDCSVYYQDKKGFLSYSMGGISVLANLFLTQNPDFDFYMDVNAKGNVSLRANGNCDVCELSQMCFNGGGHRNA
SGGKIDGFRESFNYRDIKEQIEEIFNNA
>P37972 ~~~cnrA~~~Nickel and cobalt resistance protein CnrA~~~
MIESILSGSVRYRWLVLFLTAVVAVIGAWQLNLLPIDVTPDITNKQVQINSVVPTMSPVEVEKRVTYPIETAIAGLNGVE
STRSMSRNGFSQVTVIFKESANLYFMRQQVSERLAQARPNLPENVEPQMGPVSTGLGEVFHYSVEYQYPDGTGASIKDGE
PGWQSDGSFLTERGERLDDRVSRLAYLRTVQDWIIRPQLRTTPGVADVDSLGGYVKQFVVEPDTGKMAAYGVSYADLARA
LEDTNLSVGANFIRRSGESYLVRADARIKSADEISRAVIAQRQNVPITVGQVARVKIGGELRSGAASRNGNETVVGSALM
LVGANSRTVAQAVGDKLEQISKTLPPGVVIVPTLNRSQLVIATIETVAKNLIEGALLVVAILFALLGNWRAATIAALVIP
LSLLVSAIGMNQFHISGNLMSLGALDFGLIIDGAVIIVENSLRRLAERQHREGRLLTLDERLQEVVQSSREMVRPTVYGQ
LVIFMVFLPCLTFQGVEGKMFSPMVITLMLALASAFVLSLTFVPAMVAVMLRKKVAETEVRVIVATKESYRPWLEHAVAR
PMPFIGAGIATVAVATVAFTFVGREFMPTLDELNLNLSSVRIPSTSIDQSVAIDLPLERAVLSLPEVQTVYSKAGTASLA
ADPMPPNASDNYIILKPKSEWPEGVTTKEQVIERIREKTAPMVGNNYDVTQPIEMRFNELIGGVRSDVAVKVYGENLDEL
AATAQRIAAVLKKTPGATDVRVPLTSGFPTFDIVFDRAAIARYGLTVKEVADTISTAMAGRPAGQIFDGDRRFDIVIRLP
GEQRENLDVLGALPVMLPLSEGQARASVPLRQLVQFRFTQGLNEVSRDNGKRRVYVEANVGGRDLGSFVDDAAARIAKEV
KLPPGMYIEWGGQFQNLQAATKRLAIIVPLCFILIAATLYMAIGSAALTATVLTAVPLALAGGVFALLLRGIPFSISAAV
GFIAVSGVAVLNGLVLISAIRKRLDDGMAPDAAVIEGAMERVRPVLMTALVASLGFVPMAIATGTGAEVQKPLATVVIGG
LVTATVLTLFVLPALCGIVLKRRTAGRPEAQAALEA
>P37973 ~~~cnrB~~~Nickel and cobalt resistance protein CnrB~~~
MMKNERRSVNWPMIAGVAAVAAAVGFGAAHLPVSEKSPASTQAPEAQKPQSAPVKPGLKEVKIPATYLAAANIAVEPVAS
AAVGTEILAPATVAALPGSEAVIVSRAAGAVQRVQRRLGDVVKAGDVLALVDSPEAAGMAAERKVAQAKADLARKTYERE
ASLFQQGVTPRQEMEAAKAALDVAQAEALRAATVAQSAHLASDGRSVAVVSPIAGKITAQSVTLGAFVAPQAELFRVAGT
GAVQVEAAVTAADTSRIVAGSEATILLANGSPLSARVQAVTPTVTGSARVATVVVVPAQPTDRLVVGEGVQVRLRTAVAD
AAALSVPEDAVQNLDGRDVLFVRTQEGFRPMPVLVGTRSGGSAQILSGVQAGEQVATRNAFLVKAEMNKGGGDEE
>P37974 ~~~cnrC~~~Nickel and cobalt resistance protein CnrC~~~
MKQVISSFLCRPRFVGSAIWLLPVALSHAAEAPPFPNLLQQSLALAPAMVAQAANVRAAGADAAQAQAWLNPRIDTVLEN
LGAPSSDGLSQRQNTYSITQPFELGGKRGARIEVGERNFAAAQARERQAQVAYAAELAVAYATAEAALGRKILATENLAR
ANEELAAARALVDSGKEASLRSAQAKASVAAAQAAEAAATNDATQALARLSAMSGASEPYTAVTSSLLTTQAVVPNAPAA
LAESPSVRAAEAERNALDAQVDVERKRWIPDVGVSAGVRRYGWTNSSGYVVGVTASIPLFDQNRNGINAAVERVAAAQAR
LDSVRLEANVARQSAISQVATADKQLAAASEGEQAAAEAYRMGRIGYESGKTPLMELLAVRRALVDARQLTIDARLARVR
ALAALAQADGRLAFEESR
>P37978 ~~~cnrH~~~RNA polymerase sigma factor CnrH~~~
MNPEDADRILAAQAASGNQRAFGQLVARHGVALAQAARSFGIPETDVDDVVQDTFVAAWHALDDFDPDRPFRAWLFRIGL
NKMRDLYRFRRVRQFLFGAENLGDLELAGGVANDEPGPEQQVAARLELARVASTLGKLDTGSREVIVLTAIVGMSQPEAA
AVLGLSVKAVEGRIGRARAKLSALLDADSEK
>P37975 ~~~cnrR~~~Nickel and cobalt resistance protein CnrR~~~
MMKSRTRRLSLSTLFGALLGVSVAAAWLYYSHRNEAGHGDLHEILHEAVPLDANEREILELKEDAFAQRRREIETRLRAA
NGKLADAIAKNPAWSPEVEAATQEVERAAGDLQRATLVHVFEMRAGLKPEHRPAYDRVLIDALRRGSQ
>P56621 ~~~cnrY~~~Nickel and cobalt resistance protein CnrY~~~
MADVEEWLTHARKVTQEASIGVDVTSIQECISAEPAQRVLVARRDAWRAICCAAFAALVAFAAINRVATIMLEKPAPTWV
ATPSAASPFGLLIGK
>B5GMG2 4.2.3.108~~~cnsA~~~1,8-cineole synthase~~~COG0664
MPAGHEEFDIPFPSRVNPFHARAEDRHVAWMRAMGLITGDAAEATYRRWSPAKVGARWFYLAQGEDLDLGCDIFGWFFAY
DDHFDGPTGTDPRQTAAFVNRTVAMLDPRADPTGEHPLNIAFHDLWQRESAPMSPLWQRRAVDHWTQYLTAHITEATNRT
RHTSPTIADYLELRHRTGFMPPLLDLIERVWRAEIPAPVYTTPEVQTLLHTTNQNINIVNDVLSLEKEEAHGDPHNLVLV
IQHERQSTRQQALATARRMIDEWTDTFIRTEPRLPALCGRLGIPLADRTSLYTAVEGMRAAIRGNYDWCAETNRYAVHRP
TGTGRATTPW
>D0C9N6 1.14.13.239~~~cntA~~~Carnitine monooxygenase oxygenase subunit~~~
MSAVEKLPEDFCANPDVAWTFPKVFYTSSQVFEHEKEAIFAKSWICVAHSSELAQPNDYITRKVIGENIVIIRGKDSVLR
AFYNVCPHRGHELLSGSGKAKNVITCPYHAWTFKLDGSLALARNCDHVESFDKENSSMVPLKVEEYAGFLFINMDENATC
VEDQLPGFAERLNQACGVIKDLKLAARFVTETPANWKVIVDNYMECYHCGPAHPGFADSVQVDKYWHTTHQNWTLQYGFA
RSSEKSFKLDPSVTDPEFHGFWTWPCTMFNVPPGSNFMTVIYEFPVDAETTLQHYDIYFTNEELTQDQKDLIEWYRNVFR
PEDLNLVESVQRGLKSRGYRGQGRIMTDKQRSGISEHGIAYFQHLVAQYHQ
>F0KFI5 1.14.13.239~~~yeaW~~~Carnitine monooxygenase oxygenase subunit~~~COG4638
MSAVEKLPEDFCANPDVAWTFPKVFYTSSQVFEHEKEAIFAKSWICVAHGSELAQPNDYITRKVIGENIVIIRGKDSVLR
AFYNVCPHRGHELLSGSGKAKNVITCPYHAWTFKLDGSLALARNCDHVESFDKENSSMVPLKVEEYAGFVFINMDENATC
VEDQLPEFAERLNQACSVIKDLKLAARFVTETPANWKVIVDNYLECYHCGPAHPGFADSVQVDKYWHTTHQNWTLQYGFA
RSSEKSFKLDPSVTDPEFHGFWTWPCTMFNVPPGSNFMTVIYEFPVDAETTLQHYDIYFTNEELTQDQKDLIEWYRNVFR
PEDLNLVESVQRGLKSRGYRGQGRIMTDKQRSGISEHGIAYFQHLVAQHHK
>P0ABR7 1.14.13.239~~~yeaW~~~Carnitine monooxygenase oxygenase subunit~~~COG4638
MSNLSPDFVLPENFCANPQEAWTIPARFYTDQNAFEHEKENVFAKSWICVAHSSELANANDYVTREIIGESIVLVRGRDK
VLRAFYNVCPHRGHQLLSGEGKAKNVITCPYHAWAFKLDGNLAHARNCENVANFDSDKAQLVPVRLEEYAGFVFINMDPN
ATSVEDQLPGLGAKVLEACPEVHDLKLAARFTTRTPANWKNIVDNYLECYHCGPAHPGFSDSVQVDRYWHTMHGNWTLQY
GFAKPSEQSFKFEEGTDAAFHGFWLWPCTMLNVTPIKGMMTVIYEFPVDSETTLQNYDIYFTNEELTDEQKSLIEWYRDV
FRPEDLRLVESVQKGLKSRGYRGQGRIMADSSGSGISEHGIAHFHNLLAQVFKD
>Q2FVE7 ~~~cntA~~~Metal-staphylopine-binding protein CntA~~~COG0747
MRKLTKMSAMLLASGLILTGCGGNKGLEEKKENKQLTYTTVKDIGDMNPHVYGGSMSAESMIYEPLVRNTKDGIKPLLAK
KWDVSEDGKTYTFHLRDDVKFHDGTPFDADAVKKNIDAVQENKKLHSWLKISTLIDNVKVKDKYTVELNLKEAYQPALAE
LAMPRPYVFVSPKDFKNGTTKDGVKKFDGTGPFKLGEHKKDESADFNKNDQYWGEKSKLNKVQAKVMPAGETAFLSMKKG
ETNFAFTDDRGTDSLDKDSLKQLKDTGDYQVKRSQPMNTKMLVVNSGKKDNAVSDKTVRQAIGHMVNRDKIAKEILDGQE
KPATQLFAKNVTDINFDMPTRKYDLKKAESLLDEAGWKKGKDSDVRQKDGKNLEMAMYYDKGSSSQKEQAEYLQAEFKKM
GIKLNINGETSDKIAERRTSGDYDLMFNQTWGLLYDPQSTIAAFKEKNGYESATSGIENKDKIYNSIDDAFKIQNGKERS
DAYKNILKQIDDEGIFIPISHGSMTVVAPKDLEKVSFTQSQYELPFNEMQYK
>A0A0H3JTL0 ~~~cntA~~~Metal-staphylopine-binding protein CntA~~~
MRKLTKMSAMLLASGLILTGCGGNKGLEEKKENKQLTYTTVKDIGDMNPHVYGGSMSAESMIYEPLVRNTKDGIKPLLAK
KWDVSEDGKTYTFHLRDDVKFHDGTTFDADAVKKNIDAVQQNKKLHSWLKISTLIDNVKVKDKYTVELNLKEAYQPALAE
LAMPRPYVFVSPKDFKNGTTKDGVKKFDGTGPFKLGEHKKDESADFNKNDQYWGEKSKLNKVQAKVMPAGETAFLSMKKG
ETNFAFTDDRGTDSLDKDSLKQLKDTGDYQVKRSQPMNTKMLVVNSGKKDNAVSDKTVRQAIGHMVNRDKIAKEILDGQE
KPATQLFAKNVTDINFDMPTRKYDLKKAESLLDEAGWKKGKDSDVRQKDGKNLEMAMYYDKGSSSQKEQAEYLQAEFKKM
GIKLNINGETSDKIAERRTSGDYDLMFNQTWGLLYDPQSTIAAFKAKNGYESATSGIENKDKIYNSIDDAFKIQNGKERS
DAYKNILKQIDDEGIFIPISHGSMTVVAPKDLEKVSFTQSQYELPFNEMQYK
>D0C9N8 1.14.13.239~~~cntB~~~Carnitine monooxygenase reductase subunit~~~
MASHYEMFPAVVTRVEQLTPLIKRFTFKRQDGQNFPRFSGGSHIIVKMNEQLSNAYSLMSCTQDLSTYQVCVRKDVEGKG
GSVFMHDQCNEGCEIQISEPKNLFPLAETGNKHILIAGGIGITPFLPQMDELAARGAEYELHYAYRSPEHAALLDELTQK
HAGHVFSYVDSEGSMLNLDELISSQPKGTHVYVCGPKPMIDAVIDCCNKHRYRDEYIHWEQFASTVPEDGEAFTVVLAKS
NQEIEVQSNQTILQAIETLNIDVECLCREGVCGTCETAILEGEAEHFDQYLSDAEKASQKSMMICVSRAKGKKLVLDL
>F0KFI7 1.14.13.239~~~yeaX~~~Carnitine monooxygenase reductase subunit~~~COG1018
MEQLTPLIKRFTFKRQDGQNFPRFSGGSHIIVKMNEQISNAYSLMSCTQDLSTYQVCVRKDVEGKGGSVFMHDQCNEGCE
IQISEPKNLFPLAETGNKHILIAGGIGITPFLPQMDELAARGADFELHYAYRSPEHAALLDELKQKHAKHVFSYVDSEGC
SLKLDELISSQPKGTHVYVCGPKPMIDAVIDCCNKHRYRDEYIHWEQFASTVPEDGEAFTVVLAKSNQEIEVQSNQTILQ
AIETLNIDVECLCREGVCGTCETAILEGEADHFDQYLSDAEKASQKSMMICVSRAKGKKLVLDL
>P76254 1.14.13.239~~~yeaX~~~Carnitine monooxygenase reductase subunit~~~COG1018
MSDYQMFEVQVSQVEPLTEQVKRFTLVATDGKPLPAFTGGSHVIVQMSDGDNQYSNAYSLLSSPHDTSCYQIAVRLEENS
RGGSRFLHQQVKVGDRLTISTPNNLFALIPSARKHLFIAGGIGITPFLSHMAELQHSDVDWQLHYCSRNPESCAFRDELV
QHPQAEKVHLHHSSTGTRLELARLLADIEPGTHVYTCGPEALIEAVRSEAARLDIAADTLHFEQFAIEDKTGDAFTLVLA
RSGKEFVVPEEMTILQVIENNKAAKVECLCREGVCGTCETAILEGEADHRDQYFSDEERASQQSMLICCSRAKGKRLVLD
L
>Q2FVE8 ~~~cntB~~~Metal-staphylopine import system permease protein CntB~~~COG0601
MFKFILKRIALMFPLMIVVSFMTFLLTYITNENPAVTILHAQGTPNVTPELIAETNEKYGFNDPLLIQYKNWLLEAMQFN
FGTSYITGDPVAERIGPAFMNTLKLTIISSVMVMITSIILGVVSALKRGKFTDRAIRSVAFFLTALPSYWIASILIIYVS
VKLNILPTSGLTGPESYILPVIVITIAYAGIYFRNVRRSMVEQLNEDYVLYLRASGVKSITLMLHVLRNALQVAVSIFCM
SIPMIMGGLVVIEYIFAWPGLGQLSLKAILEHDFPVIQAYVLIVAVLFIVFNTLADIINALLNPRLREGAR
>A0A0H3K104 ~~~cntB~~~Metal-staphylopine import system permease protein CntB~~~
MFKFILKRIALMFPLVIVVSFMTFLLTYITNENPAVTILHAQGTPNVTPELIAETNEKYGFNDPLLIQYKNWLLEAMQFN
FGTSYITGDPVAERIGPAFMNTLKLTIISSVMVMITSIILGVVSALKRGKFTDRAIRSVAFFLTALPSYWIASILIIYVS
VKLNILPTSGLTGPESYILPVIVITIAYAGIYFRNVRRSMVEQLNEDYVLYLRASGVKSITLMLHVLRNAIQVAVSIFCM
SIPMIMGGLVVIEYIFAWPGLGQLSLKAILEHDFPVIQAYVLIVAVLFIVFNTLADIINALLNPRLREGAR
>Q2FVE9 ~~~cntC~~~Metal-staphylopine import system permease protein CntC~~~COG1173
MIILKRLLQDKGAVIALGIIVLYVFLGLAAPLVTFYDPNHIDTANKFAGMSFQHLLGTDHLGRDILTRLIYAIRPSLLYV
FVALFVSVLIGSILGFLSGYFQGFVDALIMRACDVMLAFPSYVVTLALIALFGMGAENIIMAFILTRWAWFCRVIRTSVM
QYTASDHVRFAKTIGMNDMKIIHKHIMPLTLADIAIISSSSMCSMILQISGFSFLGLGVKAPTAEWGMMLNEARKVMFTH
PEMMFAPGIAIVIIVMAFNFLSDALQIAIDPRISSKDKLRSVKKGVVQS
>A0A0H3JU73 ~~~cntC~~~Metal-staphylopine import system permease protein CntC~~~
MIILKRLLQDKGAVIALGIIVLYVFLGLAAPLVTFYDPNHIDTANKFAGISFQHLLGTDHLGRDILTRLIYAIRPSLLYV
FVALFVSVLIGSILGFLSGYFQGFVDALIMRACDVMLAFPSYVVTLALIALFGMGAENIIMAFILTRWAWFCRVIRTSVM
QYTASDHVRFAKTIGMNDMKIIHKHIMPLTLADIAIISSSSMCSMILQISGFSFLGLGVKAPTAEWGMMLNEARKVMFTH
PEMMFAPGIAIVIIVMAFNFLSDALQIAIDPRISSKDKLRSVKKGVVQS
>Q2FVF0 7.2.2.-~~~cntD~~~Metal-staphylopine import system ATP-binding protein CntD~~~COG0444
MTLLTVKHLTITDTWTDQPLVSDVNFTLTKGETLGVIGESGSGKSITCKSIIGLNPERLGVTGEIIFDGTSMLSLSESQL
KKYRGKDIAMVMQQGSRAFDPSTTVGKQMFETMKVHTSMSTQEIEKTLIEYMDYLSLKDPKRILKSYPYMLSGGMLQRLM
IALALALKPKLIIADEPTTALDTITQYDVLEAFIDIKKHFDCAMIFISHDLTVINKIADRVVVMKNGQLIEQGTRESVLH
HPEHVYTKYLLSTKKKINDHFKHVMRGDVHD
>A0A0H3JXA3 7.2.2.-~~~cntD~~~Metal-staphylopine import system ATP-binding protein CntD~~~
MTLLTVKHLTITDTWTDQPLVSDVNFTLTKGETLGVIGESGSGKSITCKSIIGLNPERLGVTGEIIFDGTSMLSLSESQR
KKYRGKDIAMVMQQGSRAFDPSTTVGKQMFETMKVHTSMSTQEIEKTLIEYMDYLSLKDPKRILKSYPYMLSGGMLQRLM
IALALALKPKLIIADEPTTALDTITQYDVLEAFIDIKKHFDCAMIFISHDLTVINKIADRVVVMKNGQLIEHGTRESVLH
HPEHVYTKYLLSTKKKINDHFKHVMRGDVHD
>A0A0H3JTK0 ~~~cntE~~~Staphylopine export protein~~~
MKGAMAWPFLRLYILTLMFFSANAILNVFIPLRGHDLGATNTVIGIVMGAYMLTAMVFRPWAGQIIARVGPIKVLRIILI
INAIALIIYGFTGLEGYFVARVMQGVCTAFFSMSLQLGIIDALPEEHRSEGVSLYSLFSTIPNLIGPLVAVGIWNANNIS
LFAIVIIFIALTTTFFGYRVTFAEQEPDTSDKIEKMPFNAVTVFAQFFKNKELLNSGIIMIVASIVFGAVSTFVPLYTVS
LGFANAGIFLTIQAIAVVAARFYLRKYIPSDGMWHPKYMVSVLSLLVIASFVVAFGPQVGAIIFYGSAILIGMTQAMVYP
TLTSYLSFVLPKVGRNMLLGLFIACADLGISLGGALMGPISDLVGFKWMYLICGMLVIVIMIMSFLKKPTPRPASSL
>Q2FVF1 7.2.2.-~~~cntF~~~Metal-staphylopine import system ATP-binding protein CntF~~~COG1124
MIKIKDVEKSYQSAHVFKRRRTPIVKGVSFECPIGATIAIIGESGSGKSTLSRMILGIEKPDKGCVTLNDQPMHKKKVRR
HQIGAVFQDYTSSLHPFQTVREILFEVMCQCDGQPKEVMEVQAITLLEEVGLSKAYMDKYPNMLSGGEAQRVAIARAICI
NPKYILFDEAISSLDMSIQTQILDLLIHLRETRQLSYIFITHDIQAATYLCDQLIIFKNGKIEEQIPTSALHKSDNAYTR
ELIEKQLSF
>A0A0H3JT74 7.2.2.-~~~cntF~~~Metal-staphylopine import system ATP-binding protein CntF~~~
MIKVTDVEKSYQSAHVFKRRRTPIVKGVSFECPIGATIAIIGESGSGKSTLSRMILGIEKPDKGCVTLNDLPMHKKKVRR
HQIGAVFQDYTSSLHPFQTVREILFEVMCQCDGQPKEVMEVQAITLLEEVGLSKAYMDKYPNMLSGGEAQRVAIARAICI
NPKYILFDEAISSLDMSIQTQILDLLIHLRETRQLSYIFITHDIQAATYLCDQLIIFKNGKIEEQIPTSALHKSDNAYTR
ELIEKQLSF
>A0A0H2ZHZ4 ~~~cntI~~~Pseudopaline exporter CntI~~~
MVLDLLKSGVLLAVLASFTFSVMNALVKEASATLPAAEIVFFRSAIGTLLIYLLMRQAGVALSRQGVPMLLVRGVMGALY
LVCYFYAIAHIPLADASILAHMSPFFVILFSALFLGERIPRAVYWLLLVVVLGALMIVKPFSYSSYSVYAVVGLLSAVFA
AGASVAIRQLSARHHTYEIVFYFLAVATLVAIPLMWSDFVVPATLREWGLLLAIGVVSLLGQVFLTRAFSHESATIVAVT
RYIGIVFNAGWGWLFWSEVPDALTIAGGVLIVVACIALSRTKKG
>Q9HUX6 ~~~cntI~~~Pseudopaline exporter CntI~~~
MVLDLLKSGVLLAVLASFTFSVMNALVKEASATLPAAEIVFFRSAIGTLLIYLLMRQAGVALSRQGVPMLLVRGVMGALY
LVCYFYAIAHIPLADASILAHMSPFFVILFSALFLGERIPRAVYWLLLVVVLGALMIVKPFSYSSYSVYAVVGLLSAVFA
AGASVAIRQLSARHHTYEIVFYFLAVATLVAIPLMWNDFVVPATLREWGLLLAIGVVSLLGQVFLTRAFSHESATIVAVT
RYIGIVFNAGWGWLFWSEVPDALTIAGGVLIVVACIALSRTKKG
>A0A0H3JU78 5.1.1.24~~~cntK~~~Histidine racemase~~~
MNRQVIEFSKYNPSGNMTILVHSKHDASEYASIANQLMAATHVCCEQVGFIESTQNDDGNDFHLVMSGNEFCGNATMSYI
HHLQESHLLKDQQFKVKVSGCSDLVQCAIHDCQYYEVQMPQAHRVVPTTINMGNHSWKALEIIYETYVHYVIPVKQVTTE
IQHLVEAFVREQQWSHKYKTVGMMLFDEQRQFLQPLIYIPEIQSLIWENSCGSGTASIGVFNNYQRNDACKDFTVHQPGG
SILVTSKRCHQLGYQTSIKGQVTTVATGKAYIE
>A0A0H2ZI93 ~~~cntO~~~Metal-pseudopaline receptor CntO~~~
MRVSVSLVLGVGLGCSSPALWAETESPAELEVLTVTAEAERAEGPVQGYRANRSASATRTDTRIEDIPQAISVVPRQVLD
DLDSARIERALDFAGGVSRQNNFGGLTMFEYNVRGFTTSEFYRDGFSANRGYMNAPDSATIERVEILKGPASSLYGRGDP
GGTVNLVTKKPQAERFARLHASAGSWDRYRSTLDLNTPLDEEGDLLYRMNLAVEDSKGFRDYADGQRLLVAPSFSWQLDP
DTSLLVEAEVVRNRQVFDRGTVAPHNHLGSLPRSRFFGEPDDGKIDNNNETLQATLRHHLNEQWSLRLASHYKHGHLDGY
ASENSSLAADGYTLRREYRYRDFEWHDSITQLDLLGDLHTGSIRHQLLMGLEYERYHNDELILRSIPSRNPYAIDIRRPV
YGQPKPPFGRDDRNHEEVDAMALNLQDQIEFNEKWRGLLGVRFDRYRQDMNATRLNNGRFRDTSSQQTQRAATPRIGVLY
QATPEVGLFANASKSFKPNGGTDMAGKAFDPEEGRGYEAGVKLDLLDGRLGMTLAAFHLKKKNVLTADPSNPGYQQTAGE
ARSQGFDLQFSGQLTEQLRLIGAYAYIDAEVTKDENIARGSRLLNVPKHSGSLMGVYEFREGWLHGADAGAAVNYVGERA
GDSSDSGFELPAYTTVDLLAHYPLASNATLGVNVNNLFDRRYYERSYNNVWVAPGEPRNLTMSLTLNY
>Q9HUX3 ~~~cntO~~~Metal-pseudopaline receptor CntO~~~
MRVSVSLVLGVGLGCSSPALWAETESPAELEVLTVTAEAERAEGPVQGYRANRTASATRTDTRIEDIPQAISVVPRQVLD
DLDSARIERALDFAGGVSRQNNFGGLTMFEYNVRGFTTSEFYRDGFSANRGYMNAPDSATIERVEILKGPASSLYGRGDP
GGTVNLVTKKPQAERFARLHASAGSWDRYRSTLDLNTPLDEEGDLLYRMNLAVEDSKGFRDYADGQRLLVAPSISWQLDP
DTSLLVEAEVVRNRQVFDRGTVAPHNHLGSLPRSRFFGEPDDGKIDNNNETLQATLRHHFNEQWSLRLASHYKHGHLDGY
ASENSSLAADGYSLRREYRYRDFEWHDSITQLDLLGDLHTGSIRHQLLMGLEYERYHNDELILRSIPSRNPYAIDIRRPV
YGQPKPPFGRDDRNHEEVDAMALNLQDQIEFSEKWRGLLGVRFDRYRQDMNATRLNNGRFRETSSQQTQRAATPRIGVLY
QATPEVGLFANASKSFKPNGGTDMAGKAFDPEEGRGYEAGVKLDLLDGRLGMTLAAFHLKKKNVLTADPSNPGYQQTAGE
ARSQGFDLQFSGQLTEQLRLIGAYAYIDAEVTKDENIARGSRLLNVPKHSGSLMGVYEFREGWLHGADAGAAVNYVGERA
GDSSDSGFELPAYTTVDLLARYPLASNATLGVNVNNLFDRRYYERSYNNVWVAPGEPRNLTMSLTLNY
>P64467 ~~~cnu~~~OriC-binding nucleoid-associated protein~~~
MTVQDYLLKFRKISSLESLEKLYDHLNYTLTDDQELINMYRAADHRRAELVSGGRLFDLGQVPKSVWHYVQ
>Q83EV9 2.7.1.33~~~coaA~~~Pantothenate kinase~~~COG1072
MTVKPELNEITPYLQFNRQQWGNFRKDTPLTLTESDLDKLQGQIEIVSLKEVTEIYLPLSRLLSFYVTARQTLQQATYQF
LGKPEPKVPYIIGIAGSVAVGKSTTSRVLKALLSRWPDHPNVEVITTDGFLYSNAKLEKQGLMKRKGFPESYDMPSLLRV
LNAIKSGQRNVRIPVYSHHYYDIVRGQYEIVDQPDIVILEGLNILQTGVRKTLQQLQVFVSDFFDFSLFVDAQAQVIQKW
YIDRVLSFWRTTFKDPHSYFHYLTQMSETEVAAFAKHVWNEINKVNLMENILPYKNRAQLILEKAADHSIQKVYLRKI
>P0A6I3 2.7.1.33~~~coaA~~~Pantothenate kinase~~~COG1072
MSIKEQTLMTPYLQFDRNQWAALRDSVPMTLSEDEIARLKGINEDLSLEEVAEIYLPLSRLLNFYISSNLRRQAVLEQFL
GTNGQRIPYIISIAGSVAVGKSTTARVLQALLSRWPEHRRVELITTDGFLHPNQVLKERGLMKKKGFPESYDMHRLVKFV
SDLKSGVPNVTAPVYSHLIYDVIPDGDKTVVQPDILILEGLNVLQSGMDYPHDPHHVFVSDFVDFSIYVDAPEDLLQTWY
INRFLKFREGAFTDPDSYFHNYAKLTKEEAIKTAMTLWKEINWLNLKQNILPTRERASLILTKSANHAVEEVRLRK
>B5XYG3 2.7.1.33~~~coaA~~~Pantothenate kinase~~~
MSQKEQTLMTPYLQFNRHQWAALRDSVPMTLTEDEITRLKGINEDLSLEEVAEIYLPLSRLLNFYISSNLRRQAVLEQFL
GTNGQRIPYIISIAGSVAVGKSTTARVLQALLSRWPEHRHVELITTDGFLHPNSVLKERGLMKKKGFPQSYDMHRLVKFV
SDLKSGVPQATAPVYSHLIYDVIPDGDKTVAQPDILILEGLNVLQSGMDYPHDPHHVFVSDFVDFSIYVDAPEELLKSWY
INRFLKFREGAFTDPDSYFHNYAKLSKEEAVDIATSLWNEINLMNLKENILPTRERASLIMTKSANHSVNQVRLRK
>P9WPA7 2.7.1.33~~~coaA~~~Pantothenate kinase~~~COG0572
MSRLSEPSPYVEFDRRQWRALRMSTPLALTEEELVGLRGLGEQIDLLEVEEVYLPLARLIHLQVAARQRLFAATAEFLGE
PQQNPDRPVPFIIGVAGSVAVGKSTTARVLQALLARWDHHPRVDLVTTDGFLYPNAELQRRNLMHRKGFPESYNRRALMR
FVTSVKSGSDYACAPVYSHLHYDIIPGAEQVVRHPDILILEGLNVLQTGPTLMVSDLFDFSLYVDARIEDIEQWYVSRFL
AMRTTAFADPESHFHHYAAFSDSQAVVAAREIWRTINRPNLVENILPTRPRATLVLRKDADHSINRLRLRKL
>P0ABQ0 ~~~coaBC~~~Coenzyme A biosynthesis bifunctional protein CoaBC~~~COG0452
MSLAGKKIVLGVSGGIAAYKTPELVRRLRDRGADVRVAMTEAAKAFITPLSLQAVSGYPVSDSLLDPAAEAAMGHIELGK
WADLVILAPATADLIARVAAGMANDLVSTICLATPAPVAVLPAMNQQMYRAAATQHNLEVLASRGLLIWGPDSGSQACGD
IGPGRMLDPLTIVDMAVAHFSPVNDLKHLNIMITAGPTREPLDPVRYISNHSSGKMGFAIAAAAARRGANVTLVSGPVSL
PTPPFVKRVDVMTALEMEAAVNASVQQQNIFIGCAAVADYRAATVAPEKIKKQATQGDELTIKMVKNPDIVAGVAALKDH
RPYVVGFAAETNNVEEYARQKRIRKNLDLICANDVSQPTQGFNSDNNALHLFWQDGDKVLPLERKELLGQLLLDEIVTRY
DEKNRR
>A0QWT2 ~~~coaBC~~~Coenzyme A biosynthesis bifunctional protein CoaBC~~~COG0452
MSARKRIVVGVAGGIAAYKACTVVRQLTEAGHSVRVVPTESALRFVGAATFEALSGNPVHTGVFTDVHEVQHVRIGQQAD
LVVIAPATADLLARAVAGRADDLLTATLLTARCPVLFAPAMHTEMWLHPATVDNVATLRRRGAVVLEPASGRLTGADSGP
GRLPEAEEITTLAQLLLERADALPYDMAGVKALVTAGGTREPLDPVRFIGNRSSGKQGYAVARVLAQRGADVTLIAGNTA
GLIDPAGVEMVHIGSATQLRDAVSKHAPDANVLVMAAAVADFRPAHVAAAKIKKGASEPSSIDLVRNDDVLAGAVRARAD
GQLPNMRAIVGFAAETGDANGDVLFHARAKLERKGCDLLVVNAVGENRAFEVDHNDGWLLSADGTESALEHGSKTLMATR
IVDSIAAFLKSQDG
>P9WNZ1 ~~~coaBC~~~Coenzyme A biosynthesis bifunctional protein CoaBC~~~COG0452
MVDHKRIPKQVIVGVSGGIAAYKACTVVRQLTEASHRVRVIPTESALRFVGAATFEALSGEPVCTDVFADVPAVPHVHLG
QQADLVVVAPATADLLARAAAGRADDLLTATLLTARCPVLFAPAMHTEMWLHPATVDNVATLRRRGAVVLEPATGRLTGA
DSGAGRLPEAEEITTLAQLLLERHDALPYDLAGRKLLVTAGGTREPIDPVRFIGNRSSGKQGYAVARVAAQRGADVTLIA
GHTAGLVDPAGVEVVHVSSAQQLADAVSKHAPTADVLVMAAAVADFRPAQVATAKIKKGVEGPPTIELLRNDDVLAGVVR
ARAHGQLPNMRAIVGFAAETGDANGDVLFHARAKLRRKGCDLLVVNAVGEGRAFEVDSNDGWLLASDGTESALQHGSKTL
MASRIVDAIVTFLAGCSS
>B2HUN5 2.7.7.3~~~coaD~~~Phosphopantetheine adenylyltransferase~~~
MSKTRVIYPGTFDPITNGHVDLVTRASRMFDEVVVAIAIGHHKNPLFSLEERVALAQSSLGHLSNVEFVGFDGLLVNFFK
EQKATAVLRGLRAVSDFEYEFQLANMNRQLDPHFEAVFLTPSEQYSFISSTLIREIARLKGDVTKFVPQAVVEAFERKHQ
QGW
>B0VTH7 2.7.7.3~~~coaD~~~Phosphopantetheine adenylyltransferase~~~
MSKTSVIYPGTFDPITNGHVDLVTRASRMFDEVVVAIAIGHHKNPLFSLEERVALAQSSLGHLSNVEFVGFDGLLVNFFK
EQKATAVLRGLRAVSDFEYEFQLANMNRQLDPHFEAVFLTPSEQYSFISSTLIREIARLKGDVTKFVPQAVVEAFERKHQ
QGW
>B0V8I3 2.7.7.3~~~coaD~~~Phosphopantetheine adenylyltransferase~~~
MSKTRVIYPGTFDPITNGHVDLVTRASRMFDEVVVAIAIGHHKNPLFSLEERVALAQSSLGHLSNVEFVGFDGLLVNFFK
EQKATAVLRGLRAVSDFEYEFQLANMNRQLDPHFEAVFLTPSEQYSFISSTLIREIARLKGDVTKFVPQAVVEAFERKHQ
QGW
>O34797 2.7.7.3~~~coaD~~~Phosphopantetheine adenylyltransferase~~~COG0669
MASIAVCPGSFDPVTYGHLDIIKRGAHIFEQVYVCVLNNSSKKPLFSVEERCELLREVTKDIPNITVETSQGLLIDYAKR
KNAKAILRGLRAVSDFEYEMQGTSVNRVLDESIETFFMMTNNQYSFLSSSIVKEVARYNGSVSEFVPPEVELALQQKFRQ
G
>Q3JW91 2.7.7.3~~~coaD~~~Phosphopantetheine adenylyltransferase~~~
MVVAVYPGTFDPLTRGHEDLVRRASSIFDTLVVGVADSRAKKPFFSLEERLKIANEVLGHYPNVKVMGFTGLLKDFVRAN
DARVIVRGLRAVSDFEYEFQMAGMNRYLLPDVETMFMTPSDQYQFISGTIVREIAQLGGDVSKFVFPSVEKWLTEKVAAM
AQGPSA
>Q83EM7 2.7.7.3~~~coaD~~~Phosphopantetheine adenylyltransferase~~~COG0669
MKPIAIYPGTFDPLTNGHVDIIERALPLFNKIIVACAPTSRKDPHLKLEERVNLIADVLTDERVEVLPLTGLLVDFAKTH
QANFILRGLRAVSDFDYEFQLAHMNYQLSPEIETIFLPAREGYSYVSGTMVREIVTLGGDVSPFVPPLVARHLQKRREK
>P0A6I6 2.7.7.3~~~coaD~~~Phosphopantetheine adenylyltransferase~~~COG0669
MQKRAIYPGTFDPITNGHIDIVTRATQMFDHVILAIAASPSKKPMFTLEERVALAQQATAHLGNVEVVGFSDLMANFARN
QHATVLIRGLRAVADFEYEMQLAHMNRHLMPELESVFLMPSKEWSFISSSLVKEVARHQGDVTHFLPENVHQALMAKLA
>A4W515 2.7.7.3~~~coaD~~~Phosphopantetheine adenylyltransferase~~~COG0669
MSTKAIYPGTFDPITNGHIDIITRAASMFDRVILAIAASPSKKPMFDLEERVALATTALQHLPNVEVMGFSDLMANFARA
QQANILIRGLRAVADFEYEMQLAHMNRHLMPELESVFLMPSKEWSFISSSLVKEVARHAGDVTHFLPANVHQALMEKLK
>Q831P9 2.7.7.3~~~coaD~~~Phosphopantetheine adenylyltransferase~~~COG0669
MRKIALFPGSFDPMTNGHLNLIERSAKLFDEVIIGVFINTSKQTLFTPEEKKYLIEEATKEMPNVRVIMQETQLTVESAK
SLGANFLIRGIRNVKDYEYEKDIAKMNQHLAPEIETVFLLAEEPYAHVSSSLLKEVLRFGGDVSDYLPPNIYHALKQKKN
DWS
>O26010 2.7.7.3~~~coaD~~~Phosphopantetheine adenylyltransferase~~~COG0669
MQKIGIYPGTFDPVTNGHIDIIHRSSELFEKLIVAVAHSSAKNPMFSLDERLKMIQLATKSFKNVECVAFEGLLANLAKE
YHCKVLVRGLRVVSDFEYELQMGYANKSLNHELETLYFMPTLQNAFISSSIVRSIIAHKGDASHLVPKEIYPLISKA
>Q9XC89 2.7.7.3~~~coaD~~~Phosphopantetheine adenylyltransferase~~~
MSTKAIYPGTFDPITNGHIDIVTRAASMFDKVVLAIAASPSKKPMFSLDERIALAEQATAHLVNVEVIGFSDLMANFARA
QQANILIRGLRAVADFEYEMQLAHMNRHLMPTLESVFLMPCKEWSFISSSLVKEVARHQGDVSHFLPANVHQALLNKLK
>B1MDL6 2.7.7.3~~~coaD~~~Phosphopantetheine adenylyltransferase~~~
MTGAVCPGSFDPVTLGHLDVFERAAAQFDEVIVAVLINPNKAGMFTVDERIEMIRESTADLPNLRVESGQGLLVDFVRER
GLNAIVKGLRTGTDFEYELQMAQMNKHIAGVDTFFVATAPAYSFVSSSLAKEVATYGGDVSALLPASVHQRLLGKLRGQA
Q
>P9WPA5 2.7.7.3~~~coaD~~~Phosphopantetheine adenylyltransferase~~~COG0669
MTGAVCPGSFDPVTLGHVDIFERAAAQFDEVVVAILVNPAKTGMFDLDERIAMVKESTTHLPNLRVQVGHGLVVDFVRSC
GMTAIVKGLRTGTDFEYELQMAQMNKHIAGVDTFFVATAPRYSFVSSSLAKEVAMLGGDVSELLPEPVNRRLRDRLNTER
T
>B7V2S6 2.7.7.3~~~coaD~~~Phosphopantetheine adenylyltransferase~~~
MNRVLYPGTFDPITKGHGDLIERASRLFDHVIIAVAASPKKNPLFSLEQRVALAQEVTKHLPNVEVVGFSTLLAHFVKEQ
KANVFLRGLRAVSDFEYEFQLANMNRQLAPDVESMFLTPSEKYSFISSTLVREIAALGGDISKFVHPAVADALAERFKR
>Q9I6D1 2.7.7.3~~~coaD~~~Phosphopantetheine adenylyltransferase~~~
MNRVLYPGTFDPITKGHGDLIERASRLFDHVIIAVAASPKKNPLFSLEQRVALAQEVTKHLPNVEVVGFSTLLAHFVKEQ
KANVFLRGLRAVSDFEYEFQLANMNRQLAPDVESMFLTPSEKYSFISSTLVREIAALGGDISKFVHPAVADALAERFKR
>P63819 2.7.7.3~~~coaD~~~Phosphopantetheine adenylyltransferase~~~
MEHTIAVIPGSFDPITYGHLDIIERSTDRFDEIHVCVLKNSKKEGTFSLEERMDLIEQSVKHLPNVKVHQFSGLLVDYCE
QVGAKTIIRGLRAVSDFEYELRLTSMNKKLNNEIETLYMMSSTNYSFISSSIVKEVAAYRADISEFVPPYVEKALKKKFK
>P63820 2.7.7.3~~~coaD~~~Phosphopantetheine adenylyltransferase~~~
MEHTIAVIPGSFDPITYGHLDIIERSTDRFDEIHVCVLKNSKKEGTFSLEERMDLIEQSVKHLPNVKVHQFSGLLVDYCE
QVGAKTIIRGLRAVSDFEYELRLTSMNKKLNNEIETLYMMSSTNYSFISSSIVKEVAAYRADISEFVPPYVEKALKKKFK
>Q8DNE6 2.7.7.3~~~coaD~~~Phosphopantetheine adenylyltransferase~~~COG0669
MSDKIGLFTGSFDPMTNGHLDIIERASRLFDKLYVGIFFNPHKQGFLPIENRKRGLEKALGHLENVEVVASHDELVVDVA
KRLGATCLVRGLRNASDLQYEASFDYYNHQLSSDIETIYLHSRPEHLYISSSGVRELLKFGQDIACYVPESILEEIRNEK
KD
>Q9WZK0 2.7.7.3~~~coaD~~~Phosphopantetheine adenylyltransferase~~~COG0669
MKAVYPGSFDPITLGHVDIIKRALSIFDELVVLVTENPRKKCMFTLEERKKLIEEVLSDLDGVKVDVHHGLLVDYLKKHG
IKVLVRGLRAVTDYEYELQMALANKKLYSDLETVFLIASEKFSFISSSLVKEVALYGGDVTEWVPPEVARALNEKLKEGK
R
>Q5SJS9 2.7.7.3~~~coaD~~~Phosphopantetheine adenylyltransferase~~~COG0669
MHVVYPGSFDPLTNGHLDVIQRASRLFEKVTVAVLENPSKRGQYLFSAEERLAIIREATAHLANVEAATFSGLLVDFVRR
VGAQAIVKGLRAVSDYEYELQMAHLNRQLYPGLETLFILAATRYSFVSSTMVKEIARYGGDVSKLVPPATLRALKAKLGQ
>Q8ZJN9 2.7.7.3~~~coaD~~~Phosphopantetheine adenylyltransferase~~~COG0669
MITKAIYPGTFDPITNGHLDLVTRASAMFSHVILAIADSSSKKPMFTLDERVALAKKVTAPLKNVEVLGFSELMAEFAKK
HNANILVRGLRSVSDFEYEWQLANMNRHLMPKLESVFLIPSEKWSFISSSLVKEVARHGGDITPFLPKPVTKALLAKLA
>O67792 2.7.1.24~~~coaE~~~Dephospho-CoA kinase~~~COG0237
MKRIGLTGNIGCGKSTVAQMFRELGAYVLDADKLIHSFYRKGHPVYEEVVKTFGKGILDEEGNIDRKKLADIVFKDEEKL
RKLEEITHRALYKEIEKITKNLSEDTLFILEASLLVEKGTYKNYDKLIVVYAPYEVCKERAIKRGMSEEDFERRWKKQMP
IEEKVKYADYVIDNSGSIEETYKQVKKVYEELTRDP
>Q9PMD9 2.7.1.24~~~coaE~~~Dephospho-CoA kinase~~~COG0237
MKNAFFVTASIACGKSTFIEIANSLGFKSISADKIAHKILDENALELEKIFSPFSLKNLLKKEKKIDRKILGEIVFNNKE
AKKILENFTHPKIRAKILEQMQILDKENKAFFVEIPLFFESGAYENLGKVIVIYTPKELSLKRIMQRDKLSLEAAKARLD
SQIDIEEKLKKADFIIKNTNSYADFRQECVKVIQEISKGNM
>P0A6I9 2.7.1.24~~~coaE~~~Dephospho-CoA kinase~~~COG0237
MRYIVALTGGIGSGKSTVANAFADLGINVIDADIIARQVVEPGAPALHAIADHFGANMIAADGTLQRRALRERIFANPEE
KNWLNALLHPLIQQETQHQIQQATSPYVLWVVPLLVENSLYKKANRVLVVDVSPETQLKRTMQRDDVTREHVEQILAAQA
TREARLAVADDVIDNNGAPDAIASDVARLHAHYLQLASQFVSQEKP
>P44920 2.7.1.24~~~coaE~~~Dephospho-CoA kinase~~~COG0237
MTYIVGLTGGIGSGKTTIANLFTDLGVPLVDADVVAREVVAKDSPLLSKIVEHFGAQILTEQGELNRAALRERVFNHDED
KLWLNNLLHPAIRERMKQKLAEQTAPYTLFVVPLLIENKLTALCDRILVVDVSPQTQLARSAQRDNNNFEQIQRIMNSQV
SQQERLKWADDVINNDAELAQNLPHLQQKVLELHQFYLQQAENKNA
>Q5ZVH3 2.7.1.24~~~coaE~~~Dephospho-CoA kinase~~~COG0237
MVYSVGLTGNIASGKSTVAEFFSELGINVIYADKIAKELTSKNTPCYQDIISHFGSSVVLNNGELDRKRIRDIIFSNSNE
RLWLESLLHPVIRKKIEEQLIVCTSPYCLIEIPLLFNKHHYPYLQKVLLVIAPLESQLDRIVKRDHCTKKQALAILATQP
NLEQRLEAADDVLINESGLSELKAKVNKLHQKYLREAKIKQ
>Q740M4 2.7.1.24~~~coaE~~~Dephospho-CoA kinase~~~COG0237
MLRIGLTGGIGAGKSALSSAFAQCGAVIVDGDVIAREVVRPGTEGLAALVEAFGRDILLADGSLDRPALAAKAFADDAAR
QTLNGIVHPLVGARRAEIIASVPADSVVVEDIPLLVESGMAPLFPLVVIVYADVEVRLRRLVEQRGMAEADARARIAAQA
SDEQRRAVADIWLDNSGSPAELVQRAQQVWNERIVPFAHNLSTRQIARAPVRLVPPDPEWPAQAQRIVNRLKTASGHRAL
RVDHVGSTALPGDPDFAAKDVIDIQITVESLAAADELVEPLLAAGYPRLEHITADVAKPDARSTVERYDHTGDPALWHKR
IHASADPGRPTNVHIRVDGWPGQQFALLFVDWLTADPDARADYLAVKRSAEQRADGDIDAYVAVKEPWFRDAYRRAWDWA
DSTGWKP
>P9WPA3 2.7.1.24~~~coaE~~~Dephospho-CoA kinase~~~COG0237
MLRIGLTGGIGAGKSLLSTTFSQCGGIVVDGDVLAREVVQPGTEGLASLVDAFGRDILLADGALDRQALAAKAFRDDESR
GVLNGIVHPLVARRRSEIIAAVSGDAVVVEDIPLLVESGMAPLFPLVVVVHADVELRVRRLVEQRGMAEADARARIAAQA
SDQQRRAVADVWLDNSGSPEDLVRRARDVWNTRVQPFAHNLAQRQIARAPARLVPADPSWPDQARRIVNRLKIACGHKAL
RVDHIGSTAVSGFPDFLAKDVIDIQVTVESLDVADELAEPLLAAGYPRLEHITQDTEKTDARSTVGRYDHTDSAALWHKR
VHASADPGRPTNVHLRVHGWPNQQFALLFVDWLAANPGAREDYLTVKCDADRRADGELARYVTAKEPWFLDAYQRAWEWA
DAVHWRP
>Q4UN30 2.7.1.24~~~coaE~~~Dephospho-CoA kinase~~~COG0237
MLAIGITGSYASGKTFILDYLAEKGYKTFCADRCIKELYQDLSVQTQILKLLPELESFNIGKISNLIYNNDLAREKLQNF
IYPLLIDKLILFKKENANSKFGFAEIPLLYEAKFDKYFDFVVTIYCSEEIRMQRAITRTSFDIEIYNKIKEIQLSQESKI
AKADFAINSGVDMLDLEKQIEKLILVIARKL
>P63831 2.7.1.24~~~coaE~~~Dephospho-CoA kinase~~~
MPKVIGLTGGIASGKSTVSELLSVFGFKVVDADKAAREAVKKGSKGLAQVREVFGDEAIDENGEMNRRYMGDLVFNHPEK
RLELNAIIHPIVRDIMEEEKQEYLKQGYNVIMDIPLLFENELENTVDEVWVVYTSESIQMDRLMQRNNLSLEDAKARVYS
QISIDKKSRMADHVIDNLGDKLELKQNLERLLEEEGYIEKPNYGEED
>Q9X1A7 2.7.1.24~~~coaE~~~Dephospho-CoA kinase~~~COG0237
MVIGVTGKIGTGKSTVCEILKNKYGAHVVNVDRIGHEVLEEVKEKLVELFGGSVLEDGKVNRKKLAGIVFESRENLKKLE
LLVHPLMKKRVQEIINKTSGLIVIEAALLKRMGLDQLCDHVITVVASRETILKRNREADRRLKFQEDIVPQGIVVANNST
LEDLEKKVEEVMKLVWEKRE
>Q56416 2.7.1.24~~~coaE~~~Dephospho-CoA kinase~~~COG0237
MGHEAKHPIIIGITGNIGSGKSTVAALLRSWGYPVLDLDALAARARENKEEELKRLFPEAVVGGRLDRRALARLVFSDPE
RLKALEAVVHPEVRRLLMEELSRLEAPLVFLEIPLLFEKGWEGRLHGTLLVAAPLEERVRRVMARSGLSREEVLARERAQ
MPEEEKRKRATWVLENTGSLEDLERALKAVLAELTGGAKGGRG
>P96877 2.8.3.-~~~~~~Probable fatty acyl-CoA transferase Rv3272~~~COG1804
MPTSNPAKPLDGFRVLDFTQNVAGPLAGQVLVDLGAEVIKVEAPGGEAARQITSVLPGRPPLATYFLPNNRGKKSVTVDL
TTEQAKQQMLRLADTADVVLEAFRPGTMEKLGLGPDDLRSRNPNLIYARLTAYGGNGPHGSRPGIDLVVAAEAGMTTGMP
TPEGKPQIIPFQLVDNASGHVLAQAVLAALLHRERNGVADVVQVAMYDVAVGLQANQLMMHLNRAASDQPKPEPAPKAKR
RKGVGFATQPSDAFRTADGYIVISAYVPKHWQKLCYLIGRPDLVEDQRFAEQRSRSINYAELTAELELALASKTATEWVQ
LLQANGLMACLAHTWKQVVDTPLFAENDLTLEVGRGADTITVIRTPARYASFRAVVTDPPPTAGEHNAVFLARP
>Q2FWC7 2.7.1.33~~~coaW~~~Type II pantothenate kinase~~~COG5146
MKVGIDAGGTLIKIVQEQDNQRTFKTELTKNIDQVVEWLNQQQIEKLCLTGGNAGVIAENINIPAQIFVEFDAASQGLGI
LLKEQGHDLADYIFANVGTGTSLHYFDGQSQRRVGGIGTGGGMIQGLGYLLSQITDYKQLTDMAQHGDRNTIDLKVRHIY
KDTEPPIPGDLTAANFGHVLHHLDADFTPSNKLAAVIGVVGEVVTTMAITVAREFKTENIVYIGSSFHNNALLRKVVEDY
TVLRGCKPYYVENGAFSGAIGALYLEK
>Q6G7I0 2.7.1.33~~~coaW~~~Type II pantothenate kinase~~~
MKVGIDAGGTLIKIVQEQDNQRTFKTELTKNIDQVVEWLNQQQIEKLCLTGGNAGVIAENINIPAQIFVEFDAASQGLGI
LLKEQGHDLADYIFANVGTGTSLHYFDGQSQRRVGGIGTGGGMIQGLGYLLSQITDYKQLTDMAQHGDRNTIDLKVRHIY
KDTEPPIPGDLTAANFGHVLHHLDADFTPSNKLAAVIGVVGEVVTTMAITVAREFKTENIVYIGSSFHNNALLRKVVEDY
TVLRGCKPYYVENGAFSGAIGALYLEK
>Q8NVG0 2.7.1.33~~~coaW~~~Type II pantothenate kinase~~~
MKVGIDAGGTLIKIVQEQDNQRTFKTELTKNIDQVVEWLNQQQIEKLCLTGGNAGVIAENINIPAQIFVEFDAASQGLGI
LLKEQGHDLADYIFANVGTGTSLHYFDGQSQRRVGGIGTGGGMIQGLGYLLSQITDYKQLTDMAQHGDRNTIDLKVRHIY
KDTEPPIPGDLTAANFGHVLHHLDADFTPSNKLAAVIGVVGEVVTTMAITVAREFKTENIVYIGSSFHNNALLRKVVEDY
TVLRGCKPYYVENGAFSGAIGALYLEK
>Q81VX4 2.7.1.33~~~coaX~~~Type III pantothenate kinase~~~COG1521
MIFVLDVGNTNAVLGVFEEGELRQHWRMETDRHKTEDEYGMLVKQLLEHEGLSFEDVKGIIVSSVVPPIMFALERMCEKY
FKIKPLVVGPGIKTGLNIKYENPREVGADRIVNAVAGIHLYGSPLIIVDFGTATTYCYINEEKHYMGGVITPGIMISAEA
LYSRAAKLPRIEITKPSSVVGKNTVSAMQSGILYGYVGQVEGIVKRMKEEAKQEPKVIATGGLAKLISEESNVIDVVDPF
LTLKGLYMLYERNANLQHEKGE
>P37564 2.7.1.33~~~coaX~~~Type III pantothenate kinase~~~COG1521
MLLVIDVGNTNTVLGVYHDGKLEYHWRIETSRHKTEDEFGMILRSLFDHSGLMFEQIDGIIISSVVPPIMFALERMCTKY
FHIEPQIVGPGMKTGLNIKYDNPKEVGADRIVNAVAAIHLYGNPLIVVDFGTATTYCYIDENKQYMGGAIAPGITISTEA
LYSRAAKLPRIEITRPDNIIGKNTVSAMQSGILFGYVGQVEGIVKRMKWQAKQEPKVIATGGLAPLIANESDCIDIVDPF
LTLKGLELIYERNRVGSV
>B4E9P3 2.7.1.33~~~coaX~~~Type III pantothenate kinase~~~COG1521
MSEPHLLIDAGNSRIKWALADARRTLVDTGAFGHTRDGGADPDWSRLPRPRGAWISNVAGADVAARIDALLDARWPGLPR
TTIRSRPAQCGVTNGYTTPEQLGSDRWAGLIGAHAAFPGEHLLIATFGTATTLEALRADGCFTGGLIAPGWALMMRALGT
HTAQLPTLTTDIASGLLAGAQAEPFQVDTPRSLSAGCLYAQAGLIERAWRDLVAAWQAPVRLVLAGGAADDVARALTIAH
TRHDTLILSGLALIAADAADPATAPD
>Q2T1M2 2.7.1.33~~~coaX~~~Type III pantothenate kinase~~~
MSGVCLLIDAGNSRIKWALADTGRHFVTSGAFEHADDTPDWSTLPAPRGAWISNVAGDAAAARIDALIDAHWPALPRTVV
RACAAQCGVTNGYAEPARLGSDRWAGLIGAHAAFPGEHLLIATFGTATTLEALRADGRFTGGLIAPGWALMMRSLGMHTA
QLPTVSIDAATSLLDELAANDAHAPFAIDTPHALSAGCLQAQAGLIERAWRDLEKAWKAPVRLVLSGGAADAIVRALTVP
HTRHDTLVLTGLALIAHSA
>Q0PBB6 2.7.1.33~~~coaX~~~Type III pantothenate kinase~~~COG1521
MLLCDIGNSNANFLDDNKYFTLNIDQFLEFKNEQKIFYINVNEHLKEHLKNQKNFINLEPYFLFDTIYQGLGIDRIAACY
TIEDGVVVDAGSAITIDIISNSIHLGGFILPGIANYKKIYSHISPRLKSEFNTQVSLDAFPQKTMDALSYGVFKGIYLLI
KDAAQNKKLYFTGGDGQFLANYFDHAIYDKLLIFRGMKKIIKENPNLLY
>O25533 2.7.1.33~~~coaX~~~Type III pantothenate kinase~~~COG1521
MPARQSFTDLKNLVLCDIGNTRIHFAQNYQLFSSAKEDLKRLGIQKEIFYISVNEENEKALLNCYPNAKNIAGFFHLETD
YVGLGIDRQMACLAVNNGVVVDAGSAITIDLIKEGKHLGGCILPGLAQYIHAYKKSAKILEQPFKALDSLEVLPKSTRDA
VNYGMVLSVIACIQHLAKNQKIYLCGGDAKYLSAFLPHSVCKERLVFDGMEIALKKAGILECK
>Q5ZX22 2.7.1.33~~~coaX~~~Type III pantothenate kinase~~~COG1521
MILCIDVGNSHIYGGVFDGDEIKLRFRHTSKVSTSDELGIFLKSVLRENNCSPETIRKIAICSVVPQVDYSLRSACVKYF
SIDPFLLQAGVKTGLNIKYRNPVEVGADRIANAIAATHSFPNQNIIVIDFGTATTFCAISHKKAYLGGAILPGLRLSADA
LSKNTAKLPSVEIIKTESVVGRSTIESIQSGVYYGVLGACKELIQRIHHEAFNGDQILILATGGFASLFDKQGLYDHLVP
DLVLQGIRLAAMMNTA
>P9WPA1 2.7.1.33~~~coaX~~~Type III pantothenate kinase~~~COG1521
MLLAIDVRNTHTVVGLLSGMKEHAKVVQQWRIRTESEVTADELALTIDGLIGEDSERLTGTAALSTVPSVLHEVRIMLDQ
YWPSVPHVLIEPGVRTGIPLLVDNPKEVGADRIVNCLAAYDRFRKAAIVVDFGSSICVDVVSAKGEFLGGAIAPGVQVSS
DAAAARSAALRRVELARPRSVVGKNTVECMQAGAVFGFAGLVDGLVGRIREDVSGFSVDHDVAIVATGHTAPLLLPELHT
VDHYDQHLTLQGLRLVFERNLEVQRGRLKTAR
>Q9HWC1 2.7.1.33~~~coaX~~~Type III pantothenate kinase~~~
MILELDCGNSLIKWRVIEGAARSVAGGLAESDDALVEQLTSQQALPVRACRLVSVRSEQETSQLVARLEQLFPVSALVAS
SGKQLAGVRNGYLDYQRLGLDRWLALVAAHHLAKKACLVIDLGTAVTSDLVAADGVHLGGYICPGMTLMRSQLRTHTRRI
RYDDAEARRALASLQPGQATAEAVERGCLLMLRGFVREQYAMACELLGPDCEIFLTGGDAELVRDELAGARIMPDLVFVG
LALACPIE
>Q9WZY5 2.7.1.33~~~coaX~~~Type III pantothenate kinase~~~COG1521
MYLLVDVGNTHSVFSITEDGKTFRRWRLSTGVFQTEDELFSHLHPLLGDAMREIKGIGVASVVPTQNTVIERFSQKYFHI
SPIWVKAKNGCVKWNVKNPSEVGADRVANVVAFVKEYGKNGIIIDMGTATTVDLVVNGSYEGGAILPGFFMMVHSLFRGT
AKLPLVEVKPADFVVGKDTEENIRLGVVNGSVYALEGIIGRIKEVYGDLPVVLTGGQSKIVKDMIKHEIFDEDLTIKGVY
HFCFGD
>P9WP97 6.3.5.9~~~cobB~~~Hydrogenobyrinate a,c-diamide synthase~~~COG1797
MRVSAVAVAAPASGSGKTTIATGLIGALRQAGHTVAPFKVGPDFIDPGYHALAAGRPGRNLDPVLVGERLIGPLYAHGVA
GADIAVIEGVLGLFDGRIGPAGGAPAAGSTAHVAALLGAPVILVVDARGQSHSVAALLHGFSTFDTATRIAGVILNRVGS
ARHEQVLRQACDQAGVAVLGAIPRTAELELPTRYLGLVTAVEYGRRARLAVQAMTAVVARHVDLAAVIACAGSQAAHPPW
DPVIAVGNTARQPATVAIAAGRAFTFGYAEHAEMLRAAGAEVVEFDPLSETLPEGTDAVVLPGGFPEQFTAELSANDTVR
RQINELAAAGAPVHAECAGLLYLVSELDGHPMCGVVAGSARFTQHLKLGYRDAVAVVDSALYSVGERVVGHEFHRTAVTF
ADSYQPAWVYQGQDVDDVRDGAVHSGVHASYLHTHPAATPGAVARFVAHAACNTPRA
>P21632 6.3.5.9~~~cobB~~~Hydrogenobyrinate a,c-diamide synthase~~~
MSGLLIAAPASGSGKTTVTLGLMRALKRRGVAIAPGKAGPDYIDPAFHAAATGEPCFNYDPWAMRPELLLANASHVASGG
RTLIVEAMMGLHDGAADGSGTPADLAATLNLAVILVVDCARMSQSVAALVRGYADHRDDIRVVGVILNKVGSDRHEMMLR
DALGKVRMPVFGVLRQDSALQLPERHLGLVQAGEHSALEGFIEAAAARVEAACDLDAIRLIATIFPQVPAAADAERLRPL
GQRIAVARDIAFAFCYEHLLYGWRQGGAEISFFSPLADEGPDAAADAVYLPGGYPELHAGQLSAAARFRSGMHSAAERGA
RIFGECGGYMVLGEGLVAADGTRYDMLGLLPLVTSFAERRRHLGYRRVVPVDNAFFDGPMTAHEFHYATIVAEGAADRLF
AVSDAAGEDLGQAGLRRGPVAGSFMHLIDVAGAA
>P52086 3.1.3.73~~~cobC~~~Adenosylcobalamin/alpha-ribazole phosphatase~~~COG0406
MRLWLIRHGETQANIDGLYSGHAPTPLTARGIEQAQNLHTLLHGVSFDLVLCSELERAQHTARLVLSDRQLPVQIIPELN
EMFFGDWEMRHHRDLMQEDAENYSAWCNDWQHAIPTNGEGFQAFSQRVERFIARLSEFQHYQNILVVSHQGVLSLLIARL
IGMPAEAMWHFRVDQGCWSAIDINQKFATLRVLNSRAIGVENA
>P39701 3.1.3.73~~~cobC~~~Adenosylcobalamin/alpha-ribazole phosphatase~~~
MRLWLVRHGETEANVAGLYSGHAPTPLTEKGIGQAKTLHTLLRHAPFDRVLCSELERARHTARLVLEGRDVPQHILPELN
EMYFGDWEMRHHRDLTHEDAESYAAWCTDWQNAVPTNGEGFQAFTRRVERFISRLDAFSDCQNLLIVSHQGVLSLLIARL
LTMPAASLWHFRVEQGCWSAIDICEGFATLKVLNSRAVWRPE
>P9WP93 ~~~cobD~~~Cobalamin biosynthesis protein CobD~~~COG1270
MFASTWQTRAVGVLIGCLLDVVFGDPKRGHPVALFGRAAAKLEQITYRDGRVAGAVHVGLLVGAVGLLGAALQRLPGRSW
PVAATATATWAALGGTSLARTGRQISDLLERDDVEAARRLLPSLCGRDPAQLGGPGLTRAALESVAENTADAQVVPLLWA
ASSGVPAVLGYRAINTLDSMIGYRSPRYLRFGWAAARLDDWANYVGARATAVLVVICAPVVGGSPRGAVRAWRRDAARHP
SPNAGVVEAAFAGALDVRLGGPTRYHHELQIRPTLGDGRSPKVADLRRAVVLSRVVQAGAAVLAVMLVYRRRP
>P97084 4.1.1.81~~~cobD~~~Threonine-phosphate decarboxylase~~~
MALFNSAHGGNIREAATVLGISPDQLLDFSANINPLGMPVSVKRALIDNLDCIERYPDADYFHLHQALARHHQVPASWIL
AGNGETESIFTVASGLKPRRAMIVTPGFAEYGRALAQSGCEIRRWSLREADGWQLTDAILEALTPDLDCLFLCTPNNPTG
LLPERPLLQAIADRCKSLNINLILDEAFIDFIPHETGFIPALKDNPHIWVLRSLTKFYAIPGLRLGYLVNSDDAAMARMR
RQQMPWSVNALAALAGEVALQDSAWQQATWHWLREEGARFYQALCQLPLLTVYPGRANYLLLRCEREDIDLQRRLLTQRI
LIRSCANYPGLDSRYYRVAIRSAAQNERLLAALRNVLTGIAPAD
>P9WP87 5.4.99.61~~~cobH~~~Precorrin-8X methylmutase~~~COG2082
MLDYLRDAAEIYRRSFAVIRAEADLARFPADVARVVVRLIHTCGQVDVAEHVAYTDDVVARAGAALAAGAPVLCDSSMVA
AGITTSRLPADNQIVSLVADPRATELAARRQTTRSAAGVELCAERLPGAVLAIGNAPTALFRLLELVDEGAPPPAAVLGG
PVGFVGSAQAKEELIERPRGMSYLVVRGRRGGSAMAAAAVNAIASDRE
>P21638 5.4.99.61~~~cobH~~~Precorrin-8X methylmutase~~~
MPEYDYIRDGNAIYERSFAIIRAEADLSRFSEEEADLAVRMVHACGSVEATRQFVFSPDFVSSARAALKAGAPILCDAEM
VAHGVTRARLPAGNEVICTLRDPRTPALAAEIGNTRSAAALKLWSERLAGSVVAIGNAPTALFFLLEMLRDGAPKPAAIL
GMPVGFVGAAESKDALAENSYGVPFAIVRGRLGGSAMTAAALNSLARPGL
>P9WGB3 ~~~cobIJ~~~Cobalamin biosynthesis protein CobIJ~~~COG1010
MSARGTLWGVGLGPGDPELVTVKAARVIGEADVVAYHSAPHGHSIARGIAEPYLRPGQLEEHLVYPVTTEATNHPGGYAG
ALEDFYADATERIATHLDAGRNVALLAEGDPLFYSSYMHLHTRLTRRFNAVIVPGVTSVSAASAAVATPLVAGDQVLSVL
PGTLPVGELTRRLADADAAVVVKLGRSYHNVREALSASGLLGDAFYVERASTAGQRVLPAADVDETSVPYFSLAMLPGGR
RRALLTGTVAVVGLGPGDSDWMTPQSRRELAAATDLIGYRGYLDRVEVRDGQRRHPSDNTDEPARARLACSLADQGRAVA
VVSSGDPGVFAMATAVLEEAEQWPGVRVRVIPAMTAAQAVASRVGAPLGHDYAVISLSDRLKPWDVIAARLTAAAAADLV
LAIYNPASVTRTWQVGAMRELLLAHRDPGIPVVIGRNVSGPVSGPNEDVRVVKLADLNPAEIDMRCLLIVGSSQTRWYSV
DSQDRVFTPRRYPEAGRATATKSSRHSD
>P21639 2.1.1.130~~~cobI~~~Precorrin-2 C(20)-methyltransferase~~~
MSGVGVGRLIGVGTGPGDPELLTVKAVKALGQADVLAYFAKAGRSGNGRAVVEGLLKPDLVELPLYYPVTTEIDKDDGAY
KTQITDFYNASAEAVAAHLAAGRTVAVLSEGDPLFYGSYMHLHVRLANRFPVEVIPGITAMSGCWSLAGLPLVQGDDVLS
VLPGTMAEAELGRRLADTEAAVIMKVGRNLPKIRRALAASGRLDQAVYVERGTMKNAAMTALAEKADDEAPYFSLVLVPG
WKDRP
>P21640 2.1.1.131~~~cobJ~~~Precorrin-3B C(17)-methyltransferase~~~
MTGTLYVVGTGPGSAKQMTPETAEAVAAAQEFYGYFPYLDRLNLRPDQIRVASDNREELDRAQVALTRAAAGVKVCMVSG
GDPGVFAMAAAVCEAIDKGPAEWKSVELVITPGVTAMLAVAARIGAPLGHDFCAISLSDNLKPWEVITRRLRLAAEAGFV
IALYNPISKARPWQLGEAFELLRSVLPASVPVIFGRAAGRPDERIAVMPLGEADANRADMATCVIIGSPETRIVERDGQP
DLVYTPRFYAGASQ
>P9WP89 1.3.1.54~~~cobK~~~Precorrin-6A reductase~~~COG2099
MTRVLLLGGTAEGRALAKELHPHVEIVSSLAGRVPNPALPIGPVRIGGFGGVEGLRGWLREERIDAVVDATHPFAVTITA
HAAQVCGELGLPYLVLARPPWDPGTAIIAVSDIEAADVVAEQGYSRVFLTTGRSGIAAFANSDAWFLIRVVTAPDGTALP
RRHKLVLSRGPYGYHDEFALLREQRIDALVTKNSGGKMTRAKLDAAAALGISVVMIARPLLPAGVAAVDSVHRAAMWVAG
LPSR
>O68098 1.3.1.54~~~cobK~~~Precorrin-6A reductase~~~COG2099
MTRLLVLGGTTEASRLAKTLADQGFEAVFSYAGRTGAPVAQPLPTRIGGFGGVAGLVDYLTREGVSHVIDATHPFAAQMS
ANAVAACAQTGVALCAFERAPWTAQAGDRWTHVPDLAAAVAALPQAPARVFLAIGKQHLRDFSAAPQHHYLLRLVDPPEG
PLPLPDARAVIARGPFTVQGDTELLRSETITHVVAKNAGGAGAEAKLIAARSLGLPVILIDRPAVPARDICATLEGVMGW
LADHGATPRGV
>P21920 1.3.1.54~~~cobK~~~Precorrin-6A reductase~~~
MAGSLFDTSAMEKPRILILGGTTEARELARRLAEDVRYDTAISLAGRTADPRPQPVKTRIGGFGGADGLAHFVHDENIAL
LVDATHPFAARISHNAADAAQRTGVALIALRRPEWVPLPGDRWTAVDSVVEAVSALGDRRRRVFLAIGRQEAFHFEVAPQ
HSYVIRSVDPVTPPLNLPDQEAILATGPFAEADEAALLRSRQIDVIVAKNSGGSATYGKIAAARRLGIEVIMVERRKPAD
VPTVGSCDEALNRIAHWLAPA
>P9WGA9 2.1.1.132~~~cobL~~~Precorrin-6Y C(5,15)-methyltransferase [decarboxylating]~~~COG2241
MIIVVGIGADGMTGLSEHSRSELRRATVIYGSKRQLALLDDTVTAERWEWPTPMLPAVQGLSPDGADLHVVASGDPLLHG
IGSTLIRLFGHDNVTVLPHVSAVTLACARMGWNVYDTEVISLVTAQPHTAVRRGGRAIVLSGDRSTPQALAVLLTEHGRG
DSKFSVLEQLGGPAERRRDGTARAWACDPPLDVDELNVIAVRYLLDERTSWAPDEAFAHDGQITKHPIRVLTLAALAPRP
GQRLWDVGAGSGAIAVQWCRSWPGCTAVAFERDERRRRNIGFNAAAFGVSVDVRGDAPDAFDDAARPSVIFLGGGVTQPG
LLEACLDSLPAGGNLVANAVTVESEAALAHAYSRLGGELRRFQHYLGEPLGGFTGWRPQLPVTQWSVTKR
>P21921 2.1.1.132~~~cobL~~~Precorrin-6Y C(5,15)-methyltransferase [decarboxylating]~~~
MADVSNSEPAIVSPWLTVIGIGEDGVAGLGDEAKRLIAEAPVVYGGHRHLELAASLITGEAHNWLSPLERSVVEIVARRG
SPVVVLASGDPFFFGVGVTLARRIASAEIRTLPAPSSISLAASRLGWALQDATLVSVHGRPLDLVRPHLHPGARVLTLTS
DGAGPRDLAELLVSSGFGQSRLTVLEALGGAGERVTTQIAARFMLGLVHPLNVCAIEVAADEGARILPLAAGRDDALFEH
DGQITKREVRALTLSALAPRKGELLWDIGGGSGSIGIEWMLADPTMQAITIEVEPERAARIGRNATMFGVPGLTVVEGEA
PAALAGLPQPDAIFIGGGGSEDGVMEAAIEALKSGGRLVANAVTTDMEAVLLDHHARLGGSLIRIDIARAGPIGGMTGWK
PAMPVTQWSWTKG
>P9WGB1 2.1.1.133~~~cobM~~~Precorrin-4 C(11)-methyltransferase~~~COG2875
MTVYFIGAGPGAADLITVRGQRLLQRCPVCLYAGSIMPDDLLAQCPPGATIVDTGPLTLEQIVRKLADADADGRDVARLH
SGDPSLYSALAEQCRELDALGIGYEIVPGVPAFAAAAAALKRELTVPGVAQTVTLTRVATLSTPIPPGEDLAALARSRAT
LVLHLAAAQIDAIVPRLLDGGYRPETPVAVVAFASWPQQRTLRGTLADIAARMHDAKITRTAVIVVGDVLTAEGFTDSYL
YSVARHGRYAQ
>O68100 2.1.1.133~~~cobM~~~Precorrin-4 C(11)-methyltransferase~~~COG2875
MTVHFIGAGPGAADLITIRGRDLIASCPVCLYAGSLVPEALLAHCPPGAKIVNTAPMSLDAIIDTIAEAHAAGQDVARLH
SGDLSIWSAMGEQLRRLRALNIPYDVTPGVPSFAAAAATLGAELTLPGVAQSVILTRTSGRASAMPAGETLENFARTGAV
LAIHLSVHVLDEVVQKLVPHYGEDCPVAIVWRASWPDQRVVRATLATLQTSLGAELERTALILVGRSLATEDFDESRLYA
GDYDRRYRPLGTHPRFPEGSE
>P21922 2.1.1.133~~~cobM~~~Precorrin-4 C(11)-methyltransferase~~~
MTVHFIGAGPGAADLITVRGRDLIGRCPVCLYAGSIVSPELLRYCPPGARIVDTAPMSLDEIEAEYVKAEAEGLDVARLH
SGDLSVWSAVAEQIRRLEKHGIAYTMTPGVPSFAAAASALGRELTIPAVAQSLVLTRVSGRASPMPNSETLSAFGATGST
LAIHLAIHALQQVVEELTPLYGADCPVAIVVKASWPDERVVRGTLGDIAAKVAEEPIERTALIFVGPGLEASDFRESSLY
DPAYQRRFRGRGE
>P29929 6.6.1.2~~~cobN~~~Aerobic cobaltochelatase subunit CobN~~~
MHLLLAQKGTIADGNEAIDLGQTPADILFLSAADTELSSIAAAHGRRDGGLSLRIASLMSLMHPMSVDTYVERTARHAKL
IVVRPLGGASYFRYLLEALHAAAVTHRFEIAVLPGDDKPDPGLEPFSTVAADDRQRLWAYFTEGGSDNAGLFLDYAAALV
TGAEKPQPAKPLLKAGIWWPGAGVIGVSEWQSLVQGRMVAREGFEPPTVGICFYRALVQSGETRPVEALIDALEAEGVRA
LPVFVSSLKDAVSVGTLQAIFSEAAPDVVMNATGFAVSSPGADRQPTVLESTGAPVLQVIFSGSSRAQWETSPQGLMARD
LAMNVALPEVDGRILARAVSFKAASIYDAKVEANIVGHEPLEGRVRFAADLAVNWANVRRAEPAERRIAIVMANYPNRDG
RLGNGVGLDTPAGTVEVLSAMAREGYAVGEVPADGDALIRFLMAGPTNAASHDREIRERISLNDYKTFFDSLPKQIKDEV
AGRWGVPEADPFFLDGAFALPLARFGEVIVGIQPARGYNIDPKESYHSPDLVPPHGYLAFYAFLRQQFGAQAIVHMGKHG
NLEWLPGKALALSETCYPEAIFGPLPHIYPFIVNDPGEGTQAKRRTSAVIIDHLTPPLTRAESYGPLKDLEALVDEYYDA
AGGDPRRLRLLSRQILDLVRDIGLDSDAGIDRGDSDDKALEKLDAYLCDLKEMQIRDGLHIFGVAPEGRLLTDLTVALAR
VPRGLGEGGDQSLQRAIAADAGLRGFAIPTSAGGNPARDAQPFDPLDCVMSDTWTGPKPSILADLSDAPWRTAGDTVERI
ELLAANLVSGELACPDHWANTRAVLGEIETRLKPSISNSGAAEMTGFLTGLSGRFVAPGPSGAPTRGRPDVLPTGRNFYS
VDSRAVPTPAAYELGKKSAELLIRRYLQDHGEWPSSFGLTAWGTANMRTGGDDIAQALALIGAKPTWDMVSRRVMGYEIV
PLAVLGRPRVDVTLRISGFFRDAFPDQIALFDKAIRAVALEEDDADNMIAARMRAESRRLEAEGVEAAEAARRASYRVFG
AKPGAYGAALQALIDEKGWETKADLAEAYLTWGAYAYGAGEEGKAERDLFEERLRTIEAVVQNQDNREHDLLDSDDYYQF
EGGMSAAAEQLGGHRPAIYHNDHSRPEKPVIRSLEEEIGRVVRARVVNPKWIDGVMRHGYKGAFEIAATVDYMFAFAATT
GAVRDHHFEAAYQAFIVDERVADFMRDKNPAAFAELAERLLEAIDRNLWTPRSNSARFELAGIGTAATRLRAGNE
>P29930 2.5.1.17~~~cobO~~~Corrinoid adenosyltransferase~~~
MSDETTVGGEAPAEKDDARHAMKMAKKKAAREKIMATKTDEKGLIIVNTGKGKGKSTAGFGMIFRHIAHGMPCAVVQFIK
GAMATGERELIEKHFGDVCQFYTLGEGFTWETQDRARDVAMAEKAWEKAKELIRDERNSMVLLDEINIALRYDYIDVAEV
VRFLKEEKPHMTHVVLTGRNAKEDLIEVADLVTEMELIKHPFRSGIKAQQGVEF
>P29931 2.7.1.156~~~cobP~~~Bifunctional adenosylcobalamin biosynthesis protein CobP~~~
MSSLSAGPVLVLGGARSGKSSFSERLVEASGFTMHYVATGRAWDDEMRERIDHHRTRRGEGWTTHEEPLDLVGILRRIDD
PSHVVLIDCLTLWVTNLMLEERDMTAEFAALVAYLPEARARLVFVSNEVGLGIVPENRMAREFRDHAGRLHQIVAEKSAE
VYFVAAGLPLKMKG
>P9WP95 ~~~cobQ~~~Cobyric acid synthase~~~COG1492
MSGLLVAGTTSDAGKSAVTAGLCRALARRGVRVAPFKAQNMSNNSMVCRGPDGTGVEIGRAQWVQALAARTTPEAAMNPV
LLKPASDHRSHVVLMGKPWGEVASSSWCAGRRALAEAACRAFDALAARYDVVVAEGAGSPAEINLRAGDYVNMGLARHAG
LPTIVVGDIDRGGVFAAFLGTVALLAAEDQALVAGFVVNKFRGDSDLLAPGLRDLERVTGRRVYGTLPWHPDLWLDSEDA
LDLQGRRAAGTGARRVAVVRLPRISNFTDVDALGLEPDLDVVFASDPRALDDADLIVLPGTRATIADLAWLRARDLDRAL
LVHVAAGKPLLGICGGFQMLGRVIRDPYGIEGPGGQVTEVEGLGLLDVETAFSPHKVLRLPRGEGLGVPASGYEIHHGRI
TRGDTAEEFLGGARDGPVFGTMWHGSLEGDALREAFLRETLGLAPSGSCFLAARERRLDLLGDLVERHLDVDALLNLARH
GCPPTLPFLAPGAP
>P29932 ~~~cobQ~~~Cobyric acid synthase~~~
MTRRIMLQGTGSDVGKSVLVAGLCRLAANQGLKVRPFKPQNMSNNAAVSDDGGEIGRAQWLQALAARVPSSVHMNPVLLK
PQSDVGSQIVVQGKVAGQARGREYQALKPKLLGAVMESFEQISAGADLVVVEGAGSPAEINLRPGDIANMGFATRANVPV
VLVGDIDRGGVIASLVGTHAILPEEDRRMVTGYLINKFRGDVTLFDDGIAAVNRYTGWPCFGVVPWLKAAARLPAEDSVV
LEKLTRGEGRALKVAVPVLSRIANFDDLDPLAAEPEIDLVFVRPGSPIPVDAGLVVIPGSKSTIGDLIDFRAQGWDRDLE
RHVRRGGRVIGICGGYQMLGRRVTDPLGIEGGERAVEGLGLLEVETEMAPEKTVRNSRAWSLEHDVVLEGYEIHLGKTQG
ADCGRPSVRIDNRADGALSADGRVMGTYLHGLFTSDAYRGALLKSFGIEGGANNYRQSVDAALDDVANELEAVLDRRWLD
ELLRH
>P36561 2.7.8.26~~~cobS~~~Adenosylcobinamide-GDP ribazoletransferase~~~COG0368
MSKLFWAMLSFITRLPVPRRWSQGLDFEHYSRGIITFPLIGLLLGAISGLVFMVLQAWCGAPLAALFSVLVLVLMTGGFH
LDGLADTCDGVFSARSRDRMLEIMRDSRLGTHGGLALIFVVLAKILVLSELALRGESILASLAAACAVSRGTAALLMYRH
RYAREEGLGNVFIGKIDGRQTCVTLGLAAIFAAVLLPGMHGVAAMVVTMVAIFILGQLLKRTLGGQTGDTLGAAIELGEL
VFLLALL
>Q05602 2.7.8.26~~~cobS~~~Adenosylcobinamide-GDP ribazoletransferase~~~
MSKLFWAMLAFISRLPVPSRWSQGLDFEQYSRGIVMFPFIGLILGGVSGLIFILLQPWCGIPLAALFCILALALLTGGFH
LDGLADTCDGIFSARRRERMLEIMRDSRLGTHGGLALIFVLLAKILVVSELALRGTPMLAALAAACAAGRGSAVLLMYRH
RYAREEGLGNVFIGKVSGRQTCITLGLAVIVATVLLPGMQGLAAMVVTCAAIFILGQLLKRTLGGQTGDTLGAAIELGEL
IFLLALL
>P29933 6.6.1.2~~~cobS~~~Aerobic cobaltochelatase subunit CobS~~~
MMSKIDLDISNLPDTTISVREVFGIDTDLRVPAYSKGDAYVPDLDPDYLFDRETTLAILAGFAHNRRVMVSGYHGTGKST
HIEQVAARLNWPCVRVNLDSHVSRIDLVGKDAIVVKDGLQVTEFKDGILPWAYQHNVALVFDEYDAGRPDVMFVIQRVLE
SSGRLTLLDQSRVIRPHPAFRLFATANTVGLGDTTGLYHGTQQINQAQMDRWSIVTTLNYLPHDKEVDIVAAKVKGFTAD
KGRETVSKMVRVADLTRAAFINGDLSTVMSPRTVITWAENAHIFGDIAFAFRVTFLNKCDELERALVAEHYQRAFGIELK
ECAANIVLEATA
>P9WP85 2.4.2.21~~~cobT~~~Nicotinate-nucleotide--dimethylbenzimidazole phosphoribosyltransferase~~~COG2038
MIGFAPVSTPDAAAEAAARARQDSLTKPRGALGSLEDLSVWVASCQQRCPPRQFERARVVVFAGDHGVARSGVSAYPPEV
TAQMVANIDAGGAAINALADVAGATVRVADLAVDADPLSERIGAHKVRRGSGNIATEDALTNDETAAAITAGQQIADEEV
DAGADLLIAGDMGIGNTTAAAVLVAALTDAEPVAVVGFGTGIDDAGWARKTAAVRDALFRVRPVLPDPVGLLRCAGGADL
AAIAGFCAQAAVRRTPLLLDGVAVTAAALVAERLAPGAHRWWQAGHRSSEPGHGLALAALGLDPIVDLHMRLGEGTGAAV
ALMVLRAAVAALSSMATFTEAGVSTRSVDGVDRTAPPAVSP
>Q05603 2.4.2.21~~~cobT~~~Nicotinate-nucleotide--dimethylbenzimidazole phosphoribosyltransferase~~~
MQTLHALLRDIPAPDAEAMARAQQHIDGLLKPPGSLGRLETLAVQLAGMPGLNGTPQVGEKAVLVMCADHGVWDEGVAVS
PKIVTAIQAANMTRGTTGVCVLAAQAGAKVHVIDVGIDAEPIPGVVNMRVARGCGNIAVGPAMSRLQAEALLLEVSRYTC
DLAQRGVTLFGVGELGMANTTPAAAMVSVFTGSDAKEVVGIGANLPPSRIDNKVDVVRRAIAINQPNPRDGIDVLSKVGG
FDLVGMTGVMLGAARCGLPVLLDGFLSYSAALAACQIAPAVRPYLIPSHFSAEKGARIALAHLSMEPYLHMAMRLGEGSG
AALAMPIVEAACAMFHNMGELAASNIVLPEGNANAT
>P29934 6.6.1.2~~~cobT~~~Aerobic cobaltochelatase subunit CobT~~~
MSSNSKAKPTTRENAAEPFKRALSGCIRSIAGDAEVEVAFANERPGMTGERIRLPELSKRPTLQELAVTRGLGDSMALRK
ACTHARIQRTMSPQGADARAIFDAVEQARVEAIGSLRMAGVAKNLNVMLEEKYAKANFATIERQADAPLGEAVALLVREK
LTGQKPPASAGKVLDLWREFIEGKAAGDIEHLSSTINNQQAFARVVRDMLTSMEVAEKYGDDDNEPDEQESETDEDQPRS
QEQDENASDEEAGDDAAPADENQAAEEQMEEGEMDGAEISDDDLQDEGDEDSETPGEVKRPNQPFADFNEKVDYAVFTRE
FDETIASEELCDEAELDRLRAFLDKQLAHLQGAVGRLANRLQRRLMAQQNRSWEFDLEEGYLDSARLQRIIIDPMQPLSF
KREKDTNFRDTVVTLLIDNSGSMRGRPITVAATCADILARTLERCGVKVEILGFTTKAWKGGQSREKWLAGGKPQAPGRL
NDLRHIVYKSADAPWRRARRNLGLMMREGLLKENIDGEALIWAHERLMARREQRRILMMISDGAPVDDSTLSVNPGNYLE
RHLRAVIEQIETRSPVELLAIGIGHDVTRYYRRAVTIVDADELAGAMTEQLAALFEDESQRRGSSRLRRAG
>Q7SIC7 2.4.2.21~~~cobT~~~Nicotinate-nucleotide--dimethylbenzimidazole phosphoribosyltransferase~~~
MDPEVFAQARLRMDQLTKPPRALGYLEEVALRLAALQGRVKPELGRGAVVVAAADHGVVAEGVSAYPQEVTRQMVLNFLR
GGAAINQFALAADCAVYVLDVGVVGELPDHPGLLKRKVRPGTANLAQGPAMTPEEAERALLAGREAARRAIAEGATLLAA
GDMGIGNTTAAAALTAALLGLPPEAVVGRGTGVGEEGLRRKRQAVARALARLHPGMGPLEVAAEVGGLELVAIAGIYLEG
YEAGLPLVLDGFPVTAGALLAWKMAPGLRDHLFAGHLSREPGHRHQLEALGLRPLLDLDLALGEGTGAVLAMPLLRAAAR
ILHMATFQEAGVSRG
>A1JTP8 2.4.2.21~~~cobT~~~Nicotinate-nucleotide--dimethylbenzimidazole phosphoribosyltransferase~~~COG2038
MQTLSSILRTIAPLDSKAMARATTRLDGLLKPQGSLGRLEQLAIQLAGMRGLYGHQVDRKQIIVMAADHGVYDEGVAISP
RVVTMVQALNMVRGVTGVCVLAANAGAEVKIVDVGIDSDTLPGVIDMKVARGSGNIARGAAMTRQQAEDLLIASATLTLQ
QAAGGVKVFGVGELGMANTTPAAAMVSVFTDSDPELAVGIGANFPSEQLHHKVAVVRRAIETNQPDASDGIDVLAKVGGF
DLVGMTGVMLGAAAAGLPVVLDGFLSYASALAACRIEAKVRDYLIPSHLSAEKGAVIALNHLQLEPYLQMGMRLGEGSGA
ALAMHLVDAACAMYNNMGSLAESNIELPGCVN
>P0AE76 2.7.1.156~~~cobU~~~Bifunctional adenosylcobalamin biosynthesis protein CobU~~~COG2087
MMILVTGGARSGKSRHAEALIGDSSQVLYIATSQILDDEMAARIEHHRQGRPEHWRTVERWQHLDELIHADINPNEVVLL
ECVTTMVTNLLFDYGGDKDPDEWDYQAMEQAINAEIQSLIAACQRCPAKVVLVTNEVGMGIVPESRLARHFRDIAGRVNQ
QLAAAANEVWLVVSGIGVKIK
>Q05599 2.7.1.156~~~cobU~~~Bifunctional adenosylcobalamin biosynthesis protein CobU~~~
MMILVTGGARSGKSRHAEALIGDAPQVLYIATSQILDDEMAARIQHHKDGRPAHWRTAECWRHLDTLITADLAPDDAILL
ECITTMVTNLLFALGGENDPEQWDYAAMERAIDDEIQILIAACQRCPAKVVLVTNEVGMGIVPENRLARHFRDIAGRVNQ
RLAAAADEVWLVVSGIGVKIK
>P29935 2.4.2.21~~~cobU~~~Nicotinate-nucleotide--dimethylbenzimidazole phosphoribosyltransferase~~~
MSASGLPFDDFRELLRNLPGPDAAALVAARERDAQLTKPPGALGRLEEIAFWLAAWTGKAPVVNRPLVAIFAGNHGVTRQ
GVTPFPSSVTAQMVENFAAGGAAINQICVSHDLGLKVFDLALEYPTGDITEEAALSERDCAATMAFGMEAIAGGTDLLCI
GEMGIGNTTIAAAINLGLYGGTAEEWVGPGTGSEGEVLKRKIAAVEKAVALHRDHLSDPLELMRRLGGREIAAMAGAILA
ARVQKVPVIIDGYVATAAASILKAANPSALDHCLIGHVSGEPGHLRAIEKLGKTPLLALGMRLGEGTGAALAAGIVKAAA
ACHSGMATFAQAGVSNKE
>P29936 2.7.8.26~~~cobV~~~Adenosylcobinamide-GDP ribazoletransferase~~~
MGFVGDFCDDVARSIGFLSRIPMPARHFEGYDGRLSRAVRAFPFAGLAIALPSAAVAMALMALQVSSLFAAFVVVAIQAL
VTGALHEDGLGDTADGFGGGRDREAALAIMKDSRIGTYAAVALILSFGLRVSAFASILPLFSPLGAAMAILGAACLSRAA
MVWHWSSLPPARSSGVAASAGEPEPAATRFALAFGLLVAMLLFYLAQVPALGVIAALVAFLATVKGFARLAMRKIGGQTG
DTIGATQQLTEIAVLGALALTV
>Q9L9D7 3.1.1.84~~~cocE~~~Cocaine esterase~~~
MVDGNYSVASNVMVPMRDGVRLAVDLYRPDADGPVPVLLVRNPYDKFDVFAWSTQSTNWLEFVRDGYAVVIQDTRGLFAS
EGEFVPHVDDEADAEDTLSWILEQAWCDGNVGMFGVSYLGVTQWQAAVSGVGGLKAIAPSMASADLYRAPWYGPGGALSV
EALLGWSALIGTGLITSRSDARPEDAADFVQLAAILNDVAGAASVTPLAEQPLLGRLIPWVIDQVVDHPDNDESWQSISL
FERLGGLATPALITAGWYDGFVGESLRTFVAVKDNADARLVVGPWSHSNLTGRNADRKFGIAATYPIQEATTMHKAFFDR
HLRGETDALAGVPKVRLFVMGIDEWRDETDWPLPDTAYTPFYLGGSGAANTSTGGGTLSTSISGTESADTYLYDPADPVP
SLGGTLLFHNGDNGPADQRPIHDRDDVLCYSTEVLTDPVEVTGTVSARLFVSSSAVDTDFTAKLVDVFPDGRAIALCDGI
VRMRYRETLVNPTLIEAGEIYEVAIDMLATSNVFLPGHRIMVQVSSSNFPKYDRNSNTGGVIAREQLEEMCTAVNRIHRG
PEHPSHIVLPIIKR
>P25524 3.5.4.1~~~codA~~~Cytosine deaminase~~~COG0402
MSNNALQTIINARLPGEEGLWQIHLQDGKISAIDAQSGVMPITENSLDAEQGLVIPPFVEPHIHLDTTQTAGQPNWNQSG
TLFEGIERWAERKALLTHDDVKQRAWQTLKWQIANGIQHVRTHVDVSDATLTALKAMLEVKQEVAPWIDLQIVAFPQEGI
LSYPNGEALLEEALRLGADVVGAIPHFEFTREYGVESLHKTFALAQKYDRLIDVHCDEIDDEQSRFVETVAALAHHEGMG
ARVTASHTTAMHSYNGAYTSRLFRLLKMSGINFVANPLVNIHLQGRFDTYPKRRGITRVKEMLESGINVCFGHDDVFDPW
YPLGTANMLQVLHMGLHVCQLMGYGQINDGLNLITHHSARTLNLQDYGIAAGNSANLIILPAENGFDALRRQVPVRYSVR
GGKVIASTQPAQTTVYLEQPEAIDYKR
>P0AA82 ~~~codB~~~Cytosine permease~~~COG1457
MSQDNNFSQGPVPQSARKGVLALTFVMLGLTFFSASMWTGGTLGTGLSYHDFFLAVLIGNLLLGIYTSFLGYIGAKTGLT
THLLARFSFGVKGSWLPSLLLGGTQVGWFGVGVAMFAIPVGKATGLDINLLIAVSGLLMTVTVFFGISALTVLSVIAVPA
IACLGGYSVWLAVNGMGGLDALKAVVPAQPLDFNVALALVVGSFISAGTLTADFVRFGRNAKLAVLVAMVAFFLGNSLMF
IFGAAGAAALGMADISDVMIAQGLLLPAIVVLGLNIWTTNDNALYASGLGFANITGMSSKTLSVINGIIGTVCALWLYNN
FVGWLTFLSAAIPPVGGVIIADYLMNRRRYEHFATTRMMSVNWVAILAVALGIAAGHWLPGIVPVNAVLGGALSYLILNP
ILNRKTTAAMTHVEANSVE
>Q81WK7 ~~~codY~~~Global transcriptional regulator CodY~~~COG4465
MELLAKTRKLNALLQSAAGKPVNFREMSDTMCEVIEANVFVVSRRGKLLGYAIHQQIENERMKQMLAERQFPEEYTQSLF
NITETSSNLDVNSAYTAFPVENKELFGQGLTTIVPIVGGGERLGTLVLARLGQEFLDDDLILAEYSSTVVGMEILREKAE
EIEEEARSKAVVQMAISSLSYSELEAIEHIFEELNGTEGLLVASKIADRVGITRSVIVNALRKLESAGVIESRSLGMKGT
YIKVLNDKFLHELAKLKTN
>Q819X8 ~~~codY~~~Global transcriptional regulator CodY~~~
MELLAKTRKLNALLQSAAGKPVNFREMSDTMCEVIEANVFVVSRRGKLLGYAIHQQIENERMKQMLAERQFPEEYTQSLF
NITETSSNLDVNSAYTAFPVENRELFGQGLTTIVPIVGGGERLGTLVLARLGQEFLDDDLILAEYSSTVVGMEILREKAE
EIEEEARSKAVVQMAISSLSYSELEAIEHIFEELNGTEGLLVASKIADRVGITRSVIVNALRKLESAGVIESRSLGMKGT
YIKVLNDKFLQELAKLKTN
>P39779 ~~~codY~~~Global transcriptional regulator CodY~~~COG4465
MALLQKTRIINSMLQAAAGKPVNFKEMAETLRDVIDSNIFVVSRRGKLLGYSINQQIENDRMKKMLEDRQFPEEYTKNLF
NVPETSSNLDINSEYTAFPVENRDLFQAGLTTIVPIIGGGERLGTLILSRLQDQFNDDDLILAEYGATVVGMEILREKAE
EIEEEARSKAVVQMAISSLSYSELEAIEHIFEELDGNEGLLVASKIADRVGITRSVIVNALRKLESAGVIESRSLGMKGT
YIKVLNNKFLIELENLKSH
>A2RHP2 ~~~codY~~~Global transcriptional regulator CodY~~~COG4465
MATLLEKTRKITAILQDGVTDLQQELPYNSMTERLANVIDCNACVINTKGELLGYSLPYNTNNDRVDQFFYDRKLPDEYV
RAAVRIYDTMANVPVDRPLAIFPEESLSDFPKGVTTLAPIYGSGMRLGTFIMWREDGEFTDDDLVLVELATTVIGVQLSN
LKLEQMEENIRKDTMATMAVNTLSYSEMKAVKAIIEELDGEEGHVIASVIADKIGITRSVIVNALRKLESAGVIESRSLG
MKGTYLKVLNTGLFDKLAGRNF
>A7X1N2 ~~~codY~~~Global transcriptional regulator CodY~~~
MSLLSKTRELNTLLQKHKGIAVDFKDVAQTISSVTVTNVFIVSRRGKILGSSLNELLKSQRIIQMLEERHIPSEYTERLM
EVKQTESNIDIDNVLTVFPPENRELFIDSRTTIFPILGGGERLGTLVLGRVHDDFNENDLVLGEYAATVIGMEILREKHS
EVEKEARDKAAITMAINSLSYSEKEAIEHIFEELGGTEGLLIASKVADRVGITRSVIVNALRKLESAGVIESRSLGMKGT
FIKVKKEKFLDELEKSK
>Q2FHI3 ~~~codY~~~Global transcriptional regulator CodY~~~
MSLLSKTRELNTLLQKHKGIAVDFKDVAQTISSVTVTNVFIVSRRGKILGSSLNELLKSQRIIQMLEERHIPSEYTERLM
EVKQTESNIDIDNVLTVFPPENRELFIDSRTTIFPILGGGERLGTLVLGRVHDDFNENDLVLGEYAATVIGMEILREKHS
EVEKEARDKAAITMAINSLSYSEKEAIEHIFEELGGTEGLLIASKVADRVGITRSVIVNALRKLESAGVIESRSLGMKGT
FIKVKKEKFLDELEKSK
>Q2FZ27 ~~~codY~~~Global transcriptional regulator CodY~~~COG4465
MSLLSKTRELNTLLQKHKGIAVDFKDVAQTISSVTVTNVFIVSRRGKILGSSLNELLKSQRIIQMLEERHIPSEYTERLM
EVKQTESNIDIDNVLTVFPPENRELFIDSRTTIFPILGGGERLGTLVLGRVHDDFNENDLVLGEYAATVIGMEILREKHS
EVEKEARDKAAITMAINSLSYSEKEAIEHIFEELGGTEGLLIASKVADRVGITRSVIVNALRKLESAGVIESRSLGMKGT
FIKVKKEKFLDELEKSK
>P63844 ~~~codY~~~Global transcriptional regulator CodY~~~
MSLLSKTRELNTLLQKHKGIAVDFKDVAQTISSVTVTNVFIVSRRGKILGSSLNELLKSQRIIQMLEERHIPSEYTERLM
EVKQTESNIDIDNVLTVFPPENRELFIDSRTTIFPILGGGERLGTLVLGRVHDDFNENDLVLGEYAATVIGMEILREKHS
EVEKEARDKAAITMAINSLSYSEKEAIEHIFEELGGTEGLLIASKVADRVGITRSVIVNALRKLESAGVIESRSLGMKGT
FIKVKKEKFLDELEKSK
>Q04JG7 ~~~codY~~~Global transcriptional regulator CodY~~~COG4465
MAHLLEKTRKITSILKRSEEQLQDELPYNAITRQLADIIHCNACIINSKGRLLGYFMRYKTNTDRVEQFFQTKIFPDDYV
QGANMIYETEANLPVEHDMSIFPVESRDDFPDGLTTIAPIHVSGIRLGSLIIWRNDKKFEDEDLVLVEIASTVVGIQLLN
FQREEDEKNIRRRTAVTMAVNTLSYSELRAVSAILGELNGNEGQLTASVIADRIGITRSVIVNALRKLESAGIIESRSLG
MKGTYLKVLISDIFEEVKKRDY
>Q97PM1 ~~~codY~~~Global transcriptional regulator CodY~~~COG4465
MAHLLEKTRKITSILKRSEEQLQDELPYNAITRQLADIIHCNACIINSKGRLLGYFMRYKTNTDRVEQFFQTKIFPDDYV
QGANMIYETEANLPVEHDMSIFPIESRDDFPDGLTTIAPIHVSGIRLGSLIIWRNDKKFEDEDLVLVEIASTVVGIQLLN
FQREEDEKNIRRRTAVTMAVNTLSYSELRAVSAILGELNGNEGKLTASVIADRIGITRSVIVNALRKLESAGIIESRSLG
MKGTYLKVLISDIFEEVKKRDY
>P46891 3.6.1.-~~~cof~~~HMP-PP phosphatase~~~COG0561
MARLAAFDMDGTLLMPDHHLGEKTLSTLARLRERDITLTFATGRHALEMQHILGALSLDAYLITGNGTRVHSLEGELLHR
DDLPADVAELVLYQQWDTRASMHIFNDDGWFTGKEIPALLQAFVYSGFRYQIIDVKKMPLGSVTKICFCGDHDDLTRLQI
QLYEALGERAHLCFSATDCLEVLPVGCNKGAALTVLTQHLGLSLRDCMAFGDAMNDREMLVSVGSGFIMGNAMPQLRAEL
PHLPVIGHCRNQAVSHYLTHWLDYPHLPYSPE
>O31604 ~~~coiA~~~Competence protein CoiA~~~COG4469
MFHLLGAQQNQKLKRRRFFCPVCGGELAVKLGLQKAPHFAHKQNKSCAIDIEPESAYHLEGKRQLYVWLKTQRASPILEP
YIRTINQRPDVMARIKEHMLAVEYQCATIAPDVFQKRTEGFKQEGIIPQWIMGYSRLKRTASSFYQLSTFHWQFINASPY
RELICYCPERRSFLRLSHIIPFYTNHSYSSVQTIPIHRAGAGDLFFTEPKPSIQYSGWTKAIHRFRHKPHRFNSKETNRL
RLLFYEKRQTPFSFLPTEVFVPVRKGAVFKSPVFVWQGFLYLFMTDLGGKRAPIRFSAVLQQCKLHIHNKNIALRSECSE
ECLSEAVKQYIDFLCKKGFLRETQKEVYVLNQPAGGIHSMQDLIERDRSCFIE
>Q81BJ6 3.4.24.3~~~colA~~~Collagenase ColA~~~
MNKNLRFTQMMIGISTMALSFGSIQTQVSAEETAPYNILQMKPMGTETSKDEIVHATKADETLNFEERLKIGDFSQRPTL
VMKRDEIQLKQSYTLAELNKMPNSELIDTLSKISWNQITDLFQFNQDTKAFYQNKERMNVIINELGQRGRTFTKENSKGI
ETFVEVLRSAFYVGYYNNELSYLKERSFHDKCLPALKAIAKNRNFTLGTAEQDRVVTAYGKLIGNASSDTETVQYAVNVL
KHYNDNLSTYVSDYAKGQAVYEIVKGIDYDIQSYLQDTNKQPNETMWYGKIDNFINEVNRIALVGNITNENSWLINNGIY
YAGRLGKFHSNPNKGLEVITQAMNLYPRLSGAYFVAVEQMKTNYGGKDYSGNAVDLQKIREEGKQQYLPKTYTFDDGSIV
FKTGDKVTEEKIKRLYWAAKEVKAQYHRVIGNDKALEPGNADDVLTIVIYNNPDEYQLNRQLYGYETNNGGIYIEEKGTF
FTYERTPKQSIYSLEELFRHEFTHYLQGRYEVPGLFGSGEMYQNERLTWFQEGNAEFFAGSTRTNNVVPRKSMISGLSSD
PASRYTVKQTLFSKYGSWDFYKYSFALQSYLYNHQFETFDKLQDLIRANDVKNYDSYRESLSNNTQLNAEYQAYMQQLID
NQDKYNVPKVTNDYLIQHAPKPLAEVKNEIVDVANIKDAKITKYESQFFNTFTVEGKYTGGTSKGESEDWKAMSKQVNQT
LEQLSQKGWSGYKTVTAYFVNYRVNAANQFEYDIVFHGVATEEKEKTTTIVNMNGPYSGIVNEEIQFHSDGTKSENGKVI
SYLWNFGDGTTSTEANPTHVYGEKGTYTVELTVKDSRGKESKEQTKVTVKQDPQTSESYEEEKVLPFNTLVKGNLITPDQ
TDVYTFNVTDPKEVDISVVNEQNIGMTWVLYHESDMQNYVACGEDEGNVIKGKFAAKPGKYYLNVYKFDDKNGEYSLLVK
>P43153 3.4.24.3~~~colA~~~Collagenase ColA~~~
MKKNLKRGELTKLKLVERWSATFTLAAFILFNSSFKVLAADKKVENSNNGQITREINADQISKTELNNEVATDNNRPLGP
SIAPSRARNNKIYTFDELNRMNYSDLVELIKTISYENVPDLFNFNDGSYTFFSNRDRVQAIIYGLEDSGRTYTADDDKGI
PTLVEFLRAGYYLGFYNKQLSYLNTPQLKNECLPAMKAIQYNSNFRLGTKAQDGVVEALGRLIGNASADPEVINNCIYVL
SDFKDNIDKYGSNYSKGNAVFNLMKGIDYYTNSVIYNTKGYDAKNTEFYNRIDPYMERLESLCTIGDKLNNDNAWLVNNA
LYYTGRMGKFREDPSISQRALERAMKEYPYLSYQYIEAANDLDLNFGGKNSSGNDIDFNKIKADAREKYLPKTYTFDDGK
FVVKAGDKVTEEKIKRLYWASKEVKAQFMRVVQNDKALEEGNPDDILTVVIYNSPEEYKLNRIINGFSTDNGGIYIENIG
TFFTYERTPEESIYTLEELFRHEFTHYLQGRYVVPGMWGQGEFYQEGVLTWYEEGTAEFFAGSTRTDGIKPRKSVTQGLA
YDRNNRMSLYGVLHAKYGSWDFYNYGFALSNYMYNNNMGMFNKMTNYIKNNDVSGYKDYIASMSSDYGLNDKYQDYMDSL
LNNIDNLDVPLVSDEYVNGHEAKDINEITNDIKEVSNIKDLSSNVEKSQFFTTYDMRGTYVGGRSQGEENDWKDMNSKLN
DILKELSKKSWNGYKTVTAYFVNHKVDGNGNYVYDVVFHGMNTDTNTDVHVNKEPKAVIKSDSSVIVEEEINFDGTESKD
EDGEIKAYEWDFGDGEKSNEAKATHKYNKTGEYEVKLTVTDNNGGINTESKKIKVVEDKPVEVINESEPNNDFEKANQIA
KSNMLVKGTLSEEDYSDKYYFDVAKKGNVKITLNNLNSVGITWTLYKEGDLNNYVLYATGNDGTVLKGEKTLEPGRYYLS
VYTYDNQSGTYTVNVKGNLKNEVKETAKDAIKEVENNNDFDKAMKVDSNSKIVGTLSNDDLKDIYSIDIQNPSDLNIVVE
NLDNIKMNWLLYSADDLSNYVDYANADGNKLSNTCKLNPGKYYLCVYQFENSGTGNYIVNLQNK
>P43154 3.4.24.3~~~~~~Microbial collagenase~~~COG3291
MELKILSVAIATTLTSTGVFALSEPVSQVTEQHAHSAHTHGVEFNRVEYQPTATLPIQPSKATRVQSLESLDESSTACDL
EALVTESSNQLISEILSQGATCVNQLFSAESRIQESVFSSDHMYNIAKHTTTLAKGYTGGGSDELETLFLYLRAGYYAEF
YNDNISFIEWVTPAVKESVDAFVNTASFYENSDRHGKVLSEVIITMDSAGLQHAYLPQVTQWLTRWNDQYAQHWYMRNAV
NGVFTILFGGQWNEQFVQIIGNQTDLAKALGDFALRASSIGAEDEFMAANAGRELGRLTKYTGNASSVVKSQLSRIFEQY
EMYGRGDAVWLAAADTASYYADCSEFGICNFETELKGLVLSQTYTCSPTIRILSQNMTQEQHAAACSKMGYEEGYFHQSL
ETGEQPVKDDHNTQLQVNIFDSSTDYGKYAGPIFDISTDNGGMYLEGDPSQPGNIPNFIAYEASYANADHFVWNLEHEYV
HYLDGRFDLYGGFSHPTEKIVWWSEGIAEYVAQENDNQAALETILDGSTYTLSEIFETTYDGFDVDRIYRWGYLAVRFMF
ENHKDDVNQMLVETRQGNWINYKATITQWANLYQSEFEQWQQTLVSNGAPNAVITANSKGKVGESITFSSENSTDPNGKI
VSVLWDFGDGSTSTQTKPTHQYGSEGEYSVSLSVTDSEGLTATATHTVVISALGGNDTLPQDCAVQSKVSGGRLTAGEPV
CLANQQTIWLSVPAVNESSNLAITTGNGTGNLKLEYSNSGWPDDTNLHGWSDNIGNGECITLSNQSNYWGYVKVSGDFEN
AAIVVDFDAQKCRQ
>Q56696 3.4.24.3~~~prt~~~Microbial collagenase~~~COG4934
MSHIRFFPRHRLALACMLASVSSFSFAQNQCAVADLQQSRDLAAAVSGAEYDCYHAWFSAPSATLNDIYSEASLSRIQVA
LDQEIARYRGEAEQARVLENLGEFVRAAYYVRYNAGTGTPEFSEALSQRFAQSTNLFLNNPHALDQGREQVGAMKSLTLM
VDNVKQLPLTMDSMMAALMHFNRDTAKDTQWVDGLNNLFRSMAGHAANDAFYRYMANNTHHIDTLARFASDNAWALDTDA
NFIVFNALRETGRLLASPDQETKRKALAVMQQVMQRYPLGSEHDKLWLAAVEMMSYYAPEGLNGLNLEQAKQDLAARVMP
NRFECQGPAIIRSEDLTDAQAAKACEVLAAKEADFHQVANTGNQPVADDLNDRVEVAVFASNDSYVDYSSFLFGNTTDNG
GQYLEGTPSRADNTARFVAYRYANGEDLSILNLEHEYTHYLDARFNQYGSFSDNLAHGHIVWWLEGFAEYMHYKQGYKAA
IDLIPSGKLSLSTVFDTTYSHDSNRIYRWGYLAVRFMLENHPQDVESLLALSRSGQFAQWAQQVTVLGQQYDAEFERWLD
TLEVVVEPEQPGTDPEEPSEPTDPEVQVTELAANQSLQLSGEAYSEKLFYVDVPANTVRFNVSIEGAGDADLYMSYNKVA
HYYDFEMSQYADGSNEEIQFAPEQNGYVKAGRYYISLTGRDSYDSVNLVAALEVEAQTPPTQVQDDLAPVVLESGEAKVL
TVHQQRYAAVYVPEGVKEVRVWMSSQSNANDPYGAGNVDLYASRKHWPTAEQHEYASNYAGSNEYLAIPVTEAGYVHFSL
QAPQQGDDVEMLVYFF
>G4WJD3 1.1.1.356~~~colC~~~GDP-L-colitose synthase~~~
MKILLTGAGGMVGKNILAHTKSKDYEFITPSSKELDLLEKKHITTYLKHHKPNFIIHAAGIVGGIHANINNPVKFLVENM
QMGINLLTAAKDNNIRKLLNLGSSCMYPKDCDSGLTEDMILTGELESTNEGYALAKITSAKLCEYINREDSEFQYKTAIP
CNLYGKYDKFDENNSHMIPAVIKKIVTAIETGKSEVEIWGDGEARREFMYAEDLADFIFYTINNFTKMPQNINVGLGQDY
TITEYYKVIAKILGYKGTFVYDKSKPVGMRRKLIDNTLLSEFGWSNKVDLESGISKTCQYFLNEKNND
>D3QY10 4.2.1.168~~~colD~~~GDP-4-keto-6-deoxy-D-mannose 3-dehydratase~~~
MINYPLASSTWDDLEYKAIQSVLDSKMFTMGEYVKQYETQFAKTFGSKYAVMVSSGSTANLLMIAALFFTKKPRLKKGDE
IIVPAVSWSTTYYPLQQYGLRVKFVDIDINTLNIDIESLKEAVTDSTKAILTVNLLGNPNNFDEINKIIGGRDIILLEDN
CESMGATFNNKCAGTFGLMGTFSSFYSHHIATMEGGCIVTDDEEIYHILLCIRAHGWTRNLPKKNKVTGVKSDDQFEESF
KFVLPGYNVRPLEMSGAIGIEQLKKLPRFISVRRKNAEYFLDKFKDHPYLDVQQETGESSWFGFSFIIKKDSGVIRKQLV
ENLNSAGIECRPIVTGNFLKNTDVLKYFDYTVHNNVDNAEYLDKNGLFVGNHQIELFDEIDYLREVLK
>G4WJD4 4.2.1.168~~~colD~~~GDP-4-keto-6-deoxy-D-mannose 3-dehydratase~~~
MRKIMINFPLASSTWDEKELNAIQRIIDSNMFTMGESVKQYEKDFAEYFGSKYSVMVSSGSTANLLMIAALFFTKKPKFK
RGDEVIVPAVSWSTTYFPLQQYGLNVRFVDIDKKTLNIDLDKLKSAITEKTKAILAVNLLGNPNDFDAITKITEGKDIFI
LEDNCESMGARLNGKQAGTYGLMGTFSSFFSHHIATMEGGCVITDDEELYHILLCIRAHGWTRNLPEFNHITGQKSIDPF
EESFKFVLPGYNVRPLEMSGAIGIEQLKKLPSFIEMRRKNATIFKELFSSHPYIDIQQETGESSWFGFALILKESSPITR
AELVKKLIEAGIECRPIVTGNFLKNKEVLKFFDYTIAGEVTDAEYIDKHGLFVGNHQIDLSEQIKNLFNILKK
>Q9X721 3.4.24.3~~~colG~~~Collagenase ColG~~~
MKKNILKILMDSYSKESKIQTVRRVTSVSLLAVYLTMNTSSLVLAKPIENTNDTSIKNVEKLRNAPNEENSKKVEDSKND
KVEHVKNIEEAKVEQVAPEVKSKSTLRSASIANTNSEKYDFEYLNGLSYTELTNLIKNIKWNQINGLFNYSTGSQKFFGD
KNRVQAIINALQESGRTYTANDMKGIETFTEVLRAGFYLGYYNDGLSYLNDRNFQDKCIPAMIAIQKNPNFKLGTAVQDE
VITSLGKLIGNASANAEVVNNCVPVLKQFRENLNQYAPDYVKGTAVNELIKGIEFDFSGAAYEKDVKTMPWYGKIDPFIN
ELKALGLYGNITSATEWASDVGIYYLSKFGLYSTNRNDIVQSLEKAVDMYKYGKIAFVAMERITWDYDGIGSNGKKVDHD
KFLDDAEKHYLPKTYTFDNGTFIIRAGDKVSEEKIKRLYWASREVKSQFHRVVGNDKALEVGNADDVLTMKIFNSPEEYK
FNTNINGVSTDNGGLYIEPRGTFYTYERTPQQSIFSLEELFRHEYTHYLQARYLVDGLWGQGPFYEKNRLTWFDEGTAEF
FAGSTRTSGVLPRKSILGYLAKDKVDHRYSLKKTLNSGYDDSDWMFYNYGFAVAHYLYEKDMPTFIKMNKAILNTDVKSY
DEIIKKLSDDANKNTEYQNHIQELADKYQGAGIPLVSDDYLKDHGYKKASEVYSEISKAASLTNTSVTAEKSQYFNTFTL
RGTYTGETSKGEFKDWDEMSKKLDGTLESLAKNSWSGYKTLTAYFTNYRVTSDNKVQYDVVFHGVLTDNADISNNKAPIA
KVTGPSTGAVGRNIEFSGKDSKDEDGKIVSYDWDFGDGATSRGKNSVHAYKKAGTYNVTLKVTDDKGATATESFTIEIKN
EDTTTPITKEMEPNDDIKEANGPIVEGVTVKGDLNGSDDADTFYFDVKEDGDVTIELPYSGSSNFTWLVYKEGDDQNHIA
SGIDKNNSKVGTFKSTKGRHYVFIYKHDSASNISYSLNIKGLGNEKLKEKENNDSSDKATVIPNFNTTMQGSLLGDDSRD
YYSFEVKEEGEVNIELDKKDEFGVTWTLHPESNINDRITYGQVDGNKVSNKVKLRPGKYYLLVYKYSGSGNYELRVNK
>Q46085 3.4.24.3~~~colH~~~Collagenase ColH~~~
MKRKCLSKRLMLAITMATIFTVNSTLPIYAAVDKNNATAAVQNESKRYTVSYLKTLNYYDLVDLLVKTEIENLPDLFQYS
SDAKEFYGNKTRMSFIMDEIGRRAPQYTEIDHKGIPTLVEVVRAGFYLGFHNKELNEINKRSFKERVIPSILAIQKNPNF
KLGTEVQDKIVSATGLLAGNETAPPEVVNNFTPILQDCIKNIDRYALDDLKSKALFNVLAAPTYDITEYLRATKEKPENT
PWYGKIDGFINELKKLALYGKINDNNSWIIDNGIYHIAPLGKLHSNNKIGIETLTEVMKVYPYLSMQHLQSADQIKRHYD
SKDAEGNKIPLDKFKKEGKEKYCPKTYTFDDGKVIIKAGARVEEEKVKRLYWASKEVNSQFFRVYGIDKPLEEGNPDDIL
TMVIYNSPEEYKLNSVLYGYDTNNGGMYIEPEGTFFTYEREAQESTYTLEELFRHEYTHYLQGRYAVPGQWGRTKLYDND
RLTWYEEGGAELFAGSTRTSGILPRKSIVSNIHNTTRNNRYKLSDTVHSKYGASFEFYNYACMFMDYMYNKDMGILNKLN
DLAKNNDVDGYDNYIRDLSSNYALNDKYQDHMQERIDNYENLTVPFVADDYLVRHAYKNPNEIYSEISEVAKLKDAKSEV
KKSQYFSTFTLRGSYTGGASKGKLEDQKAMNKFIDDSLKKLDTYSWSGYKTLTAYFTNYKVDSSNRVTYDVVFHGYLPNE
GDSKNSLPYGKINGTYKGTEKEKIKFSSEGSFDPDGKIVSYEWDFGDGNKSNEENPEHSYDKVGTYTVKLKVTDDKGESS
VSTTTAEIKDLSENKLPVIYMHVPKSGALNQKVVFYGKGTYDPDGSIAGYQWDFGDGSDFSSEQNPSHVYTKKGEYTVTL
RVMDSSGQMSEKTMKIKITDPVYPIGTEKEPNNSKETASGPIVPGIPVSGTIENTSDQDYFYFDVITPGEVKIDINKLGY
GGATWVVYDENNNAVSYATDDGQNLSGKFKADKPGRYYIHLYMFNGSYMPYRINIEGSVGR
>B9J3S4 3.4.24.3~~~colQ1~~~Collagenase ColQ1~~~
MNKKSKINKVMLSISTMALSLGALQAPASAEEKVPYNVLKTKPVGIEKPVDEIGHVSKAEETLSFQERLKVGDFSQRPAS
IPNKAAVKQVKESYSMADLNKMNDQELVETLGCIKWHQITDLFQFNEDAKAFYKDKGKMQVIIDELAHRGSTFTRDDSKG
IQTFTEVLRSAFYLAFYNNELSELNERSFQDKCLPALKAIAKNPNFKLGTAEQDTVVSAYGKLISNASSDVETVQYASNI
LKQYNDNFNTYVNDRMKGQAIYDIMQGIDYDIQSYLIEARKEANETMWYGKVDGFINEINRIALLNEVTPENKWLVNNGI
YFASRLGKFHSNPNKGLEVVTQAMHMYPRLSEPYFVAVEQITTNYNGKDYSGNTVDLEKIRKEGKEQYLPKTYTFDDGSI
VFKTGDKVSEEKIKRLYWAAKEVKAQYHRVIGNDKALEPGNADDILTIVIYNSPEEYQLNRQLYGYETNNGGIYIEETGT
FFTYERTPEQSIYSLEELFRHEFTHYLQGRYEVPGLFGRGDMYQNERLTWFQEGNAEFFAGSTRTNNVVPRKSIISGLSS
DPASRYTAERTLFAKYGSWDFYNYSFALQSYLYTHQFETFDKIQDLIRANDVKNYDAYRENLSKDPKLNKEYQEYMQQLI
DNQDKYNVPAVADDYLAEHAPKSLTAVEKEMTETLPMKDAKMTKHSSQFFNTFTLEGTYTGSVTKGESEDWNAMSKKVNE
VLEQLAQKEWSGYKTVTAYFVNYRVNSSNEFEYDVVFHGIAKDDGENKAPTVNINGPYNGLVKEGIQFKSDGSKDEDGKI
VSYLWDFGDGRTSTEVNPVHVYEREGSYKVALIVKDDKGKESKSETTVTVKDGSLTESEPNNRPEEANRIGLNTTIKGSL
IGGDHTDVYTFNVASAKDIDISVLNEYGIGMTWVLHHESDMQNYAAYGQANGNHIEANFNAKPGEYYLYVYKYDNGDGTY
KLSVK
>Q899Y1 3.4.24.3~~~colT~~~Collagenase ColT~~~
MKKKFIKMLCSIAIGCMISTSYSIKVSAFSNGNTKTNPNGEFKSLSLNSTNPYKTKYSFNDLNKLSNKEILDLTSKIKWS
DISDLFQYNKDSYTFYSNKERVQALIDGLYEKGCNYTSTDDKGIDTLVEILRSGFYLGYYNDSLKYLNDKSFKDKCIPAM
IAIENNKNFKLGENGQDTVVHALGKLIGNTSCNDEVVNKTIPILEQYYNEIDKYSKDRLKSNAVYNFMKEINYDISQYEY
AHNIRDYKNTPWSGKIDSFIDTISKFASISNVTKDNGWIINNSIYYTAKLSKYHSNPSIPHSVIDNCIEIFPDYSEQYFT
AIEAIKEDFNSRDSKGNVIDINKLIEEGKKHYLPKTYTFDNGKIIIKAGDKVEESKIQKLYWASKEVKSQFHRIIGNDKP
LEVGNADDILTIVIYNNPEEYKLNKTLYGYSVDNGGIYIEGIGTFFTYERTPQESIYSLEELFRHEFTHYLQGRYLIPGL
FNKGDFYKGNNGRITWFEEGSAEFFAGSTRTSVLPRKSMVGGLSKNPKERFNADKLLHSKYSDGWDFYKYGYAFSDYMYN
NNKKLFSDLVSTMKNNDVKGYEALIEESSKDSKINKDYEYHMENLVNNYDNYTIPLVSDDYMKQYDNKSLHEIKSDIEKA
MDVKNSQITKESSQYFDTYNLKATYTLSSNKGEISNWNYMNNKINEALNKLDNLSWGGYKTVTAYFSNPRLNSNNEVVYD
IVFHGLLSHNKNSNEKVEVKEEPEIKDKDSFENVIYEKENNDSFDKANKIHKNQIVMATLDTEDYRDTFYFDALTSGSID
ITIENIHGNSDAFNWLVYNDEDLNNYIAYPTKKEDNKLMGSFKVHKPGRYYILVYKTSLNKVNYKLNISDATNMAPVIKK
IHEKENNDSFETANKITLDTLVLGNLDYKDVSDIYSFDIENTKDLNIKLTNLNNLGIAWNLYKESDLNNYIAYGAKSDNA
IVGKCNLSPGKYYLYVYKYSGDKGNYSVIIN
>P17811 3.4.23.48~~~pla~~~Coagulase/fibrinolysin~~~
MKKSSIVATIITILSGSANAASSQLIPNISPDSFTVAASTGMLSGKSHEMLYDAETGRKISQLDWKIKNVAILKGDISWD
PYSFLTLNARGWTSLASGSGNMDDYDWMNENQSEWTDHSSHPATNVNHANEYDLNVKGWLLQDENYKAGITAGYQETRFS
WTATGGSYSYNNGAYTGNFPKGVRVIGYNQRFSMPYIGLAGQYRINDFELNALFKFSDWVRAHDNDEHYMRDLTFREKTS
GSRYYGTVINAGYYVTPNAKVFAEFTYSKYDEGKGGTQTIDKNSGDSVSIGGDAAGISNKNYTVTAGLQYRF
>Q97E82 3.1.3.71~~~comB~~~Probable 2-phosphosulfolactate phosphatase~~~COG2045
MKIDLIISADDIKEEKVKNKTAVVIDMLRATSVITTALNNGCKRVVPVLTVEEALKKVKEYGKDAILGGERKGLKIEGFD
FSNSPMEYTEDVVKGKTLIMTTTNGTRAIKGSETARDILIGSVLNGEAVAEKIVELNNDVVIVNAGTYGEFSIDDFICSG
YIINCVMDRMKKLELTDAATTAQYVYKTNEDIKGFVKYAKHYKRIMELGLKKDFEYCCKKDIVKLVPQYTNGEIL
>Q9WZQ4 3.1.3.71~~~comB~~~Probable 2-phosphosulfolactate phosphatase~~~COG2045
MVDVVMAPCSPVECRTAVVIDVLRATSTIVTALSNGASGVIPVKTIEEALEKKKEGVLICGERNAQKPKGFNLGNSPLEY
RKEKISGKTIVLTTTNGTQVIEKIRSEEIIAASFLNLSAVVEYLKSKEDILLVCAGTNGRFSLEDFLLAGAIVKRLKRND
LGDGAHAAERYFESVENTREEIKKHSSHAKRLISLGFENDIEFCTTEDLFKTVPALVNGVFILKEFP
>Q5SID6 3.1.3.71~~~comB~~~Probable 2-phosphosulfolactate phosphatase~~~COG2045
MRLRVDVIPGEHLAYPDVVLVVDVIRATTTAAAFLEAGAEALYWTPSLESALAFKDEDVVLAGETGGLKPPRFDLGNSPR
EALSAQVAGRVVVMSTTNGTKAAHAAARTAKHVLLASLYNAHAAARLARELATEEVAILCAGKEGRAGLDDLYTAGVLAE
YLGFLGEVEPEDGARVALAVKRAYPDPLEALSLSAAALALKQVGLEADVPFCAQVAKSAAVPVLRGRVGEALIFKRA
>Q1QWN5 1.1.1.338~~~comC~~~(2R)-3-sulfolactate dehydrogenase (NADP(+))~~~COG2055
MSRMISLREAETLAVAALEAVGVPRWEAEVTARALIDAERDGLASHGLSRLPFYLAQARSGKVVADAQARVEVAGSVIRV
DARHGLAFPAIARGVERAIPLARELGLVAVAIGGSHHFGVAGAPVERLAREGLVAMAFSNAPSAMAPWGGKRPLYGTNPI
AFATPRRGTDPLVIDLSLSKVARGKVMLAKKAGEPIPEGWALDIEGRPTTDPDAAIAGSMVPAGDAKGASLALMVELLTA
GLTGSHFGFQASSFFEPEGEAPSVGHLMLAFDPAHFSDGYLEHIEALFQAMLEQEGVRLPGTRRHALRRERGESLELPEA
VVDELRAYAVSRV
>P39694 ~~~comEA~~~ComE operon protein 1~~~COG1555
MNWLNQHKKAIILAASAAVFTAIMIFLATGKNKEPVKQAVPTETENTVVKQEANNDESNETIVIDIKGAVQHPGVYEMRT
GDRVSQAIEKAGGTSEQADEAQVNLAEILQDGTVVYIPKKGEETAVQQGGGGSVQSDGGKGALVNINTATLEELQGISGV
GPSKAEAIIAYREENGRFQTIEDITKVSGIGEKSFEKIKSSITVK
>P39695 ~~~comEC~~~ComE operon protein 3~~~COG0658
MRNSRLLLPMAAASATAGITAAAYFPAIFLFILFLLIILIKTRHAFLIIVCFFSFILFFVLYAVTDSQNVSSYRQGTYQF
KAVIDTIPKIDGDRMSMMVETPDKEKWAAAYRIQSAGEKEQLLYIEPGMSCELTGTLEEPNHATVPGAFDYNEYLYRQHI
HWNYSVTSIQNCSEPENFKYKVLSLRKHIISFTNSLLPPDSTGIVQALTVGDRFYVEDEVLTAYQKLGVVHLLAISGLHV
GILTAGLFYIMIRLGITREKASILLLLFLPLYVMLTGAAPSVLRAALMSGVYLAGSLVKWRVRSATAICLSYIVLLLFNP
YHLFEAGFQLSFAVSFSLILSSSIFQQVKTSLGQLTIVSLIAQLGSLPILLYHFHQFSIISVPMNMLMVPFYTFCILPGA
VAGVLLLSLSASFGRLFFSWFDLLISWINRLITNIADVDVFTIMIAHPAPVLLFLFTVTIILLLMAIEKRSLSQLMVTGG
ICCTVMFLLFIYPCLSSEGEVDMIDIGQGDSMFVGAPHQRGRVLIDTGGTLSYSSEPWREKQHPFSLGEKVLIPFLTAKG
IKQLDALILTHADQDHIGEAEILLKHHKVKRLVIPKGFVSEPKDEKVLQAAREEGVAIEEVKRGDVLQIKDLQFHVLSPE
APDPASKNNSSLVLWMETGGMSWILTGDLEKEGEQEVMNVFPNIKADVLKVGHHGSKGSTGEEFIQQLQPKTAIISAGKN
NRYHHPHQKVLQLLQRHSIRVLRTDQNGTIQYRYKNRVGTFSVYPPYDTSDITETN
>P39145 3.6.4.12~~~comFA~~~ComF operon protein 1~~~COG4098
MNVPVEKNSSFSKELQQTLRSRHLLRTELSFSDEMIEWHIKNGYITAENSISINKRRYRCNRCGQTDQRYFSFYHSSGKN
KLYCRSCVMMGRVSEEVPLYSWKEENESNWKSIKLTWDGKLSSGQQKAANVLIEAISKKEELLIWAVCGAGKTEMLFPGI
ESALNQGLRVCIATPRTDVVLELAPRLKAAFQGADISALYGGSDDKGRLSPLMISTTHQLLRYKDAIDVMIIDEVDAFPY
SADQTLQFAVQKARKKNSTLVYLSATPPKELKRKALNGQLHSVRIPARHHRKPLPEPRFVWCGNWKKKLNRNKIPPAVKR
WIEFHVKEGRPVFLFVPSVSILEKAAACFKGVHCRTASVHAEDKHRKEKVQQFRDGQLDLLITTTILERGVTVPKVQTGV
LGAESSIFTESALVQIAGRTGRHKEYADGDVIYFHFGKTKSMLDARKHIKEMNELAAKVECTD
>P39146 ~~~comFB~~~ComF operon protein 2~~~
MLVNSKEIVMKELLDRYMDQLHMACTCQVCQNDVLALSLNKVSPSYVTDFKKIAYTKAELVDKQKNTAMLVILAESAAVV
SESPSDLCQTKQEEAFIN
>P25953 ~~~comGA~~~ComG operon protein 1~~~COG2804
MDSIEKVSKNLIEEAYLTKASDIHIVPRERDAIIHFRVDHALLKKRDMKKEECVRLISHFKFLSAMDIGERRKPQNGSLT
LKLKEGNVHLRMSTLPTINEESLVIRVMPQYNIPSIDKLSLFPKTGATLLSFLKHSHGMLIFTGPTGSGKTTTLYSLVQY
AKKHFNRNIVTLEDPVETRDEDVLQVQVNEKAGVTYSAGLKAILRHDPDMIILGEIRDAETAEIAVRAAMTGHLVLTSLH
TRDAKGAIYRLLEFGINMNEIEQTVIAIAAQRLVDLACPFCENGCSSVYCRQSRNTRRASVYELLYGKNLQQCIQEAKGN
HANYQYQTLRQIIRKGIALGYLTTNNYDRWVYHEKD
>P25955 ~~~comGC~~~ComG operon protein 3~~~COG4537
MNEKGFTLVEMLIVLFIISILLLITIPNVTKHNQTIQKKGCEGLQNMVKAQMTAFELDHEGQTPSLADLQSEGYVKKDAV
CPNGKRIIITGGEVKVEH
>P25959 ~~~comGG~~~ComG operon protein 7~~~
MYRTRGFIYPAVLFVSALVLLIVNFVAAQYISRCMFEKETKELYIGENLLQNGVLLSIRHVLEERKGQEGTQQFLYGRVS
YYIHDTSIKEQKEINLRVSTDSGTERTAQIVFDQKQKKLLRWTE
>P40396 ~~~comK~~~Competence transcription factor~~~COG4903
MSQKTDAPLESYEVNGATIAVLPEEIDGKICSKIIEKDCVFYVNMKPLQIVDRSCRFFGSSYAGRKAGTYEVTKISHKPP
IMVDPSNQIFLFPTLSSTRPQCGWISHVHVKEFKATEFDDTEVTFSNGKTMELPISYNSFENQVYRTAWLRTKFQDRIDH
RVPKRQEFMLYPKEERTKMIYDFILRELGERY
>P45049 ~~~comM~~~Competence protein ComM~~~COG0606
MSLAIVYSRASMGVQAPLVTIEVHLSNGKPGFTLVGLPEKTVKEAQDRVRSALMNAQFKYPAKRITVNLAPADLPKEGGR
FDLPIAIGILAASDQLDASHLKQFEFVAELALTGQLRGVHGVIPAILAAQKSKRELIIAKQNANEASLVSDQNTYFAQTL
LDVVQFLNGQEKLPLATEIVKESAVNFSGKNTLDLTDIIGQQHAKRALTIAAAGQHNLLFLGPPGTGKTMLASRLTGLLP
EMTDLEAIETASVTSLVQNELNFHNWKQRPFRAPHHSASMPALVGGGTIPKPGEISLATNGVLFLDELPEFERKVLDALR
QPLESGEIIISRANAKIQFPARFQLVAAMNPSPTGHYTGTHNRTSPQQIMRYLNRLSGPFLDRFDLSIEVPLLPQGSLQN
TGDRGETSAQVREKVLKVREIQMERAGKINAYLNSKEIERDCKLNDKDAFFLEKALNKLGLSVRAYHRILKVSRTIADLQ
GEQQIFQPHLAEALGYRAMVRLLQKLSNM
>O32049 ~~~comN~~~Post-transcriptional regulator ComN~~~
MEKHPADLYKDHVRPFLYSKLEEFKILGYDDVELESLWSYLTDKKWKKKTELSIYELASDILSVKIGEFMNYATVESFKT
SNWLGSEEGQEALEELLR
>A0A2K4Z9G8 ~~~cmpA~~~Cortex morphogenetic protein A~~~
MPNWLKKQMQKAFLEKDNYQIKLLNQCWYFYRKKHCS
>O30583 ~~~comP~~~Pilin-like competence factor ComP~~~COG4969
MNAQKGFTLIELMIVIAIIGILAAIAIPAYTDYTVRARVSEGLTAASSMKTTVSENILNAGALVAGTPSTAGSSCVGVQE
ISASNATTNVATATCGASSAGQIIVTMDTTKAKGANITLTPTYASGAVTWKCTTTSDKKYVPSECRG
>Q99027 2.7.13.3~~~comP~~~Sensor histidine kinase ComP~~~COG4585
MKNLIKKFTIAVIVLSILYISYTTYISMNGIIIGTKIHKNDKSQFMIEEISESSYGQFVGLRQGDIILKINKEKPSDKHL
KWGYLSHINSLDILRSGKKIHLKDFDLVTLNRPYSFFLFVLPLFFYFLSIICIFYILKVNKKRRSFAAYILILLLLDISI
AYISAGGPFRGHIINRYINLFTFISSPILYLQFIQRYLGEIGKTFLNRISFLYIIPIFNLGIEFFQDYLQVDIDFLATLN
LVSFATLTLFSFSAIYLHLNKYKYAEHSFILKLLILTNTLSFAPFLIFFVLPIIFTGNYIFPALASASLLVLIPFGLVYQ
FVANKMFDIEFILGRMRYYALLAMIPTLLIVGALVLFDVMDIQMNPVRQTVFFFVVMFAVFYFKEVMDFKFRLKRFSEKF
NYQDSIFKYTQLMRGVTSLQQVFKELKNTILDVLLVSKAYTFEVTPDHKVIFLDKHEVGPDWNFYQEEFENVTSEIGKII
EVNQGFLMKVGERGGSSYVLLCLSNINTPRLTRDEISWLKTLSFYTSVSMENVLHIEELMEHLKDLKQEGTNPIWLKKLM
FAIEEKQRSGLARDLHDSVLQDLISLKRQCELFLGDFKKDDNPCREEVQDKLVQMNEQMSDVISMTRETCHELRPQLLYD
LGLVKALSKLVAQQQERVPFHIRLNTGRFTASLDLDSQLNLYRIIQEFLSNAVKHSQATDVLIMLISIQNKIVLHYEDDG
VGFDQEKNTEHSMSMGLSGIKERVRALDGRLRIETSEGKGFKADIEIEL
>D4G0R4 2.5.1.-~~~comQ~~~Tryptophan prenyltransferase ComQ~~~
MSHLVKWNGRGEVVIEQICLDSVRIKEKMKEIVDENILNEDLKVKLISFIKEKKQFSFAELAYYHYIAFDGKNDKAIELL
ASGIELLILSADIFDDIEDKDNLQASWMKLDPSIATNAATALYTLSLQVIGSVSNHPKLLSLTLQYSLQSLQGQHVDLNL
TASSESEYIEMIKLKSGSLVTLPSILGVYLATGEYNETVEEYSRYLGIVEQIANDHYGLYYPNYNDFKTRHTLAFNYLKN
KFNQSSIDLLNFYAQENHMINNLEDLKGKLRESGVIQYLNVIKNLAVENFKESFKKLRLDEQRKNKLLIQLLRGI
>P0DV09 2.5.1.-~~~comQ~~~Tryptophan prenyltransferase ComQ~~~
MKEIVHEKIQNLDLKEYLINFIDEKNHFSFGILSFKHYVALSGNRSSHILTLAGGIELLILAFDIFDDLEDEDNIEIKWM
KIDPSLALNAATTLYTLGLETICSISNSAEFHRLTLKYALNAMQGQHEDLRNSPETEEECIQMMKQKAGSLTAMSAVLAA
MLANGEFNQTIEDYAYKIGIIKQLENDYYGLVNDQRSDIRKKRKTLIYLFLNRKFNEASEKILKLINSHTSYHSFISDSS
KFDELLFEAGLNQYVSMLIKLYEEEITASMNQLNINIKL
>P33690 2.5.1.-~~~comQ~~~Tryptophan prenyltransferase ComQ~~~COG0142
MKEIVEQNIFNEDLSQLLYSFIDSKETFSFAESTILHYVVFGGENLDVATRLGAGIEILILSSDIMDDLEDEDNHHALWM
KINRSESLNAALSLYTVGLTSIYSLNNNPLIFKYVLKYVNEAMQGQHDDITNKSKTEDESLEVIRLKCGSLIALANVAGV
LLATGEYNETVERYSYYKGIIAQISGDYYVLLSGNRSDIEKNKHTLIYLYLKRLFNDASEDLLYLISHKDLYYKSLLDKE
KFQEKLIKAGVTQYISVLLEIYKQKCISAIEQLNLDKEKKELIKECLLSYTKGDTRCKT
>P75952 ~~~comR~~~HTH-type transcriptional repressor ComR~~~COG1309
MATDSTQCVKKSRGRPKVFDRDAALDKAMKLFWQHGYEATSLADLVEATGAKAPTLYAEFTNKEGLFRAVLDRYIDRFAA
KHEAQLFCEEKSVESALADYFAAIANCFTSKDTPAGCFMINNCTTLSPDSGDIANTLKSRHAMQERTLQQFLCQRQARGE
IPPHCDVTHLAEFLNCIIQGMSISAREGASLEKLMQIAGTTLRLWPELVK
>P80355 ~~~comS~~~Competence protein S~~~
MNRSGKHLISSIILYPRPSGECISSISLDKQTQATTSPLYFCWREK
>Q9K5K2 ~~~comX~~~ComX pheromone~~~
MQEIVGYLVKNPEVLDEVMKGRASLLNIDKDQLKSIVDAFGGLQIYTNGNWVPS
>Q9K5K3 ~~~comX~~~ComX pheromone~~~
MMQDLINYFLSYPEVLKKLKNREACLIGFSSNETETIIKAYNDYHLSSPTTREWDG
>Q9K5K8 ~~~comX~~~ComX pheromone~~~
MQEMVGYLIKYPNVLREVMEGNACLLGVDKDQSECIINGFKGLEIYSMLDWKY
>D4G0R3 ~~~comX~~~ComX pheromone~~~
MKHIDKIISHLVNNPEAFDQFKNGNLTLLNINEKEKKAILYAFEQGEVPRTSKWPPIEAISNFFEDDKRKSLI
>P0CY51 ~~~comX~~~ComX pheromone~~~
MKQDMIDYLMKNPQVLTKLENGEASLIGIPDKLIPSIVDIFNKKMTLSKKCKGIFWEQ
>P45453 ~~~comX~~~ComX pheromone~~~
MQDLINYFLNYPEALKKLKNKEACLIGFDVQETETIIKAYNDYYLADPITRQWGD
>P31894 ~~~cooF~~~Iron-sulfur protein~~~
MPPIKENVIIYANPDHCLSCHSCELACAVAHSGGHDMIEAIAANLPLHARNKVVSVDGTAMPMQCRQCEDAPCTFACPTG
ACRQADGQVQIVEQHCIGCKLCVMVCPFGAITVRSETVVEQGACTNRGVAKKCDLCVDWRASTGKTAPACVEACPTKAIR
MVDLDAYRIALREARAREIAKSHRHMRVQF
>P59934 1.2.7.4~~~cooS1~~~Carbon monoxide dehydrogenase 1~~~COG1151
MSNWKNSVDPAVDYLLPIAKKAGIETAWDRYEAMQPQCGFGELGVCCRICWKGPCRIDPFGNGPQRGVCGADAHTIVARN
LIRMIAAGAAAHSEHGRHIALTLLEVGEGHAPAYRIKDEQKLRKIAEKLNLAPAGKDIRQVAKEVALASLDDYSRQKRNV
PCNWAKETLTAERVDKLAELGVMPHNIDAVITEIMGRTHVGCDADAVNLLLGGIKGALADYTGMCLSTELSDVIFGTPKP
VITQANLGVLKEDAVNIAVHGHNPLLSEIICDVALKMNEEAKKAGAKEGINVVGICCTGNEVMMRRGIPLATNYLSQEMA
IITGALDAMVVDVQCIMPALTSVAECFHTEIITTMAENKITGATHIEFREDSAVESAKKIVEVAIEAFKKRDKRKVNIPD
CKQTAITGFSAEAIMAVLSKLNANDPLKPLIDNIINGNIQGIALFAGCNNPKAIHDNSFITIAKELAKNNVLMLATGCGA
GAFAKNGLMTQEATEAYAGESLKAVLTALGKAAGLNGPLPLVLHMGSCVDNSRAVNVAVAIANKLGVDLDKLPLVASAPE
FMSEKAVAIGTWAVTLGIPTHIGIVPQIMGSSVVVEFLTEKAKDLLGGYFIVETNPELAAAKLVAVIKERRRGLGI
>Q9F8A8 1.2.7.4~~~cooS2~~~Carbon monoxide dehydrogenase 2~~~COG1151
MAKQNLKSTDRAVQQMLDKAKREGIQTVWDRYEAMKPQCGFGETGLCCRHCLQGPCRINPFGDEPKVGICGATAEVIVAR
GLDRSIAAGAAGHSGHAKHLAHTLKKAVQGKAASYMIKDRTKLHSIAKRLGIPTEGQKDEDIALEVAKAALADFHEKDTP
VLWVTTVLPPSRVKVLSAHGLIPAGIDHEIAEIMHRTSMGCDADAQNLLLGGLRCSLADLAGCYMGTDLADILFGTPAPV
VTESNLGVLKADAVNVAVHGHNPVLSDIIVSVSKEMENEARAAGATGINVVGICCTGNEVLMRHGIPACTHSVSQEMAMI
TGALDAMILDYQCIQPSVATIAECTGTTVITTMEMSKITGATHVNFAEEAAVENAKQILRLAIDTFKRRKGKPVEIPNIK
TKVVAGFSTEAIINALSKLNANDPLKPLIDNVVNGNIRGVCLFAGCNNVKVPQDQNFTTIARKLLKQNVLVVATGCGAGA
LMRHGFMDPANVDELCGDGLKAVLTAIGEANGLGGPLPPVLHMGSCVDNSRAVALVAALANRLGVDLDRLPVVASAAEAM
HEKAVAIGTWAVTIGLPTHIGVLPPITGSLPVTQILTSSVKDITGGYFIVELDPETAADKLLAAINERRAGLGLPW
>P31896 1.2.7.4~~~cooS~~~Carbon monoxide dehydrogenase~~~
MTHHDCAHCSSDACATEMLNLAEANSIETAWHRYEKQQPQCGFGSAGLCCRICLKGPCRIDPFGEGPKYGVCGADRDTIV
ARHLVRMIAAGTAAHSEHGRHIALAMQHISQGELHDYSIRDEAKLYAIAKTLGVATEGRGLLAIVGDLAAITLGDFQNQD
YDKPCAWLAASLTPRRVKRLGDLGLLPHNIDASVAQTMSRTHVGCDADPTNLILGGLRVAMADLDGSMLATELSDALFGT
PQPVVSAANLGVMKRGAVNIAVNGHNPMLSDIICDVAADLRDEAIAAGAAEGINIIGICCTGHEVMMRHGVPLATNYLSQ
ELPILTGALEAMVVDVQCIMPSLPRIAECFHTQIITTDKHNKISGATHVPFDEHKAVETAKTIIRMAIAAFGRRDPNRVA
IPAFKQKSIVGFSAEAVVAALAKVNADDPLKPLVDNVVNGNIQGIVLFVGCNTTKVQQDSAYVDLAKSLAKRNVLVLATG
CAAGAFAKAGLMTSEATTQYAGEGLKGVLSAIGTAAGLGGPLPLVMHMGSCVDNSRAVALATALANKLGVDLSDLPLVAS
APECMSEKALAIGSWAVTIGLPTHVGSVPPVIGSQIVTKLVTETAKDLVGGYFIVDTDPKSAGDKLYAAIQERRAGLGL
>P25921 ~~~~~~Cop-6 protein~~~
MVVDRKEEKKVAVTLRLTTEENEILNRIKEKYNISKSDATGILIKKYAKEEYGAF
>O32220 7.2.2.8~~~copA~~~Copper-exporting P-type ATPase~~~COG2217
MSEQKEIAMQVSGMTCAACAARIEKGLKRMPGVTDANVNLATETSNVIYDPAETGTAAIQEKIEKLGYHVVTEKAEFDIE
GMTCAACANRIEKRLNKIEGVANAPVNFALETVTVEYNPKEASVSDLKEAVDKLGYKLKLKGEQDSEAAAKKKEERKQTA
RLIFSAVLSFPLLWAMVSHFTFTSFIWVPDIFLNPWMQFALATPVQFLIGWPFYVGAYKALRNKSANMDVLVALGTTAAY
AYSLYLTFQSIGSHGHTDGLYYETSAILLTLILLGKLFETKAKGRSSDAIKKLMKLQAKTATVVRDGQEQIIPIDEVLVN
DIVYVKPGERIPVDGEVVEGRSAVDESMITGESLPVDKNPGDSVTGSTVNANGFLKIKAVNVGKDTALSHIIKIVEEAQG
SKAPIQRLADQISGIFVPIVLGIAVLTFLIWYLWAAPGDFAEAISKFIAVLVIACPCALGLATPTSIMAGSGRAAEFGIL
FKGGEHLEKTHRLDTIVLDKTGTVTNGKPRLTDAIPFGRFEEKDLLQFAAAAETGSEHPLGEAIIAGVKDKGLEIPKLTR
FEAKVGAGILAEAGGKSILVGTRKLMESEQVEHGALLAQMEELEAEGKTVMLVSIDGEAAGLVAVADTIKDTSRKAVARL
KELGLDVIMMTGDNRRTAEAIAKEAGIANIIAEVLPEQKAAEIARLQKEGRQTAMVGDGINDAPALATADIGMAIGTGTD
IAMETADITLIRGDLNSIADAIRMSRLTMKNIKQNLFWALGYNSLGIPIAALGFLAPWIAGAAMAFSSVSVVLNALRLQK
VK
>Q59385 7.2.2.8~~~copA~~~Copper-exporting P-type ATPase~~~COG2217
MSQTIDLTLDGLSCGHCVKRVKESLEQRPDVEQADVSITEAHVTGTASAEQLIETIKQAGYDASVSHPKAKPLAESSIPS
EALTAVSEALPAATADDDDSQQLLLSGMSCASCVTRVQNALQSVPGVTQARVNLAERTALVMGSASPQDLVQAVEKAGYG
AEAIEDDAKRRERQQETAVATMKRFRWQAIVALAVGIPVMVWGMIGDNMMVTADNRSLWLVIGLITLAVMVFAGGHFYRS
AWKSLLNGAATMDTLVALGTGVAWLYSMSVNLWPQWFPMEARHLYYEASAMIIGLINLGHMLEARARQRSSKALEKLLDL
TPPTARLVTDEGEKSVPLAEVQPGMLLRLTTGDRVPVDGEITQGEAWLDEAMLTGEPIPQQKGEGDSVHAGTVVQDGSVL
FRASAVGSHTTLSRIIRMVRQAQSSKPEIGQLADKISAVFVPVVVVIALVSAAIWYFFGPAPQIVYTLVIATTVLIIACP
CALGLATPMSIISGVGRAAEFGVLVRDADALQRASTLDTVVFDKTGTLTEGKPQVVAVKTFADVDEAQALRLAAALEQGS
SHPLARAILDKAGDMQLPQVNGFRTLRGLGVSGEAEGHALLLGNQALLNEQQVGTKAIEAEITAQASQGATPVLLAVDGK
AVALLAVRDPLRSDSVAALQRLHKAGYRLVMLTGDNPTTANAIAKEAGIDEVIAGVLPDGKAEAIKHLQSEGRQVAMVGD
GINDAPALAQADVGIAMGGGSDVAIETAAITLMRHSLMGVADALAISRATLHNMKQNLLGAFIYNSIGIPVAAGILWPFT
GTLLNPVVAGAAMALSSITVVSNANRLLRFKPKE
>P32113 7.2.2.8~~~copA~~~Probable copper-importing P-type ATPase A~~~COG2217
MATNTKMETFVITGMTCANCSARIEKELNEQPGVMSATVNLATEKASVKYTDTTTERLIKSVENIGYGAILYDEAHKQKI
AEEKQTYLRKMKFDLIFSAILTLPLMLAMIAMMLGSHGPIVSFFHLSLVQLLFALPVQFYVGWRFYKGAYHALKTKAPNM
DVLVAIGTSAAFALSIYNGFFPSHSHDLYFESSSMIITLILLGKYLEHTAKSKTGDAIKQMMSLQTKTAQVLRDGKEETI
AIDEVMIDDILVIRPGEQVPTDGRIIAGTSALDESMLTGESVPVEKKEKDMVFGGTINTNGLIQIQVSQIGKDTVLAQII
QMVEDAQGSKAPIQQIADKISGIFVPIVLFLALVTLLVTGWLTKDWQLALLHSVSVLVIACPCALGLATPTAIMVGTGVG
AHNGILIKGGEALEGAAHLNSIILDKTGTITQGRPEVTDVIGPKEIISLFYSLEHASEHPLGKAIVAYGAKVGAKTQPIT
DFVAHPGAGISGTINGVHYFAGTRKRLAEMNLSFDEFQEQALELEQAGKTVMFLANEEQVLGMIAVADQIKEDAKQAIEQ
LQQKGVDVFMVTGDNQRAAQAIGKQVGIDSDHIFAEVLPEEKANYVEKLQKAGKKVGMVGDGINDAPALALADVGIAMGS
GTDIAMETADVTLMNSHLTSINQMISLSAATLKKIKQNLFWAFIYNTIGIPFAAFGFLNPIIAGGAMAFSSISVLLNSLS
LNRKTIK
>Q5ZWR1 7.2.2.8~~~copA~~~Copper-exporting P-type ATPase~~~COG2217
MKHDHHQGHTHSGKGHACHHEHNSPKTQQASSKMEGPIVYTCPMHPEIRQSAPGHCPLCGMALEPETVTVSEVVSPEYLD
MRRRFWIALMLTIPVVILEMGGHGLKHFISGNGSSWIQLLLATPVVLWGGWPFFKRGWQSLKTGQLNMFTLIAMGIGVAW
IYSMVAVLWPGVFPHAFRSQEGVVAVYFEAAAVITTLVLLGQVLELKAREQTGSAIRALLKLVPESAHRIKEDGSEEEVS
LDNVAVGDLLRVRPGEKIPVDGEVQEGRSFVDESMVTGEPIPVAKEASAKVIGATINQTGSFVMKALHVGSDTMLARIVQ
MVSDAQRSRAPIQRLADTVSGWFVPAVILVAVLSFIVWALLGPQPALSYGLIAAVSVLIIACPCALGLATPMSIMVGVGK
GAQSGVLIKNAEALERMEKVNTLVVDKTGTLTEGHPKLTRIVTDDFVEDNALALAAALEHQSEHPLANAIVHAAKEKGLS
LGSVEAFEAPTGKGVVGQVDGHHVAIGNARLMQEHGGDNAPLFEKADELRGKGASVMFMAVDGKTVALLVVEDPIKSSTP
ETILELQQSGIEIVMLTGDSKRTAEAVAGTLGIKKVVAEIMPEDKSRIVSELKDKGLIVAMAGDGVNDAPALAKADIGIA
MGTGTDVAIESAGVTLLHGDLRGIAKARRLSESTMSNIRQNLFFAFIYNVLGVPLAAGVLYPLTGLLLSPMIAAAAMALS
SVSVIINALRLKRVTL
>P12374 ~~~copA~~~Copper resistance protein A~~~
MESRTSRRTFVKGLAAAGVLGGLGLWRSPSWAASGSPALSVLSGTEFDLSIGEMPVNITGRRRTAMAINGGLPGPLLRWK
EGDTVTLRVRNRLDAATSIHWHGIILPPNMDGVPGLSFAGIEPGGVYVYQFKVQQNGTYWYHSHSGFQEQVGVYGPLVIE
AKEPEPFKYDSEHVVMLTDWTDEDPVSLMRTLKKQSDYYNFHKRTVGDFVNDVADKGWAATVADRKMWAEMKMNPTDLAD
VSGATYTYLLNGQAPNMNWTGLFRPGEKLRLRFINGSAMTYFDIRIPGLKMTVVASDGQFVNPVEVDELRIAVAETFDVI
VEPTAEAYTVFAQSMDRTGYARGTLAVREGLVAQVPPLDPRPLVTMDDMGMGGMDHGSMDGMSGMDSGADDGMQTMSSMG
GDSMPAMDHSKMSTMQGMDHGAMSGMDHGAMGGMVMQSHPASENDNPLVDMQAMSPTAKLNDPGLGLRNNGRKVLTYADL
KSTFEDPDGREPSRTIELHLTGHMEKFAWSFDGIKFADAQPLILKYGERVRIVLVNDTMMTHPIHLHGMWSDLEDEDGNF
RVRKHTIDMPPGSKRSYRVTADALGRWAYHCHLLYHMEMGMFREVRVEE
>Q8ZR95 7.2.2.8~~~copA~~~Copper-exporting P-type ATPase~~~
MSQTIDLTLDGLSCGHCVKRVKESLEQRPDVELADVTVTEAHVTGTASADALIETIKQAGYGATLSHPKAKPLTESSIPS
EALAAVPHELPVATADEESQQLLLSGMSCASCVTRVQHALQSVPGVTQARVNLAERTALVMGSASAADLVQAVEKAGYGA
EAIEDDIKRRERQQETAIATMKRFRWQAIVALAVGIPVMVWGMIGDNMMVTDDNRSLWLAIGLITLAVMVFAGGHFYRNA
WKSLLNGTATMDTLVALGTGVAWLYSMSVNLWPQWFPMEARHLYYEASAMIIGLINLGHMLEARARQRSSKALEKLLDLT
PPTARVVTEDGEKSVPLADVQPGMLLRLTTGDRVPVDGEITQGEAWLDEAMLTGEPIPQQKGEGDSVHAGTVVQDGSVLF
RASAVGSHTTLSRIIRMVRQAQSSKPEIGQLADKISAVFVPVVVAIALFSAAIWYFFGPAPQIVYTLVIATTVLIIACPC
ALGLATPMSIISGVGRAAEFGVLVRDADALQRASTLDTLVFDKTGTLTEGKPQVVAIKTFNGVEEAQALRLAAALEQGSS
HPLAHAILEKAGDDKLPQVNGFRTLRGLGVSGEAEGHQLLLGNQALLNEQHVATDDMTAEITAQASQGSTPVLLAIDGKA
AALLAVRDPLRSDSIAALERLHNAGYRLVMLTGDNPTTANAIAKEAGIDEVIAGVLPDGKADAIKRLQSQGRQVAMVGDG
INDAPALAQADVGIAMGGGSDVAIETAAITLMRHSLMGVADALAISRATLRNMKQNLLGAFIYNSIGIPVAAGILWPFTG
TLLNPVVAGAAMALSSITVVSNANRLLRFKPKA
>Q2FV64 7.2.2.8~~~copA~~~Copper-exporting P-type ATPase~~~COG2217
MANTKKTTLDITGMTCAACSNRIEKKLNKLDDVNAQVNLTTEKATVEYNPDQHDVQEFINTIQHLGYGVAVETVELDITG
MTCAACSSRIEKVLNKMDGVQNATVNLTTEQAKVDYYPEETDADKLVTRIQKLGYDASIKDNNKDQTSRKAEALQHKLIK
LIISAVLSLPLLMLMFVHLFNMHIPALFTNPWFQFILATPVQFIIGWQFYVGAYKNLRNGGANMDVLVAVGTSAAYFYSI
YEMVRWLNGSTTQPHLYFETSAVLITLILFGKYLEARAKSQTTNALGELLSLQAKEARILKDGNEVMIPLNEVHVGDTLI
VKPGEKIPVDGKIIKGMTAIDESMLTGESIPVEKNVDDTVIGSTMNKNGTITMTATKVGGDTALANIIKVVEEAQSSKAP
IQRLADIISGYFVPIVVGIALLTFIVWITLVTPGTFEPALVASISVLVIACPCALGLATPTSIMVGTGRAAENGILFKGG
EFVERTHQIDTIVLDKTGTITNGRPVVTDYHGDNQTLQLLATAEKDSEHPLAEAIVNYAKEKQLILTETTTFKAVPGHGI
EATIDHHHILVGNRKLMADNDISLPKHISDDLTHYERDGKTAMLIAVNYSLTGIIAVADTVKDHAKDAIKQLHDMGIEVA
MLTGDNKNTAQAIAKQVGIDTVIADILPEEKAAQIAKLQQQGKKVAMVGDGVNDAPALVKADIGIAIGTGTEVAIEAADI
TILGGDLMLIPKAIYASKATIRNIRQNLFWAFGYNIAGIPIAALGLLAPWVAGAAMALSSVSVVTNALRLKKMRLEPRRK
DA
>Q7A3E6 7.2.2.8~~~copA~~~Copper-exporting P-type ATPase~~~
MANTKKTTLDITGMTCAACSNRIEKKLNKLDDVNAQVNLTTEKATVEYNPDQHDVQEFINTIQHLGYGVTVETVELDITG
MTCAACSSRIEKVLNKMNGVQNATVNLTTEQAKVDYYPEETDADKLVTRIQKLGYDASIKDNNKDQTSRKAEALQHKLIK
LIISAVLSLPLLMLMFVHLFNMHIPALFTNPWFQFILATPVQFIIGWQFYVGAYKNLRNGGANMDVLVAVGTSAAYFYSI
YEMVRWLNGSTTQPHLYFETSAVLLTLILFGKYLEARAKSQTTNALGELLSLQAKEARILKDGNEVMIPLNEVHVGDTLI
VKPGEKIPVDGKIIKGMTAIDESMLTGESIPVEKNVDDTVIGSTMNKNGTITMTATKVGGDTALANIIKVVEEAQSSKAP
IQRLADIISGYFVPIVVGIALLIFIVWITLVTPGTFEPALVASISVLVIACPCALGLATPTSIMVGTGRAAENGILFKGG
EFVERTHQIDTIVLDKTGTITNGRPVVTDYHGDNQTLQLLATAEKDSEHPLAEAIVNYAKEKQLTLTETTTFKAVPGHGI
EATIDHHHILVGNRKLMADNDISLPKHISDDLTHYERDGKTAMLIAVNYSLTGIIAVADTVKDHAKDAIKQLHDMGIEVA
MLTGDNKNTAQAIAKQVGIDTVIADILPEEKAAQIAKLQQQGKKVAMVGDGVNDAPALVKADIGIAIGTGTEVAIEAADI
TILGGDLMLIPKAIYASKATIRNIRQNLFWAFGYNIAGIPIAALGLLAPWVAGAAMALSSVSVVTNALRLKKMRLEPRRK
DA
>P05425 7.2.2.8~~~copB~~~Copper-exporting P-type ATPase B~~~COG2217
MNNGIDPENETNKKGAIGKNPEEKITVEQTNTKNNLQEHGKMENMDQHHTHGHMERHQQMDHGHMSGMDHSHMDHEDMSG
MNHSHMGHENMSGMDHSMHMGNFKQKFWLSLILAIPIILFSPMMGMSFPFQVTFPGSNWVVLVLATILFIYGGQPFLSGA
KMELKQKSPAMMTLIAMGITVAYVYSVYSFIANLINPHTHVMDFFWELATLIVIMLLGHWIEMNAVSNASDALQKLAELL
PESVKRLKKDGTEETVSLKEVHEGDRLIVRAGDKMPTDGTIDKGHTIVDESAVTGESKGVKKQVGDSVIGGSINGDGTIE
ITVTGTGENGYLAKVMEMVRKAQGEKSKLEFLSDKVAKWLFYVALVVGIIAFIAWLFLANLPDALERMVTVFIIACPHAL
GLAIPLVVARSTSIAAKNGLLLKNRNAMEQANDLDVIMLDKTGTLTQGKFTVTGIEILDEAYQEEEILKYIGALEAHANH
PLAIGIMNYLKEKKITPYQAQEQKNLAGVGLEATVEDKDVKIINEKEAKRLGLKIDPERLKNYEAQGNTVSFLVVSDKLV
AVIALGDVIKPEAKEFIQAIKEKNIIPVMLTGDNPKAAQAVAEYLGINEYYGGLLPDDKEAIVQRYLDQGKKVIMVGDGI
NDAPSLARATIGMAIGAGTDIAIDSADVVLTNSDPKDILHFLELAKETRRKMIQNLWWGAGYNIIAIPLAAGILAPIGLI
LSPAVGAVLMSLSTVVVALNALTLK
>P12375 ~~~copB~~~Copper resistance protein B~~~
MTVLNRLHVCSLLAVSSLGMLPVGVFAAEAAMPGVDHSQMQGMDHSKMQGMDHSQMQGMDHSKMQGMDHSQMQGMDSDMT
TMAPSKPAAPTQSRTPIAPVTDANRAAVYRSAKGHTVHDEAANYFLLFDQLEWQDADNGSVLNWDVNGWVGGDIDRLWIR
SEGERTNGKTESAELQALWGHAISPWWDLVGGVRQDFKPGSPQTWAAFGLQGLALYNFEAEATAFLGEGGQTGLRLEGDY
DILLTNRLILQPTAEVNFYGQSDPQRGIGSGLSETEVGVRLRYEIRREFAPYIGVTWNRSYGNTADFAREEGEDRSEARL
VLGVRMWF
>Q7NWF2 4.3.99.-~~~copC~~~Arginine ADP-riboxanase CopC~~~
MRVENHSPSLSKLNPPEAGSGDPTAIGRRLSGIRRAPLPHVSAGSDGEAAAAGKIGAFLRKAVAAQSYGLMFANGKLFEA
TGDALEKRGQYGFSALQRLDGLSRRNLAAVEARLGALDSAERGLKERIMTGAWHFRHQSNAALDDGKTAAIASNHLLARE
SRSSGGNTFAGDKALLSNHDFVFFGVEFSGRGKQDKPLNHKHSTMDFGANAYVVPDTLPACRHGYLTLTDHFFNRVPGGR
EAEHQDFVGSFPQMGAETGRWIHEGKYRQNAPIFNYRDMKAAVALHLIEFLRDSKDAAFKAYVFDQAMQSGQALDRVLNS
VFQAEFHIPRLMATTDYAKHPLRPMLLKEAVDSVNLPALSGLVSSKGDAVTAMWHAIDKGKDAVAAHLLGNWRFEAGDFA
SAPPGFYHELNYALSEHGASVYILDQFLSRGWAAVNAPFEHVNSGETMLDNAVKYGNREMAAALIKHGADRNLLSEWNGG
KLDALLA
>P12376 ~~~copC~~~Copper resistance protein C~~~
MLLNRTSFVTLFAAGMLVSALAQAHPKLVSSTPAEGSEGAAPAKIELHFSENLVTQFSGAKLVMTAMPGMEHSPMAVKAA
VSGGGDPKTMVITPASPLTAGTYKVDWRAVSSDTHPITGSVTFKVK
>P12377 ~~~copD~~~Copper resistance protein D~~~
MEDPLSIAVRFALYTDLMMLFGLALFGLYSLRGAERRSGAVLPFRPLLSATALIGLLLSVVSIVLMAKAMSGASEWLEAV
PHAEMMVTQTELGTAWLIRMAALVGAAVTIAFNLRVPMASLLMVSLLGGVALATLAWTGHGAMDEGSRRFWHFSADILHL
WSSGGWFGALVAFALMLRPNKVETLQSVQVLSRTLSGFERAGAVIVAFIVLSGVVNYLFIVGPQVSGVVESTYGVLLLGK
LALFGLMVGLASANRFVLSPAFERAVHRGEYARAARSIRYSMALELGAAVLVLGLIAWLGTLSPEMEAGM
>P13920 ~~~copG~~~Protein CopG~~~
MKKRLTITLSESVLENLEKMAREMGLSKSAMISVALENYKKGQEK
>Q9I036 ~~~copI~~~Copper-resistant cuproprotein CopI~~~
MFPRRLLPASLIVLGVLFGASAQASPAHGQAFGKPAQAAQASRSIEVVLGDMYFKPRAIEVKAGETVRFVLKNEGKLLHE
FNLGDAAMHAEHQKEMLEMQQSGMLTPTGMASMDHSQMGHGMADMDHGRMMKHDDPNSVLVEPGKSAELTWTFTKATRLE
FACNIPGHYQAGMVGQLTVQP
>W8FLH9 ~~~copI~~~Copper-resistant cuproprotein CopI~~~
MKNRILRPALLCVAALFATTAQADAGHDHGSAHAGAHAHDADTPYGRPGDAAKAQRTVRVVMSDTMRFDPATITVRRGET
VRFVAANGGRIEHEFVLGTTASLKAHAQEMRAMPDMQHADPGAVRVAAGASGEIVWQFTKAGSFEFACLIPGHFEAGMVG
KVVVR
>Q9KMQ9 ~~~copI~~~Copper-resistant cuproprotein CopI~~~COG4454
MIKKTLLVIALTFTVTTAFAHSMDHSKMDHGAMPMDHSQMMGMEGMSDVGMPAPGAKANKVVHVILSDDMKITFKKDVTI
EPNDVVQFVVMNTGKIDHEFSIGSAVEQLKHREMMRQMGNHEHDSGSTVTVKPGKTKELLWHFQGDNKVEFACNIPGHAE
AGMVKSIEL
>Q58AD3 ~~~copK~~~Copper resistance protein K~~~
MKQKLMVGAFIAAVSLSAAAVDMSNVVKTYDLQDGSKVHVFKDGKMGMENKFGKSMNMPEGKVMETRDGTKIIMKGNEIF
RLDEALRKGHSEGG
>Q48271 ~~~copP~~~COP-associated protein~~~COG2608
MKATFQVPSITCNHCVDKIEKFVGEIEGVSFIDVSVEKKSVVVEFDAPATQDLIKEALLDAGQEVV
>Q02540 ~~~copR~~~Transcriptional activator protein CopR~~~
MKLLVAEDEPKTGIYLQQGLREAGFNVDRVVTGTDAVDQALNEAYDLLILDVMMPGLDGWEVIRRLRTAGQPVPVLFLTA
RDGVDDRVKGLELGADDYLVKPFALSELLARVRTLLRRGSSLQVQTSLQIGDLQVDLLKRRATRGGKRIELTAKEFALLE
LLMRRQGEVLSKSLIASQVWDMNFDSDTNVIEVAIRRLRAKIDDDFEVKLLHTCRGMGYMLEAQDEG
>Q47839 ~~~copY~~~Transcriptional repressor CopY~~~COG3682
MEEKRVLIKISDSEWEVMRVIWTLGQANAQQITQILADSMDWKVATVKTLLGRLVKKEALWTEQEGKKFIYHPAVSEMEN
VRSATENLFSHICAKRVGATIADLVEEATLTQEDIQQIMKQLNKKEPVETIECNCIPGQCECKKQ
>O32221 ~~~copZ~~~Copper chaperone CopZ~~~COG2608
MEQKTLQVEGMSCQHCVKAVETSVGELDGVSAVHVNLEAGKVDVSFDADKVSVKDIADAIEDQGYDVAK
>Q47840 ~~~copZ~~~Copper chaperone CopZ~~~COG2608
MKQEFSVKGMSCNHCVARIEEAVGRISGVKKVKVQLKKEKAVVKFDEANVQATEICQAINELGYQAEVI
>Q2FV63 ~~~copZ~~~Copper chaperone CopZ~~~COG2608
MSQEILNVEGMSCGHCKSAVESALNNIDGVTSADVNLENGQVSVQYDDSKVAVSQMKDAIEDQGYDVV
>Q7A3E5 ~~~copZ~~~Copper chaperone CopZ~~~
MSQEILNVEGMSCGHCKSAVESALNNIDGVTSADVNLENGQVSVQYDDSKVAVSQMKDAIEDQGYDVV
>Q9I5R6 1.14.99.60~~~coq7~~~3-demethoxyubiquinol 3-hydroxylase~~~
MSADRHYSPIDRFLLQADSALRTLLPFSGQPARPSPAIVEPDGELSEEDTRHIAGLMRINHTGEVCAQALYQGQSLTAKL
PEVREAMEEAAEEEIDHLAWCEQRIRQLGSRPSVLNPIFYGLSFGVGAAAGLVSDRVSLGFVAATEDQVCKHLDEHLAQI
PQEDRKSRAILEQMRIDEEQHSSNALAAGGLRFPAPVKLGMSLLAKVMTKSTYRI
>Q2RNK3 1.14.99.60~~~coq7~~~3-demethoxyubiquinol 3-hydroxylase~~~COG2941
MTSPSSRTPRGSTPPFEPSADELVLHASGRKTAEDRLPGDPSPAALIDRFLRVDQAGEHGAVRIYQGQLAVLGRRSANVG
VLRHMLAQEEVHLATFDKLVADRRARPTLLGPLWHVAGFALGAGTALLGEKAAMACTTAIEEAIDGHYKDQYDRLGDDEL
PLKATIDTFRREELEHRDIGYANGARQAPAFPVLSGAIKAGAKLAIWVSERV
>P40948 ~~~corA~~~Magnesium transport protein CorA~~~COG0598
MKAHTGKDWFWYQMGPQERSKARDLIHFSHWPQCEKWFENNHHVNFLRVDTTETENEAVFGSIVYDQGLGEEKDHTVFHF
YITRQYFFTINFDFSILREIKGKEVVRQMERADNAIEGFLILLGELMNAYLIGVDEFEVKLRKLRWQIKDDNSKSILNRV
HLLRHELMIWKNLILSAKKIEMALKETFLPQNEGKKDYQRTQLKIDRGFTYISEFEGELNNLLHSEEVITSHRGNEIVKA
LTIFTTLFTPITALGALWGMNFSVMPELNWKYGYLFSLLLIVTSTVLIYLYLRKKGWTGDMLQERKKKKKPRKRRTL
>P0ABI4 ~~~corA~~~Magnesium transport protein CorA~~~COG0598
MLSAFQLENNRLTRLEVEESQPLVNAVWIDLVEPDDDERLRVQSELGQSLATRPELEDIEASARFFEDDDGLHIHSFFFF
EDAEDHAGNSTVAFTIRDGRLFTLRERELPAFRLYRMRARSQSMVDGNAYELLLDLFETKIEQLADEIENIYSDLEQLSR
VIMEGHQGDEYDEALSTLAELEDIGWKVRLCLMDTQRALNFLVRKARLPGGQLEQAREILRDIESLLPHNESLFQKVNFL
MQAAMGFINIEQNRIIKIFSVVSVVFLPPTLVASSYGMNFEFMPELKWSFGYPGAIIFMILAGLAPYLYFKRKNWL
>O25901 ~~~corA~~~Magnesium transport protein CorA~~~COG0598
MVNVFFKQQKFVIKKRFNDFNGFDIEENEVLWFELINPTPNELATLSQEYAIHYNTDHSQRVSSVTKYWEDSSSVTINAF
FTNQDENETFHTEMATFILSNNILFTIYYGTLEIFDSIQKKVLASPKKFEDGFDILTKIFEVYFEKGVECLEWINKQTSL
LRKNIIFKETSTHDDILVRLSNLQEFNVTLRDSFFDKRRIITALLRSNKVDSDTKNNLNIILTDFSSLVESTTVNLNSLD
NIQNLFASQVNVEQNKIIKLFTVATMAMMPPTLIGTIYGMNFKFMPELEWQYGYLFALIVMAISTILPVIYFKKKGWL
>P0A2R8 ~~~corA~~~Magnesium transport protein CorA~~~
MLSAFQLEKNRLTRLEVEESQSLIDAVWVDLVEPDDDERLRVQSELGQSLATRPELEDIEASARFFEDEDGLHIHSFFFF
EDAEDHAGNSTVAFTIRDGRLFTLRERELPAFRLYRMRARSQAMVDGNAYELLLDLFETKIEQLADEIENIYSDLEKLSR
VIMEGHQGDEYDEALSTLAELEDIGWKVRLCLMDTQRALNFLVRKARLPGGQLEQAREILRDIESLLPHNESLFQKVNFL
MQAAMGFINIEQNRIIKIFSVVSVVFLPPTLVASSYGMNFEFMPELKWSFGYPGAIIFMILAGLAPYLYFKRKNWL
>Q9WZ31 ~~~corA~~~Cobalt/magnesium transport protein CorA~~~COG0598
MEEKRLSAKKGLPPGTLVYTGKYREDFEIEVMNYSIEEFREFKTTDVESVLPFRDSSTPTWINITGIHRTDVVQRVGEFF
GIHPLVLEDILNVHQRPKVEFFENYVFIVLKMFTYDKNLHELESEQVSLILTKNCVLMFQEKIGDVFDPVRERIRYNRGI
IRKKRADYLLYSLIDALVDDYFVLLEKIDDEIDVLEEEVLERPEKETVQRTHQLKRNLVELRKTIWPLREVLSSLYRDVP
PLIEKETVPYFRDVYDHTIQIADTVETFRDIVSGLLDVYLSSVSNKTNEVMKVLTIIATIFMPLTFIAGIYGMNFEYMPE
LRWKWGYPVVLAVMGVIAVIMVVYFKKKKWL
>P0AE78 ~~~corC~~~Magnesium and cobalt efflux protein CorC~~~COG4535
MSDDNSHSSDTISNKKGFFSLLLSQLFHGEPKNRDELLALIRDSGQNDLIDEDTRDMLEGVMDIADQRVRDIMIPRSQMI
TLKRNQTLDECLDVIIESAHSRFPVISEDKDHIEGILMAKDLLPFMRSDAEAFSMDKVLRQAVVVPESKRVDRMLKEFRS
QRYHMAIVIDEFGGVSGLVTIEDILELIVGEIEDEYDEEDDIDFRQLSRHTWTVRALASIEDFNEAFGTHFSDEEVDTIG
GLVMQAFGHLPARGETIDIDGYQFKVAMADSRRIIQVHVKIPDDSPQPKLDE
>P0A2L3 ~~~corC~~~Magnesium and cobalt efflux protein CorC~~~
MSDDNSHSSDTVNSKKGFFSLLLSQLFHGEPKNRDELLALIRDSGQNELIDEDTRDMLEGVMDIADQRVRDIMIPRSQMI
TLKRNQTLDECLDVIIESAHSRFPVISEDKDHIEGILMAKDLLPFMRSDAEAFSMDKVLRTAVVVPESKRVDRMLKEFRS
QRYHMAIVIDEFGGVSGLVTIEDILELIVGEIEDEYDEEDDIDFRQLSRHTWTIRALASIEDFNDAFGTHFSDEEVDTIG
GLVMQAFGHLPARGETIDIDGYQFKVAMADSRRIIQVHVRIPDDSPQPKLDE
>H1AAP2 3.13.1.7~~~cos~~~Carbonyl sulfide hydrolase~~~
MEKSNTDALLENNRLYAGGQATHRPGHPGMQPIQPSRRVAVVACMDARLDVEDLLGLQTGEAHIIRNAGGVINEDAIRCL
IISHHLLNTHEIILVHHTRCGMLAFTDDLLRAGLEGDAAAEKLIGQATGRAFVSAGKASASPAAFQAFRGPPEPLDAPRS
DASTERIAADVRRGLSIILNHPWLPTAGPDAITVRGFIYDVDTGRLEEVSYPGPMGGFG
>A0A7T1FRB0 1.10.3.2~~~cotA~~~Laccase~~~
MNLEKFVDELPIPEVAEPVKKNPRQTYYEIAMEEVFLKVHRDLPPTKLWTYNGSLPGPTIKANRNEKVKVKWMNKLPLKH
FLPVDHTIHAGHHDEPEVKTVVHLHGGVTPASSDGYPEAWFSRDFEATGPFFEREVYEYPNHQQACTLWYHDHAMALTRL
NVYAGLAGFYLISDAFEKSLELPKDEYDIPLMIMDRTFQEDGALFYPSRPNNTPEDSDLPDPSIVPFFCGETILVNGKVW
PYLEVEPRKYRFRILNASNTRTYELHLDNDATILQIGSDGGFLPRPVHHQSFSIAPAERFDVIIDFSAYENKTIVLKNSA
GCGQDVNPETDANIMQFKVTRPLKGRAAKTLRPIFKPLPPLRPSRADNERTLTLTGTQDKYGRPILLLDNQFWNDPVTEN
PRLGSVEVWNIVNPTRGTHPIHLHLVQFRVIDRRPFDTDIYQSTGEIVYTGPNEAPPLHEQGYKDTIQAHAGEVIRIIAR
FVPYSGRYVWHCHILEHEDYDMMRPMDIIQ
>P07788 1.10.3.2~~~cotA~~~Laccase~~~COG2132
MTLEKFVDALPIPDTLKPVQQSKEKTYYEVTMEECTHQLHRDLPPTRLWGYNGLFPGPTIEVKRNENVYVKWMNNLPSTH
FLPIDHTIHHSDSQHEEPEVKTVVHLHGGVTPDDSDGYPEAWFSKDFEQTGPYFKREVYHYPNQQRGAILWYHDHAMALT
RLNVYAGLVGAYIIHDPKEKRLKLPSDEYDVPLLITDRTINEDGSLFYPSAPENPSPSLPNPSIVPAFCGETILVNGKVW
PYLEVEPRKYRFRVINASNTRTYNLSLDNGGDFIQIGSDGGLLPRSVKLNSFSLAPAERYDIIIDFTAYEGESIILANSA
GCGGDVNPETDANIMQFRVTKPLAQKDESRKPKYLASYPSVQHERIQNIRTLKLAGTQDEYGRPVLLLNNKRWHDPVTET
PKVGTTEIWSIINPTRGTHPIHLHLVSFRVLDRRPFDIARYQESGELSYTGPAVPPPPSEKGWKDTIQAHAGEVLRIAAT
FGPYSGRYVWHCHILEHEDYDMMRPMDITDPHK
>C9K1X5 4.2.3.146~~~CotB2~~~Cyclooctat-9-en-7-ol synthase~~~
MTTGLSTAGAQDIGRSSVRPYLEECTRRFQEMFDRHVVTRPTKVELTDAELREVIDDCNAAVAPLGKTVSDERWISYVGV
VLWSQSPRHIKDMEAFKAVCVLNCVTFVWDDMDPALHDFGLFLPQLRKICEKYYGPEDAEVAYEAARAFVTSDHMFRDSP
IKAALCTTSPEQYFRFRVTDIGVDFWMKMSYPIYRHPEFTEHAKTSLAARMTTRGLTIVNDFYSYDREVSLGQITNCFRL
CDVSDETAFKEFFQARLDDMIEDIECIKAFDQLTQDVFLDLIYGNFVWTTSNKRYKTAVNDVNSRIQ
>C9K1X6 1.14.99.61~~~cotB3~~~Cyclooctat-9-en-7-ol 5-monooxygenase~~~
MRERGPVTPAKSSAPPERPWTTGTAPGSVPLLGHTMALWRRPLQFLASLPAHGDLVEVRLGPSRAYLACHPELVRQVLLN
PRVFDKGGVFDKARQLLGNSLSVSRGEDHRYQRRMIQPAFHTPKIAAYTAAVADDTRAAIGSWEPGRTLDISDTMHALLM
RVAARTLFSTGIDEATIDEARHCLRIVSDGIYKRTMAPLGIMEKLPTPGNRRYDRANARLRQIVDEMIRERRRSGADHGD
LLSTLLRAEHPETGKGLDDGEVLDQVVTFLVAGSETTASTLAFVFHLLGAHPEVEKRVHAEIDEILEGRSPTFEDLPSLE
YTRGVITESLRLYPPSWMAMRVTAAETELGGRTVPAGTMILYSAQALHHNPELFPDPERFDPERWLGDRAKEVERGALLP
FGAGSHKCIGDVLALTETALIVATIASRWRLRPVPGTTLRPEPKATLEPGPLPMVCEPR
>C9K1X7 1.14.99.62~~~cotB4~~~Cyclooctatin synthase~~~
MKDFFRMRTAQQPATRHWRHTVAPGGLPLAGHALLMARKPLQFLASLPAHGDLVELRLGPRPVYLPCHPELVQQVLVNAR
VYDTGGPVKEKAKPILGNGLITSDWADHRRQRRLVQPAFHTARIAKYAEVMERECEAESTAWTARRPIDVSHEMLALTAR
VTARALFSTDMAPHAVAEIQHCLPIVVEGAYRQAIDPTGLLAKLPLAANRRFDDALARLNQLIDRMIDDYKASDDGDRGD
VLSALFAAQDDETGGTMSDQEIHDQVMTLLLAGIETTASALTWAWFLLGRNPGAEAALHAEVDEVLGGRAPRYADVPRLA
YTQRVFSEALRLFPPAWLFTRTTTETTELGGRRLPPASDVLISPYVLHRDPALFPRPDSFDPDRWLPERAKEVTRGSYLP
FGGGSRKCIGDVFGMTEATLALAAIAGRWRMRPIPGTKIRPRPQMSLTAGPLRMIPEPR
>P07790 ~~~cotC~~~Spore coat protein C~~~
MGYYKKYKEEYYTVKKTYYKKYYEYDKKDYDCDYDKKYDDYDKKYYDHDKKDYDYVVEYKKHKKHY
>P14016 ~~~cotE~~~Spore coat protein E~~~
MSEYREIITKAVVAKGRKFTQCTNTISPEKKPSSILGGWIINHKYDAEKIGKTVEIEGYYDINVWYSYADNTKTEVVTER
VKYVDVIKLRYRDNNYLDDEHEVIAKVLQQPNCLEVTISPNGNKIVVQAEREFLAEVVGETKVVVEVNPDWEEDDEEDWE
DELDEELEDINPEFLVGDPEE
>P23261 ~~~cotF~~~Spore coat protein F~~~COG5577
MDERRTLAWHETLEMHELVAFQSNGLIKLKKMIREVKDPQLRQLYNVSIQGVEQNLRELLPFFPQAPHREDEEEERADNP
FYSGDLLGFAKTSVRSYAIAITETATPQLRNVLVKQLNAAIQLHAQVYRYMYQHGYYPSYNLSELLKNDVRNANRAISMK
>P39801 ~~~cotG~~~Spore coat protein G~~~
MGHYSHSDIEEAVKSAKKEGLKDYLYQEPHGKKRSHKKSHRTHKKSRSHKKSYCSHKKSRSHKKSFCSHKKSRSHKKSYC
SHKKSRSHKKSYRSHKKSRSYKKSYRSYKKSRSYKKSCRSYKKSRSYKKSYCSHKKKSRSYKKSCRTHKKSYRSHKKYYK
KPHHHCDDYKRHDDYDSKKEYWKDGNCWVVKKKYK
>Q45535 ~~~cotH~~~Inner spore coat protein H~~~COG5337
MKNQSNLPLYQLFVHPKDLRELKKDIWDDDPVPAVMKVNQKRLDIDIAYRGSHIRDFKKKSYHISFYQPKTFRGAREIHL
NAEYKDPSLMRNKLSLDFFSELGTLSPKAEFAFVKMNGKNEGVYLELESVDEYYLAKRKLADGAIFYAVDDDANFSLMSD
LERETKTSLELGYEKKTGTEEDDFYLQDMIFKINTVPKAQFKSEVTKHVDVDKYLRWLAGIVFTSNYDGFVHNYALYRSG
ETGLFEVIPWDYDATWGRDIHGERMAADYVRIQGFNTLTARILDESEFRKSYKRLLEKTLQSLFTIEYMEPKIMAMYERI
RPFVLMDPYKKNDIERFDREPDVICEYIKNRSQYLKDHLSIL
>O34656 ~~~cotI~~~Spore coat protein I~~~COG2334
MCPLMAENHEVIEEGNSSELPLSAEDAKKLTELAENVLQGWDVQAEKIDVIQGNQMALVWKVHTDSGAVCLKRIHRPEKK
ALFSIFAQDYLAKKGMNVPGILPNKKGSLYSKHGSFLFVVYDWIEGRPFELTVKQDLEFIMKGLADFHTASVGYQPPNGV
PIFTKLGRWPNHYTKRCKQMETWKLMAEAEKEDPFSQLYLQEIDGFIEDGLRIKDRLLQSTYVPWTEQLKKSPNLCHQDY
GTGNTLLGENEQIWVIDLDTVSFDLPIRDLRKMIIPLLDTTGVWDDETFNVMLNAYESRAPLTEEQKQVMFIDMLFPYEL
YDVIREKYVRKSALPKEELESAFEYERIKANALRQLI
>Q45058 ~~~cotM~~~Spore coat protein M~~~COG0071
MWRNASMNHSKRNDANDFDSMDEWLRQFFEDPFAWYDETLPIDLYETSQQYIIEADLTFLQPTQVTVTLSGCEFILTVKS
SGQTFEKQMMLPFYFNDKNIQVECENQILTVAVNKETEDGSSFSLQFPLS
>P96698 ~~~cotP~~~Spore coat protein P~~~COG0071
MDFEKIRKWLEITNEYKQSDFWTNVLKYKAPEHFFDSEASTFVYDFYQDEEYNFIIVEMPGVYEEELTIRLLSKTQLLIK
GTITPVFPAEMEVLRERYYGEIERIIQLPEAAETHLLQIQLLNGLLHISYPRQVETVAFNKGL
>O06996 3.1.1.-~~~cotR~~~Putative sporulation hydrolase CotR~~~COG3621
MAKYRIMTFDGGGTLGALSLQLLNRLARQNPKLISRTHVFSGNSIGSFTALALASGRSPRETLQYFEDEILPAFSISRPG
GPVFNQQLPYSGFIKAVRNFFPADLQLIDLRKRIVVPSFKLYSQKLDRWTPVLFHNFPGSPYLNEKVSDVILRSSGAPAT
QRAYQNYVDGYVVATNPSTASIAFAVGKANVPLDQIAVLSIGTGEAPTRLRRDTRGWGMVSADNIRPENLKNLPPNWGVL
LDRSPNEPLLPFLQMIAGGNGYYESMVSANLLGDRFFRLDPRIPNFSKTDPAVVPAVIEIANKTNLQPANQFIEKNWGSK
>P46915 2.4.-.-~~~cotSA~~~Spore coat protein SA~~~COG0438
MKIALIATEKLPVPSVRGGAIQIYLEAVAPLIAKKHEVTVFSIKDPNLADREKVDGVHYVHLDEDRYEEAVGAELKKSRF
DLVHVCNRPSWVPKLKKQAPDAVFILSVHNEMFAYDKISQAEGEICIDSVAQIVTVSDYIGQTITSRFPSARSKTKTVYS
GVDLKTYHPRWTNEGQRAREEMRSELGLHGKKIVLFVGRLSKVKGPHILLQALPDIIEEHPDVMMVFIGSKWFGDNELNN
YVKHLHTLGAMQKDHVTFIQFVKPKDIPRLYTMSDVFVCSSQWQEPLARVHYEAMAAGLPIITSNRGGNPEVIEEGKNGY
IIHDFENPKQYAERINDLLSSSEKRERLGKYSRREAESNFGWQRVAENLLSVYEKNR
>P46914 ~~~cotS~~~Spore coat protein S~~~COG2334
MYQKEHEEQIVSEILSYYPFHIDHVALKSNKSGRKIWEVETDHGPKLLKEAQMKPERMLFITQAHAHLQEKGLPIAPIHQ
TKNGGSCLGTDQVSYSLYDKVTGKEMIYYDAEQMKKVMSFAGHFHHASKGYVCTDESKKRSRLGKWHKLYRWKLQELEGN
MQIAASYPDDVFSQTFLKHADKMLARGKEALRALDDSEYETWTKETLEHGGFCFQDFTLARLTEIEGEPFLKELHSITYD
LPSRDLRILLNKVMVKLSVWDTDFMVALLAAYDAVYPLTEKQYEVLWIDLAFPHLFCAIGHKYYLKQKKTWSDEKYNWAL
QNMISVEESKDSFLDKLPELYKKIKAYREAN
>P11863 ~~~cotT~~~Spore coat protein T~~~
MDYPLNEQSFEQITPYDERQPYYYPRPRPPFYPPYYYPRPYYPFYPFYPRPPYYYPRPRPPYYPWYGYGGGYGGGYGGGY
GY
>Q08309 ~~~cotV~~~Spore coat protein V~~~
MSFEEKVESLHPAIFEQLSSEFEQQIEVIDCENITIDTSHITAALSIQAFVTTMIIVATQLVIADEDLADAVASEILILD
SSQIKKRTIIKIINSRNIKITLSADEIITFVQILLQVLNSILSELDVL
>Q08310 ~~~cotW~~~Spore coat protein W~~~
MSDNDKFKEELAKLPEVDPMTKMLVQNIFSKHGVTKDKMKKVSDEEKEMLLNLVKDLQAKSQALIENQKKKKEEAAAQEQ
KNTKPLSRREQLIEQIRQRRKNDNN
>Q08313 ~~~cotX~~~Spore coat protein X~~~
MESRPYSWVALDPDCDHPLDDKEKDKEKHERKCHCDVCCNGNGFFGNDNAFIDQDLAQANLNKQVSDETIIIRDSCDINV
TSTDVQAVTSVVTALNAAVVTATLTSIADGVIAELVAQDLLQLTANKQVNRQKLLIECSRGVNVTTVDADIATLISTATN
TLVAILVITLVL
>Q08311 ~~~cotY~~~Spore coat protein Y~~~
MSCGKTHGRHENCVCDAVEKILAEQEAVEEQCPTGCYTNLLNPTIAGKDTIPFLVFDKKGGLFSTFGNVGGFVDDMQCFE
SIFFRVEKLCDCCATLSILRPVDVKGDTLSVCHPCDPDFFGLEKTDFCIEVDLGCFCAIQCLSPELVDRTSPHKDKKHHH
NG
>Q08312 ~~~cotZ~~~Spore coat protein Z~~~
MSQKTSSCVREAVENIEDLQNAVEEDCPTGCHSKLLSVSHSLGDTVPFAIFTSKSTPLVAFGNVGELDNGPCFNTVFFRV
ERVHGSCATLSLLIAFDEHKHILDFTDKDTVCEVFRLEKTNYCIEVDLDCFCAINCLNPRLINRTHHH
>Q9F8T9 2.1.1.-~~~couO~~~C-methyltransferase CouO~~~
MKIEPITGSEAEAFHRMGSRAFERYNEFVDLLVGAGIADGQTVVDLCCGSGELEIILTSRFPSLNLVGVDLSEDMVRIAR
DYAAEQGKELEFRHGDAQSPAGMEDLLGKADLVVSRHAFHRLTRLPAGFDTMLRLVKPGGAILNVSFLHLSDFDEPGFRT
WVRFLKERPWDAEMQVAWALAHYYAPRLQDYRDALAQAADETPVSEQRIWVDDQGYGVATVKCFARRAAA
>Q2RNI5 ~~~cowN~~~N(2)-fixation sustaining protein CowN~~~
MTMDGPAHMPRRYVTFQGVNVEGLSQQLIARILFHVADPAKSNAFWEHFKAKLADADKTLARTADSLCLLCGAIGYIDEL
FEDNDDEEGLTILRRLEDELC
>P98005 7.1.1.9~~~caaA~~~Cytochrome c oxidase polypeptide I+III~~~COG0843
MAITAKPKAGVWAVLWDLLTTVDHKKIGLMYTATAFFAFALAGVFSLLIRTQLAVPNNQFLTGEQYNQILTLHGATMLFF
FIIQAGLTGFGNFVVPLMLGARDVALPRVNAFSYWAFLGAIVLALMSYFFPGGAPSVGWTFYYPFSAQSESGVDFYLAAI
LLLGFSSLLGNANFVATIYNLRAQGMSLWKMPIYVWSVFAASVLNLFSLAGLTAATLLVLLERKIGLSWFNPAVGGDPVL
FQQFFWFYSHPTVYVMLLPYLGILAEVASTFARKPLFGYRQMVWAQMGIVVLGTMVWAHHMFTVGESTLFQIAFAFFTAL
IAVPTGVKLFNIIGTLWGGKLQMKTPLYWVLGFIFNFLLGGITGVMLSMTPLDYQFHDSYFVVAHFHNVLMAGSGFGAFA
GLYYWWPKMTGRMYDERLGRLHFWLFLVGYLLTFLPQYALGYLGMPRRYYTYNADIAGWPELNLLSTIGAYILGLGGLVW
IYTMWKSLRSGPKAPDNPWGGYTLEWLTASPPKAHNFDVKLPTEFPSERPLYDWKKKGVELKPEDPAHIHLPNSSFWPFY
SAATLFAFFVAVAALPVPNVWMWVFLALFAYGLVRWALEDEYSHPVEHHTVTGKSNAWMGMAWFIVSEVGLFAILIAGYL
YLRLSGAATPPEERPALWLALLNTFLLVSSSFTVHFAHHDLRRGRFNPFRFGLLVTIILGVLFFLVQSWEFYQFYHHSSW
QENLWTAAFFTIVGLHGLHVVIGGFGLILAYLQALRGKITLHNHGTLEAASMYWHLVDAVWLVIVTIFYVW
>P98002 7.1.1.9~~~ctaDII~~~Cytochrome c oxidase subunit 1-beta~~~
MADAAVHGHGDHHDTRGFFTRWFMSTNHKDIGILYLFTAGIVGLISVCFTVYMRMELQHPGVQYMCLEGARLIADASAEC
TPNGHLWNVMITYHGVLMMFFVVIPALFGGFGNYFMPLHIGAPDMAFPRLNNLSYWMYVCGVALGVASLLAPGGNDQMGS
GVGWVLYPPLSTTEAGYSMDLAIFAVHVSGASSILGAINIITTFLNMRAPGMTLFKVPLFAWSVFITAWLILLSLPVLAG
AITMLLMDRNFGTQFFDPAGGGDPVLYQHILWFFGHPEVYIIILPGFGIISHVISTFAKKPIFGYLPMVLAMAAIGILGF
VVWAHHMYTAGMSLTQQAYFMLATMTIAVPTGIKVFSWIATMWGGSIEFKTPMLWAFGFLFLFTVGGVTGVVLSQAPLDR
VYHDTYYVVAHFHYVMSLGAVFGIFAGVYYWIGKMSGRQYPEWAGQLHFWMMFIGSNLIFFPQHFLGRQGMPRRYIDYPV
EFAYWNNISSIGAYISFASFLFFIGIVFYTLFAGKRVNVPNYWNEHADTLEWTLPSPPPEHTFETLPKREDWDRAHAH
>Q04440 7.1.1.9~~~ctaD~~~Cytochrome c oxidase subunit 1~~~COG0843
MATQKQEKSVIWDWLTTVDHKKIAIMYLIAGTLFFVKAGVMALFMRIQLMYPEMNFLSGQTFNEFITMHGTIMLFLAATP
LLFAFMNYVIPLQIGARDVAFPFVNALGFWIFFFGGLLLSLSWFFGGGPDAGWTAYVPLSSRDYGGLGIDFYVLGLQVSG
IGTLISAINFLVTIVNMRAPGMTMMRLPLFVWTSFISSTLILFAFTPLAAGLALLMLDRLFEAQYFIPSMGGNVVLWQHI
FWIFGHPEVYILVLPAFGIISEVIPAFSRKRLFGYTAMVFATMIIAFLGFMVWAHHMFTVGMGPVANSIFAVATMTIAVP
TGIKIFNWLFTMWGGKITFNTAMLFASSFVPTFVLGGVTGVMLAMAPVDYLYHDTYFVVAHFHYIIVGGIVLSLFAGLFY
WYPKMFGHMLNETLGKLFFWVFYIGFHLTFFVQHLLGLMGMPRRVYTYLGDQGLDAFNFISTIGTFFMSAGVILLVINVI
YSAFKGERVTVADPWDARTLEWATPTPVPEYNFAQTPQVRSLDPLFYEKIHGDGTMKPAEPVTDIHMPNGSILPFIMSIG
LFFAGFGLIMLNMDNPIINPWIVAIGGLALTFGCMFVRSIKEDHGYHIPAEQVKADLAELKKGGN
>P16262 7.1.1.9~~~ctaD~~~Cytochrome c oxidase subunit 1~~~
MSTIARKKGVGAVLWDYLTTVDHKKIAHLYLISGGFFFLLGGLEALFIRIQLAKPNNDFLVGGLYNEVLTMHGTTMIFLA
AMPLVFAFMNAVVPLQIGARDVAFPFLNALGFWMFFFGGLFLNCSWFLGGAPDAGWTSYASLSLDSKAHHGIDFYTLGLQ
ISGFGTIMGAINFLVTIINMRAPGMTFMRMPMFTWATFVTSALILFAFPPLTVGLIFMMMDRLFGGNFFNPAAGGNTIIW
EHLFWVFGHPEVYILVLPAFGIFSEIFATFSRKRLFGYSSMVFATVLIAFLGFMVWAHHMFTVGMGPIANAIFAVATMTI
AVPTGVKIFNWLFTMWGGSIKFTTPMHYAVAFIPSFVMGGVTGVMLASAAADYQYHDSYFVVAHFHYVIVGGVVFALLAG
THYWWPKMFGRMLNETLGKITFWLFFIGFHLTFFIQHFLGLTGMPRRVFTYLPHQGWETGNLISTIGAFFIAAATVILLI
NIVVTTAKGEKVPGDAWGDGRTLEWAIASPPPVYNFAQTPLVRGLDAFWLEKMEGKKELTPAEPLGDIHMPNSSFLPFVI
AFGLFVAAFGFTYHNDAGWGLPVAILGLLITLGSMFLRSVIDDHGFHIHKEEVLEL
>P31833 7.1.1.9~~~ctaD~~~Cytochrome c oxidase subunit 1~~~COG0843
MATSAAAHGDHAQDHGHDEHAHPTGWRRYVYSTNHKDIGTMYLIFAVIAGVIGAAMSIAIRAELMYPGVQIFHETHTYNV
FVTSHGLIMIFFMVMPAMIGGFGNWFVPLMIGAPDMAFPRMNNISFWLLPASFGLLLMSTFVEGEPGANGVGAGWTMYVP
LSSSGHPGPAVDFAILSLHLAGASSILGAINFITTIFNMRAPGMTLHKMPLFVWSILVTVFLLLLSLPVLAGAITMLLTD
RNFGTTFFAPDGGGDPVLFQHLFWFFGHPEVYILILPGFGMISQIVSTFSRKPVFGYLGMAYAMVAIGGIGFVVWAHHMY
TVGMSSATQAYFVAATMVIAVPTGVKIFSWIATMWGGSIEFRAPMIWAVGFIFLFTVGGVTGVVLANAGVDRVLQETYYV
VAHFHYVLSLGAVFAIFAGWYYWFPKMTGYMYNETLAKAHFWVTFIGVNLVFFPQHFLGLSGMPRRYVDYPDAFAGWNLV
SSVGSYISGFGVLIFLYCVIDAFAKKVPAGDNPWGAGATTLEWTLPSPPPFHQFEVLPRVQ
>P33517 7.1.1.9~~~ctaD~~~Cytochrome c oxidase subunit 1~~~
MADAAIHGHEHDRRGFFTRWFMSTNHKDIGVLYLFTGGLVGLISVAFTVYMRMELMAPGVQFMCAEHLESGLVKGFFQSL
WPSAVENCTPNGHLWNVMITGHGILMMFFVVIPALFGGFGNYFMPLHIGAPDMAFPRMNNLSYWLYVAGTSLAVASLFAP
GGNGQLGSGIGWVLYPPLSTSESGYSTDLAIFAVHLSGASSILGAINMITTFLNMRAPGMTMHKVPLFAWSIFVTAWLIL
LALPVLAGAITMLLTDRNFGTTFFQPSGGGDPVLYQHILWFFGHPEVYIIVLPAFGIVSHVIATFAKKPIFGYLPMVYAM
VAIGVLGFVVWAHHMYTAGLSLTQQSYFMMATMVIAVPTGIKIFSWIATMWGGSIELKTPMLWALGFLFLFTVGGVTGIV
LSQASVDRYYHDTYYVVAHFHYVMSLGAVFGIFAGIYFWIGKMSGRQYPEWAGKLHFWMMFVGANLTFFPQHFLGRQGMP
RRYIDYPEAFATWNFVSSLGAFLSFASFLFFLGVIFYTLTRGARVTANNYWNEHADTLEWTLTSPPPEHTFEQLPKREDW
ERAPAH
>Q79VD7 7.1.1.9~~~ctaD~~~Cytochrome c oxidase subunit 1~~~COG0843
MTAVAPRVDGHVAPQRPEPTGHARKGSKAWLMMTTTDHKQLGIMYIIMSFSFFFLGGLMALLIRAELFTPGLQFLSNEQF
NQLFTMHGTVMLLLYGTPIVWGFANYVLPLQIGAPDVAFPRLNAFGFWITTVGGVAMLTGFLTPGGAADFGWTMYSPLSD
AIHSPGLGSDMWIVGVGATGIGSVASAINMLTTILCLRAPGMTMFRMPIFTWNIFVVSVLALLIFPLLLAAALGVLYDRK
LGGHLYDPANGGSLLWQHLFWFFGHPEVYVLALPFFGIVSEIIPVFSRKPMFGYVGLIFATLSIGALSMAVWAHHMFVTG
AVLLPFFSFMTFLISVPTGVKFFNWVGTMWKGHITWETPMIWSVGFMATFLFGGLTGIMLASPPLDFHLADSYFLIAHFH
YTLFGTVVFASCAGVYFWFPKMTGRMMDERLGKIHFWLTFVGFHGTFLIQHWVGNMGMPRRYADYLDSDGFTIYNQISTV
FSFLLGLSVIPFIWNVFKSWRYGELVTVDDPWGYGNSLEWATSCPPPRHNFASLPRIRSERPAFELHYPHMIERMRAEAH
TGHHDDINAPELGTAPALASDSSR
>P9WP71 7.1.1.9~~~ctaD~~~Probable cytochrome c oxidase subunit 1~~~COG0843
MTAEAPPLGELEAIRPYPARTGPKGSLVYKLITTTDHKMIGIMYCVACISFFFIGGLLALLMRTELAAPGLQFLSNEQFN
QLFTMHGTIMLLFYATPIVFGFANLVLPLQIGAPDVAFPRLNAFSFWLFVFGATIGAAGFITPGGAADFGWTAYTPLTDA
IHSPGAGGDLWIMGLIVAGLGTILGAVNMITTVVCMRAPGMTMFRMPIFTWNIMVTSILILIAFPLLTAALFGLAADRHL
GAHIYDAANGGVLLWQHLFWFFGHPEVYIIALPFFGIVSEIFPVFSRKPIFGYTTLVYATLSIAALSVAVWAHHMFATGA
VLLPFFSFMTYLIAVPTGIKFFNWIGTMWKGQLTFETPMLFSVGFMVTFLLGGLTGVLLASPPLDFHVTDSYFVVAHFHY
VLFGTIVFATFAGIYFWFPKMTGRLLDERLGKLHFWLTFIGFHTTFLVQHWLGDEGMPRRYADYLPTDGFQGLNVVSTIG
AFILGASMFPFVWNVFKSWRYGEVVTVDDPWGYGNSLEWATSCPPPRHNFTELPRIRSERPAFELHYPHMVERLRAEAHV
GRHHDEPAMVTSS
>Q08855 7.1.1.9~~~ctaD~~~Cytochrome c oxidase subunit 1~~~
MATSAAAHGEHAEDHGHDEHAHPTGWRRSTNHKDIGTLYLIFAIIAGVIGAAMSLAIRAELMYPGVEYFHNTHLYNVFVT
SHGVIMIFFMVMPAMIGGFGNWFLPLMIGAPDMAFPRMNNISFWLLPASFGLLLMSTFVEGEPGANGAGAGWTMYVPLSS
SGHPGPAVDLAIFSLHIAGASSILGAINFITTILNMRAPGMTLHKMPLFAWSVLITAFLLLLSLPVLAGAITMLLTDRNF
GTTFFAPEGGGDPLLYQHLFWFFGHPEVYILILPGFGMISHIISTFSRKPVFGYIGMVYAMAAIGGLGFVVWAHHMYIVG
MDLDTEAYFVSATMIIAVPTGIKIFSWIATMWGGSIEFATPMLWALAFIFLFTVGGVTGVVLANASLDRVLHDTYYVVAH
FHYVLSLGAIFAIFAGWYYWFPKMSGYMYNETLAEAHFWLIFIGVNLIFFPEHFLGISGMPRRYIDYPDAFAGWNLVSSI
GSYISGFSVLLFIYCVYDAFAKNVPVGDNPWGAGATTLEWTLPSPPPVHEFEVLPRVE
>Q5SJ79 7.1.1.9~~~cbaA~~~Cytochrome c oxidase subunit 1~~~COG0843
MAVRASEISRVYEAYPEKKATLYFLVLGFLALIVGSLFGPFQALNYGNVDAYPLLKRLLPFVQSYYQGLTLHGVLNAIVF
TQLFAQAIMVYLPARELNMRPNMGLMWLSWWMAFIGLVVAALPLLANEATVLYTFYPPLKGHWAFYLGASVFVLSTWVSI
YIVLDLWRRWKAANPGKVTPLVTYMAVVFWLMWFLASLGLVLEAVLFLLPWSFGLVEGVDPLVARTLFWWTGHPIVYFWL
LPAYAIIYTILPKQAGGKLVSDPMARLAFLLFLLLSTPVGFHHQFADPGIDPTWKMIHSVLTLFVAVPSLMTAFTVAASL
EFAGRLRGGRGLFGWIRALPWDNPAFVAPVLGLLGFIPGGAGGIVNASFTLDYVVHNTAWVPGHFHLQVASLVTLTAMGS
LYWLLPNLTGKPISDAQRRLGLAVVWLWFLGMMIMAVGLHWAGLLNVPRRAYIAQVPDAYPHAAVPMVFNVLAGIVLLVA
LLLFIYGLFSVLLSRERKPELAEAPLPFAEVISGPEDRRLVLAMDRIGFWFAVAAILVVLAYGPTLVQLFGHLNPVPGWR
LW
>Q04441 7.1.1.9~~~ctaC~~~Cytochrome c oxidase subunit 2~~~COG1622
MKLWKTASRFLPLSFLTLFLTGCLGEENLTALDPKGPQAQWIYDNMILSIIVMALVSIVVFAIFFIILAKYRRKPGDDEI
PKQVHGNTALEITWTVIPIILLVILAVPTITGTFMFADKDPDPEVGDNTVYIKVTGHQFWWQFDYENEGFTAGQDVYIPV
GEKVIFELHAQDVLHSFWVPALGGKIDTVPGITNHMWLEADEPGVFKGKCAELCGPSHALMDFKLIALERDEYDAWVEGM
SAEVEEPTETLANQGRQVFEENSCIGCHAVGGTGTAAGPAFTNFGEREVIAGYLENNDENLEAWIRDPQSLKQGNVMPAY
PDMSEEDMEALIAYLRSLKVME
>Q03438 7.1.1.9~~~ctaC~~~Cytochrome c oxidase subunit 2~~~
MNKGLCNWRLFSLFGMMALLLAGCGKPFLSTLQPAGEVADMQYSLMLLSTSIMVLVIVVVAIIFVYVVIRFRRRKGEENK
IPKQVEGSHKLEIIWTVIPIILLLILAVPTVLTTFKLADVKAMNDKKRDKNTVVVNVRANQYWWEFEYPDYGIITSQDLV
VPTNEKVYFNLIASDVKHSFWIPAVGGKMDTNTDNKNQFWLVFDQKATDKAGGVFYGKCAELCGPSHALMDFKVRPLPRD
QFDAWVKKMQNAKKPVVTDPVAKEGEAIFNKSCIGCHAVTPLDKRPAQRRTAPNLADFGDRERIAGILEHNEENLKKWLR
DPNSVKPGNKMAGTYGHLTEEQIDALTKYLMSLKVE
>Q03736 7.1.1.9~~~ctaC~~~Cytochrome c oxidase subunit 2~~~
MRHSTTLTGCATGAAGLLAATAAAAQQQSLEIIGRPQPGGTGFQPSASPVATQIHWLDGFILVIIAAITIFVTLLILYAV
WRFHEKRNKVPARFTHNSPLEIAWTIVPIVILVAIGAFSLPVLFNQQEIPEADVTVKVTGYQWYWGYEYPDEEISFESYM
IGSPATGGDNRMSPEVEQQLIEAGYSRDEFLLATDTAMVVPVNKTVVVQVTGADVIHSWTVPAFGVKQDAVPGRLAQLWF
RAEREGIFFGQCSELCGISHAYMPITVKVVSEEAYAAWLEQARGGTYELSSVLPATPAGVSVE
>Q8NNK2 7.1.1.9~~~ctaC~~~Cytochrome c oxidase subunit 2~~~COG1622
MEQQNKRGLKRKALLGGVLGLGGLAMAGCEVAPPGGVLGDFLRMGWPDGITPEAVAMGNFWSWVWVAAWIIGIIMWGLFL
TAIFAWGAKRAEKRGEGEFPKQLQYNVPLELVLTIVPIIIVMVLFFFTVQTQDKVTALDKNPEVTVDVTAYQWNWKFGYS
EIDGSLAPGGQDYQGSDPERQAAAEASKKDPSGDNPIHGNSKSDVSYLEFNRIETLGTTDEIPVMVLPVNTPIEFNLASA
DVAHSFWVPEFLFKRDAYAHPEANKSQRVFQIEEITEEGAFVGRCAEMCGTYHAMMNFELRVVDRDSFAEYISFRDSNPD
ATNAQALEHIGQAPYATSTSPFVSDRTATRDGENTQSNA
>P9WP69 7.1.1.9~~~ctaC~~~Cytochrome c oxidase subunit 2~~~COG1622
MTPRGPGRLQRLSQCRPQRGSGGPARGLRQLALAAMLGALAVTVSGCSWSEALGIGWPEGITPEAHLNRELWIGAVIASL
AVGVIVWGLIFWSAVFHRKKNTDTELPRQFGYNMPLELVLTVIPFLIISVLFYFTVVVQEKMLQIAKDPEVVIDITSFQW
NWKFGYQRVNFKDGTLTYDGADPERKRAMVSKPEGKDKYGEELVGPVRGLNTEDRTYLNFDKVETLGTSTEIPVLVLPSG
KRIEFQMASADVIHAFWVPEFLFKRDVMPNPVANNSVNVFQIEEITKTGAFVGHCAEMCGTYHSMMNFEVRVVTPNDFKA
YLQQRIDGKTNAEALRAINQPPLAVTTHPFDTRRGELAPQPVG
>P08306 7.1.1.9~~~ctaC~~~Cytochrome c oxidase subunit 2~~~
MMAIATKRRGVAAVMSLGVATMTAVPALAQDVLGDLPVIGKPVNGGMNFQPASSPLAHDQQWLDHFVLYIITAVTIFVCL
LLLICIVRFNRRANPVPARFTHNTPIEVIWTLVPVLILVAIGAFSLPILFRSQEMPNDPDLVIKAIGHQWYWSYEYPNDG
VAFDALMLEKEALADAGYSEDEYLLATDNPVVVPVGKKVLVQVTATDVIHAWTIPAFAVKQDAVPGRIAQLWFSVDQEGV
YFGQCSELCGINHAYMPIVVKAVSQEKYEAWLAGAKEEFAADASDYLPASPVKLASAE
>Q5SJ80 7.1.1.9~~~cbaB~~~Cytochrome c oxidase subunit 2~~~COG1622
MVDEHKAHKAILAYEKGWLAFSLAMLFVFIALIAYTLATHTAGVIPAGKLERVDPTTVRQEGPWADPAQAVVQTGPNQYT
VYVLAFAFGYQPNPIEVPQGAEIVFKITSPDVIHGFHVEGTNINVEVLPGEVSTVRYTFKRPGEYRIICNQYCGLGHQNM
FGTIVVKE
>Q03439 7.1.1.9~~~ctaE~~~Cytochrome c oxidase subunit 3~~~
MHAEEKLTAETFPAAPERNATLEGKNKFLGFWLFLGGETVLFASLFATYLALKDKTNGGPSAEELFQMPVVFMATMLLLT
SSLTSVYAIYHMKNFDFKKMQLWFGITVLLGAGFLGLEIYEFNEYVHEGHKFTTSAFASAFYTLVGTHGSHVAFGLLWIL
TLMIRNAKRGLNLYNAPKFYVASLYWHFIDVVWVFIFTVVYLMGMVG
>Q9AEL8 7.1.1.9~~~ctaE~~~Cytochrome c oxidase subunit 3~~~COG1845
MTSAVGNTGMAAPQRVAALNRPNMVSVGTIVFLSQELMFFAGLFAMYFVSRANGLANGSWGEQTDHLNVPYALLITVILV
SSSVTCQFGVFAAERGDVYGLRKWFLVTIILGSIFVIGQGYEYITLVGHGLTIQSSVYGSAFFITTGFHALHVIAGVMAF
VVVLMRIHKSKFTPAQATAAMVVSYYWHFVDVVWIGLFITIYFIQ
>P9WP67 7.1.1.9~~~ctaE~~~Probable cytochrome c oxidase subunit 3~~~COG1845
MTSAVGTSGTAITSRVHSLNRPNMVSVGTIVWLSSELMFFAGLFAFYFSARAQAGGNWPPPPTELNLYQAVPVTLVLIAS
SFTCQMGVFAAERGDIFGLRRWYVITFLMGLFFVLGQAYEYRNLMSHGTSIPSSAYGSVFYLATGFHGLHVTGGLIAFIF
LLVRTGMSKFTPAQATASIVVSYYWHFVDIVWIALFTVIYFIR
>P06030 7.1.1.9~~~ctaE~~~Cytochrome c oxidase subunit 3~~~
MAHVKNHDYQILPPSIWPFFGAIGAFVMLTGAVAWMKGITFFGLPVEGPWMFLIGLVGVLYVMFGWWADVVNEGETGEHT
PVVRIGLQYGFILFIMSEVMFFVAWFWAFIKNALYPMGPDSPIKDGVWPPEGIVTFDPWHLPLINTLILLLSGVAVTWAH
HAFVLEGDRKTTINGLIVAVILGVCFTGLQAYEYSHAAFGLADTVYAGAFYMATGFHGAHVIIGTIFLFVCLIRLLKGQM
TQKQHVGFEAAAWYWHFVDVVWLFLFVVIYIWGR
>Q03440 7.1.1.9~~~caaD~~~Cytochrome c oxidase subunit 4B~~~
MANQTNSGNERVDLAYRRRKNAEEMRHQMIAFVLMILLTLIAFAAVGYEEFSHWFVVPFILLLAAVQVAFQLYYFMHMSH
KGHEFPAMFIYGGVAVMLLLVWAFTTVVWW
>Q8NNK3 7.1.1.9~~~ctaF~~~Cytochrome c oxidase polypeptide 4~~~
MKSSAKLMYGPTVFMAAMAVIYIFATMHVSDGGSVKGVEWVGSVALVLSAGLTLMLGVYLHFTEVRVDVLPEDWEEAEVA
DKAGTLGFFSPSSIWPAAMSGAVGFLAFGVVYFHYWMIAVGLMLLIFTITKLNLQYGVPKEKH
>P77921 7.1.1.9~~~ctaH~~~Cytochrome c oxidase subunit 4~~~
MASHHEITDHKHGEMDIRHQQATFAGFIKGATWVSILSIAVLVFLALANS
>P94446 ~~~coxA~~~Sporulation cortex protein CoxA~~~
MGKKMTIASLILMTAGLTACGANDNAMNDTRNNGNTRPIGYYTNENDADRQGDGIDHDGPVSELMEDQNDGNRNTTNVNN
RDRVTADDRVPLATDGTYNNTNNRNMNRNAANNGYDNQENRRLAAKIANRVKQVKNVNDTQVMVSDDRVVIAVKSHREFT
KSDRDNVVKAARNYANGRDVQVSTDKGLFRKLHKMNNR
>P82543 7.1.1.9~~~cbaD~~~Cytochrome c oxidase polypeptide 2A~~~
MEEKPKGALAVILVLTLTILVFWLGVYAVFFARG
>P98053 7.1.1.9~~~coxM~~~Alternative cytochrome c oxidase subunit 2~~~COG1622
MAVALILLLIAIGSVLFHLFSPWWWTPIATNWGYIDDTINITFWITGFVFTAVILFMAYCVFRFHHKEGRQAAYNPENKK
LEWWLSVGTGVGVAAMLAPGLVVWHQFVTVPADATEVEIMGQQWQWSFRLPGKDGRLGTSDVRNISPENPMGLNRDDPHG
QDDVVIENGDLHLPIGKPVKVLLRSVDVLHDFYVPEFRAKMDMVPGMVTYFWIRPIRTGTFDVLCAELCGAAHYQMRAKV
IVEAESDYHAWLEQQKTFAGLSGRNAVVRAKYNSGDD
>P98000 7.1.1.9~~~coxN~~~Alternative cytochrome c oxidase subunit 1~~~COG0843
MVDVPYDRIADIPPAEVPDVELYHPRSWWTRYVFSQDAKVIAIQYSLTASAIGLVALVLSWLMRLQLGFPGTFSFIDANQ
YLQFITMHGMIMVIYLLTALFLGGFGNYLIPLMVGARDMVFPYVNMLSYWVYLLAVLVLASAFFVPGGPTGAGWTLYPPQ
AILSGTPGQDWGIVLMLSSLILFIIGFTMGGLNYVVTVLQARTRGMTLMRLPLTVWGIFTATVMALLAFPALFVGSVMLL
LDRLLGTSFFMPTLVEMGQLSKYGGGSPLLFQHLFWFFGHPEVYIVALPAFGIVSDLISTHARKNIFGYRMMVWAIVAIG
ALSFVVWAHHMYVSGMYPYFGFFFATTTLIIAIPTAIKVYNWVLTLWHGDIHLTVPMLFALGFIITFVNGGLTGLFLGNV
VVDVPLSDTMFVVAHFHMVMGVAPIMVVLGAIYHWYPKVTGRMLNDVLGKFHFWVTFLGAYLIFFPMHYLGLLGVPRRYF
ELGDAAFIPPSAHSLNAFITVVALTVGFAQMVFLFNLVWSLFEGEPSGGNPWRATTLEWQTPETPPGHGNWGKQLPIVYR
WAYDYSVPGAAQDFIPQNQPPPTGAVQGVAP
>O31652 2.5.1.141~~~ctaB1~~~Protoheme IX farnesyltransferase 1~~~COG0109
MENTRDSAAISETKYIKASNRVTIYDFIKLAKPGIIISNSIAAFGGFWIAFASAEKTLTGLAFLMTMVTAMLGTAFVMAS
GTVYNNYFDRHMDAKMARTRSRASVTGKMPPAMILTYGSVLGIAGLAMLYSLNPLTAFLGLAAFIFYAIIYTVWVKRTSV
WSTFVGSFPGAAPPLMGYCAVTGDFSMTAVLLYTIMFLWQPPHFWAIGIRRKEEYRAAGVPLLPVVKGNHVTKIKMMQYI
AVLVPVTLLFPFSLGTGHISPFYFLAALVLGGIWIKKSIKGFKTDDDVKWAKDMFVYSLIYFCLLFFIMMIDSFMMFLIR
>P24009 2.5.1.141~~~ctaB2~~~Protoheme IX farnesyltransferase 2~~~COG0109
MANSRILNDTAIDGQIEETTAWKDFLSLIKIGIVNSNLITTFTGMWLALHISGLSFLGNINTVLLTLIGSSLIIAGSCAI
NNWYDRDIDHLMERTKVRPTVTGKIQPSQALWSGILLVALGLIMLLMTTVMAAVIGFIGVFTYVVLYTMWTKRRYTINTV
VGSVSGAVPPLIGWTAVEGNIGVVAWVLFMILFIWQIPHFLALAIKKTEDYRAANIPMLPVVYGFEVTKRQIIVWVACLM
PLPFFLGSLGLPIVILGLLLNIGWLILGLMGFRSKNIMKWATQMFVYSLNYMTIYFVAMVVLTLF
>Q3J5F9 2.5.1.141~~~ctaB~~~Protoheme IX farnesyltransferase~~~COG0109
MTDIRITGIPKEAGFGDYVALLKPRVMSLVVFTALVGLLVAPVTVHPMIALTGILFIALGAGASGALNMWWDEDIDRVMK
RTRNRPVPSGTVAPGEALGIGLALSGIAVVMLGLATNLFAAGLLAFTIFFYAVVYSMWLKRTTPQNIVIGGAAGAFPPMI
GWAVATGGVSVESLFMFALIFMWTPPHFWSLALFMKSDYSDAGVPMLTVTHGRRVTRAHVLVYSLLLAPLAVAGAFTGIG
GPLYLATALALNGWLLVGAVRIWRRDEAQAEADRYRVEKGFFRFSLYYLFLHFGAILAEAALKPYGLGGW
>P56940 ~~~ctaG~~~Cytochrome c oxidase assembly protein CtaG~~~
MSLSPHQKTAGGLVLVVAVMGAASFAAVPFYNWFCRVTGFAGTTAVATEAPAEVLDRTVKVRFDASREAGMPWEFRPLQR
EMKLKIGETGLAFYEAYNPTDRTVAGTASYNVTPDAAGGYFAKIACFCFTEQVLAPGERVEMPVTFYVDPAIIDDPDGRY
VRQITLSYTFHETALTEEQAALAAESATDVN
>Q92RG6 ~~~ctaG~~~Cytochrome c oxidase assembly protein CtaG~~~COG3175
MADNGQADRKERSNGVIVGTCLAFVAGMIGMAYAAVPLYDMFCRVTGYNGTTQRVEQASDLILDEKIKVTFDANVAAGLP
WEFVPVQRDIDVRIGETVQIMYRAKNLASTPTTGQATFNVTPMAAGAYFNKVQCFCFTETTLEPGEEMEMPVVFFVDPEI
VKPVETQGIKTLTLSYTFYPREPSKPVAQVKAKAENKL
>D5E3H2 1.14.-.-~~~~~~Cytochrome P450 CYP107DY1~~~
MKKVTVDDFSSPENMHDVIGFYKKLTEHQEPLIRLDDYYGLGPAWVALRHDDVVTILKNPRFLKDVRKFTPLQDKKDSID
DSTSASKLFEWMMNMPNMLTVDPPDHTRLRRLASKAFTPRMIENLRPRIQQITNELLDSVEGKRNMDLVADFSFPLPIIV
ISEMLGIPPLDQKRFRDWTDKLIKAAMDPSQGAVVMETLKEFIDYIKKMLVEKRNHPDDDVMSALLQAHEQEDKLSENEL
LSTIWLLITAGHETTAHLISNGVLALLKHPEQMRLLRDNPSLLPSAVEELLRYAGPVMIGGRFAGEDIIMHGKMIPKGEM
VLFSLVAANIDSQKFSYPEGLDITREENEHLTFGKGIHHCLGAPLARMEAHIAFGTLLQRFPDLRLAIESEQLVYNNSTL
RSLKSLPVIF
>Q59990 1.14.-.-~~~~~~Putative cytochrome P450 120~~~COG2124
MITSPTNLNSLPIPPGDFGLPWLGETLNFLNDGDFGKKRQQQFGPIFKTRLFGKNVIFISGALANRFLFTKEQETFQATW
PLSTRILLGPNALATQMGEIHRSRRKILYQAFLPRTLDSYLPKMDGIVQGYLEQWGKANEVIWYPQLRRMTFDVAATLFM
GEKVSQNPQLFPWFETYIQGLFSLPIPLPNTLFGKSQRARALLLAELEKIIKARQQQPPSEEDALGILLAARDDNNQPLS
LPELKDQILLLLFAGHETLTSALSSFCLLLGQHSDIRERVRQEQNKLQLSQELTAETLKKMPYLDQVLQEVLRLIPPVGG
GFRELIQDCQFQGFHFPKGWLVSYQISQTHADPDLYPDPEKFDPERFTPDGSATHNPPFAHVPFGGGLRECLGKEFARLE
MKLFATRLIQQFDWTLLPGQNLELVVTPSPRPKDNLRVKLHSLM
>P0A515 1.14.-.-~~~~~~Cytochrome P450 121~~~
MTATVLLEVPFSARGDRIPDAVAELRTREPIRKVRTITGAEAWLVSSYALCTQVLEDRRFSMKETAAAGAPRLNALTVPP
EVVNNMGNIADAGLRKAVMKAITPKAPGLEQFLRDTANSLLDNLITEGAPADLRNDFADPLATALHCKVLGIPQEDGPKL
FRSLSIAFMSSADPIPAAKINWDRDIEYMAGILENPNITTGLMGELSRLRKDPAYSHVSDELFATIGVTFFGAGVISTGS
FLTTALISLIQRPQLRNLLHEKPELIPAGVEELLRINLSFADGLPRLATADIQVGDVLVRKGELVLVLLEGANFDPEHFP
NPGSIELDRPNPTSHLAFGRGQHFCPGSALGRRHAQIGIEALLKKMPGVDLAVPIDQLVWRTRFQRRIPERLPVLW
>P9WPP7 1.14.19.70~~~~~~Mycocyclosin synthase~~~COG2124
MTATVLLEVPFSARGDRIPDAVAELRTREPIRKVRTITGAEAWLVSSYALCTQVLEDRRFSMKETAAAGAPRLNALTVPP
EVVNNMGNIADAGLRKAVMKAITPKAPGLEQFLRDTANSLLDNLITEGAPADLRNDFADPLATALHCKVLGIPQEDGPKL
FRSLSIAFMSSADPIPAAKINWDRDIEYMAGILENPNITTGLMGELSRLRKDPAYSHVSDELFATIGVTFFGAGVISTGS
FLTTALISLIQRPQLRNLLHEKPELIPAGVEELLRINLSFADGLPRLATADIQVGDVLVRKGELVLVLLEGANFDPEHFP
NPGSIELDRPNPTSHLAFGRGQHFCPGSALGRRHAQIGIEALLKKMPGVDLAVPIDQLVWRTRFQRRIPERLPVLW
>P9WPP5 1.14.-.-~~~~~~Putative cytochrome P450 123~~~COG2124
MTVRVGDPELVLDPYDYDFHEDPYPYYRRLRDEAPLYRNEERNFWAVSRHHDVLQGFRDSTALSNAYGVSLDPSSRTSEA
YRVMSMLAMDDPAHLRMRTLVSKGFTPRRIRELEPQVLELARIHLDSALQTESFDFVAEFAGKLPMDVISELIGVPDTDR
ARIRALADAVLHREDGVADVPPPAMAASIELMRYYADLIAEFRRRPANNLTSALLAAELDGDRLSDQEIMAFLFLMVIAG
NETTTKLLANAVYWAAHHPGQLARVFADHSRIPMWVEETLRYDTSSQILARTVAHDLTLYDTTIPEGEVLLLLPGSANRD
DRVFDDPDDYRIGREIGCKLVSFGSGAHFCLGAHLARMEARVALGALLRRIRNYEVDDDNVVRVHSSNVRGFAHLPISVQ
AR
>P9WPP3 1.14.15.14~~~~~~Methyl-branched lipid omega-hydroxylase~~~COG2124
MGLNTAIATRVNGTPPPEVPIADIELGSLDFWALDDDVRDGAFATLRREAPISFWPTIELPGFVAGNGHWALTKYDDVFY
ASRHPDIFSSYPNITINDQTPELAEYFGSMIVLDDPRHQRLRSIVSRAFTPKVVARIEAAVRDRAHRLVSSMIANNPDRQ
ADLVSELAGPLPLQIICDMMGIPKADHQRIFHWTNVILGFGDPDLATDFDEFMQVSADIGAYATALAEDRRVNHHDDLTS
SLVEAEVDGERLSSREIASFFILLVVAGNETTRNAITHGVLALSRYPEQRDRWWSDFDGLAPTAVEEIVRWASPVVYMRR
TLTQDIELRGTKMAAGDKVSLWYCSANRDESKFADPWTFDLARNPNPHLGFGGGGAHFCLGANLARREIRVAFDELRRQM
PDVVATEEPARLLSQFIHGIKTLPVTWS
>P63710 1.14.15.29~~~~~~Steroid C26-monooxygenase~~~
MSWNHQSVEIAVRRTTVPSPNLPPGFDFTDPAIYAERLPVAEFAELRSAAPIWWNGQDPGKGGGFHDGGFWAITKLNDVK
EISRHSDVFSSYENGVIPRFKNDIAREDIEVQRFVMLNMDAPHHTRLRKIISRGFTPRAVGRLHDELQERAQKIAAEAAA
AGSGDFVEQVSCELPLQAIAGLLGVPQEDRGKLFHWSNEMTGNEDPEYAHIDPKASSAELIGYAMKMAEEKAKNPADDIV
TQLIQADIDGEKLSDDEFGFFVVMLAVAGNETTRNSITQGMMAFAEHPDQWELYKKVRPETAADEIVRWATPVTAFQRTA
LRDYELSGVQIKKGQRVVMFYRSANFDEEVFQDPFTFNILRNPNPHVGFGGTGAHYCIGANLARMTINLIFNAVADHMPD
LKPISAPERLRSGWLNGIKHWQVDYTGRCPVAH
>A0R4Y3 1.14.15.29~~~~~~Steroid C26-monooxygenase~~~COG2124
MPTPNIPSDFDFLDATLNLERLPVEELAELRKSEPIHWVDVPGGTGGFGDKGYWLVTKHADVKEVSRRSDVFGSSPDGAI
PVWPQDMTREAVDLQRAVLLNMDAPQHTRLRKIISRGFTPRAIGRLEDELRSRAQKIAQTAAAQGAGDFVEQVSCELPLQ
AIAELLGVPQDDRDKLFRWSNEMTAGEDPEYADVDPAMSSFELISYAMKMAEERAVNPTEDIVTKLIEADIDGEKLSDDE
FGFFVVMLAVAGNETTRNSITHGMIAFAQNPDQWELYKKERPETAADEIVRWATPVSAFQRTALEDVELGGVQIKKGQRV
VMSYRSANFDEEVFEDPHTFNILRSPNPHVGFGGTGAHYCIGANLARMTINLIFNAIADNMPDLKPIGAPERLKSGWLNG
IKHWQVDYTGAGKASVSGAPGTCPVAH
>P9WPP0 1.14.15.29~~~~~~Steroid C26-monooxygenase~~~
MSWNHQSVEIAVRRTTVPSPNLPPGFDFTDPAIYAERLPVAEFAELRSAAPIWWNGQDPGKGGGFHDGGFWAITKLNDVK
EISRHSDVFSSYENGVIPRFKNDIAREDIEVQRFVMLNMDAPHHTRLRKIISRGFTPRAVGRLHDELQERAQKIAAEAAA
AGSGDFVEQVSCELPLQAIAGLLGVPQEDRGKLFHWSNEMTGNEDPEYAHIDPKASSAELIGYAMKMAEEKAKNPADDIV
TQLIQADIDGEKLSDDEFGFFVVMLAVAGNETTRNSITQGMMAFAEHPDQWELYKKVRPETAADEIVRWATPVTAFQRTA
LRDYELSGVQIKKGQRVVMFYRSANFDEEVFQDPFTFNILRNPNPHVGFGGTGAHYCIGANLARMTINLIFNAVADHMPD
LKPISAPERLRSGWLNGIKHWQVDYTGRCPVAH
>P9WPP1 1.14.15.29~~~~~~Steroid C26-monooxygenase~~~COG2124
MSWNHQSVEIAVRRTTVPSPNLPPGFDFTDPAIYAERLPVAEFAELRSAAPIWWNGQDPGKGGGFHDGGFWAITKLNDVK
EISRHSDVFSSYENGVIPRFKNDIAREDIEVQRFVMLNMDAPHHTRLRKIISRGFTPRAVGRLHDELQERAQKIAAEAAA
AGSGDFVEQVSCELPLQAIAGLLGVPQEDRGKLFHWSNEMTGNEDPEYAHIDPKASSAELIGYAMKMAEEKAKNPADDIV
TQLIQADIDGEKLSDDEFGFFVVMLAVAGNETTRNSITQGMMAFAEHPDQWELYKKVRPETAADEIVRWATPVTAFQRTA
LRDYELSGVQIKKGQRVVMFYRSANFDEEVFQDPFTFNILRNPNPHVGFGGTGAHYCIGANLARMTINLIFNAVADHMPD
LKPISAPERLRSGWLNGIKHWQVDYTGRCPVAH
>Q0S7M1 1.14.15.29~~~~~~Steroid C26-monooxygenase~~~COG2124
MGSFPCPQKIEQVLLSGQGLNELSFASRPACASMLVERVPHHGVVYGLGQETAVAQPNLPEGFDFTDPDVYAERIPYQEF
AELRKTAPIWWNPQPPEIGGFHDDGYWVVSKLEDVKEVSRRSDVFSTHENTAIVRFADDIPRENIEMQRFILINKDAPEH
TKLRKLVSRGFTPRAINSLREELTERAEKIVKEAAESGAGDFVTQVACELPLQAIAELLGVPQEDRLKVFDWSNQMTGYD
DPELDIDPQAASMEILGYAYQMADERKKCPADDIVTTLIEADIDGNELSPEEFGFFVILLAVAGNETTRNAITHGMMAFL
DHPDQWELYKKERPKTTADEIVRWATPVNSFQRTALEDTELGGVQIKKGQRVVMLYGSANFDEDAFENPEKFDIMRENNP
HVGFGGTGAHFCLGANLARLEIDLIFNAIADHLPDISKLGDPRRLRSGWLNGIKEFQVDYKTASGGCPVRH
>P9WPN9 1.14.-.-~~~~~~Putative cytochrome P450 126~~~COG2124
MTTAAGLSGIDLTDLDNFADGFPHHLFAIHRREAPVYWHRPTEHTPDGEGFWSVATYAETLEVLRDPVTYSSVTGGQRRF
GGTVLQDLPVAGQVLNMMDDPRHTRIRRLVSSGLTPRMIRRVEDDLRRRARGLLDGVEPGAPFDFVVEIAAELPMQMICI
LLGVPETDRHWLFEAVEPGFDFRGSRRATMPRLNVEDAGSRLYTYALELIAGKRAEPADDMLSVVANATIDDPDAPALSD
AELYLFFHLLFSAGAETTRNSIAGGLLALAENPDQLQTLRSDFELLPTAIEEIVRWTSPSPSKRRTASRAVSLGGQPIEA
GQKVVVWEGSANRDPSVFDRADEFDITRKPNPHLGFGQGVHYCLGANLARLELRVLFEELLSRFGSVRVVEPAEWTRSNR
HTGIRHLVVELRGG
>P9WPN7 1.14.15.27~~~~~~Beta-dihydromenaquinone-9 omega-hydroxylase~~~COG2124
MTATQSPPEPAPDRVRLAGCPLAGTPDVGLTAQDATTALGVPTRRRASSGGIPVATSMWRDAQTVRTYGPAVAKALALRV
AGKARSRLTGRHCRKFMQLTDFDPFDPAIAADPYPHYRELLAGERVQYNPKRDVYILSRYADVREAARNHDTLSSARGVT
FSRGWLPFLPTSDPPAHTRMRKQLAPGMARGALETWRPMVDQLARELVGGLLTQTPADVVSTVAAPMPMRAITSVLGVDG
PDEAAFCRLSNQAVRITDVALSASGLISLVQGFAGFRRLRALFTHRRDNGLLRECTVLGKLATHAEQGRLSDDELFFFAV
LLLVAGYESTAHMISTLFLTLADYPDQLTLLAQQPDLIPSAIEEHLRFISPIQNICRTTRVDYSVGQAVIPAGSLVLLAW
GAANRDPRQYEDPDVFRADRNPVGHLAFGSGIHLCPGTQLARMEGQAILREIVANIDRIEVVEPPTWTTNANLRGLTRLR
VAVTPRVAP
>P9WPN5 1.14.-.-~~~~~~Cytochrome P450 130~~~COG2124
MTSVMSHEFQLATAETWPNPWPMYRALRDHDPVHHVVPPQRPEYDYYVLSRHADVWSAARDHQTFSSAQGLTVNYGELEM
IGLHDTPPMVMQDPPVHTEFRKLVSRGFTPRQVETVEPTVRKFVVERLEKLRANGGGDIVTELFKPLPSMVVAHYLGVPE
EDWTQFDGWTQAIVAANAVDGATTGALDAVGSMMAYFTGLIERRRTEPADDAISHLVAAGVGADGDTAGTLSILAFTFTM
VTGGNDTVTGMLGGSMPLLHRRPDQRRLLLDDPEGIPDAVEELLRLTSPVQGLARTTTRDVTIGDTTIPAGRRVLLLYGS
ANRDERQYGPDAAELDVTRCPRNILTFSHGAHHCLGAAAARMQCRVALTELLARCPDFEVAESRIVWSGGSYVRRPLSVP
FRVTS
>P9WPN3 1.14.-.-~~~~~~Putative cytochrome P450 132~~~COG2124
MATATTQRPLKGPAKRMSTWTMTREAITIGFDAGDGFLGRLRGSDITRFRCAGRRFVSISHPDYVDHVLHEARLKYVKSD
EYGPIRATAGLNLLTDEGDSWARHRGALNSTFARRHLRGLVGLMIDPIADVTAARVPGAQFDMHQSMVETTLRVVANALF
SQDFGPLVQSMHDLATRGLRRAEKLERLGLWGLMPRTVYDTLIWCIYSGVHLPPPLREMQEITLTLDRAINSVIDRRLAE
PTNSADLLNVLLSADGGIWPRQRVRDEALTFMLAGHETTANAMSWFWYLMALNPQARDHMLTELDDVLGMRRPTADDLGK
LAWTTACLQESQRYFSSVWIIAREAVDDDIIDGHRIRRGTTVVIPIHHIHHDPRWWPDPDRFDPGRFLRCPTDRPRCAYL
PFGGGRRICIGQSFALMEMVLMAAIMSQHFTFDLAPGYHVELEATLTLRPKHGVHVIGRRR
>P9WPM7 1.14.-.-~~~~~~Putative cytochrome P450 136~~~COG2124
MATIHPPAYLLDQAKRRFTPSFNNFPGMSLVEHMLLNTKFPEKKLAEPPPGSGLKPVVGDAGLPILGHMIEMLRGGPDYL
MFLYKTKGPVVFGDSAVLPGVAALGPDAAQVIYSNRNKDYSQQGWVPVIGPFFHRGLMLLDFEEHMFHRRIMQEAFVRSR
LAGYLEQMDRVVSRVVADDWVVNDARFLVYPAMKALTLDIASMVFMGHEPGTDHELVTKVNKAFTITTRAGNAVIRTSVP
PFTWWRGLRARELLENYFTARVKERREASGNDLLTVLCQTEDDDGNRFSDADIVNHMIFLMMAAHDTSTSTATTMAYQLA
AHPEWQQRCRDESDRHGDGPLDIESLEQLESLDLVMNESIRLVTPVQWAMRQTVRDTELLGYYLPKGTNVIAYPGMNHRL
PEIWTDPLTFDPERFTEPRNEHKRHRYAFTPFGGGVHKCIGMVFDQLEIKTILHRLLRRYRLELSRPDYQPRWDYSAMPI
PMDGMPIVLRPR
>P9WPM5 1.14.-.-~~~~~~Putative cytochrome P450 137~~~COG2124
MVLRSLASPAALTDPKRCASVVGVAAFAVRREHAPDALGGPPGLPAPRGFRAAFAAAYAVAYLAGGERRMLRLIRRYGPI
MTMPILSLGDVAIVSDSALAKEVFTAPTDVLLGGEGVGPAAAIYGSGSMFVQEEPEHLRRRKLLTPPLHGAALDRYVPII
ENSTRAAMHTWPVDRPFAMLTVARSLMLDVIVKVIFGVDDPEEVRRLGRPFERLLNLGVSEQLTVRYALRRLGALRVWPA
RARANTEIDDVVMALIAQRRADPRLGERHDVLSLLVSARGESGEQLSDSEIRDDLITLVLAGHETTATTLAWAFDLLLHH
PDALRRVRAEAVGGGEAFTTAVINETLRVRPPAPLTARVAAQPLTIGGYRVEAGTRIVVHIIAINRSAEVYEHPHEFRPE
RFLGTRPQTYAWVPFGGGVKRCLGANFSMRELITVLHVLLREGEFTAVDDEPERIVRRSIMLVPRRGTRVRFRPAR
>P9WPM3 1.14.-.-~~~~~~Putative cytochrome P450 138~~~COG2124
MSEVVTAAPAPPVVRLPPAVRGPKLFQGLAFVVSRRRLLGRFVRRYGKAFTANILMYGRVVVVADPQLARQVFTSSPEEL
GNIQPNLSRMFGSGSVFALDGDDHRRRRRLLAPPFHGKSMKNYETIIEEETLRETANWPQGQAFATLPSMMHITLNAILR
AIFGAGGSELDELRRLIPPWVTLGSRLAALPKPKRDYGRLSPWGRLAEWRRQYDTVIDKLIEAERADPNFADRTDVLALM
LRSTYDDGSIMSRKDIGDELLTLLAAGHETTAATLGWAFERLSRHPDVLAALVEEVDNGGHELRQAAILEVQRARTVIDF
AARRVNPPVYQLGEWVIPRGYSIIINIAQIHGDPDVFPQPDRFDPQRYIGSKPSPFAWIPFGGGTRRCVGAAFANMEMDV
VLRTVLRHFTLETTTAAGERSHGRGVAFTPKDGGRVVMRRR
>P9WPM1 1.14.-.-~~~~~~Putative cytochrome P450 139~~~COG2124
MRYPLGEALLALYRWRGPLINAGVGGHGYTYLLGAEANRFVFANADAFSWSQTFESLVPVDGPTALIVSDGADHRRRRSV
VAPGLRHHHVQRYVATMVSNIDTVIDGWQPGQRLDIYQELRSAVRRSTAESLFGQRLAVHSDFLGEQLQPLLDLTRRPPQ
VMRLQQRVNSPGWRRAMAARKRIDDLIDAQIADARTAPRPDDHMLTTLISGCSEEGTTLSDNEIRDSIVSLITAGYETTS
GALAWAIYALLTVPGTWESAASEVARVLGGRVPAADDLSALTYLNGVVHETLRLYSPGVISARRVLRDLWFDGHRIRAGR
LLIFSAYVTHRLPEIWPEPTEFRPLRWDPNAADYRKPAPHEFIPFSGGLHRCIGAVMATTEMTVILARLVARAMLQLPAQ
RTHRIRAANFAALRPWPGLTVEIRKSAPAQ
>P9WPL9 1.14.-.-~~~~~~Putative cytochrome P450 140~~~COG2124
MKDKLHWLAMHGVIRGIAAIGIRRGDLQARLIADPAVATDPVPFYDEVRSHGALVRNRANYLTVDHRLAHDLLRSDDFRV
VSFGENLPPPLRWLERRTRGDQLHPLREPSLLAVEPPDHTRYRKTVSAVFTSRAVSALRDLVEQTAINLLDRFAEQPGIV
DVVGRYCSQLPIVVISEILGVPEHDRPRVLEFGELAAPSLDIGIPWRQYLRVQQGIRGFDCWLEGHLQQLRHAPGDDLMS
QLIQIAESGDNETQLDETELRAIAGLVLVAGFETTVNLLGNGIRMLLDTPEHLATLRQHPELWPNTVEEILRLDSPVQLT
ARVACRDVEVAGVRIKRGEVVVIYLAAANRDPAVFPDPHRFDIERPNAGRHLAFSTGRHFCLGAALARAEGEVGLRTFFD
RFPDVRAAGAGSRRDTRVLRGWSTLPVTLGPARSMVSP
>P9WPL7 1.14.-.-~~~~~~Putative cytochrome P450 141~~~COG2124
MTSTSIPTFPFDRPVPTEPSPMLSELRNSCPVAPIELPSGHTAWLVTRFDDVKGVLSDKRFSCRAAAHPSSPPFVPFVQL
CPSLLSIDGPQHTAARRLLAQGLNPGFIARMRPVVQQIVDNALDDLAAAEPPVDFQEIVSVPIGEQLMAKLLGVEPKTVH
ELAAHVDAAMSVCEIGDEEVSRRWSALCTMVIDILHRKLAEPGDDLLSTIAQANRQQSTMTDEQVVGMLLTVVIGGVDTP
IAVITNGLASLLHHRDQYERLVEDPGRVARAVEEIVRFNPATEIEHLRVVTEDVVIAGTALSAGSPAFTSITSANRDSDQ
FLDPDEFDVERNPNEHIAFGYGPHACPASAYSRMCLTTFFTSLTQRFPQLQLARPFEDLERRGKGLHSVGIKELLVTWPT
>A0R4Q6 1.14.15.28~~~~~~Steroid C26-monooxygenase~~~COG2124
MTQMLTRPDVDLVNGMFYADGGAREAYRWMRANEPVFRDRNGLAAATTYQAVLDAERNPELFSSTGGIRPDQPGMPYMID
MDDPQHLLRRKLVNAGFTRKRVMDKVDSIGRLCDTLIDAVCERGECDFVRDIAAPLPMAVIGDMLGVLPTERDMLLKWSD
DLVCGLSSHVDEAAIQKLMDTFAAYTEFTKDVITKRRAEPTDDLFSVLVNSEVEGQRMSDDEIVFETLLILIGGDETTRH
TLSGGTEQLLRHRDQWDALVADVDLLPGAIEEMLRWTSPVKNMCRTLTADTVFHGTELRAGEKIMLMFESANFDESVFGD
PDNFRIDRNPNSHVAFGFGTHFCLGNQLARLELRLMTERVLRRLPDLRLADDAPVPLRPANFVSGPESMPVVFTPSAPVL
A
>P9WPL5 1.14.15.28~~~~~~Steroid C26-monooxygenase~~~COG2124
MTEAPDVDLADGNFYASREARAAYRWMRANQPVFRDRNGLAAASTYQAVIDAERQPELFSNAGGIRPDQPALPMMIDMDD
PAHLLRRKLVNAGFTRKRVKDKEASIAALCDTLIDAVCERGECDFVRDLAAPLPMAVIGDMLGVRPEQRDMFLRWSDDLV
TFLSSHVSQEDFQITMDAFAAYNDFTRATIAARRADPTDDLVSVLVSSEVDGERLSDDELVMETLLILIGGDETTRHTLS
GGTEQLLRNRDQWDLLQRDPSLLPGAIEEMLRWTAPVKNMCRVLTADTEFHGTALCAGEKMMLLFESANFDEAVFCEPEK
FDVQRNPNSHLAFGFGTHFCLGNQLARLELSLMTERVLRRLPDLRLVADDSVLPLRPANFVSGLESMPVVFTPSPPLG
>P9WPL3 1.14.-.-~~~~~~Cytochrome P450 143~~~COG2124
MTTPGEDHAGSFYLPRLEYSTLPMAVDRGVGWKTLRDAGPVVFMNGWYYLTRREDVLAALRNPKVFSSRKALQPPGNPLP
VVPLAFDPPEHTRYRRILQPYFSPAALSKALPSLRRHTVAMIDAIAGRGECEAMADLANLFPFQLFLVLYGLPLEDRDRL
IGWKDAVIAMSDRPHPTEADVAAARELLEYLTAMVAERRRNPGPDVLSQVQIGEDPLSEIEVLGLSHLLILAGLDTVTAA
VGFSLLELARRPQLRAMLRDNPKQIRVFIEEIVRLEPSAPVAPRVTTEPVTVGGMTLPAGSPVRLCMAAVNRDGSDAMST
DELVMDGKVHRHWGFGGGPHRCLGSHLARLELTLLVGEWLNQIPDFELAPDYAPEIRFPSKSFALKNLPLRWS
>P9WPL1 1.14.-.-~~~~~~Cytochrome P450 144~~~COG2124
MRRSPKGSPGAVLDLQRRVDQAVSADHAELMTIAKDANTFFGAESVQDPYPLYERMRAAGSVHRIANSDFYAVCGWDAVN
EAIGRPEDFSSNLTATMTYTAEGTAKPFEMDPLGGPTHVLATADDPAHAVHRKLVLRHLAAKRIRVMEQFTVQAADRLWV
DGMQDGCIEWMGAMANRLPMMVVAELIGLPDPDIAQLVKWGYAATQLLEGLVENDQLVAAGVALMELSGYIFEQFDRAAA
DPRDNLLGELATACASGELDTLTAQVMMVTLFAAGGESTAALLGSAVWILATRPDIQQQVRANPELLGAFIEETLRYEPP
FRGHYRHVRNATTLDGTELPADSHLLLLWGAANRDPAQFEAPGEFRLDRAGGKGHISFGKGAHFCVGAALARLEARIVLR
LLLDRTSVIEAADVGGWLPSILVRRIERLELAVQ
>P9WPP9 1.14.15.36~~~~~~Sterol 14alpha-demethylase~~~COG2124
MSAVALPRVSGGHDEHGHLEEFRTDPIGLMQRVRDECGDVGTFQLAGKQVVLLSGSHANEFFFRAGDDDLDQAKAYPFMT
PIFGEGVVFDASPERRKEMLHNAALRGEQMKGHAATIEDQVRRMIADWGEAGEIDLLDFFAELTIYTSSACLIGKKFRDQ
LDGRFAKLYHELERGTDPLAYVDPYLPIESFRRRDEARNGLVALVADIMNGRIANPPTDKSDRDMLDVLIAVKAETGTPR
FSADEITGMFISMMFAGHHTSSGTASWTLIELMRHRDAYAAVIDELDELYGDGRSVSFHALRQIPQLENVLKETLRLHPP
LIILMRVAKGEFEVQGHRIHEGDLVAASPAISNRIPEDFPDPHDFVPARYEQPRQEDLLNRWTWIPFGAGRHRCVGAAFA
IMQIKAIFSVLLREYEFEMAQPPESYRNDHSKMVVQLAQPACVRYRRRTGV
>P07125 4.-.-.-~~~cpcE~~~Phycocyanobilin lyase subunit alpha~~~COG1413
MIEPSVEEFPAENGPQLTPELAIANLQSSDLSLRYYAAWWLGKYRVKESAAVDALIAALEDEADRTELGGYPLRRNAARA
LGKLGNRKAVPGLINCLECPDFYVREAAAQSLEMLKDKTAAPALIKLLDGGVAQAVQVTGRPHLVQPYEAVLEALGAIGA
TDAIPLIQPFLEHPVSRVQCAAARAMYQLTQEPVYGELLVKVLAGNDLNLRRVALGDLGAIGYLAAAEAIANAKAENSFK
LIALKGLLEHQMSAESNALSISDQAIRVMNLMDSLL
>P31967 4.-.-.-~~~cpcE~~~Phycocyanobilin lyase subunit alpha~~~COG1413
MSDWQMAEAWTLEEAIANIQQTEDTGKRYYAAWWFGKFRVQDERAVNALLAALKDETDRSPDGGYPLRRNAAKALGKLGN
LAAVQPLIESLESPDYYVRESAAQSLEMLGDRQAIPALQALLAGGVAAAVKAEGKPHLVQPYEAVIEALGTIGATAAIAE
IEPFLDHEFAKIRYAALRALYQLTQEAHYAEQLMEALNGNQLQLRRSALLDLGAIGYVPAGQAIAKAYAENSLKLISLKG
ILESHLQRTAETLDADGLQLLELMDSLL
>P29985 4.-.-.-~~~cpcF~~~Phycocyanobilin lyase subunit beta~~~COG1413
MTNELINGVALADTPEKLVKAVQELALAKDVAAIPTLIAVFGYNNPTAAAIASTALVQLGEVAVPQLLTQIDDYNYGARA
YSIRTLAAIADPRALDVLIDAAATDFAPSVRRAAAKGLGNLHWHKLEFPDNQTAPKKALETLLFISQDAEWSIRYAAIVG
LQGLVNIPDLQQPIHTRLKEMLASDAEKAVRARILLAQSQ
>P31968 4.-.-.-~~~cpcF~~~Phycocyanobilin lyase subunit beta~~~COG1413
MTVDVLIRAVNNPTSAQDLVKNVAQLAATKDEQAIPTLVEVLKFNNPGAAVAAVNGLINIGEAVVPYLLENVDGYNYGAR
AWMLRIFAGIGDPRALDLLIEAANKDFAFSVRRSAAKGLGNIQWHKVPDSEREVQQQKVCDCLFLALEDGEWVVRYGAIA
GLEGLSQAIPEARKIVIKNKLTEFLTTEPEAAIRARIQKAILSLP
>P29988 ~~~cpcL~~~Photosystem I-associated linker protein CpcL~~~COG0237
MALPLLEYKPTTQNQRVQSFGTADVNEDTPYIYRLENANSPSEIEELIWAAYRQVFNEQEILKFNRQIGLETQLKNRSIT
VKDFIRGLAKSERFYQLVVTPNNNYRLVEMSLKRLLGRSPYNEEEKIAWSIQIASKGWGGFVDALIDSTEYEQAFGDNTV
PYQRKRLTTDRPFSFTPRYGADYRDRAGIVRPGRMSNWNNSANQNYDGVAILGVLLAISAGMTFLFVLNWLGISSSF
>P74625 ~~~cpcL~~~Photosystem I-associated linker protein CpcL~~~COG0448
MTLPLIAYAPVSQNQRVTNYEVSGDEHARIFTTEGTLSPSAMDNLIAAAYRQVFNEQQMIQSNRQIALESQFKNQQITVR
DFIRGLALSDSFRRRNFEVNNNYRFVQMCIQRLLGRDVYSEEEKIAWSIVIATKGLPGFINELLNSQEYLENFGYDTVPY
QRRRILPQRISGELPFARMPRYGADHREKLEAIGYFRNQAPLTYRWEWQKQPYPAGVYLAGKVVLYVGGALVSLGIIAVA
LSAWGIIGL
>Q8YZ70 4.-.-.-~~~cpcS1~~~Phycocyanobilin lyase CpcS 1~~~
MNIEEFFELSAGKWFSHRTSHHLAFKQSEDGKSDLVIESLAADHPEVIKLCELYEVPASAASCGARVSWNGTMEWDEEKH
TGSTVLATVPDVDNPNEGRLLREMGYAEKAPVAGRYKMGDDGALTLTTEYETMWSEERLWFASPNLRMRVSVLKRFGGFS
MASFTSEIRMGGSPAAAKAEEAANSASS
>Q8YLK6 4.-.-.-~~~cpeS2~~~Putative phycocyanobilin lyase CpcS 2~~~
MKSLSKLVRTVDESQIIEFFQESVGEWCSQRRYYTLPDGETKEMMSMITIRFLEQGCDELQKLAQIHKLAESVFLICGAE
VTWCSTDVLKNRSESEGSTLFGALGNILYRDRGFATSKPVTAQYNFPNPKTLCLRTEYNGSVFEEELKLIGSKYRTRQTI
ISRAGEQLMIGQYIEKRIVQ
>A8HTM2 4.-.-.-~~~cpcS~~~Phycocyanobilin lyase subunit CpcS~~~
MQSFADAKEFFQYSAGQWQSRRVTHHLPFRRAESGGSNIQVETLEKDDPRIIEICQMHDMDASLSVGGSYVTWAGTMQWD
KDDENHEGSTVFALIPDADNPRQGKLLRERGYAEIVPVAGEYHLDHEDGLVLTTEYETMTIYERFWFANPDLRLRTSTVK
RFGGFNTTTFCMEERIQTSPVTATAAAETNPLYAISGW
>Q8YZ40 ~~~cpcT2~~~Phycocyanobilin lyase CpcT homolog~~~
MSFSPQLVNLGNYLAGEFDNREQALGEPIWFVHLRLWQRPVDLFSDDSITLFAEQANIVNLDRPYRQRILRLMPAPDSET
GLYVQYYMPKNPSALIGAGRHPDLLKTLTPQQLELLPGCVLSVSQQTVAPNSYQFTASPLPNTCCTFSYLENTVQVSLGF
AVTETELHTYDKGIDQETGKATWGAIVGPYRYTKREQY
>Q8YLF9 4.-.-.-~~~cpcT1~~~Phycocyanobilin lyase CpcT~~~
MTHSTDIATLARWMAADFSNQAQAFENPPFYAHIRVCMRPLPWEVLSGVGFFVEQAYDYMLNDPYRLRVLKLMIVGDRIH
IENYTVKQEENFYGASRDLNRLQTLTSESLEKLPGCNMIVEWTGNSFKGTVEPGKGCIVVRKGQKTYLDSEFEINEEKFI
SLDRGRDLETDAHIWGSVAGPFYFVRLHNFADEVKISAE
>B1XI94 4.-.-.-~~~cpcT~~~Phycocyanobilin lyase CpcT~~~
MSHSTDAHTLARWMAGEFSNEAQALANPPLWAHIKVCMRPLPNQFFDGYGLYLEQAYSSDTSAPYRLRLFHIKPVDDHME
LVHYKPKDDAKTKYMGAARNPAMMQHFDMADLDPMPGCDMIVTWSGTSFKGTVQAGKGCRVVRYNKESYLDNSFEITDNA
LISIDRGRDPVTNEILWGSLAGAFEFEKINNFSGEVQPH
>A8HTN2 4.-.-.-~~~cpcU~~~Phycocyanobilin lyase subunit CpcU~~~
MDINAFIHQSAGNWFAQRTFYQAHHPEPDNGKANLAFELLPLDHPEVSRFAAAIAQAPNEHWRVFQSSWDTSVDWGKPKA
VGSSLFAFVINPDQPTQGQAFSLDERTLAQGQYRLGEDQILIVTLEAGEVKIIERQWFGNENLRLRTNIVTGKTGVLQTA
FYSEIRRIIEPEKTAEVTAEASN
>P0AEW4 3.1.4.53~~~cpdA~~~3',5'-cyclic adenosine monophosphate phosphodiesterase CpdA~~~COG1409
MESLLTLPLAGEARVRILQITDTHLFAQKHEALLGVNTWESYQAVLEAIRPHQHEFDLIVATGDLAQDQSSAAYQHFAEG
IASFRAPCVWLPGNHDFQPAMYSALQDAGISPAKRVFIGEQWQILLLDSQVFGVPHGELSEFQLEWLERKLADAPERHTL
LLLHHHPLPAGCSWLDQHSLRNAGELDTVLAKFPHVKYLLCGHIHQELDLDWNGRRLLATPSTCVQFKPHCSNFTLDTIA
PGWRTLELHADGTLTTEVHRLADTRFQPDTASEGY
>Q8DEI1 3.1.4.53~~~cpdA~~~3',5'-cyclic adenosine monophosphate phosphodiesterase CpdA~~~
MQHTSSDTLSENSIKLLQITDTHLFASDEGSLLSVKTLQSFQAVVEQVMARHVEFDYLLATGDISQDHSAASYQRFADGI
APLEKACFWLPGNHDYKPNMSSVLPSPQITTPEQVELNAHWQLILLDSQVVGVPHGRLSDQQLLMLEHHLQASPEKNTLI
LLHHHPLLVGSAWLDQHTLKDAEAFWQIVERFPMVKGIVCGHVHQDMNVMHKGIRVMATPSTCVQFKPKSDDFALDTVSP
GWRELTLHANGEITTQVQRLASGSFLPDFTSSGY
>P08331 3.1.3.6~~~cpdB~~~2',3'-cyclic-nucleotide 2'-phosphodiesterase/3'-nucleotidase~~~COG0737
MIKFSATLLATLIAASVNAATVDLRIMETTDLHSNMMDFDYYKDTATEKFGLVRTASLINDARNEVKNSVLVDNGDLIQG
SPLADYMSAKGLKAGDIHPVYKALNTLDYTVGTLGNHEFNYGLDYLKNALAGAKFPYVNANVIDARTKQPMFTPYLIKDT
EVVDKDGKKQTLKIGYIGVVPPQIMGWDKANLSGKVTVNDITETVRKYVPEMREKGADVVVVLAHSGLSADPYKVMAENS
VYYLSEIPGVNAIMFGHAHAVFPGKDFADIEGADIAKGTLNGVPAVMPGMWGDHLGVVDLQLSNDSGKWQVTQAKAEARP
IYDIANKKSLAAEDSKLVETLKADHDATRQFVSKPIGKSADNMYSYLALVQDDPTVQVVNNAQKAYVEHYIQGDPDLAKL
PVLSAAAPFKVGGRKNDPASYVEVEKGQLTFRNAADLYLYPNTLIVVKASGKEVKEWLECSAGQFNQIDPNSTKPQSLIN
WDGFRTYNFDVIDGVNYQIDVTQPARYDGECQMINANAERIKNLTFNGKPIDPNAMFLVATNNYRAYGGKFAGTGDSHIA
FASPDENRSVLAAWIADESKRAGEIHPAADNNWRLAPIAGDKKLDIRFETSPSDKAAAFIKEKGQYPMNKVATDDIGFAI
YQVDLSK
>Q2YIF7 ~~~cpdR~~~Response regulator receiver protein CpdR~~~
MKRILLAEDDNDMRRFLVKALEKAGYHVTHFDNGASAYERLQEEPFSLLLTDIVMPEMDGIELARRATEIDPDLKIMFIT
GFAAVALNPDSDAPRDAKVLSKPFHLRDLVNEIEKMLIAA
>Q9ALZ8 4.-.-.-~~~cpeS~~~Putative phycoerythrobilin lyase CpeS~~~
METKVLMNITKFVANSIGHWRSQRSAHHLAFGHFEAVQSEIDIIALPHDDPAVIDLCKSYNIDPQTVVSPFRMTWEGQSD
WDDSEIKGTCVLVPIPDPDSPHRGKLLRSRLCRNNRCRGDYYFTEHGTFVLVTAYERAAAEEKIWFVNPNVRCLCVSLIK
TSVRFSELLLPHFLQKFARIFRIKLYMTIFKSYRYKLRACSLYINAIALALPCHFFINTGA
>Q7V2Z2 4.-.-.-~~~cpeS~~~Phycoerythrobilin lyase CpeS~~~
MTKNLITINQFIQKSLGEWKSIRSTHSLAFQEVENSTSKIEIKELESNNKNVLGLLEKYNYTSKPSFIALSISWKAISDW
EIDQKIEQDKTILLFLPKDKNKGIVLRNKGYTESVISSSEYLIDENENLNIKTIYSSTASEERICFLSNHIRSRYSVIRN
NENNTVIQTSHTSEIRNMSILKD
>Q9ALZ7 4.-.-.-~~~cpeT~~~Probable phycoerythrobilin lyase CpeT~~~
MTSSLPKIPDTVSPNLITLARWMAGDFSNYQQAFENSKDYAHIHVFFRPLPFEFFSGIGLYSEQVYDYDLWRPYRQGVHR
LIDKGDEIYIENYSLKQALYYAGAARDLNILKTITPNCIERRYHCSMIFKREGDKFIGGVEPGNLCLIEKNGCQTYLDSY
VEITETTWVSLDKGMDVNTHQQVWGSTFGPLRFEKRESFADEIPNIL
>Q81U22 4.99.1.9~~~cpfC1~~~Coproporphyrin III ferrochelatase 1~~~COG0276
MKKKIGLLVMAYGTPYKEEDIERYYTHIRRGRKPSPEMLEDLTERYRAIGGISPLATITLEQAKKLEKRLNEVQDEVEYH
MYLGLKHIEPFIEDAVKEMHNDGIQDAIALVLAPHYSTFSVKSYVGRAQEEAEKLGNLTIHGIDSWYKEPKFIQYWVDAV
KSIYSGMSDAEREKAVLIVSAHSLPEKIIAMGDPYPDQLNETADYIARGAEVANYAVGWQSAGNTPDPWIGPDVQDLTRE
LNEKYGYTSFVYAPVGFVAEHLEVLYDNDFECKVVTDEIGAKYYRPEMPNASDAFIDCLTDVVVKKKESVM
>P32396 4.99.1.9~~~cpfC~~~Coproporphyrin III ferrochelatase~~~COG0276
MSRKKMGLLVMAYGTPYKEEDIERYYTHIRRGRKPEPEMLQDLKDRYEAIGGISPLAQITEQQAHNLEQHLNEIQDEITF
KAYIGLKHIEPFIEDAVAEMHKDGITEAVSIVLAPHFSTFSVQSYNKRAKEEAEKLGGLTITSVESWYDEPKFVTYWVDR
VKETYASMPEDERENAMLIVSAHSLPEKIKEFGDPYPDQLHESAKLIAEGAGVSEYAVGWQSEGNTPDPWLGPDVQDLTR
DLFEQKGYQAFVYVPVGFVADHLEVLYDNDYECKVVTDDIGASYYRPEMPNAKPEFIDALATVVLKKLGR
>Q8Y565 4.99.1.9~~~cpfC~~~Coproporphyrin III ferrochelatase~~~COG0276
MTKKVGLLVMAYGTPYKDEDIERYYTDIRHGHKPSEEMIADLRGRYHAIGGLSPLAKITEAQAYGLEKALNDSQDEVEFK
AYIGLKHIEPFIEDAVEAMHKDGIEEAISIVLAPHYSSFSVEAYNKRAKEAADKLGGPRINAINDWYKQPKFIQMWADRI
NETAKQIPADELLDTVLIVSAHSLPEKIKQHNDPYPNQLQETADFIFEKVVVPHYALGWQSEGKTGEPWLGPDVQDLTRE
LYGREKYKHFIYTPVGFVAEHLEVLYDNDYECKVVTDEVGAAYHRPPMPNSDPEFLEVLRTVVWEKYSN
>P9WNE3 4.99.1.9~~~cpfC~~~Coproporphyrin III ferrochelatase~~~COG0276
MQFDAVLLLSFGGPEGPEQVRPFLENVTRGRGVPAERLDAVAEHYLHFGGVSPINGINRTLIAELEAQQELPVYFGNRNW
EPYVEDAVTAMRDNGVRRAAVFATSAWSGYSSCTQYVEDIARARRAAGRDAPELVKLRPYFDHPLFVEMFADAITAAAAT
VRGDARLVFTAHSIPTAADRRCGPNLYSRQVAYATRLVAAAAGYCDFDLAWQSRSGPPQVPWLEPDVTDQLTGLAGAGIN
AVIVCPIGFVADHIEVVWDLDHELRLQAEAAGIAYARASTPNADPRFARLARGLIDELRYGRIPARVSGPDPVPGCLSSI
NGQPCRPPHCVASVSPARPSAGSP
>Q2FXA4 4.99.1.9~~~cpfC~~~Coproporphyrin III ferrochelatase~~~COG0276
MTKKMGLLVMAYGTPYKESDIEPYYTDIRHGKRPSEEELQDLKDRYEFIGGLSPLAGTTDDQADALVSALNKAYADVEFK
LYLGLKHISPFIEDAVEQMHNDGITEAITVVLAPHYSSFSVGSYDKRADEEAAKYGIQLTHVKHYYEQPKFIEYWTNKVN
ETLAQIPEEEHKDTVLVVSAHSLPKGLIEKNNDPYPQELEHTALLIKEQSNIEHIAIGWQSEGNTGTPWLGPDVQDLTRD
LYEKHQYKNFIYTPVGFVCEHLEVLYDNDYECKVVCDDIGANYYRPKMPNTHPLFIGAIIDEIKSIF
>P64125 4.99.1.9~~~cpfC~~~Coproporphyrin III ferrochelatase~~~
MTKKMGLLVMAYGTPYKESDIEPYYTDIRHGKRPSEEELQDLKDRYEFIGGLSPLAGTTDDQADALVSALNKAYADVEFK
LYLGLKHISPFIEDAVEQMHNDGITEAITVVLAPHYSSFSVGSYDKRADEEAAKYGIQLTHVKHYYEQPKFIEYWTNKVN
ETLAQIPEEEHKDTVLVVSAHSLPKGLIEKNNDPYPQELEHTALLIKEQSNIEHIAIGWQSEGNTGTPWLGPDVQDLTRD
LYEKHQYKNFIYTPVGFVCEHLEVLYDNDYECKVVCDDIGANYYRPKMPNTHPLFIGAIVDEIKSIF
>P28784 3.4.22.37~~~rgpA~~~Gingipain R1~~~
MKNLNKFVSIALCSSLLGGMAFAQQTELGRNPNVRLLESTQQSVTKVQFRMDNLKFTEVQTPKGMAQVPTYTEGVNLSEK
GMPTLPILSRSLAVSDTREMKVEVVSSKFIEKKNVLIAPSKGMIMRNEDPKKIPYVYGKSYSQNKFFPGEIATLDDPFIL
RDVRGQVVNFAPLQYNPVTKTLRIYTEITVAVSETSEQGKNILNKKGTFAGFEDTYKRMFMNYEPGRYTPVEEKQNGRMI
VIVAKKYEGDIKDFVDWKNQRGLRTEVKVAEDIASPVTANAIQQFVKQEYEKEGNDLTYVLLVGDHKDIPAKITPGIKSD
QVYGQIVGNDHYNEVFIGRFSCESKEDLKTQIDRTIHYERNITTEDKWLGQALCIASAEGGPSADNGESDIQHENVIANL
LTQYGYTKIIKCYDPGVTPKNIIDAFNGGISLVNYTGHGSETAWGTSHFGTTHVKQLTNSNQLPFIFDVACVNGDFLFSM
PCFAEALMRAQKDGKPTGTVAIIASTINQSWASPMRGQDEMNEILCEKHPNNIKRTFGGVTMNGMFAMVEKYKKDGEKML
DTWTVFGDPSLLVRTLVPTKMQVTAPAQINLTDASVNVSCDYNGAIATISANGKMFGSAVVENGTATINLTGLTNESTLT
LTVVGYNKETVIKTINTNGEPNPYQPVSNLTATTQGQKVTLKWDAPSTKTNATTNTARSVDGIRELVLLSVSDAPELLRS
GQAEIVLEAHDVWNDGSGYQILLDADHDQYGQVIPSDTHTLWPNCSVPANLFAPFEYTVPENADPSCSPTNMIMDGTASV
NIPAGTYDFAIAAPQANAKIWIAGQGPTKEDDYVFEAGKKYHFLMKKMGSGDGTELTISEGGGSDYTYTVYRDGTKIKEG
LTETTYRDAGMSAQSHEYCVEVKYAAGVSPKVCVDYIPDGVADVTAQKPYTLTVVGKTITVTCQGEAMIYDMNGRRLAAG
RNTVVYTAQGGYYAVMVVVDGKSYVEKLAVK
>P95493 3.4.22.37~~~rgpB~~~Gingipain R2~~~COG2957
MKKNFSRIVSIVAFSSLLGGMAFAQPAERGRNPQVRLLSAEQSMSKVQFRMDNLQFTGVQTSKGVAQVPTFTEGVNISEK
GTPILPILSRSLAVSETRAMKVEVVSSKFIEKKDVLIAPSKGVISRAENPDQIPYVYGQSYNEDKFFPGEIATLSDPFIL
RDVRGQVVNFAPLQYNPVTKTLRIYTEIVVAVSETAEAGQNTISLVKNSTFTGFEDIYKSVFMNYEATRYTPVEEKENGR
MIVIVPKKYEEDIEDFVDWKNQRGLRTEVKVAEDIASPVTANAIQQFVKQEYEKEGNDLTYVLLVGDHKDIPAKITPGIK
SDQVYGQIVGNDHYNEVFIGRFSCESKEDLKTQIDRTIHYERNITTEDKWLGQALCIASAEGGPSADNGESDIQHENIIA
NLLTQYGYTKIIKCYDPGVTPKNIIDAFNGGISLANYTGHGSETAWGTSHFGTTHVKQLTNSNQLPFIFDVACVNGDFLY
NVPCFAEALMRAQKDGKPTGTVAIIASTINQSWASPMRGQDEMNEILCEKHPNNIKRTFGGVTMNGMFAMVEKYKKDGEK
MLDTWTVFGDPSLLVRTLVPTKMQVTAPANISASAQTFEVACDYNGAIATLSDDGDMVGTAIVKDGKAIIKLNESIADET
NLTLTVVGYNKVTVIKDVKVEGTSIADVANDKPYTVAVSGKTITVESPAAGLTIFDMNGRRVATAKNRMVFEAQNGVYAV
RIATEGKTYTEKVIVK
>P56947 6.3.2.29~~~cphA~~~Cyanophycin synthetase~~~
MKILKTQTLRGPNYWSIRRQKLIQMRLDLEDVAEKPSNLIPGFYEGLVKILPSLVEHFCSRDHRGGFLERVQEGTYMGHI
VEHIALELQELAGMPVGFGRTRETSTPGIYNVVFEYVYEEAGRYAGRVAVRLCNSIITTGAYGLDELAQDLSDLKDLRAN
SALGPSTETIIKEAEARQIPWMLLSARAMVQLGYGANQQRIQATLSNKTGILGVELACDKEGTKTTLAEAGIPVPRGTVI
YYADELADAIADVGGYPIVLKPLDGNHGRGITIDINSQQEAEEAYDLASAASKTRSVIVERYYKGNDHRVLVINGKLVAV
SERIPAHVTGNGSSTIEELIQETNEHPDRGDGHDNVLTRISIDRTSLGVLKRQGFEMDTVLKKGEVAYLRATANLSTGGI
AIDRTDEIHPQNIWIAERVAKIIGLDIAGIDVVTPDITKPLTEVDGVIVEVNAAPGFRMHVAPSQGLPRNVAAPVIDMLF
PDNHPSRIPILAVTGTNGKTTTTRLLAHIYRQTGKVVGYTSTDGIYLGDYMVEKGDNTGPVSAGVILRDPTVEVAVLECA
RGGILRSGLAFESCDVGVVLNVAEDHLGLGDIDTIEQMAKVKGVIAESVNADGYAVLNADDPLVAQMAKNVKGKIAYFSM
SKDNPIIIDHLRRNGMAAVYENGYLSIFEGEWTLRIEKAENIPVTMKAMAPFMIANALAASLAAFVHGIDIELIRQGVRS
FNPGANQTPGRMNLFDMKDFSVLIDYAHNPAGYLAVGSFVKNWKGDRLGVIGGPGDRRDEDLMLLGKIASQIFDHIIIKE
DDDNRGRDRGTVADLIAKGIVAENPNASYDDILDETEAIETGLKKVDKGGLVVIFPESVTGSIEMIEKYHLSSE
>O86109 6.3.2.29~~~cphA~~~Cyanophycin synthetase~~~COG0189
MRILKIQTLRGPNYWSIRRHKLIVMRLDLETLAETPSNEIPGFYEGLVEALPSLEGHYCSPGCHGGFLMRVREGTMMGHI
VEHVALELQELAGMHVGFGRTRETATPGIYQVVIEYLNEEAGRYAGRAAVRLCQSIVDRGRYPKAELEQDIQDLKDLWRD
ASLGPSTEAIVKEAEKRGIPWMQLSARFLIQLGYGVNHKRMQATMTDKTGILGVELACDKEATKRILAASGVPVPRGTVI
NFLDDLEEAIEYVGGYPIVIKPLDGNHGRGITIDIRSWEEAEAAYEAARQVSRSIIVERYYVGRDHRVLVVDGKVVAVAE
RVPAHVIGNGRSTIAELIEEINQDPNRGDGHDKVLTKIELDRTSYQLLERAGYTLNSVPPKGTICYLRATANLSTGGTAV
DRTDEIHPENIWLAQRVVKIIGLDIAGLDIVTTDISRPLRELDGVIVEVNAAPGFRMHVAPSQGIPRNVAGAVMDMLFPN
EQSGRIPILSVTGTNGKTTTTRLLAHIYKQTGKVVGYTTTDGTYIGDYLVESGDNTGPQSAHVILQDPTVEVAVLETARG
GILRSGLGFESANVGVVLNVAADHLGIGDIDTIDQLANLKSVVAESVYPDGYAVLNADDRRVAAMAEKTKANIAYFTMNP
DSELVRKHIQKGGVAAVYENGYLSIVKGDWTHRIERAEQIPLTMGGRAPFMIANALAASLAAFVQNVSIEQIRAGLRTFR
ASVSQTPGRMNLFNLGNYHALVDYAHNPASYEAVGAFVRNWTSGQRIGVVGGPGDRRDEDFVTLGKLAAEIFDYIIVKED
DDTRGRPRGSASALITKGITQVKPDARYESILDETQAINKGLDMAPANGLVVILPESVSRAIKLIKLRGLVKEEIQQQNP
STTVIDNQNGVASSSVINTLL
>P73832 3.4.15.6~~~cphB~~~Cyanophycinase~~~COG4242
MPLSSQPAILIIGGAEDKVHGREILQTFWSRSGGNDAIIGIIPSASREPLLIGERYQTIFSDMGVKELKVLDIRDRAQGD
DSGYRLFVEQCTGIFMTGGDQLRLCGLLADTPLMDRIRQRVHNGEISLAGTSAGAAVMGHHMIAGGSSGEWPNRALVDMA
VGLGIVPEIVVDQHFHNRNRMARLLSAISTHPELLGLGIDEDTCAMFERDGSVKVIGQGTVSFVDARDMSYTNAALVGAN
APLSLHNLRLNILVHGEVYHQVKQRAFPRVT
>Q8KQN8 3.4.15.6~~~cphE~~~Cyanophycinase~~~
MIRSFIRSSALLLALLPVTGYSAGPLILVGGGLKDDNTAIYQRLIQLAGGNGQARIGVITAASIPESDDPDAGTADAANS
KANGEFYAQLLETYGAADAQWIPIDLDQISNNSNPQVVAQINSMTGFFFGGGDQSRLTQTLQTATRADSPALAAIRARHN
AGAVLAGTSAGTAIMVQGPMVTGGESYDGLRYGVYTTPSGDDLSYDMQGGFGFFNYGLLDTHFSERGRQGRIVRLADHTQ
VPFAFGVDENTALLVQNNATLGQVEMEVIGENGVFIFDLRNKERGTGSTYALYDVLGSYLTAGDRYRPVTGQFVIASGKT
SLRGRERYSAAMTVTTDIFSSPNNSGANGRRKPREFVKVSADLFDSRVTSTLGRTYETNPLSRRSVQKHAVRQPWLPGHR
WRQEHAVLPAFADGFPS
>Q6F9U4 ~~~cpiA~~~PilB-specific inhibitory protein CpiA~~~
MIMKLVSDQIQERKFIIEFQLFNLIDELGLSVKKDHERKIFIYALLYVQLMHQPYSAMNTISQLEPVTTYEKIKISYYTN
YLSTYLSTSYSQERERIISIISKTFIDSINPEMNLYNFNDFVTMFAFKKLLQQVFHEN
>Q8GFE2 ~~~~~~Chlorophenol reductase~~~
MKKTLGIILSISLAFSVLALPIFAAVDTTTSATVEAATPAAPATPAATAPAAAPSVDTSKIAAGTYYTVVSGDFFWQIAA
KHGLTIDALAKLNPQIKNVNLIFPGQKILVKAEEAAAASTSTSTAAVAPAAKKLYQGIGMAANYRDNTARQKDHDNLNIT
TVAALFDDAGKIVKLQFDVVEILPDMFPGWMDPEAADKSFYKDAQANGFNWETKKEEGDAYGMKASAVSGKEWWEQMNFY
EEYFKGKTVAEVQDWFAKYCDANGRPYKMAYPEKLTDADKAKVATFTEAEKKMLVDVTTGATMSLQDPHSRFIDALVKAY
EVRKEVK
>Q8GAW0 1.14.13.16~~~cpnB~~~Cyclopentanone 1,2-monooxygenase~~~
MTTMTTMTTEQLGMNNSVNDKLDVLLIGAGFTGLYQLYHLRKLGYKVHLVDAGADIGGIWHWNCYPGARVDTHCQIYQYS
IPELWQEFNWKELFPNWAQMREYFHFADKKLDLSKDISFNTRVQSAVFDEGTREWTVRSIGHQPIQARFVIANLGFGASP
STPNVDGIETFKGQWYHTALWPQEGVNMAGKRVAIIGTGSSGVQVAQEAALDAKQVTVYQRTPNLALPMHQKQLSAEDNL
RMKPELPAAFERRGKCFAGFDFDFIAKNATELSAAERTEILEELWNAGGFRYWLANFQDYLFDDKANDYVYEFWRDKVRA
RIKDPKVAEKLAPMKKPHPYGAKRPSLEQWYYEIFNQNNVTLVDVNETPVLRITEKGIVTAEGEAEFDLIVFATGFDAVT
GGLTSIDFRNNQGQSFKDVWSDGIRTQLGVATAGFPNLLFGYGPQSPAGFCNGPSSAEYQGDLLIQLMNYLRDNNISRIE
AQSEAQEEWSKLIADFWDSSLFPRAKSWYQGSNIPGKKVESLNFPLGLPTYISKFNESAEKGYAGFSLAS
>Q8GAV9 1.1.1.163~~~cpnA~~~Cyclopentanol dehydrogenase~~~
MGRVNDKVVLVTGGAMGMGLTHCTLLAREGATVYLSDMNEELGHQAVAEIRRQGGKAHFLHLDVTNENHWTGAVDTILAE
SDRLDALVNNAGILTLKPVQDTSNEEWDRIFEINVRSVFLGTRAVIEPMRKAHKGCIVNVSSIYGLVGAPGAAAYEASKG
AVRLFTKACAVDLAPFNIRVNSVHPGVIATPMTQQILDAPQSARALLGPTLLGRAAQPMEVSQAVLFLVSDEASFVHGSE
LVVDGGYTAN
>Q937L4 1.1.1.163~~~cpnA~~~Cyclopentanol dehydrogenase~~~
MGRVNDKVVLVTGGAMGMGLTHCTLLAREGATVYLSDMNEELGHQAVAEIRRQGGKAHFLHLDVTNENHWTGAVDTILAE
SDRLDALVNNAGILTLKPVQDTSNEEWDRIFEINVRSVFLGTRAVIEPMRKAHKGCIVNVSSIYGLVGAPGAAAYEASKG
AVRLFTKACAVDLAPFNIRVNSVHPGVIATPMTQQILDAPQSARALLGPTLLGRAAQPMEVSQAVLFLVSDEASFVHGSE
LVVDGGYTAN
>O05442 ~~~cpnT~~~Outer membrane channel protein CpnT~~~COG0803
MAPLAVDPAALDSAGGAVVAAGAGLGAVISSLTAALAGCAGMAGDDPAGAVFGRSYDGSAAALVQAMSVARNGLCNLGDG
VRMSAHNYSLAEAMSDVAGRAAPLPAPPPSGCVGVGAPPSAVGGGGGAPKGWGWVAPYIGMIWPNGDSTKLRAAAVAWRS
AGTQFALTEIQSTAGPMGVIRAQQLPEAGLIESAFADAYASTTAVVGQCHQLAAQLDAYAARIDAVHAAVLDLLARICDP
LTGIKEVWEFLTDQDEDEIQRIAHDIAVVVDQFSGEVDALAAEITAVVSHAEAVITAMADHAGKQWDRFLHSNPVGVVID
GTGQQLKGFGEEAFGMAKDSWDLGPLRASIDPFGWYRSWEEMLTGMAPLAGLGGENAPGVVESWKQFGKSLIHWDEWTTN
PNEALGKTVFDAATLALPGGPLSKLGSKGRDILAGVRGLKERLEPTTPHLEPPATPPRPGPQPPRIEPPESGHPAPAPAA
KPAPVPANGPLPHSPTESKPPPVDRPAEPVAPSSASAGQPRVSAATTPGTHVPHGLPQPGEHVPAQAPPATTLLGGPPVE
SAPATAHQPQWATTPAAPAAAPHSTPGGVHSTESGPHGRSLSAHGSEPTHDGASHGSGHGSGSEPPGLHAPHREQQLAMH
SNEPAGEGWHRLSDEAVDPQYGEPLSRHWDFTDNPADRSRINPVVAQLMEDPNAPFGRDPQGQPYTQERYQERFNSVGPW
GQQYSNFPPNNGAVPGTRIAYTNLEKFLSDYGPQLDRIGGDQGKYLAIMEHGRPASWEQRALHVTSLRDPYHAYTIDWLP
EGWFIEVSEVAPGCGQPGGSIQVRIFDHQNEMRKVEELIRRGVLRQ
>P33912 1.11.1.-~~~bpoA1~~~Non-heme chloroperoxidase CPO-A1~~~COG2267
MPICTTRDGVEIFYKDWGQGRPVVFIHGWPLNGDAWQDQLKAVVDAGYRGIAHDRRGHGHSTPVWDGYDFDTFADDLNDL
LTDLDLRDVTLVAHSMGGGELARYVGRHGTGRLRSAVLLSAIPPVMIKSDKNPDGVPDEVFDALKNGVLTERSQFWKDTA
EGFFSANRPGNKVTQGNKDAFWYMAMAQTIEGGVRCVDAFGYTDFTEDLKKFDIPTLVVHGDDDQVVPIDATGRKSAQII
PNAELKVYEGSSHGIAMVPGDKEKFNRDLLEFLNK
>P45955 ~~~cpoB~~~Cell division coordinator CpoB~~~COG1729
MSSNFRHQLLSLSLLVGIAAPWAAFAQAPISSVGSGSVEDRVTQLERISNAHSQLLTQLQQQLSDNQSDIDSLRGQIQEN
QYQLNQVVERQKQILLQIDSLSSGGAAAQSTSGDQSGAAASTTPTADAGTANAGAPVKSGNANTDYNAAIALVQDKSRQD
DAMVAFQNFIKNYPDSTYLPNANYWLGQLNYNKGKKDDAAYYFASVVKNYPKSPKAADAMFKVGVIMQDKGDTAKAKAVY
QQVISKYPGTDGAKQAQKRLNAM
>P11435 2.7.8.23~~~bcpA~~~Carboxyvinyl-carboxyphosphonate phosphorylmutase~~~
MAVTKARTFRELMNAPEILVVPSAYDALSAKVIQQAGFPAVHMTGSGTSASMLGLPDLGFTSVSEQAINLKNIVLTVDVP
VIMDADAGYGNAMSVWRATREFERVGIVGYHLEDQVNPKRCGHLEGKRLISTEEMTGKIEAAVEAREDEDFTIIARTDAR
ESFGLDEAIRRSREYVAAGADCIFLEAMLDVEEMKRVRDEIDAPLLANMVEGGKTPWLTTKELESIGYNLAIYPLSGWMA
AASVLRKLFTELREAGTTQKFWDDMGLKMSFAELFEVFEYSKISELEARFVRDQD
>P81594 3.8.1.-~~~cprA~~~3-chloro-4-hydroxyphenylacetate reductive dehalogenase~~~
MENNQKRQQSGMSRRSFLKVGAAATTMGVIGAIKAPAKVANAAETLNYVPGSGKIRSKLRPVHDFAGAKVRFVENNDQWL
GTTKILTKVKMTSEADAGFMQAVRGLYGPEPQKGFFQFIAKDPFGGSISWARNLIAPEDVVDGAPEATKTPIPDPEQMSQ
HIRDCCYFLRADEVGIGKMPEYGYYTHHVADTVGLMTKPVEECVTPVTKIYPNVIVVMIDQGIETMWASTGYDGISGAMS
MKSYFTSGCIAVILAKYIRTLGYNARAHHAKNYEAIMPVCIMAAGLGELSRTGDSAIHPRLGFRHKVAAVTTDLPLAPDQ
PLDFGLLDFCRVCKKCADNCPNEAISFDEDPIEYNGYLRWNSDFRKCTEFRTTNEEGSSCGTCMKVCPWNSKEDSWFHKA
GVWVGSKGETASTFLKSIDDIFGYGTETIEKYKWWLEWPEKYVMK
>Q69AA9 2.7.-.-~~~~~~Capsular polysaccharide phosphotransferase cps12A~~~
MNKMNRKFSKLLKNPHIFFRDFLNKKYPIKNTELPFSESEEANLIEANQKLDKIIQKNTLQQANIDVVFTWVDGSDPSWQ
AKYSQYAPNYQAKSALYATDIARFEDHNELYYSVHAVLKYMPWVRHIFIITDNQKPKWLDETRQEKITLIDHQDIIDKEY
LPTFNSHVIEAFLHKIPNLSENFIYFNDDVFIARELQAEHFFQANGIASIFVSEKSLSKMRDKGIITPTLSASEYSIRLL
NKYYDTNIDSPLVHTYIPLKKSMYELAWLRYEKAILGFLPNKLRTNNDLNFANFLIPWLMYFEGKAMPKIDICYYFNIRS
PNAISLYKKLLLKQQMGEEPNSFCANDFNSNYSIENYRNNLISTLNNYYKF
>Q59831 1.14.-.-~~~~~~Cytochrome P450 105A3~~~
MTEMTEKATTFLTSQEAPAFPADRTCPYQLPTAYSRLRDEPDALRPVTLYDGRRAWVVTKHEAARRLLADPRLSSDRLHA
DFPATSPRFKAFRQGSPAFIGMDPPEHGTRRRMTISEFTVKRIKGMRPDVERIVHGFIDDMLAAGPTADLVSQFALPVPS
MVICHMLGVPYADHEFFQDASKRLVQAVDADSAVAARDDFERYLDGLITKLESEPGTGLLGKLVTHQLADGEIDRAELIS
TALLLLVAGHETTASMTSLSVITLLEHPDQHAALRADPSLVPGAVEELLRVLAIADIAGGRIATADIEIDGQLIRAGEGV
IVTNSIANRDSSVFENPDRLDVHRSARHHLSFGYGVHQCLGQNLARLELEVILTVLFDRIPTLRLAVPVEQLTLRPGTTI
QGVNELPVTW
>Q54518 3.1.3.48~~~cpsB~~~Tyrosine-protein phosphatase CpsB~~~
MIDIHSHIVFDVDDGPKSREESKALLAESYRQGVRTIVSTSHRRKGMFETPEEKIAENFLQVREIAKEVADDLVIAYGAE
IYYTLDALEKLEKKEIPTLNDSRYALIEFSMHTSYRQIHTGLSNILMLGITPVIAHIERYDALENNEKRVRELIDMGCYT
QINSYHVSKPKFFGEKYKFMKKRARYFLERDLVHVVASDMHNLDSRPPYMQQAYDIIAKKYGAKKAKELFVDNPRKIIMD
QLI
>Q9AHD4 3.1.3.48~~~cpsB~~~Tyrosine-protein phosphatase CpsB~~~COG4464
MIDIHSHIVFDVDDGPKSREESKALLAESYRQGVRTIVSTSHRRKGMFETPEEKIAENFLQVREIAKEVASDLVIAYGAE
IYYTPDVLDKLEKKRIPTLNDSRYALIEFSMNTPYRDIHSALSKILMLGITPVIAHIERYDALENNEKRVRELIDMGCYT
QVNSSHVLKPKLFGERYKFMKKRAQYFLEQDLVHVIASDMHNLDGRPPHMAEAYDLVTQKYGEAKAQELFIDNPRKIVMD
QLI
>Q54520 2.7.10.2~~~cpsD~~~Tyrosine-protein kinase CpsD~~~
MPTLEIAQKKLEFIKKAEEYYNALCTNIQLSGDKLKVISVTSVNPGEGKTTTSVNIARSFARAGYKTLLIDGDTRNSVMS
GFFKSREKITGLTEFLSGTADLSHGLCDTNIENLFVVQSGTVSPNPTALLQSKNFNDMIETLRKYFDYIIVDTAPIGIVI
DAAIITQKCDASILVTATGEVNKRDVQKAKQQLEQTGKLFLGVVFNKLDISVDKYGVYGFYGNYGKK
>P9WGD1 2.7.-.-~~~cpsY~~~Exopolysaccharide phosphotransferase CpsY~~~COG0438
MPKISSRDGGRPAQRTVNPIIVTRRGKIARLESGLTPQEAQIEDLVFLRKVLNRADIPYLLIRNHKNRPVLAINIELRAG
LERALAAACATEPMYAKTIDEPGLSPVLVATDGLSQLVDPRVVRLYRRRIAPGGFRYGPAFGVELQFWVYEETVIRCPVE
NSLSRKVLPRNEITPTNVKLYGYKWPTLDGMFAPHASDVVFDIDMVFSWVDGSDPEFRARRMAQMSQYVVGEGDDAEARI
RQIDELKYALRSVNMFAPWIRRIFIATDSTPPPWLAEHPKITIVRAEDHFSDRSALPTYNSHAVESQLHHIPGLSEHFLY
SNDDMFFGRPLKASMFFSPGGVTRFIEAKTRIGLGANNPARSGFENAARVNRQLLFDRFGQVITRHLEHTAVPLRKSVLI
EMEREFPEEFARTAASPFRSDTDISVTNSFYHYYALMTGRAVPQEKAKVLYVDTTSYAGLRLLPKLRKHRGYDFFCLNDG
SFPEVPAAQRAERVVSFLERYFPIPAPWEKIAADVSRRDFAVPRTSAPSEGA
>Q7CPC0 2.7.-.-~~~cptA~~~Phosphoethanolamine transferase CptA~~~
MQSTLLQTKPAFSWKALGWALLYFWFFSTLLQAIIYLTGYSGTNGLRDSLLYSSLWLIPVFLFPGRIRVIAAVIGVVLWA
ASLAALSYYVIYGQEFSQSVLFVMFETNANEASEYLSQYFSLKIVLVALAYTVAAILLWTRLRPVYIPSPWRYLVSFALL
YGLILHPIAMNTFIKHKSMEKTLDSLASRMEPAAPWQFITGYYQYRLQLASLNKLLNENDALPPLANFQDHSGDAPRTLV
LVIGESTQRGRMSLYGYPRETTPELDALHKTDPGLTVFNNVVTSRPYTIEILQQALTFADEKNPDWYLTKPSLMNMMKQA
GYKTFWITNQQTMTARNTMLTVFSKQTDKQFYMNQQRTQSAREYDSNVLAPFKAVLADPAPKKFIIVHLLGTHIKYKFRY
PENQGKFDGKTDHVPPGLSSDELESYNDYDNANLYNDYVVASLIKDYKATDPNGFLLYFSDHGEEVYDTPPHKTQGRNED
SPTRHMYTVPFLLWTSEKWQAAHPRDFSQDVDRKYSSSELIHTWSDLAGLTYDGYDPTRSITNPQFKETTRWIGNPYKKN
ALIDYDTLPYGDQVGNQ
>Q56148 2.7.1.-~~~~~~Chloramphenicol 3-O phosphotransferase~~~COG3896
MTTRMIILNGGSSAGKSGIVRCLQSVLPEPWLAFGVDSLIEAMPLKMQSAEGGIEFDADGGVSIGPEFRALEGAWAEGVV
AMARAGARIIIDDVFLGGAAAQERWRSFVGDLDVLWVGVRCDGAVAEGRETARGDRVAGMAAKQAYVVHEGVEYDVEVDT
THKESIECAWAIAAHVVP
>C4B644 1.14.15.15~~~vdh~~~Vitamin D(3) 25-hydroxylase~~~
MALTTTGTEQHDLFSGTFWQNPHPAYAALRAEDPVRKLALPDGPVWLLTRYADVREAFVDPRLSKDWRHTLPEDQRADMP
ATPTPMMILMDPPDHTRLRKLVGRSFTVRRMNELEPRITEIADGLLAGLPTDGPVDLMREYAFQIPVQVICELLGVPAED
RDDFSAWSSVLVDDSPADDKNAAMGKLHGYLSDLLERKRTEPDDALLSSLLAVSDEDGDRLSQEELVAMAMLLLIAGHET
TVNLIGNGVLALLTHPDQRKLLAEDPSLISSAVEEFLRFDSPVSQAPIRFTAEDVTYSGVTIPAGEMVMLGLAAANRDAD
WMPEPDRLDITRDASGGVFFGHGIHFCLGAQLARLEGRVAIGRLFADRPELALAVGLDELVYRESTLVRGLSRMPVTMGP
RSA
>P0AE82 2.7.13.3~~~cpxA~~~Sensor histidine kinase CpxA~~~COG2205
MIGSLTARIFAIFWLTLALVLMLVLMLPKLDSRQMTELLDSEQRQGLMIEQHVEAELANDPPNDLMWWRRLFRAIDKWAP
PGQRLLLVTTEGRVIGAERSEMQIIRNFIGQADNADHPQKKKYGRVELVGPFSVRDGEDNYQLYLIRPASSSQSDFINLL
FDRPLLLLIVTMLVSTPLLLWLAWSLAKPARKLKNAADEVAQGNLRQHPELEAGPQEFLAAGASFNQMVTALERMMTSQQ
RLLSDISHELRTPLTRLQLGTALLRRRSGESKELERIETEAQRLDSMINDLLVMSRNQQKNALVSETIKANQLWSEVLDN
AAFEAEQMGKSLTVNFPPGPWPLYGNPNALESALENIVRNALRYSHTKIEVGFAVDKDGITITVDDDGPGVSPEDREQIF
RPFYRTDEARDRESGGTGLGLAIVETAIQQHRGWVKAEDSPLGGLRLVIWLPLYKRS
>P00183 1.14.15.1~~~camC~~~Camphor 5-monooxygenase~~~
MTTETIQSNANLAPLPPHVPEHLVFDFDMYNPSNLSAGVQEAWAVLQESNVPDLVWTRCNGGHWIATRGQLIREAYEDYR
HFSSECPFIPREAGEAYDFIPTSMDPPEQRQFRALANQVVGMPVVDKLENRIQELACSLIESLRPQGQCNFTEDYAEPFP
IRIFMLLAGLPEEDIPHLKYLTDQMTRPDGSMTFAEAKEALYDYLIPIIEQRRQKPGTDAISIVANGQVNGRPITSDEAK
RMCGLLLVGGLDTVVNFLSFSMEFLAKSPEHRQELIERPERIPAACEELLRRFSLVADGRILTSDYEFHGVQLKKGDQIL
LPQMLSGLDERENACPMHVDFSRQKVSHTTFGHGSHLCLGQHLARREIIVTLKEWLTRIPDFSIAPGAQIQHKSGIVSGV
QALPLVWDPATTKAV
>P14779 ~~~~~~Bifunctional cytochrome P450/NADPH--P450 reductase~~~
MTIKEMPQPKTFGELKNLPLLNTDKPVQALMKIADELGEIFKFEAPGRVTRYLSSQRLIKEACDESRFDKNLSQALKFVR
DFAGDGLFTSWTHEKNWKKAHNILLPSFSQQAMKGYHAMMVDIAVQLVQKWERLNADEHIEVPEDMTRLTLDTIGLCGFN
YRFNSFYRDQPHPFITSMVRALDEAMNKLQRANPDDPAYDENKRQFQEDIKVMNDLVDKIIADRKASGEQSDDLLTHMLN
GKDPETGEPLDDENIRYQIITFLIAGHETTSGLLSFALYFLVKNPHVLQKAAEEAARVLVDPVPSYKQVKQLKYVGMVLN
EALRLWPTAPAFSLYAKEDTVLGGEYPLEKGDELMVLIPQLHRDKTIWGDDVEEFRPERFENPSAIPQHAFKPFGNGQRA
CIGQQFALHEATLVLGMMLKHFDFEDHTNYELDIKETLTLKPEGFVVKAKSKKIPLGGIPSPSTEQSAKKVRKKAENAHN
TPLLVLYGSNMGTAEGTARDLADIAMSKGFAPQVATLDSHAGNLPREGAVLIVTASYNGHPPDNAKQFVDWLDQASADEV
KGVRYSVFGCGDKNWATTYQKVPAFIDETLAAKGAENIADRGEADASDDFEGTYEEWREHMWSDVAAYFNLDIENSEDNK
STLSLQFVDSAADMPLAKMHGAFSTNVVASKELQQPGSARSTRHLEIELPKEASYQEGDHLGVIPRNYEGIVNRVTARFG
LDASQQIRLEAEEEKLAHLPLAKTVSVEELLQYVELQDPVTRTQLRAMAAKTVCPPHKVELEALLEKQAYKEQVLAKRLT
MLELLEKYPACEMKFSEFIALLPSIRPRYYSISSSPRVDEKQASITVSVVSGEAWSGYGEYKGIASNYLAELQEGDTITC
FISTPQSEFTLPKDPETPLIMVGPGTGVAPFRGFVQARKQLKEQGQSLGEAHLYFGCRSPHEDYLYQEELENAQSEGIIT
LHTAFSRMPNQPKTYVQHVMEQDGKKLIELLDQGAHFYICGDGSQMAPAVEATLMKSYADVHQVSEADARLWLQQLEEKG
RYAKDVWAG
>P24466 1.14.-.-~~~~~~Cytochrome P450-pinF1, plant-inducible~~~
MIANSSTDVSVADQKFLNVAKSNQIDPDAVPISRLDSEGHSIFAEWRPKRPFLRREDGIFLVLRADHIFLLGTDPRTRQI
ETELMLNRGVKAGAVFDFIDHSMLFSNGETHGKRRSGLSKAFSFRMVEALRPEIAKITECLWDDLQKVDDFNFTEMYASQ
LPALTIASVLGLPSEDTPFFTRLVYKVSRCLSPSWRDEEFEEIEASAIELQDYVRSVIADSGRRMRDDFLSRYLKAVREA
GTLSPIEEIMQLMLIILAGSDTTRTAMVMVTALALQNPALWSSLRGNQSYVAAAVEEGLRFEPPVGSFPRLALKDIDLDG
YVLPKGSLLALSVMSGLRDEKHYEHPQLFDVGRQQMRWHLGFGAGVHRCLGETLARIELQEGLRTLLRRAPNLAVVGDWP
RMMGHGGIRRATDMMVKLSFDL
>P24467 1.14.-.-~~~~~~Cytochrome P450-pinF2, plant-inducible~~~
MEERRVSISSITWRFPMLFAPVDDVTTIDDLTLDPYPIYRRMRVQNPVVHVASVRRTFLTKAFDTKMVKDDPSRFSSDDP
STPMKPAFQAHTLMRKDGTEHARERMAMARAFAPKAIADHWAPIYRDIVNEYLDRLPRGDTVDLFAEICGPVAARILAHI
LGICEASDVEIIRWSQRLIDGAGNFGWRSELFERSDEANAEMNCLFNDLVKKHRSAPNPSAFATMLNAPDPIPLSQIYAN
IKIAIGGGVNEPRDALGTILYGLLTNPEQLEEVKRQQCWGQAFEEGLRWVAPIQASSRLVREDTEIRGFIVPKGDIVMTI
QASANRDEDVFEDGESFNVFRPKSAHQSFGSGPHHCPGAQISRQTVGAIMLPILFDRFPDMILPHPELVQWRGFGFRGPI
NLPVTLR
>P18326 1.14.15.22~~~~~~Vitamin D3 dihydroxylase~~~
MTDTATTPQTTDAPAFPSNRSCPYQLPDGYAQLRDTPGPLHRVTLYDGRQAWVVTKHEAARKLLGDPRLSSNRTDDNFPA
TSPRFEAVRESPQAFIGLDPPEHGTRRRMTISEFTVKRIKGMRPEVEEVVHGFLDEMLAAGPTADLVSQFALPVPSMVIC
RLLGVPYADHEFFQDASKRLVQSTDAQSALTARNDLAGYLDGLITQFQTEPGAGLVGALVADQLANGEIDREELISTAML
LLIAGHETTASMTSLSVITLLDHPEQYAALRADRSLVPGAVEELLRYLAIADIAGGRVATADIEVEGHLIRAGEGVIVVN
SIANRDGTVYEDPDALDIHRSARHHLAFGFGVHQCLGQNLARLELEVILNALMDRVPTLRLAVPVEQLVLRPGTTIQGVN
ELPVTW
>P18327 1.14.-.-~~~~~~Cytochrome P450-SU2~~~
MTTAERTAPPDALTVPASRAPGCPFDPAPDVTEAARTEPVTRATLWDGSSCWLVTRHQDVRAVLGDPRFSADAHRTGFPF
LTAGGREIIGTNPTFLRMDDPEHARLRRMLTADFIVKKVEAMRPEVQRLADDLVDRMTTGRTSADLVTEFALPLPSLVIC
LLLGVPYEDHAFFQERSRVLLTLRSTPEEVRAAQDELLEYLARLARTKRERPDDAIISRLVARGELDDTQIATMGRLLLV
AGHETTANMTALSTLVLLRNPDQLARLRAEPALVKGAVEELLRYLTIVHNGVPRIATEDVLIGGRTIAAGEGVLCMISSA
NRDAEVFPGGDDLDVARDARRHVAFGFGVHQCLGQPLARVELQIAIETLLRRLPDLRLAVPHEEIPFRGDMAIYGVHSLP
IAW
>P14762 1.14.14.-~~~~~~Cytochrome P450(BM-1)~~~
MNKEVIPVTEIPKFQSRAEEFFPIQWYKEMLNNSPVYFHEETNTWNVFQYEHVKQVLSNYDFFSSDGQRTTIFVGDNSKK
KSTSPITNLTNLDPPDHRKARSLLAAAFTPRSLKNWEPRIKQIAADLVEAIQKNSTINIVDDLSSPFPSLVIADLFGVPV
KDRYQFKKWVDILFQPYDQERLEEIEQEKQRAGAEYFQYLYPIVIEKRSNLSDDIISDLIQAEVDGETFTDEEIVHATML
LLGAGVETTSHAIANMFYSFLYDDKSLYSELRNNRELAPKAVEEMLRYRFHISRRDRTVKQDNELLGVKLKKGDVVIAWM
SACNMDETMFENPFSVDIHRPTNKKHLTFGNGPHFCLGAPLARLEMKIILEAFLEAFSHIEPFEDFELEPHLTASATGQS
LTYLPMTVYR
>Q00441 1.14.15.35~~~eryF~~~6-deoxyerythronolide B hydroxylase~~~COG2124
MTTVPDLESDSFHVDWYRTYAELRETAPVTPVRFLGQDAWLVTGYDEAKAALSDLRLSSDPKKKYPGVEVEFPAYLGFPE
DVRNYFATNMGTSDPPTHTRLRKLVSQEFTVRRVEAMRPRVEQITAELLDEVGDSGVVDIVDRFAHPLPIKVICELLGVD
EKYRGEFGRWSSEILVMDPERAEQRGQAAREVVNFILDLVERRRTEPGDDLLSALIRVQDDDDGRLSADELTSIALVLLL
AGFEASVSLIGIGTYLLLTHPDQLALVRRDPSALPNAVEEILRYIAPPETTTRFAAEEVEIGGVAIPQYSTVLVANGAAN
RDPKQFPDPHRFDVTRDTRGHLSFGQGIHFCMGRPLAKLEGEVALRALFGRFPALSLGIDADDVVWRRSLLLRGIDHLPV
RLDG
>P33271 1.14.-.-~~~~~~Cytochrome P450 107B1~~~COG2124
MTTGEVPDLLAFDDAFAQDRHNRYARMREEPVQRIRTVNGLDAWLITRYEDVKQALLDPRIAKDFGRTQQIIEKRLADAE
RRPGFSPDLGPHMLNTDPPDHTRLRKLVVKAFTARRVEGLRPRIEQITDDLLDRLAGRSEVDLIDEFAFPLPITVISELM
GVEDSRRDDFRSWTNVLVDGSQPEAQAQASVAMVEYLTELIAKKRTEPGDDLLTALLEAVEDGDRLSEGELIAMVFLLLV
AGHETTVNLIGNCVLSLLGNPDQLAALRNDPSLLPGAIEETLRYESPVANGTFRHTAEAVRFGDVVIPEGELVWVALGAA
NRDGERFEDPDRFDITRETTGHVAFGHGIHFCVGAALARLEAQIAVGRLLERFPDLRMAASPDDLRWRFSVLMRGLEKLP
VRPGA
>P33006 1.14.-.-~~~~~~Cytochrome P450-terp~~~
MDARATIPEHIARTVILPQGYADDEVIYPAFKWLRDEQPLAMAHIEGYDPMWIATKHADVMQIGKQPGLFSNAEGSEILY
DQNNEAFMRSISGGCPHVIDSLTSMDPPTHTAYRGLTLNWFQPASIRKLEENIRRIAQASVQRLLDFDGECDFMTDCALY
YPLHVVMTALGVPEDDEPLMLKLTQDFFGVHEPDEQAVAAPRQSADEAARRFHETIATFYDYFNGFTVDRRSCPKDDVMS
LLANSKLDGNYIDDKYINAYYVAIATAGHDTTSSSSGGAIIGLSRNPEQLALAKSDPALIPRLVDEAVRWTAPVKSFMRT
ALADTEVRGQNIKRGDRIMLSYPSANRDEEVFSNPDEFDITRFPNRHLGFGWGAHMCLGQHLAKLEMKIFFEELLPKLKS
VELSGPPRLVATNFVGGPKNVPIRFTKA
>Q06069 1.14.99.-~~~~~~Cytochrome P450(MEG)~~~
MKEVIAVKEITRFKTRTEEFSPYAWCKRMLENDPVSYHEGTDTWNVFKYEDVKRVLSDYKHFSSVRKRTTISVGTDSEEG
SVPEKIQITESDPPDHRKRRSLLAAAFTPRSLQNWEPRIQEIADELIGQMDGGTEIDIVASLASPLPIIVMADLMGVPSK
DRLLFKKWVDTLFLPFDREKQEEVDKLKQVAAKEYYQYLYPIVVQKRLNPADDIISDLLKSEVDGEMFTDDEVVRTTMLI
LGAGVETTSHLLANSFYSLLYDDKEVYQELHENLDLVPQAVEEMLRFRFNLIKLDRTVKEDNDLLGVELKEGDSVVVWMS
AANMDEEMFEDPFTLNIHRPNNKKHLTFGNGPHFCLGAPLARLEAKIALTAFLKKFKHIEAVPSFQLEENLTDSATGQTL
TSLPLKASRM
>Q59723 1.14.14.84~~~linC~~~Linalool 8-monooxygenase~~~
MERPDLKNPDLYTQQVPHDIFARLRREEPVYWNPESDGSGFWAVLRHKDIIEVSRQPLLFSSAYENGGHRIFNENEVGLT
NAGEAAVGVPFISLDPPVHTQYRKVIMPALSPARLGDIEQRIRVRAEALIERIPLGEEVDLVPLLSAPLPLLTLAELLGL
DPDCWYELYNWTNAFVGEDDPEFRKSPEDMAKVLGEFMGFCQELFESRRANPGPDIATLLANAEINGQPVALRDFIGNLT
LTLVGGNETTRNSISHTIVTLSQQPDQWDILRQRPELLKTATAEMVRHASPVLHMRRTAMEDTEIGGQAIAKGDKVVLWY
ASGNRDESVFSDADRFDVTRTGVQHVGFGSGQHVCVGSRLAEMQLRVVFEILSTRVKRFELCSKSRRFRSNFLNGLKNLN
VVLVPK
>P0AE85 ~~~cpxP~~~Periplasmic protein CpxP~~~COG3678
MRIVTAAVMASTLAVSSLSHAAEVGSGDNWHPGEELTQRSTQSHMFDGISLTEHQRQQMRDLMQQARHEQPPVNVSELET
MHRLVTAENFDENAVRAQAEKMANEQIARQVEMAKVRNQMYRLLTPEQQAVLNEKHQQRMEQLRDVTQWQKSSSLKLLSS
SNSRSQ
>P0AE88 ~~~cpxR~~~Transcriptional regulatory protein CpxR~~~COG0745
MNKILLVDDDRELTSLLKELLEMEGFNVIVAHDGEQALDLLDDSIDLLLLDVMMPKKNGIDTLKALRQTHQTPVIMLTAR
GSELDRVLGLELGADDYLPKPFNDRELVARIRAILRRSHWSEQQQNNDNGSPTLEVDALVLNPGRQEASFDGQTLELTGT
EFTLLYLLAQHLGQVVSREHLSQEVLGKRLTPFDRAIDMHISNLRRKLPDRKDGHPWFKTLRGRGYLMVSAS
>Q8DI91 4.-.-.-~~~cpcS~~~Chromophore lyase CpcS/CpeS~~~
MCIGMDIRDFFAQSAGRWFSQRTSHHLAFKQTESGKSQLTIELLSVDDPAVIALCQQYDMDPAWAVCGARVSWDGTMEWD
NEKHEGSTVLVPIMDQGSRMEGKLLREMGYAEKAPVAGRFSMGSDGALTLITEYETIYSEERLWFASPNLRLRTSILKRF
GGFSMASFCSEIRLGVTQPANS
>A7N6R9 2.3.-.-~~~cqsA~~~CAI-1 autoinducer synthase~~~
MSDKPKTKPLPSFVEGRLDFYIQDLIEQNENQKHLVLGKRPQQGAVVMQSNDYLSLSHNLQIQQAHRDAIYEHDDNVVMS
AIFLQDDDSKPAFETQLAEYVGMGSCLLSQSGWAANIGLLQTICPPETPVYIDFFAHMSLWEGIRAAGAQAHPFMHNNMN
HLRKQIQRNGSGVIVVDSVYSTIGTIAPLRDIYEMAREFDCALVVDESHSLGTHGPNGSGLVKALELTEQVDFITVSLAK
TFAYRAGAILGPEKLARTLPFVAFPAIFSSTVLPQEIVRLEKTLEVIRSADDKRTMLFKRAKELRTGLKQIGFHIRSESQ
IVALECGSERNTERVRDFLEERNVFGAVFCRPATGKNKNIIRFSINADMTSRDIDHVLTACQEAYNHPELEFA
>Q9KM65 2.3.-.-~~~cqsA~~~CAI-1 autoinducer synthase~~~COG0156
MNKPQLPDFIQNKIDHYIENYFDINKNGKHLVLGKQASPDDIILQSNDYLALANHPLIKARLAKSLLEEQQSLFMSASFL
QNDYDKPMIEKRLAKFTGFDECLLSQSGWNANVGLLQTICQPNTNVYIDFFAHMSLWEGARYANAQAHPFMHNNCDHLRM
LIQRHGPGIIVVDSIYSTLGTIAPLAELVNISKEFGCALLVDESHSLGTHGPNGAGLLAELGLTREVHFMTASLAKTFAY
RAGAIWCNNEVNRCVPFISYPAIFSSTLLPYEAAGLETTLEIIESADNRRQHLDRMARKLRIGLSQLGLTIRSESQIIGL
ETGDERNTEKVRDYLESNGVFGSVFCRPATSKNKNIIRLSLNSDVNDEQIAKIIEVCSDAVNYGDFYFR
>A7N6S2 2.7.13.3~~~cqsS~~~CAI-1 autoinducer sensor kinase/phosphatase CqsS~~~
MDAIRKVYQYAEPNLSLVGWMGFIGFPIYYIVWEFMFPQPYENLPLRILCSVLFFGIIYRNRTPFEWRGFLPAYYQVVTT
LCLPCFFFYMLLMNNWSNVWVMSFMSAIFLHILLVHITSVMFVQTFVGIGLATFFAWVAQGFHLELTMDWTHVPIFLFIY
LFGNLFYFRNQVEHEAKVSIAKSFGAGIAHEMRNPLSGLLTSIDVIQSVLPNPKEGKKEQYTLSDEDVTLLREVSSDAMK
IIHSGNETIDLLLTSIDENRVSRSTFKKHSAQSVVESAIESFSYKRSTDRFAISLDVRSEFDFLGSDTLLKYVMYNLFKN
AFHHRSSEDFHIHVTMYSDEFANQIVVTDNGSGIAPEVLQSIFQDFYTTGKSGNYGLGLPFCKKVMRSFGGDIRCQSEVG
EWSQFTMTFPTIGSSAVKEIKSELTKLKTILFVSEQNILVSKVTDIARFMRFELTVLDVPAVLKNKEYEFEFDLILIDME
SLDASGSHIDKVESLLSFTEARIVYMFEHHPIQRARSVSFEPIWVETQAWLLNTRATIDRLLYDANYVVPSMPAKPLDST
NKRTIMVVDDNESLRKFTAMLLEKQGFEVIQTEDGLQAINALNENNVDLILMDIEMPVMDGVEASRQIRGSNKAYASVPI
IAHTGDSSPITLDKIGSSGMSDFIVKPADKNRLFDKIANWI
>Q9KM66 2.7.13.3~~~cqsS~~~CAI-1 autoinducer sensor kinase/phosphatase CqsS~~~COG0784
MIVSMDVIKRVYQYAEPNLSLVGWMGMLGFPAYYFIWEYWFPQSYENLGLRCAAAVLFGGLVFRDSMPKKWQRYMPGYFL
FTIGFCLPFFFAFMMLMNDWSTIWAMSFMASIFLHILLVHDTRVMALQALFSVLVAYLAVYGLTDFHPTTLIEWQYIPIF
LFTYVFGNLCFFRNQISHETKVSIAKTFGAGIAHEMRNPLSALKTSIDVVRTMIPKPQTAAHTDYSLDAQELDLLHQILN
EADDVIYSGNNAIDLLLTSIDENRVSPASFKKHSVVDVIEKAVKTFPYKNAADQHSVELEVHQPFDFFGSDTLLTYALFN
LLKNAFYYQKEHFSVCISIEQTSEHNLIRVRDNGVGIAPEMLEDIFRDFYTFGKNGSYGLGLPFCRKVMSAFGGTIRCAS
QQGQWTEFVLSFPRYDSDTVNEIKTELLKTKSLIYIGSNQAIVRELNQLAVEDEFGFTAISAQQAVRRQDYEFEFDLILL
DLDDATAQGELLPKLEGTLSFAEGCIGYVYDPGKTYAVNINRYLRIQPISIHSILRKPRKIIERLLFEQESLSMNRNVIP
LQKSRHERRILVVDDNQSIRTFTAILLEQQGYEVVQANDGSEVLKHMESQNIDLVLMDIEMPNVGGLEATRLIRNSEHEY
KNIPIIGYTGDNSPKTLALVQTSGMNDFIVKPADRDVLLNKVAAWV
>P0A367 ~~~cry1Aa~~~Pesticidal crystal protein Cry1Aa~~~
MDNNPNINECIPYNCLSNPEVEVLGGERIETGYTPIDISLSLTQFLLSEFVPGAGFVLGLVDIIWGIFGPSQWDAFLVQI
EQLINQRIEEFARNQAISRLEGLSNLYQIYAESFREWEADPTNPALREEMRIQFNDMNSALTTAIPLFAVQNYQVPLLSV
YVQAANLHLSVLRDVSVFGQRWGFDAATINSRYNDLTRLIGNYTDYAVRWYNTGLERVWGPDSRDWVRYNQFRRELTLTV
LDIVALFSNYDSRRYPIRTVSQLTREIYTNPVLENFDGSFRGMAQRIEQNIRQPHLMDILNSITIYTDVHRGFNYWSGHQ
ITASPVGFSGPEFAFPLFGNAGNAAPPVLVSLTGLGIFRTLSSPLYRRIILGSGPNNQELFVLDGTEFSFASLTTNLPST
IYRQRGTVDSLDVIPPQDNSVPPRAGFSHRLSHVTMLSQAAGAVYTLRAPTFSWQHRSAEFNNIIPSSQITQIPLTKSTN
LGSGTSVVKGPGFTGGDILRRTSPGQISTLRVNITAPLSQRYRVRIRYASTTNLQFHTSIDGRPINQGNFSATMSSGSNL
QSGSFRTVGFTTPFNFSNGSSVFTLSAHVFNSGNEVYIDRIEFVPAEVTFEAEYDLERAQKAVNELFTSSNQIGLKTDVT
DYHIDQVSNLVECLSDEFCLDEKQELSEKVKHAKRLSDERNLLQDPNFRGINRQLDRGWRGSTDITIQGGDDVFKENYVT
LLGTFDECYPTYLYQKIDESKLKAYTRYQLRGYIEDSQDLEIYLIRYNAKHETVNVPGTGSLWPLSAQSPIGKCGEPNRC
APHLEWNPDLDCSCRDGEKCAHHSHHFSLDIDVGCTDLNEDLGVWVIFKIKTQDGHARLGNLEFLEEKPLVGEALARVKR
AEKKWRDKREKLEWETNIVYKEAKESVDALFVNSQYDQLQADTNIAMIHAADKRVHSIREAYLPELSVIPGVNAAIFEEL
EGRIFTAFSLYDARNVIKNGDFNNGLSCWNVKGHVDVEEQNNQRSVLVVPEWEAEVSQEVRVCPGRGYILRVTAYKEGYG
EGCVTIHEIENNTDELKFSNCVEEEIYPNNTVTCNDYTVNQEEYGGAYTSRNRGYNEAPSVPADYASVYEEKSYTDGRRE
NPCEFNRGYRDYTPLPVGYVTKELEYFPETDKVWIEIGETEGTFIVDSVELLLMEE
>P0A368 ~~~cry1Aa~~~Pesticidal crystal protein Cry1Aa~~~
MDNNPNINECIPYNCLSNPEVEVLGGERIETGYTPIDISLSLTQFLLSEFVPGAGFVLGLVDIIWGIFGPSQWDAFPVQI
EQLINQRIEEFARNQAISRLEGLSNLYQIYAESFREWEADPTNPALREEMRIQFNDMNSALTTAIPLLAVQNYQVPLLSV
YVQAANLHLSVLRDVSVFGQRWGFDAATINSRYNDLTRLIGNYTDYAVRWYNTGLERVWGPDSRDWVRYNQFRRELTLTV
LDIVALFSNYDSRRYPIRTVSQLTREIYTNPVLENFDGSFRGMAQRIEQNIRQPHLMDILNSITIYTDVHRGFNYWSGHQ
ITASPVGFSGPEFAFPLFGNAGNAAPPVLVSLTGLGIFRTLSSPLYRRIILGSGPNNQELFVLDGTEFSFASLTTNLPST
IYRQRGTVDSLDVIPPQDNSVPPRAGFSHRLSHVTMLSQAAGAVYTLRAPTFSWQHRSAEFNNIIPSSQITQIPLTKSTN
LGSGTSVVKGPGFTGGDILRRTSPGQISTLRVNITAPLSQRYRVRIRYASTTNLQFHTSIDGRPINQGNFSATMSSGSNL
QSGSFRTVGFTTPFNFSNGSSVFTLSAHVFNSGNEVYIDRIEFVPAEVTFEAEYDLERAQKAVNELFTSSNQIGLKTDVT
DYHIDQVSNLVECLSDEFCLDEKQELSEKVKHAKRLSDERNLLQDPNFRGINRQLDRGWRGSTDITIQGGDDVFKENYVT
LLGTFDECYPTYLYQKIDESKLKAYTRYQLRGYIEDSQDLEIYLIRYNAKHETVNVPGTGSLWPLSAQSPIGKCGEPNRC
APHLEWNPDLDCSCRDGEKCAHHSHHFSLDIDVGCTDLNEDLGVWVIFKIKTQDGHARLGNLEFLEEKPLVGEALARVKR
AEKKWRDKREKLEWETNIVYKEAKESVDALFVNSQYDQLQADTNIAMIHAADKRVHSIREAYLPELSVIPGVNAAIFEEL
EGRIFTAFSLYDARNVIKNGDFNNGLSCWNVKGHVDVEEQNNQRSVLVVPEWEAEVSQEVRVCPGRGYILRVTAYKEGYG
EGCVTIHEIENNTDELKFSNCVEEEIYPNNTVTCNDYTVNQEEYGGAYTSRNRGYNEAPSVPADYASVYEEKSYTDGRRE
NPCEFNRGYRDYTPLPVGYVTKELEYFPETDKVWIEIGETEGTFIVDSVELLLMEE
>P0A366 ~~~cry1Aa~~~Pesticidal crystal protein Cry1Aa~~~
MDNNPNINECIPYNCLSNPEVEVLGGERIETGYTPIDISLSLTQFLLSEFVPGAGFVLGLVDIIWGIFGPSQWDAFPVQI
EQLINQRIEEFARNQAISRLEGLSNLYQIYAESFREWEADPTNPALREEMRIQFNDMNSALTTAIPLLAVQNYQVPLLSV
YVQAANLHLSVLRDVSVFGQRWGFDAATINSRYNDLTRLIGNYTDYAVRWYNTGLERVWGPDSRDWVRYNQFRRELTLTV
LDIVALFSNYDSRRYPIRTVSQLTREIYTNPVLENFDGSFRGMAQRIEQNIRQPHLMDILNSITIYTDVHRGFNYWSGHQ
ITASPVGFSGPEFAFPLFGNAGNAAPPVLVSLTGLGIFRTLSSPLYRRIILGSGPNNQELFVLDGTEFSFASLTTNLPST
IYRQRGTVDSLDVIPPQDNSVPPRAGFSHRLSHVTMLSQAAGAVYTLRAPTFSWQHRSAEFNNIIPSSQITQIPLTKSTN
LGSGTSVVKGPGFTGGDILRRTSPGQISTLRVNITAPLSQRYRVRIRYASTTNLQFHTSIDGRPINQGNFSATMSSGSNL
QSGSFRTVGFTTPFNFSNGSSVFTLSAHVFNSGNEVYIDRIEFVPAEVTFEAEYDLERAQKAVNELFTSSNQIGLKTDVT
DYHIDQVSNLVECLSDEFCLDEKQELSEKVKHAKRLSDERNLLQDPNFRGINRQLDRGWRGSTDITIQGGDDVFKENYVT
LLGTFDECYPTYLYQKIDESKLKAYTRYQLRGYIEDSQDLEIYLIRYNAKHETVNVPGTGSLWPLSAQSPIGKCGEPNRC
APHLEWNPDLDCSCRDGEKCAHHSHHFSLDIDVGCTDLNEDLGVWVIFKIKTQDGHARLGNLEFLEEKPLVGEALARVKR
AEKKWRDKREKLEWETNIVYKEAKESVDALFVNSQYDQLQADTNIAMIHAADKRVHSIREAYLPELSVIPGVNAAIFEEL
EGRIFTAFSLYDARNVIKNGDFNNGLSCWNVKGHVDVEEQNNQRSVLVVPEWEAEVSQEVRVCPGRGYILRVTAYKEGYG
EGCVTIHEIENNTDELKFSNCVEEEIYPNNTVTCNDYTVNQEEYGGAYTSRNRGYNEAPSVPADYASVYEEKSYTDGRRE
NPCEFNRGYRDYTPLPVGYVTKELEYFPETDKVWIEIGETEGTFIVDSVELLLMEE
>P0A372 ~~~cry1Ab~~~Pesticidal crystal protein Cry1Ab~~~
MDNNPNINECIPYNCLSNPEVEVLGGERIETGYTPIDISLSLTQFLLSEFVPGAGFVLGLVDIIWGIFGPSQWDAFLVQI
EQLINQRIEEFARNQAISRLEGLSNLYQIYAESFREWEADPTNPALREEMRIQFNDMNSALTTAIPLFAVQNYQVPLLSV
YVQAANLHLSVLRDVSVFGQRWGFDAATINSRYNDLTRLIGNYTDHAVRWYNTGLERVWGPDSRDWIRYNQFRRELTLTV
LDIVSLFPNYDSRTYPIRTVSQLTREIYTNPVLENFDGSFRGSAQGIEGSIRSPHLMDILNSITIYTDAHRGEYYWSGHQ
IMASPVGFSGPEFTFPLYGTMGNAAPQQRIVAQLGQGVYRTLSSTLYRRPFNIGINNQQLSVLDGTEFAYGTSSNLPSAV
YRKSGTVDSLDEIPPQNNNVPPRQGFSHRLSHVSMFRSGFSNSSVSIIRAPMFSWIHRSAEFNNIIPSSQITQIPLTKST
NLGSGTSVVKGPGFTGGDILRRTSPGQISTLRVNITAPLSQRYRVRIRYASTTNLQFHTSIDGRPINQGNFSATMSSGSN
LQSGSFRTVGFTTPFNFSNGSSVFTLSAHVFNSGNEVYIDRIEFVPAEVTFEAEYDLERAQKAVNELFTSSNQIGLKTDV
TDYHIDQVSNLVECLSDEFCLDEKKELSEKVKHAKRLSDERNLLQDPNFRGINRQLDRGWRGSTDITIQGGDDVFKENYV
TLLGTFDECYPTYLYQKIDESKLKAYTRYQLRGYIEDSQDLEIYLIRYNAKHETVNVPGTGSLWPLSAPSPIGKCAHHSH
HFSLDIDVGCTDLNEDLGVWVIFKIKTQDGHARLGNLEFLEEKPLVGEALARVKRAEKKWRDKREKLEWETNIVYKEAKE
SVDALFVNSQYDRLQADTNIAMIHAADKRVHSIREAYLPELSVIPGVNAAIFEELEGRIFTAFSLYDARNVIKNGDFNNG
LSCWNVKGHVDVEEQNNHRSVLVVPEWEAEVSQEVRVCPGRGYILRVTAYKEGYGEGCVTIHEIENNTDELKFSNCVEEE
VYPNNTVTCNDYTATQEEYEGTYTSRNRGYDGAYESNSSVPADYASAYEEKAYTDGRRDNPCESNRGYGDYTPLPAGYVT
KELEYFPETDKVWIEIGETEGTFIVDSVELLLMEE
>P0A371 ~~~cry1Ab~~~Pesticidal crystal protein Cry1Ab~~~
MDNNPNINECIPYNCLSNPEVEVLGGERIETGYTPIDISLSLTQFLLSEFVPGAGFVLGLVDIIWGIFGPSQWDAFLVQI
EQLINQRIEEFARNQAISRLEGLSNLYQIYAESFREWEADPTNPALREEMRIQFNDMNSALTTAIPLFAVQNYQVPLLSV
YVQAANLHLSVLRDVSVFGQRWGFDAATINSRYNDLTRLIGNYTDHAVRWYNTGLERVWGPDSRDWIRYNQFRRELTLTV
LDIVSLFPNYDSRTYPIRTVSQLTREIYTNPVLENFDGSFRGSAQGIEGSIRSPHLMDILNSITIYTDAHRGEYYWSGHQ
IMASPVGFSGPEFTFPLYGTMGNAAPQQRIVAQLGQGVYRTLSSTLYRRPFNIGINNQQLSVLDGTEFAYGTSSNLPSAV
YRKSGTVDSLDEIPPQNNNVPPRQGFSHRLSHVSMFRSGFSNSSVSIIRAPMFSWIHRSAEFNNIIPSSQITQIPLTKST
NLGSGTSVVKGPGFTGGDILRRTSPGQISTLRVNITAPLSQRYRVRIRYASTTNLQFHTSIDGRPINQGNFSATMSSGSN
LQSGSFRTVGFTTPFNFSNGSSVFTLSAHVFNSGNEVYIDRIEFVPAEVTFEAEYDLERAQKAVNELFTSSNQIGLKTDV
TDYHIDQVSNLVECLSDEFCLDEKKELSEKVKHAKRLSDERNLLQDPNFRGINRQLDRGWRGSTDITIQGGDDVFKENYV
TLLGTFDECYPTYLYQKIDESKLKAYTRYQLRGYIEDSQDLEIYLIRYNAKHETVNVPGTGSLWPLSAPSPIGKCAHHSH
HFSLDIDVGCTDLNEDLGVWVIFKIKTQDGHARLGNLEFLEEKPLVGEALARVKRAEKKWRDKREKLEWETNIVYKEAKE
SVDALFVNSQYDRLQADTNIAMIHAADKRVHSIREAYLPELSVIPGVNAAIFEELEGRIFTAFSLYDARNVIKNGDFNNG
LSCWNVKGHVDVEEQNNHRSVLVVPEWEAEVSQEVRVCPGRGYILRVTAYKEGYGEGCVTIHEIENNTDELKFSNCVEEE
VYPNNTVTCNDYTATQEEYEGTYTSRNRGYDGAYESNSSVPADYASAYEEKAYTDGRRDNPCESNRGYGDYTPLPAGYVT
KELEYFPETDKVWIEIGETEGTFIVDSVELLLMEE
>P0A370 ~~~cry1Ab~~~Pesticidal crystal protein Cry1Ab~~~
MDNNPNINECIPYNCLSNPEVEVLGGERIETGYTPIDISLSLTQFLLSEFVPGAGFVLGLVDIIWGIFGPSQWDAFLVQI
EQLINQRIEEFARNQAISRLEGLSNLYQIYAESFREWEADPTNPALREEMRIQFNDMNSALTTAIPLFAVQNYQVPLLSV
YVQAANLHLSVLRDVSVFGQRWGFDAATINSRYNDLTRLIGNYTDHAVRWYNTGLERVWGPDSRDWIRYNQFRRELTLTV
LDIVSLFPNYDSRTYPIRTVSQLTREIYTNPVLENFDGSFRGSAQGIEGSIRSPHLMDILNSITIYTDAHRGEYYWSGHQ
IMASPVGFSGPEFTFPLYGTMGNAAPQQRIVAQLGQGVYRTLSSTLYRRPFNIGINNQQLSVLDGTEFAYGTSSNLPSAV
YRKSGTVDSLDEIPPQNNNVPPRQGFSHRLSHVSMFRSGFSNSSVSIIRAPMFSWIHRSAEFNNIIPSSQITQIPLTKST
NLGSGTSVVKGPGFTGGDILRRTSPGQISTLRVNITAPLSQRYRVRIRYASTTNLQFHTSIDGRPINQGNFSATMSSGSN
LQSGSFRTVGFTTPFNFSNGSSVFTLSAHVFNSGNEVYIDRIEFVPAEVTFEAEYDLERAQKAVNELFTSSNQIGLKTDV
TDYHIDQVSNLVECLSDEFCLDEKKELSEKVKHAKRLSDERNLLQDPNFRGINRQLDRGWRGSTDITIQGGDDVFKENYV
TLLGTFDECYPTYLYQKIDESKLKAYTRYQLRGYIEDSQDLEIYLIRYNAKHETVNVPGTGSLWPLSAPSPIGKCAHHSH
HFSLDIDVGCTDLNEDLGVWVIFKIKTQDGHARLGNLEFLEEKPLVGEALARVKRAEKKWRDKREKLEWETNIVYKEAKE
SVDALFVNSQYDRLQADTNIAMIHAADKRVHSIREAYLPELSVIPGVNAAIFEELEGRIFTAFSLYDARNVIKNGDFNNG
LSCWNVKGHVDVEEQNNHRSVLVVPEWEAEVSQEVRVCPGRGYILRVTAYKEGYGEGCVTIHEIENNTDELKFSNCVEEE
VYPNNTVTCNDYTATQEEYEGTYTSRNRGYDGAYESNSSVPADYASAYEEKAYTDGRRDNPCESNRGYGDYTPLPAGYVT
KELEYFPETDKVWIEIGETEGTFIVDSVELLLMEE
>P05068 ~~~cry1Ac~~~Pesticidal crystal protein Cry1Ac~~~
MDNNPNINECIPYNCLSNPEVEVLGGERIETGYTPIDISLSLTQFLLSEFVPGAGFVLGLVDIIWGIFGPSQWDAFLVQI
EQLINQRIEEFARNQAISRLEGLSNLYQIYAESFREWEADPTNPALREEMRIQFNDMNSALTTAIPLFAVQNYQVPLLSV
YVQAANLHLSVLRDVSVFGQRWGFDAATINSRYNDLTRLIGNYTDYAVRWYNTGLERVWGPDSRDWVRYNQFRRELTLTV
LDIVALFPNYDSRRYPIRTVSQLTREIYTNPVLENFDGSFRGSAQGIERSIRSPHLMDILNSITIYTDAHRGYYYWSGHQ
IMASPVGFSGPEFTFPLYGTMGNAAPQQRIVAQLGQGVYRTLSSTLYRRPFNIGINNQQLSVLDGTEFAYGTSSNLPSAV
YRKSGTVDSLDEIPPQNNNVPPRQGFSHRLSHVSMFRSGFSNSSVSIIRAPMFSWIHRSAEFNNIIASDSITQIPAVKGN
FLFNGSVISGPGFTGGDLVRLNSSGNNIQNRGYIEVPIHFPSTSTRYRVRVRYASVTPIHLNVNWGNSSIFSNTVPATAT
SLDNLQSSDFGYFESANAFTSSLGNIVGVRNFSGTAGVIIDRFEFIPVTATLEAEYNLERAQKAVNALFTSTNQLGLKTN
VTDYHIDQVSNLVTYLSDEFCLDEKRELSEKVKHAKRLSDERNLLQDSNFKDINRQPERGWGGSTGITIQGGDDVFKENY
VTLSGTFDECYPTYLYQKIDESKLKAFTRYQLRGYIEDSQDLEIYLIRYNAKHETVNVPGTGSLWPLSAQSPIGKCGEPN
RCAPHLEWNPDLDCSCRDGEKCAHHSHHFSLDIDVGCTDLNEDLGVWVIFKIKTQDGHARLGNLEFLEEKPLVGEALARV
KRAEKKWRDKREKLEWETNIVYKEAKESVDALFVNSQYDQLQADTNIAMIHAADKRVHSIREAYLPELSVIPGVNAAIFE
ELEGRIFTAFSLYDARNVIKNGDFNNGLSCWNVKGHVDVEEQNNQRSVLVVPEWEAEVSQEVRVCPGRGYILRVTAYKEG
YGEGCVTIHEIENNTDELKFSNCVEEEIYPNNTVTCNDYTVNQEEYGGAYTSRNRGYNEAPSVPADYASVYEEKSYTDGR
RENPCEFNRGYRDYTPLPVGYVTKELEYFPETDKVWIEIGETEGTFIVDSVELLLMEE
>Q03744 ~~~cry1Ad~~~Pesticidal crystal protein Cry1Ad~~~
MEIMNNQNQCVPYNCLNDPTIEILEGERIETGYTPIDISLSLTQFLLSEFVPGAGFVLGLIDLIWGFVGPSQWDAFLVQI
EQLINQRIEEFARNQAISRLEGLSNLYQIYAEAFREWEADPTNPALTEEMRIQFNDMNSALTTAIPLFTVQNYQVPLLSV
YVQAANLHLSVLRDVSVFGQRWGFDVATINSRYNDLTRLIGTYTDYAVRWYNTGLERVWGPDSRDWVRYNQFRRELTLTV
LDIVSLFPNYDSRTYPIRTVSQLTREIYTNPVLENFDGSFRGMAQRIEQNIRQPHLMDLLNSITIYTDVHRGFNYWSGHQ
ITASPVGFAGPEFTFPRYGTMGNAAPPVLISTTGLGIFRTLSSPLYRRIILGSGPNNQNLFVLDGTEFSFASLTADLPST
IYRQRGTVDSLDVIPPQDNSVPARAGFSHRLSHVTMLSQAAGAVYTLRAPTFSWRHRSAEFSNLIPSSQITQIPLTKSIN
LGSGTSVVKGPGFTGGDILRITSPGQISTLRVTITAPLSQRYRVRIRYASTTNLQFHTSIDGRPINQGNFSATMSSGGNL
QSGSFRTAGFTTPFNFSNGSSIFTLSAHVFNSGNEVYIERIEFVPAEVTFEAEYDLERAQEAVNALFTSSNQLGLKTNVT
DYHIDQVSNLVECLSGEFCLDEKRELSEKVKHANRLSDERNLLQDPNFRGINRQPDRGWRGSTDITIQGGDDVFKENYVT
LPGTFNECYPTYLYQKIDESKLKAYTRYQLRGYIEDSQHLEIYLIRYNTKHETVNVPGTGSLWPLSVENPIGKCGEPNRC
APQLEWNPDLDCSCRDGEKCAHHSHHFSLDIDIGCTDLNENLGVWVIFKIKMQDGHARLGNLEFLEEKPLVGESLARVKR
AEKKWRDKREKLQVETNIVYKEAKESVDALFVNSQYDRLQADTDIAMIHAADKRVHRIREAYLPELSVIPGVNAGIFEEL
EGRIFTAYSLYDARNVIKNGDFNNGLSCWNVKGHVDVEEQNNHRSVLVVPEWEAEVSQEVRVCPGRGYILRVTAYKEGYG
EGCVTIHEIEDNTDELKFSNCVEEEVYPNNTVTCNDYTANQEEYGGAYTSRNRGYGESYESNSSIPAEYAPVYEEAYIDG
RKENPCESNRGYGDYTPLPAGYVTKELEYFPETDKVWIEIGETEGTFIVDSVELLLMEE
>Q03748 ~~~cry1Ae~~~Pesticidal crystal protein Cry1Ae~~~
MDNNPKINECIPYNCLSNPEVEVLGGERIETGYTPIDISLSLTQFLLSEFVPGAGFVLGLIDLIWGFVGPSQWDAFLVQI
EQLISQRIEEFARNQAISRLEGLSNLYQIYAEAFREWEADPTNPALREEMRIQFNDMNSALTTAIPLFTVQNYQVPLLSV
YVQAVNLHLSVLRDVSVFGQRWGLDVATINSRYNDLTRLIGTYTDYAVRWYNTGLERVWGPDSRDWVRYNQFRRELTLTV
LDIVSLFPNYDSRTYPIRTVSQLTREIYTNPVLENFDGSFRGSAQRIEQSIRSPHLMDILNSITIYTDAHGGYYYWSGHQ
IMASPVGFSGPEFTFPLYGTMGNAAPQQRIVAQLGQGVYRTLSSTFYRNPFIIGINNQRLSVLDGTEFAYGSSSNLPSAV
YRKSGTVDSLDEIPPQDNNVPPRQGFSHRLSHVSMFRSGFSNSSVSIIRAPMFSWIHRSAEFNNIIPSSQITQIPLTKST
NLGSGTSVVKGPGFTGGDILRRTSPGQISTLRVNITAPLSQRYRVRIRYASTTNLQFHTSIDGRPINQGNFSATMSSGGN
LQSGSFRTVGFTTPFNFSNGSSVFTLSAHVFNSGNEVYIDRIEFVPAEVTFEAEYDLERAQEAVNALFTSPNQIGLKTDV
TDYHIDQVSNLVECLSDEFCLDEKRELSEKVKHAKRLSDERNLLQDPNFRGINRQPDRGWRGSTDITIQGGDDVFKENYV
TLPGTFDECYPTYLYQKIDESKLKAYTRYELRGYIEDSQDLEIYLIRYNAKHETVNVPGTGSLWPLSFESSIGKCGEPNR
CAPHLEWNPDLDCSCRDGEKCAHHSHHFSLDIDVGCIDLNEDLGVWVIFKIKTQDGHARLGNLEFLEEKPLVGEALARVK
RAEKKWRDKREKLQLETNIVYKEAKESVDALFVNSQYDQLQADTNIAMIHTADKRVHRIQEAYLPELSVIPGVNAGIFEE
LEGRIFTAYSLYDARNVIKNGDFNNGLSCWNVKGHVDVEEQNNHRSVLVVPEWEAEVSQEVRVCPGRGYILRVTAYKEGY
GEGCVTIHEIENNTDELKFSNCVEEEVYPNNTVTCNEYTANQEEYGGAYTSCNRGYDETYGSNYSVPADYASVYEEKAYT
DGRRENPCESNRGYGDYTPLPAGYVTKQLEYFPETDKVWIEIGETEGTFIVDSVELFLMEE
>Q9S515 ~~~cry1Ag~~~Pesticidal crystal protein Cry1Ag~~~
MDNNPNINECIPYNCLSNPEVEVLGGERIETGYTPIDISLSLTQFLLSEFVPGAGFVLGLVDIIWGIFGPSQWDAFLVQI
EQLINQRIEEFARNQAISRLEGLSNLYQIYAESFREWEADPTNPALREEMRIQFNDMNSALTTAIPLLAVQNYQVPLLSV
YVQAANLHLSVLRDVSVFGQRWGFDAATINSRYNDLTRLIGNYTDYAVRWYNTGLERVWGPDSRDWVRYNQFRRELTLTV
LDIVALFSNYDSRRYPIRTVSQLTREIYTNPVLENFDGSFRGMAQRIEPEYRQPHLMDILNSISIYTDVHRGFNYWSGHQ
ITTSPVGFSGPEFTFPLYGTYGNAAPPQRIAQTGLGIFRTLSSPLYRRIILGSGPNNQELFVLDGTEFSFASLTTNLPST
IYRQRGTVDSLDVIPPQDNSVPPRAGFSHRLSHVPMLSQAAGAVYTLRASLFLLLVLLIHARSIFNNIIPSSQITQSFKK
IISWTSVVKGPGFTGGDILRRPSPGLISTLRVNITAPLSQRYRVRIRYAFTTNLQFLTSIDGRPINQGNFYATMSSGSNL
QSGSFRTVGFTTPFNFSNGSSVFTLSAHVFNSGNEVYIDRIEFVPAEVTFEAEYDLERAQNGVNQLFTSSNQIGLKTDGT
DYHIDQVSNLVECLSDEFCLDEKQELSEKVKHAKRLSDERNLLQDPNFRGINRQLDRGWRGSHDITIQGGDDVFKENYVT
LLGTFDECYPTYLYQKIDQSKLKAYTSYQLRGYIEDSQDLEIYLIGYNAKQQTVNVPGTGSLWPVYAPKPIGKCGEPNRC
APHLEWNPDLDCSCRDGEKCAHHSLHFSIDIDVGCTDLNEDLGVWVIFKIKTEDGHARLGNLEFLEEKPLVGEALARVKR
AEKKWRDKRVKLEWETNIVYKEAKESVDALFVNSQYDQLQADTNIAMIHAADKRVHSIREAYLPELSVIPGVNAAIFEEL
EGRIFTAFSLYDARNVIKNGDFNNGLSCWNVKGHVDVEEQNNQRSVLVVPEWEAEVSQEVRVCPGRGYFPRVTAYKEGYG
EGCVTIHEIENNTDELKFSNCVEEEIYPNNTVTCNDYTVNQEEYGGAYTSRNRGYNEAPSVPADYASVYEEKSYTDGRRE
NPCEFNRGYRDYTGLPVGYVTKALEYFPETDKVWIEIGETEGTFIVDSVELLLMEE
>P0A373 ~~~cry1Ba~~~Pesticidal crystal protein Cry1Ba~~~
MTSNRKNENEIINAVSNHSAQMDLLPDARIEDSLCIAEGNNIDPFVSASTVQTGINIAGRILGVLGVPFAGQLASFYSFL
VGELWPRGRDQWEIFLEHVEQLINQQITENARNTALARLQGLGDSFRAYQQSLEDWLENRDDARTRSVLYTQYIALELDF
LNAMPLFAIRNQEVPLLMVYAQAANLHLLLLRDASLFGSEFGLTSQEIQRYYERQVERTRDYSDYCVEWYNTGLNSLRGT
NAASWVRYNQFRRDLTLGVLDLVALFPSYDTRTYPINTSAQLTREVYTDAIGATGVNMASMNWYNNNAPSFSAIEAAAIR
SPHLLDFLEQLTIFSASSRWSNTRHMTYWRGHTIQSRPIGGGLNTSTHGATNTSINPVTLRFASRDVYRTESYAGVLLWG
IYLEPIHGVPTVRFNFTNPQNISDRGTANYSQPYESPGLQLKDSETELPPETTERPNYESYSHRLSHIGIILQSRVNVPV
YSWTHRSADRTNTIGPNRITQIPMVKASELPQGTTVVRGPGFTGGDILRRTNTGGFGPIRVTVNGPLTQRYRIGFRYAST
VDFDFFVSRGGTTVNNFRFLRTMNSGDELKYGNFVRRAFTTPFTFTQIQDIIRTSIQGLSGNGEVYIDKIEIIPVTATFE
AEYDLERAQEAVNALFTNTNPRRLKTDVTDYHIDQVSNLVACLSDEFCLDEKRELLEKVKYAKRLSDERNLLQDPNFTSI
NKQPDFISTNEQSNFTSIHEQSEHGWWGSENITIQEGNDVFKENYVTLPGTFNECYPTYLYQKIGESELKAYTRYQLRGY
IEDSQDLEIYLIRYNAKHETLDVPGTESLWPLSVESPIGRCGEPNRCAPHFEWNPDLDCSCRDGEKCAHHSHHFSLDIDV
GCTDLHENLGVWVVFKIKTQEGHARLGNLEFIEEKPLLGEALSRVKRAEKKWRDKREKLQLETKRVYTEAKEAVDALFVD
SQYDRLQADTNIGMIHAADKLVHRIREAYLSELPVIPGVNAEIFEELEGHIITAISLYDARNVVKNGDFNNGLTCWNVKG
HVDVQQSHHRSDLVIPEWEAEVSQAVRVCPGCGYILRVTAYKEGYGEGCVTIHEIENNTDELKFKNREEEEVYPTDTGTC
NDYTAHQGTAGCADACNSRNAGYEDAYEVDTTASVNYKPTYEEETYTDVRRDNHCEYDRGYVNYPPVPAGYVTKELEYFP
ETDTVWIEIGETEGKFIVDSVELLLMEE
>Q45739 ~~~cry1Bb~~~Pesticidal crystal protein Cry1Bb~~~
MTSNRKNENEIINALSIPTVSNPSTQMNLSPDARIEDSLCVAEVNNIDPFVSASTVQTGINIAGRILGVLGVPFAGQLAS
FYSFLVGELWPSGRDPWEIFLEHVEQLIRQQVTENTRNTAIARLEGLGRGYRSYQQALETWLDNRNDARSRSIILERYVA
LELDITTAIPLFRIRNEEVPLLMVYAQAANLHLLLLRDASLFGSEWGMASSDVNQYYQEQIRYTEEYSNHCVQWYNTGLN
NLRGTNAESWLRYNQFRRDLTLGVLDLVALFPSYDTRTYPINTSAQLTREIYTDPIGRTNAPSGFASTNWFNNNAPSFSA
IEAAIFRPPHLLDFPEQLTIYSASSRWSSTQHMNYWVGHRLNFRPIGGTLNTSTQGLTNNTSINPVTLQFTSRDVYRTES
NAGTNILFTTPVNGVPWARFNFINPQNIYERGATTYSQPYQGVGIQLFDSETELPPETTERPNYESYSHRLSHIGLIIGN
TLRAPVYSWTHRSADRTNTIGPNRITQIPLVKALNLHSGVTVVGGPGFTGGDILRRTNTGTFGDIRLNINVPLSQRYRVR
IRYASTTDLQFFTRINGTTVNIGNFSRTMNRGDNLEYRSFRTAGFSTPFNFLNAQSTFTLGAQSFSNQEVYIDRVEFVPA
EVTFEAEYDLERAQKAVNALFTSTNPRRLKTDVTDYHIDQVSNMVACLSDEFCLDEKRELFEKVKYAKRLSDERNLLQDP
NFTFISGQLSFASIDGQSNFPSINELSEHGWWGSANVTIQEGNDVFKENYVTLPGTFNECYPNYLYQKIGESELKAYTRY
QLRGYIEDSQDLEIYLIRYNAKHETLDVPGTDSLWPLSVESPIGRCGEPNRCAPHFEWNPDLDCSCRDGERCAHHSHHFT
LDIDVGCTDLHENLGVWVVFKIKTQEGYARLGNLEFIEEKPLIGEALSRVKRAEKKWRDKREKLQLETKRVYTEAKEAVD
ALFVDSQYDQLQADTNIGMIHAADKLVHRIREAYLSELPVIPGVNAEIFEELEGHIITAMSLYDARNVVKNGDFNNGLTC
WNVKGHVDVQQSHHRSDLVIPEWEAEVSQAVRVCPGRGYILRVTAYKEGYGEGCVTIHEIENNTDELKFKNCEEEEVYPT
DTGTCNDYTAHQGTAACNSRNAGYEDAYEVDTTASVNYKPTYEEETYTDVRRDNHCEYDRGYVNYPPVPAGYVTKELEYF
PETDTVWIEIGETEGKFIVDSVELLLMEE
>Q45774 ~~~cry1Bc~~~Pesticidal crystal protein Cry1Bc~~~
MTSNRKNENEIINALSIPTVSNPSTQMNLSPDARIEDSLCVAEVNNIDPFVSASTVQTGINIAGRILGVLGVPFAGQLAS
FYSFLVGELWPSGRDPWEIFLEHVEQLIRQQVTENTRNTAIARLEGLGRGYRSYQQALETWLDNRNDARSRSIILERYVA
LELDITTAIPLFRIRNEEVPLLMVYAQAANLHLLLLRDASLFGSEWGMASSDVNQYYQEQIRYTEEYSNHCVQWYNTGLN
NLRGTNAESWLRYNQFRRDLTLGVLDLVALFPSYDTRTYPINTSAQLTREIYTDPIGRTNAPSGFASTNWFNNNAPSFSA
IEAAIFRPPHLLDFPEQLTIYSASSRWSSTQHMNYWVGHRLNFRPIGGTLNTSTQGLTNNTSINPVTLQFTSRDVYRTES
NAGTNILFTTPVNGVPWARFNFINPQNIYERGATTYSQPYQGVGIQLFDSETELPPETTERPNYESYSHRLSHIGLIIGN
TLRAPVYSWTHRSADRTNTIGPNRITQIPLVKALNLHSGVTVVGGPGFTGGDILRRTNTGTFGDIRLNINVPLSQRYRVR
IRYASTTDLQFFTRINGTTVNIGNFSRTMNRGDNLEYRSFRTAGFSTPFNFLNAQSTFTLGAQSFSNQEVYIDRVEFVPA
EVTFEAEYDLERAQKAVNALFTSTNPRRLKTDVTDYHIDQVSNMVACLSDEFCLDEKRELFEKVKYAKRLSDERNLLQDP
NFTFISGQLSFASIDGQSNFPSINELSEHGWWGSANVTIQEGNDVFKENYVTLPGTFNECYPNYLYQKIGESELKAYTRY
QLRGYIENSQDLEIYLIRYNAKHEAINVPGTESIWSISAESTIGKCTEPNRCAPHYEWNPDLDCSCRDGEKCAHHSHHST
LDIDVGCTDLHENLGVWLIFKIKTQDGHARLGNLEYLEEKPLLGEALRRVKRTEKKWRDKREKLHLETKRVYTEAKESVD
ALFVDSQYDRLQANSNIGMIHAADKLVHSIREAYLSELPVIRGVNADIFEELEGHILTAFSLYDARNAVKNGDFNNGLTC
WNVKGHVDVQQSHHRFDLVVPEWKAEVSQAVRVCPGCGYILRVTAYKEGYGEGCVTIHEIEENTDELNFKNRVEEEIYPP
DTGTCKYYTENQGTRTCGNECGSRNEGYDNAYEINAKSSLEYRPTYEEETYTDVRRENHCEYARGYINYSPVPAGYVTKE
LEYFPETDTVWIEIGETEGKFIVDSVELLLMEE
>Q9ZAZ5 ~~~cry1Bd~~~Pesticidal crystal protein Cry1Bd~~~
MTSNRKNENEIINALSIPAVSNHSAQMDLSLDARIEDSLCIAEGNNINPLVSASTVQTGINIAGRILGVLGVPFAGQLAS
FYSFLVGELWPSGRDPWEIFLEHVEQLIRQQVTENTRNTAIARLEGLGRGYRSYQQALETWLDNRNDARSRSIILERYVA
LELDITTAIPLFRIRNEEVPLLMVYAQAANLHLLLLRDASLFGSEWGMASSDVNQYYQEQIRYTEEYSNHCVQWYNTGLN
NLRGTNAESWLRYNQFRRDLTLGVLDLVALFPSYDTRTYPINTSAQLTREIYTDPIGRTNAPSGFASTNWFNNNAPSFSA
IEAAIFRPPHLLDFPEQLTIYSASSRWSSTQHMNYWVGHRLNFRPIGGTLNTSTQGLTNNTSINPVTLQFTSRDVYRTES
NAGTNILFTTPVNGVPWARFNFINPQNIYERGATTYSQPYQGVGIQLFDSETELPPETTERPNYESYSHRLSHIGLIIGN
TLRAPVYSWTHRSADRTNTIGPNRITQIPAVKGRFLFNGSVISGPGFTGGDVVRLNRNNGNIQNRGYIEVPIQFTSTSTR
YRVRVRYASVTSIELNVNLGNSSIFTNTLPATAASLDNLQSGDFGYVEINNAFTSATGNIVGARNFSANAEVIIDRFEFI
PVTATFEAEYDLERAQKAVNALFTSTNPRRLKTDVTDYHIDQVSNMVACLSDEFCLDEKRELFEKVKYAKRLSDERNLLQ
DPNFTFISGQLSFASIDGQSNFTSINELSEHGWWGSENVTIQEGNDVFKENYVTLPGTFNECYPNYLYQKIGESELKAYT
RYQLRGYIEDSQDLEIYLIRYNAKHETLDVPGTDSLWPLSVKSPIGRCGEPNRCAPHFEWNPDLDCSCRDGERCAHHSHH
FTLDIDVGCTDLHENLGVWVVFKIKTQEGYARLGNLEFIEEKPLIGEALSRVKRAEKKWRDKREKLQLETKRVYTEAKET
VDALFVDSHYNRLQADTNIGMIHAADRLVHRIHEAYLPELPFIPGINAVIFEELENRISTAFSLYDARNVIKNGDFNNGL
SCWNVKGHVDVQQSHHRSDLVIPEWEAEVSQAVRVCPGRGYILRVTAYKEGYGEGCVTIHEIENNTDELKFKNCEEEEVY
PTDTGTCNDYTAHQGTAACNSRNAGYEDAYEVDTTASVNYKPTYEEETYTDVRRDNHCEYDRGYVNYPPVPAGYVTKELE
YFPETDTVWIEIGETEGKFIVDSVELLLMEE
>O85805 ~~~cry1Be~~~Pesticidal crystal protein Cry1Be~~~
MTSNRKNENEIINALSIPAVSNHSAQMNLSTDARIEDSLCIAEGNNIDPFVSASTVQTGINIAGRILGVLGVPFAGQIAS
FYSFLVGELWPRGRDPWEIFLEHVEQLIRQQVTENTRDTALARLQGLGNSFRAYQQSLEDWLENRDDARTRSVLYTQYIA
LELDFLNAMPLFAIRNQEVPLLMVYAQAANLHLLLLRDASLFGSEFGLTSQEIQRYYERQVEKTREYSDYCARWYNTGLN
NLRGTNAESWLRYNQFRRDLTLGVLDLVALFPSYDTRVYPMNTSAQLTREIYTDPIGRTNAPSGFASTNWFNNNAPSFSA
IEAAVIRPPHLLDFPEQLTIFSVLSRWSNTQYMNYWVGHRLESRTIRGSLSTSTHGNTNTSINPVTLQFTSRDVYRTESF
AGINILLTTPVNGVPWARFNWRNPLNSLRGSLLYTIGYTGVGTQLFDSETELPPETTERPNYESYSHRLSNIRLISGNTL
RAPVYSWTHRSADRTNTISSDSITQIPLVKSFNLNSGTSVVSGPGFTGGDIIRTNVNGSVLSMGLNFNNTSLQRYRVRVR
YAASQTMVLRVTVGGSTTFDQGFPSTMSANESLTSQSFRFAEFPVGISASGSQTAGISISNNAGRQTFHFDKIEFIPITA
TFEAEYDLERAQEAVNALFTNTNPRRLKTGVTDYHIDEVSNLVACLSDEFCLDEKRELLEKVKYAKRLSDERNLLQDPNF
TSINKQPDFISTNEQSNFTSIHEQSEHGWWGSENITIQEGNDVFKENYVILPGTFNECYPTYLYQKIGEAELKAYTRYQL
SGYIEDSQDLEIYLIRYNAKHETLDVPGTESVWPLSVESPIGRCGEPNRCAPHFEWNPDLDCSCRDGEKCAHHSHHFSLD
IDVGCIDLHENLGVWVVFKIKTQEGHARLGNLEFIEEKPLLGEALSRVKRAEKKWRDKREKLQLETKRVYTEAKEAVDAL
FVDSQYDRLQADTNIGMIHAADKLVHRIREAYLSELSVIPGVNAEIFEELEGRIITAISLYDARNVVKNGDFNNGLACWN
VKGHVDVQQSHHRSVLVIPEWEAEVSQAVRVCPGRGYILRVTAYKEGYGEGCVTIHEIENNTDELKFKNCEEEEVYPTDT
GTCNDYTAHQGTAACNSRNAGYEDAYEVDTTASVNYKPTYEEETYTDVRRDNHCEYDRGYVNYPPVPAGYMTKELEYFPE
TDKVWIEIGETEGKFIVDSVELLLMEE
>P0A376 ~~~cry1Ca~~~Pesticidal crystal protein Cry1Ca~~~
MEENNQNQCIPYNCLSNPEEVLLDGERISTGNSSIDISLSLVQFLVSNFVPGGGFLVGLIDFVWGIVGPSQWDAFLVQIE
QLINERIAEFARNAAIANLEGLGNNFNIYVEAFKEWEEDPNNPATRTRVIDRFRILDGLLERDIPSFRISGFEVPLLSVY
AQAANLHLAILRDSVIFGERWGLTTINVNENYNRLIRHIDEYADHCANTYNRGLNNLPKSTYQDWITYNRLRRDLTLTVL
DIAAFFPNYDNRRYPIQPVGQLTREVYTDPLINFNPQLQSVAQLPTFNVMESSAIRNPHLFDILNNLTIFTDWFSVGRNF
YWGGHRVISSLIGGGNITSPIYGREANQEPPRSFTFNGPVFRTLSNPTLRLLQQPWPAPPFNLRGVEGVEFSTPTNSFTY
RGRGTVDSLTELPPEDNSVPPREGYSHRLCHATFVQRSGTPFLTTGVVFSWTHRSATLTNTIDPERINQIPLVKGFRVWG
GTSVITGPGFTGGDILRRNTFGDFVSLQVNINSPITQRYRLRFRYASSRDARVIVLTGAASTGVGGQVSVNMPLQKTMEI
GENLTSRTFRYTDFSNPFSFRANPDIIGISEQPLFGAGSISSGELYIDKIEIILADATFEAESDLERAQKAVNALFTSSN
QIGLKTDVTDYHIDQVSNLVDCLSDEFCLDEKRELSEKVKHAKRLSDERNLLQDPNFRGINRQPDRGWRGSTDITIQGGD
DVFKENYVTLPGTVDECYPTYLYQKIDESKLKAYTRYELRGYIEDSQDLEIYLIRYNAKHEIVNVPGTGSLWPLSAQSPI
GKCGEPNRCAPHLEWNPDLDCSCRDGEKCAHHSHHFTLDIDVGCTDLNEDLGLWVIFKIKTQDNHARLGNLEFLEEKPLL
GEALARVKRAEKKWRDKREKLQLETNIVYKEAKESVDALFVNSQYDRLQVNTNIAMIHAADKRVHRIREAYLPELSVIPG
VNAAIFEELEGRIFTAYSLYDARNVIKNGDFNNGLLCWNVKGHVDVEEQNNHRSVLVIPEWEAEVSQEVRVCPGRGYILR
VTAYKEGYGEGCVTIHEIEDNTDELKFSNCVEEEVYPNNTVTCNNYTGTQEEYEGTYTSRNQGYDEAYGNNPSVPADYAS
VYEEKSYTDGRRENPCESNRGYGDYTPLPAGYVTKDLEYFPETDKVWIEIGETEGTFIVDSVELLLMEE
>P0A375 ~~~cry1Ca~~~Pesticidal crystal protein Cry1Ca~~~
MEENNQNQCIPYNCLSNPEEVLLDGERISTGNSSIDISLSLVQFLVSNFVPGGGFLVGLIDFVWGIVGPSQWDAFLVQIE
QLINERIAEFARNAAIANLEGLGNNFNIYVEAFKEWEEDPNNPETRTRVIDRFRILDGLLERDIPSFRISGFEVPLLSVY
AQAANLHLAILRDSVIFGERWGLTTINVNENYNRLIRHIDEYADHCANTYNRGLNNLPKSTYQDWITYNRLRRDLTLTVL
DIAAFFPNYDNRRYPIQPVGQLTREVYTDPLINFNPQLQSVAQLPTFNVMESSRIRNPHLFDILNNLTIFTDWFSVGRNF
YWGGHRVISSLIGGGNITSPIYGREANQEPPRSFTFNGPVFRTLSNPTLRLLQQPWPAPPFNLRGVEGVEFSTPTNSFTY
RGRGTVDSLTELPPEDNSVPPREGYSHRLCHATFVQRSGTPFLTTGVVFSWTDRSATLTNTIDPERINQIPLVKGFRVWG
GTSVITGPGFTGGDILRRNTFGDFVSLQVNINSPITQRYRLRFRYASSRDARVIVLTGAASTGVGGQVSVNMPLQKTMEI
GENLTSRTFRYTDFSNPFSFRANPDIIGISEQPLFGAGSISSGELYIDKIEIILADATFEAESDLERAQKAVNALFTSSN
QIGLKTDVTDYHIDQVSNLVDCLSDEFCLDEKRELSEKVKHAKRLSDERNLLQDPNFRGINRQPDRGWRGSTDITIQGGD
DVFKENYVTLPGTVDECYPTYLYQKIDESKLKAYTRYELRGYIEDSQDLEIYLIRYNAKHEIVNVPGTGSLWPLSAQSPI
GKCGEPNRCAPHLEWNPDLDCSCRDGEKCAHHSHHFTLDIDVGCTDLNEDLGVWVIFKIKTQDGHARLGNLEFLEEKPLL
GEALARVKRAEKKWRDKREKLQLETNIVYKEAKESVDALFVNSQYDRLQVDTNIAMIHAADKRVHRIREAYLPELSVIPG
VNAAIFEELEGRIFTAYSLYDARNVIKNGDFNNGLLCWNVKGHVDVEEQNNHRSVLVIPEWEAEVSQEVRVCPGRGYILR
VTAYKEGYGEGCVTIHEIEDNTDELKFSNCVEEEVYPNNTVTCNNYTGTQEEYEGTYTSRNQGYDEAYGNNPSVPADYAS
VYEEKSYTDGRRENPCESNRGYGDYTPLPAGYVTKDLEYFPETDKVWIEIGETEGTFIVDSVELLLMEE
>P56953 ~~~cry1Cb~~~Pesticidal crystal protein Cry1Cb~~~
MENNIQNQCVPYNCLSNPEEILLDGERISTGNSSIDISLSLVQLLVSNFVPGGGFLVGLLDFVWGIVGPSPWDAFLVQIE
QLINERIAAYARSAAISNLEGLGNNFNIYVEAFKEWEADPDNPVTRTRVVDRFRILDGLLERDIPSFRIAGFEVPLLSVY
AQAANLHLAILRDSSIFGARWGLTTINVNENYNRLIRHIDEYANHCADTYNRGLNNLPKSTYQDWITYNRLRRDLTLTVL
DIAAFFPSYDNRRYPIQSVGQLTREIYTDPLITFNPQLQSVAQLPTFNVMESNAIRTPHLFDVLNNLTIFTDWFSVGRNF
YWGGHRVISNRIGGGNITSPIYGREANQEPPRSFTFNGPVFRTLSNPTFRPLQQPWPAPPFNLRGVEGVEFSTPLNSFTY
RGRGTVDSLTELPPEDNSVPPREGYSHRLCHATFVQRSGTPFLTTGPVFSWTHRSATDRNIIYPDVINQIPLVKAFNLTS
GTSVVRGPGFTGGDIIRTNVNGSVLSMSLNFSNTTLQRYRVRVRYAASQTMVMSVTVGGSTTGNQGFPSTMSANGALTSQ
SFRFAEFPVGISASGSQGASISISNNVGRQMFHLDRIEFLPVTSTFEEEYDLERAQEAVNALFTSTNQLGLKTDVTDYHI
DQVSNLVECLSDEFCLDEKRELSEKVKHAKRLSDERNLLQDRNFRSINGQLDRGWRGSTDITIQGGDDVFKENYVTLPGT
FDECYPTYLYQKIDESKLKSYTRYELRGYIEDSQDLEIYLIRYNAKHEIVNVPGTGSLWPLSIENSIGPCGEPNRCAPHL
EWNPNLDCSCRDGEKCAHHSHHFSLDIDVGCTDLNEDLGVWVIFKIKTQDGHARLGNLEFLEEKPLLGEALARVKRAEKK
WRDKREKLEWETNIVYKEAKESVDALFVNSQYDRLQADTNIAMIHAADKRVHRIREAYLPELSVIPGVNAGIFEELEGRI
FTAYSLYDARNVIKNGDFNNGLLCWNLKGHVDVEEQNNHRSVLVVPEWEAEVSQEVRVCPGRGYILRVTAYKEGYGEGCV
TIHEIEDNTDELKFSNCVEEEVYPNNTVTCNDYTATQEEYGGAYTSRNHGYGKSYESNSSVQADYASVYEEKADTDGRRD
NHCESNRGYGDYTPLPAGYVTKELEYFPETDKVWVEIGETEGTFIVDSVELLLMEE
>P19415 ~~~cry1Da~~~Pesticidal crystal protein Cry1Da~~~
MEINNQNQCVPYNCLSNPKEIILGEERLETGNTVADISLGLINFLYSNFVPGGGFIVGLLELIWGFIGPSQWDIFLAQIE
QLISQRIEEFARNQAISRLEGLSNLYKVYVRAFSDWEKDPTNPALREEMRIQFNDMNSALITAIPLFRVQNYEVALLSVY
VQAANLHLSILRDVSVFGERWGYDTATINNRYSDLTSLIHVYTNHCVDTYNQGLRRLEGRFLSDWIVYNRFRRQLTISVL
DIVAFFPNYDIRTYPIQTATQLTREVYLDLPFINENLSPAASYPTFSAAESAIIRSPHLVDFLNSFTIYTDSLARYAYWG
GHLVNSFRTGTTTNLIRSPLYGREGNTERPVTITASPSVPIFRTLSYITGLDNSNPVAGIEGVEFQNTISRSIYRKSGPI
DSFSELPPQDASVSPAIGYSHRLCHATFLERISGPRIAGTVFSWTHRSASPTNEVSPSRITQIPWVKAHTLASGASVIKG
PGFTGGDILTRNSMGELGTLRVTFTGRLPQSYYIRFRYASVANRSGTFRYSQPPSYGISFPKTMDAGEPLTSRSFAHTTL
FTPITFSRAQEEFDLYIQSGVYIDRIEFIPVTATFEAEYDLERAQKVVNALFTSTNQLGLKTDVTDYHIDQVSNLVACLS
DEFCLDEKRELSEKVKHAKRLSDERNLLQDPNFRGINRQPDRGWRGSTDITIQGGDDVFKENYVTLPGTFDECYPTYLYQ
KIDESKLKAYTRYQLRGYIEDSQDLEIYLIRYNAKHEIVNVPGTGSLWPLSVENQIGPCGEPNRCAPHLEWNPDLHCSCR
DGEKCAHHSHHFSLDIDVGCTDLNEDLGVWVIFKIKTQDGHARLGNLEFLEEKPLLGEALARVKRAEKKWRDKRETLQLE
TTIVYKEAKESVDALFVNSQYDRLQADTNIAMIHAADKRVHRIREAYLPELSVIPGVNAAIFEELEERIFTAFSLYDARN
IIKNGDFNNGLLCWNVKGHVEVEEQNNHRSVLVIPEWEAEVSQEVRVCPGRGYILRVTAYKEGYGEGCVTIHEIENNTDE
LKFNNCVEEEVYPNNTVTCINYTATQEEYEGTYTSRNRGYDEAYGNNPSVPADYASVYEEKSYTDRRRENPCESNRGYGD
YTPLPAGYVTKELEYFPETDKVWIEIGETEGTFIVDSVELLLMEE
>Q45747 ~~~cry1Db~~~Pesticidal crystal protein Cry1Db~~~
MDINHQNQCIPYNCLSNPDAILLDAERLETGNTVADISLGLINFLYSNFVPGGGFIVGLLELIWGFVGPSQWEIFLAQIE
QLISQRIEEFARNQAISRLEGLSNNYEIYTETFRAWEKDPSNPALREEMRTQFNVMNSALIAAIPLLRVRNYEVALLSVY
VQAANLHLSVLRDVSVYGQRWGFDPATVNSRYSDLTRLIHVYTDHCVDTYNDGLKNLEGSRLSDWVVYNRFRRRLTISVL
DIIAFFPNYDIEAYPIQTASQLTREVYLDLPFVNETLSPPASYPTFSAAESAIIRSPHLVDFLNSFTIYTDSLASYAYWG
GHLVNSFRTGTTTNLIRSPLYGREGNTERPVTISASPSVPIFRTLSYFTGLNNNNPVAGIEGVEFQNTISRSIYRKSGPI
DSFSELPPQDVSVSPAIGYSHRLCHATFLERISGPRIAGTVFSWTHRSASPINEVSPSRITQIPWVKAHTLASGASVIKG
PGFTGGDILTRNSMGDLGALRVTFTGRLPQSYYIRFRYASVANRSGTFRYSQPPSYGISFPKTMDAGEALTSRSFAHTTL
FTPITFSRAQEEFDLYIQSGVYIDRIEFIPVDATFESEINLERAQKAVNALFTSTNQLGLKTDVTDYHIDQVSNLVECLS
DEFCLDEKRELSEKVKHAKRLSDERNLLQDPNFRGINRQPDRGWRGSTDITIQGGDDVFKENYVTLTGTFDECYPTYLYQ
KIDESKLKAYTRYQLRGYIEDSQDLEIYLIRYNAKHEIVNVPGTGSLWPLSVQSPIGKCGEPNRCAPHLEWNPDLDCSCR
DEEKCAHHSHHFSLDIDVGCTDLNEDLGVWVIFKIKTQDGHARLGNLEFLEEKPLVGEALARVKRAEKKWRDKREKLELE
TNIVYKEAKESVDALFVNSQYDQLQADTNIAMIHAADKRVHSIREAYLPELSVIPGVNAGIFEELEGRIFTAYSLYDARN
VIKNGDFNNGLSCWNVKGHVDVEEQNNHRSVLVVPEWEAEVSQEVRVCPGRGYILRVTAYKEGYGEGCVTIHEVDNNTDE
LKFSNCEKEQVYPGNTVACNDYNKNHGANACSSRNRGYDESYESNSSIPADYAPVYEEEAYTDGQRGNPCEFNRGHTPLP
AGYVTAELEYFPETDTVWVEIGETEGTFIVDSVELLLMEE
>Q57458 ~~~cry1Ea~~~Pesticidal crystal protein Cry1Ea~~~
MEIVNNQNQCVPYNCLNNPENEILDIERSNSTVATNIALEISRLLASATPIGGILLGLFDAIWGSIGPSQWDLFLEQIEL
LIDQKIEEFARNQAISRLEGISSLYGIYTEAFREWEADPTNPALKEEMRTQFNDMNSILVTAIPLFSVQNYQVPFLSVYV
QAANLHLSVLRDVSVFGQAWGFDIATINSRYNDLTRLIPIYTDYAVRWYNTGLDRLPRTGGLRNWARFNQFRRELTISVL
DIISFFRNYDSRLYPIPTSSQLTREVYTDPVINITDYRVGPSFENIENSAIRSPHLMDFLNNLTIDTDLIRGVHYWAGHR
VTSHFTGSSQVITTPQYGITANAEPRRTIAPSTFPGLNLFYRTLSNPFFRRSENITPTLGINVVQGVGFIQPNNAEVLYR
SRGTVDSLNELPIDGENSLVGYSHRLSHVTLTRSLYNTNITSLPTFVWTHHSATNTNTINPDIITQIPLVKGFRLGGGTS
VIKGPGFTGGDILRRNTIGEFVSLQVNINSPITQRYRLRFRYASSRDARITVAIGGQIRVDMTLEKTMEIGESLTSRTFS
YTNFSNPFSFRANPDIIRIAEELPIRGGELYIDKIELILADATFEEEYDLERAQKAVNALFTSTNQLGLKTDVTDYHIDQ
VSNLVECLSDEFCLDEKRELSEKVKHAKRLSDERNLLQDPNFRGINRQPDRGWRGSTDITIQGGDDVFKENYVTLPGTFD
ECYPTYLYQKIDESKLKAYTRYELRGYIEDSQDLEIYLIRYNAKHETVNVPGTGSLWPLSAQSPIGKCGEPNRCAPHLEW
NPNLDCSCRDGEKCAHHSHHFSLDIDVGCTDLNEDLGVWVIFKIKTQDGYARLGNLEFLEENPLLGEALARVKRAEKKWR
DKCEKLEWETNIVYKEAKESVDALFVNSQYDRLQADTNIAMIHAADKRVHSIREAYLPELSVIPGVNAAIFEELEGRIFT
AFSLYDARNVIKNGDFNNGLSCWNVKGHVDVEEQNNHRSVLVVPEWEAEVSQEVRVCPGRGYILRVTAYKEGYGEGCVTI
HEIEDNTDELKFSNCVEEEVYPNNTVTCNNYTATQEEHEGTYTSRNRGYDEAYESNSSVHASVYEEKSYTDRRRENPCES
NRGYGDYTPLPAGYVTKELEYFPETDKVWIEIGETEGTFIVDSVELLLMEE
>Q03745 ~~~cry1Eb~~~Pesticidal crystal protein Cry1Eb~~~
MENNIENQCIPYNCLNNPEVEILGIERSNSNVAAEIGLGLSRLLVSRIPLGDFILGLFDVIWGAIGPSQWDIFLEQIELL
IGQRIEEFARNQAISRLQGLSNLYRIYTNAFKNWEVDPTNPALREEMRIQFNDMNSALTTAIPLFSVQGYEIPLLSVYVQ
AANLHLSVLRDVSVFGQRWGFDVATINSRYNDLTRLIGEYTDYAVRWYNTGLNRLPRNEGVRGWARFNRFRRELTISVLD
IISFFQNYDSRLYPIPTIYQLTREVYTDPVINITDYRVTPSFESIENSAIRSPHLMDFLNNIIIDTDLIRGVHYWAGHRV
TSHFTGSSQVISSPQYGITANAEPSRTIAPSTFPGLNLFYRTLSDPFFRRSDNIMPTLGINVVQGVGFIQPNNGEVLYRR
RGTVDSLDELPIDGENSLVGYSHRLSHVTLTRSLYNTNITSLPTFVWTHHSATDRNIIYPDVITQIPLVKSFSLTSGTSV
VRGPGFTGGDIIRTNVNGNVLSMSLNFSNTSLQRYRVRVRYAASQTMVMRVNVGGSTTFDQGFPSTMSANGSLTSQSFRF
AEFPVGISTSGSQTAGISISNNPGRQTFHLDRIEFIPVDATFEAEYDLERAQKAVNSLFTSSNQIELKTDVTDYHIDQVS
NLVDCLSDEFCLDEKRELSEKVKHAKRLSDERNLLQDPNFRGINRQPDRGWRGSTDITIQGGDDVFKENYVTLPGTFDEC
YPTYLYQKIDESKLKAYNRYQLRGYIEDSQDLEIYLIRYNAKHETVNVPGTGSLWPLSVESPIGRCGEPNRCVPHLEWNP
DLDCSCRDGEKCAHHSHHFSLDIDVGCTDLQEDLGVWVVFKIKTQEGYARLGNLEFIEEKPLIGEALSRVKRAEKKWRDK
REKLQLETKRVYTEAKEAVDALFVDSQYDRLQADTNIGMIHAADRLVHQIHEAYLPELPFIPGINVVIFEELENRISTAL
SLYDARNVIKNGDFNNGLSCWNVKGHVDVVEQNNHRSVLVVPEWEAEVSQTIRVCPGRGYILRVTAYKEGYGEGCVTIHE
IENNTDELKFKNCEEEEVYPTDTGTCNDYTAHQGTAGSTDSCNSRNIRYEDAYEMNTTASVNYKPTYEEERYTDVQGDNH
CEYDRGYVNYRPVPAGYVTKELEYFPETDKVWIEIGETEGKFIVDNVELLLMEE
>Q03746 ~~~cry1Fa~~~Pesticidal crystal protein Cry1Fa~~~
MENNIQNQCVPYNCLNNPEVEILNEERSTGRLPLDISLSLTRFLLSEFVPGVGVAFGLFDLIWGFITPSDWSLFLLQIEQ
LIEQRIETLERNRAITTLRGLADSYEIYIEALREWEANPNNAQLREDVRIRFANTDDALITAINNFTLTSFEIPLLSVYV
QAANLHLSLLRDAVSFGQGWGLDIATVNNHYNRLINLIHRYTKHCLDTYNQGLENLRGTNTRQWARFNQFRRDLTLTVLD
IVALFPNYDVRTYPIQTSSQLTREIYTSSVIEDSPVSANIPNGFNRAEFGVRPPHLMDFMNSLFVTAETVRSQTVWGGHL
VSSRNTAGNRINFPSYGVFNPGGAIWIADEDPRPFYRTLSDPVFVRGGFGNPHYVLGLRGVAFQQTGTNHTRTFRNSGTI
DSLDEIPPQDNSGAPWNDYSHVLNHVTFVRWPGEISGSDSWRAPMFSWTHRSATPTNTIDPERITQIPLVKAHTLQSGTT
VVRGPGFTGGDILRRTSGGPFAYTIVNINGQLPQRYRARIRYASTTNLRIYVTVAGERIFAGQFNKTMDTGDPLTFQSFS
YATINTAFTFPMSQSSFTVGADTFSSGNEVYIDRFELIPVTATFEAEYDLERAQKAVNALFTSINQIGIKTDVTDYHIDQ
VSNLVDCLSDEFCLDEKRELSEKVKHAKRLSDERNLLQDPNFKGINRQLDRGWRGSTDITIQRGDDVFKENYVTLPGTFD
ECYPTYLYQKIDESKLKPYTRYQLRGYIEDSQDLEIYLIRYNAKHETVNVLGTGSLWPLSVQSPIRKCGEPNRCAPHLEW
NPDLDCSCRDGEKCAHHSHHFSLDIDVGCTDLNEDLDVWVIFKIKTQDGHARLGNLEFLEEKPLVGEALARVKRAEKKWR
DKREKLELETNIVYKEAKESVDALFVNSQYDQLQADTNIAMIHAADKRVHRIREAYLPELSVIPGVNVDIFEELKGRIFT
AFFLYDARNVIKNGDFNNGLSCWNVKGHVDVEEQNNHRSVLVVPEWEAEVSQEVRVCPGRGYILRVTAYKEGYGEGCVTI
HEIENNTDELKFSNCVEEEVYPNNTVTCNDYTANQEEYGGAYTSRNRGYDETYGSNSSVPADYASVYEEKSYTDGRRDNP
CESNRGYGDYTPLPAGYVTKELEYFPETDKVWIEIGETEGTFIVDSVELLLMEE
>O66377 ~~~cry1Fb~~~Pesticidal crystal protein Cry1Fb~~~
MKNNIQNQCVPYNCLSNPEVEILSEERSTGRLPLDISLSLTRFLLSEFVPGVGVAFGLFDLIWGFITPSEWSLFLLQIEQ
LIEQRIETLERNRAITTLRGLADSYEVYLEALREWEENPNNAQLREDVRIRFANTDDALITAINNFTLTSFEIPLLSVYV
QAANLHLSLLRDAVSFGQGWGLDIATVNNHYNRLINLIHRYTEHCLDTYNQGLENLRGTNTRQWSRFNQFRRELTLTVLD
IVALFPNYDARAYPIQTSSQLTREIYTSSVIEDSPVSANIPNGFNRAEFGVRPPHLMDFMNSLFVTAETVRSQTVWGGHL
VSSRNTAGNPINFPIYGVFNPGGAIWIADEDPRPFYRTLSDPVFVRGGFGNPHYVLGLRGVGFQQTGTNHTRTFRNSGTI
DSLDEIPPQDNSGAPWNDYSHVLNHVTFVRWPGEIAGSDSWRAPMFSWTHRSADRTNIINPNIITQIPAVKAHNLHSGST
VVRGPGFTGGDLLRRTNTGTFADIRVNITGPLSQRYRVRIRYASTTDLQFFTRINGTSVNQGNFQRTMNRGGNLESGNFR
TAGFSTPFSFSNAQSTFTLGTQAFSNQEVYIDRIEFVPAEVTFEAESDLERAQKAVNALFTSTSQLGLKTNVTGYHIDQV
SNLVACLSDEFCLDEKRELSEKVKHAKRLSDKRNLLQDPNFRGINRQPDHGWRGSTDITIQGGDDVFKENYVTLPGTFDE
CYPTYLYQKIDESKLKAYTRYQLRGYIEDSQDLEIYLIRYNSKHEIVNVPGTGSLWPLSVENQIGPCGEPNRCAPHLEWN
PDLHCSCRDGEKCVHHSHHFSLDIDVGCTDLNEDLGVWLIFKIKTQDGHARLGNLEFLEEEPLLGEALARVKRAEKKWRD
KREKLQLETNIVYKEAKESVDALFVNSQYDRLQADTNIAMIHAADKRVHRIREAYLPELSVIPGVNAAIFEELEGRIFTA
YSLYDARNVIKNGNFNNGLLCWNVKGHVDVEEQNNHRSVLVVPEWEAEVSQEVRVCPGRGYILRVTAYKEGYGEGCVTIH
EVDNNTDELKFSSNCEKEQVYPGNTVACNDYNKNHGANACSSRNGGYDESYESNSSIPADYAPVYEEEAYTDGQRGNPCE
FNRGHTPLPAGYVTAELEYFPETDTVWVEIGETEGTFIVDSVELLLMEE
>Q45746 ~~~cry1Ga~~~Pesticidal crystal protein Cry1Ga~~~
MEISDQNQYIPYNCLNNPESEIFNARNSNFGLVSQVSSGLTRFLLEAAVPEAGFALGLFDIIWGALGVDQWSLFLRQIEQ
LIRQEITELERNRATAILTGLSSSYNLYVEALREWENDPNNPASQERVRTRFRLTDDAIVTGLPTLAIRNLEVVNLSVYT
QAANLHLSLLRDAVYFGERWGLTQANIEDLYTRLTSNIQEYSDHCARWYNQGLNEIGGISRRYLDFQRDLTISVLDIVAL
FPNYDIRTYPIPTQSQLTREIYTSPVVAGNINFGLSIANVLRAPHLMDFIDRIVIYTNSVRSTPYWAGHEVISRRTGQGQ
GNEIRFPLYGVAANAEPPVTIRPTGFTDEQRQWYRARSRVVSFRSSGQDFSLVDAVGFLTIFSAVSIYRNGFGFNTDTID
EIPIEGTDPFTGYSHRLCHVGFLASSPFISQYARAPIFSWTHRSATLTNTIAPDVITQIPLVKAFNLHSGATIVKGPGFT
GGDILRRTNVGSFGDMRVNITAPLSQRYRVRIRYASTTDLQFYTNINGTTINIGNFSSTMDSGDDLQYGRFRVAGFTTPF
TFSDANSTFTIGAFGFSPNNEVYIDRIEFVPAEVTFEAEYDLEKAQKAVNALFTSSNQIGLKTDVTDYHIDKVSNLVECL
SDEFCLDEKRELSEKVKHAKRLSDERNLLQDPNFRGINRQPDRGWRGSTDITIQGGDDVFKENYVTLPGTFDGCYPTYLY
QKIDESKLKVYTRYQLRGYIEDSQDLEIYLIRYNAKHETVNVPGTGSLWPLSAQSPIGKCGEPNRCAPHLEWNPDLDCSC
RNGEKCAHHSHHFSLDIDVGCTDLNEDLGVWVIFKIKTQDGHARLGNLEFLEEKPLLGEALARVKRAEKKWRDKREKLEL
ETNIVYKEAKESVDALFVNSQYDQLQADTNIAMIHAADKRVHSIREAYLPELSVIPGVNAAIFEELEGRIFTAFSLYDAR
NVIKNGDFNNGLSCWNVKGHVDVEEQNNHRSVLVVPEWEAEVSQEVRVCPGRGYILRVTAYKEGYGEGCVTIHEIENNTD
ELKFSNCVEEEVYPNNTVTCNDYTANQEEYKGAYTSHNRGYDEAYGNNPSVPADYTPVYEEKAYTDGRRENPCESNRGYG
DYTPLPAGYVTKELEYFPETDKVWIEIGETEGTFIVESVELLLMEE
>Q9ZAZ6 ~~~cry1Gb~~~Pesticidal crystal protein Cry1Gb~~~
MEINNQNQCVPYNCLNNPESEILNVAIFSSEQVAEIHLKITRLILENFLPGGSFAFGLFDLIWGIFNEDQWSAFLRQVEE
LINQRITEFARGQAIQRLVGFGRSYDEYILALKEWENDPDNPASKERVRTRFRTTDDALLTGVPLMAIPGFELATLSVYA
QSANLHLALLRDAVFFGERWGLTQTNINDLYSRLKNSIRDYTNHCVRFYNIGLGNLNVIRPEYYRFQRELTISVLDLVAL
FPNYDIRTYPIPTKSQLTREIYTDPIISPGAQAGYTLQDVLREPHLMDFLNRLIIYTGEYRGIRHWAGHEVESSRTGMMT
NIRFPLYGTAATAEPTRFITPSTFPGLNLFYRTLSAPIFRDEPGANIIIRYRTSLVEGVGFIQPNNGEQLYRVRGTLDSL
DQLPLEGESSLTEYSHRLCHVRFAQSLRNAEPLDYARVPMFSWTHRSATPTNTIDPDVITQIPLVKAFNLHSGATVVRGP
GFTGGDILRRTNAGNFGDMRVNITAPLSQRYRVRIRYASTANLQFHTSINGRAINQANFPATMNSGENLQSGSFRVAGFT
TPFTFSDALSTFTIGAFSFSSNNEVYIDGIEFVPAEVTFATESDQDRAQKAVNALFTSSNQIGLKTDVTNYHIDQVSNLV
ECLSDEFCLDEKRELSEKVKHAKRLCDERNLLQDPNFRGINREPDRGWRGSTDITIQRGDDVFKENYVTLPGTFDECYPT
YLYQKIDESKLKAYTRYELRGYIEDSQDLEIYLIRYNAKHETVNVPGTGSLWPLSAQSPIGKCGEPNRCATHLEWNPDLD
CSCRDGEKCAHHSHHFSLDIDVGCTDLNEDLGVWVIFKIKTQDGHARLGNLEFLEEKPLLGEALARVKRAEKKWRDKREK
LELETNIVYKEAKKSVDALFVNSQYDRLQADTNIAIIHAADKRVHSIREAYLPELSVIPGVNAAIFEELEGRIFTAYSLY
DARNVIKNGDFNNGLSCWNVKGHVDVEEQNNHRSVLVVPEWEAEVSQEVRVCPGRGYILRVTAYKEGYGEGCVTIHEIED
NTDELKFSNCVEEEIYPNNTVTCNDYTATQEEYEGTYTSRNRGYDGAYESNSSVPADYASAYEEKAYTDGRRDNTCESNR
GYGDYTPLPAGYVTKELEYFPETDKVWIEIGETEGTFIVDSVELLLMEE
>Q45748 ~~~cry1Ha~~~Pesticidal crystal protein Cry1Ha~~~
MEIINNQNQYVPYNCLSNPENEILDIESLSSRSREQVAEISLGLTRFLLESLLPGASFGFALFDIIWGVIGPDQWNLFLA
QIEQLIDQRIEAHVRNQAISRLEGLGDSYEVYIESLREWEASPNNEALQQDVRNRFSNTDNALITAIPILREQGFEIPLL
SVYVQAANLHLSLLRDAVYFGQRWGLDTVTVNNHYNRLINLINTYSDHCAQWFNRGLDNFGGVSARYLDFQREVTISVLD
IVALFPNYDIRTYPISTQSQLTREIYTSPVAEPGASLNANLQNILREPHLMDFLTRLVIYTGVQSGIYHWAGHEISSRTT
GNLSSNIQFPLYGTAASADRAFNMNIHHSETIYRTLSAPIYSVSGGISPNRTRVVEGVRFLIARDNNLDSLPFLYRKEGT
LDSFTELPPEDESTPPYIGYSHRLCHARFARSPVILEPSNFARLPVFSWTHRSASPTNEVSPSRITQIPWVKAHTLASGA
SVIKGPGFTGGDIMTRNNINLGDLGTLRVTVTGRLPQSYYIRLRYASVANSSGVFRHLPQPSYGISFPRTMGTDEPLTSR
SFALTTLFTPITLTRAQEEFNLTIPRGVYIDRIEFVPVDATFEAGYDLERAQKAVNALFTSTNQRGLKTDITDYHIDQVS
NLVECLSDEFCLDEKRELSEKVKHAKRLSDGRNLLQDRNFISINGLLDRGWRGSTDITIQGSDDVFKENYVTLPGTFDEC
YPTYLYQKIDESKLKAYTRYQLRGYIEDSQDLEIYLIRYNAKHEIVNVPGTGSLWPLSVENSIGPCGESNRCAPHLEWNP
NLDCSCRDGEKCAHHSHHFSLDIDVGCTDLNEDLGVWVIFKIKTQDGHARIGNLEFLEEKPLVGEALARVKRAEKKWRDK
RKKLEFETNIVYKEAKESVDALFVNSQYDKLKADTNIAMIHAADKRVHRIREAYLPELSVIPGVNADIFEELEGRIFTAY
SLYDARNVIKNGDFNNGLLCWNVKGHVDVEEQNNHRSVLVVPEWEAEVSQEVRVCPGRGYILRVTAYKEGYGEGCVTIHE
IEDNTDELKFSNCVEEEVYPSNTVTCNDYTANQEEYEGTYTSRNQGYDEAYESNSSVPANYASVYEEKAYTDGRRENSCE
FNRGYRDYTPLPAGYVTKELEYFPGTAKVWIEIGETEGTFIVDSVELLLMEE
>Q45718 ~~~cry1Hb~~~Pesticidal crystal protein Cry1Hb~~~
MEVNHQNECVPYNCLKNPKIEMLDIEGISSRSREQVAEISLGLTRFLLESLLPGASFGFGLFDIIWGVIGPDQWSLFLTQ
IEQLIDQRIEAHVRNQAISRLEGLGDSYEVYIESLREWEASPNNESLQQDVRNRFSNTDNALITAIPILREQGFEIPLLT
VYVQAANLHLSLLRDAVYFGQRWGLDTATVNNHYNRLINLINTYSDHCAQWFNRGLDNFGVVTARYLDFQREVTISVLDI
VALFPNYDIRTYPIQTLSQLTREIYTSPVAEPGASLNVDLRNILREPHLMDFLTRLVIYTGVQGGIYHWAGHEISSRTTG
NLSSNIQFPLYGTSANADRPFNLAIHYSETIYRTLSAPIYSVSGGISPNRTRAVEGVRFLTARDNNLNSLPFLYRKEGSL
DSFTELPPEDENEPPYIGYSHRLCHARFARSSVVLEPSNFARIPVFSWTHRSAGPTNEVSSSRITQIPWVKAHTLDSGAF
VIKGPGFTGGDILTRPNLGTLGALRVTLTGQLPQTYNIRIRYASIANRGGTLIFSQPPSYGLTFPKTMDIDEPLTSRSFA
RTTLFTPITFTQAQAELNLTIQQGVYIDRIEFIPVNATFEAEYDLERAQEAVNALFTSSNQLGLKTDLTDYHIDQVSNLV
DCLSDEFCIDEKRELSEKVKHAKRLSDERNLLQDSNFRGINRQPDRGWRGSTDITIQGGNDVFKENYVTLPGTFDECYPT
YLYQKIDESKLKAYTRYQLRGYIEDSQDLEIYLIRYNAKHETVNVPGTGSLWPLSVESPIGKCGEPNRCVPQLEWNSNLD
CSCRDGEKCAHHSHHFSLDIDVGCTDLNEDLGVWVIFKIKTQDGHARLGNLEFLEEKPLLGEALARVKRAEKKWRDKRET
LQLETNIVYKEAKESVDALFANSQYNRLQADTNIAMIHAADKRVHRIREAYLPELSVIPGVNAGIFEELEGRIFTAFSLY
DARNVIKNSDFNNGLSCWNVKGHVDIEEQNNHRSVLVVPEWEAEVSQKVHVCPGRGYILRVTAYKEGYGEGCVTIHEIED
HTDELKFRNCEEDEVYPNNTRTCNAYPADQEGYEGACTSRNRGYDEVYGNTPSLPADYAPIYEENAYTDGRRGNPCESSR
GYGDYTPLPAGYETKELEYFPETDTVWPRNRYSVD
>Q45752 ~~~cry1Ia~~~Pesticidal crystal protein Cry1Ia~~~
MKLKNQDKHQSFSSNAKVDKISTDSLKNETDIELQNINHEDCLKMSEYENVEPFVSASTIQTGIGIAGKILGTLGVPFAG
QVASLYSFILGELWPKGKNQWEIFMEHVEEIINQKISTYARNKALTDLKGLGDALAVYHDSLESWVGNRNNTRARSVVKS
QYIALELMFVQKLPSFAVSGEEVPLLPIYAQAANLHLLLLRDASIFGKEWGLSSSEISTFYNRQVERAGDYSDHCVKWYS
TGLNNLRGTNAESWVRYNQFRRDMTLMVLDLVALFPSYDTQMYPIKTTAQLTREVYTDAIGTVHPHPSFTSTTWYNNNAP
SFSAIEAAVVRNPHLLDFLEQVTIYSLLSRWSNTQYMNMWGGHKLEFRTIGGTLNISTQGSTNTSINPVTLPFTSRDVYR
TESLAGLNLFLTQPVNGVPRVDFHWKFVTHPIASDNFYYPGYAGIGTQLQDSENELPPEATGQPNYESYSHRLSHIGLIS
ASHVKALVYSWTHRSADRTNTIEPNSITQIPLVKAFNLSSGAAVVRGPGFTGGDILRRTNTGTFGDIRVNINPPFAQRYR
VRIRYASTTDLQFHTSINGKAINQGNFSATMNRGEDLDYKTFRTVGFTTPFSFLDVQSTFTIGAWNFSSGNEVYIDRIEF
VPVEVTYEAEYDFEKAQEKVTALFTSTNPRGLKTDVKDYHIDQVSNLVESLSDEFYLDEKRELFEIVKYAKQLHIERNM
>Q45709 ~~~cry1Ib~~~Pesticidal crystal protein Cry1Ib~~~
MKLKNPDKHQSLSSNAKVDKIATDSLKNETDIELKNMNNEDYLRMSEHESIDPFVSASTIQTGIGIAGKILGTLGVPFAG
QIASLYSFILGELWPKGKSQWEIFMEHVEEIINQKILTYARNKALSDLRGLGDALAVYHESLESWVENRNNTRARSVVKN
QYIALELMFVQKLPSFAVSGEEVPLLPIYAQAANLHLLLLRDASIFGKEWGLSASEISTFYNRQVERTRDYSDHCIKWYN
TGLNNLRGTNAKSWVRYNQFRKDMTLMVLDLVALFPSYDTLVYPIKTTSQLTREVYTDAIGTVHPNQAFASTTWYNNNAP
SFSAIEAAVIRSPHLLDFLEKVTIYSLLSRWSNTQYMNMWGGHRLESRPIGGALNTSTQGSTNTSINPVTLQFTSRDVYR
TESLAGLNLFLTQPVNGVPRVDFHWKFPTLPIASDNFYYLGYAGVGTQLQDSENELPPETTGQPNYESYSHRLSHIGLIS
ASHVKALVYSWTHRSADRTNTIEPNSITQIPLVKAFNLSSGAAVVRGPGFTGGDILRRTNTGTFGDIRVNINPPFAQRYR
VRIRYASTTDLQFHTSINGKAINQGNFSATMNRGEDLDYKTFRTIGFTTPFSFSDVQSTFTIGAWNFSSGNEVYIDRIEF
VPVEVTYEAEYDFEKAQEKVTALFTSTNPRGLKTDVKDYHIDQVSNLVESLSDEFYLDEKRELFEIVKYAKQIHIERNM
>O87404 ~~~cry1Ic~~~Pesticidal crystal protein Cry1Ic~~~
MKLKNPDKHQTLSSNAKVDKIATDSLKNETDIELKNMNNEDYLRMSEHESIDPFVSASTIQTGIGIAGKILGTLGVPFPG
QIASLYSFILGELWPKGKSQWEIFMEHVEAIINRKISTYARNKALTDLKGLGDALAVYHESLESWVGNRNNTRARSVVKN
QYIALELMFVQKLPSFAVSGEEVPLLPIYAQAANLHLLLLRDASIFEKNGGLSASEISTFYNRQVERTRDYSYHCVKWNN
TGLNNLRATNGQSWVRYNQFRKDIELMVLDLVRVFPSYDTLVYPIKTTSQLTREVYTDAIGTVDPNQALRSTTWYNNNAP
SFSAIEAAVIRSPHLLDFLEKVTIYSLLSRWSNTQYMNMWGGHRLESRPIGGALNTSTQGSTNTSINPVTLQFTSRDFYR
TESWAGLNLFLTQPVNGVPRVDFHWKFPTLPIASDNFYYLGYAGVGTQLQDSENELPPETTGQPNYESYSHRLSHIGLIS
GSHVKALVYSWTHRSADRTNTIEPNSITQIPLVKAFNLSSGAAVVRGPGFTGGHILRRTKSGTFGHIRVNINPPFAQRYR
VRMSYASTTDLQFHTSINGKAINQGNFSATMNRGEDLDYKTFRTVGFTTPFSFSDVQSTFTIGAWNFSSGNEVYIGRIEF
VPVEVTYEAEYDFEKAQEKVTALFTSTNPRGLKTDVKDYHIDQVSNLVESLSDELYLDEKRELFEIVKYAKQIHIERNM
>Q9XDL1 ~~~cry1Id~~~Pesticidal crystal protein Cry1Id~~~
MKSKNQNMYRSFSSNATVDKSFTDPLEHNTNMELQNSNHEDCLKMSEYESVEPFVSVSTIQTGIGIAGKILGNLGVPFAG
QVASLYSFILGELWPKGKSQWEIFMEHVEELINQKISTYARNKALADLKGLGDALAVYHESLESWIENRNNTRVRSVVKN
QYIALELMFVQKLPSFAVSGEEVPLLPIYAQAANLHLLLLRDASIFGKEWGLSESEISTFYNRQSSQTQEYSDYCSEWYN
TGLNRLRGTNAESWVRYNQFRRDMTLMVLDLVALFPSYDTRMYPIPTSAQLTREVYTDAIGTVHPNASFASTTWYNNNAP
SFSTIEAAVVRNPHLLDFLEQVTIYSLLSRWSNTQYMNMWGGHKLEFRTIGGTLNTSTQGSTNTSINPVTLPFTSRDVYR
TESLAGLNLFLTQPVNGVPRVDFHWKFVTHPIASDNFYYPGYAGIGTQLQDSENELPPETTGQPNYESYSHRLSHIGLIS
ASHVKALVYSWTHRSADRTNTINSDSITQIPLVKAFNLPSGASVVRGPGFTGGDILQRTNTGTFGDIRVNINPPFAQRYR
LRIRYASTTNLEFHTSINGKAINQGNFSATMNRGEDLDYKAFRTVGFTTPFSFSNAQSTFTIGAWNFSLGNEVYIDRIEF
VPVEVTYEAEYDLKKAQDEITAMFTSTNLRRLKTNVTDCHIDQVSNLVESLSDEFYLDEKRELFEIVKYAKQLNIERNM
>Q45738 ~~~cry1Ja~~~Pesticidal crystal protein Cry1Ja~~~
MEINNQKQCIPYNCLSNPEEVLLDGERILPDIDPLEVSLSLLQFLLNNFVPGGGFISGLVDKIWGALRPSEWDLFLAQIE
RLIDQRIEATVRAKAITELEGLGRNYQIYAEAFKEWESDPDNEAAKSRVIDRFRILDGLIEANIPSFRIIGFEVPLLSVY
VQAANLHLALLRDSVIFGERWGLTTKNVNDIYNRQIREIHEYSNHCVDTYNTELERLGFRSIAQWRIYNQFRRELTLTVL
DIVALFPNYDSRLYPIQTFSQLTREIVTSPVSEFYYGVINSGNIIGTLTEQQIRRPHLMDFFNSMIMYTSDNRREHYWSG
LEMTAYFTGFAGAQVSFPLVGTRGESAPPLTVRSVNDGIYRILSAPFYSAPFLGTIVLGSRGEKFDFALNNISPPPSTIY
RHPGTVDSLVSIPPQDNSVPPHRGSSHRLSHVTMRASSPIFHWTHRSATTTNTINPNAIIQIPLVKAFNLHSGATVVRGP
GFTGGDILRRTNTGTFADMRVNITGPLSQRYRVRIRYASTTDLQFFTRINGTSVNQGNFQRTMNRGDNLESGNFRTAGFS
TPFSFSNAQSTFTLGTQAFSNQEVYIDRIEFVPAEVTFEAESDLERAQKAVNALFTSTNQLGLKTDVTDYQIDQVSNLVE
CLSDEFCLDEKRELSEKVKHAKRLSDKRNLLQDPNFTSINRQLDRGWRGSTDITIQGGNDVFKENYVTLPGTFDECYPTY
LYQKIDESKLKAYTRYELRGYIEDSQDLEVYLIRYNAKHETVNVPGTGSLWPLSVESPIGRCGEPNRCVPHIEWNPDLDC
SCRDGEKCAHHSHHFSLDIDVGCTDLNEDLGVWVIFKIKTQDGHARLGNLEFLEEKPLLGEALARVKRAEKKWRDKREQL
QFETNIVYKEAKESVDALFVDSHYNRLQADTNITMIHAADKRVHRIREAYLPELSVIPGVNADIFEELEGLIFTAFSLYD
ARNIIKNGDFNNGLSCWNVKGHVDIQQNDHRSVLVVPEWESEVSQEVRVCPGRGYILRVTAYKEGYGEGCVTIHEIEDNT
DELKFSNCIEEEVYPTDTGNDYTAHQGTTGCADACNSRNVGYEDGYEINTTASVNYKPTYEEEMYTDVRRDNHCEYDRGY
GNHTPLPAGYVTKELEYFPETDTVWIEIGETEGTFIVDSVELLLMEE
>Q45716 ~~~cry1Jb~~~Pesticidal crystal protein Cry1Jb~~~
MEINNQNQCIPYNCLSNPEEVLLDGERILPDIDPLEVSMSLLQFLLNNFVPGGGFISGLFDKIWGALRPSDWELFLAQIE
QLIDQRIEATVRAKAIAELEGLGRSFQLYVEAFKEWEETPDNTAARSRVTERFRIIDAQIEANIPSFRIPGFEVPLLSVY
AQAANLHLALLRDSVIFGERWGLTTTNVNDIYNRQVKRIHEYSDHCVDTYKTELERLGFTSRAQWKIYNQFRRELTLTVL
DIVAVFPNYDGKLYPIQTKSELTREIYTSPVSEYYYGAINNYNQNGIQTERQIRQPHLMDFFNTMTMYTSYNRREYYWSG
LEMTAYFTGFAGPQVSFPLAGTRGDAAPPFNVRSVNDGIYRILSAPFYSAPFLGTSVLGSRGEEFMFALNNISPPPSARY
RNPGTVDSLVSIPPQDNSVPPHRGSSHRLSHVTMRNSSPIFHWTHRSATTTNRINSDVITQIPMVKAYNLHAGATVVRGP
GFTGGDILRRTSNGMVVTLRVDASAVRNQRYRIRFRYAATSNFYFVVRRGNLGVNGREIMKTMSTGEELKSASFVLGEFI
TPFNFFENQVPLQIEIQSLSPGGEVYLDKIEFIPADTTFEAEYDLERAQKAVNALFTSTNQRGLKTDVTDYHIDQVSNLV
ECLSDEFCLDEKRELSEKVKHAKRLSDERNLLQDPNFTSINGQLDRGWRGSTDITIQGGNDVFKENYVTLPGTFDECYPT
YLYQKIDESKLKAYTRYELRGYIEDSQDLEVYLIRYNAKHETLNVPGTDSLRTLSVESQNGRCGELNRCMPHIKWNPDVD
CSCRDGEKCAHHSHHFSLDIDVGCTDLQEDLGVWVVFKIKTQEGYARLGNLEFIEEKPLVGEALSRVKRAEKKWRDKREK
LELETKRVYTEAKEAVDALFVDSQYDRLQADTNIGMIHAADKLVHRICETYLPELPFIPGINAIIFEELENRISTAFFLY
EARNVINNGDFNNGLTCWNVKGHVDVQQSHHRSVLVIPEWEAEVSQKVRVCPGRGYILRVTAYKEGYGEGCVTIHEIEDN
TDELKFRNCEEEGDYSNDTGTCNDYPASQGAAGCADVCNSRNVGYKDAYETNTSASVNYKPTYEEETYTDVREDNHCEYD
RGYVNYPPLPAGYVTKELEYFPETDTVWIEIGETEGKFIVDSVELLLMEE
>Q45715 ~~~cry1Ka~~~Pesticidal crystal protein Cry1Ka~~~
MNSNRKNENEIINALSIPAVSNHSAQMDLSPDARIEDSLCVAEGNNIDPFVSASTVQTGISIAGRILGVLGVPFAGQLAS
FYSFLVGELWPSGRDPWEIFMEHVEQIVRQQQITDSVRDTAIARLEGLGRGYRSYQQALETWLDNRNDARSRSIIRERYI
ALELDITTAIPLFSIRNEEVPLLMVYAQAANLHLLLLRDASLFGSEWGMSSADVNQYYQEQIRYTEEYSNHCVQWYNTGL
NRLRGTTAETWVRYNQFRRDLTLGVLDLVALFPSYDTRTYPIPTTAQLTREVYTDPNGVVAGPNNSWFRNGASFSAIENA
IIRQPHLYDFLTNLTIYTRRSQVGTTIMNLWAGHRITFNRIQGGSTSEMVYGAITNPVSVSDIPFVNRDVYRTVSLAGGL
GSLSGIRYGLTRVDFDMIFRNHPDIVTGLFYHPGHAGIATQVKDSDTELPPETTEQPNYRAFSHLLSHISMGPTTQDVPP
VYSWTHQSADRTNTINSDRITQIPLVKAHTLQSGTTVVKGPGFTGGDILRRTSGGPFAFSNVNLDFNLSQRYRARIRYAS
TTNLRIYVTVAGERIFAGQFDKTMDAGAPLTFQSFSYATINTAFTFPERSSSLTIGADTFSSGNEVYVDRFELIQVTATF
EAESDLERARKAVNALFTSTNPRGLKTDVTDYHIDQVSNLVECLSDEFCLDKKRELLEEVKYAKRLSDERNLLQDPTFTS
ISGQTDRGWIGSTGISIQGGDDIFKENYVRLPGTVDECYPTYLYQKIDESQLKSYTRYQLRGYIEDSQDLEIYLIRYNAK
HETLSVPGTESPWPSSGVYPSGRCGEPNRCAPRIEWNPDLDCSCRYGEKCVHHSHHFSLDIDVGCTDLNEDLGVWVIFKI
KTQDGHAKLGNLEFIEEKPLLGKALSRVKRAEKKWRDKYEKLQLETKRVYTEAKESVDALFVDSQYDKLQANTNIGIIHG
ADKQVHRIREPYLSELPVIPSINAAIFEELEGHIFKAYSLYDARNVIKNGDFNNGLSCWNVKGHVDVQQNHHRSVLVLSE
WEAEVSQKVRVCPDRGYILRVTAYKEGYGEGCVTIHEFEDNTDVLKFRNFVEEEVYPNNTVTCNDYTTNQSAEGSTDACN
SYNRGYEDGYENRYEPNPSAPVNYTPTYEEGMYTDTQGYNHCVSDRGYRNHTPLPAGYVTLELEYFPETEQVWIEIGETE
GTFIVGSVELLLMEE
>P0A377 ~~~cry2Aa~~~Pesticidal crystal protein Cry2Aa~~~
MNNVLNSGRTTICDAYNVVAHDPFSFEHKSLDTIQKEWMEWKRTDHSLYVAPVVGTVSSFLLKKVGSLIGKRILSELWGI
IFPSGSTNLMQDILRETEQFLNQRLNTDTLARVNAELIGLQANIREFNQQVDNFLNPTQNPVPLSITSSVNTMQQLFLNR
LPQFQIQGYQLLLLPLFAQAANMHLSFIRDVILNADEWGISAATLRTYRDYLRNYTRDYSNYCINTYQTAFRGLNTRLHD
MLEFRTYMFLNVFEYVSIWSLFKYQSLMVSSGANLYASGSGPQQTQSFTAQNWPFLYSLFQVNSNYILSGISGTRLSITF
PNIGGLPGSTTTHSLNSARVNYSGGVSSGLIGATNLNHNFNCSTVLPPLSTPFVRSWLDSGTDREGVATSTNWQTESFQT
TLSLRCGAFSARGNSNYFPDYFIRNISGVPLVIRNEDLTRPLHYNQIRNIESPSGTPGGARAYLVSVHNRKNNIYAANEN
GTMIHLAPEDYTGFTISPIHATQVNNQTRTFISEKFGNQGDSLRFEQSNTTARYTLRGNGNSYNLYLRVSSIGNSTIRVT
INGRVYTVSNVNTTTNNDGVNDNGARFSDINIGNIVASDNTNVTLDINVTLNSGTPFDLMNIMFVPTNLPPLY
>P21254 ~~~cry2Ab~~~Pesticidal crystal protein Cry2Ab~~~
MNSVLNSGRTTICDAYNVAAHDPFSFQHKSLDTVQKEWTEWKKNNHSLYLDPIVGTVASFLLKKVGSLVGKRILSELRNL
IFPSGSTNLMQDILRETEKFLNQRLNTDTLARVNAELTGLQANVEEFNRQVDNFLNPNRNAVPLSITSSVNTMQQLFLNR
LPQFQMQGYQLLLLPLFAQAANLHLSFIRDVILNADEWGISAATLRTYRDYLKNYTRDYSNYCINTYQSAFKGLNTRLHD
MLEFRTYMFLNVFEYVSIWSLFKYQSLLVSSGANLYASGSGPQQTQSFTSQDWPFLYSLFQVNSNYVLNGFSGARLSNTF
PNIVGLPGSTTTHALLAARVNYSGGISSGDIGASPFNQNFNCSTFLPPLLTPFVRSWLDSGSDREGVATVTNWQTESFET
TLGLRSGAFTARGNSNYFPDYFIRNISGVPLVVRNEDLRRPLHYNEIRNIASPSGTPGGARAYMVSVHNRKNNIHAVHEN
GSMIHLAPNDYTGFTISPIHATQVNNQTRTFISEKFGNQGDSLRFEQNNTTARYTLRGNGNSYNLYLRVSSIGNSTIRVT
INGRVYTATNVNTTTNNDGVNDNGARFSDINIGNVVASSNSDVPLDINVTLNSGTQFDLMNIMLVPTNISPLY
>Q45743 ~~~cry2Ac~~~Pesticidal crystal protein Cry2Ac~~~
MNTVLNNGRNTTCHAHNVVAHDPFSFEHKSLNTIEKEWKEWKRTDHSLYVAPIVGTVGSFLLKKVGSLVGKRILSELQNL
IFPSGSIDLMQEILRATEQFINQRLNADTLGRVNAELAGLQANVAEFNRQVDNFLNPNQNPVPLAIIDSVNTLQQLFLSR
LPQFQIQGYQLLLLPLFAQAANFNLSFIRGVILNADEWGISAATVRTYRDHLRKFHRDYSNYCINPYQTAFRGLNHRLPD
MLEFRTYMFLNVFEYVSIWSLFKYQSLLVSSGANLYASGSGPTQSFTAQNWPFLYSLFQVNSNYVLNGLSGARTTITFPN
IGGLPVYHNSTLHFARINYRGGVSSSRIGQANLNQNFNISTLFNPLQTPFIRSWLDSGTDREGVATSTNWQSGAFETTLL
RFSIFSARGNSNFFPDYFIRNISGVVGTISNADLARPLHFNEIRDIGTTAVASLVTVHNRKNNIYDTHENGTMIHLAPND
YTGFTVSPIHATQVNNQIRTFISEKYGNQGDSLRFELSNPTARYTLRGNGNSYNLYLRVSSIGSSTIRVTINGRVYTANV
NTTTNNDGVLDNGARFSDINIGNVVASANTNVPLDIQVTFNGNPQFELMNIMFVPTNLPPLY
>Q9RMG3 ~~~cry2Ad~~~Pesticidal crystal protein Cry2Ad~~~
MNSVLNSGRNTICDAYNVVVHDPFSFQHKSLDTIQKEWMEWKKDNHSLYVDPIVGTVASFLLKKLGSLIGKRILSELRNL
IFPSGSTNLMEDILRETEKFLNQKLNTDTLSRVNAELTGLQANVEEFNRQVDNFLNPNRNAVPLSITSSVNTMQQLFLNR
LSQFQMQGYQLLLLPLFAQAANLHLSFIRDVILNAEEWGISAATLRTYQNHLRNYTRDYSNYCIDTYQTAFRGLNTRLHD
MLEFRTYMFLNVFEYVSIWSLFKYQSLLVSSGANLYASGSGPQQTQLFTSQDWPFLYSLFQVNSNYVLSGFSGASLFTTF
PNIGGLPGSTTTQALLAARVNYSGGITSGSIGGSNFNQNFNCNTISPPLSTSFVRSWLDSGSDRQGVNTVTNWQTESFET
TSGLRCGAFTPRGNSNYYPGYFIRNISGVSLVLRNEDLKRPLYYNEKRNIESPSGTPGGARAYMVSVHNKKNNIYAVHEN
GTMIHLAPEDNTGFTISPIHATQVNNQTRTFISEKFGNQGDSLRFEQSNTTARYTLRGNGNSYNLYLRVSSIGNSTIRVT
INGRVYTASNVNTTTNNDGVNDNGARFSDINIGNVVASSNSDVPLDINVTLNSGTQFDLMNIMLVPTNISPLY
>P0A381 ~~~cry3Aa~~~Pesticidal crystal protein Cry3Aa~~~
MNPNNRSEHDTIKTTENNEVPTNHVQYPLAETPNPTLEDLNYKEFLRMTADNNTEALDSSTTKDVIQKGISVVGDLLGVV
GFPFGGALVSFYTNFLNTIWPSEDPWKAFMEQVEALMDQKIADYAKNKALAELQGLQNNVEDYVSALSSWQKNPVSSRNP
HSQGRIRELFSQAESHFRNSMPSFAISGYEVLFLTTYAQAANTHLFLLKDAQIYGEEWGYEKEDIAEFYKRQLKLTQEYT
DHCVKWYNVGLDKLRGSSYESWVNFNRYRREMTLTVLDLIALFPLYDVRLYPKEVKTELTRDVLTDPIVGVNNLRGYGTT
FSNIENYIRKPHLFDYLHRIQFHTRFQPGYYGNDSFNYWSGNYVSTRPSIGSNDIITSPFYGNKSSEPVQNLEFNGEKVY
RAVANTNLAVWPSAVYSGVTKVEFSQYNDQTDEASTQTYDSKRNVGAVSWDSIDQLPPETTDEPLEKGYSHQLNYVMCFL
MQGSRGTIPVLTWTHKSVDFFNMIDSKKITQLPLVKAYKLQSGASVVAGPRFTGGDIIQCTENGSAATIYVTPDVSYSQK
YRARIHYASTSQITFTLSLDGAPFNQYYFDKTINKGDTLTYNSFNLASFSTPFELSGNNLQIGVTGLSAGDKVYIDKIEF
IPVN
>P0A380 ~~~cry3Aa~~~Pesticidal crystal protein Cry3Aa~~~
MNPNNRSEHDTIKTTENNEVPTNHVQYPLAETPNPTLEDLNYKEFLRMTADNNTEALDSSTTKDVIQKGISVVGDLLGVV
GFPFGGALVSFYTNFLNTIWPSEDPWKAFMEQVEALMDQKIADYAKNKALAELQGLQNNVEDYVSALSSWQKNPVSSRNP
HSQGRIRELFSQAESHFRNSMPSFAISGYEVLFLTTYAQAANTHLFLLKDAQIYGEEWGYEKEDIAEFYKRQLKLTQEYT
DHCVKWYNVGLDKLRGSSYESWVNFNRYRREMTLTVLDLIALFPLYDVRLYPKEVKTELTRDVLTDPIVGVNNLRGYGTT
FSNIENYIRKPHLFDYLHRIQFHTRFQPGYYGNDSFNYWSGNYVSTRPSIGSNDIITSPFYGNKSSEPVQNLEFNGEKVY
RAVANTNLAVWPSAVYSGVTKVEFSQYNDQTDEASTQTYDSKRNVGAVSWDSIDQLPPETTDEPLEKGYSHQLNYVMCFL
MQGSRGTIPVLTWTHKSVDFFNMIDSKKITQLPLVKAYKLQSGASVVAGPRFTGGDIIQCTENGSAATIYVTPDVSYSQK
YRARIHYASTSQITFTLSLDGAPFNQYYFDKTINKGDTLTYNSFNLASFSTPFELSGNNLQIGVTGLSAGDKVYIDKIEF
IPVN
>P0A379 ~~~cry3Aa~~~Pesticidal crystal protein Cry3Aa~~~
MNPNNRSEHDTIKTTENNEVPTNHVQYPLAETPNPTLEDLNYKEFLRMTADNNTEALDSSTTKDVIQKGISVVGDLLGVV
GFPFGGALVSFYTNFLNTIWPSEDPWKAFMEQVEALMDQKIADYAKNKALAELQGLQNNVEDYVSALSSWQKNPVSSRNP
HSQGRIRELFSQAESHFRNSMPSFAISGYEVLFLTTYAQAANTHLFLLKDAQIYGEEWGYEKEDIAEFYKRQLKLTQEYT
DHCVKWYNVGLDKLRGSSYESWVNFNRYRREMTLTVLDLIALFPLYDVRLYPKEVKTELTRDVLTDPIVGVNNLRGYGTT
FSNIENYIRKPHLFDYLHRIQFHTRFQPGYYGNDSFNYWSGNYVSTRPSIGSNDIITSPFYGNKSSEPVQNLEFNGEKVY
RAVANTNLAVWPSAVYSGVTKVEFSQYNDQTDEASTQTYDSKRNVGAVSWDSIDQLPPETTDEPLEKGYSHQLNYVMCFL
MQGSRGTIPVLTWTHKSVDFFNMIDSKKITQLPLVKAYKLQSGASVVAGPRFTGGDIIQCTENGSAATIYVTPDVSYSQK
YRARIHYASTSQITFTLSLDGAPFNQYYFDKTINKGDTLTYNSFNLASFSTPFELSGNNLQIGVTGLSAGDKVYIDKIEF
IPVN
>P17969 ~~~cry3Ba~~~Pesticidal crystal protein Cry3Ba~~~
MIRMGGRKMNPNNRSEYDTIKVTPNSELPTNHNQYPLADNPNSTLEELNYKEFLRMTADNSTEVLDSSTVKDAVGTGISV
VGQILGVVGVPFAGALTSFYQSFLNAIWPSDADPWKAFMAQVEVLIDKKIEEYAKSKALAELQGLQNNFEDYVNALDSWK
KAPVNLRSRRSQDRIRELFSQAESHFRNSMPSFAVSKFEVLFLPTYAQAANTHLLLLKDAQVFGEEWGYSSEDIAEFYQR
QLKLTQQYTDHCVNWYNVGLNSLRGSTYDAWVKFNRFRREMTLTVLDLIVLFPFYDVRLYSKGVKTELTRDIFTDPIFTL
NALQEYGPTFSSIENSIRKPHLFDYLRGIEFHTRLRPGYSGKDSFNYWSGNYVETRPSIGSNDTITSPFYGDKSIEPIQK
LSFDGQKVYRTIANTDIAAFPDGKIYFGVTKVDFSQYDDQKNETSTQTYDSKRYNGYLGAQDSIDQLPPETTDEPLEKAY
SHQLNYAECFLMQDRRGTIPFFTWTHRSVDFFNTIDAEKITQLPVVKAYALSSGASIIEGPGFTGGNLLFLKESSNSIAK
FKVTLNSAALLQRYRVRIRYASTTNLRLFVQNSNNDFLVIYINKTMNIDGDLTYQTFDFATSNSNMGFSGDTNDFIIGAE
SFVSNEKIYIDKIEFIPVQ
>Q06117 ~~~cry3Bb~~~Pesticidal crystal protein Cry3Bb~~~
MNPNNRSEHDTIKVTPNSELQTNHNQYPLADNPNSTLEELNYKEFLRMTEDSSTEVLDNSTVKDAVGTGISVVGQILGVV
GVPFAGALTSFYQSFLNTIWPSDADPWKAFMAQVEVLIDKKIEEYAKSKALAELQGLQNNFEDYVNALNSWKKTPLSLRS
KRSQDRIRELFSQAESHFRNSMPSFAVSKFEVLFLPTYAQAANTHLLLLKDAQVFGEEWGYSSEDVAEFYHRQLKLTQQY
TDHCVNWYNVGLNGLRGSTYDAWVKFNRFRREMTLTVLDLIVLFPFYDIRLYSKGVKTELTRDIFTDPIFSLNTLQEYGP
TFLSIENSIRKPHLFDYLQGIEFHTRLQPGYFGKDSFNYWSGNYVETRPSIGSSKTITSPFYGDKSTEPVQKLSFDGQKV
YRTIANTDVAAWPNGKVYLGVTKVDFSQYDDQKNETSTQTYDSKRNNGHVSAQDSIDQLPPETTDEPLEKAYSHQLNYAE
CFLMQDRRGTIPFFTWTHRSVDFFNTIDAEKITQLPVVKAYALSSGASIIEGPGFTGGNLLFLKESSNSIAKFKVTLNSA
ALLQRYRVRIRYASTTNLRLFVQNSNNDFLVIYINKTMNKDDDLTYQTFDLATTNSNMGFSGDKNELIIGAESFVSNEKI
YIDKIEFIPVQL
>Q45744 ~~~cry3Ca~~~Pesticidal crystal protein Cry3Ca~~~
MNPNNRSEHDTIKATENNEVSNNHAQYPLADTPTLEELNYKEFLRRTTDNNVEALDSSTTKDAIQKGISIIGDLLGVVGF
PYGGALVSFYTNLLNTIWPGEDPLKAFMQQVEALIDQKIADYAKDKATAELQGLKNVFKDYVSALDSWDKTPLTLRDGRS
QGRIRELFSQAESHFRRSMPSFAVSGYEVLFLPTYAQAANTHLLLLKDAQIYGTDWGYSTDDLNEFHTKQKDLTIEYTNH
CAKWYKAGLDKLRGSTYEEWVKFNRYRREMTLTVLDLITLFPLYDVRTYTKGVKTELTRDVLTDPIVAVNNMNGYGTTFS
NIENYIRKPHLFDYLHAIQFHSRLQPGYFGTDSFNYWSGNYVSTRSSIGSDEIIRSPFYGNKSTLDVQNLEFNGEKVFRA
VANGNLAVWPVGTGGTKIHSGVTKVQFSQYNDRKDEVRTQTYDSKRNVGGIVFDSIDQLPPITTDESLEKAYSHQLNYVR
CFLLQGGRGIIPVFTWTHKSVDFYNTLDSEKITQIPFVKAFILVNSTSVVAGPGFTGGDIIKCTNGSGLTLYVTPAPDLT
YSKTYKIRIRYASTSQVRFGIDLGSYTHSISYFDKTMDKGNTLTYNSFNLSSVSRPIEISGGNKIGVSVGGIGSGDEVYI
DKIEFIPMD
>P16480 ~~~cry4Aa~~~Pesticidal crystal protein Cry4Aa~~~
MNPYQNKNEYETLNASQKKLNISNNYTRYPIENSPKQLLQSTNYKDWLNMCQQNQQYGGDFETFIDSGELSAYTIVVGTV
LTGFGFTTPLGLALIGFGTLIPVLFPAQDQSNTWSDFITQTKNIIKKEIASTYISNANKILNRSFNVISTYHNHLKTWEN
NPNPQNTQDVRTQIQLVHYHFQNVIPELVNSCPPNPSDCDYYNILVLSSYAQAANLHLTVLNQAVKFEAYLKNNRQFDYL
EPLPTAIDYYPVLTKAIEDYTNYCVTTYKKGLNLIKTTPDSNLDGNINWNTYNTYRTKMTTAVLDLVALFPNYDVGKYPI
GVQSELTREIYQVLNFEESPYKYYDFQYQEDSLTRRPHLFTWLDSLNFYEKAQTTPNNFFTSHYNMFHYTLDNISQKSSV
FGNHNVTDKLKSLGLATNIYIFLLNVISLDNKYLNDYNNISKMDFFITNGTRLLEKELTAGSGQITYDVNKNIFGLPILK
RRENQGNPTLFPTYDNYSHILSFIKSLSIPATYKTQVYTFAWTHSSVDPKNTIYTHLTTQIPAVKANSLGTASKVVQGPG
HTGGDLIDFKDHFKITCQHSNFQQSYFIRIRYASNGSANTRAVINLSIPGVAELGMALNPTFSGTDYTNLKYKDFQYLEF
SNEVKFAPNQNISLVFNRSDVYTNTTVLIDKIEFLPITRSIREDREKQKLETVQQIINTFYANPIKNTLQSELTDYDIDQ
AANLVECISEELYPKEKMLLLDEVKNAKQLSQSRNVLQNGDFESATLGWTTSDNITIQEDDPIFKGHYLHMSGARDIDGT
IFPTYIFQKIDESKLKPYTRYLVRGFVGSSKDVELVVSRYGEEIDAIMNVPADLNYLYPSTFDCEGSNRCETSAVPANIG
NTSDMLYSCQYDTGKKHVVCQDSHQFSFTIDTGALDTNENIGVWVMFKISSPDGYASLDNLEVIEEGPIDGEALSRVKHM
EKKWNDQMEAKRSETQQAYDVAKQAIDALFTNVQDEALQFDTTLAQIQYAEYLVQSIPYVYNDWLSDVPGMNYDIYVELD
ARVAQARYLYDTRNIIKNGDFTQGVMGWHVTGNADVQQIDGVSVLVLSNWSAGVSQNVHLQHNHGYVLRVIAKKEGPGNG
YVTLMDCEENQEKLTFTSCEEGYITKTVDVFPDTDRVRIEIGETEGSFYIESIELICMNE
>P84613 ~~~~~~Insecticidal crystal toxin protein~~~
MDFFITNGTRLLEKELTAGSGQITYDVNKNIFGLPILKRRENQGNPTLFPTYDNYSHILSFIKSLSIPATYKTQVYTFAW
THSSVDPKNTIYTHLTTQIPAVKANSLGTASKVVQGPGHTGGDLIDFKDHFKITCQHSNFQQSYFIRIRYASNGSANTRA
VINLSIPGVAELGMALNPTFSGTDYTNLKYKDFQYLEFSNEVKFAPNQNISLVFNRSDVYTNTTVLIDKIEFLPITRSIR
EDREKQKLETVQQIINTFYANPIKNTLQSELTDYDIDQAAN
>P05519 ~~~cry4Ba~~~Pesticidal crystal protein Cry4Ba~~~
MNSGYPLANDLQGSMKNTNYKDWLAMCENNQQYGVNPAAINSSSVSTALKVAGAILKFVNPPAGTVLTVLSAVLPILWPT
NTPTPERVWNDFMTNTGNLIDQTVTAYVRTDANAKMTVVKDYLDQYTTKFNTWKREPNNQSYRTAVITQFNLTSAKLRET
AVYFSNLVGYELLLLPIYAQVANFNLLLIRDGLINAQEWSLARSAGDQLYNTMVQYTKEYIAHSITWYNKGLDVLRNKSN
GQWITFNDYKREMTIQVLDILALFASYDPRRYPADKIDNTKLSKTEFTREIYTALVESPSSKSIAALEAALTRDVHLFTW
LKRVDFWTNTIYQDLRFLSANKIGFSYTNSSAMQESGIYGSSGFGSNLTHQIQLNSNVYKTSITDTSSPSNRVTKMDFYK
IDGTLASYNSNITPTPEGLRTTFFGFSTNENTPNQPTVNDYTHILSYIKTDVIDYNSNRVSFAWTHKIVDPNNQIYTDAI
TQVPAVKSNFLNATAKVIKGPGHTGGDLVALTSNGTLSGRMEIQCKTSIFNDPTRSYGLRIRYAANSPIVLNVSYVLQGV
SRGTTISTESTFSRPNNIIPTDLKYEEFRYKDPFDAIVPMRLSSNQLITIAIQPLNMTSNNQVIIDRIEIIPITQSVLDE
TENQNLESEREVVNALFTNDAKDALNIGTTDYDIDQAANLVECISEELYPKEKMLLLDEVKNAKQLSQSRNVLQNGDFES
ATLGWTTSDNITIQEDDPIFKGHYLHMSGARDIDGTIFPTYIFQKIDESKLKPYTRYLVRGFVGSSKDVELVVSRYGEEI
DAIMNVPADLNYLYPSTFDCEGSNRCETSAVPANIGNTSDMLYSCQYDTGKKHVVCQDSHQFSFTIDTGALDTNENIGVW
VMFKISSPDGYASLDNLEVIEEGPIDGEALSRVKHMEKKWNDQMEAKRSETQQAYDVAKQAIDALFTNVQDEALQFDTTL
AQIQYAEYLVQSIPYVYNDWLSDVPGMNYDIYVELDARVAQARYLYDTRNIIKNGDFTQGVMGWHVTGNADVQQIDGVSV
LVLSNWSAGVSQNVHLQHNHGYVLRVIAKKEGPGNGYVTLMDCEENQEKLTFTSCEEGYITKTVDVFPDTDRVRIEIGET
EGSFYIESIELICMNE
>Q45760 ~~~cry5Aa~~~Pesticidal crystal protein Cry5Aa~~~
MAILNELYPSVPYNVLAYTPPSFLPDAGTQATPADLTAYEQLLKNLEKGINAGTYSKAIADVLKGIFIDDTINYQTYVNI
GLSLITLAVPEIGIFTPFIGLFFAALNKHDAPPPPNAKDIFEAMKPAIQEMIDRTLTADEQTFLNGEISGLQNLAARYQS
TMDDIQSHGGFNKVDSGLIKKFTDEVLSLNSFYTDRLPVFITDNTADRTLLGLPYYAILASMHLMLLRDIITKGPTWDSK
INFTPDAIDSFKTDIKNNIKLYSKTIYDVFQKGLASYGTPSDLESFAKKQKYIEIMTTHCLDFARLFPTFDPDLYPTGSG
DISLQKTRRILSPFIPIRTADGLTLNNTSIDTSNWPNYENGNGAFPNPKERILKQFKLYPSWRAGQYGGLLQPYLWAIEV
QDSVETRLYGQLPAVDPQAGPNYVSIDSSNPIIQINMDTWKTPPQGASGWNTNLMRGSVSGLSFLQRDGTRLSAGMGGGF
ADTIYSLPATHYLSYLYGTPYQTSDNYSGHVGALVGVSTPQEATLPNIIGQPDEQGNVSTMGFPFEKASYGGTVVKEWLN
GANAMKLSPGQSIGIPITNVTSGEYQIRCRYASNDNTNVFFNVDTGGANPIFQQINFASTVDNNTGVQGANGVYVVKSIA
TTDNSFTEIPAKTINVHLTNQGSSDVFLDRIEFIPFSLPLIYHGSYNTSSGADDVLWSSSNMNYYDIIVNGQANSSSIAS
SMHLLNKGKVIKTIDIPGHSETFFATFPVPEGFNEVRILAGLPEVSGNITVQSNNPPQPSNNGGGDGGGNGGGDGGQYNF
SLSGSDHTTIYHGKLETGIHVQGNYTYTGTPVLILNAYRNNTVVSSIPVYSPFDITIQTEADSLELELQPRYGFATVNGT
ATVKSPNVNYDRSFKLPIDLQNITTQVNALFASGTQNMLAHNVSDHDIEEVVLKVDALSDEVFGDEKKALRKLVNQAKRL
SRARNLLIGGSFENWDAWYKGRNVVTVSDHELFKSDHVLLPPPGLSPSYIFQKVEESKLKPNTRYIVSGFIAHGKDLEIV
VSRYGQEVQKVVQVPYGEAFPLTSNGPVCCPPRSTSNGTLGDPHFFSYSIDVGALDLQANPGIEFGLRIVNPTGMARVSN
LEIREDRPLAANEIRQVQRVARNWRTEYEKERAEVTSLIQPVINRINGLYENGNWNGSIRSDISYQNIDAIVLPTLPKLR
HWFMSDRFSEQGDIMAKFQGALNRAYAQLEQSTLLHNGHFTKDAANWTIEGDAHQITLEDGRRVLRLPDWSSSVSQMIEI
ENFNPDKEYNLVFHGQGEGTVTLEHGEETKYIETHTHHFANFTTSQRQGLTFESNKVTVTISSEDGEFLVDNIALVEAPL
PTDDQNSEGNTASSTNSDTSMNNNQ
>Q45753 ~~~cry5Ab~~~Pesticidal crystal protein Cry5Ab~~~
MAILNELYPSVPYNVLAYTPPSFLPDAGTQATPADLTAYEQLLKNLEKGINAGTYSKAIADVLKGIFIDDTINYQTYVNI
GLSLITLAVPEIGIFTPFIGLFFAALNKHDAPPPPNAKDIFEAMKPAIQEMIDRTLTADEQTFLNGEISGLQNLAARYQS
TMDDIQSHGGFNKVDSGLIKKFTDEVLSLNSFYTDRLPVFITDNTADRTLLGLPYYAILASMHLMLLRDIITKGPTWDSK
INFTPDAIDSFKTDIKNNIKLYSKTIYDVFQKGLASYGTPSDLESFAKKQKYIEIMTTHCLDFARLFPTFDPDLYPTGSG
DISLQKTRRILSPFIPIRTADGLTLNNTSIDTSNWPNYENGNGAFPNPKERILKQFKLYPSWRAAQYGGLLQPYLWAIEV
QDSVETRLYGQLPAVDPQAGPNYVSIDSSNPIIQINMDTWKTPPQGASGWNTNLMRGSVSGLSFLQRDGTRLSAGMGGGF
ADTIYSLPATHYLSYLYGTPYQTSDNYSGHVGALVGVSTPQEATLPNIIGQPDEQGNVSTMGFPFEKASYGGTVVKEWLN
GANAMKLSPGQSIGIPITNVTSGEYQIRCRYASNDNTNVFFNVDTGGANPIFQQINFASTVDNNTGVQGANGVYVVKSIA
TTDNSFTVKIPAKTINVHLTNQGSSDVFLDRIEFVPILESNTVTIFNNSYTTGSANLIPAIAPLWSTSSDKALTGSMSIT
GRTTPNSDDALLRFFKTNYDTQTIPIPGSGKDFTNTLEIQDIVSIDIFVGSGLHGSDGSIKLDFTNNNSGSGGSPKSFTE
QNDLENITTQVNALFTSNTQDALATDVSDHDIEEVVLKVDALSDEVFGKEKKTLRKFVNQAKRLSKARNLLVGGNFDNLD
AWYRGRNVVNVSNHELLKSDHVLLPPPGLSPSYIFQKVEESKLKRNTRYTVSGFIAHATDLEIVVSRYGQEIKKVVQVPY
GEAFPLTSSGPVCCIPHSTSNGTLGNPHFFSYSIDVGALDVDTNPGIEFGLRIVNPTGMARVSNLEIREDRPLAANEIRQ
VQRVARNWRTEYEKERAEVTSLIQPVINRINGLYDNGNWNGSIRSDISYQNIDAIVLPTLPKLRHWFMSDRFSEQGDIMA
KFQGALNRAYAQLEQNTLLHNGHFTKDAANWTVEGDAHQVVLEDGKRVLRLPDWSSSVSQTIEIENFDPDKEYQLVFHGQ
GEGTVTLEHGEETKYIETHTHHFANFTTSQRQGLTFESNKVTVTISSEDGEFLVDNIALVEAPLPTDDQNSEGNTASSTN
SDTSMNNNQ
>P56955 ~~~cry5Ac~~~Pesticidal crystal protein Cry5Ac~~~
MAILNELYPSVPYNVLAYTPPSFLPDAGTQATPADLTAYEQLLKNLEKGINAGTYSKAIADVLKGIFIDDTINYQTYVNI
GLSLITLAVPEIGIFTPFIGLFFAALNKHDAPPPPNAKDIFEAMKPAIQEMIDRTLTADEQTFLNGEISGLQNLAARYQS
TMDDIQSHGGFNKVDSGLIKKFTDEVLSLNSFYTDRLPVFITDNTADRTLLGLPYYAILASMHLMLLRDIITKGPTWDSK
INFTPDAIDSFKTDIKNNIKLYSKTIYDVFQKGLASYGTPSDLESFAKKKKYIEIMTTHCLDFARLFPTFDPDLYPTGSG
DISLQKTRRILSPFIPIRTADGLTLNNTSIDTSNWPNYENGNGAFPNPKERILKQFKLYPSWRAGQYGGLLQPYLWAIEV
QDSVETRLYGQLPAVDPQAGPNYVSIDSSNPIIQINMDTWKTPPQGASGWNTNLMRGSVSGLSFLQRDGTRLSAGMGGGF
ADTIYSLPATHYLSYLYGTPYQTSDNYSGHVGALVGVSTPQEATLPNIIGQPDEQGNVSTMGFPFEKASYGGTVVKEWLN
GANAMKLSPGQSIGIPITNVTKHNYQVRCRYASNSDNPVFFNVDTGGANPIFQQINFASTVDSNMGVKEENGVYVVKSIK
TVEIPAGSFYVHVTNQGSSDLFLDRIEFVPKIQFQFCDNNNLHCDCNNPVDTDCTFCCVCTSLTDCDCNNPRGIDCTLCC
QVENQLPSFVTLTDLRNITSQVNGLFAPGTQNRLAQNISDHDIEEVVLKVDALSDEIFGTNKKALRKLVNQAKRLSRARN
LLIGGSFENWDAWYKGRNVVTVSDHELFKSDHVLLPPPGLSPSYIFQKVEESKLKANTRYTVSGFIAHATDLEIVVSRYG
QEIKKVVQVPYGEAFPLTSSGPVCCIPHSTSNGTLGNPHFFSYSIDVGALDVDTNPGIEFGLRIVNPTGMARVSNLEIRE
DRPLAANEIRQVQRVARNWRTEYEKERAEVTSLIQPVINRINGLYENENWNGSIRSDISYQNIDAIVLPTLPTLRHWFMS
DRFSEQGDIMAKFQGALNRAYAQLEQSTLLHNGHFTKDAANWTIEGDAHQITLEDGRRVLRLPDWSSSVSQMIEIENFNP
DKEYNLVFHGQGEGTVTLEHGEETKYIETHTHHFANFTTSQRQGLTFESNKVTVTISSEDGEFLVDNIALVEAPLPTDDQ
NSEGNTAFSTNSDTSMNNNQ
>Q45712 ~~~cry5Ba~~~Pesticidal crystal protein Cry5Ba~~~
MATINELYPVPYNVLAHPIKEVDDPYSWSNLLKGIQEGWEEWGKTGQKKLFEDHLTIAWNLYKTGKLDYFALTKASISLI
GFIPGAEAAVPFINMFVDFVWPKLFGANTEGKDQQLFNAIMDAVNKMVDNKFLSYNLSTLNKTIEGLQGNLGLFQNAIQV
AICQGSTPERVNFDQNCTPCNPNQPCKDDLDRVASRFDTANSQFTQHLPEFKNPWSDENSTQEFKRTSVELTLPMYTTVA
TLHLLLYEGYIEFMTKWNFHNEQYLNNLKVELQQLIHSYSETVRTSFLQFLPTLNNRSKSSVNAYNRYVRNMTVNCLDIA
ATWPTFDTHNYHQGGKLDLTRIILSDTAGPIEEYTTGDKTSGPEHSNITPNNILDTPSPTYQHSFVSVDSIVYSRKELQQ
LDIATYSTNNSNNCHPYGLRLSYTDGSRYDYGDNQPDFTTSNNNYCHNSYTAPITLVNARHLYNAKGSLQNVESLVVSTV
NGGSGSCICDAWINYLRPPQTSKNESRPDQKINVLYPITETVNKGTGGNLGVISAYVPMELVPENVIGDVNADTKLPLTQ
LKGFPFEKYGSEYNNRGISLVREWINGNNAVKLSNSQSVGIQITNQTKQKYEIRCRYASKGDNNVYFNVDLSENPFRNSI
SFGSTESSVVGVQGENGKYILKSITTVEIPAGSFYVHITNQGSSDLFLDRIEFVPKIQFQFCDNNNLHCDCNNPVDTDCT
FCCVCTSLTDCDCNNPRGLDCTLCCQVENQLPSFVTLTDLQNITTQVNALVASSEHDTLATDVSDYEIEEVVLKVDALSG
EVFGKEKKALRKLVNHTKRLSKARNLLIGGNFDNLDAWYRGRNVVNVSDHELFKSDHVLLPPPTLYSSYMFQKVEESKLK
ANTRYTVSGFIAHAEDLEIVVSRYGQEVKKVVQVPYGEAFPLTSRGAICCPPRSTSNGKPADPHFFSYSIDVGTLDVEAN
PGIELGLRIVERTGMARVSNLEIREDRPLKKNELRNVQRAARNWRTAYDQERAEVTALIQPVLNQINALYENEDWNGAIR
SGVSYHDLEAIVLPTLPKLNHWFMSDMLGEQGSILAQFQEALDRAYTQLEESTILHNGHFTTDAANWTIEGDAHHAILED
GRRVLRLPDWSSSVSQTIEIENFDPDKEYQLVFHAQGEGTVSLQHGEEGEYVETHPHKSANFTTSHRQGVTFETNKVTVE
ITSEDGEFLVDHIALVEAPLPTDDQSSDGNTTSNTNSNTSMNNNQ
>Q03749 ~~~cry7Aa~~~Pesticidal crystal protein Cry7Aa~~~
MNLNNLDGYEDSNRTLNNSLNYPTQKALSPSLKNMNYQDFLSITEREQPEALASGNTAINTVVSVTGATLSALGVPGASF
ITNFYLKIAGLLWPENGKIWDEFMTEVEALIDQKIEEYVRNKAIAELDGLGSALDKYQKALADWLGKQDDPEAILSVATE
FRIIDSLFEFSMPSFKVTGYEIPLLTVYAQAANLHLALLRDSTLYGDKWGFTQNNIEENYNRQKKRISEYSDHCTKWYNS
GLSRLNGSTYEQWINYNRFRREMILMALDLVAVFPFHDPRRYSMETSTQLTREVYTDPVSLSISNPDIGPSFSQMENTAI
RTPHLVDYLDELYIYTSKYKAFSHEIQPDLFYWSAHKVSFKKSEQSNLYTTGIYGKTSGYISSGAYSFHGNDIYRTLAAP
SVVVYPYTQNYGVEQVEFYGVKGHVHYRGDNKYDLTYDSIDQLPPDGEPIHEKYTHRLCHATAIFKSTPDYDNATIPIFS
WTHRSAEYYNRIYPNKITKIPAVKMYKLDDPSTVVKGPGFTGGDLVKRGSTGYIGDIKATVNSPLSQKYRVRVRYATNVS
GQFNVYINDKITLQTKFQNTVETIGEGKDLTYGSFGYIEYSTTIQFPDEHPKITLHLSDLSNNSSFYVDSIEFIPVDVNY
AEKEKLEKAQKAVNTLFTEGRNALQKDVTDYKVDQVSILVDCISGDLYPNEKRELQNLVKYAKRLSYSRNLLLDPTFDSI
NSSEENGWYGSNGIVIGNGDFVFKGNYLIFSGTNDTQYPTYLYQKIDESKLKEYTRYKLKGFIESSQDLEAYVIRYDAKH
RTLDVSDNLLPDILPENTCGEPNRCAAQQYLDENPSPECSSMQDGILSDSHSFSLNIDTGSINHNENLGIWVLFKISTLE
GYAKFGNLEVIEDGPVIGEALARVKRQETKWRNKLAQLTTETQAIYTRAKQALDNLFANAQDSHLKRDVTFAEIAAARKI
VQSIREAYMSWLSVVPGVNHPIFTELSGRVQRAFQLYDVRNVVRNGRFLNGLSDWIVTSDVKVQEENGNNVLVLNNWDAQ
VLQNVKLYQDRGYILHVTARKIGIGEGYITITDEEGHTDQLRFTACEEIDASNAFISGYITKELEFFPDTEKVHIEIGET
EGIFLVESIELFLMEELC
>Q45707 ~~~cry7Ab~~~Pesticidal crystal protein Cry7Ab~~~
MNLNNLGGYEDSNRTLNNSLNYPTQKALSPSLKNMNYQDFLSITEREQPEALASGNTAINTVVSVTGATLSALGVPGASF
ITNFYLKITGLLWPHNKNIWDEFMTEVETLIEQKIEQYARNKALAELEGLGNNLTIYQQALEDWLNNPDDPATITRVIDR
FRILDALFESYMPSFRVAGYEIPLLTVYAQAANLHLALLRDSTLYGDKWGFTQNNIEENYNRQKKHISEYSNHCVKWYNS
GLSRLNGSTYEQWINYNRFRREMILMVLDIAAVFPIYDPRMYSMETSTQLTREVYTDPISLSISNPDIGPSFSQMENTAF
RTPHLVDYLDELYIYTSKYKAFSHEIQPDLFYWCVHKVSFKKSEQSNLYTTGIYGKTSGYISSGAYSFRGNDIYRTLAAP
SVVVYPYTQNYGVEQVEFYGVKGHVHYRGDNKYDLTYDSIDQLPPDGEPIHEKYTHRLCHATAISKSTPDYDNATIPIFS
WTHRSAEYYNRIYPNKIKKIPAVKMYKLDDLSTVVKGPGFTGGDLVKRGSNGYIGDIKATVNSPLSQKYRVRVRYATSVS
GLFNVFINDEIALQKNFQSTVETIGEGKDLTYGSFGYIEYSTTIQFPNEHPKITLHLNHLSNNSPFYVDSIEFIPVDVNY
DEKEKLEKAQKAVNTLFTEGRNALQKYVTDYKVDQVSILVDCISGDLYPNEKRELQNLVKYAKRLSYSRNLLLDPTFDSI
NSSEENGWYGSNGIVIGNGDFVFKGNYLIFSGTNDTQYPTYLYQKIDESKLKEYSRYKLKGFIESSQDLEAYVIRYDAKH
RTLDVSDNLLPDILPENTCGEPNRCAAQQYLDENPSSECSSMQDGILSDSHSFSLNIDTGSINHNENLGIWVLFKISTLE
GYAKFGNLEVIEDGPVIGEALARVKRQETKWRNKLAQMTTETQAIYTRAKQALDNLFANAQDSHLKIDVTFAEIAAARKI
VQSIREVYMSWLSVVPGVNHPIFTELSGRVQRAFQLYDVRNVVRNGRFLNGLSDWIVTSDVNVQEENGNNVLVLNNWDAQ
VLRNVKLYQDRGYVLRVTARKIGIGEGYITITDEEGHTDQLRFTACEEIDASNAFISGYITKELEFFPDTEKVHIEIGET
EGIFLVESIELFLMEELC
>Q45708 ~~~cry7Ab~~~Pesticidal crystal protein Cry7Ab~~~
MNLNNLGGYEDSNRTLNNSLNYPTQKALSPSLKNMNYQDFLSITEREQPEALASGNTAINTVVSVTGATLSALGVPGASF
ITNFYLKITGLLWPHDKNIWDEFMTEVETLIEQKIEQYARNKALAELEGLGNNLTIYQQALEDWLNNPDDPATITRVIDR
FRILDALFESYMPSFRVAGYEIPLLTVYAQAANLHLALLRDSTLYGDKWEFTQNNIEENYNRQKKHISEYSNHCVKWYNS
GLSRLNGSTYEQWINYNRFRREMILMVLDIAAVFPIYDPRMYSMETSTQLTREVYTDPISLSISNPGIGPSFSQMENTAI
RTPHLVDYLDELYIYTSKYKAFSHEIQPDLFYWSAHKVSFKQSEQSNLYTTGIYGKTSGYISSGAYSFRGNDIYRTLAAP
SVVVYPYTQNYGVEQVEFYGVKGHVHYRGDNKYDLTYDSIDQLPPDGEPIHEKYTHRLCHATAISKSTPDYDNATIPIFS
WTHRSAEYYNRIYPNKITKIPAVKMYKLGDTSTVVKGPGFTGGDLVKRGSNGYIGDIKATVNSPLSQNYRVRVRYATNVS
GQFNVYINDKITLQRKFQNTVETIGEGKDLTYGSFGYIEYSTTIQFPDKHPKITLHLSDLSNNSSFYVDSIEFIPVDVNY
DEKEKLEKAQKAVNTLFTEGRNALQKDVTDYKVDQVSILVDCISGDLYPNEKRELQNLVKYAKRLSYSRNLLLDPTFDSI
NSSEENGWYGSNGIVIGNGDFVFKGNYLIFSGTNDTQYPTYLYQKIDESKLKEYTRYKLKGFIESSQDLEAYVIRYDAKH
RTLDVSDNLLPDILPENTCGEPNRCAAQQYLDENPSSECSSMQDGILSDSHSFSLNIDIGSINHNENLGIWVLFKISTLE
GYAKFGNLEVIEDGPVIGEALARVKRQETKWRNKLAQLTTETQAIYTRAKQALDNLFANAQDSHLKIDVTFAEIAAARKI
VQSIREAYMSWLSVVPGVNHPIFTELSERVQRAFQLYDVRNVVRNGRFLNGLSDWIVTSDVKVQEENGNNVLVLNNWDAQ
VLQNVKLYQDRGYILRVTARKIGIGEGYITITDEEGHTVQLRFTACEVIDASNAFISGYITKELEFFPDTEKVHIEIGET
EGIFLVESIELFLMEELC
>Q45704 ~~~cry8Aa~~~Pesticidal crystal protein Cry8Aa~~~
MSPNNQNEYEIIDATPSTSVSSDSNRYPFANEPTDALQNMNYKDYLKMSGGENPELFGNPETFISSSTIQTGIGIVGRIL
GALGVPFASQIASFYSFIVGQLWPSKSVDIWGEIMERVEELVDQKIEKYVKDKALAELKGLGNALDVYQQSLEDWLENRN
DARTRSVVSNQFIALDLNFVSSIPSFAVSGHEVLLLAVYAQAVNLHLLLLRDASIFGEEWGFTPGEISRFYNRQVQLTAE
YSDYCVKWYKIGLDKLKGTTSKSWLNYHQFRREMTLLVLDLVALFPNYDTHMYPIETTAQLTRDVYTDPIAFNIVTSTGF
CNPWSTHSGILFYEVENNVIRPPHLFDILSSVEINTSRGGITLNNDAYINYWSGHTLKYRRTADSTVTYTANYGRITSEK
NSFALEDRDIFEINSTVANLANYYQKAYGVPGSWFHMVKRGTSSTTAYLYSKTHTALQGCTQVYESSDEIPLDRTVPVAE
SYSHRLSHITSHSFSKNGSAYYGSFPVFVWTHTSADLNNTIYSDKITQIPAVKGDMLYLGGSVVQGPGFTGGDILKRTNP
SILGTFAVTVNGSLSQRYRVRIRYASTTDFEFTLYLGDTIEKNRFNKTMDNGASLTYETFKFASFITDFQFRETQDKILL
SMGDFSSGQEVYIDRIEFIPVDETYEAEQDLEAAKKAVNALFTNTKDGLRPGVTDYEVNQAANLVECLSDDLYPNEKRLL
FDAVREAKRLSGARNLLQDPDFQEINGENGWAASTGIEIVEGDAVFKGRYLRLPGAREIDTETYPTYLYQKVEEGVLKPY
TRYRLRGFVGSSQGLEIYTIRHQTNRIVKNVPDDLLPDVSPVNSDGSINRCSEQKYVNSRLEGENRSGDAHEFSLPIDIG
ELDYNENAGIWVGFKITDPEGYATLGNLELVEEGPLSGDALERLQREEQQWKIQMTRRREETDRRYMASKQAVDRLYADY
QDQQLNPDVEITDLTAAQDLIQSIPYVYNEMFPEIPGMNYTKFTELTDRLQQAWNLYDQRNAIPNGDFRNGLSNWNATPG
VEVQQINHTSVLVIPNWDEQVSQQFTVQPNQRYVLRVTARKEGVGNGYVSIRDGGNQSETLTFSASDYDTNGVYNDQTGY
ITKTVTFIPYTDQMWIEISETEGTFYIESVELIVDVE
>Q45705 ~~~cry8Ba~~~Pesticidal crystal protein Cry8Ba~~~
MSPNNQNEYEIIDATPSTSVSNDSNRYPFANEPTNALQNMDYKDYLKMSAGNVSEYPGSPEVFLSEQDAVKAAIDIVGKL
LTGLGVPFVGPIVSLYTQLIDILWPSKQKSQWEIFMEQVEELINQKIAEYARNKALSELEGLGNNYQLYLTALEEWKENP
NGSRALRDVRNRFEILDSLFTQYMPSFRVTNFEVPFLTVYTMAANLHLLLLRDASIFGEEWGLSTSTINNYYNRQMKLTA
EYSDHCVKWYETGLAKLKGSSAKQWIDYNQFRREMTLTVLDVVALFSNYDTRTYPLATTAQLTREVYTDPLGAVDVPNIG
SWYDKAPSFSEIEKAAIRPPHVFDYITGLTVYTKKRSFTSDRYMRYWAGHQISYKHIGTSSTFTQMYGTNQNLQSTSNFD
FTNYDIYKTLSNGAVLLDIVYPGYTYTFFGMPETEFFMVNQLNNTRKTLTYKPASKDIIDRTRDSELELPPETSGQPNYE
SYSHRLGHITFIYSSSTSTYVPVFSWTHRSADLTNTVKSGEITQIPGGKSSTIGRNTYIIKGRGYTGGDLVALTDRIGSC
EFQMIFPESQRFRIRIRYASNETSYISLYGLNQSGTLKFNQTYSNKNENDLTYNDFKYIEYPRVISVNASSNIQRLSIGI
QTNTNLFILDRIEFIPVDETYEAETDLEAAKKAVNALFTNTKDGLQPGVTDYEVNQAANLVECLSDDLYPNEKRLLFDAV
REAKRLSEARNLLQDPDFQEINGENGWTASTGIEVIEGDAVFKGRYLRLPGAREIDTETYPTYLYQKVEEGVLKPYTRYR
LRGFVGSSQGLEIYTIRHQTNRIVKNVPDDLLPDVPPVNNDGRINRCSEQKYVNSRLEVENRSGEAHEFSIPIDTGELDY
NENAGIWVGFKITDPEGYATLGNLELVEEGPLSGDALERLQKEEQQWKIQMTRRREETDRRYMASKQAVDRLYADYQDQQ
LNPNVEITDLTAAQDLIQSIPYVYNEMFPEIPGMNYTKFTELTDRLQQAWGLYDQRNAIPNGDYRNELSNWNTTSGVNVQ
QINHTSVLVIPNWNEQVSQKFTVQPNQRYVLRVTARKEGVGNGYVSIRDGGNQSETLTFSASDYDTNGMYDTQASNTNGY
NTNSVYMIKPAISRKTVDISSVYNQMWIEISETEGTFYIESVELIVDVE
>Q45706 ~~~cry8Ca~~~Pesticidal crystal protein Cry8Ca~~~
MSPNNQNEYEIIDALSPTSVSDNSIRYPLANDQTNTLQNMNYKDYLKMTESTNAELSRNPGTFISAQDAVGTGIDIVSTI
ISGLGIPVLGEVFSILGSLIGLLWPSNNENVWQIFMNRVEELIDQKILDSVRSRAIADLANSRIAVEYYQNALEDWRKNP
HSTRSAALVKERFGNAEAILRTNMGSFSQTNYETPLLPTYAQAASLHLLVMRDVQIYGKEWGYPQNDIDLFYKEQVSYTA
RYSDHCVQWYNAGLNKLRGTGAKQWVDYNRFRREMNVMVLDLVALFPNYDARIYPLETNAELTREIFTDPVGSYVTGQSS
TLISWYDMIPAALPSFSTLENLLRKPDFFTLLQEIRMYTSFRQNGTIEYYNYWGGQRLTLSYIYGSSFNKYSGVLAGAED
IIPVGQNDIYRVVWTYIGRYTNSLLGVNPVTFYFSNNTQKTYSKPKQFAGGIKTIDSGEELTYENYQSYSHRVSYITSFE
IKSTGGTVLGVVPIFGWTHSSASRNNFIYATKISQIPINKASRTSGGAVWNFQEGLYNGGPVMKLSGSGSQVINLRVATD
AKGASQRYRIRIRYASDRAGKFTISSRSPENPATYSASIAYTNTMSTNASLTYSTFAYAESGPINLGISGSSRTFDISIT
KEAGAANLYIDRIEFIPVNTLFEAEEDLDVAKKAVNGLFTNEKDALQTSVTDYQVNQAANLIECLSDELYPNEKRMLWDA
VKEAKRLVQARNLLQDTGFNRINGENGWTGSTGIEVVEGDVLFKDRSLRLTSAREIDTETYPTYLYQQIDESLLKPYTRY
KLKGFIGSSQDLEIKLIRHRANQIVKNVPDNLLPDVRPVNSCGGVDRCSEQQYVDANLALENNGENGNMSSDSHAFSFHI
DTGEIDLNENTGIWIVFKIPTTNGNATLGNLEFVEEGPLSGETLEWAQQQEQQWQDKMARKRAASEKTYYAAKQAIDRLF
ADYQDQKLNSGVEMSDLLAAQNLVQSIPYVYNDALPEIPGMNYTSFTELTNRLQQAWNLYDLQNAIPNGDFRNGLSNWNA
TSDVNVQQLSDTSVLVIPNWNSQVSQQFTVQPNYRYVLRVTARKEGVGDGYVIIRDGANQTETLTFNICDDDTGVLSTDQ
TSYITKTVEFTPSTEQVWIDMSETEGVFNIESVELVLEEE
>Q99031 ~~~cry9Aa~~~Pesticidal crystal protein Cry9Aa~~~
MNQNKHGIIGASNCGCASDDVAKYPLANNPYSSALNLNSCQNSSILNWINIIGDAAKEAVSIGTTIVSLITAPSLTGLIS
IVYDLIGKVLGGSSGQSISDLSICDLLSIIDLRVSQSVLNDGIADFNGSVLLYRNYLEALDSWNKNPNSASAEELRTRFR
IADSEFDRILTRGSLTNGGSLARQNAQILLLPSFASAAFFHLLLLRDATRYGTNWGLYNATPFINYQSKLVELIELYTDY
CVHWYNRGFNELRQRGTSATAWLEFHRYRREMTLMVLDIVASFSSLDITNYPIETDFQLSRVIYTDPIGFVHRSSLRGES
WFSFVNRANFSDLENAIPNPRPSWFLNNMIISTGSLTLPVSPSTDRARVWYGSRDRISPANSQFITELISGQHTTATQTI
LGRNIFRVDSQACNLNDTTYGVNRAVFYHDASEGSQRSVYEGYIRTTGIDNPRVQNINTYLPGENSDIPTPEDYTHILST
TINLTGGLRQVASNRRSSLVMYGWTHKSLARNNTINPDRITQIPLTKVDTRGTGVSYVNDPGFIGGALLQRTDHGSLGVL
RVQFPLHLRQQYRIRVRYASTTNIRLSVNGSFGTISQNLPSTMRLGEDLRYGSFAIREFNTSIRPTASPDQIRLTIEPSF
IRQEVYVDRIEFIPVNPTREAKEDLEAAKKAVASLFTRTRDGLQVNVKDYQVDQAANLVSCLSDEQYGYDKKMLLEAVRA
AKRLSRERNLLQDPDFNTINSTEENGWKASNGVTISEGGPFYKGRAIQLASARENYPTYIYQKVDASELKPYTRYRLDGF
VKSSQDLEIDLIHHHKVHLVKNVPDNLVSDTYPDDSCSGINRCQEQQMVNAQLETEHHHPMDCCEAAQTHEFSSYIDTGD
LNSSVDQGIWAIFKVRTTDGYATLGNLELVEVGPLSGESLEREQRDNTKWSAELGRKRAETDRVYQDAKQSINHLFVDYQ
DQQLNPEIGMADIMDAQNLVASISDVYSDAVLQIPGINYEIYTELSNRLQQASYLYTSRNAVQNGDFNNGLDSWNATAGA
SVQQDGNTHFLVLSHWDAQVSQQFRVQPNCKYVLRVTAEKVGGGDGYVTIRDDAHHTETLTFNACDYDINGTYVTDNTYL
TKEVVFHPETQHMWVEVNETEGAFHIDSIEFVETEK
>Q45733 ~~~cry9Ca~~~Pesticidal crystal protein Cry9Ca~~~
MNRNNQNEYEIIDAPHCGCPSDDDVRYPLASDPNAALQNMNYKDYLQMTDEDYTDSYINPSLSISGRDAVQTALTVVGRI
LGALGVPFSGQIVSFYQFLLNTLWPVNDTAIWEAFMRQVEELVNQQITEFARNQALARLQGLGDSFNVYQRSLQNWLADR
NDTRNLSVVRAQFIALDLDFVNAIPLFAVNGQQVPLLSVYAQAVNLHLLLLKDASLFGEGWGFTQGEISTYYDRQLELTA
KYTNYCETWYNTGLDRLRGTNTESWLRYHQFRREMTLVVLDVVALFPYYDVRLYPTGSNPQLTREVYTDPIVFNPPANVG
LCRRWGTNPYNTFSELENAFIRPPHLFDRLNSLTISSNRFPVSSNFMDYWSGHTLRRSYLNDSAVQEDSYGLITTTRATI
NPGVDGTNRIESTAVDFRSALIGIYGVNRASFVPGGLFNGTTSPANGGCRDLYDTNDELPPDESTGSSTHRLSHVTFFSF
QTNQAGSIANAGSVPTYVWTRRDVDLNNTITPNRITQLPLVKASAPVSGTTVLKGPGFTGGGILRRTTNGTFGTLRVTVN
SPLTQQYRLRVRFASTGNFSIRVLRGGVSIGDVRLGSTMNRGQELTYESFFTREFTTTGPFNPPFTFTQAQEILTVNAEG
VSTGGEYYIDRIEIVPVNPAREAEEDLEAAKKAVASLFTRTRDGLQVNVTDYQVDQAANLVSCLSDEQYGHDKKMLLEAV
RAAKRLSRERNLLQDPDFNTINSTEENGWKASNGVTISEGGPFFKGRALQLASARENYPTYIYQKVDASVLKPYTRYRLD
GFVKSSQDLEIDLIHHHKVHLVKNVPDNLVSDTYSDGSCSGINRCDEQHQVDMQLDAEHHPMDCCEAAQTHEFSSYINTG
DLNASVDQGIWVVLKVRTTDGYATLGNLELVEVGPLSGESLEREQRDNAKWNAELGRKRAEIDRVYLAAKQAINHLFVDY
QDQQLNPEIGLAEINEASNLVESISGVYSDTLLQIPGINYEIYTELSDRLQQASYLYTSRNAVQNGDFNSGLDSWNTTMD
ASVQQDGNMHFLVLSHWDAQVSQQLRVNPNCKYVLRVTARKVGGGDGYVTIRDGAHHQETLTFNACDYDVNGTYVNDNSY
ITEEVVFYPETKHMWVEVSESEGSFYIDSIEFIETQE
>O06014 ~~~cry9Da~~~Pesticidal crystal protein Cry9Da~~~
MNRNNQNEYEVIDAPHCGCPADDVVKYPLTDDPNAGLQNMNYKEYLQTYGGDYTDPLINPNLSVSGKDVIQVGINIVGRL
LSFFGFPFSSQWVTVYTYLLNSLWPDDENSVWDAFMERVEELIDQKISEAVKGRALDDLTGLQYNYNLYVEALDEWLNRP
NGARASLVSQRFNILDSLFTQFMPSFGSGPGSQNYATILLPVYAQAANLHLLLLKDADIYGARWGLNQTQIDQFHSRQQS
LTQTYTNHCVTAYNDGLAELRGTTAESWFKYNQYRREMTLTAMDLVALFPYYNLRQYPDGTNPQLTREVYTDPIAFDPLE
QPTTQLCRSWYINPAFRNHLNFSVLENSLIRPPHLFERLSNLQILVNYQTNGSAWRGSRVRYHYLHSSIIQEKSYGLLSD
PVGANINVQNNDIYQIISQVSNFASPVGSSYSVWDTNFYLSSGQVSGISGYTQQGIPAVCLQQRNSTDELPSLNPEGDII
RNYSHRLSHITQYRFQATQSGSPSTVSANLPTCVWTHRDVDLDNTITANQITQLPLVKAYELSSGATVVKGPGFTGGDVI
RRTNTGGFGAIRVSVTGPLTQRYRIRFRYASTIDFDFFVTRGGTTINNFRFTRTMNRGQESRYESYRTVEFTTPFNFTQS
QDIIRTSIQGLSGNGEVYLDRIEIIPVNPAREAEEDLEAAKKAARQNLFTRTRDGLQVNVTDYQVDQAANLVSCLSDEQY
GHDKKMLLEAVRAAKRLSRERNLLQDPDFNTINSTEENGWKASNGVTISEGGPFFKGRALQLASARENYPTYIYQKVDAS
VLKPYTRYRLDGFVKSSQDLEIDLIHYHKVHLVKNVPDNLVSDTYSDGSCSGMNRCEEQQMVNAQLETEHHHPMDCCEAA
QTHEFSSYINTGDLNASVDQGIWVVLKVRTTDGYATLGNLELVEVGPLSGESLEREQRDNAKWNAELGRKRAEIDRVYLA
AKQAINHLFVDYQDQQLNPEIGLAEINEASNLVESISGVYSDTLLQIPGINYEIYTELSDRLQQASYLYTSRNAVQNGDF
NSGLDSWNTTTDASVQQDGNMHFLVLSHWDAQVSQQLRVNPNCKYVLRVTARKVGGGDGYVTIRDGAHHQETLTFNACDY
DVNGTYVNDNSYITEEVVFYPETKHMWVEVSESEGSFYIDSIEFIETQE
>Q9ZNL9 ~~~cry9Ea~~~Pesticidal crystal protein Cry9Ea~~~
MNRNNPNEYEIIDAPYCGCPSDDDVRYPLASDPNAAFQNMNYKEYLQTYDGDYTGSLINPNLSINPRDVLQTGINIVGRI
LGFLGVPFAGQLVTFYTFLLNQLWPTNDNAVWEAFMAQIEELIDQKISAQVVRNALDDLTGLHDYYEEYLAALEEWLERP
NGARANLVTQRFENLHTAFVTRMPSFGTGPGSQRDAVALLTVYAQAANLHLLLLKDAEIYGARWGLQQGQINLYFNAQQE
RTRIYTNHCVETYNRGLEDVRGTNTESWLNYHRFRREMTLMAMDLVALFPFYNVRQYPNGANPQLTREIYTDPIVYNPPA
NQGICRRWGNNPYNTFSELENAFIRPPHLFERLNRLTISRNRYTAPTTNSFLDYWSGHTLQSQHANNPTTYETSYGQITS
NTRLFNTTNGARAIDSRARNFGNLYANLYGVSSLNIFPTGVMSEITNAANTCRQDLTTTEELPLENNNFNLLSHVTFLRF
NTTQGGPLATLGFVPTYVWTREDVDFTNTITADRITQLPWVKASEIGGGTTVVKGPGFTGGDILRRTDGGAVGTIRANVN
APLTQQYRIRLRYASTTSFVVNLFVNNSAAGFTLPSTMAQNGSLTYESFNTLEVTHTIRFSQSDTTLRLNIFPSISGQEV
YVDKLEIVPINPTREAEEDLEDAKKAVASLFTRTRDGLQVNVTDYQVDQAANLVSCLSDEQYGHDKKMLLEAVRAAKRLS
RERNLLQDPDFNEINSTEENGWKASNGVTISEGGPFFKGRALQLASARENYPTYIYQKVDASTLKPYTRYKLDGFVQSSQ
DLEIDLIHHHKVHLVKNVPDNLVSDTYSDGSCSGINRCEEQHQVDVQLDAEDHPKDCCEAAQTHEFSSYIHTGDLNASVD
QGIWVVLQVRTTDGYATLGNLELVEVGPLSGESLEREQRDNAKWNEEVGRKRAETDRIYQDAKQAINHLFVDYQDQQLSP
EVGMADIIDAQNLIASISDVYSDAVLQIPGINYEMYTELSNRLQQASYLYTSRNVVQNGDFNSGLDSWNATTDTAVQQDG
NMHFLVLSHWDAQVSQQFRVQPNCKYVLRVTAKKVGNGDGYVTIQDGAHHRETLTFNACDYDVNGTHVNDNSYITKELVF
YPKTEHMWVEVSETEGTFYIDSIEFIETQE
>D0CCT2 ~~~craA~~~Chloramphenicol resistance protein CraA~~~
MYKLMKNIQTTALNRTTLMFPLALVLFEFAVYIGNDLIQPAMLAITEDFGVSATWAPSSMSFYLLGGASVAWLLGPLSDR
LGRKKVLLSGVLFFALCCFLILLTRQIEHFLTLRFLQGIGLSVISAVGYAAIQENFAERDAIKVMALMANISLLAPLLGP
VLGAFLIDYVSWHWGFVAIALLALLSWVGLKKQMPSHKVSVTKQPFSYLFDDFKKVFSNRQFLGLTLALPLVGMPLMLWI
ALSPIILVDELKLTSVQYGLAQFPVFLGLIVGNIVLIKIIDRLALGKTVLIGLPIMLTGTLILILGVVWQAYLIPCLLIG
MTLICFGEGISFSVLYRFALMSSEVSKGTVAAAVSMLLMTSFFAMIELVRYLYTQFHLWAFVLSAFAFIALWFTQPRLAL
KREMQERVAQDLH
>Q4MV79 2.4.2.-~~~~~~Putative ADP-ribosyltransferase Certhrax~~~
MKEIIRNLVRLDVRSDVDENSKKTQELVEKLPHEVLELYKNVGGEIYITDKRLTQHEELSDSSHKDMFIVSSEGKSFPLR
EHFVFAKGGKEPSLIIHAEDYASHLSSVEVYYELGKAIIRDTFPLNQKELGNPKFINAINEVNQQKEGKGVNAKADEDGR
DLLFGKELKKNLEHGQLVDLDLISGNLSEFQHVFAKSFALYYEPHYKEALKSYAPALFNYMLELDQMRFKEISDDVKEKN
KNVLDFKWYTRKAESWGVQTFKNWKENLTISEKDIITGYTGSKYDPINEYLRKYDGEIIPNIGGDLDKKSKKALEKIENQ
IKNLDAALQKSKITENLIVYRRVSELQFGKKYEDYNLRQNGIINEEKVMELESNFKGQTFIQHNYMSTSLVQDPHQSYSN
DRYPILLEITIPEGVHGAYIADMSEYPGQYEMLINRGYTFKYDKFSIVKPTREEDKGKEYLKVNLSIYLGNLNREK
>P0ACP1 ~~~cra~~~Catabolite repressor/activator~~~COG1609
MKLDEIARLAGVSRTTASYVINGKAKQYRVSDKTVEKVMAVVREHNYHPNAVAAGLRAGRTRSIGLVIPDLENTSYTRIA
NYLERQARQRGYQLLIACSEDQPDNEMRCIEHLLQRQVDAIIVSTSLPPEHPFYQRWANDPFPIVALDRALDREHFTSVV
GADQDDAEMLAEELRKFPAETVLYLGALPELSVSFLREQGFRTAWKDDPREVHFLYANSYEREAAAQLFEKWLETHPMPQ
ALFTTSFALLQGVMDVTLRRDGKLPSDLAIATFGDNELLDFLQCPVLAVAQRHRDVAERVLEIVLASLDEPRKPKPGLTR
IKRNLYRRGVLSRS
>Q0AVM1 4.2.1.150~~~~~~Crotonyl-CoA hydratase~~~COG1024
MAYENIILEKEEKLAVLYINRPKAMNALNKDTLLEIKDAVTAVNDDPAVELLIITGSGDKSFVAGADIAFMQNLSAMEAR
EFGALGQKVFRLIEAMEKPVIAAVNGFALGGGCELAMCCDFRIAASNAKFGQPEVGLGITPGFGGTQRLPRLVGPGMAKQ
LLYTADVINADEAFRIGLVNKVVQPEELLPEVKKIAGRILSKGQLAVRLSKAAANEGMQTDIDRAMSIEADAFGLCFATQ
DQKEGMTAFLEKRKANFISK
>P38487 3.5.3.3~~~~~~Creatinase~~~
MQQITDLERTKILQNGGEKVKPTFSKEEMTRRNTRLREYMAKAGIDAVMFTSYHNINYYSDFLYTSFNRSYALVVTQDKH
VTVSANIDAGMPWRRSFDENIVYTDWKRDNFLYAVKKVLNEGGFSSGRLGVENDHMTLDLRRQVQDALPNTELVDVSQAV
MGHRMFKSDEEIDLIKNGARIADIGGAAVVEAIREGVPEYEVALHGTEAMVREIARTYPHAELRDTWIWFQSGINTDGAH
NWATSRKLQRGDILSLNCFPMIAGYYTALERTLFLEEVSDRHLELWEINCKVHRRGLELIKPGARCMDIAAELNEIYREH
DLLANRTFGYGHSFGVLSHYYGREAGLELREDIETVLEPGMVVSMEPMIMIPEGEPGAGGYREHDILVISENGTENITKF
PFGPEHNIIKK
>P19213 3.5.3.3~~~~~~Creatinase~~~
MQMPKTLRIRNGEKVKSTFSAQEYANRHAKLRAHLAAENIDAAVFTSYHNINYYSDFLYCSFGRPYALVVTQDDVISISA
NIDGGQPWRRTVGTDNIVYTDWQRDNYFVAIQQALPRARRIGIEHDHLNLQNRDKLAARYPDAELVDVAAACMRMRMIKS
AEEHEMIRHGARVADIGGAAIVEALRDQVPEYEVALHATQAMVRAIAETFDNVELMDTWTWFQSGINTDGAHNPVTTRKV
NKGDILSLNCFPMIAGYYTALERTLFLDHCSDDHLRMWQANVEVHEAGLKLIKPGMRCSDIAKELNEIFLKHDLLQYRTF
GYGHSFGTLSHYYGREAGLELREDIDTVLEPGMVVSMEPMIMLPEGRPGAGGYREHDILIVNENGAENITKFPYGPERNI
IRK
>P38488 3.5.3.3~~~~~~Creatinase~~~
MQMPKTLRIRNGDKVRSTFSAQEYANRQARLRAHLAAENIDAAIFTSYHNINYYSDFLYCSFGRPYALVVTEDDVISISA
NIDGGQPWRRTVGTDNIVYTDWQRDNYFAAIQQALPKARRIGIEHDHLNLQNRDKLAARYPDAELVDVAAACMRMRMIKS
AEEHVMIRHGARIADIGGAAVVEALGDQVPEYEVALHATQAMVRAIADTFEDVELMDTWTWFQSGINTDGAHNPVTTRKV
NKGDILSLNCFPMIAGYYTALERTLFLDHCSDDHLRLWQVNVEVHEAGLKLIKPGARCSDIARELNEIFLKHDVLQYRTF
GYGHSFGTLSHYYGREAGLELREDIDTVLEPGMVVSMEPMIMLPEGLPGAGGYREHDILIVNENGAENITKFPYGPEKNI
IRK
>P08368 ~~~creB~~~Transcriptional regulatory protein CreB~~~COG0745
MQRETVWLVEDEQGIADTLVYMLQQEGFAVEVFERGLPVLDKARKQVPDVMILDVGLPDISGFELCRQLLALHPALPVLF
LTARSEEVDRLLGLEIGADDYVAKPFSPREVCARVRTLLRRVKKFSTPSPVIRIGHFELNEPAAQISWFDTPLALTRYEF
LLLKTLLKSPGRVWSRQQLMDSVWEDAQDTYDRTVDTHIKTLRAKLRAINPDLSPINTHRGMGYSLRGL
>P08401 2.7.13.3~~~creC~~~Sensor protein CreC~~~COG2205
MRIGMRLLLGYFLLVAVAAWFVLAIFVKEVKPGVRRATEGTLIDTATLLAELARPDLLSGDPTHGQLAQAFNQLQHRPFR
ANIGGINKVRNEYHVYMTDAQGKVLFDSANKAVGQDYSRWNDVWLTLRGQYGARSTLQNPADPESSVMYVAAPIMDGSRL
IGVLSVGKPNAAMAPVIKRSERRILWASAILLGIALVIGAGMVWWINRSIARLTRYADSVTDNKPVPLPDLGSSELRKLA
QALESMRVKLEGKNYIEQYVYALTHELKSPLAAIRGAAEILREGPPPEVVARFTDNILTQNARMQALVETLLRQARLENR
QEVVLTAVDVAALFRRVSEARTVQLAEKKITLHVTPTEVNVAAEPALLEQALGNLLDNAIDFTPESGCITLSAEVDQEHV
TLKVLDTGSGIPDYALSRIFERFYSLPRANGQKSSGLGLAFVSEVARLFNGEVTLRNVQEGGVLASLRLHRHFT
>K4R6W4 4.3.99.5~~~~~~Nitrosuccinate lyase~~~COG0015
MTFQLSPELAAVVDSGLLSPVRAGTPVEAAVSDAAWLAAMVEAETALVRAQARLGTVPESAAAAIVEAARPERLDLVALA
RASRETANPVVGFVKALTAVVAAEDPAAAEYVHRGSTSQDILDTATMLVVRRAGVLIRADLDRCAAALERLARTHRATPM
AGRTLTLHAVPTTFGLKAAGWLHLVTEARRRTAALAAALPVELGGAAGTLAGYLEHANNPGDDYADRLVEAYAHETGLAP
ATLPWHVLRTPIADTGAVCAFLAAALGKIAVDVQSLARTEVGEVTEPAVAGRGASSAMPHKRNPVLATLIRSAALQVPQH
AAVLYGAMLAEDERSGGAWHAEWQPLRECLRLAGGAAHTAVELLTGLTVDADRMRANLDLTGGQIVSERVAAVLTPLLGK
AQARALLTRASHEAADRGTSLAEVLSAAPEVTRHLTAAELEELLDPTRYLGAAPGLVDRAVGGPDRAVPHRSAA
>P08369 ~~~creD~~~Inner membrane protein CreD~~~COG4452
MLKSPLFWKMTSLFGAVLLLLIPIMLIRQVIVERADYRSDVEDAIRQSTSGPQKLVGPLIAIPVTELYTVQEEDKTVERK
RSFIHFWLPESLMVDGNQNVEERKIGIYTGQVWHSDLTLKADFDVSRLSELNAPNITLGKPFIVISVGDARGIGVVKAPE
VNGTALTIEPGTGLEQGGQGVHIPLPEGDWRKQNLKLNMALNLSGTGDLSVVPGGRNSEMTLTSNWPHPSFLGDFLPAKR
EVSESGFQAHWQSSWFANNLGERFASGNDTGWENFPAFSVAVTTPADQYQLTDRATKYAILLIALTFMAFFVFETLTAQR
LHPMQYLLVGLSLVMFYLLLLALSEHTGFTVAWIIASLIGAIMNGIYLQAVLKGWCNSMLFTLALLLLDGVMWGLLNSAD
SALLLGTSVLVVALAGMMFVTRNIDWYAFSLPKMKASKEVTTDDELRIWK
>A0A0K2JL82 4.3.99.5~~~creD~~~Nitrosuccinate lyase~~~
MTRPPAPPPGAPGADELLDCGLLSPVRAGTPVEALVCDSAWLQAMLDAEAALTRAQARTGFLPAAAAEAITAAARADRID
LLAVARGARETANPVVGLVAALTAAVRRDDPAAAEYVHRGSTSQDVLDTGAMLVARRALRLIGDDLDRAADALAALAADH
RDTPMAGRTLALHAVPTTFGLKAAGWLELVSEAAGRVARLRDGLPFSLGGAAGTLAGYFGDRTDRGDPAVLLDRLLDAYA
AETGLARPVLPWHVLRTPVADLAAVLAFTAGALGKIAVDVQSLARTEVAEVAEPAVEGRGASSAMPHKRNPVLSTLIRSA
ALQVPALATGLTQCLVSEDERSAGAWHAEWQPLRECLRLTGGAARTAVELAAGLEVDAARMRANLDLTDGRIVSESVAVA
LTPLLGRQAAKELLTRAAFTAGHEGRTLGEVLGELPELDGVLPKERWEALLDPARATGVAGALVDGALARRRPPAR
>K4QXN8 1.14.13.248~~~~~~L-aspartate N-monooxygenase (nitrosuccinate-forming)~~~COG4529
MTTRQHTVCVIGAGPRGLSVIERLCAAARAAAPDTAIGIHVVDPHAPGAGQVWRTSQSAHLLMNTVAGQISVFTDASVEL
TGPLEPGPSLHAWAEALLAGDIPGHYPDQVLDEARALGPDTYPTRAFYGHYLRWAFDRTVAGAPDCVTVTWHRSRAVALD
DEAVALATGHPPRQRVTLEDGEVLEHLDAVVLSQGHLPARPTAVEEQFAALAREHGHTHLPPANPADTELECVQPGQPVL
LRGLGLNFFDYMALLTTGRGGSFTRVYGRLVYLPSGREPRMYAGSRRGVPYQARGENEKGPHGRHEPLLLTPARIARLQA
ARGTDGADFQRDVWPLVAKEVETVYYRTLLTARGRGDRAESFQRAFLRALPGTSAEETVLDAYEIGEKDRWDWDRLSRPY
QDQEFGSPEDFTAWLLDHLREDVAEARSGNVSGPLKAALDVLRDLRNEIRLVVDHGGLHGDSHREHLDRWYTPLNAFLSI
GPPASRIEEAAALIEAGVLHIIGPDLRVDVDRQGFHAHSPLVPGSRISAEVLIEARLPDITLSRTEDPLLRRLLDSGQCA
THRIATRNGTWIYTEGVAVTPRPFQLLDAARRPHPRRYAFGVPTESVHWVTAAGIRPGVNSVTLTDSDAIAQAALGAVLE
QRDSLERTAA
>A0A0K2JL70 1.14.13.248~~~creE~~~L-aspartate N-monooxygenase (nitrosuccinate-forming)~~~
MSVRRLTVCIVGAGPRGLSVLERFCAHERKSASHPAVTVHVVDPARPGAGRVWRTGQPRQLLMNTVASQVTVFTDGSVDM
AGPVEAGPSLHEWARELAALTPVEELLGGHDDATLAEARALGADSYPTRAFYGCYLEEMFRRVVCGAPAHLEVRVHRSTA
VSLADETPGSGGAQSLLLADGTRLAGLDAVVLALGHVRAEEPGAPDPRAAALGLAHFPPANPADLDLSGIAPGTPVLLRG
LGLNFFDHMALFTLGRGGAFSRRPHGLRYHPSGLEPRLYAGSRRGVPYHARGENEKGVDGRHTPLLLTPERIAELTGRHR
EGPGLSFLRTLWPLIAREVECVYYGTLLASRGRAAERDAFVTAYLAGGDDTDRGGVLERFGIGPADRWCWERTASPHPRH
GFTGPDGHRRWLLEHLAQDVRRARAGNVSDPHKAALDVLRDLRNEIRLVVDHGGLDGLSHRDDLDGWYTGLNAFLSIGPP
ASRIEEMAALIEAGVLDVVGPGLEVDIDEADAAFVARSPLVPGRPVRAHVLIEARLPVTDLRRTADPLLRDLLRSGQCRS
YRIPAGRAPEGYETGGLEVTRRPYRLVDALGRAHPRRFAFGVPTEAVHWVTAAGARPGVNSVTLGDADAIAHAVASLTPA
AAPRLPAYEDPGVRCPSDDRLTEVTA
>A0A0K2JLU1 6.7.1.1~~~creM~~~3-amino-2-hydroxy-4-methoxybenzoate diazotase~~~
MHALLRRVTRGNGFYIGDLFHAAAGHDATTPVTLDQPLQYAPELGTRFTVGQLAEQTDELAARLWAAGVRPTERVALYKK
DNFDIALHACAAARIGAVPALLSPALEAPVVRTLLDRLDGPWLLTDATTLAGPLGTVLAPASVRAVLLTTGTPAEGALPA
AGATGPAREGDAPPPAPVVRLADHRGAPKRPPVFLHPRQPSLITHSSGTTGVPKLAVHCPEAGWHRLVPQKVVSWPIRGK
EKAALCMTFVHSRFYQGLAMFLSHGNPMLIAVDPDPSRIGPLFARERPGYIETHPNTYVDWEALADAPGEPLAGVRVFGA
TFDAMHPRTIQRMLGASRRTRPLFVQFYGQSEIGPMAGRWYTRRSAARMNGRCVGLPLPGFISLRVVDDAGKRLRGGRTG
HLEVRSRTRILTYLGEDQRYAEQLHDGWWRVGDMGRRDRWGLLHLLDREVDRIGDLESSLAVEDLLMSRLEELREVVLVP
GADGEPVPVVATVDESPLDAARWQRATSDLPTMAPVRQFRFEDLPRTSTRKIQKPELARLIQGVRATDQGVGA
>A0A0K2JL91 2.1.1.380~~~creN~~~3-amino-4-hydroxybenzoate 4-O-methyltransferase~~~
MTVPENAQHTAPDQTQHTAPDRTRQAQQAAPDTAGRRLIELMAGFWKTQAIYLAAESGLVDAIAAAGRAPAVELANRTGT
DPDALGRLLLFLESLDVVSGEDPAGYALTPVGELLRTGTQDSMRDHVRIYGSHFYRAWGALDHSLRTGRSAFTEVYGSDL
FRYLNQHPDLSLTYERAMVAGTPFFAQVPEVHDFSGARLIVDVAGGHGALLHEILKSCPEPRAVLFDAPHVIAETADRPI
ASEHGDRVTLVPGDFFEGVPQGGDVYLLSRILHCFDDEACLRILAHCRSAMAPGGRLVVVERLLTRGTGSSLAQGYNMHM
LVVLGGGRERDEDAYRTLLEKAGFQLDSVTTLPLETHLMAATLRR
>P0DM85 ~~~crfC~~~Clamp-binding protein CrfC~~~COG0699
MYTQTLYELSQEAERLLQLSRQQLQLLEKMPLSVPGDDAPQLALPWSQPNIAERHAMLNNELRKISRLEMVLAIVGTMKA
GKSTTINAIVGTEVLPNRNRPMTALPTLIRHTPGQKEPVLHFSHVAPIDCLIQQLQQRLRDCDIKHLTDVLEIDKDMRAL
MQRIENGVAFEKYYLGAQPIFHCLKSLNDLVRLAKALDVDFPFSAYAAIEHIPVIEVEFVHLAGLESYPGQLTLLDTPGP
NEAGQPHLQKMLNQQLARASAVLAVLDYTQLKSISDEEVREAILAVGQSVPLYVLVNKFDQQDRNSDDADQVRALISGTL
MKGCITPQQIFPVSSMWGYLANRARYELANNGKLPPPEQQRWVEDFAHAALGRRWRHADLADLEHIRHAADQLWEDSLFA
QPIQALLHAAYANASLYALRSAAHKLLNYAQQAREYLDFRAHGLNVACEQLRQNIHQIEESLQLLQLNQAQVSGEIKHEI
ELALTSANHFLRQQQDALKVQLAALFQDDSEPLSEIRTRCETLLQTAQNTISRDFTLRFAELESTLCRVLTDVIRPIEQQ
VKMELSESGFRPGFHFPVFHGVVPHFNTRQLFSEVISRQEATDEQSTRLGVVRETFSRWLNQPDWGRGNEKSPTETVDYS
VLQRALSAEVDLYCQQMAKVLAEQVDESVTAGMNTFFAEFASCLTELQTRLRESLALRQQNESVVRLMQQQLQQTVMTHG
WIYTDAQLLRDDIQTLFTAERY
>P9WP57 ~~~crgA~~~Cell division protein CrgA~~~
MPKSKVRKKNDFTVSAVSRTPMKVKVGPSSVWFVSLFIGLMLIGLIWLMVFQLAAIGSQAPTALNWMAQLGPWNYAIAFA
FMITGLLLTMRWH
>Q9XA10 ~~~crgA~~~Cell division protein CrgA~~~
MPKSRIRKKADYTPPPSKQATSIKLTSRGWVAPVMLAMFVIGLAWIVVFYVTDGSLPIDSLGNWNIVVGFGFIAAGFGVS
TQWK
>Q55804 3.6.4.13~~~crhR~~~RNA helicase CrhR~~~COG0513
MTNTLTSTFADLGLSEKRCQLLADIGFEAPTQIQTEAIPLLLSGRDMLAQSQTGTGKTAAFALPLMDRIDPEGDLQALIL
TPTRELAQQVAEAMKDFSHERRLFILNVYGGQSIERQIRSLERGVQIVVGTPGRVIDLIDRKKLKLETIQWVVLDEADEM
LSMGFIDDVKTILRKTPPTRQTACFSATMPREIKELVNQFLNDPALVTVKQTQSTPTRIEQQLYHVPRGWSKAKALQPIL
EMEDPESAIIFVRTKQTAADLTSRLQEAGHSVDEYHGNLSQSQRERLVHRFRDGKIKLVVATDIAARGLDVNNLSHVVNF
DLPDNAETYIHRIGRTGRAGKTGKAIALVEPIDRRLLRSIENRLKQQIEVCTIPNRSQVEAKRIEKLQEQLKEALTGERM
ASFLPLVRELSDEYDAQAIAAAALQMIYDQSCPHWMKSDWEVPEVDFNKPVLRRGRNAGGGQNKSGGGYQGKPGKPRRSS
GGRRPAYSDRQQ
>O06976 ~~~crh~~~HPr-like protein Crh~~~COG1925
MVQQKVEVRLKTGLQARPAALFVQEANRFTSDVFLEKDGKKVNAKSIMGLMSLAVSTGTEVTLIAQGEDEQEALEKLAAY
VQEEV
>P9WPG5 2.7.8.41~~~pgsA2~~~Putative cardiolipin synthase~~~COG0558
MEPVLTQNRVLTVPNMLSVIRLALIPAFVYVVLSAHANGWGVAILVFSGVSDWADGKIARLLNQSSRLGALLDPAVDRLY
MVTVPIVFGLSGIVPWWFVLTLLTRDALLAGTLPLLWSRGLSALPVTYVGKAATFGFMVGFPTILLGQCDPLWSHVLLAC
GWAFLIWGMYAYLWAFVLYAVQMTMVVRQMPKLKGRAHRPAAQNAGERG
>P24251 ~~~crl~~~Sigma factor-binding protein Crl~~~
MTLPSGHPKSRLIKKFTALGPYIREGKCKDNRFFFDCLAVCVNVKPAPEVREFWGWWMELEAQESRFTYSYQFGLFDKAG
DWKSVPVKDTEVVERLEHTLREFHEKLRELLTTLNLKLEPADDFRDEPVKLTA
>Q7CR52 ~~~crl~~~Sigma factor-binding protein Crl~~~
MTLPSGHPKSRLIKKFTALGPYIREGQCEDNRFFFDCLAVCVNVKPAPEKREFWGWWMELEAQEKRFTYRYQFGLFDKEG
NWTVVPINETEVVERLEYTLREFHEKLRDLLISMELALEPSDDFNDEPVKLSA
>P83772 3.5.2.10~~~crnA~~~Creatinine amidohydrolase~~~
MSKSVFVGELTWKEYEARVAAGDCVLMLPVGALEQHGHHMCMNVDVLLPTAVCKRVAERIGALVMPGLQYGYKSQQKSGG
GNHFPGTTSLDGATLTGTVQDIIRELARHGARRLVLMNGHYENSMFIVEGIDLALRELRYAGIQDFKVVVLSYWDFVKDP
AVIQQLYPEGFLGWDIEHGGVFETSLMLALYPDLVDLDRVVDHPPATFPPYDVFPVDPARTPAPGTLSSAKTASREKGEL
ILEVCVQGIADAIREEFPPT
>Q73WB6 1.11.1.-~~~~~~Catalase-related peroxidase~~~COG0753
MSGGLTPDQAIDAIRGTGGAQPGCRALHAKGTLYRGTFTATRDAVMLSAAPHLDGSTVPALIRFSNGSGNPKQRDGAPGV
RGMAVKFTLPDGSTTDVSAQTARLLVSSTPEGFIDLLKAMRPGLTTPLRLATHLLTHPRLLGALPLLREANRIPASYATT
EYHGLHAFRWIAADGSARFVRYHLVPTAAEEYLSASDARGKDPDFLTDELAARLQDGPVRFDFRVQIAGPTDSTVDPSSA
WQSTQIVTVGTVTITGPDTEREHGGDIVVFDPMRVTDGIEPSDDPVLRFRTLVYSASVKLRTGVDRGAQAPPV
>Q79VI7 ~~~glxR~~~CRP-like cAMP-activated global transcriptional regulator~~~COG0664
MEGVQEILSRAGIFQGVDPTAVNNLIQDMETVRFPRGATIFDEGEPGDRLYIITSGKVKLARHAPDGRENLLTIMGPSDM
FGELSIFDPGPRTSSAVCVTEVHAATMNSDMLRNWVADHPAIAEQLLRVLARRLRRTNASLADLIFTDVPGRVAKTLLQL
ANRFGTQEAGALRVNHDLTQEEIAQLVGASRETVNKALATFAHRGWIRLEGKSVLIVDTEHLARRAR
>P9WMH2 ~~~crp~~~CRP-like cAMP-activated global transcriptional regulator~~~
MDEILARAGIFQGVEPSAIAALTKQLQPVDFPRGHTVFAEGEPGDRLYIIISGKVKIGRRAPDGRENLLTIMGPSDMFGE
LSIFDPGPRTSSATTITEVRAVSMDRDALRSWIADRPEISEQLLRVLARRLRRTNNNLADLIFTDVPGRVAKQLLQLAQR
FGTQEGGALRVTHDLTQEEIAQLVGASRETVNKALADFAHRGWIRLEGKSVLISDSERLARRAR
>P9WMH3 ~~~crp~~~CRP-like cAMP-activated global transcriptional regulator~~~COG0664
MDEILARAGIFQGVEPSAIAALTKQLQPVDFPRGHTVFAEGEPGDRLYIIISGKVKIGRRAPDGRENLLTIMGPSDMFGE
LSIFDPGPRTSSATTITEVRAVSMDRDALRSWIADRPEISEQLLRVLARRLRRTNNNLADLIFTDVPGRVAKQLLQLAQR
FGTQEGGALRVTHDLTQEEIAQLVGASRETVNKALADFAHRGWIRLEGKSVLISDSERLARRAR
>P0ACJ8 ~~~crp~~~DNA-binding transcriptional dual regulator CRP~~~COG0664
MVLGKPQTDPTLEWFLSHCHIHKYPSKSTLIHQGEKAETLYYIVKGSVAVLIKDEEGKEMILSYLNQGDFIGELGLFEEG
QERSAWVRAKTACEVAEISYKKFRQLIQVNPDILMRLSAQMARRLQVTSEKVGNLAFLDVTGRIAQTLLNLAKQPDAMTH
PDGMQIKITRQEIGQIVGCSRETVGRILKMLEDQNLISAHGKTIVVYGTR
>P29281 ~~~crp~~~cAMP-activated global transcriptional regulator CRP~~~COG0664
MSNELTEIDEVVTSSQEEATQRDPVLDWFLTHCHLHKYPAKSTLIHAGEDATTLYYVIKGSVMVSSKDDEGKEMILTYLG
AGQFFGEAGLFDEGSKRSAWVKTKTTCEIAEISYKKYRQLIQANPEILMFLTAQLARRLQNTSRQVTNLAFLDVAGRIAQ
TLMNLAKQPEAMTHPDGMQIKITRQEIGQMVGCSRETVGRIIKMLEDQNLIHAHGKTIVVYGAR
>P0A2T7 ~~~crp~~~cAMP-activated global transcriptional regulator CRP~~~
MVLGKPQTDPTLEWFLSHCHIHKYPSKSTLIHQGEKAETLYYIVKGSVAVLIKDEEGKEMILSYLNQGDFIGELGLFEEG
QERSAWVRAKTACEVAEISYKKFRQLIQVNPDILMRLSSQMARRLQVTSEKVGNLAFLDVTGRIAQTLLNLAKQPDAMTH
PDGMQIKITRQEIGQIVGCSRETVGRILKMLEDQNLISAHGKTIVVYGTR
>P0A2T6 ~~~crp~~~cAMP-activated global transcriptional regulator CRP~~~
MVLGKPQTDPTLEWFLSHCHIHKYPSKSTLIHQGEKAETLYYIVKGSVAVLIKDEEGKEMILSYLNQGDFIGELGLFEEG
QERSAWVRAKTACEVAEISYKKFRQLIQVNPDILMRLSSQMARRLQVTSEKVGNLAFLDVTGRIAQTLLNLAKQPDAMTH
PDGMQIKITRQEIGQIVGCSRETVGRILKMLEDQNLISAHGKTIVVYGTR
>Q5SID7 ~~~~~~Cyclic AMP receptor protein~~~COG0664
MKGSPLFHGLAPEEVDLALSYFQRRLYPQGKPIFYQGDLGQALYLVASGKVRLFRTHLGGQERTLALLGPGELFGEMSLL
DEGERSASAVAVEDTELLALFREDYLALIRRLPLVAHNLAALLARRLREADLELDLLSFEEARNRVAYALLKLLRQGLGP
LFQIRHHELAALAGTSRETVSRVLHALAEEGVVRLGPGTVEVREAALLEEIAFGLA
>P17055 1.14.15.9~~~crtA~~~Spheroidene monooxygenase~~~
MPVASLSLFRFDGTSSLPWVISQMILSRRPLNDEPRVKFYKLCGSGTGEGFTPKPNWRVWAIMAAFDTEADARDVTANHP
VWKRWRAHAAETLVLHLQPLSARGTWGGVNPFLPEQVAEPSPDEPVVALTRAAIKPHKANAFWSRVPKISEKVGEDQNLM
FKIGIGEIPLFHQVTFSIWPDVAKMNAFARGDTPHGKAIRAAREEGWFTEELYARFRLLGTEGSWMGKDPLASKVLERET
A
>D5KXJ0 2.5.1.32~~~crtB~~~15-cis-phytoene synthase~~~COG1562
MEVGSKSFATASKLFDAKTRRSVLMLYAWCRHCDDVIDDQVLGFSNDTPSLQSAEQRLAQLEMKTRQPMRIQMHEPAFAA
FQEVAMAHDILPAYAFDHLAGFAMDVHETRYQTLDDTLRYCYHVAGVVGLMMAQIMGVRDNATLDRACDLGLAFQLTNIA
RDIVEDAEAGRCYLPAAWLAEEGLTRENLADPQNRKALSRVARRLVETAEPYYRSASAGLPGLPLRSAWAIATAQQVYRK
IGMKVVQAGSQAWEQRQSTSTPEKLALLVAASGQAVTSRVARHAPRSADLWQRPV
>P9WHP3 2.5.1.-~~~crtB~~~Phytoene synthase~~~COG1562
MTEIEQAYRITESITRTAARNFYYGIRLLPREKRAALSAVYALGRRIDDVADGELAPETKITELDAIRKSLDNIDDSSDP
VLVALADAARRFPVPIAMFAELIDGARMEIDWTGCRDFDELIVYCRRGAGTIGKLCLSIFGPVSTATSRYAEQLGIALQQ
TNILRDVREDFLNGRIYLPRDELDRLGVRLRLDDTGALDDPDGRLAALLRFSADRAADWYSLGLRLIPHLDRRSAACCAA
MSGIYRRQLALIRASPAVVYDRRISLSGLKKAQVAAAALASSVTCGPAHGPLPADLGSHPSH
>P21683 2.5.1.32~~~crtB~~~15-cis-phytoene synthase~~~
MNNPSLLNHAVETMAVGSKSFATASKLFDAKTRRSVLMLYAWCRHCDDVIDDQTLGFQARQPALQTPEQRLMQLEMKTRQ
AYAGSQMHEPAFAAFQEVAMAHDIAPAYAFDHLEGFAMDVREAQYSQLDDTLRYCYHVAGVVGLMMAQIMGVRDNATLDR
ACDLGLAFQLTNIARDIVDDAHAGRCYLPASWLEHEGLNKENYAAPENRQALSRIARRLVQEAEPYYLSATAGLAGLPLR
SAWAIATAKQVYRKIGVKVEQAGQQAWDQRQSTTTPEKLTLLLAASGQALTSRMRAHPPRPAHLWQRPL
>P54975 2.5.1.32~~~crtB~~~15-cis-phytoene synthase~~~
MSDLVLTSTEAITQGSQSFATAAKLMPPGIRDDTVMLYAWCRHADDVIDGQALGSRPEAVNDPQARLDGLRADTLAALQG
DGPVTPPFAALRAVARRHDFPQAWPMDLIEGFAMDVEARDYRTLDDVLEYSYHVAGIVGVMMARVMGVRDDPVLDRACDL
GLAFQLTNIARDVIDDARIGRCYLPGDWLDQAGARVDGPVPSPELYTVILRLLDAAELYYASARVGLADLPPRCAWSIAA
ALRIYRAIGLRIRKGGPEAYRQRISTSKAAKIGLLGIGGWDVARSRLPGAGVSRQGLWTRPHHA
>P37269 2.5.1.32~~~crtB~~~15-cis-phytoene synthase~~~COG1562
MLQMIPARPSRALASLSEAYEECRQITARYAKTFYLGTLLMPEAKRQAIWAIYVWCRRTDELVDGPQAAQTNFATLDAWE
RRLERLFAGEPEDDCDVALVDTLARYPLDIQPFRDMIEGQRMDLLQNRYSTFEDLYTYCYRVAGTVGLMSQPVMGIESTN
SRAPWDPTTPPDPTQEALALGIANQLTNILRDVGEDARRGRIYLPQEELAQFNYSEQDLFNGVIDDRWRAFMQFQLDRAR
DYFEQAERGIRQLSHDARWPVWASLMLYREILDVIEQNNYDVFRKRAYVPTWRKLCSLPVAMLRATVL
>P37294 2.5.1.32~~~crtB~~~15-cis-phytoene synthase~~~COG1562
MANGQISPQRAVTKPQSWWLTSEPRPSLMLQLPKPSPSAKPCASVEEAYEICRQVTAQYAKTFYLGTLLMPPAKRRAIWA
IYLWCRRTDELVDGPQAATTTPETLDHWERRLEGIFAGQPQDDADVALVDTLETFPLDIQPFRDMIAGQRMDLYRSRYQT
FEELDLYCYRVAGTVGLMSSAVLGVDTGNGQAPWQPDAVYIPQEEAIALGVANQLTNILRDVGEDVERGRIYLPLEDLER
FNYSEQDLLNGVNDDRWRSLMKFEIDRARHYFEDAERGIRALNRDARWPVWTALMLYKGILDVIEANNYNVFNRRAYVPT
PKKLLYLPVAWLRAQVL
>P17058 4.2.1.131~~~crtC~~~Acyclic carotenoid 1,2-hydratase~~~COG5621
MIAFIGSVFSPWYRWSGRREPQNHCCINMVTTGTDGRFTMTDRGRSALRQSRDSFQVGPSKLTWTGKELVIDVDEWGALP
KLGKLKGRVVLTPRAVTGVEVRLTPDAGHTWRPFAPIADVEVDLAPGHKWTGHGYFDANFGTRALEEDFSFWTWGRFPLK
DRTVCFYDATRLDRTKVALAVQINPDGSVCEIDSPPPLVKMKRTPWFVRRETRCDAGAHPAGVYSLLEAPFYSRALMRTQ
IDGEETVGMHEALDLVRFRQPVLKPMLAVRVPRRAGWKHKD
>P95619 4.2.1.131~~~crtC~~~Acyclic carotenoid 1,2-hydratase~~~
MRAAESGADARVRPVDRVEPADAPAGDAGGLRAAVPGDGGSAVRPGDARLDVLVPPGLVDEPAAGALPGGGQRAPGAGRA
DGGDVRPVGGRDADGAPRFDQPVPPGGYLWWYVDAVSDDGRHGLTFIAFVGSVFSPYYAWAGGPKADRADPENHCALNIA
LYGDAGKRWTMTERGRRWMRRSRDEFVIGPSRLHWDGESLLVEFDEVGVPIPRRVKGRVRVWPKALCRFVTSLDSGGRHR
WGPIAPCSRIEVELDSPRVRWSGHAYLDSNEGDEPIDRPFREWDWSRATMADSSTAVIYDVRQKRDGDRVIAERFLLDGS
TESFEAPPRQPLPTTLWRIGRTMRTEPGVPALVEQTLEDTPFYARSMVRSGLLGEVVTSVHETMLLPRVITLPVRLMLPW
RMPRRA
>Q7X3G5 4.2.1.131~~~crtC~~~Acyclic carotenoid 1,2-hydratase~~~
MRAAGILTPGALWAPGPSDTRDERRHDAGRLQPALPGDGRGPLRPGVTRMEGLLHASSVAQQGAGALSGRGERASGSRGT
DGGHVGTPGGSDPARGPRFDLRITPGGYLWWYLDALSDDGDHGLTIIAMLGSVFSPYYAWARRRGNPDPLNHCALNVALY
GKAGKRWTMTERGRKALRQAPGRLDIGPSHLTWDGTALTIDVNEITAPIPSRVRGRIRVIPAAVNAREFTLDPAERHVWW
PIAPISRVEVDLEKPALRWSGHGYLDSNRGEEPLEDAFQCWDWSRANTPSGTTMLYDVTARHGTGASLALRFNASGEVEE
FPPPPRVRLPTTGIWRIKRGTQCEAGHQARVVETLEDTPFYARSLVETRLAGETATCVHESLSLDRFASPVVQLMLPFRM
PRVGG
>Q7WT72 1.3.99.27~~~crtD~~~1-hydroxycarotenoid 3,4-desaturase~~~
MKKAIIIGSGIAGLAAALRLKKKGYQVSVFEKNDYAGGKLHAIELGGYRFDLGPSLFTLPHLIDELHQLFPDVEIDFNYI
KKKTACHYFWEDGTSFEAPADLENFAVKAAEIFDEKQNTLSKYLQNSKMKYESTKSLFLEKSLHKSNTYFSKQTLKAILK
IPFLGINNTLDQENKKFSNPKLNQLFNRYATYNGSSPYLTPGIMSMIPYLELGLGTYYPQGGMHRISQSLFELAQKVGVE
FRFRKNVKKINHSNNKVTGVTTEKGTHDADIVLCNMDVFPTYRQLLQDIKAPEKTLKQERSSSALIFYWGIKKSFPQLDL
HNILFSENYKAEFEAIFNNKSLYEDPTVYINITSKQSPQDAPKGCENWFVMINTPGDYGQNWENLVIKAKKNILSKIKRC
LNIDVEELIDVEYVLTPQGIEKNTSSYRGALYGAASNNKFAAFLRHPNFNKTIGNLYHVGGSVHPGGGIPLCLLSAKITA
DLIPNTNA
>L7WC64 1.3.99.27~~~crtD~~~1-hydroxycarotenoid 3,4-desaturase~~~COG1233
MKNAIVIGAGIGGLAAALRLRHQGYSVTIFEKNDYAGGKLHAIEKDGYRFDLGPSLFTLPHLVENLFALFPEEIIDFGYK
SKAISFHYFWDDGTLFKASTDSSQFIEDASKVFKEEKSTIKKYLAKSKSKYELTKSLFLEKSLHKATTYFSLDTVKAIVH
APFLGLNNTLNDENSKFKNPKLTQLFNRYATYNGSSPYQTPGIMTMIPHLELGLGTYYPDGGMHRISQSLFELAQKVGVK
FRFRESVTNITTSKNKVTGVETKNGSYLSDLVVSNMDIVPTYRNLMKDVPAPEKTLSQERSSSALIFYWGIDREFPELDL
HNILFSEDYKTEFEHIFEHKTLAQDPTVYINITSKESSNDAPAGHENWFVMINAPGDYGQDWEQLVEESKKQIIAKIKKC
LHVDISKHITTEYILTPQGIEKNTSSYRGALYGAASNNKFAAFLRHPNFNGKIKNLYHVGGSVHPGGGIPLCLLSAQITA
DLIQKEQ
>Q01671 1.3.99.27~~~crtD~~~Hydroxyneurosporene desaturase~~~COG1233
MRQIVPKVVVVGAGMGGLASAIRLARAGCEVTLLEAREAPGGRMRTLPSVAGPVDAGPTVLTLREVFDDIFEVCGQKLDH
HLTLLPQPLLARHWWLDGSTLDLTTDLEANVEAVAAFAGAREARAFRRFHDLSARLYDAFDRPMMRAARPDLRAIATGAL
KAPRTWPALLPGMTLDRLLRLFFRDRRLRQLFGRYATYVGGTPYGAPGVLALIWAAEARGVWAIEGGMHRLALALARLAD
DQGVRLRYGAPVARILRQGGRATGVQLADGRTLPADHIVFNGDPAALLAGCLGDGPQDAVPEDRIHPRSLSAWVWSYAAR
ASGPPLVHHNVFFADDPRREFGPIAAGQMPEDATLYICAEDRSGGQLPDGPERFEIIMNGPPGRPAKPEDFAQCRSRTFD
RLRQFGLTFDPVPGETSLTAPSGFASLFPASQGSIYGLSPHGALASLKRPLARTALPGLWLAGGGAHPGAGVPMAALSGR
HAAEAILADLASRSR
>Q9JPB5 1.3.99.27~~~crtD~~~Hydroxyneurosporene desaturase~~~
MAEPLRTHRVVVVGAGIAGLTSALLLAARGLDVTLVDKAATPGGKMRQVMVDGAPVDAGPTVFTMRWVFDQIFAAAGATV
EEHLKLQPLGVLARHAWRGHEPRLDLFADIRRSAEAIGEFSGPQEAQRFLGFCRQARQLYDHLEGPYIRSERPTLGSMVG
DLGPRGLMALMQIGPFSNLWRSLSRHFRDPKLQQLFARYATYCGASPWMAPATLMLVAQVELDGVWAVEGGMHAVAKAFS
ALAEARGVKTRYGCGCEQILVRDGRAVGVRLAGGEEITADSVVFNGDVNALAQGLLGDPPRRATAAVAPARRSLSALTWL
VNARTSGFPLVRHNVFFDEDYASEFDDIFRQRQLPRRGTVYVCAQDRTDEGIGSDAPERLLCLVNAPADGDRRPFDHSET
DPCEQRSLALMRECGLTIDWSPQTHRLVTPANFERLFPATGGALYGPATHGWMSSFHRASSTSRLPGLYLAGGSVHPGPG
VPMAAMSGRLAAETLMAHLDSTSRSRRVVISGGMSTRSATTGGMA
>Q93QX2 2.5.1.149~~~crtEb~~~Lycopene elongase/hydratase~~~
MMEKIRLILLSSRPISWVNTAYPFGLAYLLNAGEIDWLFWLGIVFFLIPYNIAMYGINDVFDYESDIRNPRKGGVEGAVL
PKSSHSTLLWASAISTIPFLVILFIFGTWMSSLWLTISVLAVIAYSAPKLRFKERPFIDALTSSTHFTSPALIGATITGT
SPSAAMWIALGSFFLWGMASQILGAVQDVNADREANLSSIATVIGARGAIRLSVVLYLLAAVLVTTLPNPAWIIGIAILT
YVFDAARFWNITDASCEQANRSWKVFLWLNYFVGAVITILLIAIHQI
>P22873 2.5.1.29~~~crtE~~~Geranylgeranyl diphosphate synthase~~~
MVSGSKAGVSPHREIEVMRQSIDDHLAGLLPETDSQDIVSLAMREGVMAPGKRIRPLLMLLAARDLRYQGSMPTLLDLAC
AVELTHTASLMLDDMPCMDNAELRRGQPTTHKKFGESVAILASVGLLSKAFGLIAATGDLPGERRAQAVNELSTAVGVQG
LVLGQFRDLNDAALDRTPDAILSTNHLKTGILFSAMLQIVAIASASSPSTRETLHAFALDFGQAFQLLDDLRDDHPETGK
DRNKDAGKSTLVNRLGADAARQKLREHIDSADKHLTFACPQGGAIRQFMHLWFGHHLADWSPVMKIA
>B1XJV9 2.5.1.29~~~crtE~~~Geranylgeranyl pyrophosphate synthase~~~COG0142
MVVADAHTQGFSLAQYLQEQKTIVETALDQSLVITEPVTIYEAMRYSLLAGGKRLRPILCLAACEMLGGTAAMAMNTACA
LEMIHTMSLIHDDLPAMDNDDLRRGKPTNHKVYGEDIAILAGDALLSYAFEYVARTPDVPAERLLQVIVRLGQAVGAEGL
VGGQVVDLESEGKTDVAVETLNFIHTHKTGALLEVCVTAGAILAGAKPEEVQLLSRYAQNIGLAFQIVDDILDITATAEE
LGKTAGKDLEAQKVTYPSLWGIEKSQAEAQKLVAEAIASLEPYGEKANPLKALAEYIVNRKN
>D5AP78 2.1.1.210~~~crtF~~~Demethylspheroidene O-methyltransferase~~~COG0500
MPKDDHTGATADRTAQPTGTGKQPLVPGQPGAAPVQPGRVNFFTRIALSQRLHEIFERLPLMNRVTRREGEALFDIVSGF
VQSQVLLAIVEFRVLHILAGASWPLPQLAERTGLAEDRLAVLMQAAAALKLVKFRRGLWQLAPRGAAFITVPGLEAMVRH
HPVLYRDLADPVAFLKGDIEPELAGFWPYVFGPLAQEDAGLAERYSQLMADSQRVVADDTLRLVDLRDAKRVMDVGGGTG
AFLRVVAKLYPELPLTLFDLPHVLSVADRFSPKLDFAPGSFRDDPIPQGADVITLVRVLYDHPDSVVEPLLAKVHAALPP
GGRLIISEAMAGGAKPDRACDVYFAFYTMAMSSGRTRSPEEIKQMLEKAGFTKVSKPRTLRPFITSVIEAERG
>Q02861 1.3.99.26~~~carC~~~All-trans-zeta-carotene desaturase~~~
MASEGGSVRHVIVVGAGPGGLSAAINLAGQGFRVTVVEKDAVPGGRMKGLTLGASGEYAVDTGPSILQLPGVLEQIFRRA
ARRLEDYVKLLPLDVNTRVHFWDGTHLDTTRHLDRMEAELAKFGPRQASALRQWMEDGREKYGIAYQKFICTSADNLGYY
APWRLAPTLRFKPWQTLYRQLDGFFHDDRVTYALAYPSKYLGLHPTTCSSVFSVIPFLELAFGVWHVEGGFRELSRGMMR
CARDLGATFRMGTPVEKVRVDAGRAVGVKLVGGEVLDADAVVVNADLAYAARSLIPAEAREGSRLTDAALERAKYSCSTF
MAYYGLDTVYADLPHHLIYLSESARRTDRDALEDRHVDLEDPPFYVCNPGVTDPSGAPAGHSTLYVLVPTPNTGRPVDWV
KTEQALRERIPAMLEKVGLKGVREHIREERYFTAETWRDDFNVFRGAVFNLSHTWLQLGPLRPKVKNRDIEGLYFVGGGT
HPGSGLLTIMESANIAADYLTREAGKGPLPGWPYVPPLEPESPVQARAG
>P21685 1.3.99.31~~~crtI~~~Phytoene desaturase (lycopene-forming)~~~
MKPTTVIGAGFGGLALAIRLQAAGIPVLLLEQRDKPGGRAYVYEDQGFTFDAGPTVITDPSAIEELFALAGKQLKEYVEL
LPVTPFYRLCWESGKVFNYDNDQTRLEAQIQQFNPRDVEGYRQFLDYSRAVFKEGYLKLGTVPFLSFRDMLRAAPQLAKL
QAWRSVYSKVASYIEDEHLRQAFSFHSLLVGGNPFATSSIYTLIHALEREWGVWFPRGGTGALVQGMIKLFQDLGGEVVL
NARVSHMETTGNKIEAVHLEDGRRFLTQAVASNADVVHTYRDLLSQHPAAVKQSNKLQTKRMSNSLFVLYFGLNHHHDQL
AHHTVCFGPRYRELIDEIFNHDGLAEDFSLYLHAPCVTDSSLAPEGCGSYYVLAPVPHLGTANLDWTVEGPKLRDRIFAY
LEQHYMPGLRSQLVTHRMFTPFDFRDQLNAYHGSAFSVEPVLTQSAWFRPHNRDKTITNLYLVGAGTHPGAGIPGVIGSA
KATAGLMLEDLI
>P17054 1.3.99.28~~~crtI~~~Phytoene desaturase (neurosporene-forming)~~~COG1233
MSKNTEGMGRAVVIGAGLGGLAAAMRLGAKGYKVTVVDRLDRPGGRGSSITKGGHRFDLGPTIVTVPDRLRELWADCGRD
FDKDVSLVPMEPFYTIDFPDGEKYTAYGDDAKVKAEVARISPGDVEGFRHFMWDAKARYEFGYENLGRKPMSKLWDLIKV
LPTFGWLRADRSVYGHAKKMVKDDHLRFALSFHPLFIGGDPFHVTSMYILVSQLEKKFGVHYAIGGVQAIADAMAKVITD
QGGEMRLNTEVDEILVSRDGKATGIRLMDGTELPAQVVVSNADAGHTYKRLLRNRDRWRWTDEKLDKKRWSMGLFVWYFG
TKGTAKMWKDVGHHTVVVGPRYKEHVQDIFIKGELAEDMSLYVHRPSVTDPTAAPKGDDTFYVLSPVPNLGFDNGVDWSV
EAEKYKAKVLKVIEERLLPGVAEKITEEVVFTPETFRDRYLSPLGAGFSLEPRILQSAWFRPHNASEEVDGLYLVGAGTH
PGAGVPSVIGSGELVAQMIPDAPKPETPAAAAPKARTPRAKAAQ
>P54979 1.3.99.29~~~carA2~~~zeta-carotene-forming phytoene desaturase~~~
MSASTQGRRIVVVGAGVGGLAAAARLAHQGFDVQVFEKTQGPGGRCNRLQVDGFTWDLGPTIVLMPEVFEETFRAVGRRI
EDYLTLLRCDPNYRVHFRDRSDVTFTSELCAMGRELERVEPGSYARYLAFLAQGRVQYRTSLDHLVGRNYAGLRDYLSPR
VLARIFQVRAHRRMYADVSRFFQDERLRAAMTFQTMYLGVSPYASPAVYGLLPFTELGVGIWFPKGGLYAIPQALERLAR
EEGVRFHYGAPVERILTDGGRTRGVRLEGGEVVEADAVLCNADLPYAYEKLLDPKATTLKRKEKLRYTSSGYMLYLGMKR
RYPELLHHNVVFGRDYKGSFDDIFEFRVPEDPSFYVNAPTRTDASLAPEGKDALYVLVPVPHQHPDLDWKVEGPKVRAKF
FARMAELGFPSLESDIEVERRSSTPDDWAGTFNLARGSGFGLSQNFTQIGPFRPSNQDARVKNLFFVGASTQPGTGLPTV
LISARLVTERLMTWAHAQGVSLSPRTAAATPLEGVAA
>Q2FV59 2.5.1.96~~~crtM~~~4,4'-diapophytoene synthase~~~COG1562
MTMMDMNFKYCHKIMKKHSKSFSYAFDLLPEDQRKAVWAIYAVCRKIDDSIDVYGDIQFLNQIKEDIQSIEKYPYEHHHF
QSDRRIMMALQHVAQHKNIAFQSFYNLIDTVYKDQHFTMFETDAELFGYCYGVAGTVGEVLTPILSDHETHQTYDVARRL
GESLQLINILRDVGEDFDNERIYFSKQRLKQYEVDIAEVYQNGVNNHYIDLWEYYAAIAEKDFQDVMDQIKVFSIEAQPI
IELAARIYIEILDEVRQANYTLHERVFVDKRKKAKLFHEINSKYHRI
>O07854 2.5.1.96~~~crtM~~~4,4'-diapophytoene synthase~~~
MTMMDMNFKYCHKIMKKHSKSFSYAFDLLPEDQRKAVWAIYAVCRKIDDSIDVYGDIQFLNQIKEDIQSIEKYPYEHHHF
QSDRRIMMALQHVAQHKNIAFQSFYNLIDTVYKDQHFTMFETDAELFGYCYGVAGTVGEVLTPILSDHETHQTYDVARRL
GESLQLINILRDVGEDFDNERIYFSKQRLKQYEVDIAEVYQNGVNNHYIDLWEYYAAIAEKDFQDVMDQIKVFSIEAQPI
IELAARIYIEILDEVRQANYTLHERVFVDKRKKAKLFHEINSKYHRI
>A9JQL9 2.5.1.96~~~crtM~~~4,4'-diapophytoene synthase~~~
MTMMDMNFKYCHKIMKKHSKSFSYAFDLLPEDQRKAVWAIYAVCRKIDDSIDVYGDIQFLNQIKEDIQSIEKYPYEYHHF
QSDRRIMMALQHVAQHKNIAFQSFYNLIDTVYKDQHFTMFETDAELFGYCYGVAGTVGEVLTPILSDHETHQTYDVARRL
GESLQLINILRDVGEDFENERIYFSKQRLKQYEVDIAEVYQNGVNNHYIDLWEYYAAIAEKDFRDVMDQIKVFSIEAQPI
IELAARIYIEILDEVRQANYTLHERVFVEKRKKAKLFHEINSKYHRI
>P0DPF0 1.2.99.10~~~crtNc~~~4,4'-diapolycopene-4,4'-dial dehydrogenase~~~
MPDNDSHSLKSLPERQREDLFSAGSPSLEARKKQLSRLKTMIVDHEEAFTRALHADLGKPAFESFSSEIAVLLNEIDHVC
KHIAKWNRQSRSRYLKMGYVESIKRKRHPYGSVLIIGSWNYPLQLSLMPAIGAIAAGNRCVIKPSEHAPATAELLKKIIN
DAFPPEQLLVVTGDAQTASHLTAAPFDLIFFTGSGQTGKAVAEQAARQLTPVILELGGKNPCIIDETGFSKEAVREIVWG
KFLNAGQTCIAPDTLFVHQSVYEKMLNEISAAVSAFYGEQPRESSDYGRICTDDHFQKVIEFIGQGDVRHGGSYDRSDRF
IAPTVLTDIEPGSPILQEEIFGPVLPVIPYTDMRTLLSSGRIQRDALTGYIFSKNKDNIQLFKEHMRSSTISVNQVIHHA
ASPHIAFGGVGTSGYGAYHGKAGFLAFSYEKQNTEHIITSIFKVNSRHILIQI
>P0DPE9 1.2.99.10~~~crtNc~~~4,4'-diapolycopen-4-al dehydrogenase~~~
MKIAIAGGGIGGLISALMLRKKGYEVSLFEKRDRLGGRLAFTEEQGYRIDEGPTIVLLPEMLTSILNEAGISRDQYELIN
INPLYKLHFKDGSSYTKYNSIERQIQEIKENFPGNEEGFVQFMKDMEIRFNLGKDQFLEKSFHDKRTFWTRNNLKTLVHL
KAYKSVNNSLKAYFQDERIRQAYSLQTLYIGGNPLDSPALYSLISFSEHKHGIYYLKGGYASLVTVLENALLNSGVKVMK
NATVERVVTEGEQAAALIVNGEEVKADAFVLNGDFPGASKMIPKESMPARNYTASSSCVLLYFGLDKVYRDSPVHQFFMG
SNFQQHMKEIFETKEVPSDPSIYAFHPSVIDSSLAPEGHGVLYALVPVPSGSPINWGEQEGFVEKVIDQLEERGFPGLRK
SIQWKKVRTPDDKEMEGLFQGGSFGIAPTLFQSGVFRPQVKPSKLTNVYAAGASIHPGGGIPIVMQGAKLMVSAILSDHQ
NKEREGVSLSG
>Q4VKV1 1.3.8.2~~~crtN~~~4,4'-diapophytoene desaturase (4,4'-diapolycopene-forming)~~~
MANTKHIIIVGAGPGGLCAGMLLSQRGFKVSIFDKHAEIGGRNRPINMNGFTFDTGPTFLLMKGVLDEMFELCERRSEDY
LEFLPLSPMYRLLYDDRDIFVYSDRENMRAELQRVFDEGTDGYEQFMEQERKRFNALYPCITRDYSSLKSFLSLDLIKAL
PWLAFPKSVFNNLGQYFNQEKMRLAFCFQSKYLGMSPWECPALFTMLPYLEHEYGIYHVKGGLNRIAAAMAQVIAENGGE
IHLNSEIESLIIENGAAKGVKLQHGAELRGDEVIINADFAHAMTHLVKPGVLKKYTPENLKQREYSCSTFMLYLGLDKIY
DLPHHTIVFAKDYTTNIRNIFDNKTLTDDFSFYVQNASASDDSLAPAGKSALYVLVPMPNNDSGLDWQAHCQNVREQVLD
TLGARLGLSDIRAHIECEKIITPQTWETDEHVYKGATFSLSHKFSQMLYWRPHNRFEELANCYLVGGGTHPGSGLPTIYE
SARISAKLISQKHRVRFKDIAHSAWLKKAKA
>O07855 1.3.8.-~~~crtN~~~4,4'-diapophytoene desaturase (4,4'-diaponeurosporene-forming)~~~
MKIAVIGAGVTGLAAAARIASQGHEVTIFEKNNNVGGRMNQLKKDGFTFDMGPTIVMMPDVYKDVFTACGKNYEDYIELR
QLRYIYDVYFDHDDRITVPTDLAELQQMLESIEPGSTHGFMSFLTDVYKKYEIARRYFLERTYRKPSDFYNMTSLVQGAK
LKTLNHADQLIEHYIDNEKIQKLLAFQTLYIGIDPKRGPSLYSIIPMIEMMFGVHFIKGGMYGMAQGLAQLNKDLGVNIE
LNAEIEQIIIDPKFKRADAIKVNGDIRKFDKILCTADFPSVAESLMPDFAPIKKYPPHKIADLDYSCSAFLMYIGIDIDV
TDQVRLHNVIFSDDFRGNIEEIFEGRLSYDPSIYVYVPAVADKSLAPEGKTGIYVLMPTPELKTGSGIDWSDEALTQQIK
EIIYRKLATIEVFEDIKSHIVSETIFTPNDFEQTYHAKFGSAFGLMPTLAQSNYYRPQNVSRDYKDLYFAGASTHPGAGV
PIVLTSAKITVDEMIKDIERGV
>Q7A3E2 1.3.8.-~~~crtN~~~4,4'-diapophytoene desaturase (4,4'-diaponeurosporene-forming)~~~
MKIAVIGAGVTGLAAAARIASQGHEVTIFEKNNNVGGRMNQLKKDGFTFDMGPTIVMMPDVYKDVFTACGKNYEDYIELR
QLRYIYDVYFDHDDRITVPTDLAELQQMLESIEPGSTHGFMSFLTDVYKKYEIARRYFLERTYRKPSDFYNMTSLVQGAK
LKTLNHADQLIEHYIDNEKIQKLLAFQTLYIGIDPKRGPSLYSIIPMIEMMFGVHFIKGGMYGMAQGLAQLNKDLGVNIE
LNAEIEQIIIDPKFKRADAIKVNGDIRKFDKILCTADFPSVAESLMPDFAPIKKYPPHKIADLDYSCSAFLMYIGIDIDV
TDQVRLHNVIFSDDFRGNIEEIFEGRLSYDPSIYVYVPAVADKSLAPEGKTGIYVLMPTPELKTGSGIDWSDEALTQQIK
EIIYRKLATIEVFEDIKSHIVSETIFTPNDFEQTYHAKFGSAFGLMPTLAQSNYYRPQNVSRDYKDLYFAGASTHPGAGV
PIVLTSAKITVDEMIKDIERGV
>Q4VKU9 1.14.99.44~~~crtNb~~~4,4'-diapolycopene oxygenase~~~
MNSNDNQRVIVIGAGLGGLSAAISLATAGFSVQLIEKNDKVGGKLNIMTKDGFTFDLGPSILTMPHIFEALFTGAGKNMA
DYVQIQKVEPHWRNFFEDGSVIDLCEDAETQRRELDKLGPGTYAQFQRFLDYSKNLCTETEAGYFAKGLDGFWDLLKFYG
PLRSLLSFDVFRSMDQGVRRFISDPKLVEILNYFIKYVGSSPYDAPALMNLLPYIQYHYGLWYVKGGMYGMAQAMEKLAV
ELGVEIRLDAEVSEIQKQDGRACAVKLANGDVLPADIVVSNMEVIPAMEKLLRSPASELKKMQRFEPSCSGLVLHLGVDR
LYPQLAHHNFFYSDHPREHFDAVFKSHRLSDDPTIYLVAPCKTDPAQAPAGCEIIKILPHIPHLDPDKLLTAEDYSALRE
RVLVKLERMGLTDLRQHIVTEEYWTPLDIQAKYYSNQGSIYGVVADRFKNLGFKAPQRSSELSNLYFVGGSVNPGGGMPM
VTLSGQLVRDKIVADLQ
>Q2FV57 1.14.99.-~~~crtP~~~4,4'-diaponeurosporene oxygenase~~~COG1233
MTKHIIVIGGGLGGISAAIRMAQSGYSVSLYEQNNHIGGKVNRHESDGFGFDLGPSILTMPYIFEKLFEYSKKQMSDYVT
IKRLPHQWRSFFPDGTTIDLYEGIKETGQHNAILSKQDIEELQNYLNYTRRIDRITEKGYFNYGLDTLSQIIKFHGPLNA
LINYDYVHTMQQAIDKRISNPYLRQMLGYFIKYVGSSSYDAPAVLSMLFHMQQEQGLWYVEGGIHHLANALEKLAREEGV
TIHTGARVDNIKTYQRRVTGVRLDTGEFVKADYIISNMEVIPTYKYLIHLDTQRLNKLEREFEPASSGYVMHLGVACQYP
QLAHHNFFFTENAYLNYQQVFHEKVLPDDPTIYLVNTNKTDHTQAPVGYENIKVLPHIPYIQDQPFTTEDYAKFRDKILD
KLEKMGLTDLRKHIIYEDVWTPEDIEKNYRSNRGAIYGVVADKKKNKGFKFPKESQYFENLYFVGGSVNPGGGMPMVTLS
GQQVADKINAREAKNRK
>Q99R73 1.14.99.-~~~crtP~~~4,4'-diaponeurosporene oxygenase~~~
MTKHIIVIGGGLGGISAAIRMAQSGYSVSLYEQNTHIGGKVNRHESDGFGFDLGPSILTMPYIFEKLFEYSKKQMSDYVT
IKRLPHQWRSFFPDGTTIDLYEGIKETGQHNAILSKQDIEELQNYLNYTRRIDRITEKGYFNYGLDTLSQIIKFHGPLNA
LINYDYVHTMQQAIDKRISNPYLRQMLGYFIKYVGSSSYDAPAVLSMLFHMQQEQGLWYVEGGIHHLANALEKLAREEGV
TIHTGARVDNIKTYQRRVTGVRLDTGEFVKADYIISNMEVIPTYKYLIHLDTQRLNKLEREFEPASSGYVMHLGVACQYP
QLAHHNFFFTENAYLNYQQVFHEKVLPDDPTIYLVNTNKTDHTQAPVGYENIKVLPHIPYIQDQPFTTEDYAKFRDKILD
KLEKMGLTDLRKHIIYEDVWTPEDIEKNYRSNRGAIYGVVADKKKNKGFKFPKESQYFENLYFVGGSVNPGGGMPMVTLS
GQQVADKINAREAKNRK
>Q7A3D9 1.14.99.-~~~crtP~~~4,4'-diaponeurosporene oxygenase~~~
MTKHIIVIGGGLGGISAAIRMAQSGYSVSLYEQNTHIGGKVNRHESDGFGFDLGPSILTMPYIFEKLFEYSKKQMSDYVT
IKRLPHQWRSFFPDGTTIDLYEGIKETGQHNAILSKQDIEELQNYLNYTRRIDRITEKGYFNYGLDTLSQIIKFHGPLNA
LINYDYVHTMQQAIDKRISNPYLRQMLGYFIKYVGSSSYDAPAVLSMLFHMQQEQGLWYVEGGIHHLANALEKLAREEGV
TIHTGARVDNIKTYQRRVTGVRLDTGEFVKADYIISNMEVIPTYKYLIHLDTQRLNKLEREFEPASSGYVMHLGVACQYP
QLAHHNFFFTENAYLNYQQVFHEKVLPDDPTIYLVNTNKTDHTQAPVGYENIKVLPHIPYIQDQPFTTEDYAKFRDKILD
KLEKMGLTDLRKHIIYEDVWTPEDIEKNYRSNRGAIYGVVADKKKNKGFKFPKESQYFENLYFVGGSVNPGGGMPMVTLS
GQQVADKINAREAKNRK
>Q7A3E0 2.4.1.-~~~crtQ~~~4,4'-diaponeurosporenoate glycosyltransferase~~~
MKWLSRILTVIVTMSMACGALIFNRRHQLKTKTLNFNHKALTIIIPARNEEKRIGHLLHSIIQQQVPVDVIVMNDGSTDE
TARVARSYGATVVDVVDDTDGKWYGKSHACYQGVTHACTNRIAFVDADVTFLRKDAVETLINQYQLQGEKGLLSVQPYHI
TKRFYEGFSAIFNLMTVVGMNVFSTLDDGRTNQHAFGPVTLTNKEDYYATGGHKSANRHIIEGFALGSAYTSQSLPVTVY
EGFPFVAFRMYQEGFQSLQEGWTKHLSTGAGGTKPKIMTAIVLWLFGSIASILGLCLSLKYRQMSVRKMVALYLSYTTQF
IYLHRRVGQFSNLLMVCHPLLFMFFTKIFIQSWKQTHRYGVVEWKGRQYSISKEQ
>P72449 1.3.99.39~~~crtU~~~Carotenoid phi-ring synthase~~~
MFARDSGRGHRHGRDRQAAVVPAPAGRARFTGDAPAVAVVGGGIAGIAAATLLAERGVRVTLYEREPGLGGRLSGWPTEL
TDGTTVTMSRGFHAFFRQYYNLRGLLRRVDPDLGSLTRLPDYPLWHGSGLRDSFARVPRTPPLSAMGFVALSPTFGLRDL
VRINPRAAVGLLDVRVPEVYERLDGISATDFLDRIRFPEAAHHLAFEVFSRSFFADPRELSAAELALMFHIYFLGSSEGL
LFDVPGEPFPAALWEPLHHYLEVHRVDVRTRTPLRQVRPRPGGGLDLTTDDRTTRYDALVLALDSGALRRLVAASPELGD
TDWRARIARLRTAPPFLVSRLWLDRPVAHDRPGFLGTSGYGPLDNVSVLDRWEGEAARWARRTRGSVVELHAYAVAPDAD
RSAVQDEALRQLHRVYPETRSARLLDARHEWRADCPMFPVGGYRDRPGVRSPDPAVTVAGDMVRTELPVALMERAATSGF
LAANALLERWGVRGQTLWTVPRAGRSAVLRRLAALAD
>Q01330 2.4.1.276~~~crtX~~~Zeaxanthin glucosyltransferase~~~
MSHFAIVAPPLYSHAVALHALALEMAQRGHRVTFLTGNVASLAEQETERVAFYPLPASVQQAQRNVQQQSNGNLLRLIAA
MSSLTDVLCQQLPAILQRLAVDALIVDEMEPAGSLVAEALGLPFISIACALPVNREPGLPLPVMPFHYAEDKRALRRFQV
SERIYDALMYPHGQTILRHAQRFGLPERRRLDECLSPLAQISQSVPALDFPRRALPNCFHYVGALRYQPPPQVERSPRST
PRIFASLGTLQGHRLRLFQKIARACASVGAEVTIAHCDGLTPAQADSLYACGATEVVSFVDQPRYVAEANLVITHGGLNT
VLDALAAATPVLAVPLSFDQPAVAARLVYNGLGRRVSRFARQQTLADEIAQLLGDETLHQRLATARQQLNDAGGTPRAAT
LIEQAIAGSESVS
>P21687 5.5.1.19~~~crtY~~~Lycopene beta-cyclase~~~
MQPHYDLILVGAGLANGLIALRLQQQQPDMRILLIDAAPQAGGNHTWSFHHDDLTESQHRWIAPLVVHHWPDYQVRFPTR
RRKLNSGYFCITSQRFAEVLQRQFGPHLWMDTAVAEVNAESVRLKKGQVIGARAVIDGRGYAANSALSVGFQAFIGQEWR
LSHPHGLSSPIIMDATVDQQNGYRFVYSLPLSPTRLLIEDTHYIDNATLDPECARQNICDYAAQQGWQLQTLLREEQGAL
PITLSGNADAFWQQRPLACSGLRAGLFHPTTGYSLPLAVAVADRLSALDVFTSASIHHAITHFARERWQQQGFFRMLNRM
LFLAGPADSRWRVMQRFYGLPEDLIARFYAGKLTLTDRLRILSGKPPVPVLAALQAIMTTHR
>P54974 5.5.1.19~~~crtY~~~Lycopene beta-cyclase~~~
MTHDVLLAGAGLANGLIALALRAARPDLRVLLLDHAAGPSDGHTWSCHDPDLSPDWLARLKPLRRANWPDQEVRFPRHAR
RLATGYGSLDGAALADAVVRSGAEIRWDSDIALLDAQGATLSCGTRIEAGAVLDGRGAQPSRHLTVGFQKFVGVEIETDR
PHGVPRPMIMDATVTQQDGYRFIYLLPFSPTRILIEDTRYSDGGDLDDDALAAASHDYARQQGWTGAEVRRERGILPIAL
AHDAAGFWADHAAGPVPVGLRAGFFHPVTGYSLPYAAQVADVVAGLSGPPGTDALRGAIRDYAIDRARRDRFLRLLNRML
FRGCAPDRRYTLLQRFYRMPHGLIERFYAGRLSVADQLRIVTGKPPIPLGTAIRCLPERPLLKENA
>Q01331 5.5.1.19~~~crtY~~~Lycopene beta-cyclase~~~
MRDLILVGGGLANGLIAWRLRQRYPQLNLLLIEAGEQPGGNHTWSFHEDDLTPGQHAWLAPLVAHAWPGYEVQFPDLRRR
LARGYYSITSERFAEALHQALGENIWLNCSVSEVLPNSVRLANGEALLAGAVIDGRGVTASSAMQTGYQLFLGQQWRLTQ
PHGLTVPILMDATVAQQQGYRFVYTLPLSADTLLIEDTRYANVPQRDDNALRQTVTDYAHSKGWQLAQLEREETGCLPIT
LAGDIQALWADAPGVPRSGMRAGLFHPTTGYSLPLAVALADAIADSPRLGSVPLYQLTRQFAERHWRRQGFFRLLNRMLF
LAGREENRWRVMQRFYGLPEPTVERFYAGRLSLFDKARILTGKPPVPLGEAWRAALNHFPDRRDKG
>P52046 4.2.1.150~~~crt~~~Short-chain-enoyl-CoA hydratase~~~COG1024
MELNNVILEKEGKVAVVTINRPKALNALNSDTLKEMDYVIGEIENDSEVLAVILTGAGEKSFVAGADISEMKEMNTIEGR
KFGILGNKVFRRLELLEKPVIAAVNGFALGGGCEIAMSCDIRIASSNARFGQPEVGLGITPGFGGTQRLSRLVGMGMAKQ
LIFTAQNIKADEALRIGLVNKVVEPSELMNTAKEIANKIVSNAPVAVKLSKQAINRGMQCDIDTALAFESEAFGECFSTE
DQKDAMTAFIEKRKIEGFKNR
>Q9KS67 ~~~cry2~~~Cryptochrome-like protein cry2~~~COG0415
MEKINLVWLKRDLRLTDHAPLQAALTSGRPTLLLYLFEPMLLGDAHYSERHWRFVWQSLQAINRDLAQSKGEVLIVTSDW
QTCFARIAERYAIEAIYSHQEVGLACTFQRDLALAQWCQQHDIVWHEFPYAAVIRGAQTRKNWDEHWQQVMRSPCCDPDL
TRANWLKLDAATLGLRSDIPATWQSKQAGMQTGGSDMAWATLEDFFARRGREYYRSISSPSLARHACSRMSPYLAWGNIS
LREMYQTLLKHWSVAGFRRSLIALSSRLHWHCHFIQKFESECEMEFRCVNRAYDSLLQQSSDAPAAQLAAWQTGHTGIPL
VDACMRCLIQTGYLNFRMRAMLVSVLTHHMNVDWRAGVTYLAQLFLDFEPGIHYPQFQMQAGVTGTNTIRIYNPTKQAQE
HDSEGQFIHKWVPELAQVPVPLLFEPWLMTPLEAQMYQVPLESPYLKPVMDLEASAKQARDRLWQWQKLPAVQAEAMRIL
ARHVRQAKPRTSPRQPNKRQPEMD
>P77967 ~~~cry~~~Cryptochrome DASH~~~COG0415
MKHVPPTVLVWFRNDLRLHDHEPLHRALKSGLAITAVYCYDPRQFAQTHQGFAKTGPWRSNFLQQSVQNLAESLQKVGNK
LLVTTGLPEQVIPQIAKQINAKTIYYHREVTQEELDVERNLVKQLTILGIEAKGYWGSTLCHPEDLPFSIQDLPDLFTKF
RKDIEKKKISIRPCFFAPSQLLPSPNIKLELTAPPPEFFPQINFDHRSVLAFQGGETAGLARLQDYFWHGDRLKDYKETR
NGMVGADYSSKFSPWLALGCLSPRFIYQEVKRYEQERVSNDSTHWLIFELLWRDFFRFVAQKYGNKLFNRGGLLNKNFPW
QEDQVRFELWRSGQTGYPLVDANMRELNLTGFMSNRGRQNVASFLCKNLGIDWRWGAEWFESCLIDYDVCSNWGNWNYTA
GIGNDARDFRYFNIPKQSQQYDPQGTYLRHWLPELKNLPGDKIHQPWLLSATEQKQWGVQLGVDYPRPCVNFHQSVEARR
KIEQMGVIA
>Q9KR33 ~~~cry1~~~Cryptochrome DASH~~~COG0415
MSKKIGLYWFTNDLRVNDNPLLEQASQQVDRLICLYCYPSITPFLARYAQQTQWGEAKKRFLNQTLADLDHSLSTLGQKL
WVTPLLPYQALRHLLTQVEITDIYVDAVAGSDERQAIARIHQDFSSVHIHQQALHSLLSEPQLPFALEALPSTFTQFRKQ
VETISLSAPMGYPHVLPPIEQGWQLPLMDIVTEPNHSAFVGGEQAGLTHCQNYFSSLLPSRYKETRNGLDGMDYSTKFSP
WLALGAVSPKTIYAMLQRYEAVHGANDSTYWIFFELLWREYFYWYARRYGAKLFRFSGIGEKKPLTSFYAQRFLQWKHGE
TPFPIVNACMRQLNQTGYMSNRGRQLVASCLVHELGLDWRYGAAYFETQLVDYDVGSNWGNWQYLAGVGADPRGSRQFNL
EKQAHTYDPKGEFVAKWCGTACDKLNALENLALDSVDMVDWPIAASAYLLIHHPQNKESSS
>U2UMQ6 3.1.21.1~~~~~~CRISPR-associated endonuclease Cas12a~~~
MTQFEGFTNLYQVSKTLRFELIPQGKTLKHIQEQGFIEEDKARNDHYKELKPIIDRIYKTYADQCLQLVQLDWENLSAAI
DSYRKEKTEETRNALIEEQATYRNAIHDYFIGRTDNLTDAINKRHAEIYKGLFKAELFNGKVLKQLGTVTTTEHENALLR
SFDKFTTYFSGFYENRKNVFSAEDISTAIPHRIVQDNFPKFKENCHIFTRLITAVPSLREHFENVKKAIGIFVSTSIEEV
FSFPFYNQLLTQTQIDLYNQLLGGISREAGTEKIKGLNEVLNLAIQKNDETAHIIASLPHRFIPLFKQILSDRNTLSFIL
EEFKSDEEVIQSFCKYKTLLRNENVLETAEALFNELNSIDLTHIFISHKKLETISSALCDHWDTLRNALYERRISELTGK
ITKSAKEKVQRSLKHEDINLQEIISAAGKELSEAFKQKTSEILSHAHAALDQPLPTTLKKQEEKEILKSQLDSLLGLYHL
LDWFAVDESNEVDPEFSARLTGIKLEMEPSLSFYNKARNYATKKPYSVEKFKLNFQMPTLASGWDVNKEKNNGAILFVKN
GLYYLGIMPKQKGRYKALSFEPTEKTSEGFDKMYYDYFPDAAKMIPKCSTQLKAVTAHFQTHTTPILLSNNFIEPLEITK
EIYDLNNPEKEPKKFQTAYAKKTGDQKGYREALCKWIDFTRDFLSKYTKTTSIDLSSLRPSSQYKDLGEYYAELNPLLYH
ISFQRIAEKEIMDAVETGKLYLFQIYNKDFAKGHHGKPNLHTLYWTGLFSPENLAKTSIKLNGQAELFYRPKSRMKRMAH
RLGEKMLNKKLKDQKTPIPDTLYQELYDYVNHRLSHDLSDEARALLPNVITKEVSHEIIKDRRFTSDKFFFHVPITLNYQ
AANSPSKFNQRVNAYLKEHPETPIIGIDRGERNLIYITVIDSTGKILEQRSLNTIQQFDYQKKLDNREKERVAARQAWSV
VGTIKDLKQGYLSQVIHEIVDLMIHYQAVVVLENLNFGFKSKRTGIAEKAVYQQFEKMLIDKLNCLVLKDYPAEKVGGVL
NPYQLTDQFTSFAKMGTQSGFLFYVPAPYTSKIDPLTGFVDPFVWKTIKNHESRKHFLEGFDFLHYDVKTGDFILHFKMN
RNLSFQRGLPGFMPAWDIVFEKNETQFDAKGTPFIAGKRIVPVIENHRFTGRYRDLYPANELIALLEEKGIVFRDGSNIL
PKLLENDDSHAIDTMVALIRSVLQMRNSNAATGEDYINSPVRDLNGVCFDSRFQNPEWPMDADANGAYHIALKGQLLLNH
LKESKDLKLQNGISNQDWLAYIQELRN
>A0Q7Q2 3.1.21.1~~~~~~CRISPR-associated endonuclease Cas12a~~~
MSIYQEFVNKYSLSKTLRFELIPQGKTLENIKARGLILDDEKRAKDYKKAKQIIDKYHQFFIEEILSSVCISEDLLQNYS
DVYFKLKKSDDDNLQKDFKSAKDTIKKQISEYIKDSEKFKNLFNQNLIDAKKGQESDLILWLKQSKDNGIELFKANSDIT
DIDEALEIIKSFKGWTTYFKGFHENRKNVYSSNDIPTSIIYRIVDDNLPKFLENKAKYESLKDKAPEAINYEQIKKDLAE
ELTFDIDYKTSEVNQRVFSLDEVFEIANFNNYLNQSGITKFNTIIGGKFVNGENTKRKGINEYINLYSQQINDKTLKKYK
MSVLFKQILSDTESKSFVIDKLEDDSDVVTTMQSFYEQIAAFKTVEEKSIKETLSLLFDDLKAQKLDLSKIYFKNDKSLT
DLSQQVFDDYSVIGTAVLEYITQQIAPKNLDNPSKKEQELIAKKTEKAKYLSLETIKLALEEFNKHRDIDKQCRFEEILA
NFAAIPMIFDEIAQNKDNLAQISIKYQNQGKKDLLQASAEDDVKAIKDLLDQTNNLLHKLKIFHISQSEDKANILDKDEH
FYLVFEECYFELANIVPLYNKIRNYITQKPYSDEKFKLNFENSTLANGWDKNKEPDNTAILFIKDDKYYLGVMNKKNNKI
FDDKAIKENKGEGYKKIVYKLLPGANKMLPKVFFSAKSIKFYNPSEDILRIRNHSTHTKNGSPQKGYEKFEFNIEDCRKF
IDFYKQSISKHPEWKDFGFRFSDTQRYNSIDEFYREVENQGYKLTFENISESYIDSVVNQGKLYLFQIYNKDFSAYSKGR
PNLHTLYWKALFDERNLQDVVYKLNGEAELFYRKQSIPKKITHPAKEAIANKNKDNPKKESVFEYDLIKDKRFTEDKFFF
HCPITINFKSSGANKFNDEINLLLKEKANDVHILSIDRGERHLAYYTLVDGKGNIIKQDTFNIIGNDRMKTNYHDKLAAI
EKDRDSARKDWKKINNIKEMKEGYLSQVVHEIAKLVIEYNAIVVFEDLNFGFKRGRFKVEKQVYQKLEKMLIEKLNYLVF
KDNEFDKTGGVLRAYQLTAPFETFKKMGKQTGIIYYVPAGFTSKICPVTGFVNQLYPKYESVSKSQEFFSKFDKICYNLD
KGYFEFSFDYKNFGDKAAKGKWTIASFGSRLINFRNSDKNHNWDTREVYPTKELEKLLKDYSIEYGHGECIKAAICGESD
KKFFAKLTSVLNTILQMRNSKTGTELDYLISPVADVNGNFFDSRQAPKNMPQDADANGAYHIGLKGLMLLGRIKNNQEGK
KLNLVIKNEEYFEFVQNRNN
>T0D7A2 3.1.-.-~~~~~~CRISPR-associated endonuclease Cas12b~~~COG0675
MAVKSIKVKLRLDDMPEIRAGLWKLHKEVNAGVRYYTEWLSLLRQENLYRRSPNGDGEQECDKTAEECKAELLERLRARQ
VENGHRGPAGSDDELLQLARQLYELLVPQAIGAKGDAQQIARKFLSPLADKDAVGGLGIAKAGNKPRWVRMREAGEPGWE
EEKEKAETRKSADRTADVLRALADFGLKPLMRVYTDSEMSSVEWKPLRKGQAVRTWDRDMFQQAIERMMSWESWNQRVGQ
EYAKLVEQKNRFEQKNFVGQEHLVHLVNQLQQDMKEASPGLESKEQTAHYVTGRALRGSDKVFEKWGKLAPDAPFDLYDA
EIKNVQRRNTRRFGSHDLFAKLAEPEYQALWREDASFLTRYAVYNSILRKLNHAKMFATFTLPDATAHPIWTRFDKLGGN
LHQYTFLFNEFGERRHAIRFHKLLKVENGVAREVDDVTVPISMSEQLDNLLPRDPNEPIALYFRDYGAEQHFTGEFGGAK
IQCRRDQLAHMHRRRGARDVYLNVSVRVQSQSEARGERRPPYAAVFRLVGDNHRAFVHFDKLSDYLAEHPDDGKLGSEGL
LSGLRVMSVDLGLRTSASISVFRVARKDELKPNSKGRVPFFFPIKGNDNLVAVHERSQLLKLPGETESKDLRAIREERQR
TLRQLRTQLAYLRLLVRCGSEDVGRRERSWAKLIEQPVDAANHMTPDWREAFENELQKLKSLHGICSDKEWMDAVYESVR
RVWRHMGKQVRDWRKDVRSGERPKIRGYAKDVVGGNSIEQIEYLERQYKFLKSWSFFGKVSGQVIRAEKGSRFAITLREH
IDHAKEDRLKKLADRIIMEALGYVYALDERGKGKWVAKYPPCQLILLEELSEYQFNNDRPPSENNQLMQWSHRGVFQELI
NQAQVHDLLVGTMYAAFSSRFDARTGAPGIRCRRVPARCTQEHNPEPFPWWLNKFVVEHTLDACPLRADDLIPTGEGEIF
VSPFSAEEGDFHQIHADLNAAQNLQQRLWSDFDISQIRLRCDWGEVDGELVLIPRLTGKRTADSYSNKVFYTNTGVTYYE
RERGKKRRKVFAQEKLSEEEAELLVEADEAREKSVVLMRDPSGIINRGNWTRQKEFWSMVNQRIEGYLVKQIRSRVPLQD
SACENTGDI
>A0A2U3D0N8 3.1.-.-~~~~~~CRISPR-associated endodeoxyribonuclease Cas12f1~~~
MIKVYRYEIVKPLDLDWKEFGTILRQLQQETRFALNKATQLAWEWMGFSSDYKDNHGEYPKSKDILGYTNVHGYAYHTIK
TKAYRLNSGNLSQTIKRATDRFKAYQKEILRGDMSIPSYKRDIPLDLIKENISVNRMNHGDYIASLSLLSNPAKQEMNVK
RKISVIIIVRGAGKTIMDRILSGEYQVSASQIIHDDRKNKWYLNISYDFEPQTRVLDLNKIMGIDLGVAVAVYMAFQHTP
ARYKLEGGEIENFRRQVESRRISMLRQGKYAGGARGGHGRDKRIKPIEQLRDKIANFRDTTNHRYSRYIVDMAIKEGCGT
IQMEDLTNIRDIGSRFLQNWTYYDLQQKIIYKAEEAGIKVIKIDPQYTSQRCSECGNIDSGNRIGQAIFKCRACGYEANA
DYNAARNIAIPNIDKIIAESIK
>P0DW62 3.1.-.-~~~~~~CRISPR-associated endodeoxyribonuclease Cas12f1~~~
MGESVKAIKLKILDMFLDPECTKQDDNWRKDLSTMSRFCAEAGNMCLRDLYNYFSMPKEDRISSKDLYNAMYHKTKLLHP
ELPGKVANQIVNHAKDVWKRNAKLIYRNQISMPTYKITTAPIRLQNNIYKLIKNKNKYIIDVQLYSKEYSKDSGKGTHRY
FLVAVRDSSTRMIFDRIMSKDHIDSSKSYTQGQLQIKKDHQGKWYCIIPYTFPTHETVLDPDKVMGVDLGVAKAVYWAFN
SSYKRGCIDGGEIEHFRKMIRARRVSIQNQIKHSGDARKGHGRKRALKPIETLSEKEKNFRDTINHRYANRIVEAAIKQG
CGTIQIENLEGIADTTGSKFLKNWPYYDLQTKIVNKAKEHGITVVAINPQYTSQRCSMCGYIEKTNRSSQAVFECKQCGY
GSRTICINCRHVQVSGDVCEECGGIVKKENVNADYNAAKNISTPYIDQIIMEKCLELGIPYRSITCKECGHIQASGNTCE
VCGSTNILKPKKIRKAK
>A0A0H5SJ89 3.1.-.-~~~~~~CRISPR-associated endoribonuclease Cas13a~~~
MKLTRRRISGNSVDQKITAAFYRDMSQGLLYYDSEDNDCTDKVIESMDFERSWRGRILKNGEDDKNPFYMFVKGLVGSND
KIVCEPIDVDSDPDNLDILINKNLTGFGRNLKAPDSNDTLENLIRKIQAGIPEEEVLPELKKIKEMIQKDIVNRKEQLLK
SIKNNRIPFSLEGSKLVPSTKKMKWLFKLIDVPNKTFNEKMLEKYWEIYDYDKLKANITNRLDKTDKKARSISRAVSEEL
REYHKNLRTNYNRFVSGDRPAAGLDNGGSAKYNPDKEEFLLFLKEVEQYFKKYFPVKSKHSNKSKDKSLVDKYKNYCSYK
VVKKEVNRSIINQLVAGLIQQGKLLYYFYYNDTWQEDFLNSYGLSYIQVEEAFKKSVMTSLSWGINRLTSFFIDDSNTVK
FDDITTKKAKEAIESNYFNKLRTCSRMQDHFKEKLAFFYPVYVKDKKDRPDDDIENLIVLVKNAIESVSYLRNRTFHFKE
SSLLELLKELDDKNSGQNKIDYSVAAEFIKRDIENLYDVFREQIRSLGIAEYYKADMISDCFKTCGLEFALYSPKNSLMP
AFKNVYKRGANLNKAYIRDKGPKETGDQGQNSYKALEEYRELTWYIEVKNNDQSYNAYKNLLQLIYYHAFLPEVRENEAL
ITDFINRTKEWNRKETEERLNTKNNKKHKNFDENDDITVNTYRYESIPDYQGESLDDYLKVLQRKQMARAKEVNEKEEGN
NNYIQFIRDVVVWAFGAYLENKLKNYKNELQPPLSKENIGLNDTLKELFPEEKVKSPFNIKCRFSISTFIDNKGKSTDNT
SAEAVKTDGKEDEKDKKNIKRKDLLCFYLFLRLLDENEICKLQHQFIKYRCSLKERRFPGNRTKLEKETELLAELEELME
LVRFTMPSIPEISAKAESGYDTMIKKYFKDFIEKKVFKNPKTSNLYYHSDSKTPVTRKYMALLMRSAPLHLYKDIFKGYY
LITKKECLEYIKLSNIIKDYQNSLNELHEQLERIKLKSEKQNGKDSLYLDKKDFYKVKEYVENLEQVARYKHLQHKINFE
SLYRIFRIHVDIAARMVGYTQDWERDMHFLFKALVYNGVLEERRFEAIFNNNDDNNDGRIVKKIQNNLNNKNRELVSMLC
WNKKLNKNEFGAIIWKRNPIAHLNHFTQTEQNSKSSLESLINSLRILLAYDRKRQNAVTKTINDLLLNDYHIRIKWEGRV
DEGQIYFNIKEKEDIENEPIIHLKHLHKKDCYIYKNSYMFDKQKEWICNGIKEEVYDKSILKCIGNLFKFDYEDKNKSSA
NPKHT
>P0DPB7 3.1.-.-~~~~~~CRISPR-associated endoribonuclease Cas13a~~~
MKISKVREENRGAKLTVNAKTAVVSENRSQEGILYNDPSRYGKSRKNDEDRDRYIESRLKSSGKLYRIFNEDKNKRETDE
LQWFLSEIVKKINRRNGLVLSDMLSVDDRAFEKAFEKYAELSYTNRRNKVSGSPAFETCGVDAATAERLKGIISETNFIN
RIKNNIDNKVSEDIIDRIIAKYLKKSLCRERVKRGLKKLLMNAFDLPYSDPDIDVQRDFIDYVLEDFYHVRAKSQVSRSI
KNMNMPVQPEGDGKFAITVSKGGTESGNKRSAEKEAFKKFLSDYASLDERVRDDMLRRMRRLVVLYFYGSDDSKLSDVNE
KFDVWEDHAARRVDNREFIKLPLENKLANGKTDKDAERIRKNTVKELYRNQNIGCYRQAVKAVEEDNNGRYFDDKMLNMF
FIHRIEYGVEKIYANLKQVTEFKARTGYLSEKIWKDLINYISIKYIAMGKAVYNYAMDELNASDKKEIELGKISEEYLSG
ISSFDYELIKAEEMLQRETAVYVAFAARHLSSQTVELDSENSDFLLLKPKGTMDKNDKNKLASNNILNFLKDKETLRDTI
LQYFGGHSLWTDFPFDKYLAGGKDDVDFLTDLKDVIYSMRNDSFHYATENHNNGKWNKELISAMFEHETERMTVVMKDKF
YSNNLPMFYKNDDLKKLLIDLYKDNVERASQVPSFNKVFVRKNFPALVRDKDNLGIELDLKADADKGENELKFYNALYYM
FKEIYYNAFLNDKNVRERFITKATKVADNYDRNKERNLKDRIKSAGSDEKKKLREQLQNYIAENDFGQRIKNIVQVNPDY
TLAQICQLIMTEYNQQNNGCMQKKSAARKDINKDSYQHYKMLLLVNLRKAFLEFIKENYAFVLKPYKHDLCDKADFVPDF
AKYVKPYAGLISRVAGSSELQKWYIVSRFLSPAQANHMLGFLHSYKQYVWDIYRRASETGTEINHSIAEDKIAGVDITDV
DAVIDLSVKLCGTISSEISDYFKDDEVYAEYISSYLDFEYDGGNYKDSLNRFCNSDAVNDQKVALYYDGEHPKLNRNIIL
SKLYGERRFLEKITDRVSRSDIVEYYKLKKETSQYQTKGIFDSEDEQKNIKKFQEMKNIVEFRDLMDYSEIADELQGQLI
NWIYLRERDLMNFQLGYHYACLNNDSNKQATYVTLDYQGKKNRKINGAILYQICAMYINGLPLYYVDKDSSEWTVSDGKE
STGAKIGEFYRYAKSFENTSDCYASGLEIFENISEHDNITELRNYIEHFRYYSSFDRSFLGIYSEVFDRFFTYDLKYRKN
VPTILYNILLQHFVNVRFEFVSGKKMIGIDKKDRKIAKEKECARITIREKNGVYSEQFTYKLKNGTVYVDARDKRYLQSI
IRLLFYPEKVNMDEMIEVKEKKKPSDNNTGKGYSKRDRQQDRKEYDKYKEKKKKEGNFLSGMGGNINWDEINAQLKN
>C7NBY4 3.1.-.-~~~~~~CRISPR-associated endoribonuclease Cas13a~~~
MKVTKVGGISHKKYTSEGRLVKSESEENRTDERLSALLNMRLDMYIKNPSSTETKENQKRIGKLKKFFSNKMVYLKDNTL
SLKNGKKENIDREYSETDILESDVRDKKNFAVLKKIYLNENVNSEELEVFRNDIKKKLNKINSLKYSFEKNKANYQKINE
NNIEKVEGKSKRNIIYDYYRESAKRDAYVSNVKEAFDKLYKEEDIAKLVLEIENLTKLEKYKIREFYHEIIGRKNDKENF
AKIIYEEIQNVNNMKELIEKVPDMSELKKSQVFYKYYLDKEELNDKNIKYAFCHFVEIEMSQLLKNYVYKRLSNISNDKI
KRIFEYQNLKKLIENKLLNKLDTYVRNCGKYNYYLQDGEIATSDFIARNRQNEAFLRNIIGVSSVAYFSLRNILETENEN
DITGRMRGKTVKNNKGEEKYVSGEVDKIYNENKKNEVKENLKMFYSYDFNMDNKNEIEDFFANIDEAISSIRHGIVHFNL
ELEGKDIFAFKNIAPSEISKKMFQNEINEKKLKLKIFRQLNSANVFRYLEKYKILNYLKRTRFEFVNKNIPFVPSFTKLY
SRIDDLKNSLGIYWKTPKTNDDNKTKEIIDAQIYLLKNIYYGEFLNYFMSNNGNFFEISKEIIELNKNDKRNLKTGFYKL
QKFEDIQEKIPKEYLANIQSLYMINAGNQDEEEKDTYIDFIQKIFLKGFMTYLANNGRLSLIYIGSDEETNTSLAEKKQE
FDKFLKKYEQNNNIKIPYEINEFLREIKLGNILKYTERLNMFYLILKLLNHKELTNLKGSLEKYQSANKEEAFSDQLELI
NLLNLDNNRVTEDFELEADEIGKFLDFNGNKVKDNKELKKFDTNKIYFDGENIIKHRAFYNIKKYGMLNLLEKIADKAGY
KISIEELKKYSNKKNEIEKNHKMQENLHRKYARPRKDEKFTDEDYESYKQAIENIEEYTHLKNKVEFNELNLLQGLLLRI
LHRLVGYTSIWERDLRFRLKGEFPENQYIEEIFNFENKKNVKYKGGQIVEKYIKFYKELHQNDEVKINKYSSANIKVLKQ
EKKDLYIRNYIAHFNYIPHAEISLLEVLENLRKLLSYDRKLKNAVMKSVVDILKEYGFVATFKIGADKKIGIQTLESEKI
VHLKNLKKKKLMTDRNSEELCKLVKIMFEYKMEEKKSEN
>P0DOC6 3.1.-.-~~~~~~CRISPR-associated endoribonuclease Cas13a~~~
MGNLFGHKRWYEVRDKKDFKIKRKVKVKRNYDGNKYILNINENNNKEKIDNNKFIRKYINYKKNDNILKEFTRKFHAGNI
LFKLKGKEGIIRIENNDDFLETEEVVLYIEAYGKSEKLKALGITKKKIIDEAIRQGITKDDKKIEIKRQENEEEIEIDIR
DEYTNKTLNDCSIILRIIENDELETKKSIYEIFKNINMSLYKIIEKIIENETEKVFENRYYEEHLREKLLKDDKIDVILT
NFMEIREKIKSNLEILGFVKFYLNVGGDKKKSKNKKMLVEKILNINVDLTVEDIADFVIKELEFWNITKRIEKVKKVNNE
FLEKRRNRTYIKSYVLLDKHEKFKIERENKKDKIVKFFVENIKNNSIKEKIEKILAEFKIDELIKKLEKELKKGNCDTEI
FGIFKKHYKVNFDSKKFSKKSDEEKELYKIIYRYLKGRIEKILVNEQKVRLKKMEKIEIEKILNESILSEKILKRVKQYT
LEHIMYLGKLRHNDIDMTTVNTDDFSRLHAKEELDLELITFFASTNMELNKIFSRENINNDENIDFFGGDREKNYVLDKK
ILNSKIKIIRDLDFIDNKNNITNNFIRKFTKIGTNERNRILHAISKERDLQGTQDDYNKVINIIQNLKISDEEVSKALNL
DVVFKDKKNIITKINDIKISEENNNDIKYLPSFSKVLPEILNLYRNNPKNEPFDTIETEKIVLNALIYVNKELYKKLILE
DDLEENESKNIFLQELKKTLGNIDEIDENIIENYYKNAQISASKGNNKAIKKYQKKVIECYIGYLRKNYEELFDFSDFKM
NIQEIKKQIKDINDNKTYERITVKTSDKTIVINDDFEYIISIFALLNSNAVINKIRNRFFATSVWLNTSEYQNIIDILDE
IMQLNTLRNECITENWNLNLEEFIQKMKEIEKDFDDFKIQTKKEIFNNYYEDIKNNILTEFKDDINGCDVLEKKLEKIVI
FDDETKFEIDKKSNILQDEQRKLSNINKKDLKKKVDQYIKDKDQEIKSKILCRIIFNSDFLKKYKKEIDNLIEDMESENE
NKFQEIYYPKERKNELYIYKKNLFLNIGNPNFDKIYGLISNDIKMADAKFLFNIDGKNIRKNKISEIDAILKNLNDKLNG
YSKEYKEKYIKKLKENDDFFAKNIQNKNYKSFEKDYNRVSEYKKIRDLVEFNYLNKIESYLIDINWKLAIQMARFERDMH
YIVNGLRELGIIKLSGYNTGISRAYPKRNGSDGFYTTTAYYKFFDEESYKKFEKICYGFGIDLSENSEINKPENESIRNY
ISHFYIVRNPFADYSIAEQIDRVSNLLSYSTRYNNSTYASVFEVFKKDVNLDYDELKKKFKLIGNNDILERLMKPKKVSV
LELESYNSDYIKNLIIELLTKIENTNDTL
>U2PSH1 3.1.-.-~~~~~~CRISPR-associated endoribonuclease Cas13a~~~
MYMKITKIDGVSHYKKQDKGILKKKWKDLDERKQREKIEARYNKQIESKIYKEFFRLKNKKRIEKEEDQNIKSLYFFIKE
LYLNEKNEEWELKNINLEILDDKERVIKGYKFKEDVYFFKEGYKEYYLRILFNNLIEKVQNENREKVRKNKEFLDLKEIF
KKYKNRKIDLLLKSINNNKINLEYKKENVNEEIYGINPTNDREMTFYELLKEIIEKKDEQKSILEEKLDNFDITNFLENI
EKIFNEETEINIIKGKVLNELREYIKEKEENNSDNKLKQIYNLELKKYIENNFSYKKQKSKSKNGKNDYLYLNFLKKIMF
IEEVDEKKEINKEKFKNKINSNFKNLFVQHILDYGKLLYYKENDEYIKNTGQLETKDLEYIKTKETLIRKMAVLVSFAAN
SYYNLFGRVSGDILGTEVVKSSKTNVIKVGSHIFKEKMLNYFFDFEIFDANKIVEILESISYSIYNVRNGVGHFNKLILG
KYKKKDINTNKRIEEDLNNNEEIKGYFIKKRGEIERKVKEKFLSNNLQYYYSKEKIENYFEVYEFEILKRKIPFAPNFKR
IIKKGEDLFNNKNNKKYEYFKNFDKNSAEEKKEFLKTRNFLLKELYYNNFYKEFLSKKEEFEKIVLEVKEEKKSRGNINN
KKSGVSFQSIDDYDTKINISDYIASIHKKEMERVEKYNEEKQKDTAKYIRDFVEEIFLTGFINYLEKDKRLHFLKEEFSI
LCNNNNNVVDFNININEEKIKEFLKENDSKTLNLYLFFNMIDSKRISEFRNELVKYKQFTKKRLDEEKEFLGIKIELYET
LIEFVILTREKLDTKKSEEIDAWLVDKLYVKDSNEYKEYEEILKLFVDEKILSSKEAPYYATDNKTPILLSNFEKTRKYG
TQSFLSEIQSNYKYSKVEKENIEDYNKKEEIEQKKKSNIEKLQDLKVELHKKWEQNKITEKEIEKYNNTTRKINEYNYLK
NKEELQNVYLLHEMLSDLLARNVAFFNKWERDFKFIVIAIKQFLRENDKEKVNEFLNPPDNSKGKKVYFSVSKYKNTVEN
IDGIHKNFMNLIFLNNKFMNRKIDKMNCAIWVYFRNYIAHFLHLHTKNEKISLISQMNLLIKLFSYDKKVQNHILKSTKT
LLEKYNIQINFEISNDKNEVFKYKIKNRLYSKKGKMLGKNNKFEILENEFLENVKAMLEYSE
>P0DPB8 3.1.-.-~~~~~~CRISPR-associated endoribonuclease Cas13a~~~
MWISIKTLIHHLGVLFFCDYMYNRREKKIIEVKTMRITKVEVDRKKVLISRDKNGGKLVYENEMQDNTEQIMHHKKSSFY
KSVVNKTICRPEQKQMKKLVHGLLQENSQEKIKVSDVTKLNISNFLNHRFKKSLYYFPENSPDKSEEYRIEINLSQLLED
SLKKQQGTFICWESFSKDMELYINWAENYISSKTKLIKKSIRNNRIQSTESRSGQLMDRYMKDILNKNKPFDIQSVSEKY
QLEKLTSALKATFKEAKKNDKEINYKLKSTLQNHERQIIEELKENSELNQFNIEIRKHLETYFPIKKTNRKVGDIRNLEI
GEIQKIVNHRLKNKIVQRILQEGKLASYEIESTVNSNSLQKIKIEEAFALKFINACLFASNNLRNMVYPVCKKDILMIGE
FKNSFKEIKHKKFIRQWSQFFSQEITVDDIELASWGLRGAIAPIRNEIIHLKKHSWKKFFNNPTFKVKKSKIINGKTKDV
TSEFLYKETLFKDYFYSELDSVPELIINKMESSKILDYYSSDQLNQVFTIPNFELSLLTSAVPFAPSFKRVYLKGFDYQN
QDEAQPDYNLKLNIYNEKAFNSEAFQAQYSLFKMVYYQVFLPQFTTNNDLFKSSVDFILTLNKERKGYAKAFQDIRKMNK
DEKPSEYMSYIQSQLMLYQKKQEEKEKINHFEKFINQVFIKGFNSFIEKNRLTYICHPTKNTVPENDNIEIPFHTDMDDS
NIAFWLMCKLLDAKQLSELRNEMIKFSCSLQSTEEISTFTKAREVIGLALLNGEKGCNDWKELFDDKEAWKKNMSLYVSE
ELLQSLPYTQEDGQTPVINRSIDLVKKYGTETILEKLFSSSDDYKVSAKDIAKLHEYDVTEKIAQQESLHKQWIEKPGLA
RDSAWTKKYQNVINDISNYQWAKTKVELTQVRHLHQLTIDLLSRLAGYMSIADRDFQFSSNYILERENSEYRVTSWILLS
ENKNKNKYNDYELYNLKNASIKVSSKNDPQLKVDLKQLRLTLEYLELFDNRLKEKRNNISHFNYLNGQLGNSILELFDDA
RDVLSYDRKLKNAVSKSLKEILSSHGMEVTFKPLYQTNHHLKIDKLQPKKIHHLGEKSTVSSNQVSNEYCQLVRTLLTMK
>E4T0I2 3.1.-.-~~~~~~CRISPR-associated endoribonuclease Cas13a~~~
MRVSKVKVKDGGKDKMVLVHRKTTGAQLVYSGQPVSNETSNILPEKKRQSFDLSTLNKTIIKFDTAKKQKLNVDQYKIVE
KIFKYPKQELPKQIKAEEILPFLNHKFQEPVKYWKNGKEESFNLTLLIVEAVQAQDKRKLQPYYDWKTWYIQTKSDLLKK
SIENNRIDLTENLSKRKKALLAWETEFTASGSIDLTHYHKVYMTDVLCKMLQDVKPLTDDKGKINTNAYHRGLKKALQNH
QPAIFGTREVPNEANRADNQLSIYHLEVVKYLEHYFPIKTSKRRNTADDIAHYLKAQTLKTTIEKQLVNAIRANIIQQGK
TNHHELKADTTSNDLIRIKTNEAFVLNLTGTCAFAANNIRNMVDNEQTNDILGKGDFIKSLLKDNTNSQLYSFFFGEGLS
TNKAEKETQLWGIRGAVQQIRNNVNHYKKDALKTVFNISNFENPTITDPKQQTNYADTIYKARFINELEKIPEAFAQQLK
TGGAVSYYTIENLKSLLTTFQFSLCRSTIPFAPGFKKVFNGGINYQNAKQDESFYELMLEQYLRKENFAEESYNARYFML
KLIYNNLFLPGFTTDRKAFADSVGFVQMQNKKQAEKVNPRKKEAYAFEAVRPMTAADSIADYMAYVQSELMQEQNKKEEK
VAEETRINFEKFVLQVFIKGFDSFLRAKEFDFVQMPQPQLTATASNQQKADKLNQLEASITADCKLTPQYAKADDATHIA
FYVFCKLLDAAHLSNLRNELIKFRESVNEFKFHHLLEIIEICLLSADVVPTDYRDLYSSEADCLARLRPFIEQGADITNW
SDLFVQSDKHSPVIHANIELSVKYGTTKLLEQIINKDTQFKTTEANFTAWNTAQKSIEQLIKQREDHHEQWVKAKNADDK
EKQERKREKSNFAQKFIEKHGDDYLDICDYINTYNWLDNKMHFVHLNRLHGLTIELLGRMAGFVALFDRDFQFFDEQQIA
DEFKLHGFVNLHSIDKKLNEVPTKKIKEIYDIRNKIIQINGNKINESVRANLIQFISSKRNYYNNAFLHVSNDEIKEKQM
YDIRNHIAHFNYLTKDAADFSLIDLINELRELLHYDRKLKNAVSKAFIDLFDKHGMILKLKLNADHKLKVESLEPKKIYH
LGSSAKDKPEYQYCTNQVMMAYCNMCRSLLEMKK
>S5FT07 3.13.1.5~~~csh~~~Carbon disulfide hydrolase~~~
MSTLKEQLTAHVASYDHWAQRRRYGPDGHNNRSLWVLACMDERLPVDEALGIHVDTPAGGGDAHCFRNAGGIVTDDAIRS
AMLTCNFFGTKEIVIVQHTQCGMLSGNANEMEKVLREKGMDTDNITLDPTLPELQLAKGAFAKWIGMMDDVDETCMKTIN
AFKNHPLIPKDIVVSGWVWEVENRRLRAPTLDKEKRARTDCTPTPYGVKGNQPPRWK
>S5FU55 3.13.1.5~~~csh~~~Carbon disulfide hydrolase~~~
MSLKQQLESDFEGHKRWALRRQMGIPNNRRLWVCACMDERLPVDDALGIRGDRGDAHVFRNAGGLITDDAIRSAMLTCNF
FGTEEIVIINHTECGMMSAQTDTIVKALKDKGIDLDNLQLDPDLPELTLKAGMFGKWVKMYQDVDETCARQVEYMRNHPL
IPKHVTISGWIWEVETGHLRPPHFRIGEKVNTNKAMGAK
>Q74H36 3.1.-.-~~~cas4-cas1~~~CRISPR-associated exonuclease Cas4/endonuclease Cas1 fusion~~~COG1468
MAETDGSIPLIPVRMLNEHVYCPRLAYLMWVQGEFSHNEFTVDGVIRHRRVDAGGGVLPSETQEDSRIHARSVSLSSERL
GITAKIDLVEGEGAYVSPVDYKRGKRPHVAGGAYEPERVQLCAQGLLLREHGFASDGGALYFVASRERVPVAFDDELIGR
TLAAIDEMGRTALSGTMPPPLEDSPKCPRCSLVGICLPDEVRFLSHLSVEPRPIIPADGRGLPLYVQSPKAYVRKDGDCL
VIEEERVRVAEARLGETSQVALFGNATLTTAALHECLRREIPVTWLSYGGWFMGHTVSTGHRNVETRTYQYQRSFDPETC
LNLARRWIVAKIANCRTLLRRNWRGEGDEAKAPPGLLMSLQDDMRHAMRAPSLEVLLGIEGASAGRYFQHFSRMLRGGDG
EGMGFDFTTRNRRPPKDPVNALLSFAYAMLTREWTVALAAVGLDPYRGFYHQPRFGRPALALDMMEPFRPLIADSTVLMA
INNGEIRTGDFVRSAGGCNLTDSARKRFIAGFERRMEQEVTHPIFKYTISYRRLLEVQARLLTRYLSGEIPAYPNFVTR
>Q1CW50 3.1.-.-~~~cas4-cas1~~~CRISPR-associated exonuclease Cas4/endonuclease Cas1 fusion~~~COG1468
MSVVVTRYRGGGPQYMNASSTSPKPVVGEPSIRTHALHALAYCERLFYLEEVEELRVADAAVFAGRRLHVQLQEEGEHVE
LELASEALGLHGRVDAVKTREGTLVVYEHKRGRHAPGGDAPEAWPSDRLQAGAYALLVEERFPGAPVECRVRYHQTDTTV
RFPLDAALRGAVVAAVARARLLRASRERPPVTQEERKCAKCSLAPVCLPEEERQVVGEERPRLFPEDDVRQVLHVATPGT
RVGRAAEELVVTPPEGEGAPSRQPGRMVSALIAHGAVQVSAQALAYCVENDIGVHWFTSGGRYLGGLGGGAGNVHRRLRQ
FEALRQASVCLGLARRLVAAKLEGQLRFLLRASRGDSESRQVLASAVRDLRALLPKCEEAPSLEVLLGLEGAGAARYFGA
LPYLQGEDVDTRLRFEGRNRRPPRDRFNAVLGFLFGLVHREVEAAIRAVGLDVAFGFYHQPRGTAGPLGLDVMELFRVPL
ADMPLVASVNRRAWDADADFEVTSEHVWLSKAGRAKAIELYERRKRETWKNNVLGYSLSYARLVELEVRLLEKEWTGKPG
LFATFRLR
>P0DV90 ~~~~~~Retron Se72 cold shock-like protein~~~
MENGFVNFYDHVKGYGFIRRERGRDVFFRYDDFLFLGHDVDICKGILVRFKLEKTDKGFKAVAIQKV
>P37584 ~~~csaA~~~Probable chaperone CsaA~~~COG0073
MAVIDDFEKLDIRTGTIVKAEEFPEARVPAIKLVIDFGTEIGIKQSSAQITKRYKPEGLINKQVIAVVNFPPRRIAGFKS
EVLVLGGIPGQGDVVLLQPDQPVPNGTKIG
>P37953 ~~~csbA~~~Protein CsbA~~~COG4897
MITKAVFALFFPFMLVVLFTRVTFNHYVAIALTAALLFASYLKGYTETYFIVGLDVVSLVAGGLYMAKKAAEKKEE
>Q45539 2.4.-.-~~~csbB~~~Putative glycosyltransferase CsbB~~~COG0463
MKQGLISIIIPSYNEGYNVKLIHESLKKEFKNIHYDYEIFFINDGSVDDTLQQIKDLAATCSRVKYISFSRNFGKEAAIL
AGFEHVQGEAVIVMDADLQHPTYLLKEFIKGYEEGYDQVIAQRNRKGDSFVRSLLSSMYYKFINKAVEVDLRDGVGDFRL
LSRQAVNALLKLSEGNRFSKGLFCWIGFDQKIVFYENVERKNGTSKWSFSSLFNYGMDGVVSFNHKPLRLCFYTGIFILL
LSIIYIIATFVKILTNGISVPGYFTIISAVLFLGGVQLLSLGIIGEYIGRIYYETKKRPHYLIKEANIPNKDLPETNELK
SMRRLTKMH
>P46333 ~~~csbC~~~Probable metabolite transport protein CsbC~~~COG2814
MKKDTRKYMIYFFGALGGLLYGYDTGVISGALLFINNDIPLTTLTEGLVVSMLLLGAIFGSALSGTCSDRWGRRKVVFVL
SIIFIIGALACAFSQTIGMLIASRVILGLAVGGSTALVPVYLSEMAPTKIRGTLGTMNNLMIVTGILLAYIVNYLFTPFE
AWRWMVGLAAVPAVLLLIGIAFMPESPRWLVKRGSEEEARRIMNITHDPKDIEMELAEMKQGEAEKKETTLGVLKAKWIR
PMLLIGVGLAIFQQAVGINTVIYYAPTIFTKAGLGTSASALGTMGIGILNVIMCITAMILIDRVGRKKLLIWGSVGITLS
LAALSGVLLTLGLSASTAWMTVVFLGVYIVFYQATWGPVVWVLMPELFPSKARGAATGFTTLVLSAANLIVSLVFPLMLS
AMGIAWVFMVFSVICLLSFFFAFYMVPETKGKSLEEIEASLKKRFKKKKSTQNQVLNERTL
>P70964 ~~~csbD~~~Stress response protein CsbD~~~COG3237
MGNDSVKDKMKGGFNKAKGEVKDKVGDMADRTDMQAEGKKDKAKGEIQKDIGKAKDKFSDKD
>O05390 ~~~csbX~~~Alpha-ketoglutarate permease~~~COG2814
MNTVHAKGNVLNKIGIPSHMVWGYIGVVIFMVGDGLEQGWLSPFLVDHGLSMQQSASLFTMYGIAVTISAWLSGTFVQTW
GPRKTMTVGLLAFILGSAAFIGWAIPHMYYPALLGSYALRGLGYPLFAYSFLVWVSYSTSQNILGKAVGWFWFMFTCGLN
VLGPFYSSYAVPAFGEINTLWSALLFVAAGGILALFFNKDKFTPIQKQDQPKWKELSKAFTIMFENPKVGIGGVVKTINA
IGQFGFAIFLPTYLARYGYSVSEWLQIWGTLFFVNIVFNIIFGAVGDKLGWRNTVMWFGGVGCGIFTLALYYTPQLIGHQ
YWVLMIIACCYGAALAGYVPLSALLPTLAPDNKGAAMSVLNLGSGLCAFIAPGIVSLFIGPLGAGGVIWIFAALYFFSAF
LTRFLTISEQSTDVYTEERFVRENVQTNFDKTVKQ
>P30000 ~~~cscB~~~Sucrose permease~~~
MALNIPFRNAYYRFASSYSFLFFISWSLWWSLYAIWLKGHLGLTGTELGTLYSVNQFTSILFMMFYGIVQDKLGLKKPLI
WCMSFILVLTGPFMIYVYEPLLQSNFSVGLILGALFFGLGYLAGCGLLDSFTEKMARNFHFEYGTARAWGSFGYAIGAFF
AGIFFSISPHINFWLVSLFGAVFMMINMRFKDKDHQCIAADAGGVKKEDFIAVFKDRNFWVFVIFIVGTWSFYNIFDQQL
FPVFYAGLFESHDVGTRLYGYLNSFQVVLEALCMAIIPFFVNRVGPKNALLIGVVIMALRILSCALFVNPWIISLVKLLH
AIEVPLCVISVFKYSVANFDKRLSSTIFLIGFQIASSLGIVLLSTPTGILFDHAGYQTVFFAISGIVCLMLLFGIFFLSK
KREQIVMETPVPSAI
>Q46925 2.8.1.7~~~csdA~~~Cysteine desulfurase CsdA~~~COG0520
MNVFNPAQFRAQFPALQDAGVYLDSAATALKPEAVVEATQQFYSLSAGNVHRSQFAEAQRLTARYEAAREKVAQLLNAPD
DKTIVWTRGTTESINMVAQCYARPRLQPGDEIIVSVAEHHANLVPWLMVAQQTGAKVVKLPLNAQRLPDVDLLPELITPR
SRILALGQMSNVTGGCPDLARAITFAHSAGMVVMVDGAQGAVHFPADVQQLDIDFYAFSGHKLYGPTGIGVLYGKSELLE
AMSPWLGGGKMVHEVSFDGFTTQSAPWKLEAGTPNVAGVIGLSAALEWLADYDINQAESWSRSLATLAEDALAKRPGFRS
FRCQDSSLLAFDFAGVHHSDMVTLLAEYGIALRAGQHCAQPLLAELGVTGTLRASFAPYNTKSDVDALVNAVDRALELLV
D
>P0AGF2 ~~~csdE~~~Sulfur acceptor protein CsdE~~~COG2166
MTNPQFAGHPFGTTVTAETLRNTFAPLTQWEDKYRQLIMLGKQLPALPDELKAQAKEIAGCENRVWLGYTVAENGKMHFF
GDSEGRIVRGLLAVLLTAVEGKTAAELQAQSPLALFDELGLRAQLSASRSQGLNALSEAIIAATKQV
>P9WQ69 2.8.1.7~~~csd~~~Probable cysteine desulfurase~~~COG0520
MTASVNSLDLAAIRADFPILKRIMRGGNPLAYLDSGATSQRPLQVLDAEREFLTASNGAVHRGAHQLMEEATDAYEQGRA
DIALFVGADTDELVFTKNATEALNLVSYVLGDSRFERAVGPGDVIVTTELEHHANLIPWQELARRTGATLRWYGVTDDGR
IDLDSLYLDDRVKVVAFTHHSNVTGVLTPVSELVSRAHQSGALTVLDACQSVPHQPVDLHELGVDFAAFSGHKMLGPNGI
GVLYGRRELLAQMPPFLTGGSMIETVTMEGATYAPAPQRFEAGTPMTSQVVGLAAAARYLGAIGMAAVEAHERELVAAAI
EGLSGIDGVRILGPTSMRDRGSPVAFVVEGVHAHDVGQVLDDGGVAVRVGHHCALPLHRRFGLAATARASFAVYNTADEV
DRLVAGVRRSRHFFGRA
>P99177 2.8.1.7~~~csd~~~Probable cysteine desulfurase~~~
MAEHSFDVNEVIKDFPILDQKVNGKRLAYLDSTATSQTPMQVLNVLEDYYKRYNSNVHRGVHTLGSLATDGYENARETVR
RFINAKYFEEIIFTRGTTASINLVAHSYGDANVEEGDEIVVTEMEHHANIVPWQQLAKRKNATLKFIPMTADGELNIEDI
KQTINDKTKIVAIAHISNVLGTINDVKTIAEIAHQHGAIISVDGAQAAPHMKLDMQEMNADFYSFSGHKMLGPTGIGVLF
GKRELLQKMEPIEFGGDMIDFVSKYDATWADLPTKFEAGTPLIAQAIGLAEAIRYLERIGFDAIHKYEQELTIYAYEQMS
AIEGIEIYGPPKDRRAGVITFNLQDVHPHDVATAVDTEGVAVRAGHHCAQPLMKWLNVSSTARASFYIYNTKEDIDQLIN
ALKQTKEFFSYEF
>Q55793 2.8.1.7~~~csd~~~Probable cysteine desulfurase~~~COG0520
MVALQIPSLAATVRQDFPILNQEINGHPLVYLDNAATSQKPRAVLEKLMHYYENDNANVHRGAHQLSVRATDAYEAVRNK
VAKFINARSPREIVYTRNATEAINLVAYSWGMNNLKAGDEIITTVMEHHSNLVPWQMVAAKTGAVLKFVQLDEQESFDLE
HFKTLLSEKTKLVTVVHISNTLGCVNPAEEIAQLAHQAGAKVLVDACQSAPHYPLDVQLIDCDWLVASGHKMCAPTGIGF
LYGKEEILEAMPPFFGGGEMIAEVFFDHFTTGELPHKFEAGTPAIAEAIALGAAVDYLTDLGMENIHNYEVELTHYLWQG
LGQIPQLRLYGPNPKHGDRAALASFNVAGLHASDVATMVDQDGIAIRSGHHCTQPLHRLFDASGSARASLYFYNTKEEID
LFLQSLQATIRFFSDDDFTV
>O31700 ~~~~~~Sporulation protein cse15~~~
MKRSGPFFHDVSQENLYLKSELSRCHKLISELEASYFHQKNNKLLKENTDMKEKLQQLSAELTHMSTKEKHASHTSQTLH
QIRAELLDKIVVLQELLSAETYKRRAEIEEKHKLHIAKVKIEEENKNLHKRISELQASIEQEQNALLQAKQQAELIKAEN
GRLKEQMVEKEYQLKHIKIEVDHMKDRIIETKERLLDIEKTKEKLFHETIISYKRQLDESDAWIASHFADIDGGTKQKEK
TEEEAPAAYAQPNHVETILEDVTKQIHVLQKQLAHAQSSDQAKSHTIEELKNRAAEEKPYQKWVYKLNLEKENKPSQKKP
Q
>Q46901 ~~~casA~~~CRISPR system Cascade subunit CasA~~~
MNLLIDNWIPVRPRNGGKVQIINLQSLYCSRDQWRLSLPRDDMELAALALLVCIGQIIAPAKDDVEFRHRIMNPLTEDEF
QQLIAPWIDMFYLNHAEHPFMQTKGVKANDVTPMEKLLAGVSGATNCAFVNQPGQGEALCGGCTAIALFNQANQAPGFGG
GFKSGLRGGTPVTTFVRGIDLRSTVLLNVLTLPRLQKQFPNESHTENQPTWIKPIKSNESIPASSIGFVRGLFWQPAHIE
LCDPIGIGKCSCCGQESNLRYTGFLKEKFTFTVNGLWPHPHSPCLVTVKKGEVEEKFLAFTTSAPSWTQISRVVVDKIIQ
NENGNRVAAVVNQFRNIAPQSPLELIMGGYRNNQASILERRHDVLMFNQGWQQYGNVINEIVTVGLGYKTALRKALYTFA
EGFKNKDFKGAGVSVHETAERHFYRQSELLIPDVLANVNFSQADEVIADLRDKLHQLCEMLFNQSVAPYAHHPKLISTLA
LARATLYKHLRELKPQGGPSNG
>Q53VY1 ~~~cse1~~~CRISPR-associated protein CasA/Cse1~~~
MGSLEKFNLIDEPWIPVLKGGRVVEVGIGEALLRAHEFARIETPSPLEEAVLHRLLLAVLHRALSGPRCPEDVLDWWRKG
GFPQDPIRDYLNRFRDRFFLFHPEAPFLQVADLPEENPLPWSKLLPELASGNNPTLFDHTTEENLPKATYAQAARALLVH
QAFAPGGLLRRYGVGSAKDAPVARPALFLPTGQNLLETLLLNLVPYTPEDDAPIWEVPPLRLGDLEGARTKWPLTGRTRV
YTWPARGVRLLDEGDGVRFMGYGPGVEPLEATHRDPMVAQRLDAKGNLLVLRLSEERSFWRDFSAMLPRQGGKVAATLEH
AENLQGELEDEGLEGRITLRVLGQVSDQAKVLDIRREVYPLPSGLLTPKAEENLEKALKMAEELGQGLKHLAQEVAKAVV
GERDRGHGRSPYLEELTKLANSLPLERLYWHALDGAFPRFFARVEEEASLDLWREALRGAALEAWKATRRFLGTGARHLK
ALAQGEQEFGRLLGELGEEVRT
>P76632 ~~~casB~~~CRISPR system Cascade subunit CasB~~~
MADEIDAMALYRAWQQLDNGSCAQIRRVSEPDELRDIPAFYRLVQPFGWENPRHQQALLRMVFCLSAGKNVIRHQDKKSE
QTTGISLGRALANSGRINERRIFQLIRADRTADMVQLRRLLTHAEPVLDWPLMARMLTWWGKRERQQLLEDFVLTTNKNA
>Q53VY0 ~~~cse2~~~CRISPR-associated protein Cse2~~~
MSPGERFLDWLKRLQGQKAWTAARAAFRRSLAFPPGAYPRAMPYVEPFLAKGDWRQEEREAHYLVAALYALKDGDHQVGR
TLARALWEKAQGSASVEKRFLALLEADRDQIAFRLRQAVALVEGGIDFARLLDDLLRWFSPERHVQARWAREYYGAGASE
EEKKKEVEA
>P94496 ~~~~~~Sporulation protein cse60~~~
MLKVAVFDEEHEKDLQTEINSFLKGISEEQLIDIKYTVSAACDPDGEQLYCFSALILYRK
>Q9ZEP5 ~~~cseA~~~Lipoprotein CseA~~~
MRGLTDGRTPRGTRRTTQAASTAVAVFVALGVSLAGCGTGGTGARDEGPAHADAVGGAGSASPAPAAKASPSKAPDRVDA
VRLVKADPKVSPEVKRELKPCVADEYPIDVSYGKVTDGSADDVVVNVLTCGDAVGVGSYVYREEDGAYQNVFKAEEPPVY
AEIDRGDLVVTKQVYDKGDPVSSPSGENVITYRWASDRFTEEYRTHNDYSKAAGNAPTPAPEPDS
>Q9ZEP4 ~~~cseB~~~Transcriptional regulatory protein CseB~~~COG0745
MADQTHVLFVEDDDVIREATQLALERDGFAVTAMPDGLSGLESFRADRPDIALLDVMLPGLDGVSLCRRIRDESTVPVIM
LSARADSIDVVLGLEAGADDYVTKPFDGAVLVARIRAVLRRFGHAGGGDRTEGAGAAETGGVLTFGDLEVDTDGMEVRRA
GRPVGLTPTEMRLLLEFSSAPGTVLSRDKLLERVWDYGWGGDTRVVDVHVQRLRTKIGQDRIETVRGFGYKLKA
>Q9ZEP3 2.7.13.3~~~cseC~~~Sensor protein CseC~~~COG2205
MRGFFRQRRSVSPPGHPYDRTGPGEHAGPGARTGPGGRPRVLGVRGLRARGIRTGLRWKLSAAIALVGALVAIALSLVVH
NAARVSMLDNARDLADDRVLIAQRNYELSGRQNFPNAQIDDPALPPELRRKIDAGRRATYVSERPDGVTDIWAAVPLKDG
HVMSLHSGFTDRSADILSDLDQALVIGSIAVVLGGSALGVLIGGQLSRRLREAAAAANRVASGEPDVRVRDAIGGVVRDE
TDDVARAVDAMADALQQRIEAERRVTADIAHELRTPVTGLLTAAELLPPGRPTELVLDRAKAMRTLVEDVLEVARLDGAS
ERAELQDIMLGDFVSRRVAAKDPAVEVRVIHESEVTTDPRRLERVLFNLLANAARHGRSPVEVSVEGRVIRVRDHGPGFP
EDLLAEGPSRFRTGSTDRAGRGHGLGLTIAAGQARVLGARLTFRNVRPAGAPAHIPAEGAVAVLWLPEHAPTNTGSYPML
PDRSKSGASSSARDMSREASQGMSRKP
>P54379 ~~~csgA~~~Sigma-G-dependent sporulation-specific SASP protein~~~
MDVTLGYLRESLSNHLENEVCQRICKKMLAKRYANEEEFVKDLDDNEMSFLNHVLEKEIKYAQNEQDQKRAKELNEVYEL
LL
>P28307 ~~~csgA~~~Major curlin subunit~~~
MKLLKVAAIAAIVFSGSALAGVVPQYGGGGNHGGGGNNSGPNSELNIYQYGGGNSALALQTDARNSDLTITQHGGGNGAD
VGQGSDDSSIDLTQRGFGNSATLDQWNGKNSEMTVKQFGGGNGAAVDQTASNSSVNVTQVGFGNNATAHQY
>P21158 ~~~csgA~~~C-signal~~~
MRAFATNVCTGPVDVLINNAGVSGLWCALGDVDYADMARTFTINALGPLRVTSAMLPGLRQGALRRVAHVTSRMGSLAAN
TDGGAYAYRMSKAALNMAVRSMSTDLRPEGFVTVLLHPGWVQTDMGGPDATLPAPDSVRGMLRVIDGLNPEHSGRFFDYQ
GTEVPW
>P0A1E7 ~~~csgA~~~Major curlin subunit~~~
MKLLKVAAFAAIVVSGSALAGVVPQWGGGGNHNGGGNSSGPDSTLSIYQYGSANAALALQSDARKSETTITQSGYGNGAD
VGQGADNSTIELTQNGFRNNATIDQWNAKNSDITVGQYGGNNAALVNQTASDSSVMVRQVGFGNNATANQY
>P0ABK7 ~~~csgB~~~Minor curlin subunit~~~
MKNKLLFMMLTILGAPGIAAAAGYDLANSEYNFAVNELSKSSFNQAAIIGQAGTNNSAQLRQGGSKLLAVVAQEGSSNRA
KIDQTGDYNLAYIDQAGSANDASISQGAYGNTAMIIQKGSGNKANITQYGTQKTAIVVQRQSQMAIRVTQR
>P0A1E9 ~~~csgB~~~Minor curlin subunit~~~
MKNKLLFMMLTILGAPGIATATNYDLARSEYNFAVNELSKSSFNQAAIIGQVGTDNSARVRQEGSKLLSVISQEGGNNRA
KVDQAGNYNFAYIEQTGNANDASISQSAYGNSAAIIQKGSGNKANITQYGTQKTAVVVQKQSHMAIRVTQR
>P52107 ~~~csgC~~~Curli assembly protein CsgC~~~
MNTLLLLAALSSQITFNTTQQGDVYTIIPEVTLTQSCLCRVQILSLREGSSGQSQTKQEKTLSLPANQPIALTKLSLNIS
PDDRVKIVVTVSDGQSLHLSQQWPPSSEKS
>P0A1Z9 ~~~csgC~~~Curli assembly protein CsgC~~~
MHTLLLLAALSNQITFTTTQQGDIYTVIPQVTLNEPCVCQVQILSVRDGVGGQSHTQQKQTLSLPANQPIELSRLSVNIS
SEDSVKIIVTVSDGQSLHLSQQWPPSAQ
>P52106 ~~~csgD~~~CsgBAC operon transcriptional regulatory protein~~~COG2197
MFNEVHSIHGHTLLLITKSSLQATALLQHLKQSLAITGKLHNIQRSLDDISSGSIILLDMMEADKKLIHYWQDTLSRKNN
NIKILLLNTPEDYPYRDIENWPHINGVFYSMEDQERVVNGLQGVLRGECYFTQKLASYLITHSGNYRYNSTESALLTHRE
KEILNKLRIGASNNEIARSLFISENTVKTHLYNLFKKIAVKNRTQAVSWANDNLRR
>O54294 ~~~csgD~~~Probable csgAB operon transcriptional regulatory protein~~~
MFNEVHSSHGHTLLLITKPSLQATALLQHLKQSLAITGKLHNIQRSLEDISAGCIVLMDMMEADKKLIHYWQDNLSRKNN
NIKTLLLNTPDDYPYREIENWPHINGVFYATEDQEHVVSGLQGILRGECYFSQKLASYLITHSGNYRYNSTESALLTHRE
KEILNKLRIGASNNEIARSLFISENTVKTHLYNLFKKIAVKNRTQAVSWANDNLRR
>P0AE95 ~~~csgE~~~Curli production assembly/transport component CsgE~~~
MKRYLRWIVAAEFLFAAGNLHAVEVEVPGLLTDHTVSSIGHDFYRAFSDKWESDYTGNLTINERPSARWGSWITITVNQD
VIFQTFLFPLKRDFEKTVVFALIQTEEALNRRQINQALLSTGDLAHDEF
>P0AE98 ~~~csgF~~~Curli production assembly/transport component CsgF~~~
MRVKHAVVLLMLISPLSWAGTMTFQFRNPNFGGNPNNGAFLLNSAQAQNSYKDPSYNDDFGIETPSALDNFTQAIQSQIL
GGLLSNINTGKPGRMVTNDYIVDIANRDGQLQLNVTDRKTGQTSTIQVSGLQNNSTDF
>P0AEA4 ~~~csgG~~~Curli production assembly/transport component CsgG~~~COG1462
MQRLFLLVAVMLLSGCLTAPPKEAARPTLMPRAQSYKDLTHLPAPTGKIFVSVYNIQDETGQFKPYPASNFSTAVPQSAT
AMLVTALKDSRWFIPLERQGLQNLLNERKIIRAAQENGTVAINNRIPLQSLTAANIMVEGSIIGYESNVKSGGVGARYFG
IGADTQYQLDQIAVNLRVVNVSTGEILSSVNTSKTILSYEVQAGVFRFIDYQRLLEGEVGYTSNEPVMLCLMSAIETGVI
FLINDGIDRGLWDLQNKAERQNDILVKYRHMSVPPES
>P0AEA2 ~~~csgG~~~Curli production assembly/transport component CsgG~~~COG1462
MQRLFLLVAVMLLSGCLTAPPKEAARPTLMPRAQSYKDLTHLPAPTGKIFVSVYNIQDETGQFKPYPASNFSTAVPQSAT
AMLVTALKDSRWFIPLERQGLQNLLNERKIIRAAQENGTVAINNRIPLQSLTAANIMVEGSIIGYESNVKSGGVGARYFG
IGADTQYQLDQIAVNLRVVNVSTGEILSSVNTSKTILSYEVQAGVFRFIDYQRLLEGEVGYTSNEPVMLCLMSAIETGVI
FLINDGIDRGLWDLQNKAERQNDILVKYRHMSVPPES
>Q81IT9 3.6.4.13~~~cshA~~~DEAD-box ATP-dependent RNA helicase CshA~~~
MTTFRELGLSDSLLQSVESMGFEEATPIQAETIPHALQGKDIIGQAQTGTGKTAAFGLPLLDKVDTHKESVQGIVIAPTR
ELAIQVGEELYKIGKHKRVRILPIYGGQDINRQIRALKKHPHIIVGTPGRILDHINRKTLRLQNVETVVLDEADEMLNMG
FIEDIEAILTDVPETHQTLLFSATMPDPIRRIAERFMTEPQHIKVKAKEVTMPNIQQFYLEVQEKKKFDVLTRLLDIQSP
ELAIVFGRTKRRVDELSEALNLRGYAAEGIHGDLTQAKRMSVLRKFKEGSIEVLVATDVAARGLDISGVTHVYNFDIPQD
PESYVHRIGRTGRAGKKGIAMLFVTPRESGQLKNIERTTKRKMDRMDAPTLDEALEGQQRLIAEKLQSTIENENLAYYKR
IAEEMLEENDSVTVVAAALKMMTKEPDTTPIALTSEPPVVSRGGGSKKRGGNGGGYRDGNRNRSRDGRGGGDGRNRDRNR
DGRNRDGNRDRNRDGNRDRNRDGGSRGRRGEGQGRPGSSNGRGERKHHSRPQA
>P96614 3.6.4.13~~~cshA~~~DEAD-box ATP-dependent RNA helicase CshA~~~COG0513
MTITFQDFNLSSDLMKAINRMGFEEATPIQAQTIPLGLSNKDVIGQAQTGTGKTAAFGIPLVEKINPESPNIQAIVIAPT
RELAIQVSEELYKIGQDKRAKVLPIYGGQDIGRQIRALKKNPNIIVGTPGRLLDHINRRTIRLNNVNTVVMDEADEMLNM
GFIDDIESILSNVPSEHQTLLFSATMPAPIKRIAERFMTEPEHVKVKAKEMTVSNIQQFYLEVQERKKFDTLTRLLDIQS
PELAIVFGRTKRRVDELAEALNLRGYAAEGIHGDLTQAKRMVALRKFKEGAIEVLVATDVAARGLDISGVTHVYNFDVPQ
DPESYVHRIGRTGRAGKTGMAMTFITPREKSMLRAIEQTTKRKMDRMKEPTLDEALEGQQQVTVERLRTTISENNLNFYM
TAAAELLEDHDAVTVVAAAIKMATKEPDDTPVRLTDEAPMVSKRYKNQRSSKRRDGQGGGYRGGKGKSNNRSSYDKKRSN
DRRSSGDRRQKKSY
>Q8Y8N0 3.6.4.13~~~cshA~~~ATP-dependent RNA helicase CshA~~~COG0513
MTKFSEFGLDEKIVKSVNRMGFEEATPIQEKTIPLGLEGKDLIGQAQTGTGKTAAFGLPMIHKIDQKSNNVQALIIAPTR
ELAIQVSEELYKLSYDKHVRVLAVYGGSDISRQIRSLKKNPQIVVGTPGRILDHINRRTLKLDHVETLVLDEADEMLNMG
FIDDIETILKEVPAERQTLLFSATMPDPIRRIGERFMHSPELIRIKAKEMTALLIEQFFVKVHEKEKFDVLSRLLDVQAP
ELAIVFGRTKRRVDELSRALDMRGYVAEGIHGDLTQAKRMSVLRKFKEGKIDVLVATDVAARGLDISGVTHVYNYDIPQD
PESYVHRIGRTGRAGKEGMAITFVQPREMGYLRIVEETTKKRMQPLQAPTWDEAFAGQLRVATEKIQEAITEENLADYKT
FANELLEKYDATDIAAAMLKMLAKEPDKTPVHITEERPLPSRGGGGYKGKNGKGGKGGGYRGGSGKGGSYRDRNNSGKGR
RSGGGSGGGSGSGGGGNRDRRGGGEQRSGGNKGNYSQKSK
>Q2FWH5 3.6.4.13~~~cshA~~~DEAD-box ATP-dependent RNA helicase CshA~~~COG0513
MQNFKELGISDNTVQSLESMGFKEPTPIQKDSIPYALQGIDILGQAQTGTGKTGAFGIPLIEKVVGKQGVQSLILAPTRE
LAMQVAEQLREFSRGQGVQVVTVFGGMPIERQIKALKKGPQIVVGTPGRVIDHLNRRTLKTDGIHTLILDEADEMMNMGF
IDDMRFIMDKIPAVQRQTMLFSATMPKAIQALVQQFMKSPKIIKTMNNEMSDPQIEEFYTIVKELEKFDTFTNFLDVHQP
ELAIVFGRTKRRVDELTSALISKGYKAEGLHGDITQAKRLEVLKKFKNDQINILVATDVAARGLDISGVSHVYNFDIPQD
TESYTHRIGRTGRAGKEGIAVTFVNPIEMDYIRQIEDANGRKMSALRPPHRKEVLQAREDDIKEKVENWMSKESESRLKR
ISTELLNEYNDVDLVAALLQELVEANDEVEVQLTFEKPLSRKGRNGKPSGSRNRNSKRGNPKFDSKSKRSKGYSSKKKST
KKFDRKEKSSGGSRPMKGRTFADHQK
>Q99SH6 3.6.4.13~~~cshA~~~DEAD-box ATP-dependent RNA helicase CshA~~~
MQNFKELGISDNTVQSLESMGFKEPTPIQKDSIPYALQGIDILGQAQTGTGKTGAFGIPLIEKVVGKQGVQSLILAPTRE
LAMQVAEQLREFSRGQGVQVVTVFGGMPIERQIKALKKGPQIVVGTPGRVIDHLNRRTLKTDGIHTLILDEADEMMNMGF
IDDMRFIMDKIPAVQRQTMLFSATMPKAIQALVQQFMKSPKIIKTMNNEMSDPQIEEFYTIVKELEKFDTFTNFLDVHQP
ELAIVFGRTKRRVDELTSALISKGYKAEGLHGDITQAKRLEVLKKFKNDQINILVATDVAARGLDISGVSHVYNFDIPQD
TESYTHRIGRTGRAGKEGIAVTFVNPIEMDYIRQIEDANGRKMSALRPPHRKEVLQAREDDIKEKVENWMSKESESRLKR
ISTELLNEYNDVDLVAALLQELVEANDEVEVQLTFEKPLSRKGRNGKPSGSRNRNSKRGNPKFDSKSKRSKGYSSKKKST
KKFDRKEKSSGGSRPMKGRTFADHQK
>Q7A4G0 3.6.4.13~~~cshA~~~DEAD-box ATP-dependent RNA helicase CshA~~~
MQNFKELGISDNTVQSLESMGFKEPTPIQKDSIPYALQGIDILGQAQTGTGKTGAFGIPLIEKVVGKQGVQSLILAPTRE
LAMQVAEQLREFSRGQGVQVVTVFGGMPIERQIKALKKGPQIVVGTPGRVIDHLNRRTLKTDGIHTLILDEADEMMNMGF
IDDMRFIMDKIPAVQRQTMLFSATMPKAIQALVQQFMKSPKIIKTMNNEMSDPQIEEFYTIVKELEKFDTFTNFLDVHQP
ELAIVFGRTKRRVDELTSALISKGYKAEGLHGDITQAKRLEVLKKFKNDQINILVATDVAARGLDISGVSHVYNFDIPQD
TESYTHRIGRTGRAGKEGIAVTFVNPIEMDYIRQIEDANGRKMSALRPPHRKEVLQAREDDIKEKVENWMSKESESRLKR
ISTELLNEYNDVDLVAALLQELVEANDEVEVQLTFEKPLSRKGRNGKPSGSRNRNSKRGNPKFDSKSKRSKGYSSKKKST
KKFDRKEKSSGGSRPMKGRTFADHQK
>Q818H2 3.6.4.13~~~cshB~~~DEAD-box ATP-dependent RNA helicase CshB~~~
MTQQTFTQYDFKPFLIDAVRELRFTEPTGIQQKIFPVVKKGVSVIGQSQTGSGKTHAYLLPTLNRINASREEVQLVITAP
TRELAQQIYEEIVKLTKFCAEDQMITARCLIGGTDKQRSIEKLKKQPHIVVGTPGRIKDLVEEQALFVHKANTIIVDEAD
LMLDMGFIHDVDKIAARMPKNLQMLVFSATIPQKLKPFLKKYMENPEHIHINPKQVAAGNIEHYLVPSKHRNKIDLVHKM
LLQFKPYLAVVFTNTKKMADQVADGLMERGLKVGRIHGDLSPRDRKKMMKQIRDLEFQYIVATDLAARGIDIQGISHVIN
YQPPSDLDFFVHRVARTARAGHSGIAVTIYDPANEEALDSLEKQRHIEFKHVDLRGDEWADLGERRRRKSRKKPNDELDV
MATKVIKKPKKVKPNYKRKLATERDKVKRKYSNKKR
>P54475 3.6.4.13~~~cshB~~~DEAD-box ATP-dependent RNA helicase CshB~~~COG0513
MKETKFELYELKPFIIDAVHRLGFYEPTDIQKRLIPAVLKKESVIGQSQTGTGKTHAYLLPLLNKIDPAKDVVQVVITAP
TRELANQIYQEALKITQGEEGSQIRSKCFIGGTDKQKSIDKLKIQPHLVVGTPGRIADLIKEQALSVHKAESLVIDEADL
MLDMGFLADVDYIGSRMPEDLQMLVFSATIPEKLKPFLKKYMENPKYAHVEPKQVTAAKIEHILIPSKHRDKDKLLFDIM
SHLNPYLGIVFANTKNTADHIAQYLTGKGMKIGLLHGGLTPRERKKVMKQINDLEFTYIIATDLAARGIDIKGVSHVINY
ELPDDLDFYVHRVGRTARAGSSGQAMTIYELTDEDALVRLEKMGIEFEYLELEKGEWKKGDDRQRRKKRKKTPNEADEIA
HRLVKKPKKVKPGYKKKMSYEMEKIKKKQRRNQSKKRK
>Q8Y755 3.6.4.13~~~cshB~~~DEAD-box ATP-dependent RNA helicase CshB~~~COG0513
MTKKSRFDQFGFQPFIGLAIDKLGFYEPTEVQQKLIPGILKGESIIGQSQTGTGKTHTFILPIINNVNPEKDAVQAVITA
PSRELATQIYNEIRKVTKYSEKEIAVQLVIGGTDKQRAIDKLKKQPQIIVGTPGRINDLIREQALFVHTAKTLVIDEADM
TLDMGFLNDVDHIAGKMPANLQMLVFSATIPQKLKPFLSKYMENPRYEHIQPKVAASKTVEHRIMATRSRNKLDLLKNVL
VGSQPYLAIVFTNTKTTADEVANGLIERGLKVAKIHGDVNPRERKRTMKQIENLDYQYVVATDLAARGIDIQGISHVVNY
ELPDDLDFYIHRTGRTGRAGHSGIALTLFEPADEDRLNQLEKMGIEFKHVDWKNKEFVTLEDRNRRAKREAKRETADPRE
IGMRKKAKQKGKPNYKKKINYKMNEIKRRERRKKR
>Q81E85 3.6.4.13~~~cshC~~~DEAD-box ATP-dependent RNA helicase CshC~~~
MIKDMQPFLQQAWEKAGFKELTEIQKQAIPTILEGQDVIAESPTGTGKTLAYLLPLLHKINPEVKQPQVVVLAPTRELVM
QIHEEVQKFTAGTEISGASLIGGADIKRQVEKLKKHPRVIVGSPGRILELIRMKKLKMHEVKTIVFDEFDQIVKQKMMGA
VQDVIKSTMRDRQLVFFSATMTKAAEDAARDLAVEPQLVRVTRAESKSLVEHTYIICERREKNDYVRRIMHMGDVKAVAF
LNDPFRLDEITEKLKFRKMKAAALHAEASKQEREATMRAFRGGKLEILLATDIAARGIDIDDLTHVIHLELPDTVDQYIH
RSGRTGRMGKEGTVVSLVTPQEERKLLQFAKKLGIVFTKQEMFKGSFVETKPKAPKKKKPAFTGKKKPR
>Q81DF9 3.6.4.13~~~cshE~~~DEAD-box ATP-dependent RNA helicase CshE~~~
MVYLKNFLELGISETFNHTLRENGITEATPIQEKAIPVILSGKDIIGQAKTGTGKTLAFVLPILEKIDPECSDVQALIVA
PTRELALQITTEIKKMLVQREDINVLAIYGGQDVAQQLRKLKGNTHIVVATPGRLLDHIRRETIDLSNLSTIVLDEADQM
LYFGFLYDIEDILDETPGSKQTMLFSATMPKDIKKLAKRYMDEPQMIQVQSEEVTVDTIEQRVIETTDRAKPDALRFVMD
RDQPFLAVIFCRTKVRASKLYDNLKGLGYNCAELHGDIPQAKRERVMKSFREAKIQYLIATDVAARGLDVDGVTHVFNYD
IPEDVESYIHRIGRTGRAGGSGLAITFVAAKDEKHLEEIEKTLGAPIQREIIEQPKIKRVDENGKPVPKPAPKKSGQNRQ
RDSREGSRSDSRRDSRNSSRSDSRNSSRNSSRNENNRSFNKPSNKKGSTKQGQQRRGR
>P32400 3.5.1.59~~~~~~N-carbamoylsarcosine amidase~~~
MTETSGTFNDIEARLAAVLEEAFEAGTSIYNERGFKRRIGYGNRPAVIHIDLANAWTQPGHPFSCPGMETIIPNVQRINE
AARAKGVPVFYTTNVYRNRDASSGTNDMGLWYSKIPTETLPADSYWAQIDDRIAPADGEVVIEKNRASAFPGTNLELFLT
SNRIDTLIVTGATAAGCVRHTVEDAIAKGFRPIIPRETIGDRVPGVVQWNLYDIDNKFGDVESTDSVVQYLDALPQFEDT
VPKTLSDPQPEVEAPADPVFAEQH
>P94497 ~~~~~~Protein csk22~~~
MHLTLQSVYPAIIIIFFLYKKIKRSIGYQPLKPRWLFTRIILFSLFAFGLSIFSAIHPFLYGYLILGILGGWLLVFFAKK
NISFEKRRGKIYFRTHIWVEVILLTLFLSRFLYRVTELYLTSPDLNRLGSYSQSIGTDPLTIGVCFLIAVYYIGFSSFII
KLSRNELEQHEYNKEKDILAR
>Q59288 4.2.2.5~~~cslA~~~Chondroitinase-AC~~~COG5492
MKKLFVTCIVFFSILSPALLIAQQTGTAELIMKRVMLDLKKPLRNMDKVAEKNLNTLQPDGSWKDVPYKDDAMTNWLPNN
HLLQLETIIQAYIEKDSHYYGDDKVFDQISKAFKYWYDSDPKSRNWWHNEIATPQALGEMLILMRYGKKPLDEALVHKLT
ERMKRGEPEKKTGANKTDIALHYFYRALLTSDEALLSFAVKELFYPVQFVHYEEGLQYDYSYLQHGPQLQISSYGAVFIT
GVLKLANYVRDTPYALSTEKLAIFSKYYRDSYLKAIRGSYMDFNVEGRGVSRPDILNKKAEKKRLLVAKMIDLKHTEEWA
DAIARTDSTVAAGYKIEPYHHQFWNGDYVQHLRPAYSFNVRMVSKRTRRSESGNKENLLGRYLSDGATNIQLRGPEYYNI
MPVWEWDKIPGITSRDYLTDRPLTKLWGEQGSNDFAGGVSDGVYGASAYALDYDSLQAKKAWFFFDKEIVCLGAGINSNA
PENITTTLNQSWLNGPVISTAGKTGRGKITTFKAQGQFWLLHDAIGYYFPEGANLSLSTQSQKGNWFHINNSHSKDEVSG
DVFKLWINHGARPENAQYAYIVLPGINKPEEIKKYNGTAPKVLANTNQLQAVYHQQLDMVQAIFYTAGKLSVAGIEIETD
KPCAVLIKHINGKQVIWAADPLQKEKTAVLSIRDLKTGKTNRVKIDFPQQEFAGATVELK
>Q46079 4.2.2.19~~~cslB~~~Chondroitinase-B~~~COG3420
MKMLNKLAGYLLPIMVLLNVAPCLGQVVASNETLYQVVKEVKPGGLVQIADGTYKDVQLIVSNSGKSGLPITIKALNPGK
VFFTGDAKVELRGEHLILEGIWFKDGNRAIQAWKSHGPGLVAIYGSYNRITACVFDCFDEANSAYITTSLTEDGKVPQHC
RIDHCSFTDKITFDQVINLNNTARAIKDGSVGGPAMYHRVDHCFFSNPQKPGNAGGGIRIGYYRNDIGRCLVDSNLFMRQ
DSEAEIITSKSQENVYYGNTYLNCQGTMNFRHGDHQVAINNFYIGNDQRFGYGGMFVWGSRHVIACNYFELSETIKSRGN
AALYLNPGAMASEHALAFDMLIANNAFINVNGYAIHFNPLDERRKEYCAANRLKFETPHQLMLKGNLFFKDKPYVYPFFK
DDYFIAGKNSWTGNVALGVEKGIPVNISANRSAYKPVKIKDIQPIEGIALDLNALISKGITGKPLSWDEVRPYWLKEMPG
TYALTARLSADRAAKFKAVIKRNKEH
>E6LHV6 ~~~csm2~~~CRISPR system Cms protein Csm2~~~COG1421
MELAKTKTGEMIDLNFARKVVEENKRVKDNRGRQEIVLFNGLTTSKLRNLLELINHVYTKVYNSDDTTLSEDVRDELEYL
KVKFAYESGREPAVRTFIEKTYVDKLVDVVLKKNTKKIFLDYCKYFEALVAYAKFYRMGD
>P9WJG1 ~~~csm2~~~CRISPR system Cms protein Csm2~~~COG1421
MSVIQDDYVKQAEVIRGLPKKKNGFELTTTQLRVLLSLTAQLFDEAQQSANPTLPRQLKEKVQYLRVRFVYQSGREDAVK
TFVRNAKLLEALEGIGDSRDGLLRFCRYMEALAAYKKYLDPKDK
>A0A0A7HIX1 ~~~csm2~~~CRISPR system Cms protein Csm2~~~
MAILTDENYVDKAERAISLLEKDNKGNYLLTTSQIRKLLSLCSSLYDRSKERKFDELINDVSYLRVQFVYQAGREIAVKD
LIEKAQILEALKEIKDRETLQRFCRYMEALVAYFKFYGGKD
>E6LHV5 3.1.-.-~~~csm3~~~CRISPR system Cms endoribonuclease Csm3~~~COG1337
MYSKIRIVGKIDVLTGLHIGGGGETSMIGAIDSPVVRDPYSRLPIIPGSSIKGKMRSLLAKHIGLIPGQKMHNQDAPEIL
RLFGSSQKGAIQSSRLQISDAFFSKASQEEFDKKDLAYTETKFENTISRLTAVANPRQIERVTRGASFDFHIIYNVENIN
EVMADFENIKTAIHLLENDYLGGGGTRGNGRIRFVIDSIDTVVGDFDSSNLSIK
>P9WJF9 3.1.-.-~~~csm3~~~CRISPR system Cms endoribonuclease Csm3~~~COG1337
MTTSYAKIEITGTLTVLTGLQIGAGDGFSAIGAVDKPVVRDPLSRLPMIPGTSLKGKVRTLLSRQYGADTETFYRKPNED
HAHIRRLFGDTEEYMTGRLVFRDTKLTNKDDLEARGAKTLTEVKFENAINRVTAKANLRQMERVIPGSEFAFSLVYEVSF
GTPGEEQKASLPSSDEIIEDFNAIARGLKLLELDYLGGSGTRGYGQVKFSNLKARAAVGALDGSLLEKLNHELAAV
>A0A0A7HIF0 3.1.-.-~~~csm3~~~CRISPR system Cms endoribonuclease Csm3~~~COG1337
MTFAKIKFSAQIRLETGLHIGGSDAFAAIGAIDSPVIKDPITNIPIIPGSSLKGKMRTLLAKVYNEKVAEKPSDDSDILS
RLFGNSKDKRFKMGRLIFRDAFLSNADELDSLGVRSYTEVKFENTIDRITAEANPRQIERAIRNSTFDFELIYEITDENE
NQVEEDFKVIRDGLKLLELDYLGGSGSRGYGKVAFEKLKATTVFGNYDVKTLNELLTAEV
>E6LHV4 ~~~csm4~~~CRISPR system Cms protein Csm4~~~COG1567
MNQLVVKLVKLTFKSPVHFGMKRLSDSNHTIAADTLFSALIIEALQQQLELSHLLNNLVITDLFPYNKTSYFLPKPLIRI
EGKKGDESGYKAFKKLTYIPVENYSEYLRGEIDSLEASKIAESLNLGKASLSTKVSLQAVDHNGESEPYSVGNFTFYPES
GLYFLAKGNADTIGQLEILMHALQYSGIGGKRSAGYGQFRCTIEDSGKFDSLLSQTGNIAILLSSAMASDEELVDCLEDA
RYLLKKRTGFVQSKTYADQLVKKKDFYAFSAGSTFYQKFNGKIFDVSDNGRHSVYRYAKAFWLEGKI
>P9WJF7 ~~~csm4~~~CRISPR system Cms protein Csm4~~~COG1567
MNSRLFRFDFDRTHFGDHGLESSTISCPADTLYSALCVEALRMGGQQLLGELVACSTLRLTDLLPYVGPDYLVPKPLHSV
RSDGSSMQKKLAKKIGFLPAAQLGSFLDGTADLKELAARQTKIGVHAVSAKAAIHNGKKDADPYRVGYFRFELDAGLWLL
ATGSESELGLLTRLLKGISALGGERTSGFGAFNLTESEAPAALTPTVDAASLMTLTTSLPTDDELEAALAGATYRLVKRS
GFVASSTYADMPLRKRDIYKFAAGSVFSRPFQGGILDVSLGGNHPVYSYARPLFLALPESAA
>A0A0A7HGA1 ~~~csm4~~~CRISPR system Cms protein Csm4~~~COG1567
MTYKLYIMTFQNAHFGSGTLDSSKLTFSADRIFSALVLESLKMGKLDAFLAEANQDKFTLTDAFPFQFGPFLPKPIGYPK
HDQIDQSVDVKEVRRQAKLSKKLQFLALENVDDYLNGELFENEEHAVIDTVTKNQPHKDGNLYQVATTRFSNDTSLYVIA
NESDLLNELMSSLQYSGLGGKRSSGFGRFELDIQNIPLELSDRLTKNHSDKVMSLTTALPVDADLEEAMEDGHYLLTKSS
GFAFSHATNENYRKQDLYKFASGSTFSKTFEGQIVDVRPLDFPHAVLNYAKPLFFKLEV
>E6LHV3 ~~~csm5~~~CRISPR system Cms protein Csm5~~~COG1332
MIEKVYQVKLKVYGPVHIGSGKIIRKQEYIYDRRKSLAHIVDGPNLVKFLNKKGKFTAYLQYLNTTKERADLYTFLRQEQ
IDTNDWKTFVLYTERVNQGKIDMKDHNPYSRTSTNRRQVDKGMNDLHLFVRDGRGDLYIPGSSLKGALRTVLEGANQSAE
AFHSLSISDSLPIDPKNLAIYQKIDINKELKPMPLYRECVNVGTTVEFTMKINSDDWTIEKIEKQIQQAYLQYWNKWFVG
MVTTPGGKAFIKGGGLPSVLHAKHRPTVLFLGGGTGFPSKTTHYLQKPKEQAQKDIFAILQRRFRNVYGKMATVPKNVPM
VLKGTVNDSTNKWYQQGVCLLEFQPIGEA
>P9WJF5 ~~~csm5~~~CRISPR system Cms protein Csm5~~~COG1332
MNTYLKPFELTLRCLGPVFIGSGEKRTSKEYHVEGDRVYFPDMELLYADIPAHKRKSFEAFVMNTDGAQATAPLKEWVEP
NAVKLDPAKHRGYEVKIGSIEPRRASRGRGGRMTRKKLTLNEIHAFIKDPLGRPYVPGSTVKGMLRSIYLQSLVHKRTAQ
PVRVPGHQTREHRQYGERFERKELRKSGRPNTRPQDAVNDLFQAIRVTDSPALRTSDLLICQKMDMNVHGKPDGLPLFRE
CLAPGTSISHRVVVDTSPTARGGWREGERFLETLAETAASVNQARYAEYRAMYPGVNAIVGPIVYLGGGAGYRSKTFVTD
QDDMAKVLDAQFGKVVKHVDKTRELRVSPLVLKRTKIDNICYEMGQCELSIRRAE
>A0A0A7HF79 ~~~csm5~~~CRISPR system Cms protein Csm5~~~
MKNDYRTFKLSLLTLAPIHIGNGEKYTSREFIYENKKFYFPDMGKFYNKMVEKRLAEKFEAFLIQTRPNARNNRLISFLN
DNRIAERSFGGYSISETGLESDRNPNSAGAINEVNKFIRDAFGNPYIPGSSLKGAIRTILMNTTPKWNNENAVNDFGRFP
KENKNLIPWGPKKGKEYDDLFNAIRVSDSKPFDNKRLILVQKWDYSAKTNKAKPLPLYRESISPLTKIEFEITTTTDEAG
RLIEELGKRAQAFYKDYKAFFLSEFPDDKIQANLQYPIYLGAGSGAWTKTLFKQADGILQRRYSRMKTKMVKKGVLKLTK
APLKIVKIPSGNHSLIKNHESFYEMGKANFMIKEIDK
>A0A0A7HIX6 3.1.-.-~~~csm6~~~CRISPR system endoribonuclease Csm6~~~
MKILISAVGTTDPISNNHDAALLHIARNYRPDKIVLVYSQEMMVKQDLINKVLLSIEGYNPIIEIDSTILNNDEVFLFDK
MYEVMGQIVQKYTNDDNEIILNLSSGTPQIISALFALNRINDYNTQAIQVATPKNRANREYTALTESEIDALIMENQDNR
LDFVDRSIKDKSEKFTQALVKRHLRSLIASFDYQAAEAIINRKEYNKLLSKKKIAYIREKLYDFSRVFKNQSILSDILSF
PLDDSQKKALNYYLMIDVLKEREHIADVLIKAKSLAEFVIEETIKKDHEGLIVFDGNLPKLNPSFPDCEAILDDIDKKMK
KSRGIEDTEERIFSVQSTLNLLSYLNILEFYEYDSQLQTAINGILSLNGERNKVAHGLSEIDTRLLSRKKLKQLSENLRL
LLVDCLGIDSSYFNYYDKQNKELIKMLE
>A0A0A7HFE6 3.1.-.-~~~csm6'~~~CRISPR system endoribonuclease Csm6'~~~
MRVLISAVGDTDPFRNFHDGSLIHIARKYRPEKVILIFSEHTAKKQGNIEKALFSIAPNYEPELIIHDPIISDNEVHIFD
VMFQRFSDILQEYYTKEDEFILNLSSATPQIKSALFVINRLNGINVKAVQVSSPEHASNENIGHDNDENIDELIEVNKDN
KVNFIDRTIEDNAEKFSQALLKKTARDFIEKFDYKAALDILDQLSDFPNLKSVREEIRDVVNCLSKQDVPKGLRHKKLKE
EEQKILSAYLTIELQRERGNVSESFIRIKNLTEFILEDYIEKRYPGLIDEYCEDIQKYYLSLFDYSKLLKATKEFKLKRT
IAPIIDMNSSRNKVAHSLSPLDSDAVKQLGIAMKTLKTLVREQYHFSQSDFNFYHDLNKILLTKLN
>E6LHV2 3.1.-.-~~~csm6~~~CRISPR system endoribonuclease Csm6~~~
MKILFSPIGNTDPWRNDRDGAMLHIVRHYQPDRVVLFFTESIWQGNQHFSGQQAFDWVKIIQSINENCQIEIKCDTIEVE
NDFDAYKDLFHQYLVEEKRKYPNAEIFLNVTSGTPQMETTLCLEYVTYPDKMRCIQVSTPLKTSNAKTKYAQADCQEVDL
EIVNEEESQQPSRCHKIAILSFREAIVRNQIKSLLDNYDYEAALQLVASQKSFRNGKEIRKKLKELIDDIKMHRVFSYLI
KQYPRNEKLQKALLHTILLEMRHQRGDIAETLIRVKSIAEYIVEQYIQKNYPYLIIYKEDKPYFNVSYSQELTESYLALM
DSRNKKTNKKMTVDSLDRILGFPAYRDFLQLLEASNEMTNEMNKVNEINNLRNKVAHNLDSLNLDRDKNGRKITNAVTAV
RTMLLAVFPEVQENDFHYLKQFNQSIKELL
>P71635 3.1.-.-~~~csm6~~~CRISPR system endoribonuclease Csm6~~~
MILFSPIGTADPITALGDGPMLHIVRHYRPIVVVLFLSAEIAAFENADRRYSAAITRLAPETDVRIVTYTNPSVHRFDLF
VPVFRNHLVELSAEFPDRTILLNTSSGTPAMQAALVAINVFGIPRTTAVQVSTPARALSKPGDRESPDAYDLELMWDAND
DNQPGAPNRCFEATSAALGALLERANLKQLIVSYDYSAAVTIAADSRLPDQVSNLIRGAMHRSRLEHLVAPKFFKDTAFT
YDPANKVAEYISALALLAKREQWAEFARSATPAITIVLRAAVAKHLPEDRYLDDMGRVDRRKLEREPEIRCALKHPPKSP
NAEWYLYTKDWLALLRQFAPDRVGALEVLGRFESRVRNTAAHEIVSISEDRITKDGGLLPEQLLKILARETGADLTLYDR
LNDEIIRQIDMAPLG
>Q53W17 3.1.-.-~~~csm6~~~CRISPR system endoribonuclease Csm6~~~
MEDLDALWERYREAVRAGGNPQALYQEMVWPALLALWREKPRVYPFPQAFAVSVHTLGTSPEATALAILGAGAERVYVLH
TPESARFLPRLRQDTGKDLYPVEIGKSDVEAIYREVKRLLEKHPEVPVALDLTSGTKAMSAGLAAAGFFFQRFYPKVRVV
YVDNEDYDPELRRPRAGTEKLRILPNPHEALAEVDALFAKELYGKGEFGQAAAYFRGMVGRTGNQAYALYALLAEMYRAW
RALDFGEALKAGRKLLGQLSQNVWLNHPLNARREALEAQVALLEAVDRFLKARDFALKEGVYGLARTLLHLAQEAKEEAA
VLAALYAYRALELLLQERLALLGRRAEAPGLSPEEAEALRKALAELLGVLPEEVRLPAKLGLLDLLAFLRLKGDEALGRL
SLAELRGLAGALKGRNSALLVHGFDVPSPKAVEGIARLAQGLLQDLEARTALGPLSPEPVPLGF
>P09928 ~~~cmsA~~~Bacteriochlorophyll c-binding protein~~~
MATRGWFSESSAQVAQIGDIMFQGHWQWVSNALQATAAAVDNINRNAYPGVSRSGSGEGAFSSSPSNGFRPKRIRSRFNR
>P0A314 ~~~csmA~~~Bacteriochlorophyll c-binding protein~~~
MSGGGVFTDILAAAGRIFEVMVEGHWETVGMLFDSLGKGTMRINRNAYGSMGGGSLRGSSPEVSGYAVPTKEVESKFAK
>P15527 ~~~csmA~~~Bacteriochlorophyll c-binding protein~~~
MSGGGVFTDILAAAGRIFEVMVEGHWETVGMLFDSLGKGTMRINRNAYGSMGGGTSLRGS
>P15528 ~~~csmA~~~Bacteriochlorophyll c-binding protein~~~
MSGGGVFTDILAAAGRIFEVMVEGHWETVGMLFDSLGKGTMRINRNAYGNLGGGGGSLRGSSPEVSGFAVPTKAVESKFA
K
>Q46383 ~~~csmB~~~Chlorosome envelope protein B~~~
MSNGTNIDVAGAINTLAETFGKLFQMQIDVANTALKALADVAEPLGKTATDLIGSFTGAATQVLQSVSSAIAPKK
>P15523 ~~~csmB~~~Chlorosome envelope protein B~~~
SNGTNIDVAGAINTLTETFGKLFQMQLDVANTALKALADVAEPLGKTATDLVGNFAGAATQILQSVSAAIAPKK
>Q46473 ~~~csmB~~~Chlorosome envelope protein B~~~
MSNGTNIDVAGAINTLTETFGKLFQMQVDVANNSLKALAEVAEPLGKTATDLVASFANVATQVLQNVSSAVAPKK
>O68988 ~~~csmI~~~Chlorosome protein I~~~COG0633
MNLIINDKTASSSVGQTIGKAARLNHAHVGYVCGGHGLCQACYITVQEGADCLAPLTDVEKAFLSPRQIAAGGRIACQAT
IAKEGTVKVLSRPEEVRRMVFSNPFQLIGYAADMGKDTAQQIVPGVQNLIGRIQRGEMGGKDALGDMIESIQGAAGLVVE
AIQQGPMALPIPFKEQIADLISKLPLPQIQLPSISLPQLPSISFPQLPFSLPKLPFSLPFLPQQPQATASLEKVTITVQP
PAKD
>C7UDU4 ~~~csn2~~~CRISPR-associated protein Csn2~~~
MRVNFSLLEEPIEIEKATFLTIKDVQSFAHLVKLIYQYDGENELKLFDAQQKGLKPTELFVVTDILGYDVNSAATLKLIY
GDLEAQLNDKPEVKSMIEKLTGTISQLIGYELLEHEMDLEEDGIIVQELFKALGIKIETTSDTIFEKVMEITQVHRYLSK
KKLLIFINACTYLTEDEVQQVVEYISLNNVDVLFLEQRVVQNRFQYILDENFYLSYEKA
>E7S4M0 ~~~csn2~~~CRISPR-associated protein Csn2~~~
MIKINFPILDEPLVLSNATILTIEDVSVYSSLVKHFYQYDVDEHLKLFDDKQKSLKATELMLVTDILGYDVNSAPILKLI
HGDLENQFNEKPEVKSMVEKLAATITELIAFECLENELDLEYDEITILELIKVLGVKIETQSDTIFEKCFEIIQVYNYLT
KKNLLVFVNSGAYLTKDEVIKLCEYINLMQKSVLFLEPRRLYDLPQYVIDKDYFLIGENMV
>Q99ZV9 ~~~csn2~~~CRISPR-associated protein Csn2~~~
MNLNFSLLDEPIPLRGGTILVLEDVCVFSKIVQYCYQYEEDSELKFFDHKMKTIKESEIMLVTDILGFDVNSSTILKLIH
ADLESQFNEKPEVKSMIDKLVATITELIVFECLENELDLEYDEITILELIKSLGVKVETQSDTIFEKCLEILQIFKYLTK
KKLLIFVNSGAFLTKDEVASLQEYISLTNLTVLFLEPRELYDFPQYILDEDYFLITKNMV
>G3ECR4 ~~~csn2~~~CRISPR-associated protein Csn2~~~
MKINFSLLDEPMEVNLGTVLVIEDVSVFAQLVKEFYQYDEQSNLTIFDSKIRSIRSSELLLITDILGYDINTSQVLKLLH
TDIVSQLNDKPEVRSEIDSLVSLITDIIMAECIENELDIEYDEITLLELIKALGVRIETKSCTVFEKIFEILQIFKYLVK
KRILVFVNSLSYFSKDEIYQILEYTKLSQADVLFLEPRQIEGIQQFILDKDYILMPYNN
>P0ABW8 ~~~csoA~~~CS1 fimbrial subunit A~~~
MKLKKTIGAMALATLFATMGASAVEKTISVTASVDPTVDLLQSDGSALPNSVALTYSPAVNNFEAHTINTVVHTNDSDKG
VVVKLSADPVLSNVLNPTLQIPVSVNFAGKPLSTTGITIDSNDLNFASSGVNKVSSTQKLSIHADATRVTGGALTAGQYQ
GLVSIILTKST
>P0ABW7 ~~~csoA~~~CS1 fimbrial subunit A~~~
MKLKKTIGAMALATLFATMGASAVEKTISVTASVDPTVDLLQSDGSALPNSVALTYSPAVNNFEAHTINTVVHTNDSDKG
VVVKLSADPVLSNVLNPTLQIPVSVNFAGKPLSTTGITIDSNDLNFASSGVNKVSSTQKLSIHADATRVTGGALTAGQYQ
GLVSIILTKST
>O85042 4.2.1.1~~~csoS3~~~Carboxysome shell carbonic anhydrase~~~
MNTRNTRSKQRAPFGVSSSVKPRLDLIEQAPNPAYDRHPACITLPERTCRHPLTDLEANEQLGRCEDSVKNRFDRVIPFL
QVVAGIPLGLDYVTRVQELAQSSLGHTLPEELLKDNWISGHNLKGIFGYATAKALTAATEQFSRKIMSEKDDSASAIGFF
LDCGFHAVDISPCADGRLKGLLPYILRLPLTAFTYRKAYAGSMFDIEDDLAQWEKNELRRYREGVPNTADQPTRYLKIAV
YHFSTSDPTHSGCAAHGSNDRAALEAALTQLMKFREAVENAHCCGASIDILLIGVDTDTDAIRVHIPDSKGFLNPYRYVD
NTVTYAQTLHLAPDEARVIIHEAILNANRSDGWAKGNGVASEGMRRFIGQLLINNLSQIDYVVNRHGGRYPPNDIGHAER
YISVGDGFDEVQIRNLAYYAHLDTVEENAIDVDVGIKIFTKLNLSRGLPIPIAIHYRYDPNVPGSRERTVVKARRIYNAI
KERFSSLDEQNLLQFRLSVQAQDIGSPIEEVASA
>Q31HD6 4.2.1.1~~~csoS3~~~Carboxysome shell carbonic anhydrase~~~COG0288
MNRLKKSHRQKSLFWRPIAPNPRWQKENPTAHGSTDTGGFGYNGGNEEVKTSSTMMNGIHALVNERQNEWLRGYEVDIKS
RFDNIESVLKDILAQQSQLNFVSWANQQLFAKLGVSLTEQDWQSGVQLQSQKGFQFLYGKTLFAQFMRMSEDFFVNDPLS
GQRKQEAERMFKEAGFHAVGIAPCADGRLAHILSYVLRLPYALARRKAHAGVMFDVSESVRNWVFIEHTRFRDGQPNLAD
EPTRYLKIAVYHFSKADPTHQGCAAHGSDDHKAAQAALQKLKDFKQAIENRFGCGSTVQTLLLGLNTDDDSMKVHIPNAS
GEVCLDRYVETEQLYQATMNLPDSEAKQALENAIVTCNQALGSTAPQPELVKLLSWLIGNNFSQIAYVNQYENGCYSDIG
HAERFIGIGNGFEEVQLRNLSYYSFLDTVEEGVNDVDVGIKIFKGLNVKKGLPIPIIIRCDYDGRVPGSKDRAEAKALRI
EKALHNRYQELSAPGLLQTLPTLRDFTSCKPAERLPGLADLSAKQRTA
>Q7TTT8 4.2.1.1~~~csoS3~~~Carboxysome shell carbonic anhydrase~~~
MVRSMPLRGGRPQAPTAPTRWQLQNTVFAAESQNQPSASEAVVSSTRDAALQRRRALTTDGKAATLVQGSVGGGRVRSAR
DQRQPGWVRRDKGATSGVPFNLSRSSLPITNRQHPLTDTAANARLRAYEQEVKGRFDRIVPLLQRVSALQHEPDFIEQAQ
RLTRAELGFDLPQHILERAWVRPLDMRALFAWCVFESHRLFSDRFFQDDPLDGAAGSVAARDFEQFLLDCGIHLLDVSPC
ADGRLAHTVAYALRIPFSAVRRRSHAGAMFDVENTVNRWVKTEHRRHREGMPNPSTEPTRYLKVVTYHFSSLDPQHQGCA
AHGSNDELAAAAGHQRLLDFREAVENSFCCGASVDLLLIGLDTDTDAIRVHPPSRDSEMVLDQWLCARELHAATASMTAD
QAMAQIAEAIEAGASGPMEPGMVAFLTRLIANNCSQIDYVQDLHGAPYPDAGHAERFIGVGIGFKEVHLRNLTYFAHLDT
VEEGAPDLDVGVKIFKGLNVSRDLPIPVVVRFDYSGRVPGARERAIADCQRVNQAIADRYGELVNQGLLHTCLTVRDRNQ
TAPAEVVGSTLAPPLQEAH
>Q7V6G1 4.2.1.1~~~csoS3~~~Carboxysome shell carbonic anhydrase~~~
MAYRNRNLASQTQRPLAPTAPRRRPVVTPQISDRSRLRGFANVGSGPCHPLTDRAANQHLQTYEANVKGSFELIVPFLKR
ISALQHDQDFVSKCQSLARSELGFDLPSHLLEQAWVRALDMRALFAWCVFQSHQHVSDHFFQEDPLQGGEGSIQAKHFQS
FLVDCGFHLLDVSPCADGRLAHTIAYALRIPFSSVRRRSHAGALFDVEKTVNRWIKTEHRRYREGVPNSADSPTRYLKVV
TYHFSSLDPSHQGCAAHGSDDALAASAGQQRLLDFRESVENSFCCGASVDLLLIGLDTDTDAIRVHVPAADGSIVLDEWL
SAEDLYHETLSLTSDEAMQHIAERVEAIAPKKPDEGMVAFIVKLIANNFSQIDYVKQSHAGPYPDAGHAERFIGVGIGFK
EVHLRNLTYFAHLDTVEVGAPDLDVGVKIFKGLNVSRDLPIPVVVRFDYSGQVPGARDRAVTDCYRVNQAIAERYSELFD
QGLLHTFLTIRDRDKKDTSEVVGSSLEPVHQEAH
>Q7V2C7 4.2.1.1~~~csoS3~~~Carboxysome shell carbonic anhydrase~~~
MPLRGLAKAKNFTLGPTAPMKTFTENVHSQNNEINNLKKIDKTHNLTNNSQNEKLYKYESQIKSSFDRIVPTLKEIARIQ
HHEDFINTAQSISKQNLGINLPTHILDKSWVKPLDMRALYAWCAFKQHEKLSDNFFENDPLEGSFGSPNANNFETALLDC
GIHLLDITPCSDGRLAHSVAYVMRIPFSAVRRRSHAGALFDIENTVNRWVKTEHKRYRENNPNEAHEDTRYLKIVTYHFS
SVDPLHQGCAAHGSDDKLAAKEGSEKLLAFKEAVENSFCCGASVDLMLIGLDTDTDSLKIHLSSSDGKIDLENTISSLDI
YNSTINFSKDEAEKEICQIISGNSNKVQLKGLDKFVYKLIVNNISQIDYVKKFHKGSYEDIGHAERFIGVGIGFKEVHLR
NLTYFAHLDTVEEGAPDLDVGVKIFTGLNVSQDLPIPIVIRFDYSGKVPGAKERAAKDCYRVNNAISIRYKSLVDKGLLH
TCLTIRDRDNIHSAQIIGMSLDQKTKEAH
>O32222 ~~~csoR~~~Copper-sensing transcriptional repressor CsoR~~~COG1937
MEKHNEHKTLNHKSSKEKDQITNRLKRIEGQVRGIQNMVENDRYCVDILVQISAVQAAMKNVALHLLEDHAHHCVADAIK
SGDGEQAISELLDVFKKFTKS
>P9WP49 ~~~csoR~~~Copper-sensing transcriptional repressor CsoR~~~COG1937
MSKELTAKKRAALNRLKTVRGHLDGIVRMLESDAYCVDVMKQISAVQSSLERANRVMLHNHLETCFSTAVLDGHGQAAIE
ELIDAVKFTPALTGPHARLGGAAVGESATEEPMPDASNM
>P0A329 ~~~csoS1~~~Carboxysome shell protein CsoS1~~~COG4577
MANETMGIALGMIETRGLVPAIEAADAMTKAAEVRLIGREFVGGGYVTVLVRGETGAVNAAVRAGADACERVGDGLVAAH
IIARPHREVEPALGNGNFLGQKD
>Q7V6F7 ~~~csoS1~~~Carboxysome shell protein CsoS1~~~COG4577
MASETMGIALGMIETRGLVPAIEAADAMTKAAEVRLIGREFVGGGYVTVLVRGETGAVNAAVRAGADACERVGDGLVAAH
IIARPHREVEPALGNGNFLGQKD
>Q7V2D1 ~~~csoS1~~~Major carboxysome shell protein CsoS1~~~COG4577
MATETMGIALGMIETRGLVPAIEAADAMTKAAEVRLIGREFVGGGYVTVLVRGETGAVNAAVRAGADACERVGDGLVAAH
IIARPHREVEPALGNGDFLGQKD
>P0A330 ~~~csoS1~~~Carboxysome shell protein CsoS1~~~COG4577
MANETMGIALGMIETRGLVPAIEAADAMTKAAEVRLIGREFVGGGYVTVLVRGETGAVNAAVRAGADACERVGDGLVAAH
IIARPHREVEPALGNGNFLGQKD
>O85041 ~~~csoS2~~~Carboxysome assembly protein CsoS2B~~~
MPSQSGMNPADLSGLSGKELARARRAALSKQGKAAVSNKTASVNRSTKQAASSINTNQVRSSVNEVPTDYQMADQLCSTI
DHADFGTESNRVRDLCRQRREALSTIGKKAAKTTGKPSGRVRPQQSVVHNDAMIENAGDTNQSSSTSLNNELSEICSIAD
DMPERFGSQAKTVRDICRARRQALSERGTRAVPPKPQSQGGPGRNGYQIDGYLDTALHGRDAAKRHREMLCQYGRGTAPS
CKPTGRVKNSVQSGNAAPKKVETGHTLSGGSVTGTQVDRKSHVTGNEPGTCRAVTGTEYVGTEQFTSFCNTSPKPNATKV
NVTTTARGRPVSGTEVSRTEKVTGNESGVCRNVTGTEYMSNEAHFSLCGTAAKPSQADKVMFGATARTHQVVSGSDEFRP
SSVTGNESGAKRTITGSQYADEGLARLTINGAPAKVARTHTFAGSDVTGTEIGRSTRVTGDESGSCRSISGTEYLSNEQF
QSFCDTKPQRSPFKVGQDRTNKGQSVTGNLVDRSELVTGNEPGSCSRVTGSQYGQSKICGGGVGKVRSMRTLRGTSVSGQ
QLDHAPKMSGDERGGCMPVTGNEYYGREHFEPFCTSTPEPEAQSTEQSLTCEGQIISGTSVDASDLVTGNEIGEQQLISG
DAYVGAQQTGCLPTSPRFNQTGNVQSMGFKNTNQPEQNFAPGEVMPTDFSIQTPARSAQNRITGNDIAPSGRITGPGMLA
TGLITGTPEFRHAARELVGSPQPMAMAMANRNKAAQAPVVQPEVVATQEKPELVCAPRSDQMDRVSGEGKERCHITGDDW
SVNKHITGTAGQWASGRNPSMRGNARVVETSAFANRNVPKPEKPGSKITGSSGNDTQGSLITYSGGARG
>Q31HD7 ~~~csoS2~~~Carboxysome assembly protein CsoS2~~~
MSTSNAQSGRAAAIARRNAQVKGKGYTASAAPAAPRKPAAPVAEPVVAAAPAPSQPSRSRRKVSVAPTATPAASAAGREA
AKLKRQQQKNGKSSAGAANAMPHPKAKAKQKPEEPIVEPRQAKAEKPTKRSERRTGVKPQVASQQPSGRLQSKAYRKAQA
KGKAGQEAFKSNGSSQSGAKAKLANPDASTREIAQQVRAERCAQGKTCSTGGSRPMRKRRNAKEAPQKVGESQTLHGQSV
SGTQVGQGEKKMTGSESGACQLVSGTEYLGAEEFSKNCDVQPTPQPAKVTQTQTTRGQVVSGSTKVGRSDKMTGNETGTC
SAITGTEYLPADQSKMYCGETPAKSKATGFSVMSQATQKSEQKVTGGDSRKSQSTTFKPKNPASAPHKVMPSQTAKGNTT
TGSQVGRLESVTGGERGSCHAVTGTGYQGAEEAKACDMPMTETADKVTASGTAGGQKVTGDRSGAYYGMTGAEAGDCKTI
TGTSYTGTEQFQFCSVDEQNEMKVRQRKGANPSISGVQPGPQGLTGAQKGACELVTGSHYQGGDQTAMVCDSTNAAAPGE
SDFPAMIGQAQPAFSTNEVEPMVDEGSKITGDGWDRGSKVTGTDGPWAAQRNASIRGVAGQSPMGASQYRPVNNEVPMSP
ITGSSGNTDTGAKVTLSGGARA
>Q7U5I9 ~~~csoS2~~~Carboxysome assembly protein CsoS2~~~
MARLSSRELALERRKALTTSGKKSSVAAGDGANRVRTASDVRPTRTDAAAAVEPTAPAVSAPVKPTVSFTPASPSSSSHV
KPQRHPSRDLVLARRDALSRRGKTADTSRDRNRADVARQTQAAAPVAASAEEQKTCGCGGKRAAGKVQLSAPTTSLKPRS
DRRSAAPKRRAIENPSRALVLARREAMAKHGKTAGKQPTSAAAVARQANPDLTSRELAQQVRELRTKAGARNKQSAGATR
PTGPNRHGAKQAAAADAHWKVGESTTSTGQTVTGTQANRSVKTTGNEASTCRSITGTEYLGAEVFQTFCQQAPEPTTPAK
VRVTATSHGNRVTGNEVGRSEKVTGDEPGTCKSVTGTEYISANQSAAYCGSSQVSQRKVGHSLTQQGRPVSGVMVGRSSS
VTGDEAGAGRSLTGDQYLGSDPLPDGRPAAKVGQSGTLSGTGVTGTLVGRSSQVTGNEFGSCHRVTGDQYISAEQVNAFC
GSKPEPEAAKVGFSITNRNQVVSGTRTGRSERVTGDEPGTCKAVTGTPYAGLENAGQHCGTSAVQAIRERTPVRLGTPSA
AMTGIQPGVGGVMTGDEKGACEAVTGTPYVGADQLATACGNEAPAGTDSHGQAPEGAAWTRFSVMSPARAAQQQRDDQGA
VTGTSYEQGNRITGPFDLAGGKVTGTEQFRFDNREFQRRQFQPTVAVVSEPAEQPASRVTGEGSSTKITGDDWDRGEHVT
GTEGVSARRRNPTRPGPMSATVPHERKRNEENEWPVSRVTGSSGNTEKGSLITVSGGARG
>Q7V6G0 ~~~csoS2~~~Carboxysome assembly protein CsoS2~~~
MAKQSSRELALERRKALSNSGKKSTTLNGSSPNRIRTASDARLTRTDQSFVKAGKESVQLTAPKREQLDTSFVASRESSG
ASRRQVKTIRNSSRELVLARRDELSRRGQPAAKSKDRTRAEVEKISSKVSQQDAAKKQVNDLASDQKGVDESSSKSLKSL
DTVSRLSSRNSTSRPSAKRRSIQNPSRALVLARREAQSKHGKTAANQPTSAASVARQGDPDLSSREISQRVRELRSKSGA
TGKKRSGACRPCGPNRNGSKQAVAADAHWKVGLSETSTGQVVTGTQANRSSKTTGNEASTCRSITGTQYLGSEVFDTFCQ
SAPQPGQPLKVAVTNTSHGNRVTGNEVGRSEKVTGDEPGTCKTLTGTEYISANQANQYCGVSQPSPRKVGQSVTEDGRKV
SGVMVGRSEKVTGDEAGSNRQLTGDQYLGVDPLPEGRSAEKVGSFNTLRGAGVTGTNVARSEYVTGNEPGSCKRVTGDEY
VGPQQYNTFCGGKPNPEAAKVGLSLTNKSQTVSGTLTGRSELVTGDEPGTCKAVTGTPYSGVEQASGWCDTNSVREIQDR
TPKLLGTPGAVMTGLQPGVGGVMTGAEKGACEPLTGTPYVGGDQLVQACGSDAPAGSNDHQGSSESSPWTHFSVQSPARA
MQLQRDPRSGVTGTSYEQGSQITGPFNMAVDKITGTEQFRFDRKQRHFKSVPVEATPNDVSQTRPESRVTGEGQSAGLNI
TGDDWDRSERVTGTEGASARRRNPTRPGPMSAMPAADLKRNEEVSQPMSRVTGSSGNTDQGSLITVSGGARG
>Q7V2C8 ~~~csoS2~~~Carboxysome assembly protein CsoS2~~~
MSTKTSREIALERRKAMSDGGKKAALHSSSTKDRVRSSQDINSTGATSSNKKVLTSPSKSNIPANKIARKSTSSKLSSKE
LGIERRKAMSTHGKSAINSSDRTRTDVKSDIKVNKVISTEKPQALKDHNNNIKDNQVVKQNIKRRINQKRKPITNTSRDI
VLARREAQSKHGKSASKQNTSAASLARRGDPDLSSREISQRVRELRSKTGSTSKQGNGKCRPCGPNKNGSKLNIADASWK
VGKSETDSGQTVTGTQANRSLKTTGNEASTCRTVTGTQYMGAEVTGQFCQDKPKYKQPIRASVTTTTSGNKVTGNEVGRS
EKVTGDEPGTCKNLTGTEYISANQSKKYCGEVIKKPSKVMQSITTDGLKVSGSLPGRSSLVTGDESGSGKQLTGDQYLGS
EPSPKGKSFEKVGSYDTLNGNNVTGTGVGRSDYVTGNEYGSCKNLTGDEYIGSQQYEKFCGSTPKPEARKVGLSLSSKSN
LISGTMTGRSKIVTGDEPGSCKVLTGTPYAGLDQINDNCNAEIADDMKSRATVNSGNNSNARLTGLQPGIGGVMTGATKG
SCKNLTGTPYIGGDQFLSNCETPPNDASYANQEKSASNSWKEFSVNSPSREKYSAKNTEGVTGNRYEDSSKITGPFDMAE
DKVTGTEQFRFEPNKNMTYKQKMKQEESQNIDIPTDKKEPSKITGEGQSAGNITGDDWDRGDKVTGTEGVSARKRNPSRA
GFMGAMPPVDNKRNDETEKPDFLITGSSGNTRDGQLVTFSGGARG
>P45689 ~~~csoS1A~~~Major carboxysome shell protein CsoS1A~~~COG4577
MADVTGIALGMIETRGLVPAIEAADAMTKAAEVRLVGRQFVGGGYVTVLVRGETGAVNAAVRAGADACERVGDGLVAAHI
IARVHSEVENILPKAPQA
>Q31HD3 ~~~csoS1A~~~Carboxysome shell protein CsoS1A~~~COG4577
MSDYGIALGMIETRGLVPAIEAADAMTKAAEVRLVSREFVGGGYVTVLVRGETGAVNAAVRAGADACERVGDGLVAAHII
ARPHKEVEPVLTMEQK
>P45690 ~~~csoS1B~~~Carboxysome shell protein CsoS1B~~~COG4577
MATTHGIALGMIETRGLVPAIEAADAMTKAAEVRLVGRSFVGGGYVTVMVRGETGAVNAAVRAGADACERVGDGLVAAHI
IARVHSEVEIILPETPEDSDSAWCIANLNS
>P45688 ~~~csoS1C~~~Carboxysome shell protein CsoS1C~~~COG4577
MAAVTGIALGMIETRGLVPAIEAADAMTKAAEVRLVGRQFVGGGYVTVLVRGETGAVNAAVRAGADACERVGDGLVAAHI
IARVHSEVENILPKAPEA
>Q31HD1 ~~~csoS1C~~~Carboxysome shell protein CsoS1C~~~COG4577
MSTEYGIALGMIETRGLVPAIEAADAMTKAAEVRLVSREFVGGGYVTVLVRGETGAVNAAVRAGADACERVGDGLVAAHI
IARPHKEVEPVLALGNSSPDRS
>D0KZ73 ~~~csoS1D~~~Carboxysome shell protein CsoS1D~~~COG4577
MNNIDLRVYSFIDSLQPQLASYLATSSQGFLPVPGDACLWIEVAPGMAVHRLSDIALKATNVRLGEQVVERAFGSMEIHY
RNQSDVLASGEAVLREINHAQEDRLPCRIAWKEIIRAITPDHATLINRQLRKGSMLLPGKSMFILETEPAGYIVQAANEA
EKAAHVTLIDVRAFGNFGRLTMMGSEAETEEAMRAAEATIASINARARRAEGF
>Q7V2D3 ~~~csoS1D~~~Carboxysome shell protein CsoS1D~~~COG4577
MEPTSSLNRGDRKKGSSLVTGSEVQSQSNGASCFITTDSEKSLVSRQASQVEQIELRTYVFLDSLQPQLAAYMGTVSRGF
LPIPGDSCLWMEVSPGMAVHRVTDIALKASNVRLGQMIVERAFGSLALYHKDQSTVLHSGDVVLDAIGSEVRKRTKPSTS
WTEVICAITPDHAVLINRQNRSGSMIQSGMSMFILETEPAGYVLKAANEAEKSANITIIDVKAVGAFGRLTLAGKEGDVE
EAAAAAIRAIDQISNY
>P0C1D6 ~~~csp1~~~Protein PS1~~~COG0627
MRDTAFRSIKAKAQAKRRSLWIAAGAVPTAIALTMSLAPMASAQSSNLSSDAVIGSIAQGVTDGLTDYLKPRVEELPAGE
VTYPEIAGLPDGVRVISAEWATSKHVILTIQSAAMPERPIKVQLLLPRDWYSSPNREFPEIWALDGLRAIEEQSGWTIET
NIEQYYADKNAIVVLPVGGESSFYSDWEGPNNGKNYQWETFLTQELAPILDKGFRSNTDRAITGISMGGTAAVNIATHHP
DMFKFVGSFSGYLDTTSAGMPIAISAALADAGGYDANAMWGPVGSERWQENDPKSNVDKLKGKTIYVSSGNGADDFGKEG
SVAIGPANAAGVGLEVISRMTSQTFVDRASQAGVEVVASFRPSGVHSWEYWQFEMTQAFPHIANALGMSTEDRGVECAPV
GAIADAVADGAMGTCLTNEYDVTGGKAQDFANGRAYWSANTGAFGLVGRINARYSELGGPASWLGYPTSSELKTPDGRGR
FVTFEHGSIYWTATTGPWEIPGDMLAAWGTQDYEKGSLGYPTGAAVEYNGGLRQQFEGGYVFRTSNNQSYWVRGEISKKY
AEDGIFAQLGFPTGNEKLINGGAFQEFEKGNIYWSASTGAHVILHGDIFDAWGAKGWEQGEYGFPTSDQTAITAGGQTID
FQNGTIRQVNGRIEESR
>P71478 ~~~csp~~~Cold shock protein 1~~~COG1278
MKNGTVKWFNADKGYGFITGEDGNDVFVHFSAIQTDGFKTLEEGQKVTFDEESSDRGPQAANVVPQ
>P60242 ~~~comC1~~~Competence-stimulating peptide type 1~~~
MKNTVKLEQFVALKEKDLQKIKGGEMRLSKFFRDFILQRKK
>P60243 ~~~comC1~~~Competence-stimulating peptide type 1~~~
MKNTVKLEQFVALKEKDLQKIKGGEMRLSKFFRDFILQRKK
>P96349 ~~~cspL~~~Cold shock protein 2~~~COG1278
MKNGTVKWFNADKGFGFITGEDGTDVFVHFSAIQTDGFKTLDEGQKVTYDEEQGDRGPQATNVQPQ
>P72507 ~~~comC2~~~Competence-stimulating peptide type 2~~~
MKNTVKLEQFVALKEKDLQKIKGGEMRISRIILDFLFLRKK
>Q01761 ~~~~~~Cold shock-like protein 7.0~~~COG1278
MATGTVKWFNAEKGFGFIAQDGGGPDVFVHYSAINATGFRSLEENQVVNFDVTHGEGPQAENVSPA
>Q45096 ~~~cspA~~~Major cold shock protein CspA~~~COG1278
MTVTGQVKWFNNEKGFGFIEVPGENDVFVHFSAIETDGFKSLEEGQKVSFEIEDGNRGPQAKNVIKL
>P0A9X9 ~~~cspA~~~Cold shock protein CspA~~~COG1278
MSGKMTGIVKWFNADKGFGFITPDDGSKDVFVHFSAIQNDGYKSLDEGQKVSFTIESGAKGPAAGNVTSL
>P0A355 ~~~cspLA~~~Cold shock-like protein CspLA~~~COG1278
MEQGTVKWFNAEKGFGFIERENGDDVFVHFSAIQGDGFKSLDEGQAVTFDVEEGQRGPQAANVQKA
>A0R5E1 ~~~cspA~~~Probable cold shock protein A~~~COG1278
MPQGTVKWFNAEKGFGFIAPEDGSADVFVHYTEIQGSGFRTLEENQKVEFEVGQSPKGPQATGVRTI
>P9WP75 ~~~cspA~~~Probable cold shock protein A~~~COG1278
MPQGTVKWFNAEKGFGFIAPEDGSADVFVHYTEIQGTGFRTLEENQKVEFEIGHSPKGPQATGVRSL
>P95459 ~~~cspA~~~Major cold shock protein CspA~~~
MSNRQNGTVKWFNDAKGFGFITPESGNDLFVHFRSIQGTGFKSLQEGQKVSFVVVNGQKGLQADEVQVV
>Q9Z3S6 ~~~cspA~~~Cold shock protein CspA~~~COG1278
MNSGTVKWFNSTKGFGFIQPDDGATDVFVHASAVERAGMRSLVEGQKVTYDIVRDTKSGKSSADNLRAA
>Q9S1B7 ~~~cspA~~~Cold shock-like protein CspA~~~COG1278
MSDSNTGTVKWFNEDKGFGFLTQDNGGADVFVHFRAIASEGFKTLDEGQKVTFEVEQGPKGLQASNVIAL
>Q2FYN2 ~~~cspA~~~Cold shock protein CspA~~~COG1278
MKQGTVKWFNAEKGFGFIEVEGENDVFVHFSAINQDGYKSLEEGQAVEFEVVEGDRGPQAANVVKL
>Q5HG18 ~~~cspA~~~Cold shock protein CspA~~~
MKQGTVKWFNAEKGFGFIEVEGENDVFVHFSAINQDGYKSLEEGQAVEFEVVEGDRGPQAANVVKL
>P41016 ~~~cspB~~~Cold shock protein CspB~~~
MQRGKVKWFNNEKGYGFIEVEGGSDVFVHFTAIQGEGFKTLEEGQEVSFEIVQGNRGPQAANVVKL
>P32081 ~~~cspB~~~Cold shock protein CspB~~~COG1278
MLEGKVKWFNSEKGFGFIEVEGQDDVFVHFSAIQGEGFKTLEEGQAVSFEIVEGNRGPQAANVTKEA
>P36995 ~~~cspB~~~Cold shock-like protein CspB~~~COG1278
MSNKMTGLVKWFNADKGFGFISPVDGSKDVFVHFSAIQNDNYRTLFEGQKVTFSIESGAKGPAAANVIITD
>P42016 ~~~cspB~~~Cold shock protein CspB~~~
MQRGKVKWFNNEKGYGFIEVEGGSDVFVHFTAIQGEGFKSLEEGQEVSFEIVQGNRGPQAANVVKL
>P39158 ~~~cspC~~~Cold shock protein CspC~~~COG1278
MEQGTVKWFNAEKGFGFIERENGDDVFVHFSAIQSDGFKSLDEGQKVSFDVEQGARGAQAANVQKA
>P0A9Y6 ~~~cspC~~~Cold shock-like protein CspC~~~COG1278
MAKIKGQVKWFNESKGFGFITPADGSKDVFVHFSAIQGNGFKTLAEGQNVEFEIQDGQKGPAAVNVTAI
>E0J500 ~~~cspC~~~Cold shock-like protein CspC~~~
MAKIKGQVKWFNESKGFGFITPADGSKDVFVHFSAIQGNGFKTLAEGQNVEFEIQDGQKGPAAVNVTAI
>Q45099 ~~~cspD~~~Cold shock-like protein CspD~~~COG1278
MQTGKVKWFNGEKGFGFIEVEGGEDVFVHFSAIQGDGFKTLEEGQEVSFEIVDGNRGPQAANVTKN
>P51777 ~~~cspD~~~Cold shock protein CspD~~~COG1278
MQNGKVKWFNNEKGFGFIEVEGGDDVFVHFTAIEGDGYKSLEEGQEVSFEIVEGNRGPQASNVVKL
>P0A968 ~~~cspD~~~Cold shock-like protein CspD~~~COG1278
MEKGTVKWFNNAKGFGFICPEGGGEDIFAHYSTIQMDGYRTLKAGQSVQFDVHQGPKGNHASVIVPVEVEAAVA
>P0A972 ~~~cspE~~~Cold shock-like protein CspE~~~COG1278
MSKIKGNVKWFNESKGFGFITPEDGSKDVFVHFSAIQTNGFKTLAEGQRVEFEITNGAKGPSAANVIAL
>E0J1Q3 ~~~cspE~~~Cold shock-like protein CspE~~~
MSKIKGNVKWFNESKGFGFITPEDGSKDVFVHFSAIQTNGFKTLAEGQRVEFEITNGAKGPSAANVIAL
>P48859 ~~~scoF~~~Cold shock protein ScoF~~~COG1278
MASGTVKWFNSEKGFGFIAQDGGGPDVFAHYSNINAQGYRELQEGQAVTFDITQGQKGPQAENITPA
>P0A978 ~~~cspG~~~Cold shock-like protein CspG~~~COG1278
MSNKMTGLVKWFNADKGFGFITPDDGSKDVFVHFTAIQSNEFRTLNENQKVEFSIEQGQRGPAAANVVTL
>Q9S170 ~~~cspG~~~Cold shock-like protein CspG~~~COG1278
MSNSTTGLVKWFNEEKGFGFITQDNGGDDVFVHFRSITSDGFKTLAEGQKVSFEVEQGQKGLQAANVVAL
>E1WGN1 ~~~cspJ~~~Cold shock-like protein CspJ~~~
MTTKITGLVKWFNPEKGFGFITPKDGSKDVFVHFSAIQSNEFRTLNENQEVEFSVEQGPKGPSAVNVVAL
>Q9KL16 ~~~cspV~~~Cold shock protein CspV~~~COG1278
MSTKMTGSVKWFNETKGFGFLTQDNGGNDVFVHFNSIQSEGFKTLAEGQRVSFIVEQGKKGPQASNVVAL
>P54584 ~~~csp~~~Cold shock protein~~~
MAQGTVKWFNAEKGFGFITPDDSDGDVFVHYSEIQTGGFKTLDENARVQFEIGQGAKGPQATGVTLV
>O54310 ~~~csp~~~Cold shock-like protein~~~COG1278
MRGKVKWFDSKKGYGFITKDEGGDVFVHWSAIEMEGFKTLKEGQVVEFEIQEGKKGPQAAHVKVVE
>P32144 ~~~csqR~~~HTH-type transcriptional repressor CsqR~~~COG1349
MSLTELTGNPRHDQLLMLIAERGYMNIDELANLLDVSTQTVRRDIRKLSEQGLITRHHGGAGRASSVVNTAFEQREVSQT
EEKKAIAEAVADYIPDGSTIFITIGTTVEHVARALLNHNHLRIITNSLRVAHILYHNPRFEVMVPGGTLRSHNSGIIGPS
AASFVADFRADYLVTSVGAIESDGALMEFDVNEANVVKTMMAHARNILLVADHTKYHASAAVEIGNVAQVTALFTDELPP
AALKSRLQDSQIEIILPQEDA
>P0DPC3 ~~~csrA1~~~Translational regulator CsrA1~~~COG1551
MLILTRKVGESINIGDDITITILGVSGQQVRIGINAPKDVAVHREEIYQRIQAGLTAPDKRETP
>P69920 ~~~csrA2~~~Translational regulator CsrA2~~~COG1551
MLILTRRCAESLIIGDGEITVTVLGVKGNQVRIGVNAPKEVAVHREEIYLRIKKEKDEEPSH
>P33911 ~~~csrA~~~Translational regulator CsrA~~~COG1551
MLVLSRKINEAIQIGADIEVKVIAVEGDQVKLGIDAPKHIDIHRKEIYLTIQEENNRAAALSSDVISALSSQKK
>P0DPD4 ~~~csrA~~~Translational regulator CsrA~~~
MLILSRKENESIIIGEGIEIKVVQTGKGYAKIGIEAPKSLMILRKELVQQVKDENLHSVVQNDIKLDDLSKKLIK
>Q0P9F1 ~~~csrA~~~Translational regulator CsrA~~~COG1551
MLILSRKENESIIIGEGIEIKVVQTGKGYAKIGIEAPKSLMILRKELVQQVKDENLHSVVQNDIKLDDLSKKLIK
>P69913 ~~~csrA~~~Carbon storage regulator~~~COG1551
MLILTRRVGETLMIGDEVTVTVLGVKGNQVRIGVNAPKEVSVHREEIYQRIQAEKSQQSSY
>A4ISU9 ~~~csrA~~~Translational regulator CsrA~~~COG1551
MLVLTRKLKEAIQIGDDIEITVLAIQGDQVKLGINAPKHVEIHRKEIYLAIQAENNAASHASKSSLKRLNEQLKHLKGGK
QA
>O69078 ~~~csrA~~~Translational regulator CsrA~~~
MLILTRRVGETLMVGDDVTVTVLGVKGNQVRIGVNAPKEVAVHREEIYQRIQKEKDQEPNH
>O85735 ~~~csrA~~~Translational regulator CsrA~~~
MLILTRRVGETLMIGDEVTVTVLGVKGNQVRIGVNAPKEVSVHREEIYQRIQAEKSADDLLIPKQRLVA
>A1JK11 ~~~csrA~~~Translational regulator CsrA~~~COG1551
MLILTRRVGETLMIGDEVTVTVLGVKGNQVRIGVNAPKEVSVHREEIYQRIQAEKSQPTTY
>P13518 ~~~csrD~~~RNase E specificity factor CsrD~~~COG2199
MRLTTKFSAFVTLLTGLTIFVTLLGCSLSFYNAIQYKFSHRVQAVATAIDTHLVSNDFSVLRPQITELMMSADIVRVDLL
HGDKQVYTLARNGSYRPVGSSDLFRELSVPLIKHPGMSLRLVYQDPMGNYFHSLMTTAPLTGAIGFIIVMLFLAVRWLQR
QLAGQELLETRATRILNGERGSNVLGTIYEWPPRTSSALDTLLREIQNAREQHSRLDTLIRSYAAQDVKTGLNNRLFFDN
QLATLLEDQEKVGTHGIVMMIRLPDFNMLSDTWGHSQVEEQFFTLTNLLSTFMMRYPGALLARYHRSDFAALLPHRTLKE
AESIAGQLIKAVDTLPNNKMLDRDDMIHIGICAWRSGQDTEQVMEHAESATRNAGLQGGNSWAIYDDSLPEKGRGNVRWR
TLIEQMLSRGGPRLYQKPAVTREGQVHHRELMCRIFDGNEEVSSAEYMPMVLQFGLSEEYDRLQISRLIPLLRYWPEENL
AIQVTVESLIRPRFQRWLRDTLMQCEKSQRKRIIIELAEADVGQHISRLQPVIRLVNALGVRVAVNQAGLTLVSTSWIKE
LNVELLKLHPGLVRNIEKRTENQLLVQSLVEACSGTSTQVYATGVRSRSEWQTLIQRGVTGGQGDFFASSQPLDTNVKKY
SQRYSV
>O85043 ~~~csoS4A~~~Carboxysome shell vertex protein CsoS4A~~~COG4576
MKIMQVEKTLVSTNRIADMGHKPLLVVWEKPGAPRQVAVDAIGCIPGDWVLCVGSSAAREAAGSKSYPSDLTIIGIIDQW
NGE
>Q31HD5 ~~~csoS4A~~~Carboxysome shell vertex protein CsoS4A~~~COG4576
MKIYKVDKTLVSTNRIAMMEHKPLLVVREKDGGTPQVAVDPVGCKPGDWVICCGSSAARDATGVKGYPSDLTIVGIIDKW
EVPQDADTTS
>O85044 ~~~csoS4B~~~Carboxysome shell vertex protein CsoS4B~~~COG4576
MEVMRVRSDLIATRRIPGLKNISLRVMEDATGKVSVACDPIGVPEGCWVFTISGSAARFGVGDFEILTDLTIGGIIDHWV
T
>P53508 ~~~cssA~~~CS6 fimbrial subunit A~~~
MKKTIGLILILASFGSHARTEIATKNFPVSTTISKSFFAPEPQIQPSFGKNVGKEGGLLFSVSLTVPENVSQVTVYPVYD
EDYGLGRLVNTADDSQSIIYQIVDDKGRKMLKDHGAEVTPNQQITFRALNYTSGDKEIPPGIYNDQVMVGYYVN
>P53510 ~~~cssB~~~CS6 fimbrial subunit B~~~
MLKKIIPAIVLIAGTSGVVNAGNWQYKSLDVNVNIEQNFIPDIDSAVRIIPVNYDSDPKLNSQLYTVEMTIPAGVSAVKI
VPTDSLTSSGQQIGKLVNVNNPDQNMNYYIRKDSGAGKFMAGQKGSFSVKENTSYTFSAIYTGGEYPNSGYSSGTYAGHL
TVSFYSN
>O32192 ~~~cssR~~~Transcriptional regulatory protein CssR~~~COG0745
MSYTIYLVEDEDNLNELLTKYLENEGWNITSFTKGEDARKKMTPSPHLWILDIMLPDTDGYTLIKEIKAKDPDVPVIFIS
ARDADIDRVLGLELGSNDYISKPFLPRELIIRVQKLLQLVYKEAPPVQKNEIAVSSYRVAEDAREVYDENGNIINLTSKE
FDLLLLFIHHKGHPYSREDILLKVWGHDYFGTDRVVDDLVRRLRRKMPELKVETIYGFGYRMMSS
>O32193 2.7.13.3~~~cssS~~~Sensor histidine kinase CssS~~~COG2205
MKNKPLAFQIWVVISGILLAISILLLVLFSNTLRDFFTNETYTTIENEQHVLTEYRLPGSIERRYYSEEATAPTTVRSVQ
HVLLPENEEASSDKDLSILSSSFIHKVYKLADKQEAKKKRYSADVNGEKVFFVIKKGLSVNGQSAMMLSYALDSYRDDLA
YTLFKQLLFIIAVVILLSWIPAIWLAKYLSRPLVSFEKHVKRISEQDWDDPVKVDRKDEIGKLGHTIEEMRQKLVQKDET
ERTLLQNISHDLKTPVMVIRGYTQSIKDGIFPKGDLENTVDVIECEALKLEKKIKDLLYLTKLDYLAKQKVQHDMFSIVE
VTEEVIERLKWARKELSWEIDVEEDILMPGDPEQWNKLLENILENQIRYAETKIEISMKQDDRNIVITIKNDGPHIEDEM
LSSLYEPFNKGKKGEFGIGLSIVKRILTLHKASISIENDKTGVTYRIAVPK
>P15078 ~~~cstA~~~Peptide transporter CstA~~~COG1966
MNKSGKYLVWTVLSVMGAFALGYIALNRGEQINALWIVVASVCIYLIAYRFYGLYIAKNVLAVDPTRMTPAVRHNDGLDY
VPTDKKVLFGHHFAAIAGAGPLVGPVLAAQMGYLPGMIWLLAGVVLAGAVQDFMVLFVSTRRDGRSLGELVKEEMGPTAG
VIALVACFMIMVIILAVLAMIVVKALTHSPWGTYTVAFTIPLALFMGIYLRYLRPGRIGEVSVIGLVFLIFAIISGGWVA
ESPTWAPYFDFTGVQLTWMLVGYGFVAAVLPVWLLLAPRDYLSTFLKIGTIVGLAVGILIMRPTLTMPALTKFVDGTGPV
WTGNLFPFLFITIACGAVSGFHALISSGTTPKMLANEGQACFIGYGGMLMESFVAIMALVSACIIDPGVYFAMNSPMAVL
APAGTADVVASAAQVVSSWGFSITPDTLNQIASEVGEQSIISRAGGAPTLAVGMAYILHGALGGMMDVAFWYHFAILFEA
LFILTAVDAGTRAARFMLQDLLGVVSPGLKRTDSLPANLLATALCVLAWGYFLHQGVVDPLGGINTLWPLFGIANQMLAG
MALMLCAVVLFKMKRQRYAWVALVPTAWLLICTLTAGWQKAFSPDAKVGFLAIANKFQAMIDSGNIPSQYTESQLAQLVF
NNRLDAGLTIFFMVVVVVLALFSIKTALAALKDPKPTAKETPYEPMPENVEEIVAQAKGAH
>J7T3V5 ~~~csxA~~~Exosporium protein A~~~
MFLLYEKEDIYMAINSKDFIPRPGFVNKQGCLPDPVEITCIQVPKVFDQCLIKECLKPTDDCEQLCKQIPNITDPAQVRC
VGCCKDLKVKVNSVTKCPVSNGKPGHKKVTINFTVTFDVDVDVEINGVIHTETLNFSVNRTITASNLYCPDAIAKTIIGK
ECTSAEEIDQQFIKLEVVGECLSTDISKIDCDNDCCSCSCTCEDNGDKKVFLCITLGLFIIIKCEIVVQLMVPAYGYCPV
PEECKCSHDPCKEFMERELPTLYPPQEMDNLFDDYDERQDERHIHDRKHIEEEEERGNLVTSSIIASN
>J7T0S1 ~~~csxB~~~Exosporium protein B~~~
MSKSSEEKMENKEVLNINSFNISEFCNAEEGSNFIHFKPCEICKRAILDPINVADTSRLLQVNVALRNVCIGKELTVGCI
LIDRTGTVLAFKSQTFTVGHGGSGCGCSEDKHGSPCTNTSRRFSFILPTRDLCSSMDLKVKIIANYTHPCN
>J7SGD5 ~~~csxC~~~Exosporium protein C~~~
MMSMDEMRGNYDSNSYKSADHDCHKDCGRVIESQTLPLCEGTNITPTAVTTPVVAKIPVVLAEKEVQIDVEARMKLKEKF
FEIKRIKKDVFLTQCELLPRAGVIENGVPITGKLFLSGFVRKNIEFATADCVKHDVVSGEIKHTTEKIPFTCVTEVTYIT
PPILGNRGIQQKTDLFCGRCDCECECEEERLGKLTCEEFLEDSITLVEKPFCELLGARIFEADIQRKPCYEHGEKVFDEL
LEKMVVHIRVKVLQLQQVAVNDTDAGTGSMCRK
>Q6D0W8 ~~~csy1~~~CRISPR-associated protein Csy1~~~
MRNGLPEFILSYINNRKQAKLDAFDKEAEKKRATLSGEALSVAELELAKARREIEQKHEVRNWLTDAASRAGQISLVTHA
LKFTHSDAKGSSVFNAETVEDATTLSTATLAQPAIDAVGNAAALDVAKLLQTEHDGDSLVAALQRGDNRALEALAENPEQ
LAQWLTGFQQVFTNRQPSSHKLAKQIYFPLANGEYHLLSPLYSSSLAHALHQRISAVRFGDEAKAIRQAQRTNQWHDQLS
ISYPNLAVQNMGGTKPQNISALNSSRSGRSYLLSSAPPQWNSIEKPPQQHESIFRPRGEVDYHTRATLAQMQRFLLSVKD
VENNRDIRQQRLHYLDQLIDQLFFYVASVQNLPVGWSAESELKRAQQLWLDPYRAETDTVFRREREAGDWQQAVAYEFGR
WLNRRLKHENLIFGEVERREWSTAALFKRRMREMESALKEELA
>Q02ML9 ~~~csy1~~~CRISPR-associated protein Csy1~~~
MTSPLPTPTWQELRQFIESFIQERLQGKLDKLQPDEDDKRQTLLATHRREAWLADAARRVGQLQLVTHTLKPIHPDARGS
NLHSLPQAPGQPGLAGSHELGDRLVSDVVGNAAALDVFKFLSLQYQGKNLLNWLTEDSAEALQALSDNAEQAREWRQAFI
GITTVKGAPASHSLAKQLYFPLPGSGYHLLAPLFPTSLVHHVHALLREARFGDAAKAAREARSRQESWPHGFSEYPNLAI
QKFGGTKPQNISQLNNERRGENWLLPSLPPNWQRQNVNAPMRHSSVFEHDFGRTPEVSRLTRTLQRFLAKTVHNNLAIRQ
RRAQLVAQICDEALQYAARLRELEPGWSATPGCQLHDAEQLWLDPLRAQTDETFLQRRLRGDWPAEVGNRFANWLNRAVS
SDSQILGSPEAAQWSQELSKELTMFKEILEDERD
>Q6D0W7 ~~~csy2~~~CRISPR-associated protein Csy2~~~
MSTLIILRRIQVENANAIAGLTYGFPAITHFLGFTHALSRKLQASHGLTLEGCGVVSHQHQLHAYGSSWERSFALTRNPL
TKEAKTAAFNEEGRMHMTVSLLIRCDGQIPADTTALCEHLKQQAQCQRLAGGTVIDIERVTVQSLPVDEAETRGVMRRLL
PGFVLRDRTSLLHRHFQTLQQAKPQAEMIDAWLDFAALKMQAERDPSDETVQWKYLPKPGDGGFLTPLMIGYRAISPLYA
PGEVDKTRDPHTPFCFAEAAYGIGEWQGAHRISDISQILWEYDYQNGDYHCRQVADTHSVAEDTSYEFDY
>Q02MM0 ~~~csy2~~~CRISPR-associated protein Csy2~~~
MSVTDPEALLLLPRLSIQNANAISSPLTWGFPSPGAFTGFVHALQRRVGISLDIELDGVGIVCHRFEAQISQPAGKRTKV
FNLTRNPLNRDGSTAAIVEEGRAHLEVSLLLGVHGDGLDDHPAQEIARQVQEQAGAMRLAGGSILPWCNERFPAPNAELL
MLGGSDEQRRKNQRRLTRRLLPGFALVSREALLQQHLETLRTTLPEATTLDALLDLCRINFEPPATSSEEEASPPDAAWQ
VRDKPGWLVPIPAGYNALSPLYLPGEVRNARDRETPLRFVENLFGLGEWLSPHRVAALSDLLWYHHAEPDKGLYRWSTPR
FVEHAIA
>Q6D0W6 ~~~csy3~~~CRISPR-associated protein Csy3~~~
MAKAATTLKTASVLAFERKLANSDALMYAGNWAQQDNWTAIAIQEKSVRGTISNRLKNALTSDPAKLDAEIQKANLQKVD
VAALPFGADTLKIVFTLRVLGNLAQPSVCNDQDYQTALGDIITGYAQEQGFSTLAARYAENIANGRFLWRNRVGAEAIRV
VVTKKGERSWEFNGEDYSLRQFSQPAGDLAALTQAIEKGLAGDASALFTVEAYVQLGNGQEVFPSQELVLDEKARNGKSK
ILYQVNDVAAIHSQKIGNALRTIDDWYPAADEAGPIAVEPYGSVTSRGKAYRQPREKMDFYTLLDNWVIKGDVPMPEQQH
YVIATLIRGGVFGEKGE
>Q02MM1 ~~~csy3~~~CRISPR-associated protein Csy3~~~
MSKPILSTASVLAFERKLDPSDALMSAGAWAQRDASQEWPAVTVREKSVRGTISNRLKTKDRDPAKLDASIQSPNLQTVD
VANLPSDADTLKVRFTLRVLGGAGTPSACNDAAYRDKLLQTVATYVNDQGFAELARRYAHNLANARFLWRNRVGAEAVEV
RINHIRQGEVARAWRFDALAIGLRDFKADAELDALAELIASGLSGSGHVLLEVVAFARIGDGQEVFPSQELILDKGDKKG
QKSKTLYSVRDAAAIHSQKIGNALRTIDTWYPDEDGLGPIAVEPYGSVTSQGKAYRQPKQKLDFYTLLDNWVLRDEAPAV
EQQHYVIANLIRGGVFGEAEEK
>P0A382 ~~~cyt1Aa~~~Type-1Aa cytolytic delta-endotoxin~~~
MENLNHCPLEDIKVNPWKTPQSTARVITLRVEDPNEINNLLSINEIDNPNYILQAIMLANAFQNALVPTSTDFGDALRFS
MPKGLEIANTITPMGAVVSYVDQNVTQTNNQVSVMINKVLEVLKTVLGVALSGSVIDQLTAAVTNTFTNLNTQKNEAWIF
WGKETANQTNYTYNVLFAIQNAQTGGVMYCVPVGFEIKVSAVKEQVLFFTIQDSASYNVNIQSLKFAQPLVSSSQYPIAD
LTSAINGTL
>P0A383 ~~~cyt1Aa~~~Type-1Aa cytolytic delta-endotoxin~~~
MENLNHCPLEDIKVNPWKTPQSTARVITLRVEDPNEINNLLSINEIDNPNYILQAIMLANAFQNALVPTSTDFGDALRFS
MAKGLEIANTITPMGAVVSYVDQNVTQTNNQVSVMINKVLEVLKTVLGVALSGSVIDQLTAAVTNTFTNLNTQKNEAWIF
WGKETANQTNYTYNVLFAIQNAQTGGVMYCVPVGFEIKVSAVKEQVLFFTIQDSASYNVNIQSLKFAQPLVSSSQYPIAD
LTSAINGTL
>P94594 ~~~cyt1Ab1~~~Type-1Ab cytolytic delta-endotoxin~~~
MENPNHCPLEDIQVNPWKTPQSKARVITLRIDDPNEINNLLSINEIENTNYLLQAIMLANAFQKALVPTSTEFAEDALQF
SMTKGLEVANTISPPGAVVQYVDQNVSQTNNQVSAMINKVLDVLKSILGVALGQSVIEQLTSAVTNTFTNLNTQKNEAWI
FWGRETSTQTNYTYNVLFAIQNGQTGGVMYCVPVGFEIKVSAVKERVLFLTIQDSASYNVNIQSLKFAQPLVSASEYPIA
DLTSAINGTL
>Q45790 ~~~cyt1Ba1~~~Type-1Ba cytolytic delta-endotoxin~~~
MKESIYYNEENEIQISQGNCFPEELGHNPWRQPQSTARVIYLKVKDPIDTTQLLEITEIENPNYVLQAIQLAAAFQDALV
PTETEFGEAIRFSMPKGLEVAKTIQPKGAVVAYTDQTLSQSNNQVSVMIDRVISVLKTVMGVALSGSIITQLTAAITDTF
TNLNTQKDSAWVFWGKETSHQTNYTYNVMFAIQNETTGRVMMCVPIGFEIRVFTDKRTVLFLTTKDYANYSVNIQTLRFA
QPLIDSRALSINDLSEALRSSKYLY
>Q04470 ~~~cyt2Aa1~~~Type-2Aa cytolytic delta-endotoxin~~~
MYTKNFSNSRMEVKGNNGCSAPIIRKPFKHIVLTVPSSDLDNFNTVFYVQPQYINQALHLANAFQGAIDPLNLNFNFEKA
LQIANGIPNSAIVKTLNQSVIQQTVEISVMVEQLKKIIQEVLGLVINSTSFWNSVEATIKGTFTNLDTQIDEAWIFWHSL
SAHNTSYYYNILFSIQNEDTGAVMAVLPLAFEVSVDVEKQKVLFFTIKDSARYEVKMKALTLVQALHSSNAPIVDIFNVN
NYNLYHSNHKIIQNLNLSN
>Q45723 ~~~cyt2Ba1~~~Type-2Ba cytolytic delta-endotoxin~~~
MHLNNLNNFNNLENNGEYHCSGPIIKKPFRHIALTVPSSDITNFNEIFYVEPQYIAQAIRLTNTFQGAIDPLTLNFNFEK
ALQIANGLPNAGVTGTINQSVIHQTIEVSVMISQIKEIIRSVLGLVINSANFWNSVVSAITNTFTNLEPQVDENWIVWRN
LSATQTSYFYKILFSIQNEDTGRFMAILPIAFEITVDVQKQQLLFITIKDSARYEVKMKALTVVQALDSYNAPIIDVFNV
RNYSLHRPNHNILQNLNVNPIKS
>O32322 ~~~cyt2Bb1~~~Type-2Bb cytolytic delta-endotoxin~~~
MYTKNLNSLEINEDYQYSRPIIKKPFRHITLTVPSSDIASFNEIFYLEPQYVAQALRLTNTFQAAIDPLTLNFDFEKALQ
IANGLPNAGITGTLNQSVIQQTIEISVMISQIKEIIRNVLGLVINSTNFWNSVLAAITNTFTNLEPQVDENWIVWRNLSA
THTSYYYKILFSIQNEDTGAFMAVLPIAFEITVDVQKQQLLFITIRDSARYEVKMKALTVVQLLDSYNAPIIDVFNVHNY
GLYQSNHPNHHILQNLNLNKIKG
>P94286 2.4.1.248~~~~~~Cycloisomaltooligosaccharide glucanotransferase~~~
MVRFMYALRKRRLSLLLAMSLLVMCVASVVSPPPQALASGSGGIERVFTDKARYNPGDAVSIRVQAKNGTGSSWSGAARL
EIFHLENSVYTSSQSLSLTNGQSTTLTFTWTAPSTDFRGYFVRIDAGTLGQGATAIDVSSDFTKYPRYGYISEFESGETA
LESKAKVDQLAQDYHINAWQFYDWMWRHDKMIKRTGGSIDSTWLDLFNREISWSTLQNQIDAVHDVNGKAMAYAMIYASR
ENYSPLGISPTWGIYEDSSHTNQFDVDFGDGSTYLYMSDPQNPNWQNYIHAEYIDSINTAGFDGIHVDQMGQRSNVYDYN
GNSIDLSTRFSPFLDQAKSVLSANNPARDNLTYNIVDGTVNGWAVNDVSKNADLDFLYSEIWYLSDSYNQLKNYIEQLRA
NGGNKAVVLAAYMNYADNAGTRYEAESASMTNVSTNTNHAGYTGSGFVDQFASTGDKVSFAINAPEAGDYSLVFRYGNNT
GANSTLNLYVDGNFVQKLYFFNQSSWGTWKHDAWYQVPLTQGAHTVELRYESGNVGAVNLDSLTLGTFDEHSVRLADAMM
SASGATHIELGDDNQMLPHEYYPNRSKTMRSSLKNAMKDHYNFITAYENLLFDSDVVPNDTGSQFVNLTGVSASGDGSAN
TVWYINKRTSDYNIVHLINLLGNDNQWRNTASQPSFQTNLPAKIYIGADETISDVYLASPDLSGGETQELAFTSGTDAGG
KYVSFTVPELKYWNMIYMKRTFSVPANDIYEAETAIKSNVSTNTNHAGYTGSGFVDGFSSTNDGVSFVVKSTASDDYALR
FRYANGGSDATRDVYVDGKLAGTVSFKSTGSWSTWSYGEITARLEPGHHTIVLWQTSGNTGAINLDHLDLDKTYIWQFDR
QIVSVPAGYRITFRTGLPGWVHWGVNGWTGVTDTPLRSNGSLDGNLDHETSIGPFATGTAVDVTFLWDDNNNGILEPSTD
RWEGTDFGINVS
>P12946 1.17.99.9~~~ctaA~~~Heme A synthase~~~COG1612
MNKALKALGVLTTFVMLIVLIGGALVTKTGSGQGCGRQWPLCHGRFFPELNPASIIEWSHRFASGISIILVLSLAFWSWR
KITPIFRETTFLAIMSIIFLFLQALLGALAVVFGSNALIMALHFGISLISFASVLILTLLIFEADKSVRTLVKPLQIGKK
MQFHMIGILIYSYIVVYTGAYVRHTESSLACPNVPLCSPLNNGLPTQFHEWVQMGHRAAALLLFVWIIVAAVHAITSYKD
QKQIFWGWISCLIFITLQALSGIMIVYSELALGFALAHSFFIACLFGVLCYFLLLIARFRYESRQS
>Q3IXW9 1.17.99.9~~~ctaA~~~Heme A synthase~~~COG1612
MAVKKRSIFEEVGQGAKAPVPQGGSIDRGHGGARRGIRLWLMALFLLVMAMIVVGGLTRLTDSGLSITEWRPVTGAVPPL
NETQWAAEFDKYRDSPQYRLMNAGMTLAEFQRIYWWEWGHRQLGRVIGLVWAVGFLGFLAARRIPRGWWPRLLALGALGG
LQGGIGWWMVASGLEGDKVTVESTRLATHLGLAFIILGLIAWQALLLGRSESDLLQARRQKDGRLVTLTTVLIGVAFLQI
VLGALVAGIDAGRGFPTWPDMNGTFLPAEMFYVPGVETDWRNPAWWLGLLQNPGFVQFLHRMAGYTLAALGLIFWIFGRR
SRHRATRGAFDLLAMALLAQILLGVGTVLSAAEWQVAIAHQVGAVVIWVLILHARHLALYPRVGSIRKGTL
>P94346 1.17.99.9~~~ctaA~~~Heme A synthase~~~
MQRSLKWFASTTTVAMLFVLIGGALVTKTDSGMGCGRSWPLCHGQWIPDDITPQLVIELSHRLVSGLAAIMVLILCIRSW
RVMGHVRETKPLAVLSFVFLVLQSLIGAAAVVWGQSDFVMALHFGISLISFAAVLLLTLLIFVVDKKFSPTSLQLDGQMR
FHIYGIIIYSYLVVYTGALVRHTNASLACPSWPLCAKSRLLPVQFHEWVQMGHRLAAAVIIIWIAVATVHAARYYREQPV
IYYGWIISLLLVLAQMVTGALVVFTELNLYISLAHAFFISCLFGVLSYLLLLALRTRRRPATAAGRSVEDTASAPLK
>A0A1E7MYN1 1.14.19.49~~~ctcP~~~Tetracycline 7-halogenase~~~
MTDTTADQTRHGDRPYDVVIIGSGLSGTMLGSILAKHGFRIMLLDGAHHPRFAVGESTIGQTLVVLRLISDRYGVPEIAN
LASFQDVLANVSSSHGQKSNFGFMFHRDGEEPDPNETSQFRIPSIVGNAAHFFRQDTDSYMFHAAVRYGCDARQYYRVEN
IEFDDGGVTVSGADGSTVRARYLVDASGFRSPLARQLGLREEPSRLKHHARSIFTHMVGVDAIDDHVDTPAELRPPVPWN
DGTMHHIFERGWMWIIPFNNHPGATNPLCSVGIQLDERRYPARPDLTPEEEFWSHVDRFPAVQRQLKGARSVREWVRTDR
MQYSSSRTVGERWCLMSHAAGFIDPLFSRGLSNTCEIINALSWRLMAALREDDFAVERFAYVEELEQGLLDWNDKLVNNS
FISFSHYPLWNSVFRIWASASVIGGKRILNALTRTKETGDDSHCQALDDNPYPGLWCPLDFYKEAFDELTELCEAVDAGH
TTAEEAARVLEQRVRESDWMLPALGFNDPDTHHINPTADKMIRIAEWATGHHRPEIRELLAASAEEVRAAMRVKP
>S4S3E3 1.5.1.36~~~ctcQ~~~Flavin reductase (NADH)~~~
MPPEPLSLPLDLAPGLVDGDTFLSIMGALPTGVTVVTTLGPDGEPYGLTCSAACSVSKAPPLLLVCINRDSRVLKALLER
GEFAVNVLRGGGESTSARFAAPVDDRFRDVRWEPGSAGGVPVMSADVVAHAECRVAAALDAGDHTIVIGAVVAGGPRPEV
PSPLMYWRRSYARWPVEEDPRTAALTLAAEG
>P14194 ~~~ctc~~~General stress protein Ctc~~~COG1825
MATLTAKERTDFTRSSLRNIRTSGHVPGIIYGKDTGNKPVSLDSVELIKTLRDEGKNAVITLEVSGEKHSVMVTDLQTDP
LKNEITHADFQVVNMSEDIEVEVPIHLTGEAIGVKNGGVLQQPLYALTVKAKPKAIPQTIEADISSLDVNEVLTIADLPA
GGDYSFNHESDEVVASILPPQQQEAAEVDEEESADAQPEGENEQ
>Q0P8J8 2.7.7.103~~~~~~CTP:phosphoglutamine cytidylyltransferase~~~COG1213
MNAIILAAGFGSRLMPLTKDQPKCMVEYKNKKIIDYEIEALKSAGINEIAVVGGYLNDVLKNYLNKYDIEHFFINSKYDK
TNMVHTFFCAKDFMLKCIEEKQDLIISYADIVYFQDCVQKLINAKEELAIVVDKSWCKLWSKRFANPLEDAETLKMTNGY
IIELGKKANAYDEIEAQYIGLFKFSYQFLSEVIAFYEMLDRDILYDNKNFENMYMTSFLQALIEKYNNAKAVEIDGNWCE
IDFMSDLEVQIEK
>Q7NY09 2.4.2.-~~~cteC~~~NAD(+)--protein-threonine ADP-ribosyltransferase~~~
MLFFTGLQMIWIDKAMNISLSSAVAAASVSTASVGGVPHEITGGNRQEKLAQLMRQFESGGLYLRTVSDHRDEFENTFMP
KLDACLGHGCDERYWSSATFIQQGLNGKVHDPHADRTGLIISADARLGGFSTFDAATANVPSGLEPSQYFPGQFPKFDMM
GAYQATWNEDIFSVDATAVSEQQMDELGIPDEYRSVFDFDRIQEKMAQPRLAGREVEPTEAKICYQPKDVLGIYVDVDSP
ASQSKARELQQAMREQGFDLPFIAYRGGAAQELASV
>P33752 2.8.3.9~~~ctfA~~~Acetoacetyl-CoA:acetate/butyrate CoA transferase alpha subunit~~~
MNSKIIRFENLRSFFKDGMTIMIGGFLNCGTPTKLIDFLVNLNIKNLTIISNDTCYPNTGIGKLISNNQVKKLIASYIGS
NPDTGKKLFNNELEVELSPQGTLVERIRAGGSGLGGVLTKTGLGTLIEKGKKKISINGTEYLLELPLTADVALIKGSIVD
EAGNTFYKGTTKNFNPYMAMAAKTVIVEAENLVSCEKLEKEKAMTPGVLINYIVKEPA
>P23673 2.8.3.9~~~ctfB~~~Acetoacetyl-CoA:acetate/butyrate CoA-transferase beta subunit~~~
MINDKNLAKEIIAKRVARELKNGQLVNLGVGLPTMVADYIPKNFKITFQSENGIVGMGASPKINEADKDVVNAGGDYTTV
LPDGTFFDSSVSFSLIRGGHVDVTVLGALQVDEKGNIANWIVPGKMLSGMGGAMDLVNGAKKVIIAMRHTNKGQPKILKK
CTLPLTAKSQANLIVTELGVIEVINDGLLLTEINKNTTIDEIRSLTAADLLISNELRPMAV
>Q9ZKJ5 2.7.11.1~~~ctkA~~~Serine/threonine-protein kinase CtkA~~~COG3550
MPTIDFTFCEINPKKGFGGANGNKISLFYNNELYMVKFPPKPSTHKEMSYTNGCFSEYVACHIVNSLGLKVQETLLGTYK
NKIVVACKDFTTHQYELVDFLSLKNTMIELEKSGKDTNLNDVLYAIDNQHFIEPKVLKCFFWDMFVADTLLGNFDRHNGN
WGFLRASNSKEYQIAPIFDCGSCLYPQADDVVCQKVLSNIDELNARIYNFPQSILKDDNDKKINYYDFLTQTNNKDCLDA
LLRIYPRIDMNKIHSIIDNTPFMSEIHKEFLHTMLDERKSKIIDVAHTRAIELSLQHKQAHSNPYDNADDLDNSNEYTPT
PKRRR
>Q7A5M9 3.4.21.-~~~~~~Probable CtpA-like serine protease~~~
MDDKQHTSSSDDERAEIATSNQDQETNSSKRVHLKRWQFISILIGTTLITAVITVVAYIFINQKISGLNKTDQANLNKIE
NVYKILNSDYYKKQDSDKLSKAAIDGMVKELKDPYSEYLTKEQTKSFNEGVSGDFVGIGAEMQKKNDQIMVTSPMKGSPA
ERAGIRPKDVITKVNGKSIKGKALDEVVKDVRGKENTEVTLTVQRGSEEKDVKIKREKIHVKSVDYKKKGKVGVITINKF
QNDTSGELKDAVLKAHKDGLKKIVLDLRNNPGGLLDEAVKMANIFIDKGKTVVKLEKGKDTEAIQTSNDSLKEAKDMDIS
ILVNEGSASASEVFTGALKDYNKAKVYGSKTFGKGVVQTTREFKDGSLLKYTEMKWLTPDGHYIHGKGIKPDVTIDTPKY
QSLNVIPNTKTFKVGDDDKNIKTIKIGLSALGYKVDNESTQFDQALENQVKAFQQANKLEVTGEFNKETNNKFTELLVEK
ANKHDDVLDKLINILK
>O34666 3.4.21.102~~~ctpA~~~Carboxy-terminal processing protease CtpA~~~COG0793
MKRQLKLFFIVLITAVVASALTLFITGNSSILGQKSASTGDSKFDKLNKAYEQIKSDYYQKTDDDKLVDGAIKGMIQSLD
DPYSTYMDQEQAKSFDETISASFEGIGAQVEEKDGEILIVSPIKGSPAEKAGIKPRDQIIKVNGKSVKGMNVNEAVALIR
GKKGTKVKLELNRAGVGNIDLSIKRDTIPVETVYSEMKDNNIGEIQITSFSETTAKELTDAIDSLEKKGAKGYILDLRGN
PGGLMEQAITMSNLFIDKGKNIMQVEYKNGSKEVMKAEKERKVTKPTVVLVNDGTASAAEIMAAALHESSNVPLIGETTF
GKGTVQTAKEYDDGSTVKLTVAKWLTADGEWIHKKGIKPQVKAELPDYAKLPYLDADKTYKSGDTGTNVKVAQKMLKALG
YKVKVNSMYDQDFVSVVKQFQKKEKLNETGILTGDTTTKLMIELQKKLSDNDTQMEKAIETLKKEM
>P9WPU1 7.2.2.8~~~ctpA~~~Copper-exporting P-type ATPase~~~COG2217
MTTAVTGEHHASVQRIQLRISGMSCSACAHRVESTLNKLPGVRAAVNFGTRVATIDTSEAVDAAALCQAVRRAGYQADLC
TDDGRSASDPDADHARQLLIRLAIAAVLFVPVADLSVMFGVVPATRFTGWQWVLSALALPVVTWAAWPFHRVAMRNARHH
AASMETLISVGITAATIWSLYTVFGNHSPIERSGIWQALLGSDAIYFEVAAGVTVFVLVGRYFEARAKSQAGSALRALAA
LSAKEVAVLLPDGSEMVIPADELKEQQRFVVRPGQIVAADGLAVDGSAAVDMSAMTGEAKPTRVRPGGQVIGGTTVLDGR
LIVEAAAVGADTQFAGMVRLVEQAQAQKADAQRLADRISSVFVPAVLVIAALTAAGWLIAGGQPDRAVSAALAVLVIACP
CALGLATPTAMMVASGRGAQLGIFLKGYKSLEATRAVDTVVFDKTGTLTTGRLQVSAVTAAPGWEADQVLALAATVEAAS
EHSVALAIAAATTRRDAVTDFRAIPGRGVSGTVSGRAVRVGKPSWIGSSSCHPNMRAARRHAESLGETAVFVEVDGEPCG
VIAVADAVKDSARDAVAALADRGLRTMLLTGDNPESAAAVATRVGIDEVIADILPEGKVDVIEQLRDRGHVVAMVGDGIN
DGPALARADLGMAIGRGTDVAIGAADIILVRDHLDVVPLALDLARATMRTVKLNMVWAFGYNIAAIPVAAAGLLNPLVAG
AAMAFSSFFVVSNSLRLRKFGRYPLGCGTVGGPQMTAPSSA
>O35002 3.4.21.102~~~ctpB~~~Carboxy-terminal processing protease CtpB~~~COG0793
MNQKIMAVIAAGSMLFGGAGVYAGINLLEMDKPQTAAVPATAQADSERDKAMDKIEKAYELISNEYVEKVDREKLLEGAI
QGMLSTLNDPYSVYMDKQTAKQFSDSLDSSFEGIGAEVGMEDGKIIIVSPFKKSPAEKAGLKPNDEIISINGESMAGKDL
NHAVLKIRGKKGSSVSMKIQRPGTKKQLSFRIKRAEIPLETVFASEKKVQGHSVGYIAISTFSEHTAEDFAKALRELEKK
EIEGLVIDVRGNPGGYLQSVEEILKHFVTKDQPYIQIAERNGDKKRYFSTLTHKKAYPVNVITDKGSASASEILAGALKE
AGHYDVVGDTSFGKGTVQQAVPMGDGSNIKLTLYKWLTPNGNWIHKKGIEPTIAIKQPDYFSAGPLQLKEPLKVDMNNED
VKHAQVLLKGLSFDPGREDGYFSKDMKKAVMAFQDQNKLNKTGVIDTRTAETLNQQIEKKKSDEKNDLQLQTALKSLFVN
>P9WPT9 7.2.2.-~~~ctpB~~~Cation-transporting P-type ATPase B~~~COG2217
MAAPVVGDADLQSVRRIRLDVLGMSCAACASRVETKLNKIPGVRASVNFATRVATIDAVGMAADELCGVVEKAGYHAAPH
TETTVLDKRTKDPDGAHARRLLRRLLVAAVLFVPLADLSTLFAIVPSARVPGWGYILTALAAPVVTWAAWPFHSVALRNA
RHRTTSMETLISVGIVAATAWSLSSVFGDQPPREGSGIWRAILNSDSIYLEVAAGVTVFVLAGRYFEARAKSKAGSALRA
LAELGAKNVAVLLPDGAELVIPASELKKRQRFVTRPGETIAADGVVVDGSAAIDMSAMTGEAKPVRAYPAASVVGGTVVM
DGRLVIEATAVGADTQFAAMVRLVEQAQTQKARAQRLADHIAGVFVPVVFVIAGLAGAAWLVSGAGADRAFSVTLGVLVI
ACPCALGLATPTAMMVASGRGAQLGIFIKGYRALETIRSIDTVVFDKTGTLTVGQLAVSTVTMAGSGTSERDREEVLGLA
AAVESASEHAMAAAIVAASPDPGPVNGFVAVAGCGVSGEVGGHHVEVGKPSWITRTTPCHDAALVSARLDGESRGETVVF
VSVDGVVRAALTIADTLKDSAAAAVAALRSRGLRTILLTGDNRAAADAVAAQVGIDSAVADMLPEGKVDVIQRLREEGHT
VAMVGDGINDGPALVGADLGLAIGRGTDVALGAADIILVRDDLNTVPQALDLARATMRTIRMNMIWAFGYNVAAIPIAAA
GLLNPLIAGAAMAFSSFFVVSNSLRLRNFGAQ
>B2HEM2 7.2.2.12~~~ctpC~~~Zinc-exporting P-type ATPase~~~COG2217
MTLAIVKEVPAGADGDDTTDLVVLSDAAGRMRVRADWVRGNSRRAVAVEEAVAKQDGVRVVHAYPRTGSVVVWYSPRRCD
RAAVLEAIGGAKHVAAELIPARAPHSTEIRNTDVLRMVIGGAALALLGVRRYVFARPPLLGPSGRMVATGVTIFTGYPFL
RGALRSLRSGKAGTDALVSAATIASLILRENVVALTVLWLLNIGEYLQDLTLRRTRRAISELLRGNQDTAWIRLTDGPEA
GTEVQVPIDSVQIGDEVVVHDHVAIPVDGEVVDGEAIVNQSAITGENLPVSVVAGATVHAGSVVVRGRLVVRAQAVGNQT
TIGRIITRVEEAQNDRAPIQTVGENFSRRFVPTSFIVSAITLLVTGDVRRAMTMLLIACPCAVGLSTPTAISAAIGNGAR
RGILIKGGSHLEQAGRVDAIVFDKTGTLTVGRPVVTNIIAMHKDWEPEQVLAYAASSEIHSRHPLAEAVIRSTEERRISI
PPHEECEVLVGLGMRTWADGRTLLLGSPSLLESEQVKVSKKASEWVGKLRQQAETPLLLAVDGTLVGLISLRDEVRPEAA
EVLTKLRDNGVRRIVMLTGDHPDIAKVVAEELGIDEWRAEVMPEDKLEVVRDLQDEGYVVGMVGDGINDAPALAAADIGI
AMGLAGTDVAVETADVALANDDLHRLLDVRDLGGRAVDVIRQNYGMSIAVNAAGLLIGAGGALSPVLAAILHNASSVAVV
ANSSRLIRYRLE
>P9WPT5 7.2.2.22~~~ctpC~~~Manganese-exporting P-type ATPase~~~COG2217
MTLEVVSDAAGRMRVKVDWVRCDSRRAVAVEEAVAKQNGVRVVHAYPRTGSVVVWYSPRRADRAAVLAAIKGAAHVAAEL
IPARAPHSAEIRNTDVLRMVIGGVALALLGVRRYVFARPPLLGTTGRTVATGVTIFTGYPFLRGALRSLRSGKAGTDALV
SAATVASLILRENVVALTVLWLLNIGEYLQDLTLRRTRRAISELLRGNQDTAWVRLTDPSAGSDAATEIQVPIDTVQIGD
EVVVHEHVAIPVDGEVVDGEAIVNQSAITGENLPVSVVVGTRVHAGSVVVRGRVVVRAHAVGNQTTIGRIISRVEEAQLD
RAPIQTVGENFSRRFVPTSFIVSAIALLITGDVRRAMTMLLIACPCAVGLSTPTAISAAIGNGARRGILIKGGSHLEQAG
RVDAIVFDKTGTLTVGRPVVTNIVAMHKDWEPEQVLAYAASSEIHSRHPLAEAVIRSTEERRISIPPHEECEVLVGLGMR
TWADGRTLLLGSPSLLRAEKVRVSKKASEWVDKLRRQAETPLLLAVDGTLVGLISLRDEVRPEAAQVLTKLRANGIRRIV
MLTGDHPEIAQVVADELGIDEWRAEVMPEDKLAAVRELQDDGYVVGMVGDGINDAPALAAADIGIAMGLAGTDVAVETAD
VALANDDLHRLLDVGDLGERAVDVIRQNYGMSIAVNAAGLLIGAGGALSPVLAAILHNASSVAVVANSSRLIRYRLDR
>A0R3A7 7.2.2.-~~~ctpD~~~Probable cobalt/nickel-exporting P-type ATPase~~~COG2217
MTALYPAVEPAPAARPARPRSGGWLWTVPSVRWAAAALALFLTGLAAQLLGAPQAVVWTLYLACYVVGGWEPAWVGVRAL
RNRTLDVDLLMIVAAIGAATIGQVFDGALLIVIFATSGALEDVATTRTERSVRGLLDLAPEHATLLGDGSQRVVAAADLR
PGDVIVVRPGERISADGTVIGGASEVDQSSITGEPLPAAKDVGDDVFAGTVNGSGALRVEVTREPSQTVVARIVAMVTEA
SATKATTQLFIEKIEQRYSAGVVVATLALLTVPLMFGADLRSTLLRAMTFMIVASPCAVVLATMPPLLSAIANASRHGVL
VKSAVAMERLADTDVVVLDKTGTLTAGEPVISRVTVLIDGADVLGMAAAAEQFSEHPLGRAIVAAARGRVVPEAGDFTAL
PGRGVRARVAGHVVEVVSPAAYAGENAAVREHCAAIENDGGTAVVVLEDGLPVGVIGLADRLRPDAPAAVMQLAQLTKHP
PMLLTGDNRRAAGRLAEEAGIADVHAELLPDGKAAAVQKLQRDNTHVLVVGDGVNDAPAMAAAHTSIAMGRAGADLTVQT
ADVVTIRDELATVPAVIALARRARRVVIANLVMAGAAITTLVLWDLFGQLPLPLGVAGHEGSTILVALNGLRLLSNRAWI
SPGATPT
>A0R3Y2 7.2.2.10~~~ctpE~~~Calcium-transporting ATPase CtpE~~~COG0474
MTTMAAAGLTDAEVAQRIAEGKTNDVPTRAARTVSEIVRANVFTRINAILGVLFVIVLSTGSVINGAFGLLIIANSAIGI
IQEIRAKQTLDKLAIVGQAKPTVRRQSGTRAVLPSEVVLDDIIELGPGDQIVVDGEVVEETNLEVDESLLTGEADPIAKD
AGDPVMSGSFVVAGSGAYRATKVGREAYAAKLAEEASKFTLVKSELRNGINKILQFITYLLVPAGLLTIYTQLFTTDAGW
REAVLRMVGALVPMVPEGLVLMTSIAFAVGVVRLGRRQCLVNELPAIEGLARVDVVCADKTGTLTENGMRVSDLKSLTEG
HVADVLAQLASDDARPNASMAAIAEAYQTPPGWSATATAPFKSATKWSGTSYGEHGNWVIGAPDVLLDPASPVAEEAERI
GAQGLRVLLLGSSDRSVDAPDAPGVVTPAALVVLEQRIRPDAGDTLDYFASQHVSVKVISGDNAVSVGAVAGKLGLHGET
MDARRLPEQPEKLAETLEECTTFGRVRPDQKRAMVHALQSRGHTVAMTGDGVNDVLALKDSDIGVAMGSGSSASRAVAQI
VLLDNKFATLPYVVGEGRRVIGNIERVSNLFLTKTVYSVLLAILVGIGGLSAKIFGTDPLLFPFQPIHVTIAAWFTIGIP
AFILSLAPNNERAKTGFVRRVMTAALPSGLVVGTATFVSYLVAYQGREATPVEQTQASTAALITLLASSLWVLAVVARPY
QWWRVLLVACSMLAYVLIFSIPLAQELFMLDPTNMKVTSVALGIGLAGAALIEVLWWVQGRVLGEERRVWR
>P9WPT1 7.2.2.10~~~ctpE~~~Calcium-transporting ATPase CtpE~~~COG0474
MTRSASATAGLTDAEVAQRVAEGKSNDIPERVTRTVGQIVRANVFTRINAILGVLLLIVLATGSLINGMFGLLIIANSVI
GMVQEIRAKQTLDKLAIIGQAKPLVRRQSGTRTRSTNEVVLDDIIELGPGDQVVVDGEVVEEENLEIDESLLTGEADPIA
KDAGDTVMSGSFVVSGAGAYRATKVGSEAYAAKLAAEASKFTLVKSELRNGINRILQFITYLLVPAGLLTIYTQLFTTHV
GWRESVLRMVGALVPMVPEGLVLMTSIAFAVGVVRLGQRQCLVQELPAIEGLARVDVVCADKTGTLTESGMRVCEVEELD
GAGRQESVADVLAALAAADARPNASMQAIAEAFHSPPGWVVAANAPFKSATKWSGVSFRDHGNWVIGAPDVLLDPASVAA
RQAERIGAQGLRVLLLAAGSVAVDHAQAPGQVTPVALVVLEQKVRPDARETLDYFAVQNVSVKVISGDNAVSVGAVADRL
GLHGEAMDARALPTGREELADTLDSYTSFGRVRPDQKRAIVHALQSHGHTVAMTGDGVNDVLALKDADIGVAMGSGSPAS
RAVAQIVLLNNRFATLPHVVGEGRRVIGNIERVANLFLTKTVYSVLLALLVGIECLIAIPLRRDPLLFPFQPIHVTIAAW
FTIGIPAFILSLAPNNERAYPGFVRRVMTSAVPFGLVIGVATFVTYLAAYQGRYASWQEQEQASTAALITLLMTALWVLA
VIARPYQWWRLALVLASGLAYVVIFSLPLAREKFLLDASNLATTSIALAVGVVGAATIEAMWWIRSRMLGVKPRVWR
>P9WPS8 7.2.2.-~~~ctpF~~~Probable cation-transporting ATPase F~~~
MSASVSATTAHHGLPAHEVVLLLESDPYHGLSDGEAAQRLERFGPNTLAVVTRASLLARILRQFHHPLIYVLLVAGTITA
GLKEFVDAAVIFGVVVINAIVGFIQESKAEAALQGLRSMVHTHAKVVREGHEHTMPSEELVPGDLVLLAAGDKVPADLRL
VRQTGLSVNESALTGESTPVHKDEVALPEGTPVADRRNIAYSGTLVTAGHGAGIVVATGAETELGEIHRLVGAAEVVATP
LTAKLAWFSKFLTIAILGLAALTFGVGLLRRQDAVETFTAAIALAVGAIPEGLPTAVTITLAIGMARMAKRRAVIRRLPA
VETLGSTTVICADKTGTLTENQMTVQSIWTPHGEIRATGTGYAPDVLLCDTDDAPVPVNANAALRWSLLAGACSNDAALV
RDGTRWQIVGDPTEGAMLVVAAKAGFNPERLATTLPQVAAIPFSSERQYMATLHRDGTDHVVLAKGAVERMLDLCGTEMG
ADGALRPLDRATVLRATEMLTSRGLRVLATGMGAGAGTPDDFDENVIPGSLALTGLQAMSDPPRAAAASAVAACHSAGIA
VKMITGDHAGTATAIATEVGLLDNTEPAAGSVLTGAELAALSADQYPEAVDTASVFARVSPEQKLRLVQALQARGHVVAM
TGDGVNDAPALRQANIGVAMGRGGTEVAKDAADMVLTDDDFATIEAAVEEGRGVFDNLTKFITWTLPTNLGEGLVILAAI
AVGVALPILPTQILWINMTTAIALGLMLAFEPKEAGIMTRPPRDPDQPLLTGWLVRRTLLVSTLLVASAWWLFAWELDNG
AGLHEARTAALNLFVVVEAFYLFSCRSLTRSAWRLGMFANRWIILGVSAQAIAQFAITYLPAMNMVFDTAPIDIGVWVRI
FAVATAITIVVATDTLLPRIRAQPP
>P9WPS9 7.2.2.-~~~ctpF~~~Probable cation-transporting ATPase F~~~COG0474
MSASVSATTAHHGLPAHEVVLLLESDPYHGLSDGEAAQRLERFGPNTLAVVTRASLLARILRQFHHPLIYVLLVAGTITA
GLKEFVDAAVIFGVVVINAIVGFIQESKAEAALQGLRSMVHTHAKVVREGHEHTMPSEELVPGDLVLLAAGDKVPADLRL
VRQTGLSVNESALTGESTPVHKDEVALPEGTPVADRRNIAYSGTLVTAGHGAGIVVATGAETELGEIHRLVGAAEVVATP
LTAKLAWFSKFLTIAILGLAALTFGVGLLRRQDAVETFTAAIALAVGAIPEGLPTAVTITLAIGMARMAKRRAVIRRLPA
VETLGSTTVICADKTGTLTENQMTVQSIWTPHGEIRATGTGYAPDVLLCDTDDAPVPVNANAALRWSLLAGACSNDAALV
RDGTRWQIVGDPTEGAMLVVAAKAGFNPERLATTLPQVAAIPFSSERQYMATLHRDGTDHVVLAKGAVERMLDLCGTEMG
ADGALRPLDRATVLRATEMLTSRGLRVLATGMGAGAGTPDDFDENVIPGSLALTGLQAMSDPPRAAAASAVAACHSAGIA
VKMITGDHAGTATAIATEVGLLDNTEPAAGSVLTGAELAALSADQYPEAVDTASVFARVSPEQKLRLVQALQARGHVVAM
TGDGVNDAPALRQANIGVAMGRGGTEVAKDAADMVLTDDDFATIEAAVEEGRGVFDNLTKFITWTLPTNLGEGLVILAAI
AVGVALPILPTQILWINMTTAIALGLMLAFEPKEAGIMTRPPRDPDQPLLTGWLVRRTLLVSTLLVASAWWLFAWELDNG
AGLHEARTAALNLFVVVEAFYLFSCRSLTRSAWRLGMFANRWIILGVSAQAIAQFAITYLPAMNMVFDTAPIDIGVWVRI
FAVATAITIVVATDTLLPRIRAQPP
>P9WPS7 7.2.2.-~~~ctpG~~~Probable cation-transporting ATPase G~~~COG2217
MTTVVDAEVQLTVVSDAAGRMRVQATGFQFDAGRAVAIEDTVGKVAGVQAVHAYPRTASIVIWYSRAICDTAAILSAIID
AETVPAAAVPAYASRSASNRKAGVVQKIIDWSTRTLSGVRRDVAAQPSGETSDACCDGEDNEDREPEQLWQVAKLRRAAF
SGVLLTASLVAAWAYPLWPVVLGLKALALAVGASTFVPSSLKRLAEGRVGVGTLMTIAALGAVALGELGEAATLAFLFSI
SEGLEEYATARTRRGLRALLSLVPDQATVLREGTETIVASTELHVGDQMIVKPGERLATDGIIRAGRTALDVSAITGESV
PVEVGPGDEVFAGSINGLGVLQVGVTATAANNSLARIVHIVEAEQVRKGASQRLADCIARPLVPSIMIAAALIAGTGSVL
GNPLVWIERALVVLVAAAPCALAIAVPVTVVASIGAASRLGVLIKGGAALETLGTIRAVALDKTGTLTANRPVVIDVATT
NGATREEVLAVAAALEARSEHPLAVAVLAATQATTAASDVQAVPGAGLIGRLDGRVVRLGRPGWLDAAELADHVACMQQA
GATAVLVERDQQLLGAIAVRDELRPEAAEVVAGLRTGGYQVTMLTGDNHATAAALAAQAGIEQVHAELRPEDKAHLVAQL
RARQPTAMVGDGVNDAPALAAADLGIAMGAMGTDVAIETADVALMGQDLRHLPQALDHARRSRQIMVQNVGLSLSIITVL
MPLALFGILGLAAVVLVHEFTEVIVIANGVRAGRIKPLAGPPKTPDRTIPG
>G3XDA3 ~~~ctpH~~~Methyl-accepting chemotaxis protein CtpH~~~
MPASPGHRDVLGCLVAACVPVQPGNPSRRSMLQQSLRAQILVLLGGSLAALLLIALACFGSLTGDVRAYRELLGGPVRAA
QLIDEANLQFRGQVQEWKNVLLRGRQTEAQTKYWSQFEAQERAVQDILGRLGSVAEGELKDRVERLREEHRRLGTAYRQG
RQRFLEAGADPIAGDQAVTGIDRATTAQMQTLRDELHQASDLHSSSISAEARRTMLLGSLVLIGASLAVALLSLWLVNRN
LVRPVQRLIEHIAQLSHGDFGERIEIRRKDELGKLALAANTLRDFLVDIFDRLRRSTRDLDSASGSLNAIASLMAAGTRE
QFSRTDQVATAMQEMSATAQEVARYAGDAARAADEADDSAQRGEDVMEETIRSIGEMRKEIDHTVEVIRQLESDSGRIGK
VLDVIRGIAEQTNLLALNAAIEAARAGDAGRGFAVVADEVRTLAQRTAESIAEIHQIIDTVQNGAVNAARAIESGQSRSE
AGAEQVANAGAMLRQITASVESIRDMNRQIATAAEEQTAVAEEISRNLTEIASIASSNQEQVEQTEAASRDLHGLSAQLG
DALQRLRA
>P9WPS5 7.2.2.-~~~ctpI~~~Probable cation-transporting ATPase I~~~COG0474
MKIPGVATVLGGVTNGVAQTVRAGARLPGSAAAAVQTLASPVLELTGPVVQSVVQTTGRAIGVRGSHNESPDGMTPPVRW
RSGRRVHFDLDPLLPFPRWHEHAAMVEEPVRRIPGVAEAHVEGSLGRLVVELEPDADSDIAVDEVRDVVSAVAADIFLAG
SVSSPNSAPFADPGNPLAILVPLTAAAMDLVAMGATVTGWVARLPAAPQTTRALAALINHQPRMVSLMESRLGRVGTDIA
LAATTAAANGLTQSLGTPLLDLVQRSLQISEAAAHRRVWRDREPALASPRRPQAPVVPIISSAGAKSQEPRHSWAAAAAG
EASHVVVGGSIDAAIDTAKGSRAGPVEQYVNQAANGSLIAAASALVAGGGTEDAAGAILAGVPRAAHMGRQAFAAVLGRG
LANTGQLVLDPGALRRLDRVRVVVIDGAALRGDNRAVLHAQGDEPGWDDDRVYEVADALLHGEQAPEPDPDELPATGARL
RWAPAQGPSATPAQGLEHADLVVDGQCVGSVDVGWEVDPYAIPLLQTAHRTGARVVLRHVAGTEDLSASVGSTHPPGTPL
LKLVRELRADRGPVLLITAVHRDFASTDTLAALAIADVGVALDDPRGATPWTADLITGTDLAAAVRILSALPVARAASES
AVHLAQGGTTLAGLLLVTGEQDKTTNPASFRRWLNPVNAAAATALVSGMWSAAKVLRMPDPTPQPLTAWHALDPEIVYSR
LAGGSRPLAVEPGIPAWRRILDDLSYEPVMAPLRGPARTLAQLAVATRHELADPLTPILAVGAAASAIVGSNIDALLVAG
VMTVNAITGGVQRLRAEAAAAELFAEQDQLVRRVVVPAVATTRRRLEAARHATRTATVSAKSLRVGDVIDLAAPEVVPAD
ARLLVAEDLEVDESFLTGESLPVDKQVDPVAVNDPDRASMLFEGSTIVAGHARAIVVATGVGTAAHRAISAVADVETAAG
VQARLRELTSKVLPMTLAGGAAVTALALLRRASLRQAVADGVAIAVAAVPEGLPLVATLSQLAAAQRLTARGALVRSPRT
IEALGRVDTICFDKTGTLTENRLRVVCALPSSTAAERDPLPQTTDAPSAEVLRAAARASTQPHNGEGHAHATDEAILAAA
SALAGSLSSQGDSEWVVLAEVPFESSRGYAAAIGRVGTDGIPMLMLKGAPETILPRCRLADPGVDHEHAESVVRHLAEQG
LRVLAVAQRTWDNGTTHDDETDADAVDAVAHDLELIGYVGLADTARSSSRPLIEALLDAERNVVLITGDHPITARAIARQ
LGLPADARVVTGAELAVLDEEAHAKLAADMQVFARVSPEQKVQIVAALQRCGRVTAMVGDGANDAAAIRMADVGIGVSGR
GSSAARGAADIVLTDDDLGVLLDALVEGRSMWAGVRDAVTILVGGNVGEVLFTVIGTAFGAGRAPVGTRQLLLVNLLTDM
FPALAVAVTSQFAEPDDAEYPTDDAAERAQREHRRAVLIGPTPSLDAPLLRQIVNRGVVTAAGATAAWAIGRWTPGTERR
TATMGLTALVMTQLAQTLLTRRHSPLVIATALGSAGVLVGIIQTPVISHFSGVPRWDRSPGRASSAPRQEPPQSQRWHRS
GWQAQSVSCNLMNALTTRKTLTRVDRTYRRPR
>P9WPT7 7.2.2.-~~~ctpJ~~~Probable cation-transporting P-type ATPase J~~~COG2217
MAVRELSPARCTSASPLVLARRTKLFALSEMRWAALALGLFSAGLLTQLCGAPQWVRWALFLACYATGGWEPGLAGLQAL
QRRTLDVDLLMVVAAIGAAAIGQIAEGALLIVIFATSGALEALVTARTADSVRGLMGLAPGTATRVGAGGGEETVNAADL
RIGDIVLVRPGERISADATVLAGGSEVDQATVTGEPLPVDKSIGDQVFAGTVNGTGALRIRVDRLARDSVVARIATLVEQ
ASQTKARTQLFIEKVEQRYSIGMVAVTLAVFAVPPLWGETLQRALLRAMTFMIVASPCAVVLATMPPLLAAIANAGRHGV
LAKSAIVMEQLGTTTRIAFDKTGTLTRGTPELAGIWVYERRFTDDELLRLAAAAEYPSEHPLGAAIVKAAQSRRIRLPTV
GEFTAHPGCRVTARVDGHVIAVGSATALLGTAGAAALEASMITAVDFLQGEGYTVVVVVCDSHPVGLLAITDQLRPEAAA
AISAATKLTGAKPVLLTGDNRATADRLGVQVGIDDVRAGLLPDDKVAAVRQLQAGGARLTVVGDGINDAPALAAAHVGIA
MGSARSELTLQTADAVVVRDDLTTIPTVIAMSRRARRIVVANLIVAVTFIAGLVVWDLAFTLPLPLGVARHEGSTIIVGL
NGLRLLRHTAWRRAAGTAHR
>Q9HUW6 ~~~ctpL~~~Methyl-accepting chemotaxis protein CtpL~~~
MRLKQLTNLNTLLLLTVCLALGITLWWSQRAMERPFQLLDQYLELSQRFDEQVARNIRQYLGSGDAVRQQAALQALESLA
EALPELPPDLARTLAPSLAELREFSAGDLLAAGKLAGDPQGLLLQAERDLTGNLEQWSAYLDAAAGQPQAGAYRTPLLLA
SLHLTRLSLARAKLVESANPALAGDVERELANLREQAGRIEALPLLGVLDEQRSASDDFAAMMGLAGDAEAGAGNAEDRG
VALRRELASLLQRYPDELRRTRDLIERRQQLSADTGARLDAVRQALATLEPQVRGERQRLQGQVRLIQGGMIALILLIAL
AIDSLQRRLARVLGQLVPALSAWADGDFSRPISLRTRTEDLRNLEDSLNRLRSFLAELVGAIHRRAEQVAGSSQTLAEVS
SGLHAGVERQAGDTGQIRDALGDMEAAIQQVAGDASQTADASRSAGQAVEHGQRVIGESLGGLRELVDEVQGNAQSIERL
AEESATIGSVLTVIRSIAEQTNLLALNAAIEAARAGDQGRGFAVVAEEVRSLAQRTAGATEEIQQLIGRLQQAARQSVEA
MRSQVEHAERTAEQAGAAEGALDEVVAAIHTIGVMAERIAEGSTQQSQAVGEIRSHSERIHALGGENLRLIGHSREQGEQ
LRQLGGDLRTTVQAFRL
>Q9I0I6 ~~~ctpM~~~Methyl-accepting chemotaxis protein CtpM~~~
MMRLTLKSKVLLLAMVPVLLFALVLSGGAVLILKKQADAEVKDTRERLLGDRRAELEHYVQIAMGSIQAEYDRSANGDLN
ARAEAIARLSKIKYGKDGYIFGYDSQVVRLFRGDSPVDVGKSFRDRRDPSGVYLNRELVEAGRNGSHYVTYTSPLPGNES
VMVPKLSYTLYLPKWDMVIGSAINLDGVEAQLVEIKQDIDERIGTLIASIVGIAGVLLVVLLVIGLAVANAMLRPLHQIR
QNLDDIAAGEGDLTRRLPVTSYDELGELAGSFNRFVEKIHGLVRQIAGMTGDLKQLVEQMSAQAERSEQAMERQRHETDQ
VATAINEMSAAAHEVAQSAQRAAEAAQQTDHEGQAAKRVVDGSIERIHALVDEIRDSGTSLDSLQQDVQSIVSVLGVIRS
IAEQTNLLALNAAIEAARAGEAGRGFAVVADEVRALASRTQQSTQEIQGMIDRLQQGTNAAVDAMRRSGEAGEGTSNQAN
QAGDSLDAIAQLIATINAMNAQIASAAEEQTAVAEEINRSVHQIAGAVDSVADEAQQGAQTARSLAQLGQGLGRLVGQFR
I
>P9WPS3 7.2.2.8~~~ctpV~~~Probable copper-exporting P-type ATPase V~~~COG2217
MRVCVTGFNVDAVRAVAIEETVSQVTGVHAVHAYPRTASVVIWYSPELGDTAAVLSAITKAQHVPAELVPARAPHSAGVR
GVGVVRKITGGIRRMLSRPPGVDKPLKASRCGGRPRGPVRGSASWPGEQNRRERRTWLPRVWLALPLGLLALGSSMFFGA
YPWAGWLAFAATLPVQFVAGWPILRGAVQQARALTSNMDTLIALGTLTAFVYSTYQLFAGGPLFFDTSALIIAFVVLGRH
LEARATGKASEAISKLLELGAKEATLLVDGQELLVPVDQVQVGDLVRVRPGEKIPVDGEVTDGRAAVDESMLTGESVPVE
KTAGDRVAGATVNLDGLLTVRATAVGADTALAQIVRLVEQAQGDKAPVQRLADRVSAVFVPAVIGVAVATFAGWTLIAAN
PVAGMTAAVAVLIIACPCALGLATPTAIMVGTGRGAELGILVKGGEVLEASKKIDTVVFDKTGTLTRARMRVTDVIAGQR
RQPDQVLRLAAAVESGSEHPIGAAIVAAAHERGLAIPAANAFTAVAGHGVRAQVNGGPVVVGRRKLVDEQHLVLPDHLAA
AAVEQEERGRTAVFVGQDGQVVGVLAVADTVKDDAADVVGRLHAMGLQVAMITGDNARTAAAIAKQVGIEKVLAEVLPQD
KVAEVRRLQDQGRVVAMVGDGVNDAPALVQADLGIAIGTGTDVAIEASDITLMSGRLDGVVRAIELSRQTLRTIYQNLGW
AFGYNTAAIPLAALGALNPVVAGAAMGFSSVSVVTNSLRLRRFGRDGRTA
>Q2YQA4 ~~~ctrA~~~Cell cycle response regulator CtrA~~~
MRVLLIEDDSAIAQSIELMLKSESFNVYTTDLGEEGIDLGKLYDYDIILLDLNLPDMSGYEVLRTLRLSKVKTPILILSG
MAGIEDKVRGLGFGADDYMTKPFHKDELIARIHAIVRRSKGHAQSVITTGDLVVNLDAKTVEVAGQRVHLTGKEYQMLEL
LSLRKGTTLTKEMFLNHLYGGMDEPELKIIDVFICKLRKKLDAVSGNQSYIETVWGRGYVLREPDAEMRESA
>Q9ZHS1 ~~~ctrA~~~Cell cycle response regulator CtrA~~~
MRVLLIEDDSAIAQSIELMLKSESFNVYTTDLGEEGIDLGKLYDYDIILLDLNLPDMSGYEVLRTLRLSKVKTPILILSG
MAGIEDKVRGLGFGADDYMTKPFHKDELIARIHAIVRRSKGHAQSVITTGDLVVNLDAKTVEVAGQRVHLTGKEYQMLEL
LSLRKGTTLTKEMFLNHLYGGMDEPELKIIDVFICKLRKKLDAVSGNQSYIETVWGRGYVLREPDAEMRESA
>Q7CNV1 ~~~ctrA~~~Cell cycle response regulator CtrA~~~COG0745
MRVLLIEDDSAIAQSIELMLKSESFNVYTTDLGEEGIDLGKLYDYDIILLDLNLPDMSGYEVLRTLRLSKVKTPILILSG
MAGIEDKVRGLGFGADDYMTKPFHKDELIARIHAIVRRSKGHAQSVITTGDLVVNLDAKTVEVAGQRVHLTGKEYQMLEL
LSLRKGTTLTKEMFLNHLYGGMDEPELKIIDVFICKLRKKLDAVSGNQSYIETVWGRGYVLREPDAEMRESA
>A5VRW9 ~~~ctrA~~~Cell cycle response regulator CtrA~~~
MRVLLIEDDSAIAQSIELMLKSESFNVYTTDLGEEGIDLGKLYDYDIILLDLNLPDMSGYEVLRTLRLSKVKTPILILSG
MAGIEDKVRGLGFGADDYMTKPFHKDELIARIHAIVRRSKGHAQSVITTGDLVVNLDAKTVEVAGQRVHLTGKEYQMLEL
LSLRKGTTLTKEMFLNHLYGGMDEPELKIIDVFICKLRKKLDAVSGNQSYIETVWGRGYVLREPDAEMRESA
>Q8FZ93 ~~~ctrA~~~Cell cycle response regulator CtrA~~~
MRVLLIEDDSAIAQSIELMLKSESFNVYTTDLGEEGIDLGKLYDYDIILLDLNLPDMSGYEVLRTLRLSKVKTPILILSG
MAGIEDKVRGLGFGADDYMTKPFHKDELIARIHAIVRRSKGHAQSVITTGDLVVNLDAKTVEVAGQRVHLTGKEYQMLEL
LSLRKGTTLTKEMFLNHLYGGMDEPELKIIDVFICKLRKKLDAVSGNQSYIETVWGRGYVLREPDAEMRESA
>P37568 ~~~ctsR~~~Transcriptional regulator CtsR~~~COG4463
MGHNISDIIEQYLKRVLDQNGKEILEIKRSEIADKFQCVPSQINYVINTRFTSERGYIVESKRGGGGYIRIIKIKMNNEV
VLINNIISQINTHLSQAASDDIILRLLEDKVISEREAKMMVSVMDRSVLHIDLPERDELRARMMKAMLTSLKLK
>C3W947 ~~~ctsr~~~Transcriptional regulator CtsR~~~
MPNISDIIEQYLKQVLNMSDQDIVEIKRSEIANKFRCVPSQINYVINTRFTLERGYIVESKRGGGGYIRIMKVKTKSEAQ
LIDQLLELIDHRISQSSAEDVIKRLMEEKVISEREAKMMLSVMDRSVLYIDLPERDELRARMLKAMLTSLKYK
>Q88XZ6 ~~~ctsR~~~Transcriptional regulator CtsR~~~COG4463
MQSQNISDIIEKYLKSILADSEHVEIRRSEIADLFNVVPSQINYVIKTRFTIQNGYLVESKRGGGGYIRIEKVNLVDDAD
VLDALIQVIGDSITQRDAYAVVQSLYEDDVLNRREAQLILVAIDHETLGLTDRDLENSLRARIIIGILNHLRYES
>Q2G0P8 ~~~ctsR~~~Transcriptional regulator CtsR~~~COG4463
MHNMSDIIEQYIKRLFEESNEDVVEIQRANIAQRFDCVPSQLNYVIKTRFTNEHGYEIESKRGGGGYIRITKIENKDATG
YINHLLQLIGPSISQQQAYYIIDGLLDKMLINEREAKMIQAVIDRETLSMDMVSRDIIRANILKRLLPVINYY
>Q7A799 ~~~ctsR~~~Transcriptional regulator CtsR~~~
MHNMSDIIEQYIKRLFEESNEDVVEIQRANIAQRFDCVPSQLNYVIKTRFTNEHGYEIESKRGGGGYIRITKIENKDATG
YINHLLQLIGPSISQQQAYYIIDGLLDKMLINEREAKMIQAVIDRETLSMDMVSRDIIRANILKRLLPVINYY
>Q5NHL7 3.5.1.20~~~ctu~~~Citrullinase~~~
MANIKVAVVQLSFNDNEAENLAKLESKIIQAAKNGAKIILTPELPSYLYFCKKQNSKYFDLAKTIDESPIVKLYKLLAHK
YNIVLPASFFERDGNACYNSIAMIDADGSIMGVYRKAHIPDGIGYQEKYYFSPGSAGFKVWDTKYAKVGVGICWDQWFPE
AARVMALKGAEILLYPTAIGSEPHLPDYDSKDHWQRVMQGHAAANMLPVLASNRYATEANDDITATYYGSSFITDHTGDK
IAEADRSGDDILYATFDFAELQQQRFYWGLFRDRRPELYDEIVRKY
>P14608 ~~~ctx~~~Cytotoxin~~~
MNDIDTITNAWGRWKTAQYGTTCWFTESTQYGRNKDTRSYMQHQTNVSAPKDLVYSNFVQQDGGSTLLGQYDMINEGSQV
IELAVNLQQGLVDTFTWSVTEQLKVGVEVKVKANIPLVGGAEITSTVELSLSSTQGASTSKSSNYGASTKVLISPHSHGW
GEVALSFTELRTQWVGNVGLQGYVAIWFNNKVALNNDGDYHYLWFIPVEQVFWECVQHNIVNTSGYVVQGNGVLAQATGT
FHSSVGLNLKTIAHERPYPETSEAVRTFYNYASLVPDLETRVRSAE
>C0HK82 ~~~~~~Bacteriocin curvaticin DN317~~~
IPYGGNGVHHGKAGDSXTVDTAIGNIGNNAASIIGGMISGWASGLAG
>P36649 1.16.3.4~~~cueO~~~Multicopper oxidase CueO~~~COG2132
MQRRDFLKYSVALGVASALPLWSRAVFAAERPTLPIPDLLTTDARNRIQLTIGAGQSTFGGKTATTWGYNGNLLGPAVKL
QRGKAVTVDIYNQLTEETTLHWHGLEVPGEVDGGPQGIIPPGGKRSVTLNVDQPAATCWFHPHQHGKTGRQVAMGLAGLV
VIEDDEILKLMLPKQWGIDDVPVIVQDKKFSADGQIDYQLDVMTAAVGWFGDTLLTNGAIYPQHAAPRGWLRLRLLNGCN
ARSLNFATSDNRPLYVIASDGGLLPEPVKVSELPVLMGERFEVLVEVNDNKPFDLVTLPVSQMGMAIAPFDKPHPVMRIQ
PIAISASGALPDTLSSLPALPSLEGLTVRKLQLSMDPMLDMMGMQMLMEKYGDQAMAGMDHSQMMGHMGHGNMNHMNHGG
KFDFHHANKINGQAFDMNKPMFAAAKGQYERWVISGVGDMMLHPFHIHGTQFRILSENGKPPAAHRAGWKDTVKVEGNVS
EVLVKFNHDAPKEHAYMAHCHLLEHEDTGMMLGFTV
>Q8ZRS2 1.16.3.4~~~cueO~~~Multicopper oxidase CueO~~~
MLRRDFLKYSVALGVASALPLWSRAAFAAERPALPIPDLLTADASNRMQLIVKAGQSTFAGKNATTWGYNGNLLGPAVQL
HKGKSVTVDIHNQLAEDTTLHWHGLEIPGIVDGGPQGIIPAGGTRTVTFTPEQRAATCWIHPHKHGKTGRQVAMGLAGLV
LIEDDEIRKLRLPKQWGIDDVPVIIQDKRFSADGQIDYQLDIMTAAVGWFGDTLLTNGAIYPQHSAPKGWLRLRLLNGCN
ARSLNIAASDNRPLYVIASDGGLLAEPVKVTELPLLMGERFEVLVDISDGKAFDLVTLPVSQMGMAIAPFDKPHPVMRIQ
PLRITASGTLPDTLTTMPALPSLEGLTVRNLKLSMDPRLDMMGMQMLMKKYGAQAMSGMDHDSMNAHMQGGNMGHGEMDH
GNMDHSGMNHGAMGNMNHGGKFDFHNANFINGQVFDMNKPMFAAQKGRHERWVISGVGDMMLHPFHIHGTQFRILSENGK
APAAHRTGWKDTVRVEGGISEVLVKFDHDAPKEHAYMAHCHLLEHEDTGMMLGFTV
>P0A9G4 ~~~cueR~~~HTH-type transcriptional regulator CueR~~~COG0789
MNISDVAKITGLTSKAIRFYEEKGLVTPPMRSENGYRTYTQQHLNELTLLRQARQVGFNLEESGELVNLFNDPQRHSADV
KRRTLEKVAEIERHIEELQSMRDQLLALANACPGDDSADCPIIENLSGCCHHRAG
>Q93CH6 ~~~cueR~~~HTH-type transcriptional regulator CueR~~~
MNISDVAKKTGLTSKAIRFYEEKGLVTPPLRSENGYRTYTQKHLNELTLLRQARQVGFNLEECGELVNLFNDPRRHSADV
KKRTLEKVAEIERHISELQSMRDQLLALAESCPGDDSADCPIIDNLSGCCHHKAQKPR
>P74285 2.7.7.9~~~cugP~~~UTP--glucose-1-phosphate uridylyltransferase~~~COG1208
MKAMILAAGKGTRVRPITHTIPKPMIPILQKPVMEFLLELLRQHGFDQIMVNVSHLAEEIESYFRDGQRFGVQIAYSFEG
NIVDGDLVGKALGSAGGLKKIQEFNPFFDDTFVVLCGDALIDLDLTTAVKLHREKGAIATIITKTVPQELVSSYGVVVTD
DNGKILTFQEKPAVEEALSTEINTGIYIFEPEVIDYIPSGQEYDLGGDLFPKLVDSGLPFYAVNMDFEWVDIGKVPDYWQ
AIRGVLSREIKNVQIPGIEVRPGVYTGINVAANWDNIEIEGPVYIGGMTRIEDGVKIIGPSMIGPSCLICQGAVVDNSVI
FEYSRLGPGARLVDKLVFGRYCVDKTGAAIDVQAAALDWLITDARHAAVQYRQEYPSQREISKLLQPE
>P9WP43 3.1.1.-~~~cut7~~~Carboxylesterase Culp1~~~
MTPRSLVRIVGVVVATTLALVSAPAGGRAAHADPCSDIAVVFARGTHQASGLGDVGEAFVDSLTSQVGGRSIGVYAVNYP
ASDDYRASASNGSDDASAHIQRTVASCPNTRIVLGGYSQGATVIDLSTSAMPPAVADHVAAVALFGEPSSGFSSMLWGGG
SLPTIGPLYSSKTINLCAPDDPICTGGGNIMAHVSYVQSGMTSQAATFAANRLDHAG
>P9WP41 3.1.1.-~~~cut2~~~Probable carboxylesterase Culp2~~~
MNDLLTRRLLTMGAAAAMLAAVLLLTPITVPAGYPGAVAPATAACPDAEVVFARGRFEPPGIGTVGNAFVSALRSKVNKN
VGVYAVKYPADNQIDVGANDMSAHIQSMANSCPNTRLVPGGYSLGAAVTDVVLAVPTQMWGFTNPLPPGSDEHIAAVALF
GNGSQWVGPITNFSPAYNDRTIELCHGDDPVCHPADPNTWEANWPQHLAGAYVSSGMVNQAADFVAGKLQ
>P9WP39 3.1.1.-~~~cut3~~~Probable carboxylesterase Culp3~~~
MNNRPIRLLTSGRAGLGAGALITAVVLLIALGAVWTPVAFADGCPDAEVTFARGTGEPPGIGRVGQAFVDSLRQQTGMEI
GVYPVNYAASRLQLHGGDGANDAISHIKSMASSCPNTKLVLGGYSQGATVIDIVAGVPLGSISFGSPLPAAYADNVAAVA
VFGNPSNRAGGSLSSLSPLFGSKAIDLCNPTDPICHVGPGNEFSGHIDGYIPTYTTQAASFVVQRLRAGSVPHLPGSVPQ
LPGSVLQMPGTAAPAPESLHGR
>O06319 3.1.1.-~~~cut4~~~Phospholipase Culp4~~~
MIPRPQPHSGRWRAGAARRLTSLVAAAFAAATLLLTPALAPPASAGCPDAEVVFARGTGEPPGLGRVGQAFVSSLRQQTN
KSIGTYGVNYPANGDFLAAADGANDASDHIQQMASACRATRLVLGGYSQGAAVIDIVTAAPLPGLGFTQPLPPAADDHIA
AIALFGNPSGRAGGLMSALTPQFGSKTINLCNNGDPICSDGNRWRAHLGYVPGMTNQAARFVASRI
>O53581 3.1.1.-~~~cut6~~~Carboxylesterase/lipase Culp6~~~COG4223
MAKNSRRKRHRILAWIAAGAMASVVALVIVAVVIMLRGAESPPSAVPPGVLPPGPTPAHPHKPRPAFQDASCPDVQMISV
PGTWESSPQQNPLNPVQFPKALLLKVTGPIAQQFAPARVQTYTVAYTAQFHNPLTTDNQMSYNDSRAEGTRAMVAAMTDM
NNRCPLTSYVLIGFSQGAVIAGDVASDIGNGRGPVDEDLVLGVTLIADGRRQQGVGNQVPPSPRGEGAEITLHEVPVLSG
LGLTMTGPRPGGFGALDGRTNEICAQGDLICAAPAQAFSPANLPTTLNTLAGGAGQPVHAMYATPEFWNSDGEPATEWTL
NWAHQLIENAPHPKHR
>Q79FA4 3.1.1.-~~~cut5b~~~Probable carboxylesterase Culp7~~~
MAPGSHLVLAASEDCSSTHCVSQVGAKSLGVYAVNYPASNDFASSDFPKTVIDGIRDAGSHIQSMAMSCPQTRQVLGGYS
QGAAVAGYVTSAVVPPAVPVQAVPAPMAPEVANHVAAVTLFGAPSAQFLGQYGAPPIAIGPLYQPKTLQLCADGDSICGD
GNSPVAHGLYAVNGMVGQGANFAASRL
>Q8NLR5 3.1.1.-~~~~~~Carboxylesterase Culp6 homolog~~~COG4223
MRKTITVIAVLIVLALIGVGIVQYVNTSDDSDFIGQPGEPTGTETTEPPVQPDWCPAVEVIAAPGTWESAANDDPINPTA
NPLSFMLSITQPLQERYSADDVKVWTLPYTAQFRNINSQNEMSYDDSRNEGTAKMNEELINTHNECPATEFIIVGFSQGA
VIAGDVAAQIGSEQGVIPADSVRGVALIADGRREPGVGQFPGTFVDGIGAEVTLQPLNLLVQPIVPGATMRGGRAGGFGV
LNDRVQDICAPNDAICDAPVNVGNALDRALAMVSANGVHALYATNPDVFPGTTTNAWVVDWATNLIDNG
>A0R619 3.1.1.-~~~~~~Carboxylesterase Culp6 homolog~~~COG4223
MAKNARRKRHRILALIAAAAMALVVVLVVTIVVVIMRRPDTPATPPPSAEPPGGVVVPPGTRKPRPEFQSADCPDVMMVS
IPGTWESSPTDDPFNPTQFPLSLMSNISKPLAEQFGPDRLQVYTTPYTAQFHNPFAADKQMSYNDSRAEGMRTTVKAMTD
MNDRCPLTSYVIAGFSQGAVIAGDIASDIGNGRGPVDEDLVLGVTLIADGRRQMGVGQDVGPNPAGQGAEITLHEVPALS
ALGLTMTGPRPGGFGALDNRTNQICGSGDLICSAPEQAFSVFNLPKTLETLSGSAAGPVHALYNTPQFWVENGQTATQWT
LEWARNLVENAPHPKHG
>Q2RSU5 2.8.4.6~~~cupin~~~1-methylthio-D-xylulose 5-phosphate methylsulfurylase~~~COG0662
MSDSNDDRPFRPFQSQYRWPGVDLLAYKEEGSAPFRSVTRQVLFSGNGLTGELRYFEVGPGGHSTLERHQHAHGVMILKG
RGHAMVGRAVSAVAPYDLVTIPGWSWHQFRAPADEALGFLCMVNAERDKPQLPTEADLAMLRADDAVAAFLDGLAG
>Q0RYE0 ~~~~~~Cupin 2 conserved barrel domain-containing protein~~~
MTTDSVTQGVAHSVNGRLPELDFENRPSGAKLGIFDLPKLEVSVAPFTLAHIRVPGGVTTAEDHHEVREIWLVQSGSGIL
TLDGVRSRVRAGDTLYYESYRRHQLHNDGDSPVEIVSIWWRP
>P76113 1.3.1.n3~~~curA~~~NADPH-dependent curcumin reductase~~~COG2130
MGQQKQRNRRWVLASRPHGAPVPENFRLEEDDVATPGEGQVLLRTVYLSLDPYMRGRMSDEPSYSPPVDIGGVMVGGTVS
RVVESNHPDYQSGDWVLGYSGWQDYDISSGDDLVKLGDHPQNPSWSLGVLGMPGFTAYMGLLDIGQPKEGETLVVAAATG
PVGATVGQIGKLKGCRVVGVAGGAEKCRHATEVLGFDVCLDHHADDFAEQLAKACPKGIDIYYENVGGKVFDAVLPLLNT
SARIPVCGLVSSYNATELPPGPDRLPLLMATVLKKRIRLQGFIIAQDYGHRIHEFQREMGQWVKEDKIHYREEITDGLEN
APQTFIGLLKGKNFGKVVIRVAGDD
>P38054 ~~~cusA~~~Cation efflux system protein CusA~~~COG3696
MIEWIIRRSVANRFLVLMGALFLSIWGTWTIINTPVDALPDLSDVQVIIKTSYPGQAPQIVENQVTYPLTTTMLSVPGAK
TVRGFSQFGDSYVYVIFEDGTDPYWARSRVLEYLNQVQGKLPAGVSAELGPDATGVGWIYEYALVDRSGKHDLADLRSLQ
DWFLKYELKTIPDVAEVASVGGVVKEYQVVIDPQRLAQYGISLAEVKSALDASNQEAGGSSIELAEAEYMVRASGYLQTL
DDFNHIVLKASENGVPVYLRDVAKVQIGPEMRRGIAELNGEGEVAGGVVILRSGKNAREVIAAVKDKLETLKSSLPEGVE
IVTTYDRSQLIDRAIDNLSGKLLEEFIVVAVVCALFLWHVRSALVAIISLPLGLCIAFIVMHFQGLNANIMSLGGIAIAV
GAMVDAAIVMIENAHKRLEEWQHQHPDATLDNKTRWQVITDASVEVGPALFISLLIITLSFIPIFTLEGQEGRLFGPLAF
TKTYAMAGAALLAIVVIPILMGYWIRGKIPPESSNPLNRFLIRVYHPLLLKVLHWPKTTLLVAALSVLTVLWPLNKVGGE
FLPQINEGDLLYMPSTLPGISAAEAASMLQKTDKLIMSVPEVARVFGKTGKAETATDSAPLEMVETTIQLKPQEQWRPGM
TMDKIIEELDNTVRLPGLANLWVPPIRNRIDMLSTGIKSPIGIKVSGTVLADIDAMAEQIEEVARTVPGVASALAERLEG
GRYINVEINREKAARYGMTVADVQLFVTSAVGGAMVGETVEGIARYPINLRYPQSWRDSPQALRQLPILTPMKQQITLAD
VADIKVSTGPSMLKTENARPTSWIYIDARDRDMVSVVHDLQKAIAEKVQLKPGTSVAFSGQFELLERANHKLKLMVPMTL
MIIFVLLYLAFRRVGEALLIISSVPFALVGGIWLLWWMGFHLSVATGTGFIALAGVAAEFGVVMLMYLRHAIEAVPSLNN
PQTFSEQKLDEALYHGAVLRVRPKAMTVAVIIAGLLPILWGTGAGSEVMSRIAAPMIGGMITAPLLSLFIIPAAYKLMWL
HRHRVRK
>P77239 ~~~cusB~~~Cation efflux system protein CusB~~~COG0845
MKKIALIIGSMIAGGIISAAGFTWVAKAEPPAEKTSTAERKILFWYDPMYPNTRFDKPGKSPFMDMDLVPKYADEESSAS
GVRIDPTQTQNLGVKTATVTRGPLTFAQSFPANVSYNEYQYAIVQARAAGFIDKVYPLTVGDKVQKGTPLLDLTIPDWVE
AQSEYLLLRETGGTATQTEGILERLRLAGMPEADIRRLIATQKIQTRFTLKAPIDGVITAFDLRAGMNIAKDNVVAKIQG
MDPVWVTAAIPESIAWLVKDASQFTLTVPARPDKTLTIRKWTLLPGVDAATRTLQLRLEVDNADEALKPGMNAWLQLNTA
SEPMLLIPSQALIDTGSEQRVITVDADGRFVPKRVAVFQASQGVTALRSGLAEGEKVVSSGLFLIDSEANISGALERMRS
ESATHAH
>P77211 ~~~cusC~~~Cation efflux system protein CusC~~~COG1538
MSPCKLLPFCVALALTGCSLAPDYQRPAMPVPQQFSLSQNGLVNAADNYQNAGWRTFFVDNQVKTLISEALVNNRDLRMA
TLKVQEARAQYRLTDADRYPQLNGEGSGSWSGNLKGNTATTREFSTGLNASFDLDFFGRLKNMSEAERQNYLATEEAQRA
VHILLVSNVAQSYFNQQLAYAQLQIAEETLRNYQQSYAFVEKQLLTGSSNVLALEQARGVIESTRSDIAKRQGELAQANN
ALQLLLGSYGKLPQAQTVNSDSLQSVKLPAGLSSQILLQRPDIMEAEHALMAANANIGAARAAFFPSISLTSGISTASSD
LSSLFNASSGMWNFIPKIEIPIFNAGRNQANLDIAEIRQQQSVVNYEQKIQNAFKEVADALALRQSLNDQISAQQRYLAS
LQITLQRARALYQHGAVSYLEVLDAERSLFATRQTLLDLNYARQVNEISLYTALGGG
>P77214 ~~~cusF~~~Cation efflux system protein CusF~~~COG5569
MKKALQVAMFSLFTVIGFNAQANEHHHETMSEAQPQVISATGVVKGIDLESKKITIHHDPIAAVNWPEMTMRFTITPQTK
MSEIKTGDKVAFNFVQQGNLSLLQDIKVSQ
>P0ACZ8 ~~~cusR~~~Transcriptional regulatory protein CusR~~~COG0745
MKLLIVEDEKKTGEYLTKGLTEAGFVVDLADNGLNGYHLAMTGDYDLIILDIMLPDVNGWDIVRMLRSANKGMPILLLTA
LGTIEHRVKGLELGADDYLVKPFAFAELLARVRTLLRRGAAVIIESQFQVADLMVDLVSRKVTRSGTRITLTSKEFTLLE
FFLRHQGEVLPRSLIASQVWDMNFDSDTNAIDVAVKRLRGKIDNDFEPKLIQTVRGVGYMLEVPDGQ
>P77485 2.7.13.3~~~cusS~~~Sensor histidine kinase CusS~~~COG5002
MVSKPFQRPFSLATRLTFFISLATIAAFFAFAWIMIHSVKVHFAEQDINDLKEISATLERVLNHPDETQARRLMTLEDIV
SGYSNVLISLADSQGKTVYHSPGAPDIREFTRDAIPDKDAQGGEVYLLSGPTMMMPGHGHGHMEHSNWRMINLPVGPLVD
GKPIYTLYIALSIDFHLHYINDLMNKLIMTASVISILIVFIVLLAVHKGHAPIRSVSRQIQNITSKDLDVRLDPQTVPIE
LEQLVLSFNHMIERIEDVFTRQSNFSADIAHEIRTPITNLITQTEIALSQSRSQKELEDVLYSNLEELTRMAKMVSDMLF
LAQADNNQLIPEKKMLNLADEVGKVFDFFEALAEDRGVELRFVGDKCQVAGDPLMLRRALSNLLSNALRYTPTGETIVVR
CQTVDHLVQVIVENPGTPIAPEHLPRLFDRFYRVDPSRQRKGEGSGIGLAIVKSIVVAHKGTVAVTSDARGTRFVITLPA
>P69488 ~~~cutA~~~Divalent-cation tolerance protein CutA~~~COG1324
MLDEKSSNTASVVVLCTAPDEATAQDLAAKVLAEKLAACATLIPGATSLYYWEGKLEQEYEVQMILKTTVSHQQALLECL
KSHHPYQTPELLVLPVTHGDTDYLSWLNASLR
>Q7CPA2 ~~~cutA~~~Divalent-cation tolerance protein CutA~~~
MLDVKSQDISIPEAVVVLCTAPDEATAQDLAAKVLAEKLAACATLLPGATSLYYWEGKLEQEYEVQMILKTTVSHQQALI
DCLKSHHPYQTPELLVLPVTHGDTDYLSWLNASLR
>Q9X0E6 ~~~cutA~~~Divalent-cation tolerance protein CutA~~~COG1324
MILVYSTFPNEEKALEIGRKLLEKRLIACFNAFEIRSGYWWKGEIVQDKEWAAIFKTTEEKEKELYEELRKLHPYETPAI
FTLKVENVLTEYMNWLRESVL
>Q7SIA8 ~~~cutA~~~Divalent-cation tolerance protein CutA~~~COG1324
MEEVVLITVPSEEVARTIAKALVEERLAACVNIVPGLTSIYRWQGEVVEDQELLLLVKTTTHAFPKLKERVKALHPYTVP
EIVALPIAEGNREYLDWLRENTG
>Q74XD3 ~~~cutA~~~Divalent-cation tolerance protein CutA~~~COG1324
MSDSDAMTDPNAVSYSNAIVVLCTAPDEASAQNLAAQVLGEKLAACVTLLPGATSLYYWEGKLEQEYEVQLLFKSNTDHQ
QALLTYIKQHHPYQTPELLVLPVRDGDKDYLSWLNASLL
>Q830V2 ~~~cutC~~~PF03932 family protein CutC~~~COG3142
MIKEFCAENFTKIPQAIQKGANRIELCDNLAVGGTTPSTGVIEEVLAYAGEHSVPVMTIIRPRGGNFVYNDIELKIMHTD
LIEAKKLGTDGIVIGCLTEDGWLDEEALDLFIETAEGLQITFHMAFDALSKENQFKAIDWLAERGVTRILTHGGPAGTPI
EDNFDHLKELIVYADQRILILPGGGISTENVQTVMDTLKVTEVHGTKIV
>Q30W70 4.3.99.4~~~cutC~~~Choline trimethylamine-lyase~~~COG1882
MDLQDFSHKLAEATKNLTPAERASLKKIFEGVSAEVFSQPAPVSAVATGAESGIPDGPTPRHVKLKENFLKQVPSITVQR
AVAITKIAKENPGLPKPLLRAKTFRYCCETAPLVIQDHELIVGSPNGAPRAGAFSPEVAWRWLQDELDTIGSRPQDPFYI
SEEDKKVLREEVFPFWQNKSVDEFCEGQYREADLWEMSGESFVSDCSYHAVNGGGDSNPGYDVILMKKGMLDIQREAREK
LEQLDYANPEDIDKIYFYKSVIETAEGVMIYARRLSAYAAELAARETDPRRKAELQKISEVNARVPAHAPSNFWEAIQAV
WTVESLLVVEENQTGMSIGRVDQYMYPFYRADIDSGRLTEYEAFDLAGCMLVKMSEMMWITSEGASKFFAGYQPFVNMCV
GGVTREGHDATNDLTYMLMDAVRHVRIYQPTLATRVHNKSPQKYLKKIVDVIRSGMGFPAVHFDDAHIKMMLAKGVSIED
ARDYCLMGCVEPQKSGRLYQWTSTGYTQWPICIELVLNHGVPLWYGKKVTPDMGDLSQYDTYEKFEAAVKEQIRWITKNT
SVATVISQRAHRELAPKPLMSLMYEGCMESGRDVSAGGAMYNFGPGVVWSGLATYVDSMAAIKKLVYDDRKYTLAQLNEA
LKADFAGYDQILADCLAAPKYGNDDDYADMIAADLVHFTETEHRKYKTLYSVLSHGTLSISNNTPFGQLLGASANGRRAW
MPLSDGISPTQGADYKGPTAIIKSVSKMANDNMNIGMVHNFKLMSGLLDTPEGENGLITLIRTACMLGNGEMQFNYLDNE
LLLDAQKHPEKYRDLVVRVAGYSAFFVELCKDVQDEIISRTMLHGF
>P67825 ~~~cutC~~~PF03932 family protein CutC~~~
MALLEICCYSMECALTAQQNGADRVELCAAPKEGGLTPSLGVLKSVRQRVTIPVHPIIRPRGGDFCYSDGEFAAILEDVR
TVRELGFPGLVTGVLDVDGNVDMPRMEKIMAAAGPLAVTFHRAFDMCANPLYTLNNLAELGIARVLTSGQKSDALQGLSK
IMELIAHRDAPIIMAGAGVRAENLHHFLDAGVLEVHSSAGAWQASPMRYRNQGLSMSSDEHADEYSRYIVDGAAVAEMKG
IIERHQAK
>Q5XDL5 ~~~cutC~~~PF03932 family protein CutC~~~
MIKEFCAENLTLLPTLDAGQVSRVELCDNLAVGGTTPSYGVIKEACQLLHDKKISVATMIRPRGGDFVYNDLELKAMEED
ILKAVEAGSDALVLGLLTTENQLDTDAIEQLLPATQGLPLVFHMAFDHIPTDHQHQALDQLIDYGFVRVLTHGSPEATPI
TDNVEQLKSLVTYANKRIEIMIGGGVTAENCQNLSQLTGTAIVHGTKII
>Q30W71 1.97.1.-~~~cutD~~~Choline trimethylamine-lyase activating enzyme~~~COG1180
MIERKALIFNIQKYNMYDGPGVRTLVFFKGCPLRCKWCSNPEGQLRQYQVLYKENLCVHCGACVPVCPAGVHTISASTLR
HGFAEGAQCIGCRRCEDVCPSSALAVVGEQKTISELLEVIEEDRPFYETSGGGVTLGGGEVLMQPEAAVNLLAACKQHGI
NTAIETCGYAKQEVVMKAAQYVDLFLYDVKHIDSARHYELTGVRNELILSNLTWLLENKHNVKIRVPLLRGVNDSEDDLR
GLVEYLRPYQDYKNFKGIDLLPYHKMGVGKYKQLGWEYPIEGNPALSDADLERVEACIRKYDFPVSVIRH
>A6WFI5 3.1.1.74~~~cut~~~Cutinase~~~
MLRARPSHRLASAAAVVAATGAALLAGSSPAAAATCSDVDVVFARGTGETPGLGVVGGPFVRSLTGELSDRTVTSHAVDY
AASSSQASAGPGATAMSAHVREVAAACPSTRFVLGGYSQGATVTDIALGIRTGTTTGTPVPAELAGRVAAVVVFGNPLGL
SGRTIATASSTYGPKSKDYCNSSDSVCGSAPKTGTGGHLSYASNGSTTDGARFAAGLVRAAGTPTTPTPTPTPTPVPTTC
VRDSTRDHVAADRAVSLYGRAYARGSRDSLGATSSYNVVSLQQVEGGWRLVTAC
>A0A0E2IV13 ~~~cutR~~~Bacterial microcompartment shell protein CutR~~~
MIEELGKIDRIIQESVPGKQITLAHVIAAPIEAVYECLGVDHEGAIGVVSLTPNETAIIAADIAGKAANIDICFVDRFTG
SVMFSGDIQSVETSLEDILEYFKNSLGFSTVPLTKS
>A3SQG3 4.4.1.25~~~cuyA~~~L-cysteate sulfo-lyase~~~COG2515
MHLARFPRRFIAHLPTPLERLDRLSAELGGPEIWIKRDDCTGLSTGGNKTRKLEFLMAEAELQGAEIVMTQGATQSNHAR
QTAAFAAKLGMKCHILLEDRTGSNEANYNHNGNVLLDHLHGATTEKRPGGGDMNAEMEKLADEWRADGKKVYTIPGGGSN
PTGALGYVNCAFELLAQANDGGLKIDHIVHATGSAGTQAGLITGLKAMNAQIPLLGIGVRAPKPKQEENVYNLACATAEK
LGCPGVVAREDVVANTDYVGQGYGIPTESGMEAIKMFAELESILLDPVYSAKGAAGFIDLIRKGHFKKGERVVFLHTGGA
AALFGYDGAFDFSSRWVG
>Q5LL69 4.4.1.25~~~cuyA~~~L-cysteate sulfo-lyase~~~COG2515
MHLARYPRRFIAHLPTPLERLDRLTAELGGPEIWIKRDDCTGLSTGGNKTRKLEFLMAEAELQGADMVMTQGATQSNHAR
QTAAFAAKLGMDCHILLEDRTGSNNANYNNNGNVLLDHLHGATTEKRPGSGLDMNAEMEKVAEKFRADGRKVYTIPGGGS
NPTGALGYVNCAFEMLNQFNERGLKVDHIVHATGSAGTQAGLITGLQAMNAQIPLLGIGVRAPKPKQEENVYNLACATAE
KLGCPGVVAREDVVANTDYVGEGYGIPTESGLEAIRMFAELEAILLDPVYSAKGAAGFIDLIRKGHFKKGERVVFLHTGG
AVALFGYDNAFDYSGRWVA
>P22519 ~~~cvaA~~~Colicin V secretion protein CvaA~~~
MKWQGRAILLPGIPLWLIMLGSIVFITAFLMFIIVGTYSRRVNVSGEVTTWPRAVNIYSGVQGFVVRQFVHEGQLIKKGD
PVYLIDISKSTRNGIVTDNHRRDIENQLVRVDNIISRLEESKKITLDTLEKQRLQYTDAFRRSSDIIQRAEEGIKIMKNN
MENYRYYQSKGLINKDQLTNQVALYYQQQNNLLSLSGQNEQNALQITTLESQIQTQAADFDNRIYQMELQRLELQKELVN
TDVEGEIIIRALSDGKVDSLSVTVGQMVNTGDSLLQVIPENIENYYLILWVPNDAVPYISAGDKVNIRYEAFPSEKFGQF
SATVKTISRTPASTQEMLTYKGAPQNTPGASVPWYKVIATPEKQIIRYDEKYLPLENGMKAESTLFLEKRRIYQWMLSPF
YDMKHSATGPIND
>Q2FYP3 ~~~cvfB~~~Conserved virulence factor B~~~COG2996
MALDKDIVGSIEFLEVVGLQGSTYLLKGPNGENVKLNQSEMNDDDELEVGEEYSFFIYPNRSGELFATQNMPDITKDKYD
FAKVLKTDRDGARIDVGLPREVLVPWEDLPKVKSLWPQPGDYLLVTLRIDRENHMYGRLASESVVENMFTPVHDDNLKNE
VIEAKPYRVLRIGSFLLSESGYKIFVHESERKAEPRLGESVQVRIIGHNDKGELNGSFLPLAHERLDDDGQVIFDLLVEY
DGELPFWDKSSPEAIKEVFNMSKGSFKRAIGHLYKQKIINIETGKIALTKKGWSRMDSKE
>Q7A5Q1 ~~~cvfB~~~Conserved virulence factor B~~~
MALDKDIVGSIEFLEVVGLQGSTYLLKGPNGENVKLNQSEMNDDDELEVGEEYSFFIYPNRSGELFATQNMPDITKDKYD
FAKVLKTDRDGARIDVGLPREVLVPWEDLPKVKSLWPQPGDHLLVTLRIDRENHMYGRLASESVVENMFTPVHDDNLKNE
VIEAKPYRVLRIGSFLLSESGYKIFVHESERKAEPRLGESVQVRIIGHNDKGELNGSFLPLAHERLDDDGQVIFDLLVEY
DGELPFWDKSSPEAIKEVFNMSKGSFKRAIGHLYKQKIINIETGKITLTKKGWSRIDSKE
>P81180 ~~~~~~Cyanovirin-N~~~
LGKFSQTCYNSAIQGSVLTSTCERTNGGYNTSSIDLNSVIENVDGSLKWQPSNFIETCRNTQLAGSSELAAECKTRAQQF
VSTKINLDDHIANIDGTLKYE
>P08550 ~~~cvpA~~~Colicin V production protein~~~COG1286
MVWIDYAIIAVIAFSSLVSLIRGFVREALSLVTWGCAFFVASHYYTYLSVWFTGFEDELVRNGIAIAVLFIATLIVGAIV
NFVIGQLVEKTGLSGTDRVLGVCFGALRGVLIVAAILFFLDSFTGVSKSEDWSKSQLIPQFSFIIRCFFDYLQSSSSFLP
RA
>Q02113 ~~~lytB~~~Amidase enhancer~~~COG2247
MKSCKQLIVCSLAAILLLIPSVSFAADSNISVKLLNYIGNKSSISLSPTGFYKVTGDNVAVTDRFAGASRYETATLASNS
QWKNPNTVILVNRDIFIDALPVIPLAKKLNAPVLFTQPDTLTKTTERQIAKFNPDNILIIGGARSISKDVENKLKSYGAV
KRISGKNRYVLSENIAKQMGSYDKAIVVTGRVFQDALAIAPYAAAHGYPILLTEKDKLPDYDLPKQVIIIGSSFSVSDSV
ENQIKKTSTVQRIPGSTRYELTANIIKQLKLKADKVVMTNGTKYADVLIGASLASKKNSQILFVKQDSVPAAAKSITKDK
ATYAYDFIGSTSSISAEVENSLADEFYLADGGTYNLKINSGKLNLENIKTYGNSLRIKPENYSTSNRISLDGKQYLGTVN
FSIESTKYIRPVNENIPFEDYLKGVIPNEMPASWSLEALKAQTVAARTYSITKTGTTVPDTTAFQVYGGYSWNSNTNKAV
EQTKGKVLKYNGSLITAAYSSSNGGYTEASNEVWSSSVPYLIAKKDTKDPQIGWTLTLSKQQLDTKSLDLTKPSSWWSSA
TETDSARLSGVKNWILKNKETSADSVKIASIDDLSFSGTTQGQRAKTASMKVKYFVKSSTGSYNLSKITTISVPTSELRT
MIGATVFKSTYVTVKKDTSKYTISGKGYGHGIGMSQYGAKARAEAGDSYSSILKFYYPGTTLTSY
>P81717 3.5.1.28~~~cwhA~~~N-acetylmuramoyl-L-alanine amidase A~~~
AVDFGEAIWNPASSSNYSTASNQTSAVIMHTMEGSYAGSISWFQNPSAQVSAHYLIRKSDGQITQMVREYHQAWHAKNHN
YYTIGIEHDGRAADAGNWSAAMVNASARLTKSICARRGVNCASAWKGPGYDTFHLVPDSVRVKGHGMLSGNENRYDPGKY
FPWSNYYNLINGGGGNP
>P24808 3.5.1.28~~~cwlA~~~N-acetylmuramoyl-L-alanine amidase CwlA~~~COG3409
MAIKVVKNLVSKSKYGLKCPNPMKAEYITIHNTANDASAANEISYMKNNSSSTSFHFAVDDKQVIQGIPTNRNAWHTGDG
TNGTGNRKSIGVEICYSKSGGVRYKAAEKLAIKFVAQLLKERGWGIDRVRKHQDWNGKYCPHRILSEGRWIQVKTAIEAE
LKKLGGKTNSSKASVAKKKTTNTSSKKTSYALPSGIFKVKSPMMRGEKVTQIQKALAALYFYPDKGAKNNGIDGVYGPKT
ADAIRRFQSMYGLTQDGIYGPKTKAKLEALLK
>Q06320 3.5.1.28~~~cwlC~~~Sporulation-specific N-acetylmuramoyl-L-alanine amidase~~~COG0860
MVKIFIDPGHGGSDPGATGNGLQEKTLTLQIALALRTILTNEYEGVSLLLSRTSDQYVSLNDRTNAANNWGADFFLSIHV
NSGGGTGFESYIYPDVGAPTTTYQSTIHSEVIQAVDFADRGKKTANFHVLRESAMPALLTENGFIDTVSDANKLKTSSFI
QSLARGHANGLEQAFNLKKTSSSGLYKVQIGAFKVKANADSLASNAEAKGFDSIVLLKDGLYKVQIGAFSSKDNADTLAA
RAKNAGFDAIVILES
>P50864 3.5.1.28~~~cwlD~~~Germination-specific N-acetylmuramoyl-L-alanine amidase~~~COG0860
MRKKLKWLSFLLGFIILLFLFKYQFSNNDSWKPWSLPLSGKIIYLDPGHGGPDGGAVGGKLLEKDVTLEVAFRVRDYLQE
QGALVIMTRESDTDLAPEGTKGYSRRKAEDLRQRVKLINHSEAELYISIHLNAIPSQKWSGAQSFYYGKYAENEKVAKYI
QDELRRNLENTTRKAKRIHGIYLMQNVTKPGALIEVGFLSNPSEATLLGKPKYQDKVASSIYKGILRYFTEKGDPPE
>P54450 3.5.1.28~~~cwlH~~~N-acetylmuramoyl-L-alanine amidase CwlH~~~COG3409
MVTIKKDFIPVSNDNRPGYAMAPAYITVHNTANTAKGADAKMHAKFVKNPNTSESWHFTVDDSVIYQHLPIDENGWHAGD
GTNGTGNRKSIGIEICENADGDFEKATSNAQWLIRKLMKENNIPLNRVVPHKKWSGKECPRKLLDHWNSFLNGISSSDTP
PKETSPSYPLPSGVIKLTSPYRKGTNILQLQKALAVLHFYPDKGAKNNGIDGVYGPKTANAVKRFQLMNGLTADGIYGPK
TKAKLKSKLK
>P42249 ~~~cwlJ~~~Cell wall hydrolase CwlJ~~~COG3773
MAVVRATSADVDLMARLLRAEAEGEGKQGMLLVGNVGINRLRANCSDFKGLRTIRQMIYQPHAFEAVTHGYFYQRARDSE
RALARRSINGERRWPAKFSLWYFRPQGDCPAQWYNQPFVARFKSHCFYQPTAETCENVYNTF
>O34360 3.4.-.-~~~cwlK~~~Peptidoglycan L-alanyl-D-glutamate endopeptidase CwlK~~~COG1876
MNLPAKTFVILCILFLLDLCFSYIRHEWHSQNALQDMPVPSDLHPIVKQNADALKAAAANKGIDVVITEGFRSFKEQDEL
YKQGRTKKGNIVTYARGGESYHNYGLAIDFALQKKDGSIIWDMEYDGNQNGKSDWLEVVEIAKTLGFEWGGDWKRFKDYP
HLEMIPN
>L7N653 3.5.1.28~~~cwlM~~~N-acetylmuramoyl-L-alanine amidase CwlM~~~COG0860
MPSPRREDGDALRCGDRSAAVTEIRAALTALGMLDHQEEDLTTGRNVALELFDAQLDQAVRAFQQHRGLLVDGIVGEATY
RALKEASYRLGARTLYHQFGAPLYGDDVATLQARLQDLGFYTGLVDGHFGLQTHNALMSYQREYGLAADGICGPETLRSL
YFLSSRVSGGSPHAIREEELVRSSGPKLSGKRIIIDPGRGGVDHGLIAQGPAGPISEADLLWDLASRLEGRMAAIGMETH
LSRPTNRSPSDAERAATANAVGADLMISLRCETQTSLAANGVASFHFGNSHGSVSTIGRNLADFIQREVVARTGLRDCRV
HGRTWDLLRLTRMPTVQVDIGYITNPHDRGMLVSTQTRDAIAEGILAAVKRLYLLGKNDRPTGTFTFAELLAHELSVERA
GRLGGS
>P40767 3.4.-.-~~~cwlO~~~Peptidoglycan DL-endopeptidase CwlO~~~COG0791
MRKSLITLGLASVIGTSSFLIPFTSKTASAETLDEKKQKIESKQSEVASSIEAKEKELTELQENQSKIEKELKDINDKAL
DTSNKIEDKKEENDKTKEEIKKLKKEIKETEARIEKRNEILKKRVRSLQESGGSQGYIDVLLGSTSFGDFISRATAVSSI
VDADKDLIKQQEQDKAKLEDSEADLNDKLKEVQAALAKLETMQKDLDKQLNEKDKLFDEAKASQKKTAKAISELKSEASE
LANQKANTEAEQARIKKEQEAAAALIKKQEEAQKASDETQTDDSQTATTESSKASSSDDSSDNSSDNSSNGSSNSSSNGS
SSKKSSGSNSNSGGTVISNSGGIEGAISVGSSIVGQSPYKFGGGRTQSDINNRIFDCSSFVRWAYASAGVNLGPVGGTTT
DTLVGRGQAVSASEMKRGDLVFFDTYKTNGHVGIYLGNGTFLNDNTSHGVSVDSMSNPYWKAAFKGVVRRVVQ
>O31608 3.2.1.17~~~cwlQ~~~Bifunctional muramidase/lytic transglycosylase CwlQ~~~COG0741
MLNSANTTAPSLLSAYGLNSYTSSNSGSVTKAAESTETAVADSASNKHEANQIRSGDFSIDSAIKKAADKYGVDEKLIRA
VIKQESGFNAKAVSGAGAMGLMQLMPSTASSLGVSNPLDPQQNVEGGTKYLKQMLDKYDGNVSMALAAYNAGPGNVDRYG
GIPPFQETQNYVKKITSVYYA
>O31852 3.4.19.11~~~cwlS~~~D-gamma-glutamyl-meso-diaminopimelic acid endopeptidase CwlS~~~COG0791
MKKKIVAGLAVSAVVGSSMAAAPAEAKTIKVKSGDSLWKLSRQYDTTISALKSENKLKSTVLYVGQSLKVPESSKKSTTS
PSNSSKTSTYTVAYGDSLWMIAKNHKMSVSELKSLNSLSSDLIRPGQKLKIKGTSSSSGSNGSKKNSGSNSSGSSKSTYT
VKLGDSLWKIANSLNMTVAELKTLNGLTSDTLYPKQVLKIKGSSSPKNGNSGSKKPSNSNPSKTTTYKVKAGDSLWKIAN
RLGVTVQSIRDKNNLSSDVLQIGQVLTISGASKSNPSNPTKPTKPKDNSGSNIQIGSKIDRMITEAKKYVGVPYRWGGNT
PAGFDCSGFIYYLINNVSSISRLSTAGYWNVMQKVSQPSVGDFVFFTTYKSGPSHMGIYLGGGDFIHASSSGVDISNLSN
SYWKQRYLGARSYF
>P96645 3.2.1.17~~~cwlT~~~Bifunctional muramidase/DL-endopeptidase CwlT~~~COG0741
MISKKVVLPLVFSAPFIFFFVLCIVVVMTISRENQVGDDFIGGGDGEYETVGIAPEVERFRAVFEKYARQEGVFDQVNII
MALTMQESGGRSLDIMQSSESIGLPPNSITDPERSIEVGIKHFKKVFKQAGGDVRLTLQAYNFGSGFIDYVKKNGGKYTK
KLALDFSRLQAFKMGWKSYGDPSYVDHVMRYVKGSDKNVKPVKGSMDFYETVMKEALKYEGQPYAWGGSNPETGFDCSGL
VQWSFAKAGITLPRTAQEQHGATKKISEKEATAGDLVFFGGTYEGKAITHVGIYVGNGRMFNSNDSGIQYSDLKSGYWRD
HLVSFGRIK
>P0DX27 3.4.-.-~~~~~~Peptidoglycan hydrolase Cwl0971~~~
MIVVKKAIAALGIGAVAVSVSSINASALEKGTVTASALNIRSGPSSDCDKVAKLYKGKTVEILEKSNGWYKVRVSSSVVG
WGSAKYISTSGSSEGTSNQNNPTSSGTTISGNGKVNVSSRLNVRSGAGTNYSLVGKANNGDVVKLLEQSNGWYKIKLSNG
VTGWASSQYISKTSEDVGTNNSSNSNSTNNSDKKPSSEESIEGKNGKVTSAVSLNVRSGPGTSYSIIGKLNGGDVVELKS
KNNGWYKVKLSSGTIGWVSASYISETNEGTKEKPNSSSNQNSQSNSNSKPSFTGNSDKSTAKGSTIVDFAYTLIGIPYQW
GASGPDKFDCSGFTQYVFKHSVGVSIPRVSREQANFGSAISMGNYAPGDLVYFDTDGDGTTNHVGIYVGNSKFIHCSGTQ
TNPNKVKVDNLTSSYWSKVLLGARRFV
>A0QNF5 ~~~cwsA~~~Cell wall synthesis protein CwsA~~~
MPARADVRLAPRQRLTRGLKYTAVGPVDITRGVLGIGADTAQATAAELRRRYASGKLQRQLAAAAEAVAALPETIQEAVQ
EVVSPPKKRRRRPLLIAAVAVTVLGGGAAAFSIVRRRSRPQEPPTLAPSVEVAPKP
>P9WJF3 ~~~cwsA~~~Cell wall synthesis protein CwsA~~~
MSEQVETRLTPRERLTRGLAYSAVGPVDVTRGLLELGVGLGLQSARSTAAGLRRRYREGRLAREVAAAQETLAQELTAAQ
DVVANLPQALQDARTQRRSKHHLWIFAGIAAAILAGGAVAFSIVRRSSRPEPSPRPPSVEVQPRS
>Q02760 ~~~petC~~~Cytochrome c1~~~
MIRKLTLTAATALALSGGAAMAAGGGHVEDVPFSFEGPFGTFDQHQLQRGLQVYTEVCAACHGMKFVPIRSLSEPGGPEL
PEDQVRAYATQFTVTDEETGEDREGKPTDHFPHSALENAADLSLMAKARAGFHGPMGTGISQLFNGIGGPEYIYSVLTGF
PEEPPKCAEGHEPDGFYYNRAFQNGSVPDTCKDANGVKTTAGSWIAMPPPLMDDLVEYADGHDASVHAMAEDVSAFLMWA
AEPKLMARKQAGFTAVMFLTVLSVLLYLTNKRLWAGVKGKKKTNV
>P13627 ~~~petC~~~Cytochrome c1~~~
MTLRNASLTAVAALTVALAGGAVAQDASTAPGTTAPAGSSYHTNEAAPAAADTAPAAEAADEPAAEEAEAGEAEVTEEPA
ATETPAEEPAADEPAATEEPDAEAEPAAEEAQATTEEAPAEEPAAEEPAAEEPAEEPAADAPAEEAAAEEAPAEPEAAAE
EPAAEEPEATEEEAPAEEAAAEEAPAEEVVEDEAAADHGDAAAQEAGDSHAAAHIEDISFSFEGPFGKFDQHQLQRGLQV
YTEVCSACHGLRYVPLRTLADEGGPQLPEDQVRAYAANFDITDPETEEDRPRVPTDHFPTVSGEGMGPDLSLMAKARAGF
HGPYGTGLSQLFNGIGGPEYIHAVLTGYDGEEKEEAGAVLYHNAAFAGNWIQMAAPLSDDQVTYEDGTPATVDQMATDVA
AFLMWTAEPKMMDRKQVGFVSVIFLIVLAALLYLTNKKLWQPIKHPRKPE
>P0CY49 ~~~petC~~~Cytochrome c1~~~
MKKLLISAVSALVLGSGAALANSNVQDHAFSFEGIFGKFDQAQLRRGFQVYSEVCSTCHGMKFVPIRTLSDDGGPQLDPT
FVREYAAGLDTIIDKDSGEERDRKETDMFPTRVGDGMGPDLSVMAKARAGFSGPAGSGMNQLFKGIGGPEYIYRYVTGFP
EENPACAPEGIDGYYYNEVFQVGGVPDTCKDAAGIKTTHGSWAQMPPALFDDLVTYEDGTPATVDQMGQDVASFLMWAAE
PKLVARKQMGLVAVVMLGLLSVMLYLTNKRLWAPYKRQKA
>D5ANZ4 ~~~petC~~~Cytochrome c1~~~COG2857
MKKLLISAVSALVLGSGAAFANSNVPDHAFSFEGIFGKYDQAQLRRGFQVYNEVCSACHGMKFVPIRTLADDGGPQLDPT
FVREYAAGLDTIIDKDSGEERDRKETDMFPTRVGDGMGPDLSVMAKARAGFSGPAGSGMNQLFKGMGGPEYIYNYVIGFE
ENPECAPEGIDGYYYNKTFQIGGVPDTCKDAAGVKITHGSWARMPPPLVDDQVTYEDGTPATVDQMAQDVSAFLMWAAEP
KLVARKQMGLVAMVMLGLLSVMLYLTNKRLWAPYKGHKA
>P23135 ~~~petC~~~Cytochrome c1~~~
MTTIVKRALVAAGMVLAIGGAAQANEGGVSLHKQDWSWKGIFGRYDQPQLQRGFQVFHEVCSTCHGMKRVAYRNLSALGF
SEDGIKELAAEKEFPAGPDDNGDMFTRPGTPADHIPSPFANDKAAAAANGGAAPPDLSLLAKARPGGPNYIYSLLEGYAS
DSPGEPAEWWVKQQQEKGLEVAFNEAKYFNDYFPGHAISMPPPLMDDLITYEDGTAATKDQMAQDVVAYLNWAAEPELDA
RKSLGLKVLLFLGVLTAMLLALKLAIWRDVKH
>P09787 ~~~pchC~~~4-cresol dehydrogenase [hydroxylating] cytochrome c subunit~~~
MTFPFSGAAVKRMLVTGVVLPFGLLVAAGQAQADSQWGSGKNLYDKVCGHCHKPEVGVGPVLEGRGLPEAYIKDIVRNGF
RAMPAFPASYVDDESLTQVAEYLSSLPAPAAQP
>Q45233 ~~~cycA~~~Cytochrome c-550~~~COG3474
MTKLTFGALVALAMTAAASTAMSSKAMAQDAAAGKTSFNKCLACHAIGEGAKNKVGPELNGLNGRKSGTAPDYSYSDANK
NSGITWDEATFKEYIKDPKAKIPGTKMAFAGIKNETEINNLWTFVSQFDKDGKIKQ
>P82603 ~~~psbV~~~Cytochrome c-550~~~
LTEELRTFPINAQGDTAVLSLKEIKKGQQVFNAACAQCHALGVTRTNPDVNLSPEALALATPPRDNIAALVDYIKNPTTY
DGFVEISELHPSLKSSDIFPKMRNISEDDLYNVAGYILLQPKVRGEQWG
>P19129 ~~~psbV~~~Cytochrome c-550~~~
LELDEKTLTITLNDAGESVTLTSEQATEGQKLFVANCTKCHLQGKTKTNNNVSLGLGDLAKAEPPRDNLLALIDYLEHPT
SYDGEDDLSELHPNVSRPDIYPELRNLTEDDVYNVAAYMLVAPRLDERWGGTIYF
>P00085 ~~~~~~Cytochrome c-550~~~
GDVEAGKAAFNKCKACHEIGESAKNKVGPELDGLDGRHSGAVEGYAYSPANKASGITWTEAEFKEYIKDPKAKVPGTKMV
FAGIKKDSELDNLWAYVSQFDKDGKVKAK
>P12832 ~~~~~~Cytochrome c-550~~~
GDAAKGANVAKSCGTCHSFEQGGAKKQGPNLFGITTRGPGKAEGFNYSPSYKAAAAKGFAWDAATLQDYITDPTAFLSNK
TGDAAARDKMTFKLAKPDERADVIAYLATLK
>P00096 ~~~cycA~~~Cytochrome c-550~~~
MKISIYATLAAITLALPAAAQDGDAAKGEKEFNKCKACHMIQAPDGTDIIKGGKTGPNLYGVVGRKIASEEGFKYGEGIL
EVAEKNPDLTWTEADLIEYVTDPKPWLVKMTDDKGAKTKMTFKMGKNQADVVAFLAQNSPDAGGDGEAAAEGESN
>P80288 ~~~~~~Cytochrome c-550~~~COG3474
QEGDAAKGEKEFNKCKACHMVQAPDGTDIVKGGKTGPNLYGVVGRKIASEEGFKYGDGILEVAEKNPDLVWTEADLIEYV
TDPKPWLVEKTGDSAAKTKMTFKLGKNQADVVAFLAQNSPDAGAEAAPAEDAAD
>Q00499 ~~~cyc~~~Cytochrome c-550~~~COG3474
MKISIYATLAALSLALPAVAQEGDAAKGEKEFNKCKACHMVQAPDGTDIVKGGKTGPNLYGVVGRKIASVEGFKYGDGIL
EVAEKNPDMVWSEADLIEYVTDPKPWLVEKTGDSAAKTKMTFKLGKNQADVVAFLAQHSPDAGAEAAPAEGAAN
>Q9I2C5 ~~~exaB~~~Cytochrome c550~~~
MNKNNVLRGLLVLAGLSLSSLALAHGDVTPQAVDTKGLEPLGKEWRDTNPYRKPYAKHDLAVEIGASAYNQNCARCHGLE
AKSGGIAPDLRLLETGAEGDEWFKERVINGAVRDGAVYMPKMADFISQEGLWAIRSYLESVHVDE
>Q55210 ~~~psbV~~~Cytochrome c-550~~~COG2010
MNKILGIDPLKKFIFGISAFVLLFWQLNVGAANATALREVDRTVNLNETETVVLSDQQVAKGERIFINTCSTCHNSGRTK
SNPNVTLSLVDLEGAEPRRDNILAMVDYLKNPTSYDGELDLSQLHPNTVRADIWSSMRNLNEEDLQNVSGYVLVQAQVRG
VAWGGGKTVN
>Q55013 ~~~psbV~~~Cytochrome c-550~~~COG2010
MKRFFLVAIASVLFFFNTMVGSANAVELTESTRTIPLDEAGGTTTLTARQFTNGQKIFVDTCTQCHLQGKTKTNNNVSLG
LADLAGAEPRRDNVLALVEFLKNPKSYDGEDDYSELHPNISRPDIYPEMRNYTEDDIFDVAGYTLIAPKLDERWGGTIYF
>P0A386 ~~~psbV~~~Cytochrome c-550~~~COG2010
MLKKCVWLAVALCLCLWQFTMGTALAAELTPEVLTVPLNSEGKTITLTEKQYLEGKRLFQYACASCHVGGITKTNPSLDL
RTETLALATPPRDNIEGLVDYMKNPTTYDGEQEIAEVHPSLRSADIFPKMRNLTEKDLVAIAGHILVEPKILGDKWGGGK
VYY
>P0A387 ~~~psbV~~~Cytochrome c-550~~~
MLKKCVWLAVALCLCLWQFTMGTALAAELTPEVLTVPLNSEGKTITLTEKQYLEGKRLFQYACASCHVGGITKTNPSLDL
RTETLALATPPRDNIEGLVDYMKNPTTYDGEQEIAEVHPSLRSADIFPKMRNLTEKDLVAIAGHILVEPKILGDKWGGGK
VYY
>P80549 ~~~~~~Cytochrome c-551~~~COG4654
MAFTAMTVAPSALADLVLAQKSGCTVCHSVEAAIVGPAYKDVAAKYRGDAAAQDRLVAKVMAGGVGNWGQVPMPPNAHVP
AADIKALVTWILGL
>P00104 ~~~~~~Cytochrome c-551~~~
ETGEELYKTKGCTVCHAIDSKLVGPSFKEVTAKYAGQAGIADTLAAKIKAGGSGNWGQIPMPPNPVSEAEAKTLAEWVLT
HK
>Q56247 ~~~cccA~~~Cytochrome c-551~~~
MKWKLAAMFLGVSLALAACGGGGDNAGEKNGGSNGGGDTAAAAEQIFKQNCASCHGQDLSGGVGPNLQKVGSKYSKDEIK
NIIANGRGAMPAGIIKGEDADKVAEWLAAKK
>O34594 ~~~cccB~~~Cytochrome c-551~~~COG2010
MKSKLSILMIGFALSVLLAACGSNDAKEEKTDTGSKTEATASEGEELYQQSCVGCHGKDLEGVSGPNLQEVGGKYDEHKI
ESIIKNGRGNMPKGLVDDNEAAVIAKWLSEKK
>B3QM18 ~~~pscC~~~Photosynthetic reaction center cytochrome c-551~~~COG3245
MDNKSNGKLIALAIGGAVLMGTLFFLVSFLTGYSPAPNHSAILTPLRSFMGWFLLIFCASLIIMGLGKMSGAISDKWFLS
FPLSIFVIVMVMFFSLRFYWEKGRTTTVDGKYIRSVEQLNDFLNKPAATSDLPPVPADFDFAAAEKLTDAKCNKCHTLGS
VADLFRTKYKKTGQVKLIVKRMQGFPGANISDDEVIEIGTWLQEKF
>O07091 ~~~pscC~~~Cytochrome c~~~COG3245
MDKNSNGKLIALAVGGAVLMGALFFSVSFLTGYIPAPNHSAILTPLRSFMGWFLLIFCASIIIMGLGKMSSAISDKWFLS
FPLSIFVIVMVMFLSLRVYWEKGRTTTVDGKYIRTTAELKEFLNKPAATSDVPPAPAGFDFDAAKKLVDVRCNKCHTLDS
VADLFRTKYKKTGQVNLIVKRMQGFPGSGISDDDAKTIGIWLHEKF
>P00122 ~~~~~~Cytochrome c-551~~~
DGESIYINGTAPTCSSCHDRGVAGAPELNAPEDWADRPSSVDELVESTLAGKGAMPAYDGRADREDLVKAIEYMLSTL
>P38587 ~~~~~~Cytochrome c-551~~~
DGQSIYESGTSPTCASCHDRGTAGAPKINEPGDWDGIDLDAEALVDSTMDGKGAMPAYDGRADRDEVKEAVEYMLSTIE
>P00099 ~~~nirM~~~Cytochrome c-551~~~
MKPYALLSLLATGTLLAQGAWAEDPEVLFKNKGCVACHAIDTKMVGPAYKDVAAKFAGQAGAEAELAQRIKNGSQGVWGP
IPMPPNAVSDDEAQTLAKWVLSQK
>P00103 ~~~~~~Cytochrome c-551~~~
STGEELFKAKACVACHSVDKKLVGPAFHDVAAKYGAQGDGVAHITNSIKTGSKGNWGPIPMPPNAVSPEEAKTLAEWIVT
LK
>P00100 ~~~~~~Cytochrome c-551~~~
EDGAALFKSKPCAACHTIDSKMVGPALKEVAAKNAGVKDADKTLAGHIKNGTQGNWGPIPMPPNQVTDAEALTLAQWVLS
LK
>P00102 ~~~~~~Cytochrome c-551~~~
ASGEELFKSKPCGACHSVQAKLVGPALKDVAAKNAGVDGAADVLAGHIKNGSTGVWGAMPMPPNPVTEEEAKTLAEWVLT
LK
>P07625 ~~~~~~Cytochrome c-551~~~COG3474
MTRTLAVVLAMTFSAAPVFAEGDIEAGEKAFNKCKSCHQIVSDAGEEIVKGGRTGPNLYGVLGRQAGTADFRYGDDLVAA
GEAGLVWDADNFVEYVTDPRAFLRAYLDDSKAKSKMAYKLRSGGEDIAAYLASVSGSSS
>P00101 ~~~nirM~~~Cytochrome c-551~~~
MKKILIPMLALGGALAMQPALAQDGEALFKSKPCAACHSVDTKMVGPALKEVAAKNAGVEGAADTLALHIKNGSQGVWGP
IPMPPNPVTEEEAKILAEWVLSLK
>P74917 ~~~cyc1~~~Cytochrome c-552~~~
MTTYLSQDRLRNKENDTMTYQHSKMYQSRTFLLFSALLLVAGQASAAVGSADAPAPYRVSSDCMVCHGMTGRDTLYPIVP
RLAGQHKSYMEAQLKAYKDHSRADQNGEIYMWPVAQALDSAKITALADYFNAQKPPMQSSGIKHAGAKEGKAIFNQGVTN
EQIPACMECHGSDGQGAGPFPRLAGQRYGYIIQQLTYFHNGTRVNTLMNQIAKNITVAQMKDVAAYLSSL
>P24059 ~~~cycB~~~Cytochrome c-552~~~COG2010
MHLHLRGICLVLAVASSSSSALAADAGHGADLAKRWCASCHVVANGQAVASADVPSFASVARRPDFSSEKLAFFLLDPHP
KMPSFPLSRTEAGDIAAYIGSLRP
>P15452 ~~~~~~Cytochrome c-552~~~COG4654
MKKFLLVAVVGLAGITFANEQLAKQKGCMACHDLKAKKVGPAYADVAKKYAGRKDAVDYLAGKIKKGGSGVWGSVPMPPQ
NVTDAEAKQLAQWILSIK
>P82903 ~~~~~~Cytochrome c-552~~~
AGDIEAGKAKAAVCAACHGQNGISQVPIYPNLAGQKEQYLVAALKAYKAGQRQGGQAPVMQGQATALSDADIANLAAYYA
SLPADGQG
>P95339 ~~~cyt~~~Cytochrome c-552~~~COG4654
MKTAWLGTFAASALLVAGYAQADADLAKKNNCIACHQVETKVVGPALKDIAAKYADKDDAATYLAGKIKGGSSGVWGQIP
MPPNVNVSDADAKALADWILTLK
>P54820 ~~~cycM~~~Cytochrome c-552~~~
MFDTMTVTKAAGALIGSLLFLLLMSWAASGIFHVGTSGHGAEGEEHAQAYTYPVESAGGAEGEAVDEGPDFATVLASADP
AAGEKVFGKCKACHKLDGNDGVGPHLNGVVGRTVAGVDGFNYSDPMKAHGGDWTPEALQEFLTNPKAVVKGTKMAFAGLP
KIEDRANLIAYLEGQQ
>Q5SME3 ~~~cycA~~~Cytochrome c-552~~~COG2010
MKRTLMAFLLLGGLALAQADGAKIYAQCAGCHQQNGQGIPGAFPPLAGHVAEILAKEGGREYLILVLLYGLQGQIEVKGM
KYNGVMSSFAQLKDEEIAAVLNHIATAWGDAKKVKGFKPFTAEEVKKLRAKKLTPQQVLAERKKLGLK
>P31330 ~~~~~~Cytochrome c-553~~~
SGDLGAERYAKMCKSCHGADGSNAAMSRALKGLPAEEVKAALIGYKEQTYGGKKKGMMERVVKSLTDEDIEVLATHIGTF
>P04032 ~~~cyf~~~Cytochrome c-553~~~COG2863
MKRVLLLSSLCAALSFGLAVSGVAADGAALYKSCIGCHGADGSKAAMGSAKPVKGQGAEELYKKMKGYADGSYGGERKAM
MTNAVKKYSDEELKALADYMSKL
>P00120 ~~~~~~Cytochrome c-553~~~COG2863
MKRILVVMSICAALAFGVSAAMAADGAALYKSCVGCHGADGSKQAMGVGHAVKGQKADELFKKLKGYADGSYGGEKKAVM
TNLVKRYSDEEMKAMADYMSKL
>O25825 ~~~~~~Cytochrome c-553~~~COG2863
MKKVIMALGVLAFANALMATDVKALAKSCAACHGVKFEKKALGKSKIVNMMSEAEIEKDLMDFKSGANKNPIMSAQAKKL
SDEDIKALAKYIPTLK
>P82599 ~~~~~~Cytochrome c-553~~~
GGGNDTSNETDTGTSGGETAAVDAEAVVQQKCISCHGGDLTGASAPAIDKAGANYSEEEILDIILNGQGGMPGGIAKGAE
AEAVAAWLAEKK
>P9WQ35 4.6.1.1~~~cya~~~Adenylate cyclase~~~COG2114
MAARKCGAPPIAADGSTRRPDCVTAVRTQARAPTQHYAESVARRQRVLTITAWLAVVVTGSFALMQLATGAGGWYIALIN
VFTAVTFAIVPLLHRFGGLVAPLTFIGTAYVAIFAIGWDVGTDAGAQFFFLVAAALVVLLVGIEHTALAVGLAAVAAGLV
IALEFLVPPDTGLQPPWAMSVSFVLTTVSACGVAVATVWFALRDTARAEAVMEAEHDRSEALLANMLPASIAERLKEPER
NIIADKYDEASVLFADIVGFTERASSTAPADLVRFLDRLYSAFDELVDQHGLEKIKVSGDSYMVVSGVPRPRPDHTQALA
DFALDMTNVAAQLKDPRGNPVPLRVGLATGPVVAGVVGSRRFFYDVWGDAVNVASRMESTDSVGQIQVPDEVYERLKDDF
VLRERGHINVKGKGVMRTWYLIGRKVAADPGEVRGAEPRTAGV
>P40136 4.6.1.1~~~cya~~~Calmodulin-sensitive adenylate cyclase~~~
MTRNKFIPNKFSIISFSVLLFAISSSQAIEVNAMNEHYTESDIKRNHKTEKNKTEKEKFKDSINNLVKTEFTNETLDKIQ
QTQDLLKKIPKDVLEIYSELGGEIYFTDIDLVEHKELQDLSEEEKNSMNSRGEKVPFASRFVFEKKRETPKLIINIKDYA
INSEQSKEVYYEIGKGISLDIISKDKSLDPEFLNLIKSLSDDSDSSDLLFSQKFKEKLELNNKSIDINFIKENLTEFQHA
FSLAFSYYFAPDHRTVLELYAPDMFEYMNKLEKGGFEKISESLKKEGVEKDRIDVLKGEKALKASGLVPEHADAFKKIAR
ELNTYILFRPVNKLATNLIKSGVATKGLNVHGKSSDWGPVAGYIPFDQDLSKKHGQQLAVEKGNLENKKSITEHEGEIGK
IPLKLDHLRIEELKENGIILKGKKEIDNGKKYYLLESNNQVYEFRISDENNEVQYKTKEGKITVLGEKFNWRNIEVMAKN
VEGVLKPLTADYDLFALAPSLTEIKKQIPQKEWDKVVNTPNSLEKQKGVTNLLIKYGIERKPDSTKGTLSNWQKQMLDRL
NEAVKYTGYTGGDVVNHGTEQDNEEFPEKDNEIFIINPEGEFILTKNWEMTGRFIEKNITGKDYLYYFNRSYNKIAPGNK
AYIEWTDPITKAKINTIPTSAEFIKNLSSIRRSSNVGVYKDSGDKDEFAKKESVKKIAGYLSDYYNSANHIFSQEKKRKI
SIFRGIQAYNEIENVLKSKQIAPEYKNYFQYLKERITNQVQLLLTHQKSNIEFKLLYKQLNFTENETDNFEVFQKIIDEK
>J7QLC0 ~~~cya~~~Bifunctional hemolysin/adenylate cyclase~~~COG2931
MQQSHQAGYANAADRESGIPAAVLDGIKAVAKEKNATLMFRLVNPHSTSLIAEGVATKGLGVHAKSSDWGLQAGYIPVNP
NLSKLFGRAPEVIARADNDVNSSLAHGHTAVDLTLSKERLDYLRQAGLVTGMADGVVASNHAGYEQFEFRVKETSDGRYA
VQYRRKGGDDFEAVKVIGNAAGIPLTADIDMFAIMPHLSNFRDSARSSVTSGDSVTDYLARTRRAASEATGGLDRERIDL
LWKIARAGARSAVGTEARRQFRYDGDMNIGVITDFELEVRNALNRRAHAVGAQDVVQHGTEQNNPFPEADEKIFVVSATG
ESQMLTRGQLKEYIGQQRGEGYVFYENRAYGVAGKSLFDDGLGAAPGVPSGRSKFSPDVLETVPASPGLRRPSLGAVERQ
DSGYDSLDGVGSRSFSLGEVSDMAAVEAAELEMTRQVLHAGARQDDAEPGVSGASAHWGQRALQGAQAVAAAQRLVHAIA
LMTQFGRAGSTNTPQEAASLSAAVFGLGEASSAVAETVSGFFRGSSRWAGGFGVAGGAMALGGGIAAAVGAGMSLTDDAP
AGQKAAAGAEIALQLTGGTVELASSIALALAAARGVTSGLQVAGASAGAAAGALAAALSPMEIYGLVQQSHYADQLDKLA
QESSAYGYEGDALLAQLYRDKTAAEGAVAGVSAVLSTVGAAVSIAAAASVVGAPVAVVTSLLTGALNGILRGVQQPIIEK
LANDYARKIDELGGPQAYFEKNLQARHEQLANSDGLRKMLADLQAGWNASSVIGVQTTEISKSALELAAITGNADNLKSV
DVFVDRFVQGERVAGQPVVLDVAAGGIDIASRKGERPALTFITPLAAPGEEQRRRTKTGKSEFTTFVEIVGKQDRWRIRD
GAADTTIDLAKVVSQLVDANGVLKHSIKLDVIGGDGDDVVLANASRIHYDGGAGTNTVSYAALGRQDSITVSADGERFNV
RKQLNNANVYREGVATQTTAYGKRTENVQYRHVELARVGQLVEVDTLEHVQHIIGGAGNDSITGNAHDNFLAGGSGDDRL
DGGAGNDTLVGGEGQNTVIGGAGDDVFLQDLGVWSNQLDGGAGVDTVKYNVHQPSEERLERMGDTGIHADLQKGTVEKWP
ALNLFSVDHVKNIENLHGSRLNDRIAGDDQDNELWGHDGNDTIRGRGGDDILRGGLGLDTLYGEDGNDIFLQDDETVSDD
IDGGAGLDTVDYSAMIHPGRIVAPHEYGFGIEADLSREWVRKASALGVDYYDNVRNVENVIGTSMKDVLIGDAQANTLMG
QGGDDTVRGGDGDDLLFGGDGNDMLYGDAGNDTLYGGLGDDTLEGGAGNDWFGQTQAREHDVLRGGDGVDTVDYSQTGAH
AGIAAGRIGLGILADLGAGRVDKLGEAGSSAYDTVSGIENVVGTELADRITGDAQANVLRGAGGADVLAGGEGDDVLLGG
DGDDQLSGDAGRDRLYGEAGDDWFFQDAANAGNLLDGGDGRDTVDFSGPGRGLDAGAKGVFLSLGKGFASLMDEPETSNV
LRNIENAVGSARDDVLIGDAGANVLNGLAGNDVLSGGAGDDVLLGDEGSDLLSGDAGNDDLFGGQGDDTYLFGVGYGHDT
IYESGGGHDTIRINAGADQLWFARQGNDLEIRILGTDDALTVHDWYRDADHRVEIIHAANQAVDQAGIEKLVEAMAQYPD
PGAAAAAPPAARVPDTLMQSLAVNWR
>P0DKX7 ~~~cya~~~Bifunctional hemolysin/adenylate cyclase~~~COG2931
MQQSHQAGYANAADRESGIPAAVLDGIKAVAKEKNATLMFRLVNPHSTSLIAEGVATKGLGVHAKSSDWGLQAGYIPVNP
NLSKLFGRAPEVIARADNDVNSSLAHGHTAVDLTLSKERLDYLRQAGLVTGMADGVVASNHAGYEQFEFRVKETSDGRYA
VQYRRKGGDDFEAVKVIGNAAGIPLTADIDMFAIMPHLSNFRDSARSSVTSGDSVTDYLARTRRAASEATGGLDRERIDL
LWKIARAGARSAVGTEARRQFRYDGDMNIGVITDFELEVRNALNRRAHAVGAQDVVQHGTEQNNPFPEADEKIFVVSATG
ESQMLTRGQLKEYIGQQRGEGYVFYENRAYGVAGKSLFDDGLGAAPGVPSGRSKFSPDVLETVPASPGLRRPSLGAVERQ
DSGYDSLDGVGSRSFSLGEVSDMAAVEAAELEMTRQVLHAGARQDDAEPGVSGASAHWGQRALQGAQAVAAAQRLVHAIA
LMTQFGRAGSTNTPQEAASLSAAVFGLGEASSAVAETVSGFFRGSSRWAGGFGVAGGAMALGGGIAAAVGAGMSLTDDAP
AGQKAAAGAEIALQLTGGTVELASSIALALAAARGVTSGLQVAGASAGAAAGALAAALSPMEIYGLVQQSHYADQLDKLA
QESSAYGYEGDALLAQLYRDKTAAEGAVAGVSAVLSTVGAAVSIAAAASVVGAPVAVVTSLLTGALNGILRGVQQPIIEK
LANDYARKIDELGGPQAYFEKNLQARHEQLANSDGLRKMLADLQAGWNASSVIGVQTTEISKSALELAAITGNADNLKSV
DVFVDRFVQGERVAGQPVVLDVAAGGIDIASRKGERPALTFITPLAAPGEEQRRRTKTGKSEFTTFVEIVGKQDRWRIRD
GAADTTIDLAKVVSQLVDANGVLKHSIKLDVIGGDGDDVVLANASRIHYDGGAGTNTVSYAALGRQDSITVSADGERFNV
RKQLNNANVYREGVATQTTAYGKRTENVQYRHVELARVGQLVEVDTLEHVQHIIGGAGNDSITGNAHDNFLAGGSGDDRL
DGGAGNDTLVGGEGQNTVIGGAGDDVFLQDLGVWSNQLDGGAGVDTVKYNVHQPSEERLERMGDTGIHADLQKGTVEKWP
ALNLFSVDHVKNIENLHGSRLNDRIAGDDQDNELWGHDGNDTIRGRGGDDILRGGLGLDTLYGEDGNDIFLQDDETVSDD
IDGGAGLDTVDYSAMIHPGRIVAPHEYGFGIEADLSREWVRKASALGVDYYDNVRNVENVIGTSMKDVLIGDAQANTLMG
QGGDDTVRGGDGDDLLFGGDGNDMLYGDAGNDTLYGGLGDDTLEGGAGNDWFGQTQAREHDVLRGGDGVDTVDYSQTGAH
AGIAAGRIGLGILADLGAGRVDKLGEAGSSAYDTVSGIENVVGTELADRITGDAQANVLRGAGGADVLAGGEGDDVLLGG
DGDDQLSGDAGRDRLYGEAGDDWFFQDAANAGNLLDGGDGRDTVDFSGPGRGLDAGAKGVFLSLGKGFASLMDEPETSNV
LRNIENAVGSARDDVLIGDAGANVLNGLAGNDVLSGGAGDDVLLGDEGSDLLSGDAGNDDLFGGQGDDTYLFGVGYGHDT
IYESGGGHDTIRINAGADQLWFARQGNDLEIRILGTDDALTVHDWYRDADHRVEIIHAANQAVDQAGIEKLVEAMAQYPD
PGAAAAAPPAARVPDTLMQSLAVNWR
>P00936 4.6.1.1~~~cyaA~~~Adenylate cyclase~~~COG3072
MYLYIETLKQRLDAINQLRVDRALAAMGPAFQQVYSLLPTLLHYHHPLMPGYLDGNVPKGICLYTPDETQRHYLNELELY
RGMSVQDPPKGELPITGVYTMGSTSSVGQSCSSDLDIWVCHQSWLDSEERQLLQRKCSLLENWAASLGVEVSFFLIDENR
FRHNESGSLGGEDCGSTQHILLLDEFYRTAVRLAGKRILWNMVPCDEEEHYDDYVMTLYAQGVLTPNEWLDLGGLSSLSA
EEYFGASLWQLYKSIDSPYKAVLKTLLLEAYSWEYPNPRLLAKDIKQRLHDGEIVSFGLDPYCMMLERVTEYLTAIEDFT
RLDLVRRCFYLKVCEKLSRERACVGWRRAVLSQLVSEWGWDEARLAMLDNRANWKIDQVREAHNELLDAMMQSYRNLIRF
ARRNNLSVSASPQDIGVLTRKLYAAFEALPGKVTLVNPQISPDLSEPNLTFIYVPPGRANRSGWYLYNRAPNIESIISHQ
PLEYNRYLNKLVAWAWFNGLLTSRTRLYIKGNGIVDLPKLQEMVADVSHHFPLRLPAPTPKALYSPCEIRHLAIIVNLEY
DPTAAFRNQVVHFDFRKLDVFSFGENQNCLVGSVDLLYRNSWNEVRTLHFNGEQSMIEALKTILGKMHQDAAPPDSVEVF
CYSQHLRGLIRTRVQQLVSECIELRLSSTRQETGRFKALRVSGQTWGLFFERLNVSVQKLENAIEFYGAISHNKLHGLSV
QVETNHVKLPAVVDGFASEGIIQFFFEETQDENGFNIYILDESNRVEVYHHCEGSKEELVRDVSRFYSSSHDRFTYGSSF
INFNLPQFYQIVKVDGREQVIPFRTKSIGNMPPANQDHDTPLLQQYFS
>P40134 4.6.1.1~~~cyaA~~~Adenylate cyclase~~~COG3072
MECNLAQAKQWVSALDQRRFERALQGSGDAFQHVLAIVPLLLHLNHPQLPGYVIHAPSGIASFLASDYQKKWLTNEYGIH
YADHKPSTLKSAVNFHEVFPPILGVYVMGSFGSISQTSSSDLDTWICVRDGLSLDEYTLLTQKAKRISEWAMQFNVEINF
YLMDQQRFRNEHYADPLTIENSGSAQYMLLLDEFYRSAVRLAGKPLLWLHLWVENEKDYEKEVARLITEGEIDPNDWVDF
GGLGQFSANEYFGASLWHLYKGIDSPYKSVLKILLLEAYSKEYPNTCLIARTFKRDLLAGNTNPDHHFDPYIAILAKVTQ
YLTALSEFKRLDFVHRCFYVKATEDFARYQANNWRIRYMEILAQEWGWSAETVKHLNKRPFWKIKAVKENHDNIMKFLML
SYRNLVEFARKHHIHSSVVPQDINILSRKLYTAFEELPGKVSLLNTQISHNLSEAHLTFVEVRGNKHFKDGWYLINQPIH
HIMFSKERVIEYGESLNKLVSWAYFNHLLTEKTELSIFSKNVTLSTLQRFVTNLRQSFPSTIAKQPKNSDLLNQCEIRSL
FIAINLTTDPTSKVEEVLTGISSRDLFSFGSLEQSLVGSIDFTYRNVWNEIRTLHFEGQNAILLALKVLSNKIYRGVNRP
DSIQVYCYSERYRQDLRQLVMGLVNRCVSIQVGDIQQPCQTSRLRVAGKNWQLFFEDRGISLQEIGNESVCNEAESAVDF
DEVLQTPIEDGETNQESRRYPPEMDAFASEGFLQFFFEDNSDHSFNVYILDESNHLEIYRHCDGEKDEKVREINQLYQNA
KQEGDKNPYNIVQHNFNYPQFYQLQNGKNGISIVPFKFRQMNK
>P40137 4.6.1.1~~~cyaA~~~Adenylate cyclase 1~~~
MMDAAGRTTQETLARTLESEQGHNALNLSWVRLLATSAVLIVSLYFGRVRGMTDWDVYTPPFAAYWSVTALTLVALYRFE
RLRRWAGLSLALVDVPAIYWLQHIALPLSPSPGGVAGFTLGLYATLILLSALSLRRTMTLVVTACAAVGEVALQREAHIS
LGAQLTAVVVLGACAAGACHLLLRIRTLLTTATQQELKRARLGRYFSPAVAERLQDLDRSETSPELREVTLLFADIRDFT
SLSERLRPEQVVTLLNEYYGRMVEVVFRHGGTLDKFIGDALMVYFGAPIADPAHARRGVQCALDMVQELETVNALRSARG
EPCLRIGVGVHTGPAVLGNIGSATRRLEYTAIGDTVNLASRIESLTKTRDVPILASRATREQAGDTFLWNEMAPASVPGK
SQPVAIFTPRNRTPAQQAGAPAAA
>O69199 4.6.1.1~~~cyaB~~~Adenylate cyclase CyaB~~~
MSSQHFQGRFEVEFKYRLSDVDAFTCALAALNPEVMLEDNQEQDSYFDTPEHSLAAEGKSLVIRTMQPSGIQLWIVKGPE
ADRCEAVNITDADKAASMLRTLGYRQVLAISKRRSIYFVGPFHVTRDHLEGIGDFAELAIMTDDEALLPDYRQQLQDLAT
RLGLSSAQLETRSYRTLCEQSLTLNSEKVPS
>P40138 4.6.1.1~~~cyaB~~~Adenylate cyclase 2~~~
MLQGHHRVMVVDDSPLACDFVKEGLEALGLGYEVMCFQDPYEALEQVGKVQPAIVLSDLDMPGIDGLELCWRLKESPSRQ
VPVIILTANDSEAERVKGLRAGADDYVNKSASMAELSARIESVMRRTSETERMRKLFARYTSDAVVEEILKSPDTVVLTG
EKPEVTVLFADIRNFTGLAESLPPEQVVGVLNQVLGRLSDAVLTCGGTLDKFLGDGLMAVWGAPVHRTDDALRALQAAKM
MMTAMVELRQAAQAEWAANERLGRPLVLELGIGINSGLAVAGNIGGSMRTEYTCIGDAVNVAARLCALAGPGEILAGERT
RELVSHREMPFEDLPPVRLKGKQQPVPLYRVL
>P0A3I5 2.3.1.-~~~cyaC~~~Protein-lysine palmitoyltransferase CyaC~~~COG2994
MLPSAQAPSLLNPTDDFAALGNIAWLWMNSPMHRDWPVHLLARNTLAPIQLGQYILLRCNDVPVAYCSWALMDADTELSY
VMAPSSLGGNAWNCGDRLWIIDWIAPFSRDDNRALRRALAERHPDSVGRSLRVRRGGDTARVKEYRGRALDAAAARAQLD
RYHAELIAGLRASNGGYAPRGRGTA
>E3JD18 3.5.2.-~~~~~~Cyclic amide hydrolase~~~
MPNPEASLSPRSARYPKASMHVVPMLAPNDTAAFRALFASGAVDPASVVALIAKSEGSGLHNDHARVFADVSLRTALAEA
RGCPVEDLADSVTVAVSGGSPGVISPHVTVVTQEWVADLPAGLPGVGLVVGRGHTEPILPEDIGRTAQVDKVADAVAAAM
LDAGVTDPDDVHLVMVKGPALSSRAVADALSRGKTVVTGDYGIGPMGSMCWSNDASALGVAVALGEVKRDLVADDRIRSD
WDLFSAVAATSSGGEKRGGEVLLLANSAQSASELRIGHGITRDMADTEGIKTAIRTAGVDFDCCLSPAQQAQVVQVFGKF
VLPGSDVLRGQHITALDDHEAHHVAKAVGGALVVSITGQPMSFISGGERNSHMGPPGGNPVAAVVRRLPA
>B4E5Z6 ~~~cyaY~~~Iron-sulfur cluster assembly protein CyaY~~~COG1965
MSDTEYLARAEAVLAAVERTVDVANDGDHDIDLERNGSVLTLTFENGSKIIVNLQPPMKEVWIAAKAGGFHYRFIDGEWR
DTRTGTEFFSALTDYATQQAGLPITFSA
>Q8XAP0 ~~~cyaY~~~Iron-sulfur cluster assembly protein CyaY~~~COG1965
MNDSEFHRLADQLWLTIEERLDDWDGDSDIDCEINGGVLTITFENGSKIIINRQEPLHQVWLATKQGGYHFDLKGDEWIC
DRSGETFRDLLEQAATQQAGETVSFR
>P27838 ~~~cyaY~~~Iron-sulfur cluster assembly protein CyaY~~~COG1965
MNDSEFHRLADQLWLTIEERLDDWDGDSDIDCEINGGVLTITFENGSKIIINRQEPLHQVWLATKQGGYHFDLKGDEWIC
DRSGETFWDLLEQAATQQAGETVSFR
>A1SR01 ~~~cyaY~~~Iron-sulfur cluster assembly protein CyaY~~~COG1965
MNDSEFIQLADQLYQKIEEKIEESGADVDYDQNGSLLTLEFENHTKLIINRQQPLHQVWLATLENGHHYDYNNGKWIDDR
SGDEFLTFLSAAIFKQSKETVDFTE
>P82291 ~~~~~~Soluble cytochrome b558~~~
MNETEATLPVFTLEQVAEHHSPDDCWMAIHGKVYDLTPYVPNHPGPAGMMLVWCGQESTEAWETKSYGEPHSSLAARLLQ
RYLIGTLEEIT
>Q59297 ~~~petB~~~Cytochrome bc complex cytochrome b subunit~~~
MAENTPKPAAGTAPAKPKPAAPGAAKPAAPKAARPGAAKPAAKPAAPRAAAPSGVYKKPPVDRPDPNPFKDSKRDAVAGW
FQERFYVLNPIIDYLKHKEVPKHALSFWYYFGGLGLFFFVIQILTGLLLLQYYKPTETDAFASFLFIQGEVPFGWLLRQI
HAWSANLMIMMLFIHMFSTFFMKSYRKPRELMWVSGFVLLLLSLGFGFTGYLLPWNELAFFATQVGTEVPKVAPGGAFLV
EILRGGPEVGGETLTRMFSLHVVLLPGLVMLVLAAHLTLVQILGTSAPIGYKEAGLIKGYDKFFPTFLAKDGIGWLIGFA
LLIYLAVMFPWEIGVKANPLSPAPLGIKPEWYFWAQFQLLKDFKFEGGELLAIILFTIGGVVWLLVPFIDRQASEEKKSP
IFTIFGILVLAFLLINTYRVYAEYSMLK
>P83791 ~~~petB~~~Cytochrome b6~~~
MANVYDWFQERLEIQALADDVTSKYVPPHVNIFYCLGGITLTCFLIQFATGFAMTFYYKPTVTEAYASVQYIMNEVSFGW
LIRSIHRWSASMMVLMMILHVFRVYLTGGFKKPRELTWISGVILAVITVSFGVTGYSLPWDQVGYWAVKIVSGVPEAIPV
VGVLISDLLRGGSSVGQATLTRYYSAHTFVLPWLIAVFMLLHFLMIRKQGISGPL
>P0A384 ~~~petB~~~Cytochrome b6~~~COG1290
MANVYDWFEERLEIQAIAEDVTSKYVPPHVNIFYCLGGITLVCFLIQFATGFAMTFYYKPTVAEAYSSVQYIMNEVNFGW
LIRSIHRWSASMMVLMMILHVFRVYLTGGFKKPRELTWVSGVILAVITVSFGVTGYSLPWDQVGYWAVKIVSGVPEAIPV
VGVLISDLLRGGSSVGQATLTRYYSAHTFVLPWLIAVFMLFHFLMIRKQGISGPL
>P28056 ~~~petB~~~Cytochrome b6~~~COG1290
MFTKEVTDSKLYKWFNERLEIQAISDDISSKYVPPHVNIFYCLGGITLTCFIIQFATGFAMTFYYKPTVAEAFTSVQYIM
NEVNFGWLIRSIHRWSASMMVLMMILHIFRVYLTGGFKRPRELTWITGVIMATITVSFGVTGYSLPWDQVGYWAVKIVSG
VPAAIPVVGDQMVELLRGGASVGQATLTRFYSLHTFVLPWLIAVFMLAHFLMIRKQGISGPL
>Q57038 ~~~petB~~~Cytochrome b6~~~COG1290
MFSKEVTESKVFQWFNDRLEVQAISDDIASKYVPPHVNIFYCLGGLTLTCFLIQFATGFAMTFYYKPTVTEAFASVQYIM
NEVNFGWLIRSIHRWSASMMVLMMILHVFRVYLTGGFKKPRELTWVVGVMLAVTTVTFGVTGYSLPWDQVGYWAVKIVSG
VPAAIPVVGDQLVTLMRGSESVGQATLTRFYSLHTFVLPWAIAVLLLLHFLMIRKQGISGPL
>P51131 ~~~fbcH~~~Cytochrome b/c1~~~COG1290
MSGPSDYQPSNPALQWIERRLPILGLMHSSFVAYPTPRNLNYWWTFGAILSFMLGMQILTGVILAMHYTPHADLAFKSVE
LIVRDVNYGWLLRNMHACGASMFFFAVYVHMLRGLYYGSYKEPREVLWILGVIIYLLMMATGFMGYVLPWGQMSFWGATV
ITNLFSAIPYFGESIVTLLWGGYSVGNPTLNRFFSLHYLLPFLIAGVVVLHVWALHVAGQNNPEGVEPKSEKDTVPFTPH
ATIKDMFGVACFLLLYAWFIFYMPNYLGDADNYIPANPGVTPPHIVPEWYYLPFYAILRSIPNKLAGVIGMFSAIIILCF
LPWLDAAKTRSSKYRPLAKQFFWIFVAVCILLGYLGAQPPEGIYVIAGRVLTVCYFAYFLIVLPLLSRIETPRPVPNSIS
EAILAKGGKAVASVAIALVAAGALFLGSLQDARANEGSDKPPGNKWSFAGPFGKFDRGALQRGLKVYKEVCASCHGLSYI
AFRNLAEAGGPSYSVAQVAAFASDYKIKDGPNDAGDMFERPGRPADYFPSPFPNEQAARAANGGAAPPDLSLITKARSYG
RGFPWFIFDFFTQYQEQGPDYVSAVLQGFEEKVPEGVTIPEGSYYNKYFPGHAIKMPKPLSDGQVTYDDGSPATVAQYSK
DVTTFLMWTAEPHMEARKRLGFQVFVFLIIFAGLMYFTKKKVWADSH
>P0AAM1 ~~~hyaC~~~Probable Ni/Fe-hydrogenase 1 B-type cytochrome subunit~~~COG1969
MQQKSDNVVSHYVFEAPVRIWHWLTVLCMAVLMVTGYFIGKPLPSVSGEATYLFYMGYIRLIHFSAGMVFTVVLLMRIYW
AFVGNRYSRELFIVPVWRKSWWQGVWYEIRWYLFLAKRPSADIGHNPIAQAAMFGYFLMSVFMIITGFALYSEHSQYAIF
APFRYVVEFFYWTGGNSMDIHSWHRLGMWLIGAFVIGHVYMALREDIMSDDTVISTMVNGYRSHKFGKISNKERS
>P16145 ~~~hupC~~~Probable Ni/Fe-hydrogenase B-type cytochrome subunit~~~
MKGVSDERINAPVRGPDEIFEASRLTGDATREDLESIRRRTSVYVYEAPVRVWHWVNALAITILVVTGYFIASPLPSMQI
GEATDQFVMGYIRFAHFAAGVVMSVAFFGRIYWAFVGNRHAWQMFYIPIFNKRYWKEFVFELRWYFFLEEEPKKYIGHNP
LAHAAMFTFITLGITFMMITGWALYAEGAGQGGVTDSLFGWVLGYVQNSQRLHTLHHLGMWAIVIFAIIHIYAAVREDVM
SRQSMVSTMISGHRTFKDDRIE
>P31875 ~~~hydC~~~Quinone-reactive Ni/Fe-hydrogenase B-type cytochrome subunit~~~COG1969
MENRANTEGTFERELEFTALTRWFHWIRAIAIFVLIVTGFYIAYPFLTPIKNSEPTNFMYALARSWHQIFGFALIAVTIF
RVYLFIFDKGCRVERASFWDLINPLTWFRQLRNYMLLGPHPHLKGVYNPVQLAAYMGLMVLILLISVTGIILYYNVYHDG
LGAILFAIFKPLEVMFGGLANVRAIHHITTWAFVIFIPVHIYMATWNSARYPNGGIDSIFSGFRYHKKHY
>Q02761 ~~~petB~~~Cytochrome b~~~
MSGIPHDHYEPRTGIEKWLHSRLPIVALAYDTIMIPTPRNLNWMWIWGVVLAFCLVLQIVTGIVLAMHYTPHVDLAFASV
EHIMRNVNGGFMLRYLHANGASLFFIAVYLHIFRGLYYGSYKAPREVTWIVGMLIYLAMMATAFMGYVLPWGQMSFWGAT
VITGLFGAIPGIGHSIQTWLLGGPAVDNATLNRFFSLHYLLPFVIAALVAIHIWAFHSTGNNNPTGVEVRRTSKAEAQKD
TVPFWPYFIIKDVFALAVVLLVFFAIVGFMPNYLGHPDNYIEANPLSTPAHIVPEWYFLPFYAILRAFTADVWVVQIANF
ISFGIIDAKFFGVLAMFGAILVMALVPWLDTSPVRSGRYRPMFKIYFWLLAADFVILTWVGAQQTTFPYDWISLIASAYW
FAYFLVILPILGAIEKPVAPPATIEEDFNAHYSPATGGTKTVVAE
>P05418 ~~~petB~~~Cytochrome b~~~
MAGIPHDHYEPKTGFERWLHRRLPIVSLVYDTLMIPTPKNLNWWWIWGIVLAFCLVLQIATGIVLVMHYTPHVDLAFASV
EHIMRDVNGGYMLRYLHANGASLFFLAVYIHIFRGLYYGSYKAPREVTWIVGMLIYLMMMGTAFMGYVLPWGQMSFWGAT
VITGLFGAIPGVGEAIQTWLLGGPAVDNPTLNRFFSLHYLLPFVIAALVVVHIWAFHTTGNNNPTGVEVRRGSKEEAKKD
TLPFWPYFVIKDLFALAVVLVVFFAIVGFMPNYLGHPDNYIEANPLVTPAHIVPEWYFLPFYAILRAFTADVWVVMLVNW
LSFGIIDAKFFGVIAMFGAILVMALVPWLDTSRVRSGQYRPLFKWWFWLLAVDFVVLMWVGAMPAEGIYPYIALAGSAYW
FAYFLIILPLLGIIEKPDAMPQTIEEDFNAHYGPETHPAE
>P0CY47 ~~~petB~~~Cytochrome b~~~
MSGIPHDHYEPKTGIEKWLHDRLPIVGLVYDTIMIPTPKNLNWWWIWGIVLAFTLVLQIVTGIVLAIDYTPHVDLAFASV
EHIMRDVNGGWAMRYIHANGASLFFLAVYIHIFRGLYYGSYKAPREITWIVGMVIYLLMMGTAFMGYVLPWGQMSFWGAT
VITGLFGAIPGIGPSIQAWLLGGPAVDNATLNRFFSLHYLLPFVIAALVAIHIWAFHTTGNNNPTGVEVRRTSKADAEKD
TLPFWPYFVIKDLFALALVLLGFFAVVAYMPNYLGHPDNYIQANPLSTPAHIVPEWYFLPFYAILRAFAADVWVVILVDG
LTFGIVDAKFFGVIAMFGAIAVMALAPWLDTSKVRSGAYRPKFRMWFWFLVLDFVVLTWVGAMPTEYPYDWISLIASTYW
FAYFLVILPLLGATEKPEPIPASIEEDFNSHYGNPAE
>D5ANZ3 ~~~petB~~~Cytochrome b~~~COG1290
MSGIPHDHYEPKTGIEKWLHDRLPIVGLVYDTIMIPTPKNLNWWWIWGIVLAFTLVLQIVTGIVLAMHYTPHVDLAFASV
EHIMRDVNGGWAMRYIHANGASLFFLAVYIHIFRGLYYGSYKAPREITWIVGMVIYLLMMGTAFMGYVLPWGQMSFWGAT
VITGLFGAIPGIGPSIQAWLLGGPAVDNATLNRFFSLHYLLPFVIAALVAIHIWAFHTTGNNNPTGVEVRRTSKADAEKD
TLPFWPYFVIKDLFALALVLLGFFAVVAYMPNYLGHPDNYVQANPLSTPAHIVPEWYFLPFYAILRAFAADVWVVILVDG
LTFGIVDAKFFGVIAMFGAIAVMALAPWLDTSKVRSGAYRPKFRMWFWFLVLDFVVLTWVGAMPTEYPYDWISLIASTYW
FAYFLVILPLLGATEKPEPIPASIEEDFNSHYGNPAE
>P23134 ~~~petB~~~Cytochrome b~~~
MYTPPRWNNKALKWFDERLPVLTVAHKELVVYPAPRNLNYFWNFGSLAGIAMIIMIATGIFLAMSYTAHVDHAFDSVERI
MRDVNYGWLMRYMHANGASMFFIVVYVHMFRGLYYGSYKPPREVLWWLGLVILLLMMATAFMGYVLPWGQMSFWGATVIT
NLFSAIPVVGDDIVTLLWGGFSVDNPTLNRFFSLHYLFPMLLFAVVFLHMWALHVKKSNNPLGIDAKGPFDTIPFHPYYT
VKDAFGLGIFLMVFCFFVFFAPNFFGEPDNYIPANPMVTPTHIVPEWYFLPFYAILRAVPDKLGGVLAMFGAILILFVLP
WLDTSKVRSATFRPVFKGFFWVFLADCLLLGYLGAMPAEEPYVTITQLATIYYFLHFLVITPLVGWFEKPKPLPVSISSP
VTTQA
>Q9AJE4 5.5.1.15~~~cyc1~~~Terpentedienyl-diphosphate synthase~~~
MKDRAADPVTKFSPSPYETGQFLRISERADVGTPQIDYLLATQRPDGLWGSVGFELVPTLGAVAGLSSRPEYADRAGVTD
AVARACEKLWELALGEGGLPKLPDTVASEIIVPSLIDLLSEVLQRHRPAVGGKAGQEQEFPSPPGANAELWRQLSDRIAR
GQAIPKTAWHTLEAFHPLPKQFAATVTPAADGAVTCSPSSTAAWLSAVGTDAGASTRAYLDEAQSRYGGAIPMGSSMPYF
EVLWVLNLVLKYFPDVPIPREIIEEIAAGFSDSGIGGGPGLPPDGDDTAYANLAGDKLGAPTHPEILMKFWAEDHFVSYP
GEQTPSETVNAHALEYLNHLRMRRGITEFGAVEDACAEWVISQQTEDGCWYDKWNVSPYYSTAACVEALLDARKQDEPQL
DSLRRAREWLLRHQTDSGGWGMAEPSPEETAYAVLALDLFASRGGEGAEECAAAISRAKEFFTDESRENPPLWMGKDLYT
PFRIVDVTVMCGRAVVGRY
>Q9K499 4.2.3.37~~~cyc1~~~Epi-isozizaene synthase~~~
MHAFPHGTTATPTAIAVPPSLRLPVIEAAFPRQLHPYWPKLQETTRTWLLEKRLMPADKVEEYADGLCYTDLMAGYYLGA
PDEVLQAIADYSAWFFVWDDRHDRDIVHGRAGAWRRLRGLLHTALDSPGDHLHHEDTLVAGFADSVRRLYAFLPATWNAR
FARHFHTVIEAYDREFHNRTRGIVPGVEEYLELRRLTFAHWIWTDLLEPSSGCELPDAVRKHPAYRRAALLSQEFAAWYN
DLCSLPKEIAGDEVHNLGISLITHHSLTLEEAIGEVRRRVEECITEFLAVERDALRFADELADGTVRGKELSGAVRANVG
NMRNWFSSVYWFHHESGRYMVDSWDDRSTPPYVNNEAAGEK
>P00086 ~~~~~~Cytochrome c2 iso-1~~~
ADAPTAFNQCKACHSIEAGKNGVGPSLSGAYGRKVGLAPNYKYSAAHLASGMTIDEAMLTNYLANPKATIPGNKMGASFG
GLKKPEDVKAVIEYLKTVK
>P00087 ~~~~~~Cytochrome c2 iso-1~~~
ADAPPPAFNQCKACHSIDAGKNGVGPSLSGAYGRKVGLAPNYKYSPAHLASGMTIDDAMLTKYLANPKETIPGNKMGAAF
GGLKNPADVAAVIAYLKTVK
>P81153 ~~~~~~Cytochrome c2 iso-1~~~
QDGDPVKGEAVFKKCMACHRIGPDAKNLVGPVLTGVVGRQAGVAPGFSYSALNHAAGEAGLHWTAENIMAYLPDPNAFLR
KFVTDAGNPEAAKGSTKMVFKLPNEQERKDVVAYLKTFSN
>P00090 ~~~cycA~~~Cytochrome c2~~~
QDAAKGEAVFKQCMTCHRADKNMVGPALGGVVGRKAGTAAGFTYSPLNHNSGEAGLVWTQENIIAYLPDPNAYLKKFLTD
KGQADKATGSTKMTFKLANDQQRKDVAAYLATLK
>P86322 ~~~~~~Cytochrome c2~~~
QDVEAGAVSFRKCAPCHAVGEGAANKVGPVLNGLPGRKSGTIAGFNYSDANKNSGITWDKATFKTYITDPRAKIPGTKMV
FAGIKNDKEQDDLWAYLSQFGPDGKKK
>P00089 ~~~~~~Cytochrome c2 iso-2~~~
ADAPPAFGMCKACHSVEAGKNGVGPSLAGVYGRKAGTLAGFKFSDPHAKSGLTWDEPTLTKYLADPKGVIPGNKMVFAGL
KNPADVAAVIAYLKSL
>P00088 ~~~~~~Cytochrome c2 iso-2~~~
ADAPAGFTLCKACHSVEAGKNGVGPSLAGVYGRKAGTISGFKFSDPHIKSGLTWDEPTLTKYLADPKTVIPGNKMVFAGL
KNPDDVKAVIEYLKTLK
>P81154 ~~~~~~Cytochrome c2 iso-2~~~
EDGDPAKGEAVFKKCMACHRVGPDAKNLVGPALTGVIDRQAGTAPGFNYSAINHAAGEAGLHWTPENIIAYLPDPNAFLR
KFLADAGHAEQAKGSTKMVFKLPDEQERKDVVAYLKQFSPQ
>P00091 ~~~cycA~~~Cytochrome c2~~~COG3474
MVKKLLTILSIAATAGSLSIGTASAQDAKAGEAVFKQCMTCHRADKNMVGPALGGVVGRKAGTAAGFTYSPLNHNSGEAG
LVWTADNIINYLNDPNAFLKKFLTDKGKADQAVGVTKMTFKLANEQQRKDVVAYLATLK
>P86323 ~~~~~~Cytochrome c2~~~
QQVAAGAVSFRKCTPCHNIGEGATNKVGPVLDGLEGRHSGSIPGFNYSEANKKSGLTWDKATFKSYIADPRAKIPGTKMV
FAGIKNEKEQEDLWAFLTQYGPDGKKK
>P86317 ~~~~~~Cytochrome c2~~~
QDAAKGEAVFKQCMTCHRADKNMVGPALAGVVGRKAGTAPGFSYSPLNHHSGEAGLVWTQENIITYLADPNAFLKKFLTD
KGHADQAVGATKMTFKLANEQQRKDVVAYLATLK
>P86324 ~~~~~~Cytochrome c2~~~
QEAPKAFNQCQACHKVEAGEDGVGPSLFGLFGHKLGQAPGFKYSEAHLKFAQQTVDEPFLTKYLADPKASLPGNKMVFAG
LKNPDDVKAVLAYLKTIK
>P86318 ~~~~~~Cytochrome c2~~~
QDAAKGEALFKQCQTCHRADKNMVGPALAGVVGRKAGTAPGFSYSPLNHAAGEAGLVWSQENVVEYLADPNAFLKKFLTD
KGQADKATGSTKMTFKLANEQQRKDVAAYLATLK
>P86321 ~~~~~~Cytochrome c2~~~
QXGDLVAAGKKVYRQCHACHSVGEGAKNRVGPEQNNLFGRVAGSLPDFRYSKAMKSAGENGLVWTDKSLHEYLKNPRAYV
PKTKMIFAGLKKPEDIDAVVAYLKTFDTDGMPDPDASTYTPPNG
>P00081 ~~~~~~Cytochrome c2~~~
EGDVAKGEAAFKRCSACHAIGEGAKNKVGPQLNGIIGRTAGGDPDYNYSNAMKKAGGEGLVWTPQELRDFLSAPKKKIPG
NKMALAGISKPEELDNLIAYLIFSASSKPAZ
>P00083 ~~~cycA~~~Cytochrome c2~~~
MRKLVFGLFVLAASVAPAAAQDAASGEQVFKQCLVCHSIGPGAKNKVGPVLNGLFGRHSGTIEGFAYSDANKNSGITWTE
EVFREYIRDPKAKIPGTKMIFAGVKDEQKVSDLIAYIKQFNADGSKK
>Q3J164 ~~~cycA~~~Cytochrome c2~~~COG3474
MKFQVKALAAIAAFAALPALAQEGDPEAGAKAFNQCQTCHVIVDDSGTTIAGRNAKTGPNLYGVVGRTAGTQADFKGYGE
GMKEAGAKGLAWDEEHFVQYVQDPTKFLKEYTGDAKAKGKMTFKLKKEADAHNIWAYLQQVAVRP
>P0C0X8 ~~~cycA~~~Cytochrome c2~~~
QEGDPEAGAKAFNQCQTCHVIVDDSGTTIAGRNAKTGPNLYGVVGRTAGTQADFKGYGEGMKEAGAKGLAWDEEHFVQYV
QDPTKFLKEYTGDAKAKGKMTFKLKKEADAHNIWAYLQQVAVRP
>P86319 ~~~~~~Cytochrome c2~~~
QDAPTGDAAAGAKVFNKCQTCHMVVAPDGTVLAGKAGKTGNPLYGLDGRAPASYPDFAYGDGIKELGAAGEVWNEADFLQ
YVADPTKFLKTKTGDTKAKGKMTFKLPNEKEAHDVWAFLNSLAPAPAAAEAAPAADAAAPAAADAAAPAEPAAEGAAT
>Q9AJE3 4.2.3.36~~~cyc2~~~Terpentetriene synthase~~~
MPDAIEFEHEGRRNPNSAEAESAYSSIIAALDLQESDYAVISGHSRIVGAAALVYPDADAETLLAASLWTACLIVNDDRW
DYVQEDGGRLAPGEWFDGVTEVVDTWRTAGPRLPDPFFELVRTTMSRLDAALGAEAADEIGHEIKRAITAMKWEGVWNEY
TKKTSLATYLSFRRGYCTMDVQVVLDKWINGGRSFAALRDDPVRRAIDDVVVRFGCLSNDYYSWGREKKAVDKSNAVRIL
MDHAGYDESTALAHVRDDCVQAITDLDCIEESIKRSGHLGSHAQELLDYLACHRPLIYAAATWPTETNRYR
>P00093 ~~~~~~Cytochrome c2~~~
AGDAAVGEKIAKAKCTACHDLNKGGPIKVGPPLFGVFGRTTGTFAGYSYSPGYTVMGQKGHTWDDNALKAYLLDPKGYVQ
AKSGDPKANSKMIFRLEKDDDVANVIAYLHTMK
>P00084 ~~~~~~Cytochrome c2~~~
AGDPDAGQKVFLKCAACHKIGPGAKNGVGPSLNGVANRKAGQAEGFAYSDANKNSGLTWDEATFKEYITAPQKKVPGTKM
TFPGLPNEADRDNIWAYLSQFKADGSK
>P86320 ~~~~~~Cytochrome c2~~~
VDVSGDAAAGEKAFRQCITCHVVVDDSGETLAGRNAKVGPNLYKVPGRHAGQIEGFRYSDSMSQAGQNGLVWVEEEFVKY
VQDPTGYLREYLGDSKARGAMTHKVRKEDEAVDIYAYLASLGVHEE
>P00094 ~~~cycA~~~Cytochrome c2~~~COG3474
MKISLTAATVAALVLAAPAFAGDAAKGEKEFNKCKTCHSIIAPDGTEIVKGAKTGPNLYGVVGRTAGTYPEFKYKDSIVA
LGASGFAWTEEDIATYVKDPGAFLKEKLDDKKAKTGMAFKLAKGGEDVAAYLASVVK
>P00080 ~~~~~~Cytochrome c2~~~
GSAPPGDPVEGKHLFHTICILCHTDIKGRNKVGPSLYGVVGRHSGIEPGYNYSEANIKSGIVWTPDVLFKYIEHPQKIVP
GTKMGYPGQPDPQKRADIIAYLETLK
>P0C189 ~~~cycA~~~Cytochrome c2~~~
EGDAAAGEKVSKKCLACHTFDQGGANKVGPNLFGVFENTAAHKDNYAYSESYTEMKAKGLTWTEANLAAYVKNPKAFVLE
KSGDPKAKSKMTFKLTKDDEIENVIAYLKTLK
>P00098 ~~~~~~Cytochrome c2~~~
ADESALAQTKGCLACHNPEKKVVGPAYGWVAKKYAGQAGAEAKLVAKVMAGGQGVWAKQLGAEIPMPANNVTKEEATRLV
KWVLSLKQIDYK
>P00082 ~~~~~~Cytochrome c2~~~COG3474
MKAIKIAMVGAALVWSASAYAAGDPVKGEQVFKQCKICHQVGPTAKPGVGPVQNNVVGSKAGSRPGFNYSDAMKNSGLTW
DEATLDKYLENPKAVVPGTKMVFVGLKNPQDRADVIAFLATQHGQ
>P00097 ~~~~~~Cytochrome c2~~~
ATPAELATKAGCAVCHQPTAKGLGPSYQEIAKKYKGQAGAPALMAERVRKGSVGIFGKLPMTPTPPARISDADLKLVIDW
ILKTP
>Q9X839 ~~~cyc2~~~Germacradienol/geosmin synthase~~~
MTQQPFQLPHFYLPHPARLNPHLDEARAHSTTWAREMGMLEGSGVWEQSDLEAHDYGLLCAYTHPDCDGPALSLITDWYV
WVFFFDDHFLEKYKRSQDRLAGKAHLDRLPLFMPLDDAAGMPEPRNPVEAGLADLWTRTVPAMSADWRRRFAVATEHLLN
ESMWELSNINEGRVANPVEYIEMRRKVGGAPWSAGLVEYATAEVPAAVAGTRPLRVLMETFSDAVHLRNDLFSYQREVED
EGELSNGVLVLETFFGCTTQEAADLVNDVLTSRLHQFEHTAFTEVPAVALEKGLTPLEVAAVGAYTKGLQDWQSGGHEWH
MRSSRYMNKGERPLAGWQALTGPGTSAADVGALLADAVAQRARSYTYVPFQKVGPSVIPDIRMPYPLELSPALDGARRHL
SEWCREMGILSEGVWDEDKLESCDLPLCAAGLDPDATQDQLDLASGWLAFGTYGDDYYPLVYGHRRDLAAARLTTTRLSD
CMPLDGEPVPPPGNAMERSLIDLWVRTTAGMTPEERRPLKKAVDDMTEAWLWELSNQIQNRVPDPVDYLEMRRATFGSDL
TLGLCRAGHGPAVPPEVYRSGPVRSLENAAIDYACLLNDVFSYQKEIEYEGEIHNAVLVVQNFFGVDYPAALGVVQDLMN
QRMRQFEHVVAHELPVVYDDFQLSEEARTVMRGYVTDLQNWMAGILNWHRNVPRYKAEYLAGRTHGFLPDRIPAPPVPRS
SPALTH
>P00136 ~~~~~~Cytochrome c3, 13 kDa~~~
ADAPGDDYVISAPEGMKAKPKGDKPGALQKTVPFPHTKHATVECVQCHHTLEADGGAVKKCTTSGCHDSLEFRDKANAKD
IKLVENAFHTQCIDCHKALKKDKKPTGPTACGKCHTTN
>P38554 ~~~~~~Cytochrome c3, 26 kDa~~~
ETFEIPESVTMSPKQFEGYTPKKGDVTFNHASHMDIACQQCHHTVPDTYTIESCMTEGCHDNIKERTEISSVYRTFHTTK
DSEKSCVGCHRELKRQGPSDAPLACNSCHVQ
>Q727P6 ~~~~~~Cytochrome c3~~~
MRYLVISLFAVSLLMAGSALVGNAADAAKAPKKAIELKHGTSKRMHVMFNHTTHKDIACEQCHHDSPAPDKPYASCTDND
CHATPGPRERDTMSMFVAYHAKDTDRSCYGCHKKMAAQHPEFTGCRPCHMSQQARKEAAASEKK
>P94690 ~~~~~~Acidic cytochrome c3~~~
MFKHTLIALTLLAAATLFSLPAFSQEDMTHVPTDAFGKLERPAAVFNHDEHNEKAGIESCNACHHVWVNGVLAEDEDSVG
TPCSDCHALEQDGDTPGLQDAYHQQCWGCHEKQAKGPVMCGECHVKN
>P94691 ~~~~~~Basic cytochrome c3~~~
MKKLFSMLVAAALVGTMAMAAQAVPQVPADVVIDHLSNPNAKLEYKVKFSHKAHASLGTDAAACQKCHHKWDGKSEIGGC
ATEGCHADTTSFKATEKDPKFLMTAFHSKSPMSCQGCHKEMKTAKKTTGPTACAQCHNQK
>P00137 ~~~cyd~~~Cytochrome c3~~~
ADVVTYENKKGNVTFDHKAHAEKLGCDACHEGTPAKIAIDKKSAHKDACKTCHKSNNGPTKCGGCHIK
>P00134 ~~~~~~Cytochrome c3~~~
VDAPADMVIKAPAGAKVTKAPVAFSHKGHASMDCKTCHHKWDGAGAIQPCQASGCHANTESKKGDDSFYMAFHERKSEKS
CVGCHKSMKKGPTKCTECHPKN
>P00131 ~~~~~~Cytochrome c3~~~
MRKLFFCGVLALAVAFALPVVAAPKAPADGLKMEATKQPVVFNHSTHKSVKCGDCHHPVNGKEDYRKCGTAGCHDSMDKK
DKSAKGYYHVMHDKNTKFKSCVGCHVEVAGADAAKKKDLTGCKKSKCHE
>P00132 ~~~~~~Cytochrome c3~~~
MKKMFLTGVLALAVAIAMPALAAAPKAPADGLKMDKTKQPVVFNHSTHKAVKCGDCHHPVNGKEDYQKCATAGCHDNMDK
KDKSAKGYYHAMHDKGTKFKSCVGCHLETAGADAAKKKELTGCKGSKCHS
>P00135 ~~~~~~Cytochrome c3~~~
VDAPADMVLKAPAGAKMTKAPVDFSHKGHAALDCTKCHHKWDGKAEVKKCSAEGCHVBTSKKGKKSTPKFYSAFHSKSDI
SCVGCHKALKKATGPTKCGDCHPKKK
>P00133 ~~~~~~Cytochrome c3~~~
VDVPADGAKIDFIAGGEKNLTVVFNHSTHKDVKCDDCHHDPGDKQYAGCTTDGCHNILDKADKSVNSWYKVVHDAKGGAK
PTCISCHKDKAGDDKELKKKLTGCKGSACHPS
>O33731 ~~~cctA~~~Tetraheme cytochrome c-type~~~COG1053
MSNKLLSALFAAGFAVMMMSSASFAADETLAEFHVEMGGCENCHADGEPSKDGAYEFEQCQSCHGSLAEMDDNHKPHDGL
LMCADCHAPHEAKVGEKPTCDTCHDDGRTAK
>P43302 ~~~cycA~~~Cytochrome c4~~~
MNKALVTLLLTLGITGLAHAAGDAAAGQGKAAVCGACHGPDGNSAAPNFPKLAGQGERYLLKQMQDIKAGTKPGAPEGSG
RKVLEMTGMLDNFSDQDLADLAAYFTSQKPTVGAADPQLVEAGETLYRGGKLADGMPACTGCHSPNGEGNTPAAYPRLSG
QHAQYVAKQLTDFREGARTNDGDNMIMRSIAAKLSNKDIAAISSYIQGLH
>P00106 ~~~cc4~~~Cytochrome c4~~~
MNKLLVSLLLTLGLTGLAHAAGDAAAGQAKAAVCGACHGADGNSPAPNFPKLAGQGERYLLKQMHDIKDGKRTVLEMTGL
LTNLSDQDLADIAAYFASQKMSVGMADPNLVAQGEALFRGGKIAEGMPACTGCHSPSGVGIATAGFPHLGGQHATYVAKQ
LTDFREGTRTNDGDTKIMQSIAAKLSNKDIAAISSYIQGLH
>Q52369 ~~~cc4~~~Cytochrome c4~~~
MNKVLVSLLLTLGITGMAHAAGDAEAGQGKVAVCGACHGVDGNSPAPNFPKLAGQGERYLLKQLQDIKAGSTPGAPEGVG
RKVLEMTGMLDPLSDQDLEDIAAYFSSQKGSVGYADPALAKQGEKLFRGGKLDQGMPACTGCHAPNGVGNDLAGFPKLGG
QHAAYTAKQLTDFREGNRTNDGDTMIMRGVAAKLSNKDIEALSSYIQGLH
>P86052 ~~~~~~Cytochrome c4~~~
TDGHQAAAPQVGDPQAGEAKANGVCLACHGPQGNSLVPIWPKLAGQHPEYIVKQLMDFKQRRANEQMTPMAMPLTDQEVL
DLAAYYATQPKTPGAADPELASKGESLYRWGNPETGVPACSGCHGPAGGAGQSLAKFPRLSAQHADYTKQTLEHFRGALR
ANDPNGMMRGAAARLSDQEIAAVSQYLQGLSQ
>P11732 ~~~~~~Cytochrome c5~~~
GGGARSGDDVVAKYCNACHGTGLLNAPKVGDSAAWKTRADAKGGLDGLLAQSLSGLNAMPPKGTCADCSDDELKAAIGKM
SGL
>P00121 ~~~~~~Cytochrome c5~~~
AASAGGGARSADDIIAKHCNACHGAGVLGAPKIGDTAAWKERADHQGGLDGILAKAISGINAMPPKGTCADCSDDELREA
IQKMSGL
>P0C180 ~~~petJ~~~Cytochrome c6~~~
ADSVNGAKIFSANCASCHAGGKNLGVAQKTLKKADLEKYGAYSAMAIGAQVTNGKNAMPAFKGRLKPEEIXXVAAYVLGK
AEAEWK
>P00116 ~~~petJ~~~Cytochrome c6~~~
ADTVSGAALFKANCAQCHVGGGNLVNRAKTLKKEALEKYNMYSAKAIIAQVTHGKGAMPAFGIRLKAEQIENVAAYVLEQ
ADNGWKK
>P00117 ~~~petJ~~~Cytochrome c6~~~
ADAAAGGKVFNANCAACHASGGGQINGAKTLKKNALTANGKDTVEAIVAQVTNGKGAMPAFKGRLSDDQIQSVALYVLDK
AEKGW
>P00118 ~~~petJ~~~Cytochrome c6~~~
GDVAAGASVFSANCAACHMGGRNVIVANKTLSKSDLAKYLKGFDDDAVAAVAYQVTNGKNAMPGFNGRLSPKQIEDVAAY
VVDQAEKGW
>P00112 ~~~petJ~~~Cytochrome c6~~~
DGASIFSANCASCHMGGKNVVNAAKTLKKEDLVKYGKDSVEAIVTQVTKGMGAMPAFGGRLSAEDIEAVANYVLAQAEKG
W
>P0A3X7 ~~~petJ~~~Cytochrome c6~~~COG2010
MKKIFSLVLLGIALFTFAFSSPALAADSVNGAKIFSANCASCHAGGKNLVQAQKTLKKADLEKYGMYSAEAIIAQVTNGK
NAMPAFKGRLKPEQIEDVAAYVLGKADADWK
>P0A3Y0 ~~~petJ~~~Cytochrome c6~~~
ADLANGAKVFSGNCAACHMGGGNVVMANKTLKKEALEQFGMYSEDAIIYQVQHGKNAMPAFAGRLTDEQIQDVAAYVLDQ
AAKGWAG
>O30881 ~~~petJ~~~Cytochrome c6~~~COG2010
MKKLLAIALTVLATVFAFGTPAFAADAAAGAQVFAANCAACHAGGNNAVMPTKTLKADALKTYLAGYKDGSKSLEEAVAY
QVTNGQGAMPAFGGRLSDADIANVAAYIADQAENNKW
>P00115 ~~~petJ~~~Cytochrome c6~~~COG2010
MKTLLTILALTLVTLTTWLSTPAFAADIADGAKVFSANCAACHMGGGNVVMANKTLKKEALEQFGMNSADAIMYQVQNGK
NAMPAFGGRLSEAQIENVAAYVLDQSSKNWAG
>P07497 ~~~petJ~~~Cytochrome c6~~~COG2010
MKRILGTAIAALVVLLAFIAPAQAADLAHGGQVFSANCAACHLGGRNVVNPAKTLQKADLDQYGMASIEAITTQVTNGKG
AMPAFGSKLSADDIADVASYVLDQSEKGWQG
>P46445 ~~~petJ~~~Cytochrome c6~~~COG2010
MFKLFNQASRIFFGIALPCLIFLGGIFSLGNTALAADLAHGKAIFAGNCAACHNGGLNAINPSKTLKMADLEANGKNSVA
AIVAQITNGNGAMPGFKGRISDSDMEDVAAYVLDQAEKGW
>P00114 ~~~petJ~~~Cytochrome c6~~~
ADLANGAKVFSGNCAACHMGGGNVVMANKTLKKEALEQFGMNSEDAIIYQVQHGKNAMPAFAGRLTDEQIQDVAAYVLDQ
AAKGWAG
>P0A3X9 ~~~petJ~~~Cytochrome c6~~~COG2010
MKKRFISVCAIAIALLVSLTPAALAADLANGAKVFSGNCAACHMGGGNVVMANKTLKKEALEQFGMYSEDAIIYQVQHGK
NAMPAFAGRLTDEQIQDVAAYVLDQAAKGWAG
>P81894 ~~~~~~Cytochrome c7~~~
MKRIIASLALSVFCAGLAFAADELTFKAKNGDVKFPHKKHQQVVGNCKKCHEKGPGKIEGFGKDWAHKTCKGCHEEMKKG
PTKCGDCHKK
>Q9RN68 ~~~~~~Nine-heme cytochrome c~~~COG0484
MRNGTSLLLLAAIALAGAACLTAMGGTAKAAALEPTDSGAPSAIVMFPVGEKPNPKGAAMKPVVFNHLIHEKKIDNCETC
HHTGDPVSCSTCHTVEGKAEGNYITLDRAMHATNIAKRAKGNTPVSCVSCHEQQTKERRECAGCHAIVTPKRDEAWCATC
HNITPSMTPEQMQKGINGTLLPGDNEALAAETVLAQKTVEPVSPMLAPYKVVIDALADKYEPSNFTHRRHLTSLMERIKD
DKLAQAFHNKPEILCATCHHRSPLSLTPPKCGSCHTKEIDKANPGRPNLMAAYHLQCMGCHKGMDVARPRDTDCTTCHKA
APKSAD
>A0A0H2VDI7 ~~~cycA~~~D-serine/D-alanine/glycine transporter~~~COG1113
MVDQVKVVADDQAPAEQSLRRNLTNRHIQLIAIGGAIGTGLFMGSGKTISLAGPSIIFVYMIIGFMLFFVMRAMGELLLS
NLEYKSFSDFASDLLGPWAGYFTGWTYWFCWVVTGMADVVAITAYAQFWFPGLSDWVASLSVIILLLVLNLATVKMFGEM
EFWFAMIKIVAIVSLIVVGLVMVAMHFQSPTGVEASFAHLWNDGGWFPKGLSGFFAGFQIAVFAFVGIELVGTTAAETKD
PEKSLPRAINSIPIRIIMFYVFSLIVIMSVTPWSSVVPEKSPFVELFVLVGLPAAASVINFVVLTSAASSANSGVFSTSR
MLFGLAQEGVAPKAFAKLSKRAVPAKGLTFSCICLLGGVVMLYVNPSVIGAFTMITTVSAILFMFVWTIILCSYLVYRKQ
RPHLHEKSIYKMPLGKLMCWVCMAFFVFVLVLLTLEDDTRQALLVTPLWFIALGLGWLFIGKKRAAELRK
>P0AAE0 ~~~cycA~~~D-serine/D-alanine/glycine transporter~~~COG1113
MVDQVKVVADDQAPAEQSLRRNLTNRHIQLIAIGGAIGTGLFMGSGKTISLAGPSIIFVYMIIGFMLFFVMRAMGELLLS
NLEYKSFSDFASDLLGPWAGYFTGWTYWFCWVVTGMADVVAITAYAQFWFPDLSDWVASLAVIVLLLTLNLATVKMFGEM
EFWFAMIKIVAIVSLIVVGLVMVAMHFQSPTGVEASFAHLWNDGGWFPKGLSGFFAGFQIAVFAFVGIELVGTTAAETKD
PEKSLPRAINSIPIRIIMFYVFALIVIMSVTPWSSVVPEKSPFVELFVLVGLPAAASVINFVVLTSAASSANSGVFSTSR
MLFGLAQEGVAPKAFAKLSKRAVPAKGLTFSCICLLGGVVMLYVNPSVIGAFTMITTVSAILFMFVWTIILCSYLVYRKQ
RPHLHEKSIYKMPLGKLMCWVCMAFFVFVVVLLTLEDDTRQALLVTPLWFIALGLGWLFIGKKRAAELRK
>Q9RQB9 ~~~cycA~~~Cytochrome c''~~~
MKIKTIIAVFGVLFSAHALADVTNAEKLVYKYTNIAHSANPMYEAPSITDGKIFFNRKFKTPSGKEAACASCHTNNPANV
GKNIVTGKEIPPLAPRVNTKRFTDIDKVEDEFTKHCNDILGADCSPSEKANFIAYLLTETKPTK
>Q749K5 ~~~omcB~~~C-type polyheme cytochrome OmcB~~~COG3005
MSRKVTKYSAVLAVSLFAAALAGCGSENKEGTVGTGPGGVATVGDSACVQCHSAVTEALTGESLIAQYQKSSPHNTAGLG
CESCHGGGAQHNGVGPIPFAQPDASRCADCHDGTTAVATNSDTAFAESRHNIQTIRSGATCRRCHTHEGAVLSNIAGYTG
DLATLEDTVNQNKVPLVSSYSQISCATCHEHGGGLRTIKATNGAAGPVVNWDPNNNRTVDQFDLCTSCHNMYSYNGSTLL
TNGVPVNGVATGTVGHHETTWYRIIATTHFDNYSTGPQAGAGASGTNAKVEGYVLRRTGANPCFDCHGHEAKTNTRPGRD
ATIHTDWAKSAHAGGLLTAKYNAVGALTGAAAVNAAMNAYVDDTTAIAWTHYNWDASSRGSCQRCHTATGAANFMSNPAG
YDPTGAGNSFSHLQGWSAANGSKQNELLYCWGCHTNAGTGELRNPGAITENYAGVNSTSTGTTGTAVTISYPDIAGSNVC
MTCHLGREAGENIKAITDADGILGFVNSHYLAAGGQLFGKTGYEYATRSYAKPTFFAHDKIGTAAAPGTGTNGPCAGCHM
TTPNSHSFLPVTKDGTGAVTAITSTACATCHAGAYALTPEALTAEEEEYVASLEALKAALAGKGILFFNAHPYFYRDTNA
NGIGDPGELVSSNAFTNWAGVYGLALWKDVMGAAFNANLLIHDPGGYAHNRFYVKRLIWDSIDFIYDGVLNNDVTAAIDA
QVTATRLDSATATAAKAYLGTTRP
>Q749L1 ~~~omcC~~~C-type polyheme cytochrome OmcC~~~COG3005
MSRKVTKYSAVLAVSLFAAALAGCGSENKEGTVGTGPGGVATVGDTACVQCHSAVVDPLTGESIITQYTRSFHYSKGVGC
EGCHGGGAQHNGVGPLPFPLAGQSEAQIAARCASCHNGVIAPLSSSPNFVNGNHANPFGGEEAKENLCSRCHSHEGAIFG
AQAGFTGDGNILRNAAYQPVYPQDPETFNVMTCATCHQHGGAQRQVFTQISTAGVPNSRRTVAWDPNRNSINDQYDLCTS
CHTVNTMTGTLIGSGNVLQIFTSNAVGSGTKSVTTAPFYHNTRWFRTLPSTHYDFPESKTTASGTTIEGYVIRRNTANPC
FDCHGHEFQTNTRRLAGADRPNTIFLDWGQSAHGGKLLQAKVAAAALASSGAAEVDDVMKAGATDATAPGWTHYNWDDTA
SRGACQRCHTSTGASNFLNNPAGYDRTGAGNSFTHLAGWTSSNKRSDQNELLYCWGCHTKAGTGELRNPGAITEVYPGIN
STSTGTTGLDVTVSYPDIKGSNVCMGCHLGREVGDNIKAITDADGILGFVNSHYLTAGGQLFGTTGYEYATRSYANPAFF
QHDKIGTAAAPGTGTNGPCAGCHMTTPTSHLFLPVTKDGTGAITAITSTACVTCHAGTFALTPEGLTAEEEEYVASLEAL
KAALAGKGILFFNAHPYFYRDTNANGIADPGETVSSNAFTNWAGVYGLALWQDVMGAAFNANLLIHDPGGYAHNRFYSKR
LIWDSIDFIFDGVLNNDVTAAIDAQVTAARLDSATATAAKAYLGATRP
>P14774 ~~~moxG~~~Cytochrome c-L~~~COG2010
MMNRVKIGTALLGLTLAGIALPALAQPQSGPQTGVVFRNTVTGEALDVSQGKEGGRDTPAVKKFLETGENLYIDDKSCLR
NGESLFATSCSGCHGHLAEGKLGPGLNDNYWTYPSNTTDVGLFATIFGGANGMMGPHNENLTPDEMLQTIAWIRHLYTGP
KQDAVWLNDEQKKAYTPYKQGEVIPKDAKGQCKPLDE
>P29899 ~~~moxG~~~Cytochrome c-L~~~
MTKPRILAAFAMTLIIPVAAMAAPQFFNIIDGSPLNFDDAMEEGRDTEAVKHFLETGENVYNEDPEILPEAEELYAGMCS
GCHGHYAEGKIGPGLNDAYWTYPGNETDVGLFSTLYGGATGQMGPMWGSLTLDEMLRTMAWVRHLYTGDPKDASWLTDEQ
KAGFTPFQPKSSGEDQS
>P00138 ~~~~~~Cytochrome c'~~~COG3909
QFAKPEDAVKYRQSALTLMASHFGRMTPVVKGQAPYDAAQIKANVEVLKTLSALPWAAFGPGTEGGDARPEIWSDAASFK
QKQQAFQDNIVKLSAAADAGDLDKLRAAFGDVGASCKACHDAYRKKK
>P00154 ~~~cycA~~~Cytochrome c'~~~COG3909
MKHVLASTAAGLMALGLASSAIAAGLSPEEQIETRQAGYEFMGWNMGKIKANLEGEYNAAQVEAAANVIAAIANSGMGAL
YGPGTDKNVGDVKTRVKPEFFQNMEDVGKIAREFVGAANTLAEVAATGEAEAVKTAFGDVGAACKSCHEKYRAK
>P00148 ~~~cycP~~~Cytochrome c'~~~COG3909
MRRVLLATLMAALPAAAMAADAEHVVEARKGYFSLVALEFGPLAAMAKGEMPYDAAAAKAHASDLVTLTKYDPSDLYAPG
TSADDVKGTAAKAAIWQDADGFQAKGMAFFEAVAALEPAAGAGQKELAAAVGKVGGTCKSCHDDFRVKR
>P00143 ~~~~~~Cytochrome c'~~~
AEPEDAIHYRQSALSVMGWQMGPMGAMAQGDIEYDADEFATRANNLAAVAHLPWEGFTEGTLQGDDHGVETDALADIGDD
WEGFEERQETFKQEAATLAQMVDDGEEFSALRRQVGAVGKSCKGCHDDFRAE
>P00151 ~~~~~~Cytochrome c'~~~
QQSKPEELLKLRQGLMQTLKSQWAPIAGFAAGKADLPADAAQRAENMVLVAKLAPIGWAKGTEALPNSETKAEAFGAKSA
QFMEGWKAMAAESTKLAAAAKAGPDALKAQAAATGKVCKACHEEFKQD
>P00152 ~~~~~~Cytochrome c'~~~
QQSKPEDLLKLRQGLMQTLKSQWVPIAGFAAGKADLPADAAQRAENMAMVAKLAPIGWAKGTEALPNGETKPEAFGSKSA
EFLEGWKALATESTKLAAAAKAGPDALKAQAAATGKVCKACHEEFKQD
>P00145 ~~~~~~Cytochrome c'~~~
ASPEAYVEYRKQALKASGDHMKALSAIVKGQLPLNAEAAKHAEAIAAIMESLPAAFPEGTAGIAKTEAKAVVWSKADEFK
ADAVKSADAAKALAQAATAGDTAQMGKALAALGGTCKGCHETFRE
>P00147 ~~~~~~Cytochrome c'~~~
ADTKEVLEAREAYFKSLGKSMKAMTGVAKSFDAEAAKAEAAALEKILATDVAPLFPAGTSSTDLPGQTEAKAAIWTNMAD
FGAKGKAMNDAGAEVIAAANAGDATAFGAALQKLGGTCKACHDDYREED
>P00149 ~~~cycA~~~Cytochrome c'~~~COG3909
MKLRIATIAGLVVLGSGFAVAQTDVIAQRKAILKQMGEATKPIAAMLKGEAKFDQAVVQKSLAAIADDSKKLPALFPADS
KTGGDTAALPKIWEDKAKFDDLFAKLAAAATAAQGTIKDEASLKANIGGVLGNCKSCHDDFRAKKS
>P00144 ~~~~~~Cytochrome c'~~~COG3909
MKRMMIVAALAALTTTTVAQAADPAAYVEYRKSVLSATSNYMKAIGITLKEDLAVPNQTADHAKAIASIMETLPAAFPEG
TAGIAKTEAKAAIWKDFEAFKVASKKSQDAALELASAAETGDKAAIGAKLQALGGTCKACHKEFKAD
>P00146 ~~~~~~Cytochrome c'~~~
DGMETVKARQDYFKSLGGAMKALSGVAKNYDAEAAKAEAAKLEAILATDIKPLFAPGTSDADFPGESEAKASIWENMEDF
GAKGQAMHEAGMELIAAANTGEASAFGPALKKLGGTCKACHDDYRAEH
>P00153 ~~~~~~Cytochrome c'~~~
EPAKSEDLIKWRQSAYQVLHWNMDRLKANIDSPQYNKDDGIKAANTIAAIANSGMGSLFAAGTETGKGWHPTSVKPAFFT
DGKKVGEVAVAFNKEANELAKVAATGDAAAVKAQFGKVGQTCKACHDDFRRKD
>P00142 ~~~~~~Cytochrome c'~~~
QFQKPGDAIEYRQSAFTLIANHFGRVAAMAQGKAPFDAKVAAENIALVSTLSKLPLTAFGPGTDKGHGTEAKPAVWSDAA
GFKAAADKFAAAVDKLDAAGKTGDFAQIKAAVGETGGACKGCHDKFKEK
>P07173 ~~~pufC~~~Photosynthetic reaction center cytochrome c subunit~~~
MKQLIVNSVATVALASLVAGCFEPPPATTTQTGFRGLSMGEVLHPATVKAKKERDAQYPPALAAVKAEGPPVSQVYKNVK
VLGNLTEAEFLRTMTAITEWVSPQEGCTYCHDENNLASEAKYPYVVARRMLEMTRAINTNWTQHVAQTGVTCYTCHRGTP
LPPYVRYLEPTLPLNNRETPTHVERVETRSGYVVRLAKYTAYSALNYDPFTMFLANDKRQVRVVPQTALPLVGVSRGKER
RPLSDAYATFALMMSISDSLGTNCTFCHNAQTFESWGKKSTPQRAIAWWGIRMVRDLNMNYLAPLNASLPASRLGRQGEA
PQADCRTCHQGVTKPLFGASRLKDYPELGPIKAAAK
>D2Z0P5 ~~~pufC~~~Photosynthetic reaction center cytochrome c subunit~~~
MSPAQQLTLPAVIVVASVMLLGCEGPPPGTEQIGYRGVGMENYYNKRQRALSIQANQPVESLPAADSTGPKASEVYQNVQ
VLKDLSVGEFTRTMVAVTTWVSPKEGCNYCHVPGNWASDDIYTKVVSRRMFELVRAANSDWKAHVAETGVTCYTCHRGNP
VPKYAWVTDPGPKYPSGLKPTGQNYGSKTVAYASLPFDPLTPFLDQANEIRITGNAALAGSNPASLKQAEWTFGLMMNIS
DSLGVGCTFCHNTRAFNDWTQSTPKRTTAWYAIRHVRDINQNYIWPLNDVLPASRKGPYGDPLRVSCMTCHQAVNKPLYG
AQMAKDYPGLYKTAVTQEALAGSAPASEAAPAAATEAAPEAPAQEVPAAEAVPAAAEPGAAEAAGSVEPAPVEEVAPAPA
AQRL
>P81040 ~~~~~~Split-Soret cytochrome c~~~
MNIGRRDLICGLGGLAVGGAMLGLGSVEARAAGQAQPASGRFDQVGGAFGWKPHKLDPKECAQVAYDGYWYKGFGCGFGA
FYSIVGLMGEKYGAPYNQFPFAMLEANKGGISDWGTICGALYGAAATFSLFWGRKEVHPMVNELFRWYEVTKLPIFNPGD
AAQGVKGDLPMSASDSVLCHISVSKWCYENKIEATSKQRSERCGRLTADAAFKAAEIINTKIDQGKDFKSTFPMQASVSS
CGECHMTKGNDANWAKGIMDCTPCHSGTAATQNKFVNHP
>P30960 ~~~cycY~~~Thiol:disulfide interchange protein CycY~~~COG0526
MSEQSTSANPQRRTFLMVLPLIAFIGLALLFWFRLGSGDPSRIPSALIGRPAPQTALPPLEGLQADNVQVPGLDPAAFKG
KVSLVNVWASWCVPCHDEAPLLTELGKDKRFQLVGINYKDAADNARRFLGRYGNPFGRVGVDANGRASIEWGVYGVPETF
VVGREGTIVYKLVGPITPDNLRSVLLPQMEKALK
>Q05389 ~~~cycY~~~Cytochrome c-type cyt cy~~~COG3474
MLVKTHITKIGVTLFAVALFYGFIYMLSNSLFATRPATAVAVGADGKALLPSVDEAAMPAKAPAAAAPAAETAEAAAPAE
PAAPPPPAYVEVDPATITGDAKAGEEKFNKTCKACHKIDGKNAVGPHLNGVIGRATATVEGFKYSTAMKNHVGNWTPERL
DIYLVSPKAEVPGTKMSFVGLPEAADRANVIAYLNTLPR
>P0ABJ9 7.1.1.7~~~cydA~~~Cytochrome bd-I ubiquinol oxidase subunit 1~~~COG1271
MLDIVELSRLQFALTAMYHFLFVPLTLGMAFLLAIMETVYVLSGKQIYKDMTKFWGKLFGINFALGVATGLTMEFQFGTN
WSYYSHYVGDIFGAPLAIEGLMAFFLESTFVGLFFFGWDRLGKVQHMCVTWLVALGSNLSALWILVANGWMQNPIASDFN
FETMRMEMVSFSELVLNPVAQVKFVHTVASGYVTGAMFILGISAWYMLKGRDFAFAKRSFAIAASFGMAAVLSVIVLGDE
SGYEMGDVQKTKLAAIEAEWETQPAPAAFTLFGIPDQEEETNKFAIQIPYALGIIATRSVDTPVIGLKELMVQHEERIRN
GMKAYSLLEQLRSGSTDQAVRDQFNSMKKDLGYGLLLKRYTPNVADATEAQIQQATKDSIPRVAPLYFAFRIMVACGFLL
LAIIALSFWSVIRNRIGEKKWLLRAALYGIPLPWIAVEAGWFVAEYGRQPWAIGEVLPTAVANSSLTAGDLIFSMVLICG
LYTLFLVAELFLMFKFARLGPSSLKTGRYHFEQSSTTTQPAR
>P0ABK2 7.1.1.7~~~cydB~~~Cytochrome bd-I ubiquinol oxidase subunit 2~~~COG1294
MIDYEVLRFIWWLLVGVLLIGFAVTDGFDMGVGMLTRFLGRNDTERRIMINSIAPHWDGNQVWLITAGGALFAAWPMVYA
AAFSGFYVAMILVLASLFFRPVGFDYRSKIEETRWRNMWDWGIFIGSFVPPLVIGVAFGNLLQGVPFNVDEYLRLYYTGN
FFQLLNPFGLLAGVVSVGMIITQGATYLQMRTVGELHLRTRATAQVAALVTLVCFALAGVWVMYGIDGYVVKSTMDHYAA
SNPLNKEVVREAGAWLVNFNNTPILWAIPALGVVLPLLTILTARMDKAAWAFVFSSLTLACIILTAGIAMFPFVMPSSTM
MNASLTMWDATSSQLTLNVMTWVAVVLVPIILLYTAWCYWKMFGRITKEDIERNTHSLY
>P23886 7.4.2.-~~~cydC~~~Glutathione/L-cysteine transport system ATP-binding/permease protein CydC~~~COG4987
MRALLPYLALYKRHKWMLSLGIVLAIVTLLASIGLLTLSGWFLSASAVAGVAGLYSFNYMLPAAGVRGAAITRTAGRYFE
RLVSHDATFRVLQHLRIYTFSKLLPLSPAGLARYRQGELLNRVVADVDTLDHLYLRVISPLVGAFVVIMVVTIGLSFLDF
TLAFTLGGIMLLTLFLMPPLFYRAGKSTGQNLTHLRGQYRQQLTAWLQGQAELTIFGASDRYRTQLENTEIQWLEAQRRQ
SELTALSQAIMLLIGALAVILMLWMASGGVGGNAQPGALIALFVFCALAAFEALAPVTGAFQHLGQVIASAVRISDLTDQ
KPEVTFPDTQTRVADRVSLTLRDVQFTYPEQSQQALKGISLQVNAGEHIAILGRTGCGKSTLLQQLTRAWDPQQGEILLN
DSPIASLNEAALRQTISVVPQRVHLFSATLRDNLLLASPGSSDEALSEILRRVGLEKLLEDAGLNSWLGEGGRQLSGGEL
RRLAIARALLHDAPLVLLDEPTEGLDATTESQILELLAEMMREKTVLMVTHRLRGLSRFQQIIVMDNGQIIEQGTHAELL
ARQGRYYQFKQGL
>P29018 7.4.2.-~~~cydD~~~Glutathione/L-cysteine transport system ATP-binding/permease protein CydD~~~COG4988
MNKSRQKELTRWLKQQSVISQRWLNISRLLGFVSGILIIAQAWFMARILQHMIMENIPREALLLPFTLLVLTFVLRAWVV
WLRERVGYHAGQHIRFAIRRQVLDRLQQAGPAWIQGKPAGSWATLVLEQIDDMHDYYARYLPQMALAVSVPLLIVVAIFP
SNWAAALILLGTAPLIPLFMALVGMGAADANRRNFLALARLSGHFLDRLRGMETLRIFGRGEAEIESIRSASEDFRQRTM
EVLRLAFLSSGILEFFTSLSIALVAVYFGFSYLGELDFGHYDTGVTLAAGFLALILAPEFFQPLRDLGTFYHAKAQAVGA
ADSLKTFMETPLAHPQRGEAELASTDPVTIEAEELFITSPEGKTLAGPLNFTLPAGQRAVLVGRSGSGKSSLLNALSGFL
SYQGSLRINGIELRDLSPESWRKHLSWVGQNPQLPAATLRDNVLLARPDASEQELQAALDNAWVSEFLPLLPQGVDTPVG
DQAARLSVGQAQRVAVARALLNPCSLLLLDEPAASLDAHSEQRVMEALNAASLRQTTLMVTHQLEDLADWDVIWVMQDGR
IIEQGRYAELSVAGGPFATLLAHRQEEI
>Q2YKD6 7.1.1.7~~~cydX~~~Cytochrome bd ubiquinol oxidase subunit X~~~
MWYFSWLLGLPLAAAFAVLNAMWYELMDDRARKRLAADPTAELALEGNKHH
>P56100 7.1.1.7~~~cydX~~~Cytochrome bd-I ubiquinol oxidase subunit X~~~COG4890
MWYFAWILGTLLACSFGVITALALEHVESGKAGQEDI
>Q9KII6 2.8.1.7~~~cyd~~~Probable cysteine desulfurase~~~COG0520
MTRSPCSTTSRWISSMSTSEYRAVDAESDLPISAAELAALASQLYAASIRPGPDSPPQQAPVAPRGSVPDATAATSAGRT
AAGTADVYPGPVPQVGGRDVYLPPPASPAPEAPPQAAPPAPRGSAPDATAATSAGRAAAGTSDVYSSWVPQLGVADIYLG
APTPAGPEAPPQSAPPAPRGQVPDTTAAATAYGADLSAFAVPTGIVSTAPGVQAGTAPPVPVVPRAATAPSWLPEAPSVA
DLGWSDAPAPDAPAGDEHDYHFLTKTDPVPQFRDEHEVFDVAAIRSDFPILKETVNGKPLIWFDNAATTQKPQVVIDRLS
HFYAHENSNIHRAAHELAARATDAYEEARDTVAEFIGAPSSDNIVFVRGTTEAINLVAHAWGAKHLQPGDEIVITHLEHH
ANIVPWQLISQKTGAILKVAPIDDAGNLLLSEFEGLLGPRTKLVAASHVSNALGTVMPVDKIVELGHRYGARVLIDGAQS
IQHIPIDVAELGADFFVFSGHKIYGPTGIGALYGTEEALTETPPWQGGGHMIADVTLERSLYQGPPTKFEAGTGNIADAV
GLTEALRYVQRLGVERIAAYEHALLEYATPRLADIPGVRLIGTAQEKASVLSFVLAGHEPLEVGKALNAEGIAVRAGHHC
AQPALRRLGLEATVRPSFAFYNTFEEIDVFLRAVRRIAEGGANVG
>Q8KUU5 2.8.1.7~~~cyd~~~Cysteine desulfurase~~~COG0520
MTNTVPSVPAVPNLPTQSDPFFNERSLEQLTQTVLQDLQQAGVSEAESAPTPLSVPTPALPTTSALAVPQSPTAIANVPA
PPSSIDERSLAQLAQAVLQDPQLASAIASIFPSVTLPTSASVPRSVPVPPSFLPSLVPTAPPIHDEVGVIPHHQLPVPSQ
PTPAGLQQTASSKSGSGFYFIDEQVETAIAALHSNLTVFPQLTTSSIPTLTGAHSAGAVGFDIHQVRRDFPILQERVNGR
PLVWFDNAATTQKPQVVIDRLSHYYQHENSNIHRAAHELAARSTDAYEAAREQVRHFLNAASTEEVVFVRGTTEAINLVA
KSWGSQNLKEGDEIVITWLEHHANIVPWQQLSAETGARLRVVPVDDYGQVRLDEYQKLLSDRTKIVSFTQVSNALGTITP
AKEIIELAHRYGAKVLLDGAQSVSHLAVDVQALDCDWFVFSGHKVFGPTGIGVLYGKQELLDATLPWQSGGNMIADVTFE
KTVYQPAPARFEAGTGNIADAVGLGAALEYVQKIGLEAIAAYEHELLVHGTALLSQIPGLRLIGTAPHKAAVLSFVLEGF
SPEAIGQALNREGIAVRAGHHCAQPILRRFGLETTVRPSLAFYNTFEELETLAAAIRRIQTGSLAL
>P83793 ~~~petA~~~Cytochrome f~~~
MRNSCKKARRTRPLKATIQALLVAIATMTFFFTSDIALPQSAAAYPFWAQQTYPETPREPTGRIVCANCHLAAKPAEVEV
PQSVLPDTVFKAVVKIPYDTKLQQVAADGSKVGLNVGAVLMLPEGFKIAPEERIPEELKKEVGDVYFQPYKEGQDNVLLV
GPLPGEQYQEIVFPVLSPNPTTDKNIHFGKYAIHLGANRGRGQIYPTGEKSNNNVFTASATGTITKIAKEEDEYGNVKYQ
VSIQTDSGKTVVDTIPAGPELIVSEGQAVKAGEALTNNPNVGGFGQDDTEIVLQDPNRVKWMIAFICLVMLAQLMLILKK
KQVEKVQAAEMNF
>Q93SW9 ~~~petA~~~Cytochrome f~~~COG3258
MRNACTRARLTRTARAMVKTLFIAIASVTFFFTSDLALPQSAAAYPFWAQQTYPETPREPTGRIVCANCHLAAKPTEVEV
PQSVLPDTVFKAVVKIPYDTSVQQVGADGSKVGLNVGAVLMLPEGFKIAPEDRIPEELKEEIGDVYFQPYGEDKDNIVIV
GPLPGEQYQEIVFPVLSPNPANDKNIHFGKYSVHVGGNRGRGQVYPTGEKSNNNLYSAAATGTISKIAKQEGEDGSVKYL
VDIKTESGEVVSDTIPAGPELIVSEGQAVTAGDALTNNPNVGGFGQLDAEIVLQDANRVGWLIAFVALVMLAQVMLVLKK
KQVEKVQAAEMNF
>P95522 ~~~petA~~~Cytochrome f~~~
MNFKVCSFPSRRQSIAAFVRVLMVILLTLGALVSSDVLLPQPAAAYPFWAQQNYANPREATGRIVCANCHLAAKPAEIEV
PQAVLPDSVFKAVVKIPYDHSVQQVQADGSKGPLNVGAVLMLPEGFTIAPEDRIPEEMKEEVGPSYLFQPYADDKQNIVL
VGPLPGDQYEEIVFPVLSPNPATNKSVAFGKYSIHLGANRGRGQIYPTGEKSNNAVYNASAAGVITAIAKADDGSAEVKI
RTEDGTTIVDKIPAGPELIVSEGEEVAAGAALTNNPNVGGFGQKDTEIVLQSPNRVKGRIAFLAAITLTQILLVLKKKQV
ERVQAGRDDLLKAAFIAG
>P26287 ~~~petA~~~Cytochrome f~~~COG3258
MRNPDTLGLWTKTMVALRRFTVLAIATVSVFLITDLGLPQAASAYPFWAQETAPLTPREATGRIVCANCHLAQKAAEVEI
PQAVLPDTVFEAVVKIPYDLDSQQVLGDGSKGGLNVGAVLMLPEGFKIAPPDRLSEGLKEKVGGTYFQPYREDMENVVIV
GPLPGEQYQEIVFPVLSPDPAKDKSINYGKFAVHLGANRGRGQIYPTGLLSNNNAFKAPNAGTISEVNALEAGGYQLILT
TADGTETVDIPAGPELIVSAGQTVEAGEFLTNNPNVGGFGQKDTEVVLQNPTRIKFLVLFLAGIMLSQILLVLKKKQIEK
VQAAELNF
>O34527 ~~~cymR~~~HTH-type transcriptional regulator CymR~~~COG1959
MKISTKGRYGLTIMIELAKKHGEGPTSLKSIAQTNNLSEHYLEQLVSPLRNAGLVKSIRGAYGGYVLGSEPDAITAGDII
RVLEGPISPVEVLEDEEPAKRELWIRIRDAVKEVLDSTTLEDLASYTDGEQEAYMFYI
>O33453 ~~~cymR~~~HTH-type transcriptional regulator CymR~~~
MSPKRRTQAERAMETQGKLIAAALGVLREKGYAGFRIADVPGAAGVSRGAQSHHFPTKLELLLATFEWLYEQITERSRAR
LAKLKPEDDVIQQMLDDAAEFFLDDDFSISLDLIVAADRDPALREGIQRTVERNRFVVEDMWLGVLVSRGLSRDDAEDIL
WLIFNSVRGLAVRSLWQKDKERFERVRNSTLEIARERYAKFKR
>P27111 ~~~cynR~~~HTH-type transcriptional regulator CynR~~~COG0583
MLSRHINYFLAVAEHGSFTRAASALHVSQPALSQQIRQLEESLGVPLFDRSGRTIRLTDAGEVWRQYASRALQELGAGKR
AIHDVADLTRGSLRIAVTPTFTSYFIGPLMADFYARYPSITLQLQEMSQEKIEDMLCRDELDVGIAFAPVHSPELEAIPL
LTESLALVVAQHHPLAVHEQVALSRLHDEKLVLLSAEFATREQIDHYCEKAGLHPQVVIEANSISAVLELIRRTSLSTLL
PAAIATQHDGLKAISLAPPLLERTAVLLRRKNSWQTAAAKAFLHMALDKCAVVGGNESR
>P00816 4.2.1.104~~~cynS~~~Cyanate hydratase~~~COG1513
MIQSQINRNIRLDLADAILLSKAKKDLSFAEIADGTGLAEAFVTAALLGQQALPADAARLVGAKLDLDEDSILLLQMIPL
RGCIDDRIPTDPTMYRFYEMLQVYGTTLKALVHEKFGDGIISAINFKLDVKKVADPEGGERAVITLDGKYLPTKPF
>A8GBZ7 4.2.1.104~~~cynS~~~Cyanate hydratase~~~COG1513
MTQSLHYSSPRETLTDTIMMAKIRKNLTFEAINQGTGLSLAFVTAALLGQHPLPEQAARVVAEKLDLDEDAIRLLQTIPL
RGSIPGGVPTDPTIYRFYEMVQIYGSTLKALVHEQFGDGIISAINFKLDIKKVPDPDGGERAVITLDGKYLPTKPF
>Q59948 4.2.1.104~~~cynS~~~Cyanate hydratase~~~COG1513
MTSAITEQLLKAKKAKGITFTELEQLLGRDEVWIASVFYRQSTASPEEAEKLLTALGLDLALADELTTPPVKGCLEPVIP
TDPLIYRFYEIMQVYGLPLKDVIQEKFGDGIMSAIDFTLDVDKVEDPKGDRVKVTMCGKFLAYKKW
>Q55367 4.2.1.104~~~cynS~~~Cyanate hydratase~~~COG1513
MAGTEISAITTKLLEAKKAKGITFADLEQLLGRDEVWIAAVIYRQASASVDEAEKLLHCLGLSDDLVPELTAPSVKGLGP
VVPTDPLIYRFYEIMQVYGMPMKEVIHEKFGDGIMSAIDFTLDIEKEADPKGDRVKVTMNGKFLPYKKW
>P0ABE9 4.2.1.1~~~cynT~~~Carbonic anhydrase 1~~~COG0288
MKEIIDGFLKFQREAFPKREALFKQLATQQSPRTLFISCSDSRLVPELVTQREPGDLFVIRNAGNIVPSYGPEPGGVSAS
VEYAVAALRVSDIVICGHSNCGAMTAIASCQCMDHMPAVSHWLRYADSARVVNEARPHSDLPSKAAAMVRENVIAQLANL
QTHPSVRLALEEGRIALHGWVYDIESGSIAAFDGATRQFVPLAANPRVCAIPLRQPTAA
>A0R566 4.2.1.1~~~cynT~~~Carbonic anhydrase~~~COG0288
MPNSNPVAAWKALKDGNARFVAGQPLHPSQGIERRASLTQAQRPTAVVFGCGDSRVAAEILFDQGLGDMFVVRTAGHVID
NAVLGSIEYAVTVLKVPLIVVLGHDSCGAVKATLSALDEGEVPSGFVRDIVERVTPSILLGRKAGLSRVDEFEAQHVNET
VAQLQMRSTAIAQGLAAGTQAIVGTTYHLADGRVELRSHLGDIGEV
>P27134 4.2.1.1~~~ccaA~~~Carbonic anhydrase~~~COG0288
MRKLIEGLRHFRTSYYPSHRDLFEQFAKGQHPRVLFITCSDSRIDPNLITQSGMGELFVIRNAGNLIPPFGAANGGEGAS
IEYAIAALNIEHVVVCGHSHCGAMKGLLKLNQLQEDMPLVYDWLQHAQATRRLVLDNYSGYETDDLVEILVAENVLTQIE
NLKTYPIVRSRLFQGKLQIFGWIYEVESGEVLQISRTSSDDTGIDECPVRLPGSQEKAILGRCVVPLTEEVAVAPPEPEP
VIAAVAAPPANYSSRGWLAPEQQQRIYRGNAS
>Q54735 4.2.1.1~~~ccaA~~~Carbonic anhydrase~~~COG0288
MQRLIEGLQKFREGYFSSHRDLFEQLSHGQHPRILFICCSDSRVDPNLITQSEVGDLFVIRNAGNIIPPYGAANGGEGAA
MEYALVALEINQIIVCGHSHCGAMKGLLKLNSLQEKLPLVYDWLKHTEATRRLVLDNYSHLEGEDLIEVAVAENILTQLK
NLQTYPAIHSRLHRGDLSLHGWIYRIEEGEVLAYDGVLHDFVAPQSRINALEPEDEYAPHPNSPLISYDAFKVPGKERPG
REKATESPAPQLSPLPGFGHLPREQAERIYRGSR
>P0ABJ1 ~~~cyoA~~~Cytochrome bo(3) ubiquinol oxidase subunit 2~~~COG1622
MRLRKYNKSLGWLSLFAGTVLLSGCNSALLDPKGQIGLEQRSLILTAFGLMLIVVIPAILMAVGFAWKYRASNKDAKYSP
NWSHSNKVEAVVWTVPILIIIFLAVLTWKTTHALEPSKPLAHDEKPITIEVVSMDWKWFFIYPEQGIATVNEIAFPANTP
VYFKVTSNSVMNSFFIPRLGSQIYAMAGMQTRLHLIANEPGTYDGISASYSGPGFSGMKFKAIATPDRAAFDQWVAKAKQ
SPNTMSDMAAFEKLAAPSEYNQVEYFSNVKPDLFADVINKFMAHGKSMDMTQPEGEHSAHEGMEGMDMSHAESAH
>P0ABI8 7.1.1.3~~~cyoB~~~Cytochrome bo(3) ubiquinol oxidase subunit 1~~~COG0843
MFGKLSLDAVPFHEPIVMVTIAGIILGGLALVGLITYFGKWTYLWKEWLTSVDHKRLGIMYIIVAIVMLLRGFADAIMMR
SQQALASAGEAGFLPPHHYDQIFTAHGVIMIFFVAMPFVIGLMNLVVPLQIGARDVAFPFLNNLSFWFTVVGVILVNVSL
GVGEFAQTGWLAYPPLSGIEYSPGVGVDYWIWSLQLSGIGTTLTGINFFVTILKMRAPGMTMFKMPVFTWASLCANVLII
ASFPILTVTVALLTLDRYLGTHFFTNDMGGNMMMYINLIWAWGHPEVYILILPVFGVFSEIAATFSRKRLFGYTSLVWAT
VCITVLSFIVWLHHFFTMGAGANVNAFFGITTMIIAIPTGVKIFNWLFTMYQGRIVFHSAMLWTIGFIVTFSVGGMTGVL
LAVPGADFVLHNSLFLIAHFHNVIIGGVVFGCFAGMTYWWPKAFGFKLNETWGKRAFWFWIIGFFVAFMPLYALGFMGMT
RRLSQQIDPQFHTMLMIAASGAVLIALGILCLVIQMYVSIRDRDQNRDLTGDPWGGRTLEWATSSPPPFYNFAVVPHVHE
RDAFWEMKEKGEAYKKPDHYEEIHMPKNSGAGIVIAAFSTIFGFAMIWHIWWLAIVGFAGMIITWIVKSFDEDVDYYVPV
AEIEKLENQHFDEITKAGLKNGN
>P0ABJ3 ~~~cyoC~~~Cytochrome bo(3) ubiquinol oxidase subunit 3~~~COG1845
MATDTLTHATAHAHEHGHHDAGGTKIFGFWIYLMSDCILFSILFATYAVLVNGTAGGPTGKDIFELPFVLVETFLLLFSS
ITYGMAAIAMYKNNKSQVISWLALTWLFGAGFIGMEIYEFHHLIVNGMGPDRSGFLSAFFALVGTHGLHVTSGLIWMAVL
MVQIARRGLTSTNRTRIMCLSLFWHFLDVVWICVFTVVYLMGAM
>P0ABJ6 ~~~cyoD~~~Cytochrome bo(3) ubiquinol oxidase subunit 4~~~COG3125
MSHSTDHSGASHGSVKTYMTGFILSIILTVIPFWMVMTGAASPAVILGTILAMAVVQVLVHLVCFLHMNTKSDEGWNMTA
FVFTVLIIAILVVGSIWIMWNLNYNMMMH
>P0AEA5 2.5.1.141~~~cyoE~~~Protoheme IX farnesyltransferase~~~COG0109
MMFKQYLQVTKPGIIFGNLISVIGGFLLASKGSIDYPLFIYTLVGVSLVVASGCVFNNYIDRDIDRKMERTKNRVLVKGL
ISPAVSLVYATLLGIAGFMLLWFGANPLACWLGVMGFVVYVGVYSLYMKRHSVYGTLIGSLSGAAPPVIGYCAVTGEFDS
GAAILLAIFSLWQMPHSYAIAIFRFKDYQAANIPVLPVVKGISVAKNHITLYIIAFAVATLMLSLGGYAGYKYLVVAAAV
SVWWLGMALRGYKVADDRIWARKLFGFSIIAITALSVMMSVDFMVPDSHTLLAAVW
>Q825I8 1.14.15.11~~~~~~Pentalenic acid synthase~~~COG2124
MTEPGTSVSAPVAFPQDRTCPYDPPTAYDPLREGRPLSRVSLYDGRSVWVVTGHAAARALLSDQRLSSDRTLPRFPATTE
RFEAVRTRRVALLGVDDPEHRTQRRMLVPSFTLKRAAALRPRIQETVDGLLDAMEAQGPPAELVSAFALPLPSMVICALL
GVPYADHDFFESQSRRLLRGPGIAEVQDARAQLDDYLYALIDRKRKEPGDGLLDDLIQEQLNRGTVDRAELVSLATLLLI
AGHETTANMISLGTFTLLRHPEQLAELRAEPGLMPAAVEELLRFLSIADGLLRVATEDIEVAGTTIRADEGVVFATSVIN
RDAAGFAEPDALDWHRSARHHVAFGFGIHQCLGQNLARAEMEIALGTLFERLPGLRLAAPADEIPFKPGDTIQGMLELPV
TW
>Q6N8N2 1.14.99.15~~~~~~Cytochrome p450 CYP199A2~~~COG2124
MTTAPSLVPVTTPSQHGAGVPHLGIDPFALDYFADPYPEQETLREAGPVVYLDKWNVYGVARYAEVYAVLNDPLTFCSSR
GVGLSDFKKEKPWRPPSLILEADPPAHTRTRAVLSKVLSPATMKRLRDGFAAAADAKIDELLARGGNIDAIADLAEAYPL
SVFPDAMGLKQEGRENLLPYAGLVFNAFGPPNELRQSAIERSAPHQAYVAEQCQRPNLAPGGFGACIHAFSDTGEITPEE
APLLVRSLLSAGLDTTVNGIAAAVYCLARFPDEFARLRADPSLARNAFEEAVRFESPVQTFFRTTTRDVELAGATIGEGE
KVLMFLGSANRDPRRWDDPDRYDITRKTSGHVGFGSGVHMCVGQLVARLEGEVVLAALARKVAAIEIAGPLKRRFNNTLR
GLESLPIQLTPA
>E5KIB6 ~~~cypA~~~Cypemycin~~~
MRSEMTLTSTNSAEALAAQDFANTVLSAAAPGFHADCETPAMATPATPTVAQFVIQGSTICLVC
>O08336 ~~~cypB~~~Bifunctional cytochrome P450/NADPH--P450 reductase 2~~~COG0369
MKQASAIPQPKTYGPLKNLPHLEKEQLSQSLWRIADELGPIFRFDFPGVSSVFVSGHNLVAEVCDEKRFDKNLGKGLQKV
REFGGDGLFTSWTHEPNWQKAHRILLPSFSQKAMKGYHSMMLDIATQLIQKWSRLNPNEEIDVADDMTRLTLDTIGLCGF
NYRFNSFYRDSQHPFITSMLRALKEAMNQSKRLGLQDKMMVKTKLQFQKDIEVMNSLVDRMIAERKANPDENIKDLLSLM
LYAKDPVTGETLDDENIRYQIITFLIAGHETTSGLLSFAIYCLLTHPEKLKKAQEEADRVLTDDTPEYKQIQQLKYIRMV
LNETLRLYPTAPAFSLYAKEDTVLGGEYPISKGQPVTVLIPKLHRDQNAWGPDAEDFRPERFEDPSSIPHHAYKPFGNGQ
RACIGMQFALQEATMVLGLVLKHFELINHTGYELKIKEALTIKPDDFKITVKPRKTAAINVQRKEQADIKAETKPKETKP
KHGTPLLVLFGSNLGTAEGIAGELAAQGRQMGFTAETAPLDDYIGKLPEEGAVVIVTASYNGAPPDNAAGFVEWLKELEE
GQLKGVSYAVFGCGNRSWASTYQRIPRLIDDMMKAKGASRLTAIGEGDAADDFESHRESWENRFWKETMDAFDINEIAQK
EDRPSLSITFLSEATETPVAKAYGAFEGIVLENRELQTAASTRSTRHIELEIPAGKTYKEGDHIGILPKNSRELVQRVLS
RFGLQSNHVIKVSGSAHMAHLPMDRPIKVVDLLSSYVELQEPASRLQLRELASYTVCPPHQKELEQLVSDDGIYKEQVLA
KRLTMLDFLEDYPACEMPFERFLALLPSLKPRYYSISSSPKVHANIVSMTVGVVKASAWSGRGEYRGVASNYLAELNTGD
AAACFIRTPQSGFQMPNDPETPMIMVGPGTGIAPFRGFIQARSVLKKEGSTLGEALLYFGCRRPDHDDLYREELDQAEQD
GLVTIRRCYSRVENEPKGYVQHLLKQDTQKLMTLIEKGAHIYVCGDGSQMAPDVERTLRLAYEAEKAASQEESAVWLQKL
QDQRRYVKDVWTGM
>I3DZK9 1.11.2.4~~~cypC~~~Fatty-acid peroxygenase~~~COG2124
MSNINQMPREEGIDSTWRLMEEGYMYILNRRHSFNSDIFETRLLGKKAICMGGKEAAEIFYDTEKFKRKDAAPNRVVQTL
FGKNGVQALDGQTHKHRKEMFMSIMSPDELEKLTDITKKQWEIAVDKWEQMDKVILYEEAKEIMCRTACQWAGVPVQENE
VKRLTKNLGAMFESAAAVGLKHWLGRHARNYEEIWIEELIDRVRDGKVNPPENTTLHKFSWYRDLEGNLLDTETAAVEVI
NILRPIVAIAIFINFIALALHHYPEEKEKLKSGDKKYSQMFVQEVRRFYPFFPFVVALVKKDFTWKGYKFEEGTLTLLDL
YGTNHDPEIWKNPDVFSPDRFAKWEGSPFSFIPQGGGDYFMGHRCAGEWVTIEVMKVSLDYLTNRMDYEVPDQDLSFSMA
SMPSIPHSKVVIKNVKKRI
>O31440 1.11.2.4~~~cypC~~~Fatty-acid peroxygenase~~~COG2124
MNEQIPHDKSLDNSLTLLKEGYLFIKNRTERYNSDLFQARLLGKNFICMTGAEAAKVFYDTDRFQRQNALPKRVQKSLFG
VNAIQGMDGSAHIHRKMLFLSLMTPPHQKRLAELMTEEWKAAVTRWEKADEVVLFEEAKEILCRVACYWAGVPLKETEVK
ERADDFIDMVDAFGAVGPRHWKGRRARPRAEEWIEVMIEDARAGLLKTTSGTALHEMAFHTQEDGSQLDSRMAAIELINV
LRPIVAISYFLVFSALALHEHPKYKEWLRSGNSREREMFVQEVRRYYPFGPFLGALVKKDFVWNNCEFKKGTSVLLDLYG
TNHDPRLWDHPDEFRPERFAEREENLFDMIPQGGGHAEKGHRCPGEGITIEVMKASLDFLVHQIEYDVPEQSLHYSLARM
PSLPESGFVMSGIRRKS
>P23154 ~~~~~~Putative polyketide cyclase~~~COG2867
MAGHTDNEITIAAPMELVWNMTNDIEKWPGLFSEYASVEVLGRDDDKVTFRLTMHPDADGKVWSWVSERVADPVTRTVRA
QRVETGPFQYMNIVWEYAETAEGTVMRWTQDFAMKPDAPVDDAWMTDNINRNSRTQMALIRDRIEQAAGERRTASVLAD
>O08394 ~~~cypD~~~Bifunctional cytochrome P450/NADPH--P450 reductase 1~~~COG0369
MKETSPIPQPKTFGPLGNLPLIDKDKPTLSLIKLAEEQGPIFQIHTPAGTTIVVSGHELVKEVCDEERFDKSIEGALEKV
RAFSGDGLFTSWTHEPNWRKAHNILMPTFSQRAMKDYHEKMVDIAVQLIQKWARLNPNEAVDVPGDMTRLTLDTIGLCGF
NYRFNSYYRETPHPFINSMVRALDEAMHQMQRLDVQDKLMVRTKRQFRYDIQTMFSLVDSIIAERRANGDQDEKDLLARM
LNVEDPETGEKLDDENIRFQIITFLIAGHETTSGLLSFATYFLLKHPDKLKKAYEEVDRVLTDAAPTYKQVLELTYIRMI
LNESLRLWPTAPAFSLYPKEDTVIGGKFPITTNDRISVLIPQLHRDRDAWGKDAEEFRPERFEHQDQVPHHAYKPFGNGQ
RACIGMQFALHEATLVLGMILKYFTLIDHENYELDIKQTLTLKPGDFHISVQSRHQEAIHADVQAAEKAAPDEQKEKTEA
KGASVIGLNNRPLLVLYGSDTGTAEGVARELADTASLHGVRTKTAPLNDRIGKLPKEGAVVIVTSSYNGKPPSNAGQFVQ
WLQEIKPGELEGVHYAVFGCGDHNWASTYQYVPRFIDEQLAEKGATRFSARGEGDVSGDFEGQLDEWKKSMWADAIKAFG
LELNENADKERSTLSLQFVRGLGESPLARSYEASHASIAENRELQSADSDRSTRHIEIALPPDVEYQEGDHLGVLPKNSQ
TNVSRILHRFGLKGTDQVTLSASGRSAGHLPLGRPVSLHDLLSYSVEVQEAATRAQIRELASFTVCPPHRRELEELSAEG
VYQEQILKKRISMLDLLEKYEACDMPFERFLELLRPLKPRYYSISSSPRVNPRQASITVGVVRGPAWSGRGEYRGVASND
LAERQAGDDVVMFIRTPESRFQLPKDPETPIIMVGPGTGVAPFRGFLQARDVLKREGKTLGEAHLYFGCRNDRDFIYRDE
LERFEKDGIVTVHTAFSRKEGMPKTYVQHLMADQADTLISILDRGGRLYVCGDGSKMAPDVEAALQKAYQAVHGTGEQEA
QNWLRHLQDTGMYAKDVWAGI
>E5KIB9 1.3.99.36~~~cypD~~~Cypemycin cysteine dehydrogenase (decarboxylating)~~~
MNVEKFEGAELHVHVTGSISAALVPWWIHWLREFQPELVVNVSVTPAASRFLAVRALRHLANGKVWVDSWDDPDVPPEVN
SGKSGASECFLVFPATLDTVMRLAQGRADSPALMMLQLTDAPLVIADTFPGSNEIVENNVQTLKLRPNVEFAPRVNGVRA
SNRQTAEVGFNLPGALAAANRMRKEGRSGE
>E5KIC0 2.1.1.301~~~cypM~~~Cypemycin N-terminal methyltransferase~~~
MSDPSVYDETAIEAYDLVSSMLSPGAGLVAWVSSHRPLDGRTVLDLGCGTGVSSFALAEAGARVVAVDASRPSLDMLEKK
RLDRDVEAVEGDFRDLTFDSTFDVVTMSRNTFFLAQEQEEKIALLRGIARHLKPGGAAFLDCTDPAEFQRAGGDARSVTY
PLGRDRMVTVTQTADRAGQQILSIFLVQGATTLTAFHEQATWATLAEIRLMARIAGLEVTGVDGSYAGEPYTARSREMLV
VLERQ
>O34926 1.14.15.13~~~cypX~~~Pulcherriminic acid synthase~~~COG2124
MSQSIKLFSVLSDQFQNNPYAYFSQLREEDPVHYEESIDSYFISRYHDVRYILQHPDIFTTKSLVERAEPVMRGPVLAQM
HGKEHSAKRRIVVRSFIGDALDHLSPLIKQNAENLLAPYLERGKSDLVNDFGKTFAVCVTMDMLGLDKRDHEKISEWHSG
VADFITSISQSPEARAHSLWCSEQLSQYLMPVIKERRVNPGSDLISILCTSEYEGMALSDKDILALILNVLLAATEPADK
TLALMIYHLLNNPEQMNDVLADRSLVPRAIAETLRYKPPVQLIPRQLSQDTVVGGMEIKKDTIVFCMIGAANRDPEAFEQ
PDVFNIHREDLGIKSAFSGAARHLAFGSGIHNCVGAAFAKNEIEIVANIVLDKMRNIRLEEDFCYAESGLYTRGPVSLLV
AFDGA
>P16676 7.3.2.3~~~cysA~~~Sulfate/thiosulfate import ATP-binding protein CysA~~~COG1118
MSIEIANIKKSFGRTQVLNDISLDIPSGQMVALLGPSGSGKTTLLRIIAGLEHQTSGHIRFHGTDVSRLHARDRKVGFVF
QHYALFRHMTVFDNIAFGLTVLPRRERPNAAAIKAKVTKLLEMVQLAHLADRYPAQLSGGQKQRVALARALAVEPQILLL
DEPFGALDAQVRKELRRWLRQLHEELKFTSVFVTHDQEEATEVADRVVVMSQGNIEQADAPDQVWREPATRFVLEFMGEV
NRLQGTIRGGQFHVGAHRWPLGYTPAYQGPVDLFLRPWEVDISRRTSLDSPLPVQVLEASPKGHYTQLVVQPLGWYNEPL
TVVMHGDDAPQRGERLFVGLQHARLYNGDERIETRDEELALAQSA
>P9WQM1 7.3.2.3~~~cysA~~~Sulfate/thiosulfate import ATP-binding protein CysA~~~COG1118
MTYAIVVADATKRYGDFVALDHVDFVVPTGSLTALLGPSGSGKSTLLRTIAGLDQPDTGTITINGRDVTRVPPQRRGIGF
VFQHYAAFKHLTVRDNVAFGLKIRKRPKAEIKAKVDNLLQVVGLSGFQSRYPNQLSGGQRQRMALARALAVDPEVLLLDE
PFGALDAKVREELRAWLRRLHDEVHVTTVLVTHDQAEALDVADRIAVLHKGRIEQVGSPTDVYDAPANAFVMSFLGAVST
LNGSLVRPHDIRVGRTPNMAVAAADGTAGSTGVLRAVVDRVVVLGFEVRVELTSAATGGAFTAQITRGDAEALALREGDT
VYVRATRVPPIAGGVSGVDDAGVERVKVTST
>P14788 7.3.2.3~~~cysA~~~Sulfate/thiosulfate import ATP-binding protein CysA~~~COG1118
MPKDKAVGIQVSQVSKQFGSFQAVKDVDLTVETGSLVALLGPSGSGKSTLLRLIAGLEQPDSGRIFLTGRDATNESVRDR
QIGFVFQHYALFKHLTVRKNIAFGLELRKHTKEKVRARVEELLELVQLTGLGDRYPSQLSGGQRQRVALARALAVQPQVL
LLDEPFGALDAKVRKDLRSWLRKLHDEVHVTTVFVTHDQEEAMEVADQIVVMNHGKVEQIGSPAEIYDNPATPFVMSFIG
PVNVLPNSSHIFQAGGLDTPHPEVFLRPHDIEIAIDPIPETVPARIDRIVHLGWEVQAEVRLEDGQVLVAHLPRDRYRDL
QLEPEQQVFVRPKQARSFPLNYSI
>B4E8V9 ~~~cysB~~~HTH-type transcriptional regulator CysB~~~COG0583
MNLHQFRFVREAVRQNFNLTEAAKALYTSQPGVSKAIIELEDELGVEIFTRHGKRVRSLTEPGRIILASVERILQEVESL
KRVGKDYAAQDQGNLTIAATHTQARYSLPAAIAEFKKRFPKVHLSILQGSPTQVAEMVIHDQADLAIATEAISDYKELVS
LPCFQWHHAAVVPADHPLLERKPVTLDDLAQYPLITYDDAFAGRKKINHAFALRGLSPDIVLEAIDADVIKTYVELGLGV
GIMADIAFNPERDRGLRLIPVGHLFGSNVTRVALKQGAYLRSYVYTLVELLSPTLNRKLIEQALKGEAESYEL
>P45600 ~~~cysB~~~HTH-type transcriptional regulator CysB~~~
MKLQQLRYIVEVVNHNLNVSSTAEGLYTSQPGISKQVRMLEDELGIQIFARSGKHLTQVTPAGQEIIRIAREVLSKVDAI
KSVAGEHTWPDKGSLYVATTHTQARYALPGVIKGFIERYPRVSLHMHQGSPTQIAEAVSKGNADFAIATEALHLYDDLVM
LPCYHWNRSIVVTPEHPLATKGSVSIEELAQYPLVTYTFGFTGRSELDTAFNRAGLTPRIVFTATDADVIKTYVRLGLGV
GVIASMAVDPVSDPDLVKLDANGIFSHSTTKIGFRRSTFLRSYMYDFIQRFAPHLTRDVVDTAVALRSNEDIEAMFKDIK
LPEK
>P06614 ~~~cysB~~~HTH-type transcriptional regulator CysB~~~
MKLQQLRYIVEVVNHNLNVSSTAEGLYTSQPGISKQVRMLEDELGIQIFARSGKHLTQVTPAGQEIIRIAREVLSKVDAI
KSVAGEHTWPDKGSLYIATTHTQARYALPGVIKGFIERYPRVSLHMHQGSPTQIAEAVSKGNADFAIATEALHLYDDLVM
LPCYHWNRSIVVTPDHPLAATSSVTIEALAQYPLVTYTFGFTGRSELDTAFNRAGLTPRIVFTATDADVIKTYVRLGLGV
GVIASMAVDPLADPDLVRIDAHDIFSHSTTKIGFRRSTFLRSYMYDFIQRFAPHLTRDVVDTAVALRSNEEIEAMFQDIK
LPEK
>O34577 2.7.1.25~~~cysC~~~Probable adenylyl-sulfate kinase~~~COG0529
MTNRDIVWHEASITKEEYQQKNKHKSSILWLTGLSGSGKSTIANAAARELFEQGYQVIVLDGDNIRHGLNRDLGFSDEDR
KENIRRIGEVAKLFVQQGTIVITAFISPFREDRQQVRELVEAGEFNEVYIKCDLDICEQRDPKGLYKKARNGEIPFFTGI
DSPYEEPEAPELVLDSGQHDREACKNQLIEFVKQKLS
>P0A6J1 2.7.1.25~~~cysC~~~Adenylyl-sulfate kinase~~~COG0529
MALHDENVVWHSHPVTVQQRELHHGHRGVVLWFTGLSGSGKSTVAGALEEALHKLGVSTYLLDGDNVRHGLCSDLGFSDA
DRKENIRRVGEVANLMVEAGLVVLTAFISPHRAERQMVRERVGEGRFIEVFVDTPLAICEARDPKGLYKKARAGELRNFT
GIDSVYEAPESAEIHLNGEQLVTNLVQQLLDLLRQNDIIRS
>P72940 2.7.1.25~~~cysC~~~Probable adenylyl-sulfate kinase~~~COG0529
MQQRGVTIWLTGLSGAGKTTITHALEKKLRDSGYRLEVLDGDVVRTNLTKGLGFSKEDRDTNIRRIGFVSHLLTRNGVIV
LVSAISPYAAIRQEVKHTIGDFLEVFVNAPLAVCEERDVKGLYAKARSGEIKGFTGIDDPYEPPTNPDVECRTDLEELDE
SVGKIWQKLVDLKYIEG
>Q06529 ~~~fccA~~~Cytochrome subunit of sulfide dehydrogenase~~~COG2863
MTQSTPRLMLAASVLALGLASNAGAEPTAEMLTNNCAGCHGTHGNSVGPASPSIAQMDPMVFVEVMEGFKSGEIASTIMG
RIAKGYSTADFEKMAGYFKQQTYQPAKQSFDTALADTGAKLHDKYCEKCHVEGGKPLADEEDYHILAGQWTPYLQYAMSD
FREERRPMEKKMASKLRELLKAEGDAGLDALFAFYASQQ
>P20958 ~~~fccA~~~Cytochrome subunit of sulfide dehydrogenase~~~
APEQSKSIPRGEILSLSCAGCHGTDGKSESIIPTIYGRSAEYIESALLDFKSGARPSTVMGRHAKGYSDEEIHQIAEYFG
SLSTMNN
>P21156 2.7.7.4~~~cysD~~~Sulfate adenylyltransferase subunit 2~~~COG0175
MDQIRLTHLRQLEAESIHIIREVAAEFSNPVMLYSIGKDSSVMLHLARKAFYPGTLPFPLLHVDTGWKFREMYEFRDRTA
KAYGCELLVHKNPEGVAMGINPFVHGSAKHTDIMKTEGLKQALNKYGFDAAFGGARRDEEKSRAKERIYSFRDRFHRWDP
KNQRPELWHNYNGQINKGESIRVFPLSNWTEQDIWQYIWLENIDIVPLYLAAERPVLERDGMLMMIDDNRIDLQPGEVIK
KRMVRFRTLGCWPLTGAVESNAQTLPEIIEEMLVSTTSERQGRVIDRDQAGSMELKKRQGYF
>P9WIK1 2.7.7.4~~~cysD~~~Sulfate adenylyltransferase subunit 2~~~COG0175
MTSDVTVGPAPGQYQLSHLRLLEAEAIHVIREVAAEFERPVLLFSGGKDSIVMLHLALKAFRPGRLPFPVMHVDTGHNFD
EVIATRDELVAAAGVRLVVASVQDDIDAGRVVETIPSRNPIQTVTLLRAIRENQFDAAFGGARRDEEKARAKERVFSFRD
EFGQWDPKAQRPELWNLYNGRHHKGEHIRVFPLSNWTEFDIWSYIGAEQVRLPSIYFAHRRKVFQRDGMLLAVHRHMQPR
ADEPVFEATVRFRTVGDVTCTGCVESSASTVAEVIAETAVARLTERGATRADDRISEAGMEDRKRQGYF
>Q87WW0 2.7.7.4~~~cysD~~~Sulfate adenylyltransferase subunit 2~~~COG0175
MVDKLTHLKQLEAESIHIIREVAAEFDNPVMLYSIGKDSAVMLHLARKAFFPGKLPFPVMHVDTRWKFQEMYRFRDQMVE
EMGLDLITHINPDGVAQGINPFTHGSAKHTDIMKTEGLKQALDKHGFDAAFGGARRDEEKSRAKERVYSFRDSKHRWDPK
NQRPELWNVYNGNVNKGESIRVFPLSNWTELDIWQYIYLEGIPIVPLYFAAERDVIEKNGTLIMIDDERILEHLTDEEKS
RIVKKKVRFRTLGCYPLTGAVESEATSLTDIIQEMLLTRTSERQGRVIDHDGAGSMEEKKRQGYF
>Q06750 2.3.1.30~~~cysE~~~Serine acetyltransferase~~~COG1045
MFFRMLKEDIDTVFDQDPAARSYFEVILTYSGLHAIWAHRIAHALYKRKFYFLARLISQVSRFFTGIEIHPGATIGRRFF
IDHGMGVVIGETCEIGNNVTVFQGVTLGGTGKEKGKRHPTIKDDALIATGAKVLGSITVGEGSKIGAGSVVLHDVPDFST
VVGIPGRVVVQNGKKVRRDLNHQDLPDPVADRFKSLEQQILELKAELEDRKERINQK
>P0A9D4 2.3.1.30~~~cysE~~~Serine acetyltransferase~~~COG1045
MSCEELEIVWNNIKAEARTLADCEPMLASFYHATLLKHENLGSALSYMLANKLSSPIMPAIAIREVVEEAYAADPEMIAS
AACDIQAVRTRDPAVDKYSTPLLYLKGFHALQAYRIGHWLWNQGRRALAIFLQNQVSVTFQVDIHPAAKIGRGIMLDHAT
GIVVGETAVIENDVSILQSVTLGGTGKSGGDRHPKIREGVMIGAGAKILGNIEVGRGAKIGAGSVVLQPVPPHTTAAGVP
ARIVGKPDSDKPSMDMDQHFNGINHTFEYGDGI
>P43886 2.3.1.30~~~cysE~~~Serine acetyltransferase~~~COG1045
MTLDVWQHIRQEAKELAENEPMLASFFHSTILKHQNLGGALSYLLANKLANPIMPAISLREIIEEAYQSNPSIIDCAACD
IQAVRHRDPAVELWSTPLLYLKGFHAIQSYRITHYLWNQNRKSLALYLQNQISVAFDVDIHPAAKIGHGIMFDHATGIVV
GETSVIENDVSILQGVTLGGTGKESGDRHPKVREGVMIGAGAKILGNIEVGKYAKIGANSVVLNPVPEYATAAGVPARIV
SQDKAAKPAFDMNQYFIGIDDGMNLNI
>A0A120HUS7 2.3.1.30~~~cysE~~~Serine O-acetyltransferase~~~COG1897
MEKSPLKIGILNVMHDKADTKTRLQHVLTHTAIPVDLHFYYPMTHYAGRTVPEAVSSILDPLDIHEVATMDGFIITGSPI
ETLEFDQVHYIAEVRTLLKTLSQHVPNQLYLCWGGMVALNYFFGISKLILPHKLFGVYPQTILEPHPLLKGLKNDFKSPH
ARYAEMDVRGIHADPRLTINATTTKGKLFMVTEPTDTQTFVFSHIEYDRWGLDSEYKREVAAHPEIDYVRAKHYYHHKND
YDHPKFNWKKTQRTIFDNWIQHVADHRNDNH
>P95231 2.3.1.30~~~cysE~~~Serine acetyltransferase~~~COG1045
MLTAMRGDIRAARERDPAAPTALEVIFCYPGVHAVWGHRLAHWLWQRGARLLARAAAEFTRILTGVDIHPGAVIGARVFI
DHATGVVIGETAEVGDDVTIYHGVTLGGSGMVGGKRHPTVGDRVIIGAGAKVLGPIKIGEDSRIGANAVVVKPVPPSAVV
VGVPGQVIGQSQPSPGGPFDWRLPDLVGASLDSLLTRVARLEALGGGPQAAGVIRPPEAGIWHGEDFSI
>P29847 2.3.1.30~~~cysE~~~Serine acetyltransferase~~~
MPCEELEIVWKNIKAEARALADCEPMLASFYHATLLKHENLGSALSYMLANKLASPIMPAIAIREVVEEAYAADPEMIAS
AACDIQAVRTRDPAVDKYSTPLLYLKGFHALQAYRIGHWLWNKGRRALAIFLQNQVSVSFQVDIHPAAKIGRGIMLDHAT
GIVVGETAVIEDDVSILQSVTLGGTGKTSGDRHPKIREGVMIGAGAKILGNIEVGRGAKIGAGSVVLQPVPPHTTAAGVP
ARIVGKPGSDKPSMDMDQHFNGIHHTFEYGDGI
>P67765 2.3.1.30~~~cysE~~~Serine acetyltransferase~~~
MLKRMRDDIKMVFEQDPAARSTLEVITTYAGLHAVWSHLIAHKLYNQKKYVAARAISQISRFFTGIEIHPGAKIGKRLFI
DHGMGVVIGETCTIGDNVTIYQGVTLGGTGKERGKRHPDIGDNVLIAAGAKVLGNIKINSNVNIGANSVVLQSVPSYSTV
VGIPGHIVKQDGVRVGKTFDHRHLPDPIYEQIKHLERQLEKTRNGEIQDDYII
>P0AEA8 ~~~cysG~~~Siroheme synthase~~~COG0007
MDHLPIFCQLRDRDCLIVGGGDVAERKARLLLDAGARLTVNALAFIPQFTAWADAGMLTLVEGPFDESLLDTCWLAIAAT
DDDALNQRVSEAAEARRIFCNVVDAPKAASFIMPSIIDRSPLMVAVSSGGTSPVLARLLREKLESLLPLHLGQVAKYAGQ
LRGRVKQQFATMGERRRFWEKLFVNDRLAQSLANNDQKAITETTEQLINEPLDHRGEVVLVGAGPGDAGLLTLKGLQQIQ
QADVVVYDRLVSDDIMNLVRRDADRVFVGKRAGYHCVPQEEINQILLREAQKGKRVVRLKGGDPFIFGRGGEELETLCNA
GIPFSVVPGITAASGCSAYSGIPLTHRDYAQSVRLITGHLKTGGELDWENLAAEKQTLVFYMGLNQAATIQQKLIEHGMP
GEMPVAIVENGTAVTQRVIDGTLTQLGELAQQMNSPSLIIIGRVVGLRDKLNWFSNH
>P25924 ~~~cysG~~~Siroheme synthase~~~
MDHLPIFCQLRDRDCLIVGGGDVAERKARLLLEAGARLTVNALTFIPQFTVWANEGMLTLVEGPFDETLLDSCWLAIAAT
DDDTVNQRVSDAAESRRIFCNVVDAPKAASFIMPSIIDRSPLMVAVSSGGTSPVLARLLREKLESLLPQHLGQVARYAGQ
LRARVKKQFATMGERRRFWEKFFVNDRLAQSLANADEKAVNATTERLFSEPLDHRGEVVLVGAGPGDAGLLTLKGLQQIQ
QADIVVYDRLVSDDIMNLVRRDADRVFVGKRAGYHCVPQEEINQILLREAQKGKRVVRLKGGDPFIFGRGGEELETLCHA
GIPFSVVPGITAASGCSAYSGIPLTHRDYAQSVRLVTGHLKTGGELDWENLAAEKQTLVFYMGLNQAATIQEKLIAFGMQ
ADMPVALVENGTSVKQRVVHGVLTQLGELAQQVESPALIIVGRVVALRDKLNWFSNH
>P94498 1.8.4.10~~~cysH~~~Adenosine 5'-phosphosulfate reductase 1~~~COG0175
MLTYDNWEEPTITFPEDDPYKGALSVLKWAYGHYGDQLVYACSFGIEGIVLIDLIYKVKKDAEIVFLDTGLHFKETYETI
ERVKERYPGLNIILKKPDLTLEEQAEEHGDKLWEREPNQCCYLRKVVPLREALSGHPAWLSGLRRDQGPSRANTNFLNKD
EKFKSVKVCPLIHWTWKDIWRYTSRNELDYNPLHDQGYPSIGCAPCTSPAFTAEDLRSGRWNGMAKTECGLHE
>Q8UH67 1.8.4.10~~~cysH~~~Adenosine 5'-phosphosulfate reductase~~~COG0175
MEIPDVTMTINSTNASADTASLDATLAGLDLAGRLSFVAGLGGRAVFTTSLGIEDQVITAAIGTHRLPIDVVTLETGRLF
KETVDLIDETEERFGIEIRRFRPEQDDIDAYAAKYGLNGFYESVEARHACCHVRKLIPLGKALEGAAFWITGLRRGQSGN
RAATPFAEFDAERNLIKINALADWDIEQIRAYVAEENIPVNPLHQRGYPSIGCEPCTRAIKPGEPERAGRWWWENDEKRE
CGLHVAGAEQTPPVSAIPQR
>P17854 1.8.4.8~~~cysH~~~Phosphoadenosine 5'-phosphosulfate reductase~~~COG0175
MSKLDLNALNELPKVDRILALAETNAELEKLDAEGRVAWALDNLPGEYVLSSSFGIQAAVSLHLVNQIRPDIPVILTDTG
YLFPETYRFIDELTDKLKLNLKVYRATESAAWQEARYGKLWEQGVEGIEKYNDINKVEPMNRALKELNAQTWFAGLRREQ
SGSRANLPVLAIQRGVFKVLPIIDWDNRTIYQYLQKHGLKYHPLWDEGYLSVGDTHTTRKWEPGMAEEETRFFGLKRECG
LHEG
>A0R0W2 1.8.4.10~~~cysH~~~Adenosine 5'-phosphosulfate reductase~~~COG0175
MTDVTTSTENELRELAERGAAELADASAEELLRWTDEHFGGNYVVASNMQDAVLVEMAAKVRPGVDVLFLDTGYHFAETI
GTRDAVEAVYDVHVVNVTPERTVAEQDELLGKNLFARDPGECCRLRKVVPLTNALKGYSAWVTGIRRVEAPTRANAPLIS
WDNAFGLVKINPIAAWTDEDMQNYIDANGILVNPLVYEGYPSIGCAPCTSKPIPGADPRSGRWAGLSKTECGLHVS
>P9WIK3 1.8.4.10~~~cysH~~~Adenosine 5'-phosphosulfate reductase~~~COG0175
MSGETTRLTEPQLRELAARGAAELDGATATDMLRWTDETFGDIGGAGGGVSGHRGWTTCNYVVASNMADAVLVDLAAKVR
PGVPVIFLDTGYHFVETIGTRDAIESVYDVRVLNVTPEHTVAEQDELLGKDLFARNPHECCRLRKVVPLGKTLRGYSAWV
TGLRRVDAPTRANAPLVSFDETFKLVKVNPLAAWTDQDVQEYIADNDVLVNPLVREGYPSIGCAPCTAKPAEGADPRSGR
WQGLAKTECGLHAS
>O05927 1.8.4.10~~~cysH~~~Adenosine 5'-phosphosulfate reductase~~~
MLPFATIPATERNSAAQHQDPSPMSQPFDLPALASSLADKSPQDILKAAFEHFGDELWISFSGAEDVVLVDMAWKLNRNV
KVFSLDTGRLHPETYRFIDQVREHYGIAIDVLSPDPRLLEPLVKEKGLFSFYRDGHGECCGIRKIEPLKRKLAGVRAWAT
GQRRDQSPGTRSQVAVLEIDGAFSTPEKPLYKFNPLSSMTSEEVWGYIRMLELPYNSLHERGYISIGCEPCTRPVLPNQH
EREGRWWWEEATHKECGLHAGNLISKA
>P56891 1.8.4.10~~~cysH~~~Adenosine 5'-phosphosulfate reductase~~~COG0175
MTTQSLKAEAVALEADVMALDAEAKALNDKLESLDLAGRLALIAGLEGRAVFTTSLGIEDQVITAAIGSNRLDIEVATLK
TGRLFNETVALIDQTEETYDILIKRYYPEKADIDAYVAQYGMNGFYESVEARHACCGVRKLKPLARALDGASYWITGLRR
GQSGNRATTPFAEADVERGLIKINPLADWGIETIQAHVAAEGIPVNPLHSRGYPSIGCEPCTRAIKPGEPERAGRWWWEN
DEKRECGLHVPEAASSIIPNASNAA
>I3X057 1.8.4.10~~~cysH~~~Adenosine 5'-phosphosulfate reductase~~~COG0175
MTTQSLEAEAKALNDKLESLDLAGRLAMVAGLDGRAVFTTSLGIEDQVITAAIGINRLDIEVATLKTGRLFNETVALVEE
TEETYNILIKRYYPEKADIEAYVAQYGMNGFYESVEARHACCGVRKLKPLARALEGASYWITGLRRGQSGNRATTPFAEA
DLERGLIKINPLADWDIETIRAHVAAEAIPVNPLHGRGYPSIGCEPCTRAIKPGEPERAGRWWWENDEKRECGLHVAEAA
SSIIPNASSAA
>C3MIE1 1.8.4.10~~~cysH~~~Adenosine 5'-phosphosulfate reductase~~~COG0175
MTTQSLEAEAKALNDRLEGLDLAGRLALVAGLEGRAVFTTSLGIEDQVITAALGSNRLDIEVATLKTGRLFNETVALIEA
TEEAYDILIKRYYPEKADIEDYVAQYGMNGFYESVEARHACCGVRKLRPLARALEGASYWITGLRRGQSGNRAATPYAEA
DLERGLIKINPLADWDIDVIRAHVAAEAIPVNPLHGRGYPSIGCEPCTRAIKPGEPERAGRWWWENDEKRECGLHVAEAA
SSIIPNASSAA
>Q8CWK6 1.8.4.8~~~cysH~~~Phosphoadenosine 5'-phosphosulfate reductase~~~
MLDSVASTLQLSELLSLTKAEQSIRLAEINVELEMLSAQERVAWALQNLEGAHAVSSSFGIQAAVMLHLVSKQQADIPVI
LTDTGYLFPETYQFIDELTKSLNLNLKVYRANESANWQEARYGKLWEQGIEGIEKYNKLNKVEPMRRALNELNVKTWFSG
LRREQSQSRAGLPILSIQNGVFKFLPVVDWSNKDVHYYLKEHGLSYHPLWEQGYLSVGDTHTTQKWEPGMSEEETRFFGL
KRECGLHEEDNEQDGSGI
>O32213 1.8.1.2~~~cysI~~~Sulfite reductase [NADPH] hemoprotein beta-component~~~COG0155
MVTKILKAPDGSPSDVERIKKESDYLRGTLKEVMLDRISAGIPDDDNRLMKHHGSYLQDDRDLRNERQKQKLEPAYQFML
RVRMPGGVSTPEQWLVMDDLSQKYGNGTLKLTTRETFQMHGILKWNMKKTIQTIHSALLDTIAACGDVNRNVMCASNPYQ
SEIHSEVYEWSKKLSDDLLPRTRAYHEIWLDEERVAGTPEEEVEPMYGPLYLPRKFKIGIAVPPSNDIDVFSQDLGFIAI
VEDGKLIGFNVAIGGGMGMTHGDTATYPQLAKVIGFCRPEQMYDVAEKTITIQRDYGNRSVRKNARFKYTVDRLGLENVK
EELENRLGWSLEEAKPYHFDHNGDRYGWVEGIEDKWHFTLFVEGGRITDYDDYKLMTGLREIAKVHTGEFRLTANQNLMI
ANVSSDKKEEISALIEQYGLTDGKHYSALRRSSMACVALPTCGLAMAEAERYLPTLLDKIEEIIDENGLRDQEITIRMTG
CPNGCARHALGEIGFIGKAPGKYNMYLGAAFDGSRLSKMYRENIGEADILSELRILLSRYAKEREEGEHFGDFVIRAGII
KATTDGTNFHD
>P17846 1.8.1.2~~~cysI~~~Sulfite reductase [NADPH] hemoprotein beta-component~~~COG0155
MSEKHPGPLVVEGKLTDAERMKHESNYLRGTIAEDLNDGLTGGFKGDNFLLIRFHGMYQQDDRDIRAERAEQKLEPRHAM
LLRCRLPGGVITTKQWQAIDKFAGENTIYGSIRLTNRQTFQFHGILKKNVKPVHQMLHSVGLDALATANDMNRNVLCTSN
PYESQLHAEAYEWAKKISEHLLPRTRAYAEIWLDQEKVATTDEEPILGQTYLPRKFKTTVVIPPQNDIDLHANDMNFVAI
AENGKLVGFNLLVGGGLSIEHGNKKTYARTASEFGYLPLEHTLAVAEAVVTTQRDWGNRTDRKNAKTKYTLERVGVETFK
AEVERRAGIKFEPIRPYEFTGRGDRIGWVKGIDDNWHLTLFIENGRILDYPARPLKTGLLEIAKIHKGDFRITANQNLII
AGVPESEKAKIEKIAKESGLMNAVTPQRENSMACVSFPTCPLAMAEAERFLPSFIDNIDNLMAKHGVSDEHIVMRVTGCP
NGCGRAMLAEVGLVGKAPGRYNLHLGGNRIGTRIPRMYKENITEPEILASLDELIGRWAKEREAGEGFGDFTVRAGIIRP
VLDPARDLWD
>Q8KQT8 1.8.1.2~~~cysI~~~Sulfite reductase [NADPH] hemoprotein beta-component~~~
MSHSVEDIKSESRRLRGSLEQSLADAVTGALREDDQTLIKYHGSYQQDDRDIRDERRQQKLEPAYQFMIRTRTPGGVITP
AQWLALDGIATRYANHSLRITTRQAFQFHGVIKRELKATMQAINATLIDTLAACGDVNRNVQVAANPLLSQAHATLYADA
ACVSEHLLPNTRAYYEIWLDEERVSGSGNEDEPIYGDRYLPRKFKIGFAAPPLNDVDVFANDLGFIAILRDGRLLGYNVS
IGGGMGASHGDAQTWPRVANVIGFVTRDQLLDIATAVVTTQRDFGNRAVRKRARFKYTIDDHGLDTIVAEIARRAGFALQ
PAQPFAFEHNGDRYGWVEGEDGLWHLTLSLPAGRIADTDTATHLSGLRAIAQLNVGEFRMTPNQNLVIAGVPASERARVD
ALVAQYALDAGNRSASALARGAMACVALPTCGLAMAEAERYLPDFSAALQPLLQQHGLADTPIVLRLSGCPNGCSRPYLA
EIALVGKAPGRYNLMLGGDRRGQRLNTLYRENITEPEILAALEPLLARYAAERDHANDEGFGDFLHRAGLIALPSYPTHR
RLDLELLA
>O32214 1.8.1.2~~~cysJ~~~Sulfite reductase [NADPH] flavoprotein alpha-component~~~COG0369
MQLQVMNSPFNQEQAELLNRLLPTLTESQKIWLSGYLSAQSVSAQEAAGTPAAAVSAEAPAPAVSKEVTVLYGSQTGNAQ
GLAENAGKQLEQSGFQVTVSSMSDFKPNQLKKVTNLLIVVSTHGEGEPPDNALSFHEFLHGRRAPKLEDLRFSVLALGDS
SYEFFCQTGKEFDQRLEELGGKRISPRVDCDLDYDEPAAEWLEGVLKGLNEAGGGSAAPAPAAASQTGESSYSRTNPFRA
EVLENLNLNGRGSNKETRHVELSLEGSGLTYEPGDSLGVYPENDPELVELLLKEMNWDPEEIVTLNKQGDVRPLKEALIS
HYEITVLTKPLLEQAAQLTGNDELRELLAPGNEENVKAYIEGRDLLDLVRDYGPFSVSAQEFVSILRKMPARLYSIASSL
SANPDEVHLTIGAVRYDAHGRERKGVCSILCAERLQPGDTLPVYVQHNQNFKLPKDPETPIIMVGPGTGVAPFRSFMQER
EETGAEGKAWMFFGDQHFVTDFLYQTEWQNWLKDGVLTKMDVAFSRDTEEKVYVQHRMLEHSAELFEWLQEGAAVYICGD
EKHMAHDVHNTLLEIIEKEGNMSREEAEAYLADMQQQKRYQRDVY
>P38038 1.8.1.2~~~cysJ~~~Sulfite reductase [NADPH] flavoprotein alpha-component~~~COG0369
MTTQVPPSALLPLNPEQLARLQAATTDLTPTQLAWVSGYFWGVLNQQPAALAATPAPAAEMPGITIISASQTGNARRVAE
ALRDDLLAAKLNVKLVNAGDYKFKQIASEKLLIVVTSTQGEGEPPEEAVALHKFLFSKKAPKLENTAFAVFSLGDSSYEF
FCQSGKDFDSKLAELGGERLLDRVDADVEYQAAASEWRARVVDALKSRAPVAAPSQSVATGAVNEIHTSPYSKDAPLVAS
LSVNQKITGRNSEKDVRHIEIDLGDSGMRYQPGDALGVWYQNDPALVKELVELLWLKGDEPVTVEGKTLPLNEALQWHFE
LTVNTANIVENYATLTRSETLLPLVGDKAKLQHYAATTPIVDMVRFSPAQLDAEALINLLRPLTPRLYSIASSQAEVENE
VHVTVGVVRYDVEGRARAGGASSFLADRVEEEGEVRVFIEHNDNFRLPANPETPVIMIGPGTGIAPFRAFMQQRAADEAP
GKNWLFFGNPHFTEDFLYQVEWQRYVKDGVLTRIDLAWSRDQKEKVYVQDKLREQGAELWRWINDGAHIYVCGDANRMAK
DVEQALLEVIAEFGGMDTEAADEFLSELRVERRYQRDVY
>Q79FV4 2.8.5.1~~~cysK2~~~S-sulfocysteine synthase~~~COG0031
MRSRQTRDRYRLLPEGYQVTPGRNRHPGTMVGNTPVLWIPELSGTSDPDRGFWAKLEGFNPGGMKDRPALYMVECARARG
DIAPGAAIVESTGGTLGLGLALAGKVYRHPVTLVTDPGLEPIIARMLTAYGAGVDMVTQPHPVGGWQQARKDRVAQLMAE
YPGAWNPNQYGNPDNVGAYRSLALELVAQLGRIDVLVCSVGTGGHSAGVARVLREFNPDMRLIGVDTIGSTIFGQPASNR
LMRGLGSSIYPRNVDYRAFDEVHWVAPPEAVWACRSLAATHYASGGWSVGAVALVAGWAARNLPADTTIAAVFPDGPQRY
FDTIYNDAYCNEHELLGGQPPTEPDEIASPLDAVVTRWTRSTTVIDPTQVVS
>P37887 2.5.1.47~~~cysK~~~Cysteine synthase~~~COG0031
MVRVANSITELIGNTPIVKLNRLADENSADVYLKLEYMNPGSSVKDRIGLAMIEAAEKEGKLKAGNTIIEPTSGNTGIGL
AMVAAAKGLKAILVMPDTMSMERRNLLRAYGAELVLTPGAEGMKGAIKKAEELAEKHGYFVPQQFNNPSNPEIHRQTTGK
EIVEQFGDDQLDAFVAGIGTGGTITGAGEVLKEAYPSIKIYAVEPSDSPVLSGGKPGPHKIQGIGAGFVPDILNTEVYDE
IFPVKNEEAFEYARRAAREEGILGGISSGAAIYAALQVAKKLGKGKKVLAIIPSNGERYLSTPLYQFD
>P0ABK5 2.5.1.47~~~cysK~~~Cysteine synthase A~~~COG0031
MSKIFEDNSLTIGHTPLVRLNRIGNGRILAKVESRNPSFSVKCRIGANMIWDAEKRGVLKPGVELVEPTSGNTGIALAYV
AAARGYKLTLTMPETMSIERRKLLKALGANLVLTEGAKGMKGAIQKAEEIVASNPEKYLLLQQFSNPANPEIHEKTTGPE
IWEDTDGQVDVFIAGVGTGGTLTGVSRYIKGTKGKTDLISVAVEPTDSPVIAQALAGEEIKPGPHKIQGIGAGFIPANLD
LKLVDKVIGITNEEAISTARRLMEEEGILAGISSGAAVAAALKLQEDESFTNKNIVVILPSSGERYLSTALFADLFTEKE
LQQ
>P45040 2.5.1.47~~~cysK~~~Cysteine synthase~~~COG0031
MAIYADNSYSIGNTPLVRLKHFGHNGNVVVKIEGRNPSYSVKCRIGANMVWQAEKDGTLTKGKEIVDATSGNTGIALAYV
AAARGYKITLTMPETMSLERKRLLCGLGVNLVLTEGAKGMKGAIAKAEEIVASDPSRYVMLKQFENPANPQIHRETTGPE
IWKDTDGKVDVVVAGVGTGGSITGISRAIKLDFGKQITSVAVEPVESPVISQTLAGEEVKPGPHKIQGIGAGFIPKNLDL
SIIDRVETVDSDTALATARRLMAEEGILAGISSGAAVAAADRLAKLPEFADKLIVVILPSASERYLSTALFEGIEG
>P9WP55 2.5.1.47~~~cysK1~~~O-acetylserine sulfhydrylase~~~COG0031
MSIAEDITQLIGRTPLVRLRRVTDGAVADIVAKLEFFNPANSVKDRIGVAMLQAAEQAGLIKPDTIILEPTSGNTGIALA
MVCAARGYRCVLTMPETMSLERRMLLRAYGAELILTPGADGMSGAIAKAEELAKTDQRYFVPQQFENPANPAIHRVTTAE
EVWRDTDGKVDIVVAGVGTGGTITGVAQVIKERKPSARFVAVEPAASPVLSGGQKGPHPIQGIGAGFVPPVLDQDLVDEI
ITVGNEDALNVARRLAREEGLLVGISSGAATVAALQVARRPENAGKLIVVVLPDFGERYLSTPLFADVAD
>Q7DDL5 2.5.1.47~~~cysK~~~Cysteine synthase~~~
MKIANSITELIGNTPLVKLNRLTEGLKAEVAVKLEFFNPGSSVKDRIAEAMIEGAEKAGKINKNTVIVEATSGNTGVGLA
MVCAARGYKLAITMPESMSKERKMLLRAFGAELILTPAAEGMAGAIAKAKSLVDAHPDTYFMPRQFDNEANPEVHRKTTA
EEIWRDTDGKVDVFVAGVGTGGTITGVGEVLKKYKPEVKVVAVEPEASPVLSGGEKGPHPIQGIGAGFIPTVLNTKIYDS
ITKVSNEAAFETARAIAEKEGILVGISSGAAVWSALQLAKQPENEGKLIVVLLPSYGERYLSTPLFADLA
>P0A1E3 2.5.1.47~~~cysK~~~Cysteine synthase A~~~
MSKIYEDNSLTIGHTPLVRLNRIGNGRILAKVESRNPSFSVKCRIGANMIWDAEKRGVLKPGVELVEPTSGNTGIALAYV
AAARGYKLTLTMPETMSIERRKLLKALGANLVLTEGAKGMKGAIQKAEEIVASDPQKYLLLQQFSNPANPEIHEKTTGPE
IWEDTDGQVDVFISGVGTGGTLTGVTRYIKGTKGKTDLITVAVEPTDSPVIAQALAGEEIKPGPHKIQGIGAGFIPGNLD
LKLIDKVVGITNEEAISTARRLMEEEGILAGISSGAAVAAALKLQEDESFTNKNIVVILPSSGERYLSTALFADLFTEKE
LQQ
>P63871 2.5.1.47~~~cysK~~~Cysteine synthase~~~
MAQKPVDNITQIIGGTPVVKLRNVVDDNAADVYVKLEYQNPGGSVKDRIALAMIEKAEREGKIKPGDTIVEPTSGNTGIG
LAFVCAAKGYKAVFTMPETMSQERRNLLKAYGAELVLTPGSEAMKGAIKKAKELKEEHGYFEPQQFENPANPEVHELTTG
PELLQQFEGKTIDAFLAGVGTGGTLSGVGKVLKKEYPNIEIVAIEPEASPVLSGGEPGPHKLQGLGAGFIPGTLNTEIYD
SIIKVGNDTAMEMSRRVAKEEGILAGISSGAAIYAAIQKAKELGKGKTVVTVLPSNGERYLSTPLYSFDD
>Q5XAQ3 2.5.1.47~~~cysK~~~Cysteine synthase~~~
MTKIYKTITELVGQTPIIKLNRLIPNEAADVYVKLEAFNPGSSVKDRIALSMIEAAEAEGLISPGDVIIEPTSGNTGIGL
AWVGAAKGYRVIIVMPETMSLERRQIIQAYGAELVLTPGAEGMKGAIAKTETLAIELGAWMPMQFNNPANPSIHEKTTAQ
EILEAFKEISLDAFVSGVGTGGTLSGVSHVLKKASPETVIYAVEAEESAVLSGQEPGPHKIQGISAGFIPNTLDTKAYDQ
IIRVKSKDALETARLTGAKEGFLVGISSGAALYAAIEVAKQLGKGKHVLTILPDNGERYLSTELYDVPVIKTK
>Q59966 2.5.1.47~~~srpG~~~Cysteine synthase~~~COG0031
MSTSGTFFADNSQTIGKTPLVRLNRIVKGAPATVLAKIEGRNPAYSVKCRIGAAMIWDAEQRGLLGPGKELIEPTSGNTG
IALAFVAAARGIPLTLTMPETMSLERRKLLAAYGAKLVLTEGVKGMTGAVRRAEDIAASDPDRYVLLQQFRNPANPAIHE
QTTGPEIWEDTGGAIDILVSGVGTGGTITGVSRYIKQTQGKPILSVAVEPEASPVISQQRSGLPLKPGPHKIQGIGAGFI
PENLDLSLVDQVERVSNEEAIAYARRLAQEEGLISGISCGAAVAAAVRLAQQSEHAGKTIVVVLPDSGERYLSTALFDGI
FNEQGLAVV
>P39647 ~~~cysL~~~HTH-type transcriptional regulator CysL~~~COG0583
MYYDVLKTFIAVVEEKNFTKAAEKLMISQPSVSLHIKNLEKEFQTALLNRSPKHFTTTPTGDILYQRAKQMVFLYEQAKA
EIYAHHHYVKGELKIAASFTIGEYILPPLLAQLQKLYPELNLDVMIGNTEEVSERVRMLQADIGLIEGHTNENELEIEPF
MEDEMCIAAPNQHPLAGRKEISISDLQNEAWVTREKGSGTREYLDHVLSSNGLRPKSMFTISSNQGVKEAVINGMGLSVL
SRSVLRKDLIHREISILHINNFSLKRKLSYIHSPLMENTKNKEIFITMLKSNYQSQLLK
>P16703 2.5.1.47~~~cysM~~~Cysteine synthase B~~~COG0031
MSTLEQTIGNTPLVKLQRMGPDNGSEVWLKLEGNNPAGSVKDRAALSMIVEAEKRGEIKPGDVLIEATSGNTGIALAMIA
ALKGYRMKLLMPDNMSQERRAAMRAYGAELILVTKEQGMEGARDLALEMANRGEGKLLDQFNNPDNPYAHYTTTGPEIWQ
QTGGRITHFVSSMGTTGTITGVSRFMREQSKPVTIVGLQPEEGSSIPGIRRWPTEYLPGIFNASLVDEVLDIHQRDAENT
MRELAVREGIFCGVSSGGAVAGALRVAKANPDAVVVAIICDRGDRYLSTGVFGEEHFSQGAGI
>P56067 2.5.1.47~~~cysM~~~Cysteine synthase~~~COG0031
MMIITTMQDAIGRTPVFKFTNKDYPIPLNSAIYAKLEHLNPGGSVKDRLGQYLIGEGFKTGKITSKTTIIEPTAGNTGIA
LALVAIKHHLKTIFVVPEKFSTEKQQIMRALGALVINTPTSEGISGAIKKSKELAESIPDSYLPLQFENPDNPAAYYHTL
APEIVQELGTNLTSFVAGIGSGGTFAGTARYLKERIPAIRLIGVEPEGSILNGGEPGPHEIEGIGVEFIPPFFENLDIDG
FETISDEEGFSYTRKLAKKNGLLVGSSSGAAFVAALKEAQRLPEGSQVLTIFPDVADRYLSKGIYL
>P9WP53 2.5.1.113~~~cysM~~~O-phosphoserine sulfhydrylase~~~COG0031
MTRYDSLLQALGNTPLVGLQRLSPRWDDGRDGPHVRLWAKLEDRNPTGSIKDRPAVRMIEQAEADGLLRPGATILEPTSG
NTGISLAMAARLKGYRLICVMPENTSVERRQLLELYGAQIIFSAAEGGSNTAVATAKELAATNPSWVMLYQYGNPANTDS
HYCGTGPELLADLPEITHFVAGLGTTGTLMGTGRFLREHVANVKIVAAEPRYGEGVYALRNMDEGFVPELYDPEILTARY
SVGAVDAVRRTRELVHTEGIFAGISTGAVLHAALGVGAGALAAGERADIALVVADAGWKYLSTGAYAGSLDDAETALEGQ
LWA
>P29848 2.5.1.47~~~cysM~~~Cysteine synthase B~~~
MNTLEQTIGNTPLVKLQRLGPDNGSEIWVKLEGNNPAGSVKDRAALSMIVEAEKRGEIKPGDVLIEATSGNTGIALAMIA
ALKGYRMKLLMPDNMSQERRAAMRAYGAELILVTKEQGMEGARDLALAMSERGEGKLLDQFNNPDNPYAHYTTTGPEIWR
QTSGRITHFVSSMGTTGTITGVSRFLREQEKPVTIVGLQPEEGSSIPGIRRWPAEYMPGIFNASLVDEVLDIHQNDAENT
MRELAVREGIFCGVSSGGAVAGALRVARATPGAIVVAIICDRGDRYLSTGVFGEEHFSQGAGI
>P9WNM5 ~~~cysNC~~~Bifunctional enzyme CysN/CysC~~~COG0529
MTTLLRLATAGSVDDGKSTLIGRLLYDSKAVMEDQWASVEQTSKDRGHDYTDLALVTDGLRAEREQGITIDVAYRYFATP
KRKFIIADTPGHIQYTRNMVTGASTAQLVIVLVDARHGLLEQSRRHAFLASLLGIRHLVLAVNKMDLLGWDQEKFDAIRD
EFHAFAARLDVQDVTSIPISALHGDNVVTKSDQTPWYEGPSLLSHLEDVYIAGDRNMVDVRFPVQYVIRPHTLEHQDHRS
YAGTVASGVMRSGDEVVVLPIGKTTRITAIDGPNGPVAEAFPPMAVSVRLADDIDISRGDMIARTHNQPRITQEFDATVC
WMADNAVLEPGRDYVVKHTTRTVRARIAGLDYRLDVNTLHRDKTATALKLNELGRVSLRTQVPLLLDEYTRNASTGSFIL
IDPDTNGTVAAGMVLRDVSARTPSPNTVRHRSLVTAQDRPPRGKTVWFTGLSGSGKSSVAMLVERKLLEKGISAYVLDGD
NLRHGLNADLGFSMADRAENLRRLSHVATLLADCGHLVLVPAISPLAEHRALARKVHADAGIDFFEVFCDTPLQDCERRD
PKGLYAKARAGEITHFTGIDSPYQRPKNPDLRLTPDRSIDEQAQEVIDLLESSS
>P23845 2.7.7.4~~~cysN~~~Sulfate adenylyltransferase subunit 1~~~COG2895
MNTALAQQIANEGGVEAWMIAQQHKSLLRFLTCGSVDDGKSTLIGRLLHDTRQIYEDQLSSLHNDSKRHGTQGEKLDLAL
LVDGLQAEREQGITIDVAYRYFSTEKRKFIIADTPGHEQYTRNMATGASTCELAILLIDARKGVLDQTRRHSFISTLLGI
KHLVVAINKMDLVDYSEETFTRIREDYLTFAGQLPGNLDIRFVPLSALEGDNVASQSESMPWYSGPTLLEVLETVEIQRV
VDAQPMRFPVQYVNRPNLDFRGYAGTLASGRVEVGQRVKVLPSGVESNVARIVTFDGDREEAFAGEAITLVLTDEIDISR
GDLLLAADEALPAVQSASVDVVWMAEQPLSPGQSYDIKIAGKKTRARVDGIRYQVDINNLTQREVENLPLNGIGLVDLTF
DEPLVLDRYQQNPVTGGLIFIDRLSNVTVGAGMVHEPVSQATAAPSEFSAFELELNALVRRHFPHWGARDLLGDK
>P9WP33 ~~~cysO~~~Sulfur carrier protein CysO~~~COG1977
MNVTVSIPTILRPHTGGQKSVSASGDTLGAVISDLEANYSGISERLMDPSSPGKLHRFVNIYVNDEDVRFSGGLATAIAD
GDSVTILPAVAGG
>P16700 ~~~cysP~~~Thiosulfate-binding protein~~~COG4150
MAVNLLKKNSLALVASLLLAGHVQATELLNSSYDVSRELFAALNPPFEQQWAKDNGGDKLTIKQSHAGSSKQALAILQGL
KADVVTYNQVTDVQILHDKGKLIPADWQSRLPNNSSPFYSTMGFLVRKGNPKNIHDWNDLVRSDVKLIFPNPKTSGNARY
TYLAAWGAADKADGGDKGKTEQFMTQFLKNVEVFDTGGRGATTTFAERGLGDVLISFESEVNNIRKQYEAQGFEVVIPKT
NILAEFPVAWVDKNVQANGTEKAAKAYLNWLYSPQAQTIITDYYYRVNNPEVMDKLKDKFPQTELFRVEDKFGSWPEVMK
THFTSGGELDKLLAAGRN
>P22255 3.1.3.7~~~cysQ~~~3'(2'),5'-bisphosphate nucleotidase CysQ~~~COG1218
MLDQVCQLARNAGDAIMQVYDGTKPMDVVSKADNSPVTAADIAAHTVIMDGLRTLTPDVPVLSEEDPPGWEVRQHWQRYW
LVDPLDGTKEFIKRNGEFTVNIALIDHGKPILGVVYAPVMNVMYSAAEGKAWKEECGVRKQIQVRDARPPLVVISRSHAD
AELKEYLQQLGEHQTTSIGSSLKFCLVAEGQAQLYPRFGPTNIWDTAAGHAVAAAAGAHVHDWQGKPLDYTPRESFLNPG
FRVSIY
>P9WKJ1 3.1.3.7~~~cysQ~~~3'-phosphoadenosine 5'-phosphate phosphatase~~~COG1218
MVSPAAPDLTDDLTDAELAADLAADAGKLLLQVRAEIGFDQPWTLGEAGDRQANSLLLRRLQAERPGDAVLSEEAHDDLA
RLKSDRVWIIDPLDGTREFSTPGRDDWAVHIALWRRSSNGQPEITDAAVALPARGNVVYRTDTVTSGAAPAGVPGTLRIA
VSATRPPAVLHRIRQTLAIQPVSIGSAGAKAMAVIDGYVDAYLHAGGQWEWDSAAPAGVMLAAGMHASRLDGSPLRYNQL
DPYLPDLLMCRAEVAPILLGAIADAWR
>P26264 3.1.3.7~~~cysQ~~~3'(2'),5'-bisphosphate nucleotidase CysQ~~~
MLEQVCQLARNAGDAIMQVYDGAKPMEYARKQDDSPVTAADIAAHTVILEGLRTLTPDIPVLSEEDPPAWEVRQHWQRYW
LVDPLDGTKEFIKRNGEFTVNIALIEQGKPVLGVVYAPVLKVMYYAAEGKAWKEECGVRKQIQVRDARPPLVVISRSHTD
DELTEYLQQLGEHQTTSIGSSLKFCLVAEGQAQLYPRFGPTSVWDTAAGHAIAVAAGAHVHDWQGKTLDYTPRESFLNPG
FRVTIY
>P27369 ~~~cysR~~~Regulatory protein CysR~~~COG0664
MVREPASTLLPPTSPATPAPHRLLIGRRGMVPTGANVIWKIQSGLVRSSTWGEEGDMISLGLWGPGDLIGRPLSCLDPYE
LECLTAVEVVAVSDPALESHESLVRSLRYTERLLSITRLRRAEAKLASLLGWIGERFGQPGATGWEIDLRRIPLTHQVIA
ELSGSTRVTTTRLLGEFRQAGRIHRRDRALIVRYPETLYPPARLSA
>P27367 ~~~cysT~~~Sulfate transport system permease protein CysT~~~COG0555
MSLRLPSLSFTWLTRLSWSWRFTWVYLTLILFIPIIALFLKSASLPLGRIWELATQPVAVAAYEVTFGLSLAAAALNGVF
GVIIAWVLTRYDFPGKKLFDSFIDLPFALPTAVAGLTLATVYSDKGWIGQFIAPFGVQIAFTRWGVLLAMVFISLPFVVR
TVEPLLLELEVEAEEAAASLGASPSETFWRVILPPILPGVLAGVAQGFSRAVGEFGSVVIISGNLPFDDLIAPVLIFERL
EQYDYAGATVIGSVLLLFSLVILFVINALQNWSSRYNG
>Q01895 ~~~cysT~~~Sulfate transport system permease protein CysT~~~COG0555
MTTNLPFSSPSKQLNRFSFWQSISIPWVVTIIYLLLILVLPIAALLVKSASLGLEGFWQIATTPIAISTYNVTFITALAA
GLVNGVMGTLVAWVLVRCQFPGKKIVDAMVDLPFALPTSVAGLVLATLYSQTGWVGRFFAPFGIQIAFSRLGVFVAMVFI
SLPFIVRTLQPVLQELEEEAEEAAWSLGATEFQTFWRVIFPPLIPPILTGIALGFSRAVGEYGSVVLIASNIPFKDLIAP
VLVFERLEQYDYPAATVIGAVLLSVSLILLLIINLLQQWGRRYAND
>P0AEB0 ~~~cysW~~~Sulfate transport system permease protein CysW~~~COG4208
MAEVTQLKRYDARPINWGKWFLIGIGMLVSAFILLVPMIYIFVQAFSKGLMPVLQNLADPDMLHAIWLTVMIALIAVPVN
LVFGILLAWLVTRFNFPGRQLLLTLLDIPFAVSPVVAGLVYLLFYGSNGPLGGWLDEHNLQIMFSWPGMVLVTIFVTCPF
VVRELVPVMLSQGSQEDEAAILLGASGWQMFRRVTLPNIRWALLYGVVLTNARAIGEFGAVSVVSGSIRGETLSLPLQIE
LLEQDYNTVGSFTAAALLTLMAIITLFLKSMLQWRLENQEKRAQQEEHHEH
>P27370 ~~~cysW~~~Sulfate transport system permease protein CysW~~~COG4208
MVAFVKARRQIAGVKESKSLLPLALIGISLLYVGLIIIIPAANVAVQAFSEGLSGFIKNLGDRNLQEAIRLTLLMGVISV
PLNTLFGLAAAFAIARKQFPGKSLLLSVIDLPFSISPVVAGLMIVLLYGRNGWLGPLLESNDIKIIFAWPGMALATIFVS
MPFVAREVIPNLEEIGTDAEEAASTLGANGWQTFWRVTLPSIKWSMLYGVVLTTARALGEFGAVSVVSGSITGKTQTLPL
FVEEAYKQYQTTLSYTAALLLGGISLVTLVLKALLEARTGRQSRIH
>P0A6J3 ~~~cysZ~~~Sulfate transporter CysZ~~~COG2981
MVSSFTSAPRSGFYYFAQGWKLVSQPGIRRFVILPLLVNILLMGGAFWWLFTQLDVWIPTLMSYVPDWLQWLSYLLWPLA
VISVLLVFGYFFSTIANWIAAPFNGLLAEQLEARLTGATPPDTGIFGIMKDVPRIMKREWQKFAWYLPRAIVLLILYFIP
GIGQTVAPVLWFLFSAWMLAIQYCDYPFDNHKVPFKEMRTALRTRKITNMQFGALTSLFTMIPLLNLFIMPVAVCGATAM
WVDCYRDKHAMWR
>P0ACN7 ~~~cytR~~~HTH-type transcriptional repressor CytR~~~COG1609
MKAKKQETAATMKDVALKAKVSTATVSRALMNPDKVSQATRNRVEKAAREVGYLPQPMGRNVKRNESRTILVIVPDICDP
FFSEIIRGIEVTAANHGYLVLIGDCAHQNQQEKTFIDLIITKQIDGMLLLGSRLPFDASIEEQRNLPPMVMANEFAPELE
LPTVHIDNLTAAFDAVNYLYEQGHKRIGCIAGPEEMPLCHYRLQGYVQALRRCGIMVDPQYIARGDFTFEAGSKAMQQLL
DLPQPPTAVFCHSDVMALGALSQAKRQGLKVPEDLSIIGFDNIDLTQFCDPPLTTIAQPRYEIGREAMLLLLDQMQGQHV
GSGSRLMDCELIIRGSTRALP
>P13511 ~~~czcA~~~Cobalt-zinc-cadmium resistance protein CzcA~~~
MFERIISFAIQQRWLVLLAVFGMAGLGIFSYNRLPIDAVPDITNVQVQVNTSAPGYSPLETEQRATYPIEVVMAGLPGLE
QTRSLSRYGLSQVTVIFKDGTDVYFARQLVNQRIQEAKDNLPEGVVPAMGPISTGLGEIYLWTVEAEEGARKADGTAYTP
TDLREIQDWVVRPQLRNVPGVTEINTIGGFNKQYLVAPSLERLASYGLTLTDVVNALNKNNDNVGAGYIERRGEQYLVRA
PGQVASEDDIRNIIVGTAQGQPIRIRDIGDVEIGKELRTGAATENGKEVVLGTVFMLIGENSRAVSKAVDEKVASINRTM
PEGVKIVTVYDRTRLVDKAIATVKKNLLEGAVLVIVILFLFLGNIRAALITATIIPLAMLFTFTGMVNYKISANLMSLGA
LDFGIIIDGAVVIVENCVRRLAHAQEHHGRPLTRSERFHEVFAAAKEARRPLIFGQLIIMIVYLPIFALTGVEGKMFHPM
AFTVVLALLGAMILSVTFVPAAVALFIGERVAEKENRLMLWAKRRYEPLLEKSLANTAVVLTFAAVSIVLCVAIAARLGS
EFIPNLNEGDIAIQALRIPGTSLSQSVEMQKTIETTLKAKFPEIERVFARTGTAEIASDLMPPNISDGYIMLKPEKDWPE
PKKTHAELLSAIQEEAGKIPGNNYEFSQPIQLRFNELISGVRSDVAVKIFGDDNNVLSETAKKVSAVLQGIPGAQEVKVE
QTTGLPMLTVKIDREKAARYGLNMSDVQDAVATGVGGRDSGTFFQGDRRFDIVVRLPEAVRGEVEALRRLPIPLPKGVDA
RTTFIPLSEVATLEMAPGPNQISRENGKRRIVISANVRGRDIGSFVPEAEAAIQSQVKIPAGYWMTWGGTFEQLQSATTR
LQVVVPVALLLVFVLLFAMFNNIKDGLLVFTGIPFALTGGILALWIRGIPMSITAAVGFIALCGVAVLNGLVMLSFIRSL
REEGHSLDSAVRVGALTRLRPVLMTALVASLGFVPMAIATGTGAEVQRPLATVVIGGILSSTALTLLVLPVLYRLAHRKD
EDAEDTREPVTQTHQPDQGRQPA
>P13510 ~~~czcB~~~Cobalt-zinc-cadmium resistance protein CzcB~~~
MAISNKQKAAIAAIVLVGGVATGGVLLSGRSAPEEQGGHSESKGHGDTEHHGKQAAEADHKDDKSHGDGEHHEVKKGPNG
GALFSRDGYDVEIGTAESKGEARIRLWVSKSGKAVANGVAATGQLVRATGESQALKFVVSGDALESQQPVAEPHVFDVTA
NVTLPGSSSPLAVRLSKEEGKIELTADQLAKTGVVVQTAGSAKVQAGVQFPGEIRFNEDKTAHVVPRLAGVVESVPANIG
QQVKKGQVLAVIASTGLSDQRSELLAAQKRLDLARVTYDREKKLWEQKISAEQDYLSARNALQEAQISVQNAQQKLTAIG
ASNSSTALNRYELRAPFDGMIVEKHISLGEAVADNANVFTLSDLSSVWAEFVVSAKDVERVRIGEKASINSASSDVKADG
TVSYVGSLLGEQTRTAKARVTLTNPQMAWRPGLFVTVDVFGADVEVPVAVKTEAVQDVNGESVVFVAVQGGFVPQPVKVG
RTNGKVIEIVEGLKPGARYAAANSFVLKAELGKSSAEHGH
>P13509 ~~~czcC~~~Cobalt-zinc-cadmium resistance protein CzcC~~~
MRRLFLPLGLAVAFLSPNFAVAQSDTGTSMVPVFPREAAGPLTLEAALSLAAGSNFNLSAAAKELDSTEGGIMQARVIPN
PELKTLVEDTRKSTRTSTAQMNIPIELGGKRSARINAAERTRELAQATLAGVRGDIRAQVIESFFSVLIAQERVKLATGS
ADIAARGAQAASRRVAAGKISPVDETKARVEQANAELELAEATASLQSARQALTALWGNASPQFAEAQGNLDALPSRPAP
ELLQKELENSPLVAASRAELDRRQALVGVERSRQYPDLTVSLGAKRDTEANRNMAVIGVAIPLPIFDRNQGNLYSAIRQA
DKAQDEYLANRISLTRNLLMASNQLSVSRASAQTLKQTVLPGAEQAFNAATIGFEAGKFNYLDVLDAQRTLFQARIRYLG
VLGQTYQAATTIDRILGR
>O07084 ~~~czcD~~~Cadmium, cobalt and zinc/H(+)-K(+) antiporter~~~COG1230
MGHNHNEGANKKVLLISFIMITGYMIIEAIGGFLTNSLALLSDAGHMLSDSISLMVALIAFTLAEKKANHNKTFGYKRFE
ILAAVINGAALILISLYIIYEAIERFSNPPKVATTGMLTISIIGLVVNLLVAWIMMSGGDTKNNLNIRGAYLHVISDMLG
SVGAILAAILIIFFGWGWADPLASIIVAILVLRSGYNVTKDSIHILMEGTPENIDVSDIIRTIEGTEGIQNIHDLHIWSI
TSGLNALSCHAVVDDQLTISESENILRKIEHELEHKGITHVTIQMETEAHNHDNAILCQPKMEKQRDHHHH
>P13512 ~~~czcD~~~Metal cation efflux system protein CzcD~~~
MGAGHSHDHPGGNERSLKIALALTGTFLIAEVVGGVMTKSLALISDAAHMLTDTVALAIALAAIAIAKRPADKKRTFGYY
RFEILAAAFNALLLFGVAIYILYEAYLRLKSPPQIESTGMFVVAVLGLIINLISMRMLSSGQSSSLNVKGAYLEVWSDLL
GSVGVIAGAIIIRFTGWAWVDSAIAVLIGLWVLPRTWILLKSSLNVLLEGVPDDVDLAEVEKQILATPGVKSFHDLHIWA
LTSGKASLTVHVVNDTAVNPEMEVLPELKQMLADKFDITHVTIQFELAPCEQADAAQHFNASPALVGSKSLAAGGN
>Q44009 ~~~czcI~~~Cobalt-zinc-cadmium resistance protein CzcI~~~
MRRFVLIFVLLILPFQFSWAAAARYCQHEKATATWHLGHHEHRHQQPEGKTDAEKKPFVDTDCGVCHLVSLPFVYGQTQD
VLIANRVEVTDTQHSSEFSSLNARAPDRPQWQRLA
>O07085 1.-.-.-~~~czcO~~~Uncharacterized oxidoreductase CzcO~~~COG2072
MYDTIVIGAGQAGISIGYYLKQSDQKFIILDKSHEVGESWKDRYDSLVLFTSRMYSSLPGMHLEGEKHGFPSKNEIVAYL
KKYVKKFEIPIQLRTEVISVLKIKNYFLIKTNREEYQTKNLVIATGPFHTPNIPSISKDLSDNINQLHSSQYKNSKQLAY
GNVLVVGGGNSGAQIAVELSKERVTYLACSNKLVYFPLMIGKRSIFWWFDKLGVLHASHTSIVGKFIQKKGDPVFGHELK
HAIKQKEIILKKRVIAAKQNEIIFKDSSTLEVNNIIWATGFRNPLCWINIKGVLDQEGRIIHHRGVSPVEGLYFIGLPWQ
HKRGSALLQGVGNDAEYIVKQMNGE
>Q44006 ~~~czcR~~~Transcriptional activator protein CzcR~~~
MRVLVVEDEPRTAEYLQKGLSESGFVVDIANNGGDGLHMAEETDYDVIILDVMLPGMDGWTVIKSIRSKSETPVLFLTAL
DDVADRVRGFELGADDYLVKPFAFAELLARIRRCLRQSTSKESERLRIADLDIDVLGRRVFRGTTRIELTNQEFSLLHLL
MRRRGEVLSRTTIASQVWGVNFDTDTNVVDVAIRRLRSKVDDPFDQKLIHTVRGMGYVLDPERGR
>Q44007 2.7.13.3~~~czcS~~~Sensor protein CzcS~~~
MRPGTSITPLSLTRRLGLFFALVLSIALASMGAFAYYSLAAQLEARDDEVVKGKLEQVEHFLREVDGVQGVPAAQHRFDD
LVRGYSDLIVRVTALDGRLLFRTGNDALLEGTDQAAVTGKSSLMFQSADAVLGRDGTRATVFVAKSGEDRKQVTARFRTT
LVLGTTVGVILTALVGAAITRRELEPAHVLIKQINRISVERLSYRVDMPPKPTEVRDIASAFNAMLQRLEDGYQKLSRFS
ADLAHDLRTPLNNLIGHAEVALSRDRTGPEYVALVEESLVEYQRLARMIDAMLFLARADSANVALELTELQLNAELRKLS
AYFSVLAEERSVVIRVSGDATLVADAILFQRAINNVLSNAVRHAWPNSMIDLVVRREAAHCCIDITNVGDPIPERELSLI
FDRFFRGDRARSNSSQSTGLGLAIVLSIMELHGGDASAVSGLDGKTRFTLRFPLNGAEASARVSVGRPSQDRPVVG
>O31844 ~~~czrA~~~HTH-type transcriptional repressor CzrA~~~COG0640
MTEFRETEQSAADLDEETLFLVAQTFKALSDPTRIRILHLLSQGEHAVNGIAEKLNLLQSTVSHQLRFLKNLRLVKSRRE
GTSIYYSPEDEHVLDVLQQMIHHTQHD
>P77748 1.1.99.39~~~ydiJ~~~D-2-hydroxyglutarate dehydrogenase~~~COG0247
MIPQISQAPGVVQLVLNFLQELEQQGFTGDTATSYADRLTMSTDNSIYQLLPDAVVFPRSTADVALIARLAAQERYSSLI
FTPRGGGTGTNGQALNQGIIVDMSRHMNRIIEINPEEGWVRVEAGVIKDQLNQYLKPFGYFFAPELSTSNRATLGGMINT
DASGQGSLVYGKTSDHVLGVRAVLLGGDILDTQPLPVELAETLGKSNTTIGRIYNTVYQRCRQQRQLIIDNFPKLNRFLT
GYDLRHVFNDEMTEFDLTRILTGSEGTLAFITEARLDITRLPKVRRLVNVKYDSFDSALRNAPFMVEARALSVETVDSKV
LNLAREDIVWHSVSELITDVPDQEMLGLNIVEFAGDDEALIDERVNALCARLDELIASHQAGVIGWQVCRELAGVERIYA
MRKKAVGLLGNAKGAAKPIPFAEDTCVPPEHLADYIAEFRALLDSHGLSYGMFGHVDAGVLHVRPALDMCDPQQEILMKQ
ISDDVVALTAKYGGLLWGEHGKGFRAEYSPAFFGEELFAELRKVKAAFDPHNRLNPGKICPPEGLDAPMMKVDAVKRGTF
DRQIPIAVRQQWRGAMECNGNGLCFNFDARSPMCPSMKITQNRIHSPKGRATLVREWLRLLADRGVDPLKLEQELPESGV
SLRTLIARTRNSWHANKGEYDFSHEVKEAMSGCLACKACSTQCPIKIDVPEFRSRFLQLYHTRYLRPLRDHLVATVESYA
PLMARAPKTFNFFINQPLVRKLSEKHIGMVDLPLLSVPSLQQQMVGHRSANMTLEQLESLNAEQKARTVLVVQDPFTSYY
DAQVVADFVRLVEKLGFQPVLLPFSPNGKAQHIKGFLNRFAKTAKKTADFLNRMAKLGMPMVGVDPALVLCYRDEYKLAL
GEERGEFNVLLANEWLASALESQPVATVSGESWYFFGHCTEVTALPGAPAQWAAIFARFGAKLENVSVGCCGMAGTYGHE
AKNHENSLGIYELSWHQAMQRLPRNRCLATGYSCRSQVKRVEGTGVRHPVQALLEIIK
>Q57252 1.1.99.39~~~~~~D-2-hydroxyglutarate dehydrogenase~~~COG0247
MLPNLNRIPQVEQYVLDYLDDLQCQHFEGDIATNYADRLSLATDNSVYQQLPQAILFPKTVADIVRITKLANLPEYQSIS
FTPRGGGTGTNGQSINNNIIVDLSRHMTAILELNVKERWVRVQAGVVKDQLNQFLKPHGLFFAPELSTSNRATLGGMINT
DASGQGSLQYGKTSNHVLALRAVLINGEILDTSAVNSVDVLENIDALELSESSKKLHQTIAQHCKEKRAAIIKDLPQLNR
FLTGYDLKNVFNEDESEFNLTRILTGSEGSLAFICEAKLNLLLIPQYRTLINIKYRSFDAALRNAPFMVKANALSVETVD
SKVLNLAKQDIIWHSVNELLTEDEKDPILGLNIVEFAGNNKEKIDRQVTALCRLLDEKIEHNQDHIIGYQVCSDLPSIER
IYAMRKKAVGLLGNAKGAAKPIPFVEDTCVPPENLADYISEFRALLDQHNLQYGMFGHVDAGVLHVRPALDLCDKEQVKL
FKQISDEVAELTIKYGGLLWGEHGKGVRSHYGEKFFTPELWHELRYIKTLFDPNNRLNPGKICTPLDSKDELYSILSPMR
ADKDRQIPIQIRDEFKGAMNCNGNGLCFNFDEHSIMCPSMKVSKNRVFSPKGRAAMVREWLRLMANENVSPEQLDFRKTE
IKLTALVKRLSNTVQKWRGNYDFSHEVKAAMDTCLACKACASQCPIKIDVPSFRAKFFHFYHSRYLRPTKDHIVANLEIA
APYMAKQAKFFNYFTKLKVTQTLVEKTLGMTDLPLLSEPSLQQQLVEIHYQGKSLEELESLSAVEKNDILFIVQDPYTSY
YDAKVIRDFVMLTQKLGFKPILLPFKPNGKAMHIKGFLKRFSKTAQNQAEFLNRMAKLGIPLVGVDPAIVLSYRDEYKEA
LQEKRGDFHVLTAHEWLKQRLQNADLQEKLKNIAKTDRTLGWYLFPHCTESTFMPNSPKEWQEIFGRFGQQLNVEKVGCC
GMAGVFGHEVQNQKMSREIYDVSWHKKLHGKDPHFCLATGYSCRSQVKRYEHVVLKHPVQALLEVLK
>A0A0H3KZS3 1.1.99.39~~~ydiJ~~~D-2-hydroxyglutarate dehydrogenase~~~COG0247
MVTYSMIPQISQAPGLIQRVLTFLETLKAQGFTGDTATSYADRLSLSTDNSIYQLLPDAVLFPRSTADVALIARLAGEAA
FSSLVFTPRGGGTGTNGQSLNQGIIVDMSRHMNRILEINTEQRWVRVEAGVVKDQLNAYLKPFGFFFSPELSTSNRATLG
GMINTDASGQGSLVYGKTSDHVLGLRAVLLGGDILDTRPVPTALAENLAQTPTPEGRIYQQVLTRCREHRELILEKFPKL
NRFLTGYDLRHVFSDDMQTFDLTRLLCGAEGTLAFISEARLDITPLPKVRRVVNIKYDAFDSALRNAPLMVEAQALSVET
VDSKVLNLAREDIVWHSVRELITAIPDKEMLGLNIVEFAGDDAGQIDRQITQLCARLDTLMTQQQGGVIGYQLCDDLDGI
ERIYNMRKKAVGLLGNAKGRAKPIPFVEDTAVPPEHLADYIVEFRALLDSHGLSYGMFGHVDAGVLHVRPALDMCDPHQE
MMMKQISDEVVALTARYGGLLWGEHGKGFRAQYSPAFFGETLFNELRRIKAAFDPHNRLNPGKICTPFDSEAAMMQVDAT
KRGSYDRQIPLQVRETWRGALECNGNGLCFNFDARSPMCPSMKITRNRIHSPKGRATLTREWLRLLAEQGADPVMLEKKL
PESSLSLRALISRMRNTWYANKGEYDFSHEVKEAMSGCLACKACSTQCPIKIDVPAFRSRFLQLYHTRYLRPLSDHLVAG
VESYAPLMAKAPGVFNFFLKQPWATSFSKTHIGMVDLPLLSSPTLKQQLSGHPAMNMTLEQLEALSETQRAQKVLVVQDP
FTSFYEAKLVHDFIRLIEKLGYQPVLLPFSPNGKAQHVKGFLQRFARTASKTADFLNRVAKLGMPMVGIDPATVLCYRDE
YHQMLGEARGDFNVLLVHEWLHQALQEREVQVTSGEAWYLFAHCTEVTALPGTPGQWQAIFSRFGAKLENINVGCCGMAG
TYGHESQNLENSLGIYALSWHPQLQKLPRQRCLATGFSCRSQVKRVEGNGMRHPLQALLELI
>Q88EH0 1.1.99.39~~~ydiJ~~~D-2-hydroxyglutarate dehydrogenase~~~COG0247
MIAQLSTVAPSANYPEFLEALRNSGFRGQISADYATRTVLATDNSIYQRLPQAAVFPLDADDVARVATLMGEPRFQQVKL
TPRGGGTGTNGQSLTDGIVVDLSRHMNNILEINVEERWVRVQAGTVKDQLNAALKPHGLFFAPELSTSNRATVGGMINTD
ASGQGSCTYGKTRDHVLELHSVLLGGERLHSLPIDDAALEQACAAPGRVGEVYRMAREIQETQAELIETTFPKLNRCLTG
YDLAHLRDEQGRFNLNSVLCGAEGSLGYVVEAKLNVLPIPKYAVLVNVRYTSFMDALRDANALMAHKPLSIETVDSKVLM
LAMKDIVWHSVAEYFPADPERPTLGINLVEFCGDEPAEVNAKVQAFIQHLQSDTSVERLGHTLAEGAEAVTRVYTMRKRS
VGLLGNVEGEVRPQPFVEDTAVPPEQLADYIADFRALLDGYGLAYGMFGHVDAGVLHVRPALDMKDPVQAALVKPISDAV
AALTKRYGGLLWGEHGKGLRSEYVPEYFGELYPALQRLKGAFDPHNQLNPGKICTPLGSAEGLTPVDGVTLRGDLDRTID
ERVWQDFPSAVHCNGNGACYNYDPNDAMCPSWKATRERQHSPKGRASLMREWLRLQGEANIDVLAAARNKVSWLKGLPAR
LRNNRARNQGQEDFSHEVYDAMAGCLACKSCAGQCPIKVNVPDFRSRFLELYHGRYQRPLRDYLIGSLEFTIPYLAHAPG
LYNAVMGSKWVSQLLADKVGMVDSPLISRFNFQATLTRCRVGMATVPALRELTPAQRERSIVLVQDAFTRYFETPLLSAF
IDLAHRLGHRVFLAPYSANGKPLHVQGFLGAFAKAAIRNATQLKALADCGVPLVGLDPAMTLVYRQEYQKVPGLEGCPKV
LLPQEWLMDVLPEQAPAAPGSFRLMAHCTEKTNVPASTRQWEQVFARLGLKLVTEATGCCGMSGTYGHEARNQETSRTIF
EQSWATKLDKDGEPLATGYSCRSQVKRMTERKMRHPLEVVLQYAQR
>A4VGK4 1.1.99.39~~~d2hgdh~~~D-2-hydroxyglutarate dehydrogenase~~~COG0277
MTDPALIDELKTLVEPGKVLTDADSLNAYGKDWTKHFAPAPSAIVFPKSIEQVQAIVRWANAHKVALVPSGGRTGLSAAA
VAANGEVVVSFDYMNQILEFNEMDRTAVCQPGVVTAQLQQFAEDKGLYYPVDFASAGSSQIGGNIGTNAGGIKVIRYGMT
RNWVAGMKVVTGKGDLLELNKDLIKNATGYDLRQLFIGAEGTLGFVVEATMRLERQPTNLTALVLGTPDFDSIMPVLHAF
QDKLDLTAFEFFSDKALAKVLGRGDVPAPFETDCPFYALLEFEATTEERAEQALATFEHCVEQGWVLDGVMSQSEQQLQN
LWKLREYISETISHWTPYKNDISVTVGKVPAFLKEIDAIVGEHYPDFEIVWFGHIGDGNLHLNILKPDAMDKDEFFGKCA
TVNKWVFETVQKYNGSISAEHGVGMTKRDYLEYSRSPAEIEYMKAVKAVFDPNGIMNPGKIFAA
>P0DV35 1.1.99.39~~~~~~D-2-hydroxyglutarate dehydrogenase~~~
MTDPRILSLQQAVPALRLKTEPADLEHYGRDWTRRWTPNPLAIALPGSVEEVQAVVRWANAQAVAVVPSGGRTGLSGGAV
AANGELVLSLERLNKPLDFNAVDRTLTVQAGMPLEAVHNAAREQGLVYPVDFAARGSCSIGGNIATNAGGIRVIRYGNTR
EWVAGLKVVTGSGELLELNNALVKNSSGYDFRHLMIGSEGTLGIVVEATLRLTDPPPPSNVMLLALPSFDVLMQVFAAFR
AQLRLEAFEFFTDRALEHVLAHGAQAPFAEIHPYYVVTEFAAGDEAQEAAAMAAFETCMEQGWVSDGVISQSDAQAAQLW
RLREGITEALARYTPYKNDVSVRISAMPAFLAETQALLHDAYPDFDVVWFGHIGDGNLHINVLKPDATSQADFVAACDQV
TKLLAQALQRFDGSISAEHGIGLVKKSYLWSTRSAEEIALMRGIKHVLDPHLLLNPGKLFETHDAPTNIPAG
>P19938 2.6.1.21~~~dat~~~D-alanine aminotransferase~~~
MGYTLWNDQIVKDEEVKIDKEDRGYQFGDGVYEVVKVYNGEMFTVNEHIDRLYASAEKIRITIPYTKDKFHQLLHELVEK
NELNTGHIYFQVTRGTSPRAHQFPENTVKPVIIGYTKENPRPLENLEKGVKATFVEDIRWLRCDIKSLNLLGAVLAKQEA
HEKGCYEAILHRNNTVTEGSSSNVFGIKDGILYTHPANNMILKGITRDVVIACANEINMPVKEIPFTTHEALKMDELFVT
STTSEITPVIEIDGKLIRDGKVGEWTRKLQKQFETKIPKPLHI
>P99090 2.6.1.21~~~dat~~~D-alanine aminotransferase~~~
MEKIFLNGEFVSPSEAKVSYNDRGYVFGDGIYEYIRVYNGKLFTVTEHYERFLRSANEIGLDLNYSVEELIELSRKLVDM
NQIETGAIYIQATRGVAERNHSFPTPEVEPAIVAYTKSYDRPYDHLENGVNGVTVEDIRWLRCDIKSLNLLGNVLAKEYA
VKYNAVEAIQHRGETVTEGSSSNAYAIKDGVIYTHPINNYILNGITRIVIKKIAEDYNIPFKEETFTVDFLKNADEVIVS
STSAEVTPVIKLDGEPINDGKVGPITRQLQEGFEKYIESHSI
>P13719 ~~~daaE~~~F1845 fimbrial protein~~~
MKKLAIMAAASMIFTVGSAQATFQASGTTGITTLTVTEECRVQVGNVTATLARSKLKDDTAIGVIGVTALGCNGLQAALQ
ADPDNYDATNLYMTSRNHDKLNVKLKATDGSSWTYGNGVFYKTEGGNWGGHVGISVDGNQTDKPTGEYTLNLTGGYWTN
>D0KWS7 ~~~dabA2~~~Probable inorganic carbon transporter subunit DabA2~~~COG3002
MTTLTSLQRSEAQRNHIVDLIDKACLRIAPIWPLDSFVAVNPYLGLIDQPFDTVGRYLEQTVGESLFMDHGWFADKIAQG
EITDDDLAQAAQQLDPSISLDTIKQQLAVHRQPAPALPLVTNELDRRDAPPVSEFVIEQVSQFMANYYDRGQALWHLPKE
ASASLFAQWRRYTLINRSASAVGLKQVRQHLLAVPSDAIDALFWALDQINLPESRLPDYLFTLLKTIGGWASWCRYLHFQ
AGLHGESQHDLRDLLIIRLVWVALVIKETSSAGRQQWRAKLNDWFDPAKLVASPSATATASTKAQSSRIDEILLAAAEQA
FRRRINAGLNRQPADAPDQQAERPTVQAAFCIDVRSEVFRRHLEASSPGLETIGFAGFFGLPIDYCRMGESEARLQNPVL
INPAYRAQETGDPAIAQHRHARQSRGAIWKQFKLSAASCFTFVESAGLSYVPRLLADSLGWHRSSLPPDAPGLTPEERAR
LHPQLVKLDGGALSTQEKVDLAEKVLRGLGLTHTFAPIVLLAGHGSSTTNNPHRAGLDCGACAGQAGDVNARVAVQLLNE
AAVRLGLIERGIAIPRDTRFVAALHDTTTDHIELLDLDQSGIESDQLSSLTQALKQAGELTRLERLVTLEAQVDTVDAEK
QATFRGRDWSQVRPEWGLAGNAAFIAAPRWRTRGLDLGGRAFLHDYDWRHDKEFGVLNVIMTAPLIVANWINLQYYGSTV
DNLHQGAGNKVLHNVVGGTVGVIEGNGGDLRVGLAMQSLHDGEQWRHEPLRLSAYIEAPIAEIDKIIAGHDMLNALINNR
WMHILHIDDNGIPHRRHAHGDWRPEPI
>Q31HC3 ~~~dabA~~~Probable inorganic carbon transporter subunit DabA~~~COG3002
MMLHNAKASDNNELDQAKPLIPQLSDKQKEALNDACGRIAPTWPLDELIAVNPWWEMRDQHISKVSAKLSALSQAQCVMP
KSYFQEVWMETLQPQHVQQAIDEMEKDYTVDNLERYLLEEDEHTHWHNVSDFVDSGRDRKYKMAWRDEITHQISQFCADF
FRLKDSQGTFSKTYQGLYHEWLATTRQDKGIEILMGEDGLTEHFMDLPESSESLLAEALVGLRVPDSQIADYAHALLLDA
NGWASWVAYLRWQDRLSNAENDLMMDFLAIRVAWEWVLWQHQKDSDRSVFNELKVMWHHQMSILPDLIATHEAAQAKSWI
WQRAAEIAYQSELQQQLKHASRTDQKVETESPPVLLQAAFCIDVRSEVIRRALEAQDSRVETLGFAGFFGLPIEYQPAGT
DVSRPQLPGLLKSGIKVTPVMTKVSKGATKQALNRKARWIEWGNAPPATFSMVEATGLMYAFKLLRNSLFPESHTNPINA
IPATDAFELTQNDSPLTLDQKVELAAGILHAMGLDHDLAETVMLVGHGSTSCNNPHAAGLDCGACGGQTGEINVRVLAFL
LNDESVRQGLLEKDIKIPAQTRFVAAMHNTTTDEFTCFGLNHVDETIQKWLARATEFARQERSTRLGLNHLEGQNLHQSI
QRRAKDWSQVRPEWGLSNNAAFIVAPRARTRGVDFQGRAFLHDYDWQQDADNSLLTLIMTAPMVVTNWINLQYYASVCDN
HVYGSGNKVLHNVVDGCIGVFEGNGGDLRIGLPMQSLHNGEKWMHEPLRLSVYIDAPQKTIAQVVAENDVVRHLIDNEWL
YCFSWAPDGRIHRYFNNQWLESA
>D0KWS8 ~~~dabB2~~~Probable inorganic carbon transporter subunit DabB2~~~COG1009
MTIEFSHTTGLLLLAMPALLLMAAVIKPKQGAAYARAYRQRVQWTSFAAFGLALLAVVSFLFARQQNLMLGAGSLPAGLG
LLALSIQVNGLTLVLASLVSFVLSVIARYSVQYLDGDPQQARFFRLLAVTGGFFLLVVISGNLGLFTLAIIATGFGLHRL
LSFYADRPRAIMATHKKSIFSRTADALLLAATVLIGHQIGSLEFSQISAYVHAQDHLSIALHVAAWLIVLAAILKSAQFP
FHGWLIQVMEAPTPVSALMHAGVVYSGAIIVLRTSELLAADGTALLLLALIGLMTLAIGSLVMLTQSAIKSSLAWSTAAQ
LGFMMLELGLGLFGLALLHLVGHSLYKAHAFLSSGSMTDHLRQAKVLKNRPISVVAWFTTVIVSGLFTLGIAAAMGLSID
QEPMLPAVLTIIALATAQLMLKALSHGTWREILVAAGAAIAMTGVYVFLHEVFITGFADTLAATPQRAPLLDLLLMAITI
ITFLFVAWLQGPGKTLMSPERQFALFVHLNNGLYLDRWVERLAFRFWPEKVGRAPKKSCAVIPPNPSGIEP
>Q31HC4 ~~~dabB~~~Probable inorganic carbon transporter subunit DabB~~~COG1009
MNMQWVGASMMLLIPVLFFLGSLTNERRANWNLASAISLLGLFLSMFLGIAVYFEWVNLSISGQWVGVSKMSLVMLGLVC
FIAFVNVRYSSAYMAGNVDEEKRYLRWLMVTLGCVVTVIISNHMLVLMVAWIAISLSLHRLLVFFPNRQRAVLAAHKKFI
FARVAEACLLGAILILYYEHGTWFISDIYQNVSLSTSLTTLDQFAAMLLALAALVKCAQLPLHGWLIQVVEAPTPVSALL
HAGIINLGGYLLIIFAPLIVLSDMAQWILLIVGGITTVLAALVMMTRTSVKVRLAWSTMSQMGLMLVECALGLFELALLH
LVAHSCYKAYAFLNAGSEVESSMKRRLSRAVAPSVKEWWFAGIMSAAMVVGLIWLADLSGPYSPWLLFAIAVTLLIAERR
GRLTSSSVIGMVGLGVVLLVVYTLQKNGASLIVSSMETSVGWKGDLWIGFLLVMFMVGYFLLRYHSEHIWMRKVRRAFYA
GFYLDEWVTRLNLRIYPTRLPVRFKPKKLQVPKEELFQ
>P08750 3.4.16.4~~~dacA~~~D-alanyl-D-alanine carboxypeptidase DacA~~~COG1686
MNIKKCKQLLMSLVVLTLAVTCLAPMSKAKAASDPIDINASAAIMIEASSGKILYSKNADKRLPIASMTKMMTEYLLLEA
IDQGKVKWDQTYTPDDYVYEISQDNSLSNVPLRKDGKYTVKELYQATAIYSANAAAIAIAEIVAGSETKFVEKMNAKAKE
LGLTDYKFVNATGLENKDLHGHQPEGTSVNEESEVSAKDMAVLADHLITDYPEILETSSIAKTKFREGTDDEMDMPNWNF
MLKGLVSEYKKATVDGLKTGSTDSAGSCFTGTAERNGMRVITVVLNAKGNLHTGRFDETKKMFDYAFDNFSMKEIYAEGD
QVKGHKTISVDKGKEKEVGIVTNKAFSLPVKNGEEKNYKAKVTLNKDNLTAPVKKGTKVGKLTAEYTGDEKDYGFLNSDL
AGVDLVTKENVEKANWFVLTMRSIGGFFAGIWGSIVDTVTGWF
>P0AEB2 3.4.16.4~~~dacA~~~D-alanyl-D-alanine carboxypeptidase DacA~~~COG1686
MNTIFSARIMKRLALTTALCTAFISAAHADDLNIKTMIPGVPQIDAESYILIDYNSGKVLAEQNADVRRDPASLTKMMTS
YVIGQAMKAGKFKETDLVTIGNDAWATGNPVFKGSSLMFLKPGMQVPVSQLIRGINLQSGNDACVAMADFAAGSQDAFVG
LMNSYVNALGLKNTHFQTVHGLDADGQYSSARDMALIGQALIRDVPNEYSIYKEKEFTFNGIRQLNRNGLLWDNSLNVDG
IKTGHTDKAGYNLVASATEGQMRLISAVMGGRTFKGREAESKKLLTWGFRFFETVNPLKVGKEFASEPVWFGDSDRASLG
VDKDVYLTIPRGRMKDLKASYVLNSSELHAPLQKNQVVGTINFQLDGKTIEQRPLVVLQEIPEGNFFGKIIDYIKLMFHH
WFG
>Q05523 3.4.16.4~~~dacA~~~D-alanyl-D-alanine carboxypeptidase DacA~~~
MRRRKQNWLFWLLSICLCLTFGPFQQTVKAESAPLDIRADAAILVDAQTGRILYEKNIDTVLGIASMTKMMTEYLLLDAI
KAKRVKWDQMYTPSDYVYRLSQDRALSNVPLRKDGKYTVRELYEAMAIYSANGATVAIAEIIAGSEKNFVKMMNDKAKEL
GLKDYKFVNATGLSNKDLKGFHPEGTSTNEENVMSARAMAMLAYRLLKDHPEVLKTASIPHKVFREGTKDEIKMDNWNWM
LPGLVYGYEGVDGLKTGYTEFAGNCFTGTAKRNGVRLISVVMNAKDASGKTTKEARFKETEKLFNYGFNQYSLETLYPKG
YQLKGKETLPVVKGKEKEVRVATGKNLDLLVKNGEEKQYKPVYVLDKKKMTKEGKLVAPLKKGETVGYMTLEYKGDDSLA
FLSPDMQKNIRVPLVTTAEVEKANWFVLSMRAVGGLFVDLWTSVAKTVKGWL
>P44466 3.4.16.4~~~dacA~~~D-alanyl-D-alanine carboxypeptidase DacA~~~COG1686
MLKRTTKIAFLSSFVALSAFSVSAEDMQFGVTPPQITAQTYVLMDYNSGAILTALNPDQRQYPASLTKMMTSYVVGVALK
QGKIHNTDMVTIGESAWGRNFPDSSKMFLDLNTQVSVADLNRGVIVVSGNDATVALAEHISGNVPNFVETMNKYVQQFGL
KNTNFTTPHGLDDPNQYSSARDMAIIGAHIIRDLPEEYKIYSEKNFTFNKIKQANRNGLLWDKTINVDGMKTGHTSQAGY
NLVASATTSNNMRLISVVMGVPTYKGREVESKKLLQWGFANFETFKTLEAGKEISEQRVYYGDKNSVKLGALMDHFITIP
KGKQSEVKARYELADKNLQAPLVKGQVIGKVVYQLDGKDIASANLQVMNDVGEAGIFGKLWDWLVLTVKGLFS
>Q8Y5E4 2.7.7.85~~~dacA~~~Diadenylate cyclase~~~COG1624
MDFSNMSILHYLANIVDILVVWFVIYKVIMLIRGTKAVQLLKGIFIIIAVKLLSGFFGLQTVEWITDQMLTWGFLAIIII
FQPELRRALETLGRGNIFTRYGSRIEREQHHLIESIEKSTQYMAKRRIGALISVARDTGMDDYIETGIPLNAKISSQLLI
NIFIPNTPLHDGAVIIKGNEIASAASYLPLSDSPFLSKELGTRHRAALGISEVTDSITIVVSEETGGISLTKGGELFRDV
SEEELHKILLKELVTVTAKKPSIFSKWKGGKSE
>O53380 3.4.16.-~~~dacB1~~~D-alanyl-D-alanine carboxypeptidase DacB1~~~COG1686
MAFLRSVSCLAAAVFAVGTGIGLPTAAGEPNAAPAACPYKVSTPPAVDSSEVPAAGEPPLPLVVPPTPVGGNALGGCGII
TAPGSAPAPGDVSAEAWLVADLDSGAVIAARDPHGRHRPASVIKVLVAMASINTLTLNKSVAGTADDAAVEGTKVGVNTG
GTYTVNQLLHGLLMHSGNDAAYALARQLGGMPAALEKINLLAAKLGGRDTRVATPSGLDGPGMSTSAYDIGLFYRYAWQN
PVFADIVATRTFDFPGHGDHPGYELENDNQLLYNYPGALGGKTGYTDDAGQTFVGAANRDGRRLMTVLLHGTRQPIPPWE
QAAHLLDYGFNTPAGTQIGTLIEPDPSLMSTDRNPADRQRVDPQAAARISAADALPVRVGVAVIGALIVFGLIMVARAMN
RRPQH
>I6Y204 3.4.16.-~~~dacB2~~~D-alanyl-D-alanine carboxypeptidase DacB2~~~COG1686
MRKLMTATAALCACAVTVSAGAAWADADVQPAGSVPIPDGPAQTWIVADLDSGQVLAGRDQNVAHPPASTIKVLLALVAL
DELDLNSTVVADVADTQAECNCVGVKPGRSYTARQLLDGLLLVSGNDAANTLAHMLGGQDVTVAKMNAKAATLGATSTHA
TTPSGLDGPGGSGASTAHDLVVIFRAAMANPVFAQITAEPSAMFPSDNGEQLIVNQDELLQRYPGAIGGKTGYTNAARKT
FVGAAARGGRRLVIAMMYGLVKEGGPTYWDQAATLFDWGFALNPQASVGSL
>P35150 3.4.16.4~~~dacB~~~D-alanyl-D-alanine carboxypeptidase DacB~~~COG1686
MRIFKKAVFVIMISFLIATVNVNTAHAAIDVSAKSAIIIDGASGRVLYAKDEHQKRRIASITKIMTAVLAIESGKMDQTV
TVSANAVRTEGSAIYLTEGQKVKLKDLVYGLMLRSGNDAAVAIAEHVGGSLDGFVYMMNQKAEQLGMKNTRFQNPHGLDD
HENHYSTAYDMAILTKYAMKLKDYQKISGTKIYKAETMESVWKNKNKLLTMLYPYSTGGKTGYTKLAKRTLVSTASKDGI
DLIAVTINDPNDWDDHMKMFNYVFEHYQTYLIAKKGDIPKLKGTFYESKAFIKRDITYLLTEEEKENVKINTTLLKPKKA
WEKDASKIPDIVGHMEIMFNDATIAKVPIYYENERHQKPKKQFFETFKSIFLNAAGGAKWSI
>P24228 3.4.16.4~~~dacB~~~D-alanyl-D-alanine carboxypeptidase DacB~~~COG2027
MRFSRFIIGLTSCIAFSVQAANVDEYITQLPAGANLALMVQKVGASAPAIDYHSQQMALPASTQKVITALAALIQLGPDF
RFTTTLETKGNVENGVLKGDLVARFGADPTLKRQDIRNMVATLKKSGVNQIDGNVLIDTSIFASHDKAPGWPWNDMTQCF
SAPPAAAIVDRNCFSVSLYSAPKPGDMAFIRVASYYPVTMFSQVRTLPRGSAEAQYCELDVVPGDLNRFTLTGCLPQRSE
PLPLAFAVQDGASYAGAILKDELKQAGITWSGTLLRQTQVNEPGTVVASKQSAPLHDLLKIMLKKSDNMIADTVFRMIGH
ARFNVPGTWRAGSDAVRQILRQQAGVDIGNTIIADGSGLSRHNLIAPATMMQVLQYIAQHDNELNFISMLPLAGYDGSLQ
YRAGLHQAGVDGKVSAKTGSLQGVYNLAGFITTASGQRMAFVQYLSGYAVEPADQRNRRIPLVRFESRLYKDIYQNN
>P45161 3.4.16.4~~~dacB~~~D-alanyl-D-alanine carboxypeptidase DacB~~~COG2027
MKKLSSISTALGSFLLSVSFSLPTFANINVSDLTQKLPEGSNVGFIAKNINQNQIIADYNGSTFMLSASTQKVFTAVAAK
LALDDQFQFETALLSNGKIQNGNLDGNLIVRFTGDPDLTRGQLYSLLAELKKQGIKKINGDLVLDTSVFSSHDRGLGWIW
NDLTMCFNSPPAAANIDNNCFYAELDANKNPGEIVKINVPAQFPIQVFGQVYVADSNEAPYCQLDVVVHDNNRYQVKGCL
ARQYKPFGLSFAVQNTDAYAAEIIQRQLRQLGIEFNGKVLLPQKPQQGQLLAKHLSKPLPDLLKKMMKKSDNQIADSLFR
AVAFNYYKRPASFQLGTLAVKSILQKQGIRFGNSILADGSGLSRHNLVAPKTMLSVLEYIAKNEDKLHLMETFPIAGVDG
TISGRGGLISPPLVKNVIAKTGSLKGVYNLAGFMTNARGEKVAFVQFINGYSTGDLESKTKRAPLVQFERNLYNELYKY
>P39844 3.4.16.4~~~dacC~~~D-alanyl-D-alanine carboxypeptidase DacC~~~COG2027
MKKSIKLYVAVLLLFVVASVPYMHQAALAAEKQDALSGQIDKILADHPALEGAMAGITVRSAETGAVLYEHSGDTRMRPA
SSLKLLTAAAALSVLGENYSFTTEVRTDGTLKGKKLNGNLYLKGKGDPTLLPSDFDKMAEILKHSGVKVIKGNLIGDDTW
HDDMRLSPDMPWSDEYTYYGAPISALTASPNEDYDAGTVIVEVTPNQKEGEEPAVSVSPKTDYITIKNDAKTTAAGSEKD
LTIEREHGTNTITIEGSVPVDANKTKEWISVWEPAGYALDLFKQSLKKQGITVKGDIKTGEAPSSSDVLLSHRSMPLSKL
FVPFMKLSNNGHAEVLVKEMGKVKKGEGSWEKGLEVLNSTLPEFGVDSKSLVLRDGSGISHIDAVSSDQLSQLLYDIQDQ
SWFSAYLNSLPVAGNPDRMVGGTLRNRMKGTPAQGKVRAKTGSLSTVSSLSGYAETKSGKKLVFSILLNGLIDEEDGKDI
EDQIAVILANQ
>P08506 3.4.16.4~~~dacC~~~D-alanyl-D-alanine carboxypeptidase DacC~~~COG1686
MTQYSSLLRGLAAGSAFLFLFAPTAFAAEQTVEAPSVDARAWILMDYASGKVLAEGNADEKLDPASLTKIMTSYVVGQAL
KADKIKLTDMVTVGKDAWATGNPALRGSSVMFLKPGDQVSVADLNKGVIIQSGNDACIALADYVAGSQESFIGLMNGYAK
KLGLTNTTFQTVHGLDAPGQFSTARDMALLGKALIHDVPEEYAIHKEKEFTFNKIRQPNRNRLLWSSNLNVDGMKTGTTA
GAGYNLVASATQGDMRLISVVLGAKTDRIRFNESEKLLTWGFRFFETVTPIKPDATFVTQRVWFGDKSEVNLGAGEAGSV
TIPRGQLKNLKASYTLTEPQLTAPLKKGQVVGTIDFQLNGKSIEQRPLIVMENVEEGGFFGRVWDFVMMKFHQWFGSWFS
>P33013 3.4.16.4~~~dacD~~~D-alanyl-D-alanine carboxypeptidase DacD~~~COG1686
MKRRLIIAASLFVFNLSSGFAAENIPFSPQPPEIHAGSWVLMDYTTGQILTAGNEHQQRNPASLTKLMTGYVVDRAIDSH
RITPDDIVTVGRDAWAKDNPVFVGSSLMFLKEGDRVSVRDLSRGLIVDSGNDACVALADYIAGGQRQFVEMMNNYAEKLH
LKDTHFETVHGLDAPGQHSSAYDLAVLSRAIIHGEPEFYHMYSEKSLTWNGITQQNRNGLLWDKTMNVDGLKTGHTSGAG
FNLIASAVDGQRRLIAVVMGADSAKGREEEARKLLRWGQQNFTTVQILHRGKKVGTERIWYGDKENIDLGTEQEFWMVLP
KAEIPHIKAKYTLDGKELTAPISAHQRVGEIELYDRDKQVAHWPLVTLESVGEGSMFSRLSDYFHHKA
>P38422 3.4.16.4~~~dacF~~~D-alanyl-D-alanine carboxypeptidase DacF~~~COG1686
MKRLLSTLLIGIMLLTFAPSAFAKQDGKRTSELAHEAKSAVLIERDTGKVLYNKNSNERLAPASMTKIMTMLLIMEALDK
GKIKMSDKVRTSEHAASMGGSQIFLEPGEEMTVKEMLKGIAIASGNDASVAMAEFISGSEEEFVKKMNKKAKELGLKNTS
FKNPTGLTEEGHYSSAYDMAIMAKELLKYESITKFTGTYEDYLRENTDKKFWLVNTNRLIKFYPGVDGVKTGYTGEAKYC
LTASAKKGNMRAIAVVFGASTPKERNAQVTKMLDFAFSQYETHPLYKRNQTVAKVKVKKGKQKFIELTTSEPISILTKKG
EDMNDVKKEIKMKDNISAPIQKGQELGTLVLKKDGEVLAESPVAAKEDMKKAGFITFLKRTMGDWTKFK
>P39042 3.4.16.4~~~~~~D-alanyl-D-alanine carboxypeptidase~~~
MRLRRAAATVITTGALLAAGTLGATPATAVTKPTIAAVGGYAMNNGTGTTLYTKAADTRRSTGSTTKIMTAKVVLAQSNL
NLDAKVTIQKAYSDYVVANKPSQAHLIVGDKVTVRQLLYGLMLPSGCDAAYALADKYGSGSQAAARVKSFIGKMNTAATN
LGLHNTHFDSFDGIGNGANYSTPRHLTKIASSAMKNSTFRTVVKTKAYTAKTVTKTGSIRTMDTWKNTNGLLSSYSGAIG
VKTGSGPEAKYCLVFAATRGGKTVIGTVLASTSIPARESDATKIMNYGFAL
>P39045 3.4.16.4~~~dac~~~D-alanyl-D-alanine carboxypeptidase~~~
MKQSSPEPLRPRRTGGRGGARRAAALVTIPLLPMTLLGASPALADASGARLTELREDIDAILEDPALEGAVSGVVVVDTA
TGEELYSRDGGEQLLPASNMKLFTAAAALEVLGADHSFGTEVAAESAPGRRGEVQDLYLVGRGDPTLSAEDLDAMAAEVA
ASGVRTVRGDLYADDTWFDSERLVDDWWPEDEPYAYSAQISALTVAHGERFDTGVTEVSVTPAAEGEPADVDLGAAEGYA
ELDNRAVTGAAGSANTLVIDRPVGTNTIAVTGSLPADAAPVTALRTVDEPAALAGHLFEEALESNGVTVKGDVGLGGVPA
DWQDAEVLADHTSAELSEILVPFMKFSNNGHAEMLVKSIGQETAGAGTWDAGLVGVEEALSGLGVDTAGLVLNDGSGLSR
GNLVTADTVVDLLGQAGSAPWAQTWSASLPVAGESDPFVGGTLANRMRGTAAEGVVEAKTGTMSGVSALSGYVPGPEGEL
AFSIVNNGHSGPAPLAVQDAIAVRLAEYAGHQAPEGARMMRGPVQGSGELECSWVQAC
>P15555 3.4.16.4~~~~~~D-alanyl-D-alanine carboxypeptidase~~~
MVSGTVGRGTALGAVLLALLAVPAQAGTAAAADLPAPDDTGLQAVLHTALSQGAPGAMVRVDDNGTIHQLSEGVADRATG
RAITTTDRFRVGSVTKSFSAVVLLQLVDEGKLDLDASVNTYLPGLLPDDRITVRQVMSHRSGLYDYTNDMFAQTVPGFES
VRNKVFSYQDLITLSLKHGVTNAPGAAYSYSNTNFVVAGMLIEKLTGHSVATEYQNRIFTPLNLTDTFYVHPDTVIPGTH
ANGYLTPDEAGGALVDSTEQTVSWAQSAGAVISSTQDLDTFFSALMSGQLMSAAQLAQMQQWTTVNSTQGYGLGLRRRDL
SCGISVYGHTGTVQGYYTYAFASKDGKRSVTALANTSNNVNVLNTMARTLESAFCGKPTTAKLRSATSSATTVERHEDIA
PGIARD
>Q9HTQ0 1.4.99.-~~~dadA1~~~D-amino acid dehydrogenase 1~~~
MRVLVLGSGVIGTASAYYLARAGFEVVVVDRQDGPALETSFANAGQVSPGYASPWAAPGIPLKAMKWLLEKHAPLAIKLT
SDPSQYAWMLQMLRNCTAERYAVNKERMVRLSEYSRDCLDELRAETGIAYEGRTLGTTQLFRTQAQLDAAGKDIAVLERS
GVPYEVLDRDGIARVEPALAKVADKLVGALRLPNDQTGDCQLFTTRLAEMAKGLGVEFRFGQNIERLDFAGDRINGVLVN
GELLTADHYVLALGSYSPQLLKPLGIKAPVYPLKGYSLTVPITNPEMAPTSTILDETYKVAITRFDQRIRVGGMAEIAGF
DLSLNPRRRETLEMITTDLYPEGGDISQATFWTGLRPATPDGTPIVGATRYRNLFLNTGHGTLGWTMACGSGRYLADLMA
KKRPQISTEGLDISRYSNSPENAKNAHPAPAH
>P0A6J7 1.4.99.-~~~dadA~~~D-amino acid dehydrogenase~~~COG0665
MRVVILGSGVVGVASAWYLNQAGHEVTVIDREPGAALETSAANAGQISPGYAAPWAAPGVPLKAIKWMFQRHAPLAVRLD
GTQFQLKWMWQMLRNCDTSHYMENKGRMVRLAEYSRDCLKALRAETNIQYEGRQGGTLQLFRTEQQYENATRDIAVLEDA
GVPYQLLESSRLAEVEPALAEVAHKLTGGLQLPNDETGDCQLFTQNLARMAEQAGVKFRFNTPVDQLLCDGEQIYGVKCG
DEVIKADAYVMAFGSYSTAMLKGIVDIPVYPLKGYSLTIPIAQEDGAPVSTILDETYKIAITRFDNRIRVGGMAEIVGFN
TELLQPRRETLEMVVRDLYPRGGHVEQATFWTGLRPMTPDGTPVVGRTRFKNLWLNTGHGTLGWTMACGSGQLLSDLLSG
RTPAIPYEDLSVARYSRGFTPSRPGHLHGAHS
>P0A6J6 1.4.99.-~~~dadA~~~D-amino acid dehydrogenase~~~COG0665
MRVVILGSGVVGVASAWYLNQAGHEVTVIDREPGAALETSAANAGQISPGYAAPWAAPGVPLKAIKWMFQRHAPLAVRLD
GTQFQLKWMWQMLRNCDTSHYMENKGRMVRLAEYSRDCLKALRAETNIQYEGRQGGTLQLFRTEQQYENATRDIAVLEDA
GVPYQLLESSRLAEVEPALAEVAHKLTGGLQLPNDETGDCQLFTQNLARMAEQAGVKFRFNTPVDQLLCDGEQIYGVKCG
DEVIKADAYVMAFGSYSTAMLKGIVDIPVYPLKGYSLTIPIAQEDGAPVSTILDETYKIAITRFDNRIRVGGMAEIVGFN
TELLQPRRETLEMVVRDLYPRGGHVEQATFWTGLRPMTPDGTPVVGRTRFKNLWLNTGHGTLGWTMACGSGQLLSDLLSG
RTPAIPYEDLSVARYSRGFTPSRPGHLHGAHS
>P0A6J5 1.4.99.-~~~dadA~~~D-amino acid dehydrogenase~~~COG0665
MRVVILGSGVVGVASAWYLNQAGHEVTVIDREPGAALETSAANAGQISPGYAAPWAAPGVPLKAIKWMFQRHAPLAVRLD
GTQFQLKWMWQMLRNCDTSHYMENKGRMVRLAEYSRDCLKALRAETNIQYEGRQGGTLQLFRTEQQYENATRDIAVLEDA
GVPYQLLESSRLAEVEPALAEVAHKLTGGLQLPNDETGDCQLFTQNLARMAEQAGVKFRFNTPVDQLLCDGEQIYGVKCG
DEVIKADAYVMAFGSYSTAMLKGIVDIPVYPLKGYSLTIPIAQEDGAPVSTILDETYKIAITRFDNRIRVGGMAEIVGFN
TELLQPRRETLEMVVRDLYPRGGHVEQATFWTGLRPMTPDGTPVVGRTRFKNLWLNTGHGTLGWTMACGSGQLLSDLLSG
RTPAIPYEDLSVARYSRGFTPSRPGHLHGAHS
>A3KEZ1 1.4.5.1~~~dadA~~~D-amino acid dehydrogenase~~~COG0665
MKKEVVVIGGGIVGLSCAYSMHKLGHKVCVIEKSDGANGTSFGNAGLISAFKKAPLSCPGVVLDTLKLMLKNQAPLKFHF
GLNLKLYQWILKFVKSANAKSTHRTMALFERYGWLSVDIYHQMLKDGMDFWYKEDGLLMIYTLEESFEKKLKTCDDSGAY
KILSAKETKEYMPIVNDNICGSVLLTENAHVDPGEVMHSLQEYLQNAGVEFLYNEEVIDFEFKNNLIEGVITHKEKIQAE
TIILATGANPTLIKKTKNDFLMMGAKGYSITFKMPEELKPKTSSLFADIFMAMTPRRDTVRITSKLELNTNNALIDKEQI
ANMKKNLAAFTQPFEMKDAIEWCGFRPLTPNDIPYLGYDKRYKNLIHATGLGWLGITFGPAIGKIIANLSQDGANEKNAD
IMLFSAFFRD
>O30745 1.4.99.-~~~dadA~~~D-amino acid dehydrogenase~~~
MRVVILGSGVFGVASAWYLSQAGHDVTVIDRQPGPAEETSAANAGQISPGYAAPWAAPGVPLKAIKWMFQRHAPLAIGLD
GTSFQLKWMWQMLRNCDTRHYMENKGRMVRLAEYSRDCLKALRDTTGIQYEGRQGGTLQLFRTAKQYENATRDIAVLEDA
GVPYQLLEAKRLAEVEPALAEVSHKLTGGLRLPNDETGDCQLFTTRLAAMADQAGVTFRFNTAVDALLHEGDRIAGVKCG
MRIIKGDAYVMAFGSYSTAMLKGLVDIPVYPLKGYSLTIPIAQEDGAPVSTILDVTYTIAITRFDQRIRVGGMAEIVGFN
KTLLQPRRETLEMVVRDLFPRGGHVEQATFWTGLRPMTPDGTPVVGRTAYKNLWLNTGHGTLGWTMACGSGQLISDLISG
RTPAIPYDDLAVARYSPGFTPARPQHLHGAHN
>C8WLC6 1.-.-.-~~~~~~Uncharacterized oxidoreductase Elen_0471~~~COG0243
MGNLTMSRRTFVKTAAITGAAAAAFGASTHTALAEETYSSVSGNDTVAVKTCCRGCGKMECGVKVIVQNGRAIRVEGDEG
AFQSMGNCCTKSQSSIQAAYHPDRLHYPMKRTNPKGEEPGWQRISWDEAMQSIVDNFMDIKAKHGGEAIACQVGTSRIWC
MHSESILKNMLETPNNVEAWQICKGPRHFATTMVSQFAMSWMETITRPKVYVQWGGASELSNYDDSCRTTVDVASRADVH
ISVDPRMANMGKEADYWQHLRPGTDGALALAWTNVIIEKKLYDELYVKKWTNAPFLVCEDMEPSGFPTVRTDGSYWDVKT
ALLKESDIKEGGSPYKFLVYDNNWEKLKAEGVEHEYGAFTWFNADQEGVIDETGGFWEGENYDSEKARQGREAAQDNLLP
GQTQGWLPDPMPFDPAIDPALEGEFEITLKDGKTVKVKPVWEHYKARAAEYKPEVAAEITGIPASEIEAAATAYGTRIDP
STGYGNGGIQYMLAVEHFCSAIQNCSAFDNLVGITGNMDTPGGNRGPTIVPIDGDLQGFSAWAPGATTPPEEVNRKQIGI
DKFPLLGWWQYWCDSHSLWDAVITGDPYPVRALWNESGNFMSQTNTTRAWEALCSLDFYVDLNLWHTPQNDTADIILPVA
HWIELNSPRASQGSAGAMGATVKCVQPPAEAKYDPEIVMDLARRMNWKWTDEPGNEWPDINWQLDDSIKLLTDDELTYTT
WHVENGKPTFERHGVPMAEVTPKYKTWDEYVKAFQEHGWWQAKDIEPRNWGTYRRYQTGAMRARDRVWGRLDYTAGKGIG
DWKPGWFTPTMKQEIWSTVMESHHPDHPEWRLPTYTEPPHGPKDGDRIKEYPLTATTGRRIPVYFHSEHRQLPWCRELWP
VPRVEINPKTAAEYGIEQGDWVWIETEWGKIREVADLYYGVKEDVINLEHTWWYPEVKDAGHGWQFSQVNQLIDHYAQDP
HSGTSNLRAYQVKIYKATPENSPFNNPVPCDSTGTPIIHTSDDPRLKEWLPTYEGRE
>A0A369NIV7 1.1.-.-~~~dadH~~~Dopamine dehydroxylase~~~
MGNLTMSRRTFVKTAAITGAAAAAFGASTHTALAEETYSSVSGNDTVAVKTCCRGCGKMECGVKVIVQNGRAIRVEGDEG
AFQSMGNCCTKSQSSIQAAYHPDRLHYPMKRTNPKGEEPGWQRISWDEAMQSIVDNFMDIKAKHGGEAIACQVGTSRIWC
MHSESILKNMLETPNNVEAWQICKGPRHFATTMVSQFAMSWMETITRPKVYVQWGGASELSNYDDSCRTTVDVASRADVH
ISVDPRMANMGKEADYWQHLRPGTDGALALAWTNVIIEKKLYDELYVKKWTNAPFLVCEDMEPSGFPTVRTDGSYWDVKT
ALLKESDIKEGGSPYKFLVYDNNWEKLKAEGVEHEYGAFTWFNADQEGVIDETGGFWEGENYDSEKARQGREAAQDNLLP
GQTQGWLPDPMPFDPAIDPALEGEFEITLKDGKTVKVKPVWEHYKARAAEYKPEVAAEITGIPASEIEAAATAYGTRIDP
STGYGNGGIQYMLAVEHFCSAIQNCRAFDNLVGITGNMDTPGGNRGPTIVPIDGDLQGFSAWAPGATTPPEEVNRKQIGI
DKFPLLGWWQYWCDSHSLWDAVITGDPYPVRALWNESGNFMSQTNTTRAWEALCSLDFYVDLNLWHTPQNDTADIILPVA
HWIELNSPRASQGSAGAMGATVKCVQPPAEAKYDPEIVMDLARRMNWKWTDEPGNEWPDINWQLDDSIKLLTDDELTYTT
WHVENGKPTFERHGVPMAEVTPKYKTWDEYVKAFQEHGWWQAKDIEPRNWGTYRRYQTGAMRARDRVWGRLDYTAGKGIG
DWKPGWFTPTMKQEIWSTVMESHHPDHPEWRLPTYTEPPHGPKDGDRIKEYPLTATTGRRIPVYFHSEHRQLPWCRELWP
VPRVEINPKTAAEYGIEQGDWVWIETEWGKIREVADLYYGVKEDVINLEHTWWYPEVKDAGHGWQFSQVNQLIDHYAQDP
HSGTSNLRAYQVKIYKATPENSPFNNPVPCDSTGTPIIHTSDDPRLKEWLPTYEGRE
>Q88CB2 5.1.1.1~~~dadX~~~Alanine racemase, catabolic~~~COG0787
MRPARALIDLQALRHNYRLARELTGAKALAVIKADAYGHGAVRCALALEAEADGFAVACIEEALELRAAGIKAPVLLLEG
FFEASELALIAEHDLWCVVHSLWQLEAIEKTPLHKPLNVWLKLDSGMHRVGLHPKDYHDAYQRLLASGKVSRIVLMTHFA
RADELDADATSQQIAVFEAARQGLAAECSLRNSPGVLGWPQAPSDWVRPGLMLYGATPFEVAQAEAARLQPVMTLQSRVI
SVRELPAGEPVGYGAKFVSPRPTRVGVVAMGYADGYPRQAPNGTPVLVAGKRTQLIGRVSMDMLSIDLTDVPQATVGSPV
EFWGKQVLASEVAAHAGTIPYQIFCNLKRVPRDYIGE
>P77527 ~~~dafA~~~Protein DafA~~~COG0789
MLARSGWLSLEALSEYGLSLAAVRAYVEIGFVEPLEVGGAWYFREEDLLRMAKAERIRKDLGANLIGAALVVEILERT
>P30144 ~~~dagA~~~Na(+)-linked D-alanine glycine permease~~~COG1115
MLGGAVWFPYVLLGVGLFFTIYLKFPQIRYFKHACQVVSGKFDKKDTEGDTTHFQALATALSGTVGTGNIGGVALAISIG
GPAALFWMWMTAFFGMTTKFVEVTLSHKYREKTEDGTMSGGPMYYMDKRLNMKWLAILFAVATVISSFGTGSLPQINNIA
QGMEATFGFAPMATGAVLSILLALVILGGIKRIAAITSRVVPLMAAIYIIGALAVIFYNAENIGPSFSAVFMDAFSGSAA
AGGFLGASFAYAFNRGVNRGLFSNEAGQGSAPIAHASAKADEPVSEGIVSILEPFIDTIIICTLTGLVILSSGVWNEKFQ
THFERSAMSIIKGDYTEENQTQREDLYKYLNGQKSNIETFTGNIEVVNGEALSTGFTVLHSRSIAEDVRFGITEKHKYTG
VVEVIDGMPTDDSISLVGKSLVHSAELTTKAFKRGYFGDSGQYIVSIGLLLFAFSTAIAWSYYGDRAMIYLLGHRSVMPY
RVFYVAAFFWASFADTTLVWKLAAVAIVVMTLPNLIGIMLLRKEMKESVDDYWVKFKKDNEK
>O31502 2.7.1.107~~~dagK~~~Diacylglycerol kinase~~~COG1597
MKRARIIYNPTSGREIFKKHLAQVLQKFEQAGYETSTHATTCAGDATHAAKEAALREFDLIIAAGGDGTINEVVNGLAPL
DNRPTLGVIPVGTTNDFARALGIPREDILKAADTVINGVARPIDIGQVNGQYFINIAGGGRLTELTYDVPSKLKTMLGQL
AYYLKGMEMLPSLRPTEVEIEYDGKLFQGEIMLFLVTLTNSVGGFEKLAPDSSLNDGMFDLMILKKANLAEFIRVATMAL
RGEHINDQHIIYTKANRVKVNVSEKMQLNLDGEYGGMLPGEFVNLYRHIHVVMPKEKAEQLDD
>P9WP28 2.7.1.107~~~dagK~~~Diacylglycerol kinase~~~
MSAGQLRRHEIGKVTALTNPLSGHGAAVKAAHGAIARLKHRGVDVVEIVGGDAHDARHLLAAAVAKGTDAVMVTGGDGVV
SNALQVLAGTDIPLGIIPAGTGNDHAREFGLPTKNPKAAADIVVDGWTETIDLGRIQDDNGIEKWFGTVAATGFDSLVND
RANRMRWPHGRMRYYIAMLAELSRLRPLPFRLVLDGTEEIVADLTLADFGNTRSYGGGLLICPNADHSDGLLDITMAQSD
SRTKLLRLFPTIFKGAHVELDEVSTTRAKTVHVECPGINVYADGDFACPLPAEISAVPAALQVLRPRHG
>P9WP29 2.7.1.107~~~dagK~~~Diacylglycerol kinase~~~COG1597
MSAGQLRRHEIGKVTALTNPLSGHGAAVKAAHGAIARLKHRGVDVVEIVGGDAHDARHLLAAAVAKGTDAVMVTGGDGVV
SNALQVLAGTDIPLGIIPAGTGNDHAREFGLPTKNPKAAADIVVDGWTETIDLGRIQDDNGIEKWFGTVAATGFDSLVND
RANRMRWPHGRMRYYIAMLAELSRLRPLPFRLVLDGTEEIVADLTLADFGNTRSYGGGLLICPNADHSDGLLDITMAQSD
SRTKLLRLFPTIFKGAHVELDEVSTTRAKTVHVECPGINVYADGDFACPLPAEISAVPAALQVLRPRHG
>Q6GFF9 2.7.1.107~~~dagK~~~Diacylglycerol kinase~~~
MRKRARIIYNPTSGKEQFKRELPDALIKLEKAGYETSAYATEKIGDATLEAERAMHENYDVLIAAGGDGTLNEVVNGIAE
KPNRPKLGVIPMGTVNDFGRALHIPNDIMGALDVIIEGHSTKVDIGKMNNRYFINLAAGGQLTQVSYETPSKLKSIVGPF
AYYIKGFEMLPQMKAVDLRIEYDGNVFQGEALLFFLGLTNSMAGFEKLVPDAKLDDGYFTLIIVEKSNLAELGHIMTLAS
RGEHTKHPKVIYEKAKAINISSFTDLQLNVDGEYGGKLPANFLNLERHIDVFAPNDIVNEELINNDHVDDNLIEE
>P84908 ~~~daip~~~Dispase autolysis-inducing protein~~~
MKRMGWAVTAAVTTIVLAQSSLAAQAADSTSGWRAPSCTKVTGDGAVTFTTDDGATLAPTTGTLQSVSYTHGLVALDTPN
TLLATHNDELQRSTDAGCTWTKVATLGSGSTWLTAATGGRAFAWEKNGGYLARVDGRTVTKLSSPSADIVGVGTDKARRD
HVRLAGSDGQLYDSTDAGATWKPLGKLAFGPGASVYTVSFDPADLDHAVAGGMTTGGAVTTDGGATWTAATGLSATAGGK
SNLFAASVSPADRNVVYALGIDLVEAAPNSGAEGRHLYRSTDGGRTYTRIVDDTPDTELTNSTLLAPSPVDPNVLYFEYG
TYFQAYGTDLYRYDARTGKVGKTHNAHDGISAIAFNPARPSVMYLGLEEVQIHH
>P11557 ~~~damX~~~Cell division protein DamX~~~COG3266
MDEFKPEDELKPDPSDRRTGRSRQSSERSERTERGEPQINFDDIELDDTDDRRPTRAQKERNEEPEIEEEIDESEDETVD
EERVERRPRKRKKAASKPASRQYMMMGVGILVLLLLIIGIGSALKAPSTTSSDQTASGEKSIDLAGNATDQANGVQPAPG
TTSAENTQQDVSLPPISSTPTQGQTPVATDGQQRVEVQGDLNNALTQPQNQQQLNNVAVNSTLPTEPATVAPVRNGNASR
DTAKTQTAERPSTTRPARQQAVIEPKKPQATVKTEPKPVAQTPKRTEPAAPVASTKAPAATSTPAPKETATTAPVQTASP
AQTTATPAAGAKTAGNVGSLKSAPSSHYTLQLSSSSNYDNLNGWAKKENLKNYVVYETTRNGQPWYVLVSGVYASKEEAK
KAVSTLPADVQAKNPWAKPLRQVQADLK
>P9WP27 1.4.3.3~~~aao~~~Probable D-amino-acid oxidase~~~COG0665
MAIGEQQVIVIGAGVSGLTSAICLAEAGWPVRVWAAALPQQTTSAVAGAVWGPRPKEPVAKVRGWIEQSLHVFRDLAKDP
ATGVRMTPALSVGDRIETGAMPPGLELIPDVRPADPADVPGGFRAGFHATLPMIDMPQYLDCLTQRLAATGCEIETRPLR
SLAEAAEAAPIVINCAGLGARELAGDATVWPRFGQHVVLTNPGLEQLFIERTGGSEWICYFAHPQRVVCGGISIPGRWDP
TPEPEITERILQRCRRIQPRLAEAAVIETITGLRPDRPSVRVEAEPIGRALCIHNYGHGGDGVTLSWGCAREVVNLVGGG
>Q6F3I7 3.4.14.5~~~dap4~~~Dipeptidyl aminopeptidase 4~~~
MRLALFALFALMTVATALPAHAEKLTLEAITGSAPLSGPTLTKPQIAPDGSRVTFLRGKDRDRNRLDLWEYDIASGQTRL
LVDSSVVLPGEEVLSDEEKARRERQRIAALSGIVDYQWSPDGKALLFPLGGELYFYDLTKSGRDAVRKLTNGGGFATDPK
ISPKGGFVSFIRDRNLWAIDLASGKEVQLTRDGSDTIGNGVAEFVADEEMDRHTGYWWAPDDAAIAFARIDETPVPVQKR
YEVYPDRTEVVEQRYPAAGDHNVRVQLGVIAPKTGARPRWIDLGKDPDIYLARVDWRDPQRLTFQRQSRDQKKIELIETT
LTNGTQRTLVTETSTTWVPLHNDLRFLKDGRFLWSSERSGFEHLYVASEDGSTLTALTQGEWVVDSLLAIDEAAGLAYVS
GTRDGATEAHVYAVPLSGGEPRRLTQAPGMHAATFARNASVFVDSWSSDTTLPQIELFKADGTKLATLLVNDVSDATHPY
AKYRAAHQPTAYGTLTAADGTTPLHYSLIKPAGFDPKKQYPVVVFVYGGPAAQTVTRAWPGRSDSFFNQYLAQQGYVVFT
LDNRGTPRRGAAFGGALYGKQGTVEVDDQLRGIEWLKSQAFVDPARIGVYGWSNGGYMTLMLLAKHDEAYACGVAGAPVT
DWALYDTHYTERYMDLPKANEAGYREASVFTHVDGIGAGKLLLIHGMADDNVLFTNSTKLMSELQKRGTPFELMTYPGAK
HGLRGSDLLHRYRLTEDFFARCLKP
>O69782 4.-.-.-~~~dapAL~~~Uncharacterized DapA-like lyase~~~
MALRTDWNGVFPAMVTPFRENGSFDEASFKALIELYISEGVKGVVVTGSTGEWYSMSDAERATVWEVAVEASSGRITVIA
GTSAVGTREALALTRTAKAVGVDGCMLLPPGGIFAARNEVVNYFHTLAGVGLPIMVYNNPPRTGVNMDADMVAEIAKSEE
IVSFKDINRYLYAASEIIYRVCDKLAVFTGLEPYASSVLPRGAVGVVSTISNICAANMVSYYNAVISGDSATAYKTQKLI
DQLYHFLPTLGAPAFVSVKAAMKLLGRPGGEIRLPHLPANEALIGKLREELRRLKMMTLN
>A3DK17 2.6.1.83~~~dapL~~~LL-diaminopimelate aminotransferase~~~COG0436
MAFINENYLKLPGSYLFSEIARRVDNFRKENPNAKIIRLGIGDVTKPLAPAVIDALHKAVDEMAKEETFKGYGPEQGYSF
LVSKIIEYDYMPRGIRLDEDEVFVSDGAKSDTGNFQEIFGLDNKVAVTDPVYPVYVDSNVMAGRTGKYLANGYFENITYL
PCTAENNFIPELPKEKVDIIYLCFPNNPTGMTLSREELKKWVDYARENRAIILFDSAYEAYIREKDVPHSIYEVEGADEV
AIEFRSFSKTAGFTGTRCAYTVVPKKVVAYTKNGEAHQLNSLWNRRQTTKFNGVPYIIQRAAAAVYTPEGQKQTKETIDY
YMENAKIIKQGLEDIGLTVFGGVNAPYIWLKTPDGISSWEFFDIMLKEINVVGTPGSGFGPSGEGYFRLTAFGSRENTLE
AVERFKNLKF
>Q5LC03 2.6.1.83~~~dapL~~~LL-diaminopimelate aminotransferase~~~COG0436
MALVNEHFLKLPGSYLFSDIAKKVNTFKITHPKRDIIRLGIGDVTRPLPKACIEAMHKAVEEMTSAETFRGYGPEQGYDF
LIEAIIKNDYAPRGIHLSPTEVFVNDGAKSDTGNIGDILRHDNSVGVTDPIYPVYIDSNVMCGRAGVLDTESGKWSNVTY
MPCTAENHFIPAIPEKRIDIVYLCYPNNPTGTTLTKAELKKWVDYALANDTLILFDAAYEAYIREPDIPHSIYEIKGAKK
CAIEFRSFSKTAGFTGVRCGYTVVPKELTAATLEGERIPLNRLWNRRQCTKFNGTSYITQRAAEAIYTPEGKEQIQETIN
YYMTNARIMKEGLESTGLKVYGGVNAPYLWVKTPKGTSSWRFFDQMLYEANVVGTPGVGFGPSGEGYIRLTAFGERDDCI
EAMRRIKNRL
>O84395 2.6.1.83~~~dapL~~~LL-diaminopimelate aminotransferase~~~
MKRNPHFVSLTKNYLFADLQKRVAQFRLENPQHTVINLSIGDTTQPLNASVAEAFASSIARLSSPTTCRGYGPDFGLPAL
RQKLSEDFYRGFVDAKEIFISDGAKVDLFRLLSFFGPNQTVAIQDPSYPAYLDIARLTGAKEIIALPCLQENAFFPEFPE
DTHIDILCLCSPNNPTGTVLNKDQLRAIVHYAIEHEILILFDAAYSTFISDPSLPKSIFEIPDARFCAIEINSFSKPLGF
AGIRLGWTVIPQELTYADGHFVIQDWERFLSTTFNGASIPAQEAGVAGLSILPQLEAIHYYRENSDLLRKALLATGFEVF
GGEHAPYLWVKPTQANISDRDLFDFFLREYHIAITPGIGFGRSGSGFVRFSSLGKREDILAACERLQMAPALQS
>Q18T09 2.6.1.83~~~dapL~~~LL-diaminopimelate aminotransferase~~~
MAQINENYLKLPGSYLFSEIARRVNEFKVQNPDADIIRLGIGDVTRPLAPVVVEAMKQAVEEMGRAETFRGYGPEQGYDF
LIEKIIANDYAPRGVQLGMDEVFVSDGAKSDTANFQEIFGVDNIMAVTDPVYPVYVDSNVMAGRTGNYDTEKGQYGRIIY
LPCTEEGDMKPELPTAPVDMIYLCFPNNPTGMTLTKEELKVWVDYARENKAIILFDSAYEAFIREEGVPRSIYEVEGARE
VAVEFRSFSKTAGFTGTRCAYTVVPKDIMIYDSTGEGHSLNKLWLRRQTTKFNGVSYPVQAGAAAVYTEEGKKQIQATID
YYMENARIIREGLQEAGFKVFGGVNAPYIWMKTPGTMGSWEFFDKLMTEAHVVGTPGAGFGANGEGFFRLTAFGTRENTE
KAIERIKARMK
>Q7NDX4 2.6.1.83~~~dapL~~~LL-diaminopimelate aminotransferase~~~COG0436
MKTAARLDRIPPYLFAEIDRRRDEAVARGVDIINMGIGDPDKPTPPVVLEAMHAAIDDPSTHNYPPYKGTKAYREAAAAW
FERRFGVGGFHPDTEVISSIGSKEAIHNTFLAFVDPGDYTLIPDPAYPVYRTSTIFAGGEFFAMPLLPENQLLPDLEAVP
ETVARKAKLLWLNYPNNPTGAVASLEFFEKVVHFAKKHDILVCHDNAYSEMAYDGYKPPSILQVPGARDVAIEFLSCSKA
YNMTGWRVGFVIGNRTGIAGLGQVKTNIDSGVFKAIQQAAIAAFGLDDERLHALMAVYQNRRNIIVEGLRSLGWPLEAPK
ATLYVWAPIPKSFGSSVEFVGALLDKCGIIVPPGNGYGEHGEGFFRIALTVPDERMREAIGRMEAAGIRFEG
>Q72NJ3 2.6.1.83~~~dapL~~~LL-diaminopimelate aminotransferase~~~
MANINENYLKLKAGYLFPEISKRVKIYSEKNPSAKIIRLGIGDVTLPIVPSVVDAMVEASKEMGTVGGFHGYGPEQGYSF
LLKSIADHDYGSLGIKIDESEIFVSDGSKCDCGNIQEIFSTDSKIAVADPVYPVYVDTNVMAGRTGEIGPDGRYSNLIYM
PATKENGFQPEIPKEKADIVYLCYPNNPTGTVTTKESLKAWVEYAKKNNSIILYDSAYEAFISEPGVPRSIYEVEGAKEV
AIEFRSFSKTAGFTGLRCAYIVIPKELKGRTRSGEEVSLNSLWNRRHTTKFNGVSYVTQKGAEACYSPQGKKEIQTSIAY
YMANASKIRDGLKKAGYEVFGGVNAPYIWLKTSDNLSSWDFFDKLLNKAQVVGTPGSGFGPAGEGYFRLSAFGKKEDVEE
AIARITSL
>Q2RK33 2.6.1.83~~~dapL~~~LL-diaminopimelate aminotransferase~~~COG0436
MQEARRIRELPPYLFARIEKKIAEARERGVDIISLGIGDPDMPTPSHVIDKLVAEAHNPENHRYPTSEGLLAFRQAVADW
YQRLYGVDLDPRREVVTLIGSKEGIAHISLCYVDPGDINLVPDPGYPVYNIGTLLAGGESYFMPLTAANGFLPDLGAIPS
DVARRAKLMFINYPNNPTGAVADLKFFQEVVEFARSYDLIVCHDAAYSEITYDGYRAPSFLQAPGAKEVGIEFNSVSKPY
NMTGWRLGWACGRADVIEALARIKSNIDSGAFQAVQYAGIAALTGPQEGLAEVRRVYQERRDIIVEGFNSLGWHLEKPKA
TFYVWAPVPRGYTSASFAEMVLEKAGVIITPGNGYGNYGEGYFRIALTISKERMQEAIERLRRVLGKVEF
>Q6MDE0 2.6.1.83~~~dapL~~~LL-diaminopimelate aminotransferase~~~COG0436
MVKRNVHLTKLQSGYLFPEINRRKNEFLKKHPSAQLINLGIGDTTQPIPLYISEAMQNFAKQLASEKTYRGYGTEQGSIL
LREAIAEQYYQGKIDPQEVFVSDGSKCDVGRLQILFGSDATIAVQNPTYPAYVDTGVINGQASFFQTSTKQYQRITYMSC
LPENNFFPDLANLPKTDLIYFCSPNNPTGSAATNEQLRELVQFAKKRQSIIIFDAAYASFVRSSHIPRSIYEIEGAKEVA
IEVGSFSKMIGFTGVRLGWSVVPKQLRFEDGHSVQQDWERIVCTFFNGASNIAQAGGLAALQKEGLQAIDELSSYYMKNS
NILKKAFEECGYKVYGGENVPYLWVHFPQLTSWEAFEILLKQSQLVSVPGSGFGSAGEGFLRFSAFGKQSDITVALPRIK
HALLKIKPTVY
>A0LEA5 2.6.1.83~~~dapL~~~LL-diaminopimelate aminotransferase~~~COG0436
MAFVKAERLKLLPPYLFQEIDRLKAELTAKGVDVINLGVGDPDLPTPDHIIARLKTAAEDPSTHQYPSYSGMNDFKVSVA
GWYKRRFGVELDPLSEVLTLIGSKEGLAHFPLAVINPGDLALVPTPAYPVYHVATMFAGGESYFMPLVRENGFLPDLDSI
PADVARRAKVMFINYPNNPTGATAERDFFEKVIAFAREYDVIVCHDAAYTEMAFGGYRPLSFLELPGAGEVGVEFHSLSK
TYNMTGWRLGFAVGNADILAGLGQVKSNIDSGAFNAVQWAGITALEGDQGCVVEMQRIYKERLDILIEGLKRIGLHPEVP
RATFYVWCPTPPGYSSKDFSSLLLREAGIVATPGSGFGAPGEGYIRMALTVDKERVREAVERMRKLSF
>Q55828 2.6.1.83~~~dapL~~~LL-diaminopimelate aminotransferase~~~COG0436
MASINDNYLKLKAGYLFPEIARRVNAFTTANPNAQVIKLGIGDVTEPLPLACRQAMAKAIDDMGDRQTFKGYGPEQGYAW
LREKIAQHDFQARGCEVNAEEIFISDGSKCDTGNILDIFGKDNTIAVTDPVYPVYVDTNVMAGHTGDANEKGEYGGLVYL
PISAENDFVAAIPSKKVDLIYLCFPNNPTGATATKAYLKQWVDYALAHGSIIFFDAAYEAFITDPTLPHSIYEIEGARDC
AIEFRSFSKNAGFTGTRCALTVVPKTLTAKAADGSDVELWKLWNRRQSTKFNGVSYIIQRGAEAVYSPEGQAQVQELIAF
YLENARIIREKLAAAGLQVYGGINAPYVWVKTPHGLSSWDFFDKLLHTVNVVGTPGSGFGAAGEGYFRISAFNSRANVEE
AMERITSTLKLG
>Q8UGL3 4.3.3.7~~~dapA~~~4-hydroxy-tetrahydrodipicolinate synthase~~~COG0329
MFKGSIPALITPFTDNGSVDEKAFAAHVEWQIAEGSNGLVPVGTTGESPTLSHDEHKRVVELCIEVAAKRVPVIAGAGSN
NTDEAIELALHAQEAGADALLVVTPYYNKPTQKGLFAHFSAVAEAVKLPIVIYNIPPRSVVDMSPETMGALVKAHKNIIG
VKDATGKLDRVSEQRISCGKDFVQLSGEDGTALGFNAHGGVGCISVTANVAPRLCSEFQAAMLAGDYAKALEYQDRLMPL
HRAIFMEPGVCGTKYALSKTRGGNRRVRSPLMSTLEPATEAAIDAALKHAGLMN
>O67216 4.3.3.7~~~dapA~~~4-hydroxy-tetrahydrodipicolinate synthase~~~COG0329
MFQGSIVALITPFKEGEVDYEALGNLIEFHVDNGTDAILVCGTTGESPTLTFEEHEKVIEFAVKRAAGRIKVIAGTGGNA
THEAVHLTAHAKEVGADGALVVVPYYNKPTQRGLYEHFKTVAQEVDIPIIIYNIPSRTCVEISVDTMFKLASECENIVAS
KESTPNMDRISEIVKRLGESFSVLSGDDSLTLPMMALGAKGVISVANNVMPREVKELIRAALEGDFRRAREIHYYLHDLF
KVLFIETNPIPVKTACWMLGMCEKEFRLPLTEMSPENENKLREVLKKYNLPLKN
>Q04796 4.3.3.7~~~dapA~~~4-hydroxy-tetrahydrodipicolinate synthase~~~COG0329
MNFGNVSTAMITPFDNKGNVDFQKLSTLIDYLLKNGTDSLVVAGTTGESPTLSTEEKIALFEYTVKEVNGRVPVIAGTGS
NNTKDSIKLTKKAEEAGVDAVMLVTPYYNKPSQEGMYQHFKAIAAETSLPVMLYNVPGRTVASLAPETTIRLAADIPNVV
AIKEASGDLEAITKIIAETPEDFYVYSGDDALTLPILSVGGRGVVSVASHIAGTDMQQMIKNYTNGQTANAALIHQKLLP
IMKELFKAPNPAPVKTALQLRGLDVGSVRLPLVPLTEDERLSLSSTISEL
>Q6G468 4.3.3.7~~~dapA~~~4-hydroxy-tetrahydrodipicolinate synthase~~~COG0329
MLKGAVTALITPFDDNGAIDEKAFCNFVEWQITQGINGVSPVGTTGESPTLTHEEHKRIIELCVEQVAKRVPVVAGAGSN
STSEAVELAKHAEKAGADAVLVVTPYYNRPNQRGLYTHFSSIAKAISIPIIIYNIPSRSVIDMAVETMRDLCRDFKNIIG
VKDATGKIERASEQREKCGKDFVQLSGDDCTALGFNAHGGVGCISVSSNVAPKLCAQLHAACLCSDYKTALKLNDLLMPL
NRAVFIEPSPAGIKYAAAKLGLCGTIVRSPIVPLSDTTKKIIDEALYHAGLLKE
>Q8G1R0 4.3.3.7~~~dapA~~~4-hydroxy-tetrahydrodipicolinate synthase~~~
MLKGSITALVTPFDREGAFDEKAFRAFVNWQIEEGTKGLVPVGTTGETPTLSHDEHKRVIEVCIEVAAGRVPVIAGAGSN
NTVEAIELAQHAEKAGADAVLVVTPYYNKPNQRGLYEHFSRVVRSISIPLVIYNIPGRSIIDMTPETMGALVRDCKNIVG
VKDATGKIERVSEQRAICGKEFIQLSGEDATALGFNAHGGVGCISVTSNIAPRLCAEFQEACQAGNFAKALELQDRLMPL
HKALFLEPNPSGPKYALSRLGRIENVLRSPMVTIEAATAEKIDHAMKHAGLIN
>Q9PPB4 4.3.3.7~~~dapA~~~4-hydroxy-tetrahydrodipicolinate synthase~~~COG0329
MDKNIIIGAMTALITPFKNGKVDEQSYARLIKRQIENGIDAVVPVGTTGESATLTHEEHRTCIEIAVETCKGTKVKVLAG
AGSNATHEAVGLAKFAKEHGADGILSVAPYYNKPTQQGLYEHYKAIAQSVDIPVLLYNVPGRTGCEISTDTIIKLFRDCE
NIYGVKEASGNIDKCVDLLAHEPRMMLISGEDAINYPILSNGGKGVISVTSNLLPDMISALTHFALDENYKEAKKINDEL
YNINKILFCESNPIPIKTAMYLAGLIESLEFRLPLCSPSKENFAKIEEVMKKYKIKGF
>P19808 4.3.3.7~~~dapA~~~4-hydroxy-tetrahydrodipicolinate synthase~~~COG0329
MSTGLTAKTGVEHFGTVGVAMVTPFTESGDIDIAAGREVAAYLVDKGLDSLVLAGTTGESPTTTAAEKLELLKAVREEVG
DRAKLIAGVGTNNTRTSVELAEAAASAGADGLLVVTPYYSKPSQEGLLAHFGAIAAATEVPICLYDIPGRSGIPIESDTM
RRLSELPTILAVKDAKGDLVAATSLIKETGLAWYSGDDPLNLVWLALGGSGFISVIGHAAPTALRELYTSFEEGDLVRAR
EINAKLSPLVAAQGRLGGVSLAKAALRLQGINVGDPRLPIMAPNEQELEALREDMKKAGVL
>P0A6L2 4.3.3.7~~~dapA~~~4-hydroxy-tetrahydrodipicolinate synthase~~~COG0329
MFTGSIVAIVTPMDEKGNVCRASLKKLIDYHVASGTSAIVSVGTTGESATLNHDEHADVVMMTLDLADGRIPVIAGTGAN
ATAEAISLTQRFNDSGIVGCLTVTPYYNRPSQEGLYQHFKAIAEHTDLPQILYNVPSRTGCDLLPETVGRLAKVKNIIGI
KEATGNLTRVNQIKELVSDDFVLLSGDDASALDFMQLGGHGVISVTANVAARDMAQMCKLAAEGHFAEARVINQRLMPLH
NKLFVEPNPIPVKWACKELGLVATDTLRLPMTPITDSGRETVRAALKHAGLL
>P9WP25 4.3.3.7~~~dapA~~~4-hydroxy-tetrahydrodipicolinate synthase~~~COG0329
MTTVGFDVAARLGTLLTAMVTPFSGDGSLDTATAARLANHLVDQGCDGLVVSGTTGESPTTTDGEKIELLRAVLEAVGDR
ARVIAGAGTYDTAHSIRLAKACAAEGAHGLLVVTPYYSKPPQRGLQAHFTAVADATELPMLLYDIPGRSAVPIEPDTIRA
LASHPNIVGVKDAKADLHSGAQIMADTGLAYYSGDDALNLPWLAMGATGFISVIAHLAAGQLRELLSAFGSGDIATARKI
NIAVAPLCNAMSRLGGVTLSKAGLRLQGIDVGDPRLPQVAATPEQIDALAADMRAASVLR
>Q9JZR4 4.3.3.7~~~dapA~~~4-hydroxy-tetrahydrodipicolinate synthase~~~
MLQGSLVALITPMNQDGSIHYEQLRDLIDWHIENGTDGIVAVGTTGESATLSVEEHTAVIEAVVKHVAKRVPVIAGTGAN
NTVEAIALSQAAEKAGADYTLSVVPYYNKPSQEGIYQHFKTIAEATSIPMIIYNVPGRTVVSMTNDTILRLAEIPNIVGV
KEASGNIGSNIELINRAPEGFVVLSGDDHTALPFMLCGGHGVITVAANAAPKLFADMCRAALQGDIALARELNDRLIPIY
DTMFCEPSPAAPKWAVSALGRCEPHVRLPLVPLTENGQAKVRAALKASGQL
>Q9I4W3 4.3.3.7~~~dapA~~~4-hydroxy-tetrahydrodipicolinate synthase~~~
MIAGSMVALVTPFDAQGRLDWDSLAKLVDFHLQEGTNAIVAVGTTGESATLDVEEHIQVIRRVVDQVKGRIPVIAGTGAN
STREAVALTEAAKSGGADACLLVTPYYNKPTQEGMYQHFRHIAEAVAIPQILYNVPGRTSCDMLPETVERLSKVPNIIGI
KEATGDLQRAKEVIERVGKDFLVYSGDDATAVELMLLGGKGNISVTANVAPRAMSDLCAAAMRGDAAAARAINDRLMPLH
KALFIESNPIPVKWALHEMGLIPEGIRLPLTWLSPRCHEPLRQAMRQTGVLA
>Q07607 4.3.3.7~~~dapA~~~4-hydroxy-tetrahydrodipicolinate synthase~~~
MFEGSITALVTPFADDRIDEVALHDLVEWQIEEGSFGLVPCGTTGESPTLSKSEHEQVVEITIKTANGRVPVIAGAGSNS
TAEAIAFVRHAQNAGADGVLIVSPYYNKPTQEGIYQHFKAIDAASTIPIIVYNIPGRSAIEIHVETLARIFEDCPNVKGV
KDATGNLLRPSLERMACGEDFNLLTGEDGTALGYMAHGGHGCISVTANVAPALCADFQQACLNGDFAAALKLQDRLMPLH
RALFLETNPAGAKYALQRLGRMRGDLRLPLVTISPSFQEEIDDAMRHAGILL
>Q8ZN71 4.3.3.7~~~dapA~~~4-hydroxy-tetrahydrodipicolinate synthase~~~
MFTGSIVALVTPMDEKGNVSRSCLKKLIDYHVANGTSAIVSVGTTGESATLSHDEHGDVVMMTLELADGRIPVIAGTGAN
ATAEAISLTQRFNDSGIVGCLTVTPYYNRPTQEGLFQHFKAIAEHTDLPQILYNVPSRTGCDMLPETVGRLAEIKNIIAI
KEATGNLTRVHQIKELVSDDFILLSGDDASALDFMQLGGHGVISVTANVAAREMADMCKLAAEGQFAEARAINQRLMPLH
NKLFVEPNPIPVKWACKALGLVATDTLRLPMTPITDHGRDIVKAALQHAGLL
>Q5HG25 4.3.3.7~~~dapA~~~4-hydroxy-tetrahydrodipicolinate synthase~~~
MTHLFEGVGVALTTPFTNNKVNIEALKTHVNFLLENNAQAIIVNGTTAESPTLTTDEKERILKTVIDLVDKRVPVIAGTG
TNDTEKSIQASIQAKALGADAIMLITPYYNKTNQRGLVKHFEAIADAVKLPVVLYNVPSRTNMTIEPETVEILSQHPYIV
ALKDATNDFEYLEEVKKRIDTNSFALYSGNDDNVVEYYQRGGQGVISVIANVIPKEFQALYDAQQSGLDIQDQFKPIGTL
LSALSVDINPIPIKALTSYLGFGNYELRLPLVSLEDTDTKVLRETYDTFKAGENE
>Q6GH13 4.3.3.7~~~dapA~~~4-hydroxy-tetrahydrodipicolinate synthase~~~
MTHLFEGVGVALTTPFTNNKVNLEALKAHVNFLLENNAQAIIVNGTTAESPTLTTDEKELILKTVIDLVDKRVPVIAGTG
TNDTEKSIQASIQAKALGADAIMLITPYYNKTNQRGLVKHFEAIADAVKLPVVLYNVPSRTNMTIEPETVEILSQHPYIV
ALKDATNDFEYLEEVKKRIDTNSFALYSGNDDNVVEYYQRGGQGVISVIANVIPKEFQALYDAQQSGLDIQDQFKPIGTL
LSALSVDINPIPIKALTSYLGFGNYELRLPLVSLEDTDTKVLREAYDTFKAGENE
>Q97R25 4.3.3.7~~~dapA~~~4-hydroxy-tetrahydrodipicolinate synthase~~~COG0329
MSYQDLKKCKIITAFITPFHEDGSINFDAIPALIEHLLAHHTDGILLAGTTAESPTLTHDEELELFAAVQKVVNGRVPLI
AGVGTNDTRDSIEFVKEVAEFGGFAAGLAIVPYYNKPSQEGMYQHFKTIADASDLPIIIYNIPGRVVVELTPETMLRLAD
HPNIIGVKECTSLANMAYLIEHKPEEFLIYTGEDGDAFHAMNLGADGVISVASHTNGDEMHEMFTAIAESDMKKAAAIQR
KFIPKVNALFSYPSPAPVKAILNYMGFEAGPTRLPLVPAPEEDAKRIIKVVVDGDYEATKATVTGVLRPDY
>Q9X1K9 4.3.3.7~~~dapA~~~4-hydroxy-tetrahydrodipicolinate synthase~~~COG0329
MFRGVGTAIVTPFKNGELDLESYERLVRYQLENGVNALIVLGTTGESPTVNEDEREKLVSRTLEIVDGKIPVIVGAGTNS
TEKTLKLVKQAEKLGANGVLVVTPYYNKPTQEGLYQHYKYISERTDLGIVVYNVPGRTGVNVLPETAARIAADLKNVVGI
KEANPDIDQIDRTVSLTKQARSDFMVWSGNDDRTFYLLCAGGDGVISVVSNVAPKQMVELCAEYFSGNLEKSREVHRKLR
PLMKALFVETNPIPVKAALNLMGFIENELRLPLVPASEKTVELLRNVLKESGLL
>O07834 3.4.14.-~~~dapb1~~~Dipeptidyl aminopeptidase BI~~~
MKPTSLLLAATVLMSTPITSALAASATPPDVAKKPHVVKAPHGAERNDEYYWLRDDKRENKEMLAYLNAENAYTDAVMAP
LKPLEDKLYDEVVARIKQDDASVPYRERGWWYYARFVTGKDYPVHARRKDGPGVDAVSIQAANAAGDFAGEQVLLDVNAL
GAGKDYYNVGDYEVSQDNRLLAYADDTNGRRQYTIRFKNLDTGELLPDTVTNAEPNLVWSDDGRTLFYVDKDPETLLSKR
VKAHVLGTPASQDALVYEEEDDSFYMGIGRSRDDKFICISVESTVSSEMRCTPAASPGVFTVLAPRERDVEYQADHLGDR
WVIRTNADGATNFKIVTAPTDSTSRKDWKDWVAHRDDVFVEGFELFDGFSVVAERANALESLRVIKADGSSDYVKADESA
YSMGLSANPETGTDWLRYSYTSMTTPATTYEINTKTGERRQLKQQPVPGYDASKYVTERVWAPARDGKTKIPVTLVYRKD
VARDGKAPMLQYAYGSYGASMDPNFSITNVSLLDRGVVYALAHIRGGQEMGRAWYDDGKLYNKINTFTDFIDVTDYLVKE
GYAAKDRVAAMGGSAGGLLMGAVSNMAPEKYKVILTLVPFVDVVTTMLDPTIPLTTNEYDEWGNPEEKGYYDYILTYSPY
DNLQAKAYPAMFVGTGLWDSQVQYWEPAKYVARLRDLNTGKGPVVFRTNMEAGHGGKSGRFRQYRERAEMFAFMLDQLGV
ASK
>V5YM14 3.4.14.-~~~dapb2~~~Dipeptidyl aminopeptidase BII~~~
MRPNLLAAAIAVPLSLLAAQIAQAGEGMWVPQQLPEIAGPLKKAGLKLSPQQISDLTGDPMGAVVALGGCTASFVSPNGL
VVTNHHCAYGAIQLNSTAENNLIKNGFNAPTTADEVSAGPNARVFVLDEITDVTKDAKAAIAAAGDDALARTKALEAFEK
KLIADCEAEAGFRCRLYSFSGGNTYRLFKNLEIKDVRLAYAPPGSVGKFGGDIDNWMWPRHTGDFAFYRAYVGKDGKPAA
FSKDNVPYQPKHWLKFADQPLGAGDFVMVAGYPGSTNRYALAAEFDNTAQWTYPTIARHYKNQIAMVEAAGKQNADIQVK
YAATMAGWNNTSKNYDGQLEGFKRIDAAGQKLREEAAVLGWLKGQGAKGQPALDAHAKLLDLLEQSKATRDRDLTLALFN
NTAMLGSATQLYRLSIEREKPNAERESGYQERDLPAIEGGLKQLERRYVAAMDRQLQEYWLNEYIKLPADQRVAAVDAWL
GGNDAAAVKRALDRLAGTKLGSTEERLKWFAADRKAFEASNDPAIQYAVAVMPTLLKLEQERKTRAGENLAARPVYLQAL
ADYKKSQGEFVYPDANLSLRITFGNVMGYAPKDGMEYTPFTTLEGVVAKETGQDPFDSPKALLDAVAAKRYGGLEDKRIG
SVPVNYLSDLDITGGNSGSPVLDAHGKLVGLAFDGNWESVSSNWVFDPKMTRMIAVDGRYLRWIMQEVYPAPQLLKEMNV
GK
>V5YMB3 3.4.14.-~~~dapb3~~~Dipeptidyl aminopeptidase BIII~~~
MRHPAFRLTLLASTVAFALAPQAAQAAPSAADRIAGTELIARDALFGNPERANVQISPDGKYLSWVAAVDGVLNVWIAPA
DNPSQARAVTQDTARGIRSYFWSYQPDTLLYLRDSGGDEDFHLYAVDLKTGQAKDLTPFPKTTAQVAGVSPKHPGTILVG
MNDRDAQWHDIYKVDLASGNRTLLEKNDAQIAGYIADADYTLKYAQRSRPDGGADVLRRGANGAWEKFDDIPFEDVLTTS
PGGLTLDGKTLYFTDSRGRNTAALFAIDVASGKRTLVLEDARADVGGTLADPATGKVQAVSVDYLRDEWKVVDPAIRADL
EKLEAIGPGDVSVNTRTLDDKTWIVAYSAAEAPLVYYRYDRSAGTLTKLFSARPKLEGKPLVPQWPVEIASRDNKTLVSY
LTLPRSADANNDGKADAPVPLVLLVHGGPWARDSYGYGGYNQWLANRGYAVLSVNFRGSTGFGKDFTNAGNGEWAGKMHD
DLIDAVQWAVKQGVTTQDQVAIMGGSYGGYATLTGLTFTPDAFACGVDIVGPSNLNTLLSTVPPYWASFFEQLAKRMGDP
RTDAGKKWLTERSPLTRADQIKKPLLIGQGANDPRVKQAESDQIVKAMQAKNIPVTYVLFPDEGHGFARPENNKAFNAVT
EGFLAQCLGGRAEPIGKDFTGSSISVPVGADGVPGLAEALKGHTQEVKK
>B0VA26 1.17.1.8~~~dapB~~~4-hydroxy-tetrahydrodipicolinate reductase~~~
MSAAPRIGILGAGGRMGRILIQAVQQAGYQLGAAVVRPESTLIGADAGELAGIGSIGVKLTGSLAEVLEDCDVVIDFSTP
AATSEHLKLCREAGVAIVIGTTGMSDEQKAELDETAKHIPVVYAANYSVGVNVSIKLLELAAKVFGDTVDIEVIEAHHRH
KVDAPSGTALMMGEAIADTLGRNLKEVAVYGREGHTGPRDRQTIGFETIRGGDIVGEHTVMFIGEGERVEVTHKATNRMN
FAAGAVRAAAWVVGREARKYDMKDVLGLNDVQV
>Q6G2G3 1.17.1.8~~~dapB~~~4-hydroxy-tetrahydrodipicolinate reductase~~~COG0289
MRLTVVGANGRMGRELITAIQRRKDVELCAVLVRKGSSFVDKDASILIGSDFLGVRITDDPESAFSNTEGILDFSQPQAS
VLYANYAAQKSLIHIIGTTGFSKTEEAQIADFAKYTTIVKSGNMSLGVNLLANLVKRAAKALDDDFDIEIYEMHHANKVD
SPSGTALLLGQAAAEGRNIMLKNVSVNGRSGHTGKREKGTIGFACSRGGTVIGDHSITFAGENERIVLSHIAQERSIFAN
GALKAALWAKNHENGLYSMLDVLGLNE
>Q2SZ94 1.17.1.8~~~dapB~~~4-hydroxy-tetrahydrodipicolinate reductase~~~
MSSMKIAIAGASGRMGRMLIEAVLAAPDATLVGALDRTGSPQLGQDAGAFLGKQTGVALTDDIERVCAEADYLIDFTLPE
GTLVHLDAALRHDVKLVIGTTGFSEPQKAQLRAAGEKIALVFSANMSVGVNVTMKLLEFAAKQFAQGYDIEIIEAHHRHK
VDAPSGTALMMGETIAAATGRSLDDCAVYGRHGVTGERDPSTIGFSAIRGGDIVGDHTVLFAGIGERIEITHKSASRVSY
AQGALRAARFLAGRDAGFFDMQDVLGLR
>P40110 1.17.1.8~~~dapB~~~4-hydroxy-tetrahydrodipicolinate reductase~~~COG0289
MGIKVGVLGAKGRVGQTIVAAVNESDDLELVAEIGVDDDLSLLVDNGAEVVVDFTTPNAVMGNLEFCINNGISAVVGTTG
FDDARLEQVRDWLEGKDNVGVLIAPNFAISAVLTMVFSKQAARFFESAEVIELHHPNKLDAPSGTAIHTAQGIAAARKEA
GMDAQPDATEQALEGSRGASVDGIPVHAVRMSGMVAHEQVIFGTQGQTLTIKQDSYDRNSFAPGVLVGVRNIAQHPGLVV
GLEHYLGL
>P24703 1.17.1.8~~~dapB~~~4-hydroxy-tetrahydrodipicolinate reductase~~~COG0289
MAINVIINGINGKMGRVVKENITAQSDLELVSGTGRQDDLAKTIQTTHADVVIDFTTPQSVFHNAEIIIQSGARPVIGTT
GLTLEQIALLDKQCRNKKLGAIVAPNFSVGAVLMMKYAKEAAHYFPDVEIIEMHHSQKIDAPSGTAIKTAQMIGEMRSSK
KDEPFKDRARGEIKNGIPIHSIRLPGLFSHQSVIFGSNGETLTIRHDGMDRNCTMPGIFMACRKVMELDYLVYGLENLL
>P04036 1.17.1.8~~~dapB~~~4-hydroxy-tetrahydrodipicolinate reductase~~~COG0289
MHDANIRVAIAGAGGRMGRQLIQAALALEGVQLGAALEREGSSLLGSDAGELAGAGKTGVTVQSSLDAVKDDFDVFIDFT
RPEGTLNHLAFCRQHGKGMVIGTTGFDEAGKQAIRDAAADIAIVFAANFSVGVNVMLKLLEKAAKVMGDYTDIEIIEAHH
RHKVDAPSGTALAMGEAIAHALDKDLKDCAVYSREGHTGERVPGTIGFATVRAGDIVGEHTAMFADIGERLEITHKASSR
MTFANGAVRSALWLSGKESGLFDMRDVLDLNNL
>P94844 1.17.1.8~~~dapB~~~4-hydroxy-tetrahydrodipicolinate reductase~~~COG0289
MKIGVYGASGRIGKLLLEELKGGYKGLALSSVFVRQKCETDFSSFSHAPLVTNDLKAFVRACECVIDFSLPKGVDNLLEA
LLECPKILVSGTTGLEKETLEKMQQLALKAPLLHAHNMSIGIMMLNQLAFLTSLKLKDADIEIIETHHNLKKDIPSGTAL
SLYETCAKARGYDEKNALITHREGLRSKESIGIAALRGGDVAGKHTIGFYLEGEYIELSHTATNRSIFAKGALEVALWLK
DKAAKKYEINEMFG
>A0QVR3 1.17.1.8~~~dapB~~~4-hydroxy-tetrahydrodipicolinate reductase~~~COG0289
MRVGVLGARGKVGATMVAAVEAAEDLTFSAGVDAGDDLSLLTESKTEVVIDFTHPDVVMDNLKFVIDNGIHAVVGTTGFT
WERIEQVEAWVKAKPGASVLIAPNFAIGAVLSMHFAKQAAKYFESVEIIELHHPHKADAPSGTAARTAKLIAEARKGLPP
NPDATSTGLDGARGADVDGIPVHSVRLAGLVAHQEVLFGTQGETLTIRHDSLDRTSFVPGVLLAVRKISGLQGLTVGIEP
LLDLS
>P9WP23 1.17.1.8~~~dapB~~~4-hydroxy-tetrahydrodipicolinate reductase~~~COG0289
MRVGVLGAKGKVGATMVRAVAAADDLTLSAELDAGDPLSLLTDGNTEVVIDFTHPDVVMGNLEFLIDNGIHAVVGTTGFT
AERFQQVESWLVAKPNTSVLIAPNFAIGAVLSMHFAKQAARFFDSAEVIELHHPHKADAPSGTAARTAKLIAEARKGLPP
NPDATSTSLPGARGADVDGIPVHAVRLAGLVAHQEVLFGTEGETLTIRHDSLDRTSFVPGVLLAVRRIAERPGLTVGLEP
LLDLH
>Q5F5Y7 1.17.1.8~~~dapB~~~4-hydroxy-tetrahydrodipicolinate reductase~~~
MIPLKIAIAGANGRMGRVLVEAVNNHPDTVLSGALEHSGSEALGLDAGYAVGLKTGIAISDDVDAVLAQSDVLIDFTRPE
PTLKHLQKCVEKQVNIIIGTTGFDDAGKAAIRAAAEKTGIVFAANFSVGVNLTFHILDTVARVLNEGYDIEIIEGHHRHK
VDAPSGTALRMGEVIAGALGRDLKQCAVYGREGHTGPRDPSTIGFATVRAGDIVGDHTALFATDGERVEITHKAGSRMTF
AAGAVRAAVWVNGKTGLYDMQDVLGLNNR
>Q9K1F1 1.17.1.8~~~dapB~~~4-hydroxy-tetrahydrodipicolinate reductase~~~
MTPLKIAIAGANGRMGRVLVEAVNNHPDTVLSGALEHSGSEALGLDAGYAVGLKTGIAISDDVDAVLAQSDVLIDFTRPE
PTLKHLQKCVEKQVNIIIGTTGFDDTGKAAIHTAAEKTGIVFAANFSVGVNLTFHILDTVARVLNEGYDIEIIEGHHRHK
VDAPSGTALRMGEVIAGALGRDLKQCAVYGREGHTGPRDPSTIGFATVRAGDIVGDHTALFATDGERVEITHKASSRMTF
AAGAVRAAVWVNGKTGLYDMQDVLGLNNR
>P38103 1.17.1.8~~~dapB~~~4-hydroxy-tetrahydrodipicolinate reductase~~~
MRRIAVVGAAGRMGKNLIEAVQQTGGAAGLTAAVDRPDSTLVGADAGELAGLGRIGVPLSGDLGKVCEEFDVLIDFTHPS
VTLKNIEQCRKARRAMVIGTTGFSADEKLLLAEAAKDIPIVFAANFSVGVNLCLKLLDTAARVLGDEVDIEIIEAHHRHK
VDAPSGTALRMGEVVAQALGRDLQEVAVYGREGQTGARARETIGFATVRAGDVVGDHTVLFAAEGERVEITHKASSRMTF
ARGAVRAALWLEGKENGLYDMQDVLGLR
>Q5HG24 1.17.1.8~~~dapB~~~4-hydroxy-tetrahydrodipicolinate reductase~~~
MKILLIGYGAMNQRVARLAEEKGHEIVGVIENTPKATTPYQQYQHIADVKGADVAIDFSNPNLLFPLLDEDFHLPLVVAT
TGEKEKLLNKLDELSQNMPVFFSANMSYGVHALTKILAAAVPLLDDFDIELTEAHHNKKVDAPSGTLEKLYDVIVSLKEN
VTPVYDRHELNEKRQPQDIGIHSIRGGTIVGEHEVLFAGTDETIQITHRAQSKDIFANGAIQAAERLVNKPNGFYTFDNL
>P63895 1.17.1.8~~~dapB~~~4-hydroxy-tetrahydrodipicolinate reductase~~~COG0289
MSIRVIIAGFKGKMGQAACQMVLTDPDLDLVAVLDPFESESEWQGIPVFKDKADLAGFEADVWVDFTTPAVAYENTRFAL
ENGFAPVVGTTGFTSEEIAELKEFSRAQDLGGLIAPNFALGAVLLMQFATQAAKYFPNVEIIELHHDKKKDAPSGTAIKT
AELMAEVRESIQQGAADEEELIAGARGADFDGMRIHSVRLPGLVAHQEVIFGNQGEGLTLRHDSYDRISFMTGVNLGIKE
VVKRHELVYGLEHLL
>Q9X1K8 1.17.1.8~~~dapB~~~4-hydroxy-tetrahydrodipicolinate reductase~~~COG0289
MKYGIVGYSGRMGQEIQKVFSEKGHELVLKVDVNGVEELDSPDVVIDFSSPEALPKTVDLCKKYRAGLVLGTTALKEEHL
QMLRELSKEVPVVQAYNFSIGINVLKRFLSELVKVLEDWDVEIVETHHRFKKDAPSGTAILLESALGKSVPIHSLRVGGV
PGDHVVVFGNIGETIEIKHRAISRTVFAIGALKAAEFLVGKDPGMYSFEEVIFGGE
>Q3MFY8 1.17.1.8~~~dapB~~~4-hydroxy-tetrahydrodipicolinate reductase~~~COG0289
MTNQAPIPVIVNGAAGKMGREVVKAIAQAPDLNLLGAIDSSPEHQGKDAGELAGLSEPLEVPITNQLEPMLGYVAGERQG
PPGVIVDFTHPDSVYDNVRSAIAYGIRPVVGTTGLSPAQIQNLADFAEKASTGCLIIPNFSIGMVLLQQAAVTASQYFDH
VEIIELHHNQKADAPSGTAIQTAELLAELGKTFNSAIVEETEKIPGARGSLAGEGIRIHSVRLPGLIAHQEVIFGAPGQI
YTLRHDTSDRACYMPGVLLAIRKVLQLKSLVYGLEKIL
>Q8DEM0 1.17.1.8~~~dapB~~~4-hydroxy-tetrahydrodipicolinate reductase~~~
MVRIAVAGAAGRMGRNLVKAAHHNPVAKVAAGSERPESSLVGVDLGELCGEGKFDVVVCDDLAKQIDQFDVIIDFTAPAS
TLNNLALCQQYGKSIVIGTTGFTEEQREQIDLVAQQVPVVMAPNYSVGVNLVFKLLEKAAKVMGDYCDIEIVEAHHRHKV
DAPSGTAIGMGEAIAGAMGNKLSDVAVYAREGITGERTKDEIGFATIRAGDIVGEHTAMFADIGERVEITHKATDRMTFA
NGAVKAAVWLHEKPAGFYTMTDVLGLNDL
>P9WPZ4 2.6.1.17~~~dapC~~~Probable N-succinyldiaminopimelate aminotransferase DapC~~~
MTVSRLRPYATTVFAEMSALATRIGAVNLGQGFPDEDGPPKMLQAAQDAIAGGVNQYPPGPGSAPLRRAIAAQRRRHFGV
DYDPETEVLVTVGATEAIAAAVLGLVEPGSEVLLIEPFYDSYSPVVAMAGAHRVTVPLVPDGRGFALDADALRRAVTPRT
RALIINSPHNPTGAVLSATELAAIAEIAVAANLVVITDEVYEHLVFDHARHLPLAGFDGMAERTITISSAAKMFNCTGWK
IGWACGPAELIAGVRAAKQYLSYVGGAPFQPAVALALDTEDAWVAALRNSLRARRDRLAAGLTEIGFAVHDSYGTYFLCA
DPRPLGYDDSTEFCAALPEKVGVAAIPMSAFCDPAAGQASQQADVWNHLVRFTFCKRDDTLDEAIRRLSVLAERPAT
>P9WPZ5 2.6.1.17~~~dapC~~~Probable N-succinyldiaminopimelate aminotransferase DapC~~~COG0436
MTVSRLRPYATTVFAEMSALATRIGAVNLGQGFPDEDGPPKMLQAAQDAIAGGVNQYPPGPGSAPLRRAIAAQRRRHFGV
DYDPETEVLVTVGATEAIAAAVLGLVEPGSEVLLIEPFYDSYSPVVAMAGAHRVTVPLVPDGRGFALDADALRRAVTPRT
RALIINSPHNPTGAVLSATELAAIAEIAVAANLVVITDEVYEHLVFDHARHLPLAGFDGMAERTITISSAAKMFNCTGWK
IGWACGPAELIAGVRAAKQYLSYVGGAPFQPAVALALDTEDAWVAALRNSLRARRDRLAAGLTEIGFAVHDSYGTYFLCA
DPRPLGYDDSTEFCAALPEKVGVAAIPMSAFCDPAAGQASQQADVWNHLVRFTFCKRDDTLDEAIRRLSVLAERPAT
>A3DDX7 1.4.1.16~~~ddh~~~Meso-diaminopimelate D-dehydrogenase~~~COG1748
MGGVTLEKIRIGIVGYGNLGKGAELGIRQNKDMELVGIFTRRNPNSIKPLTEGVKVYSVDSARDMADKIDVMLLCSGSRT
DLPVQGPEFAAMFNIVDGFDTHNKIQEYFESVDAKAKESKKVAVIACGWDPGMFSLNRLFGEVILPEGKTYTFWGKGVSQ
GHSDAIRRVKGVVDAKQYTIPVESAIELVRKGENPELTTRQKHIRECFVVVEEGADKERIEREIKTMPDYFADYDTIVHF
ISLEELKEKHSGIPHGGFSIRTGRTGINNENKHTIEYSLKLDSNPDFTANTLLAYARAAYRLNKEGVFGAKTVFDIPPAY
LSPKSAEELRRSLL
>Q5L9Q6 1.4.1.16~~~ddh~~~Meso-diaminopimelate D-dehydrogenase~~~COG0673
MKKVRAAIVGYGNIGRYVLEALQAAPDFEIAGVVRRAGAENKPAELNDYAVVKDIKELQGVDVAILCTPTRSVEKYAKEI
LAMGINTVDSFDIHTGIVDLRRELGACAKEHGAVSIISAGWDPGSDSIVRTMLEAIAPKGITYTNFGPGMSMGHTVAVKA
IDGVKAALSMTIPTGTGIHRRMVYIELKDGYKFEEVAAAIKSDAYFVNDETHVKQVPSVDALLDMGHGVNLTRKGVSGKT
QNQLFEFNMRINNPALTAQVLVCVARASMKQQPGCYTMVEVPVIDLLPGDREEWIGHLV
>P04964 1.4.1.16~~~ddh~~~Meso-diaminopimelate D-dehydrogenase~~~COG1748
MTNIRVAIVGYGNLGRSVEKLIAKQPDMDLVGIFSRRATLDTKTPVFDVADVDKHADDVDVLFLCMGSATDIPEQAPKFA
QFACTVDTYDNHRDIPRHRQVMNEAATAAGNVALVSTGWDPGMFSINRVYAAAVLAEHQQHTFWGPGLSQGHSDALRRIP
GVQKAVQYTLPSEDALEKARRGEAGDLTGKQTHKRQCFVVADAADHERIENDIRTMPDYFVGYEVEVNFIDEATFDSEHT
GMPHGGHVITTGDTGGFNHTVEYILKLDRNPDFTASSQIAFGRAAHRMKQQGQSGAFTVLEVAPYLLSPENLDDLIARDV
>Q9KWR0 1.4.1.16~~~dapdh~~~Meso-diaminopimelate D-dehydrogenase~~~
MSAIRVGIVGYGNLGRGVEFAISQNPDMELVAVFTRRDPSTVSVASNASVYLVDDAEKFQDDIDVMILCGGSATDLPEQG
PHFAQWFNTIDSFDTHAKIPEFFDAVDAAAQKSGKVSVISVGWDPGLFSLNRVLGEAVLPVGTTYTFWGDGLSQGHSDAV
RRIEGVKNAVQYTLPIKDAVERVRNGENPELTTREKHARECWVVLEEGADAPKVEQEIVTMPNYFDEYNTTVNFISEDEF
NANHTGMPHGGFVIRSGESGANDKQILEFSLKLESNPNFTSSVLVAYARAAHRLSQAGEKGAKTVFDIPFGLLSPKSAAQ
LRKELL
>G1UII1 1.4.1.16~~~ddh~~~Meso-diaminopimelate D-dehydrogenase~~~
MSKIRIGIVGYGNLGRGVEAAIQQNPDMELVAVFTRRDPKTVAVKSNVKVLHVDDAQSYKDEIDVMILCGGSATDLPEQG
PYFAQYFNTIDSFDTHARIPDYFDAVNAAAEQSGKVAIISVGWDPGLFSLNRLLGEVVLPVGNTYTFWGKGVSQGHSDAI
RRIQGVKNAVQYTIPIDEAVNRVRSGENPELSTREKHARECFVVLEEGADPAKVEHEIKTMPNYFDEYDTTVHFISEEEL
KQNHSGMPHGGFVIRSGKSDEGHKQIIEFSLNLESNPMFTSSALVAYARAAYRLSQNGDKGAKTVFDIPFGLLSPKSPED
LRKELL
>Q5DL43 2.3.1.117~~~dapD~~~2,3,4,5-tetrahydropyridine-2,6-dicarboxylate N-succinyltransferase~~~COG2171
MSQLSTIIEQAFEDRANFTAADCPSEIRQAVEEAIAGLDNGTLRVAEKINGEWVVHQWLKKAVLLSFKLNDNKPIESCDL
RFYDKVETKFSGWTEEQFKAAGVRVVPPAVARRGSFQAKNVVLMPSYVNIGAYVDEGTMVDTWATVGSCAQIGKNVHLSG
GVGIGGVLEPLQANPTIIEDNCFIGARSEIVEGVIVEEGSVISMGVYIGQSTRIYDRETGEIHYGRVPAGSVVVPGNLPS
ADGKYSLYAAIIVKKVDAQTRAKTSLNDLLRAD
>Q6G549 2.3.1.117~~~dapD~~~2,3,4,5-tetrahydropyridine-2,6-dicarboxylate N-succinyltransferase~~~COG2171
MTDLTQLEMIIEKAFDDRNSINTTTKGEILESVEHALNLLDKGEVRVVKRQKNGKWHVHQWLKKAVLLSFRLNPMQIMTG
GVNGTSWWDKVPSKFSHWQEADFKKADFRSVPGAIVRHSAYIAPNVILMPSFVNLGAFVDEGTMVDTWATVGSCAQIGKH
VHLSGGVGIGGVLEPLQANPTIIEDHCFIGARSEVVEGCIIREGSVLGMGVFIGKSTKIIDRTTGEIFIGEVPPYSVVVP
GSLPGKPLPNGEIGPNLYCAVIVKRVDQKTREKTSINDLLRD
>Q8FV25 2.3.1.117~~~dapD~~~2,3,4,5-tetrahydropyridine-2,6-dicarboxylate N-succinyltransferase~~~
MTKPDLASLEKTIEKAFDERDGINTATRGEVREAVEQSLILLDRGEVRVAEKQADGNWHVNQWLKKAVLLSFRLNPMEVI
KGGPGQSSWWDKVPSKFDGWTANEFEKAGFRAVPNCIVRHSAYIAPNAILMPSFVNLGAYVDKGAMIDTWATVGSCAQIG
KNVHLSGGVGIGGVLEPMQAGPTIIEDNCFIGARSEVVEGCIVREGSVLGMGVFIGKSTKIVDRATGEVFYGEVPPYSVV
VAGTMPGKNVPGENWGPSLYCAVIVKRADEKTRSKTSINELLRD
>Q0P823 2.3.1.117~~~dapD~~~2,3,4,5-tetrahydropyridine-2,6-dicarboxylate N-succinyltransferase~~~COG2171
MINTKEDFLLLIKQIEQKSGYKKPKAFGIARLDRGQLNKNKILQASFALINYEQNFGSAAIMLEAFMQRGVEIDFNASEF
VQTLKLEDIDFALSCFKPFLEEDGHQNIDLLKIIKDKFKDDEFSFVCLFEDKEPLSVESIYLKLYLLSTKKVPLRSINLN
GAFGLLSNVAWSDDKPIELEYLRANEMRLKMSNQYPKIDFVDKFPRFLAHIIPEDNTRILESSKVRMGASLAAGTTIMPG
ASYVNFNAGTTGACMVEGRISSSAIVGEGSDVGGGASILGVLSGTSGNAISVGKACLLGANSVTGIPLGDNCIVDAGIAV
LEGTKFLLKDAEELAKLNPYFNFDKEIYKGLELKGLNGLHFRQDSISGAMIVALNKKAVKLNEALH
>Q8NRE3 2.3.1.117~~~dapD~~~2,3,4,5-tetrahydropyridine-2,6-dicarboxylate N-succinyltransferase~~~COG2171
MTTASATGIATLTSTGDVLDVWYPEIGSTDQSALTPLEGVDEDRNVTRKIVTTTIDTDAAPTDTYDAWLRLHLLSHRVFR
PHTINLDGIFGLLNNVVWTNFGPCAVDGFALTRARLSRRGQVTVYSVDKFPRMVDYVVPSGVRIGDADRVRLGAYLADGT
TVMHEGFVNFNAGTLGASMVEGRISAGVTVDDGTDVGGGASIMGTLSGGGQHVISLGKRCLLGANSGCGIPLGDDCIIEA
GLYITAGTKVLFDGSLHKASTLAGSNGLIFRRDSVSGQVVAVPNTKVVELNTALHSN
>Q8X8Y7 2.3.1.117~~~dapD~~~2,3,4,5-tetrahydropyridine-2,6-dicarboxylate N-succinyltransferase~~~COG2171
MQQLQNIIETAFERRAEITPANADTVTREAVNQVIALLDSGALRVAEKIDGQWVTHQWLKKAVLLSFRINDNQVIEGAES
RYFDKVPMKFADYDEARFQKEGFRVVPPAAVRQGAFIARNTVLMPSYVNIGAYVDEGTMVDTWATVGSCAQIGKNVHLSG
GVGIGGVLEPLQANPTIIEDNCFIGARSEVVEGVIVEEGSVISMGVYIGQSTRIYDRETGDIHYGRVPAGSVVVSGNLPS
KDGKYSLYCAVIVKKVDAKTRGKVGINELLRTID
>P0A9D8 2.3.1.117~~~dapD~~~2,3,4,5-tetrahydropyridine-2,6-dicarboxylate N-succinyltransferase~~~COG2171
MQQLQNIIETAFERRAEITPANADTVTREAVNQVIALLDSGALRVAEKIDGQWVTHQWLKKAVLLSFRINDNQVIEGAES
RYFDKVPMKFADYDEARFQKEGFRVVPPAAVRQGAFIARNTVLMPSYVNIGAYVDEGTMVDTWATVGSCAQIGKNVHLSG
GVGIGGVLEPLQANPTIIEDNCFIGARSEVVEGVIVEEGSVISMGVYIGQSTRIYDRETGEIHYGRVPAGSVVVSGNLPS
KDGKYSLYCAVIVKKVDAKTRGKVGINELLRTID
>Q5ZX45 2.3.1.117~~~dapD~~~2,3,4,5-tetrahydropyridine-2,6-dicarboxylate N-succinyltransferase~~~COG2171
MNSLQDLIEQAFENRQNLSLDTASSDLINAINEVLSGLDNGQFRVAEKINGEWTVHQWLKKAVLLSFKLFPNQIIDAGFC
KFYDKIPLKYTDCSNEQFQQSGVRVVPHAMVRRGAYIAKNTVLMPSYVNIGAYIDEGVMVDTWATVGSCAQIGKNVHISG
GAGIGGVLEPLQANPTIIEDNCFIGARSEIVEGVIVEKNSVISMGVFLGQSTKIYNRITGEVSYGRIPAGSVVVAGNLPS
HDGSHSLYCAVIVKQVDEKTRAKVSINDLLRANQDD
>P9WP21 2.3.1.117~~~dapD~~~2,3,4,5-tetrahydropyridine-2,6-dicarboxylate N-succinyltransferase~~~COG2171
MSTVTGAAGIGLATLAADGSVLDTWFPAPELTESGTSATSRLAVSDVPVELAALIGRDDDRRTETIAVRTVIGSLDDVAA
DPYDAYLRLHLLSHRLVAPHGLNAGGLFGVLTNVVWTNHGPCAIDGFEAVRARLRRRGPVTVYGVDKFPRMVDYVVPTGV
RIADADRVRLGAHLAPGTTVMHEGFVNYNAGTLGASMVEGRISAGVVVGDGSDVGGGASIMGTLSGGGTHVISIGKRCLL
GANSGLGISLGDDCVVEAGLYVTAGTRVTMPDSNSVKARELSGSSNLLFRRNSVSGAVEVLARDGQGIALNEDLHAN
>G3XD76 2.3.1.117~~~dapD~~~2,3,4,5-tetrahydropyridine-2,6-dicarboxylate N-succinyltransferase~~~
MSQSLFSLAFGVGTQNRQEAWLEVFYALPLLKPSSEIVAAVAPILGYAAGNQALTFTSQQAYQLADALKGIDAAQSALLS
RLAESQKPLVATLLAEDAAPSSTAEAYLKLHLLSHRLVKPHAVNLSGIFPLLPNVAWTNIGAVDLAELAELQLEARLKGK
LLEVFSVDKFPKMTDYVVPAGVRIADTARVRLGAYIGEGTTVMHEGFVNFNAGTEGPGMIEGRVSAGVFVGKGSDLGGGC
STMGTLSGGGNIVISVGEGCLIGANAGIGIPLGDRNIVEAGLYITAGTKVALLDEQNALVKVVKARDLAGQPDLLFRRNS
QNGAVECKTNKTAIELNEALHAHN
>P56220 2.3.1.117~~~dapD~~~2,3,4,5-tetrahydropyridine-2,6-dicarboxylate N-succinyltransferase~~~
MQQLQNVIESAFERRADITPANVDTVTREAVNQVIGLLDSGALRVAEKIDGQWVTHQWLKKAVLLSFRINDNKVMDGAET
RYYDKVPMKFADYDEARFQKEGFRVVPPATVRQGAFIARNTVLMPSYVNIGAYVDEGTMVDTWATVGSCAQIGKNVHLSG
GVGIGGVLEPLQANPTIIEDNCFIGARSEVVEGVIVEEGSVISMGVYLGQSTRIYDRETGEIHYGRVPAGSVVVSGNLPS
KDGSYSLYCAVIVKKVDAKTRGKVGINELLRTID
>Q8ZH69 2.3.1.117~~~dapD~~~2,3,4,5-tetrahydropyridine-2,6-dicarboxylate N-succinyltransferase~~~COG2171
MQQLQNVIETAFERRADITPANVDTVTREAITHVIDLLDTGALRVAEKIDGQWVTHQWLKKAVLLSFRINDNQVMEGAET
RYYDKVPMKFAGYDEARFQREGFRVVPPATVRKGAFIARNTVLMPSYVNIGAFVDEGTMVDTWATVGSCAQIGKNVHLSG
GVGIGGVLEPLQANPTIIEDNCFVGARSEVVEGVIVEEGSVISMGVFIGQSTRIYDRETGEVHYGRVPAGSVVVSGNLPS
KDGSYSLYCAVIVKKVDAKTRSKVGINELLRTID
>O34916 3.5.1.47~~~ykuR~~~N-acetyldiaminopimelate deacetylase~~~COG1473
MKIEELIAIRRDLHRIPELGFQEFKTQQYLLNVLEQYPQDRIEIEKWRTGLFVKVNGTAPEKMLAYRADIDALSIEEQTG
LPFASEHHGNMHACGHDLHMTIALGIIDHFVHHPVKHDLLFLFQPAEEGPGGAEPMLESDVLKKWQPDFITALHIAPELP
VGTIATKSGLLFANTSELVIDLEGKGGHAAYPHLAEDMVVAASTLVTQLQTIISRNTDPLDSAVITVGTITGGSAQNIIA
ETAHLEGTIRTLSEESMKQVKERIEDVVKGIEIGFRCKGKVTYPSVYHQVYNTSGLTEEFMSFVAEHQLATVIEAKEAMT
GEDFGYMLKKYPGFMFWLGADSEHGLHHAKLNPDENAIETAVHVMTGYFSVYAN
>D5E0A1 3.5.1.47~~~~~~N-acetyldiaminopimelate deacetylase~~~COG1473
MAENEFVKIRRELHKIPELGFQEVKTQRFLLDYINTLPQERLEVKTWKTGLFVKVHGTNPTKTIGYRADIDGLPITEETN
YSFQSQHEGLMHACGHDMHMAIGLGVLTYFAQHEIKDNVLFIFQPAEEGPGGAQPMLQSDIMKEWLPDFIFALHVAPEYP
VGSIALKEGLLFANTSELFIDLKGKGGHAAYPHTTNDMVVAACQLVSQLQTIVARNVDPLDSAVITVGKIQGGTVQNIIA
ERARIEGTIRTLSPESMTRVKERIEAIVKGVEVGYQCETAIDYGCMYHQVYNHHEVTREFMEFAKEQTDVDVIECKEAMT
GEDFGYMLKDIPGFMFWLGVQSEYGLHHAKLQPHEGAIDIAISLITKYFEHKGNQ
>A3M8H2 3.5.1.18~~~dapE~~~Succinyl-diaminopimelate desuccinylase~~~
MNHSDTLSLSLELLQQPSVTPIDHTCQTIMADRLAKVGFHIEPMRFGDVDNLWARRGTEGPVFCFAGHTDVVPTGRLDAW
NSDPFAPEIRDGKLYGRGSADMKTALAAMVVASERFVAKHPNHKGSIAFLITSDEEGPAVNGTVKVIETLEKRNEKITWC
LVGEPSSTHKLGDIVKNGRRGSLNAVLKVQGKQGHVAYPHLARNPIHEASPALAELCQTVWDNGNEYFPATSFQISNIHA
GTGATNVIPGALEVTFNFRYSTEVTAEQLKQRVHEILDKHGLQYEIVWNLSGLPFLTPVGELVNAAQTAILNVTGTETEL
STSGGTSDGRFIAPTGAQVLELGVLNATIHQINEHVDVHDLDPLTDIYEQILENLLAQ
>Q59284 3.5.1.18~~~dapE~~~Succinyl-diaminopimelate desuccinylase~~~COG0624
MNSELKPGLDLLGDPIVLTQRLVDIPSPSGQEKQIADEIEDALRNLNLPGVEVFRFNNNVLARTNRGLASRVMLAGHIDT
VPIADNLPSRVEDGIMYGCGTVDMKSGLAVYLHTFATLATSTELKHDLTLIAYECEEVADHLNGLGHIRDEHPEWLAADL
ALLGEPTGGWIEAGCQGNLRIKVTAHGVRAHSARSWLGDNAMHKLSPIISKVAAYKAAEVNIDGLTYREGLNIVFCESGV
ANNVIPDLAWMNLNFRFAPNRDLNEAIEHVVETLELDGQDGIEWAVEDGAGGALPGLGQQVTSGLIDAVGREKIRAKFGW
TDVSRFSAMGIPALNFGAGDPSFAHKRDEQCPVEQITDVAAILKQYLSE
>P0AED7 3.5.1.18~~~dapE~~~Succinyl-diaminopimelate desuccinylase~~~COG0624
MSCPVIELTQQLIRRPSLSPDDAGCQALLIERLQAIGFTVERMDFADTQNFWAWRGQGETLAFAGHTDVVPPGDADRWIN
PPFEPTIRDGMLFGRGAADMKGSLAAMVVAAERFVAQHPNHTGRLAFLITSDEEASAHNGTVKVVEALMARNERLDYCLV
GEPSSIEVVGDVVKNGRRGSLTCNLTIHGVQGHVAYPHLADNPVHRAAPFLNELVAIEWDQGNEFFPATSMQIANIQAGT
GSNNVIPGELFVQFNFRFSTELTDEMIKAQVLALLEKHQLRYTVDWWLSGQPFLTARGKLVDAVVNAVEHYNEIKPQLLT
TGGTSDGRFIARMGAQVVELGPVNATIHKINECVNAADLQLLARMYQRIMEQLVA
>P44514 3.5.1.18~~~dapE~~~Succinyl-diaminopimelate desuccinylase~~~COG0624
MKEKVVSLAQDLIRRPSISPNDEGCQQIIAERLEKLGFQIEWMPFNDTLNLWAKHGTSEPVIAFAGHTDVVPTGDENQWS
SPPFSAEIIDGMLYGRGAADMKGSLAAMIVAAEEYVKANPNHKGTIALLITSDEEATAKDGTIHVVETLMARDEKITYCM
VGEPSSAKNLGDVVKNGRRGSITGNLYIQGIQGHVAYPHLAENPIHKAALFLQELTTYQWDKGNEFFPPTSLQIANIHAG
TGSNNVIPAELYIQFNLRYCTEVTDEIIKQKVAEMLEKHNLKYRIEWNLSGKPFLTKPGKLLDSITSAIEETIGITPKAE
TGGGTSDGRFIALMGAEVVEFGPLNSTIHKVNECVSVEDLGKCGEIYHKMLVNLLDS
>P9WHS9 3.5.1.18~~~dapE~~~Putative succinyl-diaminopimelate desuccinylase DapE~~~COG0624
MLDLRGDPIELTAALIDIPSESRKEARIADEVEAALRAQASGFEIIRNGNAVLARTKLNRSSRVLLAGHLDTVPVAGNLP
SRRENDQLHGCGAADMKSGDAVFLHLAATLAEPTHDLTLVFYDCEEIDSAANGLGRIQRELPDWLSADVAILGEPTAGCI
EAGCQGTLRVVLSVTGTRAHSARSWLGDNAIHKLGAVLDRLAVYRARSVDIDGCTYREGLSAVRVAGGVAGNVIPDAASV
TINYRFAPDRSVAAALQHVHDVFDGLDVQIEQTDAAAGALPGLSEPAAKALVEAAGGQVRAKYGWTDVSRFAALGIPAVN
YGPGDPNLAHCRDERVPVGNITAAVDLLRRYLGG
>Q9JYL2 3.5.1.18~~~dapE~~~Succinyl-diaminopimelate desuccinylase~~~
MTETQSLELAKELISRPSVTPDDRDCQKLLAERLHKIGFAAEELHFGDTKNIWLRRGTKAPVVCFAGHTDVVPTGPVEKW
DSPPFEPAERDGRLYGRGAADMKTSIACFVTACERFVAKHPNHQGSIALLITSDEEGDALDGTTKVVDVLKARDELIDYC
IVGEPTAVDKLGDMIKNGRRGSLSGNLTVKGKQGHIAYPHLAINPVHTFAPALLELTQEVWDEGNEYFPPTSFQISNING
GTGATNVIPGELNVKFNFRFSTESTEAGLKQRVHAILDKHGVQYDLQWSCSGQPFLTQAGKLTDVARAAIAETCGIEAEL
STTGGTSDGRFIKAIAQELIELGPSNATIHQINENVRLNDIPKLSAVYEGILARLLAGNAV
>Q8ZN75 3.5.1.18~~~dapE~~~Succinyl-diaminopimelate desuccinylase~~~
MSCPVIELTQQLIRRPSLSPDDAGCQALMIERLRKIGFTIEHMDFGDTQNFWAWRGRGETLAFAGHTDVVPAGDVDRWIN
PPFEPTIRDGMLFGRGAADMKGSLAAMVVAAERFVAQHPHHRGRLAFLITSDEEASAKNGTVKVVEALMARNERLDYCLV
GEPSSTEIVGDVVKNGRRGSLTCNLTIHGVQGHVAYPHLADNPVHRAAPFLNELVAIEWDRGNDFFPATSMQVANIQAGT
GSNNVIPGELFVQFNFRFSTELTDEMIKERVHALLEKHQLRYTVDWWLSGQPFLTARGKLVDAVVNAIEHYNEIKPQLLT
TGGTSDGRFIARMGAQVVELGPVNATIHKINECVNAADLQLLARMYQRIMEQLVA
>P0AED8 3.5.1.18~~~dapE~~~Succinyl-diaminopimelate desuccinylase~~~
MSCPVIELTQQLIRRPSLSPDDAGCQALLIERLQAIGFTVERMDFADTQNFWAWRGQGETLAFAGHTDVVPPGDADRWIN
PPFEPTIRDGMLFGRGAADMKGSLAAMVVAAERFVAQHPNHTGRLAFLITSDEEASAHNGTVKVVEALMARNERLDYCLV
GEPSSIEVVGDVVKNGRRGSLTCNLTIHGVQGHVAYPHLADNPVHRAAPFLNELVAIEWDQGNEFFPATSMQIANIQAGT
GSNNVIPGELFVQFNFRFSTELTDEMIKAQVLALLEKHQLRYTVDWWLSGQPFLTARGKLVDAVVNAVEHYNEIKPQLLT
TGGTSDGRFIARMGAQVVELGPVNATIHKINECVNAADLQLLARMYQRIMEQLVA
>Q99SN6 3.5.1.18~~~dapE~~~Probable succinyl-diaminopimelate desuccinylase~~~
MTTFSEKEKIQLLADIVELQTENNNEIDVCNYLKDLFDKYDIKSEILKVNEHRANIVAEIGNGSPILALSGHMDVVDAGN
QDNWTYPPFQLTEKAGKLYGRGTTDMKGGLMALVITLIELKEQNQLPQGTIRLLATAGEEKEQEGAKLLADKGYLDDVDG
LIIAEPTGSGIYYAHKGSMSCKVTATGKAVHSSVPFIGDNAIDTLLEFYNQFKEKYSELKKHDTKHELDVAPMFKSLIGK
EISEEDANYASGLTAVCSIINGGKQFNSVPDEASLEFNVRPVPEYDNDFIESFFQNIINDVDSNKLSLDIPSNHRPVTSD
KNSKLITTIKDVASSYVEQDEIFVSALVGATDASSFLGDNKDNVDLAIFGPGNPLMAHQIDEYIEKDMYLKYIDIFKEAS
IQYLKEK
>Q9KQ52 3.5.1.18~~~dapE~~~Succinyl-diaminopimelate desuccinylase~~~COG0624
MTDSPVLALAKELISRQSVTPADAGCQDLMIERLKALGFEIESMVFEDTTNFWARRGTQSPLFVFAGHTDVVPAGPLSQW
HTPPFEPTVIDGFLHGRGAADMKGSLACMIVAVERFIAEHPDHQGSIGFLITSDEEGPFINGTVRVVETLMARNELIDMC
IVGEPSSTLAVGDVVKNGRRGSITGDLKVKGTQGHVAYPHLANNPVHKALPALAELAATQWDEGNAYFPPTSFQIPNLQA
GTGASNVIPGEFDVQFNFRFSTELTDEEIKRRVHSVLDAHGLDYDVKWTLSGQPFLTDTGELLAAVVAAVEEVNHQAPAL
LTTGGTSDGRFIAQMGAQVVELGPVNATIHKVNECVRIADLEKLTDMYQKTLNHLLG
>B7GY71 5.1.1.7~~~dapF~~~Diaminopimelate epimerase~~~
MLLEFTKMHGLGNDFMVVDLISQRAYLDTATIQRLADRHFGVGFDQLLIVEPPDVPEADFKYRIFNADGSEVEQCGNGVR
CFARFVHERHLTNKTNITVQTKAGIVKPELGQNGWVRVNMGYPKFLPNEIPFVAEEPEALYTLELANDQNISIDVVNMGN
PHAVTIVPDVLTADVAGIGPQVESHKRFPERVNAGFMQVIDDKHVRLRVFERGVGETLACGTGACAAAVSGMRRGLLANS
VEVELAGGKLQIEWQEGDVVWMTGPTTHVYDGRLDLRYFQG
>Q81XR2 5.1.1.7~~~dapF~~~Diaminopimelate epimerase~~~COG0253
MSQFSFTKMHGLGNSYIYVNMFEEQIPEEDLALVAEKVSNINTGIGADGMILICPSDVAPVKMRMFNNDGSEGKSCGNGL
RCVAKYAYEHKLVEDTVFTIETLAGIVTAEVTVEEGKVTLAKIDMGAPRLTRAEIPMLGEGETPFIRENFLYNNHRYAFT
AVSMGNPHAVIFVDDVEQAPLTTLGPVLETHEMFPERVNVEFIEILNEEEMNFRVWERGSGVTQACGTGACAAVVASILN
GKMERGKEITVHLAGGDLMIAWTEEGNVLMKGPAEVICRGVYEYKIEA
>Q8NP73 5.1.1.7~~~dapF~~~Diaminopimelate epimerase~~~COG0253
MNLTIPFAKGHATENDFIIIPDEDARLDLTPEMVVTLCDRRAGIGADGILRVVKAADVEGSTVDPSLWFMDYRNADGSLA
EMCGNGVRLFAHWLYSRGLVDNTSFDIGTRAGVRHVDILQADQHSAQVRVDMGIPDVTGLSTCDINGQVFAGLGVDMGNP
HLACVVPGLSASALADMELRAPTFDQEFFPHGVNVEIVTELEDDAVSMRVWERGVGETRSCGTGTVAAACAALADAGLGE
GTVKVCVPGGEVEVQIFDDGSTLTGPSAIIALGEVQI
>P0A6K1 5.1.1.7~~~dapF~~~Diaminopimelate epimerase~~~COG0253
MQFSKMHGLGNDFMVVDAVTQNVFFSPELIRRLADRHLGVGFDQLLVVEPPYDPELDFHYRIFNADGSEVAQCGNGARCF
ARFVRLKGLTNKRDIRVSTANGRMVLTVTDDDLVRVNMGEPNFEPSAVPFRANKAEKTYIMRAAEQTILCGVVSMGNPHC
VIQVDDVDTAAVETLGPVLESHERFPERANIGFMQVVKREHIRLRVYERGAGETQACGSGACAAVAVGIQQGLLAEEVRV
ELPGGRLDIAWKGPGHPLYMTGPAVHVYDGFIHL
>P44859 5.1.1.7~~~dapF~~~Diaminopimelate epimerase~~~COG0253
MQFSKMHGLGNDFVVVDGVTQNVFFTPETIRRLANRHCGIGFDQLLIVEAPYDPELDFHYRIFNADGSEVSQCGNGARCF
ARFVTLKGLTNKKDISVSTQKGNMVLTVKDDNQIRVNMGEPIWEPAKIPFTANKFEKNYILRTDIQTVLCGAVSMGNPHC
VVQVDDIQTANVEQLGPLLESHERFPERVNAGFMQIINKEHIKLRVYERGAGETQACGSGACAAVAVGIMQGLLNNNVQV
DLPGGSLMIEWNGVGHPLYMTGEATHIYDGFITL
>P9WP19 5.1.1.7~~~dapF~~~Diaminopimelate epimerase~~~COG0253
MIFAKGHGTQNDFVLLPDVDAELVLTAARVAALCDRRKGLGADGVLRVTTAGAAQAVGVLDSLPEGVRVTDWYMDYRNAD
GSAAQMCGNGVRVFAHYLRASGLEVRDEFVVGSLAGPRPVTCHHVEAAYADVSVDMGKANRLGAGEAVVGGRRFHGLAVD
VGNPHLACVDSQLTVDGLAALDVGAPVSFDGAQFPDGVNVEVLTAPVDGAVWMRVHERGVGETRSCGTGTVAAAVAALAA
VGSPTGTLTVHVPGGEVVVTVTDATSFLRGPSVLVARGDLADDWWNAMG
>Q9X1L0 5.1.1.7~~~dapF~~~Diaminopimelate epimerase~~~
MCYSANGNTFLIVDNTQKRIPEEKKPDFVRENVGDLDGVIFVELVDGKYFMDYYNRDGSMAAFCGNGARAFSQYLIDRGW
IKEKEFTFLSRAGEIKVIVDDSIWVRMPGVSEKKEMKVDGYEGYFVVVGVPHFVMEVKGIDELDVEKLGRDLRYKTGANV
DFYEVLPDRLKVRTYERGVERETKACGTGVTSVFVVYRDKTGAKEVKIQVPGGTLFLKEENGEIFLRGDVKRCSEE
>Q81MQ2 2.3.1.89~~~dapH~~~2,3,4,5-tetrahydropyridine-2,6-dicarboxylate N-acetyltransferase~~~COG2171
MKMMDANEIISFIQKSEKKTPVKVYIKGDLKEVTFPETVQAFVNKKSGVLFGEWSEIKTILDENSKYIVDYVVENDRRNS
AIPMLDLKGIKARIEPGAIIRDHVEIGDNAVIMMNATINIGAVIGEGSMIDMNAVLGGRATVGKNCHVGAGAVLAGVIEP
PSAKPVIVEDDVVIGANVVVLEGVTVGKGAVVAAGAVVTEDVPPYTVVAGTPARVIKEIDEKTKAKTEIKQELRQLNPEK
>O34981 2.3.1.89~~~dapH~~~2,3,4,5-tetrahydropyridine-2,6-dicarboxylate N-acetyltransferase~~~COG2171
MKMMDANEIISFIQNSTKSTPVKVYVKGELEGINFGESAKAFINGNTGVVFGEWSEIQTAIEENQSKIEDYVVENDRRNS
AIPMLDLKNIKARIEPGAIIRDQVEIGDNAVIMMGASINIGSVIGEGTMIDMNVVLGGRATVGKNCHIGAGSVLAGVIEP
PSAKPVVIEDDVVIGANAVVLEGVTVGKGAVVAAGAIVVNDVEPYTVVAGTPAKKIKDIDEKTKGKTEIKQELRQL
>Q836H8 2.3.1.89~~~dapH~~~2,3,4,5-tetrahydropyridine-2,6-dicarboxylate N-acetyltransferase~~~COG2171
MDAYEIIQYIGDAKKQTLVKVTLKGQLKEVTFPETIKVFNNCKTGTLFGDWADVKPFLEANKEKIEDYVVENDARNSAIP
FLDLKDINARIEPGALIREKVEIGDQAVIMMGAILNIGAVVGAGTMIDMGAVLGGRATVGKHCHIGAGTVLAGVIEPPSA
APVVIENEVVIGANAVVLEGVRVGEGAVVAAGAVVVEDVPAHTVVAGVPAKVIKQIDDKTKSKTEILEELRKL
>Q7A2S0 2.3.1.89~~~dapH~~~2,3,4,5-tetrahydropyridine-2,6-dicarboxylate N-acetyltransferase~~~
MVQHLTAEEIIQYISDAKKSTPIKVYLNGNFEGITYPESFKVFGSEQSKVIFCEADDWKPFYEAYGSQFEDIEIEMDRRN
SAIPLKDLTNTNARIEPGAFIREQAIIEDGAVVMMGATINIGAVVGEGTMIDMNATLGGRATTGKNVHVGAGAVLAGVIE
PPSASPVIIEDDVLIGANAVILEGVRVGKGAIVAAGAIVTQDVPAGAVVAGTPAKVIKQASEVQDTKKEIVAALRKLND
>P16524 2.6.1.-~~~dapX~~~Probable N-acetyl-LL-diaminopimelate aminotransferase~~~COG0436
MEHLLNPKAREIEISGIRKFSNLVAQHEDVISLTIGQPDFFTPHHVKAAAKKAIDENVTSYTPNAGYLELRQAVQLYMKK
KADFNYDAESEIIITTGASQAIDAAFRTILSPGDEVIMPGPIYPGYEPIINLCGAKPVIVDTTSHGFKLTARLIEDALTP
NTKCVVLPYPSNPTGVTLSEEELKSIAALLKGRNVFVLSDEIYSELTYDRPHYSIATYLRDQTIVINGLSKSHSMTGWRI
GFLFAPKDIAKHILKVHQYNVSCASSISQKAALEAVTNGFDDALIMREQYKKRLDYVYDRLVSMGLDVVKPSGAFYIFPS
IKSFGMTSFDFSMALLEDAGVALVPGSSFSTYGEGYVRLSFACSMDTLREGLDRLELFVLKKREAMQTINNGV
>Q9ZBA9 3.4.11.19~~~dap~~~D-aminopeptidase~~~
MSKFDTSALEAFVRHIPQNYKGPGGVVAVVKDGEVVLQHAWGFADLRTRTPMTLDTRMPICSVSKQFTCAVLLDAVGEPE
LLDDALEAYLDKFEDERPAVRDLCNNQSGLRDYWALSVLCGADPEGVFLPAQAQSLLRRLKTTHFEPGSHYSYCNGNFRI
LADLIEAHTGRTLVDILSERIFAPAGMKRAELISDTALFDECTGYEGDTVRGFLPATNRIQWMGDAGICASLNDMIAWEQ
FIDATRDDESGLYRRLSGPQTFKDGVAAPYGFGLNLHETGGKRLTGHGGALRGWRCQRWHCADERLSTIAMFNFEGGASE
VAFKLMNIALGVSSSEVSRVEADSAWFGSWLDDETGLVLSLEDAGHGRMKARFGTSPEMMDVVSANEARSAVTTIRRDGE
TIELVRASENLRLSMKRVKGEAKHDIIGRYHSDELDADLLLVSEGGAIYGAFEGFLGKSDMYPLYSVGSDVWLLPVQRSM
DAPSPGEWKLVFRRDDKGEITGLSVGCWLARGVEYRRVQP
>P37538 ~~~darA~~~Cyclic di-AMP receptor A~~~COG3870
MKLIVAVVQDQDSNRLLKTLTDHNFRVTKLATTGGFLKSGNTTFMIGVEDIRVNKALSLIKENGQKRDQMIAPVSPMGGN
ADSYVPYPVEVEVGGATVFVLPVDEFHQF
>O31698 ~~~darB~~~Cyclic di-AMP receptor B~~~COG0517
MISLQSDQLLEATVGQFMIEADKVAHVQVGNNLEHALLVLTKTGYTAIPVLDPSYRLHGLIGTNMIMNSIFGLERIEFEK
LDQITVEEVMLTDIPRLHINDPIMKGFGMVINNGFVCVENDEQVFEGIFTRRVVLKELNKHIRSLNK
>B7UP19 3.2.2.-~~~darG~~~DNA ADP-ribosyl glycohydrolase~~~
MITYTQGNLLDAPVEALVNTVNTVGVMGKGIALMFKERFPENMKVYALACKQKQVITGKMFITETGELMGPRWIVNFPTK
QHWRADSRMEWIEDGLQDLRRFLIEENVQSIAIPPLGAGNGGLNWPDVRAQIESALGDLQDVDILIYQPTEKYQNVAKST
GVKKLTPARAAIAELVRRYWVLGMECSLLEIQKLAWLLQRAIEQHQQDDILKLRFEAHYYGPYAPNLNHLLNALDGTYLK
AEKRIPDSQPLDVIWFNDQKKEHVNAYLNNEAREWLPALEQVSQLIDGFESPFGLELLATVDWLLSRGECQPTLDSVKEG
LHQWPAGERWASRKLRLFDNNNLQFAINRVMEFHC
>A0A0H3M776 3.2.2.-~~~darG~~~DNA ADP-ribosyl glycohydrolase~~~
MITYGSGDLLRADTEALVNTVNCVGVMGKGIALQFKRRYPEMFTAYEKACKRGEVTIGKMFVVDTGQLDGPKHIINFPTK
KHWRAPSKLAYIDAGLIDLIRVIRELNIASVAVPPLGVGNGGLDWEDVEQRLVSAFQQLPDVDAVIYPPSGGSRAIEGVE
GLRMTWGRAVILEAMRRYLQQRRAMEPWEDPAGISHLEIQKLMYFANEADPDLALDFTPGRYGPYSERVRHLLQGMEGAF
TVGLGDGTARVLANQPISLTTKGTDAITDYLATDAAADRVSAAVDTVLRVIEGFEGPYGVELLASTHWVATREGAKEPAT
AAAAVRKWTKRKGRIYSDDRIGVALDRILMTA
>O53605 3.2.2.-~~~darG~~~DNA ADP-ribosyl glycohydrolase~~~COG2110
MITYGSGDLLRADTEALVNTVNCVGVMGKGIALQFKRRYPEMFTAYEKACKRGEVTIGKMFVVDTGQLDGPKHIINFPTK
KHWRAPSKLAYIDAGLIDLIRVIRELNIASVAVPPLGVGNGGLDWEDVEQRLVSAFQQLPDVDAVIYPPSGGSRAIEGVE
GLRMTWGRAVILEAMRRYLQQRRAMEPWEDPAGISHLEIQKLMYFANEADPDLALDFTPGRYGPYSERVRHLLQGMEGAF
TVGLGDGTARVLANQPISLTTKGTDAITDYLATDAAADRVSAAVDTVLRVIEGFEGPYGVELLASTHWVATREGAKEPAT
AAAAVRKWTKRKGRIYSDDRIGVALDRILMTA
>P0DV57 3.2.2.-~~~darG~~~DNA ADP-ribosyl glycohydrolase~~~
MLRFVRGNLLEAPVEALVNTVNTVGVMGKGVALQFKRAFPDNYQAYVKACERGQVQIGRIFVYDRGPLAQPRYIFNFPTK
KHWRHPSRMEYVEEGLKDLVCRIQELRVRSIALPPLGAGNGGLPWPEVKQRIQEALEALEGVEVWVYEPVENPKAHSIVP
LKTKPRLTPARAALLKLFGLYGALGEPLGRLEAQKLAYFLQEAGLDLKLDFACKQFGPYAEPLNHVLARLEGHYIQGFGD
RTGISQIRLKPQALDEAVLFLADYPKADEAATRAADWVKGFETPYGLELLATVHWAVRHEGARDWASLQKRLQAWNPRKA
TFPKTHLQVALDALLKRGALRPEEWQDRPPKLPANVAQEA
>K2PFJ6 2.4.2.-~~~darTG~~~DNA ADP-ribosyl transferase-DNA ADP-ribosyl glycohydrolase fusion protein~~~
MFPRFRELYYITHIDNVPSILEKGILSHAEIERQSINCKKVYDNSIVLKRKSRLLADNRSLWEFANLYFQPRNPMLYRLL
VQGLKPKDLAIVAVKWTIMKRDDILITDGNAASSETQIYRKSEIKNIKNIISVKDMEYWREEDGSKRKIMAECLVPQCVD
PRYISAIYVSDHEVASNLKKAINNRNIPVIPDPTFFFLPNREIKLTQNLSLVEGDMFFSRMQTLTVSVNTVGVMGKGLAS
RVKYQFPDVYVVFQDACKKKELEFGKPYLYKRESSLDAFLAEDGEKLSDLNHQTWFLLFPTKRHWKNMSEIKGIESGLRW
IVENYKKEGIKSLAVPALGCGLGGLEWSIVGPLMCRYLTKLEIPVQIYLPLEKRIPDVQLSPKFLLDS
>B7UP20 2.4.2.-~~~darT~~~DNA ADP-ribosyl transferase~~~
MAYDYSASLNPQKALIWRIVHRDNIPWILDNGLHCGNSLVQAENWINIGNPELIGKRAGHPVPVGTGGTLHDYVPFYFTP
FSPMLMNIHSGRGGIKRRPNEEIVILVSNLRNVAAHDVPFVFTDSHAYYNWTNYYTSLNSLDQIDWPILQARDFRRDPDD
PAKFERYQAEALIWQHCPISLLDGIICYSEEVRLQLEQWLFQRNLTMSVHTRSGWYFS
>A0A0H3M0L1 2.4.2.-~~~darT~~~DNA ADP-ribosyl transferase~~~
MITRYKPESGFVARSGGPDRKRPHDWIVWHFTHADNLPGIITAGRLLADSAVTPTTEVAYNPVKELRRHKVVAPDSRYPA
SMASDHVPFYIAARSPMLYVVCKGHSGYSGGAGPLVHLGVALGDIIDADLTWCASDGNAAASYTKFSRQVDTLGTFVDFD
LLCQRQWHNTDDDPNRQSRRAAEILVYGHVPFELVSYVCCYNTETMTRVRTLLDPVGGVRKYVIKPGMYY
>O53604 2.4.2.-~~~darT~~~DNA ADP-ribosyl transferase~~~COG4948
MITRYKPESGFVARSGGPDRKRPHDWIVWHFTHADNLPGIITAGRLLADSAVTPTTEVAYNPVKELRRHKVVAPDSRYPA
SMASDHVPFYIAARSPMLYVVCKGHSGYSGGAGPLVHLGVALGDIIDADLTWCASDGNAAASYTKFSRQVDTLGTFVDFD
LLCQRQWHNTDDDPNRQSRRAAEILVYGHVPFELVSYVCCYNTETMTRVRTLLDPVGGVRKYVIKPGMYY
>P0DV56 2.4.2.-~~~darT~~~DNA ADP-ribosyl transferase~~~
MPQQGLAYPVPTLIYHITHLNNLQGILQRGGLLPYSQRPPTQQNVAYGHIQAHRAQVVVPVGPRGKLHDYVPFYFCPRSP
MLYAIHTQQTDYQGDQRPILHLVSSAQKVAEARIPFVFTDRHAAVQYVCFFHKLEHLKALDWQAIQASYWANVREKKQAE
FLVKDFFPWELVEEIGVIDKTIQAQVESILAQFPDLHHPPVRVRRSWYYKKRLCSASCEATF
>A0A0B0SG80 2.4.2.-~~~darT~~~DNA ADP-ribosyl transferase~~~
MKRTYPEPTPIYHITHIDNLKGILRMGKLLAHNQSPPKQRSIAYAHIQERRNRAKVPQPPGGVLHDYVPFYFCPRSPMLY
AIYSGATEYQGGQEPILHLVSSAQAVHKAGLPFVFTDRHGVLSHARFFRQLEELAQLDWEAIQASYWADPPELREKKQAE
FLVYKAFPWALIEEIAVYSQRVGEEVLKILKQFPEARRPRVCIRKDWYY
>Q9K491 ~~~dasA~~~Diacetylchitobiose binding protein DasA~~~COG2182
MKRKLIAAIGIAGMMVSIAACGGDSDDDGKKAGADGYAGETLTVWVMDGSSPDDWQADLAKDFEAKTKAKVKFEIQKWNG
IQQKLTTALSEENPPDVFEIGNTQTPAYAKTGGLADLSDLKGEIGTDWSESLNKSAVFDGKQYAAPWFVVNRVVVYNKKI
WADAGIKELPKTRDEFYNDLKTIGEKTDAEPIYLPGQNWYHFVGLVIGEGGELVKKDGDKYVSNLADPKVAAATETYKKF
QALSKAPKDKDEATPQQGEIFAKGKTGSFIGMGWEGATAIATNPAIEKDLGYFTIPGPTADKPEGVFLGGSNLAVAAGSK
KQDLAKEFLKLALSDKYEGGLAKANGVIPNKEALQSNLKGNAAAEAAAPAAGTGDTTPLIPEWAAVENDPNPIKTYLTAV
MKGKSPADAAKQVEGEFNKRLAQQQ
>Q9K490 ~~~dasB~~~Diacetylchitobiose uptake system permease protein DasB~~~COG1175
MTVQTERPPSGPSDVRKADGGGTGGTRARAASRAGALAPYLLLLPAAAATVLLLGWPLVKDGLLSFQNLNMAQLIQHVTE
WTGFDNYKEVLTGEDFWRVTVRSIIFTAVNVVLTMVVGGLIGLLLARLGRVMRFVLMIGLVLAWAMPVVAATTVYQWLFA
QRFGVVNWVLDKLGWHSMADFSWTGSQFSTFFVVTVLIVWMSVPFVAINLYAATTTIPDELYEAAALDGAGMWRSFTSVT
LPFLRPFLYATTFLEVIWIFKAFVQVYTFNGGGPDRLTEILPVYAYIEGVGNQHYGMGAAIAVLTILILLGLTAYYLRIV
LKQEEDEL
>Q9K489 ~~~dasC~~~Diacetylchitobiose uptake system permease protein DasC~~~COG0395
MKRSLFGRVWPNVTAVVLFIGLVFPVYWMFATAFKPTGDIISENPVWFPTDITFEHFKTATEADHFWTYVSNSLIVTVCA
VVFSLVIALAGSFALARMRFKGRRGFIVGFMLAQMAPWEVMVIAIYMIVRDASMLNSLVPLTLFYMMMILPFTILTLRGF
VAAVPKELEESAMVDGCTRAQAFRRVILPLLAPGLMSTSMFGFITAWNELPLVLVVNKEAESQTLPLWLTSFQTVFGDNW
GATMAASSLFAIPILILFVYLQRKAVSGLTAGAVKG
>Q9K492 ~~~dasR~~~HTH-type transcriptional repressor DasR~~~COG2188
MSTDVSSAENEGGATVRTARVPKYYRLKKHLLDMTRTQTPGTPVPPERTLAAEFDTSRTTVRQALQELVVEGRLERIQGK
GTFVAKPKVSQALQLTSYTEDMRAQGLEPTSQLLDIGYITADDRLAGLLDITAGGRVLRIERLRMANGEPMAIETTHLSA
KRFPALRRSLVKYTSLYTALAEVYDVHLAEAEETIETSLATPREAGLLGTDVGLPMLMLSRHSQDRTGQPVEWVRSVYRG
DRYKFVARLKRPQD
>Q8VV01 ~~~dasR~~~HTH-type transcriptional repressor DasR~~~
MGAEGAVRGARPVPVRAQRVPKYYRLKRHLLDMTDTLPPGTPVPPERTLAAEFDTSRTTVPQALQELVVEGRLERIQGKG
TFVAKPKVSQALQLTSYTEDMRAQGLEPTSQLLDIGYVTADDTLAGLLDISTGGRVLRIERLRLASGEPMAIETTHLSAK
RFPALRRSLVKYTSLYTALAEVYDVRLAEAEETIETSLATPREAGLLGTDVGLPMLMLSRHSVDGQGEPVEWVRSVYRGD
RYKFVARLKRGTD
>P9WMA8 ~~~~~~Dormancy associated translation inhibitor~~~
MEPKRSRLVVCAPEPSHAREFPDVAVFSGGRANASQAERLARAVGRVLADRGVTGGARVRLTMANCADGPTLVQINLQVG
DTPLRAQAATAGIDDLRPALIRLDRQIVRASAQWCPRPWPDRPRRRLTTPAEALVTRRKPVVLRRATPLQAIAAMDAMDY
DVHLFTDAETGEDAVVYRAGPSGLRLARQHHVFPPGWSRCRAPAGPPVPLIVNSRPTPVLTEAAAVDRAREHGLPFLFFT
DQATGRGQLLYSRYDGNLGLITPTGDGVADGLA
>P9WMA9 ~~~~~~Dormancy associated translation inhibitor~~~COG1544
MEPKRSRLVVCAPEPSHAREFPDVAVFSGGRANASQAERLARAVGRVLADRGVTGGARVRLTMANCADGPTLVQINLQVG
DTPLRAQAATAGIDDLRPALIRLDRQIVRASAQWCPRPWPDRPRRRLTTPAEALVTRRKPVVLRRATPLQAIAAMDAMDY
DVHLFTDAETGEDAVVYRAGPSGLRLARQHHVFPPGWSRCRAPAGPPVPLIVNSRPTPVLTEAAAVDRAREHGLPFLFFT
DQATGRGQLLYSRYDGNLGLITPTGDGVADGLA
>P56744 2.6.1.76~~~dat~~~Diaminobutyrate--2-oxoglutarate aminotransferase~~~COG0160
MSVTSVNPATNATNEYYLTRQSQMESNVRSYPRKLPLAIAKAQGCWVTDVEGTQYLDCLAGAGTLALGHNHPAVIQSIQD
TLASGLPLHTLDLTTPLKDAFTEALLAYLPGGKEEYCLQFCGPSGADATEAAIKLAKTYTGRSSVISFSGGYHGMTHGSL
AMTGNLSAKNAVNGLMPGVQFMPYPHEYRCPLGLGGEAGVDALTYYFENFIEDVESGVTKPAAVILEAIQGEGGVVTAPV
KWLQKIREVTEKHNIVLILDEVQAGFARSGKMFAFEHAGIEPDVVVMSKAVGGGLPLAVLGIKRKFDAWQPAGHTGTFRG
NQLAMGTGLVVLETIKEQNLAQNAQERGEFPCIGNVRGRGLMIGVEIVDERKPADRIGSHPADSQLAAAIQTACFNNNLL
LEKGGRNGTVIRLLCPLIITQEECVEVIARFKKAVAEALVAVRGA
>P44951 2.6.1.76~~~dat~~~Diaminobutyrate--2-oxoglutarate aminotransferase~~~COG0160
MTMITPVQAILASNQHFLDRQDVMESNVRSYPRKLPFAYAKAQGCWVTDVEGNEYLDFLAGAGTLALGHNHPILMQAIKD
VLDSGLPLHTLDLTTPLKDAFSEELLSFFPKDKYILQFTGPSGADANEAAIKLAKTYTGRGNIIAFSGGFHGMTQGALAL
TGNLGAKNAVENLMPGVQFMPYPHEYRCPFGIGGEAGAKAVEQYFENFIEDVESGVVKPAAVILEAIQGEGGVVSAPISF
LQKVREVTQKHGILMIVDEVQAGFCRSGRMFAFEHAGIEPDIIVMSKAVGGSLPLAVLAIRKEFDAWQPAGHTGTFRGNQ
LAMATGYASLKIMRDENLAQNAQERGEYLTNALRELSKEYPCIGNVRGRGLMMGIDIVDERQSKDATGAYPRDCELAAAI
QKACFKNKLLLERGGRGGNVVRVLCAVNINQSECEEFIKRFKQSVVDALKVVRS
>P0AFR2 ~~~dauA~~~C4-dicarboxylic acid transporter DauA~~~COG0659
MNKIFSSHVMPFRALIDACWKEKYTAARFTRDLIAGITVGIIAIPLAMALAIGSGVAPQYGLYTAAVAGIVIALTGGSRF
SVSGPTAAFVVILYPVSQQFGLAGLLVATLLSGIFLILMGLARFGRLIEYIPVSVTLGFTSGIGITIGTMQIKDFLGLQM
AHVPEHYLQKVGALFMALPTINVGDAAIGIVTLGILVFWPRLGIRLPGHLPALLAGCAVMGIVNLLGGHVATIGSQFHYV
LADGSQGNGIPQLLPQLVLPWDLPNSEFTLTWDSIRTLLPAAFSMAMLGAIESLLCAVVLDGMTGTKHKANSELVGQGLG
NIIAPFFGGITATAAIARSAANVRAGATSPISAVIHSILVILALLVLAPLLSWLPLSAMAALLLMVAWNMSEAHKVVDLL
RHAPKDDIIVMLLCMSLTVLFDMVIAISVGIVLASLLFMRRIARMTRLAPVVVDVPDDVLVLRVIGPLFFAAAEGLFTDL
ESRLEGKRIVILKWDAVPVLDAGGLDAFQRFVKRLPEGCELRVCNVEFQPLRTMARAGIQPIPGRLAFFPNRRAAMADL
>Q9HXE3 1.4.99.6~~~dauA~~~FAD-dependent catabolic D-arginine dehydrogenase DauA~~~
MIEADYLVIGAGIAGASTGYWLSAHGRVVVLEREAQPGYHSTGRSAAHYTVAYGTPQVRALTAASRAFFDNPPAGFCEHP
LLSPRPEMVVDFSDDPEELRRQYESGKALVPQMRLLDAEQACSIVPVLRRDKVFGATYDPTGADIDTDALHQGYLRGIRR
NQGQVLCNHEALEIRRVDGAWEVRCDAGSYRAAVLVNAAGAWCDAIAGLAGVRPLGLQPKRRSAFIFAPPPGIDCHDWPM
LVSLDESFYLKPDAGMLLGSPANADPVEAHDVQPEQLDIATGMYLIEEATTLTIRRPEHTWAGLRSFVADGDLVAGYAAN
AEGFFWVAAQGGYGIQTSAAMGEASAALIRHQPLPAHLREHGLDEAMLSPRRLSP
>Q9HXE4 1.4.1.25~~~dauB~~~NAD(P)H-dependent anabolic L-arginine dehydrogenase DauB~~~
MSAATPLIVQQAEAEQLLARIDVLQAMRQLFLDLAAGQALQPAQQLVEFPAGRGDFINYLGVLAQEQVYGVKTSPYIVRE
QGPLVTAWTLLMSMQTGQPLLLCDAARLTTARTAATTAVAVDALAPAEACRLALIGSGPVAHAHLQYVKGLRDWQGVRVH
SPCLDERRLQSLRAIDPRAEAAGSLEEALDEADVILLCTSSARAVIDPRQLKRPALVTSISTNAPRAHEVPAESLAAMDV
YCDYRHTTPGSAGEMLIAAEQHGWSPEAIRGDLAELLSAQAPRPEYRRPAFFRSIGLGLEDVALANALYRLRQAG
>Q9HXE2 ~~~dauR~~~Transcriptional regulator DauR~~~
MSPSQDPSLENYRAIADGIATLFFPHAEVVLHDLRSQRVDYIANNLSKREVGDDSALEDMLEGDSDERNIGPYEKLNWDG
QKIRSVSTVLRDSAGQPLAVLCINLNISLFESAKAALDLFLSPSKLIPQPDALFRDDWQERINTFLHGWLRQRQLGLNLL
TREHKRELVLALHAEGAFKGKSAANYVANVLNMGRATVYKHLKELKEGGD
>Q9I6M5 1.2.1.-~~~davD~~~Glutarate-semialdehyde dehydrogenase~~~
MQLKDAKLFRQQAYVDGAWVDADNGQTIKVNNPATGEIIGSVPKMGAAETRRAIEAADKALPAWRALTAKERANKLRRWF
DLMIENQDDLARLMTIEQGKPLAEAKGEIAYAASFLEWFGEEAKRIYGDTIPGHQPDKRIIVIKQPIGVTAAITPWNFPS
AMITRKAGPALAAGCTMVLKPASQTPYSALALAELAERAGIPKGVFSVVTGSAGEVGGELTSNPIVRKLTFTGSTEIGRQ
LMAECAQDIKKVSLELGGNAPFIVFDDADLDAAVEGALISKYRNNGQTCVCANRLYVQDGVYDAFVDKLKAAVAKLNIGN
GLEAGVTTGPLIDAKAVAKVEEHIADAVSKGAKVVSGGKPHALGGTFFEPTILVDVPKNALVSKDETFGPLAPVFRFKDE
AEVIAMSNDTEFGLASYFYARDLARVFRVAEQLEYGMVGINTGLISNEVAPFGGIKASGLGREGSKYGIEDYLEIKYLCL
GGI
>Q9I6M4 2.6.1.48~~~davT~~~5-aminovalerate aminotransferase DavT~~~
MSKTNESLLKRRQAAVPRGVGQIHPVVAERAENSTVWDVEGREYIDFAGGIAVLNTGHLHPKVIAAVQEQLGKLSHTCFQ
VLAYEPYIELAEEIAKRVPGDFPKKTLLVTSGSEAVENAVKIARAATGRAGVIAFTGAYHGRTMMTLGLTGKVVPYSAGM
GLMPGGIFRALAPCELHGVSEDDSIASIERIFKNDAQPQDIAAIIIEPVQGEGGFYVNSKSFMQRLRALCDQHGILLIAD
EVQTGAGRTGTFFATEQLGIVPDLTTFAKSVGGGFPISGVAGKAEIMDAIAPGGLGGTYAGSPIACAAALAVLKVFEEEK
LLERSQAVGERLKAGLREIQAKHKVIGDVRGLGSMVAIELFEGGDTHKPAAELVSKIVVRAREKGLILLSCGTYYNVIRF
LMPVTIPDAQLEKGLAILAECFDELA
>Q88RB9 2.6.1.48~~~davT~~~5-aminovalerate aminotransferase DavT~~~COG0160
MSKTNESLMQRRVAAVPRGVGQIHPIFVDTAKNSTVIDVEGRELIDFAGGIAVLNTGHLHPKVVAAVQEQLTKVSHTCFQ
VLAYEPYVELCEKINKLVPGDFDKKTLLVTTGSEAVENAVKIARAATGRAGVIAFTGGYHGRTMMTLGLTGKVVPYSAGM
GLMPGGIFRALFPSELHGISVDDAIASVERIFKNDAEPRDIAAIILEPVQGEGGFLPAPKELMKRLRALCDQHGILLIAD
EVQTGAGRTGTFFAMEQMGVAPDLTTFAKSIAGGFPLAGVCGKAEYMDAIAPGGLGGTYAGSPIACAAALAVIEVFEEEK
LLDRSKAVGERLTAGLREIQKKYPIIGDVRGLGSMIAVEVFEKGTHTPNAAAVGQVVAKAREKGLILLSCGTYGNVLRIL
VPLTAEDALLDKGLAIIEECFAEIA
>P47243 1.13.11.-~~~dbfB~~~2,2',3-trihydroxybiphenyl dioxygenase~~~
MSVKQLGYLIFECRADVLEQMVVVYQDIIGAVVERDEGGRALVRLDGRPFRIRLDPGPANRLAAIGWNVDPSDLAAIAEQ
VEKACYSVVTADAELAADRAAAQVRQFADNDGFTHELYVESSFPTDPVLESLFVCGEEANGIFGLGHLVVIVADRAKTQS
FFTDVLGFGLSDRVTWPEADIFFLHCNQRHHTVALSAPALGLKPGMVHHLMLEAKSKEQVDRAFAAVKRLGYDVLMTIGQ
HSNDKVYSFYMMAPAGFAVELGFGGQVIGDLESWHVGFYDAPSIWGHELQLPAH
>P08821 ~~~hbs~~~DNA-binding protein HU 1~~~COG0776
MNKTELINAVAEASELSKKDATKAVDSVFDTILDALKNGDKIQLIGFGNFEVRERSARKGRNPQTGEEIEIPASKVPAFK
PGKALKDAVAGK
>P17615 ~~~hup~~~DNA-binding protein HB1~~~
MAYNKSDLVSKIAQKSNLTKAQAEAAVNAFQDVFVEAMKSGEGLKLTGLFSAERVKRAARTGRNPRTGEQIDIPASYGVR
ISAGSLLKKAVTE
>P0A3H3 ~~~~~~DNA-binding protein HRL18~~~
MNKNELVSAVAEKAGLTKADAASAVDAVFETVQSELKNGGDIRLAGFGSFSVSRREASKGRNPSTGAEVDIPARNVPKFS
AGKGLKDAVNS
>P0A3H4 ~~~~~~DNA-binding protein HRL18~~~COG0776
MNKNELVSAVAEKAGLTKADAASAVDAVFETVQSELKNGGDIRLAGFGSFSVSRREASKGRNPSTGAEVDIPARNVPKFS
AGKGLKDAVNS
>P0A3H5 ~~~hup1~~~DNA-binding protein HU 1~~~COG0776
MNRSELVAALADRAEVTRKDADAVLAAFAEVVGDIVSKGDEKVTIPGFLTFERTHRAARTARNPQTGEPIQIPAGYSVKV
SAGSKLKEAAKGK
>P0A3H7 ~~~hup2~~~DNA-binding protein HU 2~~~COG0776
MNKAQLVEAIADKLGGRQQAADAVDAVLDALVRAVVAGDRVSVTGFGSFEKVDRPARYARNPQTGERVRVKKTSVPRFRA
GQGFKDLVSGSKKLPKNDIAVKKAPKGSLSGPPPTISKAAGKKAAAKKATGAAKKTTGAAKKTSAAAKKTTAKKTTGAAK
TTAKKTTAKKSAAKTTTAAAKKTAAKKAPAKKATAKKAPAKKSTARKTTAKKATARKK
>P02348 ~~~~~~DNA-binding protein HRL53~~~COG0776
MNKNELVSAVAEKAGLTKSDAASAVDAVFDVVQAELKNKGDIRLAGFGSFTVSHRAATKGRNPSTGAEVDIPARNVPKFT
PGKGLKDAVNG
>P0ACF0 ~~~hupA~~~DNA-binding protein HU-alpha~~~COG0776
MNKTQLIDVIAEKAELSKTQAKAALESTLAAITESLKEGDAVQLVGFGTFKVNHRAERTGRNPQTGKEIKIAAANVPAFV
SGKALKDAVK
>E0J6W8 ~~~hupA~~~DNA-binding protein HU-alpha~~~
MNKTQLIDVIAEKAELSKTQAKAALESTLAAITESLKEGDAVQLVGFGTFKVNHRAERTGRNPQTGKEIKIAAANVPAFV
SGKALKDAVK
>P0ACF4 ~~~hupB~~~DNA-binding protein HU-beta~~~COG0776
MNKSQLIDKIAAGADISKAAAGRALDAIIASVTESLKEGDDVALVGFGTFAVKERAARTGRNPQTGKEITIAAAKVPSFR
AGKALKDAVN
>P05384 ~~~hupB~~~DNA-binding protein HU-beta~~~
MNKSELIDAIAASADIPKAVAGRALDAVIESVTGALKAGDSVVLVGFGTFAVKERAARTGRNPQTGKPIKIAAAKIPGFK
AGKALKDAVN
>P0A3H1 ~~~hup~~~DNA-binding protein HU~~~
MNKTELINAVAETSGLSKKDATKAVDAVFDSITEALRKGDKVQLIGFGNFEVRERAARKGRNPQTGEEMEIPASKVPAFK
PGKALKDAVK
>Q57267 ~~~hup~~~DNA-binding protein HU~~~
MSFSRRPKVTKSDIVDQISLNIKNNNLKLEKKYIRLVIDAFFEELKSNLCSNNVIEFRSFGTFEVRKRKGRLNARNPQTG
EYVKVLDHHVAYFRPGKDLKERVWGIKG
>Q46121 ~~~hup~~~DNA-binding protein HU~~~COG0776
MTKADFISLVAQTAGLTKKDATTATDAVISTITDVLAKGDSISFIGFGTFSTQERAAREARVPSTGKTIKVPATRVAKFK
VGKNLKEAVAKASGKKKK
>P05385 ~~~hup~~~DNA-binding protein HU~~~
MNKAELITSMAEKSKLTKKDAELALKALIESVEEALEKGEKVQLVGFGTFETRERAAREGRNPRTKEVINIPATTVPVFK
AGKEFKDKVNK
>P0A3H0 ~~~hup~~~DNA-binding protein HU~~~
MNKTELINAVAETSGLSKKDATKAVDAVFDSITEALRKGDKVQLIGFGNFEVRERAARKGRNPQTGEEMEIPASKVPAFK
PGKALKDAVK
>Q9CI64 ~~~hup~~~DNA-binding protein HU~~~COG0776
MANKQDLIAEVAAKTGLTKKDSEKAVNAFGEVVTEFLAKGEKVQLIGFGTFETRERAAREGRNPQTGEAIKIAATVVPAF
KAGKALKDAVK
>Q9XB18 1.16.3.1~~~hupB~~~DNA-binding protein HupB~~~
MNKAELIDVLTQKLGSDRRQATAAVENVVDTIVRAVHKGDSVTITGFGVFEQRRRAARVARNPRTGETVKVKPTSVPAFR
PGAQFKAVVSGAQRLPAEGPAVKRGVGASAAKKVAKKAPAKKATKAAKKAATKAPARKAATKAPAKKAATKAPAKKAVKA
TKSPAKKVTKAVKKTAVKASVRKAATKAPAKKAAAKRPATKAPAKKATARRGRK
>O33125 1.16.3.1~~~hupB~~~DNA-binding protein HupB~~~COG0776
MNKAELIDVLTQKLGSDRRQATAAVENVVDTIVRAVHKGDSVTITGFGVFEQRRRAARVARNPRTGETVKVKPTSVPAFR
PGAQFKAVVAGAQRLPLEGPAVKRGVATSAAKKAAIKKAPVKKALAKKAATKAPAKKAVKAPAKKITTAVKVPAKKATKV
VKKVAAKAPVRKATTRALAKKAAVKKAPAKKVTAAKRGRK
>Q9ZHC5 1.16.3.1~~~hup~~~DNA-binding protein HupB~~~COG0776
MNKAELIDVLTTKMGTDRRQATAAVENVVDTIVRAVHKGDSVTITGFGVFEQRRRAARVARNPRTGETVKVKPTSVPAFR
PGAQFKAVISGAQKLPADGPAVKRGVTAGPAKKAAKKAPAKKAAAKKTATKAAAKKAPAKKAATKAPAKKAATKAPAKKA
ATKAPAKKAATKAPAKKAAAKAPAKKAATKAPAKKAAAKKAPAKKGRR
>A5U6Z7 1.16.3.1~~~hupB~~~DNA-binding protein HupB~~~COG0776
MNKAELIDVLTQKLGSDRRQATAAVENVVDTIVRAVHKGDSVTITGFGVFEQRRRAARVARNPRTGETVKVKPTSVPAFR
PGAQFKAVVSGAQRLPAEGPAVKRGVGASAAKKVAKKAPAKKATKAAKKAATKAPARKAATKAPAKKAATKAPAKKAVKA
TKSPAKKVTKAVKKTAVKASVRKAATKAPAKKAAAKRPATKAPAKKATARRGRK
>P9WMK7 1.16.3.1~~~hupB~~~DNA-binding protein HupB~~~COG0776
MGMNKAELIDVLTQKLGSDRRQATAAVENVVDTIVRAVHKGDSVTITGFGVFEQRRRAARVARNPRTGETVKVKPTSVPA
FRPGAQFKAVVSGAQRLPAEGPAVKRGVGASAAKKVAKKAPAKKATKAAKKAATKAPARKAATKAPAKKAATKAPAKKAV
KATKSPAKKVTKAVKKTAVKASVRKAATKAPAKKAAAKRPATKAPAKKATARRGRK
>P05514 ~~~hup~~~DNA-binding protein HU~~~COG0776
MNKGELVDAVAEKASVTKKQADAVLTAALETIIEAVSSGDKVTLVGFGSFESRERKAREGRNPKTNEKMEIPATRVPAFS
AGKLFREKVAPPKA
>P02344 ~~~hupB~~~DNA-binding protein HRm~~~COG0776
MNKNELVAAVADKAGLSKADASSAVDAVFETIQGELKNGGDIRLVGFGNFSVSRREASKGRNPSTGAEVDIPARNVPKFT
AGKGLKDAVN
>Q99U17 ~~~hup~~~DNA-binding protein HU~~~
MNKTDLINAVAEQADLTKKEAGSAVDAVFESIQNSLAKGEKVQLIGFGNFEVRERAARKGRNPQTGKEIDIPASKVPAFK
AGKALKDAVK
>Q7A5J1 ~~~hup~~~DNA-binding protein HU~~~
MNKTDLINAVAEQADLTKKEAGSAVDAVFESIQNSLAKGEKVQLIGFGNFEVRERAARKGRNPQTGKEIDIPASKVPAFK
AGKALKDAVK
>Q9XB21 ~~~hup~~~DNA-binding protein HU~~~COG0776
MANKQDLIAKVAEATELTKKDSAAAVDAVFSAVSSYLAKGEKVQLIGFGNFEVRERAARKGRNPQTGEEIKIKASKVPAF
KAGKALKDAVK
>P36206 ~~~hup~~~DNA-binding protein HU~~~COG0776
MTKKELIDRVAKKAGAKKKDVKLILDTILETITEALAKGEKVQIVGFGSFEVRKAAARKGVNPQTRKPITIPERKVPKFK
PGKALKEKVK
>P19436 ~~~~~~DNA-binding protein HU~~~COG0776
MAAKKTVTKADLVDQVAQATGLKKKDVKAMVDALLAKVEEALANGSKVQLTGFGTFEVRKRKARTGVKPGTKEKIKIPAT
QYPAFKPGKALKDKVKK
>Q814I2 3.6.4.13~~~dbpA~~~ATP-dependent RNA helicase DbpA~~~
MSKKSFSDYKLSKEIVRALTGLGYDHPTEVQGEVIPVALQKKDLVVKSQTGSGKTASFGIPLCEMVEWEENKPQALVLTP
TRELAVQVKEDITNIGRFKRIKAAAVYGKSPFARQKLELKQKTHIVVGTPGRVLDHIEKGTLSLECLKYLVIDEADEMLN
MGFIDQVEAIIDELPTKRMTMLFSATLPEDVEKLSRTYMNSPTHIEIKAAGITTDKIEHTLFEVIEDEKLSLLKDVTTIE
NPDSCIIFCRTQENVDHVYRQLKRANYPCDKIHGGMVQEDRFEVMDDFRKGKFRYLVATDVAARGIDIDNITHVINYDIP
LEKESYVHRTGRTGRAGNSGKAITFITPYENRFLEEIEEYIGFEIPKAIGPSKEEVMKEKAAFEEKLHAKPIIKKDKNAD
INKGIMKLYFNGGKKKKIRAVDFVGTIAKIKGVTAEDIGIITIQDNVSYVEILNGKGPLVLKVMKTTTIKGKQLKVHEAI
K
>P42305 3.6.4.13~~~dbpA~~~ATP-dependent RNA helicase DbpA~~~COG0513
MSHFKNYQISHDILRALEGLGYTEPTKVQQSVIPAALERKDLVVKSQTGSGKTASFGIPLCELANWDENKPQALILTPTR
ELAVQVKEDITNIGRFKRIKATAVFGKSSFDKQKAELKQKSHIVVGTPGRVLDHIEKGTLPLDRLSYLVIDEADEMLNMG
FIEQVEAIIKHLPTERTTMLFSATLPQDIEKLSRQYMQNPEHIEVKAAGLTTRNIEHAVIQVREENKFSLLKDVLMTENP
DSCIIFCRTKEHVNQLTDELDDLGYPCDKIHGGMIQEDRFDVMNEFKRGEYRYLVATDVAARGIDIENISLVINYDLPLE
KESYVHRTGRTGRAGNKGKAISFVTAFEKRFLADIEEYIGFEIQKIEAPSQEEVARKKPEFLAKLNDRPESKKDKSEELN
KDIMKLYFNGGKKKKIRAVDFVGTIAKIDGVSADDIGIITIMDNASYVEILNGKGPHVLKVMKNTTVKGKQLKVNKANK
>O50917 ~~~dbpA~~~Decorin-binding protein A~~~
MIKCNNKTFNNLLKLTILVNLLISCGLTGATKIRLERSAKDITDEIDAIKKDAALKGVNFDAFKDKKTGSGVSENPFILE
AKVRATTVAEKFVIAIEEEATKLKETGSSGEFSAMYDLMFEVSKPLQKLGIQEMTKTVSDAAEENPPTTAQGVLEIAKKM
REKLQRVHTKNYCTLKKKENSTFTDEKCKNN
>P21693 3.6.4.13~~~dbpA~~~ATP-dependent RNA helicase DbpA~~~COG0513
MTAFSTLNVLPPAQLTNLNELGYLTMTPVQAAALPAILAGKDVRVQAKTGSGKTAAFGLGLLQQIDASLFQTQALVLCPT
RELADQVAGELRRLARFLPNTKILTLCGGQPFGMQRDSLQHAPHIIVATPGRLLDHLQKGTVSLDALNTLVMDEADRMLD
MGFSDAIDDVIRFAPASRQTLLFSATWPEAIAAISGRVQRDPLAIEIDSTDALPPIEQQFYETSSKGKIPLLQRLLSLHQ
PSSCVVFCNTKKDCQAVCDALNEVGQSALSLHGDLEQRDRDQTLVRFANGSARVLVATDVAARGLDIKSLELVVNFELAW
DPEVHVHRIGRTARAGNSGLAISFCAPEEAQRANIISDMLQIKLNWQTPPANSSIATLEAEMATLCIDGGKKAKMRPGDV
LGALTGDIGLDGADIGKIAVHPAHVYVAVRQAVAHKAWKQLQGGKIKGKTCRVRLLK
>Q8Y7M8 3.6.4.13~~~dbpA~~~ATP-dependent RNA helicase DbpA~~~COG0513
MNNLKLSEEIKRAINELGYTEATPVQKAVIPVALTGEDIVAKSQTGSGKTAAFAIPIAEQVEWEENKPQALIIVPTRELA
MQVKTECTNIGRFKRVKAAAIYGQSPFAKQKLELSQKNHIVVGTPGRLLDHIEKGSLNVDKVAHLVLDEVDEMLSMGFID
QVEDILSRLPKQRQNLFFSATMPEEMQDLIKRYQDNPMVIEMASEKTNPIFHVEMQTDNKEKTLKDVLITENPDSAIIFC
NTKNQVDELTDLLDVKASKIHGGLRQEDRFRAMDDFKSGKSRFLIATDVAGRGIDVDNVSLVINYDLPIEKENYVHRIGR
TGRAGKSGKAISFVKTNENPLLRDIEEMLDVTIEKKRKPTVIEVKVNEDAFRKKQQKRPTIKKARGEKLNKNIMKLYFNG
GKKKKIRAVDFVGTISKLEGITAEDIGIITIEDHVSFVEILNGKGPAVLEMMRSRKVKGRRLKVNEARKR
>P0CL68 ~~~dbpB~~~Decorin-binding protein B~~~
MKIGKLNSIVIALFFKLLVACSIGLVERTNAALESSSKDLKNKILKIKKEATGKGVLFEAFTGLKTGSKVTSGGLALREA
KVQAIVETGKFLKIIEEEALKLKETGNSGQFLAMFDLMLEVVESLEDVGIIGLKARVLEESKNNPINTAERLLAAKAQIE
NQLKVVKEKQNIENGGEKKNNKSKKKK
>B5GS26 4.2.3.97~~~~~~(-)-delta-cadinene synthase~~~
MSTRPVEGSAIWDVLSPHSPHAAAADGKTLVWVEAGELCGHDTAESANLARIRPGLLAAFCHPKATEDDLTLITKWMAWL
FLLDDRIDESDLGRDADLLDGHLQDLQGVALGIRTASGPMSRALEEIITQASAGMGDAWQLRFRRNISDYLLACVWQAAH
RQAGEFPDPEVFPHWRRAFGAIMPSFDLIERTDGGALPSCVYYSRPYQSLLTAAADLVCWTNDLMTVDKEAAHGDLHNLV
LVTEHDRHQDRRTASAAVSAACEQRMRAHTSARRDLTGLTAALGLPDTVRTHADDCAASLLVWVRGHLEWGLETPRYRPG
TTGTGTD
>P60327 3.5.1.77~~~~~~N-carbamoyl-D-amino acid hydrolase~~~
MTRQMILAVGQQGPIARAETREQVVVRLLDMLTKAASRGANFIVFPELALTTFFPRWHFTDEAELDSFYETEMPGPVVRP
LFEKAAELGIGFNLGYAELVVEGGVKRRFNTSILVDKSGKIVGKYRKIHLPGHKEYEAYRPFQHLEKRYFEPGDLGFPVY
DVDAAKMGMFICNDRRWPEAWRVMGLRGAEIICGGYNTPTHNPPVPQHDHLTSFHHLLSMQAGSYQNGAWSAAAGKVGME
ENCMLLGHSCIVAPTGEIVALTTTLEDEVITAAVDLDRCRELREHIFNFKQHRQPQHYGLIAEL
>Q5S260 3.5.1.77~~~~~~N-carbamoyl-D-amino acid hydrolase~~~
MTRQMILAVGQQGPIARAETREQVVVRLLYMLTKAASRGANFIVFPELAFTTFFPRWHFTDEAELDSFYETEMPGPVVRP
LFEKAAELGIGFNLGYAELVVEGGVKRRFNTSILVDKPGKIVGKYRKIHLPGHKEYEAYRPFQHLEKRYFEPGDLGFPVY
DVDAAKMGMFICNDRRWPEAWRVMGLRGAEIICGGYNTPTHNPPVPQHDHLTSFHHLLSMQAGSYQNGAWSAAAGKVGME
ENCMLLGHSCIVAPTGEIVALTTTLEDEVITAAVCLDRCRELREHIFNFKQHRQPQHYGLIAEL
>Q44185 3.5.1.77~~~~~~N-carbamoyl-D-amino acid hydrolase~~~
MTRQMILAVGQQGPIARAETREQVVGRLLDMLTNAASRGVNFIVFPELALTTFFPRWHFTDEAELDSFYETEMPGPVVRP
LFETAAELGIGFNLGYAELVVEGGVKRRFNTSILVDKSGKIVGKYRKIHLPGHKEYEAYRPFQHLEKRYFEPGDLGFPVY
DVDAAKMGMFICNDRRWPETWRVMGLKGAEIICGGYNTPTHNPPVPQHDHLTSFHHLLSMQAGSYQNGAWSAAAGKVGME
EGCMLLGHSCIVAPTGEIVALTTTLEDEVITAAVDLDRCRELREHIFNFKAHRQPQHYGLIAEF
>O67262 4.1.1.20~~~lysA~~~Diaminopimelate decarboxylase~~~COG0019
MELLKEYNPYLEYRDGELFIEGVSLKELAQTFGTPLYVYSSNFIKERFEAYRKAFPDALICYAVKANFNPHLVKLLGELG
AGADIVSGGELYLAKKAGIPPERIVYAGVGKTEKELTDAVDSEILMFNVESRQELDVLNEIAGKLGKKARIAIRVNPDVD
PKTHPYIATGMQKSKFGVDIREAQKEYEYASKLENLEIVGIHCHIGSQILDISPYREAVEKVVSLYESLTQKGFDIKYLD
IGGGLGIKYKPEDKEPAPQDLADLLKDLLENVKAKIILEPGRSIMGNAGILITQVQFLKDKGSKHFIIVDAGMNDLIRPS
IYNAYHHIIPVETKERKKVVADIVGPICETGDFLALDREIEEVQRGEYLAVLSAGAYGFAMSSHYNMRPRAAEVLVENGS
VKLIRKRENYDYIVEPSLDI
>P09890 4.1.1.20~~~lysA~~~Diaminopimelate decarboxylase~~~COG0019
MATVENFNELPAHVWPRNAVRQEDGVVTVAGVPLPDLAEEYGTPLFVVDEDDFRSRCRDMATAFGGPGNVHYASKAFLTK
TIARWVDEEGLALDIASINELGIALAAGFPASRITAHGNNKGVEFLRALVQNGVGHVVLDSAQELELLDYVAAGEGKIQD
VLIRVKPGIEAHTHEFIATSHEDQKFGFSLASGSAFEAAKAANNAENLNLVGLHCHVGSQVFDAEGFKLAAERVLGLYSQ
IHSELGVALPELDLGGGYGIAYTAAEEPLNVAEVASDLLTAVGKMAAELGIDAPTVLVEPGRAIAGPSTVTIYEVGTTKD
VHVDDDKTRRYIAVDGGMSDNIRPALYGSEYDARVVSRFAEGDPVSTRIVGSHCESGDILINDEIYPSDITSGDFLALAA
TGAYCYAMSSRYNAFTRPAVVSVRAGSSRLMLRRETLDDILSLEA
>P00861 4.1.1.20~~~lysA~~~Diaminopimelate decarboxylase~~~COG0019
MPHSLFSTDTDLTAENLLRLPAEFGCPVWVYDAQIIRRQIAALKQFDVVRFAQKACSNIHILRLMREQGVKVDSVSLGEI
ERALAAGYNPQTHPDDIVFTADVIDQATLERVSELQIPVNAGSVDMLDQLGQVSPGHRVWLRVNPGFGHGHSQKTNTGGE
NSKHGIWYTDLPAALDVIQRHHLQLVGIHMHIGSGVDYAHLEQVCGAMVRQVIEFGQDLQAISAGGGLSVPYQQGEEAVD
TEHYYGLWNAAREQIARHLGHPVKLEIEPGRFLVAQSGVLITQVRSVKQMGSRHFVLVDAGFNDLMRPAMYGSYHHISAL
AADGRSLEHAPTVETVVAGPLCESGDVFTQQEGGNVETRALPEVKAGDYLVLHDTGAYGASMSSNYNSRPLLPEVLFDNG
QARLIRRRQTIEELLALELL
>E0IWI3 4.1.1.20~~~lysA~~~Diaminopimelate decarboxylase~~~
MPHSLFSTDTDLTAENLLRLPAEFGCPVWVYDAQIIRRQIAALKQFDVVRFAQKACSNIHILRLMREQGVKVDSVSLGEI
ERALAAGYNPQTHPDDIVFTADVIDQATLERVSELQIPVNAGSVDMLDQLGQVSPGHRVWLRVNPGFGHGHSQKTNTGGE
NSKHGIWYTDLPAALDVIQRHHLQLVGIHMHIGSGVDYAHLEQVCGAMVRQVIEFGQDLQAISAGGGLSVPYQQGEEAVD
TEHYYGLWNAAREQIARHLGHPVKLEIEPGRFLVAQSGVLITQVRSVKQMGSRHFVLVDAGFNDLMRPAMYGSYHHISAL
AADGRSLEHAPTVETVVAGPLCESGDVFTQQEGGNVETRALPEVKAGDYLVLHDTGAYGASMSSNYNSRPLLPEVLFDNG
QARLIRRRQTIEELLALELL
>B4XMC6 4.1.1.20~~~lysA~~~Diaminopimelate decarboxylase~~~COG0019
MFNYEELFQTHKTPFYLYDFDKIKQAFLNYKEAFKGRKSLICYALKANSNLSILSLLAHLESGADCVSIGEIQRALKAGI
KPYRIVFSGVGKSAFEIEQALKLNILFLNVESFMELKTIETIAQSLGIKARISIRINPNIDAKTHPYISTGLKENKFGVG
EKEALEMFLWAKKSAFLEPVSVHFHIGSQLLDLEPIIEASQKVAKIAKSLIALGIDLRFFDVGGGIGVSYENEETIKLYD
YAQGILNALQGLDLTIICEPGRSIVAESGELITQVLYEKKAQNKRFVIVDAGMNDFLRPSLYHAKHAIRVITPSKGREIS
PCDVVGPVCESSDTFLKDAHLPELEPGDKIAIEKVGAYGSSMASQYNSRPKLLELALEDHKIRVIRKREALEDLWRLEEE
GLKGV
>P9WIU7 4.1.1.20~~~lysA~~~Diaminopimelate decarboxylase~~~COG1166
MNELLHLAPNVWPRNTTRDEVGVVCIAGIPLTQLAQEYGTPLFVIDEDDFRSRCRETAAAFGSGANVHYAAKAFLCSEVA
RWISEEGLCLDVCTGGELAVALHASFPPERITLHGNNKSVSELTAAVKAGVGHIVVDSMTEIERLDAIAGEAGIVQDVLV
RLTVGVEAHTHEFISTAHEDQKFGLSVASGAAMAAVRRVFATDHLRLVGLHSHIGSQIFDVDGFELAAHRVIGLLRDVVG
EFGPEKTAQIATVDLGGGLGISYLPSDDPPPIAELAAKLGTIVSDESTAVGLPTPKLVVEPGRAIAGPGTITLYEVGTVK
DVDVSATAHRRYVSVDGGMSDNIRTALYGAQYDVRLVSRVSDAPPVPARLVGKHCESGDIIVRDTWVPDDIRPGDLVAVA
ATGAYCYSLSSRYNMVGRPAVVAVHAGNARLVLRRETVDDLLSLEVR
>Q9X1K5 4.1.1.20~~~lysA~~~Diaminopimelate decarboxylase~~~COG0019
MDILRKVAEIHGTPTYVYFEETLRKRSRLVKEVFEGVNLLPTFAVKANNNPVLLKILREEGFGMDVVTKGELLAAKLAGV
PSHTVVWNGNGKSRDQMEHFLREDVRIVNVDSFEEMEIWRELNPEGVEYFIRVNPEVDAKTHPHISTGLKKHKFGIPLED
LDSFMERFRSMNIRGLHVHIGSQITRVEPFVEAFSKVVRASERYGFEEINIGGGWGINYSGEELDLSSYREKVVPDLKRF
KRVIVEIGRYIVAPSGYLLLRVVLVKRRHNKAFVVVDGGMNVLIRPALYSAYHRIFVLGKQGKEMRADVVGPLCESGDVI
AYDRELPEVEPGDIIAVENAGAYGYTMSNNYNSTTRPAEVLVRENGRISLIRRRETEMDIFKDVVM
>Q9KVL7 4.1.1.20~~~lysA~~~Diaminopimelate decarboxylase~~~COG0019
MDYFNYQEDGQLWAEQVPLADLANQYGTPLYVYSRATLERHWHAFDKSVGDYPHLICYAVKANSNLGVLNTLARLGSGFD
IVSVGELERVLAAGGDPSKVVFSGVGKTEAEMKRALQLKIKCFNVESEPELQRLNKVAGELGVKAPISLRINPDVDAKTH
PYISTGLRDNKFGITFDRAAQVYRLAHSLPNLDVHGIDCHIGSQLTALAPFIDATDRLLALIDSLKAEGIHIRHLDVGGG
LGVVYRDELPPQPSEYAKALLDRLERHRDLELIFEPGRAIAANAGVLVTKVEFLKHTEHKNFAIIDAAMNDLIRPALYQA
WQDIIPLRPRQGEAQTYDLVGPVCETSDFLGKDRDLVLQEGDLLAVRSSGAYGFTMSSNYNTRPRVAEVMVDGNKTYLVR
QREELSSLWALESVLPE
>Q9KFV3 3.5.4.30~~~dcd~~~dCTP deaminase, dUMP-forming~~~COG0717
MILSGKTISEKLTEKELEITPLTEEQIQPASVDLRLGPHFVTIDDSKEAVISFERPIRYREWTTSDETIVLPPHTFLLAT
TMETVKLPNHLTAFVEGRSSVGRLGLFIQNAGWVDPGFNGQITLELFNANRLPIELPIGRRICQLVFAEVTGEVAPYQGK
YLFQKGATMSEIYKDAF
>A0QQ98 3.5.4.30~~~dcd~~~dCTP deaminase, dUMP-forming~~~COG0717
MLLSDRDIRAEIAAKRLALEPFDDALVQPSSIDVRLDRMFRVFNNTRYTHIDPAMQQDELTTLVEPAEGEPFVLHPGEFV
LGSTLELCTLPDDLAGRLEGKSSLGRLGLLTHSTAGFIDPGFSGHITLELSNVANLPITLWPGMKIGQLCLLRLTSPAEN
PYGSAAVGSKYQGQRGPTPSRSHLNFIKS
>P9WP17 3.5.4.30~~~dcd~~~dCTP deaminase, dUMP-forming~~~COG0717
MLLSDRDLRAEISSGRLGIDPFDDTLVQPSSIDVRLDCLFRVFNNTRYTHIDPAKQQDELTSLVQPVDGEPFVLHPGEFV
LGSTLELFTLPDNLAGRLEGKSSLGRLGLLTHSTAGFIDPGFSGHITLELSNVANLPITLWPGMKIGQLCMLRLTSPSEH
PYGSSRAGSKYQGQRGPTPSRSYQNFIRST
>Q2GLJ4 3.5.4.13~~~dcd~~~dCTP deaminase~~~COG0717
MSVMPDHWIKERALKDGMISPFVDHKEGTGVLSYGLSSYGYDARLDNKFKIFANTHSVVVDPKNFSQDSFVDREGDFCII
PPNSFMLAKTVEYFNIPRDVMVVCVGKSTYARCGIVVNVTPLEPGWSGYVTLEFSNTSPLPVKVYAFEGACQFLFFSGKE
RCSKSYDEAGGKYMGQSDVTLPIIS
>Q2T083 3.5.4.13~~~dcd~~~dCTP deaminase~~~
MSIKSDKWIRRMAEEHKMIEPFVPDQVRAAEDGRRIVSYGTSSYGYDIRCADEFKIFTNINSTIVDPKNFDEGSFVDFKG
DVCIIPPNSFALARTVEYFRIPRTVLTVCLGKSTYARCGIIVNVTPFEPEWEGYVTLEFSNTTPLPAKIYANEGVAQVLF
FESDEVCDVSYADRGGKYQGQRGVTLPKT
>Q9PN07 3.5.4.13~~~dcd~~~dCTP deaminase~~~COG0717
MGLKADNWIRKMALEHKMIEPFCEANIGKGVVSYGLSSYGYDIRVGREFKIFTNVNSTVVDPKNFVEENVVDFEGDVCIV
PANSFALARTIEYFKMPDNVLAICLGKSTYARCGIIVNVTPFEPGFEGHITIEISNTTPLPAKIYANEGIAQVLFLQGDE
KCDTTYKDKKGKYQAQTGITLPRILK
>P28248 3.5.4.13~~~dcd~~~dCTP deaminase~~~COG0717
MRLCDRDIEAWLDEGRLSINPRPPVERINGATVDVRLGNKFRTFRGHTAAFIDLSGPKDEVSAALDRVMSDEIVLDEGEA
FYLHPGELALAVTLESVTLPADLVGWLDGRSSLARLGLMVHVTAHRIDPGWSGCIVLEFYNSGKLPLALRPGMLIGALSF
EPLSGPAVRPYNRREDAKYRNQQGAVASRIDKD
>P69909 4.1.1.15~~~gadA~~~Glutamate decarboxylase alpha~~~COG0076
MDQKLLTDFRSELLDSRFGAKAISTIAESKRFPLHEMRDDVAFQIINDELYLDGNARQNLATFCQTWDDENVHKLMDLSI
NKNWIDKEEYPQSAAIDLRCVNMVADLWHAPAPKNGQAVGTNTIGSSEACMLGGMAMKWRWRKRMEAAGKPTDKPNLVCG
PVQICWHKFARYWDVELREIPMRPGQLFMDPKRMIEACDENTIGVVPTFGVTYTGNYEFPQPLHDALDKFQADTGIDIDM
HIDAASGGFLAPFVAPDIVWDFRLPRVKSISASGHKFGLAPLGCGWVIWRDEEALPQELVFNVDYLGGQIGTFAINFSRP
AGQVIAQYYEFLRLGREGYTKVQNASYQVAAYLADEIAKLGPYEFICTGRPDEGIPAVCFKLKDGEDPGYTLYDLSERLR
LRGWQVPAFTLGGEATDIVVMRIMCRRGFEMDFAELLLEDYKASLKYLSDHPKLQGIAQQNSFKHT
>P69908 4.1.1.15~~~gadA~~~Glutamate decarboxylase alpha~~~COG0076
MDQKLLTDFRSELLDSRFGAKAISTIAESKRFPLHEMRDDVAFQIINDELYLDGNARQNLATFCQTWDDENVHKLMDLSI
NKNWIDKEEYPQSAAIDLRCVNMVADLWHAPAPKNGQAVGTNTIGSSEACMLGGMAMKWRWRKRMEAAGKPTDKPNLVCG
PVQICWHKFARYWDVELREIPMRPGQLFMDPKRMIEACDENTIGVVPTFGVTYTGNYEFPQPLHDALDKFQADTGIDIDM
HIDAASGGFLAPFVAPDIVWDFRLPRVKSISASGHKFGLAPLGCGWVIWRDEEALPQELVFNVDYLGGQIGTFAINFSRP
AGQVIAQYYEFLRLGREGYTKVQNASYQVAAYLADEIAKLGPYEFICTGRPDEGIPAVCFKLKDGEDPGYTLYDLSERLR
LRGWQVPAFTLGGEATDIVVMRIMCRRGFEMDFAELLLEDYKASLKYLSDHPKLQGIAQQNSFKHT
>P69910 4.1.1.15~~~gadB~~~Glutamate decarboxylase beta~~~COG0076
MDKKQVTDLRSELLDSRFGAKSISTIAESKRFPLHEMRDDVAFQIINDELYLDGNARQNLATFCQTWDDENVHKLMDLSI
NKNWIDKEEYPQSAAIDLRCVNMVADLWHAPAPKNGQAVGTNTIGSSEACMLGGMAMKWRWRKRMEAAGKPTDKPNLVCG
PVQICWHKFARYWDVELREIPMRPGQLFMDPKRMIEACDENTIGVVPTFGVTYTGNYEFPQPLHDALDKFQADTGIDIDM
HIDAASGGFLAPFVAPDIVWDFRLPRVKSISASGHKFGLAPLGCGWVIWRDEEALPQELVFNVDYLGGQIGTFAINFSRP
AGQVIAQYYEFLRLGREGYTKVQNASYQVAAYLADEIAKLGPYEFICTGRPDEGIPAVCFKLKDGEDPGYTLYDLSERLR
LRGWQVPAFTLGGEATDIVVMRIMCRRGFEMDFAELLLEDYKASLKYLSDHPKLQGIAQQNSFKHT
>Q9CG20 4.1.1.15~~~gadB~~~Glutamate decarboxylase~~~COG0076
MLYGKENRDEAEFLEPIFGSESEQVDLPKYKLAQQSIEPRVAYQLVQDEMLDEGNARLNLATFCQTYMEPEAVKLMSQTL
EKNAIDKSEYPRTTEIENRCVNMIADLWNASEKEKFMGTSTIGSSEACMLGGMAMKFSWRKRAEKLGLDINAKKPNLVIS
SGYQVCWEKFCIYWDIEMREVPMDKEHMSINLDKVMDYVDEYTIGVVGIMGITYTGRYDDIKALDNLIEEYNKQTDYKVY
IHVDAASGGLYAPFVEPELEWDFRLKNVISINTSGHKYGLVYPGVGWVLWRDKKYLPEELIFKVSYLGGELPTMAINFSH
SASQLIGQYYNFVRYGFDGYKAIHERTHKVAMFLAKEIEKTGMFEIMNDGSQLPIVCYKLKEDSNRGWNLYDLADRLLMK
GWQVPAYPLPKNLENEIIQRLVIRADFGMNMAFNYVQDMQEAIEALNKAHILYHEEPENKTYGFTH
>O30418 4.1.1.15~~~gadB~~~Glutamate decarboxylase~~~COG0076
MLYGKENRDEAEFLEPIFGSESEQVDLPKYKLAQQSIEPRVAYQLVQDEMLDEGNARLNLATFCQTYMEPEAVKLMSQTL
EKNAIDKSEYPRTTEIENRCVNMIADLWNASEKEKFMGTSTIGSSEACMLGGMAMKFSWRKRAEKLGLDINAKKPNLVIS
SGYQVCWEKFCVYWDIEMREVPMDREHMSINLEKVMDYVDEYTIGVVGIMGITYTGRYDDIKALDNLIEEYNKQTDYKVY
IHVDAASGGLYAPFVEPELEWDFRLKNVISINTSGHKYGLVYPGVGWVLWRDKKYLPEELIFKVSYLGGELPTMAINFSH
SASQLIGQYYNFVRYGFDGYKAIHERTHKVAMYLAEEIEKTGMFEIMNDGAQLPIVCYKLKENSNRGWNLYDLADRLLMK
GWQVPAYPLPKNLENEIIQRLVIRADFGMNMAFNYVQDMQEAIDALNKAHILFHQEPENKTYGFTH
>P0C2E5 4.1.1.22~~~hdc~~~Histidine decarboxylase proenzyme~~~
MNKNLEANRNRTLSEGIHKNIKVRAPKIDKTAISPYDRYCDGYGMPGAYGNGYVSVLKVSVGTVKKTDDILLDGIVSYDR
AEINDAYVGQINMLTASSFCGVAGQVWGHDLATHDSIAKDEIKPLYELKQFDGTPLKVYDAKPLLEAGIELFGTEKNRRF
TTAPGAHVICANKSATAYRPKENRPLKEGEAYGVWSFIALSLSNDRDHCADLFIEDAGLWTKNDNPEDLKKFLEDHRKAV
TWSVVECGRDSHVVFERTYIGFAYVIMKPGEIGNALTCAPYVTLARDAVPSEGFPSLNRISLSQWLDDMNFDSLVNPSKK
>P00862 4.1.1.22~~~hdcA~~~Histidine decarboxylase proenzyme~~~
MSELDAKLNKLGVDRIAISPYKQWTRGYMEPGNIGNGYVTGLKVDAGVRDKSDDDVLDGIVSYDRAETKNAYIGQINMTT
ASSFTGVQGRVIGYDILRSPEVDKAKPLFTETQWDGSELPIYDAKPLQDALVEYFGTEQDRRHYPAPGSFIVCANKGVTA
ERPKNDADMKPGQGYGVWSAIAISFAKDPTKDSSMFVEDAGVWETPNEDELLEYLEGRRKAMAKSIAECGQDAHASFESS
WIGFAYTMMEPGQIGNAITVAPYVSLPIDSIPGGSILTPDKDMEIMENLTMPEWLEKMGYKSLSANNALKY
>P05034 4.1.1.22~~~hdc~~~Histidine decarboxylase~~~
MTLSINDQNKLDAFWAYCVKNQYFNIGYPESADFDYTNLERFLRFSINNCGDWGEYCNYLLNSFDFEKEVMEYFADLFKI
PFEQSWGYVTNGGTEGNMFGCYLGREIFPDGTLYYSKDTHYSVAKIVKLLRIKSQVVESQPNGEIDYDDLMKKIADDKEA
HPIIFANIGTTVRGAIDDIAEIQKRLKAAGIKREDYYLHADAALSGMILPFVDDAQPFTFADGIDSIGVSGHKMIGSPIP
CGIVVAKKENVDRISVEIDYISAHDKTITGSRNGHTPLMLWEAIRSHSTEEWKRRITRSLDMAQYAVDRMQKAGINAWRN
KNSITVVFPCPSERVWREHCLATSGDVAHLITTAHHLDTVQIDKLIDDVIADFNLHAA
>Q83WC8 3.1.1.35~~~dch~~~Bifunctional esterase/perhydrolase DCH~~~
MGYVTTKDGVDIFYKDWGPRDAPVIFFHHGWPLSSDDWDAQMLFFLKEGFRVVAHDRRGHGRSTQVWDGHDMDHYADDVA
AVVEYLGVQGAVHVGHSTGGGEVAYYVARYPNDPVAKAVLISAVPPLMVKTESNPDGLPKEVFDDLQNQLFKNRSQFYHD
VPAGPFYGFNRPGAKVSEPVVLNWWRQGMMGGAKAHYDGIVAFSQTDFTEALKKIEVPVLILHGEDDQVVPFEISGKKSA
ELVKNGKLISYPGFPHGMPTTEAETINKDLLAFIRS
>O87873 4.2.1.100~~~dch~~~Cyclohexa-1,5-dienecarbonyl-CoA hydratase~~~
MSEASSPLKVWLERDGSLLRLRLARPKANIVDAAMIAAMRQALGEHLQAPALRAVLLDAEGPHFSFGASVDEHMPDQCAQ
MLKSLHGLVREMLDSPVPILVALRGQCLGGGLEVAAAGNLLFAAPDAKFGQPEIRLGVFAPAASCLLPPRVGQACAEDLL
WSGRSIDGAEGHRIGLIDVLAEDPEAAALRWFDEHIARLSASSLRFAVRAARCDSVPRIKQKLDTVEALYLEELMASHDA
VEGLKAFLEKRSANWENR
>P51852 4.1.1.74~~~ipdC~~~Indole-3-pyruvate decarboxylase~~~
MKLAEALLRALKDRGAQAMFGIPGDFALPFFKVAEETQILPLHTLSHEPAVGFAADAAARYSSTLGVAAVTYGAGAFNMV
NAVAGAYAEKSPVVVISGAPGTTEGNAGLLLHHQGRTLDTQFQVFKEITVAQARLDDPAKAPAEIARVLGAARALSRPVY
LEIPRNMVNAEVEPVGDDPAWPVDRDALAACADEVLAAMRSATSPVLMVCVEVRRYGLEAKVAELAQRLGVPVVTTFMGR
GLLADAPTPPLGTYIGVAGDAEITRLVEESDGLFLLGAILSDTNFAVSQRKIDLRKTIHAFDRAVTLGYHTYADIPLAGL
VDALLEGLPPSDRTTRGKEPHAYPTGLQADGEPIAPMDIARAVNDRVRAGQEPLLIAADMGDCLFTAMDMIDAGLMAPGY
YAGMGFGVPAGIGAQCVSGGKRILTVVGDGAFQMTGWELGNCRRLGIDPIVILFNNASWEMLRTFQPESAFNDLDDWRFA
DMAAGMGGDGVRVRTRAELKAALDKAFATRGRFQLIEAMIPRGVLSDTLARFVQGQKRLHAAPRE
>P23234 4.1.1.74~~~ipdC~~~Indole-3-pyruvate decarboxylase~~~COG3961
MRTPYCVADYLLDRLTDCGADHLFGVPGDYNLQFLDHVIDSPDICWVGCANELNASYAADGYARCKGFAALLTTFGVGEL
SAMNGIAGSYAEHVPVLHIVGAPGTAAQQRGELLHHTLGDGEFRHFYHMSEPITVAQAVLTEQNACYEIDRVLTTMLRER
RPGYLMLPADVAKKAATPPVNALTHKQAHADSACLKAFRDAAENKLAMSKRTALLADFLVLRHGLKHALQKWVKEVPMAH
ATMLMGKGIFDERQAGFYGTYSGSASTGAVKEAIEGADTVLCVGTRFTDTLTAGFTHQLTPAQTIEVQPHAARVGDVWFT
GIPMNQAIETLVELCKQHVHAGLMSSSSGAIPFPQPDGSLTQENFWRTLQTFIRPGDIILADQGTSAFGAIDLRLPADVN
FIVQPLWGSIGYTLAAAFGAQTACPNRRVIVLTGDGAAQLTIQELGSMLRDKQHPIILVLNNEGYTVERAIHGAEQRYND
IALWNWTHIPQALSLDPQSECWRVSEAEQLADVLEKVAHHERLSLIEVMLPKADIPPLLGALTKALEACNNA
>P37529 2.7.1.74~~~dck~~~Deoxyadenosine/deoxycytidine kinase~~~COG1428
MKEHHIPKNSIITVAGTVGVGKSTLTKTLAKRLGFKTSLEEVDHNPYLEKFYHDFERWSFHLQIYFLAERFKEQKTIFEA
GGGFVQDRSIYEDTGIFAKMHADKGTMSKVDYKTYTSLFEAMVMTPYFPHPDVLIYLEGDLENILNRIEQRGREMELQTS
RSYWEEMHTRYENWISGFNACPVLKLRIEDYDLLNDENSIENIVDQIASVIHDNQKK
>O50657 4.1.1.17~~~ldc~~~Lysine/ornithine decarboxylase~~~
MKNFRLSEKEVKTLAKRIPTPFLVASLDKVEENYQFMRRHLPRAGVFYAMKANPTPEILSLLAGLGSHFDVASAGEMEIL
HELGVDGSQMIYANPVKDARGLKAAADYNVRRFTFDDPSEIDKMAKAVPGADVLVRIAVRNNKALVDLNTKFGAPVEEAL
DLLKAAQDAGLHAMGICFHVGSQSLSTAAYEEALLVARRLFDEAEEMGMHLTDLDIGGGFPVPDAKGLNVDLAAMMEAIN
KQIDRLFPDTAVWTEPGRYMCGTAVNLVTSVIGTKTRGEQPWYILDEGIYGCFSGIMYDHWTYPLHCFGKGNKKPSTFGG
PSCDGIDVLYRDFMAPELKIGDKVLVTEMGSYTSVSATRFNGFYLAPTIIFEDQPEYAARLTEDDDVKKKAAV
>P21161 4.5.1.3~~~dcmA~~~Dichloromethane dehalogenase~~~
MSPNPTNIHTGKTLRLLYHPASQPCRSAHQFMYEIDVPFEEEVVDISTDITERQEFRDKYNPTGQVPILVDGEFTVWESV
AIARYVNEKFDGAGNWFGRGTQERAQINQFLQWYAYTLRLGGGAFHWNIFGCLIYGEKPYSPKFTAEQNKGRTLLYEAMG
TLENYWLRDREYVCGDEVSYADLAAFHEFVSHEAGKIIPDRVWQGFPKIAAWFKKLSERPHAKTVSEWQYTNVGKIIRGE
LTASMFKRKTAVLKGTEVFSGHNHGIPYLNEKAEDYFKRVEKEGAAVA
>P43387 4.5.1.3~~~dcmA~~~Dichloromethane dehalogenase~~~
MSTKLRYLHHPASQPCRAVHQFMLENNIEFQEEIVDITTDINEQPEFRERYNPTGQVPILVDGDFTIWESAAIVYYLSEK
YDCSSSWWGSTLEERGHIQQYMHWYAYTLRLGGGAFHWTIFAPMIYGYDKDFTVEVTKGRFLLYESFDILEKYWLKDGDY
LCGNTLSYPDLATCQDLVSHDAGRIIPTSMWDSHPKVKAWFARMMDREHAKTVSAWQYENVRKYLDDGVKLNFQRKTAVL
KGTEVYSGHNNGIIYNGDDDSFVTQHG
>P27988 2.3.1.169~~~~~~Carbon monoxide dehydrogenase/acetyl-CoA synthase subunit alpha~~~
MTDFDKIFEGAIPEGKEPVALFREVYHGAITATSYAEILLNQAIRTYGPDHPVGYPDTAYYLPVIRCFSGEEVKKLGDLP
PILNRKRAQVSPVLNFENARLAGEATWYAAEIIEALRYLKYKPDEPLLPPPWTGFIGDPVVRRFGIKMVDWTIPGEAIIL
GRAKDSKALAKIVKELMGMGFMLFICDEAVEQLLEENVKLGIDYIAYPLGNFTQIVHAANYALRAGMMFGGVTPGAREEQ
RDYQRRRIRAFVLYLGEHDMVKTAAAFGAIFTGFPVITDQPLPEDKQIPDWFFSVEDYDKIVQIAMETRGIKLTKIKLDL
PINFGPAFEGESIRKGDMYVEMGGNRTPAFELVRTVSESEITDGKIEVIGPDIDQIPEGSKLPLGILVDIYGRKMQADFE
GVLERRIHDFINYGEGLWHTGQRNINWLRVSKDAVAKGFRFKNYGEILVAKMKEEFPAIVDRVQVTIFTDEAKVKEYMEV
AREKYKERDDRMRGLTDETVDTFYSCVLCQSFAPNHVCIVTPERVGLCGAVSWLDAKASYEINHAGPNQPIPKEGEIDPI
KGIWKSVNDYLYTASNRNLEQVCLYTLMENPMTSCGCFEAIMAILPECNGIMITTRDHAGMTPSGMTFSTLAGMIGGGTQ
TPGFMGIGRTYIVSKKFISADGGIARIVWMPKSLKDFLHDEFVRRSVEEGLGEDFIDKIADETIGTTVDEILPYLEEKGH
PALTMDPIM
>P27989 1.2.7.4~~~~~~Carbon monoxide dehydrogenase/acetyl-CoA synthase subunit beta~~~
MPRFRDLSHNCRPSEAPRVMEPKNRDRTVDPAVLEMLVKSKDDKVITAFDRFVAQQPQCKIGYEGICCRFCMAGPCRIKA
TDGPGSRGICGASAWTIVARNVGLMILTGAAAHCEHGNHIAHALVEMAEGKAPDYSVKDEAKLKEVCRRVGIEVEGKSVL
ELAQEVGEKALEDFRRLKGEGEATWLMTTINEGRKEKFRTHNVVPFGIHASISELVNQAHMGMDNDPVNLVFSAIRVALA
DYTGEHIATDFSDILFGTPQPVVSEANMGVLDPDQVNFVLHGHNPLLSEIIVQAAREMEGEAKAAGAKGINLVGICCTGN
EVLMRQGIPLVTSFASQELAICTGAIDAMCVDVQCIMPSISAVAECYHTRIITTADNAKIPGAYHIDYQTATAIESAKTA
IRMAIEAFKERKESNRPVYIPQIKNRVVAGWSLEALTKLLATQNAQNPIRVLNQAILDGELAGVALICGCNNLKGFQDNS
HLTVMKELLKNNVFVVATGCSAQAAGKLGLLDPANVETYCGDGLKGFLKRLGEGANIEIGLPPVFHMGSCVDNSRAVDLL
MAMANDLGVDTPKVPFVASAPEAMSGKAAAIGTWWVSLGVPTHVGTMPPVEGSDLIYSILTQIASDVYGGYFIFEMDPQV
AARKILDALEYRTWKLGVHKEVAERYETKLCQGY
>P19919 1.2.5.3~~~coxL~~~Carbon monoxide dehydrogenase large chain~~~
MNIQTTVEPTSAERAEKLQGMGCKRKRVEDIRFTQGKGNYVDDVKLPGMLFGDFVRSSHAHARIKSIDTSKAKALPGVFA
VLTAADLKPLNLHYMPTLAGDVQAVLADEKVLFQNQEVAFVVAKDRYVAADAIELVEVDYEPLPVLVDPFKAMEPDAPLL
REDIKDKMTGAHGARKHHNHIFRWEIGDKEGTDATFAKAEVVSKDMFTYHRVHPSPLETCQCVASMDKIKGELTLWGTFQ
APHVIRTVVSLISGLPEHKIHVIAPDIGGGFGNKVGAYSGYVCAVVASIVLGVPVKWVEDRMENLSTTSFARDYHMTTEL
AATKDGKILAMRCHVLADHGAFDACADPSKWPAGFMNICTGSYDMPVAHLAVDGVYTNKASGGVAYRCSFRVTEAVYAIE
RAIETLAQRLEMDSADLRIKNFIQPEQFPYMAPLGWEYDSGNYPLAMKKAMDTVGYHQLRAEQKAKQEAFKRGETREIMG
IGISFFTEIVGAGPSKNCDILGVSMFDSAEIRIHPTGSVIARMGTKSQGQGHETTYAQIIATELGIPADDIMIEEGNTDT
APYGLGTYGSRSTPTAGAATAVAARKIKAKAQMIAAHMLEVHEGDLEWDVDRFRVKGLPEKFKTMKELAWASYNSPPPNL
EPGLEAVNYYDPPNMTYPFGAYFCIMDIDVDTGVAKTRRFYALDDCGTRINPMIIEGQVHGGLTEAFAVAMGQEIRYDEQ
GNVLGASFMDFFLPTAVETPKWETDYTVTPSPHHPIGAKGVGESPHVGGVPCFSNAVNDAYAFLNAGHIQMPHDAWRLWK
VGEQLGLHV
>P19913 1.2.5.3~~~cutL~~~Carbon monoxide dehydrogenase large chain~~~
MNAPVQDAEARELALAGMRPRACAKEDARFIQGKGNYVDDIKMPGMLHMDIVRAPIAHGRIKKIHKDAALAMPGVHAVLT
AEDLKPLKLHWMPTLAGDVAAVLADEKVHFQMQEVAIVIADDRYIAADAVEAVKVEYDELPVVIDPIDALKPDAPVLRED
LAGKTSGAHGPREHHNHIFTWGAGDKAATDAVFANAPVTVSQHMYYPRVHPCPLETCGCVASFDPIKGDLTTYITSQAPH
VVRTVVSMLSGIPESKVRIVSPDIGGGFGNKVGIYPGYVCAIVASIVLGRPVKWVEDRVENISTTAFARDYHMDGELAAT
PDGKILGLRVNVVADHGAFDACADPTKFPAGLFHICSGSYDIPRAHCSVKGVYTNKAPGGVAYRCSFRVTEAVYLIERMV
DVLAQKLNMDKAEIRAKNFIRKEQFPYTTQFGFEYDSGDYHTALKKVLDAVDYPAWRAEQAARRADPNSPTLMGIGLVTF
TEVVGAGPSKMCDILGVGMFDSCEIRIHPTGSAIARMGTITQGQGHQTTYAQIIATELGIPSEVIQVEEGDTSTAPYGLG
TYGSRSTPVAGAAIALAARKIHAKARKIAAHMLEVNENDLDWEVDRFKVKGDDSKFKTMADIAWQAYHQPPAGLEPGLEA
VHYYDPPNFTYPFGIYLCVVDIDRATGETKVRRFYALDDCGTRINPMIIEGQIHGGLTEGYAVAMGQQMPFDAQGNLLGN
TLMDYFLPTAVETPHWETDHTVTPSPHHPIGAKGVAESPHVGSIPTFTAAVVDAFAHVGVTHLDMPHTSYRVWKSLKEHN
LAL
>P19920 1.2.5.3~~~coxM~~~Carbon monoxide dehydrogenase medium chain~~~
MIPGSFDYHRPKSIADAVALLTKLGEDARPLAGGHSLIPIMKTRLATPEHLVDLRDIGDLVGIREEGTDVVIGAMTTQHA
LIGSDFLAAKLPIIRETSLLIADPQIRYMGTIGGNAANGDPGNDMPALMQCLGAAYELTGPEGARIVAARDYYQGAYFTA
IEPGELLTAIRIPVPPTGHGYAYEKLKRKIGDYATAAAAVVLTMSGGKCVTASIGLTNVANTPLWAEEAGKVLVGTALDK
PALDKAVALAEAITAPASDGRGPAEYRTKMAGVMLRRAVERAKARAKN
>P19914 1.2.5.3~~~cutM~~~Carbon monoxide dehydrogenase medium chain~~~
MIPPRFEYHAPKSVGEAVALLGQLGSDAKLLAGGHSLLPMMKLRFAQPEHLIDINRIPELRGIREEGSTVVIGAMTVEND
LISSPIVQARLPLLAEAAKLIADPQVRNRGTIGGDIAHGHPGNDHPALSIAVEAHFVLEGPNGRRTVPADGFFLGTYMTL
LEENEVMVEIRVPAFAQGTGWAYEKLKRKTGDWATAGCAVVMRKSGNTVSHIRIALTNVAPTALRREGGRSRLLGKAFTK
EAVQAAADAAIAICEPAEDLRGDADYKTAMAGQMVKRALNAAWARCA
>P19921 1.2.5.3~~~coxS~~~Carbon monoxide dehydrogenase small chain~~~
MAKAHIELTINGHPVEALVEPRTLLIHFIREQQNLTGAHIGCDTSHCGACTVDLDGMSVKSCTMFAVQANGASITTIEGM
AAPDGTLSALQEGFRMMHGLQCGYCTPGMIMRSHRLLQENPSPTEAEIRFGIGGNLCRCTGYQNIVKAIQYAAAKINGVP
FEEAAE
>P19915 1.2.5.3~~~cutS~~~Carbon monoxide dehydrogenase small chain~~~
MAKKIITVNVNGKAQEKAVEPRTLLIHFLREELNLTGAHIGCETSHCGACTVDIDGRSVKSCTHLAVQCDGSEVLTVEGL
ANKGVLHAVREGFYKEHGLQCGFCTPGMLMRAYRFLQENPNPTEAEIRMGMTGNLCRCTGYQNIVKAVQYAARKLQEPST
AAA
>P0AED9 2.1.1.37~~~dcm~~~DNA-cytosine methyltransferase~~~COG0270
MQENISVTDSYSTGNAAQAMLEKLLQIYDVKTLVAQLNGVGENHWSAAILKRALANDSAWHRLSEKEFAHLQTLLPKPPA
HHPHYAFRFIDLFAGIGGIRRGFESIGGQCVFTSEWNKHAVRTYKANHYCDPATHHFNEDIRDITLSHKEGVSDEAAAEH
IRQHIPEHDVLLAGFPCQPFSLAGVSKKNSLGRAHGFACDTQGTLFFDVVRIIDARRPAMFVLENVKNLKSHDQGKTFRI
IMQTLDELGYDVADAEDNGPDDPKIIDGKHFLPQHRERIVLVGFRRDLNLKADFTLRDISECFPAQRVTLAQLLDPMVEA
KYILTPVLWKYLYRYAKKHQARGNGFGYGMVYPNNPQSVTRTLSARYYKDGAEILIDRGWDMATGEKDFDDPLNQQHRPR
RLTPRECARLMGFEAPGEAKFRIPVSDTQAYRQFGNSVVVPVFAAVAKLLEPKIKQAVALRQQEAQHGRRSR
>P13187 7.2.4.2~~~oadA~~~Oxaloacetate decarboxylase alpha chain~~~
MTVAITDVVLRDAHQSLFATRLRLDDMLPVAAQLDDVGYRSLECWGGATFDACIRFLGEDPWVRLRELKKAMPKTPLQML
LRGQNLLGYRHYADDVVERFVERAVKNGMDVFRVFDAMNDPRNMQAALQAVRRHGAHAQGTLSYTTSPAHTLQTWLDLTE
QLLETGVDSVAIKDMSGILTPHAAFELVSEIKKRYDVTLHLHCHATTGMAEMALLKAIEAGVDGVDTAISSMSATYGHPA
TEALVATLAGTPYDTGLDIHKLESIAAYFREVRKKYHAFEGQLKGTDSRILVAQVPGGMLTNLEGQLKQQSAAHRLDEVL
AEIPRVREDLGFIPLVTPTSQIVGTQAVLNVLTGERYKTIAKETAGILKGEYGRTPAPVNAALQARVLDGADPVTCRPAD
LLKPELAQLEADVRRQAQEKGITLAENAIDDVLTVALFPQPGLKFLENRHNPAAFEPVPQAEAAQPVAKAEKPAASGVYT
VEVEGKAFVVKVSDGGDVSQLTAAAPAPAPAPAPASAPAAAAPAGAGTPVTAPLAGTIWKVLASEGQTVAAGEVLLILEA
MKMETEIRAAQAGTVRGIAVKAGDAVAVGDTLMTLA
>P21169 4.1.1.17~~~speC~~~Constitutive ornithine decarboxylase~~~COG1982
MKSMNIAASSELVSRLSSHRRVVALGDTDFTDVAAVVITAADSRSGILALLKRTGFHLPVFLYSEHAVELPAGVTAVING
NEQQWLELESAACQYEENLLPPFYDTLTQYVEMGNSTFACPGHQHGAFFKKHPAGRHFYDFFGENVFRADMCNADVKLGD
LLIHEGSAKDAQKFAAKVFHADKTYFVLNGTSAANKVVTNALLTRGDLVLFDRNNHKSNHHGALIQAGATPVYLEASRNP
FGFIGGIDAHCFNEEYLRQQIRDVAPEKADLPRPYRLAIIQLGTYDGTVYNARQVIDTVGHLCDYILFDSAWVGYEQFIP
MMADSSPLLLELNENDPGIFVTQSVHKQQAGFSQTSQIHKKDNHIRGQARFCPHKRLNNAFMLHASTSPFYPLFAALDVN
AKIHEGESGRRLWAECVEIGIEARKAILARCKLFRPFIPPVVDGKLWQDYPTSVLASDRRFFSFEPGAKWHGFEGYAADQ
YFVDPCKLLLTTPGIDAETGEYSDFGVPATILAHYLRENGIVPEKCDLNSILFLLTPAESHEKLAQLVAMLAQFEQHIED
DSPLVEVLPSVYNKYPVRYRDYTLRQLCQEMHDLYVSFDVKDLQKAMFRQQSFPSVVMNPQDAHSAYIRGDVELVRIRDA
EGRIAAEGALPYPPGVLCVVPGEVWGGAVQRYFLALEEGVNLLPGFSPELQGVYSETDADGVKRLYGYVLK
>P43099 4.1.1.17~~~odcI~~~Inducible ornithine decarboxylase~~~
MSSSLKIASTQEARQYFDTDRVVVDAVGSDFTDVGAVIAMDYETDVIDAADATKFGIPVFAVTKDAQAISADELKKIFHI
IDLENKFDATVNAREIETAVNNYEDSILPPFFKSLKEYVSRGLIQFDCPGHQGGQYYRKHPAGREFYDFFGETVFRADLC
NADVALGDLLIHEGPAVAAEKHAARVYNADKTYFVLGGSSNANNTVTSALVSNGDLVLFDRNNHKSVYNSALAMAGGRPV
YLQTNRNPYGFIGGIYDSDFDEKKIRELAAKVDPERAKWKRPFRLAVIQLGTYDGTIYNAHEVVKRIGHLCDYIEFDSAW
VGYEQFIPMMRNSSPLLIDDLGPEDPGIIVVQSVHKQQAGFSQTSQIHKKDSHIKGQLRYCDHKHFNNSFNLFMSTSPFY
PMYAALDVNAAMQEGEAGRKLWHDLLITTIEARKKLIKAGSMFRPFVPPVVNGKKWEDGDTEDMANNIDYWRFEKGAKWH
AYEGYGDNQYYVDPNKFMLTTPGINPETGDYEDFGVPATIVANYLRDHGIIPEKSDLNSILFLMTPAETPAKMNNLITQL
LQLQRLIEEDAPLKQVLPSIYAANEERYNGYTIRELCQELHDFYKNNNTFTYQKRLFLREFFPEQGMLPYEARQEFIRNH
NKLVPLNKIEGEIALEGALPYPPGVFCVAPGEKWSETAVKYFTILQDGINNFPGFAPEIQGVYFKQEGDKVVAYGEVYDA
EVAKNDDRYNN
>P24169 4.1.1.17~~~speF~~~Inducible ornithine decarboxylase~~~COG1982
MSKLKIAVSDSCPDCFTTQRECIYINESRNIDVAAIVLSLNDVTCGKLDEIDATGYGIPVFIATENQERVPAEYLPRISG
VFENCESRREFYGRQLETAASHYETQLRPPFFRALVDYVNQGNSAFDCPGHQGGEFFRRHPAGNQFVEYFGEALFRADLC
NADVAMGDLLIHEGAPCIAQQHAAKVFNADKTYFVLNGTSSSNKVVLNALLTPGDLVLFDRNNHKSNHHGALLQAGATPV
YLETARNPYGFIGGIDAHCFEESYLRELIAEVAPQRAKEARPFRLAVIQLGTYDGTIYNARQVVDKIGHLCDYILFDSAW
VGYEQFIPMMADCSPLLLDLNENDPGILVTQSVHKQQAGFSQTSQIHKKDSHIKGQQRYVPHKRMNNAFMMHASTSPFYP
LFAALNINAKMHEGVSGRNMWMDCVVNGINARKLILDNCQHIRPFVPELVDGKPWQSYETAQIAVDLRFFQFVPGEHWHS
FEGYAENQYFVDPCKLLLTTPGIDARNGEYEAFGVPATILANFLRENGVVPEKCDLNSILFLLTPAEDMAKLQQLVALLV
RFEKLLESDAPLAEVLPSIYKQHEERYAGYTLRQLCQEMHDLYARHNVKQLQKEMFRKEHFPRVSMNPQEANYAYLRGEV
ELVRLPDAEGRIAAEGALPYPPGVLCVVPGEIWGGAVLRYFSALEEGINLLPGFAPELQGVYIEEHDGRKQVWCYVIKPR
DAQSTLLKGEKL
>P24171 3.4.15.5~~~dcp~~~Dipeptidyl carboxypeptidase~~~COG0339
MTTMNPFLVQSTLPYLAPHFDQIANHHYRPAFDEGMQQKRAEIAAIALNPQMPDFNNTILALEQSGELLTRVTSVFFAMT
AAHTNDELQRLDEQFSAELAELANDIYLNGELFARVDAVWQRRESLGLDSESIRLVEVIHQRFVLAGAKLAQADKAKLKV
LNTEAATLTSQFNQRLLAANKSGGLVVNDIAQLAGMSEQEIALAAEAAREKGLDNKWLIPLLNTTQQPALAEMRDRATRE
KLFIAGWTRAEKNDANDTRAIIQRLVEIRAQQATLLGFPHYAAWKIADQMAKTPEAALNFMREIVPAARQRASDELASIQ
AVIDKQQGGFSAQPWDWAFYAEQVRREKFDLDEAQLKPYFELNTVLNEGVFWTANQLFGIKFVERFDIPVYHPDVRVWEI
FDHNGVGLALFYGDFFARDSKSGGAWMGNFVEQSTLNKTHPVIYNVCNYQKPAAGEPALLLWDDVITLFHEFGHTLHGLF
ARQRYATLSGTNTPRDFVEFPSQINEHWATHPQVFARYARHYQSGAAMPDELQQKMRNASLFNKGYEMSELLSAALLDMR
WHCLEENEAMQDVDDFELRALVAENMDLPAIPPRYRSSYFAHIFGGGYAAGYYAYLWTQMLADDGYQWFVEQGGLTRENG
LRFREAILSRGNSEDLERLYRQWRGKAPKIMPMLQHRGLNI
>P27236 3.4.15.5~~~dcp~~~Dipeptidyl carboxypeptidase~~~
MSTNPLLDQSMLPYQAPRFDRIKDCHYRPAFDEGVRQKRVEIEAIVNHPAAPDFTNTLLALEQSGALLSRVTSVFFAMTA
AHTNDELQRLDEAFSAELAALSNDIYLNSALFARVDAVWQQRHSLGLDDESLRLVDVIHQRFVLAGAQLAEEDKARLKVL
NTESATLMSQFNQRLLAASKAGGLAVDDAHCLAGLSPEEMTVAAEAAREKGLEERWFIPLLNTTQQPALATLRDRQTREN
LFAASWTRAEKGDAHDTRAIVQRLVEIRRCQAKLLGFPNYAAWKMADQMAKTPQAALSFMRGIVPPARQRVLNEQAEIQN
VIDGEQGGYTVQAWDWMFYAEQVRREKYALDEAQLKPYFALNTVLQEGVFWTANQLFGITFVERFDIPVYHPDVRVWEIF
DSDGVGMALFYGDFFARDSKSGGAWMGNFVEQSTLNETRPVIYNVCNYQKPVDGQPALLLWDDVITLFHEFGHTLHGLFA
VQRYATLSGTNTPRDFVEFPSQINEHWASHPRVFERYARHVDSGEKMPADLQERMRKASLFNKGYDMTELLGAALLDMRW
HMLEESVAEQSVAEFEQQALAAEHLDLPAVPPRYRSSYFAHIFGGGYAAGYYAYLWTQMLADDGYQWFVEQGGLTRENGQ
RFRDAILARGNSTDLETLYSAWRGHEPHIDPMLQYRGLDR
>P0AEE1 ~~~dcrB~~~Inner membrane lipoprotein DcrB~~~
MRNLVKYVGIGLLVMGLAACDDKDTNATAQGSVAESNATGNPVNLLDGKLSFSLPADMTDQSGKLGTQANNMHVWSDATG
QKAVIVIMGDDPKEDLAVLAKRLEDQQRSRDPQLQVVTNKAIELKGHKMQQLDSIISAKGQTAYSSVILGNVGNQLLTMQ
ITLPADDQQKAQTTAENIINTLVIQ
>D2Z025 3.5.3.25~~~dcsB~~~N(omega)-hydroxy-L-arginine amidinohydrolase~~~
MIDLIVSQGRVADRAAWMIEGAARTARALEERYGLKGHYVGEPAPHADDDWSVALPQARETLVAVREAATESIKGDNLTV
LVNNTCSVSLATLPVVAREHPDAVVLYIDGHGDFNTPETTDTGYLGGMVLSGACGLWDSGHGAGLRPEQAVLVGSRDIDE
GERELIRKAGVRVIPPGEATAQAVLDAVKDAPVWIHIDWDVLEPGSIPADYTVPDGMLPAQIRAVFEAIPAERLIGVELA
ELNAPADSERAEQAVAVILDMVAPAFDAAAARP
>D2Z026 5.1.1.19~~~dcsC~~~O-ureido-serine racemase~~~
MIRMRTPSTLPFTKMHGAGNDFVVLDLRDGPDPSPELCRALADRHKGVGCDLVLGIREPRSARAVAAFDIWTADGSRSAQ
CGNGARCVAAWAVRAGLARGPRFALDSPSGTHEVDVLDADTFRVALAVPRFAPESIPLFGHDGEQDLYEADLGDGTRVRF
AAVSMGNPHAVIEVDDTATAPVARVGRAVQASGLFLPTVNVGFARVESRDRVHLRVHEYGAGETLACGSGACAAAAVLMR
RGRVDRNVSVVLPGGELRISWPDDAADVLMTGPAAFVYEGTFLHASV
>D2Z027 2.6.99.3~~~dcsD~~~O-ureido-L-serine synthase~~~
MPLFNSILDTIGRTPIVRLQRMAPEHTSVYVKVESFNPGGSVKDRLALSVVLDAEAKGLLKPGDTIVECTSGNVGIALAM
VAAARGYRFVAVMGDTYSVERRKLIRAYGGKLVLFPGHLGSKGGNLIADELAEKYGWFRARQFDNPANPSYHRETTASEI
LADFAGKRLDHFVTGFGTTGTLTGVGQMLRVARPEVRVVALEPSNAAMLARGEWSPHQIQGLAPNFVPGVLDRSVIDDLV
TMDEVTARDTSRRLAAEEGIFAGISAGATVATALSIAEHAPEGTVLLAMLPDTGERYLSTFLFDGVDEGSDDAWLASLDT
GSGL
>D2Z028 2.3.1.30~~~dcsE~~~L-serine/homoserine O-acetyltransferase~~~
MREFIPPASRFIELPDGFAMRRGGALYGARIAYETFGSLNAARDNAVLVLTGLSPDAHAASRPDDPTPGWWEAMVGPGKP
VDTDLWHVICVNSLGSCKGSTGPASTDPRTGEPYRLSFPELSIEDIADAAAHTVRALGISRLACVVGASMGGMSALALLA
RHPELARTHISLSGAVHALPFSIAVRSLQREAIRSDPGWLQGHYDEGEGPRRGMLTARKLGMMTYRSAQEWDCRFGRTRI
GERRRADQGRFGPEFEVESYLDFHAQRFADRFDPNSYLYLSHAMDQFDLGDGGGGGGGAPGALSRMRVERALVMGARTDI
LFPLSQQQEIADGLSAGGADVSFLPVDTPAGHDAFLVDIERFGPPVAKFLAIVA
>D2Z030 6.3.3.5~~~dcsG~~~Cycloserine biosynthesis protein DcsG~~~
MGILALVTDAVSLPIDYDMPPLLEACRTVGITAEVCDWEDGTVDWSRFEAVVFRSPWTWAERQAEFLAFCERVSHVTRLI
TPMPLVRWALDKRYLADLAAHGVPVIPTTVVAPGSDALAAVRDFLAARPEAREFVVKPTDGCYSKDVQRYQRSLAEPASR
HVARLLANGSHVILQPYVESVDRHGETDLTFFDGVYSHAIHKGAMLMPDGTVHVPTLDFRQARDADEDQRAVAAAALAAS
VAHLGLDLPLVCGRVDLVRGADGSPMVLEMELCEPSLNLTFSEDGALRFAQALAERLKP
>Q9I4F5 ~~~dctA2~~~C4-dicarboxylate transport protein 2~~~
MTKQPFYKSLYVQVLVAIAIGIALGHWYPETAVAMKPFGDGFVKLIKMAIAPIIFCTVVTGIAGMQSMKSVGKTGGMALL
YFEVVSTVALIIGLVVVNVVQPGAGMHVDPNTLDTSKIAAYAAAGEKQSTVDFLMNVIPGTVVGAFANGDILQVLFFSVL
FGYALHRLGSYGKPVFEFIERVSHVMFNIINVIMKVAPIGAFGAMAFTIGAYGVGSLVQLGQLMLCFYITCILFVLIVLG
GIARAHGFSILRFIRYIREELLIVLGTSSSESALPRMIDKMEKLGCNKSVVGLVIPTGYSFNLDGTSIYLTMAAVFIAQA
TDTPMDITHQITLLLVLLIASKGAAGVTGSGFIVLAATLSAVGHLPVAGLALILGIDRFMSEARALTNLVGNGVATVVVS
KWCKQLDEGTLQRELAGEGNASSPASDIPVGGREAV
>P96603 ~~~dctA~~~C4-dicarboxylate transport protein~~~COG1301
MKLFKNLTVQVITAVIIGVIVGLVWPDVGKEMKPLGDTFINAVKMVIAPIIFFTIVLGIAKMGDMKKVGKVGGKAFIYFE
VVTTLALIIGLFVVNIMKPGAGLDYSKLEKGDVSQYTQNGGQGIDWIEFITHIVPSNMVDAFAKGDILQVLFFSILFGVG
LAALGEKGKSVIDFFDKVSHVFFKIIGYIMRAAPIGAFGAMAYTIGHFGLDSIKPLASLMMSVYITMFLFVFVALNIICK
LYGFSLWNYLRFIKDELLIVLGTSSSESVLPRMMDKMERYGCSKSVVGLVIPTGYSFNLDGTSIYLSMATVFLAQVFGVD
LSIGQQITIILVLMLTSKGAAGVTGSGFIVLASTLSALQVIPLEGLALLLGVDRFMSEGRAIVNLIGNGIATIIVAKSEN
EFDEAKSIEAVEGMKKMKTAV
>P0A830 ~~~dctA~~~Aerobic C4-dicarboxylate transport protein~~~COG1301
MKTSLFKSLYFQVLTAIAIGILLGHFYPEIGEQMKPLGDGFVKLIKMIIAPVIFCTVVTGIAGMESMKAVGRTGAVALLY
FEIVSTIALIIGLIIVNVVQPGAGMNVDPATLDAKAVAVYADQAKDQGIVAFIMDVIPASVIGAFASGNILQVLLFAVLF
GFALHRLGSKGQLIFNVIESFSQVIFGIINMIMRLAPIGAFGAMAFTIGKYGVGTLVQLGQLIICFYITCILFVVLVLGS
IAKATGFSIFKFIRYIREELLIVLGTSSSESALPRMLDKMEKLGCRKSVVGLVIPTGYSFNLDGTSIYLTMAAVFIAQAT
NSQMDIVHQITLLIVLLLSSKGAAGVTGSGFIVLAATLSAVGHLPVAGLALILGIDRFMSEARALTNLVGNGVATIVVAK
WVKELDHKKLDDVLNNRAPDGKTHELSS
>P20672 ~~~dctA~~~C4-dicarboxylate transport protein~~~COG1301
MIIEHSAEVRGKTPLYRHLYVQVLAAIAAGILLGHFYPDIGTELKPLGDAFIRLVKMIIAPVIFLTVATGIAGMTDLAKV
GRVAGKAMIYFLAFSTLALVVGLVVANVVQPGAGMHIDPASLDAKAVATYAEKAHEQSITGFLMNIIPTTLVGAFAEGDI
LQVLFISVLFGISLAIVGKKAEPVVDFLQALTLPIFRLVAILMKAAPIGAFGAMAFTIGKYGIASIANLAMLIGTFYLTS
FLFVFIVLGAVARYNGFSILSLIRYIKEELLLVLGTSSSEAALPGLMNKMEKAGCKRSVVGLVIPTGYSFNLDGTNIYMT
LAALFIAQATDTPLSYGDQILLLLVAMLSSKGAAGITGAGFITLAATLSVVPSVPVAGMALILGIDRFMSECRALTNFVG
NAVATIVVAKWEGELDQAQLSAALGGEASVEAIPAVVQPAE
>P31601 ~~~dctA1~~~C4-dicarboxylate transport protein~~~COG1301
MHQVEEIILIVENLAEVRGKTPHYRHLYVQVLAAIAVGILLGYFYPDVGSKMKPLGDAFIMLVKMIIAPVIFLTVATGIA
GMTDLAKVGRVAGKAMIYFLTFSTLALLVGLVVANVVQPGAGMHIDPASLDAKAIATYAEKAHEQSVTGFLMNIIPTTLV
GAFAEGDILQVLFISVLFGISLAIVGKKAEAVVDFLHALTLPIFRLVAILMKAAPIGAFGAMAFTIGKYGVASIANLAML
IGTFYLTSFLFVFMVLGAVARYNGFSIVALIRYIKEELLLVLGTSSSEAALPGLMNKMEKAGCKRSVVGLVIPTGYSFNL
DGTNIYMTLAALFIAQATDTPISYGDQILLLLIAMLSSKGAAGITGAGFITLAATLSAVPSVPVAGMALILGIDRFMSEC
RAITNIIGNAVATIVVAKWEGELAPAQLATTLAGKAPVETMSGLSSQRSDTVELGQKVLFGATNSADRTLAGRPGGRDSR
RIAPDHSAQVFGGPLSL
>P13633 2.7.13.3~~~dctB~~~C4-dicarboxylate transport sensor protein DctB~~~COG4191
MHHVRMVKLPAEASDPHALRSRARRSWLVFAAVALVLLAAGLLLARDYGRSQALAGLAGQSRIDASLKASLLRAVVERQR
ALPLVLADDAAIRGALLSPDRPSLDRINRKLEALATSAEAAVIYLIDRSGVAVAASNWQEPTSFVGNDYAFRDYFRLAVR
DGMAEHFAMGTVSNRPGLYISRRVDGPGGPLGVIVAKLEFDGVEADWQASGKPAYVTDRRGIVLITSLPSWRFMTTKPIA
EDRLAPIRESLQFGDAPLLPLPFRKIEARPDGSSTLDALLPGDSTAAFLRVETMVPSTNWRLEQLSPLKAPLAAGAREAQ
LLTLAALVPLLALAALLLRRRQVVAMRSAEERLARNALEASVEERTRDLRMARDRLETEIADHRQTTEKLQAVQQDLVQA
NRLAILGQVAAGVAHEINQPVATIRAYADNARTFLHRGQTVTAAENMESIAELTERVGAITDELRRFARKGHFAAGPTAM
KEVVEGALMLLRSRFAGRMDAIRLDLPPDGLQALGNRIRLEQVLINLLQNALEAIGDSEDGAIQVRCEEAAGGIALTVAD
NGPGIAADVREELFTPFNTSKEDGLGLGLAISKEIVSDYGGTIEVESGPSGTTFAVNLKKA
>P13632 ~~~dctD~~~C4-dicarboxylate transport transcriptional regulatory protein DctD~~~COG2204
MSAAPSVFLIDDDRDLRKAMQQTLELAGFTVSSFASATEALAGLSADFAGIVISDIRMPGMDGLALFRKILALDPDLPMI
LVTGHGDIPMAVQAIQDGAYDFIAKPFAADRLVQSARRAEEKRRLVMENRSLRRAAEAASEGLPLIGQTPVMERLRQTLK
HIADTDVDVLVAGETGSGKEVVATLLHQWSRRRTGNFVALNCGALPETVIESELFGHEPGAFTGAVKKRIGRIEHASGGT
LFLDEIEAMPPATQVKMLRVLEAREITPLGTNLTRPVDIRVVAAAKVDLGDPAARGDFREDLYYRLNVVTLSIPPLRERR
DDIPLLFSHFLARASERFGREVPAISAAMRAYLATHSWPGNVRELSHFAERVALGVEGNLGVPAAAPASSGATLPERLER
YEADILKQALTAHCGDVKETLQALGIPRKTFYDKLQRHGINRADYVERAGPGRPNAISKT
>Q312S1 ~~~dctMQ~~~Isethionate TRAP transporter permease protein DctMQ~~~COG1593
MSDPNVTATIMNAQGECSSGSLESRPGILGWLDANFEKPFLVAGMLAIIFIITFQTLYRYIGVYLHEGAAAAVWTEEMAR
FIFIWISYLAVPVAIKNRSSIRVDIIFDRLPVRFQNISWIIVDVCFLTLAATVLWQSLDLIKMQLTYPQTSPALQLPYYI
PYLVLPVSFGLMAVRLLQDLAGQVRICGAADTVIGLILCAVLAAPLFIADYIDPLPVLFGYFALFLVVGVPIAIGLGLAA
LATIVAAGSLPIDYVAQIAFTSIDSFPIMAIPFFIAAGVFMGAGGLSRRLLNLADEMLGALPGGMALATIGTCMFFAAIS
GSGPATVAAIGSLTIPAMVERGYCKYFSAAIVAAAGAIGVMIPPSNPFVVYGVSAQASIGKLFMGGIVPGLLTGLALMAY
SYWYSKKRGWKGEVRDRNLKTFMHAVWEAKWALMVPVIVLGGIYGGIMTPTEAAALAAFYGLIIGCFVHRELSCGSFYDC
VVEAAGTSAMVIVLMSMATIFGNIMTIEEVPTTIAQAMLGLTTDKIAILLMINVLLLIIGTFMEALAAIVILTPILLPIV
LKVGVDPVHFGIIMVVNLAIGFVTPPVGVNLFVASGVANAKIEQLSKVVLPLIALMLAVLLITTYVPAIPMFFAG
>Q9HU16 ~~~dctM~~~C4-dicarboxylate TRAP transporter large permease protein DctM~~~
MTILFLFLLLFLLMFIGVPIAVSLGLSGALTILLFSPDSVRSLAIKLFETSEHYTLLAIPFFLLSGAFMTTGGVARRLID
FANACVGHIRGGLAIAAVLACMLFAALSGSSPATVAAVGSIAIAGMVRSGYPQAFGAGIVCNAGTLGILIPPSIVMVVYA
AATETSVGKLFIAGVVPGLLLGLILMVVIYIVARVKKLPAMPRVSLREWLASARKALWGLLLMVIILGGIYSGAFTPTEA
AAVAAVYSAFVALFVYRDMRLSECPKVLLESGKLTIMLMFIIANAMLFAHVLTTEQIPQSIASWVTELGLSPWMFLLVVN
IVLLIAGNFMEPSAIILILAPIFFPIAMELGIDPIHLGIIMVVNMEIGLITPPVGLNLFVTSAVTGMPLGATIRAALPWL
MILLVFLIIVTYIPAVSLALPNWLGMS
>O07838 ~~~dctM~~~C4-dicarboxylate TRAP transporter large permease protein DctM~~~
MSALIIFGLLIALMLTGMPISISLGLTVLTFLFTMTQVPIDTVALKLFTGIEKFEIMAIPFFILAGNFLTHGGVAKRMIN
FATAMVGHWHGGLGLAGVIACALFAAVSGSSPATVVAIGSVILPAMVNQGFPKQFGAGVITTSGALGILIPPSIVMVMYA
VATSGMVVTGPDGQPVSSASVGELFMAGVVPGLMLAGFLAFTTWNRARKFGYPRLEKASLRQRWTAFREAAWGLMLIVVV
IGGIYAGIFTPTEAAAMSAVYAFFISVFVYKDLTLRDVPRVLLSSANMSAMLLYIITNAVLFSFLMAHEGIPQALGEWMV
NAGLSWWMFLIIVNILLLAAGNFMEPSSIVLIMAPILFPVAVRLGIDPVHFGIMIVVNMEVGMCHPPVGLNLYVASGITK
MGITELTVAVWPWLLTMLAFLVLVTYVPAISLALPNLLGM
>Q128M1 ~~~~~~Solute-binding protein Bpro_3107~~~COG1638
MTNRTPRISAIRSAALAALLAGLGMGAAQATEFRSADTHNADDYPTVAAVKYMGELLEKKSGGKHKIKVFNKQALGSEKE
TIDQVKIGALDFTRVNVGPMNAICPLTQVPTMPFLFSSIAHMRKSLDGPVGDEILKSCESAGFIGLAFYDSGARSIYAKK
PIRTVADAKGLKIRVQQSDLWVALVSAMGANATPMPYGEVYTGLKTGLIDAAENNIPSFDTAKHVEAVKVYSKTEHSMAP
EILVMSKIIYDKLPKAEQDMIRAAAKESVAFERQKWDEQEAKSLANVKAAGAEIVEVDKKSFQAVMGPVYDKFMTTPDMK
RLVKAVQDTK
>Q315G1 ~~~~~~Solute-binding protein Dde_0634~~~COG1638
MKSTFAALLIMVGCLVSGALLTGSEAAAAQPVTLNYANFPPASTFPCIQMEQWAHEVRTRTRGKVDVLTYPGGTLLGARN
MLRGVMSGQADIGCISLAYHPGVFPVMSVFELPLGFTSAEAASSVLWELYSGLRPAELERVKVLTMFTSAPSHFMTVTPV
RSLRDLQGMEIRGAGTLSAILEKLGATPVSMPMPEVPEAVQKGIIKGLFTSLDVMKDMNFAEMTGHVTRADQAVYPFAVI
MNREAWERLSPDVQQVLDGLAAEHAAWTGRYLDAHVQDSMRWAEEKHGVQVHTLPEEDIAAMRRSVQPLFDAWAQRAADK
GADPDAVMRTVDALKAQYGG
>Q122C7 ~~~~~~Solute-binding protein Bpro_4736~~~COG1638
MKTRTLKVLKPTLALLLAASFSAGALAQEVTLRLVSAFPENGIYVQRLLPWIAKVNAEGKGVLQINFLGGPKAIPTFEAG
NAVKTGVVDMAMNTGAFYTNVMPEADFLKLTQIPVAEQRKNGAFDAINKVWNEKGNTQYLARMVENQPFHIYTNKKIDKP
DLSGQKIRISPVYRDFFQALNANVVTTPPGEVYTALERGVVDGYGWPIGGIFDLNWQEKTKFRVDPGFYDAEVSLTMNLP
AYKKLTDAQRNYLQKQLLVLEAENTFWTRYGNVETARQETAGIQTIKFDAATSKAFREKAYEVGWAGAMKQSPEVAARFK
TLFSK
>Q21XD7 ~~~~~~Solute-binding protein Rfer_1840~~~COG1638
MQRRQLLQSMGGLAASTMPFSLAFAQTSALKISHQFPGGTIKEGDFRDRLVRNFAAEVEKRSKGAMKFEIYPGSSLMKTN
AQFSSMRKGALDMALIPLSYAGGEVPELNIGLMPGLVVSYEQAYSWKTKPVGIELTRVLQEKGIVLISWIWQAGGVASRG
KPVVEPEDAKGMKIRGGSREMDMILKDAGAAVVSLPSNEIYAAMQTGAMDAAMTSSTSFISFRLEEVAKALTTGRTGAYW
FMFEPLMMSKAIFDKLPKDQRDMLMTVGAEMEKFALEAAKKDDIDVAAVYQKAGAKVVDLSDGTIKKWQDIARKTAWKDY
GAKNEGCAKLLALAQQTL
>Q0B2F6 ~~~~~~Solute-binding protein Bamb_6123~~~COG1638
MTHRFPRSRTALAVALMAGFAMSAQARVFRSADVHGDSFPTNMAVKFMGDELSKLTGGKDSIKVFGNSALGSEKDTVDQV
RIGAIDMARVNGASFNEIVPESLIPSFPFLFRDVDHFRKAMYGPAGQKILDAFAAKGMIALTFYESGARSIYAKRPVRTP
ADMKGLKVRVQPSDLMVDEIRAMGGTPTPMPFAEVYTGLKTGLVDAAENNLPSYEETKHFEVAPDYSETQHAMTPEVLVF
SKKIWDTLSPQEQAAIRKAAADSVPYYQKLWTAREASAQQAVTKGGANILPAAQVDRAAFVKAMQPLWTKYEKTPQMKQI
VDEIEATK
>Q1QUN2 ~~~dctP~~~Solute-binding protein Csal_2479~~~COG1638
MQTNKRLKMASCVKAAAMLGMLLSVSISTTAQADSWRGWNIHPPSYPNGKALESFAKEVAEKTEGRVEPKVYHNAVLGDQ
PDAIEQTRSGALDFANFNMGPMGPIVPAANVLSLPFIFKSPDDMYRIMDGEIGERFADALAEKNLIVLSWFGSGARSLYN
TDHPVETPDDVEGLKVRVMNNDLYVQMIDEMGGNATPMAYGEVYQSLKTGVIDGAENNYPSYESSGHYEVANYYSLTEHL
ILPECLCVAKASWEELSEKDRQAIREAAEDAAKEQRALWEEGVQASKQKILDAGVKINEVDDKSAFQAKMQPIYDQFVQE
HPELESLVTDIQDAQS
>Q312S0 ~~~dctP~~~Isethionate-binding periplasmic protein DctP~~~COG1638
MKHLLKAGALVALACIVTLTAGAQAHAAKRINIRLAHPMAPGNNVTVGYEKFKELVAEKSNGRVRIQLFGNCMLGSDRVT
MEAAQRGTLEMASSSSPNMANFSKQWMVFDLPYITSPEHQQKLYKAIDDGELGKKLDEIAASIGLKPIMYSEYGYRNFVT
TKKPIKTADDLKNLKVRTTDSPIEVAVAAALGMAPTPISWGETYTALQQGTVDGEGNTFSLLNDAKHTEVLKYAIDSAHN
YSMHLLMMNKAYYDSLPANVQQILTEAGREALTYQRSITSELEKKAEDAFIEQGITVTRLSPEERAKLVERTRPVWDKFK
DDIPAELIKLVQETQQ
>Q9HU18 ~~~dctP~~~C4-dicarboxylate-binding periplasmic protein DctP~~~
MLKHTAKALVCALSLTVAGIVQAADPIVIKFSHVVAEHTPKGQGALLFKKLVEERLPGKVKVEVYPNSSLFGDGKEMEAL
LLGDVQIIAPSLAKFEQYTKKLQIFDLPFLFDNIQAVDRFQQSPQGKELLTSMQDKGITGLGYWHNGMKQLSANKPLREP
KDARGLKFRVQASKVLEEQFKAVRANPRKMSFAEVYQGLQTGVVNGTENPWSNIYSQKMHEVQKYITESDHGVLDYMVIT
NTKFWNGLPEDVRGVLAKTMDEVTVEVNKQAEALNQGDKQRIVEAKTSEIIELTPEQRAEWRKAMQPVWKKFEGEIGADL
IKAAEAANQAQ
>P37735 ~~~dctP~~~C4-dicarboxylate-binding periplasmic protein DctP~~~
MLTRRILGALVGATALSLALSVPALAEPIVIKFSHVVAPDTPKGKGAAKFEELAEKYTNGAVDVEVYPNSQLYKDKEELE
ALQLGAVQMLAPSLAKFGPLGVQDFEVFDLPYIFKDYEALHKVTQGEAGKMLLSKLEAKGITGLAFWDNGFKIMSANTPL
TMPDDFLGLKMRIQSSKVLEAEMNALGAVPQVMAFSEVYQALQTGVVDGTENPPSNMFTQKMNEVQKHATVSNHGYLGYA
VIVNKQFWDGLPADVRTGLEKAMAESTDYANGIAKEENEKALQAMKDAGTTEFHELTAEERAAWEEVLTPVHDEMAERIG
AETIAAVKAATAE
>Q16BC9 ~~~~~~Solute-binding protein RD1_1052~~~COG1638
MRLFTKIKGLAAVTCVAALASSAAFAQEMTLKLGHLANEQNAWHLAAVKFGEELSTLTDGRIAVEVFPNESLGKEIDLIN
GMQLGTVDMTITGESLQNWAPMAALLAVPYAYKSLEHMDEVASGEIGEQIKQQIIEKAQVRPIAFFARGPRNLTSQRPIT
SPADLDGMKMRVPNVPLFVDVWSALGASPTPMAFSEVFTSLQNGVIDGQENPLALIRSANFNEVQGYVNQTEHVRSWIYL
TIAESTWAKLSEDDQNAVMQAAATAQEYERGLLLESLAEDRGYLESKGMTFVEVDGAAFQAAAKDAVLANVSEEIRPIVE
SLFSE
>Q5LSJ5 ~~~dctP~~~Solute-binding protein SPO1773~~~COG1638
MTISFKGLARGVACAALVLAALPAAAKEFRLGLITPPPHTWTKAAEAFGAELSEKSGGAHSVSVFPARQLGNEAQMLQQL
QTGALDMAFMTVAEVSNRVPNMGAFYAPYLAGDINHAAAILRSDTARGMLAVLPQEAGVVGVGFGSAGMRQILSRGAVNS
AADLSGLKLRITPFDPILDFYNALGAAPTPMPLPAVYDALANGQVDAIDMDVELINVLKYHEHADTILISNHMMFPMVGL
ISARVYAGMSDADKAMISELMAKHVDSTLDVYMVKEPEWTDALTKVGKTFKRVDQSFFGDAIAQWETIWADKAPSLPELR
KTAADLQ
>A3QCW5 ~~~dctP~~~C4-dicarboxylate-binding periplasmic protein DctP~~~COG1638
MTRLNTCTFIKQIVKMTSIAALLGASLNSWAAPTEIKFSHVVAENTPKGQMALKFKQLVEERLPGEYQVNVFPNSQLFGD
NNELSALLLNDVQFVAPSLSKFERYTKKLQLFDLPFLFKDMDAVNRFQQSDAGQQLLNSMKRKGVVGLGYLHNGMKQFSA
SSPLVLPEDAQGKKFRIMASDVLAAQFQAVEAIPVKKPFSEVFTLLQTRAIDGQENTWSNIYSKKFYEVQSNITESNHGV
LDYMVVTSNTFWKSLPADKRKVIKASLDEAIAYGNEIAAAKVNKDKQAIIDSKRSEVTYLTPEQRAAWVNAMKPVWAQFE
DKIGKDLIDAAVASNE
>A3T0D1 ~~~~~~Solute-binding protein NAS141_03721~~~
MSFFTKTAQLVSGAAVAATLFTATAQAETVLRGASMFDEEHAFTKTLRKFEELVDEKYDGDVTFDLRLNGELGVESDYVT
FLNQGVAIDYTILAPSNMAKFAPSIPLMDMPFLFRDLDHWNAVLSSDVLAPLEDELLEKADIKIVGYTGGGTRNLLSKQP
VVTFDDLKGHKMRVMGAPIQAQIFQALTAAPSAIAYNEVYNAIQTGVIAGFENEAASIQNLKFYEVAPNLTLTRHSITVR
PIVMSGKTFNSLPADLQAVVLEAGEEAGAYGRELESREDGVKLQEMVDAGQLTVSEFENRDKMLEMVKPVQDAYAAEIGA
SDLLEAVRAK
>A1WPV4 ~~~~~~Solute-binding protein Veis_3954~~~COG1638
MPSTRPLPRPSSRSLRRLALGLGLAFGLGATAAAQTTMRINISTAQNSHQGVAIDTFAKEVEKRTGGRYKVQTFYNAALG
AERESVEAVQLGTHELTFSSSGPIPNFVPETKILDVPFLFRDKAHARAVLDGPIGQELLTRFDGKGFKALAWAENGFRHM
SNSKRAVKEPGDLKGLKMRTMENPVHIAAYKGFGIVTTPMAFSEVFTALQQGTVDGQENPLSVIISAKFDQVQKHLTLTG
HVYSPALFLMNKALFDKLPAADQQAFIDAARQGAKLNRARVDEDDAKGVADLRAKGMTVIDNIDKARFVAALAPVNAQFE
KQFGKAALEQIRSAQ
>Q9HU17 ~~~dctQ~~~C4-dicarboxylate TRAP transporter small permease protein DctQ~~~
MHALARVWARLEEGLIAFLLAAMTLVTFVYVVLNNLYTLLYDLADLWEGGNETLLAIGDGVLTLAQEMTWSNALTKALFA
WLIFLGIAYGVRTAGHLGVDVLVKLASRPVQRVLGVIACLACLGYAGLLCVASYDWVKTLFIAGIGAEDLDHFGIRQWHI
GLIVPVGFALVFIRFAEILVRILRNRQTGLGLADEAADALKLTEHEEPKA
>O07837 ~~~dctQ~~~C4-dicarboxylate TRAP transporter small permease protein DctQ~~~
MLRILDRAEEVLIAALIATATVLIFVSVTHRFTLGFVADFVGFFRGHGMTGAAAAAKSLYTTLRGINLVWAQELCIILFV
WMAKFGAAYGVRTGIHVGIDVLINRLDAPKRRFFILLGLGAGALFTGIIATLGANFVLHMYHASSTSPDLELPMWLVYLA
IPMGSSLMCFRFLQVAFGFARTGELPHHDHGHVDGVDTENEGIDAEGDVLLHSPLTPRDLVEKPKDN
>P37195 ~~~dctR~~~HTH-type transcriptional regulator DctR~~~COG2197
MFLIITRDTMFFTAMKNILSKGNVVHIQNEEEIDVMLHQNAFVIIDTLMNNVFHSNFLTQIERLKPVHVIIFSPFNIKRC
LGKVPVTFVPRTITIIDFVALINGSYCSVPEAAVSLSRKQHQVLSCIANQMTTEDILEKLKISLKTFYCHKHNIMMILNL
KRINELVRHQHIDYLV
>P0ABN5 ~~~dcuA~~~C4-dicarboxylate transporter DcuA~~~COG2704
MLVVELIIVLLAIFLGARLGGIGIGFAGGLGVLVLAAIGVKPGNIPFDVISIIMAVIAAISAMQVAGGLDYLVHQTEKLL
RRNPKYITILAPIVTYFLTIFAGTGNISLATLPVIAEVAKEQGVKPCRPLSTAVVSAQIAITASPISAAVVYMSSVMEGH
GISYLHLLSVVIPSTLLAVLVMSFLVTMLFNSKLSDDPIYRKRLEEGLVELRGEKQIEIKSGAKTSVWLFLLGVVGVVIY
AIINSPSMGLVEKPLMNTTNAILIIMLSVATLTTVICKVDTDNILNSSTFKAGMSACICILGVAWLGDTFVSNNIDWIKD
TAGEVIQGHPWLLAVIFFFASALLYSQAATAKALMPMALALNVSPLTAVASFAAVSGLFILPTYPTLVAAVQMDDTGTTR
IGKFVFNHPFFIPGTLGVALAVCFGFVLGSFML
>P0ABN9 ~~~dcuB~~~Anaerobic C4-dicarboxylate transporter DcuB~~~COG2704
MLFTIQLIIILICLFYGARKGGIALGLLGGIGLVILVFVFHLQPGKPPVDVMLVIIAVVAASATLQASGGLDVMLQIAEK
LLRRNPKYVSIVAPFVTCTLTILCGTGHVVYTILPIIYDVAIKNNIRPERPMAASSIGAQMGIIASPVSVAVVSLVAMLG
NVTFDGRHLEFLDLLAITIPSTLIGILAIGIFSWFRGKDLDKDEEFQKFISVPENREYVYGDTATLLDKKLPKSNWLAMW
IFLGAIAVVALLGADSDLRPSFGGKPLSMVLVIQMFMLLTGALIIILTKTNPASISKNEVFRSGMIAIVAVYGIAWMAET
MFGAHMSEIQGVLGEMVKEYPWAYAIVLLLVSKFVNSQAAALAAIVPVALAIGVDPAYIVASAPACYGYYILPTYPSDLA
AIQFDRSGTTHIGRFVINHSFILPGLIGVSVSCVFGWIFAAMYGFL
>P0ABP3 ~~~dcuC~~~Anaerobic C4-dicarboxylate transporter DcuC~~~COG3069
MLTFIELLIGVVVIVGVARYIIKGYSATGVLFVGGLLLLIISAIMGHKVLPSSQASTGYSATDIVEYVKILLMSRGGDLG
MMIMMLCGFAAYMTHIGANDMVVKLASKPLQYINSPYLLMIAAYFVACLMSLAVSSATGLGVLLMATLFPVMVNVGISRG
AAAAICASPAAIILAPTSGDVVLAAQASEMSLIDFAFKTTLPISIAAIIGMAIAHFFWQRYLDKKEHISHEMLDVSEITT
TAPAFYAILPFTPIIGVLIFDGKWGPQLHIITILVICMLIASILEFLRSFNTQKVFSGLEVAYRGMADAFANVVMLLVAA
GVFAQGLSTIGFIQSLISIATSFGSASIILMLVLVILTMLAAVTTGSGNAPFYAFVEMIPKLAHSSGINPAYLTIPMLQA
SNLGRTLSPVSGVVVAVAGMAKISPFEVVKRTSVPVLVGLVIVIVATELMVPGTAAAVTGK
>P45428 ~~~dcuD~~~Putative cryptic C4-dicarboxylate transporter DcuD~~~COG3069
MFGIIISVIVLITMGYLILKNYKPQVVLAAAGIFLMMCGVWLGFGGVLDPTKSSGYLIVDIYNEILRMLSNRIAGLGLSI
MAVGGYARYMERIGASRAMVSLLSRPLKLIRSPYIILSATYVIGQIMAQFITSASGLGMLLMVTLFPTLVSLGVSRLSAV
AVIATTMSIEWGILETNSIFAAQVAGMKIATYFFHYQLPVASCVIISVAISHFFVQRAFDKKDKNINHEQAEQKALDNVP
PLYYAILPVMPLILMLGSLFLAHVGLMQSELHLVVVMLLSLTVTMFVEFFRKHNLRETMDDVQAFFDGMGTQFANVVTLV
VAGEIFAKGLTTIGTVDAVIRGAEHSGLGGIGVMIIMALVIAICAIVMGSGNAPFMSFASLIPNIAAGLHVPAVVMIMPM
HFATTLARAVSPITAVVVVTSGIAGVSPFAVVKRTAIPMAVGFVVNMIATITLFY
>O66667 4.1.1.37~~~hemE~~~Uroporphyrinogen decarboxylase~~~COG0407
MPKNDLLLRSLRGEPIGRFPVWLMRQAGRYMPEYRKIRNRVKNFLELCKNVDLATEISLLPLKILGVDAIIIFSDILVPL
EPLGVKVEFVEGEGPKLSWSGKVSDLKKYDPSQNAYVYEIIKRVKEAQDEVPVIGFAGAPFTLLSYLIEGGASKDFKSTK
LFMWENPKEYKRLMDILTETVLAYLKEQIKAGADVVQIFDSWVNNLSLEDYGEYVYPYVNYLISELKDFSDTPVIYFFRG
SSSFIDLAVDYRADALSVDWSVDIPELFKIYDKGFQGNLEPAVLYASEEVIEEKTLGLLRRIPVKTRYVFNLGHGLAPDM
ELEKVKYLVDLVKSFPLT
>P32395 4.1.1.37~~~hemE~~~Uroporphyrinogen decarboxylase~~~COG0407
MSKRETFNETFLKAARGEKADHTPVWYMRQAGRSQPEYRKLKEKYGLFEITHQPELCAYVTRLPVEQYGVDAAILYKDIM
TPLPSIGVDVEIKNGIGPVIDQPIRSLADIEKLGQIDPEQDVPYVLETIKLLVNEQLNVPLIGFSGAPFTLASYMIEGGP
SKNYNKTKAFMYSMPDAWNLLMSKLADMIIVYVKAQIEAGAKAIQIFDSWVGALNQADYRTYIKPVMNRIFSELAKENVP
LIMFGVGASHLAGDWHDLPLDVVGLDWRLGIDEARSKGITKTVQGNLDPSILLAPWEVIEQKTKEILDQGMESDGFIFNL
GHGVFPDVSPEVLKKLTAFVHEYSQNKKMGQYS
>Q2STF3 4.1.1.37~~~hemE~~~Uroporphyrinogen decarboxylase~~~
MAQTLINDTFLRALLREPTDYTPIWLMRQAGRYLPEYNATRARAGSFLGLAKHPDYATEVTLQPLERFPLDAAILFSDIL
TIPDAMGLGLDFAAGEGPKFAHPVRTEADVAKLAVPDIGATLGYVTDAVREIRRALTDGEGRQRVPLIGFSGSPWTLACY
MVEGGGSDDFRTVKSMAYARPDLMHRILDVNAQAVAAYLNAQIEAGAQAVMIFDTWGGALADGAYQRFSLDYIRRVVAQL
KREHDGARVPAIAFTKGGGLWLEDLAATGVDAVGLDWTVNLGRARERVAGRVALQGNLDPTILFAPPEAIRAEARAVLDS
YGNHPGHVFNLGHGISQFTPPEHVAELVDEVHRHSRAIRSGTGS
>P32920 4.1.1.37~~~hemE~~~Uroporphyrinogen decarboxylase~~~COG0407
MTKTMLRALKGETLPTPPIWLMRQAGRYLPEYRATRAQAGDFLSLCYTPDLAAEVTLQPIRRYGFDAAILFADILLLPQA
LGADLWFETGEGPRMSTITDMAGVTALKGRDDIHETLAPVYETCRILARELPKETTFIGFAGMPWTVATYMIAGRGSKDQ
AAAHKLKDTDRPAFEALMDRVTEATIEYLAKQVEAGCEVVKLFDSWAGSLKGQDFEDFAVAPAKRIVSELKARFQGLPVI
AFPREAGEGYIGFAEKTGADCVAIDNSVSPEWAAEKVQAGRTCVQGNLDPKYMVTGGEELVQATKRVVEAFRNGPHIFNL
GHGITPEADPENVTLLVETIRGK
>A0QW23 4.1.1.37~~~hemE~~~Uroporphyrinogen decarboxylase~~~COG0407
MNTRRELPESPYLAAASGRSPHRVPVWFMRQAGRSLPEYRELRAQHRMLQACFDAELVCEITMQPVRRHKVDAAILFSDI
VVPLKAAGIGLDIVPDVGPVIDNPIRTLGDVQAMPALESPQVAPVAEAVRLLTAELGDVPLIGFAGAPFTLASYLVEGGP
SRHHERTKAMMLGESSTWHALMTALTDLTIAFLQAQVDAGVDALQVFDSWAGTLSLTDYREYVLPHSSRVFATMAAAGVP
MTHFGVGTAELLGAMSEALAPGAARVVGVDWRTSLADAAARVLPGAALQGNLDPVVLLAGWPVVEKAVRRVVEDGRAAVA
AGAAGHIFNLGHGVLPATDPGIITDAVELVHSL
>P9WFE1 4.1.1.37~~~hemE~~~Uroporphyrinogen decarboxylase~~~COG0407
MSTRRDLPQSPYLAAVTGRKPSRVPVWFMRQAGRSLPEYRALRERYSMLAACFEPDVACEITLQPIRRYDVDAAILFSDI
VVPLRAAGVDLDIVADVGPVIADPVRTAADVAAMKPLDPQAIQPVLVAASLLVAELGDVPLIGFAGAPFTLASYLVEGGP
SRHHAHVKAMMLAEPASWHALMAKLTDLTIAFLVGQIDAGVDAIQVFDSWAGALSPIDYRQYVLPHSARVFAALGEHGVP
MTHFGVGTAELLGAMSEAVTAGERPGRGAVVGVDWRTPLTDAAARVVPGTALQGNLDPAVVLAGWPAVERAARAVVDDGR
RAVDAGAAGHIFNLGHGVLPESDPAVLADLVSLVHSL
>P95458 4.1.1.37~~~hemE~~~Uroporphyrinogen decarboxylase~~~
MTALKNDRFLRALLKQPVDVTPVWMMRQAGRYLPEYRATRAKAGDFMSLCMNPELACEVTLQPLDRYPQLDAAILFSDIL
TIPDAMGQGLYFETGEGPRFRKVVSSLADIEALPVPDPEQDLGYVMDAVRTIRRELNGRVPLIGFSGSPWTLATYMVEGG
SSKDFRKSKAMLYDNPKAMHALLDKLAQSVTSYLNGQIHAGAQAVQIFDSWGGSLSAAAYQEFSLAYMRKIVDGLIREHD
GRRVPVILFTKGGGLWLESMAEVGAEALGLDWTCDIGSARARVGERVALQGNMDPSVLYANPAAIRAEVARILAAYGKGT
GHVFNLGHGITPEVDPAHAGAFFEAVHELSAQYHG
>Q83PB7 4.1.1.37~~~hemE~~~Uroporphyrinogen decarboxylase~~~
MTELKNDRYLRALLRQPVDVTPVWMMRQAGRYLPEYKATRAQAGDFMSLCKNAELACEVTLQPLRRYPLDAAILFSDILT
VPDAMGLGLYFEAGEGPRFTSPVTCKADVDKLPIPDPEDELGYVMNAVRTIRHELKGEVPLIGFSGSPWTLATYMVEGGS
SKAFTVIKKMMYADPQALHALLDKLAKSVTLYLNAQIKAGAQAVMIFDTWGGVLTGRDYQQFSLYYMHKIVDGLLRENDG
RRVPVTLFTKGGGQWLEAMAETGCDALGLDWTTDIADARRRVGNKVALQGNMDPSMLYAPPARIEEEVATILAGFGHGEG
HVFNLGHGIHQDVPPEHAGVFVEAVHRLSEQYHR
>P67420 4.1.1.37~~~hemE~~~Uroporphyrinogen decarboxylase~~~
MVHNKNNTILKMIKGEETSHTPVWFMRQAGRSQPEYRKLKEKYSLFDITHQPELCAYVTHLPVDNYHTDAAILYKDIMTP
LKPIGVDVEIKSGIGPVIHNPIKTIQDVEKLSQIDPERDVPYVLDTIKLLTEEKLNVPLIGFTGAPFTLASYMIEGGPSK
NYNFTKAMMYRDEATWFALMNHLVDVSVKYVTAQVEAGAELIQIFDSWVGALNVEDYRRYIKPHMIRLISEVKEKHDVPV
ILFGVGASHLINEWNDLPIDVLGLDWRTSINQAQQLGVTKTLQGNLDPSILLAPWNVIEERLKPILDQGMENGKHIFNLG
HGVFPEVQPETLRKVSEFVHTYTQR
>P0AD01 ~~~dcuR~~~Transcriptional regulatory protein DcuR~~~COG4565
MINVLIIDDDAMVAELNRRYVAQIPGFQCCGTASTLEKAKEIIFNSDTPIDLILLDIYMQKENGLDLLPVLHNARCKSDV
IVISSAADAATIKDSLHYGVVDYLIKPFQASRFEEALTGWRQKKMALEKHQYYDQAELDQLIHGSSSNEQDPRRLPKGLT
PQTLRTLCQWIDAHQDYEFSTDELANEVNISRVSCRKYLIWLVNCHILFTSIHYGVTGRPVYRYRIQAEHYSLLKQYCQ
>P0AEC8 2.7.13.3~~~dcuS~~~Sensor histidine kinase DcuS~~~COG3290
MRHSLPYRMLRKRPMKLSTTVILMVSAVLFSVLLVVHLIYFSQISDMTRDGLANKALAVARTLADSPEIRQGLQKKPQES
GIQAIAEAVRKRNDLLFIVVTDMQSLRYSHPEAQRIGQPFKGDDILKALNGEENVAINRGFLAQALRVFTPIYDENHKQI
GVVAIGLELSRVTQQINDSRWSIIWSVLFGMLVGLIGTCILVKVLKKILFGLEPYEISTLFEQRQAMLQSIKEGVVAVDD
RGEVTLINDAAQELLNYRKSQDDEKLSTLSHSWSQVVDVSEVLRDGTPRRDEEITIKDRLLLINTVPVRSNGVIIGAIST
FRDKTEVRKLMQRLDGLVNYADALRERSHEFMNKLHVILGLLHLKSYKQLEDYILKTANNYQEEIGSLLGKIKSPVIAGF
LISKINRATDLGHTLILNSESQLPDSGSEDQVATLITTLGNLIENALEALGPEPGGEISVTLHYRHGWLHCEVNDDGPGI
APDKIDHIFDKGVSTKGSERGVGLALVKQQVENLGGSIAVESEPGIFTQFFVQIPWDGERSNR
>P76316 4.4.1.15~~~dcyD~~~D-cysteine desulfhydrase~~~COG2515
MPLHNLTRFPRLEFIGAPTPLEYLPRFSDYLGREIFIKRDDVTPMAMGGNKLRKLEFLAADALREGADTLITAGAIQSNH
VRQTAAVAAKLGLHCVALLENPIGTTAENYLTNGNRLLLDLFNTQIEMCDALTDPNAQLEELATRVEAQGFRPYVIPVGG
SNALGALGYVESALEIAQQCEGAVNISSVVVASGSAGTHAGLAVGLEHLMPESELIGVTVSRSVADQLPKVVNLQQAIAK
ELELTASAEILLWDDYFAPGYGVPNDEGMEAVKLLARLEGILLDPVYTGKAMAGLIDGISQKRFKDEGPILFIHTGGAPA
LFAYHPHV
>Q8ZNT7 4.4.1.15~~~dcyD~~~D-cysteine desulfhydrase~~~
MPLHHLTRFPRLEFIGAPTPLEYLPRLSDYLGREIYIKRDDVTPIAMGGNKLRKLEFLVADALREGADTLITAGAIQSNH
VRQTAAVAAKLGLHCVALLENPIGTTAENYLTNGNRLLLDLFNTQIEMCDALTDPDAQLQTLATRIEAQGFRPYVIPVGG
SSALGAMGYVESALEIAQQCEEVVGLSSVVVASGSAGTHAGLAVGLEHLMPDVELIGVTVSRSVAEQKPKVIALQQAIAG
QLALTATADIHLWDDYFAPGYGVPNDAGMEAVKLLASLEGVLLDPVYTGKAMAGLIDGISQKRFNDDGPILFIHTGGAPA
LFAYHPHV
>E2JA29 ~~~ddaD~~~Dapdiamide synthesis protein DdaD~~~
MHSVETFNLPALNSLLETTARRFGNRLAVQDDNGSLTFADFVEKVGILSAKLRLVIKRGEHVAVQLPRGINYIVAAYAIW
EAGGVYLPLDNQWPSSRIEGILHRSHVRVLIHTSQADQGLELTELPAETRAESPVAGTPAYIIHTSGTTGEPKGVVVSHE
SLIHLVESHQRDIYQAYDVTEGPVAINASFCFDSALERMALVALGYSLHVVSDQVRKSPYELVKYLRDNSIVNVDLVPSH
LKVLLSAGLNEKCDALRLVIVGGEAIDAELWREIVQNQAIYINVYGPTENTINTSFCEIRGETPHIGRPFKNVTCLLLNE
NGERCAAGEEGELLVAGRHLAQGYYNAPDLTDRVFVHIDGIRYYRTGDRVRQNEQGNLLYLGRIDDQVKINGFRIELADV
QHNLTQLPGVKYAAVTPIKLPTGQGLLASIVWNSDAPEQTFSNLEALLGEKLPSYMVPTRWQKLDALPLTDNLKLDHKSL
LSHWKNSQEQIGEKFAAESISATEHQIKNLWQKILRQPSLSPDAHFFASGGDSMAAMTLLVELKKVTPQDVSLGDIFKYP
TIRKMAAWLDASSVQAES
>E2JA31 6.3.2.47~~~ddaF~~~Dapdiamide A synthase~~~
MSILNNKEVIVIIDAWSGGKHLIPAFQALGYFCLHVQSTFLPEVFIADNQLAIARSDRHIVHDGNIETLLSQLQPYTIKA
ILAGSEGAVGLADCLNDALELTFSNQFELSAARRNKYLMQEQLALKGVASINQQLAGHSDELKQWLAGHAHWPVVLKPIQ
SAGTDGVFICHDLAQALQAFEAILAKKDFFGSPNREVLCQEFLAGEEFVVNGIACQGEYFFTELWQSKKQQRNGFPVYET
QYLHYQNDAGFDVLTAYTVQVCQTLGINNGAFHAEVMMTSGGPVLIEIGARVAGGADPYIIEECLGHSQISKLAQAVLHP
AKFLQECRRQHDFSGHRRAAYVYMISPSPGRVQVSPEEKFIKIDGVISINYHYAPGDIQQETCDLLSSPGVIIAIRDNPA
LLKQTIAEIRDVEADFYHLGLIDE
>E2JA32 6.3.2.46~~~ddaG~~~Fumarate--(S)-2,3-diaminopropanoate ligase~~~
MNKGENMNLEMWVQNISATFPWYAQLLRDKQADLSRLESLPLITEELLTQHYYHAENSFPGEHHSYLTSGTTSGKRKRIF
YSDNDQRIYLQQRMDIIRDFCGEGHTRACADLGTGHAAATAGEIFQAMGCDVELIDFTRPIEQHIEVLNRFKPDIFFTMP
MILDSLIATGKLDFQPKRIILLGDVASLNWQNKVADYFHIQPAQVLDLFGSIEIGSIAFYNHAQKRYQFDSYVRPEVVPV
QSLYPGAKYGGDGGILLLTSFAREYFPAVRFVTNDLIEGFAQENVGGRTVYTYQRCLGRFAGEFKHGEKINLSDISDALA
NNLPYHKYDLADHEGGLVIRIAAKSIPTEVIEAIKHDLLARNPDIAQMISSGLVGDIRIQCVDAQEITGNVSKRRY
>P71889 3.5.3.18~~~~~~N(G),N(G)-dimethylarginine dimethylaminohydrolase~~~COG1834
MENTQRPSFDCEIRAKYRWFMTDSYVAAARLGSPARRTPRTRRYAMTPPAFFAVAYAINPWMDVTAPVDVQVAQAQWEHL
HQTYLRLGHSVDLIEPISGLPDMVYTANGGFIAHDIAVVARFRFPERAGESRAYASWMSSVGYRPVTTRHVNEGQGDLLM
VGERVLAGYGFRTDQRAHAEIAAVLGLPVVSLELVDPRFYHLDTALAVLDDHTIAYYPPAFSTAAQEQLSALFPDAIVVG
SADAFVFGLNAVSDGLNVVLPVAAMGFAAQLRAAGFEPVGVDLSELLKGGGSVKCCTLEIHP
>Q9I4E3 3.5.3.18~~~~~~N(G),N(G)-dimethylarginine dimethylaminohydrolase~~~
MFKHIIARTPARSLVDGLTSSHLGKPDYAKALEQHNAYIRALQTCDVDITLLPPDERFPDSVFVEDPVLCTSRCAIITRP
GAESRRGETEIIEETVQRFYPGKVERIEAPGTVEAGDIMMVGDHFYIGESARTNAEGARQMIAILEKHGLSGSVVRLEKV
LHLKTGLAYLEHNNLLAAGEFVSKPEFQDFNIIEIPEEESYAANCIWVNERVIMPAGYPRTREKIARLGYRVIEVDTSEY
RKIDGGVSCMSLRF
>Q9X7M4 3.5.3.18~~~ddaH~~~N(G),N(G)-dimethylarginine dimethylaminohydrolase~~~COG1834
MPSKKALVRRPSPRLAEGLVTHVEREKVDHGLALEQWDAYVEALGAHGWETLEVDPADDCPDSVFVEDAVVVFRNVALIT
RPGAESRRAETAGVEEAVARLGCSVNWVWEPGTLDGGDVLKIGDTIYVGRGGRTNAAGVQQLRAAFEPLGARVVAVPVSK
VLHLKSAVTALPDGTVIGHIPLTDVPSLFPRFLPVPEESGAHVVLLGGSRLLMAASAPKTAELLADLGHEPVLVDIGEFE
KLEGCVTCLSVRLRELYD
>Q43908 4.1.1.86~~~ddc~~~L-2,4-diaminobutyrate decarboxylase~~~COG0076
MVDFAEHRKALLCNDAQSIADYESAMGEAVKAVSAWLQNEKMYTGGSIKELRSAISFQPSKEGMGVQQSLQRMIELFLNK
SLKVHHPHSLAHLHCPTMVMSQIAEVLINATNQSMDSWDQSPAGSLMEVQLIDWLRQKVGYGSGQAGVFTSGGTQSNLMG
VLLARDWCIAKNWKDENGNPWSVQRDGIPAEAMKNVKVICSENAHFSVQKNMAMMGMGFQSVVTVPVNENAQMDVDALEK
TMAHLQAEGKVVACVVATAGTTDAGAIHPLKKIREITNKYGSWMHIDAAWGGALILSNTYRAMLDGIELSDSITLDFHKH
YFQSISCGAFLLKDEANYRFMHYEAEYLNSAYDEEHGVPNLVSKSLQTTRRFDALKLWMTIESLGEELYGSMIDHGVKLT
REVADYIKATEGLELLVEPQFASVLFRVVPEGYPVEFIDSLNQNVADELFARGEANIGVTKVGNVQSLKMTTLSPVVTVD
NVKNLLAQVLAEAERIKDAIASGNYVPPID
>P71362 4.1.1.86~~~ddc~~~L-2,4-diaminobutyrate decarboxylase~~~COG0076
MSNLKQHKQALFCNDNEAINDYETAMHNAVQAVSAWLKNEKMYTGGSIKQMRALISGFNPTKEGMGVQKSLDHLVEIFLN
PSLKVHHPHSLAHLHCPTMVTSQIAEVLINATNQSMDSWDQSPAGSIMEEHLINWLRQKAGYGEGTSGVFTSGGTQSNLM
GVLLARDWAIANHWKNEDGSEWSVQRDGIPAEAMQKVKVICSENAHFSVQKNMAMMGMGFQSVVTVPSNANAQMDLIALK
QTLAQLKADGKITACIVATAGTTDAGAIDDLKAIRKLADEYQAWLHVDAAWGGALLLSKDYRYFLDGIELTDSITLDFHK
HFFQTISCGAFLLKDPENYRFIDYKADYLNSEYDEAHGVPNLVAKSLQTTRRFDALKLWFTLEALGEDLYASMIDHGVKL
TKEVEQYINDTPDLEMLVPSQFASVLFRVVPKDYPAEFIDALNQNVADELFARGEANIGVTKVGDKQSLKMTTLSPIATL
ENVKALLTQVLTEANRIKDDIKNGTYTPPID
>P0DUH5 3.5.4.-~~~dddA~~~Double-stranded DNA deaminase toxin A~~~
MYEAARVTDPIDHTSALAGFLVGAVLGIALIAAVAFATFTCGFGVALLAGMMAGIGAQALLSIGESIGKMFSSQSGNIIT
GSPDVYVNSLSAAYATLSGVACSKHNPIPLVAQGSTNIFINGRPAARKDDKITCGATIGDGSHDTFFHGGTQTYLPVDDE
VPPWLRTATDWAFTLAGLVGGLGGLLKASGGLSRAVLPCAAKFIGGYVLGEAFGRYVAGPAINKAIGGLFGNPIDVTTGR
KILLAESETDYVIPSPLPVAIKRFYSSGIDYAGTLGRGWVLPWEIRLHARDGRLWYTDAQGRESGFPMLRAGQAAFSEAD
QRYLTRTPDGRYILHDLGERYYDFGQYDPESGRIAWVRRVEDQAGQWYQFERDSRGRVTEILTCGGLRAVLDYETVFGRL
GTVTLVHEDERRLAVTYGYDENGQLASVTDANGAVVRQFAYTNGLMTSHMNALGFTSSYVWSKIEGEPRVVETHTSEGEN
WTFEYDVAGRQTRVRHADGRTAHWRFDAQSQIVEYTDLDGAFYRIKYDAVGMPVMLMLPGDRTVMFEYDDAGRIIAETDP
LGRTTRTRYDGNSLRPVEVVGPDGGAWRVEYDQQGRVVSNQDSLGRENRYEYPKALTALPSAHFDALGGRKTLEWNSLGK
LVGYTDCSGKTTRTSFDAFGRICSRENALGQRITYDVRPTGEPRRVTYPDGSSETFEYDAAGTLVRYIGLGGRVQELLRN
ARGQLIEAVDPAGRRVQYRYDVEGRLRELQQDHARYTFTYSAGGRLLTETRPDGILRRFEYGEAGELLGLDIVGAPDPHA
TGNRSVRTIRFERDRMGVLKVQRTPTEVTRYQHDKGDRLVKVERVPTPSGIALGIVPDAVEFEYDKGGRLVAEHGSNGSV
IYTLDELDNVVSLGLPHDQTLQMLRYGSGHVHQIRFGDQVVADFERDDLHREVSRTQGRLTQRSGYDPLGRKVWQSAGID
PEMLGRGSGQLWRNYGYDGAGDLIETSDSLRGSTRFSYDPAGRLISRANPLDRKFEEFAWDAAGNLLDDAQRKSRGYVEG
NRLLMWQDLRFEYDPFGNLATKRRGANQTQRFTYDGQDRLITVHTQDVRGVVETRFAYDPLGRRIAKTDTAFDLRGMKLR
AETKRFVWEGLRLVQEVRETGVSSYVYSPDAPYSPVARADTVMAEALAATVIDSAKRAARIFHFHTDPVGAPQEVTDEAG
EVAWAGQYAAWGKVEATNRGVTAARTDQPLRFAGQYADDSTGLHYNTFRFYDPDVGRFINQDPIGLNGGANVYHYAPNPV
GWVDPWGLAGSYALGPYQISAPQLPAYNGQTVGTFYYVNDAGGLESKVFSSGGPTPYPNYANAGHVEGQSALFMRDNGIS
EGLVFHNNPEGTCGFCVNMTETLLPENAKMTVVPPEGAIPVKRGATGETKVFTGNSNSPKSPTKGGC
>A6W2K8 2.-.-.-~~~dddD~~~CoA-transferase/lyase DddD~~~COG1804
MNKQNQLPLVGVRVADFGQQIAGPAVAMVLADLGATVVHIDPPGGPSWKHPANAILNRNKASLCIDLKTQAGLDQALELI
ENVDIVIESFRPGVMKRLGIDFVALRESRPELITLSMPGFASNDELHRDWKATEAIVAATSGTFTDMGFNRVLMGLNPSF
SPLPLGSSYAISLAASSIALALFEREKTGRGDNIEVPIAAALMEGLSYNSYVVDQLPERYKTMRELEIEHRKSNNIKMDV
SYAQLQEYLDPFYRTYVCADGRQFYCVCPSHRNHAERALKVLGIYDELVAEGLPEVKDLHVPISEWDGETSIGVYPLPKK
WADLISEKMKKAFLQKTSDEWGVIFGEGQIPGAPHRSTEEWVNSEHCNASGLIVEVEGTEFGTMKQPGPIVWFENESEAM
LKPKPQEHVSFEQALARLQSVAKIEKISRPTGQDIQPASGKGWLDGVKILDLTNVIAGPHSTAFMSRFGAEITKLDPVTP
LYDPLIGILFTFQTGVGKQSALVNIMTKEGREVFERLVRSVDIVVINAPDRQMKPLGLDQDSLSAINPDVLFCRLDCFGG
PRTGSKTNYIGYDDIIQANSGIMSRFGKPETPEEHAHLGTLDVNCGFAAALGMVIALYQKRKTGKVCRVRTSLSAVTNIA
QIPFAFDYEGRAPFNEASGREAMGNHALSHFYRTNSGWVFLDSHQGELAKLDAIKGLNGIQQSQDMGQFLRDQLVKESSA
YWLKEFAAADIACAEPFSIEYLREHNSRVADQKVGTDLGSYAFSIFPDHPSGHCITQVDPYSIRPREAKIRAVTPTEKFG
CSTIKVLQGLGYSESDINDMLEKKIAATGWGREFLPS
>P0DUH6 ~~~dddI~~~Double-stranded DNA deaminase immunity protein~~~
MYADDFDGEIEIDEVDSLVEFLSRRPAFDANNFVLTFEESGFPQLNIFAKNDIAVVYYMDIGENFVSKGNSASGGTEKFY
ENKLGGEVDLSKDCVVSKEQMIEAAKQFFATKQRPEQLTWSEL
>Q3J6L0 4.4.1.3~~~dddL~~~Putative dimethylsulfonioproprionate lyase DddL~~~COG0662
MHSLSERVEQLRLNDCPDWLYLLHEFDALYRQGSDGGSRPIRTHRKRVRDSLALIVEANPAVNDRPPEVKPVTAHLGRAL
DLGERGAVQGMSRALARVAGRLTWEYGYEKVPKALARKYAYCEILGPRGPICAERLILGFVLFAPSTTYPQHSHKDIEES
YISVAGAWSENDAAVHAPGSLILNRPGLEHRITTGDLSPCLLAYAWTGSEERLNQPGMKLSSPRKARIEKGI
>Q166H0 4.4.1.3~~~dddP~~~Dimethylsulfonioproprionate lyase DddP~~~COG0006
MNRHFNATRKIDPSRGATLGDGSPNDMNRVEIGPTQLAFAEWHTARLDLPDLAAMRRFRHRRLTDHVVARGYAGLLMFDP
LNIRYATDSTNMQLWNTHNPFRATLLCADGYMVMWDYKNSPFLSEFNPLVREQRAGADLFYFDRGDKVDVAADVFANEVR
ILLRDHAPGLRRLAVDKVMLHGLRALQAQGFEIMDGEEVTEKARSVKGPDEIRAMRCASHACEVAVRKMEDFARSKVGDG
VTCENDIWAILHSENVRRGGEWIETRLLASGPRSNPWFQECGPRVCQRNEIISFDTDLVGAYGICTDISRSWWIGDQKPR
ADMIYAMQHGVEHIRTNMEMLKPGVMIPELSANTHVLDAKFQKQKYGCLMHGVGLCDEWPLVAYPDHAVAGAYDYPLEPG
MTLCVEALISEEGGDFSIKLEDQVLITEDGYENLTKYPFDPALMGVE
>A3SK19 4.4.1.3~~~dddP~~~Dimethylsulfonioproprionate lyase DddP~~~COG0006
MNQHYSETRKIDPSRGATLGDNTPNDNNRIEIGPTQLAFGEWATAGLALPDLQRMREFRWNRLTQAVVDRDYGGVLMFDP
LNIRYATDSTNMQLWNAHNPFRALLVCADGYMVIWDYKNSPFLSTFNPLVREQRFGADLFYFDRGDKVDVAADAFSNEVR
TLIAEHGGGNMRLAVDKIMLHGLRALEAQGFEIMEGEELTEKTRAIKGPDEILAMRCAVHACETSVAAMEHFAREAVPQG
NTSEDDVWAVLHAENIKRGGEWIETRLLASGPRTNPWFQECGPRIIQNNEIISFDTDLIGSYGICVDISRSWWVGDAAPP
ADMVYAMQHAHEHIMTNMEMLKPGVTIPELSERSHRLDEQFQAQKYGCLMHGVGLCDEWPLVAYPDQAVPGSYDYPLEPG
MVLCVEAAVGAVGGNFTIKLEDQVLITETGYENLTSYPFDPALMGR
>D0CY60 4.4.1.3~~~dddQ~~~Dimethylsulfonioproprionate lyase DddQ~~~COG1917
MTLENVLEAARHLHQTLPALSEFGNWPTDLTATGLQPRAIPATPLVQALDQPGSPRTTGLVQAIRSAAHLAHWKRTYTEA
EVGADFRNRYGYFELFGPTGHFHSTQLRGYVAYWGAGLDYDWHSHQAEELYLTLAGGAVFKVDGERAFVGAEGTRLHASW
QSHAMSTGDQPILTFVLWRGEGLNALPRMDAA
>Q5LT18 4.4.1.3~~~dddQ~~~Dimethylsulfonioproprionate lyase DddQ~~~COG1917
MTQTDPAFQNLLAEFQALHAREPALAGFVALPDSLTPQPVTPVRIPPAALMESDPDLTTTAYAAIRDAFIAAGAVAQWRL
TYQGSRLGADFMDRFACYCLIGEGGPFASDSLAAYVVYMPAGLYYPFHQHPAEEIYFILAGEAEFLMEGHPPRRLGPGDH
VFHPSGHPHATRTYDRPFMALVLWRGDLETAPVLTYPEGEI
>Q5LW89 4.4.1.3~~~dddW~~~Dimethylsulfonioproprionate lyase DddW~~~COG0662
MTAMLDSFATDLTAATSLHQPGNLPHPPHAIDAENVPLSGGTDPTYGEVRWRTLINGTEAAPRDMVLGIAEFGPGHQLRP
HRHTPPEFYLGLEGSGIVTIDGVPHEIRAGVALYIPGDAEHGTVAGPEGLRFAYGFASASFEAIEYRFTASA
>Q15SS1 1.1.1.389~~~~~~2-dehydro-3-deoxy-L-galactonate 5-dehydrogenase~~~COG1063
MFAAQYIGNKSFNVVEGHAIAPQAGEVRLDVGYVGICGTDMHIYHGVMDQRVSIPQTIGHEISGVVAQIGEGVEGFTVGE
KVVVRPLDWCGECPTCEAGLTHICQNLKFMGIDTPGAFQSSWTVKARTLHKLPAGVDLKQGALVEPLSVACHDVRRSRLK
AGEKAVILGGGPIGQLVAAVAKSVGAEVLVSEPNDSRREFADELGVKSVNPMDTDLAAYVDQWTGTKGADVVFEVSGVLP
AIQSMTQIAGRRGRIVMVAIHSTAPPIDLFQFFWKELELLGARVYEAADFDWAIELIASGQIDLKPFISSVSPLADIGSA
FANMDGNPQGMKALVECNAEQ
>Q15SS0 1.1.1.127~~~~~~2-dehydro-3-deoxy-D-gluconate 5-dehydrogenase~~~COG1028
MLEKFSLEGKVALVTGCKRGIGKGIALGLAEAGADIIGVSASLALEGSDVENEVKALGRNFKGYQCDFSDRDALYAFIKE
VKADFPKIDILVNNAGTILRAPAAEHGDDLWDKVIDVNLNSQFILSREIGKEMVARQSGKIIFTASLLTFQGGITVPGYA
ASKGAIGQLVMALSNEWAGKGVNVNAIAPGYIDTDNTQALREDSERSAAILGRIPQGRWGNPDDFKGPAVFLASDAASYV
NGAILLVDGGWMGR
>B2J6X9 4.2.3.154~~~~~~Demethyl-4-deoxygadusol synthase~~~COG0337
MSNVQASFEATEAEFRVEGYEKIEFSLVYVNGAFDISNREIADSYEKFGRCLTVIDANVNRLYGKQIKSYFRHYGIDLTV
VPIVITEPTKTLATFEKIVDAFSDFGLIRKEPVLVVGGGLTTDVAGFACAAYRRKSNYIRVPTTLIGLIDAGVAIKVAVN
HRKLKNRLGAYHAPLKVILDFSFLQTLPTAQVRNGMAELVKIAVVANSEVFELLYEYGEELLSTHFGYVNGTKELKAIAH
KLNYEAIKTMLELETPNLHELDLDRVIAYGHTWSPTLELAPMIPLFHGHAVNIDMALSATIAARRGYITSGERDRILSLM
SRIGLSIDHPLLDGDLLWYATQSISLTRDGKQRAAMPKPIGECFFVNDFTREELDAALAEHKRLCATYPRGGDGIDAYIE
TQEESKLLGV
>Q3M6C3 4.2.3.154~~~~~~Demethyl-4-deoxygadusol synthase~~~COG0337
MSIVQAKFEAKETSFHVEGYEKIEYDLVYVDGIFEIQNSALADVYQGFGRCLAIVDANVSRLYGNQIQAYFQYYGIELRL
FPITITEPDKTIQTFERVIDVFADFKLVRKEPVLVVGGGLITDVVGFACSTYRRSSNYIRIPTTLIGLIDASVAIKVAVN
HRKLKNRLGAYHASRKVFLDFSLLRTLPTDQVRNGMAELVKIAVVAHQEVFELLEKYGEELLRTHFGNIDATPEIKEIAH
RLTYKAIHKMLELEVPNLHELDLDRVIAYGHTWSPTLELAPRLPMFHGHAVNVDMAFSATIAARRGYITIAERDRILGLM
SRVGLSLDHPMLDIDILWRGTESITLTRDGLLRAAMPKPIGDCVFVNDLTREELAAALADHKELCTSYPRGGEGVDVYPV
YQKELIGSVK
>Q8GPG4 1.8.2.4~~~ddhA~~~Dimethylsulfide dehydrogenase subunit alpha~~~COG0243
MLRTTRRTLMQGASLVGAGLFAAGRGWALNRLEPIGDTLAEEYPYRDWEDLYRNEFTWDYVGKAAHCINCLGNCAFDIYV
KDGIVIREEQLAKYPQISPDIPDANPRGCQKGAIHSTSMYEADRLRYPMKRVGARGEGKWQRISWDQATEEIADKIIDIY
EKYGPGKLMTHTGSGNMSMMRMAAPYRFASLVGGVQLDIFTDVGDLNTGAHLAYGNALESFTSDAWFGADYIMFLLFNPV
ATRIPDAHFLWEAKWNGARVVSVAPDYNPSSIHSDLWMPIKQGADPFLAMSMVNVIIEGKLYNEAFMKEQTDLPILVRSD
NGMLLREADLEEGGSDQVFYHWDSRTGAAVKVKGSMGSEEKTLVLGDVDPALEGSFEVGGIPVTTVFEKVRAEAAKYPPE
ETAAITGIGPGVVRAEAETFARAKKALLMTGFNIGRYSNGIYTSWALTLMLALTGHGGRTGGLDTSWIAWNQPALLELAF
FDFKKLPRLEAGGLGEFVRGGMMEHSRQHYDNDKLKARTGFDLDELQEMIDESIDAGWMPYYGDMKGLISIADNKFRRNK
NAEAYRERILEEVEELFVDINVRMDSTAQWADYLLPAAAHYEAWDLRSIAFHRFVNVFSRPVPPIGEAKSDWEIMEILTR
KIQERAIARGITGYEDGDVTRDFATIHDDYTMDGTLMTDHDVVSWLVENGPEFAGATLEEGVERGFFVMGEDAGPTQKLR
PSEPYHAFLQQTEGKEPYKTMTGRITFFVDHPRFVRLGATVPTARHHAGRDASNYPLNFFSPHTRWGIHSNWRSNKFMLR
LQRGEPNIYISPQLAAAKGIADGAQVRVFNELSFFFAQAKFYPSLPPDTIMMEHGWEPHQFPNWRPMNVCMATLLQPLEL
VGGWGHLNFSLWHWNANQLAHESSVDIEPA
>Q8GPG3 ~~~ddhB~~~Dimethylsulfide dehydrogenase subunit beta~~~COG1140
MVKRQISMVLDLNKCIGCQTCTSACKLQWTNRNGREYMYWNNVETHPGPGYPRNYEHSGGGFDEEGALKIGITPSAEDYG
IPWEYNYEEALMTGTDPWLRPNVKPTWGANWNEDEGRGEYPNSYYFYLPRICNHCANPGCLAACARNAIYKRQEDGIVLV
DQERCRGYRYCITACPYKKVYFNEQISKAEKCIFCYPRIEKGLPTACAKQCVGRIRFIGYLDDEAGPVHLLVERYKVAIP
LHPEWGTKPSVFYVPPLAPPRIGDDGEPTEETRVPLAYLKELFGEAVVPALETLKTERAKKQSGAESELMDTLIGYRHPE
MFKLS
>Q8GPG1 ~~~ddhC~~~Dimethylsulfide dehydrogenase subunit gamma~~~COG2010
MPGFRFLLAATAAFLATSPALPLSADSLNAGNIRLVDPEETVPVIKIPDGIYLRTPNDPDDIIWARVPEFRVEMVMAPPV
HPSVGLRYRDEYPEQDLVVQLARTSERFYVRLRWVDPTRDMSTLRDRFRDGAAIEFSESDDSVSYMMGTDAESPVNIWYW
HPDGDRVESLAAGSPGSLTRLDRQPVTGASEYRTGHGPDDSQWIVVMSRPLASEGDHQVSFERDTIPVAFALWQGADAQR
DGLKLVSLNWIFARMTPDAAPAPGN
>P0A6J8 6.3.2.4~~~ddlA~~~D-alanine--D-alanine ligase A~~~COG1181
MEKLRVGIVFGGKSAEHEVSLQSAKNIVDAIDKSRFDVVLLGIDKQGQWHVSDASNYLLNADDPAHIALRPSATSLAQVP
GKHEHQLIDAQNGQPLPTVDVIFPIVHGTLGEDGSLQGMLRVANLPFVGSDVLASAACMDKDVTKRLLRDAGLNIAPFIT
LTRANRHNISFAEVESKLGLPLFVKPANQGSSVGVSKVTSEEQYAIAVDLAFEFDHKVIVEQGIKGREIECAVLGNDNPQ
ASTCGEIVLTSDFYAYDTKYIDEDGAKVVVPAAIAPEINDKIRAIAVQAYQTLGCAGMARVDVFLTPENEVVINEINTLP
GFTNISMYPKLWQASGLGYTDLITRLIELALERHAADNALKTTM
>P0A1F0 6.3.2.4~~~ddlA~~~D-alanine--D-alanine ligase A~~~
MAKLRVGIVFGGKSAEHEVSLQSAKNIVDAIDKTRFDVVLLGIDKAGQWHVNDAENYLQNADDPAHIALRPSAISLAQVP
GKHQHQLINAQNGQPLPTVDVIFPIVHGTLGEDGSLQGMLRVANLPFVGSDVLSSAACMDKDVAKRLLRDAGLNIAPFIT
LTRTNRHAFSFAEVESRLGLPLFVKPANQGSSVGVSKVANEAQYQQAVALAFEFDHKVVVEQGIKGREIECAVLGNDNPQ
ASTCGEIVLNSEFYAYDTKYIDDNGAQVVVPAQIPSEVNDKIRAIAIQAYQTLGCAGMARVDVFLTADNEVVINEINTLP
GFTNISMYPKLWQASGLGYTDLISRLIELALERHTANNALKTTM
>P07862 6.3.2.4~~~ddlB~~~D-alanine--D-alanine ligase B~~~COG1181
MTDKIAVLLGGTSAEREVSLNSGAAVLAGLREGGIDAYPVDPKEVDVTQLKSMGFQKVFIALHGRGGEDGTLQGMLELMG
LPYTGSGVMASALSMDKLRSKLLWQGAGLPVAPWVALTRAEFEKGLSDKQLAEISALGLPVIVKPSREGSSVGMSKVVAE
NALQDALRLAFQHDEEVLIEKWLSGPEFTVAILGEEILPSIRIQPSGTFYDYEAKYLSDETQYFCPAGLEASQEANLQAL
VLKAWTTLGCKGWGRIDVMLDSDGQFYLLEANTSPGMTSHSLVPMAARQAGMSFSQLVVRILELAD
>B2I1J3 6.3.2.4~~~ddl~~~D-alanine--D-alanine ligase~~~
MSNATKFGKVAVLLGGKSAERAVSLDSGQAVLDALLRSGVQAEAFDPQDRSVTELVNYDRAFIVLHGRGGEDGQIQGVLE
WLNIPYTGTGVQGSAIGMDKVKTKQIWQGSDLPTAPYRIITKETDLDSVIAELGLPVIIKPVHEGSSVGMSKVEKAEDFA
AAIEKATQHDAVVMAEKWITGREFTISFLNGQPLPVIRLQPPADVAFYDYEAKYQRNDVEYGIPCGLSETEEKKLQALCL
RAFQAVGAEGWGRIDAMQDEQGNFWLLEVNTVPGMTSHSLVPKAAKAVGYSFDELCVAILEQTLEGTA
>A0KKW8 6.3.2.4~~~ddl~~~D-alanine--D-alanine ligase~~~COG1181
MKNIHVLLLCGGGGSEHEVSLRSANFLEKQLSLLPGVEVTRVEMFADRWLSADGRECKLGLDKLLSFDSVARPVDYVVPC
IHGYPGETGDLQSFLELAGLPYLGCDAEASKICFNKISTKLWLSAIGIPNTPYLFLTEQNDAALSEAKAALAKWGKVFIK
AASQGSSVGCYSASNEADLVKGIADAFGYSEQVLIEKAVKPRELEVAVYQYGDELVATYPGEICVPQDKFYTYEEKYSSA
SHTETALRAEGLTQAQADAIHEYALKAFRQLKLTHLSRIDFFLTEEGEILLNEINTFPGMTSISMFPKLLEHHGHRFADY
LEQILRKAV
>Q81Q29 6.3.2.4~~~ddl~~~D-alanine--D-alanine ligase~~~COG1181
MRIGVIMGGVSSEKQVSIMTGNEMIANLDKNKYEIVPITLNEKMDLIEKAKDIDFALLALHGKYGEDGTVQGTLESLGIP
YSGSNMLSSGICMDKNISKKILRYEGIETPDWIELTKMEDLNFDELDKLGFPLVVKPNSGGSSVGVKIVYDKDELISMLE
TVFEWDSEVVIEKYIKGEEITCSIFDGKQLPIISIRHAAEFFDYNAKYDDASTIEEVIELPAELKERVNKASLACYKALK
CSVYARVDMMVKDGIPYVMEVNTLPGMTQASLLPKSADAAGIHYSKLLDMIIETSLRVRKEEGF
>B1YSS6 6.3.2.4~~~ddl~~~D-alanine--D-alanine ligase~~~
MSGIDPKRFGKVAVLFGGESAEREVSLTSGRLVLQGLRDAGIDAHPFDPAERPLSALKDEGFVRAFNALHGGYGENGQIQ
GALDFYGIRYTGSGVLGSALGLDKFRTKLVWQQTGVPTPPFETVMRGDDYAARATDIVAKLGLPLFVKPASEGSSVAVLK
VKTADALPAALSEAATHDKIVIVEKSIEGGGEYTACIAGDLDLPLIKIVPAGEFYDYHAKYVANDTQYLIPCGLPAEQET
ELKRIARRAFDVLGCTDWGRADFMLDAAGNAYFLEVNTAPGMTDHSLPPKAARSIGIGYSELVVKVLSLTLND
>A3NZL3 6.3.2.4~~~ddl~~~D-alanine--D-alanine ligase~~~
MSGIDPKRFGKVAVLLGGDSAEREVSLNSGRLVLQGLRDAGIDAHPFDPAQRPLAALKDEGFVRAFNALHGGYGENGQIQ
GALDFYGIRYTGSGVLGSALGLDKFRTKLVWQQTGIPTPPFETVMRGDDYAARAQDIVAKLGVPLFVKPASEGSSVAVEK
VKSADALPAALEEAAKHDKIVIVEKSIEGGGEYTACIAADLDLPLIRIVPAGEFYDYHAKYIANDTQYLIPCGLDAAKEA
EFKRIARRAFDVLGCTDWGRADFMLDAAGNPYFLEVNTAPGMTDHSLPPKAARAVGIGYSELVVKVLSLTLD
>Q3JNE1 6.3.2.4~~~ddl~~~D-alanine--D-alanine ligase~~~
MSGIDPKRFGKVAVLLGGDSAEREVSLNSGRLVLQGLRDAGIDAHPFDPAQRPLAALKDEGFVRAFNALHGGYGENGQIQ
GALDFYGIRYTGSGVLGSALGLDKFRTKLVWQQTGIPTPPFETVMRGDDYAARAQDIVAKLGVPLFVKPASEGSSVAVEK
VKSADALPAALEEAAKHDKIVIVEKSIEGGGEYTACIAADLDLPLIRIVPAGEFYDYHAKYIANDTQYLIPCGLDAAKEA
EFKRIARRAFDVLGCTDWGRADFMLDAAGNPYFLEVNTAPGMTDHSLPPKAARAVGIGYSELVVKVLSLTLD
>Q83BZ9 6.3.2.4~~~ddl~~~D-alanine--D-alanine ligase~~~COG1181
MAEKLHISVLCGGQSTEHEISIQSAKNIVNTLDAAKYLISVIFIDHVGRWYLIDQPEMFLAHSPDHLVKEGSARPITIAF
GDAAKPWQSLNGDGRRYSADCVFPMVHGTQGEDGALQGLLELLNLPYVGANVQSSAVCMEKDLTKTVLRAGGIPVVDWHT
LSPRDATEGVYQRLLDRWGTSELFVKAVSLGSSVATLPVKTETEFTKAVKEVFRYDDRLMVEPRIRGREIECAVLGNGAP
KASLPGEIIPHHDYYSYDAKYLDPNGATTTTSVDLSESVTKQIQQIAIDAFKMVHCSGMARVDFFVTPNNKVLVNEINTI
PGFTNISMYPKMWEASGLPCPNLLDQLIELAIDRHQEQQKLIRCYEVKARSL
>P56191 6.3.2.4~~~ddl~~~D-alanine--D-alanine ligase~~~COG1181
MEFCVLFGGASFEHEISIVSAIALKGVLKDRIKYFIFLDENHHFYLIEESNMHSKYFAQIKEKKLPPLILTHNGLLKNSF
LGAKIIELPLVINLVHGGDGEDGKLASLLEFYRIAFIGPRIEASVLSYNKYLTKLYAKDLGVKTLDHVLLNEKNRANALD
LMNFNFPFIIKPNNAGSSLGVNVVKEEKELVYALDGAFEYSKEVLIEPFIQGVKEYNLAGCKIKKDFCFSYVEEPNKQEF
LDFKQKYLDFSRNKAPKANLSNALEEQLKENFKKLYNDLFDGAIIRCDFFVIKNEVYLNEINPIPGSLANYLFDDFKTTL
ENLAQSLPKTPKIQIKNSYLLQIQKNK
>Q03ZI1 6.3.2.4~~~ddl~~~D-alanine--D-alanine ligase~~~COG1181
MTKKRVALIFGGNSSEHDVSKRSAQNFYNAIEATGKYEIIVFAIAQNGFFLDTESSKKILALEDEQPIVDAFMKTVDASD
PLARIHALKSAGDFDIFFPVVHGNLGEDGTLQGLFKLLDKPYVGAPLRGHAVSFDKALTKELLTVNGIRNTKYIVVDPES
ANNWSWDKIVAELGNIVFVKAANQGSSVGISRVTNAEEYTEALSDSFQYDYKVLIEEAVNGARELEVGVIGNDQPLVSEI
GAHTVPNQGSGDGWYDYNNKFVDNSAVHFEIPAQLSPEVTKEVKQMALDAYKVLNLRGEARMDFLLDENNVPYLGEPNTL
PGFTNMSLFKRLWDYSDINNAKLVDMLIDYGFEDFAQNKKLSYSFVSLGEEKIGKFN
>P9WP31 6.3.2.4~~~ddl~~~D-alanine--D-alanine ligase~~~COG1181
MSANDRRDRRVRVAVVFGGRSNEHAISCVSAGSILRNLDSRRFDVIAVGITPAGSWVLTDANPDALTITNRELPQVKSGS
GTELALPADPRRGGQLVSLPPGAGEVLESVDVVFPVLHGPYGEDGTIQGLLELAGVPYVGAGVLASAVGMDKEFTKKLLA
ADGLPVGAYAVLRPPRSTLHRQECERLGLPVFVKPARGGSSIGVSRVSSWDQLPAAVARARRHDPKVIVEAAISGRELEC
GVLEMPDGTLEASTLGEIRVAGVRGREDSFYDFATKYLDDAAELDVPAKVDDQVAEAIRQLAIRAFAAIDCRGLARVDFF
LTDDGPVINEINTMPGFTTISMYPRMWAASGVDYPTLLATMIETTLARGVGLH
>Q13TZ4 6.3.2.4~~~ddl~~~D-alanine--D-alanine ligase~~~COG1181
MSSIDPKQFGKVAVLLGGNSAEREVSLNSGRLVLQGLRDAGIDAHPFDPAERPLAALKEEGFVRAFNALHGGYGENGQIQ
GALDFYGIRYTGSGVLGSALGLDKFRTKLVWQQLGIPTPPFEAVLRGDDYEARAKEIVAKLGLPLFVKPASEGSSVAVIK
VKSADALPAALIEAVKFDRIVVVEKSIEGGGEYTACIAGNLDLPVIRIVPAGEFYDYHAKYIANDTQYLIPCGLTADEEA
RLKVLARRAFDVLGCTDWGRADFMLDADGNPYFLEVNTAPGMTDHSLPPKAARAVGISYQELVVAVLALTLKD
>Q2FWH3 6.3.2.4~~~ddl~~~D-alanine--D-alanine ligase~~~COG1181
MTKENICIVFGGKSAEHEVSILTAQNVLNAIDKDKYHVDIIYITNDGDWRKQNNITAEIKSTDELHLENGEALEISQLLK
ESSSGQPYDAVFPLLHGPNGEDGTIQGLFEVLDVPYVGNGVLSAASSMDKLVMKQLFEHRGLPQLPYISFLRSEYEKYEH
NILKLVNDKLNYPVFVKPANLGSSVGISKCNNEAELKEGIKEAFQFDRKLVIEQGVNAREIEVAVLGNDYPEATWPGEVV
KDVAFYDYKSKYKDGKVQLQIPADLDEDVQLTLRNMALEAFKATDCSGLVRADFFVTEDNQIYINETNAMPGFTAFSMYP
KLWENMGLSYPELITKLIELAKERHQDKQKNKYKID
>Q5HEB7 6.3.2.4~~~ddl~~~D-alanine--D-alanine ligase~~~
MTKENICIVFGGKSAEHEVSILTAQNVLNAIDKDKYHVDIIYITNDGDWRKQNNITAEIKSTDELHLENGEALEISQLLK
ESSSGQPYDAVFPLLHGPNGEDGTIQGLFEVLDVPYVGNGVLSAASSMDKLVMKQLFEHRGLPQLPYISFLRSEYEKYEH
NILKLVNDKLNYPVFVKPANLGSSVGISKCNNEAELKEGIKEAFQFDRKLVIEQGVNAREIEVAVLGNDYPEATWPGEVV
KDVAFYDYKSKYKDGKVQLQIPADLDEDVQLTLRNMALEAFKATDCSGLVRADFFVTEDNQIYINETNAMPGFTAFSMYP
KLWENMGLSYPELITKLIELAKERHQDKQKNKYKID
>P63892 6.3.2.4~~~ddl~~~D-alanine--D-alanine ligase~~~
MTKENICIVFGGKSAEHEVSILTAQNVLNAIDKDKYHVDIIYITNDGDWRKQNNITAEIKSTDELHLENGEALEISQLLK
ESSSGQPYDAVFPLLHGPNGEDGTIQGLFEVLDVPYVGNGVLSAASSMDKLVMKQLFEHRGLPQLPYISFLRSEYEKYEH
NILKLVNDKLNYPVFVKPANLGSSVGISKCNNEAELKEGIKEAFQFDRKLVIEQGVNAREIEVAVLGNDYPEATWPGEVV
KDVAFYDYKSKYKDGKVQLQIPADLDEDVQLTLRNMALEAFKATDCSGLVRADFFVTEDNQIYINETNAMPGFTAFSMYP
KLWENMGLSYPELITKLIELAKERHQDKQKNKYKID
>P95803 6.3.2.4~~~ddl~~~D-alanine--D-alanine ligase~~~COG1181
MSKETLVLLYGGRSAERDVSVLSAESVMRAINYDNFLVKTYFITQAGDFIKTQEFDSQPSETDKLMTNDTIIASQKIKPS
DIYEEEAVVFPVLHGPMGEDGSIQGFLEVLKMPYVGTNILSSSVAMDKITTNQVLESATTIPQVAYVALIEGEPLESKLA
EVEEKLIYPVFVKPANMGSSVGISKAENRTDLKQAIALALKYDSRVLIEQGVDAREIEVGILGNTDVKTTLPGEIVKDVA
FYDYEAKYIDNKITMAIPAEIDPVIVEKMRDYAATAFRTLGCCGLSRCDFFLTEDGKVYLNELNTMPGFTQWSMYPLLWE
NMGLSYSVLIEELVSLAKEMFDKRESHLV
>Q5SHZ3 6.3.2.4~~~ddl~~~D-alanine--D-alanine ligase~~~COG1181
MRVLLIAGGVSPEHEVSLLSAEGVLRHIPFPTDLAVIAQDGRWLLGEKALTALEAKAAPEGEHPFPPPLSWERYDVVFPL
LHGRFGEDGTVQGFLELLGKPYVGAGVAASALCMDKDLSKRVLAQAGVPVVPWVAVRKGEPPVVPFDPPFFVKPANTGSS
VGISRVERFQDLEAALALAFRYDEKAVVEKALSPVRELEVGVLGNVFGEASPVGEVRYEAPFYDYETKYTPGRAELLIPA
PLDPGTQETVQELALKAYKVLGVRGMARVDFFLAEGELYLNELNTIPGFTPTSMYPRLFEAGGVAYPELLRRLVELALT
>Q9KM17 6.3.2.4~~~ddl~~~D-alanine--D-alanine ligase~~~COG1181
MTKTTILLLCGGGSSEHEISLVSANYIQQQLELTPEFHVIRVEMKKEGWFSEQGALVYLDTNSATLNSDKASYPIDFVVP
CIHGFPGETGDIQSMLELAGIPYLGCGPEASANSFNKITSKLWYDALDIPNTPYLFLTQNTPSSIDKAKQAFGHWGSIFV
KAARQGSSVGCYKVTTEDQIAPAIEAAFGFSEQVLVEQAVKPRELEVSAYEMNGKLYISKPGEVIAPEGTFYSYEEKYSA
NSHARTVLEAENLTEKHKELIQTYAERVFIHMKLRHLSRIDFFLTQEGQIYLNEVNTFPGMTPISMFPKMLEHNGHRFSE
FLVQCVTNTLVNAK
>Q8ZIE7 6.3.2.4~~~ddl~~~D-alanine--D-alanine ligase~~~COG1181
MAEKVAVLLGGTSAEREVSLLSGQAVLAGLKEAGIDAYGVDTKDFPVTQLKEQGFDKVFIALHGRGGEDGTLQGVLEFLQ
LPYTGSGVMASALTMDKLRTKLVWQALGLPISPYVALNRQQFETLSPEELVACVAKLGLPLIVKPSHEGSSVGMSKVDHA
SELQKALVEAFQHDSDVLIEKWLSGPEFTVAILGDEVLPSIRIQPPGVFYDYDAKYLSDKTQYFCPSGLSDESEQQLAAL
ALQAYHALDCSGWGRVDVMQDRDGHFYLLEVNTSPGMTSHSLVPMAARQYGLSFSQLVARILMLAD
>Q5S3I2 1.18.1.3~~~ddmA1~~~Dicamba O-demethylase 1, ferredoxin reductase component~~~
MSKADVVIVGAGHGGAQCAIALRQNGFEGTITVIGREPEYPYERPPLSKEYFAREKTFDRLYIRPPTFWAEKNIEFKLGT
EVTKVDPKAHELTLSNGESYGYGKLVWATGGDPRRLSCQGADLTGIHAVRTREDCDTLMAEVDAGTKNIVVIGGGYIGLE
AAAVLSKMGLKVTLLEALPRVLARVAGEDLSTFYQKEHVDHGVDLRTEVMVDSLVGENGKVTGVQLAGGEVIPAEGVIVG
IGIVPAVGPLIAAGAAGANGVDVDEYCRTSLPDIYAIGDCAAFACDYAGGNVMRVESVQNANDMGTCVAKAICGDEKPYK
AFPWFWSNQYDLKLQTAGINLGFDKTVIRGNPEERSFSVVYLKDGRVVALDCVNMVKDYVQGRKLVEAGATPDLEALADA
GKPLKELL
>Q5S3I1 1.18.1.3~~~ddmA2~~~Dicamba O-demethylase 2, ferredoxin reductase component~~~
MQRADVVIVGAGHGGAQCAIALRQNGFEGTITVIGREPEYPYERPPLSKEYFAREKTFDRLYIRPPTFWAEKNIEFKLGT
EVTKVDPKAHELTLSNGESYGYGKLVWATGGDPRRLSCQGADLTGIHAVRTREDCDTLMAEVDAGTKNIVVIGGGYIGLE
AAAVLSKMGLKVTLLEALPRVLARVAGEDLSTFYQKEHVDHGVDLRTEVMVDSLVGENGKVTGVQLAGGEVIPAEGVIVG
IGIVPAIGPLIAAGAAGANGVDVDEYCRTSLPDIYAIGDCAAFACDYAGGNVMRVESVQNANDMGTCVAKAICGDEKPYK
AFPWFWSNQYDLKLQTAGINLGFDKTVIRGNPEERSFSVVYLKDGRVVALDCVNMVKDYVQGRKLVEAGATPDLEALADA
GKPLKELQY
>Q5S3I4 ~~~ddmB~~~Dicamba O-demethylase, ferredoxin component~~~
MPQITVVNQSGEESSVEASEGRTLMEVIRDSGFDELLALCGGCCSCATCHVHIDPAFMDKLPEMSEDENDLLDSSDHRNE
YSRLSCQIPVTGALEGIKVTIAQED
>Q5S3I3 1.14.15.-~~~ddmC~~~Dicamba O-demethylase, oxygenase component~~~
MTFVRNAWYVAALPEELSEKPLGRTILDTPLALYRQPDGVVAALLDICPHRFAPLSDGILVNGHLQCPYHGLEFDGGGQC
VHNPHGNGARPASLNVRSFPVVERDALIWIWPGDPALADPGAIPDFGCRVDPAYRTVGGYGHVDCNYKLLVDNLMDLGHA
QYVHRANAQTDAFDRLEREVIVGDGEIQALMKIPGGTPSVLMAKFLRGANTPVDAWNDIRWNKVSAMLNFIAVAPEGTPK
EQSIHSRGTHILTPETEASCHYFFGSSRNFGIDDPEMDGVLRSWQAQALVKEDKVVVEAIERRRAYVEANGIRPAMLSCD
EAAVRVSREIEKLEQLEAA
>P9WP15 1.-.-.-~~~ddn~~~Deazaflavin-dependent nitroreductase~~~COG0748
MPKSPPRFLNSPLSDFFIKWMSRINTWMYRRNDGEGLGGTFQKIPVALLTTTGRKTGQPRVNPLYFLRDGGRVIVAASKG
GAEKNPMWYLNLKANPKVQVQIKKEVLDLTARDATDEERAEYWPQLVTMYPSYQDYQSWTDRTIPIVVCEP
>P76128 ~~~ddpA~~~Probable D,D-dipeptide-binding periplasmic protein DdpA~~~COG0747
MKRSISFRPTLLALVLATNFPVAHAAVPKDMLVIGKAADPQTLDPAVTIDNNDWTVTYPSYQRLVQYKTDGDKGSTDVEG
DLASSWKASDDQKEWTFTLKDNAKFADGTPVTAEAVKLSFERLLKIGQGPAEAFPKDLKIDAPDEHTVKFTLSQPFAPFL
YTLANDGASIINPAVLKEHAADDARGFLAQNTAGSGPFMLKSWQKGQQLVLVPNPHYPGNKPNFKRVSVKIIGESASRRL
QLSRGDIDIADALPVDQLNALKQENKVNVAEYPSLRVTYLYLNNSKAPLNQADLRRAISWSTDYQGMVNGILSGNGKQMR
GPIPEGMWGYDATAMQYNHDETKAKAEWDKVTSKPTSLTFLYSDNDPNWEPIALATQSSLNKLGIIVKLEKLANATMRDR
VGKGDYDIAIGNWSPDFADPYMFMNYWFESDKKGLPGNRSFYENSEVDKLLRNALATTDQTQRTRDYQQAQKIVIDDAAY
VYLFQKNYQLAMNKEVKGFVFNPMLEQVFNINTMSK
>P77308 ~~~ddpB~~~Probable D,D-dipeptide transport system permease protein DdpB~~~COG0601
MTFWSILRQRCWGLVLVVAGVCVITFIISHLIPGDPARLLAGDRASDAIVENIRQQLGLDQPLYVQFYRYVSDLFHGDLG
TSIRTGRPVLEELRIFFPATLELAFGALLLALLIGIPLGILSAVWRNRWLDHLVRIMAITGISTPAFWLGLGVIVLFYGH
LQILPGGGRLDDWLDPPTHVTGFYLLDALLEGNGEVFFNALQHLILPALTLAFVHLGIVARQIRSAMLEQLSEDYIRTAR
ASGLPGWYIVLCYALPNALIPSITVLGLALGDLLYGAVLTETVFAWPGMGAWVVTSIQALDFPAVMGFAVVVSFAYVLVN
LVVDLLYLWIDPRIGRGGGE
>P77463 ~~~ddpC~~~Probable D,D-dipeptide transport system permease protein DdpC~~~COG1173
MMLSEETSAVRPQKQTRFNGAKLVWMLKGSPLTVTSAVIIVLMLLMMIFSPWLATHDPNAIDLTARLLPPSAAHWFGTDE
VGRDLFSRVLVGSQQSILAGLVVVAIAGMIGSLLGCLSGVLGGRADAIIMRIMDIMLSIPSLVLTMALAAALGPSLFNAM
LAIAIVRIPFYVRLARGQALVVRQYTYVQAAKTFGASRWHLINWHILRNSLPPLIVQASLDIGSAILMAATLGFIGLGAQ
QPSAEWGAMVANGRNYVLDQWWYCAFPGAAILLTAVGFNLFGDGIRDLLDPKAGGKQS
>P77268 ~~~ddpD~~~Probable D,D-dipeptide transport ATP-binding protein DdpD~~~COG0444
MTQPVLDIQQLHLSFPGFNGDVHALNNVSLQINRGEIVGLVGESGSGKSVTAMLIMRLLPTGSYCVHRGQISLLGEDVLN
AREKQLRQWRGARVAMIFQEPMTALNPTRRIGLQMMDVIRHHQPISRREARAKAIDLLEEMQIPDAVEVMSRYPFELSGG
MRQRVMIALAFSCEPQLIIADEPTTALDVTVQLQVLRLLKHKARASGTAVLFISHDMAVVSQLCDSVYVMYAGSVIESGV
TADVIHHPRHPYTIGLLQCAPEHGVPRQLLPAIPGTVPNLTHLPDGCAFRDRCYAAGAQCENVPALTACGDNNQRCACWY
PQQEVISV
>P77622 ~~~ddpF~~~Probable D,D-dipeptide transport ATP-binding protein DdpF~~~COG4608
MSDTLLTLRDVHINFPARKNWLGKTTEHVHAINGIDLQIRRGETLGIVGESGCGKSTLAQLLMGMLQPSHGQYIRSGSQR
IMQMVFQDPLSSLNPRLPVWRIITEPLWIAKRSSEQQRRALAEELAVQVGIRPEYLDRLPHAFSGGQRQRIAIARALSSQ
PDVIVLDEPTSALDISVQAQILNLLVTLQENHGLTYVLISHNVSVIRHMSDRVAVMYLGQIVELGDAQQVLTAPAHPYTR
LLLDSLPAIDKPLEEEWALRKTDLPGNRTLPQGCFFYERCPLATHGCEVRQSLAIREDGRELRCWRAL
>P77790 3.4.13.22~~~ddpX~~~D-alanyl-D-alanine dipeptidase~~~COG2173
MSDTTELVDLAVIFPDLEIELKYACADNITGKAIYQQARCLLHKDAITALAKSISIAQLSGLQLVIYDAYRPQQAQAMLW
QACPDPQYVVDVTVGSNHSRGTAIDLTLRDEHGNILDMGAGFDEMHERSHAYHPSVPPAAQRNRLLLNAIMTGGGFVGIS
SEWWHFELPQAASYPLLADQFSCFISPGTQHVS
>P74268 3.4.13.22~~~ddpX~~~D-alanyl-D-alanine dipeptidase~~~COG2173
MSFDTPLKPYLAIPIQDCGEPLAPINLEGVKSLKPHPYAQVGADYQGRSPYVLRTGVLKRLDQARLTLADIEPSWEILVF
DAYRPIAVQQFMVDHTFAEIVARDGLQGQVLTPEQKENIYHQVYQIWAVPNNNPLTPPPHSTGAALDITLLDDLGQPVDM
GGEIDELSARSLPNYYQTVEPNSDRQRKEFEQYQRRRELLNTIMESAGFLRHPGEWWHFSQGDQLWAWQYNQRHPDHQKI
AYYGRVE
>C1D1R8 ~~~ddrA~~~Single-stranded DNA-binding protein DdrA~~~COG4712
MKLSDVQKRLQAPFPAHAVAWKPGVITKDRSRALMLAHIDARNVQDRLDAVCPDAWSFEVEVVPGTRLPTVKGRLTVLGV
SREDIGEAPEGDLGTLKAAASDALKRCAVQFGIGRYLYDLPKQWVAWNDAKREPVSPPELPEWARPDHERSPGGAHLVQA
MDQLRYEMPEDLELQREVYKHLKAALGSLHPISGGNQGRAA
>Q9RX92 ~~~ddrA~~~Single-stranded DNA-binding protein DdrA~~~COG4712
MKLSDVQKRLQAPFPAHTVSWKPAAFNAERTRALLLAHVDARAVQDRLDAVCPDDWSFEMEVVSGAEVPTVKGRLTVLGV
TREDIGEAPEGSMAAYKAAASDAMKRCAVQFGIGRYLYDLPKQWADWDDARRGPKHLPELPEWARPDHERTPGGAHLVQA
MEQLRYELPEDLDLQREVYKHLKAALGSIHPVPTGPVPTNPVQGGRAA
>O68195 ~~~ddrA~~~Diol dehydratase-reactivating factor large subunit~~~
MRYIAGIDIGNSSTEVALATLDEAGALTITHSALAETTGIKGTLRNVFGIQEALALVARGAGIAVSDISLIRINEATPVI
GDVAMETITETIITESTMIGHNPKTPGGAGLGTGITITPQELLTRPADAPYILVVSSAFDFADIASVINASLRAGYQITG
VILQRDDGVLVSNRLEKPLPIVDEVLYIDRIPLGMLAAIEVAVPGKVIETLSNPYGIATVFNLSPEETKNIVPMARALIG
NRSAVVVKTPSGDVKARAIPAGNLELLAQGRSVRVDVAAGAEAIMKAVDGCGRLDNVTGESGTNIGGMLEHVRQTMAELT
NKPSSEIFIQDLLAVDTSVPVSVTGGLAGEFSLEQAVGIASMVKSDRLQMAMIAREIEQKLNIDVQIGGAEAEAAILGAL
TTPGTTRPLAILDLGAGSTDASIINPKGDIIATHLAGAGDMVTMIIARELGLEDRYLAEEIKKYPLAKVESLFHLRHEDG
SVQFFSTPLPPAVFARVCVVKADELVPLPGDLALEKVRAIRRSAKERVFVTNALRALRQVSPTGNIRDIPFVVLVGGSSL
DFEVPQLVTDALAHYRLVAGRGNIRGSEGPRNAVATGLILSWHKEFAHER
>Q1J1N6 ~~~ddrB~~~Single-stranded DNA-binding protein DdrB~~~
MLHIEFITDLGAKVTVDVESADKLLDVQRQYGRLGWTSGEVPVGGYQFPLENEPDFDWSLIGARKWTNPEGEEMILHRGH
AYRRRELEAVDSRKMKLPAAVKYSRGAKNTDPEHVREKADGEFEYVTLAIFRGGKRQERYAVPGSNRPQAGAPARSAATR
AQGARPGAVAVQDEETPF
>Q9RY80 ~~~ddrB~~~Single-stranded DNA-binding protein DdrB~~~
MLQIEFITDLGARVTVNVEHESRLLDVQRHYGRLGWTSGEIPSGGYQFPIENEADFDWSLIGARKWKSPEGEELVIHRGH
AYRRRELEAVDSRKLKLPAAIKYSRGAKVSDPQHVREKADGDIEYVSLAIFRGGKRQERYAVPGGAAGNGQGRPAPQGQP
AQARPQATAARPAARPPVQPGQEEETPF
>O68196 ~~~ddrB~~~Diol dehydratase-reactivating factor small subunit~~~
MNGNHSAPAIAIAVIDGCDGLWREVLLGIEEEGIPFRLQHHPAGEVVDSAWQAARSSPLLVGIACDRHMLVVHYKNLPAS
APLFTLMHHQDSQAHRNTGNNAARLVKGIPFRDLNSEATGEQQDE
>Q9RYE6 ~~~ddrC~~~DNA damage response protein C~~~
MKNAPLTLNFGSVRLPVSADGLLHAPTAQQQLGLTQSWEAALVEHGLPETYRDFGAGPEAAVSVPDFVALAFALDTPEAR
RWQKRARELLARAMQGDVRVAAQIAERNPEPDARRWLAARLESTGARRELLATVARHGGEGRVYGQLGSISNRTVLGKDS
ASVRQERGVKATRDGLTSAELLRLAYIDTVTARAIQESEARGNAAILTLHEQVARSERQSWERAGQVQRVG
>Q9RXI7 ~~~ddrD~~~DNA damage response protein D~~~
MDTLKKAGTMLAHLDLFHSMLDLRRLLQLAAYMKERGDRAMLISAGEITLIGSESMTAPEVVTSKGETIDAATAYRVLGQ
LEGYEAPEYAVNREALAALNARAVAELEGSEALRAFGDTLARISAAPTDPAGPERPGTDRAERTAAERTASERATHDRAS
TERPARPRRSAEPEAVRTEDAPQPNAEASEAGENTPAA
>C1CYP4 ~~~ddrOC~~~HTH-type transcriptional regulator DdrOC~~~COG1396
MKLHERLRELRSERGLRLKDVAEVAQISVPYLSDLERGRTNPSLETLQTLAGAYNITVHDLLEGVEFYGASTEGALPKGL
SDLIADPTLGPQITPDWVRTLSRIELRGKRPRDKQDWYEIYLHLKRILS
>C1D3U5 ~~~ddrOP3~~~HTH-type transcriptional regulator DdrOP3~~~
MKLCERLRELRQERGLRLKDIAGAAQISVPYLSDLERGRTNPSLETLQSLASTYGITVHDLLEGVEFYGDQTAGALPQGL
ADLIADPALGAQLTPDWIRTLARIELRGKRPRDKQDWFEIYLHLKRILD
>Q99PX1 3.5.1.105~~~deaA~~~Chitin disaccharide deacetylase~~~COG0726
MKLNKLAIATLVSAALSQYAFAQTDTKGTIYLTFDDGPINASIDVINVLNQEEVKATFYFNAWHLDGIGDENEDRALEAL
KLALDSGHIVANHSYDHMVHNCVEEFGPNSAAECNATGDHQINSYQDPAYDASMFAENLSVLEKYLPNITSYPNYKANEF
ARLPYTNGWRVTKDFKADGLCATSDDLKPWEPGYSCDTANPSNSVKAAIAVQNILANNGYQTHGWDVDWAPENWGIAMPA
NSLTEAEPFLGYVDSALNTCAPTTINPINSKAQEFPCGTPLHADKVIVLTHEFLFEDGKRGMGATQNLPKLAKFIQLAKQ
AGYVFDTMDNYTPNWQVGNNYSAGDYVLHLGTVYQAVTSHTAQQDWAPSPTSSLWTNADPATNWTQNVSYKQGDVVTYQG
LRYLVNVPHVSQADWTPNSQNTLFTAL
>Q7CS13 3.1.1.-~~~~~~Deacetylase Atu3266~~~COG3964
MTSGEQAKTPLQAPILLTNVKPVGFGKGASQSSTDILIGGDGKIAAVGSALQAPADTQRIDAKGAFISPGWVDLHVHIWH
GGTDISIRPSECGAERGVTTLVDAGSAGEANFHGFREYIIEPSRERIKAFLNLGSIGLVACNRVPELRDIKDIDLDRILE
CYAENSEHIVGLKVRASHVITGSWGVTPVKLGKKIAKILKVPMMVHVGEPPALYDEVLEILGPGDVVTHCFNGKSGSSIM
EDEDLFNLAERCAGEGIRLDIGHGGASFSFKVAEAAIARGLLPFSISTDLHGHSMNFPVWDLATTMSKLLSVDMPFENVV
EAVTRNPASVIRLDMENRLDVGQRADFTVFDLVDADLEATDSNGDVSRLKRLFEPRYAVIGAEAIAASRYIPRARKLVRH
SHGYSWR
>A6X391 3.1.1.-~~~~~~Deacetylase Oant_2987~~~COG3964
MISGEQAKPLLITNVKPVAFGVEHSDATTDILVGKDGSISAIGKSLNAPADVERVDGKGAWISPGWVDLHVHIWHGGTDI
SIRPSECGAERGVTTLVDAGSAGEANFHGFREYIIEPSKERIKAFLNLGSIGLVACNRVPELRDIKDIDLDRILECYAAN
SEHIVGIKVRASHVITGSWGVTPVKLGKKIAKILKVPMMVHVGEPPALYDEVLEILGPGDVVTHCFNGKSGSSIMEDEDL
FNLAERCSGEGIRLDIGHGGASFSFKVAEAAIERGLLPFSISTDLHGHSMNFPVWDLATTMSKLLSVNMPFENVIEAVTH
NPASVIKLSMENRLSVGQRADFTIFDLVDADLEATDSNGDVSRLNRLFEPRYAVIGAEAITASRYIPRARKLVRHSHGYS
WR
>Q837K0 3.1.1.-~~~~~~Deacetylase EF_0837~~~COG3964
MDYDLLIKNGQTVNGMPVEIAIKEKKIAAVAATISGSAKETIHLEPGTYVSAGWIDDHVHCFEKMALYYDYPDEIGVKKG
VTTVIDAGTTGAENIHEFYDLAQQAKTNVFGLVNISKWGIVAQDELADLSKVQASLVKKAIQELPDFVVGIKARMSRTVI
GDNGITPLELAKQIQQENQEIPLMVHIGSAPPHLDEILALMEKGDVLTHCFNGKENGILDQATDKIKDFAWQAYNKGVVF
DIGHGTDSFNFHVAETALREGMKAASISTDIYIRNRENGPVYDLATTMEKLRVVGYDWPEIIEKVTKAPAENFHLTQKGT
LEIGKDADLTIFTIQAEEKTLTDSNGLTRVAKEQIRPIKTIIGGQIYDN
>P0A9P6 3.6.4.13~~~deaD~~~ATP-dependent RNA helicase DeaD~~~COG0513
MAEFETTFADLGLKAPILEALNDLGYEKPSPIQAECIPHLLNGRDVLGMAQTGSGKTAAFSLPLLQNLDPELKAPQILVL
APTRELAVQVAEAMTDFSKHMRGVNVVALYGGQRYDVQLRALRQGPQIVVGTPGRLLDHLKRGTLDLSKLSGLVLDEADE
MLRMGFIEDVETIMAQIPEGHQTALFSATMPEAIRRITRRFMKEPQEVRIQSSVTTRPDISQSYWTVWGMRKNEALVRFL
EAEDFDAAIIFVRTKNATLEVAEALERNGYNSAALNGDMNQALREQTLERLKDGRLDILIATDVAARGLDVERISLVVNY
DIPMDSESYVHRIGRTGRAGRAGRALLFVENRERRLLRNIERTMKLTIPEVELPNAELLGKRRLEKFAAKVQQQLESSDL
DQYRALLSKIQPTAEGEELDLETLAAALLKMAQGERTLIVPPDAPMRPKREFRDRDDRGPRDRNDRGPRGDREDRPRRER
RDVGDMQLYRIEVGRDDGVEVRHIVGAIANEGDISSRYIGNIKLFASHSTIELPKGMPGEVLQHFTRTRILNKPMNMQLL
GDAQPHTGGERRGGGRGFGGERREGGRNFSGERREGGRGDGRRFSGERREGRAPRRDDSTGRRRFGGDA
>P9WH05 3.6.4.13~~~deaD~~~ATP-dependent RNA helicase DeaD~~~COG0513
MAFPEYSPAASAATFADLQIHPRVLRAIGDVGYESPTAIQAATIPALMAGSDVVGLAQTGTGKTAAFAIPMLSKIDITSK
VPQALVLVPTRELALQVAEAFGRYGAYLSQLNVLPIYGGSSYAVQLAGLRRGAQVVVGTPGRMIDHLERATLDLSRVDFL
VLDEADEMLTMGFADDVERILSETPEYKQVALFSATMPPAIRKLSAKYLHDPFEVTCKAKTAVAENISQSYIQVARKMDA
LTRVLEVEPFEAMIVFVRTKQATEEIAEKLRARGFSAAAISGDVPQAQRERTITALRDGDIDILVATDVAARGLDVERIS
HVLNYDIPHDTESYVHRIGRTGRAGRSGAALIFVSPRELHLLKAIEKATRQTLTEAQLPTVEDVNTQRVAKFADSITNAL
GGPGIELFRRLVEEYEREHDVPMADIAAALAVQCRGGEAFLMAPDPPLSRRNRDQRRDRPQRPKRRPDLTTYRVAVGKRH
KIGPGAIVGAIANEGGLHRSDFGQIRIGPDFSLVELPAKLPRATLKKLAQTRISGVLIDLRPYRPPDAARRHNGGKPRRK
HVG
>P0ACJ5 ~~~decR~~~DNA-binding transcriptional activator DecR~~~COG1522
MLDKIDRKLLALLQQDCTLSLQALAEAVNLTTTPCWKRLKRLEDDGILIGKVALLDPEKIGLGLTAFVLIKTQHHSSEWY
CRFVTVVTEMPEVLGFWRMAGEYDYLMRVQVADMKRYDEFYKRLVNSVPGLSDVTSSFAMEQIKYTTSLPIE
>O51266 ~~~~~~Inner membrane protein BB_0250~~~
MTKMYINTIIEYIDSNIAYSPIVFFSLLILAGLNVPISEDAIVLMGGILSSRKNEYTILIFLGIFWGAYLGDIISFYIGK
LMGNKLFKNKKDNNLLDKINYYYGQYGVLTLFIGRFIPFGVRNAIFMSAGMGNMKSNLFIVSDFFATLLSIVVYFTLSFK
LGQSFEIIFSKIKIIIFAIFIAVIATTIIIYVIKKNKKVDKNLK
>P0ABP6 ~~~dedA~~~Protein DedA~~~COG0586
MDLIYFLIDFILHIDVHLAELVAEYGVWVYAILFLILFCETGLVVTPFLPGDSLLFVAGALASLETNDLNVHMMVVLMLI
AAIVGDAVNYTIGRLFGEKLFSNPNSKIFRRSYLDKTHQFYEKHGGKTIILARFVPIVRTFAPFVAGMGHMSYRHFAAYN
VIGALLWVLLFTYAGYFFGTIPMVQDNLKLLIVGIIVVSILPGVIEIIRHKRAAARAAK
>P09549 ~~~dedD~~~Cell division protein DedD~~~COG3147
MASKFQNRLVGTIVLVALGVIVLPGLLDGQKKHYQDEFAAIPLVPKAGDRDEPDMMPAATQALPTQPPEGAAEEVRAGDA
AAPSLDPATIAANNTEFEPEPAPVAPPKPKPVEPPKPKVEAPPAPKPEPKPVVEEKAAPTGKAYVVQLGALKNADKVNEI
VGKLRGAGYRVYTSPSTPVQGKITRILVGPDASKDKLKGSLGELKQLSGLSGVVMGYTPN
>Q819U0 3.5.1.88~~~def1~~~Peptide deformylase 1~~~
MAVLEIIKHPNEVLETPCERVINFDKKLVKLLKDMHETMLIADGVGLAAPQVGVSLQVAVVDVDDDTGKIELINPSILEK
RGEQVGPEGCLSFPGLYGEVERADYIKVRAQNRRGKVFLLEAEGFLARAIQHEIDHLHGVLFTSKVTRYYEENELE
>A0A0H3KB98 3.5.1.88~~~def1~~~Peptide deformylase 1~~~COG0242
MANAAHRFTEYRKTMALLNILHYPDKRLHKVAKPVDKVDDRIRKLVADMAETMYAAPGIGLAATQVDVHERVIVIDVSED
KNELRAFINPEIIWSSDGKQVYEEGCLSVPGIYDEVERPDRVRVRALNEQGETFELDCEGLLAVCIQHEMDHLMGRVFVE
YLSPLKQSRIKTKMKKLERAM
>Q9KVU3 3.5.1.88~~~def1~~~Peptide deformylase 1~~~COG0242
MSVLQVLTFPDDRLRTVAKPVEQVTPEIQQIVDDMLETMYAEEGIGLAATQVDIHQRIVVIDISETRDQPMVLINPEIIE
KRGEDGIEEGCLSVPGARALVPRAAEVTVKALDRNGQEYQFDADDLLAICVQHELDHLAGKLFVDYLSPLKRNRIKEKLE
KIKRFNEKK
>Q819K2 3.5.1.88~~~def2~~~Peptide deformylase 2~~~
MLTMKDVIREGDPILRNVAEEVSLPASEEDTTTLKEMIEFVINSQDPEMAEKYSLRPGIGLAAPQIGVSKKMIAVHVTDA
DGTLYSHALFNPKIISHSVERTYLQGGEGCLSVDREVPGYVPRYTRITVKATSINGEEVKLRLKGLPAIVFQHEIDHLNG
VMFYDHINKENPFAAPDDSKPLER
>Q45495 3.5.1.88~~~defB~~~Peptide deformylase 2~~~COG0242
MITMENIVRDGHPALRETAEPVELPPTDAEKQQLADMIEFVKNSQNPELAEKYKLRPGVGLAAPQINIKKRMIAVHAEDA
SGKLYSYALFNPKIVSHSVEKSYLTSGEGCLSVDEAIPGYVPRYARIRVKGTTLEGENIDIRLKGFPAIVFQHEIDHLNG
VMFYDHIDKENPFKEPENAIAIER
>O31410 3.5.1.88~~~~~~Peptide deformylase 2~~~
MITMKDIIKEGHPTLRKVAEPVPLPPSEEDKRILQSLLDYVKMSQDPELAAKYGLRPGIGLAAPQINVSKRMIAVHVTDE
NGTLYSYALFNPKIVSHSVQQCYLTTGEGCLSVDRDVPGYVLRYARITVTGTTLDGEEVTLRLKGLPAIVFQHEIDHLNG
IMFYDRINPADPFQVPDGAIPIGR
>Q9KN16 3.5.1.88~~~def2~~~Peptide deformylase 2~~~COG0242
MAVLEILTAPDPRLRVQSKQVTDVASVQTLIDDLLDTLYATDNGIGLAAPQVGREEAIVVIDLSDNRDQPLVLINPKVVS
GSNKEMGQEGCLSVPDYYADVERYTSVVVEALDREGKPLRIETSDFLAIVMQHEIDHLSGNLFIDYLSPLKQQMAMKKVK
KHVKNRAR
>B0VNL8 3.5.1.88~~~def~~~Peptide deformylase~~~
MALLPILSFPDPRLRTIAKPVEEVTDEIRQLAADMFETMYAAPGIGLAASQVDRHIQLIVMDLSESKDEPMVFINPKVTP
LTEETQPYEEGCLSVPQIYDKVDRPSRVKIEAINLEGQAFEIEADGLLAVCIQHEMDHLNGKLFVDYLSPLKRQRVREKV
EKIVRQREREKVAVKR
>P0A6K3 3.5.1.88~~~def~~~Peptide deformylase~~~COG0242
MSVLQVLHIPDERLRKVAKPVEEVNAEIQRIVDDMFETMYAEEGIGLAATQVDIHQRIIVIDVSENRDERLVLINPELLE
KSGETGIEEGCLSIPEQRALVPRAEKVKIRALDRDGKPFELEADGLLAICIQHEMDHLVGKLFMDYLSPLKQQRIRQKVE
KLDRLKARA
>Q82ZJ0 3.5.1.88~~~def~~~Peptide deformylase~~~COG0242
MITMKDIIREGNPTLRAVAEEVPVPITEEDRQLGEDMLTFLKNSQDPVKAEELQLRGGVGLAAPQLDISKRIIAVHVPSN
DPENETPSLSTVMYNPKILSHSVQDVCLGEGEGCLSVDRDVPGYVVRHNKITVSYFDMAGEKHKVRLKNYEAIVVQHEID
HINGIMFYDHINKENPFALKEGVLVIE
>Q4QMV6 3.5.1.88~~~def~~~Peptide deformylase~~~
MTALNVLIYPDDHLKVVCEPVTEVNDAIRKIVDDMFDTMYQEKGIGLAAPQVDILQRIITIDVEGDKQNQFVLINPEILA
SEGETGIEEGCLSIPGFRALVPRKEKVTVRALDRDGKEFTLDADGLLAICIQHEIDHLNGILFVDYLSPLKRQRIKEKLI
KYKKQIAKS
>Q93LE9 3.5.1.88~~~def~~~Peptide deformylase~~~
MSVRKILRMGDPILRKISEPVTEDEIQTKEFKKLIRDMFDTMRHAEGVGLAAPQIGILKQIVVVGSEDNERYPGTPDVPE
RIILNPVITPLTKDTSGFWEGCLSVPGMRGYVERPNQIRMQWMDEKGNQFDETIDGYKAIVYQHECDHLQGILYVDRLKD
TKLFGFNETLDSSHNVLD
>P9WIJ3 3.5.1.88~~~def~~~Peptide deformylase~~~COG0242
MAVVPIRIVGDPVLHTATTPVTVAADGSLPADLAQLIATMYDTMDAANGVGLAANQIGCSLRLFVYDCAADRAMTARRRG
VVINPVLETSEIPETMPDPDTDDEGCLSVPGESFPTGRAKWARVTGLDADGSPVSIEGTGLFARMLQHETGHLDGFLYLD
RLIGRYARNAKRAVKSHGWGVPGLSWLPGEDPDPFGH
>Q9I7A8 3.5.1.88~~~def~~~Peptide deformylase~~~
MAILNILEFPDPRLRTIAKPVEVVDDAVRQLIDDMFETMYEAPGIGLAATQVNVHKRIVVMDLSEDKSEPRVFINPEFEP
LTEDMDQYQEGCLSVPGFYENVDRPQKVRIKALDRDGNPFEEVAEGLLAVCIQHECDHLNGKLFVDYLSTLKRDRIRKKL
EKQHRQQA
>Q5HGZ3 3.5.1.88~~~def~~~Peptide deformylase~~~
MLTMKDIIRDGHPTLRQKAAELELPLTKEEKETLIAMREFLVNSQDEEIAKRYGLRSGVGLAAPQINISKRMIAVLIPDD
GSGKSYDYMLVNPKIVSHSVQEAYLPTGEGCLSVDDNVAGLVHRHNRITIKAKDIEGNDIQLRLKGYPAIVFQHEIDHLN
GVMFYDHIDKDHPLQPHTDAVEV
>P99077 3.5.1.88~~~def~~~Peptide deformylase~~~
MLTMKDIIRDGHPTLRQKAAELELPLTKEEKETLIAMREFLVNSQDEEIAKRYGLRSGVGLAAPQINISKRMIAVLIPDD
GSGKSYDYMLVNPKIVSHSVQEAYLPTGEGCLSVDDNVAGLVHRHNRITIKAKDIEGNDIQLRLKGYPAIVFQHEIDHLN
GVMFYDHIDKNHPLQPHTDAVEV
>P68826 3.5.1.88~~~def~~~Peptide deformylase~~~
MLTMKDIIRDGHPTLRQKAAELELPLTKEEKETLIAMREFLVNSQDEEIAKRYGLRSGVGLAAPQINISKRMIAVLIPDD
GSGKSYDYMLVNPKIVSHSVQEAYLPTGEGCLSVDDNVAGLVHRHNRITIKAKDIEGNDIQLRLKGYPAIVFQHEIDHLN
GVMFYDHIDKNHPLQPHTDAVEV
>Q8E378 3.5.1.88~~~def~~~Peptide deformylase~~~COG0242
MSAIDKLVKASHLIDMNDIIREGNPTLRKVAEEVTFPLSEKEEILGEKMMQFLKHSQDPIMAEKLGLRGGVGLAAPQLDI
SKRIIAVLVPNVEDAQGNPPKEAYSLQEVMYNPKVVSHSVQDAALSDGEGCLSVDREVPGYVVRHARVTIEYFDKTGEKH
RLKLKGYNSIVVQHEIDHIDGIMFYDRINEKNPFAVKEGLLILE
>Q8DWC2 3.5.1.88~~~def~~~Peptide deformylase~~~COG0242
MSAIKTITKASHLIDMNDIIREGHPTLRAVAQDVTFPLNEDDIILGEKMLQFLKNSQDPVTAEKMELRGGVGLAAPQLDI
SKRIIAVLIPNPEDKDGNPPKEAYALKEVMYNPRIIAHSVQDAALADGEGCLSVDRVVEGYVIRHSRVTIEYYDKNSDKK
KLKLKGYQSIVVQHEIDHTNGIMFFDRINEKNPFEIKEGLLLIE
>P68771 3.5.1.88~~~def~~~Peptide deformylase~~~
MSAQDKLIKPSHLITMDDIIREGNPTLRAVAKEVSLPLCDEDILLGEKMMQFLKHSQDPVMAEKLGLRAGVGLAAPQIDV
SKRIIAVLVPNLPDKEGNPPKEAYSWQEVLYNPKIVSHSVQDAALSDGEGCLSVDRVVEGYVVRHARVTVDYYDKEGQQH
RIKLKGYNAIVVQHEIDHINGVLFYDRINAKNPFETKEELLILD
>Q5X9V1 3.5.1.88~~~def~~~Peptide deformylase~~~
MSAQDKLIKPSHLITMDDIIREGNPTLRAVAKEVSLPLCDEDILLGEKMMQFLKHSQDPVMAEKLGLRAGVGLAAPQIDV
SKRIIAVLVPNLPDKEGNPPKEAYSWQEVLYNPKIVSHSVQDAALSDGEGCLSVDRVVEGYVVRHARVTVDYYDKEGQQH
RIKLKGYNAIVVQHEIDHINGVLFYDRINAKNPFETKEELLILD
>Q9F2F0 3.5.1.88~~~def~~~Peptide deformylase~~~COG0242
MSAIERITKAAHLIDMNDIIREGNPTLRAIAEEVTFPLSDQEIILGEKMMQFLKHSQDPVMAEKMGLRGGVGLAAPQLDI
SKRIIAVLVPNIVEEGETPQEAYDLEAIMYNPKIVSHSVQDAALGEGEGCLSVDRNVPGYVVRHARVTVDYFDKDGEKHR
IKLKGYNSIVVQHEIDHINGIMFYDRINEKDPFAVKDGLLILE
>Q8DP79 3.5.1.88~~~def~~~Peptide deformylase~~~COG0242
MSAIERITKAAHLIDMNDIIREGNPTLRTVAEEVTFPLSDQEIILGEKMMQFLKHSQDPVMAEKMGLRGGVGLAAPQLDI
SKRIIAVLVPNIVEEGETPQEAYDLEAIMYNPKIVSHSVQDAALGEGEGCLSVDRNVPGYVVRHARVTVDYFDKDGEKHR
IKLKGYNSIVVQHEIDHINGIMFYDRINEKDPFAVKDGLLILE
>P96113 3.5.1.88~~~def~~~Peptide deformylase~~~COG0242
MYRIRVFGDPVLRKRAKPVTKFDENLKKTIERMIETMYHYDGVGLAAPQVGISQRFFVMDVGNGPVAVINPEILEIDPET
EVAEEGCLSFPEIFVEIERSKRIKVKYQNTRGEYVEEELEGYAARVFQHEFDHLNGVLIIDRISPAKRLLLRKKLMDIAR
TVKR
>P43522 3.5.1.88~~~def~~~Peptide deformylase~~~
MVYPIRLYGDPVLRRKARPVEDFSGIKRLAEDMLETMFEAKGVGLAAPQIGLSQRLFVAVEYADEPEGEEERPLRELVRR
VYVVANPVITYREGLVEGTEGCLSLPGLYSEEVPRAERIRVEYQDEEGRGRVLELEGYMARVFQHEIDHLDGILFFERLP
KPKREAFLEANRAELVRFQKEARALLKELSQG
>P37947 ~~~degA~~~HTH-type transcriptional regulator DegA~~~COG1609
MKTTIYDVAKAAGVSITTVSRVINNTGRISDKTRQKVMNVMNEMAYTPNVHAAALTGKRTNMIALVAPDISNPFYGELAK
SIEERADELGFQMLICSTDYDPKKETKYFSVLKQKKVDGIIFATGIESHDSMSALEEIASEQIPIAMISQDKPLLPMDIV
VIDDVRGGYEAAKHLLSLGHTNIACIIGDGSTTGEKNRIKGFRQAMEEAGVPIDESLIIQTRFSLESGKEEAGKLLDRNA
PTAIFAFNDVLACAAIQAARIRGIKVPDDLSIIGFDNTILAEMAAPPLTTVAQPIKEMGAERHRTAGRSNRGKRKAKQKI
VLPPELVVRHSTSPLNT
>P0C0V0 3.4.21.107~~~degP~~~Periplasmic serine endoprotease DegP~~~COG0265
MKKTTLALSALALSLGLALSPLSATAAETSSATTAQQMPSLAPMLEKVMPSVVSINVEGSTTVNTPRMPRNFQQFFGDDS
PFCQEGSPFQSSPFCQGGQGGNGGGQQQKFMALGSGVIIDADKGYVVTNNHVVDNATVIKVQLSDGRKFDAKMVGKDPRS
DIALIQIQNPKNLTAIKMADSDALRVGDYTVAIGNPFGLGETVTSGIVSALGRSGLNAENYENFIQTDAAINRGNSGGAL
VNLNGELIGINTAILAPDGGNIGIGFAIPSNMVKNLTSQMVEYGQVKRGELGIMGTELNSELAKAMKVDAQRGAFVSQVL
PNSSAAKAGIKAGDVITSLNGKPISSFAALRAQVGTMPVGSKLTLGLLRDGKQVNVNLELQQSSQNQVDSSSIFNGIEGA
EMSNKGKDQGVVVNNVKTGTPAAQIGLKKGDVIIGANQQAVKNIAELRKVLDSKPSVLALNIQRGDSTIYLLMQ
>Q99039 ~~~degQ~~~Degradation enzyme regulation protein DegQ~~~
MEKKLEEVKQLLFRLELDIKETTDSLRNINKSIDQLDKYNYAMKIS
>P39099 3.4.21.107~~~degQ~~~Periplasmic pH-dependent serine endoprotease DegQ~~~COG0265
MKKQTQLLSALALSVGLTLSASFQAVASIPGQVADQAPLPSLAPMLEKVLPAVVSVRVEGTASQGQKIPEEFKKFFGDDL
PDQPAQPFEGLGSGVIINASKGYVLTNNHVINQAQKISIQLNDGREFDAKLIGSDDQSDIALLQIQNPSKLTQIAIADSD
KLRVGDFAVAVGNPFGLGQTATSGIVSALGRSGLNLEGLENFIQTDASINRGNSGGALLNLNGELIGINTAILAPGGGSV
GIGFAIPSNMARTLAQQLIDFGEIKRGLLGIKGTEMSADIAKAFNLDVQRGAFVSEVLPGSGSAKAGVKAGDIITSLNGK
PLNSFAELRSRIATTEPGTKVKLGLLRNGKPLEVEVTLDTSTSSSASAEMITPALEGATLSDGQLKDGGKGIKIDEVVKG
SPAAQAGLQKDDVIIGVNRDRVNSIAEMRKVLAAKPAIIALQIVRGNESIYLLMR
>P68731 ~~~degR~~~Regulatory protein DegR~~~
MDDKDLKLILHKTFIEIYSDLEELADIAKKGKPSMEKYVEEIEQRCKQNILAIEIQMKIK
>P13799 2.7.13.3~~~degS~~~Signal transduction histidine-protein kinase/phosphatase DegS~~~COG4585
MNKTKMDSKVLDSILMKMLKTVDGSKDEVFQIGEQSRQQYEQLVEELKQIKQQVYEVIELGDKLEVQTRHARNRLSEVSR
NFHRFSEEEIRNAYEKAHKLQVELTMIQQREKQLRERRDDLERRLLGLQEIIERSESLVSQITVVLNYLNQDLREVGLLL
ADAQAKQDFGLRIIEAQEEERKRVSREIHDGPAQMLANVMMRSELIERIFRDRGAEDGFQEIKNLRQNVRNALYEVRRII
YDLRPMALDDLGLIPTLRKYLYTTEEYNGKVKIHFQCIGETEDQRLAPQFEVALFRLAQEAVSNALKHSESEEITVKVEI
TKDFVILMIKDNGKGFDLKEAKEKKNKSFGLLGMKERVDLLEGTMTIDSKIGLGTFIMIKVPLSL
>P0AEE3 3.4.21.107~~~degS~~~Serine endoprotease DegS~~~COG0265
MFVKLLRSVAIGLIVGAILLVAMPSLRSLNPLSTPQFDSTDETPASYNLAVRRAAPAVVNVYNRGLNTNSHNQLEIRTLG
SGVIMDQRGYIITNKHVINDADQIIVALQDGRVFEALLVGSDSLTDLAVLKINATGGLPTIPINARRVPHIGDVVLAIGN
PYNLGQTITQGIISATGRIGLNPTGRQNFLQTDASINHGNSGGALVNSLGELMGINTLSFDKSNDGETPEGIGFAIPFQL
ATKIMDKLIRDGRVIRGYIGIGGREIAPLHAQGGGIDQLQGIVVNEVSPDGPAANAGIQVNDLIISVDNKPAISALETMD
QVAEIRPGSVIPVVVMRDDKQLTLQVTIQEYPATN
>P13800 ~~~degU~~~Transcriptional regulatory protein DegU~~~COG2197
MTKVNIVIIDDHQLFREGVKRILDFEPTFEVVAEGDDGDEAARIVEHYHPDVVIMDINMPNVNGVEATKQLVELYPESKV
IILSIHDDENYVTHALKTGARGYLLKEMDADTLIEAVKVVAEGGSYLHPKVTHNLVNEFRRLATSGVSAHPQHEVYPEIR
RPLHILTRRECEVLQMLADGKSNRGIGESLFISEKTVKNHVSNILQKMNVNDRTQAVVVAIKNGWVEMR
>P32436 ~~~degV~~~Protein DegV~~~COG1307
MNIAVVTDSTAYIPKEMREQHQIHMIPLQVVFREETYREEIELDWKSFYEEVKKHNELPTTSQPPIGELVALYEELGKSY
DAVISIHLSSGISGTFSSAAAADSMVDNIDVYPFDSEISCLAQGFYALKAAELIKNGASSPEDIIKELEEMKKTVRAYFM
VDDLAHLQRGGRLSSAQAFIGSLLKVKPILHFDNKVIVPFEKIRTRKKAISRIYELLDEDASKGLPMRAAVIHANREEEA
AKIIEELSAKYPHVEFYNSYFGAVIGTHLGEGALGICWCFK
>Q01398 3.8.1.3~~~dehH1~~~Haloacetate dehalogenase H-1~~~
MDFPGFKNSTVTVDGVDIAYTVSGEGPPVLMLHGFPQNRAMWARVAPQLAEHHTVVCADLRGYGDSDKPKCLPDRSNYSF
RTFAHDQLCVMRHLGFERFHLVGHDRGGRTGHRMALDHPEAVLSLTVMDIVPTYAMFMNTNRLVAASYWHWYFLQQPEPF
PEHMIGQDPDFFYETCLFGWGATKVSDFDQQMLNAYRESWRNPAMIHGSCSDYRAAATIDLEHDSADIQRKVECPTLVFY
GSKGQMGQLFDIPAEWAKRCNNTTNASLPGGHFFVDQFPAETSEILLKFLARNG
>Q01399 3.8.1.3~~~dehH2~~~Haloacetate dehalogenase H-2~~~
MKKIEAIAFDMYGTLYDVHSVVDACEKQYPGKGKDISVLWRQKQLEYAWLRCLMGQYIKFEEATANALTYTCNQMKLDCD
EGSAMRLTEEYLRLKPFPEVRGALRALRQRGMRLAILSNGSTETIHDVVHNSGVEGEFEHLISVDSARAYKPHPLAYELG
EEAFGISRESILFVSSNPWDVSGAKAFGYQVCWINRYGFAFDELGQTPDFTVPVMDAIVHLIAV
>Q1JU72 3.8.1.3~~~fac-dex~~~Fluoroacetate dehalogenase~~~
MFEGFERRLVDVGDVTINCVVGGSGPALLLLHGFPQNLHMWARVAPLLANEYTVVCADLRGYGGSSKPVGAPDHANYSFR
AMASDQRELMRTLGFERFHLVGHDRGGRTGHRMALDHPDSVLSLAVLDIIPTYVMFEEVDRFVARAYWHWYFLQQPAPYP
EKVIGADPDTFYEGCLFGWGATGADGFDPEQLEEYRKQWRDPAAIHGSCCDYRAGGTIDFELDHGDLGRQVQCPALVFSG
SAGLMHSLFEMQVVWAPRLANMRFASLPGGHFFVDRFPDDTARILREFLSDARSGIHQTERRES
>Q6NAM1 3.8.1.3~~~~~~Fluoroacetate dehalogenase~~~COG0596
MPDLADLFPGFGSEWINTSSGRIFARVGGDGPPLLLLHGFPQTHVMWHRVAPKLAERFKVIVADLPGYGWSDMPESDEQH
TPYTKRAMAKQLIEAMEQLGHVHFALAGHDRGARVSYRLALDSPGRLSKLAVLDILPTYEYWQRMNRAYALKIYHWSFLA
QPAPLPENLLGGDPDFYVKAKLASWTRAGDLSAFDPRAVEHYRIAFADPMRRHVMCEDYRAGAYADFEHDKIDVEAGNKI
PVPMLALWGASGIAQSAATPLDVWRKWASDVQGAPIESGHFLPEEAPDQTAEALVRFFSAAP
>Q0KBD2 1.1.1.410~~~denD~~~D-erythronate dehydrogenase~~~COG0451
MNVLITGGAGFLGLQLARLLLQRGTLNLDGQPVAIKRLTLLDVVAPQGLDDARVRVVTGDLSDPAVLRQAIDTDTGAVFH
LAAVVSGQAEADFDLGMRVNLDASRALLETCRELGHQPRVLFTSSVAVYGGQLPPVVQDDTALNPQSSYGVQKAIGELLL
SDYSRRGFVDGRVLRLPTISVRPGKPNAAASSFASGIIREPLSGVAANCPVAPETPLWLLSPRAAVAALVNGIELAGERL
GNRRVVNLPGLSVTAAGMIEALRRVAGNAVADRVTWEREARVENIVGTWPAAWNAERALALGFQSDASFDEVIRAYMEDA
GLAK
>P44094 1.1.1.410~~~denD~~~D-erythronate dehydrogenase~~~COG0451
MKVVITGGQGFLGQRLAKTLLAQNNVHIDDLILIDVVKPIAPNNDPRVRCYEMNLRYPTGLDELITEETDAIFHLAAIVS
SHAEQDPDLGYETNFLATRNILEICRKNNPKVRFIFSSSLAIFGGELPETILDSTAFTPQSTYGTQKAMCELLINDYSRK
GFVDGIVVRLPTICIRPGKPNKAASSFVSSIMREPLHGEDAVCPVSEELRLWLSSPNTVVANFIHALQLPSLPLRSWHTI
NLPGFSVTVKQMLSDLTQVKGEAILEHIKFEFDESINNIVASWPSRIDNTQALALGFKVDSNFQNVIQQFIEYDM
>B0TBI9 2.7.1.220~~~denK~~~D-erythronate kinase~~~COG3395
MSNVVIIADDLTGANATGVLLARKGYKTATFLQLPQDPLENGNRFDVISITTDSRAVAPEEAYRRVAEAARAMLGNKPGL
FTKRIDSTLRGNLGPEIDAMLDVLGPDSLAVVVAAFPTSGRITVGGYLLVHSIPLEQTDVARDPKTPVHQTLVADIVAAQ
SKHSVGFIPLATVLQGSTAVMEALGAQKEAGKRIVVMDAATQKDLDTIAHGAYLSGLSVVAVDPGPFTEALAAYVLPKPK
QGRGKKVLMVVGSVTALTRQQLKAVENAYSTCFTTVDVHALIDPWRNAEEIERVSGEVLDHLDDHQVLGVRTVEEAGQVL
DLASVALAYMISEEEIASRIADGLAAIARRVLQVSHGEVGGLYTSGGDVTVAVCQALAASGVEVKDEVVPLAAYGRLIGG
AFHQTPIITKGGLVGNSDAACTCVDYLLTKISNETYPAE
>Q818Z9 5.4.2.7~~~deoB~~~Phosphopentomutase~~~
MNKYKRIFLVVMDSVGIGEAPDAEQFGDLGSDTIGHIAEHMNGLQMPNMVKLGLGNIREMKGISKVEKPLGYYTKMQEKS
TGKDTMTGHWEIMGLYIDTPFQVFPEGFPKELLDELEEKTGRKIIGNKPASGTEILDELGQEQMETGSLIVYTSADSVLQ
IAAHEEVVPLDELYKICKIARELTLDEKYMVGRVIARPFVGEPGNFTRTPNRHDYALKPFGRTVMNELKDSDYDVIAIGK
ISDIYDGEGVTESLRTKSNMDGMDKLVDTLNMDFTGLSFLNLVDFDALFGHRRDPQGYGEALQEYDARLPEVFAKLKEDD
LLLITADHGNDPIHPGTDHTREYVPLLAYSPSMKEGGQELPLRQTFADIGATVAENFGVKMPEYGTSFLNELKK
>P0A6K6 5.4.2.7~~~deoB~~~Phosphopentomutase~~~COG1015
MKRAFIMVLDSFGIGATEDAERFGDVGADTLGHIAEACAKGEADNGRKGPLNLPNLTRLGLAKAHEGSTGFIPAGMDGNA
EVIGAYAWAHEMSSGKDTPSGHWEIAGVPVLFEWGYFSDHENSFPQELLDKLVERANLPGYLGNCHSSGTVILDQLGEEH
MKTGKPIFYTSADSVFQIACHEETFGLDKLYELCEIAREELTNGGYNIGRVIARPFIGDKAGNFQRTGNRHDLAVEPPAP
TVLQKLVDEKHGQVVSVGKIADIYANCGITKKVKATGLDALFDATIKEMKEAGDNTIVFTNFVDFDSSWGHRRDVAGYAA
GLELFDRRLPELMSLLRDDDILILTADHGCDPTWTGTDHTREHIPVLVYGPKVKPGSLGHRETFADIGQTLAKYFGTSDM
EYGKAMF
>P99100 5.4.2.7~~~deoB~~~Phosphopentomutase~~~
MTRPFNRVHLIVMDSVGIGEAPDAADFKDEGSHTLRHTLEGFDQTLPNLEKLGLGNIDKLPVVNAVEQPEAYYTKLSEAS
VGKDTMTGHWEIMGLNIMQPFKVYPNGFPEELIQQIEEMTGRKVVANKPASGTQIIDEWGEHQMKTGDLIVYTSADPVLQ
IAAHEDIIPLEELYDICEKVRELTKDPKYLIGRIIARPYVGEPGNFTRTSNRHDYALKPFGKTVLDHLKDGGYDVIAIGK
INDIYDGEGVTEAVRTKSNMDGMDQLMKIVKKDFTGISFLNLVDFDALYGHRRDKPGYAQAIKDFDDRLPELFSNLKEDD
LVIITADHGNDPTAPGTDHTREYIPVIMYSPKFKGGHALESDTTFSSIGATIADNFNVTLPEFGKSYLKELK
>Q8DTU0 5.4.2.7~~~deoB~~~Phosphopentomutase~~~COG1015
MSTFNRIHLVVLDSVGIGAAPDANNFSNAGVPDGASDTLGHISKTVGLNVPNMAKIGLGNIPRDTPLKTVPAENHPTGYV
TKLEEVSLGKDTMTGHWEIMGLNITEPFDTFWNGFPEEIISKIEKFSGRKVIREANKPYSGTAVIDDFGPRQMETGELII
YTSADPVLQIAAHEDVIPLDELYRICEYARSITLERPALLGRIIARPYVGKPRNFTRTANRHDYALSPFAPTVLNKLADA
GVSTYAVGKINDIFNGSGITNDMGHNKSNSHGVDTLIKTMGLSAFTKGFSFTNLVDFDALYGHRRNAHGYRDCLHEFDER
LPEIIAAMKVDDLLLITADHGNDPTYAGTDHTREYVPLLAYSPSFTGNGVLPVGHYADISATIADNFGVDTAMIGESFLD
KLI
>Q8DQD0 5.4.2.7~~~deoB~~~Phosphopentomutase~~~COG1015
MSKFNRIHLVVLDSVGIGAAPDANNFVNAGVPDGASDTLGHISKTVGLNVPNMAKIGLGNIPRETPLKTVAAESNPTGYA
TKLEEVSLGKDTMTGHWEIMGLNITEPFDTFWNGFPEEILTKIEEFSGRKVIRETNKPYSGTAVIYDFGPRQMETGELII
YTSADPVLQIAAHEDIIPLDELYRICEYARSITLERPALLGRIIARPYVGEPGNFTRTANRRDLAVSPFSPTVLDKLNEA
GIDTYAVGKINDIFNGAGINHDMGHNKSNSHGIDTLLKTMGLAEFEKGFSFTNLVDFDALYGHRRNAHGYRDCLHEFDER
LPEIIAAMRENDLLLITADHGNDPTYAGTDHTREYIPLLAYSPAFKGNGLIPVGHFADISATVADNFGVETAMIGESFLD
KLV
>P99102 4.1.2.4~~~deoC1~~~Deoxyribose-phosphate aldolase 1~~~
MKFEKYIDHTLLKPESTRTQIDQIIDEAKAYNFKSVCVNPTHVKYAAERLADSEVLVCTVIGFPLGASTTATKAFETEDA
IQNGADEIDMVINIGALKDGRFDDVQQDIEAVVKAAKGHTVKVIIETVLLDHDEIVKASELTKAAGADFVKTSTGFAGGG
ATAEDVKLMKDTVGADVEVKASGGVRNLEDFNKMVEAGATRIGASAGVQIMQGLEADSDY
>P99174 4.1.2.4~~~deoC2~~~Deoxyribose-phosphate aldolase 2~~~
MNSAKLIDHTLLKPESTRTQIDQIIDEAKAYHFKSVCVNPTHVKYAAERLADSEVLVCTVIGFPLGASTTATKAFETEDA
IQNGADEIDMVINIGALKDGRFDDVQQDIEAVVKAAKGHTVKVIIETVLLDHDEIVKASELTKVAGADFVKTSTGFAGGG
ATAEDVKLMKDTVGADVEVKASGGVRNLEDFNKMVEAGATRIGASAGVQIMQGLEADSDY
>O66540 4.1.2.4~~~deoC~~~Deoxyribose-phosphate aldolase~~~COG0274
MIDVRKYIDNAALKPHLSEKEIEEFVLKSEELGIYAVCVNPYHVKLASSIAKKVKVCCVIGFPLGLNKTSVKVKEAVEAV
RDGAQELDIVWNLSAFKSEKYDFVVEELKEIFRETPSAVHKVIVETPYLNEEEIKKAVEICIEAGADFIKTSTGFAPRGT
TLEEVRLIKSSAKGRIKVKASGGIRDLETAISMIEAGADRIGTSSGISIAEEFLKRHLI
>P39121 4.1.2.4~~~deoC~~~Deoxyribose-phosphate aldolase~~~COG0274
MSLANIIDHTALKPHTQKADILKLIEEAKTYKFASVCVNPTWVELAAKELKGTGVDVCTVIGFPLGANTTETKAFETKDA
ISKGATEVDMVINIAALKDKEDDVVEADIRGVVEAVAGKALVKVIIETCLLTDEEKERACRLAVSAGADFVKTSTGFSTG
GATKEDIALMRKTVGPDIGVKASGGVRTKEDVDTMVEAGASRIGASAGVSIVKGENASGGDNY
>P0A6L0 4.1.2.4~~~deoC~~~Deoxyribose-phosphate aldolase~~~COG0274
MTDLKASSLRALKLMDLTTLNDDDTDEKVIALCHQAKTPVGNTAAICIYPRFIPIARKTLKEQGTPEIRIATVTNFPHGN
DDIDIALAETRAAIAYGADEVDVVFPYRALMAGNEQVGFDLVKACKEACAAANVLLKVIIETGELKDEALIRKASEISIK
AGADFIKTSTGKVAVNATPESARIMMEVIRDMGVEKTVGFKPAGGVRTAEDAQKYLAIADELFGADWADARHYRFGASSL
LASLLKALGHGDGKSASSY
>Q9KD67 4.1.2.4~~~deoC~~~Deoxyribose-phosphate aldolase~~~COG0274
MSRSIAQMIDHTLLKPNTTEDQIVKLCEEAKEYSFASVCVNPTWVALAAQLLKDAPDVKVCTVIGFPLGATTPEVKAFET
TNAIENGATEVDMVINIGALKDKQYELVGRDIQAVVKAAEGKALTKVIIETSLLTEEEKKAACELAVKAGADFVKTSTGF
SGGGATAEDIALMRKVVGPNLGVKASGGVRDLSDAKAMIDAGATRIGASAGVAIVNGERSEGSY
>A0QLL2 4.1.2.4~~~deoC~~~Deoxyribose-phosphate aldolase~~~
MTPTRAQLAAFVDHTLLKPEATAADVAALVTEAAELGVYAVCVSPPMVPAAVQAGAGVRVASVAGFPSGKHVSAVKAHEA
ALAVASGAAEIDMVIDVGAALAGDLDGVRADIAAVRGAVGGAVLKVIVESSALLALADEHTLVRVCRAAEDAGADFVKTS
TGFHPSGGASVRAVALMAEAVGGRLGVKASGGIRTAADALAMLDAGATRLGLSGTRAVLDGLG
>P9WP03 4.1.2.4~~~deoC~~~Deoxyribose-phosphate aldolase~~~COG0274
MLGQPTRAQLAALVDHTLLKPETTRADVAALVAEAAELGVYAVCVSPSMVPVAVQAGGVRVAAVTGFPSGKHVSSVKAHE
AAAALASGASEIDMVIDIGAALCGDIDAVRSDIEAVRAAAAGAVLKVIVESAVLLGQSNAHTLVDACRAAEDAGADFVKT
STGCHPAGGATVRAVELMAETVGPRLGVKASGGIRTAADAVAMLNAGATRLGLSGTRAVLDGLS
>Q4ZMV1 4.1.2.4~~~deoC~~~Deoxyribose-phosphate aldolase~~~COG0274
MNSLEPAALAQAIDHTLLAADASREQIATLCAEAREHGFYSVCVNSSQVPFAARQLAGSAVKVCAVVGFPLGAGLSASKA
SEAALTIAAGAQEIDMVLNIGWLKDGLFDEVRDDIAAVLQACGKVPLKVILETCLLDEAQKVRACEICRELGVAFVKTST
GFSRSGATLEDVALMRRVVGPDIGVKASGGVRDVATARAMIEAGATRLGTSSGIAIVTGAGTGAGY
>C0ZUQ6 4.1.2.4~~~deoC~~~Deoxyribose-phosphate aldolase~~~COG0274
MSEAALTRSQVAAMVDHTLLKPEATAADVTALIDEARSLGVLAVCVSPSMLPIRADGLVTAAVVGFPSGKHHSLVKGAEA
RLAVDQGATEIDMVIDVGAAVAGDYSAVLADILTVREAMGESAILKVILETAALSDEAIVECCRAAVRAGANFVKTSTGF
HPAGGATVEAVELMARTVGPGVGVKASGGIRTTQAALDMIAAGATRLGLSGTRAVLDGLTD
>Q8ZJV8 4.1.2.4~~~deoC~~~Deoxyribose-phosphate aldolase~~~
MTDLKASSLRALKLMDLTTLNDDDTNEKVIALCHQAKTPVGNTAAICIYPRFIPIARKTLKEQGTPDIRIATVTNFPHGN
DDIDIALAETRAAIAYGADEVDVVFPYRALIAGNEQVGFDLVKACKDACAAANVLLKVIIETGELKEEALIRKASEISIK
AGADFIKTSTGKVPVNATPESARIMMEVIRDMGVSKTVGFKPAGGVRTAEDAQKFLAIADELFGADWADSRHYRFGASSL
LASLLKALGHGDGKSASSY
>B0TQ91 4.1.2.4~~~deoC~~~Deoxyribose-phosphate aldolase~~~COG0274
MSDLKKAAQQAISLMDLTTLNDDDTDQKVIELCHKAKTPAGDTAAICIYPRFIPIARKTLNEIGGDDIKIATVTNFPHGN
DDIAIAVLETRAAVAYGADEVDVVFPYRALMEGNETVGFELVKACKEACGEDTILKVIIESGVLADPALIRKASELSIDA
GADFIKTSTGKVAVNATLEAAEIMMTVISEKNPKVGFKPAGGVKDAAAAAEFLGVAARLLGDDWATPATFRFGASSLLTN
LLHTLELADAPQGAQGY
>Q5XA31 4.1.2.4~~~deoC~~~Deoxyribose-phosphate aldolase~~~
MEVKDILKTVDHTLLATTATWPEIQTILDDAMAYETASACIPASYVKKAAEYVSGKLAICTVIGFPNGYSTTAAKVFECQ
DAIQNGADEIDMVINLTDVKNGDFDTVEEEIRQIKAKCQDHILKVIVETCQLTKEELIELCGVVTRSGADFIKTSTGFST
AGATFEDVEVMAKYVGEGVKIKAAGGISSLEDAKTFIALGASRLGTSRIIKIVKNEATKPDSY
>Q9X1P5 4.1.2.4~~~deoC~~~Deoxyribose-phosphate aldolase~~~COG0274
MIEYRIEEAVAKYREFYEFKPVRESAGIEDVKSAIEHTNLKPFATPDDIKKLCLEARENRFHGVCVNPCYVKLAREELEG
TDVKVVTVVGFPLGANETRTKAHEAIFAVESGADEIDMVINVGMLKAKEWEYVYEDIRSVVESVKGKVVKVIIETCYLDT
EEKIAACVISKLAGAHFVKTSTGFGTGGATAEDVHLMKWIVGDEMGVKASGGIRTFEDAVKMIMYGADRIGTSSGVKIVQ
GGEERYGG
>Q5SJ28 4.1.2.4~~~deoC~~~Deoxyribose-phosphate aldolase~~~COG0274
MDLAAHIDHTLLKPTATLEEVAKAAEEALEYGFYGLCIPPSYVAWVRARYPHAPFRLVTVVGFPLGYQEKEVKALEAALA
CARGADEVDMVLHLGRAKAGDLDYLEAEVRAVREAVPQAVLKVILETGYFSPEEIARLAEAAIRGGADFLKTSTGFGPRG
ASLEDVALLVRVAQGRAQVKAAGGIRDRETALRMLKAGASRLGTSSGVALVAGEGGTLGY
>Q9KPM0 2.4.2.1~~~deoD1~~~Purine nucleoside phosphorylase DeoD-type 1~~~COG0813
MATPHINAQMGDFADVVLMPGDPLRAKYIAENFLDNAVQVCDVRNMFGYTGTYKGRKISVMGHGMGIPSCSIYVTELIKD
YGVKKIIRVGSCGAVNEGIKVRDVVIGMGACTDSKVNRIRFKDHDFAAIADYKMVKAAEEAAKARGIDVKVGNLFSAELF
YTPDPSMFDVMDKYGIVGVEMEAAGIYGVAAEYGAKALAICTVSDHIKTGEQTTSEERQNTFNEMIEIALDSVLIGDQAG
Y
>Q5DYV8 2.4.2.1~~~deoD3~~~Purine nucleoside phosphorylase DeoD-type~~~COG0813
MSTPHINAPLDAFADTILMPGDPLRAKLIAETYLENVVQVTDVRGMLGFTGEFKGRKISVMGHGMGAPSASIYFHELMTT
YKVKNFIRIGSCGAIHDDVKLKDLIVAIGASTDSKMNRIRFKDNDFAATANYNMLSECVNTLKTTDINYLVGNVFSSDLF
YRPDEEQYDMMARYGILGVEMEVNALYSAAAENHCNAVALCTVTDHIKNHEHLTADERRTELHEMINVALDVALKLPTE
>Q8EDM4 2.4.2.1~~~deoD3~~~Purine nucleoside phosphorylase DeoD-type~~~COG0813
MTAHINAQPTDFAETVIMPGDPLRAKYIAETYLTDAVEVTNVRNMLGYTGYYQGQRISVMGHGMGISSMVLYGHELINFF
GVKRIIRIGSLGATQQHVEMRDVILAQAAGTDSPTNAKRSSGYHMATSATFSLLHKAYTKANEKGISVKVGNVFSGDLYY
DPDEDMIPALERFGVLGIDMEVAGLYGLAHQQGIESLAILTVSDHCLTGEETTAQERQLSFNNMIELALETALN
>Q81T09 2.4.2.1~~~deoD~~~Purine nucleoside phosphorylase DeoD-type~~~COG0813
MSVHIEAKQGEIAESILLPGDPLRAKYIAETFLEDVTCYNNVRGMLGFTGTYKGKRVSVQGTGMGVPSISIYVNELIQSY
GVKNLIRVGTCGAIQKDVKVRDVIIAMTACTDSNMNRLTFPGFDFAPAANFDLLKKAYDAGTEKGLHVRVGNVLTADVFY
RESMDMVKKLGDYGVLAVEMETTALYTLAAKYGVNALSVLTVSDHIFTGEETTSEERQTTFNEMIEIALDAAIQQ
>Q5EEL8 2.4.2.1~~~deoD~~~Purine nucleoside phosphorylase DeoD-type~~~COG0813
MSVHIEAKQGEIAESILLPGDPLRAKYIAETFLEDVTCYNNVRGMLGFTGTYKGKRVSVQGTGMGVPSISIYVNELIQSY
GVKNLIRVGTCGAIQKDVKVRDVIIAMTACTDSNMNRLTFPGFDFAPAANFDLLKKAYDAGTEKGLHVRVGNVLTADVFY
RESMDMVKKLGDYGVLAVEMETTALYTLAAKYGVNALSVLTVSDHIFTGEETTSEERQTTFNEMIEIALDAAIQQ
>O34925 2.4.2.1~~~deoD~~~Purine nucleoside phosphorylase DeoD-type~~~COG0813
MSVHIGAEKGQIADTVLLPGDPLRAKFIAETYLENVECYNEVRGMYGFTGTYKGKKISVQGTGMGVPSISIYVNELIQSY
DVQNLIRVGSCGAIRKDVKVRDVILAMTSSTDSQMNRVAFGSVDFAPCADFELLKNAYDAAKDKGVPVTVGSVFTADQFY
NDDSQIEKLAKYGVLGVEMETTALYTLAAKHGRKALSILTVSDHVLTGEETTAEERQTTFHDMIEVALHSVSQ
>P0ABP9 2.4.2.1~~~deoD~~~Purine nucleoside phosphorylase DeoD-type~~~COG0813
MATPHINAEMGDFADVVLMPGDPLRAKYIAETFLEDAREVNNVRGMLGFTGTYKGRKISVMGHGMGIPSCSIYTKELITD
FGVKKIIRVGSCGAVLPHVKLRDVVIGMGACTDSKVNRIRFKDHDFAAIADFDMVRNAVDAAKALGIDARVGNLFSADLF
YSPDGEMFDVMEKYGILGVEMEAAGIYGVAAEFGAKALTICTVSDHIRTHEQTTAAERQTTFNDMIKIALESVLLGDKE
>P0ABP8 2.4.2.1~~~deoD~~~Purine nucleoside phosphorylase DeoD-type~~~COG0813
MATPHINAEMGDFADVVLMPGDPLRAKYIAETFLEDAREVNNVRGMLGFTGTYKGRKISVMGHGMGIPSCSIYTKELITD
FGVKKIIRVGSCGAVLPHVKLRDVVIGMGACTDSKVNRIRFKDHDFAAIADFDMVRNAVDAAKALGIDARVGNLFSADLF
YSPDGEMFDVMEKYGILGVEMEAAGIYGVAAEFGAKALTICTVSDHIRTHEQTTAAERQTTFNDMIKIALESVLLGDKE
>P77835 2.4.2.1~~~deoD~~~Purine nucleoside phosphorylase DeoD-type~~~
MSVHIGAKEHEIADKILLPGDPLRAKYIAETFLEGATCYNQVRGMLGFTGTYKGHRISVQGTGMGVPSISIYITELMQSY
NVQTLIRVGTCGAIQKDVKVRDVILAMTSSTDSQMNRMTFGGIDYAPTANFDLLKTAYEIGKEKGLQLKVGSVFTADMFY
NENAQFEKLARYGVLAVEMETTALYTLAAKFGRKALSVLTVSDHILTGEETTAEERQTTFNEMIEVALETAIRQ
>P44417 2.4.2.1~~~deoD~~~Purine nucleoside phosphorylase DeoD-type~~~COG0813
MTPHINAPEGAFADVVLMPGDPLRAKYIAETFLQDVVEVTNVRNMLGFTGTYKGRKISIMGHGMGIPSCSIYAKELITEY
GVKKIIRVGSCGTVRMDVKVRDVIIGLGACTDSKVNRIRFKDNDFAAIADFDMAQAAVQAAKAKGKVVRVGNLFSADLFY
TPDVEMFDVMEKYGILGVEMEAAGIYGVAAEYGAKALTICTVSDHIRTHEQTTAEERQLTFNDMIEIALDSVLIGDAL
>P56463 2.4.2.1~~~deoD~~~Purine nucleoside phosphorylase DeoD-type~~~COG0813
MTPHINAKIGDFYPQCLLCGDPLRVSYIAKKFLQDAKEITNVRNMLGFSGKYKGRGISLMGHGMGIASCTIYVTELIKTY
QVKELLRIGTCGAISPKVGLKDIIMATGASTDSKTNRVRFLNHDLSATPDFELSLRAYQTAKRLGIDLKVGNVFSSDFFY
SFETHAFDLMAKYNHLAIEMEAAGLYATAMELNAKALCLCSVSDHLITKEALSPKERVESFDNMIILALEMMS
>Q8ZJV7 2.4.2.1~~~deoD~~~Purine nucleoside phosphorylase DeoD-type~~~
MATPHINAEMGDFADVVLMPGDPLRAKHIAETFLENVREVNNVRGMLGFTGTYKGRKISVMGHGMGIPSCSIYTKELITD
FGVKKIIRVGSCGAVRMDVKLRDVVIGMGACTDSKVNRIRFKDHDFAAIADFDMVRNAVDAAKALGVDARVGNLFSADLF
YSPDGEMFDVMEKYGVLGVEMEAAGIYGVAAEFGAKALTICTVSDHIRTHEQTTAAERQTTFNDMIKIALESVLLGDKE
>B1JL34 2.4.2.1~~~deoD~~~Purine nucleoside phosphorylase DeoD-type~~~
MATPHINAEMGDFADVVLMPGDPLRAKFIAETFLQDVREVNNVRGMLGFTGTYKGRKISVMGHGMGIPSCSIYAKELITD
FGVKKIIRVGSCGAVRTDVKLRDVVIGMGACTDSKVNRMRFKDHDYAAIADFEMTRNAVDAAKAKGVNVRVGNLFSADLF
YTPDPQMFDVMEKYGILGVEMEAAGIYGVAAEFGAKALTICTVSDHIRTGEQTTAAERQTTFNDMIEIALESVLLGDNA
>P39140 ~~~deoR~~~Deoxyribonucleoside regulator~~~COG2390
MDREKQQLSIEAARLYYQSDYSQQQIAEQLNISRPTVSRLLQYAKEKGYVQIRVMDPFEDLDALGSILEEKYGLLEAHVV
FSPTPDYAGITHDLSRYGAEYMHETVKDGDIVGVSWGTTMYQIAQNMQPKQVKGVEVVQLKGGISHSRVNTYSAETIQLF
AEAFQTMPRYLPLPVVFDNADVKRMVEKDRHIERIIEMGKQANIALFTVGTVRDEALLFRLGYFNEEEKALLKKQAVGDI
CSRFFDAKGNICSSAINDRTIGVELQDLRLKERSILVAGGSRKVSSIHGALTGKYANVLIIDQHTARALVNDL
>P0ACK5 ~~~deoR~~~Deoxyribose operon repressor~~~COG1349
METRREERIGQLLQELKRSDKLHLKDAAALLGVSEMTIRRDLNNHSAPVVLLGGYIVLEPRSASHYLLSDQKSRLVEEKR
RAAKLAATLVEPDQTLFFDCGTTTPWIIEAIDNEIPFTAVCYSLNTFLALKEKPHCRAFLCGGEFHASNAIFKPIDFQQT
LNNFCPDIAFYSAAGVHVSKGATCFNLEELPVKHWAMSMAQKHVLVVDHSKFGKVRPARMGDLKRFDIVVSDCCPEDEYV
KYAQTQRIKLMY
>A0QXE5 5.3.1.34~~~derI1~~~D-erythrulose-4-phosphate isomerase 1~~~COG0698
MALRVVVGADDAGYEYKEALKGDLAADDRVTEVIDVGVGADEDTAYPHVAVAAARLIAEGKADRALLVCGTGLGVAISAN
KVPGIRAVTAHDSFSVERSVLSNNAQVLCFGQRVVGLELARRLAREWLGYEFDPTSKSAEKVQAICGYEPATS
>A0R757 5.3.1.34~~~derI2~~~D-erythrulose-4-phosphate isomerase 2~~~COG0698
MALKIVIGGDNAGFNYKEALRKDLEADDRVASVEDVGVGGVDDTTSYPNVAVAAAEKVARGEADRALLICGTGLGVAIAA
NKVKGIRAVTAHDVYSVQRSVLSNNAQVLCMGERVVGLELARALVKEWLGLEFDPQSASAAKVNDICAYEGA
>B9JN19 5.3.1.34~~~derI~~~D-erythrulose-4-phosphate isomerase~~~COG0698
MKLAIAGDSAGEGLAKVLADHLKDRYDVSEVSRTDAGPDAFYANLADRVASGVIDGTYDKAILVCGTGIGVSISANKVPG
IRAALTHDTYSAERAALSNNAQIITMGARVIGTELAKAIADAFLARTFDTNGRSAGNVQAIDEVDAKYNAR
>Q6D8V9 5.3.1.34~~~derI~~~D-erythrulose-4-phosphate isomerase~~~COG0698
MLSIAIGADSAAIDLKNTITDYLQQKGLTVTDYSYDPTGENPIYPDVAYTLAHAIKDGKHQRGILLCGTGIGMCIVANKV
NGIRAAQCHDTYSAQRARKSNNAQVIALGARVIGPELAKEIIGAWLDAEFEGGGSAPKVEKIGYYEHQEKHQ
>A0QXE4 2.7.1.210~~~derK~~~D-erythrulose kinase~~~COG2376
MTKLFNDPARFTEDMLVGFLDANSRYVVGVPGGVVRAQTTRPGKVAVVIGGGSGHYPAFCGTVGPGFADGAVVGNIFTSP
SAEEAASVARAAHSDAGVLLTTGNYAGDVMNFNLAVDQLRSEGIEAQYFAVTDDVASAERGQEAKRRGIAGDFTVFKCAS
AAAEEGLDLAGVVRVAEAANAATRTLGVAFDGCTLPGADHPLFTVPEGHMGLGLGIHGEPGVSEEKMPTAAGLAATLVDG
VLGDRPDAPEKRIAVILNGLGRTKYEELFVVWGEVSRLLRDRGYTIVEPEVGELVTSLDMAGCSLTVMWLDEELERYWAA
PADTPAYKKGAAQQHVSGERRSEATARSASSGPKLAELSDEDGRAGARLVARAFDAMAEALADAEEELGRIDAVAGDGDH
GRGMVKGSSAAREAAASALSEGAGQGSVLNAAGKAWAAKAGGTSGVLWGALLTALGARLGDTGRPDSSVIAAGVRDAYDA
LIRLGGAAPGDKTMLDAMLPFTEELERRVAQDESWQSAWRAAADVATEAARATADLRPKIGRARPLAERSVGTPDAGATS
LALCARTVADCVTLSTQGEN
>P50743 ~~~der~~~GTPase Der~~~COG1160
MGKPVVAIVGRPNVGKSTIFNRIAGERISIVEDTPGVTRDRIYSSAEWLNYDFNLIDTGGIDIGDEPFLAQIRQQAEIAM
DEADVIIFMVNGREGVTAADEEVAKILYRTKKPVVLAVNKLDNTEMRANIYDFYSLGFGEPYPISGTHGLGLGDLLDAVA
EHFKNIPETKYNEEVIQFCLIGRPNVGKSSLVNAMLGEERVIVSNVAGTTRDAVDTSFTYNQQEFVIVDTAGMRKKGKVY
ETTEKYSVLRALKAIDRSEVVAVVLDGEEGIIEQDKRIAGYAHEAGKAVVIVVNKWDAVDKDESTMKEFEENIRDHFQFL
DYAPILFMSALTKKRIHTLMPAIIKASENHSLRVQTNVLNDVIMDAVAMNPTPTHNGSRLKIYYATQVSVKPPSFVVFVN
DPELMHFSYERFLENRIRDAFGFEGTPIKIFARARK
>Q83C83 ~~~der~~~GTPase Der~~~COG1160
MLPVIAIVGRPNVGKSTLFNYLTKSRAALVADVPGVTRDRQYGETTIDSQRLLLVDTGGLVDTENKEVAPLAETQVEQAI
DESDCILFLVDAKAGLVPADEIIAERLRKKGKKIFLAVNKADRARAAVVQSDFYKLGFGEPYVIAAASGRGVKDLMTQVL
ENLPEEKEVIEKEVGIKIAMIGRPNVGKSTLINRLLGEERVIVYDQPGTTRDSIYIPFARNDENYTLIDTAGIRRRAKIQ
DYVEKFSMIKSLQAMHAADVVIFLLDARQGVTEQDLRLLNRIVEAGVSLIIAVNKWDGLNIEERDNVRNAIDRRMPFVDF
ARRYFISALHGTGVGKLFRAIQESYQSIQQELTTGQLTRALEKAVAEHEPPLVKGRRIRLRYAHLGARHPLTIVVHGKQT
KSLPQSYSRYLANYFRKTFNFIGVPVHIKLKTDPNPYEGQEER
>P0A6P5 ~~~der~~~GTPase Der~~~COG1160
MVPVVALVGRPNVGKSTLFNRLTRTRDALVADFPGLTRDRKYGRAEIEGREFICIDTGGIDGTEDGVETRMAEQSLLAIE
EADVVLFMVDARAGLMPADEAIAKHLRSREKPTFLVANKTDGLDPDQAVVDFYSLGLGEIYPIAASHGRGVLSLLEHVLL
PWMEDLAPQEEVDEDAEYWAQFEAEENGEEEEEDDFDPQSLPIKLAIVGRPNVGKSTLTNRILGEERVVVYDMPGTTRDS
IYIPMERDGREYVLIDTAGVRKRGKITDAVEKFSVIKTLQAIEDANVVMLVIDAREGISDQDLSLLGFILNSGRSLVIVV
NKWDGLSQEVKEQVKETLDFRLGFIDFARVHFISALHGSGVGNLFESVREAYDSSTRRVGTSMLTRIMTMAVEDHQPPLV
RGRRVKLKYAHAGGYNPPIVVIHGNQVKDLPDSYKRYLMNYFRKSLDVMGSPIRIQFKEGENPYANKRNTLTPTQMRKRK
RLMKHIKKNK
>P9WNL3 ~~~der~~~GTPase Der~~~COG1160
MTQDGTWVDESDWQLDDSEIAESGAAPVVAVVGRPNVGKSTLVNRILGRREAVVQDIPGVTRDRVCYDALWTGRRFVVQD
TGGWEPNAKGLQRLVAEQASVAMRTADAVILVVDAGVGATAADEAAARILLRSGKPVFLAANKVDSEKGESDAAALWSLG
LGEPHAISAMHGRGVADLLDGVLAALPEVGESASASGGPRRVALVGKPNVGKSSLLNKLAGDQRSVVHEAAGTTVDPVDS
LIELGGDVWRFVDTAGLRRKVGQASGHEFYASVRTHAAIDSAEVAIVLIDASQPLTEQDLRVISMVIEAGRALVLAYNKW
DLVDEDRRELLQREIDRELVQVRWAQRVNISAKTGRAVHKLVPAMEDALASWDTRIATGPLNTWLTEVTAATPPPVRGGK
QPRILFATQATARPPTFVLFTTGFLEAGYRRFLERRLRETFGFDGSPIRVNVRVREKRAGKRR
>B4RKD2 ~~~der~~~GTPase Der~~~
MKPTIALIGRPNVGKSTLFNRLTRTKDALVHDLPGLTRDRHYGHGKVGSKPYFVIDTGGFEPVVDSGILHEMAKQTLQAV
DEADAVVFLVDGRTGLTPQDKIIADRLRQSPRPVYLAVNKGEGGDRAVLAAEFYELALGEPHVISGAHGDGVYYLIEEIL
ENFPEPEAEEADAKHPVFAVIGRPNVGKSTLVNAILGEKRVIAFDMAGTTRDSIHIDFEREGKPFTIIDTAGVRRRGKVD
EAVEKFSVIKAMQAVEAANVAVLVLDAQQDIADQDATIAGFALEAGRALVVAVNKWDGISEERREQVKRDISRKLYFLDF
AKFHFISALKERGIDGLFESIQAAYNAAMIKMPTPKITRVLQTAVGRQQPPRAGLVRPKMRYAHQGGMNPPVIVVHGNSL
HAISDSYTRYLTQTFRKAFNLQGTPLRIQYNVSENPYENAEDKPKKKPLRRVSLSNRIEKREGRKEEKNRFKKKTKVSVK
KQFSK
>Q9XCI8 ~~~der~~~GTPase Der~~~
MVPVVALVGRPNVGKSTLFNRLTRTRDALVADFPGLTRDRKYGRAEVEGREFICIDTGGIDGTEDGVETRMAEQSLLAIE
EADVVLFMVDARAGLMPADEAIAKHLRSREKPTFLVANKTDGLDPDQAVVDFYSLGLGEIYPIAASHGRGVLSLLEHVLL
PWMDDVAPQEEVDEDAEYWAQFEAEQNGEEAPEDDFDPQSLPIKLAIVGRPNVGKSTLTNRILGEERVVVYDMPGTTRDS
IYIPMERDEREYVLIDTAGVRKRGKITDAVEKFSVIKTLQAIEDANVVLLVIDAREGISDQDLSLLGFILNSGRSLVIVV
NKWDGLSQEVKEQVKETLDFRLGFIDFARVHFISALHGSGVGNLFESVREAYDSSTRRVSTAMLTRIMTMAVEDHQPPLV
RGRRVKLKYAHAGGYNPPIVVIHGNQVKDLPDSYKRYLMNYFRKSLEVMGTPIRIQFKEGENPYANKRNTLTPTQMRKRK
RLMKHIKKSK
>P64060 ~~~der~~~GTPase Der~~~
MTKPIVAIVGRPNVGKSTIFNRIVGERVSIVEDTPGVTRDRIYSSGEWLTHDFNIIDTGGIEIGDAPFQTQIRAQAEIAI
DEADVIIFMVNVREGLTQSDEMVAQILYKSKKPVVLAVNKVDNMEMRTDVYDFYSLGFGEPYPISGSHGLGLGDLLDAVV
SHFGEEEEDPYDEDTIRLSIIGRPNVGKSSLVNAILGEDRVIVSNVAGTTRDAIDTEYSYDGQDYVLIDTAGMRKKGKVY
ESTEKYSVLRALKAIERSNVVLVVIDAEQGIIEQDKRVAGYAHEQGKAVVIVVNKWDTVEKDSKTMKKFEDEVRKEFQFL
DYAQIAFVSAKERTRLRTLFPYINEASENHKKRVQSSTLNEVVTDAISMNPTPTDKGRRLNVFYATQVAIEPPTFVVFVN
DVELMHFSYKRYLENQIRAAFGFEGTPIHIIARKRN
>Q9X1F8 ~~~der~~~GTPase Der~~~COG1160
MATVLIVGRPNVGKSTLFNKLVKKKKAIVEDEEGVTRDPVQDTVEWYGKTFKLVDTCGVFDNPQDIISQKMKEVTLNMIR
EADLVLFVVDGKRGITKEDESLADFLRKSTVDTILVANKAENLREFEREVKPELYSLGFGEPIPVSAEHNINLDTLLETI
IKKLEEKGLDLESKPEITDAIKVAIVGRPNVGKSTLFNAILNKERALVSPIPGTTRDPVDDEVFIDGRKYVFVDTAGLRR
KSRVEPRTVEKYSNYRVVDSIEKADVVVIVLDATQGITRQDQRIAGLVERRGRASVVVFNKWDLVEHREKRYDEFTKLFR
EKLYFIDYSPLIFTSADKGWNIDRVIDAINLAYASYTTKVPSSAINSALQKVLAFTNLPRGLKIFFGLQVDIKPPTFLFF
VNSIEKVKNPQKIFLRKLIRDYVFPFEGSPIFLKFKRSR
>Q9ZGH7 2.4.1.277~~~desVII~~~10-deoxymethynolide desosaminyltransferase~~~
MRVLLTSFAHHTHYYGLVPLAWALLAAGHEVRVASQPALTDTITGSGLAAVPVGTDHLIHEYRVRMAGEPRPNHPAIAFD
EARPEPLDWDHALGIEAILAPYFHLLANNDSMVDDLVDFARSWQPDLVLWEPTTYAGAVAAQVTGAAHARVLWGPDVMGS
ARRKFVALRDRQPPEHREDPTAEWLTWTLDRYGASFEEELLTGQFTIDPTPPSLRLDTGLPTVGMRYVPYNGTSVVPDWL
SEPPARPRVCLTLGVSAREVLGGDGVSQGDILEALADLDIELVATLDASQRAEIRNYPKHTRFTDFVPMHALLPSCSAII
HHGGAGTYATAVINAVPQVMLAELWDAPVKARAVAEQGAGFFLPPAELTPQAVRDAVVRILDDPSVATAAHRLREETFGD
PTPAGIVPELERLAAQHRRPPADARH
>Q9ZGH8 ~~~desVIII~~~Protein DesVIII~~~
MTDDLTGALTQPPLGRTVRAVADRELGTHLLETRGIHWIHAANGDPYATVLRGQADDPYPAYERVRARGALSFSPTGSWV
TADHALAASILCSTDFGVSGADGVPVPQQVLSYGEGCPLEREQVLPAAGDVPEGGQRAVVEGIHRETLEGLAPDPSASYA
FELLGGFVRPAVTAAAAAVLGVPADRRADFADLLERLRPLSDSLLAPQSLRTVRAADGALAELTALLADSDDSPGALLSA
LGVTAAVQLTGNAVLALLAHPEQWRELCDRPGLAAAAVEETLRYDPPVQLDARVVRGETELAGRRLPAGAHVVVLTAATG
RDPEVFTDPERFDLARPDAAAHLALHPAGPYGPVASLVRLQAEVALRTLAGRFPGLRQAGDVLRPRRAPVGRGPLSVPVS
SS
>P9WNZ7 1.14.19.-~~~desA1~~~Putative acyl-[acyl-carrier-protein] desaturase DesA1~~~COG0208
MSAKLTDLQLLHELEPVVEKYLNRHLSMHKPWNPHDYIPWSDGKNYYALGGQDWDPDQSKLSDVAQVAMVQNLVTEDNLP
SYHREIAMNMGMDGAWGQWVNRWTAEENRHGIALRDYLVVTRSVDPVELEKLRLEVVNRGFSPGQNHQGHYFAESLTDSV
LYVSFQELATRISHRNTGKACNDPVADQLMAKISADENLHMIFYRDVSEAAFDLVPNQAMKSLHLILSHFQMPGFQVPEF
RRKAVVIAVGGVYDPRIHLDEVVMPVLKKWRIFEREDFTGEGAKLRDELALVIKDLELACDKFEVSKQRQLDREARTGKK
VSAHELHKTAGKLAMSRR
>P9WNZ5 1.14.19.-~~~desA2~~~Putative acyl-[acyl-carrier-protein] desaturase DesA2~~~COG0208
MAQKPVADALTLELEPVVEANMTRHLDTEDIWFAHDYVPFDQGENFAFLGGRDWDPSQSTLPRTITDACEILLILKDNLA
GHHRELVEHFILEDWWGRWLGRWTAEEHLHAIALREYLVVTREVDPVANEDVRVQHVMKGYRAEKYTQVETLVYMAFYER
CGAVFCRNLAAQIEEPILAGLIDRIARDEVRHEEFFANLVTHCLDYTRDETIAAIAARAADLDVLGADIEAYRDKLQNVA
DAGIFGKPQLRQLISDRITAWGLAGEPSLKQFVTG
>P9WNZ3 1.14.19.n4~~~desA3~~~NADPH-dependent stearoyl-CoA 9-desaturase~~~COG3239
MAITDVDVFAHLTDADIENLAAELDAIRRDVEESRGERDARYIRRTIAAQRALEVSGRLLLAGSSRRLAWWTGALTLGVA
KIIENMEIGHNVMHGQWDWMNDPEIHSSTWEWDMSGSSKHWRYTHNFVHHKYTNILGMDDDVGYGMLRVTRDQRWKRYNI
FNVVWNTILAIGFEWGVALQHLEIGKIFKGRADREAAKTRLREFSAKAGRQVFKDYVAFPALTSLSPGATYRSTLTANVV
ANVIRNVWSNAVIFCGHFPDGAEKFTKTDMIGEPKGQWYLRQMLGSANFNAGPALRFMSGNLCHQIEHHLYPDLPSNRLH
EISVRVREVCDRYDLPYTTGSFLVQYGKTWRTLAKLSLPDKYLRDNADDAPETRSERMFAGLGPGFAGADPVTGRRRGLK
TAIAAVRGRRRSKRMAKSVTEPDDLAA
>Q54794 1.14.19.6~~~desA~~~Delta(12)-fatty-acid desaturase~~~
MTLSIVKSEDSSSRPSAVPSDLPLEEDIINTLPSGVFVQDRYKAWMTVIINVVMVGLGWLGIAIAPWFLLPVVWVFTGTA
LTGFFVIGHDCGHRSFSRNVWVNDWVGHILFLPIIYPFHSWRIGHNQHHKYTNRMELDNAWQPWRKEEYQNAGKFMQVTY
DLFRGRAWWIGSILHWASIHFDWTKFEGKQRQQVKFSSLLVIGAAAIAFPTMILTIGVWGFVKFWVIPWLVFHFWMSTFT
LLHHTIADIPFREPEQWHEAESQLSGTVHCNYSRWGEFLCHDINVHIPHHVTTAIPWYNLRTPTPVYRKIGGEYLYPECD
FSWGLMKQVVDHAICMMRITIISQSLTTKRV
>G2IJ05 2.1.1.-~~~desA~~~Syringate O-demethylase~~~COG0404
MAKSLQDVLDNAGNAVDFLRNQQTGPNVYPGVPAEYSNWRNEQRAWAKTAVLFNQSYHMVELMVEGPDAFAFLNYLGINS
FKNFAPGKAKQWVPVTAEGYVIGDVILFYLAENQFNLVGRAPAIEWAEFHAATGKWNVTLTRDERTALRTDGVRRHYRFQ
LQGPNAMAILTDAMGQTPPDLKFFNMADIQIAGKTVGALRHGMAGQPGYELYGPWADYEAVHSALVAAGKNHGLALVGGR
AYSSNTLESGWVPSPFPGYLFGEGSADFRKWAGENSYGAKCSIGGSYVPESLEGYGLTPWDIGYGIIVKFDHDFIGKEAL
EKMANEPHLEKVTLALDDEDMLRVMSSYFSDSGRAKYFEFPSAVYSMHPYDSVLVDGKHVGVSTWVGYSSNEGKMLTLAM
IDPKYAKPGTEVSLLWGEPNGGTSKPTVEPHEQTEIKAVVAPVPYSAVARTGYADSWRTKKA
>P20388 1.14.19.6~~~desA~~~Delta(12)-fatty-acid desaturase~~~COG3239
MTATIPPLTPTVTPSNPDRPIADLKLQDIIKTLPKECFEKKASKAWASVLITLGAIAVGYLGIIYLPWYCLPITWIWTGT
ALTGAFVVGHDCGHRSFAKKRWVNDLVGHIAFAPLIYPFHSWRLLHDHHHLHTNKIEVDNAWDPWSVEAFQASPAIVRLF
YRAIRGPFWWTGSIFHWSLMHFKLSNFAQRDRNKVKLSIAVVFLFAAIAFPALIITTGVWGFVKFWLMPWLVYHFWMSTF
TIVHHTIPEIRFRPAADWSAAEAQLNGTVHCDYPRWVEVLCHDINVHIPHHLSVAIPSYNLRLAHGSLKENWGPFLYERT
FNWQLMQQISGQCHLYDPEHGYRTFGSLKKV
>P9WNE9 1.-.-.-~~~~~~NADPH oxidoreductase~~~COG1018
MSKKHTTLNASIIDTRRPTVAGADRHPGWHALRKIAARITTPLLPDDYLHLANPLWSARELRGRILGVRRETEDSATLFI
KPGWGFSFDYQPGQYIGIGLLVDGRWRWRSYSLTSSPAASGSARMVTVTVKAMPEGFLSTHLVAGVKPGTIVRLAAPQGN
FVLPDPAPPLILFLTAGSGITPVMSMLRTLVRRNQITDVVHLHSAPTAADVMFGAELAALAADHPGYRLSVRETRAQGRL
DLTRIGQQVPDWRERQTWACGPEGVLNQADKVWSSAGASDRLHLERFAVSKTAPAGAGGTVTFARSGKSVAADAATSLMD
AGEGAGVQLPFGCRMGICQSCVVDLVEGHVRDLRTGQRHEPGTRVQTCVSAASGDCVLDI
>Q9ZGH1 4.3.1.30~~~desII~~~dTDP-4-amino-4,6-dideoxy-D-glucose ammonia-lyase~~~
MTAPALSATAPAERCAHPGADLGAAVHAVGQTLAAGGLVPPDEAGTTARHLVRLAVRYGNSPFTPLEEARHDLGVDRDAF
RRLLALFGQVPELRTAVETGPAGAYWKNTLLPLEQRGVFDAALARKPVFPYSVGLYPGPTCMFRCHFCVRVTGARYDPSA
LDAGNAMFRSVIDEIPAGNPSAMYFSGGLEPLTNPGLGSLAAHATDHGLRPTVYTNSFALTERTLERQPGLWGLHAIRTS
LYGLNDEEYEQTTGKKAAFRRVRENLRRFQQLRAERESPINLGFAYIVLPGRASRLLDLVDFIADLNDAGQGRTIDFVNI
REDYSGRDDGKLPQEERAELQEALNAFEERVRERTPGLHIDYGYALNSLRTGADAELLRIKPATMRPTAHPQVAVQVDLL
GDVYLYREAGFPDLDGATRYIAGRVTPDTSLTEVVRDFVERGGEVAAVDGDEYFMDGFDQVVTARLNQLERDAADGWEEA
RGFLR
>O34757 2.7.13.3~~~desK~~~Sensor histidine kinase DesK~~~COG4585
MIKNHFTFQKLNGITPYIWTIFFILPFYFIWKSSSTFVIIVGIILTLLFFSVYRFAFVSKGWTIYLWGFLLIGISTASIT
LFSYIYFAFFIAYFIGNIKERVPFHILYYVHLISAAVAANFSLVLKKEFFLTQIPFVVITLISAILLPFSIKSRKERERL
EEKLEDANERIAELVKLEERQRIARDLHDTLGQKLSLIGLKSDLARKLIYKDPEQAARELKSVQQTARTSLNEVRKIVSS
MKGIRLKDELINIKQILEAADIMFIYEEEKWPENISLLNENILSMCLKEAVTNVVKHSQAKTCRVDIQQLWKEVVITVSD
DGTFKGEENSFSKGHGLLGMRERLEFANGSLHIDTENGTKLTMAIPNNSK
>O34723 ~~~desR~~~Transcriptional regulatory protein DesR~~~COG2197
MISIFIAEDQQMLLGALGSLLNLEDDMEVVGKGTTGQDAVDFVKKRQPDVCIMDIEMPGKTGLEAAEELKDTGCKIIILT
TFARPGYFQRAIKAGVKGYLLKDSPSEELANAIRSVMNGKRIYAPELMEDLYSEANPLTDREKEVLELVADGKNTKEIAQ
ELSIKSGTVRNYISMILEKLEVKNRIEAITRSKEKGWFK
>P00273 ~~~dsr~~~Desulforedoxin~~~
MANEGDVYKCELCGQVVKVLEEGGGTLVCCGEDMVKQ
>P02966 ~~~tps~~~Development-specific protein S~~~
MANITVFYNEDFQGKQVDLPPGNYTRAQLAALGIENNTISSVKVPPGVKAILYQNDGFAGDQIEVVANAEELGPLNNNVS
SIRVISVPVQPRARFFYKEQFDGKEVDLPPGQYTQAELERYGIDNNTISSVKPQGLAVVLFKNDNFSGDTLPVNSDAPTL
GAMNNNTSSIRIS
>Q9ZGH6 2.1.1.234~~~desVI~~~dTDP-3-amino-3,4,6-trideoxy-alpha-D-glucopyranose N,N-dimethyltransferase~~~
MYEVDHADVYDLFYLGRGKDYAAEASDIADLVRSRTPEASSLLDVACGTGTHLEHFTKEFGDTAGLELSEDMLTHARKRL
PDATLHQGDMRDFRLGRKFSAVVSMFSSVGYLKTTEELGAAVASFAEHLEPGGVVVVEPWWFPETFADGWVSADVVRRDG
RTVARVSHSVREGNATRMEVHFTVADPGKGVRHFSDVHLITLFHQAEYEAAFTAAGLRVEYLEGGPSGRGLFVGVPA
>Q9ZGH4 2.6.1.106~~~desV~~~dTDP-3-amino-3,4,6-trideoxy-alpha-D-glucose transaminase~~~
MSSRAETPRVPFLDLKAAYEELRAETDAAIARVLDSGRYLLGPELEGFEAEFAAYCETDHAVGVNSGMDALQLALRGLGI
GPGDEVIVPSHTYIASWLAVSATGATPVPVEPHEDHPTLDPLLVEKAITPRTRALLPVHLYGHPADMDALRELADRHGLH
IVEDAAQAHGARYRGRRIGAGSSVAAFSFYPGKNLGCFGDGGAVVTGDPELAERLRMLRNYGSRQKYSHETKGTNSRLDE
MQAAVLRIRLAHLDSWNGRRSALAAEYLSGLAGLPGIGLPVTAPDTDPVWHLFTVRTERRDELRSHLDARGIDTLTHYPV
PVHLSPAYAGEAPPEGSLPRAESFARQVLSLPIGPHLERPQALRVIDAVREWAERVDQA
>O34653 1.14.19.-~~~des~~~Fatty acid desaturase~~~COG3239
MTEQTIAHKQKQLTKQVAAFAQPETKNSLIQLLNTFIPFFGLWFLAYLSLDVSYLLTLALTVIAAGFLTRIFIIFHDCCH
QSFFKQKRYNHILGFLTGVLTLFPYLQWQHSHSIHHATSSNLDKRGTGDIWMLTVNEYKAASRRTKLAYRLYRNPFIMFI
LGPIYVFLITNRFNKKGARRKERVNTYLTNLAIVALAAACCLIFGWQSFLLVQGPIFLISGSIGVWLFYVQHTFEDSYFE
ADENWSYVQAAVEGSSFYKLPKLLQWLTGNIGYHHVHHLSPKVPNYKLEVAHEHHEPLKNVPTITLKTSLQSLAFRLWDE
DNKQFVSFRAIKHIPVSLPPDSPEKQKLRKNA
>P9WMF8 ~~~devR~~~DNA-binding transcriptional activator DevR/DosR~~~
MVKVFLVDDHEVVRRGLVDLLGADPELDVVGEAGSVAEAMARVPAARPDVAVLDVRLPDGNGIELCRDLLSRMPDLRCLI
LTSYTSDEAMLDAILAGASGYVVKDIKGMELARAVKDVGAGRSLLDNRAAAALMAKLRGAAEKQDPLSGLTDQERTLLGL
LSEGLTNKQIADRMFLAEKTVKNYVSRLLAKLGMERRTQAAVFATELKRSRPPGDGP
>P9WMF9 ~~~devR~~~DNA-binding transcriptional activator DevR/DosR~~~COG2197
MVKVFLVDDHEVVRRGLVDLLGADPELDVVGEAGSVAEAMARVPAARPDVAVLDVRLPDGNGIELCRDLLSRMPDLRCLI
LTSYTSDEAMLDAILAGASGYVVKDIKGMELARAVKDVGAGRSLLDNRAAAALMAKLRGAAEKQDPLSGLTDQERTLLGL
LSEGLTNKQIADRMFLAEKTVKNYVSRLLAKLGMERRTQAAVFATELKRSRPPGDGP
>Q07765 ~~~devR~~~CRISPR-associated protein Cas7/Cst2/DevR~~~COG1857
MSLHVFAAFVTPLGTAANNRGLTEGNITSLQKLVWNGQVHTTVSAESIRFALRRRLNEQEPCNRTYDDASRANAWKDAAF
SAWSGKSKEKTYIDDDLLGFMSAEGAKQEKEKGTAKVRRAVLEVSRAVSLTPWSGDVTFNAASPGATPSAQKKGSNPVPY
GTEMHATRYQYGVALTPEALRVPARAVTALNQLCALGPVAGNHGRFLFDFSPESVVFRLTQEAAPRILYAFEPSSRAGGV
ELAALLRKVKSGDVPAKELVLGGQVVEGLGAEEREVLSGAELHTGVVAACRAACKRLEVRKK
>P9WGK2 2.7.13.3~~~devS~~~Oxygen sensor histidine kinase response regulator DevS/DosS~~~
MTTGGLVDENDGAAMRPLRHTLSQLRLHELLVEVQDRVEQIVEGRDRLDGLVEAMLVVTAGLDLEATLRAIVHSATSLVD
ARYGAMEVHDRQHRVLHFVYEGIDEETVRRIGHLPKGLGVIGLLIEDPKPLRLDDVSAHPASIGFPPYHPPMRTFLGVPV
RVRDESFGTLYLTDKTNGQPFSDDDEVLVQALAAAAGIAVANARLYQQAKARQSWIEATRDIATELLSGTEPATVFRLVA
AEALKLTAADAALVAVPVDEDMPAADVGELLVIETVGSAVASIVGRTIPVAGAVLREVFVNGIPRRVDRVDLEGLDELAD
AGPALLLPLRARGTVAGVVVVLSQGGPGAFTDEQLEMMAAFADQAALAWQLATSQRRMRELDVLTDRDRIARDLHDHVIQ
RLFAIGLALQGAVPHERNPEVQQRLSDVVDDLQDVIQEIRTTIYDLHGASQGITRLRQRIDAAVAQFADSGLRTSVQFVG
PLSVVDSALADQAEAVVREAVSNAVRHAKASTLTVRVKVDDDLCIEVTDNGRGLPDEFTGSGLTNLRQRAEQAGGEFTLA
SVPGASGTVLRWSAPLSQ
>P9WGK3 2.7.13.3~~~devS~~~Oxygen sensor histidine kinase response regulator DevS/DosS~~~COG2203
MTTGGLVDENDGAAMRPLRHTLSQLRLHELLVEVQDRVEQIVEGRDRLDGLVEAMLVVTAGLDLEATLRAIVHSATSLVD
ARYGAMEVHDRQHRVLHFVYEGIDEETVRRIGHLPKGLGVIGLLIEDPKPLRLDDVSAHPASIGFPPYHPPMRTFLGVPV
RVRDESFGTLYLTDKTNGQPFSDDDEVLVQALAAAAGIAVANARLYQQAKARQSWIEATRDIATELLSGTEPATVFRLVA
AEALKLTAADAALVAVPVDEDMPAADVGELLVIETVGSAVASIVGRTIPVAGAVLREVFVNGIPRRVDRVDLEGLDELAD
AGPALLLPLRARGTVAGVVVVLSQGGPGAFTDEQLEMMAAFADQAALAWQLATSQRRMRELDVLTDRDRIARDLHDHVIQ
RLFAIGLALQGAVPHERNPEVQQRLSDVVDDLQDVIQEIRTTIYDLHGASQGITRLRQRIDAAVAQFADSGLRTSVQFVG
PLSVVDSALADQAEAVVREAVSNAVRHAKASTLTVRVKVDDDLCIEVTDNGRGLPDEFTGSGLTNLRQRAEQAGGEFTLA
SVPGASGTVLRWSAPLSQ
>Q07766 ~~~devS~~~CRISPR-associated protein Cas5~~~
MIALELSVPVACWRKGRARELVETEVLPPPATCYGALLSLVGEQDRERHRGCRVTAGVLNAPVISTVLRTFWRSKNLKVA
KGNDENAAPDQQQLVIDARLVVWCDSREEPDSGESLEDRVVRAMREPGSVTRAGGWSLGESTHLINDARLLPEGRPPAGC
RAFLTASTGALTLPVWVDHVGTRGTRYEVGRLEEVLAAPEVQRLPRIPLAEGAG
>P38586 ~~~devT~~~CRISPR-associated protein Cas8a1/Csx13~~~
MACMAPRGPAAIPHPSSERAGLRRPARCAMAKAVNPRKKALPAPLSIRLYAPGMTPLLRAGAGGLAASLRAILGSASPAA
PWPSPVRLGPGTATVEQEAIHLDWGGKAPEATLRALFGASFRVKQGFIDLPGTRPPGAPEPPPELAAALHDALKVTFLQH
GKSTQGGARRRVTFEVDARPVIVESQGYDSFVHQTAWQSVLEALEVGSTSLASWAYPGAAERHIGVRVTKVEYTAAEALC
ACFALVGCVSYKLPQLRGGAFVALAPTNLVRFAELRPGLTPKRLRDVAVAGASDAVLAAQLVMAQEAGKKRLGAVLGTTE
AVALRQMPWNAQQKIRGAVVRQDAVLEEVLDRYEAAAAALPHTLRVRKPEGKATGEASYFIAISALRAFITENLAASRPW
YADFATATTAEGRFIHDYRDRDNLGALLWHERKGLIAMHPYLGEAEQWLVQSVHLALRSRFKSIYADTKESAPATRSNRL
KGERERLRLSFAGAKTPEQVRAALADLWSRAGTNRELQEHWRDILQLLGPERWRAARDLALVALASYQGKGGEAAELEDA
DEAAGASEQS
>Q99040 3.2.1.70~~~dexB~~~Glucan 1,6-alpha-glucosidase~~~COG0366
MQKHWWHKATVYQIYPKSFMDTNGDGIGDLKGITSKLDYLQKLGVMAIWLSPVYDSPMDDNGYDIANYEAITDIFGNMAD
MDNLLTQAKMRGIKIIMDLVVNHTSDEHAWFIEAREHPDSSERDYYIWCDQPNDLESIFGGSAWQYDDKSDQYYLHFFSK
KQPDLNWENANLRQKIYDMMNFWIDKGIGGFRMDVIDMIGKIPAQHIVSNGPKLHAYLKEMNAASFGQHDLLTVGETWGA
TPEIAKQYSNPVNHELSMVFQFEHIGLQHKPEAPKWDYVKELNVPALKTIFNKWQTELELGQGWNSLFWNNHDLPRVLSI
WGNTGKYREKSAKALAILLHLMRGTPYIYQGEEIGMTNYPFKDLNELDDIESLNYAKEAFTNGKSMETIMDSIRMIGRDN
ARTPMQWDASQNAGFSTADKTWLPVNPNYKDINVQAALKNSNSIFYTYQQLIQLRKENDWLVDADFELLPTADKVFAYLR
KVREERYLIVVNVSDQEEVLEIDVDKQETLISNTNESAALANHKLQPWDAFCIKIN
>P39652 3.2.1.11~~~~~~Dextranase~~~
MPGTGLGRLAKRMTAAAAVFFISTSAVLPAQAATAPAAAPPGVPAALKAERAITTVDNGNLHTWWHDNGVFSPATPTQSS
EVRRSSFYDVQVAQANQPQKLYDAFSYMSIPRSGKGKIGYTEEDGAEFSSDARLTMSWSSFEYAKDVWVEVSLRTGQTIS
SADQVQIRPSSYNFEKQLVDADTVRIKVPYSDAGYRFSVEFEPQLYTAYNDMSGDSGKLTTEAAGNRPIHTEPRNSMMVF
AEPKLRGEQKERLVPTEESGSIHYPEPGEVRNLNSVSEEIIYFRPGTYSMGPDYHAVLPANVKWVYLAPGAYVKGAFRFL
HDTQSQYKVTGYGVLSGEQYVYEADTNNSYHHLSGASNCHSSCVKMLQFASADAEQKLDLQGVTVAEPPYHSFVVYGNEQ
TFHMNVENYKQVGSWYWQTDGIELYKGSTMKNTFFNANDDVLKMYHSDVTIDNTVIWKNENGPVIQWGWTPRNIDNVNVA
NTTVIHNRMYWKDVKYNTCIFNSSSHWEDMGSTTKADPNTTVKNMRFENTAVEGMTNCAIRVYALSDTENIHIKNFNIGA
WNGLEWTSQVSHLKRYTNSAGEKVTIGNEVPDGNGLALENYSVGGQVIEKTGGNSSDYQLGRLGFDGENWENWNAWKSAP
>P39653 3.2.1.11~~~dex~~~Dextranase~~~
MNNRMLSFPSMLFLLAFGIVLSVSAGTTHADELANQTAKVADEASIVVSTSQAAVEQTQSQEKEISPAMEEDTSNLSLKP
NAQQESQSPDSSTELQDPAEQTPPETSDASAPATTSADSVEKYAQDATQNQSSTSNGPGVIRATSAQVTATRSVVSSQSG
DAIVDLSADKASYRQGEDVNLSVDFKNTTDKEQDVTVYADVYYIDNKLGTYKFSKHLKAGEGYKMQSGDLKIPASQFENN
HGYLLKVRVRDADNNTLSEVNKAIAVESDWTKFPRYGIVGGSQDTNNSLLSKDADRYRAEIEKMKNMNINSYFFYDVYKT
ATNPFPSDEATFKQDWNTWSGSEIDTQAVKDIVNQVHDGGAVAMLYNMILAENTNTGEAPVLPETEYAYNSDDRGYGAQG
QPMSYTVKIPKDGQEEDVEIQRYYNPTSKLWQDYIADKMGQAMKNGGFDGWQGDTIGDNEVYSYADKDSNDPSKKFWLTE
GYAEFLRAIKEKLPNYYLTVNDVNGEQIYRLKDGNQDVIYNEIWPFGPALPSEMAAVKPNTVTSRPVLTKVRQGDWKISI
VGAYMEGSENGGSKADAEAGKSLQTDAVLLTSASIAAAGGYHMSLAALANQQDETDGGQGIGVLQTAYYPTQSLKTSSEL
TRKNNDYQQFITAYENVLRDGVENDDAQVNTFDSNGQKLSTDAKGITGNQVWTYGKKGDNFRTVQLLNLMGINSDWKNED
GSAANKTPDEQTNLTVKYALGDVSMEDAQRMANQTYVTSPDDWSKSNLQKVSASVKTDENGKPVLVINVPKLTLWDVVYI
SNANQESAPEADQAQTPAAQSSDDKVAENETSQPAAEDAKEQTSEPAQDQAAPAEQGQAINQAESPATEPEAEVTPATAE
PAKVDAPEANQAADQAVSPEPASQEQAASQSQPEANQTPASNETPATQGNSEQPELNEPTAQTQPSSQVSPANTSVTPVA
EQPTNQGQAADKADQAPTNSTSTPESTSPVEPAATDQSSDTPIVTAGNLSVQPAETETPTVPDKQGDSKANQSSTETPVA
DQVPAVAEQPQATEPNQAKPSVDKAAAPEALSLIQLKQQTPAIQAKEADDPEVDETKSEVTPDSGTDKAPEAGQVDSDKA
PTVKPSTPENNDNQPNNANDADKNKTNEADSNKANQDSTKGSSADQSGKSTTPEDGPDNSSPEDPETKPSDPNTDTSDQE
QVKPSLPVVPNQTVDDPKTDDTDTPANTDSAKSKKVADADKNKVATDSEGRQKSSEFPKEATDLEKVGQPASPQVAGVKS
SVATSPEKKSEPVSKTSTTSSSDKLPKTGDHKTVVLIIVLGLVFVGMTGLLARHEKK
>Q54443 3.2.1.11~~~dexA~~~Dextranase~~~COG5297
MEQSNRQTAEPAIRSAETVDSTINSFQETDLKVQEKEDVAAAVQTESASIDSNEQGQSVSANTNTQSQAKKLSNNSHQEP
MQMVSAANKERAVLETAQNQKNGNMINLTTDKAVYQAGEAVHLNLTLNNTTSLAQNITATAEVYSLENKLKTLQYTKYLL
PNESYTTQKGEFVIPANSLANNRGYLLKVNISDSQNNILEQGNRAIAVEDDWRTFPRYAAIGGSQKDNNSVLTKNLPDYY
RELEQMKNMNINSYFFYDVYKSATNPFPNVPKFDQSWNWWSHSQVETDAVKALVNRVHQTGAVAMLYNMILAQNANETAV
LPDTEYIYNYETGGYGQNGQVMTYSIDDKPLQYYYNPLSKSWQNYISNAMAQAMKNGGFDGWQGDTIGDNRVLSHNQKDS
RDIAHSFMLSDVYAEFLNKMKEKLPQYYLTLNDVNGENISKLANSKQDVIYNELWPFGTSALGNRPQESYGDLKARVDQV
RQATGKSLIVGAYMEEPKFDDNRVPLNGAARDVLASATYQTDAVLLTTAAIAAAGGYHMSLAALANPNDGGGVGVLETAY
YPTQSLKVSKELNRKNYHYQQFITAYENLLRDKVENDSAEPQTFTANGRQLSQDALGINGDQVWTYAKKGNDFRTIQLLN
LMGITSDWKNEDGYENNKTPDEQTNLLVTYPLTGVSMAEADRIAKQVYLTSPDDWLQSSMISLATQIKTNENGDPVLYIQ
VPRLTLWDMIYINETIKPETPKVPEQPQHPARTLEPAIPQTPEAVSPLPVANKQAVDENKNEIVSALTGEENDLQLPTLS
KRSLSISQAELPQTGDNNETRSNLLKVIGAGALLIGAAGLLSLIKGRKKD
>Q55393 1.-.-.-~~~dfa1~~~Diflavin flavoprotein A 1~~~COG0426
MFTTPLPPQKRLSTQTEAIAKNITAIRSLDWDRDRFDIEFGLQNGTTYNSYLIQADKVALVDSSHEKFRQLYLDLLQGLI
DPQRIDYLIVSHTEPDHSGLVKDILQLNPRITVVATKVALQFLDNFVHQPFERIQVKSGDRLDLGQGHDLEFVSAPNLHW
PDTMLTYDPATEILFTCDVFGMHYCSDAVFDIDLGKIAPDYQFYYDCLMGPNARSVLAAMKRMDNLGTISTVANGHGPLL
RHNVGELLHRYRHWSESQSKAEKTVVVFYVADYGYGDRLSQAIAKGITKTGVGVDMVDLSSADPQEIQELVGHASGVVLG
MPPLQANADLSTNFGAVLAAMQPKQVFGLYESYGGDDEPIDPLRTKFLDLGLREAFKVIKVKDTPSESTYQLCDESGTDL
GQNLIQAAKIKQLKSLDSDLEKAIGRISGGLYIITAQKGEVKGAMLASWVSQASFNPPGFTVAVAKDRAIESLMQVGDRF
VLNILEEGNYQILMKHFLKRFPPGADRFAGVKTQTASNGSPILTDALAYLECEVASRMECSDHWIVYSQVTNGRVAKAEG
LTAVHHRKVGNYY
>Q8YQD8 1.-.-.-~~~dfa3~~~Putative diflavin flavoprotein A 3~~~COG0426
MVALTEKTEKRLTIQTADIAQDTTAIRSLDWERDRFDIEFGLQNGTTYNSFLIRGEQIALVDTSHEKFRQLYFDTLTGLI
NPTEINYLIISHTEPDHSGLVKDLLQMAPEITVVGSKVAIQFLEDLVHQPFKRKIVKNGDRLDLGNGHEFEFVIAPNLHW
PDTIFSFDHKTQTLYTCDAFGMHYCSDIVFDEDLKTIEPDFHYYYDCLMGPNARSVLSALKRMGELPSVKMIATGHGPLL
YHNVEELTGRYRTWSQNQTKAETSIGVFYVSEYGYSDRLAQAIINGITKTGVGVDVVDLGAAVDLQELRELVGRCTGLVI
GMSPAASAASIQGALSTILGSVNEKQAVGIFETGGGDDEPIDPLLSKFRNLGLTTAFPAIRIKQTPTENTYKLCEEAGTD
LGQWVTRDRSIKAMKSLGADLDKALGRLSGGLYIITAKKGDVSSAMLASWVNQASFKPLGFSIAVAKDRAIESLMQVGDR
FVLNVLEEGNYQPLMRHFLKRFAPGADRFEGVKTQPAENGAPILGDALAYMECEVVSRMDCGDHWAVYSTVYAGRVSKSE
ALTAVHHRKVGNHY
>P74373 1.-.-.-~~~dfa3~~~Putative diflavin flavoprotein A 3~~~COG0426
MGIHAKLETVQLPLLVSCLFPPLTMPAKDVQICPIAVDTTVFRSRTWDRLKFEIEYGLQRGTTANSYLISADKIALFDPP
GESFTDNFVGTLIQRLDLNSLDYVILGHVNANRAHTLKLLLSLAPQATIICSNPAAQNLEKLLADAEVNNPIQVMKGNDH
LDLGRGHELTFIPTPSPRYPGQLCTYDPRTEILFTDKLFGAHVCGDQVFDEGWTIYQEDRRYYFDCLLAPAAAQVSAALN
KLEAYPAQTYAPSHGPLVRYGLRELTRNYQQWLSEQQAQALNVALIYASAYGNTSTLAQAIARGITKAGVAVTAINAETS
NAEEIKEAIGKSAGFIFGSPTLGGHAPTPIQTALGITLANASKTQLCGVFGSFGWSGEAIDMLENKFRDAGFSFGFDTIR
VKFKPTDQTLKMCEEAGTDFAQALKKAEKRRQPKSALPESESARTEQALGRLVGSLCVVTAQQGELSSAMLASWVSQATF
SPPGLTVAVAKERAIESLLHKNSCFVLNILQEGNHLGLMKHFLKPFAPGGDRFADVATETAENGAPILTESLAYLECRVQ
QRLECGDHWVLYAVTDRGALLKDGVTAVHHRKSGDHY
>Q8Z0C1 1.-.-.-~~~dfa5~~~Putative diflavin flavoprotein A 5~~~COG0426
MSDSKPRDVQVLPIATNTKVLRARSWSRLRFEIEYALERGTTSNSYVIEGDKTAIIDPPVESFMKIYLEALQQTVNLKKL
DYVILGHFSPNRIPTFKALLELAPQITFVCSLPAAGDLRAAFPDDNLNILPMRGKETLDLGKGHVLKFLPIPSPRWPAGL
CTYDVQTQILYTDKIFGAHICGDDVFDDNWESFKEDQRYYFNCLMAPHAIHVEAALEKISDLQVRLYAVGHGPLVRTSLI
ALTQAYADWSKAQKDREISVALLYASAYGNTATIARAIALGLTKGGVAVKSINCEFATPEEIQTNLEQVDGFLIGSPTIG
GHAPTPINTALGIVLKVGDNNKLAGVFGSYGWSGEALDMIEGKLRDAGYRFGLDTLKVKFKPDDVTLKFCEEVGTDFAQT
LKKAKKVRVPQQAATPVEQAVGRIVGSVCVITAKQGDVSTGMLGSWVSQATFNPPGLTVAIAKERAIESLMYPGGKFALN
ILSEGNHLEYMKHFRKNFAPGEDRFANFTTTEADNGCTVLADALAYVECSVDQRLECGDHWVVYATVDNGKLLKPDDVTA
INHRKTGNHY
>Q97GB9 1.15.1.2~~~dfx~~~Desulfoferrodoxin~~~COG2033
MNNDLSIYVSKNSGTAVLLLQGNGTDLTCGSEPMAKIVANTTDAAQEKHVPHITKNGNNIDVSVGSVEHPMTPEHFIEWI
ILVSGDRLEMAKLTPDMKPRAQFHNVTSGTVYAYCNLHSLWKADI
>Q46495 1.15.1.2~~~dfx~~~Desulfoferrodoxin~~~COG2033
MPERLQVYKCEVCGNIVEVLNGGIGELVCCNQDMKLMSENTVDAAKEKHVPVIEKIDGGYKVKVGAVAHPMEEKHYIQWI
ELLADDKCYTQFLKPGQAPEAVFLIEAAKVVAREYCNIHGHWKAEN
>P22076 1.15.1.2~~~dfx~~~Desulfoferrodoxin~~~COG2033
MPKHLEVYKCTHCGNIVEVLHGGGAELVCCGEPMKHMVEGSTDGAMEKHVPVIEKVDGGYLIKVGSVPHPMEEKHWIEWI
ELLADGRSYTKFLKPGDAPEAFFAIDASKVTAREYCNLHGHWKAEN
>P20418 1.15.1.2~~~dfx~~~Desulfoferrodoxin~~~COG2033
MPNQYEIYKCIHCGNIVEVLHAGGGDLVCCGEPMKLMKEGTSDGAKEKHVPVIEKTANGYKVTVGSVAHPMEEKHWIEWI
ELVADGVSYKKFLKPGDAPEAEFCIKADKVVAREYCNLHGHWKAEA
>D0ZLR3 4.3.1.29~~~dgaE~~~D-glucosaminate-6-phosphate ammonia lyase~~~
MTPNIYQQLGLKKVINACGKMTILGVSSVAPEVMQATARAASAFVEIDALVEKTGELVSRYTGAEDSYITSCASAGIAIA
VAAAITHGDRARVALMPDSSGMANEVVMLRGHNVDYGAPVTSAIRLGGGRIVEVGSSNLATRWQLESAINEKTAALLYVK
SHHCVQKGMLSIDDFVQVAQANHLPLIVDAAAEEDLRGWVASGADMVIYSGAKAFNAPTSGFITGRKTWIAACKAQHQGI
ARAMKIGKENMVGLVYALENYHQGQTTVTAAQLQPVAEAISAIHGLYADIEQDEAGRAIWRIRVRVNASELGLNAQDVEA
QLRGGEIAIYARKYQLHQGVFSLDPRTVAEGEMALIVARLREIAEHAAD
>D0ZLR2 4.1.2.14~~~dgaF~~~2-dehydro-3-deoxy-phosphogluconate aldolase~~~
MQQINFYRQRVAINVLAKDIANAKAIYEAAEGHAVIGVLSAQFATVEEGVPEVKRWMAEVPSISVGLGAGDPAQYYKAAM
IAAHTHPAHVNQTFTGSGFAAGALAATGGEQTHINALVSPTGTPGEVVISTGVSSSQGTPARVSCEAAVRMMQDMGAHAA
KFFPMGGEKSLPELYALATTAARHGMTLIEPTGGISLDNFGIILQTCLEAGVPRVMPHVYSSIIDPQTGNTRPEDIIRLM
EIVKALV
>D0ZLR9 ~~~dgaR~~~Transcriptional regulatory protein DagR~~~
MRRIEIVLGELERLTRGLCLADLAQETAFTAEAIGFNLGLARNSVSKDLNQLWNDGLAIKSRGRPVYFLHRQALETLLGR
QLEESEREVRSVADVLPHEEHYAPDDPFTSLIGYDRSLRDAVEKGRAAVLYPHGLHVLLTGPSGVGKTFFAELMHRFACE
QASGAIPPLVYFNCAEYAHNPELLSSHLFGHRQGAFTGANEHKTGLVEQADGGYLLLDEVHRLSYEGQEKLFSILDKGEY
RPLGVSSQPRSISVRLICATTEPVGSALLRTFQRRIQVCIDLPGIHQRSVEEQIELIVGFLQRESRKIERTVSIDKPLLL
WLLNKPLEGNIGQLKSDIQFLCAQAWASGMTEHNDTLQLDKRLAEMSVNPTPEQRLLVDTLFEGKARLNIDARTLPALKT
SLATGAEIEESDLFYSFLTREYVNLRNSNVPPAETLAILKNKLSSIFEYGLYSRDSVAHPPRYGDQIEERVTLLIGCVEQ
VLGFSLPENLVNPLRKHFLALIGYVQRGLIPQLYSSSLILDRCKDEYDNATLLCRKINELLHIQCPATEVVWLCLFLKEC
RHYRQRIDASPDCGVILIAHGATTATSQAQYVNRVLERELFSAIDMPFEQSVHDTLETLTQMIQTRQYRRLILLVDIGSL
IHFGSTISKLFQIDVLLMPNITLTSLLEVGLDLSYETSDLPQLTALLQSKNIPCQLCTPQQENGGKVLVISCITGMGTAE
KIKKVLEESFGELMSQDTRMVILDYNEVRSLERVQQALNASERLAGIVGTFQPGLPDIPFISLEELFSEQGPELVLSLLT
PDLSNAERRLEMERSAMRFISALTMESIINHISVLNPQRILKEMEGVFNHLTSSLSLKPSRQVTLRFLIHCCCMVERIVI
NRKPLQMALESQPNLDARAFSVIKSAFLPIEDAYAIRLSDAEYFYIYELLYS
>Q73RG3 2.7.7.65~~~dgcA~~~Diguanylate cyclase A~~~COG3706
MKTTPNEKLLKKALHSCNNKKYADKILHQEKEIFDLKQLLQISKSLNSVLEFDRLIEAILYIVMAQLKTLGAAIFTKKSF
DDNLFVLNRDHYGFDIIRDAQYSINVDHPLINFLDKSDSGCTPDEISKNIKTDKIVKDLFSLSPSFFVPLKAKNRMIGFL
LLGEKMESSHQFTDYEKNIIENIASLAAIAINNSQLLEMTTTDIMTHLKLKHYFFTLLMEHLYTINSSGEKKETLSILMI
DIDFFKNINDTYGHAAGDIVLEEVAKIIKSCTRNADTAARYGGEEFIVMLNNTSASAAMAVAERIRKSVEEKSIMYDGKK
INVTISIGVSSYNFDLESAKSIVERADKALYESKQNGRNRVTLSKNNLPKA
>P0AAP1 2.7.7.65~~~dgcC~~~Probable diguanylate cyclase DgcC~~~COG3706
MFPKIMNDENFFKKAAAHGEEPPLTPQNEHQRSGLRFARRVRLPRAVGLAGMFLPIASTLVSHPPPGWWWLVLVGWAFVW
PHLAWQIASRAVDPLSREIYNLKTDAVLAGMWVGVMGVNVLPSTAMLMIMCLNLMGAGGPRLFVAGLVLMVVSCLVTLEL
TGITVSFNSAPLEWWLSLPIIVIYPLLFGWVSYQTATKLAEHKRRLQVMSTRDGMTGVYNRRHWETMLRNEFDNCRRHNR
DATLLIIDIDHFKSINDTWGHDVGDEAIVALTRQLQITLRGSDVIGRFGGDEFAVIMSGTPAESAITAMLRVHEGLNTLR
LPNTPQVTLRISVGVAPLNPQMSHYREWLKSADLALYKAKKAGRNRTEVAA
>P38097 2.7.7.65~~~dgcE~~~Probable diguanylate cyclase DgcE~~~COG3447
MSKQSQHVLIALPHPLLHLVSLGLVSFIFTLFSLELSQFGTQLAPLWFPTSIMMVAFYRHAGRMWPGIALSCSLGNIAAS
ILLFSTSSLNMTWTTINIVEAVVGAVLLRKLLPWYNPLQNLADWLRLALGSAIVPPLLGGVLVVLLTPGDDPLRAFLIWV
LSESIGALALVPLGLLFKPHYLLRHRNPRLLFESLLTLAITLTLSWLSMLYLPWPFTFIIVLLMWSAVRLPRMEAFLIFL
TTVMMVSLMMAADPSLLATPRTYLMSHMPWLPFLLILLPANIMTMVMYAFRAERKHISESETHFRNAMEYSAIGMALVGT
EGQWLQTNKALCQFLGYSQEELRGLTFQQLTWPEDLNKDLQQVEKLISGEINTYSMEKRYYNRNGDVVWALLAVSLVRHT
DGTPLYFIAQIEDINELKRTEQVNQQLMERITLANEAGGIGIWEWELKPNIFSWDKRMFELYEIPPHIKPNWQVWYECVL
PEDRQHAEKVIRDSLQSRSPFKLEFRITVKDGIRHIRALANRVLNKEGEVERLLGINMDMTEVKQLNEALFQEKERLHIT
LDSIGEAVVCIDMAMKITFMNPVAEKMSGWTQEEALGVPLLTVLHITFGDNGPLMENIYSADTSRSAIEQDVVLHCRSGG
SYDVHYSITPLSTLDGSNIGSVLVIQDVTESRKMLRQLSYSASHDALTHLANRASFEKQLRILLQTVNSTHQRHALVFID
LDRFKAVNDSAGHAAGDALLRELASLMLSMLRSSDVLARLGGDEFGLLLPDCNVESARFIATRIISAVNDYHFIWEGRVH
RVGASAGITLIDDNNHQAAEVMSQADIACYASKNGGRGRVTVYEPQQAAAHSERAAMSLDEQWRMIKENQLMMLAHGVAS
PRIPEARNLWLISLKLWSCEGEIIDEQTFRRSFSDPALSHALDRRVFHEFFQQAAKAVASKGISISLPLSVAGLSSATLV
NDLLEQLENSPLPPRLLHLIIPAEAILDHAESVQKLRLAGCRIVLSQVGRDLQIFNSLKANMADYLLLDGELCANVQGNL
MDEMLITIIQGHAQRLGMKTIAGPVVLPLVMDTLSGIGVDLIYGEVIADAQPLDLLVNSSYFAIN
>P76237 2.7.7.65~~~dgcJ~~~Probable diguanylate cyclase DgcJ~~~COG2199
MKLHHRMLRHFIAASVIVLTSSFLIFELVASDRAMSAYLRYIVQKADSSFLYDKYQNQSIAAHVMRALAAEQSEVSPEQR
RAICEAFESANNTHGLNLTAHKYPGLRGTLQTASTDCDTIVEAAALLPAFDQAVEGNRHQDDYGSGLGMAEEKFHYYLDL
NDRYVYFYEPVNVEYFAMNNWSFLQSGSIGIDRKDIEKVFTGRTVLSSIYQDQRTKQNVMSLLTPVYVAGQLKGIVLLDI
NKNNLRNIFYTHDRPLLWRFLNVTLTDTDSGRDIIINQSEDNLFQYVSYVHDLPGGIRVSLSIDILYFITSSWKSVLFWI
LTALILLNMVRMHFRLYQNVSRENISDAMTGLYNRKILTPELEQRLQKLVQSGSSVMFIAIDMDKLKQINDTLGHQEGDL
AITLLAQAIKQSIRKSDYAIRLGGDEFCIILVDSTPQIAAQLPERIEKRLQHIAPQKEIGFSSGIYAMKENDTLHDAYKA
SDERLYVNKQNKNSRS
>P77302 2.7.7.65~~~dgcM~~~Diguanylate cyclase DgcM~~~COG3706
MITHNFNTLDLLTSPVWIVSPFEEQLIYANSAAKLLMQDLTFSQLRTGPYSVSSQKELPKYLSDLQNQHDIIEILTVQRK
EEETALSCRLVLRKLTETEPVIIFEGIEAPATLGLKASRSANYQRKKQGFYARFFLTNSAPMLLIDPSRDGQIVDANLAA
LNFYGYNHETMCQKHTWEINMLGRRVMPIMHEISHLPGGHKPLNFVHKLADGSTRHVQTYAGPIEIYGDKLMLCIVHDIT
EQKRLEEQLEHAAHHDAMTGLLNRRQFYHITEPGQMQHLAIAQDYSLLLIDTDRFKHINDLYGHSKGDEVLCALARTLES
CARKGDLVFRWGGEEFVLLLPRTPLDTALSLAETIRVSVAKVSISGLPRFTVSIGVAHHEGNESIDELFKRVDDALYRAK
NDGRNRVLAA
>P46139 2.7.7.65~~~dgcN~~~Diguanylate cyclase DgcN~~~COG2199
MMDNDNSLNKRPTFKRALRNISMTSIFITMMLIWLLLSVTSVLTLKQYAQKNLALTAATMTYSLEAAVVFADGPAATETL
AALGQQGQFSTAEVRDKQQNILASWHYTRKDPGDTFSNFISHWLFPAPIIQPIRHNGETIGEVRLTARDSSISHFIWFSL
AVLTGCILLASGIAITLTRHLHNGLVEALKNITDVVHDVRSNRNFSRRVSEERIAEFHRFALDFNSLLDEMEEWQLRLQA
KNAQLLRTALHDPLTGLANRAAFRSGINTLMNNSDARKTSALLFLDGDNFKYINDTWGHATGDRVLIEIAKRLAEFGGLR
HKAYRLGGDEFAMVLYDVQSESEVQQICSALTQIFNLPFDLHNGHQTTMTLSIGYAMTIEHASAEKLQELADHNMYQAKH
QRAEKLVR
>P76245 2.7.7.65~~~dgcP~~~Diguanylate cyclase DgcP~~~COG2199
MSDQIIARVSQSLAKEQSLESLVRQLLEMLEMVTDMESTYLTKVDVEARLQHIMFARNSQKMYIPENFTVSWDYSLCKRA
IDENCFFSDEVPDRWGDCIAARNLGITTFLSTPIHLPDGSFYGTLCAASSEKRQWSERAEQVLQLFAGLIAQYIQKEALV
EQLREANAALIAQSYTDSLTGLPNRRAIFENLTTLFSLARHLNHKIMIAFIDLDNFKLINDRFGHNSGDLFLIQVGERLN
TLQQNGEVIGRLGGDEFLVVSLNNENADISSLRERIQQQIRGEYHLGDVDLYYPGASLGIVEVDPETTDADSALHAADIA
MYQEKKHKQKTPFVAHPALHS
>A0A0H2ZJS2 2.7.7.65~~~dgcP~~~Diguanylate cyclase DgcP~~~
MSRDDVQRWKDKYLENIEQQERLQRRWDARIDLLRRGLVRSSLAAEGSDKAVDQCMKELREILRRDDMDAGLSGLIPRLE
KAVLDSEQRRQQRTQQNIDALGELAQQLLALDLPRELRKPLKQFARDIEERARQSREIPILLSELSRLQRQALAERKGGD
AEDGRPSLLQRLFGGKESETAAEPSASVPSVVAASNTPIQPAAAAPSLPVAEHDEAPGGPPQPLPASRVAAIESAPAGWV
GVAERGEPNQILLDEPREIWLDSLPLPAGLSFSETLEDAGAESPPAVPADVESAPEAPATPVDNLDGQAVDEAYELPPPI
PEPGYSAVAPHIEASLLRLLDGLSLPSSHQPQAEALRERIDGSLNWYELVPVLDDLAVLVLSLADSGQRDFEEYLRQLNE
RLESFLGHLGDAHAGYTDVLDNARGFDQSLREQVSGLQASVQQATDLNSLKLAVDSRLNGLLASMDEHQREQAEHEQEVS
GRLQALMERVNSMEQDAKAFHSHLEDQRQKALTDPLTGLPNRAALSERLEQEVARRHRDGGDLLLAVLDIDHFKRINDDF
GHLAGDKVLKIIAGELRKRLRQADFIARFGGEEFVVLLPATSLEAGRQLLERLRAAIAACPFHFKGEPLSITCSAGITAF
EGNEAGEVVFERADQALYRAKRAGRDRLEVA
>Q9HT84 2.7.7.65~~~dgcP~~~Diguanylate cyclase DgcP~~~
MSRDDVQRWKDKYLENIEQQERLQRRWDARIDLLRRGLVRSSLAAEGSDKAVDQCMKELREILRRDDMDAGLSGLIPRLE
KAVLDSEQRRQQRTQQNIDALGELAQQLLALDLPRELRKPLKQFARDIEERARQSREIPILLSELSRLQRQALAERKGGD
AEDGRPSLLQRLFGGKESETTAEPSASVPSVVAASNTPIQPAAAAPSLPVAEHDEAPGGPPQPLPARTVAAIESAPAGWV
GVAERGEPNQILLDEPREIWLDSLPLPAGLSFSETLEEAGAEPSPAMPADVESAPEAPATPVDNLDGQAVDEAYELPPPI
PEPGYSAVAPHIEASLLRLLDGLSLPSSHQPQAEALRERIDGSLNWYELVPVLDDLAVLVLSLADSGQRDFEEYLRQLNE
RLESFLGHLGDAHAGYTDVLDNARGFDQSLREQVSGLQASVQQATDLNSLKLAVDSRLNGLLASMDEHQREQAEHEQEVS
GRLQALMERVNSMEQDAKAFHSHLEDQRQKALTDPLTGLPNRAALSERLEQEVARRHRDGGDLLLAVLDIDHFKRINDDF
GHLAGDKVLKIIAGELRKRLRQADFIARFGGEEFVVLLPATSLEAGRQLLERLRAAIAACPFHFKGEPLSITCSAGITAF
EGNEAGEAVFERADQALYRAKRAGRDRLEVA
>P76330 2.7.7.65~~~dgcQ~~~Probable diguanylate cyclase DgcQ~~~COG3706
MQHETKMENQSWLKKLARRLGPGHVVNLCFIVVLLFSTLLTWREVVVLEDAYISSQRNHLENVANALDKHLQYNVDKLIF
LRNGMREALVAPLDFTSLRDAVTEFEQHRDEHAWKIELNRRRTLPVNGVSDALVSEGNLLSRENESLDNEITAALEVGYL
LRLAHNSSSMVEQAMYVSRAGFYVSTQPTLFTRNVPTRYYGYVTQPWFIGHSQRENRHRAVRWFTSQPEHASNTEPQVTV
SVPVDSNNYWYGVLGMSIPVRTMQQFLRNAIDKNLDGEYQLYDSKLRFLTSSNPDHPTGNIFDPRELALLAQAMEHDTRG
GIRMDSRYVSWERLDHFDGVLVRVHTLSEGVRGDFGSISIALTLLWALFTTMLLISWYVIRRMVSNMYVLQSSLQWQAWH
DTLTRLYNRGALFEKARPLAKLCQTHQHPFSVIQVDLDHFKAINDRFGHQAGDRVLSHAAGLISSSLRAQDVAGRVGGEE
FCVILPGASLTEAAEVAERIRLKLNEKEMLIAKSTTIRISASLGVSSSEETGDYDFEQLQSLADRRLYLAKQAGRNRVFA
SDNA
>Q8EGF8 2.7.7.65~~~dgcS~~~Diguanylate cyclase DgcS~~~COG2199
MDFGLATTLYPDEYNYQTDAYSPTSSPVDLVQVIQQLHASLDPRTVFACYGKVLGQHLPIQGVRLQSEQHKLSWGKRYGI
SLKRQIICGGTPLTLQYQLLTPLTPSQSICLQEIEPLLLQPLLNAMQYQEMSMQAMFDALTGLGNRHYYSQSLKNAVARA
QRKQGSVSLIVLDLDNFKKLNDKYGHKCGDYILKEFGDIIRSSIRSTDQAFRIGGDEFVVIVQGNIHAAGLLCERIVSAT
NTHASFHQFGVSCSLGAAEASETMEAEQLYEQADKTLYQAKASGRNCYKLSPTQLS
>P75908 2.7.7.65~~~dgcT~~~Probable diguanylate cyclase DgcT~~~COG3706
MEKDYLRISSTVLVSLLFGLALVLVNSWFNQPGVEEVVPRSTYLMVMIALFFIDTVAFIFMQLYFIYDRRQFSNCVLSLA
FLSCLIYFVITVIIIQQIIEERLTSSVVQNDIAIYYLFRQMSLCILIFLALVNKVSENTKQRNLFSKKMTLCISLFFVFG
GPIVAHILSSHYESYNLHIAELTNENGQVVWKASYVTIMIFMWLTLLSVNLYFNGLRYDIWNGVTVIAFCAVLYNISLLF
MSRYSVSTWYISRTIEVVSKLTVMVIFMCHIFSALRVTKNIAHRDPLTNIFNRNYFFNELTVQSASAQKTPYCVMIMDID
HFKKVNDTWGHPVGDQVIKTVVNIIGKSIRPDDLLARVGGEEFGVLLTDIDTERAKALAERIRENVERLTGDNPEYAIPQ
KVTISIGAVVTQENALNPNEIYRLADNALYEAKETGRNKVVVRDVVNFCESP
>P31129 2.7.7.65~~~dgcZ~~~Diguanylate cyclase DgcZ~~~COG3706
MIKKTTEIDAILLNLNKAIDAHYQWLVSMFHSVVARDASKPEITDNHSYGLCQFGRWIDHLGPLDNDELPYVRLMDSAHQ
HMHNCGRELMLAIVENHWQDAHFDAFQEGLLSFTAALTDYKIYLLTIRSNMDVLTGLPGRRVLDESFDHQLRNAEPLNLY
LMLLDIDRFKLVNDTYGHLIGDVVLRTLATYLASWTRDYETVYRYGGEEFIIIVKAANDEEACRAGVRICQLVDNHAITH
SEGHINITVTAGVSRAFPEEPLDVVIGRADRAMYEGKQTGRNRCMFIDEQNVINRV
>A0A0H3AFM6 2.7.7.65~~~~~~Diguanylate cyclase~~~COG2199
MKNWLCQAVRGEPMIELNRIEELFDNQQFSLHELVLNELGVYVFVKNRRGEYLYANPLTLKLFETNAQSLLGKTDHDFFH
DDQLSDILAADQQVFETRLSVVHEERAIAKSNGLVRIYRAVKHPILHRVTGEVIGLIGVSTDITDIVELREQLYQLANTD
SLTQLCNRRKLWADFRAAFARAKRLRQPLSCISIDIDNFKLINDQFGHDKGDEVLCFLAKLFQSVISDHHFCGRVGGEEF
IIVLENTHVETAFHLAEQIRQRFAEHPFFEQNEHIYLCAGVSSLHHGDHDIADIYRRSDQALYKAKRNGRNRCCIYRQST
E
>P16932 4.1.1.64~~~dgdA~~~2,2-dialkylglycine decarboxylase~~~COG0160
MSLNDDATFWRNARQHLVRYGGTFEPMIIERAKGSFVYDADGRAILDFTSGQMSAVLGHCHPEIVSVIGEYAGKLDHLFS
GMLSRPVVDLATRLANITPPGLDRALLLSTGAESNEAAIRMAKLVTGKYEIVGFAQSWHGMTGAAASATYSAGRKGVGPA
AVGSFAIPAPFTYRPRFERNGAYDYLAELDYAFDLIDRQSSGNLAAFIAEPILSSGGIIELPDGYMAALKRKCEARGMLL
ILDEAQTGVGRTGTMFACQRDGVTPDILTLSKTLGAGLPLAAIVTSAAIEERAHELGYLFYTTHVSDPLPAAVGLRVLDV
VQRDGLVARANVMGDRLRRGLLDLMERFDCIGDVRGRGLLLGVEIVKDRRTKEPADGLGAKITRECMNLGLSMNIVQLPG
MGGVFRIAPPLTVSEDEIDLGLSLLGQAIERAL
>D4GJ14 4.2.1.-~~~rspA~~~D-galactonate dehydratase family member RspA~~~COG4948
MSNLFITNVKTILTAPGGIDLVVVKIETNEPGLYGLGCATFTQRIYAVQSAIDEYLAPFLIGKDPARIEDIWQSAAVSGY
WRNGPVMNNALSGIDMALWDIKGKQAGLPVYELLGGKCRDGIALYVHTDGADEVEVEDSARAKMEEGYQYIRCQMGMYGG
AGTDDLRLIANRMVKAKNIQPKRSPRTKAPGIYFDPEAYAKSIPRLFDHLRNKLGFSVELLHDAHERITPINAIHMAKAL
EPYQLFFLEDPVAPENTEWLKMLRQQSSTPIAMGELFVNVNEWKPLIDNKLIDYIRCHISSIGGITPAKKIAIYSELNGV
RTAWHSPGDISPIGVCANMHLDLSSPNFGIQEYTPMNDALREVFPGCPEVDQGYAYVNDKPGLGIDINEALAAKFPCEGG
NPTWTMARTPDGTVWRP
>B5R541 4.2.1.-~~~~~~D-galactonate dehydratase family member SEN1436~~~
MKVSNLKITNVKTILTAPGGIDLAVVKIETNEPGLYGLGCATFTQRIFAVKSAIDEYMAPFLVGKDPTRIEDIWQSGVVS
GYWRNGPIMNNALSGVDMALWDIKGKLAGMPVYDLLGGKCRDGIPLYCHTDGGDEVEVEDNIRARMEEGYQYVRCQMGMY
GGAGTDDLKLIATQLARAKNIQPKRSPRSKTPGIYFDPDAYAKSVPRLFDHLRNKLGFGIEFIHDVHERVTPVTAINLAK
TLEQYQLFYLEDPVAPENIDWLKMLRQQSSTPISMGELFVNVNEWKPLIDNRLIDYIRCHVSTIGGITPARKLAVYSELN
GVRTAWHGPGDISPVGVCANMHLDLSSPNFGIQEYTPMNDALRDVFPGCPEIDHGYAYLNDKPGLGIDIDEAKAAKYPCE
GGIPSWTMARTPDGTASRP
>B5QBD4 4.2.1.-~~~~~~D-galactonate dehydratase family member SeV_A0456~~~
MSNLKITNVKTILTAPGGIDLAVVKIETNEPGLYGLGCATFTQRIFAVKSAIDEYMAPFLVGKDPTRIEDIWQSGVVSGY
WRNGPIMNNALSGVDMALWDIKGKLAGMPVYDLLGGKCRDGIPLYCHTDGGDEVEVEDNIRARMEEGYQYVRCQMGMYGG
AGTDDLKLIATQLARAKNIQPKRSPRSKTPGIYFDPDAYAKSVPRLFDHLRNKLGFGIEFIHDVHERVTPVTAINLAKTL
EQYQLFYLEDPVAPENIDWLKMLRQQSSTPISMGELFVNVNEWKPLIDNRLIDYIRCHVSTIGGITPAKKLAVYSELNGV
RTAWHGPGDISPVGVCANMHLDLSSPNFGIQEYTPMNDALRDVFPGCPEIDHGYAYLNDKPGLGIDIDEAKAAKYPCEGG
IPSWTMARTPDGTASRP
>D7BPX0 4.2.1.-~~~~~~D-galactonate dehydratase family member SBI_01856~~~COG4948
MSRPAHKTDTIVAVDVLVTSPGRNFVALKITTEQGLVGWGDATLNGRELAVASYLRDHVAPLLIGRDPARIEDTWQYLYR
GAYWRRGPVTMTSIGAVDLALWDIKGKATGQPVYQLLGGAVRDRILTYTHASGWEIPQLLDAVDERREQGFLAVRAQSGI
PGLATVYGVSSGEAGYEPADRGAAPAVEVWDTDSYLRHAPRVLAAVREHVGPELKLLHDAHHRLTPGQAARLGRALEEVD
LYWLEDVTPAENQEVLRHIRHHTTVPLAIGEVFNTVWECQTLITEQLIDFVRTCVTHAGGISHLRRIAALAEVWQVRLGP
HGPSDVSPVALAASLHVGLATPNFAIQEYMGYEPVVHEVFRHAWSYADGHLHPGDQPGLGVEVDEALAARFPYEPAYLPI
ARRRDGSMTDW
>Q74HC3 2.7.1.76~~~~~~Deoxyadenosine kinase~~~COG1428
MTVIVLSGPIGAGKSSLTSLLAEHLGTQAFYEGVDNNPILPLYYKDMAHYTFLLNTYLLNHRLAQINQAIRDHNSVSDRS
IYEDALFFKMNVDSGIADPTEFKIYDSLLENMMEQAPGNPSKKPDLLIYIHVSLDTMLHRIQKRGRKFEQLSTDPSLKDY
YARLLSYYEPWYEKYNASPKMMIDGDKYDFVANEDARRKVINAIDQKLIDIGNLN
>Q74HC2 2.7.1.113~~~~~~Deoxyguanosine kinase~~~COG1428
MTVIVLSGPIGAGKSSLTGILSKYLGTNPFYESVDDNPVLPLFYENPKKYAFLLQVYFLNTRFRSIKSALTDDNNVLDRS
IYEDALFFQMNADIGRATPEEVDTYYELLHNMMSELDRMPKKNPDLLVHIDVSYDTMLKRIQKRGRNYEQLSYDPTLEDY
YKRLLRYYKPWYEKYDYSPKMTIDGDKLDFMASEEDRQEVLNQIVAKLKEMGKFEDDWKPNLVK
>P37530 2.7.1.113~~~dgk~~~Deoxyguanosine kinase~~~COG1428
MNTAPFIAIEGPIGAGKTTLATMLSQKFGFPMINEIVEDNPYLDKFYDNIKEWSFQLEMFFLCHRYKQLEDTSDHFLKKG
QPVIADYHIYKNVIFAERTLSPHQLEKYKKIYHLLTDDLPKPNFIIYIKASLPTLLHRIEKRGRPFEKKIETSYLEQLIS
DYEVAIKQLQEADPELTVLTVDGDSKDFVLNKSDFERIAAHVKELIV
>Q6BF16 4.1.2.21~~~dgoA~~~2-dehydro-3-deoxy-6-phosphogalactonate aldolase~~~COG0800
MQWQTKLPLIAILRGITPDEALAHVGAVIDAGFDAVEIPLNSPQWEQSIPAIVDAYGDKALIGAGTVLKPEQVDALARMG
CQLIVTPNIHSEVIRRAVGYGMTVCPGCATATEAFTALEAGAQALKIFPSSAFGPQYIKALKAVLPSDIAVFAVGGVTPE
NLAQWIDAGCAGAGLGSDLYRAGQSVERTAQQAAAFVKAYREAVQ
>Q6BF17 4.2.1.6~~~dgoD~~~D-galactonate dehydratase~~~COG4948
MKITKITTYRLPPRWMFLKIETDEGVVGWGEPVIEGRARTVEAAVHELGDYLIGQDPSRINDLWQVMYRAGFYRGGPILM
SAIAGIDQALWDIKGKVLNAPVWQLMGGLVRDKIKAYSWVGGDRPADVIDGIKTLREIGFDTFKLNGCEELGLIDNSRAV
DAAVNTVAQIREAFGNQIEFGLDFHGRVSAPMAKVLIKELEPYRPLFIEEPVLAEQAEYYPKLAAQTHIPLAAGERMFSR
FDFKRVLEAGGISILQPDLSHAGGITECYKIAGMAEAYDVTLAPHCPLGPIALAACLHIDFVSYNAVLQEQSMGIHYNKG
AELLDFVKNKEDFSMVGGFFKPLTKPGLGVEIDEAKVIEFSKNAPDWRNPLWRHEDNSVAEW
>B2UCA8 4.2.1.6~~~dgoD~~~D-galactonate dehydratase~~~COG4948
MKITRLTTYRLPPRWMFLKVETDEGVTGWGEPVIEGRARTVEAAVHELSDYLIGQDPSRINDLWQTMYRAGFYRGGPILM
SAIAGIDQALWDIKGKVLGVPVYELLGGLVRDKMRTYSWVGGDRPADVIAGMKALQAGGFDHFKLNGCEEMGIIDTSRAV
DAAVARVAEIRSAFGNTVEFGLDFHGRVSAPMAKVLIKELEPYRPLFIEEPVLAEQAETYARLAAHTHLPIAAGERMFSR
FDFKRVLEAGGVSILQPDLSHAGGITECVKIAAMAEAYDVALAPHCPLGPIALAACLHVDFVSWNATLQEQSMGIHYNKG
AELLDYVRNKADFALEGGYIRPPRLPGLGVDIDEALVIERSKEAPDWRNPVWRHADGSVAEW
>P31459 2.7.1.58~~~dgoK~~~2-dehydro-3-deoxygalactonokinase~~~COG3734
MTARYIAIDWGSTNLRAWLYQGDHCLESRQSEAGVTRLNGKSPAAVLAEVTTDWREEKTPVVMAGMVGSNVGWKVAPYLS
VPACFSSIGEQLTSVGDNIWIIPGLCVSHDDNHNVMRGEETQLIGARALAPSSLYVMPGTHCKWVQADSQQINDFRTVMT
GELHHLLLNHSLIGAGLPPQENSADAFTAGLERGLNTPAILPQLFEVRASHVLGTLPREQVSEFLSGLLIGAEVASMRDY
VAHQHAITLVAGTSLTARYQQAFQAMGCDVTAVAGDTAFQAGIRSIAHAVAN
>Q92RN7 2.7.1.58~~~dgoK1~~~Probable 2-dehydro-3-deoxygalactonokinase DgoK1~~~COG3734
MTTAGYYAAVDWGTSSFRLWIIGEDGAVLAERRSAEGMTTAAKTGFHTILDGHLAAVSAPAHLPIIICGMAGARQGWKEA
GYIETPAALAEIAGRATAIPDVDRDIRILPGLAQRDRRHPDVMRGEETQLLGAAAHLGAGSHLVCMPGTHSKWVRLADDR
VEGFSTFMTGELFDTIARHTILSHAVAEADTFAAGSAAFTDAVSRTRENPALATNLLFSVRAGQLLHGTAAADARAQLSG
TLIGLEIAGALAGSGSVDGVCLVGSGGLGTLYRTALESQGLNVRAVDADEAVRAGLSAAARAIWPL
>P31460 ~~~dgoR~~~Galactonate operon transcriptional repressor~~~COG2186
MTLNKTDRIVITLGKQIVHGKYVPGSPLPAEAELCEEFATSRNIIREVFRSLMAKRLIEMKRYRGAFVAPRNQWNYLDTD
VLQWVLENDYDPRLISAMSEVRNLVEPAIARWAAERATSSDLAQIESALNEMIANNQDREAFNEADIRYHEAVLQSVHNP
VLQQLSIAISSLQRAVFERTWMGDEANMPQTLQEHKALFDAIRHQDGDAAEQAALTMIASSTRRLKEIT
>P0AA76 ~~~dgoT~~~D-galactonate transporter~~~COG2271
MDIPVNAAKPGRRRYLTLVMIFITVVICYVDRANLAVASAHIQEEFGITKAEMGYVFSAFAWLYTLCQIPGGWFLDRVGS
RVTYFIAIFGWSVATLFQGFATGLMSLIGLRAITGIFEAPAFPTNNRMVTSWFPEHERASAVGFYTSGQFVGLAFLTPLL
IWIQEMLSWHWVFIVTGGIGIIWSLIWFKVYQPPRLTKGISKAELDYIRDGGGLVDGDAPVKKEARQPLTAKDWKLVFHR
KLIGVYLGQFAVASTLWFFLTWFPNYLTQEKGITALKAGFMTTVPFLAAFVGVLLSGWVADLLVRKGFSLGFARKTPIIC
GLLISTCIMGANYTNDPMMIMCLMALAFFGNGFASITWSLVSSLAPMRLIGLTGGVFNFAGGLGGITVPLVVGYLAQGYG
FAPALVYISAVALIGALSYILLVGDVKRVG
>Q9KQL9 ~~~~~~Deoxyguanosinetriphosphate triphosphohydrolase-like protein 1~~~COG0232
MQVSLNPEWLARNNDEHKIRRNDHRSPFQRDRARILHSAAFRRLQAKTQVHGTSLNDFHRTRLTHSLEAAQIGTGIVAQI
KLKQPEFRELLPSDSLIDSLCLAHDIGHPPYGHGGEIALNYMMRDHGGFEGNAQTFRIVTSLEPYTEHHGMNLSRRTLLG
LLKYPALLSATRAAIPPPAVAHQRQLKAKDWSPAKGIYDCDLASLDWVLEPLCESDRELLGQMRAEPSSPKEHRKTRFKS
LDCSIMELADDIAYGVHDLEDAIVLGMVTRAQWQEAAAAQLAECGDPWFEEHIAELSEMLFSGKHYVRKDAIGGIVNALL
TSISVKPVEAPFHNELLAFNAYIEPHMGNALEVLKHFVSQYVIQIPQVQRFEYKGQQLIMDLFEALSADPERLLPQATGE
KWRKAQEQDEGMRVICDYIAAMTDAYAQRLHQQLFSAQSHY
>P9WNY7 ~~~dgt~~~Deoxyguanosinetriphosphate triphosphohydrolase-like protein~~~COG0232
MSASEHDPYDDFDRQRRVAEAPKTAGLPGTEGQYRSDFARDRARVLHSAALRRLADKTQVVGPREGDTPRTRLTHSLEVA
QIGRGMAIGLGCDLDLVELAGLAHDIGHPPYGHNGERALDEVAASHGGFEGNAQNFRILTSLEPKVVDAQGLSAGLNLTR
ASLDAVTKYPWMRGDGLGSQRRKFGFYDDDRESAVWVRQGAPPERACLEAQVMDWADDVAYSVHDVEDGVVSERIDLRVL
AAEEDAAALARLGEREFSRVSADELMAAARRLSRLPVVAAVGKYDATLSASVALKRLTSELVGRFASAAIATTRAAAGPG
PLVRFRADLQVPDLVRAEVAVLKILALQFIMSDPRHLETQARQRERIHRVAHRLYSGAPQTLDPVYAAAFNTAADDAARL
RVVVDQIASYTEGRLERIDADQLGVSRNALD
>P15723 3.1.5.1~~~dgt~~~Deoxyguanosinetriphosphate triphosphohydrolase~~~COG0232
MAQIDFRKKINWHRRYRSPQGVKTEHEILRIFESDRGRIINSPAIRRLQQKTQVFPLERNAAVRTRLTHSMEVQQVGRYI
AKEILSRLKELKLLEAYGLDELTGPFESIVEMSCLMHDIGNPPFGHFGEAAINDWFRQRLHPEDAESQPLTDDRCSVAAL
RLRDGEEPLNELRRKIRQDLCHFEGNAQGIRLVHTLMRMNLTWAQVGGILKYTRPAWWRGETPETHHYLMKKPGYYLSEE
AYIARLRKELNLALYSRFPLTWIMEAADDISYCVADLEDAVEKRIFTVEQLYHHLHEAWGQHEKGSLFSLVVENAWEKSR
SNSLSRSTEDQFFMYLRVNTLNKLVPYAAQRFIDNLPAIFAGTFNHALLEDASECSDLLKLYKNVAVKHVFSHPDVERLE
LQGYRVISGLLEIYRPLLSLSLSDFTELVEKERVKRFPIESRLFHKLSTRHRLAYVEAVSKLPSDSPEFPLWEYYYRCRL
LQDYISGMTDLYAWDEYRRLMAVEQ
>Q9I4L1 3.1.5.1~~~dgt~~~Probable deoxyguanosinetriphosphate triphosphohydrolase~~~
MPGAVDFKERISRQRPHDRETYGHAGNTDLQDIVYQLESDRGRIVNSAAVRRLQQKTQVFPLERNAAVRSRLTHSLEVQQ
TGRFIVRTLFRQLGPRAAEVGLDGLEGALESLVEMACLMHDVGNPPFGHFGEYAINDWFERNLDALFERRIPPGQGDGLL
QQRMLTDLKHFEGNAQAIRLVVKLLRLNLTYTQTAGLLKYVRPAYEPKPDKAAANHYLNKKPGFYLSEEAFVDELRRVLG
MRPGTRHPVAYIMEAADDISYCLADIEDSVEKGILDIRQLADLLVKKFAVHHSPDAPIPGDADNMSFQRMVDYSLEKAER
EPINKVSEFFIRLRVKMIHPLVQHAAQQFIDNFEAVHAGTLGRALMEDGSLPHAIVQTFKDVAMEWVFCHPEVETLELQG
YRIIQGLLDFYAPLLRLPAEEFQALAEGRQAAAPHPQLLVRRLPSQQIKAYLEAMKGVAEDPLQRQWEFYHRCRMLQDFV
SGMTDQHAQDEYRALSAL
>Q59827 3.1.5.1~~~dgt~~~Deoxyguanosinetriphosphate triphosphohydrolase~~~
MAQIDFRKKINWHRRYRSPQGVKTEHEILRIFESDRGRIINSPAIRRLQQKTQVFPLERNAAVRTRLTHSMEVQQVGRYI
AKEILSRMKELKLLEAYGLDELTGPFESIVEMSCLMHDIGNPPFGHFGEAAINDWFRQRLHPEDAESQPLTDDRCRVRGL
RLRDGEEPLNELRRKIRQDLCHFEGNAQGIRLVHTLMRMNLTWAQVGGILKYTRPAWWRGETPETHHYLMKKPGYYLSEE
AYIARLRKELNLALYSRFPLTWIMEAADDISYCVADLEDAVEKRIYTLEQLYHHLHDAWGQHEKGSLFSLVVENAWEKSR
SNSLSRSTEDQFFMYLRVNTLNKLVPYAAQRFIDNLPAIFAGTFNHALLEDASECSDLLKLYKNVAVKHVFSHPDVERLE
LQGYRVISGLLEIYRPLLSLSLSDFTELVEKERVKRFPIESRLFHKLSTPHRLAYVEAVSKLPSDSPEFPLWEYYYRCRL
LQDYISGMTDLYAWDEYRRLMAVEQ
>P09788 1.17.9.1~~~pchF~~~4-cresol dehydrogenase [hydroxylating] flavoprotein subunit~~~
MSEQNNAVLPKGVTQGEFNKAVQKFRALLGDDNVLVESDQLVPYNKIMMPVENAAHAPSAAVTATTVEQVQGVVKICNEH
KIPIWTISTGRNFGYGSAAPVQRGQVILDLKKMNKIIKIDPEMCYALVEPGVTFGQMYDYIQENNLPVMLSFSAPSAIAG
PVGNTMDRGVGYTPYGEHFMMQCGMEVVLANGDVYRTGMGGVPGSNTWQIFKWGYGPTLDGMFTQANYGICTKMGFWLMP
KPPVFKPFEVIFEDEADIVEIVDALRPLRMSNTIPNSVVIASTLWEAGSAHLTRAQYTTEPGHTPDSVIKQMQKDTGMGA
WNLYAALYGTQEQVDVNWKIVTDVFKKLGKGRIVTQEEAGDTQPFKYRAQLMSGVPNLQEFGLYNWRGGGGSMWFAPVSE
ARGSECKKQAAMAKRVLHKYGLDYVAEFIVAPRDMHHVIDVLYDRTNPEETKRADACFNELLDEFEKEGYAVYRVNTRFQ
DRVAQSYGPVKRKLEHAIKRAVDPNNILAPGRSGIDLNNDF
>P99151 1.4.1.1~~~ald1~~~Alanine dehydrogenase 1~~~
MLVAVVKELKQGEGRVACTPENVRKLTDAGHKVIVEKNAGIGSGFSNDMYEKEGAKIVTHEQAWEADLVIKVKEPHESEY
QYFKKNQIIWGFLHLASSKEIVEKMQEVGVTAISGETIIKNGKAELLAPMSAIAGQRSAIMGAYYSEAQHGGQGTLVTGV
HENVDIPGSTYVIFGGGVAATNAANVALGLNAKVIIIELNDDRIKYLEDMYAEKDVTVVKSTPENLAEQIKKADVFISTI
LIPGAKPPKLVTREMVKSMKKGSVLIDIAIDQGGTIETIRPTTISDPVYEEEGVIHYGVPNQPGAVPRTSTMALAQGNID
YILEICDKGLEQAIKDNEALSTGVNIYQGQVTNQGLASSHDLDYKEILNVIE
>P46368 1.2.1.3~~~acoD~~~Acetaldehyde dehydrogenase 2~~~COG1012
MNMAEIAQLGVSNPYKQQYENYIGGAWVPPAGGEYFESTTPITGKPFTRVPRSGQQDVDAALDAAHAAKAAWARTSTTER
ANILNRIADRIEANLKLLAVAESIDNGKPVRETTAADLPLAVDHFRYFAGCIRAQEGGISEIDADTIAYHFHEPLGVVGQ
IIPWNFPLLMATWKLAPALAAGNCVVLKPAEQTPASILVLMEVIGDLLPPGVVNVINGFGLEAGKPLASSPRISKVAFTG
ETTTGRLIMQYASQNLIPVTLELGGKSPNIFFEDVLAADDAFFDKALEGFAMFALNQGEVCTCPSRALIQESIYDRFMER
ALKRVAAIRQGHPLDTGTMIGAQASAEQLEKILSYIDLGRKEGAQCLTGGERNVLDGDLAGGYYVKPTVFAGHNKMRIFQ
EEIFGPVVSVTTFKDEEEALAIANDTLYGLGAGVWTRDGARAFRMGRGIQAGRVWTNCYHAYPAHAAFGGYKQSGIGREN
HRMMLDHYQQTKNLLVSYSPNALGFF
>Q99TF4 1.4.1.1~~~ald2~~~Alanine dehydrogenase 2~~~
MKIGIPREIKNNENRVGLSPSGVHALVESGHTVLVETNAGSGSFFEDVDYKEAGAEIVAEQAKVWDVDMVIKVKEPLESE
YPYFKEGLVLFTYLHLANEEKLTQALIDRKVISIAYETVQLPDRSSPLLSPMSEVAGRMSAQVGAEFLQKLNGGMGILLG
GVPGVPKGKVTIIGGGQAGTNAAKIALGLGADVTILDVNPKRLQQLDDLFGGRVHTIMSNPLNIELYVKQSDLVIGAVLI
PGAKAPRLVTEDMIKQMKNGSVIIDIAIDQGGIFETTDKITTHDDPTYIKHGVVHYAVANMPGAVPRTSTLALNNATLPY
ALMLANKGYREAFKSNQPLSLGLNTYKGHVTNKGVAEAFEMEYKSVEEALQL
>Q8U671 3.8.1.5~~~dhaA~~~Haloalkane dehalogenase~~~
MKEHRHMTEKSPHSAFGDGAKAYDVPAFGLQIHTVEHGSGAPIVFLHGNPTSSYLWRHIFRRLHGHGRLLAVDLIGYGQS
SKPDIEYTLENQQRYVDAWFDALDLRNVTLVLQDYGAAFGLNWASRNPDRVRAVAFFEPVLRNIDSVDLSPEFVTRRAKL
RQPGEGEIFVQQENRFLTELFPWFFLTPLAPEDLRQYQTPFPTPHSRKAILAGPRNLPVDGEPASTVAFLEQAVNWLNTS
DTPKLLLTFKPGFLLTDAILKWSQVTIRNLEIEAAGAGIHFVQEEQPETIARLLDAWLTRIAGN
>P59337 3.8.1.5~~~dhaA~~~Haloalkane dehalogenase~~~COG0596
MSKPIEIEIRRAPVLGSSMAYRETGAQDAPVVLFLHGNPTSSHIWRNILPLVSPVAHCIAPDLIGFGQSGKPDIAYRFFD
HVRYLDAFIEQRGVTSAYLVAQDWGTALAFHLAARRPDFVRGLAFMEFIRPMPTWQDFHHTEVAEEQDHAEAARAVFRKF
RTPGEGEAMILEANAFVERVLPGGIVRKLGDEEMAPYRTPFPTPESRRPVLAFPRELPIAGEPADVYEALQSAHAALAAS
SYPKLLFTGEPGALVSPEFAERFAASLTRCALIRLGAGLHYLQEDHADAIGRSVAGWIAGIEAVRPQLAA
>Q9ZER0 3.8.1.5~~~dhaAF~~~Haloalkane dehalogenase~~~
MSEIGTGFPFDPHYVEVLGERMHYVDVGPRDGTPVLFLHGNPTSSYLWRNIIPHVAPSHRCIAPDLIGMGKSDKPDLDYF
FDDHVRYLDAFIEALGLEEVVLVIHDWGSALGFHWAKRNPERVKGIACMEFIRPIPTWDEWPEFARETFQAFRTADVGRE
LIIDQNAFIEGALPKFVVRPLTEVEMDHYREPFLKPVDREPLWRFPNELPIAGEPANIVALVEAYMNWLHQSPVPKLLFW
GTPGVLISPAEAARLAESLPNCKTVDIGPGLHFLQEDNPDLIGSEIARWLPALIVGKSIEFDGGWAT
>P9WMR9 3.8.1.5~~~dhaA~~~Haloalkane dehalogenase 3~~~COG0596
MTAFGVEPYGQPKYLEIAGKRMAYIDEGKGDAIVFQHGNPTSSYLWRNIMPHLEGLGRLVACDLIGMGASDKLSPSGPDR
YSYGEQRDFLFALWDALDLGDHVVLVLHDWGSALGFDWANQHRDRVQGIAFMEAIVTPMTWADWPPAVRGVFQGFRSPQG
EPMALEHNIFVERVLPGAILRQLSDEEMNHYRRPFVNGGEDRRPTLSWPRNLPIDGEPAEVVALVNEYRSWLEETDMPKL
FINAEPGAIITGRIRDYVRSWPNQTEITVPGVHFVQEDSPEEIGAAIAQFVRRLRSAAGV
>P0A3G4 3.8.1.5~~~dhaA~~~Haloalkane dehalogenase~~~
MSEIGTGFPFDPHYVEVLGERMHYVDVGPRDGTPVLFLHGNPTSSYLWRNIIPHVAPSHRCIAPDLIGMGKSDKPDLDYF
FDDHVRYLDAFIEALGLEEVVLVIHDWGSALGFHWAKRNPERVKGIACMEFIRPIPTWDEWPEFARETFQAFRTADVGRE
LIIDQNAFIEGALPKCVVRPLTEVEMDHYREPFLKPVDREPLWRFPNELPIAGEPANIVALVEAYMNWLHQSPVPKLLFW
GTPGVLIPPAEAARLAESLPNCKTVDIGPGLHYLQEDNPDLIGSEIARWLPAL
>P0A3G2 3.8.1.5~~~dhaA~~~Haloalkane dehalogenase~~~
MSEIGTGFPFDPHYVEVLGERMHYVDVGPRDGTPVLFLHGNPTSSYLWRNIIPHVAPSHRCIAPDLIGMGKSDKPDLDYF
FDDHVRYLDAFIEALGLEEVVLVIHDWGSALGFHWAKRNPERVKGIACMEFIRPIPTWDEWPEFARETFQAFRTADVGRE
LIIDQNAFIEGALPKCVVRPLTEVEMDHYREPFLKPVDREPLWRFPNELPIAGEPANIVALVEAYMNWLHQSPVPKLLFW
GTPGVLIPPAEAARLAESLPNCKTVDIGPGLHYLQEDNPDLIGSEIARWLPAL
>P59336 3.8.1.5~~~dhaA~~~Haloalkane dehalogenase~~~
MSEIGTGFPFDPHYVEVLGERMHYVDVGPRDGTPVLFLHGNPTSSYLWRNIIPHVAPSHRCIAPDLIGMGKSDKPDLDYF
FDDHVRYLDAFIEALGLEEVVLVIHDWGSALGFHWAKRNPERVKGIACMEFIRPIPTWDEWPEFARETFQAFRTADVGRE
LIIDQNAFIEGVLPKCVVRPLTEVEMDHYREPFLKPVDREPLWRFPNEIPIAGEPANIVALVEAYMNWLHQSPVPKLLFW
GTPGVLIPPAEAARLAESLPNCKTVDIGPGLHYLQEDNPDLIGSEIARWLPGLA
>P0A3G3 3.8.1.5~~~dhaA~~~Haloalkane dehalogenase~~~
MSEIGTGFPFDPHYVEVLGERMHYVDVGPRDGTPVLFLHGNPTSSYLWRNIIPHVAPSHRCIAPDLIGMGKSDKPDLDYF
FDDHVRYLDAFIEALGLEEVVLVIHDWGSALGFHWAKRNPERVKGIACMEFIRPIPTWDEWPEFARETFQAFRTADVGRE
LIIDQNAFIEGALPKCVVRPLTEVEMDHYREPFLKPVDREPLWRFPNELPIAGEPANIVALVEAYMNWLHQSPVPKLLFW
GTPGVLIPPAEAARLAESLPNCKTVDIGPGLHYLQEDNPDLIGSEIARWLPAL
>P45514 4.2.1.30~~~dhaB~~~Glycerol dehydratase large subunit~~~
MRRSKRFEVLAQRPVNQDGLIGEWPEEGLIAMESPYDPASSVKVENGRIVELDGKSRAEFDMIDRFIADYAINVPEAERA
MQLDALEIARMLVDIHVSREEIIAITTAITPAKRLEVMAQMNVVEMMMALQKMRARRTPSNQCHVTNLKDNPVQIAADAA
EAGIRGFSEQETTVGIARYAPFNALALLVGSQCGAPGVLTQCSVEEATELELGMRGLTSYAETVSVYGTESVFTDGDDTP
WSKAFLASAYASRGLKMRYTSGTGSEALMGYSESKSMLYLESRCIFITKGAGVQGLQNGAVSCIGMTGAVPSGIRAVLAE
NLIASMLDLEVASANDQTFSHSDIRRTARTLMQMLPGTDFIFSGYSAVPNYDNMFAGSNFDAEDFDDYNILQRDLMVDGG
LRPVTEEETIAIRNKAARAIQAVFRELGLPLISDEEVDAATYAHGSKDMPARNVVEDLAAVEEMMKRNITGLDIVGALSS
SGFEDIASNILNMLRQRVTGDYLQTSAILDRQFDVVSAVNDINDYQGPGTGYRISAERWAEIKNIAGVVQPGSIE
>Q92EU2 2.7.1.121~~~dhaK-2~~~PTS-dependent dihydroxyacetone kinase 2, dihydroxyacetone-binding subunit DhaK~~~COG2376
MRRLVNDGYEAVEEMLAGYVAAQGKYVDFAENDKRVIVSKQMSEEPRVRIIVGGGSGHEPLFLGYVGKDFADAAVVGNIN
TSPSPEPCYNAVKAVDSGKGCLYMYGNYAGDVMNFDMGAEMAADDGIRVETVLVTDDIYSAENVEDRRGVAGDLIVFKAA
ASAAAKGLDLDAVKQAAEKANANTFSMGVALSSSTLPVTGKAIFEMKEGEMEVGMGIHGEPGIKRTSIEPADKVVDQIMG
YLIEEMKLTAGEEVHVLINGLGGLPVMDQYICYRRVDEILKEKGVHIHSPLVGNYATSMDMIGMSITLVRLDDELKDLLD
TPCDTPYFKVD
>P45510 2.7.1.29~~~dhaK~~~Dihydroxyacetone kinase~~~
MSQFFFNQRTHLVSDVIDGAIIASPWNNLARLESDPAIRIVVRRDLNKNNVAVISGGGSGHEPAHVGFIGKGMLTAAVCG
DVFASPSVDAVLTAIQAVTGEAGCLLIVKNYTGDRLNFGLAAEKARRLGYNVEMLIVGDDISLPDNKHPRGIAGTILVHK
IAGYFAERGYNLATVLREAQYAASNTFSLGVALSSCHLPQETDAAPRHHPGHAELGMGIHGEPGASVIDTQNSAQVVNLM
VDKLLAALPETGRLAVMINNLGGVSVAEMAIITRELASSPLHSRIDWLIGPASLVTALDMKGFSLTAIVLEESIEKALLT
EVETSNWPTPVPPREITCVVSSHASARVEFQPSANALVAGIVELVTATLSDLETHLNALDAKVGDGDTGSTFAAAAREIA
SLLHRQQLPLNNLATLFALIGERLTVVMGGSSGVLMSIFFTAAGQKLEQGANVVEALNTGLAQMKFYGGADEGDRTMIDA
LQPALTSLLAQPKNLQAAFDAAQAGAERTCLSSKANAGRASYLSSESLLGNMDPGAQRLAMVFKALAESELG
>P76015 2.7.1.121~~~dhaK~~~PEP-dependent dihydroxyacetone kinase, dihydroxyacetone-binding subunit DhaK~~~COG2376
MKKLINDVQDVLDEQLAGLAKAHPSLTLHQDPVYVTRADAPVAGKVALLSGGGSGHEPMHCGYIGQGMLSGACPGEIFTS
PTPDKIFECAMQVDGGEGVLLIIKNYTGDILNFETATELLHDSGVKVTTVVIDDDVAVKDSLYTAGRRGVANTVLIEKLV
GAAAERGDSLDACAELGRKLNNQGHSIGIALGACTVPAAGKPSFTLADNEMEFGVGIHGEPGIDRRPFSSLDQTVDEMFD
TLLVNGSYHRTLRFWDYQQGSWQEEQQTKQPLQSGDRVIALVNNLGATPLSELYGVYNRLTTRCQQAGLTIERNLIGAYC
TSLDMTGFSITLLKVDDETLALWDAPVHTPALNWGK
>Q9CIV8 2.7.1.121~~~dhaK~~~PTS-dependent dihydroxyacetone kinase, dihydroxyacetone-binding subunit DhaK~~~COG2376
MSDEKIINQPQDVVSEMLDGLTYAYGDLIEKVPDFEIIQRKSPKSGKVALVSGGGSGHKPAHAGFVGEGMLSAAVCGAIF
TSPTPDQIYEAIKSADEGAGVLLIIKNYLGDVMNFEMAREMAEMEEIKVEQIIVDDDIAVENSLYTQGRRGVAGTVLVHK
ILGAAAHQEASLDEIKDLADKVVKNIKTIGLALSAATVPEVGKPGFVLDDNEIEYGVGIHSEPGYRREKMKTSYELATEL
VGKLKEEFKFEAGQKYGILVNGMGATPLMEQFIFMNDVAKLLTEENIEILFKKVGNYMTSIDMAGLSLTMIKLEDDQWLK
NLNEDVKTISWG
>Q92EU3 2.7.1.121~~~dhaL-2~~~PEP-dependent dihydroxyacetone kinase 2, ADP-binding subunit DhaL~~~COG1461
MSELVMDSAFFGHVLQDMGALIEKERDYLTGLDSDIGDGDHGINLSIGFREVNKQLDELLTVSPDIATLLKKSGMILLGK
VGGASGPLYGSFFMKCGADVPGKTEVNFDELCGMIINGAAAVQHRGKAELGDKTMMDAFLPGVEVLQNRDTNADPIETFS
AFVDAMHAGAQSTIPLIAKKGRALRLGERAIGHLDPGSESSWMLMNVILENLKKAV
>P76014 2.7.1.121~~~dhaL~~~PEP-dependent dihydroxyacetone kinase, ADP-binding subunit DhaL~~~COG1461
MSLSRTQIVNWLTRCGDIFSTESEYLTGLDREIGDADHGLNMNRGFSKVVEKLPAIADKDIGFILKNTGMTLLSSVGGAS
GPLFGTFFIRAAQATQARQSLTLEELYQMFRDGADGVISRGKAEPGDKTMCDVWVPVVESLRQSSEQNLSVPVALEAASS
IAESAAQSTITMQARKGRASYLGERSIGHQDPGATSVMFMMQMLALAAKE
>Q9CIV7 2.7.1.121~~~dhaL~~~PTS-dependent dihydroxyacetone kinase, ADP-binding subunit DhaL~~~COG1461
MLTIDTTIEWLGKFNEKIQENKAYLSELDGPIGDGDHGANMARGMSETMKALEVSNFGNVSEIFKKVAMTLMSKVGGASG
PLYGSAFLAMSKTAIETLDTSELIYAGLEAIQKRGKAQVGEKTMVDIWSAFLNDLQTDSASKDNLEKVVKASAGLLATKG
RASYLGERSIGHIDPGTQSSAYLFETLLEVVA
>Q92ET9 2.7.1.121~~~dhaM-2~~~PEP-dependent dihydroxyacetone kinase 2, phosphoryl donor subunit DhaM~~~COG3412
MISIVLVSHSQKITEGLQEMIVEMVGDTVHIISSGGTGDGRLGTNALMIADNIATCTNSEHIYIFCDIGSAILSAETALE
LLDTELLEKTTIIDAPLVEGAFTAAVQSLVNPSKEAILQELTNVH
>P37349 2.7.1.121~~~dhaM~~~PEP-dependent dihydroxyacetone kinase, phosphoryl donor subunit DhaM~~~COG1080
MVNLVIVSHSSRLGEGVGELARQMLMSDSCKIAIAAGIDDPQNPIGTDAVKVMEAIESVADADHVLVMMDMGSALLSAET
ALELLAPEIAAKVRLCAAPLVEGTLAATVSAASGADIDKVIFDAMHALEAKREQLGLPSSDTEISDTCPAYDEEARSLAV
VIKNRNGLHVRPASRLVYTLSTFNADMLLEKNGKCVTPESINQIALLQVRYNDTLRLIAKGPEAEEALIAFRQLAEDNFG
ETEEVAPPTLRPVPPVSGKAFYYQPVLCTVQAKSTLTVEEEQDRLRQAIDFTLLDLMTLTAKAEASGLDDIAAIFSGHHT
LLDDPELLAAASELLQHEHCTAEYAWQQVLKELSQQYQQLDDEYLQARYIDVDDLLHRTLVHLTQTKEELPQFNSPTILL
AENIYPSTVLQLDPAVVKGICLSAGSPVSHSALIARELGIGWICQQGEKLYAIQPEETLTLDVKTQRFNRQG
>Q9CIV6 2.7.1.121~~~dhaM~~~PTS-dependent dihydroxyacetone kinase, phosphotransferase subunit DhaM~~~COG3412
MTYGIVIVSHSPEIASGLKKLIREVAKNISLTAIGGLENGEIGTSFDRVMNAIEENEADNLLTFFDLGSARMNLDLVSEM
TDKELTIFNVPLIEGAYTASALLEAGATFEAIKEQLEKMLIEK
>P17201 1.2.5.2~~~~~~Membrane-bound aldehyde dehydrogenase [pyrroloquinoline-quinone]~~~
MGRLNRFRLGKDGRREQASLSRRGFLVTSLGAGVMFGFARPSSANQIFPLDRSLPGDGAFEPTIWCSIAPDGEITVNIIR
AEMGQHIGTALARIIADEMEADWSKVRINYVDTDPKWGLMVTGGSWSVWMTWDVFRQAGAATRTAMVEEGARLLGTTPDK
CTVASSIVSAGGKQISFGDIVAKGHPSHAFTPEEMAKLPLKPASERRLIGNAELKALDIPAKTNGTAIYGIDAKVEGMLY
GRPKMPPTRYGSKVRSVDDTEAKKIKGYVRYLLIDDPSQVVQGWVVVLAESYSAAIRATDALKVEWTPGETIHTSERDIQ
DRGRELINNKAGGVYIFNDDGVDQAFGSAHTVMDQEYTCASVLHYQLEPTNALAFEKDGVYEIHAGNQWQSLILPTLAKS
LQVPESKVILRSYLLGGGFGRRLNGDYMIPAALASKALGGKPVKLILTRSDDMQFDSFRSPSVQRVRMAFDASDRITAMD
YQAAAGWPTGVMAEAFMEKGVDGKPYDQFAIAGGDHWYEVGAFRVRALRNDLAEKTFRPGWLRSVSPGWTSWGVECFLDE
VAHRQKKDPAQFRLELLTGQGRNKGQAPDSVGGALRQAAVVRRLMEKVNWGKTSLPKDTAMGLATTAGQERGMPTWDRCV
AQVHVDRSTGVVTCQKLTILVDAGTVVDPDGAKAQTEGAALWGLSMVLFENTEIVNGMPVDRNLNTYTPLRIADTPEMDI
EFLPSTEKPMGLGEPGTTVVGPAIGNAIFNAVGVRLRHMPVRPADVLRGLQNG
>Q9CIW0 ~~~dhaQ~~~DhaKLM operon coactivator DhaQ~~~COG2376
MKFYNSTNEIPEEMLKGIDLTYPQLTYLPETGILYDNTYNEKTVPIISGGGSGHEPAHVGYVGSGMLAAAVTGPLFIPPK
SKNILKAIRQVNSGKGVFVIIKNFEADLKEFNEAIKEARTEGIDVRYIVSHDDISVNAYNFHKRHRGVAGTILLHKILGA
FAKEGGSIDEIEQLALSLSPEIYTLGVALAPVHFPHQKTSFVLAEDEVSFGIGIHGEPGYRVEKFEGSERIAIELVNKLK
AEINWQKKANKNYILLVNGLGSTTLMELYSFQYDVMRLLELEGLSVKFCKVGNLMTSCDMSGISLTLCSVKDPKWLDYLN
VPTGAFAW
>P76016 ~~~dhaR~~~PTS-dependent dihydroxyacetone kinase operon regulatory protein~~~COG3284
MSGAFNNDGRGISPLIATSWERCNKLMKRETWNVPHQAQGVTFASIYRRKKAMLTLGQAALEDAWEYMAPRECALFILDE
TACILSRNGDPQTLQQLSALGFNDGTYCAEGIIGTCALSLAAISGQAVKTMADQHFKQVLWNWAFCATPLFDSKGRLTGT
IALACPVEQTTAADLPLTLAIAREVGNLLLTDSLLAETNRHLNQLNALLESMDDGVISWDEQGNLQFINAQAARVLRLDA
TASQGRAITELLTLPAVLQQAIKQAHPLKHVEATFESQHQFIDAVITLKPIIETQGTSFILLLHPVEQMRQLMTSQLGKV
SHTFAHMPQDDPQTRRLIHFGRQAARSSFPVLLCGEEGVGKALLSQAIHNESERAAGPYIAVNCELYGDAALAEEFIGGD
RTDNENGRLSRLELAHGGTLFLEKIEYLAVELQSALLQVIKQGVITRLDARRLIPIDVKVIATTTADLAMLVEQNRFSRQ
LYYALHAFEITIPPLRMRRGSIPALVNNKLRSLEKRFSTRLKIDDDALARLVSCAWPGNDFELYSVIENLALSSDNGRIR
VSDLPEHLFTEQATDDVSATRLSTSLSFAEVEKEAIINAAQVTGGRIQEMSALLGIGRTTLWRKMKQHGIDAGQFKRRV
>Q9KQG2 1.2.1.11~~~asd1~~~Aspartate-semialdehyde dehydrogenase 1~~~COG0136
MRVGLVGWRGMVGSVLMQRMVEERDFDLIEPVFFSTSQIGVPAPNFGKDAGMLHDAFDIESLKQLDAVITCQGGSYTEKV
YPALRQAGWKGYWIDAASTLRMDKEAIITLDPVNLKQILHGIHHGTKTFVGGNCTVSLMLMALGGLYERGLVEWMSAMTY
QAASGAGAQNMRELISQMGVINDAVSSELANPASSILDIDKKVAETMRSGSFPTDNFGVPLAGSLIPWIDVKRDNGQSKE
EWKAGVEANKILGLQDSPVPIDGTCVRIGAMRCHSQALTIKLKQNIPLDEIEEMIATHNDWVKVIPNERDITARELTPAK
VTGTLSVPVGRLRKMAMGDDFLNAFTVGDQLLWGAAEPLRRTLRIILAEK
>P23247 1.2.1.11~~~asd2~~~Aspartate-semialdehyde dehydrogenase 2~~~COG0136
MSQQFNVAIFGATGAVGETMLEVLQEREFPVDELFLLASERSEGKTYRFNGKTVRVQNVEEFDWSQVHIALFSAGGELSA
KWAPIAAEAGVVVIDNTSHFRYDYDIPLVVPEVNPEAIAEFRNRNIIANPNCSTIQMLVALKPIYDAVGIERINVTTYQS
VSGAGKAGIDELAGQTAKLLNGYPAETNTFSQQIAFNCIPQIDQFMDNGYTKEEMKMVWETQKIFNDPSIMVNPTCVRVP
VFYGHAEAVHVETRAPIDAEQVMDMLEQTDGIELFRGADFPTQVRDAGGKDHVLVGRVRNDISHHSGINLWVVADNVRKG
AATNAVQIAELLVRDYF
>Q04797 1.2.1.11~~~asd~~~Aspartate-semialdehyde dehydrogenase~~~COG0136
MGRGLHVAVVGATGAVGQQMLKTLEDRNFEMDTLTLLSSKRSAGTKVTFKGQELTVQEASPESFEGVNIALFSAGGSVSQ
ALAPEAVKRGAIVIDNTSAFRMDENTPLVVPEVNEADLHEHNGIIANPNCSTIQMVAALEPIRKAYGLNKVIVSTYQAVS
GAGNEAVKELYSQTQAILNKEEIEPEIMPVKGDKKHYQIAFNAIPQIDKFQDNGYTFEEMKMINETKKIMHMPDLQVAAT
CVRLPIQTGHSESVYIEIDRDDATVEDIKNLLKEAPGVTLQDDPSQQLYPMPADAVGKNDVFVGRIRKDLDRANGFHLWV
VSDNLLKGAAWNSVQIAESLKKLNLV
>P0A9Q9 1.2.1.11~~~asd~~~Aspartate-semialdehyde dehydrogenase~~~COG0136
MKNVGFIGWRGMVGSVLMQRMVEERDFDAIRPVFFSTSQLGQAAPSFGGTTGTLQDAFDLEALKALDIIVTCQGGDYTNE
IYPKLRESGWQGYWIDAASSLRMKDDAIIILDPVNQDVITDGLNNGIRTFVGGNCTVSLMLMSLGGLFANDLVDWVSVAT
YQAASGGGARHMRELLTQMGHLYGHVADELATPSSAILDIERKVTTLTRSGELPVDNFGVPLAGSLIPWIDKQLDNGQSR
EEWKGQAETNKILNTSSVIPVDGLCVRVGALRCHSQAFTIKLKKDVSIPTVEELLAAHNPWAKVVPNDREITMRELTPAA
VTGTLTTPVGRLRKLNMGPEFLSAFTVGDQLLWGAAEPLRRMLRQLA
>P44801 1.2.1.11~~~asd~~~Aspartate-semialdehyde dehydrogenase~~~COG0136
MKNVGFIGWRGMVGSVLMDRMSQENDFENLNPVFFTTSQAGQKAPVFGGKDAGDLKSAFDIEELKKLDIIVTCQGGDYTN
EVYPKLKATGWDGYWVDAASALRMKDDAIIVLDPVNQHVISEGLKKGIKTFVGGNCTVSLMLMAIGGLFEKDLVEWISVA
TYQAASGAGAKNMRELLSQMGLLEQAVSSELKDPASSILDIERKVTAKMRADNFPTDNFGAALGGSLIPWIDKLLPETGQ
TKEEWKGYAETNKILGLSDNPIPVDGLCVRIGALRCHSQAFTIKLKKDLPLEEIEQIIASHNEWVKVIPNDKEITLRELT
PAKVTGTLSVPVGRLRKLAMGPEYLAAFTVGDQLLWGAAEPVRRILKQLVA
>Q9CIV9 ~~~dhaS~~~HTH-type dhaKLM operon transcriptional activator DhaS~~~COG1309
MSAFFLNMKKSIITQKIIAKAFKDLMQSNAYHQISVSDIMQTAKIRRQTFYNYFQNQEELLSWIFENDFAELINDNSDYY
GWQNELLLLLRYLDENQIFYQKIFVIDKNFEHFFLIQWENLLDKVIFDQEKKSDYHWSDLEKSFICRYNAAAICAITRES
IIRGNSLEKLYSQIVNLLLAQIKIFES
>P9WNX5 1.2.1.11~~~asd~~~Aspartate-semialdehyde dehydrogenase~~~COG0136
MGLSIGIVGATGQVGQVMRTLLDERDFPASAVRFFASARSQGRKLAFRGQEIEVEDAETADPSGLDIALFSAGSAMSKVQ
APRFAAAGVTVIDNSSAWRKDPDVPLVVSEVNFERDAHRRPKGIIANPNCTTMAAMPVLKVLHDEARLVRLVVSSYQAVS
GSGLAGVAELAEQARAVIGGAEQLVYDGGALEFPPPNTYVAPIAFNVVPLAGSLVDDGSGETDEDQKLRFESRKILGIPD
LLVSGTCVRVPVFTGHSLSINAEFAQPLSPERARELLDGATGVQLVDVPTPLAAAGVDESLVGRIRRDPGVPDGRGLALF
VSGDNLRKGAALNTIQIAELLTADL
>Q51344 1.2.1.11~~~asd~~~Aspartate-semialdehyde dehydrogenase~~~
MKRVGLIGWRGMVGSVLMQRMLEERDFDLIEPVFFTTSNVGGQGPEVGKDIAPLKDAYSIDELKTLDVILTCQGGDYTSE
VFPKLREAGWQGYWIDAASSLRMEDDAVIVLDPVNRKVIDQALDAGTRNYIGGNCTVSLMLMALGGLFDAGLVEWMSAMT
YQAASGAGAQNMRELLKQMGAAHASVADDLANPASAILDIDRKVAETLRSEAFPTEHFGAPLGGSLIPWIDKELPNGQSR
EEWKAQAETNKILARFKNPIPVDGICVRVGAMRCHSQALTIKLNKDVPLTDIEGLISQHNPWVKLVPNHREVSVRELTPA
AVTGTLSVPVGRLRKLNMGSQYLGAFTVGDQLLWGAAEPLRRMLRILLER
>P0A1F8 1.2.1.11~~~asd~~~Aspartate-semialdehyde dehydrogenase~~~
MKNVGFIGWRGMVGSVLMQRMVEERDFDAIRPVFFSTSQFGQAAPTFGDTSTGTLQDAFDLDALKALDIIVTCQGGDYTN
EIYPKLRESGWQGYWIDAASTLRMKDDAIIILDPVNQDVITDGLNNGVKTFVGGNCTVSLMLMSLGGLFAHNLVDWVSVA
TYQAASGGGARHMRELLTQMGQLYGHVADELATPSSAILDIERKVTALTRSGELPVDNFGVPLAGSLIPWIDKQLDNGQS
REEWKGQAETNKILNTASVIPVDGLCVRVGALRCHSQAFTIKLKKEVSIPTVEELLAAHNPWAKVVPNDRDITMRELTPA
AVTGTLTTPVGRLRKLNMGPEFLSAFTVGDQLLWGAAEPLRRMLRQLA
>P45513 1.1.1.202~~~dhaT~~~1,3-propanediol dehydrogenase~~~
MSYRMFDYLVPNVNFFGPNAISVVGERCKLLGGKKALLVTDKGLRAIKDGAVDKTLTHLREAGIDVVVFDGVEPNPKDTN
VRDGLEVFRKEHCDIIVTVGGGSPHDCGKGIGIAATHEGDLYSYAGIETLTNPLPPIVAVNTTAGTASEVTRHCVLTNTK
TKVKFVIVSWRNLPSVSINDPLLMLGKPAPLTAATGMDALTHAVEAYISKDANPVTDAAAIQAIRLIARNLRQAVALGSN
LKARENMAYASLLAGMAFNNANLGYVHAMAHQLGGLYDMPHGVANAVLLPHVARYNLIANPEKFADIAEFMGENTDGLST
MDAAELAIHAIARLSADIGIPQHLRDLGVKEADFPYMAEMALKDGNAFSNPRKGNEKEIAEIFRQAF
>Q59477 1.1.1.202~~~dhaT~~~1,3-propanediol dehydrogenase~~~
MSYRMFDYLVPNVNFFGPNAISVVGERCQLLGGKKALLVTDKGLRAIKDGAVDKTLHYLREAGIEVAIFDGVEPNPKDTN
VRDGLAVFRREQCDIIVTVGGGSPHDCGKGIGIAATHEGDLYQYAGIETLTNPLPPIVAVNTTAGTASEVTRHCVLTNTE
TKVKFVIVSWRNLPSVSINDPLLMIGKPAALTAATGMDALTHAVEAYISKDANPVTDAAAMQAIRLIARNLRQAVALGSN
LQARENMAYASLLAGMAFNNANLGYVHAMAHQLGGLYDMPHGVANAVLLPHVARYNLIANPEKFADIAELMGENITGLST
LDAAEKAIAAITRLSMDIGIPQHLRDLGVKEADFPYMAEMALKDGNAFSNPRKGNEQEIAAIFRQAF
>Q08352 1.4.1.1~~~ald~~~Alanine dehydrogenase~~~COG0686
MIIGVPKEIKNNENRVALTPGGVSQLISNGHRVLVETGAGLGSGFENEAYESAGAEIIADPKQVWDAEMVMKVKEPLPEE
YVYFRKGLVLFTYLHLAAEPELAQALKDKGVTAIAYETVSEGRTLPLLTPMSEVAGRMAAQIGAQFLEKPKGGKGILLAG
VPGVSRGKVTIIGGGVVGTNAAKMAVGLGADVTIIDLNADRLRQLDDIFGHQIKTLISNPVNIADAVAEADLLICAVLIP
GAKAPTLVTEEMVKQMKPGSVIVDVAIDQGGIVETVDHITTHDQPTYEKHGVVHYAVANMPGAVPRTSTIALTNVTVPYA
LQIANKGAVKALADNTALRAGLNTANGHVTYEAVARDLGYEYVPAEKALQDESSVAGA
>E5Y944 1.4.1.1~~~ald~~~Alanine dehydrogenase~~~COG0686
MRVGIPTEIKVQEFRVGITPAGVHALKEAGHTVLVQKGAGLGSMITDEEYVAAGAQMVATAKECWDCDMVVKVKEPLAPE
YDLFHEGLILYTYLHLAPEPALTKALLEKKVIGIAYETVQFDNGFLPLLAPMSEVAGRMATQVGAQMLTKIEGGMGLLMG
GTAGVQAAHVVILGAGTVGLSAAKVAMGMGARVTILDSNLFRLRQIDDLFGGRIQTLASNAFNIAAATKDADLLVGSVLI
PGALTPKLVTEAMVKTMKPGSAIVDVAIDQGGCIEPTAKHGATYHDKPTFKYPVNGGEVVCYSVGNMPGAVARTSTFTLT
NATMPYMVDLANKGWKKACQDDKALARGINTYDGKVYFKGVSDALGYELHCTCDILK
>Q9AIK2 1.4.1.1~~~ald~~~Alanine dehydrogenase~~~
MRVGIPTEIKVQEFRVGITPAGVHALKEAGHTVLVQKGAGLGSMITDEEYVAAGAQMVATAKECWDCDMVVKVKEPLAPE
YDLFHEGLILYTYLHLAPEPALTKALLEKKVIGIAYETVQFDNGFLPLLAPMSEVAGRMATQVGAQMLTKIEGGMGLLMG
GTAGVQAAHVVILGAGTVGLSAAKVAMGMGARVTILDSNLFRLRQIDDLFGGRIQTLASNAFNIAAATKDADLLVGSVLI
PGALTPKLVTEAMVKTMKPGSAIVDVAIDQGGCIEPTAKHGATYHDKPTFKYPVNGGEVVCYSVGNMPGAVARTSTFTLT
NATMPYMVDLANKGWKKACQDDKALARGINTYDGKVYFKGVSDALGYELHCTCDILK
>P17557 1.4.1.1~~~ald~~~Alanine dehydrogenase~~~
MKIGIPKEIKNNENRVAITPAGVMTLVKAGHEVYVETEGGAGSGFSDSEYEKAGAADRCRTWRDAWTAEMVLKVKEPLAR
EFRYFRPGLILFTYLHLAAAERVTKAVVEQKVVGIAYETVQLANGSLPLLTPMSEVAGRMSVQVGAQFLEKPHGGKGILL
GGVPGVRRGKVTIIGGGTAGTNAAKIGVGLGADVTILDINAERLRELDDLFGDHVTTLMSNSYHIAECVRESDLVVGAVL
IPGAKAKLVTEEMVRSMTPGSVLVDIAIDQGGIFETTDRVTTHDDPTYVKHGVVHYAVANMPGAVPRTSTFALTNVTIPY
ALQIANKGYRAGCLDNPALLKGINTLDGHIVYEAVAAAHNMPYTDVHSLLHG
>E1V931 1.4.1.1~~~ald~~~Alanine dehydrogenase~~~COG0686
MKIAVPKEIKNHEYRVALTPSGARELVGRGHDVIVQAAAGEGAGFSDADFEAAGARLEADVAKLWDDAELILKVKEPQAE
EVARLSAGQTLFTYLHLAAEESLTKGLLDSGATCIAYETITAPEGGLPLLAPMSTVAGRMAVQAGAHSLEKAQGGAGILL
PGVPGVAPARVTVIGGGVVGENAARMALGLGAEVTVLDKSIPRLETLDDRYQGRMKTVFSTADALEEAVRESDLIIGAVL
VPGAAAPKLITRDMLSDMKPGSVLVDVAIDQGGCFETSKPTTHAEPTYVVDGVVHYCVANMPGAVARTSTQALTNATLPF
VVALADKGWQKALADDDHFAAGLNVHDGKLTYRAVAEAFGLEYVEAASLIG
>P17556 1.4.1.1~~~ald~~~Alanine dehydrogenase~~~
MKIGIPKEIKNNENRVAMTPAGVVSLTHAGHERLAIETGGGIGSSFTDAEYVAAGAAYRCIGKEAWAQEMILKVKEPVAS
EYDYFYEGQILFTYLHLAPRAELTQALIDKKVVGIAYETVQLANGSLPLLTPMSEVAGKMATQIGAQYLEKNHGGKGILL
GGVSGVHARKVTVIGGGIAGTNAAKIAVGMGADVTVIDLSPERLRQLEDMFGRDVQTLMSNPYNIAESVKHSDLVVGAVL
IPGAKAPKLVSEEMIQSMQPGSVVVDIAIDQGGIFATSDRVTTHDDPTYVKHGVVHYAVANMPGAVPRTSTIALTNNTIP
YALQIANKGYKQACIDNPALKKGVNALEGHITYKAVAEAQGLPYVNVDELIQ
>A0QVQ8 1.4.1.1~~~ald~~~Alanine dehydrogenase~~~COG0686
MLVGIPTEIKNNEYRVAITPAGVAELTRRGHEVIIQAGAGEGSAISDRDFKAAGAEIVNTADQVWSEAELLLKVKEPIEP
EYSRMRKGQTLFTYLHLAASKPCTDALLASGTTSIAYETVQTAEGALPLLAPMSEVAGRLSAQVGAYHLMRSYGGRGVLM
GGVPGVAPAEVVVIGAGTAGYNAARVAAGMGAHVTVFDLNINTLRRVDGEFGGRIETRYSSSLELEEAVKKADLVIGAVL
VPGAKAPKLVTNSTVAHMKPGAVLVDIAIDQGGCFEDSRPTTHDEPTFKVHDTIFYCVANMPGAVPRTSTFALTNSTMPY
VLKLADKGWQAACASDSALAKGLSTHDGKLLSEAVAKDLDLPFTDAAQFLA
>P9WQB1 1.4.1.1~~~ald~~~Alanine dehydrogenase~~~COG0686
MRVGIPTETKNNEFRVAITPAGVAELTRRGHEVLIQAGAGEGSAITDADFKAAGAQLVGTADQVWADADLLLKVKEPIAA
EYGRLRHGQILFTFLHLAASRACTDALLDSGTTSIAYETVQTADGALPLLAPMSEVAGRLAAQVGAYHLMRTQGGRGVLM
GGVPGVEPADVVVIGAGTAGYNAARIANGMGATVTVLDINIDKLRQLDAEFCGRIHTRYSSAYELEGAVKRADLVIGAVL
VPGAKAPKLVSNSLVAHMKPGAVLVDIAIDQGGCFEGSRPTTYDHPTFAVHDTLFYCVANMPASVPKTSTYALTNATMPY
VLELADHGWRAACRSNPALAKGLSTHEGALLSERVATDLGVPFTEPASVLA
>P39071 1.3.1.28~~~dhbA~~~2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase~~~COG1028
MNAKGIEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEGRHAEAFPADVRDSAAIDEITARIE
REMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSVSKYMMDRRSGSIVTVGSNAAGVPRTSMAAYASS
KAAAVMFTKCLGLELAEYNIRCNIVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVS
GQAGHITMHNLCVDGGATLGV
>P45743 3.3.2.1~~~dhbB~~~Isochorismatase~~~COG1535
MAIPAIQPYQMPTASDMPQNKVSWVPDPNRAVLLIHDMQNYFVDAFTAGASPVTELSANIRKLKNQCVQLGIPVVYTAQP
GSQNPDDRALLTDFWGPGLNSGPYEEKIITELAPEDDDLVLTKWRYSAFKRTNLLEMMRKEGRDQLIITGIYAHIGCLVT
ACEAFMEDIKAFFVGDAVADFSLEKHQMALEYAAGRCAFTVMTDSLLDQLQNAPADVQKTSANTGKKNVFTCENIRKQIA
ELLQETPEDITDQEDLLDRGLDSVRIMTLVEQWRREGAEVTFVELAERPTIEEWQKLLTTRSQQVLPNADYL
>P45744 5.4.4.2~~~dhbC~~~Isochorismate synthase DhbC~~~COG1169
MLDQNVITETKAEHLLHEYQPGAFFLASPHRVLLAKGICEIVPEADGQNQMETLSGRIAEALRQAKQSGQSRPLVVGAVP
FDQVKAARLVVPEEVRWSGPLQFDHEEKEQQAGHTYHIKPVPEPEDYKNGVEQGLARIADGTLSKIVLSRSLHLTSPEPI
QTDELLRHLAQHNSHGYTFAADVSSQEETSPRRTLLGASPELLVSRMGTQVVSNPLAGSRPRSNDPVEDQRRAAELLSSA
KDLHEHAVVADAVAAALRPFCRTLEVPEKPSLIKTETMWHLSSVIKGELSDPSVTALELAAALHPTPAVCGTPTDLAREA
ILSIEPFDRGFFTGMVGWCDDAGDGEWIVTIRCAEAEERSLRLYAGAGVVAGSKPEDELQETSAKFRTMLRAMGVDHI
>P40871 6.2.1.71~~~dhbE~~~2,3-dihydroxybenzoate-AMP ligase~~~COG1021
MLKGFTPWPDELAETYRKNGCWAGETFGDLLRDRAAKYGDRIAITCGNTHWSYRELDTRADRLAAGFQKLGIQQMDRVVV
QLPNIKEFFEVIFALFRLGALPVFALPSHRSSEITYFCEFAEAAAYIIPDAYSGFDYRSLARQVQSKLPTLKNIIVAGEA
EEFLPLEDLHAEPVKLPEVKSSDVAFLQLSGGSTGLSKLIPRTHDDYIYSLKRSVEVCWLDHSTVYLAALPMAHNYPLSS
PGVLGVLYAGGRVVLSPSPSPDDAFPLIEREKVTITALVPPLAMVWMDAASSRRDDLSSLQVLQVGGAKFSAEAARRVKA
VFGCTLQQVFGMAEGLVNYTRLDDPEEIIVNTQGKPMSPYDEMRVWDDHDRDVKPGETGHLLTRGPYTIRGYYKAEEHNA
ASFTEDGFYRTGDIVRLTRDGYIVVEGRAKDQINRGGEKVAAEEVENHLLAHPAVHDAAMVSMPDQFLGERSCVFIIPRD
EAPKAAELKAFLRERGLAAYKIPDRVEFVESFPQTGVGKVSKKALREAISEKLLAGFKK
>P45745 6.2.1.66~~~dhbF~~~Dimodular nonribosomal peptide synthase~~~COG1020
MPDTKDLQYSLTGAQTGIWFAQQLDPDNPIYNTAEYIEINGPVNIALFEEALRHVIKEAESLHVRFGENMDGPWQMINPS
PDVQLHVIDVSSEPDPEKTALNWMKADLAKPVDLGYAPLFNEALFIAGPDRFFWYQRIHHIAIDGFGFSLIAQRVASTYT
ALIKGQTAKSRSFGSLQAILEEDTDYRGSEQYEKDRQFWLDRFADAPEVVSLADRAPRTSNSFLRHTAYLPPSDVNALKE
AARYFSGSWHEVMIAVSAVYVHRMTGSEDVVLGLPMMGRIGSASLNVPAMVMNLLPLRLTVSSSMSFSELIQQISREIRS
IRRHHKYRHEELRRDLKLIGENHRLFGPQINLMPFDYGLDFAGVRGTTHNLSAGPVDDLSINVYDRTDGSGLRIDVDANP
EVYSESDIKLHQQRILQLLQTASAGEDMLIGQMELLLPEEKEKVISKWNETAKSEKLVSLQDMFEKQAVLTPERIALMCD
DIQVNYRKLNEEANRLARLLIEKGIGPEQFVALALPRSPEMVASMLGVLKTGAAYLPLDPEFPADRISYMLEDAKPSCII
TTEEIAASLPDDLAVPELVLDQAVTQEIIKRYSPENQDVSVSLDHPAYIIYTSGSTGRPKGVVVTQKSLSNFLLSMQEAF
SLGEEDRLLAVTTVAFDISALELYLPLISGAQIVIAKKETIREPQALAQMIENFDINIMQATPTLWHALVTSEPEKLRGL
RVLVGGEALPSGLLQELQDLHCSVTNLYGPTETTIWSAAAFLEEGLKGVPPIGKPIWNTQVYVLDNGLQPVPPGVVGELY
IAGTGLARGYFHRPDLTAERFVADPYGPPGTRMYRTGDQARWRADGSLDYIGRADHQIKIRGFRIELGEIDAVLANHPHI
EQAAVVVREDQPGDKRLAAYVVADAAIDTAELRRYMGASLPDYMVPSAFVEMDELPLTPNGKLDRKALPAPDFSTSVSDR
APRTPQEEILCDLFAEVLGLARVGIDDSFFELGGHSLLAARLMSRIREVMGAELGIAKLFDEPTVAGLAAHLDLAQSACP
ALQRAERPEKIPLSFAQRRLWFLHCLEGPSPTYNIPVAVRLSGELDQGLLKAALYDLVCRHESLRTIFPESQGTSYQHIL
DADRACPELHVTEIAEKELSDRLAEAVRYSFDLAAEPAFRAELFVIGPDEYVLLLLVHHIVGDGWSLTPLTRDLGTAYAA
RCHGRSPEWAPLAVQYADYALWQQELLGNEDDPNSLIAGQLAFWKETLKNLPDQLELPTDYSRPAEPSHDGDTIHFRIEP
EFHKRLQELARANRVSLFMVLQSGLAALLTRLGAGTDIPIGSPIAGRNDDALGDLVGLFINTLVLRTDTSGDPSFRELLD
RVREVNLAAYDNQDLPFERLVEVLNPARSRATHPLFQIMLAFQNTPDAELHLPDMESSLRINSVGSAKFDLTLEISEDRL
ADGTPNGMEGLLEYSTDLFKRETAQALADRLMRLLEAAESDPDEQIGNLDILAPEEHSSMVTDWQSVSEKIPHACLPEQF
EKQAALRPDAIAVVYENQELSYAELNERANRLARMMISEGVGPEQFVALALPRSLEMAVGLLAVLKAGAAYLPLDPDYPA
DRIAFMLKDAQPAFIMTNTKAANHIPPVENVPKIVLDDPELAEKLNTYPAGNPKNKDRTQPLSPLNTAYVIYTSGSTGVP
KGVMIPHQNVTRLFAATEHWFRFSSGDIWTMFHSYAFDFSVWEIWGPLLHGGRLVIVPHHVSRSPEAFLRLLVKEGVTVL
NQTPSAFYQFMQAEREQPDLGQALSLRYVIFGGEALELSRLEDWYNRHPENRPQLINMYGITETTVHVSYIELDRSMAAL
RANSLIGCGIPDLGVYVLDERLQPVPPGVAGELYVSGAGLARGYLGRPGLTSERFIADPFGPPGTRMYRTGDVARLRADG
SLDYVGRADHQVKIRGFRIELGEIEAALVQHPQLEDAAVIVREDQPGDKRLAAYVIPSEETFDTAELRRYAAERLPDYMV
PAAFVTMKELPLTPNGKLDRKALPAPDFAAAVTGRGPRTPQEEILCDLFMEVLHLPRVGIDDRFFDLGGHSLLAVQLMSR
IREALGVELSIGNLFEAPTVAGLAERLEMGSSQSALDVLLPLRTSGDKPPLFCVHPAGGLSWCYAGLMTNIGTDYPIYGL
QARGIGQREELPKTLDDMAADYIKQIRTVQPKGPYHLLGWSLGGNVVQAMATQLQNQGEEVSLLVMLDAYPNHFLPIKEA
PDDEEALIALLALGGYDPDSLGEKPLDFEAAIEILRRDGSALASLDETVILNLKNTYVNSVGILGSYKPKTFRGNVLFFR
STIIPEWFDPIEPDSWKPYINGQIEQIDIDCRHKDLCQPEPLAQIGKVLAVKLEELNK
>P17584 1.1.1.-~~~~~~D-2-hydroxyisocaproate dehydrogenase~~~
MKIIAYGARVDEIQYFKQWAKDTGNTLEYHTEFLDENTVEWAKGFDGINSLQTTPYAAGVFEKMHAYGIKFLTIRNVGTD
NIDMTAMKQYGIRLSNVPAYSPAAIAEFALTDTLYLLRNMGKVQAQLQAGDYEKAGTFIGKELGQQTVGVMGTGHIGQVA
IKLFKGFGAKVIAYDPYPMKGDHPDFDYVSLEDLFKQSDVIDLHVPGIEQNTHIINEAAFNLMKPGAIVINTARPNLIDT
QAMLSNLKSGKLAGVGIDTYEYETEDLLNLAKHGSFKDPLWDELLGMPNVVLSPHIAYYTETAVHNMVYFSLQHLVDFLT
KGETSTEVTGPAK
>P94316 1.4.1.2~~~gdhB~~~NAD-specific glutamate dehydrogenase~~~
MNIEKIMSSLEAKHPGESEYLQAVKEVLLSIEDIYNQHPEFEKSKIIERLVEPDRIFTFRVTWVDDKGEVQTNLGYRVQF
NNAIGPYKGGIRFHASVNLSILKFLGFEQTFKNALTTLPMGGGKGGSDFSPRGKSDAEIMRFCQAFMLELWRHLGPDMDV
PAGDIGVGGREVGYMFGMYKKLTREFTGTFTGKGLEFGGSLIRPEATGFGGLYFVNQMLQTKGIDIKGKTVAISGFGNVA
WGAATKATELGAKVVTISGPDGYIYDPNGISGEKIDYMLELRASGNDIVAPYADEFPGSTFVAGKRPWEVKADIALPCAT
QNELNGEDAKNLIDNNVLCVGEISNMGCTPEAIDLFIEHKTMYAPGKAVNAGGVATSGLEMSQNAMHLSWSAAEVDEKLH
SIMHGIHAQCVKYGTEPDGYINYVKGANIAGFMKVAHAMMGQGII
>P39633 1.4.1.2~~~rocG~~~Catabolic NAD-specific glutamate dehydrogenase RocG~~~COG0334
MSAKQVSKDEEKEALNLFLSTQTIIKEALRKLGYPGDMYELMKEPQRMLTVRIPVKMDNGSVKVFTGYRSQHNDAVGPTK
GGVRFHPEVNEEEVKALSIWMTLKCGIANLPYGGGKGGIICDPRTMSFGELERLSRGYVRAISQIVGPTKDIPAPDVYTN
SQIMAWMMDEYSRLREFDSPGFITGKPLVLGGSQGRETATAQGVTICIEEAVKKKGIKLQNARIIIQGFGNAGSFLAKFM
HDAGAKVIGISDANGGLYNPDGLDIPYLLDKRDSFGMVTNLFTDVITNEELLEKDCDILVPAAISNQITAKNAHNIQASI
VVEAANGPTTIDATKILNERGVLLVPDILASAGGVTVSYFEWVQNNQGYYWSEEEVAEKLRSVMVSSFETIYQTAATHKV
DMRLAAYMTGIRKSAEASRFRGWV
>P24295 1.4.1.2~~~gdh~~~NAD-specific glutamate dehydrogenase~~~COG0334
MSKYVDRVIAEVEKKYADEPEFVQTVEEVLSSLGPVVDAHPEYEEVALLERMVIPERVIEFRVPWEDDNGKVHVNTGYRV
QFNGAIGPYKGGLRFAPSVNLSIMKFLGFEQAFKDSLTTLPMGGAKGGSDFDPNGKSDREVMRFCQAFMTELYRHIGPDI
DVPAGDLGVGAREIGYMYGQYRKIVGGFYNGVLTGKARSFGGSLVRPEATGYGSVYYVEAVMKHENDTLVGKTVALAGFG
NVAWGAAKKLAELGAKAVTLSGPDGYIYDPEGITTEEKINYMLEMRASGRNKVQDYADKFGVQFFPGEKPWGQKVDIIMP
CATQNDVDLEQAKKIVANNVKYYIEVANMPTTNEALRFLMQQPNMVVAPSKAVNAGGVLVSGFEMSQNSERLSWTAEEVD
SKLHQVMTDIHDGSAAAAERYGLGYNLVAGANIVGFQKIADAMMAQGIAW
>E1V4J5 1.4.1.2~~~gdh~~~NAD-specific glutamate dehydrogenase~~~COG2902
MLHVAQEEARLDLLKQLKERLQSRLDKDKAAEVDTFAHLFYAAVPLEDLADRRLDDLYGATLSVWHFIQQFDPEAPKVRV
LNPDFEEHGWQSTHTFIAVLHEDMPFLVDSVRVELNRRGMTVHAIHNAVLAVGRDDEHRLQRVASPEETDAPEARESLIA
IEVDRHSNPAELEEIEASLLEVLREVRTAVSDFDPMRAQARAAIEELEATRPAQVDPADHREAIEFLQWLLQDNFTFLGY
DEYEVREDQGRQRLDKVQNSELGVFRLDQPRYRERIRTDLGVEGDHYVPMPQLMSFAKSAHHARIHRPTYPDYISIDRYD
DQGRVIGERRFLGMFTATVYNESPRNVPILRRKLQAVMDIAGFSPKGHNGKQLLQILEVYPRDDLFQIDIEELAQTALGI
LDIRERRRVRLFIREDTFGKFYSCLVFVPRDVFSTELRVRLQELLCEELDATFGDFNTYLSESVLARIQFILRFNGEKPV
EYDIKRLEEKLVKLARNWRDDLLNASIEGFGEESANLLMSRFRDAFPASYREDFSARTAVYDLQHIGELDEGAPLALSLY
RLIEEEGSGVNLKLFHRGAPIPLSDVLPMMENLGLRVIGERPYEVQASDASYWIHDFNLEHHTSVEMNLQEMRGPFIEAF
QRIWAGEADNDAFNRLIIGANLDWREVAMLRAYARYLKQIRFGMSQDYIATTLGSHPEITRELVSLFELRFDPAERPGEG
DIEECESRILTLLDEVPSLNDDQLLRRYMELIKATLRTNYYQRTEEGRYKDYLAFKLDPSQVSGIPKPCPAYEIFVCSPR
VEGVHLRGGKVARGGLRWSDRHEDFRTEVLGLVKAQQVKNAVIVPMGAKGGFVCKRMPEGADREATQKEGIACYQIFIRA
LLDVTDNLVGGEVVPPRDVVRHDDNDPYLVVAADKGTATFSDIANEISTEYGHWLGDAFASGGANGYDHKKMAITAKGAW
ESVKRHFRGLGVNTQEDEFSVVGIGDMAGDVFGNGMLLSDKIRLVGAFNHLHIFVDPTPDAAASFAERQRLFDMPRSSWE
DYNTELISEGGGIFPRSAKSITITPQMKKVFGIREDKLSPNELIRAMLVSKVDLVWNGGIGTYVKSSEETDAEVGDKAND
ALRIDGRELNCRVVGEGGNLGLTQRGRMEAAAKGVRVNTDFIDNAGGVNCSDHEVNIKILIDEVVSRGDLTEKQRNQLLA
DMTDEVSELVLLDNYRQTQALDLAELLSRQGIGPYRRFISELEAAGQIDRELEFLPSDEELLERTQHNQGMTLPELSVLI
SYAKSVLKGDLIASDVPDDPTIMRFVERVFPSMLAERYRDEMYEHRLKREIVATQVANDLVDYMGVVFVRRLMDSTGADR
ADIARAYVIARDSFQLPRLWEQIEALDNKVPSQVQYSMMLDLMRMLRRSTRWFLRQRTGMSTRDTIDYFAPRLAQLQENI
GKRLRGEEQEQWSARRQELVKAGVPEALASTVAAAGSLYAALGIIQTARQTDEKPQRVAEIFYEVGARLELPWIIQQVTR
LEVRDGWQAKARDTFRDDIDRQQLALTASVLGMDGGPRDSAERVDRWLSLHEGMHQRWRHLLEEVGSGSQGGFPLFAVAV
RELVDLAESNSEA
>A0R1C2 1.4.1.2~~~gdh~~~NAD-specific glutamate dehydrogenase~~~COG2902
MIRRLSVAFLSTYRGPQADAPGVTSTGPLAVAAHDDLVSDDLVAAHYRLASMRAPGETKAAVYPGDAGSGAALQIVTDQA
PMLVDSVTVLLHRHGIAYTAIMNPVFRVRRGLDGELLDVRPAAEAAPGDGADECWILVPITAAADGEALTEATRLVPGIL
AEARQIGLDSGAMIAALHGLANDLATDLEGHFPNAERKEVAALLRWLADGHFVLLGYQQCVVGDGNAEVDPASRLGVLRL
RNDVLPPLTDSDDLLVLAQATMPSYLRYGAYPYIVVVRESPGASRVIEHRFVGLFTVAAMNANALEIPLISRRVEEALAM
AHRDPSHPGQLLRDIIQTIPRPELFALSSKQLLEMALAVVDLGSRRRTLLFLRADHLAHFVSCLVYLPRDRYTTAVRLEM
QDILVRELGGAGIDYSARVSESPWAVVHFTVRLPEGTAADSVDTSLENESRIQDLLTEATRNWGDRMISAAAAASISPAA
LEHYAHAFPEDYKQAFAPQDAIADISLIEALQDDSVKLVLADTAEDRVWKLTWYLGGHSASLSELLPMLQSMGVVVLEER
PFTLRRTDGLPVWIYQFKISPHPSIPHAPDAEAQRDTAQRFADAVTAIWHGRVEIDRFNELVMRAGLTWQQVVVLRAYAK
YLRQAGFPYSQSHIESVLNENPHTTRSLIDLFEALFDPSQETDGRRDAQGAAAAVAADIDALVSLDTDRVLRAFANLIEA
TLRTNYFVARPDSARARNVLAFKLNPLVIKELPLPRPKFEIFVYSPRVEGVHLRFGFVARGGLRWSDRREDFRTEILGLV
KAQAVKNAVIVPVGAKGGFVVKRPPTLTGDAAADREATRAEGVECYRLFISGLLDVTDNVDKATGAVVTPPEVVRRDGED
AYLVVAADKGTATFSDIANEVAKSYGFWLGDAFASGGSIGYDHKAMGITAKGAWESVKRHFREMGVDTQTQDFTVVGIGD
MSGDVFGNGMLLSKHIRLVAAFDHRDIFLDPNPDAGRSWDERKRLFDLPRSSWADYDKSLISEGGGVYSRQQKSIPISPQ
VRTALGLDADVEELTPPALIKAILKAPVDLLWNGGIGTYIKAETEADADVGDRANDQIRVCGNQVRAKVIGEGGNLGVTA
LGRIEFDLAGGRINTDALDNSAGVDCSDHEVNIKILIDSAVTAGKVTPEERTELLLSMTDEVGELVLADNRDQNDLMGTS
RANAASLLSVHARMIKDLVDNRGLNRELEALPSEKEIRRRADAGIGLTSPELATLMAHVKLALKDDVLASDLPDQEVFAS
RLPYYFPTRLREELHGEIRSHQLRREIITTMLVNDLVDTAGISYAYRITEDVGVGPVDAVRSYVAINAIFGIGDVWRRIR
AAGDAGVPTSVTDRMTLDLRRLVDRAGRWLLNYRPQPLAVGAEINRFGAKVAALTPRMSEWLRGDDKAIVSKEAGDFASH
GVPEDLAYHIATGLYQYSLLDVIDIADIVDREPDEVADTYFALMDHLGADALLTAVSRLSRDDRWHSLARLAIRDDIYGS
LRALCFDVLAVGEPDENGEEKIAEWETTNSSRVTRARRTLTEIYKDGEQDLATLSVAARQIRSMTRTSGTGTTG
>O53203 1.4.1.2~~~gdh~~~NAD-specific glutamate dehydrogenase~~~COG2902
MTIDPGAKQDVEAWTTFTASADIPDWISKAYIDSYRGPRDDSSEATKAAEASWLPASLLTPAMLGAHYRLGRHRAAGESC
VAVYRADDPAGFGPALQVVAEHGGMLMDSVTVLLHRLGIAYAAILTPVFDVHRSPTGELLRIEPKAEGTSPHLGEAWMHV
ALSPAVDHKGLAEVERLLPKVLADVQRVATDATALIATLSELAGEVESNAGGRFSAPDRQDVGELLRWLGDGNFLLLGYQ
RCRVADGMVYGEGSSGMGVLRGRTGSRPRLTDDDKLLVLAQARVGSYLRYGAYPYAIAVREYVDGSVVEHRFVGLFSVAA
MNADVLEIPTISRRVREALAMAESDPSHPGQLLLDVIQTVPRPELFTLSAQRLLTMARAVVDLGSQRQALLFLRADRLQY
FVSCLVYMPRDRYTTAVRMQFEDILVREFGGTRLEFTARVSESPWALMHFMVRLPEVGVAGEGAAAPPVDVSEANRIRIQ
GLLTEAARTWADRLIGAAAAAGSVGQADAMHYAAAFSEAYKQAVTPADAIGDIAVITELTDDSVKLVFSERDEQGVAQLT
WFLGGRTASLSQLLPMLQSMGVVVLEERPFSVTRPDGLPVWIYQFKISPHPTIPLAPTVAERAATAHRFAEAVTAIWHGR
VEIDRFNELVMRAGLTWQQVVLLRAYAKYLRQAGFPYSQSYIESVLNEHPATVRSLVDLFEALFVPVPSGSASNRDAQAA
AAAVAADIDALVSLDTDRILRAFASLVQATLRTNYFVTRQGSARCRDVLALKLNAQLIDELPLPRPRYEIFVYSPRVEGV
HLRFGPVARGGLRWSDRRDDFRTEILGLVKAQAVKNAVIVPVGAKGGFVVKRPPLPTGDPAADRDATRAEGVACYQLFIS
GLLDVTDNVDHATASVNPPPEVVRRDGDDAYLVVAADKGTATFSDIANDVAKSYGFWLGDAFASGGSVGYDHKAMGITAR
GAWEAVKRHFREIGIDTQTQDFTVVGIGDMSGDVFGNGMLLSKHIRLIAAFDHRHIFLDPNPDAAVSWAERRRMFELPRS
SWSDYDRSLISEGGGVYSREQKAIPLSAQVRAVLGIDGSVDGGAAEMAPPNLIRAILRAPVDLLFNGGIGTYIKAESESD
ADVGDRANDPVRVNANQVRAKVIGEGGNLGVTALGRVEFDLSGGRINTDALDNSAGVDCSDHEVNIKILIDSLVSAGTVK
ADERTQLLESMTDEVAQLVLADNEDQNDLMGTSRANAASLLPVHAMQIKYLVAERGVNRELEALPSEKEIARRSEAGIGL
TSPELATLMAHVKLGLKEEVLATELPDQDVFASRLPRYFPTALRERFTPEIRSHQLRREIVTTMLINDLVDTAGITYAFR
IAEDVGVTPIDAVRTYVATDAIFGVGHIWRRIRAANLPIALSDRLTLDTRRLIDRAGRWLLNYRPQPLAVGAEINRFAAM
VKALTPRMSEWLRGDDKAIVEKTAAEFASQGVPEDLAYRVSTGLYRYSLLDIIDIADIADIDAAEVADTYFALMDRLGTD
GLLTAVSQLPRHDRWHSLARLAIRDDIYGALRSLCFDVLAVGEPGESSEQKIAEWEHLSASRVARARRTLDDIRASGQKD
LATLSVAARQIRRMTRTSGRGISG
>P28997 1.4.1.2~~~~~~NAD-specific glutamate dehydrogenase~~~
MTDTLNPLVAAQEKVRIACEKLGCDPAVYELLKEPQRVIEISIPVKMDDGTVKVFKGWRSAHSSAVGPSKGGVRFHPNVN
MDEVKALSLWMTFKGGALGLPYGGGKGGICVDPAELSERELEQLSRGWVRGLYKYLGDRIDIPAPDVNTNGQIMSWFVDE
YVKLNGERMDIGTFTGKPVAFGGSEGRNEATGFGVAVVVRESAKRFGIKMEDAKIAVQGFGNVGTFTVKNIERQGGKVCA
IAEWDRNEGNYALYNENGIDFKELLAYKEANKTLIGFPGAERITDEEFWTKEYDIIVPAALENVITGERAKTINAKLVCE
AANGPTTPEGDKVLTERGINLTPDILTNSGGVLVSYYEWVQNQYGYYWTEAEVEEKQEADMMKAIKGVFAVADEYNVTLR
EAVYMYAIKSIDVAMKLRGWY
>P0C934 1.4.1.2~~~gdh~~~NAD-specific glutamate dehydrogenase~~~COG0334
MKTQEIMTMLEAKHPGESEFLQAVKEVLLSVEEVYNQHPEFEKNGIIERIVEPDRVFTFRVPWVDDQGKVQVNIGYRVQF
NNAIGPYKGGIRFHPSVNLSILKFLGFEQMFKNALTTLPMGGGKGGADFSPKGKSEAEIMRFCQSFMTELWRNIGPDTDI
PAGDIGVGGREVGYMFGMYKKLAREHTGTLTGKGFEFGGSRLRPESTGFGAVYFVQNMCKQNGVDYKGKTLAISGFGNVA
WGVAQKATELGIKVVTISGPDGYVYDPDGINTPEKFRCMLDLRDSGNDVVSDYVKRFPNAQFFPGKKPWEQKVDFAMPCA
TQNEMNLEDAKTLHKNGVTLVAETSNMGCTAEASEYYVANKMLFAPGKAVNAGGVSCSGLEMTQNAMHLVWTNEEVDKWL
HQIMQDIHEQCVTYGKDGNYIDYVKGANIAGFMKVAKAMVAQGVC
>Q9HZE0 1.4.1.2~~~gdhB~~~NAD-specific glutamate dehydrogenase~~~
MAFFTAASKADFQHQLQTALAQHLGDKALPQVTLFAEQFFSLISLDELTQRRLSDLVGCTLSAWRLLERFDRDQPEVRVY
NPDYEKHGWQSTHTAVEVLHPDLPFLVDSVRMELNRRGYSIHTLQTNVLSVRRSAKGELKEILPKGSQGKDVSQESLMYL
EIDRCAHAGELRALEKAILEVLGEVRVTVADFEPMKAKARELLTWLGKAKLKVPAEELKEVRSYLEWLLDNHFTFLGYEE
FSVADEADGGRMVYDEKSFLGLTRLLRAGLSKDDLHIEDYAVAYLREPVLLSFAKAAHPSRVHRPAYPDYVSIRELDGKG
RVIRECRFMGLFTSSVYNESVNDIPFIRGKVAEVMRRSGFDTKAHLGKELAQVLEVLPRDDLFQTPVDELFSTALAIVRI
QERNKIRVFLRKDPYGRFCYCLAYVPRDVYSTETRLKIQQVLMERLQASDCEFWTFFSESVLARVQFILRVDPKSRIDID
PARLEEEVIQACRSWQDDYSSLVVENLGEAKGTNVLADFPKGFPAGYRERFAPHFAVVDLQHLLSLSEQRPLVMSFYQPL
AQGEQQLHCKLYHADTPLALSDVLPILENLGLRVLGEFPYRLRHQNGREYWIHDFAFTYAEGLDVDIQQLNEILQDAFVH
IVSGDAENDAFNRLVLTANLPWRDVALLRAYARYLKQIRLGFDLGYIASALNAHTDIARELVRLFKTRFYLARKLTAEDL
EDKQQKLEQAILGALDEVQVLNEDRILRRYLDLIKATLRTNFYQPDGNGQNKSYFSFKFNPKAIPELPRPVPKYEIFVYS
PRVEGVHLRGGKVARGGLRWSDREEDFRTEVLGLVKAQQVKNAVIVPVGAKGGFVPRRLPLGGSRDEIQAEAIACYRIFI
SGLLDITDNLKEGEVVPPANVVRHDEDDPYLVVAADKGTATFSDIANGIAAEYGFWLGDAFASGGSAGYDHKGMGITAKG
AWVSVQRHFRERGIDVQKDNISVIGIGDMAGDVFGNGLLMSDKLQLVAAFNHMHIFIDPNPDAASSFVERQRLFNLPRSS
WADYDAKLISAGGGIFLRSAKSIAITPEMKARFDIQADRLAPTELIHALLKAPVDLLWNGGIGTYVKSSKETHADVGDKA
NDGLRVDGRELRAKVVGEGGNLGMTQLARVEFGLHGGANNTDFIDNAGGVDCSDHEVNIKILLNEVVQAGDMTEKQRNAL
LVKMTDAVGALVLGNNYKQTQALSLAQRRARERIAEYKRLMGDLEARGKLDRALEFLPSDEELAERISAGQGLTRAELSV
LISYSKIDLKESLLKSLVPDDDYLTRDMETAFPALLAEKFGDAMRRHRLKREIVSTQIANDLVNHMGITFVQRLKESTGM
SAANVAGAYVIVRDVFHLPHWFRQIENLDYQVPADIQLTLMDELMRLGRRATRWFLRSRRNELDAARDVAHFGPRIAALG
LKLNELLEGPTRELWQARYQTYVDAGVPELLARMVAGTSHLYTLLPIIEASDVTGQDTAEVAKAYFAVGSALDLTWYLQQ
ITNLPVENNWQALAREAFRDDLDWQQRAITVSVLQMQDGPKEVEARVGLWLEQHLPLVERWRAMLVELRAASGTDYAMYA
VANRELMDLAQSSQHGVCIP
>Q7A6H8 1.4.1.2~~~gluD~~~NAD-specific glutamate dehydrogenase~~~
MTENNNLVTSTQGIIKEALHKLGFDEGMYDLIKEPLRMLQVRIPVRMDDGTVKTFTGYRAQHNDAVGPTKGGVRFHPDVD
EEEVKALSMWMTLKCGIVNLPYGGGKGGIVCDPRQMSIHEVERLSRGYVRAISQFVGPNKDIPAPDVFTNSQIMAWMMDE
YSALDKFNSPGFITGKPIVLGGSHGRDRSTALGVVIAIEQAAKRRNMQIEGAKVVIQGFGNAGSFLAKFLYDLGAKIVGI
SDAYGALHDPNGLDIDYLLDRRDSFGTVTNLFEETISNKELFELDCDILVPAAISNQITEDNAHDIKASIVVEAANGPTT
PEATRILTERGILLVPDVLASAGGVTVSYFEWVQNNQGYYWSEEEVNEKLREKLEAAFDTIYELSQNRKIDMRLAAYIIG
IKRTAEAARYRGWA
>P96110 1.4.1.3~~~gdhA~~~Glutamate dehydrogenase~~~COG0334
MPEKSLYEMAVEQFNRAASLMDLESDLAEVLRRPKRVLIVEFPVRMDDGHVEVFTGYRVQHNVARGPAKGGIRYHPDVTL
DEVKALAFWMTWKTAVMNLPFGGGKGGVRVDPKKLSRNELERLSRRFFSEIQVIIGPYNDIPAPDVNTNADVMAWYMDTY
SMNVGHTVLGIVTGKPVELGGSKGREEATGRGVKVCAGLAMDVLGIDPKKATVAVQGFGNVGQFAALLISQELGSKVVAV
SDSRGGIYNPEGFDVEELIRYKKEHGTVVTYPKGERITNEELLELDVDILVPAALEGAIHAGNAERIKAKAVVEGANGPT
TPEADEILSRRGILVVPDILANAGGVTVSYFEWVQDLQSFFWDLDQVRNALEKMMKGAFNDVMKVKEKYNVDMRTAAYIL
AIDRVAYATKKRGIYP
>P31026 1.4.1.4~~~gdh~~~NADP-specific glutamate dehydrogenase~~~COG0334
MTVDEQVSNYYDMLLKRNAGEPEFHQAVAEVLESLKIVLEKDPHYADYGLIQRLCEPERQLIFRVPWVDDQGQVHVNRGF
RVQFNSALGPYKGGLRFHPSVNLGIVKFLGFEQIFKNSLTGLPIGGGKGGSDFDPKGKSDLEIMRFCQSFMTELHRHIGE
YRDVPAGDIGVGGREIGYLFGHYRRMANQHESGVLTGKGLTWGGSLVRTEATGYGCVYFVSEMIKAKGESISGQKIIVSG
SGNVATYAIEKAQELGATVIGFSDSSGWVHTPNGVDVAKLREIKEVRRARVSVYADEVEGATYHTDGSIWDLKCDIALPC
ATQNELNGENAKTLADNGCRFVAEGANMPSTPEAVEVFRERDIRFGPGKAANAGGVATSALEMQQNASRDSWSFEYTDER
LQVIMKNIFKTCAETAAEYGHENDYVVGANIAGFKKVADAMLAQGVI
>P00370 1.4.1.4~~~gdhA~~~NADP-specific glutamate dehydrogenase~~~COG0334
MDQTYSLESFLNHVQKRDPNQTEFAQAVREVMTTLWPFLEQNPKYRQMSLLERLVEPERVIQFRVVWVDDRNQIQVNRAW
RVQFSSAIGPYKGGMRFHPSVNLSILKFLGFEQTFKNALTTLPMGGGKGGSDFDPKGKSEGEVMRFCQALMTELYRHLGA
DTDVPAGDIGVGGREVGFMAGMMKKLSNNTACVFTGKGLSFGGSLIRPEATGYGLVYFTEAMLKRHGMGFEGMRVSVSGS
GNVAQYAIEKAMEFGARVITASDSSGTVVDESGFTKEKLARLIEIKASRDGRVADYAKEFGLVYLEGQQPWSLPVDIALP
CATQNELDVDAAHQLIANGVKAVAEGANMPTTIEATELFQQAGVLFAPGKAANAGGVATSGLEMAQNAARLGWKAEKVDA
RLHHIMLDIHHACVEHGGEGEQTNYVQGANIAGFVKVADAMLAQGVI
>P95544 1.4.1.3~~~gdhA~~~NAD(P)-specific glutamate dehydrogenase~~~
MKATEVIEKLKAKFPGQPEYIQAVSQVLGTIEEEYNKHPEFEKANLIERLCVPDRILQFRVSWVDDNGNVQTNLGYRVQH
NNAIGPYKGGLRFHKSVNASILKFLAFEQTFKNSLTTLPMGGAKGGSDFDPHGKSDMEVMRFCQAFMNELYRLIGPDEDV
PAGDIGVGGREVGYMFGQYKKLTHQFQGILTGKGLEFGGSLIRPEATGYGNVYFLEDMLKTRGESLEGKTVLVSGSGNVA
QYTIEKLLQLGAKPVTCSDSNGYIYDPDGIDAEKLAFIMELKNVKRGRIKEYAEKYGVKYVENARPWGEKADIATPCATQ
DEINEAEAKTLIANGVFAVSEGANMPTEPAAIKVFQDAKILYCPGKASNAGGVATSGLEMSQNSERLSWTREEVDTKLHN
IMDEIHANCVKYGTEPDGYINYVKGANVAGFMKVAKAMMAQGIY
>Q9S1F9 1.4.1.4~~~gdhA~~~NADP-specific glutamate dehydrogenase~~~
MSISKAIEKVEARYAHQPEFIQAVKEVAITIKPLYDAHPEYDKLKVFERLVEPDRVFGFRVNWEDDNGEIQINRGWRVQF
SNALGPYKGGLRFHPTVNQSVLKFLGFEQIFKNALTGLPIGGGKGGSDFDPKGKTDSEIRRFCYAFMRELHHYVNKDMDV
PAGDIGVGGREVSYMFAMYKNLTRESTGVITGKGVGFGGSLMRTEATGYGAVYFLQNMLAAQNESIEGKKVLVSGAGNVS
LHAAEKATLIGAIVLTVSDSKGTIYDAKGLNQEKIDWLKVQKDQHKPLADYVEVFGGEWMADQKPWSIKADIAIPSATQN
EINEEDAKLLVDNGVKYIVEGANMPLTAEAIDYIRLHRVHYAPGKAANAGGVAVSALEMSQNSVRQYQTFEQVDERLQGI
MKDIHDSSAQASEMYGQTDEGYIDYMSGANMVGFKRVADALVAFGILN
>P39482 1.1.1.47~~~gdhI~~~Glucose 1-dehydrogenase 1~~~
MYKDLEGKVVVITGSSTGLGKAMAIRFATEKAKVVVNYRSKEEEANSVLEEIKKVGGEAIAVKGDVTVESDVINLVQSSI
KEFGKLDVMINNAGMENPVSSHEMSLSDWNKVIDTNLTGAFLGSREAIKYFVENDIKGTVINMSSVHEKIPWPLFVHYAA
SKGGMKLMTETLALEYAPKGIRVNNIGPGAINTPINAEKFADPEQRADVESMIPMGYIGEPEEIAAVAAWLASSEASYVT
GITLFADGGMTQYPSFQAGRG
>P80869 1.1.1.47~~~ycdF~~~Glucose 1-dehydrogenase 2~~~COG1028
MYKDLTGKTAIVTGSSKGIGKAIAERFGKEKMNVVVNYHSDPSGADETLEIIKQNGGKAVSVEADVSKEEGIQALLDTAL
EHFGTLDVMVNNSGFNGVEAMPHEMSLEDWQRVIDVNVTGTFLGAKAALNHMMKNNIKGNVLNISSVHQQIPRPVNVQYS
TSKGGIKMMTETLALNYADKGIRVNAIAPGTIATESNVDTKKEESRQKQLKKIPMKAFGKPEEVAAAAAWLVSEEASYVT
GATLFVDGGMTLYPSQLE
>P39485 1.1.1.47~~~gdhIV~~~Glucose 1-dehydrogenase 4~~~
MYTDLKDKVVVITGGSTGLGRAMAVRFGQEEAKVVINYYNNEEEALDAKKEVEEAGGQAIIVQGDVTKEEDVVNLVQTAI
KEFGTLDVMINNAGVENPVPSHELSLDNWNKVIDTNLTGAFLGSREAIKYFVENDIKGNVINMSSVHEMIPWPLFVHYAA
SKGGMKLMTETLALEYAPKGIRVNNIGPGAMNTPINAEKFADPVQRADVESMIPMGYIGKPEEVAAVAAFLASSQASYVT
GITLFADGGMTKYPSFQAGRG
>P13650 1.1.5.2~~~gdhB~~~Quinoprotein glucose dehydrogenase B~~~
MNKHLLAKIALLSAVQLVTLSAFADVPLTPSQFAKAKSENFDKKVILSNLNKPHALLWGPDNQIWLTERATGKILRVNPE
SGSVKTVFQVPEIVNDADGQNGLLGFAFHPDFKNNPYIYISGTFKNPKSTDKELPNQTIIRRYTYNKSTDTLEKPVDLLA
GLPSSKDHQSGRLVIGPDQKIYYTIGDQGRNQLAYLFLPNQAQHTPTQQELNGKDYHTYMGKVLRLNLDGSIPKDNPSFN
GVVSHIYTLGHRNPQGLAFTPNGKLLQSEQGPNSDDEINLIVKGGNYGWPNVAGYKDDSGYAYANYSAAANKSIKDLAQN
GVKVAAGVPVTKESEWTGKNFVPPLKTLYTVQDTYNYNDPTCGEMTYICWPTVAPSSAYVYKGGKKAITGWENTLLVPSL
KRGVIFRIKLDPTYSTTYDDAVPMFKSNNRYRDVIASPDGNVLYVLTDTAGNVQKDDGSVTNTLENPGSLIKFTYKAK
>P07999 1.1.1.47~~~gdhB~~~Glucose 1-dehydrogenase B~~~
MYKDLEGKVVVITGSSTGLGKSMAIRFATEKAKVVVNYRSKEDEANSVLEEEIKKVGGEAIAVKGDVTVESDVINLVQSA
IKEFGKLDVMINNAGMENPVSSHEMSLSDWNKVIDTNLTGAFLGSREAIKYFVENDIKGTVINMSSVHEWKIPWPLFVHY
AASKGGMKLMTETLALEYAPKGIRVNNIGPGAINTPINAEKFADPEQRADVESMIPMGYIGEPEEIAAVAWLASSEASYV
TGITLFADGGMTQYPSFQAGRG
>P36234 1.1.1.29~~~~~~Glycerate dehydrogenase~~~
MSKKKILITWPLPEAAMARARESYDVIAHGDDPKITIDEMIETAKSVDALLITLNEKCRKEVIDRIPENIKCISTYSIGF
DHIDLDACKARGIKVGNAPHGVTVATAEIAMLLLLGSARRAGEGEKMIRTRSWPGWEPLELVGEKLDNKTLGIYGFGSIG
QALAKRAQGFDMDIDYFDTHRASSSDEASYQATFHDSLDSLLSVSQFFSLNAPSTPETRYFFNKATIKSLPQGAIVVNTA
RGDLVDNELVVAALEAGRLAYAGFDVFAGEPNINEGYYDLPNTFLFPHIGSAATQAREDMAHQANDLIDALFGGADMSYA
LA
>Q59516 1.1.1.29~~~hprA~~~Glycerate dehydrogenase~~~COG1052
MTKKVVFLDRESLDATVREFNFPHEYKEYESTWTPEEIVERLQGAEIAMINKVPMRADTLKQLPDLKLIAVAATGTDVVD
KAAAKAQGITVVNIRNYAFNTVPEHVVGLMFALRRAIVPYANSVRRGDWNKSKQFCYFDYPIYDIAGSTLGIIGYGALGK
SIAKRAEALGMKVLAFDVFPQDGLVDLETILTQSDVITLHVPLTPDTKNMIGAEQLKKMKRSAILINTARGGLVDEAALL
QALKDGTIGGAGFDVVAQEPPKDGNILCDADLPNLIVTPHVAWASKEAMQILADQLVDNVEAFVAGKPQNVVEA
>P12310 1.1.1.47~~~gdh~~~Glucose 1-dehydrogenase~~~COG1028
MYPDLKGKVVAITGAASGLGKAMAIRFGKEQAKVVINYYSNKQDPNEVKEEVIKAGGEAVVVQGDVTKEEDVKNIVQTAI
KEFGTLDIMINNAGLENPVPSHEMPLKDWDKVIGTNLTGAFLGSREAIKYFVENDIKGNVINMSSVHEVIPWPLFVHYAA
SKGGIKLMTETLALEYAPKGIRVNNIGPGAINTPINAEKFADPKQKADVESMIPMGYIGEPEEIAAVAAWLASKEASYVT
GITLFADGGMTQYPSFQAGRG
>P15877 1.1.5.2~~~gcd~~~Quinoprotein glucose dehydrogenase~~~COG4993
MAINNTGSRRLLVTLTALFAALCGLYLLIGGGWLVAIGGSWYYPIAGLVMLGVAWMLWRSKRAALWLYAALLLGTMIWGV
WEVGFDFWALTPRSDILVFFGIWLILPFVWRRLVIPASGAVAALVVALLISGGILTWAGFNDPQEINGTLSADATPAEAI
SPVADQDWPAYGRNQEGQRFSPLKQINADNVHNLKEAWVFRTGDVKQPNDPGEITNEVTPIKVGDTLYLCTAHQRLFALD
AASGKEKWHYDPELKTNESFQHVTCRGVSYHEAKAETASPEVMADCPRRIILPVNDGRLIAINAENGKLCETFANKGVLN
LQSNMPDTKPGLYEPTSPPIITDKTIVMAGSVTDNFSTRETSGVIRGFDVNTGELLWAFDPGAKDPNAIPSDEHTFTFNS
PNSWAPAAYDAKLDLVYLPMGVTTPDIWGGNRTPEQERYASSILALNATTGKLAWSYQTVHHDLWDMDLPAQPTLADITV
NGQKVPVIYAPAKTGNIFVLDRRNGELVVPAPEKPVPQGAAKGDYVTPTQPFSELSFRPTKDLSGADMWGATMFDQLVCR
VMFHQMRYEGIFTPPSEQGTLVFPGNLGMFEWGGISVDPNREVAIANPMALPFVSKLIPRGPGNPMEQPKDAKGTGTESG
IQPQYGVPYGVTLNPFLSPFGLPCKQPAWGYISALDLKTNEVVWKKRIGTPQDSMPFPMPVPVPFNMGMPMLGGPISTAG
NVLFIAATADNYLRAYNMSNGEKLWQGRLPAGGQATPMTYEVNGKQYVVISAGGHGSFGTKMGDYIVAYALPDDVK
>P40288 1.1.1.47~~~~~~Glucose 1-dehydrogenase~~~
MYKDLEGKVVVITGSSTGLGKSMAIRFATEKAKVVVNYRSKEDEANSVLEEIKKVGGEAIAVKGDVTVESDVINLVQSAI
KEFGKLDVMINNAGLENPVSSHEMSLSDWNKVIDTNLTGAFLGSREAIKYFVENDIKGTVINMSSVHEKIPWPLFVHYAA
SKGGMKLMTETLALEYAPKGIRVNNIGPGAINTPINAEKFADPEQRADVESMIPMGYIGEPEEIAAVAAWLASSEASYVT
GITLFADGGMTQYPSFQAGRG
>P14295 1.1.1.-~~~~~~L-2-hydroxyisocaproate dehydrogenase~~~
MARKIGIIGLGNVGAAVAHGLIAQGVADDYVFIDANEAKVKADQIDFQDAMANLEAHGNIVINDWAALADADVVISTLGN
IKLQQDNPTGDRFAELKFTSSMVQSVGTNLKESGFHGVLVVISNPVDVITALFQHVTGFPAHKVIGTGTLLDTARMQRAV
GEAFDLDPRSVSGYNLGEHGNSQFVAWSTVRVMGQPIVTLADAGDIDLAAIEEEARKGGFTVLNGKGYTSYGVATSAIRI
AKAVMADAHAELVVSNRRDDMGMYLSYPAIIGRDGVLAETTLDLTTDEQEKLLQSRDYIQQRFDEIVDTL
>P22643 3.8.1.5~~~dhlA~~~Haloalkane dehalogenase~~~
MINAIRTPDQRFSNLDQYPFSPNYLDDLPGYPGLRAHYLDEGNSDAEDVFLCLHGEPTWSYLYRKMIPVFAESGARVIAP
DFFGFGKSDKPVDEEDYTFEFHRNFLLALIERLDLRNITLVVQDWGGFLGLTLPMADPSRFKRLIIMNACLMTDPVTQPA
FSAFVTQPADGFTAWKYDLVTPSDLRLDQFMKRWAPTLTEAEASAYAAPFPDTSYQAGVRKFPKMVAQRDQACIDISTEA
ISFWQNDWNGQTFMAIGMKDKLLGPDVMYPMKALINGCPEPLEIADAGHFVQEFGEQVAREALKHFAETE
>P0A393 1.4.1.9~~~ldh~~~Leucine dehydrogenase~~~COG0334
MTLEIFEYLEKYDYEQVVFCQDKESGLKAIIAIHDTTLGPALGGTRMWTYDSEEAAIEDALRLAKGMTYKNAAAGLNLGG
AKTVIIGDPRKDKSEAMFRALGRYIQGLNGRYITAEDVGTTVDDMDIIHEETDFVTGISPSFGSSGNPSPVTAYGVYRGM
KAAAKEAFGTDNLEGKVIAVQGVGNVAYHLCKHLHAEGAKLIVTDINKEAVQRAVEEFGASAVEPNEIYGVECDIYAPCA
LGATVNDETIPQLKAKVIAGSANNQLKEDRHGDIIHEMGIVYAPDYVINAGGVINVADELYGYNRERALKRVESIYDTIA
KVIEISKRDGIATYVAADRLAEERIASLKNSRSTYLRNGHDIISRR
>Q53560 1.4.1.9~~~ldh~~~Leucine dehydrogenase~~~
MELFRYMEQYDYEQLVFCQDKQSGLKAIIAIHDTTLGPALGGTRMWTYESEEAAIEDALRLARGMTYKNAAAGLNLGGGK
TVIIGDPRKDKNEEMFRAFGRYIQGLNGRYITAEDVGTTVEDMDIIHDETDFVTGISPAFGSSGNPSPVTAYGVYKGMKA
AAKAAFGTDSLEGKTVAVQGVGNVAYNLCRHLHEEGAKLIVTDINKEAVERAVAEFGARAVDPDDIYSQECDIYAPCALG
ATINDDTIPQLKAKVIAGAANNQLKETRHGDQIHDMGIVYAPDYVINAGGVINVADELYGYNSERALKKVEGIYGNIERV
LEISKRDRIPTYLAADRLAEERIERMRQSRSQFLQNGHHILSRR
>P13154 1.4.1.9~~~ldh~~~Leucine dehydrogenase~~~
MELFKYMETYDYEQVLFCQDKESGLKAIIAIHDTTLGPALGGTRMWMYNSEEEALEDALRLARGMTYKNAAAGLNLGGGK
TVIIGDPRKDKNEAMFRAFGRFIQGLNGRYITAEDVGTTVADMDIIYQETDYVTGISPEFGSSGNPSPATAYGVYRGMKA
AAKEAFGSDSLEGKVVAVQGVGNVAYHLCRHLHEEGAKLIVTDINKEVVARAVEEFGAKAVDPNDIYGVECDIFAPCALG
GIINDQTIPQLKAKVIAGSADNQLKEPRHGDIIHEMGIVYAPDYVINAGGVINVADELYGYNRERAMKKIEQIYDNIEKV
FAIAKRDNIPTYVAADRMAEERIETMRKARSPFLQNGHHILSRRRAR
>Q60030 1.4.1.9~~~ldh~~~Leucine dehydrogenase~~~
MKIFDYMEKYDYEQLVMCQDKESGLKAIICIHVTTLGPALGGMRMWTYASEEEAIEDALRLGRGMTYKNAAAGLNLGGGK
TVIIGDPRKDKNEAMFRALGRFIQGLNGRYITAEDVGTTVEDMDIIHEETRYVTGVSPAFGSSGNPSPVTAYGVYRGMKA
AAKEAFGDDSLEGKVVAVQGVGHVAYELCKHLHNEGAKLIVTDINKENADRAVQEFGAEFVHPDKIYDVECDIFAPCALG
AIINDETIERLKCKVVAGSANNQLKEERHGKMLEEKGIVYAPDYVINAGGVINVADELLGYNRERAMKKVEGIYDKILKV
FEIAKRDGIPSYLAADRMAEERIEMMRKTRSTFLQDQRNLINFNNK
>P16027 1.1.2.7~~~moxF~~~Methanol dehydrogenase [cytochrome c] subunit 1~~~COG4993
MSRFVTSVSALAMLALAPAALSSGAYANDKLVELSKSDDNWVMPGKNYDSNNFSDLKQINKGNVKQLRPAWTFSTGLLNG
HEGAPLVVDGKMYIHTSFPNNTFALGLDDPGTILWQDKPKQNPAARAVACCDLVNRGLAYWPGDGKTPALILKTQLDGNV
AALNAETGETVWKVENSDIKVGSTLTIAPYVVKDKVIIGSSGAELGVRGYLTAYDVKTGEQVWRAYATGPDKDLLLASDF
NIKNPHYGQKGLGTGTWEGDAWKIGGGTNWGWYAYDPGTNLIYFGTGNPAPWNETMRPGDNKWTMTIFGRDADTGEAKFG
YQKTPHDEWDYAGVNVMMLSEQKDKDGKARKLLTHPDRNGIVYTLDRTDGALVSANKLDDTVNVFKSVDLKTGQPVRDPE
YGTRMDHLAKDICPSAMGYHNQGHDSYDPKRELFFMGINHICMDWEPFMLPYRAGQFFVGATLNMYPGPKGDRQNYEGLG
QIKAYNAITGDYKWEKMERFAVWGGTMATAGDLVFYGTLDGYLKARDSDTGDLLWKFKIPSGAIGYPMTYTHKGTQYVAI
YYGVGGWPGVGLVFDLADPTAGLGAVGAFKKLANYTQMGGGVVVFSLDGKGPYDDPNVGEWKSAAK
>P38539 1.1.2.7~~~~~~Methanol dehydrogenase [cytochrome c] subunit 1~~~
MADADLDKQVNTAGAWPIATGGYYSQHNSPLAQINKSNVKNVKAAWSFSTGVLNGHEGAPLVIGDMMYVHSAFPNNTYAL
NLNDPGKIVWQHKPKQDASTKAVMCCDVVDRGLAYGAGQIVKKQANGHLLALDAKTGKINWEVEVCDPKVGSTLTQAPFV
AKDTVLMGCSGAELGVRGAVNAFDLKTGELKWRAFATGSDDSVRLAKDFNSANPHYGQFGLGTKTWEGDAWKIGGGTNWG
WYAYDPKLNLFYYGSGNPAPWNETMRPGDNKWTMTIWGRDLDTGMAKWGYQKTPHDEWDFAGVNQMVLTDQPVNGKMTPL
LSHIDRNGILYTLNRENGNLIVAEKVDPAVNVFKKVDLKTGTPVRDPEFATRMDHKGTNICPSAMGFHNQGVDSYDPESR
TLYAGLNHICMDWEPFMLPYRAGQFFVGATLAMYPGPNGPTKKEMGQIRAFDLTTGKAKWTKWEKFAAWGGTLYTKGGLV
WYATLDGYLKALDNKDGKELWNFKMPSGGIGSPMTYSFKGKQYIGSMYGVGGWPGVGLVFDLTDPSAGLGAVGAFRELQN
HTQMGGGLMVFSL
>P15279 1.1.2.7~~~moxF~~~Methanol dehydrogenase [cytochrome c] subunit 1~~~
MSRFVTSVSALAMLALAPAALSSVAYANDKLVELSKSDDNWVMPGKNYDSNNYSELKQVNKSNVKQLRPAWTFSTGLLNG
HEGAPLVVDGKMYVHTSFPNNTFALDLDDPGHILWQDKPKQNPAARAVACCDLVNRGLAYWPGDGKTPALILKTQLDRHV
VALNAETGETVWKVENSDIKVGSTLTIAPYVVKDKVIIGSSGAELGVRGYLTAYDVKTGGQVWRAYATGPDKDLLLADDF
NVKNAHYGQKGLGTATWEGDAWKIGGGTNWGWYAYDPGTNLIYFGTGNPAPWNETMRPGDNKWTMTIFGRDADTGEAKFG
YQKTPHDEWDYAGVNVMMPSEQKDKDGKTRKLLTHPDRNGIVYTLDRTDGALVSANKLDDTVNVFKTVDLKTGQPVRDPE
YGTRMDHLAKDVCPSAMGYHNQGHDSYDPKRELFFMGINHICMDWEPFMLPYRAGQFFVGATLNMYPGPKGDRQNYEGLG
QIKAYNAITGSYKWEKMERFAVWGGTLATAGDLVFYGTLDGYLKARDSDTGDLLWKFKIPSGAIGYPMTYTHKGTQYVAI
YYGVGGWPGVGLVFDLADPTAGLGAVGAFKKLANYTQQGGGVIVFSLDGKGPYDDPNVGEWKSASK
>P12293 1.1.2.7~~~moxF~~~Methanol dehydrogenase [cytochrome c] subunit 1~~~
MNRNTPKARGASSLAMAVAMGLAVLTTAPATANDQLVELAKDPANWVMTGRDYNAQNYSEMTDINKENVKQLRPAWSFST
GVLHGHEGTPLVVGDRMFIHTPFPNTTFALDLNEPGKILWQNKPKQNPTARTVACCDVVNRGLAYWPGDDQVKPLIFRTQ
LDGHIVAMDAETGETRWIMENSDIKVGSTLTIAPYVIKDLVLVGSSGAELGVRGYVTAYDVKSGEMRWRAFATGPDEELL
LAEDFNAPNPHYGQKNLGLETWEGDAWKIGGGTNWGWYAYDPEVDLFYYGSGNPAPWNETMRPGDNKWTMAIWGREATTG
EAKFAYQKTPHDEWDYAGVNVMMLSEQEDKQGQMRKLLTHPDRNGIVYTLDRTNGDLISADKMDDTVNWVKEVQLDTGLP
VRDPEFGTRMDHKARDICPSAMGYHNQGHDSYDPERKVFMLGINHICMDWEPFMLPYRAGQFFVGATLTMYPGPKATAER
AGAGQIKAYDAISGEMKWEKMERFSVWGGTMATAGGLTFYVTLDGFIKARDSDTGDLLWKFKLPSGVIGHPMTYKHDGRQ
YVAIMYGVGGWPGVGLVFDLADPTAGLGSVGAFKRLQEFTQMGGGVMVFSLDGESPYSDPNVGEYAPGEPT
>P14775 1.1.2.7~~~moxI~~~Methanol dehydrogenase [cytochrome c] subunit 2~~~
MKTTLIAAAIVALSGLAAPALAYDGTKCKAAGNCWEPKPGFPEKIAGSKYDPKHDPKELNKQADSIKQMEERNKKRVENF
KKTGKFEYDVAKISAN
>P38540 1.1.2.7~~~moxI~~~Methanol dehydrogenase [cytochrome c] subunit 2~~~
MKHVLTLLALASVFAVSNQALAYDGQNCKEPGNCWENKPGYPEKIAGSKYDPKHDPVELNKQEESIKAMDARNAKRIANA
KSSGNFVFDVK
>P29898 1.1.2.7~~~moxI~~~Methanol dehydrogenase [cytochrome c] subunit 2~~~
MKRILTLTVAALALGTPALAYDGTNCKAPGNCWEPKPDYPAKVEGSKYDPQHDPAELSKQGESLAVMDARNEWRVWNMKK
TGKFEYDVKKIDGYDETKAPPAE
>P9WMS3 3.8.1.5~~~dhmA1~~~Haloalkane dehalogenase 1~~~COG0596
MDVLRTPDSRFEHLVGYPFAPHYVDVTAGDTQPLRMHYVDEGPGDGPPIVLLHGEPTWSYLYRTMIPPLSAAGHRVLAPD
LIGFGRSDKPTRIEDYTYLRHVEWVTSWFENLDLHDVTLFVQDWGSLIGLRIAAEHGDRIARLVVANGFLPAAQGRTPLP
FYVWRAFARYSPVLPAGRLVNFGTVHRVPAGVRAGYDAPFPDKTYQAGARAFPRLVPTSPDDPAVPANRAAWEALGRWDK
PFLAIFGYRDPILGQADGPLIKHIPGAAGQPHARIKASHFIQEDSGTELAERMLSWQQAT
>P9WMS1 3.8.1.5~~~dhmA2~~~Haloalkane dehalogenase 2~~~COG0596
MSIDFTPDPQLYPFESRWFDSSRGRIHYVDEGTGPPILLCHGNPTWSFLYRDIIVALRDRFRCVAPDYLGFGLSERPSGF
GYQIDEHARVIGEFVDHLGLDRYLSMGQDWGGPISMAVAVERADRVRGVVLGNTWFWPADTLAMKAFSRVMSSPPVQYAI
LRRNFFVERLIPAGTEHRPSSAVMAHYRAVQPNAAARRGVAEMPKQILAARPLLARLAREVPATLGTKPTLLIWGMKDVA
FRPKTIIPRLSATFPDHVLVELPNAKHFIQEDAPDRIAAAIIERFG
>Q9A919 3.8.1.5~~~dhmA~~~Haloalkane dehalogenase~~~COG0596
MDVLRTPDERFEGLADWSFAPHYTEVTDADGTALRIHHVDEGPKDQRPILLMHGEPSWAYLYRKVIAELVAKGHRVVAPD
LVGFGRSDKPAKRTDYTYERHVAWMSAWLEQNDLKDIVLFCQDWGGLIGLRLVAAFPERFSAVVVSNTGLPIGVGKSEGF
EAWLNFSQNTPELPVGFILNGGTARDLSDAERSAYDAPFPDESYKEGARIFPALVPITPEHASVEENKAAWAVLETFDKP
FVTAFSDADPITRGGEAMFLARVPGTKNVAHTTLKGGHFVQEDSPVEIAALLDGLVAGLPQA
>P22441 1.1.1.233~~~~~~N-acylmannosamine 1-dehydrogenase~~~
MTTAGVSRRPGRLAGKAAIVTGAAGGIGRATVEAYLREGASVVAMDLAPRLAATRYEEPGAIPIACDLADRAAIDAAMAD
AVARLGGLDILVAGGALKGGTGNFLDLSDADWDRYVDVNMTGTFLTCRAGARAMVAAGAGKDGRSARIITIGSVNSFMAE
PEAAAYVAAKGGVAMLTRAMAVDLARHGILVNMIAPGPVDVTGNNTGYSEPRLAEQVLDEVALGRPGLPEEVATAAVFLA
EDGSSFITGSTITIDGGLSAMIFGGMREGRR
>Q93K00 3.8.1.5~~~dhmA~~~Haloalkane dehalogenase~~~
MHVLRTPDSRFENLEDYPFVAHYLDVTARDTRPLRMHYLDEGPIDGPPIVLLHGEPTWSYLYRTMITPLTDAGNRVLAPD
LIGFGRSDKPSRIEDYSYQRHVDWVVSWFEHLNLSDVTLFVQDWGSLIGLRIAAEQPDRVGRLVVANGFLPTAQRRTPPA
FYAWRAFARYSPVLPAGRIVSVGTVRRVSSKVRAGYDAPFPDKTYQAGARAFPQLVPTSPADPAIPANRKAWEALGRWEK
PFLAIFGARDPILGHADSPLIKHIPGAAGQPHARINASHFIQEDRGPELAERILSWQQALL
>Q1QBB9 3.8.1.5~~~dhmA~~~Haloalkane dehalogenase~~~COG0596
MKILRTPDSRFANLPDYNFDPHYLMVDDSEDSELRVHYLDEGPRDADPVLLLHGEPSWCYLYRKMIPILTAAGHRVIAPD
LPGFGRSDKPASRTDYTYQRHVNWMQSVLDQLDLNNITLFCQDWGGLIGLRLVAENPDRFARVAAGNTMLPTGDHDLGEG
FRKWQQFSQEIPQFHVGGTIKSGTVTKLSQAVIDAYNAPFPDESYKEGARQFPLLVPSTPDDPASENNRAAWIELSKWTK
PFITLFSDSDPVTAGGDRIMQKIIPGTKGQAHTTIANGGHFLQEDQGEKVAKLLVQFIHDNPR
>P29894 1.4.9.1~~~mauB~~~Methylamine dehydrogenase heavy chain~~~
MALPPNFMPLFRASLIGLGLGCSALALAASAQDAPEAETQAQETQGQAAARAAAADLAAGQDDEPRILEAPAPDARRVYV
NDPAHFAAVTQQFVIDGEAGRVIGMIDGGFLPNPVVADDGSFIAHASTVFSRIARGERTDYVEVFDPVTLLPTADIELPD
APRFLVGTYPWMTSLTPDGKTLLFYQFSPAPAVGVVDLEGKAFKRMLDVPDCYHIFPTAPDTFFMHCRDGSLAKVAFGTE
GTPEITHTEVFHPEDEFLINHPAYSQKAGRLVWPTYTGKIHQIDLSSGDAKFLPAVEALTEAERADGWRPGGWQQVAYHR
ALDRIYLLVDQRDEWRHKTASRLLVVLDAKTGERLAKFEMGHEIDSINVSQDEKPLLYALSTGDKTLYIHDAESGEELRS
VNQLGHGPQVITTADMG
>P23006 1.4.9.1~~~mauB~~~Methylamine dehydrogenase heavy chain~~~COG3391
MASARESTPRYLTLIGATLACSALALGAAQAQTEPAEPEAPAETAAADAAGQTEGQRGAAEAAAALAAGEADEPVILEAP
APDARRVYIQDPAHFAAITQQFVIDGSTGRILGMTDGGFLPHPVAAEDGSFFAQASTVFERIARGKRTDYVEVFDPVTFL
PIADIELPDAPRFLVGTYQWMNALTPDNKNLLFYQFSPAPAVGVVDLEGKTFDRMLDVPDCYHIFPASPTVFYMNCRDGS
LARVDFADGETKVTNTEVFHTEDELLINHPAFSLRSGRLVWPTYTGKIFQADLTAEGATFRAPIEALTEAERADDWRPGG
WQQTAYHRQSDRIYLLVDQRDEWKHKAASRFVVVLNAETGERINKIELGHEIDSINVSQDAEPLLYALSAGTQTLHIYDA
ATGEELRSVDQLGRGPQIITTHDMDS
>P00372 1.4.9.1~~~mauA~~~Methylamine dehydrogenase light chain~~~
MLGKSQFDDLFEKMSRKVAGHTSRRGFIGRVGTAVAGVALVPLLPVDRRGRVSRANAAESAGDPRGKWKPQDNDVQSCDY
WRHCSIDGNICDCSGGSLTSCPPGTKLASSSWVASCYNPTDKQSYLISYRDCCGANVSGRCACLNTEGELPVYRPEFGND
IIWCFGAEDDAMTYHCTISPIVGKAS
>P22619 1.4.9.1~~~mauA~~~Methylamine dehydrogenase light chain~~~
MLGNFRFDDMVEKLSRRVAGQTSRRSVIGKLGTAMLGIGLVPLLPVDRRGRVSRANAADAPAGTDPRAKWVPQDNDIQAC
DYWRHCSIDGNICDCSGGSLTNCPPGTKLATASWVASCYNPTDGQSYLIAYRDCCGYNVSGRCPCLNTEGELPVYRPEFA
NDIIWCFGAEDDAMTYHCTISPIVGKAS
>P22641 1.4.9.1~~~mauA~~~Methylamine dehydrogenase light chain~~~
MLGNFRFDDMVEKLSRRVAGRTSRRGAIGRLGTVLAGAALVPLLPVDRRGRVSRANAAGPAEGVDPRAKWQPQDNDIQAC
DYWRHCSIDGNICDCSGGSLTNCPPGTKLATASWVASCYNPTDGQSYLIAYRDCCGYNVSGRCPCLNTEGELPVYRPEFA
NDIIWCFGAEDDAMTYHCTISPIVGKAS
>P42974 7.1.1.2~~~ahpF~~~NADH dehydrogenase~~~COG3634
MVLDANIKAQLNQYMQLIENDIVLKVSAGEDDTSKDMLALVDELASMSSKISVEKAELNRTPSFSVNRVGEDTGVTFAGI
PLGHEFTSLVLALLQVSGRPPKVDQKVIDQVKKISGEYHFESYISLTCHNCPDVVQALNMMSVLNPNITHTMIDGAAYKA
EVESKNIMAVPTVYLNGESFGSGRMTLEEILAKMGSGTDASEFADKEPFDVLVVGGGPAGASAAIYTARKGIRTGVVAER
FGGQVLDTMSIENFISVKATEGPKLAASLEEHVKEYDIDVMNLQRAKRLEKKDLFELELENGAVLKSKTVILSTGARWRN
VNVPGEQEFKNKGVAYCPHCDGPLFEGKDVAVIGGGNSGIEAAIDLAGIVNHVTVLEFAPELKADEVLQKRLYSLPNVTV
VKNAQTKEITGDQSVNGITYVDRETGEEKHVELQGVFVQIGLVPNTEWLEGTVERNRMGEIIVDKHGATSVPGLFAAGDC
TDSAYNQIIISMGSGATAALGAFDYLIRN
>P26829 7.1.1.2~~~ahpF~~~NADH dehydrogenase~~~
MVLEPQIKSQLNQYLQLMEGDVLLKVSAGNDKVSEDMLSLVDELASMSSRITVEKTNLERTPSFSVNRPGEDTGIVFAGI
PLGHEFTSLVLALLQVSGRAPKAEQNVIDQIKNIEGEYHFESYISLSCQNCPDVVQALNLMSVLNPGISHTMIDGAAFKD
EVESKDIMAVPTVYLNGESFTSGRMTVEEILAQLGSGPDASELADKDPFDVLVVGGGPAGASSAIYAARKGIRTGIVADR
FGGQIMDTLSIENFISQKYTEGPKLAASLEEHVKEYDIDVMKLQRAKRLEKKDLIEIELENGAVLKSKSVILSTGARWRN
VGVPGEQEFKNKGVAYCPHCDGPLFEGKDVAVIGGGNSGVEAAIDLAGIVNHVTVLEFMPELKADEVLQERLNSLPNVTV
IKNAQTKEITGDDKVNGISYMDRDTEEVHHIELAGVFVQIGLVPNTDWLDGTLERNRFGEIVVDSHGATNVPGVFAAGDC
TNSAYKQIIISMGSGATAALGAFDYLIRNTTPAESAAAK
>P19582 1.1.1.3~~~hom~~~Homoserine dehydrogenase~~~COG0460
MKAIRVGLLGLGTVGSGVVKIIQDHQDKLMHQVGCPVTIKKVLVKDLEKKREVDLPKEVLTTEVYDVIDDPDVDVVIEVI
GGVEQTKQYLVDALRSKKHVVTANKDLMAVYGSELLAEAKENGCDIYFEASVAGGIPILRTLEEGLSSDRITKMMGIVNG
TTNFILTKMIKEKSPYEEVLKEAQDLGFAEADPTSDVEGLDAARKMAILARLGFSMNVDLEDVKVKGISQITDEDISFSK
RLGYTMKLIGIAQRDGSKIEVSVQPTLLPDHHPLSAVHNEFNAVYVYGEAVGETMFYGPGAGSMPTATSVVSDLVAVMKN
MRLGVTGNSFVGPQYEKNMKSPSDIYAQQFLRIHVKDEVGSFSKITSVFSERGVSFEKILQLPIKGHDELAEIVIVTHHT
SEADFSDILQNLNDLEVVQEVKSTYRVEGNGWS
>P08499 1.1.1.3~~~hom~~~Homoserine dehydrogenase~~~COG0460
MTSASAPSFNPGKGPGSAVGIALLGFGTVGTEVMRLMTEYGDELAHRIGGPLEVRGIAVSDISKPREGVAPELLTEDAFA
LIEREDVDIVVEVIGGIEYPREVVLAALKAGKSVVTANKALVAAHSAELADAAEAANVDLYFEAAVAGAIPVVGPLRRSL
AGDQIQSVMGIVNGTTNFILDAMDSTGADYADSLAEATRLGYAEADPTADVEGHDAASKAAILASIAFHTRVTADDVYCE
GISNISAADIEAAQQAGHTIKLLAICEKFTNKEGKSAISARVHPTLLPVSHPLASVNKSFNAIFVEAEAAGRLMFYGNGA
GGAPTASAVLGDVVGAARNKVHGGRAPGESTYANLPIADFGETTTRYHLDMDVEDRVGVLAELASLFSEQGISLRTIRQE
ERDDDARLIVVTHSALESDLSRTVELLKAKPVVKAINSVIRLERD
>P9WPX1 1.1.1.3~~~hom~~~Homoserine dehydrogenase~~~COG0460
MPGDEKPVGVAVLGLGNVGSEVVRIIENSAEDLAARVGAPLVLRGIGVRRVTTDRGVPIELLTDDIEELVAREDVDIVVE
VMGPVEPSRKAILGALERGKSVVTANKALLATSTGELAQAAESAHVDLYFEAAVAGAIPVIRPLTQSLAGDTVLRVAGIV
NGTTNYILSAMDSTGADYASALADASALGYAEADPTADVEGYDAAAKAAILASIAFHTRVTADDVYREGITKVTPADFGS
AHALGCTIKLLSICERITTDEGSQRVSARVYPALVPLSHPLAAVNGAFNAVVVEAEAAGRLMFYGQGAGGAPTASAVTGD
LVMAARNRVLGSRGPRESKYAQLPVAPMGFIETRYYVSMNVADKPGVLSAVAAEFAKREVSIAEVRQEGVVDEGGRRVGA
RIVVVTHLATDAALSETVDALDDLDVVQGVSSVIRLEGTGL
>Q5F8J4 1.1.1.3~~~~~~NAD(+)-dependent homoserine dehydrogenase~~~
MKPVNIGLLGLGTVGGGAAAVLRDNAEEISRRLGREIRISAMCDLSEEKARQICPSAAFVKDPFELVARKDVDVVVELFG
GTGIAKEAVLKAIENGKHIVTANKKLLAEYGNEIFPLAEKQNVIVQFEAAVAGGIPIIKALREGLAANRIKSIAGIINGT
SNFILSEMREKGSAFADVLKEAQALGYAEADPTFDIEGNDAGHKITIMSALAFGTPMNFSACYLEGISKLDSRDIKYAEE
LGYRIKLLGVTRKTGKGIELRVHPTLIPESRLLANVDGVMNAVRVNADMVGETLYYGAGAGALPTASAVVADIIDIARLV
EADTAHRVPHLAFQPAQVQAQTILPMDEITSSYYLRVQAKDEPGTLGQIAALLAQENVSIEALIQKGVIDQTTAEIVILT
HSTVEKHIKSAIAAIEALDCVEKPITMIRMESLHD
>Q59224 1.4.1.20~~~pdh~~~Phenylalanine dehydrogenase~~~
MSLVEKTSIIKDFTLFEKMSEHEQVVFCNDPATGLRAIIAIHDTTLGPALGGCRMQPYNSVEEALEDALRLSKGMTYKCA
ASDVDFGGGKAVIIGDPQKDKSPELFRAFGQFVDSLGGRFYTGTDMGTNMEDFIHAMKETNCIVGVPEAYGGGGDSSIPT
AMGVLYGIKATNKMLFGKDDLGGVTYAIQGLGKVGYKVAEGLLEEGAHLFVTDINEQTLEAIQEKAKTTSGSVTVVASDE
IYSQEADVFVPCAFGGVVNDETMKQFKVKAIAGSANNQLLTEDHGRHLADKGILYAPDYIVNSGGLIQVADELYEVNKER
VLAKTKHIYDAILEVYQQAELDQITTMEAANRMCEQRMAARGRRNSFFTSSVKPKWDIRN
>F5L9G2 1.4.1.20~~~~~~Phenylalanine dehydrogenase~~~COG0334
MSTVTFDQISEHEQVMFCNDPHTGLKAIIAIHNTTLGPALGGCRMLPYKSEEEALTDVLRLSKGMTYKCVAADVDFGGGK
AVIIGDPRKDKTPELFRAFGQFVQSLNGRFYTGTDMGTTPEDFVQAYKETSFIVGLPEEYGGNGDSSVTTAFGVMQGLRA
VSQFLWGTDVLTERVFAVQGLGKVGFKVAEGLLKEGANVYVTDVDPETIAKLEEKAYQYPGHVQAVTADDIYGVGADVFV
PCAIGGIINDETIERLKVKAVCGAANNQLLEDRHGKVLQAKNILYAPDYIVNAGGLIQVSDELYGPNKARVLKKTRALYD
TLFEIFQSAEKKAVSTVEAANQFVEERLQKRARLNSFFSPDNPPKWRVRR
>P23307 1.4.1.20~~~pdh~~~Phenylalanine dehydrogenase~~~
MAKQLEKSSKIGNEDVFQKIANHEQIVFCNDPVSGLQAIIAIHDTTLGPALGGTRMYPYKNVDEALEDVLRLSEGMTYKC
AAADIDFGGGKAVIIGDPEKDKSPALFRAFGQFVESLNGRFYTGTDMGTTMDDFVHAQKETNFINGIPEQYGGSGDSSIP
TAQGVIYALKATNQYLFGSDSLSGKTYAIQGLGKVGYKVAEQLLKAGADLFVTDIHENVLNSIKQKSEELGGSVTIVKSD
DIYSVQADIFVPCAMGGIINDKTIPKLKVKAVVGSANNQLKDLRHANVLNEKGILYAPDYIVNAGGLIQVADELYGPNKE
RVLLKTKEIYRSLLEIFNQAALDCITTVEAANRKCQKTIEGQQTRNSFFSRGRRPKWNIKE
>Q93NG3 1.14.13.10~~~dhpH~~~2,6-dihydroxypyridine 3-monooxygenase~~~
MSPTTDRIAVVGGSISGLTAALMLRDAGVDVDVYERSPQPLSGFGTGIVVQPELVHYLLEQGVELDSISVPSSSMEYVDA
LTGERVGSVPADWRFTSYDSIYGGLYELFGPERYHTSKCLVGLSQDSETVQMRFSDGTKAEANWVIGADGGASVVRKRLL
GIEPTYAGYVTWRGVLQPGEVADDVWNYFNDKFTYGLLDDGHLIAYPIPGRENAESPRLNFQWYWNVAEGPDLDELMTDV
RGIRLPTSVHNNSLNPHNLRQFHSKGESLFKPFRDLVLNASSPFVTVVADATVDRMVHGRVLLIGDAAVTPRPHAAAGGA
KACDDARTLAEVFTKNHDLRGSLQSWETRQLQQGHAYLNKVKKMASRLQHGGSFEPGNPAFAFGLPKVDEPSVVTNS
>Q59771 1.4.1.20~~~pdh~~~Phenylalanine dehydrogenase~~~
MSIDSALNWDGEMTVTRFDRETGAHFVIRLDSTQLGPAAGGTRAAQYSQLADALTDAGKLAGAMTLKMAVSNLPMGGGKS
VIALPAPRHSIDPSTWARILRIHAENIDKLSGNYWTGPDVNTNSADMDTLNDTTEFVFGRSLERGGAGSSAFTTAVGVFE
AMKATVAHRGLGSLDGLTVLVQGLGAVGGSLASLAAEAGAQLLVADTDTERVAHAVALGHTAVALEDVLSTPCDVFAPCA
MGGVITTEVARTLDCSVVAGAANNVIADEAASDILHARGILYAPDFVANAGGAIHLVGREVLGWSESVVHERAVAIGDTL
NQVFEISDNDGVTPDEAARTLAGRRAREASTTTATA
>P22823 1.4.1.20~~~pdh~~~Phenylalanine dehydrogenase~~~
MRDVFEMMDRYGHEQVIFCRHPQTGLKAIIALHNTTAGPALGGCRMIPYASTDEALEDVLRLSKGMTYKCSLADVDFGGG
KMVIIGDPKKDKSPELFRVIGRFVGGLNGRFYTGTDMGTNPEDFVHAARESKSFAGLPKSYGGKGDTSIPTALGVFHGMR
ATARFLWGTDQLKGRVVAIQGVGKVGERLLQLLVEVGAYCKIADIDSVRCEQLKEKYGDKVQLVDVNRIHKESCDIFSPC
AKGGVVNDDTIDEFRCLAIVGSANNQLVEDRHGALLQKRSICYAPDYLVNAGGLIQVADELEGFHEERVLAKTEAIYDMV
LDIFHRAKNENITTCEAADRIVMERLKKLTDIRRILLEDPRNSARR
>Q93NG6 3.7.1.19~~~~~~2,6-dihydropseudooxynicotine hydrolase~~~
MTVTSQVKPEDEMLNWGRLILDGVSYSDMVGARDRPKEITWFDYWMSLANEYEQEAERKVALGHDLSAGELLMSAALCAQ
YAQFLWFDERRQKGQARKVELYQKAAPLLSPPAERHELVVDGIPMPVYVRIPEGPGPHPAVIMLGGLESTKEESFQMENL
VLDRGMATATFDGPGQGEMFEYKRIAGDYEKYTSAVVDLLTKLEAIRNDAIGVLGRSLGGNYALKSAACEPRLAACISWG
GFSDLDYWDLETPLTKESWKYVSKVDTLEEARLHVHAALETRDVLSQIACPTYILHGVHDEVPLSFVDTVLELVPAEHLN
LVVEKDGDHCCHNLGIRPRLEMADWLYDVLVAGKKVAPTMKGWPLNG
>O25830 ~~~folP~~~Bifunctional dihydropteroate synthase/dihydropteroate reductase~~~COG0294
MIVKRLNPDALKNALQKIGPEKIAQDRMHQKGVSFVFEIQHLPLSATLILKQEAISVGGDFATPRDCILAKEPFYDGVLI
ASAKQLERLIVKCHSQPFGLKHLAQELKSHLKAPKPNTPQIMAVLNLTPDSFYEKSRFDSKKALEEIYQWLEKGITLIDI
GAASSRPESEIIDPKIEQDRLKEILLEIKSQKLYQCAKFSIDTYHATTAQMALEHYFSILNDVSGFNSAEMLEVAKDYKP
TCILMHTQKTPKDMQENVFYHNLFDEMDRFFKEKLEVLEKYVLQDIILDIGFGFAKLKEHNLALIKHLSHFLKFKKPLLV
GASRKNTIGLITGREVQDRLAGTLSLHLMALQNGASVLRVHDIDEHIDLIKVFKSLEETD
>P0C0X1 2.5.1.15~~~folP1~~~Dihydropteroate synthase~~~COG0294
MSLAPVQVIGVLNVTDNSFSDGGRYLDPDDAVQHGLAMVAEGAAIVDVGGESTRPGAIRTDPRVELSRIVPVVKELAAQG
ITVSIDTTRADVARAALQSGARIVNDVSGGRADPAMAPLVAEAGVAWVLMHWRLMSAERPYEAPNYRDVVAEVRADLLAG
VDQAVAAGVDPGSLVIDPGLGFAKTGQHNWALLNALPELVATGVPILLGASRKRFLGRLLAGADGAVRPPDGRETATAVI
SALAALHGAWGVRVHDVRASVDALKVVGAWLHAGPQIEKVRCDG
>P9WND1 2.5.1.15~~~folP1~~~Dihydropteroate synthase~~~COG0294
MSPAPVQVMGVLNVTDDSFSDGGCYLDLDDAVKHGLAMAAAGAGIVDVGGESSRPGATRVDPAVETSRVIPVVKELAAQG
ITVSIDTMRADVARAALQNGAQMVNDVSGGRADPAMGPLLAEADVPWVLMHWRAVSADTPHVPVRYGNVVAEVRADLLAS
VADAVAAGVDPARLVLDPGLGFAKTAQHNWAILHALPELVATGIPVLVGASRKRFLGALLAGPDGVMRPTDGRDTATAVI
SALAALHGAWGVRVHDVRASVDAIKVVEAWMGAERIERDG
>P9WNC9 ~~~folP2~~~Inactive dihydropteroate synthase 2~~~COG0294
MRSTPPASAGRSTPPALAGHSTPPALAGHSTLCGRPVAGDRALIMAIVNRTPDSFYDKGATFSDAAARDAVHRAVADGAD
VIDVGGVKAGPGERVDVDTEITRLVPFIEWLRGAYPDQLISVDTWRAQVAKAACAAGADLINDTWGGVDPAMPEVAAEFG
AGLVCAHTGGALPRTRPFRVSYGTTTRGVVDAVISQVTAAAERAVAAGVAREKVLIDPAHDFGKNTFHGLLLLRHVADLV
MTGWPVLMALSNKDVVGETLGVDLTERLEGTLAATALAAAAGARMFRVHEVAATRRVLEMVASIQGVRPPTRTVRGLA
>P0AC13 2.5.1.15~~~folP~~~Dihydropteroate synthase~~~COG0294
MKLFAQGTSLDLSHPHVMGILNVTPDSFSDGGTHNSLIDAVKHANLMINAGATIIDVGGESTRPGAAEVSVEEELQRVIP
VVEAIAQRFEVWISVDTSKPEVIRESAKVGAHIINDIRSLSEPGALEAAAETGLPVCLMHMQGNPKTMQEAPKYDDVFAE
VNRYFIEQIARCEQAGIAKEKLLLDPGFGFGKNLSHNYSLLARLAEFHHFNLPLLVGMSRKSMIGQLLNVGPSERLSGSL
ACAVIAAMQGAHIIRVHDVKETVEAMRVVEATLSAKENKRYE
>P64141 2.5.1.15~~~folP~~~Dihydropteroate synthase~~~
MTKTKIMGILNVTPDSFSDGGKFNNVETAINRVKAMIDEGADIIDVGGVSTRPGHEMVTLEEELNRVLPVVEAIVGFDVK
ISVDTFRSEVAEACLKLGVDMINDQWAGLYDHRMFQIVAKYDAEIILMHNGNGNRDEPVVEEMLTSLLAQAHQAKIAGIP
SNKIWLDPGIGFAKTRNEEAEVMARLDELVATEYPVLLATSRKRFTKEMMGYDTTPVERDEVTAATTAYGIMKGVRAVRV
HNVELNAKLAKGIDFLKENENARHNLS
>O05701 2.5.1.15~~~folP~~~Dihydropteroate synthase~~~
MTKTKIMGILNVTPDSFSDGGKFNNVESAVTRVKAMMDEGADIIDVGGVSTRPGHEMITVEEELNRVLPVVEAIVGFDVK
ISVDTFRSEVAEACLKLGVDIINDQWAGLYDHRMFQVVAKYDAEIVLMHNGNGNRDEPVVEEMLTSLLAQAHQAKIAGIP
SNKIWLDPGIGFAKTRNEEAEVMARLDELVATEYPVLLATSRKRFTKEMMGYDTTPVERDEVTAATTAYGIMKGVRAVRV
HNVELNAKLAKGIDFLKENENARHNFS
>Q5XCA8 2.5.1.15~~~folP~~~Dihydropteroate synthase~~~
MKIGKFVIEGNAAIMGILNVTPDSFSDGGSYTTVQKALDHVEQMIADGAKIIDVGGESTRPGCQFVSATDEIDRVVPVIK
AIKENYDILISIDTYKTETARAALEAGADILNDVWAGLYDGQMFALAAEYDAPIILMHNQDEEVYQEVTQDVCDFLGNRA
QAALDAGVPKNNIWIDPGFGFAKSVQQNMELLKGLDRVCQLGYPVLFGISRKRVVDALLGGNTKAKERDGATAALSAYAL
GKGCQIVRVHDVKANQDIVAVLSQLM
>P05382 2.5.1.15~~~sulA~~~Dihydropteroate synthase~~~COG0294
MSSKANHAKTVICGIINVTPDSFSDGGQFFALEQALQQARKLIAEGASMLDIGGESTRPGSSYVEIEEEIQRVVPVIKAI
RKESDVLISIDTWKSQVAEAALAAGADLVNDITGLMGDEKMAYVVAEARAKVVIMFNPVMARPQHPSSLIFPHFGFGQTF
TEKELADFETLPIEDLMVAFFERALARAAEAGIAPENILLDPGIGFGLTKKENLLLLRDLDKLHQKGYPIFLGVSRKRFV
INILEENGFEVNPETELGFRNRDTASAHVTSIAARQGVEVVRVHDVASHRMAVEIASAIRLADEAENLDLKQYK
>P59655 2.5.1.15~~~sulA~~~Dihydropteroate synthase~~~COG0294
MSSKANHAKTVICGIINVTPDSFSDGGQFFALEQALQQARKLIAEGASMLDIGGESTRPGSSYVEIEEEIQRVVPVIKAI
RKESDVLISIDTWKSQVAEAALAAGADLVNDITGLMGDEKMPHVVAEARAQVVIMFNPVMARPQHPSSLIFPHFGFGQAF
TEEELADFETLPIEELMEAFFERALARAAEAGIAPENILLDPGIGFGLTKKENLLLLRDLDKLHQKGYPIFLGVSRKRFV
INILEENGFEVNPETELGFRNRDTASAHVTSIAARQGVEVVRVHDVASHRMAVEIASAIRLADEAENLDLKQYK
>P08064 ~~~sdhC~~~Succinate dehydrogenase cytochrome b558 subunit~~~COG2009
MSGNREFYFRRLHSLLGVIPVGIFLIQHLVVNQFAARGAEAFNSAAHFMDSLPFRYALEIFIIFLPLIYHAVYGVYIAFT
AKNNAGQYSYMRNWLFVLQRVTGIITLIFVSWHVWETRIAAQMGAEVNFDMMANILSSPAMLGFYIVGVLSTIFHFSNGL
WSFAVTWGITVTPRSQRISTYVTLIIFVALSYVGLKAIFAFV
>P69054 ~~~sdhC~~~Succinate dehydrogenase cytochrome b556 subunit~~~COG2009
MIRNVKKQRPVNLDLQTIRFPITAIASILHRVSGVITFVAVGILLWLLGTSLSSPEGFEQASAIMGSFFVKFIMWGILTA
LAYHVVVGIRHMMMDFGYLEETFEAGKRSAKISFVITVVLSLLAGVLVW
>P51057 ~~~sdhD~~~Succinate dehydrogenase hydrophobic membrane anchor subunit~~~COG2142
MDMVDRTSRRGYRDWFVQRITALLSGIYAVFVIVFLLVHHPISYPQWHALFSHLIMKIFTLIVIFSILWHAWIGMWTIFT
DYVKNKPIRLALETLVCLLLVGYFVWAIEFLWIAR
>P0AC44 ~~~sdhD~~~Succinate dehydrogenase hydrophobic membrane anchor subunit~~~COG2142
MVSNASALGRNGVHDFILVRATAIVLTLYIIYMVGFFATSGELTYEVWIGFFASAFTKVFTLLALFSILIHAWIGMWQVL
TDYVKPLALRLMLQLVIVVALVVYVIYGFVVVWGV
>Q06004 1.1.1.-~~~gutB~~~Sorbitol dehydrogenase~~~COG1063
MTHTVPQNMKAAVMHNTREIKIETLPVPDINHDEVLIKVMAVGICGSDLHYYTNGRIGNYVVEKPFILGHECAGEIAAVG
SSVDQFKVGDRVAVEPGVTCGRCEACKEGRYNLCPDVQFLATPPVDGAFVQYIKMRQDFVFLIPDSLSYEEAALIEPFSV
GIHAAARTKLQPGSTIAIMGMGPVGLMAVAAAKAFGAGTIIVTDLEPLRLEAAKKMGATHIINIREQDALEEIKTITNDR
GVDVAWETAGNPAALQSALASVRRGGKLAIVGLPSQNEIPLNVPFIADNEIDIYGIFRYANTYPKGIEFLASGIVDTKHL
VTDQYSLEQTQDAMERALQFKNECLKVMVYPNR
>P16421 1.12.-.-~~~~~~Soluble hydrogenase 42 kDa subunit~~~
MDDKLMLMIPGPTPVPEAALLALAKHPIGHRTSEFSNMMGEVTQNLKWLHQTESDVLMLNVSGTGAVEAGMINFLSPGDR
ILVGSNGKFGERWVEVGQAFGLNVEAITAEWGQPLDPDKFAQKLQADTNKEIKAVIITHSETSTGVINDLVAINSHVKEH
GQALIIVDAVTSLGAYNVPVDALGLDVVASGSQKGYMIPPGLGFVSVSPKAWEAYKTAKLPKYYLDLGKYRKATAKNTTP
FTPPVNLMVALHTTLGMMKKEGLESIFTRHERQKNATRAAMKALNLPLFAADECASPAITAVATPGMEADKIRSLMKKRF
DIALAGGQDHLSNKIFRVGHLGFVSDRDILSCIASLEVVLLELGHENFNSGAGVAAAARVFSN
>Q06530 1.8.2.3~~~fccB~~~Sulfide dehydrogenase [flavocytochrome c] flavoprotein chain~~~COG0446
MTLNRRDFIKTSGAAVAAVGILGFPHLAFGAGRKVVVVGGGTGGATAAKYIKLADPSIEVTLIEPNTDYYTCYLSNEVIG
GDRKLESIKHGYDGLRAHGIQVVHDSATGIDPDKKLVKTAGGAEFGYDRCVVAPGIELIYDKIEGYSEEAAAKLPHAWKA
GEQTAILRKQLEDMADGGTVVIAPPAAPFRCPPGPYERASQVAYYLKAHKPKSKVIILDSSQTFSKQSQFSKGWERLYGF
GTENAMIEWHPGPDSAVVKVDGGEMMVETAFGDEFKADVINLIPPQRAGKIAQIAGLTNDAGWCPVDIKTFESSIHKGIH
VIGDACIANPMPKSGYSANSQGKVAAAAVVALLKGEEPGTPSYLNTCYSILAPAYGISVAAIYRPNADGSAIESVPDSGG
VTPVDAPDWVLEREVQYAYSWYNNIVHDTFG
>P16099 1.5.8.2~~~tmd~~~Trimethylamine dehydrogenase~~~
MARDPKHDILFEPIQIGPKTLRNRFYQVPHCIGAGSDKPGFQSAHRSVKAEGGWAALNTEYCSINPESDDTHRLSARIWD
EGDVRNLKAMTDEVHKYGALAGVELWYGGAHAPNMESRATPRGPSQYASEFETLSYCKEMDLSDIAQVQQFYVDAAKRSR
DAGFDIVYVYGAHSYLPLQFLNPYYNKRTDKYGGSLENRARFWLETLEKVKHAVGSDCAIATRFGVDTVYGPGQIEAEVD
GQKFVEMADSLVDMWDITIGDIAEWGEDAGPSRFYQQGHTIPWVKLVKQVSKKPVLGVGRYTDPEKMIEIVTKGYADIIG
CARPSIADPFLPQKVEQGRYDDIRVCIGCNVCISRWEIGGPPMICTQNATAGEEYRRGWHPEKFRQTKNKDSVLIVGAGP
SGSEAARVLMESGYTVHLTDTAEKIGGHLNQVAALPGLGEWSYHRDYRETQITKLLKKNKESQLALGQKPMTADDVLQYG
ADKVIIATGARWNTDGTNCLTHDPIPGADASLPDQLTPEQVMDGKKKIGKRVVILNADTYFMAPSLAEKLATAGHEVTIV
SGVHLANYMHFTLEYPNMMRRLHELHVEELGDHFCSRIEPGRMEIYNIWGDGSKRTYRGPGVSPRDANTSHRWIEFDSLV
LVTGRHSECTLWNELKARESEWAENDIKGIYLIGDAEAPRLIADATFTGHRVAREIEEANPQIAIPYKRETIAWGTPHMP
GGNFKIEYKV
>P66817 ~~~diaA~~~DnaA initiator-associating protein DiaA~~~COG0279
MQERIKACFTESIQTQIAAAEALPDAISRAAMTLVQSLLNGNKILCCGNGTSAANAQHFAASMINRFETERPSLPAIALN
TDNVVLTAIANDRLHDEVYAKQVRALGHAGDVLLAISTRGNSRDIVKAVEAAVTRDMTIVALTGYDGGELAGLLGPQDVE
IRIPSHRSARIQEMHMLTVNCLCDLIDNTLFPHQDD
>Q2FWX9 1.2.1.-~~~aldH1~~~4,4'-diaponeurosporen-aldehyde dehydrogenase~~~COG1012
MNIIEQKFYDSKAFFNTQQTKDISFRKEQLKKLSKAIKSYESDILEALYTDLGKNKVEAYATEIGITLKSIKIARKELKN
WTKTKNVDTPLYLFPTKSYIKKEPYGTVLIIAPFNYPFQLVFEPLIGAIAAGNTAIIKPSELTPNVARVIKRLINETFDA
NYIEVIEGGIEETQTLIHLPFDYVFFTGSENVGKIVYQAASENLVPVTLEMGGKSPVIVDETANIKVASERICFGKFTNA
GQTCVAPDYILVHESVKDDLITALSKTLREFYGQNIQQSPDYGRIVNLKHYHRLTSLLNSAQMNIVFGGHSDEDERYIEP
TLLDHVTSDSAIMQEEIFGPILPILTYQSLDEAIAFIHQRPKPLSLYLFSEDENATQRVINELSFGGGAINDTLMHLANP
KLPFGGVGASGMGRYHGKYSFDTFTHEKSYIFKSTRLESGVHLPPYKGKFKYIKAFFKN
>P80702 1.1.1.50~~~hsdA~~~3-alpha-hydroxysteroid dehydrogenase/carbonyl reductase~~~
MSIIVISGCATGIGAATRKVLEAAGHQIVGIDIRDAEVIADLSTAEGRKQAIADVLAKCSKGMDGLVLCAGLGPQTKVLG
NVVSVNYFGATELMDAFLPALKKGHQPAAVVISSVASAHLAFDKNPLALALEAGEEAKARAIVEHAGEQGGNLAYAGSKN
ALTVAVRKRAAAWGEAGVRLNTIAPGATETPLLQAGLQDPRYGESIAKFVPPMGRRAEPSEMASVIAFLMSPAASYVHGA
QIVIDGGIDAVMRPTQF
>P64426 3.2.1.-~~~digH~~~Glycosyl hydrolase DigH~~~COG1649
MDICSRNKKLTIRRPAILVALALLLCSCKSTPPESMVTPPAGSKPPATTQQSSQPMRGIWLATVSRLDWPPVSSVNISNP
TSRARVQQQAMIDKLDHLQRLGINTVFFQVKPDGTALWPSKILPWSDLMTGKIGENPGYDPLQFMLDEAHKRGMKVHAWF
NPYRVSVNTKPGTIRELNSTLSQQPASVYVQHRDWIRTSGDRFVLDPGIPEVQDWITSIVAEVVSRYPVDGVQFDDYFYT
ESPGSRLNDNETYRKYGGAFASKADWRRNNTQQLIAKVSHTIKSIKPGVEFGVSPAGVWRNRSHDPLGSDTRGAAAYDES
YADTRRWVEQGLLDYIAPQIYWPFSRSAARYDVLAKWWADVVKPTRTRLYIGIAFYKVGEPSKIEPDWMINGGVPELKKQ
LDLNDAVPEISGTILFREDYLNKPQTQQAVSYLQSRWGS
>P28303 ~~~dinF~~~DNA damage-inducible protein F~~~COG0534
MPPGVAVCFSSLFIRLVCMAFLTSSDKALWHLALPMIFSNITVPLLGLVDTAVIGHLDSPVYLGGVAVGATATSFLFMLL
LFLRMSTTGLTAQAYGAKNPQALARTLVQPLLLALGAGALIALLRTPIIDLALHIVGGSEAVLEQARRFLEIRWLSAPAS
LANLVLLGWLLGVQYARAPVILLVVGNILNIVLDVWLVMGLHMNVQGAALATVIAEYATLLIGLLMVRKILKLRGISGEM
LKTAWRGNFRRLLALNRDIMLRSLLLQLCFGAITVLGARLGSDIIAVNAVLMTLLTFTAYALDGFAYAVEAHSGQAYGAR
DGSQLLDVWRAACRQSGIVALLFSVVYLLAGEHIIALLTSLTQIQQLADRYLIWQVILPVVGVWCYLLDGMFIGATRATE
MRNSMAVAAAGFALTLLTLPWLGNHALWLALTVFLALRGLSLAAIWRRHWRNGTWFAAT
>P27296 3.6.4.12~~~dinG~~~ATP-dependent DNA helicase DinG~~~COG1199
MALTAALKAQIAAWYKALQEQIPDFIPRAPQRQMIADVAKTLAGEEGRHLAIEAPTGVGKTLSYLIPGIAIAREEQKTLV
VSTANVALQDQIYSKDLPLLKKIIPDLKFTAAFGRGRYVCPRNLTALASTEPTQQDLLAFLDDELTPNNQEEQKRCAKLK
GDLDTYKWDGLRDHTDIAIDDDLWRRLSTDKASCLNRNCYYYRECPFFVARREIQEAEVVVANHALVMAAMESEAVLPDP
KNLLLVLDEGHHLPDVARDALEMSAEITAPWYRLQLDLFTKLVATCMEQFRPKTIPPLAIPERLNAHCEELYELIASLNN
ILNLYMPAGQEAEHRFAMGELPDEVLEICQRLAKLTEMLRGLAELFLNDLSEKTGSHDIVRLHRLILQMNRALGMFEAQS
KLWRLASLAQSSGAPVTKWATREEREGQLHLWFHCVGIRVSDQLERLLWRSIPHIIVTSATLRSLNSFSRLQEMSGLKEK
AGDRFVALDSPFNHCEQGKIVIPRMRVEPSIDNEEQHIAEMAAFFRKQVESKKHLGMLVLFASGRAMQRFLDYVTDLRLM
LLVQGDQPRYRLVELHRKRVANGERSVLVGLQSFAEGLDLKGDLLSQVHIHKIAFPPIDSPVVITEGEWLKSLNRYPFEV
QSLPSASFNLIQQVGRLIRSHGCWGEVVIYDKRLLTKNYGKRLLDALPVFPIEQPEVPEGIVKKKEKTKSPRRRRR
>P9WMR5 3.6.4.12~~~dinG~~~Probable ATP-dependent helicase DinG homolog~~~COG1199
MSESVSMSVPELLAIAVAALGGTRRRGQQEMAAAVAHAFETGEHLVVQAGTGTGKSLAYLVPAIIRALCDDAPVVVSTAT
IALQRQLVDRDLPQLVDSLTNALPRRPKFALLKGRRNYLCLNKIHNSVTASDHDDERPQEELFDPVAVTALGRDVQRLTA
WASTTVSGDRDDLKPGVGDRSWSQVSVSARECLGVARCPFGSECFSERARGAAGLADVVVTNHALLAIDAVAESAVLPEH
RLLVVDEAHELADRVTSVAAAELTSATLGMAARRITRLVDPKVTQRLQAASATFSSAIHDARPGRIDCLDDEMATYLSAL
RDAASAARSAIDTGSDTTTASVRAEAGAVLTEISDTASRILASFAPAIPDRSDVVWLEHEDNHESARAVLRVAPLSVAEL
LATQVFARATTVLTSATLTIGGSFDAMATAWGLTADTPWRGLDVGSPFQHAKSGILYVAAHLPPPGRDGSGSAEQLTEIA
ELITAAGGRTLGLFSSMRAARAATEAMRERLSTPVLCQGDDSTSTLVEKFTADAATSLFGTLSLWQGVDVPGPSLSLVLI
DRIPFPRPDDPLLSARQRAVAARGGNGFMTVAASHAALLLAQGSGRLLRRVTDRGVVAVLDSRMATARYGEFLRASLPPF
WQTTNATQVRAALRRLARADAKAH
>Q2FYH5 3.1.-.-~~~dinG~~~3'-5' exonuclease DinG~~~COG0847
MGMATYAVVDLETTGNQLDFDDIIQIGITFVRNNQIIDTYHSMIRTNLEIPPFIQALTSIEENMLQQAPYFNQVAQEIYD
KIKDCIFVAHNVDFDLNFIKKAFKDCNIQYRPKKVIDTLEIFKIAFPTDKSYQLSELAEAHGITLANAHRADEDAATTAK
LMILAFEKFEKLPLDTLKQLYYLSKQLKYDLYDIFFEMVRQYDAKPLDKSYEKFEQIIYRKQVDFKKPTTNYNGSLKSLY
SKAVDQLGLTYRPQQLYLAETILDQLMHSEKAMIEASLGSGKSLAYLLAALMYNIETGKHVMISTNTKLLQSQLLEKDIP
AMNEALNFKINALLIKSKSDYISLGLISQILKDDTSNYEVNILKMQLLIWITETPSGDIQELNLKGGQKMYFDQKIETYV
PARHDVHYYNFIKRNAQNIQIGITNHAHLIHSDVENSIYQLFDDCIVDEAHRLPDYALNQVTNELSYADIKYQLGLIGKN
ENEKLLKAIDQLEKQRILEKLDIAPIDIFGLKASMNEIHELNEQLFSTIFTIINDSDVYDDDIHRFHNVFTFETKDILKD
LHAIIDKLNKTLEIFNGISHKTVKSLRKQLLYLKDKFKNIEQSLKAGHTSFISIKNLSQKSTIRLYVKDYAVKDVLTKQV
LEKFKSLIFISGTLKFNHSFEAFKQLFNKDVHFNTFEVNTSLQSAKNTSVFIPSDVASYQYKNIDEYVASIVSYIIEYTT
ITSSKCLVLFTSYKMMHMVQDMLNELPEFEDYVVLTQQQNQNYKIVQQFNNFDKAILLGTSTFFEGFDFQANGIKCVMIA
KLPFMNKHNAKYWLMDSEFTSTFKEYVLPDAVTRFRQGLGRLIRNENDRGIIVSFDDRLINSNYKNFFEQTLENYRQKKG
DIQQFGKLLRQIQKKKK
>Q7A5K4 3.1.-.-~~~dinG~~~3'-5' exonuclease DinG~~~
MGMATYAVVDLETTGNQLDFDDIIQIGITFVRNNQIIDTYHSMIRTNLEIPPFIQALTSIEENMLQQAPYFNQVAQEIYD
KIKDCIFVAHNVDFDLNFIKKAFKDCNIQYRPKKVIDTLEIFKIAFPTDKSYQLSELAEAHGITLANAHRADEDAATTAK
LMILAFEKFEKLPLDTLKQLYYLSKQLKYDLYDIFFEMVRQYDAKPLDKSYEKFEQIIYRKQVDFKKPTTNYNGSLKSLY
SKAVDQLGLTYRPQQLYLAETILDQLMHSEKAMIEASLGSGKSLAYLLAALMYNIETGKHVMISTNTKLLQSQLLEKDIP
AMNEALNFKINALLIKSKSDYISLGLISQILKDDTSNYEVNILKMQLLIWITETPSGDIQELNLKGGQKMYFDQKIETYV
PARHDVHYYNFIKRNAQNIQIGITNHAHLIHSDVENSIYQLFDDCIVDEAHRLPDYALNQVTNELSYADIKYQLGLIGKN
ENEKLLKAIDQLEKQRILEKLDIAPIDIFGLKASMNEIHELNEQLFSTIFTIINDSDVYDDDIHRFHNVFTFETKDILKD
LHAIIDKLNKTLEIFNGISHKTVKSLRKQLLYLKDKFKNIEQSLKAGHTSFISIKNLSQKSTIRLYVKDYAVKDVLTKQV
LEKFKSLIFISGTLKFNHSFEAFKQLFNKDVHFNTFEVNTSLQSAKNTSVFIPSDVASYQYKNIDEYVASIVSYIIEYTT
ITSSKCLVLFTSYKMMHMVQDMLNELPEFEDYVVLTQQQNQNYKIVQQFNNFDKAILLGTSTFFEGFDFQANGIKCVMIA
KLPFMNKHNAKYWLMDSEFTSTFKEYVLPDAVTRFRQGLGRLIRNENDRGIIVSFDDRLINSNYKNFFEQTLENYRQKKG
DIQQFGKLLRQIQKKKK
>Q6GGV4 3.1.-.-~~~dinG~~~3'-5' exonuclease DinG~~~
MGMATYAVVDLETTGNQLDFDDIIQIGITFVRNNQIIDTYHSMIRTNLEIPPFIQALTSIEENMLQQAPYFNQVAEEIYD
KIKDCIFVAHNVDFDLNFIKKAFKDCNIQYRPKKVIDTLEIFKIAFPTDKSYQLSELAEAHGITLANAHRADEDAATTAK
LMILAFEKFEKLPLDTLKQLYYLSKQLKYDLYDIFFEMVRQYDAKPLDKSYEKFEQIIYRKQVDFKKPTTNYNGSLKSLY
SKAVDQLGLTYRPQQLYLAETILDQLMHSEKAMIEASLGSGKSLAYLLAALMYNIETGKHVMISTNTKLLQSQLLEKDIP
AMNEALNFKINALLIKSKSDYISLGLISQILKDDTSNYEVNILKMQLLIWITETPSGDIQELNLKGGQKMYFDQKIETYV
PARHDVHYYNFIKRNAQNIQIGITNHAHLIHSDVENSIYQLFDDCIVDEAHRLPDYALNQVTNELSYADIKYQLGLIGKN
ENEKLLKAIDQLEKQRILEKLDIAPIDIFGLKASMNEIHELNEQLFSTIFTIINDSDVYDDDIHRFHNVFTFETKDILKD
LHAIIDKLNKTLEIFNGISHKTVKSLRKQLLYLKDKFKNIEQSLKAGHTSFISIKNLSQKSTIRLYVKDYAVKDVLTKQV
LEKFKSLIFISGTLKFNHSFDAFKQLFNKDVHFNTFEVNTSLQSAKNTSVFIPSDVASYQYKNIDEYVASIVSYIIEYTT
ITSSKCLVLFTSYKMMHMVQDMLNELPEFEDYVVLTQQQNQNYKIVQQFNNFDKAILLGTSTFFEGFDFQANGIKCVMIA
KLPFMNKHNAKYWLMDSEFTSTFKEYVLPDAVTRFRQGLGRLIRNENDRGIIVSFDDRLINSNYKNFFEQTLENYRQKKG
DIQQFGKLLRQIQKKKK
>P0ABR1 ~~~dinI~~~DNA damage-inducible protein I~~~
MRIEVTIAKTSPLPAGAIDALAGELSRRIQYAFPDNEGHVSVRYAAANNLSVIGATKEDKQRISEILQETWESADDWFVS
E
>A0A140ND86 ~~~dinJ~~~Antitoxin DinJ~~~COG3077
MAANAFVRARIDEDLKNQAADVLAGMGLTISDLVRITLTKVAREKALPFDLREPNQLTIQSIKNSEAGVDVHKAKDADDL
FDKLGI
>Q47150 ~~~dinJ~~~Antitoxin DinJ~~~COG3077
MAANAFVRARIDEDLKNQAADVLAGMGLTISDLVRITLTKVAREKALPFDLREPNQLTIQSIKNSEAGIDVHKAKDADDL
FDKLGI
>A5A624 ~~~dinQ~~~Uncharacterized protein DinQ~~~
MIDKAIIVLGALIALLELIRFLLQLLN
>Q9HVS1 ~~~~~~Dipeptide-binding protein~~~
MRKILPLRAWLAAGLILGSPFSHAASNLVFCSEGSPAGFDPAQYTTGTDYDATSVTLFNRLVQFERGGTRAIPALAESWD
IGDDGKTYTFHLRKGVKFHSTDYFKPTREFNADDVLFTFQRMLDKNHPFRKAYPTEFPYFTDMGLDKNIARVEKLDEHRV
KFTLNEVDAAFIQNLAMDVASIQSAEYAGQLLEAGKPQQINQKPIGTGPFILSRYQKDAQIRFKGNKDYWKPEDVKIDNL
IFSINTDAAVRAQKLKAGECQITLNPRPADLKALQEAANLKVPSQPGFNLGYIAYNVTHKPFDQLEVRQALDMAVNKQAI
IDAVYQGAGQLAVNGMPPTQWSYDETIKDAPFDPAKARELLKKAGVAEGTEITLWAMPVQRPYNPNAKLMAEMIQADWAK
IGIKARIVSYEWGEYIKRAHAGEHDAMLFGWTGDNGDPDNWLATLYGCDSINGNNVSKWCDAAYDKLVKAAKRVSDQDKR
SELYKQAQHILKEQVPITPIAHSTVYQPMSKSVHGFKISPFSRNAFYGVANQP
>O67379 ~~~spsI~~~Bifunctional IPC transferase and DIPP synthase~~~COG0558
MVETAVILAGGEGNRLKPLTEEVPKALLKVAGRELLYRTIKQLQDVGVKNFVIVVNKKFEGKVKAFLKEHNFEAEVIPNE
HPEKENGYSLYLAKGRIKGEFAVVMSDHIYEKAFLEKAVEGKGLIVDRLGLYINKNEATKVKCEEGRIKYIGKNLEKYDG
FDTGFFVLDESIFEVAEEALKEQKKLTMSELAKRAQIPCTEVSGYFWMDVDTPEDVEKAKKYLVKTAIKGVGDGFISRNL
NRKVSTRISPYLVDKFTPNQLTVLTFLLGMFSALVAYFSPALGGILLQINSMLDGLDGEVARAQMRTTKFGAWLDSVLDR
YVDFAFLSALAMHLKPSWDFMPWVFAALFGSVMVSYSTERYKGAYCEDAYAVIKELRYLLGKRDERIFMIMIFTILGWIK
ALFVVLAIITNLRVILTIYLVWKKKGNV
>Q1AWQ0 ~~~~~~Bifunctional IPC transferase and DIPP synthase~~~COG0558
MPDERTTGREGVGAAVLAAGFGERLRECGRPKPLARVAGLTLLERTVRTLRAGGLEGEIVVVVGHRGEEVAGHCKARGLP
VRVVENPDYPRGNGTSVLAAMRFLPERFVVAMVDHIHTPESVRRLLRCEGDFVAAVDTRPVYADPGEATRVRLEGGRVVE
FGKNLPRYDGLDAGLFLCSRPALERLREASGGERLSWNDLKRAWLASGGEVVACDLAGAPWTDVDTPQDLRLSEEMVLGW
AASGNDGPVSRHINRRISRRITRRLLDTPLSPDQVSLLSFALAALGAGLLAAGRLRLGGALVQLASIVDGCDGELARARL
ESSPRGAVFDATLDRWADALIISGLALGAGTRLAAAAGYPALAGALLVSYTRARWEAALGRMPSRFTGLGATRDVRLAVL
ALGGLLGAPGAALLATGALGNAEALRRLLALKRGRS
>P9WG63 ~~~dipZ~~~Protein DipZ~~~COG0526
MVESRRAAAAASAYASRCGIAPATSQRSLATPPTISVPSGEGRCRCHVARGAGRDPRRRLRRRRWCGRCGYHSHLTGGEF
DVNRLCQQRSRERSCQLVAVPADPRPKRQRITDVLTLALVGFLGGLITGISPCILPVLPVIFFSGAQSVDAAQVAKPEGA
VAVRRKRALSATLRPYRVIGGLVLSFGMVTLLGSALLSVLHLPQDAIRWAALVALVAIGAGLIFPRFEQLLEKPFSRIPQ
KQIVTRSNGFGLGLALGVLYVPCAGPILAAIVVAGATATIGLGTVVLTATFALGAALPLLFFALAGQRIAERVGAFRRRQ
REIRIATGSVTILLAVALVFDLPAALQRAIPDYTASLQQQISTGTEIREQLNLGGIVNAQNAQLSNCSDGAAQLESCGTA
PDLKGITGWLNTPGNKPIDLKSLRGKVVLIDFWAYSCINCQRAIPHVVGWYQAYKDSGLAVIGVHTPEYAFEKVPGNVAK
GAANLGISYPIALDNNYATWTNYRNRYWPAEYLIDATGTVRHIKFGEGDYNVTETLVRQLLNDAKPGVKLPQPSSTTTPD
LTPRAALTPETYFGVGKVVNYGGGGAYDEGSAVFDYPPSLAANSFALRGRWALDYQGATSDGNDAAIKLNYHAKDVYIVV
GGTGTLTVVRDGKPATLPISGPPTTHQVVAGYRLASETLEVRPSKGLQVFSFTYG
>A0R8F5 2.7.7.85~~~disA~~~DNA integrity scanning protein DisA~~~
MEENKQRVKSMINILQLVAPGTPLREGIDNVLRAQTGGLIVLGYNEQIKSIVDGGFHINCAFSPASLYELAKMDGALILN
ETGSKILIANAQLVPESSIDSIETGMRHRTAERVAKQTGSLVVAISQRRNVITLYQGNLRYTLKDIGVILTKANQAIQTL
EKYKAVWNDGITNLGILEFEEVVTMSEVVHVLHSVEMVLRIKNEILSYIHELGTEGRLIRLQLTELLADLEAEAALLIKD
YYQEKTQDHHQILKKLQELANTQLLEDSDLVKLLGYPGQTSLEESVTPRGYRITSKISRVPPLIIENLINRFKTLQGVCR
ATINELDDVEGIGEVRAKKIREGLKRIQEHLYMSRHN
>Q6HPT4 2.7.7.85~~~disA~~~DNA integrity scanning protein DisA~~~
MEENKQRVKSMINILQLVAPGTPLREGIDNVLRAQTGGLIVLGYNEQIKSIVDGGFHINCAFSPASLYELAKMDGALILN
ETGSKILIANAQLVPESSIDSIETGMRHRTAERVAKQTGSLVVAISQRRNVITLYQGNLRYTLKDIGVILTKANQAIQTL
EKYKAVWNDGITNLGILEFEEVVTMSEVVHVLHSVEMVLRIKNEILSYIHELGTEGRLIRLQLTELLADLEAEAALLIKD
YYQEKTQDHHQILKKLQELANTQLLEDSDLVKLLGYPGQTSLEESVTPRGYRITSKISRVPPLIIENLINRFKTLQGVCR
ATINELDDVEGIGEVRAKKIREGLKRIQEHLYMSRHN
>P37573 2.7.7.85~~~disA~~~DNA integrity scanning protein DisA~~~COG1623
MEKEKKGAKHELDLSSILQFVAPGTPLRAGMENVLRANTGGLIVVGYNDKVKEVVDGGFHINTAFSPAHLYELAKMDGAI
ILSDSGQKILYANTQLMPDATISSSETGMRHRTAERVAKQTGCLVIAISERRNVITLYQENMKYTLKDIGFILTKANQAI
QTLEKYKTILDKTINALNALEFEELVTFSDVLSVMHRYEMVLRIKNEINMYIKELGTEGHLIKLQVIELITDMEEEAALF
IKDYVKEKIKDPFVLLKELQDMSSYDLLDDSIVYKLLGYPASTNLDDYVLPRGYRLLNKIPRLPMPIVENVVEAFGVLPR
IIEASAEELDEVEGIGEVRAQKIKKGLKRLQEKHYLDRQL
>A0R564 2.7.7.85~~~disA~~~DNA integrity scanning protein DisA~~~COG1623
MAVKSGARSGRNVVHLARPTLRETLGRLAPGTPLRDGLERILRGRTGALIVLGYDDSVEAICDGGFVLDVRYAPTRLREL
SKMDGAVVLSSDGSRILRANVQLVPDPSIPTDESGTRHRSAERTAIQTGYPVISVSHSMSIVTVYVAGERHVVPDSATIL
SRANQTIATLERYKGRLDEVSRQLSTAEIEDFVTLRDVMTVVQRLEMVRRISLEIDADVVELGTDGRQLKLQLDELVGDN
ETARELIVRDYHANPDPPTAAQVAATLEELDSLSDSELLDFTVLARVFGYPSTAEAQDSAMSSRGYRAMAAIPRLQFAHV
DLLVRSFGSLQNLLAASADDLQSVDGIGSMWARHIREGLSLLAESTIADRLA
>P9WNW5 2.7.7.85~~~disA~~~DNA integrity scanning protein DisA~~~COG1623
MHAVTRPTLREAVARLAPGTGLRDGLERILRGRTGALIVLGHDENVEAICDGGFSLDVRYAATRLRELCKMDGAVVLSTD
GSRIVRANVQLVPDPSIPTDESGTRHRSAERAAIQTGYPVISVSHSMNIVTVYVRGERHVLTDSATILSRANQAIATLER
YKTRLDEVSRQLSRAEIEDFVTLRDVMTVVQRLELVRRIGLVIDYDVVELGTDGRQLRLQLDELLGGNDTARELIVRDYH
ANPEPPSTGQINATLDELDALSDGDLLDFTALAKVFGYPTTTEAQDSTLSPRGYRAMAGIPRLQFAHADLLVRAFGTLQG
LLAASAGDLQSVDGIGAMWARHVREGLSQLAESTISDQ
>Q9X8L6 2.7.7.85~~~disA~~~DNA integrity scanning protein DisA~~~COG1623
MAANDRAAAPGKSGGSAGADGLMRASLSAVAPGTSLRDGLERVLRGNTGGLIVLGSDKTVESMCTGGFVLDVEFTATRLR
ELCKLDGGIVLSSDLSKILRAGVQLLPDPTIPTEETGTRHRTADRVSKQVGFPVVSVSQSMRLIALYVDGQRRVLEDSAA
ILSRANQALATLERYKLRLDEVAGTLSALEIEDLVTVRDVSAVAQRLEMVRRIATEIAEYVVELGTDGRLLALQLDELIA
GVEPERELVVRDYVPEPTAKRSRTVDEALAELDKLSHAELLELSTVARALGYTGSPETLDSAVSPRGFRLLAKVPRLPGA
IIDRLVEHFGGLQKLLAASVDDLQTVDGVGEARARSVREGLSRLAESSILERYV
>Q9WY43 2.7.7.85~~~disA~~~DNA integrity scanning protein DisA~~~COG1623
MGVKSLVPQELIEKIKLISPGTELRKALDDIINANFGALIFLVDDPKKYEDVIQGGFWLDTDFSAEKLYELSKMDGAIVL
SEDITKIYYANVHLVPDPTIPTGETGTRHRTAERLAKQTGKVVIAVSRRRNIISLYYKNYKYVVNQVDFLISKVTQAIST
LEKYKDNFNKLLSELEVLELENRVTLADVVRTLAKGFELLRIVEEIRPYIVELGEEGRLARMQLRELTEDVDDLLVLLIM
DYSSEEVEEETAQNILQDFITRREPSPISISRVLGYDVQQAAQLDDVLVSARGYRLLKTVARIPLSIGYNVVRMFKTLDQ
ISKASVEDLKKVEGIGEKRARAISESISSLKHRKTSE
>O50406 5.5.1.16~~~~~~Type B diterpene cyclase~~~COG1657
METFRTLLAKAALGNGISSTAYDTAWVAKLGQLDDELSDLALNWLCERQLPDGSWGAEFPFCYEDRLLSTLAAMISLTSN
KHRRRRAAQVEKGLLALKNLTSGAFEGPQLDIKDATVGFELIAPTLMAEAARLGLAICHEESILGELVGVREQKLRKLGG
SKINKHITAAFSVELAGQDGVGMLDVDNLQETNGSVKYSPSASAYFALHVKPGDKRALAYISSIIQAGDGGAPAFYQAEI
FEIVWSLWNLSRTDIDLSDPEIVRTYLPYLDHVEQHWVRGRGVGWTGNSTLEDCDTTSVAYDVLSKFGRSPDIGAVLQFE
DADWFRTYFHEVGPSISTNVHVLGALKQAGYDKCHPRVRKVLEFIRSSKEPGRFCWRDKWHRSAYYTTAHLICAASNYDD
ALCSDAIGWILNTQRPDGSWGFFDGQATAEETAYCIQALAHWQRHSGTSLSAQISRAGGWLSQHCEPPYAPLWIAKTLYC
SATVVKAAILSALRLVDESNQ
>P84962 ~~~~~~Bacteriocin divergicin M35~~~
TKYYGNGVYCNSKKCWVDWGTAQGCIDVVIGQLGGGIPGKGKC
>P71021 ~~~divIVA~~~Septum site-determining protein DivIVA~~~COG3599
MPLTPNDIHNKTFTKSFRGYDEDEVNEFLAQVRKDYEIVLRKKTELEAKVNELDERIGHFANIEETLNKSILVAQEAAED
VKRNSQKEAKLIVREAEKNADRIINESLSKSRKIAMEIEELKKQSKVFRTRFQMLIEAQLDLLKNDDWDHLLEYEVDAVF
EEKE
>Q8CWP9 ~~~divIVA~~~Cell division protein DivIVA~~~COG3599
MPITSLEIKDKTFGTRFRGFDPEEVDEFLDIVVRDYEDLVRANHDKNLRIKSLEERLSYFDEIKDSLSQSVLIAQDTAER
VKQAAHERSNNIIHQAEQDAQRLLEEAKYKANEILRQATDNAKKVAVETEELKNKSRVFHQRLKSTIESQLAIVESSDWE
DILRPTATYLQTSDEAFKEVVSEVLGEPIPAPIEEEPIDMTRQFSQAEMEELQARIEVADKELSEFEAQIKQEVETPTPV
VSPQVEEEPLLIQLAQCMKNQK
>P16655 ~~~divIB~~~Cell division protein DivIB~~~COG1589
MNPGQDREKIVNIEERIPKIKEQRKQKANRRLISFIMLFFIMVLIIVYLQTPISKVSTISVTGNENVSKKEIIDLSDINS
GDTEFWSLDKQKTEKKIQQNKLVKKAEISKSLPNKINIAIEEYKAIAYLEKDDVYYEVLENGSVLPNEVTPDDAGPILVN
WTNAKKRSQMAKQLDALSNSLKQSISEIYYTPVKMDENRIKLYMNDGYVVTASIKTFADRMKTYPSIISQLSSNKKGIIH
LEVATYFEEFGKNDKAAKKEDEN
>Q5L0X5 ~~~divIB~~~Cell division protein DivIB~~~COG1589
MEKGKVVVLEDRVPKLKERRRQKANRRLIAYLSFFFLFILCVLYFQSPLGAVGHVEVSGNRHLTAERIISLSGITKRTSF
WKVNEQNVEKKLTRHPEIKEATVEKQLPNTIAIHVREWRRIAYVYDRQTFFPLLENGRLLKQEGTKTAPSDAPVLVGWKD
GDAIAEMTGQLAELPAAVLGAMSEIHYKPTREYEDRVIVYMNDGYEVSATIRQFADKLSHYPAIAAALDRNVKGVIHLEV
GSYFVPYSPPKKEDGDETTSP
>Q8DQM0 ~~~divIB~~~Cell division protein DivIB~~~COG1589
MSKDKKNEDKETLEELKELSEWQKRNQEYLKKKAEEEVALAEEKEKERQARMGEESEKSEDKQDQESETDQEDSESAKEE
SEEKVASSEADKEKEEPESKEKEEQDKKLAKKATKEKPAKAKIPGIHILRAFTILFPSLLLLIVSAYLLSPYATMKDIRV
EGTVQTTADDIRQASGIQDSDYTINLLLDKAKYEKQIKSNYWVESAQLVYQFPTKFTIKVKEYDIVAYYISGENHYPILS
SGQLETSSVSLNSLPETYLSVLFNDSEQIKVFVSELAQISPELKAAIQKVELAPSKVTSDLIRLTMNDSDEVLVPLSEMS
KKLPYYSKIKPQLSEPSVVDMEAGIYSYTVADKLIMEAEEKAKQEAKEAEKKQEEEQKKQEEESNRNQTNQRSSRR
>P37471 ~~~divIC~~~Cell division protein DivIC~~~COG2919
MNFSRERTITEIQNDYKEQVERQNQLKKRRRKGLYRRLTVFGALVFLTAIVLASSVWSQTSSLSAKEEKKEQLEKELKSL
KTKQTDLKEEISKLKDEDYVTELARRDLFMSGDGEIIFNVEKKSK
>Q03228 2.7.13.3~~~divJ~~~Histidine protein kinase DivJ~~~COG2205
MILPTALKSRLALEFETLPDPFRRPAARAAGLDPAHAWRLGWLAAVCLAAAAALFTADSGGWPVWAALGAGALPALVSLI
FTREDERTQSWLLVLWAVGGSLAAVLTGGVGGAMAAWCLAPVAAASTQDQPKRLAEGAALALIGACVAALTQLSGLAPAA
PTGPLAFVLGFLALVTTGLGLAAGLLIGRRRQGARDDRYASEIIGLETLLDGLPHLAIAVRGQGQVTAVRGAAPPGVTRA
DLVNRGLTGAAAPGDRQRLTAAIAQAHREGSASLTFNPALGVERVVALDMHRVAPNQLVGVLRDITVERHREHALDQARI
DAEALAAGRARFLANMSHELRTPLNAIMGFSDIMRARMFGPLSDRYAEYAELIHESGGHLLDLINDVLDMSKIEAERFEL
QRGVFDAREAVQAAMRLLRVQSDTAGVQLRGVLPPGELEVDADRRALKQIVLNLVSNALKFTPRGGQVTVTAHGYDGVLE
IVVADTGVGISPEDLERLGRPYEQAGGAEQRARGTGLGLSLVRAFAQLHGGEMVIESRLGAGTTVSVRLPVLLAPMVAAT
PTPPAAPEAPSAPEPAPTVEEPPPASLGDNVIAFAPR
>Q7BBW0 ~~~divK~~~Polar-differentiation response regulator DivK~~~
MTKSVMIVEDNELNMKLFRDLIEASGYETIRTRSGLEALDLAREHHPDLILMDIQLPEVSGLEVTKWLKDDEELRHIPVI
AVTAFAMKGDEERIRQGGCEAYISKPISVPRFIETIKSYLGDA
>Q9RQQ9 2.7.13.3~~~divL~~~Sensor protein DivL~~~COG2205
MTSYDLILAAAAGAVCLAISVALWSHGQRRNLEARIVALKTRLIQQGGSDDAPAWLDAFDTAVIAVEGGRANLVAGGEGL
IACAKALGADAEVSAVVAALSDADPNYAQKLTALFERGEPCVFEARGPHGLVSVEGRAAGALAWLRLAPIDRADSGLPTA
ARFAAFVDSVVEPCWIAGADGQAIWGNAAFVRAVGAASAQAPALAGKSFDRGADAVVVEAAGKGERREALRWINVEGRRR
AFRLSAQPLDGGGVGVFCADVTEIEDVRDAFKKHVEAHDETLNHIAEAVAIFSQTRRLSYHNTAFAELWGLEPAWLADRP
THGEVLDRLRQRRRLPETIDYAGWKAAELARYEDLGPQADDLWDLPDGRTLKVVRQPHPLGGMLLIYSDITGELRLKAQY
NALIQVQQATLDKLNDAVAVFGSDGRLRLHNEAFETFWNVTPHALEAAGDFEGVVELCVPRLHDLSFWRELKGRVADPDP
QMRAPTSGEVRTSDSRIVLYQSRPLPDGATLIAFADVTDTRDLQSALADRSAALAEAERLKRDFVGNVSYELRTPLTTII
GYSELLERADGISERGRNHVAAVRAAATQLARSIDDVLDMAQIDAGEMALEIEDIRVSDLLLNAQERALKDAQLGGVTLA
VECEEDVGLIRGDGKRLAQTLDHLVENALRQTPPGGRVTLSARRALGEVRLDVSDTGRGVPFHVQAHIFDRFVGRDRGGP
GLGLALVKALVELHGGWVALESEPGNGSTFTCHLPETQQPGAMQPELGF
>P31680 ~~~djlA~~~Co-chaperone protein DjlA~~~COG1076
MQYWGKIIGVAVALLMGGGFWGVVLGLLIGHMFDKARSRKMAWFANQRERQALFFATTFEVMGHLTKSKGRVTEADIHIA
SQLMDRMNLHGASRTAAQNAFRVGKSDNYPLREKMRQFRSVCFGRFDLIRMFLEIQIQAAFADGSLHPNERAVLYVIAEE
LGISRAQFDQFLRMMQGGAQFGGGYQQQTGGGNWQQAQRGPTLEDACNVLGVKPTDDATTIKRAYRKLMSEHHPDKLVAK
GLPPEMMEMAKQKAQEIQQAYELIKQQKGFK
>Q8GNT2 1.13.11.50~~~dke1~~~Acetylacetone-cleaving enzyme~~~
MDYCNKKHTAEEYVKISDNNYVPFPEAFSDGGITWQLLHSSPETSSWTAIFNCPAGSSFASHIHAGPGEYFLTKGKMEVR
GGEQEGGSTAYAPSYGFESSGALHGKTFFPVESQFYMTFLGPLNFIDDNGKVIASIGWAEAQGAWLATKNEAA
>P06632 1.1.1.346~~~dkgA~~~2,5-diketo-D-gluconic acid reductase A~~~
MTVPSIVLNDGNSIPQLGYGVFKVPPADTQRAVEEALEVGYRHIDTAAIYGNEEGVGAAIAASGIARDDLFITTKLWNDR
HDGDEPAAAIAESLAKLALDQVDLYLVHWPTPAADNYVHAWEKMIELRAAGLTRSIGVSNHLVPHLERIVAATGVVPAVN
QIELHPAYQQREITDWAAAHDVKIESWGPLGQGKYDLFGAEPVTAAAAAHGKTPAQAVLRWHLQKGFVVFPKSVRRERLE
ENLDVFDFDLTDTEIAAIDAMDPGDGSGRVSAHPDEVD
>Q46857 1.1.1.274~~~dkgA~~~2,5-diketo-D-gluconic acid reductase A~~~COG0656
MANPTVIKLQDGNVMPQLGLGVWQASNEEVITAIQKALEVGYRSIDTAAAYKNEEGVGKALKNASVNREELFITTKLWND
DHKRPREALLDSLKKLQLDYIDLYLMHWPVPAIDHYVEAWKGMIELQKEGLIKSIGVCNFQIHHLQRLIDETGVTPVINQ
IELHPLMQQRQLHAWNATHKIQTESWSPLAQGGKGVFDQKVIRDLADKYGKTPAQIVIRWHLDSGLVVIPKSVTPSRIAE
NFDVWDFRLDKDELGEIAKLDQGKRLGPDPDQFGG
>P15339 1.1.1.274~~~dkgB~~~2,5-diketo-D-gluconic acid reductase B~~~
MPNIPTISLNDGRPFAEPGLGTYNLRGDEGVAAMVAAIDSGYRLLDTAVNYENESEVGRAVRASSVDRDELIVASKIPGR
QHGRAEAVDSIRGSLDRLGLDVIDLQLIHWPNPSVGRWLDTWRGMIDAREAGLVRSIGVSNFTEPMLKTLIDETGVTPAV
NQVELHPYFPQAALRAFHDEHGIRTESWSPLARRSELLTEQLLQELAVVYGVTPTQVVLRWHVQLGSTPIPKSADPDRQR
ENADVFGFALTADQVDAISGLERGRLWDGDPDTHEEM
>P30863 1.1.1.346~~~dkgB~~~2,5-diketo-D-gluconic acid reductase B~~~COG0656
MAIPAFGLGTFRLKDDVVISSVITALELGYRAIDTAQIYDNEAAVGQAIAESGVPRHELYITTKIWIENLSKDKLIPSLK
ESLQKLRTDYVDLTLIHWPSPNDEVSVEEFMQALLEAKKQGLTREIGISNFTIPLMEKAIAAVGAENIATNQIELSPYLQ
NRKVVAWAKQHGIHITSYMTLAYGKALKDEVIARIAAKHNATPAQVILAWAMGEGYSVIPSSTKRKNLESNLKAQNLQLD
AEDKKAIAALDCNDRLVSPEGLAPEWD
>P0ABS1 ~~~dksA~~~RNA polymerase-binding transcription factor DksA~~~COG1734
MQEGQNRKTSSLSILAIAGVEPYQEKPGEEYMNEAQLAHFRRILEAWRNQLRDEVDRTVTHMQDEAANFPDPVDRAAQEE
EFSLELRNRDRERKLIKKIEKTLKKVEDEDFGYCESCGVEIGIRRLEARPTADLCIDCKTLAEIREKQMAG
>P11959 1.8.1.4~~~pdhD~~~Dihydrolipoyl dehydrogenase~~~
MVVGDFAIETETLVVGAGPGGYVAAIRAAQLGQKVTIVEKGNLGGVCLNVGCIPSKALISASHRYEQAKHSEEMGIKAEN
VTIDFAKVQEWKASVVKKLTGGVEGLLKGNKVEIVKGEAYFVDANTVRVVNGDSAQTYTFKNAIIATGSRPIELPNFKFS
NRILDSTGALNLGEVPKSLVVIGGGYIGIELGTAYANFGTKVTILEGAGEILSGFEKQMAAIIKKRLKKKGVEVVTNALA
KGAEEREDGVTVTYEANGETKTIDADYVLVTVGRRPNTDELGLEQIGIKMTNRGLIEVDQQCRTSVPNIFAIGDIVPGPA
LAHKASYEGKVAAEAIAGHPSAVDYVAIPAVVFSDPECASVGYFEQQAKDEGIDVIAAKFPFAANGRALALNDTDGFLKL
VVRKEDGVIIGAQIIGPNASDMIAELGLAIEAGMTAEDIALTIHAHPTLGEIAMEAAEVALGTPIHIITK
>Q9I1L9 1.8.1.4~~~lpdV~~~Dihydrolipoyl dehydrogenase~~~
MSQILKTSLLIVGGGPGGYVAAIRAGQLGIPTVLVEGAALGGTCLNVGCIPSKALIHAAEEYLKARHYASRSALGIQVQA
PSIDIARTVEWKDAIVDRLTSGVAALLKKHGVDVVQGWARILDGKSVAVELAGGGSQRIECEHLLLAAGSQSVELPILPL
GGKVISSTEALAPGSLPKRLVVVGGGYIGLELGTAYRKLGVEVAVVEAQPRILPGYDEELTKPVAQALRRLGVELYLGHS
LLGPSENGVRVRDGAGEEREIAADQVLVAVGRKPRSEGWNLESLGLDMNGRAVKVDDQCRTSMRNVWAIGDLAGEPMLAH
RAMAQGEMVAELIAGKRRQFAPVAIPAVCFTDPEVVVAGLSPEQAKDAGLDCLVASFPFAANGRAMTLEANEGFVRVVAR
RDNHLVVGWQAVGKAVSELSTAFAQSLEMGARLEDIAGTIHAHPTLGEAVQEAALRALGHALHI
>P09063 1.8.1.4~~~lpdV~~~Dihydrolipoyl dehydrogenase~~~COG1249
MQQTIQTTLLIIGGGPGGYVAAIRAGQLGIPTVLVEGQALGGTCLNIGCIPSKALIHVAEQFHQASRFTEPSPLGISVAS
PRLDIGQSVAWKDGIVDRLTTGVAALLKKHGVKVVHGWAKVLDGKQVEVDGQRIQCEHLLLATGSSSVELPMLPLGGPVI
SSTEALAPKALPQHLVVVGGGYIGLELGIAYRKLGAQVSVVEARERILPTYDSELTAPVAESLKKLGIALHLGHSVEGYE
NGCLLANDGKGGQLRLEADRVLVAVGRRPRTKGFNLECLDLKMNGAAIAIDERCQTSMHNVWAIGDVAGEPMLAHRAMAQ
GEMVAEIIAGKARRFEPAAIAAVCFTDPEVVVVGKTPEQASQQGLDCIVAQFPFAANGRAMSLESKSGFVRVVARRDNHL
ILGWQAVGVAVSELSTAFAQSLEMGACLEDVAGTIHAHPTLGEAVQEAALRALGHALHI
>Q9I3D1 1.8.1.4~~~lpdG~~~Dihydrolipoyl dehydrogenase~~~
MSQKFDVVVIGAGPGGYVAAIRAAQLGLKTACIEKYIGKEGKVALGGTCLNVGCIPSKALLDSSYKYHEAKEAFKVHGIE
AKGVTIDVPAMVARKANIVKNLTGGIATLFKANGVTSFEGHGKLLANKQVEVTGLDGKTQVLEAENVIIASGSRPVEIPP
APLTDDIIVDSTGALEFQAVPKKLGVIGAGVIGLELGSVWARLGAEVTVLEALDKFLPAADEQIAKEALKVLTKQGLNIR
LGARVTASEVKKKQVTVTFTDANGEQKETFDKLIVAVGRRPVTTDLLAADSGVTLDERGFIYVDDHCKTSVPGVFAIGDV
VRGAMLAHKASEEGVMVAERIAGHKAQMNYDLIPSVIYTHPEIAWVGKTEQTLKAEGVEVNVGTFPFAASGRAMAANDTT
GLVKVIADAKTDRVLGVHVIGPSAAELVQQGAIGMEFGTSAEDLGMMVFSHPTLSEALHEAALAVNGHAIHIANRKKR
>P31052 1.8.1.4~~~lpdG~~~Dihydrolipoyl dehydrogenase~~~COG1249
MTQKFDVVVIGAGPGGYVAAIKAAQLGLKTACIEKYTDAEGKLALGGTCLNVGCIPSKALLDSSWKYKEAKESFNVHGIS
TGEVKMDVAAMVGRKAGIVKNLTGGVATLFKANGVTSIQGHGKLLAGKKVEVTKADGTTEVIEAENVILASGSRPIDIPP
APVDQNVIVDSTGALEFQAVPKRLGVIGAGVIGLELGSVWARLGAEVTVLEALDTFLMAADTAVSKEAQKTLTKQGLDIK
LGARVTGSKVNGNEVEVTYTNAEGEQKITFDKLIVAVGRRPVTTDLLAADSGVTIDERGYIFVDDYCATSVPGVYAIGDV
VRGMMLAHKASEEGIMVVERIKGHKAQMNYDLIPSVIYTHPEIAWVGKTEQALKAEGVEVNVGTFPFAASGRAMAANDTG
GFVKVIADAKTDRVLGVHVIGPSAAELVQQGAIAMEFGTSAEDLGMMVFSHPTLSEALHEAALAVNGGAIHVANRKKR
>P31046 1.8.1.4~~~lpd3~~~Dihydrolipoyl dehydrogenase 3~~~COG1249
MKSYDVVIIGGGPGGYNAAIRAGQLGLTVACVEGRSTLGGTCLNVGCMPSKALLHASELYEAASGDEFAHLGIEVKPTLN
LAQMMKQKDESVTGLTKGIEYLFRKNKVDWIKGWGRLDGVGKVVVKAEDGSETALQAKDIVIATGSEPTPLPGVTIDNQR
IIDSTGALSLPQVPKHLVVIGAGVIGLELGSVWRRLGSQVTVIEYLDRICPGTDTETAKTLQKALAKQGMVFKLGSKVTQ
ATASADGVSLVLEPAAGGTAESLQADYVLVAIGRRPYTKGLNLESVGLETDKRGMLAQRTPPTSVPGVWVIGDVTSGPML
AHKAEDEAVACIERIAGKPHEVNYNLIPGVIYTRPELATVGKTEEQLKAEGRAYKVGKFPFTANSRAKINHETEGFAKVI
ADAETDEVLGVHLVGPSVSEMIGEFCVAMEFSASAEDIALTCHPHPTRSEALRQAAMNVDGMAMQI
>P18925 1.8.1.4~~~~~~Dihydrolipoyl dehydrogenase~~~
MSQKFDVIVIGAGPGGYVAAIKSAQLGLKTALIEKYKGKEGKTALGGTCLNVGCIPSKALLDSSYKFHEAHESFKLHGIS
TGEVAIDVPTMIARKDQIVRNLTGGVASLIKANGVTLFEGHGKLLAGKKVEVTAADGSSQVLDTENVILASGSKPVEIPP
APVDQDVIVDSTGALDFQNVPGKLGVIGAGVIGLELGSVWARLGAEVTVLEAMDKFLPAVDEQVAKEAQKILTKQGLKIL
LGARVTGTEVKNKQVTVKFVDAEGEKSQAFDKLIVAVGRRPVTTDLLAADSGVTLDERGFIYVDDYCATSVPGVYAIGDV
VRGAMLAHKASEEGVVVAERIAGHKAQMNYDLIPAVIYTHPEIAGVGKTEQALKAEGVAINVGVFPFAASGRAMAANDTA
GFVKVIADAKTDRVLGVHVIGPSAAELVQQGAIAMEFGTSAEDLGMMVFAHPALSEALHEAALAVSGHAIHVANRKK
>Q8NTE1 1.8.1.4~~~lpd~~~Dihydrolipoyl dehydrogenase~~~COG1249
MTEHYDVVVLGAGPGGYVSAIRAAQLGKKVAVIEKQYWGGVCLNVGCIPSKSLIKNAEVAHTFTHEKKTFGINGEVTFNY
EDAHKRSRGVSDKIVGGVHYLMKKNKIIEIHGLGNFKDAKTLEVTDGKDAGKTITFDDCIIATGSVVNTLRGVDFSENVV
SFEEQILNPVAPKKMVIVGAGAIGMEFAYVLGNYGVDVTVIEFMDRVLPNEDAEVSKVIAKAYKKMGVKLLPGHATTAVR
DNGDFVEVDYQKKGSDKTETLTVDRVMVSVGFRPRVEGFGLENTGVKLTERGAIEIDDYMRTNVDGIYAIGDVTAKLQLA
HVAEAQGIVAAETIAGAETQTLGDYMMMPRATFCNPQVSSFGYTEEQAKEKWPDREIKVASFPFSANGKAVGLAETDGFA
KIVADAEFGELLGAHLVGANASELINELVLAQNWDLTTEEISRSVHIHPTLSEAVKEAAHGISGHMINF
>P0A9P0 1.8.1.4~~~lpdA~~~Dihydrolipoyl dehydrogenase~~~COG1249
MSTEIKTQVVVLGAGPAGYSAAFRCADLGLETVIVERYNTLGGVCLNVGCIPSKALLHVAKVIEEAKALAEHGIVFGEPK
TDIDKIRTWKEKVINQLTGGLAGMAKGRKVKVVNGLGKFTGANTLEVEGENGKTVINFDNAIIAAGSRPIQLPFIPHEDP
RIWDSTDALELKEVPERLLVMGGGIIGLEMGTVYHALGSQIDVVEMFDQVIPAADKDIVKVFTKRISKKFNLMLETKVTA
VEAKEDGIYVTMEGKKAPAEPQRYDAVLVAIGRVPNGKNLDAGKAGVEVDDRGFIRVDKQLRTNVPHIFAIGDIVGQPML
AHKGVHEGHVAAEVIAGKKHYFDPKVIPSIAYTEPEVAWVGLTEKEAKEKGISYETATFPWAASGRAIASDCADGMTKLI
FDKESHRVIGGAIVGTNGGELLGEIGLAIEMGCDAEDIALTIHAHPTLHESVGLAAEVFEGSITDLPNPKAKKK
>P75393 1.8.1.4~~~pdhD~~~Dihydrolipoyl dehydrogenase~~~
MNYDLIIIGAGPAGYVAAEYAGKHKLKTLVVEKEYFGGVCLNVGCIPTKTLLKRAKIVDYLRHAQDYGISINGQVALNWN
QLLEQKGKVVSKLVGGVKAIIASAKAETVMGEAKVLDPNTVEVAGKTYTTKSIVVATGSRPRYLTLPGFAEARQNGFVID
STQALSLEGVPRKLVVVGGGVIGIEFAFLYASLGSEVTILQGVDRILEIFDTEVSDLVAKLLQTKNVKIITNAQVTRANN
NEVFYSQNGQEGSVVGDRILVSIGRIPNTECLDGLNLQRDERNRIVLNQDLQTSIPNIYIVGDANAQLMLAHFAYQQGRY
AVNHILNKKQVKPAQKLTCPSCIYTNPEVASVGYTEMELKKQGIPYVKTNLVLAHCGKAIADNETNGFVKMMFDPQTGKI
LGCCIIAATASDMIAELALAMGAGLTVFDIANSISPHPTINEMIADVCKKALFDHFK
>P9WHH9 1.8.1.4~~~lpdC~~~Dihydrolipoyl dehydrogenase~~~COG1249
MTHYDVVVLGAGPGGYVAAIRAAQLGLSTAIVEPKYWGGVCLNVGCIPSKALLRNAELVHIFTKDAKAFGISGEVTFDYG
IAYDRSRKVAEGRVAGVHFLMKKNKITEIHGYGTFADANTLLVDLNDGGTESVTFDNAIIATGSSTRLVPGTSLSANVVT
YEEQILSRELPKSIIIAGAGAIGMEFGYVLKNYGVDVTIVEFLPRALPNEDADVSKEIEKQFKKLGVTILTATKVESIAD
GGSQVTVTVTKDGVAQELKAEKVLQAIGFAPNVEGYGLDKAGVALTDRKAIGVDDYMRTNVGHIYAIGDVNGLLQLAHVA
EAQGVVAAETIAGAETLTLGDHRMLPRATFCQPNVASFGLTEQQARNEGYDVVVAKFPFTANAKAHGVGDPSGFVKLVAD
AKHGELLGGHLVGHDVAELLPELTLAQRWDLTASELARNVHTHPTMSEALQECFHGLVGHMINF
>P14218 1.8.1.4~~~lpd~~~Dihydrolipoyl dehydrogenase~~~
MSQKFDVVVIGAGPGGYVAAIRAAQLGLKTACIEKYIGKEGKVALGGTCLNVGCIPSKALLDSSYKYHEAKEAFKVHGIE
AKGVTIDVPAMVARKANIVKNLTGGIATLFKANGVTSFEGHGKLLANKQVEVTGLDGKTQVLEAENVIIASGSRPVEIPP
APLSDDIIVDSTGALEFQAVPKKLGVIGAGVIGLELGSVWARLGAEVTVLEALDKFLPAADEQIAKEALKVLTKQGLNIR
LGARVTASEVKKKQVTVTFTDANGEQKETFDKLIVAVGRRPVTTDLLAADSGVTLDERGFIYVDDHCKTSVPGVFAIGDV
VRGAMLAHKASEEGVMVAERIAGHKAQMNYDLIPSVIYTHPEIAWVGKTEQTLKAEGVEVNVGTFPFAASGRAMAANDTT
GLVKVIADAKTDRVLGVHVIGPSAAELVQQGAIGMEFGTSAEDLGMMVFSHPTLSEALHEAALAVNGHAIHIANRKKR
>P99084 1.8.1.4~~~pdhD~~~Dihydrolipoyl dehydrogenase~~~
MVVGDFPIETDTIVIGAGPGGYVAAIRAAQLGQKVTIVEKGNLGGVCLNVGCIPSKALLHASHRFVEAQHSENLGVIAES
VSLNFQKVQEFKSSVVNKLTGGVEGLLKGNKVNIVKGEAYFVDNNSLRVMDEKSAQTYNFKNAIIATGSRPIEIPNFKFG
KRVIDSTGALNLQEVPGKLVVVGGGYIGSELGTAFANFGSEVTILEGAKDILGGFEKQMTQPVKKGMKEKGVEIVTEAMA
KSAEETDNGVKVTYEAKGEEKTIEADYVLVTVGRRPNTDELGLEELGVKFADRGLLEVDKQSRTSISNIYAIGDIVPGLP
LAHKASYEAKVAAEAIDGQAAEVDYIGMPAVCFTEPELATVGYSEAQAKEEGLAIKASKFPYAANGRALSLDDTNGFVKL
ITLKEDDTLIGAQVVGTGASDIISELGLAIEAGMNAEDIALTIHAHPTLGEMTMEAAEKAIGYPIHTM
>P72740 1.8.1.4~~~lpdA~~~Dihydrolipoyl dehydrogenase~~~COG1249
MSQDFDYDLVIIGAGVGGHGAALHAVKCGLKTAIIEAKDMGGTCVNRGCIPSKALLAASGRVREMSDQDHLQQLGIQING
VTFTREAIAAHANDLVSKIQSDLTNSLTRLKVDTIRGWGKVSGPQEVTVIGDNETRILKAKEIMLCPGSVPFVPPGIEID
HKTVFTSDEAVKLETLPQWIAIIGSGYIGLEFSDVYTALGCEVTMIEALPDLMPGFDPEIAKIAERVLIKSRDIETYTGV
FATKIKAGSPVEIELTDAKTKEVIDTLEVDACLVATGRIPATKNLGLETVGVETDRRGFIEVNDQMQVIKDGKPVPHLWA
VGDATGKMMLAHAASGQGVVAVENICGRKTEVDYRAIPAAAFTHPEISYVGLTEAQAKELGEKEGFVVSTAKTYFKGNSK
ALAEKETDGIAKVVYRQDTGELLGAHIIGIHASDLIQEAAQAIADRKSVRELAFHVHAHPTLSEVLDEAYKRAV
>P85207 1.8.1.4~~~lpd~~~Dihydrolipoyl dehydrogenase~~~COG1249
MKTYDLIVIGTGPGGYPAAIRGAQLGLKVLAVEAAEVGGVCLNVGCIPTKALLHAAETVHHLKGAEGFGLKAKPELDLKK
LGAWRDGVVKKLTGGVAGLLKGNKVELLRGFARFKGPREIEVNGETYGAQSFIIATGSEPMPLKGFPFGEDVWDSTRALR
VEEGIPKRLLVIGGGAVGLELGQIYHRLGSEVTLIEYMPEILPAGDRETAALLRKALEKEGLKVRTGTKAVGYEKKQDGL
HVLLEAAQGGSQEEIVVDKILVAVGRRPRTEGLGLEKAGVKVDERGFIQVNARMETSAPGVYAIGDVARPPLLAHKAMKE
GLVAAENAAGKNALFDFQVPSVVYTGPEWAGVGLTEEEARKAGYNVKVGKFPFSASGRALTLGGAEGLIKVVGDAETDLL
LGVFVVGPQAGELIAEATLALEMGATVSDLGLTIHPHPTLSEGLMEAAEALHKQAIHILNR
>D4MUV9 1.1.99.6~~~~~~D-lactate dehydrogenase~~~
MSEYQYNKVTPEMIEKFKEIAPKRVLVGDEINEDFTHDEMAIYGKARPEVLVEATSTEEVAAVVKLCNENKIPVTPSGAR
TGLVGGAVSIGGGVMISLTKMNKILGYDKENFVVKIQSGVLLNDLAQDAEKQGLLYPPDPGEKFATVGGNVATNAGGMRA
VKYGCTRDYVRAMTVVLPTGEIVKLGATVSKTSSGYSLLNLMIGSEGTLGIITELTLKVIPAPKSVISLIIPYENLEDCI
ATVPQFFMHHLAPQALEFMEKEVVMDTEKFLGKQVYPKELEGTEIGAYLLATFDGNSEEQLEDIIEQASEVVLEAGAIDV
LVADTPALKKDAWAVRGALLEAIEADTVLLDECDVVVPTNKIAEFLTYTKSLEAEADFRVKSFGHAGDGNLHIYTCSNDM
EEGEFKKQVAVFMDKVYAKATEFGGMISGEHGIGHGKMDYLAESLGPVQMRIMEGVKEVFDPNMILNPGKICYKL
>Q8NRY8 1.1.5.12~~~dld~~~Quinone-dependent D-lactate dehydrogenase~~~COG0277
MTQPGQTTTTSHEAIDAFKRIVGDEHVLTSERATMPFSKGYRFGGGPVFAVVRPGTLVEMWRALQVSVDNNLIVIPQASN
TGLTGGSGPGFQDYDRPIVIISTHRIDEVHLINDAREAISLAGTPLTHLTDALAKHQREPHSVIGSTSIGASVIGGIANN
SGGSQIRKGPAFTREAIFARVNDDGKVELVNHLGISLGDDPEVALDRLQRGEWSPEDVTPAPEDSNETEYAEHLRKIVPS
PARYNANPEYLFEASGSAGKLMVFAVRTRTFPREVHPTVFYIGTNNTHELEEIRRLFLEADMPLPISGEYMGRSAFDLAE
KYGKDTFVFLKFMSPALQTRMFSFKTWANGLFSKIPGIGPTFADTVSQAMFSVLPNQLPKRMMEYRNRFEHHLLLTVSES
QKAASEKMLKEFFAEPEHTGEFFICTSDEEKSASLNRFGAASAATRYAALKRRHIAGLIPIDVALRRDDWNWLEVLPEEI
DDQLEVKAYYGHFFCHVMHQDYVAKQGVDLEALHDRIQHLLEERGAKLPAEHNYGRIYKLPESMEEHFKELDPTNTFNAG
IGGTSPHKDWA
>P06149 1.1.5.12~~~dld~~~Quinone-dependent D-lactate dehydrogenase~~~COG0277
MSSMTTTDNKAFLNELARLVGSSHLLTDPAKTARYRKGFRSGQGDALAVVFPGSLLELWRVLKACVTADKIILMQAANTG
LTEGSTPNGNDYDRDVVIISTLRLDKLHVLGKGEQVLAYPGTTLYSLEKALKPLGREPHSVIGSSCIGASVIGGICNNSG
GSLVQRGPAYTEMSLFARINEDGKLTLVNHLGIDLGETPEQILSKLDDDRIKDDDVRHDGRHAHDYDYVHRVRDIEADTP
ARYNADPDRLFESSGCAGKLAVFAVRLDTFEAEKNQQVFYIGTNQPEVLTEIRRHILANFENLPVAGEYMHRDIYDIAEK
YGKDTFLMIDKLGTDKMPFFFNLKGRTDAMLEKVKFFRPHFTDRAMQKFGHLFPSHLPPRMKNWRDKYEHHLLLKMAGDG
VGEAKSWLVDYFKQAEGDFFVCTPEEGSKAFLHRFAAAGAAIRYQAVHSDEVEDILALDIALRRNDTEWYEHLPPEIDSQ
LVHKLYYGHFMCYVFHQDYIVKKGVDVHALKEQMLELLQQRGAQYPAEHNVGHLYKAPETLQKFYRENDPTNSMNPGIGK
TSKRKNWQEVE
>P37672 1.1.1.130~~~dlgD~~~2,3-diketo-L-gulonate reductase~~~COG2055
MKVTFEQLKAAFNRVLISRGVDSETADACAEMFARTTESGVYSHGVNRFPRFIQQLENGDIIPDAQPKRITSLGAIEQWD
AQRSIGNLTAKKMMDRAIELAADHGIGLVALRNANHWMRGGSYGWQAAEKGYIGICWTNSIAVMPPWGAKECRIGTNPLI
VAIPSTPITMVDMSMSMFSYGMLEVNRLAGRQLPVDGGFDDEGNLTKEPGVIEKNRRILPMGYWKGSGMSIVLDMIATLL
SDGASVAEVTQDNSDEYGISQIFIAIEVDKLIDGPTRDAKLQRIMDYVTSAERADENQAIRLPGHEFTTLLAENRRNGIT
VDDSVWAKIQAL
>A0A6M7H989 5.3.1.15~~~~~~D-lyxose/D-mannose isomerase~~~
MKRSAINDILGHTRQFFSQHDVHLPPFASFSPAQWQQLDTAAWEEVFDLKLGWDVTAFGRNNFAAHGLTLFTLRNGSAKG
MPYVKCYAEKIMHVRDAQVTPMHFHWRKREDIINRGGGNLIVELWNADSNEQTADSDITVVIDGCRQKHTAGSQLRLSPG
ESICLPPGLYHSFWAEAGFGDVLVGEVSSVNDDDHDNHFLQPLDRYNLIDEDEPAQLVLCNEYRQFR
>D5MTT1 5.3.1.15~~~lyxi~~~D-lyxose/D-mannose isomerase~~~
MKPSAVNQILQQTQHFFARFDVHLPPFAHFSPAVWQQLDRQPWQEVFDLKLGWDVTAFGDDDFARKGLTLFTLRNGSPGG
KPYAKGYAEKIMHCREAQVTPMHFHWRKREDIINRGGGNLIVELHNADTRDGLAETAVTVTLDGCRQTHAAGSRLRLAPG
ESICLTPAIYHSFWGEEGFGDVLVGEVSTVNDDDNDNRFLQPLSRFSQIEEDQPPQWLLCHEYLRFIA
>D9RZ53 5.3.1.15~~~~~~D-lyxose/D-mannose isomerase~~~COG3822
MLKKSKVKEIQEKVYEALKKANIAITPEEKENIEVADFGLGDLENTGLQLLVYVNTDRYCAKELVLFPGQTCPEHRHPPV
NGKPGKQETFRCRYGKVYLYVEGEKTENPHCRPPKGSEQYYTVWHEIELNPGEQYTIEPNTLHWFQAGEEGAIVSEFSSH
SDDESDIFTDPRIKRIPEIED
>A0A0H3PJL7 ~~~dlp1~~~Dynamin-like protein 1~~~COG0699
MKELFQKIWQNELQFLNFDAKFQDKSKLDTAECAIILSVNKDNYERYFLLKEFQELCKKIDLRVDIFSMQNAQICILNLF
KSGFISKQDLLKALKILEKISKNTEIFDFILQEKVQSIDQKALFQNDFKELNTINLELQKLSFDENLKSRLQKTLEKFQN
LEFNIAITGVMNAGKSSLLNALLKEDFLGVSNIPETANLTVLSYGKSEEAKIYFWDKKEWQNILESSHFNADLKEFIDKL
DKSVNIEDFIKDKPLIQNIALCELKNFSSAKNKISALIKKIEIKSHLEFLKNNISIVDTPGLDDVVVQREIVTNEYLRES
DFLIHLMNASQSLTQKDADFLVHCLLNSRLSKFLIVLTKADLLSKKDLEEVIVYTKESLKSRLVDLDENLVEKIDFLCVS
AKMASDFYKGLASKESLQKSGMQEFENYLFNELYAGEKSKIALRAYKKELHLELKNILSEYEMQNRLIKENKQGVSEENQ
KLLLELQKQNTLLKEAQDEISNSIAKLKNIDSGIDNLVLLLAKKLKERLIDEFKYLKNNAQKLNLSRILNIVDITTKDGI
NDILREIKFENIKKIEELKTNLSLKYDFLKDDFDNGFEGFKDGISKNIDSIFQSEKFALLRLKIEKLSNLKSDLYELETN
LDTVIFDTFKEFKMSEILNSLNINGAFFEFLNDKLKHYEKNQKSKLESLEKVLQSLKNQDANILNSFEENLEKIEKLKQL
EMGLLNAD
>A0A0H3PJK4 ~~~dlp2~~~Dynamin-like protein 2~~~COG0699
MQINLLNDFIKAYENTYSVSFDDSFKGRIQELCKELNEPFMHASYALENELKELVFSLDKNVNIAIIGQFSSGKSSLLNL
ILGRDCLPTGVVPVTFKPTFLRYAKEYFLRVEFEDGSDIITNIEKLAFYTDQRNEVKQAKSLHIFAPIPLLEKITLVDTP
GLNANENDTLTTLDELKNIHGAIWLSLIDNAGKKSEEDAIKANLELLGENSICVLNQKDKLSAEELDNVLNYAKSVFLKY
FNELIAISCKEAKDEQSYEKSNFQSLLDFLTQLDTTVLKEKFVKRKILNLCEILEDENQLFVGIFDRLLNQFQSYEKHLL
LAYENFLKEIEILNHQILEQLKSISERISSEIFASVKEKDAYFYKESKGFLKKDLYTRYDYKAPYISSDDAFLAMFYNSD
VMSKEFKKIKNELYKSFEEIKMKLKDFINILEREILLFKAEFSNIQKDHIFQSDKNFSELRAFCNASDEYFLKDFKELLF
KSILELDLFFEKLNLKAFTNYENATKLSLAFFSRKINESRVLYELDSSEFVLFYPKKSEIYERVLNELNVYEFETLLINK
PILTKIAKNFLEQSQNLIQEKNKFLDLKKAELQKRRAQILNVRESIKED
>Q8XAF5 ~~~dlsT~~~Serine transporter~~~COG0814
MEIASNKGVIADASTPAGRAGMSESEWREAIKFDSTDTGWVIMSIGMAIGAGIVFLPVQVGLMGLWVFLLSSVIGYPAMY
LFQRLFINTLAESPECKDYPSVISGYLGKNWGILLGALYFVMLVIWMFVYSTAITNDSASYLHTFGVTEGLLSDSPFYGL
VLICILVAISSRGEKLLFKISTGMVLTKLLVVAALGVSMVGMWHLYNVGSLPPLGLLVKNAIITLPFTLTSILFIQTLSP
MVISYRSREKSIEVARHKALRAMNIAFGILFIIVFFYAVSFTLAMGHDEAVKAYEQNISALAIAAQFISGDGAAWVKVVS
VILNIFAVMTAFFGVYLGFREATQGIVMNILRRKMPAEKINENLVQRGIMIFAILLAWSAIVLNAPVLSFTSICSPIFGL
VGCLIPAWLVYKVPALHKYKGMSLYLIIVTGLLLCVSPFLAFS
>P42628 ~~~dlsT~~~Probable serine transporter~~~COG0814
MEIASNKGVIADASTPAGRAGMSESEWREAIKFDSTDTGWVIMSIGMAIGAGIVFLPVQVGLMGLWVFLLSSVIGYPAMY
LFQRLFINTLAESPECKDYPSVISGYLGKNWGILLGALYFVMLVIWMFVYSTAITNDSASYLHTFGVTEGLLSDSPFYGL
VLICILVAISSRGEKLLFKISTGMVLTKLLVVAALGVSMVGMWHLYNVGSLPPLGLLVKNAIITLPFTLTSILFIQTLSP
MVISYRSREKSIEVARHKALRAMNIAFGILFVTVFFYAVSFTLAMGHDEAVKAYEQNISALAIAAQFISGDGAAWVKVVS
VILNIFAVMTAFFGVYLGFREATQGIVMNILRRKMPAEKINENLVQRGIMIFAILLAWSAIVLNAPVLSFTSICSPIFGM
VGCLIPAWLVYKVPALHKYKGMSLYLIIVTGLLLCVSPFLAFS
>Q81G39 6.2.1.54~~~dltA~~~D-alanine--D-alanyl carrier protein ligase~~~
MKLLEQIEKWAAETPDQTAFVWRDAKITYKQLKEDSDALAHWISSEYPDDRSPIMVYGHMQPEMIINFLGCVKAGHAYIP
VDLSIPADRVQRIAENSGAKLLLSATAVTVTDLPVRIVSEDNLKDIFFTHKGNTPNPEHAVKGDENFYIIYTSGSTGNPK
GVQITYNCLVSFTKWAVEDFNLQTGQVFLNQAPFSFDLSVMDIYPSLVTGGTLWAIDKDMIARPKDLFASLEQSDIQVWT
STPSFAEMCLMEASFSESMLPNMKTFLFCGEVLPNEVARKLIERFPKATIMNTYGPTEATVAVTGIHVTEEVLDQYKSLP
VGYCKSDCRLLIMKEDGTIAPDGEKGEIVIVGPSVSVGYLGSPELTEKAFTMIDGERAYKTGDAGYVENGLLFYNGRLDF
QIKLHGYRMELEEIEHHLRACSYVEGAVIVPIKKGEKYDYLLAVVVPGEHSFEKEFKLTSAIKKELNERLPNYMIPRKFM
YQSSIPMTPNGKVDRKKLLSEVTA
>P39581 6.2.1.54~~~dltA~~~D-alanine--D-alanyl carrier protein ligase~~~COG1020
MKLLHAIQTHAETYPQTDAFRSQGQSLTYQELWEQSDRAAAAIQKRISGEKKSPILVYGHMEPHMIVSFLGSVKAGHPYI
PVDLSIPSERIAKIIESSGAELLIHAAGLSIDAVGQQIQTVSAEELLENEGGSVSQDQWVKEHETFYIIYTSGSTGNPKG
VQISAANLQSFTDWICADFPVSGGKIFLNQAPFSFDLSVMDLYPCLQSGGTLHCVTKDAVNKPKVLFEELKKSGLNVWTS
TPSFVQMCLMDPGFSQDLLPHADTFMFCGEVLPVSVAKALLERFPKAKIFNTYGPTEATVAVTSVEITNDVISRSESLPV
GFAKPDMNIFIMDEEGQPLPEGEKGEIVIAGPSVSRGYLGEPELTEKAFFSHEGQWAYRTGDAGFIQDGQIFCQGRLDFQ
IKLHGYRMELEEIEFHVRQSQYVRSAVVIPYQPNGTVEYLIAAIVPEEHEFEKEFQLTSAIKKELAASLPAYMIPRKFIY
QDHIQMTANGKIDRKRIGEEVLV
>P35854 6.2.1.54~~~dltA~~~D-alanine--D-alanyl carrier protein ligase~~~COG1020
MIDNVITAIDRVAAEHPTRVAYDYEGTQYTYAQLKEGSDRLAGFFAESLPAGEPIIVYGGQTFDMVEVFLGLSKSGHAYI
PIDTHSPNERITQVQDVAHAPAVIEVAPLPITVPDVKIIRAPALHQAEQSHAPIHSLQHAVAGDDNYYIIFTSGTTGKPK
GVQISHDNLLSYVNWNISDFGLEEGVVAMSQPPYSFDLSVMDLYPTLVLGGTLKALPKEVTDNFKELFATLPKLGLNEWV
STPSFVEIALLDPNFKQENYPNLTHFLFCGEELVNKTAQALITRFPKATVYNTYGPTEATVAVTGMAITQAIVDQYPRLP
IGYAKPDTNVYVVDEQGEQVSAGTEGELMIVGPSVSKGYLNNPDKTAAAFFKAGNQRGYRSGDLVTMTADGMVFYRGRTD
FQVKLHGYRIELEDVDHNLNQVSYIKQASTVPRYDKDHKVAQLIAFAVAKPNDFDSEMKLTQAIKAELGKMVMEYMIPQR
IIYRDQLPLTANGKVDRKALIAEVNH
>P68876 6.2.1.54~~~dltA~~~D-alanine--D-alanyl carrier protein ligase~~~
MTDIINKLQAFADANPQSIAVRHTTDELTYQQLMDESSKLAHRLQGSKKPMILFGHMSPYMIVGMIGAIKAGCGYVPVDT
SIPEDRIKMIINKVQPEFVFNTTDESFESLEGEVFTIEDIKTSQDPVIFDSQIKDNDTVYTIFTSGSTGEPKGVQIEYAS
LVQFTEWMLELNKSGNKQQWLNQAPFSFDLSVMAIYPCLASGGTLNLVDKNMINKPKLLNEMLTATPINIWVSTPSFMEM
CLLLPTLNEEQYGSLNEFFFCGEILPHRAAKALVSRFPSATIYNTYGPTEATVAVTSIQITQEILDQYPTLPVGVERLGA
RLSTTDDGELVIEGQSVSLGYLKNDQKTAEVFNFDDGIRTYHTGDKAKFENGQWFIQGRIDFQIKLNGYRMELEEIETQL
RQSEFVKEAIVVPVYKNDKVIHLIGAIVPTTEVTDNAEMTKNIKNDLKSRLPEYMIPRKFEWMEQLPLTSNGKIDRKKIA
EVING
>P99107 6.2.1.54~~~dltA~~~D-alanine--D-alanyl carrier protein ligase~~~
MTDIINKLQAFADANPQSIAVRHTTDELTYQQLMDESSKLAHRLQGSKKPMILFGHMSPYMIVGMIGAIKAGCGYVPVDT
SIPEDRIKMIINKVQPEFVFNTTDESFESLEGEVFTIEDIKTSQDPVIFDSQIKDNDTVYTIFTSGSTGEPKGVQIEYAS
LVQFTEWMLELNKSGNKQQWLNQAPFSFDLSVMAIYPCLASGGTLNLVDKNMINKPKLLNEMLTATPINIWVSTPSFMEM
CLLLPTLNEEQYGSLNEFFFCGEILPHRAAKALVSRFPSATIYNTYGPTEATVAVTSIQITQEILDQYPTLPVGVERLGA
RLSTTDDGELVIEGQSVSLGYLKNDQKTAEVFNFDDGIRTYHTGDKAKFENGQWFIQGRIDFQIKLNGYRMELEEIETQL
RQSEFVKEAIVVPVYKNDKVIHLIGAIVPTTEVTDNAEMTKNIKNDLKSRLPEYMIPRKFEWMEQLPLTSNGKIDRKKIA
EVING
>Q99ZA6 6.2.1.54~~~dltA~~~D-alanine--D-alanyl carrier protein ligase~~~
MIKDMIDSIEQFAQTQADFPVYDCLGERRTYGQLKRDSDSIAAFIDSLALLAKSPVLVFGAQTYDMLATFVALTKSGHAY
IPVDVHSAPERILAIIEIAKPSLIIAIEEFPLTIEGISLVSLSEIESAKLAEMPYERTHSVKGDDNYYIIFTSGTTGQPK
GVQISHDNLLSFTNWMIEDAAFDVPKQPQMLAQPPYSFDLSVMYWAPTLALGGTLFALPKELVADFKQLFTTIAQLPVGI
WTSTPSFADMAMLSDDFCQAKMPALTHFYFDGEELTVSTARKLFERFPSAKIINAYGPTEATVALSAIEITREMVDNYTR
LPIGYPKPDSPTYIIDEDGKELSSGEQGEIIVTGPAVSKGYLNNPEKTAEAFFTFKGQPAYHTGDIGSLTEDNILLYGGR
LDFQIKYAGYRIELEDVSQQLNQSPMVASAVAVPRYNKEHKVQNLLAYIVVKDGVKERFDRELELTKAIKASVKDHMMSY
MMPSKFLYRDSLPLTPNGKIDIKTLINEVNNR
>Q5XBN5 6.2.1.54~~~dltA~~~D-alanine--D-alanyl carrier protein ligase~~~
MIKDMIDSIEQFAQTQADFPVYDCLGERRTYGQLKRDSDSIAAFIDSLALLAKSPVLVFGAQTYDMLATFVALTKSGHAY
IPVDVHSAPERILAIIEIAKPSLIIAIEEFPLTIEGISLVSLSEIESAKLAEMPYERTHSVKGDDNYYIIFTSGTTGQPK
GVQISHDNLLSFTNWMIEDAAFDVPKQPQMLAQPPYSFDLSVMYWAPTLALGGTLFALPKELVADFKQLFTTIAQLPVGI
WTSTPSFADMAMLSDDFCQAKMPALTHFYFDGEELTVSTARKLFERFPSAKIINAYGPTEATVALSAIEITREMVDNYTR
LPIGYPKPDSPTYIIDEDGKELSSGEQGEIIVTGPAVSKGYLNNPEKTAEAFFTFKGQPAYHTGDIGSLTEDNILLYGGR
LDFQIKYAGYRIELEDVSQQLNQSPMVASAVAVPRYNKEHKVQNLLAYIVVKDGVKERFDRELELTKAIKASVKDHMMSY
MMPSKFLYRDSLPLTPNGKIDIKTLINEVNNR
>P39580 2.3.1.-~~~dltB~~~Teichoic acid D-alanyltransferase~~~COG1696
MTPYSSFLFFILLGILLLPTIILGLNGKRFQAYNMFISIIILALIFSHDLHGVIALCLFTIWQVLLISGYLAYRQKANSG
FVFCGAVIASILPLFLSKIWPFLSHPQPHHPPHNLISFLGISYLTFKGVQLIMEARDGLLKEQLPLHRLLYFILFFPTIS
SGPIDRYRRFVKDEQKAWTKEEYADLLYTGIHKIFIGFLYKFIIGYAINTYFIMNLPAITHNKILGNLLYMYGYSMYLFF
DFAGYTMFAVGVSYIMGIKSPENFNKPFISKNIKDFWNRWHMSLSFWFRDYVFMRFVFWMTKKKWIKNRMAVSNIGYFLL
FMLMGVWHGLAPQYIIYGLYHAVLMTCYNFFEKWNKKYKWLPSNRWTTILAIVITFHFVCFGFYIFSGKPFHHHH
>Q5M4V4 2.3.1.-~~~dltB~~~Teichoic acid D-alanyltransferase~~~COG1696
MIDFLKQLPHLEPYGNPFYFIYLGIALLPIFIGLFFKKRFAIYECLVSITFIVLALTGTHASQILALLFYIVWQIIWVYS
YKRYRSQRDNKWVFYLHSFLVVLPLILVKVEPTINGTQSLLNFLGISYLTFRAVGMIIEMRDGVLKEFTLGEFLRFMLFM
PTFTSGPIDRFKRFNEDYQSIPNRDELLNMLEQAVKYIMLGFLYKFVLAQIFGSMLLPPLKAQALSQGGIFNLPTLGVMY
VYGFDLFFDFAGYSMFALAVSNLMGIKSPINFDKPFISRDMKEFWNRWHMSLSFWFRDFVFMRLVIVLMRNKVFKNRNTT
SNVAYIINMMVMGFWHGITWYYIAYGIFHGIGLVINDAWLRKKKTINKDRKKAGLKPLPENKWTKALGIFITFNTVMLSF
LIFSGFLNDLWFTKK
>Q88VM8 ~~~dltC1~~~D-alanyl carrier protein 1~~~COG0236
MTMDDTKATVLSILADLTGEDVSSNMDVNLFDEGILDSMGSVQLLLELQNQLGIEVPVSEFQRSEWDTPAKIVAKVENLQ
>P39579 ~~~dltC~~~D-alanyl carrier protein~~~COG0236
MDFKQEVLDVLAEVCQDDIVKENPDIEIFEEGLLDSFGTVELLLAIENRFDILVPITEFDRDVWNTPNNIVNQLSELK
>Q03AZ0 ~~~dltC~~~D-alanyl carrier protein~~~
MADEAIKNGVLDILADLTGSDDVKTNLDLNLFETGLLDSMGTVQLLLELQSQFGVEAPVSEFDRSQWDTPNKIIAKVEQA
Q
>P55153 ~~~dltC~~~D-alanyl carrier protein~~~COG0236
MADEAIKNGVLDILADLTGSDDVKKNLDLNLFETGLLDSMGTVQLLLELQSQFGVDAPVSEFDRKEWDTPNKIIAKVEQA
Q
>P63957 ~~~dltC~~~D-alanyl carrier protein~~~COG0236
MDIKSEVIEIIDELFMEDVSDMMDEDLFDAGVLDSMGTVELIVEIENRFDIRVPVTEFGRDDWNTANKIIAGIVELQNA
>Q5M4V3 ~~~dltC~~~D-alanyl carrier protein~~~COG0236
MDVKAEVIEIIDELFMEDVSDMMDEDLFDAGVLDSMGTVELIVELESRFDIRVPVSEFGRDDWNTANKIVEGVTELRNA
>Q2FZW3 ~~~dltD~~~Protein DltD~~~COG3966
MKLKPFLPILISGAVFIVFLLLPASWFTGLVNEKTVEDNRTSLTDQVLKGTLIQDKLYESNKYYPIYGSSELGKDDPFNP
AIALNKHNANKKAFLLGAGGSTDLINAVELASQYDKLKGKKLTFIISPQWFTNHGLTNQNFDARMSQTQINQMFQQKNMS
TELKRRYAQRLLQFPHVHNKEYLKSYAKNPKETKDSYISGFKENQLIKIEAIKSLFAMDKSPLEHVKPATKPDASWDEMK
QKAVEIGKADTTSNKFGIRDQYWKLIQESKRKVRRDYEFNVNSPEFQDLELLVKTMRAAGADVQYVSIPSNGVWYDHIGI
DKERRQAVYKKIHSTVVDNGGKIYDMTDKDYEKYVISDAVHIGWKGWVYMDEQIAKHMKGEPQPEVDKPKN
>P96578 5.3.1.15~~~ydaE~~~Probable D-lyxose ketol-isomerase~~~COG1917
MGITKEEVNSYYQKAGIVLTDEEVDQIQLMDYGLGKERKVGLQLFVYVNTDRYCSKELVLFPGQTCPEHRHPPVDGQEGK
QETFRCRYGKVYLYVEGEKTPLPKVLPPQEDREHYTVWHEIELEPGGQYTIPPNTKHWFQAGEEGAVVTEMSSTSTDKHD
IFTDPRI
>A3E7Z6 5.3.1.15~~~~~~D-lyxose ketol-isomerase~~~
MRGTEWREARDRVAEMFRKAGIALTPSELEKVEVADFGLGNLAVQGLQLVTYINTDRYCAKELALFPHQTCPEHLHPPVG
GDPGKMETFRCRWGKVFLYVEGEPAASVQAAVPPGSEAYYTVFHEIVLTPGEQYTIPPGTKHWFQGGPEGAIVSEFSSTS
RDEFDIFTDPKVERMPVIEFDD
>P0AEE8 2.1.1.72~~~dam~~~DNA adenine methylase~~~COG0338
MKKNRAFLKWAGGKYPLLDDIKRHLPKGECLVEPFVGAGSVFLNTDFSRYILADINSDLISLYNIVKMRTDEYVQAAREL
FVPETNCAEVYYQFREEFNKSQDPFRRAVLFLYLNRYGYNGLCRYNLRGEFNVPFGRYKKPYFPEAELYHFAEKAQNAFF
YCESYADSMARADDASVVYCDPPYAPLSATANFTAYHTNSFTLEQQAHLAEIAEGLVERHIPVLISNHDTMLTREWYQRA
KLHVVKVRRSISSNGGTRKKVDELLALYKPGVVSPAKK
>P0DMP4 2.1.1.72~~~dam~~~DNA adenine methylase~~~COG0338
MKKNRAFLKWAGGKYPLLDDIKRHLPKGECLVEPFVGAGSVFLNTDFSRYILADINSDLISLYNIVKLRTDEYVQASREL
FMPETNQAEVYYQLREEFNTCQDPFRRAVLFLYLNRYGYNGLCRYNLRGEFNVPFGRYKRPYFPEAELYHFAEKAQNAFF
YCESYADSMARADKSSVVYCDPPYAPLSATANFTAYHTNSFSLTQQAHLAEIAENLVSNRIPVLISNHDTALTREWYQLA
KLHVVKVRPSISSNGGTRKKVDELLALYQPGVATPARK
>Q0QLE2 4.2.1.85~~~dmdA~~~2,3-dimethylmalate dehydratase large subunit~~~
MGMTMTQKILAAHASLDSVKAGDLIMADLDMVLANDITGPVAINVFGTIDKEKVFDKDKIALVPDHFAPAKDIKSAQQCK
QVRCFACDQEITNYFEIGEMGIEHALLPEKGLVAAGDVVIGADSHTCTYGALGAFSTGVGSTDMAVGMATGKAWFKVPAA
LRFNLTGTLNKNVSGKDLILHIIGMIGVDGALYRSMEFTGPGVACLSMDDRFTISNMAIEAGGKNGIFPVDDQTISYMEE
HGSGDYKVYAADADAVYEKTFDIDLSQLKSTVAFPHLPENTKTVDAIEEPVTIDQVVIGSCTNGRFEDLKRAADILRGKH
VKKGVRMLVIPATHKIYLDAMEAGYLREFIEAGATISTPTCGPCLGGYMGILAEGERCVSTTNRNFVGRMGHVDSEVYLA
SPEVAAASAILGRIATPDEL
>Q4FP21 2.1.1.269~~~dmdA~~~Dimethylsulfonioproprionate demethylase DmdA~~~COG0404
MKNFSIAKSRRLRSTPYTSRIEKQGVTAYTIYNHMLLPAAFGSIEDSYKHLKEHVQIWDVAAERQVEISGKDSAELVQLM
TCRDLSKSKIGRCYYCPIIDENGNLVNDPVVLKLDENKWWISIADSDVIFFAKGLASGHKFDVKIVEPVVDIMAIQGPKS
FALMEKVFGKKITELKFFGFDYFDFEGTKHLIARSGWSKQGGYEVYVENTQSGQKLYDHLFEVGKEFNVGPGCPNLIERI
ESALLSYGNDFDNNDNPFECGFDQYVSLDSDINFLGKEKLKEIKLKGPQKKLRGVKIDIKEISLTGSKNIYDENNNVIGE
LRSACYSPHFQKVIGIAMIKKSHWEASQGFKIQINDNTINGNVCDLPFI
>Q5LS57 2.1.1.269~~~dmdA~~~Dimethylsulfonioproprionate demethylase DmdA~~~COG0404
MASIFPSRRVRRTPFSAGVEAAGVKGYTVYNHMLLPTVFDSLQADCAHLKEHVQVWDVACERQVSIQGPDALRLMKLISP
RDMDRMADDQCYYVPTVDHRGGMLNDPVAVKLAADHYWLSLADGDLLQFGLGIAIARGFDVEIVEPDVSPLAVQGPRADD
LMARVFGEAVRDIRFFRYKRLAFQGVELVVARSGWSKQGGFEIYVEGSELGMPLWNALFAAGADLNVRAGCPNNIERVES
GLLSYGNDMTRENTPYECGLGKFCNSPEDYIGKAALAEQAKNGPARQIRALVIGGEIPPCQDAWPLLADGRQVGQVGSAI
HSPEFGVNVAIGMVDRSHWAPGTGMEVETPDGMRPVTVREGFWR
>Q0QLE1 4.2.1.85~~~DmdB~~~2,3-dimethylmalate dehydratase small subunit~~~
MKAKGSVFRYGDNVDTDVIIPARFLNTSDPLELAAHCMEDIDADFSSKVNAGDIIVADDNFGCGSSREHAPISIKASGVS
CVIANSFARIFYRNAINIGLPILECPEAVAVIEAGDEVEVDFDSGVITDVTKGQSFQGQAFPEFMQTLIAAGGLVNYINA
TEK
>Q5LRT0 6.2.1.44~~~dmdB~~~3-methylmercaptopropionyl-CoA ligase~~~COG0318
MLGQMMYQPLLISSLIDHAARYHGEAQIWSVSTEGGVEETNWAGIADNARRLGSVLTDAGLAPQSRVATLAWNNRRHLEI
YYGVSGAGFVLHTINPRLFPEQLVYILNHAEDRILFFDATFLPLVEGIRPHLTTVERLVLMGPRDEAAAARIEGLEFYDE
FVATGDAGFDWPDLDERTASSLCYTSGTTGNPKGVLYSHRSTVLHSFGSNTRDCIGFSARDVVMPVVPMFHVNAWGTPYA
CAMSGSCMVLPGPDLHGEALVGLIDRYRVTIALGVPTIWQGLLATARAKGSTLESLTRTVIGGAACPPSMIAEFRDRYGV
DTVHAWGMSEMSPLGTTNQPLAKHGALPIEAQHKLRENQGRPPYGVELKIVDDDGNTLPNDGQTQGDLMVRGHWVLDSYF
QLQDQPILSDGWFATGDVATLDRDGYMTIRDRSKDIIKSGGEWISSVELENIAVAHPKLATAAVIGVPHPKWDERPLLVA
VKAEGETPDEAELLAFFDGKIAKWQVPDRVVFVEALPLNATGKVLKRTLREQFRDVLTG
>Q5LLW7 1.3.8.-~~~dmdC~~~3-methylmercaptopropionyl-CoA dehydrogenase~~~COG1960
MTYQAPVRDIMFAIEHLSQWPQVEALQTYSEIELDDARAALEEFGRFCGEMIAPLSTIGDTEGARLENGRVVLPEGYKTA
YDQFVDMGWQSLSHPAEHGGMGLPKVVGAAATEIVNSADMSFGLCPLLTNGAIDALSITGSDAQKAFYLDKLITGRWSGT
MNLTEPQAGSDLSRVRCTAVPQDDGTYAISGTKIFITFGEHDLSENIVHLVLARTPDAPEGVRGLSLFVVPKLLAGEGGE
TSQRNTLGCVSLEHKLGVRASPTAVMEYDNATGYLVGEENSGLRYMFIMMTSARYAVGVQGVAIAERAYQHALSYARDRI
QSRPVDGSAQDAVPIIQHPDVRRMLLRMRALTEGGRALAIATGGWLDLAEHGPEEARAEAQSMAEFLVPLVKGFCTERAV
EVASLGVQIHGGMGFIEETGVAQFYRDARILPIYEGTTAIQANDLLGRKVLRDGGRTARRFAEMIAATEGELSKGGAAAQ
RIAQRLAEARAAFAAGLDHLLATAGQDPNRAYAGSVPFLMLTGNLATGWQLGLSALAAEAELAKGGDAEFLQAKIATADI
FAQQVLVECSAEHSRITDTGDSLLTASL
>Q5LLW6 4.2.1.155~~~dmdD~~~Methylthioacryloyl-CoA hydratase~~~COG1024
MTQDVTSGYSNLDLDLRDNGVCVVTLNRPDKRNALDVATIEELVTFFSTAHRKGVRAVVLTGAGDHFCAGLDLVEHWKAD
RSADDFMHVCLRWHEAFNKMEYGGVPIIAALRGAVVGGGLELASAAHLRVMDQSTYFALPEGQRGIFTGGGATIRVSDMI
GKYRMIDMILTGRVYQGQEAADLGLAQYITEGSSFDKAMELADKIASNLPLTNFAICSAISHMQNMSGLDAAYAEAFVGG
IVNTQPAARERLEAFANKTAARVRPNS
>Q9LCC1 3.5.1.56~~~dmfA1~~~N,N-dimethylformamidase alpha subunit~~~
MTEASESCVRDPSNYRDRSADWYAFYDERRRKEIIDIIDEHPEIVEEHAANPFGYRKHPSPYLQRVHNYFRMQPTFGKYY
IYSEREWDAYRIATIREFGELPELGDERFKTEEEAMHAVFLRRIEDVRAELA
>C9DQ22 3.5.1.56~~~dmfA1~~~N,N-dimethylformamidase alpha subunit~~~
MNQRRENYVSDPSAYPDRSADWYEYFDRKRREEIIEIIDSHPEIIDEHERNPFGYRNHPSPHLQRVHNYFRMQPTFGKYY
IYTEREWSSYRIAEIREFGKLPVLTDDSFATEEEAMHAVFLKRIEDVRNELSQAEQREIAN
>Q9LCC0 3.5.1.56~~~dmfA2~~~N,N-dimethylformamidase beta subunit~~~
MKDIAIRGYCDRPSVATGETIRFYVSANETRGTFDAELVRLIHGDSNPAGPGYKEEAIKSDLEGQYPARFQRTQFGSYVE
VADPDAGLQPDGAFSVHLFLWSTTPSRGRQGIASRWNDERQSGWNLAIEDGRVVFTIGDGSGATSSVVSDRPLFQQIWYS
ITGVYDPEKKQLRLYQKSVVNRTNSRFGLVVPLDSDCAVSADATVKAADSETSLLIAGLGEAAAQDGRTWCIAHYNGKVD
APKIYGCALGQDDAEKLSRGEIVRPISRLAHWDFSAGIGLNGIPTDHVVDASGNGHHGRCMNQPDRGSTGWNWDGHEENF
IHCPEQYGALWFHEDCLDDCRWEKDFEFTVPEGLKSDFYAVKIRYEDTEDYIPFFVLPPRGTATAPILVIASTLSYLAYA
NEQIMHKADIGQAVAGHTPVLNENDVELHKNLSYYGLSTYDGHIDGRGVQYTSWRRPIMNLRPKHRQGFGSIWELPADLH
LIDWLNHNGFEYDVATEHDLNDQGAELLRRYKVVLTGSHPEYQTWANADAWEDYLADGGRGMYLAANGMYWIVEVHPEKP
WVMEVRKELGVTAWEAPPGEYHYSTNGRRGGRFRGRARATQKIWGTGMSSFGFDHSGYFVQMPDSQDERVAWIMEGIDPE
ERIGDGGLVGGGAGGYELDRYDLALGTPPNTLLLASSVEHSVVYTVIPDDKAFPHPGMNGGEHPFVRADITYFSTANGGG
MFATSSISWLGSLSWNDYDNNVSKMTKNVLNQFIKDEPAPRV
>C9DQ21 3.5.1.56~~~dmfA2~~~N,N-dimethylformamidase beta subunit~~~
MKTVKIRGYCDRLSAAPGETIRFYVSADTSDGSYEAELVRLIHGDTNPAGPGYKEESVKSAADGSYPARFQRTQFGSFVE
VPDASGALLADGAFSVHTFLWSTTPGRGRQGLVSRWDDERQCGWSLAIEEGRLVFTIGDESGTTNRVISDRPLFQEVWYS
VTAVFDPVQKTISLHQKSVVNRTNSRFGLVVPLDSDTTVSAVSQVSPGDSRTSLLIAGLGEAAAADGRTWCIANFNGKID
APKLYGRALSSEEAMKLAEGTVAEPWSRLAHWDFSAGIGPDGIPTDHVVDISGNGHHGQCVNQPDRGSTGWNWDGHEENF
IHCPQQYGALWFHEDCLDDCRWDKDFEITLPDGLKSDFYAMKIRYGDSEDYIPFFVLPPRGKATAKILVLASTFSYLAYA
NEQIMHKADIGQAVAGHTPVLNENDVELHRNLDYYGLSTYDGHVDGRGVQYTSWRRPILNLRPKHRQGFGSIWELPADLH
LIDWLNHNGFDYDVATEHDLNEQGVDLLRRYNVVLTGSHPEYQTWANADAWEDYLADGGRGMYLAANGMYWIVSVHPEKP
WVMEVRKELGVTAWEAPPGEYHYSTNGRRGGRFRGRARATQKIWGTGMSSFGFDHSGYFVQMPDSQDKRAAWIMDGIDPD
ERIGDGGLVGGGAGGYELDRYDLSLGTPPNTLLLASSVEHSVVYTVIPDDKSFPHPGMNGGEHPFVRADITYFSTANGGG
MFSTSSISWLGSLSWNNYDNNVSRMTRNVLTQFMKDEPAPLV
>Q1QT89 4.2.1.-~~~manD~~~D-galactonate dehydratase family member ManD~~~COG4948
MKIRDAYTIVTCPGRNFVTLKIVTESGTHGIGDATLNGREMAVAAYLDEHVVPALIGRDAGRIEDTWQYLYRGAYWRRGP
VTMTAIAAVDMALWDIKAKAAGMPLYQLLGGKSRERVMTYAHCTGQTIEDCLGEVARHVELGYRAVRVQSGVPGIETTYG
VAKTPGERYEPADSSLPAEHVWSTEKYLNHAPKLFAAVRERFGDDLHVLHDVHHRLTPIEAARLGKAVEPYHLFWLEDCV
PAENQESLRLIREHTTTPLAIGEVFNSIHDCRELIQNQWIDYIRMPLTHGGGITAMRRVADLASLYHVRTGFHGPTDLSP
VCLGAAIHFDTWVPNFGIQEHMPHTDETDAVFPHDYRFEDGHFLAGESPGHGVDIDEELAAKYPYERASLPVNRLEDGTL
WHW
>E1V4Y0 4.2.1.-~~~rspA~~~D-galactonate dehydratase family member RspA~~~COG4948
MKIERAYTIVTAPGRNFVTLKIVTDEGTYGIGDATLNGREMAVVAYLEEHVIPALIGRDPQRIEDIWHYLYRGAYWRRGP
VTMSAIGAVDMALWDIKAKVAGMPLYQLLGGKSRERVMVYGHATGKDIEACLDEVARHVEEGYRAVRVQAGVPGIASIYG
VAKKPGERYEPADAELPAEHVWNTAKYLNHAPKLFAAVRERFGDDLHVLHDVHHRLTPIEAARLGKEVEPFNLFWLEDCV
PAENQESFGLIRQHTTTPLAVGEVFNSLYDAKALIENQWIDYIRAPLTHAGGITHVRRLADLAGLYHVRTGFHGPTDLSP
VCLGAAIHFDTWVPNFGIQEYMPHEAVTDEVFPHDYRFEDGHFLVGETPGHGVDIDEEKARKYPYRRASLPVNRLEDGTL
WHW
>C6CBG9 4.2.1.-~~~~~~D-galactonate dehydratase family member Dd703_0947~~~COG4948
MSKLKITNVKTILTAPGGIDLAVVKVETNEPGLYGLGCATFTQRIFAVKSAIDEYMAPFLIGKDPTRIEDIWQSAAVSGY
WRNGPIMNNALSGVDMALWDIKGKLAGMPVYELLGGKCRDGIPLYCHTDGGDEVEVEDNIRARMEEGYQYVRCQMGMYGG
AGTDDLKLIATQLARAKNIQPKRSPRSKTPGIYFDPEAYAKSVPRLFEHLRNKLGFGIEFIHDVHERVTPVTAIQLAKTL
EPYQLFYLEDPVAPENIDWLRMLRQQSSTPISMGELFVNINEWKPLIDNKLIDYIRCHVSTIGGITPAKKLAVYSELNGV
RTAWHGPGDISPVGVCANMHLDMSSPNFGIQEYTPMNDALREVFPGCPEIDQGYAYVNDKPGLGIDINETLAEKYPCDGG
IPSWTMARTPDGTASRP
>Q9AGP8 1.5.3.10~~~dmg~~~Dimethylglycine oxidase~~~
MASTPRIVIIGAGIVGTNLADELVTRGWNNITVLDQGPLNMPGGSTSHAPGLVFQTNPSKTMASFAKYTVEKLLSLTEDG
VSCFNQVGGLEVATTETRLADLKRKLGYAAAWGIEGRLLSPAECQELYPLLDGENILGGLHVPSDGLASAARAVQLLIKR
TESAGVTYRGSTTVTGIEQSGGRVTGVQTADGVIPADIVVSCAGFWGAKIGAMIGMAVPLLPLAHQYVKTTPVPAQQGRN
DQPNGARLPILRHQDQDLYYREHGDRYGIGSYAHRPMPVDVDTLGAYAPETVSEHHMPSRLDFTLEDFLPAWEATKQLLP
ALADSEIEDGFNGIFSFTPDGGPLLGESKELDGFYVAEAVWVTHSAGVAKAMAELLTTGRSETDLGECDITRFEDVQLTP
EYVSETSQQNFVEIYDVLHPLQPRLSPRNLRVSPFHARHKELGAFFLEAGGWERPYWFEANAALLKEMPAEWLPPARDAW
SGMFSSPIAAAEAWKTRTAVAMYDMTPLKRLEVSGPGALKLLQELTTADLAKKPGAVTYTLLLDHAGGVRSDITVARLSE
DTFQLGANGNIDTAYFERAARHQTQSGSATDWVQVRDTTGGTCCIGLWGPLARDLVSKVSDDDFTNDGLKYFRAKNVVIG
GIPVTAMRLSYVGELGWELYTSADNGQRLWDALWQAGQPFGVIAAGRAAFSSLRLEKGYRSWGTDMTTEHDPFEAGLGFA
VKMAKESFIGKGALEGRTEEASARRLRCLTIDDGRSIVLGKEPVFYKEQAVGYVTSAAYGYTVAKPIAYSYLPGTVSVGD
SVDIEYFGRRITATVTEDPLYDPKMTRLRG
>P76251 1.1.1.83~~~dmlA~~~D-malate dehydrogenase [decarboxylating]~~~COG0473
MMKTMRIAAIPGDGIGKEVLPEGIRVLQAAAERWGFALSFEQMEWASCEYYSHHGKMMPDDWHEQLSRFDAIYFGAVGWP
DTVPDHISLWGSLLKFRREFDQYVNLRPVRLFPGVPCPLAGKQPGDIDFYVVRENTEGEYSSLGGRVNEGTEHEVVIQES
VFTRRGVDRILRYAFELAQSRPRKTLTSATKSNGLAISMPYWDERVEAMAENYPEIRWDKQHIDILCARFVMQPERFDVV
VASNLFGDILSDLGPACTGTIGIAPSANLNPERTFPSLFEPVHGSAPDIYGKNIANPIATIWAGAMMLDFLGNGDERFQQ
AHNGILAAIEEVIAHGPKTPDMKGNATTPQVADAICKIILR
>P76250 ~~~dmlR~~~HTH-type transcriptional regulator DmlR~~~COG0583
MNNLPLLNDLRVFMLVARRAGFAAVAEELGVSPAFVSKRIALLEQTLNVVLLHRTTRRVTITEEGERIYEWAQRILQDVG
QMMDELSDVRQVPQGMLRIISSFGFGRQVVAPALLALAKAYPQLELRFDVEDRLVDLVNEGVDLDIRIGDDIAPNLIARK
LATNYRILCASPEFIAQHGAPKHLTDLSALPCLVIKERDHPFGVWQLRNKEGPHAIKVTGPLSSNHGEIVHQWCLDGQGI
ALRSWWDVSENIASGHLVQVLPEYYQPANVWAVYVSRLATSAKVRITVEFLRQYFAEHYPNFSLEHA
>Q0QLE4 4.1.3.32~~~Dml~~~2,3-dimethylmalate lyase~~~
MNTAAKMRELLSTKKMVVAPGAHDAMTAKVIGRLGFDAVYMTGYGQSASHLGQPDVGLLTMTEMVARANAIVEAAGVPVI
ADADTGFGNAVNVMRTVREYEKAGVAVIQLEDQVMPKKCGHMVGREIVSKEEMVGKIKAAVDTRVNPDFMIMARTDARTT
KGIDEALERGLAYKEAGADIIFIESPEGEEEMKRINETIPGYTLANMVEGGRTPLLKNAELEALGYNITIYPTASIYVAT
KAMVDLWTALKNDDTTAGVMDTMVTFSEFNDLMGLEKIREVEHNYATGR
>E9JFX9 1.14.13.131~~~dmoA~~~Dimethyl-sulfide monooxygenase~~~
MKKRIVLNAFDMTCVSHQSAGTWRHPSSQAARYNDLEYWTNMAMELERGCFDCLFIADVVGVYDVYRGSAEMALRDADQV
PVNDPFGAISAMAAVTEHVGFGVTAAITFEQPYLLARRLSTLDHLTKGRVAWNVVSSYLNSAALNIGMDQQLAHDERYEM
ADEYMEVMYKLWEGSWEDDAVKRDKKSGVFTDGSKVHPINHQGKYYKVPGFHICEPSPQRTPVIFQAGASGRGSKFAASN
AEGMFILTTSVEQARQITTDIRNQAEAAGRSRDSIKIFMLLTVITGDSDEAAEAKYQEYLSYANPEGMLALYGGWTGIDF
AKLDPDEPLQAMENDSLRTTLESLTHGENAKKWTVRDVIRERCIGGLGPVLVGGPQKVADELERWVDEGGVDGFNLAYAV
TPGSVTDFIDYIVPELRKRGRAQDSYKPGSLRRKLIGTNDGRVESTHPAAQYRDAYVGKESVADRTQPSPFANAKAPVAE
>P19729 ~~~dmpK~~~Phenol 2-monooxygenase, auxiliary component DmpK~~~
MTVTNTPTPTFDQLTRYIRVRSEPEAKFVEFDFAIGHPELFVELVLPQDAFVKFCQHNRVVAMDEAMAKAVDDDMVKWRF
GDVGRRLPKDPG
>Q7WTJ6 1.14.13.244~~~mphL~~~Phenol 2-monooxygenase, oxygenase component MhpL~~~
MTLEIKTSNVEPIRQNYAYIERRFGSKPATRYQEVSFDVQAETNFHYRPLWKPEKTLNDKTHTALQMQDWYAFKDPRQFY
YGTYVQHRARLQDTAESNFAFFEKRQLAEHLSNEVKAKVIECLLPFRHVEQTANLHMMSGSAYGYGTVLTQACIYAAMDH
LGIAQYISRIGLALDGNSGDSLQQAKQAWMQHPAWQGLRRLCEESLTEQDYFKLFLLQNLVIDGFVTELVYQQFDQWLVS
QNARDLAMLTEFMKDTLGDLRKWSDTVIKTAAAESDHNKQLLNEWFTESLAQVKAAFTPWATAALTVDAVDQAEQAVIER
AKKLGLQPLTNA
>P19730 1.14.13.244~~~dmpL~~~Phenol 2-monooxygenase, oxygenase component DmpL~~~
MSVEIKTNTVDPIRQTYGNLQRRFGDKPASRYQEASYDIEAVTNFHYRPLWDPQHELHDPTRTAIRMTDWHKVTDPRQFY
YGAYVQTRARMQEATEHAYGFCEKRELLSRLPAELQAKLLRCLVPLRHAELGANMNNSSIAGDSIAATVTQMHIYQAMDR
LGMGQYLSRIGLLLDGGTGEALDQAKAYWLDDPIWQGLRRYVEDSFVIRDWFELGLAQNLVLDGLLQPLMYQRFDQWLTE
NGGSDVAMLTEFMRDWYGESTRWVDAMFKTVLAENDANREQVQAWLEVWEPRAYEALLPLAEEATGIAALDEVRSAFATR
LQKIGLKSREE
>P19731 1.14.13.244~~~dmpM~~~Phenol 2-monooxygenase, stimulatory component DmpM~~~
MSSLVYIAFQDNDNARYVVEAIIQDNPHAVVQHHPAMIRIEAEKRLEIRRETVEENLGRAWDVQEMLVDVITIGGNVDED
DDRFVLEWKN
>P19732 1.14.13.244~~~dmpN~~~Phenol 2-monooxygenase, oxygenase component DmpN~~~
MATHNKKRLNLKDKYRYLTRDLAWETTYQKKEDVFPLEHFEGIKITDWDKWEDPFRLTMDTYWKYQAEKEKKLYAIFDAF
AQNNGHQNISDARYVNALKLFLTAVSPLEYQAFQGFSRVGRQFSGAGARVACQMQAIDELRHVQTQVHAMSHYNKHFDGL
HDFAHMYDRVWYLSVPKSYMDDARTAGPFEFLTAVSFSFEYVLTNLLFVPFMSGAAYNGDMATVTFGFSAQSDEARHMTL
GLEVIKFMLEQHEDNVPIIQRWIDKWFWRGYRLLTLIGMMMDYMLPNKVMSWSEAWGVYFEQAGGALFKDLERYGIRPPK
YVEQTTIGKEHITHQVWGALYQYSKATSFHTWIPGDEELNWLSEKYPDTFDKYYRPRFEFWREQQAKGERFYNDTLPHLC
QVCQLPVIFTEPDDPTKLSLRSLVHEGERYQFCSDGCCDIFKNEPVKYIQAWLPVHQIYQGNCEGGDVETVVQKYYHIKS
GVDNLEYLGSPEHQRWLALKGQTPPTAAPADKSLGAA
>P19733 1.14.13.244~~~dmpO~~~Phenol 2-monooxygenase, oxygenase component DmpO~~~
MTVNSIGEYTATPRDVQANFNGMQLLYLYWEEHLMYCSALAFLVAPGMPFAEFLEQVLKPAIHAHPDSAKIDFSQALWQL
NDQPFTPDYAASLEANGIDHKSMLRLNTPGLNGIQGSCS
>Q7WTJ2 1.14.13.7~~~mphP~~~Phenol hydroxylase P5 protein~~~COG2871
MSYQVTIEPIGTTIEVEEDQTILDAALRQGVWLPFACGHGTCGTCKVQVTDGFYDVGEASPFALMDIERDENKVLACCCK
PQSDMVIEADVDEDPDFLGHLVQDYQATVIEIKDLSPTIKGIRLQLDRPIEFQAGQYINVQFPNIEGTRAFSIANSPSEV
GIVELHIRKVEGGAATTYVHEQLATGDQLDISGPYGQFFVRKSDDQNAIFIAGGSGLSSPQSMILDLLESGDSRTIYLFQ
GARDLAELYNRELFEQLVKDYPNFRYIPALNAPKPEDQWTGFTGFVHEAVADYFENRCGGHKAYLCGPPIMIDSAISTLM
QSRLFERDIHTERFLSAADGAAGQSRSALFKHI
>P19734 1.14.13.244~~~dmpP~~~Phenol 2-monooxygenase, reductase component DmpP~~~
MSYNVTIEPTGEVIEVEDGQTILQAALRQGVWLPFACGHGTCATCKVQVVEGEVDIGEASPFALMDIERDERKVLACCAI
PLSDLVIEADVDADPDFLGHPVEDYRGVVSALVDLSPTIKGLHIKLDRPMPFQAGQYVNLALPGIDGTRAFSLANPPSRN
DEVELHVRLVEGGAATGFIHKQLKVGDAVELSGPYGQFFVRDSQAGDLIFIAGGSGLSSPQSMILDLLERGDTRRITLFQ
GARNRAELYNCELFEELAARHPNFSYVPALNQANDDPEWQGFKGFVHDAAKAHFDGRFGGQKAYLCGPPPMIDAAITTLM
QGRLFERDIFMERFYTAADGAGESSRSALFKRI
>C5B2R8 1.5.1.47~~~dmrA~~~Dihydromethanopterin reductase~~~COG0262
MIDVRCICAIGQRGQLGLNGHLPWEGNTDPLFVEDVTRFFALTMGHVLIAGPKTVASVPEFAFKDRTIDVIRSHEDPEAV
LKRYPGRRIFVGGGIAVWNVYAKYIQHWDVTRLPYDGEADRWFDPAWLVGGPLRS
>G0FUS0 2.1.1.315~~~~~~27-O-demethylrifamycin SV methyltransferase~~~
MTKPTPNEIGKGYDAFADLLDQLWGENLHHGYWDDESATLEEATTRLTDRLAGMLPLRAGDRLLDIGCGNGEPAIRMATA
NDVMVTGISISEKQVERANDRAYKADVDDRVVFEYADAMELPYPDASFDVVWALESLHHMPDRWHVIRQAARVLRPGGRL
ALGDFLLVPSPAGLEADAERVREVGKGVVAVVSLDEYQAHLREAGLEPESAEDVSQYTRPSWTKAAERFEGLREQALQHI
EAAQFEVTLGRFRAFSEEPSLGYVLLTARKPD
>P18775 1.8.5.3~~~dmsA~~~Dimethyl sulfoxide reductase DmsA~~~COG0243
MKTKIPDAVLAAEVSRRGLVKTTAIGGLAMASSALTLPFSRIAHAVDSAIPTKSDEKVIWSACTVNCGSRCPLRMHVVDG
EIKYVETDNTGDDNYDGLHQVRACLRGRSMRRRVYNPDRLKYPMKRVGARGEGKFERISWEEAYDIIATNMQRLIKEYGN
ESIYLNYGTGTLGGTMTRSWPPGNTLVARLMNCCGGYLNHYGDYSSAQIAEGLNYTYGGWADGNSPSDIENSKLVVLFGN
NPGETRMSGGGVTYYLEQARQKSNARMIIIDPRYTDTGAGREDEWIPIRPGTDAALVNGLAYVMITENLVDQAFLDKYCV
GYDEKTLPASAPKNGHYKAYILGEGPDGVAKTPEWASQITGVPADKIIKLAREIGSTKPAFISQGWGPQRHANGEIATRA
ISMLAILTGNVGINGGNSGAREGSYSLPFVRMPTLENPIQTSISMFMWTDAIERGPEMTALRDGVRGKDKLDVPIKMIWN
YAGNCLINQHSEINRTHEILQDDKKCELIVVIDCHMTSSAKYADILLPDCTASEQMDFALDASCGNMSYVIFNDQVIKPR
FECKTIYEMTSELAKRLGVEQQFTEGRTQEEWMRHLYAQSREAIPELPTFEEFRKQGIFKKRDPQGHHVAYKAFREDPQA
NPLTTPSGKIEIYSQALADIAATWELPEGDVIDPLPIYTPGFESYQDPLNKQYPLQLTGFHYKSRVHSTYGNVDVLKAAC
RQEMWINPLDAQKRGIHNGDKVRIFNDRGEVHIEAKVTPRMMPGVVALGEGAWYDPDAKRVDKGGCINVLTTQRPSPLAK
GNPSHTNLVQVEKV
>P18776 ~~~dmsB~~~Anaerobic dimethyl sulfoxide reductase chain B~~~COG0437
MTTQYGFFIDSSRCTGCKTCELACKDYKDLTPEVSFRRIYEYAGGDWQEDNGVWHQNVFAYYLSISCNHCEDPACTKVCP
SGAMHKREDGFVVVDEDVCIGCRYCHMACPYGAPQYNETKGHMTKCDGCYDRVAEGKKPICVESCPLRALDFGPIDELRK
KHGDLAAVAPLPRAHFTKPNIVIKPNANSRPTGDTTGYLANPKEV
>P18777 ~~~dmsC~~~Anaerobic dimethyl sulfoxide reductase chain C~~~COG3302
MGSGWHEWPLMIFTVFGQCVAGGFIVLALALLKGDLRAEAQQRVIACMFGLWVLMGIGFIASMLHLGSPMRAFNSLNRVG
ASALSNEIASGSIFFAVGGIGWLLAMLKKLSPALRTLWLIVTMVLGVIFVWMMVRVYNSIDTVPTWYSIWTPMGFFLTMF
MGGPLLGYLLLSLAGVDGWAMRLLPAISVLALVVSGVVSVMQGAELATIHSSVQQAAALVPDYGALMSWRIVLLAVALCL
WIAPQLKGYQPAVPLLSVSFILLLAGELIGRGVFYGLHMTVGMAVAS
>P69853 ~~~dmsD~~~Tat proofreading chaperone DmsD~~~COG3381
MTHFSQQDNFSVAARVLGALFYYAPESAEAAPLVAVLTSDGWETQWPLPEASLAPLVTAFQTQCEETHAQAWQRLFVGPW
ALPSPPWGSVWLDRESVLFGDSTLALRQWMREKGIQFEMKQNEPEDHFGSLLLMAAWLAENGRQTECEELLAWHLFPWST
RFLDVFIEKAEHPFYRALGELARLTLAQWQSQLLIPVAVKPLFR
>Q8ZPK0 ~~~dmsD~~~Tat proofreading chaperone DmsD~~~
MTTFLQRDDFAVTARVLGALFYYSPESHETAPLVQALLNDDWQAQWPLDAEALAPVAAMFKTHSEESLPQAWQRLFIGPY
ALPSPPWGSVWLDRESVLFGDSTLALRQWMRENGIQFEMQQNEPEDHFGSLLLLAAWLAENDRHHECEQLLAWHLFPWSS
RFLDVFIDHAGHPFYQALGQLARLTLAQWQAQLIIPVAVKPLFR
>O66659 ~~~dnaA~~~Chromosomal replication initiator protein DnaA~~~COG0593
MELNALIKKIESVDSYAREFLKKFEIKQEKGKFLFIAPGEDYREWLETIVNTFLEEEVRKLIEVKEKEEKKKVEIKDFLN
PKYTLENFIVGEGNRLAYEVVKEALENLGSLYNPIFIYGSVGTGKTHLLQAAGNEAKKRGYRVIYSSADDFAQAMVEHLK
KGTINEFRNMYKSVDLLLLDDVQFLSGKERTQIEFFHIFNTLYLLEKQIILASDRHPQKLDGVSDRLVSRFEGGILVEIE
LDNKTRFKIIKEKLKEFNLELRKEVIDYLLENTKNVREIEGKIKLIKLKGFEGLERKERKERDKLMQIVEFVANYYAVKV
EDILSDKRNKRTSEARKIAMYLCRKVCSASLIEIARAFKRKDHTTVIHAIRSVEEEKKKDRKFKHLVGFLEKQAFDKIC
>P05648 ~~~dnaA~~~Chromosomal replication initiator protein DnaA~~~COG0593
MENILDLWNQALAQIEKKLSKPSFETWMKSTKAHSLQGDTLTITAPNEFARDWLESRYLHLIADTIYELTGEELSIKFVI
PQNQDVEDFMPKPQVKKAVKEDTSDFPQNMLNPKYTFDTFVIGSGNRFAHAASLAVAEAPAKAYNPLFIYGGVGLGKTHL
MHAIGHYVIDHNPSAKVVYLSSEKFTNEFINSIRDNKAVDFRNRYRNVDVLLIDDIQFLAGKEQTQEEFFHTFNTLHEES
KQIVISSDRPPKEIPTLEDRLRSRFEWGLITDITPPDLETRIAILRKKAKAEGLDIPNEVMLYIANQIDSNIRELEGALI
RVVAYSSLINKDINADLAAEALKDIIPSSKPKVITIKEIQRVVGQQFNIKLEDFKAKKRTKSVAFPRQIAMYLSREMTDS
SLPKIGEEFGGRDHTTVIHAHEKISKLLADDEQLQQHVKEIKEQLK
>P03004 ~~~dnaA~~~Chromosomal replication initiator protein DnaA~~~COG0593
MSLSLWQQCLARLQDELPATEFSMWIRPLQAELSDNTLALYAPNRFVLDWVRDKYLNNINGLLTSFCGADAPQLRFEVGT
KPVTQTPQAAVTSNVAAPAQVAQTQPQRAAPSTRSGWDNVPAPAEPTYRSNVNVKHTFDNFVEGKSNQLARAAARQVADN
PGGAYNPLFLYGGTGLGKTHLLHAVGNGIMARKPNAKVVYMHSERFVQDMVKALQNNAIEEFKRYYRSVDALLIDDIQFF
ANKERSQEEFFHTFNALLEGNQQIILTSDRYPKEINGVEDRLKSRFGWGLTVAIEPPELETRVAILMKKADENDIRLPGE
VAFFIAKRLRSNVRELEGALNRVIANANFTGRAITIDFVREALRDLLALQEKLVTIDNIQKTVAEYYKIKVADLLSKRRS
RSVARPRQMAMALAKELTNHSLPEIGDAFGGRDHTTVLHACRKIEQLREESHDIKEDFSNLIRTLSS
>O26057 ~~~dnaA~~~Chromosomal replication initiator protein DnaA~~~COG0593
MDTNNNIEKEILALVKQNPKVSLIEYENYFSQLKYNPNASKSDIAFFYAPNQVLCTTITAKYGALLKEILSQNKVGMHLA
HSVDVRIEVAPKIQINAQSNINYKAIKTSVKDSYTFENFVVGSCNNTVYEIAKKVAQSDTPPYNPVLFYGGTGLGKTHIL
NAIGNHALEKHKKVVLVTSEDFLTDFLKHLDNKTMDSFKAKYRHCDFFLLDDAQFLQGKPKLEEEFFHTFNELHANSKQI
VLISDRSPKNIAGLEDRLKSRFEWGITAKVMPPDLETKLSIVKQKCQLNQITLPEEVMEYIAQHISDNIRQMEGAIIKIS
VNANLMNASIDLNLAKTVLEDLQKDHAEGSSLENILLAVAQSLNLKSSEIKVSSRQKNVALARKLVVYFARLYTPNPTLS
LAQFLDLKDHSSISKMYSGVKKMLEEEKSPFVLSLREEIKNRLNELNDKKTAFNSSE
>P35888 ~~~dnaA~~~Chromosomal replication initiator protein DnaA~~~COG0593
MEQFNAFKSLLKKHYEKTIGFHDKYIKDINRFVFKNNVLLILLENEFARNSLNDNSEIIHLAESLYEGIKSVNFVNEQDF
FFNLAKLEENSRDTLYQNSGLSKNYTFQNFVISEGNKRAYEAGVRLAETQDNEFSPLFIYGETGLGKTHLLQAIGNEKFR
NFPNARVKYVVSSDFAQEVVDAFYQRDKGIEKLKKNYENLDLVLIDDTQIFGRKEKTLEILFNIFNNLVLNKKQIVLVSD
KAPDELIDIDARMISRFKSGLLLKIEKHNLSSLCEILTVKLKEKDPNIQITNEARHDAAQISGNDVRALNGIATKLLFFA
KTSKQNLINTENLKEILFEEFEKFHKKSFDPYLLIENVCRRFNVPMDSVLSENRKAELVRVRDVCNYLLRQKYNMQFQQI
GKIFKRSHSSVLMAVKRVAKMIENDSSLRDVITSLVI
>A0R7K1 ~~~dnaA~~~Chromosomal replication initiator protein DnaA~~~COG0593
MTADPDPPFVAVWNSVVAELNGDVNGDRQGDPSLPVLTPQQRAWLKLVKPLVIAEGFALLSVPTPFVQNEIERHLREPIV
TALSRKLGQRVELGVRIATPTDEPEDAPDSFADSPAPASVPAGPADADEIDDDRDARVNAQESWPKYFSRPEPDTSSDDS
NAVNLNRRYTFDTFVIGASNRFAHAATLAIAEAPARAYNPLFIWGESGLGKTHLLHAAGNYAQRLFPGMRVKYVSTEEFT
NDFINSLRDDRKASFKRSYRDIDILLVDDIQFIEGKEGIQEEFFHTFNTLHNSNKQIVISSDRPPKQLATLEDRLRTRFE
WGLITDVQPPELETRIAILRKKAQMDRLDVPDDVLELIASSIERNIRELEGALIRVTAFASLNKTRIDRSLAEVVLRDLI
ADATTMQISTAAIMAVTAEYFETTVEELRGPGKTRALAQSRQIAMYLCRELTDLSLPKIGQAFGRDHTTVMYAEKKIRGE
MAERREVFDHVKELTTRIRQRAKR
>A5TY69 ~~~dnaA~~~Chromosomal replication initiator protein DnaA~~~COG0593
MTDDPGSGFTTVWNAVVSELNGDPKVDDGPSSDANLSAPLTPQQRAWLNLVQPLTIVEGFALLSVPSSFVQNEIERHLRA
PITDALSRRLGHQIQLGVRIAPPATDEADDTTVPPSENPATTSPDTTTDNDEIDDSAAARGDNQHSWPSYFTERPHNTDS
ATAGVTSLNRRYTFDTFVIGASNRFAHAAALAIAEAPARAYNPLFIWGESGLGKTHLLHAAGNYAQRLFPGMRVKYVSTE
EFTNDFINSLRDDRKVAFKRSYRDVDVLLVDDIQFIEGKEGIQEEFFHTFNTLHNANKQIVISSDRPPKQLATLEDRLRT
RFEWGLITDVQPPELETRIAILRKKAQMERLAVPDDVLELIASSIERNIRELEGALIRVTAFASLNKTPIDKALAEIVLR
DLIADANTMQISAATIMAATAEYFDTTVEELRGPGKTRALAQSRQIAMYLCRELTDLSLPKIGQAFGRDHTTVMYAQRKI
LSEMAERREVFDHVKELTTRIRQRSKR
>P9WNW3 ~~~dnaA~~~Chromosomal replication initiator protein DnaA~~~COG0593
MTDDPGSGFTTVWNAVVSELNGDPKVDDGPSSDANLSAPLTPQQRAWLNLVQPLTIVEGFALLSVPSSFVQNEIERHLRA
PITDALSRRLGHQIQLGVRIAPPATDEADDTTVPPSENPATTSPDTTTDNDEIDDSAAARGDNQHSWPSYFTERPHNTDS
ATAGVTSLNRRYTFDTFVIGASNRFAHAAALAIAEAPARAYNPLFIWGESGLGKTHLLHAAGNYAQRLFPGMRVKYVSTE
EFTNDFINSLRDDRKVAFKRSYRDVDVLLVDDIQFIEGKEGIQEEFFHTFNTLHNANKQIVISSDRPPKQLATLEDRLRT
RFEWGLITDVQPPELETRIAILRKKAQMERLAVPDDVLELIASSIERNIRELEGALIRVTAFASLNKTPIDKALAEIVLR
DLIADANTMQISAATIMAATAEYFDTTVEELRGPGKTRALAQSRQIAMYLCRELTDLSLPKIGQAFGRDHTTVMYAQRKI
LSEMAERREVFDHVKELTTRIRQRSKR
>P68866 ~~~dnaA~~~Chromosomal replication initiator protein DnaA~~~
MSEKEIWEKVLEIAQEKLSAVSYSTFLKDTELYTIKDGEAIVLSSIPFNANWLNQQYAEIIQAILFDVVGYEVKPHFITT
EELANYSNNETATPKETTKPSTETTEDNHVLGREQFNAHNTFDTFVIGPGNRFPHAASLAVAEAPAKAYNPLFIYGGVGL
GKTHLMHAIGHHVLDNNPDAKVIYTSSEKFTNEFIKSIRDNEGEAFRERYRNIDVLLIDDIQFIQNKVQTQEEFFYTFNE
LHQNNKQIVISSDRPPKEIAQLEDRLRSRFEWGLIVDITPPDYETRMAILQKKIEEEKLDIPPEALNYIANQIQSNIREL
EGALTRLLAYSQLLGKPITTELTAEALKDIIQAPKSKKITIQDIQKIVGQYYNVRIEDFSAKKRTKSIAYPRQIAMYLSR
ELTDFSLPKIGEEFGGRDHTTVIHAHEKISKDLKEDPIFKQEVENLEKEIRNV
>Q9ZH75 ~~~dnaA~~~Chromosomal replication initiator protein DnaA~~~
MADVPADLAAVWPRVLEQLLGEGQQGIEPKDKQWIERCQPLALVADTALLAVPNEWGKRVLEGRLAPLISETLTRECGRP
IRIAITVDDSAGEPPSPPAPPMHQSHQSQQGHRYPAQQRDDAPRGDAYDGYGHRPSDDGMPTRRPAYPDYQQQRPEPGAW
PRTQEDLSWQQPRHGGYQDREQPSGEPYRESESYRERENEQYREQAPEQWRQPYGTGRPQQPQHDYRSGPPEHQGYEQQR
PDRQDQGQGPRQGGHGPGRTGGSVPGPMGAQPAPAPGPGEPHARLNPKYLFDTFVIGASNRFAHAAAVAVAEAPAKAYNP
LFIYGESGLGKTHLLHAIGHYARSLYPGTRVRYVSSEEFTNEFINSIRDGKGDTFRKRYRDVDILLVDDIQFLASKESTQ
EEFFHTFNTLHNANKQIVLSSDRPPKQLVTLEDRLRNRFEWGLTTDVQPPELETRIAILRKKAVQEQLNAPPEVLEFIAS
RISRNIRELEGALIRVTAFASLNRQPVDLGLTEIVLKDLIPGGEESAPEITAPAIMAATADYFGLTVDDLCGSSRTRVLV
TARQIAMYLCRELTDLSLPKIGAQFGGRDHTTVMHADRKIRALMAERRSIYNQVTELTNRIKNG
>Q04N63 ~~~dnaA~~~Chromosomal replication initiator protein DnaA~~~COG0593
MKEKQFWNRILEFAQERLTRSMYDFYAIQAELIKVEENVATIFLPRSEMEMVWEKQLKDIIVVAGFEIYDAEITPHYIFT
KPQDTTSSQVEEATNLTLYDYSPKLVSIPYSDTGLKEKYTFDNFIQGDGNVWAVSAALAVSEDLALTYNPLFIYGGPGLG
KTHLLNAIGNEILKNIPNARVKYIPAESFINDFLDHLRLGEMEKFKKTYRSLDLLLIDDIQSLSGKKVATQEEFFNTFNA
LHDKQKQIVLTSDRSPKHLEGLEERLVTRFSWGLTQTITPPDFETRIAILQSKTEHLGYNFQSDTLEYLAGQFDSNVRDL
EGAINDITLIARVKKIKDITIDIAAEAIRARKQDVSQMLVIPIDKIQTEVGNFYGVSIKEMKGSRRLQNIVLARQVAMYL
SRELTDNSLPKIGKEFGGKDHTTVIHAHAKIKSLIDQDDNLRLEIESIKKKIK
>P46798 ~~~dnaA~~~Chromosomal replication initiator protein DnaA~~~COG0593
MKERILQEIKTRVNRKSWELWFSSFDVKSIEGNKVVFSVGNLFIKEWLEKKYYSVLSKAVKVVLGNDATFEITYEAFEPH
SSYSEPLVKKRAVLLTPLNPDYTFENFVVGPGNSFAYHAALEVAKHPGRYNPLFIYGGVGLGKTHLLQSIGNYVVQNEPD
LRVMYITSEKFLNDLVDSMKEGKLNEFREKYRKKVDILLIDDVQFLIGKTGVQTELFHTFNELHDSGKQIVICSDREPQK
LSEFQDRLVSRFQMGLVAKLEPPDEETRKSIARKMLEIEHGELPEEVLNFVAENVDDNLRRLRGAIIKLLVYKETTGKEV
DLKEAILLLKDFIKPNRVKAMDPIDELIEIVAKVTGVPREEILSNSRNVKALTARRIGMYVAKNYLKSSLRTIAEKFNRS
HPVVVDSVKKVKDSLLKGNKQLKALIDEVIGEISRRALSG
>Q9X9D5 ~~~dnaA~~~Chromosomal replication initiator protein DnaA~~~COG0593
MSHEAVWQHVLEHIRRSITEVEFHTWFERIRPLGIRDGVLELAVPTSFALDWIRRHYAGLIQEALGLLGAQAPRFELRVV
PGVVVQEDIFQAAPAEAPRPKLNPKYTFENFVVGPNNSMAHAAAVAVAESPGRAYNPLFIYGGVGLGKTHLMHAVGHSVA
KRFPHLRIEYVSTETFTNELINAIREDRMTEFRERYRSVDLLLVDDVQFIAGKERTQEEFFHTFNALYEAHKQIILSSDR
PPKDILTLEARLRSRFEWGLITDIQPPDLETRIAILKMNAEQRGLRIPEDALEYIARQVTSNIRELEGALMRAIAFASLN
GVELTRAVAAKALSDIFAPRELEADPLEIIRKVADHFGLKPEELTGSGRKKEVVLPRQLAMYLVRELTRASLPEIGQLFG
GRDHTTVLYAIQKVQELAESDREVQGLLRTLREACT
>O83047 ~~~dnaA~~~Chromosomal replication initiator protein DnaA~~~COG0593
MDAVGYEVFWNETLSQIRSESTEAEFNMWFAHLFFIASFENAIEIAVPSDFFRIQFSQKYQEKLERKFLELSGHPIKLLF
AVKKGTPHGNTAPPKHVHTYLEKNSPAEVPSKKSFHPDLNRDYTFENFVSGEETKFSHSAAISVSKNPGTSYNPLLIYGG
VGLGKTHLMQAIGHEIYKTTDLNVIYVTAENFGNEFISTLLNKKTQDFKKKYRYTADVLLIDDIHFFENKDGLQEELFYT
FNELFEKKKQIIFTCDRPVQELKNLSSRLRSRCSRGLSTDLNMPCFETRCAILIKKIQNYNSTYPHKAIHISDDVVRLVS
ENISSNIRDLEGALTKIIAFIEVSGSITIDIVPSLLKEFFLSARPKHITVETILHVVADHFNISYSDLKGKKRNKSVVYP
RQIAMFLSKELTELSTTELGIEFGGRDHSTVIYGCQKIEGEILTNPSLQANLDLLKSKVQDSIR
>Q5E7H1 ~~~~~~DNA-binding protein VF_0530~~~COG4628
MALIMTQQNNPLHGITLQKLLTELVEHYGWEELSYMVNINCFKKDPSIKSSLKFLRKTDWARERVENIYLKLQRHKERNQ
>P07908 ~~~dnaB~~~Replication initiation and membrane attachment protein~~~COG3611
MADYWKDVLPVDPYVVKSRSMLQDIDRQIITQLYQPLIGPVAFSLYMTLWGELEQNRLWGGESTHRQLMGMTQSNLKTIH
QEQGKLEGIGLLKVYMKESERQERLFIYELLPPLRPNEFFEDGMLNVFLYNRVGKTKYQQLKQFFTHPAISEDAKDITRP
FNHAFESLQPSEWKLTSDMEETVRLAEGSEYTSVGQSPSYTITEDVFDFDLFLAGLSETMIPRKAMTQQVRDTIKKLSYL
YGIDPLQMQNVVMSAIDERDVITTEALRKAASDWYQIERNGQLPDLVEKTQPVHLREGEQPAEEDSLDGKLIALLEAISP
KKLLQDIADGTEPSKADLKIIEEIMFEQKLEPGVTNVLIYYVMLKTDMKLSKNYIQKIASHWARKKVKTVREAMKLAIEE
NRQYLEWAEGKTKSSKRNQKVIREEKLPDWMTEKETASDSESGQQKLHPQDLEEQKKKMMEEMQKLKKYSAY
>P0ACB0 3.6.4.12~~~dnaB~~~Replicative DNA helicase~~~COG0305
MAGNKPFNKQQAEPRERDPQVAGLKVPPHSIEAEQSVLGGLMLDNERWDDVAERVVADDFYTRPHRHIFTEMARLQESGS
PIDLITLAESLERQGQLDSVGGFAYLAELSKNTPSAANISAYADIVRERAVVREMISVANEIAEAGFDPQGRTSEDLLDL
AESRVFKIAESRANKDEGPKNIADVLDATVARIEQLFQQPHDGVTGVNTGYDDLNKKTAGLQPSDLIIVAARPSMGKTTF
AMNLVENAAMLQDKPVLIFSLEMPSEQIMMRSLASLSRVDQTKIRTGQLDDEDWARISGTMGILLEKRNIYIDDSSGLTP
TEVRSRARRIAREHGGIGLIMIDYLQLMRVPALSDNRTLEIAEISRSLKALAKELNVPVVALSQLNRSLEQRADKRPVNS
DLRESGSIEQDADLIMFIYRDEVYHENSDLKGIAEIIIGKQRNGPIGTVRLTFNGQWSRFDNYAGPQYDDE
>O25916 3.6.4.12~~~dnaB~~~Replicative DNA helicase~~~COG0305
MDHLKHLQQLQNIERIVLSGIVLANHKIEEVHSVLEPSDFYYPPNGLFFEIALKLHEEDCPIDENFIRQKMPKDKQIKEE
DLVAIFAASPIDNIEAYVEEIKNASIKRKLFGLANTIREQALESAQKSSDILGAVEREVYALLNGSTIEGFRNIKEVLES
AMDLITENQRKGSLEVTGIPTGFVQLDNYTSGFNKGSLVIIGARPSMGKTSLMMNMVLSALNDDRGVAVFSLEMSAEQLA
LRALSDLTSINMHDLESGRLDDDQWENLAKCFDHLSQKKLFFYDKSYVRIEQIRLQLRKLKSQHKELGIAFIDYLQLMSG
SKATKERHEQIAEISRELKTLARELEIPIIALVQLNRSLENRDDKRPILSDIKDSGGIEQDADIVLFLYRGYIYQMRAED
NKIDKLKKEGKIEEAQELYLKVNEERRIHKQNGSIEEAEIIVAKNRNGATGTVYTRFNAPFTRYEDMPIDSHLEEGQETK
VDYDIVTT
>P9WMR3 3.6.4.12~~~dnaB~~~Replicative DNA helicase~~~COG0305
MAVVDDLAPGMDSSPPSEDYGRQPPQDLAAEQSVLGGMLLSKDAIADVLERLRPGDFYRPAHQNVYDAILDLYGRGEPAD
AVTVAAELDRRGLLRRIGGAPYLHTLISTVPTAANAGYYASIVAEKALLRRLVEAGTRVVQYGYAGAEGADVAEVVDRAQ
AEIYDVADRRLSEDFVALEDLLQPTMDEIDAIASSGGLARGVATGFTELDEVTNGLHPGQMVIVAARPGVGKSTLGLDFM
RSCSIRHRMASVIFSLEMSKSEIVMRLLSAEAKIKLSDMRSGRMSDDDWTRLARRMSEISEAPLFIDDSPNLTMMEIRAK
ARRLRQKANLKLIVVDYLQLMTSGKKYESRQVEVSEFSRHLKLLAKELEVPVVAISQLNRGPEQRTDKKPMLADLRESGC
LTASTRILRADTGAEVAFGELMRSGERPMVWSLDERLRMVARPMINVFPSGRKEVFRLRLASGREVEATGSHPFMKFEGW
TPLAQLKVGDRIAAPRRVPEPIDTQRMPESELISLARMIGDGSCLKNQPIRYEPVDEANLAAVTVSAAHSDRAAIRDDYL
AARVPSLRPARQRLPRGRCTPIAAWLAGLGLFTKRSHEKCVPEAVFRAPNDQVALFLRHLWSAGGSVRWDPTNGQGRVYY
GSTSRRLIDDVAQLLLRVGIFSWITHAPKLGGHDSWRLHIHGAKDQVRFLRHVGVHGAEAVAAQEMLRQLKGPVRNPNLD
SAPKKVWAQVRNRLSAKQMMDIQLHEPTMWKHSPSRSRPHRAEARIEDRAIHELARGDAYWDTVVEITSIGDQHVFDGTV
SGTHNFVANGISLHNSLEQDADVVILLHRPDAFDRDDPRGGEADFILAKHRNGPTKTVTVAHQLHLSRFANMAR
>Q55418 3.6.4.12~~~dnaB~~~Replicative DNA helicase~~~COG0305
MAANPALPPQNIEAEECILGGILLDPEAMGRIIDLLVVDAFYVKAHRLIYEAMLSLHGQSQPTDLMSVSSWLQDHHHFEA
IGGMVKLTQLLDRTISAVNIDRFAALIMDKYLRRQLIAAGHDIVDLGYETSKELETIFDESEQKIFRLTQSRPQAGLVPL
SETLVNTFIELDKLHEKLSSPGVETQFYDLDAMTGGLQRADLIILAGRPSMGKTAFGLGIAANIAKNQNLPVAIFSLEMS
KEQLALRLVASESLIDSNRLRTGHFSQAEFEPLTAAMGTLSSLPIYIDDTASISVTQMRSQVRRLQSEQKGPLGMVLIDY
LQLMEGGSDNRVQELSKITRSLKGLAREINAPVIALSQLSRAVESRTNKRPMMSDLRESGCISGDSLISLASTGKRVSIK
DLLDEKDFEIWAINEQTMKLESAKVSRVFCTGKKLVYILKTRLGRTIKATANHRFLTIDGWKRLDELSLKEHIALPRKLE
SSSLQLMSDEELGLLGHLIGDGCTLPRHAIQYTSNKIELAEKVVELAKAVFGDQINPRISQERQWYQVYIPASYRLTHNK
KNPITKWLENLDVFGLRSYEKFVPNQVFEQPQRAIAIFLRHLWSTDGCVKLIVEKSSRPVAYYATSSEKLAKDVQSLLLK
LGINARLSKISQNGKGRDNYHVTITGQADLQIFVDQIGAVDKDKQASVEEIKTHIAQHQANTNRDVIPKQIWKTYVLPQI
QIKGITTRDLQMRLGNAYCGTALYKHNLSRERAAKIATITQSPEIEKLSQSDIYWDSIVSITETGVEEVFDLTVPGPHNF
VANDIIVHNSIEQDADLIMMIYRDEYYNPDTPDPGVAELLIVKHRNGPTGVVKLLFKPEFTQFLNLQRSNDY
>O83097 3.6.4.12~~~dnaB~~~Replicative DNA helicase~~~COG0305
MPGMPNPTQELKGKIPPHNLEAERAVLGAVLLDDSALSTATEQLSASSFYSAAHQRIFQALVELSDLGQRPDILVLSEHL
RSCEALDFVGGSAYVASLTDAVPSAANVEYYTRIVCDAAMRRSLLKVARIITAEAFNDTVSGNIVLETAQREIYDLTNAR
RVATFKLLKNLIPDLVNTIETRYRNQSDLVGIATGLTALDNLTGGFQNSELIVIGARPSMGKTALAMTMASNIAIRQRIP
TAFFSLEMSNLLLMQRLIAAESGVSATNLRKGLLQLSDFGRIQNAAGEMYDAPLYIVDVPNMKLLDLRAVARRLCVQEKI
QIIFVDYLGLIVADNPFAPRYEQFAAISQSLKSLARELDIPIVALSQVGRPAEGSAPNLADIRGSGAIEQDADVVMFLHR
DRNETETQLILAKQRNGPIGTVELEFQASFTRFVCKSP
>P37469 3.6.4.12~~~dnaC~~~Replicative DNA helicase~~~COG0305
MTDLLNDRLPPQNIEAEQAVLGAIFLQPSALTLASEVLIPDDFYRMSHQKIYNAMLVLGDRGEPVDLVTVTSELANTDLL
EEVGGISYLTDIANSVPTAANIEYYAKIVEEKSILRRLIRTATTIAQDGYTREDEVEDLLSEAEKTIMEVAQRKNTSAFQ
NIKDVLVQTYDNIEQLHNRKGDITGIPTGFTELDRMTAGFQRNDLIIVAARPSVGKTAFALNIAQNVATKTDESVAIFSL
EMGAEQLVMRMLCAEGNINAQNLRTGNLTEEDWGKLTMAMGSLSNSGIYIDDTPGIRVSEIRAKCRRLKQESGLGMILID
YLQLIQGSGRSKDNRQQEVSEISRELKSIARELQVPVIALSQLSRGVEQRQDKRPMMSDIRESGSIEQDADIVAFLYRDD
YYDKETENKNIIEIIIAKQRNGPVGTVSLAFVKEYNKFVNLERRFDDAGVPPGA
>P0AEF0 ~~~dnaC~~~DNA replication protein DnaC~~~COG1484
MKNVGDLMQRLQKMMPAHIKPAFKTGEELLAWQKEQGAIRSAALERENRAMKMQRTFNRSGIRPLHQNCSFENYRVECEG
QMNALSKARQYVEEFDGNIASFIFSGKPGTGKNHLAAAICNELLLRGKSVLIITVADIMSAMKDTFRNSGTSEEQLLNDL
SNVDLLVIDEIGVQTESKYEKVIINQIVDRRSSSKRPTGMLTNSNMEEMTKLLGERVMDRMRLGNSLWVIFNWDSYRSRV
TGKEY
>P39787 ~~~dnaD~~~DNA replication protein DnaD~~~COG3935
MKKQQFIDMQEQGTSTIPNLLLTHYKQLGLNETELILLLKIKMHLEKGSYFPTPNQLQEGMSISVEECTNRLRMFIQKGF
LFIEECEDQNGIKFEKYSLQPLWGKLYEYIQLAQNQTQERKAEGEQKSLYTIFEEEFARPLSPLECETLAIWQDQDQHDA
QLIKHALKEAVLSGKLSFRYIDRILFEWKKNGLKTVEQAKIHSQKFRRVQAKQNEPQKEYKRQVPFYNWLEQ
>B8H427 2.7.7.7~~~dnaE2~~~Error-prone DNA polymerase~~~
MRPPVYAELQATTNFSFLRGASHAEELALTAEALGLTAIGIVDRNSLAGVVRAWTAAKRRSIRVLTGCRLDFMDGAPSLL
CYPTDREAFGRLTRLLTIGQLRAEKGECHLTWRDFLDHSEGQLGLIVPPRVLDDSFEQHLTRMAGDLRGRSWLAASRAYA
ARDLQRLARLESLGRTSGAPIVATNDVLYHGPERRPLQDVISCVREHCAIQEAGFRLEANAERHIKSPEEMARLFDRWPR
AVERTVEIVERIGFDLKDIREQYPDEPVPPGKTAMQHLTDLTWKGAAWRYPNGVSPKVTAQIQEELRLIEKMDYPNYFIT
VHDIVREARSMGILCQGRGSAANSSVCFCLGVTAIDPTEHRLLFTRFISENRGEPPDIDVDFEHDRREEVMQYVFQRYGR
AYAAICGTVIHYRPRSAIRDVGKALGLTEDVTSLLAGTVWGSWGDGLPEDHLRNAGLDPKAPEIARAVGLANDLIGFPRH
LSQHVGGFVLTKRRLDETVPIGKAAMKDRTFIEWDKDDIDSLGLMKVDILALGMLHAIQRAMTMLREDHGQDWLKDLADI
PKEVPGVYDMLCAADSVGVFQVESRAQMSMLPRLRPREFYDLVIQVAIVRPGPIQGDMVHPYLKRRNKEEPVDWPKPSPE
HGPPDELQEILGKTFGVPLFQEQAMSLAIEAAKFTPDEADGLRKAMATFRNLGSPDAYRNKFIEGMVGRGYERAFAERCF
KQIEGFSHYGFPESHAASFAKLVYVSAWIKWAWPDVFCAALINAQPMGFYQPAQLVRDAREHGVEVLPPDILTSDWDCTL
APISTGFRPPRVRHDKVACQETRPRWKAVRLGFRQIKGLRETLDIPPLLKARGEGARTPAEFAQGGVPQKALELLAEADA
FASVGLSRREALWAVKGLKGEHKAPVQAPLLAGLPLFEERVALPAMAATQEVAEDYRTTSLSLKAHPIGFYRSMLAARGV
VPAERLLSLKDGARVSVAGLVLIRQRPGTAKGVVFVTLEDETGVANAVVWKDRFDAARNVVMTASFLIVHGRVQRADNVI
HVVAERFTDLSAELSSLRDEPGAPAPRIRQKVSGRLLRSRDFH
>P9WNT5 2.7.7.7~~~dnaE2~~~Error-prone DNA polymerase~~~COG0587
MFDILWNVGWSNGPPSWAEMERVLNGKPRHAGVPAFDADGDVPRSRKRGAYQPPGRERVGSSVAYAELHAHSAYSFLDGA
STPEELVEEAARLGLCALALTDHDGLYGAVRFAEAAAELDVRTVFGAELSLGATARTERPDPPGPHLLVLARGPEGYRRL
SRQLAAAHLAGGEKGKPRYDFDALTEAAGGHWHILTGCRKGHVRQALSQGGPAAAQRALADLVDRFTPSRVSIELTHHGH
PLDDERNAALAGLAPRFGVGIVATTGAHFADPSRGRLAMAMAAIRARRSLDSAAGWLAPLGGAHLRSGEEMARLFAWCPE
AVTAAAELGERCAFGLQLIAPRLPPFDVPDGHTEDSWLRSLVMAGARERYGPPKSAPRAYSQIEHELKVIAQLRFPGYFL
VVHDITRFCRDNDILCQGRGSAANSAVCYALGVTAVDPVANELLFERFLSPARDGPPDIDIDIESDQREKVIQYVYHKYG
RDYAAQVANVITYRGRSAVRDMARALGFSPGQQDAWSKQVSHWTGQADDVDGIPEQVIDLATQIRNLPRHLGIHSGGMVI
CDRPIADVCPVEWARMANRSVLQWDKDDCAAIGLVKFDLLGLGMLSALHYAKDLVAEHKGIEVDLARLDLSEPAVYEMLA
RADSVGVFQVESRAQMATLPRLKPRVFYDLVVEVALIRPGPIQGGSVHPYIRRRNGVDPVIYEHPSMAPALRKTLGVPLF
QEQLMQLAVDCAGFSAAEADQLRRAMGSKRSTERMRRLRGRFYDGMRALHGAPDEVIDRIYEKLEAFANFGFPESHALSF
ASLVFYSAWFKLHHPAAFCAALLRAQPMGFYSPQSLVADARRHGVAVHGPCVNASLAHATCENAGTEVRLGLGAVRYLGA
ELAEKLVAERTANGPFTSLPDLTSRVQLSVPQVEALATAGALGCFGMSRREALWAAGAAATGRPDRLPGVGSSSHIPALP
GMSELELAAADVWATGVSPDSYPTQFLRADLDAMGVLPAERLGSVSDGDRVLIAGAVTHRQRPATAQGVTFINLEDETGM
VNVLCTPGVWARHRKLAHTAPALLIRGQVQNASGAITVVAERMGRLTLAVGARSRDFR
>O67465 2.7.7.101~~~dnaG~~~DNA primase~~~COG0358
MSSDIDELRREIDIVDVISEYLNLEKVGSNYRTNCPFHPDDTPSFYVSPSKQIFKCFGCGVGGDAIKFVSLYEDISYFEA
ALELAKRYGKKLDLEKISKDEKVYVALDRVCDFYRESLLKNREASEYVKSRGIDPKVARKFDLGYAPSSEALVKVLKEND
LLEAYLETKNLLSPTKGVYRDLFLRRVVIPIKDPRGRVIGFGGRRIVEDKSPKYINSPDSRVFKKGENLFGLYEAKEYIK
EEGFAILVEGYFDLLRLFSEGIRNVVAPLGTALTQNQANLLSKFTKKVYILYDGDDAGRKAMKSAIPLLLSAGVEVYPVY
LPEGYDPDEFIKEFGKEELRRLINSSGELFETLIKTARENLEEKTREFRYYLGFISDGVRRFALASEFHTKYKVPMEILL
MKIEKNSQEKEIKLSFKEKIFLKGLIELKPKIDLEVLNLSPELKELAVNALNGEEHLLPKEVLEYQVDNLEKLFNNILRD
LQKSGKKRKKRGLKNVNT
>P05096 2.7.7.101~~~dnaG~~~DNA primase~~~COG0358
MGNRIPDEIVDQVQKSADIVEVIGDYVQLKKQGRNYFGLCPFHGESTPSFSVSPDKQIFHCFGCGAGGNVFSFLRQMEGY
SFAESVSHLADKYQIDFPDDITVHSGARPESSGEQKMAEAHELLKKFYHHLLINTKEGQEALDYLLSRGFTKELINEFQI
GYALDSWDFITKFLVKRGFSEAQMEKAGLLIRREDGSGYFDRFRNRVMFPIHDHHGAVVAFSGRALGSQQPKYMNSPETP
LFHKSKLLYNFYKARLHIRKQERAVLFEGFADVISAVSSDVKESIATMGTSLTDDHVKILRRNVEEIILCYDSDKAGYEA
TLKASELLQKKGCKVRVAMIPDGLDPDDYIKKFGGEKFKNDIIDASVTVMAFKMQYFRKGKNLSDEGDRLAYIKDVLKEI
STLSGSLEQEVYVKQLASEFSLSQESLTEQLSVFSKQNKPADNSGETKTRRAHLTTKARQKRLRPAYENAERLLLAHMLR
DRSVIKKVIDRVGFQFNIDEHRALAAYLYAFYEEGAELTPQHLMARVTDDHISQLLSDILMLQVNQELSEAELSDYVKKV
LNQRNWSMIKEKEAERAEAERQKDFLRAASLAQEIVTLNRSLK
>P0ABS5 2.7.7.101~~~dnaG~~~DNA primase~~~COG0358
MAGRIPRVFINDLLARTDIVDLIDARVKLKKQGKNFHACCPFHNEKTPSFTVNGEKQFYHCFGCGAHGNAIDFLMNYDKL
EFVETVEELAAMHNLEVPFEAGSGPSQIERHQRQTLYQLMDGLNTFYQQSLQQPVATSARQYLEKRGLSHEVIARFAIGF
APPGWDNVLKRFGGNPENRQSLIDAGMLVTNDQGRSYDRFRERVMFPIRDKRGRVIGFGGRVLGNDTPKYLNSPETDIFH
KGRQLYGLYEAQQDNAEPNRLLVVEGYMDVVALAQYGINYAVASLGTSTTADHIQLLFRATNNVICCYDGDRAGRDAAWR
ALETALPYMTDGRQLRFMFLPDGEDPDTLVRKEGKEAFEARMEQAMPLSAFLFNSLMPQVDLSTPDGRARLSTLALPLIS
QVPGETLRIYLRQELGNKLGILDDSQLERLMPKAAESGVSRPVPQLKRTTMRILIGLLVQNPELATLVPPLENLDENKLP
GLGLFRELVNTCLSQPGLTTGQLLEHYRGTNNAATLEKLSMWDDIADKNIAEQTFTDSLNHMFDSLLELRQEELIARERT
HGLSNEERLELWTLNQELAKK
>Q9X4D0 2.7.7.101~~~dnaG~~~DNA primase~~~
MGHRIPEETIEAIRRGVDIVDVIGEYVQLKRQGRNYFGLCPFHGEKTPSFSVSPEKQIFHCFGCGAGGNAFTFLMDIEGI
PFVEAAKRLAAKAGVDLSVYELDVRGRDDGQTDEAKAMTEAHALLKRFYHHLLVHTKEGQAALDYLQARGWTKETIDRFE
IGYAPDAPDAAAKLLESHSFSLPVMEKAGLLTKKEDGRYVGRFRNRIMFPIHDHRGETVGFSGRLLGEGHPKYVNSPETP
VFRKGAILYHFHAARVPIRKRQEALLVEGFADVISAAQAGIDYAIATMGTSLTEEQARILRPCDTITICYDGDRAGIEAA
WAAAEQLSALGCRVKVASLPNGLDPDEYIRVYGGERFAGEAGCRRPLVAFKMAYLRRGKNLQHEGERLRYIDEALREIGK
LSSPVEQDYYLRQLAEEFSLSLSALHEQLSRSQRERTKPREAPDGETARPMLAKKLLPAFQNAERLLLAHMMRSRDVALV
VQERIGGRFNIEEHRALAAYIYAFYEEGHEADPGALISRIPGELQPLASDVSLLLIADDVSEQELEDYIRHVLNRPKWLM
LKVKEQEKTEAERRKDFLTAARIAKEMIEMKKMLSSS
>P56064 2.7.7.101~~~dnaG~~~DNA primase~~~COG0358
MILKSSIDRLLQTIDIVEVISSYVNLRKSGSSYMACCPFHEERSASFSVNQIKGFYHCFGCGASGDSIKFVMAFEKLSFV
EALEKLAHRFNIVLEYDKGVYYDHKEDYHLLEMVSSLYQEELFNAPFFLNYLQKRGLSLESIKAFKLGLCTNRIDYGIEN
KGLNKDKLIELGVLGKSDNDQKTYLRFLDRIMFPIYSPSAQVVGFGGRTLKEKAAKYINSPQSKLFDKSSLLYGYHLAKE
HIYKQKQVIVTEGYLDVILLHQAGFKNAIATLGTALTPSHLPLLKKGDPEILLSYDGDKAGRNAAYKASLMLAKEQRRGG
VILFENNLDPADMIANGQIETLKNWLSHPMAFIEFVLRRMADSYLLDDPLEKDKALKEMLGFLKNFSLLLQSEYKPLIAT
LLQAPLHVLGIRERVSFQPFYPKTEKPNRPQRFAHVSSAPSLEFLEKLVIRYLLEDRSLLDLAVGYIHSGVFLHKKQEFD
ALCQEKLDDPKLVALLLDANLPLKKGGFEKELRLLILRYFERQLKEIPKSSLPFSEKMICLKKARQAIMKLKQGELVAI
>P9WNW1 2.7.7.101~~~dnaG~~~DNA primase~~~COG0358
MSGRISDRDIAAIREGARIEDVVGDYVQLRRAGADSLKGLCPFHNEKSPSFHVRPNHGHFHCFGCGEGGDVYAFIQKIEH
VSFVEAVELLADRIGHTISYTGAATSVQRDRGSRSRLLAANAAAAAFYAQALQSDEAAPARQYLTERSFDAAAARKFGCG
FAPSGWDSLTKHLQRKGFEFEELEAAGLSRQGRHGPMDRFHRRLLWPIRTSAGEVVGFGARRLFDDDAMEAKYVNTPETL
LYKKSSVMFGIDLAKRDIAKGHQAVVVEGYTDVMAMHLAGVTTAVASCGTAFGGEHLAMLRRLMMDDSFFRGELIYVFDG
DEAGRAAALKAFDGEQKLAGQSFVAVAPDGMDPCDLRLKCGDAALRDLVARRTPLFEFAIRAAIAEMDLDSAEGRVAALR
RCVPMVGQIKDPTLRDEYARQLAGWVGWADVAQVIGRVRGEAKRTKHPRLGRLGSTTIARAAQRPTAGPPTELAVRPDPR
DPTLWPQREALKSALQYPALAGPVFDALTVEGFTHPEYAAVRAAIDTAGGTSAGLSGAQWLDMVRQQTTSTVTSALISEL
GVEAIQVDDDKLPRYIAGVLARLQEVWLGRQIAEVKSKLQRMSPIEQGDEYHALFGDLVAMEAYRRSLLEQASGDDLTA
>Q9I5W0 2.7.7.101~~~dnaG~~~DNA primase~~~
MAGLIPQSFIDDLLNRTDIVEVVSSRIQLKKTGKNYSACCPFHKEKTPSFTVSPDKQFYYCFGCGAGGNALGFVMDHDQL
EFPQAVEELAKRAGMDVPREERGGRGHTPRQPTDSPLYPLLSAAAEFYKQALKSHPARKAAVNYLKGRGLTGEIARDFGL
GFAPPGWDNLLKHLGGDNLQLKAMLDAGLLVENSDTGKRYDRFRDRVMFPIRDSRGRIIAFGGRVLGDDKPKYLNSPETP
VFHKGQELYGLYEARQKNRDLDEIMVVEGYMDVIALAQQGIRNAVATLGTATSEEHIKRLFRLVPSILFCFDGDQAGRKA
AWRALESVLPNLQDGKRVRFLFLPEGEDPDSLVRAEGEDAFRARITQQAQPLAEYFFQQLMLEADPATLEGKAHLATLAA
PLLEKIPGNNLRLLMRQRLSEITGLSGENIGQLAHHSPPPSSMDHGASGVLDGDDYFAASAYYENEPSHAPFDAAPGYVE
AQPRKSWNKDKKPWDGKKWDGKKKWDKGGRGDFKAPQRTPVSVESTTLNALRTLLHHPQLALKVDDAGTLAREQDTYAQL
LVSLLEALQKNPRQSSMQLIARWHGTPQGRLLQALGEKEWLIVQENLEKQFFDTITKLSESQRFGEREERLRSVMQKSYS
ELTDEEKALLREHYSVAASSPSQS
>P63964 2.7.7.101~~~dnaG~~~DNA primase~~~
MRIDQSIINEIKDKTDILDLVSEYVKLEKRGRNYIGLCPFHDEKTPSFTVSEDKQICHCFGCKKGGNVFQFTQEIKDISF
VEAVKELGDRVNVAVDIEATQSNSNVQIASDDLQMIEMHELIQEFYYYALTKTVEGEQALTYLQERGFTDALIKERGIGF
APDSSHFCHDFLQKKGYDIELAYEAGLLSRNEENFSYYDRFRNRIMFPLKNAQGRIVGYSGRTYTGQEPKYLNSPETPIF
QKRKLLYNLDKARKSIRKLDEIVLLEGFMDVIKSDTAGLKNVVATMGTQLSDEHITFIRKLTSNITLMFDGDFAGSEATL
KTGQHLLQQGLNVFVIQLPSGMDPDEYIGKYGNDAFTTFVKNDKKSFAHYKVSILKDEIAHNDLSYERYLKELSHDISLM
KSSILQQKAINDVAPFFNVSPEQLANEIQFNQAPANYYPEDEYGGYDEYGGYIEPEPIGMAQFDNLSRQEKAERAFLKHL
MRDKDTFLNYYESVDKDNFTNQHFKYVFEVLHDFYAENDQYNISDAVQYVNSNELRETLISLEQYNLNDEPYENEIDDYV
NVINEKGQETIESLNHKLREATRIGDVELQKYYLQQIVAKNKERM
>O05338 2.7.7.101~~~dnaG~~~DNA primase~~~
MRIDQSIINEIKDKTDILDLVSEYVKLEKRGRNYIGLCPFHDEKTPSFTVSEDKQICHCFGCKKGGNVFQFTQEIKDISF
VEAVKELGDRVNVAVDIEATQSNSNVQIASDDLQMIEMHELIQEFYYYALTKTVEGEQALTYLQERGFTDALIKERGIGF
APDSSHFCHDFLQKKGYDIELAYEAGLLSRNEENFSYYDRFRNRIMFPLKNAQGRIVGYSGRTYTGQEPKYLNSPETPIF
QKRKLLYNLDKARKSIRKLDEIVLLEGFMDVIKSDTAGLKNVVATMGTQLSDEHITFIRKLTSNITLMFDGDFAGSEATL
KTGQHLLQQGLNVFVIQLPSGMDPDEYIGKYGNDAFTTFVKNDKKSFAHYKVSILKDEIAHNDLSYERYLKELSHDISLM
KSSILQQKAINDVAPFFNVSPEQLANEIQFNQAPANYYPEDEYGGYDEYGGYIEPEPIGMAQFDNLSRREKAERAFLKHL
MRDKDTFLNYYESVDKDNFTNQHFKYVFEVLHDFYAENDQYNISDAVQYVNSNELRETLISLEQYNLNGEPYENEIDDYV
NVINEKGQETIESLNHKLREATRIGDVELQKYYLQQIVAKNKERM
>P74893 2.7.7.101~~~dnaG~~~DNA primase~~~COG0358
MDTPRLHPETIAAVKERADIVDIVSEQVVLKKRGKDFVGLCPFHDDKSPSFTVSPAKQFYYCFSCGAGGNPIKFLMELGK
QSFSEVVLDLAKRYQVPVRTLEVQQHQELQRQLSRRERLYEVLAVATQFYEQSLRRPEGAAALDYLRRSRQLQESTIQKF
QLGYAPAQWASLATHLIEQKRFPADLVEEAGLVVARRNGQGYYDRFRDRLMIPIHDLQGRVVGFGGRTLTGEEPKYLNSP
ETTLFEKGKLLFGLDKARAAIAKQDQAVVVEGYFDVIALHAAGIDHAVASLGTALSRQQVKLLSRYSESNQIVLNFDADR
AGAKAAERAIGEVEDLAYQGQVQLRVLNLPGGKDADEYLQRHSVADYRELLARSPLWLDWQIDQLLRDRNLDQADQFQAV
VQAIVQLLGKLPNTPLRTHYVHQVAERLSQGEARTAVQLASDLRAQVRGQRWHGQASRWEKPGDVSIREQAEAQILKVYL
HCPRLRLAVRKTLHDREIQGFSLQPHRLLWQAIAEIEEAHLGFAAMYQVERGEGNGDDLAAIDLVPILRDRLDQLTGVSL
GGFLELSENDHADLTHPLPLLRGAVALVERLRCEKRCRHLLDSWARQSIHTFEHCIEQLLQAGIGEDVDAEAQITALHEQ
LNQEALHFQKLYYNERRYLQQLDQERCLNPQAFLGMTEHDATAIAPTTPQPISA
>P06567 ~~~dnaI~~~Primosomal protein DnaI~~~COG1484
MEPIGRSLQGVTGRPDFQKRLEQMKEKVMKDQDVQAFLKENEEVIDQKMIEKSLNKLYEYIEQSKNCSYCSEDENCNNLL
EGYHPKLVVNGRSIDIEYYECPVKRKLDQQKKQQSLMKSMYIQQDLLGATFQQVDISDPSRLAMFQHVTDFLKSYNETGK
GKGLYLYGKFGVGKTFMLAAIANELAEKEYSSMIVYVPEFVRELKNSLQDQTLEEKLNMVKTTPVLMLDDIGAESMTSWV
RDEVIGTVLQHRMSQQLPTFFSSNFSPDELKHHFTYSQRGEKEEVKAARLMERILYLAAPIRLDGENRRHP
>P9WNV9 ~~~dnaJ1~~~Chaperone protein DnaJ 1~~~COG0484
MAQREWVEKDFYQELGVSSDASPEEIKRAYRKLARDLHPDANPGNPAAGERFKAVSEAHNVLSDPAKRKEYDETRRLFAG
GGFGGRRFDSGFGGGFGGFGVGGDGAEFNLNDLFDAASRTGGTTIGDLFGGLFGRGGSARPSRPRRGNDLETETELDFVE
AAKGVAMPLRLTSPAPCTNCHGSGARPGTSPKVCPTCNGSGVINRNQGAFGFSEPCTDCRGSGSIIEHPCEECKGTGVTT
RTRTINVRIPPGVEDGQRIRLAGQGEAGLRGAPSGDLYVTVHVRPDKIFGRDGDDLTVTVPVSFTELALGSTLSVPTLDG
TVGVRVPKGTADGRILRVRGRGVPKRSGGSGDLLVTVKVAVPPNLAGAAQEALEAYAAAERSSGFNPRAGWAGNR
>P9WNV7 ~~~dnaJ2~~~Chaperone protein DnaJ 2~~~COG0484
MARDYYGLLGVSKNASDADIKRAYRKLARELHPDVNPDEAAQAKFKEISVAYEVLSDPDKRRIVDLGGDPLESAAAGGNG
FGGFGGLGDVFEAFFGGGFGGGAASRGPIGRVRPGSDSLLRMRLDLEECATGVTKQVTVDTAVLCDRCQGKGTNGDSVPI
PCDTCGGRGEVQTVQRSLLGQMLTSRPCPTCRGVGVVIPDPCQQCMGDGRIRARREISVKIPAGVGDGMRVRLAAQGEVG
PGGGPAGDLYVEVHEQAHDVFVREGDHLHCTVSVPMVDAALGVTVTVDAILDGLSEITIPPGTQPGSVITLRGRGMPHLR
SNTRGDLHVHVEVVVPTRLDHQDIELLRELKGRRDREVAEVRSTHAAAGGLFSRLRETFTGR
>Q56237 ~~~dnaJ2~~~Chaperone protein DnaJ 2~~~COG0484
MAAKKDYYAILGVPRNATQEEIKRAYKRLARQYHPDVNKSPEAEEKFKEINEAYAVLSDPEKRRIYDTYGTTEAPPPPPP
GGYDFSGFDVEDFSEFFQELFGPGLFGGFGRRSRKGRDLRAELPLTLEEAFHGGERVVEVAGRRVSVRIPPGVREGSVIR
VPGMGGQGNPPGDLLLVVRLLPHPVFRLEGQDLYATLDVPAPIAVVGGKVRAMTLEGPVEVAVPPRTQAGRKLRLKGKGF
PGPAGRGDLYLEVRITIPERLTPEEEALWKKLAEAYYARA
>P47442 ~~~~~~DnaJ-like protein MG200~~~COG0484
MAEQKRDYYEVLGITPDADQSEIKKAFRKLAKKYHPDRNNAPDAAKIFAEINEANDVLSNPKKRANYDKYGFDGVDGEPA
FNFQADVFQSFFEEIAKSGVFNNQTNPEQKEKKKRYHWFSKKPKQEQPEINLDHVVEQTIKKVQQNQNQNKDPDELRSKV
PGEVTASDWEALVGDTRYGYFDETGDWSWKGYFDEQGKWVWNEPVDSETSEVSVEPEPTPVAPEASFEEAQPEINAEPEA
SFESTPTPEPVAPEASFEEAQPEPTPIPEPIPTPVQVQPLLLDLNLFTIPTKATKDDLLFDNINLTTYEQVVDYLNSQAT
PNLAKTDGELQTIDGTNPLLLEQCKKIKKQAEQLFKKLFLKKQLPFITQPEVVEESKTSFDENNVNLVYFEKVPEILFIN
QQPKEVKYTRQVFDGLTNKTTSETITLEIQLLQTPKETVSAIFKGFGNDHGKGCGDLKIVFEKIKSPFFQVNEDGLHSAC
IIDPLVAYNGGIIDVFGPYTNFQVKVDGEIDINAIMKFEKLGIAKTKRKGDLFVHLYYSSVPKKKLTTNPQVQQFLELLQ
AEYELLQDNIKSLKYFKNNLVIPKKPLDQQSYQYLSQEPIS
>P50018 ~~~dnaJ~~~Chaperone protein DnaJ~~~COG0484
MAKADFYETLGVSKTADEKELKSAFRKLAMKFHPDKNPDDADSERKFKEINEAYETLKDPQKRAAYDRFGHAAFENGGMG
GGGMGGGGFANGGFSDIFEDIFGEMMGGGRARRSSGGRERGADLRYNMEITLEEAFAGKTAQIRVPTSITCDVCSGSGAK
PGTQPKTCATCQGSGRVRAAQGFFSVERTCPTCHGRGQTISDPCGKCHGQGRVTEERSLSVNIPSGIEDGTRIRLQGEGE
AGMRGGPAGDLYIFLSVRPHEFFQRDGADLYCTVPISMTTAALGGTFDVTTLDGTKSRVTVPEGTQPGKQFRLKGKGMPV
LRSAQTGDLYIQIQIETPQKLSKRQRELLQEFEQLSSKENNPESTGFFARMKEFFDG
>P17631 ~~~dnaJ~~~Chaperone protein DnaJ~~~COG0484
MSKRDYYEVLGVSKSASKDEIKKAYRKLSKKYHPDINKEAGSDEKFKEVKEAYETLSDDQKRAHYDQFGHTDPNQGFGGG
GFGGGDFGGFGFDDIFSSIFGGGTRRRDPNAPRQGADLQYTMTLSFEDAAFGKETTIEIPREETCETCKGSGAKPGTNPE
TCSHCGGSGQLNVEQNTPFGKVVNRRVCHHCEGTGKIIKNKCADCGGKGKIKKRKKINVTIPAGVDDGQQLRLSGQGEPG
INGGPAGDLFVVFHVRAHEFFERDGDDIYCEMPLTFAQAALGDEVEVPTLHGKVKLKIPAGTQTGTKFRLRGKGVQNVRG
YGQGDQHIVVRVVTPTNLTDKQKDIIREFAEVSGNLPDEQEMSFFDKVKRAFKGD
>P28616 ~~~dnaJ~~~Chaperone protein DnaJ~~~
MKKDYYEILGLSKGASKDEIKKAYRKIAIKYHPDRNQGNEEAASIFKEATQAYEILIDDNKKAKYDRFGHSAFEGGGFEG
FSGGFSGFSDIFEDFGDIFDSFFTGNKGQERNRKHAKGEDLGYNIEISLENAYFGYKNNINITRQMLCDSCLGKKSEKGT
SPSICNMCNGSGRVVQGGGFFRVTTTCSKCYGEGKIISNPCKSCKGKGSLTKQETIQLNIPPGIDNNQQIKMKGKGNVNP
DNQEYGDLYVKILIRSHKVFKRNGKDLYAMLPISFTQAALGKEVKIKTIASKEIKIHIPKGINNEEQILIKNAGMPILQT
EKFGNLILITKIKTPKNLNSNAIKLFENLGKELKDGDEIDLLKA
>Q05980 ~~~dnaJ~~~Chaperone protein DnaJ~~~
MKIDYYEALGVTRTADDKTLKAAFRKLAMQYHPDRNPDDPEAERKFKEIGEAYETLKDPQKRAAYDRFGHAAFENGGMGG
GFGNGFGGAGGFADIFEDIFGEMMGGGRRRSNGGRERGADLRYNMEVTLEEAYAGKTAQIRVPTSITCDECSGSGAKPGS
QPTTCTMCSGSGRVRAAQGFFSVERTCPGCNGRGQIIKDPCEKCHGQGRVTQERSLSVNIPAGIEDGTRIRLAGEGEAGL
RGGPAGDLYIFLSVKPHEFFQRDGADLYCKVPISMTTAALGGQFEVSTLDGTQTRVKVPEGTQNGKQFRLKGKGMPVLRQ
SVTGDLYIQIDIETPQNLSKRQRELLEEFEKLSWQENSPKSAGLFSRMKEFFEGIGE
>P30725 ~~~dnaJ~~~Chaperone protein DnaJ~~~COG0484
MANKDYYEVLGLEKGASDDEIKKAFRKLAIKYHPDKNRGNKEAEEKFKEINEAYQVLSDPDKKANYDRFGTADFNGGGGF
GDFSGGFGDFGDLGDIFNSFFGGGFSGGSSRARKDAPQRGNDMEYSISLTFEEAVFGVEKSINITRSENCETCGGTGAKK
GTSPKTCDKCGGTGTIRVQRNTPLGSFVTQSSCDKCGGRGTIISDPCHECHGAGHVRKKRKISVKIPAGVDTGNVIPLRG
QGEHGKNGGPAGDLYISIKVTPHKKFKREGFDIYIDTHISFPKAALGTDMTVPTIDGDVKYTIPAGTQSGTVFRLKGKGV
QRVNGGGRGNQYVKVIVDTPKALNDKQREALKMFMEASGEAKSEKKSGFKRFFE
>P08622 ~~~dnaJ~~~Chaperone protein DnaJ~~~COG0484
MAKQDYYEILGVSKTAEEREIRKAYKRLAMKYHPDRNQGDKEAEAKFKEIKEAYEVLTDSQKRAAYDQYGHAAFEQGGMG
GGGFGGGADFSDIFGDVFGDIFGGGRGRQRAARGADLRYNMELTLEEAVRGVTKEIRIPTLEECDVCHGSGAKPGTQPQT
CPTCHGSGQVQMRQGFFAVQQTCPHCQGRGTLIKDPCNKCHGHGRVERSKTLSVKIPAGVDTGDRIRLAGEGEAGEHGAP
AGDLYVQVQVKQHPIFEREGNNLYCEVPINFAMAALGGEIEVPTLDGRVKLKVPGETQTGKLFRMRGKGVKSVRGGAQGD
LLCRVVVETPVGLNERQKQLLQELQESFGGPTGEHNSPRSKSFFDGVKKFFDDLTR
>Q05646 ~~~dnaJ~~~Chaperone protein DnaJ~~~
MADKRDFYEILGVSKSATDAEIKKAYRQLAKKYHPDINKEDGAEAKFKEVQEAYEVLSDSQKRANYDQFGHAAFDQGAGG
FGGGFSGGFDDFGDIFSSFFGGGGGGQRRNPNGPMKGQDRFMSMRIDFMEAVFGANKSVTLNVDEECTSCHGSGAHSKDD
IKTCSRCGGTGQTVTQQRTPFGVFQSQATCPDCGGSGKTITKRCGECHGKGFNTKRVEVDIKIPAGIVTGQQLRVSGKGE
RGANGGPNGDLFIEIVVGTHKHFRREGNDIHINIPLSVIDATLGTEIEVPTVHGDVKLTIPAGTQPNTKFRLREKGVQDL
RSGRMGDQYVEVKLEVPTKLSRQQREHLEALKETEVKGDSVFDRFKKAFK
>O87778 ~~~dnaJ~~~Chaperone protein DnaJ~~~
MAEKRDYYDVLGVGRDASDDEIKKAYRKLSKKYHPDINKAPDAEAKFKEVTEAYEALSDPQKRAAYDQYGHAGMNGGFGG
GAGAGQGFGGFGGGAEGFGGFDDIFSSFFGGGARQQPNGPRQGSDLQYRMDLKFEEAVFGKETKISYSREAECHTCHGSG
AKPGTSAETCHKCHGAGQIQVERQTPLGRMMSRETCDVCGGTGKEIKSKCDTCHGTGREEERHTVKVKVPAGVEDGQQMR
LQGQGEAGSNGGPYGDLFIVFRVAPSDEFERDGAQIFVEVPISFVQAALGDEIEVNTVHGPVKLKIPAGTQTNTVFRLRG
KGAPKLHGTGNGDQKVTVNVVTPKSLNSKQRDALKAFAVASGDSVNPQDNNLFDKILNKKHKK
>Q6RSN5 ~~~dnaJ~~~Chaperone protein DnaJ~~~COG0484
MAKADFYETLGVSKTADEKELKSAFRKLAMKYHPDKNPDDADSERKFKEINEAYETLKDPQKRAAYDRFGHAAFENGGMG
GGGGGFGGGGFANGGFSDIFEDIFGEMMGGGRARRSSGGRERGADLRYNMEITLEEAFTGKTAQIRVPTSITCDVCSGSG
AKPGTQPKTCATCQGSGRVRAAQGFFSVERTCPTCHGRGQTISDPCGKCHGQGRVTEERSLSVNIPSGIEDGTRIRLQGE
GEAGMRGGPAGDLYIFLSVRPHEFFQRDGADLYCTVPISMTTAALGGTFDVTTLDGTKSRVTVPEGTQPGKQFRLKGKGM
PVLRSAQTGDLYIQIQIETPQKLSKRQRELLQEFEQLSSKENNPESTGFFARMKKFFDG
>P63971 ~~~dnaJ~~~Chaperone protein DnaJ~~~
MAKRDYYEVLGISKDASKDEIKKAYRKLSKKYHPDINKEEGADEKFKEISEAYEVLSDDNKRASYDQFGHDGPQGFGGQG
FNGSDFGGFSGFGGGGFEDIFSSFFGGGRQRDPNAPQKGDDLQYTMTLTFEEAVFGTTKEISIRKDVTCETCHGDGAKPG
TSKKTCSYCNGAGHVAVEQNTILGRVRTEQVCPKCNGSGQEFEEACPTCHGKGTENKTVKLEVKVPEGVDNEQQIRLAGE
GSPGVNGGPAGDLYVVFRVKPSETFKRDGDDIYYKLNVSFPQAALGDEIKIPTLNNEVMLTIPAGTQTGKQFRLKEKGIK
NVHGYGYGDLYVDIKVVTPTKLTDRQKELMKEFAQLNGEEINEQPSNFKDRAKRFFKGE
>P95830 ~~~dnaJ~~~Chaperone protein DnaJ~~~COG0484
MNNTEFYDRLGVSKNASADEIKKAYRKLSKKYHPDINKEPGAEDKYKEVQEAYETLSDDQKRAAYDQYGAAGANGGFGGA
GGFGGFNGAGGFGGFEDIFSSFFGGGGSSRNPNAPRQGDDLQYRVNLTFEEAIFGTEKEVKYHREAGCRTCNGSGAKPGT
SPVTCGRCHGAGVINVDTQTPLGMMRRQVTCDVCHGRGKEIKYPCTTCHGTGHEKQAHSVHVKIPAGVETGQQIRLAGQG
EAGFNGGPYGDLYVVVSVEASDKFEREGTTIFYNLNLNFVQAALGDTVDIPTVHGDVELVIPEGTQTGKKFRLRSKGAPS
LRGGAVGDQYVTVNVVTPTGLNDRQKVALKEFAAAGDLKVNPKKKGFFDHIKDAFDGE
>P22358 ~~~dnaK2~~~Chaperone protein DnaK2~~~COG0443
MGKVVGIDLGTTNSCVAVMEGGKPTVIANAEGFRTTPSVVGYAKNGDRLVGQIAKRQAVMNPGNTFYSVKRFIGRKFDEI
TNEATEVAYSVVKDGNGNVKLDCPAQGKQFAPEEISAQVLRKLVDDASKYLGETVTQAVITVPAYFNDSQRQATKDAGKI
AGIEVLRIINEPTAASLAYGLDKKDNETILVFDLGGGTFDVSILEVGEGVFEVLATSGDTHLGGDDFDKKIVDFLAGEFQ
KAEGIDLRKDKQALQRLTEAAEKAKIELSGVSQTEINLPFITATQDGPKHLDTTLSRAKFEEICSDLIDRCGIPVENAIR
DAKIDKSALDEIVLVGGSTRIPAVQEVVKKILGKDPNQGVNPDEVVAVGAAIQGGVLSGEVKDILLLDVSPLSLGVETLG
GVMTKIIPRNTTIPTKKSETFSTAVDGQSNVEIHVLQGEREMANDNKSLGTFRLDGIPPAPRGVPQIEVTFDIDANGILN
VTAKDRGTGKEQSISITGASTLPDTEVDRMVKEAESNAAADKERREKIDRKNQADSLVYQAEKQITELGDKVPAADKIKA
EGLIKDLKEAVAQEDDAKIQTVMPELQQVLYSIGSNMYQQAGAEAGVGAPGAGPEAGTSSGGGDDVIDAEFSEPEK
>P50019 ~~~dnaK~~~Chaperone protein DnaK~~~COG0443
MAKVIGIDLGTTNSCVAVMDGKDTKVIENAEGARTTPSMVAFSDDGERLVGQPAKRQAVTNPTNTLFAVKRLIGRRYEDP
TVEKDKALVPFEIVKGDNGDAWVKAQDKNYSPSQISAMILQKMKETAESYLGEKVEKAVITVPAYFNDAQRQATKDAGRI
AGLDVLRIINEPTAAALAYGLDKKEGKTIAVYDLGGGTFDISVLEIGDGVFEVKSTNGDTFLGGEDFDMRLVEYLAGEFK
KDQGIDLKNDKLALQRLKEAAEKAKIELSSSQQTEINLPFITADASGPKHLTLKLTRAKFESLVDDLVQRTVAPCKAALK
DAGVTAAEIDEVVLVGGMSRMPKVQEVVKQLFGKEPHKGVNPDEVVAMGAAIQAGVLQGDVKDVLLLDVTPLSLGIETLG
GVFTRLIDRNTTIPTKKSQTFSTAEDNQSAVTIRVSQGEREMAQDNKLLGQFDLVGLPPSPRGVPQIEVTFDIDANGIVQ
VSAKDKGTGKEQQIRIQASGGLSDADIEKMVKDAEANAEADKKRRAGVEAKNQAESLIHSTEKSVKEYGDKVSETDRKAI
EDAIASLKTAVEAAEPDADDIQAKTQTLMEVSMKLGQAIYEAQQAEAGDASAEGKDDVVDADYEEIKDDKKSA
>O52960 ~~~dnaK~~~Chaperone protein DnaK~~~
MGKVIGIDLGTTNSCFAVLEGGKPLVIENVEGGRTTPSIVAFTKEKERLVGQLAKRQAVTNAENTIYSIKRFIGRRWEDT
EQERNRVSYHCVPGRDKTVDVKCWGKQYTPQELSAMILQTLKAGAEAYLNETVTEAVITVPAYFTDAQRQATKDAGTIAG
LNVMRIINEPTAAALAFGLDKQEQREQVLVYDLGGGTLDVSILQLGDGVFEVKATAGNNHLGGDDFDNVIVEWMLSEFQA
QEGINLNEDKMAMQRLREAAERAKIELSTRPTTSINLPFITAFSKGGGEVGPKHLKLNLSRAKFNELSRPLVEKTIDPLK
QAIEDSGLSVDQIDRILLVGGSTRIPEVQEALKKFFNGKEPDRSINPDEAVALGAAIQGGVLGQEQEVEDLLLLVVIPLS
LGIETLGEVFTKVIERNTTIPTSKSQVFSTASDGQTSVEIHVLQGERAMANDNKTLGKFLLTGIPAAPRGVPQIEVSFDI
DVNGILKVSAEDKGTGREQGIVIKETGGLSQQEIERMQQEAEQYADEDRKRMQRIELSNQADSLFYSHEATLKDNQGLIP
QKVKATANKKKEELMAILEDPNVELGVIQTRLEDYRQAVLSMGSEVYSQGSSKSSARDYETVGEEEMTRETGNSQESNAA
FSTQPEGTVNNDNLELEAVFSTYNDHTQGSDVATSSHDEIDLGDASDNSQTNFDLDDEDFNPFEDFDEEEEASTDDYEAV
E
>P17820 ~~~dnaK~~~Chaperone protein DnaK~~~COG0443
MSKVIGIDLGTTNSCVAVLEGGEPKVIANAEGNRTTPSVVAFKNGERQVGEVAKRQSITNPNTIMSIKRHMGTDYKVEIE
GKDYTPQEVSAIILQHLKSYAESYLGETVSKAVITVPAYFNDAERQATKDAGKIAGLEVERIINEPTAAALAYGLDKTDE
DQTILVYDLGGGTFDVSILELGDGVFEVRSTAGDNRLGGDDFDQVIIDHLVSEFKKENGIDLSKDKMALQRLKDAAEKAK
KDLSGVSSTQISLPFITAGEAGPLHLELTLTRAKFEELSSHLVERTMGPVRQALQDAGLSASEIDKVILVGGSTRIPAVQ
EAIKKETGKEAHKGVNPDEVVALGAAIQGGVITGDVKDVVLLDVTPLSLGIETMGGVFTKLIDRNTTIPTSKSQVFSTAA
DNQTAVDIHVLQGERPMSADNKTLGRFQLTDIPPAPRGVPQIEVSFDIDKNGIVNVRAKDLGTGKEQNITIKSSSGLSDE
EIERMVKEAEENADADAKKKEEIEVRNEADQLVFQTEKTLKDLEGKVDEEQVKKANDAKDALKAAIEKNEFEEIKAKKDE
LQTIVQELSMKLYEEAAKAQQAQGGANAEGKADDNVVDAEYEEVNDDQNKK
>P0C922 ~~~dnaK~~~Chaperone protein DnaK~~~
MGKIIGIDLGTTNSCVAIMEHGKPVVIQNSEGGRTTPSIVAYTNKGERLVGQVAKNQMVTNPENTIYSIKRFMGRRFEEV
ASEIKMVPYKIEKGLNGDARVNISNIKKQMSPPEISAATLTKMKETAEAYLGEKVTEAVITVPAYFNDAQRQATKDAGKI
AGLEVKRIVNEPTAAALAYGIEKKHEEIVAVYDLGGGTFDISILELGDGVFEVKSTNGDTHLGGDNFDDEIIKHLISEFK
KESAIDLSNDKMALQRLKEAAEKAKIELSGAQEASINLPFITADANGPKHLQYTLTRAKFEQMVDHLVQKTKEPCLKAIK
DAGLKASDINEVILVGGSTRIPAIQKIVKDIFGQDPNKGVNPDEAVAIGAAIQGGILTGETKDMVLLDVTPLSLGIETLG
GVMTKLIERNTTIPTKKSQVFSTAADNQTSVDIKVLQGEREMAAQNRILGNFILDGIPAAPRGVPQIEVSFDIDANGIVH
VSAKDMGTGKEQKIRIESSSGLSESEIDRMVKDAEAHAEEDKKLKENIEAKNTANSLIYQTEKSLKEYSEKISSEDKEAI
ESKIKELKESLEKEDISLIKSRTEELQKASYKIAEMMYKDSSQQNANSQQENGPQSNTSEEGKEADYEVVDEDKK
>B7J282 ~~~dnaK~~~Chaperone protein DnaK~~~
MGKIIGIDLGTTNSCVAIMEHGKPVVIQNSEGGRTTPSIVAYTNKGERLVGQVAKNQMVTNPENTIYSIKRFMGRRFEEV
ASEIKMVPYKIEKGLNGDARVNISNIKKQMSPPEISAATLTKMKETAEAYLGEKVTEAVITVPAYFNDAQRQATKDAGKI
AGLEVKRIVNEPTAAALAYGIEKKHEEIVAVYDLGGGTFDISILELGDGVFEVKSTNGDTHLGGDNFDDEIIKHLISEFK
KESAIDLSNDKMALQRLKEAAEKAKIELSGAQEASINLPFITADANGPKHLQYTLTRAKFEQMVDHLVQKTKEPCLKAIK
DAGLKASDINEVILVGGSTRIPAIQKIVKDIFGQDPNKGVNPDEAVAIGAAIQGGILTGETKDMVLLDVTPLSLGIETLG
GVMTKLIERNTTIPTKKSQVFSTAADNQTSVDIKVLQGEREMAAQNRILGNFILDGIPAAPRGVPQIEVSFDIDANGIVH
VSAKDMGTGKEQKIRIESSSGLSESEIDRMVKDAEAHAEEDKKLKENIEAKNTANSLIYQTEKSLKEYSEKISSEDKEAI
ESKIKELKESLEKEDISLIKSRTEELQKASYKIAEMMYKDSSQQNANSQQENGPQSNTSEEGKEADYEVVDEDKK
>Q05981 ~~~dnaK~~~Chaperone protein DnaK~~~
MAKVIGIDLGTTNSCVAVMDGKNAKVIENAEGARTTPSIIAFTDGDERLAGQPAKRQAVTNPEGTLFAVKRLIGRRYDDP
MVTKDKDLVPYKIVKGDNGDAWVEVHGKKYSPSQISAMILQKMKETAESYLGETVTQAVITVPAYFNDAQRQATKDAGKI
AGLEVLRIINEPTAAALAYGLDKSEGKTIAVYDLGGGTFDVSVLEIGDGVFEVKSTNGDTFLGGEDFDIRLVEYLVAEFK
KESGIDLKNDKLALQRLKEAAEKAKIELSSSQQTEINLPFITADQTGPKHLAIKLSRAKFESLVDDLVQRTVEPCKAALK
DAGLKAGEIDEVVLVGGMTRMPKIQEVVKAFFGKEPHKGVNPDEVVAMGAAIQGGVLQGDVKDVLLLDVTPLSLGIETLG
GVFTRLIERNTTIPTKKSQTFSTAEDNQSAVTIRVFQGEREMAADNKLLGQFDLVGIPPAPRGVPQIEVTFDIDANGIVN
VSAKDKGTGKEHQIRIQASGGLSDADIEKMVKDAEANAEADKKRRESVEAKNQAESLVHSTEKSLAEYGDKVSADDKKAI
EDAIAALKTSLEGEDAEDIKAKTQALAEVSMKLGQAMYEAAQAAEGAGAEGGEQASSSKDDVVDADYEEIDDNKKSS
>P20442 ~~~dnaK~~~Chaperone protein DnaK~~~COG0443
MSKIIGIDLGTTNSCVAIMDGKTPKVIENAEGARTTPSVVAFLEDGERLIGQPAKRQAVTNPTNTLFAIKRLIGRTASDP
VVEKDKGMVPYEIVKGPTGDAWVKAHGKDYSPQEVSAFILQKMKEAAEAHLGEPVTKAVITVPAYFNDAQRQATKDAGKI
AGLEVLRIINEPTAAALAYGLDKNDGKKIAVYDLGGGTFDVSILEIGDGVFEVKSTNGDTFLGGEDFDLRIVDYLADEFK
KEQGVDLRKDKLALQRLREEAEKAKKELSSTAQYEVNLPFISMNASGPLHLNIKLSRAKLEALVDDLIARTIGPCEQALK
DAGLKKSDIDEVILVGGMSRMPKVQQAVQDFFGREPHKGVNPDEVVALGAAVQAGVLQGDVKDVLLLDVTPLTLGIETLG
GVFTPLIERNTTIPTKRSQTFSTADDNQSAVTIRVFQGERPMAQDNKMLGQFDLVGIPPAPRGVPQIEVTFDIDANGIVQ
VHAKDKATNKEQSIRIQANGGLSDSDIERMVKEAEANKAADEKKKQLVEAKNQGEAILHSTEKALAEHGDKVGAAEKTAI
ETGITELKTALEGEDVEAIQAKTQALIQASMKLGEAMYAAQQGSAEGGDAKADDGVVDAEFEEVDDNKPAA
>P30721 ~~~dnaK~~~Chaperone protein DnaK~~~COG0443
MSKVIGIDLGTTNSCVAVMEGGDPAVIANSEGARTTPSVVSFQKNGERLVGQVAKRQSITNPDKTIISIKRKMGTAEKVA
IDDKNYTPQEISAMILQKLKADAEAYLGETVTQAVITVPAYFNDSQRQATKDAGKIAGLEVLRIINEPTAASLAYGLDKM
DTNQKILVYDLGGGTFDVSVLELGDGVFEVKSTNGNTHLGGDDFDEKIMDYIAEEFKKDNGIDLRNDKMALQRLKEAAEK
AKIELSSSTQTNINLPFITADATGPKHIDMNLTRAKFNELTEGLVQDTIEPMKKALSDAGLSINDIDKIVLVGGSTRIPA
VQEAVKNYTGKDPSKGVNPDECVAIGAAIQAGVLTGDVKDVLLLDVSPLTLGIETLGGVATPLIERNTTIPTRKSQVFST
AADNQPSVEINIVQGERKMAADNKSLGRFTLDGIAPAPRGVPQIEVTFDIDANGIVNVSAKDKGTGKESHITITASTNLS
DEEIDKAVKDAEKFAEEDKKKKENIEVKNNADQIVFQTDKALKDLGDKVSAEDKSNIEAKKEALSKVKDGDDIEAIKKAT
EDLTQALYAITTKMYEQSGAQGAPGADPNAGASQKTNGGADDNVVDADFKVDNDK
>O87712 ~~~dnaK~~~Chaperone protein DnaK~~~COG0443
MAEIIGIDLGTTNSCVAVMEGGKVRVIENAEGSRTTPSIVAYTKDGEVLVGASAKRQAVTNADRTLYAIKRLIGRRFDDN
VVQKDIKMVPYKIIKADNGDAWVEVKDKEGKSQKLAPPQISAQVLIKMKKTAEDYLGHEVKDAVITVPAYFNDSQRQATK
DAGKIAGLNVKRIINEPTAAALAYGMDKKKGDRKIAVYDLGGGTFDISIIEIAEVDGEHQFEVLATNGDTFLGGEDFDLR
LIDYLAGEFKKDEGVDLHNDPLALQRLKEAAEKAKIELSSSQQTDVNLPYITADASGPKHLNIRLTRAKLESLVEDLVER
TIEPCKVAIKDAGLKVSEIDDVILVGGQTRMPKVQEAVKNFFGKEARKDVNPDEAVAIGAAIQGAVLSGEVKDVLLLDVT
PLSLGIETLGGVMTKLIEKNTTIPTKANQVFSTADDNQTAVTVHVLQGEREMASANKSLGRFDLSDIPPAPRGVPQIEVT
FDIDANGILHVSAKDKATGKEQSIVIKASSGLSDEEVEKMVKDAEAHRDSDRKFHELVDARNQADAMIHAAEKSVKDLGS
EVSADEKSAIEKAVNELKEAMKGNDKDAIEAKTKALTEHSSKLAERVYAKKGGAAGAPPGGEAEGEPQAQAGGKKEDVVD
AEFEEVKDEKKKDEDK
>Q9RY23 ~~~dnaK~~~Chaperone protein DnaK~~~COG0443
MAKAVGIDLGTTNSVIAVMEGGRPEVIVNAEGGRTTPSVVAYKGDEILVGQIARRQAALNPAATLFEVKRFIGRRWDEVK
EEAARSPFTVKEGPSGSVRIEVNGKDLAPEQVSAEVLRKLVSDASAKLGNKITDAVITVPAYFDNSQREATRQAGEIAGL
NVLRVINEPTAAALAYGLERKGNETVLVFDLGGGTFDVTILELGDGVFEVKSTAGDTHLGGADFDYRIVDWLAGEFQKEH
NFDLRKDKQALQRLIEAAEKAKIDLSNASETSISLPFITFDPETRTPMHLERSLSRAKFEELTADLLRRVRQPVEQALSD
AKLSAGDINEVILVGGSTRIPAVKRIVQDLVGKTPNESVNPDEAVALGAAVQAGIIQGDSSLGDIVLVDVTPLTLGVEVK
GGMIAPMITRNTTVPAKKTEIYTTAENNQPGVEINVLQGERPMAADNKSLGRFKLEGIPPMPAGRAQIEVTFDIDANGIL
HVTAKEKTSGKESSITIENTTTLDKTDVERMVQEAEQNAAADKQRKEKVEKRNNLDSLRVQAVQQLEEQEGAAQDAKDRL
KAAADEAEEAVRSEDDSKIADAQKKLEEELRSFMTANQASTQGQPEGTQAQANKADDDVIDADFKPAE
>P0A6Y8 ~~~dnaK~~~Chaperone protein DnaK~~~COG0443
MGKIIGIDLGTTNSCVAIMDGTTPRVLENAEGDRTTPSIIAYTQDGETLVGQPAKRQAVTNPQNTLFAIKRLIGRRFQDE
EVQRDVSIMPFKIIAADNGDAWVEVKGQKMAPPQISAEVLKKMKKTAEDYLGEPVTEAVITVPAYFNDAQRQATKDAGRI
AGLEVKRIINEPTAAALAYGLDKGTGNRTIAVYDLGGGTFDISIIEIDEVDGEKTFEVLATNGDTHLGGEDFDSRLINYL
VEEFKKDQGIDLRNDPLAMQRLKEAAEKAKIELSSAQQTDVNLPYITADATGPKHMNIKVTRAKLESLVEDLVNRSIEPL
KVALQDAGLSVSDIDDVILVGGQTRMPMVQKKVAEFFGKEPRKDVNPDEAVAIGAAVQGGVLTGDVKDVLLLDVTPLSLG
IETMGGVMTTLIAKNTTIPTKHSQVFSTAEDNQSAVTIHVLQGERKRAADNKSLGQFNLDGINPAPRGMPQIEVTFDIDA
DGILHVSAKDKNSGKEQKITIKASSGLNEDEIQKMVRDAEANAEADRKFEELVQTRNQGDHLLHSTRKQVEEAGDKLPAD
DKTAIESALTALETALKGEDKAAIEAKMQELAQVSQKLMEIAQQQHAQQQTAGADASANNAKDDDVVDAEFEEVKDKK
>Q5KWZ7 ~~~dnaK~~~Chaperone protein DnaK~~~COG0443
MSKIIGIDLGTTNSCVAVLEGGEVKVIPNPEGNRTTPSVVAFKNGERLVGEVAKRQAITNPNTIISIKRHMGTDYKVEIE
GKQYTPQEISAIILQYLKSYAEDYLGEPVTRAVITVPAYFNDAQRQATKDAGRIAGLEVERIINEPTAAALAYGLDKEED
QTILVYDLGGGTFDVSILELGDGVFEVKATAGDNHLGGDDFDQVIIDYLVNQFKQEHGIDLSKDKMALQRLKDAAEKAKK
ELSGVTQTQISLPFISANENGPLHLEMTLTRAKFEELSAHLVERTMGPVRQALQDAGLTPADIDKVILVGGSTRIPAVQE
AIKRELGKEPHKGVNPDEVVAIGAAIQGGVIAGEVKDVVLLDVTPLSLGIETMGGVFTKLIERNTTIPTSKSQVFTTAAD
NQTTVDIHVLQGERPMAADNKSLGRFQLTGIPPAPRGVPQIEVTFDIDANGIVHVRAKDLGTNKEQSITIKSSSGLSEEE
IQRMIKEAEENAEADRKRKEAAELRNEADQLIFMTDKTLKEVEGKVSADEIKKAQDAKEALKAALEKNDIDDIRKKKDAL
QEAVQQLSIKLYEQAAKQAQSAGSQGGAANHKDNVVDAEFEEVNDDK
>O87777 ~~~dnaK~~~Chaperone protein DnaK~~~
MSKVIGIDLGTTNSAVAVLEGGQPKIITNPEGARTTPSVVSFKNGEIQVGEVAKRQAITNPDTIASIKRHIGEAGYKVTV
GDKSYTPQEVSAMILQYIKKFAEDYLGEEVTEAVITVPAYFNDSQRQATKDAGKIAGLDVKRIINEPTASALAYGLDKTE
TDEKVLVYDLGGGTFDVSVLELGDGVFQVLSTNGDTRLGGDDFDEAIMNWLVENFKSDNGIDLSKDKMAMQRLKDAAEKA
KKDLSGVSSTQISLPFISAGENGPLHLEMTLSRTEFDRLTSDLVDRTKAPVMNALKDAGLDANEIDKVILNGGSTRIPAV
QEAVKNWTGKEPDHSINPDEAVALGAAVQGGVISGDVKDVVLLDVTPLSLGIETMGGVFTKLIDRNTTIPTSKAQTFSTA
ADNQPAVDIHVLQGERPMAADNKTLGRFQLTDIPAAPRGVPQIEVKFDIDKNGIVNVSAKDLGTNKEQKITIKSNSGLSD
EEIDRMMKEAQENEEADTKRKEEVDLKNDVDQLIFQTDKTLKELEGKVSDEELQKAKDAKEELVKAQQENNLEDMKTKRD
ALSEIVQELTVKLYQQAQEAQQAAGGAEGNATDAKTDDGTVDGDFEEVKDDKE
>A1KFH2 ~~~dnaK~~~Chaperone protein DnaK~~~
MARAVGIDLGTTNSVVSVLEGGDPVVVANSEGSRTTPSIVAFARNGEVLVGQPAKNQAVTNVDRTVRSVKRHMGSDWSIE
IDGKKYTAPEISARILMKLKRDAEAYLGEDITDAVITTPAYFNDAQRQATKDAGQIAGLNVLRIVNEPTAAALAYGLDKG
EKEQRILVFDLGGGTFDVSLLEIGEGVVEVRATSGDNHLGGDDWDQRVVDWLVDKFKGTSGIDLTKDKMAMQRLREAAEK
AKIELSSSQSTSINLPYITVDADKNPLFLDEQLTRAEFQRITQDLLDRTRKPFQSVIADTGISVSEIDHVVLVGGSTRMP
AVTDLVKELTGGKEPNKGVNPDEVVAVGAALQAGVLKGEVKDVLLLDVTPLSLGIETKGGVMTRLIERNTTIPTKRSETF
TTADDNQPSVQIQVYQGEREIAAHNKLLGSFELTGIPPAPRGIPQIEVTFDIDANGIVHVTAKDKGTGKENTIRIQEGSG
LSKEDIDRMIKDAEAHAEEDRKRREEADVRNQAETLVYQTEKFVKEQREAEGGSKVPEDTLNKVDAAVAEAKAALGGSDI
SAIKSAMEKLGQESQALGQAIYEAAQAASQATGAAHPGGEPGGAHPGSADDVVDAEVVDDGREAK
>P47547 ~~~dnaK~~~Chaperone protein DnaK~~~COG0443
MSADNGLIIGIDLGTTNSCVSVMEGGRPVVLENPEGKRTTPSIVSYKNNEIIVGDAAKRQMVTNPNTIVSIKRLMGTSNK
VKVQNADGTTKELSPEQVSAQILSYLKDFAEKKIGKKISRAVITVPAYFNDAERNATKTAGKIAGLNVERIINEPTAAAL
AYGIDKASREMKVLVYDLGGGTFDVSLLDIAEGTFEVLATAGDNRLGGDDWDNKIIEYISAYIAKEHQGLNLSKDKMAMQ
RLKEAAERAKIELSAQLETIISLPFLTVTQKGPVNVELKLTRAKFEELTKPLLERTRNPISDVIKEAKIKPEEINEILLV
GGSTRMPAVQKLVESMVPGKKPNRSINPDEVVAIGAAIQGGVLRGDVKDVLLLDVTPLTLSIETLGGVATPLIKRNTTIP
VSKSQIFSTAQDNQESVDVVVCQGERPMSRDNKSLGRFNLGGIQPAPKGKPQIEITFSLDANGILNVKAKDLTTQKENSI
TISDNGNLSEEEIQKMIRDAEANKERDNIIRERIELRNEGEGIVNTIKEILASPDAKNFPKEEKEKLEKLTGNIDAAIKA
NDYAKLKVEIENFKKWREEMAKKYNPTGEQGPQAK
>P19993 ~~~dnaK~~~Chaperone protein DnaK~~~COG0443
MARAVGIDLGTTNSVVSVLEGGDPVVVANSEGSRTTPSTVAFARNGEVLVGQPAKNQAVTNVDRTIRSVKRHMGSDWSIE
IDGKKYTAQEISARVLMKLKRDAEAYLGEDITDAVITTPAYFNDAQRQATKEAGQIAGLNVLRIVNEPTAAALAYGLDKG
EREQTILVFDLGGGTFDVSLLEIGEGVVEVRATSGDNHLGGDDWDDRIVNWLVDKFKGTSGIDLTKDKMAMQRLREAAEK
AKIELSSSQSTSVNLPYITVDSDKNPLFLDEQLIRAEFQRITQDLLDRTRQPFQSVVKDAGISVSEIDHVVLVGGSTRMP
AVTDLVKELTGGKEPNKGVNPDEVVAVGAALQAGVLKGEVKDVLLLDVTPLSLGIETKGGVMTKLIERNTTIPTKRSETF
TTADDNQPSVQIQVYQGEREIASHNKLLGSFELTGIPPAPRGVPQIEVTFDIDANGIVHVTAKDKGTGKENTIKIQEGSG
LSKEEIDRMVKDAEAHAEEDRKRREEADVRNQAETLVYQTEKFVKEQRETENGSRVPEDTLNKVEAAVAEAKTALGGTDI
SAIKSAMEKLGQDSQALGQAIYEATQAASKVGGEASAPGGSNSTDDVVDAEVVDDERESK
>A0QQC8 ~~~dnaK~~~Chaperone protein DnaK~~~COG0443
MARAVGIDLGTTNSVVAVLEGGDPVVVANSEGSRTTPSVVAFARNGEVLVGQPAKNQAVTNVDRTIRSVKRHVGTDWNIE
IDDKKYTPQEISARVLMKLKRDAESYLGEDITDAVITVPAYFNDAQRQATKEAGQIAGLNVLRIVNEPTAAALAYGLDKG
EKEQTILVFDLGGGTFDVSLLEIGDGVVEVRATSGDNHLGGDDWDDRIVTWLVDKFKGSSGIDLTKDKMAMQRLREAAEK
AKIELSSSQSTSINLPYITVDADKNPLFLDEQLTRAEFQRITQDLLDRTRQPFQQVIKDAGISVSDIDHVVLVGGSTRMP
AVTDLVKELTGGKEPNKGVNPDEVVAVGAALQAGVLKGEVKDVLLLDVTPLSLGIETKGGVMTKLIERNTTIPTKRSETF
TTADDNQPSVQIQVYQGEREIASHNKLLGSFELTGIPPAPRGVPQIEVTFDIDANGIVHVTAKDKGTGKENTIKIQEGSG
LSKEEIDRMIKDAEAHAEEDRKRREEADVRNQAESLVYQTEKFVAEQRGAASDGGGSKVPEETLAKVDSAIADAKKALEG
TDISAIKSAMEKLGVESQALGQAIYEATQAEQPAGGSDNGAPGDDNVVDAEVVDDDAGKENK
>P9WMJ9 ~~~dnaK~~~Chaperone protein DnaK~~~COG0443
MARAVGIDLGTTNSVVSVLEGGDPVVVANSEGSRTTPSIVAFARNGEVLVGQPAKNQAVTNVDRTVRSVKRHMGSDWSIE
IDGKKYTAPEISARILMKLKRDAEAYLGEDITDAVITTPAYFNDAQRQATKDAGQIAGLNVLRIVNEPTAAALAYGLDKG
EKEQRILVFDLGGGTFDVSLLEIGEGVVEVRATSGDNHLGGDDWDQRVVDWLVDKFKGTSGIDLTKDKMAMQRLREAAEK
AKIELSSSQSTSINLPYITVDADKNPLFLDEQLTRAEFQRITQDLLDRTRKPFQSVIADTGISVSEIDHVVLVGGSTRMP
AVTDLVKELTGGKEPNKGVNPDEVVAVGAALQAGVLKGEVKDVLLLDVTPLSLGIETKGGVMTRLIERNTTIPTKRSETF
TTADDNQPSVQIQVYQGEREIAAHNKLLGSFELTGIPPAPRGIPQIEVTFDIDANGIVHVTAKDKGTGKENTIRIQEGSG
LSKEDIDRMIKDAEAHAEEDRKRREEADVRNQAETLVYQTEKFVKEQREAEGGSKVPEDTLNKVDAAVAEAKAALGGSDI
SAIKSAMEKLGQESQALGQAIYEAAQAASQATGAAHPGGEPGGAHPGSADDVVDAEVVDDGREAK
>Q9K0N4 ~~~dnaK~~~Chaperone protein DnaK~~~
MAKVIGIDLGTTNSCLAISENGQTKVIENAEGARTTPSVIAYLDGGEILVGAPAKRQAVTNAKNTIYAAKRLIGHKFEDK
EVQRDIESMPFEIIKANNGDAWVKAQGKELSPPQISAEVLRKMKEAAEAYLGEKVTEAVITVPAYFNDSQRQATKDAGRI
AGLDVKRIINEPTAAALAFGMDKGDNKDRKVAVYDLGGGTFDISIIEIANLDGDKQFEVLATNGDTFLGGEDFDQRLIDH
IIAEFKKEQGIDLKQDVMALQRLKEAAEKAKIELSSGQQTEINLPYITMDATGPKHLAMKITRAKFESLVEDLITRSIEP
CKIALKDAGLSTGDIDDVILVGGQSRMPKVQEAVKAFFGKEPRKDVNPDEAVAVGAAIQGEVLSGGRSDVLLLDVTPLSL
GIETMGGVMTKLIQKNTTIPTKASQVFSTAEDNQSAVTIHVLQGERERASANKSLGQFNLGDIAPAPRGMPQIEVTFDID
ANGILHVSAKDKGTGKAANITIQGSSGLSEEEIERMVKDAEANAEEDKKLTELVASRNQAEALIHSVKKSLADYGDKLDA
AEKEKIEAALKEAEEAVKGDDKAAIDAKTEALGAASQKLGEMVYAQAQAEAQAGESEQANASAKKDDDVVDADFEEVKDD
KK
>P99110 ~~~dnaK~~~Chaperone protein DnaK~~~
MSKIIGIDLGTTNSCVTVLEGDEPKVIQNPEGSRTTPSVVAFKNGETQVGEVAKRQAITNPNTVQSIKRHMGTDYKVDIE
GKSYTPQEISAMILQNLKNTAESYLGEKVDKAVITVPAYFNDAERQATKDAGKIAGLEVERIINEPTAAALAYGLDKTDK
DEKVLVFDLGGGTFDVSILELGDGVFEVLSTAGDNKLGGDDFDQVIIDYLVAEFKKENGVDLSQDKMALQRLKDAAEKAK
KDLSGVSQTQISLPFISAGENGPLHLEVNLTRSKFEELSDSLIRRTMEPTRQAMKDAGLTNSDIDEVILVGGSTRIPAVQ
EAVKKEIGKEPNKGVNPDEVVAMGAAIQGGVITGDVKDVVLLDVTPLSLGIEILGGRMNTLIERNTTIPTSKSQIYSTAV
DNQPSVDVHVLQGERPMAADNKTLGRFQLTDIPPAERGKPQIEVTFDIDKNGIVNVTAKDLGTNKEQRITIQSSSSLSDE
EIDRMVKDAEVNAEADKKRREEVDLRNEADSLVFQVEKTLTDLGENIGEEDKKSAEEKKDALKTALEGQDIEDIKSKKEE
LEKVIQELSAKVYEQAAQQQQQAQGANAGQNNDSTVEDAEFKEVKDDDKK
>O06942 ~~~dnaK~~~Chaperone protein DnaK~~~COG0443
MSKIIGIDLGTTNSAVAVLEGTESKIIANPEGNRTTPSVVSFKNGEIIVGDAAKRQAVTNPETILSIKSKMGTSEKVSAN
GKEYTPQEISAMILQYLKGYAEDYLGEKVEKAVITVPAYFNDAQRQATKDAGKIAGLEVERIVNEPTAAALAYGLDKTDK
DEKILVFDLGGGTFDVSILELGDGVFDVLATAGDNKLGGDNFDQKVIDWLVEEFKKENGIDLSTDKMALQRLKDAAEKAK
KDLSGVTSTQISLPFITAGEAGPLHLETSLSRAKFDDLTRDLVERTKTPVRQALSDAGLSLSEIDEVILVGGSTRIPAVV
DAVKAETGKEPNKSVNPDEVVAMGAAIQGGVITGDVKDVVLLDVTPLSLGIETMGGVFTKLIDRNTTIPTSKSQVFSTAA
DNQPAVDIHVLQGERPMAADNKTLGRFQLTDIPAAPRGVPQIEVTFDIDKNGIVSVKAKDLGTQKEQTIVIQSNSGLTDE
EIDKMMKDAEANAEADAKRKEEVDLKNEVDQAIFTTEKTIKETEGKGFDTERDAAQAALDDLKKAQESGNLDDMKAKLEA
LNEKAQALAMKLYEQAAAAQQAQAGQEGAQSSDSDSSDKGGDDVVDGEFTEK
>Q5XAD6 ~~~dnaK~~~Chaperone protein DnaK~~~
MSKIIGIDLGTTNSAVAVLEGTESKIIANPEGNRTTPSVVSFKNGEIIVGDAAKRQAVTNPETVISIKSKMGTSEKVSAN
GKEYTPQEISAMILQYLKGYAEDYLGEKVEKAVITVPAYFNDAQRQATKDAGKIAGLEVERIVNEPTAAALAYGMDKTDK
DEKILVFDLGGGTFDVSILELGDGVFDVLATAGDNKLGGDDFDQKIIDFLVAEFKKENGIDLSQDKMALQRLKDAAEKAK
KDLSGVTQTQISLPFITAGSAGPLHLEMSLSRAKFDDLTRDLVERTKTPVRQALSDAGLSLSEIDEVILVGGSTRIPAVV
EAVKAETGKEPNKSVNPDEVVAMGAAIQGGVITGDVKDVVLLDVTPLSLGIETMGGVFTKLIDRNTTIPTSKSQVFSTAA
DNQPAVDIHVLQGERPMAADNKTLGRFQLTDIPAAPRGIPQIEVTFDIDKNGIVSVKAKDLGTQKEQHIVIKSNDGLSEE
EIDRMMKDAEANAEADAKRKEEVDLKNEVDQAIFATEKTIKETEGKGFDTERDAAQSALDELKAAQESGNLDDMKAKLEA
LNEKAQALAVKMYEQAAAAQQAAQGAEGAQANDSANNDDVVDGEFTEK
>Q56235 ~~~dnaK~~~Chaperone protein DnaK~~~COG0443
MAKAVGIDLGTTNSVIAVLEGGKPVVLENAEGERVTPSVVAFRDGETLVGRMAKRQAVLNPEGTIFEIKRFIGRRFEEVQ
EEAKRVPYKVVPGPDGGVRVEVKGKLYTPEEISAMILRKLVEDASKKLGEKITKAVITVPAYFNNAQREATANAGRIAGL
EVLRIINEPTAAALAYGLDKKGNETVLVFDLGGGTFDVTILEIGEGVFEVKATSGDTHLGGSDMDHAIVNWLAEEFKKEH
GVDLKADRQALQRLIEAAEKAKIELSSTLETTISLPFIALDPASKTPLHLEKKLTRAKFEELIQPLLKRLRGPVEQALKD
AGLTPAQIDEVILVGGATRVPAVQQVVRELLGKEPNRSVNPDEVVAMGAAIQAGVLMGEVRDVVLLDVTPLSLGVETKGG
VMTVLIPRNTTIPTRKCEIFTTAEHNQTAVEIHVLQGERPMAQDNKSLGRFRLEGIPPMPAGVPQIEVCFDIDANGILHV
TAKERSTGREASITIQNTTTLSEEEIQRIIEEAKRHAEEDRRRREHAELKNALDSARVQAERVLQERQGAPEARARLEAA
IGKAKELVERDAPDPELKAATEELLKAVEEYEKGAQAASGKGPDDVIDADYKPAD
>D3SGB1 3.1.-.-~~~~~~Deoxyribonuclease~~~COG1555
MMHLLRRGAFAILLIVLLPSAALADLRLASWNIQHLGWNVGKDYPAVARIAAQFDFLAIQEVMNAEGIYRLRDTLEDATG
AEWSVLYSDALGRNTYREKYAFLWREAAVEYVGGALTYIDEADRFAREPFSAVFRSRGTDQHFLAATVHITYGDRVADRV
EEIEALRRYWDWLADVMPEYAGERILFGDFNLPPHHDGWASMRAVAEPLVTEGATTLSTHDRRYANLYDNLWVPKDHTLP
LGDAGILPFPVVLSEVTGVYWDHEKARDRVSDHAPVYVLFEGNTLHDAVVAEIADQEAGCIDLNRASVSELTALPHIGEA
RAEAIKDGRPWNAVRDLKEIRGIGAGRLEEIKARGEACIEP
>P0A8J2 ~~~dnaT~~~Primosomal protein 1~~~
MSSRVLTPDVVGIDALVHDHQTVLAKAEGGVVAVFANNAPAFYAVTPARLAELLALEEKLARPGSDVALDDQLYQEPQAA
PVAVPMGKFAMYPDWQPDADFIRLAALWGVALREPVTTEELASFIAYWQAEGKVFHHVQWQQKLARSLQIGRASNGGLPK
RDVNTVSEPDSQIPPGFRG
>P67524 ~~~dnaT~~~Primosomal protein 1~~~
MSSRILTSDVIGIDVLLHDHHAVLAKSTGGAVAVFANNAPAFYAVTPARMAELLALEEKLSRPGSDVALDAQFYEEPEAA
PVAIPCGKFAMYPAWQPDADFQRQAALWGVALREPVTAEELAAFIAYWQAEGKVFHHIQWQQKLARSVQISRSSNGGMPQ
RDINSVSEPDNHIPPGFRG
>Q7V4A7 3.1.2.28~~~~~~1,4-dihydroxy-2-naphthoyl-CoA hydrolase~~~COG0824
MNPENWLLLRRVVRFGDTDAAGVMHFHQLFRWCHESWEESLESYGLNPADIFPGSRKSEVTPEVALPIIHCQADFRRPIH
TGDALAMELRPERLNPNSFQVHFEFRCEEQIAAHALIRHLAINAQTRHRCALPEGIDRWLEASGVGKIGSI
>Q55777 3.1.2.28~~~~~~1,4-dihydroxy-2-naphthoyl-CoA hydrolase~~~COG0824
MGTFTYERQVYLADTDGAGVVYFNQFLQMCHEAYESWLSSEHLSLQNIISVGDFALPLVHASIDFFAPAHCGDRLLVNLT
ITQASAHRFCCDYEISQAESAQLLARAQTHHVCIALPERKKAPLPQPWQTAICDLDHP
>Q6XGD8 2.7.7.-~~~dncV~~~Cyclic GMP-AMP synthase~~~
MPWDFNNYYSHNMDGLISKLKLSKTESDKLKALRQIVRERTRDVFQEARQVAIDVRRQALTLESVRLKLEKTNVRYLSPE
ERADLARLIFEMEDEARDDFIKFQPRFWTQGSFQYDTLNRPFHPGQEMDIDDGTYMPMTVFESEPSIGHTLLLLLVDTSL
KSLEAENDGWVFEEKNTCGRIKIYREKTHIDVPMYAIPKEQFQKKQTAADSAHLIKSDSVFESFALNRGGREAYAVESDK
VNLALREGVRRWSVSDPKIVEDWFNESCKRIGGHLRSVCRFMKAWRDAQWEVGGPSSISLMTAVVNILDRESHNGSDLTG
TMKLIARLLPEEFNRGVESPDDTDEKPLFPAESNHNVHHRAIVETMEGLYGILLAAEQSESREEALRKINEAFGKRVTNA
LLITSSAAAPAFLNAPSKEPSSKPINKTMVSG
>P0DTF0 2.7.7.-~~~dncV~~~Cyclic GMP-AMP synthase~~~
MHWDLNNYYSNNMDGLISKLKLSKTESTKLKELRQIVRERTRDVFKEARAVAADVKKHTLTLEGVRLKLGQTNVRYLSTA
DQAEVARLIFEMDDDARNDFINLQPRFWTQGSFQYDTLNKPFQPGQEMDIDDGTYMPMTVFESEPRIGHTLLLLLVDTSL
KSLEAENDGWRFEEKNTCGRIKIPHEKTHIDVPMYAIPKNQFQTKQTAADSAHILKSESIFESVALNRDSREAYLVESDK
VNLALREGAKRWSISDPKIVEDWFNDSCKRIGGHVRSICRFMKAWRDAQWDVGGPSSISLMTAVVNILNREEHNDSDLAG
TMKLVAKLLPDEFNRGLESPDDTDTKLLFPAEWDQNVHQKTIVETMKTLYEILVDAENANTREDALHKMNEAFGKRVTNA
QLITSIAAAPAFHVSPSREPEPRKINKTMVSG
>Q9KVG7 2.7.7.-~~~dncV~~~Cyclic GMP-AMP synthase~~~
MRMTWNFHQYYTNRNDGLMGKLVLTDEEKNNLKALRKIIRLRTRDVFEEAKGIAKAVKKSALTFEIIQEKVSTTQIKHLS
DSEQREVAKLIYEMDDDARDEFLGLTPRFWTQGSFQYDTLNRPFQPGQEMDIDDGTYMPMPIFESEPKIGHSLLILLVDA
SLKSLVAENHGWKFEAKQTCGRIKIEAEKTHIDVPMYAIPKDEFQKKQIALEANRSFVKGAIFESYVADSITDDSETYEL
DSENVNLALREGDRKWINSDPKIVEDWFNDSCIRIGKHLRKVCRFMKAWRDAQWDVGGPSSISLMAATVNILDSVAHDAS
DLGETMKIIAKHLPSEFARGVESPDSTDEKPLFPPSYKHGPREMDIMSKLERLPEILSSAESADSKSEALKKINMAFGNR
VTNSELIVLAKALPAFAQEPSSASKPEKISSTMVSG
>A0A096ZEC9 3.3.2.14~~~dnhA~~~2,4-dinitroanisole O-demethylase subunit alpha~~~
MSVTSQTSSSGSAAVSDCHRGIIDISGPVPGYEWEPSMTTEPVRGRVWTITDGVFRTLAIEGDTGVIAVDTFWSPGSARQ
YRRALQSHFPRKPVHTIIYTHDHLDHTGFGADFAPDADQILAHELTAEVIARRSSDGQLPATRTWSGERLEVSIDGAEFE
LIYPGPTHGTGNTALYFPNERFLYMADTVFTGPTYNIVPDFLWTSWIPNTRRLLGLDWDLYVPGHFWRLSRREFEADFEL
WDATAACALDALRAGVDIDNFADVKKFTYERMDEPFGSRTFRFDEFAAINVLTHMVHYQTGGWGLRDYEPYSNEPFKTTL
PQRLGSPL
>A0A096ZED0 3.3.2.14~~~dnhB~~~2,4-dinitroanisole O-demethylase subunit beta~~~
MTGRQRTTVVAPDRPVQDATISQLTTRVWTVAIDGYRTIVVEGETGIVAINSFGTPSAQTKYRELITQTFGDKPVVAVVA
SIDHLDHTGRLGPFANGAEVIGHELGQAIAFGRGLPEQKLADTVVTGPVTEIERAGVRLVLRYPAPTVGTGNLAVDLPDD
DVVFMVGLQSGARYGIFPDFHFKHFLRATSEIAALGRRYFVPGRSEVMDAGQVRQALEYVNDFQNACQRCLAGGEVPHWL
LEPTTAYLHDELSSKWSHLEGYDPVAVGLGGLRVVCHYYMGGWWLDDTDHHELLYDHLTVRTYREYRERLATAGTGRA
>P44121 6.5.1.1~~~ligA~~~DNA ligase~~~COG1793
MKFYRTLLLFFASSFAFANSDLMLLHTYNNQPIEGWVMSEKLDGVRGYWNGKQLLTRQGQRLSPPAYFIKDFPPFAIDGE
LFSERNHFEEISTITKSFKGDGWEKLKLYVFDVPDAEGNLFERLAKLKAHLLEHPTTYIEIIEQIPVKDKTHLYQFLAQV
ENLQGEGVVVRNPNAPYERKRSSQILKLKTARGEECTVIAHHKGKGQFENVMGALTCKNHRGEFKIGSGFNLNERENPPP
IGSVITYKYRGITNSGKPRFATYWREKK
>P9WNV5 6.5.1.1~~~ligB~~~DNA ligase B~~~COG1793
MLLHDVAITSMDVAATSSRLTKVARIAALLHRAAPDTQLVTIIVSWLSGELPQRHIGVGWAALRSLPPPAPQPALTVTGV
DATLSKIGTLPGKGSQAQRAALVAELFSAATEAEQTFLLRLLGGELRQGAKGGIMADAVAQAAGLPAATVQRAAMLGGDL
AAAAAAGLSGAALDTFTLRVGRPIGPMLAQTATSVHDALERHGGTTIFEAKLDGARVQIHRANDQVRIYTRSLDDVTARL
PEVVEATLALPVRDLVADGEAIALCPDNRPQRFQVTASRFGRSVDVAAARATQPLSVFFFDILHRDGTDLLEAPTTERLA
ALDALVPARHRVDRLITSDPTDAANFLDATLAAGHEGVMAKAPAARYLAGRRGAGWLKVKPVHTLDLVVLAVEWGSGRRR
GKLSNIHLGARDPATGGFVMVGKTFKGMTDAMLDWQTTRFHEIAVGPTDGYVVQLRPEQVVEVALDGVQRSSRYPGGLAL
RFARVVRYRADKDPAEADTIDAVRALY
>P15042 6.5.1.2~~~ligA~~~DNA ligase~~~COG0272
MESIEQQLTELRTTLRHHEYLYHVMDAPEIPDAEYDRLMRELRELETKHPELITPDSPTQRVGAAPLAAFSQIRHEVPML
SLDNVFDEESFLAFNKRVQDRLKNNEKVTWCCELKLDGLAVSILYENGVLVSAATRGDGTTGEDITSNVRTIRAIPLKLH
GENIPARLEVRGEVFLPQAGFEKINEDARRTGGKVFANPRNAAAGSLRQLDPRITAKRPLTFFCYGVGVLEGGELPDTHL
GRLLQFKKWGLPVSDRVTLCESAEEVLAFYHKVEEDRPTLGFDIDGVVIKVNSLAQQEQLGFVARAPRWAVAFKFPAQEQ
MTFVRDVEFQVGRTGAITPVARLEPVHVAGVLVSNATLHNADEIERLGLRIGDKVVIRRAGDVIPQVVNVVLSERPEDTR
EVVFPTHCPVCGSDVERVEGEAVARCTGGLICGAQRKESLKHFVSRRAMDVDGMGDKIIDQLVEKEYVHTPADLFKLTAG
KLTGLERMGPKSAQNVVNALEKAKETTFARFLYALGIREVGEATAAGLAAYFGTLEALEAASIEELQKVPDVGIVVASHV
HNFFAEESNRNVISELLAEGVHWPAPIVINAEEIDSPFAGKTVVLTGSLSQMSRDDAKARLVELGAKVAGSVSKKTDLVI
AGEAAGSKLAKAQELGIEVIDEAEMLRLLGS
>Q837V6 6.5.1.2~~~ligA~~~DNA ligase~~~COG0272
MEQQPLTLTAATTRAQELRKQLNQYSHEYYVKDQPSVEDYVYDRLYKELVDIETEFPDLITPDSPTQRVGGKVLSGFEKA
PHDIPMYSLNDGFSKEDIFAFDERVRKAIGKPVAYCCELKIDGLAISLRYENGVFVRGATRGDGTVGENITENLRTVRSV
PMRLTEPISVEVRGECYMPKQSFVALNEEREENGQDIFANPRNAAAGSLRQLDTKIVAKRNLNTFLYTVADFGPMKAKTQ
FEALEELSAIGFRTNPERQLCQSIDEVWAYIEEYHEKRSTLPYEIDGIVIKVNEFALQDELGFTVKAPRWAIAYKFPPEE
AETVVEDIEWTIGRTGVVTPTAVMAPVRVAGTTVSRASLHNADFIQMKDIRLNDHVIIYKAGDIIPEVAQVLVEKRAADS
QPYEMPTHCPICHSELVHLDEEVALRCINPKCPAQIKEGLNHFVSRNAMNIDGLGPRVLAQMYDKGLVKDVADLYFLTEE
QLMTLDKIKEKSANNIYTAIQGSKENSVERLIFGLGIRHVGAKAAKILAEHFGDLPTLSRATAEEIVALDSIGETIADSV
VTYFENEEVHELMAELEKAQVNLTYKGLRTEQLAEVESPFKDKTVVLTGKLAQYTREEAKEKIENLGGKVTGSVSKKTDI
VVAGEDAGSKLTKAESLGVTVWNEQEMVDALDASHF
>O87703 6.5.1.2~~~ligA~~~DNA ligase~~~
MDRQQAERRAAELRELLNRYGYEYYVLDRPSVPDAEYDRLMQELIAIEEQYPELKTSDSPTQRIGGPPLEAFRKVAHRVP
MMSLANAFGEGDLRDFDRRVRQEVGEAAYVCELKIDGLAVSVRYEDGYFVQGATRGDGTTGEDITENLKTIRSLPLRLKE
PVSLEARGEAFMPKASFLRLNEERKARGEELFANPRNAAAGSLRQLDPKVAASRQLDLFVYGLADAEALGIASHSEALDY
LQALGFKVNPERRRCANIDEVIAFVSEWHDKRPQLPYEIDGIVIKVDSFAQQRALGATAKSPRWAIAYKFPAEEVVTTLI
GIEVNVGRTGVVTPTAILEPVRVAGTTVQRATLHNEDFIREKDIRIGDAVIIKKAGDIIPEVVGVVVDRRDGDETPFAMP
THCPECESELVRLEGEVALRCLNPNCPAQLRERLIHFASRAAMNIEGLGEKVVTQLFNAGLVRDVADLYCLTKEQLVGLE
RMGEKSAANLLAAIEASKQNSLERLLFGLGIRYVGAKAAQLLAEHFETMERLERATKEELMAVPEIGEKMADAITAFFAQ
PEATELLQELRAYGVNMAYKGPKRSAEAPADSAFAGKTVVLTGKLASMSRNEAKEQIERLGGRVTGSVSRSTDLVIAGED
AGSKLEKAQQLGIEIWDESRFLQEINRGKR
>P43813 6.5.1.2~~~ligA~~~DNA ligase~~~COG0272
MTNIQTQLDNLRKTLRQYEYEYHVLDNPSVPDSEYDRLFHQLKALELEHPEFLTSDSPTQRVGAKPLSGFSQIRHEIPML
SLDNAFSDAEFNAFVKRIEDRLILLPKPLTFCCEPKLDGLAVSILYVNGELTQAATRGDGTTGEDITANIRTIRNVPLQL
LTDNPPARLEVRGEVFMPHAGFERLNKYALEHNEKTFANPRNAAAGSLRQLDPNITSKRPLVLNAYGIGIAEGVDLPTTH
YARLQWLKSIGIPVNPEIRLCNGADEVLGFYRDIQNKRSSLGYDIDGTVLKINDIALQNELGFISKAPRWAIAYKFPAQE
ELTLLNDVEFQVGRTGAITPVAKLEPVFVAGVTVSNATLHNGDEIERLNIAIGDTVVIRRAGDVIPQIIGVLHERRPDNA
KPIIFPTNCPVCDSQIIRIEGEAVARCTGGLFCAAQRKEALKHFVSRKAMDIDGVGGKLIEQLVDRELIHTPADLFKLDL
TTLTRLERMGAKSAENALNSLENAKSTTLARFIFALGIREVGEATALNLANHFKTLDALKDANLEELQQVPDVGEVVANR
IFIFWREAHNVAVVEDLIAQGVHWETVEVKEASENLFKDKTVVLTGTLTQMGRNEAKALLQQLGAKVSGSVSSKTDFVIA
GDAAGSKLAKAQELNITVLTEEEFLAQITR
>O25336 6.5.1.2~~~ligA~~~DNA ligase~~~COG0272
MIKSQKEYLERIAYLNTLSHHYYNLDEPIVSDAIYDELYQELKAYEEKNPNGIQANSPTQKVGATTTNSFNKNPHLMRMW
SLDDVFNQSELQAWLQRILKAYPSASFVCSPKLDGVSLNLLYQHGKLVKATTRGNGLEGELVSANAKHIANIPHAIAYNG
EIEIRGEVIISKKDFDALNQERLNANEPLFANPRNAASGSLRQLDSEITKKRKLQFIPWGVGKHSLNFLSFKECLDFIVS
LGFSAIQYLSLNKNHQEIEDNYHTLIREREGFFALLDGMVIVVNELNIQKELGYTQKSPKFACAYKFPALEKHTKIVGVI
NQVGRSGAITPVALLEPVEIAGAMINRATLHNYSEIEKKNIMLSDRVVVIRSGDVIPKIIKPLESYRDGSQHKIERPKVC
PICSHELLCEEIFTYCQNLNCPARLKESLIHFASKDALNIQGLGDKVIEQLFEEKLIFNALDLYALKLEDLMRLDKFKIK
KAQNLLDAILKSKNPPLWRLINALGIEHIGKGASKTLAKYGLNVLEKSEAEFLEMEGFGVEMARSLVNFYASNQEFIRSL
FELLNPKNSDMAEEKQKSSSVFNNKTIVLTGTLSKPRQEYAQMLENLGAKISSSVSAKTDFLIAGENPGSKLALAQKHGV
SVLNEEELLKRLKELD
>A0QUW7 6.5.1.2~~~ligA~~~DNA ligase A~~~COG0272
MSEKATGEVEAELPEHPDADERRRWQELADEVREHQFRYYVKDAPIISDAEFDKLLRELQALEDAHPELRTPDSPTQLVG
GAGFATEFAPAEHLERMLSLDNVFDSDELTAWAARISSETGDAAHFLCELKIDGVALSLVYRDGRLERGATRGDGRTGED
VTLNARTIDDIPERLTPSDEFPVPAVLEVRGEVFFRVADFEELNAGLVAEGKPPFANPRNSAAGSLRQKNPAVTARRKLR
MICHGIGYTEGFTPASLHDAYRALGAWGLPVSEHTTKVSTVAEVAERIAYWGEHRHDVEHEIDGVVVKVDEVALQRRLGA
TSRAPRWAVAYKYPPEEATTKLLDIRVNVGRTGRVTPFAYMEPVKVAGSTVGLATLHNASEVKRKGVLIGDTVVIRKAGD
VIPEVLGPVVDLRDGTEREFEFPTHCPECGTELAPAKEGDADIRCPNSRSCPAQLRERLFHLAGRGAFDIEGLGYEAATA
LLAAQVIPDEGDLFTLTADDLLRTELFTTKKGELSANGKRLLANLTKAKEQPLWRVLVALSIRHVGPTAARALATEFGSL
EAIEAASEEELAAVEGVGPTIAAAVKDWFTVDWHRAIVDKWRAAGVRMADERDASIERTLEGLSIVVTGSLAGFSRDQAK
EAIIARGGKAAGSVSKKTAYVVAGDAPGSKYDKAVELGVPVLDEDGFRRLLEQGPPVEPAE
>P9WNV1 6.5.1.2~~~ligA~~~DNA ligase A~~~COG0272
MSSPDADQTAPEVLRQWQALAEEVREHQFRYYVRDAPIISDAEFDELLRRLEALEEQHPELRTPDSPTQLVGGAGFATDF
EPVDHLERMLSLDNAFTADELAAWAGRIHAEVGDAAHYLCELKIDGVALSLVYREGRLTRASTRGDGRTGEDVTLNARTI
ADVPERLTPGDDYPVPEVLEVRGEVFFRLDDFQALNASLVEEGKAPFANPRNSAAGSLRQKDPAVTARRRLRMICHGLGH
VEGFRPATLHQAYLALRAWGLPVSEHTTLATDLAGVRERIDYWGEHRHEVDHEIDGVVVKVDEVALQRRLGSTSRAPRWA
IAYKYPPEEAQTKLLDIRVNVGRTGRITPFAFMTPVKVAGSTVGQATLHNASEIKRKGVLIGDTVVIRKAGDVIPEVLGP
VVELRDGSEREFIMPTTCPECGSPLAPEKEGDADIRCPNARGCPGQLRERVFHVASRNGLDIEVLGYEAGVALLQAKVIA
DEGELFALTERDLLRTDLFRTKAGELSANGKRLLVNLDKAKAAPLWRVLVALSIRHVGPTAARALATEFGSLDAIAAAST
DQLAAVEGVGPTIAAAVTEWFAVDWHREIVDKWRAAGVRMVDERDESVPRTLAGLTIVVTGSLTGFSRDDAKEAIVARGG
KAAGSVSKKTNYVVAGDSPGSKYDKAVELGVPILDEDGFRRLLADGPASRT
>P49421 6.5.1.2~~~ligA~~~DNA ligase~~~
METHTAPQTAEARLLEATHTLLQTVRQRDLEAIDRKEAEALAARLREVLNQHAYRYYVLDNPLIPDADYDLLMQALRKLE
ARFPELVTPDSPTQRVGGPPLGRFEKVRHPEPLLSLNNAFGEEDVRVWYERCCRMLAERLGQPVQPAVTAELKIDGLAMA
LTYENGVLSVGATRGDGIEGENVTQNVRTIPAIPLRIPVDPAVGPPPTRLEVRGEVYMRKRDFERLNEQLQARGERPFAN
PRNAAAGSVRQLNPQVTALRPLSFFAYGIGPVEGAEVPDSQYEVLQWLGRLGFPVNEHARRFEHLDDVLEYCRYWTEHRD
ELDYEIDGVVLKIDHRPWQALLGAISNAPRWAVAYKFPAREAITRLLDIMVSVGRTGVVKPVAVLEPVEVGGVTVSQATL
HNEDYVRSRDIRIGDLVVVIRAGDVIPQVVRPVVEARTGNERPWRMPERCPSCGSQLVRLPGEADYYCVASDCPAQFVRL
LEHFAGRDAMDIEGMGSQVARQLAESGLVRPLSDLYRLKLEDLLKLEGFAETRARNLLRAIEASKQRPLSRLLFGLGIRH
VGKTTAELLVQRFASIDELAAATIDELAALEGVGPITAESIANWFRVEDNRRLIEELKELGVNTQRLPEEAPAAESPVRG
KTFVLTGALPHLTRKEAEELIKRAGGRVASSVSRNTDYVVVGENPGSKYDRARQLGIPMLDEDGLLRLLGMK
>Q9AIU7 6.5.1.2~~~ligA~~~DNA ligase~~~
MADLSSRVNELHDLLNQYSYEYYVEDNPSVPDSEYDKLLHELIKIEEEHPEYKTVDSPTVRVGGEAQASFNKVNHDTPML
SLGNAFNEDDLRKFDQRIREQIGNVEYMCELKIDGLAVSLKYVDGYFVQGLTRGDGTTGEDITENLKTIHAIPLKMKEPL
NVEVRGEAYMPRRSFLRLNEEKEKNDEQLFANPRNAAAGSLRQLDSKLTAKRKLSVFIYSVNDFTDFNARSQSEALDELD
KLGFTTNKNRARVNNIDGVLEYIEKWTSQRESLPYDIDGIVIKVNDLDQQDEMGFTQKSPRWAIAYKFPAEEVVTKLLDI
ELSIGRTGVVTPTAILEPVKVAGTTVSRASLHNEDLIHDRDIRIGDSVVVKKAGDIIPEVVRSIPERRPEDAVTYHMPTH
CPSCGHELVRIEGEVALRCINPKCQAQLVEGLIHFVSRQAMNIDGLGTKIIQQLYQSELIKDVADIFYLTEEDLLPLDRM
GQKKVDNLLAAIQQAKDNSLENLLFGLGIRHLGVKASQVLAEKYETIDRLLTVTEAELVEIHDIGDKVAQSVVTYLENED
IRALIQKLKDKHVNMIYKGIKTSDIEGHPEFSGKTIVLTGKLHQMTRNEASKWLASQGAKVTSSVTKNTDVVIAGEDAGS
KLTKAQSLGIEIWTEQQFVDKQNELNS
>C1CKI0 6.5.1.2~~~ligA~~~DNA ligase~~~
MNKRMNELVALLNRYATEYYTSDNPSVSDSEYDRLYRELVELETAYPEQVLADSPTHRVGGKVLDGFEKYSHQYPLYSLQ
DAFSREELDAFDARVRKEVAHPTYICELKIDGLSISLTYEKGILVAGVTRGDGSIGENITENLKRVKDIPLTLPEELDIT
VRGECYMPRASFDQVNQARQENGEPEFANPRNAAAGTLRQLDTAVVAKRNLATFLYQEASPSTRDSQEKGLKYLEQLGFV
VNPKRILAENIDEIWNFIQEVGQERENLPYDIDGVVIKVNDLASQEELGFTVKAPKWAVAYKFPAEEKEAQLLSVDWTVG
RTGVVTPTANLTPVQLAGTTVSRATLHNVDYIAEKDIRKDDTVIVYKAGDIIPAVLRVVESKRVSEEKLDIPTNCPSCNS
DLLHFEDEVALRCINPRCPAQIMEGLIHFASRDAMNITGLGPSIVEKLFAANLVKDVADIYRLQEEDFLLLEGVKEKSAA
KLYQAIQASKENSAEKLLFGLGIRHVGSKASQLLLQYFHSIENLYQADSEEVASIESLGGVIAKSLQTYFATEGSEILLR
ELKETGVNLDYKGQTVVADAALSGLTVVLTGKLERLKRSEAKSKLESLGAKVTGSVSKKTDLVVVGADAGSKLQKAQELG
IQVRDEAWLESL
>Q9ZHI0 6.5.1.2~~~ligA~~~DNA ligase~~~
MTREEARRRINELRDLIRYHNYRYYVLADPEISDAEYDRLLRELKELEERFPEFKSPDSPTEQVGARPLEPTFRPVRHPT
RMYSLDNAFTYEEVLAFEERLERALGRKRPFLYTVEHKVDGLSVNLYYEEGVLVFGATRGDGEVGEEVTQNLLTIPTIPR
RLKGVPDRLEVRGEVYMPIEAFLRLNEELEERGEKVFKNPRNAAAGSLRQKDPRVTAKRGLRATFYALGLGLEESGLKSQ
YELLLWLKEKGFPVEHGYEKALGAEGVEEVYRRFLAQRHALPFEADGVVVKLDDLALWRELGYTARAPRFALAYKFPAEE
KETRLLDVVFQVGRTGRVTPVGVLEPVFIEGSEVSRVTLHNESYIEELDIRIGDWVLVHKAGGVIPEVLRVLKERRTGEE
RPIRWPETCPECGHRLVKEGKVHRCPNPLCPAKRFEAIRHYASRKAMDIEGLGEKLIERLLEKGLVRDVADLYHLRKEDL
LGLERMGEKSAQNLLRQIEESKHRGLERLLYALGLPGVGEVLARNLARRFGTMDRLLEASLEELLEVEEVGELTARAILE
TLKDPAFRDLVRRLKEAGVSMESKEEVSDLLSGLTFVLTGELSRPREEVKALLQRLGAKVTDSVSRKTSYLVVGENPGSK
LEKARALGVAVLTEEEFWRFLKEKGAPVPA
>P49422 6.5.1.2~~~ligA~~~DNA ligase~~~
MTLEEARKRVNELRDLIRYHNYRYYVLADPEISDAEYDRLLRELKELEERFPELKSPDSPTEQVGAKPLEATFRPIRHPT
RMYSLDNAFNFDELKAFEERIGRALGREGPFAYTVEHKVDGLSVNLYYEDGVLVWGATRGDGEVGEEVTQNLLTIPTIPR
RVKGVPERLEVRGEVYMPIEAFLRLNEELEEKGEKIFKNPRNAAAGSLRQKDPRITARRGLRATFYALGLGLEESGLKTQ
LDLLHWLREKGFPVEHGFARAEGAEGVERIYQGWLKERRSLPFEADGVVVKLDELSLWRELGYTARAPRFAIAYKFPAEE
KETRLLQVVFQVGRTGRVTPVGILEPVFIEGSVVSRVTLHNESYIEELDVRIGDWVLVHKAGGVIPEVLRVLKEKRTGEE
RPIRWPETCPECGHRLVKEGKVHRCPNPLCPAKRFEAIRHYASRKAMDIGGLGEKLIEKLLEKGLVKDVADLYRLKKEDL
LGLERMGEKSAQNLLRQIEESKGRGLERLLYALGLPGVGEVLARNLAAHFGTMDRLLEASLEELLQVEEVGELTARGIYE
TLQDPAFRDLVRRLKEAGVVMEAKERGEEALKGLTFVITGELSRPREEVKALLRRLGAKVTDSVSRKTSYLVVGENPGSK
LEKARALGVPTLTEEELYRLIEERTGKPVETLAS
>P26996 6.5.1.2~~~ligA~~~DNA ligase~~~COG0272
MTLEEARKRVNELRDLIRYHNYRYYVLADPEISDAEYDRLLRELKELEERFPELKSPDSPTLQVGARPLEATFRPVRHPT
RMYSLDNAFNLDELKAFEERIERALGRKGPFAYTVEHKVDGLSVNLYYEEGVLVYGATRGDGEVGEEVTQNLLTIPTIPR
RLKGVPERLEVRGEVYMPIEAFLRLNEELEERGERIFKNPRNAAAGSLRQKDPRITAKRGLRATFYALGLGLEEVEREGV
ATQFALLHWLKEKGFPVEHGYARAVGAEGVEAVYQDWLKKRRALPFEADGVVVKLDELALWRELGYTARAPRFAIAYKFP
AEEKETRLLDVVFQVGRTGRVTPVGILEPVFLEGSEVSRVTLHNESYIEELDIRIGDWVLVHKAGGVIPEVLRVLKERRT
GEERPIRWPETCPECGHRLLKEGKVHRCPNPLCPAKRFEAIRHFASRKAMDIQGLGEKLIERLLEKGLVKDVADLYRLRK
EDLVGLERMGEKSAQNLLRQIEESKKRGLERLLYALGLPGVGEVLARNLAARFGNMDRLLEASLEELLEVEEVGELTARA
ILETLKDPAFRDLVRRLKEAGVEMEAKEKGGEALKGLTFVITGELSRPREEVKALLRRLGAKVTDSVSRKTSYLVVGENP
GSKLEKARALGVPTLTEEELYRLLEARTGKKAEELV
>O31504 2.1.1.72~~~dnmA~~~DNA methyltransferase A~~~COG1002
MALIDLEDKIAEIVNREDHSDFLYELLGVYDVPRATITRLKKGNQNLTKRVGEVHLKNKVWFKEAKKGKLFDALIDIEQQ
VEYLSAKPRYLLVTDYDGVLAKDTKTLEALDVKFEELPQYFDFFLAWKGIEKVEFEKENPADIKAAERFARIYDVLRKEN
NIIETNRGLDLFLIRLLFCFFAEDTDIFKRNSFTNLIKTLTEEDGSNLNKLFADLFIVLDKNERDDVPSYLKEFPYVNGQ
LFTEPHTELEFSAKSRKLIIECGELLNWAKINPDIFGSMIQAVASEESRSYLGMHYTSVPNIMKVIKPLFLDKLNQSFLD
AYDDYTKLENLLTRIGKIKFFDPACGSGNFLIITYKELRRMEINIIKRLQELLGEYLYVPSVTLSQFYGIEIEDFAHDVA
KLSLWIAEHQMNEELKNEVHNAVRPTLPLHTAGDIRCANAIRVEWTEVCPAQGSEEVYVFGNPPYLGSKKQNKEHKSDML
SIFGKVKNGKMLDYISAWFYFGAKYASTTNAKVAFVSTNSVTQGEQVSILWNELFKFGIQINFAYKSFKWANNAKNNAAV
IVVIVGFGPLDTKVNKYLFVDETKKLVSNISPYLTDGENILVSSRTKPISDLPKLHFGNMPNDGGGLLFTITEYTDAINK
YPELVPYFKKFIGSVEFINGGLRYCLWLNEAKYEKIKSNPLIQERISISKNHREKSTDKGTNKLALTPWKFRDTHETTNY
SIVVPSVSSENRFYIPMGLAGADTILSNLIYVIYDAEIYLLGILMSRMHMTWVKAVAGRLKTDYRYSAGLCYNTFPIPEL
STRRKNEIEEAILEILDLREEQGGTLAELYNPSTMPIELKVAHEKLDGIVERAYRQKQFESDEERLEVLLKLYQEMTER
>Q54818 2.1.1.288~~~dnrC~~~Aklanonic acid methyltransferase DnrC~~~
MQDSSYKEQVTQAFDQSSSTYDRLGVEFFTPMGRPLVEISEPVTGERVLDIGCGRGACLFPAAEKVGPQGRVHGIDIAPG
MIEEARKEAAERGLRNIALDVMDAETPELPARSFDLVMGSYSVIFLPDAVGALARYAGILDHGGRIAFTSPVFRAGTFPF
LPPEFTPLIPQALLEHLPEQWRPEALVRRFNSWLERAEDLLRTLERCGYTSVAVTDEPVRMTALSSEAWVDWSHTQGMRL
LWQNLPQAQRTELRARLVEGLDKLSDATGALAIDVPVRFVTARVAH
>Q55214 2.1.1.288~~~dauC~~~Aklanonic acid methyltransferase DauC~~~
MQDSSYKKQVTQAFDQSSSTYDRLGVEFFTPMGRRLVDISEPVTGERVLDIGCGRGACLFPAAEKVGSQGCVHGIDIAPG
MIEEARKEATERGLRNISLMVMDAETPGFPARSFDLVMGSYSVIFLPDAVGALARYADILDHGGRIAFTSPVFRAGTFPF
LPPEFTPLIPQALLEHLPEQWRPEALVRRFNSWLERAEDLVRTLEGCGYARLRQSTSRCG
>O52646 5.5.1.23~~~acma~~~Aklanonic acid methyl ester cyclase AcmA~~~
MSEQIAAVRRMVEAYNTGKTDDVADYIHPEYMNPGTLEFTSLRGPELFAINVAWVKKTFSEEARLEEVGIEERADWVRAR
LVLYGRHVGEMVGMAPTGRLFSGEQIHLLHFVDGKIHHHRDWPDYQGTYRQLGEPWPETEHRRP
>Q54808 5.5.1.23~~~dnrD~~~Aklanonic acid methyl ester cyclase DnrD~~~
MSTQIDLVRRMVEAYNTGKTDDVAEFIHLEYLNPGALEHNPELRGPEAFAAAVTWLKYAFSEEAHLEEIEYEENGPWVRA
KLALYGRHVGNLVGMPATGRRFSGEQIHLIRIVDGKIRDHRDWPDYLGTYRQLGEPWPTPEGWRP
>Q55215 5.5.1.23~~~dauD~~~Aklanonic acid methyl ester cyclase DauD~~~
MSPQIDLVRRMVEAYNTGKTDDVAEFILHEYLNPGALEHNPELRGPEAFAAAVTWLKYAFSEEAHLEEIGYEENGPWVRA
KLALYGRHVGNLVGMPATGRRFSGEQIHLIRIVDGKIRDHRDWPDYLGTYRQLGEPWPTPEGWRPCPPPPRRRHDRSTDT
P
>Q53882 1.1.1.362~~~dauE~~~Aklaviketone reductase DauE~~~
MENTQRSVIVTGGGSGIGRAVARAFAARGDRVLVVGRTAGPLAETVDGHKEAHTLAVDITDPAAPQAVVREVRERLGGVV
DVLVNNAATAVFGHLGELDRTAVEAQVATNLVAPVLLTQALLDPLETASGLVVNIGSAGALGRRAWPGNAVYGAAKAGLD
LLTRSWAVELGPRGIRVIGVAPGVIETGAGVRAGMSQEAYDGFLEAMGQRVPLGRVGRPEDVAWWVVRLADPEAAYASGA
VLAVDGGLSVT
>P72495 1.14.13.180~~~dnrF~~~Aklavinone 12-hydroxylase DnrF~~~
MALTKPDVDVLVVGGGLGGLSTALFLARRGARVLLVERHASTSVLPKAAGQNPRTMELFRFGGVADEILATDDIRGAQGD
FTIKVVERVGGRVPAQLRESFEELVGATEQCTPMPWALAPQDRVEPVLVAHAAKHGAEIRFATELTSFQAGDDGVTARLR
DLGTGAESTVSARYLVAADGPRSAIRESLGITRHGHGTLAHFMGVIFEADLTAVVPPGSTGWYYLQHPDFTGTFGPTDRP
NRHTFYVRYDPERGERPEDYTPQRCTELIRLAVDAPGLVPDILDIQAWDMAAYIADRWREGPVLLVGDAAKVTPPTGGMG
GNTAIGHGFDVAWKLAAVLRGEAGERLLDSYGADGSLVSRLVVDESLAIYAQRMAPHLLGSVPEERGTAQVVLGFRYRST
AVAAEDDDPEPTEDPRRPSGRPGFRAPHVWIEQDGTRRSTVELFGDCWVLLAAPEGGAWGQAAARAARIWASASTSISSA
AMSPPPPAN
>Q54530 1.14.13.180~~~rdmE~~~Aklavinone 12-hydroxylase RdmE~~~
MNDHEVDVLVVGAGLGGLSTAMFLARQGVRVLVVERRPGLSPYPRAAGQNPRTMELLRIGGVADEVVRADDIRGTQGDFV
IRLAESVRGEILRTVSESFDDMVAATEPCTPAGWAMLSQDKLEPILLAQARKHGGAIRFGTRLLSFRQHDDDAGAGVTAR
LAGPDGEYDLRAGYLVGADGNRSLVRESLGIGRYGHGTLTHMVGVIFDADLSGIMEPGTTGWYYLHHPEFKGTFGPTDRP
DRHTLFVEYDPDEGERPEDFTPQRCVELIGLALDAPEVKPELVDIQGWEMAARIAERWREGRVFLAGDAAKVTPPTGGMS
GNAAVADGFDLAWKLAAVLQGQAGAGLLDTYEDERKVAAELVVAEALAIYAQRMAPHMAEVWDKSVGYPETLLGFRYRSS
AVLATDDDPARVENPLTPSGRPGFRGPHVLVSRHGERLSTVDLFGDGWTLLAGELGADWVAAAEAVSAELGVPVRAYRVG
AGLTDPESAVSERYGIGKAGASLVRPDGIVAWRTDEAAADAAQTLEGVLRRVLDR
>P32009 1.14.13.180~~~dnrF~~~Aklavinone 12-hydroxylase DnrF~~~
MALTKPDVDVLVVGGGLGGLSTALFLARRGARVLLVERHASTSVLPKAAGQNPRTMELFRFGGVADEILATDDIRGAQGD
FTIKVVERVGGRVLHSFAESFEELVGATEQCTPMPWALAPQDRVEPVLVAHAAKHGAEIRFATELTSFQAGDDGVTARLR
DLGTGAESTVSARYLVAADGPRSAIRESLGITRHGHGTLAHFMGVIFEADLTAVVPPGSTGWYYLQHPDFTGTFGPTDRP
NRHTFYVATTPERGERPEDYTPQRCTELIRLAVDAPGLVPDILDIQAWDMAAYIADRWREGPVLLVGDAAKVTPPTGGMG
GNTAIGDGFDVAWKLAAVLRGEAGERLLDSYGAERSLVSRLVVDESLAIYAQRMAPHLLGSVPEERGTAQVVLGFRYRST
AVAAEDDDPEPTEDPRRPSGRPGFRAPHVWIEQDGTRRSTVELFGDCWVLLAAPEGGAWPGRPPAPPRIWASASTSISSA
AMSPPPPAN
>Q06528 2.1.1.292~~~dnrK~~~Carminomycin 4-O-methyltransferase DnrK~~~
MTAEPTVAARPQQIDALRTLIRLGSLHTPMVVRTAATLRLVDHILAGARTVKALAARTDTRPEALLRLIRHLVAIGLLEE
DAPGEFVPTEVGELLADDHPAAQRAWHDLTQAVARADISFTRLPDAIRTGRPTYESIYGKPFYEDLAGRPDLRASFDSLL
ACDQDVAFDAPAAAYDWTNVRHVLDVGGGKGGFAAAIARRAPHVSATVLEMAGTVDTARSYLKDEGLSDRVDVVEGDFFE
PLPRKADAIILSFVLLNWPDHDAVRILTRCAEALEPGGRILIHERDDLHENSFNEQFTELDLRMLVFLGGALRTREKWDG
LAASAGLVVEEVRQLPSPTIPYDLSLLVLAPAATGA
>Q55216 2.1.1.292~~~dauK~~~Carminomycin 4-O-methyltransferase DauK~~~
MTAEPTVAARPQQIDALRTLIRLGSLHTPMVVRTAATLRLVDHILAGARTVKALAARTDTRPEALLRLIRHLVAIGLLEE
DAPGEFAPTEVGKLLADDHPAAQRAWHDLTQAVARADISFTRLPEAIRSGRPTYESVYGKPFYEDLAGRPDLRASFDSLL
ACDQDVAFDAPAAAHDWTNVRHVLDVGGGKGGFAAAIARRAPHVSATVLEMAGTVDTARSYLRDAGLSDRVDVVEGDFFE
PLPRRADAIILSFVLLNWPDHDAVRILTRCAEALEPGGRILIHERDDLHENSFNEQFTELDLRMLVFLGGALRTREKWDG
LAASAGLVVEEVRQLPSPTIPYDLSLLVLAPASTGA
>D6JLP0 ~~~dnrN~~~Iron-sulfur cluster repair protein DnrN~~~
MTDFSVWEAAPFGATVDHILQRYHNVHRAQFEELVPLAQKVAQVHADTFPAEIAGLLADMRDELLMHMMKEERMLFPMIN
QGVGRGAAMPISVMMHEHEEHDRAIARLKELTGNFHAPEGACGSWTRLYALAKEMADDLNDHIHLENDILFARVLDS
>Q55217 3.1.1.-~~~dauP~~~Rhodomycin D methylesterase DauP~~~
MPTRMITKDEVTLWSEGIGDPADAPLLLIAGGNLSARSWPDEFVERLAAAGHFVIRYDHRDTGRSSRYDFALHPYGFDEL
ATDALAVLDAWQVRAAHVVGMSLGNTIGQLLALDAPERLLTLTVMLGGALDVDFDADLEAALKGEPSVSGLPVPSRRFLD
MMMLLQQPAGTDEELLERRVEKWRLLNGEGVPFDSDEFRRRELLAAGHAGTFDEPIVHHMIPQPPVSRGAELARITTPVL
AIQAMCDPAAPPPHARHLADRIPGARVVEIENMGHALPLAVHEPLAAAICAHTRAATV
>Q2T8B0 3.1.-.-~~~~~~Putative deoxyribonuclease-2~~~
MAISPRDEQNRSVDLWFAYKVPKLTKDADSDSASGYEYVYYDRQVGAVQKSPNLMNDPKGALFYTLDSVFGDPGDTTGWI
LYNDEMPADANRSNNATLGHTKGVIAFDIASSSALWLLHSWPKYASPSVPGVPTPLYGQTFLCLSLDLATAGKLAAQMAL
HQQPQVYLPRTGGLDHTSPLYALTQPLNASAPGDSDSLDFKTRGGVPFKVIAKNRKWGKDFWNDLVGPTLKADMYVETWI
RGKIPPVLDSDGVHKTYDIKFIDLRKLGAPWAWPETQDHAKWGITTTDNWVCVGDINRMVTQEKRGGGTIAFQDPKLWKA
LCETDLIIPPPGKTDAQARAMIRKTHEPAE
>Q45692 1.18.1.3~~~dntAa~~~2,4-dinitrotoluene dioxygenase system ferredoxin--NAD(+), reductase component~~~
MELVVEPLNLHLNAETGSTLLDVLRSNEVPISYSCMSGRCGTCRCRVIAGHLRDNGPETGRPQAGKGAYVLACQAVLTED
CTIEIPESDEIVVHPARIVKGTVTAIDEATHDIRRLRIKLAKPLEFSPGQYATVQFTPECVRPYSMAGLPSDAEMEFQIR
AVPGGHVSNYVFNELSVGASVRISGPLGTAYLRRTHTGPMLCVGGGTGLAPVLSIVRGALESGMSYPIHLYFGVRSEQDI
YDEERLHALAARFPNLKVNVVVATGPAGPGHRSGLVTDLIGRDLPNLAGWRLHPVWRSGHGRGPEPARCSPRHSTRAHPC
RCVLSQRRLSEGTMRTQFNPRIPSHE
>Q45694 ~~~dntAb~~~2,4-dinitrotoluene dioxygenase system, ferredoxin component~~~
MSENWIDAAARDEVPRGRRDRHQYRRQGDCLYEVAGEIYATDNTCTHGAARMSDGFLEGREIECPLHQGRFDVCTGKALC
TPLTQDIKTYPVKIENMRVMLKLD
>Q45695 1.14.12.24~~~dntAc~~~2,4-dinitrotoluene dioxygenase system, large oxygenase component~~~
MRQAIMSYQNLVSEAGLTQKHLIYGDKELFQHELKTIFARNWLFLTHDSLIPSPGDYVKAKMGVDEVIVSRQNDGSVRAF
LNVCRHRGKTIVDAEAGNAKGFVCGYHGWGYGSNGELQSVPFEKELYGDAIKKKCLGLKEVPRIESFHGFIYGCFDAEAP
PLIDYLGDVAWYLEPTFKHSGGLELVGPPAKVVVKGNWKVFAENFVGDIYHIGWTHASILRAGQAIFAPLAGNAMLPPEG
TGLQATTKYGSGIGVSLDAYSGVQSADLVPEMMAFGGAKQEKLAKEIGDVRARIYRSQVNGTVFPNNCFLTGAGVFKVFN
PIDENTTEAWTYAIVEKDMPEDLKRRLADAAQRSTGPAGYWESDDNDNMVLSQNAKKYQSSNSDLIADLGFGKDVYGDEC
YPGVVSKSAFSETNHRGFYRAYQAHISSSNWAEFENTSRNWHTELTKTTDR
>Q45696 ~~~dntAd~~~2,4-dinitrotoluene dioxygenase system, small oxygenase component~~~
MMINTQEDKLVSAHDAEEFHRFFVGHDSDLQQEVTTLLTREADLLDIQAYKAWLEHCVAPEIKYQVISRELRSTSERRYQ
LNDAVNIYNENYQQLKVRVEHQMDPQNWYNSPKIRFTRFVTNVTAAKDKSAPEMLHVRSNLILHRARRGNQVDVFYATRE
DKWKRIEGGGIKLVERFVDYPERSPQTHNLMIFL
>Q2PWU9 1.14.13.210~~~dntB~~~4-methyl-5-nitrocatechol 5-monooxygenase~~~
MTRPLETPPDIEVPVLIVGGSMVGLSTALFLSHYGIQAMAVERHERTAIHPRAGHFHLRTLELLRSVGLEEVVARTSAEA
FFPNGGINAVQSLAGGETASFISNLNAGVEEFSPTRRLFIAQQALEPILRSRAEELGADLRYSTEVVSVVDDGEGVTTVI
RDKASGQERTVRSRYLVASDGWRSQRRAQLGIETRGQGLLSRSATIYFRADCRELLAGTHLGVIYVLNERLRGFFRFEKS
LQSGFLGVATLGDPTRPGALDVSAGFTTDTAVELVRAAIGVPDIDVEIQDVAHWEATAALADRYRGGRIFLAGDAAHVVP
PYGGFGGNTGVQDAHNLASKLALVLDGTAGEALLDTYEAERRPVGALTVDQAFSRYIRRLAPEFLDEQTPELVDDFSMEL
GYRYHSPAVLTEDDDKAVDQAVVGHPREALGRPGSRAPHVALRVDDHDRSVLDLLGRDFVVLAGPAGQVWAEAAERASKE
LGLPLSAYVVGSDTPVADVEGRFADAYGLSDAGVALVRPDGFIAWRSRDLAEDPEAALTDALRAVLCR
>A0QRY0 ~~~doc~~~Toxin Doc~~~COG3654
MTEYLDREDVLTAGSIAFGGELKVRDYGLLDAAVARPQATVYGVDAYPRLWDKAAALLQSLARNHALVDGNKRTAWAAAW
TFLHINGVQLAADFDVDRAEDLMNEVATRDCDLDSIAAELAGFAAAAQTG
>A1WUH0 ~~~~~~Dodecin~~~COG3360
MSDHVYKIVELTGSSPNGIEEAVNNAIARAGETLRHLRWFEVVDTRGHIEGGRVNHWQVTVKVGFTLEGG
>Q6MX43 ~~~secE2~~~Calcium dodecin~~~COG3360
MSVYKVIDIIGTSPTSWEQAAAEAVQRARDSVDDIRVARVIEQDMAVDSAGKITYRIKLEVSFKMRPAQPR
>Q5SIE3 ~~~~~~Dodecin~~~COG3360
MGKVYKKVELVGTSEEGLEAAIQAALARARKTLRHLDWFEVKEIRGTIGEAGVKEYQVVLEVGFRLEET
>E1V7W1 3.5.4.44~~~doeA~~~Ectoine hydrolase~~~COG0006
MIQVSLPFTREEYAGRLWKVRTEMASRGIDVLVISDPSNMAWLTGYDGWSFYVHQCVLLGLEGEPVWYGRRMDANGALRT
CWMDPDNITYYPDHYVQNPDMHPMDYLAQTILPDRGWHEGVVGMEMDNYYFSAKAYQCLLRELPHARFADANSLVNWCRA
IKSPQEIEYMRVAGKIVAGMHSRILEVIEPGLPKSKLVSEIYRVGIEGWTSPEGKVFGGDYPAIVPMLPTGKDAAAPHLT
WDDSPFREGEGTFFEIAGVYKRYHAPMSRTVYLGRPPSEFVRAESALLEGIENGLEVAKPGNRTADIAMALGAAMDKYGF
DRGGARCGYPIGISYPPDWGERTMSLRPSDETILEPGMTFHFMPGLWVEDWGLEITESILITESGCETLADFPRQLFVK
>E1V7W0 3.5.1.125~~~doeB~~~N-alpha-acetyl-L-2,4-diaminobutyric acid deacetylase~~~COG3608
MSKQPGQQRPSPISATVDFEADGVQHGFLKLPISNDESAWGAVMIPVTVVKRGEGPTALLTGGNHGDEYEGITALQKLSS
RLRAEDVQGRVIIVPMMNTPACTAGRRTSPMDGGNLNRSFPGDPDGSVTEKIADYFTRVLVPMSDVVLDLHSGGRTLDII
PFAASHVLDDAEQQRRALEGAKAFGAPYAIMMFELDAEALFDTAVERQGKIFVATELGGGGTSTPESLAITERGIDNFLV
HYGLVEGELQVPDEPQIYLDMPDASCYVQSEHTGLLELTVALGDPVTQGQVIARVYDMTRSGVAPVEYRAERDGVLAARR
FPASVNMGDTIAVIAEVVESLG
>Q53U21 1.1.1.329~~~neoA~~~2-deoxy-scyllo-inosamine dehydrogenase~~~
MKALVFEAPERAVLTHRDIPAPAPGEALVRVAYNSVCGSDLSFYKGVWHGFTYPVVPGHEWSGSVVDVNGPRGADLVGRN
VVGDLTCSCGTCAHCAAGTPTLCEDLGELGFTRDGACAEYMTVPVANLRPLPDTLPLRTACQVEPLAVALNAVDRLGVTP
GEKVAVMGAGGIGLLLVQAVRLRGGTVTAVAEPVPERRAAALALGVPAAVGGDPGALVELTRSDPAAVPDVVLEASGYPT
AVQEAVEAVRPGGRVGLVGYRIEEAAVMAPHHIVLKVLTVRASMGPGTRFEEAVDVLASGAVDVDALLSHEFALDDYAKA
LDVALRRADGNTRSYFNLRA
>Q70KD0 4.2.3.124~~~gtmA~~~2-deoxy-scyllo-inosose synthase~~~
MEVEIRLGSVRYPFRLGTDCLGAIVEDLVAMSASRLLIVCDSNTGPLFGAELVERLSPRVPANLLIHRAGEPYKDLQAVG
TLADSALQLGADRASVVVAVGGGVIGNIAGLMAALLFRGIRLVHIPTSLIAMSDSVLSLKQAVNACVGKNLMGTFYAPES
VLADTAMLRSLPFRETVSGLCEVVKNSLAIRPSMVEMLRTSLRQDAVYDDETMYEIISESILAKASVTVDDMHECRAGLV
LEYGHTVGHAIEYTAAGGLSHGQAIGLGMVVAAEVSHRLGHLDQEAVALHRELLTRAGAMVTIPEEVDLDEVMHRLRFDN
KRGYLADPAESSAMVLLGGLGEPLWHDGRPLVSVPMALVGEVVNEIARPEIPNFELVAPVETVEEGRVPDTVGAADG
>Q9S5E2 4.2.3.124~~~btrC~~~2-deoxy-scyllo-inosose synthase~~~
MTTKQICFADRCFNFAFGEHVLESVESYIPRDEFDQYIMISDSGVPDSIVHYAAEYFGKLAPVHILRFQGGEEYKTLSTV
TNLQERAIALGANRRTAIVAVGGGLTGNVAGVAAGMMFRGIALIHVPTTFLAASDSVLSIKQAVNLTSGKNLVGFYYPPR
FVFADTRILSESPPRQVKAGMCELVKNMLILENDNKEFTEDDLNSANVYSPKQLETFINFCISAKMSVLSEDIYEKKKGL
IFEYGHTIGHAIELAEQGGITHGEAIAVGMIYAAKIANRMNLMPEHDVSAHYWLLNKIGALQDIPLKSDPDSIFHYLIHD
NKRGYIKLDEDNLGMILLSGVGKPAMYNQTLLTPVRKTLIKEVIREGL
>Q6L738 4.2.3.124~~~kanC~~~2-deoxy-scyllo-inosose synthase~~~
MQVTTITMDDVQYPYRLGTDCLDGIVTRLGELGASRYLIVSDPRVAELYGQGLRERLAEQAGPAELITHASGEQNKGLPA
LHDLAEEALRRGADRQSIVVALGGGVTGNIAGLLAALLFRGIRLVHVPTTVVAMLDSVLSLKQAVNAGVGKNLVGTFYQP
VEVLADTAMLRTLPVREVRSGMCEVVKNSLAIRPSMIDQLSAGLRPDGRYPDDTMHWIIYESLAAKAQVTAYDKYERGEG
LILEYGHTVGHAVEHSSQGAVPHGAAVALGMIAAAQVSHRAGWASAELVDLHRELVAKTGVARRIPSDIPLSAVRHRLSF
DNKRGYLPASADTYPMVLLESPGKVLRSEGTVLTAAPRDLVDAVVDELAEPPRPAAARTDDAATVLGGAG
>Q8ZNC4 4.1.1.116~~~dokD~~~D-ornithine/D-lysine decarboxylase~~~
MTDSIMQNYNQLREQVINGDRRFQHKDGHLCFEGVDLDALARQYPTPFYVFSEPEIIRNIHEIQQAFAAHKNTKTFFASK
TCSVMGVLKAIRDAGICAEANSQYEVRKCLEIGFRGDQIVFNGVVKKPADLEYAIANDLYLINVDSLYELEHIDAISRKL
KKVANVCVRVEPNVPSATHAELVTAFHAKSGLDLEQAEETCRRILAMPYVHLRGLHMHVGDQVPESEPFAKATKVLVDES
RRLEEVLGIKFDLINVGGGIPVPYKYDDENGDPLKDNMYAGITAQDFADAVIREVHKWRTDVEICIEPGRKVTGSAAVLL
TEVSCEKRKTNYDLNGNVECHVEWKFVDAGYSVLSDSQHFDWFFYVYNASRMTAAHDAWIKLAGPLCDGGDYFHMGVKGE
EFLLPKETHVGDIVAFLDAGAYTIESQTVYNNRPRTGVVMIDKNGDTRLIRREDSYEDMVKYDIY
>P64596 ~~~dolP~~~Outer membrane lipoprotein DolP~~~COG2823
MKALSPIAVLISALLLQGCVAAAVVGTAAVGTKAATDPRSVGTQVDDGTLEVRVNSALSKDEQIKKEARINVTAYQGKVL
LVGQSPNAELSARAKQIAMGVDGANEVYNEIRQGQPIGLGEASNDTWITTKVRSQLLTSDLVKSSNVKVTTENGEVFLMG
LVTEREAKAAADIASRVSGVKRVTTAFTFIK
>Q7CPQ6 ~~~dolP~~~Outer membrane lipoprotein DolP~~~
MKAFSPLAVLISALLLQGCVAAAVVGTAAVGTKAATDPRSVGTQVDDGTLELRVSSALSKDEQIKKETRINVTAYQGKVL
LVGQSPNSELSARAKQIAMGVEGTTEVYNEIRQGQPIGLGTASNDTWITTKVRSQLLTSDQVKSSNVKVTTENGEVFLLG
LVTEREGKAAADIASRVSGVKRVTTAFTYIK
>A0LU48 3.4.-.-~~~dop~~~Depupylase~~~COG4122
MHRVMGIETEYGISVPHQPNANAMAASSQVVNAYAPIGAPAQRQARWDFEEENPLRDARGFEVAREAADPSQLTDEDLGL
ANVILTNGARLYVDHAHPEYSTPEVTNPRDAVLWDKAGERIMAEAARRAADLPMGWTIQLYKNNTDNKGASYGCHENYLM
NRSTPFADIVRHLIPFFVTRQVFCGAGRVGIGADGRGEGFQLSQRADFFEVEVGLETTLKRPIINTRDEPHADPEKYRRL
HVIIGDANMSEIATYLKLGTTALVLAMIEDGFLSQDFSVESPVGALRAVSHDPTLRYQLRLHDGRRLTAVQLQMEYLEQA
RKYVEDRFGTDVDDMTRDVLDRWETTLVRLADDPMQLSRDLDWVAKLSILEGYRQRENLPWSAHKLQLVDLQYHDVRPDR
GLYNRLVARGRMNLLVDEAAVRTAMHEPPNDTRAYFRGRCLAKFGAEIAAASWDSVIFDLPGRDSLQRVPTLEPLRGTRA
HVGDLLDRCRSATELVAALTGGR
>A0QZ49 3.4.-.-~~~dop~~~Pup deamidase/depupylase~~~COG4122
MQRIIGTEVEYGISSPSDPTANPILTSTQAVLAYAAAAGIQRAKRTRWDYEVESPLRDARGFDLSRSSGPPPIVDADEVG
AANMILTNGARLYVDHAHPEYSAPECTDPMDAVIWDKAGERVMEAAARHVASVPGAAKLQLYKNNVDGKGASYGSHENYL
MSRQTPFSAVIAGLTPFMVSRQVVTGSGRVGIGPSGDEPGFQLSQRADYIEVEVGLETTLKRGIINTRDEPHADADKYRR
LHVIIGDANLAETSTYLKLGTTSLVLDLIEEGVDLSDLALARPVHAVHVISRDPSLRATVALADGRELTALALQRIYLDR
VAKLVDSRDPDPRASHVIETWANVLDLLERDPMECAEILDWPAKLRLLEGFRQRENLTWQAPRLHLVDLQYSDVRLDKGL
YNRLVARGSMKRLVTEQQVLDAVENPPTDTRAYFRGECLRRFGADIAAASWDSVIFDLGGDSLVRIPTLEPLRGSKAHVG
ALLDSVDSAVELVEQLTN
>P9WNU9 3.4.-.-~~~dop~~~Pup deamidase/depupylase~~~COG4122
MQRIIGTEVEYGISSPSDPTANPILTSTQAVLAYAAAAGIQRAKRTRWDYEVESPLRDARGFDLSRSAGPPPVVDADEVG
AANMILTNGARLYVDHAHPEYSAPECTDPLDAVIWDKAGERVMEAAARHVASVPGAAKLQLYKNNVDGKGASYGSHENYL
MSRQTPFSAIITGLTPFLVSRQVVTGSGRVGIGPSGDEPGFQLSQRSDYIEVEVGLETTLKRGIINTRDEPHADADRYRR
LHVIIGDANLAETSTYLKLGTTALVLDLIEEGPAHAIDLTDLALARPVHAVHAISRDPSLRATVALADGRELTGLALQRI
YLDRVAKLVDSRDPDPRAADIVETWAHVLDQLERDPMDCAELLDWPAKLRLLDGFRQRENLSWSAPRLHLVDLQYSDVRL
DKGLYNRLVARGSMKRLVTEHQVLSAVENPPTDTRAYFRGECLRRFGADIAAASWDSVIFDLGGDSLVRIPTLEPLRGSK
AHVGALLDSVDSAVELVEQLTAEPR
>P0AA89 2.7.7.65~~~dosC~~~Diguanylate cyclase DosC~~~COG3706
MEMYFKRMKDEWTGLVEQADPPIRAKAAEIAVAHAHYLSIEFYRIVRIDPHAEEFLSNEQVERQLKSAMERWIINVLSAQ
VDDVERLIQIQHTVAEVHARIGIPVEIVEMGFRVLKKILYPVIFSSDYSAAEKLQVYHFSINSIDIAMEVMTRAFTFSDS
SASKEDENYRIFSLLENAEEEKERQIASILSWEIDIIYKILLDSDLGSSLPLSQADFGLWFNHKGRHYFSGIAEVGHISR
LIQDFDGIFNQTMRNTRNLNNRSLRVKFLLQIRNTVSQIITLLRELFEEVSRHEVGMDVLTKLLNRRFLPTIFKREIAHA
NRTGTPLSVLIIDVDKFKEINDTWGHNTGDEILRKVSQAFYDNVRSSDYVFRYGGDEFIIVLTEASENETLRTAERIRSR
VEKTKLKAANGEDIALSLSIGAAMFNGHPDYERLIQIADEALYIAKRRGRNRVELWKASL
>P76129 3.1.4.52~~~dosP~~~Oxygen sensor protein DosP~~~COG2199
MKLTDADNAADGIFFPALEQNMMGAVLINENDEVMFFNPAAEKLWGYKREEVIGNNIDMLIPRDLRPAHPEYIRHNREGG
KARVEGMSRELQLEKKDGSKIWTRFALSKVSAEGKVYYLALVRDASVEMAQKEQTRQLIIAVDHLDRPVIVLDPERHIVQ
CNRAFTEMFGYCISEASGMQPDTLLNIPEFPADNRIRLQQLLWKTARDQDEFLLLTRTGEKIWIKASISPVYDVLAHLQN
LVMTFSDITEERQIRQLEGNILAAMCSSPPFHEMGEIICRNIESVLNESHVSLFALRNGMPIHWASSSHGAEIQNAQSWS
ATIRQRDGAPAGILQIKTSSGAETSAFIERVADISQHMAALALEQEKSRQHIEQLIQFDPMTGLPNRNNLHNYLDDLVDK
AVSPVVYLIGVDHIQDVIDSLGYAWADQALLEVVNRFREKLKPDQYLCRIEGTQFVLVSLENDVSNITQIADELRNVVSK
PIMIDDKPFPLTLSIGISYDLGKNRDYLLSTAHNAMDYIRKNGGNGWQFFSPAMNEMVKERLVLGAALKEAISNNQLKLV
YQPQIFAETGELYGIEALARWHDPLHGHVPPSRFIPLAEEIGEIENIGRWVIAEACRQLAEWRSQNIHIPALSVNLSALH
FRSNQLPNQVSDAMHAWGIDGHQLTVEITESMMMEHDTEIFKRIQILRDMGVGLSVDDFGTGFSGLSRLVSLPVTEIKID
KSFVDRCLTEKRILALLEAITSIGQSLNLTVVAEGVETKEQFEMLRKIHCRVIQGYFFSRPLPAEEIPGWMSSVLPLKI
>P9WGK1 2.7.13.3~~~dosT~~~Oxygen sensor histidine kinase response regulator DosT~~~COG2203
MTHPDRANVNPGSPPLRETLSQLRLRELLLEVQDRIEQIVEGRDRLDGLIDAILAITSGLKLDATLRAIVHTAAELVDAR
YGALGVRGYDHRLVEFVYEGIDEETRHLIGSLPEGRGVLGALIEEPKPIRLDDISRHPASVGFPLHHPPMRTFLGVPVRI
RDEVFGNLYLTEKADGQPFSDDDEVLVQALAAAAGIAVDNARLFEESRTREAWIEATRDIGTQMLAGADPAMVFRLIAEE
ALTLMAGAATLVAVPLDDEAPACEVDDLVIVEVAGEISPAVKQMTVAVSGTSIGGVFHDRTPRRFDRLDLAVDGPVEPGP
ALVLPLRAADTVAGVLVALRSADEQPFSDKQLDMMAAFADQAALAWRLATAQRQMREVEILTDRDRIARDLHDHVIQRLF
AVGLTLQGAAPRARVPAVRESIYSSIDDLQEIIQEIRSAIFDLHAGPSRATGLRHRLDKVIDQLAIPALHTTVQYTGPLS
VVDTVLANHAEAVLREAVSNAVRHANATSLAINVSVEDDVRVEVVDDGVGISGDITESGLRNLRQRADDAGGEFTVENMP
TGGTLLRWSAPLR
>Q5ZYC6 ~~~dotL~~~Type 4 coupling protein DotL~~~COG3505
MMRGIDSRHELDPTLLLRDTRTFTQRLADFFADPTNISIVLISLAAVSYYFSEAATFLLIMGGIFFLYSYTRKQKLPFRL
PQISRAKDYNDLKPGINKPNIARGITFFGNDRKTGEELWFANDDMRTHALIFGSTGSGKTETLVSLSYNALVQGSGFIYV
DGKGDNSLYAKVFSMVRSMGREDDLLLINFMTGARDIVGPQEKRLSNTLNPFCQGSSSMLTQLVVSLMGSSGQSSDGDMW
KGRAIAFVEALMRLLVYMRDEGAILLDANTIRNYFDLQRLESIVIDKVFPRDDQESVNIETIPKLVTDPLRNYLNTLPGY
NKEKKGKQVSQVLEQHGFITMQLVRSFSSLADTYGHIIRTNLAEVDFKDVVLNRRILVVLLPALEKSPDELSNLGKIIVS
SLKAMMAAGLGEEVEGDYRDVILRKPTNAPTPYMCILDEYGYYAVQGFAVVPAQARSLGFSAIFAGQDLPAFQKASKEEA
ASIGANTNIKICMKLEDPTETWDFFTKTAGEAYVTKVDSFQTKETSIANSYMDTKSSSFEKRARVDLLDLKEQTEGEAHI
FFKSKIVRARMFYANPKPVKQLKINQFLKVEPPPDDYLMKLQKQLASFQSILESGDLSINKAVENEEITLISKALKESTI
VEPIERGVAALIAFHGQNEPEPVEDIVEEEVEGALTIFSKLRIDPNAPPILVADKEVFSEPLLPINETRNQMITIERLAG
AKDKYAGTVANELIKDFQIATSYPPEERDVIDVQELTGIIRDLSAKISAEREKANKKAAEELT
>Q5ZYC7 ~~~dotM~~~Type 4 apparatus protein DotM~~~
MYIEMAQQQQQSGSDNSMAPVWIVILLFITAYFVWALAHQYIVSFVFTINIWQARLVNLFLNNQLLANQIYLMQTLDPNT
VNWDQMVTVMRAVGDYMRYPVICILVVLAFVLYNSNVTLKYRKTYDMKSLRAQEQFNWPAIMPIVKEDLVSQDVNKGPWA
MALTPMEFARKYNLLRKDDALLDNPVPGEEMTAGIRRGDAKRVFTMQLGPYWDGFERCSPQAYALSAVFMARMNRDRDAA
NNILKVLDKTFVDGKPDFSVARPVMKKYQNSELVQEVVAKHAYVLTVIASLLEAAREDGVVPSSEFLWLKPVDRRLWYML
NCVGRQTPYSEVAGPFAHWKAEKEMGRRSLVPMIDEAIRALEIAVKEVRLTPRQMEELEP
>Q5ZYB7 ~~~dotN~~~Type 4 apparatus protein DotN~~~COG1403
MNVNQTMADNQQRCELKLIASPGSWRLYSARKIDERFKSYEQKIFQRDRYTCQFCGFQARLYQDIVNLDGDYTNNRLSNL
VTACCFCAQCFFVESVGVGGYGGGTLIYLPELTQAELNSLCHVLFCAITNDTGYKSSAQNIYRSFKFRSQIVEEKFGEGT
SDPAIFGQLMIDSGVNSEEIREKLFKNIRLLPSRAKFRKQIEKWAASALEEIAD
>Q5ZYR7 ~~~dotY~~~Type 4 apparatus protein DotY~~~
MPKYTLPTRDALLKAMQVGETSIEAAEYMATRFEQILTKAKLLPECNDMLEKIKEYAQFVKFKLLSSAQVWSGQERPTSD
YQNTQENKAEFLASHLEGLPSGLKLEVAIGDDAKILRGFSSNGKMVEGDQLKTMDGLLEGWLAKNSLAISGGAVVKIDNT
GNQTKVDPQEIRQLINDSEKGVAKYFADKGVGMEVAQRTYQEPKALETKREEIRQEIESGAEAPTTQSIR
>Q5ZV91 ~~~dotZ~~~Type 4 apparatus protein DotZ~~~
MDEIKKDDELSQWLSTYGTITAERILGRYNISLPQDEILEAINIPSSFYRHLLQIPLKNVLNGIVIQQASDYHVYAQKLL
IDYLLSGESSKEPDSQGAGTRESLEDERQRLVQLGDEFHKLELEQDNLIASSQASLMKISIDWNTKLETTLSKLNSLYKN
TNSKIKKNAIRKALIKAFIHCDLVKDQSQKNKYQLIDKLNQTLAVSVGAELKESILTNLSELFQILEALNTKLDEFTDRT
NHLSQQAKSFRTQFYEVILRIIELIKLLPEYKIDPAQDAINREPLYFDRTIGER
>Q93MI2 1.14.13.181~~~doxA~~~Cytochrome P-450 monooxygenase DoxA~~~
MSGEAPRVAVDPFACPMMTMQRKPEVHDAFREAGPVVEVNAPAGGPAWFITDDALSRYVLADPRLVKDPDLAPAAWRGVV
DGLDIPVPELRPFTLIAVDGEAHRRLHRIHAPAFNPRRLAERTDRIAAIAGRLLTELADASGRSGEPAELIGGFAYHFPL
LVICELLGVPVTVPMAREAVSVLKALASAAQSGGGDGTDPAGGVPDTSALESLLLEAVHSARRNDTPTMTRVLYEHTQAE
FGSVSDNQLVYMITGIIFAGHERTGSFLGFLLAEVLAGRLAADADEDAVSRFVEEAVRYHPPVPYTLWRFAATEVTIGGV
RLPPGAPVLVDIEGTNTDGRHHDAPHAFHPDRPSWRRLTFGDGPHYCIGEQLAQLESRTMIGVLRSRFPEARLAVPYDEL
RWCRNGAQTARLTELPVWLR
>Q9ZAU3 1.14.13.181~~~doxA~~~Cytochrome P-450 monooxygenase DoxA~~~
MAVDPFACPMMTMQRKPEVHDAFREAGPVVEVNAPAGGPAWVITDDALAREVLADPRFVKDPDLAPAAWRGVDDGLDIPV
PELRPFTLIAVDGEAHRRLRRIHAPAFNPRRLAERTDRIAAIAGRLLTELADASGRSGKPAELIGGFAYHFPLLVICELL
GVPVTDPAMAREAVSVLKALGLGGPQSGGGDGTDPAGGVPDTSALESLLLEAVHSARRNDTPTMTRVLYERAQAEFGSVS
DDQLVYMITGLIFAGHDTTGSFLGFLLAEVLAGRLAADADEDAVSRFVEEALRYHPPVPYTLWRFAATEVTIGGVRLPRG
APVLVDIEGTNTDGRHHDAPHAFHPDRPSWRRLTFGDGPHYCIGEQLAQLESRTMIGVLRSRFPEARLAVPYDELRWCRK
GAQTARLTELPVWLR
>Q59971 1.14.13.181~~~doxA~~~Cytochrome P-450 monooxygenase DoxA~~~
MSGEAPRVAVDPFSCPMMTMQRKPEVHDAFREAGPVVEVNAPAGGPAWVITDDALAREVLADPRFVKDPDLAPTAWRGVD
DGLDIPVPELRPFTLIAVDGEDHRRLRRIHAPAFNPRRLAERTDRIAAIADRLLTELADSSDRSGEPAELIGGFAYHFPL
LVICELLGVPVTDPAMAREAVGVLKALGLGGPQSAGGDGTDPAGDVPDTSALESLLLEAVHAARRKDTRTMTRVLYERAQ
AEFGSVSDDQLVYMITGLIFAGHDTTGSFLGFLLAEVLAGRLAADADGDAISRFVEEALRHHPPVPYTLWRFAATEVVIR
GVRLPRGAPVLVDIEGTNTDGRHHDAPHAFHPDRPSRRRLTFGDGPHYCIGEQLAQLESRTMIGVLRSRFPQARLAVPYE
ELRWCRKGAQTARLTDLPVWLR
>Q04809 1.3.1.-~~~dpaA~~~Dipicolinate synthase subunit A~~~COG1052
MLTGLKIAVIGGDARQLEIIRKLTEQQADIYLVGFDQLDHGFTGAVKCNIDEIPFQQIDSIILPVSATTGEGVVSTVFSN
EEVVLKQDHLDRTPAHCVIFSGISNAYLENIAAQAKRKLVKLFERDDIAIYNSIPTVEGTIMLAIQHTDYTIHGSQVAVL
GLGRTGMTIARTFAALGANVKVGARSSAHLARITEMGLVPFHTDELKEHVKDIDICINTIPSMILNQTVLSSMTPKTLIL
DLASRPGGTDFKYAEKQGIKALLAPGLPGIVAPKTAGQILANVLSKLLAEIQAEEGK
>Q04810 1.3.1.-~~~dpaB~~~Dipicolinate synthase subunit B~~~COG0452
MSSLKGKRIGFGLTGSHCTYEAVFPQIEELVNEGAEVRPVVTFNVKSTNTRFGEGAEWVKKIEDLTGYEAIDSIVKAEPL
GPKLPLDCMVIAPLTGNSMSKLANAMTDSPVLMAAKATIRNNRPVVLGISTNDALGLNGTNLMRLMSTKNIFFIPFGQDD
PFKKPNSMVAKMDLLPQTIEKALMHQQLQPILVENYQGND
>P66899 4.3.1.15~~~ygeX~~~Diaminopropionate ammonia-lyase~~~COG1171
MSVFSLKIDIADNKFFNGETSPLFSQSQAKLARQFHQKIAGYRPTPLCALDDLANLFGVKKILVKDESKRFGLNAFKMLG
GAYAIAQLLCEKYHLDIETLSFEHLKNAIGEKMTFATTTDGNHGRGVAWAAQQLGQNAVIYMPKGSAQERVDAILNLGAE
CIVTDMNYDDTVRLTMQHAQQHGWEVVQDTAWEGYTKIPTWIMQGYATLADEAVEQMREMGVTPTHVLLQAGVGAMAGGV
LGYLVDVYSPQNLHSIIVEPDKADCIYRSGVKGDIVNVGGDMATIMAGLACGEPNPLGWEILRNCATQFISCQDSVAALG
MRVLGNPYGNDPRIISGESGAVGLGVLAAVHYHPQRQSLMEKLALNKDAVVLVISTEGDTDVKHYREVVWEGKHAVAP
>P40817 4.3.1.15~~~dpaL~~~Diaminopropionate ammonia-lyase~~~
MHELIKYQFNTRRKKYGTGAALSLLNGNVGHEVLAFHKKLPNYAVTPLHNLAHLSQRLGLGSIHIKDESWRFGLNAFKGL
GGSYAVGKYLADKLQCDINSLSFAALNTPEIKEKIKDCVFVTATDGNHGRGVAWAAEQLGLKAVVYMPKGSSLIRAENIR
HHGAECTITDLNYDDAVRLAHRMAQTKGWVLLQDTAWTGYEEIPTWIMQGYMTLAVEAYEQLAETNSPLPTHLILQAGVG
SFAGSVMGYFVEKMQENIPNIIVVEPHQANCLYQSAVMDDGQPHCVTGDMATIMAGLACGEPNIISWPIIRDNTSCFISA
DDCLAAKGMRISAAPRPGTDTPFISGESGAIGVGLLYELMNNMHYQDLANRLQLDASAHVLLISTEGDTSPDIYEDIVWN
GRSA
>Q9WXG7 1.3.1.49~~~phnB~~~Cis-3,4-dihydrophenanthrene-3,4-diol dehydrogenase~~~
MAWLEGQSVFLTGGVAGLGRALVKRLVEEGANVTVLDRNARGLDELVESFKGRVAGSPGDVRNLADNRKAVELAVERFGK
LDTFNRQRRHLGLLCPPCRPAGRCHQRSFDEVIGINLMGYVMGIKAAAPALVRSRGSVILTLSSSAFYAGGGGVLYTVAK
HAAVGLIKQAAHELAPYVRVNGVAPGGIASDLRGPKSLGMGEQSITSVPLADLVKDIAPIGRLSDTEEYTGSYVYLASAR
NSAPATGVIINCDGGMGVRSVLGPASGGKGLLEKFGG
>Q47RM6 2.5.1.88~~~~~~Trans,polycis-polyprenyl diphosphate synthase ((2Z,6E)-farnesyl diphosphate specific)~~~COG0020
MSPKTVFSTDTHREPIPPQPHPSGARPPQLPRELIPRHVAIVMDGNGRWAKQRGLPRTEGHKAGESSLFDVIEGALELGV
PYLSAYAFSTENWKRSPDEVRFLMGFNRDVIRRRRDELHARGVRVRWAGRPGRLWKSVIKELTEAEELTKHNTKLTLQFC
VNYGGRAEIADAAAALARDVAAGRLSPNRVTEATLARYLYHPDIPDVDLFIRSSGEQRLSNFLLWQSSYAEFVFLDTLWP
DFDRRHFWQACEIYARRDRRYGGAEPNPVGPPQSAAGAQGQD
>A0R0S4 2.5.1.86~~~uppS~~~Decaprenyl diphosphate synthase~~~COG0020
MATTRGKKTYPQLPPAPDDYPTFPDKSTWPVVFPEIPAGTNGRFARPPQHTSKAAAPKIPADQVPNHVAVVMDGNGRWAT
QRGLGRTEGHKMGEAVLIDITCGAIEIGIKHLTVYAFSTENWKRSTEEVRFLMGFNREVVRRRRENLNDMGVRMRWVGSR
PRMWRSVIKEFDIAEQMTVDNDVITINYCVNYGGRTEIVEAARALAQEAVDGKINPARISEAMFAKHLHRADIPDVDLFI
RTSGEQRASNFLLWQAAYAEYVFQDKLWPDYDRRDLWAACEEYVNRNRRFGRA
>P9WFF7 2.5.1.86~~~uppS~~~Decaprenyl diphosphate synthase~~~COG0020
MARDARKRTSSNFPQLPPAPDDYPTFPDTSTWPVVFPELPAAPYGGPCRPPQHTSKAAAPRIPADRLPNHVAIVMDGNGR
WATQRGLARTEGHKMGEAVVIDIACGAIELGIKWLSLYAFSTENWKRSPEEVRFLMGFNRDVVRRRRDTLKKLGVRIRWV
GSRPRLWRSVINELAVAEEMTKSNDVITINYCVNYGGRTEITEATREIAREVAAGRLNPERITESTIARHLQRPDIPDVD
LFLRTSGEQRSSNFMLWQAAYAEYIFQDKLWPDYDRRDLWAACEEYASRTRRFGSA
>A9CH28 5.1.3.30~~~dpe~~~D-psicose 3-epimerase~~~COG1082
MKHGIYYSYWEHEWSAKFGPYIEKVAKLGFDIIEVAAHHINEYSDAELATIRKSAKDNGIILTAGIGPSKTKNLSSEDAA
VRAAGKAFFERTLSNVAKLDIHTIGGALHSYWPIDYSQPVDKAGDYARGVEGINGIADFANDLGINLCIEVLNRFENHVL
NTAAEGVAFVKDVGKNNVKVMLDTFHMNIEEDSFGDAIRTAGPLLGHFHTGESNRRVPGKGRMPWHEIGLALRDINYTGA
VIMEPFVKTGGTIGSDIKVWRDLSGGADIAKMDEDARNALAFSRFVLGG
>A8RG82 5.1.3.30~~~~~~D-psicose 3-epimerase~~~COG1082
MRYFKEEVAGMKYGIYFAYWTKEWFADYKKYMDKVSALGFDVLEISCAALRDVYTTKEQLIELREYAKEKGLVLTAGYGP
TKAENLCSEDPEAVRRAMTFFKDLLPKLQLMDIHILGGGLYSYWPVDFTINNDKQGDRARAVRNLRELSKTAEECDVVLG
MEVLNRYEGYILNTCEEAIDFVDEIGSSHVKIMLDTFHMNIEETNMADAIRKAGDRLGHLHLGEQNRLVPGKGSLPWAEI
GQALRDINYQGAAVMEPFVMQGGTIGSEIKVWRDMVPDLSEEALDRDAKGALEFCRHVFGI
>B8I944 5.1.3.30~~~~~~D-psicose 3-epimerase~~~COG1082
MKHGIYYAYWEQEWEADYKYYIEKVAKLGFDILEIAASPLPFYSDIQINELKACAHGNGITLTVGHGPSAEQNLSSPDPD
IRKNAKAFYTDLLKRLYKLDVHLIGGALYSYWPIDYTKTIDKKGDWERSVESVREVAKVAEACGVDFCLEVLNRFENYLI
NTAQEGVDFVKQVDHNNVKVMLDTFHMNIEEDSIGGAIRTAGSYLGHLHTGECNRKVPGRGRIPWVEIGEALADIGYNGS
VVMEPFVRMGGTVGSNIKVWRDISNGADEKMLDREAQAALDFSRYVLECHKHS
>Q939X3 2.3.1.246~~~dpgA~~~3,5-dihydroxyphenylacetyl-CoA synthase~~~
MGVDVSMTTSIEPAEDLSVLSGLTEITRFAGVGTAVSASSYSQSEVLDILDVEDPKIRSVFLNSAIDRRFLTLPPESPGG
GRVSEPQGDLLDKHKELAVDMGCRALEACLKSAGATLSDLRHLCCVTSTGFLTPGLSALIIRELGIDPHCSRSDIVGMGC
NAGLNALNVVAGWSAAHPGELGVVLCSEACSAAYALDGTMRTAVVNSLFGDGSAALAVISGDGRVPGPRVLKFASYIITD
ALDAMRYDWDRDQDRFSFFLDPQIPYVVGAHAEIVADRLLSGTGLRRSDIGHWLVHSGGKKVIDSVVVNLGLSRHDVRHT
TGVLRDYGNLSSGSFLFSYERLAEEGVTRPGDYGVLMTMGPGSTIEMALIQW
>G4V4T4 2.3.1.246~~~dpgA~~~3,5-dihydroxyphenylacetyl-CoA synthase~~~COG3424
MDVSMTTGIELTEELSVLNGLTEITRFAGVGTAVSETSYSQTELLDILDVEDPKIRSVFLNSAIDRRFLTLPPENPGGGR
LAEPQGDLLDKHKKIAVDMGCRALEACLKSAGATLSDLRHLCCVTSTGFLTPGLSALIIREMGIDPHCSRSDIVGMGCNA
GLNALNVVSGWSAAHPGELGVVLCSEACSAAYALDGTMRTAVVNSLFGDGSAALAVISGDGRVAGPRVLKFASYIITDAV
DAMRYDWDRDQDRFSFFLDPQIPYVVGAHAEIVVDRLLSGTGLRRSDIGHWLVHSGGKKVVDAVVVNLGLSRHDVRHTTG
VLRDYGNLSSGSFLFSYERLSEEDVTRPGDYGVLMTMGPGSTIEMALIQW
>Q8KLK5 2.3.1.246~~~~~~3,5-dihydroxyphenylacetyl-CoA synthase~~~COG3424
MGVDLQVTVNLDHPELLDAPVLETGVLSAEGRALPTPPRPRIVGVGTAVTRTSYSQQEVLDAFGITDRKVRSIFLNSAIE
RRNLTLPPMDSDSVRVSESQGDLLDKHKKLAIEMGAEALHACLKRCGAELSDLRHLCCVTSTGFLTPGLSALLIRELGID
RHCSRSDIVGMGCNAGLNALNVVAGWSAAHPGELAVVLCAEACSAAYTMDSTMRTAVVNSLFGDGAAAVALLAGPGGATP
ATSEGPTVLKFASCIIPEAVDAMRYDWDRTQGRFSFFLDPQIPYVVGAHAETVVDRLLSGTGLRRSDIGHWLVHSGGKKV
IDAVVVNLGLTRHDVRHTIGVLRDQGNVSSGSFLFSYERLLEEGITRPGEYGVLMTMGPGSTIETALVQW
>G4V4T5 4.2.1.17~~~dpgB~~~Enoyl-CoA-hydratase~~~COG1024
MNGELVLRLDGARPLSPASVEELSALCDRAEDDREAGPVTVHVTGVPSAGWTAGLTVGLVSKWERVVRRFERLGRLTIAV
ASGECAGTALDLLLAADLRIVTPGTRLRLAPVGGSTWPGMSVYRLTQQAGAAGIRRAVLLGTPIEVDRALALNLVDEVSD
DPAKTLAGLAEAAAALDGAETAIRRQLIFEDGSTTFEDALGAHLAAADRALRREATS
>G4V4T6 1.13.11.80~~~dpgC~~~(3,5-dihydroxyphenyl)acetyl-CoA 1,2-dioxygenase~~~COG1024
MTTDSPTLSLSPGLDHRALAKAAQRVDELLDGLPSPSARTPAQREAASSALDEIRAARTEYVEAHAEEIYDRLTDGRTRY
LRLDELVRAAASAYPGLVPTEAQMAAERSRRQAEKEGREIDQGIFLRGILSAPKAGPHLLDAMLRPTARALELLPEFVET
GVVRMEAASLERRDGVAYLTLCRDDCLNAEDAQQVDDMETAVDLALLDPAVRVGMLRGGVMSHPRYAGRRVFCAGINLKK
LSSGDIPLVDFLLRRELGYIHKIVRGVATDGSWRARVIDKPWLAAVDSFAIGGGAQLLLVFDHVVAASDAYFSLPAATEG
IIPGAANYRLSRFTGPRLARQVILGGRRITADEPDARLLVDQVVPPEEMDAAIESALAALDGDAVRANRRMVNLAEEPPD
GFRRYMAEFALQQALRIYGADVIGKVGGFAAGSR
>Q8KLK7 1.13.11.80~~~~~~(3,5-dihydroxyphenyl)acetyl-CoA 1,2-dioxygenase~~~COG1024
MTTVLPPLEDTDGLWAALTEAAASVEKLLATLPEHGARSSAERAEIAAAHDAARALRVRFLDTHADAVYDRLTDHRRVHL
RLAELVEAAATAFPGLVPTQQQLAVERSLPQAAKEGHEIDQGIFLRAVLRSPLAGPHLLDAMLRPTPRALELLPEFVRTG
EVEMEAVHLERRDGVARLTMCRDDRLNAEDGQQVDDMETAVDLALLDPGVRVGLLRGGVMSHPRYRGKRVFSAGINLKYL
SQGGISLVDFLMRRELGYIHKLVRGVLTNDDRPGWWHSPRIEKPWVAAVDGFAIGGGAQLLLVFDRVLASSDAYFSLPAA
KEGIIPGAANLRLGRFAGPRVSRQVILEGRRIWAKEPEARLLVDEVVEPDELDAAIERSLTRLDGDAVLANRRMLNLADE
SPDGFRAYMAEFALMQALRLYGHDVIDKVGRFGGRPPA
>G4V4T7 4.2.1.17~~~dpgD~~~Enoyl-CoA-hydratase~~~COG1024
MSETRVRYEKKDHVAYVTMDRPAVLNAMDRRMHEELAGIWDDVEADDDVRAVVLTGAGDRAFSVGQDLKERARLNESGVA
PTTFGSGGQAGHPRLTDRFTLSKPVVARVRGYALGGGFELVLACDIVIAAEDAVFALPEVRLGLIAGAGGVFRLPRQLPQ
KVAMGYLLTGRRMDAATALRHGLVNEVVPAAELDQCVADWTDSLVRAAPLSVRAIKEAALRSVDLPLEEAFTTSYHWEER
RRRSADAIEGVRAFAEKRDPIWTGQ
>P0AEF4 ~~~dpiA~~~Transcriptional regulatory protein DpiA~~~COG4565
MTAPLTLLIVEDETPLAEMHAEYIRHIPGFSQILLAGNLAQARMMIERFKPGLILLDNYLPDGRGINLLHELVQAHYPGD
VVFTTAASDMETVSEAVRCGVFDYLIKPIAYERLGQTLTRFRQRKHMLESIDSASQKQIDEMFNAYARGEPKDELPTGID
PLTLNAVRKLFKEPGVQHTAETVAQALTISRTTARRYLEYCASRHLIIAEIVHGKVGRPQRIYHSG
>P77510 2.7.13.3~~~dpiB~~~Sensor histidine kinase DpiB~~~COG3290
MLQLNENKQFAFFQRLAFPLRIFLLILVFSIFVIAALAQYFTASFEDYLTLHVRDMAMNQAKIIASNDSVISAVKTRDYK
RLATIANKLQRDTDFDYVVIGDRHSIRLYHPNPEKIGYPMQFTKQGALEKGESYFITGKGSMGMAMRAKTPIFDDDGKVI
GVVSIGYLVSKIDSWRAEFLLPMAGVFVVLLGILMLLSWFLAAHIRRQMMGMEPKQIARVVRQQEALFSSVYEGLIAVDP
HGYITAINRNARKMLGLSSPGRQWLGKPIVEVVRPADFFTEQIDEKRQDVVANFNGLSVIANREAIRSGDDLLGAIISFR
SKDEISTLNAQLTQIKQYVESLRTLRHEHLNWMSTLNGLLQMKEYDRVLAMVQGESQAQQQLIDSLREAFADRQVAGLLF
GKVQRARELGLKMIIVPGSQLSQLPPGLDSTEFAAIVGNLLDNAFEASLRSDEGNKIVELFLSDEGDDVVIEVADQGCGV
PESLRDKIFEQGVSTRADEPGEHGIGLYLIASYVTRCGGVITLEDNDPCGTLFSIYIPKVKPNDSSINPIDR
>P30313 2.7.7.7~~~polA~~~DNA polymerase I, thermostable~~~
MAMLPLFEPKGRVLLVDGHHLAYRTFFALKGLTTSRGEPVQAVYGFAKSLLKALKEDGDVVVVVFDAKAPSFRHEAYEAY
KAGRAPTPEDFPRQLALIKELVDLLGLVRLEVPGFEADDVLATLAKRAEKEGYEVRILTADRDLYQLLSERIAILHPEGY
LITPAWLYEKYGLRPEQWVDYRALAGDPSDNIPGVKGIGEKTAQRLIREWGSLENLFQHLDQVKPSLREKLQAGMEALAL
SRKLSQVHTDLPLEVDFGRRRTPNLEGLRAFLERLEFGSLLHEFGLLEGPKAAEEAPWPPPEGAFLGFSFSRPEPMWAEL
LALAGAWEGRLHRAQDPLRGLRDLKGVRGILAKDLAVLALREGLDLFPEDDPMLLAYLLDPSNTTPEGVARRYGGEWTED
AGERALLAERLFQTLKERLKGEERLLWLYEEVEKPLSRVLARMEATGVRLDVAYLQALSLEVEAEVRQLEEEVFRLAGHP
FNLNSRDQLERVLFDELGLPAIGKTEKTGKRSTSAAVLEALREAHPIVDRILQYRELTKLKNTYIDPLPALVHPKTGRLH
TRFNQTATATGRLSSSDPNLQNIPVRTPLGQRIRRAFVAEEGWVLVVLDYSQIELRVLAHLSGDENLIRVFQEGRDIHTQ
TASWMFGVSPEGVDPLMRRAAKTINFGVLYGMSAHRLSGELSIPYEEAVAFIERYFQSYPKVRAWIEGTLEEGRRRGYVE
TLFGRRRYVPDLNARVKSVREAAERMAFNMPVQGTAADLMKLAMVRLFPRLQELGARMLLQVHDELVLEAPKDRAERVAA
LAKEVMEGVWPLQVPLEVEVGLGEDWLSAKE
>Q04957 2.7.7.7~~~polA~~~DNA polymerase I~~~
MKKKLVLIDGSSVAYRAFFALPLLHNDKGIHTNAVYGFTMMLNKILAEEEPTHMLVAFDAGKTTFRHEAFQEYKGGRQQT
PPELSEQFPLLRELLRAYRIPAYELENYEADDIIGTLAARAEQEGFEVKVISGDRDLTQLASPHVTVDITKKGITDIEPY
TPEAVREKYGLTPEQIVDLKGLMGDKSDNIPGVPGIGEKTAVKLLRQFGTVENVLASIDEIKGEKLKETLRQHREMALLS
KKLAAIRRDAPVELSLDDIAYQGEDREKVVALFKELGFQSFLEKMESPSSEEEKPLAKMAFTLADRVTEEMLADKAALVV
EVVEENYHDAPIVGIAVVNEHGRFFLRPETALADPQFVAWLGDETKKKSMFDSKRAAVALKWKGIELCGVSFDLLLAAYL
LDPAQGVDDVAAAAKMKQYEAVRPDEAVYGKGAKRAVPDEPVLAEHLVRKAAAIWALERPFLDELRRNEQDRLLVELEQP
LSSILAEMEFAGVKVDTKRLEQMGEELAEQLRTVEQRIYELAGQEFNINSPKQLGVILFEKLQLPVLKKSKTGYSTSADV
LEKLAPYHEIVENILQHYRQLGKLQSTYIEGLLKVVRPDTKKVHTIFNQALTQTGRLSSTEPNLQNIPIRLEEGRKIRQA
FVPSESDWLIFAADYSQIELRVLAHIAEDDNLMEAFRRDLDIHTKTAMDIFQVSEDEVTPNMRRQAKAVNFGIVYGISDY
GLAQNLNISRKEAAEFIERYFESFPGVKRYMENIVQEAKQKGYVTTLLHRRRYLPDITSRNFNVRSFAERMAMNTPIQGS
AADIIKKAMIDLNARLKEERLQARLLLQVHDELILEAPKEEMERLCRLVPEVMEQAVTLRVPLKVDYHYGSTWYDAK
>P00582 2.7.7.7~~~polA~~~DNA polymerase I~~~COG0258
MVQIPQNPLILVDGSSYLYRAYHAFPPLTNSAGEPTGAMYGVLNMLRSLIMQYKPTHAAVVFDAKGKTFRDELFEHYKSH
RPPMPDDLRAQIEPLHAMVKAMGLPLLAVSGVEADDVIGTLAREAEKAGRPVLISTGDKDMAQLVTPNITLINTMTNTIL
GPEEVVNKYGVPPELIIDFLALMGDSSDNIPGVPGVGEKTAQALLQGLGGLDTLYAEPEKIAGLSFRGAKTMAAKLEQNK
EVAYLSYQLATIKTDVELELTCEQLEVQQPAAEELLGLFKKYEFKRWTADVEAGKWLQAKGAKPAAKPQETSVADEAPEV
TATVISYDNYVTILDEETLKAWIAKLEKAPVFAFDTETDSLDNISANLVGLSFAIEPGVAAYIPVAHDYLDAPDQISRER
ALELLKPLLEDEKALKVGQNLKYDRGILANYGIELRGIAFDTMLESYILNSVAGRHDMDSLAERWLKHKTITFEEIAGKG
KNQLTFNQIALEEAGRYAAEDADVTLQLHLKMWPDLQKHKGPLNVFENIEMPLVPVLSRIERNGVKIDPKVLHNHSEELT
LRLAELEKKAHEIAGEEFNLSSTKQLQTILFEKQGIKPLKKTPGGAPSTSEEVLEELALDYPLPKVILEYRGLAKLKSTY
TDKLPLMINPKTGRVHTSYHQAVTATGRLSSTDPNLQNIPVRNEEGRRIRQAFIAPEDYVIVSADYSQIELRIMAHLSRD
KGLLTAFAEGKDIHRATAAEVFGLPLETVTSEQRRSAKAINFGLIYGMSAFGLARQLNIPRKEAQKYMDLYFERYPGVLE
YMERTRAQAKEQGYVETLDGRRLYLPDIKSSNGARRAAAERAAINAPMQGTAADIIKRAMIAVDAWLQAEQPRVRMIMQV
HDELVFEVHKDDVDAVAKQIHQLMENCTRLDVPLLVEVGSGENWDQAH
>P52026 2.7.7.7~~~polA~~~DNA polymerase I~~~
MKNKLVLIDGNSVAYRAFFALPLLHNDKGIHTNAVYGFTMMLNKILAEEQPTHILVAFDAGKTTFRHETFQDYKGGRQQT
PPELSEQFPLLRELLKAYRIPAYELDHYEADDIIGTMAARAEREGFAVKVISGDRDLTQLASPQVTVEITKKGITDIESY
TPETVVEKYGLTPEQIVDLKGLMGDKSDNIPGVPGIGEKTAVKLLKQFGTVENVLASIDEIKGEKLKENLRQYRDLALLS
KQLAAICRDAPVELTLDDIVYKGEDREKVVALFQELGFQSFLDKMAVQTDEGEKPLAGMDFAIADSVTDEMLADKAALVV
EVVGDNYHHAPIVGIALANERGRFFLRPETALADPKFLAWLGDETKKKTMFDSKRAAVALKWKGIELRGVVFDLLLAAYL
LDPAQAAGDVAAVAKMHQYEAVRSDEAVYGKGAKRTVPDEPTLAEHLVRKAAAIWALEEPLMDELRRNEQDRLLTELEQP
LAGILANMEFTGVKVDTKRLEQMGAELTEQLQAVERRIYELAGQEFNINSPKQLGTVLFDKLQLPVLKKTKTGYSTSADV
LEKLAPHHEIVEHILHYRQLGKLQSTYIEGLLKVVHPVTGKVHTMFNQALTQTGRLSSVEPNLQNIPIRLEEGRKIRQAF
VPSEPDWLIFAADYSQIELRVLAHIAEDDNLIEAFRRGLDIHTKTAMDIFHVSEEDVTANMRRQAKAVNFGIVYGISDYG
LAQNLNITRKEAAEFIERYFASFPGVKQYMDNIVQEAKQKGYVTTLLHRRRYLPDITSRNFNVRSFAERTAMNTPIQGSA
ADIIKKAMIDLSVRLREERLQARLLLQVHDELILEAPKEEIERLCRLVPEVMEQAVTLRVPLKVDYHYGPTWYDAK
>P9WNU5 2.7.7.7~~~polA~~~DNA polymerase I~~~COG0258
MVTTASAPSEDRAKPTLMLLDGNSLAFRAFYALPAENFKTRGGLTTNAVYGFTAMLINLLRDEAPTHIAAAFDVSRQTFR
LQRYPEYKANRSSTPDEFAGQIDITKEVLGALGITVLSEPGFEADDLIATLATQAENEGYRVLVVTGDRDALQLVSDDVT
VLYPRKGVSELTRFTPEAVVEKYGLTPRQYPDFAALRGDPSDNLPGIPGVGEKTAAKWIAEYGSLRSLVDNVDAVRGKVG
DALRANLASVVRNRELTDLVRDVPLAQTPDTLRLQPWDRDHIHRLFDDLEFRVLRDRLFDTLAAAGGPEVDEGFDVRGGA
LAPGTVRQWLAEHAGDGRRAGLTVVGTHLPHGGDATAMAVAAADGEGAYLDTATLTPDDDAALAAWLADPAKPKALHEAK
AAVHDLAGRGWTLEGVTSDTALAAYLVRPGQRSFTLDDLSLRYLRRELRAETPQQQQLSLLDDDDTDAETIQTTILRARA
VIDLADALDAELARIDSTALLGEMELPVQRVLAKMESAGIAVDLPMLTELQSQFGDQIRDAAEAAYGVIGKQINLGSPKQ
LQVVLFDELGMPKTKRTKTGYTTDADALQSLFDKTGHPFLQHLLAHRDVTRLKVTVDGLLQAVAADGRIHTTFNQTIAAT
GRLSSTEPNLQNIPIRTDAGRRIRDAFVVGDGYAELMTADYSQIEMRIMAHLSGDEGLIEAFNTGEDLHSFVASRAFGVP
IDEVTGELRRRVKAMSYGLAYGLSAYGLSQQLKISTEEANEQMDAYFARFGGVRDYLRAVVERARKDGYTSTVLGRRRYL
PELDSSNRQVREAAERAALNAPIQGSAADIIKVAMIQVDKALNEAQLASRMLLQVHDELLFEIAPGERERVEALVRDKMG
GAYPLDVPLEVSVGYGRSWDAAAH
>P19821 2.7.7.7~~~polA~~~DNA polymerase I, thermostable~~~
MRGMLPLFEPKGRVLLVDGHHLAYRTFHALKGLTTSRGEPVQAVYGFAKSLLKALKEDGDAVIVVFDAKAPSFRHEAYGG
YKAGRAPTPEDFPRQLALIKELVDLLGLARLEVPGYEADDVLASLAKKAEKEGYEVRILTADKDLYQLLSDRIHVLHPEG
YLITPAWLWEKYGLRPDQWADYRALTGDESDNLPGVKGIGEKTARKLLEEWGSLEALLKNLDRLKPAIREKILAHMDDLK
LSWDLAKVRTDLPLEVDFAKRREPDRERLRAFLERLEFGSLLHEFGLLESPKALEEAPWPPPEGAFVGFVLSRKEPMWAD
LLALAAARGGRVHRAPEPYKALRDLKEARGLLAKDLSVLALREGLGLPPGDDPMLLAYLLDPSNTTPEGVARRYGGEWTE
EAGERAALSERLFANLWGRLEGEERLLWLYREVERPLSAVLAHMEATGVRLDVAYLRALSLEVAEEIARLEAEVFRLAGH
PFNLNSRDQLERVLFDELGLPAIGKTEKTGKRSTSAAVLEALREAHPIVEKILQYRELTKLKSTYIDPLPDLIHPRTGRL
HTRFNQTATATGRLSSSDPNLQNIPVRTPLGQRIRRAFIAEEGWLLVALDYSQIELRVLAHLSGDENLIRVFQEGRDIHT
ETASWMFGVPREAVDPLMRRAAKTINFGVLYGMSAHRLSQELAIPYEEAQAFIERYFQSFPKVRAWIEKTLEEGRRRGYV
ETLFGRRRYVPDLEARVKSVREAAERMAFNMPVQGTAADLMKLAMVKLFPRLEEMGARMLLQVHDELVLEAPKERAEAVA
RLAKEVMEGVYPLAVPLEVEVGIGEDWLSAKE
>P80194 2.7.7.7~~~polA~~~DNA polymerase I, thermostable~~~
MEAMLPLFEPKGRVLLVDGHHLAYRTFFALKGLTTSRGEPVQAVYGFAKSLLKALKEDGYKAVFVVFDAKAPSFRHEAYE
AYKAGRAPTPEDFPRQLALIKELVDLLGFTRLEVPGYEADDVLATLAKNPEKEGYEVRILTADRDLDQLVSDRVAVLHPE
GHLITPEWLWQKYGLKPEQWVDFRALVGDPSDNLPGVKGIGEKTALKLLKEWGSLENLLKNLDRVKPENVREKIKAHLED
LRLSLELSRVRTDLPLEVDLAQGREPDREGLRAFLERLEFGSLLHEFGLLEAPAPLEEAPWPPPEGAFVGFVLSRPEPMW
AELKALAACRDGRVHRAADPLAGLKDLKEVRGLLAKDLAVLASREGLDLVPGDDPMLLAYLLDPSNTTPEGVARRYGGEW
TEDAAHRALLSERLHRNLLKRLQGEEKLLWLYHEVEKPLSRVLAHMEATGVRLDVAYLQALSLELAEEIRRLEEEVFRLA
GHPFNLNSRDQLERVLFDELRLPALGKTQKTGKRSTSAAVLEALREAHPIVEKILQHRELTKLKNTYVDPLPSLVHPNTG
RLHTRFNQTATATGRLSSSDPNLQNIPVRTPLGQRIRRAFVAEAGWALVALDYSQIELRVLAHLSGDENLIRVFQEGKDI
HTQTASWMFGVPPEAVDPLMRRAAKTVNFGVLYGMSAHRLSQELAIPYEEAVAFIERYFQSFPKVRAWIEKTLEEGRKRG
YVETLFGRRRYVPDLNARVKSVREAAERMAFNMPVQGTAADLMKLAMVKLFPRLREMGARMLLQVHDELLLEAPQAGAEE
VAALAKEAMEKAYPLAVPLEVEVGMGEDWLSAKG
>O52225 2.7.7.7~~~polA~~~DNA polymerase I, thermostable~~~
MTPLFDLEEPPKRVLLVDGHHLAYRTFYALSLTTSRGEPVQMVYGFARSLLKALKEDGQAVVVVFDAKAPSFRHEAYEAY
KAGRAPTPEDFPRQLALVKRLVDLLGLVRLEAPGYEADDVLGTLAKKAEREGMEVRILTGDRDFFQLLSEKVSVLLPDGT
LVTPKDVQEKYGVPPERWVDFRALTGDRSDNIPGVAGIGEKTALRLLAEWGSVENLLKNLDRVKPDSLRRKIEAHLEDLH
LSLDLARIRTDLPLEVDFKALRRRTPDLEGLRAFLEELEFGSLLHEFGLLGGEKPREEAPWPPPEGAFVGFLLSRKEPMW
AELLALAAASEGRVHRATSPVEALADLKEARGFLAKDLAVLALREGVALDPTDDPLLVAYLLDPANTHPEGVARRYGGEF
TEDAAERALLSERLFQNLFPRLSEKLLWLYQEVERPLSRVLAHMEARGVRLDVPLLEALSFELEKEMERLEGEVFRLAGH
PFNLNSRDQLERVLFDELGLTPVGRTEKTGKRSTAQGALEALRGAHPIVELILQYRELSKLKSTYLDPLPRLVHPRTGRL
HTRFNQTATATGRLSSSDPNLQNIPVRTPLGQRIRKAFVAEEGWLLLAADYSQIELRVLAHLSGDENLKRVFREGKDIHT
ETAAWMFGLDPALVDPKMRRAAKTVNFGVLYGMSAHRLSQELGIDYKEAEAFIERYFQSFPKVRAWIERTLEEGRTRGYV
ETLFGRRRYVPDLASRVRSVREAAERMAFNMPVQGTAADLMKIAMVKLFPRLKPLGAHLLLQVHDELVLEVPEDRAEEAK
ALVKEVMENAYPLDVPLEVEVGVGRDWLEAKQD
>P21189 2.7.7.7~~~polB~~~DNA polymerase II~~~COG0417
MAQAGFILTRHWRDTPQGTEVSFWLATDNGPLQVTLAPQESVAFIPADQVPRAQHILQGEQGFRLTPLALKDFHRQPVYG
LYCRAHRQLMNYEKRLREGGVTVYEADVRPPERYLMERFITSPVWVEGDMHNGTIVNARLKPHPDYRPPLKWVSIDIETT
RHGELYCIGLEGCGQRIVYMLGPENGDASSLDFELEYVASRPQLLEKLNAWFANYDPDVIIGWNVVQFDLRMLQKHAERY
RLPLRLGRDNSELEWREHGFKNGVFFAQAKGRLIIDGIEALKSAFWNFSSFSLETVAQELLGEGKSIDNPWDRMDEIDRR
FAEDKPALATYNLKDCELVTQIFHKTEIMPFLLERATVNGLPVDRHGGSVAAFGHLYFPRMHRAGYVAPNLGEVPPHASP
GGYVMDSRPGLYDSVLVLDYKSLYPSIIRTFLIDPVGLVEGMAQPDPEHSTEGFLDAWFSREKHCLPEIVTNIWHGRDEA
KRQGNKPLSQALKIIMNAFYGVLGTTACRFFDPRLASSITMRGHQIMRQTKALIEAQGYDVIYGDTDSTFVWLKGAHSEE
EAAKIGRALVQHVNAWWAETLQKQRLTSALELEYETHFCRFLMPTIRGADTGSKKRYAGLIQEGDKQRMVFKGLETVRTD
WTPLAQQFQQELYLRIFRNEPYQEYVRETIDKLMAGELDARLVYRKRLRRPLSEYQRNVPPHVRAARLADEENQKRGRPL
QYQNRGTIKYVWTTNGPEPLDYQRSPLDYEHYLTRQLQPVAEGILPFIEDNFATLMTGQLGLF
>B8GWS6 2.7.7.7~~~dnaE1~~~DNA polymerase III subunit alpha~~~
MSDAEGQGFVHLRVRSAYSLLEGAIKADQIGKLAAEAKMPAAGLADRANLFGALEYSSYAKDAGVQPIIGCAIPVVGIGG
GPTERWARAPTLMLLAQNERGYLNLSELSSIAYLDSAELPEPVVPWAKVAEHSEGLILLSGGTDGPVDALFAAGKTAEAS
AALAEMHRVFGDRFYVELQRHGLPRQAAAEPGLVNWAYDHDVPLVATNDVYFAKPGFYDAHDALLCISDGAFVGQDERRR
VTPEHWFKPAEEMRKLFADLPEACDNTLDIARRCAFMVHKRDPILPSFPTGDGRNEAEELEHQAREGLKMRLEGLTLSAP
EEEYWKRLDFELGIIKKMGFPGYFLIVSDFIKWGKAHGIPVGPGRGSGAGSLVAWVLTITDLDPLRFGLLFERFLNPERV
SMPDFDVDFCQERREEVISYVQEKYGRDRVAQIITFGSLQARAVLRDVGRVMQLPLGLVDRLCKMVPNNPAAPVTLAQAI
DLEPRLKQAKKEDANVSACLDVALQLEGLFRNASTHAAGLVIGDRPLTQLTPLYKDPRSDLPATQFNMKWVESAGLVKFD
FLGLKTLTVLDRAVKHLKKRGFEIDLGKLPFDDAKTYELLASGQTVGVFQLESQGMRDTLRKMRCGSIEEITALISLYRP
GPMDNIDTFVDCKFGRKPVDTLHPSLEAVLKETYGVIVYQEQVMQIAQILAGYSLGEADLLRRAMGKKKKEEMDLQKIRF
VSGAKEKNVPEEQSGSIFELVAKFAGYGFNKSHAAAYAFISYQTAWLKANTPVEFFAASMSLDLSNTDKLAVFHQDARRF
GITVRAPDVNRSGADFEVENGEVLYALGAIRNVGLEAMKHLVAVRAEGGPFRDVFDFVERIDPRQVNKRAIENLARAGAF
DSIHKNRAQIVASADVLIAHAQSCHADRQGGQGGLFGSDPGAGRPRLSKTENWNQVDLLDEELSAVGFYLTGHPLEDMVG
MLRRRRTVMLAEAMAQAEAGAEAFRMCGVVRRRQERASQSGEKFAFVSLSDPTGEYEVLYPPESLRKCRDVLEPGKAVAI
KVRAKARDGEVRFFGDDAEPIEKAVENVVAGLRVHLSPSAAEIDALKRRLEPAQAQKGGEVTFVAAIGGGREIELRLPGR
YTLDAALRGALKTAPGVALLEDV
>Q9RX08 2.7.7.7~~~dnaE~~~DNA polymerase III subunit alpha~~~COG0587
MTVSDAPTPHIHLPDGSCCQPKKFAHLHQHTQYSLLDGAAKLKDLLKWAKEVTPEGQTPALAMTDHGNMHGAVHFYNYAM
GMEVKPIIGYEAYVVPGFGTRRDRSRAQDGEKGIFHLTLLARDFEGYQNLCRLSSRGYTEGYYYKPRIDHELLQEHHKGI
IAFSGCLGSEVQQLLMQGREDDARARLLWYRELFGDNYFIEIQDHGLPEQKKNNPILKAWAQELGIGMVATNDGHYVKKS
DATAHETLLAIQTKATLADENRFKFPCDEFYVKNLEEMQRALPVSEWGEEPFDNTAHIAELCNVELPVGKKRQYQMPQLP
IPEGRSMAEELRVQTYAGAVKRYPAHLTEGLLRDYAARSLAELGEADAARVLKRTGGCDSASCDLDTLYTLLAFLGSEWE
ARGKEAGEKYTPYPALEKMEQDGESGTLPAYAHADCRAARRQDSDTSIELDPDTDDEETTRSHHRYALKLLRRAEYELSV
INNMGFPDYFLIVADYINWAKDHDISVGPGRGSGAGSLVAYAIRITNLDPLEFELLFERFLNPDRISMPDFDIDFNDARR
TEVIGYVQEKYGTDKVAQIATFGTMASKACLKDVARVMGLEYAKVDKVSKLIPIKFGKSYSLEQAREAVPDIQQMLAEDA
QLLEAYEFAQKLEGLTRHASVHAAGVVIGREELTNLVPVMRDTSGEGQVCQYDMKSVEDIGLIKMDFLGLRTLSFLDEAK
RILRESGTDFEEKYGDFDHIPFDDEKTYELMSRGDTKGVFQLEGAGIADASRRLKPRRLADIIALSALYRPGPMENIPTY
VRRHHGIEEVDYDKDGFPNSKQWLEKILQETYGIPVYQEQIMQIASEVAGYSLGGADLLRRAMGKKDAEEMKRQRQLFVV
GAKEKGVPEDEGNKLFDMLDAFANYGFNKSHSAAYGVITYQTAWLKANYPVQFMAALLTVERRDSDKVAEYVSDARKMDL
HVLPPDINRSSSDFAVAGEEILFGLYAIKGLGESAVLRILEEREKAGAFKSLADFCSRLGNKVCNRKALESLIKSGAFDA
FGERHQLIESLEDALEDAAGTAEINARAQSGMSMMFGMEEVKKERPLRSSIAPYSDLERLAIEKEALGLYISGHPLEQHE
GLREAASCRVSDLDAWFALQNVAPGKRQKAVLAGMIEGVVKKPTKSGGMMARFILADESGQMELVAFSRAYDRIEPKLVN
DTPALVIVELEAEDGGLRAIAEEIVSIEQLSEVPKVMYVTIDLETASPDALGDFQSVLDEYAGSMPTYLRLETPEQFVVY
QLDHGMGSPEAIRALNQTFAWADAHLAYDQQTILGRFAPKPPAWMNRQQGGGMRA
>P10443 2.7.7.7~~~dnaE~~~DNA polymerase III subunit alpha~~~COG0587
MSEPRFVHLRVHSDYSMIDGLAKTAPLVKKAAALGMPALAITDFTNLCGLVKFYGAGHGAGIKPIVGADFNVQCDLLGDE
LTHLTVLAANNTGYQNLTLLISKAYQRGYGAAGPIIDRDWLIELNEGLILLSGGRMGDVGRSLLRGNSALVDECVAFYEE
HFPDRYFLELIRTGRPDEESYLHAAVELAEARGLPVVATNDVRFIDSSDFDAHEIRVAIHDGFTLDDPKRPRNYSPQQYM
RSEEEMCELFADIPEALANTVEIAKRCNVTVRLGEYFLPQFPTGDMSTEDYLVKRAKEGLEERLAFLFPDEEERLKRRPE
YDERLETELQVINQMGFPGYFLIVMEFIQWSKDNGVPVGPGRGSGAGSLVAYALKITDLDPLEFDLLFERFLNPERVSMP
DFDVDFCMEKRDQVIEHVADMYGRDAVSQIITFGTMAAKAVIRDVGRVLGHPYGFVDRISKLIPPDPGMTLAKAFEAEPQ
LPEIYEADEEVKALIDMARKLEGVTRNAGKHAGGVVIAPTKITDFAPLYCDEEGKHPVTQFDKSDVEYAGLVKFDFLGLR
TLTIINWALEMINKRRAKNGEPPLDIAAIPLDDKKSFDMLQRSETTAVFQLESRGMKDLIKRLQPDCFEDMIALVALFRP
GPLQSGMVDNFIDRKHGREEISYPDVQWQHESLKPVLEPTYGIILYQEQVMQIAQVLSGYTLGGADMLRRAMGKKKPEEM
AKQRSVFAEGAEKNGINAELAMKIFDLVEKFAGYGFNKSHSAAYALVSYQTLWLKAHYPAEFMAAVMTADMDNTEKVVGL
VDECWRMGLKILPPDINSGLYHFHVNDDGEIVYGIGAIKGVGEGPIEAIIEARNKGGYFRELFDLCARTDTKKLNRRVLE
KLIMSGAFDRLGPHRAALMNSLGDALKAADQHAKAEAIGQADMFGVLAEEPEQIEQSYASCQPWPEQVVLDGERETLGLY
LTGHPINQYLKEIERYVGGVRLKDMHPTERGKVITAAGLVVAARVMVTKRGNRIGICTLDDRSGRLEVMLFTDALDKYQQ
LLEKDRILIVSGQVSFDDFSGGLKMTAREVMDIDEAREKYARGLAISLTDRQIDDQLLNRLRQSLEPHRSGTIPVHLYYQ
RADARARLRFGATWRVSPSDRLLNDLRGLIGSEQVELEFD
>P9WNT7 2.7.7.7~~~dnaE1~~~DNA polymerase III subunit alpha~~~COG0587
MSGSSAGSSFVHLHNHTEYSMLDGAAKITPMLAEVERLGMPAVGMTDHGNMFGASEFYNSATKAGIKPIIGVEAYIAPGS
RFDTRRILWGDPSQKADDVSGSGSYTHLTMMAENATGLRNLFKLSSHASFEGQLSKWSRMDAELIAEHAEGIIITTGCPS
GEVQTRLRLGQDREALEAAAKWREIVGPDNYFLELMDHGLTIERRVRDGLLEIGRALNIPPLATNDCHYVTRDAAHNHEA
LLCVQTGKTLSDPNRFKFDGDGYYLKSAAEMRQIWDDEVPGACDSTLLIAERVQSYADVWTPRDRMPVFPVPDGHDQASW
LRHEVDAGLRRRFPAGPPDGYRERAAYEIDVICSKGFPSYFLIVADLISYARSAGIRVGPGRGSAAGSLVAYALGITDID
PIPHGLLFERFLNPERTSMPDIDIDFDDRRRGEMVRYAADKWGHDRVAQVITFGTIKTKAALKDSARIHYGQPGFAIADR
ITKALPPAIMAKDIPLSGITDPSHERYKEAAEVRGLIETDPDVRTIYQTARGLEGLIRNAGVHACAVIMSSEPLTEAIPL
WKRPQDGAIITGWDYPACEAIGLLKMDFLGLRNLTIIGDAIDNVRANRGIDLDLESVPLDDKATYELLGRGDTLGVFQLD
GGPMRDLLRRMQPTGFEDVVAVIALYRPGPMGMNAHNDYADRKNNRQAIKPIHPELEEPLREILAETYGLIVYQEQIMRI
AQKVASYSLARADILRKAMGKKKREVLEKEFEGFSDGMQANGFSPAAIKALWDTILPFADYAFNKSHAAGYGMVSYWTAY
LKANYPAEYMAGLLTSVGDDKDKAAVYLADCRKLGITVLPPDVNESGLNFASVGQDIRYGLGAVRNVGANVVGSLLQTRN
DKGKFTDFSDYLNKIDISACNKKVTESLIKAGAFDSLGHARKGLFLVHSDAVDSVLGTKKAEALGQFDLFGSNDDGTGTA
DPVFTIKVPDDEWEDKHKLALEREMLGLYVSGHPLNGVAHLLAAQVDTAIPAILDGDVPNDAQVRVGGILASVNRRVNKN
GMPWASAQLEDLTGGIEVMFFPHTYSSYGADIVDDAVVLVNAKVAVRDDRIALIANDLTVPDFSNAEVERPLAVSLPTRQ
CTFDKVSALKQVLARHPGTSQVHLRLISGDRITTLALDQSLRVTPSPALMGDLKELLGPGCLGS
>Q9JXZ2 2.7.7.7~~~dnaE~~~DNA polymerase III subunit alpha~~~
MTEPTYIPLRLHTEFSITDGMVRIKKLIAKAQEYGLPALGISDLMNEFGLVKFYKACRSAGIKPIGAADVRIGNPDAPDK
PFRAMLIIRNDAGYLRLSELLTAAYVGKDRNVHHAELNPEWLENGDNSGLICLSGAHYGEVGVNLLNGNEDAARTAALKY
AAWFPDAFYMELQRLPERPEWEACVSGSVKLAEELGLPVVATHPTQFMSRDDFNAHEARVCIAGGWVLTDKKRPRDFTPG
QFFIPPETMAERFADLPEALENTVEIAKRCNLHITLGKNFLPLFPTPDGLSLDDYLIKLSNEGLQERMVQLYPDEAERAA
KMPEYQERLDFELNIIIQMKFPGYFLIVQDFINWAKTHGCPVGPGRGSGAGSLVAYSLKITDLDPLKYALLFERFLNPER
VSMPDFDVDFCQSNRGRVIEYVREKYGAEAVSQIVTFGTMSSKAVIRDVGRVLELPFMLCDKLSKLIPLEANKPLSLEKA
METEPQIQELIEAEEADELITLAKKLEDLTRGLGMHAGGVLIAPGKISDYSPVYQADESASPVSMYDKGDVEDVGLVKFD
FLGLRNLTIIEMAQNNIKNTTGDIIDVGKIPLDDQVAYQIFRDANTTAVFQFESTGMKKMLKTAHTTKFEELIAFVSLYR
PGPMDNIPDFVARMKGQEFQYIHPLLEGILAPTYGIMVYQEQVMQAAQIIGGYSLGGADLLRRAMGKKKPEEMVKHREIF
AEGAAKQGISREKSDEIFNYMEKFAGYGFNKSHAAAYALISYQTAWLKAHYPAEFMAATMSSELDNTDQLKHFYDDCRAN
GIEFLPPDINESDYRFTPYPDMKIRYALGAIKGTGEAAVESITAARQSGGKFTGLLDFCERVGKEHMNRRTLEALIRGGA
FDSIEPNRAMLLANIDLAMDNADQKAANANQGGLFDMMEDAIEPVRLIDAPMWSESEKLAEEKTVIGFYLSGHPFGPYAQ
EVRQIAPTKLDRLKPQDSVRLAGFVTAVRTMMGKRGKIAFVSLEDLSGQVEIMVGGQTLENCADCLKADQVLIIESKVSR
DDYGGGDGLRILANQVMTLQTARERYARSLSLALAPHHDIGGLVRLLAAHQLPDTPRIPLQLSYANEKASGRLQVPPKWT
VTPSSALFGELETLLGSRSVRVNW
>Q9XDH5 2.7.7.7~~~dnaE~~~DNA polymerase III subunit alpha~~~
MGSKLKFAHLHQHTQFSLLDGAAKLQDLLKWVKETTPEDPALAMTDHGNLFGAVEFYKKATAMGVKPIIGYEAYVAAESR
FDRKRGKGLDGGYFHLTLLAKDFTGYQNLVRLASRAYLEGFYEKPRIDREILREHAQGLIALSGCLGAEIPQFILQDRLD
LAEARLNEDLSIFGDRFFIEIQNHGLPEQKKVNQVLKEFARKYGLGMVATNDGHYVRKEDARAHEVLLAIQSKTTLDDPE
RWRFPCDEFYVKTPEEMRAMLPEAEWGDEPFDNTVEIARMCDVDLPIGDKMVYRIPRFPLPEGRTEAQYLRELTFLGLLR
RYPDRITEAFYREVLRLLGTMPPHGDERALAEALARVEEKAWEELRKRLPPLEGVREWTAEAILHRALYELSVIERMGFP
GYFLIVQDYINWARGHGVSVGPGRGSAAGSLVAYAVGITNIDPLRFGLLFERFLNPERVSMPDIDTDFSDRERDRVIQYV
RERYGEDKVAQIGTFGSLASKAALKDVARVYGIPHKKAEELAKLIPVQFGKPKPLQEAIQVVPELRAEMEKDERIRQVIE
VAMRLEGLNRHASVHAAGVVIAAEPLTDLVPLMRDQEGRPVTQYDMGAVEALGLLKMDFLGLRTLTFLDEARRIVKESKG
VELDYDRLPLDDPKTFELLSRGETKGVFQLESGGMTATVRGLKPRRLEDIIALVSLYRPGPMEHIPTYIRRHHGQEPVSY
AEFPHAEKYLRPILDETYGIPVYQEQIMQIASQVAGYSLGEADLLRRAMGKKRVEEMQKHRERFVRGAKERGVPEEEANR
LFDMLEAFANYGFNKSHAAAYSLLSYQTAYVKAHYPVEFMAALLSVERHDSDKVAEYIRDARALGIPVLPPDVNRSGFDF
KVVGEEILFGLSAVKNVGEMAARAILEERERGGPFKSLGDFLKRLPEQVVNKRALESLVKAGALDAFGDRARLLASLEPL
LRWAAETRERGRSGLVGLFAEVEEPPLVEASPLDEITMLRYEKEALGIYVSGHPVLRYPGLREVASCTIEELSEFVRELP
GKPKVLLSGMVEEVVRKPTRSGGMMARFTLSDETGALEVVVFGRAYEGVSPKLKEDIPLLVLAEVEKGEELRVLAQAVWT
LEEVLEAPKALEVEVDHALLDEKGVARLKSLLDEHPGSLPVYLRVLGPFGEALFALREVRVGEEALGLLEAEGYRAYLVP
DREVFLQGNGGGPKEEVVPF
>P05649 ~~~dnaN~~~Beta sliding clamp~~~COG0592
MKFTIQKDRLVESVQDVLKAVSSRTTIPILTGIKIVASDDGVSFTGSDSDISIESFIPKEEGDKEIVTIEQPGSIVLQAR
FFSEIVKKLPMATVEIEVQNQYLTIIRSGKAEFNLNGLDADEYPHLPQIEEHHAIQIPTDLLKNLIRQTVFAVSTSETRP
ILTGVNWKVEQSELLCTATDSHRLALRKAKLDIPEDRSYNVVIPGKSLTELSKILDDNQELVDIVITETQVLFKAKNVLF
FSRLLDGNYPDTTSLIPQDSKTEIIVNTKEFLQAIDRASLLAREGRNNVVKLSAKPAESIEISSNSPEIGKVVEAIVADQ
IEGEELNISFSPKYMLDALKVLEGAEIRVSFTGAMRPFLIRTPNDETIVQLILPVRTY
>P33761 ~~~dnaN~~~Beta sliding clamp~~~
MLHNTFFICETNQIMNEIEKAKGIILNRNMNDIWSALLIEVKKSNLIIKSTDRNIFFESTISIVSETDFKVLINASNFYD
AVKAFNFYKKIKIVFNENNSKLEIMGELNDEKEEYEDHLKEPTFSYEEIENYNYDMVNEDYTFGIEIKQKSFKKVINRIA
FSAHLDESKNVLNGVYFSKDEDSKLLLVSTNGHRMSICKTEVIVEEDVNFIVPVKIFNFLKHLMSGEGMVKIKFSDKKFY
VEFDNYKIACSLINGNYPDYKSIIPKEQKNKSLVSLGILKDRLARVNLYVDKSRKLVLTFSELQLKLLGEDLITGRKGEF
FIKDPNYLYDGADEVMAINISYFVEAISVFETSKIEIQFNSGNVLKLSEPENFNFTHLIMPMSLG
>P0CAU5 ~~~dnaN~~~Beta sliding clamp~~~COG0592
MKLTIERAALLKALGHVQSVVERRNTIPILSNILLSAEGDRLSFSATDLDMEIIDEGFAQIDVPGQITAPAHTLYEIVRK
LPDGADVSLSFSGDDPRLVIQAGRSRFNLPVLPAGDFPVMSSDGLSSRIAVDTNELIRLIDKTRFAISTEETRYYLNGLY
VHTVNEGGETKLRAVATDGHRLALAEMPAPEGAVGIPGVIVPRKTIAEARRLMESAGETVDLQVSPQKVRFEFGAAALTS
KVIDGAFPDYMRVIPRDNAKILTLDNDLFAKAVDRVATISAEKSRSVKLAVEPGRITLTVRNMEAGQAVEEVEVDYDGEP
FEIGFNARYLLDVCGQIAGPQAEFRFADPASPTLVVDPVDPGVKYVLMPLRV
>B8GXP6 ~~~dnaN~~~Beta sliding clamp~~~
MKLTIERAALLKALGHVQSVVERRNTIPILSNILLSAEGDRLSFSATDLDMEIIDEGFAQIDVPGQITAPAHTLYEIVRK
LPDGADVSLSFSGDDPRLVIQAGRSRFNLPVLPAGDFPVMSSDGLSSRIAVDTNELIRLIDKTRFAISTEETRYYLNGLY
VHTVNEGGETKLRAVATDGHRLALAEMPAPEGAVGIPGVIVPRKTIAEARRLMESAGETVDLQVSPQKVRFEFGAAALTS
KVIDGAFPDYMRVIPRDNAKILTLDNDLFAKAVDRVATISAEKSRSVKLAVEPGRITLTVRNMEAGQAVEEVEVDYDGEP
FEIGFNARYLLDVCGQIAGPQAEFRFADPASPTLVVDPVDPGVKYVLMPLRV
>P0A988 ~~~dnaN~~~Beta sliding clamp~~~COG0592
MKFTVEREHLLKPLQQVSGPLGGRPTLPILGNLLLQVADGTLSLTGTDLEMEMVARVALVQPHEPGATTVPARKFFDICR
GLPEGAEIAVQLEGERMLVRSGRSRFSLSTLPAADFPNLDDWQSEVEFTLPQATMKRLIEATQFSMAHQDVRYYLNGMLF
ETEGEELRTVATDGHRLAVCSMPIGQSLPSHSVIVPRKGVIELMRMLDGGDNPLRVQIGSNNIRAHVGDFIFTSKLVDGR
FPDYRRVLPKNPDKHLEAGCDLLKQAFARAAILSNEKFRGVRLYVSENQLKITANNPEQEEAEEILDVTYSGAEMEIGFN
VSYVLDVLNALKCENVRMMLTDSVSSVQIEDAASQSAAYVVMPMRL
>O25242 ~~~dnaN~~~Beta sliding clamp~~~COG0592
MKISVSKNDLENALRYLQAFLDKKDASSIASHIHLEVIKEKLFLKASDSDIGLKSYIFTQSSDKEGVGTINGKKFLDIIS
CLKDSNIILETKDDSLAIKQNKSSFKLPMFDADEFPEFPVIDPKVSIEVNAPFLVDAFKKIAPVIEQTSHKRELAGILMQ
FDQKHQTLSVVGTDTKRLSYTQLEKISIHSTEEDISCILPKRALLEILKLFYENFSFKSDGMLAVIENEMHTFFTKLIDG
NYPDYQKILPKEYISSFTLGKEEFKESIKLCSSLSSTIKLTLEKNNALFESLDSEHSETAKTSVEIEKGLDIEKAFHLGV
NAKFFLEALNALGTTQFVLRCNEPSSPFLIQESLDEKQSHLNAKISTLMMPITL
>A0QND6 ~~~dnaN~~~Beta sliding clamp~~~COG0592
MATTTAGLTDLKFRVVREDFADAVAWVARSLPTRPTIPVLAGVLLTGTDEGLTISGFDYEVSAEVKVSAEIASAGSVLVS
GRLLSDITKALPAKPVEVSVEGTRVSLTCGSARFSLPTLAVEDYPALPALPEETGVIASDLFAEAIGQVAVAAGRDDTLP
MLTGIRVEISGESVVLAATDRFRLAVRELTWVTTAGDVEAAVLVPAKTLAEAAKAGTDGNQVHLALGSGASVGKDGLLGI
RSEGKRSTTRLLDAEFPKFRQLLPAEHTAVATIGVAELTEAIKRVALVADRGAQIRMEFSDDTLKLSAGADDVGRAEEDL
PVDFAGEPLTIAFNPTYLTDGLGSLHSERVTFGFTTPSRPAVLRPAGEDDGANGGSGPFPAAKTDYVYLLMPVRLPG
>P9WNU0 ~~~dnaN~~~Beta sliding clamp~~~
MDAATTRVGLTDLTFRLLRESFADAVSWVAKNLPARPAVPVLSGVLLTGSDNGLTISGFDYEVSAEAQVGAEIVSPGSVL
VSGRLLSDITRALPNKPVDVHVEGNRVALTCGNARFSLPTMPVEDYPTLPTLPEETGLLPAELFAEAISQVAIAAGRDDT
LPMLTGIRVEILGETVVLAATDRFRLAVRELKWSASSPDIEAAVLVPAKTLAEAAKAGIGGSDVRLSLGTGPGVGKDGLL
GISGNGKRSTTRLLDAEFPKFRQLLPTEHTAVATMDVAELIEAIKLVALVADRGAQVRMEFADGSVRLSAGADDVGRAEE
DLVVDYAGEPLTIAFNPTYLTDGLSSLRSERVSFGFTTAGKPALLRPVSGDDRPVAGLNGNGPFPAVSTDYVYLLMPVRL
PG
>P9WNU1 ~~~dnaN~~~Beta sliding clamp~~~COG0592
MDAATTRVGLTDLTFRLLRESFADAVSWVAKNLPARPAVPVLSGVLLTGSDNGLTISGFDYEVSAEAQVGAEIVSPGSVL
VSGRLLSDITRALPNKPVDVHVEGNRVALTCGNARFSLPTMPVEDYPTLPTLPEETGLLPAELFAEAISQVAIAAGRDDT
LPMLTGIRVEILGETVVLAATDRFRLAVRELKWSASSPDIEAAVLVPAKTLAEAAKAGIGGSDVRLSLGTGPGVGKDGLL
GISGNGKRSTTRLLDAEFPKFRQLLPTEHTAVATMDVAELIEAIKLVALVADRGAQVRMEFADGSVRLSAGADDVGRAEE
DLVVDYAGEPLTIAFNPTYLTDGLSSLRSERVSFGFTTAGKPALLRPVSGDDRPVAGLNGNGPFPAVSTDYVYLLMPVRL
PG
>Q9I7C4 ~~~dnaN~~~Beta sliding clamp~~~
MHFTIQREALLKPLQLVAGVVERRQTLPVLSNVLLVVEGQQLSLTGTDLEVELVGRVVLEDAAEPGEITVPARKLMDICK
SLPNDVLIDIRVEEQKLLVKAGRSRFTLSTLPANDFPTVEEGPGSLNFSIAQSKLRRLIDRTSFAMAQQDVRYYLNGMLL
EVNGGTLRSVATDGHRLAMCSLDAQIPSQDRHQVIVPRKGILELARLLTEQDGEVGIVLGQHHIRATTGEFTFTSKLVDG
KFPDYERVLPRGGDKLVVGDRQQLREAFSRTAILSNEKYRGIRLQLSNGLLKIQANNPEQEEAEEEVQVEYNGGNLEIGF
NVSYLLDVLGVIGTEQVRFILSDSNSSALVHEADNDDSAYVVMPMRL
>Q1RIS7 ~~~dnaN~~~Beta sliding clamp~~~COG0592
MLKVIVETKTLVQALGFASSVVEKRNIISELANIKLLAKDGLLELSSTNMDLYLSQKIGVQVVSEGELTVSTKTLNDIVK
KLPDSELTLTDLGTTGLEITGKNCRFNLFTLPVESFPVMDNINPEASFKISCAEFAKIIESTKFSVSLDETRYNLNGIYL
HVKDSEFYAASTDGHRLSVSSVVLAEKIEDFGVILPQKSAEEILKIVKDSKNANADIEILLSSNKIKFICNENVIMLSKL
IDGTFPDYSSFIPENSSSKLVINRKIFADTIERIAIITVEKFRAVKLSLSGEALEISAIGEARGNAKEVINSSKETENFY
EYSGETNLDIGFNPQYLEDVLKAIKSDLVELYFSSVSAPVLIKFPESPKDIFVVMPVKV
>Q92I37 ~~~dnaN~~~Beta sliding clamp~~~
MLKLIVETKTLVQSLGFASSVVEKRNVIPEYANIKLSAKDGNLELSSTNMDLYLSQKIAVQVVSEGECTVSTKTLNDIVR
KLPDSELTLTDLGTTGLEIKGKNCKFNLFTLPVSSFPAMDSINPEASFKISCTDFAKIIESTKFSISLDETRYNLNGVYL
HIKDKEFCSASTDGHRLSISWVTLEKQIKNFGVILPQKSAEEILKIVKDPKNINEDIEILLSSNKIKFICNENTSMLSKL
IDGTFPDYSTFIPESSSSKLVINRKMFADSIERIAIITVEKFRAVKLSLSRETLEISAVGEARGNAKEVINSSQDKESFY
EYNSDESLAIGFNPQYLEDVLKAVKSDVVELYFSDVSAPVLIKFPENPKDIFVVMPVKV
>Q68WW0 ~~~dnaN~~~Beta sliding clamp~~~COG0592
MLKLIVETKTLVQSLGFARSVVEKRNVIPEYANIKLSAKDGNLELSSTNMDLYLSQKIAVQVLSEGEITVSTQTLSDIVR
KLPDSELTLTELDIMKLEIKGQNCQFNLFTLPVSSFPAMDSIKPEVSFKISCADFAKIIESTKFSISLDETRYNLNGIYL
HIKDKEFFAASTDGHRLSISWITLEEKIKNFGVILPQKSAEEILKIVKDLKNIHEDIEILLSSNKIKFICNENTILLSKL
IDGTFPDYSAFIPKSSISKLVINRKIFADSIERIAIITVEKFRAIKLSLSRKTLEISAVGEARGNAKEIITASQDKESFY
EYNCDESLVIGFNPQYLEDVLKAVKSNLVELYFSDISASAPVLIKFPQNPKDIFVIMPVKV
>P99103 ~~~dnaN~~~Beta sliding clamp~~~
MMEFTIKRDYFITQLNDTLKAISPRTTLPILTGIKIDAKEHEVILTGSDSEISIEITIPKTVDGEDIVNISETGSVVLPG
RFFVDIIKKLPGKDVKLSTNEQFQTLITSGHSEFNLSGLDPDQYPLLPQVSRDDAIQLSVKVLKNVIAQTNFAVSTSETR
PVLTGVNWLIQENELICTATDSHRLAVRKLQLEDVSENKNVIIPGKALAELNKIMSDNEEDIDIFFASNQVLFKVGNVNF
ISRLLEGHYPDTTRLFPENYEIKLSIDNGEFYHAIDRASLLAREGGNNVIKLSTGDDVVELSSTSPEIGTVKEEVDANDV
EGGSLKISFNSKYMMDALKAIDNDEVEVEFFGTMKPFILKPKGDDSVTQLILPIRTY
>P0A024 ~~~dnaN~~~Beta sliding clamp~~~
MMEFTIKRDYFITQLNDTLKAISPRTTLPILTGIKIDAKEHEVILTGSDSEISIEITIPKTVDGEDIVNISETGSVVLPG
RFFVDIIKKLPGKDVKLSTNEQFQTLITSGHSEFNLSGLDPDQYPLLPQVSRDDAIQLSVKVLKNVIAQTNFAVSTSETR
PVLTGVNWLIQENELICTATDSHRLAVRKLQLEDVSENKNVIIPGKALAELNKIMSDNEEDIDIFFASNQVLFKVGNVNF
ISRLLEGHYPDTTRLFPENYEIKLSIDNGEFYHAIDRASLLAREGGNNVIKLSTGDDVVELSSTSPEIGTVKEEVDANDV
EGGSLKISFNSKYMMDALKAIDNDEVEVEFFGTMKPFILKPKGDDSVTQLILPIRTY
>O06672 ~~~dnaN~~~Beta sliding clamp~~~COG0592
MIHFSINKNLFLQALNTTKRAISSKNAIPILSTVKIDVTNEGITLIGSNGQISIENFISQKNEDAGLLITSLGSILLEAS
FFINVVSSLPDVTLDFKEIEQNQIVLTSGKSEITLKGKDSEQYPRIQEISASTPLILETKLLKKIINETAFAASTQESRP
ILTGVHFVLSQHKELKTVATDSHRLSQKKLTLEKNSDDFDVVIPSRSLREFSAVFTDDIETVEIFFANNQILFRSENISF
YTRLLEGNYPDTDRLIPTDFNTTITFNVVNLRQSMERARLLSSATQNGTVKLEIKDGVVSAHVHSPEVGKVNEEIDTDQV
TGEDLTISFNPTYLIDSLKALNSEKVTISFISAVRPFTLVPADTDEDFMQLITPVRTN
>P03007 2.7.7.7~~~dnaQ~~~DNA polymerase III subunit epsilon~~~COG0847
MSTAITRQIVLDTETTGMNQIGAHYEGHKIIEIGAVEVVNRRLTGNNFHVYLKPDRLVDPEAFGVHGIADEFLLDKPTFA
EVADEFMDYIRGAELVIHNAAFDIGFMDYEFSLLKRDIPKTNTFCKVTDSLAVARKMFPGKRNSLDALCARYEIDNSKRT
LHGALLDAQILAEVYLAMTGGQTSMAFAMEGETQQQQGEATIQRIVRQASKLRVVFATDEEIAAHEARLDLVQKKGGSCL
WRA
>P06710 2.7.7.7~~~dnaX~~~DNA polymerase III subunit tau~~~COG2812
MSYQVLARKWRPQTFADVVGQEHVLTALANGLSLGRIHHAYLFSGTRGVGKTSIARLLAKGLNCETGITATPCGVCDNCR
EIEQGRFVDLIEIDAASRTKVEDTRDLLDNVQYAPARGRFKVYLIDEVHMLSRHSFNALLKTLEEPPEHVKFLLATTDPQ
KLPVTILSRCLQFHLKALDVEQIRHQLEHILNEEHIAHEPRALQLLARAAEGSLRDALSLTDQAIASGDGQVSTQAVSAM
LGTLDDDQALSLVEAMVEANGERVMALINEAAARGIEWEALLVEMLGLLHRIAMVQLSPAALGNDMAAIELRMRELARTI
PPTDIQLYYQTLLIGRKELPYAPDRRMGVEMTLLRALAFHPRMPLPEPEVPRQSFAPVAPTAVMTPTQVPPQPQSAPQQA
PTVPLPETTSQVLAARQQLQRVQGATKAKKSEPAAATRARPVNNAALERLASVTDRVQARPVPSALEKAPAKKEAYRWKA
TTPVMQQKEVVATPKALKKALEHEKTPELAAKLAAEAIERDPWAAQVSQLSLPKLVEQVALNAWKEESDNAVCLHLRSSQ
RHLNNRGAQQKLAEALSMLKGSTVELTIVEDDNPAVRTPLEWRQAIYEEKLAQARESIIADNNIQTLRRFFDAELDEESI
RPI
>P9WNT9 2.7.7.7~~~dnaX~~~DNA polymerase III subunit gamma/tau~~~COG2812
MALYRKYRPASFAEVVGQEHVTAPLSVALDAGRINHAYLFSGPRGCGKTSSARILARSLNCAQGPTANPCGVCESCVSLA
PNAPGSIDVVELDAASHGGVDDTRELRDRAFYAPVQSRYRVFIVDEAHMVTTAGFNALLKIVEEPPEHLIFIFATTEPEK
VLPTIRSRTHHYPFRLLPPRTMRALLARICEQEGVVVDDAVYPLVIRAGGGSPRDTLSVLDQLLAGAADTHVTYTRALGL
LGVTDVALIDDAVDALAACDAAALFGAIESVIDGGHDPRRFATDLLERFRDLIVLQSVPDAASRGVVDAPEDALDRMREQ
AARIGRATLTRYAEVVQAGLGEMRGATAPRLLLEVVCARLLLPSASDAESALLQRVERIETRLDMSIPAPQAVPRPSAAA
AEPKHQPAREPRPVLAPTPASSEPTVAAVRSMWPTVRDKVRLRSRTTEVMLAGATVRALEDNTLVLTHESAPLARRLSEQ
RNADVLAEALKDALGVNWRVRCETGEPAAAASPVGGGANVATAKAVNPAPTANSTQRDEEEHMLAEAGRGDPSPRRDPEE
VALELLQNELGARRIDNA
>P13267 2.7.7.7~~~polC~~~DNA polymerase III PolC-type~~~COG2176
MEQLSVNRRQFQILLQQINMTDDTFMTYFEHGEIKKLTIHKASKSWHFHFQFKSLLPFQIYDTLTTRLTQSFAHIAKVTS
SIEVQDAEVSESIVQDYWSRCIEELQGISPPIISLLNQQKPKLKGNKLIVKTKTDTEAAALKNKYSSMIQAEYRQFGFPD
LQLDAEIFVSEQEVQKFREQKLAEDQERAMQALIEMEKKDKESDEDQAPSGPLVIGYQIKDNEEIRTLDSIMDEERRITV
QGYVFDVETRELKSGRTLCIFKITDYTNSILIKMFAREKEDAALMKSLKKGMWVKARGSIQNDTFVRDLVMIANDVNEIK
AKTREDSAPEGEKRVELHLHSPMSQMDAVTGIGKLVEQAKKWGHEAIALTDHAVVQSFPDAYSAAKKHGIKMIYGMEANL
VDDGVPIAYNAAHRLLEEETYVVFDVETTGLSAVYDTIIELAAVKVKGGEIIDKFEAFANPHRPLSATIIELTGITDDML
QDAPDVVDVIRDFREWIGDDILVAHNASFDMGFLNVAYKKLLEVEKAKNPVIDTLELGRFLYPEFKNHRLNTLCKKFDIE
LTQHHRAIYDTEATAYLLLKMLKDAAEKGIQYHDELNENMGQSNAYQRSRPYHATLLAVNSTGLKNLFKLVSLSHIHYFY
RVPRIPRSQLEKYREGLLIGSACDRGEVFEGMMQKSPEEVEDIASFYDYLEVQPPEVYRHLLELELVRDEKALKEIIANI
TKLGEKLNKPVVATGNVHYLNDEDKIYRKILISSQGGANPLNRHELPKVHFRTTDEMLEAFSFLGEEKAKEIVVTNTQKV
ASLVDDIKPIKDDLYTPKIEGADEEIREMSYQRARSIYGEELPEIVEARIEKELKSIIGHGFAVIYLISHKLVKRSLDDG
YLVGSRGSVGSSLVATLTEITEVNPLPPHYVCPECQHSEFFNDGSVGSGFDLPDKTCPHCGTPLKKDGHDIPFETFLGFK
GDKVPDIDLNFSGEYQPQAHNYTKVLFGEDNVYRAGTIGTVAEKTAYGYVKGYAGDNNLHMRGAEIDRLVQGCTGVKRTT
GQHPGGIIVVPDYMDIYDFSPIQFPADATGSEWKTTHFDFHSIHDNLLKLDILGHDDPTVIRMLQDLSGIDPKTIPTDDP
EVMKIFQGTESLGVTEEQIGCKTGTLGIPEFGTRFVRQMLEDTKPTTFSELVQISGLSHGTDVWLGNAQELIHNNICELS
EVIGCRDDIMVYLIYQGLEPSLAFKIMEFVRKGKGLTPEWEEEMKNNNVPDWYIDSCKKIKYMFPKAHAAAYVLMAVRIA
YFKVHHALLYYAAYFTVRADDFDIDTMIKGSTAIRAVMEDINAKGLDASPKEKNLLTVLELALEMCERGYSFQKVDLYRS
SATEFIIDGNSLIPPFNSIPGLGTNAALNIVKAREEGEFLSKEDLQKRGKVSKTILEYLDRHGCLESLPDQNQLSLF
>P75080 2.7.7.7~~~polC~~~DNA polymerase III PolC-type~~~
MVFDTETEKGKRIWALSQFLVKKNILDHNELHQLNNRIELVYLENDRENALFLVALTLKKPLTIDIWNALYEGFQDAEGA
ELRITFQEDATFFKDGSTKSSVTLAIIKDYFKSFFGKDKKYRILLEQELTHPNFLSYSNHELKANCQSQELDQWLIEQRQ
AFIKWMHQAGFTHFGFVSLFNPPAEKQLKVKSMKVSKYDKQFETEVFSTEFVPIHKINQQMDEIKLMGQIFELKDFPGYN
NLRNTLNIYVTDFQLGGSLILKWFYKDPKTIEGIKIGTWVKATVKVERDAKTQLLQGIIKEISPIETPAYYRRPDQDKQK
RVELVFHTKMSAFDGINSVQEYAQFAKERDWKTIAVTDKDNIHIYPTLYEVAKKYGLKAIYGLECNLIDDHIKIVSNPDK
TKLKDATFVIFDIETTGLHGRYDSVIEFAGIKVKHNREVERMQFFLKIDGPLPAAVTEITKITQAQLEDGMEQQAGLEKL
RAWLDGCVMVAHNGLSFDLPFLQTQFEKYNIAPLTNPLIDTLALSWALNPGFASHTLSNICAKLKFDFDDERLHRADYDT
QALKKVFDYFKEQVELMGITNLEQLDQELNQQCHFELLKRTFTNTGIIYIKSQSGFAKLYELLSIALTDNNATRPLVLTS
TLQKFAKSFVITDNPVQGDIFKAALTKPLKELEAAIKRVDFVLIAPPGAYAGYTIREGLKKEAIPNAIKLVVDTAQRLNK
LVAVASDAYFIHPWENEYYKAIVCAKGLGGRWHRHFNYKEREQRVPNVFVRTTGEMLNEMSFLGEQLAYELVVENTNKLA
KQLTADDLVPVQTKLQPPVIEGSNENLAAKTWSQAKAIYGDPLPKLIEQRIQEELKAIIDNGFGIIYWISHLLVKQSVQD
GYFVGPRGSIGSSLVANLIGISEINPLVPHYLCESCQYFEVNEEVDDGYDLMVRDCPKCGAKAAFKGDGHNIPFATFMGF
AGDKIPDIDLNFSSEYQAKAHAYVRELFGEQYTFRAGTIATVAEKTAYGYARNYFEIIKQTELATAPEIERFKQKLVGIK
RTTGQHPGGIMIFPNHKSVYEFTPCGYPADDTSSDWKTTHFEYDALGNTILKLDILGQDDPTMLKHLGDLTHVNPQNIPR
FDKKLTEMFWSVNPLKLKPHYLDEPTGAIGIPEFGTKFVRKILEQTKPKGFGDLIRVSGLSHGKNVWADNAQKILKDQNL
SLKDVIACRDDIMLYLIHKGMQAKDAFEIMEKVRKGIALNAKEVQLMQSNGVEQHWINSCLKISYLFPKAHAAAYVLMAW
RIAWFKLYHPLSYYACLLSFKLKEHDVSGFKSGVSFVKQKLEELNTLYRIKRIKPKEAELLTSYEVYLEMMARGIKLEQI
SLTHSHATRFVEHNGMLIAPFITIPGMGEAVANSIIEARNEKPFSSLDDFKKRTKITKKHIEAFTQMQLLDEFREQDNQK
KLF
>P63982 2.7.7.7~~~polC~~~DNA polymerase III PolC-type~~~
MAMTEQQKFKVLADQIKISNQLDAEILNSGELTRIDVSNKNRTWEFHITLPQFLAHEDYLLFINAIEQEFKDIANVTCRF
TVTNGTNQDEHAIKYFGHCIDQTALSPKVKGQLKQKKLIMSGKVLKVMVSNDIERNHFDKACNGSLIKAFRNCGFDIDKI
IFETNDNDQEQNLASLEAHIQEEDEQSARLATEKLEKMKAEKAKQQDNNESAVDKCQIGKPIQIENIKPIESIIEEEFKV
AIEGVIFDINLKELKSGRHIVEIKVTDYTDSLVLKMFTRKNKDDLEHFKALSVGKWVRAQGRIEEDTFIRDLVMMMSDIE
EIKKATKKDKAEEKRVEFHLHTAMSQMDGIPNIGAYVKQAADWGHPAIAVTDHNVVQAFPDAHAAAEKHGIKMIYGMEGM
LVDDGVPIAYKPQDVVLKDATYVVFDVETTGLSNQYDKIIELAAVKVHNGEIIDKFERFSNPHERLSETIINLTHITDDM
LVDAPEIEEVLTEFKEWVGDAIFVAHNASFDMGFIDTGYERLGFGPSTNGVIDTLELSRTINTEYGKHGLNFLAKKYGVE
LTQHHRAIYDTEATAYIFIKMVQQMKELGVLNHNEINKKLSNEDAYKRARPSHVTLIVQNQQGLKNLFKIVSASLVKYFY
RTPRIPRSLLDEYREGLLVGTACDEGELFTAVMQKDQSQVEKIAKYYDFIEIQPPALYQDLIDRELIRDTETLHEIYQRL
IHAGDTAGIPVIATGNAHYLFEHDGIARKILIASQPGNPLNRSTLPEAHFRTTDEMLNEFHFLGEEKAHEIVVKNTNELA
DRIERVVPIKDELYTPRMEGANEEIRELSYTNARKLYGEDLPQIVIDRLEKELKSIIGNGFAVIYLISQRLVKKSLDDGY
LVGSRGSVGSSFVATMTEITEVNPLPPHYICPNCKTSEFFNDGSVGSGFDLPDKTCETCGAPLIKEGQDIPFETFLGFKG
DKVPDIDLNFSGEYQPNAHNYTKVLFGEDKVFRAGTIGTVAEKTAFGYVKGYLNDQGIHKRGAEIDRLVKGCTGVKRTTG
QHPGGIIVVPDYMDIYDFTPIQYPADDQNSAWMTTHFDFHSIHDNVLKLDILGHDDPTMIRMLQDLSGIDPKTIPVDDKE
VMQIFSTPESLGVTEDEILCKTGTFGVPEFGTGFVRQMLEDTKPTTFSELVQISGLSHGTDVWLGNAQELIKTGICDLSS
VIGCRDDIMVYLMYAGLEPSMAFKIMESVRKGKGLTEEMIETMKENEVPDWYLDSCLKIKYMFPKAHAAAYVLMAVRIAY
FKVHHPLYYYASYFTIRASDFDLITMIKDKTSIRNTVKDMYSRYMDLGKKEKDVLTVLEIMNEMAHRGYRMQPISLEKSQ
AFEFIIEGDTLIPPFISVPGLGENVAKRIVEARDDGPFLSKEDLNKKAGLSQKIIEYLDELGSLPNLPDKAQLSIFDM
>Q9ZHF6 2.7.7.7~~~polC~~~DNA polymerase III PolC-type~~~COG2176
MKKIENLKWKNVSFKSLEIDPDAGVVLVSVEKFSEEIEDLVRLLEKKTRFRVIVNGVQKSNGDLRGKILSLLNGNVPYIK
DVVFEGNRLILKVLGDFARDRIASKLRSTKKQLDELLPPGTEIMLEVVEPPEDLLKKEVPQPEKREEPKGEELKIEDENH
IFGQKPRKIVFTPSKIFEYNKKTSVKGKIFKIEKIEGKRTVLLIYLTDGEDSLICKVFNDVEKVEGKVSVGDVIVATGDL
LLENGEPTLYVKGITKLPEAKRMDKSPVKRVELHAHTKFSDQDAITDVNEYVKRAKEWGFPAIALTDHGNVQAIPYFYDA
AKEAGIKPIFGIEAYLVSDVEPVIRNLSDDSTFGDATFVVLDFETTGLDPQVDEIIEIGAVKIQGGQIVDEYHTLIKPSR
EISRKSSEITGITQEMLENKRSIEEVLPEFLGFLEDSIIVAHNANFDYRFLRLWIKKVMGLDWERPYIDTLALAKSLLKL
RSYSLDSVVEKLGLGPFRHHRALDDARVTAQVFLRFVEMMKKIGITKLSEMEKLKDTIDYTALKPFHCTILVQNKKGLKN
LYKLVSDSYIKYFYGVPRILKSELIENREGLLVGSACISGELGRAALEGASDSELEEIAKFYDYIEVMPLDVIAEDEEDL
DRERLKEVYRKLYRIAKKLNKFVVMTGDVHFLDPEDARGRAALLAPQGNRNFENQPALYLRTTEEMLEKAIEIFEDEEIA
REVVIENPNRIADMIEEVQPLEKKLHPPIIENADEIVRNLTMKRAYEIYGDPLPEIVQKRVEKELNAIINHGYAVLYLIA
QELVQKSMSDGYVVGSRGSVGSSLVANLLGITEVNPLPPHYRCPECKYFEVVEDDRYGAGYDLPNKNCPRCGAPLRKDGH
GIPFETFMGFEGDKVPDIDLNFSGEYQERAHRFVEELFGKDHVYRAGTINTIAERSAVGYVRSYEEKTGKKLRKAEMERL
VSMITGVKRTTGQHPGGLMIIPKDKEVYDFTPIQYPANDRNAGVFTTHFAYETIHDDLVKIDALGHDDPTFIKMLKDLTG
IDPMTIPMDDPDTLAIFSSVKPLGVDPVELESDVGTYGIPEFGTEFVRGMLVETRPKSFAELVRISGLSHGTDVWLNNAR
DWINLGYAKLSEVISCRDDIMNFLIHKGMEPSLAFKIMENVRKGKGITEEMESEMRRLKVPEWFIESCKRIKYLFPKAHA
VAYVSMAFRIAYFKVHYPLQFYAAYFTIKGDQFDPVLVLRGKEAIKRRLRELKAMPAKDAQKKNEVSVLEVALEMILRGF
SFLPPDIFKSDAKKFLIEGNSLRIPFNKLPGLGDSVAESIIRAREEKPFTSVEDLMKRTKVNKNHIELMKSLGVLGDLPE
TEQFTLF
>P9WNT3 2.7.7.7~~~dinB1~~~DNA polymerase IV 1~~~COG0389
MESRWVLHLDMDAFFASVEQLTRPTLRGRPVLVGGLGGRGVVAGASYEARAYGARSAMPMHQARRLIGVTAVVLPPRGVV
YGIASRRVFDTVRGLVPVVEQLSFDEAFAEPPQLAGAVAEDVETFCERLRRRVRDETGLIASVGAGSGKQIAKIASGLAK
PDGIRVVRHAEEQALLSGLPVRRLWGIGPVAEEKLHRLGIETIGQLAALSDAEAANILGATIGPALHRLARGIDDRPVVE
RAEAKQISAESTFAVDLTTMEQLHEAIDSIAEHAHQRLLRDGRGARTITVKLKKSDMSTLTRSATMPYPTTDAGALFTVA
RRLLPDPLQIGPIRLLGVGFSGLSDIRQESLFADSDLTQETAAAHYVETPGAVVPAAHDATMWRVGDDVAHPELGHGWVQ
GAGHGVVTVRFETRGSGPGSARTFPVDTGDISNASPLDSLDWPDYIGQLSVEGSAGASAPTVDDVGDR
>Q47155 2.7.7.7~~~dinB~~~DNA polymerase IV~~~COG0389
MRKIIHVDMDCFFAAVEMRDNPALRDIPIAIGGSRERRGVISTANYPARKFGVRSAMPTGMALKLCPHLTLLPGRFDAYK
EASNHIREIFSRYTSRIEPLSLDEAYLDVTDSVHCHGSATLIAQEIRQTIFNELQLTASAGVAPVKFLAKIASDMNKPNG
QFVITPAEVPAFLQTLPLAKIPGVGKVSAAKLEAMGLRTCGDVQKCDLVMLLKRFGKFGRILWERSQGIDERDVNSERLR
KSVGVERTMAEDIHHWSECEAIIERLYPELERRLAKVKPDLLIARQGVKLKFDDFQQTTQEHVWPRLNKADLIATARKTW
DERRGGRGVRLVGLHVTLLDPQMERQLVLGL
>C2M741 3.4.14.-~~~~~~Asp/Glu-specific dipeptidyl-peptidase~~~COG4717
MKRFFKMALFLGVSALYGQQGGMWIPSLLEGMNAKEMKTLGMKMTVADIYSVNKSSLKDAAPHFNGGCSSEVISDKGLLL
TNHHCGYGQIQAHSTLQNDYLANGFWAKSLAEELPNKNLKVTFMVRIDDVTKQVLKGTESITDETEKAKLIEKNIAEVIK
TAPKEAWQENSVKAFYDGNQYILFVTETFKDVRLVGAPPSSIGKFGSDTDNWVWPRHTGDFSLFRIYADKNNRPAEYSKD
NVPYKPKHFFPISLKGVKEGDFVLVFGYPGTTQEYLPSAAVAQIENVINPARIGIRDIVLKVQDSYMRKDQGIKIKYAAK
YARVANYWKKWMGETKGLKKSGAVALKQQQEAKFQQAIQKANKQAQYGNLLSDFNRLYKEIEPYTLAANLNSEFIFRNID
LLSNGSRLLQLEKALEDKGEQSFNDRKKNLLNTFKEIFKDNDKQVDKDVFEKVVVFYAANMPKNLLINSLKNFDAKKLAD
NLYNNSFLTSLSGVESVLNLSAAEFKERMKNDVGIQFVRELKEMNDTQVFPVYDRLNTQIHALQRTYMKAILEFSKPSDR
IFPDANGTLRVTYGKVAGFSPADAVTYSAHTTLDGVMEKYVPGDYEFDVPQHLRDLQAKKDFGRYGDKNGKMPLCFLSTC
HTTGGNSGSPAIDANGNLIGLNFDRVWEGTMSDIHYDPKLCRNIMVDIRYVLFVIDKYAGAGHLVNEMKLIK
>A6GWM2 3.4.14.-~~~~~~Asp/Glu-specific dipeptidyl-peptidase~~~COG4717
MKYLKLFLLLFIIQTQAQQGGMWIPSLLSGMNETEMKNLGMKISADDIYSVNHSSLKDAVPHFNGGCTSEVISPKGLILT
NHHCGFDAIQNHSSVDHDYLTNGFWAMKMEDELPNENLVVTFIVSINDVTAQVLDGVASITSETEKQNKIQENITKVTAS
FAKEAWQENKVRTFFEGNQYILFVTEVFKDVRLVGAPPSLIGKFGSDTDNWVWPRHTGDFSMFRVYANKNNHPAAYSKDN
VPYIPKHFLPVSLDGVQEDDFTMVMGYPGKTQEYLPSFAVAQIVNETNPAKIEIREAALKVQDGFMRKDNAIKIQYASKY
AGVANYWKKWIGESQGLKKSNAIGLKQNFEKDFQQKVIAAGKQNEYGNLLADFQKYYTEITPYAVSRDYFNEVVVKNTEL
LSLGYKLYQLEQVFITKGEQAFNDRKENLIKSQADFFKDFNATVDEKVFEQLVALYATKAPKEFLPISLLNVEYKKFAPS
IYSKSKLVDYANFKALLSGDAKAVLKKISLDKGYAFVKSLADNYSKNIAPRYDEINLKINALQRIYMKAQLELYPNSRIF
PDANSTLRVTYGKVKGYSPKDAIYYNPTTYLDGAIEKYIPGDYEFDVPKKLIDLYNNKDYGQYGENGKLPVCFIGTNHTT
GGNSGSPAVDAQGNLIGLNFDRVWEGTMSDIHYDPSICRNVMVDMRYVLFIVDKFAGAKHLINEMKLVHPKKK
>F8WQK8 3.4.14.-~~~~~~Asp/Glu-specific dipeptidyl-peptidase~~~COG3591
MNKRFFPTLLLAFVCSTLAYADGGMWLMQQINGQVARMKSLGMQLEAADIYNPNGSSLKDAVVMFDGGCTGVLVSNQGLL
LTNHHCGYDQIQKHSSVQHNYLKDGFWSYSLAEELVNPGLEVEIVDEITDVTAAVKKELERIKKPSGLEFLSPRYLSSLA
PEIVGKKAASRPGYRYEIKAFYGGNRYYMFTKKVFRDVRLVAAPPSSIGKFGSDTDNWAWPRHTGDFSIFRLYADKNGNP
AEYSKDNVPYRPKRWVKVNAQGVKEGDFALIMGYPGTTYKFFTADEVTEWSEIDNNIRIEMRGILQDVMLREMLADPKIN
IMYAAKYASSQNGYKRAQGANWAIRRRSLREIKLAQQQEVLAWAKQKGIATTEEAVRAISKAIEGRQDLRMRQRYLLEGI
LMGIEMSNAPAADSDIADHWDDPARREAGLQSIRKQFEAFFNKDYSPEVEKDQLAIALLTRYAERIPAEKQPISIREGIA
EYGSAKAYVEMIFDKSIYASRERFEEFMKNPDRDRLLRDPMSRFAASVAYEHQKLAKEVAAFDAPLAAAQRSYVASVLDM
KGQPNLAPDANLTLRFTYGEIKGYQPRDVVTYGAKSTLEGVMEKEDPNNWEYVVDPKLKALYEAKNYGRYANSDGSMPVN
FCATTHTTGGNSGSPVMNARGELIGLNFDRNWEGVGGDIEYLPNYQRSIILDIRYLLFIIDKFAGCQRLIDEIQPQF
>B2RID1 3.4.14.-~~~~~~Asp/Glu-specific dipeptidyl-peptidase~~~COG3591
MKKRLLLPLFAALCLSQIAHADEGMWLMQQLGRKYAQMKERGLKMKEYDLYNPNGTSLKDAVVLFDGGCTGEVVSDRGLV
LTNHHCGYDMIQAHSTLEHNYLENGFWAMREADELPNKDISVVFIDKIEDVTDYVKKELKAIKDPNSMDYLSPKYLQKLA
DKKAGKNFSAKNPGLSVEIKAFYGGNLYLMFTKKTYTDVRLVGAPPSSIGKFGADTDNWIWPRHTGDFSIFRIYADKNGN
PAPYSEDNVPLKPKRFFNISLGGVQENDYAMIMGFPGTTHRYFTASEVDEWKSIDNDIRIRMRDIRQGVMLREMLADPQI
KIMYSAKYAASQNAYKRAIGANWAIKTRGLRQNKQAMQDRLIAWGAKQGTPRYEEAVHEIDATVAKRADLRRRYWMIEEG
IIRGIEFARSPIPTEDETKALQGNDASARKEAIDKIRTRYSKFANKDYSAEVDKKVAVAMLTEYLKEIPYENLPLHLRLV
KDRFAGDVQAYVDDIFARSVFGSEAQFDAFAAVPSVEKLAEDPMVLFASSVFDEYRKLYNELRPYDDPILRAQRTYIAGL
LEMDGDQDQFPDANLTLRFTYGQVKGYSPRDNVYYGHQTTLDGVMEKEDPDNWEFVVDPKLKAVYERKDFGRYADRSGRM
PVAFCATTHTTGGNSGSPVMNANGELIGLNFDRNWEGVGGDIQYLADYQRSIIVDIRYVLLVIDKVGGCQRLLDEMNIVP
>A4Y3F4 3.4.14.-~~~~~~Asp/Glu-specific dipeptidyl-peptidase~~~COG3591
MRIALVATLVLTSGIANADEGQWQPYQMPSIADKLSERGIDIPAEKLADLTSYPMNAVVGLGYCTASFVSPQGLVVTNHH
CAYKAIQYNTKQEHNYLEQGFLATSMDKEPSAGPNERLYITEAVTDVTKDVTKELSQDPLTRYEEIQNNSKALIKNCEVD
DNYRCNVRSFHNGLEYYLIKQLMIRDVRLVYAPPESVGGYGGDIDNYEYPRHSGDFAFLRAYVGKDGKPAAYAEDNIPYK
PKSYLKVNADGVKAGDGVFVAGYPGTTSRYNLTSELKFASDWLYPTQAKRYQLQIDTINAMGQEDADIAIKYAGNMASMA
NRMKKLNGLLAGFKATDIIGIKQSREDNFLAWLKQNPKLNQNLIAELEVLLAEQQQVFQSNYYFTNAQSSTLLTAANSLY
RLAKEKQKSDAEREIGYQERDLAMFSSRLKRIDSSFHVDVDKTLWQQDLRAYLAQPNRIAALDDALDLNNKETNLEAKLD
GLYSLTTLTDQAQRLAWMDADTTTFETSTDPFIRLAVALYDTNMAQEKAEKILDGKLSTARPDYMAAVIEYYKANNWPVY
PDANGTLRISYGMVDGYQSRDALYKQPFTRLDGIVAKHTGAEPYNAPQKLLDAISEQRFGDHLVKSVYQDPRGWICRLFS
CLDKPEEFNSVPVNFLSSVDTTGGNSGSPVFNGKGELVGLNFDSTYEAITKDWFFNPTITRAVHVDIRYILWMMDKVDHA
DNLIKELDLVRN
>C3J8X2 3.4.14.-~~~dpp5~~~Dipeptidyl-peptidase 5~~~COG0823
MKRTILSLLAAVSLAIPVYAAGYSDGNPTQQTSQSSMMTPEMLLTMARIGGFSLSPNGKQVVYSVSLPSIQDNKAKTQLF
FVNSDGSGRKALTDGTRTAVSPRWIEDGKRIAYLTVIEGEMQLVSILPDGTDQRQVTRIPGGITGYLYSQDGKQLVYTAD
IKLPNEAKDRNPDLDKISGRVITDLMYKHWDEWVETAPHTFVASLAQQPITQGKDLLEGELFEAPMKPHSDESDIAITPD
GKGIAYASRKKTGLEYSISTNSDIYYYDLTTGTTTNLTEGMMGYDTHPSFSPDGKYMTWCSMERDGYESDLIRLFLLDRT
TGEKTYLTEGFEYNVEQPTWSQDGKSIYFIACVEAESHLYELTLKNKKIRRITQGQMDYVGFDLQGTTLVAARQSMLAPT
DLYRIDLKKGTATAITKENESTLAQLGDIRCEKRWMNTTNGEKMLVWVLYPANFDASKKYPSILYCQGGPQSTISQFWSY
RWNPRIMAENGYIVILPNRHGVPGFGKAWNEQISGDYGGQNMRDYLTAADEMKKESYIDPNGMGCVGASYGGFSVYWLAG
HHEKRFNCFIAHAGIFNLEAQYLETEEKWFANWDMGGAPWEKSNATAQRTFATSPHLFVDKWDTPILIIHGERDYRILAS
QGMMAFDAARMHGVPTEMLLYPDENHWVLQPQNAVLWQRTFFRWLDRWLKK
>B2RIT0 3.4.14.-~~~dpp5~~~Dipeptidyl-peptidase 5~~~COG0823
MNKKIFSMMAASIIGSAAMTPSAGTNTGEHLTPELFMTLSRVSEMALSPDGKTAVYAVSFPDVKTNKATRELFTVNLDGS
GRKQITDTESNEYAPAWMADGKRIAFMSNEGGSMQLWVMNADGTERRQLSNIEGGITGFLFSPDEKQVLFTKDIKFGKRT
KDIYPDLDKATGRIITDLMYKHWDEWVETIPHPFIANATDGMITTGKDIMEGEPYEAPMKPWSGIEDFSWSPDGQNIAYA
SRKKTGMAYSLSTNSDIYIYNLTSGRTHNISEGMMGYDTYPKFSPDGKSIAWISMERDGYESDLKRLFVADLATGKRTHV
NPTFDYNVDMIQWAPDSKGIYFLACKEAETNLWEITLKTGKIRQITQGQHDYADFSVRNDVMLAKRHSFELPDDLYRVNP
KNGAAQAVTAENKAILDRLTPIACEKRWMKTTDGGNMLTWVVLPPDFDKNKKYPAILYCQGGPQNTVSQFWSFRWNLRLM
AEQGYIVIAPNRHGVPGFGQKWNEQISGDYGGQNMRDYLTAVDEMKKEPYVDGDRIGAVGASYGGFSVYWLAGHHDKRFA
AFIAHAGIFNLEMQYATTEEMWFANWDIGGPFWEKDNVVAQRTYATSPHKYVQNWDTPILMIHGELDFRILASQAMAAFD
AAQLRGVPSEMLIYPDENHWVLQPQNALLFHRTFFGWLDRWLKK
>P39043 3.4.22.-~~~~~~Dipeptidyl-peptidase 6~~~
MNAIVIAMMANLYAEPDLHAELVDEILYGMPVQIIEELENDWLYVRTAYRYEGYCQRNDVLFDDAITNTWIQKAQHVIGQ
RFADVLQEPKIQSTKIITLVKGSILYNVDSDTTSNTPWTAVQLATGEIGYLRSQWLHPKIAEHTFEEHAFRENVVQTALS
YIATPYRWGGKSPLGIDCSGLCSMAYLLNGVIIFRDARIVEGFPIKEITIDRMQKGDLLFFPGHVALYLGQTLYVHASLG
GNEVNVNSLDEQHPLYRQDLATTITAIGSLF
>Q5LB17 3.4.14.-~~~dpp7~~~Dipeptidyl-peptidase 7~~~COG3591
MNRLKLYLLALTALAVCSAKADEGMWLLQLMQQQHSIDMMKKQGLKLEAQDLYNPNGVSLKDAVGIFGGGCTGEIISPEG
LILTNHHCGYASIQQHSSVEHDYLTDGFWATSRDKELPTPGLKFTFIERIEDITDIVNLRIAAKEITESESFSSTFLNKL
AKELFEKSDLKGKKGIVPQALPFYAGNKFYMFYKKVYPDVRMVAAPPSSIGKFGGETDNWMWPRHTGDFSMFRIYADANG
EPAEYSASNVPLKTKKHLNISIKGLKEGDYAMIMGFPGSTSRYLTVSEVKERMEASNAPRIRIRGTRQDVLKEAMNASDK
VRIQYANKYAGSSNYWKNSIGMNKAIIDNNVLGTKAEQEAKFAKFAKEKNNTDYMNVVAKIDEAVAKTSPIKYQQTCLTE
TFFGGIEFGSPFMVMDKLKEALEQKNDSSIEANIKVLKEVFNDIHNKDYDHEVDRKVAKALLPLYAEMIPAGQRPAIYDV
IEKEYKGDYNAYVDAMYDTSILANQANFDKFIKKPTVKAIEKDIATQYSRAKFDKYTNLAEQMGKLPEELALLHKTYIRG
LGEMKLPVPSYPDANFTIRLTYGNVKPYSPKDGVYYKYYTTTDGILEKENPEDREFVVPAKLKELIEKKDFGRYALPNGE
MPVCFLSTNDITGGNSGSPVLNENGELIGCAFDGNWESLSGDINFDNNLQRCINLDIRYVLFILEKLGGCGHLINEMTIV
E
>C2M262 3.4.14.-~~~dpp7~~~Dipeptidyl-peptidase 7~~~COG3591
MRKLIFSLVTSFFLLLPSVIRADEGMWFLMFIKRLNERDMQKKGLQLTAEEIYSINNNSLKNAIVQFNGGCTASIISPDG
LVITNHHCGYGAIAGLSTPEHNYLKDGYWAKDRSQELPPKSLYVRFFVRMDNVTDRMLSVVNSSMSEKERQDALNREMEK
IQKENSEGGKYVVSVRPFFQGNEYYYFVYQDFKDVRFVGTPPENVGKFGGDTDNWEWPRHTGDFSVFRVYTDKDGNPAPY
SPNNIPMKAKKYLNVTLKGVQENDFAMILGYPGRTNRWVSSHWVDQQVKYGYPAWVEASKTAMDAMKAHMDKDKAVRLKY
ASRYASLANYWKNRQGMIDALTAHKTADLKRAAEKKFAVWANKPENKAEYGNVLSDLATYFEKTNQEAANHNYLLLFFRA
SRIVPQANGYVKQLNTYLNSSSDQEKQQIRERIAKELDAYYSESYLPAEIDLFADNLKLYADKATDIPQEIAQIKSQYNG
DFRKFAAEVFARSIFTTKENFENFMNNPSSDALQSDPIAQIARVMIDKYYNSQSEALKDGYEKAFRKYVKGMRDSKVSLI
LYPDANSTLRLTYGSVKSLPKDKRNHDVKRNYYTTFKTMLEKYKPGDAEFDMPKKFVEMYEKKDFGRYLDKDGTMHVCFL
TNNDITGGNSGSPVMNGKGELIGLAFDGNIEAMAGDVIFDKKLQRTIVVDIRYVLWCIDTFGGAKHIVDEMTIIQ
>A6L2J8 3.4.14.-~~~dpp7~~~Dipeptidyl-peptidase 7~~~COG3591
MKKFKLLLLALMCVAFLPSKADEGMWLLQLMQEQHLADRMKAQGLLLEADDIYNPNRVSLKDAVGIFGGGCTGEIISPDG
LILTNHHCGYGAIQQHSSVEHDYLTDGFWAKSRKEELPTPGLKFKFVERIVDVTDKVNNKVKSGEVKEEETFEYDFLKKL
ADEELKASDLNGKAGISAQALPFYAGNKFYLIYLKTYSDVRMVAAPPSSIGKFGGETDNWMWPRHTCDFSVFRIYADANG
EPAEYNENNVPLKAKKHLAISLKGINEGDYAMIMGFPGSTNRYLTQSEVKQRMHSTNEPRIRIRGVRQDVLKKEMAASDK
VRIQYASKYAGSSNYWKNSIGMNKAIIDNKVLETKAEQEAKFAAFAKAKGNTDYEKVVSEIDAAIEKSNPILYNYTCFRE
VFQGGIEFGTPYLILDKLKDAIKNKDKEAINKNIETLKKVYADIHNKDYDHEVDRKVAKALLPLYAEMVPADALPAFYTT
IQKDFKGNYDAYVDHCYDNSIFSNEANFNKFIKKPTVKAIEKDPMTAYVRAKYDLMDKLGNELAESMKGMDLLHKTYVRG
LCEMYSPEPKAPDANFTIRLTYGNVKSYNPKDGVHYKYYTTLKGVMEKEDPTNPEFVVPAKLKELYEAKDFGRYALPNGD
MPACFLTTNDITGGNSGSPVINGNGELIGAAFDGNWESLSGDINFDNNLQRCIAVDIRYVLFIIDKLGGCKHLIDEMTIV
E
>C3JAQ3 3.4.14.-~~~dpp7~~~Dipeptidyl-peptidase 7~~~COG3591
MKLKRILLSVALLCGIGTTAMADKGMWLLNELNQQNYERMKELGFKLSPEQLYSLGQPSVASAVVIFGGGCTGITVSNEG
LIFTNHHCGFGAIQSQSTVDHDYLRDGFRSNNHVEELPIPGLSVRYLREIVDVTPRIEAAVKGAKSEMERMQIIEELSQK
INAEYTKGSTVVGEVTPYYAGNKYYVVVYNVFQDVRLVMAPPSSVGKFGGDTDNWMWTRHTGDFSVFRVYADANNNPALY
SQNNKPYKPISYAPVSLNGYREGDYAMTIGFPGSTNRYLTSWGVEDVVNNENSPRIEVRGIKQAIWKEAMEADQATRIKY
ASKYAQSSNYWKNSIGMNRGLKNLDVVNRKRAEEKAFEAWIAKNNSQSTYGHILPGLKADYAKSAAISKDINYLYETLWG
GTEIVRLARDVNSVGRIQAADMPKYKGRLEELYKDYLPSLDVKVLPAMLDIVRQRVSADCQPDIFKFIDKKFKGSTEKYA
QYVFEKSIVPYADKVKDFLNLPADKQKKILDKDPAVALFNSVLPAIMQAQDKSEEMMLNIEKGKREYFAASRIMDPNRQM
PSDANFTMRMSYGSIKGYAPKDGAWYNYYTTEQGVFEKQDPTSSEFAVQPEILSLLRSKDFGQYGVGDHLRLCFLSDNDI
TGGNSGSPVFNGNGELIGLAFDGNWEAMSGDIEFEPDLQRTISVDIRYVLFMIDKWAKMPHLIKELNLVKGDQRDLMPAG
KGGNCSHKKAQTCAKKECSKGKKCAEKSATCISAMKDGKPCKTEKACAAGQKSAEKKANCCSTMKDGKPCTGDKDCAKSG
KACCGKNKEAAAKKASKK
>B2RKV3 3.4.14.-~~~dpp7~~~Dipeptidyl-peptidase 7~~~COG3591
MQMKLKSILLGAALLLGASGVAKADKGMWLLNELNQENLDRMRELGFTLPLDSLYSFDKPSIANAVVIFGGGCTGITVSD
QGLIFTNHHCGYGAIQSQSTVDHDYLRDGFVSRTMGEELPIPGLSVKYLRKIVKVTDKVEGQLKGITDEMERLRKAQEVC
QELAKKENADENQLCIVEPFYSNNEYFLIVYDVFKDVRMVFAPPSSVGKFGGDTDNWMWPRHTGDFSVFRVYAGADNRPA
EYSKDNKPYKPVYFAAVSMQGYKADDYAMTIGFPGSTDRYLTSWGVEDRIENENNPRIEVRGIKQGIWKEAMSADQATRI
KYASKYAQSANYWKNSIGMNRGLARLDVIGRKRAEERAFADWIRKNGKSAVYGDVLSSLEKAYKEGAKANREMTYLSETL
FGGTEVVRFAQFANALATNPDAHAGILKSLDDKYKDYLPSLDRKVLPAMLDIVRRRIPADKLPDIFKNVIDKKFKGDTKK
YADFVFDKSVVPYSDKFHAMLKSMDKEKFAKAIEKDPAVELSKSVIAAARAIQADAMANAYAIEKGKRLFFAGLREMYPG
RALPSDANFTMRMSYGSIKGYEPQDGAWYNYHTTGKGVLEKQDPKSDEFAVQENILDLFRTKNYGRYAENGQLHIAFLSN
NDITGGNSGSPVFDKNGRLIGLAFDGNWEAMSGDIEFEPDLQRTISVDIRYVLFMIDKWGQCPRLIQELKLI
>A0A0H2ZGV2 ~~~dppA1~~~Di/tripeptide-binding protein 1~~~
MRRNAVIRSAIMPSLLGAALVAAVPQAFASNLIFCSEGSPAGFDPAQYTTGTDFDAAAETVFNRLTQFERGGTKVLPGLA
ESWDVSDDGKTYTFHLRKGVKFHSTDYFKPTREFNADDVLFTFERMLDKDHPFRKAYPTEFPYFTDMGLDKNIARIEKLD
EHTVKFTLNEVDAAFIQNLAMPFPSIQSAEYAAQLLKQGKASDINQKPIGTGPFVFSRYQKDAQIRFKGNKDYWKPDEVK
VDNLIFAINTDASVRAQKLKAGECQITLNPRPADLDALKKDPNLNLPSQAGFNLGYIAYNVTHKPFDKLEVRQALDMAVN
KQAIIDAVYQGAGQLASNGMPPTQWSYDETIKDAPYDPAKARELLKKAGVAEGTEITLWAMPVQRPYNPNAKLMAEMLQN
DWAKIGIKAKIVTYEWGEYIKRAKGGEHDAMLIGWSGDNGDPDNWLGTLYGCDAVDGNNFSKWCDAGYDKLVKDAKRTTD
QGKRTELYKQAQHILKEQVPITPIAHSTVYQPMRKTVHDFRISPFGLNSFYEVSVGK
>A0A0H2ZGW2 ~~~dppA2~~~Di/tripeptide-binding protein 2~~~
MRPRSALRYSLLLLAFAASAAIQAQPKTLAVCTEAAPEGFDPARYTSGYTFDASAHPLYNALAAFAPGSATVIPALAESW
DVSADGLVYTFRLRQGVKFHSTDYFKPSREFNADDVLFSFQRMLDPQHPAHDLSPSGYPYADAMQLRDIIERIEKIDEHQ
VRFVLKHPEAPFLADLAMPFGSILSAEYAGQLIARGKGDELNSKPIGTGPFVFTRYRKDAQVRYAANPDYWKGKPAIDHL
VLAITLDPNVRVQRLRRNECQIALTPKPEDVAALRQDPQLTVLEEAAMITSHAAINTRHEPFDDPRVRRAIAMGFNKSSY
LKIVFGDQARPAIGPYPPMLLGYDDSIRDWPYDPERAKALLKEAGVAPDTPLNLYISTGSGPGGNPARVAQLIQSDLAAI
GIRVNIHQFEWGEMVKRTKAGEHDMMLYSWIGDNGDPDNFLTHNLGCASVESGENRARWCDKGFDEAIRKARMSNDESQR
VALYKEAQRIFHEQMPWLPLAHPLMFDAQRKNVSGYRMSPMSARDFSRVKLD
>A0A0H2ZGN2 ~~~dppA3~~~Di/tripeptide-binding protein 3~~~
MRKILPLRAWLAAGLILGSPFSHAASNLVFCSEGSPAGFDPAQYTTGTDYDATSVTLFNRLVQFERGGTRAIPALAESWD
IGDDGKTYTFHLRKGVKFHSTDYFKPTREFNADDVLFTFERMLDKNHPFRKAYPTEFPYFTDMGLDKNIARVEKLDEHRV
KFTLNEVDAAFIQNLAMDVASIQSAEYAGQLLEAGKPQQINQKPIGTGPFILSRYQKDAQIRFKGNKDYWKPEDVKIDNL
IFSINTDAAVRAQKLKAGECQITLNPRPADLKALQEAANLKVPSQPGFNLGYIAYNVTHKPFDQLEVRQALDMAVNKQAI
IDAVYQGAGQLAVNGMPPTQWSYDETIKDAPFDPAKARELLKKAGVAEGTEITLWAMPVQRPYNPNAKLMAEMIQADWAK
IGIKARIVSYEWGEYIKRAHAGEHDAMLFGWTGDNGDPDNWLATLYGCDSINGNNVSKWCDAAYDKLVKAAKRVSDQDKR
SELYKQAQHILKEQVPITPIAHSTVYQPMNKSVHDFKISPFSRNAFYGVTNQP
>A0A0H2ZGV7 ~~~dppA4~~~Di/tripeptide-binding protein 4~~~
MLHPLLRHLPLALALALCAAGAAQAKNLVVCTEASPEGFDIVQYTGAVTADASAETVFNRLLAFRPGTTEVIPGLAERWD
VSADGLSYTFHLRPGVKFHTTDYFKPTRSLNADDVLWTFQRALDPKHPWHASALRGYAYFDAMGMGELIKSVEKVDELTV
RFVLNRPEAPFLRDMAMPFASIYSAEYGDQLLAAGKQGQLNNQPIGTGPFVFKRYAKDAQVRYTANPDYYAGKPPIDNLV
FAITLDPNVRMQKVRAGECQVSLYPKPEDVPRLKQDPNLAVDEIDALLTTYIAINTQHKPLDDPRVRQAINLALDKKAML
DAVFGPGAASPAVGPYPPTLLGYNHSIQDWPHDPERARALLKEAGAENLRITLFIRNGTSPTIPNPALAAQMLQADLAKA
GIQLTIRSLEWGELLKRSKAGEHDLSLLGWAGDNGDPDNFLSPNLSCAAAESGENQARWCDKDFEALMRKAREVSDPAER
AKLYEQAQVVFHEQAPWIPLAYPKLFNVRRNTVQGYVINPLSNNNFATTSVKP
>A0A0H2ZI72 ~~~dppA5~~~Probable di/tripeptide-binding protein 5~~~
MRLAAFSLFLAPLLLAQPAAAATLSVCTEASPEGFDVVQYNSLTTTNASADVLMNRLVEFDAGKGTVVPSLAERWSVSDD
GLSYRFDLRQGVHFHSTAYFKPSRTLDADDVVFSFQRMLDPANPWHKVAQNGFPHAQSMQLPELIKRVEKSGDHQVLIVL
DHPDATFLPMLSMGFASIYSAEYADQLMKAGTPEKLNTAPIGSGPFVFKRFQKDAVVRYAANPEYFAGKPAVDALIFAIT
PDANVRLQKLRRGECQIALSPKPLDVESARKDASLKVEQTPAFMTAFVALNTQHPPLDDPKVRQAINLAFDRTSYLQAVF
EGSASAATGIYPPNTWSYARDIPAYPHDPEQARKLLAGKQLPELNIWTRPSGSLLNPNPSLGAQLLQADLAEAGIKANIR
VIEWGELIRRAKNGEHDLLFMGWAGDNGDPDNFLTPQFSCASVKSGLNFARYCDPGLDKLIADGKAASSQEQRTGLYHQA
QKLIHEQALWLPLAHPTAFALTRQEVQGYQVNPFGRQDFSRVAVKR
>P26902 3.4.11.-~~~dppA~~~D-aminopeptidase~~~COG2362
MKLYMSVDMEGISGLPDDTFVDSGKRNYERGRLIMTEEANYCIAEAFNSGCTEVLVNDSHSKMNNLMVEKLHPEADLISG
DVKPFSMVEGLDDTFRGALFLGYHARASTPGVMSHSMIFGVRHFYINDRPVGELGLNAYVAGYYDVPVLMVAGDDRAAKE
AEELIPNVTTAAVKQTISRSAVKCLSPAKAGRLLTEKTAFALQNKDKVKPLTPPDRPVLSIEFANYGQAEWANLMPGTEI
KTGTTTVQFQAKDMLEAYQAMLVMTELAMRTSFC
>P23847 ~~~dppA~~~Dipeptide-binding protein~~~COG0747
MRISLKKSGMLKLGLSLVAMTVAASVQAKTLVYCSEGSPEGFNPQLFTSGTTYDASSVPLYNRLVEFKIGTTEVIPGLAE
KWEVSEDGKTYTFHLRKGVKWHDNKEFKPTRELNADDVVFSFDRQKNAQNPYHKVSGGSYEYFEGMGLPELISEVKKVDD
NTVQFVLTRPEAPFLADLAMDFASILSKEYADAMMKAGTPEKLDLNPIGTGPFQLQQYQKDSRIRYKAFDGYWGTKPQID
TLVFSITPDASVRYAKLQKNECQVMPYPNPADIARMKQDKSINLMEMPGLNVGYLSYNVQKKPLDDVKVRQALTYAVNKD
AIIKAVYQGAGVSAKNLIPPTMWGYNDDVQDYTYDPEKAKALLKEAGLEKGFSIDLWAMPVQRPYNPNARRMAEMIQADW
AKVGVQAKIVTYEWGEYLKRAKDGEHQTVMMGWTGDNGDPDNFFATLFSCAASEQGSNYSKWCYKPFEDLIQPARATDDH
NKRVELYKQAQVVMHDQAPALIIAHSTVFEPVRKEVKGYVVDPLGKHHFENVSIE
>A2RI74 ~~~dppA~~~Dipeptide-binding protein~~~COG4166
MKQAKIIGLSTVIALSGIILVACGSKTSEQKNIQFSIPTDVASLDTTILTDQYSYDVAGNVEEGLTRVDSKGNAALALAK
SIDVSKDGLTYTVTLKDNLKWSNGDKLTAKDFVYSWKRAVDPKTGSEYAYLMGAVSGANDIISGKSSLDTLGIKAESDTE
FTVTLAQPTPYFKFLLSEPVYYPLDQKVVDKYGKQYGTSSDKTVYNGPFMFKSDKAWTGTNKNFSIYANPNYYDKSAVKS
KQIDFQVISNANTGAQLYKQGKLDFTLLSTTDLINANKKTEGYTVFKQARTDYIEYNQSGKNASSPDAQKALANQDIRQA
LNLATNRAEVVKTALPGSTAATSFTPVGMSKTSTGEDFATYAKQDYSYDPTKAKELWAKGLKELGLTKLSLSLEAAGDLA
PSEATANFLQTAYQQNLPGLTVNLKLVPFKQRLNDAQNGNFDMVLSGWGGDYAEPSTFLQLFTTGQSYNDGKFSSKTYDD
AFKAATTTPDVLEPAKVDEHYKAAEAALYEGSYINPIDFQANPALMNPKITGLEFHSTGLAYDLKSAYIK
>P26903 ~~~dppB~~~Dipeptide transport system permease protein DppB~~~COG0601
MARYMIKRFWAMAATILVITTLTFVLMKVIPGSPFNEERGTNEAVQKNLEAYYHLDDPLIFQYIFYLKSIITFDFGPSIK
KPSDSVNDMLERGFPVSFELGMTAIVIAVISGLVLGVIAALRRNGFLDYAAMSLAVLGISIPNFILATLLIQQFAVNLKL
FPAATWTSPIHMVLPTAALAVGPMAIIARLTRSSMVEVLTQDYIRTAKAKGLSPFKIIVKHALRNALMPVITVLGTLVAS
ILTGSFVIEKIFAIPGMGKYFVESINQRDYPVIMGTTVFYSVILIIMLFLVDLAYGLLDPRIKLHKKG
>P0AEF8 ~~~dppB~~~Dipeptide transport system permease protein DppB~~~COG0601
MLQFILRRLGLVIPTFIGITLLTFAFVHMIPGDPVMIMAGERGISPERHAQLLAELGLDKPMWQQYLHYIWGVMHGDLGI
SMKSRIPVWEEFVPRFQATLELGVCAMIFATAVGIPVGVLAAVKRGSIFDHTAVGLALTGYSMPIFWWGMMLIMLVSVHW
NLTPVSGRVSDMVFLDDSNPLTGFMLIDTAIWGEDGNFIDAVAHMILPAIVLGTIPLAVIVRMTRSSMLEVLGEDYIRTA
RAKGLTRMRVIIVHALRNAMLPVVTVIGLQVGTLLAGAILTETIFSWPGLGRWLIDALQRRDYPVVQGGVLLVATMIILV
NLLVDLLYGVVNPRIRHKK
>A2RI75 ~~~dppB~~~Dipeptide transport system permease protein DppB~~~COG0601
MVKYILKRLGLLLLTLFLIVTLTFFMMQVMPGTPFSNPKLTPDQLEILKHAYGLDKPLWQQYFIYVGHMFTGNFGTSFIY
TNQPVITMIAQRLPVSMQLGTQALILGTVLGALMGKASARRKNGLLDGIFGFLSVLGISVPSFVIGTLILLYLGFNLNLF
PISGWGTFSQTIMPTIALSFAPMAVVTRFVRSEMIESLSSDYILLARAKGLSEKEVVNKHALRNSLIPMLTLIGPMAAGL
LTGSVLIEKIFSIPGIGAQFVDSIPAKDFPVIMATTIVYAVILMVFILVTDILTAIVDPRVRL
>A0A0H2ZGW7 ~~~dppB~~~Di/tripeptide transport system permease protein DppB~~~
MLSFIARRLGLLIPTFFGVTLLTFALIRLIPGDPVEVMMGERRVDPQMHAEALHRLGLDKPLYQQYLDYVGNLAQGNLGE
SLTTREGVWHEFLTLFPATLELSLAAMLFAGTFGLLAGVIAALKRGSLFDHGVMTVSLAGYSMPIFWWGLILIMLFSVSL
GWTPVSGRLDLLYDIEPKTGFMLIDTLLSDEQGSFLDAVRHLILPAIVLGTIPLAVIARMTRSAMLEVLREDYVRTARAK
GLSPARVVFVHALRNALIPVLTVFGLQVGTLLAGAVLTETIFSWPGIGKWLIDAISRRDYPVVQNGILLVATLVILVNFV
VDILYGLANPRIRHQR
>P26904 ~~~dppC~~~Dipeptide transport system permease protein DppC~~~COG1173
MNLPVQTDERQPEQHNQVPDEWFVLNQEKNREADSVKRPSLSYTQDAWRRLKKNKLAMAGLFILLFLFVMAVIGPFLSPH
SVVRQSLTEQNLPPSADHWFGTDELGRDVFTRTWYGARISLFVGVMAALIDFLIGVIYGGVAGYKGGRIDSIMMRIIEVL
YGLPYLLVVILLMVLMGPGLGTIIVALTVTGWVGMARIVRGQVLQIKNYEYVLASKTFGAKTFRIIRKNLLPNTMGAIIV
QMTLTVPAAIFAESFLSFLGLGIQAPFASWGVMANDGLPTILSGHWWRLFFPAFFISLTMYAFNVLGDGLQDALDPKLRR
>P0AEG1 ~~~dppC~~~Dipeptide transport system permease protein DppC~~~COG1173
MSQVTENKVISAPVPMTPLQEFWHYFKRNKGAVVGLVYVVIVLFIAIFANWIAPYNPAEQFRDALLAPPAWQEGGSMAHL
LGTDDVGRDVLSRLMYGARLSLLVGCLVVVLSLIMGVILGLIAGYFGGLVDNIIMRVVDIMLALPSLLLALVLVAIFGPS
IGNAALALTFVALPHYVRLTRAAVLVEVNRDYVTASRVAGAGAMRQMFINIFPNCLAPLIVQASLGFSNAILDMAALGFL
GMGAQPPTPEWGTMLSDVLQFAQSAWWVVTFPGLAILLTVLAFNLMGDGLRDALDPKLKQ
>A2RI76 ~~~dppC~~~Dipeptide transport system permease protein DppC~~~COG1173
MENLNKDFTLVGSKGSDSTEKIAKPALSFFQDAWRRFKKNKIALVAMWIIAITLVFSVISAFVVPQSKANYFNPNKSQVY
GNLPPKLSGDLPFWNGDFKAPGSAEKTDVYKAQGVPEKDKYVFGTDKYGRSLAKRTVVGLRISLIIALAAALIDLVIGVT
YGIISGWMGGKVDMVMQRIIEIIQSVPNLVVVTMLALLLGQGISSIIIAIGLFAWTGMARQVRNMVLSYKERDFVLASKT
LGQSTWKIAVKHLLPNVSGVIVVQIMFDIPSMIMYEAVLSAINLGVKPPTSSLGTLINDGIASLQFYPFQLIIPAIVLSV
LSLTFIFFGDGLRDAFDPRASED
>A0A0H2ZFV0 ~~~dppC~~~Di/tripeptide transport system permease protein DppC~~~
MNAMHNAPASDPSLVYPSPLKEFWQSFAHNKGALGGLLFMLLIVFCALFAPWVAPYDPSEQFRDFLLTPPSWLEGGQARF
LLGTDELGRDLLSRLIHGARLSLLIGLSSVVISLIPGILLGLLAGFSPNRAGPLIMRLMDIMLALPSLLLAVAIVAILGP
GLINTVIAIAIVSLPAYVRLTRAAVMTELNRDYVTASRLAGAGTLRLMFVCVLPNCMAPLIVQATLSFSSAILDAAALGF
LGLGVQPPTPEWGTMLASARDYIERAWWVVSLPGLTILLSVLAINLMGDGLRDALDPKLKNAA
>Q8RDH4 7.4.2.9~~~dppD~~~Dipeptide transport ATP-binding protein DppD~~~COG0444
MSIIIRVEDLRAVYLVREGTIKAADGISLDILENSVTAIVGESASGKSTIIEAMTKTLPPNGRILSGRVLYKGKDLLTMR
EEELRKIRWKEIALVPQAAQQSLNPTMKVIEHFKDTVEAHGVRWSHSELIEKASEKLRMVRLNPEAVLNSYPLQLSGGMK
QRVLIALALLLDPVVLILDEPTSALDVLTQAHIIQLLKELKKMLKITLIFVTHDIAVAAELADKVAVIYGGNLVEYNSTF
QIFKNPLHPYTRGLINSIMAVNADMSKVKPIPGDPPSLLNPPSGCRFHPRCEYAMEICKKEKPKWIRLDGEAHVACHLYE
EGRPLK
>P0AAG0 7.4.2.9~~~dppD~~~Dipeptide transport ATP-binding protein DppD~~~COG0444
MALLNVDKLSVHFGDESAPFRAVDRISYSVKQGEVVGIVGESGSGKSVSSLAIMGLIDYPGRVMAEKLEFNGQDLQRISE
KERRNLVGAEVAMIFQDPMTSLNPCYTVGFQIMEAIKVHQGGNKSTRRQRAIDLLNQVGIPDPASRLDVYPHQLSGGMSQ
RVMIAMAIACRPKLLIADEPTTALDVTIQAQIIELLLELQQKENMALVLITHDLALVAEAAHKIIVMYAGQVVETGDAHA
IFHAPRHPYTQALLRALPEFAQDKERLASLPGVVPGKYDRPNGCLLNPRCPYATDRCRAEEPALNMLADGRQSKCHYPLD
DAGRPTL
>A2RI77 7.4.2.9~~~dppD~~~Dipeptide transport ATP-binding protein DppD~~~COG0444
MAEEKVLEVKNLHVNFHTYAGDVKAIRNVSFDLEKGQTLAIVGESGSGKSVTTKTLMGLNAKNAEIPEGELLFKGRNLLD
LKEEEWQKIRGNEISMIFQDPMTSLDPTMRIGKQIAEPLLKHNKGMSKADAMKRALELMQQVGIPDAEVHINDYPHQWSG
GMRQRAVIAIALAADPEILIADEPTTALDVTIQAQIMHMMAELQERINSSIVFITHDLGVVAGFAHKVAVMYAGEIVEYG
TVEEIFYNPQHPYTWGLLDSMPTVDSSVDRLVSIPGTPPDLLNPPKGDAFAARNKFALAIDFEEEPPYFEVSPTHFAKTW
LLDPRAPKVTPSDNILARWKRWEELKGDK
>A0A0H2ZGN6 7.4.2.9~~~dppD~~~Di/tripeptide transport ATP-binding protein DppD~~~
MSLLDIKNLSVRFGDTTAVPVVDGLDLSVDKGEVLAIVGESGSGKSVTMMALMGLIDAPGWVSADHLRFDGHDMLTLKGR
QRRRIVGKDMAMVFQDPMTALNPSYTVGYQIEEVLRLHLGLRGKALRQRALELLERVEIPAAASRLDAYPHQLSGGMSQR
VAIAMAIAAEPKLLIADEPTTALDVTIQAQIMELLLNLQRDQDMALILITHDLAVVAETAQRVCVMYAGEAVEIGGVPAL
FDRPTHPYTEALIKAIPEHCAGEARLATLPGIVPGRYDRPRGCLLSPRCPYAQEHCRQERPALEAHERGAVRCFYPLNLL
NEVA
>P26906 ~~~dppE~~~Dipeptide-binding protein DppE~~~COG4166
MKRVKKLWGMGLALGLSFALMGCTANEQAGKEGSHDKAKTSGEKVLYVNNENEPTSFDPPIGFNNVSWQPLNNIMEGLTR
LGKDHEPEPAMAEKWSVSKDNKTYTFTIRENAKWTNGDPVTAGDFEYAWKRMLDPKKGASSAFLGYFIEGGEAYNSGKGK
KDDVKVTAKDDRTLEVTLEAPQKYFLSVVSNPAYFPVNEKVDKDNPKWFAESDTFVGNGPFKLTEWKHDDSITMEKSDTY
WDKDTVKLDKVKWAMVSDRNTDYQMFQSGELDTAYVPAELSDQLLDQDNVNIVDQAGLYFYRFNVNMEPFQNENIRKAFA
MAVDQEEIVKYVTKNNEKPAHAFVSPGFTQPDGKDFREAGGDLIKPNESKAKQLLEKGMKEENYNKLPAITLTYSTKPEH
KKIAEAIQQKLKNSLGVDVKLANMEWNVFLEDQKALKFQFSQSSFLPDYADPISFLEAFQTGNSMNRTGWANKEYDQLIK
QAKNEADEKTRFSLMHQAEELLINEAPIIPVYFYNQVHLQNEQVKGIVRHPVGYIDLKWADKN
>P37313 7.4.2.9~~~dppF~~~Dipeptide transport ATP-binding protein DppF~~~COG4608
MSTQEATLQQPLLQAIDLKKHYPVKKGMFAPERLVKALDGVSFNLERGKTLAVVGESGCGKSTLGRLLTMIEMPTGGELY
YQGQDLLKHDPQAQKLRRQKIQIVFQNPYGSLNPRKKVGQILEEPLLINTSLSKEQRREKALSMMAKVGLKTEHYDRYPH
MFSGGQRQRIAIARGLMLDPDVVIADEPVSALDVSVRAQVLNLMMDLQQELGLSYVFISHDLSVVEHIADEVMVMYLGRC
VEKGTKDQIFNNPRHPYTQALLSATPRLNPDDRRERIKLSGELPSPLNPPPGCAFNARCRRRFGPCTQLQPQLKDYGGQL
VACFAVDQDENPQR
>A2RI78 7.4.2.9~~~dppF~~~Dipeptide transport ATP-binding protein DppF~~~COG4608
MTEPKKVVEIKNLDLTFNKGKKGANKAINNVSLDIYEGETFGLVGESGSGKTTIGRAILKLYDNFITGGEILFEGKDVRN
LKGSELREYRSEAQMIFQDPQASLNGRMRVKDIVAEGLDANGLVKTKAERDARVLELLRLVGLNDDHLTRYPHEFSGGQR
QRIGIARALAVKPKFVVADEPISALDVSIQAQVVNLMRDIQAKENLTYLFIAHDLSMVKYISDRIGVMHWGKILEVGTSE
QVYNHPIHPYTKSLLSSIPSPDPISERQRTPIVYDPTAELDGQEREMREITPGHFVFSTEAEAEVYKKNATL
>A0A0H2ZH52 7.4.2.9~~~dppF~~~Di/tripeptide transport ATP-binding protein DppF~~~
METVLTARDLTRHYEVSRGLFKGHAQVRALNGVSFELEAGKTLAVVGESGCGKSTLARALTLIEEPTSGSLKIAGQEVKG
ASKDQRRQLRRDVQMVFQNPYASLNPRQKIGDQLAEPLLINTALSREERREKVQQMMRQVGLRPEHYQRYPHMFSGGQRQ
RIALARAMMLQPKVLVADEPTSALDVSIQAQVLNLFMDLQQQFRTAYVFISHNLAVVRHVADDVLVMYLGRPAEMGPADK
LYENPLHPYTRALLSATPAIHPDPTKPKIRIQGELPNPLHPPEGCAFHKRCPYATERCRSEVPELRLLDQRQVACHHAEQ
FLG
>P9WFR5 2.4.2.45~~~~~~Decaprenyl-phosphate phosphoribosyltransferase~~~COG0382
MSEDVVTQPPANLVAGVVKAIRPRQWVKNVLVLAAPLAALGGGVRYDYVEVLSKVSMAFVVFSLAASAVYLVNDVRDVEA
DREHPTKRFRPIAAGVVPEWLAYTVAVVLGVTSLAGAWMLTPNLALVMVVYLAMQLAYCFGLKHQAVVEICVVSSAYLIR
AIAGGVATKIPLSKWFLLIMAFGSLFMVAGKRYAELHLAERTGAAIRKSLESYTSTYLRFVWTLSATAVVLCYGLWAFER
DGYSGSWFAVSMIPFTIAILRYAVDVDGGLAGEPEDIALRDRVLQLLALAWIATVGAAVAFG
>P39813 ~~~dprA~~~DNA processing protein DprA~~~COG0758
MDQAAVCLTICRINQLLSPSLLLKWWKADPSMSLTSPVLQTVTRDQIKAAALKNEIEQFYPKLPRVLAAYREQGINTIPI
SSKQYPFWLKSIYDPPAVLFAKGDMTLLSKGRKIGIVGTRNPTAYGKQVVNHLTKEICRKGWVIVSGLASGIDGMSHAAS
IKAKGRTIGVIAGGFQHIYPRENLQLADHMAKHHILLSEHPPETKPQKWHFPMRNRIISGLSEGVIVVQGKEKSGSLITA
YQALEQGREVFAVPGSLFDPYAGGPIKLIQQGAKAIWSAEDIFEELPERNVQYTEPF
>P9WL29 ~~~dprA~~~Putative DNA processing protein DprA~~~COG0758
MIDPTARAWAYLSRVAEPPCAQLAALVRCVGPVEAADRVRRGQVGNELAQHTGARREIDRAADDLELLMRRGGRLITPDD
DEWPVLAFAAFSGAGARARPCGHSPLVLWALGPARLDEVAPRAAAVVGTRAATAYGEHVAADLAAGLAERDVAVVSGGAY
GIDGAAHRAALDSEGITVAVLAGGFDIPYPAGHSALLHRIAQHGVLFTEYPPGVRPARHRFLTRNRLVAAVARAAVVVEA
GLRSGAANTAAWARALGRVVAAVPGPVTSSASAGCHTLLRHGAELVTRADDIVEFVGHIGELAGDEPRPGAALDVLSEAE
RQVYEALPGRGAATIDEIAVGSGLLPAQVLGPLAILEVAGLAECRDGRWRILRAGAGQAAAKGAAARLV
>Q8DPI7 ~~~dprA~~~DNA processing protein DprA~~~COG0758
MKITNYEIYKLKKSGLTNQQILKVLEYGENVDQELLLGDIADISGCRNPAVFMERYFQIDDAHLSKEFQKFPSFSILDDC
YPWDLSEIYDAPVLLFYKGNLDLLKFPKVAVVGSRACSKQGAKSVEKVIQGLENELVIVSGLAKGIDTAAHMAALQNGGK
TIAVIGTGLDVFYPKANKRLQDYIGNDHLVLSEYGPGEQPLKFHFPARNRIIAGLCRGVIVAEAKMRSGSLITCERAMEE
GRDVFAIPGSILDGLSDGCHHLIQEGAKLVTSGQDVLAEFEF
>A0R607 1.1.98.3~~~dprE1~~~Decaprenylphosphoryl-beta-D-ribose oxidase~~~COG0277
MSTTEFPTTTKRLMGWGRTAPTVASVLSTSDPEVIVRAVTRAAEEGGRGVIARGLGRSYGDNAQNGGGLVIDMPALNRIH
SIDSGTRLVDVDAGVSLDQLMKAALPHGLWVPVLPGTRQVTVGGAIGCDIHGKNHHSAGSFGNHVRSMELLTANGEVRHL
TPAGPDSDLFWATVGGNGLTGIILRATIEMTPTETAYFIADGDVTGSLDETIAFHSDGSEANYTYSSAWFDAISKPPKLG
RAAISRGSLAKLDQLPSKLQKDPLKFDAPQLLTLPDIFPNGLANKFTFMPIGELWYRKSGTYRNKVQNLTQFYHPLDMFG
EWNRAYGSAGFLQYQFVVPTEAVEEFKSIIVDIQRSGHYSFLNVFKLFGPGNQAPLSFPIPGWNVCVDFPIKAGLHEFVT
ELDRRVLEFGGRLYTAKDSRTTAETFHAMYPRIDEWIRIRRSVDPDGVFASDMARRLQLL
>P9WJF1 1.1.98.3~~~dprE1~~~Decaprenylphosphoryl-beta-D-ribose oxidase~~~COG0277
MLSVGATTTATRLTGWGRTAPSVANVLRTPDAEMIVKAVARVAESGGGRGAIARGLGRSYGDNAQNGGGLVIDMTPLNTI
HSIDADTKLVDIDAGVNLDQLMKAALPFGLWVPVLPGTRQVTVGGAIACDIHGKNHHSAGSFGNHVRSMDLLTADGEIRH
LTPTGEDAELFWATVGGNGLTGIIMRATIEMTPTSTAYFIADGDVTASLDETIALHSDGSEARYTYSSAWFDAISAPPKL
GRAAVSRGRLATVEQLPAKLRSEPLKFDAPQLLTLPDVFPNGLANKYTFGPIGELWYRKSGTYRGKVQNLTQFYHPLDMF
GEWNRAYGPAGFLQYQFVIPTEAVDEFKKIIGVIQASGHYSFLNVFKLFGPRNQAPLSFPIPGWNICVDFPIKDGLGKFV
SELDRRVLEFGGRLYTAKDSRTTAETFHAMYPRVDEWISVRRKVDPLRVFASDMARRLELL
>A0R610 1.1.1.333~~~dprE2~~~Decaprenylphosphoryl-2-keto-beta-D-erythro-pentose reductase~~~COG1028
MVFDAVGNPQTILLLGGTSEIGLAICERYLRNASARIVLAVMPGDPGRDAAVEQMRKAGASAVDVVDFDALDTESHPAVI
DQAFAGGDVDVAIVAFGLLGDAEELWQNQRKAVQIAGVNYTAAVSVGVLLGEKMRAQGSGQIIAMSSAAGERVRRSNFVY
GSTKAGLDGFYLGLGEALREFGVRVLVIRPGQVRTRMSAHVKEAPLTVDKEYVAELAVTASAKGKELVWAPGAFRYVMMV
LRHIPRPIFRKLPI
>P9WGS9 1.1.1.333~~~dprE2~~~Decaprenylphosphoryl-2-keto-beta-D-erythro-pentose reductase~~~COG1028
MVLDAVGNPQTVLLLGGTSEIGLAICERYLHNSAARIVLACLPDDPRREDAAAAMKQAGARSVELIDFDALDTDSHPKMI
EAAFSGGDVDVAIVAFGLLGDAEELWQNQRKAVQIAEINYTAAVSVGVLLAEKMRAQGFGQIIAMSSAAGERVRRANFVY
GSTKAGLDGFYLGLSEALREYGVRVLVIRPGQVRTRMSAHLKEAPLTVDKEYVANLAVTASAKGKELVWAPAAFRYVMMV
LRHIPRSIFRKLPI
>P9WI53 3.1.3.-~~~~~~Putative decaprenylphosphoryl-5-phosphoribose phosphatase Rv3807c~~~COG0671
MAERAPRGEVAVMVAVQSALVDRPGMLATARGLSHFGEHCIGWLILALLGAIALPRRRREWLVAGAGAFVAHAIAVLIKR
LVRRQRPDHPAIAVNVDTPSQLSFPSAHATSTTAAALLMGRATGLPLPVVLVPPMALSRILLGVHYPSDVAVGVALGATV
GAIVDSVGGGRQRARKR
>Q8RPQ1 1.16.-.-~~~dps1~~~DNA protection during starvation protein 1~~~COG0783
MSTKTNVVEVLNKQVANWNVLYVKLHNYHWYVTGPHFFTLHEKFEEFYNEAGTYIDELAERILALEGKPLATMKEYLATS
SVNEGTSKESAEEMVQTLVNDYSALIQELKEGMEVAGEAGDATSADMLLAIHTTLEQHVWMLSAFLK
>Q9RS64 1.16.-.-~~~dps1~~~DNA protection during starvation protein 1~~~COG0783
MTKKSTKSEAASKTKKSGVPETGAQGVRAGGADHADAAHLGTVNNALVNHHYLEEKEFQTVAETLQRNLATTISLYLKFK
KYHWDIRGRFFRDLHLAYDEFIAEIFPSIDEQAERLVALGGSPLAAPADLARYSTVQVPQETVRDARTQVADLVQDLSRV
GKGYRDDSQACDEANDPVTADMYNGYAATIDKIRWMLQAIMDDERLD
>Q8RPQ2 1.16.-.-~~~dps2~~~DNA protection during starvation protein 2~~~COG0783
MNKQVIEVLNKQVADWSVLFTKLHNFHWYVKGPQFFTLHEKFEELYTESATHIDEIAERILAIGGKPVATMKEYLEISSI
QEAAYGETAEGMVEAIMKDYEMMLVELKKGMEIAQNSDDEMTSDLLLGIYTELEKHAWMLRAFLNQ
>Q9RZN1 1.16.-.-~~~dps2~~~DNA protection during starvation protein 2~~~
MRHSVKTVVVVSSLLLGTALAGGAGAQSAGNGVPSTNVNTPAPNTGQSTAQNTNTASPLPYNRATTLPAAGTEDLKKSVQ
ALQNTLTELQALQLQTKQAHWNVSGTLWYTLHELLQDHYEGISKFADDVAERQLSVGASSDGRAITIVAASRLPEIPGGF
LDDAQVIQFFTYQYETVGQRIHQRVGDVEKVDPTTANLLQEVEHIIEKYQWQMRAFLQNTPTDPNTGFDINNGKPVPLRG
R
>Q8UCK6 1.16.-.-~~~dps~~~DNA protection during starvation protein~~~COG0783
MKTHKTKNDLPSNAKSTVIGILNESLASVIDLALVTKQAHWNLKGPQFIAVHELLDTFRTQLDNHGDTIAERVVQLGGTA
LGSLQAVSSTTKLKAYPTDIYKIHDHLDALIERYGEVANMIRKAIDDSDEAGDPTTADIFTAASRDLDKSLWFLEAHVQE
KS
>P83695 1.16.-.-~~~dps~~~DNA protection during starvation protein~~~
MKTSIQQLVAVLLNRQVANWVVLYVKLHNFHWNVNGPNFFTLHEKFEELYTEASGHIDTLAERVLSIGGSPIATLAASLE
EASIKEATGGESAAEMVSSVVNDFVDLVGELKVARDVADEADDEATADMLDAIEAGLEKHVWMLEAFLE
>Q0P891 1.16.-.-~~~dps~~~DNA protection during starvation protein~~~COG0783
MSVTKQLLQMQADAHHLWVKFHNYHWNVKGLQFFSIHEYTEKAYEEMAELFDSCAERVLQLGEKAITCQKVLMENAKSPK
VAKDCFTPLEVIELIKQDYEYLLAEFKKLNEAAEKESDTTTAAFAQENIAKYEKSLWMIGATLQGACKM
>P0ABT2 1.16.-.-~~~dps~~~DNA protection during starvation protein~~~COG0783
MSTAKLVKSKATNLLYTRNDVSDSEKKATVELLNRQVIQFIDLSLITKQAHWNMRGANFIAVHEMLDGFRTALIDHLDTM
AERAVQLGGVALGTTQVINSKTPLKSYPLDIHNVQDHLKELADRYAIVANDVRKAIGEAKDDDTADILTAASRDLDKFLW
FIESNIE
>P43313 1.16.-.-~~~dps~~~DNA protection during starvation protein~~~COG0783
MKTFEILKHLQADAIVLFMKVHNFHWNVKGTDFFNVHKATEEIYEEFADMFDDLAERIVQLGHHPLVTLSEAIKLTRVKE
ETKTSFHSKDIFKEILEDYKYLEKEFKELSNTAEKEGDKVTVTYADDQLAKLQKSIWMLQAHLA
>P80725 1.16.-.-~~~dps~~~DNA protection during starvation protein~~~COG0783
MKTINSVDTKEFLNHQVANLNVFTVKIHQIHWYMRGHNFFTLHEKMDDLYSEFGEQMDEVAERLLAIGGSPFSTLKEFLE
NASVEEAPYTKPKTMDQLMEDLVGTLELLRDEYKQGIELTDKEGDDVTNDMLIAFKASIDKHIWMFKAFLGKAPLE
>Q8Y8G1 1.16.-.-~~~dps~~~DNA protection during starvation protein~~~COG0783
MKTINSVDTKEFLNHQVANLNVFTVKIHQIHWYMRGHNFFTLHEKMDDLYSEFGEQMDEVAERLLAIGGSPFSTLKEFLE
NASVEEAPYTKPKTMDQLMEDLVGTLELLRDEYQQGIELTDKEGDNVTNDMLIAFKASIDKHIWMFKAFLGKAPLE
>A0R692 1.16.-.-~~~dps~~~DNA protection during starvation protein~~~COG0783
MTSFTIPGLSDKKASDVADLLQKQLSTYNDLHLTLKHVHWNVVGPNFIGVHEMIDPQVELVRGYADEVAERIATLGKSPK
GTPGAIIKDRTWDDYSVERDTVQAHLAALDLVYNGVIEDTRKSIEKLEDLDLVSQDLLIAHAGELEKFQWFVRAHLESAG
GQLTHEGQSTEKGAADKARRKSA
>P0C558 1.16.-.-~~~dps~~~DNA protection during starvation protein~~~
MTSFTIPGLSDKKASDVADLLQKQLSTYNDLHLTLKHVHWNVVGPNFIGVHEMIDPQVELVRGYADEVAERIATLGKSPK
GTPGAIIKDRTWDDYSVERDTVQAHLAALDLVYNGVIEDTRKSIEKLEDLDLVSQDLLIAHAGELEKFQWFVRAHLESAG
GQLTHEGQSTEKGAADKARRKSA
>B2RMG0 1.16.-.-~~~dps~~~DNA protection during starvation protein~~~COG0783
MKKILEVTGLKEQQVAPVVKGLSGLLADLQVYYSNLRGFHWNIRGAEFFVLHEQYEKMYDDLAGKIDEVAERILQLGGKP
ENRFSEYLKVAEVKEEHELVCAASTLKNVTDTLQIIMAKERAIAEVAGEAGDEVTVDLMIGFLSGQEKLVWMLSAYATK
>P0C935 1.16.-.-~~~dps~~~DNA protection during starvation protein~~~COG0783
MKKILEVTGLKEQQVAPVVKGLSGLLADLQVYYSNLRGFHWNIRGAEFFVLHEQYEKMYDDLAGKIDEVAERILQLGGKP
ENRFSEYLKVAEVKEEHELVCAASTLKNVTDTLQIIMAKERAIAEVAGEAGDEVTVDLMIGFLSEQEKLVWMLSAYAAK
>Q9KWH3 1.16.-.-~~~dps~~~DNA protection during starvation protein~~~COG0783
MTNTITENIYASIIHQVEKKENSGNEKTKAVLNQAVADLSKAASIVHQVHWYMRGSGFLYLHPKMDELMDALNGHLDEIS
ERLITIGGAPFSTLKEFDENSRLEETVGTWDKSITDHLKRLVQVYDYLSSLYQVGLDVTDEEDDAVSNDIFTAAQTEAQK
TIWMLQAELGQAPGL
>P0CB53 1.16.-.-~~~dps~~~DNA protection during starvation protein~~~COG0783
MMKQKYYQSPAEIASFSPRPSLADSKAVLNQAVADLSVAHSILHQVHWYMRGRGFMIWHPKMDEYMEEIDGYLDEMSERL
ITLGGAPFSTLKEFSENSQLKEVLGDYNVTIEEQLARVVEVFRYLAALFQKGFDVSDEEGDSVTNDIFNVAKASIEKHIW
MLQAELGQAPKL
>Q55024 1.16.-.-~~~dps~~~DNA protection during starvation protein~~~COG0783
MTNTGLVQSFSQIEPNVLGLETSVTSQICEGLNRALASFQVLYLQYQKHHFTVQGAEFYSLHEFFEDSYGSVKDHVHDLG
ERLNGLGGVPVAHPLKLAELTCFAIEEDGVFNCRTMLEHDLAAEQAILSLLRRLTAQVESLGDRATRYLLEGILLKTEER
AYHIAHFLAPDSLKLA
>Q7CJ65 1.16.-.-~~~dps~~~DNA protection during starvation protein~~~COG0783
MSTAKLVKTKPSELLYTRNDVEEHVKVATIKRLNQMVIQFIDLSLITKQAHWNMRGANFVAVHEMLDGFRTALTDHLDTF
AERAVQLGGVALGTAQVINDKTPLKSYPTNIHSVQEHLKALAERYAIVANDIRKAITEVEDENSADMFTAASRDLDKFLW
FIESNIE
>Q9RW95 ~~~~~~Probable type IV piliation system protein DR_0774~~~COG1450
MNKRHALLLTAVLGMATAYAQTAPTTTTVNTLQTVYRDPSLTSAPITANVGKYVGPLSTFLASIAKSAGYEVVFNFNIDA
LALINGEIVFGNSTASVTTSYATPLGRPQELPAKPVVHNFSNAPFNEAWPLLMDVYELDYQLVKVGSANVIRIGQRPKQL
ALPLKFISAESALTAIEKFFGEEKFETVISLDSNNKPFQTTRPTGKFGLPNSIKVIPDSSNKRLIIGSNSEDGIRIRSFV
ETIDVQSSGKVISTDSISEIYIVRGQKESVLQFLRDSFPELIVTDYASGGLAIEGPRTSVNRAIILLGQVDRAPEIPIVQ
RIYTVRGQAADITALLAAQYPTLRVTPVGQTGQLVLNGAQAQLDTALALLEQVDRPAPVAESRTVQRVFQLVNASAEEVK
ATLEGTLARDLTADSNNDVLPNVPVTATDANGNTTVVSVPNALGKTANQGTANAQAQTAQTPANTQQATLIADKRTNSLI
VRGTPEQVAQVAELVPQLDQVVPQINVQVRIQEVNERALQSLGLNWRATFGGFNVAVSGGTGLAATFNPTQSFLGFNIFP
TLTALETQGLTRRVYDGNVTMQSGQRSLSATGGAQNASSGAAASVKSGGRLEINIPSAAGNIVRQIDYGLNLDFFSPQVA
PDGTITLRIRGQVNQPATAITADSLPNLIDFTNSEAQSTITFKNGQTILMSGLLGSTETTNRSGVPFLSSLPGVGAAFGE
KRTEKTQSQLLVIITGTVVK
>Q7DF83 ~~~tnpA~~~ISDra2 transposase TnpA~~~COG1943
MTYVILPLEMKKGRGYVYQLEYHLIWCVKYRHQVLVGEVADGLKDILRDIAAQNGLEVITMEVMPDHVHLLLSATPQQAI
PDFVKALKGASARRMFVAYPQLKEKLWGGNLWNPSYCILTVSENTRAQIQKYIESQHDKE
>Q7DF80 3.1.21.-~~~tnpB~~~RNA-guided DNA endonuclease TnpB~~~COG0675
MIRNKAFVVRLYPNAAQTELINRTLGSARFVYNHFLARRIAAYKESGKGLTYGQTSSELTLLKQAEETSWLSEVDKFALQ
NSLKNLETAYKNFFRTVKQSGKKVGFPRFRKKRTGESYRTQFTNNNIQIGEGRLKLPKLGWVKTKGQQDIQGKILNVTVR
RIHEGHYEASVLCEVEIPYLPAAPKFAAGVDVGIKDFAIVTDGVRFKHEQNPKYYRSTLKRLRKAQQTLSRRKKGSARYG
KAKTKLARIHKRIVNKRQDFLHKLTTSLVREYEIIGTEHLKPDNMRKNRRLALSISDAGWGEFIRQLEYKAAWYGRLVSK
VSPYFPSSQLCHDCGFKNPEVKNLAVRTWTCPNCGETHDRDENAALNIRREALVAAGISDTLNAHGGYVRPASAGNGLRS
ENHATLVV
>P24093 ~~~draA~~~Dr hemagglutinin structural subunit~~~
MKKLAIMAAASMVFAVSSAHAGFTPSGTTGTTKLTVTEECQVRVGDLTVAKTRGQLTDAAPIGPVTVQALGCDARQVALK
ADTDNFEQGKFFLISDNNRDKLYVNIRPTDNSAWTTDNGVFYKNDVGSWGGIIGIYVDGQQTNTPPGNYTLTLTGGYWAK
>P14300 3.2.2.24~~~draG~~~ADP-ribosyl-[dinitrogen reductase] glycohydrolase~~~
MTGPSVHDRALGAFLGLAVGDALGATVEFMTKGEIAQQYGIHRKMTGGGWLRLKPGQITDDTEMSLALGRSLAAKGTLDV
ADICEEFALWLKSRPVDVGNTCRRGIRRYMHEGTTTAPYSEGDAGNGAAMRCLPAALATLGHPADLEPWVLAQARITHNH
PLSDAACLTLGRMVHHLIGGRGMKACREEANRLVHQHRDFHFEPYKGQSSAYIVDTMQTVLHYYFVTDTFKSCLIQTVNQ
GGDADTTGALAGMLAGATYGVDDIPSGWLSKLDMKVEREIRRQVDALLALAGLD
>P14299 2.4.2.37~~~draT~~~NAD(+)--dinitrogen-reductase ADP-D-ribosyltransferase~~~
MKDMGEDRPGIGHSTNLVGLPTDLLASAWFNQAAPEIHIAGVREMNRSLFEMLAEAPDLESAGEAFYKYMIAMFGLDPEQ
QDHRPGQGGAVRRFHASYLRLLKGWGYDTNAKEGAVLKGWVESRFGLFPTFHREPITKFASKAWITYIEEKMTSRFHNNS
IYVQLDLMYEFCQWALARFAAPGESALLLYRGVNDFTEHQMIERIDNRQVVVRMNNLVSFSSDRGVADCFGDTILETRVP
VSKIVFFNTLLTSHPLKGEGEYLVIGGDYLVKASYL
>P0DTQ0 4.1.2.-~~~drdA~~~5-deoxy-D-ribulose 1-phosphate aldolase~~~
MLLQKEREEIVAYGKKMISSGLTKGTGGNISIFNREQGLVAISPSGLEYYETKPEDVVILNLDGEVIEGERKPSSELDMH
LIYYRKREDINALVHTHSPYAKTIASLGWELPAVSYLIAFAGPNVRCAPYETFGTKQLADAAFEGMIDRRAVLLANHGLI
AGANNIKMAFTVAEEIEFCAQIYYQTKSIGEPKLLPEDEMENLAKKFEGYGQQ
>P0DTQ1 5.3.1.-~~~drdI~~~5-deoxyribose 1-phosphate isomerase~~~
MMEEQLIPIQWKDDALVLLDQTLLPNEIVYESFKTAESVWDAIQVMKVRGAPVIGVSAAYGVYLGVKEFAESTEEGFMDE
VRKVCTYLATSRPTAVNLFWALERMESVAADNIHLSISQLKDRLLEEAKEIHREDEEINRQIGEHALTLFHDGMGVLTHC
NAGALATTKYGTATAPMYLAKEKGWDLKIFSDETRPRLQGSTLTALELQRAGIDVTVITDNMAAMVMSQGKIDAVIVGCD
RVAANGDIANKIGTLGVSILAKYYNIPFYVAAPTPTIDLKTPTGKEIPIEERDASEVINRFGQYSAPKESKVYNPAFDVT
PHENVTAIITEKGIVKAPFTENLKKIFQ
>P0DTQ2 2.7.1.-~~~drdK~~~5-deoxyribose kinase~~~
MSKFTKYFLMEASDVIAYVKEKLSKFEHAKGLKCKEIGDGNLNYVFRVWDKKENMSVIVKQAGDTARISDEFKLSTNRIR
IESDVLQLENELAPGLVPKVYLFDSVMNCCVMEDLSNHTILRTALINHQIFPQLADDLTTFMVNTLLLTSDVVMNHKEKK
ELVKNYINPELCEITEDLVYSEPFTNHNKRNELFPLNEGWIREHIYSDKELRMEVAKRKFSFMTNAQALLHGDLHTGSVF
VRDDSTKVIDPEFAFYGPMGYDVGNVMANLMFAWVNADATMPPGAEKDTYMDWLQTTMVEVIDLFKKKFLDAWDIHVTEI
MAKEEGFNEVYLQSVLEDTAAVTGLELIRRIVGLAKVKDITCIENEEARARAEQICLKVAKSFILRANQYRTGRSFVETL
KEQSMHYAE
>Q55233 ~~~drgA~~~Protein DrgA~~~COG0778
MDTFDAIYQRRSVKHFDPDHRLTAEEERKLHEAAIQAPTSFNIQLWRFLIIRDPQLRQTIREKYGNQAQMTDASLLILVA
ADVNAWDKDPARYWRNAPREVANYLVGAIASFYGGKPQLQRDEAQRSIGMAMQNLMLAAKAMGYDSCPMIGFDLQKVAEL
VKLPADYAIGPMVAIGKRTEDAPGKRRSNSPGRIPLGKLLCLTKVWCLAI
>P08038 3.1.21.-~~~dns~~~Extracellular deoxyribonuclease~~~COG2356
MMIFRFVTTLAASLPLLTFAAPISFSHAKNEAVKIYRDHPVSFYCGCEIRWQGKKGIPDLESCGYQVRKNENRASRIEWE
HVVPAWQFGHQLQCWQQGGRKNCTRTSPEFNQMEADLHNLTPAIGEVNGDRSNFSFSQWNGVDGVTYGQCEMQVNFKERT
AMPPERARGAIARTYLYMSEQYGLRLSKAQSQLMQAWNNQYPVSEWECVRDQRIEKVQGNSNRFVREQCPN
>Q99QV3 3.1.1.-~~~~~~Lactonase drp35~~~
MMSQQDLPTLFYSGKSNSAVPIISESELQTITAEPWLEISKKGLQLEGLNFDRQGQLFLLDVFEGNIFKINPETKEIKRP
FVSHKANPAAIKIHKDGRLFVCYLGDFKSTGGIFAATENGDNLQDIIEDLSTAYCIDDMVFDSKGGFYFTDFRGYSTNPL
GGVYYVSPDFRTVTPIIQNISVANGIALSTDEKVLWVTETTANRLHRIALEDDGVTIQPFGATIPYYFTGHEGPDSCCID
SDDNLYVAMYGQGRVLVFNKRGYPIGQILIPGRDEGHMLRSTHPQFIPGTNQLIICSNDIEMGGGSMLYTVNGFAKGHQS
FQFQ
>Q7A338 3.1.1.-~~~~~~Lactonase drp35~~~
MMSQQDLPTLFYSGKSNSAVPIISESELQTITAEPWLEISKKGLQLEGLNFDRQGQLFLLDVFEGNIFKINPETKEIKRP
FVSHKANPAAIKIHKDGRLFVCYLGDFKSTGGIFAATENGDNLQDIIEDLSTAYCIDDMVFDSKGGFYFTDFRGYSTNPL
GGVYYVSPDFRTVTPIIQNISVANGIALSTDEKVLWVTETTANRLHRIALEDDGVTIQPFGATIPYYFTGHEGPDSCCID
SDDNLYVAMYGQGRVLVFNKRGYPIGQILIPGRDEGHMLRSTHPQFIPGTNQLIICSNDIEMGGGSMLYTVNGFAKGHQS
FQFQ
>Q9S0S3 3.1.1.-~~~~~~Lactonase drp35~~~
MMSQQDLPTLFYSGKSNSAVPIISESELQTITAEPWLEISKKGLQLEGLNFDRQGQLFLLDVFEGNIFKINPETKEIKRP
FVSHKANPAAIKIHKDGRLFVCYLGDFKSTGGIFAATENGDNLQDIIEDLSTAYCIDDMVFDSKGGFYFTDFRGYSTNPL
GGVYYVSPDFRTVTPIIQNISVANGIALSTDEKVLWVTETTAKRLHRIALEDDGVTIQPFGATIPYYFTGHEGPDSCCID
SDDNLYVAMYGQGRVLVFNKRGYPIGQILIPGRDEGHMLRSTHPQFIPGTNQLIICSNDIEMGGGSMLYTVNGFAKGHQS
FQFQ
>P76334 ~~~drpB~~~Cell division protein DrpB~~~
MEYGSTKMEERLSRSPGGKLALWAFYTWCGYFVWAMARYIWVMSRIPDAPVSGFESDLGSTAGKWLGALVGFLFMALVGA
LLGSIAWYTRPRPARSRRYE
>Q5ZSQ3 ~~~drrA~~~Multifunctional virulence effector protein DrrA~~~
MSIMGRIKMSVNEEQFGSLYSDERDKPLLSPTAQKKFEEYQNKLANLSKIIRENEGNEVSPWQEWENGLRQIYKEMIYDA
FDALGVEMPKDMEVHFAGSLAKAQATEYSDLDAFVIVKNDEDIKKVKPVFDALNNLCQRIFTASNQIYPDPIGINPSRLI
GTPDDLFGMLKDGMVADVEATAMSILTSKPVLPRYELGEELRDKIKQEPSFSNMVSAKKFYNKAIKDFTAPKEGAEVVSV
KTHIMRPIDFMLMGLREEFNLYSEDGAHLSAPGTIRLLREKNLLPEEQIARIESVYNQAMSKRFELHAEHKKEHDEMPYS
DAKAMLDEVAKIRELGVQRVTRIENLENAKKLWDNANSMLEKGNISGYLKAANELHKFMKEKNLKEDDLRPELSDKTISP
KGYAILQSLWGAASDYSRAAATLTESTVEPGLVSAVNKMSAFFMDCKLSPNERATPDPDFKVGKSKILVGIMQFIKDVAD
PTSKIWMHNTKALMNHKIAAIQKLERSNNVNDETLESVLSSKGENLSEYLSYKYATKDEGREHRYTASTENFKNVKEKYQ
QMRGDALKTEILADFKDKLAEATDEQSLKQIVAELKSKDEYRILAKGQGLTTQLLGLKTSSVSSFEKMVEETRESIKSQE
RQTIKIK
>Q29ST3 ~~~drrA~~~Multifunctional virulence effector protein DrrA~~~
MSIMGRIKMSVNEEQFGSLYSDERDKPLLSPTAQKKFEEYQNKLANLSKIIRENEGNEVSPWQEWENGLRQIYKEMIYDA
FDALGVEMPKDMEVHFAGSLAKAQATEYSDLDAFVIVKNDEDIKKVKPVFDALNNLCQRIFTASNQIYPDPIGINPSRLI
GTPDDLFGMLKDGMVADVEATAMSILTSKPVLPRYELGEELRDKIKQEPSFSNMVSAKKFYNKAIKDFTAPKEGAEVVSV
KTHIMRPIDFMLMGLREEFNLYSEDGAHLSAPGTIRLLREKNLLPEEQIARIESVYNQAMSKRFELHAEHKKEHDEMPYS
DAKAMLDEVAKIRELGVQRVTRIENLENAKKLWDNANSMLEKGNISGYLKAANELHKFMKEKNLKEDDLRPELSDKTISP
KGYAILQSLWGAASDYSRAAATLTESTVEPGLVSAVNKMSAFFMDCKLSPNERATPDPDFKVGKSKILVGIMQFIKDVAD
PTSKIWMHNTKALMNHKIAAIQKLERSNNVNDETLESVLSSKGENLSEYLSYKYATKDEGREHRYTASTENFKNVKEKYQ
QMRGDALKTEILADFKDKLAEATDEQSLKQIVAELKSKDEYRILAKGQGLTTQLLGLKTSSVSSFEKMVEETRESIKSQE
RQTIKIK
>P9WQL9 7.6.2.-~~~drrA~~~Doxorubicin resistance ATP-binding protein DrrA~~~COG1131
MRNDDMAVVVNGVRKTYGKGKIVALDDVSFKVRRGEVIGLLGPNGAGKTTMVDILSTLTRPDAGSAIIAGYDVVSEPAGV
RRSIMVTGQQVAVDDALSGEQNLVLFGRLWGLSKSAARKRAAELLEQFSLVHAGKRRVGTYSGGMRRRIDIACGLVVQPQ
VAFLDEPTTGLDPRSRQAIWDLVASFKKLGIATLLTTQYLEEADALSDRIILIDHGIIIAEGTANELKHRAGDTFCEIVP
RDLKDLDAIVAALGSLLPEHHRAMLTPDSDRITMPAPDGIRMLVEAARRIDEARIELADIALRRPSLDHVFLAMTTDPTE
SLTHLVSGSAR
>P32010 7.6.2.-~~~drrA~~~Daunorubicin/doxorubicin resistance ATP-binding protein DrrA~~~
MNTQPTRAIETSGLVKVYNGTRAVDGLDLNVPAGLVYGILGPNGAGKSTTIRMLATLLRPDGGTARVFGHDVTSEPDTVR
RRISVTGQYASVDEGLTGTENLVMMGRLQGYSWARARERAAELIDGFGLGDARDRLLKTYSGGMRRRLDIAASIVVTPDL
LFLDEPTTGLDPRSRNQVWDIVRALVDAGTTVLLTTQYLDEADQLADRIAVIDHGRVIAEGTTGELKSSLGSNVLRLRLH
DAQSRAEAERLLSAELGVTIHRDSDPTALSARIDDPRQGMRALAELSRTHLEVRSFSLGQSSLDEVFLALTGHPADDRST
EEAAEEEKVA
>P9WG23 ~~~drrB~~~Doxorubicin resistance ABC transporter permease protein DrrB~~~COG0842
MSGPAIDASPALTFNQSSASIQQRRLSTGRQMWVLYRRFAAPSLLNGEVLTTVGAPIIFMVGFYIPFAIPWNQFVGGASS
GVASNLGQYITPLVTLQAVSFAAIGSGFRAATDSLLGVNRRFQSMPMAPLTPLLARVWVAVDRCFTGLVISLVCGYVIGF
RFHRGALYIVGFCLLVIAIGAVLSFAADLVGTVTRNPDAMLPLLSLPILIFGLLSIGLMPLKLFPHWIHPFVRNQPISQF
VAALRALAGDTTKTASQVSWPVMAPTLTWLFAFVVILALSSTIVLARRP
>P32011 ~~~drrB~~~Daunorubicin/doxorubicin resistance ABC transporter permease protein DrrB~~~
MTTSPGTVESTTPVSGQLRTVLSAGERPARATAVSATLTHLWRAMMAFKHFPVQLIDIVLMPLIFLLMFTYLFGGAFADS
TEEYLQFYLPGVTVQAVVMMTVYTGTSLNTDIHKGVFDRFRTLPFWQPATLAGSLLGDVLRYVVALATTVSLGLLLGFRA
DGGFLGVVGAMLVLIVFGFSVSWIFAALGVVASEPERVSGTSMIVLYPLLFMSNIFVMPETMPGWMQAIVDANPMSHAAT
ASRELMHGTAGFWDVGLVLCVSAGLVAVFAPLTMRLYRNKNAH
>P9WG21 ~~~drrC~~~Probable doxorubicin resistance ABC transporter permease protein DrrC~~~COG0842
MITTTSQEIELAPTRLPGSQNAARLFVAQTLLQTNRLLTRWARDYITVIGAIVLPILFMVVLNIVLGNLAYVVTHDSGLY
SIVPLIALGAAITGSTFVAIDLMRERSFGLLARLWVLPVHRASGLISRILANAIRTLVTTLVMLGTGVVLGFRFRQGLIP
SLMWISVPVILGIAIAAMVTTVALYTAQTVVVEGVELVQAIAIFFSTGLVPLNSYPGWIQPFVAHQPVSYAIAAMRGFAM
GGPVLSPMIGMLVWTAGICVVCAVPLAIGYRRASTH
>Q9RTJ3 ~~~~~~Desiccation/radiation resistance protein DR_1769~~~COG1520
MPDPAARRFSLPPFPLAALALSVALLGAPASLAQTAAPAPAPAQTPAAASRAAAPRVSWTKPLKVSSPVSIGPRGELTYV
GNDSRVHRTDASGKELWSFALGDIGRAQPVLTPDGGVITAAYDDTVYALSPAGQLTWKAKLDGDVFASPALRPDGSVVVA
TAGGTVYALSSSGQTLWSYKVGAPVFSSPAVAADGTIYFGAQNGRLHALSPEGRLTWTYAARSSVFSSPALDAEGNLYFG
SGDRSIYSLSPAGTLRWVQPTGLFVNASPIVTRAGLVVVGSYDGQLYALNTNGQVAWTYAAGAAIAAPAAELSDGSVVVG
DLNGTLHAVTPTGQALWTLPGGAKIDTGAAVSDQGTLYFAVDGGNLNAVENLRPLATGPWPTFRASPLGWGRSATAAELT
AQTQARQAAAAASTSQQPRLPTLAQAPAPTPAPAQTTPRPQPTPAQPATPAAPVPPVASPAPATARLTPQVIGGVIYLPL
SPIAAGLGYQVQAGSPTRALLVAGSQRLTVPVRAVGGQSLIALRFVTGLPGVSVERQAGTLTLRREGLSAALPLDLAQLL
PWAPQPEFPGVVRR
>Q7W2Q0 ~~~dsbA~~~Thiol:disulfide interchange protein DsbA~~~
MQSTTFTRLLAAAALAATTLFAPATQAQGAQQYVNINPPMPSDTPGKIEVLEFFAYTCPHCAAIEPMVEDWAKTAPQDVV
LKQVPIAFNAGMKPLQQLYYTLQALERPDLHPKVFTAIHTERKRLFDKKAMGEWAASQGVDRAKFDSVFDSFSVQTQVQR
ASQLAEAAHIDGTPAFAVGGRYMTSPVLAGNDYAGALKVVDQLIVQSRK
>Q7W0K2 ~~~dsbA~~~Thiol:disulfide interchange protein DsbA~~~COG1651
MQSTTFTRLLAAAALGATTLFAPATQAQGAQQYVNINPPMPSDTPGKIEVLEFFAYTCPHCAAIEPMVEDWAKTAPQDVV
LKQVPIAFNAGMKPLQQLYYTLQALERPDLHPKVFTAIHTERKRLFDKKAMGEWAASQGVDRAKFDSVFDSFSVQTQVQH
ASQLAEAAHIDGTPAFAVGGRYMTSPVLAGNDYAGALKVVDQLIVQSRK
>P0AEG4 ~~~dsbA~~~Thiol:disulfide interchange protein DsbA~~~COG1651
MKKIWLALAGLVLAFSASAAQYEDGKQYTTLEKPVAGAPQVLEFFSFFCPHCYQFEEVLHISDNVKKKLPEGVKMTKYHV
NFMGGDLGKDLTQAWAVAMALGVEDKVTVPLFEGVQKTQTIRSASDIRDVFINAGIKGEEYDAAWNSFVVKSLVAQQEKA
AADVQLRGVPAMFVNGKYQLNPQGMDTSNMDVFVQQYADTVKYLSEKK
>P0C2B2 ~~~dsbA~~~Thiol:disulfide interchange protein DsbA~~~
MRNLILTAMLAMASLFGMAAQADDYTAGKEYVELSSPVPVSQPGKIEVVELFWYGCPHCYAFEPTIVPWSEKLPADVHFV
RLPALFGGIWNVHGQMFLTLESMGVEHDVHNAVFEAIHKEHKKLATPEEMADFLAGKGVDKEKFLSTYNSFAIKGQMEKA
KKLAMAYQVTGVPTMVVNGKYRFDIGSAGGPEETLKLADYLIEKERAAAKK
>P0A2H9 ~~~dsbA~~~Thiol:disulfide interchange protein DsbA~~~
MKKIWLALAGMVLAFSASAAQISDGKQYITLDKPVAGEPQVLEFFSFYCPHCYQFEEVLHVSDNVKKKLPEGTKMTKYHV
EFLGPLGKELTQAWAVAMALGVEDKVTVPLFEAVQKTQTVQSAADIRKVFVDAGVKGEDYDAAWNSFVVKSLVAQQEKAA
ADLQLQGVPAMFVNGKYQINPQGMDTSSMDVFVQQYADTVKYLVDKK
>P52235 ~~~dsbA~~~Thiol:disulfide interchange protein DsbA~~~
MKKIWLALAGLVLAFSASAAQYEDGKQYTTLEKPVAGAPQVLEFFSFFCPHCYQFEEVLHISDNVKKKLPEGVKMTKYHV
NFMGGDLGKDLTQAWAVAMALGVEDKVTVPLFEGVQKTQTIRSASDIRDVFINAGIKGEEYDAAWNSFVVKSLVAQQEKA
AADVQLRGVPAMFVNGKYQLNPQGMDTSNMDVFVQQYADTVKYLSEEK
>P32557 ~~~dsbA~~~Thiol:disulfide interchange protein DsbA~~~COG1651
MKKLFALVATLMLSVSAYAAQFKEGEHYQVLKTPASSSPVVNEFFSFYCPHCNTFEPIIAQLKQQLPEGAKFQKNHVSFM
GGNMGQAMSKAYATMIALEVEDKMVPVMFNRIHTLRKPPKDEQELRQIFLDEGIDAAKFDAAYNGFAVDSMVRRFDKQFQ
DSGLTGVPAVVVNNRYLVQGQSVKSLDEYFDLVNYLLTLK
>Q63RY4 ~~~dsbB~~~Disulfide bond formation protein B~~~COG1495
MNNLTLSLRRERRLLVLLALVCLALLAGALYLQYVKNEDPCPLCIIQRYFFVLIAVFAFIGAGMASGAGVAVTEALIVLS
AAAGVGTAARHLYVQLNPGFSCGFDALQPVVDSLPPARWLPGVFKVAGLCETVYPPIFGILLPGWALIAFVLIAVPVAVS
LLRHRGRLR
>P0A6M2 ~~~dsbB~~~Disulfide bond formation protein B~~~COG1495
MLRFLNQCSQGRGAWLLMAFTALALELTALWFQHVMLLKPCVLCIYERCALFGVLGAALIGAIAPKTPLRYVAMVIWLYS
AFRGVQLTYEHTMLQLYPSPFATCDFMVRFPEWLPLDKWVPQVFVASGDCAERQWDFLGLEMPQWLLGIFIAYLIVAVLV
VISQPFKAKKRDLFGR
>Q7VU58 ~~~dsbC~~~Probable thiol:disulfide interchange protein DsbC~~~COG1651
MSGPSFSGAGMNFRITVWCAAAAVWSSGALAQDGAGQAAPGTPDKVYSTTGSAPAKPGDKVYSTRSAQAPDPQADAVKER
FAQRFEGFDVTAVRRTPYGLFEVQIGTDLLYTDEKVTWVMEGPLIDALTRRDVTRERQEKLSSVPFEELPLDLAVKQVKG
DGSRVMAVFEDPNCGYCKQLHRTLEDMDNITVYTFLYPILSPDSTTKVRDIWCASDPAKVWKDWMVRGQRPPTAECDAPV
DQWLALGRQLMVRGTPAIFFKSGGRVSGALPRDELEARL
>P0AEG6 ~~~dsbC~~~Thiol:disulfide interchange protein DsbC~~~COG1651
MKKGFMLFTLLAAFSGFAQADDAAIQQTLAKMGIKSSDIQPAPVAGMKTVLTNSGVLYITDDGKHIIQGPMYDVSGTAPV
NVTNKMLLKQLNALEKEMIVYKAPQEKHVITVFTDITCGYCHKLHEQMADYNALGITVRYLAFPRQGLDSDAEKEMKAIW
CAKDKNKAFDDVMAGKSVAPASCDVDIADHYALGVQLGVSGTPAVVLSNGTLVPGYQPPKEMKEFLDEHQKMTSGK
>P45111 ~~~dsbC~~~Thiol:disulfide interchange protein DsbC~~~COG1651
MKKIFTALLCVAAANAMADDAAIKRKLQSFNISNIVIKSSPISGIKTAVTDQGILYVSEDGKYLFEGKLYELTNNGPVDV
AGKILVDKLNSYKDEMIVYPAKNEKHVVTVFMDITCHYCHLLHQQLKEYNDLGITVRYLAFPRAGMNNQTAKQMEAIWTA
KDPVFALNEAEKGNLPKEVKTPNIVKKHYELGIQFGVRGTPSIVTSTGELIGGYLKPADLLRALEETAQ
>P55890 ~~~dsbC~~~Thiol:disulfide interchange protein DsbC~~~
MKKRFMMFTLLAAVFSGVAHADDAAIRQSLAKLGVQSTEIQASPVAGMKTVLTHSGVLYVTDDGKHIIQGPMYDVSGAHP
VNVTNKLLMSQLNALEKEMIVYKAPDEKHVITVFTDITCGYCHKLHEEMKDYNALGITVRYLAFPRQGLESQAEQDMKSI
WCAKDKNKAFDDAMAGKGVKPASCDVNIADHYALGVQLGVSGTPAIVLSNGYVVPGYQGPKEMKAFLDEHQKQTSGK
>P58162 1.8.1.8~~~dsbD~~~Thiol:disulfide interchange protein DsbD~~~COG4232
MAQRIFTLILLLCSTSVFAGLFDAPGRSQFVPVDQAFAFDFQQNQHDLNLTWQIKDGYYLYRKQIRITPEHAKIADVQLP
QGVWHEDEFYGKSEIYRDRLTLPVTINQASAGATLTVTYQGCADAGFCYPPETKTVPLSEVVANNAASQPVSVPQQEQPT
AQLPFSALWALLIGIGIAFTPCVLPMYPLISGIVLGGKQRLSTARALLLTFIYVQGMALTYTALGLVVAAAGLQFQAALQ
HPYVLIGLTIVFTLLAMSMFGLLTLQLPSSLQTRLTLMSNRQQGGSPGGVFIMGTIAGLICSPCTTAPLSAILLYIAQSG
NMWLGGGTLYLYALGMGLPLMLITVFGNRLLPKSGPWMEQVKTAFGFVILALPVFLLERVIGDVWGLRLWSALGVAFFGW
AFITSLQAKRGWMRVVQIILLAAALVSVRPLQDWAFGATHTAQTQTHLNFTQIKTVDELNQALVEAKGKPVMLDLYADWC
VACKEFEKYTFSDPQVQKALADTVLLQANVTANDAQDVALLKHLNVLGLPTILFFDGQGQEHPQARVTGFMDAETFSAHL
RDRQP
>P36655 1.8.1.8~~~dsbD~~~Thiol:disulfide interchange protein DsbD~~~COG4232
MAQRIFTLILLLCSTSVFAGLFDAPGRSQFVPADQAFAFDFQQNQHDLNLTWQIKDGYYLYRKQIRITPEHAKIADVQLP
QGVWHEDEFYGKSEIYRDRLTLPVTINQASAGATLTVTYQGCADAGFCYPPETKTVPLSEVVANNAAPQPVSVPQQEQPT
AQLPFSALWALLIGIGIAFTPCVLPMYPLISGIVLGGKQRLSTARALLLTFIYVQGMALTYTALGLVVAAAGLQFQAALQ
HPYVLIGLAIVFTLLAMSMFGLFTLQLPSSLQTRLTLMSNRQQGGSPGGVFVMGAIAGLICSPCTTAPLSAILLYIAQSG
NMWLGGGTLYLYALGMGLPLMLITVFGNRLLPKSGPWMEQVKTAFGFVILALPVFLLERVIGDVWGLRLWSALGVAFFGW
AFITSLQAKRGWMRIVQIILLAAALVSVRPLQDWAFGATHTAQTQTHLNFTQIKTVDELNQALVEAKGKPVMLDLYADWC
VACKEFEKYTFSDPQVQKALADTVLLQANVTANDAQDVALLKHLNVLGLPTILFFDGQGQEHPQARVTGFMDAETFSAHL
RDRQP
>Q9JYM0 1.8.1.8~~~dsbD~~~Thiol:disulfide interchange protein DsbD~~~
MKKLICLFAVFLMLCGRAFALDANDLLPPEKAFVPELAVADDGVNVRFRIADGYYMYQAKIVGKTDPADLLGQPSFSKGE
EKEDEFFGRQTVYHHEAQVAFPYAKAVGEPYKLVLTYQGCAEAGVCYPPVDTEFDIFGNGTYHPQTDEPASAKDRFLQPS
SQNGSGALPPPKGDEGGDSRFKLSWDTLNANLLAFFLAGLGLSFTACMYPLLPIVSSIVVGDKKAGKARAFVLSVVYVQG
LALTYTLVGIVAGLTGALLTVWLQQAWVVLAASALMVVLALSMFGLFNIQLPNAVQSYFQNQSSRLSGGKIVSVFIMGIL
SALIVGPCVAPPLAFALGYIGQTGDAVLGGLALYTLALGTGVPLIAIGTFGGHILPKAGDWMNAVKYAFGFILLAVAVYL
ATPHLPYYLVVALYTLLMLVPAFMLLVNGRRQKRRPKAVAFALGGILLIGGAWFGWQGANGKTTALHHFLTLNPPAEAGK
SSEHGKMFADTAALKAAMDTALKEHPDKPVVLDFYADWCISCKEMAAYTLNQPEVHQAVDMERFFQIDVTANTPEHQALL
KEYGLFGPPGVFVVRSDGSRSEPLLGFVKADKFIEWYEQNR
>Q328D2 1.8.1.8~~~dsbD~~~Thiol:disulfide interchange protein DsbD~~~
MAQRIFTLILLLCSTSVFAGLFDAPGRSQFVPADQAFTFDFQQNQHDLNLTWQIKDGYYLYRKQIRITPEHAKIADVQLP
QGVWHEDEFYGKSEIYRDRLTLPVTINQASAGATLTVTYQGCADAGFCYPPETKTVPLSEVVANNAAPQPVSVPQQEQPT
AQLPFSALWALLIGIGIAFTPCVLPMYPLISGIVLGGKQRLSTARALLLTFIYVQGMALTYTALGLVVAAAGLQFQAALQ
HPYVLIGLAIVFTLLAMSMFGLFTLQLPSSLQTRLTLMSNRQQGGSPGGVFVMGAIAGLICSPCTTAPLSAILLYIAQSG
NMWLGGGTLYLYALGMGLPLMLITVFGNRLLPKSGPWMEQVKTAFGFVILALPVFLLERVIGDIWGLRLWSALGVAFFGG
AFITSLQAKRGWMRVVQIILLAAALVSVRPLQDWAFGATHTAQTQTHLNFTQIKTVDELNQALVEAKGKPVMLDLYADWC
VACKEFEKYTFSDPQVQKALADTVLLQANVTANDAQDVALLKHLNVLGLPTILFFDGQGQEHPQARVTGFMDAETFSAHL
RDRQP
>P0AA86 ~~~dsbE~~~Thiol:disulfide interchange protein DsbE~~~COG0526
MKRKVLLIPLIIFLAIAAALLWQLARNAEGDDPTNLESALIGKPVPKFRLESLDNPGQFYQADVLTQGKPVLLNVWATWC
PTCRAEHQYLNQLSAQGIRVVGMNYKDDRQKAISWLKELGNPYALSLFDGDGMLGLDLGVYGAPETFLIDGNGIIRYRHA
GDLNPRVWEEEIKPLWEKYSKEAAQ
>Q9I3N1 ~~~dsbE~~~Thiol:disulfide interchange protein DsbE~~~
MKRAILLLPLGIFLIVAVFLFRGLWLDPSELPSALIGKPFPAFDLPSVQDPARRLTEADLKGKPALVNVWGTWCPSCRVE
HPELTRLAEQGVVIYGINYKDDNAAAIKWLNELHNPYLLSISDADGTLGLDLGVYGAPETYLIDKQGIIRHKIVGVVDQK
VWREQLAPLYQQLLDEPEAR
>P77202 ~~~dsbG~~~Thiol:disulfide interchange protein DsbG~~~COG1651
MLKKILLLALLPAIAFAEELPAPVKAIEKQGITIIKTFDAPGGMKGYLGKYQDMGVTIYLTPDGKHAISGYMYNEKGENL
SNTLIEKEIYAPAGREMWQRMEQSHWLLDGKKDAPVIVYVFADPFCPYCKQFWQQARPWVDSGKVQLRTLLVGVIKPESP
ATAAAILASKDPAKTWQQYEASGGKLKLNVPANVSTEQMKVLSDNEKLMDDLGANVTPAIYYMSKENTLQQAVGLPDQKT
LNIIMGNK
>Q9Z6Y0 1.8.-.-~~~dsbH~~~Disulfide bond reductase DsbH~~~COG0526
MKFWLQGCAFVGCLLLTLPCCAARRRASGENLQQTRPIAAANLQWESYAEALEHSKQDHKPICLFFTGSDWCMWCIKMQD
QILQSSEFKHFAGVHLHMVEVDFPQKNHQPEEQRQKNQELKAQYKVTGFPELVFIDAEGKQLARMGFEPGGGAAYVSKVK
SALKLR
>Q8FDI3 ~~~dsbI~~~Protein-disulfide oxidoreductase DsbI~~~COG1495
MGIKGMWKDLRTSPVDTLVRWQEQRLLWLLMAVAMGALIILAHSFFQIYLYMAPCEQCVYIRYAMFVMVIGGLVAAINPK
NIILKLIGCVMAFYGSILGLKFSLKLNDIHHAVHNPDPDSLFGVQGCSTDPTFPFNLPLAQWAPNWFKPTGDCGYDAPIV
PDGVTLSSTQQWFVEMYQQSEGWYLLPPWHFMNMAQACMLAFGMCLVLLVIMSGAWALKIIRG
>P0A4L7 ~~~dsbL~~~Thiol:disulfide interchange protein DsbL~~~COG2761
MSKLGISSLFKTILLTAALAVSFTASAFTEGTDYMVLEKPIPNADKTLIKVFSYACPFCYKYDKAVTGPVSEKVKDIVAF
TPFHLETKGEYGKQASEVFAVLINKDKAAGISLFDANSQFKKAKFAYYAAYHDKKERWSDGKDPAAFIKTGLDAAGMSQA
DFEAALKEPAVQETLEKWKASYDVAKIQGVPAYVVNGKYLIYTKSIKSIDAMADLIRELASK
>P46068 ~~~dsdC~~~HTH-type transcriptional regulator DsdC~~~COG0583
MEPLREIRNRLLNGWQLSKMHTFEVAARHQSFALAAEELSLSPSAVSHRINQLEEELGIQLFVRSHRKVELTHEGKRVYW
ALKSSLDTLNQEILDIKNQELSGTLTLYSRPSIAQCWLVPALGDFTRRYPSISLTVLTGNDNVNLQRAGIDLAIYFDDAP
SAQLTHHFLMDEEILPVCSPEYAQRHALTNTVINLCHCTLLHDRQAWSNDSGTDEWHSWAQHYAVNLPTSSGIGFDRSDL
AVIAAMNHIGVAMGRKRLVQKRLASGELVAPFGDMTVKCHQHYYITTLPGRQWPKIEAFIIWLREQVKTTS
>A0A0H2VAP9 ~~~dsdX~~~D-serine transporter DsdX~~~COG2610
MHSQIWVVSTLLISIVLIVLTIVKFKFHPFLALLLASFFVGTMMGMGPLDMVNAIESGIGGTLGFLAAVIGLGTILGKMM
EVSGAAERIGLTLQRCRWLSADVIMVLVGLICGITLFVEVGVVLLIPLAFSIAKKTNTSLLKLAIPLCTALMAVHCVVPP
HPAALYVANKLGADIGSVIVYGLLVGLMASLIGGPLFLKFLGQRLPFKPVPTEFADLKVRDEKTLPSLGATLFTVLLPIA
LMLVKTIAELNMARESGLYTLLEFIGNPITATFIAVFVAYYVLGIRQHMSMGTMLTHTENGFGSIANILLIIGAGGAFNA
ILKSSSLADTLAVILSNMHMHPILLAWLVALILHAAVGSATVAMMGATAIVAPMLPLYPDISPEIIAIAIGSGAIGCTIV
TDSLFWLVKQYCGATLNETFKYYTTATFIASVIALAGTFLLSFII
>P08555 ~~~dsdX~~~D-serine transporter DsdX~~~COG2610
MHSQIWVVSTLLISIVLIVLTIVKFKFHPFLALLLASFFVGTMMGMGPLDMVNAIESGIGGTLGFLAAVIGLGTILGKMM
EVSGAAERIGLTLQRCRWLSVDVIMVLVGLICGITLFVEVGVVLLIPLAFSIAKKTNTSLLKLAIPLCTALMAVHCVVPP
HPAALYVANKLGADIGSVIVYGLLVGLMASLIGGPLFLKFLGQRLPFKPVPTEFADLKVRDEKTLPSLGATLFTILLPIA
LMLVKTIAELNMARESGLYILVEFIGNPITAMFIAVFVAYYVLGIRQHMSMGTMLTHTENGFGSIANILLIIGAGGAFNA
ILKSSSLADTLAVILSNMHMHPILLAWLVALILHAAVGSATVAMMGATAIVAPMLPLYPDISPEIIAIAIGSGAIGCTIV
TDSLFWLVKQYCGATLNETFKYYTTATFIASVVALAGTFLLSFII
>Q88JU3 4.2.1.118~~~quiC1~~~3-dehydroshikimate dehydratase~~~COG1082
MQRSIATVSLSGTLPEKLEAIAAAGFDGVEIFENDLLYYAGSPRQVRQMCADLGIAITLFQPFRDFEGCRRDRLQKNLDR
AERKFDLMQELGTDLVLVCSNVQADALGDEQLLVDDLRLLGEHAGKRGLRIGYEALAWGRHVNTYQQVWNLVRQADHPAL
GVILDSFHTLSLKGDPSAIRDIPGDKIFFVQMADAPILAMDVLEWSRHFRCFPGQGEMDMAGFLAPILATGYRGPLSLEI
FNDGFRAAPTRQNAADGLRSLLYLEEQTRLRLEQENTPIEPGVLFSPPPASAYDGVEFLEFAVDEAVGARLGNWLKRLGF
AEAGKHRSKEVQLLRQGDINIVLNAEPYSFGHNFFEAHGPSLCATALRVKDQQAALKRATAFRGQPFRGLVGPNECEVPA
VRAPDGSLLYLVEQGTAGHTLYDTDFSLDNNATATGGLRRIDHMALALPAESLDSWVLFYKSLFDFAADDEVVLPDPYGL
VKSRALRSQCGTLRLPLNISENRNTAIAHALSSYRGSGVHHIAFDCDDIFREVARAKLAGVPLLEIPLNYYDDLAARFDF
DDEFLSELAYYNVLYDRDAQGGELFHVYTEPFEERFFFEIIQRKAGYAGYGAANVAVRLAAMAKARSGAARKPVL
>P0AEG8 ~~~dsrB~~~Protein DsrB~~~
MKVNDRVTVKTDGGPRRPGVVLAVEEFSEGTMYLVSLEDYPLGIWFFNEAGHQDGIFVEKAE
>D3RPC1 ~~~dsrE2~~~Sulfur carrier protein DsrE2~~~COG2210
MEQKKLAIIATKGSLDWAYPPFILASTAAALGYEVQVFFTFYGLQLLKKKPNLEVTPLGNPGMPMPMGMDKWFPVLGLAL
PGMQGMMTAMMKQKMKSKGVASIEELRELCQEAEVKMIACQMTVDLFDMPKAEFIDGVEYAGAAAFFEFAGESDICLYI
>O87896 2.8.1.-~~~dsrE~~~Putative sulfurtransferase DsrE~~~COG1553
MKFALQINEGPYQHQASDSAYQFAKAALEKGHEIFRVFFYHDGVNNSTRLTTPPQDDRHIVNRWAELAEQYELDMVVCVA
AAQRRGIVDEGEASRNGKDATNIHPKFRISGLGQLVEAAIQADRLVVFGD
>O87897 ~~~dsrF~~~Intracellular sulfur oxidation protein DsrF~~~COG2923
MSEVVKKFMYLNRKAPYGTIYAWEALEVVLIGAAFDQDVCVLFLDDGVYQLTRGQDTKGIGMKNFSPTYRTLGDYEVRRI
YVDRDSLEARGLTQDDLVEIAFEDMETEEEFDNIVEVIDSARVSELMNESDAVFSF
>Q57366 1.7.2.3~~~dmsA~~~Dimethyl sulfoxide/trimethylamine N-oxide reductase~~~
MTKLSGQELHAELSRRAFLSYTAAVGALGLCGTSLLAQGARAEGLANGEVMSGCHWGVFKARVENGRAVAFEPWDKDPAP
SHQLPGVLDSIYSPTRIKYPMVRREFLEKGVNADRSTRGNGDFVRVTWDEALDLVARELKRVQESYGPTGTFGGSYGWKS
PGRLHNCQVLMRRALNLAGGFVNSSGDYSTAAAQIIMPHVMGTLEVYEQQTAWPVVVENTDLMVFWAADPMKTNEIGWVI
PDHGAYAGMKALKEKGTRVICINPVRTETADYFGADVVSPRPQTDVALMLGMAHTLYSEDLHDKDFLENCTTGFDLFAAY
LTGESDGTPKTAEWAAEICGLPAEQIRELARSFVAGRTMLAAGWSIQRMHHGEQAHWMLVTLASMIGQIGLPGGGFGLSY
HYSNGGSPTSDGPALGGISDGGKAVEGAAWLSESGATSIPCARVVDMLLNPGGEFQFNGATATYPDVKLAYWAGGNPFAH
HQDRNRMLKAWEKLETFIVQDFQWTATARHADIVLPATTSYERNDIESVGDYSNRAILAMKKVVDPLYEARSDYDIFAAL
AERLGKGAEFTEGRDEMGWISSFYEAAVKQAEFKNVAMPSFEDFWSEGIVEFPITEGANFVRYADFREDPLFNPLGTPSG
LIEIYSKNIEKMGYDDCPAHPTWMEPAERLGGAGAKYPLHVVASHPKSRLHSQLNGTSLRDLYAVAGHEPCLINPADAAA
RGIADGDVLRVFNDRGQILVGAKVSDAVMPGAIQIYEGGWYDPLDPSEEGTLDKYGDVNVLSLDVGTSKLAQGNCGQTIL
ADVEKYAGAPVTVTVFDTPKGA
>Q52675 1.7.2.3~~~dorA~~~Dimethyl sulfoxide/trimethylamine N-oxide reductase~~~
MTKFSGNELRAELYRRAFLSYSVAPGALGMFGRSLLAKGARAEALANGTVMSGSHWGVFTATVENGRATAFTPWEKDPHP
TPMLEGVLDSIYSPTRIKYPMVRREFLEKGVNADRSTRGNGDFVRVSWDQALDLVAAEVKRVEETYGPQGVFGGSYGWKS
PGRLHNCTTLLRRMLTLAGGYVNGAGDYSTGAAQVIMPHVVGTLEVYEQQTAWPVLAENTEVMVFWAADPIKTSQIGWVI
PEHGAYPGLEALKAKGTKVIVIDPVRTKTVEFFGADHVTPKPQTDVAIMLGMAHTLVAEDLYDKDFIANYTSGFDKFLPY
LMGETDSTPKTAEWASDISGVPAETIKELARLFISKRTMLAAGWSMQRMHHGEQAHWMLVTLASMLGQIGLPGGGFGLSY
HYSGGGTPSTSGPALSGITDGGAATKGPEWLAASGASVIPVARVVDMLENPGAEFDFNGTRSKFPDVKMAYWVGGNPFVH
HQDRNRMVKAWEKLETFIVHDFQWTPTARHADIVLPATTSYERNDIETIGDYSNTGILAMKKIVEPLYEARSDYDIFAAV
AERLGKGKEFTEGKDEMGWIKSFYDDAAKQGKAGGVEMPAFDAFWAEGIVEFPVTDGADFVRYASFREDPLLNPLGTPTG
LIEIYSKNIEKMGYDDCPAHPTWMEPLERLDGPGAKYPLHIAASHPFNRLHSQLNGTVLREGYAVQGHEPCLMHPDDAAA
RGIADGDVVRVHNDRGQILTGVKVTDAVMKGVIQIYEGGWYDPSDVTEPGTLDKYGDVNVLSADIGTSKLAQGNCGQTVL
AEVEKYTGPAVTLTGFVAPKAAE
>P45574 ~~~dsvA~~~Sulfite reductase, dissimilatory-type subunit alpha~~~COG2221
MAKHATPKLDQLESGPWPSFVSDIKQEAAYRAANPKGLDYQVPVDCPEDLLGVLELSYDEGETHWKHGGIVGVFGYGGGV
IGRYCDQPEKFPGVAHFHTVRVAQPSGKYYSADYLRQLCDIWDLRGSGLTNMHGSTGDIVLLGTQTPQLEEIFFELTHNL
NTDLGGSGSNLRTPESCLGKSRCEFACYDSQAACYELTMEYQDELHRPAFPYKFKFKFDACPNGCVASIARSDFSVIGTW
KDDIKIDAEAVKAYVAGEFKPNAGAHSGRDWGKFDIEAEVVNRCPSKCMKWDGSKLSIDNKECVRCMHCINTMPRALHIG
DERGASILCGAKAPILDGAQMGSLLVPFVAAEEPFDEIKEVVEKIWDWWMEEGKNRERLGETMKRLSFQKLLEVTEIAPV
PQHVKEPRTNPYIFFKEEEVPGGWDRDITEYRKRHLR
>P45575 1.8.99.5~~~dsvB~~~Sulfite reductase, dissimilatory-type subunit beta~~~COG2221
MAFISSGYNPEKPMANRITDIGPRKFDEFFPPVIAKNFGSWLYHEILEPGVLMHVAESGDKVYTVRVGAARLMSITHIRE
MCDIADKYCGGHLRFTTRNNVEFMVADEASLKALKEDLASRKFDGGSLKFPIGGTGAGVSNIVHTQGWVHCHTPATDASG
PVKAIMDEVFEDFQSMRLPAPVRISLACCINMCGAVHCSDIGVVGIHRKPPMIDHEWTDQLCEIPLAVASCPTAAVRPTK
LEIGDKKVNTIAIKNERCMYCGNCYTMCPALPISDGEGDGVVIMVGGKVSNRISMPKFSKVVVAYIPNEPPRWPSLTKTI
KHIIEVYSANAYKYERLGEWAERIGWERFFSLTGLEFSHHLIDDFRDPAYYTWRQSTQFKF
>P45573 1.8.99.5~~~dsvC~~~Sulfite reductase, dissimilatory-type subunit gamma~~~COG2920
MAEVTYKGKSFEVDEDGFLLRFDDWCPEWVEYVKESEGISDISPDHQKIIDFLQDYYKKNGIAPMVRILSKNTGFKLKEV
YELFPSGPGKGACKMAGLPKPTGCV
>Q46582 ~~~dsvD~~~Protein DsvD~~~
MEEAKQKVVDFLNSKSGSKSKFYFNDFTDLFPDMKQREVKKILTALVNDEVLEYWSSGSTTMYGLKGAGKQAAAEHED
>Q0ZIH7 1.14.14.22~~~dszA~~~Dibenzothiophene-sulfone monooxygenase~~~
MTQQRQMDLAGFFSAGNVTHAHGAWRHTDASNDFLSGKYYQHIARTLERGKFDLLFLPDGLAVEDSYGDNLDTGVGLGGQ
GAVALEPASVVATMAAVTEHLGLGATISATYYPPYHVARVFATLDQLSGGRVSWNVVTSLNDAEARNFGINQHLEHDARY
DRADEFLEAVKKLWNSWDEDALVLDKAAGVFADPAKVHYVDHHGEWLNVRGPLQVPRSPQGEPVILQAGLSPRGRRFAGK
WAEAVFSLAPNLEVMQATYQGIKAEVDAAGRDPDQTKIFTAVMPVLGESQAVAQERLEYLNSLVHPEVGLSTLSSHTGIN
LAAYPLDTPIKDILRDLQDRNVPTQLHMFAAATHSEELTLAEMGRRYGTNVGFVPQWAGTGEQIADELIRHFEGGAADGF
IISPAFLPGSYDEFVDQVVPVLQDRGYFRTEYQGNTLRDHLGLRVPQLQGQPS
>P54995 1.14.14.22~~~dszA~~~Dibenzothiophene-sulfone monooxygenase~~~
MTQQRQMHLAGFFSAGNVTHAHGAWRHTDASNDFLSGKYYQHIARTLERGKFDLLFLPDGLAVEDSYGDNLDTGVGLGGQ
GAVALEPASVVATMAAVTEHLGLGATISATYYPPYHVARVFATLDQLSGGRVSWNVVTSLNDAEARNFGINQHLEHDARY
DRADEFLEAVKKLWNSWDEDALVLDKAAGVFADPAKVHYVDHHGEWLNVRGPLQVPRSPQGEPVILQAGLSPRGRRFAGK
WAEAVFSLAPNLEVMQATYQGIKAEVDAAGRDPDQTKIFTAVMPVLGESQAVAQERLEYLNSLVHPEVGLSTLSSHTGIN
LAAYPLDTPIKDILRDLQDRNVPTQLHMFAAATHSEELTLAEMGRRYGTNVGFVPQWAGTGEQIADELIRHFEGGAADGF
IISPAFLPGSYDEFVDQVVPVLQDRGYFRTEYQGNTLRDHLGLRVPQLQGQPS
>Q6WNP3 1.14.14.22~~~dszA~~~Dibenzothiophene-sulfone monooxygenase~~~
MTQQRQMHLAGFFSAGNVTHAHGAWRHTDASNDFLSGKYYQHIARTLERGKFDLLFLPDGLAVEDSYGDNLDTGVGLGGQ
GAVALEPASVVATMAAVTEHLGLGATISATYYPPYHVARVFATLDQLSGGRVSWNVVTSLNDAEARNFGINQHLEHDARY
DRADEFLEAVKKLWNSWDEDALVLDKAAGVFADPAKVHYVDHHGEWLNVRGPLQVPRSPQGEPVILQAGLSPRGRRFAGK
WAEAVFSLAPNLEVMQATYQGIKAEVDAAGRDPDQTKIFTAVMPVLGESQAVAQERLEYLNSLVHPEVGLSTLSSHTGIN
LAAYPLDTPIKDILRDLQDRNVPTQLHMFAAATHSEELTLAEMGRRYGTNVGFVPQWAGTGEQIADELIRHFEGGAADGF
IISPAFLPGSYDEFVDQVVPVLQDRGYFRTEYQGNTLRDHLGLRVPQLQGQPS
>P0DW79 3.13.1.3~~~dszB~~~2'-hydroxybiphenyl-2-sulfinate desulfinase~~~
MTSRVDPANPGSELDSAIRDTLTYSNCPVPNALLTASESGFLDAAGIELDVLSGQQGTVHFTYDQPAYTRFGGEIPPLLS
EGLRAPGRTRLLGITPLLGRQGFFVRDDSPITAAADLAGRRIGVSASAIRILRGQLGDYLELDPWRQTLVALGSWEARAL
LHTLEHGELGVDDVELVPISSPGVDVPAEQLEESATVKGADLFPDVARGQAAVLASGDVDALYSWLPWAGELQATGARPV
VDLGLDERNAYASVWTVSSGLVRQRPGLVQRLVDAAVDAGLWARDHSDAVTSLHAANLGVSTGAVGQGFGADFQQRLVPR
LDHDALALLERTQQFLLTNNLLQEPVALDQWAAPEFLNNSLNRHR
>P54997 3.13.1.3~~~dszB~~~2'-hydroxybiphenyl-2-sulfinate desulfinase~~~
MTSRVDPANPGSELDSAIRDTLTYSNCPVPNALLTASESGFLDAAGIELDVLSGQQGTVHFTYDQPAYTRFGGEIPPLLS
EGLRAPGRTRLLGITPLLGRQGFFVRDDSPITAAADLAGRRIGVSASAIRILRGQLGDYLELDPWRQTLVALGSWEARAL
LHTLEHGELGVDDVELVPISSPGVDVPAEQLEESATVKGADLFPDVARGQAAVLASGDVDALYSWLPWAGELQATGARPV
VDLGLDERNAYASVWTVSSGLVRQRPGLVQRLVDAAVDAGLWARDHSDAVTSLHAANLGVSTGAVGQGFGADFQQRLVPR
LDHDALALLERTQQFLLTNNLLQEPVALDQWAAPEFLNNSLNRHR
>Q6WNP2 3.13.1.3~~~dszB~~~2'-hydroxybiphenyl-2-sulfinate desulfinase~~~
MTSRVDPANPGSELDSAIRDTLTYSNCPVPNALLTASESGFLDAAGIELDVLSGQQGTVHFTYDQPAYTRFGGEIPPLLS
EGLRAPGRTRLLGITPLLGRQGFFVRDDSPITAAADLAGRRIGVSASAIRILRGQLGDYLELDPWRQTLVALGSWEARAL
LHTLEHGELGVDDVELVPISSPGVDVPAEQLEESATVKGADLFPDVARGQAAVLASGDVDALYSWLPWAGELQATGARPV
VDLGLDERNAYASVWTVSSGLVRQRPGLVQRLVDAAVDAGLWARDHSDAVTSLHAANLGVSTGAVGQGFGADFQQRLVPR
LDHDALALLERTQQFLLTNNLLQEPVALDQWAAPEFLNNSLNRHR
>A0A0C6DRW4 1.14.14.21~~~dszC~~~Dibenzothiophene monooxygenase~~~
MTLSPEKQHVRPRDAADNDPVAVARGLAEKWRATAVERDRAGGSATAEREDLRASGLLSLLVPREYGGWGADWPTAIEVV
REIAAADGSLGHLFGYHLTNAPMIELIGSQEQEEHLYTQIAQNNWWTGNASSENNSHVLDWKVRATPTEDGGYVLNGTKH
FCSGAKGSDLLFVFGVVQDDSPQQGAIIAAAIPTSRAGVTPNDDWAAIGMRQTDSGSTDFHNVKVEPDEVLGAPNAFVLA
FIQSERGSLFAPIAQLIFANVYLGIAHGALDAAREYTRTQARPWTPAGIQQATEDPYTIRSYGEFTIALQGADAAAREAA
HLLQTVWDKGDALTPEDRGELMVKVSGVKALATNAALNISSGVFEVIGARGTHPRYGFDRFWRNVRTHSLHDPVSYKIAD
VGKHTLNGQYPIPGFTS
>Q0ZIH5 1.14.14.21~~~dszC~~~Dibenzothiophene monooxygenase~~~
MTLSPEKEHVRPRDAADNDPVAVARGLAEKWRATAVERDRAGGSATAEREDLRASALLSLLVPREYGGWGADWPTAIEVV
REIAAADGSLGHLFGYHLTNAPMIELIGSQEQEEHLYTQIAQNNWWTGNASSENNSHELDVKVSATPTEDGGYVLNGTKH
FCSGAKGSDLLFVFGVVQDDSPQQGAIIAAAIPTSRAGVTPNDDWAAIGMRQTDSGSTDFHNVKVEPDEVLGAPNAFVLA
FIQSERGSLFRPIAQLIFANVYLGIAHGALDAAREYTRTQARPWTPAGIQQATEDPYTIRSYGEFTIALQGADAAAREAA
HLVQTVWDKGDALTPEDRGELMAKVSGVKSLATNAALNISSGVFEVIGARGTHPRYGFDRFWRNVRTHSLHDPVSYKIAD
VGKHTLNGQYPIPGFTS
>B2CML6 1.14.14.21~~~dszC~~~Dibenzothiophene monooxygenase~~~
MTLTDDTTTAQNSRHGDPIEVARELTRKWQTTVVERDKAGGSATEEREDLRASGLLSVTVPRHLGGWGADWPTALEVVRE
IAKVDGSLGHLFGYHLSTPAVIDLWGSPEQKERLLRQLAENNWWTGNASSENNSHILDWKVTATPADDGGYFFNGIKHFS
SGAKGSDLLLVFGVIPEGFPQQGAIVAAAIPTTREGVQPNDDWQALGMRRTDSGTTEFHNVAVRPDEVLGKPNAILEAFL
ASGRGSLFGPIVQLVFSSVYLGIARGALETAREYTRTQARPWTPAGVTQAVEDPYTIRSYGEFGIQLQAADAAAREAAQL
LQAAWDKGDALTSQERGELMVQISGVKAIATQAALDVTSRIFEVIGARGTHPKYGFDRFWRNIRTHTLHDPVSYKIAEVG
NYVLNQRYPIPGFTS
>P54998 1.14.14.21~~~dszC~~~Dibenzothiophene monooxygenase~~~
MTLSPEKQHVRPRDAADNDPVAVARGLAEKWRATAVERDRAGGSATAEREDLRASGLLSLLVPREYGGWGADWPTAIEVV
REIAAADGSLGHLFGYHLTNAPMIELIGSQEQEEHLYTQIAQNNWWTGNASSENNSHVLDWKVSATPTEDGGYVLNGTKH
FCSGAKGSDLLFVFGVVQDDSPQQGAIIAAAIPTSRAGVTPNDDWAAIGMRQTDSGSTDFHNVKVEPDEVLGAPNAFVLA
FIQSERGSLFAPIAQLIFANVYLGIAHGALDAAREYTRTQARPWTPAGIQQATEDPYTIRSYGEFTIALQGADAAAREAA
HLLQTVWDKGDALTPEDRGELMVKVSGVKALATNAALNISSGVFEVIGARGTHPRYGFDRFWRNVRTHSLHDPVSYKIAD
VGKHTLNGQYPIPGFTS
>Q6WNP1 1.14.14.21~~~dszC~~~Dibenzothiophene monooxygenase~~~
MTLSPEKQHVRPRDAADNDPVAVARGLAEKWRATAVERDRAGGSATAEREDLRASGLLSLLVPREYGGWGADWPTAIEVV
REIAAADGSLGHLFGYHLTNAPMIELIGSQEQEEHLYTQIAQNNWWTGNASSENNSHVLDWKVSATPTEDGGYVLNGTKH
FCSGAKGSDLLFVFGVVQDDSPQQGAIIAAAIPTSRAGVTPNDDWAAIGMRQTDSGSTDFHNVKVEPDEVLGAPNAFVLA
FIQSERGSLFAPIAQLIFANVYLGIAHGALDAAREYTRTQARPWTPAGIQQATEDPYTIRSYGEFTIALQGADAAAREAA
HLLQTVWDKGDALTPEDRGELMVKVSGVKALATNAALNISSGVFEVIGARGTHPRYGFDRFWRNVRTHSLHDPVSYKIAD
VGKHTLNGQYPIPGFTS
>B6CDL6 1.5.1.37~~~dszD~~~NADH:FMN oxidoreductase~~~
MSATDLSPTSLREAFGHFPSGVIAIAAEVDGTRVGLAASTFVPVSLEPPLVAFCVQNSSTTWPKLKDLPSLGISVLGEAH
DTAARTLAAKTGDRFAGLETESRDSGAVFINGTSVWLESAIEQLVPAGDHTIVVLRVSDIVINEAVPPIVFHRSAFRKLG
A
>P0DW80 1.5.1.42~~~dszD~~~NADH:FMN oxidoreductase~~~
MSDKPNAVSSHTTPDVPEVAATPELSTGICAGDYRAALRRHPAGVTVVTLDSGTGPVGFTATSFSSVSLEPPLVSFNIAE
TSSSINALKAAESLVIHLLGEHQQHLAQRFARSADQRFADESLWAVLDTGEPVLHGTPSWMRVKVDQLIPVGDHTLVIGL
VTRVHAEEDDESAAAPLLYHEGKYYRPTPLGQ
>O68503 1.5.1.42~~~dszD~~~NADH:FMN oxidoreductase~~~
MSDKPNAVSSHTTPDVPEVAATPELSTGICAGDYRAALRRHPAGVTVVTLDSGTGPVGFTATSFSSVSLEPPLVSFNIAE
TSSSINALKAAESLVIHLLGEHQQHLAQRFARSADQRFADESLWAVLDTGEPVLHGTPSWMRVKVDQLIPVGDHTLVIGL
VTRVHAEEDDESAAAPLLYHEGKYYRPTPLGQ
>Q6Q0M6 1.5.1.42~~~dszD~~~NADH:FMN oxidoreductase~~~
MSDKPNAVSSHTTPDVPEVAATPELSTGICAGDYRAALRRHPAGVTVVTLDSGTGPVGFTATSFSSVSLEPPLVSFNIAE
TSSSINALKAAESLVIHLLGEHQQRLAQRFAGSADQRFADESLWAVLDTGEPVLHGTPSWMRVKVDQLIPVGDHTLVIGL
VTRVHAEEDDESAAAPLLYHEGKYYRPTPLGQ
>C1KKR1 5.1.3.31~~~~~~D-tagatose 3-epimerase~~~
MKNPVGIISMQFIRPFTSESLHFLKKSRALGFDFIELLVPEPEDGLDAAEVRRICEGEGLGLVLAARVNLQRSIASEEAA
ARAGGRDYLKYCIEAAEALGATIVGGPLYGEPLVFAGRPPFPWTAEQIATRAARTVEGLAEVAPLAASAGKVFGLEPLNR
FETDIVNTTAQAIEVVDAVGSPGLGVMLDTFHMNMEERSIPDAIRATGARLVHFQANENHRGFPGTGTMDWTAIARALGQ
AGYAGPVSLEPFRRDDERVALPIAHWRAPHEDEDEKLRAGLGLIRSAITLAEVTH
>O50580 5.1.3.31~~~~~~D-tagatose 3-epimerase~~~
MNKVGMFYTYWSTEWMVDFPATAKRIAGLGFDLMEISLGEFHNLSDAKKRELKAVADDLGLTVMCCIGLKSEYDFASPDK
SVRDAGTEYVKRLLDDCHLLGAPVFAGLTFCAWPQSPPLDMKDKRPYVDRAIESVRRVIKVAEDYGIIYALEVVNRFEQW
LCNDAKEAIAFADAVDSPACKVQLDTFHMNIEETSFRDAILACKGKMGHFHLGEANRLPPGEGRLPWDEIFGALKEIGYD
GTIVMEPFMRKGGSVSRAVGVWRDMSNGATDEEMDERARRSLQFVRDKLA
>O82872 4.1.2.42~~~~~~D-threonine aldolase~~~
MSQEVIRGIALPPPAQPGDPLARVDTPSLVLDLAPFEANLRAMQAWADRHDVALRPHAKAHKCPEIALRQLALGARGICC
QKVSEALPFVAAGIQDIHISNEVVGPAKLALLGQLARVAKISVCVDNAHNLSQVSQAMVQAGAQIDVLVEVDVGQGRCGV
SDDALVLALAQQARDLPGVNFAGLQAYHGSVQHYRTREERAEVCRQAARIAASYAQLLRESGIACDTITGGGTGSAEFDA
ASGVYTELQAGSYAFMDGDYGANEWDGPLAFENSLFVLATVMSKPAPDRVILDAGLKSTTAECGPPAIFGEPGLTYTAIN
DEHGVVRVEPGAQAPDLGAVLRLVPSHVDPTFNLHDGLVVVRDGVVEDIWEISARGFSR
>M1V9Q0 4.2.3.148~~~dtcycA~~~Diterpene cyclase DtcycA~~~
MTDPAVTPLAFSIPQLYCPFPTAIHPEVDTLTRAGMDFMTHHGFCNTEADRLVVANIDAGAIVARWYPNPDFPVDRLQMV
TDFLYLYFLIDDLRFEVINSDTGLAGPIALFAQHLDLWEYPQAHRREELDLFHQAIHDLASRMAELTTPTKAARMRRSIN
GWFLALLREIALFNDDHAVMAEEYLPIRVVTVASRLMIDVNGFICPAEVPGDEWYSLKVQAAAEAAMSVCLYDNELYSAG
KEQWLKSRATAHDRRPRNLVALIQAQTGGSTEHALQEVAEYRNRTVCLYLNLRSQLEKTASPALLAYLSVLDGVISGNLD
AHATSSRYHNPDGHHPHAIAFTPLRTTDECSARAHTPIAPPIAWWWEQLDQ
>M1VDX3 4.2.3.150~~~dtcycB~~~Diterpene cyclase DtcycB~~~
MDLPPALLSFYCPIASEVSPEHEAVAQEMYAWIHAMSLTSDNRQAKMLAQAGAGFNSYFTPRARGELARALSKYNVCAWI
ANGMVQEIRDPGTFGAMAARWARIMEEPATCPADGIPMDFALADAFSHIRRTLSPVKWQHFSAAQSHWMHGLAWENCLHQ
VKGLTVHDYLSFRYVMSGCFAAAAFAYAVPERHPSAEEWAHPKVRAAADAAMMVDALDNDRYSYLKESLTEADKKTIFAA
LRHENPALGREEVIVRGVQLRDRILTLYLTLRGELLCDASEGLRSYLTGLDLIIAGNLVFCADMGLRYGLPEGSVRTDAE
PLDRTVAPPGIGAIDHWWAQAGA
>P73335 3.1.1.96~~~dtd3~~~D-aminoacyl-tRNA deacylase~~~COG0084
MHLVDTHVHINFDVFAADLDQLQHRWRQAGVVQLVHSCVKPQEFDQIQSLADRFPELFFAVGLHPLDAEDWQDNTAGQIL
AYAKADDRVVAIGEMGLDFFKADNRDHQIEVFRAQLAIARELNKPVIIHCRDAAQTMRQVLTDFQAESGPVAGVMHCWGG
TPEETQWFLDLGFYISFSGTVTFKKAEGIQASAQMVPPDRLLVETDCPFLAPVPQRGKRNEPAFVRHVAEAIAALRHVPL
ETLAQQTTTNARNLFKLPVPA
>O66742 3.1.1.96~~~dtd~~~D-aminoacyl-tRNA deacylase~~~COG1490
MRAVIQRVKKSWVEVDGKVVGSINEGLNVFLGVRKGDTEEDIEKLVNKILNLRIFEDERGKFQYSVLDIKGEILVVSQFT
LYANVKKGRRPSFEEAEEPKRAKELYEKFVDKIKESGLKVETGIFGAMMDVFIENWGPVTIIIDSREI
>A0A023W421 3.1.1.96~~~dtd~~~D-aminoacyl-tRNA deacylase~~~COG1490
MKLVVQRVTDASVTVDGAVAGRIGPGIMALVGVTHEDTEEDAAYLADKIVNLRIFDDESGKMNLSLLDTGGEILSVSQFT
LYGETKKGRRPNFMNAAKPDQALLLYEKWNELLREKGVKVETGIFGAMMDVQLTNSGPVTLIMDSKQ
>P0A6M4 3.1.1.96~~~dtd~~~D-aminoacyl-tRNA deacylase~~~COG1490
MIALIQRVTRASVTVEGEVTGEIGAGLLVLLGVEKDDDEQKANRLCERVLGYRIFSDAEGKMNLNVQQAGGSVLVVSQFT
LAADTERGMRPSFSKGASPDRAEALYDYFVERCRQQEMNTQTGRFAADMQVSLVNDGPVTFWLQV
>P44814 3.1.1.96~~~dtd~~~D-aminoacyl-tRNA deacylase~~~COG1490
MIALIQRVSQAKVDVKGETIGKIGKGLLVLLGVEKEDNREKADKLAEKVLNYRIFSDENDKMNLNVQQAQGELLIVSQFT
LAADTQKGLRPSFSKGASPALANELYEYFIQKCAEKLPVSTGQFAADMQVSLTNDGPVTFWLNV
>P9WNS9 3.1.1.96~~~dtd~~~D-aminoacyl-tRNA deacylase~~~COG1490
MRVLVQRVSSAAVRVDGRVVGAIRPDGQGLVAFVGVTHGDDLDKARRLAEKLWNLRVLADEKSASDMHAPILVISQFTLY
ADTAKGRRPSWNAAAPGAVAQPLIAAFAAALRQLGAHVEAGVFGAHMQVELVNDGPVTVMLEG
>P0A026 3.1.1.96~~~dtd~~~D-aminoacyl-tRNA deacylase~~~
MKVVVQRVKEASVTNDTLNNQIKKGYCLLVGIGQNSTEQDADVIAKKIANARLFEDDNNKLNFNIQQMNGEILSVSQFTL
YADVKKGNRPGFSNSKNPDQAVKIYEYFNDALRAYGLTVKTGEFGTHMNVSINNDGPVTIIYESQDGKIQ
>B2DFG5 4.3.1.27~~~dthadh~~~D-threo-3-hydroxyaspartate dehydratase~~~
MQDTLLTLDTPAAVIDLDRMQRNIARMQQRMDAQGVRLRPHVKTSKSVPVAAAQRAAGASGITVSTLKEAEQFFAAGTTD
ILYAVSMAPHRLPQALQLRRRGCDLKLIVDSVAAAQAIAAFGREQGEAFEVWIEIDTDGHRSGVGADDTPLLLAIGRTLH
DGGMRLGGVLTHAGSSYELDTPEALQALAERERAGCVQAAEALRAAGLPCPVVSVGSTPTALAASRLDGVTEVRAGVYVF
FDLVMRNIGVCAAEDVALSVLATVIGHQADKGWAIVDAGWMAMSRDRGTARQKQDFGYGQVCDLQGRVMPGFVLTGANQE
HGILARADGAAEADIATRFPLGTRLRILPNHACATGAQFPAYQALAADGSVQTWERLHGW
>A0QYC2 1.1.1.403~~~dthD~~~D-threitol dehydrogenase~~~COG1028
MTQAQELSVDFDFRLDGKVALVTGAASGIGAAIASAYATKGARIAAVDLNAEGAEALAAQLGGDRGAHRGFACDVADAAS
VQAAADAVAAEFGRIDILVNSAGVARLAPAEELSLQDWDSTLAINLSGTFLMCQAVGKRMLEAGGGAIVNMASQAATVAL
DQHVAYCASKFGVVGVSKVLAAEWGGRGVRVNTISPTVVLTELGHKAWDGPRGDALKKLIPTGRFAYPDEIAAAAVFLAS
DAAAMINGADLVIDGGYTIK
>A0A0H3LX82 2.7.1.219~~~dtnK~~~D-threonate kinase~~~COG3395
MGGPYIGIVADDLTGSGDTAVQFVRAGWATQLSVGGAEQALADPAVRQAEVLAVTTHSRPLAAADAAAVVRGEVERLRAA
GVQRLYKKVDSTLRGAFKAEIDAARLAWGEDAIAVVCPAFPVTGRTVRQGVLYVGDRPVTETSAATDPVTPVTESHIPTL
LGCAQLAAQAGETPAELARRIAAAAPVVVVDALDDADVQRLARAIGVLGQRAVPVGSGGLAAPLARVWAGGQAAGPVLVV
VTSQHSAARQQAAALQQAGARTWAPTLAQLADDRNWAAWTAEVEAAEHGMPAVDALMLLAPEGRLAGLDADSVARRLGEL
AARLVLAHGAAGVVATGGDGASAVLAALQASGIALVDEVTGGVPLGTLTGGQAAGLPVVTKAGGFGEQDVLIRAAQAIRE
RRFTK
>Q0K4F6 2.7.1.219~~~dtnK~~~D-threonate kinase~~~COG3395
MSWLIIADDLSGAADCAIGYAMSGARTVVTLEAAPAGADLSQADVVACDVDSRRMAPQEAAARNLEAWHRGQGASRRLYK
KIDSTLRGNWAAETAALAPLAGLAIVAPAFPATGRTTAGGCMFVNGQPLEDSDIWRLEALTGRADLVALLAARGLRATLL
PLDTVRAGDATLRLTIAGLAREGVRAVVCDAQTEQDLAALAAATAQLDVPAFWVGSGGLARALAAPCLFEGGAPQPLPAP
EGGPVLTLVGSLSGISGRQAACLRERTGMQSLVVPPRILREGAGHADWDAAQQSITGCLRAGRDLLVSIGRDDAFDPGEG
PRLSAALAQLSLPGFQHTRGLIATGGETARAMLSAAGIGALMLRREVEPGVPLSDTPALPGVPARRVATKAGAFGSEAAL
WHAWQAMTESRAPSA
>Q6D0N7 2.7.1.219~~~dtnK~~~D-threonate kinase~~~COG3395
MPNVQQSAGQVLVVADDFTGANDAGVGLAQHGARVSVVFDVNTLHADLLGDAVVINTDSRAARDDVASQRTAAAVAAWQA
VGGKGWIIKKIDSTLRGNLGAEVAAALSAADVPVALIAAASPTLGRVTRQGEVWVNGRRLTDTEFASDPKTPVTSASIAA
RLAEQTALPVAEIHLDEVRQANLAHRLQQLADEGTRLIILDTDVQDDLTHIVNAARALPFRPLLVGSAGLSDALATAQDF
TRKTEKPLLAVVGSMSDIAQKQIAAARLRSDVTLVEIDINALFSPDSSTVMASQCEDALKALTNGHHCIIRTCHNENQRF
EIDARCRELGLSRQQLGETISHYLGELTRSIVQALDSLAADGTRRRLPGGLYLSGGDIAIAVATALGATGFQIKGQIASC
VPWGYLLNSIVGMTPVMTKAGGFGNETTLLDVLRFIEEKVSE
>Q8ZRS5 2.7.1.219~~~dtnK~~~D-threonate kinase~~~
MKMIVIADDFTGSNDTGVQLAKKGARTEVMLSASQKPSRRADVLVINTESRAMPADQAASAVYAALSPWCETSPAPLVYK
KIDSTFRGNIGAEVTAAMRASQRKLAVIAAAIPAAGRTTLEGKCLVNGVPLLETEFASDPKTPIVSSRIAEIVALQSEIP
VYEVFLQDVRRGGLSALLTAYAAEGEGIIVVDAVEERDLTLIAQAACEQPSMPLLVGAAGLANALPVELFMQDRQRLPVL
VVAGSMSEATRRQVDNALCRGRAEVVDIDAARMVSDSAEQEIASVVEQACALLSQHRHTILRTSRRAEDRQLIDALCEKS
AMSRQQLGERLSQRLGVVTLNIIEQARIGGLFLTGGDIATAVAGALGAEGYRIQSEVAPCIPCGTFVNSEIDDLPVITKA
GGFGSDSTLCDALYYIEEMYCGD
>P77304 ~~~dtpA~~~Dipeptide and tripeptide permease A~~~COG3104
MSTANQKPTESVSLNAFKQPKAFYLIFSIELWERFGYYGLQGIMAVYLVKQLGMSEADSITLFSSFSALVYGLVAIGGWL
GDKVLGTKRVIMLGAIVLAIGYALVAWSGHDAGIVYMGMAAIAVGNGLFKANPSSLLSTCYEKNDPRLDGAFTMYYMSVN
IGSFFSMIATPWLAAKYGWSVAFALSVVGLLITIVNFAFCQRWVKQYGSKPDFEPINYRNLLLTIIGVVALIAIATWLLH
NQEVARMALGVVAFGIVVIFGKEAFAMKGAARRKMIVAFILMLEAIIFFVLYSQMPTSLNFFAIRNVEHSILGLAVEPEQ
YQALNPFWIIIGSPILAAIYNKMGDTLPMPTKFAIGMVMCSGAFLILPLGAKFASDAGIVSVSWLVASYGLQSIGELMIS
GLGLAMVAQLVPQRLMGFIMGSWFLTTAGANLIGGYVAGMMAVPDNVTDPLMSLEVYGRVFLQIGVATAVIAVLMLLTAP
KLHRMTQDDAADKAAKAAVA
>Q8ZPM6 ~~~dtpA~~~Dipeptide and tripeptide permease A~~~
MSTANKKPTESVSLNAFKQPKAFYLIFSIELWERFGYYGLQGIMAVYLVKQLGMSEADSITLFSSFSALVYGLVAIGGWL
GDKILGTKRVIMLGAVVLAIGYALVAWSGHDAGIVYMGMAAIAVGNGLFKANPSSLLSTCYAKDDPRLDGAFTMYYMSVN
IGSFFSMLATPWLAARYGWSTAFALSVVGMLITVVNFAFCQRWVKSYGSKPDFEPINFRNLLLTIVGIVVLIAVATWLLH
NQDIARMVLGVIALGIVIIFGKEAFSMHGAARRKMIVAFILMLQAIIFFVLYSQMPTSLNFFAIRNVEHSILGIAFEPEQ
YQALNPFWIIIGSPILAAIYNRMGDTLPMPMKFAIGMVLCSGAFLILPLGAKFANDAGIVSVNWLIASYGLQSIGELMIS
GLGLAMVAQLVPQRLMGFIMGSWFLTTAGANIIGGYVANLMAVPSDVTDPLMSLEVYGRVFMQIGIATAVIAVLMLLTAP
KLNRMTQDDDTAEKGSKAATV
>P36837 ~~~dtpB~~~Dipeptide and tripeptide permease B~~~COG3104
MNTTTPMGMLQQPRPFFMIFFVELWERFGYYGVQGVLAVFFVKQLGFSQEQAFVTFGAFAALVYGLISIGGYVGDHLLGT
KRTIVLGALVLAIGYFMTGMSLLKPDLIFIALGTIAVGNGLFKANPASLLSKCYPPKDPRLDGAFTLFYMSINIGSLIAL
SLAPVIADRFGYSVTYNLCGAGLIIALLVYIACRGMVKDIGSEPDFRPMSFSKLLYVLLGSVVMIFVCAWLMHNVEVANL
VLIVLSIVVTIIFFRQAFKLDKTGRNKMFVAFVLMLEAVVFYILYAQMPTSLNFFAINNVHHEILGFSINPVSFQALNPF
WVVLASPILAGIYTHLGNKGKDLSMPMKFTLGMFMCSLGFLTAAAAGMWFADAQGLTSPWFIVLVYLFQSLGELFISALG
LAMIAALVPQHLMGFILGMWFLTQAAAFLLGGYVATFTAVPDNITDPLETLPVYTNVFGKIGLVTLGVAVVMLLMVPWLK
RMIATPESH
>P39276 ~~~dtpC~~~Dipeptide and tripeptide permease C~~~COG3104
MKTPSQPRAIYYIVAIQIWEYFSFYGMRALLILYLTHQLGFDDNHAISLFSAYASLVYVTPILGGWLADRLLGNRTAVIA
GALLMTLGHVVLGIDTNSTFSLYLALAIIICGYGLFKSNISCLLGELYDENDHRRDGGFSLLYAAGNIGSIAAPIACGLA
AQWYGWHVGFALAGGGMFIGLLIFLSGHRHFQSTRSMDKKALTSVKFALPVWSWLVVMLCLAPVFFTLLLENDWSGYLLA
IVCLIAAQIIARMMIKFPEHRRALWQIVLLMFVGTLFWVLAQQGGSTISLFIDRFVNRQAFNIEVPTALFQSVNAIAVML
AGVVLAWLASPESRGNSTLRVWLKFAFGLLLMACGFMLLAFDARHAAADGQASMGVMISGLALMGFAELFIDPVAIAQIT
RLKMSGVLTGIYMLATGAVANWLAGVVAQQTTESQISGMAIAAYQRFFSQMGEWTLACVAIIVVLAFATRFLFSTPTNMI
QESND
>Q8X9D3 ~~~dtpD~~~Dipeptide permease D~~~COG3104
MNKHASQPRAIYYVVALQIWEYFSFYGMRALLILYLTNQLKYNDTHAYELFSAYCSLVYVTPILGGFLADKVLGNRMAVM
LGALLMAIGHVVLGASEIHPSFLYLSLAIIVCGYGLFKSNVSCLLGELYEPTDPRRDGGFSLMYAAGNVGSIIAPIACGY
AQEEYSWAMGFGLAAVGMIAGLVIFLCGNRHFTHTRGVNKKVLRATNFLLPNWGWLLVLLVATPALITVLFWKEWSVYAL
IVATIIGLGVLAKIYRKAENQKQRKELRLIVTLTFFSMLFWAFAQQGGSSISLYIDRFVNRDMFGYTVPTAMFQSINAFA
VMLCGVFLAWVVKESVAGNRTVRIWGKFALGLGLMSAGFCILTLSARWSAMYGHSSLPLMVLGLAVMGFAELFIDPVAMS
QITRIEIPGVTGVLTGIYMLLSGAIANYLAGVIADQTSQASFDASGAINYSINAYIEVFDQITWGALACVGVVLMIWLYQ
ALKFRNRALALES
>P75742 ~~~dtpD~~~Dipeptide permease D~~~COG3104
MNKHASQPRAIYYVVALQIWEYFSFYGMRALLILYLTNQLKYNDTHAYELFSAYCSLVYVTPILGGFLADKVLGNRMAVM
LGALLMAIGHVVLGASEIHPSFLYLSLAIIVCGYGLFKSNVSCLLGELYEPTDPRRDGGFSLMYAAGNVGSIIAPIACGY
AQEEYSWAMGFGLAAVGMIAGLVIFLCGNRHFTHTRGVNKKVLRATNFLLPNWGWLLVLLVATPALITILFWKEWSVYAL
IVATIIGLGVLAKIYRKAENQKQRKELGLIVTLTFFSMLFWAFAQQGGSSISLYIDRFVNRDMFGYTVPTAMFQSINAFA
VMLCGVFLAWVVKESVAGNRTVRIWGKFALGLGLMSAGFCILTLSARWSAMYGHSSLPLMVLGLAVMGFAELFIDPVAMS
QITRIEIPGVTGVLTGIYMLLSGAIANYLAGVIADQTSQASFDASGAINYSINAYIEVFDQITWGALACVGLVLMIWLYQ
ALKFRNRALALES
>P0C2U3 ~~~dtpT~~~Di-/tripeptide transporter~~~
MQNLNKTEKTFFGQPRGLLTLFQTEFWERFSYYGMRAILVYYLYALTTADNAGLGLPKAQAMAIVSIYGALVYLSTIVGG
WVADRLLGASRTIFLGGILITLGHVALATPFGLSSLFVALFLIILGTGMLKPNISNMVGHLYSKDDSRRDTGFNIFVVGI
NMGSLIAPLIVGTVGQGVNYHLGFSLAAIGMIFALFAYWYGRLRHFPEIGREPSNPMDAKAKRNFIITLTIVLIVALIGF
FLIYQASPANFINNFINVLSIIGIVVPIIYFVMMFTSKKVESDERRKLTAYIPLFLSAIVFWAIEEQSSTIIAVWGESRS
NLNPTWFGFTFHIDPSWYQLLNPLFIVLLSPIFVRIWNKLGDRQPSTIVKFGLGLMLTGASYLIMTLPGLLNGTSGRASA
LWLVLMFAVQMAGELLVSPVGLSVSTKLAPVAFQSQMMAMWFLADSTSQAINAQITPIFKAATEVHFFAITGIIGIIVGI
ILLIIKKPILKLMGDVR
>P0DJL7 ~~~dtxR~~~Diphtheria toxin repressor~~~
MKDLVDTTEMYLRTIYELEEEGVTPLRARIAERLEQSGPTVSQTVARMERDGLVVVASDRSLQMTPTGRTLATAVMRKHR
LAERLLTDIIGLDINKVHDEACRWEHVMSDEVERRLVKVLKDVSRSPFGNPIPGLDELGVGNSDAAVPGTRVIDAATSMP
RKVRIVQINEIFQVETDQFTQLLDADIRVGSEVEIVDRDGHITLSHNGKDVELIDDLAHTIRIEEL
>Q5SMC7 1.3.1.-~~~dus~~~tRNA-dihydrouridine(20/20a) synthase~~~COG0042
MLDPRLSVAPMVDRTDRHFRFLVRQVSLGVRLYTEMTVDQAVLRGNRERLLAFRPEEHPIALQLAGSDPKSLAEAARIGE
AFGYDEINLNLGCPSEKAQEGGYGACLLLDLARVREILKAMGEAVRVPVTVKMRLGLEGKETYRGLAQSVEAMAEAGVKV
FVVHARSALLALSTKANREIPPLRHDWVHRLKGDFPQLTFVTNGGIRSLEEALFHLKRVDGVMLGRAVYEDPFVLEEADR
RVFGLPRRPSRLEVARRMRAYLEEEVLKGTPPWAVLRHMLNLFRGRPKGRLWRRLLSEGRSLQALDRALRLMEEEVGEEG
EKEKPGPRGQREAAPGPAREGV
>P32695 1.3.1.-~~~dusA~~~tRNA-dihydrouridine(20/20a) synthase~~~COG0042
MHGNSEMQKINQTSAMPEKTDVHWSGRFSVAPMLDWTDRHCRYFLRLLSRNTLLYTEMVTTGAIIHGKGDYLAYSEEEHP
VALQLGGSDPAALAQCAKLAEARGYDEINLNVGCPSDRVQNGMFGACLMGNAQLVADCVKAMRDVVSIPVTVKTRIGIDD
QDSYEFLCDFINTVSGKGECEMFIIHARKAWLSGLSPKENREIPPLDYPRVYQLKRDFPHLTMSINGGIKSLEEAKAHLQ
HMDGVMVGREAYQNPGILAAVDREIFGSSDTDADPVAVVRAMYPYIERELSQGTYLGHITRHMLGLFQGIPGARQWRRYL
SENAHKAGADINVLEHALKLVADKR
>P0ABT5 1.3.1.-~~~dusB~~~tRNA-dihydrouridine synthase B~~~COG0042
MRIGQYQLRNRLIAAPMAGITDRPFRTLCYEMGAGLTVSEMMSSNPQVWESDKSRLRMVHIDEPGIRTVQIAGSDPKEMA
DAARINVESGAQIIDINMGCPAKKVNRKLAGSALLQYPDVVKSILTEVVNAVDVPVTLKIRTGWAPEHRNCEEIAQLAED
CGIQALTIHGRTRACLFNGEAEYDSIRAVKQKVSIPVIANGDITDPLKARAVLDYTGADALMIGRAAQGRPWIFREIQHY
LDTGELLPPLPLAEVKRLLCAHVRELHDFYGPAKGYRIARKHVSWYLQEHAPNDQFRRTFNAIEDASEQLEALEAYFENF
A
>P33371 1.3.1.-~~~dusC~~~tRNA-dihydrouridine(16) synthase~~~COG0042
MRVLLAPMEGVLDSLVRELLTEVNDYDLCITEFVRVVDQLLPVKVFHRICPELQNASRTPSGTLVRVQLLGQFPQWLAEN
AARAVELGSWGVDLNCGCPSKTVNGSGGGATLLKDPELIYQGAKAMREAVPAHLPVSVKVRLGWDSGEKKFEIADAVQQA
GATELVVHGRTKEQGYRAEHIDWQAIGDIRQRLNIPVIANGEIWDWQSAQQCMAISGCDAVMIGRGALNIPNLSRVVKYN
EPRMPWPEVVALLQKYTRLEKQGDTGLYHVARIKQWLSYLRKEYDEATELFQHVRVLNNSPDIARAIQAIDIEKL
>P9WNS7 1.3.1.-~~~dus~~~Probable tRNA-dihydrouridine synthase~~~COG0042
MSRRRAIQPSPALRIGPIELASPVVLAPMAGVTNVAFRALCRQLEQSKVGTVSGLYVCEMVTARALIERHPVTMHMTTFS
ADESPRSLQLYTVDPDTTYAAARMIAGEGLADHIDMNFGCPVPKVTKRGGGAALPFKRRLFGQIVAAAVRATEGTDIPVT
VKFRIGIDDAHHTHLDAGRIAEAEGAAAVALHARTAAQRYSGTADWEQIARLKQHVRTIPVLGNGDIYDAGDALAMMSTT
GCDGVVIGRGCLGRPWLFAELSAAFTGSPAPTPPTLGEVADIIRRHGTLLAAHFGEDKGMRDIRKHIAWYLHGFPAGSAL
RRALAMVKTFDELDCLLDRLDGTVPFPDSATGARGRQGSPARVALPDGWLTDPDDCRVPEGADAMGSGG
>P67717 1.3.1.-~~~dus~~~Probable tRNA-dihydrouridine synthase~~~
MKENFWSELPRPFFILAPMEDVTDIVFRHVVSEAARPDVFFTEFTNTESFCHPEGIHSVRGRLTFSEDEHPMVAHIWGDK
PEQFRETSIQLAKMGFKGIDLNMGCPVANVAKKGKGSGLILRPDVAAEIIQATKAGGLPVSVKTRLGYYEIDEWKDWLKH
VFEQDIANLSIHLRTRKEMSKVDAHWELIEAIKNLRDEIAPNTLLTINGDIPDRKTGLELAEKYGIDGVMIGRGIFHNPF
AFEKEPREHTSKELLDLLRLHLSLFNKYEKDEIRQFKSLRRFFKIYVRGIRGASELRHQLMNTQSIAEARALLDEFEAQM
DEDVKIEL
>Q2YRG4 3.6.1.23~~~dut~~~Deoxyuridine 5'-triphosphate nucleotidohydrolase~~~
MTAASSSAPTLGIIRLEHAKGLDLPAYETAGSAGMDLRAAVAEDRQIVLLPGRRTLVPTGLILEIPQGYEVQIRPRSGLA
FKNGITCLNTPGTIDSDYRGEVKVLLINLGDDDFRIERGMRIAQAVFAPVIQPKIEERAKISETARGAGGFGSTGTA
>Q2T0H6 3.6.1.23~~~dut~~~Deoxyuridine 5'-triphosphate nucleotidohydrolase~~~
MKLDLKILDARMRDYLPKYATTGSAGLDLRACLDAPVTLKPGDTALVPTGLAIHLADPGYAALILPRSGLGHKHGIVLGN
LVGLIDSDYQGELMISTWNRGQTEFVLNPFERLAQLVIVPVVQATFNIVGDFAQSDRGAGGFGSTGRH
>Q45920 3.6.1.23~~~dut~~~Deoxyuridine 5'-triphosphate nucleotidohydrolase~~~COG0756
MTHSVQLKILDKRLGSEFPLPAYATTGSAGLDLRACLDEPLKIEPDETCLISTGLAIYLGHSNVAATILPRSGLGHKHGI
VLGNLVGLIDSDYQGPLMVSCWNRGKEPYTINPGDRIAQLVVLPILKAQFAVVEEFELTERGAGGFGSSGQN
>P06968 3.6.1.23~~~dut~~~Deoxyuridine 5'-triphosphate nucleotidohydrolase~~~COG0756
MMKKIDVKILDPRVGKEFPLPTYATSGSAGLDLRACLNDAVELAPGDTTLVPTGLAIHIADPSLAAMMLPRSGLGHKHGI
VLGNLVGLIDSDYQGQLMISVWNRGQDSFTIQPGERIAQMIFVPVVQAEFNLVEDFDATDRGEGGFGHSGRQ
>Q5ZSN0 3.6.1.23~~~dut~~~Deoxyuridine 5'-triphosphate nucleotidohydrolase~~~COG0756
MHQVIQLKILDSRIGDTIPLPAYATDGSAGLDLRVCISEPMQVAPQQTVLLPTGIAIYIADPKLAAVILPRSGLGHKNGI
VLGNLVGLIDSDYQGELKISCWNRSQEHFTVNPGDRIAQLVFIPVVQASFEVVNEFTESSRGEGGFGSSGRY
>A0QW08 3.6.1.23~~~dut~~~Deoxyuridine 5'-triphosphate nucleotidohydrolase~~~COG0756
MSTSLAVVRLDRELPMPTRAHDGDAGVDLYSAENVELAPGQRALVSTGIAVAIPHGMVGLVHPRSGLAARVGLSIVNSPG
TIDAGYRGEIKVSLINLDPQTPVVISRGDRIAQLLVQRVELPELVEVTSFDEAGLADTTRGDGGHGSSGGHASL
>P9WNS5 3.6.1.23~~~dut~~~Deoxyuridine 5'-triphosphate nucleotidohydrolase~~~COG0756
MSTTLAIVRLDPGLPLPSRAHDGDAGVDLYSAEDVELAPGRRALVRTGVAVAVPFGMVGLVHPRSGLATRVGLSIVNSPG
TIDAGYRGEIKVALINLDPAAPIVVHRGDRIAQLLVQRVELVELVEVSSFDEAGLASTSRGDGGHGSSGGHASL
>Q9ZDD2 3.6.1.23~~~dut~~~Deoxyuridine 5'-triphosphate nucleotidohydrolase~~~COG0756
MTIIEVKIKKLENFLGNLPEYATEHSAGMDLVAANEQSITIKVGSIQLIPTGIAIALPESFEAQIRPRSGLAVKHGITVA
NSPGTIDADYRGEIKVLLINLGNKDFIIEKGMRIAQMIIAKYERVLWAETSILTETMRGRGGFGSTGL
>B7H1U5 1.1.1.267~~~dxr~~~1-deoxy-D-xylulose 5-phosphate reductoisomerase~~~
MTQSVCILGVTGSIGRSTLKILGQHPDKYSVFAVSAHSRISELVEICKQFRPKVVVVPEQKIAELKTLFAQQNISDIDVL
AGQEGLVDIASHTDVDIVMAAIVGAAGLLPTLAAVKAGKRVLLANKEALVMSGEIMMQAARDHQALLLPVDSEHNAIFQS
LPHNYLQADRTGQPQLGVSKILLTASGGPFLNHSLEQLVHVTPQQACKHPNWSMGQKISVDSATLMNKGLELIEACHLFS
ISEHFVTVVVHPQSIIHSMVQYVDGSTLAQMGNPDMCTPIAHALAWPERLQTNVPALDLFEYSQLNFQAPDTQKFPALNL
ARQAMRAGGLAPTILNAANEIAVEAFLMERIGFTSIPQVVEHTLEKLENAAAESIECILDKDKVARSVAQQYISSIGG
>P45568 1.1.1.267~~~dxr~~~1-deoxy-D-xylulose 5-phosphate reductoisomerase~~~COG0743
MKQLTILGSTGSIGCSTLDVVRHNPEHFRVVALVAGKNVTRMVEQCLEFSPRYAVMDDEASAKLLKTMLQQQGSRTEVLS
GQQAACDMAALEDVDQVMAAIVGAAGLLPTLAAIRAGKTILLANKESLVTCGRLFMDAVKQSKAQLLPVDSEHNAIFQSL
PQPIQHNLGYADLEQNGVVSILLTGSGGPFRETPLRDLATMTPDQACRHPNWSMGRKISVDSATMMNKGLEYIEARWLFN
ASASQMEVLIHPQSVIHSMVRYQDGSVLAQLGEPDMRTPIAHTMAWPNRVNSGVKPLDFCKLSALTFAAPDYDRYPCLKL
AMEAFEQGQAATTALNAANEITVAAFLAQQIRFTDIAALNLSVLEKMDMREPQCVDDVLSVDANAREVARKEVMRLAS
>P9WNS1 1.1.1.267~~~dxr~~~1-deoxy-D-xylulose 5-phosphate reductoisomerase~~~COG0743
MTNSTDGRADGRLRVVVLGSTGSIGTQALQVIADNPDRFEVVGLAAGGAHLDTLLRQRAQTGVTNIAVADEHAAQRVGDI
PYHGSDAATRLVEQTEADVVLNALVGALGLRPTLAALKTGARLALANKESLVAGGSLVLRAARPGQIVPVDSEHSALAQC
LRGGTPDEVAKLVLTASGGPFRGWSAADLEHVTPEQAGAHPTWSMGPMNTLNSASLVNKGLEVIETHLLFGIPYDRIDVV
VHPQSIIHSMVTFIDGSTIAQASPPDMKLPISLALGWPRRVSGAAAACDFHTASSWEFEPLDTDVFPAVELARQAGVAGG
CMTAVYNAANEEAAAAFLAGRIGFPAIVGIIADVLHAADQWAVEPATVDDVLDAQRWARERAQRAVSGMASVAIASTAKP
GAAGRHASTLERS
>Q9KGU6 1.1.1.267~~~dxr~~~1-deoxy-D-xylulose 5-phosphate reductoisomerase~~~
MSRPQRISVLGATGSIGLSTLDVVQRHPDRYEAFALTGFSRLAELEALCLRHRPVYAVVPEQAAAIALQGSLAAAGIRTR
VLFGEQALCEVASAPEVDMVMAAIVGAAGLPSTLAAVEAGKRVLLANKEALVMSGALFMQAVKRSGAVLLPIDSEHNAIF
QSLPRNYADGLERVGVRRILLTASGGPFRETPLEQLASVTPEQACAHPNWSMGRKISVDSASMMNKGLELIEACWLFDAQ
PSQVEVVIHPQSVIHSMVDYVDGSVIAQLGNPDMRTPISYAMAWPERIDSGVSPLDMFAVGRLDFQRPDEQRFPCLRLAS
QAAETGGSAPAMLNAANEVAVAAFLERHIRFSDIAVIIEDVLNREAVTAVESLDQVLAADRRARSVAGQWLTRHAG
>Q9RCT1 1.1.1.267~~~dxr~~~1-deoxy-D-xylulose 5-phosphate reductoisomerase~~~COG0743
MKAVTLLGSTGSIGTQTLDILEQYPDRFRLVGLAAGRNVALLSEQIRRHRPEIVAIQDAAQLSELQAAIADLDNPPLILT
GEAGVTEVARYGDAEIVVTGIVGCAGLLPTIAAIEAGKDIALANKETLIAAGPVVLPLLQKHGVTITPADSEHSAIFQCI
QGLSTHADFRPAQVVAGLRRILLTASGGAFRDWPVERLSQVTVADALKHPNWSMGRKITVDSATLMNKGLEVIEAHYLFG
LDYDYIDIVIHPQSIIHSLIELEDTSVLAQLGWPDMRLPLLYALSWPDRLSTQWSALDLVKAGSLEFREPDHAKYPCMDL
AYAAGRKGGTMPAVLNAANEQAVALFLEEQIHFSDIPRLIERACDRHQTEWQQQPSLDDILAYDAWARQFVQASYQSLES
VV
>Q9WZZ1 1.1.1.267~~~dxr~~~1-deoxy-D-xylulose 5-phosphate reductoisomerase~~~COG0743
MEERTLVILGATGSIGTQTLDVLKKVKGIRLIGISFHSNLELAFKIVKEFNVKNVAITGDVEFEDSSINVWKGSHSIEEM
LEALKPDITMVAVSGFSGLRAVLASLEHSKRVCLANKESLVCGGFLVKKKLKEKGTELIPVDSEHSAIFQVMEPEVEKVV
LTASGGALRDWKISKIDRARPEDVLKHPVWNMGARITVDSATMVNKAFEVLEAMELFELPFEKIEVKIHREGLVHGAVVL
PDGNVKMVVSPPDMRIPISYALFYPRRVALEPFFLRTISLSFEDPDPEKYPAFFLLKEIKDSYALRTAFNAADEVAVEAF
LKGRIRFGGIHRVIEKTLEEFQGYPQPRTLDDVERIHFEAIKKAERVTEWLSSTSY
>Q8DBF5 1.1.1.267~~~dxr~~~1-deoxy-D-xylulose 5-phosphate reductoisomerase~~~
MQKLTILGATGSIGASTLKVIEQNPDKFSVVALAADSNVEKMQQLCQRWQPEYAVMANKEAALRLKMALAVLAPNTQVLG
GQEALCYVATLEQVDSVMAAIVGAAGLVPTMAAVKAGKRILLANKEALVMSGQLFIDEVEKSGAQLLPVDSEHNAIFQCL
PQTVQGNLGRCDLASQGVSHILLTGSGGPFRYTDVAELEAVTPEQAIAHPNWSMGPKISVDSATMMNKGLEYIEAKWLFN
ASRDQLKVIIHPQSVIHSMVQYLDGSVLAQMGEPDMATPIALTLSYPERVKAGVKPLDFTQVGELTFLQPDFERYPCLAL
AIEACYLGQHATTTLNAANEVAVAAFLARQIKFTDIARVNDSVLNQVCKQSLASGLDSLESLLELDRMARTLADEVVRER
AQ
>Q8ZH62 1.1.1.267~~~dxr~~~1-deoxy-D-xylulose 5-phosphate reductoisomerase~~~COG0743
MKQLTILGSTGSIGNSTLSVVRANPELFKVTALVAGRNVREMAQQCLEFSPRYAAMSDEHSAKSLRLLLAEQGSDTEVYS
GETAACELAALDDVDQVMAAIVGIAGLPSTLAAIRAGKQVLLANKESLITCGKLFMDEVKRSRAQLLPIDSEHNAIFQSL
PERIQRQLGYSSLNENGVSRIILTGSGGPFRETPLSQFSDVTPDQACAHPNWSMGRKISVDSATMMNKGLEYIEARWLFN
ASAEQIEVVLHPQSVIHSMVRYHDGSILAQMGTPDMRTPIAHAMAYPMRVSSGVAPLDFCKVGALTFTTPDYQRYPCLKL
AIDACNAGQAATTALNAANEISVMAFLDSKIRFTDIEVINRTVVEGLLLSEPTSVEEVLVIDRKARDVAAQVIAKLNN
>B1JQG4 1.1.1.267~~~dxr~~~1-deoxy-D-xylulose 5-phosphate reductoisomerase~~~
MKQLTILGSTGSIGNSTLSVVRANPELFKVTALVAGRNVREMAQQCLEFSPRYAAMSDEHSAKSLRLLLAEQGSDTEVYS
GETAACELAALDDVDQVMAAIVGIAGLPSTLAAIRAGKQVLLANKESLITCGKLFMDEVKRSRAQLLPIDSEHNAIFQSL
PERIQRQLGYSSLNENGVSRIILTGSGGPFRETPLSQFSDVTPDQACAHPNWSMGRKISVDSATMMNKGLEYIEARWLFN
ASAEQIEVVLHPQSVIHSMVRYHDGSILAQMGTPDMRTPIAHAMAYPMRVSSGVAPLDFCKVGALTFTTPDYQRYPCLKL
AIDACNAGQAATTALNAANEISVMAFLDSKIRFTDIEVINRTVVEGLLLSEPTSVEEVLVIDRKARDVAAQVIAKLNN
>Q9X5F2 1.1.1.267~~~dxr~~~1-deoxy-D-xylulose 5-phosphate reductoisomerase~~~COG0743
MSQPRTVTVLGATGSIGHSTLDLIERNLDRYQVIALTANRNVKDLADAAKRTNAKRAVIADPSLYNDLKEALAGSSVEAA
AGADALVEAAMMGADWTMAAIIGCAGLKATLAAIRKGKTVALANKESLVSAGGLMIDAVREHGTTLLPVDSEHNAIFQCF
PHHNRDYVRRIIITASGGPFRTTSLAEMATVTPERAVQHPNWSMGAKISIDSATMMNKGLELIEAYHLFQIPLEKFEILV
HPQSVIHSMVEYLDGSILAQIGSPDMRTPIGHTLAWPKRMETPAESLDFTKLRQMDFEAPDYERFPALTLAMESIKSGGA
RPAVMNAANEIAVAAFLDKKIGFLDIAKIVEKTLDHYTPATPSSLEDVFAIDNEARIQAAALMESLPA
>Q9RUB5 2.2.1.7~~~dxs~~~1-deoxy-D-xylulose-5-phosphate synthase~~~COG1154
MNELPGTSDTPLLDQIHGPKDLKRLSREQLPALTEELRGEIVRVCSRGGLHLASSLGAVDIITALHYVLDSPRDRILFDV
GHQAYAHKILTGRRDQMADIKKEGGISGFTKVSESEHDAITVGHASTSLANALGMALARDAQGKDFHVAAVIGDGSLTGG
MALAALNTIGDMGRKMLIVLNDNEMSISENVGAMNKFMRGLQVQKWFQEGEGAGKKAVEAVSKPLADFMSRAKNSTRHFF
DPASVNPFAAMGVRYVGPVDGHNVQELVWLLERLVDLDGPTILHIVTTKGKGLSYAEADPIYWHGPAKFDPATGEYVPSS
AYSWSAAFGEAVTEWAKTDPRTFVVTPAMREGSGLVEFSRVHPHRYLDVGIAEEVAVTTAAGMALQGMRPVVAIYSTFLQ
RAYDQVLHDVAIEHLNVTFCIDRAGIVGADGATHNGVFDLSFLRSIPGVRIGLPKDAAELRGMLKYAQTHDGPFAIRYPR
GNTAQVPAGTWPDLKWGEWERLKGGDDVVILAGGKALDYALKAAEDLPGVGVVNARFVKPLDEEMLREVGGRARALITVE
DNTVVGGFGGAVLEALNSMNLHPTVRVLGIPDEFQEHATAESVHARAGIDAPAIRTVLAELGVDVPIEV
>P77488 2.2.1.7~~~dxs~~~1-deoxy-D-xylulose-5-phosphate synthase~~~COG1154
MSFDIAKYPTLALVDSTQELRLLPKESLPKLCDELRRYLLDSVSRSSGHFASGLGTVELTVALHYVYNTPFDQLIWDVGH
QAYPHKILTGRRDKIGTIRQKGGLHPFPWRGESEYDVLSVGHSSTSISAGIGIAVAAEKEGKNRRTVCVIGDGAITAGMA
FEAMNHAGDIRPDMLVILNDNEMSISENVGALNNHLAQLLSGKLYSSLREGGKKVFSGVPPIKELLKRTEEHIKGMVVPG
TLFEELGFNYIGPVDGHDVLGLITTLKNMRDLKGPQFLHIMTKKGRGYEPAEKDPITFHAVPKFDPSSGCLPKSSGGLPS
YSKIFGDWLCETAAKDNKLMAITPAMREGSGMVEFSRKFPDRYFDVAIAEQHAVTFAAGLAIGGYKPIVAIYSTFLQRAY
DQVLHDVAIQKLPVLFAIDRAGIVGADGQTHQGAFDLSYLRCIPEMVIMTPSDENECRQMLYTGYHYNDGPSAVRYPRGN
AVGVELTPLEKLPIGKGIVKRRGEKLAILNFGTLMPEAAKVAESLNATLVDMRFVKPLDEALILEMAASHEALVTVEENA
IMGGAGSGVNEVLMAHRKPVPVLNIGLPDFFIPQGTQEEMRAELGLDAAGMEAKIKAWLA
>P9WNS3 2.2.1.7~~~dxs~~~1-deoxy-D-xylulose-5-phosphate synthase~~~COG1154
MLQQIRGPADLQHLSQAQLRELAAEIREFLIHKVAATGGHLGPNLGVVELTLALHRVFDSPHDPIIFDTGHQAYVHKMLT
GRSQDFATLRKKGGLSGYPSRAESEHDWVESSHASAALSYADGLAKAFELTGHRNRHVVAVVGDGALTGGMCWEALNNIA
ASRRPVIIVVNDNGRSYAPTIGGVADHLATLRLQPAYEQALETGRDLVRAVPLVGGLWFRFLHSVKAGIKDSLSPQLLFT
DLGLKYVGPVDGHDERAVEVALRSARRFGAPVIVHVVTRKGMGYPPAEADQAEQMHSTVPIDPATGQATKVAGPGWTATF
SDALIGYAQKRRDIVAITAAMPGPTGLTAFGQRFPDRLFDVGIAEQHAMTSAAGLAMGGLHPVVAIYSTFLNRAFDQIMM
DVALHKLPVTMVLDRAGITGSDGASHNGMWDLSMLGIVPGIRVAAPRDATRLREELGEALDVDDGPTALRFPKGDVGEDI
SALERRGGVDVLAAPADGLNHDVLLVAIGAFAPMALAVAKRLHNQGIGVTVIDPRWVLPVSDGVRELAVQHKLLVTLEDN
GVNGGAGSAVSAALRRAEIDVPCRDVGLPQEFYEHASRSEVLADLGLTDQDVARRITGWVAALGTGVCASDAIPEHLD
>Q9KGU7 2.2.1.7~~~dxs~~~1-deoxy-D-xylulose-5-phosphate synthase~~~
MPKTLHEIPRERPATPLLDRASSPAELRRLGEADLETLADELRQYLLYTVGQTGGHFGAGLGVVELTIALHYVFDTPDDR
LVWDVGHQAYPHKILTERRELMGTLRQKNGLAAFPRRAESEYDTFGVGHSSTSISAALGMAIAARLQGKERKSVAVIGDG
ALTAGMAFEALNHASEVDADMLVILNDNDMSISHNVGGLSNYLAKILSSRTYSSMREGSKKVLSRLPGAWEIARRTEEYA
KGMLVPGTLFEELGWNYIGPIDGHDLPTLVATLRNMRDMKGPQFLHVVTKKGKGFAPAELDPIGYHAITKLEAPGSAPKK
TGGPKYSSVFGQWLCDMAAQDARLLGITPAMKEGSDLVAFSERYPERYFDVAIAEQHAVTLAAGMACEGMKPVVAIYSTF
LQRAYDQLIHDVAVQHLDVLFAIDRAGLVGEDGPTHAGSFDISYLRCIPGMLVMTPSDEDELRKLLTTGYLFDGPAAVRY
PRGSGPNHPIDPDLQPVEIGKGVVRRRGGRVALLVFGVQLAEAMKVAESLDATVVDMRFVKPLDEALVRELAGSHELLVT
IEENAVMGGAGSAVGEFLASEGLEVPLLQLGLPDYYVEHAKPSEMLAECGLDAAGIEKAVRQRLDRQ
>Q9RBN6 2.2.1.7~~~dxs~~~1-deoxy-D-xylulose-5-phosphate synthase~~~
MTILENIRQPRDLKALPEEQLHELSEEIRQFLVHAVTRTGGHLGPNLGVVELTIALHRVFESPVDRILWDTGHQSYVHKL
LTGRQDFSKLRGKGGLSGYPSREESEHDVIENSHASTALGWADGLAKARRVQGEKGHVVAVIGGRALTGGMAWEALNNIA
AAKDQPLIIVVNDNERSYAPTIGGLANHLATLRTTDGYEKVLAWGKDVLLRTPIVGHPLYEALHGAKKGFKDAFAPQGMF
EDLGLKYVGPIDGHDIGAVESALRRAKRFHGPVLVHCLTVKGRGYEPALAHEEDHFHTVGVMDPLTCEPLSPTDGPSWTS
VFGDEIVRIGAEREDIVAITAAMLHPVGLARFADRFPDRVWDVGIAEQHAAVSAAGLATGGLHPVVAVYATFLNRAFDQL
LMDVALHRCGVTFVLDRAGVTGVDGASHNGMWDMSVLQVVPGLRIAAPRDADHVRAQLREAVAVDDAPTLIRFPKESVGP
RIPALDRVGGLDVLHRDERPEVLLVAVGVMAQVCLQTAELLRARGIGCTVVDPRWVKPVDPVLPPLAAEHRLVAVVEDNS
RAAGVGSAVALALGDADVDVPVRRFGIPEQFLAHARRGEVLADIGLTPVEIAGRIGASLPVREEPAEEQPA
>P54159 ~~~dynA~~~Dynamin-like protein A~~~COG0699
MTDQNRKELLHKTGELYKQFIENQDEQRAAKLAAVMKKAADEEVYIAFTGHYSAGKSSLLNCLLMENILPTSPIPTSANL
VVIRNGEKRVRLHTTDGACAELEGTYQKDKVQQYCKDGEQIESVEIFDRYTEIDSGVAYIDTPGIDSTDDAHFLSAASIL
HQADALFYVVHYNHVHAEENVKFLRSIKESIPNVYFIVNQIDRHDETETKFGDYQAQVEEMLCNEGISREALYFTSVTEP
DHPFNQMGALREELSRIEQQSKSNMQALTEQKVRNLLKEHTEMLKKDETGAPSFAEQLNIHTGLVQSLRDQLDEAEKQMT
EAEKRMQEEINRILKNANLTPFEMRELAAAFLESQEPSFKTGFFFSKAKTAQERDKRRNAFFSDVAKRTEAEADWHMIDT
LHKLAKVFDVYTAESEKLIQAYRTPLDISIIEHAVKHGAAFSSEYVLQYTKDLAELIRKEAKREAADIIKVLSAMVKERV
SKDVQTINDRLVQESEKLVFLQEQARLENNAREKTDRLWAIWEEESACPMHIDTEWFKSKKTRVAAPEQKQGRSQLTAQP
MPKSEIKMEQEMPLQDQIKRFYTLSDILGECSMLLKQTSAFRERVKRLEERKFTLALFGGFSSGKSSFANALVGERVLPS
SPTPTTATINKITKPINGNLNKTANVVFKTEDDLTAEILQLTGIPKEPAGRSFTEKWEKAVKKNRLQEEHVKLISNFLLA
YEKYQQYIQEQKKLTIPLSELKPYVAEETTACAVKEVTVYYTCPLTEKGITIVDTPGASSMNKRHTELAFQYIKDADAFF
YMTYYQHSFSKGDRSFLRKLGLVKESLSMDKMFFIINAADLAKDKTELETVTDYVSAELVKEGVYEPQLFTVSSKEELVG
KPESFYNQFSKVRKHLDRFIEVDVKKASAAQLSSEADKLCETVFQLHQSQHQSREEKEAQKQCLMLSFERTAADIEKRRN
SKTIIEKVKKDTREQLYHIAQRLSYFANDLLKSAFHPGLQNGDWKKNVSKAMTTALHEYLFEYIQEIKTLDVRMSGFIER
HINEEWLDHFQKTLNEDGYFSVYAGDQHSNGIQLKEVEPEIEERAFEQELKEIKSPKQFFEQKGKATFIEAVRMKLTKIT
EAWIKNEEESLISHYTAHLRRLQEDMGEKAIAQITDQKETYLRGYAEGEHAKEIEMAYQACISWKNSDNTIKM
>K7N5M8 1.11.1.16~~~dyp2~~~Multifunctional dye peroxidase DyP2~~~
MPVDLSTTLSWKSATGEAATMLDELQPNILKAHVRDRLTVLFLGFGDAAEARTFLNGLSGLMKSARTHLQEVEAHKLTKA
VGTPYLGVGLTAHGYATLGVTAPADPSFTAGAKAAVEKLADPAVTEWEGHYQQTIDAVLLLGDATAGPVRTLRRQVEALR
PASVTVVGEESGLGLANANGDGIEHFGYVDGRSQPLFLTEDVDAERDTTDGVNDWDPSAPLEQVLVPDPAAPDPTVHFGS
YFVFRKLEQNVRLFKEAERDLAHDLGLRGEDRERAGAMLVGRFEDGTPLTAQSAPGSHHPVGNDFSYDSDKLGQKCPFHA
HIRKTNPRGSGGAEAPEEERKHLMARRGQTYGRRHDDPNADLPPRLRPAKDVGLLFMAFNSNLGNQFEFTQQIWANNPAF
PFPPDGSQPGLDPVIGQGARAPQKYAPEWGHNNVAEATDPIPQAVTMKGGEYFFMPSLAFLRSL
>A0A3T0E4B9 1.11.1.-~~~~~~Dye-decolorizing peroxidase~~~
MTEAFPNGKTPQHVLGPPAPAAVFLVLTVRSGAEAEAKDFLGDIAGVVRSVGFRAREDHLSCVTGIGAELWDRMFDAPRP
AGLHPFIEQRGDVHTAPSTPGDLLFHIRARRMDLCFELARQLVGELGDAVSVVDEVHGFRYFDERDIMGFVDGTENPEDQ
EAVDSVFTPTGGDDPASSTYVIVQKYTHDMAAWEALSVEDQEAAFGRHKLSDMEFPDEDKAPNSHLILNTIEDEDGTEHK
IVRDNMVFGSVESGEFGTYFIGYAADVSVTEQMLENMFIGNPRGTYDRILDFSTAQTGGLFFVPSQDFLDDPDGELAAAE
PSDAQNDDPASASARIEETDPPNPASADDPAPADDSLGIGSLRRRDQ
>Q743F4 1.11.1.7~~~~~~Dye-decolorizing peroxidase~~~COG2837
MVNIVAVRRHGVHVRVIHVPPVQPQPILAPLTPAAIFLVLTVDDGGEATVHEALQDISGLVRAIGFREPQKRLSAIASIG
SDVWDRLFSGPRPAELHRFVELHGPRHTAPATPGDLLFHIRAESLDVCFELADRILKSMAGAVTVVDEVHGFRYFDNRDL
LGFVDGTENPDGALAVSSTAIGDEDPDFAGSCYVHVQKYLHDMSAWTALSVTEQENVIGRTKLDDIELDDDVKPADAHIA
LNVITDDDGTELKIVRHNMPFGELGKSEYGTYFIGYSRTPRVTEQMLRNMFLGDPPGNTDRILDFSTAVTGGLFFSPTVD
FLDDPPPLPAPGTPAAPPARNGSLSIGSLKGTTR
>I6Y4U9 1.11.1.7~~~dyp~~~Dye-decolorizing peroxidase~~~COG2837
MAVPAVSPQPILAPLTPAAIFLVATIGADGEATVHDALSKISGLVRAIGFRDPTKHLSVVVSIGSDAWDRLFAGPRPTEL
HPFVELTGPRHTAPATPGDLLFHIRAETMDVCFELAGRILKSMGDAVTVVDEVHGFRFFDNRDLLGFVDGTENPSGPIAI
KATTIGDEDRNFAGSCYVHVQKYVHDMASWESLSVTEQERVIGRTKLDDIELDDNAKPANSHVALNVITDDDGTERKIVR
HNMPFGEVGKGEYGTYFIGYSRTPTVTEQMLRNMFLGDPAGNTDRVLDFSTAVTGGLFFSPTIDFLDHPPPLPQAATPTL
AAGSLSIGSLKGSPR
>C0ZVK5 1.11.1.-~~~~~~Dye-decolorizing peroxidase~~~COG2837
MALPAIPQPLLTPLTEAAIFLVFTIDEGGEQAVHDVLADISGLQRSIGFRVPAGGLAAVVGIGSDAWDRLFEGPRPAELH
PFVELTGDKHHAPRTPGDLLFHIRARQMDLCFEFATVVTNRLAGAASVIDEVHGFKYFEQRDLMGFVDGTENPSGQAAYV
AVTVGDEDPDFAGSSYVIVQKYLHDMSEWNSLPVEEQENVIGRSKLEDLEMDDDTKPANSHTALTVIEDESGEQIQILRD
NMPFGHVGSAEMGTYFIGYSASPTVTEQMLTNMFIGNPVGNYDRILDFSTAVTGINFFVPTADFLDDPPDAPTRLVPEAT
FTAPISDGSLGIGSLKRSAQQ
>Q47KB1 1.11.1.19~~~~~~Dye-decolorizing peroxidase Tfu_3078~~~COG2837
MTEPDTERKGSSRRGFLAGLGAAALTGAGIGMAAGEVLRPLLPDSDPAASPEAEQRLRMAAQRADATAAPQPGISGPAPA
FVHVIALDLAEEARKNPDTARDSAAAALRSWTELAARLHEESPHDIAEGAASAGLLPASLMVTVGIGGSLLSAIDAEDRR
PDALADLPEFSTDDLHPRWCGGDFMLQVGAEDPMVLTAAVEELVAAAADATAVRWSLRGFRRTAAAARDPDATPRNLMGQ
IDGTANPAQDHPLFDRTITARPADNPAHAWMDGGSYLVVRRIRMLLTEWRKLDVAARERVIGRRLDTGAPLGSRNETDPV
VLSARDEEGEPLIPENAHVRLASPENNLGARMFRRGYSYDQGWRDDGVRDAGLLFMAWQGDPATGFIPVQRSLADQGDAL
NRYIRHEGSALFAVPAAREGRYLGQDLIEG
>P00382 1.5.1.3~~~dhfrI~~~Dihydrofolate reductase type 1~~~
MKLSLMVAISKNGVIGNGPDIPWSAKGEQLLFKAITYNQWLLVGRKTFESMGALPNRKYAVVTRSSFTSDNENVLIFPSI
KDALTNLKKITDHVIVSGGGEIYKSLIDQVDTLHISTIDIEPEGDVYFPEIPSNFRPVFTQDFASNINYSYQIWQKG
>P00383 1.5.1.3~~~~~~Dihydrofolate reductase type 2~~~
MERSSNEVSNPVAGNFVFPSNATFGMGDRVRKKSGAAWQGQIVGWYCTNLTPEGYAVESEAHPGSVQIYPVAALERIN
>P12833 1.5.1.3~~~dhfrIII~~~Dihydrofolate reductase type 3~~~
MLISLIAALAHNNLIGKDNLIPWHLPADLRHFKAVTLGKPVVMGRRTFESIGRPLPGRRNVVVSRNPQWQAEGVEVAPSL
DAALALLTDCEEAMIIGGGQLYAEALPRADRLYLTYIDAQLNGDTHFPDYLSLGWQELERSTHPADDKNSYACEFVTLSR
QR
>P11731 1.5.1.3~~~dhfrV~~~Dihydrofolate reductase type 5~~~
MKVSLMAAKAKNGVIGCGPHIPWSAKGEQLLFKALTYNQWLLVGRKTFESMGALPNRKYAVVTRSAWTADNDNVIVFPSI
EEAMYGLAELTDHVIVSGGGEIYRETLPMASTLHISTIDIEPEGDVFFPNIPNTFEVVFEQHFSSNINYCYQIWQKG
>P13955 1.5.1.3~~~dfrA~~~Dihydrofolate reductase type 1 from Tn4003~~~
MTLSIIVAHDKQRVIGYQNQLPWHLPNDLKHIKQLTTGNTLVMARKTFNSIGKPLPNRRNVVLTNQASFHHEGVDVINSL
DEIKELSGHVFIFGGQTLYEAMIDQVDDMYITVIDGKFQGDTFFPPYTFENWEVESSVEGQLDEKNTIPHTFLHLVRRKG
K
>P11045 1.5.1.3~~~dfrA~~~Dihydrofolate reductase~~~COG0262
MISFIFAMDANRLIGKDNDLPWHLPNDLAYFKKITSGHSIIMGRKTFESIGRPLPNRKNIVVTSAPDSEFQGCTVVSSLK
DVLDICSGPEECFVIGGAQLYTDLFPYADRLYMTKIHHEFEGDRHFPEFDESNWKLVSSEQGTKDEKNPYDYEFLMYEKK
NSSKAGGF
>P0ABQ4 1.5.1.3~~~folA~~~Dihydrofolate reductase~~~COG0262
MISLIAALAVDRVIGMENAMPWNLPADLAWFKRNTLNKPVIMGRHTWESIGRPLPGRKNIILSSQPGTDDRVTWVKSVDE
AIAACGDVPEIMVIGGGRVYEQFLPKAQKLYLTHIDAEVEGDTHFPDYEPDDWESVFSEFHDADAQNSHSYCFEILERR
>P00380 1.5.1.3~~~folA~~~Dihydrofolate reductase~~~
MFISMWAQDKNGLIGKDGLLPWRLPNDMRFFREHTMDKILVMGRKTYEGMGKLSLPYRHIIVLTTQKDFKVEKNAEVLHS
IDELLAYAKDIPEDIYVSGGSRIFQALLPETKIIWRTLIDAEFEGDTFIGEIDFTSFELVEEHEGIVNQENQYPHRFQKW
QKMSKVV
>P00381 1.5.1.3~~~folA~~~Dihydrofolate reductase~~~
MTAFLWAQDRDGLIGKDGHLPWHLPDDLHYFRAQTVGKIMVVGRRTYESFPKRPLPERTNVVLTHQEDYQAQGAVVVHDV
AAVFAYAKQHPDQELVIAGGAQIFTAFKDDVDTLLVTRLAGSFEGDTKMIPLNWDDFTKVSSRTVEDTNPALTHTYEVWQ
KKA
>P9WNX1 1.5.1.3~~~folA~~~Dihydrofolate reductase~~~COG0262
MTMVGLIWAQATSGVIGRGGDIPWRLPEDQAHFREITMGHTIVMGRRTWDSLPAKVRPLPGRRNVVLSRQADFMASGAEV
VGSLEEALTSPETWVIGGGQVYALALPYATRCEVTEVDIGLPREAGDALAPVLDETWRGETGEWRFSRSGLRYRLYSYHR
S
>P04174 1.5.1.3~~~folA~~~Dihydrofolate reductase~~~
MLKITIIAACAENLCIGAGNAMPWHIPEDFAFFKVYTLGKPVIMGRKTWESLPVKPLPGRRNIVISRQADYCAAGAETVA
SLEVALALCAGAEEAVIMGGAQIYGQAMPLATDLRITEVDLSVEGDAFFPEIDRTHWREAERTERRVSSKGVAYTFVHYL
GK
>P99079 1.5.1.3~~~folA~~~Dihydrofolate reductase~~~
MTLSILVAHDLQRVIGFENQLPWHLPNDLKHVKKLSTGHTLVMGRKTFESIGKPLPNRRNVVLTSDTSFNVEGVDVIHSI
EDIYQLPGHVFIFGGQTLFEEMIDKVDDMYITVIEGKFRGDTFFPPYTFEDWEVASSVEGKLDEKNTIPHTFLHLIRKK
>P0A017 1.5.1.3~~~folA~~~Dihydrofolate reductase~~~
MTLSILVAHDLQRVIGFENQLPWHLPNDLKHVKKLSTGHTLVMGRKTFESIGKPLPNRRNVVLTSDTSFNVEGVDVIHSI
EDIYQLPGHVFIFGGQTLFEEMIDKVDDMYITVIEGKFRGDTFFPPYTFEDWEVASSVEGKLDEKNTIPHTFLHLIRKK
>P0C0P0 1.5.1.3~~~folA~~~Dihydrofolate reductase~~~
MTLSIIVAHDKQRVIGYQNQLPWHLPNDLKHVKQLTTGNTLVMGRKTFNSIGKPLPNRRNVVLTNQASFHHEGVDVINSL
DEIKELSGHVFIFGGQTLFEAMIDQVDDMYITVIDGKFQGDTFFPPYTFENWEVESSVEGQLDEKNTIPHTFLHLVRRKG
K
>Q54277 1.5.1.3~~~dfrD~~~Dihydrofolate reductase~~~
MWSFLKISLIVAMDKKRVIGKDNDIPWRISSDWEYVKNTTKGHAIILGRKNLQSIGRALPDRRNIILTRDKNFNFKDCEI
AHSIEAAFKLCENEEEVFIFGGEQIYVMFLPYVEKMYVTKIHHEFEGDTFFPVVNFDDWKEVSVEKGIKDEKNPYDYYFH
IYERIR
>Q54801 1.5.1.3~~~dhfR~~~Dihydrofolate reductase~~~COG0262
MTKKIVAIWAQDEEGVIGKENRLPWHLPAELQHFKETTLNHAILMGRVTFDGMGRRLLPKRETLILTRNPEEKIDGVATF
QDVQSVLDWYQAQEKNLYIIGGKQIFQAFEPYLDEVIVTHIHARVEGDTYFPEELDLSLFETVSSKFYAKDEKNPYDFTI
QYRKRKEV
>Q60034 1.5.1.3~~~folA~~~Dihydrofolate reductase~~~COG0262
MAKVIFVLAMDVSGKIASSVESWSSFEDRKNFRKITTEIGNVVMGRITFEEIGRPLPERLNVVLTRRPKTSNNPSLVFFN
GSPADVVKFLEGKGYERVAVIGGKTVFTEFLREKLVDELFVTVEPYVFGKGIPFFDEFEGYFPLKLLEMRRLNERGTLFL
KYSVEKSHR
>P22222 3.2.1.39~~~~~~Glucan endo-1,3-beta-glucosidase~~~
MPHDRKNSSRRAWAALCAAVLAVSGALVGVAAPASAVPATIPLTITNDSGRGPIYLYVLGERDGVAGWADAGGTFHPWPG
GVGPVPVPAPDASIAGPGPGQSVTIRLPKLSGRVYYSYGQKMTFQIVLDGRLVQPAVQNDSDPNRNILFNWTEYTLNDGG
LWINSTQVDHWSAPYQVGVQRADGQVLSTGMLKPNGYEAFYTALEGAGWGGLVQRAPDGSRLRALNPSHGIDVGKISSAS
IDSYVTEVWNSYRTRDMVVTPFSHEPGTQFRGRVDGDWFRFRSGSGQEVAAFKKPDASSVYGCHKDLQAPNDHVVGPIAR
TLCAALVRTTALTNPNQPDANSAGFYQDARTNVYAKLAHQQMANGKAYAFAFDDVGAHESLVHDGNPQAAYIKLDPFTGT
ATPLGNGGSTEQPGTPGGLPAGTGALRIGSTLCLDVPWADPTDTNQVQLATCSGNAAQQWTRGTDGTVRALGKCLDVARS
GTADGTAVWIYTCNGTGAQKWTYDSATKALRNPQSGKCLDAQGGAPLRDGQKVQLWTCNQTEAQRWTL
>P23903 3.2.1.39~~~glcA~~~Glucan endo-1,3-beta-glucosidase A1~~~
MKPSHFTEKRFMKKVLGLFLVVVMLASVGVLPTSKVQAAGTTVTSMEYFSPADGPVISKSGVGKASYGFVMPKFNGGSAT
WNDVYSDVGVNVKVGNNWVDIDQAGGYIYNQNWGHWSDGGFNGYWFTLSATTEIQLYSKANGVKLEYQLVFQNINKTTIT
AMNPTQGPQITASFTGGAGFTYPTFNNDSAVTYEAVADDLKVYVKPVNSSSWIDIDNNAASGWIYDHNFGQFTDGGGGYW
FNVTESINVKLESKTSSANLVYTITFNEPTRNSYVITPYEGTTFTADANGSIGIPLPKIDGGAPIAKELGNFVYQINING
QWVDLSNSSQSKFAYSANGYNNMSDANQWGYWADYIYGLWFQPIQENMQIRIGYPLNGQAGGNIGNNFVNYTFIGNPNAP
RPDVSDQEDISIGTPTDPAIAGMNLIWQDEFNGTTLDTSKWNYETGYYLNNDPATWGWGNAELQHYTNSTQNVYVQDGKL
NIKAMNDSKSFPQDPNRYAQYSSGKINTKDKLSLKYGRVDFRAKLPTGDGVWPALWMLPKDSVYGTWAASGEIDVMEARG
RLPGSVSGTIHFGGQWPVNQSSGGDYHFPEGQTFANDYHVYSVVWEEDNIKWYVDGKFFYKVTNQQWYSTAAPNNPNAPF
DEPFYLIMNLAVGGNFDGGRTPNASDIPATMQVDYVRVYKEQ
>P43528 ~~~~~~Heat-labile enterotoxin IIB, A chain~~~
MAKVISFFISLFLISFPLYANDYFRADSRTPDEVRRSGGLIPRGQDEAYERGTPININLYDHARGTATGNTRYNDGYVST
TTTLRQAHFLGQNMLGGYNEYYIYVVAAAPNLFDVNGVLGRYSPYPSENEYAALGGIPLSQIIGWYRVSFGAIEGGMHRN
RDYRRDLFRGLSAAPNEDGYRIAGFPDGFPAWEEVPWREFAPNSCLPNNKASSDTTCASLTNKLSQHDLADFKKYIKRKF
TLMTLLSINNDGFFSNNGGKDEL
>P43529 ~~~~~~Heat-labile enterotoxin IIB, B chain~~~
MSFKKIIKAFVIMAALVSVQAHAGASQFFKDNCNRTTASLVEGVELTKYISDINNNTDGMYVVSSTGGVWRISRAKDYPD
NVMTAEMRKIAMAAVLSGMRVNMCASPASSPNVIWAIELEAE
>P0A9B6 1.2.1.72~~~epd~~~D-erythrose-4-phosphate dehydrogenase~~~COG0057
MTVRVAINGFGRIGRNVVRALYESGRRAEITVVAINELADAAGMAHLLKYDTSHGRFAWEVRQERDQLFVGDDAIRVLHE
RSLQSLPWRELGVDVVLDCTGVYGSREHGEAHIAAGAKKVLFSHPGSNDLDATVVYGVNQDQLRAEHRIVSNASCTTNCI
IPVIKLLDDAYGIESGTVTTIHSAMHDQQVIDAYHPDLRRTRAASQSIIPVDTKLAAGITRFFPQFNDRFEAIAVRVPTI
NVTAIDLSVTVKKPVKANEVNLLLQKAAQGAFHGIVDYTELPLVSVDFNHDPHSAIVDGTQTRVSGAHLIKTLVWCDNEW
GFANRMLDTTLAMATVAFR
>A5F9G1 1.2.1.72~~~epd~~~D-erythrose-4-phosphate dehydrogenase~~~COG0057
MLRVAINGFGRIGRNVLRAVYESGKRDRIQVVAVNELAKPDAMAHLLQYDTSHGRFGKKISHDQQHIYVHHQNGEYDSIR
ILHLSEIPLLPWRDLGVDLVLDCTGVYGCQEDGQQHIDAGAKLVLFSHPGASDLDNTIIYGVNHETLTAEHKIVSNGSCT
TNCIVPIIKVLDDAFGIDSGTITTIHSSMNDQQVIDAYHNDLRRTRAASQSIIPVDTKLHKGIERIFPKFSNKFEAISVR
VPTVNVTAMDLSVTIKSNVKVNDVNQTIVNASQCTLRGIVDYTEAPLVSIDFNHDPHSAIVDGTQTRVSNGQLVKMLVWC
DNEWGFANRMLDTALAMQATQ
>Q6RUF5 3.2.1.102~~~eabC~~~Blood-group-substance endo-1,4-beta-galactosidase~~~
MGGVTMKNNLKKYIKYILSVILVFFVGVNGMEVYALEESRDVYLSDLDWLNATHGDDTKSKIVQKNHPFTPGNNNQSTKI
SLKMEDGSISEFEKGLGTIAGSPSTITYDISGAGVTKFFSYLGIDRSANPINEQYAKVDKIEVVVDGKVIYSTINQFPNG
LTYETPAIKVDLNIPENAKRLQLKSYAGEKTWGDEVVYADAKFTAKGDFVNPNDWTPAEKRREISNEKPLLMIPLYANGS
KYEKGDYAFWGDDTLVGKWKEVPDDLKPYTVIQLHPDDLPKRDGVAADFYEHMLNEAQSYVNPKTNKNEPIPIVLTVYTA
GNVPGYTAAHWLTTEWIEDMYSKYSALQGVFSTENYWVWTDNVESNAAEYLKLSAKYGGYFIWSEQNNGGSIEKAFGSNG
KTVFKEAVEKYWENFIFMYKNTPQAEGNDAPTSSYMTGLWLTDYAYQWGGLMDTWKWYETGKWKLFESGNIGKTQGNRQW
LTEPEALLGIEAMNIYLNGGCVYNFEHPAYTYGVRNEESPLFSNVIKEFFRYVINNPSPSKNEMRAKTKSLLYGNFTQNG
NGNYFVGLNTEMSQSPAYTTGRYGNIPAVPSSIERNKIESRLSGSQIKLIDMNSSELSNITNRKEYFNKLYKEEYNGNIF
AQKLDNRWFIYNYKYNENINQKGSFDIANIKSEVTLEPHTYLIMEDNNQSINIKLNNYRTNKDSLWEGAKNADEAKKLPE
MSKVDALNWVYDSYIKNTNNGEKRTSVIKLMNIDKAPTITNVNGIEGSYDIPTVKYNSETRSAEITIKNNGNIDFDIVIK
>O54161 3.2.1.55~~~abfB~~~Extracellular exo-alpha-L-arabinofuranosidase~~~COG3693
MHRGSLSRGHTSAVLAAVVAALAALAALLVATTPAQAAGSGALRGAGSNRCLDVLGGSQDDGALLQLYDCWGGTNQQWTS
TDTGRLTVYGDKCLDVPGHATAPGTRVQIWSCSGGANQQWRVNSDGTVVGVESGLCLEAAGAGTANGTAVQLWTCNGGGN
QKWTGLTGTPPTDGTCALPSTYRWSSTGVLAQPKSGWVALKDFTTVTHNGRHLVYGSTSSGSSYGSMVFSPFTNWSDMAS
AGQNAMNQAAVAPTLFYFAPKNIWVLAYQWGSWPFIYRTSSDPTDPNGWSAPQPLFTGSISGSDTGPIDQTLIADGQNMY
LFFAGDNGKIYRASMPIGNFPGNFGSSYTTIMSDTKANLFEGVQVYKVQGQNQYLMIVEAMGANGRYFRSFTASSLSGSW
TPQAASEGNPFAGKANSGATWTNDISHGDLVRDNPDQTMTVDPCNLQFLYQGKSPNAGGDYNSLPWRPGVLTLRR
>P82593 3.2.1.55~~~~~~Extracellular exo-alpha-L-arabinofuranosidase~~~
MSRIRWRYGTAATALLVAAGLVPTATAHAEDVTDYSITVDPAAKGAAIDDTMYGVFFEDINRAADGGLYAELVQNRSFEY
STDDNRSYTPLTSWIVDGTGEVVNDAGRLNERNRNYLSLGAGSSVTNAGYNTGIRVEQGKRYDFSVWARAGSASTLTVAL
KDAAGTLATARQVAVEGGWAKYRATFTATRTSNRGRLAVAANDAAALDMVSLFPRDTYRNQQNGLRKDLAEKIAALHPGF
VRFPGGCLVNTGSMEDYSAASGWQRKRSYQWKDTVGPVEERATNANFWGYNQSYGLGYYEYFRFSEDIGAMPLPVVPALV
TGCGQNKAVDDEALLKRHIQDTLDLIEFANGPATSKWGKVRAEMGHPRPFRLTHLEVGNEENLPDEFFDRFKQFRAAIEA
EYPDITVVSNSGPDDAGTTFDTAWKLNREANVEMVDEHYYNSPNWFLQNNDRYDSYDRGGPKVFLGEYASQGNAWKNGLS
EAAFMTGLERNADVVKLASYAPLLANEDYVQWRPDLVWFNNRASWNSANYEVQKLFMNNVGDRVVPSKATTTPDVSGPIT
GAVGLSTWATGTAYDDVKVTAADGATLLSDDFSGDASKWTHTGAGSWSVQDGQYVQTDAAAENTMVQAGDPSWHDYDLHV
KATKKSGKEGFLVAFGVKDTGNYYWWNLGGWNNTQSAVEQAVDGGKGTLLTKAGSIETGRAYDIDVKVRGRQVTLYLDGQ
EWGGFTDDKPAEPFRQVVTKDARTGDLIVKVVNAQPAEARTAIDLGGARVASTARVTTLAADQDAVNTETDAPVTPATST
FSGVTDRFTYTFPANSVTFLRLKQR
>P96463 3.2.1.55~~~abfB~~~Extracellular exo-alpha-L-arabinofuranosidase~~~
MHRGSLSRGQHVRGTRRRGAALAALAALLVATAPAQAAGSGALRGAGSNRCLDVLGGSQDDGALLQLYDCWGGTNQQWTS
TDTGRLTVYGDKCLDVPGHATAPGTRVQIWSCSGGRNQQWRVNSDGTVVGVESGLCLEAAGAGTPNGTAVQLWTCNGGGN
QKWTGLTGTPPTDGTCALPSTYRWSSTGVLAQPKSGWVALKDFTTVTHNGRHLVYGSTSSGSSYGSMVFSPFTNWSDMAS
AGQNAMNQAAVAPTLFYFAPKNIWVLAYQWGSWPFIYRTSSDPTDPNGWSAPQPLFTGSISGSDTGPIDQTLIADGQNMY
LFFAGDNGKIYRASMPIGNFPGNFGSSYTTIMSDTKANLFEGVQVYKVQGQNQYLMIVEAMGANGRYFRSFTASSLSGSW
TPQAASEGNPFAGKANSGATWTNDISHGDLVRDNPDQTMTVDPCNLQFLYQGKAPNAGGHYNSLPWRPGVLTLRH
>P94522 3.2.1.99~~~abnA~~~Extracellular endo-alpha-(1->5)-L-arabinanase 1~~~COG3507
MKKKKTWKRFLHFSSAALAAGLIFTSAAPAEAAFWGASNELLHDPTMIKEGSSWYALGTGLTEERGLRVLKSSDAKNWTV
QKSIFTTPLSWWSNYVPNYGQNQWAPDIQYYNGKYWLYYSVSSFGSNTSAIGLASSTSISSGGWKDEGLVIRSTSSNNYN
AIDPELTFDKDGNPWLAFGSFWSGIKLTKLDKSTMKPTGSLYSIAARPNNGGALEAPTLTYQNGYYYLMVSFDKCCDGVN
STYKIAYGRSKSITGPYLDKSGKSMLEGGGTILDSGNDQWKGPGGQDIVNGNILVRHAYDANDNGIPKLLINDLNWSSGW
PSY
>P42293 3.2.1.99~~~abn2~~~Extracellular endo-alpha-(1->5)-L-arabinanase 2~~~COG3507
MFNRLFRVCFLAALIMAFTLPNSVYAQKPIFKEVSVHDPSIIETNGTFYVFGSHLASAKSNDLMQWQQLTTSVSNDNPLI
PNVYEELKETFEWAQSDTLWAADVTQLADGKYYMYYNACRGDSPRSAMGVAVADNIEGPYKNKGIFLKSGMEGTSSDGTP
YDATKHPNVVDPHTFFDKDGKLWMVYGSYSGGIFILEMNPKTGFPLPGQGYGKKLLGGNHSRIEGPYVLYNPDTQYYYLY
LSYGGLDATGGYNIRVARSKKPDGPYYDAEGNPMLDVRGKGGTFFDDRSIEPYGVKLMGSYTFETENEKGTGYVSPGHNS
AYYDEKTGRSYLIFHTRFPGRGEEHEVRVHQLFMNKDGWPVAAPYRYAGETLKEVKQKDITGTYKLIQHGKDISADIKQT
INIQLNKNHTISGEMTGTWRKTGKNTADITLAGKKYNGVFLRQWDSVREKNVMTFSVLNTSGEAVWGSK
>A5IKD4 3.2.1.99~~~~~~Extracellular endo-alpha-(1->5)-L-arabinanase~~~COG3507
MRFLFLMITLTALTGYILADEQPTFRWAVVHDPSIIKVGNMYYVFGTHLQVAKSKDLMHWEQINTSAHDKNPIIPNINEE
LKETLSWARTRNDIWAPQVIQLSDGRYYMYYCASTFGSPRSAIGIAVSDDIEGPYKHYAVIVKSGQVYSVDGPSEDGTPY
DSRKHPNALDPGVFYDKEGNLWMVYGSWFGGIYILKLDPNTGLPLPGQGYGKRLVGGNHSSMEGPYILYSPDTDYYYLFL
SFGGLDYRGGYNIRVARSKNPNGPYYDPEGKSMENCMGSKTVISNYGAKLVGNFILSESNTIDFKAFGYVSPGHNSAYYD
PETGKYFIFFHTRFPGRGETYQLRVHQLFLNEDGWFVMAPFPYGGETVSKLPNEEIVGEYQFINHGKEITDKIKQPVRIK
LNSDGSITGAVEGRWERKEHYITLKIIEGNTTVIYKGVLLKQWHYSEKKWVTVFTALSNQGVSVWGIRVEE
>D9XD61 4.2.3.169~~~~~~7-epi-alpha-eudesmol synthase ((2E,6E)-farnesyl diphosphate cyclizing)~~~COG2124
MPQDVRFDLPFETPVSKHLESARARHLRWVWEMRLVHSREGFEEYRSWDLPQAAARTYPHASADDMVVLMNWFSLAFLFD
DQFDASRPDRADRIAEVARELIVTPLRPAGTPPRVACPITLAWTEVWKHLSHGMSLTWQSRFAASWGRFLEAHCEEVDLA
ARGLEGTLGLVEFTEFRRRTVGIHHSIDAGERSRGFEVPAQAMAHPVMERMRDLAADTIGFMNDIHSFEREKRRGDGHNL
IAVLRRERGCSWQEATDEAYRMTIARLDEYLELQERVPQMCDELRLDEAQRDGVRLGVEAIQHWINGNYEWALTSGRYAA
AKEGAVATAELAGRGSVDDLLTV
>P19809 ~~~eae~~~Intimin~~~
MITHGFYARTRHKHKLKKTFIMLSAGLGLFFYVNQNSFANGENYFKLGSDSKLLTHNSYQNRLFYTLKTGETVADLSKSQ
DINLSTIWSLNKHLYSSESEMMKAAPGQQIILPLKKLPFEYSALPLLGSAPLVAAGGVAGHTNKLTKMSPDVTKSNMTDD
KALNYAAQQAASLGSQLQSRSLNGDYAKDTALGIAGNQASSQLQAWLQHYGTAEVNLQSGNNFDGSSLDFLLPFYDSEKM
LAFGQVGARYIDSRFTANLGAGQRFFLPENMLGYNVFIDQDFSGDNTRLGIGGEYWRDYFKSSVNGYFRMSGWHESYNKK
DYDERPANGFDIRFNGYLPSYPALGAKLMYEQYYGDNVALFNSDKLQSNPGAATVGVNYTPIPLVTMGIDYRHGTGNEND
LLYSMQFRYQFDKPWSQQIEPQYVNELRTLSGSRYDLVQRNNNIILEYKKQDILSLNIPHDINGTERSTQKIQLIVKSKY
GLDRIVWDDSALRSQGGQIQHSGSQSAQDYQAILPAYVQGGSNVYKVTARAYDRNGNSSNNVLLTITVLSNGQVVDQVGV
TDFTADKTSAKADGTEAITYTATVKKNGVAQANVPVSFNIVSGTAVLSANSANTNGSGKATVTLKSDKPGQVVVSAKTAE
MTSALNANAVIFVDQTKASITEIKADKTTAVANGQDAITYTVKVMKGDKPVSNQEVTFTTTLGKLSNSTEKTDTNGYAKV
TLTSTTPGKSLVSARVSDVAVDVKAPEVEFFTTLTIDDGNIEIVGTGVKGKLPTVWLQYGQVNLKASGGNGKYTWRSANP
AIASVDASSGQVTLKEKGTTTISVISSDNQTATYTIATPNSLIVPNMSKRVTYNDAVNTCKNFGGKLPSSQNELENVFKA
WGAANKYEYYKSSQTIISWVQQTAQDAKSGVASTYDLVKQNPLNNIKASESNAYATCVK
>P43261 ~~~eae~~~Intimin~~~COG5492
MITHGCYTRTRHKHKLKKTLIMLSAGLGLFFYVNQNSFANGENYFKLGSDSKLLTHDSYQNRLFYTLKTGETVADLSKSQ
DINLSTIWSLNKHLYSSESEMMKAAPGQQIILPLKKLPFEYSALPLLGSAPLVAAGGVAGHTNKLTKMSPDVTKSNMTDD
KALNYAAQQAASLGSQLQSRSLNGDYAKDTALGIAGNQASSQLQAWLQHYGTAEVNLQSGNNFDGSSLDFLLPFYDSEKM
LAFGQVGARYIDSRFTANLGAGQRFFLPANMLGYNVFIDQDFSGDNTRLGIGGEYWRDYFKSSVNGYFRMSGWHESYNKK
DYDERPANGFDIRFNGYLPSYPALGAKLIYEQYYGDNVALFNSDKLQSNPGAATVGVNYTPIPLVTMGIDYRHGTGNEND
LLYSMQFRYQFDKSWSQQIEPQYVNELRTLSGSRYDLVQRNNNIILEYKKQDILSLNIPHDINGTEHSTQKIQLIVKSKY
GLDRIVWDDSALRSQGGQIQHSGSQSAQDYQAILPAYVQGGSNIYKVTARAYDRNGNSSNNVQLTITVLSNGQVVDQVGV
TDFTADKTSAKADNADTITYTATVKKNGVAQANVPVSFNIVSGTATLGANSAKTDANGKATVTLKSSTPGQVVVSAKTAE
MTSALNASAVIFFDQTKASITEIKADKTTAVANGKDAIKYTVKVMKNGQPVNNQSVTFSTNFGMFNGKSQTQATTGNDGR
ATITLTSSSAGKATVSATVSDGAEVKATEVTFFDELKIDNKVDIIGNNVRGELPNIWLQYGQFKLKASGGDGTYSWYSEN
TSIATVDASGKVTLNGKGSVVIKATSGDKQTVSYTIKAPSYMIKVDKQAYYADAMSICKNLLPSTQTVLSDIYDSWGAAN
KYSHYSSMNSITAWIKQTSSEQRSGVSSTYNLITQNPLPGVNVNTPNVYAVCVE
>Q9I738 ~~~eagT6~~~Effector EagT6~~~
MTLYRLHEADLEIPDAWQDQSINIFKLPASGPAREASFVISRDASQGDAPFADYVARQLENAEKQLPGFKLHKRWDINIH
GHAAVLLDYQWQREGRDLMLRQVFIERRPAVLITTLTTTPADLPHHEPAWKQAMQTLVPRPTPS
>Q21FJ0 4.2.2.26~~~~~~Exo-oligoalginate lyase~~~
MLSVNTIKNTLLAAVLVSVPATAQVSGNGHPNLIVTEQDVANIAASWESYDAYAEQLNADKTNLDAFMAEGVVVPMPKDA
GGGYTHEQHKRNYKAIRNAGFLYQVTGDEKYLTFAKDLLLAYAKMYPSLGEHPNRKEQSPGRLFWQSLNEAVWLVYSIQG
YDAIIDGLAAEEKQEIESGVFLPMAKFLSVESPETFNKIHNHGTWAVAAVGMTGYVLGNDELVEISLMGLDKTGKAGFMK
QLDKLFSPDGYYTEGPYYQRYALMPFIWFAKAIETNEPERKIFEYRNNILLKAVYTTIDLSYAGYFFPINDALKDKGIDT
VELVHALAIVYSITGDNTLLDIAQEQGRISLTGDGLKVAKAVGEGLTQPYNYRSILLGDGADGDQGALSIHRLGEGHNHM
ALVAKNTSQGMGHGHFDKLNWLLYDNGNEIVTDYGAARYLNVEAKYGGHYLAENNTWAKQTIAHNTLVVNEQSHFYGDVT
TADLHHPEVLSFYSGEDYQLSSAKEANAYDGVEFVRSMLLVNVPSLEHPIVVDVLNVSADKASTFDLPLYFNGQIIDFSF
KVKDNKNVMKMLGKRNGYQHLWLRNTAPVGDASERATWILDDRFYSYAFVTSTPSKKQNVLIAELGANDPNYNLRQQQVL
IRRVEKAKQASFVSVLEPHGKYDGSLETTSGAYSNVKSVKHVSENGKDVVVVDLKDGSNVVVALSYNANSEQVHKVNAGE
EAIEWKGFSSVVVRRK
>B2FSW8 4.2.2.26~~~~~~Alginate lyase~~~
MRLQPLFVSLALAAPCALLPTASLSAAPAAAARQADTAPVLVTAAQWQQMASEGRRYPWFAKEQARTEATLKKMMKAGID
VPVPRDKGGGRTHEQHKRNYQALLAAGTLYRLTGDRAYVDYARDMLLQYAQLYPTLGPHPEGRGQIPGRVFWQVLNDSVW
LVNAIQGYDAIRDALSAEDRNTIESKVFRPMAEFLVSEPKNYDQIHNHATWAVAATGMTGYVLRDQELVEKSLRGSQKDD
KFGFLRQIDLLFSPDGYYEEGPYYQRYALAPFLLFANAIERNEPQRKIFARRDGVLLKAVDVLVQSSYGGLFFPINDAIL
DKGIDTEELVAGIGIAYARTGDDRLLSVAEQQKRLLLSPEGLQVAQALAANKAKPFDYHPMLLRDGPDGDRGGLAILRMN
GERGQALVQKDTMQGMGHGHFDKLNWLFYDNGNPVVTDYGAARFLNVEAKRGGIYLAENRSWAKQTVAHNTLVVDEQSHF
NGNWKRGEAHAPQVRFFQADADTQIASATMRDAYPGVAFTRTQALLRHPDLGLPVVLDLLQVHGDKAARYDLPLHFNGHI
VTTGFEAEHFPSQRPVLGKDNGYQHLWLDARSKPGSEPRSLAWLLDGRFYTYRFGSSAPAQALLVESGANDPEFNLRREP
ALLQRVDGQKDVTFFSVLEPHGEYNGTAEYVHGADSRIREIVRTRGSDAEVIELRLASGARIALGVADNSATTSEHSVTV
DGHVYRWNGSHARLDRSKGDGK
>P31125 ~~~eamA~~~Probable amino-acid metabolite efflux pump~~~COG0697
MSRKDGVLALLVVVVWGLNFVVIKVGLHNMPPLMLAGLRFMLVAFPAIFFVARPKVPLNLLLGYGLTISFAQFAFLFCAI
NFGMPAGLASLVLQAQAFFTIMLGAFTFGERLHGKQLAGIALAIFGVLVLIEDSLNGQHVAMLGFMLTLAAAFSWACGNI
FNKKIMSHSTRPAVMSLVIWSALIPIIPFFVASLILDGSATMIHSLVTIDMTTILSLMYLAFVATIVGYGIWGTLLGRYE
TWRVAPLSLLVPVVGLASAALLLDERLTGLQFLGAVLIMTGLYINVFGLRWRKAVKVGS
>P38101 ~~~eamB~~~Cysteine/O-acetylserine efflux protein~~~COG1280
MTPTLLSAFWTYTLITAMTPGPNNILALSSATSHGFRQSTRVLAGMSLGFLIVMLLCAGISFSLAVIDPAAVHLLSWAGA
AYIVWLAWKIATSPTKEDGLQAKPISFWASFALQFVNVKIILYGVTALSTFVLPQTQALSWVVGVSVLLAMIGTFGNVCW
ALAGHLFQRLFRQYGRQLNIVLALLLVYCAVRIFY
>Q185C5 5.4.3.9~~~eam~~~Glutamate 2,3-aminomutase~~~COG1509
MNEQTRISLERAAELKSKIDDYIQARKTINRGLEKEEEINKRKQKILSILNGTEEDWNNYKWQLSNRITDVDTLSKIITL
TKKEKEYIKEVGTQFRWAISPYYLSLIDPEDICDPIKLLSIPTHIELEDEQEDLDPMGEEYTNPAGCITRRYPDRLIINV
TNECAMYCRHCQRRRNIGQQDSHKSKAIIQESIDYIRENEEIRDVLVTGGDALTLKDDYLEWILSQLKEIPHVDYVRLGT
RTLVTMPQRITDEFCNMLKKYHPVYINTHFNHPMEITKESKEACEKLANAGVPLGNQAVLLNGINNDKFVMRCLNQELLK
IRVKPYYIFQSKHVKGTKHFNTSVDDGLEIMEYLRGYTSGMAIPTYIVNAPKGGGKTPLLPQYLVSKGTDYVMLRTWEGK
VIKMEDEPAVDIKKLIKEQAQD
>A0A0T7AQA7 2.4.1.-~~~earP~~~Protein-arginine rhamnosyltransferase~~~
MNTPPFVCWIFCKVIDNFGDIGVSLRLARVLHRELGWQVHLWTDDVSALRALCPDLPDVPCVHQDIHVRTWHSDAADIDT
APVPDAVIETFACDLPENVLHIIRRHKPLWLNWEYLSAEESNERLHLMPSPQEGVQKYFWFMGFSEKSGGLIRERDYRDA
VRFDTEALRQRLMLPEKNAPEWLLFGYRSDVWAKWLEMWQQAGSPMTLLLAGAQIIDSLKQSGIIPQNALQNDGDVFQTA
SVRLVKIPFVPQQDFDQLLHLADCAVIRGEDSFVRAQLAGKPFFWHIYPQDEHVHLDKLHAFWDKAHGFYTPETASAHRC
LSDDLNGGEALSATQRLECWQILQQHQNGWRQGAGAWSRYLFGQPSASEKLAAFVSKHQKIR
>E6MVV9 2.4.1.-~~~earP~~~Protein-arginine rhamnosyltransferase~~~
MNTPPFVCWIFCKVIDNFGDIGVSWRLARVLHRELGWQVHLWTDDVSALRALCPDLPDVPCVHQDIHVRTWHSDAADIDT
APVPDVVIETFACDLPENVLHIIRRHKPLWLNWEYLSAEESNERLHLMPSPQEGVQKYFWFMGFSEKSGGLIRERDYCEA
VRFDTEALRERLMLPEKNASEWLLFGYRSDVWAKWLEMWRQAGSPMTLLLAGTQIIDSLKQSGVIPQDALQNDGDVFQTA
SVRLVKIPFVPQQDFDQLLHLADCAVIRGEDSFVRAQLAGKPFFWHIYPQDENVHLDKLHAFWDKAHGFYTPETVSAHRR
LSDDLNGGEALSATQRLECWQTLQQHQNGWRQGAEDWSRYLFGQPSAPEKLAAFVSKHQKIR
>Q9HZZ1 2.4.1.-~~~earP~~~Protein-arginine rhamnosyltransferase~~~
MASWDIFCSVVDNYGDIGVTWRLARQLAAEHGQAVRLWVDEPQAFARICPRADPVAHVQCLDGVEVRAWGRPWAPVAAAD
VVIEAFACELPEAHRQAMRERKRPSLWLNLEYLSAEEWIGSCHALPSLQACGLSKYFFFPGFREPSGGLLREAGLLERRR
RFQASVSAQDEFLASLGVRRKVGERLISLFAYENPALPGWLEQLRDARQPSLLLVPEGRVLADVADWLRVATLAVGDVHV
RDALRVQVLPFMAQDDYDRLLWCCDLNAVRGEDSFVRAQWAGRPLLWHIYRQEEETHLAKLEAFLELYCAGLPADLAENL
RTFWLAWNAGGGLAGAWEGLERQLPEWRREAQRWADEQGMRPDLAARLVQFYADWL
>Q88LS1 2.4.1.-~~~earP~~~Protein-arginine rhamnosyltransferase~~~COG4394
MKATWDIFCSVVDNYGDIGVTWRLARQLVAEHGLAVRLWVDDLNAFTPMCPGADATAAQQWQHGVDVRHWPAAWLPVAPA
DVVIGAFACQLPAAYVEAMRARPQPPLWLNLEYLSAEDWVEGCHGLPSPQPNGLRKVFFFPGFTDKTGGLLREGSLLARR
DGFQQSAEARRAFLQGLGVDLVPGALLISLFAYENPQLGNWLDALATADQPCHLLVPQGRVVAGLSQWLGEGPLHVGDVR
TRGALTVQVLPFVSQDDFDRLLWSCDFNAVRGEDSFVRAQWAGQPMLWHIYVQDENAHWEKLEAFLAHYRCGLSDDADAA
LLGLWRAWNMDFDMGQAWRAARQHWPELQQHARLWGARQAAQPDLATALVHFYRNSL
>Q8EEP8 2.4.1.-~~~earP~~~Protein-arginine rhamnosyltransferase~~~COG4394
MSTSSNASHWDIFCTVVDNYGDIGVTWRLAKQLVNEYHIPIILWVDDLNSFSHILPTLNPKLVSQCFNGVIINHWTTPLA
VPYLPGKVLIEAFACELPDEVKLQLATLHKTAPQAVPLWLNLEYLSAEDWVDGCHGLPSMQVSGIKKYFYFPGFTPKTGG
LICERELFAERDAWQQDPANKLQLFESLGLKDIQAQDSVFSIFSYETDSLPALCELWQARAKNDAKIHALLPKGRSLNSL
QHLLPCPVDALMPGQQIKLGDLTLHILPMTDQQGFDRLLWSCDVNIVRGEDSFLRAQWAAKPFIWHIYPQEDDYHLIKLE
AFIRLYCDNLAPDIADTWSKLNFAFSQGQQSAVKTHWQNLNPVSLPLLQHAKEWPIDAINAADLATRLVQFVKKS
>Q84GK0 3.4.21.-~~~eatA~~~Serine protease EatA~~~
MNKVFSLKYSFLAKGFIAVSELARRVSVKGKLKSASSIIISPITIAIVSYAPPSLAATVNADISYQTFRDFAENKGAFIV
GASNINIYDKNGVLVGVLDKAPMPDFSSATMNTGTLPPGDHTLYSPQYVVTAKHVNGSDIMSFGHIQNNYTVVGENNHNS
LDIKIRRLNKIVTEVAPAEISSVGAVNGAYQEGGRFKAFYRLGGGLQYIKDKNGNLTPVYTNGGFLTGGTISALSSYNNG
QMITAPTGDIFNPANGPLANYLNKGDSGSPLFAYDSLDKKWVLVGVLSSGSEHGNNWVVTTQDFLHQQPKHDFDKTISYD
SEKGSLQWRYNKNSGVGTLSQESVVWDMHGKKGGDLNAGKNLQFTGNNGEIILHDSIDQGAGYLQFFDNYTVTSLTDQTW
TGGGIITEKGVNVLWQVNGVNDDNLHKVGEGTLTVNGKGVNNGGLKVGDGTVILNQRPDDNGHKQAFSSINISSGRATVI
LSDANQVNPDKISWGYRGGTLDLNGNNVNFTRLQAADYGAIVSNNNKNKSELTLKLQTLNENDISVDVKTYEVFGGHGSP
GDLYYVPASNTYFILKSKAYGPFFSDLDNTNVWQNVGHDRDKAIQIVKQQKIGESSQPYMFHGQLNGYMDVNIHPLSGKD
VLTLDGSVNLPEGVITKKSGTLIFQGHPVIHAGMTTSAGQSDWENRQFTMDKLRLDAATFHLSRNAHMQGDISAANGSTV
ILGSSRVFTDKNDGTGNAVSSVEGSSIATTAGDQSYYSGNVLLENHSSLEVRENFTGGIEAYDSSVSVTSQNAIFDHVGS
FVNSSLLLEKGAKLTAQSGIFTNNTMKIKENASLTLTGIPSVGKPGYYSPVTSTTEGIHLGERASLSVKNMGYLSSNITA
ENSAAIINLGDSNATIGKTDSPLFSTLMRGYNAVLQGNIMGPQSSVNMNNALWHSDRNSELKELKANDSQIELGVRGHFA
KLRVKELIASNSVFLVHANNSQADQLNVTDKLQGSNNTILVDFFNKAANGTNVTLITAPKGSDENTFKAGTQQIGFSNIT
PEIRTENTDTATQWVLTGYQSVADARASKIATDFMDSGYKSFLTEVNNLNKRMGDLRDSQGDAGGWARIMNGTGSGESGY
RDNYTHVQIGADRKHELNGIDLFTGALLTYTDNNASSQAFSGKTKSLGGGVYASGLFESGAYFDLIGKYLHHDNRYTLNF
ASLGERSYTSHSLYAGAEIGYRYHMSENTWVEPQMELVYGSVSGKSFNWKDQGMQLSMKDKDYHPLIGRTGVDVGRAFSG
DTWKVTVRAGLGYQFDLLANGETVLQDASGKKHFKGEKDSRMLMNVGTNVEVKDNMRFGLELEKSAFGRYNIDNSINANF
RYYF
>Q57231 ~~~~~~Antitoxin epsilon~~~
MAVTYEKTFEIEIINELSASVYNRVLNYVLNHELNKNDSQLLEVNLLNQLKLAKRVNLFDYSLEELQAVHEYWRSMNRYS
KQVLNKEKVA
>P36911 3.2.1.96~~~endOF1~~~Endo-beta-N-acetylglucosaminidase F1~~~
MKKFINQFSASLKNNILVFLAFPFVWTSCARDNPLSSENSNISPNAAARAAVTGTTKANIKLFSFTEVNDTNPLNNLNFT
LKNSGKPLVDMVVLFSANINYDAANDKVFVSNNPNVQHLLTNRAKYLKPLQDKGIKVILSILGNHDRSGIANLSTARAKA
FAQELKNTCDLYNLDGVFFDDEYSAYQTPPPSGFVTPSNNAAARLAYETKQAMPNKLVTVYVYSRTSSFPTAVDGVNAGS
YVDYAIHDYGGSYDLATNYPGLAKSGMVMSSQEFNQGRYATAQALRNIVTKGYGGHMIFAMDPNRSNFTSGQLPALKLIA
KELYGDELVYSNTPYSKDW
>P36912 3.2.1.96~~~endOF2~~~Endo-beta-N-acetylglucosaminidase F2~~~
MKTANFSFALCLSVVIMLFIKCTRSEQDLSVTKDAIAQKSGVTVSAVNLSNLIAYKNSDHQISAGYYRTWRDSATASGNL
PSMRWLPDSLDMVMVFPDYTPPENAYWNTLKTNYVPYLHKRGTKVIITLGDLNSATTTGGQDSIGYSSWAKGIYDKWVGE
YNLDGIDIDIESSPSGATLTKFVAATKALSKYFGPKSGTGKTFVYDTNQNPTNFFIQTAPRYNYVFLQAYGRSTTNLTTV
SGLYAPYISMKQFLPGFSFYEENGYPGNYWNDVRYPQNGTGRAYDYARWQPATGKKGGVFSYAIERDAPLTSSNDNTLRA
PNFRVTKDLIKIMNP
>P36913 3.2.1.96~~~endOF3~~~Endo-beta-N-acetylglucosaminidase F3~~~
MKKIFFAQCSILLLMLGSCSKMTEDMTPESVNKEASVKSATALAGSNGVCIAYYITDGRNPTFKLKDIPDKVDMVILFGL
KYWSLQDTTKLPGGTGMMGSFKSYKDLDTQIRSLQSRGIKVLQNIDDDVSWQSSKPGGFASAAAYGDAIKSIVIDKWKLD
GISLDIEHSGAKPNPIPTFPGYAATGYNGWYSGSMAATPAFLNVISELTKYFGTTAPNNKQLQIASGIDVYAWNKIMENF
RNNFNYIQLQSYGANVSRTQLMMNYATGTNKIPASKMVFGAYAEGGTNQANDVEVAKWTPTQGAKGGMMIYTYNSNVSYA
NAVRDAVKN
>P80036 3.2.1.96~~~~~~Endo-beta-N-acetylglucosaminidase~~~
MQFGIVAAIADGGRTARAGGSVRPPRRPPASHTAWGLPRGRPTGQPHATPTKSGPTSIAYVEVNNDQLANVGRYQLANGA
NAFDVAIIFAANINWNGSKAVLYNNENVQATLDDAATQIRPLQAKGIKVSLSILGNHQGAGIANFPTQAAAEDFAAQVSA
TVSKYGLDGVDLDDEYSDYGTNGTPQPNQQSIGWLISALRADVPGKLISFYDIGPASSALSSSSSTIGSKLDYAWNPYYG
TYSAPSIPGLDKSRLSAAAVDVQNTPQSTAVSLAQRTKADGYGVFMTYNLPDGDVSPYVSSMTKVLYGQAATYH
>P04067 3.2.1.96~~~~~~Endo-beta-N-acetylglucosaminidase H~~~
MFTPVRRRVRTAALALSAAAALVLGSTAASGASATPSPAPAPAPAPVKQGPTSVAYVEVNNNSMLNVGKYTLADGGGNAF
DVAVIFAANINYDTGTKTAYLHFNENVQRVLDNAVTQIRPLQQQGIKVLLSVLGNHQGAGFANFPSQQAASAFAKQLSDA
VAKYGLDGVDFDDEYAEYGNNGTAQPNDSSFVHLVTALRANMPDKIISLYNIGPAASRLSYGGVDVSDKFDYAWNPYYGT
WQVPGIALPKAQLSPAAVEIGRTSRSTVADLARRTVDEGYGVYLTYNLDGGDRTADVSAFTRELYGSEAVRTP
>Q56F26 3.2.1.165~~~csxA~~~Exo-beta-D-glucosaminidase~~~COG3250
MSFRQKRTRIPLLAMTVTALAAAVCGVTTAPAATGAEVAVPLSVGAAAGNATPIPGYVIQSSAQVSDDSAVSKPGFPTSG
WYPVSSRSTVYAGLLQNGKYADPFYSTNMQNVPAAQFSVPWWYRTDLNVDDTSSRTYLDFSGVLSKADVWVNGTKVATKD
QVNGAYTRHDLDITAQVHTGVNSVAFKVYPNDPNRDLSMGWIDWAQTPPDQNMGIVRDVLVRRSGAVALRSAHVIQKLNS
ALDHADLTVKADVRNDSANAVQTTVAGTVAGKPISQTVSLAAKERKTVTFPLVGLDRPNVWWPAGMGGQHRYDLDLTASV
GGTPSDAAKSKFGVRDVKATLNSSGGRQYSVNGKPLLIRGGGYTPDLFLRWNETAAADKLKYVLNLGLNTVRLEGHIEPD
EFFDIADDLGVLTMPGWECCDKWEGQVNGEEKGEPWVESDYPIAKASMFSEAERLRDHPSVISFHIGSDFAPDRRIEQGY
LDAMKAADFLLPVIPAASARPSPITGASGMKMNGPYDYVPPVYWYDKSQKDRGGAWSFNSETSAGVDIPTMDTLKRMMSA
SELDTMWKNPSAKQYHRSSSDTFGNLKLFGDALTKRYGASANLNDFVRKAQLSQYENVRAEFESHSRNYTDSTNPSTGLI
YWMLNSPWTSLHWQLFDAYMDQNGAYYGAKKANEPLHIQYSHDNRSVVVINQTSNAVSGLTATTKLYNLDGTEKYSNTKT
GLSVGALGAKATAVTVPAVSGLSTTYLAKWVLTDSSGKEVSRNVYWLSTKADTLNWGGSDWYYTPQSAFADLSGLNNLGQ
SAVGATANSVAGADGTTTTTVTLKNTSGGRLPAFYVDSKVVDSAGKPVLPVEWNDNAVSLWPGETTTLTAKYRTADLKGS
KPSVRISGWNTGTQTVPADGSGPGPSDPVDYQAEDATIVQGAVESNHAGYTGTGFVNYDNVAGSSVEWTVTVPSAGTYDV
VVRYANGTTTSRPLDFSVNGSISASGVAFGSTGTWPAWTTKTVRVTLAAGVNKIKAVATTANGGPNVDKITL
>Q82NR8 3.2.1.165~~~csxA~~~Exo-beta-D-glucosaminidase~~~COG3250
MFHRPASVRRFVTTAVALGLLSTLSTGARAGARTHEPPPRPTTVSSTAGSTTALTGYAIQSTAKVTDPAAAVSSPGYPAS
GWYPAGARSTVLAALLAGGKYADPFYSTNQQKIPKADFQVPWWYRSDFTVADTSARTYLDFSGVISAADVFVNGRQIARS
ADVAGAYTRHELDVTSLVREGANTVAFRIQPNNPNKNLTMGWIDWLEPPPDQNMGIVRDVLVRRGGPVALRDAHVITRLD
VPSLATADLTVKARARNDSDAAITATVSGSVGATSFRRSVALAAHETKTVTFTPADTPGLHLTSPRVWWPAGMGGQPLYA
LDLSASVSETVSDTVHESFGIRDVKAPLNSDGARQYSVNGRRLLIKGGGWSPDEFLRWDSTYVEDRLRYALDLGLNTIRL
EGHIEPDEFFDLADRYGILTLPGWECCNKWEGNVNGSGSGDEWTAADYPVAKASMAAEAARLRDHPSVVSFLIGSDFAPD
AKIEKTYLDALKAADWPTPVVAAASDKSSPVSGSSGMKMTGPYDWIPPNYWYAKREGGATGFNSETSAGPDIPTLDTLRR
MMTPAELDTLWKNPGAKQYHRSPSSVFGTLKIYDAALAGRYGAPTGLTDYVRKAQLAQYENVRAQFEAYGRGATDASKPA
TGVIYWMFNSGWTSLHWQLLDRYLDQGGAYFGAKKANEPLHVQYSYDDRSVVVVNNRPAAVSGLTARVTLFNTDGTQKYD
KSATGLSVAGDGAHSTALTLPSSVSGLSTTYLARLVLTDSAGKEVSRNVYWLSTRPDTLDWAHTDWYYTPTTSYADLKGL
GSMARVPVSATASTTAGTDGASTTTVTVRNTGSGRTPSLFTDVHLVDSKGKPVLPVQWSDNEVSLWPGESATLTVTYRTA
DLHGSAPRVRVSGWNTAEQTVPAA
>O51418 ~~~ebfC~~~Nucleoid-associated protein EbfC~~~
MAVNPLDFLKNMSSVKNNIDNIKKEISKITVCGKAGSNIVTIEMDGEFNVKKVSINKEFFDDLDNDAFEQMIKSALNDAV
SKVKEEIKLKTMGVLPFGM
>P0AC73 ~~~ebgC~~~Evolved beta-galactosidase subunit beta~~~COG2731
MRIIDNLEQFRQIYASGKKWQRCVEAIENIDNIQPGVAHSIGDSLTYRVETDSATDALFTGHRRYFEVHYYLQGQQKIEY
APKETLQVVEYYRDETDREYLKGCGETVEVHEGQIVICDIHEAYRFICNNAVKKVVLKVTIEDGYFHNK
>Q931R6 ~~~ebhA~~~Extracellular matrix-binding protein EbhA~~~
MGNLQTAINDKSGTLASQNFLDADEQKRNAYNQAISAAETILNKQTGPNTAKTAVEQALNNVNSAKHALNGTQNLNNAKQ
AAITAINGASDLNQKQKDALKAQANGAQRVSNANDVQRNATELNTAMGQLQHAIADKTNTLASSKYVNADSTKQNAYTTK
VTNAEHIISGTPTVVTTPSEVTAAANQVNSAKQELNGDERLRVAKQNANTAIDALTQLNTPQKAKLKEQVGQANRLEDVQ
SVQTNGQSLNNAMKGLRDSIANETTVKASQNYTDASPNNQSTYNSAVSNAKGIINQTNNPTMDTSAITQATTQVNNAKNG
LNGAENLRNAQNTAKQNLNTLSHLTNNQKSAISSQIDRAGHVSEVTAAKNAATELNAQMGNLEQAIHDQNTVKQGVNFTD
ADKAKRDAYTNAVSRAETILNKTQGANTSKQDVEAAIQNVTSAKNALNGDQNVTNAKNAAKNALNNLTSINNAQKRDLTT
KIDQATTVAGVEAVSNTGTQLNTAMANLQNGINDKANTLASENYHDADSDKKTAYTQAVTNAENILNKNSGSNLDKAAVE
NALSQVTNAKGALNGNHNLEQAKSNANTTINGLQHLTTAQKDKLKQQVQQAQNVAGVDTVKSSANTLNGAMGTLRNSIQD
NTATKNGQNYLDATERNKTNYNNAVDSANGVINATSNPNMDANAINQIATQVTSTKNALDGTHNLTQAKQTATNAIDGAT
NLNKAQKDALKAQVTSAQRVANVTSIQQTANELNTAMGQLQHGIDDENATKQTQKYRDAEQSKKTAYDQAVAAAKAILNK
QTGSNSDKAAVDRALQQVTSTKDALNGDAKLAEAKAAARQNLGTLNHITNAQRTALEGQINQATTVDGVNTVKTNANTLD
GAMNSLQGAINDKDATLRNQNYLDADESKRNAYTQAVTAAEGILNKQTGGNTSKADVDNALNAVTRAKAALNGAENLRNA
KTSATNTINGLPNLTQLQKDNLKHQVEQAQNVVGVNGVKDKGNTLNTAMGALRTSIQNDNTTKTSQNYLDASDSNKNNYN
TAVNNANGVINATNNPNMDANAINDMANQVNTTKAALNGAQNLAQAKTNATNTINNAQDLNQKQKDALKTQVNNAQRVSD
ANNVQHTATELNGAMTALKAAIADKERTKASGNYVNADQEKRQAYDSKVTNAENIINGTPNATLTVNDVNSAASQVNAAK
TALNGDNNLRVAKEHANNTIDGLAQLNNVQKAKLKEQVQSATTLDGVQTVKNSSQTLNTAMKGLRDSIANEATIKAGQNY
TDASPNNRNEYDSAVTAAKAIINQTSNPTMEPNTITQATSQVTTKEHALNGAQNLAQAKTTAKNNLNNLTSINNAQKDAL
TRNIDGATTVAGVNQETAKATELNNAMHSLQNGINDETQTKQTQKYLDAEPSKKSAYDQAVNAAKAILTKASGQNVDKAA
VEQALQNVNSTKTALNGDAKLNEAKAAAKQTLGTLTHINNAQRNALDNEITQATNVEGVNTVKAKAQQLDGAMGQLETSI
RDKDTTLQSQNYQDADDAKRTAYSQAVNAAATILNKTAGGNTPKADVERAMQAVTQANTALNGIQNLERAKQAANTAITN
ASDLNTKQKEALKAQVTSAGRVSAANGVEHTATELNTAMTALKRAIADKADTKASGNYVNADANKRQAYDEKVTAAEHIV
SGTPTPTLTPSDVTNAATQVTNAKTQLNGNHNLEVAKQNANTAIDGLTSLNGPQKAKLKEQVGQATTLPNVQTVRDNAQT
LNTAMKGLRDSIANEATIKAGQNYTDASQNKQNDYNNAVTAAKAIIGQTTSPSMIAQEINQAKDQVTAKQQALNGQENLR
TAQTNAKQHLNGLSDLTNAQKDAAKRQIEGATHVNEVTQAQNNADALNTAMTNLKNGIQDQNTIKQGVNFTDADEAKRNA
YTNAVTQAEQILNKAQGPNTAKDGVETALQNVQRAKNELNGNQNVANAKTTAKNALNNLTSINNAQKAALKSQIEGATTV
AGVNQVSTMASELNTAMSNLQRGINDEAATKAAQKYTEADRDKQTAYNDAVTAAKTLLDKTAGSNDNKVAVEQALQRVNT
AKTALNGDARLNEAKNTAKQQLATMSHLTNAQKANLTEQIERGTTVAGVQGIQANAGTLNQAMNQLRQSIASKDATKSSE
DYQDANADLQNAYNDAVTNAEGIISATNNPEMNPDTINQKASQVNSAKSALNGDEKLAAVKQTAKSDIGRLTDLNNAQRT
AANAEVDQAPNLAAVTAAKNKATSLNTAMGNLKHALAEKDNTKRSVNYTDADQPKQQAYDTAVTQAEAITNANGSNANET
QVQAALNQLNQAKNDLNGDNKVAQAKETAKRALASYSNLNNAQSTAATSQIDNATTVADVTAAQNTANELNTAMGQLQNG
INDQNTVKQQVNFTDADQGKKDAYTNAVTNAQGILDKANGQNMTKAQVEAALNQVTTAKNALNGDANVRQAKSDAKANLG
TLTHLNNAQKQDLTSQIEGATTVNGVNSVKTKAQDLDGAMQRLESAIANKDQTKASENYIDADPTKKTAFDNAITQAESY
LNKDHGTNKDKQAVEQAIQSVTSTENALNGDANLQCAKTEATQAIDNLTQLNTPQKTALKQQVNAAQRVSGVTDLKNSAT
SLNNAMDQLKQAIGDHDTIVAGGNYTNASPDKQGAYTDAYNAAKNIVNGSPNVITNAADVTAATQRVNNAETSLNGDTNL
ATAKQQAKDALRQMTHLSDAQKQSITGQIDSATQVTGVQSVKDNATNLDNAMNQLRNSIANKDEVKASQPYVDADTDKQN
AYNTAVTSAENIINATSQPTLDPSAVTQAANQVNTNKTALNGAQNLANKKQETTANINRLSHLNNAQKQDLNTQVTNAPN
ISTVNQVKTKAEQLDQAMERLINGIQDKDQVKQSVNFTDADPEKQTAYNNAVTAAENIINQANGTNANQSQVEAALSTVT
TTKQALNGDRKVTDAKNNANQTLSTLDNLNNAQKGAVTGNINQAHTVAEVTQAIQTAQELNTAMGNLKNSLNDKDTTLGS
QNFADADPEKKNAYNEAVRNAENILNKSTGTNVPKDQVEAAMNQVNTTKAALNGTQNLEKAKQHANTAIDGLSHLTNAQK
EALKQLVQQSTTVAEAQGNEQKANNVDAAMDKLRQSIADNATTKQNQNYTDASPNKKDAYNNAVTTAQGIIDQTTNPSLD
PTVINQAAGQVSTSKNALNGNENLEAAKQQATQSLGSLDNLNNAQKQAVTNQINGAHTVDEANQIKQNAQNLNTAMGNLK
QAIADKDATKATVNFTDADQAKQQAYNTAVTNAENIISKANGGNATQTEVEQAIQQVNAAKQALNGNANVQHAKDEATAL
INNSNDLNQAQKDALKQQVQNATTVAGVNNVKQTAQELNNAMTQLKQGIADKEQTKADGNFVNADSDKQNAYNQAVAKAE
ALISGTPDVVVTPSEITAALNKVTQAKNDLNGNTNLATAKQNVQHAIDQLPNLNQAQRDEYSKQITQATLVPNVNAIQQA
ATTLNDAMTQLKQGIANKAQIKGSENYHDADTDKQTAYDNAVTKAEELLKQTTNPTMDPNTIQQALTKVNDTNQALNGNQ
KLADAKQDAKTTLGTLDHLNDAQKQALTTQVEQAPDIATVNNVKQNAQNLNNAMTNLNNALQDKTETLNSINFTDADQAK
KDDYTNAVSHAEGILSKANGSNASQTEVEQAMQRVNEAKQALNGNDNVQRAKDAAKQVITNANDLNQAQKDALKQQVDAA
QTVANVNTIKQTAQDLNQAMTQLKQGIADKDQTKANGNFVNADTDKQNAYNNAVAHAEQIISGTPNANVDPQQVAQALQQ
VNQAKGDLNGNHNLQVAKDNANTAIDQLPNLNQPQKTALKDQVSHAELVTGVNAIKQNADALNNAMGTLKQQIQANSQVP
QSVDFTQADQDKQQAYNNAANQAQQIANGTPTPVLAPDTVTKAVTTMNQAKDALNGDEKLAQAKQDALANLDTLRDLNQP
QRDALRNQINQAQALATVEQTKQNAQNVNTAMGNLKQGIANKDTVKASENYHDADVDKQTAYTNAVSQAEGIINQTTNPT
LNPDDITRALTQVTDAKNSLNGEAKLATEKQNAKDAVSGMTHLNDAQKQALKGQIDQSPEIATVNQVKQTATSLDQAMDQ
LSQAINDKDQILADGNYLNADPDKQNAYKQAVAKAEALLNKQSGTNEVQAQVESITNEVNAAKQALNGNDNLANAKQQAK
QQLANLTHLNDAQKQSFESQITQAPLVTDVTTINQKAQTLDHAMELLRNSVADNQTTLASEDYHDATAQRQNDYNKAVTA
ANNIINQTTSPTMNPDDVNGATTQVNNTKVALDGDENLAAAKQQANNRLDQLDHLNNAQKQQLQSQITQSSDIAAVNGHK
QTAESLNTAMGNLINAIADHQAVEQRGNFINADTDKQTAYNTAVNEAAAMINKQTGQNANQTEVEQAITKVQTTLQALNG
DHNLQVAKTNATQAIDVLTSLNDPQKTALKDQVTAATLVTAVHQIEQNANTLNQAMHGLRQSIQDNAATKANSKYINEDQ
PEQQNYDQAVQAANNIINEQTATLDNNAINQVAATVNTTKAALHGDVKLQNDKDHAKQTVSQLAHLNNAQKHMEDTLIDS
ETTRTAVKQDLTEVQALDQLMDALQQSIADKDATRASSAYVNAEPNKKQAYDEAVQNAESIIAGLNNPTINKGNVSSATQ
AVISSKNALDGVERLAQDKQTAGNSLNHLDQLTPAQQQALENQINNATTCDKVAEIIAQAQALNEAMKALKESIKDQPQT
EASSKFINEDQAQKDAYTQAVQHAKDLINKTTDPTLAKSIIDQATQAVTDAKNNLHGDQKLAQDKQRATETLNNLSNLNT
PQRQALENQINNAATRGEVAQKLTEAQALNQAMEALRNSIQDQQQTESGSKFINEDKPQKDAYQAAVQNAKDLINQTGNP
TLDKAQVEQLTHAFKQAKDNLHGDQKLADDKQHAVTDLNQLNGLNNPQRQALESQINNAATRGEVAQKLAEAKALDQAMQ
ALRNSIQDQQQTEAGSKFINEDKPQKDAYQAAVQNAKDLINQTGNPTLDKSQVEQLTQAVTTAKDNLHGDQKLARDQQQA
VTTVNALPNLNHAQQQTLTDAINAAPTRTEVAQHVQTATELDHAMETLKNKVDQVNTDKAQPNYTEASTDKKEAVDQALQ
AAQSITDPTNGSNANKDAVEQALTKLQEKVNELNGNERVAEAKTQAKQTIDQLTHLNADQIATAKQNIDQATKLQPIAEL
VDQATQLNQSMDQLQQAVNEHANVEQTIDYTQADSDKQKAYKQAIADAENVLKQNANKQQVDQALQNILNAKQALNGDER
VALAKTNGKHDIDQLNALNNAQQDGFKGRIDQSNDLNQIQQIVDEAKALNRAMDQLSQEITGNEGRTKGSTNYVNADTQV
KQVYDEAVDKAKQALDKSSGQNLTAEQVIKLNDAVTAAKKALNGEERLNNRKAEALQRLDQLTHLNNAQRQLAIQQINNA
ETLNKASRAINRATKLDNAMGAVQQYIDEQHLGVISSTNYINADDNLKANYDNAIANAAHELDKVQGNAIAKAEAEQLKQ
NIIDAQNALNGDQNLANAKDKANAFVNSLNGLNQQQQDLAHKAINNADTVSDVTDIVNNQIDLNDAMETLKHLVDNEIPN
AEQTVNYQNADDNAKTNFDDAKRLANTLLNSDNTNVNDINGAIQAVNDAIHNLNGDQRLQDAKDKAIQSINQALANKLKE
IEASNATDQDKLIAKNKAEELANSIINNINKATSNQAVSQVQTAGNHAIEQVHANEIPKAKIDANKDVDKQVQALIDEID
RNPNLTDKEKQALKDRINQILQQGHNDINNALTKEEIEQAKAQLAQALQDIKDLVKAKEDAKQDVDKQVQALIDEIDQNP
NLTDKEKQALKDRINQILQQGHNGINNAMTKEEIEQAKAQLAQALKEIKDLVKAKENAKQDVDKQVQALIDEIDQNPNLT
DKEKQALKDRINQILQQGHNDINNAMTKEEIEQAKAQLAQALQDIKDLVKAKEDAKNAIKALANAKRDQINSNPDLTPEQ
KAKALKEIDEAEKRALQNVENAQTIDQLNRGLNLGLDDIRNTHVWEVDEQPAVNEIFEATPEQILVNGELIVHRDDIITE
QDILAHINLIDQLSAEVIDTPSTATISDSLTAKVEVTLLDGSKVIVNVPVKVVEKELSVVKQQAIESIENAAQQKIDEIN
NSVTLTLEQKEAAIAEVNKLKQQAIDHVNNAPDVHSVEEIQQQEQAYIEQFNPEQFTIEQAKSNAIKSIEDAIQHMIDEI
KARTDLTDKEKQEAIAKLNQLKEQAIQAIQRAQSISEITEQLEQFKAQMKAANPTAKELAKRKQEAISRIKDFSNEKINS
IRNSEIGTADEKQAAMNQINEIVLETIRDINNAHTLQQVEAALNNGIARISAVQIVISDRAKQSSSTGNESNSHLTIGYG
TANHPFNSSTIGHKKKLDEDDDIDPLHMRHFSNNFGNVIKNAIGVVGISGLLASFWFFIAKRRRKEDEEEELEIRDNNKD
SIKETLDDTKHLPLLFAKRRRKEDEEDVTVEEKDSLNNGESLDKVKHTPFFLPKRRRKEDEEDVEVTNENTDEKVLKDNE
HSPLLFAKRRKDKEEDVETTTSIESKDEDVPLLLAKKKNQKDNQSKDKKSASKNTSKKVAAKKKKKKSKKNKK
>Q2FYJ6 ~~~ebh~~~Extracellular matrix-binding protein ebh~~~
MNYRDKIQKFSIRKYTVGTFSTVIATLVFLGFNTSQAHAAETNQPASVVKQKQQSNNEQTENRESQVQNSQNSQNGQSLS
ATHENEQPNISQANLVDQKVAQSSTTNDEQPASQNVNTKKDSATAATTQPDKEQSKHKQNESQSANKNGNDNRAAHVENH
EANVVTASDSSDNGNVQHDRNELQAFFDANYHDYRFIDRENADSGTFNYVKGIFDKINTLLGSNDPINNKDLQLAYKELE
QAVALIRTMPQRQQTSRRSNRIQTRSVESRAAEPRSVSDYQNANSSYYVENANDGSGYPVGTYINASSKGAPYNLPTTPW
NTLKASDSKEIALMTAKQTGDGYQWVIKFNKGHAPHQNMIFWFALPADQVPVGRTDFVTVNSDGTNVQWSHGAGAGANKP
LQQMWEYGVNDPHRSHDFKIRNRSGQVIYDWPTVHIYSLEDLSRASDYFSEAGATPATKAFGRQNFEYINGQKPAESPGV
PKVYTFIGQGDASYTISFKTQGPTVNKLYYAAGGRALEYNQLFMYSQLYVESTQDHQQRLNGLRQVVNRTYRIGTTKRVE
VSQGNVQTKKVLESTNLNIDDFVDDPLSYVKTPSNKVLGFYSNNANTNAFRPGGAQQLNEYQLSQLFTDQKLQEAARTRN
PIRLMIGFDYPDAYGNSETLVPVNLTVLPEIQHNIKFFKNDDTQNIAEKPFSKQAGHPVFYVYAGNQGNASVNLGGSVTS
IQPLRINLTSNENFTDKDWQITGIPRTLHIENSTNRPNNARERNIELVGNLLPGDYFGTIRFGRKEQLFEIRVKPHTPTI
TTTAEQLRGTALQKVPVNISGIPLDPSALVYLVAPTNQTTNGGSEADQIPSGYTILATGTPDGVHNTITIRPQDYVVFIP
PVGKQIRAVVYYNKVVASNMSNAVTILPDDIPPTINNPVGINAKYYRGDEVNFTMGVSDRHSGIKNTTITTLPNGWTSNL
TKADKNNGSLSITGRVSMNQAFNSDITFKVSATDNVNNTTNDSQSKHVSIHVGKISEDAHPIVLGNTEKVVVVNPTAVSN
DEKQSIITAFMNKNQNIRGYLASTDPVTVDNNGNVTLHYRDGSSTTLDATNVMTYEPVVKPEYQTVNAAKTATVTIAKGQ
SFSIGDIKQYFTLSNGQPIPSGTFTNITSDRTIPTAQEVSQMNAGTQLYHITATNAYHKDSEDFYISLKIIDVKQPEGDQ
RVYRTSTYDLTTDEISKVKQAFINANRDVITLAEGDISVTNTPNGANVSTITVNINKGRLTKSFASNLANMNFLRWVNFP
QDYTVTWTNAKIANRPTDGGLSWSDDHKSLIYRYDATLGTQITTNDILTMLKATTTVPGLRNNITGNEKSQAEAGGRPNF
RTTGYSQSNATTDGQRQFTLNGQVIQVLDIINPSNGYGGQPVTNSNTRANHSNSTVVNVNEPAANGAGAFTIDHVVKSNS
THNASDAVYKAQLYLTPYGPKQYVEHLNQNTGNTTDAINIYFVPSDLVNPTISVGNYTNHQVFSGETFTNTITANDNFGV
QSVTVPNTSQITGTVDNNHQHVSATAPNVTSATNKTINLLATDTSGNTATTSFNVTVKPLRDKYRVGTSSTAANPVRIAN
ISNNATVSQADQTTIINSLTFTETVPNRSYARASANEITSKTVSNVSRTGNNANVTVTVTYQDGTTSTVTVPVKHVIPEI
VAHSHYTVQGQDFPAGNGSSASDYFKLSNGSDIADATITWVSGQAPNKDNTRIGEDITVTAHILIDGETTPITKTATYKV
VRTVPKHVFETARGVLYPGVSDMYDAKQYVKPVNNSWSTNAQHMNFQFVGTYGPNKDVVGISTRLIRVTYDNRQTEDLTI
LSKVKPDPPRIDANSVTYKAGLTNQEIKVNNVLNNSSVKLFKADNTPLNVTNITHGSGFSSVVTVSDALPNGGIKAKSSI
SMNNVTYTTQDEHGQVVTVTRNESVDSNDSATVTVTPQLQATTEGAVFIKGGDGFDFGHVERFIQNPPHGATVAWHDSPD
TWKNTVGNTHKTAVVTLPNGQGTRNVEVPVKVYPVANAKAPSRDVKGQNLTNGTDAMNYITFDPNTNTNGITAAWANRQQ
PNNQQAGVQHLNVDVTYPGISAAKRVPVTVNVYQFEFPQTTYTTTVGGTLASGTQASGYAHMQNATGLPTDGFTYKWNRD
TTGTNDANWSAMNKPNVAKVVNAKYDVIYNGHTFATSLPAKFVVKDVQPAKPTVTETAAGAITIAPGANQTVNTHAGNVT
TYADKLVIKRNGNVVTTFTRRNNTSPWVKEASAATVAGIAGTNNGITVAAGTFNPADTIQVVATQGSGETVSDEQRSDDF
TVVAPQPNQATTKIWQNGHIDITPNNPSGHLINPTQAMDIAYTEKVGNGAEHSKTINVVRGQNNQWTIANKPDYVTLDAQ
TGKVTFNANTIKPNSSITITPKAGTGHSVSSNPSTLTAPAAHTVNTTEIVKDYGSNVTAAEINNAVQVANKRTATIKNGT
AMPTNLAGGSTTTIPVTVTYNDGSTEEVQESIFTKADKRELITAKNHLDDPVSTEGKKPGTITQYNNAMHNAQQQINTAK
TEAQQVINNERATPQQVSDALTKVRAAQTKIDQAKALLQNKEDNSQLVTSKNNLQSSVNQVPSTAGMTQQSIDNYNAKKR
EAETEITAAQRVIDNGDATAQQISDEKHRVDNALTALNQAKHDLTADTHALEQAVQQLNRTGTTTGKKPASITAYNNSIR
ALQSDLTSAKNSANAIIQKPIRTVQEVQSALTNVNRVNERLTQAINQLVPLADNSALKTAKTKLDEEINKSVTTDGMTQS
SIQAYENAKRAGQTESTNAQNVINNGDATDQQIAAEKTKVEEKYNSLKQAIAGLTPDLAPLQTAKTQLQNDIDQPTSTTG
MTSASIAAFNEKLSAARTKIQEIDRVLASHPDVATIRQNVTAANAAKSALDQARNGLTVDKAPLENAKNQLQHSIDTQTS
TTGMTQDSINAYNAKLTAARNKIQQINQVLAGSPTVEQINTNTSTANQAKSDLDHARQALTPDKAPLQTAKTQLEQSINQ
PTDTTGMTTASLNAYNQKLQAARQKLTEINQVLNGNPTVQNINDKVTEANQAKDQLNTARQGLTLDRQPALTTLHGASNL
NQAQQNNFTQQINAAQNHAALETIKSNITALNTAMTKLKDSVADNNTIKSDQNYTDATPANKQAYDNAVNAAKGVIGETT
NPTMDVNTVNQKAASVKSTKDALDGQQNLQRAKTEATNAITHASDLNQAQKNALTQQVNSAQNVQAVNDIKQTTQSLNTA
MTGLKRGVANHNQVVQSDNYVNADTNKKNDYNNAYNHANDIINGNAQHPVITPSDVNNALSNVTSKEHALNGEAKLNAAK
QEANTALGHLNNLNNAQRQNLQSQINGAHQIDAVNTIKQNATNLNSAMGNLRQAVADKDQVKRTEDYADADTAKQNAYNS
AVSSAETIINQTTNPTMSVDDVNRATSAVTSNKNALNGYEKLAQSKTDAARAIDALPHLNNAQKADVKSKINAASNIAGV
NTVKQQGTDLNTAMGNLQGAINDEQTTLNSQNYQDATPSKKTAYTNAVQAAKDILNKSNGQNKTKDQVTEAMNQVNSAKN
NLDGTRLLDQAKQTAKQQLNNMTHLTTAQKTNLTNQINSGTTVAGVQTVQSNANTLDQAMNTLRQSIANKDATKASEDYV
DANNDKQTAYNNAVAAAETIINANSNPEMNPSTITQKAEQVNSSKTALNGDENLAAAKQNAKTYLNTLTSITDAQKNNLI
SQITSATRVSGVDTVKQNAQHLDQAMASLQNGINNESQVKSSEKYRDADTNKQQEYDNAITAAKAILNKSTGPNTAQNAV
EAALQRVNNAKDALNGDAKLIAAQNAAKQHLGTLTHITTAQRNDLTNQISQATNLAGVESVKQNANSLDGAMGNLQTAIN
DKSGTLASQNFLDADEQKRNAYNQAVSAAETILNKQTGPNTAKTAVEQALNNVNNAKHALNGTQNLNNAKQAAITAINGA
SDLNQKQKDALKAQANGAQRVSNAQDVQHNATELNTAMGTLKHAIADKTNTLASSKYVNADSTKQNAYTTKVTNAEHIIS
GTPTVVTTPSEVTAAANQVNSAKQELNGDERLREAKQNANTAIDALTQLNTPQKAKLKEQVGQANRLEDVQTVQTNGQAL
NNAMKGLRDSIANETTVKTSQNYTDASPNNQSTYNSAVSNAKGIINQTNNPTMDTSAITQATTQVNNAKNGLNGAENLRN
AQNTAKQNLNTLSHLTNNQKSAISSQIDRAGHVSEVTATKNAATELNTQMGNLEQAIHDQNTVKQSVKFTDADKAKRDAY
TNAVSRAEAILNKTQGANTSKQDVEAAIQNVSSAKNALNGDQNVTNAKNAAKNALNNLTSINNAQKRDLTTKIDQATTVA
GVEAVSNTSTQLNTAMANLQNGINDKTNTLASENYHDADSDKKTAYTQAVTNAENILNKNSGSNLDKTAVENALSQVANA
KGALNGNHNLEQAKSNANTTINGLQHLTTAQKDKLKQQVQQAQNVAGVDTVKSSANTLNGAMGTLRNSIQDNTATKNGQN
YLDATERNKTNYNNAVDSANGVINATSNPNMDANAINQIATQVTSTKNALDGTHNLTQAKQTATNAIDGATNLNKAQKDA
LKAQVTSAQRVANVTSIQQTANELNTAMGQLQHGIDDENATKQTQKYRDAEQSKKTAYDQAVAAAKAILNKQTGSNSDKA
AVDRALQQVTSTKDALNGDAKLAEAKAAAKQNLGTLNHITNAQRTDLEGQINQATTVDGVNTVKTNANTLDGAMNSLQGS
INDKDATLRNQNYLDADESKRNAYTQAVTAAEGILNKQTGGNTSKADVDNALNAVTRAKAALNGADNLRNAKTSATNTID
GLPNLTQLQKDNLKHQVEQAQNVAGVNGVKDKGNTLNTAMGALRTSIQNDNTTKTSQNYLDASDSNKNNYNTAVNNANGV
INATNNPNMDANAINGMANQVNTTKAALNGAQNLAQAKTNATNTINNAHDLNQKQKDALKTQVNNAQRVSDANNVQHTAT
ELNSAMTALKAAIADKERTKASGNYVNADQEKRQAYDSKVTNAENIISGTPNATLTVNDVNSAASQVNAAKTALNGDNNL
RVAKEHANNTIDGLAQLNNAQKAKLKEQVQSATTLDGVQTVKNSSQTLNTAMKGLRDSIANEATIKAGQNYTDASPNNRN
EYDSAVTAAKAIINQTSNPTMEPNTITQVTSQVTTKEQALNGARNLAQAKTTAKNNLNNLTSINNAQKDALTRSIDGATT
VAGVNQETAKATELNNAMHSLQNGINDETQTKQTQKYLDAEPSKKSAYDQAVNAAKAILTKASGQNVDKAAVEQALQNVN
STKTALNGDAKLNEAKAAAKQTLGTLTHINNAQRTALDNEITQATNVEGVNTVKAKAQQLDGAMGQLETSIRDKDTTLQS
QNYQDADDAKRTAYSQAVNAAATILNKTAGGNTPKADVERAMQAVTQANTALNGIQNLDRAKQAANTAITNASDLNTKQK
EALKAQVTSAGRVSAANGVEHTATELNTAMTALKRAIADKAETKASGNYVNADANKRQAYDEKVTAAENIVSGTPTPTLT
PADVTNAATQVTNAKTQLNGNHNLEVAKQNANTAIDGLTSLNGPQKAKLKEQVGQATTLPNVQTVRDNAQTLNTAMKGLR
DSIANEATIKAGQNYTDASQNKQTDYNSAVTAAKAIIGQTTSPSMNAQEINQAKDQVTAKQQALNGQENLRTAQTNAKQH
LNGLSDLTDAQKDAVKRQIEGATHVNEVTQAQNNADALNTAMTNLKNGIQDQNTIKQGVNFTDADEAKRNAYTNAVTQAE
QILNKAQGPNTSKDGVETALENVQRAKNELNGNQNVANAKTTAKNALNNLTSINNAQKEALKSQIEGATTVAGVNQVSTT
ASELNTAMSNLQNGINDEAATKAALNGTQNLEKAKQHANTAIDGLSHLTNAQKEALKQLVQQSTTVAEAQGNEQKANNVD
AAMDKLRQSIADNATTKQNQNYTDASQNKKDAYNNAVTTAQGIIDQTTSPTLDPTVINQAAGQVSTTKNALNGNENLEAA
KQQASQSLGSLDNLNNAQKQTVTDQINGAHTVDEANQIKQNAQNLNTAMGNLKQAIADKDATKATVNFTDADQAKQQAYN
TAVTNAENIISKANGGNATQAEVEQAIKQVNAAKQALNGNANVQHAKDEATALINSSNDLNQAQKDALKQQVQNATTVAG
VNNVKQTAQELNNAMTQLKQGIADKEQTKADGNFVNADPDKQNAYNQAVAKAEALISATPDVVVTPSEITAALNKVTQAK
NDLNGNTNLATAKQNVQHAIDQLPNLNQAQRDEYSKQITQATLVPNVNAIQQAATTLNDAMTQLKQGIANKAQIKGSENY
HDADTDKQTAYDNAVTKAEELLKQTTNPTMDPNTIQQALTKVNDTNQALNGNQKLADAKQDAKTTLGTLDHLNDAQKQAL
TTQVEQAPDIATVNNVKQNAQNLNNAMTNLNNALQDKTETLNSINFTDADQAKKDAYTNAVSHAEGILSKANGSNASQTE
VEQAMQRVNEAKQALNGNDNVQRAKDAAKQVITNANDLNQAMTQLKQGIADKDQTKANGNFVNADTDKQNAYNNAVAHAE
QIISGTPNANVDPQQVAQALQQVNQAKGDLNGNHNLQVAKDNANTAIDQLPNLNQPQKTALKDQVSHAELVTGVNAIKQN
ADALNNAMGTLKQQIQANSQVPQSVDFTQADQDKQQAYNNAANQAQQIANGIPTPVLTPDTVTQAVTTMNQAKDALNGDE
KLAQAKQEALANLDTLRDLNQPQRDALRNQINQAQALATVEQTKQNAQNVNTAMSNLKQGIANKDTVKASENYHDADADK
QTAYTNAVSQAEGIINQTTNPTLNPDEITRALTQVTDAKNGLNGEAKLATEKQNAKDAVSGMTHLNDAQKQALKGQIDQS
PEIATVNQVKQTATSLDQAMDQLSQAINDKAQTLADGNYLNADPDKQNAYKQAVAKAEALLNKQSGTNEVQAQVESITNE
VNAAKQALNGNDNLANAKQQAKQQLANLTHLNDAQKQSFESQITQAPLVTDVTTINQKAQTLDHAMELLRNSVADNQTTL
ASEDYHDATAQRQNDYNQAVTAANNIINQTTSPTMNPDDVNGATTQVNNTKVALDGDENLAAAKQQANNRLDQLDHLNNA
QKQQLQSQITQSSDIAAVNGHKQTAESLNTAMGNLINAIADHQAVEQRGNFINADTDKQTAYNTAVNEAAAMINKQTGQN
ANQTEVEQAITKVQTTLQALNGDHNLQVAKTNATQAIDALTSLNDPQKTALKDQVTAATLVTAVHQIEQNANTLNQAMHG
LRQSIQDNAATKANSKYINEDQPEQQNYDQAVQAANNIINEQTATLDNNAINQAATTVNTTKAALHGDVKLQNDKDHAKQ
TVSQLAHLNNAQKHMEDTLIDSETTRTAVKQDLTEAQALDQLMDALQQSIADKDATRASSAYVNAEPNKKQSYDEAVQNA
ESIIAGLNNPTINKGNVSSATQAVISSKNALDGVERLAQDKQTAGNSLNHLDQLTPAQQQALENQINNATTRDKVAEIIA
QAQALNEAMKALKESIKDQPQTEASSKFINEDQAQKDAYTQAVQHAKDLINKTTDPTLAKSIIDQATQAVTDAKNNLHGD
QKLAQDKQRATETLNNLSNLNTPQRQALENQINNAATRGEVAQKLTEAQALNQAMEALRNSIQDQQQTEAGSKFINEDKP
QKDAYQAAVQNAKDLINQTNNPTLDKAQVEQLTQAVNQAKDNLHGDQKLADDKQHAVTDLNQLNGLNNPQRQALESQINN
AATRGEVAQKLAEAKALDQAMQALRNSIQDQQQTESGSKFINEDKPQKDAYQAAVQNAKDLINQTGNPTLDKSQVEQLTQ
AVTTAKDNLHGDQKLARDQQQAVTTVNALPNLNHAQQQALTDAINAAPTRTEVAQHVQTATELDHAMETLKNKVDQVNTD
KAQPNYTEASTDKKEAVDQALQAAESITDPTNGSNANKDAVDQVLTKLQEKENELNGNERVAEAKTQAKQTIDQLTHLNA
DQIATAKQNIDQATKLQPIAELVDQATQLNQSMDQLQQAVNEHANVEQTVDYTQADSDKQNAYKQAIADAENVLKQNANK
QQVDQALQNILNAKQALNGDERVALAKTNGKHDIDQLNALNNAQQDGFKGRIDQSNDLNQIQQIVDEAKALNRAMDQLSQ
EITDNEGRTKGSTNYVNADTQVKQVYDETVDKAKQALDKSTGQNLTAKQVIKLNDAVTAAKKALNGEERLNNRKAEALQR
LDQLTHLNNAQRQLAIQQINNAETLNKASRAINRATKLDNAMGSVQQYIDEQHLGVISSTNYINADDNLKANYDNAIANA
AHELDKVQGNAIAKAEAEQLKQNIIDAQNALNGDQNLANAKDKANAFVNSLNGLNQQQQDLAHKAINNADTVSDVTDIVN
NQIDLNDAMETLKHLVDNEIPNAEQTVNYQNADDNAKTNFDDAKRLANTLLNSDNTNVNDINGAIQAVNDAIHNLNGDQR
LQDAKDKAIQSINQALANKLKEIEASNATDQDKLIAKNKAEELANSIINNINKATSNQAVSQVQTAGNHAIEQVHANEIP
KAKIDANKDVDKQVQALIDEIDRNPNLTDKEKQALKDRINQILQQGHNGINNAMTKEEIEQAKAQLAQALQDIKDLVKAK
EDAKQDVDKQVQALIDEIDQNPNLTDKEKQALKDRINQILQQGHNDINNAMTKEAIEQAKERLAQALQDIKDLVKAKEDA
KNDIDKRVQALIDEIDQNPNLTDKEKQALKDRINQILQQGHNDINNALTKEEIEQAKAQLAQALQDIKDLVKAKEDAKNA
IKALANAKRDQINSNPDLTPEQKAKALKEIDEAEKRALQNVENAQTIDQLNRGLNLGLDDIRNTHVWEVDEQPAVNEIFE
ATPEQILVNGELIVHRDDIITEQDILAHINLIDQLSAEVIDTPSTATISDSLTAKVEVTLLDGSKVIVNVPVKVVEKELS
VVKQQAIESIENAAQQKINEINNSVTLTLEQKEAAIAEVNKLKQQAIDHVNNAPDVHSVEEIQQQEQAHIEQFNPEQFTI
EQAKSNAIKSIEDAIQHMIDEIKARTDLTDKEKQEAIAKLNQLKEQAIQAIQRAQSIDEISEQLEQFKAQMKAANPTAKE
LAKRKQEAISRIKDFSNEKINSIRNSEIGTADEKQAAMNQINEIVLETIRDINNAHTLQQVEAALNNGIARISAVQIVTS
DRAKQSSSTGNESNSHLTIGYGTANHPFNSSTIGHKKKLDEDDDIDPLHMRHFSNNFGNVIKNAIGVVGISGLLASFWFF
IAKRRRKEDEEEELEIRDNNKDSIKETLDDTKHLPLLFAKRRRKEDEEDVTVEEKDSLNNGESLDKVKHTPFFLPKRRRK
EDEEDVEVTNENTDEKVLKDNEHSPLLFAKRRKDKEEDVETTTSIESKDEDVPLLLAKKKNQKDNQSKDKKSASKNTSKK
VAAKKKKKKAKKNKK
>Q8NWQ6 ~~~ebh~~~Extracellular matrix-binding protein ebh~~~
MNYRDKIQKFSIRKYTVGTFSTVIATLVFLGFNTSQAHAAETNQPASVVKQKQQSNNEQTENRESQVQNSQNSQNSQSLS
ATHENEQPNISQANLVDQKVAQSSTTNDEQPASQNVNTKKDSATAATTQPDKEEGKHKQNESQSANKNGNDNRAAHLENH
EANVVTASDSSDNGNVQHDRNELQAFFDANYHDYRFIDRENADSGTFNYVKGIFEKINTLLGSNDPINNKDLQLAYKELE
QAVALIRTMPQRQQTSRRSNRIQTRSVESRAAEPRSVSDYQNANSSYYVENANDGSGYPVGTYINASSKGAPYNLPTTPW
NTLKASDSKEIALMTAKQTGDGYQWVIKFNKGHAPHQNMIFWFALPADQVPVGRTDFVTVNSDGTNVQWSHGAGAGANKP
LQQMWEYGVNDPDRSHDFKIRNRSGQVIYSWPTVHVYSLEDLSRASDYFSEAGATAATKAFGRQNFEYINGQKPAESPGV
PKVYTFIGQGDASYTISFKTQGPTVNKLYYAAGGRALEYNQLFMYSQLYVESTQDHQQRLNGLRQVVNRTYRIGTTKRVE
VSQGNVQTKKVLESTNLNIDGFVDDPLSYVKTPSNKVLGFYPTNATTNAFRPGGVQELNEYQLSQLFTDQKLQEAARTRN
PIRLMIGFDYPDGYGNSETLVPVNLTVLPEIQHNIKFFKNDDTQNIAEKPFSKQAGHPVFYVYAGNQGNASVNLGGSVTS
IQPLRINLTSNENFTDKDWQITGIPRTLHIENSTNRTNNARERNIELVGNLLPGDYFGTIRFGRKEQLFEIRVKPHTPTI
TTTAEQLRGTALQKVPVNISGIPLDPSALVYLVAPTNQTTNGGSEADQIPSGYTILATGTPDGVHNTITIRPQDYVVFIP
PVGKQIRAVVYYNKVVASNMSNAVTILPDDIPPTINNPVGINAKYYRGDEVNFTMGVSDRHSGIKNTTITTLPNGWTSNL
TKSDNKNGSLAITGRVSMNQAFNSDITFKVSATDNVNNTTNDSQSKHVSIHVGKISEDAHPIVLGNTEKVVVVNPTAVSN
DEKQSIITAFMNKNQNIRGYLASTDPVTVDNNGNVTLHYRDGSSTTLDATNVMTYEPVVKSEYQTANAAKTATVTIAKGQ
SFNIGDIKQYFTLSNGQAIPSGTFTNITSDRTILTAQEVSQMNAGTQLYHIVASNAYHKDTEDFYISLKIVDVKQPEGDQ
RVYRTSTYDLTTDEISKVKQAFINANRDVITLAEGNISVTNTPNGANVSTITVNINKGRLTKSFASNLANMNFLRWVNFP
QDYTVTWTNAKIANRPTDGGLSWSDDHKSLIYRYDATLGTQITTNDILTMLKATTTVPGLRNNITGNEKAQAEAGGRPNF
RTTGYSQSNATTDGQRQFTLNGQVIQVLDIINPSNGYGGQPVTNSNTRANHSNSTVVNVNEPAANGASAFTIDHVVKSNS
THNASDAVYKAQLYLTPYGPKQYVEHLNQNTGNTTDAINIYFVPSDLVNPTISVGNYTNHQVFSGETFTNTITANDNFGV
QSVTVPNTSQITGTVDNNHQHVSATAPNVTSATNKTINLLATDTSGNTATTSFNVTVKPLRDKYRVGTSSTAANPVRIAN
ISNNATVSQADQTAIINSLTFTETVPNRSYARASANEITSKTVSNVSRTGNNANVTVTATYQDGTTSTVTVPVKHVIPEI
VAHSHYTVQGQDFPAGNGSSASDYFKLSNGSAIPDATITWVSGQAPNKDNTRIGEDITVTAHILIDGETTPITKTATYKV
VSTVPKHVFETNRGAVFPGVSDVYDAKQYVKPVNDSWTQNAQRMNFQFTNSYGPSKDVVGISTRDIRVTYDNHQTQIIKI
LAKVKPDPPRIDGNSVTYKAGLTNQQIKINNVLSSSSIKLFKADNTPLTITNTTYGSGNTAVVTVSDALPNGVIKARSSI
TMNNVTYTTQDEHGRAIDVTRNESVDSNDSATVTVTPQLQATTEGAVFIKGGDGFDFGHVERFIQNPPHGATVAWHDSPD
TWKNTVGNTHKTAVVTLPNGQGTRNVEVPVKVYPVANAKAPSRDVKGQNLTNGTDAMNYITFDPNTNTNGITAAWANRQQ
PNNQQAGVQHLNVDVTYPGISAAKRVPVTVNVYQFEFPQTTYTTTVGGTLASGTQASGYAHMQNATGLPTDGFTYKWNRD
TTGTNDANWSAMNKPNVAKVVNAKYDVIYNGHTFATSLPAKFVVKDVQPAKPTVTETAAGAITIAPGANQTVNTHAGNVT
TYADKLVIKRNGNVVTTFTRRNNTSPWVKEASAATVAGIAGTNNGITVAAGTFNPADTIQVVATQGSGETVSDEQRSDDF
TVVAPQPNQATTKIWQNGHIDITPNNPSGHLINPTQAMDIAYTEKVGNGAEHSKTINVVRGQNNQWTIANKPDYVTLDAQ
TGKVTFNANTIKPNSSITITPKAGTGHSVSSNPSTLTAPAAHTVNTTEIVKDYGSNVTAAEINNAVQVANKRTATIKNGT
AMPTNLAGGSTTTIPVTVTYNDGSTEEVQESIFTKADKRELITAKNHLDDPVSTEGKKPGTITQYNNAIHNAQQQINTAK
TEAQQVINNDRATPQQVSDALTKVRAAQTKIDQAKALLQNKEDNSQLVTSKNNLQSSVNQVPSTAGMTQQSIDNYNAKKR
EAETEITAAQRVIDNGDATPQQISEEKHRVDNALTALNQAKQNLTADTHTLEQAVQQLNRTGTTTGKKPASITAYNNSMH
ALQAELTSAKNSANAIIQKPIRSVQEVQTALTNVNRVNERLTQAINQLVPLADNSALRTAKTKLDEEINKSVTTDGMTQS
SIQAYENAKRAGQTESTNAQNVINNGDATDQQIAEEKTKVEEKYNSLKQAIAGLTPDLAPLQTAKTQLQNDIDQPTSTTG
MTSTSIAAFNEKLSAARTKIQEIDRVLASHPDVATIRQNVTAANAAKTALDQARNGLTVDKAPLENAKNQLQHSIDTQTS
TTGMTQDSINAYNAKLTAARNKIQQINQVLAGSPTVEQINTNTSAANQAKSDLDHARQALTPDKAPLQTAKTQLEQSINQ
PTDTTGMTTASLNAYNQKLQAARQKLTEINQVLNGNPTVQNINDKVTEANQAKDQLNTARQGLTLDRQPALTTLHGASNL
NQAQQNNFTQQINAAQNHAALETIKSNITALNTAMTKLKDSVADNNTIKSGQNYTDATPANKQAYDNAVNAAKGVIGETT
NPTMDVNTVNQKAASVKSTKDALDGQQNLQRAKTEATNAITHASDLNQAQKNALTQQVNSAQNVHAVNDIKQTTQSLNTA
MTGLKRGVANHNQVVQSDNYVNADTNKKNDYNNAYNHANDIINGNAQHPVITPSDVNNALSNVTSKEHALNGEAKLNAAK
QEANTALGQLNNLNNAQRQNLQSQINSAHQIETVNTIKQNATNLNSAMGNLRQAVADKDQVKRTEDYADADTAKQNAYNS
AVSSAETIINQTTNPTMSVDDVNRATSAVTSNKNALNGDEKLAQSKTDAARAIDALPHLNNAQKADVKSKINAASNIAGV
NTVKQQGTDLNTAMGNLQGAINDEQTTLNSQNYQDATPSKKTAYTNAVQAAKDILNKSNGQNKTKDQVTEAMNQLNSAKN
NLDGTRLLDQAKQTAKQQLNNMTHLTTAQKTNLTNQINSGTTVAGVHTVQSNANTLDQAMNTLRQSIANKDATKASEDYV
DANNDKQTAYNNAVAAAETIINANSNPEMNPSTITQKAEQVNSSKTALNGDENLATAKQNAKTYLNTLTSITDAQKNNLI
SQISSATRVSGVDTVKQNAQHLDQAMASLQSGINNESQVKSSEKYRDADTNKQQEYDNAITAAKAILNKQHGPNTAQNAV
EAALQRVNTAKDALNGDAKLIAAQNAAKQHLGTLTHITTAQRNDLTNQISQATNLAGVESVKQNANSLDGAMGNLQTAIN
DKSGTLASQNFLDADEQKRNAYNQAVSAAETILNKQTGPNTAKTAVEQALNNVNNAKHALNGTQNLNNAKQAAITAINGA
SDLNQHQKDALKAQANGAQRVSNAQDVQRNATELNTAMGTLKHAIADKTNTLASSKYVNADSTKQNAYTTKVTNAEHIIS
GTPTVVTTPSEVTAAANQVNSAKQELNGDERLRVAKQNANTAIDALTQLNTPQKAKLKEQVGQANTLDDAMNSLQGAIND
KDATLRNQNYLDADESKRNAYTQAVTAAEGILNKQTGGNTSKADVDNALNAVTRAKAALNGAENLRNAKTSATNTINGLP
NLTQLQKDNLKHQVEQAQNVAGVNGVKDKGNTLNTAMGALRTSIQNDNTTKTSQNYLDASDSNKNNYNTAVNNANGVINA
TNNPNMDANAINGMANQVNTTKAALNGVQNLAQAKTNATNTINNAHDLNQKQKDALKTQVNNAQRVSDANNVQHTATELN
GAMTALKAAIADKERTKASGNYVNADQEKRQAYDSKVTNAENIINGTPNATLTVNDVNSATSQVNAAKTALNGDNNLRVA
KENANNTIDGLAQLNNAQKAKLKEQVQSATTLEGVQTVKNSSQTLNTAMKGLRDSIANEATIKAGQNYTDASPTNRNEYD
SAVTAAKAIINQTSNPTMEPNTITQATSQVTTKEHALNGAQNLAQAKTTAKNNLNNLTSINNAQKDALTRSIDGATTVAG
VNQETAKATELNNAMHSLQNGINDETQTKQTQKYLDAEPNKKSAYDQAVNAAKAILTKASGQNVDKAAVEQALQNVNSTK
TALNGDAKLNEAKAAAKQTLGTLTHINNAQRNALDNEITQATNVEGVNTVKAKAQQLDGAMGQLETSIRDKDTTLQSQNY
QDADDAKRTAYSQAVNAAATILNKTAGGNTPKADVERAMQAVAQANTALNGIQNLERAKQAANTAITNASDLNTKQKEAL
KAQVTSAGRVSAANGVEHTATEINTAMTALKRAIADKADTKTSGNYVNADANKRQAYDEKVTAAESIVNGTPTPTLTPSD
VTNAATQVTNAKTQLNGNHNLEVAKQNANTAIDGLTSLNGPQKAKLKEQVGQATTLPNVQTVRDNAQTLNTAMKGLRDSI
ANEATIKAGQNYTDASPNNRSEYDSAVTAAKAIIGQTTSPSMNAQEINQAKDQVTAKQQALNGQENLRTAQTNAKQHLNG
LSDLTNAQKEAAKRQIEGATHVNEVTQAQNNADALNTAMTNLKNGIQDQNTIKQGVNFTDADEAKRNAYTNAVTQAEQIL
NKAQGPNTAKDNVESALQNVQRAKNELNGNQNVANAKTTAKNALNNLTSINNAQKEALKSQIEGATTVAGVNQVSTTASE
LNTAMSNLQRGINDEAATKAAQKYTDADRDKQTAYNDAVTAAKTLLDKTAGTNENKAAVEQALQRVNTAKTALNGDARLN
EAKNTAKQQVATMSHLTDAQKANLTSQIESGTTVAGVQGIQANAGTLDQAMNQLRQSIASKDATKSSEDYQDANADLQNA
YNDAVTNAEGIISATNNPEMNPDTINQKASQVNSAKSALNGDEKLAAAKQTAKSDIGRLTDLNNAQRTAANAEVDQAPNL
AAVTAAKNKATSLNTAMGNLKHALAEKDNTKRSVNYTDADQPKQQAYDTAVTQAEAITNANGSNANETTVQAALNQLNQA
KNDLNGDNKVAQAKESAKRALASYSNLNNAQSTAATSQIDNATTVAGVTAAQNTANELNTAMGQLQNGINDQNTVKQQVN
FTDADQGKKDAYTNAVTNAQGILDKANGQNMTKAQVEAALNQVTTAKNALNGDANVRQAKSDAKANLGTLTHLNNAQKQD
LTSQIEGATTVNGVNSVKTKAQDLDGAMQRLESAIANKDQTKASENYIDADPTKKTAFDNAITQAESYLNKDHGTNKDKQ
AVEQAIQSVTSTENALNGDANLQRAKTEATQAIDNLTHLNTPQKTALKQQVNAAQRVSGVTDLKNSATSLDNAMDQLKQG
IADHDTIVAGGNYTNASPDKQGAYTDAYNAAKNIVNGSPNVITNAADVTAATQRVNNAETSLNGDSNLATAKQQAKDALR
QMTHLSDAQKQSITGQIDSATQVTGVQSVKDNATNLDNAMNQLRNSIANKDEVKASQPYVDADTDKQNAYNTAVTSAENI
INATSQPTLDPSAVTQAANQVNTNKTALNGAQNLANKKQETTANINQLSHLNNAQKQDLNTQVTNAPNISTVNQVKTKAE
QLDQAMERLINGIQDKDQVKQSVNFTDADPEKQTAYNNAVTAAENIINQANGTNANQSQVEAALSTVTTTKQALNGDRKV
TDAKNNANQTLSTLDNLNNAQKGAVTGNINQAHTVAEVTQAIQTAQELNTAMGNLKNSLNDKDTTLGSQNFADADPEKKN
AYNEAVRNAENILNKSTGTNVPKDQVEAAMNQVNTTKAALNGTQNLEKAKQHANTAIDGLSHLTNAQKEALKQLVQQSTT
VAEAQGNEQKANNVDAAMDKLRQSIADNATTKQNQNYTDASPNKKDAYNNAVTTAQGIIDQTTSPTLDPTVINQAAGQVS
TTKNALNSNENLEAAKQQATQSLGSLDNLNNAQKQAVTDQINGAHTVDEANQIKQNAQNLNTAMGNLKQAIADKDATKAT
VNFTDADQAKQQAYNTAVTNAENIISKANGGNATQTEVEQAIQQVNAAKQALNGNANVQHAKDEATALINSSNDLNQAQK
NALKQQVQNATTVAGVNNVKQTAQELNNAMTQLKQGIADKEQTKADGNFVNADPDKQNAYNQAVAKAEALISGTPDVVVT
PSEITAALNKVTQAKNDLNGNTNLATAKQNVQHAIDQLPNLNQAQRDEYNKQITQATHVPNVNAIQQAATTLNDAMTQLK
QGIANKAQIKGSENYHDADTDKQTAYDNAVTKAEELLKQTTNPTMDPNTIQQALTKVNDTNQALNGNQKLADAKQAAKTN
LGTLDHLNDAQKQALTTQVEQAPDIATVNNVKQNAQNLNNAMTNLNNALQDKTETLNSINFTDADQAKKDAYTNAVSHAE
GILSKANGSNASQTEVEQAMQRVNEAKQALNGNDNVQRAKDAAKQVITNANDLNQAQKDALKQQVDAAQTVANVNTIKQT
AQDLNQAMTQLKQGIADKDQTKANGNFVNADTDKQNAYNNAVAHAEQIISGTPNANVDPQQVAQALQQVNQAKGDLNGNH
NLQVAKDNANTAIDQLPNLNQPQKTALKDQVSHAELVTGVNAIKQNADALNNAMGTLKQQIQANSQVPQSVDFTQADQDK
QQAYNNAANQAQQIANGTPTPVLAPDTVTQAVTTMNQAKDALNGDEKLAQAKQDALANLDTLRDLNQPQRDALRNQINQA
QALATVEQTKQNAQNVNTAMGNLKQGIANKDTVKASENYHDADVDKQTAYTNAVSQAEGIINQTTNPTLNPDDITRALTQ
VTDAKNSLNGEAKLATEKQNAKDAVNAMTHLNDAQKQALKGQIDQSPEIATVNQVKQTATSLDHAMDQLSQAINDKAQTL
ADGNYLNADPDKQNAYKQAVAKAEALLNKQSGTNEVQAQVESITNEVNAAKQALNGNDNLANAKQQAKQQLANLTHLNDA
QKQSFESQITQAPLVTDVTTINQKAQTLDHAMELLRNSVADNQTTLASEDYHDATAQRQNDYNQAVTAANNIINQTTSPT
MNPDDVNGATTQVNNTKVALDGDENLAAAKQQANNRLDQLDHLNNAQKQQLQSQITQSSDIAAVNGHKQTAETLNTAMGN
LINAIADHQAVEQRGNFINADTDKQTAYNTAVNEAAAMINKQTGQNANQTEVEQAITKVQTTLQALNGDHNLQVAKTNAT
QAIDALTSLNDPQKTALKDQVTAATLVTAVHQIEQNANTLNQAMHGLRQSIQDNAATKANSKYINEDQPEKQNYDQAVQA
ANNIINEQTATLDNNAINQAAATVNTTKAALHGDVKLQNDKDHAKQTVSQLAHLNNAQKHMEDTLIDSETTRTAVKQDLT
EAQALDQLMDALQQSIADKDATRASSAYVNAEPNKKQAYDEAVQNAESIIAGLNNPTINKGNVSSATQAVTSSKNALDGV
ERLAQDKQTAGNSLNHLDQLTPAQQQALENQINNATTRDKVAEIIAQAQALNEAIKALKESIKDQPQTEASSKFINEDQA
QKDAYTQAVQHAKDLINKTTDPTLAKSIIDQATQAVTDAKNNLHGDQKLAQDKQRATETLNNLSNLNTPQRQALENQINN
AATRVEVAQKLTEAQALNQAMEALRNSIQDQQQTEAGSKFINEDKPQKDAYQAAVQNAKDLINQTNNPTLDKAQVEQLTQ
AVNQAKDNLHGDQKLADDKQHAVTDLNQLNGLNNPQRQALESQINNAATRDEVAQKLAEAKALDQAMQALRNSIQDQQQT
ESGSKFINEDKPQKDAYQAAVQNAKDLINQTGNPTLDKAQVEQLTQAVTTAKDNLHGDQKLARDQQQAVTTVNALPNINH
AQQQALTDAINAAPTRTEVAQHVQTATELDHAMETLKNKVDQVNTDKAQPNYTEASTDKKEAVDQALQAAESITDPTNGS
NANKDAVEQALTKLQEKVNELNGDERVAEAKTQAKQNIDQLTHLNADQIATAKQNIDQATQLQPIAELVDQATQLNQSMD
QLQQAVNDHTNVEQTVDYTQADSDKQKAYKQAIADAENVLKQNANKQQVDQALQNILNAKQALNGDERVALAKTNGKHDI
DQLNALNNAQQDGFKGRIDQSNDLNQIQQIVDEAKALNRAMDQLSEEITGNEGRTKGSTNYVNADTQVKQVYDEAVDKAK
QALDKSTGQNLTAEQVIKLNDAVTAAKQALNGEERLNNRKSEALQRLDQLTHLNNAQRQLAIQQINNAETLNKASRAINR
ATKLDNAMGAVQQYIDEQHLGVISSTNYINADDNLKANYDNAIANAAHELDKVQGNAIAKAEAEQLKQNIIDAQNALNGD
QNLANAKDKANAFVNSLNGLNQQQQDLAHKAINNADTVSDVTDIVNNQIDLNDAMETLKHLVDNEIPNAEQTVNYQNADD
NAKTNFDDAKRLANALLNSDNTNVNDINGAIQAVNDAIHNLNGDQRLQDAKDKAIQSINQALANKLKEIEASNATDQDKL
IAKNKAEELANSIINNINKATSNQDVSQVQTAGNHAIEQVHANEIPKAKIDANKDVDKQVQALIDEIDRNPNLTDKEKQA
LKDRINQILQQGHNDINNALTKEEIEQAKAQLAQALQEIKDLVKAKENAKQDVDKQVQALIDEIDQNPNLTDKEKQALKD
RINQILQQGHNDINNAMTKEEIEQAKAQLAQALQDIKDLVKAKEDAKNAIKALANAKRDQINSNPDLTPEQKAKALKEID
EAEKRALQNVENAQTIDQLNRGLNLGLDDIRNTHVWEVDEQPAVNEIFEATPEQILVNGELIVHRDDIITEQDILAHINL
IDQLSAEVIDTPSTATISDSLTAKVEVTLLDGSKVIVNVPVKVVEKELSVVKQQAIESIENAAQQKINEINNSVTLTLEQ
KEAAIAEVNKLKQQAIDHINNAPDVHSVEEIQQQEQAHIEQFNPEQFTIEQAKSNAIKSIEDAIQHMIDEIKARTDLTDK
EKQEAIAKLNQLKEQAIQAIQRAQSIDEITEQLEQFKAQMKAANPTAKELAKRKQEAISRIKDFSNEKMNSIRNSEIGTA
DEKQAAMNQINEIVLETIRDINNAHTLQQVEAALNNGIARISAVQIVISDRAKQSSSTGNESNSHLTIGYGTANHPFNSS
TIGHKKKIDEDDDIDPLHMRHFSNNFGNVIKNAIGVVGISGLLASFWFFIAKRRRKEDEEEELEIRDNNKDSIKETLDDT
KHLPLLFAKRRRKEDEEDVTVEEKDSLNNGESLDKVKHTPFFLPKRRRKEDEEDVEVTNENTDEKVLKDNEHSPLLFAKR
RKDKEEDVETTTSIESKDEDVPLLLAKKKNQKDNQSKDKKSASKNTSKKVAAKKKKKKSKKNKK
>Q5HPA2 ~~~ebh~~~Extracellular matrix-binding protein ebh~~~COG1196
MSGTLHNTVGSGILPYQQEIRIKLTSNEPIKDSEWSITGYPNTLTLQNAVGRTNNATEKNLALVGHIDPGNYFITVKFGD
KVEQFEIRSKPTPPRIITTANELRGNSNHKPEIRVTDIPNDTTAKIKLVMGGTDGDHDPEINPYTVPENYTVVAEAYHDN
DPSKNGVLTFRSSDYLKDLPLSGELKAIVYYNQYVQSNFSNSVPFSSDTTPPTINEPAGLVHKYYRGDHVEITLPVTDNT
GGSGLRDVNVNLPQGWTKTFTINPNNNTEGTLKLIGNIPSNEAYNTTYHFNITATDNSGNTTNPAKTFILNVGKLADDLN
PVGLSRDQLQLVTDPSSLSNSEREEVKRKISEANANIRSYLLQNNPILAGVNGDVTFYYRDGSVDVIDAENVITYEPERK
SIFSENGNTNKKEAVITIARGQNYTIGPNLRKYFSLSNGSDLPNRDFTSISAIGSLPSSSEISRLNVGNYNYRVNAKNAY
HKTQQELNLKLKIVEVNAPTGNNRVYRVSTYNLTNDEINKIKQAFKAANSGLNLNDNDITVSNNFDHRNVSSVTVTIRKG
DLIKEFSSNLNNMNFLRWVNIRDDYTISWTSSKIQGRNTDGGLEWSPDHKSLIYKYDATLGRQINTNDVLTLLQATAKNS
NLRSNINSNEKQLAERGSNGYSKSIIRDDGEKSYLLNSNPIQVLDLVEPDNGYGGRQVSHSNVIYNEKNSSIVNGQVPEA
NGASAFNIDKVVKANAANNGIMGVIYKAQLYLAPYSPKGYIEKLGQNLSNTNNVINVYFVPSDKVNPSITVGNYDHHTVY
SGETFKNTINVNDNYGLNTVASTSDSAITMTRNNNELVGQAPNVTNSTNKIVKVKATDKSGNESIVSFTVNIKPLNEKYR
ITTSSSNQTPVRISNIQNNANLSIEDQNRVKSSLSMTKILGTRNYVNESNNDVRSQVVSKVNRSGNNATVNVTTTFSDGT
TNTITVPVKHVLLEVVPTTRTTVRGQQFPTGKGTSPNDFFSLRTGGPVDARIVWVNNQGPDINSNQIGRDLTLHAEIFFD
GETTPIRKDTTYKLSQSIPKQIYETTINGRFNSSGDAYPGNFVQAVNQYWPEHMDFRWAQGSGTPSSRNAGSFTKTVTVV
YQNGQTENVNVLFKVKPNKPVIDSNSVISKGQLNGQQILVRNVPQNAQVTLYQSNGTVIPNTNTTIDSNGIATVTIQGTL
PTGNITAKTSMTNNVTYTKQNSSGIASNTTEDISVFSENSDQVNVTAGMQAKNDGIKIIKGTNYNFNDFNSFISNIPAHS
TLTWNEEPNSWKNNIGTTTKTVTVTLPNHQGTRTVDIPITIYPTVTAKNPVRDQKGRNLTNGTDVYNYIIFENNNRLGGT
ASWKDNRQPDKNIAGVQNLIALVNYPGISTPLEVPVKVWVYNFDFTQPIYKIQVGDTFPKGTWAGYYKHLENGEGLPIDG
WKFYWNQQSTGTTSDQWQSLAYTRTPFVKTGTYDVVNPSNWGVWQTSQSAKFIVTNAKPNQPTITQSKTGDVTVTPGAVR
NILISGTNDYIQASADKIVINKNGNKLTTFVKNNDGRWTVETGSPDINGIGPTNNGTAISLSRLAVRPGDSIEAIATEGS
GETISTSATSEIYIVKAPQPEQVATHTYDNGTFDILPDNSRNSLNPTERVEINYTEKLNGNETQKSFTITKNNNGKWTIN
NKPNYVEFNQDNGKVVFSANTIKPNSQITITPKAGQGNTENTNPTVIQAPAQHTLTINEIVKEQGQNVTNDDINNAVQVP
NKNRVAIKQGNALPTNLAGGSTSHIPVVIYYSDGSSEEATETVRTKVNKTELINARRRLDEEISKENKTPSSIRNFDQAM
NRAQSQINTAKSDADQVIGTEFATPQQVNSALSKVQAAQNKINEAKALLQNKADNSQLVRAKEQLQQSIQPAASTDGMTQ
DSTRNYKNKRQAAEQAIQHANSVINNGDATSQQINDAKNTVEQAQRDYVEAKSNLRADKSQLQSAYDTLNRDVLTNDKKP
ASVRRYNEAISNIRKELDTAKADASSTLRNTNPSVEQVRDALNKINTVQPKVNQAIALLQPKENNSELVQAKKRLQDAVN
DIPQTQGMTQQTINNYNDKQREAERALTSAQRVIDNGDATTQEITSEKSKVEQAMQALTNAKSNLRADKNELQTAYNKLI
ENVSTNGKKPASIRQYETAKARIQNQINDAKNEAERILGNDNPQVSQVTQALNKIKAIQPKLTEAINMLQNKENNTELVN
AKNRLENAVNDTDPTHGMTQETINNYNAKKREAQNEIQKANMIINNGDATAQDISSEKSKVEQVLQALQNAKNDLRADKR
ELQTAYNKLIQNVNTNGKKPSSIQNYKSARRNIENQYNTAKNEAHNVLENTNPTVNAVEDALRKINAIQPEVTKAINILQ
DKEDNSELVRAKEKLDQAINSQPSLNGMTQESINNYTTKRREAQNIASSADTIINNGDASIEQITENKIRVEEATNALNE
AKQHLTADTTSLKTEVRKLSRRGDTNNKKPSSVSAYNNTIHSLQSEITQTENRANTIINKPIRSVEEVNNALHEVNQLNQ
RLTDTINLLQPLANKESLKEARNRLESKINETVQTDGMTQQSVENYKQAKIKAQNESSIAQTLINNGDASDQEVSTEIEK
LNQKLSELTNSINHLTVNKEPLETAKNQLQANIDQKPSTDGMTQQSVQSYERKLQEAKDKINSINNVLANNPDVNAIRTN
KVETEQINNELTQAKQGLTVDKQPLINAKTALQQSLDNQPSTTGMTEATIQNYNAKRQKAEQVIQNANKIIENAQPSVQQ
VSDEKSKVEQALSELNNAKSALRADKQELQQAYNQLIQPTDLNNKKPASITAYNQRYQQFSNELNSTKTNTDRILKEQNP
SVADVNNALNKVREVQQKLNEARALLQNKEDNSALVRAKEQLQQAVDQVPSTEGMTQQTKDDYNSKQQAAQQEISKAQQV
IDNGDATTQQISNAKTNVERALEALNNAKTGLRADKEELQNAYNQLTQNIDTSGKTPASIRKYNEAKSRIQTQIDSAKNE
ANSILTNDNPQVSQVTAALNKIKAVQPELDKAIAMLKNKENNNALVQAKQQLQQIVNEVDPTQGMTTDTANNYKSKKREA
EDEIQKAQQIINNGDATEQQITNETNRVNQAINAINKAKNDLRADKSQLENAYNQLIQNVDTNGKKPASIQQYQAARQAI
ETQYNNAKSEAHQILENSNPSVNEVAQALQKVEAVQLKVNDAIHILQNKENNSALVTAKNQLQQSVNDQPLTTGMTQDSI
NNYEAKRNEAQSAIRNAEAVINNGDATAKQISDEKSKVEQALAHLNDAKQQLTADTTELQTAVQQLNRRGDTNNKKPRSI
NAYNKAIQSLETQITSAKDNANAVIQKPIRTVQEVNNALQQVNQLNQQLTEAINQLQPLSNNDALKAARLNLENKINQTV
QTDGMTQQSIEAYQNAKRVAQNESNTALALINNGDADEQQITTETDRVNQQTTNLTQAINGLTVNKEPLETAKTALQNNI
DQVPSTDGMTQQSVANYNQKLQIAKNEINTINNVLANNPDVNAIKTNKAEAERISNDLTQAKNNLQVDTQPLEKIKRQLQ
DEIDQGTNTDGMTQDSVDNYNDSLSAAIIEKGKVNKLLKRNPTVEQVKESVANAQQVIQDLQNARTSLVPDKTQLQEAKN
RLENSINQQTDTDGMTQDSLNNYNDKLAKARQNLEKISKVLGGQPTVAEIRQNTDEANAHKQALDTARSQLTLNREPYIN
HINNESHLNNAQKDNFKAQVNSAPNHNTLETIKNKADTLNQSMTALSESIADYENQKQQENYLDASNNKRQDYDNAVNAA
KGILNQTQSPTMSADVIDQKAEDVKRTKTALDGNQRLEVAKQQALNHLNTLNDLNDAQRQTLTDTINHSPNINSVNQAKE
KANTVNTAMTQLKQTIANYDDELHDGNYINADKDKKDAYNNAVNNAKQLINQSDANQAQLDPAEINKVTQRVNTTKNDLN
GNDKLAEAKRDANTTIDGLTYLNEAQRNKAKENVGKASTKTNITSQLQDYNQLNIAMQALRNSVNDVNNVKANSNYINED
NGPKEAYNQAVTHAQTLINAQSNPEMSRDVVNQKTQAVNTAHQNLHGQQKLEQAQSSANTEIGNLPNLTNTQKAKEKELV
NSKQTRTEVQEQLNQAKSLDSSMGTLKSLVAKQPTVQKTSVYINEDQPEQSAYNDSITMGQTIINKTADPVLDKTLVDNA
ISNISTKENALHGEQKLTTAKTEAINALNTLADLNTPQKEAIKTAINTAHTRTDVTAEQSKANQINSAMHTLRQNISDNE
SVTNESNYINAEPEKQHAFTEALNNAKEIVNEQQATLDANSINQKAQAILTTKNALDGEEQLRRAKENADQEINTLNQLT
DAQRNSEKGLVNSSQTRTEVASQLAKAKELNKVMEQLNHLINGKNQMINSSKFINEDANQQQAYSNAIASAEALKNKSQN
PELDKVTIEQAINNINSAINNLNGEAKLTKAKEDAVASINNLSGLTNEQKPKENQAVNGAQTRDQVANKLRDAEALDQSM
QTLRDLVNNQNAIHSTSNYFNEDSTQKNTYDNAIDNGSTYITGQHNPELNKSTIDQTISRINTAKNDLHGVEKLQRDKGT
ANQEIGQLGYLNDPQKSGEESLVNGSNTRSEVEEHLNEAKSLNNAMKQLRDKVAEKTNVKQSSDYINDSTEHQRGYDQAL
QEAENIINEIGNPTLNKSEIEQKLQQLTDAQNALQGSHLLEEAKNNAITGINKLTALNDAQRQKAIENVQAQQTIPAVNQ
QLTLDREINTAMQALRDKVGQQNNVHQQSNYFNEDEQPKHNYDNSVQAGQTIIDKLQDPIMNKNEIEQAINQINTTQTAL
SGENKLHTDQESTNRQIEGLSSLNTAQINAEKDLVNQAKTRTDVAQKLAAAKEINSAMSNLRDGIQNKEDIKRSSAYINA
DPTKVTAYDQALQNAENIINATPNVELNKATIEQALSRVQQAQQDLDGVQQLANAKQQATQTVNGLNSLNDGQKRELNLL
INSANTRTKVQEELNKATELNHAMEALRNSVQNVDQVKQSSNYVNEDQPEQHNYDNAVNEAQATINNNAQPVLDKLAIER
LTQTVNTTKDALHGAQKLTQDQQAAETGIRGLTSLNEPQKNAEVAKVTAATTRDEVRNIRQEATTLDTAMLGLRKSIKDK
NDTKNSSKYINEDHDQQQAYDNAVNNAQQVIDETQATLSSDTINQLANAVTQAKSNLHGDTKLQHDKDSAKQTIAQLQNL
NSAQKHMEDSLIDNESTRTQVQHDLTEAQALDGLMGALKESIKDYTNIVSNGNYINAEPSKKQAYDAAVQNAQNIINGTN
QPTINKGNVTTATQTVKNTKDALDGDHRLEEAKNNANQTIRNLSNLNNAQKDAEKNLVNSASTLEQVQQNLQTAQQLDNA
MGELRQSIAKKDQVKADSKYLNEDPQIKQNYDDAVQRVETIINETQNPELLKANIDQATQSVQNAEQALHGAEKLNQDKQ
TSSTELDGLTDLTDAQREKLREQINTSNSRDDIKQKIEQAKALNDAMKKLKEQVAQKDGVHANSDYTNEDSAQKDAYNNA
LKQAEDIINNSSNPNLNAQDITNALNNIKQAQDNLHGAQKLQQDKNTTNQAIGNLNHLNQPQKDALIQAINGATSRDQVA
EKLKEAEALDEAMKQLEDQVNQDDQISNSSPFINEDSDKQKTYNDKIQAAKEIINQTSNPTLDKQKIADTLQNIKDAVNN
LHGDQKLAQSKQDANNQLNHLDDLTEEQKNHFKPLINNADTRDEVNKQLEIAKQLNGDMSTLHKVINDKDQIQHLSNYIN
ADNDKKQNYDNAIKEAEDLIHNHPDTLDHKALQDLLNKIDQAHNELNGESRFKQALDNALNDIDSLNSLNVPQRQTVKDN
INHVTTLESLAQELQKAKELNDAMKAMRDSIMNQEQIRKNSNYTNEDLAQQNAYNHAVDKINNIIGEDNATMDPQIIKQA
TQDINTAINGLNGDQKLQDAKTDAKQQITNFTGLTEPQKQALENIINQQTSRANVAKQLSHAKFLNGKMEELKVAVAKAS
LVRQNSNYINEDVSEKEAYEQAIAKGQEIINSENNPTISSTDINRTIQEINDAEQNLHGDNKLRQAQEIAKNEIQNLDGL
NSAQITKLIQDIGRTTTKPAVTQKLEEAKAINQAMQQLKQSIADKDATLNSSNYLNEDSEKKLAYDNAVSQAEQLINQLN
DPTMDISNIQAITQKVIQAKDSLHGANKLAQNQADSNLIINQSTNLNDKQKQALNDLINHAQTKQQVAEIIAQANKLNNE
MGTLKTLVEEQSNVHQQSKYINEDPQVQNIYNDSIQKGREILNGTTDDVLNNNKIADAIQNIHLTKNDLHGDQKLQKAQQ
DATNELNYLTNLNNSQRQSEHDEINSAPSRTEVSNDLNHAKALNEAMRQLENEVALENSVKKLSDFINEDEAAQNEYSNA
LQKAKDIINGVPSSTLDKATIEDALLELQNARESLHGEQKLQEAKNQAVAEIDNLQALNPGQVLAEKTLVNQASTKPEVQ
EALQKAKELNEAMKALKTEINKKEQIKADSRYVNADSGLQANYNSALNYGSQIIATTQPPELNKDVINRATQTIKTAENN
LNGQSKLAEAKSDGNQSIEHLQGLTQSQKDKQHDLINQAQTKQQVDDIVNNSKQLDNSMNQLQQIVNNDNTVKQNSDFIN
EDSSQQDAYNHAIQAAKDLITAHPTIMDKNQIDQAIENIKQALNDLHGSNKLSEDKKEASEQLQNLNSLTNGQKDTILNH
IFSAPTRSQVGEKIASAKQLNNTMKALRDSIADNNEILQSSKYFNEDSEQQNAYNQAVNKAKNIINDQPTPVMANDEIQS
VLNEVKQTKDNLHGDQKLANDKTDAQATLNALNYLNQAQRGNLETKVQNSNSRPEVQKVVQLANQLNDAMKKLDDALTGN
DAIKQTSNYINEDTSQQVNFDEYTDRGKNIVAEQTNPNMSPTNINTIADKITEAKNDLHGVQKLKQAQQQSINTINQMTG
LNQAQKEQLNQEIQQTQTRSEVHQVINKAQALNDSMNTLRQSITDEHEVKQTSNYINETVGNQTAYNNAVDRVKQIINQT
SNPTMNPLEVERATSNVKISKDALHGERELNDNKNSKTFAVNHLDNLNQAQKEALTHEIEQATIVSQVNNIYNKAKALNN
DMKKLKDIVAQQDNVRQSNNYINEDSTPQNMYNDTINHAQSIIDQVANPTMSHDEIENAINNIKHAINALDGEHKLQQAK
ENANLLINSLNDLNAPQRDAINRLVNEAQTREKVAEQLQSAQALNDAMKHLRNSIQNQSSVRQESKYINASDAKKEQYNH
AVREVENIINEQHPTLDKEIIKQLTDGVNQANNDLNGVELLDADKQNAHQSIPTLMHLNQAQQNALNEKINNAVTRTEVA
AIIGQAKLLDHAMENLEESIKDKEQVKQSSNYINEDSDVQETYDNAVDHVTEILNQTVNPTLSIEDIEHAINEVNQAKKQ
LRGKQKLYQTIDLADKELSKLDDLTSQQSSSISNQIYTAKTRTEVAQAIEKAKSLNHAMKALNKVYKNADKVLDSSRFIN
EDQPEKKAYQQAINHVDSIIHRQTNPEMDPTVINSITHELETAQNNLHGDQKLAHAQQDAANVINGLIHLNVAQREVMIN
TNTNATTREKVAKNLDNAQALDKAMETLQQVVAHKNNILNDSKYLNEDSKYQQQYDRVIADAEQLLNQTTNPTLEPYKVD
IVKDNVLANEKILFGAEKLSYDKSNANDEIKHMNYLNNAQKQSIKDMISHAALRTEVKQLLQQAKILDEAMKSLEDKTQV
VITDTTLPNYTEASEDKKEKVDQTVSHAQAIIDKINGSNVSLDQVRQALEQLTQASENLDGDQRVEEAKVHANQTIDQLT
HLNSLQQQTAKESVKNATKLEEIATVSNNAQALNKVMGKLEQFINHADSVENSDNYRQADDDKIIAYDEALEHGQDIQKT
NATQNETKQALQQLIYAETSLNGFERLNHARPRALEYIKSLEKINNAQKSALEDKVTQSHDLLELEHIVNEGTNLNDIMG
ELANAIVNNYAPTKASINYINADNLRKDNFTQAINNARDALNKTQGQNLDFNAIDTFKDDIFKTKDALNGIERLTAAKSK
AEKLIDSLKFINKAQFTHANDEIINTNSIAQLSRIVNQAFDLNDAMKSLRDELNNQAFPVQASSNYINSDEDLKQQFDHA
LSNARKVLAKENGKNLDEKQIQGLKQVIEDTKDALNGIQRLSKAKAKAIQYVQSLSYINDAQRHIAENNIHNSDDLSSLA
NTLSKASDLDNAMKDLRDTIESNSTSVPNSVNYINADKNLQIEFDEALQQASATSSKTSENPATIEEVLGLSQAIYDTKN
ALNGEQRLATEKSKDLKLIKGLKDLNKAQLEDVTNKVNSANTLTELSQLTQSTLELNDKMKLLRDKLKTLVNPVKASLNY
RNADYNLKRQFNKALKEAKGVLNKNSGTNVNINDIQHLLTQIDNAKDQLNGERRLKEHQQKSEVFIIKELDILNNAQKAA
IINQIRASKDIKIINQIVDNAIELNDAMQGLKEHVAQLTATTKDNIEYLNADEDHKLQYDYAINLANNVLDKENGTNKDA
NIIIGMIQNMDDARALLNGIERLKDAQTKAHNDIKDTLKRQLDEIEHANATSNSKAQAKQMVNEEARKALSNINDATSND
LVNQAKDEGQSAIEHIHADELPKAKLDANQMIDQKVEDINHLISQNPNLSNEEKNKLISQINKLVNGIKNEIQQAINKQQ
IENATTKLDEVIETTKKLIIAKAEAKQMIKELSQKKRDAINNNTDLTPSQKAHALADIDKTEKDALQHIENSNSIDDINN
NKEHAFNTLAHIIIWDTDQQPLVFELPELSLQNALVTSEVVVHRDETISLESIIGAMTLTDELKVNIVSLPNTDKVADHL
TAKVKVILADGSYVTVNVPVKVVEKELQIAKKDAIKTIDVLVKQKIKDIDSNNELTSTQREDAKAEIERLKKQAIDKVNH
SKSIKDIETVKRTDFEEIDQFDPKRFTLNKAKKDIITDVNTQIQNGFKEIETIKGLTSNEKTQFDKQLTALQKEFLEKVE
HAHNLVELNQLQQEFNNRYKHILNQAHLLGEKHIAEHKLGYVVVNKTQQILNNQSASYFIKQWALDRIKQIQLETMNSIR
GAHTVQDVHKALLQGIEQILKVNVSIINQSFNDSLHNFNYLHSKFDARLREKDVANHIVQTETFKEVLKGTGVEPGKINK
ETQQPKLHKNDNDSLFKHLVDNFGKTVGVITLTGLLSSFWLVLAKRRKKEEEEKQSIKNHHKDIRLSDTDKIDPIVITKR
KIDKEEQIQNDDKHSIPVAKHKKSKEKQLSEEDIHSIPVVKRKQNSDNKDTKQKKVTSKKKKTPQSTKKVVKTKKRSKK
>B7GPC7 3.2.1.96~~~~~~Endo-beta-N-acetylglucosaminidase~~~
MTFIKQMMPRYVASMTAGIVAAAMAATCAFAPVANADAVSPTQETIQSTGRHFMVYYRAWRDVTMKGVNTDLPDDNWISM
YDIPYGVDVVNIFSYVPSGQEEQAQPFYDKLKSDYAPYLHSRGIKLVRGIDYTGVAVNGFRTFMKEQNKTESEATEADYD
AYAKQVIDKYMISVGLDGLDIDMEAHPNDADVKISDNVIRALSKHIGPKSAKPDTTMFLYDTNGSYLNPFKNVAECFDYV
AYQQYGSSSDRTARAAADYQPYIGNEFVPGLTFPEEGDMNNRWYDATEPYEESHFYQVASYVREHNLGGMFVYALDRDGR
NYDEDLRRIVPSNLLWTKTAIAESEGMALDTAKTAANHYLDRMSLRQVIDDNAASADKARDMVGKAANLYETNKAVLGGD
YGEGFSNTYDPTLEAGLLGIDISVLQQQIDKSSEIIGADTAESDAKTALRMARDAAIDGLTGKIYTADQVSAWSQALKAA
LDATVPVPTPDSTDQNGNRDKVTNHKVQGQPKQLSATGISTDIIVAVGVTLAIAGVALSLSRKLS
>Q2FYF1 ~~~ebpS~~~Elastin-binding protein EbpS~~~COG1388
MSNNFKDDFEKNRQSIDTNSHQDHTEDVEKDQSELEHQDTIENTEQQFPPRNAQRRKRRRDLATNHNKQVHNESQTSEDN
VQNEAGTIDDRQVESSHSTESQEPSHQDSTPQHEEEYYNKNAFAMDKSHPEPIEDNDKHDTIKNAENNTEHSTVSDKSEA
EQSQQPKPYFTTGANQSETSKNEHDNDSVKQDQDEPKEHHNGKKAAAIGAGTAGVAGAAGAMAASKAKKHSNDAQNKSNS
GKANNSTEDKASQDKSKDHHNGKKGAAIGAGTAGLAGGAASKSASAASKPHASNNASQNHDEHDNHDRDKERKKGGMAKV
LLPLIAAVLIIGALAIFGGMALNNHNNGTKENKIANTNKNNADESKDKDTSKDASKDKSKSTDSDKSKEDQDKATKDESD
NDQNNANQANNQAQNNQNQQQANQNQQQQQQRQGGGQRHTVNGQENLYRIAIQYYGSGSPENVEKIRRANGLSGNNIRNG
QQIVIP
>A6QH29 ~~~ebpS~~~Elastin-binding protein EbpS~~~
MSNNFKDDFEKNRQSIDTNSHQDHTEDVEKDQSELEHQDTIENTEQQFPPRNAQRRKRRRDLATNHNKQVHNESQTSEDN
VQNEAGTIDDRQVESSHSTESQEPSHQDSTPQHEEEYYNKNAFAMDKSHPEPIEDNDKHDTIKNAENNTEHSTVSDKSEA
EQSQQPKPYFTTGANQSETSKNEHDNDSVKQDQDEPKEHHNGKKAAAIGAGTAGVAGAAGAMAASKAKKHSNDAQNKSNS
GKANNSTEDKASQDKSKDHHNGKKGAAIGAGTAGLAGGAASKSASAASKPHASNNASQNHDEHDNHDRDKERKKGGMAKV
LLPLIAAVLIIGALAIFGGMALNNHNNGTKENKIANTNKNNADESKDKDTSKDASKDKSKSTDSDKSKEDQDKATKDESD
NDQNNANQANNQAQNNQNQQQANQNQQQQQQRQGGGQRHTVNGQENLYRIAIQYYGSGSPENVEKIRRANGLSGNNIRNG
QQIVIP
>Q7A5I6 ~~~ebpS~~~Elastin-binding protein EbpS~~~
MSNNFKDDFEKNRQSIDTNSHQDHTEDVEKDQSELEHQDTIENTEQQFPPRNAQRRKRRRDLATNHNKQVHNESQTSEDN
VQNEAGTIDDRQVESSHSTESQEPSHQDSTPQHEEEYYNKNAFAMDKSHPEPIEDNDKHETIKDAENNTEHSTVSDKSIA
EQSQQPKPYFATGANQANTSKDKHDDVTVKQDKDESKDHHSGKKGAAIGAGTAGVAGAAGAMGVSKAKKHSNDAQNKSNS
DKSNNSTEDKASQDKSKDHHNGKKGAAIGAGTAGLAGGAASKSASAASKPHASNNASQNHDEHDNHDRDKERKKGGMAKV
LLPLIAAVLIIGALAIFGGMALNNHNNGTKENKIANTNKNNADESKDKDTSKDASKDKSKSTDSDKSKEDQDKATKDESD
NDQNNANQANNQAQNNQNQQQANQNQQQQQQRQGGGQRHTVNGQENLYRIAIQYYGSGSPENVEKIRRANGLSGNNIRNG
QQIVIP
>Q53630 ~~~ebpS~~~Elastin-binding protein EbpS~~~
MSNNFKDDFEKNRQSIDTNSHQDHTEDVEKDQSELEHQDTIENTEQQFPPRNAQRRKRRRDLATNHNKQVHNESQTSEDN
VQNEAGTIDDRQVESSHSTESQEPSHQDSTPQHEEGYYNKNAFAMDKSHPEPIEDNDKHETIKEAENNTEHSTVSDKSEA
EQSQQPKPYFATGANQANTSKDKHDDVTVKQDKDESKDHHSGKKGAAIGAGTAGVAGAAGAMGVSKAKKHSNDAQNKSNS
GKVNNSTEDKASEDKSKEHHNGKKGAAIGAGTAGLAGGAASNSASAASKPHASNNASQNNDEHDHHDRDKERKKGGMAKV
LLPLIAAVLIIGALAIFGGMALNNHNNGTKENKIANTNKNNADESKDKDTSKDASKDKSKSTDSDKSKDDQDKATKDESD
NDQNNANQANNQAQNNQNQQQANQNQQQQQQRQGGGQRHTVNGQENLYRIAIQYYGSGSPENVEKIRRANGLSGNNIRNG
QQIVIP
>P0CW81 ~~~ebrA~~~Multidrug resistance protein EbrA~~~
MLVGYIFLTIAICSESIGAAMLKVSDGFKKWKPSALVVIAYSLAFYMLSLTLNHIPLSLSYATWSGVGTVLTAVIGVKWF
KEELNAKGLIGILLLISGVVLLNWQ
>P0CW83 ~~~ebrB~~~Multidrug resistance protein EbrB~~~
MKGLLYLALAIVSEVFGSTMLKLSEGFTQAWPIGGVIAGFLSAFTFLSFSLKTIDLSSAYATWSGVGTALTAIVGFLLFG
ETISLKGVFGLTLVIAGVVVLNQSKAPAKEKKQTVCE
>P9WNB3 ~~~eccCa1~~~ESX-1 secretion system protein EccCa1~~~COG1674
MTTKKFTPTITRGPRLTPGEISLTPPDDLGIDIPPSGVQKILPYVMGGAMLGMIAIMVAGGTRQLSPYMLMMPLMMIVMM
VGGLAGSTGGGGKKVPEINADRKEYLRYLAGLRTRVTSSATSQVAFFSYHAPHPEDLLSIVGTQRQWSRPANADFYAATR
IGIGDQPAVDRLLKPAVGGELAAASAAPQPFLEPVSHMWVVKFLRTHGLIHDCPKLLQLRTFPTIAIGGDLAGAAGLMTA
MICHLAVFHPPDLLQIRVLTEEPDDPDWSWLKWLPHVQHQTETDAAGSTRLIFTRQEGLSDLAARGPHAPDSLPGGPYVV
VVDLTGGKAGFPPDGRAGVTVITLGNHRGSAYRIRVHEDGTADDRLPNQSFRQVTSVTDRMSPQQASRIARKLAGWSITG
TILDKTSRVQKKVATDWHQLVGAQSVEEITPSRWRMYTDTDRDRLKIPFGHELKTGNVMYLDIKEGAEFGAGPHGMLIGT
TGSGKSEFLRTLILSLVAMTHPDQVNLLLTDFKGGSTFLGMEKLPHTAAVVTNMAEEAELVSRMGEVLTGELDRRQSILR
QAGMKVGAAGALSGVAEYEKYRERGADLPPLPTLFVVVDEFAELLQSHPDFIGLFDRICRVGRSLRVHLLLATQSLQTGG
VRIDKLEPNLTYRIALRTTSSHESKAVIGTPEAQYITNKESGVGFLRVGMEDPVKFSTFYISGPYMPPAAGVETNGEAGG
PGQQTTRQAARIHRFTAAPVLEEAPTP
>P9WNB1 ~~~eccCb1~~~ESX-1 secretion system protein EccCb1~~~COG1674
MTAEPEVRTLREVVLDQLGTAESRAYKMWLPPLTNPVPLNELIARDRRQPLRFALGIMDEPRRHLQDVWGVDVSGAGGNI
GIGGAPQTGKSTLLQTMVMSAAATHSPRNVQFYCIDLGGGGLIYLENLPHVGGVANRSEPDKVNRVVAEMQAVMRQRETT
FKEHRVGSIGMYRQLRDDPSQPVASDPYGDVFLIIDGWPGFVGEFPDLEGQVQDLAAQGLAFGVHVIISTPRWTELKSRV
RDYLGTKIEFRLGDVNETQIDRITREIPANRPGRAVSMEKHHLMIGVPRFDGVHSADNLVEAITAGVTQIASQHTEQAPP
VRVLPERIHLHELDPNPPGPESDYRTRWEIPIGLRETDLTPAHCHMHTNPHLLIFGAAKSGKTTIAHAIARAICARNSPQ
QVRFMLADYRSGLLDAVPDTHLLGAGAINRNSASLDEAVQALAVNLKKRLPPTDLTTAQLRSRSWWSGFDVVLLVDDWHM
IVGAAGGMPPMAPLAPLLPAAADIGLHIIVTCQMSQAYKATMDKFVGAAFGSGAPTMFLSGEKQEFPSSEFKVKRRPPGQ
AFLVSPDGKEVIQAPYIEPPEEVFAAPPSAG
>P9WPH9 ~~~eccA1~~~ESX-1 secretion system protein EccA1~~~COG0464
MTDRLASLFESAVSMLPMSEARSLDLFTEITNYDESACDAWIGRIRCGDTDRVTLFRAWYSRRNFGQLSGSVQISMSTLN
ARIAIGGLYGDITYPVTSPLAITMGFAACEAAQGNYADAMEALEAAPVAGSEHLVAWMKAVVYGAAERWTDVIDQVKSAG
KWPDKFLAGAAGVAHGVAAANLALFTEAERRLTEANDSPAGEACARAIAWYLAMARRSQGNESAAVALLEWLQTTHPEPK
VAAALKDPSYRLKTTTAEQIASRADPWDPGSVVTDNSGRERLLAEAQAELDRQIGLTRVKNQIERYRAATLMARVRAAKG
MKVAQPSKHMIFTGPPGTGKTTIARVVANILAGLGVIAEPKLVETSRKDFVAEYEGQSAVKTAKTIDQALGGVLFIDEAY
ALVQERDGRTDPFGQEALDTLLARMENDRDRLVVIIAGYSSDIDRLLETNEGLRSRFATRIEFDTYSPEELLEIANVIAA
ADDSALTAEAAENFLQAAKQLEQRMLRGRRALDVAGNGRYARQLVEASEQCRDMRLAQVLDIDTLDEDRLREINGSDMAE
AIAAVHAHLNMRE
>P9WPH7 ~~~eccA2~~~ESX-2 secretion system protein EccA2~~~COG0464
MSRMVDTMGDLLTARRHFDRAMTIKNGQGCVAALPEFVAATEADPSMADAWLGRIACGDRDLASLKQLNAHSEWLHRETT
RIGRTLAAEVQLGPSIGITVTDASQVGLALSSALTIAGEYAKADALLANRELLDSWRNYQWHQLARAFLMYVTQRWPDVL
STAAEDLPPQAIVMPAVTASICALAAHAAAHLGQGRVALDWLDRVDVIGHSRSSERFGADVLTAAIGPADIPLLVADLAY
VRGMVYRQLHEEDKAQIWLSKATINGVLTDAAKEALADPNLRLIVTDERTIASRSDRWDASTAKSRDQLDDDNAAQRRGE
LLAEGRELLAKQVGLAAVKQAVSALEDQLEVRMMRLEHGLPVEGQTNHMLLVGPPGTGKTTTAEALGKIYAGMGIVRHPE
IREVRRSDFCGHYIGESGPKTNELIEKSLGRIIFMDEFYSLIERHQDGTPDMIGMEAVNQLLVQLETHRFDFCFIGAGYE
DQVDEFLTVNPGLAGRFNRKLRFESYSPVEIVEIGHRYATPRASQLDDAAREVFLDAVTTIRNYTTPSGQHGIDAMQNGR
FARNVIERAEGFRDTRVVAQKRAGQPVSVQDLQIITATDIDAAIRSVCSDNRDMAAIVW
>A0QQ38 ~~~eccA3~~~ESX-3 secretion system protein EccA3~~~COG0464
MGSDTLAAPPHGAPRVDRDVVSRFATCCRALGLTVNDRQRPADLTAARAGFAGLTHLAHDQCDAWIGLAAAGEVTPAVVD
AVWRTVASAGVLQREIGLAAGELGFTYDTGWYLQFRATEPDDFQLAYAARLYEAGEFGEADGLVGEILARRPGWFDARWL
QVAINHRAQRWSDVVRLLTPVVTLPSLDDVTSHAVRTALGISLARLGMFAPAMSYLEDPAGPIEVAAVDGALAKALTLRA
QGEDDEATEVLQDLFATHPENTQVEQALLDTSFGLVTTTSARIEARSDPWDPETEPSEAEFVDPGAKDRKAHLLLEAEAE
LAEFIGLEEVKFQVARLKSSVAMAIRRQERGLAVAQRTNHLVFAGPPGTGKTTIARVVAKIYCGLGLLKKETVREVHRAD
LIGQHIGETEAKTNAIIDSALDGVLFLDEAYALVSTGAKNDFGLVAIDTLLARMENDRDRLVVIVAGYRKDLDAFLDTNE
GLRSRFTRSIDFPSYTAPELVEIAVRMAEKRDSVFEKAAHDDMERLFTHLAQATTPDANGVERRSLDIAGNARFVRNLVE
RSEEEREYRLDHSDQEDFTDEEMMTITAGDVQRSAAPLLRGLGLSVPA
>P9WPI3 ~~~eccA3~~~ESX-3 secretion system protein EccA3~~~COG0464
MAGVGEGDSGGVERDDIGMVAASPVASRVNGKVDADVVGRFATCCRALGIAVYQRKRPPDLAAARSGFAALTRVAHDQCD
AWTGLAAAGDQSIGVLEAASRTATTAGVLQRQVELADNALGFLYDTGLYLRFRATGPDDFHLAYAAALASTGGPEEFAKA
NHVVSGITERRAGWRAARWLAVVINYRAERWSDVVKLLTPMVNDPDLDEAFSHAAKITLGTALARLGMFAPALSYLEEPD
GPVAVAAVDGALAKALVLRAHVDEESASEVLQDLYAAHPENEQVEQALSDTSFGIVTTTAGRIEARTDPWDPATEPGAED
FVDPAAHERKAALLHEAELQLAEFIGLDEVKRQVSRLKSSVAMELVRKQRGLTVAQRTHHLVFAGPPGTGKTTIARVVAK
IYCGLGLLKRENIREVHRADLIGQHIGETEAKTNAIIDSALDGVLFLDEAYALVATGAKNDFGLVAIDTLLARMENDRDR
LVVIIAGYRADLDKFLDTNEGLRSRFTRNIDFPSYTSHELVEIAHKMAEQRDSVFEQSALHDLEALFAKLAAESTPDTNG
ISRRSLDIAGNGRFVRNIVERSEEEREFRLDHSEHAGSGEFSDEELMTITADDVGRSVEPLLRGLGLSVRA
>P9WPI1 ~~~eccA5~~~ESX-5 secretion system protein EccA5~~~COG0457
MTRPQAAAEDARNAMVAGLLASGISVNGLQPSHNPQVAAQMFTTATRLDPKMCDAWLARLLAGDQSIEVLAGAWAAVRTF
GWETRRLGVTDLQFRPEVSDGLFLRLAITSVDSLACAYAAVLAEAKRYQEAAELLDATDPRHPFDAELVSYVRGVLYFRT
KRWPDVLAQFPEATQWRHPELKAAGAAMATTALASLGVFEEAFRRAQEAIEGDRVPGAANIALYTQGMCLRHVGREEEAV
ELLRRVYSRDAKFTPAREALDNPNFRLILTDPETIEARTDPWDPDSAPTRAQTEAARHAEMAAKYLAEGDAELNAMLGME
QAKKEIKLIKSTTKVNLARAKMGLPVPVTSRHTLLLGPPGTGKTSVARAFTKQLCGLTVLRKPLVVETSRTKLLGRYMAD
AEKNTEEMLEGALGGAVFFDEMHTLHEKGYSQGDPYGNAIINTLLLYMENHRDELVVFGAGYAKAMEKMLEVNQGLRRRF
STVIEFFSYTPQELIALTQLMGRENEDVITEEESQVLLPSYTKFYMEQSYSEDGDLIRGIDLLGNAGFVRNVVEKARDHR
SFRLDDEDLDAVLASDLTEFSEDQLRRFKELTREDLAEGLRAAVAEKKTK
>A0QNJ0 ~~~eccB1~~~ESX-1 secretion system ATPase EccB1~~~COG3266
MAGFRLTTKVQVSGWRFLLRRVEHAIVRRDTRMFDDPLQFYSRAVFAGVVVSVLICLGAALMAYFKPLGKQGSDQLLVDR
TTNQLYVMLPGSNQLRPVYNLTSARLVLGNASNPVAVKSEELNRISKGQSIGIPGAPYATPTGTPASQWTLCDTVAKPDS
SAPKVETSILIRTLAIDSGVGPIRADQGMLVSYEGANWLITEGGRHSIDLADRAVTSAVGIPVTAKPTPISQGLFNALPN
RGPWQLPQIPAAGAPNSVGLPENLVIGSVFRTATESDPQHYVVLPDGVARVNNTTAAALRATNSYGLMQPPAVEASVVAK
IPEQVYVSPLPDQPLDVLLRQDSPVLCWSWQREPGDQAPKTTVIAGRRLPLPANAIGTGIDQIGGDSTVYIEGGQFVRLQ
SPDPRVGESMYYIDPQGVRYGIANDDAAKNLGLAGPVNAPWQVVGLLVDGPVLSKEAALIEHDTLPADPNPRKVASGEG
>P9WNR7 3.6.-.-~~~eccB1~~~ESX-1 secretion system ATPase EccB1~~~COG3266
MGLRLTTKVQVSGWRFLLRRLEHAIVRRDTRMFDDPLQFYSRSIALGIVVAVLILAGAALLAYFKPQGKLGGTSLFTDRA
TNQLYVLLSGQLHPVYNLTSARLVLGNPANPATVKSSELSKLPMGQTVGIPGAPYATPVSAGSTSIWTLCDTVARADSTS
PVVQTAVIAMPLEIDASIDPLQSHEAVLVSYQGETWIVTTKGRHAIDLTDRALTSSMGIPVTARPTPISEGMFNALPDMG
PWQLPPIPAAGAPNSLGLPDDLVIGSVFQIHTDKGPQYYVVLPDGIAQVNATTAAALRATQAHGLVAPPAMVPSLVVRIA
ERVYPSPLPDEPLKIVSRPQDPALCWSWQRSAGDQSPQSTVLSGRHLPISPSAMNMGIKQIHGTATVYLDGGKFVALQSP
DPRYTESMYYIDPQGVRYGVPNAETAKSLGLSSPQNAPWEIVRLLVDGPVLSKDAALLEHDTLPADPSPRKVPAGASGAP
>P9WNR5 3.6.-.-~~~eccB2~~~ESX-2 secretion system ATPase EccB2~~~COG3266
MPLSLSNRDQNSGHLFYNRRLRAATTRFSVRMKHDDRKQTAALALSMVLVAIAAGWMMLLNVLKPTGIVGDSAIIGDRDS
GALYARIDGRLYPALNLTSARLATGTAGQPTWVKPAEIAKYPTGPLVGIPGAPAAMPVNRGAVSAWAVCDTAGRPRSADK
PVVTSIAGPITGGGRATHLRDDAGLLVTFDGSTYVIWGGKRSQIDPTNRAVTLSLGLDPGVTSPIQISRALFDGLPATEP
LRVPAVPEAGTPSTWVPGARVGSVLQAQTAGGGSQFYVLLPDGVQKISSFVADLLRSANSYGAAAPRVVTPDVLVHTPQV
TSLPVEYYPAGRLNFVDTAADPTTCVSWEKASTDPQARVAVYNGRGLPVPPSMDSRIVRLVRDDRAPASVVATQVLVLPG
AANFVTSTSGVITAESRESLFWVSGNGVRFGIANDEATLRALGLDPGAAVQAPWPLLRTFAAGPALSRDAALLARDTVPT
LGQVAIVTTTAKAGA
>A0QQ39 3.6.-.-~~~eccB3~~~ESX-3 secretion system ATPase EccB3~~~COG3266
MTGPVNPDDRRSFSSRTPVNENPDGVQYRRGFVTRHQVSGWRFVMRRIASGVALHDTRMLVDPLRTQSRAVLTGALILVT
GLVGCFIFSLFRPGGVPGNNAILADRSTSALYVRVGEQLHPVLNLTSARLISGSPDNPTMVKTSEIDKFPRGNLLGIPGA
PERMVQNAATDAEWTVCDAVGGANPGVTVIAGPLGADGERAAPLPPDHAVLVHSDAEPNPGDWLLWDGKRSPIDLADRAV
TDALGLGGQALAPRPIAAGLFNAVPAAPALTAPVIPDAGAAPQFELSLPVPVGAVVVAYDADNTARYYAVLSDGLQPISP
VLAAILRNTDSHGFAQPPRLGPDEVARTPMSRGLDTSAYPDNPVTLVEASAHPVTCAHWTKPSDAAESSLSVLSGAVLPL
AEGLHTVDLVGAGAGGAANRVALTPGTGYFVQTVGAEPGSPTAGSMFWVSDTGVRYGIDTAEDDKVVAALGLSTSPLPVP
WSVLSQFAAGPALSRGDALVAHDAVSTNPNSARMEASR
>P9WNR3 3.6.-.-~~~eccB3~~~ESX-3 secretion system ATPase EccB3~~~COG3266
MTNQQHDHDFDHDRRSFASRTPVNNNPDKVVYRRGFVTRHQVTGWRFVMRRIAAGIALHDTRMLVDPLRTQSRAVLMGVL
IVITGLIGSFVFSLIRPNGQAGSNAVLADRSTAALYVRVGEQLHPVLNLTSARLIVGRPVSPTTVKSTELDQFPRGNLIG
IPGAPERMVQNTSTDANWTVCDGLNAPSRGGADGVGVTVIAGPLEDTGARAAALGPGQAVLVDSGAGTWLLWDGKRSPID
LADHAVTSGLGLGADVPAPRIIASGLFNAIPEAPPLTAPIIPDAGNPASFGVPAPIGAVVSSYALKDSGKTISDTVQYYA
VLPDGLQQISPVLAAILRNNNSYGLQQPPRLGADEVAKLPVSRVLDTRRYPSEPVSLVDVTRDPVTCAYWSKPVGAATSS
LTLLAGSALPVPDAVHTVELVGAGNGGVATRVALAAGTGYFTQTVGGGPDAPGAGSLFWVSDTGVRYGIDNEPQGVAGGG
KAVEALGLNPPPVPIPWSVLSLFVPGPTLSRADALLAHDTLVPDSRPARPVSAEGGYR
>B2HST3 3.6.-.-~~~eccB5~~~ESX-5 secretion system ATPase EccB5~~~COG3266
MAEQGRGQRGSGYGLGLSTRTQVTGYQFLARRTAMALTRWRVRMEIEPGRRQTLAVVASVSAALVICLGSLLWSFISPSG
QINDSPIIADRDSGALYVRVGDRLYPALNLASARLITGRASNPHLVRGSQIDSMPHGPLVGIPGAPSDFNPASPATSSWL
VCDTVAGPSTMPQSPHGVTVTVIDGTPDLTGHRRVLKGSDAVVLRYGGDAWVIREGRRSRIDATDRSVLLPLGLTPEQVT
QARPMSHALYDALPVGPELVVPEVPDDGAAATFPGAPGPVGTVIVTPQISGPQQYSLVLTDGVQTLPPLVAQILQNAGRP
GNTKPVTVQPSSLAKMPVVNRLDLSAYPDDPLNVMDIRDNPSTCWWWERTAGENRSRVQVISGPNIPVAPHEMNKVVDLV
KADMSGREADQVYFGPNFANFVAVTGNNPAAQTTESLWWLTEAGARFGVEDTKEAREALGLGLTPSPAPWVVLRLLPQGP
TLSRADALVEHDTLPMDMSPAELVVPK
>P9WNQ9 3.6.-.-~~~eccB5~~~ESX-5 secretion system ATPase EccB5~~~COG3266
MAEESRGQRGSGYGLGLSTRTQVTGYQFLARRTAMALTRWRVRMEIEPGRRQTLAVVASVSAALVICLGALLWSFISPSG
QLNESPIIADRDSGALYVRVGDRLYPALNLASARLITGRPDNPHLVRSSQIATMPRGPLVGIPGAPSSFSPKSPPASSWL
VCDTVATSSSIGSLQGVTVTVIDGTPDLTGHRQILSGSDAVVLRYGGDAWVIREGRRSRIEPTNRAVLLPLGLTPEQVSQ
ARPMSRALFDALPVGPELLVPEVPNAGGPATFPGAPGPIGTVIVTPQISGPQQYSLVLGDGVQTLPPLVAQILQNAGSAG
NTKPLTVEPSTLAKMPVVNRLDLSAYPDNPLEVVDIREHPSTCWWWERTAGENRARVRVVSGPTIPVAATEMNKVVSLVK
ADTSGRQADQVYFGPDHANFVAVTGNNPGAQTSESLWWVTDAGARFGVEDSKEARDALGLTLTPSLAPWVALRLLPQGPT
LSRADALVEHDTLPMDMTPAELVVPK
>A0QQ40 ~~~eccC3~~~ESX-3 secretion system protein EccC3~~~COG1674
MSRLIFEHQRRLTPPTTRKGTITIEPPPQLPRVVPPSLLRRVLPFLIVILIVGMIVALFATGMRLISPTMLFFPFVLLLA
ATALYRGGDNKMRTEEVDAERADYLRYLSVVRDNVRAHAAEQRAALEWSHPEPEVLATIPGTRRQWERDPRDRDFLVLRA
GRHDVPLDAALKVKDTADEIDLEPVAHSALRGLLDVQRTVRDAPTGLDVAKLARITVIGEADEARAAIRAWIAQAVTWHD
PTMLGVALAAPDLESGDWSWLKWLPHVDVPNEADGVGPARYLTTSTAELRERLAPALADRPLFPAESGAALKHLLVVLDD
PDADPDDIARKPGLTGVTVIHRTTELPNREQYPDPERPILRVADGRIERWQVGGWQPCVDVADAMSAAEAAHIARRLSRW
DSNPGYIRSTSTGSATFTTLLGIPDASALDVASLWAPRPRDEELRVPIGVTSTGEPLYFDLKDEAEGGMGPHGLMIGMTG
SGKSQTLMSILLSLLTTHPADRLIVIYADFKGEAGADIFRHFPQVVAVISNMAEKRSLADRFADTLRGEVARREQILKEA
GRRVQGSAFNSVAEYESAIAAGHDLPPMPTLFVVADEFTLMLAEHPEYADLFDYVARKGRSFRIHLLFASQTLDVGRIKD
IDKNTSYRIGLKVASPSISRQIIGVEDAYHIESGREHKGEGFLVPAPGAVPIKFRSTYVDGIYDPPRAEKSIVVHALPQP
QVFTAGRVEPEPDTVIATGDVEVHTAPPRKLIATIGDQLAAYGPKAPQLWLPPLDEPIALADVLAGADVEPGQLRWPLGE
IDKPFEMRRDVLVYDAHTAAANVLIHGGPRSGKSTALQAFVLSAAALHSPRAITFYCLDYGGGKLADLADLAHVGSVATP
LEPERIRRTFGELEQLLRARQRQGAVNRTGSYTDGYGEVFLVIDNLYAFSRDNTDTFNTRNPLLAKVTELANSGLAYGIH
VVITTPNWLEVPLAMRDGLGLRLELKLHDSHDSIVRVAGALRRPADSVPADQPGRGLTMAAEHFLFAEPALSDIAVINAR
YPGVSAPPVRLLPTDLSPDALAPLYPAPETVVIGQREEDLAPVAVDFANHPLLMVFGDSKSGKTTLLRHIIRTVRENSTP
DQVAFTVIDRRLHLVDEPLFPDNEYTANIDRVLPAMLGLSALIEKRRPPAGLSAQELSRWTYTGHTHYLIVDDVDQIPDT
PAVSGPFVGQRPWTNIVGLLAEAADLGLRVIVTARATGSAHAVMTAPLLRRLNDLQATTLMLSGNPTDSGKIRGHRFARF
PAGRGLLLTDTDTPDHIQLVNPLGDAALSGNIGNNGNHNRGGEYR
>P9WNA9 ~~~eccC3~~~ESX-3 secretion system protein EccC3~~~COG1672
MSRLIFEARRRLAPPSSHQGTIIIEAPPELPRVIPPSLLRRALPYLIGILIVGMIVALVATGMRVISPQTLFFPFVLLLA
ATALYRGNDKKMRTEEVDAERADYLRYLSVVRDNIRAQAAEQRASALWSHPDPTALASVPGSRRQWERDPHDPDFLVLRA
GRHTVPLATTLRVNDTADEIDLEPVSHSALRSLLDTQRSIGDVPTGIDLTKVSPITVLGERAQVRAVLRAWIAQAVTWHD
PTVLGVALAARDLEGRDWNWLKWLPHVDIPGRLDALGPARNLSTDPDELIALLGPVLADRPAFTGQPTDALRHLLIVVDD
PDYDLGASPLAVGRAGVTVVHCSASAPHREQYSDPEKPILRVAHGAIERWQTGGWQPYIDAADQFSADEAAHLARRLSRW
DSNPTHAGLRSAATRGASFTTLLGIEDASRLDVPALWAPRRRDEELRVPIGVTGTGEPLMFDLKDEAEGGMGPHGLMIGM
TGSGKSQTLMSILLSLLTTHSAERLIVIYADFKGEAGADSFRDFPQVVAVISNMAEKKSLADRFADTLRGEVARREMLLR
EAGRKVQGSAFNSVLEYENAIAAGHSLPPIPTLFVVADEFTLMLADHPEYAELFDYVARKGRSFRIHILFASQTLDVGKI
KDIDKNTAYRIGLKVASPSVSRQIIGVEDAYHIESGKEHKGVGFLVPAPGATPIRFRSTYVDGIYEPPQTAKAVVVQSVP
EPKLFTAAAVEPDPGTVIADTDEQEPADPPRKLIATIGEQLARYGPRAPQLWLPPLDETIPLSAALARAGVGPRQWRWPL
GEIDRPFEMRRDPLVFDARSSAGNMVIHGGPKSGKSTALQTFILSAASLHSPHEVSFYCLDYGGGQLRALQDLAHVGSVA
SALEPERIRRTFGELEQLLLSRQQREVFRDRGANGSTPDDGFGEVFLVIDNLYGFGRDNTDQFNTRNPLLARVTELVNVG
LAYGIHVIITTPSWLEVPLAMRDGLGLRLELRLHDARDSNVRVVGALRRPADAVPHDQPGRGLTMAAEHFLFAAPELDAQ
TNPVAAINARYPGMAAPPVRLLPTNLAPHAVGELYRGPDQLVIGQREEDLAPVILDLAANPLLMVFGDARSGKTTLLRHI
IRTVREHSTADRVAFTVLDRRLHLVDEPLFPDNEYTANIDRIIPAMLGLANLIEARRPPAGMSAAELSRWTFAGHTHYLI
IDDVDQVPDSPAMTGPYIGQRPWTPLIGLLAQAGDLGLRVIVTGRATGSAHLLMTSPLLRRFNDLQATTLMLAGNPADSG
KIRGERFARLPAGRAILLTDSDSPTYVQLINPLVDAAAVSGETQQKGSQS
>B2HST4 ~~~eccC5~~~ESX-5 secretion system protein EccC5~~~COG1674
MKRGFARPTPEKAPVIKPENIVLPTPLSIPPPEGKPWWLIVVGVVVVGLLGGMVAMVFASGSHVFGGVGSIFPIFMMVGI
MMMMFRSVGAGGQQQMSRPKLDAMRAQFMLMLDMLRETAQESADSMDSNYRWFHPAPSTLAAAVGSPRMWERKPDGKDLN
FGVVRVGVGMTRPEVTWGEPQNMPTDIELEPVTGKALQEFGRYQSVVYNLPKMISLLVEPWYALVGEREQALGLMRAIIC
QLTFSHGPDHVQFIVVSSDLAEWEWVKWLPHFGDSRRYDAAGNARMVYSSVREFAAEQGELFAGRGSFTPRHASSSAQTP
TPHTVIICDVDDPQWEYVISAEGVDGVTFFDLTGSPMWTNVPERKLEFDKTGVIEALPRDRDTWMVIDDNAWFFALTDHV
SIAEAEEFGQKLAQWRLAEAYEEIGQRVAHIGARDILAYYGIDDPGNIDFDYLWGSRTDSMGRSRLRAPFGNRSDNGELL
FLDMKSLDEGGDGPHGVMSGTTGSGKSTLVRTVIESLMLGHPPEELQFVLADLKGGSAVKPFAGVPHVSRIITDLEEDQA
LMERFLDALWGEIARRKAICDSAGVDDAKEYNSVRGRMRARGQDMAPLPMLVVVIDEFYEWFRIMPTAVDVLDSIGRQGR
AYWIHLMMASQTIESRAEKLMENMGYRLVLKARTAGAAQAAGVPNAVNLPAQAGLGYFRKSLEDIIRFQAEFLWRDYFQP
GITVDGEEAPVLVHSIDYIRPQLFTNSFTPLEVTVGGPEIDKVVAHANGEVVEEVEAEAEEEGIRVPKVGTVIIDQLRRI
NFEPYRLWQPPLTQPVAIDDLVNRFLGHPWQKEYGSARNLVFPIGVIDRPFKHDQPPWTVDTSGPGSNVLILGAGGSGKT
TALQTLISSAALTHTPDQVQFYCLAYSSTALTTVSKLPHVGEVAGPTDPYGVRRTVAELLALVRERKRSFLEYGIASMEM
FRRRKFGGEAGPVPNDGFGDVYLVIDNYRALAEENEVLIEQVNLIINQGPSFGVHVVVTADRESELRPPVRSGFGSRVEL
RLAAVEDAKLVRSRFAKDVPVKPGRGMVAVNYVRLDSDPQAGLHTLVARPAMGSTPTNVFECDSVVAAVSRLTTSQAPPV
RRLPASFGVDQVRQLAARDTRQGVGVGGIAWAISELDLQPVYLNFAENSHLMVTGRRECGRTTTLATIMSEIGRLYAPGA
TSVPAPPPGQPSAQVWLIDPRRQLLTALGSNYVERFAYNLDGVQAMMGELAAVLAGREPPPGLSAEELLSRSWWSGPEIF
LIVDDIQQLPPGFDSPLHKAAPWVNRAADVGLHVIVTRSFGGWSSAGSDPMLRALHQANAPLLVMDADPDEGFIRGKMKG
GPLPRGRGLLMAEDTGVFVQVALTEVRK
>P9WNA5 ~~~eccC5~~~ESX-5 secretion system protein EccC5~~~COG1674
MKRGFARPTPEKPPVIKPENIVLSTPLSIPPPEGKPWWLIVVGVVVVGLLGGMVAMVFASGSHVFGGIGSIFPLFMMVGI
MMMMFRGMGGGQQQMSRPKLDAMRAQFMLMLDMLRETAQESADSMDANYRWFHPAPNTLAAAVGSPRMWERKPDGKDLNF
GVVRVGVGMTRPEVTWGEPQNMPTDIELEPVTGKALQEFGRYQSVVYNLPKMVSLLVEPWYALVGEREQVLGLMRAIICQ
LAFSHGPDHVQMIVVSSDLDQWDWVKWLPHFGDSRRHDAAGNARMVYTSVREFAAEQAELFAGRGSFTPRHASSSAQTPT
PHTVIIADVDDPQWEYVISAEGVDGVTFFDLTGSSMWTDIPERKLQFDKTGVIEALPRDRDTWMVIDDKAWFFALTDQVS
IAEAEEFAQKLAQWRLAEAYEEIGQRVAHIGARDILSYYGIDDPGNIDFDSLWASRTDTMGRSRLRAPFGNRSDNGELLF
LDMKSLDEGGDGPHGVMSGTTGSGKSTLVRTVIESLMLSHPPEELQFVLADLKGGSAVKPFAGVPHVSRIITDLEEDQAL
MERFLDALWGEIARRKAICDSAGVDDAKEYNSVRARMRARGQDMAPLPMLVVVIDEFYEWFRIMPTAVDVLDSIGRQGRA
YWIHLMMASQTIESRAEKLMENMGYRLVLKARTAGAAQAAGVPNAVNLPAQAGLGYFRKSLEDIIRFQAEFLWRDYFQPG
VSIDGEEAPALVHSIDYIRPQLFTNSFTPLEVSVGGPDIEPVVAQPNGEVLESDDIEGGEDEDEEGVRTPKVGTVIIDQL
RKIKFEPYRLWQPPLTQPVAIDDLVNRFLGRPWHKEYGSACNLVFPIGIIDRPYKHDQPPWTVDTSGPGANVLILGAGGS
GKTTALQTLICSAALTHTPQQVQFYCLAYSSTALTTVSRIPHVGEVAGPTDPYGVRRTVAELLALVRERKRSFLECGIAS
MEMFRRRKFGGEAGPVPDDGFGDVYLVIDNYRALAEENEVLIEQVNVIINQGPSFGVHVVVTADRESELRPPVRSGFGSR
IELRLAAVEDAKLVRSRFAKDVPVKPGRGMVAVNYVRLDSDPQAGLHTLVARPALGSTPDNVFECDSVVAAVSRLTSAQA
PPVRRLPARFGVEQVRELASRDTRQGVGAGGIAWAISELDLAPVYLNFAENSHLMVTGRRECGRTTTLATIMSEIGRLYA
PGASSAPPPAPGRPSAQVWLVDPRRQLLTALGSDYVERFAYNLDGVVAMMGELAAALAGREPPPGLSAEELLSRSWWSGP
EIFLIVDDIQQLPPGFDSPLHKAVPFVNRAADVGLHVIVTRTFGGWSSAGSDPMLRALHQANAPLLVMDADPDEGFIRGK
MKGGPLPRGRGLLMAEDTGVFVQVAATEVRR
>A4IKE7 ~~~eccC~~~ESX secretion system protein EccC~~~COG1674
MSQLWVLYETYCQLFSLTNEEKVIVIGNQLEHHVTVSSFSFRNGYIQIEKKSDGSTLAVLQGGRQIGELKPRCSITIDVD
GQQMTIAWSGEEQRKYVYYVGQQSEVLVSNDPQADIETTNARFSLRKHRGQWVVIPDDDAPLFLNGVQLSDAVSLRNGDV
LLCPYMQFVFIEEDLLAVTSSEEVVSSLTETMPPLSEMKKKYPMYRRTPRMIYELPSDKVSISFPSQEGDGDPRGLWLMV
LPPVMMLLVIGAVALIQPRGVFIMISIAMFATTIVTSTAQYMREKKARQMRKEKRRRIYTNYLEQKREELQALSEKQRNV
LYYHFPSFEQMKSFVMQVNSRIWERTAESADFLHVRIGTADVPATYEVSVSMGDLANREIDDLLEQAQHIAKVYQTVKHV
PLPIDVSHGAIGMVGKRSIVNGEIEQLVGQIAFFHSYHDVRFVAIFSEDDYKHWEWMKWLPHFQLPNSFAKGLIYNEQTR
DQLLSSIYEMLRERALDEEKDKKRFSPHFVFIVADRSLIAEHVILEYLEEKNEDIGISVIFASETKESLTENVHTLVQYI
NEREGEIVIQHRKAAHIPFQLDEHSTEGNESFARMLRSLNHQKGMSNSIPEKVTFLEMMQTRRANELQIVQNWLSCQTSR
SLAVPIGLKGRNDVVELNLHEKAHGPHGLVAGTTGSGKSELLQTYILSLAVHFHPHEVAFLIIDYKGGGMAQPFKNMPHL
LGTITNIHGSKNFSARALASINSELKKRQRLFDRYEVNHINDYMELYKQGKAEQPLPHLFLIADEFAELKSEEPDFIREL
VSAARIGRSLGVHLILATQKPRGVIDEQIWSNARFRISLKMQDVNDSKEILRNGDAAAITVPGRAYLQVGNNEVYELFQS
AWSGAPYVEEGVEAEDEIHIVTDLGLVPVSNVATDRKRSRQKPKTEIEMVVEQIIETQKQLNIEKLPSPWLPPLPPRLAR
PASVTAEANAFPIGLKDEPELQSQSDYFYQWLEDGNIGIFGSAGYGKSTTMMTLLLSFAGAYNPAQLHYYIFDFGNSALL
PLRQLPHTADYFRLDDEKKIEKFIKFMKEEMEQRKQRFMEKEVSTIKLYNALSEEKLPIIIVALDNFDVVKEEMPDFETQ
LIQYARDGQSLGIFFIMTATRVSGIRPPLMNNLKTKIVHYFIDSSEKFSLIGRTPYDVDPIPGRALIKKDNAALTQIYLP
ADGEDDIEVLENVKREMERLKEVYQHIPKPKPIPMLPPRLSMSVFTNTYVQHRASGFIPVGLDEQTVRPVAINMRTDPHC
LIVGQSRKGKTNVVKVILESLLVQEPESIGLLDGIDRGLAGYANRDDITYIEAKERLAQWLNEADAVLQQREREYIQAVN
ENRATTLAWPPVVFVVDSLLRLQQETDSIMQGRIANMMKQYSHLGFHVFVAGNANEFVKGFDALTAELKQIRQAILVTKK
SEQSLFALPFTRNEQEIEPGFGYFVVGGKDQKIQIPKVE
>D1A4G7 ~~~eccC~~~ESX secretion system protein EccC~~~COG1674
MSTVLVRRKERRQPPQMPRGEILLESPPELPEVVTNSFQNVLMYLPMAAGSAAMVFTFLNHRNTLQLVAGGMFALSMFGM
MFGQLSQQSGERKTKLNSARRDYLRYLGQVRQRVRKAAKQQREALEWNNPAPGRLWSMVMSPRLWERRSSDADFAQVRIG
AGPQRLAVQLIPPETKPVEDLEPMSAGALRRFLRAHSTVPDLPVAISLRSFARILPDGDPKAVYGMVRALIMQLAAFHSP
DDVRITVCASRERMPQWQWMKWLPHSLHPTEYDAAGQVRLLTHSLVELESMLGPEIKDRGMFGASRAPAEPFHLVIVDGG
QASYDSQIASDGIDGVCVIDLTGSVAETNEATMLRLRVTPERVYVVKRDRAGKEVLSSVGRPDQASIAEAEALARQLAPF
RTSAADEPEEDVLSANMTLTSLLHIDNPYNLDPAVLWRPRPQRNRLRVPIGLDADGRPLELDIKESAQGGMGPHGLCIGA
TGSGKSELLRTLVLALAMTHSPEVLNFVLVDFKGGATFLGMEGLRHVSAIITNLEEELPLVDRMYDALHGEMVRRQEHLR
HSGNYASLRDYEKARMEGAPLPPMPTLFIVLDEFSELLSAKPDFAELFVMIGRLGRSLGVHLLLASQRLEEGKLRGLDTH
LSYRIGLRTFSAMESRVVLGVPDAYELPPSPGNGYLKFATEPLVRFKAAYVSGPVDEEPQTRSEGPQIVRQVLPYLTDYI
RPQVVEQPQPEQRAEENKSSESLFDVVVRQLAGHGPEPHQIWLPPLDVPPTLDELLPPLSPSAAHGYTADGWEWRGRLHA
VVGLVDRPFDQRRDPYWLDLSGGAGHVGVAGGPQTGKSTMLRTLITSLALLHTPQEVQFYCLDFGGGTLAGLAELPHVGS
VATRLDADRIRRTVAEVSALLEQREQEFTERGIDSMATYRRLRATGEYAGDGFGDVFLVVDNWLTLRQDYEALEDSITQL
AARGLGYGIHVVLSSNKWSEFRTSIRDLLGTKLELRLGDPYESEVDRKKAANVPENRPGRGLTRDGYHFLTALPRIDGDT
SAETLTEGIATTVKTIREAWHGPTAPPVRMLPNVLPAAQLPSAAESGTRIPIGIDEDSLSPVYLDFNTDPHFLVFGDTEC
GKSNLLRLITAGIIERYTPQQARLIFIDYSRSLLDVATTEHQIGYAASSTAASSLVRDIKGAMEARLPPPDLTPEQLRSR
SWWTGAELFLVVDDYEMVATSDNPLRPLAELLPQARDIGLHLIIARSMGGAGRALYEPIIQRIKEMASPGLVMSGNKDEG
ILLGNVKPHKLPQGRGYFVERRSGTRLIQTAYRES
>P9WNQ7 ~~~eccD1~~~ESX-1 secretion system protein EccD1~~~
MSAPAVAAGPTAAGATAARPATTRVTILTGRRMTDLVLPAAVPMETYIDDTVAVLSEVLEDTPADVLGGFDFTAQGVWAF
ARPGSPPLKLDQSLDDAGVVDGSLLTLVSVSRTERYRPLVEDVIDAIAVLDESPEFDRTALNRFVGAAIPLLTAPVIGMA
MRAWWETGRSLWWPLAIGILGIAVLVGSFVANRFYQSGHLAECLLVTTYLLIATAAALAVPLPRGVNSLGAPQVAGAATA
VLFLTLMTRGGPRKRHELASFAVITAIAVIAAAAAFGYGYQDWVPAGGIAFGLFIVTNAAKLTVAVARIALPPIPVPGET
VDNEELLDPVATPEATSEETPTWQAIIASVPASAVRLTERSKLAKQLLIGYVTSGTLILAAGAIAVVVRGHFFVHSLVVA
GLITTVCGFRSRLYAERWCAWALLAATVAIPTGLTAKLIIWYPHYAWLLLSVYLTVALVALVVVGSMAHVRRVSPVVKRT
LELIDGAMIAAIIPMLLWITGVYDTVRNIRF
>P9WNQ5 ~~~eccD2~~~ESX-2 secretion system protein eccD2~~~
MTAPHKVAFPARCAVNICYDKHLCSQVFPAGIPVEGFFEGMVELFDADLKRKGFDGVALPAGSYELHKINGVRLDINKSL
DELGVQDGDTLVLVPRVAGESFEPQYESLSTGLAAMGKWLGRDGGDRMFAPVTSLTAAHTAMAIIAMAVGVVLALTLRTR
TITDSPVPAAMAGGIGVLLVIGALVVWWGWRERRDLFSGFGWLAVVLLAVAAACAPPGALGAAHALIGLVVVVLGAITIG
VATRKRWQTAVVTAVVTVCGILAAVAAVRMFRPVSMQVLAICVLVGLLVLIRMTPTVALWVARVRPPHFGSITGRDLFAR
RAGMPVDTVAPVSEADADDEDNELTDITARGTAIAASARLVNAVQVGMCVGVSLVLPAAVWGVLTPRQPWAWLALLVAGL
TVGLFITQGRGFAAKYQAVALVCGASAAVCAGVLKYALDTPKGVQTGLLWPAIFVAAFAALGLAVALVVPATRFRPIIRL
TVEWLEVLAMIALLPAAAALGGLFAWLRH
>A0QQ46 ~~~eccD3~~~ESX-3 secretion system protein EccD3~~~
MSENTVMPIVRVAVLAAGDDGGRLTEMALPSELPLREILPAVQRIVQPARENDGAADPAAAPNPVRLSLAPIGGAPFSLD
ATLDTVGVVDGDLLALQAVPSGPPAPRIVEDIADAAVIFSEARRRQWGPTHIARGAALALIGLILVGTGLSVAHRVITGD
LLGQFIVSGIALATVIAALAVRNRSAVLATSLAVTALVPVAAAFALGVPGDFGAPNVLLAAAGVAAWSLISMAGSPDDRG
IAVFTATAVTGVGVLLVAGAASLWVISSDVIGCALVLLGLIVTVQAAQLSAMWARFPLPVIPAPGDPTPAARPLSVLADL
PRRVRVSQAHQTGVIAAGVLLGVAGSVALVSSANASPWAWYIVVAAAAGAALRARVWDSAACKAWLLGHSYLLAVALLVA
FVIGDRYQAALWALAALAVLVLVWIVAALNPKIASPDTYSLPMRRMVGFLATGLDASLIPVMALLVGLFSLVLDR
>P9WNQ3 ~~~eccD3~~~ESX-3 secretion system protein EccD3~~~
MSGTVMQIVRVAILADSRLTEMALPAELPLREILPAVQRLVVPSAQNGDGGQADSGAAVQLSLAPVGGQPFSLDASLDTV
GVVDGDLLVLQPVPAGPAAPGIVEDIADAAMIFSTSRLKPWGIAHIQRGALAAVIAVALLATGLTVTYRVATGVLAGLLA
VAGIAVASALAGLLITIRSPRSGIALSIAALVPIGAALALAVPGKFGPAQVLLGAAGVAAWSLIALMIPSAERERVVAFF
TAAAVVGASVALAAGAQLLWQLPLLSIGCGLIVAALLVTIQAAQLSALWARFPLPVIPAPGDPTPSAPPLRLLEDLPRRV
RVSDAHQSGFIAAAVLLSVLGSVAIAVRPEALSVVGWYLVAATAAAATLRARVWDSAACKAWLLAQPYLVAGVLLVFYTA
TGRYVAAFGAVLVLAVLMLAWVVVALNPGIASPESYSLPLRRLLGLVAAGLDVSLIPVMAYLVGLFAWVLNR
>B2HSU6 ~~~eccD5~~~ESX-5 secretion system protein EccD5~~~
MTAVADAPQAELEGVSSPRAVVVGIMAGEGVQIGVLLDANAPVSVMTDPLLKVVNSRLRELGESTLEAAGRGRWALCLID
GSPLRATQSLTEQDVYDGDRLWIRFIPDTEHRSQVIEHISTAVASNLSKRFASIDPVVAVQVGAGMVGTGVILASGVLGW
WRWHHNTWLTTIFASVIAVLVLMVAMMLLMRATTDADRRVADIMLVSGLAPLTVAAASAPPGSVGSPQAVLGFGVLSIAA
ALALRFTGRRLAIYTAIVTICGLTTLASLSRMVAATSAVTLFATMLLICVVMYHASPALSRRLSGIRLPVFPSATSRWVF
EARPDLPTTVAVAAGGPPVLEGPASVRDVVLQAERARSFLSGLLVGLGVLMVVSLTSLCNPHTSERWLPLMLAGFTSGFL
MLRGRSYVDRWQSITLAVTAVIVVAAVSVRYALVLSSPLSVSIVASLLVLLPAAGMTAAAVVPNTIYSPLFRKFVEWTEY
LCLMPIFPLAFWLMNVYAAIRYR
>P9WNP9 ~~~eccD5~~~ESX-5 secretion system protein EccD5~~~
MTAVADAPQADIEGVASPQAVVVGVMAGEGVQIGVLLDANAPVSVMTDPLLKVVNSRLRELGEAPLEATGRGRWALCLVD
GAPLRATQSLTEQDVYDGDRLWIRFIADTERRSQVIEHISTAVASDLSKRFARIDPIVAVQVGASMVATGVVLATGVLGW
WRWHHNTWLTTIYTAVIGVLVLAVAMLLLMRAKTDADRRVADIMLMSAIMPVTVAAAAAPPGPVGSPQAVLGFGVLTVAA
ALALRFTGRRLGIYTTIVIIGALTMLAALARMVAATSAVTLLSSLLLICVVAYHAAPALSRRLAGIRLPVFPSATSRWVF
EARPDLPTTVVVSGGSAPVLEGPSSVRDVLLQAERARSFLSGLLTGLGVMVVVCMTSLCDPHTGQRWLPLILAGFTSGFL
LLRGRSYVDRWQSITLAGTAVIIAAAVCVRYALELSSPLAVSIVAAILVLLPAAGMAAAAHVPHTIYSPLFRKFVEWIEY
LCLMPIFPLALWLMNVYAAIRYR
>P9WJE9 ~~~eccE1~~~ESX-1 secretion system protein EccE1~~~
MRNPLGLRFSTGHALLASALAPPCIIAFLETRYWWAGIALASLGVIVATVTFYGRRITGWVAAVYAWLRRRRRPPDSSSE
PVVGATVKPGDHVAVRWQGEFLVAVIELIPRPFTPTVIVDGQAHTDDMLDTGLVEELLSVHCPDLEADIVSAGYRVGNTA
APDVVSLYQQVIGTDPAPANRRTWIVLRADPERTRKSAQRRDEGVAGLARYLVASATRIADRLASHGVDAVCGRSFDDYD
HATDIGFVREKWSMIKGRDAYTAAYAAPGGPDVWWSARADHTITRVRVAPGMAPQSTVLLTTADKPKTPRGFARLFGGQR
PALQGQHLVANRHCQLPIGSAGVLVGETVNRCPVYMPFDDVDIALNLGDAQTFTQFVVRAAAAGAMVTVGPQFEEFARLI
GAHIGQEVKVAWPNATTYLGPHPGIDRVILRHNVIGTPRHRQLPIRRVSPPEESRYQMALPK
>P9WJE7 ~~~eccE2~~~ESX-2 secretion system protein EccE2~~~
MTSKLTGFSPRSARRVAGVWTVFVLASAGWALGGQLGAVMAVVVGVALVFVQWWGQPAWSWAVLGLRGRRPVKWNDPITL
ANNRSGGGVRVQDGVAVVAVQLLGRAHRATTVTGSVTVESDNVIDVVELAPLLRHPLDLELDSISVVTFGSRTGTVGDYP
RVYDAEIGTPPYAGRRETWLIMRLPVIGNTQALRWRTSVGAAAISVAQRVASSLRCQGLRAKLATATDLAELDRRLGSDA
VAGSAQRWKAIRGEAGWMTTYAYPAEAISSRVLSQAWTLRADEVIQNVTVYPDATCTATITVRTPTPAPTPPSVILRRLN
GEQAAAAAANMCGPRPHLRGQRRCPLPAQLVTEIGPSGVLIGKLSNGDRLMIPVTDAGELSRVFVAADDTIAKRIVIRVV
GAGERVCVHTRDQERWASVRMPQLSIVGTPRPAPRTTVGVVEYVRRRKNGDDGKSEGSGVDVAISPTPRPASVITIARPG
TSLSESDRHGFEVTIEQIDRATVKVGAAGQNWLVEMEMFRAENRYVSLEPVTMSIGR
>A0QQ48 ~~~eccE3~~~ESX-3 secretion system protein EccE3~~~
MTARIALASLFVVAAVLAQPWQTTTQRWVLGVSIAAVIVLLAWWKGMFLTTRIGRALAMVRRNRAEDTVETDAHRATVVL
RVDPAAPAQLPVVVGYLDRYGITCDKVRITHRDAGGTRRSWISLTVDAVDNLAALQARSARIPLQDTTEVVGRRLADHLR
EQGWTVTVVEGVDTPLPVSGKETWRGVADDAGVVAAYRVKVDDRLDEVLAEIGHLPAEETWTALEFTGSPAEPLLTVCAA
VRTSDRPAAKAPLAGLTPARGRHRPALAALNPLSTERLDGTAVPLPAVVRTSVKGSVEHEAAQEAGHPA
>P9WJE5 ~~~eccE3~~~ESX-3 secretion system protein EccE3~~~
MNPIPSWPGRGRVTLVLLAVVPVALAYPWQSTRDYVLLGVAAAVVIGLFGFWRGLYFTTIARRGLAILRRRRRIAEPATC
TRTTVLVWVGPPASDTNVLPLTLIARYLDRYGIRADTIRITSRVTASGDCRTWVGLTVVADDNLAALQARSARIPLQETA
QVAARRLADHLREIGWEAGTAAPDEIPALVAADSRETWRGMRHTDSDYVAAYRVSANAELPDTLPAIRSRPAQETWIALE
IAYAAGSSTRYTVAAACALRTDWRPGGTAPVAGLLPQHGNHVPALTALDPRSTRRLDGHTDAPADLLTRLHWPTPTAGAH
RAPLTNAVSRT
>B2HSU8 ~~~eccE5~~~ESX-5 secretion system protein EccE5~~~
MKAQRRFGLALSWARLTTVFVIDLLILIVASHCPDSWQGENRIAWWVGVGIAVLVTLLSVVTYRGITVTSGITAWLWDWS
ADPGTALGAGCTPAVDHQRRFGRDTVGVREHHGRLVTVITVDDGEGDAAGRHRHRTTQSAVVPVATVAENLRQFDVQLDG
VDIVTVEVRGGAEAARASASLDEWGPEEWGMVGESPAANRRRTWLILRMNPQRNVAAIASRDSLASTLVTATERLAQVLD
GQSCAARPLAADELAEVDSAILAELEPTWSRPGWRHLKHFNGYATSFWVTPADINAETLDEVWLSDAPEVGATVLTLRLV
MRAGEPRLSAWVRYHSDERLPKELSVGLNRLTGRQLAAVRASLPVPSTRAQLVVSSRELLDHDELELPVGQTQEHATSAT
TGQ
>P9WJE3 ~~~eccE5~~~ESX-5 secretion system protein EccE5~~~
MKAQRSFGLALSWPRVTAVFLVDVLILAVASHCPDSWQADHHVAWWVGVGVAAVVTLLSVVSYHGITVISGLATWVRDWS
ADPGTTLGAGCTPAIDHQRRFGRDTVGVREYNGRLVSVIEVTCGESGPSGRHWHRKSPVPMLPVVAVADGLRQFDIHLDG
IDIVSVLVRGGVDAAKASASLQEWEPQGWKSEERAGDRTVADRRRTWLVLRMNPQRNVAAVACRDSLASTLVAATERLVQ
DLDGQSCAARPVTADELTEVDSAVLADLEPTWSRPGWRHLKHFNGYATSFWVTPSDITSETLDELCLPDSPEVGTTVVTV
RLTTRVGSPALSAWVRYHSDTRLPKEVAAGLNRLTGRQLAAVRASLPAPTHRPLLVIPSRNLRDHDELVLPVGQELEHAT
SSFVGQ
>Q5KSN5 5.5.1.13~~~ent-cdps~~~Ent-copalyl diphosphate synthase~~~
MNVTSFAALRAAAQDIVDEMIADPYGLTSPSVYETARMVVSAPWLEGHRQRVEFLLAQQHEDGTWGGPAAYGLLPTLSAV
DALLSVAGTQDARRVAGAVESGLAALAGRFPRNVELPDTIAVELLVPWLIEQVDQRLSRMDDRGDLPGRLDLQADTGTLS
GIRELLRQNTGIPEKTWHSLEALGAPAVRSGTVTPMGGAVGASPAATSAWLGDPPHTDAAKACLAYLHQTQARHGGPVSG
ITSISYFELAWVVTALSGSGLDVDIPAQVPDILRTALGANGLSAGPGLPADSDDTSAALHALDLLGKPESVDCLWEYDTG
LYFTCFPKERTPSTSTNAHILVALADRRGQGDTRYDHAAERVGGWLVEQQQPDGRWMDKWHASPYYATACGAAAMARLDG
PRTSAALDDAIRWVLDTQHADGSWGRWEGTGEETAYALQVLNHRAAPDRPALEAIRAGRAFLSGHVEDDRRNPPLWHDKD
LYTPVRVIRAEILGTLAATQRLAEAEKEARA
>Q1GBJ0 7.-.-.-~~~ecfA1~~~Energy-coupling factor transporter ATP-binding protein EcfA1~~~COG1122
MSDNIISFDHVTFTYPDSPRPALSDLSFAIERGSWTALIGHNGSGKSTVSKLINGLLAPDDLDKSSITVDGVKLGADTVW
EVREKVGIVFQNPDNQFVGATVSDDVAFGLENRAVPRPEMLKIVAQAVADVGMADYADSEPSNLSGGQKQRVAIAGILAV
KPQVIILDESTSMLDPEGKEQILDLVRKIKEDNNLTVISITHDLEEAAGADQVLVLDDGQLLDQGKPEEIFPKVEMLKRI
GLDIPFVYRLKQLLKERGIVLPDEIDDDEKLVQSLWQLNSKM
>A2RI01 7.-.-.-~~~ecfA1~~~Energy-coupling factor transporter ATP-binding protein EcfA1~~~COG1122
MNKILEVENLVFKYEKESDVNQLNGVSFSITKGEWVSIIGQNGSGKSTTARLIDGLFEEFEGIVKIDGERLTAENVWNLR
RKIGMVFQNPDNQFVGATVEDDVAFGMENQGIPREEMIKRVDEALLAVNMLDFKTREPARLSGGQKQRVAVAGIIALRPE
IIILDESTSMLDPTGRSEIMRVIHEIKDKYHLTVLSITHDLDEAASSDRILVMRAGEIIKEAAPSELFATSEDMVEIGLD
VPFSSNLMKDLRTNGFDLPEKYLSEDELVELLADKLG
>Q035B2 7.-.-.-~~~ecfA1~~~Energy-coupling factor transporter ATP-binding protein EcfA1~~~
MGNVIRVQHLNYTYPEAKQQALTDVSFDVAKGEWLAIIGHNGSGKSTLAKNLNGLLAPESGTVQVAGMTLSEETVWDIRA
KVGIVFQNPDNQFVGATVADDVAFGLENRGVPRPEMIKRVDEALDRVGMTAFADREPARLSGGQKQRVAIAGIVAQRPEI
IILDESTSMLDPAGRQEVLGVIRELKDELGLTVLSITHDIDEAAEAHRIILLNDGKINEIGTPSEIFSHGMELLRLGLDV
PYSEKLKDALAQRGIAMPKDYMDNERLVDYLWTLHSTM
>Q03ZL6 7.-.-.-~~~ecfA1~~~Energy-coupling factor transporter ATP-binding protein EcfA1~~~COG1122
MVKAIKIDNLKYSYDERSLFSDFNLDIDAGQWVALVGHNGSGKSTLAKLILGLLVAEQGDIDVFDERLTVETVHHVRSKI
GMVFQNPDNQFVGATVADDVAFGLENIQVESSEMPQKIDNALTIVGMQEFKNREPHTLSGGQKQRVALASVLALQPKIII
LDEATAMLDPDGRATVMETLQKLKKQFGKELTLVTITHDMDEATLADRVVVINDGQKILDGTPAEVFSQRKALHENGLEL
PFANELAFHLNEKPNKYMDERELIQWLSTLNK
>Q03PY5 7.-.-.-~~~ecfA1~~~Energy-coupling factor transporter ATP-binding protein EcfA1~~~COG1122
MTENIISVDHLTYQYDENQAPALTDVSFTVHAGEWLAIVGHNGSGKSTLAKSLDGLLPFTQGSVTVGGITLTPETVWQVR
EQIGMIFQNPDNQFVGATVEDDVAFGLENRQISRDEMVPRVQAALAQVGMTSFAQREPSSLSGGQKQRVALAGIVAIAPK
ILILDEATSMLDPQGRIEMLAIVRQLRQQQNLTVISITHDIDEAASADRVLVIDDGRLVDEAVPSQIFERGTQLVEMGLD
LPFTEKLKAALRQRGITPPTTYQTAAEMEEWLWQSLSNT
>Q5M243 7.-.-.-~~~ecfA1~~~Energy-coupling factor transporter ATP-binding protein EcfA1~~~COG1122
MIEIKNLKFKYNQDQTSYTLNDVSFHVKHGEWLSIVGHNGSGKSTTARLIGGLLVADSGQIIVDGQELTEETVWDIRDKI
GMVFQNPDNQFVGATVEDDVAFGLENKGLPYKEMVSRVQEALSFVGMMDFKDREPARLSGGQKQRVAIAGIIAMRPSILI
LDEATSMLDPEGRQELIQYIEDIRQQYGMTVLSITHDLDEVAMSNRVLVLKQGKVESISSPRELFSRGSELVDLGLDIPF
SALLTQKLKNQGLIDCEGYLTEKELVEQLWEYLSKM
>Q9X1Z1 7.-.-.-~~~ecfA1~~~Energy-coupling factor transporter ATP-binding protein EcfA1~~~COG1122
MKITLNSVSFRYNGDYVLKDVNAEFETGKIYVVVGKNGSGKTTLLKILAGLLEAEGEIFLDGSPADPFLLRKNVGYVFQN
PSSQIIGATVEEDVAFSLEIMGLDESEMRKRIKKVLELVGLSGLEKEDPLNLSGGQKQRLAIASMLARDTRFLALDEPVS
MLDPPSQREIFQVLESLKNEGKGIILVTHELEYLDDMDFILHISNGTIDFCGSWEEFVEREFDDVEIPFKWKLWKKCGKI
NLWEDRYENSGNQRRRDTV
>Q8R7Y5 3.6.3.-~~~ecfA2~~~Energy-coupling factor transporter ATP-binding protein EcfA2~~~COG1122
MPIKVENVSFIYNEGTPYATVALKDINFSIDDEEFVGIIGHTGSGKSTLIQQLNGLLKPSKGKIYINGIDITDKKVSLKD
IRKQVGLVFQYPEYQLFEETVFKDIAFGPSNLGLSEEEVKERVYEAMEIVGISKELADKSPFELSGGQKRRVAIAGILAM
RPKILILDEPTAGLDPKGKQEILNKIKEIHDKYKMITILVSHNMEDIARIADKIIVMNRGKIELIGTPREVFREAERLEK
IGLSVPQITSLARELRKRGVPIPPDVLTIEEAKEHILRYLRGTKNV
>Q1GBI9 3.6.3.-~~~ecfA2~~~Energy-coupling factor transporter ATP-binding protein EcfA2~~~COG1122
MAIKFENVSYVYSPGSPLEAIGLDQLNFSLEEGKFIALVGHTGSGKSTLMQHFNALLKPTSGKIEIAGYTITPETGNKGL
KDLRRKVSLAFQFSEAQLFENTVLKDVEYGPRNFGFSEDEAREAALKWLKKVGLKDDLIEHSPFDLSGGQMRRVALAGVL
AYEPEIICLDEPAAGLDPMGRLEMMQLFKDYQAAGHTVILVTHNMDDVADYADDVLALEHGRLIKHASPKEVFKDSEWLQ
KHHLAEPRSARFAAKLEAAGLKLPGQPLTMPELADAIKQSLKGGEHE
>A2RI02 3.6.3.-~~~ecfA2~~~Energy-coupling factor transporter ATP-binding protein EcfA2~~~COG1122
MIKFEKVNYTYQPNSPFASRALFDIDLEVKKGSYTALIGHTGSGKSTLLQHLNGLLQPTEGKVTVGDIVVSSTSKQKEIK
PVRKKVGVVFQFPESQLFEETVLKDVAFGPQNFGIPKEKAEKIAAEKLEMVGLADEFWEKSPFELSGGQMRRVAIAGILA
MEPEVLVLDEPTAGLDPKARIEMMQLFESIHQSGQTVVLVTHLMDDVADYADYVYLLEKGHIISCGTPSDVFQEVDFLKA
HELGVPKATHFADQLQKTGAVAFEKLPITRAELVTLLTSLSVNSGGEN
>Q035B3 3.6.3.-~~~ecfA2~~~Energy-coupling factor transporter ATP-binding protein EcfA2~~~
MDITFDHVSFTYQAGTPFAGDGIKDVSGVIRDGSYTAIIGHTGSGKSTILQHLNALLKPTSGTVTIGDKVITNETNNKNL
KPLRQKVGMVFQFAENQLFEQTVAKDIAFGPQNFGVSEKDALALADKMVKMVGLPHDVLEKSPFDLSGGQMRRVAIAGVL
AMQPEVLVLDEPTAGLDPSGRHEMMQMFEQLHREQGQTIVLVTHQMDDVADYADTVWVMAEGKLIKTGTPREIFADPAWL
KANQLGLPKTAQLAQQLAAKGFHFDPQPLTESELADQLVPQIGGGQRG
>Q03ZL5 3.6.3.-~~~ecfA2~~~Energy-coupling factor transporter ATP-binding protein EcfA2~~~COG1122
MAINFEQVNFSYGAGTTLAQPILHDINVTIPDGQVTAIIGQTGSGKSTFIQHLNGLLKPTTGRVVIDDFVLTSDLKEKNL
TSLRARVGMVFQFPENQLFANTVLEDVMYAPINFGYAKADAEFAAKTALKQVNVSEELWDKSPFELSGGQMRRVAMAGTL
ASNPDIIVLDEPAAGLDPKGQKELLAIVRGLKEAGKLVVFISHQMDHVIAVADHVIVMHDGGVVAEGTPVEIFNKDLVWF
KTVALDLPKAGQFAEQLRQKGHILRHRPLLLTELATMLNEEKRHE
>Q03PY6 3.6.3.-~~~ecfA2~~~Energy-coupling factor transporter ATP-binding protein EcfA2~~~COG1122
MAIAFEHVTYTYQAGTPMAHTALTDVSLTVPDRGYLAIIGHTGSGKSTLIQQLNALLKPTSGTIKIDEFTITPETTNAAL
KPLRQHVGMVFQFPENQLFEETVRQDIAFGPKNFGMADADALALADEMLTTVGLDQSYAERSPFELSGGQMRRVAIAGVL
AMQPKVLVLDEPTAGLDPQGRQEMMRLFARLHQEQGLTIVLVTHQMEDVAQYAEQVAVMHEGRLMKFGTPADVFSNREWL
QDHQLDVPQAAQFARRLRDRGLTFPKQPLTADQLADYLAQQWAQRGADHV
>Q7A471 3.6.3.-~~~ecfA2~~~Energy-coupling factor transporter ATP-binding protein EcfA2~~~
MTIRFDNVSYTYQKGTPYQHQAIHDVNTEFEQGKYYAIVGQTGSGKSTLIQNINALLKPTTGTVTVDDITITHKTKDKYI
RPVRKRIGMVFQFPESQLFEDTVEREMIFGPKNFKMNLDEAKNYAHRLLMDLGFSRDVMSQSPFQMSGGQMRKIAIVSIL
AMNPDIIVVDEPTAGLDPQSKRQVMRLLKSLQTDENKAIILISHDMNEVARYADEVIVMKEGSIVSQTSPKELFKDKKKL
ADWHIGLPEIVQLQYDFEQKYQTKLKDIALTEEAFVSLYKEWQHEK
>Q5M244 3.6.3.-~~~ecfA2~~~Energy-coupling factor transporter ATP-binding protein EcfA2~~~COG1122
MGISLENVSYTYQSGTPFERRALFDMTVTIKDGSYTAFIGHTGSGKSTIMQLLNGLYLPTSGQVKVDDTIINSQSKNKEI
KPIRKKVGLVFQFPESQLFAETVLEDIAFGPQNFGVSKEEAEQRALESLRLVGLSDELRDQNPFDLSGGQMRRVAIAGIL
AMQPDILVLDEPTAGLDPQGRKELMSLFKQLHLSGITIVLVTHLMDDVADYATAVNVMEKGRLVLSGTPKDVFQKVAFLK
EKQLGVPKITEFALQLQEKGYSFESLPITIEEFVEVLVHG
>Q9WY65 7.-.-.-~~~ecfA2~~~Energy-coupling factor transporter ATP-binding protein EcfA2~~~COG1122
MRIEVVNVSHIFHRGTPLEKKALENVSLVINEGECLLVAGNTGSGKSTLLQIVAGLIEPTSGDVLYDGERKKGYEIRRNI
GIAFQYPEDQFFAERVFDEVAFAVKNFYPDRDPVPLVKKAMEFVGLDFDSFKDRVPFFLSGGEKRRVAIASVIVHEPDIL
ILDEPLVGLDREGKTDLLRIVEKWKTLGKTVILISHDIETVINHVDRVVVLEKGKKVFDGTRMEFLEKYDPRFFTSKMLV
MRRLVLKGEDPFSMSDDELLERVCNS
>Q0TUN8 3.6.3.-~~~ecfA3~~~Energy-coupling factor transporter ATP-binding protein EcfA3~~~COG1122
MEDYILKVEELNYNYSDGTHALKGINMNIKRGEVTAILGGNGVGKSTLFQNFNGILKPSSGRILFDNKPIDYSRKGIMKL
RESIGIVFQDPDNQLFSASVYQDVSFGAVNMKLPEDEIRKRVDNALKRTGIEHLKDKPTHCLSFGQKKRVAIAGVLVMEP
KVLILDEPTAGLDPMGVSEIMKLLVEMQKELGITIIIATHDIDIVPLYCDNVFVMKEGRVILQGNPKEVFAEKEVIRKVN
LRLPRIGHLMEILKEKDGFVFDELDLTIGQARKTINSWKNKIFND
>B3Q6P8 ~~~ecfG~~~ECF RNA polymerase sigma factor EcfG~~~
MPLTDSLRDDILAAVPSLRAFAISLSGNADRADDLVQETLLRALANIDSFQPGSNLPAWLFTILRNLFRSDYRKRRREVE
DADGSYAKTLKSQPGQTAHLEFEEFRAALDKLPQDQREALILVGASGFSYEDAAAICGCAVGTIKSRVNRARSKLSALLY
VDGAEDFGPDDTVRAVIGGNG
>P70972 ~~~ecfT~~~Energy-coupling factor transporter transmembrane protein EcfT~~~COG0619
MMDSMIIGKYVPGTSLVHRLDPRTKLITIFLFVCIVFLANNVQTYALLGLFTIGVVSLTRVPFSFLMKGLKPIIWIVLFT
FLLHILMTHEGPIIFQIGFFKVYEGGLVQGIFISLRFVYLILITTLLTLTTTPIEITDGMEQLLNPLKKLKLPVHELALM
MSISLRFIPTLMEETDKIMKAQMARGVDFTSGPVKERVKAIVPLLVPLFVSAFKRAEELAVAMEARGYQGGEGRTKYRKL
VWTGKDTSVIVSLIVLAALLFFLRA
>A2RI03 ~~~ecfT~~~Energy-coupling factor transporter transmembrane protein EcfT~~~COG0619
MQNMLMGRYIPGDSIIHRMDPRSKLLVMIAFVVIIFLAHDWLGYLLLVLYTLAGVLLSKISVSYFLRGLRPMIGLILFTV
IFQMLFTNGQHVIFSLWFIKISTESLINAVYIFFRFVLIIFMSTILTLTTPPLTLADGIEKGLGPLKKIKVPVHELGLML
SISLRFIPTLMDDTTMIMNAQKARGMDFGEGNLLKKIKSVIPILIPLFVSSFRRADDLAVAMESRGYQGGDGRTKYRQLK
WQSRDSLLVVSIIIMTILLILWSKVS
>Q035B4 ~~~ecfT~~~Energy-coupling factor transporter transmembrane protein EcfT~~~
MDKLLLGRYIPGDSWVHRLDPRTKLIASFYYIGIVFLANNWQTYLMMFVATLFMIWLSGIKIGFFLKGVRPLIWLILFTV
VLQVLFVRGGTVYWHWGWLWITEFGLINGAFIFVRFVLIIFMSTLLTLSTQPLSLADAVESLLKPLRVIRVPVTELALVL
QIALRFVPTLMDQTTKIMNAQRARGVDFGEGNIFQQMKAVVPLLIPLFVSSFTTADELATAMEARGYQGGDDRTKYRILR
YHRRDWVAAGGMLVLTGLLLLLRA
>Q03ZL4 ~~~ecfT~~~Energy-coupling factor transporter transmembrane protein EcfT~~~COG0619
MNNIMIGRFVPGDSWIHRLDPRTKMIGTFIFIFVMLWSTSWATYAWSAAFVVLAIRLTKQPFRLYWDGLKPIFWLILFTV
VLQLFFTPGTPVLLHAGPLKVTIPGIINAIYVMIRFVLIILMSTILTLTTPPTSIANALESLLKPLKKIHVPVAELSLML
SIALRFVPLLMDETQKIMNAQKSRGMSFSTGGPIKRAKAIVPLLIPLFVGALQRALDLANAMEVRGFQDATQRTKYRVLS
YGSNDRSAFIGLIGFTIIFIGINFFIK
>Q03PY7 ~~~ecfT~~~Energy-coupling factor transporter transmembrane protein EcfT~~~COG0619
MSNFIFGRYLPLDSVVHRLDPRAKLMLSFCYIIVVFLANNIWSYAILIAFTVGAILSSKISLGFFLKGIRPLLWLIVFTV
VLQLLFSPAGGHTYFHWAFINVTQDGLINAGYIFVRFLLIIMMSTLLTLSTQPLDIATGLASLMKPLRWVKVPVDTLAMM
LSIALRFVPTLMDEATKIMNAQRARGVDFGEGGLFKQAKSLIPLMVPLFMSAFNRAEDLSTAMEARGYQDSEHRSQYRIL
TWQRRDTVTWLLFLLGFVAILIFRHW
>Q5M245 ~~~ecfT~~~Energy-coupling factor transporter transmembrane protein EcfT~~~COG0619
MDKLIIGRYIVGDSFIHRLDPRSKLLAMLIYIIVIFWANNPVTYAVITLFTLFLVFLSKIKLGFFLGGIKPMIWIILFST
LFQVFFNTRGNVLWSIGFFKITEVGLNQGWMIFLRFILIISFSTLLTLTTTPLSLSDAVESLLKPLTIFKVPAHEIGLML
SLSLRFVPTLMDDTTRIMNAQKARGVDFDEGNIIQKVRSIIPILIPLFASSFKRADALAIAMEARGYRGSEGRTKYRRLL
WNCRDTLSIIAILALGLILFYLKS
>Q9X2I1 ~~~ecfT~~~Energy-coupling factor transporter transmembrane protein EcfT~~~COG0619
MRLPTVLIGRYIPVDSIVHRLDPRAKLLGMIFLISAVLIVPNLLFYLVPGLAIFLLMFLSRTGFKIYLAGLRSLWFFLVF
AVLVQFFSSSEGEKIFWMITDRAIWSAVYIMLRLVLIILLAENFSATTPPLLSARAIESIFSTFGARKIGHEIGMVMTIA
MRFVPVLALEADRILKAQIARGANFERGKFFDRIRALVVIIVPLLISALRKAEELAVAMEARLYTGEPPKTRFKDIKWKP
MDTLYVLLTAGVLVLVLFGRYFVDGVFQYGS
>P9WNN7 4.2.1.17~~~~~~Probable enoyl-CoA hydratase EchA12~~~COG1024
MPHRCAAQVVAGYRSTVSLVLVEHPRPEIAQITLNRPERMNSMAFDVMVPLKEALAQVSYDNSVRVVVLTGAGRGFSPGA
DHKSAGVVPHVENLTRPTYALRSMELLDDVILMLRRLHQPVIAAVNGPAIGGGLCLALAADIRVASSSAYFRAAGINNGL
TASELGLSYLLPRAIGSSRAFEIMLTGRDVSAEEAERIGLVSRQVPDEQLLDACYAIAARMAGFSRPGIELTKRTLWSGL
DAASLEAHMQAEGLGQLFVRLLTANFEEAVAARAEQRAPVFTDDT
>P95279 ~~~~~~Putative enoyl-CoA hydratase EchA13~~~COG1024
MFVGRVGPVDRRSDGERSRRPREFEYIRYETIDDGRIAAITLDRPKQRNAQTRGMLVELGAAFELAEADDTVRVVILRAA
GPAFSAGHDLGSADDIRERSPGPDQHPSYRCNGATFGGVESRNRQEWHYYFENTKRWRNLRKITIAQVHGAVLSAGLMLA
WCCDLIVASEDTVFADVVGTRLGMCGVEYFGHPWEFGPRKTKELLLTGDCIGADEAHALGMVSKVFPADELATSTIEFAR
RIAKVPTMAALLIKESVNQTVDAMGFSAALDGCFKIHQLNHAHWGEVTGGKLSYGTVEYGLEDWRAAPQIRPAIKQRP
>P64019 4.2.1.17~~~~~~Probable enoyl-CoA hydratase echA14~~~
MAQYDPVLLSVDKHVALITVNDPDRRNAVTDEMSAQLRAAIQRAEGDPDVHAVVVTGAGKAFCAGADLSALGAGVGDPAE
PRLLRLYDGFMAVSSCNLPTIAAVNGAAVGAGLNLALAADVRIAGPAALFDARFQKLGLHPGGGATWMLQRAVGPQVARA
ALLFGMCFDAESAVRHGLALMVADDPVTAALELAAGPAAAPREVVLASKATMRATASPGSLDLEQHELAKRLELGPQAKS
VQSPEFAARLAAAQHR
>P9WNN5 4.2.1.17~~~~~~Probable enoyl-CoA hydratase EchA14~~~COG1024
MAQYDPVLLSVDKHVALITVNDPDRRNAVTDEMSAQLRAAIQRAEGDPDVHAVVVTGAGKAFCAGADLSALGAGVGDPAE
PRLLRLYDGFMAVSSCNLPTIAAVNGAAVGAGLNLALAADVRIAGPAALFDARFQKLGLHPGGGATWMLQRAVGPQVARA
ALLFGMCFDAESAVRHGLALMVADDPVTAALELAAGPAAAPREVVLASKATMRATASPGSLDLEQHELAKRLELGPQAKS
VQSPEFAARLAAAQHR
>Q7TXE1 4.2.1.17~~~~~~Probable enoyl-CoA hydratase EchA17~~~
MAAVTPTVPEFVNVVVSDGSQDAGLAMLLLSRPPTNAMTRQVYREVVAAANELGRRDDVAAVILYGGHEIFSAGDDMPEL
RTLSAQEADTAARIRQQAVDAVAAIPKPTVAAITGYALGAGLTLALAADWRVSGDNVKFGATEILAGLIPSGDGMARLTR
AAGPSRAKELVFSGRFFDAEEALALGLIDDMVAPDDVYDAAAAWARRFLDGPPHALAAAKAGISDVYELAPAERIAAERR
RYVEVFAAGQGGGSKGDRGGR
>P9WNN3 4.2.1.17~~~~~~Probable enoyl-CoA hydratase EchA17~~~COG1024
MAAVTPTVPEFVNVVVSDGSQDAGLAMLLLSRPPTNAMTRQVYREVVAAANELGRRDDVAAVILYGGHEIFSAGDDMPEL
RTLSAQEADTAARIRQQAVDAVAAIPKPTVAAITGYALGAGLTLALAADWRVSGDNVKFGATEILAGLIPSGDGMARLTR
AAGPSRAKELVFSGRFFDAEEALALGLIDDMVAPDDVYDAAAAWARRFLDGPPHALAAAKAGISDVYELAPAERIAAERR
RYVEVFAAGQGGGSKGDRGGR
>O53561 4.2.1.-~~~~~~Enoyl-CoA hydratase EchA19~~~COG1024
MATVESGPDALVERRGHTLIVTMNRPAARNALSTEMMRIMVQAWDRVDNDPDIRCCILTGAGGYFCAGMDLKAATQKPPG
DSFKDGSYGPSRIDALLKGRRLTKPLIAAVEGPAIAGGTEILQGTDIRVAGESAKFGISEAKWSLYPMGGSAVRLVRQIP
YTLACDLLLTGRHITAAEAKEMGLIGHVVPDGQALTKALELADAISANGPLAVQAILRSIRETECMPENEAFKIDTQIGI
KVFLSDDAKEGPRAFAEKRAPNFQNR
>I6Y3U6 4.1.99.-~~~~~~(7aS)-7a-methyl-1,5-dioxo-2,3,5,6,7,7a-hexahydro-1H-indene-carboxyl-CoA hydrolase~~~COG1024
MPITSTTPEPGIVAVTVDYPPVNAIPSKAWFDLADAVTAAGANSDTRAVILRAEGRGFNAGVDIKEMQRTEGFTALIDAN
RGCFAAFRAVYECAVPVIAAVNGFCVGGGIGLVGNSDVIVASEDATFGLPEVERGALGAATHLSRLVPQHLMRRLFFTAA
TVDAATLQHFGSVHEVVSRDQLDEAALRVARDIAAKDTRVIRAAKEALNFIDVQRVNASYRMEQGFTFELNLAGVADEHR
DAFVKKS
>Q0S7P8 4.1.99.-~~~~~~(7aS)-7a-methyl-1,5-dioxo-2,3,5,6,7,7a-hexahydro-1H-indene-carboxyl-CoA hydrolase~~~COG1024
MGITSTTDGDGITTVTVDYPPVNAIPSRGWFELADAVLDAGRNPDTHVVILRAEGRGFNAGVDIKEMQATDGYGALVDAN
RGCAAAFAAVYDCAVPVVVAVNGFCVGGGIGLVGNADVIVASDDAVFGLPEVDRGALGAATHLARLVPQHMMRTLYYTAQ
NVTAQQLQHFGSVYEVVPREKLDDTARDIAAKIAAKDTRVIRCAKEAINGIDPVDVKTSYRLEQGYTFELNLAGVSDEHR
DEFVETGKPRSHSNNRKG
>P9WNP1 4.2.1.17~~~echA6~~~Probable enoyl-CoA hydratase EchA6~~~COG1024
MIGITQAEAVLTIELQRPERRNALNSQLVEELTQAIRKAGDGSARAIVLTGQGTAFCAGADLSGDAFAADYPDRLIELHK
AMDASPMPVVGAINGPAIGAGLQLAMQCDLRVVAPDAFFQFPTSKYGLALDNWSIRRLSSLVGHGRARAMLLSAEKLTAE
IALHTGMANRIGTLADAQAWAAEIARLAPLAIQHAKRVLNDDGAIEEAWPAHKELFDKAWGSQDVIEAQVARMEKRPPKF
QGA
>P9WNN9 4.2.1.17~~~echA8~~~Probable enoyl-CoA hydratase EchA8~~~COG1024
MTYETILVERDQRVGIITLNRPQALNALNSQVMNEVTSAATELDDDPDIGAIIITGSAKAFAAGADIKEMADLTFADAFT
ADFFATWGKLAAVRTPTIAAVAGYALGGGCELAMMCDVLIAADTAKFGQPEIKLGVLPGMGGSQRLTRAIGKAKAMDLIL
TGRTMDAAEAERSGLVSRVVPADDLLTEARATATTISQMSASAARMAKEAVNRAFESSLSEGLLYERRLFHSAFATEDQS
EGMAAFIEKRAPQFTHR
>Q5SKU3 4.2.1.17~~~~~~Putative enoyl-CoA hydratase~~~COG1024
MVQVEKGHVAVVFLNDPERRNPLSPEMALSLLQALDDLEADPGVRAVVLTGRGKAFSAGADLAFLERVTELGAEENYRHS
LSLMRLFHRVYTYPKPTVAAVNGPAVAGGAGLALACDLVVMDEEARLGYTEVKIGFVAALVSVILVRAVGEKAAKDLLLT
GRLVEAREAKALGLVNRIAPPGKALEEAKALAEEVAKNAPTSLRLTKELLLALPGMGLEDGFRLAALANAWVRETGDLKE
GIRAFFEKRPPRF
>Q3IZ90 5.4.99.63~~~ecm~~~Ethylmalonyl-CoA mutase~~~COG1884
MTQKDSPWLFRTYAGHSTAKASNALYRTNLAKGQTGLSVAFDLPTQTGYDSDDALARGEVGKVGVPICHLGDMRMLFDQI
PLEQMNTSMTINATAPWLLALYIAVAEEQGADISKLQGTVQNDLMKEYLSRGTYICPPRPSLRMITDVAAYTRVHLPKWN
PMNVCSYHLQEAGATPEQELAFALATGIAVLDDLRTKVPAEHFPAMVGRISFFVNAGIRFVTEMCKMRAFVDLWDEICRD
RYGIEEEKYRRFRYGVQVNSLGLTEQQPENNVYRILIEMLAVTLSKKARARAVQLPAWNEALGLPRPWDQQWSLRMQQIL
AYESDLLEYEDLFDGNPAIERKVEALKDGAREELAHIEAMGGAIEAIDYMKARLVESNAERIARVETGETVVVGVNRWTS
GAPSPLTTGDGAIMVADPEAERDQIARLEAWRAGRDGAAVAAALAELRRAATSGENVMPASIAAAKAGATTGEWAAELRR
AFGEFRGPTGVARAPSNRTEGLDPIREAVQAVSARLGRPLKFVVGKPGLDGHSNGAEQIAARARDCGMDITYDGIRLTPA
EIVAKAADERAHVLGLSILSGSHMPLVTEVLAEMRRAGLDVPLIVGGIIPEEDAAELRASGVAAVYTPKDFELNRIMMDI
VGLVDRTALAAE
>Q49115 5.4.99.63~~~ecm~~~Ethylmalonyl-CoA mutase~~~COG1884
MSAQASVAEVKRDKPWIIRTYAGHSTAAESNKLYRGNLAKGQTGLSVAFDLPTQTGYDPDHELARGEVGKVGVSIAHLGD
MRALFDQIPLAQMNTSMTINATAPWLLSLYLAVAEEQGAPLAALQGTTQNDIIKEYLSRGTYVFPPAPSLRLTKDVILFT
TKNVPKWNPMNVCSYHLQEAGATPVQELSYALAIAIAVLDTVRDDPDFDEASFSDVFSRISFFVNAGMRFVTEICKMRAF
AELWDEIAQERYGITDAKKRIFRYGVQVNSLGLTEQQPENNVHRILIEMLAVTLSKRARARAVQLPAWNEALGLPRPWDQ
QWSMRMQQILAFETDLLEYDDIFDGSTVIEARVEALKEQTRAELTRIAEIGGAVTAVEAGELKRALVESNARRISAIEKG
EQIVVGVNKWQQGEPSPLTAGDGAIFTVSETVEMEAETRIREWRSKRDERAVGQALADLEQAARSGANIMPPSIAAAKAG
VTTGEWGQRLREVFGEYRAPTGVTLQTVTSGAAEDARLLIADLGERLGETPRLVVGKPGLDGHSNGAEQIALRARDVGFD
VTYDGIRQTPTEIVAKAKERGAHVIGLSVLSGSHVPLVREVKAKLREAGLDHVPVVVGGIISTEDELVLKNMGVTAVYTP
KDYELDKIMVGLAKVVERALDKRAADRADTEAGVPGAPKRNESGAQVF
>P23827 ~~~eco~~~Ecotin~~~COG4574
MKTILPAVLFAAFATTSAWAAESVQPLEKIAPYPQAEKGMKRQVIQLTPQEDESTLKVELLIGQTLEVDCNLHRLGGKLE
NKTLEGWGYDYYVFDKVSSPVSTMMACPDGKKEKKFVTAYLGDAGMLRYNSKLPIVVYTPDNVDVKYRVWKAEEKIDNAV
VR
>B1JSA0 ~~~eco~~~Ecotin~~~
MKKCSIILASVLLATSINAIADTPTPLNQQQPLEKIAPYPQAEKGMSRQVIFLEPQKDESRFKVELLIGKTLNVDCNRHM
LGGNLETRTLSGWGFDYLVMDKISQPASTMMACPEDSKPQVKFVTANLGDAAMQRYNSRLPIVVYVPQGVEVKYRIWEAG
EDIRSAQVK
>B7UJE1 ~~~ecpA~~~Common pilus major fimbrillin subunit EcpA~~~
MKKKVLAIALVTVFTGTGVAQAADVTAQAVATWSATAKKDTTSKLVVTPLGSLAFQYAEGIKGFNSQKGLFDVAIEGDST
ATAFKLTSRLITNTLTQLDTSGSTLNVGVDYNGAAVEKTGDTVMIDTANGVLGGNLSPLANGYNASNRTTAQDGFTFSII
SGTTNGTTAVTDYSTLPEGIWSGDVSVQFDATWTS
>P0AAA4 ~~~ecpA~~~Common pilus major fimbrillin subunit EcpA~~~
MKKKVLAIALVTVFTGMGVAQAADVTAQAVATWSATAKKDTTSKLVVTPLGSLAFQYAEGIKGFNSQKGLFDVAIEGDST
ATAFKLTSRLITNTLTQLDTSGSTLNVGVDYNGAAVEKTGDTVMIDTANGVLGGNLSPLANGYNASNRTTAQDGFTFSII
SGTTNGTTAVTDYSTLPEGIWSGDVSVQFDATWTS
>P0C8Z8 ~~~ecpA~~~Common pilus major fimbrillin subunit EcpA~~~
MKKKVLAIALVTVFTGTGVAQAADVTAQAVATWSATAKKDTTSKLVVTPLGSLAFQYAEGIKGFNSQKGLFDVAIEGDST
ATAFKLTSRLITNTLTQLDTSGSTLNVGVDYNGAAVEKTGDTVMIDTANGVLGGNLSPLANGYNASNRTTAQDGFTFSII
SGTTNGTTAVTDYSTLPEGIWSGDVSVQFDATWTS
>Q8CWB9 ~~~ecpA~~~Common pilus major fimbrillin subunit EcpA~~~
MKKKVLAIALVTVFTGTGVAQAADVTAQAVATWSATAKKDTTSKLVVTPLGSLAFQYAEGIKGFNSQKGLFDVAIEGDST
ATAFKLTSRLITNTLTQLDTSGSTLNVGVDYNGTAVEKTGDTVMIDTANGVLGGNLSPLANGYNASNRTTAQDGFTFSII
SGTTNGTTAVTDYSTLPEGIWSGDVSVQFDATWTS
>P0C0P9 3.4.22.-~~~ecpA~~~Extracellular cysteine protease~~~
MYAEYVNQLKNFRIRETQGYNSWCAGYTMSALLNATYNTNRYNAESVMRYLHPNLRGHDFQFTGLTSNEMLRFGRSQGRN
TQYLNRMTSYNEVDQLTTNNQGIAVLGKRVESSDGIHAGHAMAVAGNAKVNNGQKVILIWNPWDNGLMTQDAHSNIIPVS
NGDHYEWYASIYGY
>B7UJE0 ~~~ecpB~~~Probable fimbrial chaperone EcpB~~~
MKKHLLPLALLFSGISPAQALDVGDISSFMNSDSSTLSKTIQNSTDSGRLINIRLERLSSPLDDGQVIAMDKPDELLLTP
ASLLLPAQASEVIRFFYKGPADEKERYYRIVWFDQALSDAQRDNANRSAVATASARIGTILVVAPRQANYHFQYANGSLT
NTGNATLRILAYGPCLKAANGKECKENYYLMPGKSRRFTRVDTADNKGRVALWQGDKFIPVK
>Q8X6I2 ~~~ecpB~~~Probable fimbrial chaperone EcpB~~~COG3121
MKKHLLLLALLLSGISPAQALDVGDISSFMNSDSSTLSKTIKNSTDSGRLINIRLERLSSPLDDGQVISMDKPDELLLTP
ASLLLPAQASEVIRFFYKGPADEKERYYRIVWFDQALSDAQRDNANRSAVATASARIGTILVVAPRQANYHFQYANGTLT
NTGNATLRILAYGPCLKAANGKECKENYYLMPGKSRRFTRVDTADNKGRVALWQGDKFIPVK
>P77188 ~~~ecpB~~~Probable fimbrial chaperone EcpB~~~COG3121
MKKHLLPLALLFSGISPAQALDVGDISSFMNSDSSTLSKTIKNSTDSGRLINIRLERLSSPLDDGQVISMDKPDELLLTP
ASLLLPAQASEVIRFFYKGPADEKERYYRIVWFDQALSDAQRDNANRSAVATASARIGTILVVAPRQANYHFQYANGSLT
NTGNATLRILAYGPCLKAANGKECKENYYLMPGKSRRFTRVDTADNKGRVALWQGDKFIPVK
>B7UJD9 ~~~ecpC~~~Probable outer membrane usher protein EcpC~~~
MPLRRFSPGLKAQFAFGMVFLFVQPDASAADISAQQIGGVIIPQAFSQALQDGMSVPLYIHLAGSQGRQDDQRIGSAFIW
LDDGQLRIRKIQLEESEDNASVSEQTRQQLMTLANAPFNEALTIPLTDNAQLDLSLRQLLLQLVVKREALGTVLRSRSED
IGQSSVNTLSSNLSYNFGVYNNQLRNGGSNTSSYLSLNNVTALREHHVVLDGSLYGIGSGQQDSELYKAMYERDFAGHRF
AGGMLDTWNLQSLGPMTAISAGKIYGLSWGNQASSTIFDSSQSATPVIAFLPAAGEVHLTRDGRLLSVQNFTMGNHEVDT
RGLPYGIYDVEVEVIVNGRVISKRTQRVNKLFSRGRGVGAPLAWQIWGGSFHMDRWSENGKKTRPAKESWLAGASTSGSL
STFSWAATGYGYDNQAVGETRLTLPLGVAINVNLQNMLASDSSWSNIASISATLPGGFSSLWVNQEKTRIGNQLRRSDAD
NRAIGGTLNLNSLWSKLGTFSISYNDDRRYNSHYYTADYYQSVYSGTFGSLGLRAGIQRYNNGDSSANTGKYIALDLSLP
LGNWFSAGMTHQNGYTMANLSARKQFDEGTIRTVGANLSRAISGDTGDDKTLSGGAYAQFDARYASGTLNVNSAADGYIN
TNLTANGSVGWQGKNIAASGRTDGNAGVIFDTGLENDGQISAKINGRIFPLNGKRNYLPLSPYGRYEVELQNSKNSLDSY
DIVSGRKSHLTLYPGNVAVIEPEVKQMVTVSGRIRAEDGTLLANARINNHIGRTRTDENGEFVMDVDKKYPTIDFRYSGN
KTCEVALELNQARGAVWVGDVVCSGLSSWAAVTQTGEENES
>Q8X6I4 ~~~ecpC~~~Probable outer membrane usher protein EcpC~~~COG3188
MPLRRFSPGLKAQFAFGMVFLFVQPDASAADISAQQIGGVIIPQAFSQALQDGMSVPLYIHLAGSQGRQDDQRIGSAFIW
LDDGQLRIRKIQLEESEDNASVSEQTRQQLMALANAPFNEALTIPLTDNAQLDLSLRQLLLQLVVKREALGTVLRSRSED
IGQSSVNTLSSNLSYNFGIYNNQLRNGGSNTSSYLSLNNVTALREHHVVLDGSLYGIGSGQQDSELYKAMYERDFAGHRF
AGGTLDTWNLQSLGPMTAISAGKIYGLSWGNQASSTIFDSSQSATPVIAFLPAAGEVHLTRDGRLLSVQNFTMGNHEVDT
RGLPYGIYDVEVEVIVNGRVISKRTQRVNKLFSRGRGVGAPLAWQVWGGSFHMDRWSENGKKTRPAKESWLAGASTSGSL
STLSWAATGYGYDNQAVGETRLTLPLGGAINVNLQNMLASDSSWSSIGSISATLPGGFSSLWVNQEKTRIGNQLRRSDAD
NRAIGGTLNLNSLWSKLGTFSISYNDDRRYNSHYYTADYYQNVYSGTFGSLGLRAGIQRYNNGDSNANTGKYIALDLSLP
LGNWFSAGMTHQNGYTMANLSARKQFDEGTIRTVGANLSRAISGDTGDDKTLSGGAYAQFDARYASGTLNVNSAADGYVN
TNLTANGSVGWQGKNIAASGRTDGNAGVIFNTGLEDDGQISAKINGRIFPLNGKRNYLPLSPYGRYEVELQNSKNSLDSY
DIVSGRKSHLTLYPGNVAVIEPEVKQMVTVSGRIRAEDGTLLANARINNHIGRTRTDENGEFVMDVDKKYPTIDFRYSGN
KTCEVALELNQARGAVWVGDVVCSGLSSWAAVTQTGEENES
>P77802 ~~~ecpC~~~Probable outer membrane usher protein EcpC~~~COG3188
MPLRRFSPGLKAQFAFGMVFLFVQPDASAADISAQQIGGVIIPQAFSQALQDGMSVPLYIHLAGSQGRQDDQRIGSAFIW
LDDGQLRIRKIQLEESEDNASVSEQTRQQLMALANAPFNEALTIPLTDNAQLDLSLRQLLLQLVVKREALGTVLRSRSED
IGQSSVNTLSSNLSYNLGVYNNQLRNGGSNTSSYLSLNNVTALREHHVVLDGSLYGIGSGQQDSELYKAMYERDFAGHRF
AGGMLDTWNLQSLGPMTAISAGKIYGLSWGNQASSTIFDSSQSATPVIAFLPAAGEVHLTRDGRLLSVQNFTMGNHEVDT
RGLPYGIYDVEVEVIVNGRVISKRTQRVNKLFSRGRGVGAPLAWQVWGGSFHMDRWSENGKKTRPAKESWLAGASTSGSL
STLSWAATGYGYDNQAVGETRLTLPLGGAINVNLQNMLASDSSWSSIGSISATLPGGFSSLWVNQEKTRIGNQLRRSDAD
NRAIGGTLNLNSLWSKLGTFSISYNDDRRYNSHYYTADYYQNVYSGTFGSLGLRAGIQRYNNGDSNANTGKYIALDLSLP
LGNWFSAGMTHQNGYTMANLSARKQFDEGTIRTVGANLSRAISGDTGDDKTLSGGAYAQFDARYASGTLNVNSAADGYVN
TNLTANGSVGWQGKNIAASGRTDGNAGVIFNTGLEDDGQISAKINGRIFPLNGKRNYLPLSPYGRYEVELQNSKNSLDSY
DIVSGRKSRLTLYPGNVAVIEPEVKQMVTVSGRIRAEDGTLLANARINNHIGRTRTDENGEFVMDVDKKYPTIDFRYSGN
KTCEVALELNQARGAVWVGDVVCSGLSSWAAVTQTGEENES
>B7UJD8 ~~~ecpD~~~Fimbria adhesin EcpD~~~
MRVNLLIAMIIFALIWPATALRAAVSKTTWADAPAREFVFVENNSDDNFFVTPGGALDPRLTGANRWTGLKYNGSGTIYQ
QSLGYIDNGYNTGLYTNWKFDMWLENSPVSSPLTGLRCINWYAGCNMTTSLILPQTTDASGFYGATVTSGGAKWMHGMLS
DAFYQYLQQMPVGSSFTMTINACQTSVNYDASSGARCKDQASGNWYVRNVTHTKAANLRLINTHSLAEVFINSDGVPTLG
EGNADCRTQTIGSRSGLSCKMVNYTLQTNGLSNTSIHIFPAIANSSLASAVGAYDMQFSLNGSSWKPVSNTAYYYTFNEM
KSADSIYVFFSSNFFKQMVNLGISDINTKDLFNFRFQNTTSPESGWYEFSTSNTLIIKPRDFSISIISDEYTQTPSREGY
VASGESALDFGYIVTTSGKTAADEVLIKVTGPAQVIGGRSYCVFSSDDGKAKVPFPATLSFITRNGATKTYDAGCDDSWR
DMTDALWLTTPWTDISGEVGQMDKTTVKFSIPMDNAISLRTVDDNGWFGEVSASGEIHVQATWRNIN
>Q8X4N4 ~~~ecpD~~~Fimbria adhesin EcpD~~~
MRVNLLIAMIIFALIWPVTALRAAVSKTTWADAPAREFVFVENNSDDNFFVTPGGALDPRLTGANRWTGLKYNGSGTIYQ
QSLGYIDNGYNTGLYTNWKFDMWLENSPVSSPLTGLRCINWYAGCNMTTSLILPQTTDTSGFYGATVTSGGAKWMHGMLS
DAFYQYLQQMPVGSSFTMTINACQTSVNYDASSGARCKDQASGNWYVRNVTHTKAANLRLINTHSLAEVFINSDGVPTLG
EGNADCRTQTIGSRSGLSCKMVNYTLQTNGLSNTSIHIFPAIANSSLASAVGAYDMQFSLNGSSWKPVSNTAYYYTFNEM
KSADSIYVFFSSNFFKQMVNQGISDINTKDLFNFRFQNTTSPESGWYEFSTSNTLIIKPRDFSISIISDEYTQTPSREGY
VGSGESALDFGYIVTTSGKTAADEVLIKVTGPAQVIGGRSYCVFSSDDGKAKVPFPATLSFITRNGATKTYDAGCDDSWR
DMTDALWLTTPWTDISGEVGQMDKTTVKFSIPMDNAISLRTVDDNGWFGEVSASGEIHVQATWRNIN
>Q8FKL3 ~~~ecpD~~~Fimbria adhesin EcpD~~~
MRVNLLIAMIIFALIWPATALRAAVSKTTWADAPAREFVFVENNSDDNFFVTPGGALDPRLTGANRWTGLKYNGSGTIYQ
QSLGYIDNGYNTGLYTNWKFDMWLENSPVSSPLTGLRCINWYAGCNMTTSLILPQTTDASGFYGATVTSGGAKWMHGMLS
DAFYQYLQQMPVGSSFTMTINACQTSVNYDASSGARCKDQASGNWYVRNVTHTKAANLRLINTHSLAEVFINSDGVPTLG
EGNADCRTQTIGSRSGLSCKMVNYTLQTNGLSNTSIHIFPAIANSSLASAVGAYDMQFSLNGSSWKPVSNTAYYYTFNEM
KSADSIYVFFSSNFFKQMVNLGISDINTKDLFNFRFQNTTSPESGWYEFSTSNTLIIKPRDFSISIISDEYTQTPSREGY
VGSGESALDFGYIVTTSGKTAADEVLIKVTGPAQVIGGRSYCVFSSDDGKAKVPFPATLSFITRNGATKTYDAGCDDSWR
DMTDALWLTTPWTDISGEVGQMDKTTVKFSIPMDNAISLRTVDDNGWFGEVSASGEIHVQATWRNIN
>B7UJD7 ~~~ecpE~~~Probable fimbrial chaperone EcpE~~~
MFRRRGVTLTKALLTVVCMLAAPLTQAISVGNLTFSLPSETDFVSKRVVNNNKSARIYRIAISAIDSPGSSELRTRPVDG
ELLFAPRQLALQAGESEYFKFYYHGPRDNRERYYRVSFREVPTRNLTKRSPTGGEVSTEPVVVMDTILVVRPRQVQFKWS
FDQVTGTVSNTGNTWFKLLIKPGCDSTEEEGDAWYLRPGDVVHQPELRQPGNHYLVYNDKFIKISDSCPAKPPSAD
>Q8X6I6 ~~~ecpE~~~Probable fimbrial chaperone EcpE~~~COG3121
MFRRRGVTLTKALLTAVCMLAAPLTQAISVGNLTFSLPSETDFVSKRVVNNNKSARIYRIAISAIDSPGSSELRTRPVDG
ELLFAPRQLALQAGESEYFKFYYHGPQDNRERYYRVSFREVPTRNLTKRSPTGGEVSTEPVVVMDTILVVRPRQVQFKWS
FDQVTGTVSNTGNTWFKLLIKPGCDSTEEEGDAWYLRPEDVVHQPELRQPGNHYLVYNDKFIKISDSCPAKPPSAD
>B7UJE2 ~~~ecpR~~~HTH-type transcriptional regulator EcpR~~~
MTWQNDYSRDYEVKNHMECQNRSDKYIWSPHDAYFYKGLSELIVDIDRLIYLSLEKIRKDFVFINLNTDSLTEFINRDNE
WLSAVKGKQVVLIAARKSEALANYWYYNSNIRGVVYAGLSRDIRKELAYVINGRFLRKDIKKDKITDREMEIIRMTAQGM
LPKSIARIENCSVKTVYTHRRNAEAKLYSKLYKLVQ
>Q8X6I1 ~~~ecpR~~~HTH-type transcriptional regulator EcpR~~~COG2771
MTWQNDYSRDYEVENHMECQNRSDKYIWSPHDAYFYKGLSELIVDIDRLIYLSLEKIRKDFVFINLNTDSLTEFINRDNE
WLSAVKGKQVVLIAARKSEALANYWYYNSNIRGVVYAGLSRDIRKELAYVINGRFLRKDIKKDKITDREMEIIRMTAQGM
LPKSIARIENCSVKTVYTHRRNAEAKLYSKLYKLVQ
>Q9AME2 ~~~ecpR~~~HTH-type transcriptional regulator EcpR~~~
MTWQNDYSRDYEVKNHMECQNRSDKYIWSPHDAYFYKGLSELIVDIDRLIYLSLEKIRKDFVFINLNTDSLTEFINRDNE
WLSAVKGKQVVLIAARKSEALANYWYYNSNIRGVVYAGLSRDIRKELAYVINGRFLRKDIKKDKITDREMEIIRMTAQGM
LPKSIARIENCSVKTVYTHRRNAEAKLYSKLYKLVQ
>P71301 ~~~ecpR~~~HTH-type transcriptional regulator EcpR~~~COG2771
MTWQSDYSRDYEVKNHMECQNRSDKYIWSPHDAYFYKGLSELIVDIDRLIYLSLEKIRKDFVFINLSTDSLSEFINRDNE
WLSAVKGKQVVLIAARKSEALANYWYYNSNIRGVVYAGLSRDIRKELVYVINGRFLRKDIKKDKITDREMEIIRMTAQGM
QPKSIARIENCSVKTVYTHRRNAEAKLYSKIYKLVQ
>P55339 ~~~ecsA~~~ABC-type transporter ATP-binding protein EcsA~~~COG1131
MSLLSVKDLTGGYTRNPVLKNVSFTLEPNQIVGLIGLNGAGKSTTIRHIIGLMDPHKGSIELNGKTFAEDPEGYRSQFTY
IPETPVLYEELTLMEHLELTAMAYGLSKETMEKRLPPLLKEFRMEKRLKWFPAHFSKGMKQKVMIMCAFLAEPALYIIDE
PFLGLDPLAINALLERMNEAKKGGASVLMSTHILATAERYCDSFIILHNGEVRARGTLSELREQFGMKDAALDDLYLELT
KEDAGHE
>Q7W980 2.3.1.178~~~ectA~~~L-2,4-diaminobutyric acid acetyltransferase~~~
MRKDETSNTSPDISVAQPASALRYHLRPPRRNDGAAIHQLVSECPPLDLNSLYAYLLLCEHHAHTCVVAESPGGRIDGFV
SAYLLPTRPDVLFVWQVAVHSRARGHRLGRAMLGHILERQECRHVRHLETTVGPDNQASRRTFAGLAGERGAHVSEQPFF
DRQAFGGADHDDEMLLRIGPFTHPPH
>O52249 2.3.1.178~~~ectA~~~L-2,4-diaminobutyric acid acetyltransferase~~~COG0456
MNATTEPFTPSADLAKPSVADAVVGHEASPLFIRKPSPDDGWGIYELVKSCPPLDVNSAYAYLLLATQFRDSCAVATNEE
GEIVGFVSGYVKSNAPDTYFLWQVAVGEKARGTGLARRLVEAVMTRPEMAEVHHLETTITPDNQASWGLFRRLADRWQAP
LNSREYFSTDQLGGEHDPENLVRIGPFQTDQI
>Q4JQJ5 2.3.1.178~~~ectA~~~L-2,4-diaminobutyric acid acetyltransferase~~~
MLPDKTALPIITLSQPTAEVGAQVHRLISKCPPLDPNSMYCNLLQSSHFSETAVAAKIGDELVGFVSGYRIPQRPDTLFV
WQVAVGEKARGQGLATRMLKAILARPVNQDINRIETTITPNNKASWALFEGLAKKLDTQIGSAVMFDKTRHFADQHETEM
LVKVGPFKAVQA
>Q9AP35 2.3.1.178~~~ectA~~~L-2,4-diaminobutyric acid acetyltransferase~~~
MFWVISKQGSTAVAEQEETLVFRVPTEDDGKAIWNLINYPGVLDLLSSYSYFMWAKFFDQTSVVGETNEQIVGFYIGLHT
TEYGPDTLFYLASCSDETQRQKGLASRMLQAILHRYAWRNIRYLEATVGTSNEAPEALFQKLSRDLKTAYHVTEFFTEDQ
FPGKGHEDERLFKIGPFQQV
>Q9ZEU7 2.6.1.76~~~ectB~~~Diaminobutyrate--2-oxoglutarate transaminase~~~COG0160
MQTQILERMESEVRTYSRSFPTVFTEAKGARLHAEDGNQYIDFLAGAGTLNYGHNHPKLKQALADYIASDGIVHGLDMWS
AAKRDYLETLEEVILKPRGLDYKVHLPGPTGTNAVEAAIRLARNAKGRHNIVTFTNGFHGVTMGALATTGNRKFREATGG
IPTQGASFMPFDGYMGEGVDTLSYFEKLLGDNSGGLDVPAAVIIETVQGEGGINPAGIPWLQRLEKICRDHDMLLIVDDI
QAGCGRTGKFFSFEHAGITPDIVTNSKSLSGFGLPFAHVLMRPELDIWKPGQYNGTFRGFNLAFVTAAAAMRHFWSDDTF
ERDVQRKGRVVEDRFQKLASFMTEKGHPASERGRGLMRGLDVGDGDMADKITAQAFKNGLIIETSGHSGQVIKCLCPLTI
TDEDLVGGLDILEQSVKEVFGQA
>O52250 2.6.1.76~~~ectB~~~Diaminobutyrate--2-oxoglutarate transaminase~~~COG0160
MQTQILERMESDVRTYSRSFPVVFTKARNARLTDEEGREYIDFLAGAGTLNYGHNNPHLKQALLDYIDSDGIVHGLDFWT
AAKRDYLETLEEVILKPRGLDYKVHLPGPTGTNAVEAAIRLARVAKGRHNIVSFTNGFHGVTMGALATTGNRKFREATGG
VPTQAASFMPFDGYLGSSTDTLDYFEKLLGDKSGGLDVPAAVIVETVQGEGGINVAGLEWLKRLESICRANDILLIIDDI
QAGCGRTGKFFSFEHAGITPDIVTNSKSLSGYGLPFAHVLMRPELDKWKPGQYNGTFRGFNLAFATAAAAMRKYWSDDTF
ERDVQRKARIVEERFGKIAAWLSENGIEASERGRGLMRGIDVGSGDIADKITHQAFENGLIIETSGQDGEVVKCLCPLTI
PDEDLVEGLDILETSTKQAFS
>Q9AP34 2.6.1.76~~~ectB~~~Diaminobutyrate--2-oxoglutarate transaminase~~~
MLLTKEKNGMEIIEERESAVRSYSRSFPTVFEKAKDHLVWDVDGKEYIDFFAGAGSLNYGHNNEKMKTKIMDYVMNDGIS
HSLDMGTVARAEFLETFNEVILRPRNLDYKVMFPGPTGTNTVESALKIARKVTGRQNIISFTNAFHGMTLGSLSISGNSS
IRNGAGVPLTNTISMPYDTFFKNGNAIDYLEQYLEDTGSGVDLPAAMILETVQGEGGINAASFEWLRGIEKLCRRYDILL
IIDDVQAGCGRTGTFFSFEPAGIQPDIVCLSKSIGGYGLPLAITLIKPEHDIWEPGEHNGTFRGNNMAIVAATEALSYWK
TDDLAKSVQKKSKIIKLRFEQIVEDYPELKATTRGRGFMQGIACGKGKEAYATKICAKAFEKGVIMETSGPSGEVVKFLG
ALTIDETSLIKGLGILEEATEEVVRQ
>O52251 4.2.1.108~~~ectC~~~L-ectoine synthase~~~COG1917
MIVRNLEEARQTDRLVTAENGNWDSTRLSLAEDGGNCSFHITRIFEGTETHIHYKHHFEAVYCIEGEGEVETLADGKIWP
IKPGDIYILDQHDEHLLRASKTMHLACVFTPGLTGNEVHREDGSYAPADEADDQKPL
>Q1GNW6 4.2.1.108~~~ectC~~~L-ectoine synthase~~~COG1917
MIVRNLGDIRKTDRNVRSDGWASARMLLKDDGMGFSFHVTTLFAGSELRMHYQNHLEAVLVLKGTGTIEDLATGEVHALR
PGVMYALDDHDRHIVRPETDILTACVFNPPVTGREVHDESGAYPADPELAREPVAAD
>Q9AP33 4.2.1.108~~~ectC~~~L-ectoine synthase~~~
MIVRTIDEIIGTENEVESDTWTSRRLLLEKDGMGFSFHETIIYAGTETHIHYQNHLEAVYCVGGDGEIETVSDGKVYPIQ
DGTMYALDQHDEHYPRGGKTDMRLICTFNPPLVGTETHDENGVYPLLSKQPVGK
>E1VA04 1.14.11.55~~~ectD~~~Ectoine dioxygenase~~~COG5285
MSVQTSSNRPLPQANLHIATETPEADSRIRSAPRPGQDPYPTRLSEPLDLPWLNRREPVVKGEEADGPLSAAQLDTFERQ
GFIFEPDFLKGEELEALRHELNALLARDDFRGRDFAITEPQGNEIRSLFAVHYLSRVFSRLANDERLMGRARQILGGEPY
VHQSRINYKPGFEGKGFNWHSDFETWHAEDGMPAMHAVSASIVLTDNHTFNGPLMLVPGSHRVFVPCLGETPEDHHRQSL
KTQEFGVPSRQALRELIDRHGIEAPTGAAGGLLLFDCNTLHGSNANMSPDPRSNAFFVYNRRDNRCVEPYAASKRRPRFL
AHEPDEAWSPDG
>Q1GNW5 1.14.11.55~~~ectD~~~Ectoine dioxygenase~~~COG5285
MQDLYPSRQRADAEMRPRLDPVVHSEWTNDAPISARQAAAFDRDGYIVLEDIFSADEVAFLQKAAGNLLADPAALDADTI
VTEPQSNEIRSIFEIHAQSPVMARLAADARLADVARFLLGDEVYIHQSRLNYKPGFKGREFYWHSDFETWHVEDGMPRMR
ALSMSVLLAENTPHNGPLMVIPGSHRTYLTCVGETPDDHYLSSLKKQEYGVPDEESLAELAHRHGIVAPTGKPGTVILFD
CNLMHGSNGNITPFPRANAFLVYNAVSNRLEKPFGVEKPRPWFLARRGEPAALRVERGPLVETVPA
>Q93RV9 1.14.11.55~~~ectD~~~Ectoine dioxygenase~~~COG5285
MTTTTTNVTDLYPTRGATEVATPRQDPVVWGSPDAPGPVSAGDLQALDRDGFLAIDQLITPDEVGEYQRELERLTTDPAI
RADERSIVEPQSKEIRSVFEVHKISEVFAKLVRDERVVGRARQILGSDVYVHQSRINVKPGFGASGFYWHSDFETWHAED
GLPNMRTISVSIALTENYDTNGGLMIMPGSHKTFLGCAGATPKDNYKKSLQMQDAGTPSDEGLTKMASEYGIKLFTGKAG
SATWFDCNCMHGSGDNITPFPRSNVFIVFNSVENTAVEPFAAPIRRPEFIGARDFTPVK
>A4VFY4 1.14.11.55~~~ectD~~~Ectoine dioxygenase~~~COG5285
MQADLYPSRQEDQPSWQERLDPVVYRSDLENAPIAAELVERFERDGYLVIPNLFSADEVALFRAELERMRQDPAVAGSGK
TIKEPDSGAIRSVFAIHKDNELFARVAADERTAGIARFILGGDLYVHQSRMNFKPGFTGKEFYWHSDFETWHIEDGMPRM
RCLSCSILLTDNEPHNGPLMLMPGSHKHYVRCVGATPENHYEKSLRKQEIGIPDQNSLSELASRFGIDCATGPAGSVVFF
DCNTMHGSNGNITPSARSNLFYVYNHVDNAVQAPFCEQKPRPAFVAERENFKPLDIRPQQYL
>Q2TDY4 1.14.11.55~~~ectD~~~Ectoine dioxygenase~~~
MEDLYPSRQNNQPKILKRKDPVIYTDRSKDNQAPITKEQLDSYEKNGFLQIKNFFSEDEVIDMQKAIFELQDSIKDVASD
KVIREPESNDIRSIFHVHQDDNYFQDVANDKRILDIVRHLLGSDVYVHQSRINYKPGFKGKEFDWHSDFETWHVEDGMPR
MRAISVSIALSDNYSFNGPLMLIPGSHNYFVSCVGETPDNNYKESLKKQKLGVPDEESLRELTRIGGGISVPTGKAGSVT
LFESNTMHGSTSNITPYPRNNLFMVYNSVKNRLVEPFSGGEKRPEYIAVREKQPVYSAVN
>Q79VE0 ~~~ectP~~~Ectoine/glycine betaine/proline transporter EctP~~~COG1292
MSSNIAITTEPEGKNKKGLKSDPFIFSISVGFIVVFVIATIALGEKARTTFSAIAGWLLENLGWMYIGGVSLVFIFLMGI
FASRYGRVKLGDDDDDPEHTLIVWFCMLFAGGVGAVLMFWGVAEPINHAFNVPMANEESMSEAAIVQAFAYTFYHFGIHM
WVIMALPGLSLGYFIYKRKLPPRLSSVFSPILGKHIYSTPGKLIDVLAIVGTTFGIAVSVGLGVLQINAGMNKLWSTPQV
SWVQLLIILIITAVACISVASGLDKGIKLLSNINIAMAVALMFFILFTGPTLTLLRFLVESFGIYASWMPNLMFWTDSFQ
DNPGWQGKWTVFYWAWTICWSPYVGMFVARISRGRTVREFIGGVLALPAIFGVVWFSIFGRAGIEVELSNPGFLTQPTVV
EGDVPAALFNVLQEYPLTGIVSAFALVIIVIFFITSIDSAALVNDMFATGAENQTPTSYRVMWACTIGAVAGSLLIISPS
SGIATLQEVVIIVAFPFFLVQFVMMFSLLKGMSEDAAAVRRVQTRQWEKTDTPEKLEEHSSQPAPGYDDEGNPLPMPALE
HDEDGNIVIPGNVVIEGDLGVVGDVVDDPEEAQEMGSRFKIVEQTRPQSRDEYDI
>Q93AK1 ~~~ectT~~~Ectoine/hydroxyectoine transporter~~~
MNKSTLNNPVFYVSAFVVFLLVIIGATLPNRFGAVAEKLFHFTTIHFGWFYLLAVFVFVVFLITLSLSKFGKIKLGATLT
KPEYSFFTWIGMLFSAGFGAGLVFWGVAEPMSHFFKTPFPAVEAMSEEAARVAMGYAFFHWGVSQWSVFAIVGLVIAYLQ
FRKKRRGLISTSIQPIIGKNKFIADTVDSLAVIATVMGVATSLGLGILQMNGGLKSVFDVPTSIWVQMAIAGVMLITYLI
SSSTGLDRGIKWLSNINLGSLFIIIVFVFMAGPTVFILNTFVLGLGDYFSNFIGYSLRLTPYTGDTWVREWTIFYWAWST
AWSPFVGAFIARVSRGRSIRQYVLGVLVVSPAIACIWIAAFGGTAVYNDLMNGTSIAEAVNADIAVALFETYQHLPMTTI
LSILSIFLIFTFLVTSADSATYILGVMTSRGSLNPTLVTKIVWGLLITAIAVVLLLAGGLEALQTASLISALPFTVILLL
MMASFTRMLSKGEKKAEQDKE
>A9GK58 4.2.3.175~~~~~~10-epi-cubebol synthase~~~COG0664
MHRALLCPFPATTPHPQAAQLANDCLEWTRKCGLLPDESPRTLDKVRSYSALAAHCYPDAHFERLRAICDYYSWLFFFDD
VCENTSLNGAEPKVVSSLLFDVYGVLRGPTAAVGHAPFAQALADIWRRIGDGCPGFWRRRLIRHVENYIDGCVWEAQNRQ
LDRVPSRAVFEGMRMHTSTMYEFWDFIEYAGDLFLPDEVVEHPLVAEVRRAGNAIASFANDIYSLRKETSNRDVHNLVVV
LMHEERIELEAAYARAAGIHDAQVEHFLDLVKHLPTFSATIDRNLARYVEGIRIWIRANHDWSIVTPRYNEPDAR
>D2B747 4.2.3.170~~~~~~4-epi-cubebol synthase ((2E,6E)-farnesyl diphosphate cyclizing)~~~COG0664
MGTTTTHKFDRPLRLPPLPCPFPSEVNPYVEQVDKETLEWLIDSEMLDDAETVERYRQAKYGWLSARTYPYAEHHTLRLV
SDWCVWLFAFDDAFCESDRRAAEIARALPQLYAVLEDLDVGSEVDDVFAKSLLEIKGRIAAYGDDEQLDRWRNVTKDYLF
AQVWEAANREDEVVPSLEDYIFMRRRTGAMLTVFALIDVASGRSLSADEWRHPGMRAITESANDVVVWDNDLISYAKESN
SGNSRNNLVNVLAEHRHYSRQEAMEEIGEMRNQAIADMVAVRPSLEALGSDAVLAYVRGLEFWISGSVDYSLTSSRYTDA
WRTARQPSIR
>P0ADF6 4.2.1.12~~~edd~~~Phosphogluconate dehydratase~~~COG0129
MNPQLLRVTNRIIERSRETRSAYLARIEQAKTSTVHRSQLACGNLAHGFAACQPEDKASLKSMLRNNIAIITSYNDMLSA
HQPYEHYPEIIRKALHEANAVGQVAGGVPAMCDGVTQGQDGMELSLLSREVIAMSAAVGLSHNMFDGALFLGVCDKIVPG
LTMAALSFGHLPAVFVPSGPMASGLPNKEKVRIRQLYAEGKVDRMALLESEAASYHAPGTCTFYGTANTNQMVVEFMGMQ
LPGSSFVHPDSPLRDALTAAAARQVTRMTGNGNEWMPIGKMIDEKVVVNGIVALLATGGSTNHTMHLVAMARAAGIQINW
DDFSDLSDVVPLMARLYPNGPADINHFQAAGGVPVLVRELLKAGLLHEDVNTVAGFGLSRYTLEPWLNNGELDWREGAEK
SLDSNVIASFEQPFSHHGGTKVLSGNLGRAVMKTSAVPVENQVIEAPAVVFESQHDVMPAFEAGLLDRDCVVVVRHQGPK
ANGMPELHKLMPPLGVLLDRCFKIALVTDGRLSGASGKVPSAIHVTPEAYDGGLLAKVRDGDIIRVNGQTGELTLLVDEA
ELAAREPHIPDLSASRVGTGRELFSALREKLSGAEQGATCITF
>P21909 4.2.1.12~~~edd~~~Phosphogluconate dehydratase~~~COG0129
MTDLHSTVEKVTARVIERSRETRKAYLDLIQYEREKGVDRPNLSCSNLAHGFAAMNGDKPALRDFNRMNIGVVTSYNDML
SAHEPYYRYPEQMKVFAREVGATVQVAGGVPAMCDGVTQGQPGMEESLFSRDVIALATSVSLSHGMFEGAALLGICDKIV
PGLLMGALRFGHLPTILVPSGPMTTGIPNKEKIRIRQLYAQGKIGQKELLDMEAACYHAEGTCTFYGTANTNQMVMEVLG
LHMPGSAFVTPGTPLRQALTRAAVHRVAELGWKGDDYRPLGKIIDEKSIVNAIVGLLATGGSTNHTMHIPAIARAAGVIV
NWNDFHDLSEVVPLIARIYPNGPRDINEFQNAGGMAYVIKELLSANLLNRDVTTIAKGGIEEYAKAPALNDAGELVWKPA
GEPGDDTILRPVSNPFAKDGGLRLLEGNLGRAMYKASAVDPKFWTIEAPVRVFSDQDDVQKAFKAGELNKDVIVVVRFQG
PRANGMPELHKLTPALGVLQDNGYKVALVTDGRMSGATGKVPVALHVSPEALGGGAIGKLRDGDIVRISVEEGKLEALVP
ADEWNARPHAEKPAFRPGTGRELFDIFRQNAAKAEDGAVAIYAGAGI
>P24121 2.4.2.-~~~~~~Epidermal cell differentiation inhibitor~~~
MKNKLLFKIFLSLSLALSVYSINDKIIEVSNTSLAADVKNFTDLDEATKWGNKLIKQAKYSSDDKIALYEYTKDSSKING
PLRLAGGDINKLDSTTQDKVRRLDSSISKSTTPESVYVYRLLNLDYLTSIVGFTNEDLYKLQQTNNGQYDENLVRKLNNV
MNSRIYREDGYSSTQLVSGAAVGGRPIELRLELPKGTKAAYLNSKDLTAYYGQQEVLLPRGTEYAVGSVELSNDKKKIII
TAIVFKK
>E3FKM2 4.2.3.152~~~gacC~~~2-epi-5-epi-valiolone synthase~~~COG0337
MAHIKVVGGGTSVRGSTLQVSAQADFGYGVHNEEGLFDPGQPLLLELCRDARMLVCISPSVDRLHGERIRQYFRTHFAPE
QYRFLVLSTSESNKSIENVLRICEAAKAFWLDRHGLLVAIGGGIVLDMVGFAASIYRRGIRYIKVPTTLVGQVDVAVGVK
TGVNLSGSKNLIGTYYPAYATLNDRGFLSTLPAREMRCGLAEIIKMGLVCDPEIFTALEAHFLPPVSRHLEHPLPQEAVL
GAMVRMIEELQPNLFELNLERLVDFGHTFSMSLETVSGYAHAHGEAVAMDMALSACLSNELGILGEAEFERILALLEVVG
LPVFDEALCSVDAMWQALMEVNVHRGHRINLVIPARIGEGMFLHRLEDVPRDALARAIRRMSVLSQRHAVAALGSAGEPL
ERASGA
>B0B0T7 4.2.3.152~~~gacC~~~2-epi-5-epi-valiolone synthase~~~COG0337
MSGQALAQLEGVREDAGGFDLLAPDGTQYRVDVTDGVFDPHNPLLAGYVAGRRVVAFVGPTVDRIYGDRLRAYLDARLEP
GSWSVHTIDSGERNKTLASVERVCAIAKASGLDRHGVMLAVGGGIVADIVGFAASMYARGIRYIKVNTTLVGQVDVGVGV
KTGVNALNTKNMFGAYHPAHASLNDPALLATLPAREIRCGLAEIVKMAVILDAGLFEALEEHPDAFLRSSDGALETYVVR
TSMRLMMEELCPNLREHDLARLVDFGHTFSPVIETAGGHRLEHGEAVAVDMALSAHLARLLGLADAESCRRVVTLLRRIG
LPVFDPATCTPELMTQALHASWQRRGRELHLVVPTGIGKATFVERLEDVPAEVLRAALDALAREGRTS
>P39597 4.98.1.1~~~efeB~~~Deferrochelatase~~~COG2837
MSDEQKKPEQIHRRDILKWGAMAGAAVAIGASGLGGLAPLVQTAAKPSKKDEKEEEQIVPFYGKHQAGITTAHQTYVYFA
ALDVTAKDKSDIITLFRNWTSLTQMLTSGKKMSAEQRNQYLPPQDTGESADLSPSNLTVTFGFGPGFFEKDGKDRFGLKS
KKPKHLAALPAMPNDNLDEKQGGGDICIQVCADDEQVAFHALRNLLNQAVGTCEVRFVNKGFLSGGKNGETPRNLFGFKD
GTGNQSTKDDTLMNSIVWIQSGEPDWMTGGTYMAFRKIKMFLEVWDRSSLKDQEDTFGRRKSSGAPFGQKKETDPVKLNQ
IPSNSHVSLAKSTGKQILRRAFSYTEGLDPKTGYMDAGLLFISFQKNPDNQFIPMLKALSAKDALNEYTQTIGSALYACP
GGCKKGEYIAQRLLES
>Q8XAS4 4.98.1.1~~~efeB~~~Deferrochelatase~~~COG2837
MQYEDKNGVNEPSRRRLLKGIGALALAGSCPVAHAQKTQSAPGTLSPVARNEKQPFYGEHQAGILTPQQAAMMLVAFDVL
ASDKADLERLFRLLTQRFAFLTQGGAAPETPNPRLPPLDSGILGGYIAPDNLTITLSVGHSLFDERFGLAPQMPKKLQKM
TRFPNDSLDAALCHGDVLLQICANTQDTVIHALRDIIKHTPDLLSVRWKREGFISDHAARSKGKETPINLLGFKDGTANP
DSQNDKLMQKVVWVTADQQEPAWTIGGSYQAVRLIQFRVEFWDRTPLKEQQTIFGRDKQTGAPLGMQHEHDVPDYASDPE
GKGIALDSHIRLANPRTAESESSLMLRRGYSYSLGVTNSGQLDMGLLFVCYQHDLEKGFLTVQKRLNGEALEEYVKPIGG
GYFFALPGVKDANDYLGSALLRV
>Q8CW71 4.98.1.1~~~efeB~~~Deferrochelatase~~~COG2837
MQYEDENGVNEPSRRRLLKGIGALALAGSCPVAHAQKTQSAPGTLSPDARNEKQPFYGEHQAGILTPQQAAMMLVAFDVL
ASDKADLERLFRLLTQRFAFLTQGGAAPETPNPRLPPLDSGILGGYIAPDNLTITLSVGHSLFDERFGLAPQMPKKLQKM
TRFPNDSLDAALCHGDVLLQICANTQDTVIHALRDIIKHTPDLLSVRWKREGFISDHAARSKGKETPINLLGFKDGTANP
DSQNDKLMQKVVWVTADQQEPAWTIGGSYQAVRLIQFRVEFWDRTPLKEQQTIFGRDKQTGAPLGMLHEHDVPDYASDPE
GKVIALDSHIRLANPRTAESESSLMLRRGYSYSLGVTNSGQLDMGLLFVCYQHDLEKGFLTVQKRLNGEALEEYVKPIGG
GYFFALPGVKDANDYLGSALLRV
>P31545 4.98.1.1~~~efeB~~~Deferrochelatase~~~COG2837
MQYKDENGVNEPSRRRLLKVIGALALAGSCPVAHAQKTQSAPGTLSPDARNEKQPFYGEHQAGILTPQQAAMMLVAFDVL
ASDKADLERLFRLLTQRFAFLTQGGAAPETPNPRLPPLDSGILGGYIAPDNLTITLSVGHSLFDERFGLAPQMPKKLQKM
TRFPNDSLDAALCHGDVLLQICANTQDTVIHALRDIIKHTPDLLSVRWKREGFISDHAARSKGKETPINLLGFKDGTANP
DSQNDKLMQKVVWVTADQQEPAWTIGGSYQAVRLIQFRVEFWDRTPLKEQQTIFGRDKQTGAPLGMQHEHDVPDYASDPE
GKVIALDSHIRLANPRTAESESSLMLRRGYSYSLGVTNSGQLDMGLLFVCYQHDLEKGFLTVQKRLNGEALEEYVKPIGG
GYFFALPGVKDANDYFGSALLRV
>Q9K1P6 ~~~~~~Efem/EfeO family lipoprotein NMB0035~~~
MRKFNLTALSVMLALGLTACQPPEAEKAAPAASGEAQTANEGGSVSIAVNDNACEPMELTVPSGQVVFNIKNNSGRKLEW
EILKGVMVVDERENIAPGLSDKMTVTLLPGEYEMTCGLLTNPRGKLVVTDSGFKDTANEADLEKLSQPLADYKAYVQGEV
KELVAKTKTFTEAVKAGDIEKAKSLFADTRVHYERIEPIAELFSELDPVIDAREDDFKDGAKDAGFTGFHRIEYALWVEK
DVSGVKEIAAKLMTDVEALQKEIDALAFPPGKVVGGASELIEEVAGSKISGEEDRYSHTDLSDFQANVDGSKKIVDLFRP
LIEAKNKALLEKTDTNFKQVNEILAKYRTKDGFETYDKLGEADRKALQASINALAEDLAQLRGILGLK
>P39596 ~~~efeM~~~Iron uptake system component EfeM~~~COG2822
MNFTKIAVSAGCILALCAGCGANDTSSTKEKASSEKSGVTKEITASVNKMETIISKLNDSVEKGDQKEIEKKGKELNSYW
LSFENDIRSDYPFEYTEIEKHLQPIYTEAQKDKPDAGKIKTESESLKASLEDLTEAKKSGKKASDQLAKAADEYKGYVKE
QSDQLVKATEAFTGAVKSGDIEKSKTLYAKARVYYERIEPIAESLGDLDPKIDARENDVEEGDKWTGFHKLEKAIWKDQD
ISGEKATADQLLKDVKELDGSIQSLKLTPEQIVAGAMELLNEAGISKITGEEERYSRIDLVDLMANVEGSEAVYQTVKSA
LVKDHSDLTEKLDTEFSEFEVLMAKYKTNDQSYTSYDKLSEKQIRELSTKLTTLSETMSKIANVL
>Q4ZR20 ~~~efeM~~~Iron uptake system component EfeM~~~COG2822
MTYPLLTRKTLMKKTPLALLLTLGLLQTPLAAFAATAPLDLVGPVSDYKIYVTENIEELVSHTQKFTDAVKKGDIATAKK
LYAPTRVYYESVEPIAELFSDLDASIDSRVDDHEQGVAAEDFTGFHRLEYALFSQNTTKDQGPIADKLLSDVKDLEKRVA
DLTFPPEKVVGGAAALLEEVAATKISGEEDRYSHTDLYDFQGNIDGAKKIVDLFRPQIEQQDKAFSSKVDKNFATVDKIL
AKYKTKDGGFETYDKVKENDRKALVGPVNTLAEDLSTLRGKLGLN
>Q8XAS6 ~~~efeO~~~Iron uptake system component EfeO~~~COG2822
MTINFRRNALQLSVAALFSSAFMANAADVPQVKVTVTDKQCEPMTITVNAGKTQFIIQNHSQKALEWEILKGVMVVEERE
NIAPGFSQKMTANLQPGEYDMTCGLLTNPKGKLIVKGEATADAAQSDALLSLGGAITAYKAYVMAETTQLVTDTKAFTDA
IKAGDIEKAKALYAPTRQHYERIEPIAELFSDLDGSIDAREDDYEQKAADPKFTGFHRLEKALFGDNTTKGMDQYADQLY
TDVVDLQKRISELAFPPSKVVGGAAGLIEEVAASKISGEEDRYSHTDLWDFQANVEGSQKIVDLLRPQLQKANPELLAKV
DANFKKVDTILAKYRTKDGFETYDKLTDADRNALKGPITALAEDLAQLRGVLGLD
>Q8FJ35 ~~~efeO~~~Iron uptake system component EfeO~~~COG2822
MTINFRRNALQLSVAALFSSAFMANAADIPQVKVTVTDKQCEPMIITVNAGKTQFIIQNHSQKALEWEILKGVMVVEERE
NIAPGFSQKMTANLQPGEYDMTCGLLTNPKGKLIVKGEATADAAQSDALLSLGGAITAYKAYVMAETTQLVTDTKAFTDA
IKAGDIEKAKALYAPTRQHYERIEPIAELFSDLDGSIDAREDDYEQKAADPKFTGFHRLEKALFGDNTTKGMDKYADQLY
TDVVDLQKRISELAFPPSKVVGGAAGLIEEVAASKISGEEDRYSHTDLWDFQANVEGSQKIVDLLRPQLQKANPELLAKV
DANFKKVDTILAKYRTKDGFENYDKLTDADRNALKGPITALAEDLAQLRGVLGLD
>P0AB24 ~~~efeO~~~Iron uptake system component EfeO~~~COG2822
MTINFRRNALQLSVAALFSSAFMANAADVPQVKVTVTDKQCEPMTITVNAGKTQFIIQNHSQKALEWEILKGVMVVEERE
NIAPGFSQKMTANLQPGEYDMTCGLLTNPKGKLIVKGEATADAAQSDALLSLGGAITAYKAYVMAETTQLVTDTKAFTDA
IKAGDIEKAKALYAPTRQHYERIEPIAELFSDLDGSIDAREDDYEQKAADPKFTGFHRLEKALFGDNTTKGMDQYAEQLY
TDVVDLQKRISELAFPPSKVVGGAAGLIEEVAASKISGEEDRYSHTDLWDFQANVEGSQKIVDLLRPQLQKANPELLAKV
DANFKKVDTILAKYRTKDGFETYDKLTDADRNALKGPITALAEDLAQLRGVLGLD
>Q0WFT9 ~~~efeO~~~Iron uptake system component EfeO~~~COG2822
MSIWFFRRTALHAALLSLPVFAISAQAADIQQVKITVNDKQCEPMALTVPAGKTQFIVHNVSQKGLEWEILKGVMVVEER
ENIAPGFTQKMTANLEPGEYDMTCGLLSNPKGKLTVTVAAGEQAPVKPDAMALVGPIAEYKVYVTQEVAQLVSQTKAFTD
AVKAGDLALARKLYAPTRQHYERIEPIAELFSDLDGSIDAREDDFEQKSADPKFTGFHRLEKILFGDNTTKGADKFADLL
YQDTLELQKRIAGLTFAPNKVVGGAAGLIEEVAASKISGEEDRYSRTDLWDFQANVDGAQKIVNLLRPLLEKADKPLLQK
IDANFNTVDSVLAKYRTKEGYESYEKLTDADRNAMKGPITALAEDLAQLRGVLGLD
>P39595 ~~~efeU~~~Iron permease EfeU~~~COG0672
MARGLALILFSLLMVFGSAAHAEDDPIAALIQLNKQMIKSVKDGDMDSAQQTFDTFKAKWKKEEPSIKKENLSSHSEMDA
NIAMISLSFINQDARKLKTQLEELASHLETYQQAVVLKKTSSGQSRASLTAYIQSLKDTKQFIEKKQLDEASSAIDNLVT
SWLAVEGDVVSQSKEAYTTSEENLALMKAEIGSHPEKVSKQIDEMIQLLEPIASSSYSWWDAALIPVREGMEALLVIGAL
LTMTKKARVTRSSTWIWGGASAGMAVSLAAGIGVTVLFSSSVFGENNFLLGGVTGVLSAVMLLYVGVWLHRNASMDKWRE
KINIQKSQALKKRSLLSFALIAFLAVVREGLETVIFFIGLVGKLPLTELIGGTAAGLIVLVIVGVLMIKLGMRIPLKPFF
LLSMAVVLYMCVKFLGTGVHSLQLAGILPSDAESWLPSVSVLGIYPSVYSTIPQMLILLFLLIALVSEAAKHFTNGKELT
K
>Q8XAS8 ~~~efeU~~~Ferrous iron permease EfeU~~~COG0672
MFVPFLIMLREGLEAALIVSLIASYLKRTQRGRWIGVMWIGVLLAAALCLGLGIFINETTGEFPQKEQELFEGIVAVIAV
VILTWMVFWMRKVSRNVKVQLEQAVDSALQRGNHHGWALVMMVFFAVAREGLESVFFLLAAFQQDVGIWPPLGAMLGLAT
AVVLGFLLYWGGIRLNLGAFFKWTSLFILFVAAGLAAGAIRAFHEAGLWNHFQEIAFDMSAVLSTHSLFGTLMEGIFGYQ
EAPSVSEVAVWFIYLIPALVAFALPPRAGATASRSA
>Q8FJ36 ~~~efeU~~~Ferrous iron permease EfeU~~~COG0672
MFVPFLIILREGLEAALIVSLIASYLTRTQRGRWIGVMWIGVLLAAALCLGLGIFINETTGEFPQKEQELFEGIVAVIAV
VILTWMVFWMRKVSRNVKVQLEQAVDSALQRGNHHGWALVMMVFFAVAREGLESVFFLLAAFQQDVGIWPPLGAMLGLAT
AVVLGFLLYWGGIRLNLGAFFKWTSLFILFVAAGLAAGAIRAFHEAGLWNHFQEIAFDMSAVLSTHSLFGTLMEGIFGYQ
EAPSVSEVAVWFIYLIPALVAFVLPPRAGATASRSM
>Q7BS32 1.13.12.19~~~efe~~~2-oxoglutarate-dependent ethylene/succinate-forming enzyme~~~
MTNLQTFELPTEVTGCAADISLGRALIQAWQKDGIFQIKTDSEQDRKTQEAMAASKQFCKEPLTFKSSCVSDLTYSGYVA
SGEEVTAGKPDFPEIFTVCKDLSVGDQRVKAGWPCHGPVPWPNNTYQKSMKTFMEELGLAGERLLKLTALGFELPINTFT
DLTRDGWHHMRVLRFPPQTSTLSRGIGAHTDYGLLVIAAQDDVGGLYIRPPVEGEKRNRNWLPGESSAGMFEHDEPWTFV
TPTPGVWTVFPGDILQFMTGGQLLSTPHKVKLNTRERFACAYFHEPNFEASAYPLFEPSANERIHYGEHFTNMFMRCYPD
RITTQSINKENRLAHLEDLKKYSDTRATGS
>P32021 1.13.12.19~~~efe~~~2-oxoglutarate-dependent ethylene/succinate-forming enzyme~~~
MTNLQTFELPTEVTGCAADISLGRALIQAWQKDGIFQIKTDSEQDRKTQEAMAASKQFCKEPLTFKSSCVSDLTYSGYVA
SGEEVTAGKPDFPEIFTVCKDLSVGDQRVKAGWPCHGPVPWPNNTYQKSMKTFMEELGLAGERLLKLTALGFELPINTFT
DLTRDGWHHMRVLRFPPQTSTLSRGIGAHTDYGLLVIAAQDDVGGLYIRPPVEGEKRNRNWLPGESSAGMFEHDEPWTFV
TPTPGVWTVFPGDILQFMTGGQLLSTPHKVKLNTRERFACAYFHEPNFEASAYPLFEPSANERIHYGEHFTNMFMRCYPD
RITTQRINKENRLAHLEDLKKYSDTRATGS
>Q9HWD2 ~~~fusA~~~Elongation factor G 1~~~
MARTTPINRYRNIGICAHVDAGKTTTTERVLFYTGVNHKLGEVHDGAATTDWMVQEQERGITITSAAVTTFWKGSRGQYD
NYRVNVIDTPGHVDFTIEVERSLRVLDGAVVVFCGTSGVEPQSETVWRQANKYGVPRIVYVNKMDRQGANFLRVVEQIKK
RLGHTPVPVQLAIGAEENFVGQVDLIKMKAIYWNDDDKGMTYREEEIPAELKDLAEEWRSSMVEAAAEANEELMNKYLEE
GELSEAEIKEGLRLRTLACEIVPAVCGSSFKNKGVPLVLDAVIDYLPAPTEIPAIKGVSPDDETVEDERHADDNEPFSSL
AFKIATDPFVGTLTFARVYSGVLSSGDSVLNSVKGKKERVGRMVQMHANQREEIKEVRAGDIAALIGMKDVTTGDTLCSI
EKPIILERMDFPEPVISVAVEPKTKADQEKMGIALGKLAQEDPSFRVKTDEESGQTIISGMGELHLDIIVDRMKREFGVE
ANIGKPQVAYRETITKDNVEIEGKFVRQSGGRGQFGHCWIRFSAADVDEKGNITEGLVFENEVVGGVVPKEYIPAIQKGI
EEQMKNGVVAGYPLIGLKATVFDGSYHDVDSNEMAFKIAASMATKQLAQKGGGKVLEPIMKVEVVTPEDYMGDVMGDLNR
RRGLIQGMEDTVSGKVIRAEVPLGEMFGYATDVRSMSQGRASYSMEFSKYAEAPSNIVEALVKKQG
>P28371 ~~~fusA~~~Elongation factor G 1~~~COG0480
MEKDLTRYRNIGIFAHVDAGKTTTTERILKLTGRIHKLGEVHEGESTMDFMEQEAERGITIQSAATSCFWKDHQLNVIDT
PGHVDFTIEVYRSLKVLDGGIGVFCGSGGVEPQSETNWRYANDSKVARLIYINKLDRTGADFYRVVKQVETVLGAKPLVM
TLPIGTENDFVGVVDILTEKAYIWDDSGDPEKYEITDIPADMVDDVATYREMLIETAVEQDDDLMEKYLEGEEISIDDIK
RCIRTGTRKLDFFPTYGGSSFKNKGVQLVLDAVVDYLPNPKEVPPQPEVDLEGEETGNYAIVDPEAPLRALAFKIMDDRF
GALTFTRIYSGTLSKGDTILNTATGKTERIGRLVEMHADSREEIESAQAGDIVAIVGMKNVQTGHTLCDPKNPATLEPMV
FPDPVISIAIKPKKKGMDEKLGMALSKMVQEDPSFQVETDEESGETIIKGMGELHLDIKMDILKRTHGVEVEMGKPQVAY
RESITQQVSDTYVHKKQSGGSGQYAKIDYIVEPGEPGSGFQFESKVTGGNVPREYWPAVQKGFDQSVVKGVLAGYPVVDL
KVTLTDGGFHPVDSSAIAFEIAAKAGYRQSLPKAKPQILEPIMAVDVFTPEDHMGDVIGDLNRRRGMIKSQETGPMGVRV
KADVPLSEMFGYIGDLRTMTSGRGQFSMVFDHYAPCPTNVAEEVIKEAKERQAAA
>Q88FI4 ~~~fusB~~~Elongation factor G 2~~~COG0480
MARTTPIELYRNIGIVAHVDAGKTTTTERILFYTGVNHKMGEVHDGAATMDWMAQEQERGITITSAATTAFWQGSTKQFA
HKYRFNIIDTPGHVDFTIEVERSLRVLDGAVVVFSGADGVEPQSETVWRQANKYHVPRLAYINKMDRQGADFLRVVKQID
QRLGHHPVPIQLAIGSEENFMGQIDLVKMKAIYWNDADQGTSYREEEIPAELKALADEWRAHMIEAAAEANDELTMKFLD
GEELSIEEIKAGLRQRTIANEIVPTILGSSFKNKGVPLMLDAVIDYLPAPSEIPAIRGTDPDDEEKHLERHADDKEPFSA
LAFKIATDPFVGTLTFARVYSGVLSSGNAVLNSVKGKKERIGRMVQMHANQRAEIKDVCAGDIAALIGMKDVTTGDTLCD
MDKPIILERMDFPDPVISVAVEPKTKADQEKMGIALGKLAQEDPSFRVRTDEETGQTIISGMGELHLDIIVDRMRREFNV
EANIGKPQVAYREKIRNTCEIEGRFVRQSGGRGQYGHCWIRFAPGDEGKEGLEFINEIVGGVVPREYIPAIQKGIEEQMK
NGVLAGYPLINLKAAVFDGSYHDVDSNEMAYKIAASMATKQLSQKGGAVLLEPVMKVEVVTPEEYQGDILGDLSRRRGMI
QDGDETPAGKVIRAEVPLGEMFGYATSMRSMTQGRASFSMEFTRYAEAPASIADGIVKKSRGE
>P9WNM9 ~~~~~~Elongation factor G-like protein~~~COG0480
MADRVNASQGAAAAPTANGPGGVRNVVLVGPSGGGKTTLIEALLVAAKVLSRPGSVTEGTTVCDFDEAEIRQQRSVGLAV
ASLAYDGIKVNLVDTPGYADFVGELRAGLRAADCALFVIAANEGVDEPTKSLWQECSQVGMPRAVVITKLDHARANYREA
LTAAQDAFGDKVLPLYLPSGDGLIGLLSQALYEYADGKRTTRTPAESDTERIEEARGALIEGIIEESEDESLMERYLGGE
TIDESVLIQDLEKAVARGSFFPVIPVCSSTGVGTLELLEVATRGFPSPMEHPLPEVFTPQGVPHAELACDNDAPLLAEVV
KTTSDPYVGRVSLVRVFSGTIRPDTTVHVSGHFSSFFGGGTSNTHPDHDEDERIGVLSFPLGKQQRPAAAVVAGDICAIG
KLSRAETGDTLSDKAEPLVLKPWTMPEPLLPIAIAAHAKTDEDKLSVGLGRLAAEDPTLRIEQNQETHQVVLWCMGEAHA
GVVLDTLANRYGVSVDTIELRVPLRETFAGNAKGHGRHIKQSGGHGQYGVCDIEVEPLPEGSGFEFLDKVVGGAVPRQFI
PNVEKGVRAQMDKGVHAGYPVVDIRVTLLDGKAHSVDSSDFAFQMAGALALREAAAATKVILLEPIDEISVLVPDDFVGA
VLGDLSSRRGRVLGTETAGHDRTVIKAEVPQVELTRYAIDLRSLAHGAASFTRSFARYEPMPESAAARVKAGAG
>P80868 ~~~fusA~~~Elongation factor G~~~COG0480
MAREFSLEKTRNIGIMAHIDAGKTTTTERILFYTGRIHKIGETHEGASQMDWMEQEQERGITITSAATTAQWKGYRVNII
DTPGHVDFTVEVERSLRVLDGAVAVLDAQSGVEPQTETVWRQATTYGVPRIVFVNKMDKIGADFLYSVGTLRDRLQANAH
AIQLPIGAEDNFEGIIDLVENVAYFYEDDLGTRSDAKEIPEEYKEQAEELRNSLIEAVCELDEELMDKYLEGEEITIDEL
KAGIRKGTLNVEFYPVLVGSAFKNKGVQLVLDAVLDYLPAPTDVAAIKGTRPDTNEEIERHSSDEEPFSALAFKVMTDPY
VGKLTFFRVYSGTLDSGSYVKNSTKGKRERVGRILQMHANSREEISTVYAGDIAAAVGLKDTTTGDTLCDEKDLVILESM
EFPEPVIDVAIEPKSKADQDKMGIALAKLAEEDPTFRTQTNPETGQTIISGMGELHLDIIVDRMKREFKVEANVGAPQVA
YRETFRTGAKVEGKFVRQSGGRGQFGHVWIEFEPNEEGAGFEFENAIVGGVVPREYIPAVQAGLEDALENGVLAGFPLID
IKAKLFDGSYHDVDSNEMAFKVAASMALKNAVSKCNPVLLEPIMKVEVVIPEEYMGDIMGDITSRRGRVEGMEARGNAQV
VRAMVPLAEMFGYATALRSNTQGRGTFTMHMDHYEEVPKSVAEEIIKKNKGE
>P0A6M8 3.6.5.-~~~fusA~~~Elongation factor G~~~COG0480
MARTTPIARYRNIGISAHIDAGKTTTTERILFYTGVNHKIGEVHDGAATMDWMEQEQERGITITSAATTAFWSGMAKQYE
PHRINIIDTPGHVDFTIEVERSMRVLDGAVMVYCAVGGVQPQSETVWRQANKYKVPRIAFVNKMDRMGANFLKVVNQIKT
RLGANPVPLQLAIGAEEHFTGVVDLVKMKAINWNDADQGVTFEYEDIPADMVELANEWHQNLIESAAEASEELMEKYLGG
EELTEAEIKGALRQRVLNNEIILVTCGSAFKNKGVQAMLDAVIDYLPSPVDVPAINGILDDGKDTPAERHASDDEPFSAL
AFKIATDPFVGNLTFFRVYSGVVNSGDTVLNSVKAARERFGRIVQMHANKREEIKEVRAGDIAAAIGLKDVTTGDTLCDP
DAPIILERMEFPEPVISIAVEPKTKADQEKMGLALGRLAKEDPSFRVWTDEESNQTIIAGMGELHLDIIVDRMKREFNVE
ANVGKPQVAYRETIRQKVTDVEGKHAKQSGGRGQYGHVVIDMYPLEPGSNPKGYEFINDIKGGVIPGEYIPAVDKGIQEQ
LKAGPLAGYPVVDMGIRLHFGSYHDVDSSELAFKLAASIAFKEGFKKAKPVLLEPIMKVEVETPEENTGDVIGDLSRRRG
MLKGQESEVTGVKIHAEVPLSEMFGYATQLRSLTKGRASYTMEFLKYDEAPSNVAQAVIEARGK
>Q839G9 ~~~fusA~~~Elongation factor G~~~COG0480
MAREFSLEKTRNIGIMAHVDAGKTTTTERILYYTGKIHKIGETHEGASQMDWMEQEQERGITITSAATTAQWKGYRVNII
DTPGHVDFTIEVQRSLRVLDGAVTVLDSQSGVEPQTETVWRQATEYKVPRIVFCNKMDKIGADFFYSVESLHDRLQANAH
PIQIPIGAEEDFTGIIDLIKMKAEIYTNDLGTDIQETDIPEDYLEKAQEWREKLVEAVAETDEDLMMKYLEGEEITEEEL
VAGIRQATINVEFFPVLAGSAFKNKGVQLMLDAVLDYLPSPLDIDAIKGIDTKTDEETTRPADDEAPFASLAFKVMTDPF
VGRLTFFRVYSGVLESGSYVLNASKGKKERIGRILQMHANTRQEIDKVYSGDIAAAVGLKDTTTGDTLCALDAPVILESI
EFPDPVIQVAVEPKSKADQDKMGVALQKLAEEDPSFRVETNVETGETVISGMGELHLDVLVDRMKREFKVEANVGAPQVS
YRETFRAATKAEGKFVRQSGGKGQYGHVWVEFTPNEEGKGFEFENAIVGGVVPREYIPAVEKGLEDSMNNGVLAGYPLVD
IKAKLYDGSYHDVDSNETAFRVAASMALKAAAKNANPVILEPMMKVTITVPEDYLGDIMGHVTSRRGRVEGMEAHGNSQI
VNAMVPLAEMFGYATTLRSATQGRGTFMMVFDHYEDVPKSVQEEIIKKNGGNA
>P43925 ~~~fusA~~~Elongation factor G~~~COG0480
MARTTPIERYRNIGISAHIDAGKTTTTERILFYTGVSHKIGEVHDGAATMDWMEQEQERGITITSAATTAFWSGMSQQFP
QHRINVIDTPGHVDFTVEVERSMRVLDGAVMVYCAVGGVQPQSETVWRQANKYEVPRIAFVNKMDRTGANFLRVVEQLKT
RLGANAIPLQLPVGAEENFTGVVDLIKMKAINWNEADQGMTFTYEEVPANMQADCEEWRQNLVEAAAEASEELMEKYLGG
EDLTEEEIKSALRQRVLANEIILVTCGSAFKNKGVQAMLDAVVEYLPAPTDIPAIKGINPDETEGERHASDEEPFSSLAF
KIATDPFVGNLTFFRVYSGVINSGDTVLNSVRQKRERFGRIVQMHANKREEIKEVRAGDIAAAIGLKDVTTGDTLCAIDA
PIILERMEFPEPVISVAVEPKTKADQEKMGLALGRLAQEDPSFRVHTDEESGETIISGMGELHLDIIVDRMKREFKVEAN
IGKPQVSYRETIRTRVNDVEGKHAKQSGGRGQYGHVVIDLYPLDPEGPGYEFVNEIKGGVIPGEYIPAVDKGIQEQLKSG
PLAGYPVVDLGVRLHFGSYHDVDSSELAFKLAASLAFKAAFSKANPVLLEPIMKVEVETPPEYVGDVIGDLSRRRAMVNG
QEANEFVVKIYAEVPLSEMFGYATDLRSQTQGRASYSMEPLKYAEAPTSVAAAVIEARKK
>Q5ZYP6 ~~~fusA~~~Elongation factor G~~~COG0480
MATPLKLYRNIGIAAHVDAGKTTTTERVLYYTGMSHKIGEVHDGAATMDWMVQEQERGITITSAATTCYWSGMDKQFESH
RINIIDTPGHVDFMIEVERSLRVLDGAVVVFDSVAGVEPQSETVWRQANKYGVPRIVFVNKMDRMGANFLRVVSQIKQRL
GSTPVVLQLPIGAEEEFKGVIDLIKMKAIHWDEENKGMTFKYVDIPADLKSTCEEYRAHIIEAAAEYSEELMEKYLEGEE
FTEAEIKKALRHLTITNKVVPVFCGSAFKNKGVQAVLDGVIEYLPSPTDIPDIQGVDEHGDVIHRKTSYDEPFSALAFKI
ATDPFVGTLTYFRAYSGILKSGDTVYNSVKGKKERIGRLLQMHANSREEIKEVRAGDIAAAVGLKTVTTGDTLCDQDKVV
ILERMDFPDPVIAVAVEPKTKADQEKMGIALGKLAQEDPSFRVHTDEESGQTIIQGMGELHLEIIVDRMKREFNVEANVG
KPQVAYRETLKQAVEQEGKFVRQSGGRGQYGHVWLKIEPQEPGKGYEFINAIVGGVIPKEYIPAVDKGIQEQMQNGVIAG
YPVVDVKVTLFDGSFHEVDSSEMAFKIAGSQCFKQGALKAKPVLLEPIMSVEVVTPEDYMGDVMGDLNRRRGLVQGMEDS
PAGKIVRAEVPLAEMFGYSTDLRSATQGRATYTMEFCKYAEAPTNIAEAIIKKQ
>P75544 ~~~fusA~~~Elongation factor G~~~
MARTVDLINFRNFGIMAHIDAGKTTTSERILFHSGRIHKIGETHDGESVMDWMEQEKERGITITSAATSVSWKNCSLNLI
DTPGHVDFTVEVERSLRVLDGAIAVLDAQMGVEPQTETVWRQASRYEVPRIIFVNKMDKTGANFERSVQSIQQRLGVKAV
PIQLPIGAENDFNGIIDLIEEKVYFFDGGKEEKAEEKPIPDQFKDQVKQMRAHLVEEVANFDDQLMADYLEGKEISIADI
KRCIRKGVIGCQFFPVLCGSAFKNKGIKLLLDAVVDFLPSPVDVPQAKAYGEDGNEVLISASDDAPFVGLAFKVATDPFV
GRLTFVRVYSGVLKSGSYVKNVRKNKKERVSRLVKMHAQNRNEIEEIRAGDICAIIGLKDTTTGETLVDDKIDVQLEAMQ
FAQPVISLAVEPKTKADQEKMSIALSKLAEEDPTFKTFTDPETGQTIIAGMGELHLDILVDRMRREFKVEVNVGAPQVSF
RETFNKESEVEGKYIKQSGGRGQYGHVKIRFEPNKDKGFEFVDKIVGGRIPREYIKPVQAGLENAMASGPLAGYPMIDIK
ATLFDGSFHEVDSSEMAFKIAASLALKEAGKVCSPVLLEPIMAIEVTVPEQYFGDTMGDISSRRGLIEGTEQRDNVQVIK
AKVPLKEMFGYATDLRSFSQGRGNYVMQFSHYAETPKSVVNEIIATKK
>P9WNM7 ~~~fusA~~~Elongation factor G~~~COG0480
MAQKDVLTDLSRVRNFGIMAHIDAGKTTTTERILYYTGINYKIGEVHDGAATMDWMEQEQERGITITSAATTTFWKDNQL
NIIDTPGHVDFTVEVERNLRVLDGAVAVFDGKEGVEPQSEQVWRQADKYDVPRICFVNKMDKIGADFYFSVRTMGERLGA
NAVPIQLPVGAEADFEGVVDLVEMNAKVWRGETKLGETYDTVEIPADLAEQAEEYRTKLLEVVAESDEHLLEKYLGGEEL
TVDEIKGAIRKLTIASEIYPVLCGSAFKNKGVQPMLDAVVDYLPSPLDVPPAIGHAPAKEDEEVVRKATTDEPFAALAFK
IATHPFFGKLTYIRVYSGTVESGSQVINATKGKKERLGKLFQMHSNKENPVDRASAGHIYAVIGLKDTTTGDTLSDPNQQ
IVLESMTFPDPVIEVAIEPKTKSDQEKLSLSIQKLAEEDPTFKVHLDSETGQTVIGGMGELHLDILVDRMRREFKVEANV
GKPQVAYKETIKRLVQNVEYTHKKQTGGSGQFAKVIINLEPFTGEEGATYEFESKVTGGRIPREYIPSVDAGAQDAMQYG
VLAGYPLVNLKVTLLDGAYHEVDSSEMAFKIAGSQVLKKAAALAQPVILEPIMAVEVTTPEDYMGDVIGDLNSRRGQIQA
MEERAGARVVRAHVPLSEMFGYVGDLRSKTQGRANYSMVFDSYSEVPANVSKEIIAKATGE
>Q9K1I8 ~~~fusA~~~Elongation factor G~~~
MARKTPISLYRNIGISAHIDAGKTTTTERILFYTGLTHKLGEVHDGAATTDYMEQEQERGITITSAAVTSYWSGMAKQFP
EHRFNIIDTPGHVDFTVEVERSMRVLDGAVMVYCAVGGVQPQSETVWRQANKYQVPRLAFVNKMDRQGANFFRVVEQMKT
RLRANPVPIVIPVGAEDNFSGVVDLLKMKSIIWNEVDKGTTFTYGDIPAELVETAEEWRQNMIEAAAEASEELMDKYLGG
DELTEEEIVGALRQRTLAGEIQPMLCGSAFKNKGVQRMLDAVVELLPAPTDIPPVQGVNPNTEEADSRQASDEEKFSALA
FKMLNDKYVGQLTFIRVYSGVVKSGDTVLNSVKGTRERIGRLVQMTAADRTEIEEVRAGDIAAAIGLKDVTTGETLCAES
APIILERMEFPEPVIHIAVEPKTKADQEKMGIALNRLAKEDPSFRVRTDEESGQTIISGMGELHLEIIVDRMKREFGVEA
NIGAPQVAYRETIRKAVKAEYKHAKQSGGKGQYGHVVIEMEPMEPGGEGYEFIDEIKGGVIPREFIPSVDKGIRDTLPNG
IVAGYPVVDVRIRLVFGSYHDVDSSQLAFELAASQAFKEGMRQASPALLEPIMAVEVETPEEYMGDVMGDLNRRRGVVLG
MDDDGIGGKKVRAEVPLAEMFGYSTDLRSATQGRATYSMEFKKYSEAPAHIAAAVTEARKG
>P41084 ~~~fusA~~~Elongation factor G~~~COG0480
MSKINKLEQIRNIGICAHIDAGKTTTTERILYYTGKSHKIGEVHEGGATMDWMEQEQERGITITSAATTCRWQDKVINII
DTPGHVDFTIEVERSLRVLDGAVAVFDGVAGVEPQSETVWRQADKYNVPRMCFVNKMDRMGADFYRCVEMIKDRLGARSL
IIQLPIGIEENFKGIVNLIKMKAVIWKDESLGAEYFEEDIPADMQDKAAEYRARLLDMVVELDDTIMEQYLSGAEITEEQ
IKILIRKGTIEARFYPILCGSAFKNKGVQPLLDAIVDFLPSPIDIGIVKGIEVSTSEEKDFPISIVEPFSALAFKIMNDP
FVGSLTFIRIYSGKITSGATVINTVKNKREKIGRMLLMHANNREDIKEASAGDIVALAGLKDTSTGDTLSDIDKQVVLER
MEFPEPVIELAVEPKSTADQEKMGLALSRLAAEDPSFRVSTDHETGQTVIKGMGELHLEIIIDRMRREFKVEANIGAPQV
AYRETITTACEIDYTHKKQSGGAGQFARVKIIFEPLKDVIDLKDEDKNKTFVFESKIVGGAVPKEYIPGVEKGLNNIRET
GVIAGYPMIDFKATLVDGAFHDVDSSVLAFEIAAKGAFREGMQKGNPKLLEPIMKVEVITPDEYMGDIIGDLNSRRGQIQ
NMDPRGNAQVVTAHVPLAEMFGYVNTLRSLSQGRAQFSMIFSHYDQVPSQVADMIKAKK
>Q2G0N1 ~~~fusA~~~Elongation factor G~~~COG0480
MAREFSLEKTRNIGIMAHIDAGKTTTTERILYYTGRIHKIGETHEGASQMDWMEQEQDRGITITSAATTAAWEGHRVNII
DTPGHVDFTVEVERSLRVLDGAVTVLDAQSGVEPQTETVWRQATTYGVPRIVFVNKMDKLGANFEYSVSTLHDRLQANAA
PIQLPIGAEDEFEAIIDLVEMKCFKYTNDLGTEIEEIEIPEDHLDRAEEARASLIEAVAETSDELMEKYLGDEEISVSEL
KEAIRQATTNVEFYPVLCGTAFKNKGVQLMLDAVIDYLPSPLDVKPIIGHRASNPEEEVIAKADDSAEFAALAFKVMTDP
YVGKLTFFRVYSGTMTSGSYVKNSTKGKRERVGRLLQMHANSRQEIDTVYSGDIAAAVGLKDTGTGDTLCGEKNDIILES
MEFPEPVIHLSVEPKSKADQDKMTQALVKLQEEDPTFHAHTDEETGQVIIGGMGELHLDILVDRMKKEFNVECNVGAPMV
SYRETFKSSAQVQGKFSRQSGGRGQYGDVHIEFTPNETGAGFEFENAIVGGVVPREYIPSVEAGLKDAMENGVLAGYPLI
DVKAKLYDGSYHDVDSSEMAFKIAASLALKEAAKKCDPVILEPMMKVTIEMPEEYMGDIMGDVTSRRGRVDGMEPRGNAQ
VVNAYVPLSEMFGYATSLRSNTQGRGTYTMYFDHYAEVPKSIAEDIIKKNKGE
>P68789 ~~~fusA~~~Elongation factor G~~~
MAREFSLEKTRNIGIMAHIDAGKTTTTERILYYTGRIHKIGETHEGASQMDWMEQEQDRGITITSAATTAAWEGHRVNII
DTPGHVDFTVEVERSLRVLDGAVTVLDAQSGVEPQTETVWRQATTYGVPRIVFVNKMDKLGANFEYSVSTLHDRLQANAA
PIQLPIGAEDEFEAIIDLVEMKCFKYTNDLGTEIEEIEIPEDHLDRAEEARASLIEAVAETSDELMEKYLGDEEISVSEL
KEAIRQATTNVEFYPVLCGTAFKNKGVQLMLDAVIDYLPSPLDVKPIIGHRASNPEEEVIAKADDSAEFAALAFKVMTDP
YVGKLTFFRVYSGTMTSGSYVKNSTKGKRERVGRLLQMHANSRQEIDTVYSGDIAAAVGLKDTGTGDTLCGEKNDIILES
MEFPEPVIHLSVEPKSKADQDKMTQALVKLQEEDPTFHAHTDEETGQVIIGGMGELHLDILVDRMKKEFNVECNVGAPMV
SYRETFKSSAQVQGKFSRQSGGRGQYGDVHIEFTPNETGAGFEFENAIVGGVVPREYIPSVEAGLKDAMENGVLAGYPLI
DVKAKLYDGSYHDVDSSEMAFKIAASLALKEAAKKCDPVILEPMMKVTIEMPEEYMGDIMGDVTSRRGRVDGMEPRGNAQ
VVNAYVPLSEMFGYATSLRSNTQGRGTYTMYFDHYAEVPKSIAEDIIKKNKGE
>P68790 ~~~fusA~~~Elongation factor G~~~
MAREFSLEKTRNIGIMAHIDAGKTTTTERILYYTGRIHKIGETHEGASQMDWMEQEQDRGITITSAATTAAWEGHRVNII
DTPGHVDFTVEVERSLRVLDGAVTVLDAQSGVEPQTETVWRQATTYGVPRIVFVNKMDKLGANFEYSVSTLHDRLQANAA
PIQLPIGAEDEFEAIIDLVEMKCFKYTNDLGTEIEEIEIPEDHLDRAEEARASLIEAVAETSDELMEKYLGDEEISVSEL
KEAIRQATTNVEFYPVLCGTAFKNKGVQLMLDAVIDYLPSPLDVKPIIGHRASNPEEEVIAKADDSAEFAALAFKVMTDP
YVGKLTFFRVYSGTMTSGSYVKNSTKGKRERVGRLLQMHANSRQEIDTVYSGDIAAAVGLKDTGTGDTLCGEKNDIILES
MEFPEPVIHLSVEPKSKADQDKMTQALVKLQEEDPTFHAHTDEETGQVIIGGMGELHLDILVDRMKKEFNVECNVGAPMV
SYRETFKSSAQVQGKFSRQSGGRGQYGDVHIEFTPNETGAGFEFENAIVGGVVPREYIPSVEAGLKDAMENGVLAGYPLI
DVKAKLYDGSYHDVDSSEMAFKIAASLALKEAAKKCDPVILEPMMKVTIEMPEEYMGDIMGDVTSRRGRVDGMEPRGNAQ
VVNAYVPLSEMFGYATSLRSNTQGRGTYTMYFDHYAEVPKSIAEDIIKKNKGE
>Q5XDW4 ~~~fus~~~Elongation factor G~~~
MAREFSLAKTRNIGIMAHVDAGKTTTTERILYYTGKIHKIGETHEGASQMDWMEQEQERGITITSAATTAQWDGHRVNII
DTPGHVDFTIEVQRSLRVLDGAVTVLDSQSGVEPQTETVWRQATEYGVPRIVFANKMDKIGADFLYSVQTLHDRLQANAH
PIQLPIGAEDDFRGIIDLIKMKAEIYTNDLGTDILEEDIPEEYLEQAQEYREKLIEAVAETDEDLMMKYLEGEEITNDEL
IAGIRKATINVEFFPVLCGSAFKNKGVQLMLDAVIAYLPSPLDIPAIKGVNPDTDAEEERPASDEEPFAALAFKIMTDPF
VGRLTFFRVYSGVLNSGSYVMNTSKGKRERIGRILQMHANSRQEIETVYAGDIAAAVGLKDTTTGDSLTDEKAKVILESI
EVPEPVIQLMVEPKSKADQDKMGVALQKLAEEDPTFRVETNVETGETVIAGMGELHLDVLVDRMKREFKVEANVGAPQVS
YRETFRASTQARGFFKRQSGGKGQFGDVWIEFTPNEEGKGFEFENAIVGGVVPREFIPAVEKGLIESMANGVLAGYPMVD
VKAKLYDGSYHDVDSSETAFKIAASLALKEAAKSAQPAILEPMMLVTITAPEDNLGDVMGHVTARRGRVDGMEAHGNSQI
VRAYVPLAEMFGYATVLRSATQGRGTFMMVFDHYEDVPKSVQEEIIKKNKGE
>Q72I01 ~~~fusA~~~Elongation factor G~~~COG0480
MAVKVEYDLKRLRNIGIAAHIDAGKTTTTERILYYTGRIHKIGEVHEGAATMDFMEQERERGITITAAVTTCFWKDHRIN
IIDTPGHVDFTIEVERSMRVLDGAIVVFDSSQGVEPQSETVWRQAEKYHVPRIAFANKMDKTGADLWLVIRTMQERLGAR
PVVMQLPIGREDTFSGIIDVLRMKAYTYGNDLGTDIREIPIPEEYLDQAREYHEKLVEVAADFDEHIMLKYLEGEEPTEE
ELVAAIRKGTIDLKITPVFLGSALKNKGVQLLLDAVVDYLPSPLDIPPIKGTTPEGEVVEIHPDPNGPLAALAFKIMADP
YVGRLTFIRVYSGTLTSGSYVYNTTKGRKERVARLLRMHANHREEVEELKAGDLGAVVGLKETITGDTLVGEDAPRVILE
SIEVPEPVIDVAIEPKTKADQEKLSQALARLAEEDPTFRVSTHPETGQTIISGMGELHLEIIVDRLKREFKVDANVGKPQ
VAYRETITKPVDVEGKFIRQTGGRGQYGHVKIKVEPLPRGSGFEFVNAIVGGVIPKEYIPAVQKGIEEAMQSGPLIGFPV
VDIKVTLYDGSYHEVDSSEMAFKIAGSMAIKEAVQKGDPVILEPIMRVEVTTPEEYMGDVIGDLNARRGQILGMEPRGNA
QVIRAFVPLAEMFGYATDLRSKTQGRGSFVMFFDHYQEVPKQVQEKLIKGQ
>Q5SHN5 ~~~fusA~~~Elongation factor G~~~COG0480
MAVKVEYDLKRLRNIGIAAHIDAGKTTTTERILYYTGRIHKIGEVHEGAATMDFMEQERERGITITAAVTTCFWKDHRIN
IIDTPGHVDFTIEVERSMRVLDGAIVVFDSSQGVEPQSETVWRQAEKYKVPRIAFANKMDKTGADLWLVIRTMQERLGAR
PVVMQLPIGREDTFSGIIDVLRMKAYTYGNDLGTDIREIPIPEEYLDQAREYHEKLVEVAADFDENIMLKYLEGEEPTEE
ELVAAIRKGTIDLKITPVFLGSALKNKGVQLLLDAVVDYLPSPLDIPPIKGTTPEGEVVEIHPDPNGPLAALAFKIMADP
YVGRLTFIRVYSGTLTSGSYVYNTTKGRKERVARLLRMHANHREEVEELKAGDLGAVVGLKETITGDTLVGEDAPRVILE
SIEVPEPVIDVAIEPKTKADQEKLSQALARLAEEDPTFRVSTHPETGQTIISGMGELHLEIIVDRLKREFKVDANVGKPQ
VAYRETITKPVDVEGKFIRQTGGRGQYGHVKIKVEPLPRGSGFEFVNAIVGGVIPKEYIPAVQKGIEEAMQSGPLIGFPV
VDIKVTLYDGSYHEVDSSEMAFKIAGSMAIKEAVQKGDPVILEPIMRVEVTTPEEYMGDVIGDLNARRGQILGMEPRGNA
QVIRAFVPLAEMFGYATDLRSKTQGRGSFVMFFDHYQEVPKQVQEKLIKGQ
>P13551 ~~~fusA~~~Elongation factor G~~~
MAVKVEYDLKRLRNIGIAAHIDAGKTTTTERILYYTGRIHKIGEVHEGAATMDFMEQERERGITITAAVTTCFWKDHRIN
IIDTPGHVDFTIEVERSMRVLDGAIVVFDSSQGVEPQSETVWRQAEKYKVPRIAFANKMDKTGADLWLVIRTMQERLGAR
PVVMQLPIGREDTFSGIIDVLRMKAYTYGNDLGTDIREIPIPEEYLDQAREYHEKLVEVAADFDENIMLKYLEGEEPTEE
ELVAAIRKGTIDLKITPVFLGSALKNKGVQLLLDAVVDYLPSPLDIPPIKGTTPEGEVVEIHPDPNGPLAALAFKIMADP
YVGRLTFIRVYSGTLTSGSYVYNTTKGRKERVARLLRMHANHREEVEELKAGDLGAVVGLKETITGDTLVGEDAPRVILE
SIEVPEPVIDVAIEPKTKADQEKLSQALARLAEEDPTFRVSTHPETGQTIISGMGELHLEIIVDRLKREFKVDANVGKPQ
VAYRETITKPVDVEGKFIRQTGGRGQYGHVKIKVEPLPRGSGFEFVNAIVGGVIPKEYIPAVQKGIEEAMQSGPLIGFPV
VDIKVTLYDGSYHEVDSSEMAFKIAGSMAIKEAVQKGDPVILEPIMRVEVTTPEEYMGDVIGDLNARRGQILGMEPRGNA
QVIRAFVPLAEMFGYATDLRSKTQGRGSFVMFFDHYQEVPKQVQEKLIKGQ
>Q8DCQ8 ~~~fusA~~~Elongation factor G~~~
MARKTPIERYRNIGICAHVDAGKTTTTERILFYTGLSHKIGEVHDGAATMDWMEQEQERGITITSAATTTFWRGMEAQFQ
DHRVNIIDTPGHVDFTIEVERSLRVLDGAVVVFCGSSGVEPQSETVWRQADKYHVPRMVFVNKMDRAGADFLRVVDQIKN
RLGANPVPIQLNVGAEEDFKGVIDLIKMKMINWNEADQGMTFTYEEIPADMIELAEEWRNNLVEAAAEASEELMDKYLEE
GELTEAEIKQALRARTLNNEIVLATCGSAFKNKGVQAVLDAVIEYLPSPIDVPAIKGIDENDNEVERHADDNEPFSALAF
KIATDPFVGTLTFIRVYSGVVNTGDAVYNSVKQKKERFGRIVQMHANKREEIKEVRAGDIAAAIGLKDVTTGDTLCNSDH
KVILERMEFPEPVIQIAVEPRSKADQEKMGIALGKLAAEDPSFRVETDAETGQTLISGMGELHLDIIVDRMKREFSVDCN
VGKPQVAYRETIRGKSEVEGKFVRQSGGRGQYGHVWIKLEPSEPGAGFVFVDEVVGGVIPKEYISSVAKGIEEQMNSGVL
AGYPVLDIKATLFDGSYHDVDSSEMAFKIAGSMAFKKGALEAQPVILEPMMKVEVTTPEDWMGDVVGDLNRRRGIIEGMD
EGVAGLKIIRAQVPLSEMFGYATDLRSATQGRASYSMEFFEYAEVPKNIAEAIVAERGY
>P9WJY5 ~~~efpA~~~Uncharacterized MFS-type transporter EfpA~~~COG0477
MTALNDTERAVRNWTAGRPHRPAPMRPPRSEETASERPSRYYPTWLPSRSFIAAVIAIGGMQLLATMDSTVAIVALPKIQ
NELSLSDAGRSWVITAYVLTFGGLMLLGGRLGDTIGRKRTFIVGVALFTISSVLCAVAWDEATLVIARLSQGVGSAIASP
TGLALVATTFPKGPARNAATAVFAAMTAIGSVMGLVVGGALTEVSWRWAFLVNVPIGLVMIYLARTALRETNKERMKLDA
TGAILATLACTAAVFAFSIGPEKGWMSGITIGSGLVALAAAVAFVIVERTAENPVVPFHLFRDRNRLVTFSAILLAGGVM
FSLTVCIGLYVQDILGYSALRAGVGFIPFVIAMGIGLGVSSQLVSRFSPRVLTIGGGYLLFGAMLYGSFFMHRGVPYFPN
LVMPIVVGGIGIGMAVVPLTLSAIAGVGFDQIGPVSAIALMLQSLGGPLVLAVIQAVITSRTLYLGGTTGPVKFMNDVQL
AALDHAYTYGLLWVAGAAIIVGGMALFIGYTPQQVAHAQEVKEAIDAGEL
>P0A6N8 ~~~yeiP~~~Elongation factor P-like protein~~~COG0231
MPRANEIKKGMVLNYNGKLLLVKDIDIQSPTARGAATLYKMRFSDVRTGLKVEERFKGDDIVDTVTLTRRYVDFSYVDGN
EYVFMDKEDYTPYTFTKDQIEEELLFMPEGGMPDMQVLTWDGQLLALELPQTVDLEIVETAPGIKGASASARNKPATLST
GLVIQVPEYLSPGEKIRIHIEERRYMGRAD
>A3DDQ3 ~~~efp~~~Elongation factor P~~~COG0231
MISAGDFKNGVTFELDGQIFQVIEFQHVKPGKGAAFVRTKLKNIVTGATIEKTFNPTDKMPKAHIERKDMQYLYNDGDLY
YFMDTETFEQLPLGKDKIGDALKFVKENEIVKVLSHKGNVFGIEPPNFVELEVTDTEPGFKGDTATGATKPAIVETGASI
KVPLFVNKGDIIRIDTRTGEYMERV
>Q45288 ~~~efp~~~Elongation factor P~~~COG0231
MATTADFKNGLVLKNEGKLQQIIEFQHVKPGKGPAFVRTKLKDVVTGKTIDKTWNAGVKVETATVDRRDVTYLYNDGTSF
IVMDDKTFEQYELSPDAFGDAGRFLLENMRVQVSFHEGEALFGELPVSVDLRVEHTDPGLQGDRSTGGTKPATLETGAEI
QVPLFIETGNVLKVDTRDGSYLSRVNN
>Q83AR4 ~~~efp~~~Elongation factor P~~~COG0231
MATHSTNEFRGGLKVMVDGDPCSIIDNEFVKPGKGQAFNRVKFRNLKTGRVLERTFKSGETLPAADVVEVEMQYLYNDGE
FWHFMTSENYEQHAASKEAVAEAKQWLKEEALCMVTMWNGVPLSVEPPNFVELKITETEPGVRGDTATGGTKRAKLETGA
VVRVPLFLNEGEIIKVDTRRGEYVSRAK
>P0A6N4 ~~~efp~~~Elongation factor P~~~COG0231
MATYYSNDFRAGLKIMLDGEPYAVEASEFVKPGKGQAFARVKLRRLLTGTRVEKTFKSTDSAEGADVVDMNLTYLYNDGE
FWHFMNNETFEQLSADAKAIGDNAKWLLDQAECIVTLWNGQPISVTPPNFVELEIVDTDPGLKGDTAGTGGKPATLSTGA
VVKVPLFVQIGEVIKVDTRSGEYVSRVK
>P75085 ~~~efp~~~Elongation factor P~~~
MADMIEAKSLRSGQTIFGPNKEILLVLENTFNKTAMRQGIVKTKVKNLRTGAIVWIEFTGDKLEQVIIDKKKMTFLYKDG
ANYVFMDQQDYSQIEIPEKQLEWEKNFITEDSEVTIISYQSEILGVNLPELVPIEVEFAEEAVQGNTANMARKRARLVSG
YELDVPQFIRTGDKIVISTIDGSYRERYNK
>A0QWR4 ~~~efp~~~Elongation factor P~~~COG0231
MASTADFKNGLVLQIDGQLWQIVEFQHVKPGKGPAFVRTKLKNVVSGKVVDKTYNAGVKVETATVDRRDATYLYRDGSDF
VFMDSEDFEQHPLPESLVGRLADFLLESMPVQIAFHDGTPLYLELPVSVELEVTHTEPGLQGDRSSAGTKPATVETGAEI
QVPLFINTGDRLKVDTRDGSYLGRVNA
>P9WNM3 ~~~efp~~~Elongation factor P~~~COG0231
MATTADFKNGLVLVIDGQLWTITEFQHVKPGKGPAFVRTKLKNVLSGKVVDKTFNAGVKVDTATVDRRDTTYLYRDGSDF
VFMDSQDYEQHPLPEALVGDAARFLLEGMPVQVAFHNGVPLYIELPVTVELEVTHTEPGLQGDRSSAGTKPATLQTGAQI
NVPLFINTGDKLKVDSRDGSYLGRVNA
>P0DUK0 ~~~efp~~~Elongation factor P~~~
MKTAQELRAGNVFMVGNDPMVVQKTEYIKGGRSSAKVSMKLKNLLTGAASETIYKADDKFDVVILSRKNCTYSYFADPMY
VFMDEEFNQYEIEADNIGDALKFIVDGMEDQCEVTFYEGNPISVELPTIIVREVEYTEPAVKGDTSGKVMKTARLVGGTE
IQVMSYIENGDKIEIDTRTGEFRKRA
>E6MVW0 ~~~efp~~~Elongation factor P~~~
MKTAQELRAGNVFMVGNDPMVVQKTEYIKGGRSSAKVSMKLKNLLTGAASETIYKADDKFDVVILSRKNCTYSYFADPMY
VFMDEEFNQYEIEADNIGDALKFIVDGMEDQCEVTFYEGNPISVELPTIIVREVEYTEPAVKGDTSGKVMKTARLVGGTE
IQVMSYIENGDKVEIDTRTGEFRKRA
>Q9HZZ2 ~~~efp~~~Elongation factor P~~~
MKTAQEFRAGQVANINGAPWVIQKAEFNKSGRNAAVVKMKLKNLLTGAGTETVFKADDKLEPIILDRKEVTYSYFADPLY
VFMDSEFNQYEIEKDDLEGVLTFIEDGMTDICEAVFYNDKVISVELPTTIVRQIAYTEPAVRGDTSGKVMKTARLNNGAE
LQVSAFCEIGDSIEIDTRTGEYKSRVKA
>Q88LS0 ~~~efp~~~Elongation factor P~~~COG0231
MKTGKELKPGTVLRIDNDPWLVQKAEFTKSGRNSAIMKTKLKNLLTGYKTETVYGADDKLDDVILDRKEATLSFINGDEY
TFMDTTDYTMYELNAEDIEAVLPYIEEGMEDVCEAVFFEGRLVSVELPTTISRKVVYTENAARGDTSGKVMKPAKLANGT
EISVADFIQIDEWIDIDTRDNSFKGRSKK
>P64036 ~~~efp~~~Elongation factor P~~~
MATYYSNDFRSGLKIMLDGEPYAVESSEFVKPGKGQAFARVKLRRLLTGTRVEKTFKSTDSAEGADVVDMNLTYLYNDGE
FWHFMNNETFEQLSADAKAIGDNAKWLLDQAECIVTLWNGQPISVTPPNFVELEIVDTDPGLKGDTAGTGGKPATLSTGA
VVKVPLFVQIGEVIKVDTRSGEYVSRVK
>Q8EEP9 ~~~efp~~~Elongation factor P~~~COG0231
MKTAHEVRPGNVIMFEGSPWVVQKTETTRSGRNAAIVKLKLKNLLLNSGTETTFKGEDKIDDIILDRLDCTYSYFADPMY
VFMDAEYNQYDVEAENLGDAAAYIVDGMEETCQVTFYDGKAISVEMPTTIVREVIYTEPSARGDTSGKVMKPATITGGGT
ISVADFVKVGDKIEIDTRTGEFKKRV
>Q2FY41 ~~~efp~~~Elongation factor P~~~COG0231
MISVNDFKTGLTISVDNAIWKVIDFQHVKPGKGSAFVRSKLRNLRTGAIQEKTFRAGEKVEPAMIENRRMQYLYADGDNH
VFMDNESFEQTELSSDYLKEELNYLKEGMEVQIQTYEGETIGVELPKTVELTVTETEPGIKGDTATGATKSATVETGYTL
NVPLFVNEGDVLIINTGDGSYISRG
>P99066 ~~~efp~~~Elongation factor P~~~
MISVNDFKTGLTISVDNAIWKVIDFQHVKPGKGSAFVRSKLRNLRTGAIQEKTFRAGEKVEPAMIENRRMQYLYADGDNH
VFMDNESFEQTELSSDYLKEELNYLKEGMEVQIQTYEGETIGVELPKTVELTVTETEPGIKGDTATGATKSATVETGYTL
NVPLFVNEGDVLIINTGDGSYISRG
>Q5XA92 ~~~efp~~~Elongation factor P~~~
MIEASKLKAGMTFEAEGKLIRVLEASHHKPGKGNTIMRMKLRDVRTGSTFDTTYRPDEKFEQAIIETVPAQYLYKMDDTA
YFMNTDTYDQYEIPVANVEQELLYILENSDVKIQFYGSEVIGVTVPTTVELTVAETQPSIKGATVTGSGKPATLETGLVV
NVPDFIEAGQKLIINTAEGTYVSRA
>Q76G20 ~~~efp~~~Elongation factor P~~~COG0231
MISVTDLRPGTKVKMDGGLWECVEYQHQKLGRGGAKVVAKFKNLETGATVERTFNSGEKLEDIYVETRELQYLYPEGEEM
VFMDLETYEQFAVPRSRVVGAEFFKEGMTALGDMYEGQPIKVTPPTVVELKVVDTPPGVRGDTVSGGSKPATLETGAVVQ
VPLFVEPGEVIKVDTRTGEYVGRA
>P80700 ~~~tsf~~~Elongation factor Ts~~~COG0264
MAITAQQVKELREKTGAGMMDCKKALTETDGDMDKAIDLLREKGIAKAAKKADRIAAEGSTLIKTDGNKGVILEVNSETD
FVAKNEGFKELLNTLADHLLANTPADVEEAMGQKMENGSTVEEYITSAVAKIGEKITLRRFTVLTKDDSSAFGAYLHMGG
RIGVLTVLNGTTDEETAKDIAMHVAAVNPRYISRDQVSEEETNHERQILTQQALQEGKPENIVAKMVEGRLNKFFEEICL
LDQAFVKNPDEKVKQVIAAKNATVQTFVRYEVGEGIEKRQENFAEEVMNQVKK
>Q9X5U9 ~~~tsf~~~Elongation factor Ts~~~COG0264
MTTITPIMVKELRERTGAAVMACKKALQETNGDMEAAIDLLRKAGDAKAAKRAGKTAAEGVIVIAISKDQKKGFMAEVNS
ETDFVARDTNFMAFASKVAERGLAEGVSDVAATLALPIEPNSSSTIEDERKALVNRIGENIQIRRVASLSSDGVVGHYSH
GGRIGVLLALDVPNPELAKGLAMHVAAFNPQAVSANQVSTEFVEKEKEIFLARAQETGKPANIIEKMVKGQVEKLLKEVS
LEGQSFVKDPEKLVGDLLKAEKAKVLAFLRFEVGEGVEKESQNFADEVMAQVQGNR
>P0A6P1 ~~~tsf~~~Elongation factor Ts~~~COG0264
MAEITASLVKELRERTGAGMMDCKKALTEANGDIELAIENMRKSGAIKAAKKAGNVAADGVIKTKIDGNYGIILEVNCQT
DFVAKDAGFQAFADKVLDAAVAGKITDVEVLKAQFEEERVALVAKIGENINIRRVAALEGDVLGSYQHGARIGVLVAAKG
ADEELVKHIAMHVAASKPEFIKPEDVSAEVVEKEYQVQLDIAMQSGKPKEIAEKMVEGRMKKFTGEVSLTGQPFVMEPSK
TVGQLLKEHNAEVTGFIRFEVGEGIEKVETDFAAEVAAMSKQS
>A0QVB9 ~~~tsf~~~Elongation factor Ts~~~COG0264
MANYTAADVKRLRELTGAGMLDSKNALVEADGDFDKAVELLRIKGAKDVGKRAERATAEGLVAAKDGALIELNSETDFVA
KNAEFQALADQIVAAAVAAKANDIETLKAAKTGDTTVEQAIADLSAKIGEKLELRRAAYFDGTVEAYLHKRAADLPPAVG
VLVEYQAGDADKGKEAAHAVALQIAALKAKYLTREDVPEDIVANERRIAEETARNEGKPEQALPKIVEGRVTGFYKDVVL
LDQPSVSDNKKTVKALLDEAGVTVTRFVRFEVGQA
>P9WNM1 ~~~tsf~~~Elongation factor Ts~~~COG0264
MANFTAADVKRLRELTGAGMLACKNALAETDGDFDKAVEALRIKGAKDVGKRAERATAEGLVAAKDGALIELNCETDFVA
KNAEFQTLADQVVAAAAAAKPADVDALKGASIGDKTVEQAIAELSAKIGEKLELRRVAIFDGTVEAYLHRRSADLPPAVG
VLVEYRGDDAAAAHAVALQIAALRARYLSRDDVPEDIVASERRIAEETARAEGKPEQALPKIVEGRLNGFFKDAVLLEQA
SVSDNKKTVKALLDVAGVTVTRFVRFEVGQA
>P99171 ~~~tsf~~~Elongation factor Ts~~~
MATISAKLVKELRKKTGAGMMDCKKALTETDGDIDKAIDYLREKGIAKAAKKADRIAAEGLVHVETKGNDAVIVEINSET
DFVARNEGFQELVKEIANQVLDTKAETVEALMETTLPNGKSVDERIKEAISTIGEKLSVRRFAIRTKTDNDAFGAYLHMG
GRIGVLTVVEGSTDEEAARDVAMHIAAINPKYVSSEQVSEEEINHEREVLKQQALNEGKPENIVEKMVEGRLRKYLQEIC
AVDQDFVKNPDVTVEAFLKTKGGKLVDFVRYEVGEGMEKREENFADEVKGQMK
>P0A3B7 ~~~tsf~~~Elongation factor Ts~~~COG0264
MAEITAKLVKELREKSGAGVMDAKKALVETDGDIEKAIELLREKGMAKAAKKADRVAAEGLTGVYVNGNVAAVIEVNAET
DFVAKNAQFVELVNTTAKVIAEGKPANNEEALALIMPSGETLEAAYVSATATIGEKISFRRFALIEKTDAQHFGAYQHNG
GRIGVISVVEGGDEALAKQLSMHIAAMKPTVLSYKELDEQFVKDELAQLNHVIDQDNESRAMVNKPALPHLKYGSKAQLT
DDVIAQAEADIKAELAAEGKPEKIWDKIIPGKMDRFMLDNTKVDQAYTLLAQVYIMDDSKTVEAYLESVNASVVEFARFE
VGEGIEKAANDFEAEVAATMAAALNN
>P74070 ~~~tsf~~~Elongation factor Ts~~~COG0264
MAEITAQLVKELREKTGAGMMDCKKALKENEGDLEKSIEWLRQKGIASADKKSGRTAAEGLVHSYIHFGGRIGVLVEVNC
ETDFVARGDRFKDLVNDVAMQIAACPNVEYVSVADIPQEMVAKEKEIEMGRDDLGKKPANIKEKIVQGRIDKRLKELSLL
DQPYIKDQNLTIEELVKQAIAELGENIQVRRFIRFNLGEGIEKAETNFAEEVAAAAKG
>P43895 ~~~tsf~~~Elongation factor Ts~~~COG0264
MSQMELIKKLREATGAGMMDVKRALEDAGWDEEKAVQLLRERGAMKAAKKADREAREGIIGHYIHHNQRVGVLVELNCET
DFVARNELFQNLAKDLAMHIAMMNPRYVSAEEIPAEELEKERQIYIQAALNEGKPQQIAEKIAEGRLKKYLEEVVLLEQP
FVKDDKVKVKELIQQAIAKIGENIVVRRFCRFELGA
>P0CE47 ~~~tufA~~~Elongation factor Tu 1~~~COG0050
MSKEKFERTKPHVNVGTIGHVDHGKTTLTAAITTVLAKTYGGAARAFDQIDNAPEEKARGITINTSHVEYDTPTRHYAHV
DCPGHADYVKNMITGAAQMDGAILVVAATDGPMPQTREHILLGRQVGVPYIIVFLNKCDMVDDEELLELVEMEVRELLSQ
YDFPGDDTPIVRGSALKALEGDAEWEAKILELAGFLDSYIPEPERAIDKPFLLPIEDVFSISGRGTVVTGRVERGIIKVG
EEVEIVGIKETQKSTCTGVEMFRKLLDEGRAGENVGVLLRGIKREEIERGQVLAKPGTIKPHTKFESEVYILSKDEGGRH
TPFFKGYRPQFYFRTTDVTGTIELPEGVEMVMPGDNIKMVVTLIHPIAMDDGLRFAIREGGRTVGAGVVAKVLG
>Q1R5Y2 ~~~tuf1~~~Elongation factor Tu 1~~~
MSKEKFERTKPHVNVGTIGHVDHGKTTLTAAITTVLAKTYGGAARAFDQIDNAPEEKARGITINTSHVEYDTPTRHYAHV
DCPGHADYVKNMITGAAQMDGAILVVAATDGPMPQTREHILLGRQVGVPYIIVFLNKCDMVDDEELLELVEMEVRELLSQ
YDFPGDDTPIVRGSALKALEGDAEWEAKILELAGFLDSYIPEPERAIDKPFLLPIEDVFSISGRGTVVTGRVERGIIKVG
EEVEIVGIKETQKSTCTGVEMFRKLLDEGRAGENVGVLLRGIKREEIERGQVLAKPGTIKPHTKFESEVYILSKDEGGRH
TPFFKGYRPQFYFRTTDVTGTIELPEGVEMVMPGDNIKMVVTLIHPIAMDDGLRFAIREGGRTVGAGVVAKVLG
>Q88QP8 ~~~tufA~~~Elongation factor Tu-A~~~COG0050
MAKEKFDRSLPHVNVGTIGHVDHGKTTLTAALTRVCSEVFGSAIVEFDKIDSAPEEKARGITINTAHVEYNSTIRHYAHV
DCPGHADYVKNMITGAAQMDGAILVCSAADGPMPQTREHILLSRQVGVPYIVVFLNKADLVDDAELLELVEMEVRDLLST
YDFPGDDTPIIIGSARMALEGKDDNEMGTTAVKKLVETLDSYIPEPVRAIDQPFLMPIEDVFSISGRGTVVTGRIERGIV
RVQDPLEIVGLRDTTTTTCTGVEMFRKLLDEGRAGENCGVLLRGTKRDDVERGQVLVKPGSVKPHTKFTAEVYVLSKEEG
GRHTPFFKGYRPQFYFRTTDVTGNCELPEGVEMVMPGDNIQMTVTLIKTIAMEDGLRFAIREGGRTVGAGVVAKIIE
>Q5SHN6 ~~~tufA~~~Elongation factor Tu-A~~~COG0050
MAKGEFVRTKPHVNVGTIGHVDHGKTTLTAALTYVAAAENPNVEVKDYGDIDKAPEERARGITINTAHVEYETAKRHYSH
VDCPGHADYIKNMITGAAQMDGAILVVSAADGPMPQTREHILLARQVGVPYIVVFMNKVDMVDDPELLDLVEMEVRDLLN
QYEFPGDEVPVIRGSALLALEQMHRNPKTRRGENEWVDKIWELLDAIDEYIPTPVRDVDKPFLMPVEDVFTITGRGTVAT
GRIERGKVKVGDEVEIVGLAPETRRTVVTGVEMHRKTLQEGIAGDNVGVLLRGVSREEVERGQVLAKPGSITPHTKFEAS
VYVLKKEEGGRHTGFFSGYRPQFYFRTTDVTGVVQLPPGVEMVMPGDNVTFTVELIKPVALEEGLRFAIREGGRTVGAGV
VTKILE
>P60338 ~~~tufA~~~Elongation factor Tu-A~~~
MAKGEFVRTKPHVNVGTIGHVDHGKTTLTAALTYVAAAENPNVEVKDYGDIDKAPEERARGITINTAHVEYETAKRHYSH
VDCPGHADYIKNMITGAAQMDGAILVVSAADGPMPQTREHILLARQVGVPYIVVFMNKVDMVDDPELLDLVEMEVRDLLN
QYEFPGDEVPVIRGSALLALEQMHRNPKTRRGENEWVDKIWELLDAIDEYIPTPVRDVDKPFLMPVEDVFTITGRGTVAT
GRIERGKVKVGDEVEIVGLAPETRRTVVTGVEMHRKTLQEGIAGDNVGVLLRGVSREEVERGQVLAKPGSITPHTKFEAS
VYVLKKEEGGRHTGFFSGYRPQFYFRTTDVTGVVQLPPGVEMVMPGDNVTFTVELIKPVALEEGLRFAIREGGRTVGAGV
VTKILE
>P0CE48 ~~~tufB~~~Elongation factor Tu 2~~~COG0050
MSKEKFERTKPHVNVGTIGHVDHGKTTLTAAITTVLAKTYGGAARAFDQIDNAPEEKARGITINTSHVEYDTPTRHYAHV
DCPGHADYVKNMITGAAQMDGAILVVAATDGPMPQTREHILLGRQVGVPYIIVFLNKCDMVDDEELLELVEMEVRELLSQ
YDFPGDDTPIVRGSALKALEGDAEWEAKILELAGFLDSYIPEPERAIDKPFLLPIEDVFSISGRGTVVTGRVERGIIKVG
EEVEIVGIKETQKSTCTGVEMFRKLLDEGRAGENVGVLLRGIKREEIERGQVLAKPGTIKPHTKFESEVYILSKDEGGRH
TPFFKGYRPQFYFRTTDVTGTIELPEGVEMVMPGDNIKMVVTLIHPIAMDDGLRFAIREGGRTVGAGVVAKVLS
>P60339 ~~~tufB~~~Elongation factor Tu-B~~~COG0050
MAKGEFIRTKPHVNVGTIGHVDHGKTTLTAALTFVTAAENPNVEVKDYGDIDKAPEERARGITINTAHVEYETAKRHYSH
VDCPGHADYIKNMITGAAQMDGAILVVSAADGPMPQTREHILLARQVGVPYIVVFMNKVDMVDDPELLDLVEMEVRDLLN
QYEFPGDEVPVIRGSALLALEQMHRNPKTRRGENEWVDKIWELLDAIDEYIPTPVRDVDKPFLMPVEDVFTITGRGTVAT
GRIERGKVKVGDEVEIVGLAPETRKTVVTGVEMHRKTLQEGIAGDNVGVLLRGVSREEVERGQVLAKPGSITPHTKFEAS
VYVLKKEEGGRHTGFFSGYRPQFYFRTTDVTGVVQLPPGVEMVMPGDNVTFTVELIKPVALEEGLRFAIREGGRTVGAGV
VTKILE
>P33166 ~~~tuf~~~Elongation factor Tu~~~COG0050
MAKEKFDRSKSHANIGTIGHVDHGKTTLTAAITTVLHKKSGKGTAMAYDQIDGAPEERERGITISTAHVEYETETRHYAH
VDCPGHADYVKNMITGAAQMDGAILVVSAADGPMPQTREHILLSKNVGVPYIVVFLNKCDMVDDEELLELVEMEVRDLLS
EYDFPGDDVPVVKGSALKALEGDAEWEAKIFELMDAVDEYIPTPERDTEKPFMMPVEDVFSITGRGTVATGRVERGQVKV
GDEVEIIGLQEENKKTTVTGVEMFRKLLDYAEAGDNIGALLRGVSREEIQRGQVLAKPGTITPHSKFKAEVYVLSKEEGG
RHTPFFSNYRPQFYFRTTDVTGIIHLPEGVEMVMPGDNTEMNVELISTIAIEEGTRFSIREGGRTVGSGVVSTITE
>B0B7N8 ~~~tuf~~~Elongation factor Tu~~~
MSKETFQRNKPHINIGTIGHVDHGKTTLTAAITRTLSGDGLADFRDYSSIDNTPEEKARGITINASHVEYETANRHYAHV
DCPGHADYVKNMITGAAQMDGAILVVSATDGAMPQTKEHILLARQVGVPYIVVFLNKIDMISEEDAELVDLVEMELAELL
EEKGYKGCPIIRGSALKALEGDAAYIEKVRELMQAVDDNIPTPEREIDKPFLMPIEDVFSISGRGTVVTGRIERGIVKVS
DKVQLVGLRDTKETIVTGVEMFRKELPEGRAGENVGLLLRGIGKNDVERGMVVCLPNSVKPHTRFKCAVYVLQKEEGGRH
KPFFTGYRPQFFFRTTDVTGVVTLPEGVEMVMPGDNVEFEVQLISPVALEEGMRFAIREGGRTIGAGTISKIIA
>Q83ES6 ~~~tufA~~~Elongation factor Tu~~~COG0050
MSKEKFVREKPHVNVGTIGHVDHGKTTLTAALTKVLSEKYGGEKKAFDQIDNAPEERARGITIATSHVEYQSDKRHYAHV
DCPGHADYVKNMITGAAQMDGAILVVSAADGPMPQTREHIVLAKQVGVPNIVVYLNKADMVDDKELLELVEMEVRDLLNS
YDFPGDETPIIVGSALKALEGDKSEVGEPSIIKLVETMDTYFPQPERAIDKPFLMPIEDVFSISGRGTVVTGRVERGIIK
VGDEIEIVGIKDTTKTTCTGVEMFRKLLDEGQAGDNVGILLRGTKREEVERGQVLAKPGSITPHKKFEAEIYVLSKEEGG
RHTPFLQGYRPQFYFRTTDVTGQLLSLPEGIEMVMPGDNVKVTVELIAPVAMDEGLRFAVREGGRTVGAGVVTKIIE
>Q8Y422 ~~~tuf~~~Elongation factor Tu~~~COG0050
MAKEKFDRSKPHVNIGTIGHVDHGKTTLTAAITTVLAKKGYADAQAYDQIDGAPEERERGITISTAHVEYQTDSRHYAHV
DCPGHADYVKNMITGAAQMDGAILVVSAADGPMPQTREHILLSRQVGVPYIVVFMNKCDMVDDEELLELVEMEIRDLLTE
YEFPGDDIPVIKGSALKALQGEADWEAKIDELMEAVDSYIPTPERDTDKPFMMPVEDVFSITGRGTVATGRVERGQVKVG
DEVEVIGIEEESKKVVVTGVEMFRKLLDYAEAGDNIGALLRGVAREDIQRGQVLAKPGSITPHTNFKAETYVLTKEEGGR
HTPFFNNYRPQFYFRTTDVTGIVTLPEGTEMVMPGDNIELAVELIAPIAIEDGTKFSIREGGRTVGAGVVSNISK
>P22679 ~~~tuf~~~Elongation factor Tu~~~COG0050
MAKLDFDRSKPHVNIGTIGHVDHGKTTLTAAIATVLAKKGLAEARDYASIDNAPEEKARGITINTSHIEYQTEKRHYAHV
DCPGHADYVKNMITGAAQMDGAILVVAATDGPMPQTREHILLARQVGVPKIVVFLNKIDMFKDDEREEMVGLVEMDVRSL
LSEYGFDGDNAPIIAGSALKALQGDPEYEKGILELMDAVDTYIEEPKRETDKPFLMAVEDVFTITGRGTVATGRVERGVL
QLNEEVEIVGLKPTKKTVVTGIEMFRKNLKEAQAGDNAGLLLRGIDRSEVERGQVLAKPKTIVPHTQFEATVYVLKKEEG
GRHTPFFHNYKPQFYFRTTDVTGGIEFKPGREMVVPGDNVELTVTLIAPIAIEEGTKFSIREGGRTVGAGSVTKILK
>P23568 ~~~tuf~~~Elongation factor Tu~~~
MAREKFDRSKPHVNVGTIGHIDHGKTTLTAAICTVLAKEGKSAATRYDQIDKAPEEKARGITINSAHVEYSSDKRHYAHV
DCPGHADYIKNMITGAAQMDGAILVVSATDSVMPQTREHILLARQVGVPRMVVFLNKCDIATDEEVQELVAEEVRDLLTS
YGFDGKNTPIIYGSALKALEGDPKWEAKIHDLMNAVDEWIPTPEREVDKPFLLAIEDTMTITGRGTVVTGRVERGELKVG
QEIEIVGLRPIRKAVVTGIEMFKKELDSAMAGDNAGVLLRGVDRKEVERGQVLAKPGSIKPHKKFKAEIYALKKEEGGRH
TGFLNGYRPQFYFRTTDVTGSISLPENTEMVLPGDNTSITVELIAPIACEKGSKFSIREGGRTVGAGSVTEVLE
>A0QS98 ~~~tuf~~~Elongation factor Tu~~~COG0050
MAKAKFERTKPHVNIGTIGHVDHGKTTLTAAITKVLHDKFPDLNESRAFDQIDNAPEERQRGITINISHVEYQTDKRHYA
HVDAPGHADYIKNMITGAAQMDGAILVVAATDGPMPQTREHVLLARQVGVPYILVALNKSDAVDDEELIELVEMEVRELL
AAQDFDEEAPVVRVSALKALEGDPKWVKSVEELMEAVDASIPDPVRETDKPFLMPVEDVFTITGRGTVVTGRVERGVINV
NEEVEIVGIRPETTKTTVTGVEMFRKLLDQGQAGDNVGLLLRGIKREDVERGQVVVKPGTTTPHTEFEGQVYILSKDEGG
RHTPFFNNYRPQFYFRTTDVTGVVTLPEGTEMVMPGDNTDISVKLIQPVAMDEGLRFAIREGGRTVGAGRVTKIIK
>P9WNN1 ~~~tuf~~~Elongation factor Tu~~~COG0050
MAKAKFQRTKPHVNIGTIGHVDHGKTTLTAAITKVLHDKFPDLNETKAFDQIDNAPEERQRGITINIAHVEYQTDKRHYA
HVDAPGHADYIKNMITGAAQMDGAILVVAATDGPMPQTREHVLLARQVGVPYILVALNKADAVDDEELLELVEMEVRELL
AAQEFDEDAPVVRVSALKALEGDAKWVASVEELMNAVDESIPDPVRETDKPFLMPVEDVFTITGRGTVVTGRVERGVINV
NEEVEIVGIRPSTTKTTVTGVEMFRKLLDQGQAGDNVGLLLRGVKREDVERGQVVTKPGTTTPHTEFEGQVYILSKDEGG
RHTPFFNNYRPQFYFRTTDVTGVVTLPEGTEMVMPGDNTNISVKLIQPVAMDEGLRFAIREGGRTVGAGRVTKIIK
>P64027 ~~~tufA~~~Elongation factor Tu~~~
MAKEKFERSKPHVNVGTIGHVDHGKTTLTAALTTILAKKFGGAAKAYDQIDNAPEEKARGITINTSHVEYETETRHYAHV
DCPGHADYVKNMITGAAQMDGAILVCSAADGPMPQTREHILLARQVGVPYIIVFMNKCDMVDDAELLELVEMEIRDLLSS
YDFPGDDCPIVQGSALKALEGDAAYEEKIFELAAALDSYIPTPERAVDKPFLLPIEDVFSISGRGTVVTGRVERGIIHVG
DEIEIVGLKETQKTTCTGVEMFRKLLDEGQAGDNVGVLLRGTKREDVERGQVLAKPGTITPHTKFKAEVYVLSKEEGGRH
TPFFANYRPQFYFRTTDVTGAVTLEEGVEMVMPGENVTITVELIAPIAMEEGLRFAIREGGRTVGAGVVSSVIA
>Q8YP63 ~~~tuf~~~Elongation factor Tu~~~COG0050
MARAKFERTKPHVNIGTIGHVDHGKTTLTAAITMTLAALGQAVAKGYDQIDNAPEEKARGITINTAHVEYETANRHYAHV
DCPGHADYVKNMITGAAQMDGAILVVAATDGPMPQTREHILLAKQVGVPKLVVFLNKEDMMEDAELLELVELELRELLTE
YEFDGDDIPIVRGSGLQALEVMTKNPKTQRGENPWVDKIYELMDAVDSYIPDPERDIDKPFLMAVEDVFSITGRGTVATG
RIERGKVKVGDVVELVGIRDTRNTTVTGIEMFKKSLDEGMAGDNAGVLLRGIQKADIERGMVLAKPGSITPHTQFEGEVY
VLTEKEGGRKTPFFAGYRPQFYVRTTDVTGTIKAFTSDEGETVEMVMPGDRIKVTVELINPIAIEQGMRFAIREGGRTIG
AGVVSKIVK
>Q02T82 ~~~tuf1~~~Elongation factor Tu~~~
MAKEKFERNKPHVNVGTIGHVDHGKTTLTAALTKVCSDTWGGSARAFDQIDNAPEEKARGITINTSHVEYDSAVRHYAHV
DCPGHADYVKNMITGAAQMDGAILVCSAADGPMPQTREHILLSRQVGVPYIVVFLNKADMVDDAELLELVEMEVRDLLNT
YDFPGDDTPIIIGSALMALEGKDDNGIGVSAVQKLVETLDSYIPEPVRAIDQPFLMPIEDVFSISGRGTVVTGRVERGII
KVQEEVEIVGIKATTKTTCTGVEMFRKLLDEGRAGENVGILLRGTKREDVERGQVLAKPGTIKPHTKFECEVYVLSKEEG
GRHTPFFKGYRPQFYFRTTDVTGNCELPEGVEMVMPGDNIKMVVTLIAPIAMEDGLRFAIREGGRTVGAGVVAKIIE
>P09591 ~~~tufA~~~Elongation factor Tu~~~
MAKEKFERNKPHVNVGTIGHVDHGKTTLTAALTKVCSDTWGGSARAFDQIDNAPEEKARGITINTSHVEYDSAVRHYAHV
DCPGHADYVKNMITGAAQMDGAILVCSAADGPMPQTREHILLSRQVGVPYIVVFLNKADMVDDAELLELVEMEVRDLLNT
YDFPGDDTPIIIGSALMALEGKDDNGIGVSAVQKLVETLDSYIPEPVRAIDQPFLMPIEDVFSISGRGTVVTGRVERGII
KVQEEVEIVGIKATTKTTCTGVEMFRKLLDEGRAGENVGILLRGTKREDVERGQVLAKPGTIKPHTKFECEVYVLSKEEG
GRHTPFFKGYRPQFYFRTTDVTGNCELPEGVEMVMPGDNIKMVVTLIAPIAMEDGLRFAIREGGRTVGAGVVAKIIE
>P99152 ~~~tuf~~~Elongation factor Tu~~~
MAKEKFDRSKEHANIGTIGHVDHGKTTLTAAIATVLAKNGDSVAQSYDMIDNAPEEKERGITINTSHIEYQTDKRHYAHV
DCPGHADYVKNMITGAAQMDGGILVVSAADGPMPQTREHILLSRNVGVPALVVFLNKVDMVDDEELLELVEMEVRDLLSE
YDFPGDDVPVIAGSALKALEGDAQYEEKILELMEAVDTYIPTPERDSDKPFMMPVEDVFSITGRGTVATGRVERGQIKVG
EEVEIIGLHDTSKTTVTGVEMFRKLLDYAEAGDNIGALLRGVAREDVQRGQVLAAPGSITPHTEFKAEVYVLSKDEGGRH
TPFFSNYRPQFYFRTTDVTGVVHLPEGTEMVMPGDNVEMTVELIAPIAIEDGTRFSIREGGRTVGSGVVTEIIK
>Q5XD49 ~~~tuf~~~Elongation factor Tu~~~
MAKEKYDRSKPHVNIGTIGHVDHGKTTLTAAITTVLARRLPSSVNQPKDYASIDAAPEERERGITINTAHVEYETATRHY
AHIDAPGHADYVKNMITGAAQMDGAILVVASTDGPMPQTREHILLSRQVGVKHLIVFMNKVDLVDDEELLELVEMEIRDL
LSEYDFPGDDLPVIQGSALKALEGDTKFEDIIMELMDTVDSYIPEPERDTDKPLLLPVEDVFSITGRGTVASGRIDRGTV
RVNDEIEIVGIKEETKKAVVTGVEMFRKQLDEGLAGDNVGILLRGVQRDEIERGQVIAKPGSINPHTKFKGEVYILSKDE
GGRHTPFFNNYRPQFYFRTTDVTGSIELPAGTEMVMPGDNVTINVELIHPIAVEQGTTFSIREGGRTVGSGIVSEIEA
>P74227 ~~~tuf~~~Elongation factor Tu~~~COG0050
MARAKFERTKDHVNIGTIGHVDHGKTTLTAAITMTLAELGGAKARKYEDIDAAPEEKARGITINTAHVEYETDSRHYAHV
DCPGHADYVKNMITGAAQMDGAILVVSAADGPMPQTREHILLAKQVGVPKLVVFLNKKDMVDDEELLELVELEVRELLSD
YDFPGDDIPIVAGSALKAIEGEKEYKDAILELMKAVDDYIDTPEREVDKPFLMAVEDVFSITGRGTVATGRIERGKVKVG
EEISIVGIKDTRKATVTGVEMFQKTLEEGMAGDNVGLLLRGIQKEDIERGMVLAKPGSITPHTEFEGEVYVLKKEEGGRH
TPFFANYRPQFYVRTTDVTGTIKSYTADDGSAVEMVMPGDRIKMTVELINPIAIEQGMRFAIREGGRTIGAGVVSKILK
>Q01698 ~~~tuf~~~Elongation factor Tu~~~
MAKGEFIRTKPHVNVGTIGHVDHGKTTLTAALTYVAAAENPNVEVKDYGDIDKAPEERARGITINTAHVEYETAKRHYSH
VDCPGHADYIKNMITGAAQMDGAILVVSAADGPMPQTREHILLARQVGVPYIVVFMNKVDMVDDPELLDLVEMEVRDLLN
QYEFPGDEVPVIRGSALLALEEMHKNPKTKRGENEWVDKIWELLDAIDEYIPTPVRDVDKPFLMPVEDVFTITGRGTVAT
GRIERGKVKVGDEVEIVGLAPETRKTVVTGVEMHRKTLQEGIAGDNVGLLLRGVSREEVERGQVLAKPGSITPHTKFEAS
VYILKKEEGGRHTGFFTGYRPQFYFRTTDVTGVVRLPQGVEMVMPGDNVTFTVELIKPVALEEGLRFAIREGGRTVGAGV
VTKILE
>B9K884 ~~~tuf~~~Elongation factor Tu~~~COG0050
MAKEKFVRTKPHVNVGTIGHIDHGKSTLTAAITKYLSLKGLAQYVPYDQIDKAPEEKARGITINITHVEYETEKRHYAHI
DCPGHADYIKNMITGAAQMDGAILVVAATDGPMPQTREHVLLARQVEVPYMIVFINKTDMVDDPELIELVEMEVRDLLSQ
YEYPGDEVPVIKGSALKALEAPDDPNHEAYKPIQELLDAMDNYIPDPQRDVDKPFLMPIEDVFSITGRGTVVTGRIERGR
IRPGDEVEIIGLSYEIRKTVVTSVEMFRKELDEGIAGDNVGCLLRGIDKDEVERGQVLAAPGSIKPHKRFKAEVYVLKKE
EGGRHTPFTKGYKPQFYIRTADVTGEIVGLPEGVEMVMPGDHVEMEIELIYPVAIEKGQRFAIREGGRTVGAGVVTEVIE
>Q9P9Q9 ~~~tufA~~~Elongation factor Tu~~~COG0050
MAQDKFKRTKLHVNVGTIGHVDHGKTTLTAALTKVGAERFGGEFKAYDAIDAAPEEKARGITISTAHVEYETEVRHYAHV
DCPGHADYVKNMITGAAQMDGAILVCSAADGPMPQTREHILLARQVGVPYIVVFLNKADMVDDAELLELVEMEVRELLSK
YDFPGDDTPIVRGSALKALEGDQSEIGVPAIIRLAEALDTHIPNPERAIDRPFLMPVEDVFSISGRGTVVTGRVECGVIK
VGDEVEIVGIRPTSKTIVTGVEMFRKLLDQGQAGDNAGLLLRGTKRDEVERGQVLAKPGSIKAHKEFEAEVYVLSKEEGG
RHTPFFNGYTPQFYMRTTDITGKVCLPEGVEMVMPGDNVKVTVSLINPVAMGEGQRFAIREGGRTVGAGVVSKVIG
>Q838U8 2.4.2.31~~~~~~NAD(+)--arginine ADP-ribosyltransferase EFV~~~COG2369
MSQLNKWQKELQALQKANYQETDNQLFNVYRQSLIDIKKRLKVYTENAESLSFSTRLEVERLFSVADEINAILQLNSPKV
EKTIKGYSAKQAEQGYYGLWYTLEQSQNIALSMPLINHDYIMNLVNAPVAGKRLSKRLYKYRDELAQNVTNNIITGLFEG
KSYAEIARWINEETEASYKQALRIARTEAGRTQSVTTQKGYEEAKELGINIKKKWLATIDKHTRRTHQELDGKEVDVDEE
FTIRGHSAKGPRMFGVASEDVNCRCTTIEVVDGISPELRKDNESKEMSEFKSYDEWYADRIRQNESKPKPNFTELDFFGQ
SDLQDDSDKWVAGLKPEQVNAMKDYTSDAFAKMNKILRNEKYNPREKPYLVNIIQNLDDAISKFKLKHDIITYRGVSANE
YDAILNGNVFKEFKSTSINKKVAEDFLNFTSANKDGRVVKFLIPKGTQGAYIGTNSSMKKESEFLLNRNLKYTVEIVDNI
LEVTILG
>A0A3S5YBC7 3.2.1.123~~~~~~Endoglycoceramidase I~~~
MRKTVVAFAAAIAACSAVLSSTTTSAAPPATPITTLQADGTHLVDGYGRTVLLHGVNNVDKDAPYLPAGETLTPQDIDIL
VRHGFNTVRLGTSFDALMPQRGQIDEAYLDRLTGVVDALTARGMHVLLDNHQDGLSKAWGGNGFPEWAIESRPREWEPNP
GFPLYYLMPSLNAGWDEVWGNTHGALDHLGTALGALAERVEGKPGVMGIELLNEPWPGSRFLSCFPNGCPDFDRTYQAAM
QKLTDAVRAQNPTIPVYWEPNVTWNQMMPSNLFAPPVTPALTTADVVFAPHDYCIPSQLAIYLGLPQALRGLCVPQQDLT
WSNIDAITERANVPTVITEFGDGDPTVLKNTLARADERFIGWQYWHFGAGNATDPFLGEVGRQLVRTYPQATAGEPGRMI
FDADNGDFAYRFTPRAATRPTEIFVSDLHYPDGYAVQVDGGQVTSAPGARIVTVVADGSGPVTVKINRPGSAGAEVPDGP
IETSSSGSSGSS
>P9WPK6 6.3.2.2~~~egtA~~~Glutamate--cysteine ligase EgtA~~~
MTLAAMTAAASQLDNAAPDDVEITDSSAAAEYIADGCLVDGPLGRVGLEMEAHCFDPADPFRRPSWEEITEVLEWLSPLP
GGSVVSVEPGGAVELSGPPADGVLAAIGAMTRDQAVLRSALANAGLGLVFLGADPLRSPVRVNPGARYRAMEQFFAASHS
GVPGAAMMTSTAAIQVNLDAGPQEGWAERVRLAHALGPTMIAIAANSPMLGGRFSGWQSTRQRVWGQMDSARCGPILGAS
GDHPGIDWAKYALKAPVMMVRSPDTQDTRAVTDYVPFTDWVDGRVLLDGRRATVADLVYHLTTLFPPVRPRQWLEIRYLD
SVPDEVWPAVVFTLVTLLDDPVAADLAVDAVEPVATAWDTAARIGLADRRLYLAANRCLAIAARRVPTELIGAMQRLVDH
VDRGVCPADDFSDRVIAGGIASAVTGMMHGAS
>P9WPK7 6.3.2.2~~~egtA~~~Glutamate--cysteine ligase EgtA~~~COG3572
MTLAAMTAAASQLDNAAPDDVEITDSSAAAEYIADGCLVDGPLGRVGLEMEAHCFDPADPFRRPSWEEITEVLEWLSPLP
GGSVVSVEPGGAVELSGPPADGVLAAIGAMTRDQAVLRSALANAGLGLVFLGADPLRSPVRVNPGARYRAMEQFFAASHS
GVPGAAMMTSTAAIQVNLDAGPQEGWAERVRLAHALGPTMIAIAANSPMLGGRFSGWQSTRQRVWGQMDSARCGPILGAS
GDHPGIDWAKYALKAPVMMVRSPDTQDTRAVTDYVPFTDWVDGRVLLDGRRATVADLVYHLTTLFPPVRPRQWLEIRYLD
SVPDEVWPAVVFTLVTLLDDPVAADLAVDAVEPVATAWDTAARIGLADRRLYLAANRCLAIAARRVPTELIGAMQRLVDH
VDRGVCPADDFSDRVIAGGIASAVTGMMHGAS
>A0R5N0 1.14.99.50~~~egtB~~~Hercynine oxygenase~~~COG1262
MIARETLADELALARERTLRLVEFDDAELHRQYNPLMSPLVWDLAHIGQQEELWLLRDGNPDRPGMLAPEVDRLYDAFEH
SRASRVNLPLLPPSDARAYCATVRAKALDTLDTLPEDDPGFRFALVISHENQHDETMLQALNLREGPPLLDTGIPLPAGR
PGVAGTSVLVPGGPFVLGVDALTEPHSLDNERPAHVVDIPSFRIGRVPVTNAEWREFIDDGGYDQPRWWSPRGWAHRQEA
GLVAPQFWNPDGTRTRFGHIEEIPGDEPVQHVTFFEAEAYAAWAGARLPTEIEWEKACAWDPVAGARRRFPWGSAQPSAA
LANLGGDARRPAPVGAYPAGASAYGAEQMLGDVWEWTSSPLRPWPGFTPMIYERYSTPFFEGTTSGDYRVLRGGSWAVAP
GILRPSFRNWDHPIRRQIFSGVRLAWDV
>G7CFI3 1.14.99.50~~~egtB~~~Hercynine oxygenase~~~COG1262
MTGVAVPHRAELARQLIDARNRTLRLVDFDDAELRRQYDPLMSPLVWDLAHIGQQEELWLLRGGDPRRPGLLEPAVEQLY
DAFVHPRASRVHLPLLSPAQARRFCATVRSAVLDALDRLPEDADTFAFGMVVSHEHQHDETMLQALNLRSGEPLLGSGTA
LPPGRPGVAGTSVLVPGGPFVLGVDLADEPYALDNERPAHVVDVPAFRIGRVPVTNAEWRAFIDDGGYRQRRWWSDAGWA
YRCEAGLTAPQFWNPDGTRTRFGHVEDIPPDEPVQHVTYFEAEAYAAWAGARLPTEIEWEKACAWDPATGRRRRYPWGDA
APTAALANLGGDALRPAPVGAYPAGASACGAEQMLGDVWEWTSSPLRPWPGFTPMIYQRYSQPFFEGAGSGDYRVLRGGS
WAVAADILRPSFRNWDHPIRRQIFAGVRLAWDVDRQTARPGPVGGC
>Q7D513 1.14.99.50~~~egtB~~~Hercynine oxygenase~~~
MTSPEQLACHLARARARTLRLVDFDDAELCCQYDPLMSPLVWDLAHIGQQEELWLLRGGDPGQPGLLPPAVEGLYDAFEH
SRASRVELPLLSPARARSYCATVRSAALDALAALPEDGDSFVFAMVISHENQHDETMLQALNLRTGSPLLAATSALPAGR
PRMAGTSVLVAGGPFVLGVDAADEPCSLDNERQAHVVDVPAFRIGRVPVTNGEWQDFIDDGGYTQSRWWSERGWQHRQRA
GLTAPQFWRSGGRTRTRFGHVEDIPADEPVQHVSYFEAEAYAAWAGARLPTEVEWEKACAWDPATGSRRRYPWGTEEPTD
TYANLGGQTLRPAPVGAYPAGASACGAEQMLGDVWEWTTSPLRPWPGFVPMVYERYSQPFFGGDYRVLRGGSWAVEPAIL
RPSFRNWDHPYRRQIFAGVRLAWDI
>O69671 1.14.99.50~~~egtB~~~Hercynine oxygenase~~~COG1262
MTSPEQLACHLARARARTLRLVDFDDAELCCQYDPLMSPLVWDLAHIGQQEELWLLRGGDPGQPGLLPPAVEGLYDAFEH
SRASRVELPLLSPARARSYCATVRSAALDALAALPEDGDSFVFAMVISHENQHDETMLQALNLRTGSPLLAATSALPAGR
PRMAGTSVLVAGGPFVLGVDAADEPCSLDNERPAHVVDVPAFRIGRVPVTNGEWQDFIDDGGYTQSRWWSERGWQHRQRA
GLTAPQFWRSGGRTRTRFGHVEDIPADEPVQHVSYFEAEAYAAWAGARLPTEVEWEKACAWDPATGSRRRYPWGTEEPTD
TYANLGGQTLRPAPVGAYPAGASACGAEQMLGDVWEWTTSPLRPWPGFVPMVYERYSQPFFGGDYRVLRGGSWAVEPAIL
RPSFRNWDHPYRRQIFAGVRLAWDI
>A0R5M9 3.5.1.118~~~egtC~~~Gamma-glutamyl-hercynylcysteine sulfoxide hydrolase~~~COG0121
MCRHVAWLGAPRSLADLVLDPPQGLLVQSYAPRRQKHGLMNADGWGAGFFDDEGVARRWRSDKPLWGDASFASVAPALRS
RCVLAAVRSATIGMPIEPSASAPFSDGQWLLSHNGLVDRGVLPLTGAAESTVDSAIVAALIFSRGLDALGATIAEVGELD
PNARLNILAANGSRLLATTWGDTLSVLHRPDGVVLASEPYDDDPGWSDIPDRHLVDVRDAHVVVTPL
>Q8VIV2 3.5.1.118~~~egtC~~~Gamma-glutamyl-hercynylcysteine sulfoxide hydrolase~~~
MCRHLGWLGAQVAVSSLVLDPPQGLRVQSYAPRRQKHGLMNADGWGVGFFDGAIPRRWRSAAPLWGDTSFHSVAPALRSH
CILAAVRSATVGMPIEVSATPPFTDGHWLLAHNGVVDRAVLPAGPAAESVCDSAILAATIFAHGLDALGDTIVKVGAADP
NARLNILAANGSRLIATTWGDTLSILRRADGVVLASEPYDDDSGWGDVPDRHLVEVTQKGVTLTALDRAKGPR
>O69670 3.5.1.118~~~egtC~~~Gamma-glutamyl-hercynylcysteine sulfoxide hydrolase~~~COG0121
MCRHLGWLGAQVAVSSLVLDPPQGLRVQSYAPRRQKHGLMNADGWGVGFFDGAIPRRWRSPAPLWGDTSFHSVAPALRSH
CILAAVRSATVGMPIEVSATPPFTDGHWLLAHNGVVDRAVLPAGPAAESVCDSAILAATIFAHGLDALGDTIVKVGAADP
NARLNILAANGSRLIATTWGDTLSILRRADGVVLASEPYDDDSGWGDVPDRHLVEVTQKGVTLTALDRAKGPR
>A0R5M8 2.1.1.44~~~egtD~~~Histidine N-alpha-methyltransferase~~~COG4301
MTLSLANYLAADSAAEALRRDVRAGLTAAPKSLPPKWFYDAVGSDLFDQITRLPEYYPTRTEAQILRTRSAEIIAAAGAD
TLVELGSGTSEKTRMLLDAMRDAELLRRFIPFDVDAGVLRSAGAAIGAEYPGIEIDAVCGDFEEHLGKIPHVGRRLVVFL
GSTIGNLTPAPRAEFLSTLADTLQPGDSLLLGTDLVKDTGRLVRAYDDAAGVTAAFNRNVLAVVNRELSADFDLDAFEHV
AKWNSDEERIEMWLRARTAQHVRVAALDLEVDFAAGEEMLTEVSCKFRPENVVAELAEAGLRQTHWWTDPAGDFGLSLAV
R
>P9WN46 2.1.1.44~~~egtD~~~Histidine N-alpha-methyltransferase~~~
MRVSVANHLGEDAGHLALRRDVYSGLQKTPKSLPPKWFYDTVGSELFDQITRLPEYYPTRAEAEILRARSAEVASACRAD
TLVELGSGTSEKTRMLLDALRHRGSLRRFVPFDVDASVLSATATAIQREYSGVEINAVCGDFEEHLTEIPRGGRRLFVFL
GSTIGNLTPGPRAQFLTALAGVMRPGDSLLLGTDLVKDAARLVRAYDDPGGVTAQFNRNVLAVINRELEADFDVDAFQHV
ARWNSAEERIEMWLRADGRQRVRVGALDLTVDFDAGEEMLTEVSCKFRPQAVGAELAAAGLHRIRWWTDEAGDFGLSLAA
K
>P9WN47 2.1.1.44~~~egtD~~~Histidine N-alpha-methyltransferase~~~COG4301
MRVSVANHLGEDAGHLALRRDVYSGLQKTPKSLPPKWFYDTVGSELFDQITRLPEYYPTRAEAEILRARSAEVASACRAD
TLVELGSGTSEKTRMLLDALRHRGSLRRFVPFDVDASVLSATATAIQREYSGVEINAVCGDFEEHLTEIPRGGRRLFVFL
GSTIGNLTPGPRAQFLTALAGVMRPGDSLLLGTDLVKDAARLVRAYDDPGGVTAQFNRNVLAVINRELEADFDVDAFQHV
ARWNSAEERIEMWLRADGRQRVRVGALDLTVDFDAGEEMLTEVSCKFRPQAVGAELAAAGLHRIRWWTDEAGDFGLSLAA
K
>Q7D515 4.4.-.-~~~egtE~~~Probable hercynylcysteine sulfoxide lyase~~~
MQDEAMRRSGANSPAGDSLADRWRAARPPVAGLHLDSAACSRQSFAALDAAAQHARHEAEVGGYVAAEAAAAVLDAGRAA
VAALSGLPDAEVVFTTGSLHALDLLLGSWPGENRTLACLPGEYGPNLAVMAAHGFDVRPLPTLQDGRVALDDAAFMLADD
PPDLVHLTVVASHRGVAQPLAMVAQLCTELKLPLVVDAAQGLGHVDCAVGADVTYASSRKWIAGPRGVGVLAVRPELMER
LRARLPAPDWMPPLTVAQQLGFGEANVAARVGFSVALGEHLACGPQAIRARLAELGDIARTVLADVSGWRVVEAVDEPSA
ITTLAPIDGADPAAVRAWLLSQRRIVTTYAGVERAPLELPAPVLRISPHVDNTADDLDAFAEALVAATAATSGER
>O69668 4.4.-.-~~~egtE~~~Probable hercynylcysteine sulfoxide lyase~~~COG0520
MRRSGANSPAGDSLADRWRAARPPVAGLHLDSAACSRQSFAALDAAAQHARHEAEVGGYVAAEAAAAVLDAGRAAVAALS
GLPDAEVVFTTGSLHALDLLLGSWPGENRTLACLPGEYGPNLAVMAAHGFDVRPLPTLQDGRVALDDAAFMLADDPPDLV
HLTVVASHRGVAQPLAMVAQLCTELKLPLVVDAAQGLGHVDCAVGADVTYASSRKWIAGPRGVGVLAVRPELMERLRARL
PAPDWMPPLTVAQQLGFGEANVAARVGFSVALGEHLACGPQAIRARLAELGDIARTVLADVSGWRVVEAVDEPSAITTLA
PIDGADPAAVRAWLLSQRRIVTTYAGVERAPLELPAPVLRISPHVDNTADDLDAFAEALVAATAATSGER
>Q8Y775 ~~~egtUBC~~~Probable ergothioneine transporter EgtUBC~~~COG1174
MNTLIDTFTVRKDELFTALVQHIQISFVSLFIAVLIALPLGIYLTRHKRLAEPIIQVAAIFQTIPSLALLGLLIPLVGIG
IVPAIIALVIYALLPILRNTYTGIKEVDPALVEASRAMGMNKWKRLYKVQLPLAMPVIMAGIRTAMVLIIGTATLAALIG
AGGLGDLILLGIDRNDNSLILLGAIPAALLAILFDFLLRFLEKASFKSTIITISAGILLTAAIIVVPYFASDKKEITIAG
KLGAEPEILINMYKLVIEDETDLKVNVKPNMGKTSFVFNALKSGDIDIYPEFTGTVLETFLKENAKTHDPEEVYTQARDG
LAKDFDMTYLKPMKYNNTYALAVSPEFAKENNLEKISDLGPVSDQVKAGFTLEFKDRSDGYKGIQDKYGLTFSNLKTMEP
KLRYNAIKSGDINLLDAYSTDSELAQYKLKVLEDDQQLFPPYQGAPLMLTKTLDKYPELKKPLNKLAGKITDDEMRKMNY
EVNVNGKSAYTVAKDYLKDQGIIK
>A0A0H2ZQB9 ~~~egtUBC~~~Ergothioneine transporter EgtUBC~~~COG1174
MTNLIATFQDRFGDWLTALSQHLQLSLLTLLLAILLAIPLAVYLRYHEKLADWVLQIAGIFQTIPSLALLGLFIPLMGIG
TLPALTALVIYAIFPILQNTITGLKGIDPSLQEAGIAFGMTRWERLKKFEIPLAMPVIMSGIRTAAVLIIGTATLATLIG
AGGLGSFILLGIDRNNASLILIGALSSAVLAIAFNFLLKVMEKAKLRTIFSGFALMALLLGLSYSPALLAQKEKENLIIA
GKIGPEPEILANMYKLLIEENTSMTATVKPNFGTTSFLYEALKKGDIDIYPEFTGTVTESLLQPSPKVSHEPEQVYQVAR
DGIAKQDHLAYLKPMSYQNTYAVAVPKKIAQEYGLKTISDLKKVEGQLKAGFTLEFNDREDGNKGLQSMYGLNLNVATMQ
PALRYQAIHSGDIQITDAYSTDAELERYDLQVLEDDKQLFPPYQGAPLMKEALLKKHPELERVLNTLAGKITESQMSQLN
YQVGVEGKSAKQVAKEFLQEQGLLKK
>B5Z7I3 ~~~egtU~~~Ergothioneine transport permease/ergothioneine binding protein EgtU~~~
MLGMGVFKQLIKELYEWLLHSMDMATQHLVAIVLKISVVKYLIKEFHDRFIYFIDLLAQHFIIVALSGFLVLVFGVLIGV
FAFYNSKARAFLLPVVNFLYTIPSLALFALFIPVIGVGLKNALLVLVLYGLLPIVYSTYNALKEVREEVIKAAIGLGCNP
KELFFRVHFLLAIPQILAGLRIAVVMLVAMAGIGALIGAGGLGQAIFRGLNTQNTTLLVAGSLIIALFSVLADKFVSVFQ
HENALQRLFSQNATQKQKRRVYTNLAVFLFLLLASALWLIPRNAIEEKPLVVATKPSSEQYILGEILSLLLEKHHIPIKR
AFGIGGGTMNIHPALIRGDFDLYMEYTGTAWVNTLKNPLTQKVDFETIKKRYEKEFNLLWVGLLGFNNTYSLAISKEDAQ
KYAIETFSDLALHSQNFDFGAEFDFFEREDAFKGLMKAYRFHFRSLHEMDINLRYKSFESHKINALDVFTTDAQIKELDL
KVLKDDKGFFPNYQAGIVIRKEIIKKYPEALKILEKLDSKINDETMQDLNYQVEVLKKSPKIVAKDFLERLGL
>B5Z7I4 7.4.2.-~~~egtV~~~Ergothioneine transport ATP-binding protein EgtV~~~
MKEIVTIENVSFNYRNRAVFKDFNLSIEKGDFLCVLGESGSGKSTLLGLILGLLKPSLGSVKIFNETLSNNAFLRQKIGY
IAQGNSLFSHLNALQNMTFCLNLQGINKQAAQKEAKALALKMGLDESLMDKFPNELSGGQAQRVGIIRGIIHKPELILLD
EPFSALDSFNRKNLQDLIKEIHQNSCATFIMVTHDEEEAQKLATKTLEIKALK
>Q7DJ60 ~~~ehaG~~~Autotransporter adhesin EhaG~~~COG5295
MNKIFKVIWNPATGNYTVTSETAKSRGKKSGRSKLLISALVAGGMLSSFGALANAGNDNGQGVDYGSGSAGDGWVAIGKG
AKANTFMNTSGSSTAVGYDAIAEGQYSSAIGSKTHAIGGASMAFGVSAISEGDRSIALGASSYSLGQYSMALGRYSKALG
KLSIAMGDSSKAEGANAIALGNATKATEIMSIALGDTANASKAYSMALGASSVASEENAIAIGAETEAAENATAIGNNAK
AKGTNSMAMGFGSLADKVNTIALGNGSQALADNAIAIGQGNKADGVDAIALGNGSQSRGLNTIALGTASNATGDKSLALG
SNSSANGINSVALGADSIADLDNTVSVGNSSLKRKIVNVKNGAIKSDSYDAINGSQLYAISDSVAKRLGGGAAVDVDDGT
VTAPTYNLKNGSKNNVGAALAVLDENTLQWDQTKGKYSAAHGTSSPTASVITDVADGTISASSKDAVNGSQLKATNDDVE
ANTANIATNTSNIATNTANIATNTTNITNLTDSVGDLQADALLWNETKKAFSAAHGQDTTSKITNVKDADLTADSTDAVN
GSQLKTTNDAVATNTTNIANNTSNIATNTTNISNLTETVTNLGEDALKWDKDNGVFTAAHGTETTSKITNVKDGDLTTGS
TDAVNGSQLKTTNDAVATNTTNIATNTTNISNLTETVTNLGEDALKWDKDNGVFTAAHGNNTASKITNILDGTVTATSSD
AINGSQLYDLSSNIATYFGGNASVNTDGVFTGPTYKIGETNYYNVGDALAAINSSFSTSLGDALLWDATAGKFSAKHGTN
GDASVITDVADGEISDSSSDAVNGSQLHGVSSYVVDALGGGAEVNADGTITAPTYTIANADYDNVGDALNAIDTTLDDAL
LWDADAGENGAFSAAHGKDKTASVITNVANGAISAASSDAINGSQLYTTNKYIADALGGDAEVNADGTITAPTYTIANAE
YNNVGDALDALDDNALLWDETANGGAGAYNASHDGKASIITNVANGSISEDSTDAVNGSQLNATNMMIEQNTQIINQLAG
NTDATYIQENGAGINYVRTNDDGLAFNDASAQGVGATAIGYNSVAKGDSSVAIGQGSYSDVDTGIALGSSSVSSRVIAKG
SRDTSITENGVVIGYDTTDGELLGALSIGDDGKYRQIINVADGSEAHDAVTVRQLQNAIGAVATTPTKYFHANSTEEDSL
AVGTDSLAMGAKTIVNGDKGIGIGYGAYVDANALNGIAIGSNAQVIHVNSIAIGNGSTTTRGAQTNYTAYNMDAPQNSVG
EFSVGSADGQRQITNVAAGSADTDAVNVGQLKVTDAQVSQNTQSITNLDNRVTNLDSRVTNIENGIGDIVTTGSTKYFKT
NTDGVDASAQGKDSVAIGSGSIAAADNSVALGTGSVATEENTISVGSSTNQRRITNVAAGKNATDAVNVAQLKSSEAGGV
RYDTKADGSIDYSNITLGGGNGGTTRISNVSAGVNNNDVVNYAQLKQSVQETKQYTDQRMVEMDNKLSKTESKLSGGIAS
AMAMTGLPQAYTPGASMASIGGGTYNGESAVALGVSMVSANGRWVYKLQGSTNSQGEYSAALGAGIQW
>Q8GPH6 ~~~ehpR~~~Phenazine antibiotic resistance protein EhpR~~~
MTDLAGPTITPNLQLVYVSNVERSTDFYRFIFKKEPVFVTPRYVAFPSSGDALFAIWSGGEEPVAEIPRFSEIGIMLPTG
EDVDKLFNEWTKQKSHQIIVIKEPYTDVFGRTFLISDPDGHIIRVCPLD
>Q9LA60 ~~~eibA~~~Immunoglobulin-binding protein EibA~~~
MSKKFTKAVLSAAMAGVLFGVSFDIMAAEQSYSALNAQNGAGSIYKVYYNPDNKTAHIDWGGLGDVEKERNKPIPLLSKI
DGNGNVTITSADGSTTFTVYDKEVHDFMKAAASGKTDDIKTNLLTEQNIRDLYNRVSAIQQMETNVGLDEYGNVAVTPNE
IKERVSLQRYLAWESANSTIVANELEAQKGKLDAQKGELEAQKKNLGELTTRTDKIDAAAAATAAKVESRTLVGVSSDGT
LTRAEGAKNTISVNDGLVALSGRTDRIDAAVGAIDGRVTRNTQSIEKNSKAIAANTRTLQQHSARLDSQQRQINENHKEM
KRAAAQSAALTGLFQPYSVGKFNASAAVGGYSDEQALAVGVGYRFNEQTAAKAGVAFSDGDASWNVGVNFEF
>Q9LA56 ~~~eibC~~~Immunoglobulin-binding protein EibC~~~
MSKKFTMTLLSSSLAGLLVMSGGVSAQEEKYTVPYAIGEGKWGNTYEVVKTGGNGNFRYEVKEKNGKKRSLFTFDSKGDV
IINGSGITYTIHDGALNDFAQTAEKKKNGQSQSHRMTDSVVRDVYNKVYSLQRTKITGFSVEDGENGKVSLGSDAKASGE
FSVAVGTGARADKKFATAVGSWAAADGKQSTALGVGAYAYANASTAAGTAAYVDGSAIYGTAIGNYAKVDENATEGTALG
AKATVTNKNSVALGANSVTTRDNEVYIGYKTGTESDKTYGTRVLGGLSDGTRNSDAATVGQLNRKVGGVYDDVKARITVE
SEKQKKYTDQKTSEVNEKVEARTTVGVDSDGKLTRAEGATKTIAVNDGLVALSGRTDRIDYAVGAIDGRVTRNTQSIEKN
SKAIAANTRTLQQHSARLDSQQRQINENHKEMKRAAAQSAALTGLFQPYSVGKFNATAAVGGYSDQQALAVGVGYRFNEQ
TAAKAGVAFSDGDASWNVGVNFEF
>Q9MCI8 ~~~eibD~~~Immunoglobulin-binding protein EibD~~~
MSKKFTMTLLSSSLAGLLVMSGGVSAQNGTYSVLQDDSQKSGPVKYGSTYEVVKTVDNGNFRYEVKEKKNDKRTLFKFDS
EGNVTVKGKGITHTLHDPALKDFARTAEGKKNEQNGNTPPHKLTDSAVRGVYNKVYGLEKTEITGFSVEDGENGKVSLGS
DAKASGEFSVAVGNGARATEKASTAVGSWAAADGKQSTALGVGTYAYANASTALGSVAFVDNTATYGTAAGNRAKVDKDA
TEGTALGAKATVTNKNSVALGANSVTTRDNEVYIGYKTGTESDKTYGTRVLGGLSDGTRNSDAATVGQLNRKVGGVYDDV
KARITVESEKQKKYTDQKTSEVNEKVEARTTVGVDSDGKLTRAEGATKTIAVNDGLVALSGRTDRIDYAVGAIDGRVTRN
TQSIEKNSKAIAANTRTLQQHSARLDSQQRQINENHKEMKRAAAQSAALTGLFQPYSVGKFNATAAVGGYSDQQALAVGV
GYRFNEQTAAKAGVAFSDGDASWNVGVNFEF
>Q9LA53 ~~~eibE~~~Immunoglobulin-binding protein EibE~~~
MSKKFTMTLLSSSLAGLLVMSGGVSAQEEKYTVPYAIGEGKWGNTYEVVKTGGNGNFRYEVKEKNGKKRSLFTFDSKGDV
IINGSGITYTIHDGALNDFAQTAEKKKNGQSQSHRMTDSVVRDVYNKVYSLQRTKITGFSVEDGENGKVSLGSDAKASGE
FSVAVGNGAKATEKASTSVGSWSAALGRQSVALGVGTYAYANASTAAGTAAYVDGSAIYGTAIGNYAKVDKNATEGVALG
AKAISAHKNSVALGANSRTTRDNEVYIGYEEASGKAYKTRTLGGLTDGTRPSDAATVRQVDRVKDSVEQLAQDTNTRLVV
EAKKSREYTDSRTTVGVNPDGKLTRAEGATKTIAVNDGLVALSGRTDRIDYAVGSVDRRVTKNTQAIQSNTRQLQEHNAR
LNSQQRQIRENHEEMKRAAAQSAALAGLFQPYSVGKFNATAALGGYSDKQAVAVGVGYRFNEQTAAKAGIAASDGDVSYN
MGVNFEF
>Q2YRJ0 ~~~eipB~~~Cell envelope integrity protein EipB~~~
MRFVRIAAAASGATVFMWAGFAGAASAASAVRLVPHRAIYDLTLDRADEKSGISGLTGRMVYEFNGSACEGYTTNFRFVT
RVDMDEQPQRVTDQQTTTFEDADGKDFRFVNKTFVDKELVKEVRGDAKLEDGKTVVKLSKPKENTLDLKGTQFPTRHMEE
LIGKAEAGQKFYQTTLFDASEDADRVVATTVVVGKQQAVPDDETKVMGKFSKDQVWPVTIAYFDDKEQQDGMPIYRINFK
LYRNGITRDLTMDYGDFSMRGKLVKLDIYDTGKNKTGCSK
>A5U5B1 2.3.1.-~~~eis~~~N-acetyltransferase Eis~~~COG4552
MTVTLCSPTEDDWPGMFLLAAASFTDFIGPESATAWRTLVPTDGAVVVRDGAGPGSEVVGMALYMDLRLTVPGEVVLPTA
GLSFVAVAPTHRRRGLLRAMCAELHRRIADSGYPVAALHASEGGIYGRFGYGPATTLHELTVDRRFARFHADAPGGGLGG
SSVRLVRPTEHRGEFEAIYERWRQQVPGGLLRPQVLWDELLAECKAAPGGDRESFALLHPDGYALYRVDRTDLKLARVSE
LRAVTADAHCALWRALIGLDSMERISIITHPQDPLPHLLTDTRLARTTWRQDGLWLRIMNVPAALEARGYAHEVGEFSTV
LEVSDGGRFALKIGDGRARCTPTDAAAEIEMDRDVLGSLYLGAHRASTLAAANRLRTKDSQLLRRLDAAFASDVPVQTAF
EF
>P9WFK7 2.3.1.-~~~eis~~~N-acetyltransferase Eis~~~COG4552
MTVTLCSPTEDDWPGMFLLAAASFTDFIGPESATAWRTLVPTDGAVVVRDGAGPGSEVVGMALYMDLRLTVPGEVVLPTA
GLSFVAVAPTHRRRGLLRAMCAELHRRIADSGYPVAALHASEGGIYGRFGYGPATTLHELTVDRRFARFHADAPGGGLGG
SSVRLVRPTEHRGEFEAIYERWRQQVPGGLLRPQVLWDELLAECKAAPGGDRESFALLHPDGYALYRVDRTDLKLARVSE
LRAVTADAHCALWRALIGLDSMERISIITHPQDPLPHLLTDTRLARTTWRQDGLWLRIMNVPAALEARGYAHEVGEFSTV
LEVSDGGRFALKIGDGRARCTPTDAAAEIEMDRDVLGSLYLGAHRASTLAAANRLRTKDSQLLRRLDAAFASDVPVQTAF
EF
>Q9K498 ~~~~~~Bifunctional albaflavenone monooxygenase/terpene synthase~~~COG2124
MTVESVNPETRAPAAPGAPELREPPVAGGGVPLLGHGWRLARDPLAFMSQLRDHGDVVRIKLGPKTVYAVTNPELTGALA
LNPDYHIAGPLWESLEGLLGKEGVATANGPLHRRQRRTIQPAFRLDAIPAYGPIMEEEAHALTERWQPGKTVDATSESFR
VAVRVAARCLLRGQYMDERAERLCVALATVFRGMYRRMVVPLGPLYRLPLPANRRFNDALADLHLLVDEIIAERRASGQK
PDDLLTALLEAKDDNGDPIGEQEIHDQVVAILTPGSETIASTIMWLLQALADHPEHADRIRDEVEAVTGGRPVAFEDVRK
LRHTGNVIVEAMRLRPAVWVLTRRAVAESELGGYRIPAGADIIYSPYAIQRDPKSYDDNLEFDPDRWLPERAANVPKYAM
KPFSAGKRKCPSDHFSMAQLTLITAALATKYRFEQVAGSNDAVRVGITLRPHDLLVRPVAR
>P0AEH3 ~~~elaA~~~Protein ElaA~~~COG2153
MIEWQDLHHSELSVSQLYALLQLRCAVFVVEQNCPYQDIDGDDLTGDNRHILGWKNDELVAYARILKSDDDLEPVVIGRV
IVSEALRGEKVGQQLMSKTLETCTHHWPDKPVYLGAQAHLQNFYQSFGFIPVTEVYEEDGIPHIGMAREVIQA
>P0AEH5 ~~~elaB~~~Protein ElaB~~~COG4575
MSNQFGDTRIDDDLTLLSETLEEVLRSSGDPADQKYVELKARAEKALDDVKKRVSQASDSYYYRAKQAVYRADDYVHEKP
WQGIGVGAAVGLVLGLLLARR
>Q47013 3.4.22.-~~~elaD~~~Protease ElaD~~~COG5160
MMVTVVSNYCQLSQTQLSQTFAEKFTVTEELLQSLKKTALSGDEESIELLHNIALGYDKFGKEAEDILYHIVRTPTNETL
SIIRLIKNACLKLYNLAHIATNSPLKSHDSDDLLFKKLFSPSKLMTIIGDEIPLISEKQSLSKVLLNDENNELSDGTNFW
DKNRQLTTDEIACYLQKIAANAKNTQVNYPTGLYVPYSTRTHLEDALNENIKSDPSWPNEVQLFPINTGGHWILVSLQKI
VNKKNNKLQIKCVIFNSLRALGYDKENSLKRVINSFNSELMGEMSNNNIKVHLNEPEIIFLHADLQQYLSQSCGAFVCMA
AQEVIEQRESNSDSAPYTLLKNHADRFKKYSAEEQYEIDFQHRLANRNCYLDKYGDANINHYYRNLEIKHSQPKNRASGK
RVS
>P06717 ~~~eltA~~~Heat-labile enterotoxin A chain~~~
MKNITFIFFILLASPLYANGDRLYRADSRPPDEIKRSGGLMPRGHNEYFDRGTQMNINLYDHARGTQTGFVRYDDGYVST
SLSLRSAHLAGQSILSGYSTYYIYVIATAPNMFNVNDVLGVYSPHPYEQEVSALGGIPYSQIYGWYRVNFGVIDERLHRN
REYRDRYYRNLNIAPAEDGYRLAGFPPDHQAWREEPWIHHAPQGCGNSSRTITGDTCNEETQNLSTIYLREYQSKVKRQI
FSDYQSEVDIYNRIRDEL
>Q02RJ6 3.4.24.26~~~lasB~~~Elastase~~~
MKKVSTLDLLFVAIMGVSPAAFAADLIDVSKLPSKAAQGAPGPVTLQAAVGAGGADELKAIRSTTLPNGKQVTRYEQFHN
GVRVVGEAITEVKGPGKSVAARRSGHFVANIAADLPGSTTAAVSAEQVLAQAKSLKAQGRKTENDKVELVIRLGENNIAQ
LVYNVSYLIPGEGLSRPHFVIDAKTGEVLDQWEGLAHAEAGGPGGNQKIGKYTYGSDYGPLIVNDRCEMDDGNVITVDMN
GSTNDSKTTPFRFACPTNTYKQVNGAYSPLNDAHFFGGVVFNLYRDWFGTSPLTHKLYMKVHYGRSVENAYWDGTAMLFG
DGATMFYPLVSLDVAAHEVSHGFTEQNSGLIYRGQSGGMNEAFSDMAGEAAEFYMRGKNDFLIGYDIKKGSGALRYMDQP
SRDGRSIDNASQYYNGIDVHHSSGVYNRAFYLLANSPGWDTRKAFEVFVDANRYYWTATSNYNSGACGVISSAQNRNYSA
ADVTRAFSTVGVTCPSAL
>P14756 3.4.24.26~~~lasB~~~Elastase~~~
MKKVSTLDLLFVAIMGVSPAAFAADLIDVSKLPSKAAQGAPGPVTLQAAVGAGGADELKAIRSTTLPNGKQVTRYEQFHN
GVRVVGEAITEVKGPGKSVAAQRSGHFVANIAADLPGSTTAAVSAEQVLAQAKSLKAQGRKTENDKVELVIRLGENNIAQ
LVYNVSYLIPGEGLSRPHFVIDAKTGEVLDQWEGLAHAEAGGPGGNQKIGKYTYGSDYGPLIVNDRCEMDDGNVITVDMN
SSTDDSKTTPFRFACPTNTYKQVNGAYSPLNDAHFFGGVVFKLYRDWFGTSPLTHKLYMKVHYGRSVENAYWDGTAMLFG
DGATMFYPLVSLDVAAHEVSHGFTEQNSGLIYRGQSGGMNEAFSDMAGEAAEFYMRGKNDFLIGYDIKKGSGALRYMDQP
SRDGRSIDNASQYYNGIDVHHSSGVYNRAFYLLANSPGWDTRKAFEVFVDANRYYWTATSNYNSGACGVIRSAQNRNYSA
ADVTRAFSTVGVTCPSAL
>P0ABU5 4.2.1.-~~~elbB~~~Glyoxalase ElbB~~~COG3155
MKKIGVILSGCGVYDGSEIHEAVLTLLAISRSGAQAVCFAPDKQQVDVINHLTGEAMTETRNVLIEAARITRGEIRPLAQ
ADAAELDALIVPGGFGAAKNLSNFASLGSECTVDRELKALAQAMHQAGKPLGFMCIAPAMLPKIFDFPLRLTIGTDIDTA
EVLEEMGAEHVPCPVDDIVVDEDNKIVTTPAYMLAQNIAEAASGIDKLVSRVLVLAE
>P0CK94 ~~~eltB~~~Heat-labile enterotoxin B chain~~~
MNKVKFYVLFTALLSSLCAHGAPQSITELCSEYHNTQIYTINDKILSYTESMAGKREMVIITFKSGATFQVEVPGSQHID
SQKKAIERMKDTLRITYLTETKIDKLCVWNNKTPNSIAAISMEN
>P32890 ~~~eltB~~~Heat-labile enterotoxin B chain~~~
MNKVKCYVLFTALLSSLYAHGAPQTITELCSEYRNTQIYTINDKILSYTESMAGKREMVIITFKSGETFQVEVPGSQHID
SQKKAIERMKDTLRITYLTETKIDKLCVWNNKTPNSIAAISMKN
>Q8X582 ~~~elfA~~~Laminin-binding fimbrial subunit ElfA~~~COG3539
MITMKKSVLTAFITVVCATSSVMAADDNAITDGSVTFNGKVIAPACTLVAATKDSVVTLPDVSATKLQTNGQVSGVQTDV
PIELKDCDTTVTKNATFTFNGTADTTQITAFANQASSDAATNVALQMYMNDGTTAIKPDTETGNILLQDGDQTLTFKVDY
IATGKATSGNVNAVTNFHINYY
>P75855 ~~~elfA~~~Fimbrial subunit ElfA~~~COG3539
MKKSVLTAFITVVCATSSVMAADDNAITDGSVTFNGKVIAPACTLVAATKDSVVTLPDVSATKLQTNGQVSGVQIDVPIE
LKDCDTTVTKNATFTFNGTADTTQITAFANQASSDAATNVALQMYMNDGTTAITPDTETGNILLQDGDQTLTFKVDYIAT
GKATSGNVNAVTNFHINYY
>Q8XDD1 ~~~elfC~~~Probable outer membrane usher protein ElfC~~~COG3188
MYRTHRQHSLLSSGGVPSFIGGLVVFVSAAFNAQAETWFDPAFFKDDPSMVADLSRFEKGQKITPGVYRVDIVLNQTIVD
TRNVNFVEITPEKGIAACLTTESLDAMGVNTDAFPAFKQLDKQACAPLAEIIPDASVTFNVNKLRLEISVPQIAIKSNAR
GYVPPERWDEGINALLLGYSFSGANSIHSSADSDSGDSYFLNLNSGVNLGPWRLRNNSTWSRSSGQTAEWKNLSSYLQRA
VIPLKGELTVGDDYTAGDFFDSVSFRGVQLASDDNMLPDSLKGFAPVVRGIAKSNAQITIKQNGYTIYQTYVSPGAFEIS
DIYSTSSSGDLLVEIKEADGSVNSYSVPFSSVPLLQRQGRIKYAVTLAKYRTNSNEQQESKFAQATLQWGGPWGTTWYGG
GQYAEYYRAAMFGLGFNLGDFGAISFDVTQAKSTLADQSEHKGQSYRFLYAKTLNQLGTNFQLMGYRYSTSGFYTLSDTM
YKHMDGYEFNDGDDEDTPMWSRYYNLFYTKRGKLQVNISQQLSEYGSFYLSGSQQTYWHTDQQDRLLQFGYNTQIKDLSL
GISWNYSKSRGQPDADQVFALNFSLPLNLLLSRSNDSYTSKKNYAWMTSNTSIDNEGHTTQNLGLTETLLDDGNLSYSVQ
QGYNSEGKTANGSASMDYKGVFADARVGYNYSDNGSQQQLNYALSGSLVAHSQGITLGQSLGETNVLIAAPGAENTRVAN
STGLKTDWRGYTVVPYATSYRENRIALDAASLKRNVDLENAVVNVVPTKGALVLAEFNAHAGARVLMKTSKQGISLRFGA
IATLDGVQTNSGIIDDDGSLYMAGLPAKGTITVRWGEAPDQICHISYELTEQQINSAITRMDAICR
>P75857 ~~~elfC~~~Probable outer membrane usher protein ElfC~~~COG3188
MYRTHRQHSLLSSGGVPSFIGGLVVFVSAAFNAQAETWFDPAFFKDDPSMVADLSRFEKGQKITPGVYRVDIVLNQTIVD
TRNVNFVEITPEKGIAACLTTESLDAMGVNTDAFPAFKQLDKQACVPLAEIIPDASVTFNVNKLRLEISVPQIAIKSNAR
GYVPPERWDEGINALLLGYSFSGANSIHSSADSDSGDSYFLNLNSGVNLGPWRLRNNSTWSRSSGQTAEWKNLSSYLQRA
VIPLKGELTVGDDYTAGDFFDSVSFRGVQLASDDNMLPDSLKGFAPVVRGIAKSNAQITIKQNGYTIYQTYVSPGAFEIS
DLYSTSSSGDLLVEIKEADGSVNSYSVPFSSVPLLQRQGRIKYAVTLAKYRTNSNEQQESKFAQATLQWGGPWGTTWYGG
GQYAEYYRAAMFGLGFNLGDFGAISFDATQAKSTLADQSEHKGQSYRFLYAKTLNHLGTNFQLMGYRYSTSGFYTLSDTM
YKHMDGYEFNDGDDEDTPMWSRYYNLFYTKRGKLQVNISQQLGEYGSFYLSGSQQTYWHTDQQDRLLQFGYNTQIKDLSL
GISWNYSKSRGQPDADQVFALNFSLPLNLLLPRSNDSYTRKKNYAWMTSNTSIDNEGHTTQNLGLTETLLDDGNLSYSVQ
QGYNSEGKTANGSASMDYKGAFADARVGYNYSDNGSQQQLNYALSGSLVAHSQGITLGQSLGETNVLIAAPGAENTRVAN
STGLKTDWRGYTVVPYATSYRENRIALDAASLKRNVDLENAVVNVVPTKGALVLAEFNAHAGARVLMKTSKQGIPLRFGA
IATLDGVQANSGIIDDDGSLYMAGLPAKGTISVRWGEAPDQICHINYELTEQQINSAITRMDAICR
>Q8X5E4 ~~~elfD~~~Probable fimbrial chaperone protein ElfD~~~COG3121
MKTCITKGIVTVSLTAILLSCSSTWAAGKGGVGLAATRLVYSEGEEQISLGVRNTSPDVPYLIQSWVMTPDNKKSADFII
TPPLFVLNPANENLLRIMYIGAPLAKDRETLFFTSVRAVPSTTKREEGNTLKIATQSVIKLFWRPKGLAYPLGEAPAKLR
CTSSADMVTVSNPTPYFITLTDLKIGGKLVKNQMISPFDKYQFSLPKGAKNSSVTYRTINDYGAETPQLNCKS
>P75856 ~~~elfD~~~Probable fimbrial chaperone protein ElfD~~~COG3121
MKTCITKGIVTVSLTAILLSCSSAWAAGKGGIGLAATRLVYSEGEEQISLGVRNTSPDVPYLIQSWVMTPDNKKSADFII
TPPLFVLNPANENLLRIMYIGAPLAKDRETLFFTSVRAVPSTTKRKEGNTLKIATQSVIKLFWRPKGLAYPLGEAPAKLR
CTSSADMVTVSNPTPYFITLTDLKIGGKVVKNQMISPFDKYQFSLPKGAKNSSVTYRTINDYGAETPQLNCKS
>Q8XDC9 ~~~elfG~~~Uncharacterized fimbrial-like protein ElfG~~~COG3539
MQIIFGEKCVSLLRLFFAAVLMLWCAQTAAYSGQCHTTQGNPYIGVNFGVKTLEEEENTTGVVKDKFYQWNESNDYYVSC
DCDKDNVRSGRWAFAADSPLVYLGDNWYKINDYLAAKVLLQVKGSSPTAVPFENVGTGADTRWHICDPGGQRLGGQGASG
NSGSFSLKILQPFVGSVVIPPMALARLFECYNIPAGDSCTTTGTPVLVYYLSGTINSLGSCSVNAGETIEVDLGDVFAAN
FRVVGHKPLGARTAELAIPVRCNTGNAGLVNVNLSLTATTDPSYPQAIKTSRPGVGVVVTDSQNNIISPAGGTLPLSIPD
DADSIA
>P75858 ~~~elfG~~~Uncharacterized fimbrial-like protein ElfG~~~COG3539
MQIIFGEKCVSLLRLFFAAVLMLWCAQTAAYSGQCHTTQGNPYIGVNFGVKTLEEEANTAGVVKDKFYQWNESNDYYVSC
DCDKDNVRSGRWAFAADSPLVYLGDNWYKINDYLAAKVLLQVKGSSPTAVPFENVGTGGDTRWHICDPGGQRLGGQGASG
NSGSFSLKILQPFVGSVVIPPMALARLYECYNIPAGDSCTTTGTPVLVYYLSGTINSLGSCSVNAGETIEVDLGDVFAAN
FRVVGHKPLGARTAELAIPVRCNTGNAGLVNVNLSLTATTDPSYPQAIKTSRPGVGVVVTDSQNNIISPAGGTLPLSIPD
DADSIARMNVYPVSTTGVPPETGRFEATATVRINFD
>P00632 3.1.1.24~~~catD~~~3-oxoadipate enol-lactonase 2~~~COG2021
MPVFHFKDTLTAQDVALNYATFGQADRPALIFSNSLGTNLSMWQQQIAYFQDKYFVICYDTRGHGASSTPVGPYRIDQLG
TDVIALLDHLQIPQATFCGISMGGLTGQWLAIHFPERFNQVIVANTAAKIGEAQAWQARAQLVREQGLTPIAQTAATRWF
TPGFIEDSPEIVEKLSHDLAQGSAEGYASCCEALAEADVRPQLQRISIPVLVIAGAQDPVTTVADGQFLCEHIVHSTLEV
LEASHISNVEQPQAFNHAVEAVMKRFN
>P0C7B7 ~~~~~~Cys-loop ligand-gated ion channel~~~
APADNAADARPVDVSVSIFINKIYGVNTLEQTYKVDGYIVAQWTGKPRKTPGDKPLIVENTQIERWINNGLWVPALEFIN
VVGSPDTGNKRLMLFPDGRVIYNARFLGSFSNDMDFRLFPFDRQQFVLELEPFSYNNQQLRFSDIQVYTENIDNEEIDEW
WIRKASTHISDIRYDHLSSVQPNQNEFSRITVRIDAVRNPSYYLWSFILPLGLIIAASWSVFWLESFSERLQTSFTLMLT
VVAYAFYTSNILPRLPYTTVIDQMIIAGYGSIFAAILLIIFAHHRQAMGVEDDLLIQRCRLAFPLGFLAIGCVLVIRGIT
L
>Q9F2F9 2.4.1.331~~~elmGT~~~Elloramycin glycosyltransferase ElmGT~~~
MRVLAVATPALGHLFPAVPLLWALRARGDEVLVVTGGDALRVAEAGLPVVDALPGETLTTLFGAYQETDPAFFVALRRSP
MTTLRDLAPVLAYLAGRLLEPARRAAERWRPDAILATHGQAAGAVVAAEHGIPLVEHGFGFVRSDGAQEAVRQLLAERLG
PAGSEPPPERYFLDIAVPSMTSAIEGMSLRAVPYNGGAVLPLSGASVGGRPPRPRVLVTAGTQLLHTHGAGALAWLPEVA
AGHEAEFLLAAGGADLRDLGRLPPHVRVLDWTPLATVLPTCSAVVHHGGSGTTLAALAAGVPQLVSPALADNHINARAVA
DRGAGLETAVPDATTLTALLREPAFAKAAREVADELRSLPAPADVAARLHTAFGLPTTQGDA
>Q9L4Y1 1.14.13.200~~~elmG~~~Tetracenomycin B2 monooxygenase-dioxygenase~~~
MDRIEIPVLVVGGGLTGLAAAVFLRQQGVDCLLVERHRSTTFLTRASGINARTMELLRNAGLEETVIDRSLHLIEGKRWR
ELGQPADRIPWVVLRARDLADIERAVIVEEPSLDVADVSPTRAQWCGQDKLEPILRDEAVRRGADIRFHTRLDSFAQDAD
GVDAVIVDRGTGARTAVRSRYLIAADGVRSTVRQALGVTGTGHGSLGKAMSVLFQADFEPVLHGRRFVITYMANPQAPGV
LQTFDENRWIFGFFCDAYGGGDAAFDTGRCADIVRTSLGIPDIPLDVQLVQPWEMSHHVADSYRSGRVFLAGDAAHVHPP
AGAFGANGGIQDAHNLAWKLASVLHGRASDALLDTYHQERHPVGTEIAEQAWTRHTYRLDGDDELGRRLVDTKVVAAGYR
YTSSAVLGAAYPTAIPHELALTGLPGQRVPHVWLDHDGRRVSTVDLAVDGFVLLARADGTPWADAAARLAATTGIPLTAH
VVGKTLTDPADALAAATGLGEAGALLLRPDGFVAWRSDTSADDPEAVLDGVLARILART
>Q9AJU2 2.1.1.305~~~elmMI~~~8-demethyl-8-alpha-L-rhamnosyl tetracenomycin-C 2'-O-methyltransferase~~~
MDSPQVARTLVDSAGGTAAHTREAIRQIGVPETAAFLADELAGRTETVTIRHAAEVQFVFDDRYAPDAADPVPWTFRVGP
EGVTHRAGALPDPGAVVTQDLTELARSLYGPAADRSDATRTVWWRDHDDPRVYFDPPPVFPAVERLLAAADGRDVPGLAG
LALRHGSDKWGIHTYTAAYEQHFAPFRDRAVTVVEIGVGGYDDPAAGGGSLRMWKRYFRRGLVYGVDIADKSRHREPRVH
TVVADQSDPASLRDLADAIGPIDIVIDDGSHISAHVVTAFSTLFPRLNPGGLYVVEDLQTSYWPAFQGAYDDDTRTSVGF
LKRLVDGLHHAEYPSRAGRPAQPTDRTVGSLHFHPNLAFVEKRANSGHGGISRLREAT
>Q9AJU1 2.1.1.306~~~elmMII~~~8-demethyl-8-(2-methoxy-alpha-L-rhamnosyl)-tetracenomycin-C 3'-O-methyltransferase~~~
MTTPSPTPPLAAAEVAAAAGLQQRDLLAVLDRVGLEPAVAFLVHDLTARCDTPDNPDAARIGLAVEHSGHRTERVLAVRK
GEPVRVDEETAGPPPVRLTFDLADLVRGVYGPPPGPGAGLFRVERDDAWFVKNADDAEPFRIFESYMRAVDVLVRAATSR
PGDLGRLAARHASDKWGLWHWFTPLYEHHFARLRHQPVRVLELGIGGYQNPDEGGGSLKMWRSYFPQGRIFGVDYFPKHG
LDEDRIHTLQGSQDDAGFLRRVAEEHGPFDIVIDDGSHVAGHQQTAFRTLFPAVRNGGFYVIEDLWTAYCPGYGGAATAR
AEGRTSIGLLKSLLDDLHYEEWTAPEPAAPGFAAPSLVGVHVYRNLAVLEKGRNSEGTIPFFAPREIDYV
>Q9AJU0 2.1.1.307~~~elmMIII~~~8-demethyl-8-(2,3-dimethoxy-alpha-L-rhamnosyl)-tetracenomycin-C 4'-O-methyltransferase~~~
MTEDARDLYLDLMKKVLTNLIYRDAPIQTFVYDGEPDADPRLLGRDWPSVAHTMVGLKRLDNLQYCVETVLADGVPGDLV
ETGVWRGGSSIFMRAVLRAHGDTARRVWVADSFEGMPEVGADSHAVDREMRLHEHNGVLAVPLEQVRANFERYGLLDDQV
RFLPGWFKDTLPGAPTGRLAVIRLDGDLYESTTDALENLMPRLSPGGFVIIDDYAIDACRDAVHDYRGRYGISDPISEID
GTGVFWRHTAASARSLQPATV
>A0A1C7D1B7 2.3.1.-~~~~~~tRNA uridine(34) acetyltransferase~~~
MKKLSRTISGVTPVAVMTKPLPCPGKCIYCPTFAATPQSYTPESPAVLRAKSCEYQAYKQVALRLRIIQDMGHPTDKVEL
IIMGGTFLSADITYQYGFIKDCYDALNGVVAGSLEEAKTINETAQHRCVGLCIETRPDICGKAEIQRMIDFGTTRVELGV
QMLDDDIYKLVERGHRVSDVAEATCLLREYGLKVHYHWMPGLPGSSPEKDLALSRMVFEDPRFCPDGLKLYPTMVVEGTI
LEQWWKEGRYTPYPNGTMTGLIADIKALVPPYVRISRVLRDIPAVFISAGLKDSLRDGVRQILESRHQKCRCIRCREYGH
RQRKGQTSGEPTLRRLDYPASGGKEIFLSFEDASDTLYGLLRLRIPCASLPVLGQKYGAKTGLVRELHVYGTELSLGEQG
DQSAQHRGLGRKLLAEAECLARDEFGLDSLAILSGVGAREYYRSLGYELVAGYMCKHLD
>P01558 ~~~cpe~~~Heat-labile enterotoxin B chain~~~
MLSNNLNPMVFENAKEVFLISEDLKTPINITNSNSNLSDGLYVIDKGDGWILGEPSVVSSQILNPNETGTFSQSLTKSKE
VSINVNFSVGFTSEFIQASVEYGFGITIGEQNTIERSVSTTAGPNEYVYYKVYATYRKYQAIRISHGNISDDGSIYKLTG
IWLSKTSADSLGNIDQGSLIETGERCVLTVPSTDIEKEILDLAAATERLNLTDALNSNPAGNLYDWRSSNSYPWTQKLNL
HLTITATGQKYRILASKIVDFNIYSNNFNNLVKLEQSLGDGVKDHYVDISLDAGQYVLVMKANSSYSGNYPYSILFQKF
>A0QXD8 1.1.1.-~~~eltD~~~Erythritol/L-threitol dehydrogenase~~~COG1063
MSNQVPEKMQAVVCHGPHDYRLEEVAVPQRKPGEALIRVEAVGICASDLKCYHGAAKFWGDENRPAWAETMVIPGHEFVG
RVVELDDEAAQRWGIAVGDRVVSEQIVPCWECLFCKRGQYHMCQPHDLYGFKRRTPGAMASYMVYPAEALVHKVSPDIPA
QHAAFAEPLSCSLHAVERAQITFEDTVVVAGCGPIGLGMIAGAKAKSPMRVIALDMAPDKLKLAEKCGADLTINIAEQDA
EKIIKDLTGGYGADVYIEGTGHTSAVPQGLNLLRKLGRYVEYGVFGSDVTVDWSIISDDKELDVLGAHLGPYCWPAAIKM
IESGALPMDEICTHQFPLTEFQKGLDLVASGKESVKVSLIPA
>A0QXD9 ~~~eltP~~~Erythritol/L-threitol-binding protein~~~COG1653
MMSRESQPGLHRQLSRRNMLAAMGLAGAAAVSLPVLSACGVGGRTNAPNGASEVTGGFDWRKASGSTINILQTPHPYQQS
YQPLLKEFTELTGINVNVDLVPEADYFTKLNTELAGGTGKHDAFMLGAYFIWQYGPPGWIEDLNPWLQNSSATNAEYDFE
DIFEGLRTSTRWDFELGNPLGTGGQWAIPWGFENNVVAYNKAYFDQRGITKLPDNFDDFIQLAIDLTDRSENRYGIATRG
SKSWATIHPGFMTQYVREGAVDYTFDGTDLVAEMDSDKAVEFTRKWIEMQHKAGPTSWTTYDYPNATGDLGDGTAMMVYD
ADSATYPKNKPGASAQAGNLGWYPGPAGPDGNYKTNLWTWTWAMNANSRNKLPAWLFIQWATGKESMNKAVEGGIYADPV
RQSVFDTTFKRIAADQHGYLETFETVIGSSKIQFTPQKKFFDTTKDWAVALQDIYGGDDAASRLRSLAKTNTSKVNL
>P27693 3.4.21.-~~~~~~Alkaline protease~~~
MKKPLGKIVASTALLISVAFSSSIASAAEEAKEKYLIGFNEQEAVSEFVEQVEANDEVAILSEEEEVEIELLHEFETIPV
LSVELSPEDVDALELDPAISYIEEDAEVTTMAQSVPWGISRVQAPAAHNRGLTGSGVKVAVLDTGISTHPDLNIRGGASF
VPGEPSTQDGNGHGTHVAGTIAALNNSIGVLGVAPNAELYAVKVLGASGSGSVSSIAQGLEWAGNNGMHVANLSLGSPSP
SATLEQAVNSATSRGVLVVAASGNSGAGSISYPARYANAMAVGATDQNNNRASFSQYGAGLDIVAPGVNVQSTYPGSTYA
SLNGTSMATPHVAGAAALVKQKNPSWSNVQIRNHLKNTATSLGSTNLYGSGLVNAEAATR
>P41362 3.4.21.-~~~~~~Alkaline protease~~~
MKKPLGKIVASTALLISVAFSSSIASAAEEAKEKYLIGFNEQEAVSEFVEQVEANDEVAILSEEEEVEIELLHEFETIPV
LSVELSPEDVDALELDPAISYIEEDAEVTTMAQSVPWGISRVQAPAAHNRGLTGSGVKVAVLDTGISTHPDLNIRGGASF
VPGEPSTQDGNGHGTHVAGTIAALNNSIGVLGVAPSAELYAVKVLGASGSGSVSSIAQGLEWAGNNGMHVANLSLGSPSP
SATLEQAVNSATSRGVLVVAASGNSGAGSISYPARYANAMAVGATDQNNNRASFSQYGAGLDIVAPGVNVQSTYPGSTYA
SLNGTSMATPHVAGAAALVKQKNPSWSNVQIRNHLKNTATSLGSTNLYGSGLVNAEAATR
>P20724 3.4.21.-~~~ale~~~Alkaline elastase YaB~~~
MNKKMGKIVAGTALIISVAFSSSIAQAAEEAKEKYLIGFKEQEVMSQFVDQIDGDEYSISSQAEDVEIDLLHEFDFIPVL
SVELDPEDVDALELDPAIAYIEEDAEVTTMQTVPWGINRVQAPIAQSRGFTGTGVRVAVLDTGISNHADLRIRGGASFVP
GEPNISDGNGHGTQVAGTIAALNNSIGVLGVAPNVDLYGVKVLGASGSGSISGIAQGLQWAANNGMHIANMSLGSSAGSA
TMEQAVNQATASGVLVVAASGNSGAGNVGFPARYANAMAVGATDQNNNRATFSQYGAGLDIVAPGVGVQSTVPGNGYASF
NGTSMATPHVAGVAALVKQKNPSWSNVQIRNHLKNTATNLGNTTQFGSGLVNAEAATR
>P41363 3.4.21.-~~~~~~Thermostable alkaline protease~~~COG1404
MRQSLKVMVLSTVALLFMANPAAASEEKKEYLIVVEPEEVSAQSVEESYDVDVIHEFEEIPVIHAELTKKELKKLKKDPN
VKAIEKNAEVTISQTVPWGISFINTQQAHNRGIFGNGARVAVLDTGIASHPDLRIAGGASFISSEPSYHDNNGHGTHVAG
TIAALNNSIGVLGVAPSADLYAVKVLDRNGSGSLASVAQGIEWAINNNMHIINMSLGSTSGSSTLELAVNRANNAGILLV
GAAGNTGRQGVNYPARYSGVMAVAAVDQNGQRASFSTYGPEIEISAPGVNVNSTYTGNRYVSLSGTSMATPHVAGVAALV
KSRYPSYTNNQIRQRINQTATYLGSPSLYGNGLVHAGRATQ
>P9WNL9 2.4.2.-~~~embA~~~Probable arabinosyltransferase A~~~COG1807
MPHDGNERSHRIARLAAVVSGIAGLLLCGIVPLLPVNQTTATIFWPQGSTADGNITQITAPLVSGAPRALDISIPCSAIA
TLPANGGLVLSTLPAGGVDTGKAGLFVRANQDTVVVAFRDSVAAVAARSTIAAGGCSALHIWADTGGAGADFMGIPGGAG
TLPPEKKPQVGGIFTDLKVGAQPGLSARVDIDTRFITTPGALKKAVMLLGVLAVLVAMVGLAALDRLSRGRTLRDWLTRY
RPRVRVGFASRLADAAVIATLLLWHVIGATSSDDGYLLTVARVAPKAGYVANYYRYFGTTEAPFDWYTSVLAQLAAVSTA
GVWMRLPATLAGIACWLIVSRFVLRRLGPGPGGLASNRVAVFTAGAVFLSAWLPFNNGLRPEPLIALGVLVTWVLVERSI
ALGRLAPAAVAIIVATLTATLAPQGLIALAPLLTGARAIAQRIRRRRATDGLLAPLAVLAAALSLITVVVFRDQTLATVA
ESARIKYKVGPTIAWYQDFLRYYFLTVESNVEGSMSRRFAVLVLLFCLFGVLFVLLRRGRVAGLASGPAWRLIGTTAVGL
LLLTFTPTKWAVQFGAFAGLAGVLGAVTAFTFARIGLHSRRNLTLYVTALLFVLAWATSGINGWFYVGNYGVPWYDIQPV
IASHPVTSMFLTLSILTGLLAAWYHFRMDYAGHTEVKDNRRNRILASTPLLVVAVIMVAGEVGSMAKAAVFRYPLYTTAK
ANLTALSTGLSSCAMADDVLAEPDPNAGMLQPVPGQAFGPDGPLGGISPVGFKPEGVGEDLKSDPVVSKPGLVNSDASPN
KPNAAITDSAGTAGGKGPVGINGSHAALPFGLDPARTPVMGSYGENNLAATATSAWYQLPPRSPDRPLVVVSAAGAIWSY
KEDGDFIYGQSLKLQWGVTGPDGRIQPLGQVFPIDIGPQPAWRNLRFPLAWAPPEADVARIVAYDPNLSPEQWFAFTPPR
VPVLESLQRLIGSATPVLMDIATAANFPCQRPFSEHLGIAELPQYRILPDHKQTAASSNLWQSSSTGGPFLFTQALLRTS
TIATYLRGDWYRDWGSVEQYHRLVPADQAPDAVVEEGVITVPGWGRPGPIRALP
>P9WNL7 2.4.2.-~~~embB~~~Probable arabinosyltransferase B~~~COG1807
MTQCASRRKSTPNRAILGAFASARGTRWVATIAGLIGFVLSVATPLLPVVQTTAMLDWPQRGQLGSVTAPLISLTPVDFT
ATVPCDVVRAMPPAGGVVLGTAPKQGKDANLQALFVVVSAQRVDVTDRNVVILSVPREQVTSPQCQRIEVTSTHAGTFAN
FVGLKDPSGAPLRSGFPDPNLRPQIVGVFTDLTGPAPPGLAVSATIDTRFSTRPTTLKLLAIIGAIVATVVALIALWRLD
QLDGRGSIAQLLLRPFRPASSPGGMRRLIPASWRTFTLTDAVVIFGFLLWHVIGANSSDDGYILGMARVADHAGYMSNYF
RWFGSPEDPFGWYYNLLALMTHVSDASLWMRLPDLAAGLVCWLLLSREVLPRLGPAVEASKPAYWAAAMVLLTAWMPFNN
GLRPEGIIALGSLVTYVLIERSMRYSRLTPAALAVVTAAFTLGVQPTGLIAVAALVAGGRPMLRILVRRHRLVGTLPLVS
PMLAAGTVILTVVFADQTLSTVLEATRVRAKIGPSQAWYTENLRYYYLILPTVDGSLSRRFGFLITALCLFTAVFIMLRR
KRIPSVARGPAWRLMGVIFGTMFFLMFTPTKWVHHFGLFAAVGAAMAALTTVLVSPSVLRWSRNRMAFLAALFFLLALCW
ATTNGWWYVSSYGVPFNSAMPKIDGITVSTIFFALFAIAAGYAAWLHFAPRGAGEGRLIRALTTAPVPIVAGFMAAVFVA
SMVAGIVRQYPTYSNGWSNVRAFVGGCGLADDVLVEPDTNAGFMKPLDGDSGSWGPLGPLGGVNPVGFTPNGVPEHTVAE
AIVMKPNQPGTDYDWDAPTKLTSPGINGSTVPLPYGLDPARVPLAGTYTTGAQQQSTLVSAWYLLPKPDDGHPLVVVTAA
GKIAGNSVLHGYTPGQTVVLEYAMPGPGALVPAGRMVPDDLYGEQPKAWRNLRFARAKMPADAVAVRVVAEDLSLTPEDW
IAVTPPRVPDLRSLQEYVGSTQPVLLDWAVGLAFPCQQPMLHANGIAEIPKFRITPDYSAKKLDTDTWEDGTNGGLLGIT
DLLLRAHVMATYLSRDWARDWGSLRKFDTLVDAPPAQLELGTATRSGLWSPGKIRIGP
>P9WNL5 2.4.2.-~~~embC~~~Probable arabinosyltransferase C~~~COG1807
MATEAAPPRIAVRLPSTSVRDAGANYRIARYVAVVAGLLGAVLAIATPLLPVNQTTAQLNWPQNGTFASVEAPLIGYVAT
DLNITVPCQAAAGLAGSQNTGKTVLLSTVPKQAPKAVDRGLLLQRANDDLVLVVRNVPLVTAPLSQVLGPTCQRLTFTAH
ADRVAAEFVGLVQGPNAEHPGAPLRGERSGYDFRPQIVGVFTDLAGPAPPGLSFSASVDTRYSSSPTPLKMAAMILGVAL
TGAALVALHILDTADGMRHRRFLPARWWSTGGLDTLVIAVLVWWHFVGANTSDDGYILTMARVSEHAGYMANYYRWFGTP
EAPFGWYYDLLALWAHVSTASIWMRLPTLAMALTCWWVISREVIPRLGHAVKTSRAAAWTAAGMFLAVWLPLDNGLRPEP
IIALGILLTWCSVERAVATSRLLPVAIACIIGALTLFSGPTGIASIGALLVAIGPLRTILHRRSRRFGVLPLVAPILAAA
TVTAIPIFRDQTFAGEIQANLLKRAVGPSLKWFDEHIRYERLFMASPDGSIARRFAVLALVLALAVSVAMSLRKGRIPGT
AAGPSRRIIGITIISFLAMMFTPTKWTHHFGVFAGLAGSLGALAAVAVTGAAMRSRRNRTVFAAVVVFVLALSFASVNGW
WYVSNFGVPWSNSFPKWRWSLTTALLELTVLVLLLAAWFHFVANGDGRRTARPTRFRARLAGIVQSPLAIATWLLVLFEV
VSLTQAMISQYPAWSVGRSNLQALAGKTCGLAEDVLVELDPNAGMLAPVTAPLADALGAGLSEAFTPNGIPADVTADPVM
ERPGDRSFLNDDGLITGSEPGTEGGTTAAPGINGSRARLPYNLDPARTPVLGSWRAGVQVPAMLRSGWYRLPTNEQRDRA
PLLVVTAAGRFDSREVRLQWATDEQAAAGHHGGSMEFADVGAAPAWRNLRAPLSAIPSTATQVRLVADDQDLAPQHWIAL
TPPRIPRVRTLQNVVGAADPVFLDWLVGLAFPCQRPFGHQYGVDETPKWRILPDRFGAEANSPVMDHNGGGPLGITELLM
RATTVASYLKDDWFRDWGALQRLTPYYPDAQPADLNLGTVTRSGLWSPAPLRRG
>P9WGJ9 ~~~embR~~~Transcriptional regulatory protein EmbR~~~COG1716
MAGSATVEKRLDFGLLGPLQMTIDGTPVPSGTPKQRAVLAMLVINRNRPVGVDALITALWEEWPPSGARASIHSYVSNLR
KLLGGAGIDPRVVLAAAPPGYRLSIPDNTCDLGRFVAEKTAGVHAAAAGRFEQASRHLSAALREWRGPVLDDLRDFQFVE
PFATALVEDKVLAHTAKAEAEIACGRASAVIAELEALTFEHPYREPLWTQLITAYYLSDRQSDALGAYRRVKTTLADDLG
IDPGPTLRALNERILRQQPLDAKKSAKTTAAGTVTVLDQRTMASGQQAVAYLHDIASGRGYPLQAAATRIGRLHDNDIVL
DSANVSRHHAVIVDTGTNYVINDLRSSNGVHVQHERIRSAVTLNDGDHIRICDHEFTFQISAGTHGGT
>P43147 3.4.24.-~~~empA~~~Virulence metalloprotease~~~
MKKVQRQMKWLFLAASISAALPVSAAKMVQVDDPSLLEQALSMQARSIVPTQNGFQVVKSVTLPNGKVKVRYQQMYHGLP
VFNTSVVATQTEKGIGQVYGMLAQQIDSDVVSTSPQVEQKQAVSIALTHYQQQNPSLTSADLVTENERAQLMVRLDENQM
AQMVYLVDFFVATNEPARPFFFIDANSGDVLQTWEGLNHAEATGTGPGGNQKTGFYQYGTDFPGLVINKVGNTCSMMNSA
VKTVDMKHATSGGSTFSYSCTDASNYNDYKAINGAYSPLNDAHYFGKVVFDMYKDWMNTTPLTFQLTMRVHYDSNYENAF
WNGSSMTFGDGQNTFYPLVDINVSAHEVSHGFTEQNSGLVYQNMSGGINEAFSDIAGEAAEFYMKGSVDWVVGSDIFKSS
GGLRYFDQPSKDGRSIDHASQYYNGLNVHYSSGVFNRAYYLLANKANWSVRKGFEVFTVANQLYWTANSTFDQGGCGVAK
AAQDLGYNKADVVDAFNQVGVNASCGVVPPTENVLEKGKPVIGLQGTRSSEAFYTFTVASSTSAKVSISLGSGDADLYVK
AGSKPTTSSWDCRPYKSGNNEQCTISATPGTTYHVMLKGYSNYSGVTLRLD
>A6QF98 ~~~ssp~~~Extracellular matrix protein-binding protein emp~~~
MKKKLLVLTMSTLFATQIMNSNHAKASVTESVDKKFVVPESGINKIIPAYDEFKNSPKVNVSNLTDNKNFVASEDKLNKI
ADSSAASKIVDKNFVVPESKLGNIVPEYKEINNRVNVATNNPASQQVDKHFVAKGPEVNRFITQNKVNHHFITTQTHYKK
VITSYKSTHVHKHVNHAKDSINKHFIVKPSESPRYTHPSQSLIIKHHFAVPGYHAHKFVTPGHASIKINHFCVVPQINSF
KVIPPYGHNSHRMHVPSFQNNTTATHQNAKVNKAYDYKYFYSYKVVKGVKKYFSFSQSNGYKIGKPSLNIKNVNYQYAVP
SYSPTHYVPEFKGSLPAPRV
>Q7A6P4 ~~~emp~~~Extracellular matrix protein-binding protein emp~~~
MKKKLLVLTMSTLFATQLINSNHAKASVTESVDKKFVVPESGINKIIPAYDEFKNSPKVNVSNLTDNKNFVVSEDKLNKI
VDSSAASKIVDKNFAVPESKLGNIVPEYKEINNRVNVATNNPASQQVDKHFVAKGPEVNRFITQNKVNHHFITTQTHYKK
VITSYKSTHVHKHVNHAKDSINKHFIVKPSESPRYTHPSQSLIIKHHFAVPGYHAHKFVTPGHASIKINHFCVVPQINSF
KVIPPYGHNSHRMHVPSFQNNTTATHQNAKVNKAYDYKYFYSYKVVKGVKKYFSFSQSNGYKIGKPSLNIKNVNYQYAVP
SYSPTHYVPEFKGSLPAPRV
>P0DPR6 ~~~emrA~~~Colistin resistance protein EmrA~~~
MDNVAQLETDTNFQSRKKITWGVFSVLLLFLVAGILYYFFVYRFYQSTDNAYVQADVTWVMPKISGEVMELLINDNQVVK
KGETLAVLDHRDYQARYDQARSVVSLKEAALGVQQQNEKSARSSIIEANSGVVAAQADLARLKKEFERYQDLLKDGVITR
QNFEGIQSQYLTAQAQLSKAQAAVNAAEAQLGSLQASRAQLLADIQSSHANLNLYQVDLASSKVVSPVSGKIGSLAIQKG
SRVSPQTRLMAIIPENSLYVQANFKETQIEKMHIGQKVKLKLDAYPSLNFTGKIESFSPASGATFSLMPPDNATGNFNKV
VQRIPVRIAIDSSPHIDLVKPGMSVSATVDLRT
>P27303 ~~~emrA~~~Multidrug export protein EmrA~~~COG1566
MSANAETQTPQQPVKKSGKRKRLLLLLTLLFIIIAVAIGIYWFLVLRHFEETDDAYVAGNQIQIMSQVSGSVTKVWADNT
DFVKEGDVLVTLDPTDARQAFEKAKTALASSVRQTHQLMINSKQLQANIEVQKIALAKAQSDYNRRVPLGNANLIGREEL
QHARDAVTSAQAQLDVAIQQYNANQAMILGTKLEDQPAVQQAATEVRNAWLALERTRIISPMTGYVSRRAVQPGAQISPT
TPLMAVVPATNMWVDANFKETQIANMRIGQPVTITTDIYGDDVKYTGKVVGLDMGTGSAFSLLPAQNATGNWIKVVQRLP
VRIELDQKQLEQYPLRIGLSTLVSVNTTNRDGQVLANKVRSTPVAVSTAREISLAPVNKLIDDIVKANAG
>P0DPR7 ~~~emrB~~~Colistin resistance protein EmrB~~~
MNKHVEAEWRFPAKTAWAIFAAMIFGNFMAILDIQIVASSLNEVQAGMSASRYEVTWVQTVYLIAEIIAIPMSSIVSRVL
STRVYYTMCAIGFTVSSLLCALSWNLESLLVFRGIQGFMGGGMIPTSMTALYLLFPEPKRSLPLVMFGMISTLGPAIGPT
IGGWLTNNFSWHWMFLINIIPGIIIATVIYSGPNIDRANYSLIKSMDWFSLVGMAMFLGGLEYFLDEGARHDWLADTGVR
IAFMVCVVGGMIFFSRSFTQPKPLLDLSVFKNKNFTLSAITTFVIGMALYGLGYMIPVFLGQVREMNSSQIGHVMMVTGI
VMFCFAPFLAWLIPNFDTRKTVFVGMILAGFGVWLNSHLSIHSDYDFMFWPQIYRGIGLMICLIVVSHLAMSTLPLSKVA
DASGIYNLMRNIGGAVGLALINSSLDWLTAMHVTQINQSMTPQNWIFTERLDQLTAQYQEVGTNAQQIALSVIYRDIHFQ
ALTSSFNDLLRMLAIIMFVTAFLTIFMDRGKK
>P0AEJ0 ~~~emrB~~~Multidrug export protein EmrB~~~COG2814
MQQQKPLEGAQLVIMTIALSLATFMQVLDSTIANVAIPTIAGNLGSSLSQGTWVITSFGVANAISIPLTGWLAKRVGEVK
LFLWSTIAFAIASWACGVSSSLNMLIFFRVIQGIVAGPLIPLSQSLLLNNYPPAKRSIALALWSMTVIVAPICGPILGGY
ISDNYHWGWIFFINVPIGVAVVLMTLQTLRGRETRTERRRIDAVGLALLVIGIGSLQIMLDRGKELDWFSSQEIIILTVV
AVVAICFLIVWELTDDNPIVDLSLFKSRNFTIGCLCISLAYMLYFGAIVLLPQLLQEVYGYTATWAGLASAPVGIIPVIL
SPIIGRFAHKLDMRRLVTFSFIMYAVCFYWRAYTFEPGMDFGASAWPQFIQGFAVACFFMPLTTITLSGLPPERLAAASS
LSNFTRTLAGSIGTSITTTMWTNRESMHHAQLTESVNPFNPNAQAMYSQLEGLGMTQQQASGWIAQQITNQGLIISANEI
FWMSAGIFLVLLGLVWFAKPPFGAGGGGGGAH
>P31442 ~~~emrD~~~Multidrug resistance protein D~~~COG2814
MKRQRNVNLLLMLVLLVAVGQMAQTIYIPAIADMARDLNVREGAVQSVMGAYLLTYGVSQLFYGPISDRVGRRPVILVGM
SIFMLATLVAVTTSSLTVLIAASAMQGMGTGVGGVMARTLPRDLYERTQLRHANSLLNMGILVSPLLAPLIGGLLDTMWN
WRACYLFLLVLCAGVTFSMARWMPETRPVDAPRTRLLTSYKTLFGNSGFNCYLLMLIGGLAGIAAFEACSGVLMGAVLGL
SSMTVSILFILPIPAAFFGAWFAGRPNKRFSTLMWQSVICCLLAGLLMWIPDWFGVMNVWTLLVPAALFFFGAGMLFPLA
TSGAMEPFPFLAGTAGALVGGLQNIGSGVLASLSAMLPQTGQGSLGLLMTLMGLLIVLCWLPLATRMSHQGQPV
>P23895 ~~~emrE~~~Multidrug transporter EmrE~~~COG2076
MNPYIYLGGAILAEVIGTTLMKFSEGFTRLWPSVGTIICYCASFWLLAQTLAYIPTGIAYAIWSGVGIVLISLLSWGFFG
QRLDLPAIIGMMLICAGVLIINLLSRSTPH
>P52599 ~~~emrK~~~Probable multidrug resistance protein EmrK~~~COG1566
MEQINSNKKHSNRRKYFSLLAVVLFIAFSGAYAYWSMELEDMISTDDAYVTGNADPISAQVSGSVTVVNHKDTNYVRQGD
ILVSLDKTDATIALNKAKNNLANIVRQTNKLYLQDKQYSAEVASARIQYQQSLEDYNRRVPLAKQGVISKETLEHTKDTL
ISSKAALNAAIQAYKANKALVMNTPLNRQPQVVEAADATKEAWLALKRTDIKSPVTGYIAQRSVQVGETVSPGQSLMAVV
PARQMWVNANFKETQLTDVRIGQSVNIISDLYGENVVFHGRVTGINMGTGNAFSLLPAQNATGNWIKIVQRVPVEVSLDP
KELMEHPLRIGLSMTATIDTKNEDIAEMPELASTVTSMPAYTSKALVIDTSPIEKEISNIISHNGQL
>P52600 ~~~emrY~~~Probable multidrug resistance protein EmrY~~~COG0477
MAITKSTPAPLTGGTLWCVTIALSLATFMQMLDSTISNVAIPTISGFLGASTDEGTWVITSFGVANAIAIPVTGRLAQRI
GELRLFLLSVTFFSLSSLMCSLSTNLDVLIFFRVVQGLMAGPLIPLSQSLLLRNYPPEKRTFALALWSMTVIIAPICGPI
LGGYICDNFSWGWIFLINVPMGIIVLTLCLTLLKGRETETSPVKMNLPGLTLLVLGVGGLQIMLDKGRDLDWFNSSTIII
LTVVSVISLISLVIWESTSENPILDLSLFKSRNFTIGIVSITCAYLFYSGAIVLMPQLLQETMGYNAIWAGLAYAPIGIM
PLLISPLIGRYGNKIDMRLLVTFSFLMYAVCYYWRSVTFMPTIDFTGIILPQFFQGFAVACFFLPLTTISFSGLPDNKFA
NASSMSNFFRTLSGSVGTSLTMTLWGRRESLHHSQLTATIDQFNPVFNSSSQIMDKYYGSLSGVLNEINNEITQQSLSIS
ANEIFRMAAIAFILLTVLVWFAKPPFTAKGVG
>P0C960 4.2.2.n2~~~emtA~~~Endo-type membrane-bound lytic murein transglycosylase A~~~COG0741
MKLRWFAFLIVLLAGCSSKHDYTNPPWNAKVPVQRAMQWMPISQKAGAAWGVDPQLITAIIAIESGGNPNAVSKSNAIGL
MQLKASTSGRDVYRRMGWSGEPTTSELKNPERNISMGAAYLNILETGPLAGIEDPKVLQYALVVSYANGAGALLRTFSSD
RKKAISKINDLDADEFLEHVARNHPAPQAPRYIYKLEQALDAM
>Q0QLE9 3.5.2.18~~~Ena~~~Enamidase~~~
MSKTIIKNIGKIVSGDIKSPVLQADTIVVEDGLIAAIGGEELMKDAGDATIIDAAGSTVTPGLLDTHVHVSGGDYAPRQK
TMDFISSALHGGVTTMISAGSPHFPGRPKDAAGTKALAITLSKSYYNARPAGVKVHGGAVILEKGLTEEDFIEMKKEGVW
IVGEVGLGTIKNPEDAAPMVEWAHKHGFKVQMHTGGTSIPGSSTVTADDVIKTKPDVVSHINGGPTAISVQEVDRIMDET
DFAMEIVQCGNPKIADYVARRAAEKGQLGRVIFGNDAPSGTGLIPLGILRNMCQIASMSDIDPEVAVCMATGNSTAVYGL
NTGVIAPGKEADLIIMDTPLGSVAEDAMGAIAAGDIPGISVVLIDGEAVVTKSRNTPPAKRAAKIL
>A0A0F5HPP7 ~~~enc~~~Type 1 encapsulin shell protein~~~
MNKSQLYPDSPLTDQDFNQLDQTVIEAARRQLVGRRFIELYGPLGRGMQSVFNDIFMESHEAKMDFQGSFDTEVESSRRV
NYTIPMLYKDFVLYWRDLEQSKALDIPIDFSVAANAARDVAFLEDQMIFHGSKEFDIPGLMNVKGRLTHLIGNWYESGNA
FQDIVEARNKLLEMNHNGPYALVLSPELYSLLHRVHKDTNVLEIEHVRELITAGVFQSPVLKGKSGVIVNTGRNNLDLAI
SEDFETAYLGEEGMNHPFRVYETVVLRIKRPAAICTLIDPEE
>Q45296 ~~~enc~~~Type 1 encapsulin shell protein~~~
MNNLYRELAPIPGPAWAEIEEEARRTFKRNIAGRRIVDVAGPTGFETSAVTTGHIRDVQSETSGLQVKQRIVQEYIELRT
PFTVTRQAIDDVARGSGDSDWQPVKDAATTIAMAEDRAILHGLDAAGIGGIVPGSSNAAVAIPDAVEDFADAVAQALSVL
RTVGVDGPYSLLLSSAEYTKVSESTDHGYPIREHLSRQLGAGEIIWAPALEGALLVSTRGGDYELHLGQDLSIGYYSHDS
ETVELYLQETFGFLALTDESSVPLSL
>D0LZ74 ~~~~~~Type 1 encapsulin shell protein~~~COG1659
MDLLKRHLAPIVPDAWSAIDEEAKEIFQGHLAGRKLVDFRGPFGWEYAAVNTGELRPIDDTPEDVDMKLRQVQPLAEVRV
PFTLDVTELDSVARGATNPDLDDVARAAERMVEAEDSAIFHGWAQAGIKGIVDSTPHEALAVASVSDFPRAVLSAADTLR
KAGVTGPYALVLGPKAYDDLFAATQDGYPVAKQVQRLVVDGPLVRANALAGALVMSMRGGDYELTVGQDLSIGYAFHDRS
KVELFVAESFTFRVLEPGAAVHLRYA
>Q1Q6L7 ~~~enc~~~Diheme-cytochrome-encapsulin shell fusion protein~~~
MVMGILNTFKKVYAVTGFFALLAVFSLSQVGSSAFAACAKVDDCFSCHTTQELNAVHKNTPYQGQSCIVCHKAFAADDTC
SDAKDGRFAKISSEININKEDWNKIQRAVHETTEKHLVGRKFLNIYGPLGTGAQSVPLDTYGLPSWASIDMLGEGNEAIH
PLKREIAQIYLIYKDFWLFRRDIEFSKKCETPIDISAAIGAAVSVSRKEDDMVFNGLSEMGIPGLLTASGRNIMKLSDWS
VIGNGFQDVVLAVEKLTSRGFNGPFALVVSPKLYAYLHRVYERTGQLEIQGVKELVNGGVYQSYVFNKDVALVIATGSLN
MDLAVGSNYKVEYWGPQDLNHRFRVVGSSVLRIKCPQAICTLE
>I6WZG6 ~~~enc~~~Type 1 encapsulin shell protein~~~COG1659
MNNLYRDLAPVTEAAWAEIELEAARTFKRHIAGRRVVDVSDPGGPVTAAVSTGRLIDVKAPTNGVIAHLRASKPLVRLRV
PFTLSRNEIDDVERGSKDSDWEPVKEAAKKLAFVEDRTIFEGYSAASIEGIRSASSNPALTLPEDPREIPDVISQALSEL
RLAGVDGPYSVLLSADVYTKVSETSDHGYPIREHLNRLVDGDIIWAPAIDGAFVLTTRGGDFDLQLGTDVAIGYASHDTD
TVRLYLQETLTFLCYTAEASVALSH
>Q1D6H4 ~~~encA~~~Type 1 encapsulin shell protein EncA~~~COG1659
MPDFLGHAENPLREEEWARLNETVIQVARRSLVGRRILDIYGPLGAGVQTVPYDEFQGVSPGAVDIVGEQETAMVFTDAR
KFKTIPIIYKDFLLHWRDIEAARTHNMPLDVSAAAGAAALCAQQEDELIFYGDARLGYEGLMTANGRLTVPLGDWTSPGG
GFQAIVEATRKLNEQGHFGPYAVVLSPRLYSQLHRIYEKTGVLEIETIRQLASDGVYQSNRLRGESGVVVSTGRENMDLA
VSMDMVAAYLGASRMNHPFRVLEALLLRIKHPDAICTLEGAGATERR
>C0ZVK4 ~~~enc~~~Type 1 encapsulin shell protein~~~COG1659
MTNLHRDLAPISAAAWAEIEEEASRTFKRHVAGRRVVDVEGPSGDDLAAIPLGHQVPINPLADGVIAHARQSQPVIELRV
PFTVSRQAIDDVERGAKDSDWQPVKDAAKQIAFAEDRAIFEGYPAASITGVRASGSNPELKLPIDAKDYPEAISQAITSL
RLAGVNGPYSLLLNADAFTAINETSDHGYPIREHLRRVLDGEIIWAPAIDGAFLLSTRGGDYELHLGQDLSIGYLSHDAN
SVELYFQESMTFLMYTSEAVVSLA
>Q2RVS0 ~~~enc~~~Type 1 encapsulin shell protein~~~COG1659
MNDLMRDLAPISAKAWAEIETEARGTLTVTLAARKVVDFKGPLGWDASSVSLGRTEALAEEPKAAGSAAVVTVRKRAVQP
LIELCVPFTLKRAELEAIARGASDADLDPVIEAARAIAIAEDRAVFHGFAAGGITGIGEASAEHALDLPADLADFPGVLV
RALAVLRDRGVDGPYALVLGRTVYQQLMETTTPGGYPVLQHVRRLFEGPLIWAPGVDGAMLISQRGGDFELTVGRDFSIG
YHDHDAQSVHLYLQESMTFRCLGPEAAVPLRGLSQAATKA
>Q9WZP2 3.4.-.-~~~enc~~~Type 1 encapsulin shell protein~~~COG1659
MEFLKRSFAPLTEKQWQEIDNRAREIFKTQLYGRKFVDVEGPYGWEYAAHPLGEVEVLSDENEVVKWGLRKSLPLIELRA
TFTLDLWELDNLERGKPNVDLSSLEETVRKVAEFEDEVIFRGCEKSGVKGLLSFEERKIECGSTPKDLLEAIVRALSIFS
KDGIEGPYTLVINTDRWINFLKEEAGHYPLEKRVEECLRGGKIITTPRIEDALVVSERGGDFKLILGQDLSIGYEDREKD
AVRLFITETFTFQVVNPEALILLKF
>Q1D6H3 ~~~encB~~~Encapsulin nanocompartment cargo protein EncB~~~COG1633
MAGPPDSDLDDVARIRLVLARELETINEYEAYARASSNPEVRAFFQHLAAEEKEHVSEAVHMLRMLDSGQNDHFAKPFVP
GHFQAAEAPAPATVHVPTPDGPAFSVNGRNGRLPSEPPTSLPPQRLLYGLPAPPPAVESHPLTVGSLRRGGGGSGSGR
>Q1D3Y8 ~~~encC~~~Encapsulin nanocompartment cargo protein EncC~~~COG3461
MPQTNPFHSLVPRKMTDTELARSIRLNIEAELDAINLYAAHIDATDNEDAKAILQHVMDEEREHAALFWELIARLDPEQA
AHAKEAVEKYRLITSGASHEAVEAVGKEGAAPSPADVTPEKRLTVGSLRR
>Q1D9P3 ~~~encD~~~Encapsulin nanocompartment cargo protein EncD~~~
MAKNSNPSAFDRDFGYLMPFLDRVAAAASDLEDASARAELTRLMVEEKARWQRIQELLGGAGGRGAAAPTPAREAPAEAP
RLARGSADELHEAAPFATGLTVGSLRGSR
>Q743F5 ~~~enc1~~~Type 1 encapsulin shell protein~~~COG1659
MNNLYRDLAPVTEAAWGEIELEASRTFKRHVAGRRVVDVSEPGGPAAAAVSTGRLIDVEAPTNGVVAHLRASKPLVRLRV
PFTLSRYEIDNVERGANDSDWDPVKEAAKKLAFVEDRAIFEGYAAASIDGIRSASSNKPLALPADPREIPDVITQAISEL
RLAGVDGPYSVLLSADVYTKVSETTEHGYPILEHIDRLVPGDIIWAPAIDGAFVLTTRGGDFDLQLGTDVSIGYTSHDAD
TVQLYLQETLTFLCYTAEAAVPLTS
>Q48899 ~~~enc~~~Type 2A encapsulin shell protein~~~
MTSAQNESQALGDLAARQLANATKTVPQLSTITPRWLLHLLNWVPVEAGIYRVNRVVNPEQVAIKAEAGAGSEEPVPQTY
VDYETSPREYTLRSISTLVDIHTRVSDLYSSPHDQIAQQLRLTIETIKERQELELINSPEYGLLAQATGRQTIQTLAGAP
TPDDLDALITKVWKTPSFFLTHPLGIAAFGREATYRGVPPPVVSLFGPTQFITWHGIRLPSDKVPVEDGKTKFILVRTGE
ERQGVVGLFQPGLVGEQAPGLSVRFTGINQSAIATYLVTLYTSLAVLTDDALAVLDDVAVDQFHEYK
>P46841 ~~~enc~~~Type 2A encapsulin shell protein~~~COG0664
MTSAQNESQALGDLAAGQLANATKTVPQLSTITPRWLLHLLNWVPVEAGVYRVNRVVNPERVAVKAEAGAGTEAPLPETF
VDYETSPREYTLRTISTLLDIHTRVSDLYSSPHDQITQQLRLTIETIKERQECELVNSPEFGLLAQVTPEQTIRTFAGAP
TPDDLDALITKVWKMPSFFLTHPQGIAAFGREATYRGVPPVVVSLFGAQFITWRGIPLIPSDKVPVQDGETKFILVRTGE
ERQGVVGLFQPGLVGEQAPGLSVRFTGINQAAIATYLVTLYTSLAVLTDDALAVLDNVAVDQFHEYK
>I3NID5 ~~~enc2~~~Type 2A encapsulin shell protein~~~COG0664
MTSAQNESQALGDLAARQLANATKTVPQLSTITPRWLLHLLNWVPVEAGIYRVNRVVNPEQVAIKAEAGAGSEEPLPQTY
VDYETSPREYTLRSISTLVDIHTRVSDLYSSPHDQIAQQLRLTIETIKERQELELINSPEYGLLAQATPEQTIQTLAGAP
TPDDLDALITKVWKTPSFFLTHPLGIAAFGREATYRGVPPPVVSLFGAQFITWRGIPLIPSDKVPVEDGKTKFILVRTGE
ERQGVVGLFQPGLVGEQAPGLSVRFTGINQSAIATYLVTLYTSLAVLTDDALAVLDDVAVDQFHEYK
>Q55032 ~~~enc~~~Type 2A encapsulin shell protein SrpI~~~COG0664
MTDNAPQLALRDVAARQLANATKTVPQLRTITPRWLVRLLHWTPVEAGIYRVNQVKDASQITVACSERDESELPETFVDY
IDNPREYLLSAVNTVVDVHTRISDLYSNPHDQIREQLRLTIEIMKERQESELINSREYGLLNNVAPGQLVHTRNGAPTPD
DLDELLIRVWKEPAFFLAHPQAIAAFGRECTRRGVPPATVSLFGSSFITWRGVPLIPSDKVPLENGKTKILLLRVGESRQ
GVVGLYQPNLPGEQGMGLSVRFMGINRKALASYLVSLYCSLAVLTDDALAVLDNVDVTQYHTYRYN
>P25736 3.1.21.1~~~endA~~~Endonuclease-1~~~COG2356
MYRYLSIAAVVLSAAFSGPALAEGINSFSQAKAAAVKVHADAPGTFYCGCKINWQGKKGVVDLQSCGYQVRKNENRASRV
EWEHVVPAWQFGHQRQCWQDGGRKNCAKDPVYRKMESDMHNLQPSVGEVNGDRGNFMYSQWNGGEGQYGQCAMKVDFKEK
AAEPPARARGAIARTYFYMRDQYNLTLSRQQTQLFNAWNKMYPVTDWECERDERIAKVQGNHNPYVQRACQARKS
>P0AB83 4.2.99.18~~~nth~~~Endonuclease III~~~COG0177
MNKAKRLEILTRLRENNPHPTTELNFSSPFELLIAVLLSAQATDVSVNKATAKLYPVANTPAAMLELGVEGVKTYIKTIG
LYNSKAENIIKTCRILLEQHNGEVPEDRAALEALPGVGRKTANVVLNTAFGWPTIAVDTHIFRVCNRTQFAPGKNVEQVE
EKLLKVVPAEFKVDCHHWLILHGRYTCIARKPRCGSCIIEDLCEYKEKVDI
>P9WQ11 4.2.99.18~~~nth~~~Endonuclease III~~~COG0177
MPGRWSAETRLALVRRARRMNRALAQAFPHVYCELDFTTPLELAVATILSAQSTDKRVNLTTPALFARYRTARDYAQADR
TELESLIRPTGFYRNKAASLIGLGQALVERFGGEVPATMDKLVTLPGVGRKTANVILGNAFGIPGITVDTHFGRLVRRWR
WTTAEDPVKVEQAVGELIERKEWTLLSHRVIFHGRRVCHARRPACGVCVLAKDCPSFGLGPTEPLLAAPLVQGPETDHLL
ALAGL
>Q81LV1 3.1.21.2~~~nfo~~~Probable endonuclease 4~~~COG0648
MLKIGSHVSMSGKKMLLAASEEAVSYGATTFMIYTGAPQNTRRKPIEELNIEAGRKHMEQNGIEEIIVHAPYIINVGNTT
KPETFQLGVDFLRMEIERTSALGVAKQIVLHPGAHVGAGADAGIQQIIKGLNEVLTPDQTVNIALETMAGKGTECGRSFE
EIAKIIDGVKYNEKLSVCFDTCHTHDAGYDIVNNFDGVLNEFDKIVGIDRLQVLHINDSKNVRGAGKDRHENIGFGHIGY
KALHHIVHHPQLTHVPKILETPYVGEDKKDKKPPYKLEIEMLKNGTFDEGLLEKIKAQ
>P0A6C1 3.1.21.2~~~nfo~~~Endonuclease 4~~~COG0648
MKYIGAHVSAAGGLANAAIRAAEIDATAFALFTKNQRQWRAAPLTTQTIDEFKAACEKYHYTSAQILPHDSYLINLGHPV
TEALEKSRDAFIDEMQRCEQLGLSLLNFHPGSHLMQISEEDCLARIAESINIALDKTQGVTAVIENTAGQGSNLGFKFEH
LAAIIDGVEDKSRVGVCIDTCHAFAAGYDLRTPAECEKTFADFARTVGFKYLRGMHLNDAKSTFGSRVDRHHSLGEGNIG
HDAFRWIMQDDRFDGIPLILETINPDIWAEEIAWLKAQQTEKAVA
>Q5KX27 3.1.21.2~~~nfo~~~Probable endonuclease 4~~~COG0648
MLKIGSHVSMSGKKMLLAASEEAASYGANTFMIYTGAPQNTKRKSIEELNIEAGRQHMQAHGIEEIVVHAPYIINIGNTT
NLDTFSLGVDFLRAEIERTEAIGAKQLVLHPGAHVGAGVEAGLRQIIRGLNEVLTREQNVQIALETMAGKGSECGRTFEE
LAYIIDGVAYNDKLSVCFDTCHTHDAGYDIVNDFDGVLEEFDRIIGLGRLKVLHINDSKNPRGSRKDRHENIGFGHIGFA
ALNYIVHHPQLEDIPKILETPYVGEDKNNKKPPYKHEIAMLRAQSFDDQLLEKINAGAE
>P9WQ13 3.1.21.2~~~end~~~Probable endonuclease 4~~~COG0648
MLIGSHVSPTDPLAAAEAEGADVVQIFLGNPQSWKAPKPRDDAAALKAATLPIYVHAPYLINLASANNRVRIPSRKILQE
TCAAAADIGAAAVIVHGGHVADDNDIDKGFQRWRKALDRLETEVPVYLENTAGGDHAMARRFDTIARLWDVIGDTGIGFC
LDTCHTWAAGEALTDAVDRIKAITGRIDLVHCNDSRDEAGSGRDRHANLGSGQIDPDLLVAAVKAAGAPVICETADQGRK
DDIAFLRERTGS
>P63538 3.1.21.2~~~nfo~~~Probable endonuclease 4~~~
MLLGSHVSMSGKKMLEGSAIEAYEYGETTFMIYTGAPQNTRRKSIEDLNITKGHEVMEKYGLSNIVVHAPYIINIANTTK
PETFNLGVDFLQQEIERTQAIGAKDIVLHPGAHVGAGVDAGINKIIEGLNEVLTNDNNVRIALETMAGKGTEIGRSFEEL
ARIIDGVHNNERLSVCFDTCHTHDAGYNVKEDFDGVLNEFDKIIGVDRIKVVHVNDSKNDRGAQKDRHENIGFGYIGFDA
LNYIVHHDSFKDIPKILETPYVGEDKKNKKPPYKLEIEMLKQQQFDPELKNKVMQQ
>Q9WYJ7 3.1.21.2~~~nfo~~~Probable endonuclease 4~~~COG0648
MIKIGAHMPISKGFDRVPQDTVNIGGNSFQIFPHNARSWSAKLPSDEAATKFKREMKKHGIDWENAFCHSGYLINLASPK
DDIWQKSVELLKKEVEICRKLGIRYLNIHPGSHLGTGEEEGIDRIVRGLNEVLNNTEGVVILLENVSQKGGNIGYKLEQL
KKIRDLVDQRDRVAITYDTCHGFDSGYDITKKEGVEALLNEIESLFGLERLKMIHLNDSKYPLGAAKDRHERIGSGFIGE
EGFAVFFSFKEIQEVPWILETPGGNEEHAEDIKKVFEIIEKFGIEVD
>Q72KH8 3.1.21.2~~~nfo~~~Endonuclease 4~~~COG0648
MPRYGFHLSIAGKKGVAGAVEEATALGLTAFQIFAKSPRSWRPRALSPAEVEAFRALREASGGLPAVIHASYLVNLGAEG
ELWEKSVASLADDLEKAALLGVEYVVVHPGSGRPERVKEGALKALRLAGVRSRPVLLVENTAGGGEKVGARFEELAWLVA
DTPLQVCLDTCHAYAAGYDVAEDPLGVLDALDRAVGLERVPVVHLNDSVGGLGSRVDHHAHLLQGKIGEGLKRVFLDPRL
KDRVFILETPRGPEEDAWNLRVLRAWLEEA
>P9WNB9 3.2.2.-~~~nei1~~~Endonuclease 8 1~~~COG0266
MPEGHTLHRLARLHQRRFAGAPVSVSSPQGRFADSASALNGRVLRRASAWGKHLFHHYVGGPVVHVHLGLYGTFTEWARP
TDGWLPEPAGQVRMRMVGAEFGTDLRGPTVCESIDDGEVADVVARLGPDPLRSDANPSSAWSRITKSRRPIGALLMDQTV
IAGVGNVYRNELLFRHRIDPQRPGRGIGEPEFDAAWNDLVSLMKVGLRRGKIIVVRPEHDHGLPSYLPDRPRTYVYRRAG
EPCRVCGGVIRTALLEGRNVFWCPVCQT
>P50465 3.2.2.-~~~nei~~~Endonuclease 8~~~COG0266
MPEGPEIRRAADNLEAAIKGKPLTDVWFAFPQLKPYQSQLIGQHVTHVETRGKALLTHFSNDLTLYSHNQLYGVWRVVDT
GEEPQTTRVLRVKLQTADKTILLYSASDIEMLTPEQLTTHPFLQRVGPDVLDPNLTPEVVKERLLSPRFRNRQFAGLLLD
QAFLAGLGNYLRVEILWQVGLTGNHKAKDLNAAQLDALAHALLEIPRFSYATRGQVDENKHHGALFRFKVFHRDGEPCER
CGSIIEKTTLSSRPFYWCPGCQH
>P96621 ~~~ndoAI~~~Antitoxin EndoAI~~~COG0864
MSESSARTEMKISLPENLVAELDGVAMREKRSRNELISQAVRAYVSERTTRHNRDLMRRGYMEMAKINLNISSEAHFAEC
EAETTVERLVSGG
>P96622 3.1.27.-~~~ndoA~~~Endoribonuclease EndoA~~~COG2337
MIVKRGDVYFADLSPVVGSEQGGVRPVLVIQNDIGNRFSPTAIVAAITAQIQKAKLPTHVEIDAKRYGFERDSVILLEQI
RTIDKQRLTDKITHLDDEMMDKVDEALQISLALIDF
>Q99Y92 3.2.1.96~~~endoS~~~Endo-beta-N-acetylglucosaminidase EndoS~~~
MDKHLLVKRTLGCVCAATLMGAALATHHDSLNTVKAEEKTVQVQKGLPSIDSLHYLSENSKKEFKEELSKAGQESQKVKE
ILAKAQQADKQAQELAKMKIPEKIPMKPLHGSLYGGYFRTWHDKTSDPTEKDKVNSMGELPKEVDLAFIFHDWTKDYSLF
WKELATKHVPKLNKQGTRVIRTIPWRFLAGGDNSGIAEDTSKYPNTPEGNKALAKAIVDEYVYKYNLDGLDVDVEHDSIP
KVDKKEDTAGVERSIQVFEEIGKLIGPKGVDKSRLFIMDSTYMADKNPLIERGAPYINLLLVQVYGSQGEKGGWEPVSNR
PEKTMEERWQGYSKYIRPEQYMIGFSFYEENAQEGNLWYDINSRKDEDKANGINTDITGTRAERYARWQPKTGGVKGGIF
SYAIDRDGVAHQPKKYAKQKEFKDATDNIFHSDYSVSKALKTVMLKDKSYDLIDEKDFPDKALREAVMAQVGTRKGDLER
FNGTLRLDNPAIQSLEGLNKFKKLAQLDLIGLSRITKLDRSVLPANMKPGKDTLETVLETYKKDNKEEPATIPPVSLKVS
GLTGLKELDLSGFDRETLAGLDAATLTSLEKVDISGNKLDLAPGTENRQIFDTMLSTISNHVGSNEQTVKFDKQKPTGHY
PDTYGKTSLRLPVANEKVDLQSQLLFGTVTNQGTLINSEADYKAYQNHKIAGRSFVDSNYHYNNFKVSYENYTVKVTDST
LGTTTDKTLATDKEETYKVDFFSPADKTKAVHTAKVIVGDEKTMMVNLAEGATVIGGSADPVNARKVFDGQLGSETDNIS
LGWDSKQSIIFKLKEDGLIKHWRFFNDSARNPETTNKPIQEASLQIFNIKDYNLDNLLENPNKFDDEKYWITVDTYSAQG
ERATAFSNTLNNITSKYWRVVFDTKGDRYSSPVVPELQILGYPLPNADTIMKTVTTAKELSQQKDKFSQKMLDELKIKEM
ALETSLNSKIFDVTAINANAGVLKDCIEKRQLLKK
>T1WGN1 3.2.1.96~~~endoS2~~~Endo-beta-N-acetylglucosaminidase EndoS2~~~
MDKHLLVKRTLGCVCAATLMGAALATHHDSLNTVKAEEKTVQTGKTDQQVGAKLVQEIREGKRGPLYAGYFRTWHDRAST
GIDGKQQHPENTMAEVPKEVDILFVFHDHTASDSPFWSELKDSYVHKLHQQGTALVQTIGVNELNGRTGLSKDYPDTPEG
NKALAAAIVKAFVTDRGVDGLDIDIEHEFTNKRTPEEDARALNVFKEIAQLIGKNGSDKSKLLIMDTTLSVENNPIFKGI
AEDLDYLLRQYYGSQGGEAEVDTINSDWNQYQNYIDASQFMIGFSFFEESASKGNLWFDVNEYDPNNPEKGKDIEGTRAK
KYAEWQPSTGGLKAGIFSYAIDRDGVAHVPSTYKNRTSTNLQRHEVDNISHTDYTVSRKLKTLMTEDKRYDVIDQKDIPD
PALREQIIQQVGQYKGDLERYNKTLVLTGDKIQNLKGLEKLSKLQKLELRQLSNVKEITPELLPESMKKDAELVMVGMTG
LEKLNLSGLNRQTLDGIDVNSITHLTSFDISHNSLDLSEKSEDRKLLMTLMEQVSNHQKITVKNTAFENQKPKGYYPQTY
DTKEGHYDVDNAEHDILTDFVFGTVTKRNTFIGDEEAFAIYKEGAVDGRQYVSKDYTYEAFRKDYKGYKVHLTASNLGET
VTSKVTATTDETYLVDVSDGEKVVHHMKLNIGSGAIMMENLAKGAKVIGTSGDFEQAKKIFDGEKSDRFFTWGQTNWIAF
DLGEINLAKEWRLFNAETNTEIKTDSSLNVAKGRLQILKDTTIDLEKMDIKNRKEYLSNDENWTDVAQMDDAKAIFNSKL
SNVLSRYWRFCVDGGASSYYPQYTELQILGQRLSNDVANTLKD
>A0A191T6Q6 3.2.1.96~~~endoSd~~~Endo-beta-N-acetylglucosaminidase EndoSd~~~
MDKRLLVKRTLGCVCAATLMGAILATHHDSLISVKAEEKTVQTGKTDQQIGAKLVQEIREGKRGPLYAGYFRTWHDRAST
GADGKQQHPENTMAEVPKEVDILFVFHDHTASDSPFWSELKDSYVHKLHQQGTALVQTIGVNELNGRTGLSKDYPDIPEG
NKALAAAIVKTFVTDRGVDGLDIDIEHEFTNKRTPEEDARALNVFKEIAQLIGKNGSDKSKLLIMDTTLSVENNPIFKGI
AEDLDYLLRQYYGSQGGEAEVDTINSDWNQYQNYIDASQFMIGFSFFEESAPKGNLWFDVNEYDPKNPERGKDIEGTRAK
KYAEWQPSTGGLKAGIFSYAIDRDGVAHVGKEYSQRTYQELEAGLKKHPVVDNISHTDYTVSRKLKALMAEDKRYDVIDQ
KDIPDAALREQVIQQVGQYKGDLERYNKTLVLTVDKIHSLKGLEKLSHLQKLELCQLSNVKEVTPDILPESMKKDAELVM
TGMTGLEKLNLRGLNRQTLDGIDVNGLTHLTSFDISHNSLDLSEKSADRKLLMTLMEQVSNHQKITVKNTAFENQKPKGY
YPQTYDTKEGHYDVDNAEHDILTDFVFGTVTKRDTFIGDEEAFAMYKEGAIDGRQYVAKDYTYEAFRKDYQGYKVHLTAS
NLGESDTSKVTATTDETYLVDVFDGEKIVPHMTLHVGNGATIMENLAKGAKVIGTSGDISLAEKVVDGVVADSFWTWDTK
NWIAFDLNSQVIAKEWRLFNGETDPRFKDKELNIQKGRLQILKDKTIHLENMSKDERDSYLADEQNWITVSEITSNQKIY
NGSITDITSRYWRFCVDEGVSSKSPQYTELQILGQRLSSDIASTVQD
>Q8A2F6 3.1.6.-~~~~~~Endo-4-O-sulfatase~~~COG3119
MGGLTLFAAQGCKAPKQVAEQAEHPNIIYVFPDQYRNQAMGFWNQEGFRDKVNFRGDPVHTPNIDTFARESMVLTSAQSN
CPLSSPHRGMLLTGMYPNRSGVPLNCNSTRPISSLRDDAECIGDVFSKAGYDCAYFGKLHADFPTPNDPENPGQYVETQR
PVWDAYTPKEQRHGFNYWYSYGTFDEHKNPHYWDTDGKRHDPKEWSPLHESGKVVSYLKNEGNVRDTKKPFFIMVGMNPP
HSPYRSLNDCEEQDFNLYKDQPLDSLLIRPNVDLNMKKAESVRYYFASVTGVDRAFGQILEALKQLGLDKNTVVIFASDH
GETMCSQRTDDPKNSPYSESMNIPFLVRFPGKIQPRVDDLLLSAPDIMPTVLGLCGLGDSIPSEVQGRNFAPLFFDEKAE
IVRPAGALYIQNLDGEKDKDGLVQSYFPSSRGIKTARYTLALYIDRKTKQLKKSLLFDDVNDPYQLNNLPLDENKEVVEQ
LYREMGTMLKEIDDPWYTEKILSDRIPY
>A3DD66 3.2.1.39~~~~~~Glucan endo-1,3-beta-D-glucosidase~~~COG5498
MPPGAKVPQAEIYKTSNLQGAVPTNSWESSILWNQYSLPIYAHPLTFKFKAEGIEVGKPALGGSGIAYFGAHKNDFTVGH
SSVYTFPDARADKISDFAVDAVMASGSGSIKATLMKGSPYAYFVFTGGNPRIDFSGTPTVFYGDSGSQCLGVTINGVNYG
LFAPSGSKWQGIGTGTITCILPAGKNYFSIAVLPDNTVSTLTYYKDYAYCFVTDTKVEWSYNETESTLTTTFTAEVSVKE
GTNKGTILALYPHQWRNNPHILPLPYTYSTLRGIMKTIQGTSFKTVYRYHGILPNLPDKGTYDREALNRYINELALQADA
PVAVDTYWFGKHLGKLSCALPIAEQLGNISAKDRFISFMKSSLEDWFTAKEGETAKLFYYDSNWGTLIGYPSSYGSDEEL
NDHHFHYGYFLHAAAQIALRDPQWASRDNWGAMVELLIKDIANWDRNDTRFPFLRNFDPYEGHSWASGHAGFADGNNQES
SSEAINAWQAIILWGEATGNKTIRDLGIYLYTTEVEAVCNYWFDLYKDIFSPSYGHNYASMVWGGKYCHEIWWNGTNSEK
HGINFLPITAASLYLGKDPNYIKQNYEEMLRECGTSQPPNWKDIQYMYYALYDPAAAKNMWNESIVPEDGESKAHTYHWI
CNLDSLGLPDFSVTADTPLYSVFNKNNIRTYVVYNASSSAKKVTFSDGKVMTVGPHSMAVSTGSESEVLAGDLNGDGKIN
STDISLMKRYLLKQIVDLPVEDDIKAADINKDGKVNSTDMSILKRVILRNYPL
>Q9KG76 3.2.1.39~~~~~~Glucan endo-1,3-beta-D-glucosidase~~~COG5498
MKGKNVQLLFALVVIILLFPTGASASPHAVSVGKGSYATEFPEIDFGGINDPGFRDQQGEPPATIYRSDRVTGPMQTNSW
WGSLAVDRFSMNQYPHPFSVRHRAEGLHVFYDAPHNMVVHENREAGTWHIHGAIGTDFTIKHSGTANFEQAVVDDYNDWY
VRGLLENGAHQMAITYGVGSPYIFVEYEDGSAVLDFDIAPDVWEMNGHVIGFSTHDHKHYAAFAPPGQNWSGIGSKTLTN
NADYIAIAKLPEKDGNMLAKFEQYAYSVVRDAVADWTYDEATGTVTTTFEVTTEAKVQGAPDGTIFALYPHQYRHLASSS
ENQLLQNYQYEIIRGTMIGLEGKRFTTELTYPGVLPSLPDLGDYDRERLIGYLHDATSDYPTGSDTYELGKYIGKLATLA
PIADQMGEYELAEQFRGELKDILEDWLQATNASGQLKGKNLFYYNENWGTILGYHAAHSSATRINDHHFHYGYFVKAAAE
IARADQEWAKSENWGGMIDLLIRDFMADRDDDLFPYLRMFDPYSGNSWADGLATFDAGNNQESSSEAMHAWTNVILWAEA
TGNKALRDRAIYLYTTEMSAINEYFFDVHQEIFPEEYGPEIVTINWGGKMDHATWWNSGKVEKYAINWLPFHGGSLYLGH
HPDYVDRAYEELRRDIGSTDWNLWSNLVWMYRAFTNPDDALQQMEASIDDYGLFDPGNEKIIERGSTKAQTYHWIHNLAE
LGRVDPTVTANHPIYAVFNKNGNRTYIVYNFSDSPITVQFSDGHSIQVEPHSFNIGNGDGPTNPDPSEPDLKNPYERIQA
EAYDAMSGIQTEGTDDDGGGDNIGWINDGDWVKYERVHFERDASSIEVRVASDTPGGRIEIRTGSPTGTLLGDVQVPNTG
GWQQWQTVTGNVQIQPGTYDVYLVFKGSPEYDLMNVNWFVFRANGQGNGDSHTHPDYTAGIRGITGNEVTIFFAPTTEAR
YVDVHLKVNNGQQLNYRMTERNGEWERVVENLSSGDVLEYSFTYEKLGPQYTTEWFTYSR
>Q47N06 3.2.1.39~~~~~~Glucan endo-1,3-beta-D-glucosidase~~~COG5498
MSHASRRRWRRATTSAATAALLCGALLTFPSAPAAAQVRLGSGSYTTVLPPGASGPSDHTGAPVAPKVTADFTQPVVTND
WWSSLIFQRYPGNPYGENLYAHPLSFKAQAHGLEVGYPDTPELVADGLKYQYTHSPDFVLGIHGLNAPAAKVAGYSDWTV
TADLSDGTRQLRTTIGQGLPFVYADVSGGPIRVEFTAPPTVWRRSGNAVGVTVNGHHYALFAPSGTTWSESDTVFTADVG
GSGYASVALLPSPDDFDRYAPYAYSFVTSTTLTYDYDPASATLTSTYRVTTEAREGTAQGTLLALYPHQWKETTTALTDL
SYASPRGPMRVVEGDRFTTELTTHGILPSLPTVDSADHQRLRALIDAELHASDPWKGASDTYWTGKALGRLAQLVPIADS
IGYTAGRDALLDLLKNKMEDWLTADGPGDNAQFYYDDQWDTLIGFPASFGSNTELNDHDFHYGYFITAAATIARYDRSWI
SEERWGPMVTTVLRDANNPDRDDERFPWLRSFSPYAGHGWASGHAGFASGNNQESSSEAMHFAASAALLGSLIGDEELRD
LGVYLHTTQASAMRRYWQNADGDAFPAGYSHDVVGMVWSDGGDHRIWWDGTPEELYGINYLPITAGSLYLGHDPEHAAAM
HQSLVTRLGRQPQVWRDIHWAHQALSDPDAALAAFEAQWQSYEPESGSSKAHTYQWLSTLAEFGTVDTSVTADTPHYAVF
RDGDRRTYVAFNPTGQPLTVTFSDGTTLTVPPGQLATG
>P38424 ~~~engB~~~Probable GTP-binding protein EngB~~~COG0218
MKVTKSEIVISAVKPEQYPEGGLPEIALAGRSNVGKSSFINSLINRKNLARTSSKPGKTQTLNFYIINDELHFVDVPGYG
FAKVSKSEREAWGRMIETYITTREELKAVVQIVDLRHAPSNDDVQMYEFLKYYGIPVIVIATKADKIPKGKWDKHAKVVR
QTLNIDPEDELILFSSETKKGKDEAWGAIKKMINR
>Q2SU58 ~~~engB~~~Probable GTP-binding protein EngB~~~
MAFLLHQARFFTTVNHLRDLPPTVQPEIAFAGRSNAGKSTAINVLCNQKRLAFASKTPGRTQHINYFSVGPAAEPVAHLV
DLPGYGYAEVPGAAKAHWEQLLSSYLQTRPQLCGMILMMDARRPLTELDRRMIEWFAPTGKPIHSLLTKCDKLTRQESIN
ALRATQKSLDAYRDAGYAGKLTVQLFSALKRTGLDDAHALIESWLRPAAADEDHAAVAE
>P0A6P7 ~~~engB~~~Probable GTP-binding protein EngB~~~COG0218
MTNLNYQQTHFVMSAPDIRHLPSDTGIEVAFAGRSNAGKSSALNTLTNQKSLARTSKTPGRTQLINLFEVADGKRLVDLP
GYGYAEVPEEMKRKWQRALGEYLEKRQSLQGLVVLMDIRHPLKDLDQQMIEWAVDSNIAVLVLLTKADKLASGARKAQLN
MVREAVLAFNGDVQVETFSSLKKQGVDKLRQKLDTWFSEMQPVEETQDGE
>A6TG68 ~~~engB~~~Probable GTP-binding protein EngB~~~
MTNWNYQLTHFVTSAPDIRHLPADTGIEVAFAGRSNAGKSSALNTLTNQKNLARTSKTPGRTQLINLFEVAEGKRLVDLP
GYGYAQVPEEMKIKWQRALGEYLEKRLCLKGLVVLMDIRHPLKDLDQQMIEWAVESDIQVLVLLTKADKLASGARKAQVN
MVREAVLAFNGDVQVEPFSSLKKSGVDKLRQKLDSWFNEIPPQEAVEDAE
>B4RQ29 ~~~engB~~~Probable GTP-binding protein EngB~~~
MNLFQNAKFFTTVNHLKDLPDTPLEIAFVGRSNAGKSSAINTLTNHVRLAYVSKTPGRTQHINFFELQNGNFMVDLPGYG
YAQVPEAVRAHWVNLLGDYLRHRKQLIGLVLIMDARHPLKELDIRMLDFFHTTGRPVHILLSKADKLSKNEQIKTLSQVK
KLLKPYSDRQNISVQLFSSLKKQGIDEANRTVGSWFDAADAAASSPEEN
>P64071 ~~~engB~~~Probable GTP-binding protein EngB~~~
MKVNPNNIELIISAVKEEQYPETELSEVALSGRSNVGKSTFINSMIGRKNMARTSQQPGKTQTLNFYNIDEQLIFVDVPG
YGYAKVSKTQREKFGKMIEEYITKRENLQLVIQLVDLRHDPTQDDILMYNYLKHFDIPTLVICTKEDKIPKGKVQKHIKN
IKTQLDMDPDDTIVSYSSIQNNKQQQIWNLIEPYIS
>Q9X1H7 ~~~engB~~~Probable GTP-binding protein EngB~~~COG0218
MIIRDVELVKVARTPGDYPPPLKGEVAFVGRSNVGKSSLLNALFNRKIAFVSKTPGKTRSINFYLVNSKYYFVDLPGYGY
AKVSKKERMLWKRLVEDYFKNRWSLQMVFLLVDGRIPPQDSDLMMVEWMKSLNIPFTIVLTKMDKVKMSERAKKLEEHRK
VFSKYGEYTIIPTSSVTGEGISELLDLISTLLKEN
>Q042F4 4.2.1.11~~~eno2~~~Enolase 2~~~
MSVITDIHAREVLDSRGNPTVEAEVYTELGGFGRAIVPSGASTGEHEAVELRDGDKSRFGGQGVLTAVENVNGEIAKAVI
GLDVTDQRLIDQTMIDLDGTPNKGRLGANAILSVSLASARAAADELGLPLYEYLGGPNAHVLPTPMMNVINGGKHADNNV
DIQEFMIMPVGAKSLHEAVRMGAETFHTLKGLLQERGESTAVGDEGGFAPNLKNNEEPFEILVEAIQRAGYKPGQDIAIA
FDCAASEFYNKDTKKYVTVADGREYTAEEWTSLIEDLVDKYPVISVEDPLDENDWEGWKTFTERLGDKVQIVGDDLFVTN
TSYLEKGIKMGVANSILIKLNQIGTLTETFEAIEMAKEAGYTAVVSHRSGETEDTTIADLVVATNAGQIKTGSMSRTDRI
AKYNQLMRIEEALGSTAQYKGIHSFYNLHKQF
>P37869 4.2.1.11~~~eno~~~Enolase~~~COG0148
MPYIVDVYAREVLDSRGNPTVEVEVYTETGAFGRALVPSGASTGEYEAVELRDGDKDRYLGKGVLTAVNNVNEIIAPELL
GFDVTEQNAIDQLLIELDGTENKGKLGANAILGVSMACARAAADFLQIPLYQYLGGFNSKTLPVPMMNIVNGGEHADNNV
DIQEFMIMPVGAPNFREALRMGAQIFHSLKSVLSAKGLNTAVGDEGGFAPNLGSNEEALQTIVEAIEKAGFKPGEEVKLA
MDAASSEFYNKEDGKYHLSGEGVVKTSAEMVDWYEELVSKYPIISIEDGLDENDWEGHKLLTERLGKKVQLVGDDLFVTN
TKKLSEGIKNGVGNSILIKVNQIGTLTETFDAIEMAKRAGYTAVISHRSGETEDSTIADIAVATNAGQIKTGAPSRTDRV
AKYNQLLRIEDQLAETAQYHGINSFYNLNK
>P42448 4.2.1.11~~~eno~~~Enolase~~~COG0148
MLVIEDVRAYEVLDSRGNPTVKAEVTLSDGSVGAAIVPSGASTGSKEALELRDNDERFGGKGVLKAVANVNETIADEILG
LDAFNQTQLDDTLRELDGTNNYSNLGANATLGVSMATARAAAAALGMPLYRYLGGANASILPVPMCNIINGGAHANNNVD
FQEFMIMPFGFTSFKEALRSVCEIYAILKKELANSGHSTALGDEGGFAPNLANNTEPIDLLMTCIKKAGYENRVKIALDV
ASTEFFKDGKYHMEGKAFSSEALIERYVELCAKYPICSIEDGLAENDFEGWIKLTEKLGNKIQLVGDDLFVTNEDILREG
IIKKMANAVLIKPNQIGTITQTMRTVRLAQRNNYKCVMSHRSGESEDAFIADFAVALNTGQIKTGALARGERTAKYNRLL
EIEFESDEYLGEKL
>A9WCM4 4.2.1.11~~~eno~~~Enolase~~~COG0148
MSTLIEAIVAREVLDSRGNPTIEVDVRLESGDVGRAIVPSGASTGAHEALELRDGDKSRYNGKGVLKAVQAVNEDIAEAL
IGFDAADQIALDQELIALDGTPNKSKLGANAILGVSLAAAKAAAAAFGLPLYRYLGGVYAHVLPVPMMNIMNGGQHATNS
TDFQEFMIMPVGAESFREGLRWGAEIYHMLKKVIHDRGFSTTVGDEGGFAPSLPTNDAPLQLIMEAIEKAGYRPGEQIVI
ALDPATTEIFEDGKYHLKREGRSLSSAEMVDYWVDLVNRYPIISLEDGLAEDDWEGWALLRAKLGDRVQLVGDDFLVTNV
QRLQRAIEAKAANSILIKLNQIGSLTETLSAIQLAQRSGWTAVVSHRSGESEDVTIADLVVATNAGQIKTGAPARTDRIA
KYNQLLRIEEELGSAARYAGRSAFKV
>B0BA40 4.2.1.11~~~eno~~~Enolase~~~
MFDVVISDIEAREILDSRGYPTLCVKVITNTGTFGEACVPSGASTGIKEALELRDKDPKRYQGKGVLQAISNVEKVLVPA
LQGFSVFDQITADAIMIDADGTPNKEKLGANAILGVSLALAKAAANTLQRPLYRYLGGSFSHVLPCPMMNLINGGMHATN
GLQFQEFMIRPISAPSLKEAVRMGAEVFNALKKILQNRQLATGVGDEGGFAPNLASNAEALDLLLTAIETAGFTPREDIS
LALDCAASSFYNTQDKTYDGKSYADQVGILAELCEHYPIDSIEDGLAEEDFEGWKLLSETLGDRVQLVGDDLFVTNSALI
AEGIAQGLANAVLIKPNQIGTLTETAEAIRLATIQGYATILSHRSGETEDTTIADLAVAFNTGQIKTGSLSRSERIAKYN
RLMAIEEEMGPEALFQDSNPFSKA
>Q83B44 4.2.1.11~~~eno~~~Enolase~~~COG0148
MTATITDINAHEILDSRANPTLEVRVTLSSQAYGCAAVPSGASTGEREAVELRDNDLERYGGKGVLQAVENVNGPIRDAL
LGQDPRSQEEIDRIMIELDGTENKANLGANAILGVSLAVAYAAANNADLPLYRYLGGDGGPFSMPVPMMNIINGGAHATN
NLDFQEFMIVPVGAPTFAEALRYGAEVFHALKKRLVSRGLMSAVGDEGGFAPDLPNNEAAFELILEAIEDANYVPGKDIY
LALDAASSELYQNGRYDFENNQLTSEEMIDRLTEWTKKYPVISIEDGLSENDWAGWKLLTERLENKVQLVGDDIFVTNPD
ILEKGIKKNIANAILVKLNQIGTLTETLATVGLAKSNKYGVIISHRSGETEDTTIADLAVATDARQIKTGSLCRSDRVAK
YNRLLQIERELNDQAPYAGKEAFLFNRK
>Q72F92 4.2.1.11~~~eno~~~Enolase~~~COG0148
MSTIVSVWAREILDSRGNPTIEVEVSLESGHSGRAAVPSGASTGTREALELRDGDKGRYKGKGVEKAVDNVMGEIAEAVI
GLDALRQVQLDNTLLDLDGTDNKERLGANAMLGVSLATARAASSFLGLPLYQYLGGVNAKVLPVPLMNIINGGAHAPNNL
DIQEFMIMPIGAATFRDALRMGAETFHTLKALLAADGHVTSVGDEGGFAPNLKSHDEAFKYITRAIEESGYIPGAEIALA
IDAAASEFYRDGKYHLAGEGKTFSNSEMTEWLGEFTAKYPLISIEDGLAEGDWEGWGELTYKLGDTVQLVGDDIFVTNPD
ILAQGIDEGVANSILIKLNQIGTLTETLDTIEMAKQAAYTTVISHRSGETEDHFIADLAVGLNAGQIKTGSLCRSDRLAK
YNQLLRIEEDLDDAGIYFGPMIASHFGYEGDEEFEDA
>O32513 4.2.1.11~~~eno~~~Enolase~~~COG0148
MSTIVSVWAREILDSRGNPTVEVEVSLESGHTGRAAVPSGASTGSREALEMRDGDKGRYKGKGVEKAVDNVMGEIAEAIV
GLDSLRQVQVDNTLLDLDGTDNKSRLGANAMLGVSLATARAASSFLGLPLYQYLGGVNAKVLPVPLMNIINGGAHAPNNL
DIQEFMIMPIGAATFRDALRMGAETFHTLKALLAADGHVTSVGDEGGFAPNLKNHDEAFRYIMKAIEEAGYIPGAEIALA
IDAAASEFHKDGKYVLAGEGKNLSNSEMVEWLGEFTTRYPLISIEDGLAEADWDGWRELTYKLGDTIQLVGDDIFVTNPD
ILAEGIDEGVANSILIKLNQIGTLTETLDTIEMAKQAAYTTVISHRSGETEDHFISDLAVGLNAGQIKTGSLCRSDRLAK
YNQLLRIEEDLDDTGIYFGPMMSSHFGFEEEGEE
>P0A6P9 4.2.1.11~~~eno~~~Enolase~~~COG0148
MSKIVKIIGREIIDSRGNPTVEAEVHLEGGFVGMAAAPSGASTGSREALELRDGDKSRFLGKGVTKAVAAVNGPIAQALI
GKDAKDQAGIDKIMIDLDGTENKSKFGANAILAVSLANAKAAAAAKGMPLYEHIAELNGTPGKYSMPVPMMNIINGGEHA
DNNVDIQEFMIQPVGAKTVKEAIRMGSEVFHHLAKVLKAKGMNTAVGDEGGYAPNLGSNAEALAVIAEAVKAAGYELGKD
ITLAMDCAASEFYKDGKYVLAGEGNKAFTSEEFTHFLEELTKQYPIVSIEDGLDESDWDGFAYQTKVLGDKIQLVGDDLF
VTNTKILKEGIEKGIANSILIKFNQIGSLTETLAAIKMAKDAGYTAVISHRSGETEDATIADLAVGTAAGQIKTGSMSRS
DRVAKYNQLIRIEEALGEKAPYNGRKEIKGQA
>E6ER18 4.2.1.11~~~eno~~~Enolase~~~
MSIITDIYAREVLDSRGNPTIEVEVYTESGAFGRGMVPSGASTGEYEAVELRDGDKARYLGKGVTKAVDNVNNIIAEAII
GYDVRDQMAIDKAMIDLDGTPNKGKLGANAILGVSIAVARAAADYLEVPLYHYLGGFNTKVLPTPMMNIINGGSHADNSI
DFQEFMIMPVGAPTFKEALRMGAEVFHALASILKGRGLATSVGDEGGFAPNLGSNEEGFEVIIEAIEKAGYVPGKDVVLA
MDAASSEFYDKEKGVYVLADSGEGEKTTEEMIAFYEELVSKYPIISIEDGLDENDWDGFKKLTEVLGDKVQLVGDDLFVT
NTTKLAEGIEKGIANSILIKVNQIGTLTETFEAIEMAKEAGYTAVVSHRSGETEDSTISDIAVATNAGQIKTGSLSRTDR
IAKYNQLLRIEDQLGDVAEYKGLKSFYNLKNK
>Q8GR70 4.2.1.11~~~eno~~~Enolase~~~
MSIITDVYAREILDSRGNPTIEVEVYTESGAFGRGMVPSGASTGEYEAVELRDGDKARYGGKGVTKAVDNVNNIIAEAII
GYDVRDQMAIDKAMIALDGTPNKGKLGANAILGVSIAVARAAADYLEVPLYHYLGGFNTKVLPTPMMNIINGGSHADNSI
DFQEFMIMPVGAPTFKEALRMGAEVFHALAAILKSRGLATSVGDEGGFAPNLGSNEEGFEVIIEAIEKAGYVPGKDVVLA
MDAASSEFYDKEKGVYVLADSGEGEKTTDEMIKFYEELVSKYPIISIEDGLDENDWDGFKKLTDVLGDKVQLVGDDLFVT
NTQKLSEGIEKGIANSILIKVNQIGTLTETFEAIEMAKEAGYTAVVSHRSGETEDSTISDIAVATNAGQIKTGSLSRTDR
IAKYNQLLRIEDQLGEVAEYKGLKSFYNLKAA
>Q5ZTX1 4.2.1.11~~~eno~~~Enolase~~~COG0148
MHIHKIQAREILDSRGNPTIEADVTLTTGIIGRASVPSGASTGSREACELRDNDPKRYAGKGVQKAVKHVNNEINQALQG
LSVEDQENLDRILCQLDNTENKSHLGANAILATSLACARARALSLNQPLYMTLNQGDMMTMPVPMMNILNGGAHADNNVD
IQEFMIMPIGAPDFPVALQMGTEIFHVLKSVLKKQGLNTAVGDEGGFAPNIQSNRQALDLLSEAIEKAGFRLGEDIVFAL
DVAASELFNEGFYHMYSENQKFDSHQLIEYYANLISSYPIVSIEDGLDEKDWSGWKQLTTHLGNKVQLVGDDLFVTNPKI
LREGIAQGIANAILIKVNQIGTLSETRQAIKLAYDNGYRCVMSHRSGETEDTFIADLAVASGCGQIKTGSLCRTDRTAKY
NQLLRINELASLPYAGKNILKR
>P64074 4.2.1.11~~~eno~~~Enolase~~~COG0148
MSIITEVYAREVLDSRGNPTVEVEVYTEAGAFGRALVPSGASTGEYEAVELRDGDKARYLGKGVLKAVENVNDIIADKII
GFDVTDQIGIDKAMIELDGTPNKGKLGANAILGVSLAAARAAADELGVHLYEYLGGVNGKVLPVPMMNILNGGEHADNNV
DVQEFMVMPVGAPNFKEALRMGAEILHALKAVLKGKGLNTGVGDEGGFAPNLKSNEEALETIMQAIKDAGYKPGEEVKLA
MDAASSEFYNRETGKYELKGEGVTRTSEEMVTWYEEMITKYPIISIEDGLDENDWDGFKLLTERIGDRVQLVGDDLFVTN
TTKLKEGIEKGIANSILIKVNQIGTLTETLDAIEMAKRAGYTAVISHRSGETEDSTIADIAVATNAGQIKTGAPTRTDRV
AKYNQLLRIEDNLADLAEYHGNDTFYNLKK
>P75189 4.2.1.11~~~eno~~~Enolase~~~
MSAQTGTDLFKIADLFAYQVFDSRGFPTVACVVKLASGHTGEAMVPSGASTGEKEAIELRDGDPKAYFGKGVSQAVQNVN
QTIAPKLIGLNATDQAAIDALMIQLDGTPNKAKLGANAILAVSLAVAKAAASAQKTSLFKYLANQVMGLNKTEFILTVPM
LNVINGGAHADNNIDFQEFMIMPLGANSMHQALKMASETFHALQKLLKQRGLNTNKGDEGGFAPNLKLAEEALDLMVEAI
KAAGYQPGSDIAIALDVAASEFYDDTTKRYVFKKGIKAKILDEKEWSLTTAQMIAYLKKLTEQYPIISIEDGLSEHDWEG
METLTKTLGQHIQIVGDDLYCTNPAIAEKGVAHKATNSILIKLNQIGTLTETIKAINIAKDANWSQVISHRSGETEDTTI
ADLAVAACTGQIKTGSMSRSERIAKYNRLLQIELELGNNAKYLGWNTFKNIKPQKA
>A0R3B8 4.2.1.11~~~eno~~~Enolase~~~COG0148
MPIIEQVGAREILDSRGNPTVEVEVALTDGTFARAAVPSGASTGEHEAVELRDGGSRYGGKGVEKAVEAVLDEIAPQVIG
LSADDQRLVDQALLDLDGTPDKSRLGANAILGVSLAVSKAAAESAGLPLFRYIGGPNAHILPVPMMNILNGGAHADTGVD
VQEFMVAPIGAPSFKEALRWGAEVYHSLKSVLKNQGLATGLGDEGGFAPDVAGTKAALDLISSAIEATGLKLGSDVALAL
DVAATEFYTEGSGYAFEKETRTAEQMAEFYAGLLDSYPLVSIEDPLSEDDWDGWVSLTAAIGDRIQLVGDDLFVTNPERL
EDGIQRGAANALLVKVNQIGTLTETLDAVSLAHNSGYRTMMSHRSGETEDTTIADLAVAVGSGQIKTGAPARSERVAKYN
QLLRIEETLGDAARYAGDLAFPRLEAK
>P9WNL1 4.2.1.11~~~eno~~~Enolase~~~COG4948
MPIIEQVRAREILDSRGNPTVEVEVALIDGTFARAAVPSGASTGEHEAVELRDGGDRYGGKGVQKAVQAVLDEIGPAVIG
LNADDQRLVDQALVDLDGTPDKSRLGGNAILGVSLAVAKAAADSAELPLFRYVGGPNAHILPVPMMNILNGGAHADTAVD
IQEFMVAPIGAPSFVEALRWGAEVYHALKSVLKKEGLSTGLGDEGGFAPDVAGTTAALDLISRAIESAGLRPGADVALAL
DAAATEFFTDGTGYVFEGTTRTADQMTEFYAGLLGAYPLVSIEDPLSEDDWDGWAALTASIGDRVQIVGDDIFVTNPERL
EEGIERGVANALLVKVNQIGTLTETLDAVTLAHHGGYRTMISHRSGETEDTMIADLAVAIGSGQIKTGAPARSERVAKYN
QLLRIEEALGDAARYAGDLAFPRFACETK
>B4EUF7 4.2.1.11~~~eno~~~Enolase~~~COG0148
MSKIVKVLGREIIDSRGNPTVEAEVHLEGGFVGMAAAPSGASTGSREALELRDGDKSRFLGKGVLKAVEAVNGPIAKALL
GQDAKDQANIDKIMIDLDGTENKSNFGANAILAVSLANAKAAAAAKGMPLYEHISDLNGTHGQYSMPLPMMNIINGGEHA
DNNVDIQEFMIQPVGAPTLKEAVRMGSEIFHHLAKVLKAKGMNTAVGDEGGYAPNLESNAAALAAIKEAVEAAGYVLGKD
VTLAMDCAASEFYNNETGNYELKGEGKTFTSQEFTHYLEELTKQYLIVSIEDGLNESDWDGFAYQTKVLGDKIQLVGDDL
FVTNTKILKEGIDKGIANSILIKFNQIGSLTETLAAIKMAKDAGYTAIISHRSGETEDATIADLAVGTAAGQIKTGSMSR
SDRVAKYNQLIRIEEALGSKAPFNGLKEVKGQA
>Q02RA7 4.2.1.11~~~eno~~~Enolase~~~
MAKIVDIKGREVLDSRGNPTVEADVILDNGIVGSACAPSGASTGSREALELRDGDKSRYLGKGVLKAVANINGPIRDLLL
GKDAADQKALDHAMIELDGTENKAKLGANAILAVSLAAAKAAAQAKGVPLYAHIADLNGTPGQYSMPVPMMNIINGGEHA
DNNVDIQEFMVQPVGAKNFAEALRMGAEIFHHLKAVLKARGLNTAVGDEGGFAPNLSSNEDALAAIAEAVEKAGYKLGDD
VTLALDCASSEFFKDGKYDLEGEGKVFDAAGFADYLAGLTQRYPIISIEDGMDESDWAGWKGLTDKIGAKVQLVGDDLFV
TNTKILKEGIEKGIGNSILIKFNQIGSLTETLEAIQMAKAAGYTAVISHRSGETEDSTIADLAVGTAAGQIKTGSLCRSD
RVSKYNQLLRIEEQLGAKAPYRGRAEFRG
>Q2G028 4.2.1.11~~~eno~~~Enolase~~~COG0148
MPIITDVYAREVLDSRGNPTVEVEVLTESGAFGRALVPSGASTGEHEAVELRDGDKSRYLGKGVTKAVENVNEIIAPEII
EGEFSVLDQVSIDKMMIALDGTPNKGKLGANAILGVSIAVARAAADLLGQPLYKYLGGFNGKQLPVPMMNIVNGGSHSDA
PIAFQEFMILPVGATTFKESLRWGTEIFHNLKSILSKRGLETAVGDEGGFAPKFEGTEDAVETIIQAIEAAGYKPGEEVF
LGFDCASSEFYENGVYDYSKFEGEHGAKRTAAEQVDYLEQLVDKYPIITIEDGMDENDWDGWKQLTERIGDRVQLVGDDL
FVTNTEILAKGIENGIGNSILIKVNQIGTLTETFDAIEMAQKAGYTAVVSHRSGETEDTTIADIAVATNAGQIKTGSLSR
TDRIAKYNQLLRIEDELFETAKYDGIKSFYNLDK
>P99088 4.2.1.11~~~eno~~~Enolase~~~
MPIITDVYAREVLDSRGNPTVEVEVLTESGAFGRALVPSGASTGEHEAVELRDGDKSRYLGKGVTKAVENVNEIIAPEII
EGEFSVLDQVSIDKMMIALDGTPNKGKLGANAILGVSIAVARAAADLLGQPLYKYLGGFNGKQLPVPMMNIVNGGSHSDA
PIAFQEFMILPVGATTFKESLRWGTEIFHNLKSILSKRGLETAVGDEGGFAPKFEGTEDAVETIIQAIEAAGYKPGEEVF
LGFDCASSEFYENGVYDYSKFEGEHGAKRTAAEQVDYLEQLVDKYPIITIEDGMDENDWDGWKQLTERIGDRVQLVGDDL
FVTNTEILAKGIENGIGNSILIKVNQIGTLTETFDAIEMAQKAGYTAVVSHRSGETEDTTIADIAVATNAGQIKTGSLSR
TDRIAKYNQLLRIEDELFETAKYDGIKSFYNLDK
>O69174 4.2.1.11~~~eno~~~Enolase~~~
MPIITDVYAREVLDSRGNPTVEVEVLTESGAFGRALVPSGASTGEHEAVELRDGDKSRYLGKGVTKAVENVNEIIAPEII
EGEFSVLDQVSIDKMMIALDGTPNKGKLGANAILGVSIAVARAAADLLGQPLYKYLGGFNGKQLPVPMMNIVNGGSHSDA
PIAFQEFMILPVGATTFKESLRWGTEIFHNLKSILSQRGLETAVGDEGGFAPKFEGTEDAVETIIQAIEAAGYKPGEEVF
LGFDCASSEFYENGVYDYSKFEGEHGAKRTAAEQVDYLEQLVDKYPIITIEDGMDENDWDGWKQLTERIGDRVQLVGDDL
FVTNTEILAKGIENGIGNSILIKVNQIGTLTETFDAIEMAQKAGYTAVVSHRSGETEDTTIADIAVATNAGQIKTGSLSR
TDRIAKYNQLLRIEDELFETAKYDGIKSFYNLDK
>Q8RP81 4.2.1.11~~~eno~~~Enolase~~~
MAIITDVYAREVLDSRGNPTLEVEVYTESGAFGRGMVPSGASTGEHEAVELRDGDKSRYGGLGTQKAVDNVNNVIAEAII
GYDVRDQQAIDRAMIALDGTPNKGKLGANAILGVSIAVARAAADYLEVPLYSYLGGFNTKVLPTPMMNIINGGSHSDAPI
AFQEFMIMPVGAPTFKEALRWGAEVFHALKKILKERGLETAVGDEGGFAPKFEGTEDGVETILKAIEAAGYEAGENGIMI
GFDCASSEFYDAERKVYDYSKFEGEGGAVRTAAEQIDYLEELVNKYPIITIEDGMDENDWDGWKALTERLGGRVQLVGDD
FFVTNTDYLARGIKEEAANSILIKVNQIGTLTETFEAIEMAKEAGYTAVVSHRSGETEDSTIADIAVATNAGQIKTGSLS
RTDRIAKYNQLLRIEDQLGEVAQYKGIKSFYNLDKCGR
>Q8DTS9 4.2.1.11~~~eno~~~Enolase~~~COG0148
MSIITDVYAREVLDSRGNPTLEVEVYTESGAFGRGMVPSGASTGEHEAVELRDGDKSRYGGLGTQKAVDNVNNIIAEALI
GYDVRDQQAIDKAMIALDGTPNKGKLGANAILGVSIAVARAAADFLEIPLYSYLGGFNTKVLPTPMMNIINGGSHSDAPI
AFQEFMIVPAGAPTFKEALRWGAEIFHALKKILKERGLETAVGDEGGFAPKFDGTEDAVETIIKAIETAGYKPGEEVFLG
FDCASSEFYDNGVYDYTKFEGEKGAKRSAAEQIDYIEELVNKYPIITIEDAMDENDWDGWKALTARLGDRVQLVGDDFFV
TNTDYLARGIKEGAANSILIKVNQIGTLTETFEAIEMAKEAGYTAVVSHRSGETEDSTIADISVATNAGQIKTGSLSRTD
RIAKYNQLLRIEDQLGEVAEYRGLKSFYNLKK
>P69949 4.2.1.11~~~eno~~~Enolase~~~
MSIITDVYAREVLDSRGNPTLEVEVYTESGAFGRGMVPSGASTGEHEAVELRDGDKSRYLGLGTQKAVDNVNNIIAEAII
GYDVRDQQAIDRAMIALDGTPNKGKLGANAILGVSIAVARAAADYLEVPLYTYLGGFNTKVLPTPMMNIINGGSHSDAPI
AFQEFMIMPVGAPTFKEGLRWGAEVFHALKKILKERGLVTAVGDEGGFAPKFEGTEDGVETILKAIEAAGYEAGENGIMI
GFDCASSEFYDKERKVYDYTKFEGEGAAVRTSAEQVDYLEELVNKYPIITIEDGMDENDWDGWKVLTERLGKRVQLVGDD
FFVTNTEYLARGIKENAANSILIKVNQIGTLTETFEAIEMAKEAGYTAVVSHRSGETEDSTIADIAVATNAGQIKTGSLS
RTDRIAKYNQLLRIEDQLGEVAQYKGIKSFYNLKK
>Q5XD01 4.2.1.11~~~eno~~~Enolase~~~
MSIITDVYAREVLDSRGNPTLEVEVYTESGAFGRGMVPSGASTGEHEAVELRDGDKSRYLGLGTQKAVDNVNNIIAKAII
GYDVRDQQAIDRAMIALDGTPNKGKLGANAILGVSIAVARAAADYLEVPLYTYLGGFNTKVLPTPMMNIINGGSHSDAPI
AFQEFMIMPVGAPTFKEGLRWGAEVFHALKKILKERGLVTAVGDEGGFAPKFEGTEDGVETILKAIEAAGYEAGENGIMI
GFDCASSEFYDKERKVYDYTKFEGEGAAVRTSAEQVDYLEELVNKYPIITIEDGMDENDWDGWKVLTERLGKRVQLVGDD
FFVTNTEYLARGIKENAANSILIKVNQIGTLTETFEAIEMAKEAGYTAVVSHRSGETEDSTIADIAVATNAGQIKTGSLS
RTDRIAKYNQLLRIEDQLGEVAQYKGIKSFYNLKK
>Q97QS2 4.2.1.11~~~eno~~~Enolase~~~COG0148
MSIITDVYAREVLDSRGNPTLEVEVYTESGAFGRGMVPSGASTGEHEAVELRDGDKSRYGGLGTQKAVDNVNNIIAEAII
GYDVRDQQAIDRAMIALDGTPNKGKLGANAILGVSIAVARAAADYLEIPLYSYLGGFNTKVLPTPMMNIINGGSHSDAPI
AFQEFMILPVGAPTFKEALRYGAEIFHALKKILKSRGLETAVGDEGGFAPRFEGTEDGVETILAAIEAAGYVPGKDVFIG
FDCASSEFYDKERKVYDYTKFEGEGAAVRTSAEQIDYLEELVNKYPIITIEDGMDENDWDGWKALTERLGKKVQLVGDDF
FVTNTDYLARGIQEGAANSILIKVNQIGTLTETFEAIEMAKEAGYTAVVSHRSGETEDSTIADIAVATNAGQIKTGSLSR
TDRIAKYNQLLRIEDQLGEVAEYRGLKSFYNLKK
>Q8DPS0 4.2.1.11~~~eno~~~Enolase~~~COG0148
MSIITDVYAREVLDSRGNPTLEVEVYTESGAFGRGMVPSGASTGEHEAVELRDGDKSRYGGLGTQKAVDNVNNIIAEAII
GYDVRDQQAIDRAMIALDGTPNKGKLGANAILGVSIAVARAAADYLEIPLYSYLGGFNTKVLPTPMMNIINGGSHSDAPI
AFQEFMILPVGAPTFKEALRYGAEIFHALKKILKSRGLETAVGDEGGFAPRFEGTEDGVETILAAIEAAGYVPGKDVFLG
FDCASSEFYDKERKVYDYTKFEGEGAAVRTSAEQIDYLEELVNKYPIITIEDGMDENDWDGWKALTERLGKKVQLVGDDF
FVTNTDYLARGIQEGAANSILIKVNQIGTLTETFEAIEMAKEAGYTAVVSHRSGETEDSTIADIAVATNAGQIKTGSLSR
TDRIAKYNQLLRIEDQLGEVAEYRGLKSFYNLKK
>Q5N3P4 4.2.1.11~~~eno~~~Enolase~~~COG0148
MPDDYGTQIAEITAREILDSRGRPTVEAEVHLEDGSVGLAQVPSGASTGTFEAHELRDDDPSRYGGKGVQKAVENVSAIE
DALIGLSALDQEGLDKAMIALDGTPNKKNLGANAILAVSLATAHAAATSLNLPLYRYLGGPLANVLPVPMMNVINGGAHA
DNNVDFQEFMIMPVGAPSFKEALRWGAEVFHALAKVLKDKGLATGVGDEGGFAPNLGSNKEALELLLTAIEAAGYKPGEQ
VALAMDVASSEFYKNGLYTCDGVSHEPAGMIGILADLVSQYPIVSIEDGLQEDDWSNWKTLTQQLGSTVQLVGDDLFVTN
PDRLQSGIEQGVGNAVLIKLNQIGTLTETLRTIDLATRSGYRSVISHRSGETEDTTIADLAVATRAGQIKTGSLSRSERI
AKYNRLLRIEAALGENALYAGAIGLGPKGR
>P42848 4.2.1.11~~~eno~~~Enolase~~~COG0148
MYVEIVDVRAREVLDSRGNPTVEAEVVLEDGTMGRAIVPSGASTGKFEALEIRDKDKKRYLGKGVLKAVENVNETIAPAL
IGMNAFDQPLVDKTLIELDGTENKSKLGANAILAVSMAVARAAANYLGLPLYKYLGGVNAKVLPVPLMNVINGGQHADNN
LDLQEFMIVPAGFDSFREALRAGAEIFHTLKKILHEAGHVTAVGDEGGFAPNLSSNEEAIKVLIEAIEKAGYKPGEEVFI
ALDCAASSFYDEEKGVYYVDGEEKSSEVLMGYYEELVAKYPIISIEDPFAEEDWDAFVEFTKRVGNKVQIVGDDLYVTNV
KRLSKGIELKATNSILIKLNQIGTVTETLDAVEMAQKNNMTAIISHRSGESEDTFIADLAVATNAGFIKTGSLSRSERIA
KYNQLLRIEEELGKVAEFRGLKSFYSIKR
>P33675 4.2.1.11~~~eno~~~Enolase~~~COG0148
MTAIVSIHGRQVVDSRGNPTVEVDVTLEDGSFGRAAVPSGASTGVHEAVELRDGDKTRWGGKGVTKAVHAVNNEIANAII
GLEAEDQELIDQTMIKLDGTPNKGKFGANAILGVSLAVAKAAAEARGLPLYRYVGGTAAHVLPVPMMNIVNGGMHADNPI
DFQEFMIAPVGASSINEAVRIGTEVFHTLKKELSAKGMNTNVGDEGGFAPSLDSASSALDFIVDSISKAGYKPGEDVFIA
LDAASSEFYNKDQNIYDLKGEGRKLTSAQLVDYYVELCGKYPIYSIEDGLAEDDFEGWKILTEKLGDKVQLVGDDLFVTN
VKRLSDGIERGIANSLLVKFNQIGSLSETLAAVNMANDASYTAVMSHRSGETEDTTIADLAVATNCGQIKTGSLCRSERI
AKYNQLMRIEEELGSVAKYAGRSVLRKAK
>Q03415 3.4.19.11~~~~~~Gamma-D-glutamyl-L-diamino acid endopeptidase 1~~~
MDILIRPGDSLWYFSDLFKIPLQLLLDSNRNINPQLLQVGQRIQIPGYVTTSYTITQGDSLWQIAQNKNLPLNAILLVNP
EIQPSRLHIGQTIQVPQRLTWRLVNGQQNYDYSMMMNDIKKLQTAYPFLQGTPIGNSVLAQPIPEILIGNGSKRIHYKAS
FHANEWITTPIIMTFLNDYLLALTNQTTIRGLSMGPLYNQTTLSLVPMVNPDGVNLVINGPPANEALKNKLIAWNHNSQN
FSGWKANINGVDLNDQFPAKWELENARNPQTPGPRDYGGEAPLTQPEAIAMADLTRSRNFAWVLAFHTQGRVIYWGFENL
EPPESQTMVEEFSRVSGYEPIQSANSYAGYKDWFIQDWRRPGFTVELGSGTNPLPISEFDTIYQEALGIFLAGLYL
>P15047 1.3.1.28~~~entA~~~2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase~~~COG1028
MDFSGKNVWVTGAGKGIGYATALAFVEAGAKVTGFDQAFTQEQYPFATEVMDVADAAQVAQVCQRLLAETERLDALVNAA
GILRMGATDQLSKEDWQQTFAVNVGGAFNLFQQTMNQFRRQRGGAIVTVASDAAHTPRIGMSAYGASKAALKSLALSVGL
ELAGSGVRCNVVSPGSTDTDMQRTLWVSDDAEEQRIRGFGEQFKLGIPLGKIARPQEIANTILFLASDLASHITLQDIVV
DGGSTLGA
>P0ADI4 6.3.2.14~~~entB~~~Enterobactin synthase component B~~~COG1535
MAIPKLQAYALPESHDIPQNKVDWAFEPQRAALLIHDMQDYFVSFWGENCPMMEQVIANIAALRDYCKQHNIPVYYTAQP
KEQSDEDRALLNDMWGPGLTRSPEQQKVVDRLTPDADDTVLVKWRYSAFHRSPLEQMLKESGRNQLIITGVYAHIGCMTT
ATDAFMRDIKPFMVADALADFSRDEHLMSLKYVAGRSGRVVMTEELLPAPIPASKAALREVILPLLDESDEPFDDDNLID
YGLDSVRMMALAARWRKVHGDIDFVMLAKNPTIDAWWKLLSREVK
>P01553 ~~~entC1~~~Enterotoxin type C-1~~~
MNKSRFISCVILIFALILVLFTPNVLAESQPDPTPDELHKASKFTGLMENMKVLYDDHYVSATKVKSVDKFLAHDLIYNI
SDKKLKNYDKVKTELLNEGLAKKYKDEVVDVYGSNYYVNCYFSSKDNVGKVTGGKTCMYGGITKHEGNHFDNGNLQNVLI
RVYENKRNTISFEVQTDKKSVTAQELDIKARNFLINKKNLYEFNSSPYETGYIKFIENNGNTFWYDMMPAPGDKFDQSKY
LMMYNDNKTVDSKSVKIEVHLTTKNG
>P34071 ~~~entC2~~~Enterotoxin type C-2~~~
MNKSRFISCVILIFALILVLFTPNVLAESQPDPTPDELHKSSEFTGTMGNMKYLYDDHYVSATKVMSVDKFLAHDLIYNI
SDKKLKNYDKVKTELLNEDLAKKYKDEVVDVYGSNYYVNCYFSSKDNVGKVTGGKTCMYGGITKHEGNHFDNGNLQNVLI
RVYENKRNTISFEVQTDKKSVTAQELDIKARNFLINKKNLYEFNSSPYETGYIKFIENNGNTFWYDMMPAPGDKFDQSKY
LMMYNDNKTVDSKSVKIEVHLTTKNG
>P0A0L3 ~~~entC3~~~Enterotoxin type C-3~~~
MYKRLFISRVILIFALILVISTPNVLAESQPDPMPDDLHKSSEFTGTMGNMKYLYDDHYVSATKVKSVDKFLAHDLIYNI
SDKKLKNYDKVKTELLNEDLAKKYKDEVVDVYGSNYYVNCYFSSKDNVGKVTGGKTCMYGGITKHEGNHFDNGNLQNVLV
RVYENKRNTISFEVQTDKKSVTAQELDIKARNFLINKKNLYEFNSSPYETGYIKFIENNGNTFWYDMMPAPGDKFDQSKY
LMMYNDNKTVDSKSVKIEVHLTTKNG
>P0A0L5 ~~~entC3~~~Enterotoxin type C-3~~~
MYKRLFISRVILIFALILVISTPNVLAESQPDPMPDDLHKSSEFTGTMGNMKYLYDDHYVSATKVKSVDKFLAHDLIYNI
SDKKLKNYDKVKTELLNEDLAKKYKDEVVDVYGSNYYVNCYFSSKDNVGKVTGGKTCMYGGITKHEGNHFDNGNLQNVLV
RVYENKRNTISFEVQTDKKSVTAQELDIKARNFLINKKNLYEFNSSPYETGYIKFIENNGNTFWYDMMPAPGDKFDQSKY
LMMYNDNKTVDSKSVKIEVHLTTKNG
>P0AEJ2 5.4.4.2~~~entC~~~Isochorismate synthase EntC~~~COG1169
MDTSLAEEVQQTMATLAPNRFFFMSPYRSFTTSGCFARFDEPAVNGDSPDSPFQQKLAALFADAKAQGIKNPVMVGAIPF
DPRQPSSLYIPESWQSFSRQEKQASARRFTRSQSLNVVERQAIPEQTTFEQMVARAAALTATPQVDKVVLSRLIDITTDA
AIDSGVLLERLIAQNPVSYNFHVPLADGGVLLGASPELLLRKDGERFSSIPLAGSARRQPDEVLDREAGNRLLASEKDRH
EHELVTQAMKEVLRERSSELHVPSSPQLITTPTLWHLATPFEGKANSQENALTLACLLHPTPALSGFPHQAATQVIAELE
PFDRELFGGIVGWCDSEGNGEWVVTIRCAKLRENQVRLFAGAGIVPASSPLGEWRETGVKLSTMLNVFGLH
>P19925 2.7.8.-~~~entD~~~Enterobactin synthase component D~~~COG2977
MKTTHTSLPFAGHTLHFVEFDPANFCEQDLLWLPHYAQLQHAGRKRKTEHLAGRIAAVYALREYGYKCVPAIGELRQPVW
PAEVYGSISHCGTTALAVVSRQPIGIDIEEIFSVQTARELTDNIITPAEHERLADCGLAFSLALTLAFSAKESAFKASEI
QTDAGFLDYQIISWNKQQVIIHRENEMFAVHWQIKEKIVITLCQHD
>P10378 6.3.2.14~~~entE~~~Enterobactin synthase component E~~~COG1021
MSIPFTRWPEEFARRYREKGYWQDLPLTDILTRHAASDSIAVIDGERQLSYRELNQAADNLACSLRRQGIKPGETALVQL
GNVAELYITFFALLKLGVAPVLALFSHQRSELNAYASQIEPALLIADRQHALFSGDDFLNTFVTEHSSIRVVQLLNDSGE
HNLQDAINHPAEDFTATPSPADEVAYFQLSGGTTGTPKLIPRTHNDYYYSVRRSVEICQFTQQTRYLCAIPAAHNYAMSS
PGSLGVFLAGGTVVLAADPSATLCFPLIEKHQVNVTALVPPAVSLWLQALIEGESRAQLASLKLLQVGGARLSATLAARI
PAEIGCQLQQVFGMAEGLVNYTRLDDSAEKIIHTQGYPMCPDDEVWVADAEGNPLPQGEVGRLMTRGPYTFRGYYKSPQH
NASAFDANGFYCSGDLISIDPEGYITVQGREKDQINRGGEKIAAEEIENLLLRHPAVIYAALVSMEDELMGEKSCAYLVV
KEPLRAVQVRRFLREQGIAEFKLPDRVECVDSLPLTAVGKVDKKQLRQWLASRASA
>P11454 6.3.2.14~~~entF~~~Enterobactin synthase component F~~~COG1020
MSQHLPLVAAQPGIWMAEKLSELPSAWSVAHYVELTGEVDSPLLARAVVAGLAQADTLRMRFTEDNGEVWQWVDDALTFE
LPEIIDLRTNIDPHGTAQALMQADLQQDLRVDSGKPLVFHQLIQVADNRWYWYQRYHHLLVDGFSFPAITRQIANIYCTW
LRGEPTPASPFTPFADVVEEYQQYRESEAWQRDAAFWAEQRRQLPPPASLSPAPLPGRSASADILRLKLEFTDGEFRQLA
TQLSGVQRTDLALALAALWLGRLCNRMDYAAGFIFMRRLGSAALTATGPVLNVLPLGIHIAAQETLPELATRLAAQLKKM
RRHQRYDAEQIVRDSGRAAGDEPLFGPVLNIKVFDYQLDIPDVQAQTHTLATGPVNDLELALFPDVHGDLSIEILANKQR
YDEPTLIQHAERLKMLIAQFAADPALLCGDVDIMLPGEYAQLAQLNATQVEIPETTLSALVAEQAAKTPDAPALADARYL
FSYREMREQVVALANLLRERGVKPGDSVAVALPRSVFLTLALHAIVEAGAAWLPLDTGYPDDRLKMMLEDARPSLLITTD
DQLPRFSDVPNLTSLCYNAPLTPQGSAPLQLSQPHHTAYIIFTSGSTGRPKGVMVGQTAIVNRLLWMQNHYPLTGEDVVA
QKTPCSFDVSVWEFFWPFIAGAKLVMAEPEAHRDPLAMQQFFAEYGVTTTHFVPSMLAAFVASLTPQTARQSCATLKQVF
CSGEALPADLCREWQQLTGAPLHNLYGPTEAAVDVSWYPAFGEELAQVRGSSVPIGYPVWNTGLRILDAMMHPVPPGVAG
DLYLTGIQLAQGYLGRPDLTASRFIADPFAPGERMYRTGDVARWLDNGAVEYLGRSDDQLKIRGQRIELGEIDRVMQALP
DVEQAVTHACVINQAAATGGDARQLVGYLVSQSGLPLDTSALQAQLRETLPPHMVPVVLLQLPQLPLSANGKLDRKALPL
PELKAQAPGRAPKAGSETIIAAAFSSLLGCDVQDADADFFALGGHSLLAMKLAAQLSRQVARQVTPGQVMVASTVAKLAT
IIDAEEDSTRRMGFETILPLREGNGPTLFCFHPASGFAWQFSVLSRYLDPQWSIIGIQSPRPNGPMQTAANLDEVCEAHL
ATLLEQQPHGPYYLLGYSLGGTLAQGIAARLRARGEQVAFLGLLDTWPPETQNWQEKEANGLDPEVLAEINREREAFLAA
QQGSTSTELFTTIEGNYADAVRLLTTAHSVPFDGKATLFVAERTLQEGMSPERAWSPWIAELDIYRQDCAHVDIISPGTF
EKIGPIIRATLNR
>P0A8Y8 3.1.2.-~~~entH~~~Proofreading thioesterase EntH~~~COG2050
MIWKRHLTLDELNATSDNTMVAHLGIVYTRLGDDVLEAEMPVDTRTHQPFGLLHGGASAALAETLGSMAGFMMTRDGQCV
VGTELNATHHRPVSEGKVRGVCQPLHLGRQNQSWEIVVFDEQGRRCCTCRLGTAVLG
>P54355 3.4.24.74~~~btfP~~~Fragilysin~~~
MFILNFNKMKNVKLLLMLGTAALLAACSNEADSLTTSIDAPVTASIDLQSVSYTDLATQLNDVSDFGKMIILKDNGFNRQ
VHVSMDKRTKIQLDNENVRLFNGRDKDSTSFILGDEFAVLRFYRNGESISYIAYKEAQMMNEIAEFYAAPFKKTRAINEK
EAFECIYDSRTRSAGKDIVSVKINIDKAKKILNLPECDYINDYIKTPQVPHGITESQTRAVPSEPKTVYVICLRENGSTI
YPNEVSAQMQDAANSVYAVHGLKRYVNFHFVLYTTEYSCPSGDAKEGLEGFTASLKSNPKAEGYDDQIYFLIRWGTWDNK
ILGMSWFNSYNVNTASDFEASGMSTTQLMYPGVMAHELGHILGAEHTDNSKDLMYATFTGYLSHLSEKNMDIIAKNLGWE
AADGD
>P24077 ~~~entS~~~Enterobactin exporter EntS~~~COG0477
MNKQSWLLNLSLLKTHPAFRAVFLARFISIVSLGLLGVAVPVQIQMMTHSTWQVGLSVTLTGGAMFVGLMVGGVLADRYE
RKKVILLARGTCGIGFIGLCLNALLPEPSLLAIYLLGLWDGFFASLGVTALLAATPALVGRENLMQAGAITMLTVRLGSV
ISPMIGGLLLATGGVAWNYGLAAAGTFITLLPLLSLPALPPPPQPREHPLKSLLAGFRFLLASPLVGGIALLGGLLTMAS
AVRVLYPALADNWQMSAAQIGFLYAAIPLGAAIGALTSGKLAHSARPGLLMLLSTLGSFLAIGLFGLMPMWILGVVCLAL
FGWLSAVSSLLQYTMLQTQTPEAMLGRINGLWTAQNVTGDAIGAALLGGLGAMMTPVASASASGFGLLIIGVLLLLVLVE
LRHFRQTPPQVTASDS
>P37690 ~~~envC~~~Murein hydrolase activator EnvC~~~COG4942
MTRAVKPRRFAIRPIIYASVLSAGVLLCAFSAHADERDQLKSIQADIAAKERAVRQKQQQRASLLAQLKKQEEAISEATR
KLRETQNTLNQLNKQIDEMNASIAKLEQQKAAQERSLAAQLDAAFRQGEHTGIQLILSGEESQRGQRLQAYFGYLNQARQ
ETIAQLKQTREEVAMQRAELEEKQSEQQTLLYEQRAQQAKLTQALNERKKTLAGLESSIQQGQQQLSELRANESRLRNSI
ARAEAAAKARAEREAREAQAVRDRQKEATRKGTTYKPTESEKSLMSRTGGLGAPRGQAFWPVRGPTLHRYGEQLQGELRW
KGMVIGASEGTEVKAIADGRVILADWLQGYGLVVVVEHGKGDMSLYGYNQSALVSVGSQVRAGQPIALVGSSGGQGRPSL
YFEIRRQGQAVNPQPWLGR
>P0AEJ4 2.7.13.3~~~envZ~~~Sensor histidine kinase EnvZ~~~COG2205
MRRLRFSPRSSFARTLLLIVTLLFASLVTTYLVVLNFAILPSLQQFNKVLAYEVRMLMTDKLQLEDGTQLVVPPAFRREI
YRELGISLYSNEAAEEAGLRWAQHYEFLSHQMAQQLGGPTEVRVEVNKSSPVVWLKTWLSPNIWVRVPLTEIHQGDFSPL
FRYTLAIMLLAIGGAWLFIRIQNRPLVDLEHAALQVGKGIIPPPLREYGASEVRSVTRAFNHMAAGVKQLADDRTLLMAG
VSHDLRTPLTRIRLATEMMSEQDGYLAESINKDIEECNAIIEQFIDYLRTGQEMPMEMADLNAVLGEVIAAESGYEREIE
TALYPGSIEVKMHPLSIKRAVANMVVNAARYGNGWIKVSSGTEPNRAWFQVEDDGPGIAPEQRKHLFQPFVRGDSARTIS
GTGLGLAIVQRIVDNHNGMLELGTSERGGLSIRAWLPVPVTRAQGTTKEG
>P0AEJ5 2.7.13.3~~~envZ~~~Sensor histidine kinase EnvZ~~~
MRRLRFSPRSSFARTLLLIVTLLFASLVTTYLVVLNFAILPSLQQFNKVLAYEVRMLMTDKLQLEDGTQLVVPPAFRREI
YRELGISLYSNEAAEEAGLRWAQHYEFLSHQMAQQLGGPTEVRVEVNKSSPVVWLKTWLSPNIWVRVPLTEIHQGDFSPL
FRYTLAIMLLAIGGAWLFIRIQNRPLVDLEHAALQVGKGIIPPPLREYGASEVRSVTRAFNHMAAGVKQLADDRTLLMAG
VSHDLRTPLTRIRLATEMMSEQDGYLAESINKDIEECNAIIEQFIDYLRTGQEMPMEMADLNAVLGEVIAAESGYEREIE
TALYPGSIEVKMHPLSIKRAVANMVVNAARYGNGWIKVSSGTEPNRAWFQVEDDGPGIAPEQRKHLFQPFVRGDSARTIS
GTGLGLAIVQRIVDNHNGMLELGTSERGGLSIRAWLPVPVTRAQGTTKEG
>A0A4P7TSF2 2.7.13.3~~~envZ~~~Sensor histidine kinase EnvZ~~~
MRRLRFSPRSSFARTLLLIVTLLFASLVTTYLVVLNFAILPSLQQFNKVLAYEVRMLMTDKLQLEDGTQLVVPPAFRREI
YRELGISLYSNEAAEEAGLRWAQHYEFLSHQMAQQLGGPTEVRVEVNKSSPVVWLKTWLSPNIWVRVPLTEIHQGDFSPL
FRYTLAIMLLAIGGAWLFIRIQNRPLVDLEHAALQVGKGIIPPPLREYGASEVRSVTRAFNHMAAGVKQLADDRTLLMAG
VSHDLRTPLTRIRLATEMMSEQDGYLAESINKDIEECNAIIEQFIDYLRTGQEMPMEMADLNAVLGEVIAAESGYEREIE
TALYPGSIEVKMHPLSIKRAVANMVVNAARYGNGWIKVSSGTEPNRAWFQVEDDGPGIAPEQRKHLFQPFVRGDSARTIS
GTGLGLAIVQRIVDNHNGMLELGTSERGGLSIRAWLPVPVTRAQGTTKEG
>I6YGS0 3.3.2.10~~~ephA~~~Epoxide hydrolase A~~~COG2267
MGAPTERLVDTNGVRLRVVEAGEPGAPVVILAHGFPELAYSWRHQIPALADAGYHVLAPDQRGYGGSSRPEAIEAYDIHR
LTADLVGLLDDVGAERAVWVGHDWGAVVVWNAPLLHADRVAAVAALSVPALPRAQVPPTQAFRSRFGENFFYILYFQEPG
IADAELNGDPARTMRRMIGGLRPPGDQSAAMRMLAPGPDGFIDRLPEPAGLPAWISQEELDHYIGEFTRTGFTGGLNWYR
NFDRNWETTADLAGKTISVPSLFIAGTADPVLTFTRTDRAAEVISGPYREVLIDGAGHWLQQERPGEVTAALLEFLTGLE
LR
>P95276 3.3.2.10~~~~~~Epoxide hydrolase B~~~
MSQVHRILNCRGTRIHAVADSPPDQQGPLVVLLHGFPESWYSWRHQIPALAGAGYRVVAIDQRGYGRSSKYRVQKAYRIK
ELVGDVVGVLDSYGAEQAFVVGHDWGAPVAWTFAWLHPDRCAGVVGISVPFAGRGVIGLPGSPFGERRPSDYHLELAGPG
RVWYQDYFAVQDGIITEIEEDLRGWLLGLTYTVSGEGMMAATKAAVDAGVDLESMDPIDVIRAGPLCMAEGARLKDAFVY
PETMPAWFTEADLDFYTGEFERSGFGGPLSFYHNIDNDWHDLADQQGKPLTPPALFIGGQYDVGTIWGAQAIERAHEVMP
NYRGTHMIADVGHWIQQEAPEETNRLLLDFLGGLRP
>I6YC03 3.3.2.10~~~ephB~~~Epoxide hydrolase B~~~COG0596
MSQVHRILNCRGTRIHAVADSPPDQQGPLVVLLHGFPESWYSWRHQIPALAGAGYRVVAIDQRGYGRSSKYRVQKAYRIK
ELVGDVVGVLDSYGAEQAFVVGHDWGAPVAWTFAWLHPDRCAGVVGISVPFAGRGVIGLPGSPFGERRPSDYHLELAGPG
RVWYQDYFAVQDGIITEIEEDLRGWLLGLTYTVSGEGMMAATKAAVDAGVDLESMDPIDVIRAGPLCMAEGARLKDAFVY
PETMPAWFTEADLDFYTGEFERSGFGGPLSFYHNIDNDWHDLADQQGKPLTPPALFIGGQYDVGTIWGAQAIERAHEVMP
NYRGTHMIADVGHWIQQEAPEETNRLLLDFLGGLRP
>P9WGS3 1.-.-.-~~~ephD~~~Probable oxidoreductase EphD~~~COG1028
MPATQQMSRLVDSPDGVRIAVYHEGNPDGPTVVLVHGFPDSHVLWDGVVPLLAERFRIVRYDNRGVGRSSVPKPISAYTM
AHFADDFDAVIGELSPGEPVHVLAHDWGSVGVWEYLRRPGASDRVASFTSVSGPSQDHLVNYVYGGLRRPWRPRTFLRAI
SQTLRLSYMALFSVPVVAPLLLRVALSSAAVRRNMVGDIPVDQIHHSETLARDAAHSVKTYPANYFRSFSSSRRGRAIPI
VDVPVQLIVNSQDPYVRPYGYDQTARWVPRLWRRDIKAGHFSPMSHPQVMAAAVHDFADLADGKQPSRALLRAQVGRPRG
YFGDTLVSVTGAGSGIGRETALAFAREGAEIVISDIDEATVKDTAAEIAARGGIAYPYVLDVSDAEAVEAFAERVSAEHG
VPDIVVNNAGIGQAGRFLDTPAEQFDRVLAVNLGGVVNGCRAFGQRLVERGTGGHIVNVSSMAAYAPLQSLSAYCTSKAA
TYMFSDCLRAELDAAGVGLTTICPGVIDTNIVATTGFHAPGTDEEKIDGRRGQIDKMFALRSYGPDKVADAIVSAVKKKK
PIRPVAPEAYALYGISRVLPQALRSTARLRVI
>O33283 3.3.2.10~~~ephG~~~Epoxide hydrolase EphG~~~COG4308
MAELTETSPETPETTEAIRAVEAFLNALQNEDFDTVDAALGDDLVYENVGFSRIRGGRRTATLLRRMQGRVGFEVKIHRI
GADGAAVLTERTDALIIGPLRVQFWVCGVFEVDDGRITLWRDYFDVYDMFKGLLRGLVALVVPSLKATL
>O53388 3.3.2.10~~~ephH~~~Epoxide hydrolase EphH~~~COG2021
MSSAVLADHVERQLDELGWETSHIVGNSLGGWVAFELERRGRARSVTGIAPAGGWTRWSPVKFEVIAKFIAGAPILAVAH
ILGQRALRLPFSRLLATLPISATPDGVSERELSGIIDDAAHCPAYFQLLVKALVLPGLQELEHTAVPSHVVLCEQDRVVP
PSRFSRHFTDSLPAGHRLTVLDGVGHVPMFEAPGRITELITSFIEECCPHVRAS
>P30197 4.1.1.-~~~epiD~~~Epidermin decarboxylase~~~
MYGKLLICATASINVININHYIVELKQHFDEVNILFSPSSKNFINTDVLKLFCDNLYDEIKDPLLNHINIVENHEYILVL
PASANTINKIANGICDNLLTTVCLTGYQKLFIFPNMNIRMWGNPFLQKNIDLLKNNDVKVYSPDMNKSFEISSGRYKNNI
TMPNIENVLNFVLNNEKRPLD
>Q3IZP4 5.1.99.-~~~epi~~~Ethylmalonyl-CoA/methylmalonyl-CoA epimerase~~~COG0346
MIGRLNHVAIAVPDLEAAAAQYRNTLGAEVGAPQDEPDHGVTVIFITLPNTKIELLHPLGEGSPIAGFLEKNPAGGIHHI
CYEVEDILAARDRLKEAGARVLGSGEPKIGAHGKPVLFLHPKDFNGCLVELEQV
>P0A8N7 6.3.1.-~~~epmA~~~Elongation factor P--(R)-beta-lysine ligase~~~COG2269
MSETASWQPSASIPNLLKRAAIMAEIRRFFADRGVLEVETPCMSQATVTDIHLVPFETRFVGPGHSQGMNLWLMTSPEYH
MKRLLVAGCGPVFQLCRSFRNEEMGRYHNPEFTMLEWYRPHYDMYRLMNEVDDLLQQVLDCPAAESLSYQQAFLRYLEID
PLSADKTQLREVAAKLDLSNVADTEEDRDTLLQLLFTFGVEPNIGKEKPTFVYHFPASQASLAQISTEDHRVAERFEVYY
KGIELANGFHELTDAREQQQRFEQDNRKRAARGLPQHPIDQNLIEALKVGMPDCSGVALGVDRLVMLALGAETLAEVIAF
SVDRA
>Q9ZJ12 6.3.1.-~~~epmA~~~Elongation factor P--(R)-beta-lysine ligase~~~
MSETATWQPSASIPNLLKRAAIMAEIRRFFADRGVLEVETPCMSQATVTDIHLFPFETRFVGPGHSQGINLYLMTSPEYH
MKRLLAAGCGPVFQLCRSFRNEEMGRHHNPEFTMLEWYRPHYDMYRLMNEVDDLLQQVLDCQPAESLSYQQAFQRHLEID
PLSADKTQLREAAAKLDLSNIADTEEDRDTLLQLLFTMGVEPHIGKEKPTFIYHFPASQASLAQISTEDHRVAERFEVYY
KGIELANGFHELTDAREQQQRFEQDNRKRAARGLPQQPIDQNLLDALAAGLPDCSGVALGVDRLVMLALGAESLADVIAF
TVDRA
>P39280 5.4.3.-~~~epmB~~~L-lysine 2,3-aminomutase~~~COG1509
MAHIVTLNTPSREDWLTQLADVVTDPDELLRLLNIDAEEKLLAGRSAKKLFALRVPRSFIDRMEKGNPDDPLLRQVLTSQ
DEFVIAPGFSTDPLEEQHSVVPGLLHKYHNRALLLVKGGCAVNCRYCFRRHFPYAENQGNKRNWQTALEYVAAHPELDEM
IFSGGDPLMAKDHELDWLLTQLEAIPHIKRLRIHSRLPIVIPARITEALVECFARSTLQILLVNHINHANEVDETFRQAM
AKLRRVGVTLLNQSVLLRDVNDNAQTLANLSNALFDAGVMPYYLHVLDKVQGAAHFMVSDDEARQIMRELLTLVSGYLVP
KLAREIGGEPSKTPLDLQLRQQ
>P76938 1.14.-.-~~~epmC~~~Elongation factor P hydroxylase~~~COG3101
MNSTHHYEQLIEIFNSCFADDFNTRLIKGDDEPIYLPADAEVPYNRIVFAHGFYASAIHEISHWCIAGKARRELVDFGYW
YCPDGRDAQTQSQFEDVEVKPQALDWLFCVAAGYPFNVSCDNLEGDFEPDRVVFQRRVHAQVMDYLTNGIPERPARFIKA
LQNYYHTPELTAEQFPWPEALN
>P71053 2.4.-.-~~~epsD~~~Putative glycosyltransferase EpsD~~~COG0438
MTKKILFCATVDYHFKAFHLPYFKWFKQMGWEVHVAANGQTKLPYVDEKFSIPIRRSPFDPQNLAVYRQLKKVIDTYEYD
IVHCHTPVGGVLARLAARQARRHGTKVLYTAHGFHFCKGAPMKNWLLYYPVEKWLSAYTDCLITINEEDYIRAKGLQRPG
GRTQKIHGIGVNTERFRPVSPIEQQRLREKHGFREDDFILVYPAELNLNKNQKQLIEAAALLKEKIPSLRLVFAGEGAME
HTYQTLAEKLGASAHVCFYGFCSDIHELIQLADVSVASSIREGLGMNVLEGMAAEQPAIATDNRGHREIIRDGENGFLIK
IGDSAAFARRIEQLYHKPELCRKLGQEGRKTALRFSEARTVEEMADIYSAYMDMDTKEKSV
>P71054 2.4.-.-~~~epsE~~~Putative glycosyltransferase EpsE~~~COG1215
MNSGPKVSVIMGIYNCERTLAESIESILSQSYKNWELILCDDASTDGTLRIAKQYAAHYSDRIKLIQNKTNKRLAASLNH
CLSHATGDYIARQDGDDLSFPRRLEKQVAFLEKHRHYQVVGTGMLVFDEFGVRGARILPSVPEPGIMAKGTPFCHGTIMM
RASAYRTLKGYRSVRRTRRMEDIDLWLRFFEEGFRGYNLQEALYKVREDSDAFKRRSFTYSIDNAILVYQACRRLKLPLS
DYIYIAKPLIRAFMPAAVMNRYHKKRVMNQKEGLVKHE
>P71055 2.4.-.-~~~epsF~~~Putative glycosyltransferase EpsF~~~COG0438
MNSSQKRVLHVLSGMNRGGAETMVMNLYRKMDKSKVQFDFLTYRNDPCAYDEEILSLGGRLFYVPSIGQSNPLTFVRNVR
NAIKENGPFSAVHAHTDFQTGFIALAARLAGVPVRVCHSHNTSWKTGFNWKDRLQLLVFRRLILANATALCACGEDAGRF
LFGQSNMERERVHLLPNGIDLELFAPNGQAADEEKAARGIAADRLIIGHVARFHEVKNHAFLLKLAAHLKERGIRFQLVL
AGDGPLCGEIEEEARQQNLLSDVLFLGTEERIHELMRTFDVFVMPSLYEGLPVVLVEAQASGLPCIISDSITEKVDAGLG
LVTRLSLSEPISVWAETIARAAAAGRPKREFIKETLAQLGYDAQQNVGALLNVYNISTEKDHNR
>P71056 ~~~epsG~~~Transmembrane protein EpsG~~~
MIVYAVNMGIVFIWSWFAKMCGGRDDSLATGYRPNKLLIWIPLASLVLVSGLRYRVGTDFQTYTLLYELAGDYQNVWQIF
GFGTAKTATDPGFTALLWLMNFITEDPQIMYFTVAVVTYSFIMKTLADYGRPFELSVFLFLGTFHYYASFNGIRQYMVAA
VLFWAIRYIISGNWKRYFLIVLVSSLFHSSALIMIPVYFIVRRKAWSPAIFGLSALFLGMTFLYQKFISVFVVVLENSSY
SHYEKWLMTNTNGMNVIKIAVLVLPLFLAFCYKERLRSLWPQIDIVVNLCLLGFLFGLLATKDVIFARFNIYFGLYQMIL
VPYFVRIFDEKSNALIYIAIVVCYFLYSYLLMPVDSSVLPYRTIFSR
>P71057 2.4.-.-~~~epsH~~~Putative glycosyltransferase EpsH~~~COG1216
METPAVSLLVAVYNTETYIRTCLESLRNQTMDNIEIIIVNDGSADASPDIAEEYAKMDNRFKVIHQENQGLGAVRNKGIE
AARGEFIAFIDSDDWIEPDYCEQMLRTAGDETDLVICNYAAEFEDTGKTMDSDIAQTYQDQPKEHYIKALFEGKVRGFSW
NKLYRRSMIEAHRLSFPLRGELEHVEDQFFSFRAHFFARSVSYVKTPLYHYRIHLSSIVQRYQKKLFESGLALYETNAAF
LQENNKLEEYRKELDTFIVLHSSICMLNEWKTSGSRRLFEKLRNVGVICADPVFQESLSKTGTAPFDAKRSCLLLMAKYR
MIPFVAMASAVYQRVIEYKMRNRG
>P71058 2.-.-.-~~~epsI~~~Putative pyruvyl transferase EpsI~~~COG5039
MSLQSLKINFAEWLLLKVKYPSQYWLGAADQPVKAAAHQKKIILTLLPSHDNLGDHAIAYASKAFLEQEYPDFDIVEVDM
KDIYKSAKSLIRSRHPEDMVFIIGGGNMGDLYRYEEWTRRFIIKTFHDYRVVQLPATAHFSDTKKGRKELKRAQKIYNAH
PGLLLMARDETTYQFMKQHFQEKTILKQPDMVLYLDRSKAPAEREGVYMCLREDQESVLQEEQRNRVKAALCEEFGEIKS
FTTTIGRRVSRDTREHELEALWSKLQSAEAVVTDRLHGMIFCALTGTPCVVIRSFDHKVMEGYQWLKDIPFMKLIEHPEP
ERVTAAVNELLTKETSRAGFPRDVYFKGLRDKISGEAQ
>P71059 2.4.-.-~~~epsJ~~~Uncharacterized glycosyltransferase EpsJ~~~COG1216
MIPLVSIIVPMYNVEPFIEECIDSLLRQTLSDIEIILVNDGTPDRSGEIAEDYAKRDARIRVIHQANGGLSSARNTGIKA
ARGTYIGFVDGDDYVSSAMFQRLTEEAEQNQLDIVGCGFYKQSSDRRTYVPPQLEANRVLTKPEMTEQLKHAHETRFIWY
VWRYLYRRELFERANLLFDEDIRFAEDSPFNLSAFREAERVKMLDEGLYIYRENPNSLTEIPYKPAMDEHLQKQYQAKIA
FYNHYGLAGACKEDLNVYICRHQLPMLLANACASPNSPKDIKKKIRQILSYDMVRQAVRHTPFQHEKLLRGERLVLALCK
WRLTFLIKLFFEQRGTMKGSAKQA
>P71060 ~~~epsK~~~Uncharacterized membrane protein EpsK~~~COG2244
MKFTINFSANLTAFLLSVFLSVWMTPFIVKTLGVEAFGFVHLTQNVINYFSVITVALSSVVVRFFSVAAHRGEREKANAY
ISNYLAASVLISLLLLLPLAGSAFFIDRVMNVPQALLADVRLSILIGSVLFILTFLMAGFGAAPFYANRLYITSSIQAVQ
MLIRVLSVLLLFACFAPKIWQIQLAALAGAVIASVLSFYFFKKLIPWFSFRMKDLSFRTSKELFQAGAWSSVNQIGVLLF
LQIDLLTANLMLGASASGKYAAIIQFPLLLRSLAGTVASLFAPIMTSYYSKGDMEGLMNYANKAVRLNGLLLALPAALLG
GLAGPFLTIWLGPSFSTIAPLLFIHAGYLVVSLAFMPLFYIWTAFNQQKTPAIVTLLLGAVNVVLAVTLSGPAHLGLYGI
TLAGAISLILKNAIFTPLYVSRITGYKKHVFLKGIIGPLSAAVFAWTVCKAIQFIVKIDSWPSLIATGVTVSFCYAVFAF
MLVCTKEERQLVLKRFRKTKGAVNL
>P71062 2.-.-.-~~~epsL~~~Uncharacterized sugar transferase EpsL~~~COG2148
MILKRLFDLTAAIFLLCCTSVIILFTIAVVRLKIGSPVFFKQVRPGLHGKPFTLYKFRTMTDERDSKGNLLPDEVRLTKT
GRLIRKLSIDELPQLLNVLKGDLSLVGPRPLLMDYLPLYTEKQARRHEVKPGITGWAQINGRNAISWEKKFELDVWYVDN
WSFFLDLKILCLTVRKVLVSEGIQQTNHVTAERFTGSGDVSS
>P71063 2.3.1.203~~~epsM~~~UDP-N-acetylbacillosamine N-acetyltransferase~~~COG0110
MKNVAIVGDGGHGKVIRELINARSDTRLAAVLDDKFKTFEGGKEWYTGPPKAVTELRRLIPDVLFLIAVGNNSVRKQLAE
RLGLGKDDFITLIHPSAIVSKSAVIGEGTVIMAGAIIQADARIGAHCIINTGAVAEHDNQISDYVHLSPRATLSGAVSVQ
EGAHVGTGASVIPQIIIGAWSIVGAGSAVIRSIPDRVTAAGAPARIISSIQTSNKG
>Q795J3 2.6.1.-~~~epsN~~~Putative pyridoxal phosphate-dependent aminotransferase EpsN~~~COG0399
MHKKIYLSPPHMSGREQHYISEAFRSNWIAPLGPLVNSFEEQLAERVGVKAAAAVGSGTAAIHLALRLLEVKEGDSVFCQ
SFTFVATANPILYEKAVPVFIDSEPDTWNMSPTALERALEEAKRNGTLPKAVIAVNLYGQSAKMDEIVSLCDAYGVPVIE
DAAESLGTVYKGKQSGTFGRFGIFSFNGNKIITTSGGGMLVSNDEAAIEKARFLASQAREPAVHYQHSEIGHNYRLSNIL
AGVGIAQLEVLDERVEKRRTIFTRYKNALGHLDGVRFMPEYAAGVSNRWLTTLTLDNGLSPYDIVQRLAEENIEARPLWK
PLHTQPLFDPALFYSHEDTGSVCEDLFKRGICLPSGSNMTEDEQGRVIEVLLHLFHTVEVKKWTASIR
>P71065 2.-.-.-~~~epsO~~~Putative pyruvyl transferase EpsO~~~COG5039
MDSKHSMISLKQKLSGLLDVIPKQSEIIYADYPLYGNVGDLFIMKGTEAFFKEHGIRVRKRWNPDNFPIGRKLDPNLIIV
CQGGGNFGDLYPYYQGFREKIVQTYPNHKIVILPQSIYFQNKDNLKRTAEIFSKHANLHIMTREKASYATAQAYFTTNHI
QLLPDMAHQLFPVIPTQQPSNQKLRFIRTDHEANQALQEHAEAESYDWRTVLSASDRRTIAFLQTLNVLNKKAGNPLPIA
YIWEKYSDYIVQKAIRFFSRYESVETSRLHGHILSSLLQKENTVIDNSYGKNANYFHTWMEGVPSTRLIQHASKKENLPA
HM
>P30845 2.7.-.-~~~eptA~~~Phosphoethanolamine transferase EptA~~~COG2194
MLKRLLKRPSLNLLAWLLLAAFYISICLNIAFFKQVLQALPLDSLHNVLVFLSMPVVAFSVINIVLTLSSFLWLNRPLAC
LFILVGAAAQYFIMTYGIVIDRSMIANIIDTTPAESYALMTPQMLLTLGFSGVLAALIACWIKIKPATSRLRSVLFRGAN
ILVSVLLILLVAALFYKDYASLFRNNKELVKSLSPSNSIVASWSWYSHQRLANLPLVRIGEDAHRNPLMQNEKRKNLTIL
IVGETSRAENFSLNGYPRETNPRLAKDNVVYFPNTASCGTATAVSVPCMFSDMPREHYKEELAQHQEGVLDIIQRAGINV
LWNDNDGGCKGACDRVPHQNVTALNLPDQCINGECYDEVLFHGLEEYINNLQGDGVIVLHTIGSHGPTYYNRYPPQFRKF
TPTCDTNEIQTCTKEQLVNTYDNTLVYVDYIVDKAINLLKEHQDKFTTSLVYLSDHGESLGENGIYLHGLPYAIAPDSQK
QVPMLLWLSEDYQKRYQVDQNCLQKQAQTQHYSQDNLFSTLLGLTGVETKYYQAADDILQTCRRVSE
>P36555 2.7.-.-~~~eptA~~~Phosphoethanolamine transferase EptA~~~
MLKRFLKRPVLGQIAWLLLFSFYIAVCLNIAFYKQVLQDLPLNSLRNVLVFISMPVVAFSVVNSVLTLASFIWLNRLLAC
VFILVGAAAQYFILTYGIIIDRSMIANMMDTTPAETFALMTPQMVLTLGLSGVLAAVIAFWVKIRPATPRLRSGLYRLAS
VLISILLVILVAAFFYKDYASLFRNNKQLIKALSPSNSIVASWSWYSHQRLANLPLVRIGEDAHRNPLMLKGDRKNLTIL
IVGETSRGDDFSLGGYPRDTNPRLAKDDVIYFPHTTSCGTATAISVPCMFSDMPRKHYDEELAHHQEGLLDIIQRAGINV
LWNDNDGGCKGACDRVPHQNVTELNLPGQCIDGECYDEVLFHGLEDYIDHLKGDGVIVLHTIGSHGPTYYNRYPPQFKKF
TPTCDTNEIQNCSQEQLINTYDNTVLYVDYIVDKAINLLKSHQDKFTTSLVYLSDHGESLGENGVYLHGLPYSIAPDTQK
HVPMLIWLSKDYQQRYQVDQACLQKRASTLDYSQDNLFSTMLGLTGVQTTYYQAADDILQPCRRLSE
>P37661 2.7.8.42~~~eptB~~~Kdo(2)-lipid A phosphoethanolamine 7''-transferase~~~COG2194
MRYIKSITQQKLSFLLAIYIGLFMNGAVFYRRFGSYAHDFTVWKGISAVVELAATVLVTFFLLRLLSLFGRRSWRILASL
VVLFSAGASYYMTFLNVVIGYGIIASVMTTDIDLSKEVVGLNFILWLIAVSALPLILIWNNRCRYTLLRQLRTPGQRIRS
LAVVVLAGIMVWAPIRLLDIQQKKVERATGVDLPSYGGVVANSYLPSNWLSALGLYAWARVDESSDNNSLLNPAKKFTYQ
APQNVDDTYVVFIIGETTRWDHMGIFGYERNTTPKLAQEKNLAAFRGYSCDTATKLSLRCMFVRQGGAEDNPQRTLKEQN
IFAVLKQLGFSSDLYAMQSEMWFYSNTMADNIAYREQIGAEPRNRGKPVDDMLLVDEMQQSLGRNPDGKHLIILHTKGSH
FNYTQRYPRSFAQWKPECIGVDSGCTKAQMINSYDNSVTYVDHFISSVIDQVRDKKAIVFYAADHGESINEREHLHGTPR
ELAPPEQFRVPMMVWMSDKYLENPANAQAFAQLKKEADMKVPRRHVELYDTIMGCLGYTSPDGGINENNNWCHIPQAKEA
AAN
>P0CB39 2.7.-.-~~~eptC~~~Phosphoethanolamine transferase EptC~~~COG2194
MHSTEVQAKPLFSWKALGWALLYFWFFSTLLQAIIYISGYSGTNGIRDSLLFSSLWLIPVFLFPKRIKIIAAVIGVVLWA
ASLAALCYYVIYGQEFSQSVLFVMFETNTNEASEYLSQYFSLKIVLIALAYTAVAVLLWTRLRPVYIPKPWRYVVSFALL
YGLILHPIAMNTFIKNKPFEKTLDNLASRMEPAAPWQFLTGYYQYRQQLNSLTKLLNENNALPPLANFKDESGNEPRTLV
LVIGESTQRGRMSLYGYPRETTPELDALHKTDPNLTVFNNVVTSRPYTIEILQQALTFANEKNPDLYLTQPSLMNMMKQA
GYKTFWITNQQTMTARNTMLTVFSRQTDKQYYMNQQRTQSAREYDTNVLKPFQEVLNDPAPKKLIIVHLLGTHIKYKYRY
PENQGKFDGNTDHVPPGLNAEELESYNDYDNANLYNDHVVASLIKDFKAANPNGFLVYFSDHGEEVYDTPPHKTQGRNED
NPTRHMYTIPFLLWTSEKWQATHPRDFSQDVDRKYSLAELIHTWSDLAGLSYDGYDPTRSVVNPQFKETTRWIGNPYKKN
ALIDYDTLPYGDQVGNQ
>P0CB40 2.7.-.-~~~eptC~~~Phosphoethanolamine transferase EptC~~~COG2194
MHSTEVQAKPLFSWKALGWALLYFWFFSTLLQAIIYISGYSGTNGIRDSLLFSSLWLIPVFLFPKRIKIIAAVIGVVLWA
ASLAALCYYVIYGQEFSQSVLFVMFETNTNEASEYLSQYFSLKIVLIALAYTAVAVLLWTRLRPVYIPKPWRYVVSFALL
YGLILHPIAMNTFIKNKPFEKTLDNLASRMEPAAPWQFLTGYYQYRQQLNSLTKLLNENNALPPLANFKDESGNEPRTLV
LVIGESTQRGRMSLYGYPRETTPELDALHKTDPNLTVFNNVVTSRPYTIEILQQALTFANEKNPDLYLTQPSLMNMMKQA
GYKTFWITNQQTMTARNTMLTVFSRQTDKQYYMNQQRTQSAREYDTNVLKPFQEVLKDPAPKKLIIVHLLGTHIKYKYRY
PEDQGKFDGNTEHVPPGLNAEELESYNDYDNANLYNDHVVASLIKDFKATDPNGFLVYFSDHGEEVYDTPPHKTQGRNED
NPTRHMYTIPFLLWTSEKWQATHPRDFSQDVDRKYSLAELIHTWSDLAGLSYDGYDPTRSVVNPQFKETTRWIGNPYKKN
ALIDYDTLPYGDQVGNQ
>O67800 ~~~era~~~GTPase Era~~~COG1159
MKVGYVAIVGKPNVGKSTLLNNLLGTKVSIISPKAGTTRMRVLGVKNIPNEAQIIFLDTPGIYEPKKSDVLGHSMVEIAK
QSLEEADVILFMIDATEGWRPRDEEIYQNFIKPLNKPVIVVINKIDKIGPAKNVLPLIDEIHKKHPELTEIVPISALKGA
NLDELVKTILKYLPEGEPLFPEDMITDLPLRLLAAEIVREKAMMLTREEVPTSIAVKINEIKPGDANPNMLVIKGEIIVD
RENLKPIIIGKKGQRLKEIGKRARQELELILGRPVYLELWVKVVPDWRRRPEYVRLFGYAL
>P42182 ~~~era~~~GTPase Era~~~COG1159
MTNESFKSGFVSIIGRPNVGKSTFLNRVIGQKIAIMSDKPQTTRNKVQGVLTTGTSQTIFIDTPGIHKPKHKLGDFMMKV
AQNTLKEVDLILFMINAEEGYGKGDEFIIEKLQTMSTPVFLIVNKIDKIHPDQLLLLIDEYRKRYPFKEIVPISALEGNN
IETLLAQIEAYLPEGPQFYPSDQVTDHPERFIISELIREKVLHLTREEIPHSIAVAIESIKGQDNGSVHVAATIVVERDS
QKGIVIGKKGSLLKEVGKRARADIEALLGSRVYLELWVKVQKDWRNKMSQLRDFGFKEDEY
>P51836 ~~~era~~~GTPase Era~~~COG1159
MKPTYCGYAAIIGRPNVGKSTLLNQLLEQKISITSRKPQTTRYQILGVKTFKDIQVIYVDTPGLHAGTERTINRYMNRTA
RGALRDVDAIVFVIEPHWESQDAWVLDNLKEIETPVFLVINKVDKIKNRAELLPLIEKVSSLYAFQKITPLSAKTGDQVG
TLEQAVHQLMPESPFYFPPEQVTDRSDQFMASEIIREKLMRLLGQEIPYSLAVTLIEFRKEEKIIRISAVIWVEKKSQKG
IVIGKGGERLKRVGTNARLDMEKWFGKRVFLQLWVKVKSGWADNERLLRELGFEE
>P06616 ~~~era~~~GTPase Era~~~COG1159
MSIDKSYCGFIAIVGRPNVGKSTLLNKLLGQKISITSRKAQTTRHRIVGIHTEGAYQAIYVDTPGLHMEEKRAINRLMNK
AASSSIGDVELVIFVVEGTRWTPDDEMVLNKLREGKAPVILAVNKVDNVQEKADLLPHLQFLASQMNFLDIVPISAETGL
NVDTIAAIVRKHLPEATHHFPEDYITDRSQRFMASEIIREKLMRFLGAELPYSVTVEIERFVSNERGGYDINGLILVERE
GQKKMVIGNKGAKIKTIGIEARKDMQEMFEAPVHLELWVKVKSGWADDERALRSLGYVDDL
>A0R0S7 ~~~era~~~GTPase Era~~~COG1159
MTEFRSGFVCFVGRPNTGKSTLTNALVGQKVAITSNRPQTTRHTIRGIVHREDFQIILVDTPGLHRPRTLLGQRLNDLVK
DTYSEVDVIGMCIPADEAIGPGDRWIYQQIRAVAPRTTLIGIVTKIDKVPKDRVAAQLLAVSELMGPDAEIVPVSATSGE
QLDVLTNVLVSQLPPGPAYYPDGELTDEPEEVLMAELIREAALEGVRDELPHSLAVVIDEVSQREDRDDLIDVHAILYVE
RDSQKGIVIGKGGARLREVGTAARKQIEKLLGTKVYLDLRVKIAKNWQRDPKQLGKLGF
>A5U557 ~~~era~~~GTPase Era~~~COG1159
MTEFHSGFVCLVGRPNTGKSTLTNALVGAKVAITSTRPQTTRHAIRGIVHSDDFQIILVDTPGLHRPRTLLGKRLNDLVR
ETYAAVDVIGLCIPADEAIGPGDRWIVEQLRSTGPANTTLVVIVTKIDKVPKEKVVAQLVAVSELVTNAAEIVPVSAMTG
DRVDLLIDVLAAALPAGPAYYPDGELTDEPEEVLMAELIREAALQGVRDELPHSLAVVIDEVSPREGRDDLIDVHAALYV
ERDSQKGIVIGKGGARLREVGTAARSQIENLLGTKVYLDLRVKVAKNWQRDPKQLGRLGF
>P9WNK9 ~~~era~~~GTPase Era~~~COG1159
MTEFHSGFVCLVGRPNTGKSTLTNALVGAKVAITSTRPQTTRHAIRGIVHSDDFQIILVDTPGLHRPRTLLGKRLNDLVR
ETYAAVDVIGLCIPADEAIGPGDRWIVEQLRSTGPANTTLVVIVTKIDKVPKEKVVAQLVAVSELVTNAAEIVPVSAMTG
DRVDLLIDVLAAALPAGPAYYPDGELTDEPEEVLMAELIREAALQGVRDELPHSLAVVIDEVSPREGRDDLIDVHAALYV
ERDSQKGIVIGKGGARLREVGTAARSQIENLLGTKVYLDLRVKVAKNWQRDPKQLGRLGF
>P64085 ~~~era~~~GTPase Era~~~
MTEHKSGFVSIIGRPNVGKSTFVNRVIGHKIAIMSDKAQTTRNKIQGVMTRDDAQIIFIDTPGIHKPKHKLGDYMMKVAK
NTLSEIDAIMFMVNANEEIGRGDEYIIEMLKNVKTPVFLVLNKIDLVHPDELMPKIEEYQSYMDFTEIVPISALEGLNVD
HFIDVLKTYLPEGPKYYPDDQISDHPEQFVVGEIIREKILHLTSEEIPHAIGVNVDRMVKESEDRVHIEATIYVERDSQK
GIVIGKGGKKLKEVGKRARRDIEMLLGSKVYLELWVKVQRDWRNKVNFIRQIGYVEDQD
>P37214 ~~~era~~~GTPase Era~~~COG1159
MSFKSGFVAILGRPNVGKSTFLNHVMGQKIAIMSDKAQTTRNKIMGIYTTDKEQIVFIDTPGIHKPKTALGDFMVESAYS
TLREVDTVLFMVPADEKRGKGDNMIIERLKAAKVPVILVINKIDKVHPNQLLEQIDDFRNQMDFQEIVPISALQGNNVSH
LVDLLVDHLEEGFQYFPADQITDHPERFLVSEMIREKVLLLTREEIPHSVAVVIDSMARDEETHKIHIRATIMVERDSQK
GIIIGKKGAMLKKIGQMARRDIELMLGDKVYLETWVKVKKNWRDKKLDLADFGYNKKEY
>P0A3C4 ~~~era~~~GTPase Era~~~COG1159
MTFKSGFVAILGRPNVGKSTFLNHVMGQKIAIMSDKAQTTRNKIMGIYTTDKEQIVFIDTPGIHKPKTALGDFMVESAYS
TLREVDTVLFMVPADEARGKGDDMIIERLKAAKVPVILVVNKIDKVHPDQLLSQIDDFRNQMDFKEIVPISALQGNNVSR
LVDILSENLDEGFQYFPSDQITDHPERFLVSEMVREKVLHLTREEIPHSVAVVVDSMKRDEETDKVHIRATIMVERDSQK
GIIIGKGGAMLKKIGSMARRDIELMLGDKVFLETWVKVKKNWRDKKLDLADFGYNEREY
>Q5SM23 ~~~era~~~GTPase Era~~~COG1159
MAEKTYSGFVAIVGKPNVGKSTLLNNLLGVKVAPISPRPQTTRKRLRGILTEGRRQIVFVDTPGLHKPMDALGEFMDQEV
YEALADVNAVVWVVDLRHPPTPEDELVARALKPLVGKVPILLVGNKLDAAKYPEEAMKAYHELLPEAEPRMLSALDERQV
AELKADLLALMPEGPFFYPEDYAKSDQTFGEWVAEILREEAMKRLWHEVPYAVATKVEEVAERENGVLYIKAILYVERPS
QKAIVIGEGGRKIKEIGQATRKQLEALLGKKVYLDLEVKVYPDWRKDPEALRELGYRSSVG
>A9FZ87 4.2.3.164~~~geoA~~~(+)-eremophilene synthase~~~COG0664
MSSDRTSVVVSKRDAGGFEYPFAASCHPGREVTEQRTLAWVRRLRLVPDGRSLSRLKATNFSHLAAWLLPSASTQTLQLA
SDFTAVLFLLDDAYDEGQLSTDPESVEWLNEKYLGELFGYTEADMSDPLTRGMLDVRERIRRSHPHFFLNRWLSHFQYYY
EANLWEANNRKQMRVPHLEEYLMMRRYSGAVYTYCDLLELLLERPLPLEVVQHPLIQTVRDICNDILCWTNDYFSLGKEL
TNGETHNLIVVLRNECVSTLEEAIDRLKDMHDRRVAEYQGVKEKVLALWADDEIRLYLDAVEAMIAGNQRWALEAGRYSG
LESLIVRAG
>P39176 2.-.-.-~~~erfK~~~Probable L,D-transpeptidase ErfK/SrfK~~~COG1376
MRRVNILCSFALLFASHTSLAVTYPLPPEGSRLVGQSFTVTVPDHNTQPLETFAAQYGQGLSNMLEANPGADVFLPKSGS
QLTIPQQLILPDTVRKGIVVNVAEMRLYYYPPDSNTVEVFPIGIGQAGRETPRNWVTTVERKQEAPTWTPTPNTRREYAK
RGESLPAFVPAGPDNPMGLYAIYIGRLYAIHGTNANFGIGLRVSQGCIRLRNDDIKYLFDNVPVGTRVQIIDQPVKYTTE
PDGSNWLEVHEPLSRNRAEYESDRKVPLPVTPSLRAFINGQEVDVNRANAALQRRSGMPVQISSGSRQMF
>G4SW86 1.3.1.70~~~erg~~~Delta(14)-sterol reductase~~~
MSEQESRDNAAVDAVRQKYGFGFSWLVLMIALPPLVYYLWICVTYYQGELVFTSDAAAWRRFWSHVAPPTWHAAGLYAAW
FLGQAALQVWAPGPTVQGMKLPDGSRLDYRMNGIFSFLFTLAVVFGLVTMGWLDATVLYDQLGPLLTVVNIFTFVFAGFL
YFWGLNGKQWERPTGRPFYDYFMGTALNPRIGSLDLKLFCEARPGMIFWLLMNLSMAAKQYELHGTVTVPMLLVVGFQSF
YLIDYFIHEEAVLTTWDIKHEKFGWMLCWGDLVWLPFTYTLQAQYLVHHTHDLPVWGIIAIVALNLAGYAIFRGANIQKH
HFRRDPNRIVWGKPAKYIKTKQGSLLLTSGWWGIARHMNYFGDLMIALSWCLPAAFGSPIPYFHIVYFTILLLHREKRDD
AMCLAKYGEDWLQYRKKVPWRIVPKIY
>Q87WD2 ~~~eriC~~~Chloride/fluoride channel protein~~~COG0038
MAGQMSKFRRPEQLDLLPYIAKWLALAGLVALLAGSASALFLLSLDHATQWRETHPWVIWLLPVAGFAVGLAYHLIGKPV
DAGNNLIIDEIHDPKKIVPLRMVPMVLIGTVVSHLFGASVGREGTAVQMGGALADQLTHVFRLRREDRRVILMAGISAGF
ASVFGTPLAGALFGLEVLAIGRMRYDALFPCVVAAIVADQVGQAWGVVHTHYVIGEVVPVQLWSVMAVVAAGIVFGLTGL
LFATATHKLGAFVKRLITYSPLRPFAGGLLIAVAVWALGSNHYIDVDKYIGLGIPSIVQSFQMPMAPWDWLGKMVFTVVS
LGTGFKGGEVTPLFYIGATLGNALAPLLHLPFGMLAGIGFVAVFAGAANTPLATIVMAMELFGPEIAPLAAIACIASYLV
SGHTGIYHAQRVGHSKHHRPLPEEIRLSDIKQFHAQSESASERKVTLAGEEK
>P0A0H2 2.1.1.184~~~ermA1~~~rRNA adenine N-6-methyltransferase~~~
MNQKNPKDTQNFITSKKHVKEILNHTNISKQDNVIEIGSGKGHFTKELVKMSRSVTAIEIDGGLCQVTKEAVNPSENIKV
IQTDILKFSFPKHINYKIYGNIPYNISTDIVKRITFESQAKYSYLIVEKGFAKRLQNLQRALGLLLMVEMDIKMLKKVPP
LYFHPKPSVDSVLIVLERHQPLISKKDYKKYRSFVYKWVNREYRVLFTKNQFRQALKHANVTNINKLSKEQFLSIFNSYK
LFH
>P02979 2.1.1.184~~~ermC~~~rRNA adenine N-6-methyltransferase~~~
MNEKNIKHSQNFITSKHNIDKIMTNIRLNEHDNIFEIGSGKGHFTLELVKRCNFVTAIEIDHKLCKTTENKLVDHDNFQV
LNKDILQFKFPKNQSYKIYGNIPYNISTDIIRKIVFDSIANEIYLIVEYGFAKRLLNTKRSLALLLMAEVDISILSMVPR
EYFHPKPKVNSSLIRLSRKKSRISHKDKQKYNYFVMKWVNKEYKKIFTKNQFNNSLKHAGIDDLNNISFEQFLSLFNSYK
LFNK
>Q03986 2.1.1.-~~~ermD~~~rRNA adenine N-6-methyltransferase~~~
MKKKNHKYRGKKLNRGESPNFSGQHLMHNKKLIEEIVDRANISIDDTVLELGAGKGALTTVLSQKAGKVLAVENDSKFVD
ILTRKTAQHSNTKIIHQDIMKIHLPKEKFVVVSNIPYAITTPIMKMLLNNPASGFQKGIIVMEKGAAKRFTSKFIKNSYV
LAWRMWFDIGIVREISKEHFSPPPKVDSAMVRITRKKDAPLSHKHYIAFRGLAEYALKEPNIPLCVALRGIFTPRQMKHL
RKSLKINNEKTVGTLTENQWAVIFNTMTQYVMHHKWPRANKRKPGEI
>P07287 2.1.1.184~~~ermE~~~rRNA adenine N-6-methyltransferase~~~COG0030
MSSSDEQPRPRRRNQDRQHPNQNRPVLGRTERDRNRRQFGQNFLRDRKTIARIAETAELRPDLPVLEAGPGEGLLTRELA
DRARQVTSYEIDPRLAKSLREKLSGHPNIEVVNADFLTAEPPPEPFAFVGAIPYGITSAIVDWCLEAPTIETATMVTQLE
FARKRTGDYGRWSRLTVMTWPLFEWEFVEKVDRRLFKPVPKVDSAIMRLRRRAEPLLEGAALERYESMVELCFTGVGGNI
QASLLRKYPRRRVEAALDHAGVGGGAVVAYVRPEQWLRLFERLDQKNEPRGGQPQRGRRTGGRDHGDRRTGGQDRGDRRT
GGRDHRDRQASGHGDRRSSGRNRDDGRTGEREQGDQGGRRGPSGGGRTGGRPGRRGGPGQR
>P13956 2.1.1.184~~~ermC'~~~rRNA adenine N-6-methyltransferase~~~
MNEKNIKHSQNFITSKHNIDKIMTNIRLNEHDNIFEIGSGKGHFTLELVQRCNFVTAIEIDHKLCKTTENKLVDHDNFQV
LNKDILQFKFPKNQSYKIFGNIPYNISTDIIRKIVFDSIADEIYLIVEYGFAKRLLNTKRSLALFLMAEVDISILSMVPR
EYFHPKPKVNSSLIRLNRKKSRISHKDKQKYNYFVMKWVNKEYKKIFTKNQFNNSLKHAGIDDLNNISFEQFLSLFNSYK
LFNK
>P21236 2.1.1.184~~~erm~~~rRNA adenine N-6-methyltransferase~~~
MNKNIKYSQNFLTSEKVLNQIIKQLNLKETDTVYEIGTGKGHLTTKLAKISKQVTSIELDSHLFNLSSEKLKLNIRVTLI
HQDILQFQFPNKQRYKIVGSIPYHLSTQIIKKVVFESHASDIYLIVEEGFYKRTLDIHRTLGLLLHTQVSIQQLLKLPAE
CFHPKPKVNSVLIKLTRHTTDVPDKYWKLYTYFVSKWVNREYRQLFTKNQFHQAMKHAKVNNLSTITYEQVLSIFNSYLL
FNGRK
>P0ACC3 ~~~erpA~~~Iron-sulfur cluster insertion protein ErpA~~~COG0316
MSDDVALPLEFTDAAANKVKSLIADEDNPNLKLRVYITGGGCSGFQYGFTFDDQVNEGDMTIEKQGVGLVVDPMSLQYLV
GGSVDYTEGLEGSRFIVTNPNAKSTCGCGSSFSI
>P45344 ~~~erpA~~~Iron-sulfur cluster insertion protein ErpA~~~COG0316
MIDDMAVPLTFTDAAANKVKSLISEEENTDLKLRVYITGGGCSGFQYGFTFDEKVNDGDLTIEKSGVQLVIDPMSLQYLI
GGTVDYTEGLEGSRFTVNNPNATSTCGCGSSFSI
>P9WIQ7 ~~~erp~~~Exported repetitive protein~~~
MPNRRRRKLSTAMSAVAALAVASPCAYFLVYESTETTERPEHHEFKQAAVLTDLPGELMSALSQGLSQFGINIPPVPSLT
GSGDASTGLTGPGLTSPGLTSPGLTSPGLTDPALTSPGLTPTLPGSLAAPGTTLAPTPGVGANPALTNPALTSPTGATPG
LTSPTGLDPALGGANEIPITTPVGLDPGADGTYPILGDPTLGTIPSSPATTSTGGGGLVNDVMQVANELGASQAIDLLKG
VLMPSIMQAVQNGGAAAPAASPPVPPIPAAAAVPPTDPITVPVA
>Q03131 2.3.1.94~~~eryA~~~6-deoxyerythronolide-B synthase EryA1, modules 1 and 2~~~
MSGPRSRTTSRRTPVRIGAVVVASSTSELLDGLAAVADGRPHASVVRGVARPSAPVVFVFPGQGAQWAGMAGELLGESRV
FAAAMDACARAFEPVTDWTLAQVLDSPEQSRRVEVVQPALFAVQTSLAALWRSFGVTPDAVVGHSIGELAAAHVCGAAGA
ADAARAAALWSREMIPLVGNGDMAAVALSADEIEPRIARWDDDVVLAGVNGPRSVLLTGSPEPVARRVQELSAEGVRAQV
INVSMAAHSAQVDDIAEGMRSALAWFAPGGSEVPFYASLTGGAVDTRELVADYWRRSFRLPVRFDEAIRSALEVGPGTFV
EASPHPVLAAALQQTLDAEGSSAAVVPTLQRGQGGMRRFLLAAAQAFTGGVAVDWTAAYDDVGPNPALCRSSRRPRRKTS
RPSPASTGTRHRTCCERLLAVVNGETAALAGREADAEATFRELGLDSVLAAQLRAKVSAAIGREVNIALLYDHPTPRALA
EALAAGTEVAQRETRARTNEAAPGEPVAVVAMACRLPGGVSTPEEFWELLSEGRDAVAGLPTDRGWDLDSLFHPDPTRSG
TAHQRGGGFLTEATAFDPAFFGMSPREALAVDPQQRLMLELSWEVLERAGIPPTSLQASPTGVFVGLIPQEYGPRLAEGG
EGVEGYLMTGTTTSVASGRIAYTLGLEGPAISVDTACSSSLVAVHLACQSLRRGESSLAMAGGVTVMPTPGMLVDFSRMN
SLAPDGRCKAFSAGANGFGMAEGAGMLLLERLSDARRNGHPVLAVLRGTAVNSDGASNGLSAPNGRAQVRVIQQALAESG
LGPADIDAVEAHGTGTRLGDPIEARALFEAYGRDREQPLHLGSVKSNLGHTQAAAGVAGVIKMVLAMRAGTLPRTLHASE
RSKEIDWSSGAISLLDEPEPWPAGARPRRAGVSSFGISGTNAHAIIEEAPQVVEGERVEAGDVVAPWVLSASSAEGLRAQ
AARLAAHLREHPGQDPRDIAYSLATGRAALPHRAAFAPVDESAALRVLDGLATGNADGAAVGTSRAQQRAVFVFPGQGWQ
WAGMAVDLLDTSPVFAAALRECADALEPHLDFEVIPFLRAEAARREQDAALSTERVDVVQPVMFAVMVSLASMWRAHGVE
PAAVIGHSQGEIAAACVAGALSLDDAARVVALRSRVIATMPGNKGMASIAAPAGEVRARIGDRVEIAAVNGPRSVVVAGD
SDELDRLVASCTTECIRAKRLAVDYASHSSHVETIRDALHAELGEDFHPLPGFVPFFSTVTGRWTQPDELDAGYWYRNLR
RTVRFADAVRALAEQGYRTFLEVSAHPILTAAIEEIGDGSGADLSAIHSLRRGDGSLADFGEALSRAFAAGVAVDWESVH
LGTGARRVPLPTYPFQRERVWLEPKPVARRSTEVDEVSALRYRIEWRPTGAGEPARLDGTWLVAKYAGTADETSTAAREA
LESAGARVRELVVDARCGRDELAERLRSVGEVAGVLSLLAVDEAEPEEAPLALASLADTLSLVQAMVSAELGCPLWTVTE
SAVATGPFERVRNAAHGALWGVGRVIALENPAVWGGLVDVPAGSVAELARHLAAVVSGGAGEDQLALRADGVYGRRWVRA
AAPATDDEWKPTGTVLVTGGTGGVGGQIARWLARRGAPHLLLVSRSGPDADGAGELVAELEALGARTTVAACDVTDRESV
RELLGGIGDDVPLSAVFHAAATLDDGTVDTLTGERIERASRAKVLGARNLHELTRELDLTAFVLFSSFASAFGAPGLGGY
APGNAYLDGLAQQRRSDGLPATAVAWGTWAGSGMAEGAVADRFRRHGVIEMPPETACRALQNALDRAEVCPIVIDVRWDR
FLLAYTAQRPTRLFDEIDDARRAAPQAPAEPRVGALASLPAPEREEALFELVRSHAAAVLGHASAERVPADQAFAELGVD
SLSALELRNRLGAATGVRLPTTTVFDHPDVRTLAAHLAAELGGATGAEQAAPATTAPVDEPIAIVGMACRLPGEVDSPER
LWELITSGRDSAAEVPDDRGWVPDELMASDAAGTRAHGNFMAGAGDFDAAFFGISPREALAMDPQQRQALETTWEALESA
GIPPETLRGSDTGVFVGMSHQGYATGRPRPEDGVDGYLLTGNTASVASGRIAYVLGLEGPALTVDTACSSSLVALHTACG
SLRDGDCGLAVAGGVSVMAGPEVFTEFSRQGALSPDGRCKPFSDEADGFGLGEGSAFVVLQRLSDARREGRRVLGVVAGS
AVNQDGASNGLSAPSGVAQQRVIRRAWARAGITGADVAVVEAHGTGTRLGDPVEASALLATYGKSRGSSGPVLLGSVKSN
IGHAQAAAGVAGVIKVLLGLERGVVPPMLCRGERSGLIDWSSGEIELADGVREWSPAADGVRRAGVSAFGVSGTNAHVII
AEPPEPEPVPQPRRMLPATGVVPVVLSARTGAALRAQAGRLADHLAAHPGIAPADVSWTMARARQHFEERAAVLAADTAE
AVHRLRAVADGAVVPGVVTGSASDGGSVFVFPGQGAQWEGMARELLPVPVFAESIAECDAVLSEVAGFSVSEVLEPRPDA
PSLERVDVVQPVLFAVMVSLARLWRACGAVPSAVIGHSQGEIAAAVVAGALSLEDGMRVVARRSRAVRAVAGRGSMLSVR
GGRSDVEKLLADDSWTGRLEVAAVNGPDAVVVAGDAQAAREFLEYCEGVGIRARAIPVDYASHTAHVEPVRDELVQALAG
ITPRRAEVPFFSTLTGDFLDGTELDAGYWYRNLRHPVEFHSAVQALTDQGYATFIEVSPHPVLASSVQETLDDAESDAAV
LGTLERDAGDADRFLTALADAHTRGVAVDWEAVLGRAGLVDLPGYPFQGKRFWLLPDRTTPRDELDGWFYRVDWTEVPRS
EPAALRGRWLVVVPEGHEEDGWTVEVRSALAEAGAEPEVTRGVGGLVGDCAGVVSLLALEGDGAVQTLVLVRELDAEGID
APLWTVTFGAVDAGSPVARPDQAKLWGLGQVASLERGPRWTGLVDLPHMPDPELRGRLTAVLAGSEDQVAVRADAVRARR
LSPAHVTATSEYAVPGGTILVTGGTAGLGAEVARWLAGRGAEHLALVSRRGPDTEGVGDLTAELTRLGARVSVHACDVSS
REPVRELVHGLIEQGDVVRGVVHAAGLPQQVAINDMDEAAFDEVVAAKAGGAVHLDELCSDAELFLLFSSGAGVWGSARQ
GAYAAGNAFLDAFARHRRGRGLPATSVAWGLWAAGGMTGDEEAVSFLRERGVRAMPVPRALAALDRVLASGETAVVVTDV
DWPAFAESYTAARPRPLLDRIVTTAPSERAGEPETESLRDRLAGLPRAERTAELVRLVRTSTATVLGHDDPKAVRATTPF
KELGFDSLAAVRLRNLLNAATGLRLPSTLVFDHPNASAVAGFLDAELGTEVRGEAPSALAGLDALEGALPEVPATEREEL
VQRLERMLAALRPVAQAADASGTGANPSGDDLGEAGVDELLEALGRELDGD
>Q03132 2.3.1.94~~~eryA~~~6-deoxyerythronolide-B synthase EryA2, modules 3 and 4~~~
MTDSEKVAEYLRRATLDLRAARQRIRELESDPIAIVSMACRLPGGVNTPQRLWELLREGGETLSGFPTDRGWDLARLHHP
DPDNPGTSYVDKGGFLDDAAGFDAEFFGVSPREAAAMDPQQRLLLETSWELVENAGIDPHSLRGTATGVFLGVAKFGYGE
DTAAAEDVEGYSVTGVAPAVASGRISYTMGLEGPSISVDTACSSSLVALHLAVESLRKGESSMAVVGGAAVMATPGVFVD
FSRQRALAADGRSKAFGAGADGFGFSEGVTLVLLERLSEARRNGHEVLAVVRGSALNQDGASNGLSAPSGPAQRRVIRQA
LESCGLEPGDVDAVEAHGTGTALGDPIEANALLDTYGRDRDADRPLWLGSVKSNIGHTQAAAGVTGLLKVVLALRNGELP
ATLHVEEPTPHVDWSSGGVALLAGNQPWRRGERTRRARVSAFGISGTNAHVIVEEAPEREHRETTAHDGRPVPLVVSART
TAALRAQAAQIAELLERPDADLAGVGLGLATTRARHEHRAAVVASTREEAVRGLREIAAGAATADAVVEGVTEVDGRNVV
FLFPGQGSQWAGMGAELLSSSPVFAGKIRACDESMAPMQDWKVSDVLRQAPGAPGLDRVDVVQPVLFAVMVSLAELWRSY
GVEPAAVVGHSQGEIAAAHVAGALTLEDAAKLVVGRSRLMRSLSGEGGMAAVALGEAAVRERLRPWQDRLSVAAVNGPRS
VVVSGEPGALRAFSEDCAAEGIRVRDIDVDYASHSPQIERVREELLETTGDIAPRPARVTFHSTVESRSMDGTELDARYW
YRNLRETVRFADAVTRLAESGYDAFIEVSPHPVVVQAVEEAVEEADGAEDAVVVGSLHRDGGDLSAFLRSMATAHVSGVD
IRWDVALPGAAPFALPTYPFQRKRYWLQPAAPAAASDELAYRVSWTPIEKPESGNLDGDWLVVTPLISPEWTEMLCEAIN
ANGGRALRCEVDTSASRTEMAQAVAQAGTGFRGVLSLLSSDESACRPGVPAGAVGLLTLVQALGDAGVDAPVWCLTQGAV
RTPADDDLARPAQTTAHGFAQVAGLELPGRWGGVVDLPESVDDAALRLLVAVLRGGGRAEDHLAVRDGRLHGRRVVRASL
PQSGSRSWTPHGTVLVTGAASPVGDQLVRWLADRGAERLVLAGACPGDDLLAAVEEAGASAVVCAQDAAALREALGDEPV
TALVHAGTLTNFGSISEVAPEEFAETIAAKTALLAVLDEVLGDRAVEREVYCSSVAGIWGGAGMAAYAAGSAYLDALAEH
HRARGRSCTSVAWTPWALPGGAVDDGYLRERGLRSLSADRAMRTWERVLAAGPVSVAVADVDWPVLSEGFAATRPTALFA
ELAGRGGQAEAEPDSGPTGEPAQRLAGLSPDEQQENLLELVANAVAEVLGHESAAEINVRRAFSELGLDSLNAMALRKRL
SASTGLRLPASLVFDHPTVTALAQHLRARLVGDADQAAVRVVGAADESEPIAIVGIGCRFPGGIGSPEQLWRVLAEGANL
TTGFPADRGWDIGRLYHPDPDNPGTSYVDKGGFLTDAADFDPGFFGITPREALAMDPQQRLMLETAWEAVERAGIDPDAL
RGTDTGVFVGMNGQSYMQLLAGEAERVDGYQGLGNSASVLSGRIAYTFGWEGPALTVDTACSSSLVGIHLAMQALRRGEC
SLALAGGVTVMSDPYTFVDFSTQRGLASDGRCKAFSARADGFALSEGVAALVLEPLSRARANGHQVLAVLRGSAVNQDGA
SNGLAAPNGPSQERVIRQALAASGVPAADVDVVEAHGTGTELGDPIEAGALIATYGQDRDRPLRLGSVKTNIGHTQAAAG
AAGVIKVVLAMRHGMLPRSLHADELSPHIDWESGAVEVLREEVPWPAGERPRRAGVSSFGVSGTNAHVIVEEAPAEQEAA
RTERGPLPFVLSGRSEAVVAAQARALAEHLRDTPELGLTDAAWTLATGRARFDVRAAVLGDDRAGVCAELDALAEGRPSA
DAVAPVTSAPRKPVLVFPGQGAQWVGMARDLLESSEVFAESMSRCAEALSPHTDWKLLDVVRGDGGPDPHERVDVLQPVL
FSIMVSLAELWRAHGVTPAAVVGHSQGEIAAAHVAGALSLEAAAKVVALRSQVLRELDDQGGMVSVGASRDELETVLARW
DGRVAVAAVNGPGTSVVAGPTAELDEFFAEAEAREMKPRRIAVRYASHSPEVARIEDRLAAELGTITAVRGSVPLHSTVT
GEVIDTSAMDASYWYRNLRRPVLFEQAVRGLVEQGFDTFVEVSPHPVLLMAVEETAEHAGAEVTCVPTLRREQSGPHEFL
RNLLRAHVHGVGADLRPAVAGGRPAELPTYPFEHQRFWPRPHRPADVSALGVRGAEHPLLLAAVDVPGHGGAVFTGRLST
DEQPWLAEHVVGGRTLVPGSVLVDLALAAGEDVGLPVLEELVLQRPLVLAGAGALLRMSVGAPDESGRRTIDVHAAEDVA
DLADAQWSQHATGTLAQGVAAGPRDTEQWPPEDAVRIPLDDHYDGLAEQGYEYGPSFQALRAAWRKDDSVYAEVSIAADE
EGYAFHPVLLDAVAQTLSLGALGEPGGGKLPFAWNTVTLHASGATSVRVVATPAGADAMALRVTDPAGHLVATVDSLVVR
STGEKWEQPEPRGGEGELHALDWGRLAEPGSTGRVVAADASDLDAVLRSGEPEPDAVLVRYEPEGDDPRAAARHGVLWAA
ALVRRWLEQEELPGATLVIATSGAVTVSDDDSVPEPGAAAMWGVIRCAQAESPDRFVLLDTDAEPGMLPAVPDNPQLALR
GDDVFVPRLSPLAPSALTLPAGTQRLVPGDGAIDSVAFEPAPDVEQPLRAGEVRVDVRATGVNFRDVLLALGMYPQKADM
GTEAAGVVTAVGPDVDAFAPGDRVLGLFQGAFAPIAVTDHRLLARVPDGWSDADAAAVPIAYTTAHYALHDLAGLRAGQS
VLIHAAAGGVGMAAVALARRAGAEVLATAGPAKHGTLRALGLDDEHIASSRETGFARKFRERTGGRGVDVVLNSLTGELL
DESADLLAEDGVFVEMGKTDLRDAGDFRGRYAPFDLGEAGDDRLGEILREVVGLLGAGELDRLPVSAWELGSAPAALQHM
SRGRHVGKLVLTQPAPVDPDGTVLITGGTGTLGRLLARHLVTEHGVRHLLLVSRRGADAPGSDELRAEIEDLGASAEIAA
CDTADRDALSALLDGLPRPLTGVVHAAGVLADGLVTSIDEPAVEQVLRAKVDAAWNLHELTANTGLSFFVLFSSAASVLA
GPGQGVYAAANESLNALAALRRTRGLPAKALGWGLWAQASEMTSGLGDRIARTGVAALPTERALALFDSALRRGGEVVFP
LSINRSALRRAEFVPEVLRGMVRAKLRAAGQAEAAGPNVVDRLAGRSESDQVAGLAELVRSHAAAVSGYGSADQLPERKA
FKDLGFDSLAAVELRNRLGTATGVRLPSTLVFDHPTPLAVAEHLRDRLFAASPAVDIGDRLDELEKALEALSAEDGHDDV
GQRLESLLRRWNSRRADAPSTSAISEDASDDELFSMLDQRFGGGEDL
>Q03133 2.3.1.94~~~eryA~~~6-deoxyerythronolide-B synthase EryA3, modules 5 and 6~~~
MSGDNGMTEEKLRRYLKRTVTELDSVTARLREVEHRAGEPIAIVGMACRFPGDVDSPESFWEFVSGGGDAIAEAPADRGW
EPDPDARLGGMLAAAGDFDAGFFGISPREALAMDPQQRIMLEISWEALERAGHDPVSLRGSATGVFTGVGTVDYGPRPDE
APDEVLGYVGTGTASSVASGRVAYCLGLEGPAMTVDTACSSGLTALHLAMESLRRDECGLALAGGVTVMSSPGAFTEFRS
QGGLAADGRCKPFSKAADGFGLAEGAGVLVLQRLSAARREGRPVLAVLAGSAVNQDGASNGLTAPSGPAQQRVIRRALEN
AGVRAGDVDYVEAHGTGTRLGDPIEVHALLSTYGAERDPDDPLWIGSVKSNIGHTQAAAGVAGVMKAVLALRHGEMPRTL
HFDEPSPQIEWDLGAVSVVSQARSWPAGERPRRAGVSSFGISGTNAHVIVEEAPEADEPEPAPDSGPVPLVLSGRDEQAM
RAQAGRLADHLAPEPRNSLRDTGFTLATRASAMEHRAVVVGDRDEALAGLRAVADRRIADRTATGQGPNSPRRVAMVFPG
QGAQWQGMARDLLRESQVFADSIRDCERALAPHVDWSLTDLLSGARPLDRVDVVQPALFAVMVSLAALWRSHGVEPAAVV
GHSQGEIAAAHVAGALTLEDAAKLVAVRSRVLRRLGGQGGMASFGLGTEQAAERIGRFAGALSIASVNGPRSVVVVAGES
GPLDELIAECEAEAHKARRIPVDYASHSPQVESLREELLTELAGISPVSADVALYSTTTGQPIDTATMDTAYWYANLREQ
VRFQDATRQLAEAGFDAFVEVSPHPVLTVGIEATLDSALPADAGACVVGTLRRDRGGLADFHTALGEAYAQGVEVDWSPA
FADARPVELPVYPFQRYWLPIPTGGRARDEDDDWRYQVVWREAEWESASLAGRVLLVTGPGVPSELSDAIRSGLEQSGAT
VLTCDVESRSTIGTALEAADTDALSTVGVAAVPHGEAVDPSLDALALVQALGAAGVEAPLWVLTRNAVQVADGELVDPAQ
AMVGGLGRVVGIEQPGRWGGLVDLVDADAASIRSLAAVLADPRGEEQVAIRADGIKVARLVPAPARARTHPLEPLAGTVL
VTGGTGGIGAHLARWLARSGAEHLVLLGRRGADAPGASELREELTALGTGVTIAACDVADRARLEAVLAAEAAAEGRTVS
AVMHAAGVSTSTPLDDLTEAEFTEIADVKVRGTVNLDELCPDLDAFVLFSSNAGVWGSPGLASYAAANAFLDGFARAARS
EGAPVTSIAWGLWAGQNMAGDEGGEYLRSQGLRAMDPDRAVEELHITLDHGQTSVSVVDMDRRRFVELFTAARHRPLFDE
IAGARAEARQSEEGPALAQRLAALLCDGREREHLAHLIRAEVAAVLGHGDDAAIDRDRAFRDLGFDSMTAVDLRNRLAAV
TGVREAATVVFDHPTITRLADHYLERLVGAAEAEQAPALVREVPPKDADDPIAIVGMACRFPGGVHNPGELWEFIVGGGD
AVTEMPTDRGWDLDALFDPDPQRHGTSYSRHGAFLDGAADFDAAFFGISPREALAMDPQQRQVLETTWELFENAGIDPHS
VRGSDTGVFLGAAYQGYGQDAVVPEDSEGYLLTGNSSAVVSGRVAYVLGLEGPAVTVDTACSSSLVALHSACGSLRDGDC
GLAVAGGVSVMAGPEVFTEFSRQGGLAVDGRCKAFSAEADGFGLPEGVAVVQLQRLSDGPAEGGRQVLGVVAGSAINQDG
ATNGLAAPSGVAQQRVIRKAWARAGITGADVAVVEAHGTGTRLGDPVEASALLATYGKSRGSSGPVLLGSVKSNIGHAQA
AAGVAGVIKVVLGLNRGLVPPMLCRGERSPLIEWSSGGVELAEAVSPWPPAADGVRRAGVSAFGVSGTNAHVIIAEPPEP
EPLPEPGPVGVLAAANSVPVLLSARTETALAAQARLLESAVDDSVPLTALASALATGRAHLPRRAALLAGDHEQLRGQLR
AVAEGVAAPGATTGTASAGGVVFVFPGQGAQWEGMARGLLSVPVFAESIAECDAVLSEVAGFSASEVLEQRPDAPSLERV
DVVQPVLFSVMVSLARLWGACGVSPSAVIGHSQGEIAAAVVAGVLSLEDGVRVVALRAKALRALAGKGGMVSLAAPGERA
RALIAPWEDRISVAAVNSPSSVVVSGDPEALAELVARCEDEGVRAKTLPVDYASHSRHVEEIRETILADLDGISARRAAI
PLYSTLHGERRDMGPRYWYDNLRSQVRFDEAVSAQSPDGHATFVEMSPHPVLTAAVQEIAADAVAIGSLHRDTAEEHLIA
ELARAHVHGVAVDWRNVFPAAPPVALPNYPFEPQRYWLAPEVSDQLADSRYRVDWRPLATTPVDLEGGFLVHGSAPESLT
SAVEKAGGVVPVASADREALAAALREVPGEVAGVLSVHTGAANALALHQSLGEAGVRAPLWLVTSRAVALGESEPVDPEQ
AMVWGLGRVMGLETPERWGGLVDLPAEPAPGDGEAFVACLGADGHEDQVAIRDHARYGRRLVRAPLGTRESSWEPAGTAL
VTGGTGALGGHVARHLARCGVEDLVLVSRRGVDAPAAAELEAELVALGPKTTITACDVADREQLSKLLEELRGQGRPVRT
VVHTAGVPESRPLHEIGELESVCAAKVTGARLLDELCPDAETFVLFSSGAGVWGSANLGAYSAANAYLDALAHRRRAEGR
AATSVAWGAWAGEGMATGDLEGLTRRGLRPMAPDRAIRALHQALDNGDTCVSIADVDWEAFAVGFTAARPRPLLDELVTP
AVGAVPAVQAAPAREMTSQELLEFTHSHVAAILGHSSPDAVGQDQPFTELGFDSLTAVGLRNQLQQATGLALPATLVFEH
PTVRRLADHIGQQLDSGTPAREASSALRDGYRQAGVSGRVRSYLDLLAGLSDFREHFDGSDGFSLDLVDMADGPGEVTVI
CCAGTAAISGPHEFTRLAGALRGIAPVRAVPQPGYEEGEPLPSSMAAVAAVQADAVIRTQGDKPFVVAGHSAGALMAYAL
ATELLDRGHPPRGVVLIDVYPPGHQDAMNAWLEELTATLFDRETVRMDDTRLTALGAYDRLTGQWRPRETGLPTLLVSAG
EPMGPWPDDSWKPTWPFEHDTVAVPGDHFTMVQEHADAIARHIDAWLGGGNS
>Q2YIQ1 2.7.1.215~~~eryA~~~Erythritol kinase~~~
MSAMREKGDIIIGIDAGTSVLKAVAFDFSGRQIESAAVRNTYVTGDHGAVTQSLAQTWQDCARALRDLGAKLPGLAQRTA
AIAVTGQGDGTWLVGKDNRPVGDAWIWLDARAASTVTRLAAGPMNRARFEATGTGLNTCQQGAQMAHMDTIAPELLDNAE
AALHCKDWLYLNLTGVRATDPSEASFTFGNFRTRQYDDVVIEALGLQKRRNLLPEIIDGSQSQHPLSAEAAAATGLLAGT
PVSLGYVDMAMTALGAGVCGGTAGAGCSTIGSTGVHMRAKPVADIHLNKEGTGYVIALPIPGIVTQVQTNMGATINIDWI
LQVAADLMSTPEKPVSLGDLIPRLDDWFNASRPGAILYHPYISEAGERGPFVNANARAGFIGLSSRDRFPELVRSVVEGL
GMATRDCYAAMGEMPAELRITGGAARSKALRGTLSAAVNAPVRVSAREEAGAAGAAMMAAVAIGAYPAMDDCIAEWVEPL
LGASEAPDAARAHQYEELFVAYREARLALAPVWDKLASGK
>O33939 2.4.1.328~~~eryBV~~~Erythronolide mycarosyltransferase~~~
MRVLLTSFAHRTHFQGLVPLAWALRTAGHDVRVAAQPALTDAVIGAGLTAVPVGSDHRLFDIVPEVAAQVHRYSFYLDFY
HREQELHSWEFLLGMQEATSRWVYPVVNNDSFVAELVDFARDWRPDLVLWEPFTFAGAVAARACGAAHARLLWGSDLTGY
FRGRFQAQRLRRPPEDRPDPLGTWLTEVAGRFGVEFGEDLAVGQWSVDQLPPSFRLDTGMETVVARTLPYNGASVVPDWL
KKGSATRRICITGGFSGLGLAADADQFARTLAQLARFDGEIVVTGSGPDTSAVPDNIRLVDFVPMGVLLQNCAAIIHHGG
AGTWATALHHGIPQISVAHEWDCMLRGQQTAELGAGIYLRPDEVDADSLASALTQVVEDPTYTENAVKLREEALSDPTPQ
EIVPRLEELTRRHAG
>Q2YIQ2 1.1.1.402~~~eryB~~~D-erythritol 1-phosphate dehydrogenase~~~
MAEPETCDLFVIGGGINGAGVARDAAGRGLKVVLAEKDDLAQGTSSRSGKLVHGGLRYLEYYEFRLVREALIEREVLLNA
APHIIWPMRFVLPHSPQDRPAWLVRLGLFLYDHLGGRKKLPGTRTLDLKRDPEGTPILDQYTKGFEYSDCWVDDARLVAL
NAVGAAEKGATILTRTPVVSARRENGGWIVETRNSDTGETRTFRARCIVNCAGPWVTDVIHNVAASTSSRNVRLVKGSHI
IVPKFWSGANAYLVQNHDKRVIFINPYEGDKALIGTTDIAYEGRAEDVAADEKEIDYLITAVNRYFKEKLRREDVLHSFS
GVRPLFDDGKGNPSAVTRDYVFDLDETGGAPLLNVFGGKITTFRELAERGMHRLKHIFPQMGGDWTHDAPLPGGEIANAD
YETFANTLRDTYPWMPRTLVHHYGRLYGARTKDVVAGAQNLEGLGRHFGGDFHEAEVRYLVAREWAKTAEDILYRRTKHY
LHLTEAERAAFVEWFDNANLVA
>A4F7P2 ~~~eryCII~~~Cytochrome P450 family protein EryCII~~~COG2124
MTTTDRAGLGRQLQMIRGLHWGYGSNGDPYPMLLCGHDDDPQRRYRSMRESGVRRSRTETWVVADHATARQVLDDPAFTR
ATGRTPEWMRAAGAPPAEWAQPFRDVHAASWEGEVPDVGELAESFAGLLPGAGARLDLVGDFAWQVPVQGMTAVLGAAGV
LRGAAWDARVSLDAQLSPQQLAVTEAAVAALPADPALRALFAGAEMTANTVVDAVLAVSAEPGLAERIADDPAAAQRTVA
EVLRLHPALHLERRTATAEVRLGEHVIGEGEEVVVVVAAANRDPEVFAEPDRLDVDRPDADRALSAHRGHPGRLEELVTA
LATAALRAAAKALPGLTPSGPVVRRRRSPVLRGTNRCPVEL
>A4F7P3 2.4.1.278~~~eryCIII~~~3-alpha-mycarosylerythronolide B desosaminyl transferase~~~COG1819
MRVVFSSMASKSHLFGLVPLAWAFRAAGHEVRVVASPALTEDITAAGLTAVPVGTDVDLVDFMTHAGHDIIDYVRSLDFS
ERDPATLTWEHLLGMQTVLTPTFYALMSPDTLIEGMVSFCRKWRPDLVIWEPLTFAAPIAAAVTGTPHARLLWGPDITTR
ARQNFLGLLPDQPEEHREDPLAEWLTWTLEKYGGPAFDEEVVVGQWTIDPAPAAIRLDTGLKTVGMRYVDYNGPSVVPEW
LHDEPERRRVCLTLGISSRENSIGQVSIEELLGAVGDVDAEIIATFDAQQLEGVANIPDNVRTVGFVPMHALLPTCAATV
HHGGPGSWHTAAIHGVPQVILPDGWDTGVRAQRTQEFGAGIALPVPELTPDQLRESVKRVLDDPAHRAGAARMRDDMLAE
PSPAEVVGICEELAAGRREPR
>Q2YIQ3 5.1.3.38~~~eryC~~~D-erythrulose 1-phosphate 3-epimerase~~~
MALTLSLNTNPLVNRFAEPDDLIETVARDLRLRDLQLTHEFINPSWQASTIRRLTRDMDRALQRTGVRVTSGMTGPYGRL
NHFGHPDRDVRRYYVDWFKTFADIIGDLGGKSVGTQFAIFTYKDFDDPARREELIKIAIDCWAEVAEHAAGAGLDYVFWE
PMSIGREFGETIAECMKLQDRLTAANMAIPMWMMADIDHGDVTSANPDDYDPYAWARTVPKVSPIIHIKQSLMDKGGHRP
FTAAFNAKGRIQPEPLLKAFAEGGAVDNEICLELSFKEREPNDREVIPQIAESVAFWAPHIDTGAKDLKI
>Q2YIQ4 ~~~eryD~~~Erythritol catabolism regulatory protein EryD~~~
MADADDSLALRAAWLHFVAGMTQSAVAKRLGLPSVKAHRLIAKAVADGAVKVTIDGDITECIDLENRLADLYGLDYCEVA
PDIGEEGLPLMALGHAGANFMRREIEHGDHEVIGIGHGRTLSAAVGYMPRVMANDLRFVSLLGGLTRNFAANPHDVMHRI
AEKTGMPAYVMPVPFFANTAEDREVLLAQRGVTTVFDMGCRAELKIVGIGTVDAQAQLVTSGMIELGEVEEIANLGGVGE
MLGHFFNANGQWLETALTGRTIAASVENADMSRIVALAGGLSKVDAIRAVLKSGRLYGLITDERTAKALIGQPNGK
>A4F7P5 2.1.1.254~~~eryG~~~Erythromycin 3''-O-methyltransferase~~~COG2230
MSVKQKSALQDLVDFAKWHVWTRVRPSSRARLAYELFADDHEATTEGAYINLGYWKPGCAGLEEANQELANQLAEAAGIS
EGDEVLDVGFGLGAQDFFWLETRKPARIVGVDLTPSHVRIASERAERENVQDRLQFKEGSATDLPFGAETFDRVTSLESA
LHYEPRTDFFKGAFEVLKPGGVLAIGDIIPLDLREPGSDGPPKLAPQRSGSLSGGIPVENWVPRETYAKQLREAGFVDVE
VKSVRDNVMEPWLDYWLRKLQDESFKKSVSRLFYSQVKRSLTSDSGMKGELPALDFVIASARKPGA
>Q2YIQ6 5.3.1.33~~~eryH~~~L-erythrulose-1-phosphate isomerase~~~
MTKFWIGTSWKMNKTLAEARLFAEALKAADAGRSPDIQRFVIPPFTAVREVKEILSGTSVKVGAQNMHWADQGAWTGEIS
PLMLKDCNLDIVELGHSERREHFGETNETVGLKVEAAVRHGLIPLICIGETLEDRESGRAAAVLEEEVRGALSKLSEAQK
QAEILFAYEPVWAIGENGIPASADYADARQAEIIAVAQSVLARRVPCLYGGSVNPGNCEELIACPHIDGLFIGRSAWNVE
GYLDILARCATKVQAN
>Q9ZB26 5.3.1.34~~~eryI~~~D-erythrulose-4-phosphate isomerase~~~
MKVAVAGDSAGEGLAKVLADHLKDRFEVSEISRTDAGADAFYANLSDRVASAVLDGTYDRAILVCGTGIGVCIAANKVPG
IRAALTHDTYSAERAALSNNAQIITMGARVIGAEVAKTIADAFLAQTFDENGRSAGNVNAINEVDAKYNKF
>P48635 1.14.13.154~~~eryK~~~Erythromycin C-12 hydroxylase~~~COG2124
MTTIDEVPGMADETALLDWLGTMREKQPVWQDRYGVWHVFRHADVQTVLRDTATFSSDPTRVIEGASPTPGMIHEIDPPE
HRALRKVVSSAFTPRTISDLEPRIRDVTRSLLADAGESFDLVDVLAFPLPVTIVAELLGLPPMDHEQFGDWSGALVDIQM
DDPTDPALAERIADVLNPLTAYLKARCAERRADPGDDLISRLVLAEVDGRALDDEEAANFSTALLLAGHITTTVLLGNIV
RTLDEHPAHWDAAAEDPGRIPAIVEEVLRYRPPFPQMQRTTTKATEVAGVPIPADVMVNTWVLSANRDSDAHDDPDRFDP
SRKSGGAAQLSFGHGVHFCLGAPLARLENRVALEEIIARFGRLTVDRDDERLRHFEQIVLGTRHLPVLAGSSPRQSA
>A0A0H2XFP1 ~~~esaA~~~Type VII secretion system accessory factor EsaA~~~
MKKKNWIYALIVTLIIIIAIVSMIFFVQTKYGDQSEKGSQSVSNKNNKIHIAIVNEDQPTTYNGKKVELGQAFIKRLANE
KNYKFETVTRNVAESGLKNGGYQVMIVIPENFSKLAMQLDAKTPSKISLQYKTAVGQKEEVAKNTEKVVSNVLNDFNKNL
VEIYLTSIIDNLHNAQKNVGAIMTREHGVNSKFSNYLLNPINDFPELFTDTLVNSISANKDITKWFQTYNKSLLSANSDT
FRVNTDYNVSTLIEKQNSLFDEHNTAMDKMLQDYKSQKDSVELDNYINALKQMDSQIDQQSSMQDTGKEEYKQTVKENLD
KLREIIQSQESPFSKGMIEDYRKQLTESLQDELANNKDLQDALNSIKMNNAQFAENLEKQLHDDIVKEPDTDTTFIYNMS
KQDFIAAGLNEDEANKYEAIVKEAKRYKNEYNLKKPLAEHINLTDYDNQVAQDTSSLINDGVKVQRTETIKSNDINQLTV
ATDPHFNFEGDIKINGKKYDIKDQSVQLDTSNKEYKVEVNGVAKLKKDAEKDFLKDKTMHLQLLFGQANRQDEPNDKKAT
SVVDVTLNHNLDGRLSKDALSQQLSALSRFDAHYKMYTDTKGREDKPFDNKRLIDMMVDQVINDMESFKDDKVAVLHQID
SMEENSDKLIDDILNNKKNTTKNKEDISKLIDQLENVKKTFAEEPQEPKIDKGKNDEFNTMSSNLDKEISRISEKSTQLL
SDTQESKSIADSVSGQLNQVDNNVNKLHATGRALGVRANDLNRQMAKNDKDNELFAKEFKKVLQNSKDGDRQNQALKAFM
SNPVQKKNLENVLANNGNTDVISPTLFVLLMYLLSMITAYIFYSYERAKGQMNFIKDDYSSKNHLWNNVITSGVIGTTGL
VEGLIVGLIAMNKFHVLAGYRAKFILMVILTMMVFVLINTYLLRQVKSIGMFLMIAALGLYFVAMNNLKAAGQGVTNKIS
PLSYIDNMFFNYLNAEHPIGLVLVILTVLVIIGFVLNMFIKHFKKERLI
>Q2G188 ~~~esaA~~~Type VII secretion system accessory factor EsaA~~~COG0842
MKKKNWIYALIVTLIIIIAIVSMIFFVQTKYGDQSEKGSQSVSNKNNKIHIAIVNEDQPTTYNGKKVELGQAFIKRLANE
KNYKFETVTRNVAESGLKNGGYQVMIVIPENFSKLAMQLDAKTPSKISLQYKTAVGQKEEVAKNTEKVVSNVLNDFNKNL
VEIYLTSIIDNLHNAQKNVGAIMTREHGVNSKFSNYLLNPINDFPELFTDTLVNSISANKDITKWFQTYNKSLLSANSDT
FRVNTDYNVSTLIEKQNSLFDEHNTAMDKMLQDYKSQKDSVELDNYINALKQMDSQIDQQSSMQDTGKEEYKQTVKENLD
KLREIIQSQESPFSKGMIEDYRKQLTESLQDELANNKDLQDALNSIKMNNAQFAENLEKQLHDDIVKEPDTDTTFIYNMS
KQDFIAAGLNEDEANKYEAIVKEAKRYKNEYNLKKPLAEHINLTDYDNQVAQDTSSLINDGVKVQRTETIKSNDINQLTV
ATDPHFNFEGDIKINGKKYDIKDQSVQLDTSNKEYKVEVNGVAKLKKDAEKDFLKDKTMHLQLLFGQANRQDEPNDKKAT
SVVDVTLNHNLDGRLSKDALSQQLSALSRFDAHYKMYTDTKGREDKPFDNKRLIDMMVDQVINDMESFKDDKVAVLHQID
SMEENSDKLIDDILNNKKNTTKNKEDISKLIDQLENVKKTFAEEPQEPKIDKGKNDEFNTMSSNLDKEISRISEKSTQLL
SDTQESKSIADSVSGQLNQVDNNVNKLHATGRALGVRANDLNRQMAKNDKDNELFAKEFKKVLQNSKDGDRQNQALKAFM
SNPVQKKNLENVLANNGNTDVISPTLFVLLMYLLSMITAYIFYSYERAKGQMNFIKDDYSSKNHLWNNVITSGVIGTTGL
VEGLIVGLIAMNKFHVLAGYRAKFILMVILTMMVFVLINTYLLRQVKSIGMFLMIAALGLYFVAMNNLKAAGQGVTNKIS
PLSYIDNMFFNYLNAEHPIGLVLVILTVLVIIGFVLNMFIKHFKKERLI
>P0C049 ~~~esaA~~~Type VII secretion system accessory factor EsaA~~~
MKKKNWIYALIVTLIIIIAIVSMIFFVQTKYGDQSEKGSQSVSNKNNKIHIAIVNEDQPTTYNGKKVELGQAFIKRLANE
KNYKFETVTRNVAESGLKNGGYQVMIVIPENFSKLAMQLDAKTPSKISLQYKTAVGQKEEVAKNTEKVVSNVLNDFNKNL
VEIYLTSIIDNLHNAQKNVGAIMTREHGVNSKFSNYLLNPINDFPELFTDTLVNSISANKDITKWFQTYNKSLLSANSDT
FRVNTDYNVSTLIEKQNSLFDEHNTAMDKMLQDYKSQKDSVELDNYINALKQMDSQIDQQSSMQDTGKEEYKQTVKENLD
KLREIIQSQESPFSKGMIEDYRKQLTESLQDELANNKDLQDALNSIKMNNAQFAENLEKQLHDDIVKEPDTDTTFIYNMS
KQDFIAAGLNEGEANKYEAIVKEAKRYKNEYNLKKPLAEHINLTDYDNQVAQDTSSLINDGVKVQRTETIKSNDINQLTV
ATDPHFNFEGDIKINGKKYDIKDQSVQLDTSNKEYKVEVNGVAKLKKDAEKDFLKDKTMHLQLLFGQANRQDEPNDKKAT
SVVDVTLNHNLDGRLSKDALSQQLSALSRFDAHYKMYTDTKGREDKPFDNKRLIDMMVDQVINDMESFKDDKVAVLHQID
SMEENSDKLIDDILNNKKNTTKNKEDISKLIDQLENVKKTFAEEPQEPKIDKGKNDEFNTMSSNLDKEISRISEKSTQLL
SDTQESKSIADSVSGQLNQVDNNVNKLHATGRALGVRANDLNRQMAKNDKDNELFAKEFKKVLQNSKDGDRQNQALKAFM
SNPVQKKNLENVLANNGNTDVISPTLFVLLMYLLSMITAYIFYSYERAKGQMNFIKDDYSSKNHLWNNVITSGVIGTTGL
VEGLIVGLIAMNKFHVLAGYRAKFILMVILTMMVFVLINTYLLRQVKSIGMFLMIAALGLYFVAMNNLKAAGQGVTNKIS
PLSYIDNMFFNYLNAEHPIGLVLVILTVLVIIGFVLNMFIKHFKKERLI
>P0C050 ~~~esaB~~~Type VII secretion system accessory factor EsaB~~~
MNQHVKVTFDFTNYNYGTYDLAVPAYLPIKNLIALVLDSLDISIFDVNTQIKVMTKGQLLVENDRLIDYQIADGDILKLL
>P54656 2.3.1.184~~~esaI~~~Acyl-homoserine-lactone synthase~~~
MLELFDVSYEELQTTRSEELYKLRKKTFSDRLGWEVICSQGMESDEFDGPGTRYILGICEGQLVCSVRFTSLDRPNMITH
TFQHCFSDVTLPAYGTESSRFFVDKARARALLGEHYPISQVLFLAMVNWAQNNAYGNIYTIVSRAMLKILTRSGWQIKVI
KEAFLTEKERIYLLTLPAGQDDKQQLGGDVVSRTGCPPVAVTTWPLTLPV
>P54293 ~~~esaR~~~Transcriptional activator protein EsaR~~~
MFSFFLENQTITDTLQTYIQRKLSPLGSPDYAYTVVSKKNPSNVLIISSYPDEWIRLYRANNFQLTDPVILTAFKRTSPF
AWDENITLMSDLRFTKIFSLSKQYNIVNGFTYVLHDHMNNLALLSVIIKGNDQTALEQRLAAEQGTMQMLLIDFNEQMYR
LAGTEGERAPALNQSADKTIFSSRENEVLYWASMGKTYAEIAAITGISVSTVKFHIKNVVVKLGVSNARQAIRLGVELDL
IRPAASAAR
>P0DJ88 ~~~espF(U)~~~Secreted effector protein EspF(U)~~~
MINNVSSLFPTVNRNITAVYKKSSFSVSPQKITLNPVKISSPFSPSSSSISATTLFRAPNAHSASFHRQSTAESSLHQQL
PNVRQRLIQHLAEHGIKPARSMAEHIPPAPNWPAPPPPVQNEQSRPLPDVAQRLVQHLAEHGIQPARNMAEHIPPAPNWP
APPLPVQNEQSRPLPDVAQRLVQHLAEHGIQPARSMAEHIPPAPNWPAPPPPVQNEQSRPLPDVAQRLMQHLAEHGIQPA
RNMAEHIPPAPNWPAPTPPVQNEQSRPLPDVAQRLMQHLAEHGIQPARNMAEHIPPAPNWPAPTPPVQNEQSRPLPDVAQ
RLMQHLAEHGINTSKRS
>P0DJ89 ~~~espF(U)~~~Secreted effector protein EspF(U)~~~
MINNVSSLFPTVNRNITAVYKKSSFSVSPQKITLNPVKISSPFSPSSSSISATTLFRAPNAHSASFHRQSTAESSLHQQL
PNVRQRLIQHLAEHGIKPARSMAEHIPPAPNWPAPPPPVQNEQSRPLPDVAQRLVQHLAEHGIQPARNMAEHIPPAPNWP
APPLPVQNEQSRPLPDVAQRLVQHLAEHGIQPARSMAEHIPPAPNWPAPPPPVQNEQSRPLPDVAQRLMQHLAEHGIQPA
RNMAEHIPPAPNWPAPTPPVQNEQSRPLPDVAQRLMQHLAEHGIQPARNMAEHIPPAPNWPAPTPPVQNEQSRPLPDVAQ
RLMQHLAEHGINTSKRS
>A0A0H2VDN9 ~~~esiB~~~Secretory immunoglobulin A-binding protein EsiB~~~COG0790
MKKSLLAVMLTGLFALVSLPALGNVNLEQLKQKAESGEAKAQLELGYRYFQGNETTKDLTQAMDWFRRAAEQGYTPAEYV
LGLRYMNGEGVPQDYAQAVIWYKKAALKGLPQAQQNLGVMYHEGNGVKVDKAESVKWFRLAAEQGRDSGQQSMGDAYFEG
DGVTRDYVMAREWYSKAAEQGNVWSCNQLGYMYSRGLGVERNDAISAQWYRKSATSGDELGQLHLADMYYFGIGVTQDYT
QSRVLFSQSAEQGNSIAQFRLGYILEQGLAGAKEPLKALEWYRKSAEQGNSDGQYYLAHLYDKGAEGVAKNREQAISWYT
KSAEQGDATAQANLGAIYFRLGSEEEHKKAVEWFRKAAAKGEKAAQFNLGNALLQGKGVKKDEQQAAIWMRKAAEQGLSA
AQVQLGEIYYYGLGVERDYVQAWAWFDTASTNDMNLFGTENRNITEKKLTAKQLQQAELLSQQYIEKYAPEAWARMQKLK
AQSAVKTGNK
>P9WJE1 ~~~espA~~~ESX-1 secretion-associated protein EspA~~~
MSRAFIIDPTISAIDGLYDLLGIGIPNQGGILYSSLEYFEKALEELAAAFPGDGWLGSAADKYAGKNRNHVNFFQELADL
DRQLISLIHDQANAVQTTRDILEGAKKGLEFVRPVAVDLTYIPVVGHALSAAFQAPFCAGAMAVVGGALAYLVVKTLINA
TQLLKLLAKLAELVAAAIADIISDVADIIKGTLGEVWEFITNALNGLKELWDKLTGWVTGLFSRGWSNLESFFAGVPGLT
GATSGLSQVTGLFGAAGLSASSGLAHADSLASSASLPALAGIGGGSGFGGLPSLAQVHAASTRQALRPRADGPVGAAAEQ
VGGQSQLVSAQGSQGMGGPVGMGGMHPSSGASKGTTTKKYSEGAAAGTEDAERAPVEADAGGGQKVLVRNVV
>B2HNQ9 ~~~espB~~~ESX-1 secretion-associated protein EspB~~~
MSQPQTVTVDQQEILNRADEVEAPMATPPTDVPQAPSGLTAANNAAEQLAVSADNVRLYLQAGERERQRLATSLRNAAAA
YGEVEDESATALDNDGNGEVDAQSAGGAGAGQTESLEETPKVAAAGESDFTDLKTAATKLESGDQGTSMVNFADGWNNFN
LSLQRDIKRFRIFENWEGDAATACEASMDQQKEWILHMAKLSASLAKQANFMAQLQLWARRGHPTLADIVELERLAKDPD
YQEQAIKLYAEYQETSEKVLSEYNTKADLEPVNPPKPPAAIKIDPPPPAQPQGLIPGFLMPPGDGSTGLASGMTPPMIPP
TGGAGGTPDVNTAELTSAGREAASNLSKGLGVKPMSLGGGGGGLGGMPMGDAALAGGESVRPAAAGDIAGAGQGGGAAGR
GMAGGGMGMPMGGAGQGQGGAKSKGAQQDEEALYTEDREWTEAVIGNRRRQDNK
>P9WJD9 ~~~espB~~~ESX-1 secretion-associated protein EspB~~~
MTQSQTVTVDQQEILNRANEVEAPMADPPTDVPITPCELTAAKNAAQQLVLSADNMREYLAAGAKERQRLATSLRNAAKA
YGEVDEEAATALDNDGEGTVQAESAGAVGGDSSAELTDTPRVATAGEPNFMDLKEAARKLETGDQGASLAHFADGWNTFN
LTLQGDVKRFRGFDNWEGDAATACEASLDQQRQWILHMAKLSAAMAKQAQYVAQLHVWARREHPTYEDIVGLERLYAENP
SARDQILPVYAEYQQRSEKVLTEYNNKAALEPVNPPKPPPAIKIDPPPPPQEQGLIPGFLMPPSDGSGVTPGTGMPAAPM
VPPTGSPGGGLPADTAAQLTSAGREAAALSGDVAVKAASLGGGGGGGVPSAPLGSAIGGAESVRPAGAGDIAGLGQGRAG
GGAALGGGGMGMPMGAAHQGQGGAKSKGSQQEDEALYTEDRAWTEAVIGNRRRQDSKESK
>Q9EZE7 3.4.21.-~~~espC~~~Serine protease EspC~~~
MNKIYALKYCHATGGLIAVSELASRVMKKAARGSLLALFNLSLYGAFLSASQAAQLNIDNVWARDYLDLAQNKGVFKAGA
TNVSIQLKNGQTFNFPNVPIPDFSPASNKGATTSIGGAYSVTATHNGTTHHAISTQNWGQSSYKYIDRMTNGDFAVTRLD
KFVVETTGVKNSVDFSLNSHDALERYGVEINGEKKIIGFRVGAGTTYTVQNGNTYSTGQVYNPLLLSASMFQLNWDNKRP
YNNTTPFYNETTGGDSGSGFYLYDNVKKEWVMLGTLFGIASSGADVWSILNQYDENTVNGLKNKFTQKVQLNNNTMSLNS
DSFTLAGNNTAVEKNNNNYKDLSFSGGGSINFDNDVNIGSGGLIFDAGHHYTVTGNNKTFKGAGLDIGDNTTVDWNVKGV
VGDNLHKIGAGTLNVNVSQGNNLKTGDGLVVLNSANAFDNIYMASGHGVVKINHSAALNQNNDYRGIFFTENGGTLDLNG
YDQSFNKIAATDIGALITNSAVQKAVLSVNNQSNYMYHGSVSGNTEINHQFDTQKNNSRLILDGNVDITNDINIKNSQLT
MQGHATSHAVFREGGVTCMLPGVICEKDYVSGIQQQENSANKNNNTDYKTNNQVSSFEQPDWENRLFKFKTLNLINSDFI
VGRNAIVVGDISANNSTLSLSGKDTKVHIDMYDGKNITGDGFGFRQDIKDGVSVSPESSSYFGNVTLNNHSLLDIGNKFT
GGIEAYDSSVSVTSQNAVFDRVGSFVNSSLTLEKGAKLTAQGGIFSTGAVDVKENASLILTGTPSAQKQEYYSPVISTTE
GINLGDKASLSVKNMGYLSSDIHAGTTAATINLGDGDAETDSPLFSSLMKGYNAVLSGNITGEQSTVNMNNALWYSDGNS
TIGTLKSTGGRVELGGGKDFATLRVKELNANNATFLMHTNNSQADQLNVTNKLLGSNNTVLVDFLNKPASEMNVTLITAP
KGSDEKTFTAGTQQIGFSNVTPVISTEKTDDATKWMLTGYQTVSDAGASKTATDFMASGYKSFLTEVNNLNKRMGDLRDT
QGDAGVWARIMNGTGSADGGYSDNYTHVQIGADRKHELDGVDLFTGALLTYTDSNASSHAFSGKTKSVGGGLYASALFDS
GAYFDLIGKYLHHDNQYTASFASLGTKDYSSHSWYAGAEVGYRYHLSEESWVEPQMELVYGSVSGKSFSWEDRGMALSMK
DKDYNPLIGRTGVDVGRTFSGDDWKITARAGLGYQFDLLANGETVLRDASGEKRFEGEKDSRMLMNVGMNAEIKDNMRFG
LELEKSAFGKYNVDNAINANFRYSF
>P9WJD7 ~~~espC~~~ESX-1 secretion-associated protein EspC~~~
MTENLTVQPERLGVLASHHDNAAVDASSGVEAAAGLGESVAITHGPYCSQFNDTLNVYLTAHNALGSSLHTAGVDLAKSL
RIAAKIYSEADEAWRKAIDGLFT
>P9WJD5 ~~~espD~~~ESX-1 secretion-associated protein EspD~~~
MDLPGNDFDSNDFDAVDLWGADGAEGWTADPIIGVGSAATPDTGPDLDNAHGQAETDTEQEIALFTVTNPPRTVSVSTLM
DGRIDHVELSARVAWMSESQLASEILVIADLARQKAQSAQYAFILDRMSQQVDADEHRVALLRKTVGETWGLPSPEEAAA
AEAEVFATRYSDDCPAPDDESDPW
>P9WJD3 ~~~espE~~~ESX-1 secretion-associated protein EspE~~~COG3266
MASGSGLCKTTSNFIWGQLLLLGEGIPDPGDIFNTGSSLFKQISDKMGLAIPGTNWIGQAAEAYLNQNIAQQLRAQVMGD
LDKLTGNMISNQAKYVSDTRDVLRAMKKMIDGVYKVCKGLEKIPLLGHLWSWELAIPMSGIAMAVVGGALLYLTIMTLMN
ATNLRGILGRLIEMLTTLPKFPGLPGLPSLPDIIDGLWPPKLPDIPIPGLPDIPGLPDFKWPPTPGSPLFPDLPSFPGFP
GFPEFPAIPGFPALPGLPSIPNLFPGLPGLGDLLPGVGDLGKLPTWTELAALPDFLGGFAGLPSLGFGNLLSFASLPTVG
QVTATMGQLQQLVAAGGGPSQLASMGSQQAQLISSQAQQGGQQHATLVSDKKEDEEGVAEAERAPIDAGTAASQRGQEGT
VL
>Q8X482 ~~~espF(U)~~~Secreted effector protein EspF(U)~~~
MINNVSSLFPTVNRNITAVYKKSSFSVSPQKITLNPVKISSPFSPSSSSISATTLFRAPNAHSASFHRQSTAESSLHQQL
PNVRQRLIQHLAEHGIXPARSMAEHIPPAPKWPAPPPPVQNEQSRPLPDVAQRLMQHLAEHGIQPARNMAEHIPPAPNWP
APTPPVQNEQSRPLPDVAQRLMQHLAEHGIQPARNMAEHIPPAPXWXAPTPPVQNEQSRPLPDVAQRLMQHLAEHGIZPA
RSMAEHIPPAPNWPAPPPPVQNEQSRPLPDVAQRLXQHLAEHGIQPARNMAEHIPPAPNWPAPXXPVXNEQSRPLXDVAX
RLMQHLAEHGIQPARNMAEHIPPAPNWXAPTPPVQNEQSRPLPDVAQRLMQHLAEHGINTSKRS
>P9WJD1 ~~~espF~~~ESX-1 secretion-associated protein EspF~~~
MTGFLGVVPSFLKVLAGMHNEIVGDIKRATDTVAGISGRVQLTHGSFTSKFNDTLQEFETTRSSTGTGLQGVTSGLANNL
LAAAGAYLKADDGLAGVIDKIFG
>B2HMS9 ~~~espG1~~~ESX-1 secretion-associated protein EspG1~~~
MTGPLATGRAGTGDDVVGVEVTIDGMLVIADRLHLVDFPVTLGIRPNIPQEDLREIVWDQVARDLTAQGVLDHNGQPHPA
VAAMVDTLSRADRTLEGRWWRRDVGGVMVRFVVCRKGERHVIAVRDGDMLVLQLVAPRVGLAGMVTAVLGTAEPANVEPL
TGIASELGECTNAAQLTRYGLTPTTARLYTEIVTNPKSWVEIVASERHPGGTTTHTKAAAGVLDSAHGRLVSLPRQVGGE
LYGSFLPGTEQNLQRALDSLLELLPSGSWLDRADATARG
>P96210 ~~~espG1~~~ESX-1 secretion-associated protein EspG1~~~
MTGPSAAGRAGTADNVVGVEVTIDGMLVIADRLHLVDFPVTLGIRPNIPQEDLRDIVWEQVQRDLTAQGVLDLHGEPQPT
VAEMVETLGRPDRTLEGRWWRRDIGGVMVRFVVCRRGDRHVIAARDGDMLVLQLVAPQVGLAGMVTAVLGPAEPANVEPL
TGVATELAECTTASQLTQYGIAPASARVYAEIVGNPTGWVEIVASQRHPGGTTTQTDAAAGVLDSKLGRLVSLPRRVGGD
LYGSFLPGTQQNLERALDGLLELLPAGAWLDHTSDHAQASSRG
>A0QQ45 ~~~espG3~~~ESX-3 secretion-associated protein EspG3~~~
MGPNAVELTTDQAWCLADVLGAGSYPWVLAITPPYSDHSQRSAFLAAQSAELTRMGVVNSAGAVDPRVAQWITTVCRATQ
WLDLRFVSGPGDLLRGMVARRSEETVVALRNAQLVTFTAMDIGHQHALVPVLTAGLSGRKPARFDDFALPAAAGARADEQ
IRNGAPLAEVLEFLGVPPSARPLVESVFDGRRTYVEIVAGEHRDGHRVTTEVGVSIIDTPHGRILVHPTKAFDGEWISTF
TPGSADAIAMAVERLTASLPSGSWFPDQPLTRDFDEDAATHREPVLQRRTQKA
>P9WJC7 ~~~espG3~~~ESX-3 secretion-associated protein EspG3~~~
MDATPNAVELTVDNAWFIAETIGAGTFPWVLAITMPYSDAAQRGAFVDRQRDELTRMGLLSPQGVINPAVADWIKVVCFP
DRWLDLRYVGPASADGACELLRGIVALRTGTGKTSNKTGNGVVALRNAQLVTFTAMDIDDPRALVPILGVGLAHRPPARF
DEFSLPTRVGARADERLRSGVPLGEVVDYLGIPASARPVVESVFSGPRSYVEIVAGCNRDGRHTTTEVGLSIVDTSAGRV
LVSPSRAFDGEWVSTFSPGTPFAIAVAIQTLTACLPDGQWFPGQRVSRDFSTQSS
>B2HSU5 ~~~espG5~~~ESX-5 secretion-associated protein EspG5~~~
MDQQSTRTDITVNVDGFWMLQALLDIRHVAPELRCRPYVSTDSNDWLNEHPGMAVMREQGIVVGDTVNEQVAARMRVLAA
PDLEVVALLSRGKLLYGVVDNEDQPPGSRDIPDNEFRVVLARRGQHWVSAVRVGNDITVDDVSVSDSASIAALVIDGLES
IHHADPAAINAVNVPLEEMLEATKSWQESGFNVFSGGDLRRMGISASTVAALGQALSDPAAEVAVYARQYRDDAKGPSAS
VLSLKDGSGGRIALYQQARTAGSGEAWLAICPATPQLVQVGVKTVLDTLPYGEWKTHSRV
>O53943 ~~~espG5~~~ESX-5 secretion-associated protein EspG5~~~
MDQQSTRTDITVNVDGFWMLQALLDIRHVAPELRCRPYVSTDSNDWLNEHPGMAVMREQGIVVNDAVNEQVAARMKVLAA
PDLEVVALLSRGKLLYGVIDDENQPPGSRDIPDNEFRVVLARRGQHWVSAVRVGNDITVDDVTVSDSASIAALVMDGLES
IHHADPAAINAVNVPMEEMLEATKSWQESGFNVFSGGDLRRMGISAATVAALGQALSDPAAEVAVYARQYRDDAKGPSAS
VLSLKDGSGGRIALYQQARTAGSGEAWLAICPATPQLVQVGVKTVLDTLPYGEWKTHSRV
>O69732 ~~~espH~~~ESX-1 secretion-associated protein EspH~~~
MVDPPGNDDDHGDLDALDFSAAHTNEASPLDALDDYAPVQTDDAEGDLDALHALTERDEEPELELFTVTNPQGSVSVSTL
MDGRIQHVELTDKATSMSEAQLADEIFVIADLARQKARASQYTFMVENIGELTDEDAEGSALLREFVGMTLNLPTPEEAA
AAEAEVFATRYDVDYTSRYKADD
>P9WJC5 ~~~espI~~~ESX-1 secretion-associated protein EspI~~~COG0455
MAADYDKLFRPHEGMEAPDDMAAQPFFDPSASFPPAPASANLPKPNGQTPPPTSDDLSERFVSAPPPPPPPPPPPPPTPM
PIAAGEPPSPEPAASKPPTPPMPIAGPEPAPPKPPTPPMPIAGPEPAPPKPPTPPMPIAGPAPTPTESQLAPPRPPTPQT
PTGAPQQPESPAPHVPSHGPHQPRRTAPAPPWAKMPIGEPPPAPSRPSASPAEPPTRPAPQHSRRARRGHRYRTDTERNV
GKVATGPSIQARLRAEEASGAQLAPGTEPSPAPLGQPRSYLAPPTRPAPTEPPPSPSPQRNSGRRAERRVHPDLAAQHAA
AQPDSITAATTGGRRRKRAAPDLDATQKSLRPAAKGPKVKKVKPQKPKATKPPKVVSQRGWRHWVHALTRINLGLSPDEK
YELDLHARVRRNPRGSYQIAVVGLKGGAGKTTLTAALGSTLAQVRADRILALDADPGAGNLADRVGRQSGATIADVLAEK
ELSHYNDIRAHTSVNAVNLEVLPAPEYSSAQRALSDADWHFIADPASRFYNLVLADCGAGFFDPLTRGVLSTVSGVVVVA
SVSIDGAQQASVALDWLRNNGYQDLASRACVVINHIMPGEPNVAVKDLVRHFEQQVQPGRVVVMPWDRHIAAGTEISLDL
LDPIYKRKVLELAAALSDDFERAGRR
>P9WJC3 ~~~espJ~~~ESX-1 secretion-associated protein EspJ~~~
MAEPLAVDPTGLSAAAAKLAGLVFPQPPAPIAVSGTDSVVAAINETMPSIESLVSDGLPGVKAALTRTASNMNAAADVYA
KTDQSLGTSLSQYAFGSSGEGLAGVASVGGQPSQATQLLSTPVSQVTTQLGETAAELAPRVVATVPQLVQLAPHAVQMSQ
NASPIAQTISQTAQQAAQSAQGGSGPMPAQLASAEKPATEQAEPVHEVTNDDQGDQGDVQPAEVVAAARDEGAGASPGQQ
PGGGVPAQAMDTGAGARPAASPLAAPVDPSTPAPSTTTTL
>P9WJC1 ~~~espK~~~ESX-1 secretion-associated protein EspK~~~COG4932
MSITRPTGSYARQMLDPGGWVEADEDTFYDRAQEYSQVLQRVTDVLDTCRQQKGHVFEGGLWSGGAANAANGALGANINQ
LMTLQDYLATVITWHRHIAGLIEQAKSDIGNNVDGAQREIDILENDPSLDADERHTAINSLVTATHGANVSLVAETAERV
LESKNWKPPKNALEDLLQQKSPPPPDVPTLVVPSPGTPGTPGTPITPGTPITPGTPITPIPGAPVTPITPTPGTPVTPVT
PGKPVTPVTPVKPGTPGEPTPITPVTPPVAPATPATPATPVTPAPAPHPQPAPAPAPSPGPQPVTPATPGPSGPATPGTP
GGEPAPHVKPAALAEQPGVPGQHAGGGTQSGPAHADESAASVTPAAASGVPGARAAAAAPSGTAVGAGARSSVGTAAASG
AGSHAATGRAPVATSDKAAAPSTRAASARTAPPARPPSTDHIDKPDRSESADDGTPVSMIPVSAARAARDAATAAASARQ
RGRGDALRLARRIAAALNASDNNAGDYGFFWITAVTTDGSIVVANSYGLAYIPDGMELPNKVYLASADHAIPVDEIARCA
TYPVLAVQAWAAFHDMTLRAVIGTAEQLASSDPGVAKIVLEPDDIPESGKMTGRSRLEVVDPSAAAQLADTTDQRLLDLL
PPAPVDVNPPGDERHMLWFELMKPMTSTATGREAAHLRAFRAYAAHSQEIALHQAHTATDAAVQRVAVADWLYWQYVTGL
LDRALAAAC
>P9WJB9 ~~~espL~~~ESX-1 secretion-associated protein EspL~~~
MSMDELDPHVARALTLAARFQSALDGTLNQMNNGSFRATDEAETVEVTINGHQWLTGLRIEDGLLKKLGAEAVAQRVNEA
LHNAQAAASAYNDAAGEQLTAALSAMSRAMNEGMA
>Q7BSW5 3.4.21.-~~~espP~~~Serine protease EspP~~~COG3468
MNKIYSLKYSHITGGLIAVSELSGRVSSRATGKKKHKRILALCFLGLLQSSYSFASQMDISNFYIRDYMDFAQNKGIFQA
GATNIEIVKKDGSTLKLPEVPFPDFSPVANKGSTTSIGGAYSITATHNTKNHHSVATQNWGNSTYKQTDWNTSHPDFAVS
RLDKFVVETRGATEGADISLSKQQALERYGVNYKGEKKLIAFRAGSGVVSVKKNGRITPFNEVSYKPEMLNGSFVHIDDW
SGWLILTNNQFDEFNNIASQGDSGSALFVYDNQKKKWVVAGTVWGIYNYANGKNHAAYSKWNQTTIDNLKNKYSYNVDMS
GAQVATIENGKLTGTGSDTTDIKNKDLIFTGGGDILLKSSFDNGAGGLVFNDKKTYRVNGDDFTFKGAGVDTRNGSTVEW
NIRYDNKDNLHKIGDGTLDVRKTQNTNLKTGEGLVILGAEKTFNNIYITSGDGTVRLNAENALSGGEYNGIFFAKNGGTL
DLNGYNQSFNKIAATDSGAVITNTSTKKSILSLNNTADYIYHGNINGNLDVLQHHETKKENRRLILDGGVDTTNDISLRN
TQLSMQGHATEHAIYRDGAFSCSLPAPMRFLCGSDYVAGMQNTEADAVKQNGNAYKTNNAVSDLSQPDWETGTFRFGTLH
LENSDFSVGRNANVIGDIQASKSNITIGDTTAYIDLHAGKNITGDGFGFRQNIVRGNSQGETLFTGGITAEDSTIVIKDK
AKALFSNYVYLLNTKATIENGADVTTQSGMFSTSDISISGNLSMTGNPDKDNKFEPSIYLNDASYLLTDDSARLVAKNKA
SVVGDIHSTKSASIMFGHDESDLSQLSDRTSKGLALGLLGGFDVSYRGSVNAPSASATMNNTWWQLTGDSALKTLKSTNS
MVYFTDSANNKKFHTLTVDELATSNSAYAMRTNLSESDKLEVKKHLSGENNILLVDFLQKPTPEKQLNIELVSAPKDTNE
NVFKASKQTIGFSDVTPVITTRETDDKITWSLTGYNTVANKEATRNAAALFSVDYKAFLNEVNNLNKRMGDLRDINGEAG
AWARIMSGTGSASGGFSDNYTHVQVGVDKKHELDGLDLFTGFTVTHTDSSASADVFSGKTKSVGAGLYASAMFDSGAYID
LIGKYVHHDNEYTATFAGLGTRDYSTHSWYAGAEAGYRYHVTEDAWIEPQAELVYGSVSGKQFAWKDQGMHLSMKDKDYN
PLIGRTGVDVGKSFSGKDWKVTARAGLGYQFDLLANGETVLRDASGEKRIKGEKDSRMLMSVGLNAEIRDNVRFGLEFEK
SAFGKYNVDNAVNANFRYSF
>O32591 3.4.21.-~~~espP~~~Serine protease EspP~~~
MNKIYSLKYSHITGGLIAVSELSGRVSSRATGKKKHKRILALCFLGLLQSSYSFASQMDISNFYIRDYMDFAQNKGIFQA
GATNIEIVKKDGSTLKLPEVPFPDFSPVANKGSTTSIGGAYSITATHNTKNHHSVATQNWGNSTYKQTDWNTSHPDFAVS
RLDKFVVETRGATEGADISLSKQQALERYGVNYKGEKKLIAFRAGSGVVSVKKNGRITPFNEVSYKPEMLNGSFVHIDDW
SGWLILTNNQFDEFNNIASQGDSGSALFVYDNQKKKWVVAGTVWGIYNYANGKNHAAYSKWNQTTIDNLKNKYSYNVDMS
GAQVATIENGKLTGTGSDTTDIKNKDLIFTGGGDILLKSSFDNGAGGLVFNDKKTYRVNGDDFTFKGAGVDTRNGSTVEW
NIRYDNKDNLHKIGDGTLDVRKTQNTNLKTGEGLVILGAEKTFNNIYITSGDGTVRLNAENALSGGEYNGIFFAKNGGTL
DLNGYNQSFNKIAATDSGAVITNTSTKKSILSLNNTADYIYHGNINGNLDVLQHHETKKENRRLILDGGVDTTNDISLRN
TQLSMQGHATEHAIYRDGAFSCSLPAPMRFLCGSDYVAGMQNTEADAVKQNGNAYKTNNAVSDLSQPDWETGTFRFGTLH
LENSDFSVGRNANVIGDIQASKSNITIGDTTAYIDLHAGKNITGDGFGFRQNIVRGNSQGETLFTGGITAEDSTIVIKDK
AKALFSNYVYLLNTKATIENGADVTTQSGMFSTSDISISGNLSMTGNPDKDNKFEPSIYLNDASYLLTDDSARLVAKNKA
SVVGDIHSTKSASIMFGHDESDLSQLSDRTSKGLALGLLGGFDVSYRGSVNAPSASATMNNTWWQLTGDSALKTLKSTNS
MVYFTDSANNKKFHTLTVDELATSNSAYAMRTNLSESDKLEVKKHLSGENNILLVDFLQKPTPEKQLNIELVSAPKDTNE
NVFKASKQTIGFSDVTPVITTRETDDKITWSLTGYNTVANKEATRNAAALFSVDYKAFLNEVNNLNKRMGDLRDINGEAG
AWARIMSGTGSASGGFSDNYTHVQVGVDKKHELDGLDLFTGFTVTHTDSSASADVFSGKTKSVGAGLYASAMVDSGAYID
LIGKYVHHDNEYTATFAGLGTRDYSTHSWYAGAEAGYRYHVTEDAWIEPQAELVYGSVSGKQFAWKDQGMHLSMKDKDYN
PLIGRTGVDVGKSFSGKDWKVTARAGLGYQFDLLANGETVLRDASGEKRIKGEKDSRMLMSVGLNAEIRDNVRFGLEFEK
SAFGKYNVDNAVNANFRYSF
>P9WJB7 ~~~espR~~~Nucleoid-associated protein EspR~~~COG1476
MSTTFAARLNRLFDTVYPPGRGPHTSAEVIAALKAEGITMSAPYLSQLRSGNRTNPSGATMAALANFFRIKAAYFTDDEY
YEKLDKELQWLCTMRDDGVRRIAQRAHGLPSAAQQKVLDRIDELRRAEGIDA
>A0A0H2XG66 ~~~essB~~~Type VII secretion system protein EssB~~~
MVKNHNPKNEMQDMLTPLDAEEAAKTKLRLDMREIPKSSIKPEHFHLMYLLEQHSPYFIDAELTELRDSFQIHYDINDNH
TPFDNIKSFTKNEKLRYLLNIKNLEEVNRTRYTFVLAPDELFFTRDGLPIAKTRGLQNVVDPLPVSEAEFLTRYKALVIC
AFNEKQSFDALVEGNLELHKGTPFETKVIEAATLDLLTAFLDEQYQKQEQDYSQNYAYVRKVGHTVFKWVAIGMTTLSVL
LIAFLAFLYFSVMKHNERIEKGYQAFVKDDYTQVLNTYDDLDGKKLDKEALYIYAKSYIQTNKQGLEKDKKENLLNNVTP
NSNKDYLLYWMELGQGHLDEAINIATYLDDNDITKLALINKLNEIKNNGDLSNDKRSEETKKYNDKLQDILDKEKQVKDE
KAKSEEEKAKAKDEKLKQQEENEKKQKEQAQKDKEKRQEAERKK
>Q2G185 ~~~essB~~~Type VII secretion system protein EssB~~~COG4499
MVKNHNPKNEMQDMLTPLDAEEAAKTKLRLDMREIPKSSIKPEHFHLMYLLEQHSPYFIDAELTELRDSFQIHYDINDNH
TPFDNIKSFTKNEKLRYLLNIKNLEEVNRTRYTFVLAPDELFFTRDGLPIAKTRGLQNVVDPLPVSEAEFLTRYKALVIC
AFNEKQSFDALVEGNLELHKGTPFETKVIEAATLDLLTAFLDEQYQKQEQDYSQNYAYVRKVGHTVFKWVAIGMTTLSVL
LIAFLAFLYFSVMKHNERIEKGYQAFVKDDYTQVLNTYDDLDGKKLDKEALYIYAKSYIQTNKQGLEKDKKENLLNNVTP
NSNKDYLLYWMELGQGHLDEAINIATYLDDNDITKLALINKLNEIKNNGDLSNDKRSEETKKYNDKLQDILDKEKQVKDE
KAKSEEEKAKAKDEKLKQQEENEKKQKEQAQKDKEKRQEAERKK
>P0C053 ~~~essB~~~Type VII secretion system protein EssB~~~
MVKNHNPKNEMQDMLTPLDAEEAAKTKLRLDMREIPKSSIKPEHFHLMYLLEQHSPYFIDAELTELRDSFQIHYDINDNH
TPFDNIKSFTKNEKLRYLLNIKNLEEVNRTRYTFVLAPDELFFTRDGLPIAKTRGLQNVVDPLPVSEAEFLTRYKALVIC
AFNEKQSFDALVEGNLELHKGTPFETKVIEAATLDLLTAFLDEQYQKQEQDYSQNYAYVRKVGHTVFKWVAIGMTTLSVL
LIAFLAFLYFSVMKHNERIEKGYQAFVKDDYTQVLNTYDDLDGKKLDKEALYIYAKSYIQTNKQGLEKDKKENLLNNVTP
NSNKDYLLYWMELGQGHLDEAINIATYLDDNDITKLALINKLNEIKNNGDLSNDKRSEETKKYNDKLQDILDKEKQVKDE
KAKSEEEKAKAKDEKLKQQEENEKKQKEQAQKDKEKRQEAERKK
>Q2G184 ~~~essC~~~Type VII secretion system protein EssC~~~
MHKLIIKYNKQLKMLNLRDGKTYTISEDERADITLKSLGEVIHLEQNNQGTWQANHTSINKVLVRKGDLDDITLQLYTEA
DYASFAYPSIQDTMTIGPNAYDDMVIQSLMNAIIIKDFQSIQESQYVRIVHDKNTDVYINYELQEQLTNKAYIGDHIYVE
GIWLEVQADGLNVLSQNTVASSLIRLTQEMPHAQADDYNTYHRSPRIIHREPTDDIKIERPPQPIQKNNTVIWRSIIPPL
VMIALTVVIFLVRPIGIYILMMIGMSTVTIVFGITTYFSEKKKYNKDVEKREKDYKAYLDNKSKEINKAIKAQRFSLNYH
YPTVAEIKDIVETKAPRIYEKTSHHHDFLHYKLGIANVEKSFKLDYQEEEFNQRRDELFDDAKELYEFYTDVEQAPLIND
LNHGPIAYIGARHLILEELEKMLIQLSTFHSYHDLEFLFVTREDEVETLKWARWLPHMTLRGQNIRGFVYNQRTRDQILT
SIYSMIKERIQAVRERSRSNEQIIFTPQLVFVITDMSLIIDHVILEYVNQDLSEYGISLIFVEDVIESLPEHVDTIIDIK
SRTEGELITKEKELVQLKFTPENIDNVDKEYIARRLANLIHVEHLKNAIPDSITFLEMYNVKEVDQLDVVNRWRQNETYK
TMAVPLGVRGKDDILSLNLHEKAHGPHGLVAGTTGSGKSEIIQSYILSLAINFHPHEVAFLLIDYKGGGMANLFKDLVHL
VGTITNLDGDEAMRALTSIKAELRKRQRLFGEHDVNHINQYHKLFKEGIATEPMPHLFIISDEFAELKSEQPDFMKELVS
TARIGRSLGIHLILATQKPSGVVDDQIWSNSKFKLALKVQDRQDSNEILKTPDAADITLPGRAYLQVGNNEIYELFQSAW
SGATYDIEGDKLEVEDKTIYMINDYGQLQAINKDLSGLEDEETKENQTELEAVIDHIESITTRLEIEEVKRPWLPPLPEN
VYQEDLVETDFRKLWSDDAKEVELTLGLKDVPEEQYQGPMVLQLKKAGHIALIGSPGYGRTTFLHNIIFDVARHHRPDQA
HMYLFDFGTNGLMPVTDIPHVADYFTVDQEDKIAKAIRIFNDEIDRRKKILSQYRVTSISEYRKLTGETIPHVFILIDNF
DAVKDSPFQEVFENMMIKMTREGLALDMQVTLTASRANAMKTPMYINMKTRIAMFLYDKSEVSNVVGQQKFAVKDVVGRA
LLSSDDNVSFHIGQPFKHDETKSYNDQINDEVSAMTEFYKGETPNDIPMMPDEIKYEDYRESLNLPDIVANGALPIGLDY
EGVTLQKIKLTEPAMISSENPREIAHIAEIMMKEIDILNEKYAICIADSSGEFKAYRHQVANFAEEREDIKAIHQLMIED
LKQREMDGPFEKDSLYIINDFKTFIDCTYIPEDDVKKLITKGPELGLNILFVGIHKELIDAYDKQIDVARKMINQFSIGI
RISDQQFFKFRFIQREPVIKENEAYMVANQAYQKIRWFK
>Q932J9 ~~~essC~~~Type VII secretion system protein EssC~~~
MHKLIIKYNKQLKMLNLRDGKTYTISEDERADITLKSLGEVIHLEQNNQGTWQANHTSINKVLVRKGDLDDITLQLYTEA
DYASFAYPSIQDTMTIGPNAYDDMVIQSLMNAIIIKDFQSIQESQYVRIVHDKNTDVYINYELQEQLTNKAYIGDHIYVE
GIWLEVQADGLNVLSQNTVASSLIRLTQEMPHAQADDYNTYHRSPRIIHREPTDDIKIERPPQPIQKNNTVIWRSIIPPL
VMIALTVVIFLVRPIGIYILMMIGMSTVTIVFGITTYFSEKKKYNKDVEKREKDYKAYLDNKSKEINKAIKAQRFSLNYH
YPTVAEIKDIVETKAPRIYEKTSHHHDFLHYKLGIANVEKSFKLDYQEEEFNQRRDELFDDAKELYEFYTDVEQAPLIND
LNHGPIAYIGARHLILEELEKMLIQLSTFHSYHDLEFLFVTREDEVETLKWARWLPHMTLRGQNIRGFVYNQRTRDQILT
SIYSMIKERIQAVRERSRSNEQIIFTPQLVFVITDMSLIIDHVILEYVNQDLSEYGISLIFVEDVIESLPEHVDTIIDIK
SRTEGELITKEKELVQLKFTPENIDNVDKEYIARRLANLIHVEHLKNAIPDSITFLEMYNVKEVDQLDVVNRWRQNETYK
TMAVPLGVRGKDDILSLNLHEKAHGPHGLVAGTTGSGKSEIIQSYILSLAINFHPHEVAFLLIDYKGGGMANLFKDLVHL
VGTITNLDGDEAMRALTSIKAELRKRQRLFGEHDVNHINQYHKLFKEGVATEPMPHLFIISDEFAELKSEQPDFMKELVS
TARIGRSLGIHLILATQKPSGVVDDQIWSNSKFKLALKVQDRQDSNEILKTPDAADITLPGRAYLQVGNNEIYELFQSAW
SGATYDIEGDKLEVEDKTIYMINDYGQLQAINKDLSGLEDEETKENQTELEAVIDHIESITTRLEIEEVKRPWLPPLPEN
VYQEDLVETDFRKLWSDDAKEVELTLGLKDVPEEQYQGPMVLQLKKAGHIALIGSPGYGRTTFLHNIIFDVARHHRPDQA
HMYLFDFGTNGLMPVTDIPHVADYFTVDQEDKIAKAIRIFNDEIDRRKKILSQYRVTSISEYRKLTGETIPYVFILIDNF
DAVKDSPFQEVFENMMIKMTREGLALDMQVTLTASRANAMKTPMYINMKTRIAMFLYDKSEVSNVVGQQKFAVKDVVGRA
LLSSDDNVSFHIGQPFKHDETKSYNDQINDEVSAMTEFYKGETPNDIPMMPDEIKYEDYRESLSLPDIVANGALPIGLDY
EGVTLQKIKLTEPAMISSENPREIAHIAEIMMKEIDILNEKYAICIADSSGEFKAYRHQVANFAEEREDIKAIHQLMIED
LKQREMDGPFEKDSLYIINDFKTYIDCTYIPEDDVKKLITKGPELGLNILFVGIHKELIDAYDKQIDVARKMINQFSIGI
RISDQQFFKFRFIQREPVIKENEAYMVANQAYQKIRWFK
>A0A0H2XIV9 ~~~essD~~~Type VII secretion system protein EssD~~~
MTKDIEYLTADYDNEKSSIQSVIDAIEGQDFLDVDTTMDDAVSDVSSLDEDGAISLTSSVVGPQGSKLMGYYQNELYDYA
SQLDSKMKEIIDTPFIEDIDKAFKGITNVKLENILIKNGGGHGRDTYGASGKIAKGDAKKSDSDVYSIDEILKSDQEFVK
VIDQHYKEMKKEDKKLSKSDFEKMMTQGASCDYMTVAEAEELEEQKKKEEAIEIAALAGMVVLSCINPVAGAVAIGAYSA
YSAANAATGKNIVTGRKLSKEERIMEGLSLIPLPGMGFLKGAGKSLMKLGFKGGEKFAVKTGLQKTMQQAVSRISPKMGM
MKNSVLNQSRNFAQNTHVGQMLSNMRGQATHTVQQSRNWIGQQAQNVKRIVNNGLDKEIAHPFKQQLAPAGMGGIKFAET
TTLRNMGQNIKRAVTPQNHVTHGPKDSMVRSEGKHSISSHEMNSSKYVESPNYTKVEFGEHYARLRPKKLKANIEYTTPT
GHIYRTDHKGRIKEVYVDNLSLKDGDRNSHAQRTVGGEDRLPDDDGGHLIARMFGGSKDIDNLVAQSKFINRPFKEKGHW
YNLEKEWQEFLNSGKEVKNIKMEVKYSGNSQRPTIFKVEYEINGERNIRRILNK
>Q2G179 ~~~essD~~~Type VII secretion system protein EssD~~~
MTKDIEYLTADYDNEKSSIQSVIDAIEGQDFLDVDTTMDDAVSDVSSLDEDGAISLTSSVVGPQGSKLMGYYQNELYDYA
SQLDSKMKEIIDTPFIEDIDKAFKGITNVKLENILIKNGGGHGRDTYGASGKIAKGDAKKSDSDVYSIDEILKSDQEFVK
VIDQHYKEMKKEDKKLSKSDFEKMMTQGASCDYMTVAEAEELEEQKKKEEAIEIAALAGMVVLSCINPVAGAVAIGAYSA
YSAANAATGKNIVTGRKLSKEERIMEGLSLIPLPGMGFLKGAGKSLMKLGFKGGEKFAVKTGLQKTMQQAVSRISPKMGM
MKNSVLNQSRNFAQNTHVGQMLSNMRGQATHTVQQSRNWIGQQAQNVKRIVNNGLDKEIAHPFKQQLAPAGMGGIKFAET
TTLRNMGQNIKRAVTPQNHVTHGPKDSMVRSEGKHSISSHEMNSSKYVESPNYTKVEFGEHYARLRPKKLKANIEYTTPT
GHIYRTDHKGRIKEVYVDNLSLKDGDRNSHAQRTVGGEDRLPDDDGGHLIARMFGGSKDIDNLVAQSKFINRPFKEKGHW
YNLEKEWQEFLNSGKEVKNIKMEVKYSGNSQRPTIFKVEYEINGERNIRRILNK
>A0A0H3KDT7 ~~~essD~~~Type VII secretion systems protein EssD~~~
MHDMTKDIEYLTADYDNEKSSIQSVIDAIEGQDFLDVDTTMDDAVSDVSSLDEDGAISLTSSVVGPQGSKLMGYYQNELY
DYASQLDSKMKEIIDTPFIEDIDKAFKGITNVKLENILIKNGGGHGRDTYGASGKIAKGDAKKSDSDVYSIDEILKSDQE
FVKVIDQHYKEMKKEDKKLSKSDFEKMMTQGASCDYMTVAEAEELEEQKKKEEAIEIAALAGMVVLSCINPVAGAVAIGA
YSAYSAANAATGKNIVTGRKLSKEERIMEGLSLIPLPGMGFLKGAGKSLMKLGFKGGEKFAVKTGLQKTMQQAVSRISPK
MGMMKNSVLNQSRNFAQNTHVGQMLSNMRGQATHTVQQSRNWIGQQAQNVKRIVNNGLDKEIAHPFKQQLAPAGMGGIKF
AETTTLRNMGQNIKRAVTPQNHVTHGPKDSMVRSEGKHSISSHEMNSSKYVESPNYTKVEFGEHYARLRPKKLKANIEYT
TPTGHIYRTDHKGRIKEVYVDNLSLKDGDRNSHAQRTVGGEDRLPDDDGGHLIARMFGGSKDIDNLVAQSKFINRPFKEK
GHWYNLEKEWQEFLNSGKEVKNIKMEVKYSGNSQRPTIFKVEYEINGERNIRRILNK
>A0A0H2XFI6 ~~~~~~Type VII secretion system protein EsaE~~~
MKDVKRIDYFSYEELTILGGSKLPLVNFELFDPSNFEEAKAALIEKELVTENDKLTDAGFKVATLVREYISAIVNIRIND
MYFAPFSYEKDEYILLSRFKNNGFQIRIINKDIAWWSIVQSYPLLMRQEKSNDWDFKQIDDETLENLNNESIDTIGRVLE
IEIYNHQGDPQQSLYNIYEQNDLLFIRYPLKDKVLNVHIGVINTFIRELFGFDTDENHINKAEE
>Q2G181 ~~~essE~~~Type VII secretion system protein EsaE~~~
MKDVKRIDYFSYEELTILGGSKLPLVNFELFDPSNFEEAKAALIEKELVTENDKLTDAGFKVATLVREYISAIVNIRIND
MYFAPFSYEKDEYILLSRFKNNGFQIRIINKDIAWWSIVQSYPLLMRQEKSNDWDFKQIDDETLENLNNESIDTIGRVLE
IEIYNHQGDPQQSLYNIYEQNDLLFIRYPLKDKVLNVHIGVINTFIRELFGFDTDENHINKAEE
>Q2G178 ~~~essG~~~Type VII secretion system protein EsaG~~~
MTFEEKLSKIYNEIANEISSMIPVEWEKVYTMAYIDDGGGEVFFNYTKPGSDDLNYYTNIPKEYNISVQVFDDLWMDLYD
LFEELRDLFKEEDLEPWTSCEFDFTREGELKVSFDYIDWINSEFGQIGRQNYYKYRKFGILPETEYEINKVKEIEQYIKE
LEE
>P86325 3.1.1.1~~~~~~Carboxylesterase~~~
MEIVIRTGSGDVRGSKENGIAVFRGIPYAEPPVGAHRFTAPRPPRPWDGVRDATEFSATAPRPPYPEAIGALLIERFIPG
DDYLTLNVWTPDPNAVGLPVMVWIHGGAFTNGSGSEPVYDGAAFARDGVVFVSFNYRLGIIGFADLPDAPSNRGLLDQIA
ALEWVRDNIARFGGDPGNVTVFGESAGAMSVCTLMATPRARGLFRRAILQSGAGNMAVAAEDATTIAAVIAHRLGVEPTA
AALAHVPVAQLLDVQQQVAQEIQGAPDPAVWGERIAGGSVLLPFAPVIDGELLSQRPAEAIAGGAGHDVDLLFGTTTDEY
RLFLAPTGLLPFITSDYVTAHLAKSGLDADAAKAYTAEGRGEEPGDILASIITDQVFRIPALRIAESRVDAPARTFGYEF
AWRTPQLDGILGACHAVELPFVFRTLDRAASLVGTNPPEELAETVHNAWVRFATSGDPGWPAWNPETRSVMRFDHPVSEM
VTDPYPATRALWDGVPL
>Q53547 3.1.1.1~~~estB~~~Carboxylesterase 2~~~COG0400
MTEPLILQPAKPADACVIWLHGLGADRYDFMPVAEALQESLLTTRFVLPQAPTRPVTINGGYEMPSWYDIKAMSPARSIS
LEELEVSAKMVTDLIEAQKRTGIDASRIFLAGFSQGGAVVFHTAFINWQGPLGGVIALSTYAPTFGDELELSASQQRIPA
LCLHGQYDDVVQNAMGRSAFEHLKSRGVTVTWQEYPMGHEVLPQEIHDIGAWLAARLG
>I6XU97 3.1.1.1~~~~~~Esterase Rv0045c~~~COG0596
MLSDDELTGLDEFALLAENAEQAGVNGPLPEVERVQAGAISALRWGGSAPRVIFLHGGGQNAHTWDTVIVGLGEPALAVD
LPGHGHSAWREDGNYSPQLNSETLAPVLRELAPGAEFVVGMSLGGLTAIRLAAMAPDLVGELVLVDVTPSALQRHAELTA
EQRGTVALMHGEREFPSFQAMLDLTIAAAPHRDVKSLRRGVFHNSRRLDNGNWVWRYDAIRTFGDFAGLWDDVDALSAPI
TLVRGGSSGFVTDQDTAELHRRATHFRGVHIVEKSGHSVQSDQPRALIEIVRGVLDTR
>P37957 3.1.1.3~~~estA~~~Lipase EstA~~~COG1075
MKFVKRRIIALVTILMLSVTSLFALQPSAKAAEHNPVVMVHGIGGASFNFAGIKSYLVSQGWSRDKLYAVDFWDKTGTNY
NNGPVLSRFVQKVLDETGAKKVDIVAHSMGGANTLYYIKNLDGGNKVANVVTLGGANRLTTGKALPGTDPNQKILYTSIY
SSADMIVMNYLSRLDGARNVQIHGVGHIGLLYSSQVNSLIKEGLNGGGQNTN
>O33407 3.1.1.1~~~estA~~~Esterase EstA~~~
MIRMALKPLVAACLLASLSTAPQAAPSPYSTLVVFGDSLSDAGQFPDPAGPAGSTSRFTNRVGPTYQNGSGEIFGPTAPM
LLGNQLGIAPGDLAASTSPVNAQQGIADGNNWAVGGYRTDQIYDSITAANGSLIERDNTLLRSRDGYLVDRARQGLGADP
NALYYITGGGNDFLQGRILNDVQAQQAAGRLVDSVQALQQAGARYIVVWLLPDLGLTPATFGGPLQPFASQLSGTFNAEL
TAQLSQAGANVIPLNIPLLLKEGMANPASFGLAADQNLIGTCFSGNGCTMNPTYGINGSTPDPSKLLFNDSVHPTITGQR
LIADYTYSLLSAPWELTLLPEMAHGTLRAYQDELRSQWQADWENWQNVGQWRGFVGGGGQRLDFDSQDSAASGDGNGYNL
TLGGSYRIDEAWRAGVAAGFYRQKLEAGAKDSDYRMNSYMASAFVQYQENRWWADAALTGGYLDYDDLKRKFALGGGERS
EKGDTNGHLWAFSARLGYDIAQQADSPWHLSPFVSADYARVEVDGYSEKGASATALDYDDQKRSSKRLGAGLQGKYAFGS
DTQLFAEYAHEREYEDDTQDLTMSLNSLPGNRFTLEGYTPQDHLNRVSLGFSQKLAPELSLRGGYNWRKGEDDTQQSVSL
ALSLDF
>P22266 3.1.1.-~~~estA~~~Esterase~~~
MSSAMRKTTNSPVVRRLTAAAVALGSCLALAGPAGSAGAAPADPVPTVFFGDSYTANFGIAPVTNQDSERGWCFQAKENY
PAVATRSLADKGITLDVQADVSCGGALIHHFWEKQELPFGAGELPPQQDALKQDTQLTVGSLGGNTLGFNRILKQCSDEL
RKPSLLPGDPVDGDEPAAKCGEFFGTGDGKQWLDDQFERVGAELEELLDRIGYFAPDAKRVLVGYPRLVPEDTTKCLTAA
PGQTQLPFADIPQDALPVLDQIQKRLNDAMKKAAADGGADFVDLYAGTGANTACDGADRGIGGLLEDSQLELLGTKIPWY
AHPNDKGRDIQAKQVADKIEEILNR
>Q79F14 3.1.1.3~~~estB~~~Extracellular esterase EstB~~~COG1075
MKKVLMAFIICLSLILSVLAAPPSGAKAESVHNPVVLVHGISGASYNFFAIKNYLISQGWQSNKLYAIDFYDKTGNNLNN
GPQLASYVDRVLKETGAKKVDIVAHSMGGANTLYYIKYLGGGNKIQNVVTLGGANGLVSSTALPGTDPNQKILYTSIYSL
NDQIVINSLSRLQGARNIQLYGIGHIGLLSNSQVNGYIKEGLNGGGLNTN
>Q9KX40 3.1.1.-~~~estB~~~Esterase EstB~~~
MTAASLDPTAFSLDAASLAARLDAVFDQALRERRLVGAVAIVARHGEILYRRAQGLADREAGRPMREDTLFRLASVTKPI
VALAVLRLVARGELALDAPVTRWLPEFRPRLADGSEPLVTIHHLLTHTSGLGYWLLEGAGSVYDRLGISDGIDLRDFDLD
ENLRRLASAPLSFAPGSGWQYSLALDVLGAVVERATGQPLAAAVDALVAQPLGMRDCGFVSAEPERFAVPYHDGQPEPVR
MRDGIEVPLPEGHGAAVRFAPSRVFEPGAYPSGGAGMYGSADDVLRALEAIRANPGFLPETLADAARRDQAGVGAETRGP
GWGFGYLSAVLDDPAAAGTPQHAGTLQWGGVYGHSWFVDRALGLSVLLLTNTAYEGMSGPLTIALRDAVYAR
>Q9WYH1 3.1.1.1~~~estD~~~Esterase EstD~~~COG1073
MRLTVFLSLFLGVMVFGAFDQEAFLFVQHLTSENFESALNMCSNQVKAQLSVQSLSNIWNSLKAQLSDFREIAGYEKIIQ
AEYEIYNFTLKFDRGEISALVTMDREGKVAGLFFKQATKTEYELPDYVDPESFEEKDITVNGLPGKITIPKGSGPFPAVV
LVHGSGPNDMDETIGPNKIFKDIAYGLSSKGIIVLRYHKRTFVEKVDPTTLTVEKEVIEDALEAVKILKERKDVSRVYVL
GHSLGAMLTPEIAERSKADGVVMIAPPARPLEEVMEDQLKYLQSLGLASNVEETLNILEKLKRKEIPPDEFVLGAPAKYF
YDLRERDPASIAKRLTIPMLLIFGGRDYQVTEKDQEIWLKELSGRENVKILVFDDLNHLMISGEGKSTPVEYMKKGHVDK
RVIDEIARWMVK
>P22862 3.1.1.2~~~estF~~~Arylesterase~~~
MSTFVAKDGTQIYFKDWGSGKPVLFSHGWLLDADMWEYQMEYLSSRGYRTIAFDRRGFGRSDQPWTGNDYDTFADDIAQL
IEHLDLKEVTLVGFSMGGGDVARYIARHGSARVAGLVLLGAVTPLFGQKPDYPQGVPLDVFARFKTELLKDRAQFISDFN
APFYGINKGQVVSQGVQTQTLQIALLASLKATVDCVTAFAETDFRPDMAKIDVPTLVIHGDGDQIVPFETTGKVAAELIK
GAELKVYKDAPHGFAVTHAQQLNEDLLAFLKR
>Q07792 3.1.1.2~~~~~~Arylesterase~~~
MIRLLSLVLFFCLSAASQASEKLLVLGDSLSAGYQMPIEKSWPSLLPDALLEHGQDVTVINGSISGDTTGNGLARLPQLL
DQHTPDLVLIELGANDGLRGFPPKVITSNLSKMISLIKDSGANVVMMQIRVPPNYGKRYSDMFYDIYPKLAEHQQVQLMP
FFLEHVITKPEWMMDDGLHPKPEAQPWIAEFVAQELVKHL
>Q88QS0 3.1.1.1~~~estP~~~Esterase EstP~~~COG3240
MRKAPLLRFTLASLALACSQALAGPSPYSTLIVFGDSLADAGQFPDLVGGTPGARFTNRDADGNFAPVSPMILGGRLGVA
PGDLNPSTSVGIQPDGNNWAVGGYTTQQILDSITTTSETVIPPGNPNAGLVLRERPGYLANGLRADPNALYYLTGGGNDF
LQGLVNSPADAVAAGARLAASAQALQQGGARYIMVWLLPDLGQTPNFSGTPQQNPLSLLSAAFNQSLISQLGQIDAQIIP
LNIPLLLSEALASPSQFGLASDQNLVGTCYSGDSCVENPVYGINGTTPDPTKLLFNDSVHPTIAGQQLIADYAYSILAAP
WELTLLPEMAHASLRAHQDELRNQWQTPWQAVGQWQAFVASGAQDLDFDGQHSAASGDGRGYNLTVGGSYRLNDAWRLGL
AGGANRQKLEAGEQDSDYKLNSYMASAFAQYRQDRWWADAALTAGHLDYSDLKRTFALGVNDRSEKGDTDGEAWAMSGRL
GYNLAADTSNWQLAPFISADYARVKVDGYDEKSGRSTALGFDDQERTSRRLGVGLLGSVQVLPSTRLFAEVAQEHEFEDD
EQDVTMHLTSLPANDFTLTGYTPHSDLTRASLGVSHELVAGVHLRGNYNWRKSDELTQQGISVGVSVDF
>I6YF08 3.1.1.-~~~~~~Esterase Rv3036c~~~
MRYLIATAVLVAVVLVGWPAAGAPPSCAGLGGTVQAGQICHVHASGPKYMLDMTFPVDYPDQQALTDYITQNRDGFVNVA
QGSPLRDQPYQMDATSEQHSSGQPPQATRSVVLKFFQDLGGAHPSTWYKAFNYNLATSQPITFDTLFVPGTTPLDSIYPI
VQRELARQTGFGAAILPSTGLDPAHYQNFAITDDSLIFYFAQGELLPSFVGACQAQVPRSAIPPLAI
>P9WM39 3.1.1.-~~~~~~Esterase Rv1288~~~COG0627
MVSTHAVVAGETLSALALRFYGDAELYRLIAAASGIADPDVVNVGQRLIMPDFTRYTVVAGDTLSALALRFYGDAELNWL
IAAASGIADPDVVNVGQRLIMPDFTRYTVVAGDTLSALAARFYGDASLYPLIAAVNGIADPGVIDVGQVLVIFIGRSDGF
GLRIVDRNENDPRLWYYRFQTSAIGWNPGVNVLLPDDYRTSGRTYPVLYLFHGGGTDQDFRTFDFLGIRDLTAGKPIIIV
MPDGGHAGWYSNPVSSFVGPRNWETFHIAQLLPWIEANFRTYAEYDGRAVAGFSMGGFGALKYAAKYYGHFASASSHSGP
ASLRRDFGLVVHWANLSSAVLDLGGGTVYGAPLWDQARVSADNPVERIDSYRNKRIFLVAGTSPDPANWFDSVNETQVLA
GQREFRERLSNAGIPHESHEVPGGHVFRPDMFRLDLDGIVARLRPASIGAAAERAD
>Q8KQK1 3.1.1.113~~~estZ~~~Ethyl acetate hydrolase~~~
MSLNPDLAAYLQLVEAGRSAGKVLPMHALEADEARRQFEESSALIAGKADEPDCISDLSLTTRDGHTLPVRLYRPPQDDP
ALAGAALLYLHGGGYVVGSLDSHDTLCWNLAQDAGVPVIAVGYRLAPQWRFPTASDDALDAWRWLVEQAEALGIDAQRLA
VVGDSVGGSLATILANQLAAQRELAAPRLQVMIYPVTDASCRRPSVQRYGSGYLLEAQTLEWFYQQYATVPADRLDPRFS
PLLGSVASNSAPALMLIAECDPLHDQGVAYARHLEQAGVAVQLAVIPGVTHDFMRMGSIIEEADEGLVMVVEALQQHL
>O32232 3.1.1.1~~~est~~~Carboxylesterase~~~COG1647
MKVVTPKPFTFKGGDKAVLLLHGFTGNTADVRMLGRYLNERGYTCHAPQYEGHGVPPEELVHTGPEDWWKNVMDGYEYLK
SEGYESIAACGLSLGGVFSLKLGYTVPIKGIVPMCAPMHIKSEEVMYQGVLSYARNYKKFEGKSPEQIEEEMKEFEKTPM
NTLKALQDLIADVRNNVDMIYSPTFVVQARHDHMINTESANIIYNEVETDDKQLKWYEESGHVITLDKERDLVHQDVYEF
LEKLDW
>Q06174 3.1.1.1~~~est~~~Carboxylesterase~~~
MKIVPPKPFFFEAGERAVLLLHGFTGNSADVRMLGRFLESKGYTCHAPIYKGHGVPPEELVHTGPDDWWQDVMNGYEFLK
NKGYEKIAVAGLSLGGVFSLKLGYTVPIEGIVTMCAPMYIKSEETMYEGVLEYAREYKKREGKSEEQIEQEMEKFKQTPM
KTLKALQELIADVRDHLDLIYAPTFVVQARHDEMINPDSANIIYNEIESPVKQIKWYEQSGHVITLDQEKDQLHEDIYAF
LESLDW
>Q9HZY8 3.1.1.1~~~tesA~~~Esterase TesA~~~
MRALLLSGCLALVLLTQQAAAQTLLVVGDSISAALGLDTSQGWVALLQKRLADEGYDYRVVNASISGDTSAGGLARLPAL
LAEEKPALVVIELGGNDGLRGMAPAQLQQNLASMAQKARAEGAKVLLLGIQLPPNYGPRYIEAFSRVYGAVAAQEKTALV
PFFLEGVGGVQGMMQADGIHPALAAQPRLLENVWPTLKPLL
>Q8DZR0 ~~~~~~ESAT-6-like protein SAG1039~~~
MAQIKLTPEELRSSAQKYTAGSQQVTEVLNLLTQEQAVIDENWDGSTFDSFEAQFNELSPKITEFAQLLEDINQQLLKVA
DIIEQTDADIASQISG
>Q6NJ54 ~~~esxA~~~ESAT-6-like protein EsxA~~~
MEKIKYGFGEIEAAASDIQSTSGRINSLLEDLKAHIRPMAAAWEGESAQAYNEAQQQWDSSAAELNTILSTISNTVRQGN
DRMSEVNRMAAASWS
>Q50206 ~~~esxA~~~6 kDa early secretory antigenic target homolog~~~COG4842
MIQAWHFPALQGAVNELQGSQSRIDALLEQCQESLTKLQSSWHGSGNESYSSVQRRFNQNTEGINHALGDLVQAINHSAE
TMQQTEAGVMSMFTG
>A0QNJ6 ~~~esxA~~~ESAT-6-like protein EsxA~~~COG4842
MTEQVWNFAGIEGGASEIHGAVSTTAGLLDEGKASLTTLASAWGGTGSEAYQAVQARWDSTSNELNLALQNLAQTISEAG
QTMAQTEAGVTGMFA
>P9WNK7 ~~~esxA~~~6 kDa early secretory antigenic target~~~COG4842
MTEQQWNFAGIEAAASAIQGNVTSIHSLLDEGKQSLTKLAAAWGGSGSEAYQGVQQKWDATATELNNALQNLARTISEAG
QAMASTEGNVTGMFA
>A0A0H2XI99 ~~~esxA~~~Type VII secretion system extracellular protein A~~~
MAMIKMSPEEIRAKSQSYGQGSDQIRQILSDLTRAQGEIAANWEGQAFSRFEEQFQQLSPKVEKFAQLLEEIKQQLNSTA
DAVQEQDQQLSNNFGLQ
>Q2G189 ~~~esxA~~~Type VII secretion system extracellular protein A~~~COG4842
MAMIKMSPEEIRAKSQSYGQGSDQIRQILSDLTRAQGEIAANWEGQAFSRFEEQFQQLSPKVEKFAQLLEEIKQQLNSTA
DAVQEQDQQLSNNFGLQ
>P0C046 ~~~esxA~~~Type VII secretion system extracellular protein A~~~
MAMIKMSPEEIRAKSQSYGQGSDQIRQILSDLTRAQGEIAANWEGQAFSRFEEQFQQLSPKVEKFAQLLEEIKQQLNSTA
DAVQEQDQQLSNNFGLQ
>Q7A7S4 ~~~esxA~~~Type VII secretion system extracellular protein A~~~
MAMIKMSPEEIRAKSQSYGQGSDQIRQILSDLTRAQGEIAANWEGQAFSRFEEQFQQLSPKVEKFAQLLEEIKQQLNSTA
DAVQEQDQQLSNNFGLQ
>D1A4H1 ~~~esxA~~~ESAT-6-like protein EsxA~~~COG4842
MSDYTRANFGGLSEGEAQFSMTARALLDELTDLEGKLRAKLDRWDGDAQAAYWNYQKEWDAAAKDMQNVVAQLGVAIREA
HDNYQAAERANTSIWAG
>Q6NJ55 ~~~esxB~~~ESAT-6-like protein EsxB~~~
MSQGFKTEADVMRNTAHRVDDTNQEVSAELSRLRSIVDGVRASWEGTAQVSFDNLMQRWDASAKGLQDALQSISDNIRGN
ATSFENVEADNQSAFSAVGGQGLAL
>O33084 ~~~esxB~~~ESAT-6-like protein EsxB~~~COG4842
MAEMITEAAILTQQAAQFDQIASGLSQERNFVDSIGQSFQNTWEGQAASAALGALGRFDEAMQDQIRQLESIVDKLNRSG
GNYTKTDDEANQLLSSKMNF
>A0QNJ5 ~~~esxB~~~ESAT-6-like protein EsxB~~~COG4842
MAAMNTDAAVLAKEAANFERISGELKGVIAQVESTGSALAAQMVGQAGTAAQAALARFHEAAAKQVQELNEISANIHTSG
TQYTSTDEDQAGTLASSMNI
>P9WNK5 ~~~esxB~~~ESAT-6-like protein EsxB~~~COG4842
MAEMKTDAATLAQEAGNFERISGDLKTQIDQVESTAGSLQGQWRGAAGTAAQAAVVRFQEAANKQKQELDEISTNIRQAG
VQYSRADEEQQQALSSQMGF
>A0A0H2XIE9 ~~~esxB~~~Type VII secretion system extracellular protein B~~~
MGGYKGIKADGGKVDQAKQLAAKTAKDIEACQKQTQQLAEYIEGSDWEGQFANKVKDVLLIMAKFQEELVQPMADHQKAI
DNLSQNLAKYDTLSIKQGLDRVNP
>Q2G182 ~~~esxB~~~Type VII secretion system extracellular protein B~~~
MGGYKGIKADGGKVDQAKQLAAKTAKDIEACQKQTQQLAEYIEGSDWEGQFANKVKDVLLIMAKFQEELVQPMADHQKAI
DNLSQNLAKYDTLSIKQGLDRVNP
>P0C047 ~~~esxB~~~Type VII secretion system extracellular protein B~~~
MGGYKGIKADGGKVDQAKQLAAKTAKDIEACQKQTQQLAEYIEGSDWEGQFANKVKDVLLIMAKFQEELVQPMADHQKAI
DNLSQNLAKYDTLSIKQGLDRVNP
>D1A4H0 ~~~esxB~~~ESAT-6-like protein~~~COG4842
MAPQSAVDRAAMAQAAQDIEQSANAIRGMQNQLASAKDQLRSHWEGDASMAFEAVFNRFNEDFSRVLKALDGMHESLVQT
RITYEAREEAAQQSVNRVQALLNG
>P9WNI1 ~~~esxC~~~ESAT-6-like protein EsxC~~~
MSDQITYNPGAVSDFASDVGSRAGQLHMIYEDTASKTNALQEFFAGHGAQGFFDAQAQMLSGLQGLIETVGQHGTTTGHV
LDNAIGTDQAIAGLF
>A0A0H2XIK2 ~~~esxC~~~Type VII secretion system extracellular protein C~~~
MNFNDIETMVKSKFKDIKKHAEEIAHEIEVRSGYLRKAEQYKRLEFNLSFALDDIESTAKDVQTAKSSANKDSVTVKGKA
PNTLYIEKRNLMKQKLEMLGEDIDKNKESLQKAKEIAGEKASEYFNKAMN
>P0C051 ~~~esxC~~~Type VII secretion system extracellular protein C~~~
MNFNDIETMVKSKFKDIKKHAEEIAHEIEVRSGYLRKAEQYKRLEFNLSFALDDIESTAKDVQTAKSSANKDSVTVKGKA
PNTLYIEKRNLMKQKLEMLGEDIDKNKESLQKAKEIAGEKASEYFNKAMN
>O05453 ~~~esxD~~~ESAT-6-like protein EsxD~~~
MADTIQVTPQMLRSTANDIQANMEQAMGIAKGYLANQENVMNPATWSGTGVVASHMTATEITNELNKVLTGGTRLAEGLV
QAAALMEGHEADSQTAFQALFGASHGS
>A0A0H2XDF9 ~~~esxD~~~Type VII secretion system extracellular protein D~~~
MTLSGKISVKAETIAHVVKELESISQKYDEIAQNFGKIAQLNYYSSEKAAHSMENGYSSAATVISGLKGPLSTLGGGVMN
SAQKFFEADEHWGTEFAKLYYNIEG
>P9WNH8 ~~~esxE~~~ESAT-6-like protein EsxE~~~
MDPTVLADAVARMAEFGRHVEELVAEIESLVTRLHVTWTGEGAAAHAEAQRHWAAGEAMMRQALAQLTAAGQSAHANYTG
AMATNLGMWS
>P9WNH9 ~~~esxE~~~ESAT-6-like protein EsxE~~~COG4842
MDPTVLADAVARMAEFGRHVEELVAEIESLVTRLHVTWTGEGAAAHAEAQRHWAAGEAMMRQALAQLTAAGQSAHANYTG
AMATNLGMWS
>P9WNH6 ~~~esxF~~~ESAT-6-like protein EsxF~~~
MGADDTLRVEPAVMQGFAASLDGAAEHLAVQLAELDAQVGQMLGGWRGASGSAYGSAWELWHRGAGEVQLGLSMLAAAIA
HAGAGYQHNETASAQVLREVGGG
>P9WNH7 ~~~esxF~~~ESAT-6-like protein EsxF~~~COG4842
MGADDTLRVEPAVMQGFAASLDGAAEHLAVQLAELDAQVGQMLGGWRGASGSAYGSAWELWHRGAGEVQLGLSMLAAAIA
HAGAGYQHNETASAQVLREVGGG
>A0QQ43 ~~~esxG~~~ESAT-6-like protein EsxG~~~
MSLLDAHIPQLIASEANFGAKAALMRSTIAQAEQAAMSSQAFHMGEASAAFQAAHARFVEVSAKVNALLDIAQLNIGDAA
SSYVAQDAAAASTYTGI
>O53692 ~~~esxG~~~ESAT-6-like protein EsxG~~~
MSLLDAHIPQLVASQSAFAAKAGLMRHTIGQAEQAAMSAQAFHQGESSAAFQAAHARFVAAAAKVNTLLDVAQANLGEAA
GTYVAADAAAASTYTGF
>A0QQ44 ~~~esxH~~~ESAT-6-like protein EsxH~~~COG4842
MSQIMYNYPAMLAHAAEMNTYSGALHAVGADIAAEQHALASAWQGDTGMTYQAWQAQWNQAMEELVRAYRAMATTHEQNT
MAMSARDQAEGAKWG
>P9WNK3 ~~~esxH~~~ESAT-6-like protein EsxH~~~COG4842
MSQIMYNYPAMLGHAGDMAGYAGTLQSLGAEIAVEQAALQSAWQGDTGITYQAWQAQWNQAMEDLVRAYHAMSSTHEANT
MAMMARDTAEAAKWGG
>P0DOA6 ~~~esxI~~~ESAT-6-like protein EsxI~~~
MTINYQFGDVDAHGAMIRAQAGSLEAEHQAIISDVLTASDFWGGAGSAACQGFITQLGRNFQVIYEQANAHGQKVQAAGN
NMAQTDSAVGSSWA
>P9WNJ9 ~~~esxJ~~~ESAT-6-like protein EsxJ~~~COG4842
MASRFMTDPHAMRDMAGRFEVHAQTVEDEARRMWASAQNISGAGWSGMAEATSLDTMTQMNQAFRNIVNMLHGVRDGLVR
DANNYEQQEQASQQILSS
>P9WNJ7 ~~~esxK~~~ESAT-6-like protein EsxK~~~COG4842
MASRFMTDPHAMRDMAGRFEVHAQTVEDEARRMWASAQNISGAGWSGMAEATSLDTMAQMNQAFRNIVNMLHGVRDGLVR
DANNYEQQEQASQQILSS
>P9WNJ5 ~~~esxL~~~ESAT-6-like protein EsxL~~~
MTINYQFGDVDAHGAMIRAQAGLLEAEHQAIIRDVLTASDFWGGAGSAACQGFITQLGRNFQVIYEQANAHGQKVQAAGN
NMAQTDSAVGSSWA
>B2HSU3 ~~~esxM~~~ESAT-6-like protein EsxM~~~COG4842
MTARFMTDPHAMRDMAGRFEMHAQTVEDEARKMWASSQNIAGAGWSGMASATSLDTMGQMNTAFRNIVNMLHSVRDGLVR
DANNYEQQEQASQQVLRG
>P9WNJ3 ~~~esxN~~~ESAT-6-like protein EsxN~~~
MTINYQFGDVDAHGAMIRAQAASLEAEHQAIVRDVLAAGDFWGGAGSVACQEFITQLGRNFQVIYEQANAHGQKVQAAGN
NMAQTDSAVGSSWA
>P9WNI7 ~~~esxO~~~ESAT-6-like protein EsxO~~~
MTINYQFGDVDAHGAMIRAQAGLLEAEHQAIVRDVLAAGDFWGGAGSVACQEFITQLGRNFQVIYEQANAHGQKVQAAGN
NMAQTDSAVGSSWA
>P9WNI5 ~~~esxP~~~ESAT-6-like protein EsxP~~~COG4842
MATRFMTDPHAMRDMAGRFEVHAQTVEDEARRMWASAQNISGAGWSGMAEATSLDTMAQMNQAFRNIVNMLHGVRDGLVR
DANNYEQQEQASQQILSS
>P9WNI9 ~~~esxR~~~ESAT-6-like protein EsxR~~~COG4842
MSQIMYNYPAMMAHAGDMAGYAGTLQSLGADIASEQAVLSSAWQGDTGITYQGWQTQWNQALEDLVRAYQSMSGTHESNT
MAMLARDGAEAAKWGG
>Q6MX18 ~~~esxS~~~ESAT-6-like protein EsxS~~~
MSLLDAHIPQLIASHTAFAAKAGLMRHTIGQAEQQAMSAQAFHQGESAAAFQGAHARFVAAAAKVNTLLDIAQANLGEAA
GTYVAADAAAASSYTGF
>O06261 ~~~esxT~~~ESAT-6-like protein EsxT~~~
MNADPVLSYNFDAIEYSVRQEIHTTAARFNAALQELRSQIAPLQQLWTREAAAAYHAEQLKWHQAASALNEILIDLGNAV
RHGADDVAHADRRAAGAWAR
>I6YC53 ~~~esxT~~~ESAT-6-like protein EsxT~~~COG4842
MNADPVLSYNFDAIEYSVRQEIHTTAARFNAALQELRSQIAPLQQLWTREAAAAYHAEQLKWHQAASALNEILIDLGNAV
RHGADDVAHADRRAAGAWAR
>Q7D5J1 ~~~esxU~~~ESAT-6-like protein EsxU~~~
MSTPNTLNADFDLMRSVAGITDARNEEIRAMLQAFIGRMSGVPPSVWGGLAAARFQDVVDRWNAESTRLYHVLHAIADTI
RHNEAALREAGQIHARHIAAAGGDL
>I6Y3I6 ~~~esxU~~~ESAT-6-like protein EsxU~~~COG4842
MSTPNTLNADFDLMRSVAGITDARNEEIRAMLQAFIGRMSGVPPSVWGGLAAARFQDVVDRWNAESTRLYHVLHAIADTI
RHNEAALREAGQIHARHIAAAGGDL
>P0DOA7 ~~~esxV~~~ESAT-6-like protein EsxV~~~
MTINYQFGDVDAHGAMIRAQAGSLEAEHQAIISDVLTASDFWGGAGSAACQGFITQLGRNFQVIYEQANAHGQKVQAAGN
NMAQTDSAVGSSWA
>P9WNI3 ~~~esxW~~~ESAT-6-like protein EsxW~~~COG4842
MTSRFMTDPHAMRDMAGRFEVHAQTVEDEARRMWASAQNISGAGWSGMAEATSLDTMTQMNQAFRNIVNMLHGVRDGLVR
DANNYEQQEQASQQILSS
>P09331 3.4.21.-~~~eta~~~Exfoliative toxin A~~~
MNNSKIISKVLLSLSLFTVGASAFVIQDELMQKNHAKAEVSAEEIKKHEEKWNKYYGVNAFNLPKELFSKVDEKDRQKYP
YNTIGNVFVKGQTSATGVLIGKNTVLTNRHIAKFANGDPSKVSFRPSINTDDNGNTETPYGEYEVKEILQEPFGAGVDLA
LIRLKPDQNGVSLGDKISPAKIGTSNDLKDGDKLELIGYPFDHKVNQMHRSEIELTTLSRGLRYYGFTVPGNSGSGIFNS
NGELVGIHSSKVSHLDREHQINYGVGIGNYVKRIINEKNE
>P09332 3.4.21.-~~~etb~~~Exfoliative toxin B~~~
MDKNMFKKIILAASIFTISLPVIPFESTLQAKEYSAEEIRKLKQKFEVPPTDKELYTHITDNARSPYNSVGTVFVKGSTL
ATGVLIGKNTIVTNYHVAREAAKNPSNIIFTPAQNRDAEKNEFPTPYGKFEAEEIKESPYGQGLDLAIIKLKPNEKGESA
GDLIQPANIPDHIDIQKGDKYSLLGYPYNYSAYSLYQSQIEMFNDSQYFGYTEVGNSGSGIFNLKGELIGIHSGKGGQHN
LPIGVFFNRKISSLYSVDNTFGDTLGNDLKKRAKLDK
>P85148 ~~~~~~Bacteriocin E50-52~~~
TTKNYGNGVCNSVNWCQCGNVWASCNLATGCAAWLCKLA
>P85147 ~~~~~~Enterocin E-760~~~
NRWYCNSAAGGVGGAAVCGLAGYVGEAKENIAGEVRKGWGMAGGFTHNKACKSFPGSGWASG
>P86183 ~~~entHF~~~Enterocin-HF~~~
MEKLTVKEMSQVVGGKYYGNGVSCNKKGCSVDWGKAIGIIGNNAAANLTTGGKAGWKG
>O30434 ~~~entP~~~Bacteriocin enterocin-P~~~
MRKKLFSLALIGIFGLVVTNFGTKVDAATRSYGNGVYCNNSKCWVNWGEAKENIAGIVISGWASGLAGMGH
>G3KIM6 ~~~acrA~~~Acryloyl-CoA reductase electron transfer subunit beta~~~
MAFNSADINSFRDIWVFCEQREGKLINTDFELISEGRKLADERGSKLVGILLGHEVEEIAKELGGYGADKVIVCDHPELK
FYTTDAYAKVLCDVVMEEKPEVILIGATNIGRDLGPRCAARLHTGLTADCTHLDIDMNKYVDFLSTSSTLDISSMTFPME
DTNLKMTRPAFGGHLMATIICPRFRPCMSTVRPGVMKKAEFSQEMAQACQVVTRHVNLSDEDLKTKVINIVKETKKIVDL
IGAEIIVSVGRGISKDVQGGIALAEKLADAFGNGVVGGSRAVIDSGWLPADHQVGQTGKTVHPKVYVALGISGAIQHKAG
MQDSELIIAVNKDETAPIFDCADYGITGDLFKIVPMMIDAIKEGKNA
>P53571 ~~~etfA~~~Electron transfer flavoprotein subunit alpha~~~
MSKILVIAEHRRNDLRPVSLELIGAANGLKKSGEDKVVVAVIGSQADAFVPALSVNGVDELVVVKGSSIDFDPDVFEASV
SALIAAHNPSVVLLPHSVDSLGYASSLASKTGYGFATDVYIVEYQGDELVATRGGYNQKVNVEVDFPGKSTVVLTIRPSV
FKPLEGAGSPVVSNVDAPSVQSRSQNKDYVEVGGGNDIDITTVDFIMSIGRGIGEETNVEQFRELADEAGATLCCSRPIA
DAGWLPKSRQVGQSGKVVGSCKLYVAMGISGSIQHMAGMKHVPTIIAVNTDPGASIFTIAKYGIVADIFDIEEELKAQLA
A
>P9WNG9 ~~~etfA~~~Electron transfer flavoprotein subunit alpha~~~COG2025
MAEVLVLVEHAEGALKKVSAELITAARALGEPAAVVVGVPGTAAPLVDGLKAAGAAKIYVAESDLVDKYLITPAVDVLAG
LAESSAPAGVLIAATADGKEIAGRLAARIGSGLLVDVVDVREGGVGVHSIFGGAFTVEAQANGDTPVITVRAGAVEAEPA
AGAGEQVSVEVPAAAENAARITAREPAVAGDRPELTEATIVVAGGRGVGSAENFSVVEALADSLGAAVGASRAAVDSGYY
PGQFQVGQTGKTVSPQLYIALGISGAIQHRAGMQTSKTIVAVNKDEEAPIFEIADYGVVGDLFKVAPQLTEAIKARKG
>P38974 ~~~etfA~~~Electron transfer flavoprotein subunit alpha~~~
MAVLLLGEVTNGALNRDATAKAVAAVKALGDVTVLCAGASAKAAAEEAAKIAGVAKVLVAEDALYGHRLAEPTAALIVGL
AGDYSHIAAPATTDAKNVMPRVAALLDVMVLSDVSAILDADTFERPIYAGNAIQVVKSKDAKKVFTIRTASFDAAGEGGT
APVTETAAAADPGLSSWVADEVAESDRPELTSARRVVSGGRGLGSKESFAIIEELADKLGAAVGASRAAVDSGYAPNDWQ
VGQTGKVVAPELYVAVGISGAIQHLAGMKDSKVIVAINKDEEAPIFQIADYGLVGDLFSVVPELTGKL
>Q0AZ33 ~~~etfA~~~Electron transfer flavoprotein subunit alpha~~~COG2025
MAGTWIFVEQRDGNIRKVTFEMLSEAKKFGDEVAAVVFGKGVEALAPEFAKYGADKVYVVEDDVFANYNTGAYVAQMVAM
INEFKPNAVLFAHTFNGRDFASRLAQKLQLGLATDAIKVEVSAGKGVFTRAIYAGKALAKVEVAGEPVLGTIRPGVCEVG
NTAGAGAVVKPAVAATAADVYQTVKSFVPTVSARPELTEADVVVSGGRGCKGPDGIKLVEQLADLLGAAVGGSRASIDSG
WLGHELQVGQTGKVVNPNLYVAAGISGAIQHLAGMSSSKFIAAINTDTEAPIFNVSDFGVVADLFKVIPTLVSELKK
>G3KIM7 ~~~acrB~~~Acryloyl-CoA reductase electron transfer subunit gamma~~~
MRIYVCVKQVPDTSGKVAVNPDGTLNRASMAAIINPDDMSAIEQALKLKDETGCQVTALTMGPPPAEGMLREIIAMGADD
GVLISAREFGGSDTFATSQIISAAIHKLGLSNEDMIFCGRQAIDGDTAQVGPQIAEKLSIPQVTYGAGIKKSGDLVLVKR
MLEDGYMMIEVETPCLITCIQDKAVKPRYMTLNGIMECYSKPLLVLDYEALKDEPLIELDTIGLKGSPTNIFKSFTPPQK
GVGVMLQGTDKEKVEDLVDKLMQKHVI
>P53570 ~~~etfB~~~Electron transfer flavoprotein subunit beta~~~
MKILVAVKQTAALEEDFEIREDGMDVDEDFMMYDLNEWDDFSLEEAMKIKESSDTDVEVVVVSVGPDRVDESLRKCLAKG
ADRAVRVWDDAAEGSDAIVVGRILTEVIKKEAPDMVFAGVQSSDQAYASTGISVASYLNWPHAAVVADLQYKPGDNKAVI
RRELEGGMLQEVEINCPAVLTIQLGINKPRYASLRGIKQAATKPIEEVSLADIGLSANDVGAAQSMSRVRRMYIPEKGRA
TMIEGTISEQAAKIIQIINEFKGA
>P64098 ~~~etfB~~~Electron transfer flavoprotein subunit beta~~~
MTNIVVLIKQVPDTWSERKLTDGDFTLDREAADAVLDEINERAVEEALQIREKEAADGIEGSVTVLTAGPERATEAIRKA
LSMGADKAVHLKDDGMHGSDVIQTGWALARALGTIEGTELVIAGNESTDGVGGAVPAIIAEYLGLPQLTHLRKVSIEGGK
ITGERETDEGVFTLEATLPAVISVNEKINEPRFPSFKGIMAAKKKEVTVLTLAEIGVESDEVGLANAGSTVLASTPKPAK
TAGEKVTDEGEGGNQIVQYLVAQKII
>P9WNG7 ~~~etfB~~~Electron transfer flavoprotein subunit beta~~~COG2086
MTNIVVLIKQVPDTWSERKLTDGDFTLDREAADAVLDEINERAVEEALQIREKEAADGIEGSVTVLTAGPERATEAIRKA
LSMGADKAVHLKDDGMHGSDVIQTGWALARALGTIEGTELVIAGNESTDGVGGAVPAIIAEYLGLPQLTHLRKVSIEGGK
ITGERETDEGVFTLEATLPAVISVNEKINEPRFPSFKGIMAAKKKEVTVLTLAEIGVESDEVGLANAGSTVLASTPKPAK
TAGEKVTDEGEGGNQIVQYLVAQKII
>P38975 ~~~etfB~~~Electron transfer flavoprotein subunit beta~~~
MKVLVPVKRLIDYNVKARVKSDGSGVDLANVKMSMNPFDEIAVEEAIRLKEKGQAEEIIAVSIGVKQAAETLRTALAMGA
DRAILVVAADDVQQDIEPLAVAKILAAVARAEGTELIIAGKQAIDNDMNATGQMLAAILGWAQATFASKVEIEGAKAKVT
REVDGGLQTIAVSLPAVVTADLRLNEPRYASLPNIMKAKKKPLDEKTAADYGVDVAPRLEVVSVREPEGRKAGIKVGSVD
ELVGKLKEAGVI
>Q0AZ34 ~~~etfB~~~Electron transfer flavoprotein subunit beta~~~COG2086
MNLKVLVCVKQTFDTEAKIELKDGKIADAGINLIINPYDEVAVEGAIQLKEKGVAKEIVVVAAGSDKAMDAIRTALAMGA
DRGILVQQDTAADEFARAVALAEAIKGENPDIILAGHVAADDGSSQVPTRVAEILGLPHVNVITAVEIAGGKATCTSEAD
GGTQVTEVSLPAVISSQVSWNEPRYPSMKGIMAAKKKPVATAAAAAAESKVKILEFSLPPAKAAGIKIEDEPEVCATKLA
EWMKNTVKVEVK
>Q9HZP5 1.5.5.1~~~~~~Electron transfer flavoprotein-ubiquinone oxidoreductase~~~
MEREYMEFDVVIVGAGPAGLSAACRLKQKAAEAGQEISVCVVEKGSEVGAHILSGAVFEPRALNELFPDWKELGAPLNTP
VTGDDIYVLKSAESATKVPNFFVPKTMHNEGNYIISLGNLCRWLAQQAEGLGVEIYPGFAAQEALIDENGVVRGIVTGDL
GVDREGNPKEGYYTPGMELRAKYTLFAEGCRGHIGKQLIKKYNLDSEADAQHYGIGIKEIWDIDPSKHKPGLVVHTAGWP
LNDENTGGSFLYHLENNQVFVGLIIDLSYSNPHLSPFDEFQRYKHHPVVKQYLEGGKRVAYGARAICKGGLNSLPKMVFP
GGALIGCDLGTLNFAKIKGSHTAMKSGMLAADAIAEALAAGREGGDELSSYVDAFKASWLYDELFRSRNFGAAIHKFGAI
GGGAFNFIDQNIFGGKIPVTLHDDKPDYACLKKASEAPKIDYPKPDGKLSFDKLSSVFLSNTNHEEDQPIHLKLADASIP
IEKNLPLYDEPAQRYCPAGVYEVVANDDGSKRFQINAQNCVHCKTCDIKDPAQNITWVAPEGTGGPNYPNM
>Q0AZ32 1.18.-.-~~~~~~EtfAB:quinone oxidoreductase~~~COG0247
MIFGFHPLEGGYDVPMRVVGANIPYEWLIYVIMLIPVSVFLFGFWKKLEVWLLAKGEIHRNDKIAQRIWSWFVFSFAQAR
VIRKPLAGWMHAFLFWGFLVLFLAAGIDAMHNMISWPHLEGNFYIGFSWVVDVLGFLALIGVMVLGFVRYFQKPERLNDT
KSSDGWIILLIFAILLTGYFIEGLRIAAQIKLSTTMQQIAYERAASPFGWMFASFFGSMSVDAMLMWHRLLWWFHMAIAF
LFIALVPFTKLWHIFASMLNYTFRDLEPSANRMVYNIEEAETFGVENIEDFGWKDLLDLDSCIRCGRCQENCPAYNTGKH
LNPKITLIQNMKAHLDAKAPYLLAAKASGAEVEEMAMTEEAAAEEVNPMEQSLLYDVVGSETIWDCTNCRACMEHCPMFI
EHIPKIVEMRRNLVMWQGDMPGEAQMAFTNMERNYNPWGVGWAGRAGWLDERGVREMVNLLPEDGKEFEYLLYAGCAVSF
DDRYKRVGEALVRLLNKAGVSFGYLGTEEYCCGDSARRLGNEYLYQTLVSQNLESFNNYGVKKIIVVCPHGYTALKNEYP
QMGGNYEVYHYTEILAKLVAEGKLKPSKPLGVKMTYHDSCFLGRHNGVYDQPRNVLKAAGGQVIEIEKAKEFGFCCGAGG
GRMWLEEEAVLKDGIQYKRINDTRTDQLLVPNPEMIVTNCPFCLTMIADGVKAAEAEESTKVFDVAEVLWKAME
>Q7TVI2 1.14.13.-~~~ethA~~~FAD-containing monooxygenase EthA~~~
MTEHLDVVIVGAGISGVSAAWHLQDRCPTKSYAILEKRESMGGTWDLFRYPGIRSDSDMYTLGFRFRPWTGRQAIADGKP
ILEYVKSTAAMYGIDRHIRFHHKVISADWSTAENRWTVHIQSHGTLSALTCEFLFLCSGYYNYDEGYSPRFAGSEDFVGP
IIHPQHWPEDLDYDAKNIVVIGSGATAVTLVPALADSGAKHVTMLQRSPTYIVSQPDRDGIAEKLNRWLPETMAYTAVRW
KNVLRQAAVYSACQKWPRRMRKMFLSLIQRQLPEGYDVRKHFGPHYNPWDQRLCLVPNGDLFRAIRHGKVEVVTDTIERF
TATGIRLNSGRELPADIIITATGLNLQLFGGATATIDGQQVDITTTMAYKGMMLSGIPNMAYTVGYTNASWTLKADLVSE
FVCRLLNYMDDNGFDTVVVERPGSDVEERPFMEFTPGYVLRSLDELPKQGSRTPWRLNQNYLRDIRLIRRGKIDDEGLRF
AKRPAPVGV
>P9WNF9 1.14.13.-~~~ethA~~~FAD-containing monooxygenase EthA~~~COG2072
MTEHLDVVIVGAGISGVSAAWHLQDRCPTKSYAILEKRESMGGTWDLFRYPGIRSDSDMYTLGFRFRPWTGRQAIADGKP
ILEYVKSTAAMYGIDRHIRFHHKVISADWSTAENRWTVHIQSHGTLSALTCEFLFLCSGYYNYDEGYSPRFAGSEDFVGP
IIHPQHWPEDLDYDAKNIVVIGSGATAVTLVPALADSGAKHVTMLQRSPTYIVSQPDRDGIAEKLNRWLPETMAYTAVRW
KNVLRQAAVYSACQKWPRRMRKMFLSLIQRQLPEGYDVRKHFGPHYNPWDQRLCLVPNGDLFRAIRHGKVEVVTDTIERF
TATGIRLNSGRELPADIIITATGLNLQLFGGATATIDGQQVDITTTMAYKGMMLSGIPNMAYTVGYTNASWTLKADLVSE
FVCRLLNYMDDNGFDTVVVERPGSDVEERPFMEFTPGYVLRSLDELPKQGSRTPWRLNQNYLRDIRLIRRGKIDDEGLRF
AKRPAPVGV
>Q7TVI1 ~~~ethR~~~HTH-type transcriptional regulator EthR~~~
MTTSAASQASLPRGRRTARPSGDDRELAILATAENLLEDRPLADISVDDLAKGAGISRPTFYFYFPSKEAVLLTLLDRVV
NQADMALQTLAENPADTDRENMWRTGINVFFETFGSHKAVTRAGQAARATSVEVAELWSTFMQKWIAYTAAVIDAERDRG
AAPRTLPAHELATALNLMNERTLFASFAGEQPSVPEARVLDTLVHIWVTSIYGENR
>A0R666 ~~~ethR~~~HTH-type transcriptional regulator EthR~~~COG1309
MTTASQTRTPRGRRSARPSGDDREAAILATAQRLLETKKFAEISVDDLAKGAGISRPTFYFYFPSKEAVLLSLIDPLIKR
ADSGFDNAVESMPADPQRAIRRGIEIFFNSFGSHPATARAGTEALKSSPEFKEFWSGLMQKWIAATAALITAERERGAAP
DTIPALDLATSLNLMNERTMMAALADEQPGVAPEKVVATLTHIWLNSIYGTLPVGTA
>P9WMC0 ~~~ethR~~~HTH-type transcriptional regulator EthR~~~
MTTSAASQASLPRGRRTARPSGDDRELAILATAENLLEDRPLADISVDDLAKGAGISRPTFYFYFPSKEAVLLTLLDRVV
NQADMALQTLAENPADTDRENMWRTGINVFFETFGSHKAVTRAGQAARATSVEVAELWSTFMQKWIAYTAAVIDAERDRG
AAPRTLPAHELATALNLMNERTLFASFAGEQPSVPEARVLDTLVHIWVTSIYGENR
>P9WMC1 ~~~ethR~~~HTH-type transcriptional regulator EthR~~~COG1309
MTTSAASQASLPRGRRTARPSGDDRELAILATAENLLEDRPLADISVDDLAKGAGISRPTFYFYFPSKEAVLLTLLDRVV
NQADMALQTLAENPADTDRENMWRTGINVFFETFGSHKAVTRAGQAARATSVEVAELWSTFMQKWIAYTAAVIDAERDRG
AAPRTLPAHELATALNLMNERTLFASFAGEQPSVPEARVLDTLVHIWVTSIYGENR
>P58764 2.7.10.-~~~etk~~~Tyrosine-protein kinase etk~~~
MTTKNMNTPPGSTQENEIDLLRLVGELWDHRKFIISVTALFTLIAVAYSLLSTPIYQADTLVQVEQKQGNAILSGLSDMI
PNSSPESAPEIQLLQSRMILGKTIAELNLRDIVEQKYFPIVGRGWARLTKEKPGELAISWMHIPQLNGQDQQLTLTVGEN
GHYTLEGEGFTVNGMVGQRLEKDGVALTIADIKAKPGTQFVLSQRTELEAINALQGTFTVSERSKESGMLELTMTGDDPQ
LITRILNSIANNYLQQNIARQAAQDSQSLEFLQRQLPEVRSELDQAEEKLNVYRQQRDSVDLNLEAKAVLEQIVNVDNQL
NELTFREAEISQLYKKDHPTYRALLEKRQTLEQERKRLNKRVSAMPSTQQEVLRLSRDVEAGRAVYLQLLNRQQELSISK
SSAIGNVRIIDPAVTQPQPVKPKKALNVVLGFILGLFISVGAVLARAMLRRGVEAPEQLEEHGISVYATIPMSEWLDKRT
RLRKKNLFSNQQRHRTKNIPFLAVDNPADSAVEAVRALRTSLHFAMMETENNILMITGATPDSGKTFVSSTLAAVIAQSD
QKVLFIDADLRRGYSHNLFTVSNEHGLSEYLAGKDELNKVIQHFGKGGFDVITRGQVPPNPSELLMRDRMRQLLEWANDH
YDLVIVDTPPMLAVSDAAVVGRSVGTSLLVARFGLNTAKEVSLSMQRLEQAGVNIKGAILNGVIKRASTAYSYGYNYYGY
SYSEKE
>P38134 2.7.10.-~~~etk~~~Tyrosine-protein kinase etk~~~COG0489
MTTKNMNTPPGSTQENEIDLLRLVGELWDHRKFIISVTALFTLIAVAYSLLSTPIYQADTLVQVEQKQGNAILSGLSDMI
PNSSPESAPEIQLLQSRMILGKTIAELNLRDIVEQKYFPIVGRGWARLTKEKPGELAISWMHIPQLNGQDQQLTLTVGEN
GHYTLEGEEFTVNGMVGQRLEKDGVALTIADIKAKPGTQFVLSQRTELEAINALQETFTVSERSKESGMLELTMTGDDPQ
LITRILNSIANNYLQQNIARQAAQDSQSLEFLQRQLPEVRSELDQAEEKLNVYRQQRDSVDLNLEAKAVLEQIVNVDNQL
NELTFREAEISQLYKKDHPTYRALLEKRQTLEQERKRLNKRVSAMPSTQQEVLRLSRDVEAGRAVYLQLLNRQQELSISK
SSAIGNVRIIDPAVTQPQPVKPKKALNVVLGFILGLFISVGAVLARAMLRRGVEAPEQLEEHGISVYATIPMSEWLDKRT
RLRKKNLFSNQQRHRTKNIPFLAVDNPADSAVEAVRALRTSLHFAMMETENNILMITGATPDSGKTFVSSTLAAVIAQSD
QKVLFIDADLRRGYSHNLFTVSNEHGLSEYLAGKDELNKVIQHFGKGGFDVITRGQVPPNPSELLMRDRMRQLLEWANDH
YDLVIVDTPPMLAVSDAAVVGRSVGTSLLVARFGLNTAKEVSLSMQRLEQAGVNIKGAILNGVIKRASTAYSYGYNYYGY
SYSEKE
>P0ACZ2 3.1.3.48~~~etp~~~Low molecular weight protein-tyrosine-phosphatase Etp~~~COG0394
MAQLKFNSILVVCTGNICRSPIGERLLRKRLPGVKVKSAGVHGLVKHPADATAADVAANHGVSLEGHAGRKLTAEMARNY
DLILAMESEHIAQVTAIAPEVRGKTMLFGQWLEQKEIPDPYRKSQDAFEHVYGMLERASQEWAKRLSR
>A0A0H2VFI8 3.6.1.-~~~ettA~~~Energy-dependent translational throttle protein EttA~~~COG0488
MAQFVYTMHRVGKVVPPKRHILKNISLSFFPGAKIGVLGLNGAGKSTLLRIMAGIDKDIEGEARPQPDIKIGYLPQEPQL
NPEHTVRESIEEAVSEVVNALKRLDEVYALYADPDADFDKLAAEQGRLEEIIQAHDGHNLNVQLERAADALRLPDWDAKI
ANLSGGERRRVALCRLLLEKPDMLLLDEPTNHLDAESVAWLERFLHDFEGTVVAITHDRYFLDNVAGWILELDRGEGIPW
EGNYSSWLEQKDQRLAQEASQEAARRKSIEKELEWVRQGTKGRQSKGKARLARFEELNSTEYQKRNETNELFIPPGPRLG
DKVLEVSNLRKSYGDRLLIDSLSFSIPKGAIVGIIGPNGAGKSTLFRMISGQEQPDSGTITLGETVKLASVDQFRDSMDN
SKTVWEEVSGGLDIMKIGNTEMPSRAYVGRFNFKGVDQGKRVGELSGGERGRLHLAKLLQVGGNMLLLDEPTNDLDIETL
RALENALLEFPGCAMVISHDRWFLDRIATHILDYQDEGKVEFFEGNFTEYEEYKKRTLGADALEPKRIKYKRIAK
>P0A9W3 3.6.1.-~~~ettA~~~Energy-dependent translational throttle protein EttA~~~COG0488
MAQFVYTMHRVGKVVPPKRHILKNISLSFFPGAKIGVLGLNGAGKSTLLRIMAGIDKDIEGEARPQPDIKIGYLPQEPQL
NPEHTVRESIEEAVSEVVNALKRLDEVYALYADPDADFDKLAAEQGRLEEIIQAHDGHNLNVQLERAADALRLPDWDAKI
ANLSGGERRRVALCRLLLEKPDMLLLDEPTNHLDAESVAWLERFLHDFEGTVVAITHDRYFLDNVAGWILELDRGEGIPW
EGNYSSWLEQKDQRLAQEASQEAARRKSIEKELEWVRQGTKGRQSKGKARLARFEELNSTEYQKRNETNELFIPPGPRLG
DKVLEVSNLRKSYGDRLLIDDLSFSIPKGAIVGIIGPNGAGKSTLFRMISGQEQPDSGTITLGETVKLASVDQFRDSMDN
SKTVWEEVSGGLDIMKIGNTEMPSRAYVGRFNFKGVDQGKRVGELSGGERGRLHLAKLLQVGGNMLLLDEPTNDLDIETL
RALENALLEFPGCAMVISHDRWFLDRIATHILDYQDEGKVEFFEGNFTEYEEYKKRTLGADALEPKRIKYKRIAK
>P45127 3.6.1.-~~~ettA~~~Energy-dependent translational throttle protein EttA~~~COG0488
MSSQFVYTMHRVGKVVPPKRHILKDISLSFFPGAKIGVLGLNGAGKSTLLRIMAGVDKEFEGEARPQPGIKIGYLPQEPK
LEPQQTVREAVEEAVSEVKNALTRLDEVYALYADPDADFDKLAAEQANLEAIIQAHDGHNLDNQLERAADALRLPDWDAK
IEHLSGGERRRVALCRLLLEKPDMLLLDEPTNHLDAESVAWLERFLHDYEGTVVAITHDRYFLDNVAGWILELDRGEGIP
WEGNYSSWLEQKEKRLEQEQATENARQKSIAKELEWVRQNPKGRQAKSKARMARFDELNSGEYQKRNETNELFIPPGPRL
GDKVIEVQNLTKSYGDRTLIDDLSFSIPKGAIVGIIGANGAGKSTLFRMLSGQEQPDSGSVTMGETVVLASVDQFRDSMD
DKKTVWEEVSNGQDILTIGNFEIPSRAYVGRFNFKGVDQQKRVGELSGGERGRLHLAKLLQRGGNVLLLDEPTNDLDVET
LRALENAILEFPGCAMVISHDRWFLDRIATHILDYGDEGKVTFYEGNFSDYEEWKKKTLGDAATQPHRIKYKRIAK
>P9WQK3 3.6.1.-~~~ettA~~~Energy-dependent translational throttle protein EttA~~~COG0488
MAEFIYTMKKVRKAHGDKVILDDVTLSFYPGAKIGVVGPNGAGKSSVLRIMAGLDKPNNGDAFLATGATVGILQQEPPLN
EDKTVRGNVEEGMGDIKIKLDRFNEVAELMATDYTDELMEEMGRLQEELDHADAWDLDAQLEQAMDALRCPPADEPVTNL
SGGERRRVALCKLLLSKPDLLLLDEPTNHLDAESVQWLEQHLASYPGAILAVTHDRYFLDNVAEWILELDRGRAYPYEGN
YSTYLEKKAERLAVQGRKDAKLQKRLTEELAWVRSGAKARQAKSKARLQRYEEMAAEAEKTRKLDFEEIQIPVGPRLGNV
VVEVDHLDKGYDGRALIKDLSFSLPRNGIVGVIGPNGVGKTTLFKTIVGLETPDSGSVKVGETVKLSYVDQARAGIDPRK
TVWEVVSDGLDYIQVGQTEVPSRAYVSAFGFKGPDQQKPAGVLSGGERNRLNLALTLKQGGNLILLDEPTNDLDVETLGS
LENALLNFPGCAVVISHDRWFLDRTCTHILAWEGDDDNEAKWFWFEGNFGAYEENKVERLGVDAARPHRVTHRKLTRG
>P0A0L2 ~~~entA~~~Enterotoxin type A~~~
MKKTAFTLLLFIALTLTTSPLVNGSEKSEEINEKDLRKKSELQGTALGNLKQIYYYNEKAKTENKESHDQFLQHTILFKG
FFTDHSWYNDLLVDFDSKDIVDKYKGKKVDLYGAYYGYQCAGGTPNKTACMYGGVTLHDNNRLTEEKKVPINLWLDGKQN
TVPLETVKTNKKNVTVQELDLQARRYLQEKYNLYNSDVFDGKVQRGLIVFHTSTEPSVNYDLFGAQGQYSNTLLRIYRDN
KTINSENMHIDIYLYTS
>Q02307 ~~~etxB~~~Epsilon-toxin type B~~~
MKKNLVKSLAIASAVISIYSIVNIVSPTNVIAKEISNTVSNEMSKKASYDNVDTLIEKGRYNTKYNYLKRMEKYYPNAMA
YFDKVTINPQGNDFYINNPKVELDGEPSMNYLEDVYVGKALLTNDTQQEQKLKSQSFTCKNTDTVTATTTHTVGTSIQAT
AKFTVPFNETGVSLTTSYSFANTNTNTNSKEITHNVPSQDILVPANTTVEVIAYLKKVNVKGNVKLVGQVSGSEWGEIPS
YLAFPRDGYKFSLSDTVNKSDLNEDGTININGKGNYSAVMGDELIVKVRNLNTNNVQEYVIPVDKKEKSNDSNIVKYRSL
YIKAPGIK
>P01552 ~~~entB~~~Enterotoxin type B~~~
MYKRLFISHVILIFALILVISTPNVLAESQPDPKPDELHKSSKFTGLMENMKVLYDDNHVSAINVKSIDQFLYFDLIYSI
KDTKLGNYDNVRVEFKNKDLADKYKDKYVDVFGANYYYQCYFSKKTNDINSHQTDKRKTCMYGGVTEHNGNQLDKYRSIT
VRVFEDGKNLLSFDVQTNKKKVTAQELDYLTRHYLVKNKKLYEFNNSPYETGYIKFIENENSFWYDMMPAPGDKFDQSKY
LMMYNDNKMVDSKDVKIEVYLTTKKK
>P20723 ~~~entD~~~Enterotoxin type D~~~
MKKFNILIALLFFTSLVISPLNVKANENIDSVKEKELHKKSELSSTALNNMKHSYADKNPIIGENKSTGDQFLENTLLYK
KFFTDLINFEDLLINFNSKEMAQHFKSKNVDVYPIRYSINCYGGEIDRTACTYGGVTPHEGNKLKERKKIPINLWINGVQ
KEVSLDKVQTDKKNVTVQELDAQARRYLQKDLKLYNNDTLGGKIQRGKIEFDSSDGSKVSYDLFDVKGDFPEKQLRIYSD
NKTLSTEHLHIDIYLYEK
>P12993 ~~~entE~~~Enterotoxin type E~~~
MKKTAFILLLFIALTLTTSPLVNGSEKSEEINEKDLRKKSELQRNALSNLRQIYYYNEKAITENKESDDQFLENTLLFKG
FFTGHPWYNDLLVDLGSKDATNKYKGKKVDLYGAYYGYQCAGGTPNKTACMYGGVTLHDNNRLTEEKKVPINLWIDGKQT
TVPIDKVKTSKKEVTVQELDLQARHYLHGKFGLYNSDSFGGKVQRGLIVFHSSEGSTVSYDLFDAQGQYPDTLLRIYRDN
KTINSENLHIDLYLYTT
>P0A0L8 ~~~entG~~~Enterotoxin type G~~~
MKKLSTVIIILILEIVFHNMNYVNAQPDPKLDELNKVSDYKNNKGTMGNVMNLYTSPPVEGRGVINSRQFLSHDLIFPIE
YKSYNEVKTELENTELANNYKDKKVDIFGVPYFYTCIIPKSEPDINQNFGGCCMYGGLTFNSSENERDKLITVQVTIDNR
QSLGFTITTNKNMVTIQELDYKARHWLTKEKKLYEFDGSAFESGYIKFTEKNNTSFWFDLFPKKELVPFVPYKFLNIYGD
NKVVDSKSIKMEVFLNTH
>P0A0M0 ~~~entH~~~Enterotoxin type H~~~
MINKIKILFSFLALLLSFTSYAKAEDLHDKSELTDLALANAYGQYNHPFIKENIKSDEISGEKDLIFRNQGDSGNDLRVK
FATADLAQKFKNKNVDIYGASFYYKCEKISENISECLYGGTTLNSEKLAQERVIGANVWVDGIQKETELIRTNKKNVTLQ
ELDIKIRKILSDKYKIYYKDSEISKGLIEFDMKTPRDYSFDIYDLKGENDYEIDKIYEDNKTLKSDDISHIDVNLYTKKK
V
>Q06566 ~~~~~~Early upstream open reading frame~~~
MECIQHESCFDVDDREDAQQIKEQEGTEMVSITQAAKLHNVTRQAIYVAIKQKKLKASKTTRWEIDLKDLEDYKRNRYSR
KKSLYQGELLFDNEKGCYSVNQVADMLGIPVQKVYYATRTGTMRGERKGAAWVISQSEIDRYKSEYLNKQTAKKAKGVTV
VEHAIAKPEETVSSETLLFENN
>P76551 ~~~eutA~~~Ethanolamine ammonia-lyase reactivase EutA~~~COG4819
MNTRQLLSVGIDIGTTTTQVIFSRLELVNRAAVSQVPRYEFIKREISWQSPVFFTPVDKQGGLKEAELKTLILEQYHAAG
IEPESVDSGAIIITGESAKTRNARPAVMALSQSLGDFVVASAGPHLESVIAGHGAGAQTLSEQRLCRVLNIDIGGGTANY
ALFDAGKISGTACLNVGGRLLETDSHGRVVYAHKPGQMIVDECFGAGTDARSLTGAQLVQVTRRMAELIVEVIDGTLSPL
AQALMQTGLLPAGVTPEIITLSGGVGECYRHQPADPFCFADIGPLLATALHDHPRLREMNVQFPAQTVRATVIGAGAHTL
SLSGSTIWLEGVQLPLRNLPVAIPIDETDLVGAWQQALIQLDLDPKTDAYVLALPASLPVRYAAVLTVINALVDFVARFP
NPHPLLVVAGQDFGKALGMLLRPQLQQLPLAVIDEVIVRAGDYIDIGTPLFGGSVVPVTVKSLAFPS
>Q9ZFV2 ~~~eutA~~~Ethanolamine ammonia-lyase reactivase EutA~~~
MNTRQLLSVGIDIGTTTTQVIFSRLELVNRAAVSQVPRYEFIKRDISWQSPVFFTPVDKQGGLKEVELKALILAQYQAAG
IAPESVDSGAIIITGESAKTRNARPAVMALSQSLGDFVVASAGPHLESVIAGHGAGAQSLSEQRMCRVLNIDIGGGTSNY
ALFDAGKVSGTACLNVGGRLLETDAQGRVVYAHQPGQMIIDEVFGSGTDARALAAAQLGQVARRMADLIVEVITGALSPL
AQSLMQTGLLPADITPEVITLSGGVGECYRNQPADPFCFSDIGPLLATALHEHPRLREMNVQFPAQTVRATVIGAGAHTL
SLSGSTIWLEDVQLPLRNLPVAIPQDDADLVNAWRQALLQLDLDPQTDAYVLALPATLPVRYAALLTVINALTAFVARYP
NPHPLLVVAEQDFGKALGMLLRPQLPQLPLAVIDEVVVRAGDYIDIGTPLFGGSVVPVTVKSLAFPS
>P0AEJ6 4.3.1.7~~~eutB~~~Ethanolamine ammonia-lyase large subunit~~~COG4303
MKLKTTLFGNVYQFKDVKEVLAKANELRSGDVLAGVAAASSQERVAAKQVLSEMTVADIRNNPVIAYEDDCVTRLIQDDV
NETAYNQIKNWSISELREYVLSDETSVDDIAFTRKGLTSEVVAAVAKICSNADLIYGAKKMPVIKKANTTIGIPGTFSAR
LQPNDTRDDVQSIAAQIYEGLSFGVGDAVIGVNPVTDDVENLSRVLDTIYGVIDKFNIPTQGCVLAHVTTQIEAIRRGAP
GGLIFQSICGSEKGLKEFGVELAMLDEARAVGAEFNRIAGENCLYFETGQGSALSAGANFGADQVTMEARNYGLARHYDP
FIVNTVVGFIGPEYLYNDRQIIRAGLEDHFMGKLSGISMGCDCCYTNHADADQNLNENLMILLATAGCNYIMGMPLGDDI
MLNYQTTAFHDTATVRQLLNLRPSPEFERWLESMGIMANGRLTKRAGDPSLFF
>P19264 4.3.1.7~~~eutB~~~Ethanolamine ammonia-lyase large subunit~~~
MKLKTTLFGNVYQFKDVKEVLAKANELRSGDVLAGVAAASSQERVAAKQVLSEMTVADIRNNPVIAYEEDCVTRLIQDDV
NETAYNRIKNWSISELREYVLSDETSVDDIAFTRKGLTSEVVAAVAKICSNADLIYGGKKMPVIKKANTTIGIPGTFSCR
LQPNDTRDDVQSIAAQIYEGLSFGAGDAVIGVNPVTDDVENLTRVLDTVYGVIDKFNIPTQGCVLAHVTTQIEAIRRGAP
GGLIFQSICGSEKGLKEFGVELAMLDEARAVGAEFNRIAGENCLYFETGQGSALSAGANFGADQVTMEARNYGLARHYDP
FLVNTVVGFIGPEYLYNDRQIIRAGLEDHFMGKLSGISMGCDCCYTNHADADQNLNENLMILLATAGCNYIMGMPLGDDI
MLNYQTTAFHDTATVRQLLNLRPSPEFERWLETMGIMANGRLTKRAGDPSLFF
>P19636 4.3.1.7~~~eutC~~~Ethanolamine ammonia-lyase small subunit~~~COG4302
MDQKQIEEIVRSVMASMGQAAPAPSEAKCATTNCAAPVTSESCALDLGSAEAKAWIGVENPHRADVLTELRRSTVARVCT
GRAGPRPRTQALLRFLADHSRSKDTVLKEVPEEWVKAQGLLEVRSEISDKNLYLTRPDMGRRLCAEAVEALKAQCVANPD
VQVVISDGLSTDAITVNYEEILPPLMAGLKQAGLKVGTPFFVRYGRVKIEDQIGEILGAKVVILLVGERPGLGQSESLSC
YAVYSPRMATTVEADRTCISNIHQGGTPPVEAAAVIVDLAKRMLEQKASGINMTR
>P19265 4.3.1.7~~~eutC~~~Ethanolamine ammonia-lyase small subunit~~~
MDQKQIEEIVRSVMASMGQDVPQPAAPSTQEGAKPQCAAPTVTESCALDLGSAEAKAWIGVENPHRADVLTELRRSTAAR
VCTGRAGPRPRTQALLRFLADHSRSKDTVLKEVPEEWVKAQGLLEVRSEISDKNLYLTRPDMGRRLSPEAIDALKSQCVM
NPDVQVVVSDGLSTDAITANYEEILPPLLAGLKQAGLNVGTPFFVRYGRVKIEDQIGEILGAKVVILLVGERPGLGQSES
LSCYAVYSPRVATTVEADRTCISNIHQGGTPPVEAAAVIVDLAKRMLEQKASGINMTR
>P77218 2.3.1.8~~~eutD~~~Phosphate acetyltransferase EutD~~~COG0280
MIIERCRELALRAPARVVFPDALDQRVLKAAQYLHQQGLATPILVANPFELRQFALSHGVAMDGLQVIDPHGNLAMREEF
AHRWLARAGEKTPPDALEKLTDPLMFAAAMVSAGKADVCIAGNLSSTANVLRAGLRIIGLQPGCKTLSSIFLMLPQYSGP
ALGFADCSVVPQPTAAQLADIALASAETWRAITGEEPRVAMLSFSSNGSARHPCVANVQQATEIVRERAPKLVVDGELQF
DAAFVPEVAAQKAPASPLQGKANVMVFPSLEAGNIGYKIAQRLGGYRAVGPLIQGLAAPMHDLSRGCSVQEIIELALVAA
VPRQTEVNRESSLQTLVE
>P41790 2.3.1.8~~~eutD~~~Phosphate acetyltransferase EutD~~~
MIIERARELAVRAPARVVFPDALDERVLKAAHYLQQYGLARPVLVASPFALRQFALSHRMAMDGIQVIDPHSNLSMRQRF
AQRWLARAGEKTPPDAVEKLSDPLMFAAAMVSAGEADVCIAGNLSSTANVLRAGLRVIGLQPGCKTLSSIFLMLPQYAGP
ALGFADCSVVPQPTAAQLADIALASADTWRAITGEEPRVAMLSFSSNGSARHPNVANVQQATELVRERAPQLLVDGELQF
DAAFVPEVAAQKAPDSPLQGRANVMIFPSLEAGNIGYKITQRLGGYRAVGPLIQGLAAPLHDLSRGCSVQEIIELALVAA
VPRQADVSRERSLHTLVE
>P41793 1.2.1.10~~~eutE~~~Acetaldehyde dehydrogenase (acetylating) EutE~~~
MNQQDIEQVVKAVLLKMKDSSQPASTVHEMGVFASLDDAVAAAKRAQQGLKSVAMRQLAIHAIREAGEKHARELAELAVS
ETGMGRVDDKFAKNVAQARGTPGVECLSPQVLTGDNGLTLIENAPWGVVASVTPSTNPAATVINNAISLIAAGNSVVFAP
HPAAKKVSQRAITLLNQAVVAAGGPENLLVTVANPDIETAQRLFKYPGIGLLVVTGGEAVVDAARKHTNKRLIAAGAGNP
PVVVDETADLPRAAQSIVKGASFDNNIICADEKVLIVVDSVADELMRLMEGQHAVKLTAAQAEQLQPVLLKNIDERGKGT
VSRDWVGRDAGKIAAAIGLNVPDQTRLLFVETPANHPFAVTEMMMPVLPVVRVANVEEAIALAVQLEGGCHHTAAMHSRN
IDNMNQMANAIDTSIFVKNGPCIAGLGLGGEGWTTMTITTPTGEGVTSARTFVRLRRCVLVDAFRIV
>P41795 1.1.1.1~~~eutG~~~Probable alcohol dehydrogenase EutG~~~
MQAELQTALFQAFDTLNLQRVKTFSVPPVTLCGLGALGACGQEAQARGVSHLFVMVDSFLHQAGMTAPLARSLAMKGVAM
TVWPCPPGEPCITDVCAAVAQLREAACDGVVAFGGGSVLDAAKAVALLVTNPDQTLSAMTEHSTLRPRLPLIAVPTTAGT
GSETTNVTVIIDAVSGRKQVLAHASLMPDVAILDAAVTEGVPPNVTAMTGIDALTHAIEAYSALNATPFTDSLAIGAIAM
IGKSLPKAVGYGHDLAARENMLLASCMAGMAFSSAGLGLCHAMAHQPGAALHIPHGQANAMLLPTVMGFNRMVCRERFSQ
IGRALTNKKSDDRDAIAAVCELIAEVGQSKRLADAGAKPEHYSAWAQAALEDICLRSNPRTATQAQIIDLYAAAG
>P41796 ~~~eutH~~~Probable ethanolamine permease EutH~~~
MGINEIIMYIMMFFMLIAAVDRILSQFGGSARFLGKFGKSIEGSGGQFEEGFMAMGALGLAMVGMTALAPVLAHVLGPVI
IPVYEMLGANPSMFAGTLLACDMGGFFLAKELAGGDVAAWLYSGLILGSMMGPTIVFSIPVALGIIEPSDRRYLALGVLA
GIVTIPIGCIAGGLIAMYSGVQINGQPVEFTFALILMNMIPVLIVAVLVALGLKFIPEKMINGFQIFAKFLVALITIGLA
AAVVKFLLGWELIPGLDPIFMAPGDKPGEVMRAIEVIGSISCVLLGAYPMVLLLTRWFEKPLMNVGKLLNVNNIAAAGMV
ATLANNIPMFGMMKQMDTRGKVINCAFAVSAAFALGDHLGFAAANMNAMIFPMIVGKLIGGVTAIGVAMMLVPKDDAAQV
KTEAEAQS
>P0A206 ~~~eutJ~~~Ethanolamine utilization protein EutJ~~~
MAHDEQLWLTPRLQKAAALCNQTPAASDTPLWLGVDLGTCDVVSMVVDGNAQPVAVCLDWADVVRDGIVWDFFGAVTLVR
RHLDTLEQQLGCRFTHAATSFPPGTDPRISINVLESAGLEVSHVLDEPTAVADLLALDNAGVVDIGGGTTGIAIVKQGKV
TYSADEATGGHHISLTLAGNRRIPLEEAEQYKRSNAQEIWPVVKPVYEKMAEIVARHIEGQGIADLWLAGGSCMQPGVEA
LFRQRFPELQVHLPQHSLFMTPLAIANSGRAKAEGLYAS
>P76540 ~~~eutK~~~Bacterial microcompartment shell protein EutK~~~COG4577
MINALGLLEVDGMVAAIDAADAMLKAANVRLLSHEVLDPGRLTLVVEGDLAACRAALDAGCAAAMRTGRVISRKEIGRPD
DDTQWLVTGFNRQPKQPVREPDAPVIVAESADELLALLTSVRQGMTAGEVAAHFGWPLEKARNALEQLFSAGTLRKRSSR
YRLKPH
>Q9ZFU8 ~~~eutK~~~Bacterial microcompartment shell protein EutK~~~
MINALGLLEVDGMVAAVDAADAMLKAANVRLLSHQVLDPGRLTLVVEGDLAACRAALDAGSAAAQRTGRVISRKEIGRPE
EDTQWLIGGFARATTPTEKAPQVPATPEFAEALLALLASVRQGMTAGEVAAHFGWPLEQARNVLEQLFSDGALRKRSSRY
RIKN
>Q8XLZ0 ~~~eutL~~~Bacterial microcompartment shell protein EutL~~~
MKNDLIRPNVLSVKIISNVSPEMAKKLELEPHHKSLGLITADCDDVTYTALDEATKAAEVDVVYARSMYAGAGNASTKLA
GEVIGILAGPSPAEVRSGLNATLDFIDSGVGFVSANEDDSICYYAQCVSRTGSYLSKTAGIREGEALAYLVAPPLEAMYA
LDAALKAADVEMCEFFAPPTETNFAGALLTGSQSACKAACDAFAEAVQSVASNPLGF
>P76541 ~~~eutL~~~Bacterial microcompartment shell protein EutL~~~COG4816
MPALDLIRPSVTAMRVIASVNADFARELKLPPHIRSLGLISADSDDVTYIAADEATKQAMVEVVYGRSLYAGAAHGPSPT
AGEVLIMLGGPNPAEVRAGLDAMIAHIENGAAFQWANDAQDTAFLAHVVSRTGSYLSSTAGITLGDPMAYLVAPPLEATY
GIDAALKSADVQLATYVPPPSETNYSAAFLTGSQAACKAACNAFTDAVLEIARNPIQRA
>P0A1C9 ~~~eutL~~~Bacterial microcompartment shell protein EutL~~~
MPALDLIRPSVTAMRVIASVNDGFARELKLPPHIRSLGLITADSDDVTYIAADEATKQAMVEVVYGRSLYAGAAHGPSPT
AGEVLIMLGGPNPAEVRAGLDAMVASIENGAAFQWANDAENTAFLAHVVSRTGSYLSSTAGIALGDPMAYLVAPPLEATF
GIDAAMKSADVQLVTYVPPPSETNYSAAFLTGSQAACKAACNAFTDAVLDIARNPVQRA
>Q187N0 ~~~eutM~~~Bacterial microcompartment shell protein EutM~~~COG4577
MASANALGMIETKGLVGAIEAADAMVKAANVQLVGKEQVGGGLVTVMVRGDVGAVKAATDAGAAAAERVGELISVHVIPR
PHFEVDAILPKVSAE
>P0ABF4 ~~~eutM~~~Bacterial microcompartment shell protein EutM~~~COG4577
MEALGMIETRGLVALIEASDAMVKAARVKLVGVKQIGGGLCTAMVRGDVAACKAATDAGAAAAQRIGELVSVHVIPRPHG
DLEEVFPIGLKGDSSNL
>P41791 ~~~eutM~~~Bacterial microcompartment shell protein EutM~~~
MEALGMIETRGLVALIEASDAMVKAARVKLVGVKQIGGGLCTAMVRGDVAACKAATDAGAAAAQRIGELVSVHVIPRPHG
DLEEVFPISFKGDSNI
>P0AEJ8 ~~~eutN~~~Bacterial microcompartment shell vertex protein EutN~~~COG4576
MKLAVVTGQIVCTVRHHGLAHDKLLMVEMIDPQGNPDGQCAVAIDNIGAGTGEWVLLVSGSSARQAHKSETSPVDLCVIG
IVDEVVSGGQVIFHK
>P41792 ~~~eutN~~~Bacterial microcompartment shell vertex protein EutN~~~
MEADMKLAVVTGQIVCTVRHQGLAHDKLLMVEMIDAQGNPDGQCAVAIDSIGAGTGEWVLLVSGSSARQAHRSELSPVDL
CVIGIVDEVVAGGKVVFHK
>P0A208 2.7.2.1~~~eutP~~~Probable acetate kinase EutP~~~
MKRIAFVGAVGAGKTTLFNALRGNYSLARKTQAVEFNDHGDIDTPGEYFSHPRWYHALITTLQDVDTLIYVHAANDKESR
LPAGLLDVGTRKRHIAVISKTDMPDADVAATRQLLCEIGFREPIFELNGHDPQSVRQLVDYLAALSEQEEEAGEKTYHS
>Q187N7 2.7.2.1~~~eutQ~~~Acetate kinase EutQ~~~COG4766
MDISNIDKNLIETLVRQIIEEKISGTKDTVDFVRNKDISGITSIKLPTVKVSESDRLDTGNPSDVVYTKDLFTLEESPRL
GCGMMEMKETTFDWTLNYDEIDYVIDGTLDIIIDGRKVSASSGELIFIPKGSKIQFSVPDYARFIYVTYPADWASQN
>P76555 2.7.2.1~~~eutQ~~~Acetate kinase EutQ~~~COG4766
MKKLITANDIREAHARGEQAMSVVLRASIITPEAREVADLLGFTITECDESIPVTASVPASVPADKTESQRIRETIIAQL
PEGQFTESLVAQLMEKVMKEKQSLEQGAMQPSFKSVTGKGGIKVIDGSSVKFGRFDGAEPHCVGLTDLVTGDDGSSMAAG
FMQWENAFFPWTLNYDEIDMVLEGELHVRHEGQTMIAKAGDVMFIPKGSSIEFGTTSSVKFLYVAWPANWQSL
>Q9ZFV5 2.7.2.1~~~eutQ~~~Acetate kinase EutQ~~~
MKKLITANDIRAAHARGEQAMSVVLRASIITPEAREVAELLGFTITECDESVPASTSAQACKSESQRIREAIIAQLPEGQ
FTESLVAQLMEKVLKEKQSLELGTMQPSFTSVTGKGGVKVIDGSSVKFGRFDGAEPHCVGLTDLVTEQDGSSMAAGFMQW
DNAFFPWTLNYDEIDMVLEGELHVRHEGETMIAKAGDVMFIPKGSSIEFGTPTSVRFLYVAWPANWQSV
>Q9ZFU7 ~~~eutR~~~HTH-type DNA-binding transcriptional activator EutR~~~
MKKTRTANLHHLYHEALPEDVKLTPRVEVDNVHQRRTTDVYEHALTITAWQQIYDQLHPGKFHGEFTEILLDEIQVFREY
TGLALRQSCLVWPNSFWFGIPATRGEQGFIGAQGLGSAEIATRPGGTEFELSTPDDYTILGVVISEDVISRQATFLHNPE
RVLHMLRNQLALEVKEQHKAALWGFVQQALATFSESPETLHQPAVRKVLSDNLLLAMGTMLEEAKPIHSAESISHQGYRR
LLSRAREYVLENMSEPLTVLDLCNQLHVSRRTLQNAFHAILGIGPNAWLKRIRLNAVRRELISPWSQSATVKDAAMQWGF
WHLGQFATDYQQLFAEKPSLTLHQRMRQWA
>Q187M0 ~~~eutS~~~Bacterial microcompartment shell protein EutS~~~COG4810
MTEESKQRVIQEYVPGKQVTLAHIIANPNEDIYKKLGLVLDKKDAIGILTITPSEASIIAADVATKASNVSLGFIDRFSG
SVVISGDVSSVESALNDVLEVLGNMLNFSSTKITRT
>P63746 ~~~eutS~~~Bacterial microcompartment shell protein EutS~~~COG4810
MDKERIIQEFVPGKQVTLAHLIAHPGEELAKKIGVPDAGAIGIMTLTPGETAMIAGDLALKAADVHIGFLDRFSGALVIY
GSVGAVEEALSQTVSGLGRLLNYTLCEMTKS
>Q9ZFV7 ~~~eutS~~~Bacterial microcompartment shell protein EutS~~~
MNKERIIQEFVPGKQVTLAHLIAHPGEELAKKIGVPDAGAIGIMTLTPGETAMIAGDLAMKAADVHIGFLDRFSGALVIY
GTVGAVEEALLQTVSGLGRLLNFTLCELTKS
>Q9ZFV4 2.5.1.154~~~eutT~~~Corrinoid adenosyltransferase EutT~~~
MNDFITETWLRANHTLSEGSEIHLPADARLTPSARELLESRRLRIKFLDPQGRLFVDDDEQQPQPVHGLTSSDTHPQACC
ELCRQPVVKKPDTLTHLTADKMVAKSDPRLGFRAALDSAIALTVWLQIELAEPWQPWLFDIRSRLGNIMRADAIDEPLAA
QSIVGLNEDELHRLSHQPLRYLDHDHLVPEASHGRDAALLNLLRTKVRETETLAAQVFITRSFEVLRPDILQALNRLSST
VYVMMILSVAKHPLTVAQIQQRLGEKP
>O52793 4.2.1.159~~~evaA~~~dTDP-4-dehydro-6-deoxy-alpha-D-glucopyranose 2,3-dehydratase~~~
MSSFVVPSLTAVRPRDHHDYADRIALSAATTDGVQMRTEDVRAWIAERRDANVFHVERIPFADLDQWWFEGVTGNLVHRS
GRFFTIEGLHVIEHDGPHGDGPYREWQQPVIRQPEVGILGILAKEFDGVLHFLMQAKMEPGNPNLVQLSPTVQATRSNYT
KAHGGTNVKLIEYFAPPDPERVIVDVLQAEQGSWFFRKSNRNMIVETVDDVPLWDDFCWLTLGQIAELMHEDETINMNSR
SVLSCLPYQDITPRALFSDVQLLSWFTNERSRHDVRVRRIPLADVCGWKQGAEEIEHEDGRYFKVLAVAVKGSNREKISW
TQPLVESVDLGVVAFLVRKIDGVPHVLVQARVDGGFLDTVELAPTVQCTPLNYAHLPAEERPPFLDLVQNAPRSRIRYEA
IHSEEGGRFLGVRARYLVIDADEAIDPPPGYAWVTPAQLTALTRHGHYVNVEARTLLACINAAAAQPRGGA
>P0ACZ4 ~~~evgA~~~DNA-binding transcriptional activator EvgA~~~COG2197
MNAIIIDDHPLAIAAIRNLLIKNDIEILAELTEGGSAVQRVETLKPDIVIIDVDIPGVNGIQVLETLRKRQYSGIIIIVS
AKNDHFYGKHCADAGANGFVSKKEGMNNIIAAIEAAKNGYCYFPFSLNRFVGSLTSDQQKLDSLSKQEISVMRYILDGKD
NNDIAEKMFISNKTVSTYKSRLMEKLECKSLMDLYTFAQRNKIG
>P0ACZ7 ~~~evgA~~~DNA-binding transcriptional activator EvgA~~~
MNAIIIDDHPLAIAAIRNLLIKNDIEILAELTEGGSAVQRVETLKPDIVIIDVDIPGVNGIQVLETLRKRQYSGIIIIVS
AKNDHFYGKHCADAGANGFVSKKEGMNNIIAAIEAAKNGYCYFPFSLNRFVGSLTSDQQKLDSLSKQEISVMRYILDGKD
NNDIAEKMFISNKTVSTYKSRLMEKLECKSLMDLYTFAQRNKIG
>P30855 2.7.13.3~~~evgS~~~Sensor protein EvgS~~~COG0784
MKFLPYIFLLCCGLWSTISFADEDYIEYRGISSNNRVTLDPLRLSNKELRWLASKKNLVIAVHKSQTATLLHTDSQQRVR
GINADYLNLLKRALNIKLTLREYADHQKAMDALAEGEVDIVLSHLVTSPPLNNDIAATKPLIITFPALVTTLHDSMRPLT
SPKPVNIARVANYPPDEVIHQSFPKATIISFTNLYQALASVSAGHNDYFIGSNIITSSMISRYFTHSLNVVKYYNSPRQY
NFFLTRKESVILNEVLNRFVDALTNEVRYEVSQNWLDTGNLAFLNKPLELTEHEKQWIKQHPNLKVLENPYSPPYSMTDE
NGSVRGVMGDILNIITLQTGLNFSPITVSHNIHAGTQLSPGGWDIIPGAIYSEDRENNVLFAEAFITTPYVFVMQKAPDS
EQTLKKGMKVAIPYYYELHSQLKEMYPEVEWIQVDNASAAFHKVKEGELDALVATQLNSRYMIDHYYPNELYHFLIPGVP
NASLSFAFPRGEPELKDIINKALNAIPPSEVLRLTEKWIKMPNVTIDTWDLYSEQFYIVTTLSVLLVGSSLLWGFYLLRS
VRRRKVIQGDLENQISFRKALSDSLPNPTYVVNWQGNVISHNSAFEHYFTADYYKNAMLPLENSDSPFKDVFSNAHEVTA
ETKENRTIYTQVFEIDNGIEKRCINHWHTLCNLPASDNAVYICGWQDITETRDLINALEVEKNKAIKATVAKSQFLATMS
HEIRTPISSIMGFLELLSGSGLSKEQRVEAISLAYATGQSLLGLIGEILDVDKIESGNYQLQPQWVDIPTLVQNTCHSFG
AIAASKSIALSCSSTFPEHYLVKIDPQAFKQVLSNLLSNALKFTTEGAVKITTSLGHIDDNHAVIKMTIMDSGSGLSQEE
QQQLFKRYSQTSAGRQQTGSGLGLMICKELIKNMQGDLSLESHPGIGTTFTITIPVEISQQVATVEAKAEQPITLPEKLS
ILIADDHPTNRLLLKRQLNLLGYDVDEATDGVQALHKVSMQHYDLLITDVNMPNMDGFELTRKLREQNSSLPIWGLTANA
QANEREKGLSCGMNLCLFKPLTLDVLKTHLSQLHQVAHIAPQYRHLDIEALKNNTANDLQLMQEILMTFQHETHKDLPAA
FQALEAGDNRTFHQCIHRIHGAANILNLQKLINISHQLEITPVSDDSKPEILQLLNSVKEHIAELDQEIAVFCQKND
>C6WFL3 4.2.3.155~~~~~~2-epi-valiolone synthase~~~COG0337
MDSPAGYRIHDNIPLPGNLDQVDVHVTRDDDYRIHVLPDVDRAVDALLTELDGRRAVVITDDVVADLHEGRVSAELAARG
QLIGRTAIRAGEKSKSLTTAFELIDWLAEVNLARRDVVIALGGGVVVDTVGFVASAYMRGVPYVNMPTTLLAQVDAGIGG
KVAVDHSEAKNLVGAFYQPKAVISCLEHLRTLDTRQIRSGLAEVVKKAVIASPELFDYIEANADDLLACASPAIDVLVHA
AGAIKTKLVGRDPYEIDLRRPLNFGHTTGHAVETVTNYGPVLHGEAVAFGMVVAVDVARARGLVVPEVADRVTALIRRLG
LPVALEELGAVPRVDDVVAALLKIRQIRDGSLRFVLPVELGATVIAEDVTEEEVRAALVRLR
>Q08VU0 4.2.3.155~~~~~~2-epi-valiolone synthase~~~COG0337
MPSTGSTPILAHDVKSPHRGSLALDGGKTGYRVTVHREDRYEIIIGRGTLARLGELLRPVMAANEADSAVIITDNHVGPL
YAELVTKRISATGAPVQCIVIPAGEPSKSIAQAHRLWDELRSRSVRRRTFLVALGGGVLCDLVGFVATTYLRGIPYVNVA
TSLMGQVDGAIGGKVGVDHSTGKNLIGGFYHPDLVVIDPSCLATLPLAEVINGLAEAVKVALIGTPGLFEQLERLPMSTA
WPLDQAAPERLIEGLGPIIPAAIGKKLELLAPDPFEQDLRRLLNLGHSVGHGLEAATHFVRYRHGEAVAIGTATVTAIST
GLGLTSVDTLRRILRLLQKLRLPVTVPDDLREVVWQHLETARLVRNGRLLLVMPTAIDHSVIIDDITRGQYDAACQLVAQ
EAPACG
>P04995 3.1.11.1~~~sbcB~~~Exodeoxyribonuclease I~~~COG2925
MMNDGKQQSTFLFHDYETFGTHPALDRPAQFAAIRTDSEFNVIGEPEVFYCKPADDYLPQPGAVLITGITPQEARAKGEN
EAAFAARIHSLFTVPKTCILGYNNVRFDDEVTRNIFYRNFYDPYAWSWQHDNSRWDLLDVMRACYALRPEGINWPENDDG
LPSFRLEHLTKANGIEHSNAHDAMADVYATIAMAKLVKTRQPRLFDYLFTHRNKHKLMALIDVPQMKPLVHVSGMFGAWR
GNTSWVAPLAWHPENRNAVIMVDLAGDISPLLELDSDTLRERLYTAKTDLGDNAAVPVKLVHINKCPVLAQANTLRPEDA
DRLGINRQHCLDNLKILRENPQVREKVVAIFAEAEPFTPSDNVDAQLYNGFFSDADRAAMKIVLETEPRNLPALDITFVD
KRIEKLLFNYRARNFPGTLDYAEQQRWLEHRRQVFTPEFLQGYADELQMLVQQYADDKEKVALLKALWQYAEEIV
>P09030 3.1.11.2~~~xthA~~~Exodeoxyribonuclease III~~~COG0708
MKFVSFNINGLRARPHQLEAIVEKHQPDVIGLQETKVHDDMFPLEEVAKLGYNVFYHGQKGHYGVALLTKETPIAVRRGF
PGDDEEAQRRIIMAEIPSLLGNVTVINGYFPQGESRDHPIKFPAKAQFYQNLQNYLETELKRDNPVLIMGDMNISPTDLD
IGIGEENRKRWLRTGKCSFLPEEREWMDRLMSWGLVDTFRHANPQTADRFSWFDYRSKGFDDNRGLRIDLLLASQPLAEC
CVETGIDYEIRSMEKPSDHAPVWATFRR
>P54161 3.1.11.-~~~ypcP~~~5'-3' exonuclease~~~COG0258
MNNNKLLLVDGMALLFRAFFATAVHRNFMINDSGVPTNGVNGFLKHLITAVETFQPTHVVCCWDMGSKTYRNDLFQDYKA
NRSAPPVELIPQFDLAKEAAAELGIMNIGFAGYEADDCIGTLADLFANEADITVVTGDRDLLQLLTDKVSVALLQKGIGN
YKVYTKETFYEETGVMPKALIDIKALMGDSSDNYPGVKGIGEKTAYKLIREYETIDRLLENLSLLPKGQQGKIQQGLSDL
EMSRKLAEIHCSVPLACTLKDALFTLQMEQAADMLRRHQIKGIERMLEKLNAREIV
>P9WNU3 3.1.11.-~~~~~~5'-3' exonuclease~~~COG0258
MRSPLVLLDGASMWFRSFFGVPSSITAPDGRPVNAVRGFIDSMAVVITQQRPNRLAVCLDLDWRPQFRVDLIPSYKAHRV
AEPEPNGQPDVEEVPDELTPQVDMIMELLDAFGIAMAGAPGFEADDVLGTLATRERRDPVIVVSGDRDLLQVVADDPVPV
RVLYLGRGLAKATLFGPAEVAERYGLPAHRAGAAYAELALLRGDPSDGLPGVPGVGEKTAATLLARHGSLDQIMAAADDR
KTTMAKGLRTKLLAASAYIKAADRVVRVATDAPVTLSTPTDRFPLVAADPERTAELATRFGVESSIARLQKALDTLPG
>P04994 3.1.11.6~~~xseA~~~Exodeoxyribonuclease 7 large subunit~~~COG1570
MLPSQSPAIFTVSRLNQTVRLLLEHEMGQVWISGEISNFTQPASGHWYFTLKDDTAQVRCAMFRNSNRRVTFRPQHGQQV
LVRANITLYEPRGDYQIIVESMQPAGEGLLQQKYEQLKAKLQAEGLFDQQYKKPLPSPAHCVGVITSKTGAALHDILHVL
KRRDPSLPVIIYPAAVQGDDAPGQIVRAIELANQRNECDVLIVGRGGGSLEDLWSFNDERVARAIFTSRIPVVSAVGHET
DVTIADFVADLRAPTPSAAAEVVSRNQQELLRQVQSTRQRLEMAMDYYLANRTRRFTQIHHRLQQQHPQLRLARQQTMLE
RLQKRMSFALENQLKRTGQQQQRLTQRLNQQNPQPKIHRAQTRIQQLEYRLAETLRAQLSATRERFGNAVTHLEAVSPLS
TLARGYSVTTATDGNVLKKVKQVKAGEMLTTRLEDGWIESEVKNIQPVKKSRKKVH
>P9WF31 3.1.11.6~~~xseA~~~Exodeoxyribonuclease 7 large subunit~~~COG1570
MTQNSAENPFPVRAVAIRVAGWIDKLGAVWVEGQLAQITMRPDAKTVFMVLRDPAADMSLTVTCSRDLVLSAPVKLAEGV
QVVVCGKPSFYTGRGTFSLRLSEIRAVGIGELLARIDRLRRLLDAEGLFDPRLKRPIPYLPNMIGLITGRASAAERDVTT
VASARWPAARFAVRNVAVQGPNAVGQIVEALRELDRDPDVDVIVLARGGGSVEDLLPFSDETLCRAIAACRTPVVSAVGH
EPDNPLCDLVVDLRAATPTDAAKKVVPDTAAEQRLIDDLRRRSAQALRNWVSREQRAVAQLRSRPVLADPMTMVSVRAEE
VHRARSTLRRNLTLMVAAETERIGHLAARLATLGPAATLARGYAIVQTVAQTGPEGGSEPQVLRSVHDAPEGTKLRVRVA
DGALAAVSEGQTNGL
>Q7VV85 3.1.11.6~~~xseB~~~Exodeoxyribonuclease 7 small subunit~~~COG1722
MASSKQADPQTDARPLPQDFETALAELESLVSAMENGTLPLEQSLSAYRRGVELARVCQDRLAQAEQQVKVLEGDLLRPL
DPAALDDE
>P0A8G9 3.1.11.6~~~xseB~~~Exodeoxyribonuclease 7 small subunit~~~COG1722
MPKKNEAPASFEKALSELEQIVTRLESGDLPLEEALNEFERGVQLARQGQAKLQQAEQRVQILLSDNEDASLTPFTPDNE
>P9WF29 3.1.11.6~~~xseB~~~Exodeoxyribonuclease 7 small subunit~~~COG1722
MVCDPNGDDTGRTHATVPVSQLGYEACRDELMEVVRLLEQGGLDLDASLRLWERGEQLAKRCEEHLAGARQRVSDVLAGD
EAQNG
>A8R3S7 ~~~exaE~~~Transcriptional activator protein ExaE~~~
MGILLVDDHPMIRLGLAHFLGEGLNGLPVREAGSGEEALQQVQEELPGLVIMDFDLPGISGLETTRRLRQRLPQLRVLFF
SEHTELGLVRQALDAGACGFLSKAAAPAVVLEAVRRVLAGHAYIEQPLATQLACQPHPGQGGGNARLQGLTQREIEVFLM
LAKGTPTRLIAQQLCISAKTVSNYLTLLKSKLQVSSHAELVHLAIEAGLLRIAA
>P0ABU7 ~~~exbB~~~Biopolymer transport protein ExbB~~~COG0811
MGNNLMQTDLSVWGMYQHADIVVKCVMIGLILASVVTWAIFFSKSVEFFNQKRRLKREQQLLAEARSLNQANDIAADFGS
KSLSLHLLNEAQNELELSEGSDDNEGIKERTSFRLERRVAAVGRQMGRGNGYLATIGAISPFVGLFGTVWGIMNSFIGIA
QTQTTNLAVVAPGIAEALLATAIGLVAAIPAVVIYNVFARQIGGFKAMLGDVAAQVLLLQSRDLDLEASAAAHPVRVAQK
LRAG
>P0ABV2 ~~~exbD~~~Biopolymer transport protein ExbD~~~COG0848
MAMHLNENLDDNGEMHDINVTPFIDVMLVLLIIFMVAAPLATVDVKVNLPASTSTPQPRPEKPVYLSVKADNSMFIGNDP
VTDETMITALNALTEGKKDTTIFFRADKTVDYETLMKVMDTLHQAGYLKIGLVGEETAKAK
>A0A401ETL2 3.2.1.213~~~bl1,6Gal~~~Exo-beta-1,6-galactobiohydrolase~~~
MRVLSKSLAAMVAAATLVGGGAFAVAGTAYAADNDAITVTPNPWYANSFDGWGTSLAWFANATGSLGEESAITTNLGDDA
SKAKAVEYGKQLREQFYQSIFGDEGLDLNMARYNVGGGNASDVAYGYPFMRQGAAVPGTWKDDATGSGTYGNGVTTKQAD
KDKLAAAFDPTDDNQYDFSKSAAQDWWIERGATGDNPDITDVEAFANSAPWFLTNSGYATGGRNSGSNNLANPEKFAQYM
AKNVEHLESLGANVDTVEPFNESETSYWGTPGDMASKYTDESDDNTKLINNYWDKYYSDKDRSVTPYSNALKKPQEGMHV
SNAQQQQTITALAEALKDNDDTIIAATDATNSADFVKSYNQYPQAIKDLIGQYNVHAYSDSNQMQSRDIAQADGKKLSMS
EVDGSWQSGSYNPYGFDNALGMMSKISSNVTRLQSKDFTFWQVVEDLYNMQMGSNVNPAGENTNWGTVLIDFDCTVAGMD
GKLYSERRVNNNGGTTDGLEPCTVIANAKYNGVKAITHFIHAGDKVIANNDEDNNMTATSDDGKTQTVIHRNSGTSDQTF
VIDLSKYGEIADNAYGELYLTTETSAEDKNAGVDSATPEVFAKTSNVKQAEGSVMIDKAAKTATVTVPARSIASIQLTGV
TGYAKDAAVETGDTYQLVGKQSGKAVADTTSGDSALSLANVASDAENAKKQTWTFTQIEQPTDSERPDLKAYVITNAEGK
VLVSKDGTNALSNETVGAAKSDPAAKWILNTSDGSTYQLLNAATKTNLDVDNSGTTVGTKVGLWQSPSGTSPSANQTWTL
RNVTPTSQKTVNVQTAVNEKAVLPVEVTLYYTWGEGKATVANWDTSKVDVAKEGAYEATATATDVYGNEFNVTATVYVGA
LTVSDPVSATVLAGTSASEAKAALEAAPVYLHVKASPAFEGDAAKVTWNFDGLDTKLADAKAGDNIAVTGTYQLDDATTI
ALKGAIYVTAATPENVADTASNLTVTNQQTEYSKGDQWKKLTDGDTSAEAWVTWNSAGDYSASPTATIDFGSECELSSVT
ITYGDKAPASAKAEYTTDGETWMQFGSDVKPAAGQTVTFKADKGTVNATKMRIVNTVNNDYMNATEIQAFVTPVQGAAKN
IAAASGTNFSVNFQEGASASKAIDGDTTSKGWSTWASTASTVDPVATFTFDEAQTITEVKTFFYYDGRASWPKSQTLEYQ
DEAGEWHGVGTKDGWKIQAGDAGSGSDGITAADTPTVDFVLGTPVKAKAIRLTNTLQDTKVYINVAEIQVFAQDSTVLTP
QPASDATLGDLRLDGETVEGFDPSKTDYTVDLPVDAEANPVLQAFATDNAAAVKVTGDAVENGKLGGKAAITVTSADESE
TKTYTVTFNAFTLASLKVIGPTKTEYAIGDKLDTAGLKVTAVYQSGDKTKEVPVALDDPQLAIGSFDSTTAGKKAITVSY
RGVTATFNVTVKANAVAPGPEEQKPGNTNKPGATGNGNKNTVANTGSSVAAIAGAVALLAAAAGALFMLRKRA
>P37454 3.1.11.2~~~exoA~~~Exodeoxyribonuclease~~~COG0708
MKLISWNVNGLRAVMRKMDFLSYLKEEDADIICLQETKIQDGQVDLQPEDYHVYWNYAVKKGYSGTAVFSKQEPLQVIYG
IGVEEHDQEGRVITLEFENVFVMTVYTPNSRRGLERIDYRMQWEEALLSYILELDQKKPVILCGDLNVAHQEIDLKNPKA
NRNNAGFSDQEREAFTRFLEAGFVDSFRHVYPDLEGAYSWWSYRAGARDRNIGWRIDYFVVSESLKEQIEDASISADVMG
SDHCPVELIINI
>P33693 3.2.1.-~~~exoK~~~Endo-1,3-1,4-beta-glycanase ExoK~~~COG2273
MTIDRYRRFARLAFIATLPLAGLATAAAAQEGANGKSFKDDFDTLDTRVWFVSDGWNNGGHQNCTWSKKQVKTVDGILEL
TFEEKKVKERNFACGEIQTRKRFGYGTYEARIKAADGSGLNSAFFTYIGPADKKPHDEIDFEVLGKNTAKVQINQYVSAK
GGNEFLADVPGGANQGFNDYAFVWEKNRIRYYVNGELVHEVTDPAKIPVNAQKIFFSLWGTDTLTDWMGTFSYKEPTKLQ
VDRVAFTAAGDECQFAESVACQLERAQSE
>G3XDA1 ~~~exoS~~~Secreted exoenzyme S~~~
MHIQSLQQSPSFAVELHQAASGRLGQIEARQVATPSEAQQLAQRQDAPKGEGLLARLGAALVRPFVAIMDWLGKLLGSHA
RTGPQPSQDAQPAVMSSAVVFKQMVLQQALPMTLKGLDKASELATLTPEGLAREHSRLASGDGALRSLSTALAGIRAGSQ
VEESRIQAGRLLERSIGGIALQQWGTTGGAASQLVLDASPELRREITDQLHQVMSEVALLRQAVESEVSRVSADKALADG
LVKRFGADAEKYLGRQPGGIHSDAEVMALGLYTGIHYADLNRALRQGQELDAGQKLIDQGMSAAFEKSGQAEQVVKTFRG
TRGGDAFNAVEEGKVGHDDGYLSTSLNPGVARSFGQGTISTVFGRSGIDVSGISNYKNEKEILYNKETDMRVLLSASDEQ
GVTRRVLEEAALGEQSGHSQGLLDALDLASKPERSGEVQEQDVRLRMRGLDLA
>Q9I788 ~~~exoT~~~Exoenzyme T~~~
MHIQSSQQNPSFVAELSQAVAGRLGQVEARQVATPREAQQLAQRQEAPKGEGLLSRLGAALARPFVAIIEWLGKLLGSRA
HAATQAPLSRQDAPPAASLSAAEIKQMMLQKALPLTLGGLGKASELATLTAERLAKDHTRLASGDGALRSLATALVGIRD
GSRIEASRTQAARLLEQSVGGIALQQWGTAGGAASQHVLSASPEQLREIAVQLHAVMDKVALLRHAVESEVKGEPVDKAL
ADGLVEHFGLEAEQYLGEHPDGPYSDAEVMALGLYTNGEYQHLNRSLRQGRELDAGQALIDRGMSAAFEKSGPAEQVVKT
FRGTQGRDAFEAVKEGQVGHDAGYLSTSRDPGVARSFAGQGTITTLFGRSGIDVSEISIEGDEQEILYDKGTDMRVLLSA
KDGQGVTRRVLEEATLGERSGHGEGLLDALDLATGTDRSGKPQEQDLRLRMRGLDLA
>P0AEK0 3.1.11.-~~~exoX~~~Exodeoxyribonuclease 10~~~COG0847
MLRIIDTETCGLQGGIVEIASVDVIDGKIVNPMSHLVRPDRPISPQAMAIHRITEAMVADKPWIEDVIPHYYGSEWYVAH
NASFDRRVLPEMPGEWICTMKLARRLWPGIKYSNMALYKTRKLNVQTPPGLHHHRALYDCYITAALLIDIMNTSGWTAEQ
MADITGRPSLMTTFTFGKYRGKAVSDVAERDPGYLRWLFNNLDSMSPELRLTLKHYLENT
>P9WJ72 3.1.13.-~~~~~~3'-5' exoribonuclease MT2234.1~~~
MRYFYDTEFIEDGHTIELISIGVVAEDGREYYAVSTEFDPERAGSWVRTHVLPKLPPPASQLWRSRQQIRLDLEEFLRID
GTDSIELWAWVGAYDHVALCQLWGPMTALPPTVPRFTRELRQLWEDRGCPRMPPRPRDVHDALVDARDQLRRFRLITSTD
DAGRGAAR
>P9WJ73 3.1.13.-~~~~~~3'-5' exoribonuclease Rv2179c~~~
MRYFYDTEFIEDGHTIELISIGVVAEDGREYYAVSTEFDPERAGSWVRTHVLPKLPPPASQLWRSRQQIRLDLEEFLRID
GTDSIELWAWVGAYDHVALCQLWGPMTALPPTVPRFTRELRQLWEDRGCPRMPPRPRDVHDALVDARDQLRRFRLITSTD
DAGRGAAR
>Q6B4J5 ~~~exsA~~~Spore coat assembly protein ExsA~~~
MKIHIVQKGDTLWKIAKKYGVDFDTLKKTNTQLSNPDLIMPGMKIKVPSKSVHMKQQAGAGSAPPKQYVKEVQQKEFAAT
PTPLGIEDEEEVTYQSAPITQQPAMQQTQKEVQIKPQKEMQVKPQKEVQVKPQKEMQVKPQKEVQKEQPIQKEKPVEKPS
VIQKPPVIEKQKPAEKENTKFSVNVLPQPPQPPIKPKKEYKISDVIKKGSELIAPQISKMKPNNIISPQTKKNNIISPQV
KKENVGNIVSPQVKKENVGNIVSPQVKKENVGNIVSPQVKKENVGNIVSPQVKKENVGNIVSPQVKKENVGNIVSPQVKK
ENVGNIVSPNVSKENVVIPQVIPPNIQMPNIMPIMDNNQPPNIMPIMDNNQPPNIMPIMDNNQMPNMMPIMDNNQMPNMM
PIMDNNQMPNMMPIMDNNQMPNMMPIMDNNQMPNMMPIMDNNQMPNMMPIMDNNQMPNMMPIMDNNQMPNIMPIMDNNQM
PNMMPIMDNNQMPNIMPIMDNNQMPNMMPIMDNNQPPNMMPYQMPYQQPMMPPNPYYQQPNPYQMPYQQGAPFGPQHTSM
PNQNMMPMDNNMPPLVQGEEDCGCGGESRLYSPQPGGPQYANPLYYQPTQSAYAPQPGTMYYQPDPPNVFGEPVSEEEDE
EEV
>P26993 ~~~exsA~~~HTH-type transcriptional regulator ExsA~~~
MQGAKSLGRKQITSCHWNIPTFEYRVNKEEGVYVLLEGELTVQDIDSTFCLAPGELLFVRRGSYVVSTKGKDSRILWIPL
SAQFLQGFVQRFGALLSEVERCDEPVPGIIAFAATPLLAGCVKGLKELLVHEHPPMLACLKIEELLMLFAFSPQGPLLMS
VLRQLSNRHVERLQLFMEKHYLNEWKLSDFSREFGMGLTTFKELFGSVYGVSPRAWISERRILYAHQLLLNSDMSIVDIA
MEAGFSSQSYFTQSYRRRFGCTPSRSRQGKDECRAKNN
>P26994 ~~~exsB~~~Type 3 secretion system pilotin~~~
MLLPLALLLGGCVSQPAPMSPKVTVGGSVGGVSLQARQAQLRLRLYAVVQGRMQTIAERRYRVSGLPLRYAFDLEVDRLE
GEALYLRTELSWVGVAAVQASAWQQVAAGVDERVRLVRRDCFPNCTAARPEERSGND
>P26995 ~~~exsC~~~Transcriptional anti-antiactivator ExsC~~~
MDLTSKVNRLLAEFAGRIGLPSLSLDEEGMASLLFDEQVGVTLLLLAERERLLLEADVAGIDVLGEGIFRQLASFNRHWH
RFDLHFGFDELTGKVQLYAQILAAQLTLECFEATLANLLDHAEFWQRLLPCDSDREAVAAVGMRV
>Q9I321 ~~~exsD~~~Transcriptional antiactivator ExsD~~~
MEQEDDKQYSREAVFAGRRVSVVGSDARSRGRVPGYASSSLYRESGIISARQLALLQRMLPRLRLEQLFRCEWLQQRLAR
GLALGREEVRQILLCAAQDDDGWCSELGDRVNLAVPQSMIDWVLLPVYGWWESLLDQAIPGWRLSLVELETQSRQLRVKS
EFWSRVAELEPEQAREELARVAKCQARTQEQVAELAGKLETASALAKSAWPNWQRGMATLLASGGLAGFEPIPEVLECLW
QPLCRLDDDVGAADAVQAWLHERNLCQAQDHFYWQS
>Q9I322 ~~~exsE~~~Type III secretion regulatory protein ExsE~~~
MKIESISPVQPSQDAGAEAVGHFEGRSVTRAAVRGEDRSSVAGLARWLARNVAGDPRSEQALQRLADGDGTPLEARTVRR
R
>O33680 3.2.1.-~~~exsH~~~Endo-1,3-1,4-beta-glycanase ExsH~~~COG2273
MSKTVLNAVGTPLYYSGSSTAWFSATGSGPTLHGTAGNDSMWGDSSVNVTMIGGRGDDIYYLYSSINRAYEAAGEGVDTI
STWMSYTLPANFENLTVTGSGRFAFGNEADNIIKGGSGTQTIDGRGGNDVLIGAGGADTFVFARGNGSDLITDFNYDDIV
RLDGYGFTSFEQILSNVAQEGADLRLHLADGESLVFANTTADELQAHQFRLSLDRSVLSQTFSDEFNTLQLRNGTSGVWD
AKFWWAPEKGATLSSNGEQQWYINPSYEPTASVNPFSVNNGVLTITAAPASEAIQAEINGYDYTSGMLTTYSSFAQTYGY
FEMRADMPDDQGVWPAFWLLPADGSWPPELDVVEMRGQDSNTVIATVHSNETGSRTSIENSVKVADASGFHTYGVLWTEE
EIVWYFDDAAIARADTPSDMHDPMYMLVNLAVGGIAGTPRDGLADGSEMKIDYIKAYSLDADWQI
>Q9JMQ1 ~~~exuR~~~Probable HTH-type transcriptional repressor ExuR~~~COG1609
MVTIKDIAKLANVSHTTVSRALNNSPYIKEHTKKKILELAEQLNYTPNVNAKSLAMQKSHTIGLFFTSITNGTSHSFFAD
TIKGVNQAISEDYNLYVRGIDDLKNYDSVTPMRYDGIILMSQSDIDNSFIYHIREKNIPLVVLNRDIDDRTITNILSNDK
EGSQEAVEYFIQSGHQDIAIIEGIEGFKSSQQRKEGYLSALIQHHIPIKHEYSVKGQYDMESGFQAMERLLALPNPPTAV
FCSNDDMAIGAMNAIFAKGLRVPDDISVIGFDDIGFSQYITPRLSTVKRPVEKISVLGAQKLLSLISEPETKAEKILENT
EFMVRDSVRRLTT
>O34456 ~~~exuT~~~Hexuronate transporter~~~COG2271
MFSKDKLPVILFLFLAGVINYLDRSALSIAAPFIQDDLTLSATQMGLIFSSFSIGYAIFNFLGGVASDRYGAKLTLFVAM
VVWSLFSGAVALAFGFVSLLIIRILFGMGEGPLSATINKMVNNWFPPTQRASVIGVTNSGTPLGGAISGPIVGMIAVAFS
WKVSFVLIMIIGLIWAVLWFKFVKEKPQETIKEAPAIKAETSPGEKIPLTFYLKQKTVLFTAFAFFAYNYILFFFLTWFP
SYLVDERGLSVESMSVITVIPWILGFIGLAAGGFVSDYVYKKTARKGVLFSRKVVLVTCLFSSAVLIGFAGLVATTAGAV
TLVALSVFFLYLTGAIYWAVIQDVVDQNNVGSVGGFMHFLANTAGIIGPALTGFIVDQTGTFSGAFLLAGGLAVFASLAV
IRFVRPIIGKPAGTEAENPVSY
>P94774 ~~~exuT~~~Galacturonate transporter~~~
MFKIKGLRWYMIGLVTIGTVLGYLTRNAIAAAAPTLQEQLHISTQQYSYIIAAYSACYTIMQPVAGYVLDVLGTKVGYAM
FAILWALFCAGTALANSWGGLAVARGAVGMAEAAMIPAGLKASSEWFPAKERSVAVGYFNVGSSIGGMLAPPLVVWAIMA
HSWQMAFLITGALSLVWALCWLYFYKHPKDQKKLSTEEREYILSGQEAQHQAGNAKRMSAWQILRNRQFWGIALPRFLAE
PAWGTFNAWIPLFMFKAYGFNLKEIAMFAWMPMLFADLGCILGGYMPMLFQKYFKVNLIVSRKLVVTLGALLMIGPGTIG
LFTSPYVAIACCASAALPTSPCPVR
>P0AA78 ~~~exuT~~~Hexuronate transporter~~~COG2271
MRKIKGLRWYMIALVTLGTVLGYLTRNTVAAAAPTLMEELNISTQQYSYIIAAYSAAYTVMQPVAGYVLDVLGTKIGYAM
FAVLWAVFCGATALAGSWGGLAVARGAVGAAEAAMIPAGLKASSEWFPAKERSIAVGYFNVGSSIGAMIAPPLVVWAIVM
HSWQMAFIISGALSFIWAMAWLIFYKHPRDQKHLTDEERDYIINGQEAQHQVSTAKKMSVGQILRNRQFWGIALPRFLAE
PAWGTFNAWIPLFMFKVYGFNLKEIAMFAWMPMLFADLGCILGGYLPPLFQRWFGVNLIVSRKMVVTLGAVLMIGPGMIG
LFTNPYVAIMLLCIGGFAHQALSGALITLSSDVFGRNEVATANGLTGMSAWLASTLFALVVGALADTIGFSPLFAVLAVF
DLLGALVIWTVLQNKPAIEVAQETHNDPAPQH
>O34894 ~~~ezrA~~~Septation ring formation regulator EzrA~~~COG4477
MEFVIGLLIVLLALFAAGYFFRKKIYAEIDRLESWKIEILNRSIVEEMSKIKHLKMTGQTEEFFEKWREEWDEIVTAHMP
KVEELLYDAEENADKYRFKKANQVLVHIDDLLTAAESSIEKILREISDLVTSEEKSREEIEQVRERYSKSRKNLLAYSHL
YGELYDSLEKDLDEIWSGIKQFEEETEGGNYITARKVLLEQDRNLERLQSYIDDVPKLLADCKQTVPGQIAKLKDGYGEM
KEKGYKLEHIQLDKELENLSNQLKRAEHVLMTELDIDEASAILQLIDENIQSVYQQLEGEVEAGQSVLSKMPELIIAYDK
LKEEKEHTKAETELVKESYRLTAGELGKQQAFEKRLDEIGKLLSSVKDKLDAEHVAYSLLVEEVASIEKQIEEVKKEHAE
YRENLQALRKEELQARETLSNLKKTISETARLLKTSNIPGIPSHIQEMLENAHHHIQETVNQLNELPLNMEEAGAHLKQA
EDIVNRASRESEELVEQVILIEKIIQFGNRFRSQNHILSEQLKEAERRFYAFDYDDSYEIAAAAVEKAAPGAVEKIKADI
SA
>A7X3E7 ~~~ezrA~~~Septation ring formation regulator EzrA~~~
MVLYIILAIIVIILIAVGVLFYLRSNKRQIIEKAIERKNEIETLPFDQNLAQLSKLNLKGETKTKYDAMKKDNVESTNKY
LAPVEEKIHNAEALLDKFSFNASQSEIDDANELMDSYEQSYQQQLEDVNEIIALYKDNDELYDKCKVDYREMKRDVLANR
HQFGEAASLLETEIEKFEPRLEQYEVLKADGNYVQAHNHIAALNEQMKQLRSYMEEIPELIRETQKELPGQFQDLKYGCR
DLKVEGYDLDHVKVDSTLQSLKTELSFVEPLISRLELEEANDKLANINDKLDDMYDLIEHEVKAKNDVEETKDIITDNLF
KAKDMNYTLQTEIEYVRENYYINESDAQSVRQFENEIQSLISVYDDILKEMSKSAVRYSEVQDNLQYLEDHVTVINDKQE
KLQNHLIQLREDEAEAEDNLLRVQSKKEEVYRRLLASNLTSVPERFIIMKNEIDHEVRDVNEQFSERPIHVKQLKDKVSK
IVIQMNTFEDEANDVLVNAVYAEKLIQYGNRYRKDYSNVDKSLNEAERLFKNNRYKRAIEIAEQVLESVEPGVTKHIEEE
VIKQ
>P64003 ~~~ezrA~~~Septation ring formation regulator EzrA~~~
MVLYIILAIIVIILIAVGVLFYLRSNKRQIIEKAIERKNEIETLPFDQNLAQLSKLNLKGETKTKYDAMKKDNVESTNKY
LAPVEEKIHNAEALLDKFSFNASQSEIDDANELMDSYEQSYQQQLEDVNEIIALYKDNDELYDKCKVDYREMKRDVLANR
HQFGEAASLLETEIEKFEPRLEQYEVLKADGNYVQAHNHIAALNEQMKQLRSYMEEIPELIRETQKELPGQFQDLKYGCR
DLKVEGYDLDHVKVDSTLQSLKTELSFVEPLISRLELEEANDKLANINDKLDDMYDLIEHEVKAKNDVEETKDIITDNLF
KAKDMNYTLQTEIEYVRENYYINESDAQSVRQFENEIQSLISVYDDILKEMSKSAVRYSEVQDNLQYLEDHVTVINDKQE
KLQNHLIQLREDEAEAEDNLLRVQSKKEEVYRRLLASNLTSVPERFIIMKNEIDHEVRDVNEQFSERPIHVKQLKDKVSK
IVIQMNTFEDEANDVLVNAVYAEKLIQYGNRYRKDYSNVDKSLNEAERLFKNNRYKRAIEIAEQVLESVEPGVTKHIEEE
VIKQ
>P0A993 3.1.3.11~~~fbp~~~Fructose-1,6-bisphosphatase class 1~~~COG0158
MKTLGEFIVEKQHEFSHATGELTALLSAIKLGAKIIHRDINKAGLVDILGASGAENVQGEVQQKLDLFANEKLKAALKAR
DIVAGIASEEEDEIVVFEGCEHAKYVVLMDPLDGSSNIDVNVSVGTIFSIYRRVTPVGTPVTEEDFLQPGNKQVAAGYVV
YGSSTMLVYTTGCGVHAFTYDPSLGVFCLCQERMRFPEKGKTYSINEGNYIKFPNGVKKYIKFCQEEDKSTNRPYTSRYI
GSLVADFHRNLLKGGIYLYPSTASHPDGKLRLLYECNPMAFLAEQAGGKASDGKERILDIIPETLHQRRSFFVGNDHMVE
DVERFIREFPDA
>B2FU10 3.1.3.11~~~fbp~~~Fructose-1,6-bisphosphatase class 1~~~COG0158
MSRTSLTRFLIQEQHAGRINADLRQLIAVVARACTSISIAVSKGALGGVLGDAGTGNVQGEAQKKLDVISNEILLEANAW
GGHLAACASEEMDHSQPVPDIYPRGDFLLLFDPLDGSSNIDVNVSVGTIFSVLRCPTNVELPGDDAFLQPGSKQIAAGYC
IYGPSTQLVLTVGHGTHAFTLDREKGEFVLTTENMQIPAATQEFAINMSNQRHWEAPMQAYVGDLLAGKEGTRGKNFNMR
WIASMVADVHRILTRGGIFIYPWDKKDPSKAGKLRLMYEANPMGLLVEQAGGAAWTGRERILDIQPDQLHQRVPVFLGSR
EEVAEAVRYHHAHDNAQG
>Q59943 3.1.3.11~~~fbp~~~Fructose-1,6-bisphosphatase class 1~~~COG0158
MAQSTTSETHTRDLDRDCTTLSRHVLEQLQSFSPEAQDLAALMQRIGLAAKLIARRLSHAGLVDDALGFTGEINVQGEAV
KRMDVYANQVFISVFRQSGLVCRLASEEMEKPYYIPENCPIGRYTLLYDPLDGSANVDVDLNVGSIFAVRRQEFYDESHE
AKDLLQPGDRQIAAGYVLYGASTLLVYSMGQGVHVFVLDPSLGEFVLAQSDIQLPNSGQIYSVNEGNFWQWPEGYRQYIR
EMHRREGYSGRYSGALVADFHRILMQGGVFLYPETVKNPTGKLRLLYEAAPMAFLAEQAGGKASDGQKPILLRQPQALHE
RCPLIIGSAADVDFVEACLAESVP
>Q45597 3.1.3.11~~~fbp~~~Fructose-1,6-bisphosphatase class 3~~~COG3855
MESKYLDLLAQKYDCEEKVVTEIINLKAILNLPKGTEHFVSDLHGEYQAFQHVLRNGSGRVKEKIRDIFSGVIYDREIDE
LAALVYYPEDKLKLIKHDFDAKEALNEWYKETIHRMIKLVSYCSSKYTRSKLRKALPAQFAYITEELLYKTEQAGNKEQY
YSEIIDQIIELGQADKLITGLAYSVQRLVVDHLHVVGDIYDRGPQPDRIMEELINYHSVDIQWGNHDVLWIGAYSGSKVC
LANIIRICARYDNLDIIEDVYGINLRPLLNLAEKYYDDNPAFRPKADENRPEDEIKQITKIHQAIAMIQFKLESPIIKRR
PNFNMEERLLLEKIDYDKNEITLNGKTYQLENTCFATINPEQPDQLLEEEAEVIDKLLFSVQHSEKLGRHMNFMMKKGSL
YLKYNGNLLIHGCIPVDENGNMETMMIEDKPYAGRELLDVFERFLREAFAHPEETDDLATDMAWYLWTGEYSSLFGKRAM
TTFERYFIKEKETHKEKKNPYYYLREDEATCRNILAEFGLNPDHGHIINGHTPVKEIEGEDPIKANGKMIVIDGGFSKAY
QSTTGIAGYTLLYNSYGMQLVAHKHFNSKAEVLSTGTDVLTVKRLVDKELERKKVKETNVGEELLQEVAILESLREYRYM
K
>Q7A3I5 3.1.3.11~~~fbp~~~Fructose-1,6-bisphosphatase class 3~~~
MTQITEKELKKKYLDLLSQNFDTPEKLATEIINLESILELPKGTEHFVSDLHGEYEAFQHVLRNGSGNVRAKINDIFKER
LSTKELNDLTALVYYPEDKLKLIKSDFQSCGQLNVWYITTIEHLIELIKYCSSKYTRSKLRKALPKQYVYIIEELLYKSN
EYQNKKSYYETLVNQVIELKQADDLIIGLAYSVQRLVVDHLHVVGDIYDRGPQPDKIMDTLINYHSLDIQWGNHDVLWVG
AYAGSKVCLANLLRICARYDNLDIIEDAYGINLRPLLTLAEKYYDADNPAFKPKKRPDKHERLTQREESQITKIHQAIAM
IQFKLEIPIIKRRPNFEMEERLVLEKVNYDTNEITVYGNTYPLKDTCFQTINRNNPAELLPEEEEVMNKLLLSFQQSEKL
RRHMSFLMRKGSLYLPYNGNLLIHGCIPVDENGEMESFEIDGHTYSGQELLDVFEYHVRKSFDEKENTDDLSTDLVWYLW
TGKYSSLFGKRAMTTFERYFIADKASHKEEKNPYYHLREDVNMVRKMLSDFGLNPDEGRIINGHTPVKEINGEDPIKADG
KMLVIDGGFSKAYQSTTGIAGYTLLYNSFGMQLVAHQQFNAKEKILSEGIDELSIKRVVDKELQRKKIRDTNIGKELQAQ
IDILKMLMHDRYLD
>Q99003 ~~~~~~F17a-G fimbrial adhesin~~~
MTNFYKVFLAVFILVCCNISQAAVSFIGSTENDVGPSLGSYSRTHAMDNLPFVYDTRNKIGYQNANVWHISKGFCVGLDG
KVDLPVVGSLDGQSIYGLTEEVGLLIWMGDTKYSRGTAMSGNSWENVFSGWCVGANTASTQGLSVRVTPVILKRNSSARY
SVQKTSIGSIRMRPYNGSSAGSVQTTVNFSLNPFTLNDTVTSCRLLTPSAVNVSLAAISAGQLPSSGDEVVAGTTSLKLQ
CDAGVTVWATLTDATTPSNRSDILTLTGASTATGVGLRIYKNTDSTPLKFGPDSPVKGNENQWQLSTGTETSPSVRLYVK
YVNTGEGINPGTVNGISTFTFSYQ
>Q47200 ~~~~~~F17b-G fimbrial adhesin~~~
MTNFYKVFLAVFILVCCNISHAVVSFIGSTENDVGPSQGSYSSTHAMDNLPFVYNTGYNIGYQNANVWRIGGGFCVGLDG
KVDLPVVGSLDGQSIYGLTEEVGLLIWMGDTNYSRGTAMSGNSWENVFSGWCVGNYLSTQGLSVHVRPVILKRNSSAQYS
VQKTSIGSIRMRPYNGSSAGSVQTTVNFSLNPFTLNDTVTSCRLLTPSAVNVSLAAISAGQLPSSGDEVVAGTTSLKLQC
DAGVTVWATLTDATTPSNRSDILTLTGASTATGVGLRIYKNTDSTPLKFGPDSPVKGNENQWQLSTGTETSPSVRLYVKY
VNTGEGINPGTVNGISTFTFSYQ
>Q47033 ~~~~~~F17c-G fimbrial adhesin~~~
MTNFYKVFLAVFILVCCNISHAAVSFIGSTENDVGPSQGSYSSTHAMDNLPFVYNTGYNIGYQNANVWRISGGFCVGLDG
KVDLPVVGSLDGQSIYGLTEEVGLLIWMGDTNYSRGTAMSGNSWENVFSGWCVGNYVSTQGLSVHVRPVILKRNSSAQYS
VQKTSIGSIRMRPYNGSSAGSVQTTVNFSLNPFTLNDTVTSCRLLTPSAVNVSLAAISAGQLPSSGDEVVAGTTSLKLQC
DAGVTVWATLTDATTPSNRSDILTLTGASTATGVGLRIYKNTDSTPLKFGPDSPVKGNENQWQLSTGTETSPSVRLYVKY
VNTGEGINPGTVNGISTFTFSYQ
>Q47199 ~~~~~~F17d-G fimbrial adhesin~~~
MTNFYKVFLAVFILVCCNISQAAVSFIGSTENDVGPSPGSYSRTHAMDNLPFVYNTGNNIGYQNANVWRISKGFCVGLDG
KVDLPVVGSLDGQSIYGLTEEVGLLIWMGDTNYSRGTAMSGNSWENVFSGWCVGANTASTQGLSVRVTPVILKRNSSARY
SVQKTSIGSIRMRPYNGSSAGSVQTTVNFSLNPFTLNDTVTSCRLLTPSAVNVSLAAISAGQLPSSGDEVVAGTTSLKLQ
CDAGVTVWATLTDATTPSNRSDILTLTGASTATGVGLRIYKNTDSTPLKFGPDSPVKGNENQWQLSTGTETSPSVRLYVK
YVNTGEGINPGTVNGISTFTFSYQ
>Q9RH92 ~~~~~~F17e-G fimbrial adhesin~~~
MTNFYKVFLAVFILVCCNISHAAVSFIGSTENDVGPSQSSYSRTHAMDNLPFVYNTGYNIGYQNANVWRISGGFCVGLDG
KVDLPVVGSLDGQSIYGLTEEVGLLIWMGDTNYSRGTAMSGNSWENVFSGWCVGNYVSTQGLSVHVRPVILKRNSSAQYS
VQKTSIGSIRMRPYNGSSAGSVQTTVNFSLNPFTLNDTVTSCRLLTPSAVNVSLAAISAGQLPSSGDEVVAGTTSLKLQC
DAGVTVWATLTDATTPSNRSDILTLTGASTATGVGLRIYKNTDSTPLKFGPDSPVKGNENQWQLSTGTETSPSVRLYVKY
VNTGEGINPGTVNGISTFTFSYQ
>Q9RH91 ~~~~~~F17f-G fimbrial adhesin~~~
MTNFYKVFLAVFILVCCNISHAAVSFIGSTENDVGPSQGSYSSTHAMDNLPFVYNTGHNIGYQNANVWRISGGFCVGLDG
KVDLPVVGSLDGQSIYGLTEEVGLLIWMGDTNYSRGTAMSGNSWENVFSGWCVGNYVSTQGLSVHVRPVILKRNSSAQYS
VQKTSIGSIRMRPYNGSSAGSVQTTVNFSLNPFTLNDTVTSCRLLTPSAVNVSLAAISAGQLPSSGDEVVAGTTSLKLQC
DAGVTVWATLTDATTPSNRSDILTLTGASTATGVGLRIYKNTDSTPLKFGPDSPVKGNENQWQLSTGTETSPSVRLYVKY
VNTGEGINPGTVNGISTFTFSYQ
>Q47341 ~~~~~~F17g-G fimbrial adhesin~~~
MTNFYKVCLAVFILVCCNISHAAVSFIGSTENDVGPSQGSYSSTHAMDNLPFVYNTGYNIGYQNANVWRISGGFCVGLDG
KVDLPVVGSLDGQSIYGLTEEVGLLIWMGDTNYSRGTAMSGNSWENVFSGWCVGNYVSTQGLSVHVRPVILKRNSSAQYS
VQKTSIGSIRMRPYNGSSAGSVQTTVNFSLNPFTLNDTVTSCRLLTPSAVNVSLAAISAGQLPSSGDEVVAGTTSLKLQC
DAGVTVWATLTDATTPSNRSDILTLTGASTATGVGLRIYKNTDSTPLKFGPDSPVKGNENQWQLSTGTETSPSVRLYVKY
VNTGEGINPGTVNGISTFTFSYQ
>O06553 1.-.-.-~~~~~~F420H(2)-dependent reductase Rv1155~~~COG3467
MARQVFDDKLLAVISGNSIGVLATIKHDGRPQLSNVQYHFDPRKLLIQVSIAEPRAKTRNLRRDPRASILVDADDGWSYA
VAEGTAQLTPPAAAPDDDTVEALIALYRNIAGEHSDWDDYRQAMVTDRRVLLTLPISHVYGLPPGMR
>Q7TXK7 6.2.1.50~~~~~~p-hydroxybenzoic acid--AMP ligase FadD22~~~
MRNGNLAGLLAEQASEAGWYDRPAFYAADVVTHGQIHDGAARLGEVLRNRGLSSGDRVLLCLPDSPDLVQLLLACLARGV
MAFLANPELHRDDHALAARNTEPALVVTSDALRDRFQPSRVAEAAELMSEAARVAPGGYEPMGGDALAYATYTSGTTGPP
KAAIHRHADPLTFVDAMCRKALRLTPEDTGLCSARMYFAYGLGNSVWFPLATGGSAVINSAPVTPEAAAILSARFGPSVL
YGVPNFFARVIDSCSPDSFRSLRCVVSAGEALELGLAERLMEFFGGIPILDGIGSTEVGQTFVSNRVDEWRLGTLGRVLP
PYEIRVVAPDGTTAGPGVEGDLWVRGPAIAKGYWNRPDSPVANEGWLDTRDRVCIDSDGWVTYRCRADDTEVIGGVNVDP
REVERLIIEDEAVAEAAVVAVRESTGASTLQAFLVATSGATIDGSVMRDLHRGLLNRLSAFKVPHRFAVVDRLPRTPNGK
LVRGALRKQSPTKPIWELSLTEPGSGVRAQRDDLSASNMTIAGGNDGGATLRERLVALRQERQRLVVDAVCAEAAKMLGE
PDPWSVDQDLAFSELGFDSQMTVTLCKRLAAVTGLRLPETVGWDYGSISGLAQYLEAELAGGHGRLKSAGPVNSGATGLW
AIEEQLNKVEELVAVIADGEKQRVADRLRALLGTIAGSEAGLGKLIQAASTPDEIFQLIDSELGK
>B2HIL6 6.2.1.50~~~~~~p-hydroxybenzoic acid--AMP ligase FadD22~~~COG0236
MRNENVAGLLAERASEAGWTDQPAYYAPDVVTHGQIHDGAARLGAVLANRGLCRGDRVLLCMPDSPELVQVLLACLARGI
LAFLANPELHRDDHAFQERDTQAALVITSGPLCDRFAPSTVVDAADLFSEAARVGPADYEILGGDAAAYATYTSGTTGPP
KAAIHRHCDVFAFVEAMCRNALRLTPADIGLSSARMYFAYGLGNSVWFPLATGSSAVVNPLPVGAEVAATLSARFEPSVL
YGVPNFFARVVDACSADSFRSVRCVVSAGEALEVGLAERLTEFFGGIPILDGVGSTEVGQTFVSNTVDEWRPGSLGKVLP
PYQIRVVAPDGAAAGPGVEGDLWVRGPSIAESYWNWPEPLLTDEGWLDTRDRVCIDDDGWVTYACRADDTEIVGAVNINP
REIERLIVEEDAVAEVAVVGVKEATGASTLQAFLVPASAEGIDGSVMRDIHRRLLTRLSAFKVPHRFAVVERLPRTANGK
LLRSALRGQTPAKPIWELASAEHRSGAPGQLDDQSASALVSGSREVSLKERLAALQQERHRLVLDAVCGETAKMLGEPDP
RSVNRDLAFSELGFDSQMTVELCHRLAAATGLRVPETVGWDYGSISGLAQYLEAELSGADRRVTPQSARSGARALPLIEA
QLNKVEELTAAIADGEKPRVAERLRALLGTITEGQEHWGQRIAAASTPDEIFQLIDSEFGES
>P9WQ61 6.2.1.50~~~~~~p-hydroxybenzoic acid--AMP ligase FadD22~~~COG0236
MRNGNLAGLLAEQASEAGWYDRPAFYAADVVTHGQIHDGAARLGEVLRNRGLSSGDRVLLCLPDSPDLVQLLLACLARGV
MAFLANPELHRDDHALAARNTEPALVVTSDALRDRFQPSRVAEAAELMSEAARVAPGGYEPMGGDALAYATYTSGTTGPP
KAAIHRHADPLTFVDAMCRKALRLTPEDTGLCSARMYFAYGLGNSVWFPLATGGSAVINSAPVTPEAAAILSARFGPSVL
YGVPNFFARVIDSCSPDSFRSLRCVVSAGEALELGLAERLMEFFGGIPILDGIGSTEVGQTFVSNRVDEWRLGTLGRVLP
PYEIRVVAPDGTTAGPGVEGDLWVRGPAIAKGYWNRPDSPVANEGWLDTRDRVCIDSDGWVTYRCRADDTEVIGGVNVDP
REVERLIIEDEAVAEAAVVAVRESTGASTLQAFLVATSGATIDGSVMRDLHRGLLNRLSAFKVPHRFAVVDRLPRTPNGK
LVRGALRKQSPTKPIWELSLTEPGSGVRAQRDDLSASNMTIAGGNDGGATLRERLVALRQERQRLVVDAVCAEAAKMLGE
PDPWSVDQDLAFSELGFDSQMTVTLCKRLAAVTGLRLPETVGWDYGSISGLAQYLEAELAGGHGRLKSAGPVNSGATGLW
AIEEQLNKVEELVAVIADGEKQRVADRLRALLGTIAGSEAGLGKLIQAASTPDEIFQLIDSELGK
>P9WQ47 6.2.1.57~~~~~~Long-chain-fatty-acid--AMP ligase FadD23~~~COG0318
MVSLSIPSMLRQCVNLHPDGTAFTYIDYERDSEGISESLTWSQVYRRTLNVAAEVRRHAAIGDRAVILAPQGLDYIVAFL
GALQAGLIAVPLSAPLGGASDERVDAVVRDAKPNVVLTTSAIMGDVVPRVTPPPGIASPPTVAVDQLDLDSPIRSNIVDD
SLQTTAYLQYTSGSTRTPAGVMITYKNILANFQQMISAYFADTGAVPPLDLFIMSWLPFYHDMGLVLGVCAPIIVGCGAV
LTSPVAFLQRPARWLQLMAREGQAFSAAPNFAFELTAAKAIDDDLAGLDLGRIKTILCGSERVHPATLKRFVDRFSRFNL
REFAIRPAYGLAEATVYVATSQAGQPPEIRYFEPHELSAGQAKPCATGAGTALVSYPLPQSPIVRIVDPNTNTECPPGTI
GEIWVHGDNVAGGYWEKPDETERTFGGALVAPSAGTPVGPWLRTGDSGFVSEDKFFIIGRIKDLLIVYGRNHSPDDIEAT
IQEITRGRCAAIAVPSNGVEKLVAIVELNNRGNLDTERLSFVTREVTSAISTSHGLSVSDLVLVAPGSIPITTSGKVRRA
ECVKLYRHNEFTRLDAKPLQASDL
>Q7TXM1 6.2.1.59~~~~~~Long-chain-fatty-acid--AMP ligase FadD26~~~
MPVTDRSVPSLLQERADQQPDSTAYTYIDYGSDPKGFADSLTWSQVYSRACIIAEELKLCGLPGDRVAVLAPQGLEYVLA
FLGALQAGFIAVPLSTPQYGIHDDRVSAVLQDSKPVAILTTSSVVGDVTKYAASHDGQPAPVVVEVDLLDLDSPRQMPAF
SRQHTGAAYLQYTSGSTRTPAGVIVSHTNVIANVTQSMYGYFGDPAKIPTGTVVSWLPLYHDMGLILGICAPLVARRRAV
LMSPMSFLRRPARWMQLLATSGRCFSAAPNFAFELAVRRTSDQDMAGLDLRDVVGIVSGSERIHVATVRRFIERFAPYNL
SPTAIRPSYGLAEATLYVAAPEAGAAPKTVRFDYEQLTAGQARPCGTDGSVGTELISYGSPDPSSVRIVNPETMVENPPG
VVGEIWVHGDHVTMGYWQKPKQTAQVFDAKLVDPAPAAPEGPWLRTGDLGVISDGELFIMGRIKDLLIVDGRNHYPDDIE
ATIQEITGGRAAAIAVPDDITEQLVAIIEFKRRGSTAEEVMLKLRSVKREVTSAISKSHSLRVADLVLVSPGSIPITTSG
KIRRSACVERYRSDGFKRLDVAV
>B2HIN2 6.2.1.59~~~~~~Long-chain-fatty-acid--AMP ligase FadD26~~~COG0318
MPVTDRSIPSLLKEQADQRPNETAFTFLDYDLDPNGFAETLTWSQVYARACVVADELTMYGVPGDRVAILAPQGLEYIVA
FLGALQAGFIGVPLSTPQYGVHDERVSAVLRDSQPVAILTTSAVVGDVTKYASSQDGQPAPSVIEVDLLDLDTPRPQQAL
PQPASGSAYLQYTSGSTRTPAGVIVSHENVIANVTQSLYGYFGGPDKFPADTTVVSWLPLFHDMGLILGICAPLVTGCTA
VLLSPMSFLRRPARWMQLLASHPKCFSAAPNFAFELAVRRTTDEDLAGLDLGDVLGIVSGSERIHVATIKRFTERFAPFN
LSPAAVRPSYGLAEATLYVAAPEPGTTPRTVRFDYESLTAGHARPCRADGSVGTELISYGSPDPSAVRIVNPETMIENPS
GTVGEIWAHGEHVAMGYWQKPEQSDRTFNARIVNPAPGTPEGPWLRTGDLGVMSNGELFIMGRIKDLVIVDGRNHYPDDI
EATIQEITGGRVAAIAVPDNITEQLVAIIELKRRGASAEEAMVKLRSVKREITSAISKSHSLRVADVVLVPPGSIPITTS
GKIRRAACVERYRSDGFNRLDVTV
>P9WQ43 6.2.1.59~~~~~~Long-chain-fatty-acid--AMP ligase FadD26~~~COG0318
MPVTDRSVPSLLQERADQQPDSTAYTYIDYGSDPKGFADSLTWSQVYSRACIIAEELKLCGLPGDRVAVLAPQGLEYVLA
FLGALQAGFIAVPLSTPQYGIHDDRVSAVLQDSKPVAILTTSSVVGDVTKYAASHDGQPAPVVVEVDLLDLDSPRQMPAF
SRQHTGAAYLQYTSGSTRTPAGVIVSHTNVIANVTQSMYGYFGDPAKIPTGTVVSWLPLYHDMGLILGICAPLVARRRAM
LMSPMSFLRRPARWMQLLATSGRCFSAAPNFAFELAVRRTSDQDMAGLDLRDVVGIVSGSERIHVATVRRFIERFAPYNL
SPTAIRPSYGLAEATLYVAAPEAGAAPKTVRFDYEQLTAGQARPCGTDGSVGTELISYGSPDPSSVRIVNPETMVENPPG
VVGEIWVHGDHVTMGYWQKPKQTAQVFDAKLVDPAPAAPEGPWLRTGDLGVISDGELFIMGRIKDLLIVDGRNHYPDDIE
ATIQEITGGRAAAIAVPDDITEQLVAIIEFKRRGSTAEEVMLKLRSVKREVTSAISKSHSLRVADLVLVSPGSIPITTSG
KIRRSACVERYRSDGFKRLDVAV
>P9WQ59 6.2.1.49~~~~~~Long-chain-fatty-acid--AMP ligase FadD28~~~COG0318
MSVRSLPAALRACARLQPHDPAFTFMDYEQDWDGVAITLTWSQLYRRTLNVAQELSRCGSTGDRVVISAPQGLEYVVAFL
GALQAGRIAVPLSVPQGGVTDERSDSVLSDSSPVAILTTSSAVDDVVQHVARRPGESPPSIIEVDLLDLDAPNGYTFKED
EYPSTAYLQYTSGSTRTPAGVVMSHQNVRVNFEQLMSGYFADTDGIPPPNSALVSWLPFYHDMGLVIGICAPILGGYPAV
LTSPVSFLQRPARWMHLMASDFHAFSAAPNFAFELAARRTTDDDMAGRDLGNILTILSGSERVQAATIKRFADRFARFNL
QERVIRPSYGLAEATVYVATSKPGQPPETVDFDTESLSAGHAKPCAGGGATSLISYMLPRSPIVRIVDSDTCIECPDGTV
GEIWVHGDNVANGYWQKPDESERTFGGKIVTPSPGTPEGPWLRTGDSGFVTDGKMFIIGRIKDLLIVYGRNHSPDDIEAT
IQEITRGRCAAISVPGDRSTEKLVAIIELKKRGDSDQDAMARLGAIKREVTSALSSSHGLSVADLVLVAPGSIPITTSGK
VRRGACVEQYRQDQFARLDA
>B2HIL4 6.2.1.51~~~~~~4-hydroxyphenylalkanoate adenylyltransferase~~~COG0318
MIMDTNAVSFRARDEVTTQLAPGTGGQAVPTSNGMMTRFAMSESSLTDLLHKAATQYPNRAAYKFIDYDVNPDGFTETLT
WWQIYRRAKIVAEELRGYGASGDRVAVLAPQGLEYIIAFLGVLEAGLIAVPLPVPQFGIHDERISAALQDSTPSVILTTS
PVIDEVTKYAPHARAGQGGTPIVVAVDLLDLDSARELDLTPPAHSSTAYLQYTSGSTRSPAGVVLSHKNVITNCVQLMSD
YLGETEKVPSTAVSWLPFYHDMGLMLGIILPMINQDTAVLLNPMAFLQRPARWMQLMGKFRGQISCAPNFGFELAVRRTS
DEDMAGLDLGHVRGIGSGAERVNPATLQRFIDRFAPFNLRDTAIRPSYGLAEATVFVATAEPGRPPRSVNFDYQSLSVGR
VERCANEADDGAKLVSYGSSWTSEVRIVDPEARTECPAGTVGEIWVQGDNVAMGYWRNPQLTERTFDAKLTDPSPGTSIG
PWLRTGDLGVMFEGELFITGRIKDLLIVDGSNHYPDDIESTIQEITGGRVVAIAVPDADGEKLVTIVEFASWGHSGQEAI
DKLRSVKREITSAISRAHRVRVADVVLVATGSIPVTTSGKVRRSSCAERYRNDGFTRLDRSA
>P95141 6.2.1.51~~~~~~4-hydroxyphenylalkanoate adenylyltransferase~~~COG0318
MKTNSSFHAAGEVATQPAWGTGEQAAQPLNGSTSRFAMSESSLADLLQKAASQYPNRAAYKFIDYDTDPAGFTETVTWWQ
VHRRAMIVAEELWIYASSGDRVAILAPQGLEYIIAFMGVLQAGLIAVPLPVPQFGIHDERISSALRDSAPSIILTTSSVI
DEVTTYAPHACAAQGQSAPIVVAVDALDLSSSRALDPTRFERPSTAYLQYTSGSTRAPAGVVLSHKNVITNCVQLMSDYI
GDSEKVPSTPVSWLPFYHDMGLMLGIILPMINQDTAVLMSPMAFLQRPARWMQLLAKHRAQISSAPNFGFELAVRRTSDD
DMAGLDLGHVRTIVTGAERVNVATLRRFTERFAPFNLSETAIRPSYGLAEATVYVATAGPGRAPKSVCFDYQQLSVGQAK
RAENGSEGANLVSYGAPRASTVRIVDPETRMENPAGTVGEIWVQGDNVGLGYWRNPQQTEATFRARLVTPSPGTSEGPWL
RTGDLGVIFEGELFITGRIKELLVVDGANHYPEDIEATIQEITGGRVVAIAVPDDRTEKLVTIIELMKRGRTDEEEKNRL
RTVKREVASAISRSHRLRVADVVMVAPGSIPVTTSGKVRRSASVERYLHHEFSRLDAMA
>P9WQ57 6.2.1.-~~~~~~Probable long-chain-fatty-acid--AMP ligase FadD30~~~COG0318
MSVISTLRDRATTTPSDEAFVFMDYDTKTGDQIDRMTWSQLYSRVTAVSAYLISYGRHADRRRTAAISAPQGLDYVAGFL
GALCAGWTPVPLPEPLGSLRDKRTGLAVLDCAADVVLTTSQAETRVRATIATHGASVTTPVIALDTLDEPSGDNCDLDSQ
LSDWSSYLQYTSGSTANPRGVVLSMRNVTENVDQIIRNYFRHEGGAPRLPSSVVSWLPLYHDMGLMVGLFIPLFVGCPVI
LTSPEAFIRKPARWMQLLAKHQAPFSAAPNFAFDLAVAKTSEEDMAGLDLGHVNTIINGAEQVQPNTITKFLRRFRPYNL
MPAAVKPSYGMAEAVVYLATTKAGSPPTSTEFDADSLARGHAELSTFETERATRLIRYHSDDKEPLLRIVDPDSNIELGP
GRIGEIWIHGKNVSTGYHNADDALNRDKFQASIREASAGTPRSPWLRTGDLGFIVGDEFYIVGRMKDLIIQDGVNHYPDD
IETTVKEFTGGRVAAFSVSDDGVEHLVIAAEVRTEHGPDKVTIMDFSTIKRLVVSALSKLHGLHVTDFLLVPPGALPKTT
SGKISRAACAKQYGANKLQRVATFP
>B2HMK0 6.2.1.20~~~~~~Long-chain-fatty-acid--AMP ligase FadD32~~~COG0318
MAYHNPFIVNGKIRFPENTNLVRHVEKWARVRGDKLAYRFLDFSTERDGVERDILWSEFSARNRAVGARLQQVTQPGDRI
AILCPQNLDYLISFFGALYSGRIAVPLFDPAEPGHVGRLHAVLDDCTPSTILTTTDSAEGVRKFIRSRSAKERPRVIAVD
AVPTEVASTWQQPEANELTTAYLQYTSGSTRVPSGVQITHLNLPTNVLQVLNALEGQEGDRGVSWLPFFHDMGLITVLLA
SVLGHSFTFMTPAAFVRRPGRWIRELARKPGETGGTFSAAPNFAFEHAAMRGVPRDDEPPLDLSNVKGILNGSEPVSPAS
MRKFFKAFEPYGLRETAVKPSYGLAEATLFVSTTPMDEVPTVIHVDRDELNKQRFVEVAADAPNAVAQVSAGKVGVDEWA
VIVDTETASELPDGQIGEIWLHGNNLGIGYWGKEEESAQTFRNILKSRVPESHAEGAPDDGLWVRTGDYGTYFKGHLYIA
GRIKDLVIIDGRNHYPQDLEYTAQESTKALRVGYVAAFSVPANQLPQKVFDDPHAGLSFDPEDTSEQLVIVGERAAGTHK
LEYQPIADDIRAAIAVGHGVTVRDVLLVSAGTIPRTSSGKIGRRACRTAYIDGSLRSGVSSPTVFATGS
>A0R618 6.2.1.20~~~~~~Long-chain-fatty-acid--AMP ligase FadD32~~~COG0318
MPFHNPFIKDGQIKFPDGSSIVAHVERWAKVRGDKLAYRFLDFSTERDGVPRDLTWAQFSARNRAVAARLQQVTQPGDRV
AILCPQNLDYLVAFFGALYAGRIAVPLFDPSEPGHVGRLHAVLDNCHPSAILTTTEAAEGVRKFFRTRPANQRPRVIAVD
AVPDDVASTWVNPDEPDETTIAYLQYTSGSTRIPTGVQITHLNLATNVVQVIEALEGEEGDRGLSWLPFFHDMGLITALL
APMIGHYFTFMTPAAFVRRPERWIRELARKEGDTGGTISVAPNFAFDHAAARGVPKPGSPPLDLSNVKAVLNGSEPISAA
TVRRFNEAFGPFGFPPKAIKPSYGLAEATLFVSTTPSAEEPKIITVDRDQLNSGRIVEVDADSPKAVAQASAGKVGIAEW
AVIVDAESATELPDGQVGEIWISGQNMGTGYWGKPEESVATFQNILKSRTNPSHAEGATDDATWVRTGDYGAFYDGDLYI
TGRVKDLVIIDGRNHYPQDLEYSAQEASKAIRTGYVAAFSVPANQLPDEVFENAHSGIKRDPDDTSEQLVIVAERAPGAH
KLDIGPITDDIRAAIAVRHGVTVRDVLLTAAGAIPRTSSGKIGRRACRAAYLDGSLRAGKVANDFPDATD
>O53580 6.2.1.20~~~~~~Long-chain-fatty-acid--AMP ligase FadD32~~~COG0318
MFVTGESGMAYHNPFIVNGKIRFPANTNLVRHVEKWAKVRGDKLAYRFLDFSTERDGVARDILWSDFSARNRAVGARLQQ
VTQPGDRVAILCPQNLDYLISFFGALYSGRIAVPLFDPAEPGHVGRLHAVLDDCAPSTILTTTDSAEGVRKFIRARSAKE
RPRVIAVDAVPTEVAATWQQPEANEETVAYLQYTSGSTRIPSGVQITHLNLPTNVVQVLNALEGQEGDRGVSWLPFFHDM
GLITVLLASVLGHSFTFMTPAAFVRRPGRWIRELARKPGETGGTFSAAPNFAFEHAAVRGVPRDDEPPLDLSNVKGILNG
SEPVSPASMRKFFEAFAPYGLKQTAVKPSYGLAEATLFVSTTPMDEVPTVIHVDRDELNNQRFVEVAADAPNAVAQVSAG
KVGVSEWAVIVDADTASELPDGQIGEIWLHGNNLGTGYWGKEEESAQTFKNILKSRISESRAEGAPDDALWVRTGDYGTY
FKDHLYIAGRIKDLVIIDGRNHYPQDLECTAQESTKALRVGYAAAFSVPANQLPQTVFDDSHAGLKFDPEDTSEQLVIVG
ERAAGTHKLDHQPIVDDIRAAIAVGHGVTVRDVLLVSAGTIPRTSSGKIGRRACRAAYLDGSLRSGVGSPTVFATSD
>P0A6Q3 4.2.1.59~~~fabA~~~3-hydroxydecanoyl-[acyl-carrier-protein] dehydratase~~~COG0764
MVDKRESYTKEDLLASGRGELFGAKGPQLPAPNMLMMDRVVKMTETGGNFDKGYVEAELDINPDLWFFGCHFIGDPVMPG
CLGLDAMWQLVGFYLGWLGGEGKGRALGVGEVKFTGQVLPTAKKVTYRIHFKRIVNRRLIMGLADGEVLVDGRLIYTASD
LKVGLFQDTSAF
>O33877 4.2.1.59~~~fabA~~~3-hydroxydecanoyl-[acyl-carrier-protein] dehydratase~~~
MTKQHAFTREDLLRCSRGELFGPGNAQLPAPNMLMIDRIVHISDVGGKYGKGELVAELDINPDLWFFACHFEGDPVMPGC
LGLDAMWQLVGFYLGWQGNPGRGRALGSGEVKFFGQVLPTAKKVTYNIHIKRTINRSLVLAIADGTVSVDGREIYSAEGL
RVGLFTSTDSF
>A5F850 4.2.1.59~~~fabA~~~3-hydroxydecanoyl-[acyl-carrier-protein] dehydratase~~~COG0764
MQNKRDSYNREDLLASSQGELFGEGYPQLPAPNMLMMDRITKMSETEGEFGKGLILAELDITPDLWFFDCHFPGDPVMPG
CLGLDAMWQLVGFFLGWVGGKGKGRALGVGEVKFTGQILPTAKKVTYEINMKRVVNRKLVMGLADGRVLVDGKEIYVAKD
LKVGLFQDTSAF
>Q8ZG80 4.2.1.59~~~fabA~~~3-hydroxydecanoyl-[acyl-carrier-protein] dehydratase~~~COG0764
MVDKRESYTKEDLEASGRGELFGAGGPPLPAGNMLMMDRIVKMIEDGGSHNKGYVEAELDINPDLWFFGCHFIGDPVMPG
CLGLDAMWQLVGFYLGWLGGEGKGRALGVGEVKFTGQVLPDAKKVTYRINFKRVIMRKLIMGVADGEVLVDGKVIYTATD
LKVGLFKDTNAF
>Q66CF3 4.2.1.59~~~fabA~~~3-hydroxydecanoyl-[acyl-carrier-protein] dehydratase~~~
MVDKRESYTKEDLEASGRGELFGAGGPPLPAGNMLMMDRIVKMIEDGGSHNKGYVEAELDINPDLWFFGCHFIGDPVMPG
CLGLDAMWQLVGFYLGWLGGEGKGRALGVGEVKFTGQVLPDAKKVTYRINFKRVIMRKLIMGVADGEVLVDGKVIYTATD
LKVGLFKDTNAF
>P0A953 2.3.1.41~~~fabB~~~3-oxoacyl-[acyl-carrier-protein] synthase 1~~~COG0304
MKRAVITGLGIVSSIGNNQQEVLASLREGRSGITFSQELKDSGMRSHVWGNVKLDTTGLIDRKVVRFMSDASIYAFLSME
QAIADAGLSPEAYQNNPRVGLIAGSGGGSPRFQVFGADAMRGPRGLKAVGPYVVTKAMASGVSACLATPFKIHGVNYSIS
SACATSAHCIGNAVEQIQLGKQDIVFAGGGEELCWEMACEFDAMGALSTKYNDTPEKASRTYDAHRDGFVIAGGGGMVVV
EELEHALARGAHIYAEIVGYGATSDGADMVAPSGEGAVRCMKMAMHGVDTPIDYLNSHGTSTPVGDVKELAAIREVFGDK
SPAISATKAMTGHSLGAAGVQEAIYSLLMLEHGFIAPSINIEELDEQAAGLNIVTETTDRELTTVMSNSFGFGGTNATLV
MRKLKD
>Q02K94 2.3.1.41~~~fabB~~~3-oxoacyl-[acyl-carrier-protein] synthase 1~~~
MRRVVITGLGIVSCLGNDKDTVSANLRAGRPGIRFNPSYAEMGLRSHVSGSVDLNLEELIDRKVFRFMGDAAAYAYLAME
QAIKDSGLTPEQISNPRTGLIAGSGGASTLNQMEAIDTLREKGVKRIGPYRVTRTMGSTVSACLATPFQIKGVNYSISSA
CATSAHCIGQAMEQIQLGKQDVVFAGGGEEEHWSQSCLFDAMGALSTQYNETPEKASRAYDAKRDGFVIAGGGGMVVVEE
LEHALKRGAKIYAEIVGYGATSDGYDMVAPSGEGAIRCMQQALATVDAPIDYLNTHGTSTPVGDVAEIRGVREVFGDKAP
AISSTKSLSGHSLGAAGVHEAIYCLLMMEGGFIAGSANIDELDPEVADLPILRETRENAKLDTVMSNSFGFGGTNATLVL
KRWQG
>P0AAI9 2.3.1.39~~~fabD~~~Malonyl CoA-acyl carrier protein transacylase~~~COG0331
MTQFAFVFPGQGSQTVGMLADMAASYPIVEETFAEASAALGYDLWALTQQGPAEELNKTWQTQPALLTASVALYRVWQQQ
GGKAPAMMAGHSLGEYSALVCAGVIDFADAVRLVEMRGKFMQEAVPEGTGAMAAIIGLDDASIAKACEEAAEGQVVSPVN
FNSPGQVVIAGHKEAVERAGAACKAAGAKRALPLPVSVPSHCALMKPAADKLAVELAKITFNAPTVPVVNNVDVKCETNG
DAIRDALVRQLYNPVQWTKSVEYMAAQGVEHLYEVGPGKVLTGLTKRIVDTLTASALNEPSAMAAALEL
>A0R0B2 2.3.1.39~~~fabD~~~Malonyl CoA-acyl carrier protein transacylase~~~COG0331
MLTPWLELPGAADRLAAWSQISGLDLTTLGTTATAEEITDTAVTQPLVVAATLLAHEELTKRGHSAAETIVAGHSVGEIA
AYAIAGVISADDAVKLAATRGAEMAKACAVEPTGMAAVLGGDEAEVLARLEALDLVPANRNAAGQIVAAGAVAALDKLAE
DPPAKARVRKLATAGAFHTHYMASALDGYAAAAQSVTTSEPTATLLSNADGQPVASAADAMEKLVAQLTKPVRWDLCTAT
LRDRFQNAESAGIVEFPPAGTLVGIAKRELKGTPTRAIKSPEDLDGLDQL
>P9WNG5 2.3.1.39~~~fabD~~~Malonyl CoA-acyl carrier protein transacylase~~~COG0331
MIALLAPGQGSQTEGMLSPWLQLPGAADQIAAWSKAADLDLARLGTTASTEEITDTAVAQPLIVAATLLAHQELARRCVL
AGKDVIVAGHSVGEIAAYAIAGVIAADDAVALAATRGAEMAKACATEPTGMSAVLGGDETEVLSRLEQLDLVPANRNAAG
QIVAAGRLTALEKLAEDPPAKARVRALGVAGAFHTEFMAPALDGFAAAAANIATADPTATLLSNRDGKPVTSAAAAMDTL
VSQLTQPVRWDLCTATLREHTVTAIVEFPPAGTLSGIAKRELRGVPARAVKSPADLDELANL
>O85140 2.3.1.39~~~fabD~~~Malonyl CoA-acyl carrier protein transacylase~~~
MTQFAFVFPGQGSQSVGMLAEMAANYPIVEETFAEASAALGYDLWALTQQGPAEELNKTWQTQPALLTASVALWRVWQQQ
GGKMPALMAGHSLGEYSALVCAGVINFADAVRLVEMRGKFMQEAVPEGTGGMSAIIGLDDASIAKACEESAEGQVVSPVN
FNSPGQVVIAGHKEAVERAGAACKAAGAKRALPLPVSVPSHCALMKPAADKLAVELAKITFSAPTVPVVNNVDVKCETDA
AAIRDALVRQLYNPVQWTKSVEFIAAQGVEHLYEVGPGKVLTGLTKRIVDTLTASALNEPAALSAALTQ
>Q99UN8 2.3.1.39~~~fabD~~~Malonyl CoA-acyl carrier protein transacylase~~~
MSKTAIIFPGQGAQKVGMAQDLFNNNDQATEILTSAAKTLDFDILETMFTDEEGKLGETENTQPALLTHSSALLAALKNL
NPDFTMGHSLGEYSSLVAADVLSFEDAVKIVRKRGQLMAQAFPTGVGSMAAVLGLDFDKVDEICKSLSSDDKIIEPANIN
CPGQIVVSGHKALIDELVEKGKSLGAKRVMPLAVSGPFHSSLMKVIEEDFSSYINQFEWRDAKFPVVQNVNAQGETDKEV
IKSNMVKQLYSPVQFINSTEWLIDQGVDHFIEIGPGKVLSGLIKKINRDVKLTSIQTLEDVKGWNEND
>Q7A5Z3 2.3.1.39~~~fabD~~~Malonyl CoA-acyl carrier protein transacylase~~~
MSKTAIIFPGQGAQKVGMAQDLFNNNDQATEILTSAAKTLDFDILETMFTDEEGKLGETENTQPALLTHSSALLAALKNL
NPDFTMGHSLGEYSSLVAADVLSFEDAVKIVRKRGQLMAQAFPTGVGSMAAVLGLDFDKVDEICKSLSSDDKIIEPANIN
CPGQIVVSGHKALIDELVEKGKSLGAKRVMPLAVSGPFHSSLMKVIEEDFSSYINQFEWRDAKFPVVQNVNAQGETDKEV
IKSNMVKQLYSPVQFINSTEWLIDQGVDHFIEIGPGKVLSGLIKKINRDVKLTSIQTLEDVKGWNEND
>P73242 2.3.1.39~~~fabD~~~Malonyl CoA-acyl carrier protein transacylase~~~COG0331
MKTAWVFPGQGSQAVGMGVDLLSTAIAKEKYQQAEEILGWSVVEKCQGDEASLALTQNTQPCLYVIEAILADLLRDKGFQ
PDYVAGHSLGEYSALYAAGVFDFATGLQLVKQRSEVMASASGGMMAALMKFDQTQLQQALTDNTEVVLANDNSPEQVVIS
GTVAGVEAILANVKARRAVPLKVSGAFHSSFMAQPSQSFAQTLTACHFNDATVPVLSNVDPSPTQNGDRLKEKLIQQMTG
SVRWRETMVNLGEIGATDYWEVGPGKVLTGLCKRTCPDLNLKNIGQLDDLNSL
>O34340 2.3.1.179~~~fabF~~~3-oxoacyl-[acyl-carrier-protein] synthase 2~~~COG0304
MTKKRVVVTGLGALSPLGNDVDTSWNNAINGVSGIGPITRVDAEEYPAKVAAELKDFNVEDYMDKKEARKMDRFTQYAVV
AAKMAVEDADLNITDEIAPRVGVWVGSGIGGLETLESQFEIFLTKGPRRVSPFFVPMMIPDMATGQISIALGAKGVNSCT
VTACATGTNSIGDAFKVIQRGDADVMVTGGTEAPLTRMSFAGFSANKALSTNPDPKTASRPFDKNRDGFVMGEGAGIIVL
EELEHALARGAKIYGEIVGYGSTGDAYHITAPAQDGEGGARAMQEAIKDAGIAPEEIDYINAHGTSTYYNDKYETMAIKT
VFGEHAHKLAVSSTKSMTGHLLGAAGGIEAIFSILAIKEGVIPPTINIQTPDEECDLDYVPDEARRQELNYVLSNSLGFG
GHNATLIFKKYQS
>Q83E37 2.3.1.179~~~fabF~~~3-oxoacyl-[acyl-carrier-protein] synthase 2~~~COG0304
MEKRRVVITGLGVVSPLGNKVSDMWQALLAGKSGVKPITRFDASSFPTQIAAEVRDFDPALVLDLKSIRKTDVFVQFAME
SARQAWEDSGLEINETNAPRVGVAIGSGIGGMPWIEKNYDALLTSGPRKISPFFIPGAIINMASGMVSIKYDLKGPNISI
VTACTTGLHNIGHAARMIAHNDADAMIAGGTEMASTPLGIGGFAAVRALSTRNDEPEKASRPWDKGRDGFVLGEGAACVV
VEELEHAKKRNATIYAEIIGFGMSGDAYHMTRPDPEAEGFTTCMKNSLRDAGIAPERVDYINAHGTSTPAADPLEARAIK
KTFGDHAYKLAVSSTKSMTGHMLGAAGALETVISVLAIRDNTAPPTINLENPDEGCDLDFVPNEAREMKIDTVMSNSFGF
GGTNGTLVLSRVFD
>P0AAI5 2.3.1.179~~~fabF~~~3-oxoacyl-[acyl-carrier-protein] synthase 2~~~COG0304
MSKRRVVVTGLGMLSPVGNTVESTWKALLAGQSGISLIDHFDTSAYATKFAGLVKDFNCEDIISRKEQRKMDAFIQYGIV
AGVQAMQDSGLEITEENATRIGAAIGSGIGGLGLIEENHTSLMNGGPRKISPFFVPSTIVNMVAGHLTIMYGLRGPSISI
ATACTSGVHNIGHAARIIAYGDADVMVAGGAEKASTPLGVGGFGAARALSTRNDNPQAASRPWDKERDGFVLGDGAGMLV
LEEYEHAKKRGAKIYAELVGFGMSSDAYHMTSPPENGAGAALAMANALRDAGIEASQIGYVNAHGTSTPAGDKAEAQAVK
TIFGEAASRVLVSSTKSMTGHLLGAAGAVESIYSILALRDQAVPPTINLDNPDEGCDLDFVPHEARQVSGMEYTLCNSFG
FGGTNGSLIFKKI
>Q5HHA1 2.3.1.179~~~fabF~~~3-oxoacyl-[acyl-carrier-protein] synthase 2~~~
MSQNKRVVITGMGALSPIGNDVKTTWENALKGVNGIDKITRIDTEPYSVHLAGELKNFNIEDHIDKKEARRMDRFTQYAI
VAAREAVKDAQLDINENTADRIGVWIGSGIGGMETFEIAHKQLMDKGPRRVSPFFVPMLIPDMATGQVSIDLGAKGPNGA
TVTACATGTNSIGEAFKIVQRGDADAMITGGTEAPITHMAIAGFSASRALSTNDDIETACRPFQEGRDGFVMGEGAGILV
IESLESAQARGANIYAEIVGYGTTGDAYHITAPAPEGEGGSRAMQAAMDDAGIEPKDVQYLNAHGTSTPVGDLNEVKAIK
NTFGEAAKHLKVSSTKSMTGHLLGATGGIEAIFSALSIKDSKVAPTIHAVTPDPECDLDIVPNEAQDLDITYAMSNSLGF
GGHNAVLVFKKFEA
>Q7A6F8 2.3.1.179~~~fabF~~~3-oxoacyl-[acyl-carrier-protein] synthase 2~~~
MSQNKRVVITGMGALSPIGNDVKTTWENALKGVNGIDKITRIDTEPYSVHLAGELKNFNIEDHIDKKEARRMDRFTQYAI
VAAREAVKDAQLDINDNTADRIGVWIGSGIGGMETFEIAHKQLMDKGPRRVSPFFVPMLIPDMATGQVSIDLGAKGPNGA
TVTACATGTNSIGEAFKIVQRGDADAMITGGTEAPITHMAIAGFSASRALSTNDDIETACRPFQEGRDGFVMGEGAGILV
IESLESAQARGANIYAEIVGYGTTGDAYHITAPAPEGEGGSRAMQAAMDDAGIEPKDVQYLNAHGTSTPVGDLNEVKAIK
NTFGEAAKHLKVSSTKSMTGHLLGATGGIEAIFSALSIKDSKVAPTIHAVTPDPECDLDIVPNEAQDLDITYAMSNSLGF
GGHNAVLVFKKFEA
>Q8NXE1 2.3.1.179~~~fabF~~~3-oxoacyl-[acyl-carrier-protein] synthase 2~~~
MSQNKRVVITGMGALSPIGNDVKTTWENALKGVNGIDKITRIDTEPYSVHLAGELKNFNIEDHIDKKEARRMDRFTQYAI
VAAREAVKDAQLDINENTADRIGVWIGSGIGGMETFEIAHKQLMDKGPRRVSPFFVPMLIPDMATGQVSIDLGAKGPNGA
TVTACATGTNSIGEAFKIVQRGDADAMITGGTEAPITHMAIAGFSASRALSTNDDIETACRPFQEGRDGFVMGEGAGILV
IESLESAQARGANIYAEIVGYGTTGDAYHITAPAPEGEGGSRAMQAAMDDAGIEPKDVQYLNAHGTSTPVGDLNEVKAIK
NTFGEAAKHLKVSSTKSMTGHLLGATGGIEAIFSALSIKDSKVAPTIHAVTPDPECDLDIVPNEAQDLDITYAMSNSLGF
GGHNAVLVFKKFEA
>P73283 2.3.1.179~~~fabF~~~3-oxoacyl-[acyl-carrier-protein] synthase 2~~~COG0304
MANLEKKRVVVTGLGAITPIGNTLQDYWQGLMEGRNGIGPITRFDASDQACRFGGEVKDFDATQFLDRKEAKRMDRFCHF
AVCASQQAINDAKLVINELNADEIGVLIGTGIGGLKVLEDQQTILLDKGPSRCSPFMIPMMIANMASGLTAINLGAKGPN
NCTVTACAAGSNAIGDAFRLVQNGYAKAMICGGTEAAITPLSYAGFASARALSFRNDDPLHASRPFDKDRDGFVMGEGSG
ILILEELESALARGAKIYGEMVGYAMTCDAYHITAPVPDGRGATRAIAWALKDSGLKPEMVSYINAHGTSTPANDVTETR
AIKQALGNHAYNIAVSSTKSMTGHLLGGSGGIEAVATVMAIAEDKVPPTINLENPDPECDLDYVPGQSRALIVDVALSNS
FGFGGHNVTLAFKKYQ
>Q9KQH9 2.3.1.179~~~fabF~~~3-oxoacyl-[acyl-carrier-protein] synthase 2~~~COG0304
MSKRRVVVTGMGMLSPVGNTVESSWKALLAGQSGIVNIEHFDTTNFSTRFAGLVKGFDCEQYMSKKDARKMDLFIQYGIA
AGIQALEDSGLEVNEENAARIGVAIGSGIGGLELIETGHQALIEKGPRKVSPFFVPSTIVNMIAGNLSIMRGLRGPNIAI
STACTTGLHNIGHAARMIAYGDADAMVAGGAEKASTPLGMAGFGAAKALSTRNDEPQKASRPWDKDRDGFVLGDGAGIMV
LEEYEHAKARGAKIYAEVVGFGMSGDAYHMTSPSEDGSGGALAMEAAMRDAGVTGEQIGYVNAHGTSTPAGDVAEVKGIK
RALGEAGTKQVLVSSTKSMTGHLLGAAGSVEAIITVMSLVDQMVPPTINLDNPEEGLGVDLVPHVARKVESMEYAMCNSF
GFGGTNGSLIFKRM
>I6Y778 1.1.1.212~~~fabG4~~~3-oxoacyl-[acyl-carrier-protein] reductase [NADH]~~~COG1028
MAPKRSSDLFSQVVNSGPGSFLARQLGVPQPETLRRYRAGEPPLTGSLLIGGAGRVVEPLRAALEKDYDLVGNNLGGRWA
DSFGGLVFDATGITEPAGLKGLHEFFTPVLRNLGRCGRVVVVGGTPEAAASTNERIAQRALEGFTRSLGKELRRGATTAL
VYLSPDAKPAATGLESTMRFLLSAKSAYVDGQVFSVGADDSTPPADWEKPLDGKVAIVTGAARGIGATIAEVFARDGAHV
VAIDVESAAENLAETASKVGGTALWLDVTADDAVDKISEHLRDHHGGKADILVNNAGITRDKLLANMDDARWDAVLAVNL
LAPLRLTEGLVGNGSIGEGGRVIGLSSIAGIAGNRGQTNYATTKAGMIGITQALAPGLAAKGITINAVAPGFIETQMTAA
IPLATREVGRRLNSLLQGGQPVDVAEAIAYFASPASNAVTGNVIRVCGQAMIGA
>O67610 1.1.1.100~~~fabG~~~3-oxoacyl-[acyl-carrier-protein] reductase FabG~~~COG1028
MEIKLQGKVSLVTGSTRGIGRAIAEKLASAGSTVIITGTSGERAKAVAEEIANKYGVKAHGVEMNLLSEESINKAFEEIY
NLVDGIDILVNNAGITRDKLFLRMSLLDWEEVLKVNLTGTFLVTQNSLRKMIKQRWGRIVNISSVVGFTGNVGQVNYSTT
KAGLIGFTKSLAKELAPRNVLVNAVAPGFIETDMTAVLSEEIKQKYKEQIPLGRFGSPEEVANVVLFLCSELASYITGEV
IHVNGGMF
>P0AEK2 1.1.1.100~~~fabG~~~3-oxoacyl-[acyl-carrier-protein] reductase FabG~~~COG1028
MNFEGKIALVTGASRGIGRAIAETLAARGAKVIGTATSENGAQAISDYLGANGKGLMLNVTDPASIESVLEKIRAEFGEV
DILVNNAGITRDNLLMRMKDEEWNDIIETNLSSVFRLSKAVMRAMMKKRHGRIITIGSVVGTMGNGGQANYAAAKAGLIG
FSKSLAREVASRGITVNVVAPGFIETDMTRALSDDQRAGILAQVPAGRLGGAQEIANAVAFLASDEAAYITGETLHVNGG
MYMV
>O54438 1.1.1.100~~~fabG~~~3-oxoacyl-[acyl-carrier-protein] reductase FabG~~~
MSLQGKVALVTGASRGIGQAIALELGRLGAVVIGTATSASGAEKIAETLKANGVEGAGLVLDVSSDESVAATLEHIQQHL
GQPLIVVNNAGITRDNLLVRMKDDEWFDVVNTNLNSLYRLSKAVLRGMTKARWGRIINIGSVVGAMGNAGQTNYAAAKAG
LEGFTRALAREVGSRAITVNAVAPGFIDTDMTRELPEAQREALLGQIPLGRLGQAEEIAKVVGFLASDGAAYVTGATVPV
NGGMYMS
>P50941 1.1.1.100~~~fabG~~~3-oxoacyl-[acyl-carrier-protein] reductase FabG~~~COG1028
MIDLTGKTSLITGASSGIGSAIARLLHKLGSKVIISGSNEEKLKSLGNALKDNYTIEVCNLANKEECSNLISKTSNLDIL
VCNAGITSDTLAIRMKDQDFDKVIDINLKANFILNREAIKKMIQKRYGRIINISSIVGIAGNPGQANYCASKAGLIGMTK
SLSYEVATRGITVNAVAPGFIKSDMTDKLNEKQREAIVQKIPLGTYGIPEDVAYAVAFLASNNASYITGQTLHVNGGMLM
V
>P0A2C9 1.1.1.100~~~fabG~~~3-oxoacyl-[acyl-carrier-protein] reductase FabG~~~
MSFEGKIALVTGASRGIGRAIAETLVARGAKVIGTATSENGAKNISDYLGANGKGLMLNVTDPASIESVLENIRAEFGEV
DILVNNAGITRDNLLMRMKDDEWNDIIETNLSSVFRLSKAVMRAMMKKRCGRIITIGSVVGTMGNAGQANYAAAKAGLIG
FSKSLAREVASRGITVNVVAPGFIETDMTRALSDDQRAGILAQVPAGRLGGAQEIASAVAFLASDEASYITGETLHVNGG
MYMV
>P0A0H9 1.1.1.100~~~fabG~~~3-oxoacyl-[acyl-carrier-protein] reductase FabG~~~
MKMTKSALVTGASRGIGRSIALQLAEEGYNVAVNYAGSKEKAEAVVEEIKAKGVDSFAIQANVADADEVKAMIKEVVSQF
GSLDVLVNNAGITRDNLLMRMKEQEWDDVIDTNLKGVFNCIQKATPQMLRQRSGAIINLSSVVGAVGNPGQANYVATKAG
VIGLTKSAARELASRGITVNAVAPGFIVSDMTDALSDELKEQMLTQIPLARFGQDTDIANTVAFLASDKAKYITGQTIHV
NGGMYM
>P99093 1.1.1.100~~~fabG~~~3-oxoacyl-[acyl-carrier-protein] reductase FabG~~~
MKMTKSALVTGASRGIGRSIALQLAEEGYNVAVNYAGSKEKAEAVVEEIKAKGVDSFAIQANVADADEVKAMIKEVVSQF
GSLDVLVNNAGITRDNLLMRMKEQEWDDVIDTNLKGVFNCIQKATPQMLRQRSGAIINLSSVVGAVGNPGQANYVATKAG
VIGLTKSAARELASRGITVNAVAPGFIVSDMTDALSDELKEQMLTQIPLARFGQDTDIANTVAFLASDKAKYITGQTIHV
NGGMYM
>P73574 1.1.1.100~~~fabG~~~3-oxoacyl-[acyl-carrier-protein] reductase~~~COG1028
MTALTAQVALVTGASRGIGKATALALAATGMKVVVNYAQSSTAADAVVAEIIANGGEAIAVQANVANADEVDQLIKTTLD
KFSRIDVLVNNAGITRDTLLLRMKLEDWQAVIDLNLTGVFLCTKAVSKLMLKQKSGRIINITSVAGMMGNPGQANYSAAK
AGVIGFTKTVAKELASRGVTVNAVAPGFIATDMTENLNAEPILQFIPLARYGQPEEVAGTIRFLATDPAAAYITGQTFNV
DGGMVMF
>Q9KQH7 1.1.1.100~~~fabG~~~3-oxoacyl-[acyl-carrier-protein] reductase FabG~~~COG1028
MNLEGKVALVTGASRGIGKAIAELLAERGAKVIGTATSESGAQAISDYLGDNGKGMALNVTNPESIEAVLKAITDEFGGV
DILVNNAGITRDNLLMRMKEEEWSDIMETNLTSIFRLSKAVLRGMMKKRQGRIINVGSVVGTMGNAGQANYAAAKAGVIG
FTKSMAREVASRGVTVNTVAPGFIETDMTKALNDEQRTATLAQVPAGRLGDPREIASAVAFLASPEAAYITGETLHVNGG
MYMI
>O34746 2.3.1.180~~~fabHA~~~Beta-ketoacyl-[acyl-carrier-protein] synthase III 1~~~COG0332
MKAGILGVGRYIPEKVLTNHDLEKMVETSDEWIRTRTGIEERRIAADDVFSSHMAVAAAKNALEQAEVAAEDLDMILVAT
VTPDQSFPTVSCMIQEQLGAKKACAMDISAACAGFMYGVVTGKQFIESGTYKHVLVVGVEKLSSITDWEDRNTAVLFGDG
AGAAVVGPVSDDRGILSFELGADGTGGQHLYLNEKRHTIMNGREVFKFAVRQMGESCVNVIEKAGLSKEDVDFLIPHQAN
IRIMEAARERLELPVEKMSKTVHKYGNTSAASIPISLVEELEAGKIKDGDVVVMVGFGGGLTWGAIAIRWGR
>Q9KQH5 2.3.1.180~~~fabH1~~~Beta-ketoacyl-[acyl-carrier-protein] synthase III 1~~~COG0332
MYSKILGTGSYLPSQVRTNADLEKMVETSDEWIVARTGIRERRIAADNETVADMAFFAAQNAINMAGIDKHDIDMIIVAT
TSASHTFPSAACQVQGKLGIKGCPAFDLAAACSGFMYALSIADQHVKSGMCKHVLVIGADALSKTCDPTDRSTIILFGDG
AGAVVVGASNEPGILSTHIHADGEFGDLLSLEVPVRGGDSDKWLHMAGNEVFKVAVTQLSKLVVDTLKANNMHKSELDWL
VPHQANYRIISATAKKLSMSLDQVVITLDRHGNTSAATVPTALDEAVRDGRIQRGQMLLLEAFGGGFTWGSALVKF
>O07600 2.3.1.180~~~fabHB~~~Beta-ketoacyl-[acyl-carrier-protein] synthase III 2~~~COG0332
MSKAKITAIGTYAPSRRLTNADLEKIVDTSDEWIVQRTGMRERRIADEHQFTSDLCIEAVKNLKSRYKGTLDDVDMILVA
TTTSDYAFPSTACRVQEYFGWESTGALDINATCAGLTYGLHLANGLITSGLHQKILVIAGETLSKVTDYTDRTTCVLFGD
AAGALLVERDEETPGFLASVQGTSGNGGDILYRAGLRNEINGVQLVGSGKMVQNGREVYKWAARTVPGEFERLLHKAGLS
SDDLDWFVPHSANLRMIESICEKTPFPIEKTLTSVEHYGNTSSVSIVLALDLAVKAGKLKKDQIVLLFGFGGGLTYTGLL
IKWGM
>Q9KLJ3 2.3.1.180~~~fabH2~~~Beta-ketoacyl-[acyl-carrier-protein] synthase III 2~~~COG0332
MTQCYAEITGWGKCLPPATLSNHDLSTFLDTSDEWIQSRTGIEQRRISHVNTSDLATVAAQHAIACAGVSVEEIDLIIVA
TCSPDSLIPNIASRVQQNLGIPSAAAFDLNAACTGFLYGLETATRLMQASHYRHALVIGAERLSFYLDWTKRDTAVLFGD
GAGAVVLSKTEQKVGLQDAQIGCDAQGRDILAVPKFGTAMDRFDADNGYWAFDFVGKEIFKRAVRGMGAAAQQVLARSGL
STEEIDVVIPHQANIRIIQTLCDLAGIAQDKAFVNIHRYGNTSAATVPIALCEALEQGKIKPHDDLLVAAFGAGLTWGAG
HIRWGERITPLGKSDAQLPSCDHTALDLLSKAIEHCKRHQSE
>O67185 2.3.1.180~~~fabH~~~Beta-ketoacyl-[acyl-carrier-protein] synthase III~~~COG0332
MGTKIIGTGVYLPKNVLTNFDLEKIVDTSDEWITTRTGIKERRIAKEETITYMATQAAKEALREANLSPEELDLIILATL
TPQKRFPSTACLVQAQLKAKGVYAFDISAACSGFIYALDIADSFIKSGKAKNVLVIGAEKLSEAVDWEDRSTCVLFGDGA
GAVVVTRSEDKSDILATRMYAEGSLEELLHADNCGYIRMKGRELFKVAVRSMEEVCREVLEKAGVKPEEVSLVIPHQANV
RIINALAEKLNIPKEKVFVNIQKYGNTSAASIPIALHEAIKEGKVKRGDLILMTAMGGGLTWGAVLLRY
>P0DW99 2.3.1.180~~~fabH~~~Beta-ketoacyl-[acyl-carrier-protein] synthase III~~~
MTAIKTRPVHGYSKFLSTGSARGSRVVTNKEMCTLIDSTPEWIEQRTGITERRWATNSETVASMGTTAARTALERSGLEA
SQIDAIIVATVSHHRPSPSLAAYIARELGLGDAAAFDLNGACAGFCYSTALADSMIRTGSANYVLVIGVEKLSEMTNLDD
RSTAFLFSDGAGAAIIGASDEPGIGPVVWGSRSDQLKTIELEDWPTASADPNKIHPLIRMEGRAVFKWAMTDVAKRAAEA
IAEAGITPADLDVFIPHQANDRITDVVSRHLKLPESVTVCHDIADMGNTSAASVPIAIDRMLQRGQAHSGDLALIIGFGA
GLVYAGQVIRLP
>P0A6R0 2.3.1.180~~~fabH~~~Beta-ketoacyl-[acyl-carrier-protein] synthase III~~~COG0332
MYTKIIGTGSYLPEQVRTNADLEKMVDTSDEWIVTRTGIRERHIAAPNETVSTMGFEAATRAIEMAGIEKDQIGLIVVAT
TSATHAFPSAACQIQSMLGIKGCPAFDVAAACAGFTYALSVADQYVKSGAVKYALVVGSDVLARTCDPTDRGTIIIFGDG
AGAAVLAASEEPGIISTHLHADGSYGELLTLPNADRVNPENSIHLTMAGNEVFKVAVTELAHIVDETLAANNLDRSQLDW
LVPHQANLRIISATAKKLGMSMDNVVVTLDRHGNTSAASVPCALDEAVRDGRIKPGQLVLLEAFGGGFTWGSALVRF
>Q820T1 2.3.1.180~~~fabH~~~Beta-ketoacyl-[acyl-carrier-protein] synthase III~~~COG0332
MKNYARISCTSRYVPENCVTNHQLSEMMDTSDEWIHSRTGISERRIVTQENTSDLCHQVAKQLLEKSGKQASEIDFILVA
TVTPDFNMPSVACQVQGAIGATEAFAFDISAACSGFVYALSMAEKLVLSGRYQTGLVIGGETFSKMLDWTDRSTAVLFGD
GAAGVLIEAAETPHFLNEKLQADGQRWAALTSGYTINESPFYQGHKQASKTLQMEGRSIFDFAIKDVSQNILSLVTDETV
DYLLLHQANVRIIDKIARKTKISREKFLTNMDKYGNTSAASIPILLDEAVENGTLILGSQQRVVLTGFGGGLTWGSLLLT
L
>P43711 2.3.1.180~~~fabH~~~Beta-ketoacyl-[acyl-carrier-protein] synthase III~~~COG0332
MNSRILSTGSYLPSHIRTNADLEKMVDTSDEWIVTRSGIRERRIAAEDETVATMGFEAAKNAIEAAQINPQDIELIIVAT
TSHSHAYPSAACQVQGLLNIDDAISFDLAAACTGFVYALSVADQFIRAGKVKKALVIGSDLNSRKLDETDRSTVVLFGDG
AGAVILEASEQEGIISTHLHASADKNNALVLAQPERGIEKSGYIEMQGNETFKLAVRELSNVVEETLLANNLDKKDLDWL
VPHQANLRIITATAKKLEMDMSQVVVTLDKYANNSAATVPVALDEAIRDGRIQRGQLLLLEAFGGGWTWGSALVRF
>C0LNR0 2.3.1.180~~~fabH~~~Beta-ketoacyl-[acyl-carrier-protein] synthase III~~~
MNAGILGVGKYVPEKIVTNFDLEKIMDTSDEWIRTRTGIEERRIARDDEYTHDLAYEAAKVAIKNAGLTPDDIDLFIVAT
VTQEATFPSVANIIQDRLGAKNAAGMDVEAACAGFTFGVVTAAQFIKTGAYKNIVVVGADKLSKITNWDDRTTAVLFGDG
AGAVVMGPVSDDHGLLSFDLGSDGSGGKYLNLDENKKIYMNGREVFRFAVRQMGEASLRVLERAGLEKEDLDLLIPHQAN
IRIMEASRERLNLPEEKLMKTVHKYGNTSSSSIALALVDAVEEGRIKDNDNVLLVGFGGGLTWGALIIRWGK
>P9WNG3 2.3.1.301~~~fabH~~~Mycobacterial beta-ketoacyl-[acyl-carrier-protein] synthase III~~~COG0332
MTEIATTSGARSVGLLSVGAYRPERVVTNDEICQHIDSSDEWIYTRTGIKTRRFAADDESAASMATEACRRALSNAGLSA
ADIDGVIVTTNTHFLQTPPAAPMVAASLGAKGILGFDLSAGCAGFGYALGAAADMIRGGGAATMLVVGTEKLSPTIDMYD
RGNCFIFADGAAAVVVGETPFQGIGPTVAGSDGEQADAIRQDIDWITFAQNPSGPRPFVRLEGPAVFRWAAFKMGDVGRR
AMDAAGVRPDQIDVFVPHQANSRINELLVKNLQLRPDAVVANDIEHTGNTSAASIPLAMAELLTTGAAKPGDLALLIGYG
AGLSYAAQVVRMPKG
>A1KRY9 2.3.1.180~~~fabH~~~Beta-ketoacyl-[acyl-carrier-protein] synthase III~~~
MQYAKISGTGSYLPANRVSNDDLAQKVDTSDEWITARTGIKFRHIAAENEKTSDLAAEAARRALDAAGLDSGEIDLIIVA
TATPDMQFPSTATIVQQKLGITNGCPAFDVQAVCAGFMYALTTANAYIKSGMAKNALVIGAETFSRIVDWNDRTTCVLFG
DGAGAVVLSAADKPGIIHSKLKADGNYLKLLNVPGQIACGKVSGSPYISMDGPGVFKFAVKMLSKIADDVIEEAGYTAAQ
IDWIVPHQANRRIIESTAKHLGLSMDKVVLTVQDHGNTSAASIPLALDTGIRSGQIKRGQNLLLEGIGGGFAWGAVLLQY
>Q9HYR2 2.3.1.180~~~fabH~~~Beta-ketoacyl-[acyl-carrier-protein] synthase III~~~
MPRAAVVCGLGSYLPEAVLSNDMLAAELDTSDAWISSRTGVRQRHIAGDLGSGDLALRAASAALASAGLERVDAVVLATS
TGDFCCPATAPRVAARLGLVGALAFDLSAACTGFVYGLASVGSLISAGLADSALLVGVDTFSHTLDPADRSTRALFGDGA
GAVVLRAGDAEEEGALLAFDLGSDGHQFDLLMTPAVSRAERSSGQASNYFRMDGKAVFGQAVTQMSDSVRRVLDRVGWQA
SDLHHLVPHQANTRILAAVADQLDLPVERVVSNIAEVGNTVAASIPLALAHGLRQGILRDGGNMVLTGFGAGLTWGSVAL
RWPKIVPTMD
>Q2FZS0 2.3.1.180~~~fabH~~~Beta-ketoacyl-[acyl-carrier-protein] synthase III~~~COG0332
MNVGIKGFGAYAPEKIIDNAYFEQFLDTSDEWISKMTGIKERHWADDDQDTSDLAYEASLKAIADAGIQPEDIDMIIVAT
ATGDMPFPTVANMLQERLGTGKVASMDQLAACSGFMYSMITAKQYVQSGDYHNILVVGADKLSKITDLTDRSTAVLFGDG
AGAVIIGEVSDGRGIISYEMGSDGTGGKHLYLDKDTGKLKMNGREVFKFAVRIMGDASTRVVEKANLTSDDIDLFIPHQA
NIRIMESARERLGISKDKMSVSVNKYGNTSAASIPLSIDQELKNGKIKDDDTIVLVGFGGGLTWGAMTIKWGK
>P99159 2.3.1.180~~~fabH~~~Beta-ketoacyl-[acyl-carrier-protein] synthase III~~~
MNVGIKGFGAYAPEKIIDNAYFEQFLDTSDEWISKMTGIKERHWADDDQDTSDLAYEASVKAIADAGIQPEDIDMIIVAT
ATGDMPFPTVANMLQERLGTGKVASMDQLAACSGFMYSMITAKQYVQSGDYHNILVVGADKLSKITDLTDRSTAVLFGDG
AGAVIIGEVSEGRGIISYEMGSDGTGGKHLYLDKDTGKLKMNGREVFKFAVRIMGDASTRVVEKANLTSDDIDLFIPHQA
NIRIMESARERLGISKDKMSVSVNKYGNTSAASIPLSIDQELKNGKLKDDDTIVLVGFGGGLTWGAMTIKWGK
>Q6GIA4 2.3.1.180~~~fabH~~~Beta-ketoacyl-[acyl-carrier-protein] synthase III~~~
MNVGIKGFGAYAPEKIIDNAYFEQFLDTSDEWISKMTGIKERHWADDDQDTSDLAYEASVKAIADAGIQPEDIDMIIVAT
ATGDMPFPTVANMLQERLGTGKVASMDQLAACSGFMYSMITAKQYVQSGDYHNILVVGADKLSKITDLTDRSTAVLFGDG
AGAVIIGEVSEGRGIISYEMGSDGTGGKHLYLDKDTGKLKMNGREVFKFAVRIMGDASTRVVEKANLTSDDIDLFIPHQA
NIRIMESARERLGISKDKMSVSVNKYGNTSAASIPLSIDQELKNGKLKDDDTIVLVGFGGGLTWGAMTIKWGK
>Q8NXE2 2.3.1.180~~~fabH~~~Beta-ketoacyl-[acyl-carrier-protein] synthase III~~~
MNVGIKGFGAYAPEKIIDNAYFEQFLDTSDEWISKMTGIKERHWADDDQDTSDLAYEASLKAIADAGIQPEDIDMIIVAT
ATGDMPFPTVANMLQERLGTGKVASMDQLAACSGFMYSMITAKQYVQSGDYHNILVVGADKLSKITDLTDRSTAVLFGDG
AGAVIIGEVSDGRGIISYEMGSDGTGGKHLYLDKDTGKLKMNGREVFKFAVRIMGDASTRVVEKANLTSDDIDLFIPHQA
NIRIMESARERLGISKDKMSVSVNKYGNTSAASIPLSIDQELKNGKIKDDDTIVLVGFGGGLTWGAMTIKWGK
>Q54206 2.3.1.180~~~fabH~~~Beta-ketoacyl-[acyl-carrier-protein] synthase III~~~COG0332
MSKIKPAKGAPYARILGVGGYRPTRVVPNEVILETIDSSDEWIRSRSGIQTRHWANDEETVAAMSIEASGKAIADAGITA
AQVGAVIVSTVTHFKQTPAVATEIADKLGTNKAAAFDISAGCAGFGYGLTLAKGMIVEGSAEYVLVIGVERLSDLTDLED
RATAFLFGDGAGAVVVGPSNEPAIGPTIWGSEGDKAETIKQTVPWTDYREGGVERFPAITQEGQAVFRWAVFEMAKVAQQ
ALDAAGVAAADLDVFIPHQANERIIDSMVKTLKLPESVTVARDVRTTGNTSAASIPLAMERLLATGEAKSGDTALVIGFG
AGLVYAASVVTLP
>Q9F6D4 2.3.1.180~~~fabH~~~Beta-ketoacyl-[acyl-carrier-protein] synthase III~~~
MPGLRVPERRFSRVLGVGSYRPRREVSNKEVCTWIDSTEEWIETRTGIRSRRIAEPDETIQVMGVAASRRALEHAGVDPA
EIDLVVVSTMTNFVHTPPLSVAIAHELGADNAGGFDLSAACAGFCHALSIAADAVESGGSRHVLVVATERMTDVIDLADR
SLSFLFGDGAGAAVVGPSDVPGIGPVVRGIDGTGLGSLHMSSSWDQYVEDPSVGRPALVMDGKRVFRWAVADVVPAAREA
LEVAGLTVGDLVAFVPHQANLRIIDVLVDRLGVPEHVVVSRDAEDTGNTSSASVALALDRLVRSGAVPGGGPALMIGFGA
GLSYAGQALLLPDPPSTPA
>P0A3C5 2.3.1.180~~~fabH~~~Beta-ketoacyl-[acyl-carrier-protein] synthase III~~~COG0332
MAFAKISQVAHYVPEQVVTNHDLAQIMDTNDEWISSRTGIRQRHISRTESTSDLATEVAKKLMAKAGITGEELDFIILAT
ITPDSMMPSTAARVQANIGANKAFAFDLTAACSGFVFALSTAEKFIASGRFQKGLVIGSETLSKAVDWSDRSTAVLFGDG
AGGVLLEASEQEHFLAESLNSDGSRSECLTYGHSGLHSPFSDQESADSFLKMDGRTVFDFAIRDVAKSIKQTIDESPIEV
TDLDYLLLHQANDRILDKMARKIGVDRAKLPANMMEYGNTSAASIPILLSECVEQGLIPLDGSQTVLLSGFGGGLTWGTL
ILTI
>C1CIR8 2.3.1.180~~~fabH~~~Beta-ketoacyl-[acyl-carrier-protein] synthase III~~~
MAFAKISQVAHYVPEQVVTNHDLAQIMDTNDEWISSRTGIRQRHISRTESTSDLATEVAKKLMAKAGITGKELDFIILAT
ITPDSMMPSTAARVQANIGANKAFAFDLTAACSGFVFALSTAEKFIASGRFQKGLVIGSETLSKAVDWSDRSTAVLFGDG
AGGVLLEASEQEHFLAESLNSDGSRSECLTYGHSGLHSPFSDQESADSFLKMDGRTVFDFAIRDVAKSIKQTIDESPIEV
TDLDYLLLHQANDRILDKMARKIGVDRAKLPANMMEYGNTSAASIPILLSECVEQGLIPLDGSQTVLLSGFGGGLTWGTL
ILTI
>Q4URQ0 2.3.1.180~~~fabH~~~Beta-ketoacyl-[acyl-carrier-protein] synthase III~~~
MSKRIYSRIAGTGSYLPEKVLTNDDMSKIVDTSDEWIRSRTGIRERHIVADDQTTSDLAYFASLKAMEAAGVTADEIDLI
VVGTTTPDLIFPSTACLLQARLGNVGCGAFDVNAACSGFVYALSVADKFVRSGDAKTVLVVGAETLTRIVDWTDRTTCVL
FGDGAGAVVLKADEDTGILSTHLHADGSKKELLWDPVGVSVGFGEGKNGGGALLMKGNDVFKYAVKALDSVVDETLAANG
LDTHDLDWLIPHQANLRIIEATAKRLDLPMEQVVVTVDRHGNTSSASVLLALDEAVRSGRVQRGQLLLLEAFGGGFTWGS
ALLRY
>Q8ZFT7 2.3.1.180~~~fabH~~~Beta-ketoacyl-[acyl-carrier-protein] synthase III~~~COG0332
MYTKILGTGSYLPVQVRSNADLEKMVDTSDEWIVTRTGIRERRIAGLDETVATMGFQAAEKALEMAGIDKDDIGLIIVAT
TSSSHAFPSSACQVQRMLGIKDAASFDLAAACAGFTYALSVADQYVKSGAVKHAIVIGSDVLSRALDPEDRGTIILFGDG
AGAVVLGASEQPGIMSTHLHADGRYGELLALPYPDRQQDQPAYVTMAGNEVFKVAVTELAHIVDETLQANNLDRTALDWL
VPHQANLRIISATAKKLGMGMDKVVITLDRHGNTSAASVPSAFDEAVRDGRIQRGQLVLLEAFGGGFTWGSALVRF
>O67505 1.3.1.9~~~fabI~~~Enoyl-[acyl-carrier-protein] reductase [NADH] FabI~~~COG0623
MGLLEGKRALITGVANERSIAYGIAKSFHREGAQLAFTYATPKLEKRVREIAKGFGSDLVVKCDVSLDEDIKNLKKFLEE
NWGSLDIIVHSIAYAPKEEFKGGVIDTSREGFKIAMDISVYSLIALTRELLPLMEGRNGAIVTLSYYGAEKVVPHYNVMG
IAKAALESTVRYLAYDIAKHGHRINAISAGPVKTLAAYSITGFHLLMEHTTKVNPFGKPITIEDVGDTAVFLCSDWARAI
TGEVVHVDNGYHIMGVFGREEEIKKEVYGD
>Q81GI3 1.3.1.9~~~fabI~~~Enoyl-[acyl-carrier-protein] reductase [NADH] FabI~~~
MELLQGKTFVVMGVANQRSIAWGIARSLHNAGAKLIFTYAGERLERNVRELADTLEGQESLVLPCDVTNDEELTACFETI
KQEVGTIHGVAHCIAFANRDDLKGEFVDTSRDGFLLAQNISAFSLTAVAREAKKVMTEGGNILTLTYLGGERVVKNYNVM
GVAKASLEASVKYLANDLGQHGIRVNAISAGPIRTLSAKGVGDFNSILREIEERAPLRRTTTQEEVGDTAVFLFSDLARG
VTGENIHVDSGYHILG
>P54616 1.3.1.9~~~fabI~~~Enoyl-[acyl-carrier-protein] reductase [NADH] FabI~~~COG0623
MNFSLEGRNIVVMGVANKRSIAWGIARSLHEAGARLIFTYAGERLEKSVHELAGTLDRNDSIILPCDVTNDAEIETCFAS
IKEQVGVIHGIAHCIAFANKEELVGEYLNTNRDGFLLAHNISSYSLTAVVKAARPMMTEGGSIVTLTYLGGELVMPNYNV
MGVAKASLDASVKYLAADLGKENIRVNSISAGPIRTLSAKGISDFNSILKDIEERAPLRRTTTPEEVGDTAAFLFSDMSR
GITGENLHVDSGFHITAR
>P0AEK4 1.3.1.9~~~fabI~~~Enoyl-[acyl-carrier-protein] reductase [NADH] FabI~~~COG0623
MGFLSGKRILVTGVASKLSIAYGIAQAMHREGAELAFTYQNDKLKGRVEEFAAQLGSDIVLQCDVAEDASIDTMFAELGK
VWPKFDGFVHSIGFAPGDQLDGDYVNAVTREGFKIAHDISSYSFVAMAKACRSMLNPGSALLTLSYLGAERAIPNYNVMG
LAKASLEANVRYMANAMGPEGVRVNAISAGPIRTLAASGIKDFRKMLAHCEAVTPIRRTVTIEDVGNSAAFLCSDLSAGI
SGEVVHVDGGFSIAAMNELELK
>O24990 1.3.1.9~~~fabI~~~Enoyl-[acyl-carrier-protein] reductase [NADH] FabI~~~COG0623
MGFLKGKKGLIVGVANNKSIAYGIAQSCFNQGATLAFTYLNESLEKRVRPIAQELNSPYVYELDVSKEEHFKSLYNSVKK
DLGSLDFIVHSVAFAPKEALEGSLLETSKSAFNTAMEISVYSLIELTNTLKPLLNNGASVLTLSYLGSTKYMAHYNVMGL
AKAALESAVRYLAVDLGKHHIRVNALSAGPIRTLASSGIADFRMILKWNEINAPLRKNVSLEEVGNAGMYLLSSLSSGVS
GEVHFVDAGYHVMGMGAVEEKDNKATLLWDLHKEQ
>Q9K151 1.3.1.9~~~fabI~~~Enoyl-[acyl-carrier-protein] reductase [NADH] FabI~~~
MGFLQGKKILITGMISERSIAYGIAKACREQGAELAFTYVVDKLEERVRKMAAELDSELVFRCDVASDDEINQVFADLGK
HWDGLDGLVHSIGFAPKEALSGDFLDSISREAFNTAHEISAYSLPALAKAARPMMRGRNSAIVALSYLGAVRAIPNYNVM
GMAKASLEAGIRFTAACLGKEGIRCNGISAGPIKTLAASGIADFGKLLGHVAAHNPLRRNVTIEEVGNTAAFLLSDLSSG
ITGEITYVDGGYSINALSTEG
>Q9ZFE4 1.3.1.9~~~fabI~~~Enoyl-[acyl-carrier-protein] reductase [NADH] FabI~~~
MGFLTGKRALIVGVASKLSIASGIAAAMHREGAELAFTYQNDKLRGRVEEFASGWGSRPELCFPCDVADDSQIEAVFAAL
GKHWDGLDIIVHSVGFAPGDQLDGDFTAVTTREGFRIAHDISAYSFIALAKAGREMMKGRNGSLLTLSYLGAERTMPNYN
VMGMAKASLEAGVRYLAGSLGAEGTRVNAVSAGPIRTLAASGIKSFRKMLAANERQTPLRRNVTIEEVGNAGAFLCSDLA
SGISGEILYVDGGFNTTAMGPLDDD
>P16657 1.3.1.9~~~fabI~~~Enoyl-[acyl-carrier-protein] reductase [NADH] FabI~~~
MGFLSGKRILVTGVASKLSIAYGIAQAMHREGAELAFTYQNDKLKGRVEEFAAQLGSSIVLPCDVAEDASIDAMFAELGN
VWPKFDGFVHSIGFAPGDQLDGDYVNAVTREGFKVAHDISSYSFVAMAKACRTMLNPGSALLTLSYLGAERAIPNYNVMG
LAKASLEANVRYMANAMGPEGVRVNAISAGPIRTLAASGIKDFRKMLAHCEAVTPIRRTVTIEDVGNSAAFLCSDLSAGI
SGEVVHVDGGFSIAAMNELELK
>Q2FZQ3 1.3.1.39~~~fabI~~~Enoyl-[acyl-carrier-protein] reductase [NADPH] FabI~~~COG0623
MLNLENKTYVIMGIANKRSIAFGVAKVLDQLGAKLVFTYRKERSRKELEKLLEQLNQPEAHLYQIDVQSDEEVINGFEQI
GKDVGNIDGVYHSIAFANMEDLRGRFSETSREGFLLAQDISSYSLTIVAHEAKKLMPEGGSIVATTYLGGEFAVQNYNVM
GVAKASLEANVKYLALDLGPDNIRVNAISASPIRTLSAKGVGGFNTILKEIEERAPLKRNVDQVEVGKTAAYLLSDLSSG
VTGENIHVDSGFHAIK
>Q6GI75 1.3.1.39~~~fabI~~~Enoyl-[acyl-carrier-protein] reductase [NADPH] FabI~~~
MLNLENKTYVIMGIANKRSIAFGVAKVLDQLGAKLVFTYRKERSRKELEKLLEQLNQPEAHLYQIDVQSDEEVINGFEQI
GKDVGNIDGVYHSIAFANMEDLRGRFSETSREGFLLAQDISSYSLTIVAHEAKKLMPEGGSIVATTYLGGEFAVQNYNVM
GVAKASLEANVKYLALDLGPDNIRVNAISAGPIRTLSAKGVGGFNTILKEIEERAPLKRNVDQVEVGKTAAYLLSDLSSG
VTGENIHVDSGFHAIK
>P71079 1.3.1.104~~~fabL~~~Enoyl-[acyl-carrier-protein] reductase [NADPH] FabL~~~COG1028
MEQNKCALVTGSSRGVGKAAAIRLAENGYNIVINYARSKKAALETAEEIEKLGVKVLVVKANVGQPAKIKEMFQQIDETF
GRLDVFVNNAASGVLRPVMELEETHWDWTMNINAKALLFCAQEAAKLMEKNGGGHIVSISSLGSIRYLENYTTVGVSKAA
LEALTRYLAVELSPKQIIVNAVSGGAIDTDALKHFPNREDLLEDARQNTPAGRMVEIKDMVDTVEFLVSSKADMIRGQTI
IVDGGRSLLV
>Q8DR19 5.3.3.14~~~fabM~~~Trans-2-decenoyl-[acyl-carrier-protein] isomerase~~~COG1024
MEHIIYQLEEDLAILTLNRPEVANGFHIPMCEEILEALTLAEENPAVHFILINANGKVFSVGGDLVEMKRAVDEDDIPSL
TKIAELVNTISYKIKQIAKPVLMEVDGAVAGAAANMAVAADFCLATDKAKFIQAFVGVGLAPDAGGIHLLSRSIGVTRAA
QLAMTGEALTAEKALEWGLVYRVSEAEKLEKTREQLLKKLRRASSNSYAAIKKLVWESQFKDWQGYATLELNLQKSLAQT
EDFKEGVRAHSERRRPKFIGK
>P0ACU5 ~~~fabR~~~HTH-type transcriptional repressor FabR~~~COG1309
MFILWYSASSTFGKDSDIVMGVRAQQKEKTRRSLVEAAFSQLSAERSFASLSLREVAREAGIAPTSFYRHFRDVDELGLT
MVDESGLMLRQLMRQARQRIAKGGSVIRTSVSTFMEFIGNNPNAFRLLLRERSGTSAAFRAAVAREIQHFIAELADYLEL
ENHMPRAFTEAQAEAMVTIVFSAGAEALDVGVEQRRQLEERLVLQLRMISKGAYYWYRREQEKTAIIPGNVKDE
>Q9KRA3 1.3.1.9~~~fabV~~~Enoyl-[acyl-carrier-protein] reductase [NADH] 1~~~COG3007
MIIKPKIRGFICTTTHPVGCEANVKEQIAYTKAQGPIKNAPKRVLVVGSSSGYGLSSRIAAAFGGGAATIGVFFEKPGTD
KKPGTAGFYNAAAFDKLAHEAGLYAKSLNGDAFSNEAKQKAIELIKQDLGQIDLVVYSLASPVRKMPDTGELVRSALKPI
GETYTSTAVDTNKDVIIEASVEPATEQEIADTVTVMGGQDWELWIQALEEAGVLAEGCKTVAYSYIGTELTWPIYWDGAL
GRAKMDLDRAATALNEKLAAKGGTANVAVLKSVVTQASSAIPVMPLYIAMVFKKMREQGVHEGCMEQIYRMFSQRLYKED
GSAPEVDDHNRLRLDDWELRDDIQQHCRDLWPQITTENLRELTDYDMYKEEFIKLFGFGIEGIDYDADVNPEVEFDVIDI
E
>Q5E6G3 1.3.1.9~~~fabV~~~Enoyl-[acyl-carrier-protein] reductase [NADH]~~~COG3007
MIIKPRIRGFICTTTHPVGCEQNVKEQIALTKAQGPIANAPKRVLVVGSSSGYGLSSRITAAFGGGASTIGVFFEKAGTE
KKPGTAGWYNSAAFDKFAKEEGLYSKSLNGDAFSNEAKQKTIDLIKEDLGQIDMVVYSLASPVRKMPETGEVIRSSLKPI
GETYTATAVDTNKDAIIEASVEPATEQEIKDTVTVMGGEDWELWINALSEAGVLADGCKTVAYSYIGTELTWPIYWDGAL
GQAKMDLDRAATALNEKLSATGGTANVAVLKSVVTQASSAIPVMPLYIAMVFKKMREEGVHEGCQEQILRMFSQRLYKAD
GSAAEVDEKNRLRLDDWELREDIQQHCRDLWPQVTTENLKDLTDYVEYKEEFLKLFGFGVDGVDYDADVNPEVNFDVADI
>Q62L02 1.3.1.9~~~fabV~~~Enoyl-[acyl-carrier-protein] reductase [NADH]~~~COG3007
MIIKPRVRGFICVTTHPAGCAASVREQIAYVARRGPIERGPKKVLVIGASTGYGLAARIAAAFGVGAATLGVFFERAPAD
AKPGTAGWYNSAAFHDEAAARGLQATSVNGDAFSDEIKHKTIDAIRRDLGQVDLVVYSVAAPRRTHPKTGVTHQSTLKPI
GHAVRLRGIDTDNEAIKETLLQPATPDEIADTVAVMGGEDWRMWIDALDAAGVLADGAKTTAFTYLGEQVTHDIYWNGSI
GEAKKDLDRTVLALRGKLAARGGDARVSVLKAVVTQASSAIPMMPLYLSLLFKVMKARGTHEGCIEQVDGLLRDSLYSAQ
PHVDAEGRLRADRLELDPAVQARVLELWDQVTDDNLYTLTDFAGYKAEFLRLFGFGIDGVDYDAPVEPNVRIPNLIE
>Q97LU2 1.3.1.44~~~fabV~~~Trans-2-enoyl-CoA reductase [NADH]~~~COG3007
MIVKAKFVKGFIRDVHPYGCRREVLNQIDYCKKAIGFRGPKKVLIVGASSGFGLATRISVAFGGPEAHTIGVSYETGATD
RRIGTAGWYNNIFFKEFAKKKGLVAKNFIEDAFSNETKDKVIKYIKDEFGKIDLFVYSLAAPRRKDYKTGNVYTSRIKTI
LGDFEGPTIDVERDEITLKKVSSASIEEIEETRKVMGGEDWQEWCEELLYEDCFSDKATTIAYSYIGSPRTYKIYREGTI
GIAKKDLEDKAKLINEKLNRVIGGRAFVSVNKALVTKASAYIPTFPLYAAILYKVMKEKNIHENCIMQIERMFSEKIYSN
EKIQFDDKGRLRMDDLELRKDVQDEVDRIWSNITPENFKELSDYKGYKKEFMNLNGFDLDGVDYSKDLDIELLRKLEP
>Q9HZP8 1.3.1.9~~~fabV~~~Enoyl-[acyl-carrier-protein] reductase [NADH]~~~
MIIKPRVRGFICVTTHPAGCEANVKQQIDYVEAKGPVVNGPKKVLVIGSSTGYGLAARITAAFGSGADTLGVFFERPGSE
SKPGTAGWYNSAAFEKFAHEKGLYARSINGDAFSDEVKRLTIETIKRDLGKVDLVVYSLAAPRRTHPKSGEVFSSTLKPI
GKSVSFRGLDTDKEVIKDVVLEAASDQEVADTVAVMGGEDWQMWIDALLEADVLADGAKTTAFTYLGEKITHDIYWNGSI
GAAKKDLDQKVLGIRDKLAPLGGDARVSVLKAVVTQASSAIPMMPLYLSLLFKVMKEQGTHEGCIEQVDGLYRESLYGAE
PRLDEEGRLRADYKELQPEVQSRVEELWDKVTNENLYELTDFAGYKSEFLNLFGFEVAGVDYEQDVNPDVQIANLIQA
>Q73Q47 1.3.1.44~~~fabV~~~Trans-2-enoyl-CoA reductase [NADH]~~~COG3007
MIVKPMVRNNICLNAHPQGCKKGVEDQIEYTKKRITAEVKAGAKAPKNVLVLGCSNGYGLASRITAAFGYGAATIGVSFE
KAGSETKYGTPGWYNNLAFDEAAKREGLYSVTIDGDAFSDEIKAQVIEEAKKKGIKFDLIVYSLASPVRTDPDTGIMHKS
VLKPFGKTFTGKTVDPFTGELKEISAEPANDEEAAATVKVMGGEDWERWIKQLSKEGLLEEGCITLAYSYIGPEATQALY
RKGTIGKAKEHLEATAHRLNKENPSIRAFVSVNKGLVTRASAVIPVIPLYLASLFKVMKEKGNHEGCIEQITRLYAERLY
RKDGTIPVDEENRIRIDDWELEEDVQKAVSALMEKVTGENAESLTDLAGYRHDFLASNGFDVEGINYEAEVERFDRI
>Q2P9J6 1.3.1.9~~~fabV~~~Enoyl-[acyl-carrier-protein] reductase [NADH]~~~
MIIHPKVRGFICTTTHPLGCERNVLEQIAATRARGVRNDGPKKVLVIGASSGYGLASRITAAFGFGADTLGVFFEKPGTA
SKAGTAGWYNSAAFDKHAKAAGLYSKSINGDAFSDAARAQVIELIKTEMGGQVDLVVYSLASPVRKLPGSGEVKRSALKP
IGQTYTATAIDTNKDTIIQASIEPASAQEIEETITVMGGQDWELWIDALEGAGVLADGARSVAFSYIGTEITWPIYWHGA
LGKAKVDLDRTAQRLNARLAKHGGGANVAVLKSVVTQASAAIPVMPLYISMVYKIMKEKGLHEGTIEQLDRLFRERLYRQ
DGQPAEVDEQNRLRLDDWELRDDVQDACKALWPQVTTENLFELTDYAGYKHEFLKLFGFGRTDVDYDADVATDVAFDCIE
LA
>Q8Z9U1 1.3.1.9~~~fabV~~~Enoyl-[acyl-carrier-protein] reductase [NADH]~~~COG3007
MIIKPRVRGFICVTAHPTGCEANVKKQIDYVTTEGPIANGPKRVLVIGASTGYGLAARITAAFGCGADTLGVFFERPGEE
GKPGTSGWYNSAAFHKFAAQKGLYAKSINGDAFSDEIKQLTIDAIKQDLGQVDQVIYSLASPRRTHPKTGEVFNSALKPI
GNAVNLRGLDTDKEVIKESVLQPATQSEIDSTVAVMGGEDWQMWIDALLDAGVLAEGAQTTAFTYLGEKITHDIYWNGSI
GAAKKDLDQKVLAIRESLAAHGGGDARVSVLKAVVTQASSAIPMMPLYLSLLFKVMKEKGTHEGCIEQVYSLYKDSLCGD
SPHMDQEGRLRADYKELDPEVQNQVQQLWDQVTNDNIYQLTDFVGYKSEFLNLFGFGIDGVDYDADVNPDVKIPNLIQG
>P0ADQ2 2.3.1.-~~~fabY~~~Probable acyltransferase FabY~~~COG0456
MSQLPGLSRETRESIAMYHLRVPQTEEELERYYQFRWEMLRKPLHQPKGSERDAWDAMAHHQMVVDEQGNLVAVGRLYIN
ADNEASIRFMAVHPDVQDKGLGTLMAMTLESVARQEGVKRVTCSAREDAVEFFAKLGFVNQGEITTPTTTPIRHFLMIKP
VATLDDILHRGDWCAQLQQAWYEHIPLSEKMGVRIQQYTGQKFITTMPETGNQNPHHTLFAGSLFSLATLTGWGLIWLML
RERHLGGTIILADAHIRYSKPISGKPHAVADLGALSGDLDRLARGRKARVQMQVEIFGDETPGAVFEGTYIVLPAKPFGP
YEEGGNEEE
>Q9HU15 2.3.1.180~~~fabY~~~Beta-ketoacyl-[acyl-carrier-protein] synthase FabY~~~
MSRLPVIVGFGGYNAAGRSSFHHGFRRMVIESMDPQARQETLAGLAVMMKLVKAEGGRYLAEDGTPLSPEDIERRYAERI
FASTLVRRIEPQYLDPDAVHWHKVLELSPAEGQALTFKASPKQLPEPLPANWSIAPAEDGEVLVSIHERCEFKVDSYRAL
TVKSAGQLPTGFEPGELYNSRFHPRGLQMSVVAATDAIRSTGIDWKTIVDNVQPDEIAVFSGSIMSQLDDNGFGGLMQSR
LKGHRVSAKQLPLGFNSMPTDFINAYVLGSVGMTGSITGACATFLYNLQKGIDVITSGQARVVIVGNSEAPILPECIEGY
SAMGALATEEGLRLIEGRDDVDFRRASRPFGENCGFTLAESSQYVVLMDDELALRLGADIHGAVTDVFINADGFKKSISA
PGPGNYLTVAKAVASAVQIVGLDTVRHASFVHAHGSSTPANRVTESEILDRVASAFGIDGWPVTAVKAYVGHSLATASAD
QLISALGTFKYGILPGIKTIDKVADDVHQQRLSISNRDMRQDKPLEVCFINSKGFGGNNASGVVLSPRIAEKMLRKRHGQ
AAFAAYVEKREQTRAAARAYDQRALQGDLEIIYNFGQDLIDEHAIEVSAEQVTVPGFSQPLVYKKDARFSDMLD
>Q2SWY7 4.2.1.59~~~fabZ~~~3-hydroxyacyl-[acyl-carrier-protein] dehydratase FabZ~~~
MRRTIMSTEKINFDIHKILTLLPHRYPILLVDRVLELEPHKSIKALKNVTVNEPFFTGHFPKRPVMPGVLIIEALAQAAA
LLTFAEAEPKDPENTLYYFVGIDNARFKRVVEPGDQLILNVTFERYIRGIWKFKAVAEVDGKVAAEAELMCTVKTADAAP
>A1VXZ7 4.2.1.59~~~fabZ~~~3-hydroxyacyl-[acyl-carrier-protein] dehydratase FabZ~~~COG0764
MIDVMQIQEILPHRYPFLLVDKITELKVKEVVLGYKNISISDHVFMGHFPGHPIYPGVLILEGMAQTGGVLAFESMEDKV
DPKSKVVYFTGIDGAKFRNPVRPGDRLDYEMSVVKNRGNMWIFKGQAFVDGNLVAEAELKAMIVDK
>P0A6Q6 4.2.1.59~~~fabZ~~~3-hydroxyacyl-[acyl-carrier-protein] dehydratase FabZ~~~COG0764
MTTNTHTLQIEEILELLPHRFPFLLVDRVLDFEEGRFLRAVKNVSVNEPFFQGHFPGKPIFPGVLILEAMAQATGILAFK
SVGKLEPGELYYFAGIDEARFKRPVVPGDQMIMEVTFEKTRRGLTRFKGVALVDGKVVCEATMMCARSREA
>Q5NEQ0 4.2.1.59~~~fabZ~~~3-hydroxyacyl-[acyl-carrier-protein] dehydratase FabZ~~~COG0764
MSQFNQNNKQIDVMGIRKILPHRYPFALLDKIVDWSVEDRTIVAQKNVTINEDFFNGHFPDFPVMPGVLIVEAMAQATAI
LGELMAETLFAHVVEKAGGGRRTFMLAGIDKVRVKRPVVPGDVLVIESRMVKQKNIICTAESVAKVDGQIVCSAELMAAY
KDY
>O25928 4.2.1.59~~~fabZ~~~3-hydroxyacyl-[acyl-carrier-protein] dehydratase FabZ~~~COG0764
MEQSHQNLQSQFFIEHILQILPHRYPMLLVDRIIELQANKKIVAYKNITFNEDVFNGHFPNKPIFPGVLIVEGMAQTGGF
LAFTSLWGFDPEIAKTKIVYFMTIDKVKFRIPVTPGDRLEYHLEVLKHKGMIWQVGGTAQVDGKVVAEAELKAMIAERD
>A1KRL1 4.2.1.59~~~fabZ~~~3-hydroxyacyl-[acyl-carrier-protein] dehydratase FabZ~~~
MDVQLPIEAKDIQKLIPHRYPFLQLDRITAFEPMKTLTAIKNVSINEPQFQGHFPDLPVMPGVLIIEAMAQACGTLAILS
EGGRKENEFFFFAGIDEARFKRQVIPGDQLVFEVELLTSRRGIGKFNAVAKVDGQVAVEAIIMCAKRVV
>Q9HXY7 4.2.1.59~~~fabZ~~~3-hydroxyacyl-[acyl-carrier-protein] dehydratase FabZ~~~
MMDINEIREYLPHRYPFLLVDRVVELDIEGKRIRAYKNVSINEPFFNGHFPEHPIMPGVLIIEAMAQAAGILGFKMLDVK
PADGTLYYFVGSDKLRFRQPVLPGDQLQLHAKFISVKRSIWKFDCHATVDDKPVCSAEIICAERKL
>Q1CAM3 4.2.1.59~~~fabZ~~~3-hydroxyacyl-[acyl-carrier-protein] dehydratase FabZ~~~
MTTDTHTLHIEEILDLLPHRFPFLLVDRVLDFEEGKFLRAVKNVSFNEPFFQGHFPGKPIFPGVLILEAMAQATGILAFK
SRGKLEPGELYYFAGIDEARFKRPVVPGDQMIMEVEFVKERRGLTRFTGVAKVDGEIVCTATMMCARSKPAAPAESVVVK
PDVVKPDVVNPVVKES
>Q8ZH57 4.2.1.59~~~fabZ~~~3-hydroxyacyl-[acyl-carrier-protein] dehydratase FabZ~~~COG0764
MTTDTHTLHIEEILDLLPHRFPFLLVDRVLDFEEGKFLRAVKNVSFNEPFFQGHFPGKPIFPGVLILEAMAQATGILAFK
SRGKLEPGELYYFAGIDEARFKRPVVPGDQMIMEVEFVKERRGLTRFTGVAKVDGEIVCTATMMCARSKPAAPAESVVVK
PDVVKPDVVKPDVVNPVVKES
>P9WQ37 6.2.1.3~~~~~~Long-chain-fatty-acid--CoA ligase FadD13~~~COG0318
MKNIGWMLRQRATVSPRLQAYVEPSTDVRMTYAQMNALANRCADVLTALGIAKGDRVALLMPNSVEFCCLFYGAAKLGAV
AVPINTRLAAPEVSFILSDSGSKVVIYGAPSAPVIDAIRAQADPPGTVTDWIGADSLAERLRSAAADEPAVECGGDDNLF
IMYTSGTTGHPKGVVHTHESVHSAASSWASTIDVRYRDRLLLPLPMFHVAALTTVIFSAMRGVTLISMPQFDATKVWSLI
VEERVCIGGAVPAILNFMRQVPEFAELDAPDFRYFITGGAPMPEALIKIYAAKNIEVVQGYALTESCGGGTLLLSEDALR
KAGSAGRATMFTDVAVRGDDGVIREHGEGEVVIKSDILLKEYWNRPEATRDAFDNGWFRTGDIGEIDDEGYLYIKDRLKD
MIISGGENVYPAEIESVIIGVPGVSEVAVIGLPDEKWGEIAAAIVVADQNEVSEQQIVEYCGTRLARYKLPKKVIFAEAI
PRNPTGKILKTVLREQYSATVPK
>O53521 6.2.1.3~~~~~~Long-chain-fatty-acid--CoA ligase FadD15~~~COG1022
MREISVPAPFTVGEHDNVAAMVFEHERDDPDYVIYQRLIDGVWTDVTCAEAANQIRAAALGLISLGVQAGDRVVIFSATR
YEWAILDFAILAVGAVTVPTYETSSAEQVRWVLQDSEAVVLFAETDSHATMVAELSGSVPALREVLQIAGSGPNALDRLT
EAGASVDPAELTARLAALRSTDPATLIYTSGTTGRPKGCQLTQSNLVHEIKGARAYHPTLLRKGERLLVFLPLAHVLARA
ISMAAFHSKVTVGFTSDIKNLLPMLAVFKPTVVVSVPRVFEKVYNTAEQNAANAGKGRIFAIAAQTAVDWSEACDRGGPG
LLLRAKHAVFDRLVYRKLRAALGGNCRAAVSGGAPLGARLGHFYRGAGLTIYEGYGLSGTSGGVAISQFNDLKIGTVGKP
VPGNSLRIADDGELLVRGGVVFSGYWRNEQATTEAFTDGWFKTGDLGAVDEDGFLTITGRKKEIIVTAGGKNVAPAVLED
QLRAHPLISQAVVVGDAKPFIGALITIDPEAFEGWKQRNSKTAGASVGDLATDPDLIAEIDAAVKQANLAVSHAESIRKF
RILPVDFTEDTGELTPTMKVKRKVVAEKFASDIEAIYNKE
>O53551 6.2.1.2~~~~~~Medium/long-chain-fatty-acid--CoA ligase FadD17~~~COG0318
MTPTHPTVTELLLPLSEIDDRGVYFEDSFTSWRDHIRHGAAIAAALRERLDPARPPHVGVLLQNTPFFSATLVAGALSGI
VPVGLNPVRRGAALAGDIAKADCQLVLTGSGSAEVPADVEHINVDSPEWTDEVAAHRDTEVRFRSADLADLFMLIFTSGT
SGDPKAVKCSHRKVAIAGVTITQRFSLGRDDVCYVSMPLFHSNAVLVGWAVAAACQGSMALRRKFSASQFLADVRRYGAT
YANYVGKPLSYVLATPELPDDADNPLRAVYGNEGVPGDIDRFGRRFGCVVMDGFGSTEGGVAITRTLDTPAGALGPLPGG
IQIVDPDTGEPCPTGVVGELVNTAGPGGFEGYYNDEAAEAERMAGGVYHSGDLAYRDDAGYAYFAGRLGDWMRVDGENLG
TAPIERVLMRYPDATEVAVYPVPDPVVGDQVMAALVLAPGTKFDADKFRAFLTEQPDLGHKQWPSYVRVSAGLPRTMTFK
VIKRQLSAEGVACADPVWPIRR
>P9WQ51 6.2.1.2~~~~~~Medium/long-chain-fatty-acid--CoA/3-oxocholest-4-en-26-oate--CoA ligase~~~COG0318
MAVALNIADLAEHAIDAVPDRVAVICGDEQLTYAQLEDKANRLAHHLIDQGVQKDDKVGLYCRNRIEIVIAMLGIVKAGA
ILVNVNFRYVEGELRYLFDNSDMVALVHERRYADRVANVLPDTPHVRTILVVEDGSDQDYRRYGGVEFYSAIAAGSPERD
FGERSADAIYLLYTGGTTGFPKGVMWRHEDIYRVLFGGTDFATGEFVKDEYDLAKAAAANPPMIRYPIPPMIHGATQSAT
WMALFSGQTTVLAPEFNADEVWRTIHKHKVNLLFFTGDAMARPLVDALVKGNDYDLSSLFLLASTAALFSPSIKEKLLEL
LPNRVITDSIGSSETGFGGTSVVAAGQAHGGGPRVRIDHRTVVLDDDGNEVKPGSGMRGVIAKKGNIPVGYYKDEKKTAE
TFRTINGVRYAIPGDYAQVEEDGTVTMLGRGSVSINSGGEKVYPEEVEAALKGHPDVFDALVVGVPDPRYGQQVAAVVQA
RPGCRPSLAELDSFVRSEIAGYKVPRSLWFVDEVKRSPAGKPDYRWAKEQTEARPADDVHAGHVTSGG
>E3UUE6 6.2.1.42~~~~~~3-oxocholest-4-en-26-oate--CoA ligase~~~
MALNIADLVEHAIDLVPERVALASDGREVTYAQLEERANRLAHYLREQGVEPGDKVGIYSRNTIEAVEAMIAVFKIRAIM
INVNYRYVENELQYIFDNSDMVALIHERRYSDKVANVLPSTPLVKTVVVVEDGTDVDFSAYGGIEYEAALAQSSPERDFE
DRSADDIYILYTGGTTGHPKGVMWRHEDVWRVLGGGINFMTGEWVKDEWQLAKEGAENPGLVRYPIPPMIHGGAQWALFQ
SLFSGGKVIMHPEFSGHEVWRIIDDHKVNVIFITGDAMARPMLDALEEGNPKTGKPYDLSTLFAMASSAALFSPSIKDRF
LDLLPGKIITDSIGSSETGFGGIGIAEKGKTLGGGPTVKIDESTTVLDDDGNPIEPGSGKVGMVARTGNIPLGYYKDEAK
TKATFREYNGIRYSIPGDYARVEADGTVTMLGRGSVSINSGGEKVYPEEVEGALKQHPAVFDALVVGVPDERFGERVSAV
VALRDGEQVTLDELMTTARSKIAGYKVPRAVWFVDEIKRSPAGKPDYRWAKDQTGLRPADEVYNNGDGNGAAATG
>O05307 6.2.1.2~~~fadD6~~~Medium/long-chain-fatty-acid--CoA ligase FadD6~~~COG0318
MSDYYGGAHTTVRLIDLATRMPRVLADTPVIVRGAMTGLLARPNSKASIGTVFQDRAARYGDRVFLKFGDQQLTYRDANA
TANRYAAVLAARGVGPGDVVGIMLRNSPSTVLAMLATVKCGAIAGMLNYHQRGEVLAHSLGLLDAKVLIAESDLVSAVAE
CGASRGRVAGDVLTVEDVERFATTAPATNPASASAVQAKDTAFYIFTSGTTGFPKASVMTHHRWLRALAVFGGMGLRLKG
SDTLYSCLPLYHNNALTVAVSSVINSGATLALGKSFSASRFWDEVIANRATAFVYIGEICRYLLNQPAKPTDRAHQVRVI
CGNGLRPEIWDEFTTRFGVARVCEFYAASEGNSAFINIFNVPRTAGVSPMPLAFVEYDLDTGDPLRDASGRVRRVPDGEP
GLLLSRVNRLQPFDGYTDPVASEKKLVRNAFRDGDCWFNTGDVMSPQGMGHAAFVDRLGDTFRWKGENVATTQVEAALAS
DQTVEECTVYGVQIPRTGGRAGMAAITLRAGAEFDGQALARTVYGHLPGYALPLFVRVVGSLAHTTTFKSRKVELRNQAY
GADIEDPLYVLAGPDEGYVPYYAEYPEEVSLGRRPQG
>O06417 6.2.1.2~~~fadD8~~~Medium/long-chain-fatty-acid--CoA ligase FadD8~~~COG0318
MSTAGDDAVGVPPACGGRSDAVGVPQLARESGAMRDQDCSGELLRSPTHNGHLLVGALKRHQNKPVLFLGDTRLTGGQLA
DRISQYIQAFEALGAGTGVAVGLLSLNRPEVLMIIGAGQARGYRRTALHPLGSLADHAYVLNDAGISSLIIDPNPMFVER
ALALLEQVDSLQQILTIGPVPDALKHVAVDLSAEAAKYQPQPLVAADLPPDQVIGLTYTGGTTGKPKGVIGTAQSIATMT
SIQLAEWEWPANPRFLMCTPLSHAGAAFFTPTVIKGGEMIVLAKFDPAEVLRIIEEQRITATMLVPSMLYALLDHPDSHT
RDLSSLETVYYGASAINPVRLAEAIRRFGPIFAQYYGQSEAPMVITYLAKGDHDEKRLTSCGRPTLFARVALLDEHGKPV
KQGEVGEICVSGPLLAGGYWNLPDETSRTFKDGWLHTGDLAREDSDGFYYIVDRVKDMIVTGGFNVFPREVEDVVAEHPA
VAQVCVVGAPDEKWGEAVTAVVVLRSNAARDEPAIEAMTAEIQAAVKQRKGSVQAPKRVVVVDSLPLTGLGKPDKKAVRA
RFWEGAGRAVG
>P9WQ49 6.2.1.-~~~~~~Putative fatty-acid--CoA ligase FadD21~~~COG0318
MSDSSVLSLLRERAGLQPDDAAFTYIDYEQDWAGITETLTWSEVFRRTRIVAHEVRRHCTTGDRAVILAPQGLAYIAAFL
GSMQAGAIAVPLSVPQIGSHDERVSAVLADASPSVILTTSAVAEAVAEHIHRPNTNNVGPIIEIDSLDLTGNSPSFRVKD
LPSAAYLQYTSGSTRAPAGVMISHRNLQANFQQLMSNYFGDRNGVAPPDTTIVSWLPFYHDMGLVLGIIAPILGGYRSEL
TSPLAFLQRPARWLHSLANGSPSWSAAPNFAFELAVRKTTDADIEGLDLGNVLGITSGAERVHPNTLSRFCNRFAPYNFR
EDMIRPSYGLAEATLYVASRNSGDKPEVVYFEPDKLSTGSANRCEPKTGTPLLSYGMPTSPTVRIVDPDTCIECPAGTIG
EIWVKGDNVAEGYWNKPDETRHTFGAMLVHPSAGTPDGSWLRTGDLGFLSEDEMFIVGRMKDMLIVYGRNHYPEDIESTV
QEITGGRVAAISVPVDHTEKLVTVIELKLLGDSAGEAMDELDVIKNNVTAAISRSHGLNVADLVLVPPGSIPTTTSGKIR
RAACVEQYRLQQFTRLDG
>P9WQ45 6.2.1.-~~~~~~Putative fatty-acid--CoA ligase fadD25~~~COG0318
MSVVESSLPGVLRERASFQPNDKALTFIDYERSWDGVEETLTWSQLYRRTLNLAAQLREHGSTGDRALILAPQSLDYVVS
FIASLQAGIVAVPLSIPQGGAHDERTVSVFADTAPAIVLTASSVVDNVVEYVQPQPGQNAPAVIEVDRLDLDARPSSGSR
SAAHGHPDILYLQYTSGSTRTPAGVMVSNKNLFANFEQIMTSYYGVYGKVAPPGSTVVSWLPFYHDMGFVLGLILPILAG
IPAVLTSPIGFLQRPARWIQMLASNTLAFTAAPNFAFDLASRKTKDEDMEGLDLGGVHGILNGSERVQPVTLKRFIDRFA
PFNLDPKAIRPSYGMAEATVYVATRKAGQPPKIVQFDPQKLPDGQAERTESDGGTPLVSYGIVDTQLVRIVDPDTGIERP
AGTIGEIWVHGDNVAIGYWQKPEATERTFSATIVNPSEGTPAGPWLRTGDSGFLSEGELFIMGRIKDLLIVYGRNHSPDD
IEATIQTISPGRCAAIAVSEHGAEKLVAIIELKKKDESDDEAAERLGFVKREVTSAISKSHGLSVADLVLVSPGSIPITT
SGKIRRAQCVELYRQDEFTRLDA
>P96843 6.2.1.41~~~fadD3~~~3-[(3aS,4S,7aS)-7a-methyl-1,5-dioxo-octahydro-1H-inden-4-yl]propanoyl:CoA ligase~~~COG0318
MINDLRTVPAALDRLVRQLPDHTALIAEDRRFTSTELRDAVYGAAAALIALGVEPADRVAIWSPNTWHWVVACLAIHHAG
AAVVPLNTRYTATEATDILDRAGAPVLFAAGLFLGADRAAGLDRAALPALRHVVRVPVEADDGTWDEFIATGAGALDAVA
ARAAAVAPQDVSDILFTSGTTGRSKGVLCAHRQSLSASASWAANGKITSDDRYLCINPFFHNFGYKAGILACLQTGATLI
PHVTFDPLHALRAIERHRITVLPGPPTIYQSLLDHPARKDFDLSSLRFAVTGAATVPVVLVERMQSELDIDIVLTAYGLT
EANGMGTMCRPEDDAVTVATTCGRPFADFELRIADDGEVLLRGPNVMVGYLDDTEATAAAIDADGWLHTGDIGAVDQAGN
LRITDRLKDMYICGGFNVYPAEVEQVLARMDGVADAAVIGVPDQRLGEVGRAFVVARPGTGLDEASVIAYTREHLANFKT
PRSVRFVDVLPRNAAGKVSKPQLRELG
>Q0S7V5 6.2.1.41~~~fadD3~~~3-[(3aS,4S,7aS)-7a-methyl-1,5-dioxo-octahydro-1H-inden-4-yl]propanoyl:CoA ligase~~~COG0318
MTEQPTTTPSALKRAAREFGELTAVADGDVRLTFTQLHDRVRDFAAALSSQDVRPGDHVAVWSPNTYHWVVAALGIHYAG
ATLVPINTRYTATEALDILERTKTTALVVAGNFLGTDRYASLRDESSTFDLPTVVRVPVDGGDAELPGVFDFDDFLALAD
EDTRAEADARAAAVSPDDVSDVMFTSGTTGRSKGVMSAHRQSVGIAQAWGECAEVTSDDNYLIINPFFHTFGYKAGFLVC
LLNGATVVPMAVFDVPKVMATVHDEQITVLPGAPTIFQSILDHPDRPKYDLSSLRVAITGAAAVPVALVERMQSELSFDA
VLTAYGQTEAVVVTMCRTDDDPVTVSTTSGRAIPGMEVRIGDQGEILVRGENVMLGYLDDPESTAKTIDADGWLHTGDVG
TLDDRGYVDITDRLKDMYISGGFNVYPAEVENALARLDGVAESAVIGVPDERMGEVGRAYVVAKPGVTLAEDDVVAFCKE
RLANFKVPRSVRFVDSLPRNPSGKVMKNVLREEKK
>A0R1Y7 2.3.1.9~~~~~~Probable acetyl-CoA acetyltransferase~~~COG0183
MIVAGARTPVGKLMGSLKDFSGTDLGAIAIRAALEKANVPASMVEYVIMGQVLTAGAGQMPARQAAVAAGIPWDVAALSI
NKMCLSGIDAIALADQLIRAGEFDVIVAGGQESMSQAPHLLPKSREGYKYGDATLVDHLAYDGLHDVFTDQPMGALTEQR
NDVDKFTRAEQDEYAAQSHQKAAAAWKDGVFADEVVPVSIPQRKGDPIEFAEDEGIRANTTAESLAGLKPAFRKDGTITA
GSASQISDGAAAVIVMNKAKAEELGLTWLAEIGAHGVVAGPDSTLQSQPANAIKKAITREGITVDQLDVIEINEAFAAVA
LASTKELGVDPAKVNVNGGAIAIGHPIGMSGARIALHAALELARRGSGYAVAALCGAGGQGDALVLRR
>P9WG69 2.3.1.9~~~fadA4~~~Probable acetyl-CoA acetyltransferase~~~COG0183
MTTSVIVAGARTPIGKLMGSLKDFSASELGAIAIKGALEKANVPASLVEYVIMGQVLTAGAGQMPARQAAVAAGIGWDVP
ALTINKMCLSGIDAIALADQLIRAREFDVVVAGGQESMTKAPHLLMNSRSGYKYGDVTVLDHMAYDGLHDVFTDQPMGAL
TEQRNDVDMFTRSEQDEYAAASHQKAAAAWKDGVFADEVIPVNIPQRTGDPLQFTEDEGIRANTTAAALAGLKPAFRGDG
TITAGSASQISDGAAAVVVMNQEKAQELGLTWLAEIGAHGVVAGPDSTLQSQPANAINKALDREGISVDQLDVVEINEAF
AAVALASIRELGLNPQIVNVNGGAIAVGHPLGMSGTRITLHAALQLARRGSGVGVAALCGAGGQGDALILRAG
>I6XHI4 2.3.1.16~~~fadA5~~~Steroid 3-ketoacyl-CoA thiolase~~~COG0183
MGYPVIVEATRSPIGKRNGWLSGLHATELLGAVQKAVVDKAGIQSGLHAGDVEQVIGGCVTQFGEQSNNISRVAWLTAGL
PEHVGATTVDCQCGSGQQANHLIAGLIAAGAIDVGIACGIEAMSRVGLGANAGPDRSLIRAQSWDIDLPNQFEAAERIAK
RRGITREDVDVFGLESQRRAQRAWAEGRFDREISPIQAPVLDEQNQPTGERRLVFRDQGLRETTMAGLGELKPVLEGGIH
TAGTSSQISDGAAAVLWMDEAVARAHGLTPRARIVAQALVGAEPYYHLDGPVQSTAKVLEKAGMKIGDIDIVEINEAFAS
VVLSWARVHEPDMDRVNVNGGAIALGHPVGCTGSRLITTALHELERTDQSLALITMCAGGALSTGTIIERI
>I6XHJ3 2.3.1.16~~~fadA6~~~Steroid 3-ketoacyl-CoA thiolase FadA6~~~COG0183
MPRVDDDAVGVPLTGNGRGAVMTEAYVIDAVRTAVGKRGGALAGIHPVDLGALAWRGLLDRTDIDPAAVDDVIAGCVDAI
GGQAGNIARLSWLAAGYPEEVPGVTVDRQCGSSQQAISFGAQAIMSGTADVIVAGGVQNMSQIPISSAMTVGEQFGFTSP
TNESKQWLHRYGDQEISQFRGSELIAEKWNLSREEMERYSLTSHERAFAAIRAGHFENEIITVETESGPFRVDEGPRESS
LEKMAGLQPLVEGGRLTAAMASQISDGASAVLLASERAVKDHGLRPRARIHHISARAADPVFMLTGPIPATRYALDKTGL
AIDDIDTVEINEAFAPVVMAWLKEIKADPAKVNPNGGAIALGHPLGATGAKLFTTMLGELERIGGRYGLQTMCEGGGTAN
VTIIERL
>O32177 2.3.1.16~~~fadA~~~3-ketoacyl-CoA thiolase~~~COG0183
MKEAVIVSGARTPVGKAKKGSLATVRPDDLGAICVKETLKRAGGYEGNIDDLIIGCATPEAEQGLNMARNIGALAGLPYT
VPAITVNRYCSSGLQSIAYAAEKIMLGAYDTAIAGGAESMSQVPMMGHVTRPNLALAEKAPEYYMSMGHTAEQVAKKYGV
SREDQDAFAVRSHQNAAKALAEGKFKDEIVPVEVTVTEIGEDHKPMEKQFVFSQDEGVRPQTTADILSTLRPAFSVDGTV
TAGNSSQTSDGAAAVMLMDREKADALGLAPLVKFRSFAVGGVPPEVMGIGPVEAIPRALKLAGLQLQDIGLFELNEAFAS
QAIQVIRELGIDEEKVNVNGGAIALGHPLGCTGTKLTLSLIHEMKRRNEQFGVVTMCIGGGMGAAGVFELC
>P21151 2.3.1.16~~~fadA~~~3-ketoacyl-CoA thiolase FadA~~~COG0183
MEQVVIVDAIRTPMGRSKGGAFRNVRAEDLSAHLMRSLLARNPALEAAALDDIYWGCVQQTLEQGFNIARNAALLAEVPH
SVPAVTVNRLCGSSMQALHDAARMIMTGDAQACLVGGVEHMGHVPMSHGVDFHPGLSRNVAKAAGMMGLTAEMLARMHGI
SREMQDAFAARSHARAWAATQSAAFKNEIIPTGGHDADGVLKQFNYDEVIRPETTVEALATLRPAFDPVNGMVTAGTSSA
LSDGAAAMLVMSESRAHELGLKPRARVRSMAVVGCDPSIMGYGPVPASKLALKKAGLSASDIGVFEMNEAFAAQILPCIK
DLGLIEQIDEKINLNGGAIALGHPLGCSGARISTTLLNLMERKDVQFGLATMCIGLGQGIATVFERV
>P28790 2.3.1.16~~~fadA~~~3-ketoacyl-CoA thiolase~~~COG0183
MSLNPRDVVIVDFGRTPMGRSKGGMHRNTRAEDMSAHLISKVLERNSKVDPGEVEDVIWGCVNQTLEQGWNIARMASLMT
QIPHTSAAQTVSRLCGSSMSALHTAAQAIMTGNGDVFVVGGVEHMGHVSMMHGVDPNPHMSLYAAKASGMMGLTAEMLGK
MHGISREQQDAFAVRSHQLAHKATVEGKFKDEIIPMQGYDENGFLKIFDYDETIRPDTTLESLAALKPAFNPKGGTVTAG
TSSQITDGASCMIVMSAQRAKDLGLEPLAVIRSMAVAGVDPAIMGYGPVPATQKALKRAGLNMADIDFIELNEAFAAQAL
PVLKDLKVLDKMNEKVNLHGGAIALGHPFGCSGARISGTLLNVMKQNGGTFGLSTMCIGLGQGIATVFERV
>P0A2H7 2.3.1.16~~~fadA~~~3-ketoacyl-CoA thiolase~~~
MEQVVIVDAIRTPMGRSKGGAFRNVRAEDLSAHLMRSLLARNPSLTAATLDDIYWGCVQQTLEQGFNIARNAALLAEIPH
SVPAVTVNRLCGSSMQALHDAARMIMTGDAQVCLVGGVEHMGHVPMSHGVDFHPGLSRNVAKAAGMMGLTAEMLSRLHGI
SREMQDQFAARSHARAWAATQSGAFKTEIIPTGGHDADGVLKQFNYDEVIRPETTVEALSTLRPAFDPVSGTVTAGTSSA
LSDGAAAMLVMSESRARELGLKPRARIRSMAVVGCDPSIMGYGPVPASKLALKKAGLSASDIDVFEMNEAFAAQILPCIK
DLGLMEQIDEKINLNGGAIALGHPLGCSGARISTTLINLMERKDAQFGLATMCIGLGQGIATVFERV
>P9WNP7 1.1.1.157~~~fadB2~~~3-hydroxybutyryl-CoA dehydrogenase~~~COG1250
MSDAIQRVGVVGAGQMGSGIAEVSARAGVEVTVFEPAEALITAGRNRIVKSLERAVSAGKVTERERDRALGLLTFTTDLN
DLSDRQLVIEAVVEDEAVKSEIFAELDRVVTDPDAVLASNTSSIPIMKVAAATKQPQRVLGLHFFNPVPVLPLVELVRTL
VTDEAAAARTEEFASTVLGKQVVRCSDRSGFVVNALLVPYLLSAIRMVEAGFATVEDVDKAVVAGLSHPMGPLRLSDLVG
LDTLKLIADKMFEEFKEPHYGPPPLLLRMVEAGQLGKKSGRGFYTY
>P94549 4.2.1.17~~~fadB~~~Probable enoyl-CoA hydratase~~~COG1024
MNAISLAVDQFVAVLTIHNPPANALSSRILEELSSCLDQCETDAGVRSIIIHGEGRFFSAGADIKEFTSLKGNEDSSLLA
ERGQQLMERIESFPKPIIAAIHGAALGGGLELAMACHIRIAAEDAKLGLPELNLGIIPGFAGTQRLPRYVGTAKALELIG
SGEPISGKEALDLGLVSIGAKDEAEVIEKAKALAAKFAEKSPQTLASLLELLYSNKVYSYEGSLKLEAKRFGEAFESEDA
KEGIQAFLEKRKPQFKGE
>P21177 ~~~fadB~~~Fatty acid oxidation complex subunit alpha~~~COG1024
MLYKGDTLYLDWLEDGIAELVFDAPGSVNKLDTATVASLGEAIGVLEQQSDLKGLLLRSNKAAFIVGADITEFLSLFLVP
EEQLSQWLHFANSVFNRLEDLPVPTIAAVNGYALGGGCECVLATDYRLATPDLRIGLPETKLGIMPGFGGSVRMPRMLGA
DSALEIIAAGKDVGADQALKIGLVDGVVKAEKLVEGAKAVLRQAINGDLDWKAKRQPKLEPLKLSKIEATMSFTIAKGMV
AQTAGKHYPAPITAVKTIEAAARFGREEALNLENKSFVPLAHTNEARALVGIFLNDQYVKGKAKKLTKDVETPKQAAVLG
AGIMGGGIAYQSAWKGVPVVMKDINDKSLTLGMTEAAKLLNKQLERGKIDGLKLAGVISTIHPTLDYAGFDRVDIVVEAV
VENPKVKKAVLAETEQKVRQDTVLASNTSTIPISELANALERPENFCGMHFFNPVHRMPLVEIIRGEKSSDETIAKVVAW
ASKMGKTPIVVNDCPGFFVNRVLFPYFAGFSQLLRDGADFRKIDKVMEKQFGWPMGPAYLLDVVGIDTAHHAQAVMAAGF
PQRMQKDYRDAIDALFDANRFGQKNGLGFWRYKEDSKGKPKKEEDAAVEDLLAEVSQPKRDFSEEEIIARMMIPMVNEVV
RCLEEGIIATPAEADMALVYGLGFPPFHGGAFRWLDTLGSAKYLDMAQQYQHLGPLYEVPEGLRNKARHNEPYYPPVEPA
RPVGDLKTA
>P28793 ~~~fadB~~~Fatty acid oxidation complex subunit alpha~~~COG1024
MIYEGKAITVTALESGIVELKFDLKGESVNKFNRLTLNELRQAVDAIKADASVKGVIVSSGKDVFIVGADITEFVENFKL
PDAELIAGNLEANKIFSDFEDLNVPTVAAINGIALGGGLEMCLAADFRVMADSAKIGLPEVKLGIYPGFGGTVRLPRLIG
VDNAVEWIASGKENRAEDALKVSAVDAVVTADKLGAAALDLIKRAISGELDYKAKRQPKLEKLKLNAIEQMMAFETAKGF
VAGQAGPNYPAPVEAIKTIQKAANFGRDKALEVEAAGFAKLAKTSASNCLIGLFLNDQELKKKAKVYDKIAKDVKQAAVL
GAGIMGGGIAYQSASKGTPILMKDINEHGIEQGLAEAAKLLVGRVDKGRMTPAKMAEVLNGIRPTLSYGDFGNVDLVVEA
VVENPKVKQAVLAEVENHVREDAILASNTSTISISLLAKALKRPENFVGMHFFNPVHMMPLVEVIRGEKSSDLAVATTVA
YAKKMGKNPIVVNDCPGFLVNRVLFPYFGGFAKLVSAGVDFVRIDKVMEKFGWPMGPAYLMDVVGIDTGHHGRDVMAEGF
PDRMKDDRRSAIDALYEAKRLGQKNGKGFYAYEADKKGKQKKLVDSSVLEVLKPIVYEQRDVTDEDIINWMMIPLCLETV
RCLEDGIVETAAEADMGLVYGIGFPLFRGGALRYIDSIGVAEFVALADQYAELGALYHPTAKLREMAKNGQSFFG
>Q3L887 1.3.8.8~~~fadE5~~~Broad-specificity linear acyl-CoA dehydrogenase FadE5~~~COG1960
MSHYKSNVRDQVFNLFEVFGVDKVLGADKFSDLDADTAREMLTEIARLAEGPIAESFVEGDRNPPVFDPETHTVTLPEGF
KKSMRALFDGGWDKVGLAEHLGGIPMPRALQWALIEHILGANPAAYMYAMGPGMSEIFYNNGTDEQKKWATIAAERGWGA
TMVLTEPDAGSDVGAGRTKAVQQPDGTWHIEGVKRFITSADSDDLFENIMHLVLARPEGAGPGTKGLSLFFVPKFHFDHE
TGEIGERNGVFVTNVEHKMGLKVSATCELSLGQHGIPAVGWLVGEVHNGIAQMFDVIEQARMMVGTKAIATLSTGYLNAL
EYAKERVQGADMTQMTDKTAPRVTITHHPDVRRSLMTQKAYAEGLRAIYLYTATFQDAEVAQAVHGVDGDLAARVNDLLL
PIVKGFGSETAYAKLTESLQTLGGSGFLQDYPIEQYIRDSKIDSLYEGTTAIQAQDFFFRKIIRDKGQALAYVAGEIEQF
IKNENGNGRLKTERELLATALADVQGMAASLTGYLMAAQEDAASIYKVGLGSVRFLMAVGDLLSGWLLARQAAVAIEKLD
AGATGADKSFYEGKIAAASFFAKNMLPLLTSTRQIIENLDNDVMELDEAAF
>O53666 1.3.8.8~~~fadE5~~~Broad-specificity linear acyl-CoA dehydrogenase FadE5~~~COG1960
MSHYRSNVRDQVFNLFEVLGVDKALGHGEFSDVDVDTARDMLAEVSRLAEGPVAESFVEGDRNPPVFDPKTHSVMLPESF
KKSVNAMLEAGWDKVGIDEALGGMPMPKAVVWALHEHILGANPAVWMYAGGAGFAQILYHLGTEEQKKWAVLAAERGWGS
TMVLTEPDAGSDVGAARTKAVQQADGSWHIDGVKRFITSGDSGDLFENIFHLVLARPEGAGPGTKGLSLYFVPKFLFDVE
TGEPGERNGVFVTNVEHKMGLKVSATCELAFGQHGVPAKGWLVGEVHNGIAQMFEVIEQARMMVGTKAIATLSTGYLNAL
QYAKSRVQGADLTQMTDKTAPRVTITHHPDVRRSLMTQKAYAEGLRALYLYTATFQDAAVAEVVHGVDAKLAVKVNDLML
PVVKGVGSEQAYAKLTESLQTLGGSGFLQDYPIEQYIRDAKIDSLYEGTTAIQAQDFFFRKIVRDKGVALAHVSGQIQEF
VDSGAGNGRLKTERALLAKALTDVQGMAAALTGYLMAAQQDVTSLYKVGLGSVRFLMSVGDLIIGWLLQRQAAVAVAALD
AGATGDERSFYEGKVAVASFFAKNFLPLLTSTREVIETLDNDIMELDEAAF
>O32176 1.3.99.-~~~fadE~~~Probable acyl-CoA dehydrogenase~~~COG1960
MAKKAADVQKGGGFLIEDVTYDQMYTPEDFTDEHKMIAKTTEDYIEQDVLPHIDDIENHQFEHSVRLLKKAGELGLLGAD
VPEEYGGLGLDKISSALITEKFSRAGSFSLSYGAHVGIGSLPIVFFGSEEQKKKYLPGLASGEKIAAYALTEPGSGSDAL
GAKTTAVLNEAGTHYVLTGEKQWITNSAFADVFVVYAKVDGDKFSAFIVEKEFPGVSTGPEEKKMGIKGSSTRTLILDQA
EVPKENLLGEIGKGHVIAFNILNIGRYKLAVGTIGASKRVIELSAAYANQRRQFKTPIAGFSLTQEKIGTMASRLYAMES
SVYRTVGLFEDNMSQFTAEDLKDGRQIAKSIAEYAIECSLNKVFGSETLDYIVDEGVQIHGGYGFMQEYEVERAYRDSRI
NRIFEGTNEINRLIVPSTFLKKALKGELPLFEKAQSLQEELMMLMPEEPGSGVLEQEKYIVKQAKKIALFAAGLAAQKYG
KAIDREQEILVNVADIVSNVYAMESAVLRTEKAIAAQGAEKAAQKVLYTEIFVQEAFNEIEAHAKESLIAMEEGDSLRMM
LSALRKLTRVTPKNVIQKKREAAAGIFEAEKYTV
>Q47146 1.3.8.7~~~fadE~~~Acyl-coenzyme A dehydrogenase~~~COG1960
MMILSILATVVLLGALFYHRVSLFISSLILLAWTAALGVAGLWSAWVLVPLAIILVPFNFAPMRKSMISAPVFRGFRKVM
PPMSRTEKEAIDAGTTWWEGDLFQGKPDWKKLHNYPQPRLTAEEQAFLDGPVEEACRMANDFQITHELADLPPELWAYLK
EHRFFAMIIKKEYGGLEFSAYAQSRVLQKLSGVSGILAITVGVPNSLGPGELLQHYGTDEQKDHYLPRLARGQEIPCFAL
TSPEAGSDAGAIPDTGIVCMGEWQGQQVLGMRLTWNKRYITLAPIATVLGLAFKLSDPEKLLGGAEDLGITCALIPTTTP
GVEIGRRHFPLNVPFQNGPTRGKDVFVPIDYIIGGPKMAGQGWRMLVECLSVGRGITLPSNSTGGVKSVALATGAYAHIR
RQFKISIGKMEGIEEPLARIAGNAYVMDAAASLITYGIMLGEKPAVLSAIVKYHCTHRGQQSIIDAMDITGGKGIMLGQS
NFLARAYQGAPIAITVEGANILTRSMMIFGQGAIRCHPYVLEEMEAAKNNDVNAFDKLLFKHIGHVGSNKVRSFWLGLTR
GLTSSTPTGDATKRYYQHLNRLSANLALLSDVSMAVLGGSLKRRERISARLGDILSQLYLASAVLKRYDDEGRNEADLPL
VHWGVQDALYQAEQAMDDLLQNFPNRVVAGLLNVVIFPTGRHYLAPSDKLDHKVAKILQVPNATRSRIGRGQYLTPSEHN
PVGLLEEALVDVIAADPIHQRICKELGKNLPFTRLDELAHNALVKGLIDKDEAAILVKAEESRLRSINVDDFDPEELATK
PVKLPEKVRKVEAA
>Q8ZRJ7 1.3.8.7~~~fadE~~~Acyl-coenzyme A dehydrogenase~~~
MMILSIIATVVLLGALFYHRVSLFLSSLILLAWTAALGVAGLWSIWLLVPLAIILVPFNLTPMRKSMISAPVFRGFRKVM
PPMSRTEKEAIDAGTTWWEGDLFQGKPDWKKLHNYPQPQLTAEEQAFLDGPVEEACRMANDFQITHELADLPPELWAYLK
EHRFFAMIIKKEYGGLEFSAYAQSRVLQKLSGVSGILAITVGVPNSLGPGELLQHYGTEEQKNHYLPRLARGQEIPCFAL
TSPEAGSDAGAIPDTGVVCMGEWQGQQVLGMRLTWNKRYITLAPIATVLGLAFKLSDPDRLLGGEEELGITCALIPTSTP
GVEIGRRHFPLNVPFQNGPTRGNDIFVPIDYIIGGPKMAGQGWRMLVECLSVGRGITLPSNSTGGVKSVALATGAYAHIR
RQFKISIGKMEGIEEPLARIAGNAYVMDAAASLITYGIMLGEKPAVLSAIVKYHCTHRGQQSIIDAMDITGGKGIMLGES
NFLARAYQGAPIAITVEGANILTRSMMIFGQGAIRCHPYVLEEMAAAQNNDVNAFDKLLFKHIGHVGSNTVRSFWLGLTR
GLTSHTPTGDATKRYYQHLNRLSANLALLSDVSMAVLGGSLKRRERISARLGDVLSQLYLASAVLKRYDDEGRHEADLPL
VHWGVQDALYRAEQAMDDLLQNFPNRVVAGLLTAMIFPTGRHYLAPSDKLDHAVAKILQVPNATRSRIGRGQYLTPAEHN
PVGLLEEALRDVIAADPIHQRICKELGKNLPFTRLDELARNALAKGLIDKDEAAILAKAEESRLRSINVDDFEPEALATK
PVKLPEKVRKVEAA
>P45866 1.-.-.-~~~fadF~~~Probable iron-sulfur-binding oxidoreductase FadF~~~COG0247
MSGFLIANALLFLIVTAYAVYLFVYLVKTRLAYIKLGQKEQFDQRFKERLHAIWVNVFGQKKLLKDKKSGIIHVMFFYGF
ILVQFGAIDFIIKGLAPGRNLSLGPVYPAFTFFQEIVTFLILIAVGWAFYRRYIEKLVRLKRGFKAGLVLIFIGGLMLTV
LLGNGMNLIWHEHGLSWSEPIASGIAFMLSGVGKTGAAVIFYIAWWIHLLFLLSFLVYVPQSKHAHLIAGPANVFFNRME
SAGKLEKIDFTDETKESYGAGKIEDFRQSQLLDLYACVECGRCTNMCPATGTGKMLSPMDLILRLRDHLTEKGAAVTSRS
PWVPAAAFRHTRGNQLAAASAGSGSQEAAAALDYNPSLIGDVITEEEIWACTTCRNCEDQCPVMNEHVDKIIDLRRYLVL
TEGKMDSDAQRAMTSIERQGNPWGLNRKERENWRDEAPDAEIPTVKEMKKEGKEFEYLFWVGSMGSYDNRSQKIAISFAK
LLNHAGVSFAILGNKEKNSGDTPRRLGNEFLFQELAEKNISEFEKNDVKKIVTIDPHAYNLFKNEYPDFGFEGEVYHHTE
VLAELVKNGKLRPQHPLHETITFHDSCYLGRYNEVYDPPREILKAIPGVQLVEMERSRETGMCCGAGGGLMWMEEETGNR
INVARTEQALAVNPSVISSGCPYCLTMLGDGTKAKEAEDQVKTYDVVELLAQSVLGADLKMGEKQ
>O34320 ~~~fadG~~~Uncharacterized protein FadG~~~COG3409
MDEMVLLTQEWLNETYKGKSGYNSIEENGKTGWKTMYALTRALQLELGITQTSDSFGPTTLRKLKELGPISTSTNSKKNI
VKIIQGALYCKGYGPGGLTGTFGQGTKEAIAEMQLHMGLSKTDGVVTPKVFKALLNMDSYILLNGASEKVRSIQQWLNNK
YYNRENFYFMPCDGLYSRDTQKSLVYAIQYEEGLSDSIANGNFGPTTQRLIPVLRIGETDEKNSFIHLFQAALIFNGYNV
PFDGVYSESVRSKVKAFQSFAKLQQSGTADFQTWASLLVSTGDPNRKGVACDSITQITSDRAESLKRAGYKIVGRYLTNA
PGSTLNKKIQPGELETILKSGLNVFPIYQTYGGATNYFNKEQGKKDAFAAYKAAKEYGFKNNTVIYFAVDYDAYGNDLNN
NIIPHFEGINEIMNGFLGSTYKIGIYAPRNVCTIVSKKGLAFASFVSGMSTGFSGNLGYPLPYNWAFDQISTITVGNGSG
MIEIDNDICSGLDNGVNTINIVPSENKKFFDQIDVLYETAEKYAQMQSDLNNGVKKTQLANELVAQYLRKDDYKGWKWVP
TAGQIDPIYREWAVKR
>P80094 1.1.1.306~~~~~~S-(hydroxymethyl)mycothiol dehydrogenase~~~
PQTVRGVIARSKGAPVELTDIVIPDPGPSEVTALIATCAVCHTDLTYREGGINDEFPFLLGHEAAGTVESVGEGVDSVQP
GDYVVLNWRAVCGQCRACKRGRPQYCFSTFNATQKMTLTDGTELTPALGIGAFADKTLVHAGQCTKVDPAADPAVAGLLG
CGVMAGLGAAVNTGAVSRGDSVAVIGCGAVGDAVIAGARLAGANKIIAVDRDAKKLEWATELGATHTVNATETDVVEAVQ
ALTGGFGADVVIDAVGRPETWKQAFYARDLAGTVVLVGVPTPDMRLEMPLLDFFSRGGALKSSWYGDCLPERDFPVLIDL
HLQGRLPLDKFVTERISLDDVEKAFHTMHAGEVLRSVVVW
>O34717 1.3.1.34~~~fadH~~~Probable 2,4-dienoyl-CoA reductase [(2E)-enoyl-CoA-producing]~~~COG1028
MEKKAVIITGGSSGMGKAMAKKQAELGWHVMVTGRNHEALEETKKEIQTFEGQVACFQMDVRSDSAASDMIKEAVKAFGR
LDALINNAAGNFICPAEKLTPNGWKAVIEIVLNGTFFCSQAAARHWIDQKQQGVILNMAATYAWGAGAGVVHSAAAKAGV
LSLTRTLAVEWGSKYGIRTNAIAPGPIERTGGAEKLFESEKAMARTMNSVPLGRLGTPEEIAALAAFLLSDEASYINGDC
ITMDGGQWLNPYPF
>P42593 1.3.1.34~~~fadH~~~2,4-dienoyl-CoA reductase [(2E)-enoyl-CoA-producing]~~~COG0446
MSYPSLFAPLDLGFTTLKNRVLMGSMHTGLEEYPDGAERLAAFYAERARHGVALIVSGGIAPDLTGVGMEGGAMLNDASQ
IPHHRTITEAVHQEGGKIALQILHTGRYSYQPHLVAPSALQAPINRFVPHELSHEEILQLIDNFARCAQLAREAGYDGVE
VMGSEGYLINEFLTLRTNQRSDQWGGDYRNRMRFAVEVVRAVRERVGNDFIIIYRLSMLDLVEDGGTFAETVELAQAIEA
AGATIINTGIGWHEARIPTIATPVPRGAFSWVTRKLKGHVSLPLVTTNRINDPQVADDILSRGDADMVSMARPFLADAEL
LSKAQSGRADEINTCIGCNQACLDQIFVGKVTSCLVNPRACHETKMPILPAVQKKNLAVVGAGPAGLAFAINAAARGHQV
TLFDAHSEIGGQFNIAKQIPGKEEFYETLRYYRRMIEVTGVTLKLNHTVTADQLQAFDETILASGIVPRTPPIDGIDHPK
VLSYLDVLRDKAPVGNKVAIIGCGGIGFDTAMYLSQPGESTSQNIAGFCNEWGIDSSLQQAGGLSPQGMQIPRSPRQIVM
LQRKASKPGQGLGKTTGWIHRTTLLSRGVKMIPGVSYQKIDDDGLHVVINGETQVLAVDNVVICAGQEPNRALAQPLIDS
GKTVHLIGGCDVAMELDARRAIAQGTRLALEI
>P45382 1.1.1.284~~~flhA~~~S-(hydroxymethyl)glutathione dehydrogenase~~~
MRTRAAVALEAGKPLEVMEVNLEGPKAGEVMVEIKATGICHTDEFTLSGADPEGLFPSILGHEGAGVVVEVGPGVTSVKP
GNHVIPLYTPECRQCASCLSGKTNLCTAIRATQGQGLMPDGTSRFSMLDGTPIFHYMGCSTFSNYTVLPEIAVAKVREDA
PFDKICYIGCGVTTGIGAVINTAKVEIGAKAVVFGLGGIGLNVLQGLRLAGADMIIGVDLNDDKKPMAEHFGMTHFINPK
NCENVVQEIVNLTKTPFDQIGGADYSFDCTGNVKVMRDALECTHRGWGQSIIIGVAPAGAEISTRPFQLVTGRVWKGTAF
GGARGRTDVPQIVDWYMDGKIEIDPMITHTLSLDDINKGFDLMHAGESIRSVVLY
>P46154 1.2.1.46~~~fdhA~~~Glutathione-independent formaldehyde dehydrogenase~~~COG1063
MSGNRGVVYLGSGKVEVQKIDYPKMQDPRGKKIEHGVILKVVSTNICGSDQHMVRGRTTAQVGLVLGHEITGEVIEKGRD
VENLQIGDLVSVPFNVACGRCRSCKEMHTGVCLTVNPARAGGAYGYVDMGDWTGGQAEYLLVPYADFNLLKLPDRDKAME
KIRDLTCLSDILPTGYHGAVTAGVGPGSTVYVAGAGPVGLAAAASARLLGAAVVIVGDLNPARLAHAKAQGFEIADLSLD
TPLHEQIAALLGEPEVDCAVDAVGFEARGHGHEGAKHEAPATVLNSLMQVTRVAGKIGIPGLYVTEDPGAVDAAAKIGSL
SIRFGLGWAKSHSFHTGQTPVMKYNRALMQAIMWDRINIAEVVGVQVISLDDAPRGYGEFDAGVPKKFVIDPHKTFSAA
>P76503 2.3.1.16~~~fadI~~~3-ketoacyl-CoA thiolase FadI~~~COG0183
MGQVLPLVTRQGDRIAIVSGLRTPFARQATAFHGIPAVDLGKMVVGELLARSEIPAEVIEQLVFGQVVQMPEAPNIAREI
VLGTGMNVHTDAYSVSRACATSFQAVANVAESLMAGTIRAGIAGGADSSSVLPIGVSKKLARVLVDVNKARTMSQRLKLF
SRLRLRDLMPVPPAVAEYSTGLRMGDTAEQMAKTYGITREQQDALAHRSHQRAAQAWSDGKLKEEVMTAFIPPYKQPLVE
DNNIRGNSSLADYAKLRPAFDRKHGTVTAANSTPLTDGAAAVILMTESRAKELGLVPLGYLRSYAFTAIDVWQDMLLGPA
WSTPLALERAGLTMSDLTLIDMHEAFAAQTLANIQLLGSERFAREALGRAHATGEVDDSKFNVLGGSIAYGHPFAATGAR
MITQTLHELRRRGGGFGLVTACAAGGLGAAMVLEAE
>P77399 ~~~fadJ~~~Fatty acid oxidation complex subunit alpha~~~COG1024
MEMTSAFTLNVRLDNIAVITIDVPGEKMNTLKAEFASQVRAIIKQLRENKELRGVVFVSAKPDNFIAGADINMIGNCKTA
QEAEALARQGQQLMAEIHALPIQVIAAIHGACLGGGLELALACHGRVCTDDPKTVLGLPEVQLGLLPGSGGTQRLPRLIG
VSTALEMILTGKQLRAKQALKLGLVDDVVPHSILLEAAVELAKKERPSSRPLPVRERILAGPLGRALLFKMVGKKTEHKT
QGNYPATERILEVVETGLAQGTSSGYDAEARAFGELAMTPQSQALRSIFFASTDVKKDPGSDAPPAPLNSVGILGGGLMG
GGIAYVTACKAGIPVRIKDINPQGINHALKYSWDQLEGKVRRRHLKASERDKQLALISGTTDYRGFAHRDLIIEAVFENL
ELKQQMVAEVEQNCAAHTIFASNTSSLPIGDIAAHATRPEQVIGLHFFSPVEKMPLVEIIPHAGTSAQTIATTVKLAKKQ
GKTPIVVRDKAGFYVNRILAPYINEAIRMLTQGERVEHIDAALVKFGFPVGPIQLLDEVGIDTGTKIIPVLEAAYGERFS
APANVVSSILNDDRKGRKNGRGFYLYGQKGRKSKKQVDPAIYPLIGTQGQGRISAPQVAERCVMLMLNEAVRCVDEQVIR
SVRDGDIGAVFGIGFPPFLGGPFRYIDSLGAGEVVAIMQRLATQYGSRFTPCERLVEMGARGESFWKTTATDLQ
>P38135 6.2.1.-~~~fadK~~~Medium-chain fatty-acid--CoA ligase~~~COG0318
MHPTGPHLGPDVLFRESNMKVTLTFNEQRRAAYRQQGLWGDASLADYWQQTARAMPDKIAVVDNHGASYTYSALDHAASC
LANWMLAKGIESGDRIAFQLPGWCEFTVIYLACLKIGAVSVPLLPSWREAELVWVLNKCQAKMFFAPTLFKQTRPVDLIL
PLQNQLPQLQQIVGVDKLAPATSSLSLSQIIADNTSLTTAITTHGDELAAVLFTSGTEGLPKGVMLTHNNILASERAYCA
RLNLTWQDVFMMPAPLGHATGFLHGVTAPFLIGARSVLLDIFTPDACLALLEQQRCTCMLGATPFVYDLLNVLEKQPADL
SALRFFLCGGTTIPKKVARECQQRGIKLLSVYGSTESSPHAVVNLDDPLSRFMHTDGYAAAGVEIKVVDDARKTLPPGCE
GEEASRGPNVFMGYFDEPELTARALDEEGWYYSGDLCRMDEAGYIKITGRKKDIIVRGGENISSREVEDILLQHPKIHDA
CVVAMSDERLGERSCAYVVLKAPHHSLSLEEVVAFFSRKRVAKYKYPEHIVVIEKLPRTTSGKIQKFLLRKDIMRRLTQD
VCEEIE
>P10384 ~~~fadL~~~Long-chain fatty acid transport protein~~~COG2067
MSQKTLFTKSALAVAVALISTQAWSAGFQLNEFSSSGLGRAYSGEGAIADDAGNVSRNPALITMFDRPTFSAGAVYIDPD
VNISGTSPSGRSLKADNIAPTAWVPNMHFVAPINDQFGWGASITSNYGLATEFNDTYAGGSVGGTTDLETMNLNLSGAYR
LNNAWSFGLGFNAVYARAKIERFAGDLGQLVAGQIMQSPAGQTQQGQALAATANGIDSNTKIAHLNGNQWGFGWNAGILY
ELDKNNRYALTYRSEVKIDFKGNYSSDLNRAFNNYGLPIPTATGGATQSGYLTLNLPEMWEVSGYNRVDPQWAIHYSLAY
TSWSQFQQLKATSTSGDTLFQKHEGFKDAYRIALGTTYYYDDNWTFRTGIAFDDSPVPAQNRSISIPDQDRFWLSAGTTY
AFNKDASVDVGVSYMHGQSVKINEGPYQFESEGKAWLFGTNFNYAF
>P77712 3.1.2.-~~~fadM~~~Long-chain acyl-CoA thioesterase FadM~~~COG0824
MQTQIKVRGYHLDVYQHVNNARYLEFLEEARWDGLENSDSFQWMTAHNIAFVVVNININYRRPAVLSDLLTITSQLQQLN
GKSGILSQVITLEPEGQVVADALITFVCIDLKTQKALALEGELREKLEQMVK
>O32178 1.1.1.35~~~fadN~~~Probable 3-hydroxyacyl-CoA dehydrogenase~~~COG1024
MHKHIRKAAVLGSGVMGSGIAAHLANIGIPVLLLDIVPNDLTKEEEKKGLTKDSSEVRSRLSRQAMKKLLKQKPAPLTSA
KNTSYITPGNLEDDAEKLKEADWIIEVVVENLEVKKKIFALVDEHRKTGSIVSSNTSGISVQEMAEGRSDDFKAHFLGTH
FFNPARYLKLLEIIPIKETDPDILKFMTAFGENVLGKGVVTAKDTPNFIANRIGTYGLLVTVQEMLKGGYQVGEVDSITG
PLIGRPKSATFRTLDVVGLDTFAHVARNVYDKADGDEKEVFRIPSFMNDMLEKGWIGSKAGQGFYKKEGKTIYELDPVTL
TYGERTKMKSPALEAAKQAKGTKAKMKALIYSDDRAGRLLWNITSQTLLYSAELLGEIADDIHAIDQAMKWGFGWELGPF
EMWDAIGLKQSAEKLEQLGADMPGWIKEMLDKGNETFYIKENGTVFYYDRGEYRAVKENKKRIHLQALKETKGVIAKNSG
ASLIDLGDDVALLEFHSKSNAIGLDIIQMIHKGLEETERNYKGLVIGNQGKNFCVGANLAMILMEVQDDNFLEVDFVIRR
FQETMMKIKYSAKPVVAAPFGMTLGGGTEACLPAARIQAASEAYMGLVESGVGLIPGGGGNKELYINHLRRGHDPMNAAM
KTFETIAMAKVSASAQEAREMNILKETDQISVNQDHLLYDAKQLAASLYDTGWRPPVKEKVKVPGETGYAALLLGAEQMK
LSGYISEHDFKIAKKLAYVIAGGKVPFGTEVDEEYLLEIEREAFLSLSGEAKSQARMQHMLVKGKPLRN
>P94548 ~~~fadR~~~Fatty acid metabolism regulator protein~~~COG1309
MKQKRPKYMQIIDAAVEVIAENGYHQSQVSKIAKQAGVADGTIYLYFKNKEDILISLFKEKMGQFIERMEEDIKEKATAK
EKLALVISKHFSLLAGDHNLAIVTQLELRQSNLELRQKINEILKGYLNILDGILTEGIQSGEIKEGLDVRLARQMIFGTI
DETVTTWVMNDQKYDLVALSNSVLELLVSGIHNK
>P0A8V6 ~~~fadR~~~Fatty acid metabolism regulator protein~~~COG2186
MVIKAQSPAGFAEEYIIESIWNNRFPPGTILPAERELSELIGVTRTTLREVLQRLARDGWLTIQHGKPTKVNNFWETSGL
NILETLARLDHESVPQLIDNLLSVRTNISTIFIRTAFRQHPDKAQEVLATANEVADHADAFAELDYNIFRGLAFASGNPI
YGLILNGMKGLYTRIGRHYFANPEARSLALGFYHKLSALCSEGAHDQVYETVRRYGHESGEIWHRMQKNLPGDLAIQGR
>Q9KQU8 ~~~fadR~~~Fatty acid metabolism regulator protein~~~COG2186
MVIKAKSPAGFAEKYIIESIWNGRFPPGSILPAERELSELIGVTRTTLREVLQRLARDGWLTIQHGKPTKVNQFMETSGL
HILDTLMTLDAENATSIVEDLLAARTNISPIFMRYAFKLNKESAERIMINVIESCEALVNAPSWDAFIAASPYAEKIQQH
VKEDSEKDELKRQEILIAKTFNFYDYMLFQRLAFHSGNQIYGLIFNGLKKLYDRVGSYYFSNPQARELAMEFYRQLLAVC
QSGEREHLPQVIRQYGIASGHIWNQMKMTLPSNFTEDDC
>P25401 ~~~faeE~~~Chaperone protein FaeE~~~
MSKRNAVTTFFTNRVTKALGMTLALMMTCQSAMASLAVDQTRYIFRGDKDALTITVTNNDKERTFGGQAWVDNIVEKDTR
PTFVVTPSFFKVKPNGQQTLRIIMASDHLPKDKESVYWLNLQDIPPALEGSGIAVALRTKLKLFYRPKALLEGRKGAEEG
ISLQSRPDGRTMLVNTTPYIFAIGSLLDGNGKKIATDNGTTQKLLMFMPGDEVQVKGNVVKVDSLNDYGELQTWTINKKK
PAAPEAAKAEKADTAEQK
>P02970 ~~~faeG~~~K88 fimbrial protein AB~~~
MKKTLIALAIAASAASGMAHAWMTGDFNGSVDIGGSITADDYRQKWEWKVGTGLNGFGNVLNDLTNGGTKLTITVTGNKP
ILLGRTKEAFATPVSGGVDGIPQIAFTDYEGASVKLRNTDGETNKGLAYFVLPMKNAEGTKVGSVKVNASYAGVFGKGGV
TSADGELFSLFADGLRAIFYGGLTTTVSGAALTSGSAAAARTELFGSLSRNDILGQIQRVNANITSLVDVAGSYREDMEY
TDGTVVSAAYALGIANGQTIEATFNQAVTTSTQWSAPLNVAITYY
>P14190 ~~~faeG~~~K88 fimbrial protein AC~~~
MKKTLIALAIAASAASGMAHAWMTGDFNGSVDIGGSITADDYRQKWEWKVGTGLNGFGNVLNDLTNGGTKLTITVTGNKP
ILLGRTKEAFATPVTGGVDGIPHIAFTDYEGASVVLRNPDGETNKKGLAYFVLPMKNAEGTKVGSVKVNASYAGVLGRGG
VTSADGELLSLFADGLSSIFYGGLPRGSELSAGSAAAARTKLFGSLSRNDILGQIQRVNANITSLVDVAGSYRENMEYTD
GTVVSAAYALGIANGQTIEATFNQAVTTSTQWSAPLNVAITYY
>P14191 ~~~faeG~~~K88 fimbrial protein AD~~~
MKKTLIALAIAASAASGMAHAWMTGDFNGSVDIGGSITADDYRQKWEWKVGTGLNGFGNVLNDLTNGGTKLTITVTGNKP
ILLGRTKEAFATPVTSGVDGIPHIAFTDYEGASVELRNPDGETEKGLAYFVLPMKNAEGTKVGSVKVNASYAGALGRGGV
TSADGELMSLFAEGSHAIFYGGLPTNVKNSELKGGSAAAARTELFGSLSKNDILGQIQRVNANITSLVNVPGSFNENMAY
TDGSVVSVAYALGIANGQTIEATFNQAVTTSTQWSAPLNVAITYY
>Q9FA38 4.2.1.147~~~fae~~~5,6,7,8-tetrahydromethanopterin hydro-lyase~~~COG1795
MAKITKVQVGEALVGDGNEVAHIDLIIGPRGSPAETAFCNGLVNNKHGFTSLLAVIAPNLPCKPNTLMFNKVTINDARQA
VQMFGPAQHGVAMAVQDAVAEGIIPADEADDLYVLVGVFIHWEAADDAKIQKYNYEATKLSIQRAVNGEPKASVVTEQRK
SATHPFAANA
>O69687 2.1.1.-~~~~~~Probable fatty acid methyltransferase Rv3720~~~COG2230
MTTGRLSMAEILEIFTATGQHPLKFTAYDGSTAGQDDATLGLDLRTPRGATYLATAPGELGLARAYVSGDLQAHGVHPGD
PYELLKTLTERVDFKRPSARVLANVVRSIGVEHILPIAPPPQEARPRWRRMANGLLHSKTRDAEAIHHHYDVSNNFYEWV
LGPSMTYTCAVFPNAEASLEQAQENKYRLIFEKLRLEPGDRLLDVGCGWGGMVRYAARRGVRVIGATLSAEQAKWGQKAV
EDEGLSDLAQVRHSDYRDVAETGFDAVSSIGLTEHIGVKNYPFYFGFLKSKLRTGGLLLNHCITRHDNRSTSFAGGFTDR
YVFPDGELTGSGRITTEIQQVGLEVLHEENFRHHYAMTLRDWCGNLVEHWDDAVAEVGLPTAKVWGLYMAASRVAFERNN
LQLHHVLATKVDPRGDDSLPLRPWWQP
>Q9I2N0 3.1.4.1~~~fan1~~~Fanconi-associated nuclease 1 homolog~~~
MHEQYQAPLPVNSPALPEPFYYLHNFRAVLAWIGERYADLLDDQERAFIAAFAELPEASQALLVRMVMRKGTLFREGKLA
YAEIGDTRAAVQPLLALGWVDAQPTLELAQLFGLLKKDELSQLFRDHLGRANLRKDALLERLQPLFPEARRLAEWQADFA
EPVYELRCMALCDRLRLMYFGNLWQDWSEFVLADLGIYRYESVEFSADSRGFRLRADVDAYLHLFDCRQRFDLGEPLEEL
LAGLPGEPYANPWLEGRRVKLLFQFAQHCEKQRDFDLAQRLYRQSSHPGARLRAIRSLERGERFAEAHALAREASCAPES
DAERQGLARLLPRLQGKLGLPRQARAAAPEIDRLDLCLAFPSEPCSVEWAVREHLEEPGCAVHYVENGLINSLFGLLCWE
AIFAAIPGAFFHPFHSAPADLHSADFRQRRAALFEACLGRLEDGSYRDAIRCRYRDKFGLQSPFVYWELLGEELLEQALD
CLPAAHLRAWFERLLEDIPGNRAGLPDLIQFWPAQRRYRMVEVKGPGDRLQDNQLRWLQFCREREMPVAVCYVRWHVDD
>A1C3L3 ~~~fap1~~~Fap1 adhesin~~~
MGKYKRAGETSRKTRVKMHKSGKNWVRTLISQIGLMHFLGGSISEKKINVDVYEQKNISASTILKGAVALGALTGATVVS
GNVFADETVLAKETTLTTTDANEVKLSSENFDSEKAEEKISLSQSESASESVSESISESVSESVSTSESVSESVSESVSE
SISESVSESISESISESVSESTSTSIVLSESGAASGNKATSKGTEEKQDSVRENLDKMISEAEVLNDMAARKLITLDAEQ
QLELMKSLVATQSQLEATKNLIGDPNATVADLQIAYTTLGNNTQALGNELIKLNPNGQIYAVLNNTEASRAATLRSTTTG
TKTTFTISDFSNGGTQYYWAGGNANNLKNPISSISAVYDSATGKISWTVEYDPTTILKSPALKTLKTYTGIYIDTSSDSK
LSTPTNVLIDGAATNPVTNFYGNGSKGIEYVSKGTTKGVTKHTITFDTAFSGRANDLADLKIKMLAATTLSDPHFYEDGS
KGNYGRYNGQTAPYVIANDSGTAIGGYQVSGVNADSIPSDTTSQSESTSKSESTSKSISESVIESISESVIGSVSESVSE
SVSESVSESITESVSESVSESISESVSESVSESISESVSESVSESISESVSESVSESISESVSESVSESISESISESVSE
SVSESISESVSESVSESISESVSESVSESISESVSESVSESISESVSESVSESISESVSESVSESISESVSESVSESISE
SVSESVSESISESVSESVSESISESVSESVSESISESVSESVSESISESVSESVSESVSESVSESVSESISESVSESVSE
SVSESVSESVSESVSESISESVSESVSESISESVSESVSESISESVSESVSESISESVSESVSESISESVSESVSESISE
SVSESVSESISESVSESVSESISESVSESVSESISESVSESVSESISESVSESVSESVSESVSESVSESVSESVSESISE
SVSESVSESISESVSESVSESISESVSESVSESISESVSESVSESISESVSESVSESVSESVSESVSESVSESVSESISE
SVSESVSESVSESISESVSESVSESISESVSESVSESISESVSESVSESVSESVSESVSESISESVSESVSESISESVSE
SVSESISESVSESVSESISESVSESVSESISESVSESVSESISESVSESVSESVSESVSESVSESISESVSESVSESISE
SVSESVSESVSESISESVSESVSESISESVSESVSESISESVSESVSESISESVSESVSESVSESVSESVSESVSESVSE
SVSESVSESISESVSESVSESVSESISESVSESVSESISESVSESVSESISESVSESVSESISESVSESVSESISESVSE
SVSESISESVSESVSESVSESVSESVSESISESVSESVSESISESVSESVSESISESVSESVSESISESVSESVSESVSE
SISESVSESVSESISESVSESVSESISESVSESVSESVSESVSESVSESVSESVSESVSESISESVSESVSESISESVSE
SVSESISESVSESVSESISESVSESVSESISESVSESVSESISESVSESVSESISESVSESVSESVSESVSESVSESISE
SVSESVSESISESVSESVSESVSESVSESVSESVSESVSESVSESISESVSESVSESVSESVSESVSESISESVSESVSE
SISESVSESVSESISESVSESVSESISESVSESVSESISESVSESVSESISESVSESVSESISESVSESVSESVSESVSE
SVSESVSESVSESISESVSESVSESISESVSESVSESISESVSESVSESVSESVSESVSESISESVSESVSESISESVSE
SVSESISESVSESVSESISESVSESVSESVSESISESVSESVSESISESVSESVSESISESVSESVSESISESVSESVSE
SISESVSESVSESISESVSESVSESVSESVSESVSESISESVSESISESVSESVSESISESVSESVSESISESVSESVSE
SISESVSESVSESISESVSESVSESISESVSESVSESISESVSESVSESISESVSESVSESVSESISESVSESVSESISE
SVSESVSESISESVSESVSESVSESVSESVSESVSESVSESVSESISESVSESVSESISESVSESVSESISESVSESVSE
SISESVSESVSESISESVSESVSESISESVSESVSESISESVSESVSESVSESVSESVSESISESVSESVSESVSESISE
SVSESVSESISESVSESVSESVSESVSESVSESVSESVSESVSESISESVSESVSESVSESVSESVSESISESVSESVSE
SISESVSESVSESISESVSESVSESISESVSESVSESISESVSESVSESISESVSESVSESISESVSESVSESISESVSE
SVSESVSESISESVSESVSESISESVSESVSESISESVSESVSESISESVSESVSESVSESVSESVSESVSESVSESISE
SVSESVSESISESVSESVSESVSESVSESVSESISESVSESISESVSESVSESISESVSESVSESISERTLPNTGENVSS
SLGLVGLSGLLFGALLGRKKRKSEDAE
>O31724 3.5.1.-~~~ylmB~~~N-formyl-4-amino-5-aminomethyl-2-methylpyrimidine deformylase~~~COG0624
MDQQIYSLQKKVEEHKEELIQLAKTLISYQTPAPPARNTEGIQSWIAGYLNELGFSIDKWDVYPGDPNVVGKLKGTDSAD
YYSLIINGHVDVAEVKEDEEWKHDPFHPIEKNGLLIGRGASDMKGGMACVLFAVKLIREASIELPGDLILQSVIGEEVGE
AGTLECCKRGYHADFAIVADTSDMHIQGQGGVITGWIEIKSSQTFHDGTRRNMIHAGGGTFGASAIEKMAKIIAGLGELE
RHWSIMKSYPGFKPGTNTINPAVIEGGRHAAFIADECRLWITVHFYPNETHDQVAAEIEDYVNRLSDSDIWLRENRPVFK
WGGSSMIEDRGEIFPALEVDPGHPGVLALTASHQKVKRECPIIDVSQSVTDGGWLYDAGIPCVIYGPGDLHNAHSVNEKV
SIEQLVEYTKIILDFIISWCSRKKEQ
>Q9K9G9 3.5.1.-~~~ylmB~~~N-formyl-4-amino-5-aminomethyl-2-methylpyrimidine deformylase~~~COG0624
MFKLSVIEELLAHVDKNKEELLALVQTLVAYPTPSPPARNTADAQQYIRTYCEKLGCNVDMWDVYPNDPNVVAVLKGTYS
ESYRSLILNGHIDVAAVDESEEWKTPPFEATVNQGVIRGRGVADMKGGLAACLFAMKTLHAFNIQLPGDLIFQSVVGEEV
GEAGTKSCCERGYTADLAIVSDTSHCEIQGQGGVITGWITVKSPVTFHDGTRRNLIHAGGGEFGASAIEKMMKLIQGLQE
LERHWAVTKSSPGFPPGMNTINPAFIEGGRHPAFIADECKLWITIHYYPHESYEEIVREVEEHLLHVAKADPWMREHPPS
FSWGGTSMIEDKGEIFPAFQIDEQSDAVQLLKKIHYHLTGEEVKTSMSQTVTDGGWLAEAGIPTLLFGPGKLEDAHSVNE
ELEIAELVQYTKTLLTFIYEWCHLRKA
>O34835 ~~~fapR~~~Transcription factor FapR~~~COG1349
MRRNKRERQELLQQTIQATPFITDEELAGKFGVSIQTIRLDRLELSIPELRERIKNVAEKTLEDEVKSLSLDEVIGEIID
LELDDQAISILEIKQEHVFSRNQIARGHHLFAQANSLAVAVIDDELALTASADIRFTRQVKQGERVVAKAKVTAVEKEKG
RTVVEVNSYVGEEIVFSGRFDMYRSKHS
>Q9RQ30 ~~~farA~~~Fatty acid resistance protein FarA~~~
MKSGNSEPNLMETHTDETKLQNTQVKRKRRLTALTLLFALSAAAAGSAFFLWWQHEEETEDAYVAGRVVQVTPQKGGTVR
KVLHDDTDAVKKGDVLAVLDDDNDVLAYERAKNELVQAVRQNRRQNAATSQAGAQVALRRADLARAQDDLRRRSALAESG
AVSAEELAHARTAVSQAQAAVKAALAEESSARAALGGDVSLREQPEVQTAIGRLKDAWLNLRRTQVRAPADGQVAKRSVQ
VGQQVAAGAPLMAVVPLSDVWVDANFKETQLRHMKIGQPAELVSDLYGKQIVYRGRVAGFSAGTGSAFSLIPAQNATGNW
IKVVQRVPVRIVLNREDVDRHPLRIGLSMTVKVDTSAAGAPVSKTPGAALPEMESTDWSEVDRTVDEILGQSAP
>Q9RQ29 ~~~farB~~~Fatty acid resistance protein FarB~~~
MDYPPLKGAALAWVTLSLGLAVFMEVLDTTIANVAVPVIAGNLGAATTQGTWVITSFSVANAVSVPLTGFLAKRIGEVKL
FTAAAAGFVIASWLCGIAPNLQSLVVFRILQGFIAGPLIPLSQSLLMASYPPAKRMLALALWAMTVVVAPVLGPILGGWI
SGNWHWGWIFFINIPIGIISAWITWKHLKHRETATVRTPTDYVGLTLMMVGIGALQMMLDRGKELDWFASGEIITLGITA
LVCLSYFIVWELGEKYPIVDLSLFKDRNFTVGAIATSLGFMVYMGTLTLLPLVLQTNLGYTSAWAGLAAAPVGILPVFLS
PLIGRFGNKIDMRLLVTASFLTFAFTFYWRTDFYADMDIGNVIWPQFWQGVGVAMFFLPLTTITLSHMKGGQIAAAGSLS
NFLRVLMGGVGVSVVSTLWERREALHHTRFAEHITPYSATLHETAAHLSQQGISDGQTLGIINNTITQQGFIIGSNEIFL
AGSILFIVLIPIVWLAKPPFHSGGGGH
>P0DPR8 ~~~farR~~~HTH-type transcriptional regulator FarR~~~
MPTQSKHASINIGLIQAREALMTQFRPILNQANITDQQWRIIRLLAENGTLDFQDLANQACILRPSLTGILTRLEKAGLV
VRLKPSNDQRRVYLKLTSEGEKLYEEIGEEVDERYDAIEEVLGREKMLLLKDLLAELAKIEDALNS
>P46373 1.14.-.-~~~fas1~~~Cytochrome P450 FAS1~~~COG2124
MAGTADLPLEMRRNGLNPTEELAQVRDRDGVIPVGELYGAPAFLVCRYEDVRRIFADSNRFSNAHTPMFAIPSGGDVIED
ELAAMRAGNLIGLDPPDHTRLRHILAAEFSVHRLSRLQPRIAEIVDSALDGLEQAGQPADLMDRYALPVSLLVLCELLGV
PYADRDELRDRTARLLDLSASAEQRAVAQREDRRYMATLVTRAQEQPGDDLLGILARKIGDNLSTDELISIISLIMLGGH
ETTASMIGLSVLALLHHPEQAAMMIEDPNCVNSGIEELLRWLSVAHSQPPRMAVTEVQIAGVTIPAGSFVIPSLLAANRD
SNLTDRPDDLDITRGVAGHLAFGHGVHFCLGHSLARMTLRTAVPAVLRRFPDLALSPSHDVRLRSASIVLGLEELQLTW
>P46374 ~~~fas2~~~Ferredoxin fas2~~~COG3959
MKVVVNERRCFGSGQCVLVAPEVFEQSNDGTVTLLVDKPSPDNHSLVRAAARSCPATAIRFEENAMRQEPTEFSYDDLPA
LISRMRGDERHSFSSSSTMDVLWVLYDEIPNVSPESPDDDDRDRFLLSKGHGPMAYYAVLAAKGFLRPELLDTWATKNSP
LGFAPDRTKISGVEMSGGSLGHGLPLAVGVAMGLRIQNRHAPRVFVLIGDGEFDEGSNHEAMAFAGRARLNQLTVIVLDN
GTASMGWPHGIDKRFDGEGWDTININGADHEEIAAALNRDHNDRPLAVVATVTRQSARSSIQQR
>P46375 ~~~fas3~~~Uncharacterized 33.6 kDa protein in fasciation locus~~~COG3958
MNSADTQEPKSFNHTDMWTAFGTTMSGALETDPRAVVVLADIGAHLFKAAAIADPNRVINVGIREQLMMGVAGGLAMCGM
RPVVHTVAAFLVERPLEQIKLNFAQQDVGAVLVSWGASYDLSEFAFSHFTPGDITVIDSMPNWTVHVPGHPQEAADLLLE
SLPGDGRVYLRLSSQVNRYPHAVRGTSFTPIKYGTRGVVLAVGPCLDAVLSATSMLDVTILYAATIRPFDATGLCAAVQA
VNRPNVVLVEPYLAGTSAHQVSSSLVSHPHRLLSLGVRREMEDRHYGTPDDHDHIHGLDARSLSNSINSFLG
>P46377 ~~~fas5~~~Uncharacterized oxidoreductase ORF5 in fasciation locus~~~
MSGIWHTDDVHLTSAGADFGNCIHAKPPVVVVPRTVADVQEALRYTAARNLSLAVRGSGHSTYGQCQADGGVVLDMKRFN
TVHDVRSGQATIDAGVRWSDVVAATLSRQQTPPVLTDYLGTTVGGTLSVGGFGGSSHGFGLQTDNVDSLAVVTGSGDFRE
CSAVSNSELFDAVRGGLGQFGVIVNATIRLTAAHESVRQYKLQYSNLGVFLGDQLRAMSNRLFDHVQGRIRVDADGHLRY
RLDLAKYFTPPRRPDDDALLSSLQYDSCAEYNSDVDYGDFINRMADQELDLRHTGEWFYPHPWASLLIPADKIEQFIETT
SSSLTDDLGNSGLIMVYPIPTTPITAPFIPIPHCDTFFMLAVLRTASPGAEARMIASNRLLYEQARDVGGVAYAVNAVPM
SPGDWCTHFGSRWQAIARAKRRFDPYRILAPGYRMSFD
>P11461 ~~~fatA~~~Ferric-anguibactin receptor FatA~~~COG4774
MTHQVATCHKKQSFSGKPTLSRIALLVALQISASALPISITHAEEQADESITVYGQANEAYAAGKISKASSIGMLGDKDF
LDTPFNAIGYTDKHIQDQHAQDISDVISASDPSVFTSGETGLNKESFKIRGFSSDIGDVMFNGLYGIAPYYRSSPEMYQR
IDVLKGPASLLNGMPPNGSVGGSINLVTKRAQEAPITSFTGTYMSDSQFGGHIDIGRRFGENEQFGVRFNGVFRDGDASV
DGQSRKAQLASLSLDWRNDIALIEADLYFSTERVDGPNRGLSIASGVDVPSPPSSDTLLSPSWAYNDSEDKGMMIRAELD
LSNSVTAYGAVGASRTDFDSNVPQRVKIIDDSGTLEVSLGSVKLESKRTSGEVGIRSSFDTGPIEHYLVLNSTYFREDKN
DSPTGNNNPGSWNPNIYNPVWGPEDSTYDNYYELPVDSTQISFGVADTLSLANGKYQVTLGLRHQSIDYESGVTWNGNAF
PTTKLKESTYTPAIVALYKVSDSVSLYGNYTEGLTNGKTAGSGAANVGEAFEPQKTKQTEAGLKLDMNDFAHTFSLFEIK
KPNGYQDPDTNIYSFGGEQRNRGIEWGFYGTVLEDYTLTGGIAYTDAEITKATDVTTEGKQATKLPDLQAKLALEWNLPV
MRQLTLIGQANYMSEQYIDAQNTQSLSAQTIFDLGARYNSTIANQSVIWRLAVNNVTDEAYWTTTHYASLALGAPRTVML
SATADF
>P11460 ~~~fatB~~~Ferric-anguibactin-binding protein FatB~~~COG4607
MFKSTLNIAVAIVCSSLVTLTGCEPKVAQSQVIQPLETPIVIEHNLGQTVISNRPQRVAALDMNEVDFLDQLNVPIAGMV
KDFVPHFLEKYKNTPDISDLGAIVQPNMEKIYALKPDLVLMTPLHANQYEELSKLAPTVHFDIDFRNSHGHHVDIIKQHV
IDLGEIFNKQTLAQKKVAEIDAKVDEVQALTAERSEKALVVMHNNGSFSSFGIESRYGFVFDVLGVKPASTEIAASLHGQ
PISSEFINQANPDILYIIDRTAVMEGKPVIDAEHLANPLLRQTKAWKNGKVIFVDADAWYITSASITSLKIVIDDIIKGY
QS
>Q81XB2 ~~~fatC~~~Petrobactin import system permease protein FatC~~~
MITLDYRNKENVEVDSSLHNESRSASAFRSKKEARRYWIVLITLIALGLLSSYGLLVYNNPVPIDSPSFIPVVKRRIVAI
VAMIIAAVCHSLSTVAFQSITNNKIITPSLLGFESLYSAIQTSTVFFFGASALINFNGIGSFLFQVVVMVFMSLILYGWL
LSGKYGNLQLMLLVGIIIGTGLNSVSTFMRKLLAPSEFDILQARLFGSVNHADPAYFPIVIPMIIIVAVLIFAHSKNLNV
LSLGKDVATSFGVKYQPSVIYTLVLVAILMSISTALIGPLTFYGFLVATLSYQAAATYDHRYIFPMAFAIGFLIMTSAYF
LMYHVFHAQGVVSVIIELFGGIIFLTIVLRKRAL
>P37737 ~~~fatC~~~Ferric-anguibactin transport system permease protein FatC~~~COG4605
MTSLNLNFRVSVVLVILLSIAFIFINSGFDLEYIIPRRLIKLSAIIIGGSCVAISAVIFQALARNRILTPSIMGYESIYL
VWQALLLLFVGTSGSAVLGVVGNFVVSAVLILLYSFVIQFWVLKRFQHDMHQVLLIGFVLTMVLTTVAQFIQIRISPGEF
SIFQGLSYTSFERAKPSTLLFAGTVLSILALFANKWVSELDVIGLGRDQAMSLGLNDAHYIPKYFSVIAILVAISTSLIG
PTAFMGVFIANIAYSITGSPQYRHTLPVACTIAIVMFLTAQLMVEHFFNYKTTVSILVNVLCGGYFLIITMRARSQL
>Q81XB1 ~~~fatD~~~Petrobactin import system permease protein FatD~~~
MISRVENISQPQFYNHNKIWTKPFIIAIIVVIILGIISLFTGVYDIRGQEDGMEMFFITRVPRTVALMLTGAAMAMAGLV
MQLITQNRFVEPTTTGTIEWSSLGLLFVYLLFPAPTLVQRMTGAIIFSFIGTMIFFLFLRRVKLRSSLIVPIIGLMLGAV
ISAVSTFLGLLFQMTQSIETWFVGSFANIQVGRYEYLWLIVIVTLLIFMYANRLTLAGLGEDVATSLGVNYNRIVLFGTA
LISVAVGIVAAVIGNLPFLGLIVPNIVSMFRGDDLRSNLPWVCVIGMGTITACDIISRTIIKPFELPVSLILASVGAVVF
ITILLRKRKPRRLR
>P37738 ~~~fatD~~~Ferric-anguibactin transport system permease protein FatD~~~COG4606
MTFRMILAFFTLCATSLFFGANQIEWSLLPTFNEKAWLPIIASRLPRLVALILTGSGLAMCGVILQHIVRNRFVEPGTTG
SLDAAKLGILVSIVMLPSSDKLERMFFAVLFCFAAGLVYIAIIRKVKFSNTALVPVIGLMFGSVLSALAEFYAYQNNILQ
SMSGWLMGDFSKVVQEHYEIIFLILPITLLTYLYAHRFTVMGMGEDIASNLGISYAMTAALGLILVSITVAVTVVTVGAI
HFVGLVIPNLVALKYGDHLKNTLPIVALGGASLLIFCDVISRVVLFPFEVPVGLTASAVGGVMFLAFLLKGAKA
>Q81XB3 7.2.2.-~~~fatE~~~Petrobactin import ATP-binding protein FatE~~~
MIKIDNVKKFYTDKVKIGPLDIEIPKAGFTSLIGPNGAGKSTTLLMIGRLLDMDEGQIQVANMDVSESKSKDLAKVLTIL
RQENHFVTRLTVRQLVGFGRFPYSKGRLTKEDEVIISKYIDFLDLTNLENRYLDELSGGQRQRAYVAMVLCQETEYVLLD
EPLNNLDVARSVQMMEHLRRAANEFGRTILTVMHDINFAAKYSDKICAMKDGQIAAFGTVEEVMDSTLLTDIFETRIEII
KGPYGPIAVY
>P9WQJ3 7.6.2.-~~~~~~Fatty acid ABC transporter ATP-binding/permease protein~~~COG1132
MTAPPGARPRAASPPPNMRSRDFWGSAARLVKRLAPQRRLSIAVITLGIAGTTIGVIVPRILGHATDLLFNGVIGRGLPG
GITKAQAVASARARGDNTFADLLSGMNVVPGQGVDFAAVERTLALALALYLAAALMIWAQARLLNLTVQKTMVRLRTDVE
DKVHRLPLSYFDGQQRGELLSRVTNDIDNLQSSLSMTISQLVTSILTMVAVLAMMVSISGLLALITLLTVPLSLLVTRAI
TRRSQPLFVAHWTSTGRLNAHLEETYSGFTVVKTFGHQAAARERFHELNDDVYQAGFGAQFLSGLVQPATAFIGNLGYVA
VAVAGGLQVATGQITLGSIQAFIQYIRQFNMPLSQLAGMYNALQSGVASAERVFDVLDEPEESPEPEPELPNLTGRVEFE
HVNFAYLPGTPVIRDLSLVAEPGSTVAIVGPTGAGKTTLVNLLMRFYEIGSGRILIDGVDIASVSRQSLRSRIGMVLQDT
WLYDGTIAENIAYGRPEATTDEIVEAARAAHVDRFVNTLPAGYQTRVSGDGGSISVGEKQLITIARAFLARPQLLILDEA
TSSVDTRTELLIQRAMRELRRDRTSFIIAHRLSTIRDADHILVVQTGQIVERGNHAELLARRGVYYQMTRA
>O70022 ~~~fbe~~~Fibrinogen-binding protein~~~
MINKKNNLLTKKKPIANKSNKYAIRKFTVGTASIVIGATLLFGLGHNEAKAEENSVQDVKDSNTDDELSDSNDQSSDEEK
NDVINNNQSINTDDNNQIIKKEETNNYDGIEKRSEDRTESTTNVDENEATFLQKTPQDNTHLTEEEVKESSSVESSNSSI
DTAQQPSHTTINREESVQTSDNVEDSHVSDFANSKIKESNTESGKEENTIEQPNKVKEDSTTSQPSGYTNIDEKISNQDE
LLNLPINEYENKARPLSTTSAQPSIKRVTVNQLAAEQGSNVNHLIKVTDQSITEGYDDSEGVIKAHDAENLIYDVTFEVD
DKVKSGDTMTVDIDKNTVPSDLTDSFTIPKIKDNSGEIIATGTYDNKNKQITYTFTDYVDKYENIKAHLKLTSYIDKSKV
PNNNTKLDVEYKTALSSVNKTITVEYQRPNENRTANLQSMFTNIDTKNHTVEQTIYINPLRYSAKETNVNISGNGDEGST
IIDDSTIIKVYKVGDNQNLPDSNRIYDYSEYEDVTNDDYAQLGNNNDVNINFGNIDSPYIIKVISKYDPNKDDYTTIQQT
VTMQTTINEYTGEFRTASYDNTIAFSTSSGQGQGDLPPEKTYKIGDYVWEDVDKDGIQNTNDNEKPLSNVLVTLTYPDGT
SKSVRTDEDGKYQFDGLKNGLTYKITFETPEGYTPTLKHSGTNPALDSEGNSVWVTINGQDDMTIDSGFYQTPKYSLGNY
VWYDTNKDGIQGDDEKGISGVKVTLKDENGNIISTTTTDENGKYQFDNLNSGNYIVHFDKPSGMTQTTTDSGDDDEQDAD
GEEVHVTITDHDDFSIDNGYYDDESDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD
SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD
SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSVSDSDSDSDSDSGSDSDSDSDSDSDNDSD
LGNSSDKSTKDKLPDTGANEDYGSKGTLLGTLFAGLGALLLGKRRKNRKNKN
>A0QTG2 2.7.8.28~~~fbiA~~~Phosphoenolpyruvate transferase~~~COG0391
MKITVLVGGVGGARFLLGVQNLLGLGSFADGPSKHELTAVVNIGDDAWMHGVRICPDLDTCMYTLGGGIDPDRGWGHRNE
TWNAKEELAAYGVQPDWFGLGDRDLATHLVRSQMLRAGYPLSQVTEALCKRWQPGARLLPASDERSETHVVITDPTDGER
RAIHFQEWWVRYRAKVPTHSFAYVGADQATAGPGVVEAIGDADIVLLAPSNPVVSIGPILQIPGIRGALRSTSAPVIGYS
PIIAGKPLRGMADECLKVIGVESTSQAVGEFFGARAGTGLLDGWLVHEGDHAQIEGVKVKAVPLLMTDPEATAAMVRAGL
DLAGVSL
>P9WP81 2.7.8.28~~~fbiA~~~Phosphoenolpyruvate transferase~~~COG0391
MKVTVLAGGVGGARFLLGVQQLLGLGQFAANSAHSDADHQLSAVVNVGDDAWIHGLRVCPDLDTCMYTLGGGVDPQRGWG
QRDETWHAMQELVRYGVQPDWFELGDRDLATHLVRTQMLQAGYPLSQITEALCDRWQPGARLLPATDDRCETHVVITDPV
DESRKAIHFQEWWVRYRAQVPTHSFAFVGAEKSSAATEAIAALADADIIMLAPSNPVVSIGAILAVPGIRAALREATAPI
VGYSPIIGEKPLRGMADTCLSVIGVDSTAAAVGRHYGARCATGILDCWLVHDGDHAEIDGVTVRSVPLLMTDPNATAEMV
RAGCDLAGVVA
>A0QTG1 ~~~fbiB~~~Bifunctional F420 biosynthesis protein FbiB~~~COG0778
MSAAANAEHGAADRVEILPVPGLPEFRPGDDLVGSLAEAAPWLRDGDVLVVTSKVVSKCEGRIVAAPSDPEERDTLRRKL
IDDEAVRVLARKGRTLITENAIGLVQAAAGVDGSNVGSTELALLPVDPDRSAATLREGLRERLGVTVGVVITDTMGRAWR
TGQTDFAIGASGLTVLQGYAGSRDRHGNELVVTEVAVADEIAAAADLVKGKLTAIPVAVVRGLRLPDDGSTAHRLVRAGE
DDLFWLGTAEAIELGRRQAQLLRRSVRRFSAEPVPHDAIEAAVGEALTAPAPHHTRPVRFVWVQDSETRTRLLDRMKEQW
RADLTADGLDADAVDRRVARGQILYDAPELVIPFLVPDGAHSYPDDARTAAEHTMFTVAVGAAVQGLLVALAVRDIGSCW
IGSTIFAADLVRAELELPDDWEPLGAIAIGYPEQTPQPLGPRDPVPTDELLVRK
>P9WP79 ~~~fbiB~~~Bifunctional F420 biosynthesis protein FbiB~~~COG0778
MTGPEHGSASTIEILPVIGLPEFRPGDDLSAAVAAAAPWLRDGDVVVVTSKVVSKCEGRLVPAPEDPEQRDRLRRKLIED
EAVRVLARKDRTLITENRLGLVQAAAGVDGSNVGRSELALLPVDPDASAATLRAGLRERLGVTVAVVITDTMGRAWRNGQ
TDAAVGAAGLAVLRNYAGVRDPYGNELVVTEVAVADEIAAAADLVKGKLTATPVAVVRGFGVSDDGSTARQLLRPGANDL
FWLGTAEALELGRQQAQLLRRSVRRFSTDPVPGDLVEAAVAEALTAPAPHHTRPTRFVWLQTPAIRARLLDRMKDKWRSD
LTSDGLPADAIERRVARGQILYDAPEVVIPMLVPDGAHSYPDAARTDAEHTMFTVAVGAAVQALLVALAVRGLGSCWIGS
TIFAADLVRDELDLPVDWEPLGAIAIGYADEPSGLRDPVPAADLLILK
>P9WP77 ~~~fbiC~~~FO synthase~~~COG1060
MPQPVGRKSTALPSPVVPPQANASALRRVLRRARDGVTLNVDEAAIAMTARGDELADLCASAARVRDAGLVSAGRHGPSG
RLAISYSRKVFIPVTRLCRDNCHYCTFVTVPGKLRAQGSSTYMEPDEILDVARRGAEFGCKEALFTLGDRPEARWRQARE
WLGERGYDSTLSYVRAMAIRVLEQTGLLPHLNPGVMSWSEMSRLKPVAPSMGMMLETTSRRLFETKGLAHYGSPDKDPAV
RLRVLTDAGRLSIPFTTGLLVGIGETLSERADTLHAIRKSHKEFGHIQEVIVQNFRAKEHTAMAAFPDAGIEDYLATVAV
ARLVLGPGMRIQAPPNLVSGDECRALVGAGVDDWGGVSPLTPDHVNPERPWPALDELAAVTAEAGYDMVQRLTAQPKYVQ
AGAAWIDPRVRGHVVALADPATGLARDVNPVGMPWQEPDDVASWGRVDLGAAIDTQGRNTAVRSDLASAFGDWESIREQV
HELAVRAPERIDTDVLAALRSAERAPAGCTDGEYLALATADGPALEAVAALADSLRRDVVGDEVTFVVNRNINFTNICYT
GCRFCAFAQRKGDADAYSLSVGEVADRAWEAHVAGATEVCMQGGIDPELPVTGYADLVRAVKARVPSMHVHAFSPMEIAN
GVTKSGLSIREWLIGLREAGLDTIPGTAAEILDDEVRWVLTKGKLPTSLWIEIVTTAHEVGLRSSSTMMYGHVDSPRHWV
AHLNVLRDIQDRTGGFTEFVPLPFVHQNSPLYLAGAARPGPSHRDNRAVHALARIMLHGRISHIQTSWVKLGVRRTQVML
EGGANDLGGTLMEETISRMAGSEHGSAKTVAELVAIAEGIGRPARQRTTTYALLAA
>E5ASS2 2.7.7.106~~~cofC~~~3-phospho-D-glycerate guanylyltransferase~~~COG1920
MRCNGMSPVSVAQRNTGIWAVVPLKAPECAKTRLSGVLSHAARQALFFSMASHVIGTLRASPRIASLLVVTPSESTAEMA
RAAGAEILWGPPDEGMANACSRAMAHIAAAGGERVMFVPGDLPLLDGAAIDMLSRAPVDAIGMAPNRDGHGTNGLICRPG
AIPLFFSGPSFSAHQNAARCAGIDVWIVRSREWALDVDLPADLEEFESSIKDAKRRVLCQI
>A0QUZ4 2.7.7.105~~~fbiD~~~Phosphoenolpyruvate guanylyltransferase~~~COG1920
MSGRRTPGAASDGSAGGQAAVIIAVKRLTAAKTRLAPIFSAATREEVVLAMLVDTITAASTVAPVTVVTPDEVAADAARQ
LGAGVLADPTPEGHHNPLNNAIMAAEEHLRAGTPNIVVLQGDLPAMQPRELAEALAAARTYPRSFVGDRHGTGTSALFAF
GVPLQPRFGADSATHHRHSGAIELTGAWPGLRCDIDTAEDLRTARRLGVGPATAQAIAAHR
>P9WP83 2.7.7.105~~~fbiD~~~Phosphoenolpyruvate guanylyltransferase~~~COG1920
MSGTPDDGDIGLIIAVKRLAAAKTRLAPVFSAQTRENVVLAMLVDTLTAAAGVGSLRSITVITPDEAAAAAAAGLGADVL
ADPTPEDDPDPLNTAITAAERVVAEGASNIVVLQGDLPALQTQELAEAISAARHHRRSFVADRLGTGTAVLCAFGTALHP
RFGPDSSARHRRSGAVELTGAWPGLRCDVDTPADLTAARQLGVGPATARAVAHR
>Q2RG86 3.1.3.11~~~fbp~~~Fructose-1,6-bisphosphate aldolase/phosphatase~~~COG1980
MGERITLSVIKADIGGFVGHSSVHPRLLETAERYLAESKLLIDYRVAHVGDDIDLIMTHKYGVDCPEIHHLAWNVFLQCT
EVARELKLYGAGQDLLSDAFSGNVKGMGPGVAEMEFEERKSEPIIVFAADKTEPGAWNLPLYKMFADPFNTIGLVIDPKM
HQGFRFEVYDLIKNERVEFSLPEELYDLLVFIGAPGRYCIKSVYSKTTGEIAAVSSTQRLNLMAGRYVGKDDPVCIVRCQ
SGLPAVGEALEPFANPHLVAGWMRGSHIGPLMPVGLDQSAPTRFDGPPRVVAMGFQLSGGRLVGPQDFFGDVAFDKARQT
ANEIASYLRSLGPFEPHRLPLEDMEYTTMPEVMAKLKNRFVKVDHRDDREPAAEELGAK
>Q72K02 3.1.3.11~~~fbp~~~Fructose-1,6-bisphosphate aldolase/phosphatase~~~COG1980
MKVTLSVLKADIGSVGGHTLPSRKVLAKVEEVVREEVGRLLLDAYVFHIGDDIVLLLSHTRGVRNQEVHALAWKAFREGT
EVARAEGLYGAGQDLLKDAFTGNLHGLGPQVAEMEFTERPAEPFMVLAADKTEPGAFNLPLYLAFADPMYSSGLLLSPEL
RPGFRFRIMDLAQTERDSYIELDAPERLYDIATLLRDSHRFAIESIWSRKHGEQAAVVSTTRLRNIAGRYVGKDDPVAIV
RTQKIFPATEEFGPVFALAPYVAGDTRGSHHMPLMPVRANTPASTFFCVPMVCGLAFSLREGRLSEPVDLFADPVWEAVR
AKVVEKAQEMRRQGFYGPAMLPMEELEYTGIAERLKALEREFS
>P35755 ~~~fbpA~~~Iron-utilization periplasmic protein~~~COG1840
MQFKHFKLATLAAALAFSANSFADITVYNGQHKEAATAVAKAFEQETGIKVTLNSGKSEQLAGQLKEEGDKTPADVFYTE
QTATFADLSEAGLLAPISEQTIQQTAQKGVPLAPKKDWIALSGRSRVVVYDHTKLSEKDMEKSVLDYATPKWKGKIGYVS
TSGAFLEQVVALSKMKGDKVALNWLKGLKENGKLYAKNSVALQAVENGEVPAALINNYYWYNLAKEKGVENLKSRLYFVR
HQDPGALVSYSGAAVLKASKNQAEAQKFVDFLASKKGQEALVAARAEYPLRADVVSPFNLEPYEKLEAPVVSATTAQDKE
HAIKLIEEAGLK
>P0A0Y3 ~~~fbpA~~~Major ferric iron-binding protein~~~
MKTSIRYALLAAALTAATPALADITVYNGQHKEAAQAVADAFTRATGIKVKLNSAKGDQLAGQIKEEGSRSPADVFYSEQ
IPALATLSAANLLEPLPASTINETRGKGVPVAAKKDWVALSGRSRVVVYDTRKLSEKDLEKSVLNYATPKWKNRIGYAPT
SGAFLEQVVAIVKLKGEAAALKWLKGLKEYGKPYAKNSVALQAVENGEIDAALINNYYWHAFAREKGVQNVHTRLNFVRH
RDPGALITYSGAAVLKSSQNKDEAKKFVAFLASKEGQRALTAVRAEYPLNPHVVSTFNLEPIAKLEAPQVSATTVSEKEH
ATRLLEQAGMK
>P0A0Y4 ~~~fbpA~~~Major ferric iron-binding protein~~~
MKTSIRYALLAAALTAATPALADITVYNGQHKEAAQAVADAFTRATGIKVKLNSAKGDQLAGQIKEEGSRSPADVFYSEQ
IPALATLSAANLLEPLPASTINETRGKGVPVAAKKDWVALSGRSRVVVYDTRKLSEKDLEKSVLNYATPKWKNRIGYAPT
SGAFLEQVVAIVKLKGEAAALKWLKGLKEYGKPYAKNSVALQAVENGEIDAALINNYYWHAFAREKGVQNVHTRLNFVRH
RDPGALVTYSGAAVLKSSQNKDEAKKFVAFLASKEGQRALTAVRAEYPLNPHVVSTFNLEPIAKLEAPQVSATTVSEKEH
ATRLLEQAGMK
>P21408 ~~~fbpA~~~Fe(3+)-binding periplasmic protein~~~
MKLRISSLGPVALLASSMMLAFGAQAASADQGIVIYNAQHENLVKSWVDGFTKDTGIKVTLRNGGDSELGNQLVQEGSAS
PADVFLTENSPAMVLVDNAKLFAPLDAATLAQVEPQYRPSHGRWIGIAARSTVFVYNPAKLSDAQLPKSLLDLAKPEWKG
RWAASPSGADFQAIVSALLELKGEKATLAWLKAMKTNFTAYKGNSTVMKAVNAGQVDSGVIYHYYPFVDGAKTGENSNNI
KLYYFKHQDPGAFVSISGGGVLASSKHQQQAQAFIKWITGKQGQEILRTNNAFEYAVGVGAASNPKLVPLKDLDAPKVDA
AQLNSKKVVELMTEAGLL
>P44513 7.2.2.7~~~fbpC2~~~Fe(3+) ions import ATP-binding protein FbpC 2~~~COG3842
MRLNKMINNPLLTVKNLNKFFNEQQVLHDISFSLQRGEILFLLGSSGCGKTTLLRAIAGFEQPSNGEIWLKERLIFGENF
NLPTQQRHLGYVVQEGVLFPHLNVYRNIAYGLGNGKGKNSEEKTRIEQIMQLTGIFELADRFPHQLSGGQQQRVALARAL
APNPELILLDEPFSALDEHLRQQIRQEMLQALRQSGASAIFVTHDRDESLRYADKIAIIQQGKILQIDTPRTLYWSPNHL
ETAKFMGESIVLPANLLDENTAQCQLGNIPIKNKSISQNQGRILLRPEQFSLFKTSENPTALFNGQIKQIEFKGKITSIQ
IEINGYAIWIENVISPDLSIGDNLPVYLHRKGLFYS
>Q5FA19 7.2.2.7~~~fbpC~~~Fe(3+) ions import ATP-binding protein FbpC~~~
MTAALHIGHLSKSFQNTPVLNDISLSLDPGEILFIIGASGCGKTTLLRCLAGFEQPDSGEISLSGKTIFSKNTNLPVRER
RLGYLVQEGVLFPHLTVYRNIAYGLGNGKGRTAQERQRIEAMLELTGISELAGRYPHELSGGQQQRVALARALAPDPELI
LLDEPFSALDEQLRRQIREDMIAALRANGKSAVFVSHDREEALQYADRIAVMKQGRILQTASPHELYRQPADLDAALFIG
EGIVFPAALNADGTADCRLGRLPVQSGAPAGTRGTLLIRPEQFSLHPHSAPAASIHAVVLKTTPKARHTEISLRAGQTVL
TLNLPSAPTLSDGISAVLHLDGPALFFPGNTL
>P17259 ~~~fbp~~~Major ferric iron-binding protein~~~
MKTSIRYALLAAALTAATPALADITVYNGQHKEAAQAVADAFTRATGIKVKLNSAKGDQLAGQIKEEGSRSPADVFYSEQ
IPALATLSAANLLEPLPASTINETRGKGVPVAAKKDWVALSGRSRVVVYDTRKLSEKDLEKSVLNYATPKWKNRIGYVPT
SGAFLEQIVAIVKLKGEAAALKWLKGLKEYGKPYAKNSVALQAVENGEIDAALINNYYWHAFAREKGVQNVHTRLNFVRH
RDPGALVTYSGAVLKSSQNKDEAKKFVAFLAGKEGQRALTAVRAEYPLNPHVVSTFNLEPIAKLEAPQVSATTVSEKEHA
TRLLEQAGMK
>Q31QY2 3.1.3.11~~~~~~D-fructose 1,6-bisphosphatase class 2/sedoheptulose 1,7-bisphosphatase~~~COG1494
MEKTIGLEIIEVVEQAAIASARLMGKGEKNEADRVAVEAMRVRMNQVEMLGRIVIGEGERDEAPMLYIGEEVGIYRDADK
RAGVPAGKLVEIDIAVDPCEGTNLCAYGQPGSMAVLAISEKGGLFAAPDFYMKKLAAPPAAKGKVDINKSATENLKILSE
CLDRAIDELVVVVMDRPRHKELIQEIRQAGARVRLISDGDVSAAISCGFAGTNTHALMGIGAAPEGVISAAAMRCLGGHF
QGQLIYDPEVVKTGLIGESRESNIARLQEMGITDPDRVYDANELASGQEVLFAACGITPGLLMEGVRFFKGGARTQSLVI
SSQSRTARFVDTVHMFDDVKTVSLR
>P73922 3.1.3.11~~~~~~D-fructose 1,6-bisphosphatase class 2/sedoheptulose 1,7-bisphosphatase~~~COG1494
MDSTLGLEIIEVVEQAAIASAKWMGKGEKNTADQVAVEAMRERMNKIHMRGRIVIGEGERDDAPMLYIGEEVGICTREDA
KSFCNPDELVEIDIAVDPCEGTNLVAYGQNGSMAVLAISEKGGLFAAPDFYMKKLAAPPAAKGHVDIDKSATENLKILSD
CLNRSIEELVVVVMDRPRHKELIQEIRNAGARVRLISDGDVSAAISCAFSGTNIHALMGIGAAPEGVISAAAMRCLGGHF
QGQLIYDPEVVKTGLIGESREGNLERLASMGIKNPDQVYNCEELACGETVLFAACGITPGTLMEGVRFFHGGVRTQSLVI
SSQSSTARFVDTVHMKESPKVIQLH
>Q8DJE9 3.1.3.11~~~~~~D-fructose 1,6-bisphosphatase class 2/sedoheptulose 1,7-bisphosphatase~~~COG1494
MDNVIGLEIIEVVEQAAIASARWMGKGDKNMADQAAVDAMRNRMNQIHMRGRIVIGEGERDEAPMLYIGEEVGICTRPDA
AQYCNPEELIEIDIAVDPCEGTNLCAYGQPGSMAVLAISEKGGLFAAPDFYMKKLAAPPAAKGKVDIRNSATENLKILSE
CLDRAIDELVVVVMKRDRHNDLIQEIRDAGARVQLISDGDVSAALACAFSGTNIHALMGIGAAPEGVISAAAMRALGGHF
QGQLVYDPAVVMTKEWANRTREGNLEELKKAGITDPDKVYEAEELASGETVLFAACGITPGMLMKGVRFFKGGARTQSLV
ISTQSKTARFVDTIHMFDQQLKSLQLY
>Q55721 ~~~~~~Folate-biopterin transporter~~~COG2211
MLVAMSMTPIAILFSTPLKRFLREKVLLGNAPSWELLAILSIYFVQGVLGLSRLAVSFFLKDELGLSPAAMGALIGLGAA
PWILKPVLGLMSDTVPLFGYRRRSYLWLSGLMGSAGWLLFAAWVSSGTQAGLVLLFTSLSVAIGDVIVDSLVVERAQRES
LAQVGSLQSLTWGAAAVGGIITAYASGALLEWFSTRTVFAITAIFPLLTVGAAFLISEVSTAEEEEKPQPKAQIKLVWQA
VRQKTILLPTLFIFFWQATPSAESAFFYFTTNELGFEPKFLGRVRLVTSVAGLIGVGLYQRFLKTLPFRVIMGWSTVISS
LLGLTTLILITHANRAMGIDDHWFSLGDSIILTVTGQIAFMPVLVLAARLCPPGIEATLFALLMSVMNLAGVLSFEVGSL
LTHWLGVTETQFDNLALLVIITNLSTLLPLPFLGLLPAGDPQVKDKTEKEDNPDDPGDRLVLPPAEVFEHHTVGSLSDQN
FLPEFFPEKSSSRP
>P9WLL7 1.3.98.-~~~~~~F420H(2)-dependent biliverdin reductase~~~COG3871
MAMVNTTTRLSDDALAFLSERHLAMLTTLRADNSPHVVAVGFTFDPKTHIARVITTGGSQKAVNADRSGLAVLSQVDGAR
WLSLEGRAAVNSDIDAVRDAELRYAQRYRTPRPNPRRVVIEVQIERVLGSADLLDRA
>Q52472 1.1.1.122~~~fdh~~~D-threo-aldose 1-dehydrogenase~~~
MSSTEPAAAAAGLAIPALGYGAANVGNLFRALSDDEAWAVLEAAWDAGIRYYDTAPHYGLGLSEKRLGAFLQTKPRDEFV
VSTKAGRLLRPNPERRPSGLDTDNDFHVPDDLRREWDFTEQGIRASIAESQERLGLDRIDLLYLHDPERHDLDLALASAF
PALEKVRAEGVVKAIGIGSMVSDALTRAVREADLDLIMVAGRYTLLEQPAATEVLPACAENATGIVAASVFNSGLLAQSE
PKRDGRYEYGQLPDELWDRLVRIAAICRNHDVPLPAAAIQFPLQSALVRSVVVGGSRPAQLTQNAEYAALEIPAGLWAEL
AEARLIPTP
>Q6E7F2 1.1.1.266~~~fcf1~~~dTDP-4-dehydro-6-deoxyglucose reductase~~~
MDARKNGVLITGGAGFIGKALITEMVERQIPLVSFDISDKPDSLPELSEYFNWYKFSYLESSQRIKELHEIVSRHNIKTV
IHLATTMFPHESKKNIDKDCLENVYANVCFFKNLYENGCEKIIFASSGGTVYGKSDTPFSEDDALLPEISYGLSKVMTET
YLRFIAKELNGKSISLRISNPYGEGQRIDGKQGVIPIFLNKISNDIPIDIIGSIESKRDYIYISDLVQAFMCSLEYEGHE
DIFNIGSGESITLKKLIETIEFKLNKKAVIGFQDPIHTNANGIILDIKRAMAELGWRPTVVLDDGIDKLIKSIRCK
>Q6E7F1 5.4.99.59~~~fcf2~~~dTDP-fucopyranose mutase~~~
MNKVLIIGSGFSGATIARLLAEENIKVKIIDDRKHIGGNCYDERDEKTGINVHVYGPHIFHTDNEDVWNFVNKYGTFQPY
TTRLKANAKGQIYSLPVNLHTINQYYKTALSPTEARKLIASKGDQTINDPQSFEEQALKFVGEDLYKTFFYGYPKKQWGM
EPKEIPASVLKRLPVRFNYDDNYFFHKFQGIPRDGYTPLFQNLLNHPNIEFELGKKVNRATVEELITSEQYGHVFFSGAI
DHFYDYEFGMLQYRTLDFEKFYSEDDDYQGCVVMSYCDEDVPYTRVTEHKYFTPWEEHKGSVLYKEFSRSCDKEDIPYYP
VRLVSGNSIWNKYEQKAKEETNITFIGRLATYRYLDMDVCIKEAIECAQLYIKNNKE
>Q49135 3.5.4.9~~~fchA~~~Methenyltetrahydrofolate cyclohydrolase~~~COG3404
MAGNETIETFLDGLASSAPTPGGGGAAAISGAMGAALVSMVCNLTIGKKKYVEVEADLKQVLEKSEGLRRTLTGMIADDV
EAFDAVMGAYGLPKNTDEEKAARAAKIQEALKTATDVPLACCRVCREVIDLAEIVAEKGNLNVISDAGVAVLSAYAGLRS
AALNVYVNAKGLDDRAFAEERLKELEGLLAEAGALNERIYETVKSKVN
>P33217 1.1.1.271~~~fcl~~~GDP-L-fucose synthase~~~COG0451
MGKGKKLLITGGRGMVGRNLIACAARSGWEIIAPTSVDLDLRNAEAVEQYIRRQLPDVVVHAAGVVGGIHANIADPIHFL
ADNAAMALNVVMSSFRSEVVTLINLSSSCMYPACIEGPLKECDILRGPFEVTNEGYALAKTVGLKICEYIDKLPNFNYKT
LIACNLYGVGDNFDPRRSHLLPAIIEKIHKASQCGSESVSIWGDGTARREFMFAYDFAKIIIKALEVPELIPSSMNVGVG
KDLSVLEYYSLVARVIGWSGEFVYDLNRPVGMRSKLMDITHLTALGWVPERSLEGGIRSTYQYYITGNEVYE
>P32055 1.1.1.271~~~fcl~~~GDP-L-fucose synthase~~~COG0451
MSKQRVFIAGHRGMVGSAIRRQLEQRGDVELVLRTRDELNLLDSRAVHDFFASERIDQVYLAAAKVGGIVANNTYPADFI
YQNMMIESNIIHAAHQNDVNKLLFLGSSCIYPKLAKQPMAESELLQGTLEPTNEPYAIAKIAGIKLCESYNRQYGRDYRS
VMPTNLYGPHDNFHPSNSHVIPALLRRFHEATAQNAPDVVVWGSGTPMREFLHVDDMAAASIHVMELAHEVWLENTQPML
SHINVGTGVDCTIRELAQTIAKVVGYKGRVVFDASKPDGTPRKLLDVTRLHQLGWYHEISLEAGLASTYQWFLENQDRFR
G
>Q0BGE1 3.1.1.120~~~~~~L-fucono-1,5-lactonase~~~COG3618
MTFRIDAHQHFWQMASRDGYWPPQTLDAIYRDFGPQDLEPLLARSGVQRTVVVQSLPTQEDTRYLLDVASRTSFVAAVVG
WVDLKSTDAPADIASLARDPKFRGIRPMLQDLADDDWIDDPVLEPAIDAMLAHDLAFDALVTPRHLPALLAFARRYPRLR
IVIDHAAKPPIASGRSEAWHVAMSELAAHPNVHCKLSGLWTEAGPHPDLLRVEPYVRAVCDWFGASRLIWGSDWPVSRLA
GHFGDYGAWLAWCEQCCDRFLGPDARARVFGGNACHFYRIDRPSGDQHAQ
>A0A0H3KNC4 3.1.1.120~~~~~~L-fucono-1,5-lactonase~~~COG3618
MGALRIDSHQHFWRYRAADYPWIGAGMGVLARDYLPDALHPLMHAQALGASIAVQARAGRDETAFLLELACDEARIAAVV
GWEDLRAPQLAERVAEWRGTKLRGFRHQLQDEADVRAFVDDADFARGVAWLQANDYVYDVLVFERQLPDVQAFCARHDAH
WLVLDHAGKPALAEFDRDDTALARWRAALRELAALPHVVCKLSGLVTEADWRRGLRASDLRHIEQCLDAALDAFGPQRLM
FGSDWPVCLLAASYDEVASLVERWAESRLSAAERSALWGGTAARCYALPEPADARL
>P0DX22 3.1.1.120~~~~~~L-fucono-1,5-lactonase~~~
MRIDAHQHFWAYNPEEFDWIGEDEGVLKRDYFPPALEAEAGANGVDGTVVVQARQSIEETQWLLSLAQQFPLIKGVVGWV
DLMNPSLEQQLLSWQNEQVLKGFRHVLQGEPDPNFMLQPRFVEGLKLLHQFDYTYDLLIFAAQLPQARQLLDTLPQHRIV
IDHIAKPDIASGEGFAQWKAHMQAIAEHQNVYCKISGMVTEASHKDWQEADFIPYMDVVFSAFGPERVMFGSDWPVCQLA
ATYPEVIQIVERYVTRLYPEFSQHVFGLNAERFYRL
>A0A2P1BT06 6.2.1.34~~~FCS1~~~Trans-feruloyl-CoA synthase FCS1~~~
MGERRFSNQQIDRLLRPKSVAVIGASDRKGALGATLLNNLVQYEFSGDIYPVNPKRDELLGLKVYHEVAELPEGIDCAVL
AIPRPFVIDTVRQLAQRGCGAVVIYSAGFSEAGEEGMKDQLELAAIAAEYGMVIEGPNCLGCTNYVERVPLTFVETNMQT
PPKGTRAVGIASQSGALAAVLATALHPRGLYVSSSVSTGNEAASGVEDYVEWLVDDEDTHVIAMYVESLRRPKAFIAAAR
RAHAAGKPIVMLHPGKSNKAQESAATHTGAMAGDYALMKTKLAREGVIFADTLEELADITEIALRCRALPGANMAVLGES
GALRGLAFDIAEDIGLDLIHLDDDNSPALRAILPDFVPVSNPTDITALGLSEPEIYTKVLTALLEDERIGSVVASIIQSD
PITSGIKFPHIIKVLDGGTFAKPLVFAGVDEGATVPKEYIDGLRKVGIPWFPSTERAYRAIARLADLSKRDLADNSGDPI
VVPGLDAVSGVVPEYKAKELLRPLGIAFPPSQFAANAEAAAAAARAIGYPVVMKAQAAALGHKSDAGGVILNLKTDDEVR
DAFARIYGNVEAYDRSIALDGVLIEKMGKMGTEMIVGAKNDPQWGPVVLAGFGGVTAEILKDVKLFTPEMDAAAVQRGLL
ELKQAPILKGYRGAPALDVAALAELIVQIGRVMAGNPSIREIDLNPVIIHPAGEGVAALDALMLVER
>Q9EY88 6.2.1.34~~~fcs~~~Feruloyl-CoA synthase~~~
MRNQGLGSWPVRRARMSPHATAVRHGGTALTYAELSRRVARLANGLRAAGVRPGDRVAYLGPNHPAYLETLFACGQAGAV
FVPLNFRLGVPELDHALADSGASVLIHTPEHAETVAALAAGRLLRVPAGELDAADDEPPDLPVGLDDVCLLMYTSGSTGR
PKGAMLTHGNLTWNCVNVLVETDLASDERALVAAPLFHAAALGMVCLPTLLKGGTVILHSAFDPGAVLSAVEQERVTLVF
GVPTMYQAIAAHPRWRSADLSSLRTLLCGGAPVPADLASRYLDRGLAFVQGYGMTEAAPGVLVLDRAHVAEKIGSAGVPS
FFTDVRLAGPSGEPVPPGEKGEIVVSGPNVMKGYWGRPEATAEVLRDGWFHSGDVATVDGDGYFHVVDRLKDMIISGGEN
IYPAEVENELYGYPGVEACAVIGVPDPRWGEVGKAVVVPADGSRIDGDELLAWLRTRLAGYKVPKSVEFTDRLPTTGSGK
ILKGEVRRRFG
>S5M744 6.2.1.34~~~fcs~~~Feruloyl-CoA synthase~~~
MRNQGLGSWPVRRARMSPHATAVRHGGTALTYAELSRRVARLANGLRAAGVRPGDRVAYLGPNHPAYLETLFACGQAGAV
FVPLNFRLGVPELDHALADSGASVLIHTPEHAETVAALAAGRLLRVPAGELDAADDEPPDLPVGLDDVCLLMYTSGSTGR
PKGAMLTHGNLTWNCVNVLVETDLASDERALVAAPLFHAAALGMVCLPTLLKGGTVILHSAFDPGAVLSAVEQERVTLVF
GVPTMYQAIAAHPRWRSADLSSLRTLLCGGAPVPADLASRYLDRGLAFVQGYGMTEAAPGVLVLDRAHVAEKIGSAGVPS
FFTDVRLAGPSGEPVPPGEKGEIVVSGPNVMKGYWGRPEATAEVLRDGWFHSGDVATVDGDGYFHVVDRLKDMIISGGEN
IYPAEVENELYGYPGVEACAVIGVPDPRWGEVGKAVVVPADGSRIDGDELLAWLRTRLAGYKVPKSVEFTDRLPTTGSGK
ILKGEVRRRFG
>P69902 2.8.3.16~~~frc~~~Formyl-CoA:oxalate CoA-transferase~~~COG1804
MSTPLQGIKVLDFTGVQSGPSCTQMLAWFGADVIKIERPGVGDVTRHQLRDIPDIDALYFTMLNSNKRSIELNTKTAEGK
EVMEKLIREADILVENFHPGAIDHMGFTWEHIQEINPRLIFGSIKGFDECSPYVNVKAYENVAQAAGGAASTTGFWDGPP
LVSAAALGDSNTGMHLLIGLLAALLHREKTGRGQRVTMSMQDAVLNLCRVKLRDQQRLDKLGYLEEYPQYPNGTFGDAVP
RGGNAGGGGQPGWILKCKGWETDPNAYIYFTIQEQNWENTCKAIGKPEWITDPAYSTAHARQPHIFDIFAEIEKYTVTID
KHEAVAYLTQFDIPCAPVLSMKEISLDPSLRQSGSVVEVEQPLRGKYLTVGCPMKFSAFTPDIKAAPLLGEHTAAVLQEL
GYSDDEIAAMKQNHAI
>O06644 2.8.3.16~~~frc~~~Formyl-CoA:oxalate CoA-transferase~~~
MTKPLDGINVLDFTHVQAGPACTQMMGFLGANVIKIERRGSGDMTRGWLQDKPNVDSLYFTMFNCNKRSIELDMKTPEGK
ELLEQMIKKADVMVENFGPGALDRMGFTWEYIQELNPRVILASVKGYAEGHANEHLKVYENVAQCSGGAAATTGFWDGPP
TVSGAALGDSNSGMHLMIGILAALEMRHKTGRGQKVAVAMQDAVLNLVRIKLRDQQRLERTGILAEYPQAQPNFAFDRDG
NPLSFDNITSVPRGGNAGGGGQPGWMLKCKGWETDADSYVYFTIAANMWPQICDMIDKPEWKDDPAYNTFEGRVDKLMDI
FSFIETKFADKDKFEVTEWAAQYGIPCGPVMSMKELAHDPSLQKVGTVVEVVDEIRGNHLTVGAPFKFSGFQPEITRAPL
LGEHTDEVLKELGLDDAKIKELHAKQVV
>Q05202 ~~~fcuA~~~Ferrichrome receptor FcuA~~~
MNQTISSRAPQKRLAPRLLCVMIGAALGTLSASSWAAAATDSTAENAKKTSATAATAKAEDSKTNDTITVVGAQETFRAG
GNDLIPTYLDGQVANGGRIGFLGQQDARNVPFNVIGYTSKMIEDQQANSIADVVKNDASVQNVRGYGNPSQNYRIRGYNL
DGDDISFGGLFGVLPRQIVSTSMVERVEVFKGANAFINGISPSGSGVGGMINLEPKRAGDTPLTRVTVDYGSASQVGGAL
DVGRRYGDDDQFGVRVNVLHREGESAIHDQKERTTAVSTGLDYRGDRARTSLDVGYQKQTIHHMRTDVAIGGATVIPEPP
SSTLNYGQSWVYTDMETTFGMLRSEYDVSQNWTVYGSVGASRNEETGQYGAPMLTNNNGDATISRLYVPYVADSVAGLGG
IRGHFDTGPITHKVNLGYAANYRTTKSAWNMSGQEDTNIYNPGVIGFPQTVMGSDSQDPQLTSQVRASGLSLSDTLSMMD
DKVSLMLGVRRQEVTIRNFDSGVPNSAGSLDAMKVTPIYGIMVKPWEKVSLYANHIEALGPGKSAPYQYNGKPVVNAGQI
PGIIHSKQNEIGVKFDNQRYGGTLALFEITRPTGMVDPATNVYGFYGEQRNRGIELNVFGEPVFGTRLLASATWLDPKLT
KAADSANNGNDAVGVANYQLVFGGEYDIPVVEGLTATGTVVRSGSQYANEANTLKLKPWTRLDLGVRYTMPMKDTSLTWR
ANIENVTNERYWESVEDSGTYIYQGDPRALKLSVSMDF
>P09347 1.1.1.327~~~camD~~~5-exo-hydroxycamphor dehydrogenase~~~
MQYARAAVMVEQNRVETWEVPIFDPAPGGALVRVVLGGVCGSDVHIVSGEAGAMPFPIILGHEGIGRIEKLGTGVTTDYA
GVPVKQGDMVYWAPIALCHRCHSCTVLDETPWDNSTFFEHAQKPNWGSYADFACLPNGMAFYRLPDHAQPEALAALGCAL
PTVLRGYDRCGPVGLDDTVVVQGAGPVGLAAVLVAAASGAKDIIAIDHSPIRLDMARSLGATETISLADTTPEERQRIVQ
ERFGKRGASLVVEAAGALPAFPEGVNLTGNHGRYVILGLWGAIGTQPISPRDLTIKNMSIAGATFPKPKHYYQAMQLAAR
LQDRYPLADLITQRFSIDEASKALELVKAGALIKPVIDSTL
>Q934F5 1.17.1.9~~~fdhA~~~Formate dehydrogenase subunit alpha~~~
MLIKRRAFLKLTAAGATLSAFGGLGVDLAPAKAQAATMALKTVDAKQTTSVCCYCSVGCGLIVHTDKKTNRAINVEGDPD
HPINEGSLCAKGASTWQLAENERRPANPLYRAPGSDQWEEKSWDWMLDTIAERVAKTREATFVTKNAKGQVVNRCDGIAS
VGSAAMDNEECWIYQAWLRSLGLFYIEHQARIUHSATVAALAESYGRGAMTNHWIDLKNSDVILMMGSNPAENHPISFKW
VMRAKDKGATLIHVDPRYTRTSTKCDLYAPLRSGSDIAFLNGMTKYILEKELYFKDYVVNYTNASFIVGEGFAFEEGLFA
GYNKETRKYDKSKWGFERDENGNPKRDETLKHPRCVFQIMKKHYERYDLDKISAICGTPKELILKVYDAYCATGKPDKAG
TIMYAMGWTQHTVGVQNIRAMSINQLLLGNIGVAGGGVNALRGEANVQGSTDHGLLMHIYPGYLGTARASIPTYEEYTKK
FTPVSKDPQSANWWSNFPKYSASYIKSMWPDADLNEAYGYLPKGEDGKDYSWLTLFDDMFQGKIKGFFAWGQNPACSGAN
SNKTREALTKLDWMVNVNIFDNETGSFWRGPDMDPKKIKTEVFFLPCAVAIEKEGSISNSGRWMQWRYVGPEPRKNAIPD
GDLIVELAKRVQKLLAKTPGKLAAPVTKLKTDYWVNDHGHFDPHKIAKLINGFALKDFKVGDVEYKAGQQIATFGHLQAD
GSTTSGCWIYTGSYTEKGNMAARRDKTQTDMQAKIGLYPGWTWAWPVNRRIIYNRASVDLNGKPYAPEKAVVEWNAAEKK
WVGDVPDGPWPPQADKEKGKRAFIMKPEGYAYLYGPGREDGPLPEYYEPMECPVIEHPFSKTLHNPTALHFATEEKAVCD
PRYPFICSTYRVTEHWQTGLMTRNTPWLLEAEPQMFCEMSEELATLRGIKNGDKVILESVRGKLWAKAIITKRIKPFAIQ
GQQVHMVGIPWHYGWSFPKNGGDAANILTPSVGDPNTGIPETKAFMVNVTKA
>Q8GC87 ~~~fdhB~~~Formate dehydrogenase subunit beta~~~
MSKGFFVDTTRCTACRGCQVACKQWHGNPATPTENTGFHQNPPDFNFHTYKLVRMHEQEIDGRIDWLFFPDQCRHCIAPP
CKATADMEDESAIIHDDATGCVLFTPKTKDLEDYESVISACPYDVPRKVAESNQMAKCDMCIDRITNGLRPACVTSCPTG
AMNFGDLSEMEAMASARLAEIKAAYSDAKLCDPDDVRVIFLTAHNPKLYHEYAVA
>P27273 ~~~fdhB1~~~Formate dehydrogenase iron-sulfur subunit~~~COG0437
MESQARVKFYCDEARCIDCHGCDVACKEAHHLPVGVNRRRVVTLNEGLVGKEKSLSIACMHCSDAPCAQVCPVDCFYVRA
DGIVLHDKEKCIGCGYCLYACPFGAPQFPKSGIFGSRGPMDKCTFCAGGPEETHSEKEYKLYGQNRIAEGKVPVCAAMCS
TKALLAGDSDSISLIIRERVLKRGSGTASVPYTWSQAYKD
>M1V1V5 ~~~fdhC~~~Fructose dehydrogenase cytochrome subunit~~~
MRYFRPLSATAMTTVLLLAGTNVRAQPTEPTPASAHRPSISRGHYLAIAADCAACHTNGRDGQFLAGGYAISSPMGNIYS
TNITPSKTHGIGNYTLEQFSKALRHGIRADGAQLYPAMPYDAYNRLTDEDVKSLYAYIMTEVKPVDAPSPKTQLPFPFSI
RASLGIWKIAARIEGKPYVFDHTHNDDWNRGRYLVDELAHCGECHTPRNFLLAPNQSAYLAGADIGSWRAPNITNAPQSG
IGSWSDQDLFQYLKTGKTAHARAAGPMAEAIEHSLQYLPDADISAIVTYLRSVPAKAESGQTVANFEHAGRPSSYSVANA
NSRRSNSTLTKTTDGAALYEAVCASCHQSDGKGSKDGYYPSLVGNTTTGQLNPNDLIASILYGVDRTTDNHEILMPAFGP
DSLVQPLTDEQIATIADYVLSHFGNAQATVSADAVKQVRAGGKQVPLAKLASPGVMLLLGTGGILGAILVVAGLWWLISR
RKKRSA
>O87816 ~~~fdhD~~~Sulfur carrier protein FdhD~~~COG1526
MMRCMQSPEVHPAAAGDAEPPTHSTFAVSRWRRGELMLSPDEVAEEVPVALVYNGISHAVMLATPADLEDFALGFSLSEG
IVTRASDVYDIEIDTREHGIAVQLEIASEAFMRLKDRRRSLAGRTGCGLCGTESLEQVMRLPAPVRSDASFHTDVIQAAF
VQLQLRQELQQHTGATHAAAWLRADGHVSLVREDVGRHNALDKLAGALASSGEDISSGAVLVTSRASYEMVLKTAAIGAG
VLAAVSAPTALAVRLAEQASITLAGFVRAGAHVVYAHPQRLQHEASLA
>P32177 ~~~fdhD~~~Sulfur carrier protein FdhD~~~COG1526
MKKTQRKEIENVTNITGVRQIELWRRDDLQHPRLDEVAEEVPVALVYNGISHVVMMASPKDLEYFALGFSLSEGIIESPR
DIFGMDVVPSCNGLEVQIELSSRRFMGLKERRRALAGRTGCGVCGVEQLNDIGKPVQPLPFTQTFDLNKLDDALRHLNDF
QPVGQLTGCTHAAAWMLPSGELVGGHEDVGRHVALDKLLGRRSQEGESWQQGAVLVSSRASYEMVQKSAMCGVEILFAVS
AATTLAVEVAERCNLTLVGFCKPGRATVYTHPQRLSN
>P9WNF1 ~~~fdhD~~~Sulfur carrier protein FdhD~~~COG1526
MGYATAHRRVRHLSADQVITRPETLAVEEPLEIRVNGTPVTVTMRTPGSDFELVQGFLLAEGVVAHREDVLTVSYCGRRV
EGNATGASTYNVLDVALAPGVKPPDVDVTRTFYTTSSCGVCGKASLQAVSQVSRFAPGGDPATVAADTLKAMPDQLRRAQ
KVFARTGGLHAAALFGVDGAMLAVREDIGRHNAVDKVIGWAFERDRIPLGASVLLVSGRASFELTQKALMAGIPVLAAVS
APSSLAVSLADASGITLVAFLRGDSMNVYTRADRIT
>P13024 ~~~fdhE~~~Protein FdhE~~~COG3058
MSIRIIPQDELGSSEKRTADMIPPLLFPRLKNLYNRRAERLRELAENNPLGDYLRFAALIAHAQEVVLYDHPLEMDLTAR
IKEASAQGKPPLDIHVLPRDKHWQKLLMALIAELKPEMSGPALAVIENLEKASTQELEDMASALFASDFSSVSSDKAPFI
WAALSLYWAQMANLIPGKARAEYGEQRQYCPVCGSMPVSSMVQIGTTQGLRYLHCNLCETEWHVVRVKCSNCEQSGKLHY
WSLDDEQAAIKAESCDDCDTYLKILYQEKDPKIEAVADDLASLVLDARMEQEGYARSSINPFLFPGEGE
>Q9HV00 ~~~fdhE~~~Protein FdhE homolog~~~
MSRTILQPGQIEAAANIPPHLHQPSRDLFARRGERLLQLAEGHPMGDYLRLVAGLCRLQQALLDNPPALAPLDPERLRKS
REHGMPPLAYDLLVREGAWLPWLDALLAGYPAPANAAVGAALEQLREAEEGQRKAWAIALLSGQFDLLPAALVPFLGAAL
QVAWSHWLLGLEEGAVVETESRTLCPACGSPPMAGMIRQGGKETGLRYLSCSLCACEWHYVRIKCSHCEESKHLAYLSLE
HDGQPAEKAVLRAETCPSCQGYLKQFYLEFDRHADALADDLASLALDMRLAEDGYLRRSPNLLLAPGGE
>P07658 1.17.98.4~~~fdhF~~~Formate dehydrogenase H~~~COG3383
MKKVVTVCPYCASGCKINLVVDNGKIVRAEAAQGKTNQGTLCLKGYYGWDFINDTQILTPRLKTPMIRRQRGGKLEPVSW
DEALNYVAERLSAIKEKYGPDAIQTTGSSRGTGNETNYVMQKFARAVIGTNNVDCCARVUHGPSVAGLHQSVGNGAMSNA
INEIDNTDLVFVFGYNPADSHPIVANHVINAKRNGAKIIVCDPRKIETARIADMHIALKNGSNIALLNAMGHVIIEENLY
DKAFVASRTEGFEEYRKIVEGYTPESVEDITGVSASEIRQAARMYAQAKSAAILWGMGVTQFYQGVETVRSLTSLAMLTG
NLGKPHAGVNPVRGQNNVQGACDMGALPDTYPGYQYVKDPANREKFAKAWGVESLPAHTGYRISELPHRAAHGEVRAAYI
MGEDPLQTDAELSAVRKAFEDLELVIVQDIFMTKTASAADVILPSTSWGEHEGVFTAADRGFQRFFKAVEPKWDLKTDWQ
IISEIATRMGYPMHYNNTQEIWDELRHLCPDFYGATYEKMGELGFIQWPCRDTSDADQGTSYLFKEKFDTPNGLAQFFTC
DWVAPIDKLTDEYPMVLSTVREVGHYSCRSMTGNCAALAALADEPGYAQINTEDAKRLGIEDEALVWVHSRKGKIITRAQ
VSDRPNKGAIYMTYQWWIGACNELVTENLSPITKTPEYKYCAVRVEPIADQRAAEQYVIDEYNKLKTRLREAALA
>M1VMF7 1.1.5.14~~~fdhL~~~Fructose dehydrogenase large subunit~~~
MSNETLSADVVIIGAGICGSLLAHKLVRNGLSVLLLDAGPRRDRSQIVENWRNMPPDNKSQYDYATPYPSVPWAPHTNYF
PDNNYLIVKGPDRTAYKQGIIKGVGGTTWHWAASSWRYLPNDFKLHSTYGVGRDYAMSYDELEPYYYEAECEMGVMGPNG
EEITPSAPRQNPWPMTSMPYGYGDRTFTEIVSKLGFSNTPVPQARNSRPYDGRPQCCGNNNCMPICPIGAMYNGVYAAIK
AEKLGAKIIPNAVVYAMETDAKNRITAISFYDPDKQSHRVVAKTFVIAANGIETPKLLLLAANDRNPHGIANSSDLVGRN
MMDHPGIGMSFQSAEPIWAGGGSVQMSSITNFRDGDFRSEYAATQIGYNNTAQNSRAGMKALSMGLVGKKLDEEIRRRTA
HGVDIYANHEVLPDPNNRLVLSKDYKDALGIPHPEVTYDVGEYVRKSAAISRQRLMDIAKAMGGTEIEMTPYFTPNNHIT
GGTIMGHDPRDSVVDKWLRTHDHSNLFLATGATMAASGTVNSTLTMAALSLRAADAILNDLKQG
>Q99RW4 1.17.1.9~~~~~~Putative formate dehydrogenase SA2102~~~
MQEHLVVTLDGKDYLVEPGTNLLEFIKSQDTFVPSICYNESMGPIQTCDTCTVEIDGKIERSCSTVIDRPMTVNTVNNDV
KDAQKEALDRILEKHMLYCTVCDYNNGDCEIHNTMDAWGLQHQTYEYKEKPYEKDYGPFYRYDPNQCILCGRCVEACQDI
EVNETIRIDWDREHPRVIWDNDVPINESSCVSCGQCATVCPCNAMMEVNMEGNAGYMTDTEPGSLAAMIDLTKKAEPGYG
PLFAISDSEAEMRKERIKKTKTVCTYCGVGCSFEVWTKDREILKVQPSHDSPANKIATCVKGKFSWGHINSDQRLTKPLV
RKNGEFHEVEWDEALNVIADNFTSIKEKYGPDALSFISSSKATNEESYLMQKLARQVIGTNNVDNCSRYCQAPATKGLFR
TVGHGGDSGSIEDLEKAAMSVLIGTNTAEAHPVIASRMKRAQKLFGQKIHVFDIRKHEMAERADRFYQPKPGTDLAWLSA
VTKYIIDHDLHDKAFIDEWVDDFDEYYKSLETFTMAFAEEATGIPESELIKFAEECAKAESVVICWAMGITQQDIGSDSS
TAISNLLLVTGNYRRPGTGAYPLRGHNNVQGCSDMGSMPDKITGYQSIEADDIRAKFEKEYGVKLNPKAGKDNHEMVEGI
HDGEVHSLYLYGEDTGIVDSNINFVQAAFEKLDFMVVQDEFLTFTATYADVVLPASPSLEKDGTFTNTERRIQRLYQALE
PLGDSKPDWKIFQAIANRLGFDWNYKHPSEIMDEVARLTPLYAGVSYDRLEGFNSLQWPVQPDGTDEPILYLEGFNFDNG
KAKLFPLSFDNYFKQDEIYDIHVNNGRLLEHFHEGNMTYQTPMIKYKVPRAFVEISPELAEDRGIHEGAEVKLISETGEA
VLQVHVTDRVKGKEIYIPLNNDAMENGDLGAINLLTNSDVDQYTDTPSYKRTSCRLEVITKRGKSPLNPNNFRVNKKRHP
QYSVQVQKKWERSDYVFPGNQVDK
>M1VB40 ~~~fdhS~~~Fructose dehydrogenase small subunit~~~
MEKIADSGPVQIFLSRRKLLAFSGASLTVAAIGAPSKGSTQDVVASNRDSISDFMQLSAFATGHKNLDLNIGSALLLAFE
AQKHDFSTQIKALREHITKNNYQDVEALDAAMKDDPLHPTLIQIIRAWYSGVIEDETNAKVYAFEKALMYQPSRDVVVIP
TYAHNGPNYWVSEPASVDVMPAF
>P33160 1.17.1.9~~~~~~Formate dehydrogenase~~~
MAKVLCVLYDDPVDGYPKTYARDDLPKIDHYPGGQTLPTPKAIDFTPGQLLGSVSGELGLRKYLESNGHTLVVTSDKDGP
DSVFERELVDADVVISQPFWPAYLTPERIAKAKNLKLALTAGIGSDHVDLQSAIDRNVTVAEVTYCNSISVAEHVVMMIL
SLVRNYLPSHEWARKGGWNIADCVSHAYDLEAMHVGTVAAGRIGLAVLRRLAPFDVHLHYTDRHRLPESVEKELNLTWHA
TREDMYPVCDVVTLNCPLHPETEHMINDETLKLFKRGAYIVNTARGKLCDRDAVARALESGRLAGYAGDVWFPQPAPKDH
PWRTMPYNGMTPHISGTTLTAQARYAAGTREILECFFEGRPIRDEYLIVQGGALAGTGAHSYSKGNATGGSEEAAKFKKA
V
>Q52078 1.2.98.1~~~fdm~~~Formaldehyde dismutase~~~
MAGNKSVVYHGTRDLRVETVPYPKLEHNNRKLEHAVILKVVSTNICGSDQHIYRGRFIVPKGHVLGHEITGEVVEKGSDV
ELMDIGDLVSVPFNVACGRCRNCKEARSDVCENNLVNPDADLGAFGFDLKGWSGGQAEYVLVPYADYMLLKFGDKEQAME
KIKDLTLISDILPTGFHGCVSAGVKPGSHVYIAGAGPVGRCAAAGARLLGAACVIVGDQNPERLKLLSDAGFETIDLRNS
APLRDQIDQILGKPEVDCGVDAVGFEAHGLGDEANTETPNGALNSLFDVVRAGGAIGIPGIYVGSDPDPVNKDAGSGRLH
LDFGKMWTKSIRIMTGMAPVTNYNRHLTEAILWDQMPYLSKVMNIEVITLDQAPDGYAKFDKGSPAKFVIDPHGMLKNK
>Q727P3 1.17.2.3~~~fdnG-3~~~Formate dehydrogenase 2 subunit alpha (cytochrome c-553)~~~COG0243
MKTTRRSFLKLVGVSVVGLSLGQLGFDLEDAQAYAVKLKIEGAKEVGTVCPFCSVCCQVIAYVRNGKLVSTEGDPDFPVN
EGALCAKGAALFSMYTNPHRLTKPLYRAPHSDKWVEKDWDWTLNQIARRVKDARDKDMILKNDKGQTVNRLESIFMMGTS
HASNEECAVIHQAMRGLGVVHMDHQARVUHSPTVAALAESFGRGAMTNHWIDIKNTDAVLIIGSNAAEHHPVAFKWIMRA
RDNGAVLMHVDPKFSRTSARCDFHVPLRSGTDIAFLGGMVNHIIAKDLYFKDYVANYTNAAFVVGKDYAFEDGIFSGYDP
KTRTYDRSKWEFEKGPDGGPVMDPTLKNERCVFNLMKKHYERYTLKNVSDVTGVSEENLLRVYDAFCATGRPDKAGTILY
ALGWTQHTVGVQNIRTSTLIQLLLGNIGVAGGGINALRGEPNVQGSTDHALLYHILPGYNAMPVAQWQTLADYNKANTPV
TTLKNSANWWSNRPKYVASLLKGWFGDAATPENDFCYEYLPKLEKGEDYSYMYVMDRMYHGKLKGGFIFGVNPMNSFPNT
NKMRAALDKLDWLVCSELHNSETTDNWKRPGVDPKACKTEVFLLPSAHRVEKAGTISNSGRWLQWFDKAVEPGQARNFAD
IFVPLVNKIRALYKAEGGTLPDPVLKLHWTDKFDPEEWTRRINGFFWADTKVGDKEYKRGQLVPAFGQLKDDGSTSSLNW
LYTGSYTEEDGNKSKRRDARQTPMQANIGLFPNWSWCWPVNRRILYNRASVDVNGKPWNPKKAVIEWDGAKWVGDVPDGP
WPPMADKEKGKLPFIMNKDGFAQFYGTGRMDGPFPEHYEPAETPLDSHPFSKQLSSPVYKFHTSDMDQIAKAADPKYPIV
LTTYSLTEHWCGGGETRNVPNLLETEPQLYIEMSPELAEEKGIKNGDGVIVESIRGRAEAIAMVTVRIRPFTVMGKTVHL
VGMPFAYGWTTPKCGDSTNRLTVGAYDPNTTIPESKACLVNVRKADKLTEIA
>Q727P4 ~~~~~~Formate dehydrogenase 2 subunit beta (cytochrome c-553)~~~COG0437
MPKAFLIDTTRCTACRGCQLACKEWHDLPANVTKQRGSHQNPPDLNPNNLKIVRFNERMNEKGVVIWNFFPDQCRHCVTP
VCVDVADMAVPGAMIKDKKTGAVLATEKSAKLSPADAKAVAEACPYNIPRIDPKTKRITKCDMCFDRVSAGMQPICVKTC
PTGTMAFGERDEMLALAEKRLADAKTRFPKAHLVDVEDVSVIYLLAEEKEHYYEYAGFM
>P24183 1.17.5.3~~~fdnG~~~Formate dehydrogenase, nitrate-inducible, major subunit~~~COG0243
MDVSRRQFFKICAGGMAGTTVAALGFAPKQALAQARNYKLLRAKEIRNTCTYCSVGCGLLMYSLGDGAKNAREAIYHIEG
DPDHPVSRGALCPKGAGLLDYVNSENRLRYPEYRAPGSDKWQRISWEEAFSRIAKLMKADRDANFIEKNEQGVTVNRWLS
TGMLCASGASNETGMLTQKFARSLGMLAVDNQARVUHGPTVASLAPTFGRGAMTNHWVDIKNANVVMVMGGNAAEAHPVG
FRWAMEAKNNNDATLIVVDPRFTRTASVADIYAPIRSGTDITFLSGVLRYLIENNKINAEYVKHYTNASLLVRDDFAFED
GLFSGYDAEKRQYDKSSWNYQLDENGYAKRDETLTHPRCVWNLLKEHVSRYTPDVVENICGTPKADFLKVCEVLASTSAP
DRTTTFLYALGWTQHTVGAQNIRTMAMIQLLLGNMGMAGGGVNALRGHSNIQGLTDLGLLSTSLPGYLTLPSEKQVDLQS
YLEANTPKATLADQVNYWSNYPKFFVSLMKSFYGDAAQKENNWGYDWLPKWDQTYDVIKYFNMMDEGKVTGYFCQGFNPV
ASFPDKNKVVSCLSKLKYMVVIDPLVTETSTFWQNHGESNDVDPASIQTEVFRLPSTCFAEEDGSIANSGRWLQWHWKGQ
DAPGEARNDGEILAGIYHHLRELYQSEGGKGVEPLMKMSWNYKQPHEPQSDEVAKENNGYALEDLYDANGVLIAKKGQLL
SSFAHLRDDGTTASSCWIYTGSWTEQGNQMANRDNSDPSGLGNTLGWAWAWPLNRRVLYNRASADINGKPWDPKRMLIQW
NGSKWTGNDIPDFGNAAPGTPTGPFIMQPEGMGRLFAINKMAEGPFPEHYEPIETPLGTNPLHPNVVSNPVVRLYEQDAL
RMGKKEQFPYVGTTYRLTEHFHTWTKHALLNAIAQPEQFVEISETLAAAKGINNGDRVTVSSKRGFIRAVAVVTRRLKPL
NVNGQQVETVGIPIHWGFEGVARKGYIANTLTPNVGDANSQTPEYKAFLVNIEKA
>P0AAJ3 ~~~fdnH~~~Formate dehydrogenase, nitrate-inducible, iron-sulfur subunit~~~COG0437
MAMETQDIIKRSATNSITPPSQVRDYKAEVAKLIDVSTCIGCKACQVACSEWNDIRDEVGHCVGVYDNPADLSAKSWTVM
RFSETEQNGKLEWLIRKDGCMHCEDPGCLKACPSAGAIIQYANGIVDFQSENCIGCGYCIAGCPFNIPRLNKEDNRVYKC
TLCVDRVSVGQEPACVKTCPTGAIHFGTKKEMLELAEQRVAKLKARGYEHAGVYNPEGVGGTHVMYVLHHADQPELYHGL
PKDPKIDTSVSLWKGALKPLAAAGFIATFAGLIFHYIGIGPNKEVDDDEEDHHE
>P0AEK7 ~~~fdnI~~~Formate dehydrogenase, nitrate-inducible, cytochrome b556(Fdn) subunit~~~COG2864
MSKSKMIVRTKFIDRACHWTVVICFFLVALSGISFFFPTLQWLTQTFGTPQMGRILHPFFGIAIFVALMFMFVRFVHHNI
PDKKDIPWLLNIVEVLKGNEHKVADVGKYNAGQKMMFWSIMSMIFVLLVTGVIIWRPYFAQYFPMQVVRYSLLIHAAAGI
ILIHAILIHMYMAFWVKGSIKGMIEGKVSRRWAKKHHPRWYREIEKAEAKKESEEGI
>P32176 1.17.1.9~~~fdoG~~~Formate dehydrogenase-O major subunit~~~COG0243
MQVSRRQFFKICAGGMAGTTAAALGFAPSVALAETRQYKLLRTRETRNTCTYCSVGCGLLMYSLGDGAKNAKASIFHIEG
DPDHPVNRGALCPKGAGLVDFIHSESRLKFPEYRAPGSDKWQQISWEEAFDRIAKLMKEDRDANYIAQNAEGVTVNRWLS
TGMLCASASSNETGYLTQKFSRALGMLAVDNQARVUHGPTVASLAPTFGRGAMTNHWVDIKNANLVVVMGGNAAEAHPVG
FRWAMEAKIHNGAKLIVIDPRFTRTAAVADYYAPIRSGTDIAFLSGVLLYLLNNEKFNREYTEAYTNASLIVREDYGFED
GLFTGYDAEKRKYDKSSWTYELDENGFAKRDTTLQHPRCVWNLLKQHVSRYTPDVVENICGTPKDAFLKVCEYIAETSAH
DKTASFLYALGWTQHSVGAQNIRTMAMIQLLLGNMGMAGGGVNALRGHSNIQGLTDLGLLSQSLPGYMTLPSEKQTDLQT
YLTANTPKPLLEGQVNYWGNYPKFFVSMMKAFFGDKATAENSWGFDWLPKWDKGYDVLQYFEMMKEGKVNGYICQGFNPV
ASFPNKNKVIGCLSKLKFLVTIDPLNTETSNFWQNHGELNEVDSSKIQTEVFRLPSTCFAEENGSIVNSGRWLQWHWKGA
DAPGIALTDGEILSGIFLRLRKMYAEQGGANPDQVLNMTWNYAIPHEPSSEEVAMESNGKALADITDPATGAVIVKKGQQ
LSSFAQLRDDGTTSCGCWIFAGSWTPEGNQMARRDNADPSGLGNTLGWAWAWPLNRRILYNRASADPQGNPWDPKRQLLK
WDGTKWTGWDIPDYSAAPPGSGVGPFIMQQEGMGRLFALDKMAEGPFPEHYEPFETPLGTNPLHPNVISNPAARIFKDDA
EALGKADKFPYVGTTYRLTEHFHYWTKHALLNAILQPEQFVEIGESLANKLGIAQGDTVKVSSNRGYIKAKAVVTKRIRT
LKANGKDIDTIGIPIHWGYEGVAKKGFIANTLTPFVGDANTQTPEFKSFLVNVEKV
>P0AAJ5 ~~~fdoH~~~Formate dehydrogenase-O iron-sulfur subunit~~~COG0437
MAYQSQDIIRRSATNGLTPAPQARDFQEEVAKLIDVTTCIGCKACQVACSEWNDIRDTVGNNIGVYDNPNDLSAKSWTVM
RFSEVEQNDKLEWLIRKDGCMHCSDPGCLKACPAEGAIIQYANGIVDFQSEQCIGCGYCIAGCPFDIPRLNPEDNRVYKC
TLCVDRVVVGQEPACVKTCPTGAIHFGTKESMKTLASERVAELKTRGYDNAGLYDPAGVGGTHVMYVLHHADKPNLYHGL
PENPEISETVKFWKGIWKPLAAVGFAATFAASIFHYVGVGPNRADEEENNLHEEKDEERK
>P0AEL0 ~~~fdoI~~~Formate dehydrogenase, cytochrome b556(fdo) subunit~~~COG2864
MKRRDTIVRYTAPERINHWITAFCFILAAVSGLGFLFPSFNWLMQIMGTPQLARILHPFVGVVMFASFIIMFFRYWHHNL
INRDDIFWAKNIRKIVVNEEVGDTGRYNFGQKCVFWAAIIFLVLLLVSGVIIWRPYFAPAFSIPVIRFALMLHSFAAVAL
IVVIMVHIYAALWVKGTITAMVEGWVTSAWAKKHHPRWYREVRKTTEKKAE
>Q47208 ~~~fdrA~~~Protein FdrA~~~COG0074
MIHAFIKKGCFQDSVSLMIISRKLSESENVDDVSVMMGTPANKALLDTTGFWHDDFNNATPNDICVAIRSEAADAGIAQA
IMQQLEEALKQLAQGSGSSQALTQVRRWDSACQKLPDANLALISVAGEYAAELANQALDRNLNVMMFSDNVTLEDEIQLK
TRAREKGLLVMGPDCGTSMIAGTPLAFANVMPEGNIGVIGASGTGIQELCSQIALAGEGITHAIGLGGRDLSREVGGISA
LTALEMLSADEKSEVLAFVSKPPAEAVRLKIVNAMKATGKPTVALFLGYTPAVARDENVWFASSLDEAARLACLLSRVTA
RRNAIAPVSSGFICGLYTGGTLAAEAAGLLAGHLGVEADDTHQHGMMLDADSHQIIDLGDDFYTVGRPHPMIDPTLRNQL
IADLGAKPQVRVLLLDVVIGFGATADPAASLVSAWQKACAARLDNQPLYAIATVTGTERDPQCRSQQIATLEDAGIAVVS
SLPEATLLAAALIHPLSPAAQQHTPSLLENVAVINIGLRSFALELQSASKPVVHYQWSPVAGGNKKLARLLERLQ
>D5IGG6 1.18.1.2~~~fdr~~~Ferredoxin--NAD(P)(+) reductase fdr~~~
MTDTHYDVVIVGAGHGGAQTAIALRQNGFAGTIAIIGAEPDLPYERPPLSKEYLAAEKGFERILIRPASFWNDRHIAMHL
GCAVERVDPTQRLVFLADGRSMGYGDLVWCAGGSARRLDCTGHDLGGVHYVRTRADTDALAAELPGVSKVVIIGGGYIGL
EAAAVMAKFGKNVTLIEALDRVLARVAGEPLSRFFEEKHRSRGVDVRLRTKVGCLLGQDGRVTHVELNDADPIPADLVIV
GIGIIPAISPLVVAGAKASNGLLVDASGRTSIPHVYALGDCAAHVNSFAPNDIPIRLESVQNANDQAVVVARTICGTAAQ
YHAVPWFWSSQYDIRLQTVGLTAGYDQTFVRGDPATGSFTVVYGRDGRVIALDCVNATKDYVQGKRLVEAKALIEPGMTD
PQYPLKNFMTPSPA
>Q6T1W8 5.3.2.3~~~fdtA~~~TDP-4-oxo-6-deoxy-alpha-D-glucose-3,4-oxoisomerase~~~
MENKVINFKKIIDSRGSLVAIEENKNIPFSIKRVYYIFDTKGEEPRGFHAHKKLEQVLVCLNGSCRVILDDGNIIQEITL
DSPAVGLYVGPAVWHEMHDFSSDCVMMVLASDYYDETDYIRQYDNFKKYIAKINLEKEG
>Q6T1W6 2.6.1.90~~~fdtB~~~dTDP-3-amino-3,6-dideoxy-alpha-D-galactopyranose transaminase~~~
MIPFLDLRQINMRYQKEIQQAMNRVLESGWYILGGEVDDFERKFASYCGAKYCIGVANGLDALTLIIRAYDIGLGDEVIV
PSNTYIASILAISANGATPVLVEPDINTYNIDPLKIEEKITSRTKAIMVVHLYGQSCDMESINLIAKKYNLKVIEDCAQA
HGAIYNGKRVGSLGDAAGFSFYPGKNLGALGDGGAITTNDAELAERLNVLRNYGSHKKYENLFKGVNSRLDELQAAILSI
KLSYLDDDNQRRREIAAYYLEHIKNPFIHLPTVTDDKAHVWHLFVVRVKEREAFQYYLAEQNIQTLIHYPIPPHKQKAYS
EWQQESFPISEQIHSEVVSLPISPVMSREEVERVVEAVNRYGY
>Q6T1W7 2.3.1.197~~~fdtC~~~dTDP-3-amino-3,6-dideoxy-alpha-D-galactopyranose 3-N-acetyltransferase~~~
MSSSSETCFVHPNAIVETKKIGNNTRIWAFVHILPQAMIGDNCNICDHCFIENDVFIGNNVTVKSGIYIWDGVYIEDNVF
LGPNVVFTNDVFPRSKVYPESFGRTIVKKGASIGANSVIVAGNIIGEYAMVGAGSVVTRDIPDYALAYGNPARIKGYVCQ
CTSKLKFIDNQAVCQCGKRYKYADGIVSQLII
>O53937 ~~~fdxE~~~Ferredoxin FdxE~~~COG1141
MKVRLDPSRCVGHAQCYAVDPDLFPIDDSGNSILAEHEVRPEDMQLTRDGVAACPEMALILEEDDAD
>D5ARY6 ~~~fdxN~~~Ferredoxin-1~~~COG1145
MAMKIDPELCTSCGDCEPVCPTNAIAPKKGVYVINADTCTECEGEHDLPQCVNACMTDNCINPAA
>P80668 1.2.1.39~~~feaB~~~Phenylacetaldehyde dehydrogenase~~~COG1012
MTEPHVAVLSQVQQFLDRQHGLYIDGRPGPAQSEKRLAIFDPATGQEIASTADANEADVDNAVMSAWRAFVSRRWAGRLP
AERERILLRFADLVEQHSEELAQLETLEQGKSIAISRAFEVGCTLNWMRYTAGLTTKIAGKTLDLSIPLPQGARYQAWTR
KEPVGVVAGIVPWNFPLMIGMWKVMPALAAGCSIVIKPSETTPLTMLRVAELASEAGIPDGVFNVVTGSGAVCGAALTSH
PHVAKISFTGSTATGKGIARTAADHLTRVTLELGGKNPAIVLKDADPQWVIEGLMTGSFLNQGQVCAASSRIYIEAPLFD
TLVSGFEQAVKSLQVGPGMSPVAQINPLVSRAHCDKVCSFLDDAQAQQAELIRGSNGPAGEGYYVAPTLVVNPDAKLRLT
REEVFGPVVNLVRVADGEEALQLANDTEYGLTASVWTQNLSQALEYSDRLQAGTVWVNSHTLIDANLPFGGMKQSGTGRD
FGPDWLDGWCETKSVCVRY
>Q47129 ~~~feaR~~~Transcriptional activator FeaR~~~COG2207
MNPAVDNEFQQWLSQINQVCGNFTGRLLTERYTGVLDTHFAKGLKLSTVTTSGVNLSRTWQEVKGSDDAWFYTVFQLSGQ
AIMEQDERQVQIGAGDITLLDASRPCSLYWQESSKQISLLLPRTLLEQYFPHQKPICAERLDADLPMVQLSHRLLQESMN
NPALSETESEAALQAMVCLLRPVLHQRESVQPRRERQFQKVVTLIDDNIREEILRPEWIAGETGMSVRSLYRMFADKGLV
VAQYIRNRRLDFCADAIRHAADDEKLAGIGFHWGFSDQSHFSTVFKQRFGMTPGEYRRKFR
>P13036 ~~~fecA~~~Fe(3+) dicitrate transport protein FecA~~~COG4772
MTPLRVFRKTTPLVNTIRLSLLPLAGLSFSAFAAQVNIAPGSLDKALNQYAAHSGFTLSVDASLTRGKQSNGLHGDYDVE
SGLQQLLDGSGLQVKPLGNNSWTLEPAPAPKEDALTVVGDWLGDARENDVFEHAGARDVIRREDFAKTGATTMREVLNRI
PGVSAPENNGTGSHDLAMNFGIRGLNPRLASRSTVLMDGIPVPFAPYGQPQLSLAPVSLGNMDAIDVVRGGGAVRYGPQS
VGGVVNFVTRAIPQDFGIEAGVEGQLSPTSSQNNPKETHNLMVGGTADNGFGTALLYSGTRGSDWREHSATRIDDLMLKS
KYAPDEVHTFNSLLQYYDGEADMPGGLSRADYDADRWQSTRPYDRFWGRRKLASLGYQFQPDSQHKFNIQGFYTQTLRSG
YLEQGKRITLSPRNYWVRGIEPRYSQIFMIGPSAHEVGVGYRYLNESTHEMRYYTATSSGQLPSGSSPYDRDTRSGTEAH
AWYLDDKIDIGNWTITPGMRFEHIESYQNNAITGTHEEVSYNAPLPALNVLYHLTDSWNLYANTEGSFGTVQYSQIGKAV
QSGNVEPEKARTWELGTRYDDGALTAEMGLFLINFNNQYDSNQTNDTVTARGKTRHTGLETQARYDLGTLTPTLDNVSIY
ASYAYVNAEIREKGDTYGNLVPFSPKHKGTLGVDYKPGNWTFNLNSDFQSSQFADNANTVKESADGSTGRIPGFMLWGAR
VAYDFGPQMADLNLAFGVKNIFDQDYFIRSYDDNNKGIYAGQPRTLYMQGSLKF
>P15028 ~~~fecB~~~Fe(3+) dicitrate-binding periplasmic protein~~~COG4594
MLAFIRFLFAGLLLVISHAFAATVQDEHGTFTLEKTPQRIVVLELSFADALAAVDVIPIGIADDNDAKRILPEVRAHLKP
WQSVGTRAQPSLEAIAALKPDLIIADSSRHAGVYIALQQIAPVLLLKSRNETYAENLQSAAIIGEMVGKKREMQARLEQH
KERMAQWASQLPKGTRVAFGTSREQQFNLHTQETWTGSVLASLGLNVPAAMAGASMPSIGLEQLLAVNPAWLLVAHYREE
SIVKRWQQDPLWQMLTAAQKQQVASVDSNTWARMRGIFAAERIAADTVKIFHHQPLTVVK
>P15030 ~~~fecC~~~Fe(3+) dicitrate transport system permease protein FecC~~~COG0609
MTAIKHPVLLWGLPVAALIIIFWLSLFCYSAIPVSGADATRALLPGHTPTLPEALVQNLRLPRSLVAVLIGASLALAGTL
LQTLTHNPMASPSLLGINSGAALAMALTSALSPTPIAGYSLSFIAACGGGVSWLLVMTAGGGFRHTHDRNKLILAGIALS
AFCMGLTRITLLLAEDHAYGIFYWLAGGVSHARWQDVWQLLPVVVTAVPVVLLLANQLNLLNLSDSTAHTLGVNLTRLRL
VINMLVLLLVGACVSVAGPVAFIGLLVPHLARFWAGFDQRNVLPVSMLLGATLMLLADVLARALAFPGDLPAGAVLALIG
SPCFVWLVRRRG
>P15029 ~~~fecD~~~Fe(3+) dicitrate transport system permease protein FecD~~~COG0609
MKIALVIFITLALAGCALLSLHMGVIPVPWRALLTDWQAGHEHYYVLMEYRLPRLLLALFVGAALAVAGVLIQGIVRNPL
ASPDILGVNHAASLASVGALLLMPSLPVMVLPLLAFAGGMAGLILLKMLAKTHQPMKLALTGVALSACWASLTDYLMLSR
PQDVNNALLWLTGSLWGRDWSFVKIAIPLMILFLPLSLSFCRDLDLLALGDARATTLGVSVPHTRFWALLLAVAMTSTGV
AACGPISFIGLVVPHMMRSITGGRHRRLLPVSALTGALLLVVADLLARIIHPPLELPVGVLTAIIGAPWFVWLLVRMR
>P15031 ~~~fecE~~~Fe(3+) dicitrate transport ATP-binding protein FecE~~~COG1120
MTLRTENLTVSYGTDKVLNDVSLSLPTGKITALIGPNGCGKSTLLNCFSRLLMPQSGTVFLGDNPINMLSSRQLARRLSL
LPQHHLTPEGITVQELVSYGRNPWLSLWGRLSAEDNARVNVAMNQTRINHLAVRRLTELSGGQRQRAFLAMVLAQNTPVV
LLDEPTTYLDINHQVDLMRLMGELRTQGKTVVAVLHDLNQASRYCDQLVVMANGHVMAQGTPEEVMTPGLLRTVFSVEAE
IHPEPVSGRPMCLMR
>P23484 ~~~fecI~~~Ferric citrate uptake sigma factor FecI~~~COG1595
MSDRATTTASLTFESLYGTHHGWLKSWLTRKLQSAFDADDIAQDTFLRVMVSETLSTIRDPRSFLCTIAKRVMVDLFRRN
ALEKAYLEMLALMPEGGAPSPEERESQLETLQLLDSMLDGLNGKTREAFLLSQLDGLTYSEIAHKLGVSISSVKKYVAKA
VEHCLLFRLEYGL
>P23485 ~~~fecR~~~Ferric citrate uptake sigma factor regulator FecR~~~COG3712
MNPLLTDSRRQALRSASHWYAVLSGERVSPQQEARWQQWYEQDQDNQWAWQQVENLRNQLGGVPGDVASRALHDTRLTRR
HVMKGLLLLLGAGGGWQLWQSETGEGLRADYRTAKGTVSRQQLEDGSLLTLNTQSAADVRFDAHQRTVRLWYGEIAITTA
KDALQRPFRVLTRQGQLTALGTEFTVRQQDNFTQLDVQQHAVEVLLASAPAQKRIVNAGESLQFSASEFGAVKPLDDEST
SWTKDILSFSDKPLGEVIATLTRYRNGVLRCDPAVAGLRLSGTFPLKNTDAILNVIAQTLPVKIQSITRYWINISPL
>Q2FYR2 2.3.2.17~~~femA~~~Aminoacyltransferase FemA~~~COG2348
MKFTNLTAKEFGAFTDSMPYSHFTQTVGHYELKLAEGYETHLVGIKNNNNEVIAACLLTAVPVMKVFKYFYSNRGPVIDY
ENQELVHFFFNELSKYVKKHRCLYLHIDPYLPYQYLNHDGEITGNAGNDWFFDKMSNLGFEHTGFHKGFDPVLQIRYHSV
LDLKDKTADDIIKNMDGLRKRNTKKVKKNGVKVRFLSEEELPIFRSFMEDTSESKAFADRDDKFYYNRLKYYKDRVLVPL
AYINFDEYIKELNEERDILNKDLNKALKDIEKRPENKKAHNKRDNLQQQLDANEQKIEEGKRLQEEHGNELPISAGFFFI
NPFEVVYYAGGTSNAFRHFAGSYAVQWEMINYALNHGIDRYNFYGVSGKFTEDAEDAGVVKFKKGYNAEIIEYVGDFIKP
INKPVYAAYTALKKVKDRIF
>Q7A5R3 2.3.2.17~~~femA~~~Aminoacyltransferase FemA~~~
MKFTNLTAKEFGAFTDSMPYSHFTQTVGHYELKLAEGYETHLVGIKNNNNEVIAACLLTAVPVMKVFKYFYSNRGPVIDY
ENQELVHFFFNELSKYVKKHRCLYLHIDPYLPYQYLNHDGEITGNAGNDWFFDKMSNLGFEHTGFHKGFDPVLQIRYHSV
LDLKDKTADDIIKNMDGLRKRNTKKVKKNGVKVRYLSEEELPIFRSFMEDTSESKAFADRDDKFYYNRLKYYKDRVLVPL
AYINFDEYIKELNEERDILNKDLNKALKDIEKRPENKKAHNKRDNLQQQLDANEQKIEEGKRLQEEHGNELPISAGFFFI
NPFEVVYYAGGTSNAFRHFAGSYAVQWEMINYALNHGIDRYNFYGVSGKFTEDAEDAGVVKFKKGYNAEIIEYVGDFIKP
INKPVYAAYTALKKVKDRIF
>P0A0A5 2.3.2.17~~~femA~~~Aminoacyltransferase FemA~~~
MKFTNLTAKEFGAFTDSMPYSHFTQTVGHYELKLAEGYETHLVGIKNNNNEVIAACLLTAVPVMKVFKYFYSNRGPVIDY
ENQELVHFFFNELSKYVKKHRCLYLHIDPYLPYQYLNHDGEITGNAGNDWFFDKMSNLGFEHTGFHKGFDPVLQIRYHSV
LDLKDKTADDIIKNMDGLRKRNTKKVKKNGVKVRFLSEEELPIFRSFMEDTSESKAFADRDDKFYYNRLKYYKDRVLVPL
AYINFDEYIKELNEERDILNKDLNKALKDIEKRPENKKAHNKRDNLQQQLDANEQKIEEGKRLQEEHGNELPISAGFFFI
NPFEVVYYAGGTSNAFRHFAGSYAVQWEMINYALNHGIDRYNFYGVSGKFTEDAEDAGVVKFKKGYNAEIIEYVGDFIKP
INKPVYAAYTALKKVKDRIF
>Q2FYR1 2.3.2.18~~~femB~~~Aminoacyltransferase FemB~~~COG2348
MKFTELTVTEFDNFVQNPSLESHYFQVKENIVTRENDGFEVVLLGIKDDNNKVIAASLFSKIPTMGSYVYYSNRGPVMDF
SDLGLVDYYLKELDKYLQQHQCLYVKLDPYWLYHLYDKDIVPFEGREKNDALVNLFKSHGYEHHGFTTEYDTSSQVRWMG
VLNLEGKTPETLKKTFDSQRKRNINKAINYGVKVRFLERDEFNLFLDLYRETEERAGFVSKTDDYFYNFIDTYGDKVLVP
LAYIDLDEYVLKLQQELNDKENRRDQMMAKENKSDKQMKKIAELDKQIDHDQHELLNASELSKTDGPILNLASGVYFANA
YEVNYFSGGSSEKYNQFMGPYMMHWFMINYCFDNGYDRYNFYGLSGDFTENSEDYGVYRFKRGFNVQIEELIGDFYKPIH
KVKYWLFTTLDKLRKKLKK
>P0A0A7 2.3.2.18~~~femB~~~Aminoacyltransferase FemB~~~
MKFTELTVTEFDNFVQNPSLESHYFQVKENIVTRENDGFEVVLLGIKDDNNKVIAASLFSKIPTMGSYVYYSNRGPVMDF
SDLGLVDYYLKELDKYLQQHQCLYVKLDPYWLYHLYDKDIVPFEGREKNDALVNLFKSHGYEHHGFTTEYDTSSQVRWMG
VLNLEGKTPETLKKTFDSQRKRNINKAINYGVKVRFLERDEFNLFLDLYRETEERAGFVSKTDDYFYNFIDTYGDKVLVP
LAYIDLDEYVLKLQQELNDKENRRDQMMAKENKSDKQMKKIAELDKQIDHDQHELLNASELSKTDGPILNLASGVYFANA
YEVNYFSGGSSEKYNQFMGPYMMHWFMINYCFDNGYDRYNFYGLSGDFTENSEDYGVYRFKRGFNVQIEELIGDFYKPIH
KVKYWLFTTLDKLRKKLKK
>Q2FVZ4 2.3.2.16~~~femX~~~Lipid II:glycine glycyltransferase~~~COG2348
MEKMHITNQEHDAFVKSHPNGDLLQLTKWAETKKLTGWYARRIAVGRDGEVQGVAQLLFKKVPKLPYTLCYISRGFVVDY
SNKEALNALLDSAKEIAKAEKAYAIKIDPDVEVDKGTDALQNLKALGFKHKGFKEGLSKDYIQPRMTMITPIDKNDDELL
NSFERRNRSKVRLALKRGTTVERSDREGLKTFAELMKITGERDGFLTRDISYFENIYDALHEDGDAELFLVKLDPKENIA
KVNQELNELHAEIAKWQQKMKTSEKQAKKAQNMINDAQNKIAKNEDLKRDLEALEKEHPEGIYLSGALLMFAGSKSYYLY
GASSNEFRDFLPNHHMQYTMMKYAREHGATTYDFGGTDNDPDKDSEHYGLWAFKKVWGTYLSEKIGEFDYVLNQPLYQLI
EQVKPRLTKAKIKISRKLKRK
>Q7A447 2.3.2.16~~~femX~~~Lipid II:glycine glycyltransferase~~~
MEKMHITNQEHDAFVKSHPNGDLLQLTKWAETKKLTGWYARRIAVGRDGEVQGVAQLLFKKVPKLPYTLCYISRGFVVDY
SNKEALNALLDSAKEIAKAEKAYAIKIDPDVEVDKGTDALQNLKALGFKHKGFKEGLSKDYIQPRMTMITPIDKNDDELL
NSFERRNRSKVRLALKRGTTVERSDREGLKTFAELMKITGERDGFLTRDISYFENIYDALHEDGDAELFLVKLDPKENIA
KVNQELNELHAEIAKWQQKMETSEKQAKKAQNMINDAQNKIAKNEDLKRDLEALEKEHPEGIYLSGALLMFAGSKSYYLY
GASSNEFRDFLPNHHMQYTMMKYAREHGATTYDFGGTDNDPDKDSEHYGLWAFKKVWGTYLSEKIGEFDYILNQPLYQLI
EQVKPRLTKAKIKISRKLKRK
>Q9EY50 2.3.2.10~~~femX~~~UDP-N-acetylmuramoylpentapeptide-lysine N(6)-alanyltransferase~~~
MPVLNLNDPQAVERYEEFMRQSPYGQVTQDLGWAKVKNNWEPVDVYLEDDQGAIIAAMSMLLGDTPTDKKFAYASKGPVM
DVTDVDLLDRLVDEAVKALDGRAYVLRFDPEVAYSDEFNTTLQDHGYVTRNRNVADAGMHATIQPRLNMVLDLTKFPDAK
TTLDLYPSKTKSKIKRPFRDGVEVHSGNSATELDEFFKTYTTMAERHGITHRPIEYFQRMQAAFDADTMRIFVAEREGKL
LSTGIALKYGRKIWYMYAGSMDGNTYYAPYAVQSEMIQWALDTNTDLYDLGGIESESTDDSLYVFKHVFVKDAPREYIGE
IDKVLDPEVYAELVKD
>Q9R9J2 2.3.1.39~~~fenF~~~Malonyl CoA-acyl carrier protein transacylase~~~
MNNLAFLFPGQGSQFVGMGKSFWNDFVLAKRLFEEASDAISMDVKKLCFDGDMTELTRTMNAQPAILTVSVIAYQVYMQE
IGIKPHFLAGHSLGEYSALVCAGVLSFQEAVKLIRQRGILMQNADPEQLGTMAAITQVYIQPLQDLCTEISTEDFPVGVA
CMNSDQQHVISGHRQAVEFVIKKAERMGANHTYLNVSAPFHSSMMRSASEQFQTALNQYSFRDAEWPIISNVTAIPYNNG
HSVREHLQTHMTMPVRWAESMHYLLLHGVTEVIEMGPKNVLVGLLKKITNHIAAYPLGQTSDLHLLSDSAERNENIVNLR
KKQLNKMMIQSIIARNYNKDAKTYSNLTTPLFPQIQLLKERVERKEVELSAEELEHSIHLCQLICEAKQLPTWEQLRILK
>O31475 1.18.1.2~~~ycgT~~~Ferredoxin--NADP reductase 1~~~COG0492
MAENQEVYDVTIIGGGPIGLFTAFYCGMRELKTKVIEFLPKLGGKVSLFFPEKIIRDIGGIPGIAGKQLIEQLKEQAATF
DPDIVLNQRVTGFERLDDGTIVLTGSEGKKHYTRTVILACGMGTLEVNEFDSEDAARYAGKNLHYGVEKLDAFKGKRVVI
SGGGDTAVDWANELEPIAASVTVVHRREEFGGMESSVTKMKQSSVRVLTPYRLEQLNGDEEGIKSVTVCHTESGQRKDIE
IDELIINHGFKIDLGPMMEWGLEIEEGRVKADRHMRTNLPGVFVAGDAAFYESKLRLIAGGFTEGPTAVNSAKAYLDPKA
ENMAMYSTHHKKLVHK
>O05268 1.18.1.2~~~yumC~~~Ferredoxin--NADP reductase 2~~~COG0492
MREDTKVYDITIIGGGPVGLFTAFYGGMRQASVKIIESLPQLGGQLSALYPEKYIYDVAGFPKIRAQELINNLKEQMAKF
DQTICLEQAVESVEKQADGVFKLVTNEETHYSKTVIITAGNGAFKPRKLELENAEQYEGKNLHYFVDDLQKFAGRRVAIL
GGGDSAVDWALMLEPIAKEVSIIHRRDKFRAHEHSVENLHASKVNVLTPFVPAELIGEDKIEQLVLEEVKGDRKEILEID
DLIVNYGFVSSLGPIKNWGLDIEKNSIVVKSTMETNIEGFFAAGDICTYEGKVNLIASGFGEAPTAVNNAKAYMDPKARV
QPLHSTSLFENK
>Q44532 1.18.1.2~~~fpr~~~Ferredoxin--NADP reductase~~~
MSNLNVERVLSVHHWNDTLFSFKTTRNPSLRFENGQFVMIGLEVDGRPLMRAYSIASPNYEEHLEFFSIKVQNGPLTSRL
QHLKEGDELMVSRKPTGTLVTSDLLPGKHLYMLSTGTGLAPFMSLIQDPEVYERFEKVVLIHGVRQVNELAYQQFITEHL
PQSEYFGEAVKEKLIYYPTVTRESFHNQGRLTDLMRSGKLFEDIGLPPINPQDDRAMICGSPSMLDESCEVLDGFGLKIS
PRMGEPGDYLIERAFVEK
>Q816D9 1.18.1.2~~~~~~Ferredoxin--NADP reductase~~~
MKVAENQKVYDITIIGGGPTGLFTAFYGGMRQASVKIIESLPQLGGQLSALYPEKYIYDVAGFPKVRAQELVDNLKEQMK
KFDPTVCLEEAVDTLEKQADGIFKLVTNKQTHYSKSVIITAGNGAFQPRRLELEGTAKYEKKNLHYFVDDMNKFAGKRVV
VFGGGDSAVDWTMMLEPIAEKVTIVHRRDKFRAHEHSVENLMNSRAEVSTPYVPVELIGDDKIEQVVLQHVKTEEKVIID
VDDVIVNYGFVSSLGPIKNWGLDIQKNSILVNSKMETNIPGIYAAGDICTYEGKVKLIACGFGEAPTAVNNAKAYFDPNA
KLQPMHSSSMF
>Q8KCB2 1.18.1.2~~~~~~Ferredoxin--NADP reductase~~~COG0492
MLDIHNPATDHHDMRDLTIIGGGPTGIFAAFQCGMNNISCRIIESMPQLGGQLAALYPEKHIYDVAGFPEVPAIDLVESL
WAQAERYNPDVVLNETVTKYTKLDDGTFETRTNTGNVYRSRAVLIAAGLGAFEPRKLPQLGNIDHLTGSSVYYAVKSVED
FKGKRVVIVGGGDSALDWTVGLIKNAASVTLVHRGHEFQGHGKTAHEVERARANGTIDVYLETEVASIEESNGVLTRVHL
RSSDGSKWTVEADRLLILIGFKSNLGPLARWDLELYENALVVDSHMKTSVDGLYAAGDIAYYPGKLKIIQTGLSEATMAV
RHSLSYIKPGEKIRNVFSSVKMAKEKKAAEAGNATENKAE
>P28861 1.18.1.2~~~fpr~~~Flavodoxin/ferredoxin--NADP reductase~~~COG1018
MADWVTGKVTKVQNWTDALFSLTVHAPVLPFTAGQFTKLGLEIDGERVQRAYSYVNSPDNPDLEFYLVTVPDGKLSPRLA
ALKPGDEVQVVSEAAGFFVLDEVPHCETLWMLATGTAIGPYLSILQLGKDLDRFKNLVLVHAARYAADLSYLPLMQELEK
RYEGKLRIQTVVSRETAAGSLTGRIPALIESGELESTIGLPMNKETSHVMLCGNPQMVRDTQQLLKETRQMTKHLRRRPG
HMTAEHYW
>P21890 1.18.1.2~~~petH~~~Ferredoxin--NADP reductase~~~
MSNQGAFDGAANVESGSRVFVYEVVGMRQNEETDQTNYPIRKSGSVFIRVPYNRMNQEMQRITRLGGKIVTIQTVSALQQ
LNGRTTIATVTDASSEIAKSEGNGKATPVKTDSGAKAFAKPPAEEQLKKKDNKGNTMTQAKAKHADVPVNLYRPNAPFIG
KVISNEPLVKEGGIGIVQHIKFDLTGGNLKYIEGQSIGIIPPGVDKNGKPEKLRLYSIASTRHGDDVDDKTISLCVRQLE
YKHPESGETVYGVCSTYLTHIEPGSEVKITGPVGKEMLLPDDPEANVIMLATGTGIAPMRTYLWRMFKDAERAANPEYQF
KGFSWLVFGVPTTPNILYKEELEEIQQKYPDNFRLTYAISREQKNPQGGRMYIQDRVAEHADELWQLIKNQKTHTYICGL
RGMEEGIDAALSAAAAKEGVTWSDYQKDLKKAGRWHVETY
>Q9L6V3 1.18.1.2~~~fpr~~~Flavodoxin/ferredoxin--NADP reductase~~~
MNETTPIAPAKVLPDAQTVTSVRHWTDTLFSFRVTRPQTLRFRSGEFVMIGLLDDNGKPIMRAYSIASPAWDEELEFYSI
KVPDGPLTSRLQHIKVGEQIILRPKPVGTLVIDALLPGKRLWFLATGTGIAPFASLMREPEAYEKFDEVIMMHACRTVAE
LEYGRQLVEALQEDPLIGELVEGKLKYYPTTTREEFHHMGRITDNLASGKVFEDLGIAPMNPETDRAMVCGSLAFNVDVM
KVLESYGLREGANSEPREFVVEKAFVGEGI
>Q6N2U4 1.18.1.2~~~~~~Ferredoxin--NADP reductase~~~COG0492
MTETIKTDVLIVGAGPCGLFAVFELGLLDVKAHLVDILDKVGGQCAELYPEKPIYDIPGIPMVTGHGLTEALMEQIKPFN
PTFHLSEMVENVEKIGDPGFRVTTNAGKVFECTVLVVAAGGGSFLPKRPPVPGVEAYEGTSVHYAVRKMEDFRGKDILIV
GGGDSALDWTLNLNPIAKSMTLVHRRDDFRGAPHSVEQMRQLVASGKLDLKIGQITELQGDNGQLTGATVKLNDNTTSQI
KCDAMLPFFGLTMKLGPVANWGLDLENNLIPVDTGTFETNVPGIFAIGDINTYPGKLKLILSGFHEGALMAQKAVKYVYP
DKRVVFQYTTSSTNLQKKLGVN
>P00454 1.18.1.2~~~petH~~~Ferredoxin--NADP reductase~~~
AKTDIPVNIYKPKNPYIGKCLSNEELVREGGTGTVRHLIFDISGGDLRYLEGQSIGIIPPGTDNNGKPHKLRLYSIASTR
HGDHVDDKTVSLCVRQLEYKHPETGETVYGVCSTYLCNLEAGADVAITGPVGKEMLLPEDEDATIIMMATGTGIAPFRAF
LWRIFKEQHEDYKFKGLAWLFFGIPYSPNILYQQELEELQEEFPENFRLTLAISREQQNPEGGKMYIQDRIKENADQLWE
LIQKPNTHTYICGLKGMEGGIDEGMSAAAGKFDVDWSDYQKELKKKHRWHVETY
>Q2FEC4 1.18.1.2~~~~~~Ferredoxin--NADP reductase~~~
MKDVTIIGGGPSGLYASFYAGLRDMSVRLIDVQSELGGKMRIYPEKIIWDIGGIAPKPCHEILKDTIKQGLYFKPEVHLN
ERVVDIRKKAERHFEVETEAGEIYTSKAVIIAIGAGIINPKQLDVKGVERYQLTNLHYVVQSYRRFKDKDVLISGGGNTA
LDWAHDIAKIAKSVTVVYRKEDVSGHEAMKTLVTDLNVKLCPKTRIKYLVGNDDETHISEVVLEHVESGDRHTVKFDDVI
ISHGFDRCNTLLSETSSKLDMHDDCRVKGFGNTTTSIPGIYACGDIVYHDAKSHLIASAFSDGANAANLAKTYIQPDANA
EGYVSSHHEVFKEANKTIVNKHLY
>Q2FVP8 1.18.1.2~~~~~~Ferredoxin--NADP reductase~~~COG0492
MKDVTIIGGGPSGLYASFYAGLRDMSVRLIDVQSELGGKMRIYPEKIIWDIGGIAPKPCHEILKDTIKQGLYFKPEVHLN
ERVVDIRKKAERHFEVETEAGEIYTSKAVIIAIGAGIINPKQLDVKGVERYQLTNLHYVVQSYRRFKDKDVLISGGGNTA
LDWAHDIAKIAKSVTVVYRKEDVSGHEAMKTLVTDLNVKLCPKTRIKYLVGNDDETHISEVVLEHVESGDRHTVKFDDVI
ISHGFDRCNTLLSETSSKLDMHDDCRVKGFGNTTTSIPGIYACGDIVYHDAKSHLIASAFSDGANAANLAKTYIQPDANA
EGYVSSHHEVFKEANKTIVNKHLY
>Q7A3W1 1.18.1.2~~~~~~Ferredoxin--NADP reductase~~~
MLQSIGGGPSGLYYASFYAGLRDMSVRLIDVQSELGGKMRIYPEKIIWDIGGIAPKPCHEILKDTIKQGLYFKPEVHLNE
RVVDIRKKAERHFEVETEAGEIYTSKAVIIAIGAGIINPKQLDVKGVERYQLTNLHYVVQSYRRFKDKDVLISGGGNTAL
DWAHDIAKIAKSVTVVYRKEDVSGHEAMKTLVTDLNVKLCPKTRIKYLVGNDDETHISEVVLEHVESGDRHTVKFDDVII
SHGFDRCNTLLSETSSKLDMHDDCRVKGFGNTTTSIPGIYACGDIVYHDAKSHLIASAFSDGANAANLAKTYIQPDANAE
GYVSSHHEVFKEANKTIVNKHLY
>P31973 1.18.1.2~~~petH~~~Ferredoxin--NADP reductase~~~COG0369
MYGITSTANSTGNQSYANRLFIYEVVGLGGDGRNENSLVRKSGTTFITVPYARMNQEMQRITKLGGKIVSIRPAEDAAQI
VSEGQSSAQASAQSPMASSTKIVHPKTTDTSVPVNIYRPKTPFLGKCIENYELVDEGGSGTVRHVTFDISEGDLRYLEGQ
SIGIIPPGEDKNGKPHKLRLYSIASTRHGDMEDNKTVSLCVRQLEYQDPESGETVYGVCSTYLCNLPVGTDDVKITGPVG
KEMLLPDDEDATVVMLATGTGIAPFRAFLWRMFKEQHEDYKFKGKAWLIFGVPYTANILYKDDFEKMAAENPDNFRLTYA
ISREQKTADGGKVYVQSRVSEYADELFEMIQKPNTHVYMCGLKGMQPPIDETFTAEAEKRGLNWEEMRRSMKKEHRWHVE
VY
>Q55318 1.18.1.2~~~petH~~~Ferredoxin--NADP reductase~~~COG0369
MYSPGYVATSSRQSDAGNRLFVYEVIGLSQSTMTDGLDYPIRRSGSTFITVPLKRMNQEMRRITRMGGKIVSIKPLEGDS
PLPHTEGIAKPSQSEGSGSEAVANPAPESNKTMTTTPKEKKADDIPVNIYRPKTPYIGKVLENYPLVREGAIGTVQHLTF
DLSAGDLRYLEGQSIGIIPPGEDDKGKPHKLRLYSIASTRHGDFGDDKTVSLCVRQLEYQNEAGETVQGVCSTYLCNIKE
GDDIAITGPVGKEMLLPPDEDANIVMLATGTGIAPFRAFLWRMFKEQHEDYKFKGLAWLIFGIPKSENILYKDDLEKMAA
EFPDNFRLTYAISREQQNAEGGRMYIQHRVAENAEELWNLMQNPKTHTYMCGLKGMEPGIDEAFTALAEQNGKEWTTFQR
EMKKEHRWHVETY
>Q5SL28 1.18.1.2~~~~~~Ferredoxin--NADP reductase~~~COG0492
MAADHTDVLIVGAGPTGLFAGFYVGMRGLSFRFVDPLPEPGGQLTALYPEKYIYDVAGFPKVYAKDLVKGLVEQVAPFNP
VYSLGERAETLEREGDLFKVTTSQGNAYTAKAVIIAAGVGAFEPRRIGAPGEREFEGRGVYYAVKSKAEFQGKRVLIVGG
GDSAVDWALNLLDTARRITLIHRRPQFRAHEASVKELMKAHEEGRLEVLTPYELRRVEGDERVRWAVVFHNQTQEELALE
VDAVLILAGYITKLGPLANWGLALEKNKIKVDTTMATSIPGVYACGDIVTYPGKLPLIVLGFGEAAIAANHAAAYANPAL
KVNPGHSSEKAAPGT
>Q9PMR0 ~~~feoA~~~Putative Fe(2+) transport protein A~~~COG1918
MTLNELKDGQKAIIVNLNAHKELKNRLLSFGFIKNKNLKKIHSSLKNATIMVELDTSCVILRSDEAKTIEVNLI
>P0AEL3 ~~~feoA~~~Fe(2+) transport protein A~~~COG1918
MQYTPDTAWKITGFSREISPAYRQKLLSLGMLPGSSFNVVRVAPLGDPIHIETRRVSLVLRKKDLALLEVEAVSC
>Q9PMQ9 ~~~feoB~~~Fe(2+) transporter FeoB~~~COG0370
MKKIKIALVGQPNVGKSLLINALCKANMKVGNFSGVTIEKASAKTFYKNYEFEVIDLPGTYSLDGYSEEEKITRHFLNQN
DYDVIVNVLDATNLERNLILSAELLSLNKKMLLALNMCDEAKKEGIELDTSILSQEFQSQVVEISAKTKENLELLLQKII
ILFESKFIPRSQFYTPLCEKSPEKEDLLYFINELSKKIITHKKEERNLTKKIDALLIHKFFGLPIFLFLMWLLFQLTFSL
GQIPMDYIESGFNTLGEFVKNNISNTFIASALADGIIAGVGAVILFLPNIMILFLGIALLETTGYMSRVAFLLDGILHKF
GLHGKSFIPLITGFGCSVPAFMATRTLKNKRDRLLTLFVINFMSCGARLPVYVLFIGAFFPSEKAGNYLFGIYILGAILG
LCAAKFLRMTAFRGLDEPFVMEMPKYRMPNWHLVWFMVYNKAKMYLKKAGTFILLASLLIWFASNFPKSEENLNDFNAQE
RAIEQSYLGQFGKGIEPIFQPLELDWKLSVSLISGLAAKEVMISTMGVLYSLGKDVDETNNDLKGIIAKNIPIPSAVAFI
LFVMIYNPCFAATIVFSKESGKLKYTLFLFLFTCTSAYIVAFIGLHIAKILLN
>P33650 ~~~feoB~~~Fe(2+) transporter FeoB~~~COG0370
MKKLTIGLIGNPNSGKTTLFNQLTGSRQRVGNWAGVTVERKEGQFSTTDHQVTLVDLPGTYSLTTISSQTSLDEQIACHY
ILSGDADLLINVVDASNLERNLYLTLQLLELGIPCIVALNMLDIAEKQNIRIEIDALSARLGCPVIPLVSTRGRGIEALK
LAIDRYKANENVELVHYAQPLLNEADSLAKVMPSDIPLKQRRWLGLQMLEGDIYSRAYAGEASQHLDAALARLRNEMDDP
ALHIADARYQCIAAICDVVSNTLTAEPSRFTTAVDKIVLNRFLGLPIFLFVMYLMFLLAINIGGALQPLFDVGSVALFVH
GIQWIGYTLHFPDWLTIFLAQGLGGGINTVLPLVPQIGMMYLFLSFLEDSGYMARAAFVMDRLMQALGLPGKSFVPLIVG
FGCNVPSVMGARTLDAPRERLMTIMMAPFMSCGARLAIFAVFAAAFFGQNGALAVFSLYMLGIVMAVLTGLMLKYTIMRG
EATPFVMELPVYHVPHVKSLIIQTWQRLKGFVLRAGKVIIIVSIFLSAFNSFSLSGKIVDNINDSALASVSRVITPVFKP
IGVHEDNWQATVGLFTGAMAKEVVVGTLNTLYTAENIQDEEFNPAEFNLGEELFSAIDETWQSLKDTFSLSVLMNPIEAS
KGDGEMGTGAMGVMDQKFGSAAAAYSYLIFVLLYVPCISVMGAIARESSRGWMGFSILWGLNIAYSLATLFYQVASYSQH
PTYSLVCILAVILFNIVVIGLLRRARSRVDIELLATRKSVSSCCAASTTGDCH
>Q8GNS3 ~~~feoB~~~Fe(2+) transporter FeoB~~~COG0370
MTHALLIGNPNCGKTTLFNALTNANQRVGNWPGVTVEKKTGEFLLGEHLIEITDLPGVYSLVANAEGISQDEQIAAQSVI
DLEYDCIINVIDACHLERHLYLTSQLFELGKPVVVALNMMDIAEHRGISIDTEKLESLLGCSVIPIQAHKNIGIPALQQS
LLHCSQKIKPLKLSLSVAAQQILNDLENQLISKGYKNSFAYYFSRRLAEGDTLIGEKAFTESLLIKLQETEQNLDVLLAD
ARYQKIHEIVTLVQKKHSDASEHFTAKLDKLVLHRFLALPIFFAMMYLMFLFAINIGGAFQDFFDISTETIFVQGSGWLL
QQLHAPNWVIALVANGVGKGINTTITFIPVIAAMFFFLSLLETSGYMARAAFVVDKAMRAMGLPGKSFVPMIVGFGCNVP
AIMAARTLDSERDRLLTVMMSPFMSCSARLAIYAVFVAAFFPSGGHNVVFSLYLIGILMAVFTGYILRKTTLKGHASPLI
LELPAYHRPSLRRLLRETSLRLRFFVYRAGKLIIPICVILGGLNAITWGGGISSGEANTDSLLSIIGQWITPLFAPMGIH
QDNWPATVGLLTGMLAKEVVVGTLNSLYAQVGHVGEITAAHFDFWGGIKAAFGSIPANLSELGSALWNPVSASAADSELS
QSVYGIMSRRFDGAVGAYAYLLFVLLYIPCVSTMAVIRQEANKRFMWTSIVWSFVVAYATSVVFYQGAKFLDHPQQSMVW
ILAMSLSLLFVLAVFRYSQYGMGRQNAAANT
>P64638 ~~~feoC~~~Probable [Fe-S]-dependent transcriptional repressor FeoC~~~
MASLIQVRDLLALRGRMEAAQISQTLNTPQPMINAMLQQLESMGKAVRIQEEPDGCLSGSCKSCPEGKACLREWWALR
>B5XTS6 ~~~feoC~~~Probable [Fe-S]-dependent transcriptional repressor~~~
MASLMEVRDMLALQGRMEAKQLSARLRTPQPLIDAMLERMEAMGKVVRISETSEGCLSGSCKSCPEGKAACQQEWWALR
>A6TF33 ~~~feoC~~~Probable [Fe-S]-dependent transcriptional repressor~~~
MASLMEVRDMLALQGRMEAKQLSARLQTPQPLIDAMLERMEAMGKVVRISETSEGCLSGSCKSCPEGKAACRQEWWALR
>P05825 ~~~fepA~~~Ferrienterobactin receptor~~~COG4771
MNKKIHSLALLVNLGIYGVAQAQEPTDTPVSHDDTIVVTAAEQNLQAPGVSTITADEIRKNPVARDVSKIIRTMPGVNLT
GNSTSGQRGNNRQIDIRGMGPENTLILIDGKPVSSRNSVRQGWRGERDTRGDTSWVPPEMIERIEVLRGPAAARYGNGAA
GGVVNIITKKGSGEWHGSWDAYFNAPEHKEEGATKRTNFSLTGPLGDEFSFRLYGNLDKTQADAWDINQGHQSARAGTYA
TTLPAGREGVINKDINGVVRWDFAPLQSLELEAGYSRQGNLYAGDTQNTNSDSYTRSKYGDETNRLYRQNYALTWNGGWD
NGVTTSNWVQYEHTRNSRIPEGLAGGTEGKFNEKATQDFVDIDLDDVMLHSEVNLPIDFLVNQTLTLGTEWNQQRMKDLS
SNTQALTGTNTGGAIDGVSTTDRSPYSKAEIFSLFAENNMELTDSTIVTPGLRFDHHSIVGNNWSPALNISQGLGDDFTL
KMGIARAYKAPSLYQTNPNYILYSKGQGCYASAGGCYLQGNDDLKAETSINKEIGLEFKRDGWLAGVTWFRNDYRNKIEA
GYVAVGQNAVGTDLYQWDNVPKAVVEGLEGSLNVPVSETVMWTNNITYMLKSENKTTGDRLSIIPEYTLNSTLSWQARED
LSMQTTFTWYGKQQPKKYNYKGQPAVGPETKEISPYSIVGLSATWDVTKNVSLTGGVDNLFDKRLWRAGNAQTTGDLAGA
NYIAGAGAYTYNEPGRTWYMSVNTHF
>P0AEL6 ~~~fepB~~~Ferric enterobactin-binding periplasmic protein FepB~~~COG4592
MRLAPLYRNALLLTGLLLSGIAAVQAADWPRQITDSRGTHTLESQPQRIVSTSVTLTGSLLAIDAPVIASGATTPNNRVA
DDQGFLRQWSKVAKERKLQRLYIGEPSAEAVAAQMPDLILISATGGDSALALYDQLSTIAPTLIINYDDKSWQSLLTQLG
EITGHEKQAAERIAQFDKQLAAAKEQIKLPPQPVTAIVYTAAAHSANLWTPESAQGQMLEQLGFTLAKLPAGLNASQSQG
KRHDIIQLGGENLAAGLNGESLFLFAGDQKDADAIYANPLLAHLPAVQNKQVYALGTETFRLDYYSAMQVLDRLKALF
>P23878 7.2.2.17~~~fepC~~~Ferric enterobactin transport ATP-binding protein FepC~~~COG1120
MTESVARLRGEQLTLGYGKYTVAENLTVEIPDGHFTAIIGPNGCGKSTLLRTLSRLMTPAHGHVWLDGEHIQHYASKEVA
RRIGLLAQNATTPGDITVQELVARGRYPHQPLFTRWRKEDEEAVTKAMQATGITHLADQSVDTLSGGQRQRAWIAMVLAQ
ETAIMLLDEPTTWLDISHQIDLLELLSELNREKGYTLAAVLHDLNQACRYASHLIALREGKIVAQGAPKEIVTAELIERI
YGLRCMIIDDPVAGTPLVVPLGRTAPSTANS
>P23876 ~~~fepD~~~Ferric enterobactin transport system permease protein FepD~~~COG0609
MSGSVAVTRAIAVPGLLLLLIIATALSLLIGAKSLPASVVLEAFSGTCQSADCTIVLDARLPRTLAGLLAGGALGLAGAL
MQTLTRNPLADPGLLGVNAGASFAIVLGAALFGYSSAQEQLAMAFAGALVASLIVAFTGSQGGGQLSPVRLTLAGVALAA
VLEGLTSGIALLNPDVYDQLRFWQAGSLDIRNLHTLKVVLIPVLIAGATALLLSRALNSLSLGSDTATALGSRVARTQLI
GLLAITVLCGSATAIVGPIAFIGLMMPHMARWLVGADHRWSLPVTLLATPALLLFADIIGRVIVPGELRVSVVSAFIGAP
VLIFLVRRKTRGGA
>P26266 ~~~fepE~~~Ferric enterobactin transport protein FepE~~~COG3765
MSSLNIKQGSDAHFPDYPLASPSNNEIDLLNLISVLWRAKKTVMAVVFAFACAGLLISFILPQKWTSAAVVTPPEPVQWQ
ELEKSFTKLRVLDLDIKIDRTEAFNLFIKKFQSVSLLEEYLRSSPYVMDQLKEAKIDELDLHRAIVALSEKMKAVDDNAS
KKKDEPSLYTSWTLSFTAPTSEEAQTVLSGYIDYISTLVVKESLENVRNKLEIKTQFEKEKLAQDRIKTKNQLDANIQRL
NYSLDIANAAGIKKPVYSNGQAVKDDPDFSISLGADGIERKLEIEKAVTDVAELNGELRNRQYLVEQLTKAHVNDVNFTP
FKYQLSPSLPVKKDGPGKAIIVILSALIGGMVACGGVLLRYAMASRKQDAMMADHLV
>P23877 ~~~fepG~~~Ferric enterobactin transport system permease protein FepG~~~COG4779
MIYVSRRLLITCLLLVSACVVAGIWGLRSGAVTLETSQVFAALMGDAPRSMTMVVTEWRLPRVLMALLIGAALGVSGAIF
QSLMRNPLGSPDVMGFNTGAWSGVLVAMVLFGQDLTAIALSAMVGGIVTSLLVWLLAWRNGIDTFRLIIIGIGVRAMLVA
FNTWLLLKASLETALTAGLWNAGSLNGLTWAKTSPSAPIIILMLIAAALLVRRMRLLEMGDDTACALGVSVERSRLLMML
VAVVLTAAATALAGPISFIALVAPHIARRISGTARWGLTQAALCGALLLLAADLCAQQLFMPYQLPVGVVTVSLGGIYLI
VLLIQESRKK
>P00244 ~~~~~~Ferredoxin-1~~~
MATYKVTLIDAEGTTTTIDCPDDTYILDAAEEAGLDLPYSCRAGACSTCAGKLVTGTIDQSDQSFLDDDQVEAGYVLTCV
AYPTSDVTIETHKEEDLY
>O67065 ~~~fdx1~~~Ferredoxin-1~~~COG0633
MKVIINGKEFDIPKGVRFGELSHEIEKAGIEFGCTDGQCGVCVARVIKGMECLNEPSEEEEETLWRVGAVDEDQRLTCQL
VIEKEDCDEIVIESED
>P00214 ~~~fdxA~~~Ferredoxin-1~~~
MAFVVTDNCIKCKYTDCVEVCPVDCFYEGPNFLVIHPDECIDCALCEPECPAQAIFSEDEVPEDMQEFIQLNAELAEVWP
NITEKKDPLPDAEDWDGVKGKLQHLER
>P00204 ~~~~~~Ferredoxin-1~~~
ALYITEECTYCGACEPECPVTAISAGDDIYVIDANTCNECAGLDEQACVAVCPAECIVQG
>P00210 ~~~fd1~~~Ferredoxin-1~~~
MARKFYVDQDECIACESCVEIAPGAFAMDPEIEKAYVKDVEGASQEEVEEAMDTCPVQCIHWEDE
>P00252 ~~~~~~Ferredoxin-1~~~
MATVYKVTLVDQEGTETTIDVPDDEYILDIAEDQGLDLPYSCRAGACSTCAGKIVSGTVDQSDQSFLDDDQIEKGYVLTC
VAYPTSDLKIETHKEEDLY
>P07485 ~~~~~~Ferredoxin-1~~~
TIVIDHEECIGCESCVELCPEVFAMIDGEEKAMVTAPDSTAECAQDAIDACPVEAISKE
>P08813 ~~~~~~Ferredoxin-1~~~COG4231
MGWTVTVDTDKCTGDGECVDVCPVEVYELQDGKAVPVNEEECLGCESCVEVCEAGAITVEEN
>Q51577 ~~~petF1~~~Ferredoxin-1~~~
MPSFKVTLINETEGLNTTIEVPDDEYILDAAEEQGIDLPYSCRAGACSTCAGKITAGTVDQSDQSFLDDDQIQAGYVLTC
VAYPTSDCTILTHQEEDLY
>P0A3C7 ~~~petF~~~Ferredoxin-1~~~COG0633
MATFKVTLINEAEGTKHEIEVPDDEYILDAAEEQGYDLPFSCRAGACSTCAGKLVSGTVDQSDQSFLDDDQIEAGYVLTC
VAYPTSDVVIQTHKEEDLY
>P0A3C8 ~~~petF~~~Ferredoxin-1~~~
MATFKVTLINEAEGTKHEIEVPDDEYILDAAEEQGYDLPFSCRAGACSTCAGKLVSGTVDQSDQSFLDDDQIEAGYVLTC
VAYPTSDVVIQTHKEEDLY
>P0A123 ~~~fdxA~~~Ferredoxin 1~~~COG1146
MTFVVTDNCIKCKYTDCVEVCPVDCFYEGPNFLVIHPDECIDCALCEPECPAQAIFSEDEVPSGMENFIELNAELAEIWP
NITERKDALPDAEEWDGKPGKIADLER
>P00207 ~~~fer1~~~Ferredoxin-1~~~COG1145
AYKIVTSQCTVCGACEFECPNAAISMKRGTYVIDATKCTECEGQFDKPQCVSVCPVDNTCVPA
>P00194 ~~~~~~Ferredoxin-1~~~
AYKIEETCISCGACAAECPVNAIEQGDTIFVVNADTCIDCGNCANVCPVGAPVAE
>P18324 ~~~suaB~~~Ferredoxin-1~~~
MTMRVSADRTVCVGAGLCALTAPGVFDQDDDGIVTVLTAEPAADDDRRTAREAGHLCPSGAVRVVEDTE
>P0A3D3 ~~~petF1~~~Ferredoxin-1~~~COG0633
MATYKVTLVNAAEGLNTTIDVADDTYILDAAEEQGIDLPYSCRAGACSTCAGKVVSGTVDQSDQSFLDDDQIAAGFVLTC
VAYPTSDVTIETHKEEDLY
>P00254 ~~~petF1~~~Ferredoxin-1~~~COG0633
MATFKVTLINEAEGTSNTIDVPDDEYILDAAEEQGYDLPFSCRAGACSTCAGKLVSGTVDQSDQSFLDDDQIEAGYVLTC
VAYPTSDVTIQTHKEEDLY
>P00251 ~~~~~~Ferredoxin-2~~~
MATYKVTLINEEEGINAILEVADDQTILDAGEEAGLDLPSSCRAGSCSTCAGKLVSGAAPNQDDQAFLDDDQLAAGWVMT
CVAYPTGDCTIMTHQESEVL
>O66511 ~~~fdx4~~~Ferredoxin, 2Fe-2S~~~COG3411
MAEFKHVFVCVQDRPPGHPQGSCAQRGSREVFQAFMEKIQTDPQLFMTTVITPTGCMNACMMGPVVVVYPDGVWYGQVKP
EDVDEIVEKHLKGGEPVERLVISKGKPPGMF
>P82802 ~~~~~~Ferredoxin, 2Fe-2s~~~COG3411
MAKPEFHIFICAQNRPAGHPRGSCGAKGAEGVYNAFAQVLIQKNLTNRIALTTTGCLGPCQAGANVLIYPGAVMYSWVEP
ADAAIIVEQHLLGGEPYADKLTPAEIW
>P00206 ~~~~~~Ferredoxin-2~~~
AHRITEECTYCAACEPECPVNAISAGDEIYIVDESVCTDCEGYYDEPACVAVCPVDCIIKV
>P07324 ~~~~~~Ferredoxin, 2Fe-2S~~~
MVNPKHHIFVCTSCRLNGKQQGFCYSKNSVEIVETFMEELDSRDLSSEVMVNNTGCFGICSQGPIVVVYPEGVWYGNVTA
DDVEEIVESHIENGEVVKRLQI
>P00249 ~~~~~~Ferredoxin-2~~~
MATYKVRLFNAAEGLDETIEVPDDEYILDAAEEAGLDLPFSCRSGSCSSCNGILKKGTVDQSDQNFLDDDQIAAGNVLTC
VAYPTSNCEIETHREDAIA
>P00211 ~~~~~~Ferredoxin-2~~~
MGYSVIVDSDKCIGCGECVDVCPVEVYELQNGKAVPVNEEECLGCESCIEVCPQNAIVE
>P10624 ~~~~~~Ferredoxin-2~~~COG1141
MAKYLYLDQDECMACESCVELCPEAFRMSSAGEYAEVIDPNTTAECVEDAISTCPVECIEWREE
>D5AP15 ~~~fdxA~~~Ferredoxin-2~~~COG1146
MTYVVTDNCIACKYTDCVEVCPVDCFYEGENTLVIHPDECIDCGVCEPECPADAIRPDTEPGMEDWVEFNRTYASQWPVI
TIKKDPMPDHKKYDGETGKREKYFSPNPGTGD
>P80448 ~~~~~~Ferredoxin-2~~~
PYVVTENCIKCKYQDCVEVCPVDCFYEGENFLVINPDECIDCGVCNPECPAEAIAGKWLEINRKFADLWPNITRKGPALA
DADDWKDKPDKTGLLSENPGKGTVCH
>P18325 ~~~subB~~~Ferredoxin-2~~~
MRIHVDQDKCCGAGSCVLAAPDVFDQREEDGIVVLLDTAPPAALHDAVREAATICPAAAITVTD
>P08812 ~~~~~~Ferredoxin-3~~~
GYKITIDTDKCTGDGECVDVCPVEVYELQDGKAVAVNEDECLGCESCVEVCEQDALTVEEN
>P20624 ~~~fdxB~~~Ferredoxin-3~~~
MPTVAYTRGGAEYTPVYLMKIDEQKCIGCGRCFKVCGRDVMSLHGLTEDGQVVAPGTDEWDEVEDEIVKKVMALTGAENC
IGCGACARVCPSECQTHAALS
>P59799 ~~~fdx5~~~2Fe-2S ferredoxin-5~~~
MPKVIVANINAEFEGIENETIMQILYRNGIEIDSACGGHGQCTSCKVLIISGSENLYPAEFEEKDTLEENGMDPETERLS
CQAKLNGKGDVVIYLP
>P80306 ~~~fdxE~~~Ferredoxin-6~~~
MAKIIFIEHNGTRHEVEAKPGLTVMEAARDNGVPGIDADCGGACACSTCHAYVDPAWVDKLPKALPTETDMIDFAYEPNP
ATSRLTCQIKVTSLLDGLVVHLPEKQI
>P11053 ~~~fdxH~~~Ferredoxin, heterocyst~~~COG0633
MASYQVRLINKKQDIDTTIEIDEETTILDGAEENGIELPFSCHSGSCSSCVGKVVEGEVDQSDQIFLDDEQMGKGFALLC
VTYPRSNCTIKTHQEPYLA
>P46046 ~~~fdxH1~~~Ferredoxin, heterocyst~~~COG1018
MATYQVRLISKKENIDTTIEIDEETTILDGAEENGIELPFSCHSGSCSSCVGKVVEGEVDQSDQIFLDDEQVGKGFALLC
VTYPRSNCTIKTHQEPYLA
>P46047 ~~~fdxH2~~~Ferredoxin, vegetative~~~COG0633
MTTYQVRLINKKRAIDITIPVDENTTILDAAEQQDIELPFSCQSGSCSSCVAKVVEGEVDQSEQVFLDEEQMAKGFIVLC
VSYPRSDCTIRTHQEPYLV
>P71820 ~~~fdx~~~Ferredoxin Fdx~~~COG1141
MGYRVEADRDLCQGHAMCELEAPEYFRVPKRGQVEILDPEPPEEARGVIKHAVWACPTQALSIRETGE
>P80168 ~~~~~~Ferredoxin~~~COG2221
MAYVINDSCISCGACEPECPVNAITAGDDKYVIDAATCIDCGACAGVCPVDAPQPE
>P07508 ~~~~~~Ferredoxin~~~
AYFITDACISCGACESECPVSPISPGDSVYVIDADACIECGACANVCPVDAPQQK
>P03941 ~~~~~~Ferredoxin~~~
PFVITSPCIGEKAADCVETCPVDAIHEGPDQYYIDPDLCIDCAACEPVCPVNAIYQEEFVPEDEKEFIEKNRNFFRNR
>P00208 ~~~fdx~~~Ferredoxin~~~COG1145
MALMITDECINCDVCEPECPNGAISQGDETYVIEPSLCTECVGHYETSQCVEVCPVDCIIKDPSHEETEDELRAKYERIT
GEG
>P00250 ~~~~~~Ferredoxin-1~~~
MASYKVTLKTPDGDNVITVPDDEYILDVAEEEGLDLPYSCRAGACSTCAGKLVSGPAPDEDQSFLDDDQIQAGYILTCVA
YPTGDCVIETHKEEALY
>P00246 ~~~~~~Ferredoxin~~~
MATYKVTLINEAEGINETIDCDDDTYILDAAEEAGLDLPYSCRAGACSTCAGTITSGTIDQSDQSFLDDDQIEAGYVLTC
VAYPTSDCTIKTHQEEGLY
>P50727 ~~~fer~~~Ferredoxin~~~COG1141
MAKYTIVDKDTCIACGACGAAAPDIYDYDDEGIAFVTLDENKGVVEVPEVLEEDMIDAFEGCPTDSIKVADEPFEGDPLK
FE
>P10245 ~~~~~~Ferredoxin~~~
PKYTIVDKETCIACGACGAAAPDIYDYDEDGIAYVTLDDNQGIVEVPDILIDDMMDAFEGCPTDSIKVADEPFDGDPNKF
E
>P14073 ~~~~~~Ferredoxin~~~
AYKITDECIACGSCADQCPVEAISEGSIYEIDEALCTDCGACADQCPVEAIVPED
>P00247 ~~~~~~Ferredoxin~~~
MATYKVTLINDAEGLNQTIEVDDDTYILDAAEEAGLDLPYSCRAGACSTCAGKIKSGTVDQSDQSFLDDDQIEAGYVLTC
VAYPTSDCTIETHKEEELY
>P00205 ~~~~~~Ferredoxin~~~
ALYITEECTYCGACEPECPTNAISAGSEIYVIDAAGCTECVGFADAPACAAVCPAECIVQG
>P00196 ~~~~~~Ferredoxin~~~
AFVINDSCVSCGACAGECPVSAITQGDTQFVIDADTCIDCGNCANVCPVGAPNQE
>P00195 ~~~~~~Ferredoxin~~~
MAYKIADSCVSCGACASECPVNAISQGDSIFVIDADTCIDCGNCANVCPVGAPVQE
>P22846 ~~~fer~~~Ferredoxin~~~
MAYKILDTCVSCGACAAECPVDAISQGDTQFVIDADTCIDCGNCANVCPVGAPVQE
>P00197 ~~~~~~Ferredoxin~~~
AYKITDGCINCGACEPECPVEAISESDAVRVIDADKCIDCGACANTCPVDAIVEG
>P00253 ~~~~~~Ferredoxin~~~
MATFKVTLINEAEGTKHEIEVPDDEYILDAAEEEGYDLPFSCRAGACSTCAGKLVSGTVDQSDQSFLDDDQIEAGYVLTC
VAYPTSDVVIQTHKEEDLY
>P0A9R4 ~~~fdx~~~2Fe-2S ferredoxin~~~COG0633
MPKIVILPHQDLCPDGAVLEANSGETILDAALRNGIEIEHACEKSCACTTCHCIVREGFDSLPESSEQEDDMLDKAWGLE
PESRLSCQARVTDEDLVVEIPRYTINHAREH
>P00212 ~~~fer~~~Ferredoxin~~~
PKYTIVDKETCIACGACGAAAPDIYDYDEDGIAYVTLDDNQGIVEVPDILIDDMMDAFEGCPTESIKVADEPFDGDPNKF
D
>P00198 ~~~fdxA~~~4Fe-4S ferredoxin FdxA~~~COG2768
MAYVINEACISCGACEPECPVNAISSGDDRYVIDADTCIDCGACAGVCPVDAPVQA
>D0LZ73 1.16.3.1~~~fer~~~Encapsulated ferritin-like protein~~~COG3461
MSSEQLHEPAELLSEETKNMHRALVTLIEELEAVDWYQQRADACSEPGLHDVLIHNKNEEVEHAMMTLEWIRRRSPVFDA
HMRTYLFTERPILELEEEDTGSSSSVAASPTSAPSHGSLGIGSLRQEGKED
>P15788 ~~~~~~Ferredoxin~~~COG0633
MASYKVTLINEEMGLNETIEVPDDEYILDVAEEEGIDLPYSCRAGACSTCAGKIKEGEIDQSDQSFLDDDQIEAGYVLTC
VAYPASDCTIITHQEEELY
>Q45560 ~~~fdxA~~~Ferredoxin 7Fe~~~
MAYVITEPCIGTKDASCVEVCPVDCIHEGEDQYYIDPDVCIDCGACEAVCPVSAIYHEDFVPEEWKSYIQKNRDFFKK
>P00245 ~~~~~~Ferredoxin~~~
MATYKVTLISEAEGINETIDCDDDTYILDAAEEAGLDLPYSCRAGACSTCAGKITSGSIDQSDQSFLDDDQIEAGYVLTC
VAYPTSDCTIQTHQEEGLY
>P00248 ~~~petF~~~Ferredoxin~~~
MATYKVTLINEAEGLNKTIEVPDDQYILDAAEEAGIDLPYSCRAGACSTCAGKLISGTVDQSDQSFLDDDQIEAGYVLTC
VAYPTSDCVIETHKEEELY
>P00201 ~~~~~~Ferredoxin~~~
MHVISDECVKCGACASTCPTGAIEEGETKYVVTDSCIDCGACEAVCPTGAISAE
>P00209 ~~~~~~Ferredoxin-2~~~
PIEVNDDCMACEACVEICPDVFEMNEEGDKAVVINPDSDLDCVEEAIDSCPAEAIVRS
>P00203 ~~~~~~Ferredoxin~~~
MKVTVDQDLCIACGTCIDLCPSVFDWDDEGLSHVIVDEVPEGAEDSCARESVNECPTEAIKEV
>P00215 ~~~fdxA~~~Ferredoxin~~~COG1146
TYVIAEPCVDVKDKACIEECPVDCIYEGARMLYIHPDECVDCGACEPVCPVEAIYYEDDVPDQWSSYAQANADFFAELGS
PGGASKVGQTDNDPQAIKDLPPQGED
>P9WNE6 ~~~fdxA~~~Ferredoxin~~~
MTYVIGSECVDVMDKSCVQECPVDCIYEGARMLYINPDECVDCGACKPACRVEAIYWEGDLPDDQHQHLGDNAAFFHQVL
PGRVAPLGSPGGAAAVGPIGVDTPLVAAIPVECP
>P9WNE7 ~~~fdxA~~~Ferredoxin~~~COG1146
MTYVIGSECVDVMDKSCVQECPVDCIYEGARMLYINPDECVDCGACKPACRVEAIYWEGDLPDDQHQHLGDNAAFFHQVL
PGRVAPLGSPGGAAAVGPIGVDTPLVAAIPVECP
>P00193 ~~~~~~Ferredoxin~~~
AYVINDSCIACGACKPECPVNIQQGSIYAIDADSCIDCGSCASVCPVGAPNPED
>Q2RVS1 1.16.3.1~~~fer~~~Encapsulated ferritin-like protein~~~COG3461
MAQSSNSTHEPLEVLKEETVNRHRAIVSVMEELEAVDWYDQRVDASTDPELTAILAHNRDEEKEHAAMTLEWLRRNDAKW
AEHLRTYLFTEGPITAIEAADTAGEGSGGDAAKGATAQGDGSLGIGSLKGEAALARPPRL
>P24496 ~~~fdxA~~~Ferredoxin~~~
MTYVIAEPCVDVLDKACIEECPVDCIYEGGRMLYIHPDECVDCGACEPVCPVEAIYYEDDVPDEWAAYTKANVDFFDELG
SPGGAAKVGKVDRDVEPVSSLPPQGE
>P13279 ~~~~~~Ferredoxin~~~
TYVIAQPCVDVKDKACIEECPVDCIYEGQRSLYIHPDECVDCGACEPVCPVEAIFYEDDTPEEWKDYYKANVEFFDDLGS
PGGASKLGLIERDHPFVAGLPPQNA
>P08811 ~~~~~~Ferredoxin 1~~~COG1146
MTFVVTDNCIKCKYTDCVEVCPVDCFYEGPNFLVIHPDECIDCALCEPECPAQAIFSEDEVPEDQQEFIELNADLAEVWP
NITEKKDALADAEEWDGVKDKLQYLER
>P27320 ~~~petF~~~Ferredoxin-1~~~COG1018
MASYTVKLITPDGESSIECSDDTYILDAAEEAGLDLPYSCRAGACSTCAGKITAGSVDQSDQSFLDDDQIEAGYVLTCVA
YPTSDCTIETHKEEDLY
>P00243 ~~~~~~Ferredoxin~~~COG1018
MASYTVKLITPDGENSIECSDDTYILDAAEEAGLDLPYSCRAGACSTCAGKITAGSVDQSDQSFLDDDQIEAGYVLTCVA
YPTSDCTIETHKEEDLY
>P00255 ~~~~~~Ferredoxin~~~
MATYKVTLVRPDGETTIDVPEDEYILDVAEEQGLDLPFSCRAGACSTCAGKLLEGEVDQSDQSFLDDDQIEKGFVLTCVA
YPRSDCKILTHQEEELY
>P46797 ~~~fdx~~~Ferredoxin~~~COG1141
MKVRVDADACIGCGVCENLCPDVFQLGDDGKAKVLQPETDLPCAKDAADSCPTGAISVEE
>P03942 ~~~~~~Ferredoxin~~~COG1146
MPHVICEPCIGVKDQSCVEVCPVECIYDGGDQFYIHPEECIDCGACVPACPVNAIYPEEDVPEQWKSYIEKNRKLAGLE
>P00200 ~~~~~~Ferredoxin~~~
AHIITDECISCGACAAECPVEAIHEGTGKYEVDADTCIDCGACEAVCPTGAVKAE
>P0A3C9 ~~~petF1~~~Ferredoxin-1~~~COG1018
MATYKVTLVRPDGSETTIDVPEDEYILDVAEEQGLDLPFSCRAGACSTCAGKLLEGEVDQSDQSFLDDDQIEKGFVLTCV
AYPRSDCKILTNQEEELY
>Q44501 ~~~fesII~~~Protein FeSII~~~
MATIYFSSPLMPHNKKVQAVAGKRSTLLGVAQENGVKIPFECQDGNCGSCLVKITHLDGERIKGMLLTDKERNVLKSVGK
LPKSEEERAAVRDLPPTYRLACQTIVTDEDLLVEFTGEPGGA
>A0A0H2V760 3.1.1.108~~~fes~~~Iron(III) enterobactin esterase~~~COG2382
MTALKVGSESWWQSKHGPEWQRLNDEMFEVTFWWRDPQGSEEYSTIKRVWVYITGVTDHHQNSQPRSMQRIAGTDVWQWT
TQLNANWRGSYCFIPTERDDIFSAPSPDRLELREGWRKLLPQAIADPLNPQSWKGGRGHAVSALEMPQAPLQPGWDCPQA
PETPAKEIIWKSERLKNSRRVWIFTTGDATAEERPLAVLLDGEFWAQSMPVWPALTSLTHRRQLPPAVYVLIDAIDTTHR
AHELPCNADFWLAVQQELLPQVKAIAPFSDRADRTVVAGQSFGGLSALYAGLHWPERFGCVLSQSGSYWWPHRGGHQEGM
LLEQLNTGEVSAEGLRIVLEAGVREPMIMQANQALYAQLHPLKESIFWRQVDGGHDALCWRGGLMQGLIDLWQPLFHDRS
>P13039 3.1.1.108~~~fes~~~Iron(III) enterobactin esterase~~~COG2382
MTALKVGSESWWQSKHGPEWQRLNDEMFEVTFWWRDPQGSEEYSTIKRVWVYITGVTDHHQNSQPQSMQRIAGTNVWQWT
TQLNANWRGSYCFIPTERDDIFSVPSPDRLELREGWRKLLPQAIADPLNLQSWKGGRGHAVSALEMPQAPLQPGWDCPQA
PEIPAKEIIWKSERLKKSRRVWIFTTGDATAEERPLAVLLDGEFWAQSMPVWPVLTSLTHRQQLPPAVYVLIDAIDTTHR
AHELPCNADFWLAVQQELLPLVKAIAPFSDRADRTVVAGQSFGGLSALYAGLHWPERFGCVLSQSGSYWWPHRGGQQEGV
LLEKLKAGEVSAEGLRIVLEAGIREPMIMRANQALYAQLHPIKESIFWRQVDGGHDALCWRGGLMQGLIDLWQPLFHDRS
>P77279 7.2.2.-~~~fetA~~~Probable iron export ATP-binding protein FetA~~~COG4619
MQENSPLLQLQNVGYLAGDAKILNNINFSLRAGEFKLITGPSGCGKSTLLKIVASLISPTSGTLLFEGEDVSTLKPEIYR
QQVSYCAQTPTLFGDTVYDNLIFPWQIRNRQPDPAIFLDFLERFALPDSILTKNIAELSGGEKQRISLIRNLQFMPKVLL
LDEITSALDESNKHNVNEMIHRYVREQNIAVLWVTHDKDEINHADKVITLQPHAGEMQEARYELA
>P77307 ~~~fetB~~~Probable iron export permease protein FetB~~~COG0390
MNSHNITNESLALALMLVVVAILISHKEKLALEKDILWSVGRAIIQLIIVGYVLKYIFSVDDASLTLLMVLFICFNAAWN
AQKRSKYIAKAFISSFIAITVGAGITLAVLILSGSIEFIPMQVIPIAGMIAGNAMVAVGLCYNNLGQRVISEQQQIQEKL
SLGATPKQASAILIRDSIRAALIPTVDSAKTVGLVSLPGMMSGLIFAGIDPVKAIKYQIMVTFMLLSTASLSTIIACYLT
YRKFYNSRHQLVVTQLKKK
>Q3JQJ0 ~~~~~~Probable Fe(2+)-trafficking protein~~~
MARMIHCAKLGKEAEGLDFPPLPGELGKRLYESVSKQAWQDWLKQQTMLINENRLNMADPRARQYLMKQTEKYFFGEGAD
QASGYVPPAQG
>P0A8P3 ~~~yggX~~~Probable Fe(2+)-trafficking protein~~~COG2924
MSRTIFCTFLQREAEGQDFQLYPGELGKRIYNEISKEAWAQWQHKQTMLINEKKLNMMNAEHRKLLEQEMVNFLFEGKEV
HIEGYTPEDKK
>P44048 ~~~~~~Probable Fe(2+)-trafficking protein~~~COG2924
MARTVFCEYLKKEAEGLDFQLYPGELGKRIFDSVSKQAWGEWIKKQTMLVNEKKLNMMNAEHRKLLEQEMVNFLFEGKDV
HIEGYVPPSN
>Q9HU36 ~~~~~~Probable Fe(2+)-trafficking protein~~~
MSRTVMCRKYHEELPGLDRPPYPGAKGEDIYNNVSRKAWDEWQKHQTMLINERRLNMMNAEDRKFLQQEMDKFLSGEDYA
KADGYVPPSA
>P67617 ~~~yggX~~~Probable Fe(2+)-trafficking protein~~~
MSRTIFCTYLQRDAEGQDFQLYPGELGKRIYNEISKDAWAQWQHKQTMLINEKKLNMMNAEHRKLLEQEMVSFLFEGKDV
HIEGYTPEDKK
>P40409 ~~~feuA~~~Iron-uptake system-binding protein~~~COG0614
MKKISLTLLILLLALTAAACGSKNESTASKASGTASEKKKIEYLDKTYEVTVPTDKIAITGSVESMEDAKLLDVHPQGAI
SFSGKFPDMFKDITDKAEPTGEKMEPNIEKILEMKPDVILASTKFPEKTLQKISTAGTTIPVSHISSNWKENMMLLAQLT
GKEKKAKKIIADYEQDLKEIKTKINDKAKDSKALVIRIRQGNIYIYPEQVYFNSTLYGDLGLKAPNEVKAAKAQELSSLE
KLSEMNPDHIFVQFSDDENADKPDALKDLEKNPIWKSLKAVKEDHVYVNSVDPLAQGGTAWSKVRFLKAAAEKLTQN
>P40410 ~~~feuB~~~Iron-uptake system permease protein FeuB~~~COG0609
MYSKQWTRIILITSPFAIALSLLLSILYGAKHLSTDIVFTSLIHFDPGNTDHQIIWHSRIPRAAGALLIGAALAVSGALM
QGITRNYLASPSIMGVSDGSAFIITLCMVLLPQSSSIEMMIYSFIGSALGAVLVFGLAAMMPNGFTPVQLAIIGTVTSML
LSSLSAAMSIYFQISQDLSFWYSARLHQMSPDFLKLAAPFFLIGIIMAISLSKKVTAVSLGDDISKSLGQKKKTIKIMAM
LSVIILTGSAVALAGKIAFVGLVVPHITRFLVGSDYSRLIPCSCILGGIFLTLCDLASRFINYPFETPIEVVTSIIGVPF
FLYLIKRKGGEQNG
>P40411 ~~~feuC~~~Iron-uptake system permease protein FeuC~~~COG0609
MAKKYALFIALILVVSYFSLTSGSFSVRPAELLSTLFQIDPNPQYEILLFDLRLPRVVMAAIIGLGLGIAGAVIQAITRN
GLADPGILGINAGAGAGIVAFMLLFQGQKEVTSIAAAMGMPLFGLIGGLIAAILIYIFAWHRGNLDSGRIILVGIAINSG
FSALSLFLSLKMDPQDYQMAMVWKNGSIWSANWTYITAVLPWMLLFIPILIGKSRLLDTIRFDEDTVRSLGISSNKEKTI
LLVACVAIISACVSVAGSMAFVGLIAPHISRRLAGVEHRYILPLSGLIGMLLVISADFAGKLFFQPAEVPAGIILAILGV
PYFLYLLFKQKKGENA
>Q0RVH7 1.1.98.2~~~fgd1~~~F420-dependent glucose-6-phosphate dehydrogenase 1~~~COG2141
MVIKFGYKASAEQFGPRELVELGVLAEAHGMDSATVSDHFQPWRHEGGHAPFSLAWMTAVGERTSRLQLGTSVMTPTFRY
NPAVVAQAFATMGCLYPGRIMLGVGTGEALNEIATGFAGEWPEFKERFARLREAVALMRELWLGDRVDFEGNYYKTVGAS
IYDVPEGGIPVYIAAGGPVVARYAGRSGDGFICTSGKGMELYTEKLMPAVAEGAEKADRDVAEIDKMIEIKISYDTDPEL
ALENTRFWAPLSLTPEQKHSIDDPIEMERAADALPIEQVAKRWIVASDPDEAVAQIRPYLDAGLNHLVFHAPGHDQKRFL
ELFQRDLAPRLRGLA
>A0QQJ4 1.1.98.2~~~fgd~~~F420-dependent glucose-6-phosphate dehydrogenase~~~COG2141
MAELKLGYKASAEQFAPRELVELAVLAESAGMDSATVSDHFQPWRHEGGHAPFSLAWMTAVGERTKNLVLGTSVLTPTFR
YNPAVIAQAFATMGCLYPGRIFLGVGTGEALNEIATGYAGEWPEFKERFARLRESVRLMRELWLGDRVDFDGEYYRTKGA
SIYDVPEGGIPVYIAAGGPVVAKYAGRAGDGFICTSGKGEELYAEKLIPAVKEGAAAADRDADAIDRMIEIKISYDTDPE
LALENTRFWAPLSLTAEQKHSIDDPIEMEKAADALPIEQVAKRWIVASDPDEAVEKVGQYVKWGLNHLVFHAPGHDQRRF
LELFKRDLEPRLRKLA
>P9WNE1 1.1.98.2~~~fgd1~~~F420-dependent glucose-6-phosphate dehydrogenase~~~COG2141
MAELKLGYKASAEQFAPRELVELAVAAEAHGMDSATVSDHFQPWRHQGGHAPFSLSWMTAVGERTNRLLLGTSVLTPTFR
YNPAVIAQAFATMGCLYPNRVFLGVGTGEALNEIATGYEGAWPEFKERFARLRESVGLMRQLWSGDRVDFDGDYYRLKGA
SIYDVPDGGVPVYIAAGGPAVAKYAGRAGDGFICTSGKGEELYTEKLMPAVREGAAAADRSVDGIDKMIEIKISYDPDPE
LALNNTRFWAPLSLTAEQKHSIDDPIEMEKAADALPIEQIAKRWIVASDPDEAVEKVGQYVTWGLNHLVFHAPGHDQRRF
LELFQSDLAPRLRRLG
>I6Y8I5 1.8.3.7~~~~~~Formylglycine-generating enzyme~~~COG1262
MLTELVDLPGGSFRMGSTRFYPEEAPIHTVTVRAFAVERHPVTNAQFAEFVSATGYVTVAEQPLDPGLYPGVDAADLCPG
AMVFCPTAGPVDLRDWRQWWDWVPGACWRHPFGRDSDIADRAGHPVVQVAYPDAVAYARWAGRRLPTEAEWEYAARGGTT
ATYAWGDQEKPGGMLMANTWQGRFPYRNDGALGWVGTSPVGRFPANGFGLLDMIGNVWEWTTTEFYPHHRIDPPSTACCA
PVKLATAADPTISQTLKGGSHLCAPEYCHRYRPAARSPQSQDTATTHIGFRCVADPVSG
>Q9F3C7 1.8.3.7~~~~~~Formylglycine-generating enzyme~~~COG1262
MAVAAPSPAAAAEPGPAARPRSTRGQVRLPGGEFAMGDAFGEGYPADGETPVHTVRLRPFHIDETAVTNARFAAFVKATG
HVTDAERFGSSAVFHLVVAAPDADVLGSAAGAPWWINVRGAHWRRPEGARSDITGRPNHPVVHVSWNDATAYARWAGKRL
PTEAEWEYAARGGLAGRRYAWGDELTPGGRWRCNIWQGRFPHVNTAEDGHLSTAPVKSYRPNGHGLWNTAGNVWEWCSDW
FSPTYYAESPTVDPHGPGTGAARVLRGGSYLCHDSYCNRYRVAARSSNTPDSSSGNLGFRCANDADLTSGSAAE
>D1A7C3 1.8.3.7~~~~~~Formylglycine-generating enzyme~~~COG1262
MPSFDFDIPRRSPQEIAKGMVAIPGGTFRMGGEDPDAFPEDGEGPVRTVRLSPFLIDRYAVSNRQFAAFVKATGYVTDAE
RYGWSFVFHAHVAPGTPVMDAVVPEAPWWVAVPGAYWKAPEGPGSSITDRPNHPVVHVSWNDAVAYATWAGKRLPTEAEW
EMAARGGLDQARYPWGNELTPRGRHRCNIWQGTFPVHDTGEDGYTGTAPVNAFAPNGYGLYNVAGNVWEWCADWWSADWH
ATESPATRIDPRGPETGTARVTKGGSFLCHESYCNRYRVAARTCNTPDSSAAHTGFRCAADPL
>P71590 ~~~fhaA~~~FHA domain-containing protein FhaA~~~COG1716
MGSQKRLVQRVERKLEQTVGDAFARIFGGSIVPQEVEALLRREAADGIQSLQGNRLLAPNEYIITLGVHDFEKLGADPEL
KSTGFARDLADYIQEQGWQTYGDVVVRFEQSSNLHTGQFRARGTVNPDVETHPPVIDCARPQSNHAFGAEPGVAPMSDNS
SYRGGQGQGRPDEYYDDRYARPQEDPRGGPDPQGGSDPRGGYPPETGGYPPQPGYPRPRHPDQGDYPEQIGYPDQGGYPE
QRGYPEQRGYPDQRGYQDQGRGYPDQGQGGYPPPYEQRPPVSPGPAAGYGAPGYDQGYRQSGGYGPSPGGGQPGYGGYGE
YGRGPARHEEGSYVPSGPPGPPEQRPAYPDQGGYDQGYQQGATTYGRQDYGGGADYTRYTESPRVPGYAPQGGGYAEPAG
RDYDYGQSGAPDYGQPAPGGYSGYGQGGYGSAGTSVTLQLDDGSGRTYQLREGSNIIGRGQDAQFRLPDTGVSRRHLEIR
WDGQVALLADLNSTNGTTVNNAPVQEWQLADGDVIRLGHSEIIVRMH
>P12255 ~~~fhaB~~~Filamentous hemagglutinin~~~COG3064
MNTNLYRLVFSHVRGMLVPVSEHCTVGNTFCGRTRGQARSGARATSLSVAPNALAWALMLACTGLPLVTHAQGLVPQGQT
QVLQGGNKVPVVNIADPNSGGVSHNKFQQFNVANPGVVFNNGLTDGVSRIGGALTKNPNLTRQASAILAEVTDTSPSRLA
GTLEVYGKGADLIIANPNGISVNGLSTLNASNLTLTTGRPSVNGGRIGLDVQQGTVTIERGGVNATGLGYFDVVARLVKL
QGAVSSKQGKPLADIAVVAGANRYDHATRRATPIAAGARGAAAGAYAIDGTAAGAMYGKHITLVSSDSGLGVRQLGSLSS
PSAITVSSQGEIALGDATVQRGPLSLKGAGVVSAGKLASGGGAVNVAGGGAVKIASASSVGNLAVQGGGKVQATLLNAGG
TLLVSGRQAVQLGAASSRQALSVNAGGALKADKLSATRRVDVDGKQAVALGSASSNALSVRAGGALKAGKLSATGRLDVD
GKQAVTLGSVASDGALSVSAGGNLRAKQLVSSAQLEVRGQREVALDDASSARGMTVVAAGALAARNLQSKGAIGVQGGEA
VSVANANSDAELRVRGRGQVDLHDLSAARGADISGEGRVNIGRARSDSDVKVSAHGALSIDSMTALGAIGVQAGGSVSAK
DMRSRGAVTVSGGGAVNLGDVQSDGQVRATSAGAMTVRDVAAAADLALQAGDALQAGFLKSAGAMTVNGRDAVRLDGAHA
GGQLRVSSDGQAALGSLAAKGELTVSAARAATVAELKSLDNISVTGGERVSVQSVNSASRVAISAHGALDVGKVSAKSGI
GLEGWGAVGADSLGSDGAISVSGRDAVRVDQARSLADISLGAEGGATLGAVEAAGSIDVRGGSTVAANSLHANRDVRVSG
KDAVRVTAATSGGGLHVSSGRQLDLGAVQARGALALDGGAGVALQSAKASGTLHVQGGEHLDLGTLAAVGAVDVNGTGDV
RVAKLVSDAGADLQAGRSMTLGIVDTTGDLQARAQQKLELGSVKSDGGLQAAAGGALSLAAAEVAGALELSGQGVTVDRA
SASRARIDSTGSVGIGALKAGAVEAASPRRARRALRQDFFTPGSVVVRAQGNVTVGRGDPHQGVLAQGDIIMDAKGGTLL
LRNDALTENGTVTISADSAVLEHSTIESKISQSVLAAKGDKGKPAVSVKVAKKLFLNGTLRAVNDNNETMSGRQIDVVDG
RPQITDAVTGEARKDESVVSDAALVADGGPIVVEAGELVSHAGGIGNGRNKENGASVTVRTTGNLVNKGYISAGKQGVLE
VGGALTNEFLVGSDGTQRIEAQRIENRGTFQSQAPAGTAGALVVKAAEAIVHDGVMATKGEMQIAGKGGGSPTVTAGAKA
TTSANKLSVDVASWDNAGSLDIKKGGAQVTVAGRYAEHGEVSIQGDYTVSADAIALAAQVTQRGGAANLTSRHDTRFSNK
IRLMGPLQVNAGGAVSNTGNLKVREGVTVTAASFDNETGAEVMAKSATLTTSGAARNAGKMQVKEAATIVAASVSNPGTF
TAGKDITVTSRGGFDNEGKMESNKDIVIKTEQFSNGRVLDAKHDLTVTASGQADNRGSLKAGHDFTVQAQRIDNSGTMAA
GHDATLKAPHLRNTGQVVAGHDIHIINSAKLENTGRVDARNDIALDVADFTNTGSLYAEHDATLTLAQGTQRDLVVDQDH
ILPVAEGTLRVKAKSLTTEIETGNPGSLIAEVQENIDNKQAIVVGKDLTLSSAHGNVANEANALLWAAGELTVKAQNITN
KRAALIEAGGNARLTAAVALLNKLGRIRAGEDMHLDAPRIENTAKLSGEVQRKGVQDVGGGEHGRWSGIGYVNYWLRAGN
GKKAGTIAAPWYGGDLTAEQSLIEVGKDLYLNAGARKDEHRHLLNEGVIQAGGHGHIGGDVDNRSVVRTVSAMEYFKTPL
PVSLTALDNRAGLSPATWNFQSTYELLDYLLDQNRYEYIWGLYPTYTEWSVNTLKNLDLGYQAKPAPTAPPMPKAPELDL
RGHTLESAEGRKIFGEYKKLQGEYEKAKMAVQAVEAYGEATRRVHDQLGQRYGKALGGMDAETKEVDGIIQEFAADLRTV
YAKQADQATIDAETDKVAQRYKSQIDAVRLQAIQPGRVTLAKALSAALGADWRALGHSQLMQRWKDFKAGKRGAEIAFYP
KEQTVLAAGAGLTLSNGAIHNGENAAQNRGRPEGLKIGAHSATSVSGSFDALRDVGLEKRLDIDDALAAVLVNPHIFTRI
GAAQTSLADGAAGPALARQARQAPETDGMVDARGLGSADALASLASLDAAQGLEVSGRRNAQVADAGLAGPSAVAAPAVG
AADVGVEPVTGDQVDQPVVAVGLEQPVATVRVAPPAVALPRPLFETRIKFIDQSKFYGSRYFFEQIGYKPDRAARVAGDN
YFDTTLVREQVRRALGGYESRLPVRGVALVAKLMDSAGTVGKALGLKVGVAPTAQQLKQADRDFVWYVDTVIDGQKVLAP
RLYLTEATRQGITDQYAGGGALIASGGDVTVNTDGHDVSSVNGLIQGRSVKVDAGKGKVVVADSKGAGGGIEADDEVDVS
GRDIGIEGGKLRGKDVRLKADTVKVATSMRYDDKGRLAARGDGALDAQGGQLHIEAKRLETAGATLKGGKVKLDVDDVKL
GGVYEAGSSYENKSSTPLGSLFAILSSTTETNQSAHANHYGTRIEAGTLEGKMQNLEIEGGSVDAAHTDLSVARDARFKA
AADFAHAEHEKDVRQLSLGAKVGAGGYEAGFSLGSESGLEAHAGRGMTAGAEVKVGYRASHEQSSETEKSYRNANLNFGG
GSVEAGNVLDIGGADINRNRYGGAAKGNAGTEEALRMRAKKVESTKYVSEQTSQSSGWSVEVASTASARSSLLTAATRLG
DSVAQNVEDGREIRGELMAAQVAAEATQLVTADTAAVALSAGISADFDSSHSRSTSQNTQYLGGNLSIEATEGDATLVGA
KFGGGDQVSLKAAKSVNLMAAESTFESYSESHNFHASADANLGANAVQGAVGLGLTAGMGTSHQITNETGKTYAGTSVDA
ANVSIDAGKDLNLSGSRVRGKHVVLDVEGDINATSKQDERNYNSSGGGWDASAGVAIQNRTLVAPVGSAGFNFNTEHDNS
RLTNDGAAGVVASDGLTGHVKGDANLTGATIADLSGKGNLKVDGAVNAQNLKDYRDKDGGSGGLNVGISSTTLAPTVGVA
FGRVAGEDYQAEQRATIDVGQTKDPARLQVGGGVKGTLNQDAAQATVVQRNKHWAGGGSEFSVAGKSLKKKNQVRPVETP
TPDVVDGPPSRPTTPPASPQPIRATVEVSSPPPVSVATVEVVPRPKVETAQPLPPRPVAAQVVPVTPPKVEVAKVEVVPR
PKVETAQPLPPRPVVAEKVTTPAVQPQLAKVETVQPVKPETTKPLPKPLPVAKVTKAPPPVVETAQPLPPVKPQKATPGP
VAEVGKATVTTVQVQSAPPKPAPVAKQPAPAPKPKPKPKPKAERPKPGKTTPLSGRHVVQQQVQVLQRQASDINNTKSLP
GGKLPKPVTVKLTDENGKPQTYTINRREDLMKLNGKVLSTKTTLGLEQTFRLRVEDIGGKNYRVFYETNK
>P9WJB5 ~~~fhaB~~~FHA domain-containing protein FhaB~~~COG1716
MQGLVLQLTRAGFLMLLWVFIWSVLRILKTDIYAPTGAVMMRRGLALRGTLLGARQRRHAARYLVVTEGALTGARITLSE
QPVLIGRADDSTLVLTDDYASTRHARLSMRGSEWYVEDLGSTNGTYLDRAKVTTAVRVPIGTPVRIGKTAIELRP
>P35077 ~~~fhaC~~~Filamentous hemagglutinin transporter protein FhaC~~~COG2831
MTDATNRFRPGLVGRALVRAGLLFAVAACAQAQLLPGARDLNRIDDRQRKEQLQRDIERALTRPPVELNPQSEAAAPARK
PDATSGHTVTVHAVDLDFGVEGRLFDPAPLVQDYLNRPLDNEQLFLLVKALSAALYDRGYATSIVTFVPPGVVDGVLKLK
VEWGRIKGWLIDGKPLEGTRDRMMVFSAMPGWQDKVLNVFDIDQAIYNINNGGKTGNITIVPADEYGYSYLDLQLQRRAL
PRVSLGMDNSGPGTPENGRYKYNASVTANDLLGLNDTLGLYIGNRYYRDAGHDAERNYDLMYSVPLGRTRLDLQTGYSTY
RNLLKTRYGQYQSAGNSRSFGLKATRLLYRDTRSQFSVYGGLKLRQNKNYLAGTRLDVSSKHYSDVTVGMQYSTQRGANA
YFGDLSFTRGVGVNNGKYAAYDERGPQGNVSRFNGSLAWTRYMALAGQPIQWASQLGFQYSRQQLLNSYQITVGDEYTVR
GYNLRTSQSGDSGVYLSNTLTVPVQFSLLGKQASVAPFVGADVGALKSNHPDARTIRMAGLAAGVRFDLPYARMSFTYSK
PVGAQPGGAPRAPVWLYINAGLSF
>Q9JXV4 ~~~fhbP~~~Factor H binding protein~~~
MNRTAFCCLSLTTALILTACSSGGGGVAADIGAGLADALTAPLDHKDKGLQSLTLDQSVRKNEKLKLAAQGAEKTYGNGD
SLNTGKLKNDKVSRFDFIRQIEVDGQLITLESGEFQVYKQSHSALTAFQTEQIQDSEHSGKMVAKRQFRIGDIAGEHTSF
DKLPEGGRATYRGTAFGSDDAGGKLTYTIDFAAKQGNGKIEHLKSPELNVDLAAADIKPDGKRHAVISGSVLYNQAEKGS
YSLGIFGGKAQEVAGSAEVKTVNGIRHIGLAAKQ
>E6MV22 ~~~fhbP~~~Factor H binding protein~~~
MNRTAFCCLSLTTALILTACSSGGGGVAADIGAGLADALTAPLDHKDKGLQSLTLDQSVRKNEKLKLAAQGAEKTYGNGD
SLNTGKLKNDKVSRFDFIRQIEVDGQLITLESGEFQVYKQSHSALTAFQTEQIQDSEHSGKMVAKRQFRIGDIAGEHTSF
DKLPEGGRATYRGTAFGSDDAGGKLTYTIDFAAKQGNGKIEHLKSPELNVDLAAADIKPDGKRHAVISGSVLYNQAEKGS
YSLGIFGGKAQEVAGSAEVKTVNGIRHIGLAAKQ
>C5B137 3.5.1.-~~~fhcA~~~Formyltransferase/hydrolase complex Fhc subunit A~~~COG1229
MLTRIHGGRVVDPTAGRDAVGDVWIEDGRVVAPSERAPDQTIDATGCVVMAGGVEVHSHIAGGNVVMSRLLLPDLYVSES
APNGHPFAHAGGSGSWIGANYARMGYTTAVEPALPPSNALATHLELADIPLLDRGGLAVLGNDDHLLQLLRDGEGKQAVR
DLVQQTLAHSRGLGVKCINAGGASAFKDGVLKLSLDDEIPCYGLSTRKIMSALLDAVEEIGVPHPLHVHCNNLGLPGADD
SLVATLEAAEGRRIHFAHAQFYAYGVVDPENPMTGGFRSAAERINAAMEAHPNATYDVGQVVFGQTVTISLDILRQFGGR
KGAKPKKWVISAGDAEGGGVVPFLYRPRGPVSSLQWAIGLELMLLSSNPERTILTTDHPNGGVFTEYPRIIHLLMDAEER
AKEIATLPAIVGERSGLPKIEREYSFSEIAQLTRSGPAKLLGLTDRGHLREGAKADVAIYRDDKDRTAMFSRAKLVLKDG
QPIVEDGEVVAWFSGKTLSLDVEADAGMEKRAESYLQDRFGAGLDTFAVPDAAFPENTGTFEDVACRA
>C5B138 ~~~fhcB~~~Formyltransferase/hydrolase complex Fhc subunit B~~~COG1029
MAAWVKGGAADVDAAVEAAADLLAASRVPVLAGLSAEVSALRAAYRLAETLGASLDPVSGPSVYAELGALSAGGAMSTTR
AETIGRADVILIVGNRPWDGELIAEIAAAAPSRGRAAGSERALLSLGGPQNGAIRHVAYAADAGGLTISLGHLRAFAKGH
LAGEAAFADLAKRLFAAQYGVIVYDPEEVGELGAEMLQGLIRDLNESTRFFALTLADPFQGRAAVQLSAWTTGQAPRVGF
GRHQPEHDSWRFDSARQIAAGEADAALWLASLPAPRPAWLGSLPTIAIVGEGSQEAAGETAEVVITVGVPGQSVGGALWN
DRRGVIAYAEASDPAKTPAETETAAGVLTRIRDRLIEKGVSC
>C5B135 ~~~fhcC~~~Formyltransferase/hydrolase complex Fhc subunit C~~~COG2218
MSTLRLRGDLPERVDLLNITPLALSGLSEAEAGKLAIGTSRRGLTLGDVFEISLDGSDSLVIEGGSARLDRVGAALSQGS
IRVEGDVGQRLGEGMAAGSLTVTGSAGPYAGTGATGGTITIEGDAGDHAGGAVYAAKAGLDGATLVIKGAAGDHLGDRMR
RGMILAGSAGAFAASRMIAGTIVVSGALGDHPGYGMRRGTLIAGSHGTLLPTFVETGTPDLVFVRLLAQSLKHLGAAQAS
LLSGTLRRYSGDLATLGKGELFVPA
>Q49118 2.3.1.101~~~ffsA~~~Formyltransferase/hydrolase complex subunit D~~~COG2037
MSDFTLNGIKVEDTFAEAFDVAGTAIIVTNDTPKWAMIAATVMTGFATSVIGCGAEAGIDAELSPDETPDGRPGVRILLF
GFEPNGLKDQLLKRVGQCILTCPGTACFAGVEGPTKIKLGGAIRYFGDGFAVAKRLPDHEGKMRRYWRIPVMDGEFLCED
SVRAVDGAVGGGNLLFLGRKHADTLIVAEIAVEAAKAIPGAILPFPGGIVRSGSKVGGRTKGMMASTNDAYCPTLKGRAG
SALPPECGVVLEIVIDALTSAAVAESMRAALHAATEIGAQHGLVAVTAGNYGGNLGRHHYHLRDLLEKPAA
>P19323 ~~~fhlA~~~Formate hydrogenlyase transcriptional activator FhlA~~~COG3604
MSYTPMSDLGQQGLFDITRTLLQQPDLASLCEALSQLVKRSALADNAAIVLWQAQTQRASYYASREKDTPIKYEDETVLA
HGPVRSILSRPDTLHCSYEEFCETWPQLDAGGLYPKFGHYCLMPLAAEGHIFGGCEFIRYDDRPWSEKEFNRLQTFTQIV
SVVTEQIQSRVVNNVDYELLCRERDNFRILVAITNAVLSRLDMDELVSEVAKEIHYYFDIDDISIVLRSHRKNKLNIYST
HYLDKQHPAHEQSEVDEAGTLTERVFKSKEMLLINLHERDDLAPYERMLFDTWGNQIQTLCLLPLMSGDTMLGVLKLAQC
EEKVFTTTNLNLLRQIAERVAIAVDNALAYQEIHRLKERLVDENLALTEQLNNVDSEFGEIIGRSEAMYSVLKQVEMVAQ
SDSTVLILGETGTGKELIARAIHNLSGRNNRRMVKMNCAAMPAGLLESDLFGHERGAFTGASAQRIGRFELADKSSLFLD
EVGDMPLELQPKLLRVLQEQEFERLGSNKIIQTDVRLIAATNRDLKKMVADREFRSDLYYRLNVFPIHLPPLRERPEDIP
LLAKAFTFKIARRLGRNIDSIPAETLRTLSNMEWPGNVRELENVIERAVLLTRGNVLQLSLPDIVLPEPETPPAATVVAL
EGEDEYQLIVRVLKETNGVVAGPKGAAQRLGLKRTTLLSRMKRLGIDKSALI
>P0CL46 ~~~fhlA~~~Formate hydrogenlyase transcriptional activator~~~
MSYTPMSDLGQQGLFDITRTLLQQPDLASLSEALSQLVKRSALADSAGIVLWQAQSQRAQYYATRENGRPVEYEDETVLA
HGPVRRILSRPDALHCNFHEFTETWPQLAASGLYPEFGHYCLLPLAAEGRIFGGCEFIRQEDRPWSEKEYDRLHTFTQIV
GVVAEQIQNRVNNNVDYDLLCRERDNFRILVAITNAVLSRLDIDELVSEVAKEIHHYFNIDAISIVLRSHRKNKLNIYST
HYLDEHHPAHEQSEVDEAGTLTERVFKSKEMLLINLNERDPLAPYERMLFDTWGNQIQTLCLLPLMSGKTMLGVLKLAQC
EEKVFTTANLKLLRQIAERVAIAVDNALAYQEIHRLKERLVDENLALTEQLNNVDSEFGEIIGRSEAMYNVLKQVEMVAQ
SDSTVLILGETGTGKELIARAIHNLSGRSGRRMVKMNCAAMPAGLLESDLFGHERGAFTGASAQRIGRFELADKSSLFLD
EVGDMPLELQPKLLRVLQEQEFERLGSNKLIQTDVRLIAATNRDLKKMVADREFRNDLYYRLNVFPIQLPPLRERPEDIP
LLVKAFTFKIARRMGRNIDSIPAETLRTLSSMEWPGNVRELENVVERAVLLTRGNVLQLSLPDITAVTPDTSPVATESDK
EGEDEYQLIIRVLKETNGVVAGPKGAAQRLGLKRTTLLSRMKRLGIDKDALA
>P96809 1.1.98.-~~~fgd2~~~F420-dependent hydroxymycolic acid dehydrogenase~~~COG2141
MTGISRRTFGLAAGFGAIGAGGLGGGCSTRSGPTPTPEPASRGVGVVLSHEQFRTDRLVAHAQAAEQAGFRYVWASDHLQ
PWQDNEGHSMFPWLTLALVGNSTSSILFGTGVTCPIYRYHPATVAQAFASLAILNPGRVFLGLGTGERLNEQAATDTFGN
YRERHDRLIEAIVLIRQLWSGERISFTGHYFRTDELKLYDTPAMPPPIFVAASGPQSATLAGRYGDGWIAQARDINDAKL
LAAFAAGAQAAGRDPTTLGKRAELFAVVGDDKAAARAADLWRFTAGAVDQPNPVEIQRAAESNPIEKVLANWAVGTDPGV
HIGAVQAVLDAGAVPFLHFPQDDPITAIDFYRTNVLPELR
>Q988C8 1.2.1.100~~~fhmpcd1~~~5-formyl-3-hydroxy-2-methylpyridine 4-carboxylate 5-dehydrogenase~~~COG1250
MIRNIAIIGLGTMGPGMAARLARGGLQVVAYDVAPAAIERARSMLSVAETVLDALGIALPSAGVGTVRFTDDIGDAVSGA
DLVIENVPENISIKADVYRTIDGLIGQDTIVASDTSGIPITKLQAHISYPERMVGMHWSNPPHIIPMIEVIAGEKTAPQT
VATIRDLIRSIGLLPVVVKKDVPGFVENRVLYALLREAVDLVERGVIDPEDLDTCVSWGIGYKIAVIGPMALLDMAGLDI
YKSVSSFLNADLSNRDDVAPMVLEKTSASKFGIKSGEGMFCYTPEQTKALQAERARKLVAVRRILEGRE
>P06971 ~~~fhuA~~~Ferrichrome outer membrane transporter/phage receptor~~~COG4774
MARSKTAQPKHSLRKIAVVVATAVSGMSVYAQAAVEPKEDTITVTAAPAPQESAWGPAATIAARQSATGTKTDTPIQKVP
QSISVVTAEEMALHQPKSVKEALSYTPGVSVGTRGASNTYDHLIIRGFAAEGQSQNNYLNGLKLQGNFYNDAVIDPYMLE
RAEIMRGPVSVLYGKSSPGGLLNMVSKRPTTEPLKEVQFKAGTDSLFQTGFDFSDSLDDDGVYSYRLTGLARSANAQQKG
SEEQRYAIAPAFTWRPDDKTNFTFLSYFQNEPETGYYGWLPKEGTVEPLPNGKRLPTDFNEGAKNNTYSRNEKMVGYSFD
HEFNDTFTVRQNLRFAENKTSQNSVYGYGVCSDPANAYSKQCAALAPADKGHYLARKYVVDDEKLQNFSVDTQLQSKFAT
GDIDHTLLTGVDFMRMRNDINAWFGYDDSVPLLNLYNPVNTDFDFNAKDPANSGPYRILNKQKQTGVYVQDQAQWDKVLV
TLGGRYDWADQESLNRVAGTTDKRDDKQFTWRGGVNYLFDNGVTPYFSYSESFEPSSQVGKDGNIFAPSKGKQYEVGVKY
VPEDRPIVVTGAVYNLTKTNNLMADPEGSFFSVEGGEIRARGVEIEAKAALSASVNVVGSYTYTDAEYTTDTTYKGNTPA
QVPKHMASLWADYTFFDGPLSGLTLGTGGRYTGSSYGDPANSFKVGSYTVVDALVRYDLARVGMAGSNVALHVNNLFDRE
YVASCFNTYGCFWGAERQVVATATFRF
>P06972 ~~~fhuB~~~Iron(3+)-hydroxamate import system permease protein FhuB~~~COG0609
MSKRIALFPALLLALLVIVATALTWMNFSQALPRSQWAQAAWSPDIDVIEQMIFHYSLLPRLAISLLVGAGLGLVGVLFQ
QVLRNPLAEPTTLGVATGAQLGITVTTLWAIPGAMASQFAAQAGACVVGLIVFGVAWGKRLSPVTLILAGLVVSLYCGAI
NQLLVIFHHDQLQSMFLWSTGTLTQTDWGGVERLWPQLLGGVMLTLLLLRPLTLMGLDDGVARNLGLALSLARLAALSLA
IVISALLVNAVGIIGFIGLFAPLLAKMLGARRLLPRLMLASLIGALILWLSDQIILWLTRVWMEVSTGSVTALIGAPLLL
WLLPRLRSISAPDMKVNDRVAAERQHVLAFALAGGVLLLMAVVVALSFGRDAHGWTWASGALLEDLMPWRWPRIMAALFA
GVMLAVAGCIIQRLTGNPMASPEVLGISSGAAFGVVLMLFLVPGNAFGWLLPAGSLGAAVTLLIIMIAAGRGGFSPHRML
LAGMALSTAFTMLLMMLQASGDPRMAQVLTWISGSTYNATDAQVWRTGIVMVILLAITPLCRRWLTILPLGGDTARAVGM
ALTPTRIALLLLAACLTATATMTIGPLSFVGLMAPHIARMMGFRRTMPHIVISALVGGLLLVFADWCGRMVLFPFQIPAG
LLSTFIGAPYFIYLLRKQSR
>O87656 ~~~fhuB~~~Iron(3+)-hydroxamate import system permease protein FhuB~~~
MSRKREMPDGGAKSVLSDLRFGRFVGRIRRSRHPALLLLALFVAACWLTWVNFSVALPRSQWQQAIWSPDIDIIEQMIFH
YSQLPRLAISLLVGAGLGLVGVLFQQVLRNPLAEPTTLGVATGAQLGITVTTLWAIPGALTTQFAALTGACIVGALVFGV
AWGKRLSPVTLILAGLVVSLYCGAINQLLVIFHHDQLQSMFLWSTGTLTQTDWSGVQRLWPQLLGGVMLTLLLLRPMTLM
GLDDGVARNLGLALSLARLAALSLAIVLSALLVNAVGIIGFIGLFAPLLAKMLGARRLLARLMLAPLIGALILWLSDQII
LWLTRVWMEVSTGSVTALIGAPLLLWLLPRLKSMSAPDMNASDRVAAERRHVLAFAVAGGALLLLATWVALSFGRDAHGW
TWASGTLLEELMPWRWPRILAALMAGVMLAVAGCIIQRLTGNPMASPEVLGISSGAAFGVVLMLFLVPGNAFGWLLPAGS
LGAAATLLIIMIAAGRGGFSPQRMLLAGMALSTAFTMLLMMLQASGDPRMAEVLTWLSGSTYNATGGQVTRTAIVMVILL
AIVPLCRRWLTILPLGGDAARAVGMALTPSRIALLALAACLTATATMTIGPLSFVGLMAPHIARMLGFRRTMPHMVISAL
AGGVLLVFADWCGRMALFPYQIPAGLLSSFIGAPYFIYLLRKQSR
>P07821 7.2.2.16~~~fhuC~~~Iron(3+)-hydroxamate import ATP-binding protein FhuC~~~COG1120
MQEYTNHSDTTFALRNISFRVPGRTLLHPLSLTFPAGKVTGLIGHNGSGKSTLLKMLGRHQPPSEGEILLDAQPLESWSS
KAFARKVAYLPQQLPPAEGMTVRELVAIGRYPWHGALGRFGAADREKVEEAISLVGLKPLAHRLVDSLSGGERQRAWIAM
LVAQDSRCLLLDEPTSALDIAHQVDVLSLVHRLSQERGLTVIAVLHDINMAARYCDYLVALRGGEMIAQGTPAEIMRGET
LEMIYGIPMGILPHPAGAAPVSFVY
>P37580 ~~~fhuD~~~Iron(3+)-hydroxamate-binding protein FhuD~~~COG0614
MTHIYKKLGAAFFALLLIAALAACGNNSESKGSASDSKGAETFTYKAENGNVKIPKHPKRVVVMADGYYGYFKTLGINVV
GAPENVFKNPYYKGKTNGVENIGDGTSVEKVIDLNPDLIIVWTTQGADIKKLEKIAPTVAVKYDKLDNIEQLKEFAKMTG
TEDKAEKWLAKWDKKVAAAKTKIKKAVGDKTISIMQTNGKDIYVFGKDFGRGGSIIYKDLGLQATKLTKEKAIDQGPGYT
SISLEKLPDFAGDYIFAGPWQSGGDDGGVFESSIWKNLNAVKNGHVYKMDPIGFYFTDPISLEGQLEFITESLTK
>P07822 ~~~fhuD~~~Iron(3+)-hydroxamate-binding protein FhuD~~~COG0614
MSGLPLISRRRLLTAMALSPLLWQMNTAHAAAIDPNRIVALEWLPVELLLALGIVPYGVADTINYRLWVSEPPLPDSVID
VGLRTEPNLELLTEMKPSFMVWSAGYGPSPEMLARIAPGRGFNFSDGKQPLAMARKSLTEMADLLNLQSAAETHLAQYED
FIRSMKPRFVKRGARPLLLTTLIDPRHMLVFGPNSLFQEILDEYGIPNAWQGETNFWGSTAVSIDRLAAYKDVDVLCFDH
DNSKDMDALMATPLWQAMPFVRAGRFQRVPAVWFYGATLSAMHFVRVLDNAIGGKA
>P16869 ~~~fhuE~~~Hydroxamate siderophore receptor FhuE~~~COG4773
MLSTQFNRDNQYQAITKPSLLAGCIALALLPSAAFAAPATEETVIVEGSATAPDDGENDYSVTSTSAGTKMQMTQRDIPQ
SVTIVSQQRMEDQQLQTLGEVMENTLGISKSQADSDRALYYSRGFQIDNYMVDGIPTYFESRWNLGDALSDMALFERVEV
VRGATGLMTGTGNPSAAINMVRKHATSREFKGDVSAEYGSWNKERYVADLQSPLTEDGKIRARIVGGYQNNDSWLDRYNS
EKTFFSGIVDADLGDLTTLSAGYEYQRIDVNSPTWGGLPRWNTDGSSNSYDRARSTAPDWAYNDKEINKVFMTLKQQFAD
TWQATLNATHSEVEFDSKMMYVDAYVNKADGMLVGPYSNYGPGFDYVGGTGWNSGKRKVDALDLFADGSYELFGRQHNLM
FGGSYSKQNNRYFSSWANIFPDEIGSFYNFNGNFPQTDWSPQSLAQDDTTHMKSLYAATRVTLADPLHLILGARYTNWRV
DTLTYSMEKNHTTPYAGLVFDINDNWSTYASYTSIFQPQNDRDSSGKYLAPITGNNYELGLKSDWMNSRLTTTLAIFRIE
QDNVAQSTGTPIPGSNGETAYKAVDGTVSKGVEFELNGAITDNWQLTFGATRYIAEDNEGNAVNPNLPRTTVKMFTSYRL
PVMPELTVGGGVNWQNRVYTDTVTPYGTFRAEQGSYALVDLFTRYQVTKNFSLQGNVNNLFDKTYDTNVEGSIVYGTPRN
FSITGTYQF
>P39405 ~~~fhuF~~~Ferric siderophore reductase~~~COG4114
MAYRSAPLYEDVIWRTHLQPQDPTLAQAVRATIAKHREHLLEFIRLDEPAPLNAMTLAQWSSPNVLSSLLAVYSDHIYRN
QPMMIRENKPLISLWAQWYIGLMVPPLMLALLTQEKALDVSPEHFHAEFHETGRVACFWVDVCEDKNATPHSPQHRMETL
ISQALVPVVQALEATGEINGKLIWSNTGYLINWYLTEMKQLLGEATVESLRHALFFEKTLTNGEDNPLWRTVVLRDGLLV
RRTCCQRYRLPDVQQCGDCTLK
>P27711 ~~~~~~Fibril protein~~~
MIGVISTAYFTMKDKHSIKTVKKYWWKNCVIQHVKYHGKTFIIATVGYGKANAAMTITYLLEKYPGLQTILNVDLALSTN
DKHDTGDTTISTKFIYRDADLTVFKDIKYGQIVNEPESFQFDGEFAKVVKDFKLGLTEGVTGTADMLIYNSKQFKEMVDK
YGHTIDVIDTEAGAIAQVAKKSSINYIALKIIYNNALSPWDNDPIHKFKMYETVNTLKYLLRRLFNLLSSNYIIDLSQCS
QDDLDSINELFEIKHDQWIKLFKPNTHKVLSGFGPSLMLVDKQEKTPVALDIIQVIRSKTKEAEGPSKVILGEDEWKNAP
KKWLRKLLFLEQVRVNDDELLWNKSAKYDLNNEKLYKIETVETVANEIAAAIAEKCQDKSSYTYNGATVPEKYLLVNCDA
RISFYITHNQSHEFVEDKNFGTQLVSNEFLKYLNEALKDVDSPYQQIVIYMTIPALDYRKISVFIPSNKGANRGVKFVAL
NQKLQRDYTVVDITRNDYDPIKVGSFKVTIRLKSE
>A6QG59 ~~~fib~~~Fibrinogen-binding protein~~~
MKNKLIAKSLLTLAAIGITTTTIASTADASEGYGPREKKPVSINHNIVEYNDGTFKYQSRPKFNSTPKYIKFKHDYNILE
FNDGTFEYGARPQFNKPAAKTDATIKKEQKLIQAQNLVREFEKTHTVSAHRKAQKAVNLVSFEYKVKKMVLQERIDNVLK
QGLVK
>P68799 ~~~fib~~~Fibrinogen-binding protein~~~
MKNKLIAKSLLTIAAIGITTTTIASTADASEGYGPREKKPVSINHNIVEYNDGTFKYQSRPKFNSTPKYIKFKHDYNILE
FNDGTFEYGARPQFNKPAAKTDATIKKEQKLIQAQNLVREFEKTHTVSAHRKAQKAVNLVSFEYKVKKMVLQERIDNVLK
QGLVR
>P68800 ~~~fib~~~Fibrinogen-binding protein~~~
MKNKLIAKSLLTIAAIGITTTTIASTADASEGYGPREKKPVSINHNIVEYNDGTFKYQSRPKFNSTPKYIKFKHDYNILE
FNDGTFEYGARPQFNKPAAKTDATIKKEQKLIQAQNLVREFEKTHTVSAHRKAQKAVNLVSFEYKVKKMVLQERIDNVLK
QGLVR
>P0C6P2 ~~~fib~~~Fibrinogen-binding protein~~~
MKNKLIAKSLLTIAAIGITTTTIASTADASEGYGPREKKPVSINHNIVEYNDGTFKYQSRPKFNSTPKYIKFKHDYNILE
FNDGTFEYGARPQFNKPAAKTDATIKKEQKLIQAQNLVREFEKTHTVSAHRKAQKAVNLVSFEYKVKKMVLQERIDNVLK
QGLVR
>P20605 2.7.7.108~~~fic~~~Probable protein adenylyltransferase Fic~~~COG2184
MSDKFGEGRDPYLYPGLDIMRNRLNIRQQQRLEQAAYEMTALRAATIELGPLVRGLPHLRTIHRQLYQDIFDWAGQLREV
DIYQGDTPFCHFAYIEKEGNALMQDLEEEGYLVGLEKAKFVERLAHYYCEINVLHPFRVGSGLAQRIFFEQLAIHAGYQL
SWQGIEKEAWNQANQSGAMGDLTALQMIFSKVVSEAGESE
>P69380 ~~~fieF~~~Cation-efflux pump FieF~~~COG0053
MNQSYGRLVSRAAIAATAMASLLLLIKIFAWWYTGSVSILAALVDSLVDIGASLTNLLVVRYSLQPADDNHSFGHGKAES
LAALAQSMFISGSALFLFLTGIQHLISPTPMTDPGVGVIVTIVALICTIILVSFQRWVVRRTQSQAVRADMLHYQSDVMM
NGAILLALGLSWYGWHRADALFALGIGIYILYSALRMGYEAVQSLLDRALPDEERQEIIDIVTSWPGVSGAHDLRTRQSG
PTRFIQIHLEMEDSLPLVQAHMVADQVEQAILRRFPGSDVIIHQDPCSVVPREGKRSMLS
>A7M4E1 ~~~~~~Putative fimbrium tip subunit Fim1C~~~
MKQYKLMQVALLAILLFGWAGCSQNEEEVPGNVRNGIVLNVTDTGIISNEPSTRTEDTGFVTTFTQGDQIGLFAVKDGAI
LDEINNMPFTFNGSSWSGKPILYDDRLVGVNFYAYYPYQSEMTGKTDLIGDDFFAPLAAGWELTTEQSDQKAYAKQDLMT
SNATALIGENGNYSLSFQLTHRMSLVVVKLPSTRYIFTDAEGVAMPEETPYVAMSVDVAFYLDNVEEGTKISPYYDAKKD
EYRLLRKPSSENQIIGHYNDKQCTLDTAEKMKEGKYKRFVVDGGYKEVTHHLQVGDYYYADGSVVSGNEAEPAKDNCIGI
VCWVGNPMPSVLYKDVAGTPYTATNDALLRSHPNCVHGLVMSLYTETGKFSPALTQSIHDWFMTTSFTSSYVSVTGYYDA
NENNKNKPLRFLGYNNSEVLDLYYDTFKTDFECFQYQDDCESSFPSPSITTGWYVPSSGELVALQDKDNSLESKLNTKLI
KVSDKTMDISATYWSSTERNNKNMYIVTYSKTAGSAGTGGVKTNTYTYRFFLGF
>A7UZ95 ~~~fim1C~~~Putative fimbrium subunit Fim1C~~~
MKKQALICALLATVLLPGCSEDGENTPQPTDGRVALEATSGIRMNTRAYDKTWEAGDAIGIYMLNGDATDGNGNRKYTTA
QTAENGSFTAAEGQTIYFPVDASQRDFVAYYPYRETLADGNVYTVDVSVQTPQKDIDLMGAAKVEGKDKTDPKVAFVFTH
KLVKLDITIKADGTSLTDADLAGTTVSISNQQTAATYNVVTGGDATVTTGTTKEIVLHTDGLKAEGIVLPAASTAGMALT
FTVPGLEGQAFHWDVNSAAQSKAFVAGSKYLYTITISKAGVEVSSKVEDWTPGNGGGETGNAE
>A6LHQ6 ~~~~~~Putative fimbrium tip subunit Fim1C~~~
MKLLANIFLSGLAILACVSCSKDEDPVLPLEGAKLSVAVKASGTATKAYNPNDVNELEGEAYINNLAVVVFNETGTELLG
YKWEALSGAEHSAIIADVPTTKAVRARIIVLANVPRDLLSTVSTYDEFQTRLVDLSSQSQTNLTMSSQVIVTKSALSEED
NYLGYTDLGDQNVDGISDPILLTRVAARIDLVNISTRFAGTPFAGREVRIDAVGIYNMKTKSYYFSEADWGETEAPDAVR
NSEDTSFEDLLVNDGTSISNTPFVHYVMENMKSDDHTMIAVKATLRGNSSYQDHTKIFTAVINAGGLQNGYDHNFIRRNY
VYRLRIYFDGESFDNIPVTPDPGPGPDPEPEVDTNLNIAVQVVGWGPVMQHPVID
>A6L3B5 ~~~fim1C~~~Fimbrium subunit Fim1C~~~
MEVKSLLMVMATLTIAGCSQNEMTEMNPDTNRTIGLDVYTEVQTRGTETTTSTLKANAGFGIFAYQTSSAGWNSEKGNTT
PNFMYNEHATWTSDSWGYTNLRFWPIDDKKITFFAYAPYESKPEVGTDQKITLSGQNAKGAPTITFEVKTSNNWKDMIDL
VTDCHTAIQDQTNESNKGTVQFKFSHVLTQIANIKVKPDVNLGTDTKIFVTGLKLDPGSTTLYNKAVYKFDNDTWEAISP
DASYFSTEQDLSDFLNKTTTDQWGYNKSSINVSDDQNATALFSDTEALYFIPVNNKNGTTNAGDLKLKINYDIVTKVTDT
SNLTSTITNKEVSLPKNTFKKGTKHTYVLTIKMNAIKITVEDNMEGWTDDSDSDINVEK
>A6LHQ9 ~~~~~~Putative fimbrium tip subunit Fim1F~~~
MRFNVVLFMLIVALLGGLSTCSSEVPIGFDTDELSFDMSLVLLTGDMQTKASDPNYTYATTEELTIQNCHVAVFDKDGKR
IYFKNFYSKDLGEMKTIGNLSGYELQLEGVRTFGKEDKKVSVLVVANANNANNSPFDNLTTYDGVDNSYTAKTIAKGPVT
ASLLVKIGKSETTLKYNQDNAPVTVSLIQLSAKIEYTGVYKKENGELLEGFSLTKVAGLNASSKITIFNTSAVENGAFSD
LAYPTTKPVTFYTYEISDAFKEVILSVQSGVEPKEYPFPANKFIKGNYYRIKGLKSSTEIEWVLENVEDKEVTLDPFE
>P04128 ~~~fimA~~~Type-1 fimbrial protein, A chain~~~COG3539
MKIKTLAIVVLSALSLSSTAALAAATTVNGGTVHFKGEVVNAACAVDAGSVDQTVQLGQVRTASLAQEGATSSAVGFNIQ
LNDCDTNVASKAAVAFLGTAIDAGHTNVLALQSSAAGSATNVGVQILDRTGAALTLDGATFSSETTLNNGTNTIPFQARY
FATGAATPGAANADATFKVQYQ
>Q47223 ~~~fimA~~~Type-1 fimbrial protein, A chain~~~
MKIKTLAIVVLSALSLSSTAALADTTPTTVNGGTVHFKGEVVNAACAVDAGSVDQTVQLGQVRTATLKQAGATSSAVGFN
IQLNDCDTTVATKAAVAFLGTAIDSTHPKVLALQSSAAGSATNVGVQILDRTGNELTLDGATFSAETTLNNGTNTIPFQA
RYFATGAATPGAANADATFKVQYQ
>B2RH54 ~~~fimA~~~Major fimbrium subunit FimA type-1~~~
MKKTKFFLLGLAALAMTACNKDNEAEPVTEGNATISVVLKTSNSNRAFGVGDDESKVAKLTVMVYNGEQQEAIKSAENAT
KVEDIKCSAGQRTLVVMANTGAMELVGKTLAEVKALTTELTAENQEAAGLIMTAEPKTIVLKAGKNYIGYSGTGEGNHIE
NDPLKIKRVHARMAFTEIKVQMSAAYDNIYTFVPEKIYGLIAKKQSNLFGATLVNADANYLTGSLTTFNGAYTPANYANV
PWLSRNYVAPAADAPQGFYVLENDYSANGGTIHPTILCVYGKLQKNGADLAGADLAAAQAANWVDAEGKTYYPVLVNFNS
NNYTYDSNYTPKNKIERNHKYDIKLTITGPGTNNPENPITESAHLNVQCTVAEWVLVGQNATW
>P0C940 ~~~fimA~~~Major fimbrium subunit FimA type-1~~~
MKKTKFFLLGLAALAMTACNKDNEAEPVTEGNATISVVLKTSNSNRAFGVGDDESKVAKLTVMVYNGEQQEAIKSAENAT
KVEDIKCSAGQRTLVVMANTGAMELVGKTLAEVKALTTELTAENQEAAGLIMTAEPKTIVLKAGKNYIGYSGTGEGNHIE
NDPLKIKRVHARMAFTEIKVQMSAAYDNIYTFVPEKIYGLIAKKQSNLFGATLVNADANYLTGSLTTFNGAYTPANYANV
PWLSRNYVAPAADAPQGFYVLENDYSANGGTIHPTILCVYGKLQKNGADLAGADLAAAQAANWVDAEGKTYYPVLVNFNS
NNYTYDSNYTPKNKIERNHKYDIKLTITGPGTNNPENPITESAHLNVQCTVAEWVLVGQNATW
>P37921 ~~~fimA~~~Type-1 fimbrial protein, A chain~~~
MKHKLMTSTIASLMFVAGAAVAADPTPVSVSGGTIHFEGKLVNAACAVSTKSADQTVTLGQYRTASFTAIGNTTAQVPFS
IVLNDCDPKVAANAAVAFSGQADNTNPNLLAVSSADNSTTATGVGIEILDNTSSPLKPDGATFSAKQSLVEGTNTLRFTA
RYKATAAATTPGQANADATFIMKYE
>Q51822 ~~~fimA~~~Major fimbrium subunit FimA type-2~~~
MKKTKFFLLGLAALAMTACNKDNEAEPVTEGNATISVVLKTSNPNRAFGEDESKVAKLTVMVYNGEQQEAIKSAENATKV
EDIKCSAGQRTLVVMANTGEMKLAGKTLAEVKALTTELTAENQEAAGLIMTAEPVEVTLVAGNNYYGYDGSQGGNQISQD
TPLEIKRVHARMAFTEIKVQMSPSYVNKYNFAPENIYALVAKKESNLFGASLANSDDAYLTGSLTNFNGAYSPANYTHVD
WLGRDYTEPSNNAPQGFYVLESTYAQNAGLRPTILCVKGKLTKHDGTPLSSEEMTAAFNAGWIVADNNPTTYYPVLVNFN
SNNYTYDNGYTPKNKIERNHKYDIKLTITGPGTNNPENPITESAHLNVQCTVAEWVLVGQNATW
>Q51827 ~~~fimA~~~Major fimbrium subunit FimA type-4~~~
MKKTKFFLLGLAALAMTACNKDNEAEPIVETDATVSFIIKSGEGRAVGDGLADAKITKLTAMVYAGQIQEGIKTVEEADG
VLKVEGIPCKSGANRVLVVVANHNYELTGKSLNEVEALTTSLTAENQNAKNLIMTGKSAAFTIKPGSNHYGYPDGTTSDN
LVSAGTPLAVTRVHAGISFAGVEVNMATQYQNYYSFNPADAKIAALVAKKDSKIFGNSLVSNTNAYLYGVQTPAGLYTPD
AAGETYELEASLNTNYAVGAGFYVLESKYDASNELRPTILCIYGKLLDKDGNPLTEPALTDAINAGFCDGDGTTYYPVLV
NYDGNGYIYSGAITQGQNKIVRNNHYKITLNITGPGTNTPENPQPVQANLNVTCQVTPWVVVNQAATW
>A5EWR9 ~~~fimA~~~Type IV major fimbrial protein FimA~~~COG4969
MKSLQKGFTLIELMIVVAIIGILAAFAIPAYNDYIARTQVSEGVSLADGLKIRIADNLQDGKCTSEGDPASGEVGNTDMG
KYALATIEGTPDANLAGLTPKDPNGCKVKIEYGKGTAGDNISPLIKGQMLVLNQLVNGSYDKDSSSTVKPKFLPKALKEA
TP
>P59914 ~~~fimA~~~Major fimbrium subunit FimA type-4~~~
MKKTKFFLLGLAALAMTACNKDNEAEPVVETNATVSFIIKSGESRAVGDDLTDAKITKLTAMVYAGQVQEGIKTVEEDGG
VLKVEGIPCKSGANRVLVVVANHNYELTGKSLNEVEALTTSLTAENQNAKNLIMTGKSAAFTIKPGSNHYGYPGGTASDN
LVSAGTPLAVTRVHAGISFAGVEVNMATQYQNYYSFKPADAKIAALVAKKDSKIFGNSLVSNTNAYLYGVQTPAGLYTPD
AAGETYELEASLNTNYAVGAGFYVLESKYDASNELRPTILCIYGKLLDKDGNPLTEPALTDAINAGFCDGDGTTYYPVLV
NYDGNGYIYSGAITQGQNKIVRNNHYKISLNITGPGTNTPENPQPVQANLNVTCQVTPWVVVNQAATW
>A7LXW1 ~~~~~~Putative fimbrium anchoring subunit Fim4B~~~
MRNTRYGFLVLLSSLLMLTGCSRRDILDDYPVSGVDIKLDWDGVTDQLPEGVRVIFYPKNGDGRKVDKYLSVRGGEMKVP
PGRYSVVVYNYNTESIRIRGEESYETIEAYTGNCNGLGIEGTEKMVWSPDSLYVLNIDELKIEKSEEVLRLDWKLESVVK
KYSFAVEAKGLEYVATVVGSIDGLSDCYCIGKGRGVCSSQPIYFEVKKGDNKVTAFFTAFKQVKEMTMPTRMSTSERETS
SEKGAIILILKFIKTDNTVQEATIDVTEIIGTLENAGTGEDGKPTPPPEIELPPDDKIEVDKPETPPNPDGGGGMGGNVD
GWGPEDNVELPVN
>P31697 ~~~fimC~~~Chaperone protein FimC~~~COG3121
MSNKNVNVRKSQEITFCLLAGILMFMAMMVAGRAEAGVALGATRVIYPAGQKQEQLAVTNNDENSTYLIQSWVENADGVK
DGRFIVTPPLFAMKGKKENTLRILDATNNQLPQDRESLFWMNVKAIPSMDKSKLTENTLQLAIISRIKLYYRPAKLALPP
DQAAEKLRFRRSANSLTLINPTPYYLTVTELNAGTRVLENALVPPMGESTVKLPSDAGSNITYRTINDYGALTPKMTGVM
E
>B2RH57 ~~~fimC~~~Major fimbrium subunit FimC~~~
MKMKYFHHPSGLLPRLLLLLLLTMGAVACTKEDNPDQPTSDEVATVKMSLDDVEMRGGDLYSGEDLIKKVRIFVFREGLN
GLWVLDKQKLFASGQSDFQNPFTISAHAGPRQIYVIANEPDALTTKLDKILFKKELEDMQAPDVNEPIVRPFTMTGMATA
TLNPQGTVQANISLNRIAAKITLDIKQVTPGSDVIKITKVQILRNAKNSRLLEGTNKPTGYWNWANACDLPLTNNGSAQS
IIQASAPLYVYENIGSDSDSSGRATQLVVEALYNGIKTRYYAYVNDKTTTANHHYSIRRNHHYKLDGTITKMGEFSSLLL
TTTVLPWTVENLDYGFLVPYVAEINPHAVITQDNVVTFENSLSFTVRIKGRDGSRWKATLDNGLEFGFDSGSAIDGAADG
TTVYTIKVKALKPNGIGIQRRTNLFFTVDGKKVILDKNINPQPTDIKIIQQGL
>P30130 ~~~fimD~~~Outer membrane usher protein FimD~~~COG3188
MSYLNLRLYQRNTQCLHIRKHRLAGFFVRLVVACAFAAQAPLSSADLYFNPRFLADDPQAVADLSRFENGQELPPGTYRV
DIYLNNGYMATRDVTFNTGDSEQGIVPCLTRAQLASMGLNTASVAGMNLLADDACVPLTTMVQDATAHLDVGQQRLNLTI
PQAFMSNRARGYIPPELWDPGINAGLLNYNFSGNSVQNRIGGNSHYAYLNLQSGLNIGAWRLRDNTTWSYNSSDRSSGSK
NKWQHINTWLERDIIPLRSRLTLGDGYTQGDIFDGINFRGAQLASDDNMLPDSQRGFAPVIHGIARGTAQVTIKQNGYDI
YNSTVPPGPFTINDIYAAGNSGDLQVTIKEADGSTQIFTVPYSSVPLLQREGHTRYSITAGEYRSGNAQQEKTRFFQSTL
LHGLPAGWTIYGGTQLADRYRAFNFGIGKNMGALGALSVDMTQANSTLPDDSQHDGQSVRFLYNKSLNESGTNIQLVGYR
YSTSGYFNFADTTYSRMNGYNIETQDGVIQVKPKFTDYYNLAYNKRGKLQLTVTQQLGRTSTLYLSGSHQTYWGTSNVDE
QFQAGLNTAFEDINWTLSYSLTKNAWQKGRDQMLALNVNIPFSHWLRSDSKSQWRHASASYSMSHDLNGRMTNLAGVYGT
LLEDNNLSYSVQTGYAGGGDGNSGSTGYATLNYRGGYGNANIGYSHSDDIKQLYYGVSGGVLAHANGVTLGQPLNDTVVL
VKAPGAKDAKVENQTGVRTDWRGYAVLPYATEYRENRVALDTNTLADNVDLDNAVANVVPTRGAIVRAEFKARVGIKLLM
TLTHNNKPLPFGAMVTSESSQSSGIVADNGQVYLSGMPLAGKVQVKWGEEENAHCVANYQLPPESQQQLLTQLSAECR
>B2RH58 ~~~fimD~~~Major fimbrium tip subunit FimD~~~
MRTNRILNIICPPILFLLVGFLFGCVREDIESDMNETSSLFLQVQPYNQRSEEGGVAAYDENTIERLTLVFYKNGTKVWQ
AEPVETSPSSNSYYVPVPESMYGQFNGNNSFKIYLVANVNFSGSFEPNASETSFLKTLVPNSILLQNDGKPEDKFAMIGS
VEKQINMATSEGKQLGSIELKRVAAKLRLKKPVLNISDYELVGDPKAKFRNCMPKGFLSVEEKPEGVGYEAIDYRPMTEA
NSSVHFYSYYNEWALNNEGRPEFVMMLKLKKTGTDDNTAKPYYYRIPVDGSDKKIRSNHLYDMAVTIEVLGSLNEEDPVT
INGSLSVIEWTSHSDDQTLPDVQYLEVIPQETVMNMTTEIELDYFSSHSLLPPADVKATCTYVNSNGQQITDTYTGANVP
TVTIDANTKKIKVRSILPINNIPKDISFTIKNSIGFEKKIKIRQNPSQFIINTFGTKSSWQPEGNLAPNLNNKAIYQIVV
LSPPADGNMIIGFPPTKEVGFYKKSGSSYTLKHTDRITEQDEQTANMVSPSFELASQLGATLVQDHWEYYTLNPLRLIYH
SNQQNRYALMTCAFYWEERKKADGTIERLDDWRLPTRAEIQLVDKLQREQAGVVRDIMTGRYYWSGLPDKAIKILLPTAS
GNATEQRAHVRCVRDVKNDRFVKSAKRLKK
>B2RH59 ~~~fimE~~~Major fimbrium tip subunit FimE~~~
MKSKSIIAQLLYVLIAFMAVSCVADKSEPCPSGEPTRVSGSIVSLEHHGLRGASADKENSVERLELWVFDEDGHFLERAV
ADLSGSTFTAKIIPSEVERRIHFIANYELADPSVWVGRSEREMLPSISVADDLETIRMWARISYPSIAPNQNLGQIQLLR
NMAKFSLSVTPPAESKLYDASYALYNSWNKGTLAPFDPNTGSFPQGQITEPAGVVFANPTSEAAFKEADGAHFFYGFERD
QSNIGTGAGITCLILKARYNLPNADYTYYKLDFVDTNKVRYNITRNHFYKMILKKAKAPGRPTLQEALDGAAANNIFLSA
EVQALPAFSDGSGMLTVDHTYMVFVQGEPSGTFQATYIPQGQNNPDYSKLTVSVSTPTGQQAAVTSAQHEGNGKIKLTLA
QQENLTKRSDVVIGVQGNPDLKRSVTVLVREKYQYVFFKANTSSAENNQVTTQISAGQGNELLISAKLPDVLNAALLPIT
FKVYTEHFYPKTGGMILGIEGGKTLYKYVLTTMPQNKELQFRFKSNKVNSAENIAVKMDYFHDQTIHVTN
>P08189 ~~~fimF~~~Protein FimF~~~COG3539
MRNKPFYLLCAFLWLAVSHALAADSTITIRGYVRDNGCSVAAESTNFTVDLMENAAKQFNNIGATTPVVPFRILLSPCGN
AVSAVKVGFTGVADSHNANLLALENTVSAASGLGIQLLNEQQNQIPLNAPSSALSWTTLTPGKPNTLNFYARLMATQVPV
TAGHINATATFTLEYQ
>P08190 ~~~fimG~~~Protein FimG~~~COG3539
MKWCKRGYVLAAILALASATIQAADVTITVNGKVVAKPCTVSTTNATVDLGDLYSFSLMSAGAASAWHDVALELTNCPVG
TSRVTASFSGAADSTGYYKNQGTAQNIQLELQDDSGNTLNTGATKTVQVDDSSQSAHFPLQVRALTVNGGATQGTIQAVI
SITYTYS
>P08191 ~~~fimH~~~Type 1 fimbrin D-mannose specific adhesin~~~COG3539
MKRVITLFAVLLMGWSVNAWSFACKTANGTAIPIGGGSANVYVNLAPVVNVGQNLVVDLSTQIFCHNDYPETITDYVTLQ
RGSAYGGVLSNFSGTVKYSGSSYPFPTTSETPRVVYNSRTDKPWPVALYLTPVSSAGGVAIKAGSLIAVLILRQTNNYNS
DDFQFVWNIYANNDVVVPTGGCDVSARDVTVTLPDYPGSVPIPLTVYCAKSQNLGYYLSGTTADAGNSIFTNTASFSPAQ
GVGVQLTRNGTIIPANNTVSLGAVGTSAVSLGLTANYARTGGQVTAGNVQSIIGVTFVYQ
>P39264 ~~~fimI~~~Fimbrin-like protein FimI~~~COG3539
MKRKRLFLLASLLPMFALAGNKWNTTLPGGNMQFQGVIIAETCRIEAGDKQMTVNMGQISSNRFHAVGEDSAPVPFVIHL
RECSTVVSERVGVAFHGVADGKNPDVLSVGEGPGIATNIGVALFDDEGNLVPINRPPANWKRLYSGSTSLHFIAKYRATG
RRVTGGIANAQAWFSLTYQ
>Q9I2S3 ~~~fimL~~~Scaffold protein FimL~~~
MVTGATSLSLVRDELFATMEQAEQGLEQFIAERQNGSLLQHAVECLQQIRGTLNLIELAGAELLAQEALQLATDIPTGVS
EERDGQLAALGNALYVLRRYLENVEANRQEIPELLLPAINEVRCAAGQPALPESFFFSARLDIPRPPSTAIDHLPSEAEL
GEESRRMRHMYQIGLLGLIREQNLYPSLKLMGRALARLDSLHGGVARSRLCWIGAAAIESIVDGQLLPRKSRKQLFSRID
RELKQLLIGPAYEAPRHLLKELLYLVALSDGQGPRSREVRELHGLAPLPFTDHLLEEESQRLSGPGQSVMRSLSTAIREE
LAGVKDMLDLIERGVAQPDSLTNLHAQLGKLSKTLGMVGLNSAGTALQTQLPTVAAWAASGVADSPPALLRLADAVLYVE
SMVGNLERGERRIIRPTPAEPGQEADAFAVHQLAEARIVVIEEAKAGLALAKRAITAYLESNGDKLNLANVPASLQAVRG
GLWFLGQERAALLVGGCADYIQQRMIETAQMPSEQMLETLADALTSLEYYLEGGAVLRPQGQPDVLDLASASVKALGLPV
AA
>A0A0H2ZC68 ~~~fimV~~~Motility hub protein FimV~~~
MVRLRTLVRAIAAASVLTSGMAHGLGLGEITLKSALNQPLDAEIELLEVRDLGSGEVIPSLASPEEFSKAGVDRLYYLTD
LKFTPVVKPNGKSVIRVTSSKPVQEPYLNFLVQVLWPNGRLLREYTVLLDPPLYSPQAAASAPQAPVSAPRATGAPRTPQ
APAPVRTTAPAGSDTYRTVSNDTLWEIAQRNRTDRVSVPQAMLAFQELNPGAFVDGNINRLKSGQVLRIPTEQQMLERSP
REALSQVQAQNQSWRGSRNPAAGTAGARQLDATQRNAAGSAPSKVDATDNLRLVSGEGKASKGADKGGKGDSKAIADTLA
VTKESLDSTRRENEELQSRMQDLQSQLDKLQKLIQLKDAQLAKLQGQLGAEGQGAAQPNAALPDASQPNAAAQAPAQPGT
PAAAAPTPAPAGEAPAAPAQPPVAPPPAPVAEKPPAPAVPAPAPVQAAEQPAPSFLDELLANPLWLAVIGGSALLALLVL
LMILSRRNAQKEKEEAQAFAADAGEEQEDALDLGKDGFDDLTLDEPEPQVAAAAPQVEKTTAQTSDALGEADIYIAYGRF
NQAAELLQNAIYDEPQRTDLRLKLMEVYAEMGDREGFARQENELREIGGAQPQVEQLKSRYPAMVAVAAVAGLAGAKLAQ
DELDSFSLDDLSLDDSGHAAKPDAAGQDLDDAFDLSLDDLGGGDLGSDDVQADLKSDSGALDDLTLDSDLDLAASTAADK
PVDDLDFGLDFAELAETPSQPKHDDLGDFSLDLDAPEDKLSDDDFLLSLNDEVPAAAPANNEFTLDTEAAEEPALSLPDD
FDLSLADEPTEPAAPEKGEDSFAAQLDEVSAQLDELASNLDEPKSAAPSFSAEDAAVASALDGDADDDFDFLSGADEAAT
KLDLARAYIDMGDSEGARDILDEVLAEGNDSQQAEARELLERLA
>Q9HZA6 ~~~fimV~~~Motility hub protein FimV~~~
MVRLRTLVRAIAAASVLTSGMAHGLGLGEITLKSALNQPLDAEIELLEVRDLGSGEVIPSLASPEEFSKAGVDRLYYLTD
LKFTPVVKPNGKSVIRVTSSKPVQEPYLNFLVQVLWPNGRLLREYTVLLDPPLYSPQAAASAPQAPVSAPRATGAPRAPQ
APAPVRTTAPAGSDTYRTVSNDTLWEIAQRNRTDRVSVPQAMLAFQELNPGAFVDGNINRLKSGQVLRIPTEQQMLERSP
REALSQVQAQNQSWRGSRNPAAGSAGARQLDATQRNAAGSAPSKVDATDNLRLVSGEGKASKGADKGGKGDSKAIADTLA
VTKESLDSTRRENEELQSRMQDLQSQLDKLQKLIQLKDAQLAKLQGQLGAEGQGAAQPNAALPDASQPNAAAQAPAQPGT
PAAAAPTPAPAGEAPAAPAQPPVAPPPAPAAEKPPAPAVPAPAPVQAAEQPAPSFLDELLANPLWLAVIGGSALLALLVL
LMILSRRNAQKEKEEAQAFAADTGEEQEDALDLGKDGFDDLTLDEPEPQVAAVAPQVEKTTAQTSDALGEADIYIAYGRF
NQAAELLQNAIYDEPQRTDLRLKLMEVYAEMGDREGFARQENELREIGGAQPQVEQLKSRYPAMVAVAAVAGLAGAKLAQ
DELDSFSLDDLSLDDSGHAAKPDAAGQDLDDAFDLSLDDLGGDDVQADLKSDSGALDDLTLDSDLDLAASTPADKPVDDL
DFGLDFAELAETPSQPKHDDLGDFSLDLDAPEDKLSDDDFLLSLNDEVPAAAPADNEFTLDTEAAEEPALSLPDDFDLSL
ADEPTEPAAPEKGEDSFAAQLDEVSAQLDELASNLDEPKSATPSFSAEDAAVASALDGDADDDFDFLSGADEAATKLDLA
RAYIDMGDSEGARDILDEVLAEGNDSQQAEARELLERLA
>Q9HUK7 ~~~fimW~~~Cyclic-di-GMP receptor FimW~~~
MENQSPHLSLRVPTPTQQNLSFCDATPKDIKYWLAHLPKANLGETARQLYQGLIELNQLVLPVEARLQLLELFRPEVHFV
CAHLERHFLNQAIVLDERPRKIANLCQALQNHLAIGYKLIVVQEAPRNSRDRAQLFAVGIQRAIRSLCGPLIRASQLYCP
VPEGLWLELHQLYQLASQRGVHRLAVRDELAKHTPGLSVEQAYLIPLLLGCARCNQMRQNNIARLAEVLEPWSQLLSIQS
ATLPGSLFIAVPQIDGPPRYRSLYPETQLASALGIDTQPLVELIREYLLQPEAERAKARLPLIEGVTLDLLQHLSSAWGD
IAERTFQRTQGQGQLTLCIGMSALHYFLAGRRPFNEVLQIQEAPEAPRFKADVQDAWAGAFDAQKVTDWQPGMPLEEIEY
RPHQSPRSVQPGHPQAHAQADATEDYPTYALPIVNHSPGGYCLSWPKEVPAQLQAGELVGLQDLPGQAWSIAVVRWIRQV
RNGGTQMGIEMIAPAAQPCGLQLLRKTEQSSHYLRALLLPAIAAISRPATVITPRLPFQEGSRVQINLHGEERRAVLNRR
QASTGSFSQFEYRSAEPVNTPSDKPVTAPVARPPAGEEDFDSLWKSL
>P29367 ~~~finO~~~Fertility inhibition protein~~~
MTEQKRPVLTLKRKTEGETPTRSRKTIINVTTPPKWKVKKQKLAEKAAREAELTAKKAQARQALSIYLNLPTLDEAVNTL
KPWWPGLFDGDTPRLLACGIRDVLLEDVAQRNIPLSHKKLRRALKAITRSESYLCAMKAGACRYDTEGYVTEHISQEEEV
YAAERLDKIRRQNRIKAELQAVLDEQ
>P37553 ~~~fin~~~Anti-sigma-F factor Fin~~~
MALHYYCRHCGVKVGSLESSMVSTDSLGFQHLTNEERNDMISYKENGDVHVLTICEDCQEALDRNPHYHEYHTFIQ
>Q9HUW0 ~~~~~~Putative Fis-like DNA-binding protein~~~
MTTETLVSGTTPVSDNANLKQHLTTPTQEGQTLRDSVEKALHNYFAHLEGQPVTDVYNMVLCEVEAPLLETVMNHVKGNQ
TKASELLGLNRGTLRKKLKQYDLL
>P0A6R3 ~~~fis~~~DNA-binding protein Fis~~~COG2901
MFEQRVNSDVLTVSTVNSQDQVTQKPLRDSVKQALKNYFAQLNGQDVNDLYELVLAEVEQPLLDMVMQYTRGNQTRAALM
MGINRGTLRKKLKKYGMN
>Q5F881 ~~~fitA~~~Antitoxin FitA~~~
MASVVIRNLSEATHNAIKFRARAAGRSTEAEIRLILDNIAKAQQTVRLGSMLASIGQEIGGVELEDVRGRNTDNEVSL
>Q5F882 3.1.-.-~~~fitB~~~Toxin FitB~~~
MILLDTNVISEPLRPQPNERVVAWLDSLILEDVYLSAITVAELRLGVALLLNGKKKNVLHERLEQSILPLFAGRILPFDE
PVAAIYAQIRSYAKTHGKEIAAADGYIAATAKQHSLTVATRDTGSFFAADVAVFNPWHD
>P75780 ~~~fiu~~~Catecholate siderophore receptor Fiu~~~COG4774
MENNRNFPARQFHSLTFFAGLCIGITPVAQALAAEGQTNADDTLVVEASTPSLYAPQQSADPKFSRPVADTTRTMTVISE
QVIKDQGATNLTDALKNVPGVGAFFAGENGNSTTGDAIYMRGADTSNSIYIDGIRDIGSVSRDTFNTEQVEVIKGPSGTD
YGRSAPTGSINMISKQPRNDSGIDASASIGSAWFRRGTLDVNQVIGDTTAVRLNVMGEKTHDAGRDKVKNERYGVAPSVA
FGLGTANRLYLNYLHVTQHNTPDGGIPTIGLPGYSAPSAGTAALNHSGKVDTHNFYGTDSDYDDSTTDTATMRFEHDIND
NTTIRNTTRWSRVKQDYLMTAIMGGASNITQPTSDVNSWTWSRTANTKDVSNKILTNQTNLTSTFYTGSIGHDVSTGVEF
TRETQTNYGVNPVTLPAVNIYHPDSSIHPGGLTRNGANANGQTDTFAIYAFDTLQITRDFELNGGIRLDNYHTEYDSATA
CGGSGRGAITCPTGVAKGSPVTTVDTAKSGNLMNWKAGALYHLTENGNVYINYAVSQQPPGGNNFALAQSGSGNSANRTD
FKPQKANTSEIGTKWQVLDKRLLLTAALFRTDIENEVEQNDDGTYSQYGKKRVEGYEISVAGNITPAWQVIGGYTQQKAT
IKNGKDVAQDGSSSLPYTPEHAFTLWSQYQATDDISVGAGARYIGSMHKGSDGAVGTPAFTEGYWVADAKLGYRVNRNLD
FQLNVYNLFDTDYVASINKSGYRYHPGEPRTFLLTANMHF
>P60566 ~~~fixA~~~Protein FixA~~~COG2086
MKIITCYKCVPDEQDIAVNNADGSLDFSKADAKISQYDLNAIEAACQLKQQAAEAQVTALSVGGKALTNAKGRKDVLSRG
PDELIVVIDDQFEQALPQQTASALAAAAQKAGFDLILCGDGSSDLYAQQVGLLVGEILNIPAVNGVSKIISLTADTLTVE
RELEDETETLSIPLPAVVAVSTDINSPQIPSMKAILGAAKKPVQVWSAADIGFNAEAAWSEQQVAAPKQRERQRIVIEGD
GEEQIAAFAENLRKVI
>P31574 ~~~fixB~~~Protein FixB~~~COG2025
MNTFSQVWVFSDTPSRLPELMNGAQALANQINTFVLNDADGAQAIQLGANHVWKLNGKPDDRMIEDYAGVMADTIRQHGA
DGLVLLPNTRRGKLLAAKLGYRLKAAVSNDASTVSVQDGKATVKHMVYGGLAIGEERIATPYAVLTISSGTFDAAQPDAS
RTGETHTVEWQAPAVAITRTATQARQSNSVDLDKARLVVSVGRGIGSKENIALAEQLCKAIGAELACSRPVAENEKWMEH
ERYVGISNLMLKPELYLAVGISGQIQHMVGANASQTIFAINKDKNAPIFQYADYGIVGDAVKILPALTAALAR
>P10958 ~~~fixJ~~~Transcriptional regulatory protein FixJ~~~
MTDYTVHIVDDEEPVRKSLAFMLTMNGFAVKMHQSAEAFLAFAPDVRNGVLVTDLRMPDMSGVELLRNLGDLKINIPSIV
ITGHGDVPMAVEAMKAGAVDFIEKPFEDTVIIEAIERASEHLVAAEADVDDANDIRARLQTLSERERQVLSAVVAGLPNK
SIAYDLDISPRTVEVHRANVMAKMKAKSLPHLVRMALAGGFGPS
>P26488 ~~~fixK~~~Nitrogen fixation regulation protein FixK~~~COG0664
MSIAASVIAHIAPVPAQAYAHPMPSNRWSEMARGVAADESARPAQVVAALGTPAVFARNSEIFGDDQVAENVYVVVSGVV
RICKLMGDGRRQIEAFCLPGDAFGWETGERYRFSAEAVSECRLVRVKRSVLFARAGSDPELACALWALSFAELQRAQEHL
LLLGRKTAQERVGSFLLDLARRSGTTNASHVTEVTLAMSRQDIADFLGLTIETVSRTLTYLEEQGTISLPSSRRVLLRDR
SALRRLDS
>P29286 ~~~fixK~~~Nitrogen fixation regulation protein FixK~~~COG0664
MKPSVVMIEPNGHFCSDCAIRTSAVCSSLDAAELREFEHLGRRVHFSSGETVFSEEDITTSFYNVLEGVMRLYKLLPDGR
RQIVGFALPGDFLGMNLSGRHNFSADAIGAVTVCQFAKAPFGRFIEERPQLLRRINELAIRELSQARDHMVLLGRRSADE
KVAAFLLGWRERLLALKGASDTVPLPMSRQDIADYLGLTIETVSRTFTKLERHGAIAIIHGGISLLDPARVEALAAA
>P23222 2.7.13.3~~~fixL~~~Sensor protein FixL~~~COG4191
MAPTRVTHPPDDGRGEHFRVRIEGFGVGTWDLDLKTWALDWSDTARTLLGIGQDQPASYDLFLSRLEPDDRERVESAIKR
VSERGGGFDVSFRVAGTSNAGQWIRARAGLIRDEAGTARHLSGIFLDIDEEKQVEGALRTRETHLRSILHTIPDAMIVID
GHGIIQLFSTAAERLFGWSELEAIGQNVNILMPEPDRSRHDSYISRYRTTSDPHIIGIGRIVTGKRRDGTTFPMHLSIGE
MQSGGEPYFTGFVRDLTEHQQTQARLQELQSELVHVSRLSAMGEMASALAHELNQPLAAISNYMKGSRRLLAGSSDPNTP
KVESALDRAAEQALRAGQIIRRLRDFVARGESEKRVESLSKLIEEAGALGLAGAREQNVQLRFSLDPGADLVLADRVQIQ
QVLVNLFRNALEAMAQSQRRELVVTNTPAADDMIEVEVSDTGSGFQDDVIPNLFQTFFTTKDTGMGVGLSISRSIIEAHG
GRMWAESNASGGATFRFTLPAADEN
>P10955 2.7.13.3~~~fixL~~~Sensor protein FixL~~~
MLSKSGIERTQWGRRVVRWRGDGVAAYIVAAIVTSSVLAIRMIRAEPIGEGLLLFSFIPAILVVALIGGRNPILFAAGLS
LVAAVSHQQISSADGPSVVELLVFGSAVLLIVALGEVLEAARRAIDRTEDVVRARDAHLRSILDTVPDATVVSATDGTIV
SFNAAAVRQFGYAEEEVIGQNLRILMPEPYRHEHDGYLQRYMATGEKRIIGIDRVVSGQRKDGSTFPMKLAVGEMRSGGE
RFFTGFIRDLTEREESAARLEQIQAELARLARLNEMGEMASTLAHELNQPLSAIANYSHGCTRLLRDMDDAVATRIREAL
EEVASQSLRAGQIIKHLREFVTKGETEKAPEDIRKLVEESAALALVGSREQGVRTVFEYLPGAEMVLVDRIQVQQVLINL
MRNAIEAMRHVDRRELTIRTMPADPGEVAVVVEDTGGGIPEEVAGQLFKPFVTTKASGMGIGLSISKRIVEAHGGEMTVS
KNEAGGATFRFTLPAYLDERIVAND
>Q03073 7.1.1.9~~~fixN~~~Cytochrome c oxidase subunit 1 homolog, bacteroid~~~COG3278
MSQPSISKSMTIGESGLAVVFAATAFLCVIAAAKALDAPFAFHAALSAAASVAAVFCIVNRYFERPAALPPAEINGRPNY
NMGPIKFSSFMAMFWGIAGFLVGLIIASQLAWPALNFDLPWISFGRLRPLHTSAVIFAFGGNVLIATSFYVVQKSCRVRL
AGDLAPWFVVVGYNFFILVAGTGYLLGVTQSKEYAEPEWYADLWLTIVWVVYLLVFLATIIKRKEPHIFVANWFYLAFIV
TIAVLHLGNNPALPVSAFGSKSYVAWGGIQDAMFQWWYGHNAVGFFLTAGFLAIMYYFIPKRAERPIYSYRLSIIHFWAL
IFLYIWAGPHHLHYTALPDWTQTLGMTFSIMLWMPSWGGMINGLMTLSGAWDKLRTDPVLRMLVVSVAFYGMSTFEGPMM
SIKVVNSLSHYTDWTIGHVHSGALGWVGFVSFGALYCLVPWAWNRKGLYSLKLVNWHFWVATLGIVLYISAMWVSGILQG
LMWRAYTSLGFLEYSFIETVEAMHPFYIIRAAGGGLFLIGALIMAYNLWMTVRVGEAEVQMPVALQPAE
>Q05572 7.1.1.9~~~fixN~~~Cytochrome c oxidase subunit 1 homolog, bacteroid~~~
MKHTVEMVVLSVGAFLALVGAGLAQDRLFGAHMWVLFFALLAGTLVLMRRVDFRPAVAGHPGRRREYFDEVVKYGVVATV
FWGVVGFLVGVVVALQLAFPELNVEPWFNFGRVRPLHTSAVIFAFGGNALIATSFYVVQRTSRARLFGGDLGWFVFWGYQ
LFIVLAASGYLLGITQSREYAEPEWYVDLWLTIVWVAYLVAFLGTIMKRKEPHIYVANWFYLAFIVTIAMLHVVNNLAVP
VSFLGSKSYSAFSGVQDALTQWWYGHNAVGFFLTAGFLAMMYYFIPKQVNRPVYSYRLSIIHFWAIIFMYIWAGPHHLHY
TALPDWAQTLGMVFSIMLWMPSWGGMINGLMTLSGAWDKIRTDPVVRMMVMAVAFYGMATFEGPMMSIKTVNSLSHYTDW
TIGHVHSGALGWNGLITFGAIYYLVPKLWNRERLYSVRMVNWHFWLATLGIVVYAAVMWVAGIQQGLMWREYDDQGFLVY
SFAETVAAMFPYYVMRAAGGALFLAGALLMAFNVTMTILGRVRDEEPIFGAAPLPAPAE
>Q03075 ~~~fixP~~~Cbb3-type cytochrome c oxidase subunit FixP~~~COG2010
MTDHSEFDSVSGKTTTGHEWDGIKELNTPLPRWWVICFYLTIVWAIGYWIVYPAWPLISSNTTGLFGYSSRADVAVELAN
LEKIRGDKMAALGAASLADVEKDPALLALARAKGKTVFGDNCAPCHGSGGAGAKGFPNLNDDDWLWGGTLDQIMQTIQFG
ARSGHAKTHEGQMLAFGKDGVLKGDEIVTVANYVRSLSGLPTRKGYDAAKGEKIFVENCVACHGDGGKGNQEMGAPNLTD
KIWLYGSDEAALIETISQGRAGVMPAWEGRLDPSTIKAMAVYVHSLGGGK
>O86464 ~~~fixT~~~Transcriptional regulator protein FixT~~~
MLDGKTIIVVAADQGLRRSVAFALEVEGYYTESYDSVQKSEASCREALCAIVDDDILRTEPQAAAQFLSNRGGRAILLVD
GLSALQPPVDYATLTKPFTGADLLGVINSLVVAAK
>P68646 ~~~fixX~~~Ferredoxin-like protein FixX~~~COG2440
MTSPVNVDVKLGVNKFNVDEEHPHIVVKADADKQALELLVKACPAGLYKKQDDGSVRFDYAGCLECGTCRILGLGSALEQ
WEYPRGTFGVEFRYG
>P45523 5.2.1.8~~~fkpA~~~FKBP-type peptidyl-prolyl cis-trans isomerase FkpA~~~COG0545
MKSLFKVTLLATTMAVALHAPITFAAEAAKPATAADSKAAFKNDDQKSAYALGASLGRYMENSLKEQEKLGIKLDKDQLI
AGVQDAFADKSKLSDQEIEQTLQAFEARVKSSAQAKMEKDAADNEAKGKEYREKFAKEKGVKTSSTGLVYQVVEAGKGEA
PKDSDTVVVNYKGTLIDGKEFDNSYTRGEPLSFRLDGVIPGWTEGLKNIKKGGKIKLVIPPELAYGKAGVPGIPPNSTLV
FDVELLDVKPAPKADAKPEADAKAADSAKK
>Q9JYI8 5.2.1.8~~~fkpA~~~Probable FKBP-type peptidyl-prolyl cis-trans isomerase FkpA~~~
MNTIFKISALTLSAALALSACGKKEAAPASASEPAAASSAQGDTSSIGSTMQQASYAMGVDIGRSLKQMKEQGAEIDLKV
FTEAMQAVYDGKEIKMTEEQAQEVMMKFLQEQQAKAVEKHKADAKANKEKGEAFLKENAAKDGVKTTASGLQYKITKQGE
GKQPTKDDIVTVEYEGRLIDGTVFDSSKANGGPVTFPLSQVIPGWTEGVQLLKEGGEATFYIPSNLAYREQGAGDKIGPN
ATLVFDVKLVKIGAPENAPAKQPAQVDIKKVN
>P0A9L3 5.2.1.8~~~fklB~~~FKBP-type 22 kDa peptidyl-prolyl cis-trans isomerase~~~COG0545
MTTPTFDTIEAQASYGIGLQVGQQLSESGLEGLLPEALVAGIADALEGKHPAVPVDVVHRALREIHERADAVRRQRFQAM
AAEGVKYLEENAKKEGVNSTESGLQFRVINQGEGAIPARTDRVRVHYTGKLIDGTVFDSSVARGEPAEFPVNGVIPGWIE
ALTLMPVGSKWELTIPQELAYGERGAGASIPPFSTLVFEVELLEIL
>P74838 2.1.1.-~~~fkbM~~~31-O-demethyl-FK506 methyltransferase FkbM~~~
MSDVVETLRLPNGATVAHVNAGEAQFLYREIFTDRCYLRHGVELRPGDVVFDVGANIGMFMLFAHLEHPGVTVHAFEPAP
VPFAALRANAVRHRVAGRVDQCAVSDEAGVRRMTFYPDATLMSGFHPDAAARKELLRTLGLNGGYTAEDVDMMLAQLPDT
GEEIETSVVRLSDVIAERGIAAIGLLKIDVEKSERRVLAGVEDADWPRIRQVVAEVHDVDGALGEVVALLRGHGFTVVAE
QDPLFAGTEIHQVAARRTAG
>Q9KID9 3.3.2.13~~~fkbO~~~Chorismatase~~~
MTDAGRQGRVEALSISVTAPYCRFEKTGSPDLEGDETVLGLIEHGTGHTDVSLVDGAPRTAVHTTTRDDEAFTEVWHAQR
PVESGMDNGIAWARTDAYLFGVVRTGESGRYADATAALYTNVFQLTRSLGYPLLARTWNYVSGINTTNADGLEVYRDFCV
GRAQALDEGGIDPATMPAATGIGAHGGGITCVFLAARGGVRINIENPAVLTAHHYPTTYGPRPPVFARATWLGPPEGGRL
FISATAGILGHRTVHHGDVTGQCEVALDNMARVIGAENLRRHGVQRGHVLADVDHLKVYVRRREDLDTVRRVCAARLSST
AAVALLHTDIAREDLLVEIEGMVA
>P0A0W2 5.2.1.8~~~fbp~~~FK506-binding protein~~~
MGGLIIEDLQEGFGKEAVKGKEITVHYTGWLENGTKFDSSLDRRQPLTITLGVGQVIKGWDEGFGGMKEGGKRKLTIPSE
MGYGAHGAGGVIPPHATLIFEVELLKVYE
>P28725 5.2.1.8~~~fkbP~~~FK506-binding protein~~~
MSIEKPEVDFPGGEPPADLAIKDIWEGDGPVAQAGQTVSVHYVGVAFSTGEEFDASWNRGTPLQFQLGAGQVISGWDQGV
QGMKVGGRRELIIPAHLAYGDRGAGGGKIAPGETLIFVCDLVAV
>P0AEM0 5.2.1.8~~~fkpB~~~FKBP-type 16 kDa peptidyl-prolyl cis-trans isomerase~~~COG1047
MSESVQSNSAVLVHFTLKLDDGTTAESTRNNGKPALFRLGDASLSEGLEQHLLGLKVGDKTTFSLEPDAAFGVPSPDLIQ
YFSRREFMDAGEPEIGAIMLFTAMDGSEMPGVIREINGDSITVDFNHPLAGQTVHFDIEVLEIDPALEA
>P44760 5.2.1.8~~~~~~Probable FKBP-type peptidyl-prolyl cis-trans isomerase~~~COG0545
MLKIQKLSIAALMVSAVISGQVFAEDNTFDEKAASYAVGTLMGSQMKDLVDSHKEVIKYDNARILDGLKDALEGKVDVRK
DEKIQKTLESIEAKLVAASKAKAESIAKQAKEEGDKFRAEFAKGKDVKTTQSGLMYKIESAGKGDTIKSTDTVKVHYTGK
LPNGKVFDSSVERGQPVEFQLDQVIKGWTEGLQLVKKGGKIQFVIAPELGYGEQGAGASIPPNSTLIFDVEVLDVNPKSE
K
>P11089 ~~~fla~~~Flagellar filament 41 kDa core protein~~~
MIINHNTSAINASRNNGINAANLSKTQEKLSSGYRINRASDDAAGMGVSGKINAQIRGLSQASRNTSKAINFIQTTEGNL
NEVEKVLVRMKELAVQSGNGTYSDADRGSIQIEIEQLTDEINRIADQAQYNQMHMLSNKSASQNVRTAEELGMQPAKINT
PASLSGSQASWTLRVHVGANQDEAIAVNIYAANVANLFSGEGAQTAQAAPVQEGVQQEGAQQPAPATAPSQGGVNSPVNV
TTTVDANTSLAKIENAIRMISDQRANLGAFQNRLESIKDSTEYAIENLKASYAQIKDATMTDEVVAATTNSILTQSAMAM
IAQANQVPQYVLSLLR
>P56963 ~~~flaA~~~Flagellin A~~~COG1344
MGFRINTNVAALNAKANADLNSKSLDASLSRLSSGLRINSAADDASGMAIADSLRSQANTLGQAISNGNDALGILQTADK
AMDEQLKILDTIKTKATQAAQDGQSLKTRTMLQADINRLMEELDNIANTTSFNGKQLLSGNFINQEFQIGASSNQTVKAT
IGATQSSKIGLTRFETGGRISTSGEVQFTLKNYNGIDDFQFQKVVISTSVGTGLGALADEINKNADKTGVRATFTVETRG
IAAVRAGATSDTFAINGVKIGKVDYKDGDANGALVAAINSVKDTTGVEASIDANGQLLLTSREGRGIKIDGNIGGGAFIN
ADMKENYGRLSLVKNDGKDILISGSNLSSAGFGATQFISQASVSLRESKGQIDANIADAMGFGSANKGVVLGGYSSVSAY
MSSAGSGFSSGSGYSVGSGKNYSTGFANAIAISAASQLSTVYNVSAGSGFSSGSTLSQFATMKTTAFGVKDETAGVTTLK
GAMAVMDIAETAITNLDQIRADIGSVQNQVTSTINNITVTQVNVKAAESQIRDVDFAAESANYSKANILAQSGSYAMAQA
NSVQQNVLRLLQ
>P21989 ~~~~~~Flagellar filament 33 kDa core protein~~~
MIINHNMSAMFAQRTLGNTNLSVQKNMEKLSSGLRINRAGDDASGLAVSEKMRSQIRGLNQASTNAQNGISFIQVAESYL
QETTDVIQRIRELSVQSANGIYSAEDRMYIQVEVSQLVAEIDRIASHAQFNGMNMLTGRFARETGENTVTASMWFHIGAN
MDQRTRAYIGTMTAAALGVRDVGDESILNIDDPEKANRAIGTLDEAIKKINKQRADLGAYQNRLEYTVIGVNVAAENLQA
AESRIRDVDMAKEMVDYTKNQILVQSGTAMLAQANQATQSVLSLLR
>P32520 ~~~flaA1~~~Flagellar filament outer layer protein flaA1~~~
MKKLFVVLTSIFIAASAYGLTNSTLIDFALTGNADNLQAGEGDTNEVVPVAENLYNDNWVVWLNESARLTENRRNSYVTN
VDSKGNNGAWEAGKVLGVRVHFPLAAWNSYALVKPVYELEMYGGADGTKYTEGKGVIHNVGEIKSISSWVYGRNYLISYF
VNLQNEFGELKSYPMGTVYFNGWRQVRWENREYLPNVRDRVLVREPLYPRMIPSVKLDSLGFYRTKDTKGGDFITYVKDV
TLEYDVVVVDFEEDIDDEATWQLLKTENDRKQAIESARIREQAELRDLEQRRIGDGTAADQGAAANTGAADTGAAQEQAQ
>C0QWY9 ~~~flaAL~~~Putative flagellar filament outer layer-like protein~~~
MFAQDAAQTGEQTTQNQGENGNNFVTEAITNYLIDDFEFANTWQASMPRDYGVVSIIRREGGPADVVAEGAENNKYILGA
KVEYFRTGYPWFSVTPPRPVKIPGYTKELSVWVAGRNHNNRMSFYVYDVNGKPQAVGNEALNFMGWKNITVQIPANIRQE
EFRGQVEQGISFMGIHVKVDPRDSYGKYYIYFDQLMAKTDMYLETYREEDDPLDTW
>O67803 ~~~flaA~~~Flagellin~~~COG1344
MATRINYNYEAAVTYTSLKQNERLMNKSLLRLSTGLRILSAADDASGLFIADQLALVSAGLEQGNRNIQFGISALQIAEG
GVSQIYDKLKTMYQKAVSAANDINDPNARAALQRDIENLRDAILKIAQDTEYNGIRLLNGSFNNVRIHYGARSAQTLSVS
ISSVLPQQLGGYVAEDSPATATDTNNVLTNIGTTNTNYSVASGDSLAFTFTDGTSITFNSLNQLGYDFNNTGTYILDASA
IVNTINNNPTLQGKGIRAYAENVSEADLTFDTTNVNIDQGDEVTITFYSGGELVFTKTYTDTVTLDQFIADINNQAGGKL
IASKDPSGTKLVLSTPNGETISVEVTVNDADGDTVVSSINLGALLQGAAGTVVNTSGATASAVKVGTLIVMGSENFTVQG
TGIAYFTAATSGTFNSLNDVDVTTNKGAEIAQVLIQRAVRQVDTIRTQIGSTINNLQAIYDAQAVAKDNTDNAESIIRNV
DFAKEMTEFTKYQIRMQSGVAMLAQANALPQLVLQLLR
>Q06064 ~~~flaA~~~Flagellin~~~COG1344
MAAVINTNYLSLVAQNNLNKSQSALGSAIERLSSGLRINSAKDDAAGQAIANRFTANVKGLTQAARNANDGISIAQTTEG
ALNEINNNLQRIRELTVQASNGTNSASDIDSIQQEVNQRLEEINRIAEQTDFNGIKVLKSNATDMTLSIQVGAKDNETID
IKIDRNSNWNLYDAVGTVPGGTVNGEARTVNALGFDVLSAVTTTIASDTVTFDAAVAAAEQAAGAAVGDGSVVSYGDTAN
PQYAVVVDNAGTMTSYALTFDKDGKAALGDQLGAVASQAAEAAVGTNDVAAGANVTVSGGAADALSKLDDAMKAVDEQRS
SLGAIQNRFESTVANLNNTITNLSAARSRIEDSDYATEVSNMTKNQILQQAGTSVLAQANQVPQNVLSLLR
>C0XTM5 ~~~flaA~~~Lantibiotic flavucin~~~
MSDFTLDFAEGDAADTVSPQITSKSLCTPGCITGWMMCNTVTKGCSFTIGK
>P50612 ~~~flaA~~~Flagellin A~~~
AFQVNTNINALTTSAGATQLGLKNSLEKLSSGLRINKAADDASGMTISDSLRSQASALGQAISNANDGIGIIQVADKAMD
EQLKILDTIKVKATQAAQDGQSLESRKAIQSDIIRLIQGLDNIGNTTSYNGQSLLSGQWTNKEFQIGTYSNQSIKVSVGS
TTSDKIGQVRINTGAMITAASEATLTFKQINGGGTSPLEGVKISHSVGTGLGVLAEVINKNSDKTGIRAKASVETTSDKE
IMSGNLKNLTINDVNIGNIVDIKKGDADGRLVQAINALTSSTGVEASTDSKGRLNLRSVDGRGIVLKADASEDNGDGKSA
PMAIDAVNGGQSITDGEGAANYGRLSLVRLDARDIVLTSSDKPDENKFSAIGFGDNNVAMATVNLRDVLGKFDASVKSAS
GANYNAVIASGNSNLGAGVTTLVGAMLVMDIADSARKTLDKIRSDLGSVQGQMVSTVNNISVTQVNVKAAESRMREVDFA
AESAEFNKYNILAQ
>P0A0S1 ~~~flaA~~~Flagellin A~~~COG1344
MAFQVNTNINAMNAHVQSALTQNALKTSLERLSSGLRINKAADDASGMTVADSLRSQASSLGQAIANTNDGMGIIQVADK
AMDEQLKILDTVKVKATQAAQDGQTTESRKAIQSDIVRLIQGLDNIGNTTTYNGQALLSGQFTNKEFQVGAYSNQSIKAS
IGSTTSDKIGQVRIATGALITASGDISLTFKQVDGVNDVTLESVKVSSSAGTGIGVLAEVINKNSNRTGVKAYASVITTS
DVAVQSGSLSNLTLNGIHLGNIADIKKNDSDGRLVAAINAVTSETGVEAYTDQKGRLNLRSIDGRGIEIKTDSVSNGPSA
LTMVNGGQDLTKGSTNYGRLSLTRLDAKSINVVSASDSQHLGFTAIGFGESQVAETTVNLRDVTGNFNANVKSASGANYN
AVIASGNQSLGSGVTTLRGAMVVIDIAESAMKMLDKVRSDLGSVQNQMISTVNNISITQVNVKAAESQIRDVDFAEESAN
FNKNNILAQSGSYAMSQANTVQQNILRLLT
>P13118 ~~~flaA~~~Flagellin A~~~
MTSILTNNSAMAALSTLRSISSSMEDTQSRISSGLRVGSASDNAAYWSIATTMRSDNQALSAVQDALGLGAAKVDTAYSG
MESAIEVVKEIKAKLVAATEDGVDKAKIQEEITQLKDQLTSIADAASFSGENWLQADLSGGAVTKSVVGSFVRDGSGSVA
VKKVDYSLNANSVLFDTVGDTGILDKVYNVSQASVTLTVNTNGVESQHTVAAYSLESLTEAGAEFQGNYALQGGNSYVKV
ENVWVRAETAATGATGQEIAATTTAAGTITADSWVVDVGNAPAANVSAGQSVANINIVGMGAAALDALISGVDAALTDMT
SAAASLGSISSRIDLQSEFVNKLSDSIESGVGRLVDADMNEESTRLKALQTQQQLAIQALSIANSDSQNVLSLFR
>P21982 ~~~flaA~~~Flagellar filament outer layer protein~~~
MKRFFAILGAALFVGNSGAFAEQATLIDFSKLVGEGNTGLHAPTTIDYSRQAGSAYSAEDKAAMKISLAIPSWEIELASS
SQTVENQTLSLVTAAPVKQDAARYGGETVMGVRIHFPSFGINSFAVIKPPFTIPAYATLGDATAQNAVAGGQFDGFGVLK
NVGVIKSIQINILGRNYLNRLSLLLEDQNGDEREIVMGYLNFDGWKSLQWNNPNYQTEVRNRDLQIVPLYPRSAPLIKLK
GIKIHRDGSQEGGDIVSYIKDIKVIYDQAVVDRNSDVDDEAIWGILRQREEQYRNFELAKLGNLQVLRSLEKKKMAKEAD
FDQAAPAAAAARAPATN
>P21990 ~~~flaB1~~~Flagellin FlaB1~~~COG1344
MIINHNMSAMFAQRTLGHTNVQVGKGIEKLSSGYRINRAGDDASGLAVSEKMRSQIRGLNQASTNASNGVNFIQVTEAYL
QETTDIMQRIRELAIQAANGIYSAEDRMQIQVEVSQLVAEVDRIASSAQFNGMNLLTGRFSRTEGENVIGGSMWFHIGAN
MDQRMRVYIGTMTAVALGVRNGVDESIMSIETADSANKSIGTIDAALKRINKQRADLGGYQNRMEYTVVGLDIAAENLQA
AESRIRDANIAKQMVEYTKNQVLTQSGTAMLAQANTSAQSILSILR
>P80160 ~~~flaB2~~~Flagellar filament core protein flaB2~~~
MIINNNISAINAQRTLKFRNVDLSKDMAALSSGMRINRAGDDASGLAVSEKMRTQIRGLRQAERNNSSGISFIQTTEGYL
QESQDILQRIRELAVQSANGIYTDADRMLIQVEVSQLVDEVNRIASHAQFNTLNMLTGRFSNPNEGGAPVASMWFHIGAN
MDERRRVYIGTMTAAALGLQTAEGTGISISSIDKANSAIGIVDEALTKVSKQRSNLPAYQNRLELTAQGLMIAYENTAAS
ESRIRDTDMAETSVKFAKDQILSQANLAMLAQANTMNQGALRLIQ
>Q9KWX0 ~~~flaB2~~~Flagellar filament 31.3 kDa core protein~~~
MIINHNMSAMYSNRVLGVTNLAQAKDMEKLSSGMKINRAGDDASGLAVSEKMRSQIRGLNQASRNAQNGISFIQVSEGYL
QETTDIMHRIRELAVQSSNGIYSDEDRMQIQVEVSQLVAEVDRIASHAQFNGMNMLTGRFARATGENTVTGSMWLHIGAN
MDQRMQVFIGTMTAMAVGVREIGSEKVMSIAAPDDANRAIGTIDEGLKKINKQRADLGAYQNRLEMTVKGLDVAAENTQA
AESTIRDTDMAKQMVDFTKNRILAQAGTAMLAQANVTTQNVLTLLQ
>P21991 ~~~flaB2~~~Flagellin FlaB2~~~COG1344
MIINHNMSAMFSQRTLGHTNLSVQKNIEKLSSGLRINRSGDDASGLAVSEKMRSQIRGLNQASTNAQNGISFIQVAEAFL
QETTDVIQRIRELSVQAANGIYSAEDRLYIQVEVSQLVAEVDRIASHAQFNGMNMLTGRFARQGGENTVTASMWFHIGAN
MDQRTRAYIGTMTAVAMGIRDAGDESVMNIDSPEKANRAIGTLDQAIKRINKQRADLGAYQNRLDHTVAGINVAAENLQA
AESRIRDVDMAKEMVDYTKNQILVQSGTAMLAQANQATQSVLSLLR
>Q9KWW9 ~~~flaB3~~~Flagellar filament 30.7 kDa core protein~~~
MIINHNMSSMYANRMLGINNDQVQGNIEKLSSGQRINRAGDDASGLAVSEKMRMQIRGLNQAQKNIQNGVSFIQATEGYL
QETTDILGRIRELSIQSANGIYSDEDRMQIQVEVSQLIDEVDRIASSAQFNGMNMLTGAFAANSVSGRIMQFHIGANVDQ
NARVYIGTMTAQSLGLVGTQGDAFAKLSIASPESANMAIATLDSALTSVNKQRADLGAYQNRFEMAAKGIGIASENLQAA
ESIIRDTDMASEIVDYTKNQILTQSSVAMLAQANTQAQNVLPLLS
>P21992 ~~~flaB3~~~Flagellin FlaB3~~~COG1344
MIINHNMSAMFAQRQGGINGLAIAKNIEKLSSGYRINRAGDDASGLAVSEKMRSQIRGLNQAGQNIQNGISFIQATEGYL
AETTEIVQRLRELAIQAANGIYSAEDRMQIQVEVSQLVDEVDRIASQAQFNGMNLLTGRFSRESALGPMQLHVGANMDQN
EKIFINTMTASALGFFSDEGTDGSRSISIATVDGANKVIGTLDSALKEINKQRADLGAYQNRFETAYQGIAIAAENLQAA
ESRIRDADLAQQMVDYTKNQILEQSTMAMLAQANTQPQAVLRLMQ
>Q07911 ~~~flaB~~~Flagellin B~~~COG1344
MSFRINTNIAALTSHAVGVQNNRDLSSSLEKLSSGLRINKAADDSSGMAIADSLRSQSANLGQAIRNANDAIGMVQTADK
AMDEQIKILDTIKTKAVQAAQDGQTLESRRALQSDIQRLLEELDNIANTTSFNGQQMLSGSFSNKEFQIGAYSNATVKAS
IGSTSSDKIGHVRMETSSFSGAGMLASAAAQNLTEVGLNFKQVNGVNDYKIETVRISTSAGTGIGALSEIINRFSNTLGV
RASYNVMATGGTPVQSGTVRELTINGVEIGTVNDVHKNDADGRLTNAINSVKDRTGVEASLDIQGRINLHSIDGRAISVH
AASASGQVFGGGNFAGISGTQHAVIGRLTLTRTDARDIIVSGVNFSHVGFHSAQGVAEYTVNLRAVRGIFDANVASAAGA
NANGAQAETNSQGIGAGVTSLKGAMIVMDMADSARTQLDKIRSDMGSVQMELVTTINNISVTQVNVKAAESQIRDVDFAE
ESANFSKYNILAQSGSFAMAQANAVQQNVLRLLQ
>O51941 ~~~flaB~~~Flagellar filament 35 kDa core protein~~~
MIINHNLSAVNAHRSLKFNELAVDKTMKALSSGMRINSAADDASGLAVSEKLRTQINGLRQAERNTEDGMSFIQTAEGFL
EQTSNIIQRIRVLAIQTSNGIYSNEDRQLVQVEVSALVDEVDRIASQAEFNKFKLFEGQFARGSRVASMWFHMGPNQNQR
ERFYIGTMTSKALKLVKADGRPIAISSPGEANDVIGLADAALTKIMKQRADMGAYYNRLEYTAKGLMGAYENMQASESRI
RDADMAEEVVSLTTKQILVQSGTAMLAQANMKPNSVLKLLQQI
>Q56572 ~~~flaB~~~Flagellin B~~~
MAINVSTNVSAMTAQRYLNNAADGTQKSMERLSSGYKINSARDDAAGLQISNRLTSQSRGLDMAVRNANDGISIAQTAEG
AMNETTNILQRMRDLSLQSANGSNSSSERQAIQEEVSALNDELNRIAETTSFGGNKLLNGSFGNKSFQIGADSGEAVMLS
MSDMRSDTKAMGGKSYVATNGKAPDWSVTNATDLTLSYTDKQGEAREVTINAKAGDDLEEVATYINGQNGDIKASVGDEG
KLQLFAANQKVSSDVTIGGGLGTEIGFAAGKDVTVKDINVTTVGGSQEAVALIDGALKAVDSQRASLGAFQNRFGHAISN
LDNINENVNASRSRIKDTDYARETTQMTKSQILQQASTSVLAQAKQSPSAALSLLG
>P0DPD2 ~~~flaC~~~Secreted flagellin C~~~
MMISDATMMQQNYYLNNAQKASDKALENIAAVRAISGVDSANLAIADSLRSQSSTIDQGVANAYDAIGVLQIADASLTNI
SQSADRLNELSVKMNNAALNDSQKGMLRTEATRIQESINDSFNNATYNGKNVFQTMNFVVGSGTETTNLNPLATGGLSID
NQDSITNFMDQLGSLRSEIGSGINAITSNINASVQNSINSKAAENNLLNNDMAKNVNDFNANYLKENAAAFVAAQSNMQL
QSKIANLLQ
>Q56574 ~~~flaC~~~Flagellin C~~~
MAVNVNTNVSAMTAQRYLNSASNAQQLSMERLSSGFKINNAKDDAAGLQISNRLNVQSRGLDVAVRNANDGISIAQTAEG
AMNETTNILQRMRDLSLQSSNGSNSKADRVAIQEEITALNDELNRIAETTSFGGNKLLNGTFETKSFQIGADNGEAVMLS
LNNMRSDNAMMGGKSYQAANGQDKDWTVKAGANDLTITLTDKRTGEQTINLSAKDGDDIEELATYINGQTDMLKASVDDE
GKLQIFTDSNRIDGVATFGGSLAGELSFQAAKDVTVDTIDVTSVGGSQESVAIVDAALQFVDSHRAQLGAFQNRFNHAIN
NLDNINENVNASKSRIKDTDFAKETTALTKSQILSQASSSVLAQAKQAPNAALGLLG
>Q56571 ~~~flaD~~~Flagellin D~~~
MAVNVNTNVSAMTAQRYLNGASNAQQTSMERLSSGFKINSAKDDAAGLQISNRLNVQSRGLDVAVRNANDGISIAQTAEG
AMNETTNILQRMRDLSLQSANGSNSKADRVAIQEEVTALNDELNRIAETTSFGGNKLLNGTFETKSFQIGADNGEAVMLS
LNNMRSDNAMMGGTSYQAANGKDKDWSVQAGSNDLQITLKDTSGTDQTINISAKEGDDIEELATYINGQTDMVKASVDDE
GKLQVFAGSNKVEGPVTFAGGLAGELGMQAGQAVTVDVIDVTSVGGAQESVAIVDAALQFVDSHRAQLGAFQNRFSHAIS
NLDNINENVSASKSRIKDTDFAKETTALTKSQILSQASSSVLAQAKQAPQAALSLLG
>P0C6C6 ~~~flaD~~~Flagellin D~~~COG1344
MAVNVNTNVAAMTAQRYLTGATNAQQTSMERLSSGFKINSAKDDAAGLQISNRLNVQSRGLDVAVRNANDGISIAQTAEG
AMNETTNILQRMRDLSLQSANGSNSKSERVAIQEEITALNDELNRIAETTSFGGNKLLNGTFSTKSFQIGADNGEAVMLT
LKDMRSDNRMMGGTSYVAAEGKDKDWKVQAGANDITFTLKDIDGNDQTITVNAKEGDDIEEVATYINGQTDMVKASVNEK
GQLQIFAGNNKVTGDVAFSGGLAGALNMQAGTAETVDTIDVTSVGGAQQSVAVIDSALKYVDSHRAELGAFQNRFNHAIS
NLDNINENVNASKSRIKDTDFAKETTALTKSQILSQASSSVLAQAKQAPNAALSLLG
>O67866 ~~~fldA~~~Flavodoxin~~~COG0655
MGKVLVIYDTRTGNTKKMAELVAEGARSLEGTEVRLKHVDEATKEDVLWADGLAVGSPTNMGLVSWKMKRFFDDVLGDLW
GEIDGKIACAFSSSGGWGGGNEVACMSILTMLMNFGFLVFGVTDYVGKKFTLHYGAVVAGEPRSEEEKEACRRLGRRLAE
WVAIFVDGRKELLEKIRKDPARFVD
>P23001 ~~~nifF~~~Flavodoxin B~~~
MAKIGLFFGSNTGKTRKVAKSIKKRFDDETMSDAVNVNRVSAEDFAQYQFLILGTPTLGEGELPGLSSDCENESWEEFLP
KIEGLDFSGKTVALFGLGDQVGYPENFLDAMGELHSFFTERGAKVVGAWSTDGYEFEGSTAVVDGKFVGLALDLDNQSGK
TDERVAAWLAQIAPEFGLSL
>P00324 ~~~nifF~~~Flavodoxin 2~~~
MAKIGLFFGSNTGKTRKVAKSIKKRFDDETMSDALNVNRVSAEDFAQYQFLILGTPTLGEGELPGLSSDCENESWEEFLP
KIEGLDFSGKTVALFGLGDQVGYPENYLDALGELYSFFKDRGAKIVGSWSTDGYEFESSEAVVDGKFVGLALDLDNQSGK
TDERVAAWLAQIAPEFGLSL
>P00322 ~~~~~~Flavodoxin~~~
MKIVYWSGTGNTEKMAELIAKGIIESGKDVNTINVSDVNIDELLNEDILILGCSAMGDEVLEESEFEPFIEEISTKISGK
KVALFGSYGWGDGKWMRDFEERMNGYGCVVVETPLIVQNEPDEAEQDCIEFGKKIANI
>P26492 ~~~~~~Flavodoxin~~~COG0716
MSKVLIVFGSSTGNTESIAQKLEELIAAGGHEVTLLNAADASAENLADGYDAVLFGCSAWGMEDLEMQDDFLSLFEEFNR
IGLAGRKVAAFASGDQEYEHFCGAVPAIEERAKELGATIIAEGLKMEGDASNDPEAVASFAEDVLKQL
>P00323 ~~~~~~Flavodoxin~~~COG0716
MPKALIVYGSTTGNTEYTAETIARELADAGYEVDSRDAASVEAGGLFEGFDLVLLGCSTWGDDSIELQDDFIPLFDSLEE
TGAQGRKVACFGCGDSSYEYFCGAVDAIEEKLKNLGAEIVQDGLRIDGDPRAARDDIVGWAHDVRGAI
>P71165 ~~~~~~Flavodoxin~~~COG0716
MANVLIVYGSTTGNTAWVAETVGRDIAEAGHSVEIRDAGQVEAEGLCEGRDLVLFGCSTWGDDEIELQDDFIHLYESLEA
TGAGKGRAACFGCGDSSYTYFCGAVDAIEERLSGLGADIVADSLKIDGDPRTMRDDVSAWAGRVVTAL
>P61949 ~~~fldA~~~Flavodoxin 1~~~COG0716
MAITGIFFGSDTGNTENIAKMIQKQLGKDVADVHDIAKSSKEDLEAYDILLLGIPTWYYGEAQCDWDDFFPTLEEIDFNG
KLVALFGCGDQEDYAEYFCDALGTIRDIIEPRGATIVGHWPTAGYHFEASKGLADDDHFVGLAIDEDRQPELTAERVEKW
VKQISEELHLDEILNA
>Q9ZK53 ~~~fldA~~~Flavodoxin~~~COG0716
MGKIGIFFGTDSGNAEAIAEKISKAIGNAEVIDVAKASKEQFNGFTKVILVAPTAGAGDLQTDWEDFLGTLEASDFANKT
IGLVGLGDQDTYSETFAEGIFHIYEKAKAGKVVGQTPTDGYHFEASKAVEGGKFVGLVIDEDNQDDLTDERIAKWVEQVK
GSFA
>O25776 ~~~fldA~~~Flavodoxin~~~COG0716
MGKIGIFFGTDSGNAEAIAEKISKAIGNAEVVDVAKASKEQFNSFTKVILVAPTAGAGDLQTDWEDFLGTLEASDFATKT
IGLVGLGDQDTYSETFAEGIFHIYEKAKAGKVVGQTPTDGYHFEASKAVEGGKFVGLVIDEDNQDDLTDERISKWVEQVK
GSFA
>O07026 ~~~fldA~~~Flavodoxin~~~
MAIIGIFFGSDTGNTENIAKMIQKQLGKDVADVHDIAKSSKEDLEAHDILLLGIPTWYYGEAQCDWDDFFPTLEEIDFNG
KLVALFGCGDQEDYAEYFCDALGTIRDIIEPRGATIVGHWPTAGYHFEASKGLADDDHFVGLAIDEDRQPELTNERVEKW
VKQVAEELHLEEIKNA
>P00321 ~~~~~~Flavodoxin~~~
MVEIVYWSGTGNTEAMANEIEAAVKAAGADVESVRFEDTNVDDVASKDVILLGCPAMGSEELEDSVVEPFFTDLAPKLKG
KKVGLFGSYGWGSGEWMDAWKQRTEDTGATVIGTAIVNEMPDNAPECKELGEAAAKA
>Q01095 ~~~~~~Flavodoxin~~~
MPKALIVYGSTTGNTEGVAEAIAKTLNSEGMETTVVNVADVTAPGLAEGYDVVLLGCSTWGDDEIELQEDFVPLYEDLDR
AGLKDKKVGVFGCGDSSYTYFCGAVDVIEKKAEELGATLVASSLKIDGEPDSAEVLDWAREVLARV
>P0A3D9 ~~~isiB~~~Flavodoxin~~~COG0716
MSKKIGLFYGTQTGKTESVAEIIRDEFGNDVVTLHDVSQAEVTDLNDYQYLIIGCPTWNIGELQSDWEGLYSELDDVDFN
GKLVAYFGTGDQIGYADNFQDAIGILEEKISQRGGKTVGYWSTDGYDFNDSKALRNGKFVGLALDEDNQSDLTDDRIKSW
VAQLKSEFGL
>P0A3E0 ~~~isiB~~~Flavodoxin~~~
MSKKIGLFYGTQTGKTESVAEIIRDEFGNDVVTLHDVSQAEVTDLNDYQYLIIGCPTWNIGELQSDWEGLYSELDDVDFN
GKLVAYFGTGDQIGYADNFQDAIGILEEKISQRGGKTVGYWSTDGYDFNDSKALRNGKFVGLALDEDNQSDLTDDRIKSW
VAQLKSEFGL
>P52967 ~~~nifF~~~Flavodoxin~~~COG0716
MAKIGLFFGSDTGTTRKIAKQIKDMFDDEVMAKPLNVNRADVADFMAYDFLILGTPTLGDGQLPGLSANAASESWEEFLP
RIADQDFSGKTIALFGLGDQVTYPLEFVNALFFLHEFFSDRGAKLVGRWPAKGYGFEDSLAVVEGEFLGLALDQDNQAAL
TPERLKGWLSLIAADFGLVLPA
>Q8ZQX1 ~~~fldA~~~Flavodoxin 1~~~
MAITGIFFGSDTGNTENIAKMIQKQLGKDVADVHDIAKSSKEDLEGYDILLLGIPTWYYGEAQCDWDDFFPTLEEIDFNG
KLVALFGCGDQEDYAEYFCDALGTIRDIIEPRGATIVGHWPTAGYHFEASKGLADDDHFVGLAIDEDRQPELTAERVEKW
VKQVSAELHLDDILNA
>P10340 ~~~isiB~~~Flavodoxin~~~COG0716
MAKIGLFYGTQTGVTQTIAESIQQEFGGESIVDLNDIANADASDLNAYDYLIIGCPTWNVGELQSDWEGIYDDLDSVNFQ
GKKVAYFGAGDQVGYSDNFQDAMGILEEKISSLGSQTVGYWPIEGYDFNESKAVRNNQFVGLAIDEDNQPDLTKNRIKTW
VSQLKSEFGL
>P31158 ~~~isiB~~~Flavodoxin~~~COG0716
MSKIGLFFGTQTGNTEELAQAIQAAFGGSDIVELFDVAEVDIEALRDFDQLIIGCPTWNVGELQSDWEALYDDLDDVDFS
GKTIAYFGAGDQVGYADNFQDAMGVLEEKITSLGGKTVGQWPTAGYDHSESKAERDGKFVGLAIDEDNQPELTAERIQAW
VAQLKPAFGL
>P27319 ~~~isiB~~~Flavodoxin~~~COG0716
MTKIGLFYGTQTGNTETIAELIQKEMGGDSVVDMMDISQADVDDFRQYSCLIIGCPTWNVGELQSDWEGFYDQLDEIDFN
GKKVAYFGAGDQVGYADNFQDAMGILEEKISGLGGKTVGFWPTAGYDFDESKAVKNGKFVGLALDEDNQPELTELRVKTW
VSEIKPILQS
>P52964 ~~~~~~Flavodoxin 1~~~COG0716
MSRIGIFYGSSSGVTGKVAEKLAELLGEERCDLYNMEEDFVDFDDMLKYDHLLFGCSTWGSGEVQNDWRDPLLELDNEKP
DFSGKTIALFGAGDYVSHGEQFVSALGVLYDKFKARGAALVGSFPTDGYTYEYSFAVRDGKFVGLPFDKINEVDKTDERL
ERWIAVLQEEFLPA
>P80312 ~~~~~~Flavodoxin~~~COG0716
MSKVLILFGSSTGNTESIAQKLEELVAAGGHEVTLLNAAEASADNLADGYDAVLMGCSAWGMEDLELQDDFAPLFDEMEN
MGLKGKKLAAFASGDMEYEHYCGAVPAIEEKARGLGAEVICEGLKIEGDASSDPDAVSAFAEDVLKKL
>P0ABY4 ~~~fldB~~~Flavodoxin 2~~~COG0716
MNMGLFYGSSTCYTEMAAEKIRDIIGPELVTLHNLKDDSPKLMEQYDVLILGIPTWDFGEIQEDWEAVWDQLDDLNLEGK
IVALYGLGDQLGYGEWFLDALGMLHDKLSTKGVKFVGYWPTEGYEFTSPKPVIADGQLFVGLALDETNQYDLSDERIQSW
CEQILNEMAEHYA
>A0A1G9FQX8 2.5.1.63~~~~~~Fluorinase~~~
MSDSYSRPIIAFMSDLGTTDDSVAQCKGLMMSICQDVTVVDVCHSMEPWNVEEGARYIVDLPRFFPEGTVFATTTYPATG
TTARSVAVRIKYPAKGGARGQWAGSGEGFERSEGSYIYIAPNNGLLTTVLQEHGYTEAYEVSSTDVVPARPEPTFYSREM
VAIPSAHLAAGYPLEKVGRKLQDSEIVRFTPPQATVSPEGDLSGVVTAIDHPFGNIWTSIHRDNLESAGVGYGTNLKIVL
DDVFPFELPLSPTFADAGEVGDPVVYVNSRGYLSLARNAASLAYPYNLKEGMSVRVTRS
>R4LHX8 2.5.1.63~~~flA3~~~Fluorinase~~~COG1912
MPANGNPIIAFMSDLGTTDDSVAQCKGLMLSICPGVTIVDVNHSMTPWDVEEGARYIVDLPRFFPEGTVFATTTYPATGT
ATRSVALRIKQAAQGGARGQWAGSGAGFERAEGSYIYIAPNNGLLTTVIEEHGYIEAYEVSNTKVIPAEPEPTFYSREMV
AIPSAHLAAGFPLNEVGRALSDDEIVRFAKPKPSTVSGGVLSGVITNIDHPFGNLWTNIHRTDLEKAGIGYQTQLRLLLD
GVLTFDLPLVPTFADAGQIGDPVIYINSRGYLALARNAAPLAYPYNLKAGLTVTVTKA
>P02968 ~~~hag~~~Flagellin~~~COG1344
MRINHNIAALNTLNRLSSNNSASQKNMEKLSSGLRINRAGDDAAGLAISEKMRGQIRGLEMASKNSQDGISLIQTAEGAL
TETHAILQRVRELVVQAGNTGTQDKATDLQSIQDEISALTDEIDGISNRTEFNGKKLLDGTYKVDTATPANQKNLVFQIG
ANATQQISVNIEDMGADALGIKEADGSIAALHSVNDLDVTKFADNAADTADIGFDAQLKVVDEAINQVSSQRAKLGAVQN
RLEHTINNLSASGENLTAAESRIRDVDMAKEMSEFTKNNILSQASQAMLAQANQQPQNVLQLLR
>P22251 ~~~flaA~~~Flagellin A~~~
MGFRINTNVAALNAKANSDLNAKSLDASLSRLSSGLRINSAADDASGMAIADSLRSQANTLGQAISNGNDALGILQTADK
AMDEQLKILDTIKTKATQAAQDGQSLKTRTMLQADINKLMEELDNIANTTSFNGKQLLSGNFTNQEFQIGASSNQTVKAT
IGATQSSKIGVTRFETGAQSFTSGVVGLTIKNYNGIEDFKFDNVVISTSVGTGLGALAEEINKSADKTGVRATYDVKTTG
VYAIKEGTTSQDFAINGVTIGKIEYKDGDGNGSLISAINAVKDTTGVQASKDENGKLVLTSADGRGIKITGDIGVGSGIL
ANQKENYGRLSLVKNDGRDINISGTNLSAIGMGTTDMISQSSVSLRESKGQISATNADAMGFNSYKGGGKFVFTQNVSSI
SAFMSAQGSGFSRGSGFSVGSGKNLSVGLSQGIQIISSAASMSNTYVVSAGSGFSSGSGNSQFAALKTTAANTTDETAGV
TTLKGAMAVMDIAETAITNLDQIRADIGSIQNQVTSTINNITVTQVNVKAAESQIRDVDFASESANYSKANILAQSGSYA
MAQANSSQQNVLRLLQ
>Q05203 ~~~hag~~~Flagellin~~~COG1344
MIINHNLPAMNAHRNMGINLNQGQKAMEKLSSGLRINRAGDDAAGLAISEKMRAQIRGLDQASRNSQDGISLIQTAEGAL
DEVHSILQRMRELAVQSSNETNVEQDQAALNDEFQQLVEEIERIKDTTQFNTQKLLDDTVDTVQLQVGANSGELIELDLT
KVDLSAIHTALAAEDITDHTNAQSAIDAIDEQLKAVSEGRSYLGAMQNRLEHTIKNLDNASENLQAAESRIRDVDMAKEM
MEFTRTNILNQASQAMLAQANQQPQAVLQLLR
>W8JNL4 2.5.1.63~~~flA2~~~Fluorinase~~~COG1912
MTTTNGRRPIIAFMSDLGITDDSVAQCKGLMLSVCPDVTIVDICHTMQPWDVEEGARYIVDLPRLFPEGTVFATTTYPAT
GTTARSVALRIAHASKGGARGQWAGSGAGFERKEGSYIYIAPNNGLLTTVIKEHGYLEAYEVSSPEVIPEQPEPTFYSRE
MVALPSAHLAAGFPLEKVGRRLADDEIVRFERKDPELVADHDLVGYVTNIDHPFGNVWTNIHRTDLEKLGVGYGTKLRIT
LDGVLPFELPLSPTFADAGEIGAAVAYLSSRGYLALARNAASLAYPYNLKAGISVQVKVG
>Q70GK9 2.5.1.63~~~flA~~~Fluorinase~~~
MAANSTRRPIIAFMSDLGTTDDSVAQCKGLMYSICPDVTVVDVCHSMTPWDVEEGARYIVDLPRFFPEGTVFATTTYPAT
GTTTRSVAVRIKQAAKGGARGQWAGSGAGFERAEGSYIYIAPNNGLLTTVLEEHGYLEAYEVTSPKVIPEQPEPTFYSRE
MVAIPSAHLAAGFPLSEVGRPLEDHEIVRFNRPAVEQDGEALVGVVSAIDHPFGNVWTNIHRTDLEKAGIGYGARLRLTL
DGVLPFEAPLTPTFADAGEIGNIAIYLNSRGYLSIARNAASLAYPYHLKEGMSARVEAR
>W0W999 2.5.1.63~~~flA1~~~Fluorinase~~~
MAANGSQRPIIAFMSDLGTTDDSVAQCKGLMHSICPGVTVVDVCHSMTPWDVEEGARYIVDLPRFFPEGTVFATTTYPAT
GTTTRSVAVRIRQAAKGGARGQWAGSGDGFERADGSYIYIAPNNGLLTTVLEEHGYIEAYEVTSTKVIPANPEPTFYSRE
MVAIPSAHLAAGFPLAEVGRRLDDSEIVRFHRPAVEISGEALSGVVTAIDHPFGNIWTNIHRTDLEKAGIGQGKHLKIIL
DDVLPFEAPLTPTFADAGAIGNIAFYLNSRGYLSLARNAASLAYPYNLKAGLKVRVEAR
>P22252 ~~~flaB~~~Flagellin B~~~
MGFRINTNIGALNAHANSVVNSNELDKSLSRLSSGLRINSAADDASGMAIADSLRSQAATLGQAINNGNDAIGILQTADK
AMDEQLKILDTIKTKATQAAQDGQSLKTRTMLQADINRLMEELDNIANTTSFNGKQLLSGNFTNQEFQIGASSNQTIKAT
IGATQSSKIGVTRFETGAQSFTSGVVGLTIKNYNGIEDFKFDNVVISTSVGTGLGALAEEINKSADKTGVRATYDVKTTG
VYAIKEGTTSQDFAINGVVIGQINYKDGDNNGQLVSAINAVKDTTGVQASKDENGKLVLTSADGRGIKITGDIGVGSGIL
ANQKENYGRLSLVKNDGRDINISGTNLSAIGMGTTDMISQSSVSLRESKGQISATNADAMGFNSYKGGGKFVFTQNVSSI
SAFMSAQGSGFSRGSGFSVGSGKNLSVGLSQGIQIISSAASMSNTYVVSAGSGFSSGSGNSQFAALKTTAANTTDETAGV
TTLKGAMAVMDIAETAITNLDQIRADIGSVQNQLQVTINNITVTQVNVKAAESTIRDVDFASESANFSKYNILAQSGSYA
MSQANAVQQNVLKLLQ
>Q93AM1 2.8.3.17~~~fldA~~~Cinnamoyl-CoA:phenyllactate CoA-transferase~~~
MENNTNMFSGVKVIELANFIAAPAAGRFFADGGAEVIKIESPAGDPLRYTAPSEGRPLSQEENTTYDLENANKKAIVLNL
KSEKGKKILHEMLAEADILLTNWRTKALVKQGLDYETLKEKYPKLVFAQITGYGEKGPDKDLPGFDYTAFFARGGVSGTL
YEKGTVPPNVVPGLGDHQAGMFLAAGMAGALYKAKTTGQGDKVTVSLMHSAMYGLGIMIQAAQYKDHGLVYPINRNETPN
PFIVSYKSKDDYFVQVCMPPYDVFYDRFMTALGREDLVGDERYNKIENLKDGRAKEVYSIIEQQMVTKTKDEWDNIFRDA
DIPFAIAQTWEDLLEDEQAWANDYLYKMKYPTGNERALVRLPVFFKEAGLPEYNQSPQIAENTVEVLKEMGYTEQEIEEL
EKDKDIMVRKEK
>Q93AL9 4.2.1.175~~~fldB~~~(R)-phenyllactyl-CoA dehydratase alpha subunit~~~
MSDRNKEVKEKKAKHYLREITAKHYKEALEAKERGEKVGWCASNFPQEIATTLGVKVVYPENHAAAVAARGNGQNMCEHA
EAMGFSNDVCGYARVNLAVMDIGHSEDQPIPMPDFVLCCNNICNQMIKWYEHIAKTLDIPMILIDIPYNTENTVSQDRIK
YIRAQFDDAIKQLEEITGKKWDENKFEEVMKISQESAKQWLRAASYAKYKPSPFSGFDLFNHMAVAVCARGTQEAADAFK
MLADEYEENVKTGKSTYRGEEKQRILFEGIACWPYLRHKLTKLSEYGMNVTATVYAEAFGVIYENMDELMAAYNKVPNSI
SFENALKMRLNAVTSTNTEGAVIHINRSCKLWSGFLYELARRLEKETGIPVVSFDGDQADPRNFSEAQYDTRIQGLNEVM
VAKKEAE
>Q93AL8 4.2.1.175~~~fldC~~~(R)-phenyllactyl-CoA dehydratase beta subunit~~~
MSNSDKFFNDFKDIVENPKKYIMKHMEQTGQKAIGCMPLYTPEELVLAAGMFPVGVWGSNTELSKAKTYFPAFICSILQT
TLENALNGEYDMLSGMMITNYCDSLKCMGQNFKLTVENIEFIPVTVPQNRKMEAGKEFLKSQYKMNIEQLEKISGNKITD
ESLEKAIEIYDEHRKVMNDFSMLASKYPGIITPTKRNYVMKSAYYMDKKEHTEKVRQLMDEIKAIEPKPFEGKRVITTGI
IADSEDLLKILEENNIAIVGDDIAHESRQYRTLTPEANTPMDRLAEQFANRECSTLYDPEKKRGQYIVEMAKERKADGII
FFMTKFCDPEEYDYPQMKKDFEEAGIPHVLIETDMQMKNYEQARTAIQAFSETL
>J7SHB8 1.1.1.110~~~fldH~~~Aromatic 2-oxoacid reductase~~~
MKILAYCVRPDEIDSFKNFSEKYGHTVDLIPDSFGPNVAHLAKGYDGISILGNDTCNREALEKIKDCGIKYLATRTAGVN
NIDFDAAKEFGINVANVPAYSPNSVSEFTVGLALSLTRKIPFALKRVELNNFALGGLIGVELRNLTLGVIGTGRIGLKVI
EGFSGFGMKKMIGYDIFENEKAKEYIEYKSLDEVYKEADIITLHAPLTDDNYHMIGKESIAKMKDGVFIINAARGALIDS
EALIEGLKSGKIAGAALDSYEYEQGVFHNNKMNEIMKDDTLERLKSFPNVVITPHLGFYTDEAVSNMVEITLMNLQEFEL
KGTCKNQRVCK
>G9EZR6 1.1.1.110~~~fldH~~~Aromatic 2-oxoacid reductase~~~
MKILAYCVRPDEIDSFKNFSEKYGHTVDLIPDSFGPSVAHLAKGYDGISILGNDTCNREALEKIKDCGIKYLATRTAGVN
NIDFDAAKEFGINVANVPAYSPNSVSEFTVGLALSLTRKIPFALKRVELNNFALGGLIGVELRNLTLGVIGTGRIGLKVI
EGFSGFGMKKMIGYDIFENEKAKEYIEYKSLDEVYKEADIITLHAPLTDDNYHMIGKESIAKMKDGVFIINAARGALIDS
EALIEGLKSGKIAGAALDSYEYEQGVFHNNKMNEIMKDDTLARLKSFPNVVITPHLGFYTDEAVSNMVEITLMNLQEFEL
KGTCKNQRVCK
>Q93AM0 3.6.1.-~~~fldI~~~(R)-phenyllactate dehydratase activator~~~
MADIYTMGVDIGSTASKTVVLKNGKEIVSQAVISVGAGTSGPKRAIDSVLKDAKLSIEDLDYIVSTGYGRNSFDFANKQI
SELSCHAKGVYFDNNKARTVIDIGGQDIKVLKLADSGRLLNFIMNDKCAAGTGRFLDVMSRVIEVPVDELGKKALESKNP
CTISSTCTVFAESEVISQLARGVKTEDLIAGICKSVASRVASLAKRSGIEELVVMSGGVAKNIGVVKAMEAELGRDIYIS
KNSQLNGALGASLYAYESFQKERS
>A0A0H2ZDT7 ~~~fldP~~~Flavodoxin FldP~~~
MSKAVVVYFSGYGHTKRVAQAAAEGAQASLVEIDSEGNIPEAAWDLLDQAQSILFGAPTYMGSVPWQFKKFADATSRKWF
VRQWQDKVFGGFTNSASLNGDKQVSLILMHTLASQHGGIWVSLGLAPANTSASSRADINNLGASVGALVQSPSDADAQAI
PSGDLETVKLYAARVARIGQQLHA
>G9F1Y9 1.3.1.-~~~fldZ~~~Cinnamate reductase~~~
MKDQYKVLYDPIKIGKLEIKNRYVLAPMGPGGMCNADGSFNKRGIEFYVERAKGGTGLIMTGVTMVENNIEKCALPSMPC
PTINPLNFITTGNEMTERVHAYGAKIFLQLSAGFGRVSIPSIVGKVAVAPSKIPHRFLPGVTCRELTTEEVKEYVKAFGE
SAEIAKKAGFDGVEIHAVHEGYLLDQFAISFFNHRTDEYGGSLENRLRFACEVVQEIKKRCGQDFPVSLRYSIKSFIKDW
CKGGLPDEEFEEKGRDIPEGIEAAKILVAAGYDALNGDVGSYDSWYWSHPPMYQKKGLYLPYNEILKKVVDVPIITAGRM
EDPELSSDAILSGKTDMIALGRPLLADAEIPNKIFEDKYDKVRPCLSCQEGCMGRLQNFATVSCAVNPACGREKEYGLKK
AEQIKKVLIVGGGVAGMEAARVAAIRGHKVTLIEKNGYLGGNIVPGGVPDFKDDDRALVKWYEGILKDLGVEIKLNVAAS
KENIKEFGADEVLLATGSSPRTLTIEGADKVYSAEDVLMERKNVGEKVIIIGGGLVGCETALWLKQQGKEVTIVEMQNDI
LQVGGPLCHANHDMLIDLIKFNKIDVKASSYISKKTDEGFVLNTNGEESIINADSAVVAIGYLSEKDLYSEVRFDIPNAR
LIGDANKVQNIMYAIWSAYEVAKNI
>G3XD64 ~~~fleN~~~Antiactivator FleN~~~
MKQMGSMHPVQVIAVTGGKGGVGKTNVSVNLALALADLGRRVMLLDADLGLANVDVLLGLTPKRTLADVIEGRCELRDVL
LLGPGGVRIVPAASGTQSMVHLSPMQHAGLIQAFSDISDNLDVLVVDTAAGIGDSVVSFVRAAQEVLLVVCDEPTSITDA
YALIKLLNRDHGMTRFRVLANMAHSPQEGRNLFAKLTKVTDRFLDVALQYVGVIPYDESVRKAVQKQRAVYEAFPRSKAS
LAFKAVAQKVDSWPLPANPRGHLEFFVERLVQHPATGSAV
>G3XCV0 ~~~fleQ~~~Transcriptional regulator FleQ~~~
MWRETKLLLIDDNLDRSRDLAVILNFLGEDQLTCNSEDWREVAAGLSNSREALCVLLGSVESKGGAVELLKQLASWDEYL
PILLIGEPAPADWPEELRRRVLASLEMPPSYNKLLDSLHRAQVYREMYDQARERGRSREPNLFRSLVGTSRAIQQVRQMM
QQVADTDASVLILGESGTGKEVVARNLHYHSKRREGPFVPVNCGAIPAELLESELFGHEKGAFTGAITSRAGRFELANGG
TLFLDEIGDMPLPMQVKLLRVLQERTFERVGSNKTQNVDVRIIAATHKNLEKMIEDGTFREDLYYRLNVFPIEMAPLRER
VEDIALLLNELISRMEHEKRGSIRFNSAAIMSLCRHDWPGNVRELANLVERLAIMHPYGVIGVGELPKKFRHVDDEDEQL
ASSLREELEERAAINAGLPGMDAPAMLPAEGLDLKDYLANLEQGLIQQALDDAGGVVARAAERLRIRRTTLVEKMRKYGM
SRRDDDLSDD
>Q9I4N3 ~~~fleR~~~Response regulator protein FleR~~~
MAAKVLLVEDDRALREALSDTLLLGGHEFVAVDSAEAALPVLAREAFSLVISDVNMPGMDGHQLLGLIRTRYPHLPVLLM
TAYGAVDRAVEAMRQGAADYLVKPFEARALLDLVARHALGQLPGSEEDGPVALEPASRQLLELAARVARSDSTVLISGES
GTGKEVLANYIHQQSPRAGKPFIAINCAAIPDNMLEATLFGHEKGSFTGAIAAQPGKFELADGGTILLDEISEMPLGLQA
KLLRVLQEREVERVGARKPINLDIRVLATTNRDLAAEVAAGRFREDLYYRLSVFPLAWRPLRERPADILPLAERLLRKHS
RKMNLGAVALGPEAAQCLVRHAWPGNVRELDNAIQRALILQQGGLIQPADLCLTAPIGMPLAAPVPVPMPAMPPATPPSV
EIPSPAAGQDASGALGDDLRRREFQVIIDTLRTERGRRKEAAERLGISPRTLRYKLAQMRDAGMDVEAYLYAI
>P40131 ~~~flgA~~~Flagella basal body P-ring formation protein FlgA~~~
MQTLKRGFAVAALLFSPLTMAQDINAQLTTWFSQRLAGFSDEVVVTLRSSPNLLPSCEQPAFSMTGSAKLWGNVNVVARC
ANEKRYLQVNVQATGNYVAVAAPIARGGKLTPANVTLKRGRLDQLPPRTVLDIRQIQDAVSLRDLAPGQPVQLTMIRQAW
RVKAGQRVQVIANGEGFSVNAEGQAMNNAAVAQNARVRMTSGQIVSGTVDSDGNILINL
>P24500 ~~~flgB~~~Flagellar basal body rod protein FlgB~~~COG1815
MSLFSGTIQNLENALSRADIKQKVITNNIANIDTPNYKAKKVSFQNLLDQESSRLEAIKTDYRHVDFSDTDSNYSIVASG
DTSYQQNGNNVDVDKEMTELAQNQINYQALVERMNGKFNSLKTVLTGGK
>P16437 ~~~flgB~~~Flagellar basal body rod protein FlgB~~~
MLDRLDAALRFQQEALNLRAQRQEILAANIANADTPGYQARDIDFASELKKVMVRGREETGGVALTLTSSHHIPAQAVSS
PAVDLLYRVPDQPSLDGNTVDMDRERTQFADNSLKYQMGLTVLGSQLKGMMNVLQGGN
>P0A1I7 ~~~flgC~~~Flagellar basal-body rod protein FlgC~~~
MALLNIFDIAGSALAAQSKRLNVAASNLANADSVTGPDGQPYRAKQVVFQVDAAPGQATGGVKVASVIESQAPEKLVYEP
GNPLADANGYVKMPNVDVVGEMVNTMSASRSYQANIEVLNTVKSMMLKTLTLGQ
>P0A1I9 ~~~flgD~~~Basal-body rod modification protein FlgD~~~
MSIAVNMNDPTNTGVKTTTGSGSMTGSNAADLQSSFLTLLVAQLKNQDPTNPLQNNELTTQLAQISTVSGIEKLNTTLGA
ISGQIDNSQSLQATTLIGHGVMVPGTTILAGKGAEEGAVTSTTPFGVELQQPADKVTATITDKDGRVVRTLEIGELRAGV
HTFTWDGKQTDGTTVPNGSYNIAITASNGGTQLVAQPLQFALVQGVTKGSNGNLLDLGTYGTTTLDEVRQII
>Q8YDL6 ~~~flgE~~~Flagellar hook protein FlgE~~~COG1749
MSLYGMMRTGVSGMNAQANRLSTVADNIANASTVGYKRAETQFSSLVLPSTAGQYNSGSVLTDVRYGISDQGGIRSTSST
TDLAIDGNGYFVVQGPGGSTYLTRAGSFVPDKNGDLVNSAGYYLLGAGADEAAGGLTVAGLNIVNVNAAALPAEGSTAGD
FTVNLPSTDQAPAAGGYNHKTSLISYNDKGEKITLDVYFTKTGADEWNVSVKNAADGVEIGTTVLNFDPTTGDLVSGGNV
AVNLGAYGGQTLNLNLGGSTQRAGDYTISQAVINGQAPSSIKGVDVGNDGAVVAVYENGTQKVLYRIPLANVASPDRMTV
VSGNIFLPSAESGDVRLGFPQGDGMGKIMSGTLEESNADIAQELTDMIEAQRSYTANSKVFQTGFELMDVLVNLKR
>P35806 ~~~flgE~~~Flagellar hook protein FlgE~~~COG1749
MSINSAMLAGVSGLIANSSALAAISDNIANVNTVGFKRSTSNFSTLVTSGNKNQTYSAGGVKAQTHQFISQQGLTQSTTS
NLDISISGAGFFVTTEKPENLTATDTRSFTRAGSFQLDNLGYLRNDAGLYLQGWLADPVSGLITPDPSDLMQLASINVGS
VGGTAEKTTRVGVNANLRSEQPVAAAVSYKVGTAGSPSKTNVVDSATNSHNYDVVYSSTGIANPVSGNNEYLVDIKENGV
IVATGKVAYDAATNELVSSTIDYKGASPVTGSMTTTRINAAGTTVNLADLGIVNASGADDAEVVAGKLYDPSTWSMSDYA
KDNSKGVKPDFEVQIPLSDSKGGQRTVTLSMLKGPGPNQWYAELRAKPGDLANNGNGQISTGIIEFTTDGKLKNTGSLFG
TTSPTAITIKSSGYIAPTVTPPAVQPPTPPTWADALGIDEQEVQIDLASAAGGLTQYNSQSVVQSVNTNGTAFGNLTNIE
IDEGGYVSAIFDNGVTRRIAQVAIATFSNPNGLKGVNGNAYRVTNESGTYSLKAPSQGGAGALAPSTLEASTVDLSQEFT
GLITTQRAYSASSKIITTADQMLEELLNIKR
>P50610 ~~~flgE~~~Flagellar hook protein FlgE~~~COG1749
MLRSLWSGVNGMQAHQIALDIESNNIANVNTTGFKYSRASFVDMLSQVKLIATAPYKNGLAGQNDFSVGLGVGVDATTKI
FSQGNIQNTDVKTDLAIQGDGFFIISPDRGITRNFTRDGEFLFDSQGSLVTTGGLVVQGWVRNGSDTGNKGSDTDALKVD
NTGPLENIRIDPGMVMPARASNRISMRANLNAGRHADQTAAIFALDSSAKTPSDGINPVYDSGTNLAQVAEDMGSLCNED
GDALLLNENQGIWVSYKSAKMVKDILPSAENSTLELNGVKISFTNDSAVSRTSSLVAAKNAINAVKSQTGIEAYLDGKQL
RLENTNELDGDEKLKNIVVTQAGTGAFANFLDGDKDVTAFKYSYTHSISPNADIGQFRTTEDLRALIQHDANIVKDPSLA
DNYQDSAASIGVSVNQYGMFEINNKDNKNVIKENLNIFVSGYSSDSVTNNVLFKNAMKGLNTASLIEGGASASSSKFTHA
THATSIDVIDSLGTKHAMRIEFYRSGGAEWNFRVIVPEPGELVGGSAARPNVFEGGRLHFNNDGSLAGMNPPLLQFDPKN
GADAPQRINLAFGSSGSFDGLTSVDKISETYAIEQNGYQAGDLMDVRFDSDGVLLGAFSNGRTLALAQVALANFANDAGL
QALGGNVFSQTGNSGQALIGAANTGRRGSISGSKLESSNVDLSRSLTNLIVVQRGFQANSKAVTTSDQILNTLLNLKQ
>P0A1J1 ~~~flgE~~~Flagellar hook protein FlgE~~~
MSFSQAVSGLNAAATNLDVIGNNIANSATYGFKSGTASFADMFAGSKVGLGVKVAGITQDFTDGTTTNTGRGLDVAISQN
GFFRLVDSNGSVFYSRNGQFKLDENRNLVNMQGMQLTGYPATGTPPTIQQGANPAPITIPNTLMAAKSTTTASMQINLNS
TDPVPSKTPFSVSDADSYNKKGTVTVYDSQGNAHDMNVYFVKTKDNEWAVYTHDSSDPAATAPTTASTTLKFNENGILES
GGTVNITTGTINGATAATFSLSFLNSMQQNTGANNIVATNQNGYKPGDLVSYQINNDGTVVGNYSNEQEQVLGQIVLANF
ANNEGLASQGDNVWAATQASGVALLGTAGSGNFGKLTNGALEASNVDLSKELVNMIVAQRNYQSNAQTIKTQDQILNTLV
NLR
>P16323 ~~~flgF~~~Flagellar basal-body rod protein FlgF~~~
MDHAIYTAMGAASQTLNQQAVTASNLANASTPGFRAQLNALRAVPVDGLSLATRTLVTASTPGADMTPGQLDYTSRPLDV
ALQQDGWLVVQAADGAEGYTRNGNIQVGPTGQLTIQGHPVIGEGGPITVPEGSEITIAADGTISALNPGDPPNTVAPVGR
LKLVKAEGNEVQRSDDGLFRLTAEAQAERGAVLAADPSIRIMSGVLEGSNVKPVEAMTDMIANARRFEMQMKVITSVDEN
EGRANQLLSMS
>P0A1J3 ~~~flgG~~~Flagellar basal-body rod protein FlgG~~~
MISSLWIAKTGLDAQQTNMDVIANNLANVSTNGFKRQRAVFEDLLYQTIRQPGAQSSEQTTLPSGLQIGTGVRPVATERL
HSQGNLSQTNNSKDVAIKGQGFFQVMLPDGTSAYTRDGSFQVDQNGQLVTAGGFQVQPAITIPANALSITIGRDGVVSVT
QQGQAAPVQVGQLNLTTFMNDTGLESIGENLYIETQSSGAPNESTPGLNGAGLLYQGYVETSNVNVAEELVNMIQVQRAY
EINSKAVSTTDQMLQKLTQL
>P0A1N8 ~~~flgH~~~Flagellar L-ring protein~~~
MQKYALHAYPVMALMVATLTGCAWIPAKPLVQGATTAQPIPGPVPVANGSIFQSAQPINYGYQPLFEDRRPRNIGDTLTI
VLQENVSASKSSSANASRDGKTSFGFDTVPRYLQGLFGNSRADMEASGGNSFNGKGGANASNTFSGTLTVTVDQVLANGN
LHVVGEKQIAINQGTEFIRFSGVVNPRTISGSNSVPSTQVADARIEYVGNGYINEAQNMGWLQRFFLNLSPM
>P15930 ~~~flgI~~~Flagellar P-ring protein~~~
MFKALAGIVLALVATLAHAERIRDLTSVQGVRENSLIGYGLVVGLDGTGDQTTQTPFTTQTLNNMLSQLGITVPTGTNMQ
LKNVAAVMVTASYPPFARQGQTIDVVVSSMGNAKSLRGGTLLMTPLKGVDSQVYALAQGNILVGGAGASAGGSSVQVNQL
NGGRITNGAIIERELPTQFGAGNTINLQLNDEDFTMAQQITDAINRARGYGSATALDARTVQVRVPSGNSSQVRFLADIQ
NMEVNVTPQDAKVVINSRTGSVVMNREVTLDSCAVAQGNLSVTVNRQLNVNQPNTPFGGGQTVVTPQTQIDLRQSGGSLQ
SVRSSANLNSVVRALNALGATPMDLMSILQSMQSAGCLRAKLEII
>P15931 3.2.1.-~~~flgJ~~~Peptidoglycan hydrolase FlgJ~~~
MIGDGKLLASAAWDAQSLNELKAKAGQDPAANIRPVARQVEGMFVQMMLKSMREALPKDGLFSSDQTRLYTSMYDQQIAQ
QMTAGKGLGLADMMVKQMTSGQTMPADDAPQVPLKFSLETVNSYQNQALTQLVRKAIPKTPDSSDAPLSGDSKDFLARLS
LPARLASEQSGVPHHLILAQAALESGWGQRQILRENGEPSYNVFGVKATASWKGPVTEITTTEYENGEAKKVKAKFRVYS
SYLEALSDYVALLTRNPRYAAVTTAATAEQGAVALQNAGYATDPNYARKLTSMIQQLKAMSEKVSKTYSANLDNLF
>P33235 ~~~flgK~~~Flagellar hook-associated protein 1~~~COG1256
MSSLINNAMSGLNAAQAALNTASNNISSYNVAGYTRQTTIMAQANSTLGAGGWVGNGVYVSGVQREYDAFITNQLRAAQT
QSSGLTARYEQMSKIDNMLSTSTSSLATQMQDFFTSLQTLVSNAEDPAARQALIGKSEGLVNQFKTTDQYLRDQDKQVNI
AIGASVDQINNYAKQIASLNDQISRLTGVGAGASPNNLLDQRDQLVSELNQIVGVEVSVQDGGTYNITMANGYSLVQGST
ARQLAAVPSSADPSRTTVAYVDGTAGNIEIPEKLLNTGSLGGILTFRSQDLDQTRNTLGQLALAFAEAFNTQHKAGFDAN
GDAGEDFFAIGKPAVLQNTKNKGDVAIGATVTDASAVLATDYKISFDNNQWQVTRLASNTTFTVTPDANGKVAFDGLELT
FTGTPAVNDSFTLKPVSDAIVNMDVLITDEAKIAMASEEDAGDSDNRNGQALLDLQSNSKTVGGAKSFNDAYASLVSDIG
NKTATLKTSSATQGNVVTQLSNQQQSISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALINIR
>P0A1J5 ~~~flgK~~~Flagellar hook-associated protein 1~~~
MSSLINHAMSGLNAAQAALNTVSNNINNYNVAGYTRQTTILAQANSTLGAGGWIGNGVYVSGVQREYDAFITNQLRGAQN
QSSGLTTRYEQMSKIDNLLADKSSSLSGSLQSFFTSLQTLVSNAEDPAARQALIGKAEGLVNQFKTTDQYLRDQDKQVNI
AIGSSVAQINNYAKQIANLNDQISRMTGVGAGASPNDLLDQRDQLVSELNKIVGVEVSVQDGGTYNLTMANGYTLVQGST
ARQLAAVPSSADPTRTTVAYVDEAAGNIEIPEKLLNTGSLGGLLTFRSQDLDQTRNTLGQLALAFADAFNAQHTKGYDAD
GNKGKDFFSIGSPVVYSNSNNADKTVSLTAKVVDSTKVQATDYKIVFDGTDWQVTRTADNTTFTATKDADGKLEIDGLKV
TVGTGAQKNDSFLLKPVSNAIVDMNVKVTNEAEIAMASESKLDPDVDTGDSDNRNGQALLDLQNSNVVGGNKTFNDAYAT
LVSDVGNKTSTLKTSSTTQANVVKQLYKQQQSVSGVNLDEEYGNLQRYQQYYLANAQVLQTANALFDALLNIR
>P16326 ~~~flgL~~~Flagellar hook-associated protein 3~~~
MRISTQMMYEQNMSGITNSQAEWMKLGEQMSTGKRVTNPSDDPIAASQAVVLSQAQAQNSQYALARTFATQKVSLEESVL
SQVTTAIQTAQEKIVYAGNGTLSDDDRASLATDLQGIRDQLMNLANSTDGNGRYIFAGYKTEAAPFDQATGGYHGGEKSV
TQQVDSARTMVIGHTGAQIFNSITSNAVPEPDGSDSEKNLFVMLDTAIAALKTPVEGNNVEKEKAAAAIDKTNRGLKNSL
NNVLTVRAELGTQLSELSTLDSLGSDRALGQKLQMSNLVDVDWNSVISSYVMQQAALQASYKTFTDMQGMSLFQLNR
>P0AEM4 ~~~flgM~~~Negative regulator of flagellin synthesis~~~COG2747
MSIDRTSPLKPVSTVQPRETTDAPVTNSRAAKTTASTSTSVTLSDAQAKLMQPGSSDINLERVEALKLAIRNGELKMDTG
KIADALINEAQQDLQSN
>P26477 ~~~flgM~~~Negative regulator of flagellin synthesis~~~
MSIDRTSPLKPVSTVQTRETSDTPVQKTRQEKTSAATSASVTLSDAQAKLMQPGVSDINMERVEALKTAIRNGELKMDTG
KIADSLIREAQSYLQSK
>P43533 ~~~flgN~~~Flagella synthesis protein FlgN~~~COG3418
MTRLAEILDQMSAVLNDLKTVMDQEQQHLSMGQINGSQLQWITEQKSSLLATLDYLEQLRRKEPNTANSVDISQRWQEIT
VKTQQLRQMNQHNGWLLEGQIERNQQALEMLKPHQEPTLYGANGQTSTTHRGGKKISI
>P0A1J7 ~~~flgN~~~Flagella synthesis protein FlgN~~~
MTRLSEILDQMTTVLNDLKTVMDAEQQQLSVGQINGSQLQRITEEKSSLLATLDYLEQQRRLEQNAQRSANDDIAERWQA
ITEKTQHLRDLNQHNGWLLEGQIERNQQALEVLKPHQEPTLYGADGQTSVSHRGGKKISI
>O25408 ~~~flgR~~~Transcriptional regulatory protein FlgR~~~COG2204
MKIAIVEDDINMRKSLELFFELQDDLEIVSFKNPKDALAKLDESFDLVITDINMPHMDGLEFLRLLEGKYESIVITGNAT
LNKAIDSIRLGVKDFFQKPFKPELLLESIYRTKKVLEFQKKHPLEKPLKKPHKHSFLAASKALEESKRQALKVASTDANV
MLLGESGVGKEVFAHFIHQHSQRSKHPFIAINMSAIPEHLLESELFGYQKGAFTDATAPKMGLFESANKGTIFLDEIAEM
PLQLQSKLLRVVQEKEITRLGDNKSVKIDVRFISATNANMKEKIAAKEFREDLFFRLQIVPITIAPLRERVEEILPIAEI
KLKEVCDAYHLGPKSFSKNAAKCLLEYSWHGNVRELLGVVERAAILSEETEIQEKDLFLER
>O25026 2.7.13.3~~~flgS~~~Sensor histidine kinase FlgS~~~COG0642
MKKSKHLKRPYLKRSHLKHSDKASSFKGLLKKEDNVISLENFKPKESEDLLENFSNKKDMQELLGLLNQFILQSYKVEKE
FKDYKALYEWVIEILPQAIWVVNENGSFFYKNSLANQSHEVFNKAKLENFNTEIEHENKSYLVQQNSIQGKQIITATDIS
AQKRQERLASMGKISAHLAHEIRNPVGSISLLASVLLKHANEKTKPIVVELQKALWRVERIIKATLLFSKGIQANRTKQS
LKTLESDLKEALNCYTYSKDIDFLFNFSDEEGFFDFDLMGIVLQNFLYNAIDAIEALEESEQGQVKIEAFIQNEFIVFTI
IDNGKEVENKSALFEPFETTKLKGNGLGLALSLQVVKAHEGSIALLENQEKTFEIKILNAS
>P35620 ~~~flhA~~~Flagellar biosynthesis protein FlhA~~~COG1298
MSTRDLSVLISVVLIVAMLVIPFPPWLLSILIIINISLALIVLLTTMNMQEALQFSIFPSLLLLLTLFRLGLNVSTTRSI
LSHGEGGKVVETFGNFVVGGNVLVGLVVFIILIIIQFIVITKGAERVSEVAARFTLDAMPGKQMSIDADLNAGMITEQEA
KHRREKVAREADFYGAMDGASKFVKGDAIAGIIIVMINIIFGIVIGMLQQGMSIQEAASHFTMLTVGDGIVSQIPALLIS
TATGIVVTRAASEGNLGHDITGQLFAYPKLLYVAAATIMLLGIFTPIGILLTGPLAGLLAFGAYTLSKSGKEKEEVDEIL
EEEAEVDELKSPESVVQLLHIDPIEFEFGYGLIPLADANQGGDLLDRIVMIRRQLALELGLVIPVVRIRDNIALQPNEYR
LKIKGNEVAKGELLLDHYLAMSPTPEDDLIEGIETVEPSFGLPAKWISEAVKDEADMLGYTVVDPASVVSTHITEKIKQH
AHELIGRQETKQLIDHLKESYPVLVEEVTPNPLSVGDIQKVLAKLLKEKVSIRNLVTIFETLADYGKLTTDSDLLTEYTR
QALAKQITAQFAKENEVLKVVTCSGRVEKAIADGVQQTEHGNYLSLEPDISESIVRSVAKEAEQLSLRQETAILLCSPPV
RMYVKQLLERYFPDLPVLSYNELEANVEVQSIGVVDI
>Q8YDK9 ~~~flhA~~~Flagellar biosynthesis protein FlhA~~~COG1298
MKQTLGAVPAARLTGTGPMSVAKQEADVQQAAAKPFIKGKFLTGSDIGLAVGIIIILTVLFLPVPAVVLDIGLAFSIAFS
VLILMVALWIQRPLDFSAFPTVLLIATMMRLSLNIATTRVILTHGNEGYLAAGHVIHGFSQFVMGGDFVIGLVVFAILII
VNFLVITKGATRIAEVGARFTLDAIPGKQMAIDADLSSGLIDEKEAQRRRRELEEESSFFGSMDGASKFVRGDAIAGLII
TAVNIFGGIVIGATRHGMDISQAADVFTKLSVGDGLVTQIPALIVSLAAGLLVSKGGTRGSADQAIFGQLGAYPKALLIA
ALLLFILGVMPGLPAFPFFLLGGAMAFVGIAVPRRQARQREADAAEAGKKQREAEEQERNSVKASLETNQIELCLGKQLS
ARLIASQEELAHRVNKMRRKFAQEYGFVIPEIKVTDDIALPPKSYRIKIHGTAVASHELRVGEILVVLGERPVPSVPGEE
VREPAFGMRAYSVPETFTADLRREGYMTVDNLSVLLTHLSEIVRNNLAQLLSYKDMRILLDRLGPEYRKLLEDICPAHIS
YSGLQAVLKLLLAERISIRNLHLILEAIAEIAPLVRRPEMIVEHVRMRMAQQICGDLSDNGVLNVLRLGNRWDLVFHQSL
KRDAKGEIVEFDIDPRLLEQFGTEASAAIRKHFDNGERFVLVSSPEARPYIRMIIERLFATLPVLSHVEIARGVEVKSLG
AIS
>O06758 ~~~flhA~~~Flagellar biosynthesis protein FlhA~~~COG1298
MANERSKLAFKKTFPVFKRFLQSKDLALVVFVIAILAIIIVPLPPFVLDFLLTISIALSVLIILIGLYIDKPTDFSAFPT
LLLIVTLYRLALNVATTRMILTQGYKGPSAVSDIITAFGEFSVSGNYVIGAIIFSILVLVNLLVVTNGSTRVTEVRARFA
LDAMPGKQMAIDADLNSGLIDDKEAKKRRAALSQEADFYGAMDGASKFVKGDAIASIIITLINIIGGFLVGVFQRDMSLS
FSASTFTILTIGDGLVGQIPALIIATATGIVATRTTQNEEEDFASKLITQLTNKSKTLVIVGAILLLFATIPGLPTFSLA
FVGTLFLFIAWLISREGKDGLLTKLENYLSQKFGLDLSEKPHSSKIKPHTPTTRAKTQEELKREEEQAIDEVLKIEFLEL
ALGYQLISLADMKQGGDLLERIRGIRKKIASDYGFLMPQIRIRDNLQLPPTHYEIKLKGIVIGEGMVMPDKFLAMNTGFV
NKEIEGIPTKEPAFGMDALWIETKNKEEAIIQGYTIIDPSTVIATHTSELVKKYAEDFITKDEVKSLLERLAKDYPTIVE
ESKKIPTGAIRSVLQALLHEKIPIKDMLTILETITDIAPLVQNDVNILTEQVRARLSRVITNAFKSEDGRLKFLTFSTDS
EQFLLNKLRENGTSKSLLLNVGELQKLIEVVSEEAMKVLQKGIAPVILIVEPNLRKALSNQMEQARIDVIVLSHAELDPN
SNFEALGTIHINF
>P40729 ~~~flhA~~~Flagellar biosynthesis protein FlhA~~~
MANLVAMLRLPSNLKSTQWQILAGPILILLILSMMVLPLPAFILDLLFTFNIALSIMVLLVAMFTQRTLDFAAFPTILLF
TTLLRLALNVASTRIILMEGHTGAAAAGKVVEAFGHFLVGGNFAIGIVVFIILVIINFMVITKGAGRIAEVGARFVLDGM
PGKQMAIDADLNAGLIGEDEAKKRRSEVTQEADFYGSMDGASKFVRGDAIAGILIMVINVVGGLLVGVLQHGMSIGSAAE
SYTLLTIGDGLVAQIPALVISTAAGVIVTRVSTDQDVGEQMVGQLFSNPRVMLLAAAVLGLLGMVPGMPNLVFLLFTAAL
LGLAWWLRGREEKAPEEPQPVKMPENNSVVEATWNDVQLEDSLGMEVGYRLIPMVDFQQDGELLGRIRSIRKKFAQDMGF
LPPVVHIRDNMDLQPARYRILMKGVEIGSGDAYPGRWLAINPGTAAGTLPGEKTVDPAFGLDAIWIESALKEQAQIQGFT
VVEASTVVATHLNHLIGQFSAELFGRQEAQQLLDRVSQEMPKLTEDLVPGVVTLTTLHKVLQNLLAEKVPIRDMRTILET
LAEHAPLQSDPHELTAVVRVALGRAITQQWFPGNEEVQVIGLDTALERLLLQALQGGGGLEPGLADRLLAQTQEALSRQE
MLGAPPVLLVNHALRPLLSRFLRRSLPQLVVLSNLELSDNRHIRMTATIGGK
>O67813 ~~~flhB~~~Flagellar biosynthetic protein FlhB~~~COG1377
MAEEHKTERATPYKRRKVREEGNVAKSHEIASSLVVLLSLLLLLFLGTYIAKEVILIFLAVTGYVHADISELGSLYENFY
ENIVKVLTPLFFLALLVVILSHVAQFGFIFTLKPLSFKWERINPFEGIKRLISLTTLFETVKNTLKAFLLIGIAVFVLKG
SLYFFLSSSTYPLAETLKSFIKTSAITLITLGVVALLIAFLDYAFKRWQYEKKIMMSRRELKEEYKQLEGHPEVKSRIKA
RMRELAKSRMMAEVPKATVVITNPTHIAIALKYNPEKDKAPVVVAKGKGTIAQKIVEIAENYSIPVVRKPELARALYPAV
EVGKEISPKFYKAVAEIIAYVMFKKKKVYA
>P76299 ~~~flhB~~~Flagellar biosynthetic protein FlhB~~~COG1377
MSDESDDKTEAPTPHRLEKAREEGQIPRSRELTSLLILLVGVSVIWFGGVSLARRLSGMLSAGLHFDHSIINDPNLILGQ
IILLIREAMLALLPLISGVVLVALISPVMLGGLVFSGKSLQPKFSKLNPLPGIKRMFSAQTGAELLKAILKTILVGSVTG
FFLWHHWPQMMRLMAESPITAMGNAMDLVGLCALLVVLGVIPMVGFDVFFQIFSHLKKLRMSRQDIRDEFKQSEGDPHVK
GRIRQMQRAAARRRMMADVPKADVIVNNPTHYSVALQYDENKMSAPKVVAKGAGLVALRIREIGAENNVPTLEAPPLARA
LYRHAEIGQQIPGQLYAAVAEVLAWVWQLKRWRLAGGQRPVQPTHLPVPEALDFINEKPTHE
>P40727 ~~~flhB~~~Flagellar biosynthetic protein FlhB~~~
MAEESDDDKTEAPTPHRLEKAREEGQIPRSRELTSLLILLVGVCIIWFGGESLARQLAGMLSAGLHFDHRMVNDPNLILG
QIILLIKAAMMALLPLIAGVVLVALISPVMLGGLIFSGKSLQPKFSKLNPLPGIKRMFSAQTGAELLKAVLKSTLVGCVT
GFYLWHHWPQMMRLMAESPIVAMGNALDLVGLCALLVVLGVIPMVGFDVFFQIFSHLKKLRMSRQDIRDEFKESEGDPHV
KGKIRQMQRAAAQRRMMEDVPKADVIVTNPTHYSVALQYDENKMSAPKVVAKGAGLIALRIREIGAEHRVPTLEAPPLAR
ALYRHAEIGQQIPGQLYAAVAEVLAWVWQLKRWRLAGGQRPPQPENLPVPEALDFMNEKNTDG
>P0ABY7 ~~~flhC~~~Flagellar transcriptional regulator FlhC~~~
MSEKSIVQEARDIQLAMELITLGARLQMLESETQLSRGRLIKLYKELRGSPPPKGMLPFSTDWFMTWEQNVHASMFCNAW
QFLLKTGLCNGVDAVIKAYRLYLEQCPQAEEGPLLALTRAWTLVRFVESGLLQLSSCNCCGGNFITHAHQPVGSFACSLC
QPPSRAVKRRKLSQNPADIIPQLLDEQRVQAV
>O34202 ~~~flhC~~~Flagellar transcriptional regulator FlhC~~~
MSEKSIVREAKDIRLAMELITLGARLQMLESETQLSRGRLIKLYKELRGSPPPKGMLPFSTDWFMTWEQNIHSSMFYNAY
RFLLKSGGSVGIEAVVKAYRLYLEQCPPVKDQEPILALTRAWTLVRFVESGMLQLSVCTKCNGSFITHAHQPASNYVCSL
CQPPSRAIKKRKLSANPADINLQLLDGLEQFRM
>O52222 ~~~flhC~~~Flagellar transcriptional regulator FlhC~~~
MSEKSIVQEARDIQLAMELINLGARLQMLESETQLSRGRLIRLYKELRGSPPPKGMLPFSTDWFMTWEQNIHASMFCNAW
QFLLKTGLCSGVDAVIKAYRLYLEQCPQPPEGPLLALTRAWTLVRFVESGLLELSSCNCCGGNFITHAHQPVGSFACSLC
QPPSRAVKRRKLSRDAADIIPQLLDEQIEQAV
>P0A8S9 ~~~flhD~~~Flagellar transcriptional regulator FlhD~~~
MHTSELLKHIYDINLSYLLLAQRLIVQDKASAMFRLGINEEMATTLAALTLPQMVKLAETNQLVCHFRFDSHQTITQLTQ
DSRVDDLQQIHTGIMLSTRLLNDVNQPEEALRKKRA
>O34201 ~~~flhD~~~Flagellar transcriptional regulator FlhD~~~
MSTVELLKHIYDINLSYLLLAQRLINQEKASAMFRLGISDSMADALKELTLPQLVKLAETNQLICNFRFEDSETIEQLTK
ESRVDDLQQIHTGILLSSNLFRQLSEHDTSATKKRA
>P0A2R2 ~~~flhD~~~Flagellar transcriptional regulator FlhD~~~
MHTSELLKHIYDINLSYLLLAQRLIVQDKASAMFRLGINEEMANTLGALTLPQMVKLAETNQLVCHFRFDDHQTITRLTQ
DSRVDDLQQIHTGIMLSTRLLNEVDDTARKKRA
>P0A1N4 ~~~flhE~~~Flagellar protein FlhE~~~
MRKWLALLLFPLTVQAAGEGAWQDSGMGVTLNYRGVSASSSPLSARQPVSGVMTLVAWRYELNGPTPAGLRVRLCSQSRC
VELDGQSGTTHGFAHVPAVEPLRFVWEVPGGGRLIPALKVRSNQVIVNYR
>Q01960 ~~~flhF~~~Flagellar biosynthesis protein FlhF~~~COG1419
MKIKKFTAASMQEAALLIRKELGNEAVILNSKKIKKRKWFGLVNKPAVEVIAVLDQDFLEKKTPQKAAEPKQTLKTPVSS
PKIEERTYPPQIPAQQELGDFSAYQSVLPEPLRKAEKLLQETGIKESTKTNTLKKLLRFSVEAGGLTEENVVGKLQEILC
DMLPSADKWQEPIHSKYIVLFGSTGAGKTTTLAKLAAISMLEKHKKIAFITTDTYRIAAVEQLKTYAELLQAPLEVCYTK
EEFQQAKELFSEYDHVFVDTAGRNFKDPQYIDELKETIPFESSIQSFLVLSATAKYEDMKHIVKRFSSVPVNQYIFTKID
ETTSLGSVFNILAESKIGVGFMTNGQNVPEDIQTVSPLGFVRMLCR
>P0AEM6 ~~~fliA~~~RNA polymerase sigma factor FliA~~~COG1191
MNSLYTAEGVMDKHSLWQRYVPLVRHEALRLQVRLPASVELDDLLQAGGIGLLNAVERYDALQGTAFTTYAVQRIRGAML
DELRSRDWVPRSVRRNAREVAQAIGQLEQELGRNATETEVAERLGIDIADYRQMLLDTNNSQLFSYDEWREEHGDSIELV
TDDHQRENPLQQLLDSNLRQRVMEAIETLPEREKLVLTLYYQEELNLKEIGAVLEVGESRVSQLHSQAIKRLRTKLGKL
>P0A2E8 ~~~fliA~~~RNA polymerase sigma factor FliA~~~
MNSLYTAEGVMDKHSLWQRYVPLVRHEALRLQVRLPASVELDDLLQAGGIGLLNAVDRYDALQGTAFTTYAVQRIRGAML
DELRSRDWVPRSVRRNAREVAQAMGQLEQELGRNATETEVAERLGIPVAEYRQMLLDTNNSQLFSYDEWREEHGDSIELV
TEEHQQENPLHQLLEGDLRQRVMDAIESLPEREQLVLTLYYQEELNLKEIGAVLEVGESRVSQLHSQAIKRLRTKLGKL
>P42272 ~~~fliC1~~~Flagellin 1~~~
MAQVINTNYLSLVTQNNLNKSQGTLGSAIERLSSGLRINSAKDDAAGQAIANRFTSNVNGLTQASRNANDGISIAQTTEG
ALNEINNNLQRIRELTVQAKNGTNSNSDITSIQNEVKNVLDEINRISEQTQFNGVKVLSGEKSEMVIQVGTNDNETIKFN
LDKVDNDTLGVASDKLFDTKTEKKGVTAAGAGVTDAKKINAAATLDMMVSLVKEFNLDGKPVTDKFIVTKGGKDYVATKS
DFELDATGTKLGLKASATTEFKVDAGKDVKTLNVKDDALATLDKAINTIDESRSKLGAIQNRFESTINNLNNTVNNLSAS
RSRILDADYATEVSNMSRGQILQQAGTSVLAQANQVPQTVLSLLR
>P42273 ~~~fliC2~~~Flagellin 2~~~
MAQVINTNYLSLVTQNNLNRSQSALGNAIERLSSGMRINSAKDDAAGQAIANRFTSNINGLTQASRNANDGISVSQTTEG
ALNEINNNLQRIRELTVQAKNGTNSNSDINSIQNEVNQRLDEINRVSEQTQFNGVKVLSGEKSKMTIQVGTNDNEVIEFN
LDKIDNDTLGVASDKLFDAKTEKKGVTAAGDAIDANALGISGSKKYVTGISVKEYKVDGKVSSDKVVLNDGSDDYIVSKS
DFTLKSGTTTGEVEFTGSKTTKFTADAGKDVKVLNVKDDALATLDNAISKVDESRSKLGAIQNRFQSTINNLNNTVNNLS
ASRSRILDADYATEVSNMSKNQILQQAGTAVLAQANQVPQTVLSLLR
>P21184 ~~~fliC~~~A-type flagellin~~~COG1344
MALTVNTNIASLNTQRNLNNSSASLNTSLQRLSTGSRINSAKDDAAGLQIANRLTSQVNGLNVATKNANDGISLAQTAEG
ALQQSTNILQRMRDLSLQSANGSNSDSERTALNGEVKQLQKELDRISNTTTFGGRKLLDGSFGVASFQVGSAANEIISVG
IDEMSAESLNGTYFKADGGGAVTAATASGTVDIAIGITGGSAVNVKVDMKGNETAEQAAAKIAAAVNDANVGIGAFSDGD
TISYVSKAGKDGSGAITSAVSGVVIADTGSTGVGTAAGVTPSATAFAKTNDTVAKIDISTAKGAQSAVLVIDEAIKQIDA
QRADLGAVQNRFDNTINNLKNIGENVSAARGRIEDTDFAAETANLTKNQVLQQAGTAILAQANQLPQSVLSLLR
>P72151 ~~~fliC~~~B-type flagellin~~~
MALTVNTNIASLNTQRNLNASSNDLNTSLQRLTTGYRINSAKDDAAGLQISNRLSNQISGLNVATRNANDGISLAQTAEG
ALQQSTNILQRIRDLALQSANGSNSDADRAALQKEVAAQQAELTRISDTTTFGGRKLLDGSFGTTSFQVGSNAYETIDIS
LQNASASAIGSYQVGSNGAGTVASVAGTATASGIASGTVNLVGGGQVKNIAIAAGDSAKAIAEKMDGAIPNLSARARTVF
TADVSGVTGGSLNFDVTVGSNTVSLAGVTSTQDLADQLNSNSSKLGITASINDKGVLTITSATGENVKFGAQTGTATAGQ
VAVKVQGSDGKFEAAAKNVVAAGTAATTTIVTGYVQLNSPTAYSVSGTGTQASQVFGNASAAQKSSVASVDISTADGAQN
AIAVVDNALAAIDAQRADLGAVQNRFKNTIDNLTNISENATNARSRIKDTDFAAETAALSKNQVLQQAGTAILAQANQLP
QAVLSLLR
>Q8YDM5 ~~~fliC~~~Flagellin~~~COG1344
MASILTNSSALTALQTLASTNKSLESTQNRISTGLRISEASDNASYWSIATSMKSDNKANSAVQDALGLGAGKVDTAYSA
INKIRESVDDIKTKLVSAMGASTEDKGKIETEIKSIVANINSALSNANYAGSNLLNGPTTDLNVVASYNRSGNAVAVDKI
TVKATDTDAKTMVKDIVDAGFFTSASDDTAIGTALNTVETALASLATGAATLGAAKSQIDSQKSFLSGLQDSIEKGVGTL
VDADMNKESARLSALQVQQQLGVQALSIANSSNQSILSLFRG
>B7USU2 ~~~fliC~~~Flagellin~~~
MAQVINTNSLSLITQNNINKNQSALSSSIERLSSGLRINSAKDDAAGQAIANRFTSNIKGLTQAARNANDGISVAQTTEG
ALSEINNNLQRIRELTVQASTGTNSDSDLDSIQDEIKSRLDEIDRVSGQTQFNGVNVLAKDGSMKIQVGANDGQTITIDL
KKIDSDTLGLNGFNVNGKGETANTAATLKDMSGFTAAAAPGGTVGVTQYTDKSAVASSVDILNAVAGADGNKVTTSADVG
FGTPAAAVTYTYNKDTNSYSAASDDISSANLAAFLNPQARDTTKATVTIGGKDQDVNIDKSGNLTAADDGAVLYMDATGN
LTKNNAGGDTQATLAKVATATGAKAATIQTDKGTFTSDGTAFDGASMSIDANTFANAVKNDTYTATVGAKTYSVTTGSAA
ADTAYMSNGVLSDTPPTYYAQADGSITTTEDAAAGKLVYKGSDGKLTTDTTSKAESTSDPLAALDDAISQIDKFRSSLGA
VQNRLDSAVTNLNNTTTNLSEAQSRIQDADYATEVSNMSKAQIIQQAGNSVLAKANQVPQQVLSLLQG
>P04949 ~~~fliC~~~Flagellin~~~COG1344
MAQVINTNSLSLITQNNINKNQSALSSSIERLSSGLRINSAKDDAAGQAIANRFTSNIKGLTQAARNANDGISVAQTTEG
ALSEINNNLQRVRELTVQATTGTNSESDLSSIQDEIKSRLDEIDRVSGQTQFNGVNVLAKNGSMKIQVGANDNQTITIDL
KQIDAKTLGLDGFSVKNNDTVTTSAPVTAFGATTTNNIKLTGITLSTEAATDTGGTNPASIEGVYTDNGNDYYAKITGGD
NDGKYYAVTVANDGTVTMATGATANATVTDANTTKATTITSGGTPVQIDNTAGSATANLGAVSLVKLQDSKGNDTDTYAL
KDTNGNLYAADVNETTGAVSVKTITYTDSSGAASSPTAVKLGGDDGKTEVVDIDGKTYDSADLNGGNLQTGLTAGGEALT
AVANGKTTDPLKALDDAIASVDKFRSSLGAVQNRLDSAVTNLNNTTTNLSEAQSRIQDADYATEVSNMSKAQIIQQAGNS
VLAKANQVPQQVLSLLQG
>Q06971 ~~~fliC~~~Flagellin~~~
MAQVINTNSLSLLTQNNLNKSQSSLSSAIERLSSGLRINSAKDDAAGQAIANRFTSNIKGLTQASRNANDGISIAQTTEG
ALNEINNNLQRVRELSVQATNGTNSDSDLKSIQDEIQQRLEEIDRVSNQTQFNGVKVLSQDNQMKIQVGANDGETITIDL
QKIDVKSLGLDGFNVNGPKEATVGDLKSSFKNVTGYDTYAAGADKYRVDINSGAVVTDAVAPDKVYVNAANGQLTTDDAE
NNTAVDLFKTTKSTAGTAEAKAIAGAIKGGKEGDTFDYKGVTFTIDTKTGDDGNGKVSTTINGEKVTLTVADIAIGAADV
NAATLQSSKNVYTSVVNGQFTFDDKTKNESAKLSDLEANNAVKGESKITVNGAEYTANATGDKITLAGKTMFIDKTASGV
STLINEDAAAAKKSTANPLASIDSALSKVDAVRSSLGAIQNRFDSAITNLGNTVTNLNSARSRIEDADYATEVSNMSKAQ
ILQQAGTSVLAQANQVPQNVLSLLR
>P06179 ~~~fliC~~~Flagellin~~~
MAQVINTNSLSLLTQNNLNKSQSALGTAIERLSSGLRINSAKDDAAGQAIANRFTANIKGLTQASRNANDGISIAQTTEG
ALNEINNNLQRVRELAVQSANSTNSQSDLDSIQAEITQRLNEIDRVSGQTQFNGVKVLAQDNTLTIQVGANDGETIDIDL
KQINSQTLGLDTLNVQQKYKVSDTAATVTGYADTTIALDNSTFKASATGLGGTDQKIDGDLKFDDTTGKYYAKVTVTGGT
GKDGYYEVSVDKTNGEVTLAGGATSPLTGGLPATATEDVKNVQVANADLTEAKAALTAAGVTGTASVVKMSYTDNNGKTI
DGGLAVKVGDDYYSATQNKDGSISINTTKYTADDGTSKTALNKLGGADGKTEVVSIGGKTYAASKAEGHNFKAQPDLAEA
AATTTENPLQKIDAALAQVDTLRSDLGAVQNRFNSAITNLGNTVNNLTSARSRIEDSDYATEVSNMSRAQILQQAGTSVL
AQANQVPQNVLSLLR
>Q9K3C5 ~~~fliD~~~B-type flagellar hook-associated protein 2~~~
MAGISIGVGSTDYTDLVNKMVNLEGAAKTNQLATLEKTTTTRLTALGQFKSAISAFQTALTALNSNAVFMARTAKSSNED
ILKASATQSAVAGTYQIQVNSLATSSKIALQAIADPANAKFNSGTLNISVGDTKLPAITVDSSNNTLAGMRDAINQAGKE
AGVSATIITDNSGSRLVLSSTKTGDGKDIKVEVSDDGSGGNTSLSQLAFDPATAPKLSDGAAAGYVTKAANGEITVDGLK
RSIASNSVSDVIDGVSFDVKAVTEAGKPITLTVSRDDAGVKDNVKKFVEAYNTLTKFINEQTVVTKVGEDKNPVTGALLG
DASVRALVNTMRSELIASNENGSVRNLAALGITTTKDGTLEIDEKKLDKAISADFEGVASYFTGDTGLAKRLGDKMKPYT
DAQGILDQRTTTLQKTLSNVDTQKADLAKRLAALQEKLTTQFNLLSAMQDEMTKRQKSITDNLASLPYGSGKKT
>P39738 ~~~fliD~~~Flagellar hook-associated protein 2~~~COG1345
MVTRITGLASGMDIDDIVSKLMQTERAPLDKLTQKKQTLEWQRDSYREVNSKIKELQDYMSKNTLTYPSTYQSKTVTSSN
ESVLTATGSVSAPNSSSTVEVASLATAATYKANNYTGYTQGDYNLAFNVVAPGETTAKTVNISVTSADTIDNVISKLNSS
DLGVSAFKDKIWNGTEYVETIAFSSKATGAGGSIQAADSATADFMSGQLGFSLDADNKLTAYKEGTNAKVTINGFEMEKL
TNNFTVNGVTYSIKNTTAATGPVTTSVSTDVDGIYNQIKEFVDKYNELVDSLNEKLKEEKYRDYTPLTSEQKEAMSDKEV
ELWEEKAKSGLLRNDSSISTGTNQMRTDFYTQVNADGKTYQLTEFGITTSSAYQLRGHLEINEEKLKAKIAEDPQGVANL
FTSGTNDSNYSDKGIMKRITNTLRSTVKSIEAKAGNSTMGASSYSIGKNLNSISTEITDMQDRLNTIENRYYSKFSAMDS
AIQKMNEQASYLSQLLVQ
>P24216 ~~~fliD~~~Flagellar hook-associated protein 2~~~COG1345
MASISSLGVGSGLDLSSILDSLTAAQKATLTPISNQQSSFTAKLSAYGTLKSALTTFQTANTALSKADLFSATSTTSSTT
AFSATTAGNAIAGKYTISVTHLAQAQTLTTRTTRDDTKTAIATSDSKLTIQQGGDKDPITIDISAANSSLSGIRDAINNA
KAGVSASIINVGNGEYRLSVTSNDTGLDNAMTLSVSGDDALQSFMGYDASASSNGMEVSVAAQNAQLTVNNVAIENSSNT
ISDALENITLNLNDVTTGNQTLTITQDTSKAQTAIKDWVNAYNSLIDTFSSLTKYTAVDAGADSQSSSNGALLGDSTLRT
IQTQLKSMLSNTVSSSSYKTLAQIGITTDPSDGKLELDADKLTAALKKDASGVGALIVGDGKKTGITTTIGSNLTSWLST
TGIIKAATDGVSKTLNKLTKDYNAASDRIDAQVARYKEQFTQLDVLMTSLNSTSSYLTQQFENNSNSK
>P96786 ~~~fliD~~~Flagellar hook-associated protein 2~~~COG1345
MAIGSLSSLGLGSKVLNYDVIDKLKDADEKALIAPLDKKMEQNVEKQKALVEIKTLLSALKGPVKTLSDYSTYISRKSNV
TGDALSASVGVGVPIQDIKVDVQNLAQGDINELGAKFSSRDDIFSQVDTTLKFYTQNKDYAVNIKAGMTLGDVAQSITDA
TNGEVMGIVMKTGGNDPYQLMVNTKNTGEDNRVYFGSHLQSTLTNKNALSLGVDGSGKSEVSLNLKGADGNMHEVPIMLE
LPESASIKQKNTAIQKAMEQALENDPNFKNLIANGDISIDTLHGGESLIINDRRGGNIEVKGSKAKELGFLQTTTQESDL
LKSSRTIKEGKLEGVVSLNGQKLDLSALTKESNTSEENTDAIIQAINAKEGLSAFKNAEGKLVINSKTGMLTIKGEDALG
KASLKDLGLNAGMVQSYEASQNTLFMSKNLQKASDSAFTYNGVSITRPTNEVNDVISGVNITLEQTTEPNKPAIISVSRD
NQAIIDSLTEFVKAYNELIPKLDEDTRYDADTKIAGIFNGVGDIRAIRSSLNNVFSYSVHTDNGVESLMKYGLSLDDKGV
MSLDEAKLSSALNSNPKATQDFFYGSDSKDMGGREIHQEGIFSKFNQVIANLIDGGNAKLKIYEDSLDRDAKSLTKDKEN
AQELLKTRYNIMAERFAAYDSQISKANQKFNSVQMMIDQAAAKKN
>P16328 ~~~fliD~~~Flagellar hook-associated protein 2~~~
MASISSLGVGSNLPLDQLLTDLTKNEKGRLTPITKQQSANSAKLTAYGTLKSALEKFQTANTALNKADLFKSTVASSTTE
DLKVSTTAGAAAGTYKINVTQLAAAQSLATKTTFATTKEQLGDTSVTSRTIKIEQPGRKEPLEIKLDKGDTSMEAIRDAI
NDADSGIAASIVKVKENEFQLVLTANSGTDNTMKITVEGDTKLNDLLAYDSTTNTGNMQELVKAENAKLNVNGIDIERQS
NTVTDAPQGITLTLTKKVTDATVTVTKDDTKAKEAIKSWVDAYNSLVDTFSSLTKYTAVEPGEEASDKNGALLGDSVVRT
IQTGIRAQFANSGSNSAFKTMAEIGITQDGTSGKLKIDDDKLTKVLKDNTAAARELLVGDGKETGITTKIATEVKSYLAD
DGIIDNAQDNVNATLKSLTKQYLSVSNSIDETVARYKAQFTQLDTMMSKLNNTSSYLTQQFTAMNKS
>P26462 ~~~fliE~~~Flagellar hook-basal body complex protein FliE~~~
MAAIQGIEGVISQLQATAMAARGQDTHSQSTVSFAGQLHAALDRISDRQAAARVQAEKFTLGEPGIALNDVMADMQKASV
SMQMGIQVRNKLVAAYQEVMSMQV
>O67241 ~~~fliF~~~Flagellar M-ring protein~~~COG1766
MDKLREYLNLLKERFNALTPVQKALAVGIPLLLLSLGAVALIYLSQENYTVLYTGLSPDDLNAVVTELDKEGVKYKISPD
GRTIYVPENVARELRLKLAAKGVPRKGIVGYELFDKSGIVLSRFQQLVNFKRAIEGELAKTIMSLDCVEFARVHIVLPEK
SLFIREEEEAKASVFLKLKPGCELTPEQVKAIRNLVSGSVENLKPSQVVVVDDKGRDLTAYLDEEFKTNASQLKVKREFE
KSLERKLQKTLEEVFGYGKVKVNVSAELDFSSMKKREELYDPDLTAVVSEQKKKERTTSTRAQGIPGTQANIPPATGRQG
GGELITERKESITNYEVSKREIYFEDKTIKVKRISVGLVIDKDVKVNTEELKNLIIASAGLDPKRGDQVSIVSVPFVKPT
VVAEKPKVPTYVYVAVALVSLVILGLVAFGLVKLLRRRPPAPTPAPAVPGVPPTVEEVRKKTPYEELLEIAKQEPEKVAM
VLKKWLKEG
>Q8YDM4 ~~~fliF~~~Flagellar M-ring protein~~~COG1766
MAVVWMQQNFQQLIEQLKGTLGKLGARKLIALGLVGAALMGAILYTSIYLGRPSYETLYVGLSRDDVNRMGLALGEAGIP
FDVKSDGSSILVPIGKAENARMYLAEKGLPTSNNAGYELFDNMGSLGLTSFMQEITRVRALEGEIARTIQAIRGVKAARV
HIVLAEKGSFRRGDQKPSASVVIRAEGGFSAESAQSIRQLVAAAVPSLDASSVTVLDTNGHLLASAGEGANGAALMTASL
EQQVASHVDDSIRKALAPYLGLGHFQTSVQAALDTDRRQTKETTYDPESRVERSVRVVRESGDSRNNRNDNATGVEQNIP
QEQIQNRNGESSTEKTDRREELTNYEVNSMTVSTVSDGYSIKRLSIAVVIDQARLLQTAGTTPPPANFVDQQITKIRDLV
ATAAGLNTNRGDVINVTAVNFLDSAGADMEPVSAPWTDTLLRQSGSYANALAILAAVGLLIWFGLRPLLRDQNVKPAGTE
VAIREAGEVATPNFIGGAESVGEGVQAVIGGPAAYADQMKTSLSDLRQRMRMPAKLRLEQMIEMDEERVAAVLKQWIHET
ASGREADPAKASAMPELKAA
>P25798 ~~~fliF~~~Flagellar M-ring protein~~~COG1766
MNATAAQTKSLEWLNRLRANPKIPLIVAGSAAVAVMVALILWAKAPDYRTLFSNLSDQDGGAIVSQLTQMNIPYRFSEAS
GAIEVPADKVHELRLRLAQQGLPKGGAVGFELLDQEKFGISQFSEQVNYQRALEGELSRTIETIGPVKGARVHLAMPKPS
LFVREQKSPSASVTVNLLPGRALDEGQISAIVHLVSSAVAGLPPGNVTLVDQGGHLLTQSNTSGRDLNDAQLKYASDVEG
RIQRRIEAILSPIVGNGNIHAQVTAQLDFASKEQTEEQYRPNGDESHAALRSRQLNESEQSGSGYPGGVPGALSNQPAPA
NNAPISTPPANQNNRQQQASTTSNSGPRSTQRNETSNYEVDRTIRHTKMNVGDVQRLSVAVVVNYKTLPDGKPLPLSNEQ
MKQIEDLTREAMGFSEKRGDSLNVVNSPFNSSDESGGELPFWQQQAFIDQLLAAGRWLLVLLVAWLLWRKAVRPQLTRRA
EAMKAVQQQAQAREEVEDAVEVRLSKDEQLQQRRANQRLGAEVMSQRIREMSDNDPRVVALVIRQWINNDHE
>P15928 ~~~fliF~~~Flagellar M-ring protein~~~
MSATASTATQPKPLEWLNRLRANPRIPLIVAGSAAVAIVVAMVLWAKTPDYRTLFSNLSDQDGGAIVAQLTQMNIPYRFA
NGSGAIEVPADKVHELRLRLAQQGLPKGGAVGFELLDQEKFGISQFSEQVNYQRALEGELARTIETLGPVKSARVHLAMP
KPSLFVREQKSPSASVTVTLEPGRALDEGQISAVVHLVSSAVAGLPPGNVTLVDQSGHLLTQSNTSGRDLNDAQLKFAND
VESRIQRRIEAILSPIVGNGNVHAQVTAQLDFANKEQTEEHYSPNGDASKATLRSRQLNISEQVGAGYPGGVPGALSNQP
APPNEAPIATPPTNQQNAQNTPQTSTSTNSNSAGPRSTQRNETSNYEVDRTIRHTKMNVGDIERLSVAVVVNYKTLADGK
PLPLTADQMKQIEDLTREAMGFSDKRGDTLNVVNSPFSAVDNTGGELPFWQQQSFIDQLLAAGRWLLVLVVAWILWRKAV
RPQLTRRVEEAKAAQEQAQVRQETEEAVEVRLSKDEQLQQRRANQRLGAEVMSQRIREMSDNDPRVVALVIRQWMSNDHE
>O66891 ~~~fliG~~~Flagellar motor switch protein FliG~~~COG1536
MAQEKSALSKAQKAAVLLLSLPEEVSMNIVKELSEEELQKLFALAKDLESVPEEEIENIAEELLDEIKKAGIKIKKPEEF
IENIKKVIPPTLAEKFRGILELGDAEKILKEIEKVDSRILASLLKNEHPQTIALFLSQLSPKKSAEIIQNLPEELKKEVV
KRIATLENVNVQYVKELAQILLEEISSLGAKEALKLEGTAVAAELLNTLDKETRELILQSIGQEDPLLEERIREKMFTFE
DIRKLSDRDIIEILKVVDKNTLMIALLGAPEDIKQKFLSNMSKRAAKLFLEDMEALGPVKKSEIEKAQRQVVNIIRKMID
EGKIEIGD
>P0ABZ1 ~~~fliG~~~Flagellar motor switch protein FliG~~~COG1536
MSNLTGTDKSVILLMTIGEDRAAEVFKHLSQREVQTLSAAMANVTQISNKQLTDVLAEFEQEAEQFAALNINANDYLRSV
LVKALGEERAASLLEDILETRDTASGIETLNFMEPQSAADLIRDEHPQIIATILVHLKRAQAADILALFDERLRHDVMLR
IATFGGVQPAALAELTEVLNGLLDGQNLKRSKMGGVRTAAEIINLMKTQQEEAVITAVREFDGELAQKIIDEMFLFENLV
DVDDRSIQRLLQEVDSESLLIALKGAEQPLREKFLRNMSQRAADILRDDLANRGPVRLSQVENEQKAILLIVRRLAETGE
MVIGSGEDTYV
>O25119 ~~~fliG~~~Flagellar motor switch protein FliG~~~COG1536
MATKLTPKQKAQLDELSMSEKIAILLIQVGEDTTGEILRHLDIDSITEISKQIVQLNGTDKQIGAAVLEEFFAIFQSNQY
INTGGLEYARELLTRTLGSEEAKKVMDKLTKSLQTQKNFAYLGKIKPQQLADFIINEHPQTIALILAHMEAPNAAETLSY
FPDEMKAEISIRMANLGEISPQVVKRVSTVLENKLESLTSYKIEVGGLRAVAEIFNRLGQKSAKTTLARIESVDNKLAGA
IKEMMFTFEDIVKLDNFAIREILKVADKKDLSLALKTSTKDLTDKFLNNMSSRAAEQFVEEMQYLGAVKIKDVDVAQRKI
IEIVQSLQEKGVIQTGEEEDVIE
>P0A1J9 ~~~fliG~~~Flagellar motor switch protein FliG~~~
MSNLSGTDKSVILLMTIGEDRAAEVFKHLSTREVQALSTAMANVRQISNKQLTDVLSEFEQEAEQFAALNINANEYLRSV
LVKALGEERASSLLEDILETRDTTSGIETLNFMEPQSAADLIRDEHPQIIATILVHLKRSQAADILALFDERLRHDVMLR
IATFGGVQPAALAELTEVLNGLLDGQNLKRSKMGGVRTAAEIINLMKTQQEEAVITAVREFDGELAQKIIDEMFLFENLV
DVDDRSIQRLLQEVDSESLLIALKGAEPPLREKFLRNMSQRAADILRDDLANRGPVRLSQVENEQKAILLIVRRLAETGE
MVIGSGEDTYV
>Q9WY63 ~~~fliG~~~Flagellar motor switch protein FliG~~~COG1536
MPEKKIDGRRKAAVLLVALGPEKAAQVMKHLDEETVEQLVVEIANIGRVTPEEKKQVLEEFLSLAKAKEMISEGGIEYAK
KVLEKAFGPERARKIIERLTSSLQVKPFSFVRDTDPVQLVNFLQSEHPQTIAVVLSYLDPPVAAQILGALPEELQTEVLK
RIALLERTSPEVVKEIERNLEKKISGFVSRTFSKVGGIDTAAEIMNNLDRTTEKKIMDKLVQENPELADEIRRRMFVFED
ILKLDDRSIQLVLREVDTRDLALALKGASDELKEKIFKNMSKRAAALLKDELEYMGPVRLKDVEEAQQKIINIIRRLEEA
GEIVIARGGGEELIM
>P15934 ~~~fliH~~~Flagellar assembly protein FliH~~~
MSNELPWQVWTPDDLAPPPETFVPVEADNVTLTEDTPEPELTAEQQLEQELAQLKIQAHEQGYNAGLAEGRQKGHAQGYQ
EGLAQGLEQGQAQAQTQQAPIHARMQQLVSEFQNTLDALDSVIASRLMQMALEAARQVIGQTPAVDNSALIKQIQQLLQQ
EPLFSGKPQLRVHPDDLQRVEEMLGATLSLHGWRLRGDPTLHHGGCKVSADEGDLDASVATRWQELCRLAAPGVL
>P52612 7.1.2.2~~~fliI~~~Flagellum-specific ATP synthase~~~COG1157
MTTRLTRWLTTLDNFEAKMAQLPAVRRYGRLTRATGLVLEATGLQLPLGATCVIERQNGSETHEVESEVVGFNGQRLFLM
PLEEVEGVLPGARVYAKNISAEGLQSGKQLPLGPALLGRVLDGSGKPLDGLPSPDTTETGALITPPFNPLQRTPIEHVLD
TGVRPINALLTVGRGQRMGLFAGSGVGKSVLLGMMARYTRADVIVVGLIGERGREVKDFIENILGAEGRARSVVIAAPAD
VSPLLRMQGAAYATRIAEDFRDRGQHVLLIMDSLTRYAMAQREIALAIGEPPATKGYPPSVFAKLPALVERAGNGISGGG
SITAFYTVLTEGDDQQDPIADSARAILDGHIVLSRRLAEAGHYPAIDIEASISRAMTALISEQHYARVRTFKQLLSSFQR
NRDLVSVGAYAKGSDPMLDKAIALWPQLEGYLQQGIFERADWEASLQGLERIFPTVS
>O07025 7.1.2.2~~~fliI~~~Flagellum-specific ATP synthase~~~COG1157
MPLKSLKNRLNQHFDLSPRYGSVKKIMPNIVYADGFNPSVGDVVKIEKSDGSECVGMVVVAEKEQFGFTPFNFIEGARAG
DKVLFLKEGLNFPVGRNLLGRVLNPLGQVIDNKGALDYERLAPVITTPIAPLKRGLIDEIFSVGVKSIDGLLTCGKGQKL
GIFAGSGVGKSTLMGMITRGCLAPIKVIALIGERGREIPEFIEKNLKGDLSSCVLVVATSDDSPLMRKYGAFCAMSVAEY
FKNQGLDVLFIMDSVTRFAMAQREIGLALGEPPTSKGYPPSALSLLPQLMERAGKEENKGSITAFFSVLVEGDDLSDPIA
DQTRSILDGHIVLSRELTDYGIYPPINILNSASRVAKDIISESQNLCARKFRRLYALLKENEMLIRIGSYQMGNDKELDE
AIKKKALMEQFLAQDENALQPFETSFQQLEEILR
>P26465 7.1.2.2~~~fliI~~~Flagellum-specific ATP synthase~~~
MTTRLTRWLTALDNFEAKMALLPAVRRYGRLTRATGLVLEATGLQLPLGATCIIERQDGPETKEVESEVVGFNGQRLFLM
PLEEVEGILPGARVYARNGHGDGLQSGKQLPLGPALLGRVLDGGGKPLDGLPAPDTLETGALITPPFNPLQRTPIEHVLD
TGVRAINALLTVGRGQRMGLFAGSGVGKSVLLGMMARYTRADVIVVGLIGERGREVKDFIENILGPDGRARSVVIAAPAD
VSPLLRMQGAAYATRIAEDFRDRGQHVLLIMDSLTRYAMAQREIALAIGEPPATKGYPPSVFAKLPALVERAGNGIHGGG
SITAFYTVLTEGDDQQDPIADSARAILDGHIVLSRRLAEAGHYPAIDIEASISRAMTALITEQHYARVRLFKQLLSSFQR
NRDLVSVGAYAKGSDPMLDKAITLWPQLEAFLQQGIFERADWEDSLQALDLIFPTV
>P20487 ~~~fliJ~~~Flagellar FliJ protein~~~COG2882
MAYQFRFQKLLELKENEKDQSLSEYQQSVSEFENVAEKLYENMSKKELLEQNKEKKLKSGMSVQEMRHYQQFVSNLDNTI
YHYQKLVIMKRNQMNQKQEILTEKNIEVKKFEKMREKQFKMFALEDKAAEMKEMDDISIKQFMIQGH
>P52613 ~~~fliJ~~~Flagellar FliJ protein~~~COG2882
MAEHGALATLKDLAEKEVEDAARLLGEMRRGCQQAEEQLKMLIDYQNEYRNNLNSDMSAGITSNRWINYQQFIQTLEKAI
TQHRQQLNQWTQKVDIALNSWREKKQRLQAWQTLQERQSTAALLAENRLDQKKMDEFAQRAAMRKPE
>P0A1K1 ~~~fliJ~~~Flagellar FliJ protein~~~
MAQHGALETLKDLAEKEVDDAARLLGEMRRGCQQAEEQLKMLIDYQNEYRSNLNTDMGNGIASNRWINYQQFIQTLEKAI
EQHRLQLTQWTQKVDLALKSWREKKQRLQAWQTLQDRQTAAALLAENRMDQKKMDEFAQRAAMRKPE
>P26416 ~~~fliK~~~Flagellar hook-length control protein~~~
MITLPQLITTDTDMTAGLTSGKTTGSAEDFLALLAGALGADGAQGKDARITLADLQAAGGKLSKELLTQHGEPGQAVKLA
DLLAQKANATDETLTDLTQAQHLLSTLTPSLKTSALAALSKTAQHDEKTPALSDEDLASLSALFAMLPGQPVATPVAGET
PAENHIALPSLLRGDMPSAPQEETHTLSFSEHEKGKTEASLARASDDRATGPALTPLVVAAAATSAKVEVDSPPAPVTHG
AAMPTLSSATAQPLPVASAPVLSAPLGSHEWQQTFSQQVMLFTRQGQQSAQLRLHPEELGQVHISLKLDDNQAQLQMVSP
HSHVRAALEAALPMLRTQLAESGIQLGQSSISSESFAGQQQSSSQQQSSRAQHTDAFGAEDDIALAAPASLQAAARGNGA
VDIFA
>B8GXB6 ~~~fliL~~~Flagellar FliL protein~~~
MAKKPEKEAPAPEGEEGAEGEAPAKKKPPILIIAIAAGVLVLGGGGAAAFFLLKPKPAAEAGEHGEKKEEKKKEKKKEEK
GDKKDAEKGAEGAAGTPVIKEGPDGVVFYTLPDIVVNMQTADGKSTFLKLKLTFELPDEETADELTPNLPRLQDMFQTFL
RELRPEDLNGSQGTYQLRVELLRRVNLVAAPAKVNAVLIEEMLIN
>P23453 ~~~fliM~~~Flagellar motor switch protein FliM~~~COG1868
MSGEVLSQNEIDALLSAISTGEMDAEELKKEEKEKKVKVYDFKRALRFSKDQIRSLTRIHDNFARLLTTHFSAQLRTYIH
ISVSSVDQVPYEEFIRSIPNMTILNLFDVHPMEGRIMMEVNPTIAYTMMDRVMGGIGISHNKVDSLTEIETKIISNLFEN
ALGNYKEAWQSIADIEPEMTEFEVNPQFVQMVSPNETVVVISLNTQIGEISGVINLCIPHIVLEPLIPKLSVHYWMQSDR
NEPKPEETKSLEKRIMTAQIPVVAELGTSELTIEEFLSLEVGDCITLDKSVTDPLTVLVGDKPKFLGQAGRVNRKQAVQI
LDHDIRGEQDGE
>P06974 ~~~fliM~~~Flagellar motor switch protein FliM~~~COG1868
MGDSILSQAEIDALLNGDSEVKDEPTASVSGESDIRPYDPNTQRRVVRERLQALEIINERFARHFRMGLFNLLRRSPDIT
VGAIRIQPYHEFARNLPVPTNLNLIHLKPLRGTGLVVFSPSLVFIAVDNLFGGDGRFPTKVEGREFTHTEQRVINRMLKL
ALEGYSDAWKAINPLEVEYVRSEMQVKFTNITTSPNDIVVNTPFHVEIGNLTGEFNICLPFSMIEPLRELLVNPPLENSR
NEDQNWRDNLVRQVQHSQLELVANFADISLRLSQILKLNPGDVLPIEKPDRIIAHVDGVPVLTSQYGTLNGQYALRIEHL
INPILNSLNEEQPK
>P26418 ~~~fliM~~~Flagellar motor switch protein FliM~~~
MGDSILSQAEIDALLNGDSDTKDEPTPGIASDSDIRPYDPNTQRRVVRERLQALEIINERFARQFRMGLFNLLRRSPDIT
VGAIRIQPYHEFARNLPVPTNLNLIHLKPLRGTGLVVFSPSLVFIAVDNLFGGDGRFPTKVEGREFTHTEQRVINRMLKL
ALEGYSDAWKAINPLEVEYVRSEMQVKFTNITTSPNDIVVNTPFHVEIGNLTGEFNICLPFSMIEPLRELLVNPPLENSR
HEDQNWRDNLVRQVQHSELELVANFADIPLRLSQILKLKPGDVLPIEKPDRIIAHVDGVPVLTSQYGTVNGQYALRVEHL
INPILNSLNEEQPK
>Q9WZE6 ~~~fliM~~~Flagellar motor switch protein FliM~~~COG1868
MSDVLSQEEINQLIEALMKGELKEEDLLKEEEEKKVKPYDFKRPSKFSKEQLRTFQMIHENFGRALSTYLSGRLRTFVDV
EISIDQLTYEEFIRSVMIPSFIVIFTGDVFEGSAIFEMRLDLFYTMLDIIMGGPGENPPNRPPTEIETSIMRKEVTNMLT
LLAQAWSDFQYFIPSIENVETNPQFVQIVPPNEIVLLVTASVSWGEFTSFINVCWPFSLLEPLLEKLSDRFWMMGRKPEK
VEERMEELRKASQKIPVTVQAVIGETELRLKEILDLEVGDVIRLGTHYKDEIRIDVEGRPKFRGIPGVFKGKYAVKVTGE
FTNGGEYE
>P15070 ~~~fliN~~~Flagellar motor switch protein FliN~~~COG1886
MSDMNNPADDNNGAMDDLWAEALSEQKSTSSKSAAETVFQQFGGGDVSGTLQDIDLIMDIPVKLTVELGRTRMTIKELLR
LTQGSVVALDGLAGEPLDILINGYLIAQGEVVVVADKYGVRITDIITPSERMRRLSR
>P26419 ~~~fliN~~~Flagellar motor switch protein FliN~~~
MSDMNNPSDENTGALDDLWADALNEQKATTTKSAADAVFQQLGGGDVSGAMQDIDLIMDIPVKLTVELGRTRMTIKELLR
LTQGSVVALDGLAGEPLDILINGYLIAQGEVVVVADKYGVRITDIITPSERMRRLSR
>P0A1L1 ~~~fliO~~~Flagellar protein FliO~~~
MMKTEATVSQPTAPAGSPLMQVSGALIGIIALILAAAWVIKRMGFAPKGNSVRGLKVSASASLGPRERVVIVEVENARLV
LGVTASQINLLHTLPPAENDTEAPVAPPADFQNMMKSLLKRSGRS
>A6QG57 ~~~flr~~~FPRL1 inhibitory protein~~~
MKKNITKTIIASTVIAAGLLTQTNDAKAFFSYEWKGLEIAKNLADQAKKDDERIDKLMKESDKNLTPYKAETVNDLYLIV
KKLSQGDVKKAVVRIKDDGPRDYYTFDLTRPLEENRKNIKVVKNGEIDSITWY
>P54700 ~~~fliP~~~Flagellar biosynthetic protein FliP~~~
MRRLLFLSLAGLWLFSPAAAAQLPGLISQPLAGGGQSWSLSVQTLVFITSLTFLPAILLMMTSFTRIIIVFGLLRNALGT
PSAPPNQVLLGLALFLTFFIMSPVIDKIYVDAYQPFSEQKISMQEALDKGAQPLRAFMLRQTREADLALFARLANSGPLQ
GPEAVPMRILLPAYVTSELKTAFQIGFTIFIPFLIIDLVIASVLMALGMMMVPPATIALPFKLMLFVLVDGWQLLMGSLA
QSFYS
>P0A1L6 ~~~fliQ~~~Flagellar biosynthetic protein FliQ~~~COG1987
MTPESVMMMGTEAMKVALALAAPLLLVALITGLIISILQAATQINEMTLSFIPKIVAVFIAIIVAGPWMLNLLLDYVRTL
FSNLPYIIG
>P0A1L5 ~~~fliQ~~~Flagellar biosynthetic protein FliQ~~~
MTPESVMMMGTEAMKVALALAAPLLLVALITGLIISILQAATQINEMTLSFIPKIVAVFIAIIVAGPWMLNLLLDYVRTL
FSNLPYIIG
>P54702 ~~~fliR~~~Flagellar biosynthetic protein FliR~~~
MIQVTSEQWLYWLHLYFWPLLRVLALISTAPILSERAIPKRVKLGLGIMITLVIAPSLPANDTPLFSIAALWLAMQQILI
GIALGFTMQFAFAAVRTAGEFIGLQMGLSFATFVDPGSHLNMPVLARIMDMLAMLLFLTFNGHLWLISLLVDTFHTLPIG
SNPVNSNAFMALARAGGLIFLNGLMLALPVITLLLTLNLALGLLNRMAPQLSIFVIGFPLTLTVGIMLMAALMPLIAPFC
EHLFSEIFNLLADIVSEMPINNNP
>P39739 ~~~fliS~~~Flagellar secretion chaperone FliS~~~COG1516
MAIQNPYTAYQQNSVNTATPGELTLMLYNGCLKFIRLAAQAIENDDMERKNENLIKAQNIIQELNFTLNRNIELSASMGA
MYDYMYRRLVQANIKNDTGMLAEVEGYVTDFRDAWKQAIQSERKDRHGSGGIA
>P0DPD3 ~~~fliS~~~Flagellar secretion chaperone FliS~~~
MQNNLAYNAYSQNQAGIESPQKLIEMLYEGILRFCARAKVAIRNEDIEQRVYFVKRTTAIFIELINTLDYEKGGEVAHYL
SGLYTREIQLLSLANLENNEDRINEVINVTKGLLEAWREVHNNETVAQ
>P26609 ~~~fliS~~~Flagellar secretion chaperone FliS~~~
MYTASGIKAYAQVSVESAVMSASPHQLIEMLFDGANSALVRARLFLEQGDVVAKGEALSKAINIIDNGLKAGLDQEKGGE
IATNLSELYDYMIRRLLQANLRNDAQAIEEVEGLLSNIAEAWKQISPKASFQESR
>P39740 ~~~fliT~~~Flagellar protein FliT~~~
MNNIDQLYTETKSMLSHIQNTPESDELLKQIEDFVATRSELIQEISLPLSEEERKQMKLILTWDQLIVKEMERLKQSIAT
ELQQMKRKRVMHTTYLNPYNNITIDGTYYDKRK
>P0ABY2 ~~~fliT~~~Flagellar protein FliT~~~
MNHAPHLYFAWQQLVEKSQLMLRLATEEQWDELIASEMAYVNAVQEIAHLTEEVDPSTTMQEQLRPMLRLILDNESKVKQ
LLQIRMDELAKLVGQSSVQKSVLSAYGDQGGFVLAPQDNLF
>P0A1N3 ~~~fliT~~~Flagellar protein FliT~~~
MTSTVEFINRWQRIALLSQSLLELAQRGEWDLLLQQEVSYLQSIETVMEKQTPPGITRSIQDMVAGYIKQTLDNEQLLKG
LLQQRLDELSSLIGQSTRQKSLNNAYGRLSGMLLVPDAPGAS
>P0A1N2 ~~~fliT~~~Flagellar protein FliT~~~
MTSTVEFINRWQRIALLSQSLLELAQRGEWDLLLQQEVSYLQSIETVMEKQTPPGITRSIQDMVAGYIKQTLDNEQLLKG
LLQQRLDELSSLIGQSTRQKSLNNAYGRLSGMLLVPDAPGAS
>A1JSR8 ~~~fliT~~~Flagellar protein FliT~~~
MERHQHLLSEYQQILTLSEQMLVLATEGNWDALVDLEMTYLKAVESTANITISSCSSLMLQDLLREKLRAILDNEIEIKR
LLQLRLDRLSDLVGQSTKQQAVNNTYGQFPDHALLLGETQ
>P37587 ~~~fliU~~~Flagellar biosynthetic protein FliU~~~
MKEITVTEPAFVTHFSCSGSACSDHCCKGWKITLDKTTVKKYLASKDATIRTIAQDNIILLKKNNSHWGEIKLPSALGNC
PYLDEDRLCRVQKTLGAKALSHTCSSFPRAHHTYKNEVRNSLSLACPEVTSRILNDPDAMALGEKKSFSRHSILRRYFQR
SKSYSICFA
>P37588 ~~~fliV~~~Flagellar biosynthetic protein FliV~~~
MNIAPDSKVKTSLVLQMQNYFRSLPLNRGSVILDHYIQCLLRVLTAEEGVSMEQKVSDIESSLARCLQADEQQKNWAFRN
LILYKIWENNLGNQPNVDPLRALYIIVAEYAFIKLLTAASVHERGRLEWDDVTNIVYSFHSRSQHNSEVAKNFHRHIETV
RTGDDLSMIHLLT
>O25769 ~~~fliW1~~~Flagellar assembly factor FliW 1~~~COG1699
MNYFLKAPILGFEHINEVRLEKIDSLFSRLISQTNSPMALDMVLVNPYCLREYSFVIPKYIELLLELDSHSKVEVYCVVV
LQKNLEDSMVNFLAPLVFNSKNGFGAQVALSMMDYPDFGFRDPLKSFVIQERERA
>O25929 ~~~fliW2~~~Flagellar assembly factor FliW 2~~~COG1699
MIFDVKAPILGFETIHKMRLQKIDEIFLRLNSTEENSVVSFTLVNPFALRKYEFEVPTPLKILLELEGAKSVLVANIMVV
QTPIELSTVNYLAPLIFNLDKQLMGQVVLDSNKYPHYHLRENILSHTHE
>P96503 ~~~fliW~~~Flagellar assembly factor FliW~~~COG1699
MIIHTKYHGQMNIKEEQIILFESGIPGFLEEKQFVILPLSEDSPFVALQSVTSENLAFIVVSPFIFFKNYEFDLDESTAE
LLDIDNIQDVEVMTILTMAEPFEKSTANLLAPIIVNRKNMMAKQVVLHDSSYTTKHPIGGESC
>A8FMC8 ~~~fliW~~~Flagellar assembly factor FliW~~~
MTLAVKCPILGFEETKNMEFSTIDEVFVRLKSLDGKDFSFVLINPYLIRPDYEFDIPTYYQELLSLTPESNMKIFNIVAI
AKSIEESTVNFLAPVVINLDNNTMVQVILDTVNYPDFFQADQIANYIKK
>Q0P9H9 ~~~fliW~~~Flagellar assembly factor FliW~~~COG1699
MTLAVKCPILGFEETKNMEFSTIDEVFVRLKSLDGKDFSFVLINPYLIRPDYEFDIPTYYQELLSLTPESNMKIFNIVAI
AKSIEESTVNFLAPVVINLDNNTMVQVILDTVNYPDFFQADQIANYIKK
>Q72EP7 ~~~fliW~~~Flagellar assembly factor FliW~~~COG1699
MARQNEIEIQTRIGRQRITLDKIIHFPRGLAGFEGRHDFTLLQLREGAPFLVLQSLDDPGLGLLVADPYSFLTDYQIRVG
DPEQRLLKLENIRQVAVLVTVSIPAGQPEKTALNLTGPILINHRARIGLQVPQTDASLPPQFYLHMDDANGSTTVRRKAS
PPAAGEDKGDVQE
>A4ISV0 ~~~fliW~~~Flagellar assembly factor FliW~~~COG1699
MKIATKYHGDIEIHEKDIVRFEQGIPGFLEEKQFVLLPLEDTPFIILQSVNTPALGFVLIEPFSYFPTYEIDLDDNTLEQ
LQITGEQDVALYVILTVADPFDDTTANLQAPIVINVHKRLGKQVILTNTNYKTKHRLFPEKVAK
>Q9K6V7 ~~~fliW~~~Flagellar assembly factor FliW~~~COG1699
MKVIETKYNGKLEVAEDRLIAFDQGIPAFEDEKEFVLLPFEEGTPYYTLQSTKTVDLAFIIVNPFSFFPEYRVKLPEATI
VQLNITNENDVAIFSLLTVKEPFSETTVNLQAPIVINANKQMGKQLVLGDTAYDRKQPLFQKELVLAKEAK
>O83664 ~~~fliW~~~Flagellar assembly factor FliW~~~COG1699
MEIQTKTLGTQTVEAHQIITLERGLYGFEKYHRFALFDAVQVPFIHMQSLDDPALSFIAIDPFLFRPDYELDIDDVLLQP
LDISSPTDVLVFALVTIPPDGSAVTANLQGPLIVNKKNRKAMQVAMGGDRWRTKHDIVAEMAERRAQEQC
>O32348 ~~~fliX~~~Flagellar assembly protein FliX~~~
MKVSSTGGVSATGASRAKPAGGSSGFSLPSVNAASGAASTASVGGLTGVGSVDALLALQAAGSVGGPLERRKRAVRRADN
ILDILGEVRIALIDGDISHATLDRLSRAIREQREATDDPRLEGVLNEIETRAAVELAKLQARAG
>P52627 ~~~fliZ~~~Regulator of sigma S factor FliZ~~~
MMVQHLKRRPLSRYLKDFKHSQTHCAHCRKLLDRITLVRDGKIVNKIEISRLDTLLDENGWQTEQKSWAALCRFCGDLHC
KTQSDFFDIIGFKQFLFEQTEMSPGTVREYVVRLRRLGNHLHEQNISLDQLQDGFLDEILAPWLPTTSTNNYRIALRKYQ
HYQRQTCTRLVQKSSSLPSSDIY
>P52616 ~~~fljB~~~Phase 2 flagellin~~~
MAQVINTNSLSLLTQNNLNKSQSALGTAIERLSSGLRINSAKDDAAGQAIANRFTANIKGLTQASRNANDGISIAQTTEG
ALNEINNNLQRVRELAVQSANSTNSQSDLDSIQAEITQRLNEIDRVSGQTQFNGVKVLAQDNTLTIQVGANDGETIDIDL
KQINSQTLGLDSLNVQKAYDVKDTAVTTKAYANNGTTLDVSGLDDAAIKAATGGTNGTASVTGGAVKFDADNNKYFVTIG
GFTGADAAKNGDYEVNVATDGTVTLAAGATKTTMPAGATTKTEVQELKDTPAVVSADAKNALIAGGVDATDANGAELVKM
SYTDKNGKTIEGGYALKAGDKYYAADYDEATGAIKAKTTSYTAADGTTKTAANQLGGVDGKTEVVTIDGKTYNASKAAGH
DFKAQPELAEAAAKTTENPLQKIDAALAQVDALRSDLGAVQNRFNSAITNLGNTVNNLSEARSRIEDSDYATEVSNMSRA
QILQQAGTSVLAQANQVPQNVLSLLR
>P18913 ~~~fljK~~~Flagellin FljK~~~COG1344
MALNSINTNAGAMIALQNLNGTNSELTTVQQRINTGKKIASAKDNGAIWATAKNQSATAASMNAVKDSLQRGQSTIDVAL
AAGDTITDLLGKMKEKALAASDTSLNTASFNALKSDFDSLRDQIEKAATNAKFNGVSIADGSTTKLTFLANSDGSGFTVN
AKTISLAGIGLTTTSTFTTAAAAKTMIGTIDTALQTATNKLASLGTSSVGLDTHLTFVGKLQDSLDAGVGNLVDADLAKE
SAKLQSLQTKQQLGVQALSIANQSSSSILSLFR
>Q7CQ37 ~~~flk~~~Flagellar regulator flk~~~
MHPISGAPAQPPGEGRNPLSAASEQPLSMQQRTVLERLITRLISLTQQQSAEVWAGMKHDLGIKNDAPLLSRHFPAAEQN
LTQRLGVAQQNHANRQVLSQLTELLGVGNNRQAVSDFIRQQYGQTALSQLTPDQLKNVLTLLQQGQLSIPQPQQRPATDR
PLLPAEHNTLNQLVTKLAAATGESNKLIWQSMLELSGVKSGELIPAKQFTHLATWLQARQTLSLQHAPTLHTLQAALKQP
LEPDELTAIKEYAQHTYQIQPQTVLTTAQVQDLLNHIFLRRVEREADELEPLSIQPIYRPFAPMIETVKNLSARPGLLFI
ALIIVLALFWLVS
>Q1EMV2 3.1.2.29~~~flK~~~Fluoroacetyl-CoA thioesterase~~~
MKDGMRVGERFTHDFVVPPHKTVRHLYPESPEFAEFPEVFATGFMVGLMEWACVRAMAPYLEPGEGSLGTAICVTHTAAT
PPGLTVTVTAELRSVEGRRLSWRVSAHDGVDEIGSGTHERAVIHLEKFNAKVRQKTPAG
>P62670 ~~~flmA~~~Stable plasmid inheritance protein~~~
MKLPRSSLVWCVLIVCLTLLIFTYLTRKSLCEIRYRDGYREVAAFMAYESGK
>P11519 ~~~flmC~~~Protein FlmC~~~
MLRQHQDSLLLRFAQGEEGHETTTQLSCLVCVDRVSHTVDIHLSDTKIAVRDSLQRRIQGGGGFHGLRIR
>Q93UV4 1.1.1.256~~~flnB~~~Fluoren-9-ol dehydrogenase~~~
MSESGGGTVATARQRQLVERALGEWQGEVAGRVIVVTGGARGIGRSLCEGLLRAGAKVVAADLTWDDADDFRKQLESDGS
GMAVDMDITDDDALDAARDAVIDRFGTVDVLVNNASLVSETLFPPTGHRNTLDTTDRDWEVMFGVNVFGTLKAIRRFIEP
MRAQQRGSIVNVVSSGVLAVAAGGGYHGLRPWTVEMPYQATKAAVMALTFYLAEEVRGDGVAVNAIMPGHTRASWFDATA
RAFNEQGIAYFMRPAIPEHLLPISLFLAAQDSAGASGRLYYVPEWNYDHGYGDYAAWQDHELPPDMEEIYSRLEAATPSY
ERAGVAHLPFDAQGALYAAGMANLGAQNSWTSNDSAQ
>P54466 ~~~floA~~~Flotillin-like protein FloA~~~COG4864
MDPSTLMILAIVAVAIIVLAVFFTFVPVMLWISALAAGVKISIFTLVGMRLRRVIPNRVVNPLIKAHKAGLNVGTNQLES
HYLAGGNVDRVVNALIAAQRANIELTFERCAAIDLAGRDVLEAVQMSVNPKVIETPFIAGVAMDGIEVKAKARITVRANI
ERLVGGAGEETIVARVGEGIVSTIGSSDNHKKVLENPDMISQTVLGKGLDSGTAFEILSIDIADVDIGKNIGAILQTDQA
EADKNIAQAKAEERRAMAVAQEQEMRARVEEMRAKVVEAEAEVPLAMAEALREGNIGVMDYMNIKNIDADTEMRDSFGKL
TKDPSDEDRKS
>Q9ZEF9 ~~~floA~~~Flotillin-like protein FloA~~~
MEVGSVLFFVVIGLAIIALAVFFTFVPIMLWISALAAGVRISIFTLVGMRLRRVIPSRVVNPLIKASKAGLGITINQLES
HYLAGGNVDRVVNALIAAHRANIELTFERGAAIDLAGRDVLEAVQMSVNPKVIETPFIAGVAMDGIEVKAKARITVRANI
DRLVGGAGEETIIARVGEGIVSTIGSQTDHKKVLENPDMISQTVLGKGLDSGTAFEILSIDIADIDIGKNIGAVLQTDQA
EADKNIAQAKAEERRAMAVAQEQEMRAKVEEMRAKVVEAEAEVPLAMAEALRSGNIGVMDYMNIQNLTADTDMRDSIGKM
SKEDDEK
>Q7A5C5 ~~~floA~~~Flotillin-like protein FloA~~~
MFSLSFIVIAVIIIVALLILFSFVPIGLWISALAAGVHVGIGTLVGMRLRRVSPRKVIAPLIKAHKAGLALTTNQLESHY
LAGGNVDRVVDANIAAQRADIDLPFERAAAIDLAGRDVLEAVQMSVNPKVIETPFIAGVAMNGIEVKAKARITVRANIAR
LVGGAGEETIIARVGEGIVSTIGSSKHHTEVLENPDNISKTVLSKGLDSGTAFEILSIDIADVDISKNIGADLQTEQALA
DKNIAQAKAEERRAMAVATEQEMKARVQEMHAKVVEAESEVPLAMAEALRSGNISVKDYYNLKNIEADTGMRNAINKRTD
QSDDESPEH
>O32076 ~~~floT~~~Flotillin-like protein FloT~~~COG2268
MTMPIIMIIGVVFFLLIALIAVFITKYRTAGPDEALIVTGSYLGNKNVHVDEGGNRIKIVRGGGTFVLPVFQQAEPLSLL
SSKLDVSTPEVYTEQGVPVMADGTAIIKIGGSIGEIATAAEQFLGKSKDDREQEAREVLEGHLRSILGSMTVEEIYKNRE
KFSQEVQRVASQDLAKMGLVIVSFTIKDVRDKNGYLESLGKPRIAQVKRDADIATAEADKETRIKRAEADKDAKKSELER
ATEIAEAEKINQLKMAEFRREQDTAKANADQAYDLETARARQQVTEQEMQVKIIERQKQIELEEKEILRRERQYDSEVKK
KADADRYSVEQSAAAEKAKQLAEADAKKYSIEAMAKAEAEKVRIDGLAKAEAEKAKGETEAEVIRLKGLAEAEAKEKIAA
AFEQYGQAAIFDMIVKMLPEYAKQAAAPLSNIDKITVVDTGGSGESSGANKVTSYATNLMSSLQESLKASSGIDVKEMLE
NFSGKGNVKQSINELTNEIKEAKTIQKSE
>Q7A3Q5 ~~~flp~~~Protein flp~~~
MTTKKLYFLSISIIILVAISIAIYITLNSNTKTRLTNDSQQQIDKIIEHDLQKGHIPGASILIVKNGKVFLNKGYGYQDV
DKKVKASPTTKYEIASNTKAFTGLAILKLAQEGRLNLNDDVSKHVPHFKMNYNGQNETITIKQLLAQTSGIPSDITSEDA
VTNKNNRLNDVTRAIMGDELHHKPGEEFEYSNMNYDLLGLIIQNVTKQSYTKYITNSWLKPLHMTHTSFKQTNNKSKHDA
IGYELQGSTPVVSKPEFNLWDTPSAYMMTSTEDLEHWIKFQLNPPDKYKSLVQQSHKNLSSTIGEPNANAYASGWFTNND
EHLVFHSGTLDNFSSFILLNPKQNYGIVVLANLNSEYVPKLVEHLNTQIVNHKRYSTVASILNQYKDQFNIVTVLMTTLI
LLAFIFSAYRAWQMRHGQILLRRSKRIAVLSWLTLCLCIAIALILYALPYLILGSNNWSFVLTWLPIEIKLALITTLIAL
FSTLIVILLFLHTKITKT
>Q9WZP3 ~~~~~~Ferritin-like protein~~~
MADQYHEPVSELTGKDRDFVRALNSLKEEIEAVAWYHQRVVTTKDETVRKILEHNRDEEMEHAAMLLEWLRRNMPGWDEA
LRTYLFTDKPITEIEEETSGGSENTGGDLGIRKL
>P9WQL7 7.6.2.-~~~~~~Fluoroquinolones export ATP-binding protein Rv2688c~~~COG1131
MTALNRAVASARVGTEVIRVRGLTFRYPKAAEPAVRGMEFTVGRGEIFGLLGPSGAGKSTTQKLLIGLLRDHGGQATVWD
KEPAEWGPDYYERIGVSFELPNHYQKLTGYENLRFFASLYAGATADPMQLLAAVGLADDAHTLVGKYSKGMQMRLPFARS
LINDPELLFLDEPTSGLDPVNARKIKDIIVDLKARGRTIFLTTHDMATADELCDRVAFVVDGRIVALDSPTELKIARSRR
RVRVEYRGDGGGLETAEFGMDGLADDPAFHSVLRNHHVETIHSREASLDDVFVEVTGRQLT
>P9WJB3 ~~~~~~Fluoroquinolones export permease protein Rv2686c~~~
MRAISSLAGPRALAAFGRNDIRGTYRDPLLVMLVIAPVIWTTGVALLTPLFTEMLARRYGFDLVGYYPLILTAFLLLTSI
IVAGALAAFLVLDDVDAGTMTALRVTPVPLSVFFGYRAATVMVVTTIYVVATMSCSGILEPGLVSSLIPIGLVAGLSAVV
TLLLILAVANNKIQGLAMVRALGMLIAGLPCLPWFISSNWNLAFGVLPPYWAAKAFWVASDHGTWWPYLVGGAVYNLAIV
WVLFRRFRAKHA
>P9WJB1 ~~~~~~Fluoroquinolones export permease protein Rv2687c~~~COG1668
MTRLVPALRLELTLQVRQKFLHAAVFSGLIWLAVLLPMPVSLRPVAEPYVLVGDIAIIGFFFVGGTVFFEKQERTIGAIV
STPLRFWEYLAAKLTVLLAISLFVAVVVATIVHGLGYHLLPLVAGIVLGTLLMLLVGFSSSLPFASVTDWFLAAVIPLAI
MLAPPVVHYSGLWPNPVLYLIPTQGPLLLLGAAFDQVSLAPWQVGYAVVYPIVCAAGLCRAAKALFGRYVVQRSGVL
>Q9KIT1 ~~~flr~~~Flavoredoxin~~~
MKRSLGAKPLLFPTPVLVVGTYDDQGRPNAMTAAWGGICCSKPPCVTVSLRKATYTYASLMARKAYTLHVTDEPHLTASD
FLGMASGRDGDKLGTLGLTTVRSELVDAPIIQEYPLVLECKIVHVHDLGLHTMFVGEVQDIKADARVLDEKGHLLLDALK
PLGFMPEVRTYHGMGPALGRAFDAGKLHMPKKAE
>Q93LQ6 3.4.21.-~~~fls~~~Fervidolysin~~~
MRKVLLIASIVALILALFSCANPSFEPRSKAKDLASLPEIKSQGYHILFGELRDGEYTEGKILVGYNDRSEVDKIVKAVN
GKVVLELPQIKVVSIKLNGMTVKQAYDKIKALALKGIRYVEPSYKRELIKPTVVKPNPDMYKIRKPGLNSTARDYGEELS
NELWGLEAIGVTQQLWEEASGTNIIVAVVDTGVDGTHPDLEGQVIAGYRPAFDEELPAGTDSSYGGSHGTHVAGTIAAKK
DGKGIVGVAPGAKIMPIVIFDDPALVGGNGYVGDDYVAAGIIWATDHGAKVMNHSWGGWGYSYTMKEAFDYAMEHGVVMV
VSAGNNTSDSHHQYPAGYPGVIQVAALDYYGGTFRVAGFSSRSDGVSVGAPGVTILSTVPGEDSIGYEGHNENVPATNGG
TYDYYQGTSMAAPHVTGVVAVLLQKFPNAKPWQIRKLLENTAFDFNGNGWDHDTGYGLVKLDAALQGPLPTQGGVEEFQV
VVTDAKGNFGVPTVFVSMMRDNGSCYYAKTGPDGIARFPHIDSGTYDIFVGGPDHWDRALAPYDGESIPGGYAIALRMAE
ERQASFVGFGVSPDATQLNVNFNSTLQVKFSTNLSTLKDPQFVVVDPLLRGVYGRVAYARNQTYDLSLLSGQISFGIQTL
LPAATDITIQGTVTLNGEDIPVYGVLKAGTTWTIIDDFGGLNLGTDSQPIYVWWTIFGQ
>Q57CD7 ~~~fluC1~~~Fluoride-specific ion channel FluC 1~~~
MLDIIILVVIGGAFGAMTREFIMLMVPPLTDGFPLDILVANVVACFLLGTVTALYARKIHSRDVHTIIGTGMMGGVSTFS
SFAYGSVVLASASMSAFLIAAAYVTVSVVAGYVAVLAGMKFGEKSADILHRYPPMASIIDSGLVTVESRHSVAETIERVA
AKAKSMGMNVFTRVDHGAGAKEAGLGLPPTELIIFGNPQNGTVLMQDKRTIGLDLPIRALAWEDGSGKVWLTVNDPAWLA
QRHSLGLSSDVAIKAMVTGTGTVTKYAAGD
>Q5FKD2 ~~~fluC1~~~Fluoride-specific ion channel FluC 1~~~COG0239
MSTKTGNYISIAIFAFFGGIARAWLNSSFGFYGTFWGNIIGCFLLAFFTYLFMEFKDVRQWLTVGLGTGFVGAFTTFSTF
NLDVLKNIQANVPITALIYFLSSIIFGFCFAYLGMNAGKQVGNLLRKED
>P9WP63 ~~~fluC1~~~Fluoride-specific ion channel FluC 1~~~COG0239
MPNHDYRELAAVFAGGALGALARAALSALAIPDPARWPWPTFTVNVVGAFLVGYFTTRLLERLPLSSYRRPLLGTGLCGG
LTTFSTMQVETISMIEHGHWGLAAAYSVVSITLGLLAVHLATVLVRRVRIRR
>Q5FKD1 ~~~fluC2~~~Fluoride-specific ion channel FluC 2~~~COG0239
MNFLLAGIGASIGAMLRYAITNYGKKHWEWIGNKFSNLPTPTLFINLTGAFILGFIFGIKTNVFIYAIVGTGVLGGYTTF
STMNTELVELYKSKNYRGFIFYALSSYLGGLILVFVGYYLAILF
>Q7VYU0 ~~~fluC~~~Fluoride-specific ion channel FluC~~~COG0239
MLTYAPLNFIAIGIGATLGAWLRWVLGLRLNGAGWPWGTLTANLVGGYLIGVMVALIASHPEWPAWIRLAAVTGFLGGLT
TFSTFSAETVDMLERGVYATAAAYAGASLAGSLAMTGLGLATVRLLLR
>B7LI20 ~~~fluC~~~Fluoride-specific ion channel FluC~~~
MIKSLFAVIIGGSVGCTLRWLLSTRFNSLFPNLPPGTLVVNLLAGLIIGTALAYFLRQPHLDPFWKLMITTGLCGGLSTF
STFSVEVFALLQAGNYIWALTSVLVHVIGSLIMTALGFFIITILFA
>A0A0H2XLA2 ~~~fluC~~~Fluoride-specific ion channel FluC~~~
MIKSLFAVIIGGSVGCTLRWLLSTRFNSLFPNLPPGTLVVNLLAGLIIGTALAYFLRQPHLDPFWKLMITTGLCGGLSTF
STFSVEVFALLQAGNYIWALTSVLVHVIGSLIMTALGFFIITILFA
>P37002 ~~~fluC~~~Fluoride-specific ion channel FluC~~~COG0239
MLQLLLAVFIGGGTGSVARWLLSMRFNPLHQAIPLGTLTANLIGAFIIGIGFAWFSRMTNIDPVWKVLITTGFCGGLTTF
STFSAEVVFLLQEGRFGWALLNVFVNLLGSFAMTALAFWLFSASTAH
>F8JX40 1.2.1.69~~~~~~Fluoroacetaldehyde dehydrogenase~~~COG1012
MTVHQAPGTPGSVISLRPRYDNWIGGDWKAPAEGRYFANPTPVTGEEYTEIARSTAADIDLALDAAHAAAPAWGRTAPAE
RAAVLGRIADRIEQHLTELAVAEVWDNGKPIREALAADLPLAVDHFRYFAGVLRAQEGSISQLDEDTVAYHFHEPLGVVG
QIIPWNFPLLMAVWKLAPALAAGNAVVLKPAEQTPVSILVLMELIADILPPGVINVVNGFGIEAGKPLAINPRIAKVAFT
GETTTGRLIMQYASQNLIPVTLELGGKSPNLFFEDVAAARDDFYDKALEGFTMFALNQGEVCTCPSRALIAGGIYDGFLG
DALERTRAVKQGNPLDTETMIGAQASNDQLEKILSYIDIGTAEGAKVLTGGERVDLGGSLSGGYYVAPTIFEGDNRMRIF
QEEIFGPVVSVTRFDGYDDAISIANDTLYGLGAGVWTRDLSTAYRAGRAIQAGRVWTNCYHAYPAHAAFGGYKNSGIGRE
THKMMLDHYQQTKNLLISYSAKGPGLF
>Q47899 3.4.24.76~~~~~~Flavastacin~~~
MTRKLLILSGCLILALNSCKSDMETTPASSVDHTTTQLNGTTIHKLLINGAYTYVNEVNGEYFYADDITITAEQFNQLKR
MANPDISTVERSTIVSSFIKTWPNATVYYTLPSQGSLSTQAYNTFLTNINKAFDMISSKTSVKFVQRTNQTEYITFTYST
GNSSPLGWVKNRVNGIKIYNTTYPAIIAHEIMHSMGIMHEQCRPDRDQYIIVDTNRAQDGTRHNFNLYNDYAGHGEFDFG
SVMMYKSTDFAIDPNLPVMTKLDGSTFGKQRDGLSAGDYAGINHLYGPVNSTSATNGTYTLTTSLAGDKNIDITGSSTAD
GTDVILYSATTGNNQKFIFRKSEHGYFTIKSILDSTKVLTVRNNGTANGTAVELRTNADTDAQKWLLFNLGNEGFGFAPK
NAPSLRLEVKDGLTTNLTPIVIGSTDQTLQPYTKQRFTLTKVN
>P77609 ~~~flxA~~~Protein FlxA~~~
MSVTIQGNTSTVISNNSAPEGTSEIAKITRQIQVLTEKLGKISSEEGMTTQQKKEMAALVQKQIESLWAQLEQLLRQQAE
KKNEDATVQPDKKEEKKDDTNTAGTIDIYV
>P12266 ~~~~~~Fimbrial subunit type 1~~~
MKIKTLAIVVLSALSLSSAAALADTTTVNGGTVHFKGEVVNAACAVDAGSVDQTVQLGQVRTASLKQAGANSSAVVFNIQ
LNDCDTTVATKAAVAFLGTAIGPTHTDVLALQSSAAGSATNVGVQILDRTGAGLALDGATFSSETTLNNGTNTIPFQARY
FATGAATPGAANADATFKVQYQ
>P12903 ~~~fim~~~Fimbrial subunit type 1~~~
MKIKTLAMIVVSALALSSTAALADTTTVNGGTVHFKGEVVNAACAVDAGSIDQTVQLGQVRSAKLATAGSTSSAVGFNIQ
LDDCDTTVATKASVAFAGTAIDSSNTTVLALQNSAAGSATNVGVQILDNTGTPLALNGATFSAATTLNDDPNIIPFQARY
YATGAATAGIANADATFKVQYE
>P18774 ~~~pilA~~~Fimbrial protein~~~
MKAQKGFTLIELMIVVAIIGILAAIAIPQYQDYTARTQVTRAVSEVSALKTAAESAILEGKEIVSSATPKDTQYDIGFTE
STLLDGSGKSQIQVTDNQDGTVELVATLGKSSGSAIKGAVITVSRKNDGVWNCKITKTPTAWKPNYAPANCPKS
>P18477 ~~~~~~Fimbrial subunit type 1~~~
MHSLNTRRGLGLAAAMTLAAGALVAPTGAAAPADPNGSTIDPDAATTLTVHKCEQTDTNGVKEGTGNEDPQAECKPVSDV
EFTITKLNVDLTTYDGWKTLADLKGDVVKAGALKSTTVQKITTGANGLASFTDAQTEVGAYLVSETRTPDKVIPAEDFVV
TLPMTNPQDTAKWNYNVHVYPKNTLSGVDKQVTDKPAPGSGRDITYTITTSIPKVDYPGGARIKRYEVVDRLDKRIKKEA
LTPVVKIVGQNEVTLAETTDYTLITAEGKDHNWATIQLTEEGRRKASEARYNGNGETKLQVTLNAKFDAAVNLEGDLSNT
AGLIPNDSPNFTWDPNNPGTTTDIPGIPTTPVLSKYGKVVLTKTGTDDLADKTKYNGAQFQVYECTKTASGATLRDSDPS
TQTVDPLTIGGEKTFTTAGQGTVEINYLRANDYVNGAKKDQLTDEDYYCLVETKAPEGYNLQADPLPFRVLAEKAEKKAA
TEVTVTDIPKNAGFRLPLTGANGVIFLTIAGALLVAGGAVVAYANKRRHVAKH
>P17823 ~~~fimA~~~Type IV major fimbrial protein FimA~~~
MKSLQKGFTLIELMIVVAIIGILAAFAIPAYNDYIARTQVSEGVSLADGLKIRIADNLQDGDCTTKGDASTGEVGNEDKG
KYALATIEGTPAANLSELKAEEKNGCLVKIEYGKGTSGGSVSALINNTELVLAQLANGSYVKESATVKDKFLPKALKETK
>P02975 ~~~fimA~~~Type IV major fimbrial protein FimA~~~
MKSLQKGFTLIELMIVVAIIGILAAFAIPAYNDYIARSQAAEGLTLADGLKVRISDHLESGECKGDANPASGSLGNDDKG
KYALATIDGDYNKDAKTADEKNGCKVVITYGQGTAGEKISKLIVGKKLVLDQFVNGSYKYNEGETDLELKFIPNAVKN
>P04953 ~~~fimA~~~Type IV major fimbrial protein FimA~~~
MKSLQKGFTLIELMIVVAIIGILAAIAIPQYQNYIARSQVSRVMSETGQMRTAIETCLLDGKEGKDCFIGWTTSNLLAAA
GGSTTNNATAADPGQGGLNITYALESTAENKIEATFGQNAAATLHGKKLTWTRSPEATWSCSTDVDEKFKPTGCKK
>P13421 ~~~smfA~~~Fimbria A protein~~~
MKLNKIMLATVLAFGVSSLANAADQGHGKVTFTGSIIDAPCSIAPESADQTVEMGQISNVALKNGGKSAPRQFDIKLEQC
DTSTLKTVTTTFDGKASAANPDLLGIIGTASGASIAITDMASNPIKLGTATAPQTLNDGNNTLRFAAYLQGDGASATVVP
GDFTAVADFKLAYQ
>E3PPC4 ~~~cfaB~~~CFA/I fimbrial subunit B~~~
MKFKKTIGAMALTTMFVAVSASAVEKNITVTASVDPVIDLLQADGNALPSAVKLAYSPASKTFESYRVMTQVHTNDATKK
VIVKLADTPQLTDVLNSTVQMPISVSWGGQVLSTTAKEFEAAALGYSASGVNGVSSSQELVISAAPKTAGTAPTAGNYSG
VVSLVMTLGS
>P0CK93 ~~~cfaB~~~CFA/I fimbrial subunit B~~~
MKFKKTIGAMALTTMFVAVSASAVEKNITVTASVDPAIDLLQADGNALPSAVKLAYSPASKTFESYRVMTQVHTNDATKK
VIVKLADTPQLTDVLNSTVQMPISVSWGGQVLSTTAKEFEAAALGYSASGVNGVSSSQELVISAAPKTAGTAPTAGNYSG
VVSLVMTLGS
>Q50228 3.5.1.49~~~fmdA~~~Formamidase~~~
MKTIVKLDLDKKPWEQDGQIHNRWHPDLPMIAMVKPGDEFRVECMDWTGGQIGNNDSANDVRDVDLTQVHYLSGPIGVEG
AEPGDLMVVDILDVGTFDDSQWGFNGLFAKENGGGFLTDHFPEASKTIWDFHGVYTTSRHVPKVRYAGIMHPGLIGCLPS
KELLDTWNKREGDLIATDPDRVPPLACPPTSQSAVMGRLSGDAAKKAAAEGARTVPPRDHGGNCDIKNLTKGSRVYFPVY
VKDGGLSMGDLHFSQGDGEITFCGAIEMAGYLDIKVGLIKDGVKKYGIKNPVFQPSPITPTYRDYMIFEGISVDEAGKQH
YLDVHIAYRQACLNAIEYLKKFGYSGEQAVSILGTAPVEGHISGIVDIPNACATLWIPTEIFEFDIRPNADGPKIMVPPG
VDVSFTS
>P19369 ~~~aerA~~~Flexible pilin~~~
MPNFFRNGCIALVGSVAAMGAAHAEGGIAEAAGKALDSAQSDVTITAPKVMMVVATVVGVGILINMMRKA
>P20657 ~~~tfpI~~~Type IV major alpha-pilin~~~
MNAQKGFTLIELMIVIAIIGILAAIALPAYQDYISKSQTTRVSGELAAGKTAVDAALFEGKTPVLSEESSTSKENIGLTS
SETSTKPRSNLMASVELTGFADNGAGTISATLGNKANKDIAKTVITQERTTDGVWTCKIDGSQAAKYKEKFNPTGCVKK
>P02974 ~~~pilE1~~~Type IV major pilin protein PilE1~~~
MNTLQKGFTLIELMIVIAIVGILAAVALPAYQDYTARAQVSEAILLAEGQKSAVTEYYLNHGKWPENNTSAGVASPPSDI
KGKYVKEVEVKNGVVTATMLSSGVNNEIKGKKLSLWARRENGSVKWFCGQPVTRTDDDTVADAKDGKEIDTKHLPSTCRD
NFDAK
>P57039 ~~~pilE~~~Fimbrial protein~~~
MNTLQKGFTLIELMIVIAIVGILAAVALPAYQDYTARAQVSEAILLAEGQKSAVTEYYLNHGEWPSNNTSAGVASSTDIK
GKYVQSVEVKNGVVTATMASSNVNNEIKGKKLSLWAKRQDGSVKWFCGQPVKRNDTATTNDDVKADTAANGKQIDTKHLP
STCRDAASAG
>P05431 ~~~pilE~~~Fimbrial protein~~~
MNTLQKGFTLIELMIVIAIVGILAAVALPAYQDYTARAQVSEAILLAEGQKSAVTEYYLNHGEWPGNNTSAGVATSSEIK
GKYVKSVEVKNGVVTAQMASSNVNNEIKGKKLSLWAKRQNGSVKWFCGQPVTRDKAKAANDDVTAAAAANGKKIDTKHLP
STCRDASDAS
>P09829 ~~~tfpA~~~Fimbrial protein~~~
MNAQKGFTLIELMIVIAIIGILAAIALPAYQDYIARAQVSEAFTLADGLKTSISTNRQNGRCFADGKDTAADGVDIITGK
YGKATILEENPNTADGLICGIYYEFNTTGVSDKLIGKTIALKADEKAGKLVLETVNSKTTNVENKYLPSAFKKP
>Q46604 ~~~~~~FMN-binding protein~~~COG3576
MLPGTFFEVLKNEGVVAIATQGEDGPHLVNTWNSYLKVLDGNRIVVPVGGMHKTEANVARDERVLMTLGSRKVAGRNGPG
TGFLIRGSAAFRTDGPEFEAIARFKWARAALVITVVSAEQTL
>P50726 ~~~fmnP~~~Riboflavin transporter FmnP~~~COG3601
MKVKKLVVVSMLSSIAFVLMLLNFPFPGLPDYLKIDFSDVPAIIAILIYGPLAGIAVEAIKNVLQYIIQGSMAGVPVGQV
ANFIAGTLFILPTAFLFKKLNSAKGLAVSLLLGTAAMTILMSILNYVLILPAYTWFLHSPALSDSALKTAVVAGILPFNM
IKGIVITVVFSLIFIKLKPWIEQQRSAHIH
>Q9I4D4 1.-.-.-~~~~~~NAD(P)H-dependent FMN reductase PA1204~~~
MSDDIKVLGISGSLRSGSYNSAALQEAIGLVPPGMSIELADISGIPLYNEDVYALGFPPAVERFREQIRAADALLFATPE
YNYSMAGVLKNAIDWASRPPEQPFSGKPAAILGASAGRFGTARAQYHLRQTLVFLDVHPLNKPEVMISSAQNAFDAQGRL
LDDKARELIQQQLQALQLWVRRLRG
>Q99R54 1.-.-.-~~~~~~Putative flavoprotein monooxygenase~~~
MQHHKVAIIGAGAAGIGMAITLKDFGITDVIILEKGTVGHSFKHWPKSTRTITPSFTSNGFGMPDMNAISMDTSPAFTFN
EEHISGETYAEYLQVVANHYELNIFENTVVTNISADDAYYTIATTTETYHADYIFVATGDYNFPKKPFKYGIHYSEIEDF
DNFNKGQYVVIGGNESGFDAAYQLAKNGSDIALYTSTTGLNDPDADPSVRLSPYTRQRLGNVIKQGARIEMNVHYTVKDI
DFNNGQYHISFDSGQSVHTPHEPILATGFDATKNPIVQQLFVTTNQDIKLTTHDESTRYPNIFMIGATVENDNAKLCYIY
KFRARFAVLAHLLTQREGLPAKQEVIENYQKNQMYLDDYSCCEVSCTC
>P17838 ~~~pilA~~~Fimbrial protein~~~
MKAAQKGFTLIELMIVVAIIGILAAIAIPAYQDYTARAQLSERMTLASGLKTKVSDIFSQDGSCPANTAATAGIEKDTDI
NGKYVAKVTTGGTAAASGGCTIVATMKASDVATPLRGKTLTLTLGNADKGSYTWACTSNADNKYLPKTCQTATTTTP
>P02973 ~~~pilA~~~Fimbrial protein~~~
MKAQKGFTLIELMIVVAIIGILAAIAIPQYQNYVARSEGASALASVNPLKTTVEEALSRGWSVKSGTGTEDATKKEVPLG
VAADANKLGTIALKPDPADGTADITLTFTMGGAGPKNKGKIITLTRTAADGLWKCTSDQDEQFIPKGCSR
>P07640 ~~~tfpQ~~~Fimbrial protein Q~~~
MNAQKGFTLIELMIVIAIIGILAAIALPAYQDYISKSQTTRVVGELAAGKTAVDAALFEGKTPKLGKAANDTEEDIGLTT
TGGTARSNLMSSVNIGGGAFATGAGTLEATLGNRANKDIAGAVITQSRDAEGVWTCTINGSAAPGWKSKFVPTGCKE
>Q08325 2.1.1.179~~~fmrO~~~16S rRNA (guanine(1405)-N(7))-methyltransferase~~~
MLAAAKYRNLDPAFVERLAQEAAERFRDRGQAVKYAKRKLHQAFGAFVAGTPAQAVAACVAKIAAGAEPKEAGREAMRAH
ASSAERVDWLEPFYERVAQWCGPASSVIDLACGLNPLAVPWMALAPGATYACYDVDRTMAEALRALGTVYPVRVNAAAVD
LVAAVPAAGVDVALVLKTLTTVEQQRGGRRVAEYRRELTAVQHHSDGARSLSGRRGYADDPDAIVQRAVHGTGYEVVDEA
AFGTEALYHLVPLAGTAGRPAPAEGAAEPGATRPVVDVPATARPDADRVDPTG
>P15488 ~~~~~~CS3 fimbrial subunit A~~~
MLKIKYLLIGLSLSAMSSYSLAAAGPTLTKELALNVLSPAALDATWAPQDNLTLSNTGVSNTLVGVLTLSNTSIDTVSIA
STNVSDTSKNGTVTFAHETNNSASFATTISTDNANITLDKNAGNTIVKTTNGSQLPTNLPLKFITTEGNEHLVSGNYRAN
ITITSTIK
>P33781 ~~~~~~CS5 fimbrial subunit~~~
MKKNLLITSVLAMATVSGSVLAAVTNGQLTFNWQGVVPSAPVTQSSWAFVNGLDIPFTPGTEQLNITLDSNKDITARSVK
PYDFFIVPVSGNVTPGAPVTRDTSANINSVNAFLSSVPVSNGFVGNKQLTLSTAVEAAKGEVAITLNGQALKVGSASPTV
VTVASNKKESHISIDMNAKAAAADVAEGAAINFVAPVTFAVDI
>Q2FZK3 3.1.1.103~~~fmtA~~~Teichoic acid D-alanine hydrolase~~~COG1680
MKFNKVKLVIHACVLLFIIISIALIFHRLQTKTHSIDPIHKETKLSDNEKYLVDRNKEKVAPSKLKEVYNSKDPKYKKID
KYLQSSLFNGSVAIYENGKLKMSKGYGYQDFEKGIKNTPNTMFLIGSAQKFSTGLLLKQLEEEHKININDPVSKYLPWFK
TSKPIPLKDLMLHQSGLYKYKSSKDYKNLDQAVKAIQKRGIDPKKYKKHMYNDGNYLVLAKVIEEVTGKSYAENYYTKIG
DPLKLQHTAFYDEQPFKKYLAKGYAYNSTGLSFLRPNILDQYYGAGNLYMTPTDMGKLITQIQQYKLFSPKITNPLLHEF
GTKKYPDEYRYGFYAKPTLNRLNGGFFGQVFTVYYNDKYVVVLALNVKGNNEVRIKHIYNDILKQNKPYNTKGVIVQ
>Q5HH27 3.1.1.103~~~fmtA~~~Teichoic acid D-alanine hydrolase~~~
MKFNKVKLVIHACVLLFIIISIALIFHRLQTKTHSIDPIHKETKLSDNEKYLVDRNKEKVAPSKLKEVYNSKDPKYKKID
KYLQSSLFNGSVAIYENGKLKMSKGYGYQDFEKGIKNTPNTMFLIGSAQKFSTGLLLKQLEEEHKININDPVSKYLPWFK
TSKPIPLKDLMLHQSGLYKYKSSKDYKNLDQAVKAIQKRGIDPKKYKKHMYNDGNYLVLAKVIEEVTGKSYAENYYTKIG
DPLKLQHTAFYDEQPFKKYLAKGYAYNSTGLSFLRPNILDQYYGAGNLYMTPTDMGKLITQIQQYKLFSPKITNPLLHEF
GTKKYPDEYRYGFYAKPTLNRLNGGFFGQVFTVYYNDKYVVVLALNVKGNNEVRIKHIYNDILKQNKPYNTKGVIVQ
>Q7A2T0 3.1.1.103~~~fmtA~~~Teichoic acid D-alanine hydrolase~~~
MKFNKVKLVIHACVLLFIIISIALIFHRLQTKTHSIDPIHKETKLSDNEKYLVDRNKEKVAPSKLKEVYNSKDPKYKKID
KYLQSSLFNGSVAIYENGKLKMSKGYGYQDFEKGIKNTPNTMFLIGSAQKFSTGLLLKQLEEEHKININDPVSKYLPWFK
TSKPIPLKDLMLHQSGLYKYKSSKDYKNLDQAVKAIQKRGIDPKKYKKHMYNDGNYLVLAKVIEEVTGKSYAENYYTKIG
DPLKLQHTAFYDEQPFKKYLAKGYAYNSTGLSFLRPNILDQYYGAGNLYMTPTDMGKLITQIQQYKLFSPKITNPLLHEF
GTKQYPDEYRYGFYAKPTLNRLNGGFFGQVFTVYYNDKYVVVLALNVKGNNEVRIKHIYNDILKQNKPYNTKGVIVQ
>Q7A6A2 3.1.1.103~~~fmtA~~~Teichoic acid D-alanine hydrolase~~~
MKFNKVKLVIHACVLLFIIISIALIFHRLQTKTHSIDPIHKETKLSDNEKYLVDRNKEKVAPSKLKEVYNSKDPKYKKID
KYLQSSLFNGSVAIYENGKLKMSKGYGYQDFEKGIKNTPNTMFLIGSAQKFSTGLLLKQLEEEHKININDPVSKYLPWFK
TSKPIPLKDLMLHQSGLYKYKSSKDYKNLDQAVKAIQKRGIDPKKYKKHMYNDGNYLVLAKVIEEVTGKSYAENYYTKIG
DPLKLQHTAFYDEQPFKKYLAKGYAYNSTGLSFLRPNILDQYYGAGNLYMTPTDMGKLITQIQQYKLFSPKITNPLLHEF
GTKQYPDEYRYGFYAKPTLNRLNGGFFGQVFTVYYNDKYVVVLALNVKGNNEVRIKHIYNDILKQNKPYNTKGVIVQ
>O50608 3.1.1.103~~~fmtA~~~Teichoic acid D-alanine hydrolase~~~
MKFNKVKLVIHACVLLFIIISIALIFHRLQTKTHSIDPIHKETKLSDNEKYLVDRNKEKVAPSKLKEVYNSKDPKYKKID
KYLQSSLFNGSVAIYENGKLKMSKGYGYQDFEKGIKNTPNTMFLIGSAQKFSTGLLLKQLEEEHKININDPVSKYLPWFK
TSKPIPLKDLMLHQSGLYKYKSSKDYKNLDQAVKAIQKRGIDPKKYKKHMYNDGNYLVLAKVIEEVTGKSYAENYYTKIG
DPLKLQHTAFYDEQPFKKYLAKGYAYNSTGLSFLRPNILDQYYGAGNLYMTPTDMGKLITQIQQYKLFSPKITNPLLHEF
GTKQYPDEYRYGFYAKPTLNRLNGGFFGQVFTVYYNDKYVVVLALNVKGNNEVRIKHIYNDILKQNKPYNTKGVIVQ
>Q81WH2 2.1.2.9~~~fmt~~~Methionyl-tRNA formyltransferase~~~COG0223
MIKVVFMGTPDFSVPVLRRLIEDGYDVIGVVTQPDRPVGRKKVLTPTPVKVEAEKHGIPVLQPLRIREKDEYEKVLALEP
DLIVTAAFGQIVPNEILEAPKYGCINVHASLLPELRGGAPIHYAIMEGKEKTGITIMYMVEKLDAGDILTQVEVEIEERE
TTGSLFDKLSEAGAHLLSKTVPLLIQGKLEPIKQNEEEVTFAYNIKREQEKIDWTKTGEEVYNHIRGLNPWPVAYTTLAG
QVVKVWWGEKVPVTKSAEAGTIVAIEEDGFVVATGNETGVKITELQPSGKKRMSCSQFLRGTKPEIGTKLGENA
>Q83AA8 2.1.2.9~~~fmt~~~Methionyl-tRNA formyltransferase~~~COG0223
MSLKIVFAGTPQFAVPTLRALIDSSHRVLAVYTQPDRPSGRGQKIMESPVKEIARQNEIPIIQPFSLRDEVEQEKLIAMN
ADVMVVVAYGLILPKKALNAFRLGCVNVHASLLPRWRGAAPIQRAILAGDRETGISIMQMNEGLDTGDVLAKSACVISSE
DTAADLHDRLSLIGADLLLESLAKLEKGDIKLEKQDEASATYASKIQKQEALIDWRKSAVEIARQVRAFNPTPIAFTYFE
GQPMRIWRATVVDEKTDFEPGVLVDADKKGISIAAGSGILRLHQLQLPGKRVCSAGDFINAHGDKLIPGKTVFG
>P23882 2.1.2.9~~~fmt~~~Methionyl-tRNA formyltransferase~~~COG0223
MSESLRIIFAGTPDFAARHLDALLSSGHNVVGVFTQPDRPAGRGKKLMPSPVKVLAEEKGLPVFQPVSLRPQENQQLVAE
LQADVMVVVAYGLILPKAVLEMPRLGCINVHGSLLPRWRGAAPIQRSLWAGDAETGVTIMQMDVGLDTGDMLYKLSCPIT
AEDTSGTLYDKLAELGPQGLITTLKQLADGTAKPEVQDETLVTYAEKLSKEEARIDWSLSAAQLERCIRAFNPWPMSWLE
IEGQPVKVWKASVIDTATNAAPGTILEANKQGIQVATGDGILNLLSLQPAGKKAMSAQDLLNSRREWFVPGNRLV
>A0QWU2 2.1.2.9~~~fmt~~~Methionyl-tRNA formyltransferase~~~COG0223
MRLVFAGTPEPALPSLRRLIESPRHDVVAVLTRPDAAAGRRGKPRPSPVAQLALEHGIPLLRPDRPNSDEFVAELTELAP
DCCAVVAYGALLSQRLLAVPRHGWINLHFSLLPAWRGAAPVQAAIAAGDTVTGATTFQIEPALDSGPVYGVVTETVRDTD
TAGDLLERLSDSGAELLERTIDGIADGSLTAVPQPSEGITVAPKITVESARVRWDLPAHVVDRRIRAVTPNPGAWTMIGE
LRVKVGPVTVDQAAEADGPLAPGEIRVGRNSVHVGTGSHPVRLGQIQPPGKKLMNAADWARGARLEEPVSAS
>P9WND3 2.1.2.9~~~fmt~~~Methionyl-tRNA formyltransferase~~~COG0223
MRLVFAGTPEPALASLRRLIESPSHDVIAVLTRPDAASGRRGKPQPSPVAREAAERGIPVLRPSRPNSAEFVAELSDLAP
ECCAVVAYGALLGGPLLAVPPHGWVNLHFSLLPAWRGAAPVQAAIAAGDTITGATTFQIEPSLDSGPIYGVVTEVIQPTD
TAGDLLKRLAVSGAALLSTTLDGIADQRLTPRPQPADGVSVAPKITVANARVRWDLPAAVVERRIRAVTPNPGAWTLIGD
LRVKLGPVHLDAAHRPSKPLPPGGIHVERTSVWIGTGSEPVRLGQIQPPGKKLMNAADWARGARLDLAARAT
>O85732 2.1.2.9~~~fmt~~~Methionyl-tRNA formyltransferase~~~
MSQALRIVFAGTPEFAAEHLKALLDTPHRIVAVYTQPDRPAGRGQKLMPSAVKSLALEHGLPVMQPQSLRNAEAQAELAA
LRADLMVVVAYGLILPQAVLDIPRLGCINSHASLLPRWRGAAPIQRAVEAGDAESGVTVMQMEAGLDTGPMLLKVSTPIS
AADTGGSLHDRLAALGPKAVIEAIAGLAAGTLHGEIQDDALATYAHKLNKDEARLDWSRPAVELERQVRAFTPWPVCHTS
LADAPLKVLGASLGQGSGAPGTILEASRDGLLVACGEGALRLTRLQLPGGKPLAFADLYNSRREQFAAGQVLGQ
>P99127 2.1.2.9~~~fmt~~~Methionyl-tRNA formyltransferase~~~
MTKIIFMGTPDFSTTVLEMLIAEHDVIAVVTQPDRPVGRKRVMTPPPVKKVAMKYDLPVYQPEKLSGSEELEQLLQLDVD
LIVTAAFGQLLPESLLALPKLGAINVHASLLPKYRGGAPIHQAIIDGEQETGITIMYMVKKLDAGNIISQQAIKIEENDN
VGTMHDKLSVLGADLLKETLPSIIEGTNESVPQDDTQATFASNIRREDERISWNKPGRQVFNQIRGLSPWPVAYTTMDDT
NLKIYDAELVETNKINEPGTIIETTKKAIIVATNDNEAVAIKDMQLAGKKRMLAANYLSGAQNTLVGKKLI
>Q9KVU4 2.1.2.9~~~fmt~~~Methionyl-tRNA formyltransferase~~~COG0223
MSQSLRIVFAGTPDFAARHLAALLSSEHEIIAVYTQPERPAGRGKKLTASPVKTLALEHNVPVYQPENFKSDESKQQLAA
LNADLMVVVAYGLLLPKVVLDTPKLGCINVHGSILPRWRGAAPIQRSIWAGDSETGVTIMQMDVGLDTGDMLKIATLPIE
ASDTSASMYDKLAELGPQALLECLQDIAQGTAVAVKQDDGLANYAHKLSKEEARINWSDAATHIERCIRAFNPWPMSHFE
VAENSIKVWQARVETRAVTQTPGTIIQADKSGIYVATGQDVLVLESLQIPGKKALPVQDILNARADWFSVGSQLS
>Q8ZJ80 2.1.2.9~~~fmt~~~Methionyl-tRNA formyltransferase~~~COG0223
MSDSLRIIFAGTPDFAARHLGALLSSQHKIVGVFTQPDRPAGRGNKLTPSPVKILAEHHGIPVFQPKSLRPEENQHLVAD
LNADIMVVVAYGLILPAAVLAMPRLGCINVHGSLLPRWRGAAPIQRSVWAGDEKTGITIMQMDIGLDTGAMLHKIECAIQ
PEDTSATLYDKLAQLGPQGLLITLQQLAAGTALAEVQNETQATYAEKLSKEEAKLDWTLSATQLERCIRAFNPWPVSYFI
VDEQPIKVWQAQVLPAGEDAEPGTIIHADKHGIQVATADGVLNITQLQPAGKKAMSAADLLNSRREWFIPGSQLV
>P12061 ~~~sefA~~~Fimbrial protein~~~
MRKSASAVAVLALIACGSAHAAGFVGNKAVVQAAVTIAAQNTTSANWSQDPGFTGPAVAAGQKVGTLSITATGPHNSVSI
AGKGASVSGGVATVPFVDGQGQPVFRGRIQGANINDQANTGIDGLAGWRVASSQETLNVPVTTFGKSTLPAGTFTATFYV
QQYQN
>P14738 ~~~fnbA~~~Fibronectin-binding protein A~~~COG4932
MKNNLRYGIRKHKLGAASVFLGTMIVVGMGQDKEAAASEQKTTTVEENGNSATDNKTSETQTTATNVNHIEETQSYNATV
TEQPSNATQVTTEEAPKAVQAPQTAQPANIETVKEEVVKEEAKPQVKETTQSQDNSGDQRQVDLTPKKATQNQVAETQVE
VAQPRTASESKPRVTRSADVAEAKEASNAKVETGTDVTSKVTVEIGSIEGHNNTNKVEPHAGQRAVLKYKLKFENGLHQG
DYFDFTLSNNVNTHGVSTARKVPEIKNGSVVMATGEVLEGGKIRYTFTNDIEDKVDVTAELEINLFIDPKTVQTNGNQTI
TSTLNEEQTSKELDVKYKDGIGNYYANLNGSIETFNKANNRFSHVAFIKPNNGKTTSVTVTGTLMKGSNQNGNQPKVRIF
EYLGNNEDIAKSVYANTTDTSKFKEVTSNMSGNLNLQNNGSYSLNIENLDKTYVVHYDGEYLNGTDEVDFRTQMVGHPEQ
LYKYYYDRGYTLTWDNGLVLYSNKANGNEKNGPIIQNNKFEYKEDTIKETLTGQYDKNLVTTVEEEYDSSTLDIDYHTAI
DGGGGYVDGYIETIEETDSSAIDIDYHTAVDSEAGHVGGYTESSEESNPIDFEESTHENSKHHADVVEYEEDTNPGGGQV
TTESNLVEFDEESTKGIVTGAVSDHTTVEDTKEYTTESNLIELVDELPEEHGQAQGPVEEITKNNHHISHSGLGTENGHG
NYDVIEEIEENSHVDIKSELGYEGGQNSGNQSFEEDTEEDKPKYEQGGNIVDIDFDSVPQIHGQNKGNQSFEEDTEKDKP
KYEHGGNIIDIDFDSVPHIHGFNKHTEIIEEDTNKDKPSYQFGGHNSVDFEEDTLPKVSGQNEGQQTIEEDTTPPIVPPT
PPTPEVPSEPETPTPPTPEVPSEPETPTPPTPEVPSEPETPTPPTPEVPAEPGKPVPPAKEEPKKPSKPVEQGKVVTPVI
EINEKVKAVAPTKKPQSKKSELPETGGEESTNKGMLFGGLFSILGLALLRRNKKNHKA
>A0A0H2XKG3 ~~~fnbB~~~Fibronectin-binding protein B~~~
MKSNLRYGIRKHKLGAASVFLGTMIVVGMGQEKEAAASEQNNTTVEESGSSATESKASETQTTTNNVNTIDETQSYSATS
TEQPSQSTQVTTEEAPKTVQAPKVETSRVDLPSEKVADKETTGTQVDIAQPSNVSEIKPRMKRSTDVTAVAEKEVVEETK
ATGTDVTNKVEVEEGSEIVGHKQDTNVVNPHNAERVTLKYKWKFGEGIKAGDYFDFTLSDNVETHGISTLRKVPEIKSTD
GQVMATGEIIGERKVRYTFKEYVQEKKDLTAELSLNLFIDPTTVTQKGNQNVEVKLGETTVSKIFNIQYLGGVRDNWGVT
ANGRIDTLNKVDGKFSHFAYMKPNNQSLSSVTVTGQVTKGNKPGVNNPTVKVYKHIGSDDLAESVYAKLDDVSKFEDVTD
NMSLDFDTNGGYSLNFNNLDQSKNYVIKYEGYYDSNASNLEFQTHLFGYYNYYYTSNLTWKNGVAFYSNNAQGDGKDKLK
EPIIEHSTPIELEFKSEPPVEKHELTGTIEESNDSKPIDFEYHTAVEGAEGHAEGTIETEEDSIHVDFEESTHENSKHHA
DVVEYEEDTNPGGGQVTTESNLVEFDEDSTKGIVTGAVSDHTTIEDTKEYTTESNLIELVDELPEEHGQAQGPIEEITEN
NHHISHSGLGTENGHGNYGVIEEIEENSHVDIKSELGYEGGQNSGNQSFEEDTEEDKPKYEQGGNIVDIDFDSVPQIHGQ
NNGNQSFEEDTEKDKPKYEQGGNIIDIDFDSVPHIHGFNKHTEIIEEDTNKDKPNYQFGGHNSVDFEEDTLPQVSGHNEG
QQTIEEDTTPPIVPPTPPTPEVPSEPETPTPPTPEVPSEPETPTPPTPEVPTEPGKPIPPAKEEPKKPSKPVEQGKVVTP
VIEINEKVKAVVPTKKAQSKKSELPETGGEESTNNGMLFGGLFSILGLALLRRNKKNHKA
>A2AXG5 2.5.1.123~~~~~~Flaviolin linalyltransferase~~~
MMSGTADLAGVYAAVEESAGLLDVSCAREKVWPILAAFEDVLPTAVIAFRVATNARHEGEFDCRFTVPGSIDPYAVALDK
GLTHRSGHPIETLVADVQKHCAVDSYGVDFGVVGGFKKIWVYFPGGRHESLAHLGEIPSMPPGLAATEGFFARYGLADKV
DLIGVDYASKTMNVYFAASPEVVSAPTVLAMHREIGLPDPSEQMLDFCSRAFGVYTTLNWDSSKVERIAYSVKTEDPLEL
SARLGSKVEQFLKSVPYGIDTPKMVYAAVTAGGEEYYKLQSYYQWRTDSRLNLSYIGGRS
>Q9EXQ1 ~~~fnr~~~Anaerobic regulatory protein~~~COG0664
MKILATETKLGRRVQSGGCAIHCQNCSISQLCIPFTLNQHELDQLDNIIERKKPIQKSQVLFKAGDELTSLYAIRSGTIK
SYTISETGEEQITSFHLPGDLVGFDAIMNMQHPSFAQALETAMVCEIPFDILDDLSGKMPKLRQQIMRLMSNEIKSDQEM
ILLLSKMNAEERLAAFIYNLSQRYSARGFSAREFRLTMTRGDIGNYLGLTVETISRLLGRFQKLGVLSVQGKYITINNMA
ELIELSGTNKNKIQLII
>P46908 ~~~fnr~~~Anaerobic regulatory protein~~~COG0664
MNFLSVRPSDSDLISSDLYELLESISTKRKMEKHTYLFREGMDAEELYLIQSGLIEIGKLTSDGKDLTLRICQKHDIVGE
LTLFTEEPRYMLSAKVLEDGEVLVINKNKLEKELIQNGALTFEFMKWMSTHLRKIQSKIRDLLLHGKKGALYSTLIRLSN
SYGVERSDGILINIVLTNQDLAKFCAAARESVNRMLGDLRKKGVISIDESGKIILHKRDYLRCEIECENCPLEICNID
>P0A9E5 ~~~fnr~~~Fumarate and nitrate reduction regulatory protein~~~COG0664
MIPEKRIIRRIQSGGCAIHCQDCSISQLCIPFTLNEHELDQLDNIIERKKPIQKGQTLFKAGDELKSLYAIRSGTIKSYT
ITEQGDEQITGFHLAGDLVGFDAIGSGHHPSFAQALETSMVCEIPFETLDDLSGKMPNLRQQMMRLMSGEIKGDQDMILL
LSKKNAEERLAAFIYNLSRRFAQRGFSPREFRLTMTRGDIGNYLGLTVETISRLLGRFQKSGMLAVKGKYITIENNDALA
QLAGHTRNVA
>P0AC25 ~~~focA~~~Formate channel FocA~~~COG2116
MKADNPFDLLLPAAMAKVAEEAGVYKATKHPLKTFYLAITAGVFISIAFVFYITATTGTGTMPFGMAKLVGGICFSLGLI
LCVVCGADLFTSTVLIVVAKASGRITWGQLAKNWLNVYFGNLVGALLFVLLMWLSGEYMTANGQWGLNVLQTADHKVHHT
FIEAVCLGILANLMVCLAVWMSYSGRSLMDKAFIMVLPVAMFVASGFEHSIANMFMIPMGIVIRDFASPEFWTAVGSAPE
NFSHLTVMNFITDNLIPVTIGNIIGGGLLVGLTYWVIYLRENDHH
>P0AC23 ~~~focA~~~Formate channel FocA~~~COG2116
MKADNPFDLLLPAAMAKVAEEAGVYKATKHPLKTFYLAITAGVFISIAFVFYITATTGTGTMPFGMAKLVGGICFSLGLI
LCVVCGADLFTSTVLIVVAKASGRITWGQLAKNWLNVYFGNLVGALLFVLLMWLSGEYMTANGQWGLNVLQTADHKVHHT
FIEAVCLGILANLMVCLAVWMSYSGRSLMDKAFIMVLPVAMFVASGFEHSIANMFMIPMGIVIRDFASPEFWTAVGSAPE
NFSHLTVMNFITDNLIPVTIGNIIGGGLLVGLTYWVIYLRENDHH
>P77733 ~~~focB~~~Formate channel FocB~~~COG2116
MRNKLSFDLQLSARKAAIAERIAAHKIARSKVSVFLMAMSAGVFMAIGFTFYLSVIADAPSSQALTHLVGGLCFTLGFIL
LAVCGTSLFTSSVMTVMAKSRGVISWRTWLINALLVACGNLAGIACFSLLIWFSGLVMSENAMWGVAVLHCAEGKMHHTF
TESVSLGIMCNLMVCLALWMSYCGRSLCDKIVAMILPITLFVASGFEHCIANLFVIPFAIAIRHFAPPPFWQLAHSSADN
FPALTVSHFITANLLPVMLGNIIGGAVLVSMCYRAIYLRQEP
>P62609 ~~~focC~~~Chaperone protein FocC~~~
MRIWAVLASFLVFFYIPQSYAGVALGATRVIYPEGQKQVQLAVTNNDDKSSYLIQSWIENAEGKKDARFVITPPLFSMQG
KKENTLRIIDATNGQMPEDRESLFWVNVKAIPAMDKAKTGENYLQFAIVSRIKLLYRPQGLVIPPEQAPGKLEFTRENGG
LTLFNPTPYYLTVTDLKAGNKSLENTMVPPQGKVTVNIPGGYTGGDITYKTINDYGALTEQVKGVVK
>P0AC16 4.1.2.25~~~folB~~~Dihydroneopterin aldolase~~~COG1539
MDIVFIEQLSVITTIGVYDWEQTIEQKLVFDIEMAWDNRKAAKSDDVADCLSYADIAETVVSHVEGARFALVERVAEEVA
ELLLARFNSPWVRIKLSKPGAVARAANVGVIIERGNNLKENN
>P46362 4.1.2.25~~~folB~~~Dihydroneopterin aldolase~~~COG1539
MDRVFIEELTVFAQIGVYDWEQQIKQKLVFDLEMAWDCKQAAETDDVVYCLNYAEVSQAIIDYVESKPFLLIERVAYEVA
DLLESRYQLQGLKIKLSKPKAVAQARNVGVLIVRGCLK
>P9WNC5 4.1.2.25~~~folB~~~Dihydroneopterin aldolase~~~COG1539
MADRIELRGLTVHGRHGVYDHERVAGQRFVIDVTVWIDLAEAANSDDLADTYDYVRLASRAAEIVAGPPRKLIETVGAEI
ADHVMDDQRVHAVEVAVHKPQAPIPQTFDDVAVVIRRSRRGGRGWVVPAGGAV
>P56740 4.1.2.25~~~folB~~~Dihydroneopterin aldolase~~~
MQDTIFLKGMRFYGYHGALSAENEIGQIFKVDVTLKVDLSEAGRTDNVIDTVHYGEVFEEVKSIMEGKAVNLLEHLAERI
ANRINSQYNRVMETKVRITKENPPIPGHYDGVGIEIVRENK
>P08192 6.3.2.12~~~folC~~~Dihydrofolate synthase/folylpolyglutamate synthase~~~COG0285
MIIKRTPQAASPLASWLSYLENLHSKTIDLGLERVSLVAARLGVLKPAPFVFTVAGTNGKGTTCRTLESILMAAGYKVGV
YSSPHLVRYTERVRVQGQELPESAHTASFAEIESARGDISLTYFEYGTLSALWLFKQAQLDVVILEVGLGGRLDATNIVD
ADVAVVTSIALDHTDWLGPDRESIGREKAGIFRSEKPAIVGEPEMPSTIADVAQEKGALLQRRGVEWNYSVTDHDWAFSD
AHGTLENLPLPLVPQPNAATALAALRASGLEVSENAIRDGIASAILPGRFQIVSESPRVIFDVAHNPHAAEYLTGRMKAL
PKNGRVLAVIGMLHDKDIAGTLAWLKSVVDDWYCAPLEGPRGATAEQLLEHLGNGKSFDSVAQAWDAAMADAKAEDTVLV
CGSFHTVAHVMEVIDARRSGGK
>I6Y0R5 6.3.2.12~~~folC~~~Dihydrofolate synthase/folylpolyglutamate synthase~~~COG0285
MNSTNSGPPDSGSATGVVPTPDEIASLLQVEHLLDQRWPETRIDPSLTRISALMDLLGSPQRSYPSIHIAGTNGKTSVAR
MVDALVTALHRRTGRTTSPHLQSPVERISIDGKPISPAQYVATYREIEPLVALIDQQSQASAGKGGPAMSKFEVLTAMAF
AAFADAPVDVAVVEVGMGGRWDATNVINAPVAVITPISIDHVDYLGADIAGIAGEKAGIITRAPDGSPDTVAVIGRQVPK
VMEVLLAESVRADASVAREDSEFAVLRRQIAVGGQVLQLQGLGGVYSDIYLPLHGEHQAHNAVLALASVEAFFGAGAQRQ
LDGDAVRAGFAAVTSPGRLERMRSAPTVFIDAAHNPAGASALAQTLAHEFDFRFLVGVLSVLGDKDVDGILAALEPVFDS
VVVTHNGSPRALDVEALALAAGERFGPDRVRTAENLRDAIDVATSLVDDAAADPDVAGDAFSRTGIVITGSVVTAGAART
LFGRDPQ
>P54382 ~~~folD~~~Bifunctional protein FolD~~~COG0190
MTATIIDGKETAREKREQLAKEVEELKKQGVTPGLAVILIGDDPASHSYVRGKKKAAETMGMNFKLDQFDSSLTEAELLS
IIDQYNQDPEFHGILVQLPLPDHISEKAVIERISPDKDVDGFHPLNVGKMLLGEDTFLPCTPHGIVELLKKTNIDLSGKE
VVVVGRSNIVGKPVGQLLLNENATVTYCHSRTENITEHTKKADILVVAVGRANFISADQIKEGAVVIDVGVNRLENGKLC
GDVEFEGAKEKASFITPVPGGVGPMTITMLAHNTVKSAKRTLS
>Q0PA35 ~~~folD~~~Bifunctional protein FolD~~~COG0190
MTLLDGKALSAKIKEELKEKNQFLKSKGIESCLAVILVGDNPASQTYVKSKAKACEECGIKSLVYHLNENITQNELLALI
NTLNHDDSVHGILVQLPLPDHICKDLILESIISSKDVDGFHPINVGYLNLGLESGFLPCTPLGVMKLLKAYEIDLEGKDA
VIIGASNIVGRPMATMLLNAGATVSVCHIKTKDLSLYTRQADLIIVAAGCVNLLRSDMVKEGVIVVDVGINRLESGKIVG
DVDFEEVSKKSSYITPVPGGVGPMTIAMLLENTVKSAKNRLN
>P24186 ~~~folD~~~Bifunctional protein FolD~~~COG0190
MAAKIIDGKTIAQQVRSEVAQKVQARIAAGLRAPGLAVVLVGSNPASQIYVASKRKACEEVGFVSRSYDLPETTSEAELL
ELIDTLNADNTIDGILVQLPLPAGIDNVKVLERIHPDKDVDGFHPYNVGRLCQRAPRLRPCTPRGIVTLLERYNIDTFGL
NAVVIGASNIVGRPMSMELLLAGCTTTVTHRFTKNLRHHVENADLLIVAVGKPGFIPGDWIKEGAIVIDVGINRLENGKV
VGDVVFEDAAKRASYITPVPGGVGPMTVATLIENTLQACVEYHDPQDE
>Q5NGF3 ~~~folD~~~Bifunctional protein FolD~~~COG0190
MILIDGKSLSKDLKERLATQVQEYKHHTAITPKLVAIIVGNDPASKTYVASKEKACAQVGIDSQVITLPEHTTESELLEL
IDQLNNDSSVHAILVQLPLPAHINKNNVIYSIKPEKDVDGFHPTNVGRLQLRDKKCLESCTPKGIMTMLREYGIKTEGAY
AVVVGASNVVGKPVSQLLLNAKATVTTCHRFTTDLKSHTTKADILIVAVGKPNFITADMVKEGAVVIDVGINHVDGKIVG
DVDFAAVKDKVAAITPVPGGVGPMTITELLYNTFQCAQELNR
>A0QSY5 ~~~folD~~~Bifunctional protein FolD~~~COG0190
MGAISLDGKTTRDEIFVDLKERVAALTAAGRTPGLGTVLVGDDPGSQAYVRGKHADCAKVGINSIRRDLPADITTEQLNE
TIDELNANPDCTGYIVQLPLPKHLDENAALERIDPAKDADGLHPTNLGRLVLGKQAALPCTPRGIVHLLRRFDVPIAGAH
VVVIGRGVTVGRPMGLLLTRRSENATVTLCHTGTRDLPALTRQADIIIAAVGVPHMVTADMVKPGAAVVDVGVSRVDGKL
TGDVAPDVWEVAGHVSPNPGGVGPLTRAFLLTNVVEAEESKLA
>P9WG81 ~~~folD~~~Bifunctional protein FolD~~~COG0190
MGAIMLDGKATRDEIFGDLKQRVAALDAAGRTPGLGTILVGDDPGSQAYVRGKHADCAKVGITSIRRDLPADISTATLNE
TIDELNANPDCTGYIVQLPLPKHLDENAALERVDPAKDADGLHPTNLGRLVLGTPAPLPCTPRGIVHLLRRYDISIAGAH
VVVIGRGVTVGRPLGLLLTRRSENATVTLCHTGTRDLPALTRQADIVVAAVGVAHLLTADMVRPGAAVIDVGVSRTDDGL
VGDVHPDVWELAGHVSPNPGGVGPLTRAFLLTNVVELAERR
>P51696 ~~~folD~~~Bifunctional protein FolD~~~
MSAQIIDGKIISQTVRQEVAARVKARTDAGLRAPGLAVVLVGQDPASQIYVGSKRKACEEVGFISKSFDLPSSASEQQLL
DLIDELNQDPTMDGILVQLPLPAGMDCTRILERIDPEKDVDGFHPYNVGRLSQRIPKLRSCTPKGIITLLERYNIEVRGK
HAVIVGASNIVGRPMTLELLLAGATTTTCHRFTQDLEGHIRQADILVVAVGKPNFIPGGWIKEGATVIDVGINRLENGKL
CGDVEFDVACQRAKYITPVPGGVGPMTVASLIENTLLACEQYHSA
>Q9I2U6 ~~~folD~~~Bifunctional protein FolD~~~
MTAQLIDGKAIAANLRQQIAQRVTERRQQGLRVPGLAVILVGTDPASQVYVAHKRKDCEEVGFLSQAYDLPAETSQDDLL
ALIDRLNDDPAIDGILVQLPLPAHLDASLLLERIHPDKDVDGFHPYNIGRLAQRMPLLRPCTPKGIMTLLASTGADLYGM
DAVVVGASNIVGRPMALELLLGGCTVTVTHRFTRDLADHVSRADLVVVAAGKPGLVKGEWIKEGAIVIDVGINRQADGRL
VGDVEYEVAAQRASWITPVPGGVGPMTRACLLENTLHAAEHLHD
>Q7A697 ~~~folD~~~Bifunctional protein FolD~~~
MVAKILDGKQIAKDYRQGLQDQVEALKEKGFTPKLSVILVGNDGASQSYVRSKKKAAEKIGMISEIVHLEETATEEEVLN
ELNRLNNDDSVSGILVQVPLPKQVSEQKILEAINPEKDVDGFHPINIGKLYIDEQTFVPCTPLGIMEILKHADIDLEAKN
AVVIGRSHIVGQPVSKLLLQKNASVTILHSRSKDMASYLKDADVIVSAVGKPSLVTKDVVKEGAVIIDVGNTPDENGKLK
GDVDYDAVKEIAGAITPVPGGVGPLTITMVLNNTLLAEKMRRGIDS
>Q5SJ94 ~~~folD~~~Bifunctional protein FolD~~~COG0190
MAAQVLSGHEAAEAVYEEIRARLRSLSFTPSLRVIRLGEDPASVAYVRLKDKRARALGYRSQVEVYPEDLPEEALLERIA
ALNADEEVDGILVQLPLPPHIRTQRVLEAIHPLKDVDGFHPLNVGRLWSGGKGLFPCTPLGVVRLLKHYGVDLRGKEVVV
VGRSNIVGKPLAGLLLREDATVTLAHSKTQDLPEVTRRAQVLVVAVGRPHLVRKEWVREGAIVVDVGVNRVEGRLLGDVH
PEVAEVAFALTPVPGGVGPMTVAMLMGNTLEAALLRRHGASG
>P0AFS3 1.5.1.50~~~folM~~~Dihydromonapterin reductase~~~COG1028
MGKTQPLPILITGGGRRIGLALAWHFINQKQPVIVSYRTHYPAIDGLINAGAQCIQADFSTNDGVMAFADEVLKSTHGLR
AILHNASAWMAEKPGAPLADVLACMMQIHVNTPYLLNHALERLLRGHGHAASDIIHFTDYVVERGSDKHIAYAASKAALD
NMTRSFARKLAPEVKVNSIAPSLILFNEHDDAEYRQQALNKSLMKTAPGEKEVIDLVDYLLTSCFVTGRSFPLDGGRHLR
>D8KIT5 3.6.1.-~~~folQ~~~Probable DHNTP pyrophosphohydrolase~~~
MNEDLISQIKEVVTAENQEKLIKIIQLLESSNYELRGKINPDLQLSASALVFKEDKLFFIEHPYQKELLLPAGHVELKES
PLDTAIREFHEETGFFAKKMGKLVDVNLIDIPFNETKNEKKHQHIDFRYLLELEEQEAELAELPFFLLELEEAPEEFKKY
YRYKNI
>A0Q1J7 ~~~folT~~~Folate transporter FolT~~~COG3601
MKKVNVMIYMAFMITLEIVFTRFLSIQTPIIRIGFGFIPVAMSGMMFGPLLAGIVGATSDVLGMMIFPKGAYFPGFTLSA
FVGAVIYGVFFYNKKVSVKRVLLAVGIITVLVNLTMNTIWLQILTGKAVKVLFVTRLVKEAIMFPIHAIVIYGAWKMVDR
LEIMNKVAKFNK
>Q035X6 ~~~folT~~~Folate transporter FolT~~~
MQVLSFSSPKLSTRNMVYMAMLMAMQIVLGRFSFGTPWLKISPAFFATILMGYYFGPWLAAGAAALNDQLSIMIFSPGAN
FPGFTISAAIAAMLYGMFFHGKKVTVLRTLLAVGMVLLISNIILTTLWLNIMGTPWQGIIWPRTIKNVVMLPIQTALSYG
TLKAIERIRPHL
>Q03ZT0 ~~~folT~~~Folate transporter FolT~~~COG4720
MENTTRTWVFPKLDTRQFVLLAMLMALHMVLSRLTVGTNVLQVSFAFVTMSLIAKWYGPLWSMLIAAILDVIGATIINPG
AFFVGFTFTAMISALIYSLAYFKHDKTSWWRVSVAVGLVLLIANIGLNSIWLVMMYHTAHDWPSFLAFITPRVIKNLIMF
PIQVGISYFLLNNQVISHTTKKIFS
>Q8DV98 ~~~folT~~~Folate transporter FolT~~~
MNTMFKSPKLSPQRLVTLAMLIALAFAIGKLSIPIIPQQLIISPTFIVNVMIGMIGGPIWAFISLAILDIVDNLSSGAGN
FIIWWTLLEAVQGLFYGLFFYQKSLSWTNKKDWLHVTIATAIIMLIGSFIFTPLLVQIYYGVPFWAQFAAGRWLKIFEIP
IRILVTMAIMPQLQRIPELRKLANFK
>P0AC19 5.1.99.7~~~folX~~~Dihydroneopterin triphosphate 2'-epimerase~~~COG1539
MAQPAAIIRIKNLRLRTFIGIKEEEINNRQDIVINVTIHYPADKARTSEDINDALNYRTVTKNIIQHVENNRFSLLEKLT
QDVLDIAREHHWVTYAEVEIDKLHALRYADSVSMTLSWQR
>Q9HYG7 5.1.99.7~~~folX~~~Dihydroneopterin triphosphate 2'-epimerase~~~
MPRLEPGMARIRVKDLRLRTFIGIKEEEILNKQDVLINLTILYPAADAVEVNDIEHALNYRTITKAIIRHVEENRFALLE
RMTQEILDLVMENPAVRYAEVEVDKPHALRFAESVSITLAGHR
>P96074 ~~~fom1~~~Fosfomycin biosynthesis bifunctional protein Fom1~~~
MQRPIVYVGMSADLIHPGHINILSRAAELGDITIGLLTDAAIASYKRLPHMTYEQRKAVVENLKGVASVVPQRTLDYAEN
LRTVRPDFVVHGDDWQTGVQRHTRERVIEVLSEWGGKLVEIPYTPGISSTRLHSSVKEVGTTPNVRLSRLRRLLDSKDIV
RILEVHNGLTGLIIENSKVTVDNQAREFDGMWSSSLTDSLARGKPDTEAVDVSSRLQMVNELFEVTTKPLVFDGDTGGKP
EHFGFTVRSLERLGVSAVIVEDKEGLKRNSLFGTDVPQTQSSVEDFSERIRIGKRAQITDDFMVIARIESLILEKGMADA
VHRAEAYVDAGADGIMIHSRQSDPAEIFEFCRYFDKLPRRVPLVVVPTSYSSVRESELADAGVNMVIYANHLMRAVYPQV
TKVVQSILQHGRAHEAESMLASIKDALSIIPENAG
>Q56184 2.1.1.308~~~fom3~~~Cytidylyl-2-hydroxyethylphosphonate methyltransferase~~~
MTIGSLGSTEFALHGKPAIRWGDLPQRVGKPETRRYQKVLLLNPSATLFRHDLPRCTYPLGLGYIAAVLEKYGYEVKILD
VFAEGYYNAQPVDGDDQFLRYGLSDDDIVKVMKEFGPDVVGISSIFSNQADNVHHLLKLADLVTPEAVTAIGGAHARYFP
KACLDDPNLDAVFLGEGEMTFLLWMEHLNGNVSDDEVHGIAWRDRDGKVQIKPELPLISSMRPEGPETGKSSPMLSMAGE
LDHIPFPAWHHYNMEKYFEIKAYQSPYTVGSRVGQLYTSRGCTAHCTFCTTTHFWGQKLRRRSVQDVVDEVLRLRDEYGI
DEFHIQDDNITNDMDHARELFRAFKEVGLPWATPQGTALWRMDEELLDLMAESGAYQVTFAIESGVQRVLKELIKKPLNL
ERTSHLIKYARSLGMHVHGFFIIGMPPMCGNAGESIEEMQASYDYAEEAGFSSASFFAASPIVGSELLRECIRQGFVDPE
ESLYRMTYKQGIINVPGLWDGEEIAELAAKFNRDFNARRDRAYTPQKQWNANQY
>Q9I4K6 2.5.1.18~~~fosA~~~Glutathione transferase FosA~~~
MLTGLNHLTLAVADLPASIAFYRDLLGFRLEARWDQGAYLELGSLWLCLSREPQYGGPAADYTHYAFGIAAADFARFAAQ
LRAHGVREWKQNRSEGDSFYFLDPDGHRLEAHVGDLRSRLAACRQAPYAGMRFAD
>Q56415 2.5.1.18~~~fosA~~~Glutathione transferase FosA~~~
MLQSLNHLTLAVSDLQKSVTFWHELLGLTLHARWNTGAYLTCGDLWVCLSYDEARQYVPPQESDYTHYAFTVAEEDFEPL
SQRLEQAGVTIWKQNKSEGASFYFLDPDGHKLELHVGSLAARLAACREKPYAGMVFTSDEA
>Q81W73 2.5.1.-~~~fosB2~~~Metallothiol transferase FosB 2~~~COG0346
MLQGINHICFSVSNLEKSIEFYQKILQAKLLVKGRKLAYFDLNGLWIALNVEEDIPRNEIKQSYTHMAFTVTNEALDHLK
EVLIQNDVNILPGRERDERDQRSLYFTDPDGHKFEFHTGTLQNRLEYYKEDKKHMTFYI
>Q739M9 2.5.1.-~~~fosB~~~Metallothiol transferase FosB~~~
MLNGINHLCFSVSNLEDSIEFYEKVLEGELLVRGRKLAYFNICGVWVALNEEIHIPRNEIYQSYTHIAFSVEQKDFESLL
QRLEENDVHILKGRERDVRDCESIYFVDPDGHKFEFHSGTLQDRLNYYREDKPHMTFY
>O31817 2.5.1.-~~~fosB~~~Metallothiol transferase FosB~~~COG0346
MEIKGINHLLFSVSHLDTSIDFYQKVFGAKLLVKGRTTAYFDMNGIWLALNEEPDIPRNDIKLSYTHIAFTIEDHEFEEM
SAKLKRLHVNILPGRERDERDRKSIYFTDPDGHKFEFHTGTLQDRLRYYKQEKTHMHFYDETAF
>P60864 2.5.1.-~~~fosB~~~Metallothiol transferase FosB~~~
MLKSINHICFSVRNLNDSIHFYRDILLGKLLLTGKKTAYFELAGLWIALNEEKDIPRNEIHFSYTHIAFTIDDSEFKYWH
QRLKDNNVNILEGRVRDIRDRQSIYFTDPDGHKLELHTGTLENRLNYYKEAKPHMTFYK
>Q8Y6I2 ~~~fosX~~~Fosfomycin resistance protein FosX~~~COG0346
MISGLSHITLIVKDLNKTTTFLREIFNAEEIYSSGDQTFSLSKEKFFLIAGLWICIMEGDSLQEQTYNHIAFRIQSEEVD
EYIERIKSLGVEIKPERPRVEGEGRSIYFYDFDNHLFELHAGTLEERLKRYHE
>Q98GG1 ~~~fosX~~~Fosfomycin resistance protein FosX~~~COG0346
MIEGLSHMTFIVRDLERMTRILEGVFDAREVYASDTEQFSLSREKFFLIGDIWVAIMQGEKLAERSYNHIAFKIDDADFD
RYAERVGKLGLDMRPPRPRVEGEGRSIYFYDDDNHMFELHTGTLTERLARKAKGLEAAQ
>P9WNC3 3.2.2.23~~~fpg1~~~Formamidopyrimidine-DNA glycosylase 1~~~COG0266
MPELPEVEVVRRGLQAHVTGRTITEVRVHHPRAVRRHDAGPADLTARLRGARINGTDRRGKYLWLTLNTAGVHRPTDTAL
VVHLGMSGQMLLGAVPCAAHVRISALLDDGTVLSFADQRTFGGWLLADLVTVDGSVVPVPVAHLARDPLDPRFDCDAVVK
VLRRKHSELKRQLLDQRVVSGIGNIYADEALWRAKVNGAHVAATLRCRRLGAVLHAAADVMREALAKGGTSFDSLYVNVN
GESGYFERSLDAYGREGENCRRCGAVIRRERFMNRSSFYCPRCQPRPRK
>L0T864 ~~~fpg2~~~Uncharacterized formamidopyrimidine-DNA glycosylase-like protein~~~COG0266
MAGTPQPRALGPDALDVSTDDLAGLLAGNTGRIKTVITDQKVIAGIGNAYSDEILHVAKISPFATAGKLSGAQLTCLHEA
MASVLSDAVRRSVGQGAAMLKGEKRSGLRVHARTGLPCPVCGDTVREVSFADKSFQYCPTCQTGGKALADRRMSRLLK
>P15925 6.3.2.17~~~fpgS~~~Folylpolyglutamate synthase~~~
MNYTETVAYIHSFPRLAKTGDHRRILTLLHALGNPQQQGRYIHVTGTNGKGSAANAIAHVLEASGLTVGLYTSPFIMRFN
ERIMIDHEPIPDAALVNAVAFVRAALERLQQQQADFNVTEFEFITALGYWYFRQRQVDVAVIEVGIGGDTDSTNVITPVV
SVLTEVALDHQKLLGHTITAIAKHKAGIIKRGIPVVTGNLVPDAAAVVAAKVATTGSQWLRFDRDFSVPKAKLHGWGQRF
TYEDQDGRISDLEVPLVGDYQQRNMAIAIQTAKVYAKQTEWPLTPQNIRQGLAASHWPARLEKISDTPLIVIDGAHNPDG
INGLITALKQLFSQPITVIAGILADKDYAAMADRLTAAFSTVYLVPVPGTPRALPEAGYEALHEGRLKDSWQEALAASLN
DVPDQPIVITGSLYLASAVRQTLLGGKS
>P05523 3.2.2.23~~~mutM~~~Formamidopyrimidine-DNA glycosylase~~~COG0266
MPELPEVETSRRGIEPHLVGATILHAVVRNGRLRWPVSEEIYRLSDQPVLSVQRRAKYLLLELPEGWIIIHLGMSGSLRI
LPEELPPEKHDHVDLVMSNGKVLRYTDPRRFGAWLWTKELEGHNVLTHLGPEPLSDDFNGEYLHQKCAKKKTAIKPWLMD
NKLVVGVGNIYASESLFAAGIHPDRLASSLSLAECELLARVIKAVLLRSIEQGGTTLKDFLQSDGKPGYFAQELQVYGRK
GEPCRVCGTPIVATKHAQRATFYCRQCQK
>P42371 3.2.2.23~~~mutM~~~Formamidopyrimidine-DNA glycosylase~~~
MPELPEVETVRRELEKRIVGQKIISIEATYPRMVLTGFEQLKKELTGKTIQGISRRGKYLIFEIGDDFRLISHLRMEGKY
RLATLDAPREKHDHLTMKFADGQLIYADVRKFGTWELISTDQVLPYFLKKKIGPEPTYDEDFDEKLFREKLRKSTKKIKP
YLLEQTLVAGLGNIYVDEVLWLAKIHPEKETNQLIESSIHLLHDSIIEILQKAIKLGGSSIRTYSALGSTGKMQNELQVY
GKTGEKCSRCGAEIQKIKVAGRGTHFCPVCQQK
>O50606 3.2.2.23~~~mutM~~~Formamidopyrimidine-DNA glycosylase~~~COG0266
MPELPEVETTRRRLRPLVLGQTLRQVVHRDPARYRNTALAEGRRILEVDRRGKFLLFALEGGVELVAHLGMTGGFRLEPT
PHTRAALVLEGRTLYFHDPRRFGRLFGVRRGDYREIPLLLRLGPEPLSEAFAFPGFFRGLKESARPLKALLLDQRLAAGV
GNIYADEALFRARLSPFRPARSLTEEEARRLYRALREVLAEAVELGGSTLSDQSYRQPDGLPGGFQTRHAVYGREGLPCP
ACGRPVERRVVAGRGTHFCPTCQGEGP
>Q6MLJ0 ~~~~~~Ferreportin~~~
MKVQSLLRIETQLLLGRLLTRSGDQAWDFVVPFALLVIFPGKLQVAAFYYLIVKIGTFLLTPSSGKWIDTHPRIQVVKWG
VWLQFFAILAGMVFFGMLDGLVRAGGRESWLLSVLFIALALSGVMASLGSQITDISVGNDLAPSLVAPEKLTHFNSWLRR
IDLATEVGAPILAGALFAFHPEQLPLAGLFLIGLWNLVSFVPEYFLLRNVIQRSGLKIKVLTEAQSWKDTFHINLRGSFS
DPIFWLILSYALLWLSVLSPHGVLLAAYLKDEMRLPETEIGLFRGLGAVFGLISTVSFPYLVRRLGLISSSRWHLGFQGV
TLGIAVTAFAMGSTASVYVFLGCILLSRVGLYGFSNGEFELRQRLIPEGRRGELNSLSSLTTTSATLILFSAGSLLPQTE
DFKYLVYVSLAAVLLANVVFIKWSSRQGVVTSGAAEPVES
>P9WKH1 2.5.1.10~~~~~~(2E,6E)-farnesyl diphosphate synthase~~~COG0142
MRGTDEKYGLPPQPDSDRMTRRTLPVLGLAHELITPTLRQMADRLDPHMRPVVSYHLGWSDERGRPVNNNCGKAIRPALV
FVAAEAAGADPHSAIPGAVSVELVHNFSLVHDDLMDRDEHRRHRPTVWALWGDAMALLAGDAMLSLAHEVLLDCDSPHVG
AALRAISEATRELIRGQAADTAFESRTDVALDECLKMAEGKTAALMAASAEVGALLAGAPRSVREALVAYGRHIGLAFQL
VDDLLGIWGRPEITGKPVYSDLRSRKKTLPVTWTVAHGGSAGRRLAAWLVDETGSQTASDDELAAVAELIECGGGRRWAS
AEARRHVTQGIDMVARIGIPDRPAAELQDLAHYIVDRQA
>Q97K92 1.6.3.4~~~fprA1~~~Flavo-diiron protein FprA1~~~COG0426
MSAEKLCENVYWVGVKDQKLRVFDIIMNTKKGSTYNSYLINDDKVAIIDTVKDGFYDEFLKSIKSVIGDKKVDYIVVQHT
ELDHSGSMYRLIKEYPEAKVVSSKAANMYLKEIVNDEFNSLDAMEVKELNLGKNTLEFISAPNLHWPDTMFTYNKENNIL
FTCDVMGCHYCPDGSIKDEGGEDYLPEMRYYFDVIMSPFKKFVNMGLDKIKDLKLDMIAPSHGPVHINDIEESVKLYREW
AKEKEPKEKNVQIFYITAYGNTGIMAKHLCEDINKKGVKAEVHEITDMKMEDIVELIADANGVLVGSPTINQDAVRPVWD
VLSSVCPIVNRGKAAAAFGSYGWSGEGVPMMMDRLKSLKFKTPDNGLKFKFVPASKEFSEADKFVDDFIGLL
>Q97GC0 1.6.3.4~~~fprA2~~~Flavo-diiron protein FprA2~~~COG0426
MPAIKIKDNIFSVGVLNPSLRIFDIIMKTEYGTSYNAYLIKGKKNVLIDTVHGRFFDEYLENIKSVIDPSSIDYVIMNHC
EPDHSGSLARLYEVAPQIKVIASNAGKIYLKNITNKETLDVKAVKTNDTLDIGNGKVLKFAIAPFLHWPDSMFTILEEDK
IAFTCDFLGCHFCEPRMFDTKITYMPKYEKSFKEYYDAIFSPFKPYVVKGLDILDALDLDFIATSHGPILTREGLLAASK
QKYRDLSSEIQSTTKYIPIFYCSAYGNTEILANEIASGIKSVLNDANIEMLDIINYDYSDLKEKINICDAFMLGTPTINK
DALFPIWELIGGIDAVNCKNKPASAFGSFGWSGEAIPFVISRLKELKLKVFQDGFTCLFVPSEDDIKKAFKFGEDFAKSI
>Q9FDN7 1.-.-.-~~~fprA~~~Nitric oxide reductase~~~COG0426
MSQPVAITDGIYWVGAVDWNIRYFHGPAFSTHRGTTYNAYLIVDDKTALVDTVYEPFKEELIAKLKQIKDPVKLDYLVVN
HTESDHAGAFPAIMELCPDAHVLCTQRAFDSLKAHYSHIDFNYTIVKTGTSVSLGKRSLTFIEAPMLHWPDSMFTYVPEE
ALLLPNDAFGQHIATSVRFDDQVDAGLIMDEAAKYYANILMPFSNLITKKLDEIQKINLAIKTIAPSHGIIWRKDPGRII
EAYARWAEGQGKAKAVIAYDTMWLSTEKMAHALMDGLVAGGCEVKLFKLSVSDRNDVIKEILDARAVLVGSPTINNDILP
VVSPLLDDLVGLRPKNKVGLAFGAYGWGGGAQKILEERLKAAKIELIAEPGPTVQWVPRGEDLQRCYELGRKIAARIAD
>P9WIQ3 1.18.1.2~~~fprA~~~NADPH-ferredoxin reductase FprA~~~COG0493
MRPYYIAIVGSGPSAFFAAASLLKAADTTEDLDMAVDMLEMLPTPWGLVRSGVAPDHPKIKSISKQFEKTAEDPRFRFFG
NVVVGEHVQPGELSERYDAVIYAVGAQSDRMLNIPGEDLPGSIAAVDFVGWYNAHPHFEQVSPDLSGARAVVIGNGNVAL
DVARILLTDPDVLARTDIADHALESLRPRGIQEVVIVGRRGPLQAAFTTLELRELADLDGVDVVIDPAELDGITDEDAAA
VGKVCKQNIKVLRGYADREPRPGHRRMVFRFLTSPIEIKGKRKVERIVLGRNELVSDGSGRVAAKDTGEREELPAQLVVR
SVGYRGVPTPGLPFDDQSGTIPNVGGRINGSPNEYVVGWIKRGPTGVIGTNKKDAQDTVDTLIKNLGNAKEGAECKSFPE
DHADQVADWLAARQPKLVTSAHWQVIDAFERAAGEPHGRPRVKLASLAELLRIGLG
>P0CY93 1.-.-.-~~~fprA~~~Type A flavoprotein fprA~~~
MSVPPFTIRPAAPRLDGPTGPVAVAPGVHWVGALDPGLRNFDVILKTANGTTYNAYAVRGSEGVAVIDTVKAEFAGDFFA
RLEAVARYDEIRLIVLNHLEPDHTGAVPELLRRAPQAQVRLSPRGLPMLRALLKDDFERYDIKGVTTGQSVSLGDRICSF
FTTPFVHWPDTQCTWLAAERVLFTCDLFGSHYCDGRLFNDLVGDFRFSFEYYFDRIMRPFRSFVAQVLDLIEPLDFGIIA
PAHGPILRSHPRDYLTHTRRLISSWLAAETGSEKTLLIFYVSAYRATAQLAQAIHDGAAESPDVRVSLFDLEGGEITPFL
DLIEEADGIALGTPTINGDAVRTIWEMLAALVDIETRGKLGAAFGSYGWSGEAVRLVETRLQGLKMRLPEPGLRVKLHPS
AAELEEGRAFGRRLADHLTGRARPREVDFAEIAAR
>P9WJI1 1.18.1.2~~~fprB~~~Probable ferredoxin/ferredoxin--NADP reductase~~~COG0493
MPHVITQSCCNDASCVFACPVNCIHPTPDEPGFATSEMLYIDPVACVDCGACVTACPVSAIAPNTRLDFEQLPFVEINAS
YYPKRPAGVKLAPTSKLAPVTPAAEVRVRRQPLTVAVVGSGPAAMYAADELLVQQGVQVNVFEKLPTPYGLVRSGVAPDH
QNTKRVTRLFDRIAGHRRFRFYLNVEIGKHLGHAELLAHHHAVLYAVGAPDDRRLTIDGMGLPGTGTATELVAWLNGHPD
FNDLPVDLSHERVVIIGNGNVALDVARVLAADPHELAATDIADHALSALRNSAVREVVVAARRGPAHSAFTLPELIGLTA
GADVVLDPGDHQRVLDDLAIVADPLTRNKLEILSTLGDGSAPARRVGRPRIRLAYRLTPRRVLGQRRAGGVQFSVTGTDE
LRQLDAGLVLTSIGYRGKPIPDLPFDEQAALVPNDGGRVIDPGTGEPVPGAYVAGWIKRGPTGFIGTNKSCSMQTVQALV
ADFNDGRLTDPVATPTALDQLVQARQPQAIGCAGWRAIDAAEIARGSADGRVRNKFTDVAEMLAAATSAPKEPLRRRVLA
RLRDLGQPIVLTVPL
>P42512 ~~~fptA~~~Fe(3+)-pyochelin receptor~~~
MKTETKVIKGRQGIARNRHTPLCLGLLLALSPLAAAVADARKDGETELPDMVISGESTSATQPPGVTTLGKVPLKPRELP
QSASVIDHERLEQQNLFSLDEAMQQATGVTVQPFQLLTTAYYVRGFKVDSFELDGVPALLGNTASSPQDMAIYERVEILR
GSNGLLHGTGNPAATVNLVRKRPQREFAASTTLSAGRWDRYRAEVDVGGPLSASGNVRGRAVAAYEDRDYFYDVADQGTR
LLYGVTEFDLSPDTLLTVGAQYQHIDSITNMAGVPMAKDGSNLGLSRDTYLDVDWDRFKWDTYRAFGSLEQQLGGGWKGK
VSAEYQEADSRLRYAGSFGAIDPQTGDGGQLMGAAYKFKSIQRSLDANLNGPVRLFGLTHELLGGVTYAQGETRQDTARF
LNLPNTPVNVYRWDPHGVPRPQIGQYTSPGTTTTTQKGLYALGRIKLAEPLTLVVGGRESWWDQDTPATRFKPGRQFTPY
GGLIWDFARDWSWYVSYAEVYQPQADRQTWNSEPLSPVEGKTYETGIKGELADGRLNLSLAAFRIDLENNPQEDPDHPGP
PNNPFYISGGKVRSQGFELEGTGYLTPYWSLSAGYTYTSTEYLKDSQNDSGTRYSTFTPRHLLRLWSNYDLPWQDRRWSV
GGGLQAQSDYSVDYRGVSMRQGGYALVNMRLGYKIDEHWTAAVNVNNLFDRTYYQSLSNPNWNNRYGEPRSFNVSLRGAF
>Q81L65 ~~~fpuA~~~Petrobactin-binding protein FpuA~~~
MKKILSIFIVVFLFAVGCGQQKEEKKETKADNKNQAITIKHAEGETKLDKPAKKVVVLEWVYSEDLLALGVQPVGMADIK
NYNKWVNTKTKPSKDVVDVGTRQQPNLEEISRLKPDLIITASFRGKAIKNELEQIAPTVMFDPSTSNNDHFAEMTETFKQ
IAKAVGKEEEGKKVLADMDKAFADAKAKIEKADLKDKNIAMAQAFTAKNVPTFRILTDNSLALQVTKKLGLTNTFEAGKS
EPDGFKQTTVESLQSVQDSNFIYIVADEDNIFDTQLKGNPAWEELKFKKENKMYKLKGDTWIFGGPESATSLATQVADVM
TAKK
>Q81L64 ~~~fpuB~~~Petrobactin import system permease protein FpuB~~~
MNNLQHTLRASLVFGGGGALLLLLFFIHIGQGQANISYSMIIDALISPNQSLEHQTLIMLRLPRAVIAILAGGALAASGV
ILQTLTKNPLAESSTMGIHSGAYFFLVAATIFLPKGLQINSLLFTFIGGAITALFVYRISGEKKGTPLRMALAGMVVTLM
LSAFTGTMQLFYENETAGLFLWGAGSLIQNNWDGVQFSFPFIIISFLVLLGISRKLNILLLGDDVAVSLGEKTAVTRLIA
FIAAIFLTAVIVTVVGPIGFVGLVAPHLMRLIGYRQHFTLLLSSFLWGAVLLLGADVAGRLIDPTGAELPVGAVTAMIGS
PWLIYLVYRMMKSKQYMNDNGANTAGASSRYYSYKKVIIISITLCIVTIALGVTIGSNAYIESITNVISGQLTQFDKNMM
MNLRLPRMLVAAIAGACLAISGLVFQGILRNPLADPSIIGISSGAGVGALTIMYVFPTLPGFFLPIGAFIGGLLAVGIVL
FFSWKSGFSPTALALIGIGISALGSAIIQIFIVKANLNVAAALTWLSGSTYARGWNHLENIILYPSLILVLIIFFLIKQL
DVLVLGDDLATGLGQPVNKTRLALIVLATLLASVNIAAVGTIAFLGLVAPHLARIVVGMNHQRLFVCSALFGAILLSIAD
LLGRTIAYPKEIPSGLVVAVLGAPYFLWLMRKSGKKVN
>Q81LM1 7.2.2.-~~~fpuC~~~Petrobactin import ATP-binding protein FpuC~~~
MISVNKVFYAHSERFQMQNMNVHIKAGEVVSLIGPNGSGKSTLLRLMARLLKQSEGDIVLDGKSIHTMKSADVAKQLAML
PQMHDHQLDLTVKELIEFGRGPHKSWRGRLNKEDEEIVDWALSVTNLEGYEYRLLQSLSGGERQRAWIAMTLAQRTNVLL
LDEPTTFLDIVHQLEVMELVKRLNEEFGMTIIMVLHDINQAAQYSDRLLVLKRGKLQYDGVPEEVLCHEMFQHIFGIEVD
IFQGSEKPFFTPKRISKKGGAKCEQKNVLPLS
>Q81V82 7.2.2.-~~~fpuD~~~Petrobactin import ATP-binding protein FpuD~~~
MQKALETKRLTLSYGETIIIDELNLEIPKGEITIFIGSNGCGKSTLLRSLARLLKPTTGDILLDNQAIQSMQTKQIARQM
AILPQGPQAPEGLTVLQLVKQGRYPYQTWLKQWSEKDEEMVQNALAATGMTEFAERDVHALSGGQRQRAWIAMTLAQDTD
IILLDEPTTYLDMTHQIEVLDLLFELNETEQRTIVMVLHDLNLACRYADNIVAIQDKQIYAQGKPEEVVDEKLVRDVFRM
ECQISTDPLFGTPLCIPHGKGRRVRKEVAHAMR
>P48632 ~~~fpvA~~~Ferripyoverdine receptor~~~
MPAPHGLSPLSKAFLMRRAFQRRILPHSLAMALSLPLAGYVQAQEVEFDIPPQALGSALQEFGRQADIQVLYRPEEVRNK
RSSAIKGKLEPNQAITELLRGTGASVDFQGNAITISVAEAADSSVDLGATMITSNQLGTITEDSGSYTPGTIATATRLVL
TPRETPQSITVVTRQNMDDFGLNNIDDVMRHTPGITVSAYDTDRNNYYARGFSINNFQYDGIPSTARNVGYSAGNTLSDM
AIYDRVEVLKGATGLLTGAGSLGATINLIRKKPTHEFKGHVELGAGSWDNYRSELDVSGPLTESGNVRGRAVAAYQDKHS
FMDHYERKTSVYYGILEFDLNPDTMLTVGADYQDNDPKGSGWSGSFPLFDSQGNRNDVSRSFNNGAKWSSWEQYTRTVFA
NLEHNFANGWVGKVQLDHKINGYHAPLGAIMGDWPAPDNSAKIVAQKYTGETKSNSLDIYLTGPFQFLGREHELVVGTSA
SFSHWEGKSYWNLRNYDNTTDDFINWDGDIGKPDWGTPSQYIDDKTRQLGSYMTARFNVTDDLNLFLGGRVVDYRVTGLN
PTIRESGRFIPYVGAVYDLNDTYSVYASYTDIFMPQDSWYRDSSNKLLEPDEGQNYEIGIKGEYLDGRLNTSLAYFEIHE
ENRAEEDALYNSKPTNPAITYAYKGIKAKTKGYEAEISGELAPGWQVQAGYTHKIIRDDSGKKVSTWEPQDQLSLYTSYK
FKGALDKLTVGGGARWQGKSWQMVYNNPRSRWEKFSQEDYWLVDLMARYQITDKLSASVNVNNVFDKTYYTNIGFYTSAS
YGDPRNLMFSTRWDF
>P9WP11 1.1.98.-~~~~~~F420H(2)-dependent quinone reductase Rv1558~~~COG1846
MPLSGEYAPSPLDWSREQADTYMKSGGTEGTQLQGKPVILLTTVGAKTGKLRKTPLMRVEHDGQYAIVASLGGAPKNPVW
YHNVVKNPRVELQDGTVTGDYDAREVFGDEKAIWWQRAVAVWPDYASYQTKTDRQIPVFVLTPVRAGG
>P9WP13 1.1.98.-~~~~~~F420H(2)-dependent quinone reductase Rv1261c~~~COG3945
MDISRWLERHVGVQLLRLHDAIYRGTNGRIGHRIPGAPPSLLLHTTGAKTSQPRTTSLTYARDGDAYLIVASKGGDPRSP
GWYHNLKANPDVEINVGPKRFGVTAKPVQPHDPDYARLWQIVNENNANRYTNYQSRTSRPIPVVVLTRR
>P46072 1.6.99.-~~~~~~Major NAD(P)H-flavin oxidoreductase~~~
MTHPIIHDLENRYTSKKYDPSKKVSQEDLAVLLEALRLSASSINSQPWKFIVIESDAAKQRMHDSFANMHQFNQPHIKAC
SHVILFANKLSYTRDDYDVVLSKAVADKRITEEQKEAAFASFKFVELNCDENGEHKAWTKPQAYLALGNALHTLARLNID
STTMEGIDPELLSEIFADELKGYECHVALAIGYHHPSEDYNASLPKSRKAFEDVITIL
>Q797E6 ~~~fra~~~Intracellular iron chaperone frataxin~~~COG5646
MDVFSEYLAGIADPFHRERTEEVLTWIKNKYPNLHTEIKWNQPMFTDHGTFIIGFSVSKKHLAVAPEKVTIAHVEDDIVK
AGYDYTEQLIRIPWNGPVDYTLLEKMIEFNILDKADCSTFWRK
>Q0ZQ46 2.3.3.19~~~frbC~~~2-phosphonomethylmalate synthase~~~
MRNDLVLEDTTLRDGEQTPGVAFSKETKTAILNALIEAGVTSIEIGIPAMGGEELDFIKSVVDRQDEARLVVWHRGVRED
VERSLDLGFTSVHVGLPTSAGHLKASVRKDRTWLLATARDMVKMAKDRGAFVSISAEDIARTEISFLQEYAGVVAEAGAD
RLRLSDTVGLLGPEAYGERVAAVLSAADIDVQCHAHNDFGLATANTLAGLKAGARYFHVTVNAIGERAGMADLAQVVVAL
KKLYDRDLGIDLTKLKKVSRLVAEAAGHQVLPWQPITGDNVFAHESGIHANGMFRDTSSFEPFPPEHVGGERRYVLGKHS
GRALVAWALEQEGITPREELLPHCLEEVRALSIRIGGAVSHEQLVEIYNKAAA
>Q9F9B0 7.5.2.-~~~frcA~~~Fructose import ATP-binding protein FrcA~~~
MAQEPILTARGLVKRYGRVTALDRADFDLYPGEILAVIGDNGAGKSSMIKAISGAVTPDEGEIRLEGKPIQFRSPMEARQ
AGIETVYQNLALSPALSIADNMFLGREIRKPGIMGKWFRSLDRAAMEKQARAKLSELGLMTIQNINQAVETLSGGQRQGV
AVARAAAFGSKVVIMDEPTAALGVKESRRVLELILDVRRRGLPIVLISHNMPHVFEVADRIHIHRLGRRLCVINPKDYTM
SDAVAFMTGAKEPPREAIAA
>Q9F9B2 ~~~frcB~~~Fructose import binding protein FrcB~~~
MKKTVLSAAFGALAMGVAFASPSQAAEVSACLITKTDTNPFFVKMKEGAAAKAKELGVTLKSYAGKIDGDSESQVAAIET
CIADGAKGILIAASDTQGIVPQVQKARDAGLLVIALDTPLEPLDAADATFATDNLLAGKLIGQWAAATLGDAAKEAKVAF
LDLTPSQPSVDVLRDQGFMIGFGIDPKDPNKIGDEDDPRIVGHDITNGNEEGGRTAMENLLQKDPTINVVHTINEPAAAG
AYEALKSVGREKDVLIVSVDGGCPGVKNVAEGVIGATSQQYPLMMAALGIEAIKKFADTGEKPTPTEGKDFVDTGVSLVA
DKPVSGVESIDTKTGMEKCWG
>Q9F9B1 ~~~frcC~~~Fructose import permease protein FrcC~~~
MGETNTAAQPSQEFEKVLADSSTDVASFDAHDKTLLQKLQHFLHSSPAAVPLIVLVLSLIAFGVILGGKFFSAFTMTLIL
QQVAIVGIVGAAQTLVILTAGIDLSVGAIMVLSSVIMGQFTFRYGFPPALSVICGLGVGALCGYINGTLVARMKLPPFIV
TLGMWQIVLASNFLYSANETIRAQDISANASILQFFGQNFRIGNAVFTYGVVVMVLLVCLLWYVLNRTAWGRYVYAVGDD
PEAAKLAGVNVTRMLISIYTLSGLICALAGWALIGRIGSVSPTAGQFANIESITAVVIGGISLFGGRGSIMGMLFGALIV
GVFSLGLRLMGTDPQWTYLLIGLLIIIAVAIDQWIRKVAA
>Q9Z4P0 1.3.5.1~~~ifcA~~~Fumarate reductase flavoprotein subunit~~~COG1053
MKLKYLVSAMALVVLSSGTAMAKTPDMGSFHADMGSCQSCHAKPIKVTDSETHENAQCKSCHGEYAELANDKLQFDPHNS
HLGDINCTSCHKGHEEPKFYCNECHSFDIKPMPFSDAKKKKSWDDGWDQDKIQKAIAAGPSETTQVLVVGAGSAGFNASL
AAKKAGANVILVDKAPFSGGNSMISAGGMNAVGTKQQTAHGVEDKVEWFIEDAMKGGRQQNDIKLVTILAEQSADGVQWL
ESLGANLDDLKRSGGARVDRTHRPHGGKSSGPEIIDTLRKAAKEQGIDTRLNSRVVKLVVNDDHSVVGAVVHGKHTGYYM
IGAKSVVLATGGYGMNKEMIAYYRPTMKDMTSSNNITATGDGVLMAKEIGASMTDIDWVQAHPTVGKDSRILISETVRGV
GAVMVNKDGNRFISELTTRDKASDAILKQPGQFAWIIFDNQLYKKAKMVRGYDHLEMLYKGDTVEQLAKSTGMKVADLAK
TVSDYNGYVASGKDTAFGRADMPLNMTQSPYYAVKVAPGIHHTMGGVAINTTASVLDLQSKPIDGLFAAGEVTGGVHGYN
RLGGNAIADTVVFGRIAGDNAAKHALDK
>P00363 1.3.5.1~~~frdA~~~Fumarate reductase flavoprotein subunit~~~COG1053
MQTFQADLAIVGAGGAGLRAAIAAAQANPNAKIALISKVYPMRSHTVAAEGGSAAVAQDHDSFEYHFHDTVAGGDWLCEQ
DVVDYFVHHCPTEMTQLELWGCPWSRRPDGSVNVRRFGGMKIERTWFAADKTGFHMLHTLFQTSLQFPQIQRFDEHFVLD
ILVDDGHVRGLVAMNMMEGTLVQIRANAVVMATGGAGRVYRYNTNGGIVTGDGMGMALSHGVPLRDMEFVQYHPTGLPGS
GILMTEGCRGEGGILVNKNGYRYLQDYGMGPETPLGEPKNKYMELGPRDKVSQAFWHEWRKGNTISTPRGDVVYLDLRHL
GEKKLHERLPFICELAKAYVGVDPVKEPIPVRPTAHYTMGGIETDQNCETRIKGLFAVGECSSVGLHGANRLGSNSLAEL
VVFGRLAGEQATERAATAGNGNEAAIEAQAAGVEQRLKDLVNQDGGENWAKIRDEMGLAMEEGCGIYRTPELMQKTIDKL
AELQERFKRVRITDTSSVFNTDLLYTIELGHGLNVAECMAHSAMARKESRGAHQRLDEGCTERDDVNFLKHTLAFRDADG
TTRLEYSDVKITTLPPAKRVYGGEADAADKAEAANKKEKANG
>P9WN91 1.3.5.1~~~frdA~~~Fumarate reductase flavoprotein subunit~~~COG1053
MTAQHNIVVIGGGGAGLRAAIAIAETNPHLDVAIVSKVYPMRSHTVSAEGGAAAVTGDDDSLDEHAHDTVSGGDWLCDQD
AVEAFVAEAPKELVQLEHWGCPWSRKPDGRVAVRPFGGMKKLRTWFAADKTGFHLLHTLFQRLLTYSDVMRYDEWFATTL
LVDDGRVCGLVAIELATGRIETILADAVILCTGGCGRVFPFTTNANIKTGDGMALAFRAGAPLKDMEFVQYHPTGLPFTG
ILITEAARAEGGWLLNKDGYRYLQDYDLGKPTPEPRLRSMELGPRDRLSQAFVHEHNKGRTVDTPYGPVVYLDLRHLGAD
LIDAKLPFVRELCRDYQHIDPVVELVPVRPVVHYMMGGVHTDINGATTLPGLYAAGETACVSINGANRLGSNSLPELLVF
GARAGRAAADYAARHQKSDRGPSSAVRAQARTEALRLERELSRHGQGGERIADIRADMQATLESAAGIYRDGPTLTKAVE
EIRVLQERFATAGIDDHSRTFNTELTALLELSGMLDVALAIVESGLRREESRGAHQRTDFPNRDDEHFLAHTLVHRESDG
TLRVGYLPVTITRWPPGERVYGR
>V3TQ67 1.3.5.1~~~frdA~~~Fumarate reductase flavoprotein subunit~~~COG1053
MQTFNADLAIIGAGGAGLRAAIAAAEANPQLKIALISKVYPMRSHTVAAEGGSAAVTQDHDSFDFHFHDTVAGGDWLCEQ
DVVDQFVQSCPREMTQLEQWGCPWSRKPDGSVNVRRFGGMKIERTWFAADKTGFHMLHTLFQTSLKYPQIQRFDEHFVLD
ILVDDGQARGLVAINMMEGTLVQIRANAVIMATGGAGRVYRYNTNGGIVTGDGMGMAFRHGVPLRDMEFVQYHPTGLPGS
GILMTEGCRGEGGIMVNKDGYRYLQDYGMGPETPLGQPKNKYMELGPRDKVSQAFWHEWRAGRTISTPLGDVVYLDLRHL
GEKKLKERLPFICELAQAYVGVDPVKEPIPIRPTAHYTMGGIETDQQCETRIKGLFAAGECSSVGLHGANRLGSNSLAEL
VVFGRIAGEHATQRSLESAPANASALDAQARDVEQRLHTLMKQEGTESWAKIRDEMGISMEEGCGIYRTTELMQKTLDKL
AELKERFKRVKITDHSSVFNTDLLYTIELGHSLDVAQCMAHSAINRKESRGAHQRLDEGCTERDDVNFLKHTLAFYNPEG
APRLEYSDVKITKLPPAKRVYGGEADAQEKSDKEQANG
>Q07WU7 1.3.5.1~~~fccA~~~Fumarate reductase flavoprotein subunit~~~COG1053
MKKMNLAVCIATLMGTAGLMGTAVAADNLAEFHVQNQECDSCHTPDGELSNDSLTYENTQCVSCHGTLAEVAETTKHEHY
NAHASHFPGEVACTSCHSAHEKSMVYCDSCHSFDFNMPYAKKWLRDEPTIAELAKDKSERQAALASAPHDTVDVVVVGSG
GAGFSAAISATDSGAKVILIEKEPVIGGNAKLAAGGMNAAWTDQQKAKKITDSPELMFEDTMKGGQNINDPALVKVLSSH
SKDSVDWMTAMGADLTDVGMMGGASVNRAHRPTGGAGVGAHVVQVLYDNAVKRNIDLRMNTRGIEVLKDDKGTVKGILVK
GMYKGYYWVKADAVILATGGFAKNNERVAKLDPSLKGFISTNQPGAVGDGLDVAENAGGALKDMQYIQAHPTLSVKGGVM
VTEAVRGNGAILVNREGKRFVNEITTRDKASAAILAQTGKSAYLIFDDSVRKSLSKIDKYIGLGVAPTADSLVKLGKMEG
IDGKALTETVARYNSLVSSGKDTDFERPNLPRALNEGNYYAIEVTPGVHHTMGGVMIDTKAEVMNAKKQVIPGLYGAGEV
TGGVHGANRLGGNAISDIITFGRLAGEEAAKYSKKN
>P0C278 1.3.5.1~~~fccA~~~Fumarate reductase flavoprotein subunit~~~
ADNLAEFHVQNQECDSCHTPDGELSNDSLTYENTQCVSCHGTLEEVAETTKHEHYNAHASHFPGEVACTSCHSAHEKSMV
YCDSCHSFDFNMPYAKKWQRDEPTIAELAKDKSERQAALASAPHDTVDVVVVGSGGAGFSAAISATDSGAKVILIEKEPV
IGGNAKLAAGGMNAAWTDQQKAKKITDSPELMFEDTMKGGQNINDPALVKVLSSHSKDSVDWMTAMGADLTDVGMMGGAS
VNRAHRPTGGAGVGAHVVQVLYDNAVKRNIDLRMNTRGIEVLKDDKGTVKGILVKGMYKGYYWVKADAVILATGGFAKNN
ERVAKLDPSLKGFISTNQPGAVGDGLDVAENAGGALKDMQYIQAHPTLSVKGGVMVTEAVRGNGAILVNREGKRFVNEIT
TRDKASAAILAQTGKSAYLIFDDSVRKSLSKIDKYIGLGVAPTADSLVKLGKMEGIDGKALTETVARYNSLVSSGKDTDF
ERPNLPRALNEGNYYAIEVTPGVHHTMGGVMIDTKAEVMNAKKQVIPGLYGAGEVTGGVHGANRLGGNAISDIITFGRLA
GEEAAKYSKKN
>P83223 1.3.5.1~~~~~~Fumarate reductase flavoprotein subunit~~~COG1053
MFTRKIQKTALAMLISGAMAGTAYAAPEVLADFHGEMGGCDSCHVSDKGGVTNDNLTHENGQCVSCHGDLKELAAAAPKD
KVSPHKSHLIGEIACTSCHKGHEKSVAYCDACHSFGFDMPFGGKWERKFVPVDADKAAQDKAIAAGVKETTDVVIIGSGG
AGLAAAVSARDAGAKVILLEKEPIPGGNTKLAAGGMNAAETKPQAKLGIEDKKQIMIDDTMKGGRNINDPELVKVLANNS
SDSIDWLTSMGADMTDVGRMGGASVNRSHRPTGGAGVGAHVAQVLWDNAVKRGTDIRLNSRVVRILEDASGKVTGVLVKG
EYTGYYVIKADAVVIAAGGFAKNNERVSKYDPKLKGFKATNHPGATGDGLDVALQAGAATRDLEYIQAHPTYSPAGGVMI
TEAVRGNGAIVVNREGNRFMNEITTRDKASAAILQQKGESAYLVFDDSIRKSLKAIEGYVHLNIVKEGKTIEELAKQIDV
PAAELAKTVTAYNGFVKSGKDAQFERPDLPRELVVAPFYALEIAPAVHHTMGGLVIDTKAEVKSEKTGKPITGLYAAGEV
TGGVHGANRLGGNAISDIVTYGRIAGASAAKFAKDN
>P17412 1.3.5.1~~~frdA~~~Fumarate reductase flavoprotein subunit~~~COG1053
MKVQYCDSLVIGGGLAGLRAAVATQQKGLSTIVLSLIPVKRSHSAAAQGGMQASLGNSKMSDGDNEDLHFMDTVKGSDWG
CDQKVARMFVNTAPKAIRELAAWGVPWTRIHKGDRMAIINAQKTTITEEDFRHGLIHSRDFGGTKKWRTCYTADATGHTM
LFAVANECLKLGVSIQDRKEAIALIHQDGKCYGAVVRDLVTGDIIAYVAKGTLIATGGYGRIYKNTTNAVVCEGTGTAIA
LETGIAQLGNMEAVQFHPTPLFPSGILLTEGCRGDGGILRDVDGHRFMPDYEPEKKELASRDVVSRRMIEHIRKGKGVQS
PYGQHLWLDISILGRKHIETNLRDVQEICEYFAGIDPAEKWAPVLPMQHYSMGGIRTDYRGEAKLKGLFSAGEAACWDMH
GFNRLGGNSVSEAVVAGMIVGEYFAEHCANTQVDLETKTLEKFVKGQEAYMKSLVESKGTEDVFKIKNRMKDVMDDNVGI
FRDGPHLEKAVKELEELYKKSKNVGIKNKRLHANPELEEAYRVPMMLKVALCVAKGALDRTESRGAHNREDYPKRDDINW
LNRTLASWPNPEQTLPTLEYEALDVNEMEIAPGYRGYGAKGNYIENPLSVKRQEEIDKIQSELEAAGKDRHAIQEALMPY
ELPAKYKARNERLGDK
>P0AC47 1.3.5.1~~~frdB~~~Fumarate reductase iron-sulfur subunit~~~COG0479
MAEMKNLKIEVVRYNPEVDTAPHSAFYEVPYDATTSLLDALGYIKDNLAPDLSYRWSCRMAICGSCGMMVNNVPKLACKT
FLRDYTDGMKVEALANFPIERDLVVDMTHFIESLEAIKPYIIGNSRTADQGTNIQTPAQMAKYHQFSGCINCGLCYAACP
QFGLNPEFIGPAAITLAHRYNEDSRDHGKKERMAQLNSQNGVWSCTFVGYCSEVCPKHVDPAAAIQQGKVESSKDFLIAT
LKPR
>P9WN89 1.3.5.1~~~frdB~~~Fumarate reductase iron-sulfur subunit~~~COG0479
MMDRIVMEVSRYRPEIESAPTFQAYEVPLTREWAVLDGLTYIKDHLDGTLSFRWSCRMGICGSSGMTINGDPKLACATFL
ADYLPGPVRVEPMRNFPVIRDLVVDISDFMAKLPSVKPWLVRHDEPPVEDGEYRQTPAELDAFKQFSMCINCMLCYSACP
VYALDPDFLGPAAIALGQRYNLDSRDQGAADRRDVLAAADGAWACTLVGECSTACPKGVDPAGAIQRYKLTAATHALKKL
LFPWGGG
>P17596 1.3.5.1~~~frdB~~~Fumarate reductase iron-sulfur subunit~~~COG0479
MGRMLTIRVFKYDPQSAVSKPHFQEYKIEEAPSMTIFIVLNMIRETYDPDLNFDFVCRAGICGSCGMMINGRPSLACRTL
TKDFEDGVITLLPLPAFKLIKDLSVDTGNWFNGMSQRVESWIHAQKEHDISKLEERIEPEVAQEVFELDRCIECGCCIAA
CGTKIMREDFVGAAGLNRVVRFMIDPHDERTDEDYYELIGDDDGVFGCMTLLACHDVCPKNLPLQSKIAYLRRKMVSVN
>P0A8Q0 ~~~frdC~~~Fumarate reductase subunit C~~~COG3029
MTTKRKPYVRPMTSTWWKKLPFYRFYMLREGTAVPAVWFSIELIFGLFALKNGPEAWAGFVDFLQNPVIVIINLITLAAA
LLHTKTWFELAPKAANIIVKDEKMGPEPIIKSLWAVTVVATIVILFVALYW
>P17413 ~~~frdC~~~Fumarate reductase cytochrome b subunit~~~
MTNESILESYSGVTPERKKSRMPAKLDWWQSATGLFLGLFMIGHMFFVSTILLGDNVMLWVTKKFELDFIFEGGKPIVVS
FLAAFVFAVFIAHAFLAMRKFPINYRQYLTFKTHKDLMRHGDTTLWWIQAMTGFAMFFLGSVHLYIMMTQPQTIGPVSSS
FRMVSEWMWPLYLVLLFAVELHGSVGLYRLAVKWGWFDGETPDKTRANLKKLKTLMSAFLIVLGLLTFGAYVKKGLEQTD
PNIDYKYFDYKRTHHR
>P0A8Q3 ~~~frdD~~~Fumarate reductase subunit D~~~COG3080
MINPNPKRSDEPVFWGLFGAGGMWSAIIAPVMILLVGILLPLGLFPGDALSYERVLAFAQSFIGRVFLFLMIVLPLWCGL
HRMHHAMHDLKIHVPAGKWVFYGLAAILTVVTLIGVVTI
>B5XRB0 1.3.1.6~~~~~~NADH:fumarate oxidoreductase~~~
MTSNERILQPFTLPNGTELKNRLLMAPMTTCTGYFDGTVTSELVEYYRARAGSIGTIIVECCFIDDYGLAFPGAIGIDND
EKIAGLAKIAEAIKAQGSKAILQIYHGGRMVDPQLIGGRQPVAPSAIAAPREGAAMPRALSGEEVEGMIAKFGDGVRRAI
LAGFDGVEIHGANTYLIQQFYSPNSNQRDDEWGGSRDNRARFPLAVLDITHKMARQYADDAFIIGYRFSPEEMEVPGIRF
DDTMYLLEKLAARGVDYLHFSVGATLRPSIVDTSDPTPLIEKYCAMRSETLAQVPVMGVGGVVNVADAELGLDHGYDLIA
VGRACIAYPDWAARIAAGEELELFIDSTQREALHIPEPLWRFSLVEAMIRDMSMGDAKFKPGMFVETVQDDANELVINVS
LENDHIADIELAASPVQTVEFTTSFEEIRERILTANTPHVDAISGATSQSEAVKKAVAKAMLKSSKALAAEEGGNDAAPK
SYDVVVVGSGGAGLAAAIQAHDEGASVLIVEKMPTIGGNTIKASAGMNAAETRFQRVKGIQDSKELFYQETLKGGHNKNN
PQLLRRFVENAPQAIEWLADRGIMLNDITTTGGMSIDRTHRPRDGSAVGGYLISGLVRNITKRGIDVLLDTSVEEILMSG
DEVSGVRLVNDEKEVIEVQTKSIVVATGGFSANSAMVVKYRPDLDGFVTTNHKGATGSGIALLERIGAGTVDMGEIQIHP
TVEQQTSYLISESIRGGGAILVNQQGNRFFNEMETRDKVSAAIIALPEHYAYIVFDEHVRAKNKAADEYIAKGFVTSASS
PRELAEKLGMDYHAFLATLECYNGAVEKQHDEQFGRTTALRAPINEGPFHAIRIAPGVHHTMGGVTINTDGEVLNVDQQP
IRGAYAAGEVVGGIHGGNRIGGNAVADIIIFGTLAGHQAAKRARG
>M4YFG7 1.5.1.37~~~~~~Flavin reductase~~~
MTKVAAEIVRSAIDPQWFRAVLGQYPTGVCAVTAMDPDGKMSGMAVGSFTSVSLNPPLVAFLPDRSSTSWPKIERAGKFC
VNVLSDQQLGVCKRFASKDEDKFSGLVYRLSDNGSPIIEGVVAWIDCDLHSVQEAGDHYIVIGSVRELQVESEDSALLFY
RGGYGGFAAI
>P0AEN1 1.5.1.41~~~fre~~~NAD(P)H-flavin reductase~~~COG0543
MTTLSCKVTSVEAITDTVYRVRIVPDAAFSFRAGQYLMVVMDERDKRPFSMASTPDEKGFIELHIGASEINLYAKAVMDR
ILKDHQIVVDIPHGEAWLRDDEERPMILIAGGTGFSYARSILLTALARNPNRDITIYWGGREEQHLYDLCELEALSLKHP
GLQVVPVVEQPEAGWRGRTGTVLTAVLQDHGTLAEHDIYIAGRFEMAKIARDLFCSERNAREDRLFGDAFAFI
>Q9L6L9 1.5.1.41~~~fre~~~NAD(P)H-flavin reductase~~~
MTTLSCKVTSVEAITDTVYRVRLVPDAAFSFRAGQYLMVVMDERDKRPFSMASTPDEKGFIELHIGASELNLYAMAVMDR
ILKDREIVVDIPHGDAWLRDDEERPLILIAGGTGFSYVRSILLTALARNPARDVTIYWGGREEKHLYDLSELEALSVNHP
NLRIEPVVEQPEEGWRGRTGTVLTAVLQDYGTLAGHDIYIAGRFEMAKIARDLFCHERNAREDRLFGDAFAFI
>P43127 1.5.1.-~~~fre~~~NAD(P)H-flavin reductase~~~
MTIQCKVKSIQPLACNTYQILLHPESPVPFKAGQYLMVVMGEKDKRPFSIASSPCRHEGELELHIGAAEHNAYALEVVEA
MQAALETDGHIEIDAPHGDAWVQEESERPLLLIAGGTGFSYVRSILDHCVAQNKTNPIYLYWGARDNCQLYAKEELVEIA
DKFANVHFVPVVEEAPADWQGKVGNVLQAVSEDFESLENYDIYIAGRFEMAGAAREQFTQNKKAKSERMFADAYAFI
>P45539 ~~~frlA~~~Probable fructoselysine/psicoselysine transporter FrlA~~~COG0531
MGSQELQRKLGFWAVLAIAVGTTVGSGIFVSVGEVAKAAGTPWLTVLAFVIGGLIVIPQMCVYAELSTAYPENGADYVYL
KNAGSRPLAFLSGWASFWANDAPSLSIMALAIVSNLGFLTPIDPLLGKFIAAGLIIAFMLLHLRSVEGGAAFQTLITIAK
IIPFTIVIGLGIFWFKAENFAAPTTTAIGATGSFMALLAGISATSWSYTGMASICYMTGEIKNPGKTMPRALIGSCLLVL
VLYTLLALVISGLMPFDKLANSETPISDALTWIPALGSTAGIFVAITAMIVILGSLSSCVMYQPRLEYAMAKDNLFFKCF
GHVHPKYNTPDVSIILQGALGIFFIFVSDLTSLLGYFTLVMCFKNTLTFGSIIWCRKRDDYKPLWRTPAFGLMTTLAIAS
SLILVASTFVWAPIPGLICAVIVIATGLPAYAFWAKRSRQLNALS
>O32157 3.5.-.-~~~frlB~~~Fructosamine deglycase FrlB~~~COG2222
MSQATAKVNREVQAFLQDLKGKTIDHVFFVACGGSSAIMYPSKYVFDRESKSINSDLYSANEFIQRNPVQLGEKSLVILC
SHSGNTPETVKAAAFARGKGALTIAMTFKPESPLAQEAQYVAQYDWGDEALAINTNYGVLYQIVFGTLQVLENNTKFEQA
IEGLDQLQAVYEKALKQEADNAKQFAKAHEKESIIYTMASGANYGVAYSYSICILMEMQWIHSHAIHAGEYFHGPFEIID
ESVPFIILLGLDETRPLEERALTFSKKYGKKLTVLDAASYDFTAIDDSVKGYLAPLVLNRVLRSYADELAEERNHPLSHR
RYMWKVEY
>P0AC00 3.5.-.-~~~frlB~~~Fructoselysine 6-phosphate deglycase~~~COG2222
MLDIDKSTVDFLVTENMVQEVEKVLSHDVPLVHAIVEEMVKRDIDRIYFVACGSPLNAAQTAKHLADRFSDLQVYAISGW
EFCDNTPYRLDDRCAVIGVSDYGKTEEVIKALELGRACGALTAAFTKRADSPITSAAEFSIDYQADCIWEIHLLLCYSVV
LEMITRLAPNAEIGKIKNDLKQLPNALGHLVRTWEEKGRQLGELASQWPMIYTVAAGPLRPLGYKEGIVTLMEFTWTHGC
VIESGEFRHGPLEIVEPGVPFLFLLGNDESRHTTERAINFVKQRTDNVIVIDYAEISQGLHPWLAPFLMFVPMEWLCYYL
SIYKDHNPDERRYYGGLVEY
>P45541 5.1.3.41~~~frlC~~~Fructoselysine 3-epimerase~~~COG1082
MKTGMFTCGHQRLPIEHAFRDASELGYDGIEIWGGRPHAFAPDLKAGGIKQIKALAQTYQMPIIGYTPETNGYPYNMMLG
DEHMRRESLDMIKLAMDMAKEMNAGYTLISAAHAGYLTPPNVIWGRLAENLSELCEYAENIGMDLILEPLTPYESNVVCN
ANDVLHALALVPSPRLFSMVDICAPYVQAEPVMSYFDKLGDKLRHLHIVDSDGASDTHYIPGEGKMPLRELMRDIIERGY
EGYCTVELVTMYMNEPRLYARQALERFRALLPEDER
>O32153 2.7.1.-~~~frlD~~~Fructosamine kinase FrlD~~~COG0524
MKLIAVGDNVVDYYQDQETFYPGGNALNVAVLAKRLGHESSYIGIVGNDEAAAHLLNVLKLEQVNADYIRQAHGENGMAI
VTLDEQGDRIFVRSNKGGIQSRLRLAFQEKDVSFISGHDLLHTSVYSRLENDLPQLCGLVPVSFDFSTNREDDYLRRVCP
YVTYAFFSGSDLSESECGELAKTAHGYGAKMVCMTRGGQGAILSAGDRVYHQPIVEADIIDTLGAGDSFIAGFLTAFCVK
QDITYALRQAAETAAKTCGVYGAFGYGYPYRLEDGGSSEKTRIL
>P45543 2.7.1.218~~~frlD~~~Fructoselysine 6-kinase~~~COG0524
MKTLATIGDNCVDIYPQLNKAFSGGNAVNVAVYCTRYGIQPGCITWVGDDDYGTKLKQDLARMGVDISHVHTKHGVTAQT
QVELHDNDRVFGDYTEGVMADFALSEEDYAWLAQYDIVHAAIWGHAEDAFPQLHAAGKLTAFDFSDKWDSPLWQTLVPHL
DFAFASAPQEDETLRLKMKAIVARGAGTVIVTLGENGSIAWDGAQFWRQAPEPVTVIDTMGAGDSFIAGFLCGWSAGMTL
PQAIAQGTACAAKTIQYHGAW
>P45544 ~~~frlR~~~Probable fructoselysine utilization operon transcriptional repressor~~~COG2188
MSATDRYSHQLLYATVRQRLLDDIAQGVYQAGQQIPTENELCTQYNVSRITIRKAISDLVADGVLIRWQGKGTFVQSQKV
ENALLTVSGFTDFGVSQGKATKEKVIEQERVSAAPFCEKLNIPGNSEVFHLCRVMYLDKEPLFIDSSWIPLSRYPDFDEI
YVEGSSTYQLFQERFDTRVVSDKKTIDIFAATRPQAKWLKCELGEPLFRISKIAFDQNDKPVHVSELFCRANRITLTIDN
KRH
>P25437 1.1.1.284~~~frmA~~~S-(hydroxymethyl)glutathione dehydrogenase~~~COG1062
MKSRAAVAFAPGKPLEIVEIDVAPPKKGEVLIKVTHTGVCHTDAFTLSGDDPEGVFPVVLGHEGAGVVVEVGEGVTSVKP
GDHVIPLYTAECGECEFCRSGKTNLCVAVRETQGKGLMPDGTTRFSYNGQPLYHYMGCSTFSEYTVVAEVSLAKINPEAN
HEHVCLLGCGVTTGIGAVHNTAKVQPGDSVAVFGLGAIGLAVVQGARQAKAGRIIAIDTNPKKFDLARRFGATDCINPND
YDKPIKDVLLDINKWGIDHTFECIGNVNVMRAALESAHRGWGQSVIIGVAVAGQEISTRPFQLVTGRVWKGSAFGGVKGR
SQLPGMVEDAMKGDIDLEPFVTHTMSLDEINDAFDLMHEGKSIRTVIRY
>P0AAP3 ~~~frmR~~~Transcriptional repressor FrmR~~~COG1937
MPSTPEEKKKVLTRVRRIRGQIDALERSLEGDAECRAILQQIAAVRGAANGLMAEVLESHIRETFDRNDCYSREVSQSVD
DTIELVRAYLK
>P55127 ~~~frpC~~~Iron-regulated protein FrpC~~~
MNEGEVVLTPEQIQTLRGYASRGDTYGGWRYLANLGDRYADNAAAIVGKDTNLNGLNLWMKKGVENLWDDTVGKKTRLEK
FDRVALQHFSQYVDLINKNNGRLPNTSEIERSYYKAVTYHGVSSSAAIDLVINRSLPDMADGYWALGLGIEAERIHNEQA
VNNPNGSERDNRKQLISALDKGFDGSFKEKHFTFLQSVMMDLTKLGVEYTIDGWQKIGGWGNGIINDLYKSVVKREWTGI
FEIVNNNIKQGNEAFKNEINSLVHDMKAAGKEFGDDLNTQWNNLTQAAEIIYNDIVDNTSQGIEKGVKAIKELSEKMKNA
ASDLADGSAEKAKQVVEDLAQAAKEAYENAKSTAEKAAQAAREFFKGLPSFKDLAEKFRDLFPNPEGWIDDGHQCFAPWV
KETKKRNGKYHVYDPLALDLDGDGIETVATKGFSGSLFDHNRDGIRTATGWVAADDGLLVRDLNGNGIIDNGAELFGDNT
KLADGSFAKHGYAALAELDSNGDNIINAADAAFQTLRVWQDLNQDGISQANELRTLEELGIQSLDLAYKDVNKNLGNGNT
LAQQGSYTKTDGTTAKMGDLLLAADNLHSRFKDKVELTAEQAKAANLAGIGRLRDLREAAALSGDLANMLKAYSAAETKE
AQLALLDNLIHKWAETDSNWGKKSPMRLSTDWTQTANEGIALTPSQVAQLKKNALVSLSDKAKAAIDAARDRIAVLDAYT
GQDSSTLYYMSEEDALNIVKVTNDTYDHLAKNIYQNLLFQTRLQPYLNQISFKMENDTFTLDFSGLVQAFNHVKETNPQK
AFVDLAEMLAYGELRSWYEGRRLMADYVEEAKKAGKFEDYQKVLGQETVALLAKTSGTQADDILQNVGFGHNKNVSLYGN
DGNDTLIGGAGNDYLEGGSGSDTYVFGKGFGQDTVYNYDYATGRKDIIRFTDGITADMLTFTREGNHLLIKAKDDSGQVT
VQSYFQNDGSGAYRIDEIHFDNGKVLDVATVKELVQQSTDGSDRLYAYQSGNTLNGGLGDDYLYGADGDDLLNGDAGNDS
IYSGNGNDTLNGGEGNDALYGYNGNDALNGGEGNDHLNGEDGNDTLIGGAGNDYLEGGSGSDTYVFGKGFGQDTVYNYDY
ATGRKDIIRFTDGITADMLTFTREGNHLLIKAKDGSGQVTVQSYFQNDGSGAYRIDEIHFDNGKVLDVATVKELVQQSTD
GSDRLYAYQSGNTLNGGLGDDYLYGADGDDLLNGDAGNDSIYSGNGNDTLDGGEGNDALYGYNGNDALNGGEGNDHLNGE
DGNDTLIGGAGNDYLEGGSGSDTYVFGKGFGQDTVYNYDYATGRKDIIRFTDGITADMLTFTREGNHLLIKAKDDSGQVT
VQSYFQNDGSGAYRIDEIHFDNGKVLDVATVKELVQQSTDGSDRLYAYQSGSTLNGGLGDDYLYGADGDDLLNGDAGNDS
IYSGNGNDTLDGGEGNDALYGYNGNDALNGGEGNDHLNGEDGNDTLIGGAGNDYLEGGSGSDTYVFGKGFGQDTVYNYDY
ATGRKDIIRFTDGITADMLTFTREGNHLLIKAKDGSGQVTVQSYFQNDGSGAYRIDEIHFDNGKVLDVATVKKLVQQSTD
GSDRLYAYQSGNTLNGGLGDDYLYGADGDDLLNGDAGNDSIYSGNGNDTLNGGEGNDALYGYNGNDVLNGGEGNDHLNGE
DGNDTLIGGAGNDYLEGGSGSDTYVFGKGFGQDTVYNYHVDKNSDTMHFKGFKAADVHFIRSGSDLVLSASEQDNVRISG
FFYGENHRVDTFVFDDAAISNPDFAKYINAGNNLVQSMSVFGSNTAATGGNVDANTQSVQQPLLVTPSA
>P74103 ~~~frp~~~Fluorescence recovery protein~~~
MLQTAEAPWSQAETQSAHALFRKAYQRELDGLLATVQAQASQITQIDDLWKLHDFLSAKRHEIDGKYDDRQSVIIFVFAQ
LLKEGLVQAEELTFLAADKQSKIKALARL
>Q56691 1.5.1.38~~~frp~~~NADPH-flavin oxidoreductase~~~
MNNTIETILAHRSIRKFTAVPITDEQRQTIIQAGLAASSSSMLQVVSIVRVTDSEKRNELAQFAGNQAYVESAAEFLVFC
IDYQRHATINPDVQADFTELTLIGAVDSGIMAQNCLLAAESMGLGGVYIGGLRNSAAQVDELLGLPENSAVLFGMCLGHP
DQNPEVKPRLPAHVVVHENQYQELNLDDIQSYDQTMQAYYASRTSNQKLSTWSQEVTGKLAGESRPHILPYLNSKGLAKR
>P04335 3.1.1.1~~~frsA~~~Esterase FrsA~~~COG1073
MTQANLSETLFKPRFKHPETSTLVRRFNHGAQPPVQSALDGKTIPHWYRMINRLMWIWRGIDPREILDVQARIVMSDAER
TDDDLYDTVIGYRGGNWIYEWATQAMVWQQKACAEDDPQLSGRHWLHAATLYNIAAYPHLKGDDLAEQAQALSNRAYEEA
AQRLPGTMRQMEFTVPGGAPITGFLHMPKGDGPFPTVLMCGGLDAMQTDYYSLYERYFAPRGIAMLTIDMPSVGFSSKWK
LTQDSSLLHQHVLKALPNVPWVDHTRVAAFGFRFGANVAVRLAYLESPRLKAVACLGPVVHTLLSDFKCQQQVPEMYLDV
LASRLGMHDASDEALRVELNRYSLKVQGLLGRRCPTPMLSGYWKNDPFSPEEDSRLITSSSADGKLLEIPFNPVYRNFDK
GLQEITDWIEKRLC
>Q8DF91 3.1.1.1~~~frsA~~~Esterase FrsA~~~
MSEEVSKNLSETLFVKHKQAKETSALTQYMPTSQSLLDEIKEKNGFSWYRNLRRLQWVWQGVDPIEQEQVLARIASSKHS
RTDEQWLDTVMGYHSGNWAYEWTRLGMEHQKRAGEMTNEAASEALFSASLCYSIAGYPHLKSDNLAIQAQVLANSAYLEA
AKKSKYIIKQLEIPFEKGKITAHLHLTNTDKPHPVVIVSAGLDSLQTDMWRLFRDHLAKHDIAMLTVDMPSVGYSSKYPL
TEDYSRLHQAVLNELFSIPYVDHHRVGLIGFRFGGNAMVRLSFLEQEKIKACVILGAPIHDIFASPQKLQQMPKMYLDVL
ASRLGKSVVDIYSLSGQMAAWSLKVQGFLSSRKTKVPILAMSLEGDPVSPYSDNQMVAFFSTYGKAKKISSKTITQGYEQ
SLDLAIKWLEDELLR
>Q03174 3.2.1.80~~~fruA~~~Fructan beta-fructosidase~~~COG1621
MEEETVCKNWFMRKSGKSWIFGCAVFFVLGLATALPVAAEEISQTTAADTAVTEVRTEDSSQTSSQETAVTETTQSEGTA
SKQLTTPAVADQTTEPTDNEPISSSDGASSPYQVTDTTEPQQTLTPADSEPQAKADVQQAAAPKKEEINPVTNLEDMSHD
TNGTWEVREDGIHSNAIGKGDSFLYSQSSGKNFVYATDVTFKQNSGAAALVFRSNNDSNNKNMYAVNVDIGGHKAKFWRW
VDNKDIQLIDERDVVPTADNRYTLKVVAVNNWISYYVNDILMASTGDYVLQKADKGQNTVIPEGHFGLLNWNGDMVFQNT
KFALLDDTTAPLIDNITVRSDRGNVEKQGQFFSEEPLHIQYVSNDASQVSLDIAKHNPAATVTVEDKTGRVYTDPSHLPV
NVGANYFTVKSTVIDSFGRTVTLTYRINVHRRQNDEVYYNELYRDQYHYSVKDGWANDPNGLVYYNGVYHLFHQFYDDTK
WGPMHWAHATSTDLIHWKEEPIAFYPDSNGYMFSGCVVVDEHNSSGLFKTAKGGLVAIITANGNGQRMELAYSEDEGKTW
QKYDRIVADWSNDPLQNQDFRDPKVFHWNNQWFMVLAGGPLRIYSSNNLKDWKVESTYPDLHTECPDMYPIVANDGVLKW
VLSRGGRFYKVGDFKQVDGKWTFIADDAYKDKDQVMNFGKDSYAAMTYYVHDFGTETRPTIPKLTEVNWMNTWEDYCNLV
ADTVGQDFNGTFNLNLDLGLINENGQYILTQTPVKAYDSLRDVNTALHFKDVTVDANNTLLKDFKGDSYEIVSHFRPDEK
TTKVGFNLRVGNGQATKVIYDLQTETLSIDRSQSGTILSAAFAKVNSQHVTKNADGSIDLHIYVDRASVEVFSKNNTVAG
ANQIFPNPEAVGASIIVEGGKAQADISVYQMKTIWTDKKDTAKPVAMNTTTAKELALQVGQSQDLQVYLAPASVRQDVEW
TISDPSLVRTSQKGNVLHLTAVKKGKLTITAISKENPSLSKTFTISITLNNFKTNLKGLQSVTGKWYVDDETLYDSNTSS
NDYYMASQKPGFKEYDYDIDLKYQRGLINLFVASGNIDPSQAYSVQFGDSETVRLYRFAGDTIAEANMGKRINDDQYHHI
KVTKTKNSIIISVDGQEVMSHNFDQVDSYFNDAYVGLGLWDGAVEFQNFFVTDHATTPKPDSDPTPQPDAPEALAQEREL
IDPATGVRVILQKGELASIVRVKVSHIETNDAHTPAVLNAKDYDLFNITPIDKNEKVVAITKPATVLLPIDAGKVVDKVV
YLPNTDKEENLPFTIVSLTDSNGKKQSYVRFTAEHFSEYGLVYQAENQTNLKSKEKQDNVAISYPLNLEQEVKVSSISRK
YAANKTADVNSVQQTEPSVMSSSPKATLPDTGDHKTDLSQLGVLAMIGSFLVEIAGYFKKRKD
>Q8G848 ~~~fruE~~~Fructose import binding protein FruE~~~
MKNWKKAIALVASAAALVSVAACGSSNAGGSSDSGKKTVGFVAVGPEGGFRTANEKDIQKAFEDAGFDLTYSPTQNNDQQ
KQIQAFNKFVNDEVDAIILSSTEDSGWDDSLKKAAEAEIPVFTVDRNVDVKDAEAKKAIVAHIGPSNVWCGEQAAEFVNK
NFPDGANGFILEGPAGLSVVKDRGTGWGNKVASNVKVLESQSANWSTDEAKTVTAGLLDKYKSDNPQFIFAQNDEMGLGA
AQAVDAAGLKGKVKIITIDGTKNALQALVDGDLSYVIEYNPIFGKETAQAVKDYLDGKTVEKDIEIESKTFDAASAKEAL
DNNTRAY
>Q8G846 ~~~fruF~~~Fructose import permease protein FruF~~~
MAEKAKAEGNNFVKKLLSSNLTWSIVAFILLVIICTIFQHDFLALSWNSNTGGLAGPLITMLQESARYLMIATGMTLVIS
TAGIDLSVGSVMAVAGAAAMQTLSNGMNVWLSILIALAVGLAIGCVNGALVSFLGLQPFITTLIMMLAGRGMAKVITSGE
NTDASAVAGNEPLKWFANGFILGIPANFVIAVIIVILVGLLCRKTAMGMMIEAVGINQEASRMTGIKPKKILFLVYAISG
FLAAIAGLFATASVMRVDVVKTGQDLEMYAILAVVIGGTSLLGGKFSLAGSAVGAVIIAMIRKTIITLGVNAEATPAFFA
VVVIVICVMQAPKIHNLSANMKRKRALKAQAKAVAA
>Q8G845 ~~~fruG~~~Fructose import permease protein FruG~~~
MTTATANKVKAPKKGFKLDRQMIPTLAAVVIFILMIIMGQALFGTYIRLGFISSLFIDHAYLIILAVAMTLPILTGGIDL
SVGAIVAITAVVGLKLANAGVPAFLVMIIMLLIGAVFGLLAGTLIEEFNMQPFIATLSTMFLARGLASIISTDSLTFPQG
NDFSFISNVIKIIDNPKISNDLSFNVGVIIALVVVVFGYVFLHHTRTGRTIYAIGGSRSSAELMGLPVKRTQYIIYLTSA
TLAALASIVYTANIGSAKNTVGVGWELDAVASVVIGGTIITGGFGYVLGSVLGSLVRSILDPLTSDFGVPAEWTTIVIGL
MILVFVVLQRAVMAVGGDKK
>Q8G847 7.5.2.-~~~fruK~~~Fructose import ATP-binding protein FruK~~~
MTDKNPIVVMKGITIEFPGVKALDGVDLTLYPGEVHALMGENGAGKSTMIKALTGVYKINAGSIMVDGKPQQFNGTLDAQ
NAGIATVYQEVNLCTNLSVGENVMLGHEKRGPFGIDWKKTHEAAKKYLAQMGLESIDPHTPLSSISIAMQQLVAIARAMV
INAKVLILDEPTSSLDANEVRDLFAIMRKVRDSGVAILFVSHFLDQIYEITDRLTILRNGQFIKEVMTKDTPRDELIGMM
IGKSAAELSQIGAKKARREITPGEKPIVDVKGLGKKGTINPVDVDIYKGEVVGFAGLLGSGRTELGRLLYGADKPDSGTY
TLNGKKVNISDPYTALKNKIAYSTENRRDEGIIGDLTVRQNILIALQATRGMFKPIPKKEADAIVDKYMKELNVRPADPD
RPVKNLSGGNQQKVLIGRWLATHPELLILDEPTRGIDIGAKAEIQQVVLDLASQGMGVVFISSELEEVVRLSDDIEVLKD
RHKIAEIENDDTVSQATIVETIANTNVNTGKEA
>Q9KM69 ~~~fruR~~~Fructose operon regulatory protein~~~COG1609
MTLDEIAKLAGVSKTTASYVINGKAQKYRISEKTQHKVMAVVEQYNFRPDHAASALRAGNSRSFGLIIPDLENTSYARLA
KLLEQNSRQAGYQILIACSDDDPQIEMAAAEALVSRRIDALFVASGIPSASEYYLKLQQSGTPVIAIDRALDDEYFSCVI
SEDFGAAFELTRSVLTQDVHSVGLVGALPELNVSREREQGFAMAVKQRGLPTTLGYGEHFNREEGRKVFAKWVANDQLPD
AVVATSYTLLEGILDVLLEQPELMQKVRLATFGDNRLLDFLPIRVNSLPQQFELIADSALALALNASAKRYQTGIELIPR
QLKVRT
>P43500 ~~~frzCD~~~Frizzy aggregation protein FrzCD~~~
MSLDTPNEKPAGKARARKAPASKAGATNAASTSSSTKAITDTLLTVLSGNLQARVPKELVGESGVELAHLLNQVLDQFAA
SEHRKHVAAQEIDQALDALIGLVREGDLSRWNTTTEDPQLGPLLEGFGKVIETLRTFVREINEAALRLSSSANQVLAAST
QHETSSTEQAAAIHETTATMEELKHASAQIAENAGSVARVAEETLGAARAGRGAIGEFIQAMQQIRSDGVAVADSIAKLS
KRVERIGTVVEVIDEIADRSDLLALNAALEGSRAGEAGKGFSIVAAEMRRLAENVLDSTKEIKNLITEIREATAAAAGAA
EASKSATESGEKLGAVAAQAVEGILAGVQETSDAARVINLATQQQRTATEQVVASMAEIEDVTRQTTQASKQATGAAAEL
TQLAGRLAELIKRFKAD
>P18769 2.7.13.3~~~frzE~~~Gliding motility regulatory protein~~~
MDTEALKKSLLKKFQEVTADRLQKIQLGVLDLEKETADQAAEDVARELHTMKGEARMLGLAAIGQLAHAAEDVLRAEREG
KTATEVATDVLLRACDVLSDLNEDLSGANTGNPASEEMVRMLAEVSGQTPPAIAGARPVAPPPAPPPAPVAAPVVTPAAV
AAPPAPVQAPVAPPPTQAPVAEPGAHAAAAAPHPAAAHGRDEEAPSAAKSAVADRSIRVNVEVLDALGLLAGDLLVESAR
GRLRSSETEALFERFSRLGDRFLRLAEEIDISNEVREQLDRVESDLHMLRDDAFRFVRRNDDGINTLHGNLAKMADHVAE
ARLVPLSTVFDAFPRAVREMSRTQGKEVDLVIENADIGVDRSMLGDVRDALVHLLRNSVDHGVESPDTRQQLGKPLNGRI
RIRVRVDGDMLHIEVEDDGRGIDPERLRQAAISKRLINAVQAAALSEREAIELIFRPGFSTRDQVSELSGRGVGMDVVKR
KVETLGGSVGVSSRIGRGSTITLRLPQSLALMKVLLVRLGDDVYGMPAADVEAVMRVKPDDRLEIFGTLAVRHRGKPTAL
VALGPLLGLNGGNRFDKPPAVVVRHGEDHAALVVDGFVDEREVAVKPCGGEFLKAAPFIAGTAALEDGRIAVLLHVPDIM
AEVRRMARPVTQAPAAKRLRVLLVDDSPIARATEGALVKALGHSVEEAQDGEEAYVKVQNNTYDLILTDVQMPKLDGFSL
ARRLKSTPAVARIPVIILSSLASPEDKRRGLDAGADAYLVKGELGVEVLAQAIDRLT
>P78055 4.1.2.-~~~fsaA~~~Fructose-6-phosphate aldolase 1~~~COG0176
MELYLDTSDVVAVKALSRIFPLAGVTTNPSIIAAGKKPLDVVLPQLHEAMGGQGRLFAQVMATTAEGMVNDALKLRSIIA
DIVVKVPVTAEGLAAIKMLKAEGIPTLGTAVYGAAQGLLSALAGAEYVAPYVNRIDAQGGSGIQTVTDLHQLLKMHAPQA
KVLAASFKTPRQALDCLLAGCESITLPLDVAQQMISYPAVDAAVAKFEQDWQGAFGRTSI
>P32669 4.1.2.-~~~fsaB~~~Fructose-6-phosphate aldolase 2~~~COG0176
MELYLDTANVAEVERLARIFPIAGVTTNPSIIAASKESIWEVLPRLQKAIGDEGILFAQTMSRDAQGMVEEAKRLRDAIP
GIVVKIPVTSEGLAAIKILKKEGITTLGTAVYSAAQGLLAALAGAKYVAPYVNRVDAQGGDGIRTVQELQTLLEMHAPES
MVLAASFKTPRQALDCLLAGCESITLPLDVAQQMLNTPAVESAIEKFEHDWNAAFGTTHL
>Q10725 4.2.1.-~~~psdht~~~Phenylserine dehydratase~~~
MTQLDTTTLPDLSAIAGLRARLKQWVRTTPVFDKTDFEPVPGTAVNFKLELLQASGTFKARGAFSNLLALDDDQRAAGVT
CVSAGNHAVGVAYAAMRLGIPAKVVMIKTASPARVALCRQYGAEVVLAENGQTAFDTVHRIESEEGRFFVHPFNGYRTVL
GTATLGHEWLEQAGALDAVIVPIGGGGLMAGVSTAVKLLAPQCQVIGVEPEGADAMHRSFETGGPVKMGSMQSIADSLMA
PHTEQYSYELCRRNVDRLVKVSDDELRAAMRLLFDQLKLATEPACATATAALVGGLKAELAGKRVGVLLCGTNTDAATFA
RHLGLG
>P52067 ~~~fsr~~~Fosmidomycin resistance protein~~~COG2223
MAMSEQPQPVAGAAASTTKARTSFGILGAISLSHLLNDMIQSLILAIYPLLQSEFSLTFMQIGMITLTFQLASSLLQPVV
GYWTDKYPMPWSLPIGMCFTLSGLVLLALAGSFGAVLLAAALVGTGSSVFHPESSRVARMASGGRHGLAQSIFQVGGNFG
SSLGPLLAAVIIAPYGKGNVAWFVLAALLAIVVLAQISRWYSAQHRMNKGKPKATIINPLPRNKVVLAVSILLILIFSKY
FYMASISSYYTFYLMQKFGLSIQNAQLHLFAFLFAVAAGTVIGGPVGDKIGRKYVIWGSILGVAPFTLILPYASLHWTGV
LTVIIGFILASAFSAILVYAQELLPGRIGMVSGLFFGFAFGMGGLGAAVLGLIADHTSIELVYKICAFLPLLGMLTIFLP
DNRHKD
>Q8YDL7 ~~~ftcR~~~Flagellar transcriptional regulator FtcR~~~COG0745
MIVVVDDRDMVTEGYSSWFGREGITTTGFTPTDFDEWVESVPEQDIMAIEAFLIGECADQHRLPARIRERCKAPVIAVND
RPSLEHTLELFQSGVDDVVRKPVHVREILARINAIRRRAGASATSGADGTQLGPIRVFSDGRDPQINGIDFPLPRRERRI
LEYLIANRGRRLNKVQIFSAIYGIFDSEVEENVVESHISKLRKKLRGQLGFDPIDSKRFLGYCINIE
>Q5XC12 6.3.4.3~~~fhs1~~~Formate--tetrahydrofolate ligase 1~~~
MKSDIEIAQSVALQPITDIVKKVGIDGDDIELYGKYKAKLSFEKMKAVEANEPGKLILVTAINPTPAGEGKSTMSIGLAD
ALNQMGKKTMLALREPSLGPVMGIKGGAAGGGYAQVLPMEDINLHFTGDMHAITTANNALSALIDNHLQQGNDLGIDPRR
IIWKRVLDLNDRALRQVIVGLGSPVNGVPREDGFDITVASEIMAILCLATDLKDLKKRLADIVVAYTYDRKPVYVRDLKV
EGALTLILKDAIKPNLVQTIYGTPALIHGGPFANIAHGCNSVLATSTALRLADYTVTEAGFGADLGAEKFLNIKVPNLPK
APDAIVIVATLRALKMHGGVAKSDLAAENCEAVRLGFANLKRHVENMRQFKVPVVVAINEFVADTEAEIATLKALCEEIK
VPVELASVWANGAEGGLALAKTVVRVIDQEAADYKRLYSDEDTLEEKVINIVTQIYGGKAVQFGPKAKTQLKQFAEFGWD
KLPVCMAKTQYSFSDNPSLLGAPTDFDITIREFVPKTGAGFIVGLTGDVMTMPGLPKVPAAMAMDVAENGTALGLF
>B7L0A5 6.3.4.3~~~fhs~~~Formate--tetrahydrofolate ligase~~~
MPSDIEIARAATLKPIAQVAEKLGIPDEALHNYGKHIAKIDHDFIASLEGKPEGKLVLVTAISPTPAGEGKTTTTVGLGD
ALNRIGKRAVMCLREPSLGPCFGMKGGAAGGGKAQVVPMEQINLHFTGDFHAITSAHSLAAALIDNHIYWANELNIDVRR
IHWRRVVDMNDRALRAINQSLGGVANGFPREDGFDITVASEVMAVFCLAKNLADLEERLGRIVIAETRDRKPVTLADVKA
TGAMTVLLKDALQPNLVQTLEGNPALIHGGPFANIAHGCNSVIATRTGLRLADYTVTEAGFGADLGAEKFIDIKCRQTGL
KPSSVVIVATIRALKMHGGVNKKDLQAENLDALEKGFANLERHVNNVRSFGLPVVVGVNHFFQDTDAEHARLKELCRDRL
QVEAITCKHWAEGGAGAEALAQAVVKLAEGEQKPLTFAYETETKITDKIKAIATKLYGAADIQIESKAATKLAGFEKDGY
GKLPVCMAKTQYSFSTDPTLMGAPSGHLVSVRDVRLSAGAGFVVVICGEIMTMPGLPKVPAADTIRLDANGQIDGLF
>Q83WS0 6.3.4.3~~~fhs~~~Formate--tetrahydrofolate ligase~~~COG2759
MPSDIEIARAATLKPIAQVAEKLGIPDEALHNYGKHIAKIDHDFIASLEGKPEGKLVLVTAISPTPAGEGKTTTTVGLGD
ALNRIGKRAVMCLREPSLGPCFGMKGGAAGGGKAQVVPMEQINLHFTGDFHAITSAHSLAAALIDNHIYWANELNIDVRR
IHWRRVVDMNDRALRAINQSLGGVANGFPREDGFDITVASEVMAVFCLAKNLADLEERLGRIVIAETRDRKPVTLADVKA
TGAMTVLLKDALQPNLVQTLEGNPALIHGGPFANIAHGCNSVIATRTGLRLADYTVTEAGFGADLGAEKFIDIKCRQTGL
KPSAVVIVATIRALKMHGGVNKKDLQAENLDALEKGFANLERHVNNVRSFGLPVVVGVNHFFQDTDAEHARLKELCRDRL
QVEAITCKHWAEGGAGAEALAQAVVKLAEGEQKPLTFAYETETKITDKIKAIATKLYGAADIQIESKAATKLAGFEKDGY
GGLPVCMAKTQYSFSTDPTLMGAPSGHLVSVRDVRLSAGAGFVVVICGEIMTMPGLPKVPAADTIRLDANGQIDGLF
>Q2RM91 6.3.4.3~~~fhs~~~Formate--tetrahydrofolate ligase~~~COG2759
MSKVPSDIEIAQAAKMKPVMELARGLGIQEDEVELYGKYKAKISLDVYRRLKDKPDGKLILVTAITPTPAGEGKTTTSVG
LTDALARLGKRVMVCLREPSLGPSFGIKGGAAGGGYAQVVPMEDINLHFTGDIHAVTYAHNLLAAMVDNHLQQGNVLNID
PRTITWRRVIDLNDRALRNIVIGLGGKANGVPRETGFDISVASEVMACLCLASDLMDLKERFSRIVVGYTYDGKPVTAGD
LEAQGSMALLMKDAIKPNLVQTLENTPAFIHGGPFANIAHGCNSIIATKTALKLADYVVTEAGFGADLGAEKFYDVKCRY
AGFKPDATVIVATVRALKMHGGVPKSDLATENLEALREGFANLEKHIENIGKFGVPAVVAINAFPTDTEAELNLLYELCA
KAGAEVALSEVWAKGGEGGLELARKVLQTLESRPSNFHVLYNLDLSIKDKIAKIATEIYGADGVNYTAEADKAIQRYESL
GYGNLPVVMAKTQYSFSDDMTKLGRPRNFTITVREVRLSAGAGFIVPITGAIMTMPGLPKRPAACNIDIDADGVITGLF
>P21164 6.3.4.3~~~fhs~~~Formate--tetrahydrofolate ligase~~~
MSKVPSDIEIAQAAKMKPVMELARGLGIQEDEVELYGKYKAKISLDVYRRLKDKPDGKLILVTAITPTPAGEGKTTTSVG
LTDALARLGKRVMVCLREPSLGPSFGIKGGAAGGGYAQVVPMEDINLHFTGDIHAVTYAHNLLAAMVDNHLQQGNVLNID
PRTITWRRVIDLNDRALRNIVIGLGGKANGVPRETGFDISVASEVMACLCLASDLMDLKERFSRIVVGYTYDGKPVTAGD
LEAQGSMALLMKDAIKPNLVQTLENTPAFIHGGPFANIAHGCNSIIATKTALKLADYVVTEAGFGADLGAEKFYDVKCRY
AGFKPDATVIVATVRALKMHGGVPKSDLATENLEALREGFANLEKHIENIGKFGVPAVVAINAFPTDTEAELNLLYELCA
KAGAEVALSEVWAKGGEGGLELARKVLQTLESRPSNFHVLYNLDLSIKDKIAKIATEIYGADGVNYTAEADKAIQRYESL
GYGNLPVVMAKTQYSFSDDMTKLGRPRNFTITVREVRLSAGGRLIVPITGAIMTMPGLPKRPAACNIDIDADGVITGLF
>Q7A535 6.3.4.3~~~fhs~~~Formate--tetrahydrofolate ligase~~~
MTHLSDLDIANQSTLQPIKDIAASVGISEDALEPYGHYKAKIDINKITPRENKGKVVLVTAMSPTPAGEGKSTVTVGLAD
AFHELNKNVMVALREPALGPTFGIKGGATGGGYAQVLPMEDINLHFNGDFHAITTANNALSAFIDNHIHQGNELGIDQRR
IEWKRVLDMNDRALRHVNVGLGGPTNGVPREDGFNITVASEIMAILCLSRSIKDLKDKISRITIGYTRDRKPVTVADLKV
QGALAMILKDAIKPNLVQSIEGTPALVHGGPFANIAHGCNSILATETARDLADIVVTEAGFGSDLGAEKFMDIKAREAGF
DLAAVVVVATIRALKMHGGVAKDNLKEENVEAVKAGIVNLERHVNNIKKFGVEPVVAINAFIHDTDAEVEYVKSWAKENN
VRIALTEVWEKGGKGGVDLANEVLEVIDQPNSFKPLYELELPLEQKIEKIVTEIYGGSKVTFSSKAQKQLKQFKENGWDN
YPVCMAKTQYSFSDDQTLLGAPSGFEITIRELEAKTGAGFIVALTGAIMTMPGLPKKPAALNMDVTDDGHAIGLF
>Q9X287 6.3.4.3~~~fhs~~~Formate--tetrahydrofolate ligase~~~COG2759
MKPIKEIADQLELKDDILYPYGHYIAKIDHRFLKSLENHEDGKLILVTAVTPTPAGEGKTTTSIGLSMSLNRIGKKSIVT
LREPSLGPTLGLKGGATGGGRSRVLPSDEINLHFTGDMHAVASAHNLLAAVLDSHIKHGNELKIDITRVFWKRTMDMNDR
ALRSIVIGLGGSANGFPREDSFIITAASEVMAILALSENMKDLKERLGKIIVALDADRKIVRISDLGIQGAMAVLLKDAI
NPNLVQTTEGTPALIHCGPFANIAHGTNSIIATKMAMKLSEYTVTEAGFGADLGAEKFIDFVSRVGGFYPNAAVLVATVR
ALKYHGGANLKNIHEENLEALKEGFKNLRVHVENLRKFNLPVVVALNRFSTDTEKEIAYVVKECEKLGVRVAVSEVFKKG
SEGGVELAKAVAEAAKDVEPAYLYEMNDPVEKKIEILAKEIYRAGRVEFSDTAKNALKFIKKHGFDELPVIVAKTPKSIS
HDPSLRGAPEGYTFVVSDLFVSAGAGFVVALSGDINLMPGLPKKPNALNMDVDDSGNIVGVS
>P43707 1.16.3.2~~~ftnA~~~Probable bacterial non-heme ferritin~~~COG1528
MLNQIIINKLNDQINLEFYSSNVYLQMSAWCSKHGYEGAATFLLRHADEELEHMQKLFNYVSETSGMPILGKIDAPKHDY
SSLREVFEITLEHEKLVTSKINELVEVTFESKDYSTFNFLQWYVAEQHEEEKLFSGIIDRFNLVGEDGKGLFFIDRELAT
LE
>P0A998 1.16.3.2~~~ftnA~~~Bacterial non-heme ferritin~~~COG1528
MLKPEMIEKLNEQMNLELYSSLLYQQMSAWCSYHTFEGAAAFLRRHAQEEMTHMQRLFDYLTDTGNLPRINTVESPFAEY
SSLDELFQETYKHEQLITQKINELAHAAMTNQDYPTFNFLQWYVSEQHEEEKLFKSIIDKLSLAGKSGEGLYFIDKELST
LDTQN
>E1WS50 1.16.3.2~~~ftnA~~~Bacterial non-heme ferritin~~~
MISEKLQNAINEQISAEMWSSNLYLSMSFYFEREGFSGFAHWMKKQSQEEMGHAYAMADYIIKRGGIAKVDKIDVVPTGW
GTPLEVFEHVFEHERHVSKLVDALVDIAAAEKDKATQDFLWGFVREQVEEEATAQGIVDKIKRAGDAGIFFIDSQLGQR
>P0CJ83 1.16.3.2~~~ftnA~~~Bacterial non-heme ferritin~~~
MISEKLQNAINEQISAEMWSSNLYLSMSFYFEREGFSGFAHWMKKQSQEEMGHAYAMADYIIKRGGIAKVDKIDVVPTGW
GTPLEVFEHVFEHERHVSKLVDALVDIAAAEKDKATQDFLWGFVREQVEEEATAQGIVDKIKRAGDAGIFFIDSQLGQR
>Q46106 1.16.3.2~~~ftn~~~Bacterial non-heme ferritin~~~COG1528
MLSKEVVKLLNEQINKEMYAANLYLSMSSWCYENSLDGAGAFLFAHASEESDHAKKLITYLNETDSHVELQEVKQPEQNF
KSLLDVFEKTYEHEQFITKSINTLVEHMLTHKDYSTFNFLQWYVSEQHEEEALFRGIVDKIKLIGEHGNGLYLADQYIKN
IALSRKK
>Q9ZLI1 1.16.3.2~~~ftnA~~~Bacterial non-heme ferritin~~~COG1528
MLSKDIIKLLNEQVNKEMNSSNLYMSMSSWCYTHSLDGAGLFLFDHAAEEYEHAKKLIIFLNENNVPVQLTSISAPEHKF
EGLTQIFQKAYEHEQHISESINNIVDHAIKSKDHATFNFLQWYVAEQHEEEVLFKDILDKIELIGNENHGLYLADQYVKG
IAKSRKS
>P52093 1.16.3.2~~~ftnA~~~Bacterial non-heme ferritin~~~COG1528
MLSKDIIKLLNEQVNKEMNSSNLYMSMSSWCYTHSLDGAGLFLFDHAAEEYEHAKKLIVFLNENNVPVQLTSISAPEHKF
EGLTQIFQKAYEHEQHISESINNIVDHAIKGKDHATFNFLQWYVSEQHEEEVLFKDILDKIELIGNENHGLYLADQYVKG
IAKSRKS
>Q2FWZ8 1.16.3.2~~~ftnA~~~Bacterial non-heme ferritin~~~COG1528
MLSKNLLEALNDQMNHEYFAAHAYMAMAAYCDKESYEGFANFFIQQAKEERFHGQKIYNYINDRGAHAEFRAVSAPKIDF
SSILETFKDSLSQEQEVTRRFYNLSEIARQDKDYATISFLNWFLDEQVEEESMFETHINYLTRIGDDSNALYLYEKELGA
RTFDEE
>Q7A4R2 1.16.3.2~~~ftnA~~~Bacterial non-heme ferritin~~~
MLSKNLLEALNDQMNHEYFAAHAYMAMAAYCDKESYEGFANFFIQQAKEERFHGQKIYNYINDRGAHAEFRAVSAPKIDF
SSILETFKDSLSQEQEVTRRFYNLSEIARQDKDYATISFLNWFLDEQVEEESMFETHINYLTRIGDDSNALYLYEKELGA
RTFDEE
>Q47953 ~~~ftpA~~~Fine tangled pili major subunit~~~COG0783
MRSKTITFPVLKLTGQSQALTNDMHKNADHTVPGLTVATGHLIAEALQMRLQGLNELALILKHAHWNVVGPQFIAVHEML
DSQVDEVRDFIDEIAERMATLGVAPNGLSGNLVETRQSPEYPLGRATAQDHLKLIDLYYSHNIEAHRVVLEHNGHLDPIS
EDLLVAQTRSLEKLQWFIRAHLDNGNGNI
>Q55389 1.8.7.2~~~ftrC~~~Ferredoxin-thioredoxin reductase, catalytic chain~~~COG4802
MTSSDTQNNKTLAAMKNFAEQYAKRTDTYFCSDLSVTAVVIEGLARHKEELGSPLCPCRHYEDKEAEVKNTFWNCPCVPM
RERKECHCMLFLTPDNDFAGDAQDIPMETLEEVKASMA
>P24018 ~~~ftrV~~~Ferredoxin-thioredoxin reductase, variable chain~~~
MNVGDRVRVKESVVVYHHPDHRNQAFDLKDAEGEIAAILTEWNGKPISANFPYLVSFSNKFRAHLRDFELEVI
>Q55781 ~~~ftrV~~~Ferredoxin-thioredoxin reductase, variable chain~~~
MNVGDRVRVTSSVVVYHHPEHKKTAFDLQGMEGEVAAVLTEWQGRPISANLPVLVKFEQRFKAHFRPDEVTLIED
>P28264 ~~~ftsA~~~Cell division protein FtsA~~~COG0849
MNNNELYVSLDIGTSNTKVIVGEMTDDSLNIIGVGNVPSEGLKKGSIVDIDETVHSIRKAFDQAERMVGFPLRKAIVGVN
GNYINIQDTNGVVAVSSENKEIQVEDVRRVMEAAQVVSVPHEQLIVDVIPKQFIVDGRDEITDPKKMLGVRLEVEGTLIT
GSKTILHNLLRCVERAGIEITDICLQPLAAGSAALSKDEKNLGVALIDIGGGSTTIAVFQNGHLTSTRVIPLGGENITKD
ISIGLRTSTEEAERVKKQLGHAYYDEASEDEIFEVTVIGTNQKQTFTQQEAANIIEARVEEILEIVSEELRSMGITDLPG
GFVLTGGQAAMPGVMSLAQDVLQNNVRVASPNYIGVRDPQYMTGVGLIQFACRNARIQGRKIGFKMPEEAIQEIAVSSSE
EQEQHHHQNEVQQRPKGKQKTQAEHNKQSKMKKLLSMFWE
>P0ABH0 ~~~ftsA~~~Cell division protein FtsA~~~COG0849
MIKATDRKLVVGLEIGTAKVAALVGEVLPDGMVNIIGVGSCPSRGMDKGGVNDLESVVKCVQRAIDQAELMADCQISSVY
LALSGKHISCQNEIGMVPISEEEVTQEDVENVVHTAKSVRVRDEHRVLHVIPQEYAIDYQEGIKNPVGLSGVRMQAKVHL
ITCHNDMAKNIVKAVERCGLKVDQLIFAGLASSYSVLTEDERELGVCVVDIGGGTMDIAVYTGGALRHTKVIPYAGNVVT
SDIAYAFGTPPSDAEAIKVRHGCALGSIVGKDESVEVPSVGGRPPRSLQRQTLAEVIEPRYTELLNLVNEEILQLQEKLR
QQGVKHHLAAGIVLTGGAAQIEGLAACAQRVFHTQVRIGAPLNITGLTDYAQEPYYSTAVGLLHYGKESHLNGEAEVEKR
VTASVGSWIKRLNSWLRKEF
>Q9K0X8 ~~~ftsA~~~Cell division protein FtsA~~~
MEQQQRYISVLDIGTSKVLALIGEVQDDDKINIVGLGQAPSRGLRAGMVTNIDATVQAIRQAVNDAELMADTKITHVTTG
IAGNHIRSLNSQGVVKIKDGEVTQADIDRAIETAKAINIPPDQKILDAVVQDYIIDTQLGVREPIGMSGVRLDTRVHIIT
GASTAVQNVQKCIERCGLKSDQIMLQPLASGQAVLTEDEKDLGVCVIDIGGGTTDIAVYMNGAIRHTSVIPAGGNLITKD
LSKSLRTPLDAAEYIKIHYGVASCDTEGLGEMIEVPGVGDRTSRQVSSKVLAAIISARIQEIFGVVLGELQKSGFPKEVL
NAGIVLTGGVSMMTGIVEFAEKIFDLPVRTGAPQEMGGLSDRVRTPRFSTAIGLLHAACKLEGNLPQPENGAVQEREGGG
GLLARLKRWIENSF
>O07325 ~~~ftsA~~~Cell division protein FtsA~~~COG0849
MEEHYYVSIDIGSSSVKTIVGEKFHNGINVIGTGQTYTSGIKNGLIDDFDIARQAIKDTIKKASIASGVDIKEVFLKLPI
IGTEVYDESNEIDFYEDTEINGSHIEKVLEGIREKNDVQETEVINVFPIRFIVDKENEVSDPKELIARHSLKVEAGVIAI
QKSILINMIKCVEACGVDVLDVYSDAYNYGSILTATEKELGACVIDIGEDVTQVAFYERGELVDADSIEMAGRDITDDIA
QGLNTSYETAEKVKHQYGHAFYDSASDQDIFTVEQVDSDETVQYTQKDLSDFIEARVEEIFFEVFDVLQDLGLTKVNGGF
IVTGGSANLLGVKELLSDMVSEKVRIHTPSQMGIRKPEFSSAISTISSSIAFDELLDYVTINYHDNEETEEDVIDVKDKD
NESKLGGFDWFKRKTNKKDTHENEVESTDEEIYQSEDNHQEHKQNHEHVQDKDKDKEESKFKKLMKSLFE
>P63765 ~~~ftsA~~~Cell division protein FtsA~~~
MEEHYYVSIDIGSSSVKTIVGEKFHNGINVIGTGQTYTSGIKNGLIDDFDIARQAIKDTIKKASIASGVDIKEVFLKLPI
IGTEVYDESNEIDFYEDTEINGSHIEKVLEGIREKNDVQETEVINVFPIRFIVDKENEVSDPKELIARHSLKVEAGVIAI
QKSILINMIKCVEACGVDVLDVYSDAYNYGSILTATEKELGACVIDIGEDVTQVAFYERGELVDADSIEMAGRDITDDIA
QGLNTSYETAEKVKHQYGHAFYDSASDQDIFTVEQVDSDETVQYTQKDLSDFIEARVEEIFFEVFDVLQDLGLTKVNGGF
IVTGGSANLLGVKELLSDMVSEKVRIHTPSQMGIRKPEFSSAISTISSSIAFDELLDYVTINYHDNEETEEDVIDVKDKD
NESKLGGFDWFKRKTNKKDTHENEVESTDEEIYQSEDNHQEHKQNHEHVQDKDKDKEESKFKKLMKSLFE
>Q6GHQ0 ~~~ftsA~~~Cell division protein FtsA~~~
MEEHYYVSIDIGSSSVKTIVGEKFHNGINVIGTGQTYTSGIKNGLIDDFDIARQAIKDTIKKASIASGVDIKEVFLKLPI
IGTEVYDESNEIDFYEDTEINGSHIEKVLEGIREKNDVQETEVINVFPIRFIVDKENEVSDPKELIARHSLKVEAGVIAI
QKSILINMIKCVEACGVDVLDVYSDAYNYGSILTATEKELGACVIDIGEDVTQVAFYERGELVDADSIEMAGRDITDDIA
QGLNTSYETAEKVKHQYGHAFYDSASDQDIFTVEQVDSDETVQYTQKDLSDFIEARVEEIFFEVFDVLQDLGLTKVNGGF
IVTGGSANLLGVKELLSDMVSEKVRIHTPSQMGIRKPEFSSAISTISSSIAFDELLDYVTINYHDNEETEEDVIDVKDKD
NESKLGGFDWFKRKTNKKDTHENEVESSDEEIYQSEDNHQEHKQNHEHVQDKDKEESKFKKLMKSLFE
>Q8NX33 ~~~ftsA~~~Cell division protein FtsA~~~
MEEHYYVSIDIGSSSVKTIVGEKFHNGINVIGTGQTYTSGIKNGLIDDFDIARQAIKDTIKKASIASGVDIKEVFLKLPI
IGTEVYDESNEIDFYEDTEINGSHIEKVLEGIREKNDVQETEVINVFPIRFIVDKENEVSDPKELIARHSLKVEAGVIAI
QKSILINMIKCVEACGVDVLDVYSDAYNYGSILTATEKELGACVIDIGEDVTQVAFYERGELVDADSIEMAGRDITDDIA
QGLNTSYETAEKVKHQYGHAFYDSASDQDIFTVEQVDSDETVQYTQKDLSDFIEARVEEIFFEVFDVLQDLGLTKVNGGF
IVTGGSANLLGVKELLSDMVSEKVRIHTPSQMGIRKPEFSSAISTISSSIAFDELLDYVTINYHDSEETEEDVIDVKDKD
NESKLGGFDWFKRKTNKKDTHENEVESTDEEIYQSEDNHQEHKQNHEHVQDKDKDKEESKFKKLMKSLFE
>A0A0H2ZPT5 ~~~ftsA~~~Cell division protein FtsA~~~COG0849
MAREGFFTGLDIGTSSVKVLVAEQRNGELNVIGVSNAKSKGVKDGIIVDIDAAATAIKSAISQAEEKAGISIKSVNVGLP
GNLLQVEPTQGMIPVTSDTKEITDQDVENVVKSALTKSMTPDREVITFIPEEFIVDGFQGIRDPRGMMGVRLEMRGLLYT
GPRTILHNLRKTVERAGVQVENVIISPLAMVQSVLNEGEREFGATVIDMGAGQTTVATIRNQELQFTHILQEGGDYVTKD
ISKVLKTSRKLAEGLKLNYGEAYPPLASKETFQVEVIGEVEAVEVTEAYLSEIISARIKHILEQIKQELDRRRLLDLPGG
IVLIGGNAILPGMVELAQEVFGVRVKLYVPNQVGIRNPAFAHVISLSEFAGQLTEVNLLAQGAIKGENDLSHQPISFGGM
LQKTAQFVQSTPVQPAPAPEVEPVAPTEPMADFQQASQNKPKLADRFRGLIGSMFDE
>P0A6S5 ~~~ftsB~~~Cell division protein FtsB~~~COG2919
MGKLTLLLLAILVWLQYSLWFGKNGIHDYTRVNDDVAAQQATNAKLKARNDQLFAEIDDLNGGQEALEERARNELSMTRP
GETFYRLVPDASKRAQSAGQNNR
>Q9HXZ6 ~~~ftsB~~~Cell division protein FtsB~~~
MRLRSPYWLFVVLILALAGLQYRLWVGDGSLAQVRDLQKQIADQHGENERLLERNRILEAEVAELKKGTETVEERARHEL
GMVKDGETLYQLAK
>Q9KUJ3 ~~~ftsB~~~Cell division protein FtsB~~~COG2919
MRVFALTLSLLLVWLLYTLMWGKNGVMDFRAVQAEIEVQQQVNANLHLRNQEMFAEIDDLRQGLDAIEERARNELGMVKD
GETFYRIIGEESRQ
>O34814 ~~~ftsE~~~Cell division ATP-binding protein FtsE~~~COG2884
MIEMKEVYKAYPNGVKALNGISVTIHPGEFVYVVGPSGAGKSTFIKMIYREEKPTKGQILINHKDLATIKEKEIPFVRRK
IGVVFQDFKLLPKLTVFENVAFALEVIGEQPSVIKKRVLEVLDLVQLKHKARQFPDQLSGGEQQRVSIARSIVNNPDVVI
ADEPTGNLDPDTSWEVMKTLEEINNRGTTVVMATHNKEIVNTMKKRVIAIEDGIIVRDESRGEYGSYD
>P0A9R7 ~~~ftsE~~~Cell division ATP-binding protein FtsE~~~COG2884
MIRFEHVSKAYLGGRQALQGVTFHMQPGEMAFLTGHSGAGKSTLLKLICGIERPSAGKIWFSGHDITRLKNREVPFLRRQ
IGMIFQDHHLLMDRTVYDNVAIPLIIAGASGDDIRRRVSAALDKVGLLDKAKNFPIQLSGGEQQRVGIARAVVNKPAVLL
ADEPTGNLDDALSEGILRLFEEFNRVGVTVLMATHDINLISRRSYRMLTLSDGHLHGGVGHE
>A5U7B7 ~~~ftsE~~~Cell division ATP-binding protein FtsE~~~COG2884
MITLDHVTKQYKSSARPALDDINVKIDKGEFVFLIGPSGSGKSTFMRLLLAAETPTSGDVRVSKFHVNKLRGRHVPKLRQ
VIGCVFQDFRLLQQKTVYDNVAFALEVIGKRTDAINRVVPEVLETVGLSGKANRLPDELSGGEQQRVAIARAFVNRPLVL
LADEPTGNLDPETSRDIMDLLERINRTGTTVLMATHDHHIVDSMRQRVVELSLGRLVRDEQRGVYGMDR
>O05779 ~~~ftsE~~~Cell division ATP-binding protein FtsE~~~COG2884
MMITLDHVTKQYKSSARPALDDINVKIDKGEFVFLIGPSGSGKSTFMRLLLAAETPTSGDVRVSKFHVNKLRGRHVPKLR
QVIGCVFQDFRLLQQKTVYDNVAFALEVIGKRTDAINRVVPEVLETVGLSGKANRLPDELSGGEQQRVAIARAFVNRPLV
LLADEPTGNLDPETSRDIMDLLERINRTGTTVLMATHDHHIVDSMRQRVVELSLGRLVRDEQRGVYGMDR
>P73179 3.4.24.-~~~ftsH1~~~ATP-dependent zinc metalloprotease FtsH 1~~~COG0465
MSHRPRSDRHSFSSPSRFWHRLGMGLLVAGTLALPVSTLAQEGGEGAQPKASPSPIQSPNSSNGEATPRSFFNSGSPRSA
EPKMNYGQLIDAIKANQVAKVEVDTNRRQAIVTLKDAPPGSKPQTVQLLDNNPELLNLLRSRSETIDLDINRTPDNSALY
GLLTNLLVVAILIGLVVMVVRRSANASGQAMSFGKSKARFQMEAKTGVGFDDVAGIDEAKEELQEVVTFLKQPEKFTAIG
AKIPRGVLLIGPPGTGKTLLAKAIAGEAGVPFFSISGSEFVEMFVGVGASRVRDLFKKAKENAPCLVFIDEIDAVGRQRG
VGYGGGNDEREQTLNQLLTEMDGFEGNSGIIVIAATNRPDVLDLALLRPGRFDRQVTVDYPDVQGRELILAIHAQNKKLH
EEVQLAAIARRTPGFTGADLANVLNEAAIFTARRRKEAITMAEVNDAIDRVVAGMEGTPLVDSKSKRLIAYHEVGHALIG
TLCPGHDPVEKVTLIPRGQAQGLTWFTPDEDQSLMTRNQMIARIAGLLGGRVAEEVIFGDDEVTTGAGNDIEKITYLARQ
MVTKLGMSSLGLVALEEEGDRNFSGGDWGKRSEYSEDIAARIDREIQAIVTAAHQRATRIIEENRNLMDLLVDALIDQET
IEGEHFRQLVESYQQSQKQPALAGK
>Q55700 3.4.24.-~~~ftsH2~~~ATP-dependent zinc metalloprotease FtsH 2~~~COG0465
MKFSWRTALLWSLPLLVVGFFFWQGSFGGADANLGSNTANTRMTYGRFLEYVDAGRITSVDLYENGRTAIVQVSDPEVDR
TLRSRVDLPTNAPELIARLRDSNIRLDSHPVRNNGMVWGFVGNLIFPVLLIASLFFLFRRSSNMPGGPGQAMNFGKSKAR
FQMDAKTGVMFDDVAGIDEAKEELQEVVTFLKQPERFTAVGAKIPKGVLLVGPPGTGKTLLAKAIAGEAGVPFFSISGSE
FVEMFVGVGASRVRDLFKKAKENAPCLIFIDEIDAVGRQRGAGIGGGNDEREQTLNQLLTEMDGFEGNTGIIIIAATNRP
DVLDSALMRPGRFDRQVMVDAPDYSGRKEILEVHARNKKLAPEVSIDSIARRTPGFSGADLANLLNEAAILTARRRKSAI
TLLEIDDAVDRVVAGMEGTPLVDSKSKRLIAYHEVGHAIVGTLLKDHDPVQKVTLIPRGQAQGLTWFTPNEEQGLTTKAQ
LMARIAGAMGGRAAEEEVFGDDEVTTGAGGDLQQVTEMARQMVTRFGMSNLGPISLESSGGEVFLGGGLMNRSEYSEEVA
TRIDAQVRQLAEQGHQMARKIVQEQREVVDRLVDLLIEKETIDGEEFRQIVAEYAEVPVKEQLIPQL
>P72991 3.4.24.-~~~ftsH3~~~ATP-dependent zinc metalloprotease FtsH 3~~~COG0465
MSKNNKKWRNAGLYALLLIVVLALASAFFDRPTQTRETLSYSDFVNRVEANQIERVNLSADRTQAQVPNPSGGPPYLVNL
PNDPDLINILTQHNVDIAVQPQSDEGFWFRIASTLFLPILLLVGIFFLFRRAQSGPGSQAMNFGKSKARVQMEPQTQVTF
GDVAGIEQAKLELTEVVDFLKNADRFTELGAKIPKGVLLVGPPGTGKTLLAKAVAGEAGVPFFSISGSEFVEMFVGVGAS
RVRDLFEQAKANAPCIVFIDEIDAVGRQRGAGLGGGNDEREQTLNQLLTEMDGFEGNTGIIIVAATNRPDVLDSALMRPG
RFDRQVVVDRPDYAGRREILNVHARGKTLSQDVDLDKIARRTPGFTGADLSNLLNEAAILAARRNLTEISMDEVNDAIDR
VLAGPEKKNRVMSEKRKTLVAYHEAGHALVGALMPDYDPVQKISIIPRGRAGGLTWFTPSEDRMESGLYSRSYLQNQMAV
ALGGRIAEEIIFGEEEVTTGASNDLQQVARVARQMVTRFGMSDRLGPVALGRQGGGVFLGRDIASDRDFSDETAAAIDEE
VSQLVDQAYQRAKQVLVENRGILDQLAEILVEKETVDSEELQTLLANNNAKLALLV
>P73437 3.4.24.-~~~ftsH4~~~ATP-dependent zinc metalloprotease FtsH 4~~~COG0465
MAIKPQPQWQRRLASVLLWGSTIYLLVNLLAPALFRSQPPQVPYSLFIDQVEGDKVASVYVGQNEIRYQLKPEAEDEGKE
KAAEGQILRTTPIFDLELPKRLEAKGIEFAAAPPAKNSWFGTLLSWVIPPLIFVGIWSFFLNRNNNGAPGGALAFTKSKA
KVYVEGDSTKVTFDDVAGVEEAKTELSEVVDFLKFPQRYTALGAKIPKGVLLVGPPGTGKTLLAKAAAGEAGVPFFIISG
SEFVELFVGAGAARVRDLFEQAKKQAPCIVFIDELDAIGKSRASGAFMGGNDEREQTLNQLLTEMDGFSAAGATVIVLAA
TNRPETLDPALLRPGRFDRQVLVDRPDLAGRLKILEIYAKKIKLDKEVELKNIATRTPGFAGADLANLVNEAALLAARNK
QDSVTEADFREAIERVVAGLEKKSRVLSDKEKKIVAYHEVGHALVGAVMPGGGQVAKISIVPRGMAALGYTLQMPTEDRF
LLNESELRDQIATLLGGRAAEEIVFDSITTGAANDLQRATDLAEQMVTTYGMSKVLGPLAYDKGQQNNFLGQGMGNPRRM
VSDDTAKEIDLEVKEIVEQGHNQALAILEHNRDLLEAIAEKILEKEVIEGEELHHLLGQVQAPGTLVV
>O67077 3.4.24.-~~~ftsH~~~ATP-dependent zinc metalloprotease FtsH~~~COG0465
MNALKNFFIWAIIIGAAIVAFNLFEGKREFTTKVSLNEVVKLVEEGKVSYAEVRGNTAIIQTKDGQKLEVTLPPNTNLVD
KMVEKGVRVEVANPEPPGGWLVNVFLSWLPILFFIGIWIFLLRQMSGGGNVNRAFNFGKSRAKVYIEEKPKVTFKDVAGI
EEVKEEVKEIIEYLKDPVKFQKLGGRPPKGVLLYGEPGVGKTLLAKAIAGEAHVPFISVSGSDFVEMFVGVGAARVRDLF
ETAKKHAPCIIFIDEIDAVGRARGAIPVGGGHDEREQTLNQLLVEMDGFDTSDGIIVIAATNRPDILDPALLRPGRFDRQ
IFIPKPDVRGRYEILKVHARNKKLAKDVDLEFVARATPGFTGADLENLLNEAALLAARKGKEEITMEEIEEALDRITMGL
ERKGMTISPKEKEKIAIHEAGHALMGLVSDDDDKVHKISIIPRGMALGVTQQLPIEDKHIYDKKDLYNKILVLLGGRAAE
EVFFGKDGITTGAENDLQRATDLAYRMVSMWGMSDKVGPIAIRRVANPFLGGMTTAVDTSPDLLREIDEEVKRIITEQYE
KAKAIVEEYKEPLKAVVKKLLEKETITCEEFVEVFKLYGIELKDKCKKEELFDKDRKSEENKELKSEEVKEEVV
>P37476 3.4.24.-~~~ftsH~~~ATP-dependent zinc metalloprotease FtsH~~~COG0465
MNRVFRNTIFYLLILLVVIGVVSYFQTSNPKTENMSYSTFIKNLDDGKVDSVSVQPVRGVYEVKGQLKNYDKDQYFLTHV
PEGKGADQIFNALKKTDVKVEPAQETSGWVTFLTTIIPFVIIFILFFFLLNQAQGGGSRVMNFGKSKAKLYTEEKKRVKF
KDVAGADEEKQELVEVVEFLKDPRKFAELGARIPKGVLLVGPPGTGKTLLAKACAGEAGVPFFSISGSDFVEMFVGVGAS
RVRDLFENAKKNAPCLIFIDEIDAVGRQRGAGLGGGHDEREQTLNQLLVEMDGFSANEGIIIIAATNRADILDPALLRPG
RFDRQITVDRPDVIGREAVLKVHARNKPLDETVNLKSIAMRTPGFSGADLENLLNEAALVAARQNKKKIDARDIDEATDR
VIAGPAKKSRVISKKERNIVAYHEGGHTVIGLVLDEADMVHKVTIVPRGQAGGYAVMLPREDRYFQTKPELLDKIVGLLG
GRVAEEIIFGEVSTGAHNDFQRATNIARRMVTEFGMSEKLGPLQFGQSQGGQVFLGRDFNNEQNYSDQIAYEIDQEIQRI
IKECYERAKQILTENRDKLELIAQTLLKVETLDAEQIKHLIDHGTLPERNFSDDEKNDDVKVNILTKTEEKKDDTKE
>B8H444 3.4.24.-~~~ftsH~~~ATP-dependent zinc metalloprotease FtsH~~~
MNFRNLAIWLVIVAVLGGVFVVSQNSRTKSSSEISYSQLLKDVDAGKIKSAEIAGQTVLAKTADNKTLTVNAPMNSEELV
NRMVAKNADVKFKSGSISFLAILVQLLPILLVVGVWLFLMRQMQGGAKGAMGFGKSKARLLTENKNRITFEDVAGVDEAK
EELQEVVDFLKDPAKFQRLGGKIPKGALLVGPPGTGKTLIARAVAGEAGVPFFTISGSDFVEMFVGVGASRVRDMFEQAK
KNAPCIIFIDEIDAVGRHRGAGLGGGNDEREQTLNQLLVEMDGFEANEGIILIAATNRPDVLDPALLRPGRFDRQVVVPN
PDVAGREKIIRVHMKNVPLAADVDVKTLARGTPGFSGADLANLVNEAALMAARKNRRMVTMQDFEQAKDKVMMGAERRSM
AMNEEEKKLTAYHEGGHAIVALNVPLADPVHKATIVPRGRALGMVMQLPEGDRYSMKYQQMTSRLAIMMGGRVAEEIIFG
KENITSGASSDIKAATDLARNMVTRWGYSDILGTVAYGDNQDEVFLGHSVARTQNVSEETARLIDSEVKRLVQYGLDEAR
RILTDKIDDLHTLGKALLEYETLSGEEIADILKGIPPKREEEEAATAVIAPSLVPLSPGAGASVTA
>P0AAI3 3.4.24.-~~~ftsH~~~ATP-dependent zinc metalloprotease FtsH~~~COG0465
MAKNLILWLVIAVVLMSVFQSFGPSESNGRKVDYSTFLQEVNNDQVREARINGREINVTKKDSNRYTTYIPVQDPKLLDN
LLTKNVKVVGEPPEEPSLLASIFISWFPMLLLIGVWIFFMRQMQGGGGKGAMSFGKSKARMLTEDQIKTTFADVAGCDEA
KEEVAELVEYLREPSRFQKLGGKIPKGVLMVGPPGTGKTLLAKAIAGEAKVPFFTISGSDFVEMFVGVGASRVRDMFEQA
KKAAPCIIFIDEIDAVGRQRGAGLGGGHDEREQTLNQMLVEMDGFEGNEGIIVIAATNRPDVLDPALLRPGRFDRQVVVG
LPDVRGREQILKVHMRRVPLAPDIDAAIIARGTPGFSGADLANLVNEAALFAARGNKRVVSMVEFEKAKDKIMMGAERRS
MVMTEAQKESTAYHEAGHAIIGRLVPEHDPVHKVTIIPRGRALGVTFFLPEGDAISASRQKLESQISTLYGGRLAEEIIY
GPEHVSTGASNDIKVATNLARNMVTQWGFSEKLGPLLYAEEEGEVFLGRSVAKAKHMSDETARIIDQEVKALIERNYNRA
RQLLTDNMDILHAMKDALMKYETIDAPQIDDLMARRDVRPPAGWEEPGASNNSGDNGSPKAPRPVDEPRTPNPGNTMSEQ
LGDK
>P71408 3.4.24.-~~~ftsH~~~ATP-dependent zinc metalloprotease FtsH~~~COG0465
MKPTNEPKKPFFQSPIILAVLGGILLIFFLRSFNSDGSFSDNFLASSTKNVSYHEIKQLISNNEVENVSIGQTLIKASHK
EGNNRVIYIAKRVPDLTLVPLLDEKKINYSGFSESNFFTDMLGWLMPILVILGLWMFMANRMQKNMGGGIFGMGSAKKLI
NAEKPNVRFNDMAGNEEAKEEVVEIVDFLKYPERYANLGAKIPKGVLLVGPPGTGKTLLAKAVAGEAHVPFFSMGGSSFI
EMFVGLGASRVRDLFETAKKQAPSIIFIDEIDAIGKSRAAGGVVSGNDEREQTLNQLLAEMDGFGSENAPVIVLAATNRP
EILDPALMRPGRFDRQVLVDKPDFNGRVEILKVHIKGVKLANDVNLQEVAKLTAGLAGADLANIINEAALLAGRNNQKEV
RQQHLKEAVERGIAGLEKKSRRISPKEKKIVAYHESGHAVISEMTKGSARVNKVSIIPRGMAALGYTLNTPEENKYLMQK
HELIAEIDVLLGGRAAEDVFLEEISTGASNDLERATDIIKGMVSYYGMSSVSGLMVLEKQRNAFLGGGYGSSREFSEKTA
EEMDLFIKNLLEERYKHVKQTLSDYREAIEIMVKELFDKEVITGERVREIISEYEVANNLESRLIPLEEQAS
>Q88Z31 3.4.24.-~~~ftsH~~~ATP-dependent zinc metalloprotease FtsH~~~COG0465
MNNRRNGLFRNSLFYILMFLSLMGIIYFFFGGNSGSQTQNIRYSEFVKQLDKNNVKNVSIQPSGGVYKVTGSYRKARTTS
SANALGIKSASTKTTSFSTTMLENNSTVDQVSKLAAKHDVKVTAKAEESSGIWVTLLMYIAPVILMLFLFYMMMGQAGQG
GGNNRVMNFGKTKAKPADSKQNKVRFSDVAGEEEEKQELVEVVEFLKDPRKFVSLGARIPSGVLLEGPPGTGKTLLAKAV
AGEAGVPFFSISGSDFVEMFVGVGASRVRDLFEQAKKNAPSIIFIDEIDAVGRQRGNGMGGGHDEREQTLNQLLVEMDGF
TGNEGVIVMAATNRSDVLDPALLRPGRFDRKILVGRPDVKGREAILKVHAKNKPLAADVDLKEIAKQTPGFVGADLENLL
NEAALLAARRNKKQVDAADLDEAEDRVIAGPAKHDRVVNKHERETVAYHEAGHTIVGLVLNDARVVHKVTIVPRGRAGGY
AIMLPREDQMLMSKRDAKEQMAGLMGGRAAEEIIFGAQSSGASNDFEQATQIARAMVTQYGMSEKLGPVELENANQQAAY
QQGMGASAFSQHTAQLIDDEVRRLSQEAHQTATDIIESHREQHKLIAEALLKYETLDEKQILSLFKTGKMPEKDSNEFPS
EKAATFEESKRELERREAEKHAQNQSADDKQADSADTTTNVSVAEPSFPSESDASSEVSADSSVNSTANSATESATDSDV
ATSATGLPNAESATPSSQDDTNSQA
>P9WQN3 3.4.24.-~~~ftsH~~~ATP-dependent zinc metalloprotease FtsH~~~COG0465
MNRKNVTRTITAIAVVVLLGWSFFYFSDDTRGYKPVDTSVAITQINGDNVKSAQIDDREQQLRLILKKGNNETDGSEKVI
TKYPTGYAVDLFNALSAKNAKVSTVVNQGSILGELLVYVLPLLLLVGLFVMFSRMQGGARMGFGFGKSRAKQLSKDMPKT
TFADVAGVDEAVEELYEIKDFLQNPSRYQALGAKIPKGVLLYGPPGTGKTLLARAVAGEAGVPFFTISGSDFVEMFVGVG
ASRVRDLFEQAKQNSPCIIFVDEIDAVGRQRGAGLGGGHDEREQTLNQLLVEMDGFGDRAGVILIAATNRPDILDPALLR
PGRFDRQIPVSNPDLAGRRAVLRVHSKGKPMAADADLDGLAKRTVGMTGADLANVINEAALLTARENGTVITGPALEEAV
DRVIGGPRRKGRIISEQEKKITAYHEGGHTLAAWAMPDIEPIYKVTILARGRTGGHAVAVPEEDKGLRTRSEMIAQLVFA
MGGRAAEELVFREPTTGAVSDIEQATKIARSMVTEFGMSSKLGAVKYGSEHGDPFLGRTMGTQPDYSHEVAREIDEEVRK
LIEAAHTEAWEILTEYRDVLDTLAGELLEKETLHRPELESIFADVEKRPRLTMFDDFGGRIPSDKPPIKTPGELAIERGE
PWPQPVPEPAFKAAIAQATQAAEAARSDAGQTGHGANGSPAGTHRSGDRQYGSTQPDYGAPAGWHAPGWPPRSSHRPSYS
GEPAPTYPGQPYPTGQADPGSDESSAEQDDEVSRTKPAHG
>Q83XX3 3.4.24.-~~~ftsH~~~ATP-dependent zinc metalloprotease FtsH~~~
MKNKNRGFFRSSLSYAFVILAVIFLIYSFFGRSDGSVKHLSTTTFLKELKNNKIKDFTIQPGDSGVYTIAGDFKKAQKSS
SSSSSTTTLLSGYQSSVTKFTAYVLPNNSSLKQITTAAQKAGVAVNPKPAASNFWGSMLTLILPTLIMFALLYWMLIGSQ
RGQGGSGGPGGIMSFGRSKAKPADPKQNKIRFADVAGEEEEKQELVEVVEFLKDPKKFTKLGARIPKGVLLEGPPGTGKT
LLAKAVAGEAKTPFFSISGSDFVEMFVGVGASRVRDLFENAKKSAPSIIFIDEIDAVGRRRGAGMGGGNDEREQTLNQIL
IEMDGFEGSEGVIVLASTNRSDVLDPALLRSGRFDRKILVGAPDVKGREAILRVHAKNKPLAADVDLKVIAQQTPGFVGA
DLENLLNEAALLAARNDEKAVTAADIDEAEDRVIAGPAKKDRKTTQDERETVAYHEAGHAIVGLVLNDAQVVRKVTIVPR
GRAGGYALMMPKDERYLMSEKDAKEELAGLMGGRAAEILINHVASSGASNDFQQATQIAREMVTQYGMSDKLGMVQLEGS
SNVFVGDPNNPNPPYSQKTSELIDEEVRRLTNEAYKRAVDIIKSHPKQHKAIAEALLKYETLDEAQIRSLFETGEIPSDL
VKDSQRPARPLSYEESKAALKKNGAVDNKEAEDELKRTKMIRKMKMIPSRVPIQLQKRRQPRRLQLLDAVNNKFD
>Q9WZ49 3.4.24.-~~~ftsH~~~ATP-dependent zinc metalloprotease FtsH~~~COG0465
MNRSNIWNLLFTILIIVTLFWLARFFYVENSPVSKLSYTSFVQMVEDERSVVSEVVIRDDGVLRVYTKDGRVYEVDAPWA
VNDSQLIEKLVSKGIKVSGERSGSSSFWINVLGTLIPTILFIVVWLFIMRSLSGRNNQAFTFTKSRATMYKPSGNKRVTF
KDVGGAEEAIEELKEVVEFLKDPSKFNRIGARMPKGILLVGPPGTGKTLLARAVAGEANVPFFHISGSDFVELFVGVGAA
RVRDLFAQAKAHAPCIVFIDEIDAVGRHRGAGLGGGHDEREQTLNQLLVEMDGFDSKEGIIVMAATNRPDILDPALLRPG
RFDKKIVVDPPDMLGRKKILEIHTRNKPLAEDVNLEIIAKRTPGFVGADLENLVNEAALLAAREGRDKITMKDFEEAIDR
VIAGPARKSKLISPKEKRIIAYHEAGHAVVSTVVPNGEPVHRISIIPRGYKALGYTLHLPEEDKYLVSRNELLDKLTALL
GGRAAEEVVFGDVTSGAANDIERATEIARNMVCQLGMSEELGPLAWGKEEQEVFLGKEITRLRNYSEEVASKIDEEVKKI
VTNCYERAKEIIRKYRKQLDNIVEILLEKETIEGDELRRILSEEFEKVVE
>Q5SI82 3.4.24.-~~~ftsH~~~ATP-dependent zinc metalloprotease FtsH~~~COG0465
MPRAPFSLLALVLGLAFLAWAFSLAGTVGAPSGTVNYTTFLEDLKAGRVKEVVVRAGDTRIQGVLEDGSAFTTYAASPPD
NATLEGWMARGVSVRVEPPQGQNALGFLWPLLLVGLLIGALYYFSRNGRAGPSDSAFSFTKSRARVLTEAPKVTFKDVAG
AEEAKEELKEIVEFLKNPSRFHEMGARIPKGVLLVGPPGVGKTHLARAVAGEARVPFITASGSDFVEMFVGVGAARVRDL
FETAKRHAPCIVFIDEIDAVGRKRGSGVGGGNDEREQTLNQLLVEMDGFEKDTAIVVMAATNRPDILDPALLRPGRFDRQ
IAIDAPDVKGREQILRIHARGKPLAEDVDLALLAKRTPGFVGADLENLLNEAALLAAREGRRKITMKDLEEAADRVMMGP
AKKSLVLSPRDRRITAYHEAGHALAAHFLEHADGVHKVTIVPRGRALGFMMPRREDMLHWSRKRLLDQIAVALAGRAAEE
IVFDDVTTGAENDFRQATELARRMITEWGMHPEFGPVAYAVREDTYLGGYDVRQYSEETAKRIDEAVRRLIEEQYQRVKA
LLLEKREVLERVAETLLERETLTAEEFQRVVEGLPLEAPEEAREEREPPRVVPKVKPGGALGGA
>Q9I1K1 3.4.16.4~~~pbpC~~~Probable peptidoglycan D,D-transpeptidase PbpC~~~
MSSQRRNYRFILVVTLFVLASLAVSGRLVYLQVHDHEFLADQGDLRSIRDLPIPVTRGMITDRNGEPLAVSTEVASIWCN
PREMAAHLDEVPRLAGALHRPAAALLAQLQANPNKRFLYLERGLSPIEASEVMALGITGVHQIKEYKRFYPSSELTAQLI
GLVNIDGRGQEGTELGFNDWLSGKDGVREVAINPRGSLVNSIKVLKTPKASQDVALSIDLRLQFIAYKALEKAVLKFGAH
SGSAVLVNPKSGQILAMANFPSYNPNNRASFAPAFMRNRTLTDTFEPGSVIKPFSMSAALASGKFDENSQVSVAPGWMTI
DGHTIHDVARRDVLTMTGVLINSSNIGMSKVALQIGPKPILEQLGRVGFGAPLSLGFPGENPGYLPFHEKWSNIATASMS
FGYSLAVNTAELAQAYSVFANDGKLVPLSLLRDNPQNQVRQAMDPQIARRIRAMLQTVVEDPKGVVRARVPGYHVAGKSG
TARKASGRGYADKSYRSLFVGMAPASDPQLVLAVMIDSPTRIGYFGGLVSAPTFSDIMAGSLRALAIPPDNLQDSPAVAD
RQHHG
>B8H0A0 3.4.16.4~~~ftsI~~~Probable peptidoglycan D,D-transpeptidase FtsI~~~
MSLSNLGPGGVHSPLWRWVVERVWRLEHAFERSRAAARPEDDTRIRIFLVMGFFGFCFVGVSLGAGWSALFSRAGQGGGY
AQGVEGARGDVVDRNGKLLAVDLAHYALYVDPREVWDAKETRAALGRALPQVPAKRLDKAVFGDHRAFVLGGLTPDEKDA
IFNLGLPGVTFEEQERRMYPLGPTAAHLIGFVDSGGKGLAGAERALDDPIRKAAGGEGGPAQLSIDVRVQAALEDELRKA
AEEFTPKGAVGLVTNVHTGEILGMASWPDYDANKAGGATDDQRLNRAAASVYEMGSTFKAFTVAIGLDTGVATAASTFDA
REPYKLGYRTIHDYHATKAVLNLVEVFQHSSNIGTAMLAERVGGQRLSQYFTNLGLTKPAKVELQESARPLTPRKWDQDT
VASTSFGHGMNISPLALAQAMNALLNGGEMRPLTIRKLPPGVRPEGRRVLSEHTSAEMLKIMRANVVPGEGGSGGKADVP
GLSVGGKTGTGEKYDPAIRRYNHQRQVSSFAATFPTDGPLEADRYFVLILLDEPKGNANSFGFSTGGWVAAPAAGRVIER
IAPFLGVKRKTELVTIANSPKNAAPEAGL
>P0AD68 3.4.16.4~~~ftsI~~~Peptidoglycan D,D-transpeptidase FtsI~~~COG0768
MKAAAKTQKPKRQEEHANFISWRFALLCGCILLALAFLLGRVAWLQVISPDMLVKEGDMRSLRVQQVSTSRGMITDRSGR
PLAVSVPVKAIWADPKEVHDAGGISVGDRWKALANALNIPLDQLSARINANPKGRFIYLARQVNPDMADYIKKLKLPGIH
LREESRRYYPSGEVTAHLIGFTNVDSQGIEGVEKSFDKWLTGQPGERIVRKDRYGRVIEDISSTDSQAAHNLALSIDERL
QALVYRELNNAVAFNKAESGSAVLVDVNTGEVLAMANSPSYNPNNLSGTPKEAMRNRTITDVFEPGSTVKPMVVMTALQR
GVVRENSVLNTIPYRINGHEIKDVARYSELTLTGVLQKSSNVGVSKLALAMPSSALVDTYSRFGLGKATNLGLVGERSGL
YPQKQRWSDIERATFSFGYGLMVTPLQLARVYATIGSYGIYRPLSITKVDPPVPGERVFPESIVRTVVHMMESVALPGGG
GVKAAIKGYRIAIKTGTAKKVGPDGRYINKYIAYTAGVAPASQPRFALVVVINDPQAGKYYGGAVSAPVFGAIMGGVLRT
MNIEPDALTTGDKNEFVINQGEGTGGRS
>P45059 3.4.16.4~~~ftsI~~~Peptidoglycan D,D-transpeptidase FtsI~~~COG0768
MVKFNSSRKSGKSKKTIRKLTAPETVKQNKPQKVFEKCFMRGRYMLSTVLILLGLCALVARAAYVQSINADTLSNEADKR
SLRKDEVLSVRGSILDRNGQLLSVSVPMSAIVADPKTMLKENSLADKERIAALAEELGMTENDLVKKIEKNSKSGYLYLA
RQVELSKANYIRRLKIKGIILETEHRRFYPRVEEAAHVVGYTDIDGNGIEGIEKSFNSLLVGKDGSRTVRKDKRGNIVAH
ISDEKKYDAQDVTLSIDEKLQSMVYREIKKAVSENNAESGTAVLVDVRTGEVLAMATAPSYNPNNRVGVKSELMRNRAIT
DTFEPGSTVKPFVVLTALQRGVVKRDEIIDTTSFKLSGKEIVDVAPRAQQTLDEILMNSSNRGVSRLALRMPPSALMETY
QNAGLSKPTDLGLIGEQVGILNANRKRWADIERATVAYGYGITATPLQIARAYATLGSFGVYRPLSITKVDPPVIGKRVF
SEKITKDIVGILEKVAIKNKRAMVEGYRVGVKTGTARKIENGHYVNKYVAFTAGIAPISDPRYALVVLINDPKAGEYYGG
AVSAPVFSNIMGYALRANAIPQDAEAAENTTTKSAKRIVYIGEHKNQKVN
>G3XD46 3.4.16.4~~~ftsI~~~Peptidoglycan D,D-transpeptidase FtsI~~~
MKLNYFQGALYPWRFCVIVGLLLAMVGAIVWRIVDLHVIDHDFLKGQGDARSVRHIAIPAHRGLITDRNGEPLAVSTPVT
TLWANPKELMTAKERWPQLAAALGQDTKLFADRIEQNAEREFIYLVRGLTPEQGEGVIALKVPGVYSIEEFRRFYPAGEV
VAHAVGFTDVDDRGREGIELAFDEWLAGVPGKRQVLKDRRGRVIKDVQVTKNAKPGKTLALSIDLRLQYLAHRELRNALL
ENGAKAGSLVIMDVKTGEILAMTNQPTYNPNNRRNLQPAAMRNRAMIDVFEPGSTVKPFSMSAALASGRWKPSDIVDVYP
GTLQIGRYTIRDVSRNSRQLDLTGILIKSSNVGISKIAFDIGAESIYSVMQQVGLGQDTGLGFPGERVGNLPNHRKWPKA
ETATLAYGYGLSVTAIQLAHAYAALANDGKSVPLSMTRVDRVPDGVQVISPEVASTVQGMLQQVVEAQGGVFRAQVPGYH
AAGKSGTARKVSVGTKGYRENAYRSLFAGFAPATDPRIAMVVVIDEPSKAGYFGGLVSAPVFSKVMAGALRLMNVPPDNL
PTATEQQQVNAAPAKGGRG
>Q9Z726 ~~~ftsK~~~DNA translocase FtsK~~~COG1674
MIRERKKSRHPRLPTLPLAAKASLYLFFACFSGLSLWSFHRDQPCTQNWIGLLGWSFSSFLLYFFGAAAFFIPLYFLWLS
FLYFRRTPRPLFFYKAAAFLSLPFCSAILLSMLSPVGTLPALLDTRLPKFILGNNPPVSYVGGIPFYLFYEGQSFCLKHL
IGSVGTALIFGFVMLFSVLYLCGGIALLKKKTFQDGVKKAFCSFFQTCFKNLKKLINRRNYLPKPSVPFVSKNPFSCTKS
QPSPRRVSETIILDGSISPLPQEEIPGSKKESFFLTPHPCKRFLTKFVEPQENKAKEGKTIALSSTPTVVRESKGKERAA
LPKLKSLAVPENDLPQYHLLSKNREARPESLQAELERKALILKQTLTSFGIDADLGNICSGPTLAAFEVLPHSGVKVQKI
KSLENDIALKLQASSIRIIAPIPGKAAVGIEIPTPFPQAVNFRDLLEDYQKTNRKLQIPLLLGKKANGDNLWADLATMPH
LIIAGTTGSGKSVCINTIVMSMIMTTLPSEIKLVIIDPKKVELTGYSQLPHMLSPVITESREVYNALVWLVKEMESRYEI
LRYLGLRNIQAFNSRTRNKTIEASYDREIRETMPFMVGIIDELSDLLLSSSQDIETPIIRLAQMARAVGIHLILATQRPS
REVITGLIKANFPSRISFKVSNKVNSQIIIDEPGAENLMGNGDMLVLLPSVFGTIRAQGAYICDEDINKVIQDLCSRFPT
QYVIPSFHAFDDSDSDNSGEKDPLFAQAKTLILQTGNASTTFLQRKLKIGYARAASLIDQLEEARIIGPSEGAKPRQILI
QNPLEG
>O84744 ~~~ftsK~~~DNA translocase FtsK~~~
MGKERKKASVSLSPQTVFAVKTCVYLALACFSGLSLWSFQHNQPYTQNWIGLLGWSLSSFLLYNFGVAAFLIPLNFGWLS
FLNMKRTPAPLAFRKAAAFGAIPVCCAVLLSMISPAQNLPQFLATRVPMVVMDLQPPKAYLGGIPFYLLYDGNSFSLKLL
IGAVGTGLIFLAILLCAIFYLIPKSFVLKKKALLDDLLKFLKNKFYACWNACKKLLKNLVNNKSYVPKPSLRVPSSPSVA
KKEMLKLPTPVISLPLENKDLHDDSSVNRTIFLTPPHPTKRTLSPQKRTDLPNLLPKDSALAPAQTSYKPLPTPSPFVLA
GDAPDLPQYHLLSKRNVHRPESLLEELKKKAAILQQTLASFGIEAAIGNICSGPTLAAFEVLPNTGVKVQKIKALENDIA
LNLQASSIRIIAPIPGKAAVGIEIPNPDPQPVNFRDLLEDYQKGTQRLQVPLLLGKKANGDNFWTDLATMPHLIIAGTTG
SGKSVCINTIVMSLIMTSPPTDIKLVIVDPKKVELTGYSQLPHMLTPVITESKEAHSALIWLVREMELRYEILRFLGLRN
IQSFNSRTRNVDIEASYDKEISEKMPFIVGIIDELSDLLLSSSHDIETPIVRLAQMARAVGIHLILATQRPSRDVITGLI
KANFPSRIAFKVANKVNSQIIIDEPGAENLMGNGDMLVVSPGSFAPVRVQGAYICDDDINKVIKDLCSRFPCKYVIPSFN
TYDDPGSMDPEDLDPLFNQAKTLVLQTGNASTTFLQRKLKIGYARAASIIDQLEEARIVGPSEGAKPRQILVQLSNQDD
>P46889 ~~~ftsK~~~DNA translocase FtsK~~~COG1178
MSQEYIEDKEVTLTKLSSGRRLLEALLILIVLFAVWLMAALLSFNPSDPSWSQTAWHEPIHNLGGMPGAWLADTLFFIFG
VMAYTIPVIIVGGCWFAWRHQSSDEYIDYFAVSLRIIGVLALILTSCGLAAINADDIWYFASGGVIGSLLSTTLQPLLHS
SGGTIALLCVWAAGLTLFTGWSWVTIAEKLGGWILNILTFASNRTRRDDTWVDEDEYEDDEEYEDENHGKQHESRRARIL
RGALARRKRLAEKFINPMGRQTDAALFSGKRMDDDEEITYTARGVAADPDDVLFSGNRATQPEYDEYDPLLNGAPITEPV
AVAAAATTATQSWAAPVEPVTQTPPVASVDVPPAQPTVAWQPVPGPQTGEPVIAPAPEGYPQQSQYAQPAVQYNEPLQQP
VQPQQPYYAPAAEQPAQQPYYAPAPEQPVAGNAWQAEEQQSTFAPQSTYQTEQTYQQPAAQEPLYQQPQPVEQQPVVEPE
PVVEETKPARPPLYYFEEVEEKRAREREQLAAWYQPIPEPVKEPEPIKSSLKAPSVAAVPPVEAAAAVSPLASGVKKATL
ATGAAATVAAPVFSLANSGGPRPQVKEGIGPQLPRPKRIRVPTRRELASYGIKLPSQRAAEEKAREAQRNQYDSGDQYND
DEIDAMQQDELARQFAQTQQQRYGEQYQHDVPVNAEDADAAAEAELARQFAQTQQQRYSGEQPAGANPFSLDDFEFSPMK
ALLDDGPHEPLFTPIVEPVQQPQQPVAPQQQYQQPQQPVPPQPQYQQPQQPVAPQPQYQQPQQPVAPQQQYQQPQQPVAP
QQQYQQPQQPVAPQPQDTLLHPLLMRNGDSRPLHKPTTPLPSLDLLTPPPSEVEPVDTFALEQMARLVEARLADFRIKAD
VVNYSPGPVITRFELNLAPGVKAARISNLSRDLARSLSTVAVRVVEVIPGKPYVGLELPNKKRQTVYLREVLDNAKFRDN
PSPLTVVLGKDIAGEPVVADLAKMPHLLVAGTTGSGKSVGVNAMILSMLYKAQPEDVRFIMIDPKMLELSVYEGIPHLLT
EVVTDMKDAANALRWCVNEMERRYKLMSALGVRNLAGYNEKIAEADRMMRPIPDPYWKPGDSMDAQHPVLKKEPYIVVLV
DEFADLMMTVGKKVEELIARLAQKARAAGIHLVLATQRPSVDVITGLIKANIPTRIAFTVSSKIDSRTILDQAGAESLLG
MGDMLYSGPNSTLPVRVHGAFVRDQEVHAVVQDWKARGRPQYVDGITSDSESEGGAGGFDGAEELDPLFDQAVQFVTEKR
KASISGVQRQFRIGYNRAARIIEQMEAQGIVSEQGHNGNREVLAPPPFD
>A2RJB8 ~~~ftsK~~~DNA translocase FtsK~~~COG1674
MPAKKKTTRRNTKKELQKKAATRKMIAFFVGLLLILFALARLGIVGILLYNIVRLFIGSLAIILLLLVAAIMILSVFRKQ
FLKENKRIIPAIILTFIGLMFVFQIRLHQGLNETFHLIWSDLTAGRVIHFVGSGLIGAIITEPAKALFSVIGVYIIAAVL
WLVAIYLMIPGLFPKMREDLHQRLAKWKEKRAEKVEAKKAVKALKKLEEEKEIPEPQTILPEAENSLFTSAPVEIPINIP
EAPFEENENPVLEENPVDDEPVNFMNTNNYNGNYKLPTIDLLAEVPVKNQSGERENVRKNIGILEETFKSFGIGANVESA
VVGPSITKYEIKLATGTKVSRVVNLSDDLALALAAKDIRIEAPIPGKSLVGVEIPNAEVAMVGFREMWEAGKTNPSKLLE
IPLGKSLDGGIRTFDLTRMPHLLVAGSTGSGKSVAVNGIITSILMKALPSQVKFLMVDPKMVELSVYNDIPHLLIPVVTN
PRKASRALQKVVDQMEERYELFSRYGVRNIAGYNEKVQRYNAESDEKMLELPLIVVIVDELADLMMVASKEVEDAIIRLG
QKARAAGIHMILATQRPSVDVISGLIKANVPSRIAFAVSSGTDSRTILDTNGAEKLLGRGDMLFKPIDENHPVRLQGAFL
SDDDVEAVVTFIKDQSEAQYDESFDPGEVDENQVGTGASNTGSGDPLFEEARNMVIIAQKASTAQLQRALKVGFNRASDL
MNELEAQGIVGPAKGTTPRKVLVSPDGEFIGGVEE
>P9WNA3 ~~~ftsK~~~DNA translocase FtsK~~~COG1674
MSSKTVARSGTRTSRSKATSRGASRSARSAVPRKRSRPVKGVGRPSRRHHRSLLVSTGLACGRAMRAVWMMAAKGTGGAA
RSIGRARDIEPGHRRDGIALVLLGLAVVVAASSWFDAARPLGAWVDALLRTFIGSAVVMLPLVAAAVAVVLMRTSPNPDS
RPRLILGASLIGLSFLGLCHLWAGSPEAPESRLRAAGFIGFAIGGPLSDGLTAWIAAPLLFIGALFGLLLLAGITIREVP
DAMRAMFGTRLLPREYADDFEDFADFDGDDADTVEVARQDFSDGYYDEVPLCSDDGPPAWPSAEVPQDDTATIPEASAGR
GSGRRGRRKDTQVLDRIVEGPYTLPSLDLLISGDPPKKRSAANTHMAGAIGEVLTQFKVDAAVTGCTRGPTVTRYEVELG
PGVKVEKITALQRNIAYAVATESVRMLAPIPGKSAVGIEVPNTDREMVRLADVLTARETRRDHHPLVIGLGKDIEGDFIS
ANLAKMPHLLVAGSTGSGKSSFVNSMLVSLLTRATPEEVRMILIDPKMVELTPYEGIPHLITPIITQPKKAAAALAWLVD
EMEQRYQDMQASRVRHIDDFNDKVRSGAITAPLGSQREYRPYPYVVAIVDELADLMMTAPRDVEDAIVRITQKARAAGIH
LVLATQRPSVDVVTGLIKTNVPSRLAFATSSLTDSRVILDQAGAEKLIGMGDGLFLPMGASKPLRLQGAYVSDEEIHAVV
TACKEQAEPEYTEGVTTAKPTAERTDVDPDIGDDMDVFLQAVELVVSSQFGSTSMLQRKLRVGFAKAGRLMDLMETRGIV
GPSEGSKAREVLVKPDELAGTLAAIRGDGGE
>Q9I0M3 ~~~ftsK~~~DNA translocase FtsK~~~
MRRKNSDLKDSTTASHAAAWRQQLHSRLKEGVLIALGALCLYLWMALLTYDSADPSWSHSSQVDQVQNAAGRLGAVSADI
LFMTLGYFAYLFPLLLGIKTWQVFRRRNLPWEWNTWLFSWRLVGLIFLILAGSALAYIHFHASGHMPASASAGGAIGQSL
GRVAVDALNVQGSTLVFFALFLFGLTVFADLSWFKVMDVTGKITLDFFELIQNAFNRWMGARAERKQLVAQLREVDERVA
EVVAPSVPDRREQSKAKERLLEREEALAKHMSEREKRPPPKIDPPPSPKAPEPSKRVLKEKQAPLFVDTAVEGTLPPLSL
LDPAEVKQKSYSPESLEAMSRLLEIKLKEFGVEVSVDSVHPGPVITRFEIQPAAGVKVSRISNLAKDLARSLAVISVRVV
EVIPGKTTVGIEIPNEDRQMVRFSEVLSSPEYDEHKSTVPLALGHDIGGRPIITDLAKMPHLLVAGTTGSGKSVGVNAML
LSILFKSTPSEARLIMIDPKMLELSIYEGIPHLLCPVVTDMKEAANALRWSVAEMERRYRLMAAMGVRNLAGFNRKVKDA
EEAGTPLTDPLFRRESPDDEPPQLSTLPTIVVVVDEFADMMMIVGKKVEELIARIAQKARAAGIHLILATQRPSVDVITG
LIKANIPTRIAFQVSSKIDSRTILDQGGAEQLLGHGDMLYLPPGTGLPIRVHGAFVSDDEVHRVVEAWKLRGAPDYIEDI
LAGVDEGGGGGGSFDGGDGSGEGSEDDPLYDEAVRFVTESRRASISAVQRKLKIGYNRAARMIEAMEMAGVVTPMNTNGS
REVIAPAPVRD
>P64165 ~~~ftsK~~~DNA translocase FtsK~~~
MAQAKKKSTAKKKTASKKRTNSRKKKNDNPIRYVIAILVVVLMVLGVFQLGIIGRLIDSFFNYLFGYSRYLTYILVLLAT
GFITYSKRIPKTRRTAGSIVLQIALLFVSQLVFHFNSGIKAEREPVLSYVYQSYQHSHFPNFGGGVLGFYLLELSVPLIS
LFGVCIITILLLCSSVILLTNHQHRDVAKVALENIKAWFGSFNEKMSERNQEKQLKREEKARLKEEQKARQNEQPQIKDV
SDFTEVPQERDIPIYGHTENESKSQCQPSRKKRVFDAENSSNNIVNHQADQQEQLTEQTHNSVESENTIEEAGEVTNVSY
VVPPLTLLNQPAKQKATSKAEVQRKGQVLENTLKDFGVNAKVTQIKIGPAVTQYEIQPAQGVKVSKIVNLHNDIALALAA
KDVRIEAPIPGRSAVGIEVPNEKISLVSLKEVLDEKFPSNNKLEVGLGRDISGDPITVPLNEMPHLLVAGSTGSGKSVCI
NGIITSILLNAKPHEVKLMLIDPKMVELNVYNGIPHLLIPVVTNPHKAAQALEKIVAEMERRYDLFQHSSTRNIKGYNEL
IRKQNQELDEKQPELPYIVVIVDELADLMMVAGKEVENAIQRITQMARAAGIHLIVATQRPSVDVITGIIKNNIPSRIAF
AVSSQTDSRTIIGTGGAEKLLGKGDMLYVGNGDSSQTRIQGAFLSDQEVQDVVNYVVEQQQANYVKEMEPDAPVDKSEMK
SEDALYDEAYLFVVEQQKASTSLLQRQFRIGYNRASRLMDDLERNQVIGPQKGSKPRQVLIDLNNDEV
>P64167 ~~~ftsK~~~DNA translocase FtsK~~~COG0697
MANKNTSTTRRRPSKAELERKEAIQRMLISLGIAILLIFAAFKLGAAGITLYNLIRLLVGSLAYLAIFGLLIYLFFFKWI
RKQEGLLSGFFTIFAGLLLIFEAYLVWKYGLDKSVLKGTMAQVVTDLTGFRTTSFAGGGLIGVALYIPTAFLFSNIGTYF
IGSILILVGSLLVSPWSVYDIAEFFSRGFAKWWEGHERRKEERFVKQEEKARQKAEKEARLEQEETEKALLDLPPVDMET
GEILTEEAVQNLPPIPEEKWVEPEIILPQAELKFPEQEDDSDDEDVQVDFSAKEALEYKLPSLQLFAPDKPKDQSKEKKI
VRENIKILEATFASFGIKVTVERAEIGPSVTKYEVKPAVGVRVNRISNLSDDLALALAAKDVRIEAPIPGKSLIGIEVPN
SDIATVSFRELWEQSQTKAENFLEIPLGKAVNGTARAFDLSKMPHLLVAGSTGSGKSVAVNGIIASILMKARPDQVKFMM
VDPKMVELSVYNDIPHLLIPVVTNPRKASKALQKVVDEMENRYELFAKVGVRNIAGFNAKVEEFNSQSEYKQIPLPFIVV
IVDELADLMMVASKEVEDAIIRLGQKARAAGIHMILATQRPSVDVISGLIKANVPSRVAFAVSSGTDSRTILDENGAEKL
LGRGDMLFKPIDENHPVRLQGSFISDDDVERIVNFIKTQADADYDESFDPGEVSENEGEFSDGDAGGDPLFEEAKSLVIE
TQKASASMIQRRLSVGFNRATRLMEELEIAGVIGPAEGTKPRKVLQQ
>Q07867 ~~~ftsL~~~Cell division protein FtsL~~~COG4839
MSNLAYQPEKQQRHAISPEKKVIVKKRASITLGEKVLLVLFAAAVLSVSLLIVSKAYAAYQTNIEVQKLEEQISSENKQI
GDLEKSVADLSKPQRIMDIAKKNGLNLKDKKVKNIQE
>P0AEN4 ~~~ftsL~~~Cell division protein FtsL~~~COG3116
MISRVTEALSKVKGSMGSHERHALPGVIGDDLLRFGKLPLCLFICIILTAVTVVTTAHHTRLLTAQREQLVLERDALDIE
WRNLILEENALGDHSRVERIATEKLQMQHVDPSQENIVVQK
>Q9HVZ6 ~~~ftsL~~~Cell division protein FtsL~~~
MSRLFVKRLPTGSFLMLLLYIGLLLSAIAVAYSTYWNRQLLNSLYSELSVRDKAQAEWGRLILEQSTWTAHSRIESLAVE
QLRMRVPDPAEVRMVAP
>B8GX61 ~~~ftsN~~~Cell division protein FtsN~~~
MSDPHRGAYTPPTDAPLSFDARQPVRGARPLPMTLIISAVVLVTLVVAVVMIYRDGVRGPNDAPQAVGTEVAQMKTPPAE
SSQPKDPASGLQIYHNEEPQPSATFAAPPETPLARPVAPTATTPVETASLPAAKPAAPAPTIESLATAAAAQKPAPKPVQ
VAQAAPKPVAAAPATTATAPKPAAVSTGPASVQIGALSSPALADKAWAEAVRLAPGLAAGKGKKVETVDKNGTTLYRTSV
TGFATREAAKAFCEAIAASGKSCFVK
>P29131 ~~~ftsN~~~Cell division protein FtsN~~~COG3087
MAQRDYVRRSQPAPSRRKKSTSRKKQRNLPAVSPAMVAIAAAVLVTFIGGLYFITHHKKEESETLQSQKVTGNGLPPKPE
ERWRYIKELESRQPGVRAPTEPSAGGEVKTPEQLTPEQRQLLEQMQADMRQQPTQLVEVPWNEQTPEQRQQTLQRQRQAQ
QLAEQQRLAQQSRTTEQSWQQQTRTSQAAPVQAQPRQSKPASSQQPYQDLLQTPAHTTAQSKPQQAAPVARAADAPKPTA
EKKDERRWMVQCGSFRGAEQAETVRAQLAFEGFDSKITTNNGWNRVVIGPVKGKENADSTLNRLKMAGHTNCIRLAAGG
>P26648 ~~~ftsP~~~Cell division protein FtsP~~~COG2132
MSLSRRQFIQASGIALCAGAVPLKASAAGQQQPLPVPPLLESRRGQPLFMTVQRAHWSFTPGTRASVWGINGRYLGPTIR
VWKGDDVKLIYSNRLTENVSMTVAGLQVPGPLMGGPARMMSPNADWAPVLPIRQNAATLWYHANTPNRTAQQVYNGLAGM
WLVEDEVSKSLPIPNHYGVDDFPVIIQDKRLDNFGTPEYNEPGSGGFVGDTLLVNGVQSPYVEVSRGWVRLRLLNASNSR
RYQLQMNDGRPLHVISGDQGFLPAPVSVKQLSLAPGERREILVDMSNGDEVSITCGEAASIVDRIRGFFEPSSILVSTLV
LTLRPTGLLPLVTDSLPMRLLPTEIMAGSPIRSRDISLGDDPGINGQLWDVNRIDVTAQQGTWERWTVRADEPQAFHIEG
VMFQIRNVNGAMPFPEDRGWKDTVWVDGQVELLVYFGQPSWAHFPFYFNSQTLEMADRGSIGQLLVNPVP
>P06136 ~~~ftsQ~~~Cell division protein FtsQ~~~COG1589
MSQAALNTRNSEEEVSSRRNNGTRLAGILFLLTVLTTVLVSGWVVLGWMEDAQRLPLSKLVLTGERHYTRNDDIRQSILA
LGEPGTFMTQDVNIIQTQIEQRLPWIKQVSVRKQWPDELKIHLVEYVPIARWNDQHMVDAEGNTFSVPPERTSKQVLPML
YGPEGSANEVLQGYREMGQMLAKDRFTLKEAAMTARRSWQLTLNNDIKLNLGRGDTMKRLARFVELYPVLQQQAQTDGKR
ISYVDLRYDSGAAVGWAPLPPEESTQQQNQAQAEQQ
>P9WNA1 ~~~ftsQ~~~Cell division protein FtsQ~~~COG1589
MTEHNEDPQIERVADDAADEEAVTEPLATESKDEPAEHPEFEGPRRRARRERAERRAAQARATAIEQARRAAKRRARGQI
VSEQNPAKPAARGVVRGLKALLATVVLAVVGIGLGLALYFTPAMSAREIVIIGIGAVSREEVLDAARVRPATPLLQIDTQ
QVADRVATIRRVASARVQRQYPSALRITIVERVPVVVKDFSDGPHLFDRDGVDFATDPPPPALPYFDVDNPGPSDPTTKA
ALQVLTALHPEVASQVGRIAAPSVASITLTLADGRVVIWGTTDRCEEKAEKLAALLTQPGRTYDVSSPDLPTVK
>G3XDA7 ~~~ftsQ~~~Cell division protein FtsQ~~~
MNGVLLRHQQPGGLGRAPRKPMPRGASRLVAKEPLSVRLPKADFSFLKYLAWPLLLAVLGYGAYRGAEYILPYADRPIAK
VSVEGDLSYISQRAVQQRISPYLAASFFTIDLAGMRGQLEQMPWIAHAEVRRVWPDQVVIRLDEQLPIARWGDEALLNNQ
GQAFTPKELANYEHLPRLHGPQRAQQQVMQQYQLLSQLLRPLGFSIARLEMSDRGGWALTTAQGVEIQIGRDHVVDKIRR
FVSIYDKALKDQISNIARIDLRYPNGLAVAWREPVTPATVATASAVQ
>O07639 2.4.1.129~~~ftsW~~~Probable peptidoglycan glycosyltransferase FtsW~~~COG0772
MLKKMLKSYDYSLIFAIVLLCGFGLVMVYSSSMITAVSRYGVSSNFFFMRQLFALIAGGALFILMALFPYKALAHQKFQK
GILLVSVLALISLFVFGHVAGNAQSWFKIGGMSIQPGEFVKLVVILYLAAVYAKKQSYIDHLLTGVAPPVVMTLIICGLI
AMQPDFGTAMIIGLIATCMILCSGFSGKTLVRLVILGGIVFILVSPIIYLNQDKILTEGRLARFESLEDPFKYANSSGLQ
VINSYYAISSGGIFGLGLGESIQKYGYLPESHTDFIMAVIAEELGIFGVLFVIFLLGFVVIKGFYIARKCEDPFGSLLAI
GISSMIAIQSFINLGGVSGLIPITGVTLPFISYGGSSLVLLLGSMGILANISMFVKYSENKKKKEPLAPKGMKKKQLKKT
VYL
>B8H092 2.4.1.129~~~ftsW~~~Probable peptidoglycan glycosyltransferase FtsW~~~
MASNATHAFARTDRTALGLWWWTTDRWLLGATALLVTLGMLLSFASSPAAAQRIGIDDQFHFALRMCFFATASSVLMLIT
SMLSPRDIRRAAFFIYLGAIAVMIALPFIGHNAKGATRWLQFAGFTLQPSEFMKPALIVLVSWMFAEGQKGEGVPGVSIA
FLLYFIAVALLLIQPDVGQTVLITIAFGAAFWMAGVPISWIMGLGGVALAGLGSTYFLFDHVHARVQKFLSPDQADTHQI
TRAAEAIRAGGLFGRGPGEGVMKRHVPDLHTDFIYSVAAEEYGLIFSWSLIGLFAFVVVRGLYKAMKLNDPFEQVAAAGL
FVLVGQQALINIAVNLNMIPTKGMTLPFISYGGSSMLAMGLTLGMALALLRKRPGAYGASGEFGFGRADA
>P0ABG4 2.4.1.129~~~ftsW~~~Probable peptidoglycan glycosyltransferase FtsW~~~COG0772
MRLSLPRLKMPRLPGFSILVWISTALKGWVMGSREKDTDSLIMYDRTLLWLTFGLAAIGFIMVTSASMPIGQRLTNDPFF
FAKRDGVYLILAFILAIITLRLPMEFWQRYSATMLLGSIILLMIVLVVGSSVKGASRWIDLGLLRIQPAELTKLSLFCYI
ANYLVRKGDEVRNNLRGFLKPMGVILVLAVLLLAQPDLGTVVVLFVTTLAMLFLAGAKLWQFIAIIGMGISAVVLLILAE
PYRIRRVTAFWNPWEDPFGSGYQLTQSLMAFGRGELWGQGLGNSVQKLEYLPEAHTDFIFAIIGEELGYVGVVLALLMVF
FVAFRAMSIGRKALEIDHRFSGFLACSIGIWFSFQALVNVGAAAGMLPTKGLTLPLISYGGSSLLIMSTAIMMLLRIDYE
TRLEKAQAFVRGSR
>P9WN97 2.4.1.129~~~ftsW~~~Probable peptidoglycan glycosyltransferase FtsW~~~COG0772
MLTRLLRRGTSDTDGSQTRGAEPVEGQRTGPEEASNPGSARPRTRFGAWLGRPMTSFHLIIAVAALLTTLGLIMVLSASA
VRSYDDDGSAWVIFGKQVLWTLVGLIGGYVCLRMSVRFMRRIAFSGFAITIVMLVLVLVPGIGKEANGSRGWFVVAGFSM
QPSELAKMAFAIWGAHLLAARRMERASLREMLIPLVPAAVVALALIVAQPDLGQTVSMGIILLGLLWYAGLPLRVFLSSL
AAVVVSAAILAVSAGYRSDRVRSWLNPENDPQDSGYQARQAKFALAQGGIFGDGLGQGVAKWNYLPNAHNDFIFAIIGEE
LGLVGALGLLGLFGLFAYTGMRIASRSADPFLRLLTATTTLWVLGQAFINIGYVIGLLPVTGLQLPLISAGGTSTAATLS
LIGIIANAARHEPEAVAALRAGRDDKVNRLLRLPLPEPYLPPRLEAFRDRKRANPQPAQTQPARKTPRTAPGQPARQMGL
PPRPGSPRTADPPVRRSVHHGAGQRYAGQRRTRRVRALEGQRYG
>Q81X30 ~~~ftsX~~~Cell division protein FtsX~~~COG2177
MKAKTLSRHLREGVKNLSRNGWMTFASVSAVTVTLLLVGVFLTAIMNMNHFATKVEQDVEIRVHIDPAAKEADQKKLEDD
MSKIAKVESIKYSSKEEELKRLIKSLGDSGKTFELFEQDNPLKNVFVVKAKEPTDTATIAKKIEKMQFVSNVQYGKGQVE
RLFDTVKTGRNIGIVLIAGLLFTAMFLISNTIKITIYARSTEIEIMKLVGATNWFIRWPFLLEGLFLGVLGSIIPIGLIL
VTYNSLQGMFNEKLGGTIFELLPYSPFVFQLAGLLVLIGALIGMWGSVMSIRRFLKV
>O34876 ~~~ftsX~~~Cell division protein FtsX~~~COG2177
MIKILGRHLRESFKSLGRNTWMTFASISAVTVTLILVGVFLVIMLNLNNMATNAEKQVEIKVLIDLTADQKAQDKLQNDI
KELKGIQSVTFSSKEKELDQLVDSFGDSGKSLTMKDQENPLNDAFVVKTTDPHDTPNVAKKIEKMDHVYKVTYGKEEVSR
LFKVVGVSRNIGIALIIGLVFTAMFLISNTIKITIFARRKEIEIMKLVGATNWFIRWPFFLEGLLLGVFGSVIPIALVLS
TYQYVIGWVVPKVQGSFVSLLPYNPFVFQVSLVLIAIGAVIGVWGSLTSIRKFLRV
>P0AC30 ~~~ftsX~~~Cell division protein FtsX~~~COG2177
MNKRDAINHIRQFGGRLDRFRKSVGGSGDGGRNAPKRAKSSPKPVNRKTNVFNEQVRYAFHGALQDLKSKPFATFLTVMV
IAISLTLPSVCYMVYKNVNQAATQYYPSPQITVYLQKTLDDDAAAGVVAQLQAEQGVEKVNYLSREDALGEFRNWSGFGG
ALDMLEENPLPAVAVVIPKLDFQGTESLNTLRDRITQINGIDEVRMDDSWFARLAALTGLVGRVSAMIGVLMVAAVFLVI
GNSVRLSIFARRDSINVQKLIGATDGFILRPFLYGGALLGFSGALLSLILSEILVLRLSSAVAEVAQVFGTKFDINGLSF
DECLLLLLVCSMIGWVAAWLATVQHLRHFTPE
>A5U7B6 ~~~ftsX~~~Cell division protein FtsX~~~COG2177
MRFGFLLNEVLTGFRRNVTMTIAMILTTAISVGLFGGGMLVVRLADSSRAIYLDRVESQVFLTEDVSANDSSCDTTACKA
LREKIETRSDVKAVRFLNRQQAYDDAIRKFPQFKDVAGKDSFPASFIVKLENPEQHKDFDTAMKGQPGVLDVLNQKELID
RLFAVLDGLSNAAFAVALVQAIGAILLIANMVQVAAYTRRTEIGIMRLVGASRWYTQLPFLVEAMLAATMGVGIAVAGLM
VVRALFLENALNQFYQANLIAKVDYADILFITPWLLLLGVAMSGLTAYLTLRLYVRR
>P9WG19 ~~~ftsX~~~Cell division protein FtsX~~~COG2177
MRFGFLLNEVLTGFRRNVTMTIAMILTTAISVGLFGGGMLVVRLADSSRAIYLDRVESQVFLTEDVSANDSSCDTTACKA
LREKIETRSDVKAVRFLNRQQAYDDAIRKFPQFKDVAGKDSFPASFIVKLENPEQHKDFDTAMKGQPGVLDVLNQKELID
RLFAVLDGLSNAAFAVALVQAIGAILLIANMVQVAAYTRRTEIGIMRLVGASRWYTQLPFLVEAMLAATMGVGIAVAGLM
VVRALFLENALNQFYQANLIAKVDYADILFITPWLLLLGVAMSGLTAYLTLRLYVRR
>Q04LE4 ~~~ftsX~~~Cell division protein FtsX~~~COG2177
MISRFFRHLFEALKSLKRNGWMTVAAVSSVMITLTLVAIFASVIFNTAKLATDIENNVRVVVYIRKDVEDNSQTIEKEGQ
TVTNNDYHKVYDSLKNMSTVKSVTFSSKEEQYEKLTEIMGDNWKIFEGDANPLYDAYIVEANAPNDVKTIAEDAKKIEGV
SEVQDGGANTERLFKLASFIRVWGLGIAALLIFIAVFLISNTIRITIISRSREIQIMRLVGAKNSYIRGPFLLEGAFIGL
LGAIAPSVLVFIVYQIVYQSVNKSLVGQNLSMISPDLFSPLMIALLFVIGVFIGSLGSGISMRRFLKI
>P51835 3.6.5.4~~~ftsY~~~Signal recognition particle receptor FtsY~~~COG0552
MSFFKKLKEKITKQTDSVSEKFKDGLEKTRNSFQNKVNDLVSRYRKVDEDFFEELEEVLISADVGFTTVMELIDELKKEV
KRRNIQDPKEVQSVISEKLVEIYNSGDEQISELNIQDGRLNVILLVGVNGVGKTTTIGKLAHKMKQEGKSVVLAAGDTFR
AGAIEQLEVWGERTGVPVIKQTAGSDPAAVIYDAVHAAKARNADVLICDTAGRLQNKVNLMKELEKVKRVIEREVPEAPH
EVLLALDATTGQNAMAQAKEFSKATNVTGIALTKLDGTAKGGIVLAIRNELHIPVKLVGLGEKVDDLQEFDPESYVYGLF
SDLVEKADD
>P10121 3.6.5.4~~~ftsY~~~Signal recognition particle receptor FtsY~~~COG0552
MAKEKKRGFFSWLGFGQKEQTPEKETEVQNEQPVVEEIVQAQEPVKASEQAVEEQPQAHTEAEAETFAADVVEVTEQVAE
SEKAQPEAEVVAQPEPVVEETPEPVAIEREELPLPEDVNAEAVSPEEWQAEAETVEIVEAAEEEAAKEEITDEELETALA
AEAAEEAVMVVPPAEEEQPVEEIAQEQEKPTKEGFFARLKRSLLKTKENLGSGFISLFRGKKIDDDLFEELEEQLLIADV
GVETTRKIITNLTEGASRKQLRDAEALYGLLKEEMGEILAKVDEPLNVEGKAPFVILMVGVNGVGKTTTIGKLARQFEQQ
GKSVMLAAGDTFRAAAVEQLQVWGQRNNIPVIAQHTGADSASVIFDAIQAAKARNIDVLIADTAGRLQNKSHLMEELKKI
VRVMKKLDVEAPHEVMLTIDASTGQNAVSQAKLFHEAVGLTGITLTKLDGTAKGGVIFSVADQFGIPIRYIGVGERIEDL
RPFKADDFIEALFARED
>Q6MTB9 3.6.5.4~~~ftsY~~~Signal recognition particle receptor FtsY~~~COG0552
MGFWAKLKEKLTKKTNQVEQDEPILDQQDQQDQQEEQEQIIEKEIEQIKENKIKKTKTSETKKQEKPIETLKEKKKREKQ
KEKDKKVEKAMLKSAFNFSKDIKKLSKKYKQADDEFFEELEDVLIQTDMGMKMVLKVSNLVRKKTKRDTSFENIKDALVE
SLYQAYTDNDWTNKKYRIDFKENRLNIFMLVGVNGTGKTTSLAKMANYYAELGYKVLIAAADTFRAGATQQLEEWIKTRL
NNKVDLVKANKLNADPASVVFDAIKKAKEQNYDLLLIDTAGRLQNKVNLMAELEKMNKIIQQVEKSAPHEVLLVIDATTG
QNGVIQAEEFSKVADVSGIILTKMDSTSKGGIGLAIKELLNIPIKMIGVGEKVDDLLAFDIDQYIVHLSSGFMQGDEVEK
>P9WGD9 3.6.5.4~~~ftsY~~~Signal recognition particle receptor FtsY~~~COG0552
MWEGLWIATAVIAALVVIAALTLGLVLYRRRRISLSPRPERGVVDRSGGYTASSGITFSQTPTTQPAERIDTSGLPAVGD
DATVPRDAPKRTIADVHLPEFEPEPQAPEVPEADAIAPPEGRLERLRGRLARSQNALGRGLLGLIGGGDLDEDSWQDVED
TLLVADLGPAATASVVSQLRSRLASGNVRTEADARAVLRDVLINELQPGMDRSIRALPHAGHPSVLLVVGVNGTGKTTTV
GKLARVLVADGRRVVLGAADTFRAAAADQLQTWAARVGAAVVRGPEGADPASVAFDAVDKGIAAGADVVLIDTAGRLHTK
VGLMDELDKVKRVVTRRASVDEVLLVLDATIGQNGLAQARVFAEVVDISGAVLTKLDGTAKGGIVFRVQQELGVPVKLVG
LGEGPDDLAPFEPAAFVDALLG
>P14929 3.6.5.4~~~ftsY~~~Signal recognition particle receptor FtsY~~~
MFSFFRRKKKQETPALEEAQVQETAAKVESEVAQIVGNIKEDVESLAESVKGRAESAVETVSGAVEQVKETVAEMPSEAG
EAAERVESAKEAVAETVGEAVGQVQEAVATTEEHKLGWAARLKQGLAKSRDKMAKSLAGVFGGGQIGEDLYEELETVLIT
GDMGMEATEYLMKDVRGRVSLKGLKDGNELRGALKEALYDLIKPLEKPLVLPETKEPFVIMLAGINGAGKTTSIGKLAKY
FQAQGKSVLLAAGDTFRAAAREQLQAWGGRNNVTVISQTTGDSAAVCFDAVQAAKARIDIVLADTAGRLPTQLHLMEEIK
KVKRVLQKAIPGAPHEIIVVLDANIGQNAVNQVKAFDDALGLTGLIVTKLDGTAKGGILAALASDRPVPVRYIGVGEGID
DLRPFDARAFVDRLLD
>P83749 3.6.5.4~~~ftsY~~~Signal recognition particle receptor FtsY~~~
MGFFDRLKAGLAKTRERLLKAIPWGGNLEEVLEELEMALLAADVGLSATEEILQEVRASGRKDLKEAVKEKLVGMLEPDE
RRATLRKLGFNPQKPKPVEPKGRVVLVVGVNGVGKTTTIAKLGRYYQNLGKKVMFCAGDTFRAAGGTQLSEWGKRLSIPV
IQGPEGTDPAALAYDAVQAMKARGYDLLFVDTAGRLHTKHNLMEELKKVKRAIAKADPEEPKEVWLVLDAVTGQNGLEQA
KKFHEAVGLTGVIVTKLDGTAKGGVLIPIVRTLKVPIKFVGVGEGPDDLQPFDPEAFVEALLED
>Q3BK72 3.6.1.-~~~ftsZ-like~~~FtsZ-like protein~~~COG0206
MFGVEPISFCEDMTMIRPRIIVIGVGGAGGNAVNNMILSKIEGVEFIAANTDAQALGLSLADRRIPLGGYVTKGLGAGSR
PELGRSAAQESIDDILTAIDDANMVFITAGMGGGTGSGAAPVIAQAARERGILTIGVVTKPFHFEGGHRMGTAEAAIEEL
QHVVDTLIIIPNQNLFRIASERTTFIDAFKMADNVLNSGVRSVTDLVVKPGLINLDFADIRIVMSEMGKAIMGTGEAEGE
PRAVKAAEAAISNPLLGDTSIAGAKGVLINITGGMDMTLFEVDEAANRIRTEVAPDANIIFGSTFDEKLDGKMRVSVVAT
GIA
>O66809 ~~~ftsZ~~~Cell division protein FtsZ~~~COG0206
MEEFVNPCKIKVIGVGGGGSNAVNRMYEDGIEGVELYAINTDVQHLSTLKVPNKIQIGEKVTRGLGAGAKPEVGEEAALE
DIDKIKEILRDTDMVFISAGLGGGTGTGAAPVIAKTAKEMGILTVAVATLPFRFEGPRKMEKALKGLEKLKESSDAYIVI
HNDKIKELSNRTLTIKDAFKEVDSVLSKAVRGITSIVVTPAVINVDFADVRTTLEEGGLSIIGMGEGRGDEKADIAVEKA
VTSPLLEGNTIEGARRLLVTIWTSEDIPYDIVDEVMERIHSKVHPEAEIIFGAVLEPQEQDFIRVAIVATDFPEEKFQVG
EKEVKFKVIKKEEKEEPKEEPKPLSDTTYLEEEEIPAVIRRKNKRLL
>P77817 ~~~ftsZ~~~Cell division protein FtsZ~~~
MFELVDNVPQSAVIKVIGVGGGGGNAVNHMAATSIEGIEFICANTDAQALKNITARTVLQLGSGVTKGLGAGANPEVGRE
AAMEDRERIAEVLQGTDMVFITTGMGGGTGTGAAPVIAEVAKGLGILTVAVVTRPFPFEGRKRMQVAEEGIRLLAEHVDS
LITIPNEKLLTILGKDASLLSAFAKADDVLAGAVRGISDIIKLSGMINVDFADVKTVMSEMGMAMMGTGFASGPNRAREA
TEAAIRNPLLEDVHLQGARGILVNITAGPDLSLGEYSDVGNIIEQFASDQAMVKVGTVIDPDMRDELHVTVVATGLGTRA
DKPMKVVDNTLQPAGAAAAAPAVPRGDQTVNYKDYERPTVQRQSHAASATAAKINPQDDLDYLDIPAFLRRQAD
>P17865 ~~~ftsZ~~~Cell division protein FtsZ~~~COG0206
MLEFETNIDGLASIKVIGVGGGGNNAVNRMIENEVQGVEYIAVNTDAQALNLSKAEVKMQIGAKLTRGLGAGANPEVGKK
AAEESKEQIEEALKGADMVFVTAGMGGGTGTGAAPVIAQIAKDLGALTVGVVTRPFTFEGRKRQLQAAGGISAMKEAVDT
LIVIPNDRILEIVDKNTPMLEAFREADNVLRQGVQGISDLIATPGLINLDFADVKTIMSNKGSALMGIGIATGENRAAEA
AKKAISSPLLEAAIDGAQGVLMNITGGTNLSLYEVQEAADIVASASDQDVNMIFGSVINENLKDEIVVTVIATGFIEQEK
DVTKPQRPSLNQSIKTHNQSVPKREPKREEPQQQNTVSRHTSQPADDTLDIPTFLRNRNKRG
>P94337 ~~~ftsZ~~~Cell division protein FtsZ~~~COG0206
MTSPNNYLAKIKVVGVGGGGVNAVNRMIEEGLKGVEFIAVNTDSQALMFSDADVKLDIGREATRGLGAGANPEVGRASAE
DHKNEIEETIKGADMVFVTAGEGGGTGTGAAPVVAGIAKKMGALTIGVVTKPFEFEGRRRTRQAEEGIAALKEVCDTLIV
IPNDRLLELGDANLSIMEAFRAADEVLHNGVQGITNLITIPGVINVDFADVRSVMSEAGSALMGVGSARGDNRVVSATEQ
AINSPLLEATMDGATGVLLSFAGGSDLGLMEVNAAASMVRERSDEDVNLIFGTIIDDNLGDEVRVTVIATGFDAARASAA
ENRRAGISAAPAAEPVQQQVPTTNATLPPEKESIFGGAREENDPYLSRSAGARHRIEETRSGGGLFTTGNDRDYRRDERR
EDHRDERRDERRDDRSYDRRDDRRDDRRDDRGDDLDVPSFLQ
>Q83F12 ~~~ftsZ~~~Cell division protein FtsZ~~~COG0206
MFELGETSPQNAQIKVIGIGGGGGNAIEHMIAENIDGVEFVCANTDSQALGRSNARVVLQLGDEITKGLGAGADPSVGRQ
AAEEARDRIREILEGTDMVFLTAGMGGGTGTGAAPIFAEVAKELGILTVAVVTKPFVFEGKKRMDVAEEGIKALGNYVDS
LITIPNNKLLNVLGKNITLLNAFKAANNVLLGAVQGIADLITRPGLINVDFADVRTVMSEMGMAMMGTGVSSGENRAREA
AEAAIASPLLEDVDFTGARGVLVNITAGMDLSIGEFEQVGEAVKAFASETATVVIGTVIDPDMSDELRVTVVVTGLGSHA
GGGAGVPLKPVKNTKNDGTLDYHQLDRPTYMRNQEPSKRTVDLEEQRDRDFEYLDIPAFLRRLEED
>P0A9A6 ~~~ftsZ~~~Cell division protein FtsZ~~~COG0206
MFEPMELTNDAVIKVIGVGGGGGNAVEHMVRERIEGVEFFAVNTDAQALRKTAVGQTIQIGSGITKGLGAGANPEVGRNA
ADEDRDALRAALEGADMVFIAAGMGGGTGTGAAPVVAEVAKDLGILTVAVVTKPFNFEGKKRMAFAEQGITELSKHVDSL
ITIPNDKLLKVLGRGISLLDAFGAANDVLKGAVQGIAELITRPGLMNVDFADVRTVMSEMGYAMMGSGVASGEDRAEEAA
EMAISSPLLEDIDLSGARGVLVNITAGFDLRLDEFETVGNTIRAFASDNATVVIGTSLDPDMNDELRVTVVATGIGMDKR
PEITLVTNKQVQQPVMDRYQQHGMAPLTQEQKPVAKVVNDNAPQTAKEPDYLDIPAFLRKQAD
>V6F5E5 ~~~ftsZ~~~Cell division protein FtsZ~~~COG0206
MLNFLPGNPQDLKPKITVIGVGGAGGNAVNNMIASRLEGVEFIVANTDAQAINQSRTERRVQLGTTVAQGLGAGSRPEIG
RAAAEESLEEVIGQIAGANMVFITAGMGGGTGSGAAPVIARAARDHGILTVGVVTKPFHFEGAHRMRTAEGAIEELSQYV
DTLIIIPNQNLFRVATERTTFADAFKMADDVLYSGVRGVTDLMIMPGLINLDFADIRTVMSEMGKAMMGTGEAEGDKRAI
EAAEAAISNPLLDDTSMKGAKGVLINITGGMDMTLFEVDEAANRIRDEVDPEANIIFGSTFDEKLNGKMRVSVVATGIAS
EAAAQPKPTVVSLNTPQAQPQPRVAAGGTAGAGFRPAVVTAQAAPAAAVAVAQAQPQMEARTVAQPAPQPAHQPVVTAQV
RVQPAAARPAQQPMAETFRPDPQLRLDPVLERPVPATTSLQADFRADPDMGHLSQAVSHIAETAQAAPQPQRQPEIQRQQ
APQPQRQPEPEARRSGGLFGLLRRPAAAQPAPQPQRHEPAPMAQQPRQEPARMGNMATRSEPSVARAGEDLDIPAFLRRQ
AN
>A5U4H7 ~~~ftsZ~~~Cell division protein FtsZ~~~COG0206
MTPPHNYLAVIKVVGIGGGGVNAVNRMIEQGLKGVEFIAINTDAQALLMSDADVKLDVGRDSTRGLGAGADPEVGRKAAE
DAKDEIEELLRGADMVFVTAGEGGGTGTGGAPVVASIARKLGALTVGVVTRPFSFEGKRRSNQAENGIAALRESCDTLIV
IPNDRLLQMGDAAVSLMDAFRSADEVLLNGVQGITDLITTPGLINVDFADVKGIMSGAGTALMGIGSARGEGRSLKAAEI
AINSPLLEASMEGAQGVLMSIAGGSDLGLFEINEAASLVQDAAHPDANIIFGTVIDDSLGDEVRVTVIAAGFDVSGPGRK
PVMGETGGAHRIESAKAGKLTSTLFEPVDAVSVPLHTNGATLSIGGDDDDVDVPPFMRR
>P9WN95 ~~~ftsZ~~~Cell division protein FtsZ~~~COG0206
MTPPHNYLAVIKVVGIGGGGVNAVNRMIEQGLKGVEFIAINTDAQALLMSDADVKLDVGRDSTRGLGAGADPEVGRKAAE
DAKDEIEELLRGADMVFVTAGEGGGTGTGGAPVVASIARKLGALTVGVVTRPFSFEGKRRSNQAENGIAALRESCDTLIV
IPNDRLLQMGDAAVSLMDAFRSADEVLLNGVQGITDLITTPGLINVDFADVKGIMSGAGTALMGIGSARGEGRSLKAAEI
AINSPLLEASMEGAQGVLMSIAGGSDLGLFEINEAASLVQDAAHPDANIIFGTVIDDSLGDEVRVTVIAAGFDVSGPGRK
PVMGETGGAHRIESAKAGKLTSTLFEPVDAVSVPLHTNGATLSIGGDDDDVDVPPFMRR
>P72079 ~~~ftsZ~~~Cell division protein FtsZ~~~
MEFVYDVAESAVSPAVIKVIGLGGGGCNAINNMVANNVRSVEFISANTDAQSLAKNHAAKRIQLGTNLTRGLGAGANPDI
GRAAAQEDREAIEEAIRGANMLFITTGMGGGTGTGSAPVVAEIAKSLGILTVAVVTRPFSYEGKRVHVAQAGLEQLKEHV
DSLIIIPNDKLMTALGEDVTMREAFRAADNVLRDAVAGISEVVTCPSEIINLDFADVKTVMSNRGIAMMGSGYAQGIDRA
RMATDQAISSPLLDDVTLDGARGVLVNITTAPGCLKMSELSEVMKIVNQSAHPDLECKFGAAEDETMSEDAIRITIIATG
LKEKGAVDPTPEREVEAVAPSKQEQSHIVEGMIRTNRGIRTMNLTAADFDNQSVLDDFEIPAILRRQHNSDK
>P47204 ~~~ftsZ~~~Cell division protein FtsZ~~~
MFELVDNIAQTAVIKVIGVGGGGGNAVNHMAKNNVEGVEFICANTDAQALKNIAARTVLQLGPGVTKGLGAGANPEVGRQ
AALEDRERISEVLEGADMVFITTGMGGGTGTGAAPIIAEVAKEMGILTVAVVTRPFPFEGRKRMQIADEGIRALAESVDS
LITIPNEKLLTILGKDASLLAAFAKADDVLAGAVRGISDIIKRPGMINVDFADVKTVMSEMGMAMMGTGCASGPNRAREA
TEAAIRNPLLEDVNLQGARGILVNITAGPDLSLGEYSDVGNIIEQFASEHATVKVGTVIDADMRDELHVTVVATGLGARL
EKPVKVVDNTVQGSAAQAAAPAQREQQSVNYRDLDRPTVMRNQSHGSAATAAKLNPQDDLDYLDIPAFLRRQAD
>Q2FZ89 ~~~ftsZ~~~Cell division protein FtsZ~~~COG0206
MLEFEQGFNHLATLKVIGVGGGGNNAVNRMIDHGMNNVEFIAINTDGQALNLSKAESKIQIGEKLTRGLGAGANPEIGKK
AAEESREQIEDAIQGADMVFVTSGMGGGTGTGAAPVVAKIAKEMGALTVGVVTRPFSFEGRKRQTQAAAGVEAMKAAVDT
LIVIPNDRLLDIVDKSTPMMEAFKEADNVLRQGVQGISDLIAVSGEVNLDFADVKTIMSNQGSALMGIGVSSGENRAVEA
AKKAISSPLLETSIVGAQGVLMNITGGESLSLFEAQEAADIVQDAADEDVNMIFGTVINPELQDEIVVTVIATGFDDKPT
SHGRKSGSTGFGTSVNTSSNATSKDESFTSNSSNAQATDSVSERTHTTKEDDIPSFIRNREERRSRRTRR
>P0A029 ~~~ftsZ~~~Cell division protein FtsZ~~~
MLEFEQGFNHLATLKVIGVGGGGNNAVNRMIDHGMNNVEFIAINTDGQALNLSKAESKIQIGEKLTRGLGAGANPEIGKK
AAEESREQIEDAIQGADMVFVTSGMGGGTGTGAAPVVAKIAKEMGALTVGVVTRPFSFEGRKRQTQAAAGVEAMKAAVDT
LIVIPNDRLLDIVDKSTPMMEAFKEADNVLRQGVQGISDLIAVSGEVNLDFADVKTIMSNQGSALMGIGVSSGENRAVEA
AKKAISSPLLETSIVGAQGVLMNITGGESLSLFEAQEAADIVQDAADEDVNMIFGTVINPELQDEIVVTVIATGFDDKPT
SHGRKSGSTGFGTSVNTSSNATSKDESFTSNSSNAQATDSVSERTHTTKEDDIPSFIRNREERRSRRTRR
>P99108 ~~~ftsZ~~~Cell division protein FtsZ~~~
MLEFEQGFNHLATLKVIGVGGGGNNAVNRMIDHGMNNVEFIAINTDGQALNLSKAESKIQIGEKLTRGLGAGANPEIGKK
AAEESREQIEDAIQGADMVFVTSGMGGGTGTGAAPVVAKIAKEMGALTVGVVTRPFSFEGRKRQTQAAAGVEAMKAAVDT
LIVIPNDRLLDIVDKSTPMMEAFKEADNVLRQGVQGISDLIAVSGEVNLDFADVKTIMSNQGSALMGIGVSSGENRAVEA
AKKAISSPLLETSIVGAQGVLMNITGGESLSLFEAQEAADIVQDAADEDVNMIFGTVINPELQDEIVVTVIATGFDDKPT
SHGRKSGSTGFGTSVNTSSNATSKDESFTSNSSNAQATDSVSERTHTTKEDDIPSFIRNREERRSRRTRR
>Q6GHP9 ~~~ftsZ~~~Cell division protein FtsZ~~~
MLEFEQGFNHLATLKVIGVGGGGNNAVNRMIDHGMNNVEFIAINTDGQALNLSKAESKIQIGEKLTRGLGAGANPEIGKK
AAEESREQIEDAIQGADMVFVTSGMGGGTGTGAAPVVAKIAKEMGALTVGVVTRPFSFEGRKRQTQAAAGVEAMKAAVDT
LIVIPNDRLLDIVDKSTPMMEAFKEADNVLRQGVQGISDLIAVSGEVNLDFADVKTIMSNQGSALMGIGVSSGENRAVEA
AKKAISSPLLETSIVGAQGVLMNITGGESLSLFEAQEAADIVQDAADEDVNMIFGTVINPELQDEIVVTVIATGFDDKPT
SHGRKSGSTGFGTSVNTSSNATSKDESFTSNSSNAQATDSVSERTHTTKEDDIPSFIRNREERRSRRTRR
>P0A031 ~~~ftsZ~~~Cell division protein FtsZ~~~
MLEFEQGFNHLATLKVIGVGGGGNNAVNRMIDHGMNNVEFIAINTDGQALNLSKAESKIQIGEKLTRGLGAGANPEIGKK
AAEESREQIEDAIQGADMVFVTSGMGGGTGTGAAPVVAKIAKEMGALTVGVVTRPFSFEGRKRQTQAAAGVEAMKAAVDT
LIVIPNDRLLDIVDKSTPMMEAFKEADNVLRQGVQGISDLIAVSGEVNLDFADVKTIMSNQGSALMGIGVSSGENRAVEA
AKKAISSPLLETSIVGAQGVLMNITGGESLSLFEAQEAADIVQDAADEDVNMIFGTVINPELQDEIVVTVIATGFDDKPT
SHGRKSGSTGFGTSVNTSSNATSKDESFTSNSSNAQATDSVSERTHTTKEDDIPSFIRNREERRSRRTRR
>Q5HQ06 ~~~ftsZ~~~Cell division protein FtsZ~~~COG0206
MLEFEQGFNHLATLKVIGVGGGGNNAVNRMIDHGMNNVEFIAINTDGQALNLSKAESKIQIGEKLTRGLGAGANPEIGKK
AAEESREQIEDAIQGADMVFVTAGMGGGTGTGAAPVVAKIAKEMGALTVGVVTRPFGFEGRKRQTQAAAGVESMKAAVDT
LIVIPNDRLLDIVDKSTPMMEAFKEADNVLRQGVQGISDLIAVSGEVNLDFADVKTIMSNQGSALMGIGVSSGENRAVEA
AKKAISSPLLETSIVGAQGVLMNITGGESLSLFEAQEAADIVQDAADEDVNMIFGTVINPELQDEIVVTVIATGFEDKPS
SQGRKATSTGFGSSVNSSSNHQSGASAKEDSFSAHTSHSQSSESVSERSHTTKDDDIPSFIRNREERRSRRTRR
>A0A0H2ZNE0 ~~~ftsZ~~~Cell division protein FtsZ~~~COG0206
MTFSFDTAAAQGAVIKVIGVGGGGGNAINRMVDEGVTGVEFIAANTDVQALSSTKAETVIQLGPKLTRGLGAGGQPEVGR
KAAEESEETLTEAISGADMVFITAGMGGGSGTGAAPVIARIAKDLGALTVGVVTRPFGFEGSKRGQFAVEGINQLREHVD
TLLIISNNNLLEIVDKKTPLLEALSEADNVLRQGVQGITDLITNPGLINLDFADVKTVMANKGNALMGIGIGSGEERVVE
AARKAIYSPLLETTIDGAEDVIVNVTGGLDLTLIEAEEASQIVNQAAGQGVNIWLGTSIDESMRDEIRVTVVATGVRQDR
VEKVVAPQARSATNYRETVKPAHSHGFDRHFDMAETVELPKQNPRRLEPTQASAFGDWDLRRESIVRTTDSVVSPVERFE
APISQDEDELDTPPFFKNR
>P73456 ~~~ftsZ~~~Cell division protein FtsZ~~~COG0206
MTLNNDLPLNNIGFTGSGLNDGTEGLDDLFSSSIVDNEPLEALVETPTFASPSPNLKRDQIVPSNIAKIKVIGVGGGGCN
AVNRMIASGVTGIDFWAINTDSQALTNTNAPDCIQIGQKLTRGLGAGGNPAIGQKAAEESRDEIARSLEGTDLVFITAGM
GGGTGTGAAPIVAEVAKEMGCLTVGIVTRPFTFEGRRRAKQAEEGINALQSRVDTLIVIPNNQLLSVIPAETPLQEAFRV
ADDILRQGVQGISDIIIIPGLVNVDFADVRAVMADAGSALMGIGVGSGKSRAKEAATAAISSPLLESSIQGAKGVVFNVT
GGTDLTLHEVNVAAEIIYEVVDADANIIFGAVIDDRLQGEMRITVIATGFNGEKEKPQAKTSSKPVLSGPPAGVETVPST
TTPEDPLGEIPMAPELDIPDFLQKRRFPRR
>O08398 ~~~ftsZ~~~Cell division protein FtsZ~~~COG0206
MGFDLDVEKKKENRNIPQANNLKIKVIGVGGAGNNAINRMIEIGIHGVEFVAVNTDLQVLEASNADVKIQIGENITRGLG
AGGRPEIGEQAALESEEKIREVLQDTHMVFITAGFGGGTGTGASPVIAKIAKEMGILTVAIVTTPFYFEGPERLKKAIEG
LKKLRKHVDTLIKISNNKLMEELPRDVKIKDAFLKADETLHQGVKGISELITKRGYINLDFADIESVMKDAGAAILGIGV
GKGEHRAREAAKKAMESKLIEHPVENASSIVFNITAPSNIRMEEVHEAAMIIRQNSSEDADVKFGLIFDDEVPDDEIRVI
FIATRFPDEDKILFPEGDIPAIYRYGLEGLL
>P0AB87 4.1.2.17~~~fucA~~~L-fuculose phosphate aldolase~~~COG0235
MERNKLARQIIDTCLEMTRLGLNQGTAGNVSVRYQDGMLITPTGIPYEKLTESHIVFIDGNGKHEEGKLPSSEWRFHMAA
YQSRPDANAVVHNHAVHCTAVSILNRSIPAIHYMIAAAGGNSIPCAPYATFGTRELSEHVALALKNRKATLLQHHGLIAC
EVNLEKALWLAHEVEVLAQLYLTTLAITDPVPVLSDEEIAVVLEKFKTYGLRIEE
>Q8P3K4 1.1.1.-~~~~~~2-keto-3-deoxy-L-fuconate dehydrogenase~~~COG1028
MAADRTAAGTAVRRRQCRGGVRRQRTHVLSSATAGGPHPRNLIGMTVSIPTTPNTRLQGKRCLITAAGAGIGRESALACA
RAGAHVIATDIDAAALQALAAESDAITTQLLDVTDAAAITALVAAHGPFDVLFNCAGYVHQGSILDCDEPAWRRSFSINV
DAMYYTCKAVLPGMLERGRGSIINMSSVASSIKGVPNRFVYGVTKAAVIGLSKAIAADYVAQGVRCNAICPGTIKTPSLG
QRVQALGGDEQAVWKSFTDRQPMGRLGDPREIAQLVVYLASDESSFTTGQTHIIDGGWSN
>Q8P3K2 4.2.1.68~~~~~~L-fuconate dehydratase~~~COG4948
MRTIIALETHDVRFPTSRELDGSDAMNPDPDYSAAYVVLRTDGAEDLAGYGLVFTIGRGNDVQTAAVAALAEHVVGLSVD
KVIADLGAFARRLTNDSQLRWLGPEKGVMHMAIGAVINAAWDLAARAANKPLWRFIAELTPEQLVDTIDFRYLSDALTRD
EALAILRDAQPQRAARTATLIEQGYPAYTTSPGWLGYSDEKLVRLAKEAVADGFRTIKLKVGANVQDDIRRCRLARAAIG
PDIAMAVDANQRWDVGPAIDWMRQLAEFDIAWIEEPTSPDDVLGHAAIRQGITPVPVSTGEHTQNRVVFKQLLQAGAVDL
IQIDAARVGGVNENLAILLLAAKFGVRVFPHAGGVGLCELVQHLAMADFVAITGKMEDRAIEFVDHLHQHFLDPVRIQHG
RYLAPEVPGFSAEMHPASIAEFSYPDGRFWVEDLAASKAKA
>P69922 5.3.1.25~~~fucI~~~L-fucose isomerase~~~COG2407
MKKISLPKIGIRPVIDGRRMGVRESLEEQTMNMAKATAALLTEKLRHACGAAVECVISDTCIAGMAEAAACEEKFSSQNV
GLTITVTPCWCYGSETIDMDPTRPKAIWGFNGTERPGAVYLAAALAAHSQKGIPAFSIYGHDVQDADDTSIPADVEEKLL
RFARAGLAVASMKGKSYLSLGGVSMGIAGSIVDHNFFESWLGMKVQAVDMTELRRRIDQKIYDEAELEMALAWADKNFRY
GEDENNKQYQRNAEQSRAVLRESLLMAMCIRDMMQGNSKLADIGRVEESLGYNAIAAGFQGQRHWTDQYPNGDTAEAILN
SSFDWNGVREPFVVATENDSLNGVAMLMGHQLTGTAQVFADVRTYWSPEAIERVTGHKLDGLAEHGIIHLINSGSAALDG
SCKQRDSEGNPTMKPHWEISQQEADACLAATEWCPAIHEYFRGGGYSSRFLTEGGVPFTMTRVNIIKGLGPVLQIAEGWS
VELPKDVHDILNKRTNSTWPTTWFAPRLTGKGPFTDVYSVMANWGANHGVLTIGHVGADFITLASMLRIPVCMHNVEETK
VYRPSAWAAHGMDIEGQDYRACQNYGPLYKR
>Q97N97 5.3.1.25~~~fucI~~~L-fucose isomerase~~~COG2407
MIQHPRIGIRPTIDGRRQGVRESLEVQTMNMAKSVADLISSTLKYPDGEPVECVISPSTIGRVPEAAASHELFKKSNVCA
TITVTPCWCYGSETMDMSPDIPHAIWGFNGTERPGAVYLAAVLASHAQKGIPAFGIYGRDVQEASDTDIPEDVKEKLLRY
ARAALATGLMRDTAYLSMGSVSMGIGGSIVNPDFFQEYLGMRNESVDMTEFTRRMDRGIYDPEEFERALKWVKENVKEGF
DHNREDLVLSREEKDRQWEFVIKMFMIGRDLMVGNPRLAELGFEEEAVGHHALVAGFQGQRQWTDHFPNGDFMETFLNTQ
FDWNGIRKPFVFATENDSLNGVSMLFNYLLTNTPQIFADVRTYWSPEAVKRVTGHTLEGRAAAGFLHLINSGSCTLDGTG
QATRDGKPIMKPFWELEESEVQAMLENTDFPPANREYFRGGGFSTRFLTKGDMPVTMVRLNLLKGVGPVLQIAEGYTLEL
PEDVHHTLDNRTDPGWPTTWFAPRLTGKGAFKSVYDVMNNWGANHGAITYGHIGADLITLASMLRIPVNMHNVPEEDIFR
PKNWSLFGTEDLESADYRACQLLGPLHK
>P11553 2.7.1.51~~~fucK~~~L-fuculokinase~~~COG1070
MKQEVILVLDCGATNVRAIAVNRQGKIVARASTPNASDIAMENNTWHQWSLDAILQRFADCCRQINSELTECHIRGIAVT
TFGVDGALVDKQGNLLYPIISWKCPRTAAVMDNIERLISAQRLQAISGVGAFSFNTLYKLVWLKENHPQLLERAHAWLFI
SSLINHRLTGEFTTDITMAGTSQMLDIQQRDFSPQILQATGIPRRLFPRLVEAGEQIGTLQNSAAAMLGLPVGIPVISAG
HDTQFALFGAGAEQNEPVLSSGTWEILMVRSAQVDTSLLSQYAGSTCELDSQAGLYNPGMQWLASGVLEWVRKLFWTAET
PWQMLIEEARLIAPGADGVKMQCDLLSCQNAGWQGVTLNTTRGHFYRAALEGLTAQLQRNLQMLEKIGHFKASELLLVGG
GSRNTLWNQIKANMLDIPVKVLDDAETTVAGAALFGWYGVGEFNSPEEARAQIHYQYRYFYPQTEPEFIEEV
>P0AEN8 5.1.3.29~~~fucU~~~L-fucose mutarotase~~~COG4154
MLKTISPLISPELLKVLAEMGHGDEIIFSDAHFPAHSMGPQVIRADGLLVSDLLQAIIPLFELDSYAPPLVMMAAVEGDT
LDPEVERRYRNALSLQAPCPDIIRINRFAFYERAQKAFAIVITGERAKYGNILLKKGVTP
>Q8P3K1 5.1.3.29~~~~~~L-fucose mutarotase~~~COG3254
MSMQRLCYVLDLHDDAALIAQYERWHRPSEVWPEVVASLQQAGIAELEIFRSGDRLVMLMTVGEDYDPAAKAARDAGDPR
IQAWEALMWRFQKALPGSAPGEKWREAGRIFALSEAVSVQQGSAA
>P0A9S1 1.1.1.77~~~fucO~~~Lactaldehyde reductase~~~COG1454
MANRMILNETAWFGRGAVGALTDEVKRRGYQKALIVTDKTLVQCGVVAKVTDKMDAAGLAWAIYDGVVPNPTITVVKEGL
GVFQNSGADYLIAIGGGSPQDTCKAIGIISNNPEFADVRSLEGLSPTNKPSVPILAIPTTAGTAAEVTINYVITDEEKRR
KFVCVDPHDIPQVAFIDADMMDGMPPALKAATGVDALTHAIEGYITRGAWALTDALHIKAIEIIAGALRGSVAGDKDAGE
EMALGQYVAGMGFSNVGLGLVHGMAHPLGAFYNTPHGVANAILLPHVMRYNADFTGEKYRDIARVMGVKVEGMSLEEARN
AAVEAVFALNRDVGIPPHLRDVGVRKEDIPALAQAALDDVCTGGNPREATLEDIVELYHTAW
>P11551 ~~~fucP~~~L-fucose-proton symporter~~~COG0738
MGNTSIQTQSYRAVDKDAGQSRSYIIPFALLCSLFFLWAVANNLNDILLPQFQQAFTLTNFQAGLIQSAFYFGYFIIPIP
AGILMKKLSYKAGIITGLFLYALGAALFWPAAEIMNYTLFLVGLFIIAAGLGCLETAANPFVTVLGPESSGHFRLNLAQT
FNSFGAIIAVVFGQSLILSNVPHQSQDVLDKMSPEQLSAYKHSLVLSVQTPYMIIVAIVLLVALLIMLTKFPALQSDNHS
DAKQGSFSASLSRLARIRHWRWAVLAQFCYVGAQTACWSYLIRYAVEEIPGMTAGFAANYLTGTMVCFFIGRFTGTWLIS
RFAPHKVLAAYALIAMALCLISAFAGGHVGLIALTLCSAFMSIQYPTIFSLGIKNLGQDTKYGSSFIVMTIIGGGIVTPV
MGFVSDAAGNIPTAELIPALCFAVIFIFARFRSQTATN
>O30511 2.4.1.152~~~fucT~~~Alpha-(1,3)-fucosyltransferase FucT~~~
MFQPLLDAYVESASIEKMASKSPPPLKIAVANWWGDEEIKEFKNSVLYFILSQRYTITLHQNPNEFSDLVFGNPLGSARK
ILSYQNAKRVFYTGENESPNFNLFDYAIGFDELDFNDRYLRMPLYYDRLHHKAESVNDTTAPYKLKDNSLYALKKPSHCF
KEKHPNLCAVVNDESDPLKRGFASFVASNPNAPIRNAFYDALNSIEPVTGGGSVRNTLGYNVKNKNEFLSQYKFNLCFEN
TQGYGYVTEKIIDAYFSHTIPIYWGSPSVAKDFNPKSFVNVHDFKNFDEAIDYIKYLHTHKNAYLDMLYENPLNTLDGKA
YFYQNLSFKKILAFFKTILENDTIYHDNPFIFCRDLNEPLVTIDDLRVNYDDLRVNYDDLRINYDDLRVNYDDLRINYDD
LRVNYDDLRVNYDDLRINYDDLRVNYDDLRVNYERLLSKATPLLELSQNTTSKIYRKAYQKSLPLLRAIRRWVKKLGL
>P01547 ~~~~~~Bacteriocin fulvocin-C~~~
ANCSCSTASDYCPILTFCTTGTACSYTPTGCGTGWVYCACNGNFY
>P0AC33 4.2.1.2~~~fumA~~~Fumarate hydratase class I, aerobic~~~COG1838
MSNKPFHYQAPFPLKKDDTEYYLLTSEHVSVSEFEGQEILKVAPEALTLLARQAFHDASFMLRPAHQQQVADILRDPEAS
ENDKYVALQFLRNSDIAAKGVLPTCQDTGTAIIVGKKGQRVWTGGGDEAALARGVYNTYIEDNLRYSQNAPLDMYKEVNT
GTNLPAQIDLYAVDGDEYKFLCIAKGGGSANKTYLYQETKALLTPGKLKNYLVEKMRTLGTAACPPYHIAFVIGGTSAET
NLKTVKLASAKYYDELPTEGNEHGQAFRDVELEKELLIEAQNLGLGAQFGGKYFAHDIRVIRLPRHGASCPVGMGVSCSA
DRNIKAKINRQGIWIEKLEHNPGKYIPEELRKAGEGEAVRVDLNRPMKEILAQLSQYPVSTRLSLNGTIIVGRDIAHAKL
KERMDNGEGLPQYIKDHPIYYAGPAKTPEGYASGSLGPTTAGRMDSYVDQLQAQGGSMIMLAKGNRSQQVTDACKKHGGF
YLGSIGGPAAVLAQGSIKSLECVEYPELGMEAIWKIEVEDFPAFILVDDKGNDFFQQIQLTQCTRCVK
>P14407 4.2.1.2~~~fumB~~~Fumarate hydratase class I, anaerobic~~~COG1838
MSNKPFIYQAPFPMGKDNTEYYLLTSDYVSVADFDGETILKVEPEALTLLAQQAFHDASFMLRPAHQKQVAAILHDPEAS
ENDKYVALQFLRNSEIAAKGVLPTCQDTGTAIIVGKKGQRVWTGGGDEETLSKGVYNTYIEDNLRYSQNAALDMYKEVNT
GTNLPAQIDLYAVDGDEYKFLCVAKGGGSANKTYLYQETKALLTPGKLKNFLVEKMRTLGTAACPPYHIAFVIGGTSAET
NLKTVKLASAHYYDELPTEGNEHGQAFRDVQLEQELLEEAQKLGLGAQFGGKYFAHDIRVIRLPRHGASCPVGMGVSCSA
DRNIKAKINREGIWIEKLEHNPGQYIPQELRQAGEGEAVKVDLNRPMKEILAQLSQYPVSTRLSLTGTIIVGRDIAHAKL
KELIDAGKELPQYIKDHPIYYAGPAKTPAGYPSGSLGPTTAGRMDSYVDLLQSHGGSMIMLAKGNRSQQVTDACHKHGGF
YLGSIGGPAAVLAQQSIKHLECVAYPELGMEAIWKIEVEDFPAFILVDDKGNDFFQQIVNKQCANCTK
>Q51404 4.2.1.2~~~fumC2~~~Fumarate hydratase class II 2~~~
MTDTRIERDSMGELAVPATALYGAQTQRAVNNFPVSGQRMPQAFVRALLLAKAAAARANVSLQQLDAPMGEAIADTCLQL
LQEDFMQHFPVDVFQTGSGTSSNMNANEVVATLASRRLGGKVNPNDHVNCGQSSNDIIPSTIHISAALEISERLLPALRH
LEQTIQSKAGEVHAYVKTGRTHLMDAMPVRMSQVLGGWAQQVRQAGVHIESVLPALQQLAQGGTAVGTGINAHPRFAERF
SQELNDLTGLAFRPGDDFFALIGSQDTAVAASGQLKTLAVTLMKLANDLRWMNSGPLAGLGEIELEALQPGSSIMPGKVN
PVIPEATAMVAAQVIGNDAAIAVAGQSGNFELNVMLPLVADNLLHSIQLLANVSRLLADKAIASFKVNQGKLSEALARNP
ILVTALNPIIGYQKAAEIAKQAYREGRPIIDVALENTDLDRARLEVLLDPEKLTAGGL
>P05042 4.2.1.2~~~fumC~~~Fumarate hydratase class II~~~COG0114
MNTVRSEKDSMGAIDVPADKLWGAQTQRSLEHFRISTEKMPTSLIHALALTKRAAAKVNEDLGLLSEEKASAIRQAADEV
LAGQHDDEFPLAIWQTGSGTQSNMNMNEVLANRASELLGGVRGMERKVHPNDDVNKSQSSNDVFPTAMHVAALLALRKQL
IPQLKTLTQTLNEKSRAFADIVKIGRTHLQDATPLTLGQEISGWVAMLEHNLKHIEYSLPHVAELALGGTAVGTGLNTHP
EYARRVADELAVITCAPFVTAPNKFEALATCDALVQAHGALKGLAASLMKIANDVRWLASGPRCGIGEISIPENEPGSSI
MPGKVNPTQCEALTMLCCQVMGNDVAINMGGASGNFELNVFRPMVIHNFLQSVRLLADGMESFNKHCAVGIEPNRERINQ
LLNESLMLVTALNTHIGYDKAAEIAKKAHKEGLTLKAAALALGYLSEAEFDSWVRPEQMVGSMKAGR
>P9WN93 4.2.1.2~~~fumC~~~Fumarate hydratase class II~~~COG0114
MAVDADSANYRIEHDTMGEVRVPAKALWRAQTQRAVENFPISGRGLERTQIRALGLLKGACAQVNSDLGLLAPEKADAII
AAAAEIADGQHDDQFPIDVFQTGSGTSSNMNTNEVIASIAAKGGVTLHPNDDVNMSQSSNDTFPTATHIAATEAAVAHLI
PALQQLHDALAAKALDWHTVVKSGRTHLMDAVPVTLGQEFSGYARQIEAGIERVRACLPRLGELAIGGTAVGTGLNAPDD
FGVRVVAVLVAQTGLSELRTAANSFEAQAARDGLVEASGALRTIAVSLTKIANDIRWMGSGPLTGLAEIQLPDLQPGSSI
MPGKVNPVLPEAVTQVAAQVIGNDAAIAWGGANGAFELNVYIPMMARNILESFKLLTNVSRLFAQRCIAGLTANVEHLRR
LAESSPSIVTPLNSAIGYEEAAAVAKQALKERKTIRQTVIDRGLIGDRLSIEDLDRRLDVLAMAKAEQLDSDRL
>Q92PB6 4.2.1.2~~~fumC~~~Fumarate hydratase class II~~~COG0114
MTSTRTETDTFGPIEVASDRYWGAQAQRSLGNFKIGWEKQPLAIVRALGIVKQAAARANMALGRLDPAIGDAIVKAAQEV
IDGKLDEHFPLVVWQTGSGTQSNMNANEVVSNRAIELLGGVMGSKKPVHPNDHVNMSQSSNDTYPTAMHIACAERVIHDL
LPALKHLHKALEEKVKAFDHIIKIGRTHTQDATPLTLGQEFSGYAAQVASSIKRIEMTLPGLCELAQGGTAVGTGLNAPV
GFAEKVAEEIAAITGIGFTSAPNKFEALAAHDSMVFSHGAINATAAALFKIANDIRFLGSGPRSGLGELSLPENEPGSSI
MPGKVNPTQCEALTQVCVQVFGNHAALTFAGSQGHFELNVYNPLMAYNFLQSVQLLADAAISFTDNCVVGIEAREDNIKA
ALDRSLMLVTALAPKIGYDNAAKIAKTAHKNGTTLREEAVGGGYVTDEEFDAVVRPETMIGPA
>Q9ZCQ4 4.2.1.2~~~fumC~~~Fumarate hydratase class II~~~COG0114
MKNYRIESDSFGEIQIEEKFYWGAQTQRSLNNFKISKQKMPKILIRALAILKKCAAQVNYEFGDLEYKIATSIDKAIDRI
LAGEFEDNFPLVVWQTGSGTQTNMNMNEVIASIANEELTGKKGGKFPVHPNDHVNKGQSSNDSFPTAMHIATVLATKQQL
IPALNNLLTYLQDKSKDWDKIIKIGRTHLQDATPLTLKQEFSGYITQIEYALERIEDALKKVYLLAQGGTAVGTGINSKI
GFDIKFAQKVAEFTQQPFKTAPNKFESLAAHDALVEFSGTLNTIAVSLMKIANDIRLLGSGPRCGLGELHLPENEPGSSI
MPGKVNPTQVEALTMVCTQVMGNHVTVTIAGSNGHLELNVFKPVIIYNILQSIELLSDSVNSFVTHCVKGLEPNIARINT
LRDKSLMLVTVLNPHIGYDNAAKIAKEAHKYGITLKEAAKKLNFLSEEEFDKIVVPEKMIS
>P64173 4.2.1.2~~~fumC~~~Fumarate hydratase class II~~~
MSVRIEHDTFGEIEVPADKYWGAQTERSKRNFPVGKERMPIEVVYGFAQLKRAAALANFDLGKLSEAKKDAIVYACDQIL
SGELDEHFPLVVWQTGSGTQSNMNVNEVVSYVANMYLKDHQIDESIHPNDDVNESQSSNDTFPTAMHVALYQEVETKLEP
ALKLLRNTLKEKEDKFESIIKIGRTHLQDATPIKLGQEISGWRYMLDRCEIMLSESKKHILNLAIGGTAVGTGINAHPEF
GDKVAHYISENTGYPFVSSENKFHALTAHDEVVQLHGTLKALAGDLMKIANDVRWLASGPRAGLAEISIPENEPGSSIMP
GKVNPTQCEMLTMVAVQVMGNDTVVGFASSQGNFELNVYKPVIMHNTLQSIYLLADGMETFNNNCAVGIEPIEENIDNYL
NQSLMLVTALNPHIGYEKAAQIAKKAHKEGLTLKESAIQTGYVTEEQFEAWIKPEDMVDPH
>P0ACX5 4.2.1.2~~~fumD~~~Fumarase D~~~
MGNRTKEDELYREMCRVVGKVVLEMRDLGQEPKHIVIAGVLRTALANKRIQRSELEKQAMETVINALVK
>D2D3B6 3.1.1.87~~~fumD~~~Fumonisin B1 esterase~~~
MKEHQCRGGRASPAAPATWLARISVSRGASAIAWTFMLGATAIPVAAQTDDPKLVRHTQSGAVEGVEGDVETFLGIPFAA
PPVGDLRWRPPAPPRAWAGTRDGRRFAPDCIGNERLREGSRAAGTSEDCLYLNIWSPKQVGKGGLPVMIWVYGGGFSGGS
GAVPYYDGSALAQKGVVVVTFNYRAGILGFLAHPALSKESPNGVSGNYGLLDMLAAFKWVQNNIREFGGDPNRVTVFGES
AGASALGLLLTSPLSESAFNQAILQSPGLARPLATLSESEANGLELGADISALRRADAGELTKIAQSRIPMSRQFTKPRP
MGPILDGYVLRTLDVDAFAKGAFRKIPVLVGGNADEGRAFTDRLPVKTVLEYRAYLTEQFGDEADAWERCYPANSDADVP
AAVARLFGDSQFNNGIELLSAAFAKWRTPLWRYRFTGIPGAGRRPATHGDEIPYVFANLGPSSVSMFGSLEGGAGASDIK
LATEMSAAWVSFAVHGVPDQGTKSHWPRFERRGEIMTFGSQVGSGEGLGVSPSKACQPSK
>P11663 4.2.1.2~~~fumE~~~Fumarase E~~~COG3722
MATLTEDDVLEQLDAQDNLFSFMKTAHTILLQGIRQFLPSLFVDNDEEIVEYAVKPLLAQSGPLDDIDVALRLIYALGKM
DKWLYADITHFSQFWHYLNEQDETPGFADDMTWDFISNVNSITRNAMLYDALKAMKFADFSVWSEARFSGMVKTALTLAV
TTTLKELTP
>Q83Q96 4.2.1.2~~~fumE~~~Fumarase E~~~
MATLTEDDVLEQLDAQDNLFSFMKTAHSILLQGIRQFLPSLFVDNDEEIVEYAVKPLLAQSGPLDDIDVALRLIYALGKM
DKWLYADITHFSQYWHYLNEQDETPGFADDITWDFISNVNSITRNATLYDALKAMKFADFAVWSEARFSGMVKTALTLAV
TTTLKELTP
>D2D3B2 2.6.1.-~~~fumI~~~Aminopentol aminotransferase~~~
MANGTRQKDLRERAERVIPGGMYGHESTRLLPPEFPQFFRRALGARIWDADEQPYIDYMCAYGPNLLGYRQSEIEAAADA
QRLLGDTMTGPSEIMVNLAEAFVGMVRHADWAMFCKNGSDATSTAMVLARAHTGRKTILCAKGAYHGASPWNTPHTAGIL
ASDRVHVAYYTYNDAQSLSDAFKAHDGDIAAVFATPFRHEVFEDQALAQLEFARTARKCCDETGALLVVDDVRAGFRVAR
DCSWTHLGIEPDLSCWGKCFANGYPISALLGSNKARDAARDIFVTGSFWFSAVPMAAAIETLRIIRETPYLETLIASGAA
LRAGLEAQSQRHGLELKQTGPAQMPQIFFADDPDFRIGYAWAAACLKGGVYVHPYHNMFLSAAHTVDDVTETLEATDRAF
SAVLRDFASLQPHPILMQLAGA
>Q2L6E3 2.5.1.124~~~fur7~~~Furaquinocin biosynthesis prenyltransferase~~~
MPGTDDVAVDVASVYSAIEKSAGLLDVTAAREVVWPVLTAFEDVLEQAVIAFRVATNARHEGDFDVRFTVPEEVDPYAVA
LSRSLIAKTDHPVGSLLSDIQQLCSVDTYGVDLGVKSGFKKVWVYFPAGEHETLARLTGLTSMPGSLAGNVDFFTRYGLA
DKVDVIGIDYRSRTMNVYFAAPSECFERETVLAMHRDIGLPSPSEQMFKFCENSFGLYTTLNWDTMEIERISYGVKTENP
MTFFARLGTKVEHFVKNVPYGVDTQKMVYAAVTSSGEEYYKLQSYYRWRSVSRLNAAYIAARDKEST
>P9WN87 ~~~furA~~~Transcriptional regulator FurA~~~COG0735
MSSIPDYAEQLRTADLRVTRPRVAVLEAVNAHPHADTETIFGAVRFALPDVSRQAVYDVLHALTAAGLVRKIQPSGSVAR
YESRVGDNHHHIVCRSCGVIADVDCAVGEAPCLTASDHNGFLLDEAEVIYWGLCPDCSISDTSRSHP
>P54574 ~~~fur~~~Ferric uptake regulation protein~~~COG0735
MENRIDRIKKQLHSSSYKLTPQREATVRVLLENEEDHLSAEDVYLLVKEKSPEIGLATVYRTLELLTELKVVDKINFGDG
VSRYDLRKEGAAHFHHHLVCMECGAVDEIEEDLLEDVEEIIERDWKFKIKDHRLTFHGICHRCNGKETE
>P0C631 ~~~fur~~~Ferric uptake regulation protein~~~COG0735
MLIENVEYDVLLERFKKILRQGGLKYTKQREVLLKTLYHSDTHYTPESLYMEIKQAEPDLNVGIATVYRTLNLLEEAEMV
TSISFGSAGKKYELANKPHHDHMICKNCGKIIEFENPIIERQQALIAKEHGFKLTGHLMQLYGVCGDCNNQKAKVKI
>P0A9A9 ~~~fur~~~Ferric uptake regulation protein~~~COG0735
MTDNNTALKKAGLKVTLPRLKILEVLQEPDNHHVSAEDLYKRLIDMGEEIGLATVYRVLNQFDDAGIVTRHNFEGGKSVF
ELTQQHHHDHLICLDCGKVIEFSDDSIEARQREIAAKHGIRLTNHSLYLYGHCAEGDCREDEHAHEGK
>O25671 ~~~fur~~~Ferric uptake regulation protein~~~COG0735
MKRLETLESILERLRMSIKKNGLKNSKQREEVVSVLYRSGTHLSPEEITHSIRQKDKNTSISSVYRILNFLEKENFICVL
ETSKSGRRYEIAAKEHHDHIICLHCGKIIEFADPEIENRQNEVVKKYQAKLISHDMKMFVWCKECQESEC
>P45599 ~~~fur~~~Ferric uptake regulation protein~~~
MTDNNTALKKAGLKVTLPRLKILEVLQEPDNHHVSAEDLYKRLIDMGEEIGLATVYRVLNQFDDAGIVTRHNFEGGKSVF
ELTQQHHHDHLICLDCGKVIEFSDDSIELRQREIASRHGIRLTNHSLYLYGHCAEGDCRETNTPTTRWKNNSPFR
>Q03456 ~~~fur~~~Ferric uptake regulation protein~~~
MVENSELRKAGLKVTLPRVKILQMLDSAEQRHMSAEDVYKALMEAGEDVGLATVYRVLTQFEAAGLVVRHNFDGGHAVFE
LADSGHHDHMVCVDTGEVIEFMDAEIEKRQKEIVRERGFELVDHNLVLYVRKKK
>O07315 ~~~fur~~~Ferric uptake regulation protein~~~
MTDVAKTLEELCTERGMRMTEQRRVIARILEDSEDHPDVEELYRRSVKVDAKISISTVYRTVKLFEDAGIIARHDFRDGR
SRYETVPEEHHDHLIDLKTGTVIEFRSPEIEALQERIAREHGFRLVDHRLELYGVPLKKEDL
>P37736 ~~~fur~~~Ferric uptake regulation protein~~~COG0735
MSDNNQALKDAGLKVTLPRLKILEVLQQPECQHISAEELYKKLIDLGEEIGLATVYRVLNQFDDAGIVTRHHFEGGKSVF
ELSTQHHHDHLVCLDCGEVIEFSDEVIEQRQREIAEQYNVQLTNHSLYLYGKCADGSCKQNPNAHKSKR
>P0C6C8 ~~~fur~~~Ferric uptake regulation protein~~~COG0735
MSDNNQALKDAGLKVTLPRLKILEVLQQPECQHISAEELYKKLIDLGEEIGLATVYRVLNQFDDAGIVTRHHFEGGKSVF
ELSTQHHHDHLVCLDCGEVIEFSDDVIEQRQKEIAAKYNVQLTNHSLYLYGKCGSDGSCKDNPNAHKPKK
>A0A0H2URD6 ~~~fusA~~~Fructooligosaccharide ABC transporter substrate-binding protein FusA~~~COG1653
MKFKTFSKSAVLLTASLAVLAACGSKNTASSPDYKLEGVTFPLQEKKTLKFMTASSPLSPKDPNEKLILQRLEKETGVHI
DWTNYQSDFAEKRNLDISSGDLPDAIHNDGASDVDLMNWAKKGVIIPVEDLIDKYMPNLKKILDEKPEYKALMTAPDGHI
YSFPWIEELGDGKESIHSVNDMAWINKDWLKKLGLEMPKTTDDLIKVLEAFKNGDPNGNGEADEIPFSFISGNGNEDFKF
LFAAFGIGDNDDHLVVGNDGKVDFTADNDNYKEGVKFIRQLQEKGLIDKEAFEHDWNSYIAKGHDQKFGVYFTWDKNNVT
GSNESYDVLPVLAGPSGQKHVARTNGMGFARDKMVITSVNKNLELTAKWIDAQYAPLQSVQNNWGTYGDDKQQNIFELDQ
ASNSLKHLPLNGTAPAELRQKTEVGGPLAILDSYYGKVTTMPDDAKWRLDLIKEYYVPYMSNVNNYPRVFMTQEDLDKIA
HIEADMNDYIYRKRAEWIVNGNIDTEWDDYKKELEKYGLSDYLAIKQKYYDQYQANKN
>P72827 ~~~futA1~~~Iron uptake protein A1~~~COG1840
MVQKLSRRLFLSIGTAFTVVVGSQLLSSCGQSPDAPIADTPGEQQEINLYSSRHYNTDNELYAKFTAETGIKVNLIEGKA
DELLERIKSEGANSPADVLLTVDLARLWRAEEDGIFQPVQSEILETNVPEYLRSPDGMWFGFTKRARVIMYNKGKVKPEE
LSTYEELADPKWKGRVIIRSSSNEYNQSLVASLVVADGEESTLAWAKGFVSNFAREPQGNDTAQIEAVSSGEADLTLANT
YYMGRLLESEDPAQKAIAENVGVFFPNQEGRGTHVNVSGVGVVKTAPNREGAVKFIEFLVSEPAQAFLAQNNYEYPVLAG
VPLNKSVASFGEFKSDTTSLDKLGPALAPATKIMNEAGWK
>Q55835 ~~~futA2~~~Iron uptake protein A2~~~COG1840
MTTKISRRTFFVGGTALTALVVANLPRRASAQSRTINLYSSRHYNTDDALYDAFGEVNLIEASAEELIERIQSEGANSPG
DILFTVDAGMLWRAEQAGLFQPVRSGKLNERIPENLRHPDGLWYGFTQRARVLYYSRDRVNPADLSTYEALADPQWRGKI
LVRPSSNVYNLSLTASRIAIHGEPETRRWLQGLVGNFARQPEGNDTAQIRAIAAGIGDVAIANSYYYIRLQKSTDPADQE
VVEKVSLFFPNTGSGERGTHVNVSGAGVLKNAPNRDAAIAFLEYLASDDAQRYFAEGNNEYPVIPGVPIDPVLAAHGQLK
GDPLNVSNLGRYQPDSARLMNEVGWQ
>P37147 ~~~fxsA~~~UPF0716 protein FxsA~~~COG3030
MRWLPFIAIFLYVYIEISIFIQVAHVLGVLLTLVLVIFTSVIGMSLVRNQGFKNFVLMQQKMAAGENPAAEMIKSVSLII
AGLLLLLPGFFTDFLGLLLLLPPVQKHLTVKLMPHLRFSRMPGGGFSAGTGGGNTFDGEYQRKDDERDRLDHKDDRQD
>P0C2M9 ~~~fyuA~~~Pesticin receptor~~~
MKMTRLYPLALGGLLLPAIANAQTSQQDESTLEVTASKQSSRSASANNVSSTVVSAPELSDAGVTASDKLPRVLPGLNIE
NSGNMLFSTISLRGVSSAQDFYNPAVTLYVDGVPQLSTNTIQALTDVQSVELLRGPQGTLYGKSAQGGIINIVTQQPDST
PRGYIEGGVSSRDSYRSKFNLSGPIQDGLLYGSVTLLRQVDDGDMINPATGSDDLGGTRASIGNVKLRLAPDDQPWEMGF
AASRECTRATQDAYVGWNDIKGRKLSLSDGSPDPYMRRCTDSQTLSGKYTTDDWVFNLISAWQQQHYSRTFPSGSLIVNM
PQRWNQDVQELRAATLGDARTVDMVFGLYRQNTREKLNSAYNMPTMPYLSSTGYTTAETLAAYSDLTWHLTDRFDIGGGV
RFSHDKSSTQYHGSMLGNPFGDQGKSNDDQVLGQLSAGYMLTDDWRVYTRIAQGYKPSGYNIVPTAGLDAKPFVAEKSIN
YELGTRYETADVTLQAATFYTHTKDMQLYSGPVGMQTLSNAGKADATGVELEAKWRFAPGWSWDINGNVIRSEFTNDSEL
YHGNRVPFVPRYGAGSSVNGVIDTRYGALMPRLAVNLVGPHYFDGDNQLRQGTYATLDSSLGWQATERINISVHVDNLFD
RRYRTYGYMNGSSAVAQVNMGRTVGINTRIDFF
>P46359 ~~~fyuA~~~Pesticin receptor~~~COG4771
MKMTRLYPLALGGLLLPAIANAQTSQQDESTLVVTASKQSSRSASANNVSSTVVSAPELSDAGVTASDKLPRVLPGLNIE
NSGNMLFSTISLRGVSSAQDFYNPAVTLYVDGVPQLSTNTIQALTDVQSVELLRGPQGTLYGKSAQGGIINIVTQQPDST
PRGYIEGGVSSRDSYRSKFNLSGPIQDGLLYGSVTLLRQVDDGDMINPATGSDDLGGTRASIGNVKLRLAPDDQPWEMGF
AASRECTRATQDAYVGWNDIKGRKLSISDGSPDPYMRRCTDSQTLSGKYTTDDWVFNLISAWQQQHYSRTFPSGSLIVNM
PQRWNQDVQELRAATLGDARTVDMVFGLYRQNTREKLNSAYDMPTMPYLSSTGYTTAETLAAYSDLTWHLTDRFDIGGGV
RFSHDKSSTQYHGSMLGNPFGDQGKSNDDQVLGQLSAGYMLTDDWRVYTRVAQGYKPSGYNIVPTAGLDAKPFVAEKSIN
YELGTRYETADVTLQAATFYTHTKDMQLYSGPVRMQTLSNAGKADATGVELEAKWRFAPGWSWDINGNVIRSEFTNDSEL
YHGNRVPFVPRYGAGSSVNGVIDTRYGALMPRLAVNLVGPHYFDGDNQLRQGTYATLDSSLGWQATERMNISVYVDNLFD
RRYRTYGYMNGSSAVAQVNMGRTVGINTRIDFF
>A0A0H3CDY2 ~~~fzlA~~~FtsZ-localized protein A~~~
MSVERTLHHFPLDPASRQVRLALGEKRLPFVEMQVRYWEMPPEFTSLNPSGMPPVLVETKHQRNLVICETRAILEHIEET
ETEPPLLGRDPAERAEARRLLQWFDRKFDNEVNGFLLHEKMEKRLLRMGAPDLAALRQGREALRMHLGYIESLLQTRDWL
AGRRMSLADFAAAAHLSVIDYFGDVPWKDFQAAKTWYMKLKSRPCFRPILADRWPGLAPAAHYDDLDF
>A0A0H3C3S1 ~~~fzlC~~~FtsZ-localized protein C~~~
MAGKPPVLGLAFPARLRGGVLWKAIRAEAAGHVEREWFGSGPHRLKIALPRPEGLSARPHDPRPVDPAHGQKILSGALTL
DGGALRLGVDGDPFDTASPSRRFAVSLHRFDWLPDLVAVGPDGARRALRLIDDWRRVFGKWNAFSWGPECLERRVHHLAC
AAKTLAAEASDAEVADLVFDLARQGRHLLEITRAPERTLERAVAAGLAGCVLAGKPGEPLIDAALKALVPQLDAMVLGDG
GHATRSPEAGVELLFDLLTLDDALGQRGRPSPEALSRAIDRLSSATRFFILGDGHLAAFHGGETVGPARIAAALAHDDAG
PRSLNAAPHSGYHKMIGGSIEVIADCGPPPVGPLSVNACAQPAAFEIVCAKDRLITSCGWSPEAAGAHAFRLSDAASTVS
VADGSAGRPLSGFRAKALGPWLVDGAAKVEAKRHDDVGGVWLDIVHDGWRHLGLTHARRLFLDAVQDELRGEDSLSPLAL
DPKAAEGPRRYLPFAVRFHLHPDARASIARDGKSVLIRGPSNIGWWLRNDAVDVEIAPSAHFDHGLARKAGQIVLKSQVR
PEVGAKIRWKLTKAEG
>Q5LGZ0 3.2.1.-~~~~~~Glycosyl hydrolase family 109 protein 1~~~COG0673
MFKHLNALFIGLALFACTSGAVAQTIKPIETPVPVRPAGQKDVVGLTTPKLDVVRVGFIGLGMRGPGAVERFTHIPGTQI
VALCDLIPERVAGAQKILTKANLPEAASYSGSEDAWKKLCERKDIDLVYIATDWKHHAQMAIYAMEHGKHVAIEVPSAMT
LDEIWALINTSEKTRKHCMQLENCVYDFFELTTLNMAQQGVFGEVLHTEGAYIHNLEDFWPYYWNNWRMDYNQNHRGDVY
ATHGMGPACQLLDIHRGDKMNYLVSMDTKAVNGPAYIKKTTGKEVKDFQNGDQTSTLIRTEKGKTILIQHNVMTPRPYSR
MYQVVGADGYASKYPIEEYCMRPTQIASNDVPNHEKLNAHGSVPADVKKALMDKYKHPIHKELEETAKKVGGHGGMDYIM
DYRLVYCLRNGLPLDMDVYDLAEWCCMADLTKLSIENSSAPVAIPDFTRGAWNKVKGYRHAFAK
>B2UQL7 3.2.1.-~~~~~~Glycosyl hydrolase family 109 protein 2~~~COG0673
MSIFSSRRQFLKSLGLAAGAAAAGNALPGKAVEIPAGDHLWKSASPAAPRPSGSTYMGGFKAPRLGRIRLAFIGVGGRGF
SHLAQMCVMDGVEIVGICDLKEELTKRGVDRVLSRMGKSPLGYSGGDMEYLTMLKELKPDAVIISTDWSSHARIACDSMK
HGAHAFVEVPLAVSLEELWSLVDTSEATRKHCMMMENVNYGRDELMFLNMVRQGVIGDLLHGEAAYIHCLVTQLGDTRGE
GAWRPEYHTRINGNLYPTHGLGPVAQYMNLERGEDRFCRVAAFASPALGRNAYAKKHLPADHRWNNTPFICGDMNTAVVK
TQLGRTILVQLDETSPRPYSRANLIQGTEGTLAGFPTRVAGEKLGNGNYHEWIEGREKLAAIYEKYDHPLWKRIGELATK
MGGHGGMDFVMLSRIVECLRNGEPMDQNVYEGASWSSLLPLTARSIAQGGMPVEFPDFTRGDWKTTMPLAVVS
>P80872 ~~~yocK~~~General stress protein 16O~~~COG1734
MALTKEQTQHLYHKLLDMQKELSGEKKETESMTEEVGELSNGVDNHMADHGTLVTDRMTDQTVKEIDRELLEEVNRALQK
MKDGTYGVCEKTGQEIPYERLEAVPYARMTVEAQADVEDDLETDAPSYEREFHEQVKDLSNKETIDQKSSQTYEILDREQ
DSK
>P80875 ~~~yceD~~~General stress protein 16U~~~COG2310
MTISLAKGQKVDLTKTNPGLSKVVVGLGWDTNKYDGGHDFDLDSSVFLLDAAGKCASPNDFIFYNQLEGGNGSVVHSGDN
LTGAGEGDDENVKVNLSAVPANIDKISFVITIHDAEARSQNFGQVSNAFVRIVNEETNEELIRYDLAEDFSIETAIIAGE
LYRHNGEWKFSAIGSGYQGGLARIATDYGLQVG
>P80241 ~~~yflT~~~General stress protein 17M~~~
MKPVVKEYTNDEQLMKDVEELQKMGVAKEDVYVLAHDDDRTERLADNTNANTIGAKETGFKHAVGNIFNKKGDELRNKIH
EIGFSEDEAAQFEKRLDEGKVLLFVTDNEKVKAWA
>P94527 1.1.1.261~~~egsA~~~Glycerol-1-phosphate dehydrogenase [NAD(P)+]~~~COG0371
MNRIAADVQRAFENAGEKTLPIKVEEIVLGKQAADSLLDYVKRKNNQHIVLVCDANTHRIAGIDLENRLNQEGFQAECLI
IPENEAGDVTADERSLIHVLIHTKQPTDVMIAVGSGTIHDIVRFAAFQRDLPFISYPTAPSVDGFTSAGAPIILYGTKTT
IQTKAPSALFADLDLLKAAPQSMVAAGFGDMLGKITSLADWEISRHLAGEPYSPAGAKIVQEALAACIEHTEDIAMKTET
GIRVLMESLLVSGLVMLALDHSRPASGGEHHISHWIEMELMEKKRPQILHGAKVGCAAVLLTDTYRKLAQDDGLNEFSPS
RREAIQSAYQTLPRGEVLADWLRSAGGPAYFDEIGVGQDSVKNAFRHAHTLRDRCTGLRIINENKTLINHGLYE
>P80879 ~~~dps~~~General stress protein 20U~~~COG0783
MSEQLIQAVNKQVANWTVMYVKLHNYHWYVKGKDFFTLHEKFEELYNETATYIDDLAERLLALNGKPIATMKESLETASV
KEAAGNETAEQMVQSVYDDFTVIAEELKNGMDLADEVGDETTGDMLLAIHQNIEKHNWMLKAYLG
>Q4MQ58 1.2.1.12~~~gap1~~~Glyceraldehyde-3-phosphate dehydrogenase 1~~~COG0057
MTKIGINGFGRIGRNVFRAALNNSEVEVVAINDLTDAKTLAHLLKYDTVHGTLNAEVSANENSIVVNGKEIKVIAERDPA
QLPWSDYGVEVVVESTGRFTKKSDAEKHLGGSVKKVIISAPASDEDITVVMGVNHEQYDAANHNVVSNASCTTNCLAPFA
KVLNEKFGVKRGMMTTIHSYTNDQQILDLPHKDLRRARAAAENMIPTSTGAAKAVALVLPELKGKLNGGAVRVPTANVSL
VDLVVELDKEVTVEEVNAAFKAAAEGELKGILGYSEEPLVSIDYNGCTASSTIDALSTMVMEGNMVKVLSWYDNETGYSN
RVVDLAAYMTSKGL
>P09124 1.2.1.12~~~gapA~~~Glyceraldehyde-3-phosphate dehydrogenase 1~~~COG0057
MAVKVGINGFGRIGRNVFRAALNNPEVEVVAVNDLTDANMLAHLLQYDSVHGKLDAEVSVDGNNLVVNGKTIEVSAERDP
AKLSWGKQGVEIVVESTGFFTKRADAAKHLEAGAKKVIISAPANEEDITIVMGVNEDKYDAANHDVISNASCTTNCLAPF
AKVLNDKFGIKRGMMTTVHSYTNDQQILDLPHKDYRRARAAAENIIPTSTGAAKAVSLVLPELKGKLNGGAMRVPTPNVS
LVDLVAELNQEVTAEEVNAALKEAAEGDLKGILGYSEEPLVSGDYNGNKNSSTIDALSTMVMEGSMVKVISWYDNESGYS
NRVVDLAAYIAKKGL
>P0A9B2 1.2.1.12~~~gapA~~~Glyceraldehyde-3-phosphate dehydrogenase A~~~COG0057
MTIKVGINGFGRIGRIVFRAAQKRSDIEIVAINDLLDADYMAYMLKYDSTHGRFDGTVEVKDGHLIVNGKKIRVTAERDP
ANLKWDEVGVDVVAEATGLFLTDETARKHITAGAKKVVMTGPSKDNTPMFVKGANFDKYAGQDIVSNASCTTNCLAPLAK
VINDNFGIIEGLMTTVHATTATQKTVDGPSHKDWRGGRGASQNIIPSSTGAAKAVGKVLPELNGKLTGMAFRVPTPNVSV
VDLTVRLEKAATYEQIKAAVKAAAEGEMKGVLGYTEDDVVSTDFNGEVCTSVFDAKAGIALNDNFVKLVSWYDNETGYSN
KVLDLIAHISK
>P80506 1.2.1.12~~~gap1~~~Glyceraldehyde-3-phosphate dehydrogenase 1~~~COG0057
MAKLKVGINGFGRIGRLVLRAGINNPNIEFVGINDLVPPDNLAYLLKYDSTHGRLRSQVETKDDGIVIDGHFIPCVSVRN
PAELPWGKLGADYVVESTGLFTDSEGASKHLQAGARRVIISAPTKDPDRVRTLLVGVNHDLFDPSKDLIVSNASCTTNCL
APIAKVINDNFGLTEGLMTTVHAMTATQPTVDGPSKKDWRGGRGAAQNIIPSSTGAAKAVALVLPELKGKLTGMAFRVPT
PDVSVVDLTFKTAKATSYKEICAAMKQASEGSLAGILGYTDEEVVSTDFQGDTHSSIFDAGAGIELNSNFFKVVAWYDNE
WGYSNRVVDLMLSMVQKEQLAAV
>P99136 1.2.1.12~~~gapA1~~~Glyceraldehyde-3-phosphate dehydrogenase 1~~~
MAVKVAINGFGRIGRLAFRRIQEVEGLEVVAVNDLTDDDMLAHLLKYDTMQGRFTGEVEVVDGGFRVNGKEVKSFSEPDA
SKLPWKDLNIDVVLECTGFYTDKDKAQAHIEAGAKKVLISAPATGDLKTIVFNTNHQELDGSETVVSGASCTTNSLAPVA
KVLNDDFGLVEGLMTTIHAYTGDQNTQDAPHRKGDKRRARAAAENIIPNSTGAAKAIGKVIPEIDGKLDGGAQRVPVATG
SLTELTVVLEKQDVTVEQVNEAMKNASNESFGYTEDEIVSSDVVGMTYGSLFDATQTRVMSVGDRQLVKVAAWYDNEMSY
TAQLVRTLAYLAELSK
>Q6GIL8 1.2.1.12~~~gapA1~~~Glyceraldehyde-3-phosphate dehydrogenase 1~~~
MAVKVAINGFGRIGRLAFRRIQEVEGLEVVAVNDLTDDDMLAHLLKYDTMQGRFTGEVEVVDGGFRVNGKEVKSFSEPDA
SKLPWKDLNIDVVLECTGFYTDKDKAQAHIEAGAKKVLISAPATGDLKTIVFNTNHQELDGSETVVSGASCTTNSLAPVA
KVLNDDFGLVEGLMTTIHAYTGDQNTQDAPHRKGDKRRARAAAENIIPNSTGAAKAIGKVIPEIDGKLDGGAQRVPVATG
SLTELTVVLEKQDVTVEQVNEAMKNASNESFGYTEDEIVSSDVVGMTYGSLFDATQTRVMSVGDRQLVKVAAWYDNEMSY
TAQLVRTLAYLAELSK
>P54226 1.2.1.12~~~gap1~~~Glyceraldehyde-3-phosphate dehydrogenase 1~~~
MTVRIGINGFGRIGRNVFRAAAARSSELEIVAVNDLGDVPTMAHLLAYDSILGRFPEEVTAEPGAIRVGDRTIKVLAERD
PGALPWGDLGVDIVIESTGIFTDAAKARSHVDGGAKKVIIAAPASGEDFTVVLGVNDGDYDPERHTIISNASCTTNCLGV
LAKVLHDAVGIDSGMMTTVHAYTQDQNLQDAPHKDLRRARAAALNIVPTSSGAAKAIGLVLPELAGRLDAFALRVPVPTG
SVTDLTVTTRRGTSVEEVKEAYAAAASGPYKGLLSYVDAPLVSTDIVGDPASLFDAGLTRVCGPQVKVVGWYDNEWGYSN
RLIDLATLIGSSL
>Q82IZ2 1.2.1.12~~~gap1~~~Glyceraldehyde-3-phosphate dehydrogenase 1~~~COG0057
MTVRVGINGFGRIGRNVFRAAATRGADLEIVAVNDLGDVATMAHLLAYDSILGRFPEEVTAEPGAIRAGDTTVKVLAERD
PAALPWGDLGVDVVIESTGIFTDAAKARAHVDGGAKKVIIAAPASNEDVTVVLGVNQDAYDPERHTIISNASCTTNCLGV
LAKVLHDAVGIESGMMTTVHAYTQDQNLQDAPHKDLRRARAAGLNIVPTSSGAAKAIGLVLPELQGRLDAFALRVPVPTG
SVTDLTVTASRSTTVEEVKEAYAKAAAGAYKGLLSYTEAPIVSTDIAGDPASCVFDAELTRVLGSQVKVVGWYDNEWGYS
NRLIDLALLVGDTL
>O34425 1.2.1.59~~~gapB~~~Glyceraldehyde-3-phosphate dehydrogenase 2~~~COG0057
MKVKVAINGFGRIGRMVFRKAMLDDQIQVVAINASYSAETLAHLIKYDTIHGRYDKEVVAGEDSLIVNGKKVLLLNSRDP
KQLPWREYDIDIVVEATGKFNAKDKAMGHIEAGAKKVILTAPGKNEDVTIVMGVNEDQFDAERHVIISNASCTTNCLAPV
VKVLDEEFGIESGLMTTVHAYTNDQKNIDNPHKDLRRARACGESIIPTTTGAAKALSLVLPHLKGKLHGLALRVPVPNVS
LVDLVVDLKTDVTAEEVNEAFKRAAKTSMYGVLDYSDEPLVSTDYNTNPHSAVIDGLTTMVMEDRKVKVLAWYDNEWGYS
CRVVDLIRHVAARMKHPSAV
>P58554 1.2.1.59~~~gap2~~~Glyceraldehyde-3-phosphate dehydrogenase 2~~~COG0057
MIRVAINGFGRIGRNFARCWLGRENTNIELVAVNDTSDPRTNAHLLKYDSMLGKLKNVDITADDNSITVNGKTIKCVSDR
NPENLPWKEWEIDLIIEATGVFVSKEGATKHINAGAKKVLITAPGKNEDGTFVMGVNHHDYDHNLHNIISNASCTTNCLA
PIAKVLNDKFGIIKGSMTTTHSYTGDQRLLDASHRDLRRARAAAINIVPTSTGAAKAVALVIPELKGKLNGVALRVPTPN
VSMVDFVVQVEKRTITEEVNQALKDASEGPLKGILDYSELQLVSSDYQGTDASSIVDANLTLVMGNDLVKVMAWYDNEWG
YSQRVLDLAELVAEKWV
>P99067 1.2.1.12~~~gapA2~~~Glyceraldehyde-3-phosphate dehydrogenase 2~~~
MSTNIAINGMGRIGRMVLRIALQNKNLNVVAINASYPPETIAHLINYDTTHGKYNLKVEPIENGLQVGDHKIKLVADRNP
ENLPWKELDIDIAIDATGKFNHGDKAIAHIKAGAKKVLLTGPSKGGHVQMVVKGVNDNQLDIEAFDIFSNASCTTNCIGP
VAKVLNNQFGIVNGLMTTVHAITNDQKNIDNPHKDLRRARSCNESIIPTSTGAAKALKEVLPELEGKLHGMALRVPTKNV
SLVDLVVDLEKEVTAEEVNQAFENAGLEGIIEVEHQPLVSVDFNTNPNSAIIDAKSTMVMSGNKVKVIAWYDNEWGYSNR
VVDVAEQIGALLTSKETVSAS
>E3VWI2 1.2.1.12~~~gap2~~~Glyceraldehyde-3-phosphate dehydrogenase 2~~~
MTVRIGINGFGRIGRNVFRAAAARSSELEIVAVNDLGDVPTMAHLLAYDSILGRFPEEVTAEPGAIRVGDRTIKVLAERD
PGALPWGDLGVDIVIESTGIFTDAAKARSHVDGGAKKVIIAAPASGEDFTVVLGVNDGDYDPERHTIISNASCTTNCLGV
LAKVLHDAVGIDSGMMTTVHAYTQDQNLQDAPHKDLRRARAAALNIVPTSSGAAKAIGLVLPELAGRLDAFALRVPVPTG
SVTDLTVTTRRGTSVEEVKEAYAAAASGPYKGLLSYVDAPLVSTDIVGDPASCVFDAGLTRVSGPQVKVVGWYDNEWGYS
NRLIDLATLIGSSL
>Q829W3 1.2.1.12~~~gap2~~~Glyceraldehyde-3-phosphate dehydrogenase 2~~~COG0057
MTIRVGINGFGRIGRNYFRALLEQGADIEIVAVNDLGDTATTAHLLKYDTILGRLKAEVTHTADTITVDGKTIKVFSERN
PADIPWGELNVDIVIESTGIFTKKADAEKHIAGGAKKVLISAPASDEDITIVLGVNEDKYDPAKHNVISNASCTTNCVAP
MAKVLDENFGIVKGLMTTIHAYTNDQRILDFPHKDLRRARAAAENIIPTTTGAAKATALVLPQLKGKMDGISMRVPVPTG
SATDLVVEVSREVTKDEVNAAFKKAAEGELQGYLSYTEDPIVSSDIVGDPSSCTFDSAMTMVMEGTSVKILGWYDNEWGY
SNRLVDLTVFVGNQL
>P80505 1.2.1.59~~~gap2~~~Glyceraldehyde-3-phosphate dehydrogenase 2~~~COG0057
MTRVAINGFGRIGRNFLRCWLGRTDSQLEVVGINDTSDPRTNAHLLRYDSMLGKLDADISADENSITVNGKTIKCVSDRN
PLNLPWAEWNVDLVIEATGVFVTHEGATKHVQAGAKKVLITAPGKGPNIGTYVVGVNAHEYKHEEYEVISNASCTTNCLA
PFGKVINDNFGIIKGTMTTTHSYTGDQRILDASHRDLRRARAAAVNIVPTSTGAAKAVALVIPELQGKLNGIALRVPTPN
VSVVDLVVQVEKNTIAEQVNGVLKEAANTSLKGVLEYTDLELVSSDFRGTDCSSTVDGSLTMVMGGDMVKVIAWYDNEWG
YSQRVVDLAEIVAKNWK
>P58559 1.2.1.12~~~gap3~~~Glyceraldehyde-3-phosphate dehydrogenase 3~~~COG0057
MKVRVGINGFGRMGRLALRAAWDWPELEFVHINEIKGGAVAAAHLLKFDSVHGRWTPEVEAEGERVLIDGTPLSFSEYGK
PDDVPWEDFGVDLVLECSGKFRTPATLDPYFKRGVQKVIVAAPVKEEALNIVMGVNDYLYEPEKHHLLTAASCTTNCLAP
VVKVIHEGLGIKHGIITTIHDNTNTQTLVDAPHKDLRRARATSLSLIPTTTGSATAIALIYPELKGKLNGIAVRVPLLNA
SLTDCVFEVTRPTTVEEINALLKAASEQAPLQGILGYEERPLVSIDYKDDPRSSIIDALSTMVVDETQVKILAWYDNEWG
YVNRMVELARKVALSLK
>P34918 1.2.1.12~~~gap3~~~Glyceraldehyde-3-phosphate dehydrogenase 3~~~COG0057
MKIRVGINGFGRMGRLALRAAWGWPELEFVHINEIKGGAVAAAHLLKFDSVHGRWTPEVEAEGERVLIDSTPLSFSEYGK
PEDVPWEDFGVDLVLECSGKFRTPATLDPYFKRGVQKVIVAAPVKEEALNIVMGVNDYLYEPEKHHLLTAASCTTNCLAP
VVKVIHEGLGIKHGIITTIHDNTNTQTLVDAPHKDLRRARATSLSLIPTTTGSATAIALIYPELKGKLNGIAVRVPLLNA
SLTDCVFEVNRPTTVEEINALLKAASEQAPLQGILGYEERPLVSIDYKDDPRSSIIDALSTMVVDETQVKILAWYDNEWG
YVNRMVELARKVALSLK
>O33194 ~~~~~~D-glycerol 3-phosphate phosphatase~~~COG0647
MKSIAQEHDCLLIDLDGTVFCGRQPTGGAVQSLSQVRSRKLFVTNNASRSADEVAAHLCELGFTATGEDVVTSAQSAAHL
LAGQLAPGARVLIVGTEALANEVAAVGLRPVRRFEDRPDAVVQGLSMTTGWSDLAEAALAIRAGALWVAANVDPTLPTER
GLLPGNGSMVAALRTATGMDPRVAGKPAPALMTEAVARGDFRAALVVGDRLDTDIEGANAAGLPSLMVLTGVNSAWDAVY
AEPVRRPTYIGHDLRSLHQDSKLLAVAPQPGWQIDVGGGAVTVCANGDVDDLEFIDDGLSIVRAVASAVWEARAADLHQR
PLRIEAGDERARAALQRWSLMRSDHPVTSVGTQ
>O67161 1.2.1.12~~~gap~~~Glyceraldehyde-3-phosphate dehydrogenase~~~COG0057
MAIKVGINGFGRIGRSFFRASWGREEIEIVAINDLTDAKHLAHLLKYDSVHGIFKGSVEAKDDSIVVDGKEIKVFAQKDP
SQIPWGDLGVDVVIEATGVFRDRENASKHLQGGAKKVIITAPAKNPDITVVLGVNEEKYNPKEHNIISNASCTTNCLAPC
VKVLNEAFGVEKGYMVTVHAYTNDQRLLDLPHKDFRRARAAAINIVPTTTGAAKAIGEVIPELKGKLDGTARRVPVPDGS
LIDLTVVVNKAPSSVEEVNEKFREAAQKYRESGKVYLKEILQYCEDPIVSTDIVGNPHSAIFDAPLTQVIDNLVHIAAWY
DNEWGYSCRLRDLVIYLAERGL
>P46795 1.2.1.12~~~gap~~~Glyceraldehyde-3-phosphate dehydrogenase~~~
MKLAINGFGRIGRNVFKIAFERGIDIVAINDLTDPKTLAHLLKYDSTFGVYNKKVESRDGAIVVDGREIKIIAERDPKNL
PWAKLGIDVVIESTGVFSSATSDKGGYLDHVNHAGAKKVILTVPAKDEIKTIVLGVNDHDINSDLKAVSNASCTTNCLAP
LAKVLHESFGIEQGLMTTVHAYTNDQRILDLPHSDLRRARAAALSIIPTSTGAAKAVGLVLPELKGKLNGTSMRVPVPTG
SIVDLTVQLKKKDVTKEEINSVLRKASETPELKGILGYTEDPIVSSDIKGNSHSSIVDGLETMVLENGFAKILSWYDNEF
GYSTRVVDLAQKLVK
>P0CE13 1.2.1.12~~~gap~~~Glyceraldehyde-3-phosphate dehydrogenase~~~
MRIVINGFGRIGRLVLRQILKRNSPIEVVAINDLVAGDLLTYLFKYDSTHGSFAPQATFSDGCLVMGERKVHFLAEKDVQ
KLPWKDLDVDVVVESTGLFVNRDDVAKHLDSGAKRVLITAPAKGDVPTFVMGVNHQQFDPADVIISNASCTTNCLAPLAK
VLLDNFGIEEGLMTTVHAATATQSVVDGPSRKDWRGGRGAFQNIIPASTGAAKAVGLCLPELKGKLTGMAFRVPVADVSV
VDLTVKLSSATTYEAICEAVKHAANTSMKNIMYYTEEAVVSSDFIGCEYSSVFDAQAGVALNDRFFKLVAWYDNEIGYAT
RIVDLLEYVQENSK
>O52631 1.2.1.12~~~gap~~~Glyceraldehyde-3-phosphate dehydrogenase~~~COG0057
MAKIAINGFGRIGRLALRRILEVPGLEVVAINDLTDAKMLAHLFKYDSSQGRFNGEIEVKEGAFVVNGKEVKVFAEADPE
KLPWGDLGIDVVLECTGFFTKKEKAEAHVRAGAKKVVISAPAGNDLKTIVFNVNNEDLDGTETVISGASCTTNCLAPMAK
VLNDKFGIEKGFMTTIHAFTNDQNTLDGPHRKGDLRRARAAAVSIIPNSTGAAKAISQVIPDLAGKLDGNAQRVPVPTGS
ITELVSVLKKKVTVEEINAAMKEAADESFGYTEDPIVSADVVGINYGSLFDATLTKIVDVNGSQLVKTAAWYDNEMSYTS
QLVRTLAYFAKIAK
>Q59309 1.2.1.12~~~gap~~~Glyceraldehyde-3-phosphate dehydrogenase~~~
MTKVAINGFGRIGRLALRRILEVPGLEVVAINDLTDAKMLAHLFKYDSSQGRFNGEIEVKEGAFVVNGKEVKVFAEADPE
KLPWGELGIDVVLECTGFFTKKEKAEAHVRAGAKKVVISAPAGNDLKTIVFNVNNEDLDGTETVISGASCTTNCLAPMAK
VLNDKFGIEKGFMTTIHAYTNDQNTLDGPHRKGDFRRARAAAVSIIPNSTGAAKAIAQVIPELKGKLDGNAQRVPVPTGS
VTELISVLKKNVTVEEINAAMKEAANESFGYTEDEIVSADVVGISYGSLFDATLTKIVDVDGSQLVKTVSWYDNEMSYTS
QLVRTLEYFAKIAK
>P00362 1.2.1.12~~~gap~~~Glyceraldehyde-3-phosphate dehydrogenase~~~
MAVKVGINGFGRIGRNVFRAALKNPDIEVVAVNDLTDANTLAHLLKYDSVHGRLDAEVSVNGNNLVVNGKEIIVKAERDP
ENLAWGEIGVDIVVESTGRFTKREDAAKHLEAGAKKVIISAPAKNEDITIVMGVNQDKYDPKAHHVISNASCTTNCLAPF
AKVLHEQFGIVRGMMTTVHSYTNDQRILDLPHKDLRRARAAAESIIPTTTGAAKAVALVLPELKGKLNGMAMRVPTPNVS
VVDLVAELEKEVTVEEVNAALKAAAEGELKGILAYSEEPLVSRDYNGSTVSSTIDALSTMVIDGKMVKVVSWYDNETGYS
HRVVDLAAYIASKGL
>A0A0H3MAB5 1.2.1.12~~~gap~~~Glyceraldehyde-3-phosphate dehydrogenase~~~
MTVRVGINGFGRIGRNFYRALLAQQEQGTADVEVVAANDITDNSTLAHLLKFDSILGRLPCDVGLEGDDTIVVGRAKIKA
LAVREGPAALPWGDLGVDVVVESTGLFTNAAKAKGHLDAGAKKVIISAPATDEDITIVLGVNDDKYDGSQNIISNASCTT
NCLAPLAKVLDDEFGIVKGLMTTIHAYTQDQNLQDGPHKDLRRARAAALNIVPTSTGAAKAIGLVMPQLKGKLDGYALRV
PIPTGSVTDLTVDLSTRASVDEINAAFKAAAEGRLKGILKYYDAPIVSSDIVTDPHSSIFDSGLTKVIDDQAKVVSWYDN
EWGYSNRLVDLVTLVGKSL
>P47543 1.2.1.12~~~gapA~~~Glyceraldehyde-3-phosphate dehydrogenase~~~COG0057
MAAKNRTIKVAINGFGRIGRLVFRSLLSKANVEVVAINDLTQPEVLAHLLKYDSAHGELKRKITVKQNILQIDRKKVYVF
SEKDPQNLPWDEHDIDVVIESTGRFVSEEGASLHLKAGAKRVIISAPAKEKTIRTVVYNVNHKTISSDDKIISAASCTTN
CLAPLVHVLEKNFGIVYGTMLTVHAYTADQRLQDAPHNDLRRARAAAVNIVPTTTGAAKAIGLVVPEANGKLNGMSLRVP
VLTGSIVELSVVLEKSPSVEQVNQAMKRFASASFKYCEDPIVSSDVVSSEYGSIFDSKLTNIVEVDGMKLYKVYAWYDNE
SSYVHQLVRVVSYCAKL
>P75358 1.2.1.12~~~gapA~~~Glyceraldehyde-3-phosphate dehydrogenase~~~
MLAKSKTIRVAINGFGRIGRLVFRALLSQKNIEIVAVNDLTHPDTLAHLLKYDSAHGEFKKKVVAKDNTLMIDKKKVLVF
SEKDPANLPWAEHNIDIVVESTGRFVSEEGASLHLQAGAKRVIISAPAKQKTIKTVVYNVNHKIINAEDKIISAASCTTN
CLAPMVHVLEKNFGILHGTMVTVHAYTADQRLQDAPHSDLRRARAAACNIVPTTTGAAKAIGLVVPEATGKLNGMALRVP
VLTGSIVELCVALEKDATVEQINQAMKKAASASFRYCEDEIVSSDIVGSEHGSIFDSKLTNIIEVDGNKLYKVYAWYDNE
SSYVNQLVRVVNYCAKL
>A0QWW2 1.2.1.12~~~gapA~~~Glyceraldehyde-3-phosphate dehydrogenase~~~COG0057
MTIRVGVNGFGRIGRNFYRALATQKAEGKNTDIEIVAVNDLTDNATLAHLLKFDSILGRLPQDVSLEGDDTIVIGDTKIK
ALEVKEGPAALPWGDLGVDVVVESTGIFTNAAKAKGHLDAGAKKVIISAPATDEDITIVLGVNDDKYDGSQNIISNASCT
TNCLGPLAKVLNDEFGIVKGLMTTIHAYTQDQNLQDGPHKDLRRARAAALNIVPTSTGAAKAIGLVLPELKGKLDGYALR
VPIPTGSVTDLTAELAKSASVEDINAAMKAAAEGPLKGILKYYDAPIVSSDIVTDPHSSLYDAGLTKVIDNQAKVVSWYD
NEWGYSNRLADLVALVGKSL
>P9WN83 1.2.1.12~~~gap~~~Glyceraldehyde-3-phosphate dehydrogenase~~~COG0136
MTVRVGINGFGRIGRNFYRALLAQQEQGTADVEVVAANDITDNSTLAHLLKFDSILGRLPCDVGLEGDDTIVVGRAKIKA
LAVREGPAALPWGDLGVDVVVESTGLFTNAAKAKGHLDAGAKKVIISAPATDEDITIVLGVNDDKYDGSQNIISNASCTT
NCLAPLAKVLDDEFGIVKGLMTTIHAYTQDQNLQDGPHKDLRRARAAALNIVPTSTGAAKAIGLVMPQLKGKLDGYALRV
PIPTGSVTDLTVDLSTRASVDEINAAFKAAAEGRLKGILKYYDAPIVSSDIVTDPHSSIFDSGLTKVIDDQAKVVSWYDN
EWGYSNRLVDLVTLVGKSL
>Q59906 1.2.1.12~~~gap~~~Glyceraldehyde-3-phosphate dehydrogenase~~~
MVVKVGINGFGRIGRLAFRRIQNVEGVEVTRINDLTDPNMLAHLLKYDTTQGRFDGTVEVKEGGFEVNGNFIKVSAERDP
ENIDWATDGVEIVLEATGFFAKKEAAEKPLHANGAKKVVITAPGGNDVKQLFSTLTTSILDGTETVISGASCTTNCLAPM
AKALHDAFGIQKGLMTTIHAYTGDQMIVDGHRGGGDLRRARAGAANIVPNSTGARKAIGLVIPELNGKLDGAAQRVPVPT
GSVTELVVTLDKNVSVDEINAAMKAASNDSFGYTEDPIVSSDIVGVSYGSLFDATQTKVMEVDGSQLVKVVSWYDNEMSY
TAQLVRTLEYFAKIAK
>Q5XDW3 1.2.1.12~~~gap~~~Glyceraldehyde-3-phosphate dehydrogenase~~~
MVVKVGINGFGRIGRLAFRRIQNIEGVEVTRINDLTDPNMLAHLLKYDTTQGRFDGTVEVKEGGFEVNGNFIKVSAERDP
ENIDWATDGVEIVLEATGFFAKKEAAEKHLHANGAKKVVITAPGGNDVKTVVFNTNHDILDGTETVISGASCTTNCLAPM
AKALHDAFGIQKGLMTTIHAYTGDQMILDGPHRGGDLRRARAGAANIVPNSTGAAKAIGLVIPELNGKLDGAAQRVPVPT
GSVTELVVTLDKNVSVDEINAAMKAASNDSFGYTEDPIVSSDIVGVSYGSLFDATQTKVMEVDGSQLVKVVSWYDNEMSY
TAQLVRTLEYFAKIAK
>P68777 1.2.1.12~~~gap~~~Glyceraldehyde-3-phosphate dehydrogenase~~~
MVVKVGINGFGRIGRLAFRRIQNIEGVEVTRINDLTDPNMLAHLLKYDTTQGRFDGTVEVKEGGFEVNGNFIKVSAERDP
ENIDWATDGVEIVLEATGFFAKKEAAEKHLHANGAKKVVITAPGGNDVKTVVFNTNHDILDGTETVISGASCTTNCLAPM
AKALHDAFGIQKGLMTTIHAYTGDQMILDGPHRGGDLRRARAGAANIVPNSTGAAKAIGLVIPELNGKLDGAAQRVPVPT
GSVTELVVTLDKNVSVDEINAAMKAASNDSFGYTEDPIVSSDIVGVSYGSLFDATQTKVMEVDGSQLVKVVSWYDNEMSY
TAQLVRTLEYFAKIAK
>P0C0G6 1.2.1.12~~~gap~~~Glyceraldehyde-3-phosphate dehydrogenase~~~COG0057
MVVKVGINGFGRIGRLAFRRIQNIEGVEVTRINDLTDPNMLAHLLKYDTTQGRFDGTVEVKEGGFEVNGNFIKVSAERDP
ENIDWATDGVEIVLEATGFFAKKEAAEKHLHANGAKKVVITAPGGNDVKTVVFNTNHDILDGTETVISGASCTTNCLAPM
AKALHDAFGIQKGLMTTIHAYTGDQMILDGPHRGGDLRRARAGAANIVPNSTGAAKAIGLVIPELNGKLDGAAQRVPVPT
GSVTELVVTLDKNVSVDEINSAMKAASNDSFGYTEDPIVSSDIVGVSYGSLFDATQTKVMEVDGSQLVKVVSWYDNEMSY
TAQLVRTLEYFAKIAK
>P00361 1.2.1.12~~~gap~~~Glyceraldehyde-3-phosphate dehydrogenase~~~
MKVGINGFGRIGRQVFRILHSRGVEVALINDLTDNKTLAHLLKYDSIYHRFPGEVAYDDQYLYVDGKAIRATAVKDPKEI
PWAEAGVGVVIESTGVFTDADKAKAHLEGGAKKVIITAPAKGEDITIVMGVNHEAYDPSRHHIISNASCTTNSLAPVMKV
LEEAFGVEKALMTTVHSYTNDQRLLDLPHKDLRRARAAAINIIPTTTGAAKATALVLPSLKGRFDGMALRVPTATGSISD
ITALLKREVTAEEVNAALKAAAEGPLKGILAYTEDEIVLQDIVMDPHSSIVDAKLTKALGNMVKVFAWYDNEWGYANRVA
DLVELVLRKGV
>P17721 1.2.1.12~~~gap~~~Glyceraldehyde-3-phosphate dehydrogenase~~~COG0057
MARVAINGFGRIGRLVYRIIYERKNPDIEVVAINDLTDTKTLAHLLKYDSVHKKFPGKVEYTENSLIVDGKEIKVFAEPD
PSKLPWKDLGVDFVIESTGVFRNREKAELHLQAGAKKVIITAPAKGEDITVVIGCNEDQLKPEHTIISCASCTTNSIAPI
VKVLHEKFGIVSGMLTTVHSYTNDQRVLDLPHKDLRRARAAAVNIIPTTTGAAKAVALVVPEVKGKLDGMAIRVPTPDGS
ITDLTVLVEKETTVEEVNAVMKEATEGRLKGIIGYNDEPIVSSDIIGTTFSGIFDATITNVIGGKLVKVASWYDNEYGYS
NRVVDTLELLLKM
>P15115 1.2.1.12~~~gap~~~Glyceraldehyde-3-phosphate dehydrogenase~~~
MAVKVGINGFGRIGRNVFRAAVKNPDIEVVAVNDLTANADGLAHLLKYDSVHGRLDAEVVVNDGVSVNGKEIIVKAERNP
ENLAWGEIGVDIVVESTGRFTKREDAAKHLEAGAKKVIISAPAKVENITVVMGVNQDKYDADAHHVISNASCTTICLAAF
ARVLHQIFGEVSRMMTTAHSYTNIQRILDAATHADLRGARAAAESIIDTTNGAAMAVALVLPELKGKLNGMAMRVATANV
SVVDLVYELAKEVTVEEVNAALKAIAEGELKGILAYSIEPLVIRNYNGSTVSSTIDILSTMVIDGAMVKVVSWYDNETGY
SHRVVALAAYINAKGL
>P9WN71 1.1.1.49~~~zwf1~~~Glucose-6-phosphate 1-dehydrogenase 1~~~COG0364
MVDGGGGASDLLVIFGITGDLARKMTFRALYRLERHQLLDCPILGVASDDMSVGQLVKWARESIGRTEKIDDAVFDRLAG
RLSYLHGDVTDSQLYDSLAELIGSACRPLYYLEMPPALFAPIVENLANVRLLERARVAVEKPFGHDLASALELNARLRAV
LGEDQILRVDHFLGKQPVVELEYLRFANQALAELWDRNSISEIHITMAEDFGVEDRGKFYDAVGALRDVVQNHLLQVLAL
VTMEPPVGSSADDLNDKKAEVFRAMAPLDPDRCVRGQYLGYTEVAGVASDSATETYVALRTEIDNWRWAGVPIFVRAGKE
LPAKVTEVRLFLRRVPALAFLPNRRPAEPNQIVLRIDPDPGMRLQISAHTDDSWRDIHLDSSFAVDLGEPIRPYERLLYA
GLVGDHQLFAREDSIEQTWRIVQPLLDNPGEIHRYDRGSWGPEAAQSLLRGHRGWQSPWLPRGTDA
>P9WN73 1.1.1.49~~~zwf2~~~Glucose-6-phosphate 1-dehydrogenase 2~~~COG0364
MKPAHAAASWRNPLRDKRDKRLPRIAGPCGMVIFGVTGDLARKKVMPAVYDLANRGLLPPTFSLVGFARRDWSTQDFGQV
VYNAVQEHCRTPFRQQNWDRLAEGFRFVPGTFDDDDAFAQLAETLEKLDAERGTGGNHAFYLAIPPKSFPVVCEQLHKSG
LARPQGDRWSRVVIEKPFGHDLASARELNKAVNAVFPEEAVFRIDHYLGKETVQNILALRFANQLFDPIWNAHYVDHVQI
TMAEDIGLGGRAGYYDGIGAARDVIQNHLMQLLALTAMEEPVSFHPAALQAEKIKVLSATRLAEPLDQTTSRGQYAAGWQ
GGEKVVGLLDEEGFAEDSTTETFAAITLEVDTRRWAGVPFYLRTGKRLGRRVTEIALVFRRAPHLPFDATMTDELGTNAM
VIRVQPDEGVTLRFGSKVPGTAMEVRDVNMDFSYGSAFAEDSPEAYERLILDVLLGEPSLFPVNAEVELAWEILDPALEH
WAAHGTPDAYEAGTWGPESSLEMLRRTGREWRRP
>P54547 1.1.1.49~~~zwf~~~Glucose-6-phosphate 1-dehydrogenase~~~COG0364
MKTNQQPKAVIVIFGATGDLAKRKLYPSIHRLYQNGQIGEEFAVVGVGRRPWSNEDLRQTVKTSISSSADKHIDDFTSHF
YYHPFDVTNPGSYQELNVLLNQLEDTYQIPNNRMFYLAMAPEFFGTIAKTLKSEGVTATTGWSRLVIEKPFGHDLPSAQA
LNKEIREAFTEDQIYRIDHYLGKQMVQNIEVIRFANAIFEPLWTNRYISNIQITSSESLGVEDRARYYEKSGALRDMVQN
HIMQMVALLAMEPPIKLNTEEIRSEKVKVLRALRPIAKDEVDEYFVRGQYHAGEIDGVPVPAYTDEDNVAPDSNTETFVA
GKLLIDNFRWAGVPFYIRTGKRMKEKSTKIVVQFKDIPMNLYYGNENNMNPNLLVIHIQPDEGITLYLNAKKLGGAAHAQ
PIKLDYCSNCNDELNTPEAYEKLIHDCLLGDATNFAHWDEVALSWSFVDSISETWAANKTLSPNYESGSMGPKESDDLLV
KDGLHWWNI
>P0AC53 1.1.1.49~~~zwf~~~Glucose-6-phosphate 1-dehydrogenase~~~COG0364
MAVTQTAQACDLVIFGAKGDLARRKLLPSLYQLEKAGQLNPDTRIIGVGRADWDKAAYTKVVREALETFMKETIDEGLWD
TLSARLDFCNLDVNDTAAFSRLGAMLDQKNRITINYFAMPPSTFGAICKGLGEAKLNAKPARVVMEKPLGTSLATSQEIN
DQVGEYFEECQVYRIDHYLGKETVLNLLALRFANSLFVNNWDNRTIDHVEITVAEEVGIEGRWGYFDKAGQMRDMIQNHL
LQILCMIAMSPPSDLSADSIRDEKVKVLKSLRRIDRSNVREKTVRGQYTAGFAQGKKVPGYLEEEGANKSSNTETFVAIR
VDIDNWRWAGVPFYLRTGKRLPTKCSEVVVYFKTPELNLFKESWQDLPQNKLTIRLQPDEGVDIQVLNKVPGLDHKHNLQ
ITKLDLSYSETFNQTHLADAYERLLLETMRGIQALFVRRDEVEEAWKWVDSITEAWAMDNDAPKPYQAGTWGPVASVAMI
TRDGRSWNEFE
>Q5FUK8 1.1.1.49~~~zwf~~~Glucose-6-phosphate 1-dehydrogenase~~~COG0364
MEHFQQVEPFDYVIFGATGDLTMRKLLPALYNRLRMGQIPDDACIIGAARTELDREAYVARARDALERFLPSDILGPGLV
ERFLARLDYVTLDSSREGPQWDALKSLLAKAQPDRVRVYYFATAPQLYGSICENLNRYELITPTSRVVLEKPIGTNMATA
TAINDGVGQYFPEKQIYRIDHYLGKETVQNVLALRFANPLMNAAWSGEHIESVQITAVETVGVEGRAAYYDTSGALRDMI
QNHLLQVLCLVAMEAPDSLEADAVRNAKLAVLNALRPITDATAATETVRAQYTAGVVDGENVPGYLEELGKPSATETYAA
IRAWVDTPRWKNVPFYIRTAKRSGKKVSEIVVTFRPAATTMFGATPASNRLVLRIQPNEGVDLRLNVKNPALDVFNLRTA
DLDTSIRMEGGLPFPDSYERLLLDAVRGDPVLFIRRDEVEAAWRWVEPILEAWKHDKAPMQTYSAGSYGPEQATQLLASH
GDTWHEASE
>P11411 1.1.1.363~~~zwf~~~Glucose-6-phosphate 1-dehydrogenase~~~
MVSEIKTLVTFFGGTGDLAKRKLYPSVFNLYKKGYLQKHFAIVGTARQALNDDEFKQLVRDSIKDFTDDQAQAEAFIEHF
SYRAHDVTDAASYAVLKEAIEEAADKFDIDGNRIFYMSVAPRFFGTIAKYLKSEGLLADTGYNRLMIEKPFGTSYDTAAE
LQNDLENAFDDNQLFRIDHYLGKEMVQNIAALRFGNPIFDAAWNKDYIKNVQVTLSEVLGVEERAGYYDTAGALLDMIQN
HTMQIVGWLAMEKPESFTDKDIRAAKNAAFNALKIYDEAEVNKYFVRAQYGAGDSADFKPYLEELDVPADSKNNTFIAGE
LQFDLPRWEGVPFYVRSGKRLAAKQTRVDIVFKAGTFNFGSEQEAQEAVLSIIIDPKGAIELKLNAKSVEDAFNTRTIDL
GWTVSDEDKKNTPEPYERMIHDTMNGDGSNFADWNGVSIAWKFVDAISAVYTADKAPLETYKSGSMGPEASDKLLAANGD
AWVFKG
>A0QP90 1.1.1.49~~~zwf~~~Glucose-6-phosphate 1-dehydrogenase~~~COG0364
MNRTPSPVDPCDFVIFGGTGDLAARKLLPALYLRDRDGQLAGATRIIGVAKAGLDDAGYRNTVRAGLARHVEPDLLDSDV
VDRFLSRLRFVSVDLTEPSDYAAVGDVLTSPDGGSGHDIRVFYLACAPALFGPICGALGAQGLVTESSRVVLEKPIGRDL
ASAQQINEAVGAVFAEHQIFRIDHYLGKESVQQLLVTRFGNTWLEPLWNSSRIDHVQITAAESLGVGARGDYYDQSGALR
DMLQNHLLQVLCLVAMEPPTHVNRESVRDEKRKVLEALEPLTAEQTQRDTVTGQYGPGLVGDEVVGSYREEVADPHSRTE
TFVAVKAHIRNWRWAGVPFYLRTGKRMSQRFSEIVVQFKPVPLPMFPGIEGTSEPNRLIISLQPDEAIRLEMTAKEPGSG
GRLRPVSLALNYTEAFPERSPDAYERLLMDVVRGDPTLFMRRDEVEAAWAWAEPILRHWQDADRVPRTYPAGTDGPVDAA
TLIERDGRRWHGGAA
>O68282 1.1.1.363~~~zwf~~~Glucose-6-phosphate 1-dehydrogenase~~~
MPDVRVLPCTLALFGALGDLALRKLFPALYQLDRENLLHRDTRVLALARDEGAPAEHLATLEQRLRLAVPAKEWDDVVWQ
RFRERLDYLSMDFLDPQAYVGLREAVDDELPLVAYFATPASVFGGICENLAAAGLAERTRVVLEKPIGHDLESSREVNEA
VARFFPESRIYRIDHYLGKETVQNLIALRFANSLFETQWNQNHISHVEITVAEKVGIEGRWGYFDQAGQLRDMVQNHLLQ
LLCLIAMDPPSDLSADSIRDEKVKVLRALEPIPAEQLASRVVRGQYTAGFSDGKAVPGYLEEEHANRDSDAETFVALRVD
IRNWRWSGVPFYLRTGKRMPQKLSQIVIHFKEPPHYIFAPEQRSLISNRLIIRLQPDEGISLQVMTKDQGLGKGMQLRTG
PLQLSFSETYHAARIPDAYERLLLEVTQGNQYLFVRKDEVEFAWKWCDQLIAGWERLSEAPKPYPAGSWGPVASVALVAR
DGRSWYGDF
>P13376 5.3.1.9~~~pgi2~~~Glucose-6-phosphate isomerase 2~~~
MAISFDYSNALPFMQENELDYLSEFVKAAHHMLHERKGPGSDFLGWVDWPIRYDKNEFSRIKQAAERIRNHSDALVVIGI
GGSYLGARAAIEALSHTFHNQMNDTTQIYFAGQNISSTYISHLLDVLEGKDLSINVISKSGTTTEPAIAFRIFRDYMEKK
YGKEEARKRIYVTTDRTKGALKKLADQEGYETFVIPDNIGGRYSVLTAVGLLPIAVAGLNIDRMMEGAASAYHKYNNPDL
LTNESYQYAAVRNILYRKGKAIELLVNYEPSLHYVSEWWKQLFGESEGKDQKGLFPASVDFTTDLHSMGQYVQEGRRNLI
ETVLHVKKPQIELTIQEDPENIDGLNFLAGKTLDEVNKKAFQGTLLAHVDGGVPNLIVELDEMNEYTFGEMVYFFEKACG
ISGHLLGVNPFDQPGVEAYKKNMFALLGKPGFEDEKAALMKRLSK
>Q81K75 5.3.1.9~~~pgi~~~Glucose-6-phosphate isomerase~~~COG0166
MSTHVTFDYSKALSFIGEHEITYLRDAVKVTHHAIHEKTGAGNDFLGWVDLPLQYDKEEFARIQKCAEKIKNDSDILLVV
GIGGSYLGARAAIEMLNHSFYNTLSKEQRKTPQVLFVGQNISSTYMKDLMDVLEGKDFSINVISKSGTTTEPALAFRIFR
KLLEEKYGKEEARKRIYATTDKARGALKTLADNEGYETFVIPDDVGGRFSVLTPVGLLPIAVSGLNIEEMMKGAAAGRDD
FGTSELEENPAYQYAVVRNALYNKGKTIEMLINYEPALQYFAEWWKQLFGESEGKDQKGIFPSSANFSTDLHSLGQYVQE
GRRDLFETVLKVGKSTHELTIESEENDLDGLNYLAGETVDFVNTKAYEGTLLAHSDGGVPNLIVNIPELNEYTFGYLVYF
FEKACAMSGYLLGVNPFDQPGVEAYKKNMFALLGKPGFEELKAELEERLK
>P80860 5.3.1.9~~~pgi~~~Glucose-6-phosphate isomerase~~~COG0166
MTHVRFDYSKALTFFNEHELTYLRDFVKTAHHNIHEKTGAGSDFLGWVDLPEHYDKEEFARIKKSAEKIKSDSDVLLVVG
IGGSYLGARAAIEALNHAFYNTLPKAKRGNPQVIFIGNNISSSYMRDVMDLLEDVDFSINVISKSGTTTEPAIAFRIFRK
LLEEKYGKEEAKARIYATTDKERGALKTLSNEEGFESFVIPDDVGGRYSVLTAVGLLPIAVSGVNIDDMMKGALDASKDF
ATSELEDNPAYQYAVVRNVLYNKGKTIEMLINYEPALQYFAEWWKQLFGESEGKDEKGIYPSSANYSTDLHSLGQYVQEG
RRDLFETVLNVEKPKHELTIEEADNDLDGLNYLAGKTVDFVNKKAFQGTMLAHTDGNVPNLIVNIPELNAYTFGYLVYFF
EKACAMSGYLLGVNPFDQPGVEAYKVNMFALLGKPGFEEKKAELEKRLED
>Q8A5W2 5.3.1.9~~~pgi~~~Glucose-6-phosphate isomerase~~~COG0166
MISLNIEKTFGFISKEKVSAYEAEVKAAQEMLEKGTGEGNDFLGWLHLPSSISKEHLADLNATAKVLRDNCEVVIVAGIG
GSYLGARAVIEALSNSFTWLQDKKTAPVMIYAGHNISEDYLYELTEYLKDKKFGVINISKSGTTTETALAFRLLKKQCED
QRGKETAKKVIVAVTDAKKGAARVTADKEGYKTFIIPDNVGGRFSVLTPVGLLPIAVAGFDIDKLVAGAADMEKACGSDV
PFAENPAAIYAATRNELYRQGKKIEILVNFCPKLHYVSEWWKQLYGESEGKDNKGIFPASVDFSTDLHSMGQWIQEGERS
IFETVISLDKVDHKLEVPFDEANLDGLNFLAGKRVDEVNKMAELGTQLAHVDGGVPNMRIVLPELSEYNIGGLLYFFEKA
CGISGYLLGVNPFNQPGVEAYKKNMFALLNKPGYEEESKAIQAKL
>Q8YF86 5.3.1.9~~~pgi~~~Glucose-6-phosphate isomerase~~~COG0166
MARDATKLEATVAKLKKHWAESAPRDMRAAFSADPGRFGRYSLCLDDLLFDWSKCRVNDETMALLKELAVAADVEGRRAA
MFAGEHINNTEDRAVLHVALRDTSSKEVLVDGHNVLPDVKHVLDRMAAFADGIRSGALKGATGRKITDIVNIGIGGSDLG
PVMATLALAPYHDEPRAHFVSNIDGAHIADTLSPLDPASTLIIVASKTFTTIETMTNAQTARKWVADTLGEAAVGAHFAA
VSTALDKVAAFGIPEDRVFGFWDWVGGRYSVWSAIGLPVMIAVGPDNFRKFLAGAHAMDVHFRDAPLEKNLPVMLGLIGY
WHRAICGYGSRAIIPYDQRLSRLPAYLQQLDMESNGKSVTLDGKPVSGPTGPVVWGEPGTNGQHAFFQLLHQGTDTIPLE
FIVAAKGHEPTLDHQHEMLMANCLAQSEALMKGRTLDEARAQLQAKNLPASQVERIAPHRVFSGNRPSLTLIHDMLDPYT
LGRLIALYEHRVFVEAQIFGINAFDQWGVELGKELATELLPVVSGKEGASGRDASTQGLVAHLHARRKA
>P0A6T1 5.3.1.9~~~pgi~~~Glucose-6-phosphate isomerase~~~COG0166
MKNINPTQTAAWQALQKHFDEMKDVTIADLFAKDGDRFSKFSATFDDQMLVDYSKNRITEETLAKLQDLAKECDLAGAIK
SMFSGEKINRTENRAVLHVALRNRSNTPILVDGKDVMPEVNAVLEKMKTFSEAIISGEWKGYTGKAITDVVNIGIGGSDL
GPYMVTEALRPYKNHLNMHFVSNVDGTHIAEVLKKVNPETTLFLVASKTFTTQETMTNAHSARDWFLKAAGDEKHVAKHF
AALSTNAKAVGEFGIDTANMFEFWDWVGGRYSLWSAIGLSIVLSIGFDNFVELLSGAHAMDKHFSTTPAEKNLPVLLALI
GIWYNNFFGAETEAILPYDQYMHRFAAYFQQGNMESNGKYVDRNGNVVDYQTGPIIWGEPGTNGQHAFYQLIHQGTKMVP
CDFIAPAITHNPLSDHHQKLLSNFFAQTEALAFGKSREVVEQEYRDQGKDPATLDYVVPFKVFEGNRPTNSILLREITPF
SLGALIALYEHKIFTQGVILNIFTFDQWGVELGKQLANRILPELKDDKEISSHDSSTNGLINRYKAWRG
>Q5NFC4 5.3.1.9~~~pgi~~~Glucose-6-phosphate isomerase~~~COG0166
MLFCDDSKKYLKEQNINLKNEFDKDDKRVEKFSLKHQNIYFDYSKNLINDYILKSLLESAEKSSLKDKIKQMFNGAKINS
TEHRAVLHTALRDLSSTPLIVDGQDIRQEVTKEKQRVKELVEKVVSGRWRGFSGKKITDIVNIGIGGSDLGPKMVVRALQ
PYHCTDLKVHFVSNVDADSLLQALHVVDPETTLFIIASKSFSTEETLLNSISAREWLLDHYEDEKAVANHFVAISSKLDK
VKEFGIDLEHCYKMWDWVGGRYSLWSSIGMSIAFAIGYDNFEKLLAGAYSVDKHFKETEFSKNIPVIMALLASYYSCTYN
SQSQALLPYDERLCYFVDYLQQADMESNGKSVNIAGETVNYQTGVVLWGGVGTNGQHAFHQLLHQGNIFIPVDFIAIATS
HHNYDNHQQALLANCFAQSQALMFGQSYDMVYNELLKSGLNETQAKELAAHKVIPGNRPSTTILLDELSPYSLGALIALY
EHKIFVQGVLWDINSYDQWGVELGKKLGKNILKAMNDDSSDEYQNLDDSTRQLIAKVKNK
>P81181 5.3.1.9~~~pgi~~~Glucose-6-phosphate isomerase~~~COG0166
MAHIKFDYSKLTPFVAENELDEIQWQIDGAAKLLHEGKGAGSDYIGWLDLPEDYDKEEFARIQKAAKKIQSDSEVLIVIG
IGGSYLGARAAIDFLSNSFVNLQTAEERKAPRILYAGNSISSSYLADLVDYVADKDFSVNVISKSGTTTEPAIAFRVFEE
MLVKKYGREEANKRIYATTDKEKGAVKVNADANNWETFVVPDSVGGRFSVLTAVGLLPIAASGADITALMEGANAARKEY
TSTNVHENDAYAYAALRNILYRKGKFSEILINYEPSLQYFSEWWKQLAGESEGKDQKGIYPTSANFSTDLHSLGQWIQEG
TRTVFETAIRIEKPRKNINIPELDADLDGLGYLQGKDVDFVNKKAADGVLLAHTDGNVPNMIVTLPEQDEFTLGYAIYFF
ELAIGVSGYLNGINPFNQPGVEAYKKNMFALLGKPGFEELSKELNDRL
>A0R3N9 5.3.1.9~~~pgi~~~Glucose-6-phosphate isomerase~~~COG0166
MSADITETPAWQALSDHHAEIGDRHLTELFADDPARGTELALTVGDLYIDYSKHRVTRRTLDLLVDLARAAGLEERRDAM
FAGEHINTSEDRAVLHTALRLPRDAKLVVDGQDVVADVHDVLDRMGDFTDRLRSGEWTGATGERITTVVNIGIGGSDLGP
VMVYDALRHYADAGISARFVSNVDPADLVAKLDGLEPAKTLFIVASKTFSTLETLTNATAARRWLTDALGDAAVAKHFVA
VSTNKKLVDEFGINTDNMFGFWDWVGGRYSVDSAIGLSVMAVIGKERFAEFLAGFHIVDEHFRTAPLHQNAPALLGLIGL
WYSNFFGAQSRAVLPYSNDLSRFAAYLQQLTMESNGKSVRADGTPVSTDTGEIFWGEPGTNGQHAFYQLLHQGTRLVPAD
FIGFSQPTDDLPTADGTGSMHDLLMSNFFAQTQVLAFGKTADAIASEGTPADVVPHKVMPGNRPTTSILATKLTPSVVGQ
LIALYEHQVFTEGVIWGIDSFDQWGVELGKTQAKALLPVLTGDKSPAAQSDTSTDALVRRYRTERGRPA
>P9WN69 5.3.1.9~~~pgi~~~Glucose-6-phosphate isomerase~~~COG0166
MTSAPIPDITATPAWDALRRHHDQIGNTHLRQFFADDPGRGRELTVSVGDLYIDYSKHRVTRETLALLIDLARTAHLEER
RDQMFAGVHINTSEDRAVLHTALRLPRDAELVVDGQDVVTDVHAVLDAMGAFTDRLRSGEWTGATGKRISTVVNIGIGGS
DLGPVMVYQALRHYADAGISARFVSNVDPADLIATLADLDPATTLFIVASKTFSTLETLTNATAARRWLTDALGDAAVSR
HFVAVSTNKRLVDDFGINTDNMFGFWDWVGGRYSVDSAIGLSLMTVIGRDAFADFLAGFHIIDRHFATAPLESNAPVLLG
LIGLWYSNFFGAQSRTVLPYSNDLSRFPAYLQQLTMESNGKSTRADGSPVSADTGEIFWGEPGTNGQHAFYQLLHQGTRL
VPADFIGFAQPLDDLPTAEGTGSMHDLLMSNFFAQTQVLAFGKTAEEIAADGTPAHVVAHKVMPGNRPSTSILASRLTPS
VLGQLIALYEHQVFTEGVVWGIDSFDQWGVELGKTQAKALLPVITGAGSPPPQSDSSTDGLVRRYRTERGRAG
>Q5HHC2 5.3.1.9~~~pgi~~~Glucose-6-phosphate isomerase~~~
MTHIQLDFSKTLEFFGEHELKQQQEIVKSIHKTIHEGTGAGSDFLGWVDLPVDYDKEEFSRIVEASKRIKENSDVLVVIG
IGGSYLGARAAIEMLTSSFRNSNEYPEIVFVGNHLSSTYTKELVDYLADKDFSVNVISKSGTTTEPAVAFRLFKQLVEER
YGKEEAQKRIFATTDKEKGALKQLATNEGYETFIVPDDVGGRYSVLTAVGLLPIATAGINIEAMMIGAAKAREELSSDKL
EENIAYQYATIRNILYAKGYTTEMLINYEPSMQYFNEWWKQLFGESEGKDFKGIYPSSANYTTDLHSLGQYVQEGRRFLF
ETVVKVNHPKYDITIEKDSDDLDGLNYLAGKTIDEVNTKAFEGTLLAHTDGGVPNMVVNIPQLDEETFGYVVYFFELACA
MSGYQLGVNPFNQPGVEAYKQNMFALLGKPGFEDLKKELEERL
>P99078 5.3.1.9~~~pgi~~~Glucose-6-phosphate isomerase~~~
MTHIQLDFSKTLEFFGEHELKQQQEIVKSIHKTIHEGTGAGSDFLGWVDLPVDYDKEEFSRIVEASKRIKENSDVLVVIG
IGGSYLGARAAIEMLTSSFRNSNEYPEIVFVGNHLSSTYTKELVDYLADKDFSVNVISKSGTTTEPAVAFRLFKQLVEER
YGKEEAQKRIFATTDKEKGALKQLATNEGYETFIVPDDVGGRYSVLTAVGLLPIATAGINIEAMMIGAAKAREELSSDKL
EDNIAYQYATIRNILYAKGYTTEMLINYEPSMQYFNEWWKQLFGESEGKDFKGIYPSSANYTTDLHSLGQYVQEGRRFLF
ETVVKVNHPKYDITIEKDSDDLDGLNYLAGKTIDEVNTKAFEGTLLAHTDGGVPNMVVNIPQLDEETFGYVVYFFELACA
MSGYQLGVNPFNQPGVEAYKQNMFALLGKPGFEDLKKELEERL
>Q9X1A5 5.3.1.9~~~pgi~~~Glucose-6-phosphate isomerase~~~COG0166
MSLKFDFSNLFEPNISGGLTDEDVKSVEEKVTSAVRNFVENTPDFAKLDRSWIDSVKSLEDWIINFDTVVVLGIGGSGLG
NLALHYSLRPLNWNEMTREERNGYARVFVVDNVDPDLMSSVLDRIDPKTTLFNVISKSGSTAEVMATYSIARGILEAYGL
DPREHMLITTDPEKGFLRKLVKEEGFRSLEVPPGVGGRFSVLTPVGLLSAMAEGIDIDELHEGAKDAFEKSMKENILENP
AAMIALTHYLYLNKGKSISVMMAYSNRMIYLVDWYRQLWAESLGKRYNLKGEEVFTGQTPVKALGATDQHSQIQLYNEGP
NDKVITFLRVENFDREIVIPETGRAELSYLARKKLSELLLAEQTGTEEALRENNRPNMRVTFDGLTPYNVGQFFAYYEAA
TAFMGYLLEINPFDQPGVELGKKITFALMGREGYTYEIKERSKKVIIE
>Q5SLL6 5.3.1.9~~~pgi~~~Glucose-6-phosphate isomerase~~~COG0166
MLRLDTRFLPGFPEALSRHGPLLEEARRRLLAKRGEPGSMLGWMDLPEDTETLREVRRYREANPWVEDFVLIGIGGSALG
PKALEAAFNESGVRFHYLDHVEPEPILRLLRTLDPRKTLVNAVSKSGSTAETLAGLAVFLKWLKAHLGEDWRRHLVVTTD
PKEGPLRAFAEREGLKAFAIPKEVGGRFSALSPVGLLPLAFAGADLDALLMGARKANETALAPLEESLPLKTALLLHLHR
HLPVHVFMVYSERLSHLPSWFVQLHDESLGKVDRQGQRVGTTAVPALGPKDQHAQVQLFREGPLDKLLALVIPEAPLEDV
EIPEVEGLEAASYLFGKTLFQLLKAEAEATYEALAEAGQRVYALFLPEVSPYAVGWLMQHLMWQTAFLGELWEVNAFDQP
GVELGKVLTRKRLAG
>Q9KUY4 5.3.1.9~~~pgi~~~Glucose-6-phosphate isomerase~~~COG0166
MLKNINPTQTQAWKALTAHFESAQDMDLKALFAQDSERFAKYSARFGQDILVDYSKNLVNAETMQHLFALAKETDLQSAI
TAMFKGEAINQTEDRAVLHTALRNRSNSPVLVNGEDVMPAVNAVLAKMKAFSERVIGGEWKGFTGKAITDVVNIGIGGSD
LGPYMVTEALVPYKNHLTMHFVSNVDGTHMAETLKNVDPETTLFLVASKTFTTQETMTNAHTARDWFLKAAGDEAHVAKH
FAALSTNGKAVAEFGIDTDNMFEFWDWVGGRYSLWSAIGLSIILSIGYDNFVELLAGAHEMDQHFVNTPFESNIPVILAL
IGIWYNNFHGAESEAILPYDQYLHRFAAYFQQGNMESNGKYVDRNGNPVTYQTGPIIWGEPGTNGQHAFYQLIHQGTKLI
PCDFIAPAVSHNLVGDHHQKLMSNFFAQTEALAFGKSAQAVQAELEKAGKSAAEIAALVPFKVFEGNRPTNSILVKQITP
RTLGNLIAMYEHKIFVQGVIWNIFSFDQWGVELGKQLANQILPELADSAAVTSHDSSTNGLINAFKAFRA
>P0A0T1 5.3.1.9~~~pgi~~~Glucose-6-phosphate isomerase~~~
MTQTNGFDALHAHAQRLRGAAIPALLAAEPERPTQYARQVGPLYFNFARQKYDRAALDALFAIARERDLSGAFQRLFRGE
QVNVTEQRAALHTALRGDLTDAPVASEAYATAEEVRQRMGSLIQQLEATDVTDIVSVGIGGSDLGPRLVADALRAPSGAR
FRVHFVSNVDGAAMQRTLATLDPARTAGILISKTFGTQETLLNGSILHAWLGGSERLYAVSANPERAAKAFDIAPGRVLP
MWDWVGGRYSLWSAVGFPIALAIGFERFEQLLEGAAQFDAHVLNTPLEENVAVLHGLTAVWNRNLLGSATHAVMTYDQRL
ALLPAYLQQLVMESLGKRVKLDGSAVDSDTVSVWWGGAGTDVQHSFFQALHQGTSVVPADFIGTVHNDDPYAENHTALMA
NVLAQTEALANGQDSSDPHRSYPGGRPSTVILLDALTPQALGALISMYEHSVYVQSVMWGINAFDQFGVELGKQLASQLL
PALKGESVDVADPVTRELLNKLRG
>Q9L5D6 3.5.1.93~~~~~~Glutaryl-7-aminocephalosporanic-acid acylase~~~
MLRVLHRAASALVMATVIGLAPGVAFALAEPTSTPQAPIAAYKPRSNEILWDGYGVPHIYGVDAPSAFYGYGWAQARSHG
DNILRLYGEARGKGAEYWGPDYEQTTVWLLTNGVPERAQQWYAQQSPDFRANLDAFAAGINAYAQQNPDDISPEVRQVLP
VSGADVVAHAHRLMNFLYVASPGRTLGEGDPPDLADQGSNSWAVAPGKTANGNALLLQNPHLSWTTDYFTYYEAHLVTPD
FEIYGATQIGLPVIRFAFNQRMGITNTVNGMVGATNYRLTLQDGGYLYDGQVRPFERRQASYRLRQADGSTVDKPLEIRS
SVHGPVFERADGTAVAVRVAGLDRPGMLEQYFDMITAHSFDDYEAAMARMQVPTFNIVYADREGTINYSFNGVAPKRAEG
DIAFWQGNVPGDSSRYLWTETHPLDDLPRVTNPPGGFVQNSNDPPWTPTWPVTYTPRDHPSYLAPQTPHSLRAQQSVRLM
SENDDLTLERFMALQFSHRAVMADRTLPDLIPAALIDPDPEVQAAARLLAAWDREFTSDSRAALLFEEWARLFAGQNFAG
QAAFATPWSLDKPVSTPYGVRDPKAAVDQLRTAIANTKRKYGAIDRPFGDASRMILNDVNVPGAAGYGNLGSFRVFTWSD
PDENGIRTPVHGETWVAMIEFSTPVRAYGLMSYGNSRQPGTTHYSDQIERVSRADFRELLLRREQVEAAVQERTPFNFKP
>P07662 3.5.1.93~~~~~~Glutaryl-7-aminocephalosporanic-acid acylase~~~
MLRVLHRAASALVMATVIGLAPAVAFALAEPTSTPQAPIAAYKPRSNEILWDGYGVPHIYGVDAPSAFYGYGWAQARSHG
DNILRLYGEARGKGAEYWGPDYEQTTVWLLTNGVPERAQQWYAQQSPDFRANLDAFAAGINAYAQQNPDDISPEVRQVLP
VSGADVVAHAHRLMNFLYVASPGRTLGEGDPPDLADQGSNSWAVAPGKTANGNALLLQNPHLSWTTDYFTYYEAHLVTPD
FEIYGATQIGLPVIRFAFNQRMGITNTVNGMVGATNYRLTLQDGGYLYDGQVRPFERPQASYRLRQADGTTVDKPLEIRS
SVHGPVFERADGTAVAVRVAGLDRPGMLEQYFDMITADSFDDYEAALARMQVPTFNIVYADREGTINYSFNGVAPKRAEG
DIAFWQGLVPGDSSRYLWTETHPLDDLPRVTNPPGGFVQNSNDPPWTPTWPVTYTPKDFPSYLAPQTPHSLRAQQSVRLM
SENDDLTLERFMALQLSHRAVMADRTLPDLIPAALIDPDPEVQAAARLLAAWDREFTSDSRAALLFEEWARLFAGQNFAG
QAGFATPWSLDKPVSTPYGVRDPKAAVDQLRTAIANTKRKYGAIDRPFGDASRMILNDVNVPGAAGYGNLGSFRVFTWSD
PDENGVRTPVHGETWVAMIEFSTPVRAYGLMSYGNSRQPGTTHYSDQIERVSRADFRELLLRREQVEAAVQERTPFNFKP
>Q9I6R6 ~~~~~~Gamma-aminobutyric acid-binding protein~~~
MFKSLHQYAHVFSRLSLFGLAFAAAAQAQSQSLTVISFGGATKAAQEQAYFKPFERSGGGQVVAGEYNGEMAKVKAMVDV
GKVSWDVVEVESPELLRGCDEGLFERLDPARFGDPAQFVPGTFSECGVATYVWSMVMAYDSTKLARAPQSWADFWNVREF
PGKRGLRKGAKYTLEVALLADGVKAEDLYKVLATPEGVSRAFAKLDQLKPNIQWWEAGAQPPQWLAAGDVVMSAAYNGRI
AAAQKEGVKLAIVWPGSLYDPEYWAVVKGTPNKALAEKFIAFASQPQTQKVFSEQIPYGPVHKGTLALLPKTVQEALPTA
PANLEGARAVDAEFWVDHGEELEQRFNAWAAR
>P9WNX9 1.2.1.79~~~gabD1~~~Succinate-semialdehyde dehydrogenase [NADP(+)] 1~~~COG1012
MPIATINPATGETVKTFTAATDDEVDAAIARAHRRFADYRQTSFAQRARWANATADLLEAEADQAAAMMTLEMGKTLAAA
KAEALKCAKGFRYYAENAEALLADEPADAAKVGASAAYGRYQPLGVILAVMPWNFPLWQAVRFAAPALMAGNVGLLKHAS
NVPQCALYLADVIARGGFPDGCFQTLLVSSGAVEAILRDPRVAAATLTGSEPAGQSVGAIAGNEIKPTVLELGGSDPFIV
MPSADLDAAVSTAVTGRVQNNGQSCIAAKRFIVHADIYDDFVDKFVARMAALRVGDPTDPDTDVGPLATEQGRNEVAKQV
EDAAAAGAVIRCGGKRLDRPGWFYPPTVITDISKDMALYTEEVFGPVASVFRAANIDEAVEIANATTFGLGSNAWTRDET
EQRRFIDDIVAGQVFINGMTVSYPELPFGGVKRSGYGRELSAHGIREFCNIKTVWIA
>P9WNX7 1.2.1.79~~~gabD2~~~Putative succinate-semialdehyde dehydrogenase [NADP(+)] 2~~~COG1012
MPAPSAEVFDRLRNLAAIKDVAARPTRTIDEVFTGKPLTTIPVGTAADVEAAFAEARAAQTDWAKRPVIERAAVIRRYRD
LVIENREFLMDLLQAEAGKARWAAQEEIVDLIANANYYARVCVDLLKPRKAQPLLPGIGKTTVCYQPKGVVGVISPWNYP
MTLTVSDSVPALVAGNAVVLKPDSQTPYCALACAELLYRAGLPRALYAIVPGPGSVVGTAITDNCDYLMFTGSSATGSRL
AEHAGRRLIGFSAELGGKNPMIVARGANLDKVAKAATRACFSNAGQLCISIERIYVEKDIAEEFTRKFGDAVRNMKLGTA
YDFSVDMGSLISEAQLKTVSGHVDDATAKGAKVIAGGKARPDIGPLFYEPTVLTNVAPEMECAANETFGPVVSIYPVADV
DEAVEKANDTDYGLNASVWAGSTAEGQRIAARLRSGTVNVDEGYAFAWGSLSAPMGGMGLSGVGRRHGPEGLLKYTESQT
IATARVFNLDPPFGIPATVWQKSLLPIVRTVMKLPGRR
>P94428 1.2.1.79~~~gabD~~~Succinate-semialdehyde dehydrogenase [NADP(+)]~~~COG1012
MPDQLTVYNPATGEEIKTIPQQSATEVEEAIERSHQAFKTWSKTSANERTSLLKKWYELIVEHKEELADLITKENGKPYQ
EAVGEVLYGAGYIEWFAEEAKRVYGRTVPAPTTGKRIVVTRQPVGPVAAITPWNFPNAMITRKAAPALAAGCTFIIKPAP
DTPLSAYELARLAYEAGIPKDVLQVVIGDGEEIGNVFTSSPKIRKITFTGSTPVGKILMKNSADTVKHVSMELGGHAPLI
VDEDADIDLAVEQAMASKYRNAGQTCVCANRLIVHESIKDEFAAKLSEQVSKLKVGNGLEEGVNVGPIINKRGFEKIVSQ
IDDAVEKGAKVIAGGTYDRNDDKGCYFVNPTVLTDVDTSMNIMHEETFGPVAPIVTFSDIDEAIQLANDTPYGLAAYFFT
ENYRRGIYISENLEYGIIGWNDGGPSAVQAPFGGMKESGIGREGGSEGIEPYLETKYLSIGL
>P25526 1.2.1.79~~~gabD~~~Succinate-semialdehyde dehydrogenase [NADP(+)] GabD~~~COG1012
MKLNDSNLFRQQALINGEWLDANNGEAIDVTNPANGDKLGSVPKMGADETRAAIDAANRALPAWRALTAKERATILRNWF
NLMMEHQDDLARLMTLEQGKPLAEAKGEISYAASFIEWFAEEGKRIYGDTIPGHQADKRLIVIKQPIGVTAAITPWNFPA
AMITRKAGPALAAGCTMVLKPASQTPFSALALAELAIRAGVPAGVFNVVTGSAGAVGNELTSNPLVRKLSFTGSTEIGRQ
LMEQCAKDIKKVSLELGGNAPFIVFDDADLDKAVEGALASKFRNAGQTCVCANRLYVQDGVYDRFAEKLQQAVSKLHIGD
GLDNGVTIGPLIDEKAVAKVEEHIADALEKGARVVCGGKAHERGGNFFQPTILVDVPANAKVSKEETFGPLAPLFRFKDE
ADVIAQANDTEFGLAAYFYARDLSRVFRVGEALEYGIVGINTGIISNEVAPFGGIKASGLGREGSKYGIEDYLEIKYMCI
GL
>P46349 ~~~gabP~~~Gamma-aminobutyric acid permease~~~COG1113
MNQSQSGLKKELKTRHMTMISIAGVIGAGLFVGSGSVIHSTGPGAVVSYALAGLLVIFIMRMLGEMSAVNPTSGSFSQYA
HDAIGPWAGFTIGWLYWFFWVIVIAIEAIAGAGIIQYWFHDIPLWLTSLILTIVLTLTNVYSVKSFGEFEYWFSLIKVVT
IIAFLIVGFAFIFGFAPGSEPVGFSNLTGKGGFFPEGISSVLLGIVVVIFSFMGTEIVAIAAGETSNPIESVTKATRSVV
WRIIVFYVGSIAIVVALLPWNSANILESPFVAVLEHIGVPAAAQIMNFIVLTAVLSCLNSGLYTTSRMLYSLAERNEAPR
RFMKLSKKGVPVQAIVAGTFFSYIAVVMNYFSPDTVFLFLVNSSGAIALLVYLVIAVSQLKMRKKLEKTNPEALKIKMWL
FPFLTYLTIIAICGILVSMAFIDSMRDELLLTGVITGIVLISYLVFRKRKVSEKAAANPVTQQQPDILP
>P25527 ~~~gabP~~~Gamma-aminobutyric acid permease~~~COG1113
MGQSSQPHELGGGLKSRHVTMLSIAGVIGASLFVGSSVAIAEAGPAVLLAYLFAGLLVVMIMRMLAEMAVATPDTGSFST
YADKAIGRWAGYTIGWLYWWFWVLVIPLEANIAAMILHSWVPGIPIWLFSLVITLALTGSNLLSVKNYGEFEFWLALCKV
IAILAFIFLGAVAISGFYPYAEVSGISRLWDSGGFMPNGFGAVLSAMLITMFSFMGAEIVTIAAAESDTPEKHIVRATNS
VIWRISIFYLCSIFVVVALIPWNMPGLKAVGSYRSVLELLNIPHAKLIMDCVILLSVTSCLNSALYTASRMLYSLSRRGD
APAVMGKINRSKTPYVAVLLSTGAAFLTVVVNYYAPAKVFKFLIDSSGAIALLVYLVIAVSQLRMRKILRAEGSEIRLRM
WLYPWLTWLVIGFITFVLVVMLFRPAQQLEVISTGLLAIGIICTVPIMARWKKLVLWQKTPVHNTR
>P94426 ~~~gabR~~~HTH-type transcriptional regulatory protein GabR~~~COG1167
MDITITLDRSEQADYIYQQIYQKLKKEILSRNLLPHSKVPSKRELAENLKVSVNSVNSAYQQLLAEGYLYAIERKGFFVE
ELDMFSAEEHPPFALPDDLKEIHIDQSDWISFSHMSSDTDHFPIKSWFRCEQKAASRSYRTLGDMSHPQGIYEVRAAITR
LISLTRGVKCRPEQMIIGAGTQVLMQLLTELLPKEAVYAMEEPGYRRMYQLLKNAGKQVKTIMLDEKGMSIAEITRQQPD
VLVTTPSHQFPSGTIMPVSRRIQLLNWAAEEPRRYIIEDDYDSEFTYDVDSIPALQSLDRFQNVIYMGTFSKSLLPGLRI
SYMVLPPELLRAYKQRGYDLQTCSSLTQLTLQEFIESGEYQKHIKKMKQHYKEKRERLITALEAEFSGEVTVKGANAGLH
FVTEFDTRRTEQDILSHAAGLQLEIFGMSRFNLKENKRQTGRPALIIGFARLKEEDIQEGVQRLFKAVYGHKKIPVTGD
>P22256 2.6.1.19~~~gabT~~~4-aminobutyrate aminotransferase GabT~~~COG0160
MNSNKELMQRRSQAIPRGVGQIHPIFADRAENCRVWDVEGREYLDFAGGIAVLNTGHLHPKVVAAVEAQLKKLSHTCFQV
LAYEPYLELCEIMNQKVPGDFAKKTLLVTTGSEAVENAVKIARAATKRSGTIAFSGAYHGRTHYTLALTGKVNPYSAGMG
LMPGHVYRALYPCPLHGISEDDAIASIHRIFKNDAAPEDIAAIVIEPVQGEGGFYASSPAFMQRLRALCDEHGIMLIADE
VQSGAGRTGTLFAMEQMGVAPDLTTFAKSIAGGFPLAGVTGRAEVMDAVAPGGLGGTYAGNPIACVAALEVLKVFEQENL
LQKANDLGQKLKDGLLAIAEKHPEIGDVRGLGAMIAIELFEDGDHNKPDAKLTAEIVARARDKGLILLSCGPYYNVLRIL
VPLTIEDAQIRQGLEIISQCFDEAKQ
>P9WQ79 2.6.1.19~~~gabT~~~4-aminobutyrate aminotransferase~~~COG0160
MASLQQSRRLVTEIPGPASQALTHRRAAAVSSGVGVTLPVFVARAGGGIVEDVDGNRLIDLGSGIAVTTIGNSSPRVVDA
VRTQVAEFTHTCFMVTPYEGYVAVAEQLNRITPGSGPKRSVLFNSGAEAVENAVKIARSYTGKPAVVAFDHAYHGRTNLT
MALTAKSMPYKSGFGPFAPEIYRAPLSYPYRDGLLDKQLATNGELAAARAIGVIDKQVGANNLAALVIEPIQGEGGFIVP
AEGFLPALLDWCRKNHVVFIADEVQTGFARTGAMFACEHEGPDGLEPDLICTAKGIADGLPLSAVTGRAEIMNAPHVGGL
GGTFGGNPVACAAALATIATIESDGLIERARQIERLVTDRLTTLQAVDDRIGDVRGRGAMIAVELVKSGTTEPDAGLTER
LATAAHAAGVIILTCGMFGNIIRLLPPLTIGDELLSEGLDIVCAILADL
>P32967 ~~~gacA~~~Response regulator GacA~~~COG2197
MIRVLVVDDHDLVRTGITRMLADIDGLQVVGQAESGEESLLKARELKPYVVLMDVKMPGIGGLEATRKLLRSHPDIKVVA
VTVCEEDPFPTRLLQAGAAGYLTKGAGLNEMVQAIRLVFAGQRYISPQIAQQLVFKSFQPSSDSPFDALSEREIQIALMI
VGCQKVQIISDKLCLSPKTVNTYRYRIFEKLSISSDVELTLLAVRHGMVDASL
>Q5E5Y7 4.1.1.15~~~gadA~~~Glutamate decarboxylase~~~COG0076
MPLHSKNAVRDDLLDDIYSSADLSLSMPKYKMPEQEHDPRHAYQVIHDELMMDGNSRQNLATFCQTWVEDEVHKLMDECI
DKNMIDKDEYPQTAELESRCVHMLADLWNSPDAENTLGCSTTGSSEAAMLGGMALKWAWREKMKKLGKPTDKPNMICGPV
QVCWHKFARYWDIELREIPMEGDRLIMTPEEVIKRCDENTIGVVPTLGVTFTCQYEPVKAVHEALDKLQEETGLDIPMHI
DAASGGFLAPFCDPDLEWDFRLPRVKSINASGHKFGLSPLGVGWVIWRDASALHEDLIFNVNYLGGNMPTFALNFSRPGG
QIVAQYYNFLRLGKEGYRKIHQACYDTAVYLSSEIEKLGMFEIIYDGKGGIPAMSWSLKEGVDPGFNLFDLSDRIRSRGW
QIAAYAMPPKREDLVIMRILVRHGFSRDQADLLVADLKHCVEFFAKHPISHGSDELESSGFNHG
>P63235 ~~~gadC~~~Glutamate/gamma-aminobutyrate antiporter~~~COG0531
MATSVQTGKAKQLTLLGFFAITASMVMAVYEYPTFATSGFSLVFFLLLGGILWFIPVGLCAAEMATVDGWEEGGVFAWVS
NTLGPRWGFAAISFGYLQIAIGFIPMLYFVLGALSYILKWPALNEDPITKTIAALIILWALALTQFGGTKYTARIAKVGF
FAGILLPAFILIALAAIYLHSGAPVAIEMDSKTFFPDFSKVGTLVVFVAFILSYMGVEASATHVNEMSNPGRDYPLAMLL
LMVAAICLSSVGGLSIAMVIPGNEINLSAGVMQTFTVLMSHVAPEIEWTVRVISALLLLGVLAEIASWIVGPSRGMYVTA
QKNLLPAAFAKMNKNGVPVTLVISQLVITSIALIILTNTGGGNNMSFLIALALTVVIYLCAYFMLFIGYIVLVLKHPDLK
RTFNIPGGKGVKLVVAIVGLLTSIMAFIVSFLPPDNIQGDSTDMYVELLVVSFLVVLALPFILYAVHDRKGKANTGVTLE
PINSQNAPKGHFFLHPRARSPHYIVMNDKKH
>O30417 ~~~gadC~~~Glutamate/gamma-aminobutyrate antiporter~~~COG0531
MNQKKLSLFGFFALTASMVLTVYEYPTFATSKLHLVFFLLLGGLLWFLPVALCAAEMATVEGWKNGGIFSWVSQTLGERF
GFAAIFFQWFQITVGFVTMIYFILGALSYVLNFQALNTDPLIKFIGLLIIFWGLTFSQLGGTQRTAKLVKAGFVVGIVIP
SVILFGLAAAYFIGGNPIEIPINSHAFVPDFSQVSTLVVFVSFILAYMGVEASASHINELENPKRNYPLAMILLVILAIS
LDAIGGFSVAAVIPQKELSLSAGVIQTFQTLILHFNHHLGWLVKVIALMIAFGVMGEVSSWVVGPSRGMFAAAQRGLLPK
FLRKTNTHEVPVPLVMIQGIIVTLWGAVLTFGGGGNNLSFLVAISLTVVIYLVGYLLFFIVYFVLIYKKQNLKRTYNVPG
KIIGKTIIAGIGFLLSIFALFISFVPPASIAKNETHTYQMILLISFVVTAILPFIIYELHDKKGHDTIEEPTHFKAGDVN
PAIYPAARGEHHIIKKEEHILKH
>P63204 ~~~gadE~~~Transcriptional regulator GadE~~~COG2771
MIFLMTKDSFLLQGFWQLKDNHEMIKINSLSEIKKVGNKPFKVIIDTYHNHILDEEAIKFLEKLDAERIIVLAPYHISKL
KAKAPIYFVSRKESIKNLLEITYGKHLPHKNSQLCFSHNQFKIMQLILKNKNESNITSTLNISQQTLKIQKFNIMYKLKL
RRMSDIVTLGITSYF
>O34214 1.1.99.3~~~~~~Gluconate 2-dehydrogenase flavoprotein~~~
MERGERVSVPVSGYSRGEGVTVANELKKVDAVVVGFGWAGAIMAKELTEAGLNVVALERGPHRDTYPDGAYPQSIDELTY
NIRKKLFQDLSKSTVTIRHDASQTAVPYRQLAAFLPGTGTGGAGLHWSGVHFRVDPVELNLRSHYEARYGKNFIPEGMTI
QDFGVSYNELEPFFDQAEKVFGTSGSAWTIKGKMIGKEKGGNFYAPDRSSDFPLPAQKRTYSAQLFAQAAESVGYHPYDM
PSANTSGPYTNTYGAQMGPCNFCGYCSGYACYMYSKASPNVNILPALRQEPKFELRNNAYVLRVNLTGDKKRATGVTYLD
GQGREVVQPADLVILSAFQFHNVHLMLLSGIGQPYNPITNEGVVGRNFAYQNISTLKALFDKNTTTNPFIGAGGAGVAVD
DFNADNFDHGPYGFVGGSPFWVNQAGTKPVSGLPTPKGTPNWGSQWKAAVADTYNHHISMDAHGAHQSYRANYLDLDPNY
KNVYGQPLLRMTFDWQDNDIRMAQFMVGKMRKITEAMNPKMIIGGAKGPGTHFDTTVYQTTHMSGGAIMGEDPKTSAVNR
YLQSWDVPNVFVPGASAFPQGLGYNPTGMVAALTYWSAKAIREQYLKNPGPLVQA
>O34215 1.1.99.3~~~~~~Gluconate 2-dehydrogenase cytochrome c subunit~~~
MMKSILALVLGTLSFAALADDQANDALVKRGEYLARAGDCVACHSVKGGQPFAGGLPMATPIGTIYSTNITPDKTTGIGD
YSYDDFQKAVRHGVAKNGDTLYPAMPYPSYAVVSDEDMKALYAYFMHGVAPVAQANKDSDIPWPLSMRWPLAIWRGVFAP
DVKAFQPAAQEDPVLARGRYLVEGLGHCGACHTPRSITMQEKALSNDGAHDYLSGSSAPIDGWTASNLRGDNRDGLGRWS
EDDLRQFLRYGRNDHTAAFGGMTDVVEHSLQHLSDDDITAIARYLKSLGAKDASQTVFTQDDQVAKALWKGDDSQTGASV
YVDSCAACHKTDGSRLSALLPGAAWQPGGAGEPDPTSLIHIVLTGGTLPGVQGAPTAITMPAFGWRLNDQQVADVVNFIR
GSWGNGAKATVTAKDVASLRKDETVQAHQGNADIKVLEQQQ
>O34213 1.1.99.3~~~~~~Gluconate 2-dehydrogenase subunit 3~~~
MSEHKNGHTRRDFLLRTITLAPAMAVGSTAMGALVAPMAAGAAEQSSGSQTARDYQPTWFTAEEFAFITAAVARLIPNDE
RGPGALEAGVPEFIDRQMNTPYALGSNWYMQGPFNPDLPKELGYQLPLVPQQIYRLGLADADSWSKHQHGKVFAELSGDQ
QDALLSDFESGKAEFTQLPAKTFFSFLLQNTREGYFTRSDPRWQSGHGGLEADWLPRRTR
>P63201 ~~~gadW~~~HTH-type transcriptional regulator GadW~~~COG2207
MTHVCSVILIRRSFDIYHEQQKISLHNESILLLEKNLADDFAFCSPDTRRLDIDELTVCHYLQNIRQLPRNLGLHSKDRL
LINQSPPMPLVTAIFDSFNESGVNSPILSNMLYLSCLSMFSHKKELIPLLFNSISTVSGKVERLISFDIAKRWYLRDIAE
RMYTSESLIKKKLQDENTCFSKILLASRMSMARRLLELRQIPLHTIAEKCGYSSTSYFINTFRQYYGVTPHQFAQHSPGT
FS
>P37639 ~~~gadX~~~HTH-type transcriptional regulator GadX~~~COG2207
MQSLHGNCLIAYARHKYILTMVNGEYRYFNGGDLVFADASQIRVDKCVENFVFVSRDTLSLFLPMLKEEALNLHAHKKVS
SLLVHHCSRDIPVFQEVAQLSQNKNLRYAEMLRKRALIFALLSVFLEDEHFIPLLLNVLQPNMRTRVCTVINNNIAHEWT
LARIASELLMSPSLLKKKLREEETSYSQLLTECRMQRALQLIVIHGFSIKRVAVSCGYHSVSYFIYVFRNYYGMTPTEYQ
ERSAQRLSNRDSAASIVAQGNFYGTDRSAEGIRL
>A9KQ75 2.4.1.211~~~~~~1,3-beta-galactosyl-N-acetylhexosamine phosphorylase Cphy3030~~~COG5426
MSEKLTGRVTVPTDVDMIQETKEIAERWGADALRDCDGTDMPDELKKMPAKIYSTYYTTRKDNAWANANPDEVQQVYLMT
EFYTAMSQGELRIPLMKHLYKDQLKPNTIHDIKRWWEVVDRTTGEPLVLDAWEYDENNQEVIILNPDHFHDYTVSFLAFI
IWDPVHMYNFITNDWQDVEHQITYDVRQPKTQKYVIEKLKRWMKENPDSDVVRFTTFFHQFTLVFNEFAKEKFVDWFGYS
ASVSPYILEQFEKEVGYKFRPEYIIDQGYHNNTNRVPSKEFRDFQEFQQREVAKLMKVLVDICHDNDKEAMMFLGDHWIG
TEPFGEYFKHVGLDAVVGSVGNGTTLRLISDIPGVKYTEGRFLPYFFPDVFHEGGDPIKEAKVNWVTARRAILRKPVDRI
GYGGYLKLALDFPEFIQYIEEVCDEFRLLYDNMGGQSPYSHFKVGVLNSWGKIRSWGTHMVAHAIDYKQTYSYAGVLEAL
SGMPFDVEFISFEDVIKNPVILNECGVVINVGDAYTGPSGGAYWTNEKVSSAVKAFVAQGGGFIGVGEPSACEHQGRYFT
LANVLGVNKEIGFSMSTDKYNWDEHSHFITEDSNESINFGEGMKNIYALDGAQILRKDGQDVQMAVNQFGDGRSVYISGI
PYSFENSRMLYRAIFWAAGMEQEMKKWYSSNYNVEVNYYPATKKYCIVNNTYEPQETMIYDGLGREYSMKLKANDILWFT
FLED
>A9KIW5 2.4.1.211~~~~~~1,3-beta-galactosyl-N-acetylhexosamine phosphorylase Cphy0577~~~COG5426
MKKDTMLAGRVTIPTDVDVVPETMELLNRWGADAIRDCDGTDYPEELKAVQAKVYSTYYTTRKDNAWAKAHPEEVQQCYI
MTSFYTATETTLRIPLLKGIAKELMMVNNYDDKVRWWEVIDRSTAMVVSTDTWSYDKETGEVIITNCEPFHNYTVSFLAY
LIWDPVHMYNAVVNGWQGVEHQITFDVRQPKTREYSMVRLRKFIEEHPYVDVIRYTTFFHQFTLVFDEMMREKYVDWYGY
SASVSPYILEQFEKEVGYRFRPEFIIDQGYYNNQYRIPSKEFKDFQAFQRREVAKLAKEMVDITHEYGKEAMMFLGDHWI
GTEPFMEEFKTIGLDAVVGSVGNGSTLRLISDIPGVKYTEGRFLPYFFPDTFHEGGDPVKEAKVNWVTARRAILRKPIDR
IGYGGYLKLACQFPEFIDYVESVCNEFRELYENIKGTTPFCIKRVAVLNSWGKMRAWGAHMVHHALYQKQNYSYAGVIES
LSGTPFEVSFISFDDIKKDKNILKNIDVIINVGDGDTAHTGGLVWEDADISSAIHQFVYEGGGLIGIGEPTGHQYQGRYI
QLANVFGIEKETGFTLNYDKYNWDAVESHFITEDCTKEVDFGEGKKNMYALEGTTILVQMEKEVQMAVNEFGKGRSVYLS
GLPYSFENSRVLYRSILWSAHEEENLHKWYSSNFNVEVHAYVKNNKYCVVNNTYEPQNTTIYRGDSSSFDLELEANEIIW
YEI
>J8H9C1 3.1.-.-~~~gajA~~~Endonuclease GajA~~~
MKFSNITIKNFRNFEKVNINLDNKNVIFGMNDIGKTNFLYALRFLLDKEIRKFGFNKSDYHKHDTSKKIEIILTLDLSNY
EKDEDTKKLISVVKGARTSANADVFYIALESKYDDKELYGNIILKWGSELDNLIDIPGRGNINALDNVFKVIYINPLVDL
DKLFAQNKKYIFEESQGNESDEGILNNIKSLTDQVNQQIGEMTIIKGFQQEITSEYRSLKKEEVSIELKSEMAIKGFFSD
IIPYIKKDGDSNYYPTSGDGRRKMLSYSIYNYLAKKKYEDKIVIYLIEEPEISLHRSMQIALSKQLFEQSTYKYFFLSTH
SPELLYEMDNTRLIRVHSTEKVVCSSHMYNVEEAYGSVKKKLNKALSSALFAERVLLIEGPSEKILFEKVLDEVEPEYEL
NGGFLLEVGGTYFNHYVCTLNDLGITHIIKTDNDLKSKKGKKGVYELLGLNRCLNLLGRENLDEITIDIPEDIKGKKKKE
RLNERKKEIFKQYKNEVGEFLGERIYLSEIDLENDLYSAIGESMKRIFENEDPVHYLQKSKLFNMVELVNNLSTKDCFDV
FEHEKFACLKELVGSDRG
>J8HQ06 ~~~gajB~~~Gabija protein GajB~~~
MSREQIIKDGGNILVTAGAGSGKTTILVSKIEADLKENKTHYSIAAVTFTNKAAKEIEGRLGYSSRGNFIGTNDGFVESE
IIRPFIKDAFGNDYPDNFTAEYFDNQFASYDKGLQVLKYQNILGTYSNPKKNFKFQLALDILKKSLVARQYIFSKYFKIF
IDEYQDSDKDMHNLFMYLKDQLKIKLFIVGDPKQSIYIWRGAEPENFNGLIENSTDFNKYHLTSNFRCCQDIQNYSNLFN
EETRSLIKEKNEVQNVISIADDMPISDILLKLTEEKQVLNIEAELVILVRRRNQAIEIMKELNEEGFNFIFIPQTPLDRA
TPNATLLKEVIKYVKNDRYSIYDLAAEIVGNLSSREIKEIQKIINELLVPNINQVLINQVLINLFAKLEITLDTREITAF
TEVMMTNEFDIAFDTNEYLHKIFTVHSAKGLEFNQVIITASDYNVHYNRDTNEHYVATTRAKDKLIVIMDNKKYSDYIET
LMKELKIKNIIKSI
>P0A6T3 2.7.1.6~~~galK~~~Galactokinase~~~COG0153
MSLKEKTQSLFANAFGYPATHTIQAPGRVNLIGEHTDYNDGFVLPCAIDYQTVISCAPRDDRKVRVMAADYENQLDEFSL
DAPIVAHENYQWANYVRGVVKHLQLRNNSFGGVDMVISGNVPQGAGLSSSASLEVAVGTVLQQLYHLPLDGAQIALNGQE
AENQFVGCNCGIMDQLISALGKKDHALLIDCRSLGTKAVSMPKGVAVVIINSNFKRTLVGSEYNTRREQCETGARFFQQP
ALRDVTIEEFNAVAHELDPIVAKRVRHILTENARTVEAASALEQGDLKRMGELMAESHASMRDDFEITVPQIDTLVEIVK
AVIGDKGGVRMTGGGFGGCIVALIPEELVPAVQQAVAEQYEAKTGIKETFYVCKPSQGAGQC
>Q9R7D7 2.7.1.6~~~galK~~~Galactokinase~~~COG0153
MSIVVENSTVLSALTEKFAEVFGDTKEVEYFFSPGRINLIGEHTDYNGGYVFPASITIGTTGLARLREDKKVKLYSENFP
KLGVIEFDLDEVEKKDGELWSNYVKGMIVMLKGAGYEIDKGFELLIKGEIPTASGLSSSASLELLVGVVLDDLFNLNVPR
LELVQLGQKTENDYIGVNSGILDQFAIGFGEVKKAILLDCNTLKYEMVPVELRDYDIVIMNTNKPRALTESKYNERFAET
REALKRMQTRLDIQSLGELSNEEFDANTDLIGDETLIKRARHAVYENNRTKIAQKAFVAGNLTKFGELLNASHASLKDDY
EVTGLELDTLAETAQKQAGVLGARMTGAGFGGCAIALVAHDNVSAFEKAVGQVYEEVVGYPASFYVAQIGSGSTKLDVE
>P09148 2.7.7.12~~~galT~~~Galactose-1-phosphate uridylyltransferase~~~COG1085
MTQFNPVDHPHRRYNPLTGQWILVSPHRAKRPWQGAQETPAKQVLPAHDPDCFLCAGNVRVTGDKNPDYTGTYVFTNDFA
ALMSDTPDAPESHDPLMRCQSARGTSRVICFSPDHSKTLPELSVAALTEIVKTWQEQTAELGKTYPWVQVFENKGAAMGC
SNPHPHGQIWANSFLPNEAEREDRLQKEYFAEQKSPMLVDYVQRELADGSRTVVETEHWLAVVPYWAAWPFETLLLPKAH
VLRITDLTDAQRSDLALALKKLTSRYDNLFQCSFPYSMGWHGAPFNGEENQHWQLHAHFYPPLLRSATVRKFMVGYEMLA
ETQRDLTAEQAAERLRAVSDIHFRESGV
>Q88JX5 1.13.11.57~~~galA~~~Gallate dioxygenase~~~COG3384
MADEGGNPRDLPPVGGHAALSRHIGQSLMADEFDMSFFRDKPLDHGFFSPMSALLPCDESWPVQIVPLQVGVLQLPIPTA
RRCYKLGQALRRAIESYPEDLKVAIVATGGVSHQVHGERCGFNNPEWDAQFLDLLVNDPQRLTEMTLAEYATLGGMEGAE
VITWLIMRGTLSANVERKHQSYYLPSMTGIATLLLENRDQALPAPVNERHRQHMQHQLAGAEQLEGTYPYTLERSAKGYR
LNKFLHRMIEPQWRQRFLSEPEALYREAGLSEEESDLLRRRDWRGLIHYGVIFFVLEKLGAVLGVSNLDIYAAMRGQSIE
DFMKTRNQQVRYSVAGKAPN
>Q88JX8 4.2.1.83~~~galB~~~4-oxalmesaconate hydratase~~~COG2120
MTSCAHPHCRSQRNMNTPQKSALVVSAHSADFVWRAGGAIALHAEQGYAMHVVCLSFGERGESAKLWRKGEMTEAKVKDA
RREEAMAAAEILGASVEFFDIGDYPMRADKDTLFRLADVYRRVQPEFVLSHSLKDPYNYDHPLAMHLAQEARIIAQAEGY
KPGEKIVGAPPVYAFEPHQPEQCEWRPDTFLDITSVWDKKYAAIQCMAGQEHLWEYYTRVALQRGVQAKRNVGITSARNI
VYAEGLQSVFPRVTENLA
>Q88JX9 4.1.3.17~~~galC~~~4-carboxy-4-hydroxy-2-oxoadipic acid aldolase~~~COG0684
MSGLIGKTGIVVRNIPRVEPHMIDALGRLGVATVHEAQGRKGLLNTAVRPIQQGVAVAGSAVTVLVAPGDNWMFHVAVEQ
CRPGDVLVVAPSSPCSDGYFGDLLATSLQARGVLGLVIDAGVRDSQTLRDMGFAVWSRAINAQGTVKEVLGSVNLPLLCA
GQLVNAGDIVVADDDGVVVVRHGEAQAVLEAATQRADLEERKRLRLAAGELGLDIYEMRPRLAAKGLRYVDHLTDLEG
>Q88JY0 5.3.2.8~~~galD~~~4-oxalomesaconate tautomerase~~~COG2828
MGQTRIPCLLMRGGTSKGAYFLHDDLPAPGPLRDRVLLAVMGSPDARQIDGIGGADSLTSKVAIIRASQRDDADVDYLFA
QVVVDEARVDYGQNCGNILAGVGPFALERGLVAASGASTPVRIFMENTGQIAVAQVPTADGQVEYAGDTRIDGVPGRAAA
LVVTFADVAGASCGALLPTGNSRDCVEGVEVTCIDNGMPVVLLCAEDLGVTGYEPCETLEADSALKTRLEAIRLQLGPRM
NLGDVSQRNVPKMCLLSAPRNGGTVNTRSFIPHRCHASIGVFGAVSVATACLIEGSVAQGLASTSGGDRQRLAVEHPSGE
FTVEISLEHGVIKGCGLVRTARLLFDGVVCIGRDTWGGPEK
>E8MF10 5.1.3.2~~~lnpD~~~UDP-glucose 4-epimerase~~~
MTTVLVTGGAGFIATHTDIELLNKGYDVISVDNYGNSSPVALERVEQITGKPVKRYDGDVRDEALMERVFAENNIDWVIH
FAGLKAVGESVAKPIEYYDNNLYSTLVLLKVMKKHNVKKIIFSSSATVYGTPKELPITEETPTGGTTNPYGTSKLFQEQI
LRDVHVADPSWTIVLLRYFNPVGAHESGLLGEDPKGIPANLTPYVAKVAVGELKEVQVYGDDYDTPDGTGVRDYIHVVDL
AKGHVAVIDHIDKEGVFVYNLGTGHGYSVLEVIKAYEKAAGHPIPYAIKPRRPGDIAACYADASKAEKELGWKAELTIDD
MAASSLNWQTKNPNGFRDAE
>P09147 5.1.3.2~~~galE~~~UDP-glucose 4-epimerase~~~COG1087
MRVLVTGGSGYIGSHTCVQLLQNGHDVIILDNLCNSKRSVLPVIERLGGKHPTFVEGDIRNEALMTEILHDHAIDTVIHF
AGLKAVGESVQKPLEYYDNNVNGTLRLISAMRAANVKNFIFSSSATVYGDQPKIPYVESFPTGTPQSPYGKSKLMVEQIL
TDLQKAQPDWSIALLRYFNPVGAHPSGDMGEDPQGIPNNLMPYIAQVAVGRRDSLAIFGNDYPTEDGTGVRDYIHVMDLA
DGHVVAMEKLANKPGVHIYNLGAGVGNSVLDVVNAFSKACGKPVNYHFAPRREGDLPAYWADASKADRELNWRVTRTLDE
MAQDTWHWQSRHPQGYPD
>Q7WTB1 5.1.3.2~~~galE~~~UDP-glucose 4-epimerase~~~COG1087
MKVLVIGGAGYIGSHAVRELVKEGNDVLVLDALYTGHRKAVDPKAKFYQGDIEDTFLVSKILRDEKIDAVMHFAAYSLVP
ESVKKPLKYYDNNVTGMISLLQAMNDANVKYLVFSSSAATYGIPKKLPITEDTPLNPINPYGETKMMMEKIMAWADKADG
IKYTALRYFNVAGASSDGSIGEDHAPETHLIPNILKSAISGDGKFTIFGDDYDTKDGTNVRDYVQVEDLIDAHILALKHM
MKTNKSDVFNLGTAHGYSNLEILESAKKVTGIDIPYTMGPRRGGDPDSLVADSTKARTVLGWKPKHENVDDVIATAWKWH
KSHPKGYEDK
>A0R5C5 5.1.3.2~~~~~~UDP-glucose 4-epimerase~~~COG0451
MRTLVTGAAGFIGSTLVDRLLADGHGVVGLDDLSSGRAENLHSAENSDKFEFVKADIVDADLTGLLAEFKPEVIFHLAAQ
ISVKRSVDDPPFDATVNVVGTVRLAEAARLAGVRKVVHTSSGGSVYGTPPAYPTSEDMPVNPASPYAAGKVAGEVYLNMY
RNLYDLDCSHIAPANVYGPRQDPHGEAGVVAIFSEALLAGRTTKIFGDGSDTRDYVFVDDVVDAFVRAGGPAGGGQRFNV
GTGVETSTRELHTAIAGAVGAPDEPEFHPPRLGDLRRSRLDNTRAREVLGWQPQVALAEGIAKTVEFFRNKSQ
>P9WN67 5.1.3.2~~~galE1~~~UDP-glucose 4-epimerase~~~COG0451
MRALVTGAAGFIGSTLVDRLLADGHSVVGLDNFATGRATNLEHLADNSAHVFVEADIVTADLHAILEQHRPEVVFHLAAQ
IDVRRSVADPQFDAAVNVIGTVRLAEAARQTGVRKIVHTSSGGSIYGTPPEYPTPETAPTDPASPYAAGKVAGEIYLNTF
RHLYGLDCSHIAPANVYGPRQDPHGEAGVVAIFAQALLSGKPTRVFGDGTNTRDYVFVDDVVDAFVRVSADVGGGLRFNI
GTGKETSDRQLHSAVAAAVGGPDDPEFHPPRLGDLKRSCLDIGLAERVLGWRPQIELADGVRRTVEYFRHKHTD
>P0AAB6 2.7.7.9~~~galF~~~UTP--glucose-1-phosphate uridylyltransferase~~~COG1210
MTNLKAVIPVAGLGMHMLPATKAIPKEMLPIVDKPMIQYIVDEIVAAGIKEILLVTHASKNAVENHFDTSYELESLLEQR
VKRQLLAEVQSICPPGVTIMNVRQGEPLGLGHSILCARPAIGDNPFVVVLPDVVIDDASADPLRYNLAAMIARFNETGRS
QVLAKRMPGDLSEYSVIQTKEPLDREGKVSRIVEFIEKPDQPQTLDSDIMAVGRYVLSADIWPELERTQPGAWGRIQLTD
AIAELAKKQSVDAMLMTGDSYDCGKKMGYMQAFVKYGLRNLKEGAKFRKGIEKLLSE
>P05149 5.1.3.3~~~mro~~~Aldose 1-epimerase~~~
MKKLAILGVTVYSFAQLANAATLNVKSYGTTQNGQKVDLYTMSNNNGVSVSFISFGGVITQILTPDAQGKQNNIVLGFDD
LKGYEVTDTKEGIHFGGLIGRYANRIGNAKFSLDGKTYNLEKNNGPNSLHSGNPGFDKRVWQVKPLVSKGETVKASLKLT
SPNGDQGFPGKLDVEVIYSLSDQNEFKIEYKAKTDQPTVVNLTNHSYFNLSGAGNNPYGVLDHVVQLNAGRILVTDQNSL
PTGEIASVAGTPFDFRMPKAIVKDIRANNQQLAYGYGYDQTWVINQKSQGKLNLAAIVVDPKSKRTMQVLTTEPSVQMYT
ADHLLGNIVGANGVLYRQADALALETQHFPDSPNQPTFPSTRLNPNQTYNSVTVFKFGVQK
>P39840 5.1.3.3~~~galM~~~Aldose 1-epimerase~~~COG2017
MANFIEKITYLGTPAIKAGNEHLEMIVVPEWGSNVISLVDKTTNVQLLREPETAESFHDTPTLYGIPILFPPNRISDGTF
SFRGRTYHFDINEKDKHNHLHGFLYHEKWNVVTTKQTDEGVIVETEIDLSELPHVQKQFPHHAVVRMTYTIKENTLFKHA
TVMNKGKEAFPWGIGYHTTFIFPAESSLFSLTADQQWELDERLLPTGKLMDVPYKEALHEGMDLRHKQLDDVFLSSYQKR
GGENQAVIYHQHAHISIIYKADEQFKHWVVYNADGKQGYLCPEPYTWVTNAVNLDLPSSLTGLQVLEPGEETTAKSSITI
ELNHQ
>P0A9C3 5.1.3.3~~~galM~~~Aldose 1-epimerase~~~COG2017
MLNETPALAPDGQPYRLLTLRNNAGMVVTLMDWGATLLSARIPLSDGSVREALLGCASPECYQDQAAFLGASIGRYANRI
ANSRYTFDGETVTLSPSQGVNQLHGGPEGFDKRRWQIVNQNDRQVLFALSSDDGDQGFPGNLGATVQYRLTDDNRISITY
RATVDKPCPVNMTNHVYFNLDGEQSDVRNHKLQILADEYLPVDEGGIPHDGLKSVAGTSFDFRSAKIIASEFLADDDQRK
VKGYDHAFLLQAKGDGKKVAAHVWSADEKLQLKVYTTAPALQFYSGNFLGGTPSRGTEPYADWQGLALESEFLPDSPNHP
EWPQPDCFLRPGEEYSSLTEYQFIAE
>P0AEP1 ~~~galP~~~Galactose-proton symporter~~~COG2814
MPDAKKQGRSNKAMTFFVCFLAALAGLLFGLDIGVIAGALPFIADEFQITSHTQEWVVSSMMFGAAVGAVGSGWLSFKLG
RKKSLMIGAILFVAGSLFSAAAPNVEVLILSRVLLGLAVGVASYTAPLYLSEIAPEKIRGSMISMYQLMITIGILGAYLS
DTAFSYTGAWRWMLGVIIIPAILLLIGVFFLPDSPRWFAAKRRFVDAERVLLRLRDTSAEAKRELDEIRESLQVKQSGWA
LFKENSNFRRAVFLGVLLQVMQQFTGMNVIMYYAPKIFELAGYTNTTEQMWGTVIVGLTNVLATFIAIGLVDRWGRKPTL
TLGFLVMAAGMGVLGTMMHIGIHSPSAQYFAIAMLLMFIVGFAMSAGPLIWVLCSEIQPLKGRDFGITCSTATNWIANMI
VGATFLTMLNTLGNANTFWVYAALNVLFILLTLWLVPETKHVSLEHIERNLMKGRKLREIGAHD
>Q8EMJ9 4.2.1.158~~~~~~Galactarate dehydratase (D-threo-forming)~~~COG4948
MKITDLELHAVGIPRHTGFVNKHVIVKIHTDEGLTGIGEMSDFSHLPLYSVDLHDLKQGLLSILLGQNPFDLMKINKELT
DNFPETMYYYEKGSFIRNGIDNALHDLCAKYLDISVSDFLGGRVKEKIKVCYPIFRHRFSEEVESNLDVVRQKLEQGFDV
FRLYVGKNLDADEEFLSRVKEEFGSRVRIKSYDFSHLLNWKDAHRAIKRLTKYDLGLEMIESPAPRNDFDGLYQLRLKTD
YPISEHVWSFKQQQEMIKKDAIDIFNISPVFIGGLTSAKKAAYAAEVASKDVVLGTTQELSVGTAAMAHLGCSLTNINHT
SDPTGPELYVGDVVKNRVTYKDGYLYAPDRSVKGLGIELDESLLAKYQVPDLSWDNVTVHQLQDRTADTKS
>P03024 ~~~galR~~~HTH-type transcriptional regulator GalR~~~COG1609
MATIKDVARLAGVSVATVSRVINNSPKASEASRLAVHSAMESLSYHPNANARALAQQTTETVGLVVGDVSDPFFGAMVKA
VEQVAYHTGNFLLIGNGYHNEQKERQAIEQLIRHRCAALVVHAKMIPDADLASLMKQMPGMVLINRILPGFENRCIALDD
RYGAWLATRHLIQQGHTRIGYLCSNHSISDAEDRLQGYYDALAESGIAANDRLVTFGEPDESGGEQAMTELLGRGRNFTA
VACYNDSMAAGAMGVLNDNGIDVPGEISLIGFDDVLVSRYVRPRLTTVRYPIVTMATQAAELALALADNRPLPEITNVFS
PTLVRRHSVSTPSLEASHHATSD
>Q8A2H2 3.1.6.-~~~~~~N-acetylgalactosamine-6-O-sulfatase~~~COG3119
MKNVSRLLPLLPGIALLTGCNQKVQKDNGQNSQKPNIIYIFADDLGIGDLSCYGATKVSTPHIDRLAGQGVQFTNAYATS
ATSTPSRFGLLTGMYPWRQENTGIAPGNSELIIDTACVTMADMLKEAGYATGVVGKWHLGLGPKGGTDFNGHITPNAQSI
GFDYEFVIPATVDRVPCVFVENGHVVGLDPNDPITVNYEHKVGDWPTGEENPELVKLKPSQGHNNTIINGIPRIGWMTGG
KSALWKDEDIADIITNKAKSFIVSHKEEPFFLYMGTQDVHVPRVPHPRFAGKSGLGTRGDVILQLDWTIGEIMNTLDSLQ
LTDNTILIFTSDNGPVIDDGYQDQAFERLNGHTPMGIYRGGKYSAYEAGTRIPFIVRWPAKVKPNKQQALFSQIDIFASL
AALLKQPLPEDAAPDSQEHLNTLLGKDYTSREYIVQQNLNNTLAIVKGQWKYIEPSDAPAIEYWTKMELGNDRHPQLYDL
SADPSEKNNVAKQHPEVVRELSELLESVKTR
>P25748 ~~~galS~~~HTH-type transcriptional regulator GalS~~~COG1609
MITIRDVARQAGVSVATVSRVLNNSTLVSADTREAVMKAVSELDYRPNANAQALATQVSDTIGVVVMDVSDAFFGALVKA
VDLVAQQHQKYVLIGNSYHEAEKERHAIEVLIRQRCNALIVHSKALSDDELAQFMDNIPGMVLINRVVPGYAHRCVCLDN
LSGARMATRMLLNNGHQRIGYLSSSHGIEDDAMRKAGWMSALKEQDIIPPESWIGAGTPDMPGGEAAMVELLGRNLQLTA
VFAYNDNMAAGALTALKDNGIAIPLHLSIIGFDDIPIARYTDPQLTTVRYPIASMAKLATELALQGAAGNIDPRASHCFM
PTLVRRHSVATRQNAAAITNSTNQAM
>E8MF11 2.7.7.12~~~galT~~~Galactose-1-phosphate uridylyltransferase~~~
MNDQLTEVYASIDALIDYALAHLDLDPRNADWTRNQIFALFRLDSYPGPKTTTSAASVSDVVQDIVGSRSQAPYGEKTPD
PLLAAFRAAATTAGLFKPEEGPAYADTIMGILSANPADLDDRFLLVEHRDGGMAAMQWFYDYCVANNYVKRAQLDRNPRF
DSHGLTVTINLAKPEFKNMKKAAAGNAVAGGYPKCTICHENEGFAGRDKRTLRTLPVTLGGESWFWQFSPYGYFDQHGIC
VNTDHTPMHVDRDTFGHLLDFVDRFPGYFLGCNAALPRIGGSVLAHDHYQGGGELLPMHKAATWAAFTLADYPDAVVEIL
DWPGTAVRVVSKSRQSIIDVSDIIREAWVGYDDAANGIASHDADGNRQSALSPSAIITERGYEMSLIFRNNAISDEYPEG
IFHAHPEYWPVKQEPIGLIEAQGLFILPGRLVDQLGIVEEALAEGRDLPDEVSEFSLEWGELAETLAGNHDREAIRQAVH
DELGSVCYRILGNTAVFKQKATTQTFLESLGFAAR
>P0AEP3 2.7.7.9~~~galU~~~UTP--glucose-1-phosphate uridylyltransferase~~~COG1210
MAAINTKVKKAVIPVAGLGTRMLPATKAIPKEMLPLVDKPLIQYVVNECIAAGITEIVLVTHSSKNSIENHFDTSFELEA
MLEKRVKRQLLDEVQSICPPHVTIMQVRQGLAKGLGHAVLCAHPVVGDEPVAVILPDVILDEYESDLSQDNLAEMIRRFD
ETGHSQIMVEPVADVTAYGVVDCKGVELAPGESVPMVGVVEKPKADVAPSNLAIVGRYVLSADIWPLLAKTPPGAGDEIQ
LTDAIDMLIEKETVEAYHMKGKSHDCGNKLGYMQAFVEYGIRHNTLGTEFKAWLEEEMGIKK
>P0AEP6 2.7.7.9~~~galU~~~UTP--glucose-1-phosphate uridylyltransferase~~~
MAAINTKVKKAVIPVAGLGTRMLPATKAIPKEMLPLVDKPLIQYVVNECIAAGITEIVLVTHSSKNSIENHFDTSFELEA
MLEKRVKRQLLDEVQSICPPHVTIMQVRQGLAKGLGHAVLCAHPVVGDEPVAVILPDVILDEYESDLSQDNLAEMIRRFD
ETGHSQIMVEPVADVTAYGVVDCKGVELAPGESVPMVGVVEKPKADVAPSNLAIVGRYVLSADIWPLLAKTPPGAGDEIQ
LTDAIDMLIEKETVEAYHMKGKSHDCGNKLGYMQAFVEYGIRHNTLGTEFKAWLEEEMGIKK
>P11886 1.1.1.48~~~gal~~~Galactose 1-dehydrogenase~~~
MQPIRLGLVGYGKIAQDQHVPAINANPAFTLVSVATQGKPCPGVENFQSLGELLENGPPVDAIAFCTPPQGRFALVQQAL
AAGKHVLVEKPPCATLGKAALWIKREQASAPCSPCIAYAPAIAAARDWLATRTLQSVQIDWKEDVRKWHPGQAWIWQPGL
GVFDPGINALSIVTHLLPLPLFVESAELRVPSNCQSPIAASIKMSDPRLLDVRAEFDFDHGHDELWSIQIRCAEGTLRLD
NGGALLSIDGVRQTVAEEGEYAAVYRHFQQLIGDKTSDVDVQPLRLVADSFFVGSRVSVEAFYD
>O31458 3.5.99.6~~~gamA~~~Probable glucosamine-6-phosphate deaminase 2~~~COG0363
MKILIAEHYEELCKLSAAIIKEQIQAKKDAVLGLATGSTPVGLYKQLISDYQAGEIDFSKVTTFNLDEYAGLSPSHPQSY
NHFMHEHLFQHINMQPDHIHIPQGDNPQLEAACKVYEDLIRQAGGIDVQILGIGANGHIGFNEPGSDFEDRTRVVKLSES
TIQANARFFGGDPVLVPRLAISMGIKTIMEFSKHIVLLASGEEKADAIQKMAEGPVTTDVPASILQKHNHVTVIADYKAA
QKLKSASFS
>O31459 ~~~gamR~~~HTH-type transcriptional repressor GamR~~~COG2188
MTALYSVIKFKIIELIKSGKYQANDQLPTESEFCEQYDVSRTTVRLALQQLELEGYIKRIQGKGTFVSAAKIQTPIPHKI
TSFAEQMRGLRSESKVLELVVIPADHSIAELLKMKENEPVNKLVRVRYAEGEPLQYHTSYIPWKAAPGLAQEECTGSLFE
LLRTKYNIEISRGTESIEPILTDETISGHLLTNVGAPAFLSESLTYDKNEEVVEYAQIITRGDRTKFTVEQSYHS
>P48841 3.2.1.89~~~ganB~~~Arabinogalactan endo-beta-1,4-galactanase~~~COG3867
MKKKILAATAILLAAIANTGVADNTPFYVGADLSYVNEMESCGATYRDQGKKVDPFQLFADKGADLVRVRLWHNATWTKY
SDLKDVSKTLKRAKNAGMKTLLDFHYSDTWTDPEKQFIPKAWAHITDTKELAKALYDYTTDTLASLDQQQLLPNLVQVGN
ETNIEILQAEDTLVHGIPNWQRNATLLNSGVNAVRDYSKKTGKPIQVVLHIAQPENALWWFKQAKENGVIDYDVIGLSYY
PQWSEYSLPQLPDAIAELQNTYHKPVMIVETAYPWTLHNFDQAGNVLGEKAVQPEFPASPRGQLTYLLTLTQLVKSAGGM
GVIYWEPAWVSTRCRTLWGKGSHWENASFFDATRKNNALPAFLFFKADYQASAQAE
>Q65CX5 3.2.1.-~~~ganB~~~Endo-beta-1,4-galactanase~~~COG3867
MKNVLAVFVVLIFVLGAFGTSGPAEAARDSGTAKSGLYVEKVSGLRKDFIKGVDVSSIIALEESGVAFYNESGKKQDIFK
TLKEAGVNYVRVRIWNDPYDANGNGYGGGNNDLEKAIQIGKRATANGMKLLADFHYSDFWADPAKQKAPKAWANLNFEDK
KTALYQYTKQSLKAMKAAGIDIGMVQVGNETNGGLAGETDWAKMSQLFNAGSQAVRETDSNILVALHFTNPETSGRYAWI
AETLHRHHVDYDVFASSYYPFWHGTLKNLTSVLTSVADTYGKKVMVAETSYTYTAEDGDGHGNTAPKNGQTLNNPVTVQG
QANAVRDVIQAVSDVGEAGIGVFYWEPAWIPVGPAHRLEKNKALWETYGSGWATSYAAEYDPEDAGKWFGGSAVDNQALF
DFKGRPLPSLHVFQYVDTGTPFKN
>O07013 3.2.1.-~~~ganB~~~Endo-beta-1,4-galactanase~~~COG3867
MKSKVKMFFAAAIVWSACSSTGYAAAIEKEKHVSELRAEDLFVKKVEGMNKDFIKGADVSSVIALENSGVTFYNTNGKRQ
DIFTTLKQAGVNYVRVRIWNHPYDSNGNGYGGGNNDVQKAIEIGKRATANGMKVLADFHYSDFWADPAKQKVPKAWANLS
FEAKKAKLYEYTKQSLQKMIKEGVDIGMVQVGNETTGGFAGETDWTKMCQLFNEGSRAVRETNSNILVALHFTNPETAGR
YSFIAETLSKNKVDYDVFASSYYPFWHGTLQNLTSVLKAVANTYGKKVMVAETSYTYTAEDGDGHGNTAPKSGQTLPYPI
SVQGQATAVRDVMEAVANTGKAGLGVFYWEPAWIPVGPKTQIEKNKVLWETYGSGWASSYAAEYDPEDAGKWYGGSAVDN
QALFDFNGHPLPSLQVFQYAESGHIPKKR
>O32261 ~~~ganP~~~Galactooligosaccharides transport system permease protein GanP~~~COG1175
MQHRQVALLLSIIPGLGQFYNKQWIKGIVFLFLGASFFAVFGDLLNMGFWGIFTLGTEVPRDNSVFLLAEGIIAVIVTCF
GLAVYYVNLRDAFQSGKQRDENKPLSSLKEQYQHIISEGYPYVVSGPSLFILIFAVIFPILFSFALAFTNYDLYHSPPAK
LIDWVGFQTFANIFTVDIWRSTFFDVLAWTVVWTLAASTLQVTLGIFLAIIVNQKDLRFKRFFRTILILPWAVPGFVTIL
IFAGLFNDSFGAMNHDILAFFGIDPLPWMTDANWSRLALILMQGWLGFPYIFLVSTGVLQSIPDDLYEAATIDGASVFSK
LRYITLPMVFIAMAPIIITQFTFNFNNFNIIYLFNGGGPAVTGSTAGGTDILVSWIYKLTMQSSQYSLAAALTILLSVFV
ISIALWQFRQTKSFKEEA
>O07011 ~~~ganQ~~~Galactooligosaccharides transport system permease protein GanQ~~~COG3833
MLADMKVRRYIRLLFSYLLLAFMAVIIVYPLLWTAGASFNPGNSLISTSIIPKHPTFDHYKELFAGKESLQYVQWYVNSM
KISLFTMAGSLLCVTFTAYAFSRFRFKGRKYALTLFLLLQMIPQFSALIALFVLAQILGMINSHWLLILLYIGGLIPMNT
YLMKGYMDSIPMDLDESAKIDGASSTRIFFQIILPLSKPMAAVVAMNGFTGPLGDFVLSSTILRTPESYTLPVGLFNLVN
DVMGASYTTFAAGALLISIPVAVIFIMLQKNFVSGLTAGGTKG
>O07008 ~~~ganR~~~HTH-type transcriptional regulator GanR~~~COG1609
MATIKDIAQEAGFSISTVSRVLNNDESLSVPDETREKIYEAAEKLNYRKKTVRPLVKHIAFLYWLTDKEELEDVYFKTMR
LEVEKLAKAFNVDMTTYKIADGIESIPEHTEGFIAVGTFSDEELAFLRNLTENGVFIDSTPDPDHFDSVRPDLAQMTRKT
VNILTEKGHKSIGFIGGTYKNPNTNQDEMDIREQTFRSYMREKAMLDERYIFCHRGFSVENGYRLMSAAIDTLGDQLPTA
FMIAADPIAVGCLQALNEKGIAIPNRVSIVSINNISFAKYVSPPLTTFHIDIHELCKNAVQLLLEQVQDKRRTVKTLYVG
AELIVRKSMN
>O07009 ~~~ganS~~~Galactooligosaccharide-binding protein~~~COG2182
MKMAKKCSVFMLCAAVSLSLAACGPKESSSAKSSSKGSELVVWEDKEKSNGIKDAVAAFEKEHDVKVKVVEKPYAKQIED
LRMDGPAGTGPDVLTMPGDQIGTAVTEGLLKELHVKKDVQSLYTDASIQSQMVDQKLYGLPKAVETTVLFYNKDLITEKE
LPKTLEEWYDYSKKTADGSKFGFLALFDQIYYAESVMSGYGGYIFGKAKDGSYNPSDIGINNEGAVKGAALIQKFYKDGL
FPAGIIGEQGINVLESLFTEGKAAAIISGPWNVEAFSNAGINYGITKLPKLENGKNMSSFIGVKSYNVSAFSKNEELAQE
LAVFLANEKNSKTRYEETKEVPAVKSLANDPAIMKSEAARAVTEQSRFSEPTPNIPEMNEIWTPADSALQTVATGKADPK
QALDQAAETAKGQIKAKHSGK
>Q02PG5 1.2.1.-~~~gap2~~~Glyceraldehyde-3-phosphate dehydrogenase-like protein~~~
MIPLIGQLYRNNNVVTSIHGRGLINRSVIAIMKAHRFARHRMADDAELSVHETFPILKAMSELKLGAASVDLGKMVAKFK
AEGNGRSIEDFVKAELAEVAGKQNGDAREGTDVVLYGFGRIGRLLARILIEKTGGGDGLRLRAIVVRKGAENDLVKRASL
LRRDSVHGPFDGTITIDEENNTLTANGNLIQVIYSNDPASIDYTQYGIKNALLVDNTGKWRDAEGLGQHLKCPGIDRVVL
TAPGKGALKNIVHGINHTDIGADDKIISAASCTTNAIVPVLKAVNDQYGIVNGHVETVHSYTNDQNLIDNFHKGSRRGRS
APLNMVITKTGAATAAAKALPVLKGKLTGNAIRVPTPNVSMAILNLNLEKATTREEINEYLRQMAMHSDLQKQIDFVSSQ
EVVSTDFVGSRHAGVVDAEATICNDNRVVLYVWYDNEFGYSCQVVRVMEDMAGVNPPAFPR
>Q3C1A6 1.2.1.9~~~gapN~~~NADP-dependent glyceraldehyde-3-phosphate dehydrogenase~~~
MTKQYKNYVNGEWKLSKEEIKIYAPATGEELGSVPAMSQEEVDYVYASAKAAQKAWRALSYVERAEYLHKAADILMRDAE
KIGAVLSKEIAKGYKSAVGEVIRTAEIINYAAEEGVRLEGEVLEGGSFDPASKKKIAIVRREPVGLVLAISPFNYPINLA
GSKIAPALISGNVVALKPPTQGSISGLLLAEAFAEAGLPAGVFNTITGRGSVIGDYIVEHEAVNYINFTGSTPVGEHIGH
LAGMRPIMLELGGKDSAIILEDADLDLAAKNIVAGAYGYSGQRCTAVKRVLVMDSIADKLVEKVSALVNNLTVGMPEDNA
DITPLIDTKAADYVEGLIKDAQEKGAKEVISFKREGNLISPVLFDNVTTDMRLAWEEPFGPVLPFIRVNSVEEAIEISNK
SEYGLQASVFTNNFPLAFKIAEQLEVGTVHINNKTQRGTDNFPFLGAKKSGAGVQGVKYSIEAMTTVKSTVFDIAK
>Q59931 1.2.1.9~~~gapN~~~NADP-dependent glyceraldehyde-3-phosphate dehydrogenase~~~COG1012
MTKQYKNYVNGEWKLSENEIKIYEPASGAELGSVPAMSTEEVDYVYASAKKAQPAWRSLSYIERAAYLHKVADILMRDKE
KIGAVLSKEVAKGYKSAVSEVVRTAEIINYAAEEGLRMEGEVLEGGSFEAASKKKIAVVRREPVGLVLAISPFNYPVNLA
GSKIAPALIAGNVIAFKPPTQGSISGLLLAEAFAEAGLPAGVFNTITGRGSEIGDYIVEHQAVNFINFTGSTGIGERIGK
MAGMRPIMLELGGKDSAIVLEDADLELTAKNIIAGAFGYSGQRCTAVKRVLVMESVADELVEKIREKVLALTIGNPEDDA
DITPLIDTKSADYVEGLINDANDKGAAALTEIKREGNLICPILFDKVTTDMRLAWEEPFGPVLPIIRVTSVEEAIEISNK
SEYGLQASIFTNDFPRAFGIAEQLEVGTVHINNKTQRGTDNFPFLGAKKSGAGIQGVKYSIEAMTTVKSVVFDIK
>Q3L890 ~~~gap~~~Peptidoglycolipid exporter Gap~~~
MWSDILGLALFVSLNPLLLGFILLVLSRPRPVPNLVVFWVGCLIVNVPGFLIPLFVLRAVPSFAEFAEDLTTADPSSGIE
PFQLGTGIFALAVSAVIALRMWVKRRANQPVLVGSGAGDRGPTDDASTLVLDSGAREAREPGAIARMILRMRSALQRLVS
RLHQEWENGALWVALVFGLAYIPPPPLVLLVDTIIGGSGAPIGTQIIAVFVFIMAMLAVFEITLLSYVIAPRRTQAVLEP
LHEWSHRHRQMILLVLFGAVGIWELIVGLGVI
>A0QYG2 ~~~garA~~~Glycogen accumulation regulator GarA~~~COG1716
MTDKDSNLGADQSEDVTVETTSVFRADFLNELDAPAAAGTEGAVSGVEGLPSGSALLVVKRGPNAGSRFLLDQPTTSAGR
HPDSDIFLDDVTVSRRHAEFRLEGGEFQVVDVGSLNGTYVNREPVDSAVLANGDEVQIGKFRLVFLTGPKSDDSGSNA
>P9WJA9 ~~~garA~~~Glycogen accumulation regulator GarA~~~COG1716
MTDMNPDIEKDQTSDEVTVETTSVFRADFLSELDAPAQAGTESAVSGVEGLPPGSALLVVKRGPNAGSRFLLDQAITSAG
RHPDSDIFLDDVTVSRRHAEFRLENNEFNVVDVGSLNGTYVNREPVDSAVLANGDEVQIGKFRLVFLTGPKQGEDDGSTG
GP
>P39829 4.2.1.42~~~garD~~~Galactarate dehydratase (L-threo-forming)~~~COG2721
MANIEIRQETPTAFYIKVHDTDNVAIIVNDNGLKAGTRFPDGLELIEHIPQGHKVALLDIPANGEIIRYGEVIGYAVRAI
PRGSWIDESMVVLPEAPPLHTLPLATKVPEPLPPLEGYTFEGYRNADGSVGTKNLLGITTSVHCVAGVVDYVVKIIERDL
LPKYPNVDGVVGLNHLYGCGVAINAPAAVVPIRTIHNISLNPNFGGEVMVIGLGCEKLQPERLLTGTDDVQAIPVESASI
VSLQDEKHVGFQSMVEDILQIAERHLQKLNQRQRETCPASELVVGMQCGGSDAFSGVTANPAVGYASDLLVRCGATVMFS
EVTEVRDAIHLLTPRAVNEEVGKRLLEEMEWYDNYLNMGKTDRSANPSPGNKKGGLANVVEKALGSIAKSGKSAIVEVLS
PGQRPTKRGLIYAATPASDFVCGTQQVASGITVQVFTTGRGTPYGLMAVPVIKMATRTELANRWFDLMDINAGTIATGEE
TIEEVGWKLFHFILDVASGKKKTFSDQWGLHNQLAVFNPAPVT
>P23522 4.1.2.20~~~garL~~~5-keto-4-deoxy-D-glucarate aldolase~~~COG3836
MNNDVFPNKFKAALAAKQVQIGCWSALSNPISTEVLGLAGFDWLVLDGEHAPNDISTFIPQLMALKGSASAPVVRVPTNE
PVIIKRLLDIGFYNFLIPFVETKEEAELAVASTRYPPEGIRGVSVSHRANMFGTVADYFAQSNKNITILVQIESQQGVDN
VDAIAATEGVDGIFVGPSDLAAALGHLGNASHPDVQKAIQHIFNRASAHGKPSGILAPVEADARRYLEWGATFVAVGSDL
GVFRSATQKLADTFKK
>P0AA80 ~~~garP~~~Probable galactarate/D-glucarate transporter GarP~~~COG2271
MILDTVDEKKKGVHTRYLILLIIFIVTAVNYADRATLSIAGTEVAKELQLSAVSMGYIFSAFGWAYLLMQIPGGWLLDKF
GSKKVYTYSLFFWSLFTFLQGFVDMFPLAWAGISMFFMRFMLGFSEAPSFPANARIVAAWFPTKERGTASAIFNSAQYFS
LALFSPLLGWLTFAWGWEHVFTVMGVIGFVLTALWIKLIHNPTDHPRMSAEELKFISENGAVVDMDHKKPGSAAASGPKL
HYIKQLLSNRMMLGVFFGQYFINTITWFFLTWFPIYLVQEKGMSILKVGLVASIPALCGFAGGVLGGVFSDYLIKRGLSL
TLARKLPIVLGMLLASTIILCNYTNNTTLVVMLMALAFFGKGFGALGWPVISDTAPKEIVGLCGGVFNVFGNVASIVTPL
VIGYLVSELHSFNAALVFVGCSALMAMVCYLFVVGDIKRMELQK
>P0ABQ2 1.1.1.60~~~garR~~~2-hydroxy-3-oxopropionate reductase~~~COG2084
MKVGFIGLGIMGKPMSKNLLKAGYSLVVADRNPEAIADVIAAGAETASTAKAIAEQCDVIITMLPNSPHVKEVALGENGI
IEGAKPGTVLIDMSSIAPLASREISEALKAKGIDMLDAPVSGGEPKAIDGTLSVMVGGDKAIFDKYYDLMKAMAGSVVHT
GEIGAGNVTKLANQVIVALNIAAMSEALTLATKAGVNPDLVYQAIRGGLAGSTVLDAKAPMVMDRNFKPGFRIDLHIKDL
ANALDTSHGVGAQLPLTAAVMEMMQALRADGLGTADHSALACYYEKLAKVEVTR
>Q9I402 ~~~~~~L-glutamate/L-aspartate-binding protein~~~
MRIAPSLLSTAIVAALLSAPVVADELTGTLKKIKETGTITLGHRDASIPFSYLGTEPGKPIGYSHDLQLKVVEAVKKELN
LPELKVRYNLVTSQTRIPLVQNGTVDIECGSTTNNEERQKQVDFSVGIFEVGTRLLSKKTANIKDFDDLKGKNVVTTAGT
TSERLLKAMNADKKMGMNIISAKDHGESFMMLESGRAVAFMMDDALLYGEMAKAKKPDDWVVGGTPQSFEIYGCMVRKGD
AAFKKVVDKAITDTYASGEVNKIYDKWFTQPIPPKGLNLNFPMSEELKKLIASPTDKAAEQM
>D0VWY5 1.8.1.16~~~garB~~~Glutathione amide reductase~~~
MTQHFDLIAIGGGSGGLAVAEKAAAFGKRVALIESKALGGTCVNVGCVPKKVMWYASHLAEAVRDAPGFGVQASGGTLDW
PRLVAGRDRYIGAINSFWDGYVERLGITRVDGHARFVDAHTIEVEGQRLSADHIVIATGGRPIVPRLPGAELGITSDGFF
ALQQQPKRVAIIGAGYIGIELAGLLRSFGSEVTVVALEDRLLFQFDPLLSATLAENMHAQGIETHLEFAVAALERDAQGT
TLVAQDGTRLEGFDSVIWAVGRAPNTRDLGLEAAGIEVQSNGMVPTDAYQNTNVPGVYALGDITGRDQLTPVAIAAGRRL
AERLFDGQSERKLDYDNIPTVVFAHPPLSKVGLSEPEARERLGDVLTVYETSFTPMRYALNEHGPKTAMKLVCAGPEQRV
VGVHVIGDGADEMLQGFAVAVKMGATKADFDNTVAIHPGSAEELVTLKEPVRRPGDPLPEGAA
>O66610 6.3.5.7~~~gatA~~~Glutamyl-tRNA(Gln) amidotransferase subunit A~~~COG0154
MLWKKSLSELRELLKRGEVSPKEVVESFYDRYNQTEEKVKAYITPLYGKALKQAESLKERELPLFGIPIAVKDNILVEGE
KTTCASKILENFVAPYDATVIERLKKAGALIVGKTNLDEFAMGSSTEYSAFFPTKNPWDLERVPGGSSGGSAASVAVLSA
PVSLGSDTGGSIRQPASFCGVIGIKPTYGRVSRYGLVAFASSLDQIGVFGRRTEDVALVLEVISGWDEKDSTSAKVPVPE
WSEEVKKEVKGLKIGLPKEFFEYELQPQVKEAFENFIKELEKEGFEIKEVSLPHVKYSIPTYYIIAPSEASSNLARYDGV
RYGYRAKEYKDIFEMYARTRDEGFGPEVKRRIMLGTFALSAGYYDAYYLKAQKVRRLITNDFLKAFEEVDVIASPTTPTL
PFKFGERLENPIEMYLSDILTVPANLAGLPAISIPIAWKDGLPVGGQLIGKHWDETTLLQISYLWEQKFKHYEKIPLT
>O06491 6.3.5.7~~~gatA~~~Glutamyl-tRNA(Gln) amidotransferase subunit A~~~COG0154
MSLFDHKITELKQLIHKKEIKISDLVDESYKRIQAVDDKVQAFLALDEERARAYAKELDEAVDGRSEHGLLFGMPIGVKD
NIVTKGLRTTCSSKILENFDPIYDATVVQRLQDAEAVTIGKLNMDEFAMGSSTENSAYKLTKNPWNLDTVPGGSSGGSAA
AVAAGEVPFSLGSDTGGSIRQPASFCGVVGLKPTYGRVSRYGLVAFASSLDQIGPITRTVEDNAFLLQAISGVDKMDSTS
ANVDVPDFLSSLTGDIKGLKIAVPKEYLGEGVGKEARESVLAALKVLEGLGATWEEVSLPHSKYALATYYLLSSSEASAN
LARFDGIRYGYRTDNADNLIDLYKQTRAEGFGNEVKRRIMLGTFALSSGYYDAYYKKAQKVRTLIKKDFEDVFEKYDVIV
GPTTPTPAFKIGENTKDPLTMYANDILTIPVNLAGVPGISVPCGLADGLPLGLQIIGKHFDESTVYRVAHAFEQATDHHK
AKPEL
>Q72DX1 6.3.5.7~~~gatA~~~Glutamyl-tRNA(Gln) amidotransferase subunit A~~~COG0154
MSALHTLSLAAIRDALARREVRAEDAVLDCLARIETTEPRIDALLHLRAEAAIEEARALDAAGPDASRPLWGVPVTVKDA
LTTAGTPTTAGSRILEDFVPFYDAFAVQRLREAGAIILGKNNMDEFAMGSSTENSAYKPTRNPWDTARVPGGSSGGSAAS
VAAGQCFASLGTDTGGSIRQPASLCGCVGLKPTYGRVSRYGLIAYGSSLDQIGPMTRTVEDAAIVMGVIAGHDKRDSTCA
DRPVEDFAAALASRHDLAGVRIGVPAEFWGEGLSPEVATSCRAALDAARDLGATIVDVALPHTPQSIAAYYIVASAEASS
NLARYDGVRYGKRAHAPEDLMDLYVRSRSEGLGDEVQRRIMLGTYVLSSGYYDAYYRKAAQVRRRILEDYRNAFATCDVI
CGPVSPVTAWPLGALTADPLQMYLMDVFTLSLNLAGLPGLSLPVGLGTESGMPVGIQLLGRSFDEATLLSVGNVLSRALP
PLGSPAGLR
>P9WQA1 6.3.5.7~~~gatA~~~Glutamyl-tRNA(Gln) amidotransferase subunit A~~~COG0154
MTDIIRSDAATLAAKIAIKEVSSAEITRACLDQIEATDETYHAFLHVAADEALAAAAAIDKQVAAGEPLPSALAGVPLAL
KDVFTTSDMPTTCGSKILEGWRSPYDATLTARLRAAGIPILGKTNMDEFAMGSSTENSAYGPTRNPWNLDRVPGGSGGGS
AAALAAFQAPLAIGSDTGGSIRQPAALTATVGVKPTYGTVSRYGLVACASSLDQGGPCARTVLDTALLHQVIAGHDPRDS
TSVDAEVPDVVGAARAGAVGDLRGVRVGVVRQLHGGEGYQPGVLASFEAAVEQLTALGAEVSEVDCPHFDHALAAYYLIL
PSEVSSNLARFDAMRYGLRVGDDGTRSAEEVMAMTRAAGFGPEVKRRIMIGTYALSAGYYDAYYNQAQKVRTLIARDLDA
AYRSVDVLVSPTTPTTAFRLGEKVDDPLAMYLFDLCTLPLNLAGHCGMSVPSGLSPDDGLPVGLQIMAPALADDRLYRVG
AAYEAARGPLLSAI
>Q9HVT8 6.3.5.7~~~gatA~~~Glutamyl-tRNA(Gln) amidotransferase subunit A~~~
MLHQLTLAEIARALADKQFSAEELTRTLLGRIRQLDPQLNSFISITDDLAIAQAKAADERRANGENGALLGAPIAHKDLF
CTQGVRTSCGSKMLDNFVSPYDATVVEKLTAAGAVTLGKLNMDEFAMGSSNQSSHYGAVKNPWSLDRVPGGSSGGSAAAV
AARLLPAATGTDTGGSIRQPAALTNLTGIKPTYGRVSRWGMIAYASSLDQGGPLARTAEDCALMLGVMAGFDPKDSTSVE
QPVDDYLAALQKPLSGLRIGLPREYFGAGLDSRIADAVLAVVEELKTLGATVKDISLPNMQHAIPAYYVIAPAEASSNLS
RFDGVRYGYRCDAPQNLEDLYKRSRAEGFGSEVKNRIMVGTYALSAGYYDAYYLQAQKIRRLIKNDFVSAFAEVDVILGP
TTPNPAWKIGEKNDDPVSQYLEDIYTITANLAGLPGLSMPAGFVDGLPVGVQLLAPYFQEGRLLNVAHQYQQVSDWHTRT
PAGF
>P63488 6.3.5.7~~~gatA~~~Glutamyl-tRNA(Gln) amidotransferase subunit A~~~
MSIRYESVENLLTLIKDKKIKPSDVVKDIYDAIEETDPTIKSFLALDKENAIKKAQELDELQAKDQMDGKLFGIPMGIKD
NIITNGLETTCASKMLEGFVPIYESTVMEKLHKENAVLIGKLNMDEFAMGGSTETSYFKKTVNPFDHKAVPGGSSGGSAA
AVAAGLVPLSLGSDTGGSIRQPAAYCGVVGMKPTYGRVSRFGLVAFASSLDQIGPLTRNVKDNAIVLEAISGADVNDSTS
APVDDVDFTSEIGKDIKGLKVALPKEYLGEGVADDVKEAVQNAVETLKSLGAVVEEVSLPNTKFGIPSYYVIASSEASSN
LSRFDGIRYGYHSKEAHSLEELYKMSRSEGFGKEVKRRIFLGTFALSSGYYDAYYKKSQKVRTLIKNDFDKVFENYDVVV
GPTAPTTAFNLGEEIDDPLTMYANDLLTTPVNLAGLPGISVPCGQSNGRPIGLQFIGKPFDEKTLYRVAYQYETQYNLHD
VYEKL
>P63489 6.3.5.7~~~gatA~~~Glutamyl-tRNA(Gln) amidotransferase subunit A~~~
MSIRYESVENLLTLIKDKKIKPSDVVKDIYDAIEETDPTIKSFLALDKENAIKKAQELDELQAKDQMDGKLFGIPMGIKD
NIITNGLETTCASKMLEGFVPIYESTVMEKLHKENAVLIGKLNMDEFAMGGSTETSYFKKTVNPFDHKAVPGGSSGGSAA
AVAAGLVPLSLGSDTGGSIRQPAAYCGVVGMKPTYGRVSRFGLVAFASSLDQIGPLTRNVKDNAIVLEAISGADVNDSTS
APVDDVDFTSEIGKDIKGLKVALPKEYLGEGVADDVKEAVQNAVETLKSLGAVVEEVSLPNTKFGIPSYYVIASSEASSN
LSRFDGIRYGYHSKEAHSLEELYKMSRSEGFGKEVKRRIFLGTFALSSGYYDAYYKKSQKVRTLIKNDFDKVFENYDVVV
GPTAPTTAFNLGEEIDDPLTMYANDLLTTPVNLAGLPGISVPCGQSNGRPIGLQFIGKPFDEKTLYRVAYQYETQYNLHD
VYEKL
>Q97SE6 6.3.5.7~~~gatA~~~Glutamyl-tRNA(Gln) amidotransferase subunit A~~~COG0154
MTFNNKTIEELHNLLVSKEISATELTQATLENIKSREEALNSFVTIAEEQALVQAKAIDEAGIDADNVLSGIPLAVKDNI
STDGILTTAASKMLYNYEPIFDATAVANAKTKGMIVVGKTNMDEFAMGGSGETSHYGATKNAWNHSKVPGGSSSGSAAAV
ASGQVRLSLGSDTGGSIRQPAAFNGIVGLKPTYGTVSRFGLIAFGSSLDQIGPFAPTVKENALLLNAIASEDAKDSTSAP
VRIADFTSKIGQDIKGMKIALPKEYLGEGIDPEVKETILNAAKHFEKLGAIVEEVSLPHSKYGVAVYYIIASSEASSNLQ
RFDGIRYGYRAEDATNLDEIYVNSRSQGFGEEVKRRIMLGTFSLSSGYYDAYYKKAGQVRTLIIQDFEKVFADYDLILGP
TAPSVAYDLDSLNHDPVAMYLADLLTIPVNLAGLPGISIPAGFSQGLPVGLQLIGPKYSEETIYQAAAAFEATTDYHKQQ
PVIFGGDN
>Q9X0Z9 6.3.5.7~~~gatA~~~Glutamyl-tRNA(Gln) amidotransferase subunit A~~~COG0154
MIDLDFRKLTIEECLKLSEEEREKLPQLSLETIKRLDPHVKAFISVRENVSVEKKGKFWGIPVAIKDNILTLGMRTTCAS
RILENYESVFDATVVKKMKEAGFVVVGKANLDEFAMGSSTERSAFFPTRNPWDLERVPGGSSGGSAAAVSAGMVVAALGS
DTGGSVRQPASLCGVVGYKPTYGLVSRYGLVAFASSLDQIGPITKTVRDAAILMEIISGRDENDATTVNRKVDFLSEIEE
GVSGMKFAVPEEIYEHDIEEGVSERFEEALKLLERLGAKVERVKIPHIKYSVATYYVIAPAEASSNLARFDGVKYGLRIK
EKGLREMYMKTRNVGFGEEVRRRIMIGTFTLSAAYYEAYFNKAMKVRRKISDELNEVLSQYDAILTPTSPVTAFKIGEIK
DPLTYYLMDIFTIPANLAGLPAISVPFGFSNNLPVGVQVIGRRFADGKVFRIARAIEKNSPYNENGMFPLPEVKA
>Q9LCX3 6.3.5.7~~~gatA~~~Glutamyl-tRNA(Gln) amidotransferase subunit A~~~COG0154
MLAHEIRARVARGEVSPLEVAQAYLKRVQELDPGLGAFLSLNERLLEEAEAVDPGLPLAGLVVAVKDNIATRGLRTTAGS
RLLENFVPPYEATAVARLKALGALVLGKTNLDEFGMGSSTEHSAFFPTKNPFDPDRVPGGSSGGSAAALAADLAPLALGS
DTGGSVRQPAAFCGVYGLKPTYGRVSRFGLIAYASSLDQIGPMARSVRDLALLMDAAAGPDPLDATSLDLPPRFQEALEG
PLPPLRLGVVREALAGNSPGVERALEEALKVFRELGLSVREVSWPSLPQALAAYYILAPAEASSNLARYDGTLYGRRAAG
EEVEGMMEATRALFGLEVKRRVLVGTFVLSSGYYEAYYGRAQAFRRRLKAEAQALFREVDLLLLPTTPHPAFPFGARRDP
LAMYREDLYTVGANLTGLPALSFPAGFEGHLPVGLQLLAPWGEDERLLRAALAFEEATARAHLKAPLGEAL
>O66766 6.3.5.-~~~gatB~~~Aspartyl/glutamyl-tRNA(Asn/Gln) amidotransferase subunit B~~~COG0064
MNEKYEAVIGLEIHVQMDTKTKMFCGCKVEFGAEPNTNVCPVCLGMPGALPIVNKRAVEYAIRASLALNCEVHEESVFAR
KHYFYPDLPKGYQISQYEKPLATNGWVELNLPNGEKKKVRIRRLHIEEDAGKNIHEGDKTLVDLNRAGTPLMEIVTEPDI
RTPEEARLFLEKLRNIMRYAGVSKADMEKGQLRCDINVSIRPKGSKEFGTRVEIKNVNSFRFVQKALEYEIERQINVVEE
GGEVVQETRTFDPQTGKTYPMRTKEEAEDYRYFPDPDLVPLKVKKEWIEEIKKNMPELPDQRFERLIKEYGLSEYEAGIL
VNHKEVGDFFEEAVRHFKEPKGIVNWLINDLLGLLRDKGISIEESPVKPEHLAELVKLIKEKVISTKIGKEVIKEMVETG
KTPSQIVEEKGLKQITDENQIKELVKKIFEKHPKEVERLKQGEEKLIGFFVGQVMRETRGKANPQVVNKVIRELVKEV
>O30509 6.3.5.-~~~gatB~~~Aspartyl/glutamyl-tRNA(Asn/Gln) amidotransferase subunit B~~~COG0064
MNFETVIGLEVHVELKTKSKIFSSSPTPFGAEANTQTSVIDLGYPGVLPVLNKEAVEFAMKAAMALNCEIATDTKFDRKN
YFYPDNPKAYQISQFDKPIGENGWIEIEVGGKTKRIGITRLHLEEDAGKLTHTGDGYSLVDFNRQGTPLVEIVSEPDIRT
PEEAYAYLEKLKSIIQYTGVSDCKMEEGSLRCDANISLRPIGQEEFGTKTELKNLNSFAFVQKGLEHEEKRQEQVLLSGF
FIQQETRRYDEATKKTILMRVKEGSDDYRYFPEPDLVELYIDDEWKERVKASIPELPDERRKRYIEELGFAAYDAMVLTL
TKEMADFFEETVQKGAEAKQASNWLMGEVSAYLNAEQKELADVALTPEGLAGMIKLIEKGTISSKIAKKVFKELIEKGGD
AEKIVKEKGLVQISDEGVLLKLVTEALDNNPQSIEDFKNGKDRAIGFLVGQIMKASKGQANPPMVNKILLEEIKKR
>Q72AV5 6.3.5.-~~~gatB~~~Aspartyl/glutamyl-tRNA(Asn/Gln) amidotransferase subunit B~~~COG0064
MASYEAVIGLEVHAQLRTRSKLFCSCSTAFGADPNAHVCEVCAGMPGVLPVLNEKAVEFAARMGIAVGCTVNRTSVFARK
NYFYPDLPKGYQISQYEQPICEHGHLDISVGDAVKRIGITRIHLEDDAGKNIHSAGENVSYVDLNRTGVPLIEIVSEPDL
RSAEEAVAYLKALRAIVVHLGICDGNMEEGSFRCDANVSLRPRGAAEFGTRAELKNLNSFRHVQRAIEYEISRQADLLDD
GDKVVQETRLYDSVKNITVSMRGKEEAHDYRYFPDPDLIPIHIDEARLAEWQATLPELPQARLERFMSSFGLSAQDAEVL
TAERDHAEFFEAAVKLYDQPRKIANMMLGPLQRELNQRGTSLAVSAMRPEALAELVRIIDAGLISAKIGNDVFGELFENG
AMPEAFVRERGLVQISDTSAIEQAVDEVIAENPAEVEAYRGGKTKLVSFFVGQVMRKTRGKANPALVNELLASKLG
>P9WN61 6.3.5.-~~~gatB~~~Aspartyl/glutamyl-tRNA(Asn/Gln) amidotransferase subunit B~~~COG0064
MTVAAGAAKAAGAELLDYDEVVARFQPVLGLEVHVELSTATKMFCGCTTTFGGEPNTQVCPVCLGLPGSLPVLNRAAVES
AIRIGLALNCEIVPWCRFARKNYFYPDMPKNYQISQYDEPIAINGYLDAPLEDGTTWRVEIERAHMEEDTGKLTHIGSET
GRIHGATGSLIDYNRAGVPLIEIVTKPIVGAGARAPQIARSYVTALRDLLRALDVSDVRMDQGSMRCDANVSLKPAGTTE
FGTRTETKNVNSLKSVEVAVRYEMQRQGAILASGGRITQETRHFHEAGYTSAGRTKETAEDYRYFPEPDLEPVAPSRELV
ERLRQTIPELPWLSRRRIQQEWGVSDEVMRDLVNAGAVELVAATVEHGASSEAARAWWGNFLAQKANEAGIGLDELAITP
AQVAAVVALVDEGKLSNSLARQVVEGVLAGEGEPEQVMTARGLALVRDDSLTQAAVDEALAANPDVADKIRGGKVAAAGA
IVGAVMKATRGQADAARVRELVLEACGQG
>Q9HVT7 6.3.5.-~~~gatB~~~Aspartyl/glutamyl-tRNA(Asn/Gln) amidotransferase subunit B~~~
MQWETVIGLEIHAQLATQSKIFSGSSTAFGAAPNTQASLVDLAMPGTLPVLNEEAVRMACLFGLAIDARIDRQNVFARKN
YFYPDLPKGYQTSQMDHPIVGKGHLDITLEDGTTKRIGITRAHLEEDAGKSLHEDFQGMSGIDLNRAGTPLLEIVSEPDI
RSAKEAVAYVKAIHALVRYLGICDGNMAEGSLRCDCNVSVRPKGQAEFGTRAEIKNVNSFRFIEKAINHEIQRQIELIED
GGKVVQETRLYDPNKDETRSMRGKEEANDYRYFPCPDLLPVVIEPEYLAKLREQLPELPVQKRERFESQYGLSAYDASVL
SASREMADYFEKVQGICGDAKLAANWVMVELGSLLNKDGLEIEQSPVSAEQLGGMILRIKDNTISGKLAKMVFEAMANGE
GSADQIIEAKGLKQVTDSGAIEKMLDEVLAANAEQVEQYRAADEAKRGKMFGFFVGQAMKASKGKANPQQVNELLKKKLE
A
>P64201 6.3.5.-~~~gatB~~~Aspartyl/glutamyl-tRNA(Asn/Gln) amidotransferase subunit B~~~
MHFETVIGLEVHVELKTDSKMFSPSPAHFGAEPNSNTNVIDLAYPGVLPVVNKRAVDWAMRAAMALNMEIATESKFDRKN
YFYPDNPKAYQISQFDQPIGENGYIDIEVDGETKRIGITRLHMEEDAGKSTHKGEYSLVDLNRQGTPLIEIVSEPDIRSP
KEAYAYLEKLRSIIQYTGVSDVKMEEGSLRCDANISLRPYGQEKFGTKAELKNLNSFNYVRKGLEYEEKRQEEELLNGGE
IGQETRRFDESTGKTILMRVKEGSDDYRYFPEPDIVPLYIDDAWKERVRQTIPELPDERKAKYVNELGLPAYDAHVLTLT
KEMSDFFESTIEHGADVKLTSNWLMGGVNEYLNKNQVELLDTKLTPENLAGMIKLIEDGTMSSKIAKKVFPELAAKGGNA
KQIMEDNGLVQISDEATLLKFVNEALDNNEQSVEDYKNGKGKAMGFLVGQIMKASKGQANPQLVNQLLKQELDKR
>P99169 6.3.5.-~~~gatB~~~Aspartyl/glutamyl-tRNA(Asn/Gln) amidotransferase subunit B~~~
MHFETVIGLEVHVELKTDSKMFSPSPAHFGAEPNSNTNVIDLAYPGVLPVVNKRAVDWAMRAAMALNMEIATESKFDRKN
YFYPDNPKAYQISQFDQPIGENGYIDIEVDGETKRIGITRLHMEEDAGKSTHKGEYSLVDLNRQGTPLIEIVSEPDIRSP
KEAYAYLEKLRSIIQYTGVSDVKMEEGSLRCDANISLRPYGQEKFGTKAELKNLNSFNYVRKGLEYEEKRQEEELLNGGE
IGQETRRFDESTGKTILMRVKEGSDDYRYFPEPDIVPLYIDDAWKERVRQTIPELPDERKAKYVNELGLPAYDAHVLTLT
KEMSDFFESTIEHGADVKLTSNWLMGGVNEYLNKNQVELLDTKLTPENLAGMIKLIEDGTMSSKIAKKVFPELAAKGGNA
KQIMEDNGLVQISDEATLLKFVNEALDNNEQSVEDYKNGKGKAMGFLVGQIMKASKGQANPQLVNQLLKQELDKR
>Q9RF06 6.3.5.-~~~gatB~~~Aspartyl/glutamyl-tRNA(Asn/Gln) amidotransferase subunit B~~~
MHFETVIGLEVHVELKTDSKMFSPSPAHFGAEPNSNTNVIDLAYPGVLPVVNKRAVDWAMRAAMALNMEIATESKFDRKN
YFYPDNPKAYQISQFDQPIGENGYIDIEVDGETKRIGITRLHMEEDAGKSTHKGEYSLVDLNRQGTPLIEIVSEPDIRSP
KEAYAYLEKLRSIIQYTGVSDVKMEEGSLRCDANISLRPYGQEKFGTKAELKNLNSFNYVRKGLEYEEKRQEEELLSGGE
IGQETRRFDESTGKTILMRVKEGSDDYRYFPEPDIVPLYIDDAWKERVRQTIPELPDERKAKYVNELGLPAYDAHVLTLT
KEMSDFFESTIEHGADVKLTSNWLMGGVNEYLNKNQVELLDTKLTPENLAGMIKLIEDGTMSSKIAKKVFPELAAKGGNA
KQIMEDNGLVQISDEATLLKFVNEALDNNEQSVEDYKNGKGKAMGFLVGQIMKASKGQANPQLVNQLLKQELDKR
>Q97SE7 6.3.5.-~~~gatB~~~Aspartyl/glutamyl-tRNA(Asn/Gln) amidotransferase subunit B~~~COG0064
MNFETVIGLEVHVELNTNSKIFSPTSAHFGNDQNANTNVIDWSFPGVLPVLNKGVVDAGIKAALALNMDIHKKMHFDRKN
YFYPDNPKAYQISQFDEPIGYNGWIEVKLEDGTTKKIGIERAHLEEDAGKNTHGTDGYSYVDLNRQGVPLIEIVSEADMR
SPEEAYAYLTALKEVIQYAGISDVKMEEGSMRVDANISLRPYGQEKFGTKTELKNLNSFSNVRKGLEYEVQRQAEILRSG
GQIRQETRRYDEANKATILMRVKEGAADYRYFPEPDLPLFEISDEWIEEMRTELPEFPKERRARYVSDLGLSDYDASQLT
ANKVTSDFFEKAVALGGDAKQVSNWLQGEVAQFLNAEGKTLEQIELTPENLVEMIAIIEDGTISSKIAKKVFVHLAKNGG
GAREYVEKAGMVQISDPAILIPIIHQVFADNEAAVADFKSGKRNADKAFTGFLMKATKGQANPQVALKLLAQELAKLKEN
>Q9X100 6.3.5.-~~~gatB~~~Aspartyl/glutamyl-tRNA(Asn/Gln) amidotransferase subunit B~~~COG0064
MRYRPVIGLEIHVQLSTKTKAFCSCPADVFELPPNTAICPVCTGQPGALPVPNEEMIRFAVKTALALNCKIHKYSRFDRK
NYFYPDLPKGYQISQYFYPIATEGFLEIDGDEGRKKVRIRRLHLEEDAGKLVHEGDSITRASYSLVDMNRCGVPLIEIVT
EPDISSPREARVFMEKLRSIVRYLGVSTGDMEKGALRCDANISVVDTETGRQSNRVEVKNMNSFRFVERALEYEFERIVK
AMERGEDVERETRGWDMATKITVSMRGKEEESDYRYFPEPDIPPVVLSDEYLEEVKKELPELPDEKAERFMREYGLPEYD
AKVLTSSKELAEFFEECVKVVNRPKDLSNWIMTEVLRELNERNIEITESKLTPQHFADLFKLMDEGKISIKIAKEIFPEV
FETGKMPSQIVEEKGLTQINDEKLIEELVKKAMEQNPKAVQDYKSGKKKAAGFFVGYVMRETKGKANPELTNRIIQKLLE
GE
>Q9LCX2 6.3.5.-~~~gatB~~~Aspartyl/glutamyl-tRNA(Asn/Gln) amidotransferase subunit B~~~COG0064
MYEAVIGLEVHLHLKTRTKMFCGCRADYFGAEPNTHTCPVCLGLPGALPVPNRVAVEHGLRLALALGAEVPERLVFHRKN
YFYPDLPKNYQISQYDLPLGRGGSLPLGERRVRIKRLHLEEDAGKSLHLEGRTLLDLNRAGSPLIELVTEPDLKTPEEAR
LFLQRIQALVQTLGISDASPEEGKLRADVNVSVRRVGEPLGTKVEIKNLNSFKSVQRALEYEIRRQTEILRRGEKVKQAT
MGFEEGSGKTYPMRTKEEEADYRYFPEPDLPPVAIPRDWLEEVRRSLPELPWEKEARYRALGIKEKDAEVLAYTPSLARF
LDQALPLGLASPQALANWLLADVAGLLHERGLRLEETRLSPEGLARLVGLFERGEVTSRVAKSLLPEVLEGQDPEALVRE
RGLKVVADEGALKALVAEAIAAMPEAAESVRQGKVKALDALVGQVMRKTRGQARPDLVRRLLLEALGVG
>O67904 6.3.5.-~~~gatC~~~Glutamyl-tRNA(Gln) amidotransferase subunit C~~~COG0721
MVDREWVLKIAKLARLELKEEEIEVFQKQLSDILDFIDQLKELDTENVEPYIQEFEETPMREDEPHPSLDREKALMNAPE
RKDGFFVVPRVVEV
>P9WN59 6.3.5.-~~~gatC~~~Glutamyl-tRNA(Gln) amidotransferase subunit C~~~COG0721
MSQISRDEVAHLARLARLALTETELDSFAGQLDAILTHVSQIQAVDVTGVQATDNPLKDVNVTRPDETVPCLTQRQVLDQ
APDAVDGRFAVPQILGDEQ
>Q9HVT9 6.3.5.-~~~gatC~~~Glutamyl-tRNA(Gln) amidotransferase subunit C~~~
MALERSDVEKIAHLARLGLSEADLPRTTETLNNILGLIDQMQAVDTSGVEPLAHPLEATQRLRPDAVTETDHRDAYQTIA
PAVEEGLYLVPKVIES
>P68807 6.3.5.-~~~gatC~~~Aspartyl/glutamyl-tRNA(Asn/Gln) amidotransferase subunit C~~~
MTKVTREEVEHIANLARLQISPEETEEMANTLESILDFAKQNDSADTEGVEPTYHVLDLQNVLREDKAIKGIPQELALKN
AKETEDGQFKVPTIMNEEDA
>P68808 6.3.5.-~~~gatC~~~Aspartyl/glutamyl-tRNA(Asn/Gln) amidotransferase subunit C~~~
MTKVTREEVEHIANLARLQISPEETEEMANTLESILDFAKQNDSADTEGVEPTYHVLDLQNVLREDKAIKGIPQELALKN
AKETEDGQFKVPTIMNEEDA
>Q5XAC6 6.3.5.-~~~gatC~~~Glutamyl-tRNA(Gln) amidotransferase subunit C~~~
MKISEEEVRHVAKLSKLSFSESETTTFATTLSKIVDMVELLNEVDTEGVAITTTMADKKNVMRQDVAEEGTDRALLFKNV
PEKENHFIKVPAILDDGGDA
>Q97SE5 6.3.5.-~~~gatC~~~Glutamyl-tRNA(Gln) amidotransferase subunit C~~~COG0721
MKITQEEVTHVANLSKLRFSEEETAAFATTLSKIVDMVELLGEVDTTGVAPTTTMADRKTVLRPDVAEEGIDRDRLFKNV
PEKDNYYIKVPAILDNGGDA
>Q9WY94 6.3.5.-~~~gatC~~~Glutamyl-tRNA(Gln) amidotransferase subunit C~~~COG0721
MIKVTKDLVLHLENLARLELSEDQRESLMKDFQEILDYVELLNEVDVEGVEPMYTPVEDSAKLRKGDPRFFEMRDLIKKN
FPEEKDGHIKVPGIHR
>Q9LCX4 6.3.5.-~~~gatC~~~Glutamyl-tRNA(Gln) amidotransferase subunit C~~~COG0721
MELSPELLRKLETLAKIRLSPEEEALLLQDLKRILDFVDALPRVEEGGAEEALGRLREDEPRPSLPQAEALALAPEAEDG
FFRVPPVLE
>C0KTJ6 1.1.1.406~~~~~~Galactitol 2-dehydrogenase (L-tagatose-forming)~~~
MDYRTVFRLDGACAAVTGAGSGIGLEICRAFAASGARLILIDREAAALDRAAQELGAAVAARIVADVTDAEAMTAAAAEA
EAVAPVSILVNSAGIARLHDALETDDATWRQVMAVNVDGMFWASRAFGRAMVARGAGAIVNLGSMSGTIVNRPQFASSYM
ASKGAVHQLTRALAAEWAGRGVRVNALAPGYVATEMTLKMRERPELFETWLDMTPMGRCGEPSEIAAAALFLASPAASYV
TGAILAVDGGYTVW
>P0A9S3 1.1.1.251~~~gatD~~~Galactitol 1-phosphate 5-dehydrogenase~~~COG1063
MKSVVNDTDGIVRVAESVIPEIKHQDEVRVKIASSGLCGSDLPRIFKNGAHYYPITLGHEFSGYIDAVGSGVDDLHPGDA
VACVPLLPCFTCPECLKGFYSQCAKYDFIGSRRDGGFAEYIVVKRKNVFALPTDMPIEDGAFIEPITVGLHAFHLAQGCE
NKNVIIIGAGTIGLLAIQCAVALGAKSVTAIDISSEKLALAKSFGAMQTFNSSEMSAPQMQSVLRELRFNQLILETAGVP
QTVELAVEIAGPHAQLALVGTLHQDLHLTSATFGKILRKELTVIGSWMNYSSPWPGQEWETASRLLTERKLSLEPLIAHR
GSFESFAQAVRDIARNAMPGKVLLIP
>A0A0H2WZ38 6.3.5.13~~~gatD~~~Lipid II isoglutaminyl synthase (glutamine-hydrolyzing) subunit GatD~~~
MHELTIYHFMSDKLNLYSDIGNIIALRQRAKKRNIKVNVVEINETEGITFDECDIFFIGGGSDREQALATKELSKIKTPL
KEAIEDGMPGLTICGGYQFLGKKYITPDGTELEGLGILDFYTESKTNRLTGDIVIESDTFGTIVGFENHGGRTYHDFGTL
GHVTFGYGNNDEDKKEGIHYKNLLGTYLHGPILPKNYEITDYLLEKACERKGIPFEPKEIDNEAEIQAKQVLIDRANRQK
KSR
>A0A0H3JN63 6.3.5.13~~~gatD~~~Lipid II isoglutaminyl synthase (glutamine-hydrolyzing) subunit GatD~~~
MHELTIYHFMSDKLNLYSDIGNIIALRQRAKKRNIKVNVVEINETEGITFDECDIFFIGGGSDREQALATKELSKIKTPL
KEAIEDGMPGLTICGGYQFLGKKYITPDGTELEGLGILDFYTESKTNRLTGDIVIESDTFGTIVGFENHGGRTYHDFGTL
GHVTFGYGNNDEDKKEGIHYKNLLGTYLHGPILPKNYEITDYLLEKACERKGIPFEPKEIDNEAEIQAKQVLIDRANRQK
KSR
>Q8DNZ8 6.3.5.13~~~gatD~~~Lipid II isoglutaminyl synthase (glutamine-hydrolyzing) subunit GatD~~~COG3442
MVYTSLSSKDGNYPYQLNIAHLYGNLMNTYGDNGNILMLKYVAEKLGAHVTVDIVSLHDDFDENHYDIAFFGGGQDFEQS
IIADDLPAKKESIDNYIQNDGVVLAICGGFQLLGQYYVEASGKRIEGLGVMGHYTLNQTNNRFIGDIKIHNEDFDETYYG
FENHQGRTFLSDDQKPLGQVVYGNGNNEEKVGEGVHYKNVFGSYFHGPILSRNANLAYRLVTTALKKKYGQDIQLPAYED
ILSQEIAEEYSDVKSKADFS
>P0C8J6 4.1.2.40~~~gatY~~~D-tagatose-1,6-bisphosphate aldolase subunit GatY~~~COG0191
MYVVSTKQMLNNAQRGGYAVPAFNIHNLETMQVVVETAANLHAPVIIAGTPGTFTHAGTENLLALVSAMAKQYHHPLAIH
LDHHTKFDDIAQKVRSGVRSVMIDASHLPFAQNISRVKEVVDFCHRFDVSVEAELGQLGGQEDDVQVNEADALYTNPAQA
REFAEATGIDSLAVAIGTAHGMYASAPALDFSRLENIRQWVNLPLVLHGASGLSTKDIQQTIKLGICKINVATELKNAFS
QALKNYLTEHPEATDPRDYLQSAKSAMRDVVSKVIADCGCEGRA
>P0C8J7 4.1.2.40~~~gatY~~~D-tagatose-1,6-bisphosphate aldolase subunit GatY~~~COG0191
MYVVSTKQMLNNAQRGGYAVPAFNIHNLETMQVVVETAANLHAPVIIAGTPGTFTHAGTENLLALVSAMAKQYHHPLAIH
LDHHTKFDDIAQNLRSGVRSVMIDASHLPFAQNISRVKEVVDFCHRFDVSVEAELGQLGGQEDDVQVNEADAFYTNPAQA
REFAEATGIDSLAVAIGTAHGMYASAPVLDFSRLENIRQWVNLPLVLHGASGLSTKDIQQTIKLGICKINVATELKNAFS
QALKNYLTAHPEATDPRDYLQSAKSAMRDVVSKVIADCGCEGRA
>Q8VS16 4.1.2.40~~~gatY~~~D-tagatose-1,6-bisphosphate aldolase subunit GatY~~~COG0191
MFIISSKNMLLKAQRLGYAVPAFNIHNLETMQVVVETAAELRSPLILAGTPGTYSYAGTGNVVAIARDLAKIWDLPLAVH
LDHHEDLADITRKVQAGIRSVMIDGSHSPFEENVALVKSVVELSHRYDASVEAELGRLGGVEDDLGVDAKDALYTNPEQG
REFVARTGIDSLAVVIGTAHGLYAAEPKLGFAALPPISERVDVPLVLHGASKLPDSDIRRAISLGVCKVNVATELKIAFS
DALKHYFEENPDANEPRHYMKPAKAAMKDVVRKVIHVCGCEGQL
>Q8X7H4 ~~~gatZ~~~D-tagatose-1,6-bisphosphate aldolase subunit GatZ~~~COG4573
MKTLIARHKAGEHIGICSVCSAHPLVIEAALAFDRNSTRKVLIEATSNQVNQFGGYTGMTPADFREFVFAIADKVGFARE
RIILGGDHLGPNCWQQENVDAAMEKSVELVKAYVRAGFSKIHLDASMSCAGDPIPLAPETVAERAAVLCFAAESVATDCQ
REQLSYVIGTEVPVPGGEASAIQSVHITHVEDAANTLRTHQKAFIARGLTEALTRVIAIVVQPGVEFDHSNIIHYQPQEA
QALAQWIENTRMVYEAHSTDYQTRTAYWELVRDHFAILKVGPALTFALREAIFALAQIEQELIAPENRSGCLAVIEEVML
DEPQYWKKYYRTGFNDSLLDIRYSLSDRIRYYWPHSRIKNSVETMMVNLQGVDIPLGMISQYLPKQFERIQSGELSAIPH
QLIMDKIYDVLRAYRYGCAE
>P0C8J8 ~~~gatZ~~~D-tagatose-1,6-bisphosphate aldolase subunit GatZ~~~COG4573
MKTLIARHKAGEHIGICSVCSAHPLVIEAALAFDRNSTRKVLIEATSNQVNQFGGYTGMTPADFREFVFTIADKVGFARE
RIILGGDHLGPNCWQQENADAAMEKSVELVKEYVRAGFSKIHLDASMSCAGDPIPLAPETVAERAAVLCFAAESVATDCQ
REQLSYVIGTEVPVPGGEASAIQSVHITHVEDAANTLRTHQKAFIARGLTEALTRVIAIVVQPGVEFDHSNIIHYQPQEA
QPLAQWIENTRMVYEAHSTDYQTRTAYWELVRDHFAILKVGPALTFALREAIFALAQIEQELIAPENRSGCLAVIEEVML
DEPQYWKKYYRTGFNDSLLDIRYSLSDRIRYYWPHSRIKNSVETMMVNLEGVDIPLGMISQYLPKQFERIQSGELSAIPH
QLIMDKIYDVLRAYRYGCAE
>P0C8J9 ~~~gatZ~~~D-tagatose-1,6-bisphosphate aldolase subunit GatZ~~~COG4573
MKTLIARHKAGEHIGICSVCSAHPLVIEAALAFDRNSTRKVLIEATSNQVNQFGGYTGMTPADFREFVFTIADKVGFARE
RIILGGDHLGPNCWQQENADAAMEKSVELVKAYVRAGFSKIHLDASMSCAGDPIPLAPETVAERAAVLCFAAESVATDCQ
REQLSYVIGTEVPVPGGEASAIQSVHITRVEDAANTLRTHQKAFIARGLAEALTRVIAIVVQPGVEFDHSNIIHYQPQEA
QPLAQWIENTRMVYEAHSTDYQTRTAYWELVRDHFAILKVGPALTFALREAIFALAQIEQELIAPENRSGCLAVIEEVMF
DEPQYWKKYYRTGFNDSLLDIRYSLSDRIRYYWPHSRIKNSVETMMVNLEGMEIPLGMISQYLPKQFERIQSGELSAIPH
QLIMDKIYDVLRAYRYGCAE
>Q8VS12 ~~~gatZ~~~D-tagatose-1,6-bisphosphate aldolase subunit GatZ~~~COG4573
MKDIISRHKAGEHIGICSVCSAHPLVIEAALSFDLHTNNKVLIEATSNQVNQFGGYTGMSCDFRDFVNKIAREVGFPSER
IILGGDHLGPNCWQGEPAAEAMEKSVDLIKAYVAAGFSKIHLDASMSCADDPVPLDPAIVAERAARLCQAAEETATDEQK
RHLTYVIGTEVPVPGGEASTIGSVHVTRAQDAAATLETHEAAFRKLGLNAALERVIAIVVQPGVEFDHTQIIHYQPEAAK
ALSAWIEGTPMVYEAHSTDYQSRQAYWALVRDHYAILKVGPALTFALREAIFSLAQMENELVAPESRSRVMEVIDEVMLN
EPGYWKKYYRPTWSQAMADIHFSLSDRIRYYWPHPRIRQSVEKLIANLTETKLPLGLISQYIPVQFERLSLNELAAVPHD
LILDKIQDVLRAYRYGRAI
>A0A0H2ZIC3 ~~~gbdR~~~HTH-type transcriptional regulator GbdR~~~
MTTYAPGVPPQNRNPQSIGFLLLDNFTLISLASAVEPLRMANQLSGRELYRWHTLSLDGRQVWASDGLQITPDAGTDNAP
AVDCVIVCGGVGIQRSVTREHVTFLQAQARQGRRLGAVCTGSWALARAGLLDGYDCSVHWECLAAMQEAFPRVAMSTRLF
SIDRNRFTSSGGTAPMDMMLHLIGREHGRELSAAISEMFIYERIRNEQDHQRVPLKHMLGTNQPKLQEIVALMEANLEEP
IDLDELAVYVNVSRRQLERLFQKYLHCSPSRYYLKLRLIRARQLLKQTSMSIIEVASVCGFVSTPHFSKCYREYFGIPPR
DERQGQPLGQPVVLMPIPQDLALMPNSSALSALSQAQGESTFASVRI
>Q9HTI4 ~~~gbdR~~~HTH-type transcriptional regulator GbdR~~~
MTTYAPGVPPQNRNPQSIGFLLLDNFTLISLASAVEPLRMANQLSGRELYRWHTLSLDGRQVWASDGLQITPDAGTDNAP
AVDCVIVCGGVGIQRSVTREHVTFLQAQARQGRRLGAVCTGSWALARAGLLDGYDCSVHWECLAAMQEAFPRVAMSTRLF
SIDRNRFTSSGGTAPMDMMLHLIGREHGRELSAAISEMFIYERIRNEQDHQRVPLKHMLGTNQPKLQEIVALMEANLEEP
IDLDELAVYVNVSRRQLERLFQKYLHCSPSRYYLKLRLIRARQLLKQTSMSIIEVASVCGFVSTPHFSKCYREYFGIPPR
DERQGQPLGQPVVLMPIPQDLALMPNSSALSALSQAQGESTFASVRI
>Q8KZT5 3.5.3.7~~~gbh~~~Guanidinobutyrase~~~
MEELRIEANGNLGPIDSSRIPRYAGAATYARLPRLDQVSKADVTVVGVPFDSGVSYRPGARFGANHVREASRLLRPYNPA
WDVSPFENIQVADAGDMAVNPFNINEAIETIQQNALDLTANGSKLVTLGGDHTIALPLLRAAAERAGEPIAMLHFDAHLD
TWDTYFGAEYTHGTPFRRAVEEGILDTEAISHVGTRGPLYGKKDLDDDHRFGFGIVTSADVYYQGVLETVAKIRDRIGNR
PLYISVDIDVLDPAHAPGTGTPEAGGITSRELLEIIRGFRGMNLVGADVVEVAPAYDHAEITGVAGSHVAYELVTLMADN
AVEGDRHGAPNGYAQQALGARIQEVAQAIGGQR
>Q9KLD5 ~~~gbpA~~~GlcNAc-binding protein A~~~COG3397
MKKQPKMTAIALILSGISGLAYGHGYVSAVENGVAEGRVTLCKFAANGTGEKNTHCGAIQYEPQSVEGPDGFPVTGPRDG
KIASAESALAAALDEQTADRWVKRPIQAGPQTFEWTFTANHVTKDWKYYITKPNWNPNQPLSRDAFDLNPFCVVEGNMVQ
PPKRVSHECIVPEREGYQVILAVWDVGDTAASFYNVIDVKFDGNGPVLPDWNPAGQIIPSMDLSIGDTVYTRVFDNDGEN
PAYRTELKIDSETLTKANQWSYALATKINQTQKQQRAGQLNGDQFVPVYGTNPIYLKEGSGLKSVEIGYQIEAPQPEYSL
TVSGLAKEYEIGEQPIQLDLTLEAQGEMSAELTVYNHHQKPLASWSQAMTDGELKSITLELSEAKAGHHMLVSRIKDRDG
NLQDQQTLDFMLVEPQTPPTPGDYDFVFPNGLKEYVAGTKVLASDGAIYQCKPWPYSGYCQQWTSNATQYQPGTGSHWEM
AWDKR
>P52661 ~~~gbpR~~~HTH-type transcriptional regulator GbpR~~~
MTPHRFPANWFLKARLKLRHLQLFVALDEHRNLHRAAASLTMSQPAASKLLGDLEESLGVTLFERHGRGVEPNWYGGLMI
RHARTILSGLQEAGEELNDLLAGHSGSVSIGTVMAPAVELVVPVITTLTRDHPDLKIAVAVETSDVLAERVRQGVMDFAI
GRLPDHVDASCFDYQEISSEELCFVCRDGHPLLRLGRPLTAADLVDATWILQPLGSLLRSRVEALFRAEGVPPPRKVIES
ASPVISLAMVAENDSVTVFARALAQVFSPTGSCTIVPFHKRFSVEPYGIFWLKDRPLSPGARTALAALRAASDTKMRRAL
EMPQPSSSSDIEMCMDSANNRI
>P71016 1.2.1.8~~~gbsA~~~Betaine aldehyde dehydrogenase~~~COG1012
MSQTLFIDGEWISAEKEQIRSIINPFNQEEIATVSEGGREDAIKAIAAARRAFDKGEWSSLSGLERGKIVLKIAELIRRD
LEELAELESLDTGKTLEESKADMDDIANVFQYYAGLADKDGGEIISSPIPDSESKIIREPIGVCGQITPWNYPLLQASWK
IAPALAAGNTIVMKPSEITPLTTIKVFKLMEEAGVPKGVANLVLGPGATVGDELAVNKDVDLISFTGGIETGKKIMRAAS
GNVKKIALELGGKNPNIVFKDADLEVAVDQALNAVFFHAGQVCSAGSRLLVEDAIHDQFLAELVKRAKRIKLGNGFHAET
ESGPLISAEHRAKVEKYVEIGIEEGAKLETGGKRPEDPELQNGFFYEPTIFSNCNSDMRIVQEEVFGPVLTVETFSSEEE
VIELANDTIYGLAGAVWSKDIEKCERVAARLRMGTVWINDFHPYFAQAPWGGYKQSGFGRELGKIGLEEYTEVKHVYRNT
KPAAVNWFNS
>P71017 1.1.1.-~~~gbsB~~~Choline dehydrogenase~~~COG1454
MTLNMKVESMQKFHTFEIPTVIKHGIGAIKHTGEEVAALGVSKALLVTDPGIYKAGVADPVIESLKEAGIEVVLFNKVEP
NPPVRLVNEGSELYKKENCNGLVAVGGGSSMDTAKAIGVEATHEGSVLDYEAADGKKPLENRIPPLTTIPTTAGTGSEVT
QWAVITDEEREFKFNTGGPLIAAHLTIIDPELHVSMPPHVTAMTGIDALAHAIECYTMKFAQPITDAVALMAIEYAAHYI
KRAFADGEDLEARYGMAQAAMLAGLSYGSESAGAAHAMSQTLGGIIPVAHGQCVAAMMGPVMEYNWKGYPEKFARIAKAF
GIDTSKMTTEEAAKASVNWMYDLVEDLEVPTLEEQGVSPDMIERLSKEAMKDPQTFGNPRDLNEKAYNWIYKRCFNLTPK
TV
>Q6F754 ~~~~~~Glycine betaine transporter~~~COG1292
MPSKTSSRFANINPNVFVSTIMIIAIFLAIVILAPDAFELLTQQLKNWITESFSWFYVLSVAFFLIVLGYIACSSSGKIK
LGPDHSQPDYSNSSWFAMLFTAGMGIGLMFFGIAEPIMHYVSPPSGEPETILAAQQSMRVTFFHWGLHAWGIYAIVALSL
SYFAYRHDLPLKIRSSLYPLIGKKIYGPMGDAVDTFATIGTIFGVATTLGFGVTQISSGLNYLFGFEPTSFSKVVLIIIV
SAMAALSVGLGLDKGVKRLAELNLVLAVTLLAFVFFTSATVYLLQTTIQNTGQYISNLFEMTFNLYAYQPNGWIGGWTIM
YWAWWISWSPFVGMFIARVSRGRTIREFIIGVMLIPTGFTLIWMGFMGNAGLYSILHDGNLSLLNAVQRDSSVALFEFLH
SLPFSGVMSLLATVLVVLFFVTSADSGALVVDYLTAKSEDSPVWQRLFWIVVMAGLAIILLLAGGLTALQSATIMSALPF
TFIMLLICWGLIKALRIDSTKMQAIQEARTTPRAIQNPRSWQQRLGLIMHYPHSKVEVDAYIKKHVQRAFESLEREFKRR
HLTVAISETDDGLQLKVDHHDEINFIYHVVSRETMPPSFMLEQEHNADVEKYFQAEVFLREGGQNYDVMDWTEEDLIQDI
IDQYERHLYFLSVMRAQTGN
>Q24SP9 ~~~~~~Probable glycine betaine transporter~~~COG1292
MNMGVDKKQNTVLYISSAIALLFVLWGVFLPENMANVVNKVFALLTTNFGWLYLLAVAIFIIFVFGIAISRYGKIKLGAD
DDKPEFSNFQWFAMLFGGGMGIGLVFWSVAEPIMHFNSPPFGEPGTVEAMQTSMRVVFFHWGIHAWVNFAIAGLALAYFQ
FRKGLPFLISSAFYPLIGDRIYGPIGKAIDILAVFATIFGIATSLGLGSSQIATGIQYIWGIPAGPLTISLVIAVITVIF
TLATVSGLHKAMQSIANVKVWLSVAFMVFIFYFGGKVFILNTFTQSLGDYLQNFVGQTFWMANESWVGGWTIFYWAWWIA
WAPFVGQFVARVSKGRTIREFVFAVTLLPVGFSFIWLAIYGGAAFNLDQISGGFIQNAVNADYTTALFALLQQMPLYAIT
GPLAILLIVTCFVGAADSATYVLAMLTSNGDMDPSKKLRSFWGIMQGAMTIVLIVVGGTAALKALQTASIASAFPFMLIM
LVMCYSILKALRSDHP
>Q9RR46 7.6.2.9~~~gbuA~~~Glycine betaine/carnitine transport ATP-binding protein GbuA~~~
MSKIKVEELTKIFGKKASKASSLLSQGKSKTDILKETGATIGVNKASFSVEEGEIFVIMGLSGSGKSTLVRLLNRLIEPT
SGKIWLDGKELSSLNKKELLEVRRKSMSMVFQNFGLFPNRTINRNVEYGLEIQGMDKEEREKNAAESLALVGLAGYGDQY
PSQLSGGMQQRVGLARALANNPDILLMDEAFSALDPLNRKDMQDQLLDLQDKMKKTIIFITHDLDEALRIGDHIMIMRDG
SVVQTGSPEEILAHPANEYVEKFIEDVDRSKVYTASNVMIRPEIVNFEKDGPRVALKRMREAGTSSVFVVKRNRELVGIV
HAAEVSKLVKENITSLETALHRDVPTTGLDTPLAEIMDTISTTTIPIAVTEDGKLKGIIIRGSVLAALSGNEVNVNA
>Q9I3S3 3.5.3.7~~~gbuA~~~Guanidinobutyrase~~~
MDKNLHQPLGGNEMPRFGGIATMMRLPHVQSPAELDALDAAFVGVPLDIGTSLRSGTRFGPREIRAESVMIRPYNMATGA
APFDSLNVADIGDVAINTFNLLEAVRIIEQEYDRILGHGILPLTLGGDHTITLPILRAIKKKHGKVGLVHVDAHADVNDH
MFGEKIAHGTTFRRAVEEDLLDCDRVVQIGLRAQGYTAEDFNWSRKQGFRVVQAEECWHKSLEPLMAEVREKVGGGPVYL
SFDIDGIDPAWAPGTGTPEIGGLTTIQAMEIIRGCQGLDLIGCDLVEVSPPYDTTGNTSLLGANLLYEMLCVLPGVVRR
>Q9RR45 ~~~gbuB~~~Glycine betaine/carnitine transport permease protein GbuB~~~
MPNIPTIPLASWIDKLVDGLTQFEGFFNVITNIIGGIVDAFQWVFDLVPPWLFIILLVFGTFWVNRKGKKWGLIIFEVVG
LLLIWNLDFWRDMTQTLTLVLTSSLIALVIGVPLGIWMAKSNIVESIFKPVLDFMQTMPAFVYLIPAVAFFGIGMVPGVV
ASVIFAMPPTVRMTNLGIRQVSTELVEAADSFGSTPWQKLWKVQLPMAKSTMMAGINQSIMLALSMVVIASMIGAMGLGT
RVYFAVGRNDAGGGFVAGIAIVIVAIILDRLTQAFNKKAKSE
>Q9RR44 ~~~gbuC~~~Glycine betaine/carnitine transport binding protein GbuC~~~
MLKKLITTAVLAMLIFTLAACGTTLAPYDAKKDLGEQINYTITGIDAGAGIMLATQNAIKDYHLDDDNWQLQTSSTAAMT
STLQKAMKDKRPIVVTGWTPHWMFTKFDLKFLDDPKNVYGNAENIHTIVRKGLKEDKPSAYQVLDNFFWTAEDMSEVMLE
VNDGVDPEEAAKKWIKNNPDKVAKWTDGVEKVDGDEIKLTYVAWDSEIASTNVVAEALKQVGYKPTIQAMEIQPMWASVA
TDAADGMVAAWLPNTSGIYYKDYKGKFEDLGPNLKGAKIGLAVPKYMTNINSIEDLKTSK
>C7PLV2 4.2.3.62~~~~~~(-)-gamma-cadinene synthase ((2Z,6E)-farnesyl diphosphate cyclizing)~~~COG0664
MPTITLPRIIYPFPSLINQFVTAAHEQNRQWVADFGFITTPEAMARFDRSRFAWLAARAFPHAGFHELCTIANFNTWLFM
LDDQCDEAQLGKKAVYLEHVTDGFMNILKHNTPVDTVLGRSFTDIWERMQALGDTAWQTRFIRSMEEYFTSCHWEAGNRA
ADIVPTVAEYVTMRPYTGALFADVEAIEIIEKVYLPAHILQHFIVQRLVLACNNIVCWANDIFSCAKEARQGDVHNLVLV
LQHERNSTLQEAVNETARMHNEEVKLFTALEKLLPSFGAEMDRELERFMAVLRSWITANYDWSYHDTGRYQVKEVEVVIN
S
>P10480 2.3.1.43~~~~~~Phosphatidylcholine-sterol acyltransferase~~~COG3240
MKKWFVCLLGLVALTVQAADSRPAFSRIVMFGDSLSDTGKMYSKMRGYLPSSPPYYEGRFSNGPVWLEQLTNEFPGLTIA
NEAEGGPTAVAYNKISWNPKYQVINNLDYEVTQFLQKDSFKPDDLVILWVGANDYLAYGWNTEQDAKRVRDAISDAANRM
VLNGAKEILLFNLPDLGQNPSARSQKVVEAASHVSAYHNQLLLNLARQLAPTGMVKLFEIDKQFAEMLRDPQNFGLSDTE
NACYGGSYVWKPFASRSASTDSQLSAFNPQERLAIAGNPLLAQAVASPMAARSASTLNCEGKMFWDQVHPTTVVHAALSE
PAATFIESQYEFLAH
>Q5SI26 ~~~~~~Glutamine synthetase and cystathionine beta-lyase binding protein~~~COG4274
MPTFIVLSTLTDDGAETLVKNPERIKEVNQELERDFGVRVVAQYAVLGPYDFVNVVEAEDAATVARAMLHLASRGSVKTM
TLEAIPVADLIARLK
>E9K9Z1 ~~~gccF~~~Bacteriocin glycocin F~~~
MSKLVKTLTISEISKAQNNGGKPAWCWYTLAMCGAGYDSGTCDYMYSHCFGIKHHSSGSSSYHC
>Q06700 7.2.4.5~~~gcdA~~~Glutaconyl-CoA decarboxylase subunit alpha~~~COG4799
MGFYSMPRYFQNMPQVGKPLKKADAANEEQLKKIEEEIHQLIKEAQEAGKADADVNKRGELTALQRIEKLVEPGSWRPLN
TLFNPQGNKNGSVAIVKGLGRVNGKWCVVVASDNKKLAGAWVPGQAECLLRASDTAKTLHVPLVYVLNCSGVKFDEQEKV
YPNRRGGGTPFFRNAELNQLGIPVIVGIYGTNPAGGGYHSISPTVIIAHEKANMAVGGAGIMGGMNPKGHVDLEYANEIA
DMVDRTGKTEPPGAVDIHYTETGFMREVYASEEGVLEGIKKYVGMLPKYDPEFFRVDDPKAPAFPADDLYSMVPLNDKRA
YDIYNVIARLFDNSELHEYKKGYGPEMVTGLAKVNGLLVGVVANVQGLLMNYPEYKAAGSVGIGGKLYRQGLVKMNEFVT
LCARDRLPIVWIQDTTGIDVGNDAEKAELLGLGQSLIYSIQTSHIPQFEITLRKGTAAAHYVLGGPQGNDTNAFSIGTAA
TEIAVMNGETAATAMYSRRLAKDRKAGKDLQPTIDKMNNLIQAFYTKSRPKVCAELGLVDEIVDMNKIRGYVEAFTEAAY
QNPESICPFHQMILPRAIREFETFVKK
>Q9ZAA6 7.2.4.5~~~gcdB~~~Glutaconyl-CoA decarboxylase subunit beta~~~COG1883
MDAFVVALTSVIQDSGFVAFTWGNAVMMLVGCILLYLAIVKGFEPLLLSPIAFGCILANVPRTGFETDPGVMQLILGGIK
YEIFPPLIFMGVGAMTDFGPLIANPKTLLLGAAAQIGVFVALLGAMLLGFNVKEASAIGIIGGADGPTSIYLASKMAPHL
LGAIAVAAYSYMSLVPLIQPPVMKLFTSKEERKIKMAQLRTVTHFEKVVFPIVTTIFISLLLPSVCSLIGMLMLGNLFTE
SGCMDRLSDTAQNALMNSVTIMLATGTGLTMKAESFLTLQTIEIICLGLVAFIGGTAGGVLFGKLMSKLDGGKTNPLIGS
AGVSAVPMAARVSQVVGQQADPGNFLLMHAMGPNVAGVIGTAVAAGTMLAMVGGK
>Q9ZAA7 7.2.4.5~~~gcdC~~~Glutaconyl-CoA decarboxylase subunit gamma~~~COG4770
MRKFNVNVNGTVYTVEVEEVGGAVTAAPAAPAAPAAAPAAAPVAAAPAAAPAPAPAAAPAAAPAPAAKPAAAAPAGSVTV
SAPMPGKILSVNVKPGDKVEAGDVLLILEAMKMQNEIMAPEDGTVSEVRVNAGDTVATGDVMVIL
>Q0P8J7 3.5.1.129~~~~~~Gamma-glutamyl-CDP-amidate hydrolase~~~COG2071
MFIGITQRLICNDSYHEKRECLALDWGKLFNKDLFKNFTPLPLSYEIDFSYYKHLIKAVILSGGNDLSFYSPNVLSKKRD
LYEKQVIEICLEEKIPLLGICRGAQMIAHYFNSHISPCENHIGKHEVFFSKEKFISNSFHNFAIEKLGEDLVELCLAKDN
TIEAFKHKYENIFGIMWHIERENGLNNIQILKEWFSLIKE
>P0AFP6 ~~~ybgI~~~GTP cyclohydrolase 1 type 2 homolog~~~COG0327
MKNTELEQLINEKLNSAAISDYAPNGLQVEGKETVQKIVTGVTASQALLDEAVRLGADAVIVHHGYFWKGESPVIRGMKR
NRLKTLLANDINLYGWHLPLDAHPELGNNAQLAALLGITVMGEIEPLVPWGELTMPVPGLELASWIEARLGRKPLWCGDT
GPEVVQRVAWCTGGGQSFIDSAARFGVDAFITGEVSEQTIHSAREQGLHFYAAGHHATERGGIRALSEWLNENTDLDVTF
IDIPNPA
>Q57354 ~~~~~~GTP cyclohydrolase 1 type 2 homolog~~~COG0327
MNNLELEQLINQKLSSDKINDYAPNGLQVEGKTEIKKIITGVTASQALINYAISQNADAILVHHGYFWKSETPCIRGMKG
KRIKALLVNDINLYGYHLPLDVHPELGNNAQLAKLLDIENLQPLEKGSVSIPVWGELKEPMTGKDFAEKIEKVLNRKPLI
CIENGPHLIRKIGICTGGGQGYIDLAAEQGCDAFITGEVSEQTIHSAREQGLYFFSAGHHATERYGIKALGEWLAKEYGF
DVEFKDIDNPA
>P9WFM1 ~~~~~~GTP cyclohydrolase 1 type 2 homolog~~~COG0327
MSVRLADVIDVLDQAYPPRLAQSWDSVGLVCGDPDDVVDSVTVAVDATPAVVDQVPQAGLLLVHHPLLLRGVDTVAANTP
KGVLVHRLIRTGRSLFTAHTNADSASPGVSDALAHAVGLTVDAVLDPVPGAADLDKWVIYVPRENSEAVRAAVFEAGAGH
IGDYSHCSWSVAGTGQFLAHDGASPAIGSVGTVERVAEDRVEVVAPARARAEVLAAMRAAHPYEEPAFDIFALVPPPVGS
GLGRIGRLPKPEPLRTFVARLEAALPPTATGVRAAGDPDLLVSRVAVCGGAGDSLLATVAAADVQAYVTADLRHHPADEH
CRASQVALIDVAHWASEFPWCGQAAEVLRSHFGASLPVRVCTICTDPWNLDHETGRDQA
>P67272 ~~~~~~GTP cyclohydrolase 1 type 2 homolog~~~
MKIADLMTLLDHHVPFSTAESWDNVGLLIGDGDVEVTGVLTALDCTLEVVNEAIEKGYNTIISHHPLIFKGVTSLKANGY
GLIIRKLIQHDINLIAMHTNLDVNPYGVNMMLAKAMGLKNISIINNQQDVYYKVQTYIPKDNVGPFKDKLSENGLAQEGN
YEYCFFESEGRGQFKPVGEANPTIGQIDKIEDVDEVKIEFMIDAYQKSRAEQLIKQYHPYETPVFDFIEIKQTSLYGLGV
MAEVDNQMTLEDFAADIKSKLNIPSVRFVGESNQKIKRIAIIGGSGIGYEYQAVQQGADVFVTGDIKHHDALDAKIHGVN
LIDINHYSEYVMKEGLKTLLMNWFNIEKINIDVEASTINTDPFQYI
>Q97PK0 ~~~~~~GTP cyclohydrolase 1 type 2 homolog~~~COG0327
MLASEVIQAYEAFCPQEFSMEGDSRGLQIGTLDKGIQRVMVALDIREETVAEAIEKGVDLIIVKHAPIFRPIKDLLASRP
QNQIYIDLIKHDIAVYVSHTNIDIVENGLNDWFCQMLGIEETTYLQETGPERGIGRIGNIQPQTFWELAQQVKQVFDLDS
LRMVHYQEDDLQKPISRVAICGGSGQSFYKDALAKGADVYITGDIYYHTAQDMLSDGLLALDPGHYIEVIFVEKIAALLS
QWKEDKGWSIDILPSQASTNPFHHI
>P19465 3.5.4.16~~~folE~~~GTP cyclohydrolase 1~~~COG0302
MKEVNKEQIEQAVRQILEAIGEDPNREGLLDTPKRVAKMYAEVFSGLNEDPKEHFQTIFGENHEELVLVKDIAFHSMCEH
HLVPFYGKAHVAYIPRGGKVTGLSKLARAVEAVAKRPQLQERITSTIAESIVETLDPHGVMVVVEAEHMCMTMRGVRKPG
AKTVTSAVRGVFKDDAAARAEVLEHIKRQD
>P0A6T5 3.5.4.16~~~folE~~~GTP cyclohydrolase 1~~~COG0302
MPSLSKEAALVHEALVARGLETPLRPPVHEMDNETRKSLIAGHMTEIMQLLNLDLADDSLMETPHRIAKMYVDEIFSGLD
YANFPKITLIENKMKVDEMVTVRDITLTSTCEHHFVTIDGKATVAYIPKDSVIGLSKINRIVQFFAQRPQVQERLTQQIL
IALQTLLGTNNVAVSIDAVHYCVKARGIRDATSATTTTSLGGLFKSSQNTRHEFLRAVRHHN
>Q8Y5X1 3.5.4.16~~~folE~~~GTP cyclohydrolase 1~~~COG0302
MEQIDKQKIADAVKVILEAVGENPDREGLIDTPMRVARMYEEVFAGLKKDPSVHFDTIFEEQHEELVLVKDIRFSSMCEH
HLVPFFGVAHVAYLPQNGRVAGLSKLARVVDDVSRRPQLQERITTTVAEIMMEKLKPLGVMVIMEAEHMCMTIRGVNKPG
TKTITSAVRGAFKNDDKLRSEVLALIKHN
>P9WN57 3.5.4.16~~~folE~~~GTP cyclohydrolase 1~~~COG0302
MSQLDSRSASARIRVFDQQRAEAAVRELLYAIGEDPDRDGLVATPSRVARSYREMFAGLYTDPDSVLNTMFDEDHDELVL
VKEIPMYSTCEHHLVAFHGVAHVGYIPGDDGRVTGLSKIARLVDLYAKRPQVQERLTSQIADALMKKLDPRGVIVVIEAE
HLCMAMRGVRKPGSVTTTSAVRGLFKTNAASRAEALDLILRK
>Q8ZG15 3.5.4.16~~~folE~~~GTP cyclohydrolase 1~~~COG0302
MSSLSKEAELVHQALLARGLETPLRKPELDAETRKTRIQAHMTEVMHLLNLDLTDDSLADTPRRIAKMYVDEIFSGLDYE
NFPKITLIQNKMKVDEMVTVRDITLTSTCEHHFVTIDGKATVAYIPKDSVIGLSKINRIVQFFAQRPQVQERLTQQILLA
LQTLLGTNNVAVSIDAVHYCVKARGIRDATSATTTTSLGGLFKSSQNTRQEFLRAVRHHG
>P94398 3.5.4.16~~~folE2~~~GTP cyclohydrolase FolE2~~~COG1469
MNQHTLLPKKTERLQYFGSVSPIKGEKPVEKEKMKDLQNIRKDYFFDIQHVGVANVSHPVTITSAMMPAEQTTAANFTMT
CNLPRNQKGINMSRLTELLQVYHQNGWILSFSSLQQFTKELAENMDTSSATVEVRFPWFFERKSPKLEKAGLMHADIFMS
VTYRKDQPFKQRAGISAKVTTLCPCSKEISEYSAHNQRGTVSIWADIHPAASLPSDVKADLLHAAESNASARLHPVLKRP
DEKAVTETAYENPRFVEDLARLIAADLFELEWVSAFEIECRNEESIHLHDAYAKLCFSKEVDKI
>Q5F9K6 3.5.4.16~~~folE2~~~GTP cyclohydrolase FolE2~~~
MNAIADVQSSRDLRNLPINQVGIKDLRFPITLKTAEGTQSTVARLTMTVYLPAEQKGTHMSRFVALMEQHTEVLDFAQLH
RLTAEMVALLDSRAGKISVSFPFFRKKTAPVSGIRSLLDYDVSLTGEMKDGAYGHSMKVMIPVTSLCPCSKEISQYGAHN
QRSHVTVSLTSDAEVGIEEVIDYVETQASCQLYGLLKRPDEKYVTEKAYENPKFVEDMVRDVATSLIADKRIKSFVVESE
NFESIHNHSAYAYIAYP
>Q82VD1 3.5.4.16~~~folE2~~~GTP cyclohydrolase FolE2~~~COG1469
MNKQIDLPIADVQGSLDTRHIAIDRVGIKAIRHPVVVADKGGGSQHTVAQFNMYVNLPHNFKGTHMSRFVEILNSHEREI
SVESFEEILRSMVSRLESDSGHIEMAFPYFINKSAPVSGVKSLLDYEVTFIGEIKHGNQYSFTMKVIVPVTSLCPCSKKI
SDYGAHNQRSHVTISVRTNSFIWIEDIIRIAEEQASCELYGLLKRPDEKYVTERAYNNPKFVEDIVRDVAEVLNHDDRID
AYIVESENFESIHNHSAYALIERDKRIR
>Q7A777 3.5.4.16~~~folE2~~~GTP cyclohydrolase FolE2~~~
MTEFDLSTREGRWKHFGSVDPIEGTKPTTKNEMTDLQSTHKDFLFEIEEVGIKNLVYPVLVDQYQTAGTFSFSTSLTKDE
KGINMSRIIESVEKHYDNGIELEFNTLYQVLRTLQTNMKQNAAGVDVSGKWFFDRYSPTTNIKAVGNADVTYGLAIDGDK
VTRKELTIEATVTTLCPCSKEISEYSAHNQRGVVTVKTYINKDQNIVDDYKNKILDAMEANASSILYPILKRPDEKRVTE
RAYENPRFVEDLIRLIAADLVEFDWLDGFDIECRNEESIHQHDAFAKLKYRK
>Q9WXP6 3.5.4.16~~~folE2~~~GTP cyclohydrolase FolE2~~~COG1469
MKDVQNEKDPRMVPLKKVGIKDLHWPLKVILKEDGYQSTVAQISCSVDLHREKRGIHMSRFIEVLNKLEVITPQIFEEIL
DDLIEIMEAKRAHLEIHFPYFIWKESPVSRKKSPLKVDCFVEAEKEKNFSFKIGVRTPVHTLCPCSKEISDYGAHNQRAF
VEITVKTRKFIWFEDLVEIAEKNASSPLYTLLKRPDEKFVTEKAYENPRFVEDVARDVALELEKDPRITWYRVYVESMES
IHNHNAFACVEKGDFVLEG
>A7HD43 2.7.13.3~~~gchK~~~Globin-coupled histidine kinase~~~COG2205
MTGVPETVFEELKRYVGWGDGDERALRSLHGAAAPHFPRLAEEFYDRILGHEGARTALVGGESQVGHLKVTMIAWLDELL
GGPWDEAYWDRRYRIGRVHVRIGLPQHYMFGAMNVHRTGLARLAYERFHGDPPELERVRNALGKVLDLELAVMLHTYRED
LLAQQARVERLSTFGQLVGSIGHDLRNPLGVIETSLYILRTRTGEDERARKHLDRIGEQLGVANGIITNLLDMIRDRPLA
REPVELAAVVGGAAESVRRPTGVSLALEGLDALPPVEGDPGQLRQVFVNLLENAVFAASPEGVVAVRASRADGLVALDVE
DSGPGVDPATRRRLFEPLITTKDKGIGLGLALVKRIAERHGGTVEYSDRPGGGARFTVRLPA
>A9CEQ8 5.5.1.27~~~gci~~~D-galactarolactone cycloisomerase~~~COG4948
MKITAVRTHLLEHRLDTPFESASMRFDRRAHVLVEIECDDGTVGWGECLGPARPNAAVVQAYSGWLIGQDPRQTEKIWAV
LYNALRDQGQRGLSLTALSGIDIALWDIKGKHYGASISMLLGGRWRESVRAYATGSFKRDNVDRVSDNASEMAERRAEGF
HACKIKIGFGVEEDLRVIAAVREAIGPDMRLMIDANHGYTVTEAITLGDRAAGFGIDWFEEPVVPEQLDAYARVRAGQPI
PVAGGETWHGRYGMWQALSAGAVDILQPDLCGCGGFSEIQKIATLATLHGVRIVPHVWGTGVQIAAALQFMAAMTPDPVR
VNPIEPIMEFDRTHNPFRQAVLREPLEAVNGVVTIPDGPGLGIEINRDALTEFRMPDP
>Q9X1S1 2.7.1.165~~~~~~D-glycerate 2-kinase~~~COG2379
MFDPESLKKLAIEIVKKSIEAVFPDRAVKETLPKLNLDRVILVAVGKAAWRMAKAAYEVLGKKIRKGVVVTKYGHSEGPI
DDFEIYEAGHPVPDENTIKTTRRVLELVDQLNENDTVLFLLSGGGSSLFELPLEGVSLEEIQKLTSALLKSGASIEEINT
VRKHLSQVKGGRFAERVFPAKVVALVLSDVLGDRLDVIASGPAWPDSSTSEDALKVLEKYGIETSESVKRAILQETPKHL
SNVEIHLIGNVQKVCDEAKSLAKEKGFNAEIITTSLDCEAREAGRFIASIMKEVKFKDRPLKKPAALIFGGETVVHVKGN
GIGGRNQELALSAAIALEGIEGVILCSAGTDGTDGPTDAAGGIVDGSTAKTLKAMGEDPYQYLKNNDSYNALKKSGALLI
TGPTGTNVNDLIIGLIV
>P0AEP7 4.1.1.47~~~gcl~~~Glyoxylate carboligase~~~COG3960
MAKMRAVDAAMYVLEKEGITTAFGVPGAAINPFYSAMRKHGGIRHILARHVEGASHMAEGYTRATAGNIGVCLGTSGPAG
TDMITALYSASADSIPILCITGQAPRARLHKEDFQAVDIEAIAKPVSKMAVTVREAALVPRVLQQAFHLMRSGRPGPVLV
DLPFDVQVAEIEFDPDMYEPLPVYKPAASRMQIEKAVEMLIQAERPVIVAGGGVINADAAALLQQFAELTSVPVIPTLMG
WGCIPDDHELMAGMVGLQTAHRYGNATLLASDMVFGIGNRFANRHTGSVEKYTEGRKIVHIDIEPTQIGRVLCPDLGIVS
DAKAALTLLVEVAQEMQKAGRLPCRKEWVADCQQRKRTLLRKTHFDNVPVKPQRVYEEMNKAFGRDVCYVTTIGLSQIAA
AQMLHVFKDRHWINCGQAGPLGWTIPAALGVCAADPKRNVVAISGDFDFQFLIEELAVGAQFNIPYIHVLVNNAYLGLIR
QSQRAFDMDYCVQLAFENINSSEVNGYGVDHVKVAEGLGCKAIRVFKPEDIAPAFEQAKALMAQYRVPVVVEVILERVTN
ISMGSELDNVMEFEDIADNAADAPTETCFMHYE
>Q5FQ97 2.7.1.12~~~~~~Gluconokinase~~~COG3265
MTEHETQMGLKPRFLVVMGVSGTGKTTVATGLATRLGWHFQEGDALHPPANVEKMSTGQPLTDADRAPWLALCHDWLREQ
VKAGHGAVLTCSALKRSYREQLRGDDLPIEFVHIDTSTGELADRLQRREGHFMPASLLPSQLATLEVPGDDEPVIRVSGE
KHPDVVLEELIRHFQAED
>P0DPQ7 1.14.14.-~~~gcoA~~~Aromatic O-demethylase, cytochrome P450 subunit~~~
MTTTERPDLAWLDEVTMTQLERNPYEVYERLRAEAPLAFVPVLGSYVASTAEVCREVATSPDFEAVITPAGGRTFGHPAI
IGVNGDIHADLRSMVEPALQPAEVDRWIDDLVRPIARRYLERFENDGHAELVAQYCEPVSVRSLGDLLGLQEVDSDKLRE
WFAKLNRSFTNAAVDENGEFANPEGFAEGDQAKAEIRAVVDPLIDKWIEHPDDSAISHWLHDGMPPGQTRDREYIYPTIY
VYLLGAMQEPGHGMASTLVGLFSRPEQLEEVVDDPTLIPRAIAEGLRWTSPIWSATARISTKPVTIAGVDLPAGTPVMLS
YGSANHDTGKYEAPSQYDLHRPPLPHLAFGAGNHACAGIYFANHVMRIALEELFEAIPNLERDTREGVEFWGWGFRGPTS
LHVTWEV
>B1W019 4.2.1.138~~~gcoA~~~(+)-caryolan-1-ol synthase~~~
MSQITLPAFHMPFQSAGCHPGLAETREAAWEWAAAEGLDLSVPARRKMIRTRPELWISLIFPQATQAHLDLFCQWLFWAF
LVDDEFDDGPAGRDPLMCERAIARLVDVFDGAAPNGPMERALAGLRDRTCRGRSPQWNRQFRRDTAAWLWTYYAEAVERA
AGQVPSRAEFAKHRRDSVAMQPFLCLHEITAGIDLPDSARSLPAYIALRNAVTDHSGLCNDICSFEKEAALGYEHNAVRL
IQRDRGSTLQEAVDEAGIQLARIAERVQRAERELIEEIEAAGIDGPTRTALERCVRDYRGLVRGDFDYHARAERYTRPDL
VELDERDSLSRHFAA
>P0DPQ8 1.6.2.-~~~gcoB~~~Aromatic O-demethylase, reductase subunit~~~
MTFAVSVGGRRVDCEPGQTLLEAFLRGGVWMPNSCNQGTCGTCKLQVLSGEVDHGGAPEDTLSAEERASGLALACQARPL
ADTEVRSTADAGRVTHPLRDLTATVLEVADIARDTRRVLLGLAEPLAFEAGQYVELVVPGSGARRQYSLANTADEDKVLE
LHVRRVPGGVATDGWLFDGLAAGDRVEATGPLGDFHLPPPDEDDGGPMVLIGGGTGLAPLVGIARTALARHPSREVLLYH
GVRGAADLYDLGRFAEIAEEHPGFRFVPVLSDEPDPAYRGGFPTDAFVEDVPSGRGWSGWLCGPPAMVEAGVKAFKRRRM
SPRRIHREKFTPAS
>P77213 6.3.2.2~~~ybdK~~~Putative glutamate--cysteine ligase 2~~~COG2170
MPLPDFHVSEPFTLGIELEMQVVNPPGYDLSQDSSMLIDAVKNKITAGEVKHDITESMLELATDVCRDINQAAGQFSAMQ
KVVLQAATDHHLEICGGGTHPFQKWQRQEVCDNERYQRTLENFGYLIQQATVFGQHVHVGCASGDDAIYLLHGLSRFVPH
FIALSAASPYMQGTDTRFASSRPNIFSAFPDNGPMPWVSNWQQFEALFRCLSYTTMIDSIKDLHWDIRPSPHFGTVEVRV
MDTPLTLSHAVNMAGLIQATAHWLLTERPFKHQEKDYLLYKFNRFQACRYGLEGVITDPHTGDRRPLTEDTLRLLEKIAP
SAHKIGASSAIEALHRQVVSGLNEAQLMRDFVADGGSLIGLVKKHCEIWAGD
>P9WPK9 6.3.2.2~~~~~~Putative glutamate--cysteine ligase 2~~~COG2170
MPARRSAARIDFAGSPRPTLGVEWEFALVDSQTRDLSNEATAVIAEIGENPRVHKELLRNTVEIVSGICECTAEAMQDLR
DTLGPARQIVRDRGMELFCAGTHPFARWSAQKLTDAPRYAELIKRTQWWGRQMLIWGVHVHVGIRSAHKVMPIMTSLLNY
YPHLLALSASSPWWGGEDTGYASNRAMMFQQLPTAGLPFHFQRWAEFEGFVYDQKKTGIIDHMDEIRWDIRPSPHLGTLE
VRICDGVSNLRELGALVALTHCLIVDLDRRLDAGETLPTMPPWHVQENKWRAARYGLDAVIILDADSNERLVTDDLADVL
TRLEPVAKSLNCADELAAVSDIYRDGASYQRQLRVAQQHDGDLRAVVDALVAELVI
>Q8ZR41 6.3.2.2~~~ybdK~~~Putative glutamate--cysteine ligase 2~~~
MALNDFHVSEPYTLGIELEMQVINPPGYDLSQDSSTLIDAVKPQLTAGEIKHDITESMLEMATGVCRDIDQAAAQLSAMQ
HVILQAASEHHLGICGGGTHPFQKWQRQEVCDNERYQRTLENFGYLIQQATVFGQHVHVGCANGDDAIYLLHGLSHFVPH
FIALSAASPYMQGADTRFACARLNIFSAFPDNGPMPWVSNWQEFAGLFRRLSYTTMIDSIKDLHWDIRPSPAFGTVEVRV
MDTPLTLDHAINMAGLIQATAHWLLTERPFKPQEQDYLLYKFNRFQACRYGLEGVLTDAYTGDRRRLADDTLRLLDNVTP
SARKLGADSAIDALRLQVKKGGNEAQYMREFIADGGSLIGLVQKHCEIWAGQ
>O32174 ~~~gcvH~~~Glycine cleavage system H protein~~~COG0509
MSIPKDLRYSGEHEWVKVEGEKARIGITHFAQSELGDIVFVELPEVGAEIKADEPFGSVESVKTVSELYAPINGTVVEVN
EDLDDSPEFVNESPYEKAWMIVVEPSDASEIEKLMTAEQYEEMTQED
>Q6G2F0 ~~~gcvH~~~Glycine cleavage system H protein~~~COG0509
MSKTYFTQDHEWLSVEGQVVTVGITDYAQEQLGDLVFIDLPQNGTKLSKGDAAAVVESVKAASDVYAPLDGEVVEINAAL
AESPELVNQKAETEGWLWKMTVQDETQLERLLDEAAYKELIG
>P0A6T9 ~~~gcvH~~~Glycine cleavage system H protein~~~COG0509
MSNVPAELKYSKEHEWLRKEADGTYTVGITEHAQELLGDMVFVDLPEVGATVSAGDDCAVAESVKAASDIYAPVSGEIVA
VNDALSDSPELVNSEPYAGGWIFKIKASDESELESLLDATAYEALLEDE
>A0QYG3 ~~~gcvH~~~Glycine cleavage system H protein~~~COG0509
MSEIPADLYYTSEHEWVLRTGDDTVRVGITDYAQSALGDVVFVQLPDVGADVASGDAFGEVESTKSVSDLYAPVTAKVVA
VNGDLEGSPELVNSDPYGEGWLVDLRVEAGTLDEALGGLLDAEGYRAVVTE
>P9WN55 ~~~gcvH~~~Glycine cleavage system H protein~~~COG0509
MSDIPSDLHYTAEHEWIRRSGDDTVRVGITDYAQSALGDVVFVQLPVIGTAVTAGETFGEVESTKSVSDLYAPISGKVSE
VNSDLDGTPQLVNSDPYGAGWLLDIQVDSSDVAALESALTTLLDAEAYRGTLTE
>P64214 ~~~gcvH~~~Glycine cleavage system H protein~~~
MAVPNELKYSKEHEWVKVEGNVATIGITEYAQSELGDIVFVELPETDDEINEGDTFGSVESVKTVSELYAPISGKVVEVN
EELEDSPEFVNESPYEKAWMVKVEISDESQLEALLTAEKYSEMIGE
>Q9WY55 ~~~gcvH~~~Glycine cleavage system H protein~~~COG0509
MKMKKYTKTHEWVSIEDKVATVGITNHAQEQLGDVVYVDLPEVGREVKKGEVVASIESVKAAADVYAPLSGKIVEVNEKL
DTEPELINKDPEGEGWLFKMEISDEGELEDLLDEQAYQEFCAQE
>Q5SKW9 ~~~gcvH~~~Glycine cleavage system H protein~~~COG0509
MDIPKDRFYTKTHEWALPEGDTVLVGITDYAQDALGDVVYVELPEVGRVVEKGEAVAVVESVKTASDIYAPVAGEIVEVN
LALEKTPELVNQDPYGEGWIFRLKPRDMGDLDELLDAGGYQEVLESEA
>Q72C59 1.4.4.2~~~gcvPA~~~Probable glycine dehydrogenase (decarboxylating) subunit 1~~~COG0403
MPFVPHSPEDVSVMLDAIGVNTIEDLFADIPAEMRPKSFALPKGLSEMDVCSRLEALSARNRTDVVSFLGAGFYDHHIPK
AVDALSSRGEFYTAYTPYQPEAAQGTLQAIFEFQTAVCRLLDMDCANASVYDGGSALFEAMMMAVRATRRRKLVIDEALS
PIYRTMLASYTSNLQLELVTVPHRDGLSDMDALKASVDDTCAAVVVQNPNFFGAITDFTDLFTHARAHKALGVISVYPVM
QSVLKTPGEMGADIAVADGQSIGQPLSFGGPYLGIMTCTKPLVRQIPGRIVGRTQDVDGRTGYVLTLQAREQHIRRAKAT
SNICSNQALCALRSLIHLTLLGPEGLVRTAELSMERARYAAERLTALPGVELLHDAPFGNEFAVRLPVSAFEVVDRLTAR
GYVPGFPVGRYYPGMDNVLLVACTEKHSFEQVGILAEMLGGIL
>P64218 1.4.4.2~~~gcvPA~~~Probable glycine dehydrogenase (decarboxylating) subunit 1~~~
MSHRYIPLTEKDKQEMLQTIGAKSIGELFGDVPSDILLNRDLNIAEGEAETTLLRRLNRIASKNITKETHTSFLGAGVYD
HYAPSVVDAMISRSEFYTAYTPYQPEISQGELQAIFEFQTLICELTDMDVANSSMYDGMTSFAEACILAFSQTKKNKIVV
SKGLHYQALQVLHTYAKTRKEFEVVEIDLDGTVTDLKKLEAAVDDETAAVAVQYPNFYGSIEDLEKIQSFIEDKKALFIV
YANPLALGLLTPPGSFGADIVVGDTQPFGIPAQFGGPHCGYFATTKKLMRKVPGRLVGQTQDDEGNRGFVLTLQAREQHI
RRDKATSNICSNQALNALASSIAMSALGKQGIYDIAVQNIEHANYAKQQFIKKGFEVLDGTSFNEFVVKFDKPIQQVNEE
LVKYNIIGGFDLGVVSDDFKNHMLIAVTELRTKDEIDTFVEKAGELND
>P99168 1.4.4.2~~~gcvPB~~~Probable glycine dehydrogenase (decarboxylating) subunit 2~~~
MTSKSSPLIFERSREGRYAYSLPKSDIKTNSVESLLDDKFIRKNKAEFPEVAELDLVRHYTELSNKNFGVDNGFYPLGSC
TMKYNPKINEKVARIPGFSESHPLQDEDQVQGSLEIIYSLQEELKEITGMDEVTLQPAAGAHGEWTALMIFKAYHENNGE
GHRDEVIVPDSAHGTNPASASFAGFKSVTVKSNERGEVDIDDLKRVVNENTAAIMLTNPNTLGIFEKNIMEIREIVHNAG
GLLYYDGANLNAIMDKVRPGDMGFDAVHLNLHKTFTGPHGGGGPGSGPVGVVKELASYLPKPMVIKDGDKFKYDNDIKNS
IGRVKPFYGNFGIYLRAYTYIRTMGATGLKEVSEAAVLNANYIKARLSEHFEIPYKQYCKHEFVLSGVRQKEFGVRTLDM
AKRLLDFGVHPPTIYFPLNVEEGMMIEPTETESKETLDYFIDTLISIAEEAKNDPDKVLEAPHTTVIDRLDEATAARKPI
LKFENLKQEK
>Q5SKW7 1.4.4.2~~~gcvPB~~~Probable glycine dehydrogenase (decarboxylating) subunit 2~~~COG1003
MSFPLIFERSRKGRRGLKLVKAVPKAEDLIPKEHLREVPPRLPEVDELTLVRHYTGLSRRQVGVDTTFYPLGSCTMKYNP
KLHEEAARLFADLHPYQDPRTAQGALRLMWELGEYLKALTGMDAITLEPAAGAHGELTGILIIRAYHEDRGEGRTRRVVL
VPDSAHGSNPATASMAGYQVREIPSGPEGEVDLEALKRELGPHVAALMLTNPNTLGLFERRILEISRLCKEAGVQLYYDG
ANLNAIMGWARPGDMGFDVVHLNLHKTFTVPHGGGGPGSGPVGVKAHLAPYLPVPLVERGEEGFYLDFDRPKSIGRVRSF
YGNFLALVRAWAYIRTLGLEGLKKAAALAVLNARYLKELLKEKGYRVPYDGPSMHEFVAQPPEGFRALDLAKGLLELGFH
PPTVYFPLIVKEALMVEPTETEAKETLEAFAEAMGALLKKPKEWLENAPYSTPVRRLDELRANKHPKLTYFDEG
>P33195 1.4.4.2~~~gcvP~~~Glycine dehydrogenase (decarboxylating)~~~COG0403
MTQTLSQLENSGAFIERHIGPDAAQQQEMLNAVGAQSLNALTGQIVPKDIQLATPPQVGAPATEYAALAELKAIASRNKR
FTSYIGMGYTAVQLPPVILRNMLENPGWYTAYTPYQPEVSQGRLEALLNFQQVTLDLTGLDMASASLLDEATAAAEAMAM
AKRVSKLKNANRFFVASDVHPQTLDVVRTRAETFGFEVIVDDAQKVLDHQDVFGVLLQQVGTTGEIHDYTALISELKSRK
IVVSVAADIMALVLLTAPGKQGADIVFGSAQRFGVPMGYGGPHAAFFAAKDEYKRSMPGRIIGVSKDAAGNTALRMAMQT
REQHIRREKANSNICTSQVLLANIASLYAVYHGPVGLKRIANRIHRLTDILAAGLQQKGLKLRHAHYFDTLCVEVADKAG
VLTRAEAAEINLRSDILNAVGITLDETTTRENVMQLFNVLLGDNHGLDIDTLDKDVAHDSRSIQPAMLRDDEILTHPVFN
RYHSETEMMRYMHSLERKDLALNQAMIPLGSCTMKLNAAAEMIPITWPEFAELHPFCPPEQAEGYQQMIAQLADWLVKLT
GYDAVCMQPNSGAQGEYAGLLAIRHYHESRNEGHRDICLIPASAHGTNPASAHMAGMQVVVVACDKNGNIDLTDLRAKAE
QAGDNLSCIMVTYPSTHGVYEETIREVCEVVHQFGGQVYLDGANMNAQVGITSPGFIGADVSHLNLHKTFCIPHGGGGPG
MGPIGVKAHLAPFVPGHSVVQIEGMLTRQGAVSAAPFGSASILPISWMYIRMMGAEGLKKASQVAILNANYIASRLQDAF
PVLYTGRDGRVAHECILDIRPLKEETGISELDIAKRLIDYGFHAPTMSFPVAGTLMVEPTESESKVELDRFIDAMLAIRA
EIDQVKAGVWPLEDNPLVNAPHIQSELVAEWAHPYSREVAVFPAGVADKYWPTVKRLDDVYGDRNLFCSCVPISEYQ
>P9WN53 1.4.4.2~~~gcvP~~~Probable glycine dehydrogenase (decarboxylating)~~~COG0403
MSDHSTFADRHIGLDSQAVATMLAVIGVDSLDDLAVKAVPAGILDTLTDTGAAPGLDSLPPAASEAEALAELRALADANT
VAVSMIGQGYYDTHTPPVLLRNIIENPAWYTAYTPYQPEISQGRLEALLNFQTLVTDLTGLEIANASMLDEGTAAAEAMT
LMHRAARGPVKRVVVDADVFTQTAAVLATRAKPLGIEIVTADLRAGLPDGEFFGVIAQLPGASGRITDWSALVQQAHDRG
ALVAVGADLLALTLIAPPGEIGADVAFGTTQRFGVPMGFGGPHAGYLAVHAKHARQLPGRLVGVSVDSDGTPAYRLALQT
REQHIRRDKATSNICTAQVLLAVLAAMYASYHGAGGLTAIARRVHAHAEAIAGALGDALVHDKYFDTVLARVPGRADEVL
ARAKANGINLWRVDADHVSVACDEATTDTHVAVVLDAFGVAAAAPAHTDIATRTSEFLTHPAFTQYRTETSMMRYLRALA
DKDIALDRSMIPLGSCTMKLNAAAEMESITWPEFGRQHPFAPASDTAGLRQLVADLQSWLVLITGYDAVSLQPNAGSQGE
YAGLLAIHEYHASRGEPHRDICLIPSSAHGTNAASAALAGMRVVVVDCHDNGDVDLDDLRAKVGEHAERLSALMITYPST
HGVYEHDIAEICAAVHDAGGQVYVDGANLNALVGLARPGKFGGDVSHLNLHKTFCIPHGGGGPGVGPVAVRAHLAPFLPG
HPFAPELPKGYPVSSAPYGSASILPITWAYIRMMGAEGLRAASLTAITSANYIARRLDEYYPVLYTGENGMVAHECILDL
RGITKLTGITVDDVAKRLADYGFHAPTMSFPVAGTLMVEPTESESLAEVDAFCEAMIGIRAEIDKVGAGEWPVDDNPLRG
APHTAQCLLASDWDHPYTREQAAYPLGTAFRPKVWPAVRRIDGAYGDRNLVCSCPPVEAFA
>P74416 1.4.4.2~~~gcvP~~~Glycine dehydrogenase (decarboxylating)~~~COG0403
MPNLEPAVVVPTSEAIAVDLTKLEEKLAPADSFLDRHLGPGETEQRQMLQTLGFDTLGDLIDQAVPPAIRFPRSLQLPAS
QSEYGAIAQLKSIASKNQVFRSYIGMGYYDTITPPVIQRNILENPGWYTAYTPYQAEIAQGRLEALLNFQTMVMDLTGLE
IANASLLDEGTAAAEAMALSYGVSKSKANAFFVAQDCHPQTIEVIKTRANPLGIEVIVGDHHTFSFSTSIFGALLQYPAT
DGAVYDYRSFIDKAHQHQALVTLAADPLSLTLLTPPGELGADIAVGSTQRFGIPLGYGGPHAAYFATKAEYQRKMPGRIV
GVSKDAHGNPALRLALQTREQHIRRDKATSNICTAQVLLAVMASMYGVYHGSTGLKNIALRIHQLTVLLAIGLKRLNYSL
NNDYFFDTLRVGVGEQSAPAILKAAEGRGINLRPLVPGEVGISLDETVTVQDLLDLWQVFAGKDNLPFTPEELWSEVKTS
FPADLTRQSLYLQDAVFNQYHSETELLRYLHQLESKDLALNTSMIPLGSCTMKLNATAEMMPVTWPEFGKIHPFAPAGQT
EGYQILFAQLEAWLGEITGFDAISLQPNAGSQGEYAGLQVIRQYHLSRGEEQRNICLIPESAHGTNPASAVMCGMQVVPV
KCDGEGNIDVEDLTSKAEKYGDRLAALMVTYPSTHGVFEATIGTICDIVHRFGGEVYMDGANMNAQVGLCRPADFGADVC
HLNLHKTFCIPHGGGGPGMGPIGVKSHLQAFLPRTSLNSTAELQAEDQSIGMISAAPYGSASILVISWMYIAMMGPQGLT
KATEVAILSANYMAKRLENYYPILFRGNNELVAHECILDLRPLKKQAAIEVEDVAKRLMDFGFHAPTVSWPVLGTMMVEP
TESESLGELDRFCDAMIAIYQEAQAITHGEIDPADNPLKNAPHTAQSLICGEWNHPYSQEEAAYPAPWTKQFKFWPAVGR
INNTYGDRHLVCSCEGMEAYKEG
>P54378 2.1.2.10~~~gcvT~~~Aminomethyltransferase~~~COG0404
MLKRTPLFDLYKEYGGKTIDFGGWELPVQFSSIKKEHEAVRTAAGLFDVSHMGEVEVSGNDSLSFLQRLMTNDVSALTPG
RAQYTAMCYPDGGTVDDLLIYQKGENRYLLVINASNIDKDLAWMKEHAAGDVQIDNQSDQIALLAVQGPKAEAILKNLTD
ADVSALKPFAFIDEADISGRKALISRTGYTGEDGYEIYCRSDDAMHIWKKIIDAGDAYGLIPCGLGARDTLRFEAKLPLY
GQELTRDITPIEAGIGFAVKHKKESDFFGKSVLSEQKENGAKRKLVGLEMIEKGIPRHGYEVFQNGKSVGKVTTGTQSPT
LGKNVGLALIDSETSEIGTVVDVEIRKKLVKAKVVKTPFYKR
>P27248 2.1.2.10~~~gcvT~~~Aminomethyltransferase~~~COG0404
MAQQTPLYEQHTLCGARMVDFHGWMMPLHYGSQIDEHHAVRTDAGMFDVSHMTIVDLRGSRTREFLRYLLANDVAKLTKS
GKALYSGMLNASGGVIDDLIVYYFTEDFFRLVVNSATREKDLSWITQHAEPFGIEITVRDDLSMIAVQGPNAQAKAATLF
NDAQRQAVEGMKPFFGVQAGDLFIATTGYTGEAGYEIALPNEKAADFWRALVEAGVKPCGLGARDTLRLEAGMNLYGQEM
DETISPLAANMGWTIAWEPADRDFIGREALEVQREHGTEKLVGLVMTEKGVLRNELPVRFTDAQGNQHEGIITSGTFSPT
LGYSIALARVPEGIGETAIVQIRNREMPVKVTKPVFVRNGKAVA
>P9WN51 2.1.2.10~~~gcvT~~~Aminomethyltransferase~~~COG0404
MSDVPELIHGPLEDRHRELGASFAEFGGWLMPVSYAGTVSEHNATRTAVGLFDVSHLGKALVRGPGAAQFVNSALTNDLG
RIGPGKAQYTLCCTESGGVIDDLIAYYVSDDEIFLVPNAANTAAVVGALQAAAPGGLSITNLHRSYAVLAVQGPCSTDVL
TALGLPTEMDYMGYADASYSGVPVRVCRTGYTGEHGYELLPPWESAGVVFDALLAAVSAAGGEPAGLGARDTLRTEMGYP
LHGHELSLDISPLQARCGWAVGWRKDAFFGRAALLAEKAAGPRRLLRGLRMVGRGVLRPGLAVLVGDETVGVTTSGTFSP
TLQVGIGLALIDSDAGIEDGQQINVDVRGRAVECQVVCPPFVAVKTR
>P64225 2.1.2.10~~~gcvT~~~Aminomethyltransferase~~~
MSSDLKQTPLYQNYVDRGAKIVEFGGWAMPVQFSSIKEEHNAVRYEIGLFDVSHMGEIEVTGKDASQFVQYLLSNDTDNL
TTSKALYTALCNEEGGIIDDLVIYKLADDNYLLVVNAANTEKDFNWILKHKEKFDVEVQNVSNQYGQLAIQGPKARDLIN
QLVDEDVTEMKMFEFKQGVKLFGANVILSQSGYTGEDGFEIYCNIDDTEKIWDGLLEYNVMPCGLGARDTLRLEAGLPLH
GQDLTESITPYEGGIAFASKPLIDADFIGKSVLKDQKENGAPRRTVGLELLEKGIARTGYEVMDLDGNIIGEVTSGTQSP
SSGKSIALAMIKRDEFEMGRELLVQVRKRQLKAKIVKKNQIDK
>Q9WY54 2.1.2.10~~~gcvT~~~Aminomethyltransferase~~~COG0404
MKRTPLFEKHVELGAKMVDFAGWEMPLYYTSIFEEVMAVRKSVGMFDVSHMGEFLVKGPEAVSFIDFLITNDFSSLPDGK
AIYSVMCNENGGIIDDLVVYKVSPDEALMVVNAANIEKDFNWIKSHSKNFDVEVSNISDTTALIAFQGPKAQETLQELVE
DGLEEIAYYSFRKSIVAGVETLVSRTGYTGEDGFELMLEAKNAPKVWDALMNLLRKIDGRPAGLGARDVCRLEATYLLYG
QDMDENTNPFEVGLSWVVKLNKDFVGKEALLKAKEKVERKLVALELSGKRIARKGYEVLKNGERVGEITSGNFSPTLGKS
IALALVSKSVKIGDQLGVVFPGGKLVEALVVKKPFYRGSVRREV
>Q59111 2.8.3.12~~~gctA~~~Glutaconate CoA-transferase subunit A~~~COG1788
MSKVMTLKDAIAKYVHSGDHIALGGFTTDRKPYAAVFEILRQGITDLTGLGGAAGGDWDMLIGNGRVKAYINCYTANSGV
TNVSRRFRKWFEAGKLTMEDYSQDVIYMMWHAAALGLPFLPVTLMQGSGLTDEWGISKEVRKTLDKVPDDKFKYIDNPFK
PGEKVVAVPVPQVDVAIIHAQQASPDGTVRIWGGKFQDVDIAEAAKYTIVTCEEIISDEEIRRDPTKNDIPGMCVDAVVL
APYGAHPSQCYGLYDYDNPFLKVYDKVSKTQEDFDAFCKEWVFDLKDHDEYLNKLGATRLINLKVVPGLGYHIDMTKEDK
>Q59112 2.8.3.12~~~gctB~~~Glutaconate CoA-transferase subunit B~~~COG2057
MADYTNYTNKEMQAVTIAKQIKNGQVVTVGTGLPLIGASVAKRVYAPDCHIIVESGLMDCSPVEVPRSVGDLRFMAHCGC
IWPNVRFVGFEINEYLHKANRLIAFIGGAQIDPYGNVNSTSIGDYHHPKTRFTGSGGANGIATYSNTIIMMQHEKRRFMN
KIDYVTSPGWIDGPGGRERLGLPGDVGPQLVVTDKGILKFDEKTKRMYLAAYYPTSSPEDVLENTGFDLDVSKAVELEAP
DPAVIKLIREEIDPGQAFIQVPTEAK
>P0A9F6 ~~~gcvA~~~Glycine cleavage system transcriptional activator~~~COG0583
MSKRLPPLNALRVFDAAARHLSFTRAAEELFVTQAAVSHQIKSLEDFLGLKLFRRRNRSLLLTEEGQSYFLDIKEIFSQL
TEATRKLQARSAKGALTVSLLPSFAIHWLVPRLSSFNSAYPGIDVRIQAVDRQEDKLADDVDVAIFYGRGNWPGLRVEKL
YAEYLLPVCSPLLLTGEKPLKTPEDLAKHTLLHDASRRDWQTYTRQLGLNHINVQQGPIFSHSAMVLQAAIHGQGVALAN
NVMAQSEIEAGRLVCPFNDVLVSKNAFYLVCHDSQAELGKIAAFRQWILAKAAAEQEKFRFRYEQ
>A0A0H3JT43 ~~~~~~Glycine cleavage system H-like protein~~~
MKKLANYLWVEKVGDLYVFSMTPELQDDIGTVGYVEFVSPDEVKVDDEIVSIEASKTVIDVQTPLSGTIIERNTKAEEEP
TILNSEKPEENWLFKLDDVDKEAFLALPEA
>P0DN69 ~~~~~~Glycine cleavage system H-like protein~~~
MKKIANYLLIEKTDDRYTISMTPELQDDIGTIGYAEFTDNDHLAVDDIILNLEASKTVMSVLSPLAGAVVERNEAATLTP
TLLNSEKAEENWIVVLTDVDQAAFDALEDA
>P0A9I3 ~~~gcvR~~~Glycine cleavage system transcriptional repressor~~~COG2716
MTLSSQHYLVITALGADRPGIVNTITRHVSSCGCNIEDSRLAMLGEEFTFIMLLSGSWNAITLIESTLPLKGAELDLLIV
MKRTTARPRPPMPASVWVQVDVADSPHLIERFTALFDAHHMNIAELVSRTQPAENERAAQLHIQITAHSPASADAANIEQ
AFKALCTELNAQGSINVVNYSQHDEQDGVK
>A9CES4 1.1.1.16~~~~~~Galactitol 2-dehydrogenase~~~COG1028
MRLNNKVALITGAARGIGLGFAQAFAAEGAKVIIADIDIARATTSAAAIGPAAKAVKLDVTDLAQIDAVVKAVDEEFGGI
DILVNNAAIFDMAPINGITEESYERVFDINLKGPMFMMKAVSNVMIARARGGKIINMASQAGRRGEALVTLYCASKAAII
SATQSAALALVKHGINVNAIAPGVVDGEHWEVVDAHFAKWEGLKPGEKKAAVAKSVPIGRFATPDDIKGLAVFLASADSD
YILAQTYNVDGGNWMS
>Q1MLL4 1.1.1.16~~~gdh~~~Galactitol 2-dehydrogenase~~~COG1028
MSYQQKFRLDGERAVVTGGGRAIGLCCTEALAEAGAAVVVIERSEADAEQALALRNRGYDVEVRVGDVTDAARMDAIATE
LADGGRPATILVNNAGIGQSGIPAQDLTDADWLRMMDVNLNGVFWCSRAFGRSMISMKRGAIVNLGSMSGTICNRPQPQT
AYNVSKAAVHHLTRSLAAEWAHHGIRVNAVAPTYIETPMVVAVEANRERIPLWLADTPMARMGTPEEVASAVLFLASGAA
SLMTGAIVNVDAGFTCW
>P49856 ~~~gdnC~~~Probable guanidinium efflux system subunit GdnC~~~COG2076
MKWGLVVLAAVFEVVWVIGLKHADSALTWSGTAIGIIFSFYLLMKATHSLPVGTVYAVFTGLGTAGTVLSEIVLFHEPVG
WPKLLLIGVLLIGVIGLKLVTQDETEEKGGEA
>P49857 ~~~gdnD~~~Probable guanidinium efflux system subunit GdnD~~~COG2076
MLHWISLLCAGCLEMAGVALMNQYAKEKSVKWVLLIIVGFAASFSLLSYAMETTPMGTAYAVWTGIGTAGGALIGILFYK
EQKDAKRIFFIALILCSAVGLKILS
>Q9S3U6 1.13.11.4~~~xlnE~~~Gentisate 1,2 dioxygenase 1~~~
MSFTEKPAVTKERKEFYSKLESHDLAPLWEVLNEVVTTKPKSNCAPHLWEFEVAKEFLMEAGTLITAKEAERRVLILENP
GLKGLSRITTSLYAGLQLILPGEVAPTHRHSQSALRFVVDGGGACTSVDGERTTMQVGDFVITPPWAWHDHVNDSDKPMI
WMDGLDLPMVTLFDTSFAEGYGEDIQEITRPNGDSLARYGANMLPVDFKQKGLSSPIFNYPYERSREALEAMKKANEWDP
CHGLKMQYINPLDGMAAMPTISSFIQLLPKEFRTQTYRSTDATVFSVIEGQGKTRIGDKVFFWKAKDTFVVPSWYPVEHE
ASSDAVLFSYSDRVAQQKLGFWRESRN
>Q0QFQ2 1.13.11.4~~~hbzE~~~Gentisate 1,2 dioxygenase 2~~~
MSELNLGTVEDLPKDYYEQLVANNTLPLWPSLRSVLPHGKPARRTRPVIWRYADVRPDLLRAGDLTPIEKAEGRVLVLCN
PGLGLENMQVTGSIYIGLQLIQPGETAPNHKHSPSAVRFVIEGKGGYTLVNGEKLPMEKGDLILTPPGMWHQHGHEGDGP
VVWLDALDLPLIYGIDASYHIDGEEQTLDKAAGTCTSRYRQSGLLPYSALDRKTSFPLLRFPWQEVRESLKQFAAVTPAG
ELVQLAYVNPETGAECLPTLGFSAIMLRPGETIRLQRRSASGVLHVVEGAGSVVVDDTRHAFTEADTLAIPTHAAVTLSN
ASSTEPTYLFMVDDAPLHRKLGIYEVFS
>A0A8F4N283 3.1.4.46~~~bagdpd~~~Glycerophosphodiester phosphodiesterase~~~
MTKIFAHRGFKGNYPENTMIAFEHALHSGADGIELDVQLTKDGRLAVIHDEKLNRTTNMKGLVKDYTYEELKRGDASHSF
YEETGAVTIPLLEEVLELVTQRSSFMINIELKNSIYRYPGIEEKVKEQIEHFQIEDRVLVSSFHHGSLALFHKLMPHIEL
AVLTMDVIHQPDMYLKTIPAKGYHPNIKGAGVTKEVVSALHADQQVIRPFTVNSEKQIKNMLTLGVDGIFTDFPDRAVKI
REEMK
>Q8RB32 3.1.4.46~~~UgpQ~~~Glycerophosphodiester phosphodiesterase~~~COG0584
MKTLVIAHRGDSKNVPENTIAAFKRAMELGADGIELDVQLTKDGHLVVIHDETVDRTTNGEGFVKDFTLEEIKKLDAGIK
FGEKFAGERIPTLYEVFELIGDKDFLVNIEIKSGIVLYPGIEEKLIKAIKEYNFEERVIISSFNHYSLRDVKKMAPHLKI
GLLYQCGLVEPWHMALRMEAYSLHPFYFNIIPELVEGCKKNGVKLFPWTVDRKEDMERMIKAGVDGIITDDPETLINLVR
KGG
>Q9A9H3 2.6.1.102~~~per~~~GDP-perosamine synthase~~~COG0399
MSDLPRISVAAPRLDGNERDYVLECMDTTWISSVGRFIVEFEKAFADYCGVKHAIACNNGTTALHLALVAMGIGPGDEVI
VPSLTYIASANSVTYCGATPVLVDNDPRTFNLDAAKLEALITPRTKAIMPVHLYGQICDMDPILEVARRHNLLVIEDAAE
AVGATYRGKKSGSLGDCATFSFFGNKIITTGEGGMITTNDDDLAAKMRLLRGQGMDPNRRYWFPIVGFNYRMTNIQAAIG
LAQLERVDEHLAARERVVGWYEQKLARLGNRVTKPHVALTGRHVFWMYTVRLGEGLSTTRDQVIKDLDALGIESRPVFHP
MHIMPPYAHLATDDLKIAEACGVDGLNLPTHAGLTEADIDRVIAALDQVLV
>Q7DBF3 2.6.1.102~~~perA~~~GDP-perosamine synthase~~~COG0399
MKYIPVYQPSLTGKEKEYVNECLDSTWISSKGNYIQKFENKFAEQNHVQYATTVSNGTVALHLALLALGISEGDEVIVPT
LTYIASVNAIKYTGATPIFVDSDNETWQMSVSDIEQKITNKTKAIMCVHLYGHPCDMEQIVELAKSRNLFVIEDCAEAFG
SKYKGKYVGTFGDISTFSFFGNKTITTGEGGMVVTNDKTLYDRCLHFKGQGLAVHRQYWHDVIGYNYRMTNICAAIGLAQ
LEQADDFISRKREIADIYKKNINSLVQVHKESKDVFHTYWMVSILTRTAEEREELRNHLADKLIETRPVFYPVHTMPMYS
EKYQKHPIAEDLGWRGINLPSFPSLSNEQVIYICESINEFYSDK
>Q06953 2.6.1.102~~~rfbE~~~GDP-perosamine synthase~~~
MIPVYEPSLDGNERKYLNDCIDSGWVSSRGKYIDRFETEFAEFLKVKHATTVSNGTVALHLAMSALGITQGDEVIVPTFT
YVASVNTIVQCGALPVFAEIEGESLQVSVEDVKRKINKKTKAVMAVHIYGQACDIQSLRDLCDEHGLYLIEDCAEAIGTA
VNGKKVGTFGDVSTFSFFGNKTITSGEGGMVVSNSDIIIDKCLRLKNQGVVAGKRYWHDLVAYNYRMTNLCAAIGVAQLE
RVDKIIKAKRDIAEIYRSELAGLPMQVHKESNGTFHSYWLTSIILDQEFEVHRDGLMTFLENNDIESRPFFYPAHTLPMY
EHLAEKTAFPLSNSYSHRGINLPSWPGLCDDQVKEICNCIKNYFNCI
>P37484 3.1.4.59~~~gdpP~~~Cyclic-di-AMP phosphodiesterase GdpP~~~COG3887
MPSFYEKPLFRYPIYALIALSIITILISFYFNWILGTVEVLLLAVILFFIKRADSLIRQEIDAYISTLSYRLKKVGEEAL
MEMPIGIMLFNDQYYIEWANPFLSSCFNESTLVGRSLYDTCESVVPLIKQEVESETVTLNDRKFRVVIKRDERLLYFFDV
TEQIQIEKLYENERTVLAYIFLDNYDDVTQGLDDQTRSTMNSQVTSLLNAWAQEYGIFLKRTSSERFIAVLNEHILTELE
NSKFSILDEVREKTSFDGVALTLSVGVGASVSSLKELGDLAQSSLDLALGRGGDQVAIKLPNGKVKFYGGKTNPMEKRTR
VRARVISHALKEIVTESSNVIIMGHKFPDMDSIGAAIGILKVAQANNKDGFIVIDPNQIGSSVQRLIGEIKKYEELWSRF
ITPEEAMEISNDDTLLVIVDTHKPSLVMEERLVNKIEHIVVIDHHRRGEEFIRDPLLVYMEPYASSTAELVTELLEYQPK
RLKINMIEATALLAGIIVDTKSFSLRTGSRTFDAASYLRAKGADTVLVQKFLKETVDSYIKRAKLIQHTVLYKDNIAIAS
LPENEEEYFDQVLIAQAADSLLSMSEVEASFAVARRDEQTVCISARSLGEVNVQIIMEALEGGGHLTNAATQLSGISVSE
ALERLKHAIDEYFEGGVQR
>A4ITV2 3.1.4.59~~~gdpP~~~Cyclic-di-AMP phosphodiesterase GdpP~~~COG3887
MSHFYERKTYRYPSYALAALAVLMAVSLFYFQWMLGLVGLLGVGFLLYYVIWSQRSLHKELQQYISNLSYRVKKVSEEAL
MQMPIGILLLDEEDKIEWSNRFLAACFKEQTLIGRSLAELSEPLAAFVKKGKTDEEIIELNGKQLKVIVHRHERLLYFFD
VTEHMELRRRYEIERLVLAIIFLDNYDEITQGMDDQAKSQMNSLVTSVLNRWANDYGIFLKRTSSDRFIAVLNEHILTQL
EKSKFSILDEVREQTAKHQAQITLSIGIGAGVSSLPELGTLAQSSLDLALGRGGDQVAIKQGNGKVKFYGGKTNPMEKRT
RVRARVISHALRELIAESDKVLIMGHKYPDMDALGAAIGILKVVQSNQKEGFLVVDAMKTDAGAQRLLEEMKKQADLWAR
CIKPEQALELITEDTLLIVVDTHRPSLVIEERLLYRADHIVVIDHHRRGEEFIEAPILVYMEPYASSTSELVTELLEYQP
KRVKLSMLEATALLAGIVVDTKSFTLRTGSRTFDAASYLRAQGADTVLVQKLLRESVANYVKRAKLIERAAIDEHGIAIA
KGDENEVHDQVLIAQTADTLLTLSGVVASFVISKRGDGTVGISARSLGDVNVQVIMERLGGGGHLTNAAAQLSDVTVGEA
EQQLREAIHDYFEGGKPV
>O53423 3.1.-.-~~~~~~GDSL-like esterase Rv1075c~~~COG2755
MPRRSTIALATAGALASTGTAYLGARNLLVGQATHARTVIPKSFDAPPRADGVYTRGGGPVQRWRREVPFDVHLMIFGDS
TATGYGCASAEEVPGVLIARGLAEQTGKRIRLSTKAIVGATSKGVCGQVDAMFVVGPPPDAAVIMIGANDITALNGIGPS
AQRLADCVRRLRTRGAVVVVGTCPDLGVITAIPQPLRALAHTRGVRLARAQTAAVKAAGGVPVPLGHLLAPKFRAMPELM
FSADRYHPSAPAYALAADLLFLALRDALTEKLDIPIHETPSRPGTATLEPGHTRHSMMSRLRRPRPARAVPTGG
>O33363 3.1.1.-~~~~~~GDSL lipase Rv0518~~~COG2755
MSRPGTYVIGLTLLVGLVVGNPGCPRSYRPLTLDYRLNPVAVIGDSYTTGTDEGGLGSKSWTARTWQMLAARGVRIAADV
AAEGRAGYGVPGDHGNVFEDLTARAVQPDDALVVFFGSRNDQGMDPEDPEMLAEKVRDTFDLARHRAPSASLLVIAPPWP
TADVPGPMLRIRDVLGAQARAAGAVFVDPIADHWFVDRPELIGADGVHPNDAGHEYLADKIAPLISMELVG
>O69279 ~~~gdx~~~Guanidinium exporter~~~
MSWIVLLIAGLLEVVWAIGLKYTHGFTRLTPSIITIAAMIVSIAMLSWAMRTLPVGTAYAVWTGIGAVGAAITGILLLGE
SASPARLLSLGLIVAGIIGLKLSTH
>P69937 ~~~gdx~~~Guanidinium exporter~~~COG2076
MSWIILVIAGLLEVVWAVGLKYTHGFSRLTPSVITVTAMIVSMALLAWAMKSLPVGTAYAVWTGIGAVGAAITGIVLLGE
SANPMRLASLALIVLGIIGLKLSTH
>Q833V7 3.4.24.30~~~gelE~~~Gelatinase~~~COG3227
MMKGNKILYILGTGIFVGSSCLFSSLFVAAEEQVYSESEVSTVLSKLEKEAISEAAAEQYTVVDRKEDAWGMKHLKLEKQ
TEGVTVDSDNVIIHLDRNGAVTSVTGNPVDQVVKIQSVDAIGEEGVKKIIASDNPETKDLVFLAIDKRVNNEGQLFYKVR
VTSSPTGDPVSLVYKVNATDGTIMEKQDLTEHVGSEVTLKNSFQVAFNVPVEKSNTGIALHGTDNTGVYHAVVDGKNNYS
IIQAPSLVALNQNAVDAYTHGKFVKTYYEDHFQRHSIDDRGMPILSVVDEQHPDAYDNAFWDGKAMRYGETSTPTGKTYA
SSLDVVGHEMTHGVTEHTAGLEYLGQSGALNESYSDLMGYIISGASNPEIGADTQSVDRKTGIRNLQTPSKHGQPETMAQ
YDDRARYKGTPYYDQGGVHYNSGIINRIGYTIIQNLGIEKAQTIFYSSLVNYLTPKAQFSDARDAMLAAAKVQYGDEAAS
VVSAAFNSAGIGAKEDIQVNQPSESVLVNE
>O82833 4.2.2.25~~~~~~Gellan lyase~~~
MRFSWKKLVSAALVMALLVGIVYPAASGRGAVASAASGTTVELVPTDDAFTSAVAKDANANGTWMQLKGSIGGQRYIYMK
FDLTALAGVEADRIENAKVWLKKMGTNGTAMTVGLRAVDDTSWSESTLTWNNAPVYGSQVLSQQSVLSTPDVYYPFDLDE
YLKTQLAAGKSKLAIAFVPISTLNENMEFYARESTANTPKLVVELKDEPPAPTGLMQLVQSFGGHNKGHLRVVEFDATPA
STTGNGTVGITADGAAPAAAADFPIALRFGTDGTIKAANAAGFESKTPVNYTAGQKYHVKAMINLSLGTYDLWLTPPNAG
QPVLLAADYAFAASAPALNDIGGVHATADAQSTDVPAVANARLIADHFVSKAPFKDEQGQSLAIRLESDNSLANRSYAIK
FDMNLTGNPLETDALISYADRSVTLNGFPDLAYIVRSNFGNFDVRNDNVYASSHPSTAQSNRTYQVEVRINPASGTGQPT
RTYDVWIAPEGEQPVQLADQFKARNYANTGYALNNIGQAFVYSQADGLLSIDNHVVQDGQRLDEALARVNAASGEAAMTA
ALESNALGLPMERYRLMDAAKRAQVAQDVLAGRPAEGYAHALSVQAVFVSAVANRLDTENPTAPANVQVAISNTMQAHVS
WTASSDDTGILYYKVFRDGALVGTVTNATSFVDNGLAPATEYTYVVKAYDLVLKEAVSQPATATSPGEQAQVRIPFSAEA
IATAFGQPLLDYNLETHSGTLKWVMEWREEYEKSANALKLLTLLSASAPDYIGPDGVTTASAKALQHLRSVTAGGNEPGF
AGNGLSGQGYMPLLSAIVMAKKKAPAIWNALTAAEKEKLDLMILAGLYGAKFAYDDENDNKTGIDATGNFDKEWNPNHRS
GIAGAIMAMYYFEDAQWLNDQMRSFNYDDWLARLTAAGLTNVRTIYQNSGKTLTEREIRKDAAGDGFVYKGHPLSQPGKI
MAEFVNYTFSHPVSPVGGFDSGIGKYRGYIVDGQDDLPNLGADSMGFEFDTLDANGKRSSLVYVFMGWKPNVDAITPVLL
LDNIDSGLTSAETRDVVSRLSIGTTDMLYKNEHGYMTYAKGVNEGVKSLNGPILTINEEIWNRILNNPAAPMEAVNQASS
AGQMRTALEASALGMILYGYGALSETGKNAVAQHVLDARPAAGYANKAAAQNELYEGVRLQALLALSQAQTAEQMRSALE
SRALGLYKPKYETASQDKKQFVAQYLLDNKPADGFLTKTEVREQVESALEPQGNQLRNLPPLASGEKRINLADYDHWPQQ
HGDAEVALWADDKTGAFSLTIDDNFENEHDTWRSLAQQYGFKFSWFVITSLIKDPNKWRTLAAEGHEIGSHTVTHEDKGS
TLDPAHLHSEYADSQALLNTIEGVRATTLAYPFGSGREDIAAEYYIAARGTVGLPNPADSINYMNTQSLSVRPGSLELTN
QAANGNSVEAMVKTLVDPNHKVWSASYYRGWSNMLVHSLNESGKTPSDGVTRTSRDLTQYLLTLLDTYRDQIWVGRYGDI
VRYSQQRDTAHIVVTRKDDRKITFNLTDRMDDTLFDYPLTVKVRVDDAWSDIGATQAGEPIPFVETIRDGKRYLLVKAVP
DKGSVSIVPDAASPLNVVNGAVTSEQMLSAIAAPGLGLDLGEFNALGAGKKRMVGSRLLEVRPADGYADAAALQDALDAA
VEEANNAPSLSENASLSDLKVNGVTIAGFAPETYAYDIMLPEGTTALPVVSFKVADTGKATAVLQNAPALPGTAKVTVTA
EDNWTVATYTLRFQVRISALQRVNTAPDASAMRTAIENAALGLVLAAYNGLTSEQKNSVAASVLTHRPATGYADVQAVQA
ELNAALPKINAPLLAHAIVDQLNPDTVSTANWTNLYGGTSGRKGGVYMKFNIASLAGLEADAIGDAKVQFFTTREGTVIG
YAAPSSWEAPLTWNTQPLADLKNSNMAALAEIGRTAVQATGANYEMNITQYVKDAAAADKTELSLVLLGSNNTNITMQKI
PTAFALSVTLATYGEPNPEPSPLAAVNEAGDAAAMQGAIAAVELDLNLTAYNGLTAAQRIDVAQALLDNRPAAGYAHALA
VQVALDAAVAAAQPANQAPGGTLAASAEQLQPGQQLELTVGVSDASRFTGADILVHYDPQALTFATELYEGVRMLKAEAI
ASLQANYQVAAAMAEQPGTIKILLFTAGAGQPLSGTLPLFKLRASVKDDAQTGVSTAVSLSDFELTFEGEDSVWPDTTRA
AVSLQIAAHPVEADKTALIAKIAHAQALLTGATVGANPGQYPQAAYDALADAIGLAEEKRDLTGVSQAAVDEAVASLGTA
EQQFLNAVIPGVPADLTALNAAIAKAQRLHDNGPYGEKIGQYPQSAKVPLKSALDAAKAVGGSGASSQESVNAAAASLNG
AIQTFERSLVTLVGGGATKVGIRDLSIVAKYYGVTSSDPNWGKVSAAAIDGGNEITIEVLAAVARMILADWAAGQ
>Q8NLB7 ~~~genK~~~Gentisate transporter~~~COG2814
MTSHAPESGGLVTESTLGASNSSQTIENKGLTILGISGRRLAAVLIGWFFVIFDGYDLIVYGTVQSALAKEWNLSSATLG
TIGSTAFFGMAIGAVFIGRLSDRVGRKAAVIGSVLILSVFTMLCAFAPNPWVFGAFRFIAGLGLGGLVPSVNAMTSDLVP
RKTMSAWATVMMSGVPIGGSIAAVLALVVVPSSEEWGWRFMFLIALIPLVVGLPIAMKVIPSDKAIKADHDIREGHDEPA
GFKDLLVDRYRWISIWFALATFVTLLAWYGLGTWLPRLMETAGYEFGHALMFTLALNLGAVIGSVVTAWAGDRFGPIRSG
VIAAGIAGIALLLLLTYPPVTAVYVILILAGVGTHGTQILIIAAVANFYPSNLRGTALGWALGVGRIGAVVAPQLAGLLL
AWNLGVNSNFIMFGTAALLSALALSVLLRLQKTYSVTHKVEIQG
>H1ZV38 1.1.1.347~~~geoA~~~Geraniol dehydrogenase~~~
MNDTQDFISAQAAVLRQVGGPLAVEPVRISMPKGDEVLIRIAGVGVCHTDLVCRDGFPVPLPIVLGHEGSGTVEAVGEQV
RTLKPGDRVVLSFNSCGHCGNCHDGHPSNCLQMLPLNFGGAQRVDGGQVLDGAGHPVQSMFFGQSSFGTHAVAREINAVK
VGDDLPLELLGPLGCGIQTGAGAAINSLGIGPGQSLAIFGGGGVGLSALLGARAVGADRVVVIEPNAARRALALELGASH
ALDPHAEGDLVAAIKAATGGGATHSLDTTGLPPVIGSAIACTLPGGTVGMVGLPAPDAPVPATLLDLLSKSVTLRPITEG
DADPQRFIPRMLDFHRAGKFPFDRLITRYRFDQINEALHATEKGEAIKPVLVF
>H1ZV37 1.2.1.86~~~geoB~~~Geranial dehydrogenase~~~
MTIDHQHIFVGGQWIAPKSTQRSNILNASTEELVGSVPKCNNEDMDRAVAAAREAMRSLAWAGLDGKGRAQHLRRFADAV
ERRGQQLARSVSLQNGMPINVADQLESAFVVSLLRYYASLAENLVEEEARPSPTGSTTLVRRDPVGVVGAIIPWNFPVAL
SIFKIAPALAAGCAVVVKPSSGTVLDSYVLAEAAAEAGLPPGVINWVPGDRGIGSHLVSHPGVDKVAFTGSTSAGRIIAE
ACARLLRPVTLELGGKSAAIVLEDADLDALIRSLPMSSVLNNGQACFSCTRILAPAGRYDEVVDAIAGAVSAYSVGDALD
RATVVGPMASAAHRDSVQRYIELGTGEARLVVGGGRTSQDRGWFVQPTVFADVDNRSRIAREEIFGPVLSIIRYEGEDEA
VEIANDSEYGLGGTVWSTDHDHAVTIARRMETGTVGINGYMPDLNAPFGGVKSSGMGRELGPESIGAYQRYKSVYLLG
>P07868 ~~~gerAA~~~Spore germination protein A1~~~COG0697
MEQTEFKEYIHDNLALVLPKLKENDDLVKNKKMLANGLVFYYLYFSEMTDENKVSEAIKTLIKDEETLTLDQVKKRLDQL
DARPVETAKKTIESILNGNCAVFINGLDKAYILTTGKKKTRSLTEPTTEKVVRGPKVAFVEDIDTNLALIRQRTSHPKLI
TKKIMIGENKLKPAAIMYIEGKAKKSVIKEVKARLKNIQLEDIQDSGTLEELIEDNKYSPFPQIQNTERPDKVSSALFNG
RVAILVDSSPFVLLVPVSLGILMQSPDDYYERWISASLIRSLRFASIFITLFLSSIYITLVSFHQGLLPTALAVTISANR
ENVPFPPIFEALLMEVTIELLREAGLRLPNPLGQTIGLVGGVVIGQAAVEANLVSSILVIVVSVIALASFTVPQYGMGLS
FRVLRFISMFSAAILGLYGIILFMLVVYTHLTRQTSFGSPYFSPNGFFSLKNTDDSIIRLPIKNKPKEVNNPNEPKTDST
ET
>P07869 ~~~gerAB~~~Spore germination protein A2~~~COG0814
MSQKQTPLKLNTFQGISIVANTMLGAGLLTLPRALTTKANTPDGWITLILEGFIFIFFIYLNTLIQKKHQYPSLFEYLKE
GLGKWIGSIIGLLICGYFLGVASFETRAMAEMVKFFLLERTPIQVIILTFICCGIYLMVGGLSDVSRLFPFYLTVTIIIL
LIVFGISFKIFDINNLRPVLGEGLGPIANSLTVVSISFLGMEVMLFLPEHMKKKKYTFRYASLGFLIPIILYILTYIIVV
GALTAPEVKTLIWPTISLFQSFELKGIFIERFESFLLVVWIIQFFTTFVIYGYFAANGLKKTFGLSTKTSMVIIGITVFY
FSLWPDDANQVMMYSDYLGYIFVSLFLLPFILFFIVALKRRITTK
>P07870 ~~~gerAC~~~Spore germination protein A3~~~
MKIRILCMFICTLLLSGCWDSENIEELSLVIGIGLDKPDDENLELTQQILVPKIISAKEGSSSDPTQLSITKGKTVHQMM
RTSALKHKPTFSQHLRLILLSKSVIADQIGMDAIINQFVRDNGTRRSSYVFITNGRTKDIFNMNDEGEPASNVIYDLTEN
NKVTIRTMEPVTLGEISEHLTSDDSFLIPHVGKENGKLAINGASIIKNKLWHRDLTPIEVQNISLFSGTVEGGVIDLKRD
GHLFSYEVYSSNRKIKTAYKDGKFKFTVTRNIEGRLSEDWNPNEDSFKDSYIKSIEKTVEKRVHETVTSFITEKLQKEIK
ADVTGLGNEVRIHYPQKWKKISRKWDDDYFSNAEIDYRVNVIVRDFGTKGANK
>B2J4A4 4.2.3.90~~~~~~Germacrene A synthase~~~COG0664
MNQLLCPGLYCPFPSQTNKYVDVLEEYSLEWVLRFNLLANESAYKRFCKSKFFFLAASAYPDSKFEELKITHDWLSWVFI
WDDQCDLSELKKQPEVLNNFHQRYLEILNGAELTSQDTLFSHALIDLRKRTLQRASIKWFNYFISYLEDYFYGCVQEATN
RAKGIVPDLDTYIMIRRSSVGVYAVLALSEFCNQFIIPDVLRNHHLVKKLELITTDIIAWSNDIFSASREIASGDVHNLI
FVLHYHKKISLEKAIEQVVKIHNEEVHSLIKVESSLSFFSEELDVEITKYISGMHSWIRGNLDWCYESYRYHNLERLELT
EFK
>P39569 ~~~gerBA~~~Spore germination protein B1~~~COG0697
MQIDSDLQNNLDTLKKTLGQNDDMMFYTFAFGDSRQKACLLYIDGLTENKMLAQYVISPLQKEALAHKECSIEDLSAFFF
GFHHSVVSTMKEIEQLVFSGQAILLADGYRGGLAFDTKSVATRSLDEPSSEVVERGPKIGFIEKLRTNTALLRERTSDPN
LVIKEMTLGKRTKKKIAVAYIQDIAPDYVVKEVFKRLKSVNIDNLPESGTLEQLIEDEPFSIFPTILSTERPDRVESSLL
EGRVSILVDGTPFALIVPATVDEFIHSPDDYSQRWIPMSLVRLLRYSSILITIYLPGLYISLVSFHTGLLPTRMAISIAG
SRLNVPFPPFVEAFIMIFTIELIREAGLRLPKPIGQTIGLIGGVVIGQAAVQAQIVSALMVIVVSVTALASFTVPSYAYN
FPLRIIRIGVMISATALGMYGVIMVYLFVIGHLMRLKSFGQDYIIPIMAQPGQDLKDTVIRIPTMFLKRRPTRNDPEDNI
RQR
>P39570 ~~~gerBB~~~Spore germination protein B2~~~COG0814
MRKSEHKLTFMQTLIMISSTLIGAGVLTLPRSAAETGSPSGWLMILLQGVIFIIIVLLFLPFLQKNSGKTLFKLNSIVAG
KFIGFLLNLYICLYFIGIVCFQARILGEVVGFFLLKNTPMAVVVFIFLAVAIYHVGGGVYSIAKVYAYIFPITLIIFMML
LMFSFRLFQLDFIRPVFEGGYQSFFSLFPKTLLYFSGFEIIFYLVPFMRDPKQVKKAVALGIATSTLFYSITLLIVIGCM
TVAEAKTVTWPTISLIHALEVPGIFIERFDLFLQLTWTAQQFACMLGSFKGAHIGLTEIFHLKNKNNAWLLTAMLAATFF
ITMYPKDLNDVFYYGTLLGYAFLIVITIPFFVWFLSWIQKKIGRGQLQ
>P39571 ~~~gerBC~~~Spore germination protein B3~~~
MKTASKFSVMFFMLLALCGCWDVKDIEQLSFARGLAIDETNDHQYKLTYQNLLPQSEDSQASGKPEFVNVTSHGKTILEA
VSDVSIKDPPVYSDHLKVILLGEKLMRNQNVDQVLNHFIRDDELRRSSYLMAARGNAADVFTKGNPNQQQPMPSEKLIDL
TTHSGYNGKIMIPLRIGRASVYSQNGYSYLIQAVKNEKGKAKYDGAGIIKRGSNKLVGFLSADETQTLSWVMGTIQGGVM
PTTDKGHPITFEIKKSKTKIKPVIENGKPVFHISVKTKGILTEDQNPNENSFSKSYLHRLENIFEKKLERDVKQVMDKLQ
HEYKTDPVFLSDHIRIQHPDYWNKVKGHWDEIFSETDFKYDISFKIINFGTVGK
>P16450 ~~~gerD~~~Spore germination protein GerD~~~
MSKAKTLLMSCFLLLSVTACAPKDQAADMDYDQTKKMVVDILKTDDGKKAIKELLNDDAMNEALVIDQDAIKGTIEKTLT
SKKGEEFWKNIFEDTDFAEGFAKTLQTEHEKVIKKLMKDPDYQKMLMSVMQDPGMDKKYSQLAKSQEFRSYLEEVINETL
SSPLYKKQFEDELKKAAKDTAKESE
>P11470 ~~~gerE~~~Spore germination protein GerE~~~COG2197
MKEKEFQSKPLLTKREREVFELLVQDKTTKEIASELFISEKTVRNHISNAMQKLGVKGRSQAVVELLRMGELEL
>Q331R1 5.1.3.27~~~gerF~~~dTDP-4-dehydro-6-deoxyglucose 3-epimerase~~~
MHPLSIEGAWSQEPVIHSDHRGRSHEWFRGERFRQTFGHDFPVAQVNVAVSHRGALRGIHYTEIPPGQAKYSVCVRGAGL
DVIVDVRIGSPTFGRWEIVPMDAERNTAVYLAAGLGRAFLSLTDDATLVYLCSSGYAPEREHSVNPLDPDLGIVWPADIE
PLLSDRDKNAPTLATAERLGLLPTYQAWQEQQQAKA
>Q331Q7 1.1.1.364~~~gerKI~~~dTDP-4-dehydro-6-deoxy-D-allose reductase~~~
MTADRWAGRTVLVTGALGFIGSHFVRQLDARGAEVLALYRTERPEIQAELAALNRVRLVRTELRDESDVRGAFKYLAPSI
DTVVHCAAMDGNAQFKLERSAEILDSNQRTISNLLNCVRDFGVGEVVVMSSSELYSASPTVAAREEDDFRRSMRYTDNGY
VLSKTYGEILARLHREQFGTNVFLVRPGNVYGPGDGFDCSRGRVIPSMLAKADAGEEIEIWGDGSQTRSFVHVADLVRAS
LRLLETGKYPEMNVAGAEQVSILELAGMVMAVLGRPERIRLDPSRPVGAPSRLLDLSRMSEVIDFDPQPLRAGLEETARW
YRLHKR
>P39072 ~~~gerM~~~Spore germination protein GerM~~~COG5401
MLKKGPAVIGATCLTSALLLSGCGLFQSDKAAEEIDPPQDVTFVNDEAGANSNTTAAKKTESEKSDTAKADQASSTVMRE
LYLIDKNGYVVAQTLPLPKSESTAKQALEYLVQGGPVSEILPNGFRAVLPADTTVNVDIKKDGTAIADFSNEFKNYKKED
EQKIVQSVTWTLTQFSSIDKVKLRINGHELKEMPVGGTPISDDLSRKDGINLETAGVNDLTATHPLTVYYLAENEDSEYY
VPVTKRIDNSEKDDITAAINELAKGPSKVSGLLTDFSEDVKLVSKPKIKDGRVTLDFNQSIFGSADEKTKMISSEVLNSI
VLTLTEQPDVKSVSVKVNGKSELVNEKGEKLTEPVSRPSQVNTGSF
>Q9KI10 ~~~gerN~~~Na(+)/H(+)-K(+) antiporter GerN~~~COG0475
MEFEFFFQIALILLSTKLAGDLSVRLGQPSVLGKLIVGIVIGPAVLGWIENSELLTQLSNVGVILLMFMAGLETDLEELN
ANRNSSLAVALGGIILPFVGGYVSGLVMGMEQGNAVFLGLLLCATSVSISVQTLRDLGKMKTRESTTMLGAAVFDDILVV
ILLAFAMSFLGTDDVNLTMVILKKVVFFASIILIGWKGVPAIMRWLSPLRVSESIVSAALIICFSFAYFGELLGIAGIIG
AFAAGIAISQTNYKHEVEKKVEPIAYAMFVPVFFVSIGMNITFDGIGNQIWFILALTVIAVLTKLIGCGFGARMTGFDAK
SSAIIGAGMVSRGEVALIIAGTGLSSGLLAQDYFTAIVIVVILTTMITPPMLKYTFGAKDKAMKASK
>P62165 ~~~gerPA~~~Probable spore germination protein GerPA~~~
MPAMVGHIRIVNIGSSGIFHIGDVFAIRPISYSRAFAGAGSFNVGDNVSVYNYQSATTVNDSDVVDQAIIGST
>O06721 ~~~gerPA~~~Probable spore germination protein GerPA~~~
MPAIVGAFKINAIGTSGVVHIGDCITISPQAQVRTFAGAGSFNTGDSLKVMNYQNATNVYDNDAVDQPIVANA
>P0A3T7 ~~~gerPB~~~Probable spore germination protein GerPB~~~
MNFYVNQSIIINSIKIDSITTSSVFQIGTAGSIKALSKFSNTGGFTEPLRPLQAKGQIISIKPSTSSS
>O06720 ~~~gerPB~~~Probable spore germination protein GerPB~~~
MNFYINQTIQINYLRLESISNSSILQIGSAGSIKSLSNLYNTGSYVEPAPEVSGSGQPLQLQEPDTGSLVPLQPPGR
>O68685 ~~~gerPC~~~Probable spore germination protein GerPC~~~
MNQDIYTYLHQLQQALQTQQAAILNLEDQVRQLQEELNELKNRPSSSVGKVEYKFDQLKVENLNGTLNIGLNPFSAKGQQ
IEDLQVDTETLKVNPETETNPDFYQGILQEMHRYLDEEAYNRILHFEQEERTPLDEMYRQMMVDDIKKQMEHRLPYYLSQ
AQSYEGISTDPDYLRDIIIQAMKQDIDKAFLSFIQHIPGNFRKE
>O06719 ~~~gerPC~~~Probable spore germination protein GerPC~~~
MYDQSVSSYLQNLNSFVQQQAIHIQQLERQLKEIQTEMNTMKQRPATTIERVEYKFDQLKIERLDGTLNIGLNPTDPNSV
QNFDVSQSTPQIGMMQQEESAQLMQQIRQNVDMYLTEEIPDILEQLENQYDSRLDDTNRHHVIEDIRKQMDSRIHYYMSH
IKKEENTPPAQYAEHIAEHVKRDVIRAVEHFLEHIPSEMKGDEQA
>P0A3T9 ~~~gerPD~~~Probable spore germination protein GerPD~~~
MNLNVVNRELKVGQIKMNGVSSSALFLIGDANLLILSSILDTPFETVTEGPFVPLVTDVPPTPG
>O06718 ~~~gerPD~~~Probable spore germination protein GerPD~~~
MIFTVINRSLEVGDIRMNGVSSSSVFHIGDTESIYLSSIFDTPPESLIIGPFAPLAPE
>O68687 ~~~gerPE~~~Probable spore germination protein GerPE~~~
MLHHVSIVQNVSIISLGIAAVFQVGDANQMELKSRAIAVHREIPFYIRGEGRFDAFEIFTDEHITIPKRTTDVKLNIVNE
CPFIEVNNVELRTLLNSGCFQIGNVDYGFNNSRIIQIRQYITDEPSAQ
>O06717 ~~~gerPE~~~Probable spore germination protein GerPE~~~
MLKRISRIRLVKFNSLGIASVFQVGDTNEIDMSVKVFAVQRSLSTFYHNEGSFNKKEYQIFQQQAVKPLPETGVQSAFCH
EVPAIYVRSIKIQGVSASSVLHAGSASLIRGDARLKHIRQIQSPRSQSPAKNI
>P62183 ~~~gerPF~~~Probable spore germination protein GerPF~~~
MPSVVGNLVVQNSNGSFNLGDFYNVSPKENTKAYNGSGASNVGFVVNTFNGVSATNTFDSDVADQDQIGTA
>O06716 ~~~gerPF~~~Probable spore germination protein GerPF~~~
MPAIVGPIAINSISGGVVNFGDSFYLSPKSSSKSALGSGAGNTGDFLLLNNAVNATNYIDPDVNDQDMVGNG
>P39620 ~~~gerQ~~~Spore coat protein GerQ~~~
MKPKKNQYQQMQAFDNMQGYQPQFGANPYPQQGQGSQMQTMGMQPMMPMQQGQQGQQGQQGFGFPGQQQGGGFQIPSGPT
PSGPGQSVPGMLPVEESYIENILRLNRGKTATIYMTFENSKEWGSKIFRGVIEAAGRDHIIISDPKSGTRYLLLTIYLDY
ITFDEEIAYTYPYSMASYSPR
>E8W6C7 4.2.3.166~~~~~~(+)-(1(10)E,4E,6S,7R)-germacradien-6-ol synthase~~~COG0664
MTSQASAPKIPQLWVPLPSGIHPSWREIDQGSAAWLDRFGLYSDHAQRERLTRISVGEITGRGGPDGRLAALQWTADFLM
WLFAFDDEYCDEGPAAASPDATLLIITKLQRIVEVPWAAPADDNYSAALLELRLRLDDLTTPVQTARWAASFRAYLQGQI
WMAANSTYGRIPTLSDHLAVRLDSSGVKIFSTLSEIIHGYDLPAADYDRHDVRGFVEVFAAIIGWSNDLVSYHKERRRSQ
DSYGNVVDLIAHERQCSVEEAVSETATMHTRAMALYLRLRDQILRDAEPELRKWITDCDSWIRADYDWSLTTHRYVNPDD
PADLPVGSAEAPFRAREADQPLPIASVSWWWTLLKD
>Q7WY67 ~~~gerT~~~Spore germination protein GerT~~~
MFEWNKYFPFHNQFSKEALKKADPKEVETYVNRVMESVFGSDYAAQFPFRDPLPQKEHPAKPDAKPDVKPDIDIFETADH
VFVKVPISEEWLEQVRIKHTSHELWLENLPRADHPKKVNLPCLVKRKGTKAVYKDGLLEVMFQKQQDYNMSEVEIIR
>Q9ZFB4 ~~~gerXA~~~Spore germination protein XA~~~
MKRTVEVNESILRVWFEGCKDVKIMNRKWCADTTTTTILLVYCQHVIDHTKLKQAIAPEMCNDLLQSSFKDSNLLASNSQ
FSVTTLELENSNENVSRMLFEGKLLIIFQEYKRGYTIDIAKLPTRSIEQSNTEMTIRGSRDGFVEELSTNIGLIRKRLKT
SSLSYDEFIIGERTQTKVGLLYLKDVASQETISQVQFKLKEINIDGVVSSAQIEEFITGDQFSLFPLIEYTGRPDYAVNC
LLHGRFILLVDGSPTATIAPVSFPFFVNTAEDQNYFYLFGSFVRLLSLFGIAISIFLPGFWVALVTYHPDQIPYTLLATL
SLSREGIPFPAPLEGMIMITLFELLRQAGLRIPAAFGQTLSVVGGLIIGQAAISSGFVSPSMVVMIAISVVSTFTLVNQS
FTGTLSILRYGVFLMSSFLGIVGFICSILLIVIHVANLRSFGLPFLAPYSPPVFSSMLPSTFRIPFTRMKKRPKELHTYD
NTRQRTNNDENK
>Q51669 4.4.1.22~~~gfa~~~Glutathione-dependent formaldehyde-activating enzyme~~~
MVDTSGVKIHPAVDNGIKPAQPGFAGGTLHCKCSTNPVRVAVRAQTAHNHVCGCTKCWKPEGAIFSQVAVVGRDALEVLE
GAEKLEIVNAEAPIQRHRCRDCGVHMYGRIENRDHPFYGLDFVHTELSDEDGWSAPEFAAFVSSIIESGVDPSRMEAIRA
RLRELGLEPYDALSPPLMDAIATHIAKRSGALAA
>P75885 ~~~gfcA~~~Threonine-rich inner membrane protein GfcA~~~
MKHKLSAILMAFMLTTPAAFAAPEATNGTEATTGTTGTTTTTTGATTTATTTGGVAAGAVGTATVVGVATAVGVATLAVV
AANDSGDGGSHNTSTTTSTTR
>P75884 ~~~gfcB~~~Uncharacterized lipoprotein GfcB~~~
MRPLILSIFALFLAGCTHSQQSMVDTFRASLFDNQDITVADQQIQALPYSTMYLRLNEGQRIFVVLGYIEQEQSKWLSQD
NAMLVTHNGRLLKTVKLNNNLLEVTNSGQDPLRNALAIKDGSRWTRDILWSEDNHFRSATLSSTFSFAGLETLNIAGRNV
LCNVWQEEVTSTRPEKQWQNTFWVDSATGQVRQSRQMLGAGVIPVEMTFLKPAP
>Q8VQD7 ~~~gfh1~~~Transcription inhibitor protein Gfh1~~~
MAREVKLTKAGYERLMKQLEQERERLQEATKILQELMESSDDYDDSGLEAAKQEKARIEARIDSLEDVLSRAVILEEGTG
EVIGLGSVVELEDPATGERLSVQVVSPAEASVLENPMKISDASPMGKALLGHRVGDVLSLDTPKGKKEFRVVAIHGR
>Q72JT8 ~~~gfh1~~~Transcription inhibitor protein Gfh1~~~COG0782
MAREVKLTKAGYERLMQQLERERERLQEATKILQELMESSDDYDDSGLEAAKQEKARIEARIDSLEDILSRAVILEEGSG
EVIGLGSVVELEDPLSGERLSVQVVSPAEANVLDTPMKISDASPMGKALLGHRVGDVLSLDTPKGKREFRVVAIHG
>Q5SJG6 ~~~gfh1~~~Transcription inhibitor protein Gfh1~~~COG0782
MAREVKLTKAGYERLMQQLERERERLQEATKILQELMESSDDYDDSGLEAAKQEKARIEARIDSLEDILSRAVILEEGSG
EVIGLGSVVELEDPLSGERLSVQVVSPAEANVLDTPMKISDASPMGKALLGHRVGDVLSLDTPKGRREFRVVAIHG
>Q07982 1.1.99.28~~~gfo~~~Glucose--fructose oxidoreductase~~~COG0673
MTNKISSSDNLSNAVSATDDNASRTPNLTRRALVGGGVGLAAAGALASGLQAATLPAGASQVPTTPAGRPMPYAIRPMPE
DRRFGYAIVGLGKYALNQILPGFAGCQHSRIEALVSGNAEKAKIVAAEYGVDPRKIYDYSNFDKIAKDPKIDAVYIILPN
SLHAEFAIRAFKAGKHVMCEKPMATSVADCQRMIDAAKAANKKLMIGYRCHYDPMNRAAVKLIRENQLGKLGMVTTDNSD
VMDQNDPAQQWRLRRELAGGGSLMDIGIYGLNGTRYLLGEEPIEVRAYTYSDPNDERFVEVEDRIIWQMRFRSGALSHGA
SSYSTTTTSRFSVQGDKAVLLMDPATGYYQNLISVQTPGHANQSMMPQFIMPANNQFSAQLDHLAEAVINNKPVRSPGEE
GMQDVRLIQAIYEAARTGRPVNTDWGYVRQGGY
>O53507 2.5.1.10~~~idsA2~~~(2E,6E)-farnesyl diphosphate synthase~~~COG0142
MAGAITDQLRRYLHGRRRAAAHMGSDYDGLIADLEDFVLGGGKRLRPLFAYWGWHAVASREPDPDVLLLFSALELLHAWA
LVHDDLIDRSATRRGRPTAQLRYAALHRDRDWRGSPDQFGMSAAILLGDLAQVWADDIVSKVCQSALAPDAQRRVHRVWA
DIRNEVLGGQYLDIVAEASAAESIESAMNVATLKTACYTVSRPLQLGTAAAADRSDVAAIFEHFGADLGVAFQLRDDVLG
VFGDPAVTGKPSGDDLKSGKRTVLVAEAVELADRSDPLAAKLLRTSIGTRLTDAQVRELRTVIEAVGARAAAESRIAALT
QRALATLASAPINATAKAGLSELAMMAANRSA
>D7BAR0 2.4.1.352~~~~~~Glucosylglycerate phosphorylase~~~COG0366
MSSLTPELRQSILEHLGFLYGERAPAVLGRLEEICSGFPAQRREGGWSEKDALLITYGDQIHAEGEPPLQTLYDFLYERL
RGVFSGVHLLPFYPSTSDDGFSVVDFQRVDPELGTWTDIRIIAQDFRLMADLVCNHVSASSPWFQGFLQDDPQYQGFFIT
VDPGTDLSTVFRPRALPLLTPFQTPSGEKLVWTTFSPDQTDLNYANPEVLLEVIEALLCYVRNGAGLIRLDAVGFIWKEI
GTSCMHLEGAHRIVKLMRLVLDAVAPHVLLVSETNAPHRENISYFGNGHDEAQLVYQFPLPPLVMHTFRTGDASKLAGWA
AGLTLPSERTTFFNFLASHDGIGVVPAGGILQPEEIAALVRQALEHGGRVNHKDTPDGPVPYELCLTLFDALSNPNSDEA
EDLKIARFLAANVILLSLQGIPGVYIHSLFGSPSDHAGFEESGIPRRLNRHKFTKAELEERLADPASRAAKILAAYSHLL
RVRSMHPAFHPNAPQRILPSTEVLRIVRGEGDQAVGCYINVTDRPQVVSRIGKNLITGQWFTGVLKPYQAAWIID
>P76041 2.4.1.352~~~ycjM~~~Glucosylglycerate phosphorylase~~~COG0366
MKQKITDYLDEIYGGTFTATHLQKLVTRLESAKRLITQRRKKHWDESDVVLITYADQFHSNDLKPLPTFNQFYHQWLQSI
FSHVHLLPFYPWSSDDGFSVIDYHQVASEAGEWQDIQQLGECSHLMFDFVCNHMSAKSEWFKNYLQQHPGFEDFFIAVDP
QTDLSAVTRPRALPLLTPFQMRDHSTRHLWTTFSDDQIDLNYRSPEVLLAMVDVLLCYLAKGAEYVRLDAVGFMWKEPGT
SCIHLEKTHLIIKLLRSIIDNVAPGTVIITETNVPHKDNIAYFGAGDDEAHMVYQFSLPPLVLHAVQKQNVEALCAWAQN
LTLPSSNTTWFNFLASHDGIGLNPLRGLLPESEILELVEALQQEGALVNWKNNPDGTRSPYEINVTYMDALSRRESSDEE
RCARFILAHAILLSFPGVPAIYIQSILGSRNDYAGVEKLGYNRAINRKKYHSKEITRELNDEATLRHAVYHELSRLITLR
RSHNEFHPDNNFTIDTINSSVMRIPRSNADGNCLTGLFNVSKNIQHVNITNLHGRDLISEVDILGNEITLRPWQVMWIK
>G0GBS4 2.4.1.352~~~~~~Glucosylglycerate phosphorylase~~~
MEPVDRMRELLSFIYGPETGRDTHEALHALLDGWRGRLPSPDEEYASGRLPLDHTDAVLITYGDQFGRKGEAPLATLGEF
LREYLSGTMKGVHILPFFPYSSDDGFSVMDYRRVNPEWGTWDDVRRISEDFRLMVDLVLNHCSAKSEWFRRFLQGDPEYE
DFFITVEPGTDLSGVFRPRALPLVHEFESAKGPVLVWTTFSRDQVDLNYANPRVLLEMIDIFLFYVSQGAQIIRLDAIAY
LWKELGTPCIHHPKTHAVVKLFRAICEEVCPWVLIITETNVPHKENISYFGDMDEAHLVYQFALPPLVLDAFLRKDVSYL
REWARTIDTYGGKVSYFNFLASHDGIGVLPARGILPDEYIDAMIEAVKDRGGLISYKSTPQGEVPYELNINYLSAISESH
LDRPTRARKFLASQAVMLSLVGMPGIYVHSLLGSENWREGVEKTGMNRTINRQKLSYEGVLEELRDPESLRSMVFEGYLD
MLAARRKSRAFDPRGMQEVLEAPETVFALLRRSPDATEEVLCLINVSHIEQECVFPSSIFRTAPDAHLFTELTSGDTLVP
YREDEDRFSISLGGYEVLWLTPYRDKG
>C7PEQ0 2.5.1.41~~~~~~Geranylgeranylglyceryl phosphate synthase~~~COG1646
MHNKIYNSFIDRKAKGIKSFAVLIDPDKVNPADIADLAAKCTAAKVDYIFLGGSLVITNHLDECVQQFKTLCDIPVVLFP
GSPSQVSRYADALLYLSVISGRNPELLIGQHVLSAPAVKKSGLEVISTGYVLIDGGAPTTVSYISNTTPIPSDKDDIAMC
TAMAGEMLGMKVVFMDAGSGARKPITESMISRVASQVSAPIIVGGGIRDAEKAYLNCKAGADIIVVGNAIEKETSLIKEM
ADAVHAAAPVLK
>A5FJK8 2.5.1.41~~~~~~Geranylgeranylglyceryl phosphate synthase~~~COG1646
MEQKILTTIHQQILEAKKNGQKLLAILLDPDKIVWENLDHLLLKINQSPATHIFVGGSIVESTIIEDLIAQLKQKTRLPV
VIFPGDPSQISPKADAILFLSLLSGRNPDYLIEYQVQAAPILKKTNLEVISTGYILIESGNETAVARVSKTEPLNRENFD
LALATAQAGEMLGSKLIYLEAGSGAKKPVPLEMISVISQNVEIPIIVGGGIVDLHGIKKAYNAGADLVVIGTAFENDSHF
FDS
>D2QS27 2.5.1.41~~~~~~Geranylgeranylglyceryl phosphate synthase~~~COG1646
MTILRDYKLSGRKAFAVLLDPDKVEQDAFSTLLQRTADYPVDFFLVGGSLVTDYAHKEVIATIRRYSSTPVILFPGNPLH
IESSADAILLLSLISGRNADFLIGQHVIAAPLLKKSGLEILPTGYMVVDSGTQTTVSYISGTMPLPHDKPDVAACTALAG
EMLGLQLMYLDAGSGARRPVSAAMIAAVRKAVNVPIIVGGGITSGEKAYEALKAGADMIVVGNGVEQDPDLLPQLATVVR
EFNQSVVQA
>D5BCE4 2.5.1.41~~~~~~Geranylgeranylglyceryl phosphate synthase~~~COG1646
MPKILDAIIKASKINKKLLAVLIDPEKFATENYSYFIEKLPEAVTHIFVGGSTATTAQSEVCVDFIKTKTNLPVILFPGD
KEQITEKADGILLLSLISGRNPEYLIEQHIKAVPKLLNAGLEIIPTGYLLLDGGNQSAVARVSKTKPIQQDEIELIRNTA
LAGAMLGKQLVYLEAGSGALIPVSEKVIAEVKRDLNIPLIVGGGIRNATQLKKAYKAGADLVVIGTAFENGEFK
>K5BDL0 3.2.1.208~~~ggh~~~Glucosylglycerate hydrolase~~~COG1626
MPHDPSFTPTQLAARAAYLLRGNDLGTMTTAAPLLYPHMWSWDAAFVAIGLAPLSVERAVVELDTLLSAQWRNGMIPHIV
FANGVDGYFPGPARWATATLADNAPRNRLTSGITQPPVHAIAVQRILEHARTRGRSTRAVAEAFLDRRWGDLMRWHRWLA
ECRDRNERGRITLYHGWESGMDNSPRWDSAYANVVPGKLPEYQRADNVIITDPSQRPSDGEYDRYLWLLEEMKAVRYDDE
RLPSVMSFQVEDVFFSAIFSVACQVLAEIGEDYKRPHADVKDLYLWAERFRAGVVETTDQRTGAARDFDVLAEKWLVTET
AAQFAPLLCGGLPHDRERALLKLLEGPRFCGHPDLKYGLIPSTSPVSRDFRPREYWRGPVWPVLTWLFSWCFARRGWAER
ARLLRQEGLRQASDGSFAEYYEPFTGEPLGSMQQSWTAAAVLDWLG
>P11697 ~~~~~~Antibacterial protein 1~~~
MQKLAEAIAAAVSAGQDKDWGKMGTSIVGIVENGITVLGKIFGF
>P11698 ~~~~~~Antibacterial protein 2~~~
MEKIANAVKSAIEAGQNQDWTKLGTSILDIVSNGVTELSKIFGF
>P11699 ~~~~~~Antibacterial protein 3~~~
MSKLVQAISDAVQAQQNQDWAKLGTSIVGIVENGVGILGKLFGF
>E4PMA5 2.4.1.359~~~gtfA~~~Glucosylglycerol phosphorylase~~~COG0366
MLLKNAVQLICYPDRIGNNLKDLYTVVDTHLSEAIGGLHILPFFPSNADGGFSPLTHKEVDPKVGTWDDIEAFTAKYDLC
VDLTVNHISDESPEFTDFIANGFDSEYADLFVHVDKFGEISPDDMAKIHIRKEKEPFREVTLSDGTKTRVWCTFTEQQID
LNYESDLAYQLMESYIGFLTSKGVNLLRLDAFGYTTKRIGTSCFLVEPEVYQILDWVNQVALKHGAECLPEVHDHTSYQY
AISRRNMHPYGFALPPLLLYSLLDANSTYLKNWLRMCPRNMVTVLDTHDGICIPDVEGVLPDEKIKVLIDNIDARSADPI
MRRSAANIHSVGAIYQLTCTFYDALMQNDDAYIAARAIQFFTPGIPQVYYVGLLAGCNDHELMEQSGELRDINRHYYTLE
EVEQDIQKPVVQRLLSLMKFRSNYPAFDGHFELNYSNNSSVAMAWRHGDYYCHLFVDLNFKTVKVTYTDVETGETRHLEC
>O50410 2.5.1.29~~~idsB~~~Geranylgeranyl diphosphate synthase~~~COG0142
MGGVLTLDAAFLGSVPADLGKALLERARADCGPVLHRAIESMREPLATMAGYHLGWWNADRSTAAGSSGKYFRAALVYAA
AAACGGDVGDATPVSAAVELVHNFTLLHDDVMDGDATRRGRPTVWSVWGVGVAILLGDALHATAVRILTGLTDECVAVRA
IRRLQMSCLDLCIGQFEDCLLEGQPEVTVDDYLRMAAGKTAALTGCCCALGALVANADDATIAALERFGHELGLAFQCVD
DLIGIWGDPGVTGKPVGNDLARRKATLPVVAALNSRSEAATELAALYQAPAAMTASDVERATALVKVAGGGHVAQRCADE
RIQAAIAALPDAVRSPDLIALSQLICRREC
>O65979 2.4.1.213~~~ggpS~~~Glucosylglycerol-phosphate synthase~~~COG0380
MKSSLVILYHREPYDEVRENGKTFYRDKTSPNGIMPTLKSFFANAEQSTWVAWKQISGKQQENFQAKMAFPGQENSVVHR
IPLSADQVKNFYHITSKEAFWPILHSFPWQFTYDSSDWENFKQINEMFAEAACEDADDDALFWVHDYNLWLTPYFIRQKK
PNAKIAFFHHTPFPSVDIFNILPWREAIVDSLLCCDLCGFHLPRYVQNFVAVARSLRKVEITRQVPVDEHAFTAVGTALA
EPEITTQLKYKDHLVNLDAFPVGTNPTQIRAQVEKASTQERIRKIREELGSNKLILSAGRVDYVKGTKEMLVCYERLLER
RPELQTKVNLVVAAAKAASGMRVYKNAQSEIERLVGRINGRFAKLNWTPILLFTSALSYEELLGFFGAADIAWITPLRDG
LNLVAKEYVVAHGCDDGVLILSEFAGSAVELPDAILTNPYAAKRMDESIDQALAMPVEEQQRRMKSMYQAIQRYDVQQWA
NHMFREAKATAVLGKEPTPV
>P74258 2.4.1.213~~~ggpS~~~Glucosylglycerol-phosphate synthase~~~COG0380
MNSSLVILYHREPYDEVRENGKTVYREKKSPNGILPTLKSFFADAEQSTWVAWKQVSPKQKDDFQADMSIEGLGDRCTVR
RVPLTAEQVKNFYHITSKEAFWPILHSFPWQFTYDSSDWDNFQHINRLFAEAACADADDNALFWVHDYNLWLAPLYIRQL
KPNAKIAFFHHTPFPSVDIFNILPWREAIVESLLACDLCGFHIPRYVENFVAVARSLKPVEITRRVVVDQAFTPYGTALA
EPELTTQLRYGDRLINLDAFPVGTNPANIRAIVAKESVQQKVAEIKQDLGGKRLIVSAGRVDYVKGTKEMLMCYERLLER
RPELQGEISLVVPVAKAAEGMRIYRNAQNEIERLAGKINGRFAKLSWTPVMLFTSPLAYEELIALFCAADIAWITPLRDG
LNLVAKEYVVAKNGEEGVLILSEFAGCAVELPDAVLTNPYASSRMDESIDQALAMDKDEQKKRMGRMYAAIKRYDVQQWA
NHLLREAYADVVLGEPPQM
>D6XZ22 2.4.1.332~~~~~~1,2-alpha-glucosylglycerol phosphorylase~~~COG1554
MHEIGEHLTTNTGWDIIKNRYEAAQAITEGSNFMIGNGFMGYRGTFAEDGKDAYAACIVTDTWDKADGKWEELSTVPNAL
LTLLHVDGEPFIMSEEAASFERTLDLSQGVTSRKVSQRMKNGATITIHEEKFASYRKKHAVLMKYTVESDQDTDAVLDTG
IDYDVWSINGDHLQGHHYFSHPTGDGVTAKTVSYEDTVTVVETCSLDADASEEDYQNPDGSGRTFSLSLEAGKPVTLEKA
MIIYSSNDVDNPQDEALLEAKHMQSYEEEKAANRLEWDNLWSHYDVTIQNNIIDQVALRFNIYHAIIATPVHKSLPIGAR
GLSCQAYQGAAFWDQEIYNMPMYLYSNPEIARNILKYRHRTLDGARRKAKRLGYEGAYYAWISGKTGDELCPDFFFKDVL
SGRDIRNHFNDWQIHISPDIAYAVKKYHQVTGDDAFIRDYGAEMIFEIARFLASHAVYKPMRGRYEFMRVQGPDEYHENV
DNNAFTNHQAMFTLQAADELLQTLDEKTLSAVKEKIGLSDDEISLWRDMLANTYVPKPDKHGIIEQFDGYYDLETIIPAK
KVTERLIKEDEYYGYPNGVTVRTQCIKQADVIQLFVLHPHLYDRKTVELNYEFYEPRTLHFSSLSPSSYAIVAAQIDKVE
EAYRNFRKSVMIDLLNTNEAVSGGTFIGGIHTAANGASWQMVVNGFGGLSVHGDDIHLSPRLPDAWDGYTFKAIVKGQTL
EVDVTKEQITITNKSEDRKPLTLHIFGEKSVLDSERITKSR
>A9BEU2 2.4.1.268~~~ggs~~~Glucosylglycerate synthase~~~COG0463
MSIFFDDKILQNLPKKVKVVVGIPSYNNAETISFVSKTAAEGIVEYFDSDGIIVNADGGSKDGTKEVFMKTDTKSVPKIA
YDYIGLPGKGSAMLSVIELAKNLDAEAIVFLDSDLKSVRPWWIERLTGPIMKGLSDYVTPYYVRHKYDGTITNQVCYPLV
SSLFGQAIRQPIGGDFGVGKNMIDVYLKAASSVAKTEVARFGIDIWMTINAILNSNKKVYQAALGAKVHDPKDPGADLSP
MFKQVVGTLFDIIVDSASKWKDIGSIEEAPIYGEIPQIAVEPININIENLKMQLLEGLKNEESKILANDHLGFIMEKKKV
PLQIWVDILFNALIEYSKNKDKKLVESLVPLYFGRVADFAELTKDMNEVEAEKVIKDQINLFANKKDELIEKL
>Q79EE4 7.5.2.-~~~ggtA~~~Osmoprotective compounds uptake ATP-binding protein GgtA~~~COG3842
MASVSFEQVTKQFDDYVAVNNLNLEIEDGEFLVFVGPSGCGKTTSLRLLAGLETVSQGQICIGDRRVNELSPKDRDIAMV
FQSYALYPHMSVYENMAFSLDLQGKPKEEIRQRVCSAAELLGIEKLLHRKPKELSGGQRQRVAVGRAIVRKPSVFLMDEP
LSNLDAMLRVQARKEISKLHSDLATTFIYVTHDQVEAMTMGDRIAVMKDGILQQVDSPANLYNQPANLFVAGFIGSPAMN
FFQVERLSQEGKEKLSLDGVVLPMPDSVAKNGDRPLTLGIRPENIYHPQYLPLEIEPMELPATVNLVEMMGNELIVYAQT
PAGTEFVARIDPRVNIKQKDSVKFVVDTQRFYYFDREMETAIF
>Q55471 ~~~ggtB~~~Osmoprotective compounds-binding protein GgtB~~~COG1653
MKFFKITTLIISLIVLTSCQGPGVNGDEDRKQVTILGVMIGEQQEKIEQALAPFTEATGIEVVYEGVDTFATTLPIRVDS
GRAPDLAMFPQPGLMADFAREGKLVPLGEILTPEEMTEAYDQAWLDLAAVDGTVYGVWYRASVKSLVWFNPQEFAANGYE
VPGTWEEMMALSQRLIDKGKTPWCLGIESGNATGWVGTDWVEDIMLRTASPATYDQWVAHDIPFNDRRVENALDIFGEIT
QNEKMIYGGKVGALSTPFGDSILGLFTDPPHCYLHRQGNFIAAFLPADVDDDQVDIFPLPPIEEEYGLPILVAGDIFAMF
NDTPEARQLMAYLASSRPHEVAATLGAYISPHKNIDLNLYPDRLTRKQAEILNKAEVIRFDASDMMPGAVGTGTFWSGMV
DYIGGADGTQVLNTIERSWPR
>Q55472 ~~~ggtC~~~Osmoprotective compounds uptake permease protein GgtC~~~COG1175
MYVTPALLFLSAYLILPTLETVYLSFFDGRSRNFVGLKNYVFAFTDHTMLVAFRNNLLWLVLVTGISVSLGLIIAVLVDK
VRYEAIAKSIIFLPMAISFVGASVIWKFVYAYRPAGAEQIGLLNAIVTSLGFAPVGWLVERSVNNFALIAIMIWLYTGFC
MVILSAAVKGIPADVIEAARIDGANSWQIFWRITIPMIRSTLLVVSTTMVILVLKVFDIVFVMTGGNQGTEVIASLMIKE
MFNYRNFGRGSTIAVILLLLIVPVMITNIRRFKAQEKLR
>Q55473 ~~~ggtD~~~Osmoprotective compounds uptake permease protein GgtD~~~COG0395
MTKAVNKSNRTNNTNRKTEFWQKLPIHIAILTIAFIWTLPSLGLFISSLRPRGDMLSTGWWTVFWHPLEITQFYLGNYGD
VLRSSGMGEAFLNSLTIAVPATVIPIAIATFAAYAFAWMTFPGRQLLFILVVCLLVVPLQTTLIPVLRVYAQLGLAGTFL
GVWLAHTAYGLPLGIYLLRNYIGALPKDLIEAAAVDGASHLKIFTKLIVPLSMPAIASFAVFQFLWVWNDLLVALVYLGG
TADVAPVTIQLSNLVGSRGQDWYLLTAGAFISMIVPLMVFFGLQRYFVRGILAGSVKS
>P63186 3.4.19.13~~~ggt~~~Glutathione hydrolase proenzyme~~~
MKRTWNVCLTALLSVLLVAGSVPFHAEAKKPPKSYDEYKQVDVGKDGMVATAHALASEIGADVLKKGGNAIDAAVAIQFA
LNVTEPMMSGIGGGGFMMVYDGKTKDTTIIDSRERAPAGATPDMFLDENGKAIPFSERVTKGTAVGVPGTLKGLEEALDK
WGTRSMKLLITLTIKLAEKGFPIDSVLADAISDYQEKLSRTAAKDVFLPNGEPLKEGDTLIQKDLAKTFKLIRSKGTDAF
YKGKFAKTLSDTVQDFGGSMTEKDLENYDITIDEPIWGDYQGYQIATTPPPSSGGIFLLQMLKILDDFNLSQYDVRSWEK
YQLLAETMHLSYADRASYAGDPEFVNVPLKGLLHPDYIKERQQLINLDQVNKKPKAGDPWKYQEGSANYKQVEQPKDKVE
GQTTHFTVADRWGNVVSYTTTIEQLFGTGIMVPDYGVILNNELTDFDAIPGGANEVQPNKRPLSSMTPTILFKDDKPVLT
VGSPGGATIISSVLQTILYHIEYGMGLKAAVEEPRIYTTSMSSYRYEDGVPKDVLSKLNGMGHRFGTSPVDIGNVQSISI
DHENGTFKGVVISGSNDAAIGINLKRK
>P54422 3.4.19.13~~~ggt~~~Glutathione hydrolase proenzyme~~~COG0405
MKRTWNVCLTALLSVLLVAGSVPFHAEAKKPPKSYDEYKQVDVGKDGMVATAHPLASEIGADVLKKGGNAIDAAVAIQFA
LNVTEPMMSGIGGGGFMMVYDGKTKDTTIIDSRERAPAGATPDMFLDENGKAIPFSERVTKGTAVGVPGTLKGLEEALDK
WGTRSMKQLITPSIKLAEKGFPIDSVLAEAISDYQEKLSRTAAKDVFLPNGEPLKEGDTLIQKDLAKTFKLIRSKGTDAF
YKGKFAKTLSDTVQDFGGSMTEKDLENYDITIDEPIWGDYQGYQIATTPPPSSGGIFLLQMLKILDHFNLSQYDVRSWEK
YQLLAETMHLSYADRASYAGDPEFVNVPLKGLLHPDYIKERQQLINLDQVNKKPKAGDPWKYQEGSANYKQVEQPKDKVE
GQTTHFTVADRWGNVVSYTTTIEQLFGTGIMVPDYGVILNNELTDFDAIPGGANEVQPNKRPLSSMTPTILFKDDKPVLT
VGSPGGATIISSVLQTILYHIEYGMELKAAVEEPRIYTNSMSSYRYEDGVPKDVLSKLNGMGHKFGTSPVDIGNVQSISI
DHENGTFKGVADSSRNGAAIGINLKRK
>P18956 3.4.19.13~~~ggt~~~Glutathione hydrolase proenzyme~~~COG0405
MIKPTFLRRVAIAALLSGSCFSAAAAPPAPPVSYGVEEDVFHPVRAKQGMVASVDATATQVGVDILKEGGNAVDAAVAVG
YALAVTHPQAGNLGGGGFMLIRSKNGNTTAIDFREMAPAKATRDMFLDDQGNPDSKKSLTSHLASGTPGTVAGFSLALDK
YGTMPLNKVVQPAFKLARDGFIVNDALADDLKTYGSEVLPNHENSKAIFWKEGEPLKKGDTLVQANLAKSLEMIAENGPD
EFYKGTIAEQIAQEMQKNGGLITKEDLAAYKAVERTPISGDYRGYQVYSMPPPSSGGIHIVQILNILENFDMKKYGFGSA
DAMQIMAEAEKYAYADRSEYLGDPDFVKVPWQALTNKAYAKSIADQIDINKAKPSSEIRPGKLAPYESNQTTHYSVVDKD
GNAVAVTYTLNTTFGTGIVAGESGILLNNQMDDFSAKPGVPNVYGLVGGDANAVGPNKRPLSSMSPTIVVKDGKTWLVTG
SPGGSRIITTVLQMVVNSIDYGLNVAEATNAPRFHHQWLPDELRVEKGFSPDTLKLLEAKGQKVALKEAMGSTQSIMVGP
DGELYGASDPRSVDDLTAGY
>P36267 3.4.19.13~~~ggt~~~Glutathione hydrolase proenzyme~~~
MKNQTFSKALLATALSCALFNVHAASQAPVGAENGMVVTAQHIASKVGVEVLKSGGNAIDAAVAVGYALAVVYPAAGNIG
GGGFMTIQLADGRKTFLDFREKAPLAATANMYLDKDGNVIKGASTTGYLAVGVPGTVSGMEYAREKYGTKTRQQLISPAI
TLADKGFVLEQGDVDMLWTSTKDFEKDRANSGAIFMNKGQPFQPGERLVQKDLARTLRLISAKGTDGFYKGEVADKLVAS
MKAGGGIITQADLDQYKTRELAPVECDYRGYHVVSAPPPSSGGVVICEIMNILEGYPMKELGYHSAQGVHYTIEAMRHAY
VDRNSYLGDPDFVKNPLAHLLDKDYAAKIRAAINPQKAGISQEIKPGVPPHEGSNTTHYSIVDKDGNAVSVTYTLNDWFG
AKVMANGTGVLLNDEMDDFTSKVGVPNMYGLIQGEANAIGPGRRPLSSMSPTIVTKDGKTVMVVGTPGGSRIITATLLTM
LNMIDYGMNLQEAVDAPRFHQQWMPESTNIEAFALSPDTQKILESWGQKFAGPQPANHIAAILVGAPSLGGKPIGKNRFY
GANDPRRNTGLALGY
>Q2MGH6 3.2.1.97~~~~~~Endo-alpha-N-acetylgalactosaminidase~~~COG0366
MNKGLFEKRCKYSIRKFSLGVASVMIGAAFFGTSPVLADSVQSGSTANLPADLATALATAKENDGRDFEAPKVGEDQGSP
EVTDGPKTEEELLALEKEKPAEEKPKEDKPAAAKPETPKTVTPEWQTVANKEQQGTVTIREEKGVRYNQLSSTAQNDNAG
KPALFEKKGLTVDANGNATVDLTFKDDSEKGKSRFGVFLKFKDTKNNVFVGYDKDGWFWEYKSPTTSTWYRGSRVAAPET
GSTNRLSITLKSDGQLNASNNDVNLFDTVTLPAAVNDHLKNEKKILLKAGSYDDERTVVSVKTDNQEGVKTEDTPAEKET
GPEVDDSKVTYDTIQSKVLKAVIDQAFPRVKEYSLNGHTLPGQVQQFNQVFINNHRITPEVTYKKINETTAEYLMKLRDD
AHLINAEMTVRLQVVDNQLHFDVTKIVNHNQVTPGQKIDDESKLLSSISFLGNALVSVSSNQTGAKFDGATMSNNTHVSG
DDHIDVTNPMKDLAKGYMYGFVSTDKLAAGVWSNSQNSYGGGSNDWTRLTAYKETVGNANYVGIHSSEWQWEKAYKGIVF
PEYTKELPSAKVVITEDANADKNVDWQDGAIAYRSIMNNPQGWEKVKDITAYRIAMNFGSQAQNPFLMTLDGIKKINLHT
DGLGQGVLLKGYGSEGHDSGHLNYADIGKRIGGVEDFKTLIEKAKKYGAHLGIHVNASETYPESKYFNEKILRKNPDGSY
SYGWNWLDQGINIDAAYDLAHGRLARWEDLKKKLGDGLDFIYVDVWGNGQSGDNGAWATHVLAKEINKQGWRFAIEWGHG
GEYDSTFHHWAADLTYGGYTNKGINSAITRFIRNHQKDAWVGDYRSYGGAANYPLLGGYSMKDFEGWQGRSDYNGYVTNL
FAHDVMTKYFQHFTVSKWENGTPVTMTDNGSTYKWTPEMRVELVDADNNKVVVTRKSNDVNSPQYRERTVTLNGRVIQDG
SAYLTPWNWDANGKKLSTDKEKMYYFNTQAGATTWTLPSDWAKSKVYLYKLTDQGKTEEQELTVKDGKITLDLLANQPYV
LYRSKQTNPEMSWSEGMHIYDQGFNSGTLKHWTISGDASKAEIVKSQGANDMLRIQGNKEKVSLTQKLTGLKPNTKYAVY
VGVDNRSNAKASITVNTGEKEVTTYTNKSLALNYVKAYAHNTRRDNATVDDTSYFQNMYAFFTTGADVSNVTLTLSREAG
DQATYFDEIRTFENNSSMYGDKHDTGKGTFKQDFENVAQGIFPFVVGGVEGVEDNRTHLSEKHNPYTQRGWNGKKVDDVI
EGNWSLKTNGLVSRRNLVYQTIPQNFRFEAGKTYRVTFEYEAGSDNTYAFVVGKGEFQSGRRGTQASNLEMHELPNTWTD
SKKAKKATFLVTGAETGDTWVGIYSTGNASNTRGDSGGNANFRGYNDFMMDNLQIEEITLTGKMLTENALKNYLPTVAMT
NYTKESMDALKEAVFNLSQADDDISVEEARAEIAKIEALKNALVQKKTALVADDFASLTAPAQAQEGLANAFDGNVSSLW
HTSWNGGDVGKPATMVLKEPTEITGLRYVPRGSGSNGNLRDVKLVVTDESGKEHTFTATDWPNNNKPKDIDFGKTIKAKK
IVLTGTKTYGDGGDKYQSAAELIFTRPQVAETPLDLSGYEAALVKAQKLTDKDNQEEVASVQASMKYATDNHLLTERMVE
YFADYLNQLKDSATKPDAPTVEKPEFKLRSLASEQGKTPDYKQEIARPETPEQILPATGESQSDTALILASVSLALSALF
VVKTKKD
>Q8DR60 3.2.1.97~~~~~~Endo-alpha-N-acetylgalactosaminidase~~~COG0366
MNKGLFEKRCKYSIRKFSLGVASVMIGATFFGTSPVLADSVQSGSTANLPADLATALATAKENDGHDFEAPKVGEDQGSP
EVTDGPKTEEELLALEKEKPAEEKPKEDKPAAAKPETPKTVTPEWQTVEKKEQQGTVTIREEKGVRYNQLSSTAQNDNAG
KPALFEKKGLTVDANGNATVDLTFKDDSEKGKSRFGVFLKFKDTKNNVFVGYDKDGWFWEYKSPTTSTWYRGSRVAAPET
GSTNRLSITLKSDGQLNASNNDVNLFDTVTLPAAVNDHLKNEKKILLKAGSYDDERTVVSVKTDNQEGVKTEDTPAEKET
GPEVDDSKVTYDTIQSKVLKAVIDQAFPRVKEYSLNGHTLPGQVQQFNQVFINNHRITPEVTYKKINETTAEYLMKLRDD
AHLINAEMTVRLQVVDNQLHFDVTKIVNHNQVTPGQKIDDERKLLSSISFLGNALVSVSSDQTGAKFDGATMSNNTHVSG
DDHIDVTNPMKDLAKGYMYGFVSTDKLAAGVWSNSQNSYGGGSNDWTRLTAYKETVGNANYVGIHSSEWQWEKAYKGIVF
PEYTKELPSAKVVITEDANADKKVDWQDGAIAYRSIMNNPQGWKKVKDITAYRIAMNFGSQAQNPFLMTLDGIKKINLHT
DGLGQGVLLKGYGSEGHDSGHLNYADIGKRIGGVEDFKTLIEKAKKYGAHLGIHVNASETYPESKYFNEKILRKNPDGSY
SYGWNWLDQGINIDAAYDLAHGRLARWEDLKKKLGDGLDFIYVDVWGNGQSGDNGAWATHVLAKEINKQGWRFAIEWGHG
GEYDSTFHHWAADLTYGGYTNKGINSAITRFIRNHQKDAWVGDYRSYGGAANYPLLGGYSMKDFEGWQGRSDYNGYVTNL
FAHDVMTKYFQHFTVSKWENGTPVTMTDNGSTYKWTPEMRVELVDADNNKVVVTRKSNDVNSPQYRERTVTLNGRVIQDG
SAYLTPWNWDANGKKLSTDKEKMYYFNTQAGATTWTLPSDWAKSKVYLYKLTDQGKTEEQELTVKDGKITLDLLANQPYV
LYRSKQTNPEMSWSEGMHIYDQGFNSGTLKHWTISGDASKAEIVKSQGANDMLRIQGNKEKVSLTQKLTGLKPNTKYAVY
VGVDNRSNAKASITVNTGEKEVTTYTNKSLALNYVKAYAHNTRRNNATVDDTSYFQNMYAFFTTGSDVSNVTLTLSREAG
DEATYFDEIRTFENNSSMYGDKHDTGKGTFKQDFENVAQGIFPFVVGGVEGVEDNRTHLSEKHDPYTQRGWNGKKVDDVI
EGNWSLKTNGLVSRRNLVYQTIPQNFRFEAGKTYRVTFEYEAGSDNTYAFVVGKGEFQSGRRGTQASNLEMHELPNTWTD
SKKAKKATFLVTGAETGDTWVGIYSTGNASNTRGDSGGNANFRGYNDFMMDNLQIEEITLTGKMLTENALKNYLPTVAMT
NYTKESMDALKEAVFNLSQADDDISVEEARAEIAKIEALKNALVQKKTALVADDFASLTAPAQAQEGLANAFDGNLSSLW
HTSWGGGDVGKPATMVLKEATEITGLRYVPRGSGSNGNLRDVKLVVTDESGKEHTFTATDWPDNNKPKDIDFGKTIKAKK
IVLTGTKTYGDGGDKYQSAAELIFTRPQVAETPLDLSGYEAALAKAQKLTDKDNQEEVASVQASMKYATDNHLLTERMVE
YFADYLNQLKDSATKPDAPTVEKPEFKLSSVASDQGKTPDYKQEIARPETPEQILPATGESQFDTALFLASVSLALSALF
VVKTKKD
>A4Q8F7 3.2.1.49~~~nagA~~~Alpha-N-acetylgalactosaminidase~~~COG0673
MGALIPSSTLFNIFDFNPKKVRIAFIAVGLRGQTHVENMARRDDVEIVAFADPDPYMVGRAQEILKKNGKKPAKVFGNGN
DDYKNMLKDKNIDAVFVSSPWEWHHEHGVAAMKAGKIVGMEVSGAITLEECWDYVKVSEQTGVPLMALENVCYRRDVMAI
LNMVRKGMFGELVHGTGGYQHDLRPVLFNSGINGKNGDGVEFGEKAFSEAKWRTNHYKNRNGELYPTHGVGPLHTMMDIN
RGNRLLRLSSFASKARGLHKYIVDKGGESHPNAKVEWKQGDIVTTQIQCHNGETIVLTHDTSLQRPYNLGFKVQGTEGLW
EDFGWGEAAQGFIYFEKIMNHSHRWDSSEKWIKEYDHPMWKKHEQKAVGAGHGGMDYFLDNTFVECIKRNEAFPLDVYDL
ATWYSITPLSEKSIAENGAVQEIPDFTNGKWKNAKNTFAINDDY
>E4Q361 3.2.1.21~~~~~~Multifunctional glycoside hydrolase~~~COG2723
MSFPKGFLWGAATASYQIEGAWNEDGKGESIWDRFTHQKGNILYGHNGDVACDHYHRHEEDVSLMKELGIKAYRFSTAWA
RIFPDGFGNINQKGLEFYDKLINELVENGIEPVVTLYHWDLPQKLQDIGGWANPEIVNYYFEYAMLIINRYKDKVKKWIT
FNEPYCIAFLGHWHGIHAPGIKNFKVAMDVVHNIMLSHFKVVKAVKENNIDVEIGITLNLTPVYLQTERLGYKVSEIERE
MVNLSSQLDNELFLDPVLKGSYPQKLLDYLVQKDLLDSQKVNNMQQEVKENFIFPDFLGINYYTRSVRLYDENSGWIFPI
RWEHPAGEYTEMGWEVFPQGLFDLLIWIKESYPQIPIYITENGAAYNDKVEDGRVHDQKRVEYLKQHFEAARKAIKNGVD
LRGYFVWSLIDNFEWAMGYTKRFGIIYVDYETQKRIKKDSFYFYQQYIKENS
>A7LXT0 3.2.1.177~~~~~~Alpha-xylosidase BoGH31A~~~COG1501
MIMNMKNIFYCLLPGLLLGACSNKVYEKTGDSVIVKVQHKETGGPRLVRLQVMGDKLIHVSATADSKFADPQSLIVVPQK
KQTSFAVVQNGDTITVSTEEVKASVLASTGEVWFTDKNGELILQENKGGGKTFTPIEVEGTKGYTVCQVFESPEDEAFYG
LGQHQADEFNYKGKNEELFQYNTKVSVPFVVSNKNYGILLDSYSFCRFGNPNDYSQLNRIFKLYDKTGQEGALTGTYVPK
KGETLVRREDSIYFENLKTIENLPKKLPLMGAKVTYEGEIEPAQTGEFKFILYYAGYVKVYLNNEPVVPERWRTAWNPNS
YKFAAHLEAGKRVPLKIEWQPDGGQSYCGLRALTPVNPEEQGKQSWWSEMTKQLDYYFMAGENMDDVISGYRSLTGKSPV
MPKWAMGFWQSREKYNTQEEMLGALKGFRDRKIPLDNIVLDWNHWPENAWGSHEFDKARFPDPKAMVDSIHAMHARMMIS
VWPKFYVTTEHFKEFDENGWMYQQSVKDSLKDWVGPGYHYGFYDAYDPDARKLFWKQMYEHYYPLGIDAWWMDASEPNVR
DCTDLEYRKALCGPTALGSSTEFFNAYALMNAEAIYDGQRGVDNNKRVFLLTRSGFAGLQRYSTATWSGDIGTRWEDMKA
QISAGLNFAMSGIPYWTMDIGGFCVENRYVAGQKQWNATKTENADYKEWRELNTRWYQFGAFVPLYRAHGQYPFREIWEI
APEGHPAYQSVVYYTKLRYNMMPYIYSLAGMTWFDDYTIMRPLVMDFTADAEVNDIGDQFMFGPSFMVSPVYRYGDRSRE
IYFPQAEGWYDFYSGKFQAGGERKVIEAPYERIPLYVRAGAIIPFGDDIQYTDEKPAEHIRLYIYQGADGEFTLYEDEGV
NYNYEQGMYAMIPMKYDEATKTLVIGERQGEFPGMLKERTFTVVTVNKEKAQPFDLNAKGVTVKYNGSEQTLKL
>A7LXT8 3.2.1.55~~~~~~Non-reducing end alpha-L-arabinofuranosidase BoGH43A~~~COG3507
MRNALFLIFISLCSVCKSSAQGYSNPVIPGFHPDPSVCKAGDDYYLVNSSFQYFPGVPLFHSKDLVHWEQIGNCLTRPSQ
LDLTNANSGSGIFAPTIRYNDGVFYMITTNVSGKGNFLVHTTDPRSEWSEPVWLEQGGIDPSLYFEDGKCFMVSNPDGYI
NLCEIDPMTGKQLSSSKRIWNGTGGRYAEGPHIYKKDGWYYLLISEGGTELGHKVTIARSRYIDGPYQGNPANPILTHAN
ESGQSSPIQGTGHADLVEGTDGSWWMVCLAYRIMPGTHHTLGRETYLAPVRWDKDAWPVVNSNGTISLKMDVPTLPQQEM
KGRPERIDFKEGKLSPEWIHLQNPEAKNYIFTKDGKLRLIATPVTLSDWKSPTFVALRQEHFDMEASAPVVLQKAGVNDE
AGISVFMEFHSHYDLFVRQDKDRKRSVGLRYKLGEITHYAKEVSLPTDGEVELVVKSDINYYYFGYKVNGIYHDLGKMNT
RYLSTETAGGFTGVVLGLYITSASKDSKAYADFEYFKYKGKPGENK
>A7LXU0 3.2.1.55~~~~~~Non-reducing end alpha-L-arabinofuranosidase BoGH43B~~~COG3507
MMKNSCRLLLILIGLWMANVSLAQKTFRNPIITGMNPDPSICRVGDDFYLVTSTFEYFPGLPVYHSKDLVHWKLIGHALS
RPENNPLMGCNASTGGQYAPTLRYHDGTFYVIGTNYGGKGSQGVFYVTAKNPAGPWSDPVWVGNWYVDPSIEFIDGKMYF
LSPDNQGSFLLGVMDPETGTFVEALRKVASGLGGSSPEGPHFYKIGDYYYIMSAEGGTGYEHREVIQRSKSPWGPYEPSP
VNPVLSNMNCPDHPFQAIGHADLVQLKDGSWWAVCLGIRPVNGKYQHLGRETFLAPVTWDADGWPKVGKDGVVQETYLFP
NLPSHVWMEQPVRDDFDQETLGLDWTFIRNPAHSFWSLTEKPGSLRLKGTAINFTTNDSPSFIGRRQAAFNLTASAKVNF
IPKVENEEAGLVVRADDKNHYDLLITERNGQRVAMIRKTLKDKVVDTTCKELPATGEVILSITATETTYTFEIKAAHVSA
ILGTASTRDVSNEVVGGFTGVFIGMYASGNGQANTNPADFDWFDFRCLD
>P0AF61 3.1.-.-~~~ghoS~~~Endoribonuclease antitoxin GhoS~~~
MEGKNKFNTYVVSFDYPSSYSSVFLRLRSLMYDMNFSSIVADEYGIPRQLNENSFAITTSLAASEIEDLIRLKCLDLPDI
DFDLNIMTVDDYFRQFYK
>P64646 ~~~ghoT~~~Toxin GhoT~~~
MALFSKILIFYVIGVNISFVIIWFISHEKTHIRLLSAFLVGITWPMSLPVALLFSLF
>P84848 ~~~~~~Green heme protein~~~
EEAIDAQALVDQNCTGCHGSEVYTRDERRVESLDALHGQVRMCEQNLELTWFDDQVDAVTTLLNREYYNFEP
>P75913 1.1.1.79~~~ghrA~~~Glyoxylate/hydroxypyruvate reductase A~~~COG0111
MDIIFYHPTFDTQWWIEALRKAIPQARVRAWKSGDNDSADYALVWHPPVEMLAGRDLKAVFALGAGVDSILSKLQAHPEM
LNPSVPLFRLEDTGMGEQMQEYAVSQVLHWFRRFDDYRIQQNSSHWQPLPEYHREDFTIGILGAGVLGSKVAQSLQTWRF
PLRCWSRTRKSWPGVQSFAGREELSAFLSQCRVLINLLPNTPETVGIINQQLLEKLPDGAYLLNLARGVHVVEDDLLAAL
DSGKVKGAMLDVFNREPLPPESPLWQHPRVTITPHVAAITRPAEAVEYISRTIAQLEKGERVCGQVDRARGY
>Q8ZQ30 1.1.1.79~~~ghrA~~~Glyoxylate/hydroxypyruvate reductase A~~~
MEIIFYHPTFNAAWWVNALEKALPHARVREWKVGDNNPADYALVWQPPVEMLAGRRLKAVFVLGAGVDAILSKLNAHPEM
LDASIPLFRLEDTGMGLQMQEYAVSQVLHWFRRFDDYQALKNQALWKPLPEYTREEFSVGIMGAGVLGAKVAESLQAWGF
PLRCWSRSRKSWPGVESYVGREELRAFLNQTRVLINLLPNTAQTVGIINSELLDQLPDGAYVLNLARGVHVQEADLLAAL
DSGKLKGAMLDVFSQEPLPQESPLWRHPRVAMTPHIAAVTRPAEAIDYISRTITQLEKGEPVTGQVDRARGY
>P37666 1.1.1.79~~~ghrB~~~Glyoxylate/hydroxypyruvate reductase B~~~COG1052
MKPSVILYKALPDDLLQRLQEHFTVHQVANLSPQTVEQNAAIFAEAEGLLGSNENVNAALLEKMPKLRATSTISVGYDNF
DVDALTARKILLMHTPTVLTETVADTLMALVLSTARRVVEVAERVKAGEWTASIGPDWYGTDVHHKTLGIVGMGRIGMAL
AQRAHFGFNMPILYNARRHHKEAEERFNARYCDLDTLLQESDFVCLILPLTDETHHLFGAEQFAKMKSSAIFINAGRGPV
VDENALIAALQKGEIHAAGLDVFEQEPLSVDSPLLSMANVVAVPHIGSATHETRYGMAACAVDNLIDALQGKVEKNCVNP
HVAD
>P58000 1.1.1.79~~~tkrA~~~Glyoxylate/hydroxypyruvate reductase B~~~
MKPEVLLYKSLPDDLRARLDEHFTVTAINGLSPETIAEHGGAGARRRHDRLQQHGGSSAAGENAKLRAASTISVGYDNFD
VEALNQRGIVLIDTPTVLTETVADTMMALVLSSARRVVEVAERVKAGEWRRSIGPDWFGIDVHHKKMGILGMGRIGLALA
QRAHHGFGMPILYNARKHHEEAESRFNAQYCDLDTLLRESDFLCISLPLTEQTHHMIGREQLAKMKPSAILINAGRGPVV
DEQALIAALKDKTIHAAGLDVFEQEPLPVDSELLTLPNVVALPHIGSATHETRYGMARDAVDNLIAALAGKVEKNCVNPQ
VLR
>P0AF52 ~~~ghxP~~~Guanine/hypoxanthine permease GhxP~~~COG2252
MSTPSARTGGSLDAWFKISQRGSTVRQEVVAGLTTFLAMVYSVIVVPGMLGKAGFPPAAVFVATCLVAGLGSIVMGLWAN
LPLAIGCAISLTAFTAFSLVLGQHISVPVALGAVFLMGVLFTVISATGIRSWILRNLPHGVAHGTGIGIGLFLLLIAANG
VGLVIKNPLDGLPVALGDFATFPVIMSLVGLAVIIGLEKLKVPGGILLTIIGISIVGLIFDPNVHFSGVFAMPSLSDENG
NSLIGSLDIMGALNPVVLPSVLALVMTAVFDATGTIRAVAGQANLLDKDGQIIDGGKALTTDSMSSVFSGLVGAAPAAVY
IESAAGTAAGGKTGLTAITVGVLFLLILFLSPLSYLVPGYATAPALMYVGLLMLSNVAKIDFADFVDAMAGLVTAVFIVL
TCNIVTGIMIGFATLVIGRLVSGEWRKLNIGTVVIAVALVTFYAGGWAI
>Q46817 ~~~ghxQ~~~Guanine/hypoxanthine permease GhxQ~~~COG2252
MSGDILQTPDAPKPQGALDNYFKITARGSTVRQEVLAGLTTFLAMVYSVIVVPGMLGKAGFPPAAVFVATCLVAGFGSLL
MGLWANLPMAIGCAISLTAFTAFSLVLGQQISVPVALGAVFLMGVIFTAISVTGVRTWILRNLPMGIAHGTGIGIGLFLL
LIAANGVGMVIKNPIEGLPVALGAFTSFPVMMSLLGLAVIFGLEKCRVPGGILLVIIAISIIGLIFDPAVKYHGLVAMPS
LTGEDGKSLIFSLDIMGALQPTVLPSVLALVMTAVFDATGTIRAVAGQANLLDKDNQIINGGKALTSDSVSSIFSGLVGA
APAAVYIESAAGTAAGGKTGLTATVVGALFLLILFLSPLSFLIPGYATAPALMYVGLLMLSNVSKLDFNDFIDAMAGLVC
AVFIVLTCNIVTGIMLGFVTLVVGRVFAREWQKLNIGTVIITAALVAFYAGGWAI
>P37534 ~~~csfB~~~Anti-sigma-G factor Gin~~~
MDETVKLNHTCVICDQEKNRGIHLYTKFICLDCERKVISTSTSDPDYAFYVKKLKSIHTPPLYS
>B1V8K7 3.2.1.n1~~~glaA~~~Alpha-1,3-galactosidase A~~~
MGTATAQPALRPQTSTVIGGLHGAAVLDNTGRTVIDVTDFGADPSGKADSAAAVSAAMAHAKTVGGPTTLHFPTGTYHIW
PERTPKRELYVSNTVGSDQAFRTKNIGILVEDMRDVVVDGGGSRIVNHGFQTVFAAIRSSDVRFTNFSQTWVAPKTVDIT
VADAGVVSGQAYRIIDIPETYDYAVEGTSVRWNGERGPATGQPYWTGTNSFDYSQVHDPATNRTWRTSNPVFPERHEDHR
PRRRQVRITYGDSTAPGDRGYVYQMREVTRDTPGALFWESSRVTVDHLRLGYLHGFGIVGQLSEDIGIDSVTFKADRGSG
RVTSGFADHIQMSGVKGTVRITNSVFDNPQDDPINIHGTYLQATAAERETLQLRYMHNETSGFPQFYPGDTIELVDKRTM
LAAPGATAKVVSVTGPTGSGVPAGTDPDTYLRTMTVVLDRTLPAAVLAAPGDYVAENTTYTPTVEITGNTFQAVPTRGIL
VTTRRPVRIENNRFDGMSMASIYISSDARSWYESGPVRNVTIRGNVFDRPASPVIFFDPTNQDFVAGQPVHRNVLIEDND
FNLTGGTILSGRGVGGLTFRDNRVERYPHLRLTGPSRALRVGDTTTVTTDAPPPSHTSPLFTFDGADDITLANNTYGNGF
NKRVNTANMDVSEITVTADGLALNADSISSAPVAVSYSSSRPKVATVDSEGVVKALSGGTTSITARATIGGVRVTSNPVK
VVVATER
>Q5LGZ8 3.2.1.n1~~~glaB~~~Alpha-1,3-galactosidase B~~~COG5434
MKTILLFALSLLLSLSVSDVCAQERVYDISQFGLKANSKKNASPVVRKAIAKIKAECRDGEKVILRFPAGRYNFHEAGST
VREYYISNHDQDNPKKVGIALEDMKNLTIDGQGSEFVFYGRMIPVSLLRSENCVLKNFSIDFEQPHIAQVQVVENDPEKG
ITFEPAPWVDYRISKDSVFEGLGEGWVMRYSWGIAFDGKTKHVVYNTSDIGCPTKGAFEVAPRRICSPKWKDARLVPGTV
VAMRGWGRPTPGIFMSHDVNTSLLDVKVHYAEGMGLLAQLCEDITLDGFGVCLKGDNDPRYFTTQADATHFSGCKGKIVS
KNGLYEGMMDDAINVHGTYLKVIKRVDDHTLIGRYMHDQSWGFEWGRPGDDVQFVRSETMELIGKQNQITAIRPYDKGEI
RGAREFSITFKEAIDPAINEKSGFGIENLTWTPEVLFAGNTIRNNRARGTLFSTPKKTVVEDNLFDHTSGTAILLCGDCN
GWFETGACRDVTIRRNRFINALTNMFQFTNAVISIYPEIPNLKDQQKYFHGGKDGGIVIEDNEFDTFDAPILYAKSVDGL
IFRNNVIKTNTEFKPFHWNKDRFLLERVTNVKISE
>P76621 1.14.11.64~~~glaH~~~Glutarate 2-hydroxylase~~~
MNALTAVQNNAVDSGQDYSGFTLTPSAQSPRLLELTFTEQTTKQFLEQVAEWPVQALEYKSFLRFRVAKILDDLCANQLQ
PLLLKTLLNRAEGALLINAVGVDDVKQADEMVKLATAVAHLIGRSNFDAMSGQYYARFVVKNVDNSDSYLRQPHRVMELH
NDGTYVEEITDYVLMMKIDEQNMQGGNSLLLHLDDWEHLDNYFRHPLARRPMRFAAPPSKNVSKDVFHPVFDVDQQGRPV
MRYIDQFVQPKDFEEGVWLSELSDAIETSKGILSVPVPVGKFLLINNLFWLHGRDRFTPHPDLRRELMRQRGYFAYASNH
YQTHQ
>Q88IU0 1.14.11.64~~~glaH~~~Glutarate 2-hydroxylase~~~
MNAFTQIDELVMPLPLEPQGYTIAPSKQSPRLLELTFARETVEAFVQAVAQWPVQALEYKSFLRFRVGEILDELCQGTLR
PVLLNTILDRATGGMLITPIGLDDVSQAEDMVKFTTACAHLIGRSNYDAMSGQFYARFVVVNSDNSDSYLRQPHRVMELH
NDGTFVNQITDYVLMLKIDEKNMEGGNSLLLHLDDWEQCEAFFRHPLARREMRWTAPPSKKVAEDVFHSVFDTDAEGRPT
MRYIDQFVQPENYEEGIWLNALSESLEGSGKKVSVPVGVGSFLLINNLFWLHGRDRFTPHEGLRRELMRQRGYVAFPKPL
YQRGQ
>P37338 ~~~glaR~~~HTH-type transcriptional repressor GlaR~~~COG1802
MTITSLDGYRWLKNDIIRGNFQPDEKLRMSLLTSRYALGVGPLREALSQLVAERLVTVVNQKGYRVASMSEQELLDIFDA
RANMEAMLVSLAIARGGDEWEADVLAKAHLLSKLEACDASEKMLDEWDLRHQAFHTAIVAGCGSHYLLQMRERLFDLAAR
YRFIWLRRTVLSVEMLEDKHDQHQTLTAAVLARDTARASELMRQHLLTPIPIIQQAMAGN
>P22094 ~~~gla~~~Glycerol facilitator-aquaporin gla~~~
MDVTWTVKYITEFVGTALLIIMGNGAVANVELKGTKAHAQSWMIIGWGYGLGVMLPAVAFGNITSQINPAFTLGLAASGL
FPWAHVAQYIIAQVLGAMFGQLLIVMVYRPYYLKTQNPNAILGTFSTIDNVDDNSEKTRLGATINGFLNEFLGSFVLFFG
AVAATNIFFGSQSITWMTNYLKGQGADVSSSDVMNQIWVQASGASASKMIAHLFLGFLVMGLVVALGGPTGPGLNPARDF
GPRLVHSLLPKSVLGEAKGSSKWWYAWVPVLAPILASLAAVALFKMIYL
>Q9HXC4 ~~~~~~Glycine-betaine-binding protein~~~
MNRLIRSLCLACAGLFAAGLAQAETLRIGGKTFTEQRILTAITAQFLQKRGYDVTVTTGLGSTLARAAQESGQLDIVWEY
TGSSLIVYNHIDEKLDAAASYRRVKQLDEAQGLVWLKPTRFNNTYALAMPEEQAEHLGIQSVSDLARVLAEQQEAEPGST
HLFAMDPEFAGRPDGLGPMSELYGLHFTRNDIRQMDAGLVYTALKNRQVFLGLVYTTDGRLKDFKLRVLKDDKQYFPFYN
AAPVVRKEVMQRHPEFATLFDPIIERLDDATMQALNARVDIEQQTPQKVAADFLREHHLLDDGQAGQGGSQ
>Q93HX6 4.3.1.9~~~~~~Glucosaminate ammonia-lyase~~~COG0492
MVEVRHSRVIILGSGPAGYSAAVYAARANLKPLLITGMQAGGQLTTTTEVDNWPGDVHGLTGPALMERMREHAERFETEI
VFDHINAVDFAAKPYTLTGDSATYTCDALIIATGASARYLGLPSEEAFMGKGVSACATCDGFFYRNKPVAVVGGGNTAVE
EALYLANIASTVTLIHRRETFRAEKILIDKLNARVAEGKIILKLNANLDEVLGDNMGVTGARLKNNDGSFDELKVDGVFI
AIGHTPNTSLFEGQLTLKDGYLVVQGGRDGNATATSVEGIFAAGDVADHVYRQAITSAGAGCMAALDTERYLDGLQNASE
>Q46839 ~~~glcA~~~Glycolate permease GlcA~~~COG1620
MVTWTQMYMPMGGLGLSALVALIPIIFFFVALAVLRLKGHVAGAITLILSILIAIFAFKMPIDMAFAAAGYGFIYGLWPI
AWIIVAAVFLYKLTVASGQFDIIRSSVISITDDQRLQVLLIGFSFGALLEGAAGFGAPVAITGALLVGLGFKPLYAAGLC
LIANTAPVAFGALGVPILVAGQVTGIDPFHIGAMAGRQLPFLSVLVPFWLVAMMDGWKGVKETWPAALVAGGSFAVTQFF
TSNYIGPELPDITSALVSIVSLALFLKVWRPKNTETAISMGQSAGAMVVNKPSSGGPVPSEYSLGQIIRAWSPFLILTVL
VTIWTMKPFKALFAPGGAFYSLVINFQIPHLHQQVLKAAPIVAQPTPMDAVFKFDPLSAGGTAIFIAAIISIFILGVGIK
KGIGVFAETLISLKWPILSIGMVLAFAFVTNYSGMSTTLALVLAGTGVMFPFFSPFLGWLGVFLTGSDTSSNALFGSLQS
TTAQQINVSDTLLVAANTSGGVTGKMISPQSIAVACAATGMVGRESELFRYTVKHSLIFASVIGIITLLQAYVFTGMLVS
>P0ACL5 ~~~glcC~~~Glc operon transcriptional activator~~~COG2186
MKDERRPICEVVAESIERLIIDGVLKVGQPLPSERRLCEKLGFSRSALREGLTVLRGRGIIETAQGRDSRVARLNRVQDT
SPLIHLFSTQPRTLYDLLDVRALLEGESARLAATLGTQADFVVITRCYEKMLAASENNKEISLIEHAQLDHAFHLAICQA
SHNQVLVFTLQSLTDLMFNSVFASVNNLYHRPQQKKQIDRQHARIYNAVLQRLPHVAQRAARDHVRTVKKNLHDIELEGH
HLIRSAVPLEMNLS
>P0AEP9 1.1.99.14~~~glcD~~~Glycolate oxidase subunit GlcD~~~COG0277
MSILYEERLDGALPDVDRTSVLMALREHVPGLEILHTDEEIIPYECDGLSAYRTRPLLVVLPKQMEQVTAILAVCHRLRV
PVVTRGAGTGLSGGALPLEKGVLLVMARFKEILDINPVGRRARVQPGVRNLAISQAVAPHNLYYAPDPSSQIACSIGGNV
AENAGGVHCLKYGLTVHNLLKIEVQTLDGEALTLGSDALDSPGFDLLALFTGSEGMLGVTTEVTVKLLPKPPVARVLLAS
FDSVEKAGLAVGDIIANGIIPGGLEMMDNLSIRAAEDFIHAGYPVDAEAILLCELDGVESDVQEDCERVNDILLKAGATD
VRLAQDEAERVRFWAGRKNAFPAVGRISPDYYCMDGTIPRRALPGVLEGIARLSQQYDLRVANVFHAGDGNMHPLILFDA
NEPGEFARAEELGGKILELCVEVGGSISGEHGIGREKINQMCAQFNSDEITTFHAVKAAFDPDGLLNPGKNIPTLHRCAE
FGAMHVHHGHLPFPELERF
>P52073 1.1.99.14~~~glcE~~~Glycolate oxidase subunit GlcE~~~COG0277
MLRECDYSQALLEQVNQAISDKTPLVIQGSNSKAFLGRPVTGQTLDVRCHRGIVNYDPTELVITARVGTPLVTIEAALES
AGQMLPCEPPHYGEEATWGGMVACGLAGPRRPWSGSVRDFVLGTRIITGAGKHLRFGGEVMKNVAGYDLSRLMVGSYGCL
GVLTEISMKVLPRPRASLSLRREISLQEAMSEIAEWQLQPLPISGLCYFDNALWIRLEGGEGSVKAARELLGGEEVAGQF
WQQLREQQLPFFSLPGTLWRISLPSDAPMMDLPGEQLIDWGGALRWLKSTAEDNQIHRIARNAGGHATRFSAGDGGFAPL
SAPLFRYHQQLKQQLDPCGVFNPGRMYAEL
>P52074 1.1.99.14~~~glcF~~~Glycolate oxidase iron-sulfur subunit~~~COG0247
MQTQLTEEMRQNARALEADSILRACVHCGFCTATCPTYQLLGDELDGPRGRIYLIKQVLEGNEVTLKTQEHLDRCLTCRN
CETTCPSGVRYHNLLDIGRDIVEQKVKRPLPERILREGLRQVVPRPAVFRALTQVGLVLRPFLPEQVRAKLPAETVKAKP
RPPLRHKRRVLMLEGCAQPTLSPNTNAATARVLDRLGISVMPANEAGCCGAVDYHLNAQEKGLARARNNIDAWWPAIEAG
AEAILQTASGCGAFVKEYGQMLKNDALYADKARQVSELAVDLVELLREEPLEKLAIRGDKKLAFHCPCTLQHAQKLNGEV
EKVLLRLGFTLTDVPDSHLCCGSAGTYALTHPDLARQLRDNKMNALESGKPEMIVTANIGCQTHLASAGRTSVRHWIEIV
EQALEKE
>P0AEQ1 ~~~glcG~~~Protein GlcG~~~COG3193
MKTKVILSQQMASAIIAAGQEEAQKNNWSVSIAVADDGGHLLALSRMDDCAPIAAYISQEKARTAALGRRETKGYEEMVN
NGRTAFVTAPLLTSLEGGVPVVVDGQIIGAVGVSGLTGAQDAQVAKAAAAVLAK
>I7FJX8 2.7.1.8~~~~~~Glucosamine kinase~~~
MIELDRLDLGGGRRLVITSEPDAAVPQVRDADGHWRRAGPGDGVAEAMLDALNQNPGTTKHGNFTLLSWASQTARGERPI
TVDQTNESVIVGDAAVVKWATHLQEGPHPAPARIKALRGNGFRGMPMPWGLVTWQTADHPETLVVTVDEYLPDAVDGWTW
AVALVTDAAQDRAAVPALVDAVTAVGCVVAELHAAQADTARPATAADARSWREAALETVETAATLGTSVSGELLRARRED
VEAVVGTLGDLAGIPVLAGHGDLHVGQVLRAGGRYVVTDFDGNPVLPAEARVKPVPAALDVAGMAQSLAHVAIVACKYTE
LAPAALADVDRLARTTFVGAYTDRLETLGHRSVYDPAPLRALRLQQVLREIIYAARHLPRWMYVPDAALPALLDEGTST
>A0A1H7TQR5 2.7.1.8~~~~~~Glucosamine kinase~~~COG3281
MTPNWSELVAAADPALVLPSGERRAEVAVPGPLRLDALLDLGEGHAVGVVRSADAARWTVPLVRDGAGGVRRSRPGDGTA
EHLVAALARRGATPDAAFVLEAFTGAAPVTGERGIIVDQTNESVIVGECAVVKWAVRLPAEGEPGSPAAQRIAALARGGF
TEMPRPWGLLTLAEGAQPVLLASVVAYLPGALDGWDWAVDDVRRLARGELTMDQALLPAAQLGTLTARMHAALAARGRTP
ATAADVAAWGVRMREELDEAVASVPGAEGERLKAWAPRIADVYAELDALAGTPLIDVHGDFHVGQILRADGRYAVVDFDG
NPVLPADQRAARQPAALDVVGMTASLDHVGRVVVFRTPDVDPAPVRAWIAAAQRSFLDAYRTTLARLDADDLFDDRLLTP
LRYAQEVREYLYAVRHLPHWVYVPDLSLTDLLPERLKD
>O07563 ~~~glcP~~~Glucose/mannose transporter GlcP~~~COG0738
MLRGTYLFGYAFFFTVGIIHISTGSLTPFLLEAFNKTTDDISVIIFFQFTGFLSGVLIAPLMIKKYSHFRTLTLALTIML
VALSIFFLTKDWYYIIVMAFLLGYGAGTLETTVGSFVIANFESNAEKMSKLEVLFGLGALSFPLLINSFIDINNWFLPYY
CIFTFLFVLFVGWLIFLSKNREYAKNANQQVTFPDGGAFQYFIGDRKKSKQLGFFVFFAFLYAGIETNFANFLPSIMINQ
DNEQISLISVSFFWVGIIIGRILIGFVSRRLDFSKYLLFSCSCLIVLLIAFSYISNPILQLSGTFLIGLSIAGIFPIALT
LASIIIQKYVDEVTSLFIASASFGGAIISFLIGWSLNQDTILLTMGIFTTMAVILVGISVKIRRTKTEDPISLENKASKT
Q
>A0A0H2VG78 ~~~glcP~~~Glucose transporter GlcP~~~COG2814
MKANKYLIFILGALGGLLYGYDNGVISGALLFIHKDIPLNSTTEGIVVSSMLIGAIVGAGSSGPLADKLGRRRLVMLIAI
VFIIGALILAASTNLALLIIGRLIIGLAVGGSMSTVPVYLSEMAPTEYRGSLGSLNQLMITIGILAAYLVNYAFADIEGW
RWMLGLAVVPSVILLVGIYFMPESPRWLLENRNEEAARQVMKITYDDSEIDKELKEMKEINAISESTWTVIKSPWLGRIL
IVGCIFAIFQQFIGINAVIFYSSSIFAKAGLGEAASILGSVGIGTINVLVTIVAIFVVDKIDRKKLLVGGNIGMIASLLI
MAILIWTIGIASSAWIIIVCLSLFIVFFGISWGPVLWVMLPELFPMRARGAATGISALVLNIGTLIVSLFFPILSDALST
EWVFLIFAFIGVLAMIFVIKFLPETRGRSLEEIEYELRERTGARTE
>P94591 ~~~glcR~~~HTH-type transcriptional repressor GlcR~~~COG1349
MYQEERLVAILDFLKQHNRITTEQICTLLQVSRDTARRDLVKLEEQNAIIRTRGGAILPTVHQKIQSYSGRLKTVSEEKN
KIGRLAASLIHDGDRVILDASTTVQACAKHLNAVDCTVITNSINLADVLSDKEGIEIYLLGGKLEKEHRFLYGSSVIEKL
SSYHVDKALIGVVGISEHGITIAHEEDGMVKRKMIQQAKQVIALADHSKLGSTSFYQYAELNEIDLLITDRLPNQAFCDL
LDRNGVELLVTEQDEGKD
>Q89YS5 3.1.6.-~~~~~~N-acetylglucosamine-6-O-sulfatase~~~COG3119
MPATEKASAPHWSFLSSDVISIMKSNPSTLLLPLAALSLASCANPQKEETKRPNIIFMMTDDHTTQAMSCYGGNLIQTPN
MDRIANEGIRFDNCYAVNALSGPSRACILTGKFSHENGFTDNASTFNGDQQTFPKLLQQAGYQTAMIGKWHLISEPQGFD
HWSILSGQHEQGDYYDPDFWEDGKHIVEKGYATDIITDKAINFLENRDKNKPFCMMYHQKAPHRNWMPAPRHLGIFNNTI
FPEPANLFDDYEGRGKAAREQDMSIEHTLTNDWDLKLLTREEMLKDTTNRLYSVYKRMPSEVQDKWDSAYAQRIAEYRKG
DLKGKALISWKYQQYMRDYLATVLAVDENIGRLLNYLEKIGELDNTIIVYTSDQGFFLGEHGWFDKRFMYEECQRMPLII
RYPKAIKAGSTSSAISMNVDFAPTFLDFAGVEVPSDIQGASLKPVLENEGKTPADWRKAAYYHYYEYPAEHSVKRHYGIR
TQDFKLIHFYNDIDEWEMYDMKADPREMNNIFGKAEYAKKQKELMQLLEETQKQYKDNDPDEKETVLFKGDRRLMENR
>O31691 ~~~glcT~~~PtsGHI operon antiterminator~~~COG3711
MNGSFTVKKVLNNNVLIASHHKYSEVVLIGKGIGFGKKQDDVIEDKGYDKMFILKDEKEQKQFKKLLDYVDEKLVDISND
VIYHISNRTNHSLNEHIHIALTDHIAFAIKRQQQGFDMKNPFLMETQSLYPEEYQIAKEVIDMINEKAGLCLPEGEIGFI
ALHIHSALTNRPLSEVNQHSQLMAQLVEVIEDSFQMKVNKESVNYLRLIRHIRFTIERIKKEEPTKEPEKLMLLLKNEYP
LCYNTAWKLIKILQQTLKKPVHEAEAVYLTLHLYRLTNKIS
>A6M9B7 2.4.1.-~~~wclY~~~O-antigen biosynthesis glycosyltransferase WclY~~~
MKIAYVVSSKKKCGPNIVILNIVKELANKHEMEIFFLDESDDDVFECVNVKSTQIKKASDLKEHLKRFDIIHSSGIRPDA
LVVLCKVIYRVKCKIITTIHNYVFQDLYYSYGLVKSLIGGLLWCSIWLFFDKLVILSKNADNYYWFLPSAKKNIIYNGID
DNDCLQNKKCNYRKEFNIPDDGILAGSCANLTKRKGIDLVIQTLTKEHKIYYIVAGNGIEKHNLINLVKARKLHERVYFI
DFLDEPESFMSQLDVFLMPSRSEGFGLTVLESTKLGIPVITSNIPIFMELFDQMCLTFDIKNPSTLIDVITYAKKNRLHL
SQKFHAIFQDRFTSSKMATKYENVYNNLFREVL
>O33618 ~~~glcT~~~GlcA/glcB genes antiterminator~~~COG3711
MSNYVIEKTLNNNVIICTDENQHHEVVLIGKGIGFNKKKGMELSDSVMIDKVYKLEQKKDQDHYKALVEIADDNVLQTII
EAMDIITHADRTVVDKDLMVALTDHILFAYKRIKQHQFIKNPFLIETKQLYSESYQIAVSVIEHLNKLLDIEFPEDEIGF
IALHIASSKDDLSLHEVRLTNEIINKSILIMEHDLKYKIDTNSIQYQRFIRHIQFLIRRLQKGEIIQVNDEFGNMLKAHY
PLCYNIAVKIIKMMQQHLDVEVYEAELIYLTLHINHFTQQNEKNTNV
>P40420 ~~~glcU~~~Glucose uptake protein GlcU~~~COG4975
MDLLLALLPALFWGSIVLFNVKLGGGPYSQTLGTTIGALIVSIVIYFFVQPVLSLRIFIVGIVSGLFWSLGQANQLKSIQ
LMGVSKTMPISTGMQLVSTSLFGVIVFREWSTPIAITLGVLALIFIIVGIILTSLEDKNDKKEGEPSNLKKGILILLVST
LGYLVYVVVARLFNVSGWSALLPQAIGMVVGGLVLTYRHKPFNKYAIRNILPGLIWAGGNMFLFISQPRVGVATSFSLSQ
MGIVISTLGGIFILREKKTKRQLIAIAIGIILIIAAAVFLGIAKTNS
>P40419 ~~~glcU~~~Probable glucose uptake protein GlcU~~~
MDIFLAVLPAIFWGSIVLFNVKLGGGPYSQTLGTTLGALIFSIGIYIFVHPTFTPLIFGVGVVSGLFWAVGQSNQLKSID
LIGVSKTMPISTGLQLVSTSLFGVIVFHEWSTKTSIILGVLALIFIIVGIVLASLQSKEEKEAEEGKGNFKKGIVILLIS
TVGYLVYVVVARLFNVDGWSALLPQAIGMVIGGVLLTFKHKPFNKYAIRNIIPGLIWAAGNMFLFISQPKVGVATSFSLS
QMGIVISTLGGIIILGEKKTKRQLVGIIIGIILIIIAGVMLGLAKS
>P45511 1.1.1.6~~~dhaD~~~Glycerol dehydrogenase~~~
MLKVIQSPAKYLQGPDASTLFGQYAKNLADSFFVIADDFVMKLAGEKVLNGLHSHDISCHAERFNGECSHIEINRLIAIL
KQHGCRGVVGIGGGKTLDTAKAIGYYQKLPVVVIPTIASTDAPTSALSVIYTEAGEFEEYLIYPKNPDMVVMDTAIIAKA
PVRLLVAGMGDALSTWFEAKACYDARATSMAGGQSTVAALSLARLCYDTLLAEGEKARFAAQAGVVTDALERIVEANTYL
SGIGFESSGLAGAHAIHNGFTILEECHHLYHGEKVAFGTLAQLVLQNSPMEEIETVLNFCQKVGLPVTLAEMGVKDDIDG
KIMAVAKATCAEGETIHNMPFSVTPESVHAAILTADLLGQQWLAR
>P0A9S5 1.1.1.6~~~gldA~~~Glycerol dehydrogenase~~~COG0371
MDRIIQSPGKYIQGADVINRLGEYLKPLAERWLVVGDKFVLGFAQSTVEKSFKDAGLVVEIAPFGGECSQNEIDRLRGIA
ETAQCGAILGIGGGKTLDTAKALAHFMGVPVAIAPTIASTDAPCSALSVIYTDEGEFDRYLLLPNNPNMVIVDTKIVAGA
PARLLAAGIGDALATWFEARACSRSGATTMAGGKCTQAALALAELCYNTLLEEGEKAMLAAEQHVVTPALERVIEANTYL
SGVGFESGGLAAAHAVHNGLTAIPDAHHYYHGEKVAFGTLTQLVLENAPVEEIETVAALSHAVGLPITLAQLDIKEDVPA
KMRIVAEAACAEGETIHNMPGGATPDQVYAALLVADQYGQRFLQEWE
>P32816 1.1.1.6~~~gldA~~~Glycerol dehydrogenase~~~
MAAERVFISPAKYVQGKNVITKIANYLEGIGNKTVVIADEIVWKIAGHTIVNELKKGNIAAEEVVFSGEASRNEVERIAN
IARKAEAAIVIGVGGGKTLDTAKAVADELDAYIVIVPTAASTDAPTSALSVIYSDDGVFESYRFYKKNPDLVLVDTKIIA
NAPPRLLASGIADALATWVEARSVIKSGGKTMAGGIPTIAAEAIAEKCEQTLFKYGKLAYESVKAKVVTPALEAVVEANT
LLSGLGFESGGLAAAHAIHNGFTALEGEIHHLTHGEKVAFGTLVQLALEEHSQQEIERYIELYLSLDLPVTLEDIKLKDA
SREDILKVAKAATAEGETIHNAFNVTADDVADAIFAADQYAKAYKEKHRK
>Q9WYQ4 1.1.1.6~~~gldA~~~Glycerol dehydrogenase~~~COG0371
MITTTIFPGRYVQGAGAINILEEELSRFGERAFVVIDDFVDKNVLGENFFSSFTKVRVNKQIFGGECSDEEIERLSGLVE
EETDVVVGIGGGKTLDTAKAVAYKLKKPVVIVPTIASTDAPCSALSVIYTPNGEFKRYLFLPRNPDVVLVDTEIVAKAPA
RFLVAGMGDALATWFEAESCKQKYAPNMTGRLGSMTAYALARLCYETLLEYGVLAKRSVEEKSVTPALEKIVEANTLLSG
LGFESGGLAAAHAIHNGLTVLENTHKYLHGEKVAIGVLASLFLTDKPRKMIEEVYSFCEEVGLPTTLAEIGLDGVSDEDL
MKVAEKACDKNETIHNEPQPVTSKDVFFALKAADRYGRMRKNLT
>Q8KRP0 ~~~gldH~~~Gliding motility lipoprotein GldH~~~
MRIKNSGILLLAAILLFSCDKKRVFDEYKSVGSAWHKDSVVTFDLPVLDSTKKYNLFVNLRDNNNYPFNNLFLIVAIETP
SGFTKVDTLEYQMANPDGTLMGNGFTDIKESKLYYKEDVKFKGKYKVHIKQAVRESGKIPGVEALEGITDVGFRIEQKD
>Q8G8Y2 2.6.1.100~~~btrR~~~L-glutamine:2-deoxy-scyllo-inosose aminotransferase~~~
MTIPFDHWPEWPQHSDRTRRKIEEVFQSNRWAISGYWTGEESMERKFAKAFADFNGVPYCVPTTSGSTALMLALEALGIG
EGDEVIVPSLTWIATATAVLNVNALPVFVDVEADTYCIDPQLIKSAITDKTKAIIPVHLFGSMANMDEINEIAQEHNLFV
IEDCAQSHGSVWNNQRAGTIGDIGAFSCQQGKVLTAGEGGIIVTKNPRLFELIQQLRADSRVYCDDSSELMHGDMQLVKK
GDIQGSNYCLSEFQSAILLDQLQELDDKNAIREKNAMFLNDALSKIDGIKVMKRPPQVSRQTYYGYVFRFDPVKFGGLNA
DQFCEILREKLNMGTFYLHPPYLPVHKNPLFCPWTKNRYLKSVRKTEAYWRGLHYPVSERASGQSIVIHHAILLAEPSHL
SLLVDAVAELARKFCVTH
>Q53U20 2.6.1.100~~~neoB~~~L-glutamine:2-deoxy-scyllo-inosose aminotransferase~~~
MVSPLAVKGGEALRTRPWPAWPQPAPGVPAAVAEVLGSGRWSISGPYRGTDSHERRFARAFADYHGVPYCVPAASGTAGL
MLALEACGVGAGDEVIVPGLSWVASGSTVLGVNAVPVFCDVDPDTLCVSPEAVEALITERTRAVVVVHLYSAVADMDGLT
RVAERHGLPLVEDCAQAHGASYRGVKVGALATAGTFSMQHSKVLTSGEGGAVITRDADLARRVEHLRADGRCLSDGPPAP
GAMELVETGELMGSNRCLSEFQAAILTEQLTLLDEQNRTRRANAARLDGLLGELGLRPQATSEGTTSRTYYTYAARLPEG
ALEDVPLTDVTGALTAELGFPVQPCYAPIPANRLYAPQTRRRYTLGPDHEARIDPKRFALPVCEDTARRTVTLHHAALLG
DAEDMADIAAAFAKVLRHGADLAT
>Q6L739 2.6.1.100~~~kanB~~~L-glutamine:2-deoxy-scyllo-inosose aminotransferase~~~
MPLQSSRLAVDNGTPVRGKPWPVWPQPTDGTLDALSRVLRSGRWAISGPYRGVESAERRFARRFADYHRIAHCVPASSGT
ASLMLALEACGVGAGDEVILPGVTWVASASTVVGVNAVPVFADIDPDTLCLDPDAVEAAITPATKAIVVVHLYAAVADLT
RLKEVADRHGIVLIEDCAQAHGAEFEGHKVGTFGAVGTFSMQQSKVLTSGEGGAAITADPVLARRMEHLRADGRCYRDQA
PPSGHMELVETGELMGSNRCISEFQAAVLTEQLGELDRFNALRRHNAELLDALLTDVGYRPQRSTPGTTARTYYTYVAEL
PDAELPGADITKVTEALTAELGFPVAPAYSPLNANPLYDPASRSRFALGPQHEKLIDPARFVLPVSGRLTRRLVTFHHAA
LLGDESDMRDIAEAFTKVLQHRAVLAA
>Q4R0W2 2.6.1.100~~~rbmB~~~L-glutamine:2-deoxy-scyllo-inosose aminotransferase~~~
MVSQLAVKGGEALRTRPWPAWPQPAPGVPDAVADVLGSGRWSISGPYRGTESYERRFARAFAAYNGVPHCVPAASGTASL
MLALEACGIGAGDEVIVPGLSWVASGSTILGVNAVPIFCDVDPDTLCLSPEAVEAAITEHTRAIVVVHLYSALADMDALS
AIAERHGLPLIEDCAQAHGATYRGVKVGALATAGTFSMQHSKVLTSGEGGAVITRDEDFARRVEHLRADGRCLSAVPPAP
GAMELVETGELMGNNRCLSEFQAAILAEQLTILDEQNETRRANAAHLDGLLGELGLRPQTTSDGTTSRTYYTYAVRLPDG
VLEDVPVTDVSCALTAELGFPVLPSYAPIPANRLYTPHTRRRYTLGLDHERRIDPKRFALPVCEDAARRTVTLHHAALLG
DADDMGDIAAAFAKVLRHGAGLMH
>Q48485 5.4.99.9~~~rfbD~~~UDP-galactopyranose mutase~~~
MKSKKILIVGAGFSGAVIGRQLAEKGHQVHIIDQRDHIGGNSYDARDSETNVMVHVYGPHIFHTDNETVWNYVNKHAEMM
PYVNRVKATVNGQVFSLPINLHTINQFFSKTCSPDEARALIAEKGDSTIADPQTFEEQALRFIGKELYEAFFKGYTIKQW
GMQPSELPASILKRLPVRFNYDDNYFNHKFQGMPKCGYTQMIKSILNHENIKVDLQREFIVEERTHYDHVFYSGPLDAFY
GYQYGRLGYRTLDFKKFTYQGDYQGCAVMNYCSVDVPYTRITEHKYFSPWEQHDGSVCYKEYSRACEENDIPYYPIRQMG
EMALLEKYLSLAENETNITFVGRLGTYRYLDMDVTIAEALKTAEVYLNSLTENQPMPVFTVSVR
>A0R5Z2 2.4.1.287~~~glfT1~~~Galactofuranosyltransferase GlfT1~~~COG1216
MTHTEVVCAVVVTHRRRELLATSLDAVVSQDRKPDHLIVVDNDNDPQVRELVTGQPVPSTYLGSRRNLGGAGGFALGMLH
ALALGADWIWLADDDGRPADTTVLSTLLSCAHTHSLAEVSPMVCNLDDPQRLAFPLRRGLVWRRLTSELRTDSSSSSGDL
LPGIASLFNGALFRADTVDAVGVPDLRLFVRGDEVELHRRLVRSGLPFGTCLTASYLHPCGTDEFKPILGGRMHTQYPDD
ETKRFFTYRNRGYLLSQPGLRKLLPQEWLRFGWYFLVSRRDLAGLREWIRLRRLGRRERFQR
>P9WMX3 2.4.1.287~~~glfT1~~~Galactofuranosyltransferase GlfT1~~~COG1216
MTESVFAVVVTHRRPDELAKSLDVLTAQTRLPDHLIVVDNDGCGDSPVRELVAGQPIATTYLGSRRNLGGAGGFALGMLH
ALAQGADWVWLADDDGHAQDARVLATLLACAEKYSLAEVSPMVCNIDDPTRLAFPLRRGLVWRRRASELRTEAGQELLPG
IASLFNGALFRASTLAAIGVPDLRLFIRGDEVEMHRRLIRSGLPFGTCLDAAYLHPCGSDEFKPILCGRMHAQYPDDPGK
RFFTYRNRGYVLSQPGLRKLLAQEWLRFGWFFLVTRRDPKGLWEWIRLRRLGRREKFGKPGGSA
>O53585 2.4.1.288~~~glfT2~~~Galactofuranosyltransferase GlfT2~~~COG1216
MSELAASLLSRVILPRPGEPLDVRKLYLEESTTNARRAHAPTRTSLQIGAESEVSFATYFNAFPASYWRRWTTCKSVVLR
VQVTGAGRVDVYRTKATGARIFVEGHDFTGTEDQPAAVETEVVLQPFEDGGWVWFDITTDTAVTLHSGGWYATSPAPGTA
NIAVGIPTFNRPADCVNALRELTADPLVDQVIGAVIVPDQGERKVRDHPDFPAAAARLGSRLSIHDQPNLGGSGGYSRVM
YEALKNTDCQQILFMDDDIRLEPDSILRVLAMHRFAKAPMLVGGQMLNLQEPSHLHIMGEVVDRSIFMWTAAPHAEYDHD
FAEYPLNDNNSRSKLLHRRIDVDYNGWWTCMIPRQVAEELGQPLPLFIKWDDADYGLRAAEHGYPTVTLPGAAIWHMAWS
DKDDAIDWQAYFHLRNRLVVAAMHWDGPKAQVIGLVRSHLKATLKHLACLEYSTVAIQNKAIDDFLAGPEHIFSILESAL
PQVHRIRKSYPDAVVLPAASELPPPLHKNKAMKPPVNPLVIGYRLARGIMHNLTAANPQHHRRPEFNVPTQDARWFLLCT
VDGATVTTADGCGVVYRQRDRAKMFALLWQSLRRQRQLLKRFEEMRRIYRDALPTLSSKQKWETALLPAANQEPEHG
>Q99XR4 2.1.2.5~~~~~~Glutamate formimidoyltransferase~~~
MAKIVECIPNFSEGQNQAVIDGLVATAKSIPGVTLLDYSSDASHNRSVFTLVGDDQSIQEAAFQLVKYASENIDMTKHHG
EHPRMGATDVCPFVPIKDITTQECVEISKQVAERINRELGIPIFLYEDSATRPERQNLAKVRKGQFEGMPEKLLEEDWAP
DYGDRKIHPTAGVTAVGARMPLVAFNVNLDTDNIDIAHKIAKIIRGSGGGYKYCKAIGVMLEDRHIAQVSMNMVNFEKCS
LYRTFETIKFEARRYGVNVIGSEVIGLAPAKALIDVAEYYLQVEDFDYHKQILENHLLG
>P37747 5.4.99.9~~~glf~~~UDP-galactopyranose mutase~~~COG0562
MYDYIIVGSGLFGAVCANELKKLNKKVLVIEKRNHIGGNAYTEDCEGIQIHKYGAHIFHTNDKYIWDYVNDLVEFNRFTN
SPLAIYKDKLFNLPFNMNTFHQMWGVKDPQEAQNIINAQKKKYGDKVPENLEEQAISLVGEDLYQALIKGYTEKQWGRSA
KELPAFIIKRIPVRFTFDNNYFSDRYQGIPVGGYTKLIEKMLEGVDVKLGIDFLKDKDSLASKAHRIIYTGPIDQYFDYR
FGALEYRSLKFETERHEFPNFQGNAVINFTDANVPYTRIIEHKHFDYVETKHTVVTKEYPLEWKVGDEPYYPVNDNKNME
LFKKYRELASREDKVIFGGRLAEYKYYDMHQVISAALYQVKNIMSTD
>P9WIQ1 5.4.99.9~~~glf~~~UDP-galactopyranose mutase~~~COG0562
MQPMTARFDLFVVGSGFFGLTIAERVATQLDKRVLVLERRPHIGGNAYSEAEPQTGIEVHKYGAHLFHTSNKRVWDYVRQ
FTDFTDYRHRVFAMHNGQAYQFPMGLGLVSQFFGKYFTPEQARQLIAEQAAEIDTADAQNLEEKAISLIGRPLYEAFVKG
YTAKQWQTDPKELPAANITRLPVRYTFDNRYFSDTYEGLPTDGYTAWLQNMAADHRIEVRLNTDWFDVRGQLRPGSPAAP
VVYTGPLDRYFDYAEGRLGWRTLDFEVEVLPIGDFQGTAVMNYNDLDVPYTRIHEFRHFHPERDYPTDKTVIMREYSRFA
EDDDEPYYPINTEADRALLATYRARAKSETASSKVLFGGRLGTYQYLDMHMAIASALNMYDNVLAPHLRDGVPLLQDGA
>P21906 ~~~glf~~~Glucose facilitated diffusion protein~~~COG0477
MSSESSQGLVTRLALIAAIGGLLFGYDSAVIAAIGTPVDIHFIAPRHLSATAAASLSGMVVVAVLVGCVTGSLLSGWIGI
RFGRRGGLLMSSICFVAAGFGAALTEKLFGTGGSALQIFCFFRFLAGLGIGVVSTLTPTYIAEIAPPDKRGQMVSGQQMA
IVTGALTGYIFTWLLAHFGSIDWVNASGWCWSPASEGLIGIAFLLLLLTAPDTPHWLVMKGRHSEASKILARLEPQADPN
LTIQKIKAGFDKAMDKSSAGLFAFGITVVFAGVSVAAFQQLVGINAVLYYAPQMFQNLGFGADTALLQTISIGVVNFIFT
MIASRVVDRFGRKPLLIWGALGMAAMMAVLGCCFWFKVGGVLPLASVLLYIAVFGMSWGPVCWVVLSEMFPSSIKGAAMP
IAVTGQWLANILVNFLFKVADGSPALNQTFNHGFSYLVFAALSILGGLIVARFVPETKGRSLDEIEEMWRSQK
>P0A3F3 2.4.1.21~~~glgA1~~~Glycogen synthase 1~~~COG0297
MNVLSVSSEIYPLIKTGGLADVVGALPIALEAHGVRTRTLIPGYPAVKAAVTDPVKCFEFTDLLGEKADLLEVQHERLDL
LILDAPAYYERSGGPYLGQTGKDYPDNWKRFAALSLAAARIGAGVLPGWRPDMVHAHDWQAAMTPVYMRYAETPEIPSLL
TIHNIAFQGQFGANIFSKLALPAHAFGMEGIEYYNDVSFLKGGLQTATALSTVSPSYAEEILTAEFGMGLEGVIGSRAHV
LHGIVNGIDADVWNPATDHLIHDNYSAANLKNRALNKKAVAEHFRIDDDGSPLFCVISRLTWQKGIDLMAEAVDEIVSLG
GRLVVLGAGDVALEGALLAAASRHHGRVGVAIGYNEPLSHLMQAGCDAIIIPSRFEPCGLTQLYALRYGCIPVVARTGGL
ADTVIDANHAALASKAATGVQFSPVTLDGLKQAIRRTVRYYHDPKLWTQMQKLGMKSDVSWEKSAGLYAALYSQLISKGH
>P39125 2.4.1.21~~~glgA~~~Glycogen synthase~~~COG0297
MKILFAVSECTPFVKSGGLADVAGALPKALARLGNEVAVMLPKYSQIPEPWKKRMKKQAECTVAVGWRQQYCGIEHMAEN
DVNYYFIDNEYYFNRDSLYGHYDDGERFAFFSRAVLEAAKVVNVQADIVHTHDWHTAMVNYLLKEEYRKHPFYERMKSVL
TIHNLQFQGIFPPDVTHDLLGLEMDHFHYERLECNGFVNFMKAGIIAADHVTTVSPTYRNEIMTPYYGEQLEQVLQYRED
DVTGILNGIDDTFYQPKSDPYIEAQYDSGDLACKLENKTKLQQRMGLPEKNDIPLISMVTRLTKQKGLDLVRRIMHELLE
EQDIQLVVLGTGEREFEDYFRYAEFAFHEKCRAYIGFDEPLAHQIYAGSDMFLMPSKFEPCGLGQLIALQYGAIPIVRET
GGLYDTVRAYQEEEGTGNGFTFSAFNAHDLKFTIERALSFYCQQDVWKSIVKTAMNADYSWGKSAKEYQRIFEQVTRSGR
DVLE
>P0A6U8 2.4.1.21~~~glgA~~~Glycogen synthase~~~COG0297
MQVLHVCSEMFPLLKTGGLADVIGALPAAQIADGVDARVLLPAFPDIRRGVTDAQVVSRRDTFAGHITLLFGHYNGVGIY
LIDAPHLYDRPGSPYHDTNLFAYTDNVLRFALLGWVGAEMASGLDPFWRPDVVHAHDWHAGLAPAYLAARGRPAKSVFTV
HNLAYQGMFYAHHMNDIQLPWSFFNIHGLEFNGQISFLKAGLYYADHITAVSPTYAREITEPQFAYGMEGLLQQRHREGR
LSGVLNGVDEKIWSPETDLLLASRYTRDTLEDKAENKRQLQIAMGLKVDDKVPLFAVVSRLTSQKGLDLVLEALPGLLEQ
GGQLALLGAGDPVLQEGFLAAAAEYPGQVGVQIGYHEAFSHRIMGGADVILVPSRFEPCGLTQLYGLKYGTLPLVRRTGG
LADTVSDCSLENLADGVASGFVFEDSNAWSLLRAIRRAFVLWSRPSLWRFVQRQAMAMDFSWQVAAKSYRELYYRLK
>P30537 2.4.1.18~~~glgB~~~1,4-alpha-glucan branching enzyme GlgB~~~
MIAANPTDLEVYLFHEGRLYQSYELFGAHVIRGGGAVGTRFCVWAPHAREVRLVGSFNDWNGTNSPLTKVNDEGVWTIVV
PENLEGHLYKYEIITPDGRVLLKADPYAFYSELRPHTASIVYDLKGYEWNDSPWQRKKRRKRIYDQPMVIYELHFGSWKK
KPDGRFYTYREMADELIPYVLERGFTHIELLPLVEHPLDRSWGYQGTGYYSVTSRYGTPHDFMYFVDRCHQAGLGVIIDW
VPGHFCKDAHGLYMFDGAPTYEYANEKDRENYVWGTANFDLGKPEVRSFLISNALFWLEYYHVDGFRVDAVANMLYWPNN
DRLYENPYAVEFLRQLNEAVFAYDPNVWMIAEDSTDWPRVTAPTYDGGLGFNYKWNMGWMNDMLKYMETPPHERKYAHNQ
VSFSLLYAYSENFILPFSHDEVVHGKKSLLNKMPGSYEEKFAQLRLLYGYMMAHPGKKLLFMGSEFAQFDEWKFAEELDW
VLFDFELHRKMDEYVKQLIACYKRYKPFYELDHDPRGFEWIDVHNAEQSIFSFIRRGKKEGDVLVIVCNFTNQAYDDYKV
SVPLLAPYREVLNSDAAEFGGSGHVNGKRLPAFSEPFHGKPYHVRMTIPPFGISILRPVQKRGERKQNEEEVHRHVIGRR
ARKPASLADEKHRETSRAVWGEVPDH
>P39118 2.4.1.18~~~glgB~~~1,4-alpha-glucan branching enzyme GlgB~~~COG0296
MAAASPTAHDVYLFHEGSLFKSYQLFGSHYRELNGKSGYEFCVWAPHASEVRVAGDFNSWSGEEHVMHRVNDNGIWTLFI
PGIGEKERYKYEIVTNNGEIRLKADPYAIYSEVRPNTASLTYDLEGYSWQDQKWQKKQKAKTLYEKPVFIYELHLGSWKK
HSDGRHYSYKELSQTLIPYIKKHGFTHIELLPVYEHPYDRSWGYQGTGYYSPTSRFGPPHDLMKFVDECHQQNIGVILDW
VPGHFCKDAHGLYMFDGEPLYEYKEERDRENWLWGTANFDLGKPEVHSFLISNALYWAEFYHIDGFRVDAVANILYWPNQ
DERHTNPYAVDFLKKLNQTMREAYPHVMMIAEDSTEWPQVTGAVEEGGLGFHYKWNMGWMNDVLKYMETPPEERRHCHQL
ISFSLLYAFSEHFVLPFSHDEVVYGKKSLLNKMPGDYWQKFAQYRLLLGYMTVHPGKKLIFMGSEFAQFDEWKDTEQLDW
FLDSFPMHQKASVFTQDLLRFYQKSKILYEHDHRAQSFEWIDVHNDEQSIFSFIRYGQKHGEALVIICNFTPVVYHQYDV
GVPFFTQYIEVLNSDSETYGGSGQINKKPLSAKKGALHHKPCYITMTIPPYGISILRAVKKRGEIKR
>P30539 2.4.1.18~~~glgB~~~1,4-alpha-glucan branching enzyme GlgB~~~
MSQKVFISEDDEYLFGQGTHYDIYDKLGAHPSEEKGKKGFFFAVWAPNAADVHVVGDFNGWDENAHQMKRSKTGNIWTLF
IPGVAIGALYKFLITAQDGRKLYKADPYANYAELRPGNASRTTDLSGFKWSDSKWYESLKGKDMNRQPIAIYECHIGSWM
KHPDGTEDGFYTYRQFADRIVEYLKEMKYTHIELIGIAEHPFDGSWGYQVTGYYAPTARYGEPTDFMYLINQLHKHGIGV
ILDWVPAHFCPDEFGLACFDGTCIYEDPDPRKGEHPDWGTKIFNLAKPEVKNFLIANALYWIRKFHIDGLRVDAVASMLY
LDYGKKDGQWVPNKYGDNKNLDAIEFFKHFNSVVRGTYPNILTIAEESTAWPKVTAPPEEDGLGFAFKWNMGWMHDFCEY
MKLDPYFRQGAHYMMTFAMSYNDSENYILPLSHDEVVHLKCSMVEKMPGYKVDKYANLRVGYTYMFGHSGKKLLFMGQDF
GQEREWSEKRELDWFLLENDLNRGMKDYVGKLLEIYRKYPALYEVDNDWGGFEWINADDKERSTYSFYRRASNGKDNILF
VLNMTPMERKGFKVGVPFDGTYTKILDSAKECYGGSGSSVPDKIKAVKGLCDYKDYSIEFDLPPYGAEVFVFQTKKTKN
>Q8GQC5 2.4.1.18~~~glgB~~~1,4-alpha-glucan branching enzyme GlgB~~~
MFVAAMTESDQNIINLLFSGHYADPFAVLGMHDTASGLEVRALLPDAIDVWVVDAHSGRKVANLQCRDPRGFFASAIPRR
KKPFSYRLAVTWPQDTQVIDDPYRFGTLLQELDIWLLAEGRHLRPFETLGAHPSTLDGVVGTCFAVWAPNAQRVSVVGDF
NFWDGRRHPMRRRRENGVWELFVPGVGPGQLYKFEIIDCYGNVLVKSDPYAFESQMRPDTASVVSRLPPALPVDEARQHA
NELQSPISIYEVHLGSWRRHTHNNFWLSYRELADQLVPYVKEMGFTHVELMPVHKHPFDGSWGYQPLGLYAPTRRFGSPD
DFRYLVSAFHEAGINVLLDWVSGHFPADSYGLARFDGPALYEYADPKEGYHQDWNTLIYNFDRHEVRNYLAGNALYWTER
FGVDGLRVDAVASMIYRDYSRRDGEWVPNYFGGKENLEAIGFLRYTNQMLGQHHAGAVTIAEESTDYAGVTLPPEHGGLG
FHYKWNMGWMHDSLAYMQLDPVHRKYHHDLLTFGMLYAYSENFVLPLSHDEVVHGKRSLLDRMPGDVWQKFANLRAYYGF
MWAYPGKKLLFMGGEFAQGREWNHDTSLDWHLLDEPEGWHAGVQQLVRDLNHCYRQHPPLYQCDYLHQGFEWVVVDDREN
SVFAFIRRDADGNEMLIISNFTPVPRDSYRVGINQPGAWREVLNTDSWHYHGGNLGNQGLVYSETVGSHSRPQSLVLALP
PLATLYLVKEA
>P07762 2.4.1.18~~~glgB~~~1,4-alpha-glucan branching enzyme GlgB~~~COG0296
MSDRIDRDVINALIAGHFADPFSVLGMHKTTAGLEVRALLPDATDVWVIEPKTGRKLAKLECLDSRGFFSGVIPRRKNFF
RYQLAVVWHGQQNLIDDPYRFGPLIQEMDAWLLSEGTHLRPYETLGAHADTMDGVTGTRFSVWAPNARRVSVVGQFNYWD
GRRHPMRLRKESGIWELFIPGAHNGQLYKYEMIDANGNLRLKSDPYAFEAQMRPETASLICGLPEKVVQTEERKKANQFD
APISIYEVHLGSWRRHTDNNFWLSYRELADQLVPYAKWMGFTHLELLPINEHPFDGSWGYQPTGLYAPTRRFGTRDDFRY
FIDAAHAAGLNVILDWVPGHFPTDDFALAEFDGTNLYEHSDPREGYHQDWNTLIYNYGRREVSNFLVGNALYWIERFGID
ALRVDAVASMIYRDYSRKEGEWIPNEFGGRENLEAIEFLRNTNRILGEQVSGAVTMAEESTDFPGVSRPQDMGGLGFWYK
WNLGWMHDTLDYMKLDPVYRQYHHDKLTFGILYNYTENFVLPLSHDEVVHGKKSILDRMPGDAWQKFANLRAYYGWMWAF
PGKKLLFMGNEFAQGREWNHDASLDWHLLEGGDNWHHGVQRLVRDLNLTYRHHKAMHELDFDPYGFEWLVVDDKERSVLI
FVRRDKEGNEIIVASNFTPVPRHDYRFGINQPGKWREILNTDSMHYHGSNAGNGGTVHSDEIASHGRQHSLSLTLPPLAT
IWLVREAE
>P30538 2.4.1.18~~~glgB~~~1,4-alpha-glucan branching enzyme GlgB~~~
MIAVGPTDLEIYLFHEGSLYKSYELFGAHVIKKNGMVGTRFCVWAPHAREVRLVGSFNEWNGTNFNLMKVSNQGVWMIFI
PENLEGHLYKYEITTNDGNVLLKSDPYAFYSELRPHTASIVYNIKGYQWNDQTWRRKKQRKRIYDQPLFIYELHFGSWKK
KEDGSFYTYQEMAEELIPYVLEHGFTHIELLPLVEHPFDRSWGYQGIGYYSATSRYGTPHDLMYFIDRCHQAGIGVILDW
VPGHFCKDSHGLYMFDGAPAYEYANMQDRENYVWGTANFDLGKPEVRSFLISNALFWMEYFHVDGFRVDAVANMLYWPNS
DVLYKNTYAVEFLQKLNETVFAYDPNILMIAEDSTDWPRVTAPTYDGGLGFNYKWNMGWMNDILTYMETPPEHRKYVHNK
VTFSLLYAYSENFILPFSHDEVVHGKKSLLSKMPGTYEEKFAQLRLLYGYLLTHPGKKLLFMGGEFGQFDEWKDLEQLDW
MLFDFDMHRNMNMYVKELLKCYKRYKPLYELDHSPDGFEWIDVHNAEQSIFSFIRRGKKEDDLLIVVCNFTNKVYHGYKV
GVPLFTRYREVINSDAIQFGGFGNINPKPIAAMEGPFHGKPYHIQMTIPPFGISILRPVKKGSVKSFMKTPHPPSHGAS
>P9WN45 2.4.1.18~~~glgB~~~1,4-alpha-glucan branching enzyme GlgB~~~COG0296
MSRSEKLTGEHLAPEPAEMARLVAGTHHNPHGILGAHEYDDHTVIRAFRPHAVEVVALVGKDRFSLQHLDSGLFAVALPF
VDLIDYRLQVTYEGCEPHTVADAYRFLPTLGEVDLHLFAEGRHERLWEVLGAHPRSFTTADGVVSGVSFAVWAPNAKGVS
LIGEFNGWNGHEAPMRVLGPSGVWELFWPDFPCDGLYKFRVHGADGVVTDRADPFAFGTEVPPQTASRVTSSDYTWGDDD
WMAGRALRNPVNEAMSTYEVHLGSWRPGLSYRQLARELTDYIVDQGFTHVELLPVAEHPFAGSWGYQVTSYYAPTSRFGT
PDDFRALVDALHQAGIGVIVDWVPAHFPKDAWALGRFDGTPLYEHSDPKRGEQLDWGTYVFDFGRPEVRNFLVANALYWL
QEFHIDGLRVDAVASMLYLDYSRPEGGWTPNVHGGRENLEAVQFLQEMNATAHKVAPGIVTIAEESTPWSGVTRPTNIGG
LGFSMKWNMGWMHDTLDYVSRDPVYRSYHHHEMTFSMLYAFSENYVLPLSHDEVVHGKGTLWGRMPGNNHVKAAGLRSLL
AYQWAHPGKQLLFMGQEFGQRAEWSEQRGLDWFQLDENGFSNGIQRLVRDINDIYRCHPALWSLDTTPEGYSWIDANDSA
NNVLSFMRYGSDGSVLACVFNFAGAEHRDYRLGLPRAGRWREVLNTDATIYHGSGIGNLGGVDATDDPWHGRPASAVLVL
PPTSALWLTPA
>Q93HU3 2.4.1.18~~~glgB~~~1,4-alpha-glucan branching enzyme GlgB~~~
MSWLTEEDIRRWESGTFYDSYRKLGAHPDDEGTWFCVWAPHADGVSVLGAFNDWNPEANPLERYGGGLWAGYVPGARPGH
TYKYRIRHGFYQADKTDPYAFAMEPPTGSPIEGLASIITRLDYTWHDDEWMRRRKGPASLYEPVSIYEVHLGSWRHKRPG
ESFSYREIAEPLADYVQEMGFTHVELLPVMEHPYYGSWGYQVVGYYAPTFRYGSPQDLMYLIDYLHQRGIGVILDWVPSH
FAADPQGLVFFDGTTLFEYDDPKMRYHPDWGTYVFDYNKPGVRNFLISNALFWLEKYHVDGLRVDAVASMLYRDYSRKEW
TPNIFGGRENLEAIDFIKKFNETVYLHFPEAMTIAEESTAWPGVSAPTYNNGLGFLYKWNMGWMHDTLDYIQRDPIYRKY
HHDELTFSLWYAFSEHYVLPLSHDEVVHGKGSLWGKMPGDDWQKAANLRLLFGHMWGHPGKKLLFMGGEFGQHHEWNHDT
QLEWHLLDQPYHRGIQLWVCDLNHLYRTNPALWHDGPEGFEWIDFSDRDQSVICYLRKNAGRMLLFVLNFTPVPREHYRV
GVPIGGPWHEVLNSDAVAYGGSGMGNFGRVEAVPESWHGRPFHLELTLPPLAALILEPEHG
>P16954 2.4.1.18~~~glgB~~~1,4-alpha-glucan branching enzyme GlgB~~~COG0296
MTGTTPLPSSSLSVEQVNRIASNQEQNPFDILGPHPYEHEGQAGWVIRAYLPEAQEAAVICPALRREFAMHPVHHPHFFE
TWVPEETLEIYQLRITEGERERIIYDPYAFRSPLLTDYDIHLFAEGNHHRIYEKLGAHPCELENVAGVNFAVWAPSARNV
SILGDFNSWDGRKHQMARRSNGIWELFIPELTVGAAYKYEIKNYDGHIYEKSDPYGFQQEVRPKTASIVADLDRYTWGDA
DWLERRRHQEPLRQPISVYEVHLGSWMHASSDAIATDAQGKPLPPVPVADLKPGARFLTYRELADRLIPYVLDLGYSHIE
LLPIAEHPFDGSWGYQVTGYYAATSRYGSPEDFMYFVDRCHQNGIGVILDWVPGHFPKDGHGLAFFDGTHLYEHADSRQG
EHREWGTLVFNYGRHEVRNFLAANALFWFDKYHIDGIRVDAVASMLYLDYNRKEGEWIPNEYGGRENIEAADFLRQVNHL
IFSYFPGALSIAEESTSWPMVSWPTYVGGLGFNLKWNMGWMHDMLDYFSMDPWFRQFHQNNVTFSIWYAFSENFMLALSH
DEVVHGKSNLIGKMPGDEWQKFANLRCLLGYMFTHPGKKTLFMGMEFGQWAEWNVWGDLEWHLLQYEPHQGLKQFVKDLN
HLYRNAPALYSEDCNQAGFEWIDCSDNRHSIVSFIRRAHESDRFLVVVCNFTPQPHAHYRIGVPVAGFYREIFNSDARSY
GGSNMGNLGGKWTDEWSCHNRPYSLDLCLPPLTTLVLELASGPESLSEAANSPL
>Q8U8L5 2.7.7.27~~~glgC~~~Glucose-1-phosphate adenylyltransferase~~~COG0448
MSEKRVQPLARDAMAYVLAGGRGSRLKELTDRRAKPAVYFGGKARIIDFALSNALNSGIRRIGVATQYKAHSLIRHLQRG
WDFFRPERNESFDILPASQRVSETQWYEGTADAVYQNIDIIEPYAPEYMVILAGDHIYKMDYEYMLQQHVDSGADVTIGC
LEVPRMEATGFGVMHVNEKDEIIDFIEKPADPPGIPGNEGFALASMGIYVFHTKFLMEALRRDAADPTSSRDFGKDIIPY
IVEHGKAVAHRFADSCVRSDFEHEPYWRDVGTIDAYWQANIDLTDVVPDLDIYDKSWPIWTYAEITPPAKFVHDDEDRRG
SAVSSVVSGDCIISGAALNRSLLFTGVRANSYSRLENAVVLPSVKIGRHAQLSNVVIDHGVVIPEGLIVGEDPELDAKRF
RRTESGICLITQSMIDKLDL
>P39122 2.7.7.27~~~glgC~~~Glucose-1-phosphate adenylyltransferase~~~COG0448
MKKQCVAMLLAGGKGSRLSGLTKNMAKPAVSFGGKYRIIDFTLSNCSNSGIDTVGILTQYQPLELNSYIGIGSAWDLDRY
NGGVTVLPPYAESSEVKWYKGTASSIYENLNYLNQYDPEYVLILSGDHIYKMDYGKMLDYHIEKKADVTISVIEVGWEEA
SRFGIMKANPDGTITHFDEKPKFPKSNLASMGIYIFNWPLLKQYLEMDDQNPYSSHDFGKDIIPLLLEEKKKLSAYPFKG
YWKDVGTVQSLWEANMDLLKEDSELKLFERKWKIYSVNPNQPPQFISSDAQVQDSLVNEGCVVYGNVSHSVLFQGVTVGK
HTTVTSSVIMPDVTIGEHVVIENAIVPNGMVLPDGAVIRSEKDIEEVLLVSEEFVEKELI
>P0A6V1 2.7.7.27~~~glgC~~~Glucose-1-phosphate adenylyltransferase~~~COG0448
MVSLEKNDHLMLARQLPLKSVALILAGGRGTRLKDLTNKRAKPAVHFGGKFRIIDFALSNCINSGIRRMGVITQYQSHTL
VQHIQRGWSFFNEEMNEFVDLLPAQQRMKGENWYRGTADAVTQNLDIIRRYKAEYVVILAGDHIYKQDYSRMLIDHVEKG
ARCTVACMPVPIEEASAFGVMAVDENDKIIEFVEKPANPPSMPNDPSKSLASMGIYVFDADYLYELLEEDDRDENSSHDF
GKDLIPKITEAGLAYAHPFPLSCVQSDPDAEPYWRDVGTLEAYWKANLDLASVVPELDMYDRNWPIRTYNESLPPAKFVQ
DRSGSHGMTLNSLVSGGCVISGSVVVQSVLFSRVRVNSFCNIDSAVLLPEVWVGRSCRLRRCVIDRACVIPEGMVIGENA
EEDARRFYRSEEGIVLVTREMLRKLGHKQER
>P9WN43 2.7.7.27~~~glgC~~~Glucose-1-phosphate adenylyltransferase~~~COG0448
MREVPHVLGIVLAGGEGKRLYPLTADRAKPAVPFGGAYRLIDFVLSNLVNARYLRICVLTQYKSHSLDRHISQNWRLSGL
AGEYITPVPAQQRLGPRWYTGSADAIYQSLNLIYDEDPDYIVVFGADHVYRMDPEQMVRFHIDSGAGATVAGIRVPRENA
TAFGCIDADDSGRIRSFVEKPLEPPGTPDDPDTTFVSMGNYIFTTKVLIDAIRADADDDHSDHDMGGDIVPRLVADGMAA
VYDFSDNEVPGATDRDRAYWRDVGTLDAFYDAHMDLVSVHPVFNLYNKRWPIRGESENLAPAKFVNGGSAQESVVGAGSI
ISAASVRNSVLSSNVVVDDGAIVEGSVIMPGTRVGRGAVVRHAILDKNVVVGPGEMVGVDLEKDRERFAISAGGVVAVGK
GVWI
>P39669 2.7.7.27~~~glgC~~~Glucose-1-phosphate adenylyltransferase~~~COG0448
MSEKRVQPLARDAMAYVLAGGRGSRLKELTDRRAKPAVYFGGKARIIDFALSNALNSGIRRIGVATQYKAHSLIRHLQRG
WDFFRPERNESFDILPASQRVSETQWYEGTADAVYQNIDIIEPYAPEYMVILAGDHIYKMDYEYMLQQHVDSGADVTIGC
LEVPRMEATGFGVMHVNEKDEIIDFIEKPADPPGIPGNEGFALASMGIYVFHTKFLMEAVRRDAADPTSSRDFGKDIIPY
IVEHGKAVAHRFADSCVRSDFEHEPYWRDVGTIDAYWQANIDLTDVVPDLDIYDKSWPIWTYAEITPPAKFVHDDEDRRG
SAVSSVVSGDCIISGAALNRSLLFTGVRANSYSRLENAVVLPSVKIGRHAQLSNVVIDHGVVIPEGLIVGEDPELDAKRF
RRTESGICLITQSMIDKLDL
>P39124 ~~~glgD~~~Glycogen biosynthesis protein GlgD~~~COG0448
MFNNQMLGVIDETTYKHSLQDLTAQRSLGAIPFAGRYRLIDFMLSNMVNADIRSVAIFPKYRYRSLMDHLGAGKEWDLHR
KKDGLFFFPSPHLHHEYDEFGSFRQFSDHLDYFHRSTQQYAVISNSHTVCNIQFQYVLKRHQEVGCDVTEVFQDGQSLQI
YIMSTTLLKDLIYGHSEKGYKTIQEAVEKESSALTICPYEYSGYAAVIDSVEKYYTHSMELIQPRFWQQVFLPQQPIYTK
VKDEPPTKYGKHSTVKNSLVANGCVLEGEVENCILFRAVHVGKGTKLKNCIIMQKTQIGEDCLLEQVISDKDVKIGKATE
AAGTAEQPLVLRKGLVQGELMNS
>Q9L1K2 2.4.99.16~~~glgE1~~~Alpha-1,4-glucan:maltose-1-phosphate maltosyltransferase 1~~~COG0366
MPATHHSSATSAERPTVVGRIPVLDVRPVVQRGRRPAKAVTGESFEVSATVFREGHDAVGANVVLRDPRGRPGPWTPMRE
LAPGTDRWGATVTAGETGTWSYTVEAWGDPVTTWRHHARIKIPAGLDTDLVLEEGARLYERAAADVPGREDRRELLAAVD
ALRDESRPAASRLAAALTPQVDAVLARHPLRDLVTSSDPLPLLVERERALYGAWYEFFPRSEGTPHTPHGTFRTAARRLP
AIAAMGFDVVYLPPIHPIGTTHRKGRNNTLSATGDDVGVPWAIGSPEGGHDSIHPALGTLDDFDHFVTEAGKLGLEIALD
FALQCSPDHPWVHKHPEWFHHRPDGTIAHAENPPKKYQDIYPIAFDADPDGLATETVRILRHWMDHGVRIFRVDNPHTKP
VAFWERVIADINGTDPDVIFLAEAFTRPAMMATLAQIGFQQSYTYFTWRNTKQELTEYLTELSGEAASYMRPNFFANTPD
ILHAYLQHGGRPAFEVRAVLAATLSPTWGIYSGYELCENTPLREGSEEYLDSEKYQLKPRDWTRAAREGTTIAPLVTRLN
TIRRENPALRQLRDLHFHPTDKEEVIAYSKRQGSNTVLVVVNLDPRHTQEATVSLDMPQLGLDWHESVPVRDELTGETYH
WGRANYVRLEPGRTPAHVCTVLRPSHPQIGGSHTT
>Q9KY04 2.4.99.16~~~glgE2~~~Alpha-1,4-glucan:maltose-1-phosphate maltosyltransferase 2~~~COG0366
MRMSATVGIGRIPVRDVQPVVEYGRRPAKAVTGETFEVTATVFREGHDAVAANVVLKDPEGRPGPWTPMRELAPGSDRWG
ATVTPGAPGNWTYRVEAWSDPVATWRHAARIKVPAGIDAGLVLEEGSELYRRAAAGVPKDSGRDVLLAAATALLDDTLPV
ATRLAAALTPQVDAVLARHPLRDLVTSSDPLPLLVERERALYGAWYEFFPRSEGTPHTPHGTFRTAARRLPAIAAMGFDV
VYLPPIHPIGTTHRKGRNNTLSATGDDVGVPWAIGSPEGGHDSIHPALGTLDDFDHFVTEAARHGLEIALDFALQCSPDH
PWVHKHPEWFHHRPDGTIAHAENPPKKYQDIYPIAFDADPDGLATETVRILRHWMDHGVRIFRVDNPHTKPVAFWERVIA
DINGTDPDVIFLAEAFTRPAMMATLAQIGFQQSYTYFTWRNTKQELTEYLEELSGEAAAYMRPNFFANTPDILHAYLQHG
GRPAFEVRAVLAATLSPTWGIYSGYELCENTPLREGSEEYLDSEKYQLKPRDWTRAAREGTTIAPLVTRLNTIRREHPAL
HRLRNLRFHHTDNDALIAYSKRVGSDVVLVVANLDPHRTQEATISLDMPQLGLDWHDSVPVHDELTGRTYHWGRANYVRL
EPGRAPAHVFHVRRPSAAAAPQNGGSGAS
>Q9RP48 2.4.99.16~~~glgE~~~Alpha-1,4-glucan:maltose-1-phosphate maltosyltransferase~~~COG0366
MRSGWVAGRIGIDDVAPVVSCGRYPAKAVVGEVVPVRATVWREGHDAVSATLVVRYLGTEFPRLASGPGTTPPAVPLGTV
VQPGKRVKPQILQMSKGRTPDVFHGEFTPDAVGLWTFRVDGWGDPIATWRHAVEAKLEAGQSETELNNDLLVGARLLMRA
AEGVPRKLRDPLLEAAQQLRTPGDPYQRAGGALSPEVADLLLQYPLREFVTRGEVHGVWVDRPLARFSSWYEMFPRSTGG
WDENGHPVHGTFATAAAALPRIARMGFNVVYLPPIHPIGKVHRKGRNNSVTAAPGDVGSPWAIGSDEGGHDAVHPDLGTI
DDFDAFVAAARDAGLEVALDLALQCAPDHPWAKEHPEWFTVLPDGTIAYAENPPKKYQDIYPLNFDNDPDGLFHEVLRVV
KFWISHGVKVFRVDNPHTKPPNFWAWLIAEVKNEDPDILFLSEAFTRPARLYGLAKLGFTQSYTYFTWRTAKWELTEFGE
EIAKYADHARPNLWVNTPDILHESLQHGGPGMFAIRAVLASTMSSSWGVYSGYELFEHRSVREGSEEYLDSEKYELRPRD
FDGALARGESLEPFLTRLNEIRRLHPALRQLRTIKFHHLDNDALLAYSKFDPVTGDTVLVVVTLNPFGPEESTLWLDMEA
LGMEPYDRFWVRDEITGEEYQWGQSNYVRIEPAKAVAHVLNMPLIPYEKRLDLLRRE
>P9WQ16 2.4.99.16~~~glgE~~~Alpha-1,4-glucan:maltose-1-phosphate maltosyltransferase~~~
MSGRAIGTETEWWVPGRVEIDDVAPVVSCGVYPAKAVVGEVVPVSAAVWREGHEAVAATLVVRYLGVRYPHLTDRPRARV
LPTPSEPQQRVKPLLIPMTSGQEPFVFHGQFTPDRVGLWTFRVDGWGDPIHTWRHGLIAKLDAGQGETELSNDLLVGAVL
LERAATGVPRGLRDPLLAAAAALRTPGDPVTRTALALTPEIEELLADYPLRDLVTRGEQFGVWVDRPLARFGAWYEMFPR
STGGWDDDGNPVHGTFATAAAELPRIAGMGFDVVYLPPIHPIGKVHRKGRNNSPTAAPTDVGSPWAIGSDEGGHDTVHPS
LGTIDDFDDFVSAARDLGMEVALDLALQCAPDHPWAREHRQWFTELPDGTIAYAENPPKKYQDIYPLNFDNDPEGLYDEV
LRVVQHWVNHGVKFFRVDNPHTKPPNFWAWLIAQVKTVDPDVLFLSEAFTPPARQYGLAKLGFTQSYSYFTWRTTKWELT
EFGNQIAELADYRRPNLFVNTPDILHAVLQHNGPGMFAIRAVLAATMSPAWGMYCGYELFEHRAVREGSEEYLDSEKYEL
RPRDFASALDQGRSLQPFITRLNIIRRLHPAFQQLRTIHFHHVDNDALLAYSKFDPATGDCVLVVVTLNAFGPEEATLWL
DMAALGMEDYDRFWVRDEITGEEYQWGQANYIRIDPARAVAHIINMPAVPYESRNTLLRRR
>P9WQ17 2.4.99.16~~~glgE~~~Alpha-1,4-glucan:maltose-1-phosphate maltosyltransferase~~~COG0366
MSGRAIGTETEWWVPGRVEIDDVAPVVSCGVYPAKAVVGEVVPVSAAVWREGHEAVAATLVVRYLGVRYPHLTDRPRARV
LPTPSEPQQRVKPLLIPMTSGQEPFVFHGQFTPDRVGLWTFRVDGWGDPIHTWRHGLIAKLDAGQGETELSNDLLVGAVL
LERAATGVPRGLRDPLLAAAAALRTPGDPVTRTALALTPEIEELLADYPLRDLVTRGEQFGVWVDRPLARFGAWYEMFPR
STGGWDDDGNPVHGTFATAAAELPRIAGMGFDVVYLPPIHPIGKVHRKGRNNSPTAAPTDVGSPWAIGSDEGGHDTVHPS
LGTIDDFDDFVSAARDLGMEVALDLALQCAPDHPWAREHRQWFTELPDGTIAYAENPPKKYQDIYPLNFDNDPEGLYDEV
LRVVQHWVNHGVKFFRVDNPHTKPPNFWAWLIAQVKTVDPDVLFLSEAFTPPARQYGLAKLGFTQSYSYFTWRTTKWELT
EFGNQIAELADYRRPNLFVNTPDILHAVLQHNGPGMFAIRAVLAATMSPAWGMYCGYELFEHRAVREGSEEYLDSEKYEL
RPRDFASALDQGRSLQPFITRLNIIRRLHPAFQQLRTIHFHHVDNDALLAYSKFDPATGDCVLVVVTLNAFGPEEATLWL
DMAALGMEDYDRFWVRDEITGEEYQWGQANYIRIDPARAVAHIINMPAVPYESRNTLLRRR
>A0R2E2 2.4.1.342~~~glgM~~~Alpha-maltose-1-phosphate synthase~~~COG0297
MRVAMMTREYPPEVYGGAGVHVTELVAQLRKLCDVDVHCMGAPRDGAYVAHPDPTLRGANAALTMLSADLNMVNNAEAAT
VVHSHTWYTGLAGHLASLLYGVPHVLTAHSLEPLRPWKAEQLGGGYQVSSWVERTAVEAADAVIAVSSGMRDDVLRTYPA
LDPDRVHVVRNGIDTTVWYPAEPGPDESVLAELGVDLNRPIVAFVGRITRQKGVAHLVAAAHRFAPDVQLVLCAGAPDTP
QIAEEVSSAVQQLAQARTGVFWVREMLPTHKIREILSAATVFVCPSVYEPLGIVNLEAMACATAVVASDVGGIPEVVADG
RTGLLVHYDANDTEAYEARLAEAVNSLVADPDRAREYGVAGRERCIEEFSWAHIAEQTLEIYRKVSA
>P9WMZ1 2.4.1.342~~~glgM~~~Alpha-maltose-1-phosphate synthase~~~COG0297
MRVAMLTREYPPEVYGGAGVHVTELVAYLRRLCAVDVHCMGAPRPGAFAYRPDPRLGSANAALSTLSADLVMANAASAAT
VVHSHTWYTALAGHLAAILYDIPHVLTAHSLEPLRPWKKEQLGGGYQVSTWVEQTAVLAANAVIAVSSAMRNDMLRVYPS
LDPNLVHVIRNGIDTETWYPAGPARTGSVLAELGVDPNRPMAVFVGRITRQKGVVHLVTAAHRFRSDVQLVLCAGAADTP
EVADEVRVAVAELARNRTGVFWIQDRLTIGQLREILSAATVFVCPSVYEPLGIVNLEAMACATAVVASDVGGIPEVVADG
ITGSLVHYDADDATGYQARLAEAVNALVADPATAERYGHAGRQRCIQEFSWAYIAEQTLDIYRKVCA
>P9WMY9 2.4.1.11~~~~~~Glycogen synthase~~~COG0297
MRILMVSWEYPPVVIGGLGRHVHHLSTALAAAGHDVVVLSRCPSGTDPSTHPSSDEVTEGVRVIAAAQDPHEFTFGNDMM
AWTLAMGHAMIRAGLRLKKLGTDRSWRPDVVHAHDWLVAHPAIALAQFYDVPMVSTIHATEAGRHSGWVSGALSRQVHAV
ESWLVRESDSLITCSASMNDEITELFGPGLAEITVIRNGIDAARWPFAARRPRTGPAELLYVGRLEYEKGVHDAIAALPR
LRRTHPGTTLTIAGEGTQQDWLIDQARKHRVLRATRFVGHLDHTELLALLHRADAAVLPSHYEPFGLVALEAAAAGTPLV
TSNIGGLGEAVINGQTGVSCAPRDVAGLAAAVRSVLDDPAAAQRRARAARQRLTSDFDWQTVATATAQVYLAAKRGERQP
QPRLPIVEHALPDR
>P26649 ~~~glgS~~~Surface composition regulator~~~
MDHSLNSLNNFDFLARSFARMHAEGRPVDILAVTGNMDEEHRTWFCARYAWYCQQMMQARELELEH
>Q8KR69 3.2.1.196~~~glgX~~~Glycogen debranching enzyme~~~
MGELLAGRPRPLGSHFDGEGVNFALFSSGASRVELCIFDGLREQRLPLTARTGDIWHGYLPDAQPGLCYGYRVDGAFDPS
RGQRFNANKLLLDPCARQMDGWVVDDQRLHGGYHQPDPSDSAEVMPPSVVVDEHYDWQGDRLPRTPWSQTVLYEAHVRGL
TRRHPGIPAAIRGTYAALAHPVMLDYLTQLGVTALELMPVQQHADEPRLQSMGLRNYWGYNTLLPFAVDNSLAASDDPLN
EFRDTVRALHQAGIEVILDVVFNHSAELDVDGPTLTLRGIDNASYYWLTENGDYHNWAGCGNVLRLEHPAVLHWVIECLT
FWHEVCHVDGFRFDLATILGRLPDFSSSAPFFTALRNHRSLRDCKLIAEPWDISPGGYQLGQFPAPFAEWNDRFRDDMRR
FWLHGDLPIGVLARRFAASSEVFERGSRQPWASVNMLTSHDGFTLRDLVCFNHKHNDANGEQNRDGTNSNFSFNHGTEGL
EADETTQARRRVSQQALLTTLLLSQGTPMLLAGDEFGNSQQGNNNAYCQDNALAWLHWDQADDALLAFTSGLIRLRRSIP
ALQRGRWWRDDDEDDVRWLNAQGEALTPYEWEQGTHQLQIQLSERWLLLVNATPQVSDFSLPEGEWRVAPPFSATDHLLD
GQTWRGQANAVCVLVKQ
>P15067 3.2.1.196~~~glgX~~~Glycogen debranching enzyme~~~COG1523
MTQLAIGKPAPLGAHYDGQGVNFTLFSAHAERVELCVFDANGQEHRYDLPGHSGDIWHGYLPDARPGLRYGYRVHGPWQP
AEGHRFNPAKLLIDPCARQIDGEFKDNPLLHAGHNEPDYRDNAAIAPKCVVVVDHYDWEDDAPPRTPWGSTIIYEAHVKG
LTYLHPEIPVEIRGTYKALGHPVMINYLKQLGITALELLPVAQFASEPRLQRMGLSNYWGYNPVAMFALHPAYACSPETA
LDEFRDAIKALHKAGIEVILDIVLNHSAELDLDGPLFSLRGIDNRSYYWIREDGDYHNWTGCGNTLNLSHPAVVDYASAC
LRYWVETCHVDGFRFDLAAVMGRTPEFRQDAPLFTAIQNCPVLSQVKLIAEPWDIAPGGYQVGNFPPLFAEWNDHFRDAA
RRFWLHYDLPLGAFAGRFAASSDVFKRNGRLPSAAINLVTAHDGFTLRDCVCFNHKHNEANGEENRDGTNNNYSNNHGKE
GLGGSLDLVERRRDSIHALLTTLLLSQGTPMLLAGDEHGHSQHGNNNAYCQDNQLTWLDWSQASSGLTAFTAALIHLRKR
IPALVENRWWEEGDGNVRWLNRYAQPLSTDEWQNGPKQLQILLSDRFLIAINATLEVTEIVLPAGEWHAIPPFAGEDNPV
ITAVWQGPAHGLCVFQR
>P9WQ25 3.2.1.-~~~glgX~~~Glycogen operon protein GlgX homolog~~~COG1523
MSSNNAGESDGTGPALPTVWPGNAYPLGATYDGAGTNFSLFSEIAEKVELCLIDEDGVESRIPLDEVDGYVWHAYLPNIT
PGQRYGFRVHGPFDPAAGHRCDPSKLLLDPYGKSFHGDFTFGQALYSYDVNAVDPDSTPPMVDSLGHTMTSVVINPFFDW
AYDRSPRTPYHETVIYEAHVKGMTQTHPSIPPELRGTYAGLAHPVIIDHLNELNVTAVELMPVHQFLHDSRLLDLGLRNY
WGYNTFGFFAPHHQYASTRQAGSAVAEFKTMVRSLHEAGIEVILDVVYNHTAEGNHLGPTINFRGIDNTAYYRLMDHDLR
FYKDFTGTGNSLNARHPHTLQLIMDSLRYWVIEMHVDGFRFDLASTLARELHDVDRLSAFFDLVQQDPVVSQVKLIAEPW
DVGEGGYQVGNFPGLWTEWNGKYRDTVRDYWRGEPATLGEFASRLTGSSDLYEATGRRPSASINFVTAHDGFTLNDLVSY
NDKHNEANGENNRDGESYNRSWNCGVEGPTDDPDILALRARQMRNMWATLMVSQGTPMIAHGDEIGRTQYGNNNVYCQDS
ELSWMDWSLVDKNADLLAFARKATTLRKNHKVFRRRRFFEGEPIRSGDEVRDIAWLTPSGREMTHEDWGRGFDRCVAVFL
NGEAITAPDARGERVVDDSFLLCFNAHDHDVEFVMPHDGYAQQWTGELDTNDPVGDIDLTVTATDTFSVPARSLLVLRKT
L
>Q7NDN8 ~~~glvI~~~Proton-gated ion channel~~~COG5361
MFPTGWRPKLSESIAASRMLWQPMAAVAVVQIGLLWFSPPVWGQDMVSPPPPIADEPLTVNTGIYLIECYSLDDKAETFK
VNAFLSLSWKDRRLAFDPVRSGVRVKTYEPEAIWIPEIRFVNVENARDADVVDISVSPDGTVQYLERFSARVLSPLDFRR
YPFDSQTLHIYLIVRSVDTRNIVLAVDLEKVGKNDDVFLTGWDIESFTAVVKPANFALEDRLESKLDYQLRISRQYFSYI
PNIILPMLFILFISWTAFWSTSYEANVTLVVSTLIAHIAFNILVETNLPKTPYMTYTGAIIFMIYLFYFVAVIEVTVQHY
LKVESQPARAASITRASRIAFPVVFLLANIILAFLFFGF
>A9CEQ7 5.4.1.4~~~~~~D-galactarolactone isomerase~~~COG3618
MSELVRKLSGTAPNPAFPRGAVDTQMHMYLPGYPALPGGPGLPPGALPGPEDYRRLMQWLGIDRVIITQGNAHQRDNGNT
LACVAEMGEAAHAVVIIDATTTEKDMEKLTAAGTVGARIMDLPGGAVNLSELDAVDERAHAADWMVAVQFDGNGLLDHLP
RLQKIRSRWVFDHHGKFFKGIRTDGPEMAALLKLIDRGNLWFKFAGVYESSRKSWPYADVAAFSRVIAAHAPERIVWGTN
WPHNSVRETAAYPDDARLAELTLGWLPDEAARHRALVENPEALFKLSPVKAT
>Q0P8J6 2.7.3.13~~~~~~L-glutamine kinase~~~COG0574
MAELKFKTKAQNLKNLQTKLKKAKVLPLVLTSLEELISNEDKVLQDIQTLKANRLIIRSSSLSEDSMKNSNAGAFLSLAN
IKADSKDELLKALYEVANSMPSKSDEILVQPMLENITLCGVGFSVDKDNFSPYFCLQYDENGSNSSITDGSSKSAKTYYH
YRNYLEFKDIRLQKIIELIKELEVLYDCHFLDVEFAFAIQDDKEELFCLQVRPLVMHEKNNLFHSLPKEALYRFYKRFES
LKESRSRVLGDKAIFGVMPDWNPAEIIGLRPKRLAFSLYKEIITDNIWAYQRDNYGYRDLRSHPLIHSFLGIPYVDVRLS
FNSFIPKKLDENIAQKLVNFYLDKLNKNHELHDKIEFNIVYSCYDFNSSKKLEELLNHGFNENEIKRLEFSLLELTNKII
NPRSGFYLKDIQKAYKLKERYDGIINSNFSLIDKIYWLIEECKRYGTLPFAGVARAAFVAMQLLNSLVEIDFITKEEKDD
FLNSLNTVSKNLSKQTNHLNFHNKDQFLKDFGHLRAGTYNILSPRYDEDFELYFDADQKDSKVYLQDKAFVFSKEKTRAL
NALLKEHGLEINACEFFDFLKQAIEGRELVKFEFTRLLSKAIVYIEELGKYYDIEKEDLAHLDIKSILNLYSSLYSINPK
EQFIEEINRNKKEYELTQAIKLPSLLCNADEIFSFYNHSIIPNFITQKSITAFTAKENDKDLEGKIVLIYAADPGYDYLF
TKNIAGLITCYGGANSHMAIRASELGMPAVIGVGEENFEKYLKAKKINIECESEQIFCL
>P54495 2.7.1.2~~~glcK~~~Glucokinase~~~COG1940
MDEIWFAGIDLGGTTIKLAFINQYGEIQHKWEVPTDKTGDTITVTIAKTIDSKLDELQKPKHIIKYIGMGAPGPVDMAAG
VVYETVNLGWKNYALKNHLETETGIPAVIENDANIAALGEMWKGAGDGAKDVILVTLGTGVGGGIIANGEIVHGINGAGG
EIGHICSIPEGGAPCNCGKTGCIETIASATGIVRIAKEKIANAKKTTRLKATEQLSARDVFEAAGENDEIALEVVDYVAK
HLGLVLGNLASSLNPSKIVLGGGVSRAGELLRSKVEKTFRKCAFPRAAQAADISIAALGNDAGVIGGAWIAKNEWLKHQN
C
>P0A6V9 2.7.1.2~~~glk~~~Glucokinase~~~COG0837
MTKYALVGDVGGTNARLALCDIASGEISQAKTYSGLDYPSLEAVIRVYLEEHKVEVKDGCIAIACPITGDWVAMTNHTWA
FSIAEMKKNLGFSHLEIINDFTAVSMAIPMLKKEHLIQFGGAEPVEGKPIAVYGAGTGLGVAHLVHVDKRWVSLPGEGGH
VDFAPNSEEEAIILEILRAEIGHVSAERVLSGPGLVNLYRAIVKADNRLPENLKPKDITERALADSCTDCRRALSLFCVI
MGRFGGNLALNLGTFGGVFIAGGIVPRFLEFFKASGFRAAFEDKGRFKEYVHDIPVYLIVHDNPGLLGSGAHLRQTLGHI
L
>P0A6V8 2.7.1.2~~~glk~~~Glucokinase~~~COG0837
MTKYALVGDVGGTNARLALCDIASGEISQAKTYSGLDYPSLEAVIRVYLEEHKVEVKDGCIAIACPITGDWVAMTNHTWA
FSIAEMKKNLGFSHLEIINDFTAVSMAIPMLKKEHLIQFGGAEPVEGKPIAVYGAGTGLGVAHLVHVDKRWVSLPGEGGH
VDFAPNSEEEAIILEILRAEIGHVSAERVLSGPGLVNLYRAIVKADNRLPENLKPKDITERALADSCTDCRRALSLFCVI
MGRFGGNLALNLGTFGGVFIAGGIVPRFLEFFKASGFRAAFEDKGRFKEYVHDIPVYLIVHDNPGLLGSGAHLRQTLGHI
L
>P0A4E1 2.7.1.2~~~glkA~~~Glucokinase~~~COG1940
MGLTIGVDIGGTKIAAGVVDEEGNILSTHKVPTPTTPEAIVDAIASAVEGARVGHEIVAVGIGAAGYVNRQRSTVYFAPN
IDWRQEPLKEKVEARVGLPVVVENDANAAAWGEYKFGGGKGHRNVICITLGTGLGGGIIIGNKLRRGHFGVAAEFGHIRM
VPDGLLCGCGSQGCWEQYASGRALVRYAKQRANATPERAEVLLALGDGTPDGIEGKHISVAARQGCPVAVDSYRELARWA
GAGLADLASLFDPSAFIVGGGLSDEGDLVLDPIRKSYKRWLVGGNWRPVADVIAAQLGNKAGLVGAADLAREPDPIM
>P0A4E2 2.7.1.2~~~glkA~~~Glucokinase~~~
MGLTIGVDIGGTKIAAGVVDEEGNILSTHKVPTPTTPEAIVDAIASAVEGARVGHEIVAVGIGAAGYVNRQRSTVYFAPN
IDWRQEPLKEKVEARVGLPVVVENDANAAAWGEYKFGGGKGHRNVICITLGTGLGGGIIIGNKLRRGHFGVAAEFGHIRM
VPDGLLCGCGSQGCWEQYASGRALVRYAKQRANATPERAEVLLALGDGTPDGIEGKHISVAARQGCPVAVDSYRELARWA
GAGLADLASLFDPSAFIVGGGLSDEGDLVLDPIRKSYKRWLVGGNWRPVADVIAAQLGNKAGLVGAADLAREPDPIM
>Q9X1I0 2.7.1.2~~~glk~~~Glucokinase~~~
MPKLKLIGVDLGGTTFSVGLVSEDGKILKKVTRDTLVENGKEDVIRRIAETILEVSDGEEAPYVGIGSPGSIDRENGIVR
FSPNFPDWHNVPLTDELAKRTGKKVFLENDANAFVLGEKWFGAGRGHDHIVALTLGTGIGGGVVTHGYLLTGRDGIGAEL
GHVVVEPNGPMCNCGTRGCLEAVASATAIRRFLREGYKKYHSSLVYKLAGSPEKADAKHLFDAARQGDRFALMIRDRVVD
ALARAVAGYIHIFNPEIVIIGGGISRAGEILFGPLREKVVDYIMPSFVGTYEVVASPLVEDAGILGAASIIKERIGG
>P80077 5.4.99.1~~~glmE~~~Glutamate mutase epsilon subunit~~~
MELKNKKWTDEEFHKQREEVLQQWPTGKEVDLQEAVDYLKKIPAEKNFAEKLVLAKKKGITMAQPRAGVALLDEHIELLR
YLQDEGGADFLPSTIDAYTRQNRYDECENGIKESEKAGRSLLNGFPGVNYGVKGCRKVLEAVNLPLQARHGTPDSRLLAE
IIHAGGWTSNEGGGISYNVPYAKNVTIEKSLLDWQYCDRLVGFYEEQGVHINREPFGPLTGTLVPPSMSNAVGITEALLA
AEQGVKNITVGYGECGNMIQDIAALRCLEEQTNEYLKAYGYNDVFVTTVFHQWMGGFPQDESKAFGVIVTATTIAALAGA
TKVIVKTPHEAIGIPTKEANAAGIKATKMALNMLEGQRMPMSKELETEMAVIKAETKCILDKMFELGKGDLAIGTVKAFE
TGVMDIPFGPSKYNAGKMMPVRDNLGCVRYLEFGNVPFTEEIKNYNRERLQERAKFEGRDVSFQMVIDDIFAVGKGRLIG
RPE
>Q05509 5.4.99.1~~~glmE~~~Glutamate mutase epsilon subunit~~~
MELKNKKWTDEEFFKQREEVLKQWPTGKEVDLQEAVDYLKKVPTEKNFADKLVRAKEAGITLAQPRAGVALLDEHINLLR
YLQDEGGADLLPSTIDAYTRQNRYEECEIGIKESEKAGRSLLNGFPGVNHGVKGCRKVLESVNLPLQARHGTPDSRLLAE
IIHAGGWTSNEGGGISYNIPYAKSVPIDKCLKDWQYCDRLVGFYEEQGVHINREPFGPLTGTLVPPSMSNAVGITEALLA
AEQGVKNITVGYGECGNMLQDIAALRCLEEQTNEYLKAYGYNDVFVTTVFHQWMGGFPQDESKAFGVIVTATTIASLAGA
TKVIVKTPHEAIGIPTKEANASGIKATKMALNMLEGQRMPMSKELETEMAIIKAETKCILDKMFELGKGDLAVGTVKAFE
TGVMDIPFGPSKYNAGKMMPVRDNLGCVRYLEFGNVPFTEELKNYNRERLAERAKFEGREVSFQMVIDDIFAVGKGRLIG
RPENK
>Q81VN7 5.4.2.10~~~glmM~~~Phosphoglucosamine mutase~~~COG1109
MGKYFGTDGVRGVANKELTPELAFKIGRFGGYVLTKDTDRPKVIIGRDTRISGHMLEGALVAGLLSTGAEVMRLGVISTP
GVAYLTKALDAQAGVMISASHNPVQDNGIKFFGSDGFKLTDEQEAEIEALLDKEVDELPRPTGTNLGQVSDYFEGGQKYL
QYIKQTVEEDFSGLHIALDCAHGATSSLAPYLFADLEADISTMGTSPNGMNINDGVGSTHPEVLAELVKEKGADIGLAFD
GDGDRLIAVDEKGNIVDGDQIMFICAKYMKETGQLKHNTVVSTVMSNLGFYKALEANGITSDKTAVGDRYVMEEMKRGGY
NLGGEQSGHIILLDYITTGDGMLSALQLVNIMKMTKKPLSELAGEMTKFPQLLVNVRVTDKKLALENEKIKEIIRVVEEE
MNGDGRILVRPSGTEPLIRVMAEAPTQEVCDAYVHRIVEVVKAEVGAE
>O34824 5.4.2.10~~~glmM~~~Phosphoglucosamine mutase~~~COG1109
MGKYFGTDGVRGVANSELTPELAFKVGRFGGYVLTKDKQRPKVLIGRDTRISGHMLEGALVAGLLSIGAEVMRLGVISTP
GVSYLTKAMDAEAGVMISASHNPVQDNGIKFFGGDGFKLSDEQEAEIERLMDEPEDKLPRPVGADLGLVNDYFEGGQKYL
QFLKQTADEDFTGIHVALDCANGATSSLATHLFADLDADVSTMGTSPNGLNINDGVGSTHPEALSAFVKEKNADLGLAFD
GDGDRLIAVDEKGNIVDGDQIMYICSKHLKSEGRLKDDTVVSTVMSNLGFYKALEKEGIKSVQTAVGDRYVVEAMKKDGY
NVGGEQSGHLIFLDYNTTGDGLLSAIMLMNTLKATGKPLSELAAEMQKFPQLLVNVRVTDKYKVEENEKVKAVISEVEKE
MNGDGRILVRPSGTEPLVRVMAEAKTKELCDEYVNRIVEVVRSEMGLE
>P31120 5.4.2.10~~~glmM~~~Phosphoglucosamine mutase~~~COG1109
MSNRKYFGTDGIRGRVGDAPITPDFVLKLGWAAGKVLARHGSRKIIIGKDTRISGYMLESALEAGLAAAGLSALFTGPMP
TPAVAYLTRTFRAEAGIVISASHNPFYDNGIKFFSIDGTKLPDAVEEAIEAEMEKEISCVDSAELGKASRIVDAAGRYIE
FCKATFPNELSLSELKIVVDCANGATYHIAPNVLRELGANVIAIGCEPNGVNINAEVGATDVRALQARVLAEKADLGIAF
DGDGDRVIMVDHEGNKVDGDQIMYIIAREGLRQGQLRGGAVGTLMSNMGLELALKQLGIPFARAKVGDRYVLEKMQEKGW
RIGAENSGHVILLDKTTTGDGIVAGLQVLAAMARNHMSLHDLCSGMKMFPQILVNVRYTAGSGDPLEHESVKAVTAEVEA
ALGNRGRVLLRKSGTEPLIRVMVEGEDEAQVTEFAHRIADAVKAV
>Q5NII8 5.4.2.10~~~glmM~~~Phosphoglucosamine mutase~~~COG1109
MAKYFGTDGIRGEVANSTITVEFTQKLGNAVGSLINQKNYPKFVIVGQDTRSSGGFLKFALVSGLNAAGIDVLDLGVVPT
PVVAFMTVKHRAAAGFVITASHNKFTDNGIKLFSSNGFKLDDALEEEVEDMIDGDFIYQPQFKFGSYKILANAIDEYIES
IYSRFAKFVNYKGKVVVDCAHGAASHNFEALLDKFGINYVSIASNPDGLNINVGCGATCVSNIKKAVKEQKADLGISLDG
DADRIIIVDENGQEIDGDGILNILAQYSDICGGTNGIVGTQMTNMSYENHYRANKIPFIRSKVGDRYVLEDLVKYGYKIG
GESSGHVINLNFGTTGDGLFTAIQLLAIFSQADKPVSEFKLQGELMQQTLINVPLTKKVAREDLQKVASDVNDVEKRLGN
RGRVLLRPSGTEPVLRVMVEADDKSLATNEAEYLVEKVKQKLV
>P25177 5.4.2.10~~~glmM~~~Phosphoglucosamine mutase~~~COG1109
MKIFGTDGVRGKAGVKLTPMFVMRLGIAAGLYFKKHSQTNKILIGKDTRKSGYMVENALVSALTSIGYNVIQIGPMPTPA
IAFLTEDMRCDAGIMISASHNPFEDNGIKFFNSYGYKLKEEEEKAIEEIFHDEELLHSSYKVGESVGSAKRIDDVIGRYI
AHLKHSFPKHLNLQSLRIVLDTANGAAYKVAPVVFSELGADVLVINDEPNGCNINDQCGALHPNQLSQEVKKYRADLGFA
FDGDADRLVVVDNLGNIVHGDKLLGVLGVYQKSKNALSSQAVVATNMSNLALKEYLKSQDLELKHCAIGDKFVSECMQLN
KANFGGEQSGHIIFSDYAKTGDGLVCALQVSALVLESKQVSSVALNPFELYPQSLVNLNVQKKPPLESLKGYSALLKELD
KLEIRHLIRYSGTENKLRILLEAKDEKLLESKMQELKEFFEGHLC
>P9WN41 5.4.2.10~~~glmM~~~Phosphoglucosamine mutase~~~COG1109
MGRLFGTDGVRGVANRELTAELALALGAAAARRLSRSGAPGRRVAVLGRDPRASGEMLEAAVIAGLTSEGVDALRVGVLP
TPAVAYLTGAYDADFGVMISASHNPMPDNGIKIFGPGGHKLDDDTEDQIEDLVLGVSRGPGLRPAGAGIGRVIDAEDATE
RYLRHVAKAATARLDDLAVVVDCAHGAASSAAPRAYRAAGARVIAINAEPNGRNINDGCGSTHLDPLRAAVLAHRADLGL
AHDGDADRCLAVDANGDLVDGDAIMVVLALAMKEAGELACNTLVATVMSNLGLHLAMRSAGVTVRTTAVGDRYVLEELRA
GDYSLGGEQSGHIVMPALGSTGDGIVTGLRLMTRMVQTGSSLSDLASAMRTLPQVLINVEVVDKATAAAAPSVRTAVEQA
AAELGDTGRILLRPSGTEPMIRVMVEAADEGVAQRLAATVADAVSTAR
>P0C0V7 5.4.2.10~~~glmM~~~Phosphoglucosamine mutase~~~COG1109
MGKYFGTDGVRGVANQELTPELAFKLGRYGGYVLAHNKGEKHPRVLVGRDTRVSGEMLESALIAGLISIGAEVMRLGIIS
TPGVAYLTRDMGAELGVMISASHNPVADNGIKFFGSDGFKLSDEQENEIEALLDQENPELPRPVGNDIVHYSDYFEGAQK
YLSYLKSTVDVNFEGLKIALDGANGSTSSLAPFLFGDLEADTETIGCSPDGYNINEKCGSTHPEKLAEKVVETESDFGLA
FDGDGDRIIAVDENGQIVDGDQIMFIIGQEMHKNQELNNDMIVSTVMSNLGFYKALEQEGIKSNKTKVGDRYVVEEMRRG
NYNLGGEQSGHIVMMDYNTTGDGLLTGIQLASVIKMTGKSLSELAGQMKKYPQSLINVRVTDKYRVEENVDVKEVMTKVE
VEMNGEGRILVRPSGTEPLVRVMVEAATDEDAERFAQQIADVVQDKMGLDK
>Q5HE43 5.4.2.10~~~glmM~~~Phosphoglucosamine mutase~~~
MGKYFGTDGVRGVANQELTPELAFKLGRYGGYVLAHNKGEKHPRVLVGRDTRVSGEMLESALIAGLISIGAEVMRLGIIS
TPGVAYLTRDMGAELGVMISASHNPVADNGIKFFGSDGFKLSDEQENEIEALLDQENPELPRPVGNDIVHYSDYFEGAQK
YLSYLKSTVDVNFEGLKIALDGANGSTSSLAPFLFGDLEADTETIGCSPDGYNINEKCGSTHPEKLAEKVVETESDFGLA
FDGDGDRIIAVDENGQIVDGDQIMFIIGQEMHKNQELNNDMIVSTVMSNLGFYKALEQEGIKSNKTKVGDRYVVEEMRRG
NYNLGGEQSGHIVMMDYNTTGDGLLTGIQLASVIKMTGKSLSELAGQMKKYPQSLINVRVTDKYRVEENVDVKEVMTKVE
VEMNGEGRILVRPSGTEPLVRVMVEAATDEDAERFAQQIADVVQDKMGLDK
>P99087 5.4.2.10~~~glmM~~~Phosphoglucosamine mutase~~~
MGKYFGTDGVRGVANQELTPELAFKLGRYGGYVLAHNKGEKHPRVLVGRDTRVSGEMLESALIAGLISIGAEVMRLGIIS
TPGVAYLTRDMGAELGVMISASHNPVADNGIKFFGSDGFKLSDEQENEIEALLDQENPELPRPVGNDIVHYSDYFEGAQK
YLSYLKSTVDVNFEGLKIVLDGANGSTSSLAPFLFGDLEADTETIGCSPDGYNINEKCGSTHPEKLAEKVVETESDFGLA
FDGDGDRIIAVDENGQIVDGDQIMFIIGQEMHKNQELNNDMIVSTVMSNLGFYKALEQEGIKSNKTKVGDRYVVEEMRRG
NYNLGGEQSGHIVMMDYNTTGDGLLTGIQLASVIKMTGKSLSELAGQMKKYPQSLINVRVTDKYRVEENVDVKEVMTKVE
VEMNGEGRILVRPSGTEPLVRVMVEAATDEDAERFAQQIADVVQDKMGLDK
>Q8DP16 5.4.2.10~~~glmM~~~Phosphoglucosamine mutase~~~COG1109
MGKYFGTDGVRGEANLELTPELAFKLGRFGGYVLSQHETEAPKVFVGRDTRISGEMLESALVAGLLSVGIHVYKLGVLAT
PAVAYLVETEGASAGVMISASHNPALDNGIKFFGGDGFKLDDEKEAEIEALLDAEEDTLPRPSAEGLGILVDYPEGLRKY
EGYLVSTGTPLDGMKVALDTANGAASTSARQIFADLGAQLTVIGETPDGLNINLNVGSTHPEALQEVVKESGSAIGLAFD
GDSDRLIAVDENGDIVDGDKIMYIIGKYLSKKGQLAQNTIVTTVMSNLGFHKALNREGINKAVTAVGDRYVVEEMRKSGY
NLGGEQSGHVILMDYNTTGDGQLSAVQLTKIMKETGKSLSELAAEVTIYPQKLVNIRVENVMKEKAMEVPAIKAIIEKME
EEMAGNGRILVRPSGTEPLLRVMAEAPTTEEVDYYVDTITDVVRAEIGID
>P0CI73 2.6.1.16~~~glmS~~~Glutamine--fructose-6-phosphate aminotransferase [isomerizing]~~~COG0449
MCGIVGYIGQLDAKEILLKGLEKLEYRGYDSAGIAVANEQGIHVFKEKGRIADLREVVDANVEAKAGIGHTRWATHGEPS
YLNAHPHQSALGRFTLVHNGVIENYVQLKQEYLQDVELKSDTDTEVVVQVIEQFVNGGLETEEAFRKTLTLLKGSYAIAL
FDNDNRETIFVAKNKSPLLVGLGDTFNVVASDAMAMLQVTNEYVELMDKEMVIVTDDQVVIKNLDGDVITRASYIAELDA
SDIEKGTYPHYMLKETDEQPVVMRKIIQTYQDENGKLSVPGDIAAAVAEADRIYIIGCGTSYHAGLVGKQYIEMWANVPV
EVHVASEFSYNMPLLSKKPLFIFLSQSGETADSRAVLVQVKALGHKALTITNVPGSTLSREADYTLLLHAGPEIAVASTK
AYTAQIAVLAVLASVAADKNGINIGFDLVKELGIAANAMEALCDQKDEMEMIAREYLTVSRNAFFIGRGLDYFVCVEGAL
KLKEISYIQAEGFAGGELKHGTIALIEQGTPVFALATQEHVNLSIRGNVKEVAARGANTCIISLKGLDDADDRFVLPEVN
PALAPLVSVVPLQLIAYYAALHRGCDVDKPRNLAKSVTVE
>P17169 2.6.1.16~~~glmS~~~Glutamine--fructose-6-phosphate aminotransferase [isomerizing]~~~COG0449
MCGIVGAIAQRDVAEILLEGLRRLEYRGYDSAGLAVVDAEGHMTRLRRLGKVQMLAQAAEEHPLHGGTGIAHTRWATHGE
PSEVNAHPHVSEHIVVVHNGIIENHEPLREELKARGYTFVSETDTEVIAHLVNWELKQGGTLREAVLRAIPQLRGAYGTV
IMDSRHPDTLLAARSGSPLVIGLGMGENFIASDQLALLPVTRRFIFLEEGDIAEITRRSVNIFDKTGAEVKRQDIESNLQ
YDAGDKGIYRHYMQKEIYEQPNAIKNTLTGRISHGQVDLSELGPNADELLSKVEHIQILACGTSYNSGMVSRYWFESLAG
IPCDVEIASEFRYRKSAVRRNSLMITLSQSGETADTLAGLRLSKELGYLGSLAICNVPGSSLVRESDLALMTNAGTEIGV
ASTKAFTTQLTVLLMLVAKLSRLKGLDASIEHDIVHGLQALPSRIEQMLSQDKRIEALAEDFSDKHHALFLGRGDQYPIA
LEGALKLKEISYIHAEAYAAGELKHGPLALIDADMPVIVVAPNNELLEKLKSNIEEVRARGGQLYVFADQDAGFVSSDNM
HIIEMPHVEEVIAPIFYTVPLQLLAYHVALIKGTDVDQPRNLAKSVTVE
>Q5NHQ9 2.6.1.16~~~glmS~~~Glutamine--fructose-6-phosphate aminotransferase [isomerizing]~~~COG0449
MCGIVGANSTRNVTNILIEGLKKLEYRGYDSAGLAIIDDKNNIDICKEVGKVIELEKSVHNLANFKGDIGIAHTRWATHG
KPSKNNSHPHASESFCIVHNGVIENFAELKKVLINDGYKFKSDTDTEVIAHLLQKEWRDNFSIVDNIKYIMAMLKGAYAV
AIISQKFSDKIVAVRSGSPLVIGVGIDENFISSDALSLLPVTNKFSYLDEGDIAIISKDNVEVFDNNGAAKNLEVEEYNY
SSSSASKDGYKHYMLKEIYEQPEAVSNTILASLADGEISLDSFDKRAKELFEKTKHICIVACGTSYNAGMTAKYWIEKYA
KVPCSVEIASEIRYRDNVVVDGSLFVSISQSGETADTLESLRKSKKQNYVGSMCICNVPNSSLVRESDIAFMTKAGVEIG
VASTKAFTTQLVALAIFTLVIAKLKNSLTDQQIAKYTEELKNIRALVMGALKLDTEIDQISEYFSDKEHTIFLGRGLYYP
IAIEGALKLKEISYIHAEAYPSGELKHGPLALVDKNMPIVAVVPNDELLDKTLSNLQEVHARGGKLILFVDKAVKERVNF
DNSIVLELDAGHDFSAPVVFTIPLQLLSYHVAIIKGTDVDQPRNLAKSVTVE
>P9WN49 2.6.1.16~~~glmS~~~Glutamine--fructose-6-phosphate aminotransferase [isomerizing]~~~COG0449
MCGIVGYVGRRPAYVVVMDALRRMEYRGYDSSGIALVDGGTLTVRRRAGRLANLEEAVAEMPSTALSGTTGLGHTRWATH
GRPTDRNAHPHRDAAGKIAVVHNGIIENFAVLRRELETAGVEFASDTDTEVAAHLVARAYRHGETADDFVGSVLAVLRRL
EGHFTLVFANADDPGTLVAARRSTPLVLGIGDNEMFVGSDVAAFIEHTREAVELGQDQAVVITADGYRISDFDGNDGLQA
GRDFRPFHIDWDLAAAEKGGYEYFMLKEIAEQPAAVADTLLGHFVGGRIVLDEQRLSDQELREIDKVFVVACGTAYHSGL
LAKYAIEHWTRLPVEVELASEFRYRDPVLDRSTLVVAISQSGETADTLEAVRHAKEQKAKVLAICNTNGSQIPRECDAVL
YTRAGPEIGVASTKTFLAQIAANYLLGLALAQARGTKYPDEVEREYHELEAMPDLVARVIAATGPVAELAHRFAQSSTVL
FLGRHVGYPVALEGALKLKELAYMHAEGFAAGELKHGPIALIEDGLPVIVVMPSPKGSATLHAKLLSNIREIQTRGAVTI
VIAEEGDETVRPYADHLIEIPAVSTLLQPLLSTIPLQVFAASVARARGYDVDKPRNLAKSVTVE
>P64228 2.6.1.16~~~glmS~~~Glutamine--fructose-6-phosphate aminotransferase [isomerizing]~~~
MCGIVGYIGYDNAKELLLKGLEKLEYRGYDSAGIAVVNDDNTTVFKEKGRIAELRKVADSSDFDGPVGIGHTRWATHGVP
NHENSHPHQSSNGRFTLVHNGVIENYEELKGEYLQGVSFISETDTEVIVQLVEYFSNQGLSTEEAFTKVVSLLHGSYALG
LLDAEDKDTIYVAKNKSPLLLGVGEGFNVIASDALAMLQVTSEYKEIHDHEIVIVKKDEVIIKDADGNVVERDSYIAEID
ASDAEKGVYAHYMLKEIHEQPAVMRRIIQEYQDAEGNLKIDQDIINDVKEADRIYVIAAGTSYHAGLVGKEFLEKWAGVP
TEVHVASEFVYNMPLLSEKPLFVYISQSGETADSRAVLVETNKLGHKSLTITNVAGSTLSREADHTLLLHAGPEIAVAST
KAYTAQIAVLSILSQIVAKEHGREADIDLLRELAKVTTAIEAIVDDAPIMEQIATDFLETTRNAFFIGRTIDYNVSLEGA
LKLKEISYIQAEGFAGGELKHGTIALIEDGTPVVALATQENVNLSIRGNVKEVVARGAHPCIISMEGLEKEGDTYVIPHV
HELLTPLVSVVALQLISYYAALHRDLDVDKPRNLAKSVTVE
>B0VPT6 ~~~glmU~~~Bifunctional protein GlmU~~~
MSTTVIILAAGKGTRMRSQLPKVLQPLAGRPLLGHVIKTAKQLLAENIITIYGHGGDHVKKTFAQENIQWVEQAEQLGTG
HAVQMTLPVLPKDGISLILYGDVPLARQTTLEQLIEASNKTGIGMITLHVDNPTGYGRIVRQDGKIQAIVEHKDATEAQR
QIQEINTGIYCVSNAKLHEWLPKLSNENAQGEYYLTDIVAMAVADGLEIASIQPELAFEVEGVNDRLQLAALEREFQKQQ
AKELMQQGVTFADPARFDLRGTVKVGHDVRIDVNVIIEGDCELGDFVEIGAGCILKNTTIAAGTKVQAYSVFDGAVVGEN
TQIGPFARLRPGAKLANEVHIGNFVEVKNTTIGLGSKANHFTYLGDAEIGAESNIGAGTITCNYDGANKHKTTIGDAVFI
GSNSSLVAPVTIGNGATVGAGSVITKDVAEQSLSFERAQQISKANYQRPQKLKK
>P0ACC7 ~~~glmU~~~Bifunctional protein GlmU~~~COG1207
MLNNAMSVVILAAGKGTRMYSDLPKVLHTLAGKAMVQHVIDAANELGAAHVHLVYGHGGDLLKQALKDDNLNWVLQAEQL
GTGHAMQQAAPFFADDEDILMLYGDVPLISVETLQRLRDAKPQGGIGLLTVKLDDPTGYGRITRENGKVTGIVEHKDATD
EQRQIQEINTGILIANGADMKRWLAKLTNNNAQGEYYITDIIALAYQEGREIVAVHPQRLSEVEGVNNRLQLSRLERVYQ
SEQAEKLLLAGVMLRDPARFDLRGTLTHGRDVEIDTNVIIEGNVTLGHRVKIGTGCVIKNSVIGDDCEISPYTVVEDANL
AAACTIGPFARLRPGAELLEGAHVGNFVEMKKARLGKGSKAGHLTYLGDAEIGDNVNIGAGTITCNYDGANKFKTIIGDD
VFVGSDTQLVAPVTVGKGATIAAGTTVTRNVGENALAISRVPQTQKEGWRRPVKKK
>P43889 ~~~glmU~~~Bifunctional protein GlmU~~~COG1207
MTKKALSAVILAAGKGTRMYSDLPKVLHTIAGKPMVKHVIDTAHQLGSENIHLIYGHGGDLMRTHLANEQVNWVLQTEQL
GTAHAVQQAAPFFKDNENIVVLYGDAPLITKETLEKLIEAKPENGIALLTVNLDNPTGYGRIIRENGNVVAIVEQKDANA
EQLNIKEVNTGVMVSDGASFKKWLARVGNNNAQGEYYLTDLIALANQDNCQVVAVQATDVMEVEGANNRLQLAALERYFQ
NKQASKLLLEGVMIYDPARFDLRGTLEHGKDVEIDVNVIIEGNVKLGDRVKIGTGCVLKNVVIGNDVEIKPYSVLEDSIV
GEKAAIGPFSRLRPGAELAAETHVGNFVEIKKSTVGKGSKVNHLTYVGDSEIGSNCNIGAGVITCNYDGANKFKTIIGDD
VFVGSDTQLVAPVKVANGATIGAGTTITRDVGENELVITRVAQRHIQGWQRPIKKK
>A6TG34 ~~~glmU~~~Bifunctional protein GlmU~~~
MSNSAMSVVILAAGKGTRMYSDLPKVLHTLAGKPMVQHVIDAANDLGACAVHLVYGHGGDLLRQTLHEDNLNWVLQAEQL
GTGHAMQQAAPFFNDDEDILMLYGDVPLISVETLQRLRAAKPQGGIGLLTVKLDDPTGYGRITRENGQVTGIVEHKDASE
AQRQIQEINTGILIAGGADLKRWLAKLTNNNAQGEYYITDIIAMAHQEGHQIVAVHPQRLSEVEGVNNRLQLARLERVYQ
AEQAEKLLLAGVMLRDPARFDLRGTLQHGRDVEIDTNVILEGNVVLGDRVKIGAGCVIKNSTIGDDCEISPYSVVEDAQL
QAACTIGPFARLRPGAELLEGAHVGNFVEMKKARLGKGSKAGHLTYLGDAEIGDNVNIGAGTITCNYDGANKHKTIIGDD
VFVGSDTQLVAPVTVGNGVTIAAGTTVTRNIADNELVLSRVPQVHKQGWQRPVKKK
>A5U161 ~~~glmU~~~Bifunctional protein GlmU~~~COG1207
MTFPGDTAVLVLAAGPGTRMRSDTPKVLHTLAGRSMLSHVLHAIAKLAPQRLIVVLGHDHQRIAPLVGELADTLGRTIDV
ALQDRPLGTGHAVLCGLSALPDDYAGNVVVTSGDTPLLDADTLADLIATHRAVSAAVTVLTTTLDDPFGYGRILRTQDHE
VMAIVEQTDATPSQREIREVNAGVYAFDIAALRSALSRLSSNNAQQELYLTDVIAILRSDGQTVHASHVDDSALVAGVNN
RVQLAELASELNRRVVAAHQLAGVTVVDPATTWIDVDVTIGRDTVIHPGTQLLGRTQIGGRCVVGPDTTLTDVAVGDGAS
VVRTHGSSSSIGDGAAVGPFTYLRPGTALGADGKLGAFVEVKNSTIGTGTKVPHLTYVGDADIGEYSNIGASSVFVNYDG
TSKRRTTVGSHVRTGSDTMFVAPVTIGDGAYTGAGTVVREDVPPGALAVSAGPQRNIENWVQRKRPGSPAAQASKRASEM
ACQQPTQPPDADQTP
>P9WMN2 ~~~glmU~~~Bifunctional protein GlmU~~~
MTFPGDTAVLVLAAGPGTRMRSDTPKVLHTLAGRSMLSHVLHAIAKLAPQRLIVVLGHDHQRIAPLVGELADTLGRTIDV
ALQDRPLGTGHAVLCGLSALPDDYAGNVVVTSGDTPLLDADTLADLIATHRAVSAAVTVLTTTLDDPFGYGRILRTQDHE
VMAIVEQTDATPSQREIREVNAGVYAFDIAALRSALSRLSSNNAQQELYLTDVIAILRSDGQTVHASHVDDSALVAGVNN
RVQLAELASELNRRVVAAHQLAGVTVVDPATTWIDVDVTIGRDTVIHPGTQLLGRTQIGGRCVVGPDTTLTDVAVGDGAS
VVRTHGSSSSIGDGAAVGPFTYLRPGTALGADGKLGAFVEVKNSTIGTGTKVPHLTYVGDADIGEYSNIGASSVFVNYDG
TSKRRTTVGSHVRTGSDTMFVAPVTIGDGAYTGAGTVVREDVPPGALAVSAGPQRNIENWVQRKRPGSPAAQASKRASEM
ACQQPTQPPDADQTP
>P9WMN3 ~~~glmU~~~Bifunctional protein GlmU~~~COG1207
MTFPGDTAVLVLAAGPGTRMRSDTPKVLHTLAGRSMLSHVLHAIAKLAPQRLIVVLGHDHQRIAPLVGELADTLGRTIDV
ALQDRPLGTGHAVLCGLSALPDDYAGNVVVTSGDTPLLDADTLADLIATHRAVSAAVTVLTTTLDDPFGYGRILRTQDHE
VMAIVEQTDATPSQREIREVNAGVYAFDIAALRSALSRLSSNNAQQELYLTDVIAILRSDGQTVHASHVDDSALVAGVNN
RVQLAELASELNRRVVAAHQLAGVTVVDPATTWIDVDVTIGRDTVIHPGTQLLGRTQIGGRCVVGPDTTLTDVAVGDGAS
VVRTHGSSSSIGDGAAVGPFTYLRPGTALGADGKLGAFVEVKNSTIGTGTKVPHLTYVGDADIGEYSNIGASSVFVNYDG
TSKRRTTVGSHVRTGSDTMFVAPVTIGDGAYTGAGTVVREDVPPGALAVSAGPQRNIENWVQRKRPGSPAAQASKRASEM
ACQQPTQPPDADQTP
>Q7A7B4 ~~~glmU~~~Bifunctional protein GlmU~~~
MRRHAIILAAGKGTRMKSKKYKVLHEVAGKPMVEHVLESVKGSGVDQVVTIVGHGAESVKGHLGERSLYSFQEKQLGTAH
AVQMAKSHLEDKEGTTIVVCGDTPLITKETLETLIAHHEDANAQATVLSASIQQPYGYGRIVRNASGRLERIVEEKDATQ
AEKDINEISSGIFAFNNKTLFEKLTQVKNDNAQGEYYLPDVLSLILNDGGIVEVYRTNDVEEIMGVNDRVMLSQAEKAMQ
RRTNHYHMLNGVTIIDPDSTFIGPDVTIGSDTVIEPGVRINGRTEIGEDVVIGQYSEINNSTIENGACIQQSVVNDASVG
ANTKVGPFAQLRPGAQLGADVKVGNFVEIKKADLKDGAKVSHLSYIGDAVIGERTNIGCGTITVNYDGENKFKTIVGKDS
FVGCNVNLVAPVTIGDDVLVAAGSTITDDVPNDSLAVARARQTTKEGYRK
>B2FHY5 ~~~glmU~~~Bifunctional protein GlmU~~~COG1207
MTQPLHVIILAAGAGKRMKSVLPKVLQPIAGQPMLAHVIDAARELQPAAIHVVHGHGGEAVRQYFAGQPDLQWAEQAQQL
GTGHAVAQAMPQVPDLAQVLVLYGDVPLIRAQTLRDLLAQPGRLAVLVADVDDPTGYGRVLRDAEGKVGAIIEQKDATDD
QLRVRTINTGIIAAESTALRRWLSQLSNSNAQGEYYLTDVFAFAAHEYTPAEMALVADAQEAEGANDPWQLSQLERAWQR
RAVRALCAQGARVRDPARLDIRGTVTVGSDVLIDVDVVLEGKVVLGDGVTVGPFNRLKDVNLGPGTDVRAHCDLEGVVTE
GAAQIGPFARLRPGTVLADGVHVGNFVETKKVTLGVGSKANHLTYLGDAVIGSKVNIGAGTITCNYDGVNKSTTTIGDNA
FIGSNSSLVAPVTIGDGATIAAGSVITRNAPDGKLTLARARQETIDGWKRPLKKS
>Q04KU2 ~~~glmU~~~Bifunctional protein GlmU~~~COG1207
MSNFAIILAAGKGTRMKSDLPKVLHKVAGISMLEHVFRSVGAIQPEKTVTVVGHKAELVEEVLAEQTEFVTQSEQLGTGH
AVMMTEPILEGLSGHTLVIAGDTPLITGESLKNLIDFHINHKNVATILTAETDNPFGYGRIVRNDNAEVLRIVEQKDATD
FEKQIKEINTGTYVFDNERLFEALKNINTNNAQGEYYITDVIGIFRETGEKVGAYTLKDFDESLGVNDRVALATAESVMR
RRINHKHMVNGVSFVNPEATYIDIDVEIAPEVQIEANVILKGQTKIGAETVLTNGTYVVDSTIGAGAVITNSMIEESSVA
DGVTVGPYAHIRPNSSLGAQVHIGNFVEVKGSSIGENTKAGHLTYIGNCEVGSNVNFGAGTITVNYDGKNKYKTVIGDNV
FVGSNSTIIAPVELGDNSLVGAGSTITKDVPADAIAIGRGRQINKDEYATRLPHHPKNQ
>Q97R46 ~~~glmU~~~Bifunctional protein GlmU~~~COG1207
MSNFAIILAAGKGTRMKSDLPKVLHKVAGISMLEHVFRSVGAIQPEKTVTVVGHKAELVEEVLAGQTEFVTQSEQLGTGH
AVMMTEPILEGLSGHTLVIAGDTPLITGESLKNLIDFHINHKNVATILTAETDNPFGYGRIVRNDNAEVLRIVEQKDATD
FEKQIKEINTGTYVFDNERLFEALKNINTNNAQGEYYITDVIGIFRETGEKVGAYTLKDFDESLGVNDRVALATAESVMR
RRINHKHMVNGVSFVNPEATYIDIDVEIASEVQIEANVTLKGQTKIGAETVLTNGTYVVDSTIGAGAVITNSMIEESSVA
DGVIVGPYAHIRPNSSLGAQVHIGNFVEVKGSSIGENTKAGHLTYIGNCEVGSNVNFGAGTITVNYDGKNKYKTVIGNNV
FVGSNSTIIAPVELGDNSLVGAGSTITKDVPADAIAIGRGRQINKDEYATRLPHHPKNQ
>Q8DQ18 ~~~glmU~~~Bifunctional protein GlmU~~~COG1207
MSNFAIILAAGKGTRMKSDLPKVLHKVAGISMLEHVFRSVGAIQPEKTVTVVGHKAELVEEVLAEQTEFVTQSEQLGTGH
AVMMTEPILEGLSGHTLVIAGDTPLITGESLKNLIDFHINHKNVATILTAETDNPFGYGRIVRNDNAEVLRIVEQKDATD
FEKQIKEINTGTYVFDNERLFEALKNINTNNAQGEYYITDVIGIFRETGEKVGAYTLKDFDESLGVNDRVALATAESVMR
RRINHKHMVNGVSFVNPEATYIDIDVEIAPEVQIEANVILKGQTKIGAETVLTNGTYVVDSTIGAGAVITNSMIEESSVA
DGVTVGPYAHIRPNSSLGAQVHIGNFVEVKGSSIGENTKAGHLTYIGNCEVGSNVNFGAGTITVNYDGKNKYKTVIGDNV
FVGSNSTIIAPVELGDNSLVGAGSTITKDVPADAIAIGRGRQINKDEYATRLPHHPKNQ
>Q8Z9S7 ~~~glmU~~~Bifunctional protein GlmU~~~COG1207
MSNSSMSVVILAAGKGTRMYSDLPKVLHPLAGKPMVQHVIDAAMKLGAQHVHLVYGHGGELLKKTLADPSLNWVLQAEQL
GTGHAMQQAAPHFADDEDILMLYGDVPLISVDTLQRLLAAKPEGGIGLLTVKLDNPSGYGRIVRENGDVVGIVEHKDASD
AQREINEINTGILVANGRDLKRWLSLLDNNNAQGEFYITDIIALAHADGKKIATVHPTRLSEVEGVNNRLQLSALERVFQ
TEQAEKLLLAGVMLLDPSRFDLRGELTHGRDITIDTNVIIEGHVILGDRVRIGTGCVLKNCVIGDDSEISPYTVLEDARL
DANCTVGPFARLRPGAELAEGAHVGNFVEIKKARLGKGSKAGHLSYLGDAEIGAGVNIGAGTITCNYDGANKFKTIIGDD
VFVGSDTQLVAPVTVANGATIGAGTTVTRDVAENELVISRVKQVHIQGWKRPVKKK
>A4TSJ5 ~~~glmU~~~Bifunctional protein GlmU~~~
MSNSSMSVVILAAGKGTRMYSDLPKVLHPLAGKPMVQHVIDAAMKLGAQHVHLVYGHGGELLKKTLADPSLNWVLQAEQL
GTGHAMQQAAPHFADDEDILMLYGDVPLISVDTLQRLLAAKPEGGIGLLTVKLDNPSGYGRIVRENGDVVGIVEHKDASD
AQREINEINTGILVANGRDLKRWLSLLDNNNAQGEFYITDIIALAHADGKKIATVHPTRLSEVEGVNNRLQLSALERVFQ
TEQAEKLLLAGVMLLDPSRFDLRGELTHGRDITIDTNVIIEGHVILGDRVRIGTGCVLKNCVIGDDSEISPYTVLEDARL
DANCTVGPFARLRPGAELAEGAHVGNFVEIKKARLGKGSKAGHLSYLGDAEIGAGVNIGAGTITCNYDGANKFKTIIGDD
VFVGSDTQLVAPVTVANGATIGAGTTVTRDVAENELVISRVKQVHIQGWKRPVKKK
>P19064 6.3.1.2~~~glnA~~~Glutamine synthetase~~~COG0174
MARYTKEDIFRLAKEENVKYIRLQFTDLLGVIKNVEIPVSQLTKALDNKMMFDGSSIEGFVRIEESDMYLYPDLDTWVIF
PWTAEKGKVARLICDIYNADGTPFEGDPRNNLKRVLKEMEALGFSDFNLGPEPEFFLFKVDEKGNPTLELNDNGGYFDLA
PMDLGENCRRDIVLELEEMGFEIEASHHEVAPGQHEIDFKYANAIRSCDDIQTFKLVVKTIARKHGLHATFMPKPLYGVN
GSGMHCNLSLFKNGENVFYDQNGDLQLSDDARHFIAGILKHAPAFTAVANPTVNSYKRLVPGYEAPCYVAWSAQNRSPLV
RIPASRGISTRVEVRSVDPAANPYLVMATLLAAGLDGIKNKLTPPAAVDRNIYVMTKEEREEAGIVDLPATLAQALVTLQ
SNEVISNALGDHLLEHFIEAKEFEWDIFRTQVHQWERDQYMSLY
>P12425 6.3.1.2~~~glnA~~~Glutamine synthetase~~~COG0174
MAKYTREDIEKLVKEENVKYIRLQFTDILGTIKNVEIPVSQLGKALDNKVMFDGSSIEGFVRIEESDMYLYPDLNTFVIF
PWTAEKGKVARFICDIYNPDGTPFEGDPRNNLKRILKEMEDLGFSDFNLGPEPEFFLFKLDEKGEPTLELNDKGGYFDLA
PTDLGENCRRDIVLELEEMGFEIEASHHEVAPGQHEIDFKYAGAVRSCDDIQTFKLVVKTIARKHGLHATFMPKPLFGVN
GSGMHCNLSLFKNGVNAFFDENADLQLSETAKHFIAGIVKHATSFTAVTNPTVNSYKRLVPGYEAPCYVAWSAQNRSPLI
RIPASRGISTRVEVRSVDPAANPYLALSVLLAAGLDGIKNKLEAPAPIDRNIYVMSKEERMENGIVDLPATLAEALEEFK
SNEVMVKALGEHLFEHFIEAKEIEWDMFRTQVHPWEREQYMSQY
>A0R083 6.3.1.2~~~glnA~~~Glutamine synthetase~~~COG0174
MDRQKEFVLRTLEERDIRFVRLWFTDVLGYLKSVAIAPAELEGAFEEGIGFDGSSIEGFARVFESDTVARPDPSTFQVLP
WKTSDGNHYSARMFCDITMPDGSPSWADSRHVLRRQLAKASDLGFTCYVHPEIEFFLLKPGPNDGTPPEPADNGGYFDQA
VHDAAPNFRRHAIEALEQMGISVEFSHHEGAPGQQEIDLRYADALSMADNVMTFRYLVKEVALADGVRASFMPKPFAEHP
GSAMHTHMSLFEGDTNAFHSPDDPLQLSDVAKSFIAGILEHANEISAVTNQWVNSYKRLVHGGEAPTAASWGAANRSALV
RVPMYTPHKVSSRRVEVRSPDSACNPYLTFAVLLAAGLRGVEKGYVLGPQAEDNVWSLTQEERRAMGYRELPTSLGNALE
SMENSELVAEALGEHVFDYFLRNKRSEWENYRSHVTPYELKNYLSL
>P9WN37 6.3.1.2~~~glnA2~~~Glutamine synthetase~~~COG0174
MDRQKEFVLRTLEERDIRFVRLWFTDVLGFLKSVAIAPAELEGAFEEGIGFDGSSIEGFARVSESDTVAHPDPSTFQVLP
WATSSGHHHSARMFCDITMPDGSPSWADPRHVLRRQLTKAGELGFSCYVHPEIEFFLLKPGPEDGSVPVPVDNAGYFDQA
VHDSALNFRRHAIDALEFMGISVEFSHHEGAPGQQEIDLRFADALSMADNVMTFRYVIKEVALEEGARASFMPKPFGQHP
GSAMHTHMSLFEGDVNAFHSADDPLQLSEVGKSFIAGILEHACEISAVTNQWVNSYKRLVQGGEAPTAASWGAANRSALV
RVPMYTPHKTSSRRVEVRSPDSACNPYLTFAVLLAAGLRGVEKGYVLGPQAEDNVWDLTPEERRAMGYRELPSSLDSALR
AMEASELVAEALGEHVFDFFLRNKRTEWANYRSHVTPYELRTYLSL
>P99095 6.3.1.2~~~glnA~~~Glutamine synthetase~~~
MPKRTFTKEDIRKFAEEENVRYLRLQFTDILGTIKNVEVPVSQLEKVLDNEMMFDGSSIEGFVRIEESDMYLHPDLDTWV
IFPWTAGQGKVARLICDVYKTDGTPFEGDPRANLKRVLKEMEDLGFTDFNLGPEPEFFLFKLDEKGEPTLELNDDGGYFD
LAPTDLGENCRRDIVLELEDMGFDIEASHHEVAPGQHEIDFKYADAVTACDNIQTFKLVVKTIARKHNLHATFMPKPLFG
VNGSGMHFNVSLFKGKENAFFDPNTEMGLTETAYQFTAGVLKNARGFTAVCNPLVNSYKRLVPGYEAPCYIAWSGKNRSP
LIRVPSSRGLSTRIEVRSVDPAANPYMALAAILEAGLDGIKNKLKVPEPVNQNIYEMNREEREAVGIQDLPSTLYTALKA
MRENEVIKKALGNHIYNQFINSKSIEWDYYRTQVSEWERDQYMKQY
>P0A9C5 6.3.1.2~~~glnA~~~Glutamine synthetase~~~COG0174
MSAEHVLTMLNEHEVKFVDLRFTDTKGKEQHVTIPAHQVNAEFFEEGKMFDGSSIGGWKGINESDMVLMPDASTAVIDPF
FADSTLIIRCDILEPGTLQGYDRDPRSIAKRAEDYLRSTGIADTVLFGPEPEFFLFDDIRFGSSISGSHVAIDDIEGAWN
SSTQYEGGNKGHRPAVKGGYFPVPPVDSAQDIRSEMCLVMEQMGLVVEAHHHEVATAGQNEVATRFNTMTKKADEIQIYK
YVVHNVAHRFGKTATFMPKPMFGDNGSGMHCHMSLSKNGVNLFAGDKYAGLSEQALYYIGGVIKHAKAINALANPTTNSY
KRLVPGYEAPVMLAYSARNRSASIRIPVVSSPKARRIEVRFPDPAANPYLCFAALLMAGLDGIKNKIHPGEAMDKNLYDL
PPEEAKEIPQVAGSLEEALNELDLDREFLKAGGVFTDEAIDAYIALRREEDDRVRMTPHPVEFELYYSV
>P94845 6.3.1.2~~~glnA~~~Glutamine synthetase~~~COG0174
MIVRTQNSESKIKEFFEFCKENEVEFVDFRFSDIKGTWNHIAYSFGALTHGMLKEGIPFDASCFKGWQGIEHSDMILTPD
LVRYFIDPFSADVSVVVFCDVYDVYKNQPYEKCPRSIAKKALQHLKDSGLGDVAYFGAENEFFIFDSIKIKDASNSQYYE
VDSEEGEWNRDRSFENGVNFGHRPGKQGGYMPVPPTDTMMDIRTEIVKVLNQVGLETFVVHHEVAQAQGEVGVKFGDLVE
AADNVQKLKYVVKMVAHLNGKTATFMPKPLYGDNGSGMHTHVSVWKNNENLFSGETYKGLSEFALHFLGGVLRHARGLAA
FTNASTNSYKRLIPGYEAPSILTYSANNRSASVRIPYGISKNSARFEFRFPDSSSNPYLAFAAILMAGMDGVKNKIDPGE
AMDINLFKLTLDEIREKGIKQMPHTLRRSLEEMLADKQYLKESQVFSEEFIQAYQSLKFNAEVFPWESKPHPFEFITTYS
C
>P0A591 6.3.1.2~~~glnA1~~~Glutamine synthetase~~~
MTEKTPDDVFKLAKDEKVEYVDVRFCDLPGIMQHFTIPASAFDKSVFDDGLAFDGSSIRGFQSIHESDMLLLPDPETARI
DPFRAAKTLNINFFVHDPFTLEPYSRDPRNIARKAENYLISTGIADTAYFGAEAEFYIFDSVSFDSRANGSFYEVDAISG
WWNTGAATEADGSPNRGYKVRHKGGYFPVAPNDQYVDLRDKMLTNLINSGFILEKGHHEVGSGGQAEINYQFNSLLHAAD
DMQLYKYIIKNTAWQNGKTVTFMPKPLFGDNGSGMHCHQSLWKDGAPLMYDETGYAGLSDTARHYIGGLLHHAPSLLAFT
NPTVNSYKRLVPGYEAPINLVYSQRNRSACVRIPITGSNPKAKRLEFRSPDSSGNPYLAFSAMLMAGLDGIKNKIEPQAP
VDKDLYELPPEEAASIPQTPTQLSDVIDRLEADHEYLTEGGVFTNDLIETWISFKRENEIEPVNIRPHPYEFALYYDV
>A0R079 6.3.1.2~~~glnA~~~Glutamine synthetase~~~COG0174
MAEKTSDDIFKLIKDENVEYVDIRFCDLPGVVQHFSIPASAFDESVFEDGLAFDGSSVRGFQSIHESDMMLLPDPNTARI
DPFRAAKTLNMNFFVHDPFTREAYSRDPRNVARKAENYLASTGIADTAFFGAEAEFYIFDSVSFDSKINGTFYEVDSESG
WWNTGEPFESDGSANRGYKVRPKGGYFPVAPYDHYVDLRDQMATNLQNAGFTLERGHHEVGTAGQAEINYKFNTLLAAAD
DVLLFKYIIKNTAWQAGKTVTFMPKPLFGDNGSGMHAHQSLWKDGQPLFHDESGYAGLSDIARHYIGGILHHAPSLLAFT
NPTVNSYKRLVPGYEAPINLVYSQRNRSACVRIPITGNNPKAKRLEFRCPDSSGNPYLAFAAMLMAGIDGIKKKIEPLQP
VDKDLYELPPDEAAAIPQAPTSLSAVIDKLEEDHEYLTEGGVFTEDLIETWISYKRENEIMPIQIRPHPYEFSLYYDV
>P9WN39 6.3.1.2~~~glnA1~~~Glutamine synthetase~~~COG0174
MTEKTPDDVFKLAKDEKVEYVDVRFCDLPGIMQHFTIPASAFDKSVFDDGLAFDGSSIRGFQSIHESDMLLLPDPETARI
DPFRAAKTLNINFFVHDPFTLEPYSRDPRNIARKAENYLISTGIADTAYFGAEAEFYIFDSVSFDSRANGSFYEVDAISG
WWNTGAATEADGSPNRGYKVRHKGGYFPVAPNDQYVDLRDKMLTNLINSGFILEKGHHEVGSGGQAEINYQFNSLLHAAD
DMQLYKYIIKNTAWQNGKTVTFMPKPLFGDNGSGMHCHQSLWKDGAPLMYDETGYAGLSDTARHYIGGLLHHAPSLLAFT
NPTVNSYKRLVPGYEAPINLVYSQRNRSACVRIPITGSNPKAKRLEFRSPDSSGNPYLAFSAMLMAGLDGIKNKIEPQAP
VDKDLYELPPEEAASIPQTPTQLSDVIDRLEADHEYLTEGGVFTNDLIETWISFKRENEIEPVNIRPHPYEFALYYDV
>P00964 6.3.1.2~~~glnA~~~Glutamine synthetase~~~COG0174
MTTPQEVLKRIQDEKIELIDLKFIDTVGTWQHLTLYQNQIDESSFSDGVPFDGSSIRGWKAINESDMTMVLDPNTAWIDP
FMEVPTLSIVCSIKEPRTGEWYNRCPRVIAQKAIDYLVSTGIGDTAFFGPEAEFFIFDSARFAQNANEGYYFLDSVEGAW
NSGKEGTADKPNLAYKPRFKEGYFPVSPTDSFQDIRTEMLLTMAKLGVPIEKHHHEVATGGQCELGFRFGKLIEAADWLM
IYKYVIKNVAKKYGKTVTFMPKPIFGDNGSGMHCHQSIWKDGKPLFAGDQYAGLSEMGLYYIGGLLKHAPALLAITNPST
NSYKRLVPGYEAPVNLAYSQGNRSASIRIPLSGTNPKAKRLEFRCPDATSNPYLAFAAMLCAGIDGIKNKIHPGEPLDKN
IYELSPEELAKVPSTPGSLELALEALENDHAFLTDTGVFTEDFIQNWIDYKLANEVKQMQLRPHPYEFSIYYDV
>Q9HU65 6.3.1.2~~~glnA~~~Glutamine synthetase~~~
MSYKSHQLIKDHDVKWVDLRFTDTKGKQQHVTMPARDALDDEFFEAGKMFDGSSIAGWKGIEASDMILMPDDSTAVLDPF
TEEPTLILVCDIIEPSTMQGYERDPRNIAKRAEEYLKSTGIGDTVFVGPEPEFFIFDEVKFKSDISGSMFKIFSEQASWN
TDADIESGNKGHRPGVKGGYFPVPPVDHDHEIRTAMCNALEEMGLVVEVHHHEVATAGQNEIGVKFNTLVAKADEVQTLK
YCVHNVADAYGKTVTFMPKPLYGDNGSGMHVHMSISKDGKNTFAGEGYAGLSETALYFIGGIIKHGKALNGFTNPSTNSY
KRLVPGFEAPVMLAYSARNRSASIRIPYVSSPKARRIEARFPDPAANPYLAFAALLMAGLDGIQNKIHPGDAADKNLYDL
PPEEAKEIPQVCGSLKEALEELDKGRAFLTKGGVFTDEFIDAYIELKSEEEIKVRTFVHPLEYDLYYSV
>Q3V5W6 6.3.1.2~~~glnA~~~Glutamine synthetase~~~
MSKSVQLIKDHDVKWIDLRFTDTKGTQHHVTMPARDALEDDFFEVGKMFDGSSIAGWKGIEASDMILLPDDDTAVLDPFT
EDATLILVCDIIEPSTMQGYDRDPRAIAHRAEEYLKTTGIGDTVFAGPEPEFFIFDEVKFKSDISGSMFKIYSEQGSWMS
DQDIEGGNKGHRPGVKGGYFPVPPFDHDHEIRTAMCNALEEMGQTVEVHHHEVATAGQNEIGVKFNTLVKKADEVQTLKY
VVHNVADAYGRTATFMPKPLYGDNGSGMHVHMSIAKDGKNTFAGEGYAGLSETALYFIGGIIKHGKALNGFTNPATNSYK
RLVPGFEAPVMLAYSARNRSASIRIPYVNSPRGRRIEARFPDPAANPYLAFAALLMAGLDGIQNKIHPGDAADKNLYDLP
PEEAKEIPQVCGSLKEALEELDKGRAFLTKGGVFSDDFIDAYIALKSEEEIKVRTFVHPLEYELYYSC
>P0A1P7 6.3.1.2~~~glnA~~~Glutamine synthetase~~~COG0174
MSAEHVLTMLNEHEVKFVDLRFTDTKGKEQHVTIPAHQVNAEFFEEGKMFDGSSIGGWKGINESDMVLMPDASTAVIDPF
FADSTLIIRCDILEPGTLQGYDRDPRSIAKRAEDYLRATGIADTVLFGPEPEFFLFDDIRFGASISGSHVAIDDIEGAWN
SSTKYEGGNKGHRPGVKGGYFPVPPVDSAQDIRSEMCLVMEQMGLVVEAHHHEVATAGQNEVATRFNTMTKKADEIQIYK
YVVHNVAHRFGKTATFMPKPMFGDNGSGMHCHMSLAKNGTNLFSGDKYAGLSEQALYYIGGVIKHAKAINALANPTTNSY
KRLVPGYEAPVMLAYSARNRSASIRIPVVASPKARRIEVRFPDPAANPYLCFAALLMAGLDGIKNKIHPGEAMDKNLYDL
PPEEAKEIPQVAGSLEEALNALDLDREFLKAGGVFTDEAIDAYIALRREEDDRVRMTPHPVEFELYYSV
>P0A1P6 6.3.1.2~~~glnA~~~Glutamine synthetase~~~
MSAEHVLTMLNEHEVKFVDLRFTDTKGKEQHVTIPAHQVNAEFFEEGKMFDGSSIGGWKGINESDMVLMPDASTAVIDPF
FADSTLIIRCDILEPGTLQGYDRDPRSIAKRAEDYLRATGIADTVLFGPEPEFFLFDDIRFGASISGSHVAIDDIEGAWN
SSTKYEGGNKGHRPGVKGGYFPVPPVDSAQDIRSEMCLVMEQMGLVVEAHHHEVATAGQNEVATRFNTMTKKADEIQIYK
YVVHNVAHRFGKTATFMPKPMFGDNGSGMHCHMSLAKNGTNLFSGDKYAGLSEQALYYIGGVIKHAKAINALANPTTNSY
KRLVPGYEAPVMLAYSARNRSASIRIPVVASPKARRIEVRFPDPAANPYLCFAALLMAGLDGIKNKIHPGEAMDKNLYDL
PPEEAKEIPQVAGSLEEALNALDLDREFLKAGGVFTDEAIDAYIALRREEDDRVRMTPHPVEFELYYSV
>P77961 6.3.1.2~~~glnA~~~Glutamine synthetase~~~COG0174
MARTPQEVLKWIQDENIKIIDLKFIDTPGIWQHCSFYYDQLDENSFTEGIPFDGSSIRGWKAINESDMCMVPDPNTATID
PFCKEPTLSMICSIKEPRTGEWYNRDPRTIAAKAVEYLRGTGIADTVYFGPEAEFFLFDDIRFGQTENSSYYFADSVEGR
WNTGREEEGGNLGYKPGYKQGYFPVAPTDTAQDIRTEMLLTMAGLCVPIEKHHHEVASGGQNELGIKFDKLVNSADNLMI
YKYVIKNVAKKYGKTVTFMPKPIFNDNGSGMHVHQSLWKDGQPLFAGDKYAGFSQMGLWYIGGILKHAPALLAFTNPTTN
SYKRLVPGFEAPVNLAYSQGNRSASVRIPLSGGNPKAKRLEFRCPDATSNPYLAFAAMLCAGIDGIKNQIDPGEPLDVDI
YDLSPEELAKIPSTPGSLEAALEALEKDHEFLTGTGVFSPDFVESWIEYKLDNEVNPMRLRPHPYEFSLYYDC
>P04772 6.3.1.2~~~glnII~~~Glutamine synthetase~~~COG0174
MTKYKLEYIWLDGYTPTPNLRGKTQIKEFASFPTLEQLPLWGFDGSSTQQAEGHSSDCVLKPVAVFPDAARTNGVLVMCE
VMMPDGKTPHASNKRATILDDAGAWFGFEQEYFFYKDGRPLGFPTSGYPAPQGPYYTGVGFSNVGDVARKIVEEHLDLCL
AAGINHEGINAEVAKGQWEFQIFGKGSKKAADEMWMARYLMLRLTEKYGIDIEFHCKPLGDTDWNGSGMHANFSTEYMRT
VGGKEYFEALMAAFDKNLMDHIAVYGPDNDKRLTGKHETAPWNKFSYGVADRGASIRVPHSFVNNGYKGYLEDRRPNSQG
DPYQIASQILKTISSVPTEKKAVA
>P20805 6.3.1.2~~~glnII~~~Glutamine synthetase~~~
MSYQAEYIWIDGTEPEPLMRSKTRIIKDGKEPEIWGFDGSSTNQAPGSNSDCVLRPVFETPDPIRGGDNRLVLCEVQLTD
FTPPTNTRAAALGVAERYADMSPMFGIEQEYTFFKDGRPYGWPEVGYPAPQGPYYCGVGGSKMPGRQIVERHTQACLDAG
LAIEGTNAEVMMGQWEFQIGVLPAPAIGDQIWLGRWLLHRIAEDYGVEVSFAAKPIPGDWNGAGAHTNFSTKQTMEGWDA
IVTCCEALGTRVTEHVTHYGKGIEDRLTGKHETAPWNKYSWGASDRGASVRIPWAVEKAKKGWLEDRRPNANMDPYLVTA
LMIDTCCSALAGDKPTLFVPSQTTPAPAEASV
>Q02154 6.3.1.2~~~glnII~~~Glutamine synthetase~~~
MTKFKLEYIWLDGYTPVPNLRGKTQIKEFDEFPTLEQLPLWGFDGSSTMQAEGSSDCVLKPVAIYPDPARTNGALVMCEV
MMPDGHAHASNARATILDDEDAWFGFEQEYFFYQNGRPLGFPEQGYPAPQPYYTGVGYSNVGDVAREIVEEHLDLCLAAG
INHEGINAEVAKGQWEFQIFGKGSKKAADQIWMARYLLQRLTEKYGIDIEYHCKPLGDTDWNGSGMHCNFSTKYLREVGG
KEYFEALMASSDKNLMDHIAVYGPDNDKRLTGKHETAPWNKFSYGVADRGASIRVPHSFIKNDYKGYLEDRRPNSQGDPY
QIVRRF
>Q9RDS6 6.3.1.-~~~glnA2~~~Gamma-glutamylpolyamine synthetase GlnA2~~~COG0174
MDKQQEFVIRTLEERDIRFVRLWFTDVLGFLKSVAVAPAELEQAFDEGIGFDGSAIEGFARVYESDMIAKPDPSTFQVLP
WRAEAPGTARMFCDILMPDGSPSFADPRYVLKRALARTSDLGFTFYTHPEIEFFLLKDKPVDGSVPTPADNSGYFDHTPQ
NIGMDFRRQAITMLESMGISVEFSHHEGAPGQQEIDLRYADALSTADNVMTFRLVMKQVALEQGLQATFMPKPFSEYPGS
GMHTHLSLFEGDRNAFYESGAEYQLSKVGRSFIAGLLRHAAEISAVTNQWVNSYKRIWGGTERTAGAGGEAPSYICWGHN
NRSALVRVPMYKPGKTGSARVEVRSIDSGANPYLTYAVLLAAGLKGIEEGYELPPGAEDDVWALSDAERRALGIEPLPQN
LGEALALMERSDLVAETLGEHVFDFFLRNKRQEWEEYRSQVTAFELRKSLPVL
>P22878 6.3.1.2~~~glnB~~~Glutamine synthetase~~~
MSIKAEYIWIDGTQPTAKLRSKTKILSDGSRLPRWGFDGSSTNQAEGHASDLVLEPVFSCPDPIRGGDHLLVLCEVLHTD
LTPHPSNTRALLRPVAERFAGQEPIFGIEQEYTFLKGDRPLGFPEGGGYPAPQADYYCGVGADAIFGREIVEKHLDLCLA
AGLGLSGINAEVMPGQWEFQVGALPPLEVSDHMWVARWLLHRVAEEFGVTASLDAKPAKGDWNGAGAHTNFSTRAMREGY
DPIITACEALGQDDKPLEHVRQYGTGIEDRLTGAHETAPWDAYSYGASDRGASVRIPWQVEVEKKGYIEDRRPNANVDPY
VVTRLMVDTCCTELARREQI
>P15623 6.3.1.2~~~glnA~~~Glutamine synthetase~~~
MSKMRFFALQELSNRKPLEITTPSNKLSDYYASHVFDRKKMQEYLPKEAYKAVVDATEKGTPISREMADLIANGMKSWAK
SLNVTHYTHWFQPLTDGTAEKHDGFIEFGEDGEVIERFSGKLLIQQEPDASSFPNGGIRNTFEARGYTAWDVSSPAFVVD
TTLCIPTIFISYTGEALDYKTPLLKALAAVDKAATEVCQLFDKNITRVFTNLGWEQEYFLVDTSLYNARPDLRLTGRTLM
GHSSAKDQQLEDHYFGSIPPRVTAFMKELEIECHKLGIPVKTRHNEVAPNQFELAPIFENCNLANDHNQLVMDLMKRIAR
KHHFAVLFHEKPYNGVNGSGKHNNWSLCTDTGINLFAPGKNPKGNMLFLTFLVNVLMMVHKNQDLLRASIMSAGNSHRLG
ANEAPPAILSIFLGSQLSATLDEIVRQVTNSKMTPEEKTTLKLGIGRIPEILLDTTDRNRTSPFAFTGNRFEFRAAGSSA
NCAAAMIAINAAMANQLNEFKASVDKLMEEGIGKDEAIFRILKENIIASEPIRFEGDGYSEEWKQEAARRGLTNICHVPE
ALMHYTDNQSRAVLIGERIFNETELACRLEVELEKYTMKVQIESRVLGDLAINHIVPIAVSYQNRLLENLCRMKEIFSEE
EYEVMSADRKELIKEISHRVSAIKVLVRDMTEARKVANHKENFKEKAFAYEETVRPYLESIRDHIDHLEMEIDDEIWPLP
KYRELLFTK
>P31592 6.3.1.2~~~glnT~~~Glutamine synthetase~~~
MTLDLAAFARDKSIKYFMISYTDLFGGQRAKLVPAEAIADMQKDGAGFAGFATWLDLTPAHPDLFAVPDASSVIQLPWKK
DVAWVAADCVMDDRPVEQAPRVVLKRLVAEAAKEGLRVKTGVEPEFFLISADGSVISDQFDTAEKPCYDQQAVMRRYDVI
AEICDYMLELGWKPYQNDHEDANGQFEMNWEYDDVLKTADKHSFFKFMVKSVAEKHGLRATFMPKPFKGLTGNGCHAHIS
VWDVDGRVNAFADKEMAFGLSAQGKTFLGGIMKHAPALAAITNPTVNSYKRINAPRTTSGATWSPNTVTWTGNNRTHMVR
VPGPGRFELRLPDGAVNPYLLQAIIIAAGLEGIRSQADPGQHYDIDMYAEGHLVKDAPRLPLNLLDALRAFDADEGLKAA
IGAEFSSAYLKLKHLEWNAYCSHFTQWERDSTLDI
>O87393 6.3.1.2~~~glnT~~~Glutamine synthetase~~~COG0174
MTLDLSTFAREKGVKYFMISYTDLFGGQRAKLVPAEAIADMQKGGAGFAGFATWFDLTPAHPDLFALPDASAVIQLPWKK
DVAWVAADCIMDDAPVEQAPRVVLKKLVAEAAQEGLRVKTGVEPEFFLISPDGSKISDTFDTAEKPCYDQQAIMRRYDVI
AEICDYMLELGWKPYQNDHEDANGQFEMNWEYDDALRTADKHSFFKFMVKSIAEKHGLRATFMPKPFKGLTGNGCHCHIS
VWDLAGEVNAFADNKAEFGLSAEGRHFLGGIMKHASALAAVTNPTVNSYKRINAPRTISGATWAPNSVTWTGNNRTHMVR
VPGPGRFELRLPDGAVNPYLLQAIIIAAGLSGVRSKADPGRHYDIDMYKDGHKVTDAPKLPLNLLDALREYNRDEELQEA
LGREFSAAYLKLKQGEWNTYCSQFTEWEHQTTLDV
>Q9KZC7 6.3.1.-~~~glnA3~~~Gamma-glutamylpolyamine synthetase GlnA3~~~COG0174
MSESDPVPGGRPGEVERATALSGELTGQGVHGVVLAYVDTAGIARVKTVPTAKLAAAAAWGVGMSPVFDTFLADDSIVGT
DVLGSPDGDLRLYPDLDRLTMLAAQPGWAWAPVDRITQEGAPHPACGRTVLRRIVAGAAERHGITFRAAVEVEWVVGRGD
AGGDAFVPAVSGPAYGAARQVELSDCAADLLAALAAQGVDVEQFHPEYAAGQFEVSVGALGPVAAADHSVLVRQTIRAVS
ARHGLRVSFAPAVLGQGVGNGGHLHLSAWRDGTNLHAGGTARCGMTAEAESFVAGVLGHLPALTALTAPSPASRLRLRPS
QWAGVFTAWGRETREAALRIVTGTAGIRDRAANLEVKPVDLAANPYLALASVIAAGLDGLASSAPLPEEITGDPARLDPA
AAAARGVRRLPVTLTESVAAFRTDGVLREALGPVLADAVIAVRLGEAGSVEGLDDDGVAAAYRWKY
>O88070 6.3.1.-~~~glnA4~~~Gamma-glutamylethanolamide synthetase GlnA4~~~COG0174
MSPGQKEALPVADRTPPLGVEELHALVAAGDIDTVVLAFPDMQGRLQGKRFAARFFLDEVLEHGTEGCNYLLAVDADMNT
VDGYAMSSWDRGYGDFAMRADPATLRRLPWNEGTAMAVADLAWEDGSPVLAAPRQILRRQLERLAGHGYTAQVGTELEFI
VFRDTYEHAWDANYRGLTPANQYNVDYSVLGTGRVEPLLRRIRNEMAGAGLTVESAKGECNPGQHEIAFRYDEALVTCDQ
HAVYKTGAKEIAAQEGMSLTFMAKYNELEGNSCHIHLSLADADGRNAMAEGGGMSDVMRHFLAGQLVALREFSLLYAPHI
NSYKRFQPGSFAPTAVAWGHDNRTCALRVVGHGRSLRFENRLPGGDVNPYLAVAGLVAAGLHGIEQRLELPEPCPGNAYT
ADFAHVPTTLREAAELWENSTLAKAAFGDEVVAHYRNMARVELDAFDAAVTDWELRRSFERM
>Q5SIP0 6.3.1.2~~~~~~Glutamine synthetase~~~COG0174
MGYTKAEILKALKGENVKFLRLQITDILGVVKNVEVPESQFEKALDGEIMFDGSSIEGFTRIEESDMLLRPDYNTFVILP
DLVEDPKRGRVARLICDVYYPDGRPFEGDPRYVLKRQIERLKKLGFDNLYAGPEPEFFLFLRTPEGLPTTETHDRAGYFD
LAPIDKGEEARRDMVNALVAMGFEIEAAHHEVAPGQHEIDFKYADALTTADNIATFKWVVKRIALNHGLHATFLPKPIRG
INGSGMHTHLSLFKDGENAFYDPNAEYQLSQTALHFIAGLLEHAAGMVAVTNPLVNSYKRLTPGYEAPTNIAWSASNRSA
MIRIPARRGVGTRAELRMPDPSCNPYLALAVMAAAGADGIERKLLPPPPIQRNIYQMTVRERRKHKIRELPGTLREALEA
LRKDPVIREALGEHVYTHFLQAKQMEWDDYRVTVHQWELDRYLATY
>O66513 ~~~glnB~~~Nitrogen regulatory protein P-II~~~COG0347
MKKIEAIIKPFKLDEVKDALVEIGIGGMTVTEVKGFGQQKGHTEIYRGTEYVIDFLPKVKIEVVVRDEDVEKVVETIVKT
AQTGRVGDGKIFIIPVEDVIRIRTGERGEQAI
>P0A9Z1 ~~~glnB~~~Nitrogen regulatory protein P-II 1~~~COG0347
MKKIDAIIKPFKLDDVREALAEVGITGMTVTEVKGFGRQKGHTELYRGAEYMVDFLPKVKIEIVVPDDIVDTCVDTIIRT
AQTGKIGDGKIFVFDVARVIRIRTGEEDDAAI
>P11671 ~~~glnB~~~Nitrogen regulatory protein P-II~~~COG0347
MKKIDAIIKPFKLDDVREALAEVGITGMTVTEVKGFGRQKGHTELYRGAEYMVDFLPKVKIEIVVTDDIVDTCVDTIIRT
AQTGKIGDGKIFVFDVARVIRIRTGEEDDAAI
>P9WN31 ~~~glnB~~~Nitrogen regulatory protein P-II~~~COG0347
MKLITAIVKPFTLDDVKTSLEDAGVLGMTVSEIQGYGRQKGHTEVYRGAEYSVDFVPKVRIEVVVDDSIVDKVVDSIVRA
ARTGKIGDGKVWVSPVDTIVRVRTGERGHDAL
>P0A3F4 ~~~glnB~~~Nitrogen regulatory protein P-II~~~COG0347
MKKIEAIIRPFKLDEVKIALVNAGIVGMTVSEVRGFGRQKGQTERYRGSEYTVEFLQKLKLEIVVEDAQVDTVIDKIVAA
ARTGEIGDGKIFVSPVDQTIRIRTGEKNADAI
>P0A3F5 ~~~glnB~~~Nitrogen regulatory protein P-II~~~COG0347
MKKIEAIIRPFKLDEVKIALVNAGIVGMTVSEVRGFGRQKGQTERYRGSEYTVEFLQKLKLEIVVEDAQVDTVIDKIVAA
ARTGEIGDGKIFVSPVDQTIRIRTGEKNADAI
>Q55247 ~~~glnB~~~Nitrogen regulatory protein P-II~~~COG0347
MKKVEAIIRPFKLDEVKIALVNAGIVGMTVSEVRGFGRQKGQTERYRGSEYTVEFLQKLKIEIVVDEGQVDMVVDKLVSA
ARTGEIGDGKIFISPVDSVVRIRTGEKDTEAI
>Q8RQD1 ~~~glnD~~~Bifunctional uridylyltransferase/uridylyl-removing enzyme~~~
MLSTRAASADASDAKDAGTANIPNKRAILSRRKLAEDLETLVAEHGTGDKLRPALIARLRGALNDGRAEVRARFEAKGSG
EDCVRQNCYLADGVVRSLADLTVTHIFPTPNPTSGEVFDIVATGGYGRGELAPFSDIDLLFLLPYKRTPRVEQVVEYMLY
ILWDLGLKVGHAVRSVDDCIRQSKADVTIRTAILESRYLWGPRKLFHRLRRRFDREVVAGTGPEFVEAKLAERDNRHLKL
GDSAYVLEPNLKDGKGGLRDLQTLFWIAKYLYRVEDVDDLVGKKVLLPEEAHGFAKAQNFLWTARCHLHYLTGRMEDRMT
FDVQTSIGNRMGYTDHAGTKGVERFMKHYFLVAKDVGDLTRIFCAALEAESKRPPKFNILRLAALARRKDVDGFVVDGER
LNVRSDRQFKDEPLDMIRLFHTAQQNDIDIHPNALRAITRSLSVVGPKLRADPEANRLFLEILTGRKDPEITLRRMNEAG
VLARFIPDFGRVVAQMQYDMYHVYTVDEHTLFALGILHKIEMGELTDELPLSSEVIHKVVSRRALYVAVLLHDIAKGRGG
DHSILGARVAEKLCPRLGLTAEETETVAWLVRWHLAMSYTAFKRDLEDDKTVRDFVSLVQSPERLRLLLVLTVADIRAVG
PQRWNNWKATLLRELYNRSEEVMSGGLSVEGRGRRIQAAQAALRDELSDFDAADFERHLALGYPAYWLAFDAETLGRQAR
LVRGRLRDERPLTVNTRIDRGRAITEVTIFATDHHGLFSRLAGALAAAGADIVDARIFTMTNGMALDVFTVQDAAGGGAF
ESGDKLAKLSVMIEKVLSGQLKPLHDLTKRKAPHASRTRVFHVPPRVLIDNNASTTHTVIEVNGRDRPGLLYDLTRALTN
LTLQISSAKISTYGEKAIDVFYVKDVFGLKVTHENKLAQIRERLLHALADPSA
>P36223 ~~~glnD~~~Bifunctional uridylyltransferase/uridylyl-removing enzyme~~~
MPQVDPDLFDPGQFQAELALKSSPIPAYKKALRCAREVLDARFQEGRDIRRLIEDRAWFVDQILALAWNRFDWSEDADIA
LIAVGGYGRGELHPYSDIDLLILMDGADHEVFREPIEGFLTLLWDIGLEVGQSVRSLAECAEEAQADLTVITNLMESRTI
AGPEHLRQRMQEVTSAQRMWPSRAFFLAKRDEQKTRHARYNDTEYNLEPNVKGSPGGLRDIQTLLWIARRQFGTINLHAM
VGQGFLLESEYTLLASSQEFLWKVRYALHMLAGRAEDRLLFDLQRQIAGLLGYEDSDAKLAVERFMQKYYRVVLGIAELT
ELVFQHFEEVILPGDAAGRVEPLNERFQVRDGYLEVTHAGVFQETPSALLEIFVLLARRPEIRGVRADTIRLLRDHRYLI
DDAFRRDPHNTGLFIELFKSRQGIHRNLRRMNRYGILGRYLPEFGHIVGQMQHDLFHIYTVDAHTLNLIKNLRKLFWPEL
AEKYPLASKLIEKLPKPELIYLAGLYHDIGKGRGGDHSELGAADALAFCQRHDLPAMDTQLIVWLVRNHLLMSTTAQRKD
LSDPQVIFDFAQKVRDQTYLDYLYVLTVADINATNPTLWNSWRASLLRQLYTETKHALRRGLEQPVGREEQIRQTQKAAL
DILVRSGTDPDDAEHLWTQLGDDYFLRHTSSDIAWHTEAILQHPSSGGPLVLIKETTQREFEGATQIFIYAPDQHDFFAV
TVAAMDQLNLSIHDARVITSTSQFTLDTYIVLDADGGSIGNNPARIQEIRQGLVEALRNPADYPTIIQRRVPRQLKHFAF
APQVTIQNDALRPVTILEIIAPDRPGLLARIGKIFLDFDLSLQNAKIATLGERVEDVFFVTDAHNQPLSDPELCARLQLA
IAEQLADGDSYIQPSRISI
>P27249 ~~~glnD~~~Bifunctional uridylyltransferase/uridylyl-removing enzyme~~~COG2844
MNTLPEQYANTALPTLPGQPQNPCVWPRDELTVGGIKAHIDTFQRWLGDAFDNGISAEQLIEARTEFIDQLLQRLWIEAG
FSQIADLALVAVGGYGRGELHPLSDVDLLILSRKKLPDDQAQKVGELLTLLWDVKLEVGHSVRTLEECMLEGLSDLTVAT
NLIESRLLIGDVALFLELQKHIFSEGFWPSDKFYAAKVEEQNQRHQRYHGTSYNLEPDIKSSPGGLRDIHTLQWVARRHF
GATSLDEMVGFGFLTSAERAELNECLHILWRIRFALHLVVSRYDNRLLFDRQLSVAQRLNYSGEGNEPVERMMKDYFRVT
RRVSELNQMLLQLFDEAILALPADEKPRPIDDEFQLRGTLIDLRDETLFMRQPEAILRMFYTMVHNSAITGIYSTTLRQL
RHARRHLQQPLCNIPEARKLFLSILRHPGAVRRGLLPMHRHSVLGAYMPQWSHIVGQMQFDLFHAYTVDEHTIRVMLKLE
SFASEETRQRHPLCVDVWPRLPSTELIFIAALFHDIAKGRGGDHSILGAQDVVHFAELHGLNSRETQLVAWLVRQHLLMS
VTAQRRDIQDPEVIKQFAEEVQTENRLRYLVCLTVADICATNETLWNSWKQSLLRELYFATEKQLRRGMQNTPDMRERVR
HHQLQALALLRMDNIDEEALHQIWSRCRANYFVRHSPNQLAWHARHLLQHDLSKPLVLLSPQATRGGTEIFIWSPDRPYL
FAAVCAELDRRNLSVHDAQIFTTRDGMAMDTFIVLEPDGNPLSADRHEVIRFGLEQVLTQSSWQPPQPRRQPAKLRHFTV
ETEVTFLPTHTDRKSFLELIALDQPGLLARVGKIFADLGISLHGARITTIGERVEDLFIIATADRRALNNELQQEVHQRL
TEALNPNDKG
>P9WN29 ~~~glnD~~~Bifunctional uridylyltransferase/uridylyl-removing enzyme~~~COG2844
MEAESPCAASDLAVARRELLSGNHRELDPVGLRQTWLDLHESWLIDKADEIGIADASGFAIVGVGGLGRRELLPYSDLDV
LLLHDGKPADILRPVADRLWYPLWDANIRLDHSVRTVSEALTIANSDLMAALGMLEARHIAGDQQLSFALIDGVRRQWRN
GIRSRMGELVEMTYARWRRCGRIAQRAEPDLKLGRGGLRDVQLLDALALAQLIDRHGIGHTDLPAGSLDGAYRTLLDVRT
ELHRVSGRGRDHLLAQFADEISAALGFGDRFDLARTLSSAGRTIGYHAEAGLRTAANALPRRGISALVRRPKRRPLDEGV
VEYAGEIVLARDAEPEHDPGLVLRVAAASADTGLPIGAATLSRLAASVPDLPTPWPQEALDDLLVVLSAGPTTVATIEAL
DRTGLWGRLLPEWEPIRDLPPRDVAHKWTVDRHVVETAVHAAPLATRVARPDLLALGALLHDIGKGRGTDHSVLGAELVI
PVCTRLGLSPPDVRTLSKLVRHHLLLPITATRRDLNDPKTIEAVSEALGGDPQLLEVLHALSEADSKATGPGVWSDWKAS
LVDDLVRRCRMVMAGESLPQAEPTAPHYLSLAADHGVHVEISPRDGERIDAVIVAPDERGLVSKAAAVLALNSLRVHSAS
VNVHQGVAITEFVVSPLFGSPPAAELVRQQFVGALNGDVDVLGMLQKRDSDAASLVSARAGDVQAGVPVTRTAAPPRILW
LDTAAPAKLILEVRAMDRAGLLALLAGALEGAGAGIVWAKVNTFGSTAADVFCVTVPAELDARAAVEQHLLEVLGASVDV
VVDEPVGD
>Q9RAE4 ~~~glnD~~~Bifunctional uridylyltransferase/uridylyl-removing enzyme~~~
MRDLDFTNILDVELLQKQCDAVAEANRNRPDVLRADLLAVLKKASTEGRQKAREALMADGGGLNCAYRISWLQDQITTVL
YNFATAHIFPQQKDKFAVTAVGGYGRDTLAPGSDIDLLFLFLPRPAEETHKAVEFMLYVLWDMGFKVGHATRTVEECIAL
SKSDMTIRTAILEMRYICGLQRLETELETRFDKEIVTGTGPEFIAAKLAERDERHRKAGDTRYLVEPNVKEGKGGLRDLH
TLFWISKYYYHVRDQAELVKLGVLSKHEYRLLEKADDFLWAVRCHMHFLTGKAEERLSFDIQREIAEAFGYHTRPGLSAV
ERFMKHYFLVAKDVGDLTRILCAALEDQQAKSIPGLTGVISRFTHRNRKIAGSVEFVEDRGRIALADPEVFKRDPVNIIR
LFHVADINGLEFHPDALKRVTRSLALIDNALRENDEANRLFMSILTSKRDPALILRRMNEAGVLGRFIPEFGKIVAMMQF
NMYHHYTVDEHLIRTVDILSEIDKGRAEDLHPLANKLMPGIEDREALYVAVLLHDIAKGRQEDHSIAGARVARKLCVRFG
LSQKQTEIVVWLIEEHLTMSMVAQTRDLTDRKTITDFADRVQSLDRLKMLLILTICDIRAVGPGVWNGWKGQLLRTLYYE
TELLLAGGFSEVSRKERANAAAEALHSALADWSQKDRNTYTKLHYQPYLLSVPLEDQIRHAHFIRQADKAGQALATMVRT
DSFHAITEITVLSPDHPRLLAVIAGACAAAGANIVDAQIFTTSDGRALDTIHVSREFTDDADELRRAATIGRMIEDVLSG
RKRLPEVIATRARNRKKSKAFVIPPSVNITNSLSNKFTVIEVECLDRPGLLSEITAVLSDLSLDIQSARITTFGEKVIDT
FYVTDLVGQKISGDSKRANITARMKAVMAEEEDELRERMPSGIIAPAATARTPPASEKKAGSPI
>P30870 ~~~glnE~~~Bifunctional glutamine synthetase adenylyltransferase/adenylyl-removing enzyme~~~COG1391
MKPLSSPLQQYWQTVVERLPEPLAEESLSAQAKSVLTFSDFVQDSVIAHPEWLTELESQPPQADEWQHYAAWLQEALCNV
SDEAGLMRELRLFRRRIMVRIAWAQTLALVTEESILQQLSYLAETLIVAARDWLYDACCREWGTPCNAQGEAQPLLILGM
GKLGGGELNFSSDIDLIFAWPEHGCTQGGRRELDNAQFFTRMGQRLIKVLDQPTQDGFVYRVDMRLRPFGESGPLVLSFA
ALEDYYQEQGRDWERYAMVKARIMGDSEGVYANELRAMLRPFVFRRYIDFSVIQSLRNMKGMIAREVRRRGLTDNIKLGA
GGIREIEFIVQVFQLIRGGREPSLQSRSLLPTLSAIAELHLLSENDAEQLRVAYLFLRRLENLLQSINDEQTQTLPSDEL
NRARLAWAMDFADWPQLTGALTAHMTNVRRVFNELIGDDESETQEESLSEQWRELWQDALQEDDTTPVLAHLSEDDRKQV
LTLIADFRKELDKRTIGPRGRQVLDHLMPHLLSDVCAREDAAVTLSRITALLVGIVTRTTYLELLSEFPAALKHLISLCA
ASPMIASQLARYPLLLDELLDPNTLYQPTATDAYRDELRQYLLRVPEDDEEQQLEALRQFKQAQLLRIAAADIAGTLPVM
KVSDHLTWLAEAMIDAVVQQAWVQMVARYGKPNHLNEREGRGFAVVGYGKLGGWELGYSSDLDLIFLHDCPMDAMTDGER
EIDGRQFYLRLAQRIMHLFSTRTSSGILYEVDARLRPSGAAGMLVTSAEAFADYQKNEAWTWEHQALVRARVVYGDPQLT
AHFDAVRREIMTLPREGKTLQTEVREMREKMRAHLGNKHRDRFDIKADEGGITDIEFITQYLVLRYAHEKPKLTRWSDNV
RILELLAQNDIMEEQEAMALTRAYTTLRDELHHLALQELPGHVSEDCFTAERELVRASWQKWLVEE
>P9WN27 ~~~glnE~~~Bifunctional glutamine synthetase adenylyltransferase/adenylyl-removing enzyme~~~COG1391
MVVTKLATQRPKLPSVGRLGLVDPPAGERLAQLGWDRHEDQAHVDLLWSLSRAPDADAALRALIRLSENPDTGWDELNAA
LLRERSLRGRLFSVLGSSLALGDHLVAHPQSWKLLRGKVTLPSHDQLQRSFVECVEESEGMPGSLVHRLRTQYRDYVLML
AALDLAATVEDEPVLPFTVVAARLADAADAALAAALRVAEASVCGEHPPPRLAVIAMGKCGARELNYVSDVDVIFVAERS
DPRNARVASEMMRVASAAFFEVDAALRPEGRNGELVRTLESHIAYYQRWAKTWEFQALLKARPVVGDAELGERYLTALMP
MVWRACEREDFVVEVQAMRRRVEQLVPADVRGRELKLGSGGLRDVEFAVQLLQLVHARSDESLRVASTVDALAALGEGGY
IGREDAANMTASYEFLRLLEHRLQLQRLKRTHLLPDPEDEEAVRWLARAAHIRPDGRNDAAGVLREELKKQNVRVSKLHT
KLFYQPLLESIGPTGLEIAHGMTLEAAGRRLAALGYEGPQTALKHMSALVNQSGRRGRVQSVLLPRLLDWMSYAPDPDGG
LLAYRRLSEALATESWYLATLRDKPAVAKRLMHVLGTSAYVPDLLMRAPRVIQQYEDGPAGPKLLETEPAAVARALIASA
SRYPDPERAIAGARTLRRRELARIGSADLLGLLEVTEVCRALTSVWVAVLQAALDVMIRASLPDDDRAPAAIAVIGMGRL
GGAELGYGSDADVMFVCEPATGVDDARAVKWSTSIAERVRALLGTPSVDPPLELDANLRPEGRNGPLVRTLGSYAAYYEQ
WAQPWEIQALLRAHAVAGDAELGQRFLRMVDKTRYPPDGVSADSVREIRRIKARIESERLPRGADPNTHTKLGRGGLADI
EWTVQLLQLQHAHQVPALHNTSTLQSLDVIAAADLVPAADVELLRQAWLTATRARNALVLVRGKPTDQLPGPGRQLNAVA
VAAGWRNDDGGEFLDNYLRVTRRAKAVVRKVFGS
>O34563 ~~~glnH~~~ABC transporter glutamine-binding protein GlnH~~~COG0834
MKKIFSLALISLFAVILLAACGSKGSNGEASKESKKDTLAAIKDNDKIVFGVKTDTRLFGLKNPSSGEIEGFDIDIAKQI
AKDILGDEKKAQFKEVTSKTRIPMLQNGDIDAIVATMTITEERKKEVDFSDVYFEAGQSLLVKKGSKIKSVENLGKGSKV
LAVKGSTSSQNIREKAPEASVLEFENYAEAFTALKSGQGDALTTDNAILYGMADENKNYQLTGKPFTDEPYGIAVKKGQS
ALAKEINASLKKMKSDGRYDEIYKKWIKEDPAE
>P0AEQ3 ~~~glnH~~~Glutamine-binding periplasmic protein~~~COG0834
MKSVLKVSLAALTLAFAVSSHAADKKLVVATDTAFVPFEFKQGDKYVGFDVDLWAAIAKELKLDYELKPMDFSGIIPALQ
TKNVDLALAGITITDERKKAIDFSDGYYKSGLLVMVKANNNDVKSVKDLDGKVVAVKSGTGSVDYAKANIKTKDLRQFPN
IDNAYMELGTNRADAVLHDTPNILYFIKTAGNGQFKAVGDSLEAQQYGIAFPKGSDELRDKVNGALKTLRENGTYNEIYK
KWFGTEPK
>P40758 2.7.13.3~~~glnK~~~Sensor histidine kinase GlnK~~~COG4191
MLITVPLAGELKFYPLNEEFRVSFGAPVFFFFLSLLRHVPAVLPGFLTGAAVFIFRVFLELWGGGHNGLTPILYDQASGF
FFYMTYACLFSILKANRFRERPIMLGFIGFMIEVVSDCVELTVQFLIFHTVVTPEKITDIAVIAISHTFIVMSFYSVLKL
YETQSREKQTRQQHEHMLMIVSNLYEETVHLKKTLKTTEKVTNDSYQLYREMKGKDVQLSGRILRLAGEIHEVKKDNQRI
FAGLSKLISNESLRDYMRASDLLQLVIRMNEKYAEALGKQIDFYCSIEGEHDEYHVFIVLSIINNLTANAVEAMDEEGMV
SLRLRKPNESMVEFQVEDNGPGISEKIGDIVFDPGFTSKYDEFGTPSTGIGLSYVKEIVTELEGDITFDNQQRGVVFAIR
LPVRHLIQKG
>P0AC55 ~~~glnK~~~Nitrogen regulatory protein GlnK~~~COG0347
MKLVTVIIKPFKLEDVREALSSIGIQGLTVTEVKGFGRQKGHAELYRGAEYSVNFLPKVKIDVAIADDQLDEVIDIVSKA
AYTGKIGDGKIFVAELQRVIRIRTGEADEAAL
>O34671 ~~~glnM~~~Probable glutamine ABC transporter permease protein GlnM~~~COG0765
MNVSILFDNFSMYMDGFYHTLLASVIALAGSFVLGVAVAVMRITVFKPLQWLGTAYVEFIRNIPLLLITFVFYFGLPNAG
LRLDGFQAGTVALTIYTSAFIAEAIRAGIQSVSKGQMEAARSSGFTYSQAMLHIILPQAIKIVIPPLGNQFLNLVKNSSI
LGVVAGLDLMYQADLVSSSTLVVFDVYIFVALFYLVLTIPLSIGVNYLEKRLEKSY
>O34606 ~~~glnP~~~Probable glutamine ABC transporter permease protein GlnP~~~COG0765
MDFIGAYSQEHLAFLWDGFLVTLYVAFISIILSFFFGLIAGTLRYAKVPVLSQLIAVLVETIRNLPLLLIIFFTFFALPE
IGIKLEITAAAITALTIFESAMLSEIIRSGLKSIDKGQIEAARSSGLSYTQTLFFIVMPQALRRMVPPIVSQFISLLKDT
SLAVVIALPELIHNAQIINGQSADGSYFFPIFLLAALMYFAVNYSLSLAARRLEVRQT
>P0AEQ6 ~~~glnP~~~Glutamine transport system permease protein GlnP~~~COG0765
MQFDWSAIWPAIPLLIEGAKMTLWISVLGLAGGLVIGLLAGFARTFGGWIANHVALVFIEVIRGTPIVVQVMFIYFALPM
AFNDLRIDPFTAAVVTIMINSGAYIAEITRGAVLSIHKGFREAGLALGLSRWETIRYVILPLALRRMLPPLGNQWIISIK
DTSLFIVIGVAELTRQGQEIIAGNFRALEIWSAVAVFYLIITLVLSFILRRLERRMKIL
>O34677 7.4.2.-~~~glnQ~~~Glutamine transport ATP-binding protein GlnQ~~~COG1126
MITFQNVNKHYGDFHVLKQINLQIEKGEVVVIIGPSGSGKSTLLRCINRLESINEGVLTVNGTAINDRKTDINQVRQNIG
MVFQHFHLYPHKTVLQNIMLAPVKVLRQSPEQAKETARYYLEKVGIPDKADAYPSQLSGGQQQRVAIARGLAMKPEVMLF
DEPTSALDPEMIGEVLDVMKTLAKEGMTMVVVTHEMGFAKEVADRIVFIDEGKILEEAVPAEFYANPKEERARLFLSRIL
NH
>P10346 ~~~glnQ~~~Glutamine transport ATP-binding protein GlnQ~~~COG1126
MIEFKNVSKHFGPTQVLHNIDLNIAQGEVVVIIGPSGSGKSTLLRCINKLEEITSGDLIVDGLKVNDPKVDERLIRQEAG
MVFQQFYLFPHLTALENVMFGPLRVRGANKEEAEKLARELLAKVGLAERAHHYPSELSGGQQQRVAIARALAVKPKMMLF
DEPTSALDPELRHEVLKVMQDLAEEGMTMVIVTHEIGFAEKVASRLIFIDKGRIAEDGNPQVLIKNPPSQRLQEFLQHVS
>P27675 ~~~glnQ~~~Glutamine transport ATP-binding protein GlnQ~~~
MIYFHQVNKYYGDFHVLKDINLTIHQGEVVVIIGPSGSGKSTLVRCINRLETISSGELIVDNVKVNDKHIDINQLRRNIG
MVFQHFNLYPHMTVLQNITLAPMKVLRIPEKEAKETAMYYLEKVGIPDKANAYPSELSGGQQQRVAIARGLAMKPKIMLF
DEPTSALDPETIGEVLDVMKQLAKEGMTMVVVTHEMGFAREVADRIVFMDQGRILEEAPPEEFFSNPKEERAKVFLSRIL
NH
>P62173 ~~~glnR~~~HTH-type transcriptional regulator GlnR~~~COG0789
MKEDRRSAPLFPIGIVMDLTQLSARQIRYYEEHNLVSPTRTKGNRRLFSFNDVDKLLEIKDLLDQGLNMAGIKQVLLMKE
NQTEAVKVKEETKEISKTELRKILRDELQHTGRFNRTSLRQGDISRFFH
>P37582 ~~~glnR~~~HTH-type transcriptional regulator GlnR~~~COG0789
MSDNIRRSMPLFPIGIVMQLTELSARQIRYYEENGLIFPARSEGNRRLFSFHDVDKLLEIKHLIEQGVNMAGIKQILAKA
EAEPEQKQNEKTKKPMKHDLSDDELRQLLKNELMQAGRFQRGNTFRQGDMSRFFH
>P75849 3.1.2.6~~~gloC~~~Hydroxyacylglutathione hydrolase GloC~~~COG0491
MNYRIIPVTAFSQNCSLIWCEQTRLAALVDPGGDAEKIKQEVDDSGLTLMQILLTHGHLDHVGAAAELAQHYGVPVFGPE
KEDEFWLQGLPAQSRMFGLEECQPLTPDRWLNEGDTISIGNVTLQVLHCPGHTPGHVVFFDDRAKLLISGDVIFKGGVGR
SDFPRGDHNQLISSIKDKLLPLGDDVIFIPGHGPLSTLGYERLHNPFLQDEMPVW
>P0AC84 3.1.2.6~~~gloB~~~Hydroxyacylglutathione hydrolase GloB~~~COG0491
MNLNSIPAFDDNYIWVLNDEAGRCLIVDPGDAEPVLNAIAANNWQPEAIFLTHHHHDHVGGVKELVEKFPQIVVYGPQET
QDKGTTQVVKDGETAFVLGHEFSVIATPGHTLGHICYFSKPYLFCGDTLFSGGCGRLFEGTASQMYQSLKKLSALPDDTL
VCCAHEYTLSNMKFALSILPHDLSINDYYRKVKELRAKNQITLPVILKNERQINVFLRTEDIDLINVINEETLLQQPEER
FAWLRSKKDRF
>Q9I2T1 3.1.2.6~~~gloB~~~Hydroxyacylglutathione hydrolase~~~
MIQIDALPAFNDNYIWLLQDATSRRCAVVDPGDAKPVEAWLAAHPDWRLSDILVTHHHHDHVGGVAALKELTGARVLGPA
NEKIPARDLALEDGERVEVLGLVFEIFHVPGHTLGHIAYYHPAETPLLFCGDTLFAAGCGRLFEGTPAQMHHSLARLAAL
PANTRVYCTHEYTLSNLRFALAVEPDNAALRERFEEATRLRERDRITLPSEISLELSTNPFLRVSENSVKKKADQRSGQQ
NRTPEEVFAVLRAWKDQF
>Q8ZRM2 3.1.2.6~~~gloB~~~Hydroxyacylglutathione hydrolase~~~
MNLNSIPAFQDNYIWVLTNDEGRCVIVDPGEAAPVLKAIAEHKWMPEAIFLTHHHHDHVGGVKELLQHFPQMTVYGPAET
QDKGATHLVGDGDTIRVLGEKFTLFATPGHTLGHVCYFSRPYLFCGDTLFSGGCGRLFEGTPSQMYQSLMKINSLPDDTL
ICCAHEYTLANIKFALSILPHDSFINEYYRKVKELRVKKQMTLPVILKNERKINLFLRTEDIDLINEINKETILQQPEAR
FAWLRSKKDTF
>P0A9C0 1.1.5.3~~~glpA~~~Anaerobic glycerol-3-phosphate dehydrogenase subunit A~~~COG0578
MKTRDSQSSDVIIIGGGATGAGIARDCALRGLRVILVERHDIATGATGRNHGLLHSGARYAVTDAESARECISENQILKR
IARHCVEPTNGLFITLPEDDLSFQATFIRACEEAGISAEAIDPQQARIIEPAVNPALIGAVKVPDGTVDPFRLTAANMLD
AKEHGAVILTAHEVTGLIREGATVCGVRVRNHLTGETQALHAPVVVNAAGIWGQHIAEYADLRIRMFPAKGSLLIMDHRI
NQHVINRCRKPSDADILVPGDTISLIGTTSLRIDYNEIDDNRVTAEEVDILLREGEKLAPVMAKTRILRAYSGVRPLVAS
DDDPSGRNVSRGIVLLDHAERDGLDGFITITGGKLMTYRLMAEWATDAVCRKLGNTRPCTTADLALPGSQEPAEVTLRKV
ISLPAPLRGSAVYRHGDRTPAWLSEGRLHRSLVCECEAVTAGEVQYAVENLNVNSLLDLRRRTRVGMGTCQGELCACRAA
GLLQRFNVTTSAQSIEQLSTFLNERWKGVQPIAWGDALRESEFTRWVYQGLCGLEKEQKDAL
>P13033 1.1.5.3~~~glpB~~~Anaerobic glycerol-3-phosphate dehydrogenase subunit B~~~COG3075
MRFDTVIMGGGLAGLLCGLQLQKHGLRCAIVTRGQSALHFSSGSLDLLSHLPDGQPVTDIHSGLESLRQQAPAHPYSLLE
PQRVLDLACQAQALIAESGAQLQGSVELAHQRVTPLGTLRSTWLSSPEVPVWPLPAKKICVVGISGLMDFQAHLAAASLR
ELGLAVETAEIELPELDVLRNNATEFRAVNIARFLDNEENWPLLLDALIPVANTCEMILMPACFGLADDKLWRWLNEKLP
CSLMLLPTLPPSVLGIRLQNQLQRQFVRQGGVWMPGDEVKKVTCKNGVVNEIWTRNHADIPLRPRFAVLASGSFFSGGLV
AERNGIREPILGLDVLQTATRGEWYKGDFFAPQPWQQFGVTTDETLRPSQAGQTIENLFAIGSVLGGFDPIAQGCGGGVC
AVSALHAAQQIAQRAGGQQ
>P0A996 ~~~glpC~~~Anaerobic glycerol-3-phosphate dehydrogenase subunit C~~~COG0247
MNDTSFENCIKCTVCTTACPVSRVNPGYPGPKQAGPDGERLRLKDGALYDEALKYCINCKRCEVACPSDVKIGDIIQRAR
AKYDTTRPSLRNFVLSHTDLMGSVSTPFAPIVNTATSLKPVRQLLDAALKIDHRRTLPKYSFGTFRRWYRSVAAQQAQYK
DQVAFFHGCFVNYNHPQLGKDLIKVLNAMGTGVQLLSKEKCCGVPLIANGFTDKARKQAITNVESIREAVGVKGIPVIAT
SSTCTFALRDEYPEVLNVDNKGLRDHIELATRWLWRKLDEGKTLPLKPLPLKVVYHTPCHMEKMGWTLYTLELLRNIPGL
ELTVLDSQCCGIAGTYGFKKENYPTSQAIGAPLFRQIEESGADLVVTDCETCKWQIEMSTSLRCEHPITLLAQALA
>P9WN81 1.1.5.3~~~glpD1~~~Glycerol-3-phosphate dehydrogenase 1~~~COG0578
MLMPHSAALNAARRSADLTALADGGALDVIVIGGGITGVGIALDAATRGLTVALVEKHDLAFGTSRWSSKLVHGGLRYLA
SGNVGIARRSAVERGILMTRNAPHLVHAMPQLVPLLPSMGHTKRALVRAGFLAGDALRVLAGTPAATLPRSRRIPASRVV
EIAPTVRRDGLDGGLLAYDGQLIDDARLVMAVARTAAQHGARILTYVGASNVTGTSVELTDRRTRQSFALSARAVINAAG
VWAGEIDPSLRLRPSRGTHLVFDAKSFANPTAALTIPIPGELNRFVFAMPEQLGRIYLGLTDEDAPGPIPDVPQPSSEEI
TFLLDTVNTALGTAVGTKDVIGAYAGLRPLIDTGGAGVQGRTADVSRDHAVFESPSGVISVVGGKLTEYRYMAEDVLNRA
ITLRHLRAAKCRTRNLPLIGAPANPGPAPGSGAGLPESLVARYGAEAANVAAAATCERPTEPVADGIDVTRAEFEYAVTH
EGALDVDDILDRRTRIGLVPRDRERVVAVAKEFLSR
>P9WN79 1.1.5.3~~~glpD2~~~Glycerol-3-phosphate dehydrogenase 2~~~COG0578
MSNPIQAPDGGQGWPAAALGPAQRAVAWKRLGTEQFDVVVIGGGVVGSGCALDAATRGLKVALVEARDLASGTSSRSSKM
FHGGLRYLEQLEFGLVREALYERELSLTTLAPHLVKPLPFLFPLTKRWWERPYIAAGIFLYDRLGGAKSVPAQRHFTRAG
ALRLSPGLKRSSLIGGIRYYDTVVDDARHTMTVARTAAHYGAVVRCSTQVVALLREGDRVIGVGVRDSENGAVAEVRGHV
VVNATGVWTDEIQALSKQRGRFQVRASKGVHVVVPRDRIVSDVAMILRTEKSVMFVIPWGSHWIIGTTDTDWNLDLAHPA
ATKADIDYILGTVNAVLATPLTHADIDGVYAGLRPLLAGESDDTSKLSREHAVAVPAAGLVAIAGGKYTTYRVMAADAID
AAVQFIPARVAPSITEKVSLLGADGYFALVNQAEHVGALQGLHPYRVRHLLDRYGSLISDVLAMAASDPSLLSPITEAPG
YLKVEAAYAAAAEGALHLEDILARRMRISIEYPHRGVDCAREVAEVVAPVLGWTAADIDREVANYMARVEAEVLSQAQPD
DVSADMLRASAPEARAEILEPVPLD
>A0A0R3K2G2 1.1.99.-~~~lhgO_1~~~Glycerol 3-phosphate dehydrogenase~~~
MFDVAIIGAGVIGCSIARELSKYNLNVALIEKENDVGNVTTKANSAIIHAGYDAKPGTLKGKLNAKGNLMFDELCRELEV
PFKRVGSLVLAFDDDEMKTLGKLYEQGIQNGVPELYILSKEKVLEMDPNISDNIKGALYAKTGGIIGPWEFTIALAENAV
ENGVNIFLSNEVVDIEKKDFGYRIITNKDTYDTKYVVNCAGLYADKINNMVSNNKMEIIPRRGQYYLLDKTVGNLVKYVI
FQCPSKLGKGVLVTPTVHGNLLIGPDAEDLIDKTALNTTSEGLNFIVEVARRSVKTLPLNMAITNFAGLRARTERDDFII
EEAVDAKGFINVAGIESPGLSSAPAISLYVIDILKNIAKKIEKKENFNPYRRAIPKFIELSEDEKNELVKKDKRFGKIIC
RCESITEGEIVSAIHRNVGARTVDAVKRRVRAGMGRCQGGFCSPRVIEILARELGVEMTEIEKDHEGSYILTGPTKSEVQ
>P18158 1.1.5.3~~~glpD~~~Aerobic glycerol-3-phosphate dehydrogenase~~~COG0578
MMNHQFSSLERDRMLTDMTKKTYDLFIIGGGITGAGTALDAASRGMKVALSEMQDFAAGTSSRSTKLVHGGLRYLKQFEV
KMVAEVGKERAIVYENGPHVTTPEWMLLPFHKGGTFGSFTTSIGLRVYDFLAGVKKSERRSMLSAKETLQKEPLVKKDGL
KGGGYYVEYRTDDARLTIEVMKEAVKFGAEPVNYSKVKELLYEKGKAVGVLIEDVLTKKEYKVYAKKIVNATGPWVDQLR
EKDHSKNGKHLQHTKGIHLVFDQSVFPLKQAVYFDTPDGRMVFAIPREGKTYVGTTDTVYKEALEHPRMTTEDRDYVIKS
INYMFPELNITANDIESSWAGLRPLIHEEGKDPSEISRKDEIWTSDSGLITIAGGKLTGYRKMAEHIVDLVRDRLKEEGE
KDFGPCKTKNMPISGGHVGGSKNLMSFVTAKTKEGIAAGLSEKDAKQLAIRYGSNVDRVFDRVEALKDEAAKRNIPVHIL
AEAEYSIEEEMTATPADFFVRRTGRLFFDINWVRTYKDAVIDFMSERFQWDEQAKNKHTENLNKLLHDAVVPLEQ
>P13035 1.1.5.3~~~glpD~~~Aerobic glycerol-3-phosphate dehydrogenase~~~COG0578
METKDLIVIGGGINGAGIAADAAGRGLSVLMLEAQDLACATSSASSKLIHGGLRYLEHYEFRLVSEALAEREVLLKMAPH
IAFPMRFRLPHRPHLRPAWMIRIGLFMYDHLGKRTSLPGSTGLRFGANSVLKPEIKRGFEYSDCWVDDARLVLANAQMVV
RKGGEVLTRTRATSARRENGLWIVEAEDIDTGKKYSWQARGLVNATGPWVKQFFDDGMHLPSPYGIRLIKGSHIVVPRVH
TQKQAYILQNEDKRIVFVIPWMDEFSIIGTTDVEYKGDPKAVKIEESEINYLLNVYNTHFKKQLSRDDIVWTYSGVRPLC
DDESDSPQAITRDYTLDIHDENGKAPLLSVFGGKLTTYRKLAEHALEKLTPYYQGIGPAWTKESVLPGGAIEGDRDDYAA
RLRRRYPFLTESLARHYARTYGSNSELLLGNAGTVSDLGEDFGHEFYEAELKYLVDHEWVRRADDALWRRTKQGMWLNAD
QQSRVSQWLVEYTQQRLSLAS
>Q7A5V7 1.1.5.3~~~glpD~~~Aerobic glycerol-3-phosphate dehydrogenase~~~
MALSTFKREHIKKNLRNDEYDLVIIGGGITGAGIALDASERGMKVALVEMQDFAQGTSSRSTKLVHGGLRYLKQFQIGVV
AETGKERAIVYENGPHVTTPEWMLLPMHKGGTFGKFSTSIGLGMYDRLAGVKKSERKKMLSKKETLAKEPLVKKEGLKGG
GYYVEYRTDDARLTIEVMKRAAEKGAEIINYTKSEHFTYDKNQQVNGVKVIDKLTNENYTIKAKKVVNAAGPWVDDVRSG
DYARNNKKLRLTKGVHVVIDQSKFPLGQAVYFDTEKDGRMIFAIPREGKAYVGTTDTFYDNIKSSPLTTQEDRDYLIDAI
NYMFPSVNVTDEDIESTWAGIRPLIYEEGKDPSEISRKDEIWEGKSGLLTIAGGKLTGYRHMAQDIVDLVSKRLKKDYGL
TFSPCNTKGLAISGGDVGGSKNFDAFVEQKVDVAKGFGIDEDVARRLASKYGSNVDELFNIAQTSQYHDSKLPLEIYVEL
VYSIQQEMVYKPNDFLVRRSGKMYFNIKDVLDYKDAVIDIMADMLDYSPAQIEAYTEEVEQAIKEAQHGNNQPAVKE
>P0A6V5 2.8.1.1~~~glpE~~~Thiosulfate sulfurtransferase GlpE~~~COG0607
MDQFECINVADAHQKLQEKEAVLVDIRDPQSFAMGHAVQAFHLTNDTLGAFMRDNDFDTPVMVMCYHGNSSKGAAQYLLQ
QGYDVVYSIDGGFEAWQRQFPAEVAYGA
>P18156 ~~~glpF~~~Glycerol uptake facilitator protein~~~COG0580
MTAFWGEVIGTMLLIIFGAGVCAGVNLKKSLSFQSGWIVVVFGWGLGVAMAAYAVGGISGAHLNPALTIALAFVGDFPWK
EVPVYIAAQMIGAIIGAVIIYLHYLPHWKSTDDPAAKLGVFSTGPSIPHTFANVLSEVIGTFVLVLGILAIGANQFTEGL
NPLIVGFLIVAIGISLGGTTGYAINPARDLGPRIAHAFLPIPGKGSSNWKYAWVPVVGPILGGSFGGVFYNAAFKGHITS
SFWIVSVILVVVLLGLYVYTKSHSAKTLSNSKYI
>P0AER0 ~~~glpF~~~Glycerol uptake facilitator protein~~~COG0580
MSQTSTLKGQCIAEFLGTGLLIFFGVGCVAALKVAGASFGQWEISVIWGLGVAMAIYLTAGVSGAHLNPAVTIALWLFAC
FDKRKVIPFIVSQVAGAFCAAALVYGLYYNLFFDFEQTHHIVRGSVESVDLAGTFSTYPNPHINFVQAFAVEMVITAILM
GLILALTDDGNGVPRGPLAPLLIGLLIAVIGASMGPLTGFAMNPARDFGPKVFAWLAGWGNVAFTGGRDIPYFLVPLFGP
IVGAIVGAFAYRKLIGRHLPCDICVVEEKETTTPSEQKASL
>P19255 ~~~glpF~~~Probable glycerol uptake facilitator protein~~~COG0580
MSSSDIFIGETIGTALLILLGGGVCAAVTLKASKARNAGWLAIAFGWGFAVMTAAYISGPLSGAHLNPAVTVGIAIKDGD
WSNTPTYFAGQLLGAMIGAVLVWVAYYGQFQAHLTDREIVGGPGAQDTTAKSVEAQEKGAGPVLGVFSTGPEIRHTVQNL
ATEIIGTFVLLLAILTQGLNDEGNGLGILGALITGFVVVSIGLSLGGPTGYAINPVRDLGPRIVHALLPLPNKGGSDWSY
AWIPVVGPLIGGAIAGGVYNVAFA
>P09391 3.4.21.105~~~glpG~~~Rhomboid protease GlpG~~~COG0705
MLMITSFANPRVAQAFVDYMATQGVILTIQQHNQSDVWLADESQAERVRAELARFLENPADPRYLAASWQAGHTGSGLHY
RRYPFFAALRERAGPVTWVMMIACVVVFIAMQILGDQEVMLWLAWPFDPTLKFEFWRYFTHALMHFSLMHILFNLLWWWY
LGGAVEKRLGSGKLIVITLISALLSGYVQQKFSGPWFGGLSGVVYALMGYVWLRGERDPQSGIYLQRGLIIFALIWIVAG
WFDLFGMSMANGAHIAGLAVGLAMAFVDSLNARKRK
>P44783 3.4.21.105~~~glpG~~~Rhomboid protease GlpG~~~COG0705
MKNFLAQQGKITLILTALCVLIYLAQQLGFEDDIMYLMHYPAYEEQDSEVWRYISHTLVHLSNLHILFNLSWFFIFGGMI
ERTFGSVKLLMLYVVASAITGYVQNYVSGPAFFGLSGVVYAVLGYVFIRDKLNHHLFDLPEGFFTMLLVGIALGFISPLF
GVEMGNAAHISGLIVGLIWGFIDSKLRKNSLE
>P18157 2.7.1.30~~~glpK~~~Glycerol kinase~~~COG0554
METYILSLDQGTTSSRAILFNKEGKIVHSAQKEFTQYFPHPGWVEHNANEIWGSVLAVIASVISESGISASQIAGIGITN
QRETTVVWDKDTGSPVYNAIVWQSRQTSGICEELREKGYNDKFREKTGLLIDPYFSGTKVKWILDNVEGAREKAEKGELL
FGTIDTWLIWKMSGGKAHVTDYSNASRTLMFNIYDLKWDDELLDILGVPKSMLPEVKPSSHVYAETVDYHFFGKNIPIAG
AAGDQQSALFGQACFEEGMGKNTYGTGCFMLMNTGEKAIKSEHGLLTTIAWGIDGKVNYALEGSIFVAGSAIQWLRDGLR
MFQDSSLSESYAEKVDSTDGVYVVPAFVGLGTPYWDSDVRGSVFGLTRGTTKEHFIRATLESLAYQTKDVLDAMEADSNI
SLKTLRVDGGAVKNNFLMQFQGDLLNVPVERPEINETTALGAAYLAGIAVGFWKDRSEIANQWNLDKRFEPELEEEKRNE
LYKGWQKAVKAAMAFK
>P0A6F3 2.7.1.30~~~glpK~~~Glycerol kinase~~~COG0554
MTEKKYIVALDQGTTSSRAVVMDHDANIISVSQREFEQIYPKPGWVEHDPMEIWATQSSTLVEVLAKADISSDQIAAIGI
TNQRETTIVWEKETGKPIYNAIVWQCRRTAEICEHLKRDGLEDYIRSNTGLVIDPYFSGTKVKWILDHVEGSRERARRGE
LLFGTVDTWLIWKMTQGRVHVTDYTNASRTMLFNIHTLDWDDKMLEVLDIPREMLPEVRRSSEVYGQTNIGGKGGTRIPI
SGIAGDQQAALFGQLCVKEGMAKNTYGTGCFMLMNTGEKAVKSENGLLTTIACGPTGEVNYALEGAVFMAGASIQWLRDE
MKLINDAYDSEYFATKVQNTNGVYVVPAFTGLGAPYWDPYARGAIFGLTRGVNANHIIRATLESIAYQTRDVLEAMQADS
GIRLHALRVDGGAVANNFLMQFQSDILGTRVERPEVREVTALGAAYLAGLAVGFWQNLDELQEKAVIEREFRPGIETTER
NYRYAGWKKAVKRAMAWEEHDE
>O34153 2.7.1.30~~~glpK~~~Glycerol kinase~~~
MAEKNYVMAIDQGTTSSRAIIFDRNGKKIGSSQKEFPQYFPKSGWVEHNANEIWNSVQSVIAGAFIESGIRPEAIAGIGI
TNQRETTVVWDKTTGQPIANAIVWQSRQSSPIADQLKVDGHTEMIHEKTGLVIDAYFSATKVRWLLDNIEGAQEKADNGE
LLFGTIDSWLVWKLTDGQVHVTDYSNASRTMLYNIHKLEWDQEILDLLNIPSSMLPEVKSNSEVYGHTRSYHFYGSEVPI
AGMAGDQQAALFGQMAFEKGMIKNTYGTGAFIVMNTGEEPQLSDNDLLTTIGYGINGKVYYALEGSIFVAGSAIQWLRDG
LRMIETSPQSEELAAKAKGDNEVYVVPAFTGLGAPYWDSEARGAVFGLTRGTTKEDFVRATLQAVAYQSKDVIDTMKKDS
GIDIPLLKVDGGAAKNDLLMQFQADILDIDVQRAANLETTALGAAYLAGLAVGFWKDLDELKSMAEEGQMFTPEMPAEER
DNLYEGWKQAVAATQTFKFKAKKEGE
>O34154 2.7.1.30~~~glpK~~~Glycerol kinase~~~COG0554
MAEEKYIMAIDQGTTSSRAIIFDKKGNKIGSSQKEFTQYFPNAGWVEHNANEIWNSVQSVIAGSLIESGVKPTDIAGIGI
TNQRETTVVWDKATGLPIYNAIVWQSRQTTPIADQLKEDGYSEMIHEKTGLIIDAYFSATKVRWILDHVEGAQERAENGE
LMFGTIDTWLVWKLTGDTHVTDYSNASRTMLFNIHDLDWDQEILDLLNIPRVMLPKVVSNSEVYGLTKNYHFYGSEVPIA
GMAGDQQAALFGQMAFEPGMVKNTYGTGSFIVMNTGEEPQLSKNNLLTTIGYGINGKVYYALEGSIFVAGSAIQWLRDGL
KMLQTAAESEAVAKASTGHNEVYVVPAFTGLGAPYWDSQARGAVFGLTRGTTREDFVKATLQAVAYQVRDIIDTMKEDTG
IDIPVLKVDGGAANNDFLMQFQADILNTAVQRAHNLETTALGAAFLAGLAVGFWKDLEEIKAFQEEGQQFEPIMAEEERE
DLYEGWQQAVAATQQFKRKNK
>P9WPK1 2.7.1.30~~~glpK~~~Glycerol kinase~~~COG0554
MSDAILGEQLAESSDFIAAIDQGTTSTRCMIFDHHGAEVARHQLEHEQILPRAGWVEHNPVEIWERTASVLISVLNATNL
SPKDIAALGITNQRETTLVWNRHTGRPYYNAIVWQDTRTDRIASALDRDGRGNLIRRKAGLPPATYFSGGKLQWILENVD
GVRAAAENGDALFGTPDTWVLWNLTGGPRGGVHVTDVTNASRTMLMDLETLDWDDELLSLFSIPRAMLPEIASSAPSEPY
GVTLATGPVGGEVPITGVLGDQHAAMVGQVCLAPGEAKNTYGTGNFLLLNTGETIVRSNNGLLTTVCYQFGNAKPVYALE
GSIAVTGSAVQWLRDQLGIISGAAQSEALARQVPDNGGMYFVPAFSGLFAPYWRSDARGAIVGLSRFNTNAHLARATLEA
ICYQSRDVVDAMEADSGVRLQVLKVDGGITGNDLCMQIQADVLGVDVVRPVVAETTALGVAYAAGLAVGFWAAPSDLRAN
WREDKRWTPTWDDDERAAGYAGWRKAVQRTLDWVDVS
>O86033 2.7.1.30~~~glpK~~~Glycerol kinase~~~COG0554
MGGYILAIDQGTTSTRAIVFDGNQKIAGVGQKEFKQHFPKSGWVEHDPEEIWQTVVSTVKEAIEKSGITANDIAAIGITN
QRETVVVWDRETGKPIHNAIVWQDRRTAAFCDKLKKKGLEKTFVKKTGLLLDPYFSGTKLNWLLSNVKGAQVRAAKGELC
FGTIDTFLIWRLTGGECFCTDATNASRTLLYNIAENAWDDELTEVLRVPKEMLPEVKDCAADFGVTDPSLFGAAIPILGV
AGDQQAATIGQACFKPGMLKSTYGTGCFALLNTGKDMVRSKNRLLTTIAYRLDGETTYALEGSIFVAGAAVQWLRDGLKV
IKAAPDTGSLAESADPSQEVYLVPAFTGLGAPHWDPDARGAIFGMTRNTGPAEFARAALEAVCYQTRDLLEAMHKDWRRN
GNDTVLRVDGGMVASDWTMQRLSDLLDAPVDRPVILETTALGVAWLAGSRAGVWPNQEAFAKSWARDRRFEPHMDEATRK
VKLKGWRSAVKRTLIAA
>Q5HGD2 2.7.1.30~~~glpK~~~Glycerol kinase~~~
MEKYILSIDQGTTSSRAILFNQKGEIAGVAQREFKQYFPQSGWVEHDANEIWTSVLAVMTEVINENDVRADQIAGIGITN
QRETTVVWDKHTGRPIYHAIVWQSRQTQSICSELKQQGYEQTFRDKTGLLLDPYFAGTKVKWILDNVEGAREKAENGDLL
FGTIDTWLVWKLSGKAAHITDYSNASRTLMFNIHDLEWDDELLELLTVPKNMLPEVKASSEVYGKTIDYHFYGQEVPIAG
VAGDQQAALFGQACFERGDVKNTYGTGGFMLMNTGDKAVKSESGLLTTIAYGIDGKVNYALEGSIFVSGSAIQWLRDGLR
MINSAPQSESYATRVDSTEGVYVVPAFVGLGTPYWDSEARGAIFGLTRGTEKEHFIRATLESLCYQTRDVMEAMSKDSGI
DVQSLRVDGGAVKNNFIMQFQADIVNTSVERPEIQETTALGAAFLAGLAVGFWESKDDIAKNWKLEEKFDPKMDEGEREK
LYRGWKKAVEATQVFKTE
>P99113 2.7.1.30~~~glpK~~~Glycerol kinase~~~
MEKYILSIDQGTTSSRAILFNQKGEIAGVAQREFKQYFPQSGWVEHDANEIWTSVLAVMTEVINENDVRADQIAGIGITN
QRETTVVWDKHTGRPIYHAIVWQSRQTQSICSELKQQGYEQTFRDKTGLLLDPYFAGTKVKWILDNVEGAREKAENGDLL
FGTIDTWLVWKLSGKAAHITDYSNASRTLMFNIHDLEWDDELLELLTVPKNMLPEVKPSSEIYGKTIDYHFYGQEVPIAG
VAGDQQAALFGQACFECGDVKNTYGTGGFMLMNTGDKAVKSESGLLTTIAYGIDGKVNYALEGSIFVSGSAIQWLRDGLR
MINSAPQSESYATRVDSTEGVYVVPAFVGLGTPYWDSEARGAIFGLTRGTEKEHFIRATLESLCYQTRDVMEAMSKDSGI
DVQSLRVDGGAVKNNFIMQFQADIVNTSVERPEIQETTALGAAFLAGLAVGFWESKDDIAKNWKLEEKFDPKMDEGEREK
LYRGWKKAVEATQVFKTE
>Q9WX53 2.7.1.30~~~glpK~~~Glycerol kinase~~~
MNQYILAIDQGTTSSRAILFNQKGEIVHMAQKEFTQYFPQPGWVEHNANEIWGSVLAVIASVLSEAQVKPEQVAGIGITN
QRETTVVWVKDTGNPIYNAIVWQSRQTAGICDELKAKGYDPLFREKTGLLIDAYFSGTKVKWILDHVEGARERAERGELL
FGTIDTWLIWKLSGGRAHVTDYSNASRTLMFNIHTLEWDDELLAILNVPKAMLPEVRPSSEVYAKTVPHHFFGVEVPIAG
AAGDQQAALFGQACFTEGMAKNTYGTGCFMLMNTGEKAVQSKHGLLTTIAWGIDGKVEYALEGSIFVAGSAIQWLRDGLR
MIKTAADSEAYAEKVESTDGVYVVPAFVGLGTPYWDSEVRGAVFGLTRGTTKEHFIRATLESLAYQTKDVLAAMEADSGI
SLTTLRVDGGAVKNNFLMQFQSDLLAVPVERPVINETTALGAPYLAGLAVGYWNSRDDIAAQWQLERRFEPKMDDDKRTM
LYDGWKKAVRAAMAFK
>O66131 2.7.1.30~~~glpK~~~Glycerol kinase~~~
MNQYMLAIDQGTTSSRAILFNQKGEIVHMAQKEFTQYFPQPGWVEHNANEIWGSVLAVIASVLSEAQVKPEQVAGIGITN
QRETTVVWEKDTGNPIYNAIVWQSRQTAGICDELKAKGYDPLFRKKTGLLIDAYFSGTKVKWILDHVDGARERAERGELL
FGTIDTWLIWKLSGGRVHVTDYSNASRTLMFNIHTLEWDDELLDILGVPKAMLPEVRPSSEVYAKTAPYHFFGVEVPIAG
AAGDQQAALFGQACFTEGMAKNTYGTGCFMLMNTGEKAVASKHGLLTTIAWGIDGKVEYALEGSIFVAGSAIQWLRDGLR
MIKTAADSETYAEKVESTDGVYVVPAFIGLGTPYWDSEVRGAVFGLTRGTTKEHFIRATLESLAYQTKDVLAVMEADSGI
SLTTLRVDGGAVKNNFLMQFQSDLLAVPVERPVVNETTALGAAYLAGLAVGYWNSRDDIAAQWQLERRFEPKMDDDKRTM
LYDGWKKAVRAAMAFK
>O86963 1.1.3.21~~~glpO~~~Alpha-glycerophosphate oxidase~~~
MTFSQKDRKETIQETAKTTYDVLIIGGGITGAGVAVQTAAAGMKTVLLEMQDFAEGTSSRSTKLVHGGIRYLKTFDVEVV
ADTVRERAIVQQIAPHIPKPDPMLLPIYDEPGATFSLFSVKVAMDLYDRLANVTGSKYENYLLTKEEVLAREPQLQAENL
VGGGVYLDFRNNDARLVIENIKRAQADGAAMISKAKVVGILHDEQGIINGVEVEDQLTNERFEVHAKVVINTTGPWSDIV
RQLDKNDELPPQMRPTKGVHLVVDREKLKVPQPTYFDTGKNDGRMVFVVPRENKTYFGTTDTDYTGDFAHPTVTQEDVDY
LLTIVNERFPHAQITLDDIEASWAGLRPLITNNGGSDYNGGGKGKLSDESFEQIVESVKEYLADERQRPVVEKAVKQAQE
RVEASKVDPSQVSRGSSLERSKDGLLTLAGGKITDYRLMAEGAVKRINELLQESGASFELVDSTTYPVSGGELDAANVEE
ELAKLADQAQTAGFNEAAATYLAHLYGSNLPQVLNYKTKFEGLDEKESTALNYSLHEEMVLTPVDYLLRRTNHILFMRDT
LDDVKAGVVAAMTDFFGWSEEEKAAHVLELNQVIAESDLTALKGGKKDE
>P75063 1.1.3.21~~~glpD~~~Glycerol 3-phosphate oxidase~~~
METRDVLIVGGGVIGCATAYELSQYKLKVTLVEKHHYLAQETSHANSGVIHTGIDPNPHKLTAKYNILGKKLWLNTYFKR
LGFPRQKIRTLIVAFNEMEREQLEVLKQRGIANQINLEDIQMLSKEETLKLEPYVNPEIVAGLKIEGSWAIDPVLASKCL
ALAAQQNKVQICTNTEVTNISKQVDGTYLVWTNNETTPSFKVKKIIDAAGHYADYLAHLAKADDFEQTTRRGQYVVVTNQ
GELHLNSMVFMVPTIHGKGVIVSPMLDGNFLVGPTALDGVDKEATRYITKDAPCMLTKIGKHMVPSLNINNALISFAGSR
PIDKATNDFIIRVAHNDPDFVILGGMKSPGLTAAPAIVREAVRLLNWKLTKKPNWNGKYNLPWI
>P30300 ~~~glpP~~~Glycerol uptake operon antiterminator regulatory protein~~~COG1954
MMSFHNQPILPAIRNMKQFDEFLNSSFSYGVILDIHLGQLKGVIKEAQKHGKNMMVHVDLIQGIKHDEYGAEFICQDIKP
AGIISTRSNVIAKAKQKKIYAIQRLFLLDTSAMEKSMEFIGKHKPDFIEVLPGIVPSLIQEIKEKTGIPIFAGGFIRTEE
DVEQALKAGAVAVTTSNTKLWKKYENFLTESD
>P9WMU3 3.1.4.46~~~glpQ1~~~Probable glycerophosphodiester phosphodiesterase 1~~~COG0584
MTWADEVLAGHPFVVAHRGASAARPEHTLAAYDLALKEGADGVECDVRLTRDGHLVCVHDRRLDRTSTGAGLVSTMTLAQ
LRELEYGAWHDSWRPDGSHGDTSLLTLDALVSLVLDWHRPVKIFVETKHPVRYGSLVENKLLALLHRFGIAAPASADRSR
AVVMSFSAAAVWRIRRAAPLLPTVLLGKTPRYLTSSAATAVGATAVGPSLPALKEYPQLVDRSAAQGRAVYCWNVDEYED
IDFCREVGVAWIGTHHPGRTKAWLEDGRANGTTR
>O07244 3.1.4.46~~~glpQ2~~~Probable glycerophosphodiester phosphodiesterase 2~~~COG0584
MSDGGAPTVEFLRHGGRIAMAHRGFTSFRLPMNSMGAFQEAAKLGFRYIETDVRATRDGVAVILHDRRLAPGVGLSGAVD
RLDWRDVRKAQLGAGQSIPTLEDLLTALPDMRVNIDIKAASAIEPTVNVIERCNAHNRVLIGSFSERRRRRALRLLTKRV
ASSAGTGALLAWLTARPLGSRAYAWRMMRDIDCVQLPSRLGGVPVITPARVRGFHAAGRQVHAWTVDEPDVMHTLLDMDV
DGIITDRADLLRDVLIARGEWDGA
>P37965 3.1.4.46~~~glpQ~~~Glycerophosphodiester phosphodiesterase~~~COG0584
MRKNRILALFVLSLGLLSFMVTPVSAASKGNLLSPDRILTVAHRGASGYVPEHTILSYETAQKMKADFIELDLQMTKDGK
LIVMHDEKLDRTTNGMGWVKDHTLADIKKLDAGSWFNEAYPEKAKPQYVGLKVPTLEEVLDRFGKHANYYIETKSPDTYP
GMEEKLIASLQKHKLLGKHSKPGQVIIQSFSKESLVKVHQLQPNLPTVQLLEAKQMASMTDAALEEIKTYAVGAGPDYKA
LNQENVRMIRSHGLLLHPYTVNNEADMHRLLDWGVTGVFTNYPDLFHKVKKGY
>P09394 3.1.4.46~~~glpQ~~~Glycerophosphodiester phosphodiesterase, periplasmic~~~COG0584
MKLTLKNLSMAIMMSTIVMGSSAMAADSNEKIVIAHRGASGYLPEHTLPAKAMAYAQGADYLEQDLVMTKDDNLVVLHDH
YLDRVTDVADRFPDRARKDGRYYAIDFTLDEIKSLKFTEGFDIENGKKVQTYPGRFPMGKSDFRVHTFEEEIEFVQGLNH
STGKNIGIYPEIKAPWFHHQEGKDIAAKTLEVLKKYGYTGKDDKVYLQCFDADELKRIKNELEPKMGMELNLVQLIAYTD
WNETQQKQPDGSWVNYNYDWMFKPGAMKQVAEYADGIGPDYHMLIEETSQPGNIKLTGMVQDAQQNKLVVHPYTVRSDKL
PEYTPDVNQLYDALYNKAGVNGLFTDFPDKAVKFLNKE
>Q06282 3.1.4.46~~~glpQ~~~Glycerophosphodiester phosphodiesterase~~~COG0584
MKLKTLALSLLAAGVLAGCSSHSSNMANTQMKSDKIIIAHRGASGYLPEHTLESKALAFAQHSDYLEQDLAMTKDGRLVV
IHDHFLDGLTDVAKKFPYRHRKDGRYYVIDFTLKEIQSLEMTENFETKDGKQAQVYPNRFPLWKSHFRIHTFEDEIEFIQ
GLEKSTGKKVGIYPEIKAPWFHHQNGKDIATETLKVLKKYGYDKKTDMVYLQTFDFNELKRIKTELLPQMGMDLKLVQLI
AYTDWKETQEKDPKGYWVNYNYDWMFKPGAMAEVVKYADGVGPGWYMLVNKEESKPDNIVYTPLVKELAQYNVEVHPYTV
RKDALPEFFTDVNQMYDALLNKSGATGVFTDFPDTGVEFLKGIK
>O30405 3.1.4.46~~~glpQ~~~Glycerophosphodiester phosphodiesterase~~~COG0584
MRGTYCVTLWGGVFAALVAGCASERMIVAYRGAAGYVPEHTFASKVLAFAQGADYLQQDVVLSKDNQLIVAQSHILDNMT
DVAEKFPRRQRADGHFYVIDFTVEELSLLRATNSFYTRGKRHTPVYGQRFPLWKPGFRLHTFEEELQFIRGLEQTTGKKI
GIYSEIKVPWFHHQEGKDIAALTLALLKKYGYQSRSDLVYVQTYDFNELKRIKRELLPKYEMNVKLIQRVAYTDQRETQE
KDSRGKWINYNYNWMFEPGGMQKIAKYADGVGPDWRMLIENEWSKVGAVRLSPMVSAIQDAKLECHVHTVRKETLPSYAR
TMDEMFSILFKQTGANVVLTDFPDLGVKFLGKPARY
>P0ACL0 ~~~glpR~~~Glycerol-3-phosphate regulon repressor~~~COG1349
MKQTQRHNGIIELVKQQGYVSTEELVEHFSVSPQTIRRDLNELAEQNLILRHHGGAALPSSSVNTPWHDRKATQTEEKER
IARKVAEQIPNGSTLFIDIGTTPEAVAHALLNHSNLRIVTNNLNVANTLMVKEDFRIILAGGELRSRDGGIIGEATLDFI
SQFRLDFGILGISGIDSDGSLLEFDYHEVRTKRAIIENSRHVMLVVDHSKFGRNAMVNMGSISMVDAVYTDAPPPVSVMQ
VLTDHHIQLELC
>P08194 ~~~glpT~~~Glycerol-3-phosphate transporter~~~COG2271
MLSIFKPAPHKARLPAAEIDPTYRRLRWQIFLGIFFGYAAYYLVRKNFALAMPYLVEQGFSRGDLGFALSGISIAYGFSK
FIMGSVSDRSNPRVFLPAGLILAAAVMLFMGFVPWATSSIAVMFVLLFLCGWFQGMGWPPCGRTMVHWWSQKERGGIVSV
WNCAHNVGGGIPPLLFLLGMAWFNDWHAALYMPAFCAILVALFAFAMMRDTPQSCGLPPIEEYKNDYPDDYNEKAEQELT
AKQIFMQYVLPNKLLWYIAIANVFVYLLRYGILDWSPTYLKEVKHFALDKSSWAYFLYEYAGIPGTLLCGWMSDKVFRGN
RGATGVFFMTLVTIATIVYWMNPAGNPTVDMICMIVIGFLIYGPVMLIGLHALELAPKKAAGTAAGFTGLFGYLGGSVAA
SAIVGYTVDFFGWDGGFMVMIGGSILAVILLIVVMIGEKRRHEQLLQERNGG
>P21437 3.1.3.11~~~yggF~~~Fructose-1,6-bisphosphatase 2 class 2~~~COG1494
MMSLAWPLFRVTEQAALAAWPQTGCGDKNKIDGLAVTAMRQALNDVAFRGRVVIGEGEIDHAPMLWIGEEVGKGDGPEVD
IAVDPIEGTRMVAMGQSNALAVMAFAPRDSLLHAPDMYMKKLVVNRLAAGAIDLSLPLTDNLRNVAKALGKPLDKLRMVT
LDKPRLSAAIEEATQLGVKVFALPDGDVAASVLTCWQDNPYDVMYTIGGAPEGVISACAVKALGGDMQAELIDFCQAKGD
YTENRQIAEQERKRCKAMGVDVNRVYSLDELVRGNDILFSATGVTGGELVNGIQQTANGVRTQTLLIGGADQTCNIIDSL
H
>Q03224 3.1.3.11~~~glpX~~~Fructose-1,6-bisphosphatase class 2~~~COG1494
MERSLSMELVRVTEAAALASARWMGRGKKDEADEAATSAMRDVFDTVPMKGTVVIGEGEMDEAPMLYIGEKLGNGYGPRV
DVAVDPLEGTNILASGGWNALTVIAVADHGTLLNAPDMYMQKIAVGPEAVGCIDIEAPVIDNLKAVAKAKNKDVEDVVAT
ILNRERHAKIISELREAGARIKLINDGDVAGAINTAFDHTGVDILFGSGGAPEGVLSAVALKALGGEIIGKLLPQSEEEI
TRCHKMGLDLSKVLRMEDLVKGDDAIFAATGVTDGELLKGVQFKGSVGTTESLVIRAKSGTVRFVDGRHSLKKKPNLVIR
P
>Q6M6E7 3.1.3.11~~~glpX~~~Fructose-1,6-bisphosphatase class 2~~~COG1494
MNLKNPETPDRNLAMELVRVTEAAALASGRWVGRGMKNEGDGAAVDAMRQLINSVTMKGVVVIGEGEKDEAPMLYNGEEV
GTGFGPEVDIAVDPVDGTTLMAEGRPNAISILAAAERGTMYDPSSVFYMKKIAVGPEAAGKIDIEAPVAHNINAVAKSKG
INPSDVTVVVLDRPRHIELIADIRRAGAKVRLISDGDVAGAVAAAQDSNSVDIMMGTGGTPEGIITACAMKCMGGEIQGI
LAPMNDFERQKAHDAGLVLDQVLHTNDLVSSDNCYFVATGVTNGDMLRGVSYRANGATTRSLVMRAKSGTIRHIESVHQL
SKLQEYSVVDYTTAT
>P0A9C9 3.1.3.11~~~glpX~~~Fructose-1,6-bisphosphatase 1 class 2~~~COG1494
MRRELAIEFSRVTESAALAGYKWLGRGDKNTADGAAVNAMRIMLNQVNIDGTIVIGEGEIDEAPMLYIGEKVGTGRGDAV
DIAVDPIEGTRMTAMGQANALAVLAVGDKGCFLNAPDMYMEKLIVGPGAKGTIDLNLPLADNLRNVAAALGKPLSELTVT
ILAKPRHDAVIAEMQQLGVRVFAIPDGDVAASILTCMPDSEVDVLYGIGGAPEGVVSAAVIRALDGDMNGRLLARHDVKG
DNEENRRIGEQELARCKAMGIEAGKVLRLGDMARSDNVIFSATGITKGDLLEGISRKGNIATTETLLIRGKSRTIRRIQS
IHYLDRKDPEMQVHIL
>P9WN21 3.1.3.11~~~glpX~~~Fructose-1,6-bisphosphatase class 2~~~COG1494
MTAEGSGSSTAAVASHDPSHTRPSRREAPDRNLAMELVRVTEAGAMAAGRWVGRGDKEGGDGAAVDAMRELVNSVSMRGV
VVIGEGEKDHAPMLYNGEEVGNGDGPECDFAVDPIDGTTLMSKGMTNAISVLAVADRGTMFDPSAVFYMNKIAVGPDAAH
VLDITAPISENIRAVAKVKDLSVRDMTVCILDRPRHAQLIHDVRATGARIRLITDGDVAGAISACRPHSGTDLLAGIGGT
PEGIIAAAAIRCMGGAIQAQLAPRDDAERRKALEAGYDLNQVLTTEDLVSGENVFFCATGVTDGDLLKGVRYYPGGCTTH
SIVMRSKSGTVRMIEAYHRLSKLNEYSAIDFTGDSSAVYPLP
>P52101 2.7.13.3~~~glrK~~~Sensor histidine kinase GlrK~~~COG2205
MKRWPVFPRSLRQLVMLAFLLILLPLLVLAWQAWQSLNALSDQAALVNRTTLIDARRSEAMTNAALEMERSYRQYCVLDD
PTLAKVYQSQRKRYSEMLDAHAGVLPDDKLYQALRQDLHNLAQLQCNNSGPDAAAAARLEAFASANTEMVQATRTVVFSR
GQQLQREIAERGQYFGWQSLVLFLVSLVMVLLFTRMIIGPVKNIERMINRLGEGRSLGNSVSFSGPSELRSVGQRILWLS
ERLSWLESQRHQFLRHLSHELKTPLASMREGTELLADQVVGPLTPEQKEVVSILDSSSRNLQKLIEQLLDYNRKQADSAV
ELENVELAPLVETVVSAHSLPARAKMMHTDVDLKATACLAEPMLLMSVLDNLYSNAVHYGAESGNICLRSSLHGARVYID
VINTGTPIPQEERAMIFEPFFQGSHQRKGAVKGSGLGLSIARDCIRRMQGELYLVDESGQDVCFRIELPSSKNTK
>A9KHK4 2.4.1.247~~~~~~D-galactosyl-beta-1->4-L-rhamnose phosphorylase~~~COG5426
MEQQKEITKGGFTLPGEAGFEKLTLELANRWGADVIRDSDGTELSDDILNAGYGIYSTICLIRDHNAWAKANIDKLQQTF
LVTSPVVANSETLTIDLMEGFFKEQFLVNDSEEALEYWQVYDRTTETLLQKESWSYHPQNQTVVLTGICPWHKYTVSFMA
YRIWEEISMYNHTTNNWNKEHLMQIDPIHKETQEYLLTWMDDWCKKHEQTTVVRFTSMFYNFVWMWGSNEKNRYLFSDWA
SYDFTVSPHALKLFEEEYGYVLTAEDFIHQGKFHVTHMPADKHKLDWMEFINNFVVDFGKKLIDIVHNYGKLAYVFYDDS
WVGVEPYHKNFEKFGFDGLIKCVFSGFEVRLCAGVKVNTHELRLHPYLFPVGLGGAPTFMEGGNPTLDAKNYWISVRRAL
LREPIDRIGLGGYLHLVEDFPDFTDYIEKIANEFRRIKELHNAGKPMALKPRIAVLHSWGSLRSWTLSGHFHETYMHDLI
HINESLSGLPFDVKFINFEDINQGALEEVDVVINAGIMGSAWTGGQAWEDQEIIERLTRFVYEGKAFIGVNEPSALTGYD
TLYRMAHVLGVDMDLGDRVSHGRYSFTEEPVEELEFAECGPKAKRNIYLTDGLAKVLKEENGIPVMTSYEFGRGRGIYLA
SYEHSIKNARTLLNIILYAAGESFHQEGITNNVYTECAYYEKDKILVMINNSNTLQESSVTIKGRTYTKDIPAFDTVILP
LE
>P0AFU4 ~~~glrR~~~Transcriptional regulatory protein GlrR~~~COG2204
MSHKPAHLLLVDDDPGLLKLLGLRLTSEGYSVVTAESGAEGLRVLNREKVDLVISDLRMDEMDGMQLFAEIQKVQPGMPV
IILTAHGSIPDAVAATQQGVFSFLTKPVDKDALYQAIDDALEQSAPATDERWREAIVTRSPLMLRLLEQARLVAQSDVSV
LINGQSGTGKEIFAQAIHNASPRNSKPFIAINCGALPEQLLESELFGHARGAFTGAVSNREGLFQAAEGGTLFLDEIGDM
PAPLQVKLLRVLQERKVRPLGSNRDIDINVRIISATHRDLPKAMARGEFREDLYYRLNVVSLKIPALAERTEDIPLLANH
LLRQAAERHKPFVRAFSTDAMKRLMTASWPGNVRQLVNVIEQCVALTSSPVISDALVEQALEGENTALPTFVEARNQFEL
NYLRKLLQITKGNVTHAARMAGRNRTEFYKLLSRHELDANDFKE
>P68688 ~~~grxA~~~Glutaredoxin 1~~~COG0695
MQTVIFGRSGCPYCVRAKDLAEKLSNERDDFQYQYVDIRAEGITKEDLQQKAGKPVETVPQIFVDQQHIGGYTDFAAWVK
ENLDA
>P0AC59 ~~~grxB~~~Glutaredoxin 2~~~COG2999
MKLYIYDHCPYCLKARMIFGLKNIPVELHVLLNDDAETPTRMVGQKQVPILQKDDSRYMPESMDIVHYVDKLDGKPLLTG
KRSPAIEEWLRKVNGYANKLLLPRFAKSAFDEFSTPAARKYFVDKKEASAGNFADLLAHSDGLIKNISDDLRALDKLIVK
PNAVNGELSEDDIQLFPLLRNLTLVAGINWPSRVADYRDNMAKQTQINLLSSMAI
>P73492 ~~~~~~Probable glutaredoxin ssr2061~~~COG0695
MAVSAKIEIYTWSTCPFCMRALALLKRKGVEFQEYCIDGDNEAREAMAARANGKRSLPQIFIDDQHIGGCDDIYALDGAG
KLDPLLHS
>P0AC62 ~~~grxC~~~Glutaredoxin 3~~~COG0695
MANVEIYTKETCPYCHRAKALLSSKGVSFQELPIDGNAAKREEMIKRSGRTTVPQIFIDAQHIGGCDDLYALDARGGLDP
LLK
>P0AC69 ~~~grxD~~~Glutaredoxin 4~~~COG0278
MSTTIEKIQRQIAENPILLYMKGSPKLPSCGFSAQAVQALAACGERFAYVDILQNPDIRAELPKYANWPTFPQLWVDGEL
VGGCDIVIEMYQRGELQQLIKETAAKYKSEEPDAE
>Q5XBW6 ~~~~~~Stress response regulator gls24 homolog~~~
MTETYIKNTSKDLTSAIRGQLTYDDKVIEKIVGLALENVDGLLGVNGGFFANLKDKLVNTESVRDGVNVEVGKKQVAVDL
DIVAEYQKHVPTIYDSIKSIVEEEVKRMTDLDVIEVNVKVVDIKTKEQFEAEKVSLQDKVSDMARSTSEFTSHQVENVKA
SVDNGVEKLQDQKAEPRVK
>O31465 3.5.1.2~~~glsA1~~~Glutaminase 1~~~COG2066
MKELIKEHQKDINPALQLHDWVEYYRPFAANGQSANYIPALGKVNDSQLGICVLEPDGTMIHAGDWNVSFTMQSISKVIS
FIAACMSRGIPYVLDRVDVEPTGDAFNSIIRLEINKPGKPFNPMINAGALTIASILPGESAYEKLEFLYSVMETLIGKRP
RIHEEVFRSEWETAHRNRALAYYLKETNFLEAEVEETLEVYLKQCAMESTTEDIALIGLILAHDGYHPIRHEQVIPKDVA
KLAKALMLTCGMYNASGKYAAFVGVPAKSGVSGGIMALVPPSARREQPFQSGCGIGIYGPAIDEYGNSLTGGMLLKHMAQ
EWELSIF
>P77454 3.5.1.2~~~glsA1~~~Glutaminase 1~~~COG2066
MLDANKLQQAVDQAYTQFHSLNGGQNADYIPFLANVPGQLAAVAIVTCDGNVYSAGDSDYRFALESISKVCTLALALEDV
GPQAVQDKIGADPTGLPFNSVIALELHGGKPLSPLVNAGAIATTSLINAENVEQRWQRILHIQQQLAGEQVALSDEVNQS
EQTTNFHNRAIAWLLYSAGYLYCDAMEACDVYTRQCSTLLNTIELATLGATLAAGGVNPLTHKRVLQADNVPYILAEMMM
EGLYGRSGDWAYRVGLPGKSGVGGGILAVVPGVMGIAAFSPPLDEDGNSVRGQKMVASVAKQLGYNVFKG
>O07637 3.5.1.2~~~glsA2~~~Glutaminase 2~~~COG2066
MVCQHNDELEALVKKAKKVTDKGEVASYIPALAKADKHDLSVAIYYSNNVCLSAGDVEKTFTLQSISKVLSLALVLMEYG
KDKVFSYVGQEPTGDPFNSIIKLETVNPSKPLNPMINAGALVVTSLIRGRTVKERLDYLLSFIRRLTNNQEITYCREVAE
SEYSTSMINRAMCYYMKQYGIFEDDVEAVMDLYTKQCAIEMNSLDLAKIGSVFALNGRHPETGEQVISKDVARICKTFMV
TCGMYNASGEFAIKVGIPAKSGVSGGIMGISPYDFGIGIFGPALDEKGNSIAGVKLLEIMSEMYRLSIF
>P0A6W0 3.5.1.2~~~glsA2~~~Glutaminase 2~~~COG2066
MAVAMDNAILENILRQVRPLIGQGKVADYIPALATVDGSRLGIAICTVDGQLFQAGDAQERFSIQSISKVLSLVVAMRHY
SEEEIWQRVGKDPSGSPFNSLVQLEMEQGIPRNPFINAGALVVCDMLQGRLSAPRQRMLEVVRGLSGVSDISYDTVVARS
EFEHSARNAAIAWLMKSFGNFHHDVTTVLQNYFHYCALKMSCVELARTFVFLANQGKAIHIDEPVVTPMQARQINALMAT
SGMYQNAGEFAWRVGLPAKSGVGGGIVAIVPHEMAIAVWSPELDDAGNSLAGIAVLEQLTKQLGRSVY
>Q5KY26 3.5.1.2~~~glsA~~~Glutaminase~~~COG2066
MLVYNQEELVRFVEEAKQYARYGKVADYIPALGKANPNELSIAIYTPDDEVVSAGDVTVKVTLQSISKIIALALVLIDRG
EDEVFHKVGMEPTDYPFHSIAKLEEKPAKPLNPMINAGALVVTSMIQGGSVSERLERLLAFVRRLAGNERISYSDEVARS
EFETAFLNRSLCYFLKQHRIIDEDVEELMELYTKQCAIEMTCIDLARIGLVLALDGRDPHSSEPLMPLDVARICKTFMVT
CGMYNSSGEFAIKVGIPAKSGVSGGILAAVPGRCGIGVFGPALDDKGNSLTGVKLLERLSKTYSLSIF
>O87405 3.5.1.2~~~glsA~~~Thermolabile glutaminase~~~COG2066
MADLQATLDSIYTDILPRIGEGKVADYIPELAKIDPRQFGMAIVTVDGQVFRVGDADIAFSIQSISKVFMLTLALGKVGE
GLWKRVGREPSGSAFNSIVQLEHESGIPRNPFINAGAIAVTDVVMAGHAPREAIGELLRFVRYLADDESITIDDKVARSE
TQTGYRNVALANFMRAYRNLDHPVDHVLGVYFHQCALAMSCEQLARAGLFLAARGSNPMTGHSVVSPKRARRINALMLTC
GHYDGSGDFAYHVGLPGKSGVGGGIFAVAPGIASIAVWSPGLNKVGNSQLGAVALEMLAARTGWSVFGD
>P77952 2.6.1.50~~~stsC~~~L-glutamine:scyllo-inosose aminotransferase~~~
MDSSLAISGGPRLSNREWPRWPQPGDRALKSLEDVLTSGRWTISCAYQGRDSYERQFASAFADYCGSAMCVPISTGTASL
AIALEACGVGAGDEVIVPGLSWVASASAVLGINAVPVLVDVDPATYCLDPAATEAAITERTRAITVVHAYSAVADLDALL
DIARRHGLPLIEDCAHAHGAGFRGRPVGAHGAAGVFSMQGSKLLTCGEGGALVTDDADVALRAEHLRADGRVVRREPVGV
GEMELEETGRMMGSNACLSEFHAAVLLDQLELLDGQNARRTRAADHLTDRLSELGMTAQATAPGTTARAYYRYLVRLPDE
VLAVAPVERFAHALTAELGFAVTQTHRPLNDNPLNRPSSRRRFATDARYLERVDPSRFDLPAAKRAHESVVSFSHEVLLA
PLDAIDDIARAFRKVLDNVREVSR
>P9WGB9 2.8.2.-~~~~~~Glycolipid sulfotransferase Rv1373~~~
MNSEHPMTDRVVYRSLMADNLRWDALQLRDGDIIISAPSKSGLTWTQRLVSLLVFDGPDLPGPLSTVSPWLDQTIRPIEE
VVATLDAQQHRRFIKTHTPLDGLVLDDRVSYICVGRDPRDAAVSMLYQSANMNEDRMRILHEAVVPFHERIAPPFAELGH
ARSPTEEFRDWMEGPNQPPPGIGFTHLKGIGTLANILHQLGTVWVRRHLPNVALFHYADYQADLAGELLRPARVLGIAAT
RDRARDLAQYATLDAMRSRASEIAPNTTDGIWHSDERFFRRGGSGDWQQFFTEAEHLRYYHRINQLAPPDLLAWAHEGRR
GYDPAN
>P39812 1.4.1.13~~~gltA~~~Glutamate synthase [NADPH] large chain~~~COG0067
MTYNQMPKAQGLYRPEFEHDACGIGLYAHLKGKQTHDIVKQGLKMLCQLDHRGGQGSDPDTGDGAGLLVQIPDAFFRKEC
KNINLPEKERYGVGMVFFSQKEDERKKIEKQINALIEQEGQVVLGWRTVPVNVGKIGTVAQKSCPFVRQVFIGASSDLKD
NLSFERKLYVIRKQAENWGVTEGLDFYFASLSSQTIVYKGLLTPEQVDAFYSDLQDEAFVSAFALVHSRFSTNTFPTWER
AHPNRYLVHNGEINTLRGNINWMRAREQQFVSESFGEDLNKILPILNADGSDSSILDNAFEFFVMAGRKPAHTAMMLIPE
PWTENTHMSKEKRAFYEYHSSLMEPWDGPTAISFTDGKQIGAILDRNGLRPARYYVTKDDYIIFSSEVGVIEVEQENVLY
KNRLEPGKMLLIDLEEGRIISDEEVKTQIATEYPYQKWLEEELVQVNPDPESREEEQFSDLLTRQKAFGYTYEDIQKYLI
PVIKEGKDPLGSMGNDAPLAVLSDRAQSLFNYFKQLFAQVTNPPIDAIREQLVTSTMTWLGAEGDLLHPSERNVRRIKLY
TPVLSNEQFYALKTIVHPDLKSQKIDVLFSEDLERGLKDMFTQAEKAISQGVSLLILSDKKMNERLTPIPPLLAVSALHQ
HLIRKGLRTKVSIIVESGEAREVHHFAALIGYGADAINPYLAYATYKQEIDEGRLDISYEEAVSKYGKSITEGVVKVMSK
MGISTVQSYRGAQIFEAVGISRDVIDRYFSGTASQLGGIDLQTIAEEAQRRHREAYQDDYSKTLEPGSDFQWRNGGEHHA
FNPKTIHTLQWACRRNDYNLFKQYTKAADEERIGFLRNLFAFDGNRKPLKLEEVESAESIVKRFKTGAMSFGSLSKEAHE
ALAIAMNRLGGKSNSGEGGEDPKRFVPDENGDDRRSAIKQIASGRFGVKSHYLVNADELQIKMAQGAKPGEGGQLPGNKV
YPWVADVRGSTPGVGLISPPPHHDIYSIEDLAQLIHDLKNANRDARISVKLVSKAGVGTIAAGVAKATADVIVISGYDGG
TGASPKTSIKHTGLPWELGLAEAHQTLMLNGLRDRVVLETDGKLMTGRDVVMAALLGAEEFGFATAPLVVLGCVMMRACH
LDTCPVGVATQNPELRKKFMGDPDHIVNYMLFIAEEVREYMAALGFKTFDEMIGRTDVLHASERAKEHWKASQLDLSTLL
YQPEGVRTFQSPQNHKIDQSLDITTILPAVQEAIESGKEADISIEINNTNRVAGTITGSEISKRYGEEGLPEDTIKLHFT
GSAGQSFGAFVPKGMTLYLDGDSNDYVGKGLSGGKIIVKSSEGFNSASDDNVIIGNVAFYGATSGEAYINGRAGERFAVR
NSGVNVVVEGIGDHGCEYMTGGSVVVLGDVGKNFAAGMSGGIAYVLTEDVKAFKRKCNLEMILFESLEDEKEIQQIKAML
ERHTAYTNSQKAEDLLDQWEDSVKKFVKVIPKNYKQMLASIEEQKAAGLSDEEAIMFAFEANTKPKQNTAASGQKQAVVQ
>Q05755 1.4.1.13~~~gltB~~~Glutamate synthase [NADPH] large chain~~~
MTTELNQGEQFVADFRANAAALTTANAYNPEDEHDACGVGFIAAIDGKPRRSVVEKGIEALKAVWHRGAVDADGKTGDGA
GIHVAVPQKFFKDHVKVIGHRAPDNKLAVGQVFLPRISLDAQEACRCIVETEILAFGYYIYGWRQVPINVDIIGEKANAT
RPEIEQIIVGNNKGVSDEQFELDLYIIRRRIEKAVKGEQINDFYICSLSARSIIYKGMFLAEQLTTFYPDLLDERFESDF
AIYHQRYSTNTFPTWPLAQPFRMLAHNGEINTVKGNVNWMKAHETRMEHPAFGTHMQDLKPVIGVGLSDSGSLDTVFEVM
VRAGRTAPMVKMMLVPQALTSSQTTPDNHKALIQYCNSVMEPWDGPAALAMTDGRWVVGGMDRNGLRPMRYTITTDGLII
GGSETGMVKIDETQVIEKGRLGPGEMIAVDLQSGKLYRDRELKDHLATLKPWDKWVQNTTHLDELVKTASLKGEPSDMDK
AELRRRQQAFGLTMEDMELILHPMVEDGKEAIGSMGDDSPIAVLSDKYRGLHHFFRQNFSQVTNPPIDSLRERRVMSLKT
RLGNLGNILDEDETQTRLLQLESPVLTTAEFRAMRDYMGDTAAEIDATFPVDGGPEALRDALRRIRQETEDAVRGGATHV
ILTDEAMGPARAAIPAILATGAVHTHLIRSNLRTFTSLNVRTAEGLDTHYFAVLIGVGATTVNAYLAQEAIAERHRRGLF
GSMPLEKGMANYKKAIDDGLLKIMSKMGISVISSYRGGGNFEAIGLSRALVAEHFPAMVSRISGIGLNGIQKKVLEQHAT
AYNEEVVALPVGGFYRFRKSGDRHGWEGGVIHTLQQAVTNDSYTTFKKYSEQVNKRPPMQLRDLLELRSTKAPVPVDEVE
SITAIRKRFITPGMSMGALSPEAHGTLNVAMNRIGAKSDSGEGGEDPARFRPDKNGDNWNSAIKQVASGRFGVTAEYLNQ
CRELEIKVAQGAKPGEGGQLPGFKVTEMIARLRHSTPGVMLISPPPHHDIYSIEDLAQLIYDLKQINPDAKVTVKLVSRS
GIGTIAAGVAKANADIILISGNSGGTGASPQTSIKFAGLPWEMGLSEVHQVLTLNRLRHRVRLRTDGGLKTGRDIVIAAM
LGAEEFGIGTASLIAMGCIMVRQCHSNTCPVGVCVQDDKLRQKFVGTPEKVVNLFTFLAEEVREILAGLGFRSLNEVIGR
TDLLHQVSRGAEHLDDLDLNPRLAQVDPGENARYCTLQGRNEVPDTLDARIVADARPLFEEGEKMQLAYNARNTQRAIGT
RLSSMVTRKFGMFGLQPGHITIRLRGTAGQSLGAFAVQGIKLEVMGDANDYVGKGLSGGTIVVRPTTSSPLETNKNTIIG
NTVLYGATAGKLFAAGQAGERFAVRNSGATVVVEGCGSNGCEYMTGGTAVILGRVGDNFAAGMTGGMAYVYDLDDSLPLY
INDESVIFQRIEVGHYESQLKHLIEEHVTETQSRFAAEILNDWAREVTKFWQVVPKEMLNRLEVPVHLPKAISAE
>O34399 1.4.1.13~~~gltB~~~Glutamate synthase [NADPH] small chain~~~COG0493
MGKPTGFMEIKREKPAERDPLTRLKDWKEYSAPFSEEASKRQGARCMDCGTPFCQIGADINGFTSGCPIYNLIPEWNDLV
YRGRWKEALERLLKTNNFPEFTGRVCPAPCEGSCTLAISDPAVSIKNIERTIIDKGFENGWIQPRIPKKRTGKKVAIVGS
GPAGLASADQLNQAGHSVTVFERADRAGGLLTYGIPNMKLEKGIVERRIKLLTQEGIDFVTNTEIGVDITADELKEQFDA
VILCTGAQKQRDLLIEGRDSKGVHYAMDYLTLATKSYLDSNFKDKQFIDAKGKDVIVIGGGDTGADCVATALRQKAKSVH
QFGKHPKLPPARTNDNMWPEQPHVFTLEYAYEEAEAKFGRDPREYSIQTTKMVADKNGKLKELHTIQMEKVKNEHGKYEF
RELPGTEKVWPAQLVFIAIGFEGTEQPLLKQFGVNSVNNKISAAYGDYQTNIDGVFAAGDARRGQSLIVWAINEGREVAR
EVDRYLMGSSVLP
>P09831 1.4.1.13~~~gltB~~~Glutamate synthase [NADPH] large chain~~~COG0067
MLYDKSLERDNCGFGLIAHIEGEPSHKVVRTAIHALARMQHRGAILADGKTGDGCGLLLQKPDRFFRIVAQERGWRLAKN
YAVGMLFLNKDPELAAAARRIVEEELQRETLSIVGWRDVPTNEGVLGEIALSSLPRIEQIFVNAPAGWRPRDMERRLFIA
RRRIEKRLEADKDFYVCSLSNLVNIYKGLCMPTDLPRFYLDLADLRLESAICLFHQRFSTNTVPRWPLAQPFRYLAHNGE
INTITGNRQWARARTYKFQTPLIPDLHDAAPFVNETGSDSSSMDNMLELLLAGGMDIIRAMRLLVPPAWQNNPDMDPELR
AFFDFNSMHMEPWDGPAGIVMSDGRFAACNLDRNGLRPARYVITKDKLITCASEVGIWDYQPDEVVEKGRVGPGELMVID
TRSGRILHSAETDDDLKSRHPYKEWMEKNVRRLVPFEDLPDEEVGSRELDDDTLASYQKQFNYSAEELDSVIRVLGENGQ
EAVGSMGDDTPFAVLSSQPRIIYDYFRQQFAQVTNPPIDPLREAHVMSLATSIGREMNVFCEAEGQAHRLSFKSPILLYS
DFKQLTTMKEEHYRADTLDITFDVTKTTLEATVKELCDKAEKMVRSGTVLLVLSDRNIAKDRLPVPAPMAVGAIQTRLVD
QSLRCDANIIVETASARDPHHFAVLLGFGATAIYPYLAYETLGRLVDTHAIAKDYRTVMLNYRNGINKGLYKIMSKMGIS
TIASYRCSKLFEAVGLHDDVVGLCFQGAVSRIGGASFEDFQQDLLNLSKRAWLARKPISQGGLLKYVHGGEYHAYNPDVV
RTLQQAVQSGEYSDYQEYAKLVNERPATTLRDLLAITPGENAVNIADVEPASELFKRFDTAAMSIGALSPEAHEALAEAM
NSIGGNSNSGEGGEDPARYGTNKVSRIKQVASGRFGVTPAYLVNADVIQIKVAQGAKPGEGGQLPGDKVTPYIAKLRYSV
PGVTLISPPPHHDIYSIEDLAQLIFDLKQVNPKAMISVKLVSEPGVGTIATGVAKAYADLITIAGYDGGTGASPLSSVKY
AGCPWELGLVETQQALVANGLRHKIRLQVDGGLKTGVDIIKAAILGAESFGFGTGPMVALGCKYLRICHLNNCATGVATQ
DDKLRKNHYHGLPFKVTNYFEFIARETRELMAQLGVTRLVDLIGRTDLLKELDGFTAKQQKLALSKLLETAEPHPGKALY
CTENNPPFDNGLLNAQLLQQAKPFVDERQSKTFWFDIRNTDRSVGASLSGYIAQTHGDQGLAADPIKAYFNGTAGQSFGV
WNAGGVELYLTGDANDYVGKGMAGGLIAIRPPVGSAFRSHEASIIGNTCLYGATGGRLYAAGRAGERFGVRNSGAITVVE
GIGDNGCEYMTGGIVCILGKTGVNFGAGMTGGFAYVLDESGDFRKRVNPELVEVLSVDALAIHEEHLRGLITEHVQHTGS
QRGEEILANWSTFATKFALVKPKSSDVKALLGHRSRSAAELRVQAQ
>E1V8I1 1.4.1.13~~~gltB~~~Glutamate synthase [NADPH] large chain~~~COG0067
MNRGLHQPDEFRDNCGFGLIAHMEGQASHDLLKTAIESLTCMTHRGGIAADGKTGDGCGLLLKMPESFMREVAREALGVE
LGERFAVGSIFLPDDDAREAQGRETLEAELTKRGLNVLGWREVPVDPSVCGPMALDCLPRIRQLFVEPGEETGDTFDVDL
FMARRHAEQALRDEEDFYVCSLSPEVVSYKGLVMPVDLPAFYHDLGDTRLETAICVFHQRFSTNTAPRWPLAQPFRLLAH
NGEINTIQANRGWANSRKENFTSERLPDIAELDEIVNTTGSDSSSMDNMLEVLLTGGMDLHRAVRMMVPPAWQNVEIMDG
DLRAFYEYNSMHAEPWDGPAGVVMTDGRQAVCMLDRNGLRPARWVITRNGYITLASEIGTYDYRPEDVVAKGRVGPGQIL
AVDTETGEVLHTEDIDSRLKSAYPYKRWLKQEASYLESALTELARFQNMDADTLAVQQKMFQISFEERDQILRPLAESGQ
EGVGSMGDDTPMAVLSTKQRLLTDYFRQKFAQVTNPAIDPLREAIVMSLESCIGAELNVFKATPEHAHRLILTTPVLSPR
KFTALVTQEDPAFASQTLSLAYDPETTGLKDALQALCAEAEKAARGDKVLLVLSDANLEKGQLPIHAALAVGAVHHHLGR
LALRPRVNLIVETGYARDAHQMAVLFGVGATAVYPWLAYQVMADMHRTGELVGNPADARENYRKGLQKGLFKILSKMGIS
TLASYRGSQLFEAVGLASEVMDMCFTGMASRIEGTGFAELQLQQELLAKDAWKPRKSISHGGLMKYVHGHEYHAYNPDVI
KALQEAVQEGDYTKWKKFAALVNERDPATIRDLLRLKPAETPVPLEEVEPVENLLPRFDSAGMSLGALSPEAHEALAQAM
NETGGRSNSGEGGEDPVRYGTIRSSKIKQIASGRFGVTPAYLANAEVLQIKVAQGAKPGEGGQLPGGKVNELIARLRYAV
PGVTLISPPPHHDIYSIEDLAQLIFDLKQVNPDAQVSVKLVSEPGIGTIATGVAKAYADLITVSGYDGGTAASPLTSIKH
AGSPWELGLPEVHQALRINGLRDKIRLQTDGGLKTGLDVIKAAILGAESFGFGTAPMVALGCKYLRICHLNNCATGVATQ
HQVLRDEHFRGTVDMVKHYFRFIAEEVRELMAMLGVRQLTDLIGRTDLLEAIEGVTPSQRRLDLSPLMTNDFVPAEAPQF
CQVDRNVPHDPGAKNQEVLAAMKTAIEQKSGGEFEFAITNCDRSVGALASGTIAKRYGEAGLEDAPVTARFRGVAGQSFG
VWNARGLHLYLEGDANDYVGKGMNGGRVVIVPPRESRFESHKTAIIGNTCLYGATGGKLFASGTAGERFGVRNSGAQAVI
EGAGDHCCEYMTGGLVAVLGETGVNFGAGMTGGFAYVLDEDRTFVDKYNHELVEIHRVNTEAMEAHRRHLREVIEEFVAE
TGSQRGRDILEDFSDFIRHFWLVKPKAASLASLLDQSRRQPE
>P96218 1.4.1.13~~~gltB~~~Glutamate synthase [NADPH] large chain~~~COG0067
MTPKRVGLYNPAFEHDSCGVAMVVDMHGRRSRDIVDKAITALLNLEHRGAQGAEPRSGDGAGILIQVPDEFLREAVDFEL
PAPGSYATGIAFLPQSSKDAAAACAAVQKIAEAEGLQVLGWRSVPTDDSSLGALSRDAMPTFRQVFLAGASGMALERRCY
VVRKRAEHELGTKGPGQDGPGRETVYFPSLSGQTLVYKGMLTTPQLKAFYLDLQDERLTSALGIVHSRFSTNTFPSWPLA
HPFRRIAHNGEINTVTGNENWMRAREALIKTDIFGSAADVEKLFPICTPGASDTARFDEVLELLHLGGRSLAHAVLMMIP
EAWERHESMDPARRAFYQYHASLMEPWDGPASMTFTDGTVVGAVLDRNGLRPSRIWVTDDGLVVMASEAGVLDLHPSTVV
RRMRLQPGRMFLVDTAQGRIVSDEEIKADLAAEHPYQEWLDNGLVPLDELPEGKDVRMPHHRIVMRQLAFGYTYEELNLL
VAPMARLGAEPIGSMGTDTPVAVLSQRPRMLYDYFHQLFAQVTNPPLDAIREEVVTSLQGTTGGERDLLNPDQNSCHQIV
LPQPILRNHELAKLVSLDPNDKVNGRPHGLRSKVIRCLYRVSEGGAGLAAALEEVRGAAAAAIADGARIIILSDRESDEE
MAPIPSLLAVAGVHHHLVRERTRTQVGLVVESGDAREVHHMAALVGFGAAAINPYLVFESIEDMLDRGVIEGIDRTAALN
NYIKAAGKGVLKVMSKMGISTLASYTGAQLFQAVGISEQVLDEYFTGLTCPTGGITLDDIAADVAARHRLAYLDRPDERA
HRELEVGGEYQWRREGEYHLFNPETVFKLQHSTRTGQYKIFKEYTRLVDDQSERMASLRGLLKFRTGVRPPVPLDEVEPA
SEIVKRFSTGAMSYGSISAEAHETLAIAMNRLGARSNCGEGGEDVKRFDRDPNGDWRRSAIKQVASARFGVTSHYLTNCT
DLQIKMAQGAKPGEGGQLPGHKVYPWVAEVRHSTPGVGLISPPPHHDIYSIEDLAQLIHDLKNANPSARVHVKLVSENGV
GTVAAGVSKAHADVVLISGHDGGTGATPLTSMKHAGAPWELGLAETQQTLLLNGLRDRIVVQVDGQLKTGRDVMIATLLG
AEEFGFATAPLVVAGCIMMRVCHLDTCPVGVATQNPLLRERFTGKPEFVENFFMFIAEEVREYLAQLGFRTVNEAVGQAG
ALDTTLARAHWKAHKLDLAPVLHEPESAFMNQDLYCSSRQDHGLDKALDQQLIVMSREALDSGKPVRFSTTIGNVNRTVG
TMLGHELTKAYGGQGLPDGTIDITFDGSAGNSFGAFVPKGITLRVYGDANDYVGKGLSGGRIVVRPSDDAPQDYVAEDNI
IGGNVILFGATSGEVYLRGVVGERFAVRNSGAHAVVEGVGDHGCEYMTGGRVVILGRTGRNFAAGMSGGVAYVYDPDGEL
PANLNSEMVELETLDEDDADWLHGTIQVHVDATDSAVGQRILSDWSGQQRHFVKVMPRDYKRVLQAIALAERDGVDVDKA
IMAAAHG
>P55037 1.4.7.1~~~gltB~~~Ferredoxin-dependent glutamate synthase 1~~~COG0067
MPCHEGLHPLVPNFCTVTSPMNSSHLAPQVQGLYDPQNEHDACGVGFIVQMKGKVSHDIVEQGLQMLVNLEHRGACGCEP
NTGDGAGILIQVPHKFIQKIAGAEGITIPAPGQYAVGNIYGSPDPLARAEARQKFNDIVAQEGLKVLGWRDIPTQNEPLG
ETAIASEPFMQQVYIARPEGLTDDLDFERKLYVIRKLTHGAIRSPKIDTYWYVASLSARTLVYKGMLTTAQVGQYYPELH
DPDMESALALVHSRFSTNTFPSWERSHPYRYIAHNGEINTMRGNVNWMQARQALFESSLFGEDMAKVQPVINIDGSDSTI
FDNALELLYLAGRSLPHAVMMMIPEPWSAHESMSQEKKAFYKYHSCLMEPWDGPASIAFTNGKMMGAVLDRNGLRPSRYY
VTKDDLVIMASEAGVLPIEPERVAKKGRLQPGRMFLVDMEQGRIIADEEIKQEIVSQHPYGEWLAANLKSLEQLPSPGNV
PGTDAESLRQRQMAFGYTFEELRILLAPMGRDGVEAIGSMGADTPLAVLSDKPKLLYNYFQQLFAQVTNPPIDSIREEII
TSAETTIGGEGNLLDPRPESCRLIELKTPILTNEDLAKLKALDDDEFKSVTLDILFDPNQGEAGLKTALDNLFTEADQAI
SQGANLIILSDRQVSAEKAAIPALLAVSGLHHHLIRNGSRTKVGLVLESGEPREVHHFAVLLGYGCGAINPYLAFETLDG
MIAEGLLVNVDHKTACKNYIKAATKGVIKVASKIGISTIQSYRGAQIFEAVGLNQSVIDEYFCRTSSRIQGSDLGVIAQE
AILRHQHAFAPRPGDLHTLDVGGEYQWRKDGEEHLFSPQTIHLLQRAVREGNYELYKQYAALVNEQNQKFFTLRGLLDFQ
DRESIPLEEVEPIEAIMKRFKTGAMSYGSISKEAHESLAIAMNRIGGKSNTGEGGEDPERFTWTNDQGDSKNSAIKQVAS
GRFGVTSLYLSQAKEIQIKMAQGAKPGEGGQLPGKKVYPWIAKVRHSTPGVGLISPPPHHDIYSIEDLAELIHDLKNANR
EARINVKLVSEVGVGTIAAGVAKAHADVVLVSGYDGGTGASPQTSIKHAGLPWELGLAETHQTLVLNNLRSRIVVETDGQ
MKTGRDVAIAALLGAEEFGFSTAPLVSLGCIMMRACHLNTCPVGIATQNPELRAKFTGDPAHAVNFMTFIATELREVMAQ
LGFRTINEMVGRTDILEPKKAVAHWKAKGIDLSTILHQPEVGDDVGRYCQIPQDHGLQHSLDITQLLDLCQPAIAKGEKV
TATLPITNINRVVGTIVGNEITKRHWEGLPEDTVHLHFQGSAGQSFGAFIPKGMTLELEGDANDYLGKGLSGGKIIVYPP
KGSSFIASENIIAGNVCLYGATAGEVYISGMVGERFCVRNSGVNTVVEAVGDHGCEYMTGGKVVVLGQTGRNFAAGMSGG
VAYIFDETGDFATRCNSAMVGLEKLEDPEEIKDLKELIQNHVNYTDSAKGKAVLADWEASIPKFVKVMPRDYKRVLQAIK
KALEAGLSGDDALNAAFEENAKDVARIGGS
>P20668 ~~~gltC~~~Transcriptional dual regulator GltC~~~COG0583
MELRQLRYFMEVAEREHVSEAADHLHVAQSAISRQIANLEEELNVTLFEREGRNIKLTPIGKEFLIHVKTAMKAIDYAKE
QIDEYLDPHRGTVKIGFPTSLASQLLPTVISAFKEEYPHVEFLLRQGSYKFLIEAVRNRDIDLALLGPVPTNFSDITGKI
LFTEKIYALVPLNHPLAKQKTVHLIDLRNDQFVLFPEGFVLREMAIDTCKQAGFAPLVSTEGEDLDAIKGLVSAGMGVTL
LPESTFAETTPRFTVKIPIEFPQVKRTVGIIKPKNRELAPSANDFYEFVIQFFSKLEQYQ
>Q05756 1.4.1.13~~~gltD~~~Glutamate synthase [NADPH] small chain~~~
MANQRMLGFVHTAQRMPDKRPAAERRQDFAEIYARFSDERANEQANRCSQCGVPFCQVHCPVSNNIPDWLKLTSEGRLEE
AYEVSQATNNFPEICGRICPQDRLCEGNCVIEQSTHGAVTIGSVEKYINDTAWDQGWVKPRTPSRELGLSVGVIGAGPAG
LAAAEELRAKGYEVHVYDRYDRMGGLLVYGIPGFKLEKSVVERRVKLLADAGVIYHPNFEVGRDASLPELRRKHVAVLVA
TGVYKARDIKAPGSGLGNIVAALDYLTTSNKVSLGDTVEAYENGSLNAAGKHVVVLGGGDTAMDCVRTAIRQGATSVKCL
YRRDRKNMPGSQREVAHAEEEGVEFIWQAAPEGFTGDTVVTGVRAVRIHLGVADATGRQTPQVIEGSEFTVQADLVIKAL
GFEPEDLPNAFDEPELKVTRWGTLLVDHRTKMTNMDGVFAAGDIVRGASLVVWAIRDGRDAAEGIHAYAKAKAEAPVAVA
AE
>P09832 1.4.1.13~~~gltD~~~Glutamate synthase [NADPH] small chain~~~COG0493
MSQNVYQFIDLQRVDPPKKPLKIRKIEFVEIYEPFSEGQAKAQADRCLSCGNPYCEWKCPVHNYIPNWLKLANEGRIFEA
AELSHQTNTLPEVCGRVCPQDRLCEGSCTLNDEFGAVTIGNIERYINDKAFEMGWRPDMSGVKQTGKKVAIIGAGPAGLA
CADVLTRNGVKAVVFDRHPEIGGLLTFGIPAFKLEKEVMTRRREIFTGMGIEFKLNTEVGRDVQLDDLLSDYDAVFLGVG
TYQSMRGGLENEDADGVYAALPFLIANTKQLMGFGETRDEPFVSMEGKRVVVLGGGDTAMDCVRTSVRQGAKHVTCAYRR
DEENMPGSRREVKNAREEGVEFKFNVQPLGIEVNGNGKVSGVKMVRTEMGEPDAKGRRRAEIVAGSEHIVPADAVIMAFG
FRPHNMEWLAKHSVELDSQGRIIAPEGSDNAFQTSNPKIFAGGDIVRGSDLVVTAIAEGRKAADGIMNWLEV
>E1V8I0 1.4.1.13~~~gltD~~~Glutamate synthase [NADPH] small chain~~~COG0493
MANRLNNDFQFIDVGRQDPEKKPARTRAKQFAEIYEPYKPQDAAAQAHRCLHCGNPYCEWKCPVHNYIPNWLQLVSEGNI
LEAAELSHRTNSLPEVCGRVCPQDRLCEGDCTLNDGFGAVTIGSVEKYITDTAFAMGWRPDMSKVTWTDKKVAIIGAGPA
GLGCADILVRNGVKPVVFDKYPEIGGLLTFGIPEFKLEKTVMERRRAVFEEMGVEFCLGVEIGRDMPFEQLLEEYDAVFL
GMGTYKYMEGGFPGEDLPGVHKALDYLVANVNHCLGFETDPADYVSLEGQRVVVLGGGDTAMDCNRTAIRQGAASVTCAY
RRDEDNMPGSRKEVANAREEGVDFLFNRQPVAVIGEDRVEGIKVVRTRLGEPDENGRQRPEVVPGSEEVVPADAVVIAFG
FQPSPAPWFETVGIELDEKGRVKAPEEGAYAFQTTNEKIFAGGDMVRGSDLVVTAVFEGRQAGEGILDYLDV
>P9WN19 1.4.1.13~~~gltD~~~Glutamate synthase [NADPH] small chain~~~COG0493
MADPGGFLKYTHRKLPKRRPVPLRLRDWREVYEEFDNESLRQQATRCMDCGIPFCHNGCPLGNLIPEWNDLVRRGRWRDA
IERLHATNNFPDFTGRLCPAPCEPACVLGINQDPVTIKQIELEIIDKAFDEGWVQPRPPRKLTGQTVAVVGSGPAGLAAA
QQLTRAGHTVTVFEREDRIGGLLRYGIPEFKMEKRHLDRRLDQMRSEGTEFRPGVNVGVDISAEKLRADFDAVVLAGGAT
AWRELPIPGRELEGVHQAMEFLPWANRVQEGDDVLDEDGQPPITAKGKKVVIIGGGDTGADCLGTVHRQGAIAVHQFEIM
PRPPDARAESTPWPTYPLMYRVSAAHEEGGERVFSVNTEAFVGTDGRVSALRAHEVTMLDGKFVKVEGSDFELEADLVLL
AMGFVGPERAGLLTDLGVKFTERGNVARGDDFDTSVPGVFVAGDMGRGQSLIVWAIAEGRAAAAAVDRYLMGSSALPAPV
KPTAAPLQ
>P28721 ~~~gltF~~~Protein GltF~~~COG3539
MFFKKNLTTAAICAALSVAAFSAMATDSTDTELTIIGEYTPGACTPVVTGGGIVDYGKHHNSALNPTGKSNKLVQLGRKN
STLNITCTAPTLIAVTSKDNRQSTIVALNDTSYIEKAYDTLVDMKGTKNAFGLGSAPNGQKIGAASIGIDRSNGGIHAAD
DTGEIPVDLIQTDHWSAATPTWKASSNGAFCSLTSCSAIERGYSVAKTGELTPVAITAVTFPLLIDAAVNDNTILGSDET
IKLDGNVTISVQYL
>P37902 ~~~gltI~~~Glutamate/aspartate import solute-binding protein~~~COG0834
MQLRKPATAILALALSAGLAQADDAAPAAGSTLDKIAKNGVIVVGHRESSVPFSYYDNQQKVVGYSQDYSNAIVEAVKKK
LNKPDLQVKLIPITSQNRIPLLQNGTFDFECGSTTNNVERQKQAAFSDTIFVVGTRLLTKKGGDIKDFANLKDKAVVVTS
GTTSEVLLNKLNEEQKMNMRIISAKDHGDSFRTLESGRAVAFMMDDALLAGERAKAKKPDNWEIVGKPQSQEAYGCMLRK
DDPQFKKLMDDTIAQVQTSGEAEKWFDKWFKNPIPPKNLNMNFELSDEMKALFKEPNDKALN
>P0AER3 ~~~gltJ~~~Glutamate/aspartate import permease protein GltJ~~~COG0765
MSIDWNWGIFLQQAPFGNTTYLGWIWSGFQVTIALSICAWIIAFLVGSFFGILRTVPNRFLSGLGTLYVELFRNVPLIVQ
FFTWYLVIPELLPEKIGMWFKAELDPNIQFFLSSMLCLGLFTAARVCEQVRAAIQSLPRGQKNAALAMGLTLPQAYRYVL
LPNAYRVIVPPMTSEMMNLVKNSAIASTIGLVDMAAQAGKLLDYSAHAWESFTAITLAYVLINAFIMLVMTLVERKVRLP
GNMGGK
>P0AER5 ~~~gltK~~~Glutamate/aspartate import permease protein GltK~~~COG0765
MYEFDWSSIVPSLPYLLDGLVITLKITVTAVVIGILWGTMLAVMRLSSFAPVAWFAKAYVNVFRSIPLVMVLLWFYLIVP
GFLQNVLGLSPKNDIRLISAMVAFSMFEAAYYSEIIRAGIQSISRGQSSAALALGMTHWQSMKLIILPQAFRAMVPLLLT
QGIVLFQDTSLVYVLSLADFFRTASTIGERDGTQVEMILFAGFVYFVISLSASLLVSYLKRRTA
>P39817 ~~~gltP~~~Proton/glutamate-aspartate symporter~~~COG1301
MKKLIAFQILIALAVGAVIGHFFPDFGMALRPVGDGFIRLIKMIVVPIVFSTIVIGAAGSGSMKKMGSLGIKTIIWFEVI
TTLVLGLGLLLANVLKPGVGLDLSHLAKKDIHELSGYTDKVVDFKQMILDIIPTNIIDVMARNDLLAVIFFAILFGVAAA
GIGKASEPVMKFFESTAQIMFKLTQIVMVTAPIGVLALMAASVGQYGIELLLPMFKLVGTVFLGLFLILFVLFPLVGLIF
QIKYFEVLKMIWDLFLIAFSTTSTETILPQLMDRMEKYGCPKRVVSFVVPSGLSLNCDGSSLYLSVSCIFLAQAFQVDMT
LSQQLLMMLVLVMTSKGIAAVPSGSLVVLLATANAVGLPAEGVAIIAGVDRVMDMARTGVNVPGHAIACIVVSKWEKAFR
QKEWVSANSQTESI
>P21345 ~~~gltP~~~Proton/glutamate-aspartate symporter~~~COG1301
MKNIKFSLAWQILFAMVLGILLGSYLHYHSDSRDWLVVNLLSPAGDIFIHLIKMIVVPIVISTLVVGIAGVGDAKQLGRI
GAKTIIYFEVITTVAIILGITLANVFQPGAGVDMSQLATVDISKYQSTTEAVQSSSHGIMGTILSLVPTNIVASMAKGEM
LPIIFFSVLFGLGLSSLPATHREPLVTVFRSISETMFKVTHMVMRYAPVGVFALIAVTVANFGFSSLWPLAKLVLLVHFA
ILFFALVVLGIVARLCGLSVWILIRILKDELILAYSTASSESVLPRIIEKMEAYGAPVSITSFVVPTGYSFNLDGSTLYQ
SIAAIFIAQLYGIDLSIWQEIILVLTLMVTSKGIAGVPGVSFVVLLATLGSVGIPLEGLAFIAGVDRILDMARTALNVVG
NALAVLVIAKWEHKFDRKKALAYEREVLGKFDKTADQ
>P9WMX7 2.4.1.-~~~~~~PGL/p-HBAD biosynthesis glycosyltransferase Rv2957~~~COG0463
MAAPMFSIIIPTLNVAAVLPACLDSIARQTCGDFELVLVDGGSTDETLDIANIFAPNLGERLIIHRDTDQGVYDAMNRGV
DLATGTWLLFLGADDSLYEADTLARVAAFIGEHEPSDLVYGDVIMRSTNFRWGGAFDLDRLLFKRNICHQAIFYRRGLFG
TIGPYNLRYRVLADWDFNIRCFSNPALVTRYMHVVVASYNEFGGLSNTIVDKEFLKRLPMSTRLGIRLVIVLVRRWPKVI
SRAMVMRTVISWRRRR
>P9WFR1 2.4.1.-~~~~~~PGL/p-HBAD biosynthesis glycosyltransferase Rv2958c~~~COG1819
MEETSVAGDPGPDAGTSTAPNAAPEPVARRQRILFVGEAATLAHVVRPFVLARSLDPSRYEVHFACDPRFNKLLGPLPFP
HHPIHTVPSEEVLLKIAQGRLFYNTRTLRKYIAADRKILNEIAPDVVVGDNRLSLSVSARLAGIPYIAIANAYWSPQARR
RFPLPDVPWTRFFGVRPVSILYRLYRPLIFALYCLPLNWLRRKHGLSSLGWDLCRIFTDGDYTLYADVPELVPTYNLPAN
HRYLGPVLWSPDVKPPTWWHSLPTDRPIIYATLGSSGGKNLLQVVLNALADLPVTVIAATAGRNHLKNVPANAFVADYLP
GEAAAARSAVVLCNGGSPTTQQALAAGVPVIGLPSNMDQHLNMEALERAGAGVLLRTERLNTEGVAAAVKQVLSGAEFRQ
AARRLAEAFGPDFAGFPQHIESALRLVC
>P94501 ~~~gltR~~~HTH-type transcriptional regulator GltR~~~COG0583
MNIQLLQVFLTTAREGSISKAALTLNYAQSNVTNKIQQLENDLQTKLFYRHSRGITLTPPGQILVSYSEKILHTIEEARA
AMGESSAPSGPLRIGSMETTAAVWLPQLLAHYNNLYPNVDLNLVTGPTEQQIQAVLHYELNGAFISGPIEHPDLVQEKVL
DEEMVLVTSASHPVISSIQDVQTQTMLVFRKGCSYRAKLNHILQEEGLLPIKLMEFGILEAIIGCVSAGLGISLLPRSII
ASHEKEGRIRSHTISDKYSFVSTMFIRRKDTLITPALSAFLTHMRDHFQIKRPDQS
>G3XCY6 ~~~gltR~~~Transcriptional regulatory protein GltR~~~
MSANGRSILLVDDDQEIRELLETYLSRAGFQVRSVSRGADFRQALCEEEASLAILDVMLPDEDGFSLCRWIRSHQRLACM
PIIMLTASSDEADRVIGLELGADDYLGKPFSPRELLARIKALLRRAQFTQVRGGDVLAFEDWRLDTVSHRLFHEDGEEFF
LSGADFALLKLFLDHPQQILDRDTIANATRGREVLPLERIVDMAVSRLRQRLRDTGKAPRLIQTVRGSGYLLAAQVRPHL
QP
>P0AER8 ~~~gltS~~~Sodium/glutamate symporter~~~COG0786
MFHLDTLATLVAATLTLLLGRKLVHSVSFLKKYTIPEPVAGGLLVALALLVLKKSMGWEVNFDMSLRDPLMLAFFATIGL
NANIASLRAGGRVVGIFLIVVVGLLVMQNAIGIGMASLLGLDPLMGLLAGSITLSGGHGTGAAWSKLFIERYGFTNATEV
AMACATFGLVLGGLIGGPVARYLVKHSTTPNGIPDDQEVPTAFEKPDVGRMITSLVLIETIALIAICLTVGKIVAQLLAG
TAFELPTFVCVLFVGVILSNGLSIMGFYRVFERAVSVLGNVSLSLFLAMALMGLKLWELASLALPMLAILVVQTIFMALY
AIFVTWRMMGKNYDAAVLAAGHCGFGLGATPTAIANMQAITERFGPSHMAFLVVPMVGAFFIDIVNALVIKLYLMLPIFA
G
>P55038 1.4.7.1~~~gltS~~~Ferredoxin-dependent glutamate synthase 2~~~COG0067
MSFQYPLLAPMTNSSVATNSNQPFLGQPWLVEERDACGVGFIANLRGKPDHTLVEQALKALGCMEHRGGCSADNDSGDGA
GVMTAIPRELLAQWFNTRNLPMPDGDRLGVGMVFLPQEPSAREVARAYVEEVVRLEKLTVLGWREVPVNSDVLGIQAKNN
QPHIEQILVTCPEGCAGDELDRRLYIARSIIGKKLAEDFYVCSFSCRTIVYKGMVRSIILGEFYLDLKNPGYTSNFAVYH
RRFSTNTMPKWPLAQPMRLLGHNGEINTLLGNINWMAAREKELEVSGWTKAELEALTPIVNQANSDSYNLDSALELLVRT
GRSPLEAAMILVPEAYKNQPALKDYPEISDFHDYYSGLQEPWDGPALLVFSDGKIVGAGLDRNGLRPARYCITKDDYIVL
GSEAGVVDLPEVDIVEKGRLAPGQMIAVDLAEQKILKNYQIKQQAAQKYPYGEWIKIQRQTVASDSFAEKTLFNDAQTVL
QQQAAFGYTAEDVEMVVVPMASQGKEPTFCMGDDTPLAVLSHKPRLLYDYFKQRFAQVTNPPIDPLRENLVMSLAMFLGK
RGNLLEPKAESARTIKLRSPLVNEVELQAIKTGQLQVAEVSTLYDLDGVNSLETALDNLVKTAIATVQAGAEILVLTDRP
NGAILTENQSFIPPLLAVGAVHHHLIRAGLRLKASLIVDTAQCWSTHHFACLVGYGASAICPYLALESVRQWWLDEKTQK
LMENGRLDRIDLPTALKNYRQSVEAGLFKILSKMGISLLASYHGAQIFEAIGLGAELVEYAFAGTTSRVGGLTIADVAGE
VMVFHGMAFPEMAKKLENFGFVNYRPGGEYHMNSPEMSKSLHKAVAAYKVGGNGNNGEAYDHYELYRQYLKDRPVTALRD
LLDFNADQPAISLEEVESVESIVKRFCTGGMSLGALSREAHETLAIAMNRLGAKSNSGEGGEDVVRYLTLDDVDSEGNSP
TLPHLHGLQNGDTANSAIKQIASGRFGVTPEYLMSGKQLEIKMAQGAKPGEGGQLPGKKVSEYIAMLRRSKPGVTLISPP
PHHDIYSIEDLAQLIYDLHQINPEAQVSVKLVAEIGIGTIAAGVAKANADIIQISGHDGGTGASPLSSIKHAGSPWELGV
TEVHRVLMENQLRDRVLLRADGGLKTGWDVVMAALMGAEEYGFGSIAMIAEGCIMARVCHTNNCPVGVATQQERLRQRFK
GVPGQVVNFFYFIAEEVRSLLAHLGYRSLDDIIGRTDLLKVRSDVQLSKTQNLTLDCLLNLPDTKQNRQWLNHEPVHSNG
PVLDDDILADPDIQEAINHQTTATKTYRLVNTDRTVGTRLSGAIAKKYGNNGFEGNITLNFQGAAGQSFGAFNLDGMTLH
LQGEANDYVGKGMNGGEIVIVPHPQASFAPEDNVIIGNTCLYGATGGNLYANGRAGERFAVRNSVGKAVIEGAGDHCCEY
MTGGVIVVLGPVGRNVGAGMTGGLAYFLDEVGDLPEKINPEIITLQRITASKGEEQLKSLITAHVEHTGSPKGKAILANW
SDYLGKFWQAVPPSEKDSPEANGDVSLTGEKTLTSV
>P24944 ~~~gltT~~~Proton/sodium-glutamate symport protein~~~
MRKIGLAWQIFIGLILGIIVGAIFYGNPKVAAYLQPIGDIFLRLIKMIVIPIVISSLVVGVASVGDLKKLGKLGGKTIIY
FEIITTIAIVVGLLAANIFQPGAGVNMKSLEKTDIQSYVDTTNEVQHHSMVETFVNIVPKNIFESLSTGDMLPIIFFSVM
FGLGVAAIGEKGKPVLQFFQGTAEAMFYVTNQIMKFAPFGVFALIGVTVSKFGVESLIPLSKLVIVVYATMLFFIFAVLG
GVAKLFGINIFHIIKILKDELILAYSTASSETVLPRIMDKMEKFGCPKAITSFVIPTGYSFNLDGSTLYQALAAIFIAQL
YGIDMSVSQQISLLLVLMVTSKGIAGVPGVSFVVLLATLGTVGIPVEGLAFIAGIDRILDMARTAVNVIGNSLAAIIMSK
WEGQYNEEKGKQYLAELQQSA
>O07605 ~~~gltT~~~Proton/sodium-glutamate symport protein~~~COG1301
MKRIKFGLATQIFVGLILGVIVGVIWYGNPALPTYLQPIGDLFLRLIKMIVIPIVVSSLIIGVAGAGNGKQVGKLGFRTI
LYFEIITTFAIILGLALANIFHPGTGVNIHEAQKSDISQYVETEKEQSNKSVAETFLHIVPTNFFQSLVEGDLLAIICFT
VLFALGISAIGERGKPVLAFFEGVSHAMFHVVNLVMKVAPFGVFALIGVTVSKFGLGSLISLGKLVGLVYVALAFFLIVI
FGIVAKIAGISIFKFLAYMKDEILLAFSTSSSETVLPRIMEKMEKIGCPKGIVSFVIPIGYTFNLDGSVLYQSIAALFLA
QVYGIDLTIWHQITLVLVLMVTSKGMAAVPGTSFVVLLATLGTIGVPAEGLAFIAGVDRIMDMARTVVNLTGNALAAVVM
SKWEGMFNPAKAETVMSQSKTEQNATISG
>P24943 ~~~gltT~~~Proton/sodium-glutamate symport protein~~~
MRKIGLAWQIFIGLILGIIVGAIFYGNPKVATYLQPIGDIFLRLIKMIVIPIVISSLVVGVASVGDLKKLGKLGGKTIIY
FEIITTIAIVVGLLAANIFQPGTGVNMKSLEKTDIQSYVDTTNEVQHHSMVETFVNIVPKNIFESLTKGDMLPIIFFSVM
FGLGVAAIGEKGKPVLQFFQGTAEAMFYVTNQIMKFAPFGVFALIGVTVSKFGVESLIPLSKLVIVVYATMVFFIFVVLG
GVAKLFGINIFHIIKILKDELILAYSTASSETVLPKIMEKMENFGCPKAITSFVIPTGYSFNLDGSTLYQALAAIFIAQL
YGIDMPISQQISLLLVLMVTSKGIAGVPGVSFVVLLATLGTVGIPIEGLAFIAGIDRILDMARTAVNVIGNSLAAIIMSK
WEGQYNEEKGKQYIAQLQQSA
>P48243 7.4.2.1~~~gluA~~~Glutamate transport ATP-binding protein GluA~~~COG1126
MIKMTGVQKYFGDFHALTDIDLEIPRGQVVVVLGPSGSGKSTLCRTINRLETIEEGTIEIDGKVLPEEGKGLANLRADVG
MVFQSFNLFPHLTIKDNVTLAPIKVRKMKKSEAEKLAMSLLERVGIANQADKYPAQLSGGQQQRVAIARALAMNPKIMLF
DEPTSALDPEMVNEVLDVMASLAKEGMTMVCVTHEMGFARKAADRVLFMADGLIVEDTEPDSFFTNPKSDRAKDFLGKIL
AH
>P48242 ~~~gluB~~~Glutamate-binding protein GluB~~~COG0834
MSAKRTFTRIGAILGATALAGVTLTACGDSSGGDGFLAAIENGSVNVGTKYDQPGLGLRNPDNSMSGLDVDVAEYVVNSI
ADDKGWDHPTIEWRESPSAQRETLIQNGEVDMIAATYSINAGRSESVNFGGPYLLTHQALLVRQDDDRIETLEDLDNGLI
LCSVSGSTPAQKVKDVLPGVQLQEYDTYSSCVEALSQGNVDALTTDATILFGYSQQYEGDFRVVEMEKDGEPFTDEYYGI
GLKKDDQEGTDAINAALERMYADGTFQRLLTENLGEDSVVVEEGTPGDLSFLDAS
>P48244 ~~~gluC~~~Glutamate transport system permease protein GluC~~~COG0765
MSTLWADLGPSLLPAFWVTIKLTIYSAIGAMIFGTILTTMRVSPVKILRTLSTAYINTVRNTPLTLVVLFCSFGLYQNLG
LTLAGRESSTFLVDNNFRLAVLGFILYTSTFVAESLRSGINTVHFGQAEAARSLGLGFGATFRSIIFPQAVRAAIVPLGN
TLIALTKNTTIASVIGVGEASLLMKATIENHANMLFVVFAIFAVGFMILTLPMGLGLGKLSERLAVKK
>P48245 ~~~gluD~~~Glutamate transport system permease protein GluD~~~COG0765
MLSGNGQLDANKWTPFINSQTWTTYILPGLWGTLKSAVFSVILALVMGTALGLGRISEIRILRWFCAVIIETFRAIPVLI
LMIFAYQMFAQYNIVPSSQLAFAAVVFGLTMYNGSVIAEILRSGIASLPKGQKEAAIALGMSSRQTTWSILLPQAVAAML
PALISQMVIALKDSALGYQIGYIEVVRSGIQSASVNRNYLAALFVVALIMIVLNFSLTALASRIERQLRAGRARKNIVAK
VPEQPDQGLETKDNVNVDWQDPDYKDLKTPGVQ
>P54493 3.4.21.105~~~gluP~~~Rhomboid protease GluP~~~COG0457
MFLLEYTYWKIAAHLVNSGYGVIQAGESDEIWLEAPDKSSHDLVRLYKHDLDFRQEMVRDIEEQAERVERVRHQLGRRRM
KLLNVFFSTEAPVDDWEEIAKKTFEKGTVSVEPAIVRGTMLRDDLQAVFPSFRTEDCSEEHASFENAQMARERFLSLVLK
QEEQRKTEAAVFQNGKPTFTYLFIALQILMFFLLEINGGSTNTETLVAFGAKENSLIAQGEWWRLLTPIVLHIGIAHLAF
NTLALWSVGTAVERMYGSGRFLLIYLAAGITGSIASFVFSPYPSAGASGAIFGCLGALLYVALSNRKMFLRTIGTNIIVI
IIINLGFGFAVSNIDNSGHIGGLIGGFFAAAALGLPKAGAFGKRLLSAVLLIALAVGFLYYGLHSPSHQESALIQQASEL
YQEGKYEEVTELLNGEAAQKDASADLLKILAVSDIQIGEYDQAVSLLERAVKKEPKDHASYYNLALLYAEKNELAQAEKA
IQTAVKLKPKEQRYKELQRQIENNKES
>Q07006 3.4.21.82~~~sprE~~~Glutamyl endopeptidase 2~~~
VLGGGAIYGGGSRCSAAFNVTKGGARYFVTAGHCTNISANWSASSGGSVVGVREGTSFPTNDYGIVRYTDGSSPAGTVDL
YNGSTQDISSAANAVVGQAIKKSGSTTKVTSGTVTAVNVTVNYGDGPVYNMGRTTACSAGGDSGGAHFAGSVALGIHSGS
SGCSGTAGSAIHQPVTKALSAYGVTVYL
>P27305 6.1.1.-~~~gluQ~~~Glutamyl-Q tRNA(Asp) synthetase~~~COG0008
MLPPYFLFKEMTDTQYIGRFAPSPSGELHFGSLIAALGSYLQARARQGRWLVRIEDIDPPREVPGAAETILRQLEHYGLH
WDGDVLWQSQRHDAYREALAWLHEQGLSYYCTCTRARIQSIGGIYDGHCRVLHHGPDNAAVRIRQQHPVTQFTDQLRGII
HADEKLAREDFIIHRRDGLFAYNLAVVVDDHFQGVTEIVRGADLIEPTVRQISLYQLFGWKVPDYIHLPLALNPQGAKLS
KQNHAPALPKGDPRPVLIAALQFLGQQAEAHWQDFSVEQILQSAVKNWRLTAVPESAIVNSTFSNASC
>P54716 3.2.1.122~~~glvA~~~Maltose-6'-phosphate glucosidase~~~COG1486
MKKKSFSIVIAGGGSTFTPGIVLMLLDHLEEFPIRKLKLYDNDKERQDRIAGACDVFIREKAPDIEFAATTDPEEAFTDV
DFVMAHIRVGKYAMRALDEQIPLKYGVVGQETCGPGGIAYGMRSIGGVLEILDYMEKYSPDAWMLNYSNPAAIVAEATRR
LRPNSKILNICDMPVGIEDRMAQILGLSSRKEMKVRYYGLNHFGWWTSIQDQEGNDLMPKLKEHVSQYGYIPKTEAEAVE
ASWNDTFAKARDVQAADPDTLPNTYLQYYLFPDDMVKKSNPNHTRANEVMEGREAFIFSQCDMITREQSSENSEIKIDDH
ASYIVDLARAIAYNTGERMLLIVENNGAIANFDPTAMVEVPCIVGSNGPEPITVGTIPQFQKGLMEQQVSVEKLTVEAWA
EKSFQKLWQALILSKTVPNARVARLILEDLVEANKDFWPELDQSPTRIS
>P54717 ~~~glvR~~~HTH-type transcriptional regulator GlvR~~~COG1737
MQLEELINQHYSKLNDNDFHILKYILNHKHTCYHLGIDALAKACSVSRSSILRLAQKLGFSGYSEFRVFLKWEDQPEEGE
SMSFEKLLDDIEANLKFLRTKDMTDMCQLIDAADRIFVYGSGNAQKICARDLQRMFIPRHRYLILIEDTNEFNLMRDDFK
VNDLFIIISLSGETPELIPQARMLSAKGIPFISITNLKNNVLAQLTPHNLYATSKPVTLSDRTEIVAFAPFFLVGEALFR
AYVDYKEAEKNDNE
>P23524 2.7.1.165~~~garK~~~Glycerate 2-kinase~~~COG1929
MKIVIAPDSYKESLSASEVAQAIEKGFREIFPDAQYVSVPVADGGEGTVEAMIAATQGAERHAWVTGPLGEKVNASWGIS
GDGKTAFIEMAAASGLELVPAEKRDPLVTTSRGTGELILQALESGATNIIIGIGGSATNDGGAGMVQALGAKLCDANGNE
IGFGGGSLNTLNDIDISGLDPRLKDCVIRVACDVTNPLVGDNGASRIFGPQKGASEAMIVELDNNLSHYAEVIKKALHVD
VKDVPGAGAAGGMGAALMAFLGAELKSGIEIVTTALNLEEHIHDCTLVITGEGRIDSQSIHGKVPIGVANVAKKYHKPVI
GIAGSLTDDVGVVHQHGIDAVFSVLTSIGTLDEAFRGAYDNICRASRNIAATLAIGMRNAG
>P77364 2.7.1.31~~~glxK~~~Glycerate 3-kinase~~~COG1929
MKIVIAPDSFKESLSAEKCCQAIKAGFSTLFPDANYICLPIADGGEGTVDAMVAATGGNIVTLEVCGPMGEKVNAFYGLT
GDGKTAVIEMAAASGLMLVAPEKRNPLLASSFGTGELIRHALDNDIRHIILGIGGSATVDGGMGMAQALGVRFLDADGQA
LAANGGNLARVASIEMDECDPRLANCHIEVACDVDNPLVGARGAAAVFGPQKGATPEMVEELEQGLQNYARVLQQQTEIN
VCQMAGGGAAGGMGIAAAVFLNADIKPGIEIVLNAVNLAQAVQGAALVITGEGRIDSQTAGGKAPLGVASVAKQFNVPVI
GIAGVLGDGVEVVHQYGIDAVFSILPRLAPLAEVLASGETNLFNSARNIACAIKIGQGIKN
>P57098 2.7.1.31~~~glxK~~~Glycerate kinase~~~
MKIVIAPDSFKESLTAQQVAEAIKRGFQQSIADVECLLCPVGDGGEGTVDAIRHSLDLEEKCLQVTGSFGQKEVMRYFQK
EQLALFEVADLVGLGKIPLEKRNPLQIQTRGIGELIRHLISQEIKEIYIGVGGTASNDGGIGIAAGLGYQFYDEDGNALP
ACGQSLLNLASVSTENRYKIPEDVHIRILADVVSPLCGHQGATYTFGKQKGLDSTMFEVVDQAIQDFYEKVSPATLKLKG
AGAGGGIAGGLCAFAQASIVSGIDTCLDLIDFDKKVSDVDLVIVGEGRLDRQSLAGKAPIGVAKRTPVGVPVVAICGSLV
EDLPSLPFENIQAAFSILEKSEPLEDSLKNASLYLEHTASNIGHLLNMPKI
>P77161 1.1.1.60~~~glxR~~~2-hydroxy-3-oxopropionate reductase~~~COG2084
MKLGFIGLGIMGTPMAINLARAGHQLHVTTIGPVADELLSLGAVSVETARQVTEASDIIFIMVPDTPQVEEVLFGENGCT
KASLKGKTIVDMSSISPIETKRFARQVNELGGDYLDAPVSGGEIGAREGTLSIMVGGDEAVFERVKPLFELLGKNITLVG
GNGDGQTCKVANQIIVALNIEAVSEALLFASKAGADPVRVRQALMGGFASSRILEVHGERMIKRTFNPGFKIALHQKDLN
LALQSAKALALNLPNTATCQELFNTCAANGGSQLDHSALVQALELMANHKLA
>P9WGI9 2.1.2.1~~~glyA1~~~Serine hydroxymethyltransferase 1~~~COG0112
MTAAPDARTTAVMSAPLAEVDPDIAELLAKELGRQRDTLEMIASENFVPRAVLQAQGSVLTNKYAEGLPGRRYYGGCEHV
DVVENLARDRAKALFGAEFANVQPHSGAQANAAVLHALMSPGERLLGLDLANGGHLTHGMRLNFSGKLYENGFYGVDPAT
HLIDMDAVRATALEFRPKVIIAGWSAYPRVLDFAAFRSIADEVGAKLLVDMAHFAGLVAAGLHPSPVPHADVVSTTVHKT
LGGGRSGLIVGKQQYAKAINSAVFPGQQGGPLMHVIAGKAVALKIAATPEFADRQRRTLSGARIIADRLMAPDVAKAGVS
VVSGGTDVHLVLVDLRDSPLDGQAAEDLLHEVGITVNRNAVPNDPRPPMVTSGLRIGTPALATRGFGDTEFTEVADIIAT
ALATGSSVDVSALKDRATRLARAFPLYDGLEEWSLVGR
>Q3JGP5 2.1.2.1~~~glyA2~~~Serine hydroxymethyltransferase 2~~~
MSNANPFFSQSLAERDASVRGAILKELERQQSQVELIASENIVSRAVLDAQGSVLTNKYAEGYPGKRYYGGCEFADEVEA
LAIERVKRLFNAGHANVQPHSGAQANGAVMLALAKPGDTVLGMSLDAGGHLTHGAKPALSGKWFNALQYGVSRDTMLIDY
DQVEALAQQHKPSLIIAGFSAYPRKLDFARFRAIADSVGAKLMVDMAHIAGVIAAGRHANPVEHAHVVTSTTHKTLRGPR
GGFVLTNDEEIAKKINSAVFPGLQGGPLMHVIAGKAVAFGEALTDDFKTYIDRVLANAQALGDVLKAGGVDLVTGGTDNH
LLLVDLRPKGLKGAQVEQALERAGITCNKNGIPFDPEKPTITSGIRLGTPAGTTRGFGAAEFREVGRLILEVFEALRTNP
EGDHATEQRVRREIFALCERFPIY
>P9WGI7 2.1.2.1~~~glyA2~~~Serine hydroxymethyltransferase 2~~~COG0112
MNTLNDSLTAFDPDIAALIDGELRRQESGLEMIASENYAPLAVMQAQGSVLTNKYAEGYPGRRYYGGCEFVDGVEQLAID
RVKALFGAEYANVQPHSGATANAATMHALLNPGDTILGLSLAHGGHLTHGMRINFSGKLYHATAYEVSKEDYLVDMDAVA
EAARTHRPKMIIAGWSAYPRQLDFARFRAIADEVDAVLMVDMAHFAGLVAAGVHPSPVPHAHVVTSTTHKTLGGPRGGII
LCNDPAIAKKINSAVFPGQQGGPLEHVIAAKATAFKMAAQPEFAQRQQRCLDGARILAGRLTQPDVAERGIAVLTGGTDV
HLVLVDLRDAELDGQQAEDRLAAVDITVNRNAVPFDPRPPMITSGLRIGTPALAARGFSHNDFRAVADLIAAALTATNDD
QLGPLRAQVQRLAARYPLYPELHRT
>P39148 2.1.2.1~~~glyA~~~Serine hydroxymethyltransferase~~~COG0112
MKHLPAQDEQVFNAIKNERERQQTKIELIASENFVSEAVMEAQGSVLTNKYAEGYPGKRYYGGCEHVDVVEDIARDRAKE
IFGAEHVNVQPHSGAQANMAVYFTILEQGDTVLGMNLSHGGHLTHGSPVNFSGVQYNFVEYGVDKETQYIDYDDVREKAL
AHKPKLIVAGASAYPRTIDFKKFREIADEVGAYFMVDMAHIAGLVAAGLHPNPVPYADFVTTTTHKTLRGPRGGMILCRE
EFGKKIDKSIFPGIQGGPLMHVIAAKAVSFGEVLQDDFKTYAQNVISNAKRLAEALTKEGIQLVSGGTDNHLILVDLRSL
GLTGKVAEHVLDEIGITSNKNAIPYDPEKPFVTSGIRLGTAAVTSRGFDGDALEEVGAIIALALKNHEDEGKLEEARQRV
AALTDKFPLYKELDY
>P24531 2.1.2.1~~~glyA~~~Serine hydroxymethyltransferase~~~COG0112
MSLEMFDKEIFDLTNKELERQCEGLEMIASENFTLPEVMEVMGSILTNKYAEGYPGKRYYGGCEFVDEIETLAIERCKKL
FNCKFANVQPNSGSQANQGVYAALINPGDKILGMDLSHGGHLTHGAKVSSSGKMYESCFYGVELDGRIDYEKVREIAKKE
KPKLIVCGASAYARVIDFAKFREIADEIGAYLFADIAHIAGLVVAGEHPSPFPYAHVVSSTTHKTLRGPRGGIIMTNDEE
LAKKINSAIFPGIQGGPLMHVIAAKAVGFKFNLSDEWKVYAKQVRTNAQVLANVLMDRKFKLVSDGTDNHLVLMSFLDRE
FSGKDADLALGNAGITANKNTVPGEIRSPFITSGLRLGTPALTARGFKEKEMEIVSNYIADILDDVNNEKLQENIKQELK
KLASNFIIYERAMF
>Q72CT0 2.1.2.1~~~glyA~~~Serine hydroxymethyltransferase~~~COG0112
MDELLLQDPEVGKAIILEIERQTGKLELIASENFVSAAVRQAQGSVLTHKYAEGYPGKRYYGGCEFVDIAENIAIERART
IFGCEYANVQPHSGSQANMGVYFACLKPGDTILGMNLSHGGHLTHGSPVNFSGRLFNVVFYGVEKETGRIDYEQVAALAR
EHKPSLIVAGASAYPRTIDFARFRAIADEVGAKLMVDMAHIAGLVAAGYHPSPVQHAHYTTTTTHKTLRGPRGGMILSTE
DNGKTLNSQIFPGIQGGPLMHVIAAKAVAFGEALRPAFKEYQKQVVDNAAALAGVLTAAGFDLVSGGTDNHLMLVDLTSK
DVTGKDAEIALDKAGITVNKNTVPFETRSPFVTSGVRLGTPALTTRGMKAAEMEKVGGWIVDAIANTTNETRLAEISREV
ERFARQFPLFAW
>P0A825 2.1.2.1~~~glyA~~~Serine hydroxymethyltransferase~~~COG0112
MLKREMNIADYDAELWQAMEQEKVRQEEHIELIASENYTSPRVMQAQGSQLTNKYAEGYPGKRYYGGCEYVDIVEQLAID
RAKELFGADYANVQPHSGSQANFAVYTALLEPGDTVLGMNLAHGGHLTHGSPVNFSGKLYNIVPYGIDATGHIDYADLEK
QAKEHKPKMIIGGFSAYSGVVDWAKMREIADSIGAYLFVDMAHVAGLVAAGVYPNPVPHAHVVTTTTHKTLAGPRGGLIL
AKGGSEELYKKLNSAVFPGGQGGPLMHVIAGKAVALKEAMEPEFKTYQQQVAKNAKAMVEVFLERGYKVVSGGTDNHLFL
VDLVDKNLTGKEADAALGRANITVNKNSVPNDPKSPFVTSGIRVGTPAITRRGFKEAEAKELAGWMCDVLDSINDEAVIE
RIKGKVLDICARYPVYA
>B5Z9V7 2.1.2.1~~~glyA~~~Serine hydroxymethyltransferase~~~
MAYFLEQTDSEIFELIFEEYKRQNEHLEMIASENYTFPSVMEAMGSILTNKYAEGYPNKRYYGGCEVVDKIESLAIERAK
KLFNCQFANVQAHSGSQANNAVYHALLKPYDKILGMDLSCGGHLTHGAKVSLTGKHYQSFSYGVNLDGYIDYEEALKIAQ
SVKPEIIVCGFSAYPREIDFKKFREIADEVGALLLGDIAHVAGLVVTNEHAHPFPHCHVVSSTTHKTLRGPRGGIILTND
EEIAAKIDKAIFPGTQGGPLMHVIAAKAVGFKENLKPEFKAYAKLVKSNMQVLAKALKEKNHKLVSGGTSNHLLLMDFLD
KPYSGKDADIALGNAGITVNKNTIPGETRSPFVTSGIRIGSAALSARGMGAKEFEIIGNKISDILNDINNVSLQLHVKEE
LKAMANQFPVYQQPIF
>P56089 2.1.2.1~~~glyA~~~Serine hydroxymethyltransferase~~~COG0112
MAYFLEQTDSEIFELIFEEYKRQNEHLEMIASENYTFASVMEAMGSVLTNKYAEGYPNKRYYGGCEVVDKIESLAIERAK
KLFNCQFANVQAHSGSQANNAVYHALLKPYDKILGMDLSCGGHLTHGAKVSLTGKHYQSFSYGVNLDGYIDYEEALKIAQ
SVKPEIIVCGFSAYPREIDFKKFREIADEVGALLLGDIAHVAGLVVTGEHAHPFPHCHVVSSTTHKTLRGPRGGIILTND
EEIAAKIDKAIFPGTQGGPLMHVIAAKAVGFKENLKPEFKAYAQLVKSNMQVLAKALKEKNHKLVSGGTSNHLLLMDFLD
KPYSGKDADIALGNAGITVNKNTIPGETRSPFVTSGIRIGSAALSARGMGAKEFEIIGNKISDILNDINNVSLQLHVKEE
LKAMVNQFPVYHQPIF
>D3DKC4 2.1.2.1~~~glyA~~~Serine hydroxymethyltransferase~~~COG0112
MRHLFNTDAEIYEAIVKEYERQFYHLELIASENFTSLAVMEAQGSVMTNKYAEGLPHKRYYGGCEFVDIAEDLAIERAKA
LFDAEHANVQPHSGTQANMAVYMAVLKPGDTIMGMDLSHGGHLTHGAKVNFSGKIYNAVYYGVHPETHLIDYDQLYRLAK
EHKPKLIVGGASAYPRVIDWAKLREIADSVGAYLMVDMAHYAGLIAGGVYPNPVPYAHFVTSTTHKTLRGPRSGFILCKK
EFAKDIDKSVFPGIQGGPLMHVIAAKAVAFKEAMSQEFKEYARQVVANARVLAEEFIKEGFKVVSGGTDSHIVLLDLRDT
GLTGREVEEALGKANITVNKNAVPFDPLPPVKTSGIRLGTPAMTTRGMKEDQMRIIARLISKVIKNIGDEKVIEYVRQEV
IEMCEQFPLYPELREEINHLAKIKATY
>A0R2V7 2.1.2.1~~~glyA~~~Serine hydroxymethyltransferase~~~COG0112
MAADPSSNSSSVPAANGADYADTASAAYQAALQVIESVEPRVAAATRKELADQRDSLKLIASENYASPAVLLTMGTWFSD
KYAEGTIGHRFYAGCQNVDTVESVAAEHARELFGAPYAYVQPHSGIDANLVAFWAILATRVEAPELANFGAKHINDLSEA
DWETLRNKLGNQRLLGMSLDAGGHLTHGFRPNISGKMFHQRSYGTNPETGFLDYDAVAAAAREFKPLVLVAGYSAYPRRV
NFAKMREIADEVGATLMVDMAHFAGLVAGKVFTGDEDPVPHAHVTTTTTHKSLRGPRGGMVLATEEYAPAVDKGCPMVLG
GPLSHVMAAKAVALAEARQPAFQQYAQQVADNAQALADGFVKRDAGLVTGGTDNHIVLLDVTSFGLTGRQAESALLDAGI
VTNRNSIPADPNGAWYTSGVRLGTPALTSRGFGADDFDRVAELIVEVLANTQPEGTSKAKYKLADGTAERVHAASSELLS
ANPLYPGLTL
>A1SUU0 2.1.2.1~~~glyA~~~Serine hydroxymethyltransferase~~~COG0112
MFNRDMNIADYDPELWQSITDEVQRQEDHIELIASENYTSPRVMEAQGSQLTNKYAEGYPGKRYYGGCEYVDVAESLAIE
RAKSLFGADYANVQPHSGSQANAAVYQALCAPGDTILGMSLAHGGHLTHGSHVSFSGKMYNAVQYGITPETGILDYAEIE
RLAVEHKPTMIIAGFSAYSGIVDWAKFREIADKVGAYLFVDMAHVAGLVAAGLYPNPVPFADVVTTTTHKTLGGPRGGLI
LAKANEAIEKKLNSAVFPGQQGGPLMHVIAAKAVAFKECAEPEFAVYQQQVLDNAKAMVKSFLARGYKIVSGGTENHLFL
VDLIAQDITGKEADAALGNAHITVNKNSVPNDPRSPFVTSGLRIGTPALARRGVNAQQSAELALWMCDVLDAIKDEAKLA
TTITAVKVKVAALCKACPVYG
>A8GTI9 2.1.2.1~~~glyA~~~Serine hydroxymethyltransferase~~~
MNIFNNNLHETDKEINEIIKHEKLRQSSVIELIASENFVSPAVLEAQGALLTNKYAEGYPSKRFYNGCEEVDKAENLAIE
RVKKLFNCKYANVQPHSGSQANQAVYLALLQPGDTVLGMSLDSGGHLTHGAAPNMSGKWFNAVSYSVNKETYLIDYDEIE
RLADLHKPKLLIAGFSAYPRNIDFAKFREIVDKVGAYFMADIAHIAGLVATGEHQSPIPYAHAVTSTTHKTLRGPRGGLI
LSNDEEIGHKINSALFPGLQGGPLMHIIAAKAVAFLENLQPEYKSYIQQVISNAKALASSLQERGYDILTGGTDNHIVLV
DLRKDGITGKLAANSLDRAGITCNKNAIPFDETSPFITSGIRLGTPACTTRGFKEKDFVLVGHMVADILDGLKNNEDNSA
LEQQVLNEVTKLIELFPFYG
>P0A2E1 2.1.2.1~~~glyA~~~Serine hydroxymethyltransferase~~~
MLKREMNIADYDAELWQAMEQEKVRQEEHIELIASENYTSPRVMQAQGSQLTNKYAEGYPGKRYYGGCEYVDVVEQLAID
RAKELFGADYANVQPHSGSQANFAVYTALLQPGDTVLGMNLAQGGHLTHGSPVNFSGKLYNIVPYGIDESGKIDYDEMAK
LAKEHKPKMIIGGFSAYSGVVDWAKMREIADSIGAYLFVDMAHVAGLIAAGVYPNPVPHAHVVTTTTHKTLAGPRGGLIL
AKGGDEELYKKLNSAVFPSAQGGPLMHVIAGKAVALKEAMEPEFKVYQQQVAKNAKAMVEVFLNRGYKVVSGGTENHLFL
LDLVDKNLTGKEADAALGRANITVNKNSVPNDPKSPFVTSGIRIGSPAVTRRGFKEAEVKELAGWMCDVLDNINDEATIE
RVKAKVLDICARFPVYA
>Q5HE87 2.1.2.1~~~glyA~~~Serine hydroxymethyltransferase~~~
MSYITKQDKVIAEAIEREFQRQNSNIELIASENFVSEAVMEAQGSVLTNKYAEGYPGRRYYGGCEFVDVTESIAIDRAKA
LFGAEHVNVQPHSGSQANMAVYLVALEMGDTVLGMNLSHGGHLTHGAPVNFSGKFYNFVEYGVDKDTERINYDEVRKLAL
EHKPKLIVAGASAYSRTIDFKKFKEIADEVNAKLMVDMAHIAGLVAAGLHPNPVEYADFVTTTTHKTLRGPRGGMILCKE
EYKKDIDKTIFPGIQGGPLEHVIAAKAVAFGEALENNFKTYQQQVVKNAKVLAEALINEGFRIVSGGTDNHLVAVDVKGS
IGLTGKEAEETLDSVGITCNKNTIPFDQEKPFVTSGIRLGTPAATTRGFDEKAFEEVAKIISLALKNSKDEEKLQQAKER
VAKLTAEYPLYQ
>P99091 2.1.2.1~~~glyA~~~Serine hydroxymethyltransferase~~~
MSYITKQDKVIAEAIEREFQRQNSNIELIASENFVSEAVMEAQGSVLTNKYAEGYPGRRYYGGCEFVDVTESIAIDRAKA
LFGAEHVNVQPHSGSQANMAVYLVALEMGDTVLGMNLSHGGHLTHGAPVNFSGKFYNFVEYGVDKDTERINYDEVRKLAL
EHKPKLIVAGASAYSRTIDFKKFKEIADEVNAKLMVDMAHIAGLVAAGLHPNPVEYADFVTTTTHKTLRGPRGGMILCKE
EYKKDIDKTIFPGIQGGPLEHVIAAKAVAFGEALENNFKTYQQQVVKNAKVLAEALINEGFRIVSGGTDNHLVAVDVKGS
IGLTGKEAEETLDSVGITCNKNTIPFDQEKPFVTSGIRLGTPAATTRGFDEKAFEEVAKIISLALKNSKDEEKLQQAKER
VAKLTAEYPLYQ
>Q5M0B4 2.1.2.1~~~glyA~~~Serine hydroxymethyltransferase~~~
MIFDKEDYKAFDPELWNAIDAEAERQQNNIELIASENVVSKAVMAAQGTLLTNKYAEGYPGKRYYGGTAVIDVVETLAIE
RAKKLFGAKFANVQPHSGSQANAAVYMSLIQPGDTVMGMDLSAGGHLTHGAPVSFSGKTYNFVSYNVDKESELLDYDAIL
AQAKEVRPKLIVAGASAYSRIIDFAKFREIADAVGAYLMVDMAHIAGLVASGHHPSPVPYAHVTTTTTHKTLRGPRGGLI
LTDDEDIAKKLNSAVFPGLQGGPLEHVIAAKAVALKEALDPAFKEYGENVIKNAAAMADVFNQHPDFRVISGGTNNHLFL
VDVTKVVENGKVAQNVLEEVNITLNKNSIPYEQLSPFKTSGIRVGSPAITSRGMGEAESRQIAEWMVEALENHDKPEVLE
RIRGDVKVLTDAFPLY
>Q5M4W1 2.1.2.1~~~glyA~~~Serine hydroxymethyltransferase~~~COG0112
MIFDKEDYKAFDPELWNAIDAEAERQQNNIELIASENVVSKAVMAAQGTLLTNKYAEGYPGKRYYGGTAVIDVVETLAIE
RAKKLFGVKFANVQPHSGSQANAAVYMSLIQPGDTVMGMDLSAGGHLTHGAPVSFSGKTYNFVSYNVDKESELLDYDAIL
AQAKEVRPKLIVAGASAYSRIIDFAKFREIADAVGAYLMVDMAHIAGLVASGHHPSPVPYAHVTTTTTHKTLRGPRGGLI
LTDDEDIAKKLNSAVFPGLQGGPLEHVIAAKAVALKEALDPAFKEYGENVIKNAAAMADVFNQHPDFRVISGGTNNHLFL
VDVTKVVENGKVAQNVLEEVNITLNKNSIPYEQLSPFKTSGIRVGSPAITSRGMGEAESRQIAEWMVEALENHDKPEVLE
RIRGDVKVLTDAFPLY
>Q5SI56 2.1.2.1~~~glyA~~~Serine hydroxymethyltransferase~~~COG0112
MVSTLKRDEALFELIALEEKRQREGLELIASENFVSKQVREAVGSVLTNKYAEGYPGARYYGGCEVIDRVESLAIERAKA
LFGAAWANVQPHSGSQANMAVYMALMEPGDTLMGMDLAAGGHLTHGSRVNFSGKLYKVVSYGVRPDTELIDLEEVRRLAL
EHRPKVIVAGASAYPRFWDFKAFREIADEVGAYLVVDMAHFAGLVAAGLHPNPLPYAHVVTSTTHKTLRGPRGGLILSND
PELGKRIDKLIFPGIQGGPLEHVIAGKAVAFFEALQPEFKEYSRLVVENAKRLAEELARRGYRIVTGGTDNHLFLVDLRP
KGLTGKEAEERLDAVGITVNKNAIPFDPKPPRVTSGIRIGTPAITTRGFTPEEMPLVAELIDRALLEGPSEALREEVRRL
ALAHPMP
>A0A0H2URB1 ~~~glyD~~~Glycosyltransferase GlyD~~~COG1442
MNKTIVLAGDRNYTRQLETTIKSILYHNRDVKIYILNQDIMPDWFRKPRKIARMLGSEIIDVKLPEQTVFQDWEKQDHIS
SITYARYFIADYIQEDKVLYLDSDLIVNTSLEKLFSICLEEKSLAAVKDTDGITFNAGVLLINNKKWRQEKLKERLIEQS
IVTMKEVEEGRFEHFNGDQTIFNQVLQDDWLELGRAYNLQVGHDIVALYNNWQEHLAFNDKPVVIHFTTYRKPWTTLTAN
RYRDLWWEFHDLEWSQILQHHMGEFELISPLDKEFSCLTLTNSQDLEGIEELVTALPEVVFHIAAWTDMGDKLKKLAVYN
NVRLHPQIVPPVLDKLKKSTNLYLDINHGSADENFLKSLQEQEKTLLAFQSTQHGELGQIVFENGKVSFMIDTIKDFKKN
GHLTCFRQLPSLTCLTFTASQYIEQLDYLAGQLPNVVFQIAAWTAMGPKLYDLSNRYPNIQLYPAISRDKLDELKEKMDA
YLDINLLTSTSDIVAEMAHLSKPILAFYKSQNGNNGQRLYSSEHPERMLADLQKLITKDMLEKPLDIIQVKGIDETLDYI
IEHNSSLVRFGDGEINMLAGHSIPYQDYDEELVSIMRDIIGQESREDLVVCLPDAFTDRFRFTSWAIPFWKDHMDHYMDF
YRELCSDSWYGSTFVSRPYIDFEDKSQAKAQFEKLKSIWENRDLLIVEGATSRSGVGNDLFDEANSIKRIICPSHSAFSR
VHELEQEIEKYAGGRLILCMLGPTAKVLSYNLCQMGYQVLDVGHIDSEYEWMKMGAKTKVKFSHKHTAEHNFDQDIEFID
DETYNSQIVARILN
>A0A0H2URJ6 ~~~glyE~~~Glycosyltransferase GlyE~~~COG1442
MRNTKRAVVFAGDYAYIRQIETAMKSLCRHNSHLKIYLLNQDIPQEWFSQIRIYLQEMGGDLIDCKLIGSQFQMNWSNKL
PHINHMTFARYFIPDFVTEDKVLYLDSDLIVTGDLTDLFELDLGENYLAAARSCFGAGVGFNAGVLLINNKKWGSETIRQ
KLIDLTEKEHENVEEGDQSILNMLFKDQYSSLEDQYNFQIGYDYGAATFKHQFIFDIPLEPLPLILHYISQDKPWNQFSV
GRLREVWWEYSLMDWSVILNEWFSKSVKYPSKSQIFKLQCVNLTNSWCVEKIDYLAEQLPEVHFHIVAYTNMANELLALT
RFPNVTVYPNSLPMLLEQIVIASDLYLDLNHDRKLEDAYEFVLKYKKPMIAFDNTCSENLSEISYEGIYPSSIPKKMVAA
IRSYMR
>A0A0H2UR96 ~~~glyG~~~Glycosyltransferase GlyG~~~COG1215
MSELISVVVPIYNTGKYLVECVEHILKQTYQNIEIILVDDGSTDNSGEICDAFMMQDNRVRVLHQENKGGAAQAKNMGIS
VAKGEYITIVDSDDIVKENMIETLYQQVQEKDADVVIGNYYNYDESDGNFYFYVTGQDFCVEELAIQEIMNRQAGDWKFN
SSAFILPTFKLIKKELFNEVHFSNGRRFDDEATMHRFYLLASKIVFINDNLYLYRRRSGSIMRTEFDLSWARDIVEVFSK
KISDCVLAGLDVSVLRIRFVNLLKDYKQTLEYHQLTDTEEYKDICFRLKLFFDAEQRNGKS
>S5FMM4 1.4.3.19~~~thiO~~~Glycine oxidase~~~
MRKRYDTIVIGGGIIGTSIAYHLAKAGKKTAVFESGEVGKKATSAAAGMLGAHAECDKPGTFFEFARASQKAYKRLTGEL
KDISGIDIRRHDGGILKLAFSESDREHLMQMGALDSVEWLEADEVYKLEPNAGKGILGANFIRDDVHVEPAAVCRAFARG
ARMLGADVFEYTPVLSIESEAGAVRVTSASGTAEAEHAVIASGVWSGALFKQIGLDKRFYPVKGECLSVWNDGISLTRTL
YHDHCYIVPRHSGRLVVGATMKPGDWNEQPELGGIEELIRKAKSMLPGIESMKIDQCWAGLRPETGDGNPYIGRHPENDR
ILFAAGHFRNGILLAPATGEMMADMILGNPVKTEWIEAFKAERKEAVHR
>O31616 1.4.3.19~~~thiO~~~Glycine oxidase~~~COG0665
MKRHYEAVVIGGGIIGSAIAYYLAKENKNTALFESGTMGGRTTSAAAGMLGAHAECEERDAFFDFAMHSQRLYKGLGEEL
YALSGVDIRQHNGGMFKLAFSEEDVLQLRQMDDLDSVSWYSKEEVLEKEPYASGDIFGASFIQDDVHVEPYFVCKAYVKA
AKMLGAEIFEHTPVLHVERDGEALFIKTPSGDVWANHVVVASGVWSGMFFKQLGLNNAFLPVKGECLSVWNDDIPLTKTL
YHDHCYIVPRKSGRLVVGATMKPGDWSETPDLGGLESVMKKAKTMLPAIQNMKVDRFWAGLRPGTKDGKPYIGRHPEDSR
ILFAAGHFRNGILLAPATGALISDLIMNKEVNQDWLHAFRIDRKEAVQI
>Q5L2C2 1.4.3.19~~~thiO~~~Glycine oxidase~~~COG0665
MTHRYDVAIVGGGVIGAAIGFELAKRRHRVAIFEKGTMGSGASSAAAGMLGAQSEFSTSSPLVPLALQSRALMPALAEEL
RERTGIDIGLVEKGLIKLATTEEEADDLYRHYTFWRGIGEPVQWLTKGEALEMEPRLAAEALAGAMYIPGDGQVSAPDLA
AALAYAAASAGACLYEYTEVFDIRSDSSGHVLDTTGGTFAAEAVVIASGAWAARLGARVGLSLSVYPVKGECVMVRAPVP
LLQTTVFAKNGCYIVPKSGNRLLIGATSTPGTFDRRVSAGGVMNLLHRAAHLVPDIEQAEWVASWSGIRPQTEDGLPYLG
EHPERRGLFVAAGHYRNGILLSPLTGLLVADLVERKETAFDLAPFSLTRHIGKVGVE
>Q88Q83 1.4.3.19~~~thiO~~~Glycine oxidase~~~COG0665
MSKQVVVVGGGVIGLLTAFNLAAKVGQVVVCDQGEVGRESSWAGGGIVSPLYPWRYSPAVTALAHWSQDFYPQLGERLFA
STGVDPEVHTTGLYWLDLDDEAEALAWAAREQRPLSAVDISAAYDAVPVLGPGFKHAIYMAGVANVRNPRLVKSLKAALL
ALPNVSLREHCQITGFVQEKGRVTGVQTADGVLAADEVVLSAGAWSGDLLRTLGLELPVEPVKGQMILFKCAEDFLPSMV
LAKGRYAIPRRDGHILVGSTLEHAGYDKTPTADALESLKASAVELLPELEGATVVAHWAGLRPGSPEGIPYIGPVPGHEG
LWLNCGHYRNGLVLAPASCQLFTDLLTGAEPIIDPAPYAPEGRLG
>P9WMX5 2.4.-.-~~~pimF~~~Putative glycosyltransferases~~~COG1216
MRLSIVTTMYMSEPYVLEFYRRARAAADKITPDVEIIFVDDGSPDAALQQAVSLLDSDPCVRVIQLSRNFGHHKAMMTGL
AHATGDLVFLIDSDLEEDPALLEPFYEKLISTGADVVFGCHARRPGGWLRNFGPKIHYRASALLCDPPLHENTLTVRLMT
ADYVRSLVQHQERELSIAGLWQITGFYQVPMSVNKAWKGTTTYTFRRKVATLVDNVTSFSNKPLVFIFYLGAAIFIISSS
AAGYLIIDRIFFRALQAGWASVIVSIWMLGGVTIFCIGLVGIYVSKVFIETKQRPYTIIRRIYGSDLTTREPSSLKTAFP
AAHLSNGKRVTSEPEGLATGNR
>Q9JRN5 4.2.1.47~~~gmd~~~GDP-mannose 4,6-dehydratase~~~COG1089
MKTAIVTGASGQDGAYLSQLLLDKGYKVYATYRRSSSVNLWRIDELNIRNHPNLHLFEFDLTDMSSCISLVTKAQPGEVY
NLAAQSFVGVSFSQPVTTAEITAIGVLNLLEAIRIINPKIKFYQASTSEMFGKVQQIPQTEKTPFYPRSPYGVAKLYGHW
ITLNYRESYDIFGCSGILFNHESPLRGREFVTRKITDTVAKIALNKQSCLELGNLDAKRDWGFAKEYVEGMWRMLQEDQP
DTYVLATNRTETVRDFVAMAFQAVNIPLEFNGKGENEIGVNTDTGDVLVRVNKEYYRPAEVDLLIGDYSKAKRILGWEPK
TSLEELCKMMIEADIERNKLGFSF
>P0AC88 4.2.1.47~~~gmd~~~GDP-mannose 4,6-dehydratase~~~COG1089
MSKVALITGVTGQDGSYLAEFLLEKGYEVHGIKRRASSFNTERVDHIYQDPHTCNPKFHLHYGDLSDTSNLTRILREVQP
DEVYNLGAMSHVAVSFESPEYTADVDAMGTLRLLEAIRFLGLEKKTRFYQASTSELYGLVQEIPQKETTPFYPRSPYAVA
KLYAYWITVNYRESYGMYACNGILFNHESPRRGETFVTRKITRAIANIAQGLESCLYLGNMDSLRDWGHAKDYVKMQWMM
LQQEQPEDFVIATGVQYSVRQFVEMAAAQLGIKLRFEGTGVEEKGIVVSVTGHDAPGVKPGDVIIAVDPRYFRPAEVETL
LGDPTKAHEKLGWKPEITLREMVSEMVANDLEAAKKHSLLKSHGYDVAIALES
>Q51366 4.2.1.47~~~gmd~~~GDP-mannose 4,6-dehydratase~~~
MTRSALVTGITGQDGAYLAKLLLEKGYRVHGLVARRSSDTRWRLRELGIEGDIQYEDGDMADACSVQRAVIKAQPQEVYN
LAAQSFVGASWNQPVTTGVVDGLGVTHLLEAIRQFSPETRFYQASTSEMFGLIQAERQDENTPFYPRSPYGVAKLYGHWI
TVNYRESFGLHASSGILFNHESPLRGIEFVTRKVTDAVARIKLGKQQELRLGNVDAKRDWGFAGDYVEAMWLMLQQDKAD
DYVVATGVTTTVRDMCQIAFEHVGLDYRDFLKIDPAFFRPAEVDVLLGNPAKAQRVLGWKPRTSLDELIRMMVEADLRRV
SRE
>A9ZPH9 6.3.4.12~~~~~~Glutamate--methylamine ligase~~~
MKSLEEAQKFLEDHHVKYVLAQFVDIHGVAKVKSVPASHLNDILTTGAGFAGGAIWGTGIAPNGPDYMAIGELSTLSLIP
WQPGYARLVCDGHVNGKPYEFDTRVVLKQQIARLAEKGWTLYTGLEPEFSLLKKDEHGAVHPFDDSDTLQKPCYDYKGIT
RHSPFLEKLTESLVEVGLDIYQIDHEDANGQFEINYTYADCLKSADDYIMFKMAASEIANELGIICSFMPKPFSNRPGNG
MHMHMSIGDGKKSLFQDDSDPSGLGLSKLAYHFLGGILAHAPALAAVCAPTVNSYKRLVVGRSLSGATWAPAYIAYGNNN
RSTLVRIPYGRLELRLPDGSCNPYLATAAVIAAGLDGVARELDPGTGRDDNLYDYSLEQLAEFGIGILPQNLGEALDALE
ADQVIMDAMGPGLSKEFVELKRMEWVDYMRHVSDWEINRYVQFY
>F5RH07 6.3.4.12~~~gms~~~Glutamate--methylamine ligase~~~COG0174
MSPSEAQQFLKENQVKYILAQFVDIHGSAKTKSVPAEHYKTVVTDGAGFAGFAIWGMGMTPNVDADYMAVGDASTLSLVP
WQPGYARIACDGHTHGKPHEYDTRVVLKKQLEQITARGWTFFTGMEPEFSLLRKVEGKLLPADPGDTLSKPCYDYKGLSR
ARVFLERLSESLRSVGIDVYQIDHEDANGQFEINYTFTDALTSCDHYTFFKMGAAEIAAELGLICSFMPKPFSNRPGNGL
HMHMSIGDGKRNLFEDKSDKHGLALSKLAYHWAAGLLKHAPALAALCCPTVNSYKRLVVGRSLTGATWAPAYICYGGNNR
SGMIRSPGGRLELRLPDASCNAYLATAAVIAAGMDGVINELDPGAPQNDNLYEYSQAQLDAAGIKVLPQNLHEALLALEK
DEVIRSALGPVVDEFLRLKHMEWVEYMRHVSDWEVNSYLEFF
>Q5FPE5 1.1.1.119~~~~~~Glucose 1-dehydrogenase~~~COG1028
MPAPYKDRFAGKKVLVTGASQGIGEATALRFAEEGAQVALNGRKEDKLIAVREKLPKVSGGEHPIATGDISKEDDVKRLV
AESIKAMGGLDVLVCNAGYQIPSPSEDIKLEDFEGVMAVNVTGVMLPCREVIRYWLENGIKGTIIVNSSVHQIIPKPHYL
GYSASKGAVGNIVRTLALEYATRGIRVNAVAPGAIVTPINMSWIDDPEQYKAVSSHIPMKRPGESREIADAITFLAAEDS
TYITGQTLYVDGGLTLYGDFENNWSS
>Q9PNE6 5.3.1.28~~~gmhA1~~~Phosphoheptose isomerase 1~~~COG0279
MINLVEKEWQEHQKIVQASEILKGQIAKVGELLCECLKKGGKILICGNGGSAADAQHFAAELSGRYKKERKALAGIALTT
DTSALSAIGNDYGFEFVFSRQVEALGNEKDVLIGISTSGKSPNVLEALKKAKELNMLCLGLSGKGGGMMNKLCDHNLVVP
SDDTARIQEMHILIIHTLCQIIDESF
>Q9PMN3 5.3.1.28~~~gmhA2~~~Phosphoheptose isomerase 2~~~COG0279
MENLNSYIKGHFADSILVKEQILKDENLITLIKNASLEVIKAYKNGNKTLLAGNGGSAADAQHIAGEFVSRFYFDRPGIA
SIALTTDTSILTAIGNDYGYENLFARQVQAQGVKGDVFIGISTSGNSKNILKALEFCKQKEIISIGLSGASGGAMNELCD
YCIKVPSTCTPRIQEAHILIGHIICAIVEEELFGKGFSCKQ
>Q93UJ2 5.3.1.28~~~gmhA~~~Phosphoheptose isomerase~~~COG0279
MENRELTYITNSIAEAQRVMAAMLADERLLATVRKVADACIASIAQGGKVLLAGNGGSAADAQHIAGEFVSRFAFDRPGL
PAVALTTDTSILTAIGNDYGYEKLFSRQVQALGNEGDVLIGYSTSGKSPNILAAFREAKAKGMTCVGFTGNRGGEMRELC
DLLLEVPSADTPKIQEGHLVLGHIVCGLVEHSIFGKQ
>Q47VU0 5.3.1.28~~~gmhA~~~Phosphoheptose isomerase~~~COG0279
MLEQIKNNFTESIQTQIAASELLGPSIEHAGMMMVQCLLGGNKIISCGNGGSAGHAQHFCAQLLNKYETERPSLPAISLN
SDISTITSIANDYQYDEVFSKQIRALGHNGDVLLAISTSGNSRNVVKAIESAVSRDIPIIALTGFDGGDISGLLGEGDVE
IRVPSARTSRIQEVHLVVLHSLCEIIDTTLFPQGDS
>P63224 5.3.1.28~~~gmhA~~~Phosphoheptose isomerase~~~COG0279
MYQDLIRNELNEAAETLANFLKDDANIHAIQRAAVLLADSFKAGGKVLSCGNGGSHCDAMHFAEELTGRYRENRPGYPAI
AISDVSHISCVGNDFGFNDIFSRYVEAVGREGDVLLGISTSGNSANVIKAIAAAREKGMKVITLTGKDGGKMAGTADIEI
RVPHFGYADRIQEIHIKVIHILIQLIEKEMVK
>P9WGG1 5.3.1.28~~~gmhA~~~Phosphoheptose isomerase~~~COG0279
MCTARTAEEIFVETIAVKTRILNDRVLLEAARAIGDRLIAGYRAGARVFMCGNGGSAADAQHFAAELTGHLIFDRPPLGA
EALHANSSHLTAVANDYDYDTVFARALEGSARPGDTLFAISTSGNSMSVLRAAKTARELGVTVVAMTGESGGQLAEFADF
LINVPSRDTGRIQESHIVFIHAISEHVEHALFAPRQ
>Q5F5E3 5.3.1.28~~~gmhA~~~Phosphoheptose isomerase~~~
MTTLQERVAAHFAESIRAKQEAEKILVEPTVQAAELMLQCLMNDGKILACGNGGSAADAQHFAAEMTGRFEKERMELAAV
ALTTDTSALTAIGNDYGFDHVFSKQVRALGRAGDVLVGISTSGNSANVIEAVKAAHERDMHVIALTGRDGGKIAAMLKDT
DVLLNVPHPRTARIQENHILLIHAMCDCIDSVLLEGM
>Q02H15 5.3.1.28~~~gmhA~~~Phosphoheptose isomerase~~~
MDMQHRIRQLFQASIETKQQALEVLPPYIEQASLVMVNALLNEGKILSCGNGGSAGDAQHFSSELLNRFERERPSLPAVA
LTTDSSTITSIANDYSYNEVFSKQIRALGQPGDVLLAISTSGNSANVIQAIQAAHDREMLVVALTGRDGGGMASLLLPED
VEIRVPSKITARIQEVHLLAIHCLCDLIDRQLFGSEE
>Q9HVZ0 5.3.1.28~~~gmhA~~~Phosphoheptose isomerase~~~
MDMQHRIRQLFQASIETKQQALEVLPPYIEQASLVMVNALLNEGKILSCGNGGSAGDAQHFSSELLNRFERERPSLPAVA
LTTDSSTITSIANDYSYNEVFSKQIRALGQPGDVLLAISTSGNSANVIQAIQAAHDREMLVVALTGRDGGGMASLLLPED
VEIRVPSKITARIQEVHLLAIHCLCDLIDRQLFGSEE
>Q9KPY2 5.3.1.28~~~gmhA~~~Phosphoheptose isomerase~~~COG0279
MYQDLIRSELTEAADVLQKFLSDDHNIAQIEAAAKLIADSFKQGGKVLSCGNGGSHCDAMHFAEELTGRYRENRPGYPGI
AISDPSHLSCVSNDFGYDYVFSRYVEAVGAKGDVLFGLSTSGNSGNILKAIEAAKAKGMKTIALTGKDGGKMAGLADVEI
RVPHFGYADRIQEVHIKIIHIIIQLIEKEMA
>Q9AGY5 3.1.3.83~~~gmhB~~~D-glycero-alpha-D-manno-heptose-1,7-bisphosphate 7-phosphatase~~~
MKNKALFLDRDGVINVEKNYVHKIEDFEFMDGIFETLRYFQEKGYLLIIITNQAGIGRGYYTEEQFHILNDWMLSEFEKE
GIYITKVYYCPYHPEHGIGKYKRDSFDRKPNPGMILKSQKEFNIDLSKSILVGDKESDIQAGKRAGVNVNIIFSNNKNGD
ELDCCKKINSLSELVSLIL
>Q8AAI7 3.1.3.83~~~gmhB~~~D-glycero-alpha-D-manno-heptose-1,7-bisphosphate 7-phosphatase~~~COG0241
MRLQDIDVTGFETLLLDRDGVVNRLRPDDYVKKWEEFEFLPGVLEILKAWNTHFKYIFIVTNQRGVGKEIMSEEDLKHIH
ERMISEVKNYGGRIDRIYYCTALTDSDINRKPGIGMFLQILRDYPDIDKAKCLMIGDSDSDIKFAKNCGIVGIKVI
>Q6TG07 3.1.3.83~~~gmhB~~~D-glycero-alpha-D-manno-heptose-1,7-bisphosphate 7-phosphatase~~~COG0241
MKTKALFLDRDGVINIDKKYVYKIEDFEFCDGIFELCRYFLARNYLLFIATNQSGIARGYYKESDFFKLCDYMLKEFAKQ
DIKIDKIYHCPHLEGCECRKPKAGMLLKAKDEFDLDMKNSIFIGDNLSDMQAGLNADIGTLILVNEEKKEGDFFRQFKNL
KEILNFFKEKDI
>Q7WG29 3.1.3.82~~~~~~D-glycero-beta-D-manno-heptose-1,7-bisphosphate 7-phosphatase~~~COG0241
MKLIILDRDGVVNQDSDAFVKSPDEWIALPGSLQAIARLTQADWTVVLATNQSGLARGLFDTATLNAIHDKMHRALAQMG
GVVDAIFMCPHGPDDGCACRKPLPGMYRDIARRYDVDLAGVPAVGDSLRDLQAAAQAGCAPWLVQTGNGRKTLAQGGLPE
GTRVCEDLAAVAEQLLQEA
>P63228 3.1.3.82~~~gmhB~~~D-glycero-beta-D-manno-heptose-1,7-bisphosphate 7-phosphatase~~~COG0241
MAKSVPAIFLDRDGTINVDHGYVHEIDNFEFIDGVIDAMRELKKMGFALVVVTNQSGIARGKFTEAQFETLTEWMDWSLA
DRDVDLDGIYYCPHHPQGSVEEFRQVCDCRKPHPGMLLSARDYLHIDMAASYMVGDKLEDMQAAVAANVGTKVLVRTGKP
ITPEAENAADWVLNSLADLPQAIKKQQKPAQ
>P46452 3.1.3.82~~~gmhB~~~D-glycero-beta-D-manno-heptose-1,7-bisphosphate 7-phosphatase~~~COG0241
MNKAIFLDRDGTLNIDYGYVHEIDNFKFIDGVIDALRELKKMGYMLVLVTNQSGIARGYFSEDQFLQLTEWMDWSLAEQD
VDLDGIYYCPHHSEGKGEYKEDCDCRKPKSGMLLQAIKELKIDPTQSIMVGDKVEDLKAGIGAKVKMNVLVRTGKPVTGE
GEGIADYVLDSIVDLPRILKRLKK
>Q88RS0 3.1.3.82~~~gmhB~~~D-glycero-beta-D-manno-heptose-1,7-bisphosphate 7-phosphatase~~~COG0241
MKLLILDRDGVINYDSDAYIKTLEEWVPIPGSVDAIAQLSKAGWTVAVATNQSGIARGYYPLATLEAMHARLRALVAEQG
GEVGHIVYCPHGPDEGCDCRKPKPGMLRAIAEHYQIGLEGVWFVGDSKGDLEAALAVGAQPVLVKTGKGERTLEKGVPET
TLIFDDLAAIARELI
>Q98I56 3.1.3.82~~~gmhB~~~D-glycero-beta-D-manno-heptose-1,7-bisphosphate 7-phosphatase~~~COG0241
MADKTGTPHPLTEPGVWIERIGGRVFPPHLPALFLDRDGTINVDTDYPSDPAEIVLRPQMLPAIATANRAGIPVVVVTNQ
SGIARGYFGWSAFAAVNGRVLELLREEGVFVDMVLACAYHEAGVGPLAIPDHPMRKPNPGMLVEAGKRLALDLQRSLIVG
DKLADMQAGKRAGLAQGWLVDGEAAVQPGFAIRPLRDSSELGDLLAAIETLGRDNRS
>Q6N2R1 3.1.3.82~~~gmhB~~~D-glycero-beta-D-manno-heptose-1,7-bisphosphate 7-phosphatase~~~COG0241
MTASAPRRPAAFLDRDGVINYNDHYVGTRERLRWMPGIAAAIRQLNAAGYYVFIITNQSGVARGMFSEDDVRALHRWMLD
ELNTQGARIDDVRFCPHHVEGTLDAYRVACEHRKPGPGMILDLAKTWPVDMTRSFVIGDSASDVEAAKAAGIPGFRFEGE
DIDVFVKQVLIEMQRAAVSN
>Q8Z989 3.1.3.82~~~gmhB~~~D-glycero-beta-D-manno-heptose-1,7-bisphosphate 7-phosphatase~~~COG0241
MAKSVPAIFLDRDGTINVDHGYVHEIDAFEFIDGVIDAMRELKKMGYALVVVTNQSGIARGKFTEAQFETLTEWMDWSLA
DRDVDLDGIYYCPHHPQGSIEEFRQVCDCRKPHPGMLISARDFLHIDMAASYMVGDKLEDMQAAAAANVGTKVLVRTGKP
VTAEAENAADWVLNSLADLPSAIKKQQK
>Q9KTJ4 3.1.3.82~~~gmhB~~~D-glycero-beta-D-manno-heptose-1,7-bisphosphate 7-phosphatase~~~COG0241
MAKPAVFLDRDGVINVDHGYVHDEHDFQFIEGVFEATAALQRMGYLLVLVTNQSGIARGKFSEERFISLTQWMDWNFADN
GVEFDGIYYCPHHAEHGIGQYKEECDCRKPKPGMFLSARDFLNIDMANSVMVGDKAEDMMAAEAAGVGTKILVRTGKPIT
EQGEALATVVLDSIRDVPHYLLRVKK
>Q0P8I7 1.1.98.-~~~~~~GDP-D-glycero-alpha-D-manno-heptose dehydrogenase~~~COG0451
MSKKVLITGGAGYIGSVLTPILLEKGYEVCVIDNLMFDQISLLSCFHNKNFTFINGDAMDENLIRQEVAKADIIIPLAAL
VGAPLCKRNPKLAKMINYEAVKMISDFASPSQIFIYPNTNSGYGIGEKDAMCTEESPLRPISEYGIDKVHAEQYLLDKGN
CVTFRLATVFGISPRMRLDLLVNDFTYRAYRDKFIVLFEEHFRRNYIHVRDVVKGFIHGIENYDKMKGQAYNMGLSSANL
TKRQLAETIKKYIPDFYIHSANIGEDPDKRDYLVSNTKLEATGWKPDNTLEDGIKELLRAFKMMKVNRFANFN
>P32056 3.6.1.-~~~gmm~~~GDP-mannose mannosyl hydrolase~~~COG1051
MFLRQEDFATVVRSTPLVSLDFIVENSRGEFLLGKRTNRPAQGYWFVPGGRVQKDETLEAAFERLTMAELGLRLPITAGQ
FYGVWQHFYDDNFSGTDFTTHYVVLGFRFRVSEEELLLPDEQHDDYRWLTSDALLASDNVHANSRAYFLAEKRTGVPGL
>L7N6A5 2.7.7.13~~~manB~~~Mannose-1-phosphate guanylyltransferase~~~COG1208
MATHQVDAVVLVGGKGTRLRPLTLSAPKPMLPTAGLPFLTHLLSRIAAAGIEHVILGTSYKPAVFEAEFGDGSALGLQIE
YVTEEHPLGTGGGIANVAGKLRNDTAMVFNGDVLSGADLAQLLDFHRSNRADVTLQLVRVGDPRAFGCVPTDEEDRVVAF
LEKTEDPPTDQINAGCYVFERNVIDRIPQGREVSVEREVFPALLADGDCKIYGYVDASYWRDMGTPEDFVRGSADLVRGI
APSPALRGHRGEQLVHDGAAVSPGALLIGGTVVGRGAEIGPGTRLDGAVIFDGVRVEAGCVIERSIIGFGARIGPRALIR
DGVIGDGADIGARCELLSGARVWPGVFLPDGGIRYSSDV
>P80078 5.4.99.1~~~glmS~~~Glutamate mutase sigma subunit~~~
MEKKTIVLGVIGSDCHAVGNKILDHAFTNAGFNVVNIGVLSPQEVFIKAAIETKADAILLSSLYGQGEIDCKGLRQKCDE
AGLEGILLYVGGNIVVGKQHWPDVEKRFKDMGYDRVYAPGTPPEVGIADLKKDLNIE
>Q05488 5.4.99.1~~~glmS~~~Glutamate mutase sigma subunit~~~
MEKKTIVLGVIGSDCHAVGNKILDHSFTNAGFNVVNIGVLSSQEDFINAAIETKADLICVSSLYGQGEIDCKGLREKCDE
AGLKGIKLFVGGNIVVGKQNWPDVEQRFKAMGFDRVYPPGTSPETTIADMKEVLGVE
>O05508 3.2.1.86~~~gmuD~~~6-phospho-beta-glucosidase GmuD~~~COG2723
MAHTEQYRFPKDFWWGSSASATQMEGAADRDGKGQNIWDYWFEKEPHRFFDHVGPADTSQFYDNYKEDIRLMKELGHNSF
RMSISWSRLIPNGTGEINDKAADFYNNVIDELIANGIEPFVNLFHFDMPMALQKIGGWVNRETVDAYENYARTCFRLFGG
RVKKWFTHNEPIVPVEGGYLYDFHYPNKVDFKEAVQVGFHTMLSSARAIQAYREMKQDGKIGIILNLTPSYPRSSHPADV
KAGEIADAFFNRSFLDPSVKGEFPKELVDILKHEGFMPDYNAEDLDIIKKNTVDLLGVNYYQPRRVKAKEHLPNPDAPFL
PDRYFDPYVMPGRKMNPHRGWEIYEKGVYDILINLKENYGNIECFISENGMGVEGEERFRDEQGIIQDDYRIEFIKEHLK
WIHRAIQEGSNVKGYHLWTFMDNWSWTNAYKNRYGFVSVNLEKDGERTVKKSGKWFKEVAEHSGF
>O05509 ~~~gmuR~~~HTH-type transcriptional regulator GmuR~~~COG2188
MNKYEIIANEMRNRIKNNVYPIDQPIPDEVSLAKEFNSSRMTMKRALDNLVAEGLLFRKRGHGTFIIQSAIQDDHVHVVS
NEILGLTNLLKDKKIKSKVIQFEVQFPTEEVAAHLSIDQKTPVYYVVRLRIVEGEPYVLEKTYMPTHLIPGINDDVLHDS
IYNHITNVLQLKIAGTHRKIRACKSDHIDQQHLGCKQDDPILEVEHVGFLDTGIPFEYSFSRHRHDKFVVTSVNIRR
>B8QSK0 2.4.1.-~~~wclY~~~O-antigen biosynthesis glycosyltransferase WclY~~~
MKIAYVVSSKKKCGPNIVILNIVKELANKHEMEIFFLDESDDDVFECVNVKSTQIKKASDLKEHLKRFDIIHSSGIRPDA
LVVLCKVIYRVKCKIITTIHNYVFQDLYYSYGLVKSLIWGLLWCSIWLFFDKLVILSKNADNYYWFLPSAKKNIIYNGID
DNDCLQNKKCNYRKEFNIPDDGILAGSCANLTKCKGIDLVIQTLTKEHKIYYIVAGDGIEKHNLINLVKARKLHERVYFI
DFLDEPESFMSQLDVFLMPSRSEGFGLTVLESTKLGIPVITSNIPIFMELFDQMCLTFDIKNPSTLIDVITYAKKNRLHL
SQKFHAIFQDRFTSSKMATKYENVYNNLFREVL
>P0DJQ5 2.3.1.35~~~oat2~~~Glutamate N-acetyltransferase 2~~~
MSDSTPKTPRGFVVHTAPVGLADDGRDDFTVLASTAPATVSAVFTRSRFAGPSVVLCREAVADGQARGVVVLARNANVAT
GLEGEENAREVREAVARALGLPEGEMLIASTGVIGRQYPMESIREHLKTLEWPAGEGGFDRAARAIMTTDTRPKEVRVSV
GGATLVGIAKGVGMLEPDMATLLTFFATDARLDPAEQDRLFRRVMDRTFNAVSIDTDTSTSDTAVLFANGLAGEVDAGEF
EEALHTAALALVKDIASDGEGAAKLIEVQVTGARDDAQAKRVGKTVVNSPLVKTAVHGCDPNWGRVAMAIGKCSDDTDID
QERVTIRFGEVEVYPPKARGDQADDALRAAVAEHLRGDEVVIGIDLAIADGAFTVYGCDLTEGYVRLNSEYTT
>P0DQD7 ~~~gndA~~~Protein GndA~~~
MLLLIPSNHISIKETSSLMVVTPSSRTLFVVIVSFQQRALTSSVPVFLAVKRGR
>Q7BJX9 5.1.3.7~~~wbgU~~~UDP-N-acetylglucosamine 4-epimerase~~~
MDIYMSRYEEITQQLIFSPKTWLITGVAGFIGSNLLEKLLKLNQVVIGLDNFSTGHQYNLDEVKTLVSTEQWSRFCFIEG
DIRDLTTCEQVMKGVDHVLHQAALGSVPRSIVDPITTNATNITGFLNILHAAKNAQVQSFTYAASSSTYGDHPALPKVEE
NIGNPLSPYAVTKYVNEIYAQVYARTYGFKTIGLRYFNVFGRRQDPNGAYAAVIPKWTAAMLKGDDVYINGDGETSRDFC
YIDNVIQMNILSALAKDSAKDNIYNVAVGDRTTLNELSGYIYDELNLIHHIDKLSIKYREFRSGDVRHSQADVTKAIDLL
KYRPNIKIREGLRLSMPWYVRFLKG
>P75767 ~~~ybhK~~~Putative gluconeogenesis factor~~~COG0391
MRNRTLADLDRVVALGGGHGLGRVLSSLSSLGSRLTGIVTTTDNGGSTGRIRRSEGGIAWGDMRNCLNQLITEPSVASAM
FEYRFGGNGELSGHNLGNLMLKALDHLSVRPLEAINLIRNLLKVDTHLIPMSEHPVDLMAIDDQGHEVYGEVNIDQLTTP
IQELLLTPNVPATREAVHAINEADLIIIGPGSFYTSLMPILLLKEIAQALRRTPAPMVYIGNLGRELSLPAANLKLESKL
AIMEQYVGKKVIDAVIVGPKVDVSAVKERIVIQEVLEASDIPYRHDRQLLHNALEKALQALG
>Q9K706 ~~~~~~Gluconeogenesis factor~~~COG0391
MKKKNVVVFGGGTGLSVLLRGLKTFPVSITAIVTVADDGGSSGRLRKELDIPPPGDVRNVLVALSEVEPLLEQLFQHRFE
NGNGLSGHSLGNLLLAGMTSITGDFARGISEMSKVLNVRGKVLPASNRSIILHGEMEDGTIVTGESSIPKAGKKIKRVFL
TPKDTKPLREGLEAIRKADVIVIGPGSLYTSVLPNLLVPGICEAIKQSTARKVYICNVMTQNGETDGYTASDHLQAIMDH
CGVGIVDDILVHGEPISDTVKAKYAKEKAEPVIVDEHKLKALGVGTISDYFVLEQDDVLRHNASKVSEAILEGKPRTSSS
IQ
>P9WMU5 ~~~~~~Putative gluconeogenesis factor~~~COG0391
MTDGIVALGGGHGLYATLSAARRLTPYVTAVVTVADDGGSSGRLRSELDVVPPGDLRMALAALASDSPHGRLWATILQHR
FGGSGALAGHPIGNLMLAGLSEVLADPVAALDELGRILGVKGRVLPMCPVALQIEADVSGLEADPRMFRLIRGQVAIATT
PGKVRRVRLLPTDPPATRQAVDAIMAADLVVLGPGSWFTSVIPHVLVPGLAAALRATSARRALVLNLVAEPGETAGFSVE
RHLHVLAQHAPGFTVHDIIIDAERVPSEREREQLRRTATMLQAEVHFADVARPGTPLHDPGKLAAVLDGVCARDVGASEP
PVAATQEIPIDGGRPRGDDAWR
>Q97PN8 ~~~~~~Putative gluconeogenesis factor~~~COG0391
MRKPKITVIGGGTGSPVILKSLREKDVEIAAIVTVADDGGSSGELRKNMQQLTPPGDLRNVLVAMSDMPKFYEKVFQYRF
SEDAGAFAGHPLGNLIIAGLSEMQGSTYNAMQLLSKFFHTTGKIYPSSDHPLTLHAVFQDGTEVAGESHIVDHRGIIDNV
YVTNALNDDTPLASRRVVQTILESDMIVLGPGSLFTSILPNIVIKEIGRALLETKAEIAYVCNIMTQRGETEHFTDSDHV
EVLHRHLGRPFIDTVLVNIEKVPQEYMNSNRFDEYLVQVEHDFVGLCKQVSRVISSNFLRLENGGAFHDGDLIVDELMRI
IQVKK
>Q01578 3.1.1.17~~~gnl~~~Gluconolactonase~~~COG3386
MTTGRMSRRECLSAAVMVPIAAMTATATITGSAQAAKNNMNGSTIGKITKFSPRLDAILDVSTPIEVIASDIQWSEGPVW
VKNGNFLLFSDPPANIMRKWTPDAGVSIFLKPSGHAEPIPAGQFREPGSNGMKVGPDGKIWVADSGTRAIMKVDPVTRQR
SVVVDNYKGKRFNSPNDLFFSKSGAVYFTDPPYGLTNLDESDIKEMNYNGVFRLSPDGRLDLIEAGLSRPNGLALSPDET
KLYVSNSDRASPNIWVYSLDSNGLPTSRTLLRNFRKEYFDQGLAGLPDGMNIDKQGNLFASAPGGIYIFAPDGECLGLIS
GNPGQPLSNCCFGEKGQTLFISASHNVVRVRTKTFG
>P50199 1.1.1.-~~~gno~~~Gluconate 5-dehydrogenase~~~COG1028
MSHPDLFSLSGARALVTGASRGIGLTLAKGLARYGAEVVLNGRNAESLDSAQSGFEAEGLKASTAVFDVTDQDAVIDGVA
AIERDMGPIDILINNAGIQRRAPLEEFSRKDWDDLMSTNVNAVFFVGQAVARHMIPRGRGKIVNICSVQSELARPGIAPY
TATKGAVKNLTKGMATDWGRHGLQINGLAPGYFATEMTERLVADEEFTDWLCKRTPAGRWGQVEELVGAAVFLSSRASSF
VNGQVLMVDGGITVSL
>P0AC92 ~~~gnsA~~~Protein GnsA~~~
MNIEELKKQAETEIADFIAQKIAELNKNTGKEVSEIRFTAREKMTGLESYDVKIKIM
>O86041 1.13.11.4~~~nagI~~~Gentisate 1,2-dioxygenase~~~
MLDEEERITMSHELGRLEDLPQDYRDELKQLNLVPLWPSLRAVLPPNVPTRQTQPTYWSYQTLKPLLLKAGELTPIEKAE
RRVLVLANPGHGLEKMQASAAIYLGMQLLLPGEWAPSHRHTPNAVRMIVEGEGAYTTVDGEKCPMSRGDLILTPTGLWHE
HGHDGNEPVVWLDVLDLPLVYYMEASYHIDGERQQVDPGRGDCAWTRAGVVPTPVFQRSDKRYPLLRYPWADTRAALLSL
AADQPEQECVQVTYVNPETGDDAENILGFYALMLKPGQTLRLPVRSPAVVFHQIEGRSEARIAESTFALREADTCCAPGY
TEVTLKNLSADQPSFIFMADESPLHRKLGVFENRG
>P46859 2.7.1.12~~~gntK~~~Thermoresistant gluconokinase~~~COG3265
MSTTNHDHHIYVLMGVSGSGKSAVASEVAHQLHAAFLDGDFLHPRRNIEKMASGEPLNDDDRKPWLQALNDAAFAMQRTN
KVSLIVCSALKKHYRDLLREGNPNLSFIYLKGDFDVIESRLKARKGHFFKTQMLVTQFETLQEPGADETDVLVVDIDQPL
EGVVASTIEVIKKGK
>P0AC94 ~~~gntP~~~High-affinity gluconate transporter~~~COG2610
MHVLNILWVVFGIGLMLVLNLKFKINSMVALLVAALSVGMLAGMDLMSLLHTMKAGFGNTLGELAIIVVFGAVIGKLMVD
SGAAHQIAHTLLARLGLRYVQLSVIIIGLIFGLAMFYEVAFIMLAPLVIVIAAEAKIPFLKLAIPAVAAATTAHSLFPPQ
PGPVALVNAYGADMGMVYIYGVLVTIPSVICAGLILPKFLGNLERPTPSFLKADQPVDMNNLPSFGVSILVPLIPAIIMI
STTIANIWLVKDTPAWEVVNFIGSSPIAMFIAMVVAFVLFGTARGHDMQWVMNAFESAVKSIAMVILIIGAGGVLKQTII
DTGIGDTIGMLMSHGNISPYIMAWLITVLIRLATGQGVVSAMTAAGIISAAILDPATGQLVGVNPALLVLATAAGSNTLT
HINDASFWLFKGYFDLSVKDTLKTWGLLELVNSVVGLIIVLIISMVA
>P0ACP5 ~~~gntR~~~HTH-type transcriptional regulator GntR~~~COG1609
MKKKRPVLQDVADRVGVTKMTVSRFLRNPEQVSVALRGKIAAALDELGYIPNRAPDILSNATSRAIGVLLPSLTNQVFAE
VLRGIESVTDAHGYQTMLAHYGYKPEMEQERLESMLSWNIDGLILTERTHTPRTLKMIEVAGIPVVELMDSKSPCLDIAV
GFDNFEAARQMTTAIIARGHRHIAYLGARLDERTIIKQKGYEQAMLDAGLVPYSVMVEQSSSYSSGIELIRQARREYPQL
DGVFCTNDDLAVGAAFECQRLGLKVPDDMAIAGFHGHDIGQVMEPRLASVLTPRERMGSIGAERLLARIRGESVTPKMLD
LGFTLSPGGSI
>Q8GAL4 ~~~gntR~~~Probable D-xylose utilization operon transcriptional repressor~~~
MDATSKRLTRTTVASQVRDFIVMEIAQGRLPLGAPVREMEIAAQLGTSQTPVREAFRELAALGLLESRIHVGTRVRQLAE
KDLVEAVPIRSALEGIAGRLAANNYHKHAEEVRGSFEAMKEVAEGGDRRVFAAASTTFHRSVVRAAENASLLRAWNALGI
EVMTILAMASSDIPLDDAAESHRPIVDALEAGDPELAEHALTHHVAAYLPATAHSNGGVDAAVQAS
>Q9I1F6 ~~~gntR~~~HTH-type transcriptional regulator GntR~~~
MSITKNDKNTRTTGRPTLNEVARRAGVSPITASRALRGVASVAEELAQKVRDAARELGYVANPAARALASAQSHSVAVLV
PSLANLLFIETLEAIHAVLRPQGLEVLIGNFHYSRNEEEDLIRNYLAYQPRGLLLTGFERTESARRMIEASGIPCVYMMD
LDSGSGLNCVGFSQLRAGEAAAEHLLARGRRRLAYIGAQLDQRTLLRGEGFRRALQKAGCYDPGLEILTPRPSSVALGGE
LFVQLLASQPQVDGVFFCNDDLAQGALLEALRRGVKVPEQIAVLGFNDLPGSDCTVPRLSSIRTPREAIGRRAAEQLLAL
IAGKEVRDSALDMGFELMAREST
>P39835 ~~~gntT~~~High-affinity gluconate transporter~~~COG2610
MPLVIVAIGVILLLLLMIRFKMNGFIALVLVALAVGLMQGMPLDKVIGSIKAGVGGTLGSLALIMGFGAMLGKMLADCGG
AQRIATTLIAKFGKKHIQWAVVLTGFTVGFALFYEVGFVLMLPLVFTIAASANIPLLYVGVPMAAALSVTHGFLPPHPGP
TAIATIFNADMGKTLLYGTILAIPTVILAGPVYARVLKGIDKPIPEGLYSAKTFSEEEMPSFGVSVWTSLVPVVLMAMRA
IAEMILPKGHAFLPVAEFLGDPVMATLIAVLIAMFTFGLNRGRSMDQINDTLVSSIKIIAMMLLIIGGGGAFKQVLVDSG
VDKYIASMMHETNISPLLMAWSIAAVLRIALGSATVAAITAGGIAAPLIATTGVSPELMVIAVGSGSVIFSHVNDPGFWL
FKEYFNLTIGETIKSWSMLETIISVCGLVGCLLLNMVI
>P0AC96 ~~~gntU~~~Low-affinity gluconate transporter~~~COG2610
MTTLTLVLTAVGSVLLLLFLVMKARMHAFLALMVVSMGAGLFSGMPLDKIAATMEKGMGGTLGFLAVVVALGAMFGKILH
ETGAVDQIAVKMLKSFGHSRAHYAIGLAGLVCALPLFFEVAIVLLISVAFSMARHTGTNLVKLVIPLFAGVAAAAAFLVP
GPAPMLLASQMNADFGWMILIGLCAAIPGMIIAGPLWGNFISRYVELHIPDDISEPHLGEGKMPSFGFSLSLILLPLVLV
GLKTIAARFVPEGSTAYEWFEFIGHPFTAILVACLVAIYGLAMRQGMPKDKVMEICGHALQPAGIILLVIGAGGVFKQVL
VDSGVGPALGEALTGMGLPIAITCFVLAAAVRIIQGSATVACLTAVGLVMPVIEQLNYSGAQMAALSICIAGGSIVVSHV
NDAGFWLFGKFTGATEAETLKTWTMMETILGTVGAIVGMIAFQLLS
>Q8X7P7 5.1.3.26~~~gnu~~~N-acetyl-alpha-D-glucosaminyl-diphospho-ditrans,octacis-undecaprenol 4-epimerase~~~COG0451
MNDNVLLIGASGFVGTRLLETAIADFNIKNLDKQQSHFYPEITQIGDVRDQQALDQALAGFDTVVLLAAEHRDDVSPTSL
YYDVNVQGTRNVLAAMEKNGVKNIIFTSSVAVYGLNKHNPDENHPHDPFNHYGKSKWQAEEVLREWYNKAPTERSLTIIR
PTVIFGERNRGNVYNLLKQIAGGKFMMVGAGTNYKSMAYVGNIVEFIKYKLKNVAAGYEVYNYVDKPDLNMNQLVAEVEQ
SLNKKIPSMHLPYPLGMLGGYCFDILSKITGKKYAVSSVRVKKFCATTQFDATKVHSSGFVAPYTLSQGLDRTLQYEFVH
AKKDDITFVSE
>Q92EU6 1.1.1.6~~~golD~~~NAD-dependent glycerol dehydrogenase~~~COG1028
MTFKGFDKDFNITDKVAVVTGAASGIGKAMAELFSEKGAYVVLLDIKEDVKDVAAQINPSRTLALQVDITKKENIEKVVA
EIKKVYPKIDILANSAGVALLEKAEDLPEEYWDKTMELNLKGSFLMAQIIGREMIATGGGKIVNMASQASVIALDKHVAY
CASKAAIVSMTQVLAMEWAPYNINVNAISPTVILTELGKKAWAGQVGEDMKKLIPAGRFGYPEEVAACALFLVSDAASLI
TGENLIIDGGYTIK
>B0BCM1 ~~~~~~Virulence plasmid protein pGP3-D~~~
MGNSGFYLYNTQNCVFADNIKVGQMTEPLKDQQIILGTTSTPVAAKMTASDGISLTVSNNPSTNASITIGLDAEKAYQLI
LEKLGDQILGGIADTIVDSTVQDILDKITTDPSLGLLKAFNNFPITNKIQCNGLFTPRNIETLLGGTEIGKFTVTPKSSG
SMFLVSADIIASRMEGGVVLALVREGDSKPYAISYGYSSGVPNLCSLRTRIINTGLTPTTYSLRVGGLESGVVWVNALSN
GNDILGITNTSNVSFLEVIPQTNA
>P0CE18 ~~~~~~Virulence plasmid protein pGP3-D~~~
MGNSGFYLYNTQNCVFADNIKVGQMTEPLKDQQIILGTTSTPVAAKMTASDGISLTVSNNPSTNASITIGLDAEKAYQLI
LEKLGDQILGGIADTIVDSTVQDILDKITTDPSLGLLKAFNNFPITNKIQCNGLFTPRNIETLLGGTEIGKFTVTPKSSG
SMFLVSADIIASRMEGGVVLALVREGDSKPYAISYGYSSGVPNLCSLRTRIINTGLTPTTYSLRVGGLESGVVWVNALSN
GNDILGITNTSNVSFLEVIPQTNA
>P9WN75 1.1.1.94~~~gpsA2~~~Probable glycerol-3-phosphate dehydrogenase 2 [NAD(P)+]~~~COG0240
MAANKREPKVVVLGGGSWGTTVASICARRGPTLQWVRSAVTAQDINDNHRNSRYLGNDVVLSDTLRATTDFTEAANCADV
VVMGVPSHGFRGVLVELSKELRPWVPVVSLVKGLEQGTNMRMSQIIEEVLPGHPAGILAGPNIAREVAEGYAAAAVLAMP
DQHLATRLSAMFRTRRFRVYTTDDVVGVETAGALKNVFAIAVGMGYSLGIGENTRALVIARALREMTKLGVAMGGKSETF
PGLAGLGDLIVTCTSQRSRNRHVGEQLGAGKPIDEIIASMSQVAEGVKAAGVVMEFANEFGLNMPIAREVDAVINHGSTV
EQAYRGLIAEVPGHEVHGSGF
>Q83BJ0 1.1.1.94~~~gpsA~~~Glycerol-3-phosphate dehydrogenase [NAD(P)+]~~~COG0240
MEPFKHPIAILGAGSWGTALALVLARKGQKVRLWSYESDHVDEMQAEGVNNRYLPNYPFPETLKAYCDLKASLEGVTDIL
IVVPSFAFHEVITRMKPLIDAKTRIAWGTKGLAKGSRLLHEVVATELGQVPMAVISGPSLATEVAANLPTAVSLASNNSQ
FSKDLIERLHGQRFRVYKNDDMIGVELCGSVKNILAIATGISDGLKLGSNARAALITRGLTEMGRLVSVFGGKQETLTGL
AGLGDLVLTCTDNQSRNRRFGLALGEGVDKKEAQQAIGQAIEGLYNTDQVHALAQKHAIEMPLTFQVHRILHEDLDPQQA
VQELLERSPKAE
>P9WN77 1.1.1.94~~~gpsA~~~Glycerol-3-phosphate dehydrogenase [NAD(P)+]~~~COG0240
MAGIASTVAVMGAGAWGTALAKVLADAGGEVTLWARRAEVADQINTTRYNPDYLPGALLPPSIHATADAEEALGGASTVL
LGVPAQTMRANLERWAPLLPEGATLVSLAKGIELGTLMRMSQVIISVTGAEPPQVAVISGPNLASEIAECQPAATVVACS
DSGRAVALQRALNSGYFRPYTNADVVGTEIGGACKNIIALACGMAVGIGLGENTAAAIITRGLAEIIRLGTALGANGATL
AGLAGVGDLVATCTSPRSRNRSFGERLGRGETLQSAGKACHVVEGVTSCESVLALASSYDVEMPLTDAVHRVCHKGLSVD
EAITLLLGRRTKPE
>P64191 1.1.1.94~~~gpsA~~~Glycerol-3-phosphate dehydrogenase [NAD(P)+]~~~
MTKITVFGMGSFGTALANVLAENGHDVLMWGKNQDAVDELNTCHTNKKYLKYAKLDVNIIATSDMTKAIQFADIYLMALP
TKAMREVATQINDKLTSKKTFIHVAKGIENGTFKRVSEMIEDSISPEYNAGIGVLSGPSHAEEVVVKQPTTVAASSKDKS
VSKLTQDLFMNDYLRVYTNDDLIGVELGGALKNIIAVASGIVAGIGYGDNAKAALMTRGLAEISRLGEKLGADPMTFLGL
GGIGDLIVTCISTHSRNFTLGYKLGQGESMDQALSEMNMVVEGIYTTKSVYHLAKEKNVDMPITNALYRVLFENISVKEC
VKDLMERDKKSE
>Q5XE03 1.1.1.94~~~gpsA~~~Glycerol-3-phosphate dehydrogenase [NAD(P)+]~~~
MTKQKVAILGPGSWGTALSQVLNDNGHDVRLWGNIPDQIEEINTKHTNRHYFKDIVLDKNITATLDLGQALSDVDAVLFV
VPTKVTRLVARQVAAILDHKVVVMHASKGLEPETHERLSTILEEVIPAHFRSEVVVVSGPSHAEETIVRDITLITAASKD
IEAAKYVQSLFSNHYFRLYTNTDVIGVETAGALKNIIAVGAGALHGLGYGDNAKAAVITRGLAEITRLGVKLGADPLTYS
GLSGVGDLIVTGTSVHSRNWRAGAALGRGEKLEDIERNMGMVIEGIATTKVAYEIAQDLGVYMPITTAIYKSIYEGADIK
ESILGMMSNEFRSENEWH
>Q97NF1 1.1.1.94~~~gpsA~~~Glycerol-3-phosphate dehydrogenase [NAD(P)+]~~~COG0240
MEKQTVAVLGPGSWGTALSQVLNDNGHEVRIWGNLPEQINEINTHHTNKHYFKDVVLDENIIAYTDLAETLKDVDAILFV
VPTKVTRLVAQQVAQTLDHKVIIMHASKGLEPDSHKRLSTILEEEIPEHLRSDIVVVSGPSHAEETIVRDLTLITAASKD
LQTAQYVQKLFSNHYFRLYTNTDVIGVETAGALKNIIAVGAGALHGLGFGDNAKAAIIARGLAEITRLGVALGASPLTYS
GLSGVGDLIVTGTSIHSRNWRAGDALGRGESLADIEANMGMVIEGISTTRAAYELAQELGVYMPITQAIYQVIYHGTNIK
DAIYDIMNNEFKAENEWS
>Q6XBH1 3.1.4.46~~~gpdQ~~~Glycerophosphodiester phosphodiesterase GpdQ~~~
MLLAHISDTHFRSRGEKLYGFIDVNAANADVVSQLNALRERPDAVVVSGDIVNCGRPEEYQVARQILGSLNYPLYLIPGN
HDDKALFLEYLQPLCPQLGSDANNMRCAVDDFATRLLFIDSSRAGTSKGWLTDETISWLEAQLFEGGDKPATIFMHHPPL
PLGNAQMDPIACENGHRLLALVERFPSLTRIFCGHNHSLTMTQYRQALISTLPGTVHQVPYCHEDTRPYYDLSPASCLMH
RQVGEQWVSYQHSLAHYAGPWLYDENISCPTEER
>P9WIC7 3.1.3.85~~~gpgP~~~Glucosyl-3-phosphoglycerate phosphatase~~~COG0406
MRARRLVMLRHGQTDYNVGSRMQGQLDTELSELGRTQAVAAAEVLGKRQPLLIVSSDLRRAYDTAVKLGERTGLVVRVDT
RLRETHLGDWQGLTHAQIDADAPGARLAWREDATWAPHGGESRVDVAARSRPLVAELVASEPEWGGADEPDRPVVLVAHG
GLIAALSAALLKLPVANWPALGGMGNASWTQLSGHWAPGSDFESIRWRLDVWNASAQVSSDVL
>A1TC01 3.1.3.85~~~gpgP~~~Glucosyl-3-phosphoglycerate phosphatase~~~COG0406
MRVRRLVMLRHGQTEYNAGSRMQGQLDTDLSDLGREQAVAAAEVLAKRQPLLIVSSDLRRALDTAVALGDRSGQPVSIDT
RLRETHLGDWQGMTHLEVDAAAPGARLAWRDDARWAPHGGESRVDVADRSLPLVHELVTQQTDWGAAGSDRPVVLVAHGG
LIAALTAALLGLPVDNWPVLGGMGNASWVQLAGHTRADGDPGAFADIRWRLDVWNASAQVANDVL
>Q7U0E1 2.4.1.266~~~gpgS~~~Glucosyl-3-phosphoglycerate synthase~~~
MTASELVAGDLAGGRAPGALPLDTTWHRPGWTIGELEAAKAGRTISVVLPALNEEATIESVIDSISPLVDGLVDELIVLD
SGSTDDTEIRAIASGARVVSREQALPEVPVRPGKGEALWRSLAATSGDIVVFIDSDLINPHPLFVPWLVGPLLTGEGIQL
VKSFYRRPLQVSDVTSGVCATGGGRVTELVARPLLAALRPELGCVLQPLSGEYAASRELLTSLPFAPGYGVEIGLLIDTF
DRLGLDAIAQVNLGVRAHRNRPLDELGAMSRQVIATLLSRCGIPDSGVGLTQFLPGGPDDSDYTRHTWPVSLVDRPPMKV
MRPR
>Q73WU1 2.4.1.266~~~~~~Glucosyl-3-phosphoglycerate synthase~~~COG0463
MTTSDLVAGELAGDGLRDTRPGDTWLADRSWNRPGWTVAELEAAKAGRTISVVLPALDEEDTIGSVIDSISPLVDGLVDE
LIVLDSGSTDDTEIRAVAAGARVVSREQALPEVPIRPGKGEALWRSLAASRGDIVVFVDSDLINPHPMFVPWLVGPLLTG
DGVHLVKSFYRRPLNVGDAGGGAGATGGGRVTELVARPLLAALRPELGCILQPLGGEYAATRELLTSVPFAPGYGVEIGL
LVDTFDRLGLDAIAQVNLGVREHRNRPLAELGAMSRQVIATLLSRCGIPDSGVGLTQFVADGPEGQSYTQHTWPVSLADR
PPMQAIRPR
>A0R2E6 2.4.1.266~~~gpgS~~~Glucosyl-3-phosphoglycerate synthase~~~COG0463
MGHRWLTDHSWNRPSWTVADLEAAKAGRTVSVVLPALNEEETVGSVVETIKPLLGGLVDELIVLDSGSTDETEIRAVAAG
AKVVSREAALPEVPPQPGKGEVLWRSLAATTGDIIAFVDSDLIDPDPMFVPKLLGPLLTCDGVHLVKGFYRRPLKVSGAE
DANGGGRVTELVARPLLASLRPELNCVLQPLGGEYAGTRELLTSVPFAPGYGVEIGLLVDTYDRLGLDGIAQVNLGVRAH
RNRPLTELASMSRQVIATLLSRCGISDSGVGLTQFFADGDDFTPRVSSVSLADRPPMTTLRPR
>P9WMW9 2.4.1.266~~~gpgS~~~Glucosyl-3-phosphoglycerate synthase~~~COG0463
MTASELVAGDLAGGRAPGALPLDTTWHRPGWTIGELEAAKAGRTISVVLPALNEEATIESVIDSISPLVDGLVDELIVLD
SGSTDDTEIRAIASGARVVSREQALPEVPVRPGKGEALWRSLAATSGDIVVFIDSDLINPHPLFVPWLVGPLLTGEGIQL
VKSFYRRPLQVSDVTSGVCATGGGRVTELVARPLLAALRPELGCVLQPLSGEYAASRELLTSLPFAPGYGVEIGLLIDTF
DRLGLDAIAQVNLGVRAHRNRPLDELGAMSRQVIATLLSRCGIPDSGVGLTQFLPGGPDDSDYTRHTWPVSLVDRPPMKV
MRPR
>C0QRQ2 2.4.1.266~~~gpgS~~~Glucosyl-3-phosphoglycerate synthase~~~COG1215
MADFFQNGVITTLQNFRNRSLEELEYELELFSKRRNMVLLLPALYSEFEGPAMPKIIQELKDIRYLYKIVLSLDRATEEE
FKKVKKIMSEINTEVKVIWHDGPRMQRLYRELEEAGFNVSIPGKGRSVWMSLGYILSDADAYAIALHDCDIVNYSRELPA
RLLYPVVHPALDFEFSKGYYARVTHKLYGRVTRIFYTPLIRALIRILGCNRFLVYLDSFRYALSGEFAFIRTLARGIRIS
PTWGLEVSMLSEVYQNTSFNRICQVEVMDTYEHKHQKLVKSTSEGLVKMASDIAKTLFRVLAHDGFVFSEAFFRTLLTTY
LQEARYAIEKYNALSLINGLTYDRHAEIEAIEVFVDALKKAEKEFIEDPIGVPLMSAWVRVRAALPEISDKLIRAVEEDN
SDD
>A9BHI9 2.4.1.266~~~gpgS~~~Glucosyl-3-phosphoglycerate synthase~~~COG1215
MKDNILKRSFHHSKFENIKELVKLKEKQDVKISLAFPSLNEEKTIGKEIIIMKSELMEKYPLLDEIAVIDSGSEDETVSI
AKEYGAKVFYSSDILPEYGFYKGKGENLWKSLYALDGDIIVWVDSDIENIHPKFVYGLVGALLNYPEIGYVKAFYDRPIV
GKSAMQPTGGGRVTELVARPLFSLFYPELSTIIQPLSGEYAGRREILEKLPFFVGYGVEIAHLIDIAEKFGSEIIAQVDL
ELRIHDNQPLHSLSKMAFELTKVVLKRLEKYGKLDLNTELTDKHIMIQKKENEKVLVPTEILSVERPPMITIPEYKEKFS
KEEKV
>P40852 3.1.3.18~~~cbbZC~~~Phosphoglycolate phosphatase, chromosomal~~~COG0546
MATVSMPCTAVLIDLDGTLVDSAPDIVEAANRMLADFGSPALPFDTVAGFIGRGVPNLVRRVLETAGLTPRVEAAEAVAM
FHRHYAETNGRLGSVFPGVEAGLEALRRQGYRLACVTNKPRALAVPLLALTGLSQYLEVLVAGDSIAQMKPDPEPLRHAC
NLLDVDTAQGVLVGDSAVDVAAARAAGIPVCLVRYGYAGPGGPAALGADALLDSLEALPALLTPARLAPAA
>P40853 3.1.3.18~~~cbbZP~~~Phosphoglycolate phosphatase, plasmid~~~COG0546
MATVSLPCTAVLIDLDGTLVDCAPDIVEAANRMLADLGSPALPFGTVAGFIGRGVPNLVRRVLETAQLAPRVDATDAVAM
FHRHYADTNGRLGSVFPGVEAGLAALRRQGYRLACVTNKPRALAVPLLALTGLSQYLEVLVAGDSIAQMKPDPEPLRHAC
NLLDVDAAQGVLVGDSAVDVAAARAAGIPVCLVRYGYAGPGGPAALGADALVDSLEALPALLTPARLAPAA
>O67359 3.1.3.18~~~gph~~~Phosphoglycolate phosphatase~~~COG0546
MRVILFDLDGTLIDSAKDIALALEKTLKELGLEEYYPDNVTKYIGGGVRALLEKVLKDKFREEYVEVFRKHYLENPVVYT
KPYPEIPYTLEALKSKGFKLAVVSNKLEELSKKILDILNLSGYFDLIVGGDTFGEKKPSPTPVLKTLEILGEEPEKALIV
GDTDADIEAGKRAGTKTALALWGYVKLNSQIPDFTLSRPSDLVKLMDNHIVEF
>P32662 3.1.3.18~~~gph~~~Phosphoglycolate phosphatase~~~COG0546
MNKFEDIRGVAFDLDGTLVDSAPGLAAAVDMALYALELPVAGEERVITWIGNGADVLMERALTWARQERATQRKTMGKPP
VDDDIPAEEQVRILRKLFDRYYGEVAEEGTFLFPHVADTLGALQAKGLPLGLVTNKPTPFVAPLLEALDIAKYFSVVIGG
DDVQNKKPHPDPLLLVAERMGIAPQQMLFVGDSRNDIQAAKAAGCPSVGLTYGYNYGEAIDLSQPDVIYQSINDLLPALG
LPHSENQESKND
>O51602 5.4.2.11~~~gpmA~~~2,3-bisphosphoglycerate-dependent phosphoglycerate mutase~~~
MYKLVLVRHGESEWNKENLFTGWTDVKLSDKGIDEAVEAGLLLKQEGYSFDIAFSSLLSRANDTLNIILRELGQSYISVK
KTWRLNERHYGALQGLNKSETAAKYGEDKVLIWRRSYDVPPMSLDESDDRHPIKDPRYKHIPKRELPSTECLKDTVARVI
PYWTDEIAKEVLEGKKVIVAAHGNSLRALVKYFDNLSEEDVLKLNIPTGIPLVYELDKDLNPIKHYYLGDESKIKKAMES
VASQGKLK
>Q3JWH7 5.4.2.11~~~gpmA~~~2,3-bisphosphoglycerate-dependent phosphoglycerate mutase~~~
MYKLVLIRHGESTWNKENRFTGWVDVDLTEQGNREARQAGQLLKEAGYTFDIAYTSVLKRAIRTLWHVQDQMDLMYVPVV
HSWRLNERHYGALSGLNKAETAAKYGDEQVLVWRRSYDTPPPALEPGDERAPYADPRYAKVPREQLPLTECLKDTVARVL
PLWNESIAPAVKAGKQVLIAAHGNSLRALIKYLDGISDADIVGLNIPNGVPLVYELDESLTPIRHYYLGDQEAIAKAQAA
VAQQGKSAA
>P62707 5.4.2.11~~~gpmA~~~2,3-bisphosphoglycerate-dependent phosphoglycerate mutase~~~COG0588
MAVTKLVLVRHGESQWNKENRFTGWYDVDLSEKGVSEAKAAGKLLKEEGYSFDFAYTSVLKRAIHTLWNVLDELDQAWLP
VEKSWKLNERHYGALQGLNKAETAEKYGDEQVKQWRRGFAVTPPELTKDDERYPGHDPRYAKLSEKELPLTESLALTIDR
VIPYWNETILPRMKSGERVIIAAHGNSLRALVKYLDNMSEEEILELNIPTGVPLVYEFDENFKPLKRYYLGNADEIAAKA
AAVANQGKAK
>B8ZT86 5.4.2.11~~~gpmA~~~2,3-bisphosphoglycerate-dependent phosphoglycerate mutase~~~
MQQGNTATLILLRHGESDWNARNLFTGWVDVGLTDKGRAEAVRSGELLAEHNLLPDVLYTSLLRRAITTAHLALDTADWL
WIPVRRSWRLNERHYGALQGLDKAVTKARYGEERFMAWRRSYDTPPPPIEKGSEFSQDADPRYTDIGGGPLTECLADVVT
RFLPYFTDVIVPDLRTGRTVLIVAHGNSLRALVKHLDEMSDDEVVGLNVPTGIPLRYDLDADLRPVVPGGTYLDPEAAAA
VISQARP
>P9WIC9 5.4.2.11~~~gpmA~~~2,3-bisphosphoglycerate-dependent phosphoglycerate mutase~~~COG0588
MANTGSLVLLRHGESDWNALNLFTGWVDVGLTDKGQAEAVRSGELIAEHDLLPDVLYTSLLRRAITTAHLALDSADRLWI
PVRRSWRLNERHYGALQGLDKAETKARYGEEQFMAWRRSYDTPPPPIERGSQFSQDADPRYADIGGGPLTECLADVVARF
LPYFTDVIVGDLRVGKTVLIVAHGNSLRALVKHLDQMSDDEIVGLNIPTGIPLRYDLDSAMRPLVRGGTYLDPEAAAAGA
AAVAGQGRG
>B4RIY7 5.4.2.11~~~gpmA~~~2,3-bisphosphoglycerate-dependent phosphoglycerate mutase~~~
MELVFIRHGQSEWNAKNLFTGWRDVKLSEQGLAEAAAAGKKLKENGYEFDIAFTSVLTRAIKTCNIVLEESDQLFVPQIK
TWRLNERHYGRLQGLDKKQTAEKYGDEQVRIWRRSYDTLPPLLDKDDAFSAHKDRRYAHLPADVVPDGENLKVTLERVLP
FWEDQIAPAILSGKRVLVAAHGNSLRALAKHIEGISDEDIMGLEIPTGQPLVYKLDDNLKVIEKFYL
>P99153 5.4.2.11~~~gpmA~~~2,3-bisphosphoglycerate-dependent phosphoglycerate mutase~~~
MPKLILCRHGQSEWNAKNLFTGWEDVNLSEQGINEATRAGEKVRENNIAIDVAFTSLLTRALDTTHYILTESKQQWIPVY
KSWRLNERHYGGLQGLNKDDARKEFGEEQVHIWRRSYDVKPPAETEEQREAYLADRRYNHLDKRMMPYSESLKDTLVRVI
PFWTDHISQYLLDGQTVLVSAHGNSIRALIKYLEDVSDEDIINYEIKTGAPLVYELTDDLEVIDKYYL
>P33158 5.4.2.11~~~gpmA~~~2,3-bisphosphoglycerate-dependent phosphoglycerate mutase~~~COG0588
MADAPYKLILLRHGESEWNEKNLFTGWVDVNLTPKGEKEATRGGELLKDAGLLPDVVHTSVQKRAIRTAQLALEAADRHW
IPVHRHWRLNERHYGALQGKDKAQTLAEFGEEQFMLWRRSYDTPPPALDRDAEYSQFSDPRYAMLPPELRPQTECLKDVV
GRMLPYWFDAIVPDLLTGRTVLVAAHGNSLRALVKHLDGISDADIAGLNIPTGIPLSYELNAEFKPLNPGGTYLDPDAAA
AAIEAVKNQGKKK
>Q5XB88 5.4.2.11~~~gpmA~~~2,3-bisphosphoglycerate-dependent phosphoglycerate mutase~~~
MVKLVFARHGESEWNKANLFTGWADVDLSEKGTQQAIDAGKLIKEAGIEFDLAFTSVLTRAIKTTNLALENAGQLWVPTE
KSWRLNERHYGALTGKNKAEAAEQFGDEQVHIWRRSYDVLPPAMAKDDEYSAHKDRRYADLDPALIPDAENLKVTLERAM
PYWEEKIAPALLDGKNVFVGAHGNSIRALVKHIKGLSDDEIMDVEIPNFPPLVFELDEKLNIVKEYYLGGE
>P30798 5.4.2.11~~~gpmA~~~2,3-bisphosphoglycerate-dependent phosphoglycerate mutase~~~COG0588
MPTLVLSRHGQSEWNLENRFTGWWDVNLTEQGVQEATAGGKALAEKGFEFDIAFTSVLTRAIKTTNLILEAGKTLWVPTE
KDWRLNERHYGGLTGLNKAETAAKHGEEQVHIWRRSYDVPPPPMEKGSKFDLSGDRRYDGVKIPETESLKDTVARVLPYW
EERIAPELKAGKRVLIGAHGNSLRALVKHLSKLSDEEIVKFELPTGQPLVYELNDDLTPKDRYFLNER
>Q81X77 5.4.2.12~~~gpmI~~~2,3-bisphosphoglycerate-independent phosphoglycerate mutase~~~COG0696
MRKPTALIILDGFGLREETYGNAVAQAKKPNFDGYWNKFPHTTLTACGEAVGLPEGQMGNSEVGHLNIGAGRIVYQSLTR
VNVAIREGEFDKNETFQSAIKSVKEKGTALHLFGLLSDGGVHSHMNHMFALLRLAAKEGVEKVYIHAFLDGRDVGPKTAQ
SYIDATNEVIKETGVGQFATISGRYYSMDRDKRWDRVEKCYRAMVNGEGPTYKSAEECVEDSYANGIYDEFVLPSVIVNE
DNTPVATINDDDAVIFYNFRPDRAIQIARVFTNGDFREFDRGEKVPHIPEFVCMTHFSETVDGYVAFKPMNLDNTLGEVV
AQAGLKQLRIAETEKYPHVTFFFSGGREAEFPGEERILINSPKVATYDLKPEMSIYEVTDALVNEIENDKHDVIILNFAN
CDMVGHSGMMEPTIKAVEATDECLGKVVEAILAKDGVALITADHGNADEELTSEGEPMTAHTTNPVPFIVTKNDVELRED
GILGDIAPTMLTLLGVEQPKEMTGKTIIK
>P39773 5.4.2.12~~~gpmI~~~2,3-bisphosphoglycerate-independent phosphoglycerate mutase~~~COG0696
MSKKPAALIILDGFGLRNETVGNAVALAKKPNFDRYWNQYPHQTLTASGEAVGLPEGQMGNSEVGHLNIGAGRIVYQSLT
RVNVAIREGEFERNQTFLDAISNAKENNKALHLFGLLSDGGVHSHINHLFALLKLAKKEGLTKVYIHGFLDGRDVGPQTA
KTYINQLNDQIKEIGVGEIASISGRYYSMDRDKRWDRVEKAYRAMAYGEGPSYRSALDVVDDSYANGIYDEFVIPSVITK
ENGEPVAKIQDGDSVIFYNFRPDRAIQISNTFTNKDFRDFDRGENYPKNLYFVCLTHFSETVDGYVAFKPINLDNTVGEV
LSQHGLKQLRIAETEKYPHVTFFMSGGREAEFPGEERILINSPKVATYDLKPEMSAYEVKDALVKEIEADKHDAIILNFA
NPDMVGHSGMVEPTIKAIEAVDECLGEVVDAILAKGGHAIITADHGNADILITESGEPHTAHTTNPVPVIVTKEGITLRE
GGILGDLAPTLLDLLGVEKPKEMTGTSLIQK
>P37689 5.4.2.12~~~gpmI~~~2,3-bisphosphoglycerate-independent phosphoglycerate mutase~~~COG0696
MLVSKKPMVLVILDGYGYREEQQDNAIFSAKTPVMDALWANRPHTLIDASGLEVGLPDRQMGNSEVGHVNLGAGRIVYQD
LTRLDVEIKDRAFFANPVLTGAVDKAKNAGKAVHIMGLLSAGGVHSHEDHIMAMVELAAERGAEKIYLHAFLDGRDTPPR
SAESSLKKFEEKFAALGKGRVASIIGRYYAMDRDNRWDRVEKAYDLLTLAQGEFQADTAVAGLQAAYARDENDEFVKATV
IRAEGQPDAAMEDGDALIFMNFRADRAREITRAFVNADFDGFARKKVVNVDFVMLTEYAADIKTAVAYPPASLVNTFGEW
MAKNDKTQLRISETEKYAHVTFFFNGGVEESFKGEDRILINSPKVATYDLQPEMSSAELTEKLVAAIKSGKYDTIICNYP
NGDMVGHTGVMEAAVKAVEALDHCVEEVAKAVESVGGQLLITADHGNAEQMRDPATGQAHTAHTNLPVPLIYVGDKNVKA
VEGGKLSDIAPTMLSLMGMEIPQEMTGKPLFIVE
>Q9X519 5.4.2.12~~~gpmI~~~2,3-bisphosphoglycerate-independent phosphoglycerate mutase~~~
MSKKPVALIILDGFALRDETYGNAVAQANKPNFDRYWNEYPHTTLKACGEAVGLPEGQMGNSEVGHLNIGAGRIVYQSLT
RINIAIREGEFDRNETFLAAMNHVKQHGTSLHLFGLLSDGGVHSHIHHLYALLRLAAKEGVKRVYIHGFLDGRDVGPQTA
PQYIKELQEKIKEYGVGEIATLSGRYYSMDRDKRWDRVEKAYRAMVYGEGPTYRDPLECIEDSYKHGIYDEFVLPSVIVR
EDGRPVATIQDNDAIIFYNFRPDRAIQISNTFTNEDFREFDRGPKHPKHLFFVCLTHFSETVKGYVAFKPTNLDNTIGEV
LSQHGLRQLRIAETEKYPHVTFFMSGGREEKFPGEDRILINSPKVPTYDLKPEMSAYEVTDALLKEIEADKYDAIILNYA
NPDMVGHSGKLEPTIKAVEAVDECLGKVVDAILAKGGIAIITADHGNADEVLTPDGKPQTAHTTNPVPVIVTKKGIKLRD
GGILGDLAPTMLDLLGLPQPKEMTGKSLIVK
>P75167 5.4.2.12~~~gpmI~~~2,3-bisphosphoglycerate-independent phosphoglycerate mutase~~~
MHKKVLLAILDGYGISNKQHGNAVYHAKTPALDSLIKDYPCVMLEASGEAVGLPQGQIGNSEVGHLNIGAGRIVYTGLSL
INQNIKTGAFHHNQVLLEAIARAKANNAKLHLIGLFSHGGVHSHMDHLYALIKLAAPQVKMVLHLFGDGRDVAPCTMKSD
LEAFMVFLKDYHNVIIGTLGGRYYGMDRDQRWDREEIAYNAILGNSKASFTDPVAYVQSAYDQKVTDEFLYPAVNGNVDK
EQFALKDHDSVIFFNFRPDRARQMSHMLFQTDYYDYTPKAGRKYNLFFVTMMNYEGIKPSAVVFPPETIPNTFGEVIAHN
KLKQLRIAETEKYAHVTFFFDGGVEVDLPNETKCMVPSLKVATYDLAPEMACKGITDQLLNQINQFDLTVLNFANPDMVG
HTGNYAACVQGLEALDVQIQRIIDFCKANHITLFLTADHGNAEEMIDSNNNPVTKHTVNKVPFVCTDTNIDLQQDSASLA
NIAPTILAYLGLKQPAEMTANSLLISKK
>P52832 5.4.2.12~~~gpmI~~~2,3-bisphosphoglycerate-independent phosphoglycerate mutase~~~COG0696
MTATPKPLVLIILDGFGHSESHEGNAILAAKMPVMDRLYKTMPNGLISGSGMDVGLPDGQMGNSEVGHMNLGAGRVVYQD
FTRVTKAIRDGEFFENPTICAAVDKAVSAGKAVHIMGLLSDGGVHSHQDHLVAMAELAVRRGADKIYLHAFLDGRDTPPR
SAKKSLELMDETFARLGKGRIATIIGRYFAMDRDNRWDRVSTAYNLIVDSSAEFHAATGVAGLEAAYARDENDEFVKATR
IGEPANVEDGDAVVFMNFRADRARELTRVFVEDDFKDFERARQPKVNYVMLTQYAASIPAPSAFAAGSLKNVLGEYLADN
GKTQLRIAETEKYAHVTFFFSGGREEPFPGEERILIPSPKVATYDLQPEMSAPEVTDKIVDAIEHQRYDVIIVNYANGDM
VGHSGIMEAAIKAVEYLDVCVGRITDALEKVGGEALITADHGNVEQMTDDATGQAHTAHTSEPVPFVYVGKRQLKVREGG
VLADVAPTMLQLLGMEKPQEMTGHSILVEE
>Q5HHP2 5.4.2.12~~~gpmI~~~2,3-bisphosphoglycerate-independent phosphoglycerate mutase~~~
MAKKPTALIILDGFANRESEHGNAVKLANKPNFDRYYNKYPTTQIEASGLDVGLPEGQMGNSEVGHMNIGAGRIVYQSLT
RINKSIEDGDFFENDVLNNAIAHVNSHDSALHIFGLLSDGGVHSHYKHLFALLELAKKQGVEKVYVHAFLDGRDVDQKSA
LKYIEETEAKFNELGIGQFASVSGRYYAMDRDKRWEREEKAYNAIRNFDAPTYATAKEGVEASYNEGLTDEFVVPFIVEN
QNDGVNDGDAVIFYNFRPDRAAQLSEIFANRAFEGFKVEQVKDLFYATFTKYNDNIDAAIVFEKVDLNNTIGEIAQNNNL
TQLRIAETEKYPHVTYFMSGGRNEEFKGERRRLIDSPKVATYDLKPEMSAYEVKDALLEELNKGDLDLIILNFANPDMVG
HSGMLEPTIKAIEAVDECLGEVVDKILDMDGYAIITADHGNSDQVLTDDDQPMTTHTTNPVPVIVTKEGVTLRETGRLGD
LAPTLLDLLNVEQPEDMTGESLIKH
>P64270 5.4.2.12~~~gpmI~~~2,3-bisphosphoglycerate-independent phosphoglycerate mutase~~~
MAKKPTALIILDGFANRESEHGNAVKLANKPNFDRYYNKYPTTQIEASGLDVGLPEGQMGNSEVGHMNIGAGRIVYQSLT
RINKSIEDGDFFENDVLNNAIAHVNSHDSALHIFGLLSDGGVHSHYKHLFALLELAKKQGVEKVYVHAFLDGRDVDQKSA
LKYIEETEAKFNELGIGQFASVSGRYYAMDRDKRWEREEKAYNAIRNFDAPTYATAKEGVEASYNEGLTDEFVVPFIVEN
QNDGVNDGDTVIFYNFRPDRAAQLSEIFANRAFEGFKVEQVKDLFYATFTKYNDNIDAAIVFEKVDLNNTIGEIAQNNNL
TQLRIAETEKYPHVTYFMSGGRNEEFKGERRRLIDSPKVATYDLKPEMSAYEVKDALLEELNKGDLDLIILNFANPDMVG
HSGMLEPTIKAIEAVDECLGEVVDKILDMDGYAIITADHGNSDQVLTDDDQPMTTHTTNPVPVIVTKEGVTLRETGRLGD
LAPTLLDLLNVEQPEDMTGESLIKH
>C0QRP9 3.1.3.70~~~gpgP~~~Glucosyl-3-phosphoglycerate/mannosyl-3-phosphoglycerate phosphatase~~~COG3769
MVIFTDLDGTLLNHEDYSFKDAIPSLERIKKKGIPLVIVTSKTKKEVELIQKELGIEEPFIVENGAAVFFPKGYRGFNIR
CDQENRYCIIKLGRDYREIRDFIEKIKDKFKIKGFGDMTVEEIVRLTDLPYDRAELAKERDFTEPFIIEDEKDIKDLEEI
AEKEGFKITKGGRFYHLIGKGQDKGRAVQIVKKVFEENYGEVPLTVGLGDSRNDIPMLREVDIPILIPHINKKYESVNLP
GIIKAEYPGSKGWNESIWRILNEIERGCC
>P25552 3.6.1.40~~~gppA~~~Guanosine-5'-triphosphate,3'-diphosphate pyrophosphatase~~~COG0248
MGSTSSLYAAIDLGSNSFHMLVVREVAGSIQTLTRIKRKVRLAAGLNSENALSNEAMERGWQCLRLFAERLQDIPPSQIR
VVATATLRLAVNAGDFIAKAQEILGCPVQVISGEEEARLIYQGVAHTTGGADQRLVVDIGGASTELVTGTGAQTTSLFSL
SMGCVTWLERYFADRNLGQENFDAAEKAAREVLRPVADELRYHGWKVCVGASGTVQALQEIMMAQGMDERITLEKLQQLK
QRAIHCGRLEELEIDGLTLERALVFPSGLAILIAIFTELNIQCMTLAGGALREGLVYGMLHLAVEQDIRSRTLRNIQRRF
MIDIDQAQRVAKVAANFFDQVENEWHLEAISRDLLISACQLHEIGLSVDFKQAPQHAAYLVRNLDLPGFTPAQKKLLATL
LLNQTNPVDLSSLHQQNAVPPRVAEQLCRLLRLAIIFASRRRDDLVPEMTLQANHELLTLTLPQGWLTQHPLGKEIIAQE
SQWQSYVHWPLEVH
>A4FG18 2.1.1.255~~~~~~Geranyl diphosphate 2-C-methyltransferase~~~COG2230
MTKSIHENGTAASVYQGSIAEYWNQEANPVNLELGEVDGYFHHHYGIGEPDWSVVEGDAATSHERTTRELHRLETWQAEF
LLDHLGGVEPEHRIMDAGCGRGGSSFMAHERFGCSVEGVSLSRKQVDFANAQARERGVADKVAFHQLNMLDTGFDTASMR
AIWNNESTMYVDLHDLFAEHSRLLARGGRYVTITGCYNDVYGLPSRAVSTINAHYICDIHPRSGYFRAMAANRLVPCAVV
DLTEATVPYWRLRAKSPLATGIEETFIEAYTSGSFQYLLIAADRV
>A3KI18 2.1.1.255~~~~~~Geranyl diphosphate 2-C-methyltransferase~~~
MTTESTSTVTAKIPAPATPYQGDIARYWNNEARPVNLRLGDVDGLYHHHYGIGAVDHAALGDPADSEYEKKLIAELHRLE
SAQAEFLMDHLGPIGSDDTLVDAGCGRGGSMVMAHRRFGCKVEGVTLSASQADFGNARARELRIEDHVRSRVCNMLDTPF
DKGSIAASWNNESTMYVDLHDLFAEHSRFLEVGGRYVTITGCWNPRYGQPSKWVSQINAHFECNIHSRREYLRAMADNRL
VPHTIVDLTPDTLPYWELRATSSLVTGIEKAFIESYRDGSFQYVLIAADRV
>Q9F1Y5 2.1.1.255~~~~~~Geranyl diphosphate 2-C-methyltransferase~~~COG2230
MTTETTTATATAKIPAPATPYQEDIARYWNNEARPVNLRLGDVDGLYHHHYGIGPVDRAALGDPEHSEYEKKVIAELHRL
ESAQAEFLMDHLGQAGPDDTLVDAGCGRGGSMVMAHRRFGSRVEGVTLSAAQADFGNRRARELRIDDHVRSRVCNMLDTP
FDKGAVTASWNNESTMYVDLHDLFSEHSRFLKVGGRYVTITGCWNPRYGQPSKWVSQINAHFECNIHSRREYLRAMADNR
LVPHTIVDLTPDTLPYWELRATSSLVTGIEKAFIESYRDGSFQYVLIAADRV
>D3KYU3 2.1.1.255~~~gdpmt~~~Geranyl diphosphate 2-C-methyltransferase~~~
MAAASAPVPGPGGASSTARGRIPAPATPYQEDIARYWNNEARPVNLRLGDVDGLYHHHYGIGAVDHAALGDPGDGGYEAR
LIAELHRLESAQAEFLLDHLGPVGPGDTLVDAGCGRGGSMVMAHQRFGCKVEGVTLSAAQAEFGNRRARELGIDDHVRSR
VCNMLDTPFEKGTVAASWNNESSMYVDLHDVFAEHSRFLRVGGRYVTVTGCWNPRYGQPSKWVSQINAHFECNIHSRREY
LRAMADNRLVPQTVVDLTPETLPYWELRATSSLVTGIEEAFIESYRDGSFQYVLIAADRV
>O05572 2.5.1.1~~~grcC2~~~Dimethylallyltranstransferase~~~COG0142
MIPAVSLGDPQFTANVHDGIARITELINSELSQADEVMRDTVAHLVDAGGTPFRPLFTVLAAQLGSDPDGWEVTVAGAAI
ELMHLGTLCHDRVVDESDMSRKTPSDNTRWTNNFAILAGDYRFATASQLASRLDPEAFAVVAEAFAELITGQMRATRGPA
SHIDTIEHYLRVVHEKTGSLIAASGQLGAALSGAAEEQIRRVARLGRMIGAAFEISRDIIAISGDSATLSGADLGQAVHT
LPMLYALREQTPDTSRLRELLAGPIHDDHVAEALTLLRCSPGIGKAKNVVAAYAAQAREELPYLPDRQPRRALATLIDHA
ISACD
>P22322 3.4.24.78~~~gpr~~~Germination protease~~~COG0680
MKKSELDVNQYLIRTDLAVETKEAMANQQAVPTKEIKGFIEKERDHGGIKIRTVDVTKEGAELSGKKEGRYLTLEAQGIR
ENDSEMQEKVSAVFAEEFSAFLENLNISKDASCLIVGLGNWNVTPDALGPMAVENLLVTRHLFKLQPENVQEGYRPVSAF
APGVMGITGIETSDIIKGVIEQSKPDFVIAIDALAARAVERVNTTIQISDTGIHPGSGVGNKRKDLSKDTLGVPVIAIGV
PTVVDAVTIASDTVDYILKHFGREMKDNRPSRSLVPAGMTFGKKKVLTEDDLPDQKQRQSFLGIVGTLQEDEKRQLIHEV
LSPLGHNLMVTPKEVDSFIDDMANVLANGLNTALHEKVSQENKGSYNH
>Q46851 1.1.1.-~~~gpr~~~L-glyceraldehyde 3-phosphate reductase~~~COG0667
MVWLANPERYGQMQYRYCGKSGLRLPALSLGLWHNFGHVNALESQRAILRKAFDLGITHFDLANNYGPPPGSAEENFGRL
LREDFAAYRDELIISTKAGYDMWPGPYGSGGSRKYLLASLDQSLKRMGLEYVDIFYSHRVDENTPMEETASALAHAVQSG
KALYVGISSYSPERTQKMVELLREWKIPLLIHQPSYNLLNRWVDKSGLLDTLQNNGVGCIAFTPLAQGLLTGKYLNGIPQ
DSRMHREGNKVRGLTPKMLTEANLNSLRLLNEMAQQRGQSMAQMALSWLLKDDRVTSVLIGASRAEQLEENVQALNNLTF
STKELAQIDQHIADGELNLWQASSDK
>P22321 3.4.24.78~~~gpr~~~Germination protease~~~COG0680
MEKELDLSQYSVRTDLAVEAKDIALENQPKPNNQSEIKGVIVKEKEEQGVKISMVEITEEGAEAIGKKKGRYVTLESVGI
REQDTEKQEAMEEVFAKELNFFIKSLNIPDDASCLVVGLGNLSVTPDALGPKAVDNLLITRHLFELQPESVQDGFRPVSA
IVPGVMGMTGIETSDIIFGVVKKVNPDFIIAIDALAARSIERVNATIQISDSGIHPGSGVGNKRKEISYETLGIPVIAIG
IPTVVDAVSITSDTIDFILKHFGREMKEQGKPSKSLLPSGMTFGEKKKLTEDDLPNEEQRQTYLGMIGTLPDEEKRRLIH
EVLAPLGHNLMVTPKEVDMFIEDMANVVAGGLNAALHHEVDQENFGAYTH
>P0CI74 ~~~gpsB~~~Cell cycle protein GpsB~~~COG3599
MLADKVKLSAKEILEKEFKTGVRGYKQEDVDKFLDMIIKDYETFHQEIEELQQENLQLKKQLEEASKKQPVQSNTTNFDI
LKRLSNLEKHVFGSKLYD
>Q8Y614 ~~~gpsB~~~Cell cycle protein GpsB~~~COG3599
MTSEQFEYHLTGKEILEKEFKTGLRGYSPEDVDEFLDMVIKDYSTFTQEIEALQAENIRLVQELDNAPLRTSTQPAPTFQ
AAAQPAGTTNFDILKRLSNLEKHVFGNKLDDNE
>Q7A5L1 ~~~gpsB~~~Cell cycle protein GpsB~~~
MSDVSLKLSAKDIYEKDFEKTMARGYRREEVDAFLDDIIADYQKMADMNNEVVKLSEENHKLKKELEELRLRVATSRPQD
NKSFSSNNTTTNTSSNNVDILKRISNLEKAVFGK
>Q8DR57 ~~~gpsB~~~Cell cycle protein GpsB~~~COG3599
MASIIFSAKDIFEQEFGREVRGYNKVEVDEFLDDVIKDYETYAALVKSLRQEIADLKEELTRKPKPSPVQAEPLEAAITS
SMTNFDILKRLNRLEKEVFGKQILDNSDF
>Q9I6K2 3.5.3.17~~~gpuA~~~Guanidinopropionase~~~
MSNDHPQPLDAAEIPRFAGIPTFMRLPAFTDPAALQVGLIGVPWDGGTTNRAGARHGPREVRNLSSLMRKVHHVSRIAPY
DLVRVGDLGDAPVNPIDLLDSLRRIEGFYRQVHAAGTLPLSVGGDHLVTLPIFRALGRERPLGMVHFDAHSDTNDRYFGD
NPYTHGTPFRRAIEEGLLDPLRTVQIGIRGSVYSPDDDAFARECGIRVIHMEEFVELGVEATLAEARRVVGAGPTYVSFD
VDVLDPAFAPGTGTPEIGGMTSLQAQQLVRGLRGLDLVGADVVEVSPPFDVGGATALVGATMMFELLCLLAESAARSA
>P74250 1.11.1.22~~~gpx1~~~Hydroperoxy fatty acid reductase gpx1~~~COG0386
MTAQANNTIYGFSANALDGSPVALRDFEGKVLLIVNTASQCGFTPQYQGLQALYNRFGDRGFTVLGFPCNQFGQQEPGGS
GEIKNFCETRYGVTFPLFEKVEVNGPNAHPLFKFLTAASPGMAIPFLGGAEDIKWNFTKFLVDRQGKVVKRYGSIAKPDE
IAADIEKLL
>P73824 1.11.1.22~~~gpx2~~~Hydroperoxy fatty acid reductase Gpx2~~~COG0386
MPLPTSLTTLDGTPLAPEVIADKVVLFVNVASKCGLTPQYSGLVALDKAYGEKGLVIIGVPCNQFGAQEPGSPEEIKDFT
KTKYDVDFTLLEKQDVNGPNRSPLYQFLVGDGEDISWNFGKFLIGRDGQVVARFDPQTKPDDTNLKAAIEKALG
>Q9ZA33 1.1.1.384~~~~~~dTDP-3,4-didehydro-2,6-dideoxy-alpha-D-glucose 3-reductase~~~
MNAGHTETVRALRIGVAGCADIALRRMLPAFAASPHTEPTAVASRSSEKARAAAETFGCAAVEGYDALLERRDVDAVYIP
LPVALHAPWTERALRAGKHVLAEKPLTARAADTARLLDLARERGLVLAENYLFVHHSAYTAVRDLVDAGAIGDVRALSAS
FTIPPRSADDIRYRADLDGGALLDIGVYPLRLASLLLGSELRVRGAVLRHDTVRGVDLGGSALLGDPGTGVSAQLVFGME
HAYTAGWRLLGSEGSLTLDRAYSPPAGHRPVLRIERPDGTEERILPAHDQATAAVAAFAEAVRRAGQGAGQDKGRSSGDA
AAVLRQAELVDAVRQAAHLVKI
>Q9ZA32 4.2.1.159~~~~~~dTDP-4-dehydro-6-deoxy-alpha-D-glucopyranose 2,3-dehydratase~~~
MRITDTAGFHAWFAERGAAHRYRITRTPLHDLEGWYTDPASGDVRHRSGRFFSIEGLRYGRQEPDGPAWTQPIIRQPETG
VLGVLIKWFDGVPHLLMQAKMEPGNINTLQVSPTVQATFSNYTRVHHGSPVRYIDHFLTPGAGDRVHYDALQSEQGSWFL
GKRNRNIVVETTGEIPVHEDFCWVPRPVMAELLRVDNLVNMDSRTVLAGLPDDPGEGSVPRRAVEKPLHDTAALLHWFTG
AKVRHRPERTTIPLSRVGGWRRDDDRGEIVHETGRYFRIIGVDVEADSREVTSWSQPMLAPVGRGVVAFVSKEIHGERHL
LVQARAEAGTFDAVELGPTVQCNPGNLPDGAPRPPYLDTVLTARPEQVLFDTVHSEEGGRFYHAENRYLVLDGDDVPVDV
PEDYTWMTVRQLTRAGRIGNLVDVEARTLLACVRTLPDHGASR
>A1IIX2 1.14.14.27~~~graA~~~FADH(2)-dependent resorcinol hydroxylase, oxygenase component~~~
MNDMSHAPQPAQTKPHVRLVGRVAGVADLFRSSARQTEEARRVPASHIAALRGIGYFDIVKPRAFGGQGGEFAELVEANI
ELSAACASTGWVAGLLSAHQWLLAMFPEEAQADVWDENPDALLCGSYAPVKMAEAADGGYRLSGKWAFASGCENAQWSLC
AAILPPQAKGRPVPAFLLVPASQYAIEDTWHVVGLAGTVSKTLVLDDVFVPKHRVLTFPDATSGHTPGGRFYAQEGLFNM
PLLTGIPSCLASTGVGAAKGALAAYVDHVGGRVTRGAVAGGNNRMAEFPTIQLRVAEAAASVDAACEILLRDVARAQALS
QARLEGRAEFSVDDRLLSRRGQSFSVSFSLRAVQALNDSTGGVGLDLSNPVQRAWRDANAVGRHISMNWDAVGTMIGQSM
LGLEPKGQY
>A1IIX3 1.13.11.37~~~graB~~~Hydroxyquinol 1,2-dioxygenase~~~
MDMKTTGDDGYFVEERSAETVIARMRDCDDPRLKEIMAVVTRKLHEAVKEIEPTEEEWMKAIHFLTEVGQICNEWRQEWI
LFSDILGVSMLVDAINHRKPSGASESTVLGPFHVADAPEMPMGANICLDGKGEDMLVTGRILDTDGVPVAGARIDVWQAN
DEGFYDVQQKGIQPDFNLRGVFVTGEDGRYWFRAAKPKYYPIPDDGPVGQLLRAMGRHPYRPAHLHYIVSAEGFTTLVTH
IFDPDDPYIRSDAVFGVKESLLADFQRVEDAQRAQELGFANGWFWSVDHDFVLAR
>A1IIX4 1.3.1.32~~~graC~~~Maleylacetate reductase~~~
MQPFVYTTAPARIVFGTGSSVGVAEEIRRLGLSRALVLSTPHQKGDAEALAARLGPLAAGVFSDAAMHTPVEVTKRAVEA
YRAAGADCVVSLGGGSTTGLGKAIALRTDAPQIVIPTTYAGSEVTPILGQTENGVKTTLRGPEILPEVVIYDAELTLGLP
VGISMTSGLNAMAHAAEALYARDRNPIASMMAVEGLRAMIEALPGVRMEPQDTKARETALYGAWLCGTVLGAVGMSLHHK
LCHTLGGSLDLPHAETHAVLLPYTIAYVEQAVPDQLAPLAALVGGRAGTGLYDFAARLGAPASLAALGVGGEDLDAMAEL
ATANPYWCPRPVEKTAIRALLQRAFEGARPE
>A1IIX5 1.5.1.37~~~graD~~~FADH(2)-dependent resorcinol hydroxylase, reductase component~~~
MTSALFGLNNLAPEGVGQNFRTTMRRFPATVTVITACATGDQRDHGMTVTAVTSVSMEPPSLLVCLNNRTFLHELLLCRP
DFIVNVLTQDQIALSDAFSGKVSPEERFRNGEWQRHDNGVLYLPTAHAAIACRRVAAMPYGTHTVFIGQVVSADVSETTR
PLLYENAQYCAASPAGLSA
>Q2G0E0 ~~~graR~~~Response regulator protein GraR~~~COG0745
MQILLVEDDNTLFQELKKELEQWDFNVAGIEDFGKVMDTFESFNPEIVILDVQLPKYDGFYWCRKMREVSNVPILFLSSR
DNPMDQVMSMELGADDYMQKPFYTNVLIAKLQAIYRRVYEFTAEEKRTLTWQDAVVDLSKDSIQKGDQTIFLSKTEMIIL
EILITKKNQIVSRDTIITALWDDEAFVSDNTLTVNVNRLRKKLSEISMDSAIETKVGKGYMAHE
>Q5HI09 ~~~graR~~~Response regulator protein GraR~~~
MQILLVEDDNTLFQELKKELEQWDFNVAGIEDFGKVMDTFESFNPEIVILDVQLPKYDGFYWCRKMREVSNVPILFLSSR
DNPMDQVMSMELGADDYMQKPFYTNVLIAKLQAIYRRVYEFTAEEKRTLTWQDAVVDLSKDSIQKGDQTIFLSKTEMIIL
EILITKKNQIVSRDTIITALWDDEAFVSDNTLTVNVNRLRKKLSEISMDSAIETKVGKGYMAHE
>Q932F1 ~~~graR~~~Response regulator protein GraR~~~
MQILLVEDDNTLFQELKKELEQWDFNVAGIEDFGKVMDTFESFNPEIVILDVQLPKYDGFYWCRKMREVSNVPILFLSSR
DNPMDQVMSMELGADDYMQKPFYTNVLIAKLQAIYRRVYEFTAEEKRTLTWQDAVVDLSKDSIQKGDDTIFLSKTEMIIL
EILITKKNQIVSRDTIITALWDDEAFVSDNTLTVNVSRLRKKLSEISMDSAIETKVGKGYMAHE
>Q2G0D9 2.7.13.3~~~graS~~~Sensor histidine kinase GraS~~~COG2205
MNNLKWVAYFLKSRMNWIFWILFLNFLMLGISLIDYDFPIDSLFYIVSLNLSLTMIFLLLTYFKEVKLYKHFDKDKEIEE
IKHKDLAETPFQRHTVDYLYRQISAHKEKVVEQQLQLNMHEQTITEFVHDIKTPVTAMKLLIDQEKNQERKQALLYEWSR
INSMLDTQLYITRLESQRKDMYFDYVSLKRMVIDEIQLTRHISQVKGIGFDVDFKVDDYVYTDIKWCRMIIRQILSNALK
YSENFNIEIGTELNDQHVSLYIKDYGRGISKKDMPRIFERGFTSTANRNETTSSGMGLYLVNSVKDQLGIHLQVTSTVGK
GTTVRLIFPLQNEIVERMSEVTNLSF
>Q5HI08 2.7.13.3~~~graS~~~Sensor histidine kinase GraS~~~
MNNLKWVAYFLKSRMNWIFWILFLNFLMLGISLIDYDFPIDSLFYIVSLNLSLTMIFLLLTYFKEVKLYKHFDKDKEIEE
IKHKDLAETPFQRHTVDYLYRQISAHKEKVVEQQLQLNMHEQTITEFVHDIKTPVTAMKLLIDQEKNQERKQALLYEWSR
INSMLDTQLYITRLESQRKDMYFDYVSLKRMVIDEIQLTRHISQVKGIGFDVDFKVDDYVYTDIKWCRMIIRQILSNALK
YSENFNIEIGTELNDQHVSLYIKDYGRGISKKDMPRIFERGFTSTANRNETTSSGMGLYLVNSVKDQLGIHLQVTSTVGK
GTTVRLIFPLQNEIVERMSEVTNLSF
>Q99VW1 2.7.13.3~~~graS~~~Sensor histidine kinase GraS~~~
MNNLKWVAYFLKSRMNWIFWILFLNLLMLGISLIDYDFPIDSLFYIVSLNLSLTMIFLILTYFKEVKLYKHFDKDKEIEE
IKHKDLAETPFQRHTVDYLYRQISAHKEKVVEQQLQLNMHEQTITEFVHDIKTPVTAMKLLIDQEKNQERKQALLYEWSR
INSMLDTQLYITRLESQRKDMYFDYVSLKRMVIDEIQLTRHISQVKGIGFDVDFKVDDYVYTDTKWCRMIIRQILSNALK
YSENFNIEIGTELNDQHVSLYIKDYGRGISKKDMPRIFERGFTSTANRNETTSSGMGLYLVNSVKDQLGIHLQVTSTVGK
GTTVRLIFPLQNEIVERMSEVTNLSF
>Q7A6Z3 2.7.13.3~~~graS~~~Sensor protein kinase GraS~~~
MNNLKWVAYFLKSRMNWIFWILFLNLLMLGISLIDYDFPIDSLFYIVSLNLSLTMIFLILTYFKEVKLYKHFDKDKEIEE
IKHKDLAETPFQRHTVDYLYRQISAHKEKVVEQQLQLNMHEQTITEFVHDIKTPVTAMKLLIDQEKNQERKQALLYEWSR
INSMLDTQLYITRLESQRKDMYFDYVSLKRMVIDEIQLTRHISQVKGIGFDVDFKVDDYVYTDTKWCRMIIRQILSNALK
YSENFNIEIGTELNDQHVSLYIKDYGRGISKKDMPRIFERGFTSTANRNETTSSGMGLYLVNSVKDQLGIHLQVTSTVGK
GTTVRLIFPLQNEIVERMSEVTNLSF
>Q8NXR5 2.7.13.3~~~graS~~~Sensor protein kinase GraS~~~
MNNLKWVAYFLKSRMNWIFWILFLNLLMLGISLIDYDFPIDSLFYIVSLNLSLTMIFLILTYFKEVKLYKHFDKDKEIEE
IKHKDLAETPFQRHTVDYLYRQISAHKEKVVEQQLQLNMHEQTITEFVHDIKTPVTAMKLLIDQEKNQERKQALLYEWSR
INSMLDTQLYITRLESQRKDMYFDYVSLKRMVIDEIQLTRHISQVKGIGFDVDFKVDDYVYTDIKWCRMIIRQILSNALK
YSENFNIEIGTELNDQHVSLYIKDYGRGISKKDMPRIFERGFTSTANRNETTSSGMGLYLVNSVKDQLGIHLQVTSTVGK
GTTVRLIFPLQNEIVERMSEVTNLSF
>Q2G0E1 ~~~graX~~~Auxiliary protein GraX~~~COG0451
MKPKVLLAGGTGYIGKYLSEVIENDAELFAISKYPDNKKTDDVEMTWIQCDIFHYEQVVAAMNQIDIAVFFIDPTKNSAK
ITQSSARDLTLIAADNFGRAAAINQVKKVIYIPGSRYDNETIERLGAYGTPVETTNLVFKRSLVNVELQVSKYDDVRSTM
KVVLPKGWTLKNVVNHFIAWMGYTKGTFVKTEKSHDQFKIYIKNKVRPLAVFKIEETADGIITLILLSGSLVKKYTVNQG
KLEFRLIKESAVVYIHLYDYIPRLFWPIYYFIQAPMQKMMIHGFEVDCRIKDFQSRLKSGENMKYTK
>P68066 ~~~grcA~~~Autonomous glycyl radical cofactor~~~COG3445
MITGIQITKAANDDLLNSFWLLDSEKGEARCIVAKAGYAEDEVVAVSKLGDIEYREVPVEVKPEVRVEGGQHLNVNVLRR
ETLEDAVKHPEKYPQLTIRVSGYAVRFNSLTPEQQRDVIARTFTESL
>P44455 ~~~grcA~~~Autonomous glycyl radical cofactor~~~COG3445
MIKGIQITQAANDNLLNSFWLLDSEKNEARCLCAKGEFAEDQVVAVSELGQIEYRELPVNVAPTVKVEGGQHLNVNVLRR
ETLEDAVNNPDKYPQLTIRVSGYAVRFNSLTPEQQRDVITRTFTESL
>P18953 ~~~grcA~~~Autonomous glycyl radical cofactor~~~
MITGIQITKANDQALVNSFWLLDDEKAEARCVCANGQYAEDQVVAVSDLGQIEYREVPLEMQPTVRVEGGQHLNVNVLRR
ETLEDAVKHPEKYPQLTIRVSGYAVRFNSLTPEQQRDVIARTFTESL
>P50972 1.21.4.2~~~grdA1~~~Glycine/sarcosine/betaine reductase complex component A1~~~
MSLFDGKKVIIIGDRDGIPGPAIAECLKGTAAEVVYSATECFVUTAAGAMDLENQNRVKGFADQFGAENLVVLVGAAEAE
SAGLAAETVTAGDPTFAGPLAGVQLGLRVFHAVEPEFKDAVDSAVYDEQIGMMEMVLDVDSIIAEMKSIREQFGKFND
>P26971 1.21.4.2~~~grdA~~~Glycine/sarcosine/betaine reductase complex component A~~~
MSRFTGKKIVIIGDRDGIPGPAIEECLKPIDCEVIFSSTECFVUTAAGAMDLENQKRIKEATEKFGAENLVVLIGAAEAE
AAGLAAETVTAGDPTFAGPLAGVELGLRVYHAVEPEFKDEVDAQIFDDQVGMMEMVLNVDEIIEEMQSIRSQFCKFND
>P26970 1.21.4.2~~~grdA~~~Glycine/sarcosine/betaine reductase complex component A~~~
MILQGKKVIAIGDRDGIPGPAIEECVKSAGAEIAFSSTECFVUTAAGAMDLEIQQKVKDAAESIGADNLVVVLGGAEAES
SGLSAETVTTGDPTYAGPLAGVELGLKVYHVVEDELKAEFDEAIYEDQCGMMEMVLDVDGIKEEMNRVRG
>P52216 1.21.4.2~~~grdA~~~Glycine/sarcosine/betaine reductase complex component A~~~
MSLFDGKKVIIIGDRDGIPGPAMAECLKGINVEVVYSATECFVUTAAGAMDLENQNWVKNFTDQYGAENIIVLVGAAEAE
SAGLAAETVTAGDPTFAGPLAGVQLGLRVFHAVEPEFKGAVDSAIYDEQIGMMEMVLDVDSIIEEMKSIRADYCKFND
>Q9R4G8 1.21.4.2~~~grdB~~~Glycine reductase complex component B subunit gamma~~~
MSKIRVVHYINQFFAGVGGEEKADIEPFIAESLPPVSQSLSNLIKDEAEVVGTVVCGDSYFGENLVEAKNRILEMIKSFN
PDIVVAGPAFNAGRYGVAAATVTKAVQDELGIPAVTGMYIENPGADMFKKYAYIISTGNSAAAMRTALPAMAKFAMKLAK
GEEIGGPVAEGYIERGIRFNMFKEDRGAKRAVAMLVKKLKGEEYETEYPMPSFDKVEPGKAIKDMSKAKIAIVTSGGIVP
KGNPDRIESSSASKYGKYDIQGIDDLTSEGWETAHGGHDPIYANEDADRVIPVDVLRDMEKEGVIGELHRYFYSTTGNGT
AVASSKKFAEEFTKELVADGVDAVILTSTUGTCTRCGASMVKEIERSGIPVVHIATVTPISLTVGANRIVPAIAIPHPLG
NPALSHEEEKALRRKIVEKALEALQTEVEEQTVFERNY
>P54935 1.21.4.2~~~grdC~~~Glycine/sarcosine/betaine reductase complex component C subunit beta~~~
MNFPVLKGAGYVLVHTPDMIMHNGTTQTTEKIVNPESEYLKKLPEHLRSFEDVVAYAPNQTYIGSMTPEALGEIAMPWWT
EDKKVAGADRYGKLGEIMPQDEFLALMSASDVFDLVLFEKEFIEGAKAKLAAHPVVGNLAESVNAGVELAEIEKQLSEFH
AEGLYNNGKLVGCVKRAHDVDVNLNSHTMLENLAVKASGVLALANLIAKNNVNPAEVDYIIECSEEACGDMNQRGGGNFA
KALAEMTGCVNATGSDMRGFCAGPTHALIAAAALVKSGVYKNVIIAAGGATAKLGMNGKDHVKKEMPILEDCLGGFAVLV
SENDGVNPILRTDLVGRHTVATGSAPQAVIGSLVLSPLKAGGLKITDVDKYSVEMQNPDITKPAGAGDVPEANYKMIAAL
AVMGKEIERADIAAFVEKHGMVGWAPTQGHIPSGVPYIGFAISDLTEGSVNRTMIVGKGSLFLGRMTNLFDGVSIVAERN
TGKVESGSSVSTEEIRKMIAESMKDFAAHLLAE
>Q12BV1 4.1.1.103~~~~~~Gamma-resorcylate decarboxylase~~~COG2159
MNGKIALEEHFATEETLMDSAGFVPDKDWPELRSRLLDIQDRRVRLMDEHGIETMILSLNAPAVQAIADSTRANETARRA
NDFLAEQVAKQPTRFRGFAALPMQDPELAARELERCVKELGFVGALVNGFSQDNRSAVPLYYDMAQYWPFWETVQALDVP
FYLHPRNPLPSDARIYDGHAWLLGPTWAFGQETAVHALRLMGSGLFDKYPALKIILGHMGEGLPYSMWRIDHRNAWIKTT
PKYPAKRKIVDYFNENFYLTTSGNFRTQTLIDAILEIGADRILFSTDWPFENIDHAADWFENTSISEADRKKIGWGNAQN
LFKLNR
>Q60FX6 4.1.1.103~~~rdc~~~Gamma-resorcylate decarboxylase~~~
MQGKVALEEHFAIPETLQDSAGFVPGDYWKELQHRLLDIQDTRLKLMDAHGIETMILSLNAPAVQAIPDRKKAIEIARRA
NDVLAEECARRPDRFLAFAALPLQDPDAATQELQRCVNDLGFVGALVNGFSQEGDGQTPLYYDLPQYRPFWGEVEKLDVP
FYLHPRNPLPQDSRIYDGHPWLLGPTWAFAQETAVHALRLMASGLFDAHPRLNIILGHMGEGLPYMMWRIDHRNAWVKLP
PRYPAKRRFVDYFNENFHITTSGNFRTQTLIDAILEIGADRILFSTDWPFENIDHASDWFNATTIAEADRVKIGRTNARR
LFKLDGR
>Q60GU1 4.1.1.103~~~graF~~~Gamma-resorcylate decarboxylase~~~
MQGKVALEEHFAIPETLQDSAGFVPGDYWKELQHRLLDIQDTRLKLMDAHGIETMILSLNAPAVQAIPDRRKAIEIARRA
NDVLAEECAKRPDRFLAFAALPLQDPDAATEELQRCVNDLGFVGALVNGFSQEGDGQTPLYYDLPQYRPFWGEVEKLDVP
FYLHPRNPLPQDSRIYDGHPWLLGPTWAFAQETAVHALRLMASGLFDEHPRLNIILGHMGEGLPYMMWRIDHRNAWVKLP
PRYPAKRRFMDYFNENFHITTSGNFRTQTLIDAILEIGADRILFSTDWPFENIDHASDWFNATSIAEADRVKIGRTNARR
LFKLDGA
>Q0SFL6 4.1.1.103~~~tsdA~~~Gamma-resorcylate decarboxylase~~~COG2159
MQGKIALEEHFAIPETLNDSAGFVPGTYWDELQARLLDIQDVRLKLMDEHNIETMILSLNAPAVQAIPERERAIDIARRA
NDVLAEECAKRPDRFRGFAALPLQDPDAAAEELRRCVTELGFVGALVNGFSQSATVDGGSTPLYYDLPRYRPFWAEVERL
DVPFYLHPRNPLNQDARIYEGHPWLLGPTWAFAQETAVHALRLMASGLFDEHPGLRIVLGHMGEGIPAMLWRIDHRNAWV
DVPPAYPAKRRMVDYFTENFFVTTSGNFRTQTLIDLLLELGSERVMFSTDWPFENINHAAEWFDAASISEADRLKIGRTN
AATLFKLDR
>Q47878 1.21.4.2~~~grdD~~~Glycine/sarcosine/betaine reductase complex component C subunit alpha~~~
MSDIKQMIGKTFMEIADAIETGSFAGKVKVGITTLGSEHGVENLVKGAELAAKDAAGFDIVLIGPKVETSLEVVEVATEE
EAHKKMEELLDSGYIHSCVTVHYNFPIGVSTVGRVVTPGMGKEMFIATTTGTSAAQRVEAMVRNALYGIITAKSMGIENP
TVGILNLDGARAVERALKELAGNGYPITFAESLRADGGSVMRGNDLLGGAADVMVTDSLTGNIMMKVFSSYTTGGSYEGL
GYGYGPGIGDGYNRTILILSRASGVPVAANAIKYAAKLAQNNVKAIAAAEFKAAKAAGLESILAGLSKDTKKASTEEEVK
MPPKEVVTGTISGVDVMDLEDAQKVLWKAGIYAESGMGCTGPIVMVNEAKVEEAAKILKDAGIVA
>Q9EV94 1.21.4.2~~~grdE~~~Glycine reductase complex component B subunits alpha and beta~~~
MRLEVGNIFIKDIQFGDSTKVENGVLYVNKQELISELSSDEHIKSIDMEIVRPGESVRIAPVKDVIEPRVKVEGNGGIFP
GFLSKVDTVGEGKTNVLKGAAVVTTGKVVGFQEGIIDMTGPGADYTPFSKTCNVVIIAEPVDGLKQHDHEAALRMVGLKA
GKYLGEAGRNITPDEVKVYETKPIFESVKEYPNLPKVAYVYMLQTQGLLHDTYVYGVDAKKIIPTLIYPTEVMDGAILSG
NCVSACDKNPTYVHMNNPVIHDLYELHGKEYNFVGVIITNENVYLADKERSSNWTAKMAEYLGLDGVIISEEGFGNPDTD
LIMNCKKITKKGIKTVILTDEYAGRDGASQSLADADAAADACVTGGNANMTIVLPKLDKIIGHVSKDVIDVIAGGFDGSL
RADGSIEVEIQAITGATSEVGFNKMTAKTY
>Q9R4G7 1.21.4.2~~~grdE~~~Glycine reductase complex component B subunits alpha and beta~~~
MRLEIGNIFIKDIQFGEQTKVENGVLYVNKDEMIKKLSVIEHIKSVDLDIARPGESVRITPVKDVIEPRVKVEGPGGIFP
GVISKVETDGSGRTHVLKGAAVVTTGKVVGFQEGIVDMSGVGAEYTPFSKTLNLVVIAEPEDGIEQHRHEEVLRMVGLNA
GVYIGEAGRSVTPDEVKVYETDTIFEGAAKYPNLPKVGYVYMLQTQGLLHDTYVYGVDAKKIVPTILYPTEVMDGAILSG
NCVSSCDKNPTYVHCNNPMVEELYAMHGKEINFVGVIITNENVYLADKERSSDWTAKLCKFLGLDGAIVSQEGFGNPDTD
LIMNCKKIEMEGVKTVISTDEYAGRDGASQSLADADVRANAVVSNGNANMVIVLPPMDKTIGHIQYIDTIAGGFDGSLRA
DGSIEVEIQAITGATNELGFGYLSAKGY
>O69407 1.21.4.4~~~grdH~~~Betaine reductase complex component B subunit beta~~~
MKKAILYLNQFFGQVGGEDKADYEPEIINGQVGAAMMLNGVLEGAEVTHTIICGDNFMGTYKDEAVSRIMGFLEDKEFDI
FLAGPAFQAGRYGVACGEICKVVKEKYNVPVVTSMHVENPGVQMFKKDMYVMIGGNNAGRMRQDMSAMAKVANKIIAGEK
IGPADEEGFFPRGKRHQHWREDGKPASERVVDMLLKKLSGEEFQTELPIPKSDRVEIAAPIKDLSKATIAVVTTGGIVPV
DNPDRIQSASATRWGMYDVTGLERLEGGVYKTIHAGFDPAAADADPNVIVPLDALRAYEKEGKIGKVHEYFYSTVGTGTT
EAEAARMAKEIVVKLKQGGVDGVIMTSTUGTCTRCGATMVKEIERAGFPIVQMCNLIPVASTVGANKIVPTISIPYPLGD
PSTSKEQQWKLRYHRVGTALDALTVDVQEQTIFKVKI
>O69406 1.21.4.4~~~grdI~~~Betaine reductase complex component B subunit alpha~~~
MKLELGNFYVEEIVFGEKTSFKDGVLTINKQEALDYVMEDENITHAELHIVKPGDMVRLCPVKEAIEPRIKLDGRTYFPG
VTDEELTRCGEGRTHALKGCSVLVVGKHWGGFQDGLIDMGGEGAKYTYYSTLKNIVLVGDTNEDFEKNEQQKKNKALRWA
GHKLAEYIGKTVKDMEPQEVETYELEPVTQRSEEVTKLPGVVFVMQPQSQMEELGYNDMVYGWDMNRMVPTYMHPNEVLD
GAIISGSFMPCSSKWSTYDFQNFPALKRLYAEHGKTVNFLGVIMSNLNVALQQKQRSALFVAQMAKSLGAQGAIVAEEGY
GNPDADFIACIVALENEGIKTVGLTNECTGRDGFSQPLVTLDEKANAIVSCGNVSELVELPPMPVVLGELEALARDGLSG
GWAGDEILGSSVKADGSVIMENNAMFCGDQVVGWSTKTMKEF
>P80240 ~~~greA~~~Transcription elongation factor GreA~~~COG0782
MAQEKVFPMTAEGKQKLEQELEYLKTVKRKEVVERIKIARSFGDLSENSEYDSAKEEQAFVEGRVTTLENMIRNAKIIED
DGGSNVVGLGKTVTFVELPDGDEESYTIVGSAEADPFEGKISNDSPIAKSLLGKKVDEEVTVQTPGGEMLVKIVKIS
>P0A6W5 ~~~greA~~~Transcription elongation factor GreA~~~COG0782
MQAIPMTLRGAEKLREELDFLKSVRRPEIIAAIAEAREHGDLKENAEYHAAREQQGFCEGRIKDIEAKLSNAQVIDVTKM
PNNGRVIFGATVTVLNLDSDEEQTYRIVGDDEADFKQNLISVNSPIARGLIGKEEDDVVVIKTPGGEVEFEVIKVEYL
>A0R2X1 ~~~greA~~~Transcription elongation factor GreA~~~COG0782
MTDTQVTWLTQEAFDRLKAELDQLIANRPVIAAEINDRREEGDLRENGGYHAAREEQGQQEARIRQLQELLNNAKVGEAP
KQSGVALPGSVVKVYYDDDENDTETFLIATRQEGISDGKLEVYSPNSPLGGALLDAKVGESRTYTVPSGNVVKVTLVSAE
PYQG
>P9WMT9 ~~~greA~~~Transcription elongation factor GreA~~~COG0782
MTDTQVTWLTQESHDRLKAELDQLIANRPVIAAEINDRREEGDLRENGGYHAAREEQGQQEARIRQLQDLLSNAKVGEAP
KQSGVALPGSVVKVYYNGDKSDSETFLIATRQEGVSDGKLEVYSPNSPLGGALIDAKVGETRSYTVPNGSTVSVTLVSAE
PYHS
>P99156 ~~~greA~~~Transcription elongation factor GreA~~~
MENQKQYPMTQEGFEKLERELEELKTVKRPEVVEKIKVARSFGDLSENSEYDAAKDEQGFIEQDIQRIEHMLRNALIIED
TGDNNVVKIGKTVTFVELPGDEEESYQIVGSAESDAFNGKISNESPMAKALIGKGLDDEVRVPLPNGGEMNVKIVNIQ
>Q5XDQ7 ~~~greA~~~Transcription elongation factor GreA~~~
MAEKTYPMTLTEKEQLEKELEELKLVRRPEIVERIKIARSYGDLSENSEYDAAKDEQAFVEGQISTLETKIRYAEIIDSD
AVAKDEVAIGKTVIVQEVGTTDKDTYHIVGAAGADIFSGKISNESPIAQALIGKKTGDKVRIESPAATYDVEIISVEKTN
>P30128 ~~~greB~~~Transcription elongation factor GreB~~~COG0782
MKTPLVTREGYEKLKQELNYLWREERPEVTKKVTWAASLGDRSENADYQYNKKRLREIDRRVRYLTKCLENLKIVDYSPQ
QEGKVFFGAWVEIENDDGVTHRFRIVGYDEIFGRKDYISIDSPMARALLKKEVGDLAVVNTPAGEASWYVNAIEYVKP
>B1VTI5 1.10.3.15~~~griF~~~Grixazone synthase~~~COG2304
MVHVRKNHLTMTAEEKRRFVHAVLEIKRRGIYDRFVKLHIQINSTDYLDKETGKRLGHVNPGFLPWHRQYLLKFEQALQK
VDPRVTLPYWDWTTDHGENSPLWSDTFMGGNGRPGDRRVMTGPFARRNGWKLNISVIPEGPEDPALNGNYTHDDRDYLVR
DFGTLTPDLPTPQELEQTLDLTVYDCPPWNHTSGGTPPYESFRNHLEGYTKFAWEPRLGKLHGAAHVWTGGHMMYIGSPN
DPVFFLNHCMIDRCWALWQARHPDVPHYLPTVPTQDVPDLNTPLGPWHTKTPADLLDHTRFYTYDQ
>A0JC76 4.1.99.20~~~griH~~~3-amino-4-hydroxybenzoic acid synthase~~~
MSSSPSPSPSSSSSSSASSSASSSPSSSSKLTWLDIRSVGEARAAIVQEALHHRVEALVADDPAHLADLPPTVAKVLLVV
GKQIPEEFGEATVVVVDPSKHGVTPAELALKHPEIEFGRFVEIIDAPTLEDACESSRTEKWSVLLFRDPTKIPLEIVIAA
AARASGSMVTIAQDLEEAEILFGVLEHGSDGVMMAPKTVGDAAELKRIAEAGIPNLNLTELRVVETSHIGMGERACVDTT
THFGEDEGILVGSHSKGMILCVSETHPLPYMPTRPFRVNAGAIHSYTLGRDERTNYLSELKTGSKLTAVDIKGNTRLVTV
GRVKIETRPLISIDAEAPDGRRVNLILQDDWHVRVLGPGGTVLNSTELKPGDTVLGYLPVEDRHVGYPINEFCLEK
>A0JC77 4.1.2.56~~~griI~~~2-amino-4,5-dihydroxy-6-oxo-7-(phosphonooxy)heptanoate synthase~~~
MAPNAPFARSLRLQRLHHHDPDRLFIVPLDHSITDGPLSRAHRLDPLVGELASHHVDGIVLHKGSLRHVDPEWFTRTSLI
VHLSASTVHAPDPNAKYLVSSVEESLRMGADAVSVHVNLGSEGERHQIADMAAVAEACDRWNVPLLAMMYPRGPKIDDPR
DPALVAHAVQVAVDLGADLVKTLYVGSVAAMAEITAASPVPVVVVGGPRDSDESRILAYVDDALRGGAAGVAMGRNVFQA
PDPGAMADKLSDLIHNSGTRGAARAPAGAAAGAA
>A0A0E3URH8 ~~~griR~~~Beta sliding clamp homolog GriR~~~
MRFQVEREVLAEGIGWVARGLAVRPSVPILSGVVVNAEGDTLTLSGFDYEVSTRVELKANVEESGTVLIPGRRLADIAKV
LPDVPIEFNVDQTKVYVQCDSNSFVLNALPLDEYPTLPKLPTVCGSVEGDQFARAVSQVAVVASRDDALPVLTGIGVNFD
GEIMKLNATDRYRFAIRELAWKPEGTPSSSSVLVPARTLLDFAKSLNKGDLVKIALSDEGNLLGLHAGTRQMTCRLLEGT
LPDYEKLFPKEFTSFGAVEVSRLVEALKRVSLVLERNSSVALDFTDGELVLQAGGADDDRATSRMAASLEGESIDIAFNP
SFLLDGLTNLDASWAQFSFTSSNGKAVIMGKSSVDAEADTSARYLVMPVRFHR
>P24618 2.1.1.179~~~grm~~~16S rRNA (guanine(1405)-N(7))-methyltransferase~~~
MTTSAPEDRIDQVEQAITKSRRYQTVAPATVRRLARAALVAARGDVPDAVKRTKRGLHEIYGAFLPPSPPNYAALLRQLD
SAVDAGDDEAVRAALRRAMSVHVSTRERLPHLAEFYQEIFRHVPQPNTLRDLACGLNPLAAPWMGLSDQTVYVASDIDAR
LIGFVDAALTRLGVAHRTSVVDLLEDRLDEPTDVTLLLKTLPCLETQRRGSGWEVIDIVNSPIIVVTFPTKSLGQRSKGM
FQNYSQSFESQARERSCRIQRLEIGNELIYVIQK
>P24619 2.1.1.179~~~grm~~~16S rRNA (guanine(1405)-N(7))-methyltransferase~~~
MTTSTGDDRIDQLQQAITKSRRYQTVAPATVRRLARAALVASRGDVPDAVKRTKRGLHEIYGAFLPPSAPNYTALLRHLD
SAVEAGDDEAVVRWDRRAMSVHMSTRERVPHLDEFYREIFRHVPRPNTLRDLACGLNPLAVPWMGLSDETVYVASDIDAR
LMDFVGAALTRLGVAHRTSVVDLLEARLDEPADVTLLLKTLPCLETQQRGSGWEVIDIVNSPIIVVTFPTKSLGQRSKGM
FQNYSQSFESQASERSCRIQRLEIGNELIYVIHK
>P15874 ~~~grpE~~~Protein GrpE~~~COG0576
MSEEKQTVEQNETEEQEIIEEQAAADEQQEETNESELLQNQINELQGLLEEKENKLLRVQADFENYKRRSRLEMEASQKY
RSQNIVTDLLPALDSFERALQVEADNEQTKSLLQGMEMVHRQLVEALKKEGVEAIEAVGQEFDPNLHQAVMQAEDENYGS
NIVVEEMQKGYKLKDRVIRPSMVKVNQ
>C4ZYN1 ~~~grpE~~~Protein GrpE~~~
MSSKEQKTPEGQAPEEIIMDQHEEIEAVEPEASAEQVDPRDEKVANLEAQLAEAQTRERDGILRVKAEMENLRRRTELDI
EKAHKFALEKFINELLPVIDSLDRALEVADKANPDMSAMVEGIELTLKSMLDVVRKFGVEVIAETNVPLDPNVHQAIAMV
ESDDVAPGNVLGIMQKGYTLNGRTIRAAMVTVAKAKA
>P09372 ~~~grpE~~~Protein GrpE~~~COG0576
MSSKEQKTPEGQAPEEIIMDQHEEIEAVEPEASAEQVDPRDEKVANLEAQLAEAQTRERDGILRVKAEMENLRRRTELDI
EKAHKFALEKFINELLPVIDSLDRALEVADKANPDMSAMVEGIELTLKSMLDVVRKFGVEVIAETNVPLDPNVHQAIAMV
ESDDVAPGNVLGIMQKGYTLNGRTIRAAMVTVAKAKA
>Q5KWZ6 ~~~grpE~~~Protein GrpE~~~COG0576
MEQGEKQVMEQATYDEPEREQPIEEEAAPQPEEESGGVPLEEAGGEEAAEPAEKAPTAEELAAAKAQIAELEAKLSEMEH
RYLRLYADFENFRRRTRQEMEAAEKYRAQSLASDLLPVLDNFERALKIETDNEQAKSILQGMEMVYRSLVDALKKEGVEA
IEAVGKPFDPYLHQAVMQAEAEGYEPNTVVEELQKGYKLKDRVLRPAMVKVSQ
>P78017 ~~~grpE~~~Protein GrpE~~~
MSENSLTITEILSSIRTLLVKHNKAKVTQIEKELLQAVAELEKKFKQQVQNFNELQQKIPNLQKVNEEFRLKVEKIQEEA
QKKIQEKVAELTIKSKEELENAKKYVIEKSIDQPLIIIDQFEIALSYAQKDPQVKNYTTGFNMVLDAFSRWLEGFGVTKI
AIEPGAQFDEKVMAALEVVPSDQPANTVVKVSKSGYKLHDKVIRFASVVVSQGNKTE
>P9WMT5 ~~~grpE~~~Protein GrpE~~~COG0576
MTDGNQKPDGNSGEQVTVTDKRRIDPETGEVRHVPPGDMPGGTAAADAAHTEDKVAELTADLQRVQADFANYRKRALRDQ
QAAADRAKASVVSQLLGVLDDLERARKHGDLESGPLKSVADKLDSALTGLGLVAFGAEGEDFDPVLHEAVQHEGDGGQGS
KPVIGTVMRQGYQLGEQVLRHALVGVVDTVVVDAAELESVDDGTAVADTAENDQADQGNSADTSGEQAESEPSGS
>P99086 ~~~grpE~~~Protein GrpE~~~
MTNKDESVEKNTESTVEETNIKQNIDDSVEQAEESKGHLQDEAIEETSDENVIEEIDPKDQKINELQQLADENEEKYLRL
YAEFENYKRRIQKENEINKTYQAQRVLTDILPAIDNIERALQIEGDDETFKSLQKGVQMVHESLINALKDNGLEVIKTEG
EAFDPNIHQAVVQDDNPDFESGEITQELQKGYKLKDRVLRPSMVKVNQ
>Q5XAD5 ~~~grpE~~~Protein GrpE~~~
MAVFNKLFKRRHSVSEEIKKDDLQEEVEATETEETVEEVIEETPEKSELELANERADEFENKYLRAHAEMQNIQRRSSEE
RQQLQRYRSQDLAKAILPSLDNLERALAVEGLTDDVKKGLEMTRDSLIQALKEEGVEEVEVDSFDHNFHMAVQTLPADDE
HPADSIAEVFQKGYKLHERLLRPAMVVVYN
>Q97S73 ~~~grpE~~~Protein GrpE~~~COG0576
MAQDIKNEEVEEVQEEEVVKTAEETTPEKSELDLANERADEFENKYLRAHAEMQNIQRRANEERQNLQRYRSQDLAKAIL
PSLDNLERALAVEGLTDDVKKGLGMVQESLIHALKEEGIEEIAADGEFDHNYHMAIQTLPADDEHPVDTIAQVFQKGYKL
HDRILRPAMVVVYN
>Q56236 ~~~grpE~~~Protein GrpE~~~COG0576
MEERNHENTLEKDLEAVGQEAQALEERLKAAEEELKGLKDKYLRLLADFDNYRKRMEEELKAREREGVLKALRALLPVLD
DLDRALEFAEASPESIRQGVRAIRDGFFRILAGLGVEEVPGEGEAFDPRYHEAVGLLPGEPGKVAKVFQRGFRMGEALVR
PARVAVGEEKREEADLE
>P0DUM1 ~~~grpN~~~Bacterial microcompartment shell vertex protein GrpN~~~
MYLGKVIGTVVSTSKNESLSGTKLLVVARLTEKLIPDGSTQVVVDTVGAGNGEIVIVSCGSSARQSTGKDHSVIDAAVVG
IVDTVETVN
>P0C061 ~~~grsA~~~Gramicidin S synthase 1~~~
MLNSSKSILIHAQNKNGTHEEEQYLFAVNNTKAEYPRDKTIHQLFEEQVSKRPNNVAIVCENEQLTYHELNVKANQLARI
FIEKGIGKDTLVGIMMEKSIDLFIGILAVLKAGGAYVPIDIEYPKERIQYILDDSQARMLLTQKHLVHLIHNIQFNGQVE
IFEEDTIKIREGTNLHVPSKSTDLAYVIYTSGTTGNPKGTMLEHKGISNLKVFFENSLNVTEKDRIGQFASISFDASVWE
MFMALLTGASLYIILKDTINDFVKFEQYINQKEITVITLPPTYVVHLDPERILSIQTLITAGSATSPSLVNKWKEKVTYI
NAYGPTETTICATTWVATKETIGHSVPIGAPIQNTQIYIVDENLQLKSVGEAGELCIGGEGLARGYWKRPELTSQKFVDN
PFVPGEKLYKTGDQARWLSDGNIEYLGRIDNQVKIRGHRVELEEVESILLKHMYISETAVSVHKDHQEQPYLCAYFVSEK
HIPLEQLRQFSSEELPTYMIPSYFIQLDKMPLTSNGKIDRKQLPEPDLTFGMRVDYEAPRNEIEETLVTIWQDVLGIEKI
GIKDNFYALGGDSIKAIQVAARLHSYQLKLETKDLLKYPTIDQLVHYIKDSKRRSEQGIVEGEIGLTPIQHWFFEQQFTN
MHHWNQSYMLYRPNGFDKEILLRVFNKIVEHHDALRMIYKHHNGKIVQINRGLEGTLFDFYTFDLTANDNEQQVICEESA
RLQNSINLEVGPLVKIALFHTQNGDHLFMAIHHLVVDGISWRILFEDLATAYEQAMHQQTIALPEKTDSFKDWSIELEKY
ANSELFLEEAEYWHHLNYYTENVQIKKDYVTMNNKQKNIRYVGMELTIEETEKLLKNVNKAYRTEINDILLTALGFALKE
WADIDKIVINLEGHGREEILEQMNIARTVGWFTSQYPVVLDMQKSDDLSYQIKLMKENLRRIPNKGIGYEIFKYLTTEYL
RPVLPFTLKPEINFNYLGQFDTDVKTELFTRSPYSMGNSLGPDGKNNLSPEGESYFVLNINGFIEEGKLHITFSYNEQQY
KEDTIQQLSRSYKQHLLAIIEHCVQKEDTELTPSDFSFKELELEEMDDIFDLLADSLT
>P0C062 ~~~grsA~~~Gramicidin S synthase 1~~~
MLNSSKSILIHAQNKNGTHEEEQYLFAVNNTKAEYPRDKTIHQLFEEQVSKRPNNVAIVCENEQLTYHELNVKANQLARI
FIEKGIGKDTLVGIMMEKSIDLFIGILAVLKAGGAYVPIDIEYPKERIQYILDDSQARMLLTQKHLVHLIHNIQFNGQVE
IFEEDTIKIREGTNLHVPSKSTDLAYVIYTSGTTGNPKGTMLEHKGISNLKVFFENSLNVTEKDRIGQFASISFDASVWE
MFMALLTGASLYIILKDTINDFVKFEQYINQKEITVITLPPTYVVHLDPERILSIQTLITAGSATSPSLVNKWKEKVTYI
NAYGPTETTICATTWVATKETTGHSVPIGAPIQNTQIYIVDENLQLKSVGEAGELCIGGEGLARGYWKRPELTSQKFVDN
PFVPGEKLYKTGDQARWLPDGNIEYLGRIDNQVKIRGHRVELEEVESILLKHMYISETAVSVHKDHQEQPYLCAIFVSEK
HIPLEQLRQFSSEELPTYMIPSYFIQLDKMPLTSNGKIDRKQLPEPDLTFGMRVDYEAPRNEIEETLVTIWQDVLGIEKI
GIKDNFYALGGDSIKAIQVAARLHSYQLKLETKDLLKYPTIDQLVHYIKDSKRRSEQGIVEGEIGLTPIQHWFFEQQFTN
MHHWNQSYMLYRPNGFDKEILLRVFNKIVEHHDALRMIYKHHNGKIVQINRGLEGTLFDFYTFDLTANDNEQQVICEESA
RLQNSINLEVGPLVKIALFHTQNGDHLFMAIHHLVVDGISWRILFEDLATAYEQAMHQQTIALPEKTDSFKDWSIELEKY
ANSELFLEEAEYWHHLNYYTDNVQIKKDYVTMNNKQKNIRYVGMELTIEETEKLLKNVNKAYRTEINDILLTALGFALKE
WADIDKIVINLEGHGREEILEQMNIARTVGWFTSQYPVVLDMQKSDDLSYQIKLMKENLRRIPNKGIGYEIFKYLTTEYL
RPVLPFTLKPEINFNYLGQFDTDVKTELFTRSPYSMGNSLGPDGKNNLSPEGESYFVLNINGFIEEGKLHITFSYNEQQY
KEDTIQQLSRSYKQHLLAIIEHCVQKEDTELTPSDFSFKELELEEMDDIFDLLADSLT
>P0C064 ~~~grsB~~~Gramicidin S synthase 2~~~
MSTFKKEHVQDMYRLSPMQEGMLFHALLDKDKNAHLVQMSIAIEGIVDVELLSESLNILIDRYDVFRTTFLHEKIKQPLQ
VVLKERPVQLQFKDISSLDEEKREQAIEQYKYQDGETVFDLTRDPLMRVAIFQTGKVNYQMIWSFHHILMDGWCFNIIFN
DLFNIYLSLKEKKPLQLEAVQPYKQFIKWLEKQDKQEALRYWKEHLMNYDQSVTLPKKKAAINNTTYEPAQFRFAFDKVL
TQQLLRIANQSQVTLNIVFQTIWGIVLQKYNSTNDVVYGSVVSGRPSEISGIEKMVGLFINTLPLRIQTQKDQSFIELVK
TVHQNVLFSQQHEYFPLYEIQNHTELKQNLIDHIMVIENYPLVEELQKNSIMQKVGFTVRDVKMFEPTNYDMTVMVLPRD
EISVRLDYNAAVYDIDFIRKIEGHMKEVALCVANNPHVLVQDVPLLTKQEKQHLLVELHDSITEYPDKTIHQLFTEQVEK
TPEHVAVVFEDEKVTYRELHERSNQLARFLREKGVKKESIIGIMMERSVEMIVGILGILKAGGAFVPIDPEYPKERIGYM
LDSVRLVLTQRHLKDKFAFTKETIVIEDPSISHELTEEIDYINESEDLFYIIYTSGTTGKPKGVMLEHKNIVNLLHFTFE
KTNINFSDKVLQYTTCSFDVCYQEIFSTLLSGGQLYLIRKETQRDVEQLFDLVKRENIEVLSFPVAFLKFIFNEREFINR
FPTCVKHIITAGEQLVVNNEFKRYLHEHNVHLHNHYGPSETHVVTTYTINPEAEIPELPPIGKPISNTWIYILDQEQQLQ
PQGIVGELYISGANVGRGYLNNQELTAEKFFADPFRPNERMYRTGDLARWLPDGNIEFLGRADHQVKIRGHRIELGEIEA
QLLNCKGVKEAVVIDKADDKGGKYLCAYVVMEVEVNDSELREYLGKALPDYMIPSFFVPLDQLPLTPNGKIDRKSLPNLE
GIVNTNAKYVVPTNELEEKLAKIWEEVLGISQIGIQDNFFSLGGHSLKAITLISRMNKECNVDIPLRLLFEAPTIQEISN
YINGAKKESYVAIQPVPEQEYYPVSSVQKRMFILNEFDRSGTAYNLPGVMFLDGKLNYRQLEAAVKKLVERHEALRTSFH
SINGEPVQRVHQNVELQIAYSESTEDQVERIIAEFMQPFALEVAPLLRVGLVKLEAERHLFIMDMHHIISDGVSMQIMIQ
EIADLYKEKELPTLGIQYKDFTVWHNRLLQSDVIEKQEAYWLNVFTEEIPVLNLPTDYPRPTIQSFDGKRFTFSTGKQLM
DDLYKVATETGTTLYMVLLAAYNVFLSKYSGQDDIVVGTPIAGRSHADVENMLGMFVNTLAIRSRLNNEDTFKDFLANVK
QTALHAYENPDYPFDTLVEKLGIQRDLSRNPLFDTMFVLQNTDRKSFEVEQITITPYVPNSRHSKFDLTLEVSEEQNEIL
LCLEYCTKLFTDKTVERMAGHFLQILHAIVGNPTIIISEIEILSEEEKQHILFEFNDTKTTYPHMQTIQGLFEEQVEKTP
DHVAVGWKDQALTYRELNERANQVARVLRQKGVQPDNIVGLLVERSPEMLVGIMGILKAGGAYLPLDPEYPADRISYMIQ
DCGVRIMLTQQHLLSLVHDEFDCVILDEDSLYKGDSSNLAPVNQAGDLAYIMYTSGSTGKPKGVMVEHRNVIRLVKNTNY
VQVREDDRIIQTGAIGFDALTFEVFGSLLHGAELYPVTKDVLLDAEKLHKFLQANQITIMWLTSPLFNQLSQGTEEMFAG
LRSLIVGGDALSPKHINNVKRKCPNLTMWNGYGPTENTTFSTCFLIDKEYDDNIPIGKAISNSTVYIMDRYGQLQPVGVP
GELCVGGDGVARGYMNQPALTEEKFVPNPFAPGERMYRTGDLARWLPDGTIEYLGRIDQQVKIRGYRIEPGEIETLLVKH
KKVKESVIMVVEDNNGQKALCAYYVPEEEVTVSELREYIAKELPVYMVPAYFVQIEQMPLTQNGKVNRSALPKPDGEFGT
ATEYVAPSSDIEMKLAEIWHNVLGVNKIGVLDNFFELGGHSLRAMTMISQVHKEFDVELPLKVLFETPTISALAQYIADG
EKGMYLAIQPVTPQDYYPVSSAQKRMYILYEFEGAGITYNVPNVMFIEGKLDYQRFEYAIKSLINRHEALRTSFYSLNGE
PVQRVHQNVELQIAYSEAKEDEIEQIVESFVQPFDLEIAPALRVGLVKLASDRHLFLMDMHHIISDGVSMQIITKEIADL
YKGKELAELHIQYKDFAVWQNEWFQSAALEKQKTYWLNTFAEDIPVLNLSTDYPRPTIQSFEGDIVTFSAGKQLAEELKR
LATETGTTLYMLLLAAYNVLLHKYSGQEEIVVGTPIAGRSHADVENIVGMFVNTLALKNTPIAVRTFHEFLLEVKQNALE
AFENQDYPFENLIEKLQVRRDLSRNPLFDTMFSLSNIDEQVEIGIEGLSFSPYEMQYWIAKFDISFDILEKQDDIQFYFN
YCTNLFKKETIERLATHFMHILQEIVINPEIKLCEINMLSEEEQQRVLYDFNGTDATYATNKIFHELFEEQVEKTPDHIA
VIDEREKLSYQELNAKANQLARVLRQKGVQPNSMVGIMVDRSLDMIVGMLGVLKAGGAYVPIDIDYPQERISYMMEDSGA
ALLLTQQKLTQQIAFSGDILYLDQEEWLHEEASNLEPIARPQDIAYIIYTSGTTGKPKGVMIEHQSYVNVAMAWKDAYRL
DTFPVRLLQMASFAFDVSAGDFARALLTGGQLIVCPNEVKMDPASLYAIIKKYDITIFEATPALVIPLMEYIYEQKLDIS
QLQILIVGSDSCSMEDFKTLVSRFGSTIRIVNSYGVTEACIDSSYYEQPLSSLHVTGTVPIGKPYANMKMYIMNQYLQIQ
PVGVIGELCIGGAGVARGYLNRPDLTAEKFVPNPFVPGEKLYRTGDLARWMPDGNVEFLGRNDHQVKIRGIRIELGEIEA
QLRKHDSIKEATVIAREDHMKEKYLCAYMVTEGEVNVAELRAYLATDLPAAMIPSYFVSLEAMPLTANGKIDKRSLPEPD
GSISIGTEYVAPRTMLEGKLEEIWKDVLGLQRVGIHDDFFTIGGHSLKAMAVISQVHKECQTEVPLRVLFETPTIQGLAK
YIEETDTEQYMAIQPVSGQDYYPVSSAQKRMFIVNQFDGVGISYNMPSIMLIEGKLERTRLESAFKRLIERHESLRTSFE
IINGKPVQKIHEEADFNMSYQVASNEQVEKMIDEFIQPFDLSVAPLLRVELLKLEEDRHVLIFDMHHIISDGISSNILMK
ELGELYQGNALPELRIQYKDFAVWQNEWFQSEAFKKQEEYWVNVFADERPILDIPTDYPRPMQQSFDGAQLTFGTGKQLM
DGLYRVATETGTTLYMVLLAAYNVLLSKYSGQEDIIVGTPIVGRSHTDLENIVGMFVNTLAMRNKPEGEKTFKAFVSEIK
QNALAAFENQDYPFEELIEKLEIQRDLSRNPLFDTLFSLQNIGEESFELAELTCKPFDLVSKLEHAKFDLSLVAVEKEEE
IAFGLQYCTKLYKEKTVEQLAQHFIQIVKAIVENPDVKLSDIDMLSEEEKKQIMLEFNDTKIQYPQNQTIQELFEEQVKK
TPEHIAIVWEGQALTYHELNIKANQLARVLREKGVTPNHPVAIMTERSLEMIVGIFSILKAGGAYVPIDPAYPQERIQYL
LEDSGATLLLTQSHVLNKLPVDIEWLDLTDEQNYVEDGTNLPFMNQSTDLAYIIYTSGTTGKPKGVMIEHQSIINCLQWR
KEEYEFGPGDTALQVFSFAFDGFVASLFAPILAGATSVLPKEEEAKDPVALKKLIASEEITHYYGVPSLFSAILDVSSSK
DLQNLRCVTLGGEKLPAQIVKKIKEKNKEIEVNNEYGPTENSVVTTIMRDIQVEQEITIGCPLSNVDVYIVNCNHQLQPV
GVVGELCIGGQGLARGYLNKPELTADKFVVNPFVPGERMYKTGDLAKWRSDGMIEYVGRVDEQVKVRGYRIELGEIESAI
LEYEKIKEAVVIVSEHTASEQMLCAYIVGEEDVLTLDLRSYLAKLLPSYMIPNYFIQLDSIPLTPNGKVDRKALPEPQTI
GLMAREYVAPRNEIEAQLVLIWQEVLGIELIGITDNFFELGGHSLKATLLVAKIYEYMQIEMPLNVVFKHSTIMKIAEYI
THQESENNVHQPILVNVEADREALSLNGEKQRKNIELPILLNEETDRNVFLFAPIGAQGVFYKKLAEQIPTASLYGFDFI
EDDDRIQQYIESMIQTQSDGQYVLIGYSSGGNLAFEVAKEMERQGYSVSDLVLFDVYWKGKVFEQTKEEEEENIKIIMEE
LRENPGMFNMTREDFELYFANEFVKQSFTRKMRKYMSFYTQLVNYGEVEATIHLIQAEFEEEKIDENEKADEEEKTYLEE
KWNEKAWNKAAKRFVKYNGYGAHSNMLGGDGLERNSSILKQILQGTFVVK
>O32210 1.1.1.-~~~yvgN~~~Glyoxal reductase~~~COG0656
MPTSLKDTVKLHNGVEMPWFGLGVFKVENGNEATESVKAAIKNGYRSIDTAAIYKNEEGVGIGIKESGVAREELFITSKV
WNEDQGYETTLAAFEKSLERLQLDYLDLYLIHWPGKDKYKDTWRALEKLYKDGKIRAIGVSNFQVHHLEELLKDAEIKPM
VNQVEFHPRLTQKELRDYCKGQGIQLEAWSPLMQGQLLDNEVLTQIAEKHNKSVAQVILRWDLQHGVVTIPKSIKEHRII
ENADIFDFELSQEDMDKIDALNKDERVGPNPDELLF
>P80870 ~~~yugI~~~General stress protein 13~~~COG1098
MAAKFEVGSVYTGKVTGLQAYGAFVALDEETQGLVHISEVTHGFVKDINEHLSVGDEVQVKVLAVDEEKGKISLSIRATQ
AAPEKKESKPRKPKAAQVSEEASTPQGFNTLKDKLEEWIEMSNRKDLIKK
>P80871 1.6.99.-~~~ywrO~~~General stress protein 14~~~COG2249
MKILVLAVHPHMETSVVNKAWAEELSKHDNITVRDLYKEYPDEAIDVAKEQQLCEEYDRIVFQFPLYWYSSPPLLKKWQD
LVLTYGWAFGSEGNALHGKELMLAVSTGSEAEKYQAGGANHYSISELLKPFQATSNLIGMKYLPPYVFYGVNYAAAEDIS
HSAKRLAEYIQQPFV
>P80876 3.2.-.-~~~yfkM~~~General stress protein 18~~~COG0693
MGKKIAVVLTYYFEDSEYTEPAKAFKEAGHELTVIEKEKGKTVKGKQGTAEVTVDASIDDVNSSDFDALLIPGGFSPDQL
RADDRFVQFTKAFMTDKKPVFAICHGPQLLINAKALDGRKATGYTSIRVDMENAGADVVDKEVVVCQDQLVTSRTPDDIP
AFNRESLALLEK
>P80238 ~~~ydaG~~~General stress protein 26~~~COG3871
MNQQDIKQKVLDVLDHHKVGSLATVQKGKPHSRYMTFFHDGLTIYTPTSKETHKAEEIENNPNVHILLGYDCEGFGDAYV
EVAGKAKINNSAELKDKIWSSKLERWFDGKDDPNLVILEIEPEDIRLMNAGEKTPVSLEL
>P42101 2.-.-.-~~~yxaB~~~General stress protein 30~~~COG5039
MTVQEIKGKKLVKGIAPNVEPEALLNDKRKVFLFGSPSYTNIGDQAIAYAEEKFIKNHFPYYEYIEIMDYATDEGIELVK
EIIREDDIVCFTGGGNLGNLYLDIEEDRRKVFSAFKDYKSISLPQSVYFEDTEEGQKEKKKTQDAYHQNTNLTIAARETQ
TLDVVKETFNSNVIFTPDMVLSLDIVPRELERDGVLFILRADKEKVTDEDFISQMKQWAEKTTYTERTDTVLDTVDTIDY
ADREKHFMEMLDRIGSSKLVITDRLHAMIFSIITKTPCLVFGNSYGKAKHSYRDWLESLNFIEYTDKNDVEELERMIDRL
LQAEPNDVDLSKDFQPLIDFFAS
>P80873 1.-.-.-~~~ydaD~~~General stress protein 39~~~COG1028
MANYPKELPAQTQSRQPGIESEMNPSPVYEYEDYKGADKLKGKVALITGGDSGIGRAVSVAYAKEGADIAIVYKDEHEDA
EETKKRVEQEGVKCLLIAGDVGEEEFCNEAVEKTVKELGGLDILVNNAGEQHPKESIKDITSEQLHRTFKTNFYSQFYLT
KKAIDYLKPGSAIINTTSINPYVGNPTLIDYTATKGAINAFTRTMAQALVKDGIRVNAVAPGPIWTPLIPATFPEETVAQ
FGQDTPMGRPGQPVEHVGCYVLLASDESSYMTGQTLHVNGGNFVTT
>P80874 1.1.1.-~~~yhdN~~~Aldo-keto reductase YhdN~~~COG0667
MEYTSIADTGIEASRIGLGTWAIGGTMWGGTDEKTSIETIRAALDQGITLIDTAPAYGFGQSEEIVGKAIKEYGKRDQVI
LATKTALDWKNNQLFRHANRARIVEEVENSLKRLQTDYIDLYQVHWPDPLVPIEETAEVMKELYDAGKIRAIGVSNFSIE
QMDTFRAVAPLHTIQPPYNLFEREMEESVLPYAKDNKITTLLYGSLCRGLLTGKMTEEYTFEGDDLRNHDPKFQKPRFKE
YLSAVNQLDKLAKTRYGKSVIHLAVRWILDQPGADIALWGARKPGQLEALSEITGWTLNSEDQKDINTILENTISDPVGP
EFMAPPTREEI
>Q81YV0 5.4.3.8~~~hemL1~~~Glutamate-1-semialdehyde 2,1-aminomutase 1~~~COG0001
MVVKFTKSEALHKEALEHIVGGVNSPSRSFKAVGGGAPIAMERGKGAYFWDVDGNKYIDYLAAYGPIITGHAHPHITKAI
TTAAENGVLYGTPTALEVKFAKMLKEAMPALDKVRFVNSGTEAVMTTIRVARAYTGRTKIMKFAGCYHGHSDLVLVAAGS
GPSTLGTPDSAGVPQSIAQEVITVPFNNVETLKEALDKWGHEVAAILVEPIVGNFGIVEPKPGFLEKVNELVHEAGALVI
YDEVITAFRFMYGGAQDLLGVTPDLTALGKVIGGGLPIGAYGGKKEIMEQVAPLGPAYQAGTMAGNPASMASGIACLEVL
QQEGLYEKLDELGAMLEKGILEQAAKHNIDITLNRLKGALTVYFTTNTIEDYDAAQDTDGEMFGKFFKLMLQEGVNLAPS
KYEAWFLTTEHTKEDIEYTIEAVGRAFAALADNK
>P99096 5.4.3.8~~~hemL1~~~Glutamate-1-semialdehyde 2,1-aminomutase 1~~~
MRYTKSEEAMKVAETLMPGGVNSPVRAFKSVDTPAIFMDHGKGSKIYDIDGNEYIDYVLSWGPLILGHRDPQVISHLHEA
IDKGTSFGASTLLENKLAQLVIDRVPSIEKVRMVSSGTEATLDTLRLARGYTGRNKIVKFEGCYHGHSDSLLIKAGSGVA
TLGLPDSPGVPEGIAKNTITVPYNDLDALKIAFEKFGDDIAGVIVEPVAGNMGVVPPIEGFLQGLRDITTEYGALLIFDE
VMTGFRVGYHCAQGYFGVTPDLTCLGKVIGGGLPVGAFGGKKEIMDHIAPLGNIYQAGTLSGNPLAMTSGYETLSQLTPE
TYEYFNMLGDILEDGLKRVFAKHNVPITVNRAGSMIGYFLNEGPVTNFEQANKSDLKLFAEMYREMAKEGVFLPPSQFEG
TFLSTAHTKEDIEKTIQAFDTALSRIVK
>Q81LD0 5.4.3.8~~~hemL2~~~Glutamate-1-semialdehyde 2,1-aminomutase 2~~~COG0001
MRKFDKSIAAFEEAQDLMPGGVNSPVRAFKSVGMNPLFMERGKGSKVYDIDGNEYIDYVLSWGPLIHGHANDRVVEALKA
VAERGTSFGAPTEIENKLAKLVIERVPSIEIVRMVNSGTEATMSALRLARGYTGRNKILKFIGCYHGHGDSLLIKAGSGV
ATLGLPDSPGVPEGVAKNTITVAYNDLESVKYAFEQFGDDIACVIVEPVAGNMGVVPPQPGFLEGLREVTEQNGALLIFD
EVMTGFRVAYNCGQGYYGVTPDLTCLGKVIGGGLPVGAYGGKAEIMRQVAPSGPIYQAGTLSGNPLAMAAGYETLVQLTP
ESYVEFERKAEMLEAGLRKAAEKHGIPHHINRAGSMIGIFFTDEPVINYDAAKSSNLQFFAAYYREMVEQGVFLPPSQFE
GLFLSTVHSDADIEATIAAAEIAMSKLKA
>Q7A4T5 5.4.3.8~~~hemL2~~~Glutamate-1-semialdehyde 2,1-aminomutase 2~~~
MNFSESERLQQLSNEYILGGVNSPSRSYKAVGGGAPVVMKEGHGAYLYDVDGNKFIDYLQAYGPIIAGHAHPHITKAIQE
QAAKGVLFGTPTELEIEFSKKLRDAIPSLEKIRFVNSGTEAVMTTIRVARAYTKRNKIIKFAGSYHGHSDLVLVAAGSGP
SQLGSPDSAGVPESVAREVITVPFNDINAYKEAIEFWGDEIAAVLVEPIVGNFGMVMPQPGFLEEVNEISHNNGTLVIYD
EVITAFRFHYGAAQDLLGVIPDLTAFGKIVGGGLPIGGYGGRQDIMEQVAPLGPAYQAGTMAGNPLSMKAGIALLEVLEQ
DGVYEKLDSLGQQLEEGLLKLIEKHNITATINRIYGSLTLYFTDEKVTHYDQVEHSDGEAFGKFFKLMLNQGINLAPSKF
EAWFLTTEHTEEDIQQTLKAADYAFSQMK
>P30949 5.4.3.8~~~hemL~~~Glutamate-1-semialdehyde 2,1-aminomutase~~~COG0001
MRSYEKSKTAFKEAQKLMPGGVNSPVRAFKSVDMDPIFMERGKGSKIFDIDGNEYIDYVLSWGPLILGHTNDRVVESLKK
VAEYGTSFGAPTEVENELAKLVIDRVPSVEIVRMVSSGTEATMSALRLARGYTGRNKILKFEGCYHGHGDSLLIKAGSGV
ATLGLPDSPGVPEGIAKNTITVPYNDLESVKLAFQQFGEDIAGVIVEPVAGNMGVVPPQEGFLQGLRDITEQYGSLLIFD
EVMTGFRVDYNCAQGYFGVTPDLTCLGKVIGGGLPVGAYGGKAEIMEQIAPSGPIYQAGTLSGNPLAMTAGLETLKQLTP
ESYKNFIKKGDRLEEGISKTAGAHGIPHTFNRAGSMIGFFFTNEPVINYETAKSSDLKLFASYYKGMANEGVFLPPSQFE
GLFLSTAHTDEDIENTIQAAEKVFAEISRR
>Q725I1 5.4.3.8~~~hemL~~~Glutamate-1-semialdehyde 2,1-aminomutase~~~COG0001
MSQRSSELFERAQQLIPGGVNSPVRACLGVDSEPLFIARAAGSRLHTVDGETFIDFVESWGPMLLGHTHPEVTAAVHAAV
DRGTSYGAPCEDEVVLAAKVVDALPGVDMVRMVNSGTEATMSALRLARGYTGRTKLVKFVGCYHGHADPFLASAGSGVAT
LSIPGTPGVPESTVRDTLLAPYNDLAAVKDLFALHGKDIAAIIVEAVAGNMGLVPPKAGFLEGLRELCDQHGALLIFDEV
ITGFRVSFGGAQQRFGITPDLTTLGKIIGGGLPVGAYGGKREIMQRIAPCGEVYQAGTLSGNPLAMAAGIATLDVLSRSD
YAGLEARVAAFVKELEAILKGKGVPVRINTLASMFTVFFTNDPVTDFASAKTADGALYTSFYKQMRAQGIYLAPSPFEAA
MVSFAHTDDDLAAMLDAARKVTF
>P23893 5.4.3.8~~~hemL~~~Glutamate-1-semialdehyde 2,1-aminomutase~~~COG0001
MSKSENLYSAARELIPGGVNSPVRAFTGVGGTPLFIEKADGAYLYDVDGKAYIDYVGSWGPMVLGHNHPAIRNAVIEAAE
RGLSFGAPTEMEVKMAQLVTELVPTMDMVRMVNSGTEATMSAIRLARGFTGRDKIIKFEGCYHGHADCLLVKAGSGALTL
GQPNSPGVPADFAKYTLTCTYNDLASVRAAFEQYPQEIACIIVEPVAGNMNCVPPLPEFLPGLRALCDEFGALLIIDEVM
TGFRVALAGAQDYYGVVPDLTCLGKIIGGGMPVGAFGGRRDVMDALAPTGPVYQAGTLSGNPIAMAAGFACLNEVAQPGV
HETLDELTTRLAEGLLEAAEEAGIPLVVNHVGGMFGIFFTDAESVTCYQDVMACDVERFKRFFHMMLDEGVYLAPSAFEA
GFMSVAHSMEDINNTIDAARRVFAKL
>P9WMN9 5.4.3.8~~~hemL~~~Glutamate-1-semialdehyde 2,1-aminomutase~~~COG0001
MGSTEQATSRVRGAARTSAQLFEAACSVIPGGVNSPVRAFTAVGGTPRFITEAHGCWLIDADGNRYVDLVCSWGPMILGH
AHPAVVEAVAKAAARGLSFGAPTPAETQLAGEIIGRVAPVERIRLVNSGTEATMSAVRLARGFTGRAKIVKFSGCYHGHV
DALLADAGSGVATLGLCDDPQRPASPRSQSSRGLPSSPGVTGAAAADTIVLPYNDIDAVQQTFARFGEQIAAVITEASPG
NMGVVPPGPGFNAALRAITAEHGALLILDEVMTGFRVSRSGWYGIDPVPADLFAFGKVMSGGMPAAAFGGRAEVMQRLAP
LGPVYQAGTLSGNPVAVAAGLATLRAADDAVYTALDANADRLAGLLSEALTDAVVPHQISRAGNMLSVFFGETPVTDFAS
ARASQTWRYPAFFHAMLDAGVYPPCSAFEAWFVSAALDDAAFGRIANALPAAARAAAQERPA
>P48247 5.4.3.8~~~hemL~~~Glutamate-1-semialdehyde 2,1-aminomutase~~~
MSRSETLFNNAQKHIPGGVNSPVRAFKSVGGTPLFFKHAEGAYVLDEDDKRYVDYVGSWGPMILGHSHPDVLDAVRRQLD
HGLSYGAPTALEVEMADLVCSMVPSMEMVRMVSSGTEATMSAIRLARGYTGRDSIIKFEGCYHGHSDSLLVKAGSGALTF
GVPNSPGVPAAFAKHTLTLPFNDIEAVRKTLGEVGKEVACIIVEPVAGNMNCVPPAPGFLEGLREACDEHGVVLIFDEVM
TGFRVALGGAQAYYGVTPDLSTFGKIIGGGMPVGAFGGKREIMQQISPLGPVYQAGTLSGNPLAMAAGLTTLRLISRPGF
HDELTAYTTRMLDGLQQRADAAGIPFVTTQAGGMFGLYFSGADAIVTFEDVMASDVERFKRFFHLMLDGGVYLAPSAFEA
GFTSIAHGDKELEITLNAAEKAFAALK
>B2FT35 5.4.3.8~~~hemL~~~Glutamate-1-semialdehyde 2,1-aminomutase~~~COG0001
MNHDQSHALFSRAQQLLPGGVNSPVRAFKSVGGEPFFVERADGAYLYDVDGNRYIDYVGSWGPMIVGHNHPAVRQAVKKA
IDNGLSFGAPCAGEVTMAETITRLVPSCEMVRMVNSGTEATLSAIRLARGATGRNRIVKFEGCYHGHGDSFLVKAGSGML
TLGVPTSPGVPAGLSELTLTLPYNDFEAATALFEQQGDDIAGLIIEPVVGNANCIPPREGYLQHLRALCTKHGALLIFDE
VMTGFRVALGGAQAHYGITPDLTTFGKIIGGGMPVGAYGGRRELMQQIAPAGPIYQAGTLSGNPVAMAAGLAMLELVQQP
GFHADLAERTARLCAGLEAAAADAGVAVTTTRVGAMFGLFFTSEKVETYAQATACDIPAFNRFFHAMLEQGVFLAPSAYE
AGFLSSAHDDAVIEATLAAARVAFRAAKG
>Q31QJ2 5.4.3.8~~~hemL~~~Glutamate-1-semialdehyde 2,1-aminomutase~~~COG0001
MVTSSPFKTIKSDEIFAAAQKLMPGGVSSPVRAFKSVGGQPIVFDRVKDAYAWDVDGNRYIDYVGTWGPAICGHAHPEVI
EALKVAMEKGTSFGAPCALENVLAEMVIDAVPSIEMVRFVNSGTEACMAVLRLMRAYTGRDKIIKFEGCYHGHADMFLVK
AGSGVATLGLPDSPGVPKSTTANTLTAPYNDLEAVKALFAENPGEIAGVILEPIVGNSGFIVPDAGFLEGLREITLEHDA
LLVFDEVMTGFRIAYGGVQEKFGVTPDLTTLGKIIGGGLPVGAYGGKREIMQLVAPAGPMYQAGTLSGNPLAMTAGIKTL
ELLRQPGTYEYLDQITKRLSDGLLAIAQETGHAACGGQVSGMFGFFFTEGPVHNYEDAKKSDLQKFSRFHRGMLEQGIYL
APSQFEAGFTSLAHTEEDIDATLAAARTVMSAL
>P24630 5.4.3.8~~~hemL~~~Glutamate-1-semialdehyde 2,1-aminomutase~~~COG0001
MVTSSPFKTIKSDEIFAAAQKLMPGGVSSPVRAFKSVGGQPIVFDRVKDAYAWDVDGNRYIDYVGTWGPAICGHAHPEVI
EALKVAMEKGTSFGAPCALENVLAEMVIDAVPSIEMVRFVNSGTEACMAVLRLMRAYTGRDKIIKFEGCYHGHADMFLVK
AGSGVATLGLPDSPGVPKSTTANTLTAPYNDLEAVKALFAENPGEIAGVILEPIVGNSGFIVPDAGFLEGLREITLEHDA
LLVFDEVMTGFRIAYGGVQEKFGVTPDLTTLGKIIGGGLPVGAYGGKREIMQLVAPAGPMYQAGTLSGNPLAMTAGIKTL
ELLRQPATYEYLDQITKRLSDGLLAIAQETGHAACGGQVSGMFGFFFTEGPVHNYEDAKKSDLQKFSRFHRGMLEQGIYL
APSQFEAGFTSLAHTEEDIDATLAAARTVMSAL
>Q5SJS4 5.4.3.8~~~hemL~~~Glutamate-1-semialdehyde 2,1-aminomutase~~~COG0001
MERPISEAYFQEAKRHIPGGVSSPVRAFKAVGGTPPFLVRGEGAYVWDADGNRYLDYVMSWGPLILGHAHPKVLARVRET
LERGLTFGAPSPLEVALAKKVKRAYPFVDLVRFVNSGTEATMSALRLARGYTGRPYIVKFRGNYHGHADGLLVEAGSGAL
TLGVPSSAGVPEEYAKLTLVLEYNDPEGLREVLKRRGEEIAAIIFEPVVGNAGVLVPTEDFLKALHEAKAYGVLLIADEV
MTGFRLAFGGATELLGLKPDLVTLGKILGGGLPAAAYAGRREIMEKVAPLGPVYQAGTLSGNPLAMAAGLATLELLEENP
GYYAYLEDLGARLEAGLKEVLKEKGLPHTVNRVGSMITVFFTEGPVVTFQDARRTDTELFKRFFHGLLDRGIYWPPSNFE
AAFLSVAHREEDVEKTLEALRKAL
>Q8DLK8 5.4.3.8~~~hemL~~~Glutamate-1-semialdehyde 2,1-aminomutase~~~COG0001
MRELTLTTTVFQTTKSQEIFAAAQKLMPGGVSSPVRAFKSVGGQPIVFDHVKGAHIWDVDGNQYIDYVGSWGPAIVGHAH
PEVIDALHAALEKGTSFGAPCLLENILAEMVIAAVPSVEMVRFVNSGTEACMAVLRLMRAYTQREKVIKFEGCYHGHADM
FLVKAGSGVATLGLPDSPGVPKATTAATLTAPYNDLEAVSRLFEQYPNDIAGVILEPVVGNAGFIPPDAGFLEGLRELTK
QYGALLVFDEVMTGFRIAYGGAQEKFGVTPDLTTLGKVIGGGLPVGAYGGRAEIMKMVAPAGPVYQAGTLSGNPLAMTAG
IKTLEILSRPGSYEHLDRITGKLVQGLLDAAREFGHEVCGGHISGMFGLFFTAGPVTNYEQAKQSDLKKFAAFHRGMLEQ
GIYLAPSQFEAGFTSLAHTEADIERTIAAARTVLSQL
>Q8ZBL9 5.4.3.8~~~hemL~~~Glutamate-1-semialdehyde 2,1-aminomutase~~~COG0001
MSKSENLYAQAQQLIPGGVNSPVRAFTGVGGIPLFIERADGAYLFDVDGKAYIDYVGSWGPMILGHNHPAIRQAVIEAVE
RGLSFGAPTEMEVKMAQLVTDLVPTMDMVRMVNSGTEATMSAIRLARGYTGRDKIIKFEGCYHGHADCLLVKAGSGALTL
GQPNSPGVPTDFAKHTLTCTYNDLASVRQAFEQYPQEVACIIVEPVAGNMNCIPPLPEFLPGLRALCDEFGALLIIDEVM
TGFRVALAGAQDYYHVIPDLTCLGKIIGGGMPVGAFGGRREVMNALAPTGPVYQAGTLSGNPIAMAAGFACLTEISQVGV
YETLTELTDSLATGLRHAAKEENIPLVVNHVGGMFGLFFTNADTVTCYQDVMNCDVERFKRFFHLMLEEGVYLAPSAFEA
GFMSLAHSNEDIQKTVNAARRCFAKL
>Q9KJ20 2.1.1.156~~~~~~Glycine/sarcosine/dimethylglycine N-methyltransferase~~~
MTKSVDDLARGDQAGDEQDPVHREQQTFGDNPLEVRDTDHYMHEYVGGFVDKWDDLIDWKKRYESEGSFFIDQLRARGVE
TVLDAAAGTGFHSVRLLEEGFETVSADGSPQMLAKAFSNGLAYNGHILRVVNADWRWLNRDVHGEYDAIICLGNSFTHLF
SERDRRKTLAEFYAMLKHDGVLIIDQRNYDSILDTGFSSKHTYYYAGEDVSAEPDHIDDGLARFKYTFPDKSEFFLNMYP
LRKDYMRRLMREVGFQRIDTYGDFQETYGEDEPDFYIHVAEKSYRTEDEFVDMYSNAVHTARDYYNSEDADNFYYHVWGG
NDIHVGLYQTPQEDIATASERTVQRMAGKVDISPETRILDLGAGYGGAARYLARTYGCHVTCLNLSEVENQRNREITRAE
GLEHLIEVTDGSFEDLPYQDNAFDVVWSQDSFLHSGDRSRVMEEVTRVLKPKGSVLFTDPMASDSAKKNELGPILDRLHL
DSLGSPGFYRKELTRLGLQNIEFEDLSEYLPVHYGRVLEVLESRENELAGFIGEEYRAHMKTGLRNWVQAGNGGSLAWGI
IHARA
>P0DV46 ~~~~~~Gasdermin bGSDM~~~
MNCSRDTGDELMAALLAEGINLILPPRDNIAPGDLIIADPQGGARLGGWHEVFNLQLSPEVATDPGFKSFQFRASSILQV
GVAASVMGRVLQALGLGSGSFSSAFSSSNADTIQLSIVAPANKELTNFDAVLVQMNEAKAEPAQGYTDRNFFVVTKVWRA
RGIRISVADKSKKQVDLSAKAVEELTAKAKMELKREDTGSYAFLAASQLIFGLTLREVTYKDGAIVDVAPTGPLKFRGKG
PGDPFAFIGDDAFVDLPES
>P0DV50 ~~~~~~Gasdermin bGSDM~~~
MWCKDPFLTYLKEFGYNVIRLPKADVKPLQVLARCGKDLNRLGEINNLLVAGDSIPLPRLKENTRAASISGQRTGDLSVG
VGLSILGTVLGAMGGSKLGLDTKYQNAKTTMFEFQDVFEDTVEIIELDKFLGDADINPFSRHVAELLEADDLYITTSTIK
STKFTIEAKKSDGTALELTIPDIQGIVGGNVKVSGAASVTSKICYESPIPLVFGFQAVRLFYDNGRYTAIEPLDSGSAGM
KALGKAPSDGAARLMTDGPFLRLTGV
>H2BXL6 ~~~~~~Gasdermin bGSDM~~~
MSKVLKNALAGYGYNLVALPKEGIAPLLLLYKNKRDVSSSGNNIDKLFALADSPPPIVSKNNATLNLQQNSTVSFDGKAG
VDILDWLLQKLKMGKLRGNINADHINSLQISYQNVFEDNVSLLQLDNFISGSEPKVDQFNTFKEKLKDNELFVINSVLKS
NSFSVSAQNKNGQNIDLEATIKGIVDADVNVGRSKKDEVLMEYKNATPIVFAFKAQKIIYDHKKWWQFFKKGDAKFRIKD
EHGVVLKDESGFPTQSLEETNELINI
>A0A0S2DNG5 ~~~~~~Gasdermin bGSDM~~~
MSILPGCKDPSLSALKSKGYNVVQLPRADLRPTQLLVEKSKRLQRLGELLSVFDAAADGPPAPPVSADRPGPNIAGTQSA
DLDVDLGLSVLRGIISALGGSTLGVDAAFARAATVQFEFSSTLENNSELALIDRFLAASRVNPHARAVAEMLEQDQVYVV
TSTLKAQRINVAAKDSNKQSLGLNLPVIQDAIGANVKIAAAAASGSTVSFEGAVPLVFGFQAVRLIFEQGRYRTMRLVDA
GGVVAEAVRPDGAADGEPPCYLDVEAMLLDR
>P0DV48 ~~~~~~Gasdermin bGSDM~~~
MECNDPFVVALKDKGYSLVAYPKTSIRPLHIYEHTIKNAFKRIWIQSEAQPTSGFIKSLFSDKIHGAIGLSDGQGIDIDL
RKTNSLSSAVAAKILESYFQDSAPSFDLAFENSSSVIFHIEEIITTDADEISLRNWLNDNQNELREIYKEEIKKGNFFVA
TSLLRAKKMRMQFERKNKGELGVDVSKIKNLPVDAKLESKIEGSTYDRLVFETPDEGIVFGVKLVRLFFSDNGILTIDKK
QDFNRVLGENMALNLFTEIQDAGFIEVT
>P0DV52 ~~~~~~Gasdermin bGSDM~~~
MIKYLQSHLEEQGYLFVTLPKPDLAPLQLLTEYKGHLEEYDGSLLDLFEPDGSPFPIRDRQLPNFSGQQLLQTDWSAGAD
LLHGLFKLFQQKEDKLKASLSGMKGLVLSFAYENIEEERVSEQALDNFLAGAMPKKEGFQRSVERLQDGELYVLTSVMRS
NQFTVTIDCQREDQGKLEAAVAEIVDAHASIERKQSNSFSLQTEGEQAFVFACRAAQVLYNKKQWFQFWKKDKDGFRIEK
REGMVVRGEEDFSVQPLQAPSGLLKL
>A0A2T4VDM4 ~~~~~~Gasdermin bGSDM~~~
MGLCSDPAITYLKRLGYNVVRLPREGIQPLHLLGQQRGTVEYLGSLEKLITQPPSEPPAITRDQAAAGINGQKTENLSFS
IGINILKSVLAQFGAGAGIEAQYNQARKVRFEFSNVLADSVEPLAVGQFLKMAEVDADNPVLKQYVLGNGRLYVITQVIK
SNEFTVAAEKSGGGSIQLDVPEIQKVVGGKLKVEASVSSQSTVTYKGEKQLVFGFKCFEIGVKNGEITLFASQPGAIAMA
LDAAGGVMPSDSALLDEGGLLDLEGF
>P0C0Q1 3.4.21.19~~~gseA~~~Glutamyl endopeptidase~~~
MKKRFLSICTMTIAALATTTMVNTSYAKTDTESHNHSSLGTENKNVLDINSSSHNIKPSQNKSYPSVILPNNNRHQIFNT
TQGHYDAVSFIYIPIDGGYMSGSGVVVGENEILTNKHVVNGAKGNPRNISVHPSAKNENDYPNGKFVGQEIIPYPGNSDL
AILRVSPNEHNQHIGQVVKPATISSNTDTRINENITVTGYPGDKPLATMWESVGKVVYIGGEELRYDLSTVGGNSGSPVF
NGKNQVIGIHYGGVDNKYNSSVYINDFVQQFLRNNIPDINIQ
>P0C0Q2 3.4.21.19~~~gseA~~~Glutamyl endopeptidase~~~COG3591
MKKRFLSICTMTIAALATTTMVNTSYAKTDTESHNHSSLGTENKNVLDINSSSHNIKPSQNKSYPSVILPNNNRHQIFNT
TQGHYDAVSFIYIPIDGGYMSGSGVVVGENEILTNKHVVNGAKGNPRNISVHPSAKNENDYPNGKFVGQEIIPYPGNSDL
AILRVSPNEHNQHIGQVVKPATISSNTDTRINENITVTGYPGDKPLATMWESVGKVVYIGGEELRYDLSTVGGNSGSPVF
NGKNQVIGIHYGGVDNKYNSSVYINDFVQQFLRNNIPDINIQ
>P80057 3.4.21.19~~~blaSE~~~Glutamyl endopeptidase~~~COG3591
MVSKKSVKRGLITGLIGISIYSLGMHPAQAAPSPHTPVSSDPSYKAETSVTYDPNIKSDQYGLYSKAFTGTGKVNETKEK
AEKKSPAKAPYSIKSVIGSDDRTRVTNTTAYPYRAIVHISSSIGSCTGWMIGPKTVATAGHCIYDTSSGSFAGTATVSPG
RNGTSYPYGSVKSTRYFIPSGWRSGNTNYDYGAIELSEPIGNTVGYFGYSYTTSSLVGTTVTISGYPGDKTAGTQWQHSG
PIAISETYKLQYAMDTYGGQSGSPVFEQSSSRTNCSGPCSLAVHTNGVYGGSSYNRGTRITKEVFDNLTNWKNSAQ
>P0A6W9 6.3.2.2~~~gshA~~~Glutamate--cysteine ligase~~~COG2918
MIPDVSQALAWLEKHPQALKGIQRGLERETLRVNADGTLATTGHPEALGSALTHKWITTDFAEALLEFITPVDGDIEHML
TFMRDLHRYTARNMGDERMWPLSMPCYIAEGQDIELAQYGTSNTGRFKTLYREGLKNRYGALMQTISGVHYNFSLPMAFW
QAKCGDISGADAKEKISAGYFRVIRNYYRFGWVIPYLFGASPAICSSFLQGKPTSLPFEKTECGMYYLPYATSLRLSDLG
YTNKSQSNLGITFNDLYEYVAGLKQAIKTPSEEYAKIGIEKDGKRLQINSNVLQIENELYAPIRPKRVTRSGESPSDALL
RGGIEYIEVRSLDINPFSPIGVDEQQVRFLDLFMVWCALADAPEMSSSELACTRVNWNRVILEGRKPGLTLGIGCETAQF
PLPQVGKDLFRDLKRVAQTLDSINGGEAYQKVCDELVACFDNPDLTFSARILRSMIDTGIGGTGKAFAEAYRNLLREEPL
EILREEDFVAEREASERRQQEMEAADTEPFAVWLEKHA
>Q8Y3R3 ~~~gshAB~~~Glutathione biosynthesis bifunctional protein GshAB~~~COG0189
MLDSFKEDPNLRKLLFSGHFGLEKENIRVTSDGKLALTPHPAIFGPKEDNPYIKTDFSESQIEMITPVTDSIDSVYEWLE
NLHNIVSLRSENELLWPSSNPPILPAEEDIPIAEYKTPDSPDRKYREHLAKGYGKKIQLLSGIHYNFSFPEALIDGLYAN
ISLPEESKQDFKNRLYLKVAKYFMKNRWLLIYLTGASPVYLADFSKTKHEESLPDGSSALRDGISLRNSNAGYKNKEALF
VDYNSFDAYISSISNYIEAGKIESMREFYNPIRLKNAHTDQTVESLAEHGVEYLEIRSIDLNPLEPNGISKDELDFIHLF
LIKGLLSEDRELCANNQQLADENENNIALNGLAQPSIKNCDNEDIPLADAGLLELDKMSDFIKSLRPEDTKLRAIIEKQK
ERLLHPEKTIAAQVKQQVTKEGYVDFHLNQAKTYMEETEALAYKLIGAEDMELSTQIIWKDAIARGIKVDVLDRAENFLR
FQKGDHIEYVKQASKTSKDNYVSVLMMENKVVTKLVLAEHDIRVPFGDSFSDQALALEAFSLFEDKQIVVKPKSTNYGWG
ISIFKNKFTLEDYQEALNIAFSYDSSVIIEEFIPGDEFRFLVINDKVEAVLKRVPANVTGDGIHTVRELVEEKNTDPLRG
TDHLKPLEKIRTGPEETLMLSMQNLSWDSIPKAEEIIYLRENSNVSTGGDSIDYTEEMDDYFKEIAIRATQVLDAKICGV
DIIVPRETIDRDKHAIIELNFNPAMHMHCFPYQGEQKKIGDKILDFLFD
>Q9CM00 ~~~gshAB~~~Glutathione biosynthesis bifunctional protein GshAB~~~
MKIQHIIHENQLGLLFQQGSFGLEKESQRVTADGAIVTTPHPAVFGNRRYHPYIQTDFAESQLELITPPTKKLEDTFRWL
SVIHEVVQRSLPEEEYIFPLSMPAGLPAEEQIRVAQLDNPEDVAYREYLVKIYGKNKQMVSGIHYNFQLSPDLITRLFRL
QNEYQSAVDFQNDLYLKMAKNFLRYQWILLYLLAATPTVESAYFKDGSPLAKGQFVRSLRSSQYGYVNDPEINVSFDSVE
KYVESLEHWVSTGKLIAEKEFYSNVRLRGAKKAREFLTTGIQYLEFRLFDLNPFEIYGISLKDAKFIHVFALFMIWMDHT
ADQEEVELGKARLAEVAFEHPLEKTAYAVEGELVLLELLSMLEQIGAEPELFEIVKEKLTQFTDPSKTVAGRLVRAIEQA
GSDQQLGAQLAQQYKAQAFERFYALSAFDNMELSTQALLFDVIQKGIHTEILDENDQFLCLKYGDHIEYVKNGNMTSHDS
YISPLIMENKVVTKKVLQKAGFNVPQSVEFTSLEKAVASYALFENRAVVIKPKSTNYGLGITIFQQGVQNREDFAKALEI
AFREDKEVMVEDYLVGTEYRFFVLGDETLAVLLRVPANVVGDSVHSVAELVAMKNDHPLRGDGSRTPLKKIALGEIEQLQ
LKEQGLTIDSIPAKDQLVQLRANSNISTGGDSIDMTDEMHESYKQLAVGITKAMGAAVCGVDLIIPDLKQPATPNLTSWG
VIEANFNPMMMMHIFPYAGKSRRLTQNVIKMLFPELE
>Q8DXM9 ~~~gshAB~~~Glutathione biosynthesis bifunctional protein GshAB~~~
MIIDRLLQRSHSHLPILQATFGLERESLRIHQPTQRVAQTPHPKTLGSRNYHPYIQTDYSEPQLELITPIAKDSQEAIRF
LKAISDVAGRSINHDEYLWPLSMPPKVREEDIQIAQLEDAFEYDYRKYLEKTYGKLIQSISGIHYNLGLGQELLTSLFEL
SQADNAIDFQNQLYMKLSQNFLRYRWLLTYLYGASPVAEEDFLDQKLNNPVRSLRNSHLGYVNHKDIRISYTSLKDYVND
LENAVKSGQLIAEKEFYSPVRLRGSKACRNYLEKGITYLEFRTFDLNPFSPIGITQETVDTVHLFLLALLWIDSSSHIDQ
DIKEANRLNDLIALSHPLEKLPNQAPVSDLVDAMQSVIQHFNLSPYYQDLLESVKRQIQSPELTVAGQLLEMIEGLSLET
FGQRQGQIYHDYAWEAPYALKGYETMELSTQLLLFDVIQKGVNFEVLDEQDQFLKLWHNSHIEYVKNGNMTSKDNYIVPL
AMANKVVTKKILDEKHFPTPFGDEFTDRKEALNYFSQIQDKPIVVKPKSTNFGLGISIFKTSANLASYEKAIDIAFTEDS
AILVEEYIEGTEYRFFVLEGDCIAVLLRVAANVVGDGIHTISQLVKLKNQNPLRGYDHRSPLEVIELGEVEQLMLEQQGY
TVNSIPPEGTKIELRRNSNISTGGDSIDVTNTMDPTYKQLAAEMAEAMGAWVCGVDLIIPNATQAYSKDKKNATCIELNF
NPLMYMHTYCQEGPGQSITPRILAKLFPEL
>A0A482PU20 6.3.2.3~~~gshB~~~Glutathione synthetase~~~
MIKLGIVMDPIASINIKKDSSFAMLLEAQRRGYELHYMEMADLYLINGEARARTRTLSVEQNYDKWYDFTGEQDLALDSL
DAILMRKDPPFDTEFIYATYILERAEEKGTLIVNKPQSLRDCNEKLFTAWFSDLTPETLVTRNKAQLKAFWQKHSDIILK
PLDGMGGASIFRVKEGDPNLGVIAETLTEHGTRYCMAQNYLPAIVDGDKRVLVVDGEPVPYCLARIPQGGETRGNLAAGG
RGEPRPLTDSDWAIARRIGPTLKAKGLIFVGLDIIGDRLTEINVTSPTCIREIEAEFPISITGMLMDAIEARLQK
>B7UHZ4 6.3.2.3~~~gshB~~~Glutathione synthetase~~~
MIKLGIVMDPIANINIKKDSSFAMLLEAQRRGYELHYMEMADLYLINGEARARTRTLSVEQNYDKWYEFTGEQDLPLADL
DVILMRKDPPFDTEFIYATYILERAEEKGTLIVNKPQSLRDCNEKLFTAWFSDLTPETLVTRNKAQLKAFWEKHSDIILK
PLDGMGGASIFRVKEGDPNLGVIAETLTEHGTRYCMAQNYLPAIKDGDKRVLVVDGEPVPYCLARIPQGGETRGNLAAGG
RGEPRPLTESDWKIARQIGPTLKEKGLIFVGLDIIGDRLTEINVTSPTCIREIEAEFPVSITGMLMDAIEARLQQQ
>P04425 6.3.2.3~~~gshB~~~Glutathione synthetase~~~COG0189
MIKLGIVMDPIANINIKKDSSFAMLLEAQRRGYELHYMEMGDLYLINGEARAHTRTLNVKQNYEEWFSFVGEQDLPLADL
DVILMRKDPPFDTEFIYATYILERAEEKGTLIVNKPQSLRDCNEKLFTAWFSDLTPETLVTRNKAQLKAFWEKHSDIILK
PLDGMGGASIFRVKEGDPNLGVIAETLTEHGTRYCMAQNYLPAIKDGDKRVLVVDGEPVPYCLARIPQGGETRGNLAAGG
RGEPRPLTESDWKIARQIGPTLKEKGLIFVGLDIIGDRLTEINVTSPTCIREIEAEFPVSITGMLMDAIEARLQQQ
>P06715 1.8.1.7~~~gor~~~Glutathione reductase~~~COG1249
MTKHYDYIAIGGGSGGIASINRAAMYGQKCALIEAKELGGTCVNVGCVPKKVMWHAAQIREAIHMYGPDYGFDTTINKFN
WETLIASRTAYIDRIHTSYENVLGKNNVDVIKGFARFVDAKTLEVNGETITADHILIATGGRPSHPDIPGVEYGIDSDGF
FALPALPERVAVVGAGYIAVELAGVINGLGAKTHLFVRKHAPLRSFDPMISETLVEVMNAEGPQLHTNAIPKAVVKNTDG
SLTLELEDGRSETVDCLIWAIGREPANDNINLEAAGVKTNEKGYIVVDKYQNTNIEGIYAVGDNTGAVELTPVAVAAGRR
LSERLFNNKPDEHLDYSNIPTVVFSHPPIGTVGLTEPQAREQYGDDQVKVYKSSFTAMYTAVTTHRQPCRMKLVCVGSEE
KIVGIHGIGFGMDEMLQGFAVALKMGATKKDFDNTVAIHPTAAEEFVTMR
>P75796 7.4.2.10~~~gsiA~~~Glutathione import ATP-binding protein GsiA~~~COG4172
MPHSDELDAGNVLAVENLNIAFMQDQQKIAAVRNLSFSLQRGETLAIVGESGSGKSVTALALMRLLEQAGGLVQCDKMLL
QRRSREVIELSEQNAAQMRHVRGADMAMIFQEPMTSLNPVFTVGEQIAESIRLHQNASREEAMVEAKRMLDQVRIPEAQT
ILSRYPHQLSGGMRQRVMIAMALSCRPAVLIADEPTTALDVTIQAQILQLIKVLQKEMSMGVIFITHDMGVVAEIADRVL
VMYQGEAVETGTVEQIFHAPQHPYTRALLAAVPQLGAMKGLDYPRRFPLISLEHPAKQAPPIEQKTVVDGEPVLRVRNLV
TRFPLRSGLLNRVTREVHAVEKVSFDLWPGETLSLVGESGSGKSTTGRALLRLVESQGGEIIFNGQRIDTLSPGKLQALR
RDIQFIFQDPYASLDPRQTIGDSIIEPLRVHGLLPGKDAAARVAWLLERVGLLPEHAWRYPHEFSGGQRQRICIARALAL
NPKVIIADEAVSALDVSIRGQIINLLLDLQRDFGIAYLFISHDMAVVERISHRVAVMYLGQIVEIGPRRAVFENPQHPYT
RKLLAAVPVAEPSRQRPQRVLLSDDLPSNIHLRGEEVAAVSLQCVGPGHYVAQPQSEYAFMRR
>P26907 ~~~gsiB~~~Glucose starvation-inducible protein B~~~COG3729
MADNNKMSREEAGRKGGETTSKNHDKEFYQEIGQKGGEATSKNHDKEFYQEIGEKGGEATSKNHDKEFYQEIGEKGGEAT
SENHDKEFYQEIGRKGGEATSKNHDKEFYQEIGSKGGNARNND
>P75797 ~~~gsiB~~~Glutathione-binding protein GsiB~~~COG0747
MARAVHRSGLVALGIATALMASCAFAAKDVVVAVGSNFTTLDPYDANDTLSQAVAKSFYQGLFGLDKEMKLKNVLAESYT
VSDDGITYTVKLREGIKFQDGTDFNAAAVKANLDRASDPANHLKRYNLYKNIAKTEAIDPTTVKITLKQPFSAFINILAH
PATAMISPAALEKYGKEIGFYPVGTGPYELDTWNQTDFVKVKKFAGYWQPGLPKLDSITWRPVADNNTRAAMLQTGEAQF
AFPIPYEQATLLEKNKNIELMASPSIMQRYISMNVTQKPFDNPKVREALNYAINRPALVKVAFAGYATPATGVVPPSIAY
AQSYKPWPYDPVKARELLKEAGYPNGFSTTLWSSHNHSTAQKVLQFTQQQLAQVGIKAQVTAMDAGQRAAEVEGKGQKES
GVRMFYTGWSASTGEADWALSPLFASQNWPPTLFNTAFYSNKQVDDFLAQALKTNDPAEKTRLYKAAQDIIWQESPWIPL
VVEKLVSAHSKNLTGFWIMPDTGFSFEDADLQ
>P75798 ~~~gsiC~~~Glutathione transport system permease protein GsiC~~~COG0601
MLNYVIKRLLGLIPTLFIVSVLVFLFVHMLPGDPARLIAGPEADAQVIELVRQQLGLDQPLYHQFWHYISNAVQGDFGLS
MVSRRPVADEIASRFMPTLWLTITSMVWAVIFGMAAGIIAAVWRNRWPDRLSMTIAVSGISFPAFALGMLLIQVFSVELG
WLPTVGADSWQHYILPSLTLGAAVAAVMARFTRASFVDVLSEDYMRTARAKGVSETWVVLKHGLRNAMIPVVTMMGLQFG
FLLGGSIVVEKVFNWPGLGRLLVDSVEMRDYPVIQAEILLFSLEFILINLVVDVLYAAINPAIRYK
>P75799 ~~~gsiD~~~Glutathione transport system permease protein GsiD~~~COG1173
MRLFNWRRQAVLNAMPLVKPDQVRTPWHEFWRRFRRQHMAMTAALFVILLIVVAIFARWIAPYDAENYFDYDNLNNGPSL
QHWFGVDSLGRDIFSRVLVGAQISLAAGVFAVFIGAAIGTLLGLLAGYYEGWWDRLIMRICDVLFAFPGILLAIAVVAVL
GSGIANVIIAVAIFSIPAFARLVRGNTLVLKQQTFIESARSIGASDMTVLLRHILPGTVSSIVVFFTMRIGTSIISAASL
SFLGLGAQPPTPEWGAMLNEARADMVIAPHVAVFPALAIFLTVLAFNLLGDGLRDALDPKIKG
>Q83WC4 2.1.1.156~~~~~~Glycine/sarcosine N-methyltransferase~~~
MAIKEKQVQDYGENPIEVRDSDHYQNEYIEGFVEKWDELINWHARSSSEGEFFIKTLKEHGAKRVLDAATGTGFHSIRLI
EAGFDVASVDGSVEMLVKAFENATRKDQILRTVHSDWRQVTRHIQERFDAVICLGNSFTHLFSEEDRRKTLAEFYSVLKH
DGILILDQRNYDLILDEGFKSKHTYYYCGDNVKAEPEYVDDGLARFRYEFPDQSVYHLNMFPLRKDYVRRLLHEVGFQDI
TTYGDFQETYHQDDPDFYIHVAKKD
>Q9KJ22 2.1.1.156~~~~~~Glycine/sarcosine N-methyltransferase~~~
MNTTTEQDFGADPTKVRDTDHYTEEYVDGFVDKWDDLIDWDSRAKSEGDFFIQELKKRGATRILDAATGTGFHSVRLLEA
GFDVVSADGSAEMLAKAFENGRKRGHILRTVQVDWRWLNRDIHGRYDAIICLGNSFTHLFNEKDRRKTLAEFYSALNPEG
VLILDQRNYDGILDHGYDSSHSYYYCGEGVSVYPEHVDDGLARFKYEFNDGSTYFLNMFPLRKDYTRRLMHEVGFQKIDT
YGDFKATYRDADPDFFIHVAEKEYREED
>Q7U4Z8 2.1.1.156~~~bsmA~~~Glycine/sarcosine N-methyltransferase~~~COG2226
MTSTQNHPLQTQDDQQRFGQSPESVRETDHYQQEYIEDFTDRWDRLIDWNARAEAEGDFFIRLLKEHGARSVLDVATGTG
FHSIRLLEEGFDVVSADGSPNMLARAFRNARNRDQLLRTSQADWRFLNRDIHGEFDAVICLGNSFTHLFKERDRRKALAE
YYAVLKHNGILILDHRNYDRLLEGGSAVRQGKGNVYCGKDVEVGPEHVDEGLARFRYSFSDGGVYHLNMFPLRYGYVRRL
MSEVGFQQITSFGDYQRDFENPDFYVHVAEKEYRFDVDTTMH
>P25148 ~~~gspA~~~General stress protein A~~~COG1442
MRKDEIMHIVSCADDNYARHLGGMFVSLLTNMDQEREVKLYVIDGGIKPDNKKRLEETTLKFGVPIEFLEVDTNMYEHAV
ESSHITKAAYYRISIPDLIKDESIKRMIYIDCDALVLEDISKLWDLDIAPYTVAAVEDAGQHERLKEMNVTDTGKYFNSG
IMIIDFESWRKQNITEKVINFINEHPDEDFLVLHDQDALNAILYDQWYELHPRWNAQTYIMLKLKTPSTLLGRKQYNETR
ENPAIVHFCGGEKPWNSNTKHPYRDEYFHYMSYTKWNTIGNPAINQ
>P45756 ~~~gspA~~~Putative general secretion pathway protein A~~~COG3267
MSTRREVILSWLCEKRQTWRLCYLLGEAGSGKTWLAQQLQKDKHRRVITLSLVVSWQGKAAWIVTDDNAAEQGCRDSAWT
RDEMAGQLLHALHRTDSRCPLIIIENAHLNHRRILDDLQRAISLIPDGQFLLIGRPDRKVERDFKKQGIELVSIGRLTEH
ELKASILEGQNIDQPDLLLTARVLKRIALLCRGDRRKLALAGETIRLLQQAEQTSVFTAKQWRMIYRILGDNRPRKMQLA
VVMSGTIIALTCGWLLLSSFTATLPVPAWLIPVTPVVKQDMTKDIAHVVMRDSEALSVLYGVWGYEVPADSAWCDQAVRA
GLACKSGNASLQTLVDQNLPWIASLKVGDKKLPVVVVRVGEASVDVLVGQQTWTLTHKWFESVWTGDYLLLWKMSPEGES
TITRDSSEEEILWLETMLNRALHISTEPSAEWRPLLVEKIKQFQKSHHLKTDGVVGFSTLVHLWQVAGESAYLYRDEANI
SPETTVKGK
>Q01563 ~~~outB~~~General secretion pathway protein B~~~COG3266
MKNTPEVKASPQTGYRIPGYLLVVYALLLFTLGWFGHQRWADISPIPLSTSAAAIAAPPTKVGMVPASTAVTADENHPGS
LHAAAENSATQTAASGSQTSASSQEATPDTKPAKLVTGWQTAKPGELPYIAFSAHVYTSAPDKRSVTLNGERYREGDSPY
QGLVIEQIEQDMVIFSFNGEPFILDSLQDWPGGKPGDDAAQGNEQEPTSKPEQTVRTTKK
>P03825 ~~~gspB~~~Putative general secretion pathway protein B~~~
MFEFYIAAREQKETGHPGIFSRQKHSTIIYVICLLLICLWFAGMVLVGGYARQLWVLWIVKAEVTVEAETPAFKQSTQHY
FFKKQPLPVVESVEEEDDPGVAVENAPSSSEDEENTVEESEEKAGLRERVKNALNELER
>Q939N5 ~~~gspB~~~Platelet binding protein GspB~~~
MFFKRQKGQYHEVERVTRFKLIKSGKHWLRAATSQFGLLRLMKGSDVSSTEVKVVEEQSVEKSGLNYLKGIIATGAVLGG
AVVTSSSVYAEEEQAHEKVIDTRDVLATRGEAVLSEEAATTLSSTEANPVESLSDTLSASESTSASSSVSTSISVSESFS
VSGSLSYSTSLSQSVSASASASESLSVSSSASDSVSASTSTSASASQSVSASQKSTISTSESTRSESSQQSTEASSQTGR
RRTRRAVTESAPNVEYHDVKGDMIQSVTTSFDDTSRLLTWTINLTPRQVKSNLGALVSISGNQETRTVTINGKNAANGGV
YNSGGAWNLYTGESVNNNVLRITTQVNDTGGEVKLGLRLVTSDKKITKTNLPLEFSQVAATTNGSWDKAGYNTTIVEKDT
ERPVVNVPSEITVYRGESFEYFATVTDNSNAFDLAKTVVRWLYNNQPGRGTEWLQYSVTQVGNQLKVRIFGNVPIDTTIG
DYTRYVVATDAAGNVNATQTEMGNAAVDKTSVNGQFKLIIRFRIKTPENTVFVNNPNQLTEVEKNLVREAVKKSNPDLRA
QDVLNSNYVTGITVSNNGTTTITYRDGRKDIIDGSKFIDTRAGSISKSQSTSNSISVSLSKSESASASLVTSKLNSISSS
ASVSASTSISTSGSVSASESASTSSSVSASESASTSASVSASESASTSASVSASTSASTSASVSASTSASTSASTSASKS
ASTSASVSASTSASTSASVSASESASTSASVSASTSASTSASVSASTSASTSASVSASESASTSASVSASTSASTSASVS
ASESASTSASVSASTSASTSASVSASASASTSASVSASTSASTSASVSASASASTSASVSASTSASTSASVSASESASTS
ASVSASESASTSASVSASESASTSASVSASESASTSASVSASTSASTSASVSASESASTSASVSASESASTSASVSASES
ASTSASVSASESASTSASVSASTSASTSASVSASTSASTSASVSASTSASTSASVSASTSASTSASVSASESASTSASVS
ASESASTSASVSASTSASTSASVSASESASTSASVSASESASTSASESASESASTSASVSASESASTSASVSASESSSTS
ASVSASESSSTSASVSASESASTSASVSASESASTSASESASESASTSASVSASESASTSASVSASESASTSASVSASES
VSTSASVSASESASTSASVSASESASTSASESASESASTSASVSASESASTSASVSASESASTSASVSASTSASTSASVS
ASESASTSASVSASESASTSASVSASESASTSASVSASESVSTSASVSASESASTSASVSASESASTSASESASESASTS
ASVSASESASTSASVSASESASTSASVSASTSASTSASVSASESASTSASVSASESASTSASVSASESASTSASVSASTS
ASTSASVSASESASTSTSVSTSTSASTSASVSASESASTSASVSASESASTSASVSASTSASTSASVSASESASTSASVS
ASESASTSASVSASESASTSASVSASESASTSASVSASTSASTSASVSASESASTSASVSASESASTSASVSASESASTS
ASVSASESASTSASVSASESASTSASVSASESASTSASVSASESASTSASVSASESASTSASVSASESASTSASVSASES
ASTSASVSASESASTSASVSASESASTSASVSASESASTSASVSASESASTSASVSASESASTSASVSASESASTSASVS
ASESASTSASVSASESASTSASVSASESASTSASVSASESASTSASVSASESASTSASVSASESASTSASVSASESASTS
ASVSASESASTSASVSASESASTSASVSASTSTSTSASVSASESASTSASVSASESASTSASVSASESASTSASVSASES
ASTSASVSASESASTSASVSASESASTSASVSASESASTSASVSASESASTSASVSASESASTSASVSASTSASTSASVS
ASESASTSASVSASESASTSASVSASESASTSASVSASESASTSASVSASESASTSASVSASESASTSASVSASESASTS
ASVSASESASTSASVSASKSASTSESASTSASVSASESASTSASVSASESASTSASVSASESVSTSASVSASDSASISAS
VLASESASTSASVSASESASTSASVSASESASTSASVSASESASTSSSVSASESASTSASVSASESASTSASVSASTSAS
TSASVSASESASTSASVSASESASTSASVSASESASTSASVSASESASTSASVSASESASTSASVSASESASTSASVSAS
ESASTSASVSASTSASTSASVSASESASTSASVSSSESASTSASVSASESASTSASVSASESASTSASVSASESASTSAS
VSASESASTSASVSASTSASTSASVSASESASTSASVSASESASTSASVSASESASTSASVSASESASTSASVSASTSAS
TSASVSASESASTSASVSASESASTSASVSASTSASTSASVSASESASTSASVSASESASTSASVSASESASTSASVSAS
ESASTSASVSASESASTSASVSASESASTSASVSASESASTSASVSASMSASTSASVSVSESTSTSASVSANESASTSAS
VSASESASTSASVSASESASTSASVSASESASTSASVSASESASTSASVSASESASTSASVSASESASTSASVSASESAS
TSASVSASESASTSASVSASESASTSASVSASTSASTSASVSANESASTSASVSASESASTSASVSASESASTSASVSAS
ESASTSASVSASESASTSASVSASTSASTSASVSANESASTSASVSASESASTSASVSASESASTSASVSASESASTSAS
VSASESASTSASVSASTSASTSASVSASESASTSASASASESASTSASVSASESASTSASVSASESASTSASVSASESAS
TNASVSVSESMSVSESLSLSISTSVLHSQLNDIYESELYSLSLSESLSASQSLSQSLSESQSSSASQSMHDRISKGQLPR
TGESENKASILALGLGALGLAFKKRKKNESED
>Q01564 ~~~outC~~~Type II secretion system protein C~~~COG3031
MNISKLPPLSPSVIRRILFYLLMLLFCQQLAMIFWRIGLPDNAPVSSVQITPAQARQQPVTLNDFTLFGVSPEKNKAGAL
DASQMSNLPPSTLNLSLTGVMAGDDDSRSIAIISKDNEQFSRGVNEEVPGYNAKIVSIRPDRVVLQYQGRYEVLGLYSQE
DSGSDGVPGAQVNEQLQQRASTTMSDYVSFSPIMNDNKLQGYRLNPGPKSDSFYRVGLQDNDMAVALNGLDLRDAEQAKK
AMERMADVHNFTLTVERDGQRQDIYMEFGGDE
>E3PJ87 ~~~gspC2~~~Type II secretion system protein C 2~~~
MARVVFRDARIYLIQWLTKIRHTLNQRQSLNTDKEHLRKIVRGMFWLMLLIISAKVAHSLWRYFSFSAEYTAVSPSANKP
PRADAKTFDKNDVQLISQQNWFGKYQPVATPVKQPEPASVAETRLNVVLRGIAFGARPGAVIEEGGKQQVYLQGERLDSH
NAVIEEINRDHVMLRYQGKIERLSLAEEGHSTVAVTNKKAVSDEAKQAVAEPAASAPVEIPTAVRQALTKDPQKIFNYIQ
LTPVRKEGIVGYAVKPGADRSLFDASGFKEGDIAIALNQQDFTDPRAMIALMRQLPSMDSIQLTVLRKGARHDISIALR
>P45757 ~~~gspC~~~Putative type II secretion system protein C~~~COG3031
MPTLRFPFHLANHNKDAAINILIIFISIGSIIFNVNYFHTTIVKNGQIINQPTNAFQSDFSLAALWRNENHAGVKDANPV
AVNQETPKLSIALNGIVLTSNDETSFVLINEGSEQKRYSLNEALESAPGTFIRKINKTSVVFETHGHYEKVTLHPGLPDI
IKQPDSESQNVLADYIIATPIRDGEQIYGLRLNPRKGLNAFTTSLLQPGDIALRINNLSLTHPDEVSQALSLLLTQQSAQ
FTIRRNGVPRLINVSVGELTGMNGLRHERTQ
>P45777 ~~~epsC~~~Type II secretion system protein C~~~COG3031
MEFKQLPPLAAWPRLLSQNTLRWQKPISEGLTLLLLVASAWTLGKMVWVVSAEQTPVPTWSPTLSGLKAERQPLDISVLQ
KGELFGVFTEPKEAPVVEQPVVVDAPKTRLSLVLSGVVASNDAQKSLAVIANRGVQATYGINEVIEGTQAKLKAVMPDRV
IISNSGRDETLMLEGLDYTAPATASVSNPPRPRPNQPNAVPQFEDKVDAIREAIARNPQEIFQYVRLSQVKRDDKVLGYR
VSPGKDPVLFESIGLQDGDMAVALNGLDLTDPNVMNTLFQSMNEMTEMSLTVERDGQQHDVYIQF
>Q01565 ~~~outD~~~Secretin OutD~~~COG1450
MLGKGIKKSWGWLGLTVLLLGSPCGWAAEFSASFKGTDIQEFINTVSKNLNKTVIIDPTVRGTISVRSYDMMNEGQYYQF
FLSVLDVYGFSVVPMDNGVLKVIRSKDAKSSSIPLANNEQPGIGDELVTRVVPLNNVAARDLAPLLRQLNDNAGAGTVVH
YEPSNVLLMTGRAAVIKRLVDIVNTVDKTGDREMVTVPLTYASAEDVAKLVNDLNKSDEKNALPSTMLANVVADGRTNSV
VVSGEENARQRAVEMIRQLDRKQVVQGGTKVIYLKYAKALDLIEVLAGNGTSGNRNSSSSNASRPSSPRSGSSSNSNSSS
GSSGSSSGSSSSSSSSSSMGFGSAFGSTSSSGGRTITIQGKEVTVRAHDQTNSLIITAPPDIMRDLEQVINQLDIRRPQV
LVEAIIAEIQDADGLNLGIQWANKRAGMTQFTNTGIPISTAVIGTDQFRSNGTLTTAYASALSSFNGVTAGFYRGNWSML
LTALSSDSKNDVLATPSIVTLDNMEATFNVGQEVPVLTGSQTTSADNIFNTVERKTVGIKLRVKPQINEGDSVLLQIEQE
VSSVADSNSSTNSSLGVTFNTRTVNNAVMVTNGETVVVGGLLDKTSVESNDKVPLLGDIPWLGSLFRSKSQEVRKRNLML
FLRPTIIRDPGQFQEASINKYRSFNNEQQQQRGEGNGVLDNNTLRLSGGNTYTFRQVQSSISDFYKPEGR
>E3PJ86 ~~~gspD2~~~Secretin GspD 2~~~
MFWRDITLSVWRKKTTGLKTKKRLLPLVLAAALCSSPVWAEEATFTANFKDTDLKSFIETVGANLNKTIIMGPGVQGKVS
IRTMTPLNERQYYQLFLNLLEAQGYAVVPMENDVLKVVKSSAAKVEPLPLVGEGSDNYAGDEMVTKVVPVRNVSVRELAP
ILRQMIDSAGSGNVVNYDPSNVIMLTGRASVVERLTEVIQRVDHAGNRTEEVIPLDNASASEIARVLESLTKNSGENQPA
TLKSQIVADERTNSVIVSGDPATRDKMRRLIRRLDSEMERSGNSQVFYLKYSKAEDLVDVLKQVSGTLTAAKEEAEGTVG
SGREIVSIAASKHSNALIVTAPQDIMQSLQSVIEQLDIRRAQVHVEALIVEVAEGSNINFGVQWASKDAGLMQFANGTQI
PIGTLGAAISQAKPQKGSTVISENGATTINPDTNGDLSTLAQLLSGFSGTAVGVVKGDWMALVQAVKNDSSSNVLSTPSI
TTLDNQEAFFMVGQDVPVLTGSTVGSNNSNPFNTVERKKVGIMLKVTPQINEGNAVQMVIEQEVSKVEGQTSLDVVFGER
KLKTTVLANDGELIVLGGLMDDQAGESVAKVPLLGDIPLIGNLFKSTADKKEKRNLMVFIRPTILRDGMAADGVSQRKYN
YMRAEQIYRDEQGLSLMPHTAQPVLPAQNQALPPEVRAFLNAGRTR
>P31780 ~~~exeD~~~Secretin ExeD~~~COG1450
MINKGKGWRLATVAAALMMAGSAWATEYSASFKNADIEEFINTVGKNLSKTIIIEPSVRGKINVRSYDLLNEEQYYQFFL
SVLDVYGFAVVPMDNGVLKVVRSKDAKTSAIPVVDETNPGIGDEMVTRVVPVRNVSVRELAPLLRQLNDNAGGGNVVHYD
PSNVLLITGRAAVVNRLVEVVRRVDKAGDQEVDIIKLKYASAGEMVRLVTNLNKDGNSQGGNTSLLLAPKVVADERTNSV
VVSGEPKARARIIQMVRQLDRDLQSQGNTRVFYLKYGKAKDMVEVLKGVSSSIEADKKGGGTATTAGGGASIGGGKLAIS
ADETTNALVITAQPDVMAELEQVVAKLDIRRAQVLVEAIIVEIADGDGLNLGVQWANTNGGGTQFTNAGPGIGSVAIAAK
DYKDNGTTTGLAKLAENFNGMAAGFYQGNWAMLVTALSTNTKSDILSTPSIVTMDNKEASFNVGQEVPVQTGTQNSTSGD
TTFSTIERKTVGTKLVVTPQINEGDSVLLTIEQEVSSVGKQATGTDGLGPTFDTRTVKNAVLVKSGETVVLGGLMDEQTK
EEVSKVPLLGDIPVLGYLFRSTSNNTSKRNLMVFIRPTILRDANVYSGISSNKYTLFRAQQLDAVAQEGYATSPDRQVLP
EYGQDVTMSPEAQKQIELMKTHQQATADGVQPFVQGNK
>P45758 ~~~gspD~~~Putative secretin GspD~~~COG1450
MKGLNKITCCLLAALLMPCAGHAENEQYGANFNNADIRQFVEIVGQHLGKTILIDPSVQGTISVRSNDTFSQQEYYQFFL
SILDLYGYSVITLDNGFLKVVRSANVKTSPGMIADSSRPGVGDELVTRIVPLENVPARDLAPLLRQMMDAGSVGNVVHYE
PSNVLILTGRASTINKLIEVIKRVDVIGTEKQQIIHLEYASAEDLAEILNQLISESHGKSQMPALLSAKIVADKRTNSLI
ISGPEKARQRITSLLKSLDVEESEEGNTRVYYLKYAKATNLVEVLTGVSEKLKDEKGNARKPSSSGAMDNVAITADEQTN
SLVITADQSVQEKLATVIARLDIRRAQVLVEAIIVEVQDGNGLNLGVQWANKNVGAQQFTNTGLPIFNAAQGVADYKKNG
GITSANPAWDMFSAYNGMAAGFFNGDWGVLLTALASNNKNDILATPSIVTLDNKLASFNVGQDVPVLSGSQTTSGDNVFN
TVERKTVGTKLKVTPQVNEGDAVLLEIEQEVSSVDSSSNSTLGPTFNTRTIQNAVLVKTGETVVLGGLLDDFSKEQVSKV
PLLGDIPLVGQLFRYTSTERAKRNLMVFIRPTIIRDDDVYRSLSKEKYTRYRQEQQQRIDGKSKALVGSEDLPVLDENTF
NSHAPAPSSR
>P15644 ~~~pulD~~~Secretin PulD~~~
MIIANVIRSFSLTLLIFAALLFRPAAAEEFSASFKGTDIQEFINTVSKNLNKTVIIDPSVRGTITVRSYDMLNEEQYYQF
FLSVLDVYGFAVINMNNGVLKVVRSKDAKTAAVPVASDAAPGIGDEVVTRVVPLTNVAARDLAPLLRQLNDNAGVGSVVH
YEPSNVLLMTGRAAVIKRLLTIVERVDNAGDRSVVTVPLSWASAADVVKLVTELNKDTSKSALPGSMVANVVADERTNAV
LVSGEPNSRQRIIAMIKQLDRQQATQGNTKVIYLKYAKASDLVEVLTGISSTMQSEKQAAKPVAALDKNIIIKAHGQTNA
LIVTAAPDVMNDLERVIAQLDIRRPQVLVEAIIAEVQDADGLNLGIQWANKNAGMTQFTNSGLPISTAIAGANQYNKDGT
VSSSLASALSSFNGIAAGFYQGNWAMLLTALSSSTKNDILATPSIVTLDNMEATFNVGQEVPVLTGSQTTSGDNIFNTVE
RKTVGIKLKVKPQINEGDSVLLEIEQEVSSVADAASSTSSDLGATFNTRTVNNAVLVGSGETVVVGGLLDKSVSDTADKV
PLLGDIPVIGALFRSTSKKVSKRNLMLFIRPTVIRDRDEYRQASSGQYTAFNDAQSKQRGKENNDAMLNQDLLEIYPRQD
TAAFRQVSAAIDAFNLGGNL
>P35818 ~~~xcpQ~~~Secretin XcpQ~~~
MSQPLLRALFAPSSRSYVPAVLLSLALGIQAAHAENSGGNAFVPAGNQQEAHWTINLKDADIREFIDQISEITGETFVVD
PRVKGQVSVVSKAQLSLSEVYQLFLSVMSTHGFTVVAQGDQARIVPNAEAKTEAGGGQSAPDRLETRVIQVQQSPVSELI
PLIRPLVPQYGHLAAVPSANALIISDRSANIARIEDVIRQLDQKGSHDYSVINLRYGWVMDAAEVLNNAMSRGQAKGAAG
AQVIADARTNRLIILGPPQARAKLVQLAQSLDTPTARSANTRVIRLRHNDAKTLAETLGQISEGMKNNGGQGGEQTGGGR
PSNILIRADESTNALVLLADPDTVNALEDIVRQLDVPRAQVLVEAAIVEISGDIQDAVGVQWAINKGGMGGTKTNFANTG
LSIGTLLQSLESNKAPESIPDGAIVGIGSSSFGALVTALSANTKSNLLSTPSLLTLDNQKAEILVGQNVPFQTGSYTTNS
EGSSNPFTTVERKDIGVSLKVTPHINDGAALRLEIEQEISALLPNAQQRNNTDLITSKRSIKSTILAENGQVIVIGGLIQ
DDVSQAESKVPLLGDIPLLGRLFRSTKDTHTKRNLMVFLRPTVVRDSAGLAALSGKKYSDIRVIDGTRGPEGRPSILPTN
ANQLFDGQAVDLRELMTE
>P45779 ~~~epsD~~~Secretin GspD~~~
MKYWLKKSSWLLAGSLLSTPLAMANEFSASFKGTDIQEFINIVGRNLEKTIIVDPSVRGKVDVRSFDTLNEEQYYSFFLS
VLEVYGFAVVEMDNGVLKVIKSKDAKTSAIPVLSGEERANGDEVITQVVAVKNVSVRELSPLLRQLIDNAGAGNVVHYDP
ANIILITGRAAVVNRLAEIIRRVDQAGDKEIEVVELNNASAAEMVRIVEALNKTTDAQNTPEFLKPKFVADERTNSILIS
GDPKVRERLKRLIKQLDVEMAAKGNNRVVYLKYAKAEDLVEVLKGVSENLQAEKGTGQPTTSKRNEVMIAAHADTNSLVL
TAPQDIMNAMLEVIGQLDIRRAQVLIEALIVEMAEGDGINLGVQWGSLESGSVIQYGNTGASIGNVMIGLEEAKDTTQTK
AVYDTNNNFLRNETTTTKGDYTKLASALSSIQGAAVSIAMGDWTALINAVSNDSSSNILSSPSITVMDNGEASFIVGEEV
PVITGSTAGSNNDNPFQTVDRKEVGIKLKVVPQINEGNSVQLNIEQEVSNVLGANGAVDVRFAKRQLNTSVMVQDGQMLV
LGGLIDERALESESKVPLLGDIPLLGQLFRSTSSQVEKKNLMVFIKPTIIRDGVTADGITQRKYNYIRAEQLFRAEKGLR
LLDDASVPVLPKFGDDRRHSPEIQAFIEQMEAKQ
>P29041 ~~~xpsD~~~Secretin XpsD~~~COG1450
MSERMTPRLFPVSLLIGLLAGCATTPPPDVRRDARLDPQVGAAGATQTTAEQRADGNASAKPTPVIRRGSGTMINQSAAA
APSPTLGMASSGSATFNFEGESVQAVVKAILGDMLGQNYVIAPGVQGTVTLATPNPVSPAQALNLLEMVLGWNNARMVFS
GGRYNIVPADQALAGTVAPSTASPSAARGFEVRVVPLKYISASEMKKVLEPYARPNAIVGTDASRNVITLGGTRAELENY
LRTVQIFDVDWLSGMSVGVFPIQSGKAEKISADLEKVFGEQSKTPSAGMFRFMPLENANAVLVITPQPRYLDQIQQWLDR
IDSAGGGVRLFSYELKYIKAKDLADRLSEVFGGRGNGGNSGPSLVPGGVVNMLGNNSGGADRDESLGSSSGATGGDIGGT
SNGSSQSGTSGSFGGSSGSGMLQLPPSTNQNGSVTLEVEGDKVGVSAVAETNTLLVRTSAQAWKSIRDVIEKLDVMPMQV
HIEAQIAEVTLTGRLQYGVNWYFENAVTTPSNADGSGGPNLPSAAGRGIWGDVSGSVTSNGVAWTFLGKNAAAIISALDQ
VTNLRLLQTPSVFVRNNAEATLNVGSRIPINSTSINTGLGSDSSFSSVQYIDTGVILKVRPRVTKDGMVFLDIVQEVSTP
GARPAACTAAATTTVNSAACNVDINTRRVKTEAAVQNGDTIMLAGLIDDSTTDGSNGIPFLSKLPVVGALFGRKTQNSDR
REVIVLITPSIVRNPQDARDLTDEYGSKFKSMRPMDVHK
>P31702 7.4.2.8~~~outE~~~Type II secretion system protein E~~~
MSDQPVHTSELRPVLPFAFARAQQILLLQDESASAAEVVCVPETPALALLEVRRVAGVALTVSQVSPEEFERQLVMRYQR
DSEEARRLMEDIGNDIDFYTLAEELPDSDDLLDGEDDAPIIRLINAMLTEAIKHKASDIHIETFERHLLIRFRIDGVLRE
ILRPQRQLASLLVSRIKVMAKLDIAEKRVPQDGRMALRIGGRAIDVRVSTLPSNYGERVVLRLLDKNSVRLDLETLGMAE
HHRRQLDTLIHRPHGIILVTGPTASGKSTTLYAALSPLNSAERNIMTVEDPIEYELEGIGQTQVNPKVDMTFARGLRAIL
RQDPDVVLVGEIRDGETAQIAVQASLTGHLVLSTLHTNSALGALSRLQDMGIEPFLLSTSLLGVLAQRLVRTLCPSCRQP
YTIDHEQAEQTGLAAGTTLYHPGGCEKCNYSGYRGRTGIHELLLIDDTVRAAIHRGESELGIARMLGAKRVTIRQDGLDK
VLAGITTWEEVVRVTKEE
>P45759 7.4.2.8~~~gspE~~~Putative type II secretion system protein E~~~COG2804
MRIHSPYPASWALAQRIGYLYSEGEIIYLADTPFERLLDIQRQVGQCQTMTSLSQADFEARLEAVFHQNTGESQQIAQDI
DQSVDLLSLSEEMPANEDLLNEDSAAPVIRLINAILSEAIKETASDIHIETYEKTMSIRFRIDGVLRTILQPNKKLAALL
ISRIKVMARLDIAEKRIPQDGRISLRIGRRNIDVRVSTLPSIYGERAVLRLLDKNSLQLSLNNLGMTAADKQDLENLIQL
PHGIILVTGPTGSGKSTTLYAILSALNTPGRNILTVEDPVEYELEGIGQTQVNTRVDMSFARGLRAILRQDPDVVMVGEI
RDTETAQIAVQASLTGHLVLSTLHTNSASGAVTRLRDMGVESFLLSSSLAGIIAQRLVRRLCPQCRQFTPVSPQQAQMFK
YHQLAVTTIGTPVGCPHCHQSGYQGRMAIHEMMVVTPELRAAIHENVDEQALERLVRQQHKALIKNGLQKVISGDTSWDE
VMRVASATLESEA
>Q00512 7.4.2.8~~~xcpR~~~Type II secretion system protein E~~~
MMTAPLPDIAAPAAPPRRLPFSFAKRQGLLFLCLEEQYWLACRPQVELAAIAEAQRFAGRRLPLKALGEDAFNQALAASY
QHDSSAAMQLAEDLGGSLDLAALADQVPETEDLMEQEDDAPIIRLINAILGEAIRENASDIHLETFEKRLVVRFRVDGVL
REVLEPKRELAALLVSRIKVMARLDIAEKRIPQDGRISLRVGGREVDIRVSTLPSANGERVVLRLLDKQAGRLNLQHLGM
SERDRKLMDETVRKPHGILLVTGPTGSGKTTTLYASLTTLNDRTRNILTVEDPIEYHLEGIGQTQVNAKVDMTFARGLRA
ILRQDPDVVMVGEIRDRETAEIAVQASLTGHLVLSTLHTNSAIGAITRLVDMGIEPFLLSSSMLGVLAQRLVRVLCPACK
EPYRADEAECALLGVDPAAPPTLHRARGCGECHQHGYRGRTGIYELVVFDDHMRSLIHNESSEQEMTRHARTSGPSIRDD
GRRKVLEGVTTVEEVLRVTREE
>P37093 7.4.2.8~~~epsE~~~Type II secretion system ATPase E~~~COG2804
MTEMVISPAERQSIRRLPFSFANRFKLVLDWNEDFSQASIYYLAPLSMEALVETKRVVKHAFQLIELSQAEFESKLTQVY
QRDSSEARQLMEDIGADSDDFFSLAEELPQNEDLLESEDDAPIIKLINAMLGEAIKEGASDIHIETFEKTLSIRFRVDGV
LREVLAPSRKLSSLLVSRVKVMAKLDIAEKRVPQDGRISLRIGGRAVDVRVSTMPSSHGERVVMRLLDKNATRLDLHSLG
MTAHNHDNFRRLIKRPHGIILVTGPTGSGKSTTLYAGLQELNSSERNILTVEDPIEFDIDGIGQTQVNPRVDMTFARGLR
AILRQDPDVVMVGEIRDLETAQIAVQASLTGHLVMSTLHTNTAVGAVTRLRDMGIEPFLISSSLLGVLAQRLVRTLCPDC
KEPYEADKEQRKLFDSKKKEPLILYRATGCPKCNHKGYRGRTGIHELLLVDDALQELIHSEAGEQAMEKHIRATTPSIRD
DGLDKVRQGITSLEEVMRVTKES
>P31742 7.4.2.8~~~xpsE~~~Type II secretion system protein E~~~COG2804
MEQRSAETRIVEALLERRRLKDTDLVRARQLQAESGMGLLALLGRLGLVSERDHAETCAEVLGLPLVDARQLGDTPPEML
PEVQGLSLRFLKQFHLCPVGERDGRLDLWIADPYDDYAIDAVRLATGLPLLLQVGLRSEIDDLIERWYGQGRSAMGTIVE
TADGDASSTDDIEALRDLASEAPVIRLVNLVIQHAVELRASDIHIEPFESRLKVRYRVDGVLVEGESPPAKLTAAVISRI
KIMAKLNIAERRLPQDGRIMLRVQGKELDLRVSTVPTAHGESVVMRLLDRETVVFDFYKLGFTEDFLPQFRKVLEQPHGI
MLVTGPTGSGKTTTLYTALSQLNTSDVKIITVEDPVEYQIEGINQIQAKPQIGLDFANALRSIVRQDPDIIMIGEMRDLE
TARIAIQSALTGHLVLSTLHTNNAAGGITRLLDMGVEDYLLTSTINGILAQRLVRKLDLANAERYAASPEEIERFDLRRL
QPDGEIFLYRPRATAAAPTGYLGRTTIVEFLVMNDELRRAVMRRAGMGEIEQLARKSGMRTMYEDGLSKALRGETTIEEV
LRVTEDA
>P31704 ~~~outF~~~Type II secretion system protein F~~~
MALFQYQALNAQGKKSQGMQEADSARHARQLLREKGLVPVKIEEQRGEAAPRSGFSLSFGRSHRIASDLALLTRQLATLV
AALPLEEALDAVAKQSEKPKLSALMAAVRAKVVEGHSLAEAMGNFPGSFERLYCAMVAAGEASGHLDAVLNRLADYTEQR
HEMRSRIQQAMIYPCVLTLVAISVVSILLSAVVPKVVEQFIHMKQALPLSTRLLMSASDAVRTYGPWVVLLLVLAIMGFR
VLLRQEKHRLVFHRRLLFLPVVGRVARGLNTARYARTLSILNSSAVPLLQAMRISGDVLTNDYARFRLGQATDAVREGVT
LHKALEQTALFPPMMRHMIASERRRARRHVNPRGDNQDREFSAQMTLVLGLFEPLLVVSMAGIVLFIVLAILQPILQLNT
LMSM
>P41441 ~~~gspF~~~Putative type II secretion system protein F~~~COG1459
MNYRYRAMTQDGQKLQGIIDANDERQARLRLREEGLFLLDIRPQKSSGVKTRRPRISHSELTLFTRQLATLSAAALPLEE
SLAVIGQQSSNKRLGDVLNQVRSAILEGHPLSDALQHFPTLFDSLYRTLVKAGEKSGLLAPVLEKLADYNENRQKIRSKL
IQSLIYPCMLTTVAIGVVIILLTAVVPKITEQFVHMKQQLPLSTRILLGLSDTLQRTGPTLLATVFIVAVGFWLWLKRGN
NRHRFHAMLLRVALIGPLICAINSARYLRTLSILQSSGVPLLDGMNLSTESLNNLEIRQRLANAAENVRQGNSIHLSLEQ
TAIFPPMMLYMVASGEKSGQLGTLMVRAADNQETLQQNRIALTLSIFEPALIITMALIVLFIVVSVLQPLLQLNSMIN
>Q00513 ~~~xcpS~~~Type II secretion system protein F~~~
MAAFEYLALDPSGRQQKGVLEADSARQVRQLLRERQLAPLDVKPTRTREQSGQGGRLTFARGLSARDLALVTRQLATLVQ
AALPIEEALRAAAAQSTSQRIQSMLLAVRAKVLEGHSLAGSLREFPTAFPELYRATVAAGEHAGHLGPVLEQLADYTEQR
QQSRQKIQLALLYPVILMVASLAIVGFLLGYVVPDVVRVFIDSGQTLPLLTRVLIGVSDWVKAWGALAFVAAIGGVIGFR
YALRKDAFRERWHGFLLRVPLVGRLVRSTDTARFASTLAILTRSGVPLVEALAIAAEVIANRIIRNEVVKAAQKVREGAS
LTRSLEATGQFPPMMLHMIASGERSGELDQMLARTARNQENDLAAQIGLMVGLFEPFMLIFMGAVVLVIVLAILLPILSL
NQLVG
>P45780 ~~~epsF~~~Type II secretion system protein F~~~COG1459
MAAFEYKALDAKGRHKKGVIEGDNARQVRQRLKEQSLVPMEVVETQVKAARSRSQGFAFKRGISTPDLALITRQLATLVQ
SGMPLEECLRAVAEQSEKPRIRTMLVAVRAKVTEGYTLSDSLGDYPHVFDELFRSMVAAGEKSGHLDSVLERLADYAENR
QKMRSKLQQAMIYPVVLVVFAVGIVAFLLAAVVPKIVGQFVQMGQALPASTQFLLDASDFLQHWGISLLVGLLMLIYLVR
WLLTKPDIRLRWDRRVISLPVIGKIARGLNTARFARTLSICTSSAIPILDGMRVAVDVMTNQFVKQQVLAAAENVREGSS
LRKALEQTKLFPPMMLHMIASGEQSGELEGMLTRAADNQDNSFESTVNIALGIFTPALIALMAGMVLFIVMATLMPILEM
NNLMSR
>P41442 ~~~gspG~~~Type II secretion system core protein G~~~COG2165
MRATDKQRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDIVALENALDMYKLDNHHYPTTNQGLESLVEAP
TLPPLAANYNKEGYIKRLPADPWGNDYVLVNPGEHGAYDLLSAGPDGEMGTEDDITNWGLSKKKK
>A0A0H3HDD6 ~~~~~~Type II secretion system core protein G~~~
MRRQSQRGFTLLEIMVVIVIMGILASLVVPNLMGNKDKADRQKVVSDIVALESALDMYKLDNSRYPTTEQGLQALITKPS
VPPEARYYPQDGYIRRLPQDPWGGDYQLVSPGQHGQIDIFSSGQDGVPGTDDDIGNWTLSKK
>P15746 ~~~pulG~~~Type II secretion system core protein G~~~
MQRQRGFTLLEIMVVIVILGVLASLVVPNLMGNKEKADRQKVVSDLVALEGALDMYKLDNSRYPTTEQGLQALVSAPSAE
PHARNYPEGGYIRRLPQDPWGSDYQLLSPGQHGQVDIFSLGPDGVPESNDDIGNWTIGKK
>Q00514 ~~~xcpT~~~Type II secretion system core protein G~~~
MQRRQQSGFTLIEIMVVVVILGILAALVVPQVMSRPDQAKVTVAKGDIKAIAAALDMYKLDNFAYPSTQQGLEALVKKPT
GNPQPKNWNKDGYLKKLPVDPWGNPYQYLAPGTKGPFDLYSLGADGKEGGSDNDADIGNWDN
>P45773 ~~~epsG~~~Type II secretion system core protein G~~~COG2165
MKKMRKQTGFTLLEVMVVVVILGILASFVVPNLLGNKEKADQQKAVTDIVALENALDMYKLDNSVYPTTDQGLEALVTKP
TNPEPRNYREGGYIKRLPKDPWGNDYQYLSPGDKGTIDVFTLGADGQEGGEGTGADIGNWNIQDFQ
>P31734 ~~~xpsG~~~Type II secretion system core protein G~~~COG2165
MIKRSITRSPSRAGQAGMSLLEIIIVIVLIGAVLTLVGSRVLGGADRGKANLAKSQIQTLAGKIENFQLDTGKLPSKLDD
LVTQPGGSSGWLGPYAKPVELNDPWGHTIEYRVPGDGQAFDLISLGKDGRPGGSSYDSDIKYQ
>P41443 ~~~gspH~~~Type II secretion system protein H~~~COG2165
MNQQRGFTLLEMMLVLALVAITASVVLFTYGREDVASTRARETAARFTAALELAIDRATLSGQPVGIHFSDSAWRIMVPG
KTPSAWRWVPLQEDAADESQNDWDEELSIHLQPFKPDDSNQPQVVILADGQITPFSLLMANAGTGEPLLTLVCSGSWPLD
QTLARDTRP
>A0A0H3H546 ~~~pulH~~~Type II secretion system protein H~~~
MRQRGFTVLEMMLVVLLMGSAASLVIMSFPAMQQDTAERQLQRFQAQLEFAMDSGMQNDRLLGIQIRPNGWQFQVLQSQA
AETRSSVAHSDRWQGYVWQIWQPRQAALGGQVPDNQPLTLRLPPPQEWPPTAEPAADPDILLLPGGEITPFTLIFGEKDD
RSEVWLRVDESGAIATSAKGGAP
>Q00515 ~~~xcpU~~~Type II secretion system protein H~~~
MRASRGFTLIELMVVMVIISVLIGLAVLSTGFASTSRELDSEAERLAGLIGVLTDEAVLDNREYGLRLERDAYQVLRYDE
AKARWLPVARDSHRLPEWAELTFELDGQPLVLAGSKGEKEQKKGTDQPQLLILSSGELSPFRLRLAERGPEGRALSLSSD
GFRLPRVEVARR
>P45774 ~~~epsH~~~Type II secretion system protein H~~~COG2165
MTATRGFTLLEILLVLVLVSASAVAVIATFPVSVKDEAKISAQSFYQRLLLLNEEAILSGQDFGVRIDVDTRRLTFLQLT
ADKGWQKWQNDKMTNQTTLKEGLQLDFELGGGAWQKDDRLFNPGSLFDEEMFADEKKEQKQEPAPQLFVLSSGEVTPFTL
SIFPKGQEPDEQWRVTAQENGTLRLLAPGESDEE
>P31736 ~~~xpsH~~~Type II secretion system protein H~~~COG4970
MRVARLPLLHPHRAAPVVRRQLRGSSLLEMLLVIALIALAGVLAAAALTGGIDGMRLRSAGKAIAAQLRYTRTQAIATGT
PQRFLIDPQQRRWEAPGGHHGDLPAALEVRFTGARQVQSRQDQGAIQFFADGASTGGRIDLTIKDARWRVDVGWITGEVR
SGPLRTPAP
>P45760 ~~~gspI~~~Putative type II secretion system protein I~~~COG2165
MNKQSGMTLLEVLLAMSIFTAVALTLMSSMQGQRNAIERMRNETLALWIADNQLQSQDSFGEENTSSSGKELINGEEWNW
RSDIHSSKDGTLLERTITVTLPSGQTTSLTRYQSIDNKSGQAQDD
>Q8VPC3 ~~~gspI~~~Type II secretion system protein I~~~
MKRGFTLLEVMLALAIFALSATAVLQIASGALSNQHVLEEKTVAGWVAENQTALLYLMTRGQRAVRQQGESDMAGSRWYW
RTTPLSTGNALLQAVDIEVSLHEDFSSVIQSRRAWFSAVGGQQ
>A0A0H3HA88 ~~~pulI~~~Type II secretion system protein I~~~
MNKQKGMTLLEVLVALAIFSLAGLTLLQTTAQQARNAGMMKEKMLASWLADNQQVRLHLNKLWPEKSATGALVTYAGEEW
YLSWQGVDTEFSQLRALDIEVRRHKQDTAAIFSLRSYVVHE
>Q00516 ~~~xcpV~~~Type II secretion system protein I~~~
MKRARGFTLLEVLVALAIFAMVAASVLSASARSLQNASRLEDKTLAMWIADNRLNELQLEQTPPSSGRNQGELEFAGRRW
EWRTQVDSTAEQDMRRVIVWVAAKPLGRERGSIEERAAARLVGFLGSQP
>P45761 ~~~gspJ~~~Type II secretion system protein J~~~COG4795
MINRQQGFTLLEVMAALAIFSMLSVLAFMIFSQASELHQRSQKEIQQFNQLQRTITILDNDLLQLVARRNRSTDKIMVLG
EEAIFTTQSRDPLAPLSEAQTLLTVHWYLRNHTLYRAVRTSVDGRKDQPAQAMLEHVESFLLESNSGESQELPLSVTLHL
QTQQYGGLQRRFALPEQLAREESPAQTQAGNNNHE
>A0A4C3GMC1 ~~~gspJ~~~Type II secretion system protein J~~~
MKRTRAGFTLLEMLVAIAIFASLALMAQQVTNGVTRVNSAVADHDQKLNLMQQTMSFLTHDLTQMMPRPVRGDQGQREPA
LLAGAGVLASESEGMRFVRGGVVNPLMRLPRSNLLTVGYRIHDGYLERLAWPLTDAAGSVKPTMQKLIPADSLRLQFYDG
TRWQESWSSVQAIPVAVRMTLHSPQWGEIERIWLLRGPQ
>A0A0H3H7Y9 ~~~pulJ~~~Type II secretion system protein J~~~
MSRCRERGFTLLEMLLALAIFAALSLSAFQILQGVMRNDEMAQRQVQRLTELQRAFVYLEGDFGQIIPRPPRGDERLFYA
ARYQRQSADWSISFMRNGWQNPMGILPRSELQRVGYRLRHQQLERLSYVHTDPQAGEEPIVKVLLKDVSAFRLRFFANGM
WRDSWNDTTRLPEGIEVSLVVADVGEVSRLFFVTTGEQA
>Q00517 ~~~xcpW~~~Type II secretion system protein J~~~
MRLQRGFTLLELLIAIAIFALLALATYRMFDSVMQTDQATRVQEQRMRELVRAMGALERDLTQAVERPVRDELGDNRGAF
LSEGENDQIVEFTRGGWRNPLGQARSRLQRVRWSLSGETLERRYWLVLDRAQDSKPRVQQVLDGVTALSWRFLDKEHNWQ
GHWPTDEGSEEERLESLPLAVEMTLEHRHYGKLVRVWRLLDPPLKQDQPQGQPGGENGENGEGGVPQPPEGMPGAPE
>P45762 ~~~gspK~~~Putative type II secretion system protein K~~~COG3156
MNNEQRGVALLIVLMLLALMAALAADMTLSFHSQLQRTRQVNHHLQRQYDIELAEKLALASLTQDVKDNDRQTTLQQYWA
QPQQLQLEDGNTVKWQLRDAQHCFNLNALAKISDDPLASPDFPAQVFSALLINAGIDRGNTDEIVQSIADYIDVDDSPRF
HGAEDSFYQSQTPPRHSANQMLFLTGELRQIKGITENIYQRLIPYVCVLPTTELSINLNMLTENDIPLFRALFLNNITDA
DARVLLQKRPREGWLTTDAFLYWAQQDFSGVKPLVAQVKRHLFPYSRYFTLSTESISDEQSQGWQSHIFFNRKQQSAQIY
RRTLQLY
>Q00518 ~~~xcpX~~~Type II secretion system protein K~~~
MRRGQNGVALITVLLVVAVVTIVCAGLIIRQQLAIRSSANQLHVRQAWHYALGGERLAEAVLRRDLRQGGENTREPVDHL
GEAWARPMTPFKLDDGGELRVRIEDPSGRFNLNGLVRKRKVKPDSVKQFRRLLATLGMKEEIVQGLPDRLADWLDADQNP
QGEQGAEDNQYLLEAPAYRAANRSFKDVSELRLLKLSEADYRRLLPFVSALPEDAPLNVNTASVPVLAAMFEIDPGQAEN
IVDARGREGFQSKDDFTKHLTQLGSKTGNVSYAVGTRYFQVISEVSLGDRRQVLVSTLQRGKDGKIRVMARDMGQGGLPI
PSTGGDDWKKDER
>Q9KUA9 2.7.1.8~~~gspK~~~Glucosamine kinase GspK~~~COG2971
MNYYVGIDGGGTSCRARIRNQQGEWVGEAKSGSANIMLGVEVALRSVVDAITQAAEQGGLSPDDFPSMHVGLALAGAEQK
EAWHAFMQQAHPFASITLNTDAYGACLGAHLGEEGAIMIAGTGSCGILLKGGKQYVVGGREFPISDQGSGAVMGLRLIQQ
VLLAQDGIRPHTPLCDVVMNHFNHDIDSIVAWSKTALPRDYGQFSPQIFSHAYCGDPLAIELLKQTAADIEMFLIALHHK
GAERICLMGSIAERIQDWLSPPVQQWIVKPQSDAIEGALMFAGKPEHNLYKDGL
>P31707 ~~~outL~~~Type II secretion system protein L~~~
MSKAENTSGKQQLILRLSADTSDSLEWLIWSVSRHQTLTTGSGTLESLQAVLADYPVISARVLVPSTDVTFHTLSLPRQS
RRQLLQAIPFMLEEQVASDIDQLHFAVMDMHGDNATVAVVQKSRLRAWLNQCETLGVPVETVVPDVMALPRADSAWSAIS
HRNLWLFRLDSGIGMAAEENWYQSLLAFQPLPAVHCYSPVPASALTWQPQPVTDLLTLAAQVNLSMSMDLRQGEYAPVKP
WKQALLPWRNVLIALSAWLLLVLGESVWTHYQWYRQADYWRQESVRVYRKLFPDEKQVVNPRAQMQRHLQEVRAGVSGFA
LTEQMNRLQQLVAQNEGVSLQSLSYDRSRDELRLSLRATSYAQMEQFRQQAQAYFQIPPGEMKQEKDHVEGQLTLRSQP
>P45763 ~~~gspL~~~Putative type II secretion system protein L~~~COG3297
MPESLMVIRSSSTLRKHWEWMTFSADSVSSVHTLTDDLPLESLADQPGAGNVHLLIPPEGLLYRSLTLPNAKYKLTAQTL
QWLAEETLPDNTQDWHWTVVDKQNESVEVIGIQSEKLSRYLERLHTAGLNVTRVLPDGCYLPWEVDSWTLVNQQTSWLIR
SAAHAFNELDEHWLQHLAAQFPPENMLCYGVVPHGVAAANPLIQHPEIPSLSLYSADIAFQRYDMLHGIFRKQKTVSKSG
KWLARLAVSCLVLAILSFVGSRSIALWHTLKIEDQLQQQQQETWQRYFPQIKRTHNFRFYFKQQLAQQYPEAVPLLYHLQ
TLLLEHPELQLMEANYSQKQKSLTLKMSAKSEANIDRFCELTQSWLPMEKTEKDPVSGVWTVRNSGK
>P25060 ~~~xcpY~~~Type II secretion system protein L~~~
MSGVSALFLPPASTAGADGELAVWWVQDGECRRAPFAQALAEIRAPWRLYLPVEAVTACAVNLPTQKARWLRQSLPFAVE
EQLADDVEQMHLALGPALADGRHRVFAVQRTWLAAWLALAEGAGKAPASLHVDADCLPGEGSCLFWLEERWLLGGSGAVR
LACGSEDWPVLRDSCPPPQRAFAAQEVAPLEGVEVQALAGNPHVWLSEQPLGTDLAQAEFAARQQSSQWRRWRPLLGLVG
LWLVLQWGFTLVQAWQLQREGDRYAAQSAELYRQLFPEDRKLINLRAQFDQHLADSASSGGEGQLLGLLGQAATVIGGEP
TVSVEQLDFSAARGDVALQVRAPGFDVLERLRSRLSESGLAVQLGSASRDGSTVSARLVIGG
>P45782 ~~~epsL~~~Type II secretion system protein L~~~COG3297
MEGSVSEFLTVRLSSQKEADIPWLVWSAEQQEVIASGQVAGWEALHEIESYADQRSVVVLLAASDLILTSVEIPPGASRQ
LENMLPYLLEDEIAQDVEDVHFCVLSKGRETADVVGVDRLWLRACLDHLKACGFDVKRVLPDVLAIPRPEHGLAALQLGD
EWLVRKSTTQGMAVDAQWLSLLAASDWVQNEGEYLPLQALTPLPELSLAETQEWRYEPSGLVMQLLTQEALTSKFNLLTG
SFKLKSSWLRYWQIWRKVAIAAGLFVAVSISYSLFQAHQYEAQADAYRAESERIFRSIFPDKQKIPTVTYLKRQMSDEMA
RLSGGASVGSVLKWLSPLPEALKGVNLQLQSIKFDSNRSEIRLEATSRDFQSFEQARTQLEQYFAVEQGQLNKNGEQVFG
VFVVKPK
>Q47422 ~~~outM~~~Type II secretion system protein M~~~
MNELRRRWQVMSQRERLMALACGGLVVLCLLYYLIWAPWQESVRQWQMTVERERQTVRWMQQQPPRFRRRKVRGGRXPVA
ISANGIGAQSAVRYGITVLRMQPQESQVSVTLARSDFNNLLHWLAELEQKNGVITQGIDVTAVPNSPGIVEVTRLSLERV
L
>P36678 ~~~gspM~~~Putative type II secretion system protein M~~~COG3149
MIKSWWAEKSTSEKQIVAALAVLSLGVFCWLGVIKPIDTYIAEHQSHAQKIKKDIKWMQDQASTHGLLGHPALTQPIKNI
LLEEAKRENLAITLENGPDNTLTIHPVTAPLENVSRWLTTAQVTYGIVIEDLQFTLAGNEEITLRHLSFREQQ
>P25061 ~~~xcpZ~~~Type II secretion system protein M~~~
MKVMTQFHERLRAQAETSQLAIRWRGLPARDRLALLWLGAFLLLVVLYLALWRPAERHLQSARQYFTEQRALHAYIQQQA
PNVRQADAAAPQAQIDPAALQGMVTASAAQAGLSVERLDNEGEGAVQVALQPAPFAKLLPWLEQLNGQGVQVAEAGLDRQ
VDGRVSARLSLRVE
>P41851 ~~~epsM~~~Type II secretion system protein M~~~COG3149
MKELLAPVQAWWRSVTPREQKMVMGMGALTVLAIAYWGIWQPLSERTAQAQARLQTEKQLLSWVSENANDIVTLRAQGGS
DAPSDQPLNQVITNSTRQFNIELIRVQPRGEMMQVWIQPLPFSQLVSWIAYLQERQGVSVDAIDIDRGKVNGVVEVKRLQ
LKRGG
>P31710 ~~~outN~~~Type II secretion system protein N~~~
MKLKSGIVTGVALVLAYGLFLASYAPARLLTAVPLPAGMVVAEAAGTLWQGSLQRFSWRTLTLDDVHWNITFSDFMPALD
IAFKNPEGIAGRGIIRGWQRAQFYQWQLSVPAGYLFSHMRFIVPIGAEGNVQLNLQEATVDRSGCQSLDANVTWPGARVK
TPLGGLVLATPQATLRCQQGALEANLRQTSSHLQLSGKGSVTPKGEYRFTGQLSSGNDLPATMKKLLATTGKANEQGART
LNFQGRLL
>Q51575 ~~~xcpP~~~Type II secretion system protein N~~~
MIPRRSSDITIKTRSDVLPFSGASSRWLQRYAPALLAVALIIAMSISLAWQAAGWLRLQRSPVAVAASPVSHESIRSDPT
RLARLFGTSAQDPNAPPPATNLDLVLKGSFVQSDPKLSSAIIQRQGDKPHRYAVGGEISDGVKLHAVYRDRVELQRGGRL
ESLPFPHRSGGLLASADDITSENDSIEQLQSLQDENAAALRERLDALRQQMEATPIAEPAEEDSSEPTTTPTESD
>B3EWI1 1.11.1.17~~~garA~~~Glutathione amide-dependent peroxidase~~~
MLQDRTGSRVPQVTFHTRSGHEWVDLTTDEIFAGKTVVVFSLPGAFTPTCSSSHVPRYNQLVPMFKEHGVDTVACVSVSV
NDTFVMNEWQKTQHADDLLFIPDGNGEFTEGMGMLVEKDDLGFGKRSWRYSMLVRDGVVEKMFIEPEVEGDPYEVSDADT
MLAHLAPNAPKPMDVSVFTRDGCPFCVMAKEALRNAGIDFEELVLNEDYTEQTLRAVANAVPQVEVNGELIGGSEAVEGW
LKERASA
>P0AES0 ~~~gss~~~Bifunctional glutathionylspermidine synthetase/amidase~~~COG0754
MSKGTTSQDAPFGTLLGYAPGGVAIYSSDYSSLDPQEYEDDAVFRSYIDDEYMGHKWQCVEFARRFLFLNYGVVFTDVGM
AWEIFSLRFLREVVNDNILPLQAFPNGSPRAPVAGALLIWDKGGEFKDTGHVAIITQLHGNKVRIAEQNVIHSPLPQGQQ
WTRELEMVVENGCYTLKDTFDDTTILGWMIQTEDTEYSLPQPEIAGELLKISGARLENKGQFDGKWLDEKDPLQNAYVQA
NGQVINQDPYHYYTITESAEQELIKATNELHLMYLHATDKVLKDDNLLALFDIPKILWPRLRLSWQRRRHHMITGRMDFC
MDERGLKVYEYNADSASCHTEAGLILERWAEQGYKGNGFNPAEGLINELAGAWKHSRARPFVHIMQDKDIEENYHAQFME
QALHQAGFETRILRGLDELGWDAAGQLIDGEGRLVNCVWKTWAWETAFDQIREVSDREFAAVPIRTGHPQNEVRLIDVLL
RPEVLVFEPLWTVIPGNKAILPILWSLFPHHRYLLDTDFTVNDELVKTGYAVKPIAGRCGSNIDLVSHHEEVLDKTSGKF
AEQKNIYQQLWCLPKVDGKYIQVCTFTVGGNYGGTCLRGDESLVIKKESDIEPLIVVKK
>P0A9D2 2.5.1.18~~~gstA~~~Glutathione S-transferase GstA~~~COG0625
MKLFYKPGACSLASHITLRESGKDFTLVSVDLMKKRLENGDDYFAVNPKGQVPALLLDDGTLLTEGVAIMQYLADSVPDR
QLLAPVNSISRYKTIEWLNYIATELHKGFTPLFRPDTPEEYKPTVRAQLEKKLQYVNEALKDEHWICGQRFTIADAYLFT
VLRWAYAVKLNLEGLEHIAAFMQRMAERPEVQDALSAEGLK
>P0ACA7 2.5.1.18~~~gstB~~~Glutathione S-transferase GstB~~~COG0625
MITLWGRNNSTNVKKVLLTLEELELPYEQILAGREFGINHDADFLAMNPNGLVPLLRDDESDLILWESNAIVRYLAAQYG
QKRLWIDSPARRAEAEKWMDWANQTLSNAHRGILMGLVRTPPEERDQAAIDASCKECDALFALLDAELAKVKWFSGDEFG
VGDIAIAPFIYNLFNVGLTWTPRPNLQRWYQQLTERPAVRKVVMIPVS
>P82998 2.5.1.18~~~~~~Glutathione S-transferase~~~COG0625
MLKLHGFSVSNYYNMVKLALLEKGLPFEEVTFYGGQAPQALEVSPRGKVPVLETEHGFLSETSVILDYIEQTQSGKALLP
ADPFEQAKVRELLKEIELYIELPARTCYAESFFGMSVEPLIKEKARADLLAGFATLKRNGRFAPYVAGEQLTLADLMFCF
SVDLANAVGKKVLSIDFLADFPQAKALLQLMGENPHMARIMADKEASMPAFMEMIRSGKR
>P81065 2.5.1.18~~~gst~~~Glutathione S-transferase~~~
MKLYYKVGACSLAPHIILSEAGLPYELEAVDLKAKKTADGGDYFAVNPRGAVPALEVKPGTVITQNAAILQYIGDHSDVA
AFKPAYGSIERARLQEALGFCSDLHAAFSGLFAPNLSEEARAGVIANINRRLGQLEAMLSDKNAYWLGDDFTQPDAYASV
IIGWGVGQKLDLSAYPKALKLRERVLARPNVQKAFKEEGLN
>P15214 2.5.1.18~~~gstB~~~Glutathione S-transferase GST-6.0~~~
MKLYYTPGSCSLSPHIVLRETGLDFSIERIDLRTKKTESGKDFLAINPKGQVPVLQLDNGDILTEGVAIVQYLADLKPDR
NLIAPPKALERYHQIEWLNFLASEVHKGYSPLFSSDTPESYLPVVKNKLKSKFVYINDVLSKQKCVCGDHFTVADAYLFT
LSQWAPHVALDLTDLSHLQDYLARIAQRPNVHSALVTEGLIKE
>P45875 2.5.1.18~~~gst~~~Glutathione S-transferase GST-4.5~~~COG0625
MKLYTKPGACSLADHIVLRWSCLPFELTVVDAATMKSPDYLRLNPAGAVPLLVVDQWALTQNAAILNYIADTAPLTGLGG
DGTARSRAEINRWIAFVNADLHPTFKPLFGSTAYLQEDALIQRSHEDARTKLRTLYTRVDAHLQGRNWLAGDTHTGADAY
LFVTLRWAHKAGVDLSGLSALDAFFQRMLADADVQAALQAEGLN
>Q05852 2.7.7.9~~~gtaB~~~UTP--glucose-1-phosphate uridylyltransferase~~~COG1210
MKKVRKAIIPAAGLGTRFLPATKAMPKEMLPIVDKPTIQYIIEEAVEAGIEDIIIVTGKSKRAIEDHFDYSPELERNLEE
KGKTELLEKVKKASNLADIHYIRQKEPKGLGHAVWCARNFIGDEPFAVLLGDDIVQAETPGLRQLMDEYEKTLSSIIGVQ
QVPEEETHRYGIIDPLTSEGRRYQVKNFVEKPPKGTAPSNLAILGRYVFTPEIFMYLEEQQVGAGGEIQLTDAIQKLNEI
QRVFAYDFEGKRYDVGEKLGFITTTLEFAMQDKELRDQLVPFMEGLLNKEEI
>Q2G1T6 2.7.7.9~~~gtaB~~~UTP--glucose-1-phosphate uridylyltransferase~~~COG1210
MKKIKKAIIPAAGLGTRFLPATKAMPKEMLPILDKPTIQYIVEEAARAGIEDIIIVTGRHKRAIEDHFDSQKELEMVLKE
KGKSELLEKVQYSTELANIFYVRQKEQKGLGHAISSARQFIGNEPFAVLLGDDIVESEVPAVKQLIDVYEETGHSVIGVQ
EVPEADTHRYGIIDPLTKNGRQYEVKKFVEKPAQGTAPSNLAIMGRYVLTPEIFDYLKTQKEGAGNEIQLTDAIERMNND
NQVYAYDFEGERYDVGEKLGFVKTTIEYALKDDSMREELTRFIKALGL
>Q7A3J9 2.7.7.9~~~gtaB~~~UTP--glucose-1-phosphate uridylyltransferase~~~
MKKIKKAIIPAAGLGTRFLPATKAMPKEMLPILDKPTIQYIVEEAARAGIEDIIIVTGRHKRAIEDHFDSQKELEMVLKE
KGKSELLEKVQYSTELANIFYVRQKEQKGLGHAISSARQFIGNEPFAVLLGDDIVESEVPAVKQLIDVYEETGHSVIGVQ
EVPEADTHRYGIIDPLTKNGRQYEVKKFVEKPAQGTAPSNLAIMGRYVLTPEIFDYLKTQKEGAGNEIQLTDAIERMNND
NQVYAYDFEGERYDVGEKLGFVKTTIEYALKDDSMREELTRFIKALGL
>F8KEJ1 2.4.1.-~~~gtf3~~~N-acetylglucosaminyltransferase~~~
MEGLSLTVHITNLYGQSFQSTAQIAQNQIAKIGRELGFNELGIYNYNWPDEPSVALDTRFDGIIASVSNNDTVIFQSPTW
NSIEWDQAFIDHLAPYNVKKIIFIHDIIPLMFESNRYLLPQFIDYYNKADLIIAPSQPMVDFLRANGLTVEKVVLQHMWD
HCASVDFTVTLQNTGVINFAGNLEKFQLVGHWHYPDNPLYAFAKVIDVEPTDNIKFMGWQSDPVLLSKLRHNGGFGLVWS
NEPYCKNYMHLNANHKLSTYLAAGLPVIVNENIAESETILRKGLGIVADNLDEAIEKVQGMDDQSYNEMVQRVDDFARLI
REGYFAKKALTEAVFKLYYQ
>A0A0M3KKZ0 2.4.1.-~~~gtf3~~~Glucosyltransferase 3~~~
MRTYITNLNGHSITSTAQIAQNMVTDIAVSLGFRELGIHSYPIDTDSPEEMSKRLDGICSGLRKNDIVIFQTPTWNTTTF
DEKLFHKLKIFGVKIVIFIHDVVPLMFDGNFYLMDRTIAYYNEADVLIAPSQAMVDKLQSYGLTVKKILVQGMWDHPTNI
TLQAVNHKKLVHFPGNPERFNFIKNWRIPTELHVYTDHNMQLPTTVVKEPYQSDEQLIMKMSEGGYGLVWMDDRDKQYQS
LYCPYKLGAYIAAGIPVIIQKGIANQDIIEKNNLGFIIEKIDDISNIVESTTEEEYMEIVSDVRRFNPLVRQGYFTRKLL
TDAVFSALNSM
>B5A7L9 2.4.1.-~~~gtf3~~~Glucosyltransferase 3~~~
MRVYITNINGQSIQSTAQLCQNTVTDVAVSLGYRELGIYCYQIHTDSESELSKRLDGIVAGLRHGDVVIFQTPTWNTTEF
DEKLMNKLKLYDIKIVLFIHDVVPLMFSGNFYLMDRTIAYYNKADVVVAPSQKMIDKLRDFGMNVSKTVVQGMWDHPTQA
PMFPAGLKREIHFPGNPERFSFVKEWKYDIPLKVYTWQNVELPQNVHKINYRPDEQLLMEMSQGGFGLVWMDDKDKEYQS
LYCSYKLGSFLAAGIPVIVQEGIANQELIENNGLGWIVKDVEEAIMKVKNVNEDEYIELVKNVRSFNPILRKGFFTRRLL
TESVFQAICD
>A0A0H2UR93 2.4.1.-~~~gtf3~~~Glucosyltransferase 3~~~COG0438
MKLHLTNLYGMAGDSTVILAQNAVQKIASQLGFREVGIYFYNIASDSPSEMNKRLDGIMASISIGDILVFQSPTWNGFEF
DRLLFDKLKDMQVKIICFIHDVVPLMFDSNYYLMKDYLYMYNLSDVLIVPSERMKTRLMEEGLTTKKILVQGMWDHPHDL
SLYTPAFKKELFFAGSLERFPDLQNWSQDTPLRVFSNKGEASSSARSLSIEGWKKDEELLLELSKGGFGLVWGTHQNEGE
SNQYYTLNISHKVSTYLTAGIPVIVPSSLSTAKFIVDQGLGFMADSLEEVHEIVDKMNLQEYQEMTNRIKTFSYLLKEGY
FTKKLLVDAIYHLGID
>P96558 2.4.1.311~~~gtfA~~~dTDP-epi-vancosaminyltransferase~~~
MRVLITGCGSRGDTEPLVALAARLRELGADARMCLPPDYVERCAEVGVPMVPVGRAVRAGAREPGELPPGAAEVVTEVVA
EWFDKVPAAIEGCDAVVTTGLLPAAVAVRSMAEKLGIPYRYTVLSPDHLPSEQSQAERDMYNQGADRLFGDAVNSHRASI
GLPPVEHLYDYGYTDQPWLAADPVLSPLRPTDLGTVQTGAWILPDERPLSAELEAFLAAGSTPVYVGFGSSSRPATADAA
KMAIKAVRASGRRIVLSRGWADLVLPDDGADCFVVGEVNLQELFGRVAAAIHHDSAGTTLLAMRAGIPQIVVRRVVDNVV
EQAYHADRVAELGVGVAVDGPVPTIDSLSAALDTALAPEIRARATTVADTIRADGTTVAAQLLFDAVSLEKPTVPA
>Q3S2Y2 2.4.1.-~~~gtfA~~~UDP-N-acetylglucosamine--peptide N-acetylglucosaminyltransferase GtfA subunit~~~
MVVYNLNRGIGWASSGVEYAQAYRSEVFRKLGVEAKFIFTDMFQNENIEHLTRNIGFEDNEIIWLYTFFTDLTIAATSYS
LQQLKESFSLPIDRTEKNGKIISFFFKGSSIVVTVMLNDESSNIVQRVEYLMGGKLVRKDYYSYTKMFSEYYAPEDIGPC
LYQRTFYNEDGSVAYEENVDGENSIFKFKETILYSKEELVGYMLEKLQLTNSDLILLDRSTGIGQAVLRNKGNAKVAVVV
HAEHYNVSATDETTILWNNYYDYQFSNADSIDAFITSTETQTKTLIDQFKKYLNIEPVVYTIPVGSLSKLQRKEWHERKA
FSLLTCSRLASEKHIDWLINAVVEANKVIPELTFDIYGEGGERQKLQEIIAKNKANNYIRLMGHKNLSSVYKDYQVYLSG
STSEGFGLTLMEAIGSGLPIIGLDVPYGNQTFIENNLNGYLIPRETPDNPQQISTAFAQYIVALFNSKDICKKHEYSYRI
ASRFLNDKIIENWSFFLRRLLNDYTI
>Q9AET5 2.4.1.-~~~gtfA~~~UDP-N-acetylglucosamine--peptide N-acetylglucosaminyltransferase GtfA subunit~~~
MTVYNINLGIGWASSGVEYAQAYRAQILRRIQQPAKFIFMDMILADNIQHLTENIGFLDEEIIWLYNYFTDIKIAPTTVT
LDQVLAQVAGQPERSEKEGKIVRYFYPQDDQFITCYLRQEDQDSVEHVEYVSRGRLIRKDYFSYVRYASEYFAPHNDAAT
LYQRRFYHEDGSVAYDMLIEDGQEKLYRFPDRIFYSKAELVRYFLQCLQLQADDVVILDRETGIGQVVFEESQKAKLGVV
VHAEHFSENASSDDYILWNNFYDYQFTNADKVDFFIVATEAQKRILEQQFQHYSDKQPQIATIPVGSLDQLTYPKEPRKP
YSMITASRLATEKHIDWLVAATVQAHAQLPELTLDIYGKGSEEDKLRRRIEEAGAQDYIRLKGHADLSQIYAGYELYLTA
STSEGFGLTLMEAVGSGLPLIGFDVRYGNQTFIDDGKNGYLLPVSSNHVEDQIIAAFVEKIIALFSQGRQQEMSQHSYQV
AENYLTSRVEAAWTQLLKEVRDDSAL
>A1C3L9 2.4.1.-~~~gtfA~~~UDP-N-acetylglucosamine--peptide N-acetylglucosaminyltransferase GtfA subunit~~~
MTIYNINLGIGWASSGVEYAQAYRAQILRSLGMPAKFIFTNMFQSENLEHFTKNIGFEDNEIIWLYGYFTDVKISGTTYK
KDDLEATFSQCPTKKEASSDRKLIRYYFENQELYINASLYGENQEYVQRVEYVVKGKLIRKDYYSYTKVFSEFYSPGENG
VQLCNRSFYNEDGSIAYEEILSNEKSTFVFSNKICYGLEELLEFMLEDLSLTKSDLILLDRATGIGQVVFENIGAAKLAV
VIHAEHFNEKNTDEHNILWNNYYEYQFTNADKVNAFITSTERQKILLEEQFTQYTSLHPKIVAIPVGSLDQLKFPEQSRK
SFSMMTGSRLAIEKHIDWLIEGVALAQKRLPELTFDIYGEGGERRKLTELLTKLHAGEFIELKGHKQLDEIYQNYELYLT
ASTSEGFGLTLMEAVGSGLPIIGFDVPYGNQTFVCSGENGLLIERPKGDDRSRIVQAFADSIYEYFTKFKMADAQQYSYN
IAENYKHEKLVERWKDFIEEMLND
>A0A0H2URG7 2.4.1.-~~~gtfA~~~UDP-N-acetylglucosamine--peptide N-acetylglucosaminyltransferase GtfA subunit~~~COG0438
MTIYNINLGIGWASSGVEYAQAYRAGVFRKLNLSSKFIFTDMILADNIQHLTANIGFDDNQVIWLYNHFTDIKIAPTSVT
VDDVLAYFGGEESHREKNGKVLRVFFFDQDKFVTCYLVDENKDLVQHAEYVFKGNLIRKDYFSYTRYCSEYFAPKDNVAV
LYQRTFYNEDGTPVYDILMNQGKEEVYHFKDKIFYGKQAFVRAFMKSLNLNKSDLVILDRETGIGQVVFEEAQTAHLAVV
VHAEHYSENATNEDYILWNNYYDYQFTNADKVDFFIVSTDRQNEVLQEQFAKYTQHQPKIVTIPVGSIDSLTDSSQGRKP
FSLITASRLAKEKHIDWLVKAVIEAHKELPELTFDIYGSGGEDSLLREIIANHQAEDYIQLKGHAELSQIYSQYEVYLTA
STSEGFGLTLMEAIGSGLPLIGFDVPYGNQTFIEDGQNGYLIPSSSDHVEDQIKQAYAAKICQLYQENRLEAMRAYSYQI
AEGFLTKEILEKWKKTVEEVLHD
>P96559 2.4.1.310~~~gtfB~~~Vancomycin aglycone glucosyltransferase~~~
MRVLLATCGSRGDTEPLVALAVRVRDLGADVRMCAPPDCAERLAEVGVPHVPVGPSARAPIQRAKPLTAEDVRRFTTEAI
ATQFDEIPAAAEGCAAVVTTGLLAAAIGVRSVAEKLGIPYFYAFHCPSYVPSPYYPPPPLGEPSTQDTIDIPAQWERNNQ
SAYQRYGGLLNSHRDAIGLPPVEDIFTFGYTDHPWVAADPVLAPLQPTDLDAVQTGAWILPDERPLSPELAAFLDAGPPP
VYLGFGSLGAPADAVRVAIDAIRAHGRRVILSRGWADLVLPDDGADCFAIGEVNHQVLFGRVAAVIHHGGAGTTHVAARA
GAPQILLPQMADQPYYAGRVAELGVGVAHDGPIPTFDSLSAALATALTPETHARATAVAGTIRTDGAAVAARLLLDAVSR
EKPTVSA
>Q3S2Y1 ~~~gtfB~~~UDP-N-acetylglucosamine--peptide N-acetylglucosaminyltransferase stabilizing protein GtfB~~~
MIILFDFFDKKSKDLYYSLITSGLHGNAVVINDDGFLPQNINSPYSFFCNMEGKNGNPLYFNQVPLPDLWEIKGNNIEAE
IWDFSIKRAKIFYQEPKYKRQVKNIDWFDNNKKVRYTDHYNRFGWCFARTHFDKNQNVTTKSYFDKDGKEVIVENFRTGV
IILNWLNKDYFFDNRVAFLNFYFSLMGWNLSRIWYNSLSTPFFVSYRMTYPGEDILFWQEDIEDTIPANMRVLLESTNTR
TQKVIVQKKNTYHKIKSMLPKEQQEKIGYLGFIYPNKKNNKGRKDIFILTNSDQIEHLEVLVHHLSDYHFHIAAYTEMSF
KLMSFSQEQNVTLYPNISRTDLDNLFEICDIYFDINHGNEVDDVIRRAFEYNHLIFAFDNTCHNRELVLDSNIISHTTCE
QLINLMKNLSGSIMYLLEQQREQTSNETKERYKEILGGYGNA
>Q79T00 ~~~gtfB~~~UDP-N-acetylglucosamine--peptide N-acetylglucosaminyltransferase stabilizing protein GtfB~~~
MIQLFDYYNQETQDLHDSLLAAGYACPTIVIEANGFLPDDMISPYTYFLGDEEGVDHPLFFNQVPVPPFWEITGDHQVAR
VSDMGEERARIHYASQARGRLVKQVDWLDKKGQLRLSERYNKQGRCFAKTAYKSGQEAFNTTYYSTDGQERIVENHVTGD
IILTLDQEPLRIFKSRVDFIRFFLERLDLDLDHILFNSLAYSFLVSHSLTGRAGQDILFWQEPLYDELPGNMQLILDNSQ
LRTQTIVIPDLATYEKAMSLAAADQQQKFLHLGYHYDFKRDNYLRKDALILTHSDQIEGLDTLVQSLPQLVFRIAALTEM
SPKLLSMLSYKNVVLYQNASLKQIEQLYLESDIYLDINHGGQVLQAVRKAFENNLLILGFEQTLHDRHYIAQQHIFDSSQ
PAQLASILEEALCGVEQMRSALQAQGRHANDVPVSLYQETLQSLLGGQHG
>P08987 2.4.1.5~~~gtfB~~~Glucosyltransferase-I~~~COG0366
MDKKVRYKLRKVKKRWVTVSVASAVMTLTTLSGGLVKADSNESKSQISNDSNTSVVTANEESNVTTEVTSKQEAASSQTN
HTVTTISSSTSVVNPKEVVSNPYTVGETASNGEKLQNQTTTVDKTSEAAANNISKQTTEADTDVIDDSNAANLQILEKLP
NVKEIDGKYYYYDNNGKVRTNFTLIADGKILHFDETGAYTDTSIDTVNKDIVTTRSNLYKKYNQVYDRSAQSFEHVDHYL
TAESWYRPKYILKDGKTWTQSTEKDFRPLLMTWWPSQETQRQYVNYMNAQLGINKTYDDTSNQLQLNIAAATIQAKIEAK
ITTLKNTDWLRQTISAFVKTQSAWNSDSEKPFDDHLQNGAVLYDNEGKLTPYANSNYRILNRTPTNQTGKKDPRYTADNT
IGGYEFLLANDVDNSNPVVQAEQLNWLHFLMNFGNIYANDPDANFDSIRVDAVDNVDADLLQIAGDYLKAAKGIHKNDKA
ANDHLSILEAWSDNDTPYLHDDGDNMINMDNKLRLSLLFSLAKPLNQRSGMNPLITNSLVNRTDDNAETAAVPSYSFIRA
HDSEVQDLIRDIIKAEINPNVVGYSFTMEEIKKAFEIYNKDLLATEKKYTHYNTALSYALLLTNKSSVPRVYYGDMFTDD
GQYMAHKTINYEAIETLLKARIKYVSGGQAMRNQQVGNSEIITSVRYGKGALKATDTGDRTTRTSGVAVIEGNNPSLRLK
ASDRVVVNMGAAHKNQAYRPLLLTTDNGIKAYHSDQEAAGLVRYTNDRGELIFTAADIKGYANPQVSGYLGVWVPVGAAA
DQDVRVAASTAPSTDGKSVHQNAALDSRVMFEGFSNFQAFATKKEEYTNVVIAKNVDKFAEWGVTDFEMAPQYVSSTDGS
FLDSVIQNGYAFTDRYDLGISKPNKYGTADDLVKAIKALHSKGIKVMADWVPDQMYAFPEKEVVTATRVDKFGKPVEGSQ
IKSVLYVADSKSSGKDQQAKYGGAFLEELQAKYPELFARKQISTGVPMDPSVKIKQWSAKYFNGTNILGRGAGYVLKDQA
TNTYFNISDNKEINFLPKTLLNQDSQVGFSYDGKGYVYYSTSGYQAKNTFISEGDKWYYFDNNGYMVTGAQSINGVNYYF
LSNGLQLRDAILKNEDGTYAYYGNDGRRYENGYYQFMSGVWRHFNNGEMSVGLTVIDGQVQYFDEMGYQAKGKFVTTADG
KIRYFDKQSGNMYRNRFIENEEGKWLYLGEDGAAVTGSQTINGQHLYFRANGVQVKGEFVTDRYGRISYYDSNSGDQIRN
RFVRNAQGQWFYFDNNGYAVTGARTINGQHLYFRANGVQVKGEFVTDRHGRISYYDGNSGDQIRNRFVRNAQGQWFYFDN
NGYAVTGARTINGQHLYFRANGVQVKGEFVTDRYGRISYYDSNSGDQIRNRFVRNAQGQWFYFDNNGYAVTGARTINGQH
LYFRANGVQVKGEFVTDRYGRISYYDANSGERVRIN
>A1C3M0 ~~~gtfB~~~UDP-N-acetylglucosamine--peptide N-acetylglucosaminyltransferase stabilizing protein GtfB~~~
MIRLFEWLTQESLDLHYSLEESGIHGTSIVLNDDGFLPEGIISPYTFFCEVEMDGSPLYFNQLEVPYLWQITGTNIEGEI
WNRSSKRGVIHYHEPKYLRFVQSVDWLYPDGSIYMTDHYNKYGWAFARTYFFSDQQVSHKKYYTKSGQEVLSENILTGDI
LLNWKGKVYHFTKKVDFFLFYFKKSGLDLSSIWYNSLGMPFLISYYLGGEGRDILFWQENLADQLPGNMQIIFSGRTSRT
KKVIVQDRSVYKKLLHLVEEKNKEMISFLNIIYPKLRENYSRKEILIVTNSDQIEGIETLTDNLSAYTFHIGALTSMSDK
LQNIGQKENVLLYPNMSPKTMLDLLEQCDIYLDINHGNEVLSIVRLAFERSLLILAYDNTVHSPIFHHESGIFNHSKPQT
LSDWLLNLDDYSQTVSCWRSDLFPMTYRDYKQVLVSNVD
>A0A0H2UR90 ~~~gtfB~~~UDP-N-acetylglucosamine--peptide N-acetylglucosaminyltransferase stabilizing protein GtfB~~~COG0438
MIELYDSYSQESRDLHESLGATGLSQLGVVIDADGFLPDGLLSPFTYYLGYEDGKPLYFNQVPVSDFWEILGDNQSACIE
DVTQERAVIHYADGMQARLVKQVDWKDLEGRVRQVDHYNRFGACFATTTYSADSEPIMTVYQDVNGQQVLLENHVTGDIL
LTLPGQSMRYFANKVEFITFFLQDLEIDTSQLIFNTLATPFLVSFHHPDKSGSDVLVWQEPLYDAIPGNMQLILESDNVR
TKKIIIPNKATYERALELTDEKYHDQFVHLGYHYQFKRDNFLRRDALILTNSDQIEQVEAIAGALPDVTFRIAAVTEMSS
KLLDMLCYPNVALYQNASPQKIQELYQLSDIYLDINHSNELLQAVRQAFEHNLLILGFNQTVHNRLYIAPDHLFESSEVA
ALVETIKLALSDVDQMRQALGKQGQHANYVDLVRYQETMQTVLGG
>P96560 2.4.1.-~~~gtfC~~~Glycosyltransferase GtfC~~~
MRVLLSTAGSRGDVEPLVALAVRLQGLGVEARMCASPASAERLAEVGVPHVPVGLQLEGMLLQEGMPPPSPEEERRLAAK
AIDMQFDEVPAAAEGCAAVVAAGELAAAAAVRSVAEMLGIPYFYAAYSPNYLPSPHHAPPEDERTTPGVTDNKVLWDERG
QRFAKRYGDTLNSRRASVGLPPVEDVFGYGYSERPWLATDPILAPLPPDFDAVQTGTWILPDERPLSAELEAFLAAGSPP
VYLGFGSASGPGIDDAARVAIEAIRAHGRRIVLLSGWADLVRPDDGADCFSVDEVNLQVLFSRAAAAIHHGSAGTEHLAT
LAGIPQIVIPRHTDQPYYAERVADLGIGVALEGPVPTFDAMSAAVATALAPETRARATAVAGTIRTDGAAVAARLLLDAV
SREKSAVLA
>P13470 2.4.1.5~~~gtfC~~~Glucosyltransferase-SI~~~COG0366
MEKKVRFKLRKVKKRWVTVSVASAVVTLTSLSGSLVKADSTDDRQQAVTESQASLVTTSEAAKETLTATDTSTATSATSQ
PTATVTDNVSTTNQSTNTTANTANFDVKPTTTSEQSKTDNSDKIIATSKAVNRLTATGKFVPANNNTAHSRTVTDKIVPI
KPKIGKLKQPSSLSQDDIAALGNVKNIRKVNGKYYYYKEDGTLQKNYALNINGKTFFFDETGALSNNTLPSKKGNITNND
NTNSFAQYNQVYSTDAANFEHVDHYLTAESWYRPKYILKDGKTWTQSTEKDFRPLLMTWWPDQETQRQYVNYMNAQLGIH
QTYNTATSPLQLNLAAQTIQTKIEEKITAEKNTNWLRQTISAFVKTQSAWNSDSEKPFDDHLQKGALLYSNNSKLTSQAN
SNYRILNRTPTNQTGKKDPRYTADRTIGGYEFLLANDVDNSNPVVQAEQLNWLHFLMNFGNIYANDPDANFDSIRVDAVD
NVDADLLQIAGDYLKAAKGIHKNDKAANDHLSILEAWSYNDTPYLHDDGDNMINMDNRLRLSLLYSLAKPLNQRSGMNPL
ITNSLVNRTDDNAETAAVPSYSFIRAHDSEVQDLIRNIIRAEINPNVVGYSFTMEEIKKAFEIYNKDLLATEKKYTHYNT
ALSYALLLTNKSSVPRVYYGDMFTDDGQYMAHKTINYEAIETLLKARIKYVSGGQAMRNQQVGNSEIITSVRYGKGALKA
TDTGDRTTRTSGVAVIEGNNPSLRLKASDRVVVNMGAAHKNQAYRPLLLTTDNGIKAYHSDQEAAGLVRYTNDRGELIFT
AADIKGYANPQVSGYLGVWVPVGAAADQDVRVAASTAPSTDGKSVHQNAALDSRVMFEGFSNFQAFATKKEEYTNVVIAK
NVDKFAEWGVTDFEMAPQYVSSTDGSFLDSVIQNGYAFTDRYDLGISKPNKYGTADDLVKAIKALHSKGIKVMADWVPDQ
MYALPEKEVVTATRVDKYGTPVAGSQIKNTLYVVDGKSSGKDQQAKYGGAFLEELQAKYPELFARKQISTGVPMDPSVKI
KQWSAKYFNGTNILGRGAGYVLKDQATNTYFSLVSDNTFLPKSLVNPNHGTSSSVTGLVFDGKGYVYYSTSGNQAKNAFI
SLGNNWYYFDNNGYMVTGAQSINGANYYFLSNGIQLRNAIYDNGNKVLSYYGNDGRRYENGYYLFGQQWRYFQNGIMAVG
LTRIHGAVQYFDASGFQAKGQFITTADGKLRYFDRDSGNQISNRFVRNSKGEWFLFDHNGVAVTGTVTFNGQRLYFKPNG
VQAKGEFIRDADGHLRYYDPNSGNEVRNRFVRNSKGEWFLFDHNGIAVTGTRVVNGQRLYFKSNGVQAKGELITERKGRI
KYYDPNSGNEVRNRYVRTSSGNWYYFGNDGYALIGWHVVEGRRVYFDENGVYRYASHDQRNHWDYDYRRDFGRGSSSAVR
FRHSRNGFFDNFFRF
>Q9AFC7 2.4.1.322~~~gtfD~~~Devancosaminyl-vancomycin vancosaminetransferase~~~COG1819
MRVLLSVCGTRGDVEIGVALADRLKALGVQTRMCAPPAAEERLAEVGVPHVPVGLPQHMMLQEGMPPPPPEEEQRLAAMT
VEMQFDAVPGAAEGCAAVVAVGDLAAATGVRSVAEKLGLPFFYSVPSPVYLASPHLPPAYDEPTTPGVTDIRVLWEERAA
RFADRYGPTLNRRRAEIGLPPVEDVFGYGHGERPCWPADPVLAPLQPDVDAVQTGAWLLSDERPLPPELEAFLAAGSPPV
HIGFGSSSGRGIADAAKVAVEAIRAQGRRVILSRGWTELVLPDDRDDCFAIDEVNFQALFRRVAAVIHHGSAGTEHVATR
AGVPQLVIPRNTDQPYFAGRVAALGIGVAHDGPTPTFESLSAALTTVLAPETRARAEAVAGMVLTDGAAAAADLVLAAVG
REKPAVPA
>P49331 2.4.1.5~~~gtfD~~~Glucosyltransferase-S~~~COG0366
METKRRYKMYKVKKHWVTIAVASGLITLGTTTLGSSVSAETEQQTSDKVVTQKSEDDKAASESSQTDAPKTKQAQTEQTQ
AQSQANVADTSTSITKETPSQNITTQANSDDKTVTNTKSEEAQTSEERTKQAEEAQATASSQALTQAKAELTKQRQTAAQ
ENKNPVDLAAIPNVKQIDGKYYYIGSDGQPKKNFALTVNNKVLYFDKNTGALTDTSQYQFKQGLTKLNNDYTPHNQIVNF
ENTSLETIDNYVTADSWYRPKDILKNGKTWTASSESDLRPLLMSWWPDKQTQIAYLNYMNQQGLGTGENYTADSSQESLN
LAAQTVQVKIETKISQTQQTQWLRDIINSFVKTQPNWNSQTESDTSAGEKDHLQGGALLYSNSDKTAYANSDYRLLNRTP
TSQTGKPKYFEDNSSGGYDFLLANDIDNSNPVVQAEQLNWLHYLMNYGSIVANDPEANFDGVRVDAVDNVNADLLQIASD
YLKAHYGVDKSEKNAINHLSILEAWSDNDPQYNKDTKGAQLPIDNKLRLSLLYALTRPLEKDASNKNEIRSGLEPVITNS
LNNRSAEGKNSERMANYIFIRAHDSEVQTVIAKIIKAQINPKTDGLTFTLDELKQAFKIYNEDMRQAKKKYTQSNIPTAY
ALMLSNKDSITRLYYGDMYSDDGQYMATKSPYYDAIDTLLKARIKYAAGGQDMKITYVEGDKSHMDWDYTGVLTSVRYGT
GANEATDQGSEATKTQGMAVITSNNPSLKLNQNDKVIVNMGTAHKNQEYRPLLLTTKDGLTSYTSDAAAKSLYRKTNDKG
ELVFDASDIQGYLNPQVSGYLAVWVPVGASDNQDVRVAASNKANATGQVYESSSALDSQLIYEGFSNFQDFVTKDSDYTN
KKIAQNVQLFKSWGVTSFEMAPQYVSSEDGSFLDSIIQNGYAFEDRYDLAMSKNNKYGSQQDMINAVKALHKSGIQVIAD
WVPDQIYNLPGKEVVTATRVNDYGEYRKDSEIKNTLYAANTKSNGKDYQAKYGGAFLSELAAKYPSIFNRTQISNGKKID
PSEKITAWKAKYFNGTNILGRGVGYVLKDNASDKYFELKGNQTYLPKQMTNKEASTGFVNDGNGMTFYSTSGYQAKNSFV
QDAKGNWYYFDNNGHMVYGLQHLNGEVQYFLSNGVQLRESFLENADGSKNYFGHLGNRYSNGYYSFDNDSKWRYFDASGV
MAVGLKTINGNTQYFDQDGYQVKGAWITGSDGKKRYFDDGSGNMAVNRFANDKNGDWYYLNSDGIALVGVQTINGKTYYF
GQDGKQIKGKIITDNGKLKYFLANSGELARNIFATDSQNNWYYFGSDGVAVTGSQTIAGKKLYFASDGKQVKGSFVTYNG
KVHYYHADSGELQVNRFEADKDGNWYYLDSNGEALTGSQRINGQRVFFTREGKQVKGDVAYDERGLLRYYDKNSGNMVYN
KVVTLANGRRIGIDRWGIARYY
>B3H2N1 2.4.1.-~~~~~~Alpha-1,6-glucosyltransferase~~~
MENNIDLNVYFCFVNRPCTGGDFVNLDHVRTLRKLGINASILLAGNQSEEIVNSFGSLPVVILNEEIEFSSQDIFIVPEV
MQVLYDLASKMTVFPRMIMHNQNPFYTGYGFLSAQHINEHRLERIIVPSSYTKYKLQEIGVTKPIDIIHPYIPDYFKPAE
KQREVIQIAFSRRKRSAEFDIFKFYFLSLYSHKHSVNFVNIQGLTREEVAKVMSEAAIFISFAERESLGLMTLEAMASGC
HVIGFSGYTDIYNNEVIDDSVGDWIGEGEYTLFAQKVCQAIDDFVNGKMNPKIENGLRLIEQRFRIRHFEQEVKRVYGNI
FDYDLENSRS
>O25613 3.5.4.16~~~~~~GTP cyclohydrolase 1 type 2~~~COG0327
MALVKEVLVVLNRLSPFELQESWDNSGLNVGSENSEFSEIVACLEITLKIALNAPQNALIITHHPLIFKPLKTLNDEIYP
GNILKILIQKNVSVISMHTNFDKTHLNKHFAHALLEFDGLVEKGLMLVKENANIEFDALVKKIKSSLGVGSLACVKSSQT
IKDLAFVCGSGASMFSSLKAQSCLITGDVKYHDAMIAQSLGISLIDATHYYSERGFALIVAEILHSFNYLVTIENFKNPL
QII
>P77682 ~~~yfdG~~~Prophage bactoprenol-linked glucose translocase homolog~~~COG2246
MLKLFAKYTSIGVLNTLIHWVVFGVCIYVAHTNQALANFAGFVVAVSFSFFANAKFTFKASTTTMRYMLYVGFMGTLSAT
VGWAADRCALPPMITLVTFSAISLVCGFVYSKFIVFRDAK
>P77293 2.4.1.-~~~yfdH~~~Prophage bactoprenol glucosyl transferase homolog~~~COG0463
MKISLVVPVFNEEEAIPIFYKTVREFEELKSYEVEIVFINDGSKDATESIINALAVSDPLVVPLSFTRNFGKEPALFAGL
DHATGDAIIPIDVDLQDPIEVIPHLIEKWQAGADMVLAKRSDRSTDGRLKRKTAEWFYKLHNKISNPKIEENVGDFRLMS
RDVVENIKLMPERNLFMKGILSWVGGKTDIVEYVRAERIAGDTKFNGWKLWNLALEGITSFSTFPLRIWTYIGLVVASVA
FIYGAWMILDTIIFGNAVRGYPSLLVSILFLGGIQMIGIGVLGEYIGRTYIETKKRPKYIIKRVKK
>Q9HZ47 2.7.13.3~~~gtrS~~~Sensor histidine kinase GtrS~~~
MPRSLLGRMLLLTLLAVLVAQGLSSLFWLSHLRSSQREGLLTSSRSLAYSMAASVSYFRSLPLGYRPLVLDQLRSMGGTR
FFVSLNDRPLEMRALPDTPNKQAVLEIVQDVLHQRLGKEVELQVEFVSPDELRLFNGALKLDELPRSWAHYALTLEPVNP
PVLVTQIRIGESEWLYIASLMPAPYVSLEPEGLQPQQVLSIVFTSLLLLLFTGLLMHWQSRPLKRLARAARDLALGSPSA
ALEERGASELVEVARAFNTMHERIDRYLNERGQLFSAISHDLRTPITRLRLRVELLEDERLQEKFGRDLDELELLVKGAL
QCVKDTDIHENVESVDLNLLLQHIAEPYLADGRVEVVGRAAEPYPGKPLALKRCIGNLLDNALKYGERARLSLEDGPEAV
VLHVDDDGPGVPEQRLEQIFEPRFRLSPRGQGYGLGLGIARNIAHTHGGEVSLQNRREGGLRVSLRLPRLGLE
>Q83BZ6 6.3.5.2~~~guaA~~~GMP synthase [glutamine-hydrolyzing]~~~COG0518
MLKDIHQHRILILDFGSQYAQLIARRVREIGVYCELMPCDIDEETIRDFNPHGIILSGGPETVTLSHTLRAPAFIFEIGC
PVLGICYGMQTMAYQLGGKVNRTAKAEFGHAQLRVLNPAFLFDGIEDQVSPQGEPLLDVWMSHGDIVSELPPGFEATACT
DNSPLAAMADFKRRFFGLQFHPEVTHTPQGHRILAHFVIHICQCIPNWTTKHIIEDSIRDIQEKVGKEQVIVGLSGGVDS
AVTATLVHKAIGDQLVCVLVDTGLLRLNEVDEVLNVFQKHLGAKVICVDAKDRFMKALKGISDPEEKRKIAGEQFIRVFE
EQAKKLNVKWLGQGTIYPDVIESAKTKTGKGHIIKTHHNVGGLPLNMELKLIEPLRELFKDEVRKLGLELGLPADLIYRH
PFPGPGLAIRILGEVSAEYINILKQADAIFIEELKKSDYYHQVSQAFAVFMPLKSVGVKGDARHYGYIIALRAVKTVDFM
TAQWADLPHEFLSKVSHRIVNEIKEVSRVVYDMTNKPPATIEWE
>P04079 6.3.5.2~~~guaA~~~GMP synthase [glutamine-hydrolyzing]~~~COG0518
MTENIHKHRILILDFGSQYTQLVARRVRELGVYCELWAWDVTEAQIRDFNPSGIILSGGPESTTEENSPRAPQYVFEAGV
PVFGVCYGMQTMAMQLGGHVEASNEREFGYAQVEVVNDSALVRGIEDALTADGKPLLDVWMSHGDKVTAIPSDFITVAST
ESCPFAIMANEEKRFYGVQFHPEVTHTRQGMRMLERFVRDICQCEALWTPAKIIDDAVARIREQVGDDKVILGLSGGVDS
SVTAMLLHRAIGKNLTCVFVDNGLLRLNEAEQVLDMFGDHFGLNIVHVPAEDRFLSALAGENDPEAKRKIIGRVFVEVFD
EEALKLEDVKWLAQGTIYPDVIESAASATGKAHVIKSHHNVGGLPKEMKMGLVEPLKELFKDEVRKIGLELGLPYDMLYR
HPFPGPGLGVRVLGEVKKEYCDLLRRADAIFIEELRKADLYDKVSQAFTVFLPVRSVGVMGDGRKYDWVVSLRAVETIDF
MTAHWAHLPYDFLGRVSNRIINEVNGISRVVYDISGKPPATIEWE
>P9WMS7 6.3.5.2~~~guaA~~~GMP synthase [glutamine-hydrolyzing]~~~COG0518
MVQPADIDVPETPARPVLVVDFGAQYAQLIARRVREARVFSEVIPHTASIEEIRARQPVALVLSGGPASVYADGAPKLDP
ALLDLGVPVLGICYGFQAMAQALGGIVAHTGTREYGRTELKVLGGKLHSDLPEVQPVWMSHGDAVTAAPDGFDVVASSAG
APVAAFEAFDRRLAGVQYHPEVMHTPHGQQVLSRFLHDFAGLGAQWTPANIANALIEQVRTQIGDGHAICGLSGGVDSAV
AAALVQRAIGDRLTCVFVDHGLLRAGERAQVQRDFVAATGANLVTVDAAETFLEALSGVSAPEGKRKIIGRQFIRAFEGA
VRDVLDGKTAEFLVQGTLYPDVVESGGGSGTANIKSHHNVGGLPDDLKFTLVEPLRLLFKDEVRAVGRELGLPEEIVARQ
PFPGPGLGIRIVGEVTAKRLDTLRHADSIVREELTAAGLDNQIWQCPVVLLADVRSVGVQGDGRTYGHPIVLRPVSSEDA
MTADWTRVPYEVLERISTRITNEVAEVNRVVLDITSKPPATIEWE
>B4RJH7 6.3.5.2~~~guaA~~~GMP synthase [glutamine-hydrolyzing]~~~
MTQDKILILDFGSQVTRLIARRVREAHVYCELHSFDMPLDEIKAFNPKGIILSGGPNSVYESDYQADTGIFDLGIPVLGI
CYGMQFMAHHLGGEVQPGNQREFGYAQVKTIDSGLTRGIQDDAPNTLDVWMSHGDKVSKLPDGFAVIGDTPSCPIAMMEN
TEKQFYGIQFHPEVTHTKQGRALLNRFVLDICGAQPGWTMPNYIEEAVAKIREQVGSDEVILGLSGGVDSSVAAALIHRA
IGDQLTCVFVDHGLLRLNEGKMVMDMFARNLGVKVIHVDAEGQFMAKLAGVTDPEKKRKIIGAEFIEVFDAEEKKLTNAK
WLAQGTIYPDVIESAGAKTKKAHAIKSHHNVGGLPENMKLKLLEPLRDLFKDEVRELGVALGLPREMVYRHPFPGPGLGV
RILGEVKKEYADLLRQADDIFIQELRNTTDENGTSWYDLTSQAFAVFLPVKSVGVMGDGRTYDYVVALRAVITSDFMTAH
WAELPYSLLGRVSNRIINEVKGINRVVYDVSGKPPATIEWE
>P99105 6.3.5.2~~~guaA~~~GMP synthase [glutamine-hydrolyzing]~~~
MEMAKEQELILVLDFGSQYNQLITRRIREMGVYSELHDHEISIEEIKKMNPKGIILSGGPNSVYEEGSFTIDPEIYNLGI
PVLGICYGMQLTTKLLGGKVERANEREYGKAIINAKSDELFAGLPAEQTVWMSHSDKVIEIPEGFEVIADSPSTDYAAIE
DKKRRIYGVQFHPEVRHTEYGNDLLNNFVRRVCDCKGQWTMENFIEIEIEKIRQRVGDRRVLCAMSGGVDSSVVAVLLHK
AIGDQLTCIFVDHGLLRKGEGDMVMEQFGEGFNMNIIRVNAKDRFMNKLKGVSDPEQKRKIIGNEFVYVFDDEASKLKGV
DFLAQGTLYTDVIESGTKTAQTIKSHHNVGGLPEDMEFELIEPINTLFKDEVRKLGIELGIPEHLVWRQPFPGPGLGIRV
LGEITEDKLEIVRESDAILRQVIREEGLEREIWQYFTVLPNIQSVGVMGDYRTYDHTVGIRAVTSIDGMTSDFARIDWEV
LQKISSRIVNEVDHVNRVVYDITSKPPSTIEWE
>Q5SI28 6.3.5.2~~~guaA~~~GMP synthase [glutamine-hydrolyzing]~~~COG0518
MVLVLDFGSQYTRLIARRLRELRAFSLILPGDAPLEEVLKHRPQALILSGGPRSVFDPDAPRPDPRLFSSGLPLLGICYG
MQLLAQELGGRVERAGRAEYGKALLTRHEGPLFRGLEGEVQVWMSHQDAVTAPPPGWRVVAETEENPVAAIASPDGRAYG
VQFHPEVAHTPKGMQILENFLELAGVKRDWTPEHVLEELLREVRERAGKDRVLLAVSGGVDSSTLALLLAKAGVDHLAVF
VDHGLLRLGEREEVEGALRALGVNLLVVDAKERFLKALKGVEDPEEKRKIIGREFVAAFSQVARERGPFRFLAQGTLYPD
VIESAGGHGAAKIKSHHNVGGLPEDLEFELLEPFRLLFKDEVRELALLLGLPDTLRLRHPFPGPGLAVRVLGEVTEERLE
ILRRADDIFTSLLREWGLYEKVAQALAVLTPVRSVGVAGDERKYGYVLALRAVTTEDFMTADWARLPLEFLDEAARRITR
RVPEIGRVVYDLTSKPPATIEWE
>A0QYE8 1.7.1.7~~~guaB1~~~GMP reductase~~~COG0516
MRFLDGHTPAYDLTYNDVFVVPGRSDVASRFDVDLSTVDGSGTTIPVVVANMTAVAGRRMAETVARRGGIVVLPQDLPIT
AVSETVDFVKSRDLVVDTPVTLSPEDSVSDANALLHKRAHGAAVVVFEGRPIGLVTEANCAGVDRFARVRDIALSDFVTA
PVGTDPREVFDLLEHAPIDVAVMTAPDGTLAGVLTRTGAIRAGIYTPAVDAKGRLRIAAAVGINGDVGAKAQALAEAGAD
LLVIDTAHGHQAKMLDAIKAVASLDLGLPLVAGNVVSAEGTRDLIEAGASIVKVGVGPGAMCTTRMMTGVGRPQFSAVVE
CAAAARQLGGHVWADGGVRHPRDVALALAAGASNVMIGSWFAGTYESPGDLLFDRDDRPYKESYGMASKRAVAARTAGDS
SFDRARKGLFEEGISTSRMSLDPARGGVEDLLDHITSGVRSTCTYVGAANLPELHEKVVLGVQSAAGFAEGHPLPAGW
>P9WKI3 1.7.1.7~~~guaB1~~~GMP reductase~~~COG0516
MMRFLDGHPPGYDLTYNDVFIVPNRSEVASRFDVDLSTADGSGTTIPVVVANMTAVAGRRMAETVARRGGIVILPQDLPI
PAVKQTVAFVKSRDLVLDTPVTLAPDDSVSDAMALIHKRAHGVAVVILEGRPIGLVRESSCLGVDRFTRVRDIAVTDYVT
APAGTEPRKIFDLLEHAPVDVAVLTDADGTLAGVLSRTGAIRAGIYTPATDSAGRLRIGAAVGINGDVGAKARALAEAGV
DVLVIDTAHGHQVKTLDAIKAVSALDLGLPLAAGNVVSAEGTRDLLKAGANVVKVGVGPGAMCTTRMMTGVGRPQFSAVL
ECASAARQLGGHIWADGGIRHPRDVALALAAGASNVMIGSWFAGTYESPGDLMRDRDDQPYKESYGMASKRAVVARTGAD
NPFDRARKALFEEGISTSRMGLDPDRGGVEDLIDHITSGVRSTCTYVGASNLAELHERAVVGVQSGAGFAEGHPLPAGW
>Q81JJ9 1.7.1.7~~~guaC~~~GMP reductase~~~COG0516
MGNVFDYEDIQLIPAKCIVNSRSECDTTVTLGKHKFKLPVVPANMQTIIDERIATYLAENNYFYIMHRFQPEKRISFIRD
MQSRGLIASISVGVKEDEYEFVQQLAAEHLTPEYITIDIAHGHSNAVINMIQHIKKHLPESFVIAGNVGTPEAVRELENA
GADATKVGIGPGKVCITKIKTGFGTGGWQLAALRWCAKAASKPIIADGGIRTNGDVAKSIRFGATMVMIGSLFAGHEESP
GETIEKDGKLYKEYFGSASEFQKGEKKNVEGKKMFVEHKGSLEDTLIEMEQDLQSSISYAGGTKLDSIRTVDYVVVKNSI
FNGDKVY
>O05269 1.7.1.7~~~guaC~~~GMP reductase~~~COG0516
MENVFDYEDIQLIPAKCIVNSRSECDTSVRLGGHTFKLPVVPANMQTIIDEKLAISLAENGYFYVMHRFEPETRIDFIKD
MNARGLFSSISVGVKDEEYEFVRQLAEENLTPEYVTIDIAHGHSNAVIEMIQHLKKHLPDSFVIAGNVGTPEAVRELENA
GADATKVGIGPGKVCITKIKTGFGTGGWQLAALRWCAKAASKPIIADGGIRTHGDIAKSIRFGATMVMIGSLFAGHEESP
GQTIEKDGKLYKEYFGSASEFQKGEKKNVEGKKMHVAHKGSIKDTLIEMEQDLQSSISYAGGTKLNAIRNVDYVIVKNSI
FNGDKY
>P60560 1.7.1.7~~~guaC~~~GMP reductase~~~COG0516
MRIEEDLKLGFKDVLIRPKRSTLKSRSDVELERQFTFKHSGQSWSGVPIIAANMDTVGTFSMASALASFDILTAVHKHYS
VEEWQAFINNSSADVLKHVMVSTGTSDADFEKTKQILDLNPALNFVCIDVANGYSEHFVQFVAKAREAWPTKTICAGNVV
TGEMCEELILSGADIVKVGIGPGSVCTTRVKTGVGYPQLSAVIECADAAHGLGGMIVSDGGCTTPGDVAKAFGGGADFVM
LGGMLAGHEESGGRIVEENGEKFMLFYGMSSESAMKRHVGGVAEYRAAEGKTVKLPLRGPVENTARDILGGLRSACTYVG
ASRLKELTKRTTFIRVQEQENRIFNNL
>P60563 1.7.1.7~~~guaC~~~GMP reductase~~~
MKIFDYEDIQLIPNKCIVESRSECDTTIQFGPKKFKLPVVPANMQTVMNEKLAKWFAENDYFYIMHRFDEEARIPFIKHM
QNSGLFASISVGVKKAEFDFIEKLAQEKLIPEYITIDIAHGHSDSVINMIKHIKNHIPDSFVIAGNVGTPEGVRELENAG
ADATKVGIGPGRVCITKIKTGFGTGGWQLAALNICSKAARKPLIADGGIRTHGDIAKSIRFGASMVMIGSLFAAHEESPG
ETVELDGKQYKEYFGSASEFQKGEHKNVEGKKMFVEHKGSLMDTLKEMQQDLQSSISYAGGKDLKSLRTVDYVIVRNSIF
NGDRD
>O34598 3.5.4.3~~~guaD~~~Guanine deaminase~~~COG0590
MNHETFLKRAVTLACEGVNAGIGGPFGAVIVKDGAIIAEGQNNVTTSNDPTAHAEVTAIRKACKVLGAYQLDDCILYTSC
EPCPMCLGAIYWARPKAVFYAAEHTDAAEAGFDDSFIYKEIDKPAEERTIPFYQVTLTEHLSPFQAWRNFANKKEY
>P76641 3.5.4.3~~~guaD~~~Guanine deaminase~~~COG0402
MMSGEHTLKAVRGSFIDVTRTIDNPEEIASALRFIEDGLLLIKQGKVEWFGEWENGKHQIPDTIRVRDYRGKLIVPGFVD
THIHYPQSEMVGAYGEQLLEWLNKHTFPTERRYEDLEYAREMSAFFIKQLLRNGTTTALVFGTVHPQSVDALFEAASHIN
MRMIAGKVMMDRNAPDYLLDTAESSYHQSKELIERWHKNGRLLYAITPRFAPTSSPEQMAMAQRLKEEYPDTWVHTHLCE
NKDEIAWVKSLYPDHDGYLDVYHQYGLTGKNCVFAHCVHLEEKEWDRLSETKSSIAFCPTSNLYLGSGLFNLKKAWQKKV
KVGMGTDIGAGTTFNMLQTLNEAYKVLQLQGYRLSAYEAFYLATLGGAKSLGLDDLIGNFLPGKEADFVVMEPTATPLQQ
LRYDNSVSLVDKLFVMMTLGDDRSIYRTYVDGRLVYERN
>A3DBX3 3.2.1.73~~~licB~~~Beta-glucanase~~~COG2273
MKNRVISLLMASLLLVLSVIVAPFYKAEAATVVNTPFVAVFSNFDSSQWEKADWANGSVFNCVWKPSQVTFSNGKMILTL
DREYGGSYPYKSGEYRTKSFFGYGYYEVRMKAAKNVGIVSSFFTYTGPSDNNPWDEIDIEFLGKDTTKVQFNWYKNGVGG
NEYLHNLGFDASQDFHTYGFEWRPDYIDFYVDGKKVYRGTRNIPVTPGKIMMNLWPGIGVDEWLGRYDGRTPLQAEYEYV
KYYPNGVPQDNPTPTPTIAPSTPTNPNLPLKGDVNGDGHVNSSDYSLFKRYLLRVIDRFPVGDQSVADVNRDGRIDSTDL
TMLKRYLIRAIPSL
>Q84C00 3.2.1.73~~~licB~~~Beta-glucanase~~~
MKNRVISLLMASLLLVLSVIVAPFYKAEAATVVNTPFVAVFSNFDSSQWEKADWANGSVFNCVWKPSQVTFSNGKMILTL
DREYGGSYPYKSGEYRSKSFFGYGYYEVRMKAAKNVGIVSSFFTYTGPSDNNPWDEIDIEFLGKDTTKAQFNWYKNEVGG
NEYLHNLGFDASQDFHTYGFEWRPDYIDFYVDGKKVYRGTRNIPVTPGKIMMNLWPGKGVDEWLGRYDGRTPLQAEYEYV
KYYPNGVPQDNPTPTPTIAPSTPTNPNLPLKGDVNGDGHVNSSDYSLFKRYLLRVIDRFPVGDQSVADVNRDGRIDSTDL
TMLKRYLIRAIPSL
>P27051 3.2.1.73~~~bg1~~~Beta-glucanase~~~
MSYRVKRMLMLLVTGLFLSLSTFAASASAQTGGSFYEPFNNYNTGLWQKADGYSNGNMFNCTWRANNVSMTSLGEMRLSL
TSPSYNKFDCGENRSVQTYGYGLYEVNMKPAKNVGIVSSFFTYTGPTDGTPWDEIDIEFLGKDTTKVQFNYYTNGVGNHE
KIVNLGFDAANSYHTYAFDWQPNSIKWYVDGQLKHTATTQIPQTPGKIMMNLWNGAGVDEWLGSYNGVTPLSRSLHWVRY
TKR
>P04957 3.2.1.73~~~bglS~~~Beta-glucanase~~~COG2273
MPYLKRVLLLLVTGLFMSLFAVTATASAQTGGSFFDPFNGYNSGFWQKADGYSNGNMFNCTWRANNVSMTSLGEMRLALT
SPAYNKFDCGENRSVQTYGYGLYEVRMKPAKNTGIVSSFFTYTGPTDGTPWDEIDIEFLGKDTTKVQFNYYTNGAGNHEK
IVDLGFDAANAYHTYAFDWQPNSIKWYVDGQLKHTATNQIPTTPGKIMMNLWNGTGVDEWLGSYNGVNPLYAHYDWVRYT
KK
>P37073 3.2.1.73~~~bglBB~~~Beta-glucanase~~~
MVKSKYLVFISVFSLLFGVFVVGFSHQGVKAEEERPMGTAFYESFDAFDDERWSKAGVWTNGQMFNATWYPEQVTADGLM
RLTIAKKTTSARNYKAGELRTNDFYHYGLFEVSMKPAKVEGTVSSFFTYTGEWDWDGDPWDEIDIEFLGKDTTRIQFNYF
TNGVGGNEFYYDLGFDASESFNTYAFEWREDSITWYVNGEAVHTATENIPQTPQKIMMNLWPGVGVDGWTGVFDGDNTPV
YSYYDWVRYTPLQNYQIHQ
>P17989 3.2.1.73~~~~~~Beta-glucanase~~~COG2273
MNIKKTAVKSALAVAAAAAALTTNVSAKDFSGAELYTLEEVQYGKFEARMKMAAASGTVSSMFLYQNGSEIADGRPWVEV
DIEVLGKNPGSFQSNIITGKAGAQKTSEKHHAVSPAADQAFHTYGLEWTPNYVRWTVDGQEVRKTEGGQVSNLTGTQGLR
FNLWSSESAAWVGQFDESKLPLFQFINWVKVYKYTPGQGEGGSDFTLDWTDNFDTFDGSRWGKGDWTFDGNRVDLTDKNI
YSRDGMLILALTRKGQESFNGQVPRDDEPAPQSSSSAPASSSSVPASSSSVPASSSSAFVPPSSSSATNAIHGMRTTPAV
AKEHRNLVNAKGAKVNPNGHKRYRVNFEH
>P23904 3.2.1.73~~~~~~Beta-glucanase~~~
MKKKSCFTLVTTFAFSLIFSVSALAGSVFWEPLSYFNRSTWEKADGYSNGGVFNCTWRANNVNFTNDGKLKLGLTSSAYN
KFDCAEYRSTNIYGYGLYEVSMKPAKNTGIVSSFFTYTGPAHGTQWDEIDIEFLGKDTTKVQFNYYTNGVGGHEKVISLG
FDASKGFHTYAFDWQPGYIKWYVDGVLKHTATANIPSTPGKIMMNLWNGTGVDDWLGSYNGANPLYAEYDWVKYTSN
>P45798 3.2.1.73~~~bglA~~~Beta-glucanase~~~
MCTMPLMKLKKMMRRTAFLLSVLIGCSMLGSDRSDKAPHWELVWSDEFDYSGLPDPEKWDYDVGGHGWGNQELQYYTRAR
IENARVGGGVLIIEARHEPYEGREYTSARLVTRGKASWTYGRFEIRARLPSGRGTWPAIWMLPDRQTYGSAYWPDNGEID
IMEHVGFNPDVVHGTVHTKAYNHLLGTQRGGSIRVPTARTDFHVYAIEWTPEEIRWFVDDSLYYRFPNERLTDPEADWRH
WPFDQPFHLIMNIAVGGAWGGQQGVDPEAFPAQLVVDYVRVYRWVE
>P50735 1.4.1.2~~~gudB~~~Cryptic catabolic NAD-specific glutamate dehydrogenase GudB~~~COG0334
MAADRNTGHTEEDKLDVLKSTQTVIHKALEKLGYPEEVYELLKEPMRLLTVKIPVRMDDGSVKIFTGYRAQHNDSVGPTK
GGIRFHPNVTEKEVKAVKALSIWMSLKCGIIDLPYGGGKGGIVCDPRDMSFRELERLSRGYVRAISQIVGPTKDVPAPDV
FTNSQIMAWMMDEYSRIDEFNSPGFITGKPLVLGGSHGRESATAKGVTICIKEAAKKRGIDIKGARVVVQGFGNAGSYLA
KFMHDAGAKVVGISDAYGGLYDPEGLDIDYLLDRRDSFGTVTKLFNDTITNQELLELDCDILVPAAIENQITEENAHNIR
AKIVVEAANGPTTLEGTKILSDRDILLVPDVLASAGGVTVSYFEWVQNNQGFYWSEEEVEEKLEKMMVKSFNNIYEMANN
RRIDMRLAAYMVGVRKMAEASRFRGWI
>Q6FFQ2 4.2.1.40~~~gudD~~~Glucarate dehydratase~~~COG4948
MAASTPIIQSVRAIPVAGHDSMLLNLSGAHAPYFTRNLLVIEDNSGNIGVGEIPGGEKILATLNDAKSLILGQPIGEYKN
LLKKIHQTFADRDSGGRGNQTFDLRTTVHVVTAYESALLDLLGKHLNVNVASLLGDGQQRDEVEVLGYLFFIGDRKQTSL
DYATSTHLNHDWYQVRHEKALTPEAIQRLAEASYDRYGFKDFKLKGGVLHGEQEAEAVTAIARRFPDARVTLDPNGAWYL
DEAIGLGKHLKGVLAYAEDPCGAEQGYSSREIMAEFKRATGLPTATNMIATDWREMSHSIQLQAVDIPLADPHFWTLEGS
VRVSQLCNMYNLTWGSHSNNHFDVSLAMFTHVAAAAVGNVTAIDTHWIWQEGTDHLTKQPLEIKGGKIQVPSVPGLGVEL
DWDNINRAHELYKAKGLGARNDADAMQFMVPNWKFDHKKPCLVR
>P0AES2 4.2.1.40~~~gudD~~~Glucarate dehydratase~~~COG4948
MSSQFTTPVVTEMQVIPVAGHDSMLMNLSGAHAPFFTRNIVIIKDNSGHTGVGEIPGGEKIRKTLEDAIPLVVGKTLGEY
KNVLTLVRNTFADRDAGGRGLQTFDLRTTIHVVTGIEAAMLDLLGQHLGVNVASLLGDGQQRSEVEMLGYLFFVGNRKAT
PLPYQSQPDDSCDWYRLRHEEAMTPDAVVRLAEAAYEKYGFNDFKLKGGVLAGEEEAESIVALAQRFPQARITLDPNGAW
SLNEAIKIGKYLKGSLAYAEDPCGAEQGFSGREVMAEFRRATGLPTATNMIATDWRQMGHTLSLQSVDIPLADPHFWTMQ
GSVRVAQMCHEFGLTWGSHSNNHFDISLAMFTHVAAAAPGKITAIDTHWIWQEGNQRLTKEPFEIKGGLVQVPEKPGLGV
EIDMDQVMKAHELYQKHGLGARDDAMGMQYLIPGWTFDNKRPCMVR
>P42206 4.2.1.40~~~gudD~~~Glucarate dehydratase~~~
MEALNQSQAATGAPVITDLKVVPVAGHDSMLLNLSGAHGPLFTRNILILTDSSGHVGVGEVPGGEGIRKTLEDARHLLIN
QSIGNYQSLLNKVRNAFADRDVGGRGLQTFDLRIAVHAVTAVESALLDLLGQHLQVPVAALLGEGQQRDAVEMLGYLFYV
GDRNKTDLGYRSEHEADNEWFRLRNKEALTPESVVALAEAAYDRYGFKDFKLKGGVLRGEDEIAAVTALSERFPDARITL
DPNGAWSLKEAVALCRDQHHVLAYAEDPCGAENGYSGREVMAEFRRSTGLRTATNMIATDWRQMGHAIQLQSVDIPLADP
HFWTMQGSVRVAQMCNEWGLTWGSHSNNHFDISLAMFTHVAAAAPGNITAIDTHWIWQDGQRLTKEPLQIKGGLVEVPKK
PGLGVELDWDALMKAHEVYKSMGLGARDDATAMRYLVSGWEFNNKRPCMVR
>P42237 ~~~gudP~~~Probable galactarate/D-glucarate transporter GudP~~~COG2271
MKKDFASVTPAGKKTSVRWFIVFMLFLVTSINYADRATLSITGDSVQHDLGLDSVAMGYVFSAFGWAYVIGQLPGGWLLD
RFGSKTIIALSIFFWSFFTLLQGAIGFFSAGTAIILLFALRFLVGLSEAPSFPGNGRVVASWFPSSERGTASAFFNSAQY
FAIVIFSPLMGWLTHSFGWHSVFVVMGIAGILLAVIWLKTVYEPKKHPKVNEAELAYIEQGGGLISMDDSKSKQETESKW
PYIKQLLTNRMLIGVYIAQYCITTLTYFFLTWFPVYLVQARGMSILEAGFVASLPALCGFAGGVLGGIVSDILLKKGRSL
TFARKVPIIAGMLLSCSMIVCNYTDSAWLVVVIMSLAFFGKGFGALGWAVVSDTSPKECAGLSGGLFNTFGNIASITTPI
IIGYIVNATGSFNGALVFVGANAIAAILSYLLLVGPIKRVVLKKQEQDPDQSLPV
>Q46916 ~~~gudP~~~Probable galactarate/D-glucarate transporter GudP~~~COG2271
MSSLSQAASSVEKRTNARYWIVVMLFIVTSFNYGDRATLSIAGSEMAKDIGLDPVGMGYVFSAFSWAYVIGQIPGGWLLD
RFGSKRVYFWSIFIWSMFTLLQGFVDIFSGFGIIVALFTLRFLVGLAEAPSFPGNSRIVAAWFPAQERGTAVSIFNSAQY
FATVIFAPIMGWLTHEVGWSHVFFFMGGLGIVISFIWLKVIHEPNQHPGVNKKELEYIAAGGALINMDQQNTKVKVPFSV
KWGQIKQLLGSRMMIGVYIGQYCINALTYFFITWFPVYLVQARGMSILKAGFVASVPAVCGFIGGVLGGIISDWLMRRTG
SLNIARKTPIVMGMLLSMVMVFCNYVNVEWMIIGFMALAFFGKGIGALGWAVMADTAPKEISGLSGGLFNMFGNISGIVT
PIAIGYIVGTTGSFNGALIYVGVHALIAVLSYLVLVGDIKRIELKPVAGQ
>Q46915 4.2.1.-~~~gudX~~~Glucarate dehydratase-related protein~~~COG4948
MATQSSPVITDMKVIPVAGHDSMLLNIGGAHNAYFTRNIVVLTDNAGHTGIGEAPGGDVIYQTLVDAIPMVLGQEVARLN
KVVQQVHKGNQAADFDTFGKGAWTFELRVNAVAALEAALLDLLGKALNVPVCELLGPGKQREAITVLGYLFYIGDRTKTD
LPYVENTPGNHEWYQLRHQKAMNSEAVVRLAEASQDRYGFKDFKLKGGVLPGEQEIDTVRALKKRFPDARITVDPNGAWL
LDEAISLCKGLNDVLTYAEDPCGAEQGFSGREVMAEFRRATGLPVATNMIATNWREMGHAVMLNAVDIPLADPHFWTLSG
AVRVAQLCDDWGLTWGCHSNNHFDISLAMFTHVGAAAPGNPTAIDTHWIWQEGDCRLTQNPLEIKNGKIAVPDAPGLGVE
LDWEQVQKAHEAYKRLPGGARNDAGPMQYLIPGWTFDRKRPVFGRH
>P9WIT3 1.1.2.-~~~~~~L-gulono-1,4-lactone dehydrogenase~~~COG0277
MSPIWSNWPGEQVCAPSAIVRPTSEAELADVIAQAAKRGERVRAVGSGHSFTDIACTDGVMIDMTGLQRVLDVDQPTGLV
TVEGGAKLRALGPQLAQRRLGLENQGDVDPQSITGATATATHGTGVRFQNLSARIVSLRLVTAGGEVLSLSEGDDYLAAR
VSLGALGVISQVTLQTVPLFTLHRHDQRRSLAQTLERLDEFVDGNDHFEFFVFPYADKALTRTMHRSDEQPKPTPGWQRM
VGENFENGGLSLICQTGRRFPSVAPRLNRLMTNMMSSSTVQDRAYKVFATQRKVRFTEMEYAIPRENGREALQRVIDLVR
RRSLPIMFPIEVRFSAPDDSFLSTAYGRDTCYIAVHQYAGMEFESYFRAVEEIMDDYAGRPHWGKRHYQTAATLRERYPQ
WDRFAAVRDRLDPDRVFLNDYTRRVLGP
>Q56770 2.7.8.31~~~gumD~~~UDP-glucose:undecaprenyl-phosphate glucose-1-phosphate transferase~~~
MLLADLSSATYTTSSPRLLSKYSAAADLVLRVFDLTMVVASGLIAYRIVFGTWVPAAPYRVAIATTLLYSVICFALFPLY
RSWRGRGLLSELVVLGGAFGGVFALFAVHALIVQVGEQVSRGWVGLWFVGGLVSLVAARTLLRGFLNHLRTQGVDVQRVV
VVGLRHPVMKISHYLSRNPWVGMNMVGYFRTPYDLAVAEQRQGLPCLGDPDELIEYLKNNQVEQVWISLPLGERDHIKQL
LQRLDRYPINVKLVPDLFDFGLLNQSAEQIGSVPVINLRQGGVDRDNYFVVAKALQDKILAVIALMGLWPLMLAIAVGVK
MSSPGPVFFRQRRHGLGGREFYMFKFRSMRVHDDHGTTIQQATKNDTRITRFGSFLRRSSLDELPQIFNVLGGSMSIVGP
RPHAAQHNTHYEKLINHYMQRHYVKPGITGWAQVNGFRGETPELRTMKKRIQYDLDYIRRWSLWLDIRIIVLTAVRVLGQ
KTAY
>Q44571 2.4.1.252~~~aceC~~~GDP-mannose:cellobiosyl-diphosphopolyprenol alpha-mannosyltransferase~~~
MNSKKRGDETLKVLHICRQFSPSVGGLEDSLLNLARSQRQRLGIDAEVLTLDTVFGRPGKLPHRDVVDGIPVTRLAWRGS
TKYPLAPQVLRHIGGFDLLHVHAIDFFFDFLAWTWPLHRKTMIASTHGGFFHTGALRRIKEIWFRTITPISVRAYKKIVA
CSYSDADLFRHVAAGRLITIENGINQTRFRDAASRTPNRTILAFGRFAVHKRLKLLFQLVALLRAYNSGWNIIVAGQDSN
LTADDLRAQARACGIEDSLRIVSGPSDAELRGLMGEASFFGCLSAHEGFGLAAVEAMSAGLVPILSNITPFARLMQQGAA
GVMVNPDNLAPGAREAEDMAAALPETADALRARNMDVASRYDWHSVAHEYARLYQQVLGRALPEANMAAAGAE
>Q56774 2.4.1.252~~~gumH~~~GDP-mannose:cellobiosyl-diphosphopolyprenol alpha-mannosyltransferase~~~COG0438
MKVVHVVRQFHPSIGGMEEVVLNVARQHQANSADTVEIVTLDRVFTDPSAQLAQHELHQGLSITRIGYRGSSRYPIAPSV
LGAIRSADVVHLHGIDFFYDYLALTKPLHGKPMVVSTHGGFFHTAYASRMKQIWFQTLTRTSALAYARVIATSENDGDLF
AKVVAPSRLRVIENGVDVEKYAGQGARAPGRTMLYFGRWSVNKGLIETLELLQAALTRDPQWRLIIAGREYDLNEADLRK
AIAERGLQDKVQLSMSPSQQQLCALMQQAQFFVCLSRHEGFGIAAVEAMSAGLIPILSDIPPFVRLATESGQGVIVNRDR
IQAAADSVQALALQANADFDARRTATMAYVARYDWRHVVGRYIDEYHAALGTPRTQEAVR
>Q56775 2.4.1.251~~~gumI~~~GDP-mannose:glycolipid 4-beta-D-mannosyltransferase~~~
MSASASLPVTRAAAAPRITVLFSTEKPNANTNPYLTQLYDALPDAVQPRFFSMREALLSRYDVLHLHWPEYLLRHPSKMG
TLAKQACAALLLMKLQLTGTPVVRTLHNLAPHEDRGWRERALLRWIDQLTRRWIRINATTPVRPPFTDTILHGHYRDWFA
TMEQSTTLPGRLLHFGLIRPYKGVEVLLDVMRDVQDPRLSLRIVGNPATPXMRTLVETACAQDARISALLAYVEEPVLAR
EVSACELVVLPYKQMHNSGTLLLALSLARPVLAPWSESNAAIADEVGPGWVFLYEGEFDAALLSGMLDQVRAAPRGPAPD
LSQRDWPRIGQLHYRTYLEALGKDGDAAL
>Q8GCH2 2.4.1.264~~~gumK~~~UDP-glucuronate:glycolipid 2-beta-glucuronosyltransferase~~~
MSVSPAAPASGIRRPCYLVLSAHDFRTPRRANIHFITDQLALRGTTRFFSLRYSRLSRMKGDMRLPLDDTANTVVSHNGV
DCYLWRTTVHPFNTRRSWLRPVEDAMFRWYAAHPPKQLLDWMRESDVIVFESGIAVAFIELAKRVNPAAKLVYRASDGLS
TINVASYIEREFDRVAPTLDVIALVSPAMAAEVASRDNVFHVGHGVDHNLDQLGDPSPYAEGIHAVAVGSMLFDPEFFVV
ASKAFPQVTFHVIGSGMGRHPGYGDNVIVYGEMKHAQTIGYIKHARFGIAPYASEQVPVYLADSSMKLLQYDFFGLPAVC
PNAVVGPYKSRFGYTPGNADSVIAAITQALEAPRVRYRQCLNWSDTTDRVLDPRAYPETRLYPHPPTAAPQLSSEAALSH
>P54583 3.2.1.4~~~~~~Endoglucanase E1~~~COG2730
MPRALRRVPGSRVMLRVGVVVAVLALVAALANLAVPRPARAAGGGYWHTSGREILDANNVPVRIAGINWFGFETCNYVVH
GLWSRDYRSMLDQIKSLGYNTIRLPYSDDILKPGTMPNSINFYQMNQDLQGLTSLQVMDKIVAYAGQIGLRIILDRHRPD
CSGQSALWYTSSVSEATWISDLQALAQRYKGNPTVVGFDLHNEPHDPACWGCGDPSIDWRLAAERAGNAVLSVNPNLLIF
VEGVQSYNGDSYWWGGNLQGAGQYPVVLNVPNRLVYSAHDYATSVYPQTWFSDPTFPNNMPGIWNKNWGYLFNQNIAPVW
LGEFGTTLQSTTDQTWLKTLVQYLRPTAQYGADSFQWTFWSWNPDSGDTGGILKDDWQTVDTVKDGYLAPIKSSIFDPVG
ASASPSSQPSPSVSPSPSPSPSASRTPTPTPTPTASPTPTLTPTATPTPTASPTPSPTAASGARCTASYQVNSDWGNGFT
VTVAVTNSGSVATKTWTVSWTFGGNQTITNSWNAAVTQNGQSVTARNMSYNNVIQPGQNTTFGFQASYTGSNAAPTVACA
AS
>P16216 3.2.1.4~~~Eg I~~~Endoglucanase 1~~~
MNSKKIGAMIAAAVLSLIVMTPAATRKIVQRQTRNSSTAVENSAADESETENVPVSQTHTNDTMTVTSAKDLVAKMTNGW
NLGNTMDATAQGLGSEVSWLPLKVTTNKYMIDMLPEAGFNVLRIPVSWGNHIIDDKYTSDPAWMDRVQEIVNYGIDNGLY
VILNTHHEEWYMPKPSEKDGDIEEIKAVWAQIADRFKGYDEHLIFEGLNEPRLRGEGAEWTGTSEAREIINEYEKAFVET
VRASGGNNGDRCLMITGYAASSAYNNLSAIELPEDSDKLIISVHAYLPYSFALDTKGTDKYDPEDTAIPELFEHLNELFI
SKGIPVIVGEFGTMNKENTEDRVKCLEDYLAAAAKYDIPCVWWDNYARIGNGENFGLMNRADLEWYFPDLIETFKTYAEK
DPASAE
>P33682 3.2.1.4~~~celA1~~~Endoglucanase 1~~~
MSRKLRTLMAALCALPLAFAAAPPAHAADPTTMTNGFYADPDSSASRWAAANPGDGRAAAINASIANTPMARWFGSWSGA
IGTAAGAYAGAADGRDKLPILVAYNIYNRDYCGGHSAGGAASPSAYADWIARFAGGIAARPAVVILEPDSLGDYGCMNPA
QIDEREAMLTNALVQFNRQAPNTWVYMDAGNPRWADAATMARRLHEAGLRQAHGFSLNVSNYITTAENTAYGNAVNNELA
ARYGYTKPFVVDTSRNGNGSNGEWCNPSGRRIGTPTRTGGGAEMLLWIKTPGESDGNCGVGSGSTAGQFLPEVAYKMIYG
Y
>Q05156 3.2.1.4~~~cel1~~~Cellulase 1~~~
MKRRTTAVLTLTALLGTALTALPVQQAGAEEVEQVRNGTFDTTTDPWWTSNVTAGLSDGRLCADVPGGTTNRWDSAIGQN
DITLVKGETYRFSFHASGIPEGHVVRAVVGLAVSPYDTWQEASPVLTEADGSYSYTFTAPVDTTQGQVAFQVGGSTDAWR
FCVDDVSLLGGVPPEVYEPDTGPRVRVNQVAYLPAGPKNATLVTDATARLPWQLRNAQGTTVARGLTVPRGVDASSGQNV
HSIDFGSYRGRGTGYTLVADGETSHPFDIDAAAYRPLRLDSVKYYYTQRSGIAIRDDLRPGYGRAAGHLNVAPNQGDANV
PCQPGVCDYTLDVTGGWYDAGDHGKYVVNGGIATWELLSTYERSLTARTGHPAALGDGTLALPESGNKVPDVLDEARWEL
EFLLKMQVPAGQPLAGMAHHKLHDEQWTGLPLLPDQDPQKRELHPPTTAATLNLAATAAQAARLYRPFDKAFAARALTAA
RTAWQAALAHPDLLADPNDGTGGGAYNDDDVTDEFYWAAAELYLTTGERQFADHVLDSPVHTADIFGPTGFDWGHTAAAG
RLDLALVPSRLPGRDQVRRSVIKAADTYLATLTAHPYGMPYAPAGNRYDWGSSHQVLNNGVVLASAYDLTGAAKYRDGAL
QGMDYVLGRNALNMSYVTGYGEVSSHNQHSRWYAHQLDPTLPNPPSGTLAGGPNSSIQDPYAQSKLTGCVGQFCYIDDIQ
SWSTNETAINWNAALARMASFAADQG
>P13933 3.2.1.4~~~casA~~~Endoglucanase 1~~~
MENPRTTPTPTPLRRRRSERRARGGRVLTALTGVTLLAGLAIAPAATGASPSPAPPASPAPSADSGTADAGTTALPSMEL
YRAEAGVHAWLDANPGDHRAPLIAERIGSQPQAVWFAGAYNPGTITQQVAEVTSAAAAAGQLPVVVPYMIPFRDCGNHSG
GGAPSFAAYAEWSGLFAAGLGSEPVVVVLEPDAIPLIDCLDNQQRAERLAALAGLAEAVTDANPEARVYYDVGHSAWHAP
AAIAPTLVEAGILEHGAGIATNISNYRTTTDETAYASAVIAELGGGLGAVVDTSRNGNGPLGSEWCDPPGRLVGNNPTVN
PGVPGVDAFLWIKLPGELDGCDGPVGSFSPAKAYELAGG
>P10475 3.2.1.4~~~eglS~~~Endoglucanase~~~COG2730
MKRSISIFITCLLITLLTMGGMIASPASAAGTKTPVAKNGQLSIKGTQLVNRDGKAVQLKGISSHGLQWYGEYVNKDSLK
WLRDDWGITVFRAAMYTADGGYIDNPSVKNKVKEAVEAAKELGIYVIIDWHILNDGNPNQNKEKAKEFFKEMSSLYGNTP
NVIYEIANEPNGDVNWKRDIKPYAEEVISVIRKNDPDNIIIVGTGTWSQDVNDAADDQLKDANVMYALHFYAGTHGQFLR
DKANYALSKGAPIFVTEWGTSDASGNGGVFLDQSREWLKYLDSKTISWVNWNLSDKQESSSALKPGASKTGGWRLSDLSA
SGTFVRENILGTKDSTKDIPETPSKDKPTQENGISVQYRAGDGSMNSNQIRPQLQIKNNGNTTVDLKDVTARYWYKAKNK
GQNFDCDYAQIGCGNVTHKFVTLHKPKQGADTYLELGFKNGTLAPGASTGNIQLRLHNDDWSNYAQSGDYSFFKSNTFKT
TKKITLYDQGKLIWGTEPN
>P26222 3.2.1.4~~~celB~~~Endoglucanase E-2~~~
MSPRPLRALLGAAAAALVSAAALAFPSQAAANDSPFYVNPNMSSAEWVRNNPNDPRTPVIRDRIASVPQGTWFAHHNPGQ
ITGQVDALMSAAQAAGKIPILVVYNAPGRDCGNHSSGGAPSHSAYRSWIDEFAAGLKNRPAYIIVEPDLISLMSSCMQHV
QQEVLETMAYAGKALKAGSSQARIYFDAGHSAWHSPAQMASWLQQADISNSAHGIATNTSNYRWTADEVAYAKAVLSAIG
NPSLRAVIDTSRNGNGPAGNEWCDPSGRAIGTPSTTNTGDPMIDAFLWIKLPGEADGCIAGAGQFVPQAAYEMAIAAGGT
NPNPNPNPTPTPTPTPTPPPGSSGACTATYTIANEWNDGFQATVTVTANQNITGWTVTWTFTDGQTITNAWNADVSTSGS
SVTARNVGHNGTLSQGASTEFGFVGSKGNSNSVPTLTCAAS
>P14250 3.2.1.4~~~cel-3~~~Endoglucanase 3~~~COG2730
MQLKNFYPKMSVLGIATVMALTACGDENTQALFANNPVPGAENQVPVSSSDMSPTSSDAVIDPTSSSAAVVDPSTLPAEG
PITMPEGLGTLVDDFEDGDNLSKIGDYWYTYNDNDNGGASIITTPLNEEENIIPGRVNNGSNYALQVNYTLDRGDYEYDP
YVGWGVQVAPDEANGHFGGLTYWYKGGAHEVHIEITDVEDYDVHLAKFPASRTWKQAVVRFKDLVQGGWGKEIPFDAKHI
MAISFQAKGNKSKLVTDSLFIDNIYLQDSSEVEKDQPDMEIKDPVIPVVEFTEAEITVTNPLQEKAMKYLNKGVNFTNWL
ENADGKFKSFELGESDVKILADNGFKSLRLPIDLDLYATNRDAFIAGTDTELKFDDDTLFLVLDSFVEWTAKYNMSFVID
YHEYDNSYNTTSAKDPNYIKMMAETWKHVAAHYAESPREDLFFELLNEPDMSDGKVTAATWTTAAQAMIDAIRTVDTKHT
ILFGDAQWYSITLLAKRTPFTDDNIIYVIHTYEPFAFTHQGGSWTDYATIHDIPFPYDPAKWSTVSGDFGVNKSTKSYVK
TNIKNYYKTGSKEAILEQILKAKKWAATNNVPVIINEFGALNLRSTAESRLNYLTAMREICDTLQIPWTHWGYTGNFSVI
ENGKLIEGLDKALGVGSK
>Q07940 3.2.1.4~~~Eg IV~~~Endoglucanase 4~~~
MLDKLKVINGKLTAGEKPVRLFGLSTHGIAWYPEYICEESFNALKKDWRTNCIRIAMYTDEFRGYCKDGNKQHLKELIEK
GVVIAEKLDMYVIVDWHVLCDQDPMKYIDEAEEFFSDMSKRFANKTNVIYEICNEPNCSGTWDKITEYADRIIPIIRSNS
PDALIVTGTSTWSQDIHCALEKPLKWDNVMYSLHFYAATHKGTLRSRLERCIEAGLPVFINEFNLCEASGKGDIDIDEAN
AWYEVIDRLGLSCISWCLSNSGDTCGVFTQNCTKLSGWTDEDIKTSGKIIKGWFEAFADEENTNEQCFRIDK
>P26221 3.2.1.4~~~celD~~~Endoglucanase E-4~~~
MSVTEPPPRRRGRHSRARRFLTSLGATAALTAGMLGVPLATGTAHAEPAFNYAEALQKSMFFYEAQRSGKLPENNRVSWR
GDSGLNDGADVGLDLTGGWYDAGDHVKFGFPMAFTATMLAWGAIESPEGYIRSGQMPYLKDNLRWVNDYFIKAHPSPNVL
YVQVGDGDADHKWWGPAEVMPMERPSFKVDPSCPGSDVAAETAAAMAASSIVFADDDPAYAATLVQHAKQLYTFADTYRG
VYSDCVPAGAFYNSWSGYQDELVWGAYWLYKATGDDSYLAKAEYEYDFLSTEQQTDLRSYRWTIAWDDKSYGTYVLLAKE
TGKQKYIDDANRWLDYWTVGVNGQRVPYSPGGMAVLDTWGALRYAANTAFVALVYAKVIDDPVRKQRYHDFAVRQINYAL
GDNPRNSSYVVGFGNNPPRNPHHRTAHGSWTDSIASPAENRHVLYGALVGGPGSPNDAYTDDRQDYVANEVATDYNAGFS
SALAMLVEEYGGTPLADFPPTEEPDGPEIFVEAQINTPGTTFTEIKAMIRNQSGWPARMLDKGTFRYWFTLDEGVDPADI
TVSSAYNQCATPEDVHHVSGDLYYVEIDCTGEKIFPGGQSEHRREVQFRIAGGPGWDPSNDWSFQGIGNELAPAPYIVLY
DDGVPVWGTAPEEGEEPGGGEGPGGGEEPGEDVTPPSAPGSPAVRDVTSTSAVLTWSASSDTGGSGVAGYDVFLRAGTGQ
EQKVGSTTRTSFTLTGLEPDTTYIAAVVARDNAGNVSQRSTVSFTTLAENGGGPDASCTVGYSTNDWDSGFTASIRITYH
GTAPLSSWELSFTFPAGQQVTHGWNATWRQDGAAVTATPMSWNSSLAPGATVEVGFNGSWSGSNTPPTDFTLNGEPCALA
>A0A8A1G1R1 3.2.1.4~~~MaCel5A~~~Endoglucanase MaCel5A~~~
MKRILFTAGGCLFYLLLAVKAYAYDCSATPAYQDGVNYQSGDLVSNTGAAYRCNVAGWCSTGGAYAPGTGWAWTEAWDEL
GSCDGGAGSSGSSSSSSSSSSSSSGSSSSSSGSGSSSGGTSCDGIEAWQVASIYTEGNVVQQNGERYIANWWNQGQSPED
NSGAYEVWSAAGSCSGAGSSSSSSGGTSSSGSSSSGVSSSGGSSGGDSPIARHGKLHVCGNGLCNADNVPVQLRGMSTHG
LQWYGWGNCITGNSLDTLAEDWNADILRVSLYVQEGGYETDPAGYTAQVSHIIDEVTARGMYVLVDWHQLDPGDPNANLD
NARQFFTDIAQAHGDKTNIIYDVANEPNNVSWDAIQRYAMEVIPVIRQYAPDAVVLVGTHGWASLGISDGGSAQDIFNNP
VTIDNIMYTFHFYAASHGQVYRDELRSALERGMPVFVTEWGSQTYTGDDGNDFVSTQAYLDLLDQYQISWTNWNYSDDFR
TGAVWNTGTCSADSWGVGNLKEAGAWVRDKIRNR
>O85465 3.2.1.4~~~cel5A~~~Endoglucanase 5A~~~
MKKITTIFVVLLMTVALFSIGNTTAADNDSVVEEHGQLSISNGELVNERGEQVQLKGMSSHGLQWYGQFVNYESMKWLRD
DWGINVFRAAMYTSSGGYIDDPSVKEKVKEAVEAAIDLDIYVIIDWHILSDNDPNIYKEEAKDFFDEMSELYGDYPNVIY
EIANEPNGSDVTWGNQIKPYAEEVIPIIRNNDPNNIIIVGTGTWSQDVHHAADNQLADPNVMYAFHFYAGTHGQNLRDQV
DYALDQGAAIFVSEWGTSAATGDGGVFLDEAQVWIDFMDERNLSWANWSLTHKDESSAALMPGANPTGGWTEAELSPSGT
FVREKIRESASIPPSDPTPPSDPGEPDPTPPSDPGEYPAWDPNQIYTNEIVYHNGQLWQAKWWTQNQEPGDPYGPWEPLN
>Q01786 3.2.1.4~~~celE~~~Endoglucanase E-5~~~
MAKSPAARKGXPPVAVAVTAALALLIALLSPGVAQAAGLTATVTKESSWDNGYSASVTVRNDTSSTVSQWEVVLTLPGGT
TVAQVWNAQHTSSGNSHTFTGVSWNSTIPPGGTASSGFIASGSGEPTHCTINGAPCDEGSEPGGPGGPGTPSPDPGTQPG
TGTPVERYGKVQVCGTQLCDEHGNPVQLRGMSTHGIQWFDHCLTDSSLDALAYDWKADIIRLSMYIQEDGYETNPRGFTD
RMHQLIDMATARGLYVIVDWHILTPGDPHYNLDRAKTFFAEIAQRHASKTNVLYEIANEPNGVSWASIKSYAEEVIPVIR
QRDPDSVIIVGTRGWSSLGVSEGSGPAEIAANPVNASNIMYAFHFYAASHRDNYLNALREASELFPVFVTEFGTETYTGD
GANDFQMADRYIDLMAERKIGWTKWNYSDDFRSGAVFQPGTCASGGPWSGSSLKASGQWVRSKLQS
>A3DC29 3.2.1.4~~~celA~~~Endoglucanase A~~~COG3405
MKNVKKRVGVVLLILAVLGVYMLAMPANTVSAAGVPFNTKYPYGPTSIADNQSEVTAMLKAEWEDWKSKRITSNGAGGYK
RVQRDASTNYDTVSEGMGYGLLLAVCFNEQALFDDLYRYVKSHFNGNGLMHWHIDANNNVTSHDGGDGAATDADEDIALA
LIFADKLWGSSGAINYGQEARTLINNLYNHCVEHGSYVLKPGDRWGGSSVTNPSYFAPAWYKVYAQYTGDTRWNQVADKC
YQIVEEVKKYNNGTGLVPDWCTASGTPASGQSYDYKYDATRYGWRTAVDYSWFGDQRAKANCDMLTKFFARDGAKGIVDG
YTIQGSKISNNHNASFIGPVAAASMTGYDLNFAKELYRETVAVKDSEYYGYYGNSLRLLTLLYITGNFPNPLSDLSGQPT
PPSNPTPSLPPQVVYGDVNGDGNVNSTDLTMLKRYLLKSVTNINREAADVNRDGAINSSDMTILKRYLIKSIPHLPY
>Q5YLG1 3.2.1.4~~~eglA~~~Endoglucanase A~~~
MLIFETYLILFKTVQITKRRIERRRLRLLNQCFTKKEGVSNREMASYNYVEVLQKSMLFYEAQRSGRLPESNRLNWRGDS
GLKDGKDVGHDLTGGWYDAGDHVKFGLPMAYSAAVLAWTVYEYREAYEEAELLDEILDQIKWATDYFLKAHTGPNEFWAQ
VGDGNADHAWWGPAEVMPMNRPAFKIDEHCPGTEVAAQTAAALAAGSIIFKETDASYAAKLLTHAKQLYAFADRYRGKYT
DCVTNAQPFYNSWSGYVDELIWGGIWLYLATNEETYLNKALKAVEEWPQDWDYTFTMSWDNTFFASQILLARITKENRFI
ESTERNLDYWTTGLVQNGKVERITYTPGGLAWLDQWGSLRYAANAAFLAFVYADWVSDQEKKNRYQSFAIKQTHYMLGDN
PLNRSYVVGFGQNSPKHPHHRTAHGSWSNQLTNPPSHRHTLYGALVGGPNAQDQYDDDISDYISNEVATDYNAAFTGNIA
KMVQLFGEGQSKLPNFPPKEQVEDEFFVEAAVMHNDTTSTQVKAVLYNRSGWPARSSQTLSFRYYVNLSEVFAKGFTEKD
IQVTAAYNEGASLSPLKVYDASSRVYFAEIDFTGVVISPRGESEHKKEIQFRLSAPNGSNIWDASNDYSYQGLTSNMQKT
TKIPVFEDGVLVFGTLPDK
>P22541 3.2.1.4~~~celA~~~Endoglucanase A~~~
MVSKKQKFLTVILVIVLAIVIVGGVFGISFVKGRVTFPWQLQNSEAKTEQVKEPAKEEPKLVIKEKKQDESAKKEQELKK
AKEEAEAAVEKETEKTEEEPVDNLLNDMKLKYYGKLAVEGSHLVDADGHEVLLMGVSTHGINWYPEYASAETIKSLRDTW
GINVIRLAMYTSDYNGYCVAGKENQEKLKDIIDDAVEAATDNDMYVIIDWHTLNDADPNEYKADAIQFFGEMVRKYKDNE
NVIYEICNEPNGDTTWNDVRRYANEVIPVIRNVDAIILVGTPKWATDLDSVLDKPLDFDNIMYTYHFYAGTHHKAERNAL
RDALDEGLPVFISEYGLVDADGDGNLNEKEADYWYDMIRKEYGVSSCMWNLSNKDEGSAMINADCDKLSDFTEEDLSESA
MWLIDQISQLKHSDLEQGVDWITPENNNR
>P07984 3.2.1.4~~~cenA~~~Endoglucanase A~~~
MSTRRTAAALLAAAAVAVGGLTALTTTAAQAAPGCRVDYAVTNQWPGGFGANVTITNLGDPVSSWKLDWTYTAGQRIQQL
WNGTASTNGGQVSVTSLPWNGSIPTGGTASFGFNGSWAGSNPTPASFSLNGTTCTGTVPTTSPTPTPTPTTPTPTPTPTP
TPTPTVTPQPTSGFYVDPTTQGYRAWQAASGTDKALLEKIALTPQAYWVGNWADASHAQAEVADYTGRAVAAGKTPMLVV
YAIPGRDCGSHSGGGVSESEYARWVDTVAQGIKGNPIVILEPDALAQLGDCSGQGDRVGFLKYAAKSLTLKGARVYIDAG
HAKWLSVDTPVNRLNQVGFEYAVGFALNTSNYQTTADSKAYGQQISQRLGGKKFVIDTSRNGNGSNGEWCNPRGRALGER
PVAVNDGSGLDALLWVKLPGESDGACNGGPAAGQWWQEIALEMARNARW
>P54937 3.2.1.4~~~celA~~~Endoglucanase A~~~
MKRSLLKTCSIIAGATIIFSSLSISRNPLEVQAASMRSASEIVQEMGVGWNLGNTLDAKITNLSYNTSPISFETGWGNPV
TTKAMIDKIKNAGFKTIRIPTTWGEHLDGNNKLNEEWVKRVKEVVDYCIADDLYVILNTHHEGNWVIPTYAKESSVTPKL
KTLWTQISEAFKDYDDHLIFETLNEPRLEGTPYEWTGGTSESRDVVNKYNAAALESIRKTGGNNLSRAVMMPTYAASGSS
TTMNDFKVPDDKNVIASVHAYSPYFFAMDTSSNSVNTWGSSYDKYSLDVELDSYLNTFKSKGVPVVIGEFGSINKNNTSS
RAELAEYYVTAAQKRGIPCVWWDNNYAETNKGETFGLLNRSTLNWYFSDIKDALIRGYKNVHPEATEDDKPSTDVTNPDS
GNTKPDSGNTNPGTETTTPTDNEKISITSKINDWGGAYQADFTLKNNTSSDINNWSFKIKKNDIVFTNYWDVKITEENGY
YVVTPQAWKTTILANSSIVISIQGTGKVISNFEYKFD
>P23665 3.2.1.4~~~endA~~~Endoglucanase A~~~
MNCRKYLLSGLAVFGLAATSAVAALSTDDYVEAAWMTTRFFGAQRSGQGPNWILDGTSNPTSFTKDSYNGKDVSGGWFDC
GDHVMYGQSQGYASYVLALAYAEFTEVSTTFILVTTPTTRKPTTTPMKSGKPNKVRDLLEELRYEADFWVKAAIDGNNFV
TVKGDGNADHQKWVTAGAMSKLGSGEGGEPRCITGNANDGFTSGLAAAMLAVMARVDPDTANQAKYLKAAKTAYSYAKSH
KGVTNSQGFYESSWWDGRWEDGPFLAELELYRTTGENSYKTAAIDRYDNLKFSLGEGTHFMYSNVVPLSAVMAEAVFEET
PHGMRKEAIGVLDLIYEEKAKDKIFQNPNGMGSGKFPVRVPSGGAFLYALSDKFNNTNEHMEMIEKNVSYLLGDNGSKKS
YVVGFSKNGANAPSRPHHRGYYANEKRWRRSRRCSESSRKEQALGRYDCWRLY
>P37696 3.2.1.4~~~cmcAX~~~Probable endoglucanase~~~
MSVMAAMGGAQVLSSTGAFADPAPDAVAQQWAIFRAKYLRPSGRVVDTGNGGESHSEGQGYGMLFAASAGDLASFQSMWM
WARTNLQHTNDKLFSWRFLKGHQPPVPDKNNATDGDLLIALALGRAGKRFQRPDYIQDAMAIYGDVLNLMTMKAGPYVVL
MPGAVGFTKKDSVILNLSYYVMPSLLQAFDLTADPRWRQVMEDGIRLVSAGRFGQWRLPPDWLAVNRATGALSIASGWPP
RFSYDAIRVPLYFYWAHMLAPNVLADFTRFWNNFGANALPGWVDLTTGARSPYNAPPGYLAVAECTGLDSAGELPTLDHA
PDYYSAALTLLVYIARAEETIK
>O08342 3.2.1.4~~~celA~~~Endoglucanase A~~~
MTKTFKKFSIAGLALLFMATAAFAGWSTKASAADMRSLTAAQITAEMGAGWNLGNQLEATVNGTPNETSWGNPTITPELI
KKVKAAGFKTIRIPVSYLNYIGSAPNYTVNASWLNRIQQVVDYAYNEGLYVVINMHGDGFHSIPGSWLHVNSSNQNVIRD
KYQKVWQQVATRFSAYNERLIFESMNEVFDGNYNNPNTSYYGNLNAYNQIFVDTVRKTGGNNNARWLLVPGWNTNIDYTV
GNYGFVVPTDNFRSSAIPSSQKRIMISAHYYSPWDFAGEENGNITQWGATATNPAKRSTWGQEDYLDSQFKSMYDKFVTQ
GYPVVMGEFGSIDKSSYDSSNNNYRAVYAKAVTATAKKYKLVPVYWDNGFNGQHGFALFNRFNNTVTQQNIINAIMQGMQ
>P23660 3.2.1.4~~~celA~~~Endoglucanase A~~~COG2730
MRKPDKDADRLTTLDLARSGEVRDISAMELVGEMKTGWNLGNSLDATGAPGNASEVNWGNPKTTKEMIDAVYNKGFDVIR
IPVTWGGHVGDAPDYKIDDEWIARVQEVVNYAYDDGAYVIINSHHEEDWRIPDNEHIDAVDEKTAAIWKQVAERFKDYGD
HLIFEGLNEPRVKGSPQEWNGGTEEGRRCVDRLNKTFLDTVRATGGNNEKRLLLMTTYASSSMSNVIKDTAIPEDDHIGF
SIHAYTPYAFTYNANADWELFHWDDSHDGELVSLMTNLKENYLDKDIPVIITEYGAVNKDNNDEDRAKWVSSYIEYAELL
GGIPCVWWDNGYYSSGNELFGIFDRNTCTWFTDTVTDAIIENAK
>P17901 3.2.1.4~~~celCCA~~~Endoglucanase A~~~COG2730
MKKTTAFLLCFLMIFTALLPMQNANAYDASLIPNLQIPQKNIPNNDGMNFVKGLRLGWNLGNTFDAFNGTNITNELDYET
SWSGIKTTKQMIDAIKQKGFNTVRIPVSWHPHVSGSDYKISDVWMNRVQEVVNYCIDNKMYVILNTHHDVDKVKGYFPSS
QYMASSKKYITSVWAQIAARFANYDEHLIFEGMNEPRLVGHANEWWPELTNSDVVDSINCINQLNQDFVNTVRATGGKNA
SRYLMCPGYVASPDGATNDYFRMPNDISGNNNKIIVSVHAYCPWNFAGLAMADGGTNAWNINDSKDQSEVTWFMDNIYNK
YTSRGIPVIIGECGAVDKNNLKTRVEYMSYYVAQAKARGILCILWDNNNFSGTGELFGFFDRRSCQFKFPEIIDGMVKYA
FEAKTDPDPVIVYGDYNNDGNVDALDFAGLKKYIMAADHAYVKNLDVNLDNEVNAFDLAILKKYLLGMVSKLPSN
>P27035 3.2.1.4~~~celA~~~Endoglucanase CelA~~~
MKRLLALLATGVSIVGLTALAGPPAQAATGCKAEYTITSQWEGGFQAGVKITNLGDPVSGWTLGFTMPDAGQRLVQGWNA
TWSQSGSAVTAGGVDWNRTLATGASADLGFVGSFTGANPAPTSFTLNGATCSGSVTDPPTDPPTDPPATGTPAAVNGQLH
VCGVHLCNQYDRPIQLRGMSTHGIQWFGPCYGDASLDRLAQDWKSDLLRVAMYVQEDGYETDPAGFTSRVNGLVDMAEDR
GMYAVIDFHTLTPGDPNYNLDRARTFFSSVAARNDKKNVIYEIANEPNGVSWTAVKSYAEQVIPVIRAADPDAVVIVGTR
GWSSLGVSDGANESEVVNNPVNATNIMYAFHFYAASHKDDYRAAVRPAATRLPLFVSEFGTVSATAWSVDRSSSVAWLDL
LDQLKISYANWTYSDADEGSAAFRPGTCEGTDYSSSGVLTESGALVKSRISTTDDFPTS
>P19487 3.2.1.4~~~engXCA~~~Major extracellular endoglucanase~~~COG2730
MSIFRTASTLALATALALAAGPAFSYSINNSRQIVDDSGKVVQLKGVNVFGFETGNHVMHGLWARNWKDMIVQMQGLGFN
AVRLPFCPATLRSDTMPASIDYSRNADLQGLTSLQILDKVIAEFNARGMYVLLDHHTPDCAGISELWYTGSYTEAQWLAD
LRFVANRYKNVPYVLGLDLKNEPHGAATWGTGNAATDWNKAAERGSAAVLAVAPKWLIAVEGITDNPVCSTNGGIFWGGN
LQPLACTPLNIPANRLLLAPHVYGPDVFVQSYFNDSNFPNNMPAIWERHFGQFAGTHALLLGEFGGKYGEGDARDKTWQD
ALVKYLRSKGINQGFYWSWNPNSGDTGGILRDDWTSVRQDKMTLLRTLWGTAGNTTPTPTPTPTPTPTPTPTPTPTPTPG
TSTFSTKVIVDNSWNGGYCNRVQVTNTGTASGTWSIAVPVTGTVNNAWNATWSQSGSTLRASGVDFNRTLAAGATAEFGF
CAAS
>P18126 3.2.1.4~~~celB~~~Endoglucanase B~~~COG2730
MNLLSGWVRPLMLGCGLLGAALSAGSIQAAVCEYRVTNEWGSGFTASIRITNNGSSTINGWSVSWNYTDGSRVTSSWNAG
LSGANPYSATPVGWNTSIPIGSSVEFGVQGNNGSSRAQVPAVTGAICGGQGSSAPSSVASSSSSSSVVSSTPRSSSSSVS
SSVPGTSSSSSSSVLTGAQACNWYGTLTPLCNNTSNGWGYEDGRSCVARTTCSAQPAPYGIVSTSSSTPLSSSSSSRSSV
ASSSSLSSATSSSASSVSSVPPIDGGCNGYATRYWDCCKPHCGWSANVPSLVSPLQSCSANNTRLSDVSVGSSCDGGGGY
MCWDKIPFAVSPTLAYGYAATSSGDVCGRCYQLQFTGSSYNAPGDPGSAALAGKTMIVQATNIGYDVSGGQFDILVPGGG
VGAFNACSAQWGVSNAELGAQYGGFLAACKQQLGYNASLSQYKSCVLNRCDSVFGSRGLTQLQQGCTWFAEWFEAADNPS
LKYKEVPCPAELTTRSGMNRSILNDIRNTCP
>P0C2S3 3.2.1.4~~~celC~~~Endoglucanase C~~~
MVSFKAGINLGGWISQYQVFSKEHFDTFITEKDIETIAEAGFDHVRLPFDYPIIESDDNVGEYKEDGLSYIDRCLEWCKK
YNLGLVLDMHHAPGYRFQDFKTSTLFEDPNQQKRFVDIWRFLAKRYINEREHIAFELLNEVVEPDSTRWNKLMLEYIKAI
REIDSTMWLYIGGNNYNSPDELKNLADIDDDYIVYNFHFYNPFFFTHQKAHWSESAMAYNRTVKYPGQYEGIEEFVKNNP
KYSFMMELNNLKLNKELLRKDLKPAIEFREKKKCKLYCGEFGVIAIADLESRIKWHEDYISLLEEYDIGGAVWNYKKMDF
EIYNEDRKPVSQELVNILARRKT
>P23658 3.2.1.4~~~ced1~~~Cellodextrinase~~~
MKKVLVNQVGFLCNAPKKAVLNFQANEFSVVDGNGKKAFDGKVEHFGTDEISGEDTYVADFSALTEEGKYKIVADGQESV
LFSISNDAYDKLMKDICKCFYYLRCGDALSKEFAGEYYHKPCHMTKATVYGEDVEPVDVTGGWHDAGDYGRYSTAGAVAV
AHLLYGVRFFKGLLDVHYDIPKVAGDKGNLPEILAEVKVELDFLMKMQRENGSVWHKVTTFNHAPFLMPEDDREELFLFS
VSSLATADIAAVFALAYTVYKEYDAEYADKLMQKSLLAYKWLLDNPDELLFFNPDGSNTGQYDEAEDISNRFWAACALYE
ATSDGKYYSDAQELKNRLEEFDKNAQKKGYQGNVFTCLGWAEVAGLGSLSLLLKREENALCSLARNSFVAEADRLVKVSK
ENGFGLCMGENDFIWGSNMELLKYMMVLSTAIRIDNKPEYKLALEAGLDYILGCNSMDISYVTGNGEKAFKNPHLRPTAV
DDIEEPWPGLVSGGPNSGLHDERAQTLRGKGLPPMKCYIDHIDCYSLNEITIYWNSPLVFALSGILE
>P14090 3.2.1.4~~~cenC~~~Endoglucanase C~~~COG3250
MVSRRSSQARGALTAVVATLALALAGSGTALAASPIGEGTFDDGPEGWVAYGTDGPLDTSTGALCVAVPAGSAQYGVGVV
LNGVAIEEGTTYTLRYTATASTDVTVRALVGQNGAPYGTVLDTSPALTSEPRQVTETFTASATYPATPAADDPEGQIAFQ
LGGFSADAWTFCLDDVALDSEVELLPHTSFAESLGPWSLYGTSEPVFADGRMCVDLPGGQGNPWDAGLVYNGVPVGEGES
YVLSFTASATPDMPVRVLVGEGGGAYRTAFEQGSAPLTGEPATREYAFTSNLTFPPDGDAPGQVAFHLGKAGAYEFCISQ
VSLTTSATPPPGYEPDTGPRVRVNQVGYLPFGPKRATLVTDAAEPVAWELRDADGVVVADGTSEPRGVEPSAAQAVHVLD
FSDVTTQGAGYTLVADGETSRPFDIDGDLYQQLRYDALNYFYLARSGTEIEADVVGEEYAREAGHVGVAPNQGDTDVPCI
GPRDYYDGWTCDYRLDVSGGWYDAGDHGKYVVNGGIAVGQLLQTYERALHAGTADALADGTLDVPEHGNDVPDVLDEARW
ELEWMLSMIVPEGEYAGMVHHKVHDEGWTGLPLLPADDPQARSLHRPSTAATLNLSAVAAQGARLLEPYDPQLAQTLLEA
ARTTWAAAQEHPALYAPGEAGADGGGAYNDSQVADEFYWAAAELYLTTGEDAFATAVTTSPLHTADVFTADGFGWGSVAA
LGRLDLATVPNELPGLDAVQSSVVEGAQEYLAAQAGQGFGSLYSPPGGEYVWGSSSQVANNLVVVATAYDLTGDERFRAA
TLEGLDYLFGRNALNQSYVTGWGEVASHQQHSRWFAHQLDPSLPSPPPGSLAGGPNSQAATWDPTTKAAFPDGCAPSACY
VDEIQAWSTNELTVNWNSALSWVASWVADQGSAEPVPTAPVVTRQPVDATVALGADATFTAEASGVPAPTVRWQVRAGRG
WKDVAGATGTTLTVRATARTDGTRYRAVFTNAAGSVESAVVRLTVERAAPVVTQHPADVRARVGTRAVFRAAADGYPTPC
VVWQVRWGGGSWRPIPWATSTTLSVPVTVLAAGTEYRAVFTNAVGTAATEPAELAVQRPRS
>P27033 3.2.1.4~~~celC~~~Endoglucanase C~~~COG2730
MGHVTSPSKRYPASFKRAGSILGVSIALAAFSNVAAAGCEYVVTNSWGSGFTAAIRITNSTSSVINGWNVSWQYNSNRVT
NLWNANLSGSNPYSASNLSWNGTIQPGQTVEFGFQGVTNSGTVESPTVNGAACTGGTSSSVSSSSVVSSSSSSRSSVSSS
SVVSSSSSVVSSSSSSVVSGGQCNWYGTLYPLCVSTTSGWGYENNRSCISPSTCSAQPAPYGIVGGSSSPSSISSSSVRS
SSSSSVVPPSSSSSSSVPSSSSSSVSSSSVVSSSSSSVSVPGTGVFRVNTQGNLTKDGQLLPARCGNWFGLEGRHEPSND
ADNPSGAPMELYAGNMWWVNNSQGSGRTIQQTMTELKQQGITMLRLPIAPQTLDANDPQGRSPNLKNHQSIRQSNARQAL
EDFIKLADQNDIQIFIDIHSCSNYVGWRAGRLDARPPYVDANRVGYDFTREEYSCSATNNPSSVTRIHAYDKQKWLANLR
EIAGLSAKLGVSNLIGIDVFNEPYDYTWAEWKGMVEEAYQAINEVNPNMLIIVEGISANANTQDGTPDTSVPVPHGSTDL
NPNWGENLYEAGANPPNIPKDRLLFSPHTYGPSVFVQRQFMDPAQTECAGLEGDEAAQARCRIVINPTVLEQGWEEHFGY
LRELGYGILIGEFGGNMDWPGAKSSQADRNAWSHITTNVDQQWQQAAASYFKRKGINACYWSMNPESADTMGWYLTPWDP
VTANDMWGQWTGFDPRKTQLLHNMWGL
>P23340 3.2.1.4~~~~~~Endoglucanase C307~~~
MVSFKAGINLGGWISQYQVFSKEHFDTFITEKDIETIAEAGFDHVRLPFDYPIIESDDNVGEYKEDGLSYIDRCLEWCKK
YNLGLVLDMHHAPGYRFQDFKTSTLFEDPNQQKRFVDIWRFLAKRYINEREHIAFELLNEVVEPDSTRWNKLMLECVKAI
REIDSTRWLYIGGNNYNSPDELKNLADIDDDYIVYNFHFYNPFFFTHQKAHWSESAMAYNRTVKYPGQYEGIEEFVKNNP
KYSFMMELNNLKLNKELLRKDLKPAIEFREKKKCKLYCGEFGVIAIADLESRIKWHEDYISLLEEYDIGGAVWNYKKMDF
EIYNEDRKPVSQELVNILARRKT
>P37699 3.2.1.4~~~celCCC~~~Endoglucanase C~~~COG3405
MIKGSSLKRFKSLVMAAIFSVSIISTAIASSAADQIPFPYDAKYPNGAYSCLADSQSIGNNLVRSEWEQWKSAHITSNGA
RGYKRVQRDASTNYDTVSEGLGYGLLLSVYFGEQQLFDDLYRYVKVFLNSNGLMSWRIDSSGNIMGKDSIGAATDADEDI
AVSLVFAHKKWGTSGGFNYQTEAKNYINNIYNKMVEPGTYVIKAGDTWGGSNVTNPSYFAPAWYRIFADFTGNSGWINVA
NKCYEIADKARNSNTGLVPDWCTANGTPASGQGFDFYYDAIRYQWRAAIDYSWYGTAKAKTHCDAISNFFKNIGYANIKD
GYTISGSQISSNHTATFVSCAAAAAMTGTDTTYAKNIYNECVKVKDSGNYTYFGNTLRMMVLLYTTGNFPNLYTYNSQPK
PDLKGDVNNDGAIDALDIAALKKAILTQTTSNISLTNADMNNDGNIDAIDFAQLKVKLLN
>A3DDN1 3.2.1.4~~~celD~~~Endoglucanase D~~~COG3291
MSRMTLKSSMKKRVLSLLIAVVFLSLTGVFPSGLIETKVSAAKITENYQFDSRIRLNSIGFIPNHSKKATIAANCSTFYV
VKEDGTIVYTGTATSMFDNDTKETVYIADFSSVNEEGTYYLAVPGVGKSVNFKIAMNVYEDAFKTAMLGMYLLRCGTSVS
ATYNGIHYSHGPCHTNDAYLDYINGQHTKKDSTKGWHDAGDYNKYVVNAGITVGSMFLAWEHFKDQLEPVALEIPEKNNS
IPDFLDELKYEIDWILTMQYPDGSGRVAHKVSTRNFGGFIMPENEHDERFFVPWSSAATADFVAMTAMAARIFRPYDPQY
AEKCINAAKVSYEFLKNNPANVFANQSGFSTGEYATVSDADDRLWAAAEMWETLGDEEYLRDFENRAAQFSKKIEADFDW
DNVANLGMFTYLLSERPGKNPALVQSIKDSLLSTADSIVRTSQNHGYGRTLGTTYYWGCNGTVVRQTMILQVANKISPNN
DYVNAALDAISHVFGRNYYNRSYVTGLGINPPMNPHDRRSGADGIWEPWPGYLVGGGWPGPKDWVDIQDSYQTNEIAINW
NAALIYALAGFVNYNSAQNEVLYGDVNDDGKVNSTDLTLLKRYVLKAVSTLPSSKAEKNADVNRDGRVNSSDVTILSRYL
IRVIEKLPI
>P28623 3.2.1.4~~~engD~~~Endoglucanase D~~~COG2730
MIKHLLSRGKLLLFVSVMATSSIIAGGNAYGSTAFTGVRDVPAQQIVNEMKVGWNLGNTMDAIGGETNWGNPMTTHAMIN
KIKEAGFNTLRLPVTWDGHMGAAPEYTIDQTWMKRVEEIANYAFDNDMYVIINLHHENEWLKPFYANEAQVKAQLTKVWT
QIANNFKKYGDHLIFETMNEPRPVGASNEWTGGSYENREVVNRYNLTAVNAIRATGGNNATRYIMVPTLAASAMSTTIND
LVIPNNDSKVIVSLHMYSPYFFAMDINGTSSWGSDYDKSSLDSEFDAVYNKFVKNGRAVVIGEMGSINKNNTAARVTHAE
YYAKSAKARGLTPIWWDNGYSVAGKAETFGIFNRSNLTWDAPEVMKAFIKGIGGSSTTTPTTPTTPTTPTTPTTPTTPTT
PTTPTTPQSAVEVTYAITNSWGSGASVNVTIKNNGTTPINGWTLKWTMPINQTITNMWSASFVASGTTLSVTNAGYNGTI
AANGGTQSFGFNINYSGVLSKPTGFTVNGTECTVK
>P37698 3.2.1.4~~~celCCF~~~Endoglucanase F~~~COG5297
MSKNFKRVGAVAVAAAMSLSIMATTSINAASSPANKVYQDRFESMYSKIKDPANGYFSEQGIPYHSIETLMVEAPDYGHV
TTSEAMSYYMWLEAMHGRFSGDFTGFDKSWSVTEQYLIPTEKDQPNTSMSRYDANKPATYAPEFQDPSKYPSPLDTSQPV
GRDPINSQLTSAYGTSMLYGMHWILDVDNWYGFGARADGTSKPSYINTFQRGEQESTWETIPQPCWDEHKFGGQYGFLDL
FTKDTGTPAKQFKYTNAPDADARAVQATYWADQWAKEQGKSVSTSVGKATKMGDYLRYSFFDKYFRKIGQPSQAGTGYDA
AHYLLSWYYAWGGGIDSTWSWIIGSSHNHFGYQNPFAAWVLSTDANFKPKSSNGASDWAKSLDRQLEFYQWLQSAEGAIA
GGATNSWNGRYEAVPSGTSTFYGMGYVENPVYADPGSNTWFGMQVWSMQRVAELYYKTGDARAKKLLDKWAKWINGEIKF
NADGTFQIPSTIDWEGQPDTWNPTQGYTGNANLHVKVVNYGTDLGCASSLANTLTYYAAKSGDETSRQNAQKLLDAMWNN
YSDSKGISTVEQRGDYHRFLDQEVFVPAGWTGKMPNGDVIKSGVKFIDIRSKYKQDPEWQTMVAALQAGQVPTQRLHRFW
AQSEFAVANGVYAILFPDQGPEKLLGDVNGDETVDAIDLAILKKYLLNSSTTINTANADMNSDNAIDAIDYALLKKALLS
IQ
>P37700 3.2.1.4~~~celCCG~~~Endoglucanase G~~~COG4733
MLKTKRKLTKAIGVALSISILSSLVSFIPQTNTYAAGTYNYGEALQKSIMFYEFQRSGDLPADKRDNWRDDSGMKDGSDV
GVDLTGGWYDAGDHVKFNLPMSYTSAMLAWSLYEDKDAYDKSGQTKYIMDGIKWANDYFIKCNPTPGVYYYQVGDGGKDH
SWWGPAEVMQMERPSFKVDASKPGSAVCASTAASLASAAVVFKSSDPTYAEKCISHAKNLFDMADKAKSDAGYTAASGYY
SSSSFYDDLSWAAVWLYLATNDSTYLDKAESYVPNWGKEQQTDIIAYKWGQCWDDVHYGAELLLAKLTNKQLYKDSIEMN
LDFWTTGVNGTRVSYTPKGLAWLFQWGSLRHATTQAFLAGVYAEWEGCTPSKVSVYKDFLKSQIDYALGSTGRSFVVGYG
VNPPQHPHHRTAHGSWTDQMTSPTYHRHTIYGALVGGPDNADGYTDEINNYVNNEIACDYNAGFTGALAKMYKHSGGDPI
PNFKAIEKITNDEVIIKAGLNSTGPNYTEIKAVVYNQTGWPARVTDKISFKYFMDLSEIVAAGIDPLSLVTSSNYSEGKN
TKVSGVLPWDVSNNVYYVNVDLTGENIYPGGQSACRREVQFRIAAPQGTTYWNPKNDFSYDGLPTTSTVNTVTNIPVYDN
GVKVFGNEPAGGSENPDPEILYGDVNSDKNVDALDFAALKKYLLGGTSSIDVKAADTYKDGNIDAIDMATLKKYLLGTIT
QLPQG
>P16218 3.2.1.4~~~celH~~~Endoglucanase H~~~COG2730
MKKRLLVSFLVLSIIVGLLSFQSLGNYNSGLKIGAWVGTQPSESAIKSFQELQGRKLDIVHQFINWSTDFSWVRPYADAV
YNNGSILMITWEPWEYNTVDIKNGKADAYITRMAQDMKAYGKEIWLRPLHEANGDWYPWAIGYSSRVNTNETYIAAFRHI
VDIFRANGATNVKWVFNVNCDNVGNGTSYLGHYPGDNYVDYTSIDGYNWGTTQSWGSQWQSFDQVFSRAYQALASINKPI
IIAEFASAEIGGNKARWITEAYNSIRTSYNKVIAAVWFHENKETDWRINSSPEALAAYREAIGAGSSNPTPTPTWTSTPP
SSSPKAVDPFEMVRKMGMGTNLGNTLEAPYEGSWSKSAMEYYFDDFKAAGYKNVRIPVRWDNHTMRTYPYTIDKAFLDRV
EQVVDWSLSRGFVTIINSHHDDWIKEDYNGNIERFEKIWEQIAERFKNKSENLLFEIMNEPFGNITDEQIDDMNSRILKI
IRKTNPTRIVIIGGGYWNSYNTLVNIKIPDDPYLIGTFHYYDPYEFTHKWRGTWGTQEDMDTVVRVFDFVKSWSDRNNIP
VYFGEFAVMAYADRTSRVKWYDFISDAALERGFACSVWDNGVFGSLDNDMAIYNRDTRTFDTEILNALFNPGTYPSYSPK
PSPTPRPTKPPVTPAVGEKMLDDFEGVLNWGSYSGEGAKVSTKIVSGKTGNGMEVSYTGTTDGYWGTVYSLPDGDWSKWL
KISFDIKSVDGSANEIRFMIAEKSINGVGDGEHWVYSITPDSSWKTIEIPFSSFRRRLDYQPPGQDMSGTLDLDNIDSIH
FMYANNKSGKFVVDNIKLIGATSDPTPSIKHGDLNFDNAVNSTDLLMLKRYILKSLELGTSEQEEKFKKAADLNRDNKVD
STDLTILKRYLLKAISEIPI
>Q02934 3.2.1.4~~~celI~~~Endoglucanase 1~~~COG4447
MRLVNSLGRRKILLILAVIVAFSTVLLFAKLWGRKTSSTLDEVGSKTHGDLTAENKNGGYLPEEEIPDQPPATGAFNYGE
ALQKAIFFYECQRSGKLDPSTLRLNWRGDSGLDDGKDAGIDLTGGWYDAGDHVKFNLPMSYSAAMLGWAVYEYEDAFKQS
GQYNHILNNIKWACDYFIKCHPEKDVYYYQVGDGHADHAWWGPAEVMPMERPSYKVDRSSPGSTVVAETSAALAIASIIF
KKVDGEYSKECLKHAKELFEFADTTKSDDGYTAANGFYNSWSGFYDELSWAAVWLYLATNDSSYLDKAESYSDKWGYEPQ
TNIPKYKWAQCWDDVTYGTYLLLARIKNDNGKYKEAIERHLDWWTTGYNGERITYTPKGLAWLDQWGSLRYATTTAFLAC
VYSDWENGDKEKAKTYLEFARSQADYALGSTGRSFVVGFGENPPKRPHHRTAHGSWADSQMEPPEHRHVLYGALVGGPDS
TDNYTDDISNYTCNEVACDYNAGFVGLLAKMYKLYGGSPDPKFNGIEEVPEDEIFVEAGVNASGNNFIEIKAIVNNKSGW
PARVCENLSFRYFINIEEIVNAGKSASDLQVSSSYNQGAKLSDVKHYKDNIYYVEVDLSGTKIYPGGQSAYKKEVQFRIS
APEGTVFNPENDYSYQGLSAGTVVKSEYIPVYDAGVLVFGREPGSASKSTSKDNGLSKATPTVKTESQPTAKHTQNPASD
FKTPANQNSVKKDQGIKGEVVLQYANGNAGATSNSINPRFKIINNGTKAINLSDVKIRYYYTKEGGASQNFWCDWSSAGN
SNVTGNFFNLSSPKEGADTCLEVGFGSGAGTLDPGGSVEVQIRFSKEDWSNYNQSNDYSFNPSASDYTDWNRVTLYISNK
LVYGKEP
>A3DH67 3.2.1.176~~~celS~~~Cellulose 1,4-beta-cellobiosidase (reducing end) CelS~~~COG5297
MVKSRKISILLAVAMLVSIMIPTTAFAGPTKAPTKDGTSYKDLFLELYGKIKDPKNGYFSPDEGIPYHSIETLIVEAPDY
GHVTTSEAFSYYVWLEAMYGNLTGNWSGVETAWKVMEDWIIPDSTEQPGMSSYNPNSPATYADEYEDPSYYPSELKFDTV
RVGSDPVHNDLVSAYGPNMYLMHWLMDVDNWYGFGTGTRATFINTFQRGEQESTWETIPHPSIEEFKYGGPNGFLDLFTK
DRSYAKQWRYTNAPDAEGRAIQAVYWANKWAKEQGKGSAVASVVSKAAKMGDFLRNDMFDKYFMKIGAQDKTPATGYDSA
HYLMAWYTAWGGGIGASWAWKIGCSHAHFGYQNPFQGWVSATQSDFAPKSSNGKRDWTTSYKRQLEFYQWLQSAEGGIAG
GATNSWNGRYEKYPAGTSTFYGMAYVPHPVYADPGSNQWFGFQAWSMQRVMEYYLETGDSSVKNLIKKWVDWVMSEIKLY
DDGTFAIPSDLEWSGQPDTWTGTYTGNPNLHVRVTSYGTDLGVAGSLANALATYAAATERWEGKLDTKARDMAAELVNRA
WYNFYCSEGKGVVTEEARADYKRFFEQEVYVPAGWSGTMPNGDKIQPGIKFIDIRTKYRQDPYYDIVYQAYLRGEAPVLN
YHRFWHEVDLAVAMGVLATYFPDMTYKVPGTPSTKLYGDVNDDGKVNSTDAVALKRYVLRSGISINTDNADLNEDGRVNS
TDLGILKRYILKEIDTLPYKN
>P0C2S5 3.2.1.176~~~celS~~~Cellulose 1,4-beta-cellobiosidase (reducing end) CelS~~~
MVKSRKISILLAVAMLVSIMIPTTAFAGPTKAPTKDGTSYKDLFLELYGKIKDPKNGYFSPDEGIPYHSIETLIVEAPDY
GHVTTSEAFSYYVWLEAMYGNLTGNWSGVETAWKVMEDWIIPDSTEQPGMSSYNPNSPATYADEYEDPSYYPSELKFDTV
RVGSDPVHNDLVSAYGPNMYLMHWLMDVDNWYGFGTGTRATFINTFQRGEQESTWETIPHPSIEEFKYGGPNGFLDLFTK
DRSYAKQWRYTNAPDAEGRAIQAVYWANKWAKEQGKGSAVASVVSKAAKMGDFLRNDMFDKYFMKIGAQDKTPATGYDSA
HYLMAWYTAWGGGIGASWAWKIGCSHAHFGYQNPFQGWVSATQSDFAPKSSNGKRDWTTSYKRQLEFYQWLQSAEGGIAG
GATNSWNGRYEKYPAGTSTFYGMAYVPHPVYADPGSNQWFGFQAWSMQRVMEYYLETGDSSVKNLIKKWVDWVMSEIKLY
DDGTFAIPSDLEWSGQPDTWTGTYTGNPNLHVRVTSYGTDLGVAGSLANALATYAAATERWEGKLDTKARDMAAELVNRA
WYNFYCSEGKGVVTEEARADYKRFFEQEVYVPAGWSGTMPNGDKIQPGIKFIDIRTKYRQDPYYDIVYQAYLRGEAPVLN
YHRFWHEVDLAVAMGVLATYFPDMTYKVPGTPSTKLYGDVNDDGKVNSTDAVALKRYVLRSGISINTDNADLNEDGRVNS
TDLGILKRYILKEIDTLPYKN
>P16630 3.2.1.4~~~celS~~~Endoglucanase S~~~COG5297
MQTVNTQPHRIFRVLLPAVFSSLLLSSLTVSAASSSNDADKLYFGNNKYYLFNNVWGKDEIKGWQQTIFYNSPISMGWNW
HWPSSTHSVKAYPSLVSGWHWTAGYTENSGLPIQLSSNKSITSNVTYSIKATGTYNAAYDIWFHTTDKANWDSSPTDELM
IWLNDTNAGPAGDYIETVFLGDSSWNVFKGWINADNGGGWNVFSFVHTSGTNSASLNIRHFTDYLVQTKQWMSDEKYISS
VEFGTEIFGGDGQIDITEWRVDVK
>Q47096 3.2.1.4~~~celV~~~Endoglucanase 5~~~
MWMRRNQIVRKLTLGVVTTVLGMSLSFSALSATPVETHGQLSIENGRLVDEQGKRVQLRGISSHGLQWFGDYVNKDSMKW
LRDDWGINVFRVAMYTAADGYISNPSLANKVKEAVAAAQSLGVYIIIDWHILSDNDPNIYKAQAKTFFAEMAGLYGSSPN
VIYEIANEPNGGVTWNGQIRPYALEVTDTIRSKDPDNLIIVGTGTWSQDIHDAADNQLPDPNTMYALHFYAGTHGQFLRD
RIDYAQSRGAAIFVSEWGTSDASGNGGPFLPESQTWIDFLNNRGVSWVNWSLTDKSEASAALAPGASKSGGWTEQNLSTS
GKFVREQIRAGANLGGGDTPTTPTEPTNPGNGTTGDVVLQYRNVDNNPSDDAIRMAVNIKNTGSTPIKLSDLQVRYYFHD
DGKPGANLFVDWANVGPNNIVTSTGTPAASTDKANRYVLVTFSSGAGSLQPGAETGEVQVRIHAGDWSNVNETNDYSYGA
NVTSYANWDKITVHDKGTLVWGVEP
>P27032 3.2.1.4~~~celY~~~Minor endoglucanase Y~~~COG3405
MGKPMWRCWALMLMVWFSASATAANGWEIYKSRFMTTDGRIQDTGNKNVSHTEGQGFAMLMAVHYDDRIAFDNLWNWTQS
HLRNTTSGLFYWRYDPSAANPVVDKNNASDGDVLIAWALLKAGNKWQDNRYLQASDSIQKAIIASNIIQFAGRTVMLPGA
YGFNKNSYVILNPSYFLFPAWRDFANRSHLQVWRQLIDDSLSLVGEMRFGQVGLPTDWAALNADGSMAPATAWPSRFSYD
AIRIPLYLYWYDAKTTALVPFQLYWRNYPRLTTPAWVDVLSSNTATYNMQGGLLAVRDLTMGNLDGLSDLPGASEDYYSS
SLRLLVMLARGK
>P07103 3.2.1.4~~~celZ~~~Endoglucanase Z~~~COG2730
MPLSYLDKNPVIDSKKHALRKKLFLSCAYFGLSLACLSSNAWASVEPLSVNGNKIYAGEKAKSFAGNSLFWSNNGWGGEK
FYTADTVASLKKDWKSSIVRAAMGVQESGGYLQDPAGNKAKVERVVDAAIANDMYAIIGWHSHSAENNRSEAIRFFQEMA
RKYGNKPNVIYEIYNEPLQVSWSNTIKPYAEAVISAIRAIDPDNLIIVGTPSWSQNVDEASRDPINAKNIAYTLHFYAGT
HGESLRNKARQALNNGIALFVTEWGTVNADGNGGVNQTETDAWVTFMRDNNISNANWALNDKNEGASTYYPDSKNLTESG
KKVKSIIQSWPYKAGSAASATTDPSTDTTTDTTVDEPTTTDTPATADCANANVYPNWVSKDWAGGQPTHNEAGQSIVYKG
NLYTANWYTASVPGSDSSWTQVGSCN
>P23659 3.2.1.4~~~celZ~~~Endoglucanase Z~~~
MRKFYSFAIIISLLVTGLFIHTPKAEAAGYNYGEALQKAIMFYEFQRSGKLPENKRDNWRGDSGLNDGADVGLDLTGGWY
DAGDHVKFNLPMAYSQTMLAWAAYEAEEALERSGQMGYLLDAIKWVSDYLIKCHPSPNVFYYQVGDGHLDHSWWGPAEVM
QMDRPAYKVDLANPGSTVVAEAAAALASAAVVFADRDPAYAATCIQHAKELYNFAEITKSDSGYTAASGFYDSHSGFYDE
LSWAGVWLYLATGDETYLNKAEQYVAYWGTEPQTNIISYKWAHCWDDVHYGACLLLAKITGKQIYKEAIERHLDYWSVGY
NGERVHYTPKGLAWLDSWGSLRYATTTAFLASVYADWEGCSREKAAIYNDFAKQQIDYALGSSGRSYVVGFGVNPPKRPH
HRTAHSSWADSMSVPDYHRHVLIGALVGGPGKDDSYTDDINNYINNEVACDYNAGFVGALAKMYEDYGGSPIPDLNAFEE
ITNDEFFVMAGINASGQNFIEIKALLHNQSGWPARVADKLSFRYFVDLTELIEAGYSASDVTITTNYNAGAKVTGLHPWN
EAENIYYVNVDFTGTKIYPGGQSAYRKEVQFRIAAPQNTNFWNNDNDYSFRDIKGVTSGNTVKTVYIPVYDDGVLVFGVE
PEGGSGENNSSISITNATFDKNPAKQENIQVVMNLNGNTLNGIKYGNTYLREGTDYTVSGDTVTILKSFLNSFDTSTVQL
IFDFSAGRDPVLTVNIIDTTTSASIVPTTADFDKNPDASRDVKVKLVPNGNTLLAVKKDGEALVLGRDYSIDGDEVTIFR
EYLADQPVGRVTLTFDFDRGTDPVLTINITDSRQVETGVIQIQMFNGNTSDKTNGIMPRYRLTNTGTTPIRLSDVKIRYY
YTIDGEKDQNFWCDWSSVGSNNITGTFVKMAEPKEGADYYLETGFTDGAGYLQPNQSIEVQNRFSKADWTDYIQTNDYSF
STNTSYGSNDRITVYISGVLVSGIEP
>P19424 3.2.1.4~~~~~~Endoglucanase~~~
MKIKQIKQSLSLLLIITLIMSLFVPMASANTNESKSNAFPFSDVKKTSWSFPYIKDLYEQEVITGTSATTFSPTDSVTRA
QFTVMLTRGLGLEASSKDYPFKDRKNWAYKEIQAAYEAGIVTGKTNGEFAPNENITREQMAAMAVRAYEYLENELSLPEE
QREYNDSSSISTFAQDAVQKAYVLELMEGNTDGYFQPKRNSTREQSAKVISTLLWKVASHDYLYHTEAVKSPSEAGALQL
VELNGQLTLAGEDGTPVQLRGMSTHGLQWFGEIVNENAFVALSNDWGSNMIRLAMYIGENGYATNPEVKDLVYEGIELAF
EHDMYVIVDWHVHAPGDPRADVYSGAYDFFEEIADHYKDHPKNHYIIWELANEPSPNNNGGPGLTNDEKGWEAVKEYAEP
IVEMLREKGDNMILVGNPNWSQRPDLSADNPIDAENIMYSVHFYTGSHGASHIGYPEGTPSSERSNVMANVRYALDNGVA
VFATEWGTSQANGDGGPYFDEADVWLNFLNKHNISWANWSLTNKNEISGAFTPFELGRTDATDLDPGANQVWAPEELSLS
GEYVRARIKGIEYTPIDRTKFTKLVWDFNDGTTQGFQVNGDSPNKESITLSNNNDALQIEGLNVSNDISEGNYWDNVRLS
ADGWSENVDILGATELTIDVIVEEPTTVSIAAIPQGPAAGWANPTRAIKVTEDDFESFGDGYKALVTITSEDSPSLETIA
TSPEDNTMSNIILFVGTEDADVISLDNITVSGTEIEIEVIHDEKGTATLPSTFEDGTRQGWDWHTESGVKTALTIEEANG
SNALSWEYAYPEVKPSDGWATAPRLDFWKDELVRGTSDYISFDFYIDAVRASEGAISINAVFQPPANGYWQEVPTTFEID
LTELDSATVTSDELYHYEVKINIRDIEAITDDTELRNLLLIFADEDSDFAGRVFVDNVRFE
>P29019 3.2.1.4~~~~~~Endoglucanase~~~
MVEKRKIFTVLCACGIGFTSYTSCISAAAIDNDTLINNGHKINSSIITNSSQVSAVAKEMKPFPQQVNYSGILKPNHVSQ
ESLNNAVKNYYNDWKKKYLKNDLSSLPGGYYVKGEITGNPDGFRPLGTSEGQGYGMIITVLMAGHDSNAQTIYDGLFKTA
RAFKSSINPNLMGWVVADDKKAQGHFDSATDGDLDIAYSLLLAHKQWGSSGKINYLKEAQNMITKGIKASNVTKNNGLNL
GDWGDKSTFDTRPSDWMMSHLRAFYEFTGDKTWLNVIDNLYNTYTNFTNKYSPKTGLISDFVVKNPPQPAPKDFLDESKY
TDSYYYNASRVPLRIVMDYAMYGEKRGKVISDKVATWIKSKTKGNPSKIVDGYKLDGTNIGDYPTAVYVSPFIAAGTTNS
KNQEWVNSGWDWMKNKKESYFSDSYNLLTMLFLTGNWWKPIPDEKKIQSPINLEVQSELKEQD
>P18336 3.2.1.4~~~~~~Endoglucanase~~~
MPLRALVAVIVTTAVMLVPRAWAQTAWERYKARFMMPDARIIDTANGNVSHTEGQGFAMLLAVANNDRPAFDKLWQWTDS
TLRDKSNGLFYWRYNPVAPDPIADKNNATDGDTLIAWALLRAQKQWQDKRYATASDAITASLLKYTVVTFAGRQVMLPGV
KGFNRNDHLNLNPSYFIFPAWRAFAERTHLTAWRTLQSDGQALLGQMGWGKSHLPSDWVALRADGKMLPAKEWPPRMSFD
AIRIPLYISWVDPHSALLAPWKAWMQSYPRLQTPAWINVSTNEVAPWNMAGGLLAVRDLTLGEPLERRRLTTRMIITPPA
SSCWSGWRNRISASAVMALQVSQPVCLRAERKEQERLTM
>P37651 3.2.1.4~~~bcsZ~~~Endoglucanase~~~COG3405
MNVLRSGIVTMLLLAAFSVQAACTWPAWEQFKKDYISQEGRVIDPSDARKITTSEGQSYGMFSALAANDRAAFDNILDWT
QNNLAQGSLKERLPAWLWGKKENSKWEVLDSNSASDGDVWMAWSLLEAGRLWKEQRYTDIGSALLKRIAREEVVTVPGLG
SMLLPGKVGFAEDNSWRFNPSYLPPTLAQYFTRFGAPWTTLRETNQRLLLETAPKGFSPDWVRYEKDKGWQLKAEKTLIS
SYDAIRVYMWVGMMPDSDPQKARMLNRFKPMATFTEKNGYPPEKVDVATGKAQGKGPVGFSAAMLPFLQNRDAQAVQRQR
VADNFPGSDAYYNYVLTLFGQGWDQHRFRFSTKGELLPDWGQECANSH
>P06564 3.2.1.4~~~~~~Endoglucanase~~~COG2730
MMLRKKTKQLISSILILVLLLSLFPTALAAEGNTREDNFKHLLGNDNVKRPSEAGALQLQEVDGQMTLVDQHGEKIQLRG
MSTHGLQWFPEILNDNAYKALANDWESNMIRLAMYVGENGYASNPELIKSRVIKGIDLAIENDMYVIVDWHVHAPGDPRD
PVYAGAEDFFRDIAALYPNNPHIIYELANEPSSNNNGGAGIPNNEEGWNAVKEYADPIVEMLRDSGNADDNIIIVGSPNW
SQRPDLAADNPIDDHHTMYTVHFYTGSHAASTESYPPETPNSERGNVMSNTRYALENGVAVFATEWGTSQANGDGGPYFD
EADVWIEFLNENNISWANWSLTNKNEVSGAFTPFELGKSNATSLDPGPDQVWVPEELSLSGEYVRARIKGVNYEPIDRTK
YTKVLWDFNDGTKQGFGVNGDSPVEDVVIENEAGALKLSGLDASNDVSEGNYWANARLSADGWGKSVDILGAEKLTMDVI
VDEPTTVSIAAIPQGPSANWVNPNRAIKVEPTNFVPLEDKFKAELTITSADSPSLEAIAMHAENNNINNIILFVGTEGAD
VIYLDNIKVIGTEVEIPVVHDPKGEAVLPSVFEDGTRQGWDWAGESGVKTALTIEEANGSNALSWEFGYPEVKPSDNWAT
APRLDFWKSDLVRGENDYVTFDFYLDPVRATEGAMNINLVFQPPTNGYWVQAPKTYTINFDELEEPNQVNGLYHYEVKIN
VRDITNIQDDTLLRNMMIIFADVESDFAGRVFVDNVRFEGAATTEPVEPEPVDPGEETPPVDEKEAKTEQKEAEKEEKEE
>A0A023VXA2 3.2.1.4~~~cel5A~~~Endoglucanase~~~
MPRMLAASAAIIATTLAPLSAQAAGCEMTLHGINLSGAEFGQPGDPYGQGYIYPSESTIKAFADDGFNAVRLPFLWERLQ
PTLNGTLDATELSRIKDTVETLRDNGMVVILDVHNYARYHGEMIGTPNVPVAAFADFWKRLSAVFANDDDVIFGLMNEPH
DISAPAWLAAANAAIDAIRTIGAGNLVLVPGTAWTGAHSWSQTFYGPSNASVMAQVVDSSNNFAYEVHQYTDDDFSGKNA
DCSKIDDAVSALNDFTSWLNANDVQGFLGEFGTTEQIQCLRGLKQMVDVVQQNPRAWLGWAYWAGGDWWPKDSPMIIHSN
PRDGGTQQLRTLQPVLGSNTTRASCNTKS
>P17974 3.2.1.4~~~egl~~~Endoglucanase~~~
MRRCMPLVAASVAALMLAGCGGGDGDPSLSTASVSATDTTTLKPAATSTTSSVWLTLAKDSAAFTVSGTRTVRYGAGSAW
VEKSVSGSGRCTSTFFGKDPAAGVAKVCQLLQGTGTLLWRGVSLAGAEFGEGSLPGTYGSNYIYPSADSVTYYKNKGMNL
VRLPFRWERLQPTLNQVFDANELSRLTGFVNAVTATGQTVLLDPHNYARYYGNVIGSSAVPNSAYADFWRRLATQFKSNP
RVILGLMNEPNSMPTEQWLSGANAELAAIRSANASNVVFVPGNAWTGAHSWNQNWYGTPNGTVMKGINDPGHNLVFEVHQ
YLDGDSSGQSANCVSATIGAQRLQDFTTWLRSNGYRGFLGEFGAASNDTCNQAVSNMLTFVKNNADVWTGWAWWAGGPWW
GGYMYSIEPSNGVDKPQMSVLAPYLK
>P17115 5.3.1.13~~~gutQ~~~Arabinose 5-phosphate isomerase GutQ~~~COG0517
MSEALLNAGRQTLMLELQEASRLPERLGDDFVRAANIILHCEGKVVVSGIGKSGHIGKKIAATLASTGTPAFFVHPAEAL
HGDLGMIESRDVMLFISYSGGAKELDLIIPRLEDKSIALLAMTGKPTSPLGLAAKAVLDISVEREACPMHLAPTSSTVNT
LMMGDALAMAVMQARGFNEEDFARSHPAGALGARLLNKVHHLMRRDDAIPQVALTASVMDAMLELSRTGLGLVAVCDAQQ
QVQGVFTDGDLRRWLVGGGALTTPVNEAMTVGGTTLQSQSRAIDAKEILMKRKITAAPVVDENGKLTGAINLQDFYQAGI
I
>P50900 3.2.1.91~~~celY~~~Exoglucanase-2~~~COG4447
MKRRLMKGISLLTLVFLIGIMLQLSLKSELTAYASSDDPYKQRFLELWEELHDPSNGYFSSHGIPYHAVETLIVEAPDYG
HLTTSEAMSYYLWLEALYGKFTGDFSYFMKAWETIEKYMIPTEQDQPNRSMAGYNPAKPATYAPEWEEPSMYPSQLDFSA
PVGIDPIYNELVSTYGTNTIYGMHWLLDVDNWYGFGRRADRISSPAYINTFQRGSQESVWETIPQPCWDDLTIGGRNGFL
DLFVGDSQYSAQFKYTNAPDADARAIQATYWANQWAKEHGVNLSQYVKKASRMGDYLRYAMFDKYFRKIGDSKQAGTGYD
AAHYLLSWYYAWGGGITADWAWIIGCSHVHAGYQNPMTAWILANDPEFKPESPNGANDWAKSLERQLEFYQWLQSAEGAI
AGGATNSYKGRYETLPAGISTFYGMAYEEHPVYLDPGSNTWFGFQAWTMQRVAEYYYLTGDTRAEQLLDKWVDWIKSVVR
LNSDGTFEIPGNLEWSGQPDTWTGTYTGNPNLHVSVVSYGTDLGAAGSLANALLYYAKTSGDDEARNLAKELLDRMWNLY
RDDKGLSAPETREDYVRFFEQEVYVPQGWSGTMPNGDRIEPGVTFLDIRSKYLNDPDYPKLQQAYNEGKAPVFNYHRFWA
QCDIAIANGLYSILFGSEQANDSFITPTSATFDKNNQEDISVTVTYNGNTLLGIKSGSSYLIEGVDYIVNGDVIIIKKEF
LAGQATGSISLLFDFSAGLDRTLTIDIIDTGGGEEPVEPVEPVEGVLIIQSFNANTQEISNSIMPRFRIYNSGNTSIPLS
EVKLRYYYTVDGDKPQNFWCDWASIGSSNVTGTFVKMDGATTGADYYLEIGFTPQAGTLEPGASIEVQGRFSKIDWTDYT
QTNDYSFNPTASSYVDFNKITAYISGNLVYGIEP
>P50401 3.2.1.91~~~cbhA~~~Exoglucanase A~~~COG5297
MSTLGKRAGVRRRVRAVATAATATALVAVPLTTLATSASAAPVHVDNPYAGAVQYVNPTWAASVNAAAGRQSADPALAAK
MRTVAGQPTAVWMDRISAITGNADGNGLKFHLDNAVAQQKAAGVPLVFNLVIYDLPGRDCFALASNGELPATDAGLARYK
SEYIDPIADLLDNPEYESIRIAATIEPDSLPNLTTNISEPACQQAAPYYRQGVKYALDKLHAIPNVYNYIDIGHSGWLGW
DSNAGPSATLFAEVAKSTTAGFASIDGFVSDVANTTPLEEPLLSDSSLTINNTPIRSSKFYEWNFDFDEIDYTAHMHRLL
VAAGFPSSIGMLVDTSRNGWGGPNRPTSITASTDVNAYVDANRVDRRVHRGAWCNPLGAGIGRFPEATPSGYAASHLDAF
VWIKPPGESDGASTDIPNDQGKRFDRMCDPTFVSPKLNNQLTGATPNAPLAGQWFEEQFVTLVKNAYPVIGGTTPVEDLV
APTVPTGLTAGTTTATSVPLSWTASTDNVAVTGYDVYRGTTLVGTTAATSYTVTGLTPATAYSFTVRAKDAAGNVSAASA
AAAATTQSGTVTDTTAPSVPAGLTAGTTTTTTVPLSWTASTDNAGGSGVAGYEVLRGTTVVGTTTATSYTVTGLTAGTTY
SFSVRAKDVAGNTSAASAAVSATTQTGTVVDTTAPSVPTGLTAGTTTTSSVPLTWTASTDNAGGSGVAGYEVFNGTTRVA
TVTSTSYTVTGLAADTAYSFTVKAKDVAGNVSAASAAVSARTQAATSGGCTVKYSASSWNTGFTGTVEVKNNGTAALNGW
TLGFSFADGQKVSQGWSAEWSQSGTAVTAKNAPWNGTLAAGSSVSIGFNGTHNGTNTAPTAFTLNGVACTLG
>P50899 3.2.1.91~~~cbhB~~~Exoglucanase B~~~COG4733
MSSTTRRRSAWVAAATVGVSSFLAVAGITPAIAAAGAGQPATVTVPAASPVRAAVDGEYAQRFLAQYDKIKDPANGYFSA
QGIPYHAVETLMVEAPDYGHETTSEAYSYWLWLEALYGQVTQDWAPLNHAWDTMEKYMIPQSVDQPTNSFYNPNSPATYA
PEFNHPSSYPSQLNSGISGGTDPIGAELKATYGNADVYQMHWLADVDNIYGFGATPGAGCTLGPTATGTSFINTFQRGPQ
ESVWETVPQPSCEEFKYGGKNGYLDLFTKDASYAKQWKYTSASDADARAVEAVYWANQWATEQGKAADVAATVAKAAKMG
DYLRYTLFDKYFKKIGCTSPTCAAGQGREAAHYLLSWYMAWGGATDTSSGWAWRIGSSHAHFGYQNPLAAWALSTDPKLT
PKSPTAKADWAASMQRQLEFYTWLQASNGGIAGGATNSWDGAYAQPPAGTPTFYGMGYTEAPVYVDPPSNRWFGMQAWGV
QRVAELYYASGNAQAKKILDKWVPWVVANISTDGASWKVPSELKWTGKPDTWNAAAPTGNPGLTVEVTSYGQDVGVAADT
ARALLFYAAKSGDTASRDKAKALLDAIWANNQDPLGVSAVETRGDYKRFDDTYVANGDGIYIPSGWTGTMPNGDVIKPGV
SFLDIRSFYKKDPNWSKVQTFLDGGAEPQFRYHRFWAQTAVAGALADYARLFDDGTTTPDTTAPTVPTGLQAGVVTSTEA
TISWTASTDDTRVTGYDVYRGATKVGTATTTSFTDTGLTASTAYAYTVRAFDAAGNVSAPSAALTVTTKATPSDTTAPSV
PAITSSSSTANSVTIGWSASTDNAGGSGLAGYDVYRGATRVAQTTALTFTDTGLTASTAYEYTVRARDVAGNVSAPSTAV
SVTTKSDTTPDTTAPSVPAGLAAMTVTETSVALTWNASTDTGGSGLKGYDVYRGATRVGSTTTASYTDTGLTAATAYQYT
VRATDNAGNVSAASAALSVTTKTPQTGGSCSVAYNASSWNSGFTASVRITNTGTTTINGWSLGFDLTAGQKVQQGWSATW
TQSGSTVTATNAPWNGTLAPGQTVDVGFNGSHTGQNPNPASFTLNGASCT
>P07986 ~~~cex~~~Exoglucanase/xylanase~~~
MPRTTPAPGHPARGARTALRTTRRRAATLVVGATVVLPAQAATTLKEAADGAGRDFGFALDPNRLSEAQYKAIADSEFNL
VVAENAMKWDATEPSQNSFSFGAGDRVASYAADTGKELYGHTLVWHSQLPDWAKNLNGSAFESAMVNHVTKVADHFEGKV
ASWDVVNEAFADGDGPPQDSAFQQKLGNGYIETAFRAARAADPTAKLCINDYNVEGINAKSNSLYDLVKDFKARGVPLDC
VGFQSHLIVGQVPGDFRQNLQRFADLGVDVRITELDIRMRTPSDATKLATQAADYKKVVQACMQVTRCQGVTVWGITDKY
SWVPDVFPGEGAALVWDASYAKKPAYAAVMEAFGASPTPTPTTPTPTPTTPTPTPTSGPAGCQVLWGVNQWNTGFTANVT
VKNTSSAPVDGWTLTFSFPSGQQVTQAWSSTVTQSGSAVTVRNAPWNGSIPAGGTAQFGFNGSHTGTNAAPTAFSLNGTP
CTVG
>P81002 ~~~~~~17 kDa gas vesicle protein~~~
MVSLREQWNEQARERQREISARKTETVALLQEANQERVRVAQQQKALAIELKNQLAQFHEQLETSVGNWRQETQEQLINL
EETRTANAQQQREALFNFRRQLTADVWGETESDSLKVEENTPFVA
>P81003 ~~~~~~35 kDa gas vesicle protein~~~
MVSLREQWNEQARERQREISARKTEIEEARSSIQKELQAQYEARLAATQQLQSELRQFYNNLANDTAGFLERTRTTREQM
AQKLEQELTQLITDLENNTAALLQEANQERVRVSQQQKALAIELKNQLAQFHEQLETSVGNWRQETQEQQQQTAAQLRED
LNAFSANLMAQVEKLLINLEETRTANAQQQREALFNFRRQLTVDVWGETESDSLKVEENTPSDVWGETESDFLEVEENTP
FVT
>Q9ZC13 ~~~gvpA1~~~Gas vesicle protein A1~~~
MTVVPAQQTGGGGSSGLYDVLELVLDRGLVIDAFVRVSLVGIEILKIDVRVVVASVDTYLRFAEACNRLDLEAGPRKDPG
LPDLVGEMTESGARGKSKGALSGAAETISDAFKQARDDGGSERETSSRPRARKAAPSRRKEEQE
>O68677 ~~~gvpA2~~~Gas vesicle protein A2~~~
MSIQKSTNSSSLAEVIDRILDKGIVIDAFARVSVVGIEILTIEARVVIASVDTWLRYAEAVGLLRDDVEENGLPERSNSS
EGQPRFSI
>Q9RJB4 ~~~gvpA2~~~Gas vesicle protein A2~~~
MITYDDEVVCAPRAGTLYDVLELILDRGMVIDVFVRVSLVGIEILKVDARIVVASVDTYLRFAEACNRLDLEHDVRSKTV
PEMFGSPMAKTVGRAGARRTARSLTDKVRDVLTPEHEHEEEPEEAEDRPRAGAERGRSTQRPRSRPAARPRDEDDRPRSR
PRRRTEEEDR
>P10397 ~~~gvpA~~~Gas vesicle protein A~~~
MAVEKTNSSSSLAEVIDRILDKGIVIDAWVRVSLVGIELLAIEARIVIASVETYLKYAEAVGLTQSAAVPA
>P08412 ~~~gvpA~~~Gas vesicle protein A~~~
MAVEKTNSSSSLAEVIDRILDKGIVIDAWARVSLVGIELLAIEARVVIASVETYLKYAEAVGLTQXAXXAX
>P07060 ~~~gvpA1~~~Gas vesicle protein A~~~
MAVEKTNSSSSLAEVIDRILDKGIVVDAWVRVSLVGIELLAIEARIVIASVETYLKYAEAVGLTQSAAVPA
>Q8XFU1 ~~~gvpA~~~Gas vesicle protein A~~~
MAVEKTNSSSSLAEVIDRILDKGIVVDAWVRVSLVGIELLAIEARIVIASVETYLKYAEAVGLTQSAAMPA
>P0A3G0 ~~~gvpA~~~Gas vesicle protein A~~~
MAVEKVNSSSSLAEVIDRILDKGIVIDAWVRVSLVGIELLSIEARIVIASVETYLKYAEAVGLTAQAAVPSV
>P0A3G1 ~~~gvpA~~~Gas vesicle protein A~~~
MAVEKVNSSSSLAEVIDRILDKGIVIDAWVRVSLVGIELLSIEARIVIASVETYLKYAEAVGLTAQAAVPSV
>P22453 ~~~gvpA~~~Gas vesicle protein A~~~
MAVEKVNSSSSLAEVIDRILDKGIVIDAWVRVSLVGIELLSIEARVVIASVETYLKYAEAVGLTASAAVPAA
>A0A1Q5LR04 ~~~gvpA~~~Gas vesicle protein A~~~
MTVVPAQQGGGGARGTSGLYDVLELVLDRGLVIDAFVRVSLVGIEILKIDVRVVVASVDTYLRFAEACNRLDLEAGPRKD
PGLPDLVGEMTESGARGKSKGALSGAAETISDALKGSSSGSSSGSSSRSTSRKKEEQE
>P09413 ~~~gvpC~~~Gas vesicle protein C~~~
MISLMAKIRQEHQSIAEKVAELSLETREFLSVTTAKRQEQAEKQAQELQAFYKDLQETSQQFLSETAQARIAQAEKQAQE
LLAFHKELQETSQQFLSATAQARIAQAEKQAQELLAFYQEVRETSQQFLSATAQARIAQAEKQAQELLAFHKELQETSQQ
FLSATADARTAQAKEQKESLLKFRQDLFVSIFG
>P08041 ~~~gvpC~~~Gas vesicle protein C~~~
MTPLMIRIRQEHRGIAEEVTQLFKDTQEFLSVTTAQRQAQAKEQAENLHQFHKDLEKDTEEFLTDTAKERMAKAKQQAED
LFQFHKEMAENTQEFLSETAKERMAQAQEQARQLREFHQNLEQTTNEFLADTAKERMAQAQEQKQQLHQFRQDLFASIFG
TF
>Q8YUS9 ~~~gvpC~~~Gas vesicle protein C~~~
MTALMVRIRQEHRSIAEEVTQLFRETHEFLSATTAHRQEQAKQQAQQLHQFHQNLEQTTHEFLTETTTQRVAQAEAQANF
LHKFHQNLEQTTQEFLAETAKNRTEQAKAQSQYLQQFRKDLFASIFGTF
>P80999 ~~~gvpC~~~Gas vesicle protein C~~~
MALKDKWQQDRIGRQQGVQERQQQVQTTLSLWQQERQNQALDDQESRQGFVTGVQQQTQELLTNISTERLWVAQQQREQL
ENFIQQLSQEVGEFLQQTIEERSQVAAQLHQQLSEFREDLEYRVTDLLANYQKQRLEARETLLEDLAIFRQTLYREVEEY
LGELDILHQQMAAQLQQQLQQSRTERKDAVQKLFEDLGVFRAELQDYHLKLQQTVWGSSHRKPRKAITPQRSIPSRLYSC
>A8Y9T3 ~~~gvpF~~~Gas vesicle protein F~~~
MTVGLYLYGIFPEPIPDGLVLQGIDNEPVHSEMIDGFSFLYSAAHKEKYLASRRYLICHEKVLETVMEAGFTTLLPLRFG
LVIKTWESVTEQLITPYKTQLKELFAKLSGQREVSIKIFWDNQWELQAALESNPKLKQERDAMMGKNLNMEEIIHIGQLI
EATVLRRKQDIIQVFRDQLNHRAQEVIESDPMTDDMIYNAAYLIPWEQEPEFSQNVEAIDQQFGDRLRIRYNNLTAPYTF
AQLI
>Q8YUT3 ~~~gvpF~~~Gas vesicle protein F~~~COG0154
MSSGLYLYGIFPDPIPETVTLQGLDSQLVYSQIIDGFTFLYSEAKQEKYLASRRNLISHEKVLEQAMHAGFRTLLPLRFG
LVVKNWETVVTQLLQPYKAQLRELFQKLAGRREVSVKIFWDSKAELQAMMDSHQDLKQKRDQMEGKALSMEEVIHIGQLI
ESNLLSRKESIIQVFFDELKPLADEVIESDPMTEDMIYNAAFLIPWENESIFSQQVESIDHKFDERLRIRYNNFTAPYTF
AQIS
>A0A1Q5LR02 ~~~gvpF~~~Gas vesicle protein F~~~
MSTYVYGIARSSHPSLPEKMGGIGDPPQPVRILVQGALAALVSDAPEDLRPKRRDLMAHQNVLAEAGAGGAVLPMRFGGI
SPDDDAVLAVLDEREEHYLERLRALDDKVEYNVKASHDEEAVLHRVLADNPELRGLSEANRAAGGGTYEQKLALGERVAA
AVQQREASDAVLIQEALQAEATDVRPGPESGAWLANISFLVERDRADGFVAAIDKLQQANHHLVLQVNGPLPPYSFVE
>A0A1Q5LQZ4 ~~~gvpG~~~Gas vesicle protein G~~~
MGLLGELLLLPAAPLRGTAWVLRQVVAEAERQYYDPAAVQRELARLNELLEAGEIDEEEFDRREDELLDRLEKGPRQS
>Q9ZC08 ~~~gvpJ1~~~Probable gas vesicle protein J1~~~
MTTPSRLPDPYGQGQSANLADILERVLDKGVVIAGDIKINLLDIELLTIKLRLVVASVDKAKEMGIDWWESDPALSSRAR
HDELTRENAALRERLRELDPGRVPREEAP
>Q9RJA9 ~~~gvpJ2~~~Probable gas vesicle protein J2~~~
MTDLDHRYPGEETEPYGPPSGSLADLLERVLDKGIVIAGDIKIDLLDIELLTIRLRLFIASVDTAKKAGIDWWETDPALS
SRAARDALAEENARLRERLDALEGAAGETTGAVR
>Q8YUT1 ~~~gvpJ~~~Gas vesicle protein J~~~
MTTTPIHPTRPQTNSNRVIPTSTQGSTLADILERVLDKGIVIAGDISISIASTELIHIRIRLLISSVDKAREMGINWWEN
DPYLSSKSQRLVEENQQLQQRLESLETQLRLLTSAAKEETTLTANNPEDLQPMYEVNSQEGDNSQLEA
>O68669 ~~~gvpJ~~~Gas vesicle protein J~~~
MAVEHNMQSSTIVDVLEKILDKGVVIAGDITVGIADVELLTIKIRLIVASVDKAKEIGMDWWENDPYLSSKGANNKALEE
ENKMLHERLKTLEEKIETKR
>A0A1Q5LRA3 ~~~gvpJ~~~Gas vesicle protein J~~~
MTTPRPSGLPSPYASDGSSANLADILERVLDKGVVIAGDIKINLLDIELLTIKLRLVVASVDRAKEMGIDWWESDPALSS
RARGSELARENADLQRRIAELEGRTV
>Q8YUT2 ~~~gvpK~~~Gas vesicle protein K~~~
MVCTPVEKSPNLLPTTSKANSKAGLAPLLLTVVELIRQLMEAQVIRRMEQDCLSESELEQASESLQKLEEQVLNLCHIFE
IEPADLNINLGDVGTLLPSPGSYYPGEIGNKPSVLELLDRLLNTGIVVDGEIDLGLAQLNLIHAKLRLVLTSRPL
>A0A1Q5LQZ5 ~~~gvpK~~~Gas vesicle protein K~~~
MTTQRRKVELDPDTVERDLARLVLTVVELLRQLMERQALRRVEGGDLTEEQEERIGLTLMLLEDRMELLRTRFGLEPEDL
NLDLGPLGPLL
>A0A1Q5LR34 ~~~gvpL~~~Gas vesicle protein L~~~
MSTDRLQYVYAVTRPFDGVLPEGAHGIGGEPPRLLRHGDLVAVTGAVPAGDFDEAPLRARLEDLDWLADAARAHDAVISA
LSTVTCPLPLRLATVCRDDSGVRRLLEDGHDRFVRALERLDGRVEWGVKVYAEPGAAQQQEEEPAAHAREASGRDYLRRR
LHARRSRDGDWQRADALCRRLHTELSRCAEAGTVHRPQDARLSGVPGVNVLNAAYLVDRARSQQFVELVDGASEPGVRVE
LTGPWAPYSFAGIAEEDVHETQEAGR
>A0A1Q5LR10 ~~~gvpO~~~Gas vesicle protein O~~~
MANTPEDTQNTQNDSQNDSQNDSQKDTSARATSARAHQQPQEQPPSPMRVLRGACAQLAELTGMEAESVSSFERTEDGWT
LNVEVLELARVPDTMSLLASYEVELDAHGELSGYRRVRRYERGRSDRS
>O68671 ~~~gvpS~~~Gas vesicle protein S~~~
MSLKQSMENKDIALIDILDVILDKGVAIKGDLIISIAGVDLVYLDLRVLISSVETLVQAKEGNHKPITSEQFDKQKEELM
DATGQPSKWTNPLGS
>A0A1Q5LQW7 ~~~gvpS~~~Gas vesicle protein S~~~
MTAPVTRAAPEPLAERRIALVDLLDRLLAGGVVLTGDLTLSIADVDLVRVDLKALISSVGEDVPSPWEPLREVRP
>P22866 ~~~gylR~~~Glycerol operon regulatory protein~~~
MARNIQSLERAAAMLRLLAGGERRLGLSDIASTLGLAKGTAHGILRSLQAEGFVEQEPASGRYQLGAELLALGNSYLDVH
ELRARALVWTDDLARSSGEAAYLGVLHQQGVLIVHHVFRPDDSRQVLEVGAMHPLHSTAHGKVISAFDPVAHSEVLEGDR
ATLTGRTVTEAAAFEEVLDLTRARGWALDLEETWEGVASLAAPVHDRRRMAVGAVGVTGPVERLCPDGAPATELVTAVRD
CAAAVSRDLGAGRF
>P05653 5.6.2.2~~~gyrA~~~DNA gyrase subunit A~~~COG0188
MSEQNTPQVREINISQEMRTSFLDYAMSVIVSRALPDVRDGLKPVHRRILYAMNDLGMTSDKPYKKSARIVGEVIGKYHP
HGDSAVYESMVRMAQDFNYRYMLVDGHGNFGSVDGDSAAAMRYTEARMSKISMEILRDITKDTIDYQDNYDGSEREPVVM
PSRFPNLLVNGAAGIAVGMATNIPPHQLGEIIDGVLAVSENPDITIPELMEVIPGPDFPTAGQILGRSGIRKAYESGRGS
ITIRAKAEIEQTSSGKERIIVTELPYQVNKAKLIEKIADLVRDKKIEGITDLRDESDRTGMRIVIEIRRDANANVILNNL
YKQTALQTSFGINLLALVDGQPKVLTLKQCLEHYLDHQKVVIRRRTAYELRKAEARAHILEGLRVALDHLDAVISLIRNS
QTAEIARTGLIEQFSLTEKQAQAILDMRLQRLTGLEREKIEEEYQSLVKLIAELKDILANEYKVLEIIREELTEIKERFN
DERRTEIVTSGLETIEDEDLIERENIVVTLTHNGYVKRLPASTYRSQKRGGKGVQGMGTNEDDFVEHLISTSTHDTILFF
SNKGKVYRAKGYEIPEYGRTAKGIPIINLLEVEKGEWINAIIPVTEFNAELYLFFTTKHGVSKRTSLSQFANIRNNGLIA
LSLREDDELMGVRLTDGTKQIIIGTKNGLLIRFPETDVREMGRTAAGVKGITLTDDDVVVGMEILEEESHVLIVTEKGYG
KRTPAEEYRTQSRGGKGLKTAKITENNGQLVAVKATKGEEDLMIITASGVLIRMDINDISITGRVTQGVRLIRMAEEEHV
ATVALVEKNEEDENEEEQEEV
>O51396 5.6.2.2~~~gyrA~~~DNA gyrase subunit A~~~
MAVGENKEQILNVRIEDEIKTSYLNYAMSVIVSRALPDVRDGLKPVHRRILYSMYEMGLRSDKAFKKAGRIVGDVLGKYH
PHGDQSIYDALVRLAQDFSLRYPVIRGQGNFGSIDGDPPAAMRYTEAKMEKITEYIVKDIDKETVNFKSNYDDSLSEPEI
MPSSFPFLLVNGSSGIAVGMATNMAPHNLREICDAIVYMLDNENASIFDLLKIVKGPDFPTFGEIVYNDNLIKAYKTGKG
SVVIRARYHIEERAEDRNAIIVTEIPYTVNKSALLMKVALLAKEEKLEGLLDIRDESDREGIRIVLEVKRGFDPHVIMNL
LYEYTEFKKHFSINNLALVNGIPKQLNLEELLFEFIEHRKNIIERRIEFDLRKAKEKAHVLEGLNIALNNIDEVIKIIKS
SKLAKDARERLVSNFGLSEIQANSVLDMRLQKLTALEIFKLEEELNILLSLIKDYEDILLNPVRIINIIREETINLGLKF
GDERRTKIIYDEEVLKTSMSDLMQKENIVVMLTKKGFLKRLSQNEYKLQGTGGKGLSSFDLNDGDEIVIALCVNTHDYLF
MISNEGKLYLINAYEIKDSSRASKGQNISELINLGDQEEILTIKNSKDLTDDAYLLLTTASGKIARFESTDFKAVKSRGV
IVIKLNDKDFVTSAEIVFKDEKVICLSKKGSAFIFNSRDVRLTNRGTQGVCGMKLKEGDLFVKVLSVKENPYLLIVSENG
YGKRLNMSKISELKRGATGYTSYKKSDKKAGSVVDAIAVSEDDEILLVSKRSKALRTVAGKVSEQGKDARGIQVLFLDND
SLVSVSKFIK
>P0AES4 5.6.2.2~~~gyrA~~~DNA gyrase subunit A~~~COG0188
MSDLAREITPVNIEEELKSSYLDYAMSVIVGRALPDVRDGLKPVHRRVLYAMNVLGNDWNKAYKKSARVVGDVIGKYHPH
GDSAVYDTIVRMAQPFSLRYMLVDGQGNFGSIDGDSAAAMRYTEIRLAKIAHELMADLEKETVDFVDNYDGTEKIPDVMP
TKIPNLLVNGSSGIAVGMATNIPPHNLTEVINGCLAYIDDEDISIEGLMEHIPGPDFPTAAIINGRRGIEEAYRTGRGKV
YIRARAEVEVDAKTGRETIIVHEIPYQVNKARLIEKIAELVKEKRVEGISALRDESDKDGMRIVIEVKRDAVGEVVLNNL
YSQTQLQVSFGINMVALHHGQPKIMNLKDIIAAFVRHRREVVTRRTIFELRKARDRAHILEALAVALANIDPIIELIRHA
PTPAEAKTALVANPWQLGNVAAMLERAGDDAARPEWLEPEFGVRDGLYYLTEQQAQAILDLRLQKLTGLEHEKLLDEYKE
LLDQIAELLRILGSADRLMEVIREELELVREQFGDKRRTEITANSADINLEDLITQEDVVVTLSHQGYVKYQPLSEYEAQ
RRGGKGKSAARIKEEDFIDRLLVANTHDHILCFSSRGRVYSMKVYQLPEATRGARGRPIVNLLPLEQDERITAILPVTEF
EEGVKVFMATANGTVKKTVLTEFNRLRTAGKVAIKLVDGDELIGVDLTSGEDEVMLFSAEGKVVRFKESSVRAMGCNTTG
VRGIRLGEGDKVVSLIVPRGDGAILTATQNGYGKRTAVAEYPTKSRATKGVISIKVTERNGLVVGAVQVDDCDQIMMITD
AGTLVRTRVSEISIVGRNTQGVILIRTAEDENVVGLQRVAEPVDEEDLDTIDGSAAEGDDEIAPEVDVDDEPEEE
>C5C7X9 5.6.2.2~~~gyrA~~~DNA gyrase subunit A~~~COG0188
MSDETTPQTPDEPVEGAPIPGTPQTGLDIEVEHYVLPEGAADKVEPVDLESEMKRSYLDYAMAVIVGRALPDVRDGLKPV
HRRVLYAMYDGGYRPDRAFNKSARVVGDVMGNYHPHGDTAIYDALVRLIQDWVQRYPLALGQGNFGSPGNDGAAAQRYTE
TKMAPLAMEMVRDIDEDTVDMQDNYDGKQQEPVVLPARYPNLLVNGSSGIAVGMATNIPPHNMREVAAGVQWYLEHPEAT
REELLEALLARVHGPDFPTGAQILGRKGIEEVYRTGRGPITMRAVVNVEEIQGRTCLVVTELPYMTNPDNLAAKIAEMVR
DGKISGIADMRDETSGRTGQRLVIVLKRDAVAKVVLNNLYKHTELQSNFSANMLALVDGVPRTLSLDGFVHHWVKHQIDV
IVRRTAFRKRKAEERAHILRGLLKALDMLDEVIATIRRSASADVAREALKELLDIDDVQAQAILQMQLRQLAALESQKIQ
DEYDDLMAKIAEYNRILESPQRQREVISEELAEIVAKHGDDRRTEIMAGFDGDMSIEDLIPEEEMVVSITRGGYVKRTRI
DQYRSQARGGKGVRGATLRGDDVVEHFLTVSTHHWLLFFTNFGRVYRIKTYELLEAGRDAKGQHVANLLAFQPDERIAQI
QPLVDYGRAPYLVLATRGGLVKKTPLLDYDTNRTAGLIAIKLREGDELVSARVVSPDDDLILISHKGQSLRFTATDEALR
PMGRATSGVTGMKFRDDDSLLTMDVVEEDGYVFTVTDGGFAKRTHVDEYRLQNRGGLGIKVAKLVDDRGELAGGLVVRED
QEVLVVMASGKVVRSAVAGVPAKGRDTMGVIFAKPDKRDRIVAVTLNNEQEMEAKADAEAEAGPDVPLDADIDPTDPVAA
PEDALTQDAGEGADGGEQ
>A0A0G2Q9F8 5.6.2.2~~~gyrA1~~~DNA gyrase subunit A~~~
MTDTTLPPDDSLDRIEPVDIQQEMQRSYIDYAMSVIVGRALPEVRDGLKPVHRRVLYAMFDSGFRPDRSHAKSARSVAET
MGNYHPHGDASIYDTLVRMAQPWSLRYPLVDGQGNFGSPGNDPPAAMRYTEARLTPLAMEMLREIDEETVDFIPNYDGRV
QEPTVLPSRFPNLLANGSGGIAVGMATNIPPHNLRELADAVFWALENHDADEEETLAAVMGRVKGPDFPTAGLIVGSQGT
ADAYKTGRGSIRMRGVVEVEEDSRGRTSLVITELPYQVNHDNFITSIAEQVRDGKLAGISNIEDQSSDRVGLRIVIEIKR
DAVAKVVINNLYKHTQLQTSFGANMLAIVDGVPRTLRLDQLIRYYVDHQLDVIVRRTTYRLRKANERAHILRGLVKALDA
LDEVIALIRASETVDIARAGLIELLDIDEIQAQAILDMQLRRLAALERQRIIDDLAKIEAEIADLEDILAKPERQRGIVR
DELAEIVDRHGDDRRTRIIAADGDVSDEDLIAREDVVVTITETGYAKRTKTDLYRSQKRGGKGVQGAGLKQDDIVAHFFV
CSTHDLILFFTTQGRVYRAKAYDLPEASRTARGQHVANLLAFQPEERIAQVIQIRGYTDAPYLVLATRNGLVKKSKLTDF
DSNRSGGIVAVNLRDNDELVGAVLCSADDDLLLVSANGQSIRFSATDEALRPMGRATSGVQGMRFNIDDRLLSLNVVREG
TYLLVATSGGYAKRTAIEEYPVQGRGGKGVLTVMYDRRRGRLVGALIVDDDSELYAVTSGGGVIRTAARQVRKAGRQTKG
VRLMNLGEGDTLLAIARNAEESGDDNAVDANGADQTGN
>Q57532 5.6.2.2~~~gyrA~~~DNA gyrase subunit A~~~COG0188
MTDITLPPGDGSIQRVEPVDIQQEMQRSYIDYAMSVIVGRALPEVRDGLKPVHRRVLYAMLDSGFRPDRSHAKSARSVAE
TMGNYHPHGDASIYDTLVRMAQPWSLRYPLVDGQGNFGSPGNDPPAAMRYCVSGNSLVRLLFGKSIRIGDIVTGAQFNSD
NPIDLKVLDRHGNPVVADYLFHSGEHQTYTVRTTEGYEITGTSNHPLLCLVNVGGIPTLLWKLIGEIRSGDYVVLQRIPP
VEFGPADWYSTMEALLFGAFISGGFVFQDHAGFNSLDRDYFTMVVNAYDTVVGGLRCISSRITVSGSTLLELDVYNLIEF
KKTRLSGLCGQRSADKLVPDWLWHSPSTVKRAFLQALFEGEGFSSILSRNIIEISYSTLSERLAADVQQMLLEFGVVSER
YCHTVNEYKVVIANRAQVEMFFTQVGFGVTKQAKLIRDVVSMSPCVGMDINCVPGLATFIRKHCDNRWVEEDSFNQHNVD
CVQHWHHHSAEIVGHIADPDIRAIVTDLTDGRFYYARVASVTDTGIQPVFSLHVDTEDHSFLTNGFISHNTEARLTPLAM
EMLREIDEETVDFISNYDGRVQEPMVLPSRFPNLLANGSGGIAVGMATNIPPHNLYELADAVFWCLENHDADEETMLVAV
MERVKGPDFPTAGLIVGSQGIADAYKTGRGSIRIRGVVEVEEDSRGRTSLVITELPYQVNHDNFITSIAEQVRTGRLAGI
SNVEDQGSDRVGVRIVIEIKRDAVAKVVLNNLYKHTQLQTSFGANMLSIVDGVPRTLRLDQMICYYVEHQLDVIVRRTTY
RLRKANERAHILRGLVKALDALDEVITLIRASQTVDIARVGVVELLDIDDIQAQAILDMQLRRLAALERQRIIDDLAKIE
VEIADLGDILAKPERRRGIIRNELTEIAEKYGDDRRTRIIAVDGDVNDEDLIAREEVVVTITETGYAKRTKTDLYRSQKR
GGKGVQGAGLKQDDIVRHFFVCSTHDWILFFTTQGRVYRAKAYELPEASRTARGQHVANLLAFQPEERIAQVIQIRSYED
APYLVLATRAGLVKKSKLTDFDSNRSGGIVAINLRDNDELVGAVLCAADGDLLLVSANGQSIRFSATDEALRPMGRATSG
VQGMRFNADDRLLSLNVVREDTYLLVATSGGYAKRTSIEEYPMQGRGGKGVLTVMYDRRRGSLVGAIVVDEDSELYAITS
GGGVIRTTARQVRQAGRQTKGVRLMNLGEGDTLLAIARNAEESADGVSVKVMISRSRVLSFFGSDSNTSPDRT
>P22446 5.6.2.2~~~gyrA~~~DNA gyrase subunit A~~~
MAKQQDQIDKIRQELAQSAIKNISLSSELERSFMEYAMSVIVARALPDARDGLKPVHRRVLYGAYTGGMHHDRPFKKSAR
IVGDVMSKFHPHGDMAIYDTMSRMAQDFSLRYLLIDGHGNFGSIDGDRPAAQRYTEARLSKLAGELLRDIDKDTVDFVAN
YDGEEQEPTVLPAAFPNLLANGSSGIAVGMSTSIPSHNLSELIQGLILLIDNPDCTINDLLGVIKGPDFPTGANIIYTKG
IESYFETGKGNVVIRSKVSIEQLPTRAALVVTEIPYMVNKTSLIEKIVELVKAEEITGIADIRDESSREGIRLVIEVKRD
TVPEVLLNQLFKSTRLQVRFPVNMLALVKGAPKLLNMKQALTVYLEHQLDVLIRKTQFNLKKYQERFHILSGLLIAALNI
DEVIAIIKKSANNQVAMEALHERFGLDEIQARAVLDMRLRSLSVLEVNKLQTEQQELKALIEFCQQVLADKQLQLKLIKE
QLTKINEQFGDPRRSEILYGISEDIDDEDLITQENVVITMSTNGYLKRIGVDAYNLQHRGGVGVKGLTTYTDDSISQLLV
CSTHSDLLFFTDKGKVYRIRAHQIPPGFRTNKGIPAVNLIKIDKDEKICALISVNDYQNGYFFFCTKNGTIKRTSLSEFA
NILSIGKRAILFKENDVLFSVIRTSGQDDIFIGSTAGFVVRFHEDTVRPLSRAAMGVLGINLNQCEFVNGLSTSSNGSLL
LSVGQNGIGKLTSIDKYRLTKRNAKGVKTLRVTAKTGPVVTTTTVFGNEDLLMISSAGKIVRISLEQLSEQRKNTSGVKL
IKLKEKERLETVTIFKKEEAIKTTTATETDDVGSKQITQ
>P48354 5.6.2.2~~~gyrA~~~DNA gyrase subunit A~~~COG0188
MTDTTLPPEGEAHDRIEPVDIQQEMQRSYIDYAMSVIVGRALPEVRDGLKPVHRRVLYAMYDSGFRPDRSHAKSARSVAE
TMGNYHPHGDASIYDTLVRMAQPWSLRYPLVDGQGNFGSPGNDPPAAMRYTEARLTPLAMEMLREIDEETVDFIPNYDGR
VQEPTVLPSRFPNLLANGSGGIAVGMATNIPPHNLGELAEAVYWCLENYEADEEATCEAVMERVKGPDFPTSGLIVGTQG
IEDTYKTGRGSIKMRGVVEIEEDSRGRTSIVITELPYQVNHDNFITSIAEQVRDGKLAGISNIEDQSSDRVGLRIVVELK
RDAVAKVVLNNLYKHTQLQTSFGANMLSIVDGVPRTLRLDQLIRLYVDHQLDVIVRRTRYRLRKANERAHILRGLVKALD
ALDEVIALIRASQTVDIARAGLIELLDIDDIQAQAILDMQLRRLAALERQKIVDDLAKIEAEIADLEDILAKPERQRGIV
RDELKEIVDKHGDARRTRIVPADGEVSDEDLIAREDVVVTITETGYAKRTKTDLYRSQKRGGKGVQGAGLKQDDMVNHFF
VCSTHDWILFFTTQGRVYRAKAYELPEASRTARGQHVANLLAFQPEERIAQVIQIKSYEDAPYLVLATRNGLVKKSKLSD
FDSNRSGGIVAINLREGDELVGAVLCSAEDDLLLVSANGQSIRFSATDEALRPMGRATSGVQGMRFNEDDRLLSLNVVRP
DTYLLVATSGGYAKRTSIDEYSVQGRGGKGILTIQYDRKRGSLVGALIVDDDTELYAITSTGGVIRTAARQVRKAGRQTK
GVRLMNLAEGDTLIAIARNADEDEAAESISESDADTAESPEA
>Q59556 5.6.2.2~~~gyrA~~~DNA gyrase subunit A~~~COG0188
MTDTTLPPEGEAHDRIEPVDIQQEMQRSYIDYAMSVIVGRALPEVRDGLKPVHRRVLYAMYDSASSDRSHAKSARSVAET
MGNYHPHGDASIYDTLVRMAQPWSLRYPLVDGQGNFGSPGNDPPAAMRYTEARLTPLAMEMLREIDEETVDFIPNYDGRV
QEPTVLPSRFPNLLANGSRGIAVGMATNNPPHNLGELAEAVYWCLENYEADEEATCEAVMERVKGPDFPTSGLIVGTQGI
EDTYKTGRGSIKMRGVVEIEEDSRGRTSIVITELPYQVNHDNFITSIAEQVRDGKLAGISNIEDQSSDRVGLRIVVELKR
DAVAKVVLNNLYKHTQLQTSFGANMLSIVDGVPRTLRLDQLIRLYVDHQLDVIVRRTRYRLRKANERAHILRGLVKALDA
LDEVIALIRASQTVDIARAGLIELLDIDDIQAQAILDMQLRRLAALERQKIVDDLAKIEAEIADLEDILAKPERQRGIVR
DELKEIVDKHGDARRTRIVPADGQVSDEDLIAREDVVVTITETGYAKRTKTDLYHSQKRGGKGVQGAGLKQDDMVNHFFV
CSTHDWILFFTTQGRVYRAKAYELPEASRTARGQHVANLLAFQPEERIAQVIQIKSYEDAPYLVLATRNGLVKKSKLSDF
DSNRSGGIVAINLREGDELVGAVLCSAEDDLLLVSANRQSIRFSATDEALRPMGRATSGVQGMRFNEDDRLLSLNVVRPD
TYLLVATSGGYAKRTSIDEYSVQGRGGKGILTIQYDRKRGSLVGALIVDDDTELYAITSTGGVIRTAARQVRKAGRQTKG
VRLMNLAEGDTLIAIARNGRGRGGRVDQRIRRGHRRVTRGVMETLRSRKVLGPS
>P9WG47 5.6.2.2~~~gyrA~~~DNA gyrase subunit A~~~COG0188
MTDTTLPPDDSLDRIEPVDIEQEMQRSYIDYAMSVIVGRALPEVRDGLKPVHRRVLYAMFDSGFRPDRSHAKSARSVAET
MGNYHPHGDASIYDSLVRMAQPWSLRYPLVDGQGNFGSPGNDPPAAMRYTEARLTPLAMEMLREIDEETVDFIPNYDGRV
QEPTVLPSRFPNLLANGSGGIAVGMATNIPPHNLRELADAVFWALENHDADEEETLAAVMGRVKGPDFPTAGLIVGSQGT
ADAYKTGRGSIRMRGVVEVEEDSRGRTSLVITELPYQVNHDNFITSIAEQVRDGKLAGISNIEDQSSDRVGLRIVIEIKR
DAVAKVVINNLYKHTQLQTSFGANMLAIVDGVPRTLRLDQLIRYYVDHQLDVIVRRTTYRLRKANERAHILRGLVKALDA
LDEVIALIRASETVDIARAGLIELLDIDEIQAQAILDMQLRRLAALERQRIIDDLAKIEAEIADLEDILAKPERQRGIVR
DELAEIVDRHGDDRRTRIIAADGDVSDEDLIAREDVVVTITETGYAKRTKTDLYRSQKRGGKGVQGAGLKQDDIVAHFFV
CSTHDLILFFTTQGRVYRAKAYDLPEASRTARGQHVANLLAFQPEERIAQVIQIRGYTDAPYLVLATRNGLVKKSKLTDF
DSNRSGGIVAVNLRDNDELVGAVLCSAGDDLLLVSANGQSIRFSATDEALRPMGRATSGVQGMRFNIDDRLLSLNVVREG
TYLLVATSGGYAKRTAIEEYPVQGRGGKGVLTVMYDRRRGRLVGALIVDDDSELYAVTSGGGVIRTAARQVRKAGRQTKG
VRLMNLGEGDTLLAIARNAEESGDDNAVDANGADQTGN
>P37411 5.6.2.2~~~gyrA~~~DNA gyrase subunit A~~~
MSDLAREITPVNIEEELKSSYLDYAMSVIVGRALPDVRDGLKPVHRRVLYAMNVLGNDWNKAYKKSARVVGDVIGKYHPH
GDSAVYDTIVRMAQPFSLRYMLVDGQGNFGSIDGDSAAAMRYTEIRLAKIAHELMADLEKETVDFVDNYDGTEKIPDVMP
TKIPNLLVNGSSGIAVGMATNIPPHNLTEVINGCLAYIDNEDISIEGLMEHIPGPDFPTAAIINGRRGIEEAYRTGRGKV
YIRARAEVEADAKTGRETIIVHEIPYQVNKARLIEKIAELVKDKRVEGISALRDESDKDGMRIVIEVKRDAVGEVVLNNL
YSQTQLQVSFGINMVALHHGQPKIMNLKDIISAFVRHRREVVTRRTIFELRKARDRAHILEALAIALANIDPIIELIRRA
PTPAEAKAALISRPWDLGNVAAMLERAGDDAARPEWLEPEFGVRDGQYYLTEQQAQAILDLRLQKLTGLEHEKLLDEYKE
LLEQIAELLHILGSADRLMEVIREEMELIRDQFGDERRTEITANSADINIEDLISQEDVVVTLSHQGYVKYQPLTDYEAQ
RRGGKGKSAARIKEEDFIDRLLVANTHDTILCFSSRGRLYWMKVYQLPEASRGARGRPIVNLLPLEANERITAILPVREY
EEGVNVFMATASGTVKKTALTEFSRPRSAGIIAVNLNDGDELIGVDLTSGSDEVMLFSAAGKVVRFKEDAVRAMGRTATG
VRGIKLAGDDKVVSLIIPRGEGAILTVTQNGYGKRTAADEYPTKSRATQGVISIKVTERNGSVVGAVQVDDCDQIMMITD
AGTLVRTRVSEISVVGRNTQGVILIRTAEDENVVGLQRVAEPVDDEELDAIDGSVAEGDEDIAPEAESDDDVADDADE
>P0AES5 5.6.2.2~~~gyrA~~~DNA gyrase subunit A~~~
MSDLAREITPVNIEEELKSSYLDYAMSVIVGRALPDVRDGLKPVHRRVLYAMNVLGNDWNKAYKKSARVVGDVIGKYHPH
GDSAVYDTIVRMAQPFSLRYMLVDGQGNFGSIDGDSAAAMRYTEIRLAKIAHELMADLEKETVDFVDNYDGTEKIPDVMP
TKIPNLLVNGSSGIAVGMATNIPPHNLTEVINGCLAYIDDEDISIEGLMEHIPGPDFPTAAIINGRRGIEEAYRTGRGKV
YIRARAEVEVDAKTGRETIIVHEIPYQVNKARLIEKIAELVKEKRVEGISALRDESDKDGMRIVIEVKRDAVGEVVLNNL
YSQTQLQVSFGINMVALHHGQPKIMNLKDIIAAFVRHRREVVTRRTIFELRKARDRAHILEALAVALANIDPIIELIRHA
PTPAEAKTALVANPWQLGNVAAMLERAGDDAARPEWLEPEFGVRDGLYYLTEQQAQAILDLRLQKLTGLEHEKLLDEYKE
LLDQIAELLRILGSADRLMEVIREELELVREQFGDKRRTEITANSADINLEDLITQEDVVVTLSHQGYVKYQPLSEYEAQ
RRGGKGKSAARIKEEDFIDRLLVANTHDHILCFSSRGRVYSMKVYQLPEATRGARGRPIVNLLPLEQDERITAILPVTEF
EEGVKVFMATANGTVKKTVLTEFNRLRTAGKVAIKLVDGDELIGVDLTSGEDEVMLFSAEGKVVRFKESSVRAMGCNTTG
VRGIRLGEGDKVVSLIVPRGDGAILTATQNGYGKRTAVAEYPTKSRATKGVISIKVTERNGLVVGAVQVDDCDQIMMITD
AGTLVRTRVSEISIVGRNTQGVILIRTAEDENVVGLQRVAEPVDEEDLDTIDGSAAEGDDEIAPEVDVDDEPEEE
>Q2G2Q0 5.6.2.2~~~gyrA~~~DNA gyrase subunit A~~~COG0188
MAELPQSRINERNITSEMRESFLDYAMSVIVARALPDVRDGLKPVHRRILYGLNEQGMTPDKSYKKSARIVGDVMGKYHP
HGDSSIYEAMVRMAQDFSYRYPLVDGQGNFGSMDGDGAAAMRYTEARMTKITLELLRDINKDTIDFIDNYDGNEREPSVL
PARFPNLLANGASGIAVGMATNIPPHNLTELINGVLSLSKNPDISIAELMEDIEGPDFPTAGLILGKSGIRRAYETGRGS
IQMRSRAVIEERGGGRQRIVVTEIPFQVNKARMIEKIAELVRDKKIDGITDLRDETSLRTGVRVVIDVRKDANASVILNN
LYKQTPLQTSFGVNMIALVNGRPKLINLKEALVHYLEHQKTVVRRRTQYNLRKAKDRAHILEGLRIALDHIDEIISTIRE
SDTDKVAMESLQQRFKLSEKQAQAILDMRLRRLTGLERDKIEAEYNELLNYISELEAILADEEVLLQLVRDELTEIRDRF
GDDRRTEIQLGGFEDLEDEDLIPEEQIVITLSHNNYIKRLPVSTYRAQNRGGRGVQGMNTLEEDFVSQLVTLSTHDHVLF
FTNKGRVYKLKGYEVPELSRQSKGIPVVNAIELENDEVISTMIAVKDLESEDNFLVFATKRGVVKRSALSNFSRINRNGK
IAISFREDDELIAVRLTSGQEDILIGTSHASLIRFPESTLRPLGRTATGVKGITLREGDEVVGLDVAHANSVDEVLVVTE
NGYGKRTPVNDYRLSNRGGKGIKTATITERNGNVVCITTVTGEEDLMIVTNAGVIIRLDVADISQNGRAAQGVRLIRLGD
DQFVSTVAKVKEDAEDETNEDEQSTSTVSEDGTEQQREAVVNDETPGNAIHTEVIDSEENDEDGRIEVRQDFMDRVEEDI
QQSSDEE
>Q99XG5 5.6.2.2~~~gyrA~~~DNA gyrase subunit A~~~
MAELPQSRINERNITSEMRESFLDYAMSVIVARALPDVRDGLKPVHRRILYGLNEQGMTPDKSYKKSARIVGDVMGKYHP
HGDSSIYEAMVRMAQDFSYRYPLVDGQGNFGSMDGDGAAAMRYTEARMTKITLELLRDINKDTIDFIDNYDGNEREPSVL
PARFPNLLANGASGIAVGMATNIPPHNLTELINGVLSLSKNPDISIAELMEDIEGPDFPTAGLILGKSGIRRAYETGRGS
IQMRSRAVIEERGGGRQRIVVTEIPFQVNKARMIEKIAELVRDKKIDGITDLRDETSLRTGVRVVIDVRKDANASVILNN
LYKQTPLQTSFGVNMIALVNGRPKLINLKEALVHYLEHQKTVVRRRTQYNLRKAKDRAHILEGLRIALDHIDEIISTIRE
SDTDKVAMESLQQRFKLSEKQAQAILDMRLRRLTGLERDKIEAEYNELLNYISELETILADEEVLLQLVRDELTEIRDRF
GDDRRTEIQLGGFEDLEDEDLIPEEQIVITLSHNNYIKRLPVSTYRAQNRGGRGVQGMNTLEEDFVSQLVTLSTHDHVLF
FTNKGRVYKLKGYEVPELSRQSKGIPVVNAIELENDEVISTMIAVKDLESEDNFLVFATKRGVVKRSALSNFSRINRNGK
IAISFREDDELIAVRLTSGQEDILIGTSHASLIRFPESTLRPLGRTATGVKGITLREGDEVVGLDVAHANSVDEVLVVTE
NGYGKRTPVNDYRLSNRGGKGIKTATITERNGNVVCITTVTGEEDLMIVTNAGVIIRLDVADISQNGRAAQGVRLIRLGD
DQFVSTVAKVKEDAEDETNEDEQSTSTVSEDGTEQQREAVVNDETPGNAIHTEVIDSEENDEDGRIEVRQDFMDRVEEDI
QQSSDEDEE
>P20831 5.6.2.2~~~gyrA~~~DNA gyrase subunit A~~~
MAELPQSRINERNITSEMRESFLDYAMSVIVARALPDVRDGLKPVHRRILYGLNEQGMTPDKSYKKSARIVGDVMGKYHP
HGDSSIYEAMVRMAQDFSYRYPLVDGQGNFGSMDGDGAAAMRYTEARMTKITLELLRDINKDTIDFIDNYDGNEREPSVL
PARFPNLLANGASGIAVGMATNIPPHNLTELINGVLSLSKNPDISIAELMEDIEGPDFPTAGLILGKSGIRRAYETGRGS
IQMRSRAVIEERGGGRQRIVVTEIPFQVNKARMIEKIAELVRDKKIDGITDLRDETSLRTGVRVVIDVRKDANASVILNN
LYKQTPLQTSFGVNMIALVNGRPKLINLKEALVHYLEHQKTVVRRRTQYNLRKAKDRAHILEGLRIALDHIDEIISTIRE
SDTDKVAMESLQQRFKLSEKQAQAILDMRLRRLTGLERDKIEAEYNELLNYISELEAILADEEVLLQLVRDELTEIRDRF
GDDRRTEIQLGGFEDLEDEDLIPEEQIVITLSHNNYIKRLPVSTYRAQNRGGRGVQGMNTLEEDFVSQLVTLSTHDHVLF
FTNKGRVYKLKGYEVPELSRQSKGIPVVNAIELENDEVISTMIAVKDLESEDNFLVFATKRGVVKRSALSNFSRINRNGK
IAISFREDDELIAVRLTSGQEDILIGTSHASLIRFPESTLRPLGRTATGVKGITLREGDEVVGLDVAHANSVDEVLVVTE
NGYGKRTPVNDYRLSNRGGKGIKTATITERNGNVVCITTVTGEEDLMIVTNAGVIIRLDVADISQNGRAAQGVRLIRLGD
DQFVSTVAKVKEDAEDETNEDEQSTSTVSEDGTEQQREAVVNDETPGNAIHTEVIDSEENDEDGRIEVRQDFMDRVEEDI
QQSLDEDEE
>Q8DPM2 5.6.2.2~~~gyrA~~~DNA gyrase subunit A~~~COG0188
MQDKNLVNVNLTKEMKASFIDYAMSVIVARALPDVRDGLKPVHRRILYGMNELGVTPDKPHKKSARITGDVMGKYHPHGD
SSIYEAMVRMAQWWSYRYMLVDGHGNFGSMDGDSAAAQRYTEARMSKIALEMLRDINKNTVDFVDNYDANEREPLVLPAR
FPNLLVNGATGIAVGMATNIPPHNLGETIDAVKLVMDNPEVTTKDLMEVLPGPDFPTGALVMGKSGIHKAYETGKGSIVL
RSRTEIETTKTGRERIVVTEFPYMVNKTKVHEHIVRLVQEKRIEGITAVRDESNREGVRFVIEVKRDASANVILNNLFKM
TQMQTNFGFNMLAIQNGIPKILSLRQILDAYIEHQKEVVVRRTRFDKEKAEARAHILEGLLIALDHIDEVIRIIRASETD
AEAQAELMSKFKLSERQSQAILDMRLRRLTGLERDKIQSEYDDLLALIADLADILAKPERVSQIIKDELDEVKRKFSDKR
RTELMVGQVLSLEDEDLIEESDVLITLSNRGYIKRLDQDEFTAQKRGGRGVQGTGVKDDDFVRELVSTSTHDHLLFFTNK
GRVYRLKGYEIPEYGRTAKGLPVVNLLKLDEDESIQTVINVESDRSDDAYLFFTTRHGIVKRTSVKEFANIRQNGLKALN
LKDEDELINVLLAEGDMDIIIGTKFGYAVRFNQSAVRGMSRIATGVKGVNLREGDTVVGASLITDQDEVLIITEKGYGKR
TVATEYPTKGRGGKGMQTAKITEKNGLLAGLMTVQGDEDLMIITDTGVMIRTNLANISQTGRATMGVKVMRLDQDAQIVT
FTTVAVAEKEEVGTENETEGEA
>Q5SIL4 5.6.2.2~~~gyrA~~~DNA gyrase subunit A~~~COG0188
MAQVLPVEITEELKQSFINYAMSVIVDRALPDVRDGLKPVQRRILFGAYQEGVLPGRKHVKSAKIVGEVMGKYHPHGDAA
IYDALVRMAQPWNLRYPLIDGQGNFGSIDGDPPAAQRYTEARLSPIGAEMLLDIDKDTVDFRPNYDGSLKEPEVLPAAIP
NLLVNGASGIAVGMATSLPPHNLSEVVDALVAMIENPAITLEEVMRHLPGPDFPTGGKLSKKGIKEAYATGRGSLKVRAK
VRVEEKGQRPVLVVTEIPYQVNKASLIAQIAALVKAKKIEDIVGLRDESDRQGLRIAIELKRGANPQVVLNQLYKHTALQ
TSFTVNLLAIVDGEPKVLSLLDLMRHYLDHRKEVVRRRSLFELRKAEERAHVLEGLLIALDHIDEVIALIRGSEDAPKAR
IALMERFGLSEAQAQAILDMRLQRLVALEREKLLEEYRGLMEEIARLKAILEDEARLLAEVKADLLRVKEKYGDARRTLI
TEFEETFNPEDLIEDEPMVITLTAQGFLKRLPLESYRAQGRGGKGLLAGRTKEEDEATHVFVADAHDDLLLFTNRGRVYR
LKVYELPEMGRQARGVHVKSLLPLAEDEEVAALLSVRGLDQEGYLVFATERGLVKRTALKEYQNLGQAGLIAIRLQEGDR
LVGVALSDPEDEAILATQEGQAIRFPLEEVRATGRDTQGVIGVRFKKPEDRVVSLVVVKPGEMVDLLSVSTRGYGKRTPL
SEYPLQGRGGMGVITYAVSTKVGRLAALLKVRGGEDLLVLSRRGLAIRTPVAEIRQYSRATAGVRVMNLPEDDEVASAFV
VEEEK
>P50074 5.6.2.2~~~gyrBR~~~DNA gyrase subunit B, novobiocin-resistant~~~
MTTYDTRTATDTRGSEQPGHVGTASYDANAITVLDGLDAVRKRPGMYIGSTGERGLHHLVQELVDNSVDEALAGVADRID
VTVLADGGVRVVDNGRGIPVGMHPVEKRPAVEVVLTVLHAGGKFGGGGYGVSGGLHGVGLSVVNALSTRLSAEIWTDGHR
WTQDYRDGAPTAPLARHEATSRTGTSLTFWADGDIFETTEYSFETLARRHQEMAFLNGGLTLTLTDERSSARATAAVDEA
DSDPTAKTVSYRYDGGITDFVVHLNARKGEPAHPSVITIAAEDTERLLSAEIALQWNGQYTDSVYSYANAIHTHEGGTHE
EGFRTALTTVVNRYAREKRLLRDKDANLSGEDIREGLTAIISVNVGEPQFEGQTKTKLGNTEVRTLLQKIVHEHLADWFD
RNPNEAVDIVRKAVQAATARVAARKARDLTRRKGLLETAALPGKLSDCQSNDPATSEIFIVEGDSAGGSAKAGRNPQYQA
ILPIRGKILNVEKARIDKVLQNQENQALISAFGTGVHEDFDIAKLRYHKIILMADADVDGQHISTLLLTFLFRFMRPLVE
EGHVHLSRPPLYKIKWSREHVEYAYSDRERNTLLERGRRDGRRIRDDSIQRFKGLGEMNAEELRVTTMDPDHRVLGQVTL
DDAAFADDLFSVLMGEDVEARRHFIQRNAQDVRFLDI
>B8GXQ0 5.6.2.2~~~gyrB~~~DNA gyrase subunit B~~~
MTTEEAAAQYGADSIKVLKGLDAVRKRPGMYIGDTDDGSGLHHMVYEVVDNAIDEALAGHATKVQVILNADGSVTVTDDG
RGIPVDMHEGEGVSAAEVIMTQLHAGGKFDQNSYKVSGGLHGVGVSVVNALSDWLELLIHRNGKVHQMRFERGDAVTSLK
VTGDSPVRTEGPKAGETLTGTEVTFFPSKDTFAFIEFDRKTLEHRLRELAFLNSGVTIWFKDHRDVEPWEEKLFYEGGIE
AFVRHLDKAKTPLLKAPIAVKGVKDKVEIDLALWWNDSYHEQMLCFTNNIPQRDGGTHLSAFRAALTRIITSYAESSGIL
KKEKVSLGGEDSREGLTCVLSVKVPDPKFSSQTKDKLVSSEVRPAVEGLVSEGLSTWFEEHPNEAKAIVTKIAEAAAARE
AARKARELTRRKSALDITSLPGKLADCSERDPAKSEIFIVEGDSAGGSAKQARNRDNQAVLPLRGKILNVERARFDKMLS
SDQIGTLITALGAGIGRDDFNPDKVRYHKIVLMTDADVDGAHIRTLLLTFFYRQMPELIERGYIYIAQPPLYKASKGKSS
RYLKDDAEMDAFLVDEGVDGAELDLASGERMTGQDLLALVQTCRSAKANIDRLAARAPATAIEQAALSGLLGESPNAAAA
ATRLDLYAEEGDGPWSGERGDTGFVFSRVRRGVSERVVLDDVLLHAADARRLAERAVKLTEIFSGRAIFRRKDKSTTVRG
PLDLVNAVLDAGRKGLTIQRYKGLGEMNPDQLWETTLDAEARTLLQVRVNHADDADDMFSRLMGDLVEPRREFIQENALD
AEVDV
>P0AES6 5.6.2.2~~~gyrB~~~DNA gyrase subunit B~~~COG0187
MSNSYDSSSIKVLKGLDAVRKRPGMYIGDTDDGTGLHHMVFEVVDNAIDEALAGHCKEIIVTIHADNSVSVQDDGRGIPT
GIHPEEGVSAAEVIMTVLHAGGKFDDNSYKVSGGLHGVGVSVVNALSQKLELVIQREGKIHRQIYEHGVPQAPLAVTGET
EKTGTMVRFWPSLETFTNVTEFEYEILAKRLRELSFLNSGVSIRLRDKRDGKEDHFHYEGGIKAFVEYLNKNKTPIHPNI
FYFSTEKDGIGVEVALQWNDGFQENIYCFTNNIPQRDGGTHLAGFRAAMTRTLNAYMDKEGYSKKAKVSATGDDAREGLI
AVVSVKVPDPKFSSQTKDKLVSSEVKSAVEQQMNELLAEYLLENPTDAKIVVGKIIDAARAREAARRAREMTRRKGALDL
AGLPGKLADCQERDPALSELYLVEGDSAGGSAKQGRNRKNQAILPLKGKILNVEKARFDKMLSSQEVATLITALGCGIGR
DEYNPDKLRYHSIIIMTDADVDGSHIRTLLLTFFYRQMPEIVERGHVYIAQPPLYKVKKGKQEQYIKDDEAMDQYQISIA
LDGATLHTNASAPALAGEALEKLVSEYNATQKMINRMERRYPKAMLKELIYQPTLTEADLSDEQTVTRWVNALVSELNDK
EQHGSQWKFDVHTNAEQNLFEPIVRVRTHGVDTDYPLDHEFITGGEYRRICTLGEKLRGLLEEDAFIERGERRQPVASFE
QALDWLVKESRRGLSIQRYKGLGEMNPEQLWETTMDPESRRMLRVTVKDAIAADQLFTTLMGDAVEPRRAFIEENALKAA
NIDI
>Q839Z1 5.6.2.2~~~gyrB~~~DNA gyrase subunit B~~~COG0187
MRERAQEYDASQIQVLEGLEAVRKRPGMYIGSTSGEGLHHLVWEIVDNSIDEALAGFAKSIQVIIEPDDSITVIDDGRGI
PVGIQAKTGRPAVETVFTVLHAGGKFGGGGYKVSGGLHGVGSSVVNALSTSLDVRVYKDGKVYYQEYRRGAVVDDLKVIE
ETDRHGTTVHFIPDPEIFTETTVYDFDKLATRVRELAFLNRGLHISIEDRREGQEDKKEYHYEGGIKSYVEHLNANKDVI
FPEPIFIEGEQQDITVEVSMQYTDGYHSNILSFANNIHTYEGGTHESGFKTSLTRVINDYARKQKLMKENDEKLTGEDVR
EGLTAVVSIKHPDPQFEGQTKTKLGNSEVRTVTDRLFSEYFTKFLMENPTVGKQIVEKGMLASKARLAAKRAREVTRRKG
ALEISNLPGKLADCSSKDPEKCELFIVEGDSAGGSAKQGRSREFQAILPIRGKILNVEKASMDKILANEEIRSLFTAMGT
GFGEDFDVSKARYHKLVIMTDADVDGAHIRTLLLTLFYRFMRPIVEAGYVYIAQPPLYGVKQGKNITYVQPGKHAEEELA
KVLEELPASPKPSVQRYKGLGEMDDHQLWETTMDPEKRLMARVSVDDAIEADQIFEMLMGDRVEPRRAFIEENAHYVKNL
DI
>C5C7X8 5.6.2.2~~~gyrB~~~DNA gyrase subunit B~~~COG0187
MVDAMPENPAEEPTAASAAPNPEAVPDAVGQPEAPVKDRKVPGEYGASAITVLEGLEAVRKRPGMYIGSTGPRGLHHLVY
EVVDNSVDEALAGYATGIDVTLQADGGVRVADDGRGIPVDLHPTEGRPTVEVVMTILHAGGKFGGGGYAVSGGLHGVGIS
VVNALSRRVDTEVRRQGHVWRMSFADGGVPQGELVKGEATDATGTVQTFYPDAEIFDSIEFDYETLRARFQQMAFLNKGL
RITLTDERVQESNEVVDDEIAGEGAAGEDVAENGLAEDAEQEPQRRSVTYLYENGLLDYVQHLNSAKKVEYVHDDVIAFE
AEDFSDGRSMAVEVAMQWTSAYSESVHTYANTINTHEGGTHEEGFRAALTSLVNRYAREKEILKPKEDNLSGEDIREGLT
AVISVKLSEPQFEGQTKTKLGNSEARGFVSKAVTDHLGDWFERNPGPAKEIIRKAIMASHARLAARKARDNARRKSPLES
FGMPGKLADCSSKDPERCEVYIVEGDSAGGSAKQGRNPETQAILPLRGKILNVERARLDKALGNAEIQSMITAFGTNIGE
EFDISKLRYHKIVLMADADVDGQHITTLLLTVLFRYMRPLIEAGHVFLAQPPLYRIKWSNAPHDYVFSDEERDAAVEAGL
AKGWRYPKDNGVQRYKGLGEMNYQELWDTTMDPEHRTLLQVTMEDAAAADAVFSMLMGEDVESRRTFIQQNAKDIRFLDV
>A0A0G2Q9D6 5.6.2.2~~~gyrB1~~~DNA gyrase subunit B~~~
MGKNEARRSALAPDHGTVVCDPLRRLNRMHATPEESIRIVAAQKKKAQDEYGAASITILEGLEAVRKRPGMYIGSTGERG
LHHLIWEVVDNAVDEAMAGYATTVNVVLLEDGGVEVADDGRGIPVATHASGIPTVDVVMTQLHAGGKFDSDAYAISGGLH
GVGVSVVNALSTRLEVEIKRDGYEWSQVYEKSEPLGLKQGAPTKKTGSTVRFWADPAVFETTEYDFETVARRLQEMAFLN
KGLTINLTDERVTQDEVVDEVVSDVAEAPKSASERAAESTAPHKVKSRTFHYPGGLVDFVKHINRTKNAIHSSIVDFSGK
GTGHEVEIAMQWNAGYSESVHTFANTINTHEGGTHEEGFRSALTSVVNKYAKDRKLLKDKDPNLTGDDIREGLAAVISVK
VSEPQFEGQTKTKLGNTEVKSFVQKVCNEQLTHWFEANPTDSKVVVNKAVSSAQARIAARKARELVRRKSATDIGGLPGK
LADCRSTDPRKSELYVVEGDSAGGSAKSGRDSMFQAILPLRGKIINVEKARIDRVLKNTEVQAIITALGTGIHDEFDIGK
LRYHKIVLMADADVDGQHISTLLLTLLFRFMRPLIENGHVFLAQPPLYKLKWQRSDPEFAYSDRERDGLLEAGLKAGKKI
NKEDGIQRYKGLGEMDAKELWETTMDPSVRVLRQVTLDDAAAADELFSILMGEDVDARRSFITRNAKDVRFLDV
>Q59533 5.6.2.2~~~gyrB~~~DNA gyrase subunit B~~~COG0187
MAAQRKAQDEYGAASITILEGLEAVRKRPGMYVGSTGERGLHHLIWEVVDNSVDEAMAGYATQVDVRLFDDGSVEVADNG
RGIPVAVHATGVPTVDVVMTQLHAGGKFGGKDSGYNVSGGLHGVGVSVVNALSTRVEVDIKRDGYEWSQFYDKAVPGILK
QGEATEATGTTIRFWADPDIFETTKYDFGTVARRIQEVAFLNKGLTINLVDERVKQDEVVDDVVSDTAEAPVAMTVEEKS
TESSAPHKVRHRTFHYPGGLVDFVKHINRTKTPIQQSIIDFDGKGAGHEVEVAMQWNGGYSESVHTFANTINTHEGGTHE
EGFRSALTSVVNKYAKDKKLLKDKDPNLTGDDIREGLAAVISVKVSEPQFEGQTKTKLGNTEVKSFVQRVCNEQLIHWFE
ANPVDAKAVVNKAISSAQARIAARKARELVRRKSATDLGGLPGKLADCRSTDPRSSELYVVEGDSAGGSAKSGRDSMFQA
ILPLRGKIINVEKARIDRVLKNTEVQAIITALGTGIHDEFDISRLRYHKIVLMADADVDGQHISTLLLTLLFRFMRPLIE
HGYVFLAQPPLYKLKWQRMDPEFAYSDSERDGLLETGLKLGKKINKEDGIQRYKGLGEMDAKELWETTMDPSVRVLRQVT
LDDAAAADELFSILMGEDVDARRSFITRNAKDVRFLDV
>P22447 5.6.2.2~~~gyrB~~~DNA gyrase subunit B~~~
MEDNNKTQAYDSSSIKILEGLEAVRKRPGMYIGSTGEEGLHHMIWEIIDNSIDEAMGGFASTVKLTLKDNFVTIVEDDGR
GIPVDIHPKTNRSTVETVFTVLHAGGKFDNDSYKVSGGLHGVGASVVNALSSSFKVWVAREHQQYFLAFHNGGEVIGDLV
NEGKCDKEHGTKVEFVPDFTVMEKSDYKQTVIASRLQQLAFLNKGIQIDFVDERRQNPQSFSWKYDGGLVQYIHHLNNEK
EPLFEDIIFGEKTDTVKSVSRDESYTIKVEVAFQYNKTYNQSIFSFCNNINTTEGGTHVEGFRNALVKIINRFAVENKFL
KETDEKITRDDICEGLTAIISIKHPNPQYEGQTKKKLGNTEVRPLVNSIVSEIFERFMLENPQEANAIIRKTLLAQEARR
RSQEARELTRRKSPFDSGSLPGKLADCTTRDPSISELYIVEGDSAGGTAKTGRDRYFQAILPLRGKILNVEKSHFEQIFN
NVEISALVMAVGCGIKPDFELEKLRYNKIIIMTDADVDGAHIRTLLLTFFFRFMYPLVEQGNIYIAQPPLYKVSYSNKDL
YMQTDVQLEEWKQQHPNLKYNLQRYKGLGEMDAIQLWETTMDPKVRTLLKVTVEDASIADKAFSLLMGDEVPPRREFIEQ
NARNVKNIDI
>A0QNE0 5.6.2.2~~~gyrB~~~DNA gyrase subunit B~~~COG0187
MAAQKNNAPKEYGADSITILEGLEAVRKRPGMYIGSTGERGLHHLIWEVVDNAVDEAMAGFATRVDVKIHADGSVEVRDD
GRGIPVEMHATGMPTIDVVMTQLHAGGKFDGETYAVSGGLHGVGVSVVNALSTRLEATVLRDGYEWFQYYDRSVPGKLKQ
GGETKETGTTIRFWADPEIFETTDYNFETVARRLQEMAFLNKGLTIELTDERVTAEEVVDDVVKDTAEAPKTADEKAAEA
TGPSKVKHRVFHYPGGLVDYVKHINRTKTPIQQSIIDFDGKGPGHEVEIAMQWNAGYSESVHTFANTINTHEGGTHEEGF
RAALTSVVNRYAKDKKLLKDKDPNLTGDDIREGLAAVISVKVAEPQFEGQTKTKLGNTEVKSFVQKICNEQLQHWFEANP
AEAKTVVNKAVSSAQARIAARKARELVRRKSATDIGGLPGKLADCRSTDPSKSELYVVEGDSAGGSAKSGRDSMFQAILP
LRGKIINVEKARIDRVLKNTEVQSIITALGTGIHDEFDISKLRYHKIVLMADADVDGQHISTLLLTLLFRFMKPLVENGH
IFLAQPPLYKLKWQRSEPEFAYSDRERDGLLEAGRAAGKKINVDDGIQRYKGLGEMDAKELWETTMDPSVRVLRQVTLDD
AAAADELFSILMGEDVEARRSFITRNAKDVRFLDV
>P0C559 5.6.2.2~~~gyrB~~~DNA gyrase subunit B~~~
MAAQKNNAPKEYGADSITILEGLEAVRKRPGMYIGSTGERGLHHLIWEVVDNAVDEAMAGFATRVDVKIHADGSVEVRDD
GRGIPVEMHATGMPTIDVVMTQLHAGGKFDGETYAVSGGLHGVGVSVVNALSTRLEATVLRDGYEWFQYYDRSVPGKLKQ
GGETKETGTTIRFWADPEIFETTDYNFETVARRLQEMAFLNKGLTIELTDERVTAEEVVDDVVKDTAEAPKTADEKAAEA
TGPSKVKHRVFHYPGGLVDYVKHINRTKTPIQQSIIDFDGKGPGHEVEIAMQWNAGYSESVHTFANTINTHEGGTHEEGF
RAALTSVVNRYAKDKKLLKDKDPNLTGDDIREGLAAVISVKVAEPQFEGQTKTKLGNTEVKSFVQKICNEQLQHWFEANP
AEAKTVVNKAVSSAQARIAARKARELVRRKSATDIGGLPGKLADCRSTDPSKSELYVVEGDSAGGSAKSGRDSMFQAILP
LRGKIINVEKARIDRVLKNTEVQSIITALGTGIHDEFDISKLRYHKIVLMADADVDGQHISTLLLTLLFRFMKPLVENGH
IFLAQPPLYKLKWQRSEPEFAYSDRERDGLLEAGRAAGKKINVDDGIQRYKGLGEMDAKELWETTMDPSVRVLRQVTLDD
AAAADELFSILMGEDVEARRSFITRNAKDVRFLDV
>P9WG45 5.6.2.2~~~gyrB~~~DNA gyrase subunit B~~~COG0187
MAAQKKKAQDEYGAASITILEGLEAVRKRPGMYIGSTGERGLHHLIWEVVDNAVDEAMAGYATTVNVVLLEDGGVEVADD
GRGIPVATHASGIPTVDVVMTQLHAGGKFDSDAYAISGGLHGVGVSVVNALSTRLEVEIKRDGYEWSQVYEKSEPLGLKQ
GAPTKKTGSTVRFWADPAVFETTEYDFETVARRLQEMAFLNKGLTINLTDERVTQDEVVDEVVSDVAEAPKSASERAAES
TAPHKVKSRTFHYPGGLVDFVKHINRTKNAIHSSIVDFSGKGTGHEVEIAMQWNAGYSESVHTFANTINTHEGGTHEEGF
RSALTSVVNKYAKDRKLLKDKDPNLTGDDIREGLAAVISVKVSEPQFEGQTKTKLGNTEVKSFVQKVCNEQLTHWFEANP
TDAKVVVNKAVSSAQARIAARKARELVRRKSATDIGGLPGKLADCRSTDPRKSELYVVEGDSAGGSAKSGRDSMFQAILP
LRGKIINVEKARIDRVLKNTEVQAIITALGTGIHDEFDIGKLRYHKIVLMADADVDGQHISTLLLTLLFRFMRPLIENGH
VFLAQPPLYKLKWQRSDPEFAYSDRERDGLLEAGLKAGKKINKEDGIQRYKGLGEMDAKELWETTMDPSVRVLRQVTLDD
AAAADELFSILMGEDVDARRSFITRNAKDVRFLDV
>Q9I7C2 5.6.2.2~~~gyrB~~~DNA gyrase subunit B~~~
MSENNTYDSSSIKVLKGLDAVRKRPGMYIGDTDDGTGLHHMVFEVVDNSIDEALAGYCSEISITIHTDESITVRDNGRGI
PVDIHKEEGVSAAEVIMTVLHAGGKFDDNTYKVSGGLHGVGVSVVNALSHELRLTIRRHNKVWEQVYHHGVPQFPLREVG
ETDGSGTEVHFKPSPETFSNIHFSWDILAKRIRELSFLNSGVGILLRDERTGKEELFKYEGGLKAFVEYLNTNKTAVNEV
FHFNVQREEDGVGVEVALQWNDSFNENLLCFTNNIPQRDGGTHLAGFRSALTRNLNNYIEAEGLAKKFKIATTGDDAREG
LTAIISVKVPDPKFSSQTKDKLVSSEVKTAVEQEMGKYFADFLLENPNEAKAVVGKMIDAARAREAARKAREMTRRKGAL
DIAGLPGKLADCQEKDPALSELYIVEGDSAGGSAKQGRNRRTQAILPLKGKILNVEKARFDKMLSSQEVGTLITALGCGI
GREEYNIDKLRYHNIIIMTDADVDGSHIRTLLLTFFFRQMPELIERGYIYIAQPPLYKVKRGKQEQYIKDDQAMEEYMTQ
SALEDASLHVNEHAPGLSGAALEKLVNEYRGVIATLKRLSRLYPQELTEHFIYLPTVSVDDLANESAMQGWLEKFQARLT
AAEKSGLTYKASLREDRERHLWLPEVELVAHGLSSYVTFNRDFFASNDYRSVSLLGDQLNSLLEDGAYVQKGERKRPISA
FKDGLDWLMAEGTKRHSIQRYKGLGEMNPEQLWETTMDPNVRRMLKVTIEDAIAADQIFNTLMGDAVEPRRDFIESNALA
VSNLDV
>P0A2I4 5.6.2.2~~~gyrB~~~DNA gyrase subunit B~~~COG0187
MSNSYDSSSIKVLKGLDAVRKRPGMYIGDTDDGTGLHHMVFEVVDNAIDEALAGHCKDIVVTIHADNSVSVTDDGRGIPT
GIHPEEGVSAAEVIMTVLHAGGKFDDNSYKVSGGLHGVGVSVVNALSQKLELVIQRDGKIHRQIYEHGVPQAPLAVTGDT
DKTGTMVRFWPSHETFTNVTEFEYEILAKRLRELSFLNSGVSIRLRDKRDGKEDHFHYEGGIKAFVEYLNKNKTPIHPNI
FYFSTEKDGIGVEVALQWNDGFQENIYCFTNNIPQRDGGTHLAGFRAAMTRTLNAYMDKEGYSKKAKVSATGDDAREGLI
AVVSVKVPDPKFSSQTKDKLVSSEVKSAVEQQMNELLSEYLLENPSDAKIVVGKIIDAARAREAARRAREMTRRKGALDL
AGLPGKLADCQERDPALSELYLVEGDSAGGSAKQGRNRKNQAILPLKGKILNVEKARFDKMLSSQEVATLITALGCGIGR
DEYNPDKLRYHSIIIMTDADVDGSHIRTLLLTFFYRQMPEIVERGHVYIAQPPLYKVKKGKQEQYIKDDEAMDQYQISIA
LDGATLHANAHAPALSGEALEKLVSEYNATQKMIGRMERRFPKALLKELVYQPTLTEADLSDEQTVTRWVNALITELNEK
EQHGSQWKFDVHTNTEQNLFEPIVRVRTHGVDTDYPLDHEFVTGAEYRRICTLGEKLRGLIEEDAFIERGERRQPVTSFE
QALEWLVKESRRGLAIQRYKGLGEMNPDQLWETTMDPESRRMLRVTVKDAIAADQLFTTLMGDAVEPRRAFIEENALKAA
NIDI
>P0A2I3 5.6.2.2~~~gyrB~~~DNA gyrase subunit B~~~
MSNSYDSSSIKVLKGLDAVRKRPGMYIGDTDDGTGLHHMVFEVVDNAIDEALAGHCKDIVVTIHADNSVSVTDDGRGIPT
GIHPEEGVSAAEVIMTVLHAGGKFDDNSYKVSGGLHGVGVSVVNALSQKLELVIQRDGKIHRQIYEHGVPQAPLAVTGDT
DKTGTMVRFWPSHETFTNVTEFEYEILAKRLRELSFLNSGVSIRLRDKRDGKEDHFHYEGGIKAFVEYLNKNKTPIHPNI
FYFSTEKDGIGVEVALQWNDGFQENIYCFTNNIPQRDGGTHLAGFRAAMTRTLNAYMDKEGYSKKAKVSATGDDAREGLI
AVVSVKVPDPKFSSQTKDKLVSSEVKSAVEQQMNELLSEYLLENPSDAKIVVGKIIDAARAREAARRAREMTRRKGALDL
AGLPGKLADCQERDPALSELYLVEGDSAGGSAKQGRNRKNQAILPLKGKILNVEKARFDKMLSSQEVATLITALGCGIGR
DEYNPDKLRYHSIIIMTDADVDGSHIRTLLLTFFYRQMPEIVERGHVYIAQPPLYKVKKGKQEQYIKDDEAMDQYQISIA
LDGATLHANAHAPALSGEALEKLVSEYNATQKMIGRMERRFPKALLKELVYQPTLTEADLSDEQTVTRWVNALITELNEK
EQHGSQWKFDVHTNTEQNLFEPIVRVRTHGVDTDYPLDHEFVTGAEYRRICTLGEKLRGLIEEDAFIERGERRQPVTSFE
QALEWLVKESRRGLAIQRYKGLGEMNPDQLWETTMDPESRRMLRVTVKDAIAADQLFTTLMGDAVEPRRAFIEENALKAA
NIDI
>P66937 5.6.2.2~~~gyrB~~~DNA gyrase subunit B~~~
MVTALSDVNNTDNYGAGQIQVLEGLEAVRKRPGMYIGSTSERGLHHLVWEIVDNSIDEALAGYANKIEVVIEKDNWIKVT
DNGRGIPVDIQEKMGRPAVEVILTVLHAGGKFGGGGYKVSGGLHGVGSSVVNALSQDLEVYVHRNETIYHQAYKKGVPQF
DLKEVGTTDKTGTVIRFKADGEIFTETTVYNYETLQQRIRELAFLNKGIQITLRDERDEENVREDSYHYEGGIKSYVELL
NENKEPIHDEPIYIHQSKDDIEVEIAIQYNSGYATNLLTYANNIHTYEGGTHEDGFKRALTRVLNSYGLSSKIMKEEKDR
LSGEDTREGMTAIISIKHGDPQFEGQTKTKLGNSEVRQVVDKLFSEHFERFLYENPQVARTVVEKGIMAARARVAAKKAR
EVTRRKSALDVASLPGKLADCSSKSPEECEIFLVEGDSAGGSTKSGRDSRTQAILPLRGKILNVEKARLDRILNNNEIRQ
MITAFGTGIGGDFDLAKARYHKIVIMTDADVDGAHIRTLLLTFFYRFMRPLIEAGYVYIAQPPLYKLTQGKQKYYVYNDR
ELDKLKSELNPTPKWSIARYKGLGEMNADQLWETTMNPEHRALLQVKLEDAIEADQTFEMLMGDVVENRRQFIEDNAVYA
NLDF
>Q6GKU0 5.6.2.2~~~gyrB~~~DNA gyrase subunit B~~~
MVTALSDVNNTDNYGAGQIQVLEGLEAVRKRPGMYIGSTSERGLHHLVWEIVDNSIDEALAGYANQIEVVIEKDNWIKVT
DNGRGIPVDIQEKMGRPAVEVILTVLHAGGKFGGGGYKVSGGLHGVGSSVVNALSQDLEVYVHRNETIYHQAYKKGVPQF
DLKEVGTTDKTGTVIRFKADGEIFTETTVYNYETLQQRIRELAFLNKGIQITLRDERDEENVREDSYHYEGGIKSYVELL
NENKEPIHDEPIYIHQSKDDIEVEIAIQYNSGYATNLLTYANNIHTYEGGTHEDGFKRALTRVLNSYGLSSKIMKEDKDR
LSGEDTREGMTAIISIKHGDPQFEGQTKTKLGNSEVRQVVDKLFSEHFERFLYENPQVARTVVEKGIMAARARVAAKKAR
EVTRRKSALDVASLPGKLADCSSKSPEECEIFLVEGDSAGGSTKSGRDSRTQAILPLRGKILNVEKARLDRILNNNEIRQ
MITAFGTGIGGDFDLAKARYHKIVIMTDADVDGAHIRTLLLTFFYRFMRPLIEAGYVYIAQPPLYKLTQGKQKYYVYNDR
ELDKLKSELNPTPKWSIARYKGLGEMNADQLWETTMNPEHRALLQVKLEDAIEADQTFEMLMGDVVENRRQFIEDNAVYA
NLDF
>P0A0K8 5.6.2.2~~~gyrB~~~DNA gyrase subunit B~~~
MVTALSDVNNTDNYGAGQIQVLEGLEAVRKRPGMYIGSTSERGLHHLVWEIVDNSIDEALAGYANQIEVVIEKDNWIKVT
DNGRGIPVDIQEKMGRPAVEVILTVLHAGGKFGGGGYKVSGGLHGVGSSVVNALSQDLEVYVHRNETIYHQAYKKGVPQF
DLKEVGTTDKTGTVIRFKADGEIFTETTVYNYETLQQRIRELAFLNKGIQITLRDERDEENVREDSYHYEGGIKSYVELL
NENKEPIHDEPIYIHQSKDDIEVEIAIQYNSGYATNLLTYANNIHTYEGGTHEDGFKRALTRVLNSYGLSSKIMKEEKDR
LSGEDTREGMTAIISIKHGDPQFEGQTKTKLGNSEVRQVVDKLFSEHFERFLYENPQVARTVVEKGIMAARARVAAKKAR
EVTRRKSALDVASLPGKLADCSSKSPEECEIFLVEGDSAGGSTKSGRDSRTQAILPLRGKILNVEKARLDRILNNNEIRQ
MITAFGTGIGGDFDLAKARYHKIVIMTDADVDGAHIRTLLLTFFYRFMRPLIEAGYVYIAQPPLYKLTQGKQKYYVYNDR
ELDKLKSELNPTPKWSIARYKGLGEMNADQLWETTMNPEHRALLQVKLEDAIEADQTFEMLMGDVVENRRQFIEDNAVYA
NLDF
>P0A4M0 5.6.2.2~~~gyrB~~~DNA gyrase subunit B~~~COG0187
MTEEIKNLQAQDYDASQIQVLEGLEAVRMRPGMYIGSTSKEGLHHLVWEIVDNSIDEALAGFASHIQVFIEPDDSITVVD
DGRGIPVDIQEKTGRPAVETVFTVLHAGGKFGGGGYKVSGGLHGVGSSVVNALSTQLDVHVHKNGKIHYQEYRRGHVVAD
LEIVGDTDKTGTTVHFTPDPKIFTETTIFDFDKLNKRIQELAFLNRGLQISITDKRQGLEQTKHYHYEGGIASYVEYINE
NKDVIFDTPIYTDGEMDDITVEVAMQYTTGYHENVMSFANNIHTHEGGTHEQGFRTALTRVINDYARKNKLLKDNEDNLT
GEDVREGLTAVISVKHPNPQFEGQTKTKLGNSEVVKITNRLFSEAFSDFLMENPQIAKRIVEKGILAAKARVAAKRAREV
TRKKSGLEISNLPGKLADCSSNNPAETELFIVEGDSAGGSAKSGRNREFQAILPIRGKILNVEKASMDKILANEEIRSLF
TAMGTGFGAEFDVSKARYQKLVLMTDADVDGAHIRTLLLTLIYRYMKPILEAGYVYIAQPPIYGVKVGSEIKEYIQPGAD
QEIKLQEALARYSEGRTKPTIQRYKGLGEMDDHQLWETTMDPEHRLMARVSVDDAAEADKIFDMLMGDRVEPRREFIEEN
AVYSTLDV
>Q5SHZ4 5.6.2.2~~~gyrB~~~DNA gyrase subunit B~~~COG0187
MSYDASAIRVLKGLEGVRHRPAMYIGGTGVEGYHHLFKEILDNAVDEALAGYATEILVRLNEDGSLTVEDNGRGIPVDLM
PEEGKPAVEVIYTTLHSGGKFEQGAYKVSGGLHGVGASVVNALSEWTVVEVFREGKHHRIAFSRGEVTEPLRVVGEAPRG
KTGTRVTFKPDPEIFGNLRFDPSKIRARLREVAYLVAGLKLVFQDRQHGKEEVFLDKGGVASFAKALAEGEDLLYEKPFL
IRGTHGEVEVEVGFLHTQGYNAEILTYANMIPTRDGGTHLTAFKSAYSRALNQYAKKAGLNKEKGPQPTGDDLLEGLYAV
VSVKLPNPQFEGQTKGKLLNPEAGTAVGQVVYERLLEILEENPRIAKAVYEKALRAAQAREAARKARELVRRQNPLESDE
LPGKLADCQTENPEEAELFIVEGDSAGGSAKQGRDRRFQAILPLRGKILNVEKAGLSKALKNAEVRAMVSAIGVGIGGDG
EAHFDLEGLRYHKIIIMTDADVDGSHIRTLLLTFFYRYMRPLIERGHVYIAQPPLYRLQVGKKVEYLYSDEELQARLKEL
EGKHYEVQRFKGLGEMNPEQLWETTMNPEKRVLKRVELQDALEASELFEKLMGQEVAPRREFIEEHARYAELDI
>O08399 5.6.2.2~~~gyrB~~~DNA gyrase subunit B~~~COG0187
MGIEYSASSITVLEGLEAVRKRPGMYIGSTGPNGLHHLVYEVVDNCIDEAMAGYCDRITVVLEQGNVVRVEDNGRGIPVD
VHPHEGVSALEVVLTKLHAGGKFDKKSYKVSGGLHGVGVSVVNALSLWVEVTVYRDGAEYYQKFNVGMPLAPVEKRGVSE
KRGTIIRWQADPSIFKETVAYDFDVLLTRLRELAFLNSTVVIQLRDERLATAKQVEFAFEGGIRHFVSYLNRGKSVVPER
PLYIEGSKSDVLVEVALQYHDGYTENVQSFVNDINTREGGTHLEGFKSALTRVANDFLKKSPKLAKKIEREEKLVGEDVR
AGLTVVLSVKIPEPQFEGQTKTKLGNSEVRGIVDSLVGERLTLYFEQNPGVLTKILEKSIAEAQARLAARRAKEAARRKS
GMDSFGLPGKLADCSLKDPAKCEVYIVEGDSAGGSAKKGRDSKTQAILPLWGKMLNVEKTRLDKVLHNEKLQPIIATLGT
GVGKDFDLTRIRYHKVIIMADADVDGSHIRTLLLTFFFRYLPQIIEAGYVYLAMPPLYRIAWSKKELYVYSDTERDEALE
SIGKKSGVAVQRYKGLGEMDGTQLWETTMNPVRRKMMQVVLSDAVEADRVFSTLMGEDVEPRRKFIEENAIYARLDV
>B2UQG6 3.2.1.52~~~~~~Beta-hexosaminidase Amuc_0868~~~COG3525
MISKCTFSATVFSLFSLCWGAPSSPVLEAPHTIPLPAAMRVQTGESGFSLKNGVRLPEKNPLSRQAERIFRDNGINTALV
KNNADIIFTEDASLGREGYRLAVTPDSISIASGSVNGTLYALQSLVQSIAADKNGAPALPRMDVKDQPRFSWRGLMVDSC
RHMMPVRDIKKVLDLMERYKFNTLHWHLTDDQGWRLPIAKYPRLTTVGGARAQSPVIGNRNKGDGIPYSGHYTADEIRDV
VRYARDRGITVIPEVEMPGHASAAIAAYPELGNTDIPGYEPRVQETWGVHSYTFSPTEKTFRFLEDVIDEICALFPDSPY
IHIGGDEAPKNQWKQSPTAQRVMKDNGLANEHELQSYFIRRVEKMINNRGKRLIGWDEIQEGGLSPTATMMVWRSQMPHI
AAQALAQGNDIVMTPNSHLYFDYDQGPGKPAAPEYETINNNQLTWQHVYGLEPVPQGTPREREKQVLGCQANIWTEYIPN
LPKWEYHVFPRALALAEVAWTPQELKNEKDFRKRLDRQLPFLDARGVNYKRPDNGAPAQPKAVITRERR
>B2UP57 3.2.1.52~~~~~~Beta-hexosaminidase Amuc_2018~~~COG3525
MARPLPILGGILLSFSPPAEATAQYSIIPEPSRTELRQETAKTLQLLSDQEVPTLETDAYRLTVTPQGAHLASGGREGRI
YGLATLRQLRDQLAGQPEGIPCGVITDKPRYPWRGLMVDPARHFIPAADLKKFVDMMAYYKFNRLHLHLTDNQGWRLPVP
GYPKLKSVASRREESFGDGIPHEGMYTKQELKELVAYCAARGIDVIPEIDMPGHNQALHAAYPEFFCFPKPDMNVRTTAG
NSKELVCPQKPEVWKFYASVFNELKDIFPSGIVHLGGDEAPTELWEKCPLCREARTRAAMKDEQEQMKAFFAKTAALLAK
NGQTPQFWYEGNAGIYHPGETVYAWRQGQALQSIEKTKKAGLNLIMASSEYCYLDFPQIQGQRNWGWMKTTTLQKCYDLD
PAFGKPEKEAGHIRGVHAPVWAERLPDLNHLLYRAYPRACAIAEAGWSPMGVRSWENFRRKLADHRQFILKRFNYDMERT
QGNEPAFRWENNK
>B2UPR7 3.2.1.52~~~~~~Beta-hexosaminidase Amuc_2136~~~COG3525
MKKNLFLMIAVLAASPVMGQDAKQIADSLSIPPVKAGAKQLPMPSVSGAQIKLLGADYEQLVNSKGKIAPVISDTPVNVS
FKVTKDGKEAVSKDYEIMLQAPQAAQGNPKPRIIPEILQWKGGQGEYKLGNTVTIACPDKELGKLFAADMEDVLGKKVKL
VAPGAKADISLSLLKGGNLGREGYRLQIARDGVRLGAAAPTGLFWGTRTLLQMLRQTPGSVPCGTAVDFPRYQLRGFMLD
VARTPYPLSYLKDVIRTMAWYKMNDLHLVINNNYIFHEHYVDNGHDPFKESYAAFRLESKMKGKDGTPLTARDLFYTKKE
FADLVSYARKYGVNIVPEFDTPGHALSFTRLRPDLIYKGPMNHEKRRCEMLDAANPETIDLVSKVFDEYMLKDPKLGRPV
FADCGVVHVGADEFYGDKEDYRHFANAVLTHALKRGYTPRIWGSLSAKPGKTPVVSKGVQMNLWSTGWMKAWEAVNQGYD
VINTNDGALYIVPFAGYYRMDRNHKGLYNNWIPNRIGNETLPSGHPQLLGGTFAVWNDETDIMHTGYAPYDIWGIISGSM
DVLSQKLWGTAKAPDTFEQHRELVSSIGNAPRTNPLHKWKDSQPLTVKPSSLPQKLDKPALGPNYRLTMELELTAAPEGK
EQVLLAAPEGELLAVMKDGTVGFRRDDSLEFSFGAKLPVGKKVKVEIVGEPEKTSLLLDGEPAGTAVLKNFSDKSKDFSD
KFKHRPKVHRSTFILPLKELGSSFQGKVFHMNVQPL
>P07211 ~~~~~~Outer membrane protein H.8~~~
MKAYLALISAAVIGLAACSQEPAAPAAEATPAGEAPASEAPAAEAAPADAAEAPAAGNCAATVESNDNMQFNTKDIQVSK
ACKEFTITLKHTGTQPKASMGHNLVIAKAEDMDGVFKDGVGAADTDYVKPDDARVVAHTKLIGGGEESSLTLDPAKLADG
DYKFACTFPGHGALMNGKVTLVD
>P11910 ~~~~~~Outer membrane protein H.8~~~
MKKSLFAAALLSLALAACGGEKAAEAPAAEASSTEAPAAEAPAAEAPAAEAAAAEAPAAEAPAAEAPAAEAAATEAPAAE
APAAEAAK
>O52214 5.4.4.1~~~habA~~~Hydroxylaminobenzene mutase HabA~~~
MQTYLFASGLVLFLLGLVTGLLVPVSKNPRMGVAGHLQGMTNGPLLIIAGLLWPYLELPDAWQLATFWLLIYGTYANWLG
VQLAALWGAGAKLAPIAAGEHRSTPLKERVVTFLLFSLIPAMFAAPIILLIGILR
>O52216 5.4.4.1~~~habB~~~Hydroxylaminobenzene mutase HabB~~~
MTLHTPSTDAPLARRLLQLGIALFLLGLLTGFLLPMMANPRVGLSSHLEGVLNGMFLLALGLMWPQLSLGTGARKAAFGF
AVYGTYANWLATLLAGFWGAGGRMMPIAAGGHTGTAAQEGLIAFALISLSLSMLVVCALALWGLRSAPARRNTDAPAAGP
QPAA
>Q9ZNE0 4.2.1.36~~~hacA~~~Homoaconitase large subunit~~~COG0065
MGQTLAEKILSHKVGRPVRAGELVVVEVDQVMVVDSIAGSFFKRLEYLEATPRYPERVSIVIDHVAPAANLEVAKAQKEI
REWGKRHGIRVFDVGRGVCHQVLIEEGLAQPGWVVVGSDSHSTTYGAVGAFGTGMGATDIALAAASGRTWLRVPESVKVV
FRGRLPKGVTAKDAALEMVRLLTAEGATYMAVEIHLLDGAEALTRGERMTLANLTVEAGAKAGLVVPSGEILEMYRVPDW
LYPDPDARYAKEVEIDLSALTPRVSVPFYVDNVHEVAQVKGKRVDQVFIGTCTNGRIEDLRAAAEVLRGRKVAPWVRLLV
VPASSQVLEEAARDGTLLTLLEAGATIGTPGCGPCMGRHMGVLAPGEVCVSTSNRNFRGRMGAPDAEIYLASPRVAAASA
VAGYLTTPEELEEEEVHA
>Q9ZND9 4.2.1.36~~~hacB~~~Homoaconitase small subunit~~~COG0066
MPRVWKFGDQINTDDILPGKYAPFMVGEDRFHLYAFAHLRPEFAKEVRPGDILVFGRNAGLGSSREYAPEALKRLGVRAI
IAKSYARIFFRNLVNLGIVPFESEEVVDALEDGDEVELDLESGVLTRGEERFALRPPPPFLLEALKEGSLLDYYKKHGRF
PGE
>P0DUV9 4.1.-.-~~~~~~2-hydroxyacyl-CoA lyase~~~
MADRQDAERSGAGPARQSVPVASLVAEFLQEHGVDRVFGLQGGHIQPIWDQLARRGVRIVDVRDEGSAVHMAHAHTELTG
QTAVAMVTAGPGVTNTVTAVANASVSRIPLLVIGGCPPIPQSNMGPLQDIPHTAILEPITRLARTLRSADQVLREFDEAW
ARASGDRGEPGPVYLEIPTDVLRRDVPPALQMREHLRAKPKRRPQPHPDDVAAVADLIRAAEKPAIISGRGARTTDGTDL
VRLLDASGAAYLDTQESRGLVPDSHPAAVGSARSAVMRDTDLLITVGRQLDYQLGMGSPAVFPHAKVVRIADTASELIDN
RRGEVEILAEPGAALAAIADALKDHTPDTSWRDELKAKHRKRAEDYRQALHSTENGADGHIHPNRIFGALDALDGDVLDL
GETIMIADGGDLLSFGRLGITKARRYLDAGAFGCLGVATPFAIGAALAYPDRPVVAVTGDGAFGITATEIDTAVRHDAKI
VVIVSNNRAWNIERYDQAENYGLVVGTDLADSDYAGVARAFGAHGERVTDPAELEGAIRRALANAPALVDVVTTQDAASP
DSGKGLGFVPDYQALTPWNDAEVARRQEGI
>Q51645 3.8.1.2~~~hdl IVa~~~(S)-2-haloacid dehalogenase 4A~~~
MVDSLRACVFDAYGTLLDVHSAVMRNADEVGASAEALSMLWRQRQLEYSWTRTLMHQYADFWQLTDEALTFALRTYHLED
RKGLKDRLMSAYKELSAYPDAAETLEKLKSAGYIVAILSNGNDEMLQAALKASKLDRVLDSCLSADDLKIYKPDPRIYQF
ACDRLGVNPNEVCFVSSNAWDLGGAGKFGFNTVRINRQGNPPEYEFAPLKHQVNSLSELWPLLAKNVTKAA
>Q5U924 4.2.1.157~~~hadB~~~(R)-2-hydroxyisocaproyl-CoA dehydratase alpha subunit~~~
MSEKKEARVVINDLLAEQYANAFKAKEEGRPVGWSTSVFPQELAEVFDLNVLYPENQAAGVAAKKGSLELCEIAESKGYS
IDLCAYARTNFGLLENGGCEALDMPAPDFLLCCNNICNQVIKWYENISRELDIPLIMIDTTFNNEDEVTQSRIDYIKAQF
EEAIKQLEIISGKKFDPKKFEEVMKISAENGRLWKYSMSLPADSSPSPMNGFDLFTYMAVIVCARGKKETTEAFKLLIEE
LEDNMKTGKSSFRGEEKYRIMMEGIPCWPYIGYKMKTLAKFGVNMTGSVYPHAWALQYEVNDLDGMAVAYSTMFNNVNLD
RMTKYRVDSLVEGKCDGAFYHMNRSCKLMSLIQYEMQRRAAEETGLPYAGFDGDQADPRAFTNAQFETRIQGLVEVMEER
KKLNRGEI
>Q5U923 4.2.1.157~~~hadC~~~(R)-2-hydroxyisocaproyl-CoA dehydratase beta subunit~~~
MEAILSKMKEVVENPNAAVKKYKSETGKKAIGCFPVYCPEEIIHAAGMLPVGIWGGQTELDLAKQYFPAFACSIMQSCLE
YGLKGAYDELSGVIIPGMCDTLICLGQNWKSAVPHIKYISLVHPQNRKLEAGVKYLISEYKGVKRELEEICGYEIEEAKI
HESIEVYNEHRKTMRDFVEVAYKHSNTIKPSIRSLVIKSGFFMRKEEHTELVKDLIAKLNAMPEEVCSGKKVLLTGILAD
SKDILDILEDNNISVVADDLAQETRQFRTDVPAGDDALERLARQWSNIEGCSLAYDPKKKRGSLIVDEVKKKDIDGVIFC
MMKFCDPEEYDYPLVRKDIEDSGIPTLYVEIDQQTQNNEQARTRIQTFAEMMSLA
>O06652 3.8.1.10~~~~~~2-haloacid dehalogenase, configuration-inverting~~~
MSHRPILKNFPQVDHHQASGKLGDLYNDIHDTLRVPWVAFGIRVMSQFEHFVPAAWEALKPQISTRYAEEGADKVREAAI
IPGSAPANPTPALLANGWSEEEIAKLKATLDGLNYGNPKYLILISAWNEAWHGRDAGGGAGKRLDSVQSERLPYGLPQGV
EKFHLIDPEAADDQVQCLLRDIRDAFLHHGPASDYRVLAAWPDYLEIAFRDTLKPVALTTEFELTTSRIRKIAREHVRGF
DGAGGVAWRDMADRMTPEEIAGLTGVLFMYNRFIADITVAIIRLKQAFGSAEDATENKFRVWPTEKG
>Q52086 3.8.1.9~~~hadD~~~(R)-2-haloacid dehalogenase~~~
MNLPDNSIHLQLPRPVCEAIIRPVPEHRADQELSEIYRDLKATFGVPWVGVITQAVAYYRPFFAEAWRRFAPSAKTHFFE
RASDDIRIRSWELMGQSFVIEGQTDRLREMGYSVREIGQIRAVLDIFDYGNPKYLIFATAIKEGLLSGRTFGGAAGDARC
HFPRSPICQIDPIPVMVEEHHAGGTLSQVYADIKQTLQLPFINSDYKAMARWPSYLEQAWGALKPCIDTPAYQAGRFDIN
ARALAALDALPTAYRMSRDDALQAGLSEAQTDELIQVISLFQWMLSGLVLNVTHFKQQALK
>Q8KLS9 3.8.1.9~~~dehI~~~(R)-2-haloacid dehalogenase~~~
MIDLPRHPPSMLPVIRTVPEHAATGELKRRYDAVKSAFDVPWMGVVAMAHTQYPRFFDALWEGFEPIAGTRAFQDACRAM
RAATEAGVERSLGISPLAHRLQDLGYDPREIGEIRTIIEVFSHGNYPYILLATVSRYLLSGGDLSGEPQVFETSPRSPHI
FHQPILMEPHHADEHTRGIFADIQATLALPILNTDYRALARWPSYFHLAWAELRPLIRTPSHAALSQQLHEQAIAVLRTL
PNPARLKGDMVTRGCGR
>Q5U925 3.-.-.-~~~hadI~~~2-hydroxyisocaproyl-CoA dehydratase activator~~~
MYTMGLDIGSTASKGVILKNGEDIVASETISSGTGTTGPSRVLEKLYGKTGLAREDIKKVVVTGYGRMNYSDADKQISEL
SCHARGVNFIIPETRTIIDIGGQDAKVLKLDNNGRLLNFLMNDKCAAGTGRFLDVMAKIIEVDVSELGSISMNSQNEVSI
SSTCTVFAESEVISHLSENAKIEDIVAGIHTSVAKRVSSLVKRIGVQRNVVMVGGVARNSGIVRAMAREINTEIIVPDIP
QLTGALGAALYAFDEAKESQKEVKNI
>Q52087 3.8.1.2~~~hadL~~~(S)-2-haloacid dehalogenase~~~
MKNIQGIVFDLYGTLYDVHSVVQACEEVYPGQGDAISRLWRQKQLEYTWLRSLMGRYVNFEKATEDALRFTCTHLGLSLD
DETHQRLSDAYLHLTPYADTADAVRRLKAAGLPLGIISNGSHCSIEQVVTNSEMNWAFDQLISVEDVQVFKPDSRVYSLA
EKRMGFPKENILFVSSNAWDASAASNFGFPVCWINRQNGAFDELDAKPTHVVRNLAEMSNWLVNSLD
>P60527 3.8.1.2~~~~~~(S)-2-haloacid dehalogenase~~~
MKHIKAIAFDLYGTLFDVHSVIDQCEKRFPGRGREVSTLWRQKQLEYTWLRSLMNRYVTFEQATEDALRYTCRHLGFALD
DAACTVLCDAYLRLQAFPEVPPRLRELRNRGLQLAVLSNGSPHSIGAVVGNAGIRDEFDHLISVDPVRVYKPHDRAYGLA
EEAFGLARTSILFVSSNGWDATGARYFGFPTCWINRGGNVFEEMGQTPDWDLLGIDEIVRLFDSAEGSAPSV
>Q53464 3.8.1.2~~~~~~(S)-2-haloacid dehalogenase~~~
MDYIKGIAFDLYGTLFDVHSVVGRCDEAFPGRGREISALWRQKQLEYTWLRSLMNRYVNFQQATEDALRFTCRHLGLDLD
ARTRSTLCDAYLRLAPFSEVPDSLRELKRRGLKLAILSNGSPQSIDAVVSHAGLRDGFDHLLSVDPVQVYKPDNRVYELA
EQALGLDRSAILFVSSNAWDATGARYFGFPTCWINRTGNVFEEMGQTPDWEVTSLRAVVELFETAAGKAEKG
>O87871 1.1.1.368~~~had~~~6-hydroxycyclohex-1-ene-1-carbonyl-CoA dehydrogenase~~~
MAAKSSVSSWRSEMSSNPHRWMMTSPGAPMVRAEFEIGELSADQVVVAVAGCGVCHTDLGYYYDSVRTNHALPLALGHEI
SGRVVQAGANAAQWLGRAVIVPAVMPCGTCELCTSGHGTICRDQVMPGNDIQGGFASHVVVPARGLCPVDEARLAAAGLQ
LADVSVVADAVTTPYQAVLQAGVEPGDVAVVIGVGGVGGYAVQIANAFGASVVAIDVDPAKLEMMSKHGAALTLNAREIS
GRDLKKAIEAHAKANGLRLTRWKIFECSGTGAGQTSAYGLLTHGATLAVVGFTMDKVEVRLSNLMAFHARALGNWGCLPE
YYPAALDLVLDKKIDLASFIERHPLDQIGEVFAAAHAHKLTRRAILTP
>Q60099 3.8.1.2~~~dhlB~~~(S)-2-haloacid dehalogenase~~~
MIKAVVFDAYGTLFDVQSVADATERAYPGRGEYITQVWRQKQLEYSWLRALMGRYADFWGVTREALAYTLGTLGLEPDES
FLADMAQAYNRLTPYPDAAQCLAELAPLKRAILSNGAPDMLQALVANAGLTDSFDAVISVDAKRVFKPHPDSYALVEEVL
GVTPAEVLFVSSNGFDVGGAKNFGFSVARVARLSQEALARELVSGTIAPLTMFKALRMREETYAEAPDFVVPALGDLPRL
VRGMAGAHLAPAV
>P59915 ~~~hagA~~~Hemagglutinin A~~~COG1974
MRKLNSLFSLAVLLSLLCWGQTAAAQGGPKTAPSVTHQAVQKGIRTSKAKDLRDPIPAGMARIILEAHDVWEDGTGYQML
WDADHNQYGASIPEESFWFANGTIPAGLYDPFEYKVPVNADASFSPTNFVLDGTASADIPAGTYDYVIINPNPGIIYIVG
EGVSKGNDYVVEAGKTYHFTVQRQGPGDAASVVVTGEGGNEFAPVQNLQWSVSGQTVTLTWQAPASDKRTYVLNESFDTQ
TLPNGWTMIDADGDGHNWLSTINVYNTATHTGDGAMFSKSWTASSGAKIDLSPDNYLVTPKFTVPENGKLSYWVSSQEPW
TNEHYGVFLSTTGNEAANFTIKLLEETLGSGKPAPMNLVKSEGVKAPAPYQERTIDLSAYAGQQVYLAFRHFGCTGIFRL
YLDDVAVSGEGSSNDYTYTVYRDNVVIAQNLTATTFNQENVAPGQYNYCVEVKYTAGVSPKVCKDVTVEGSNEFAPVQNL
TGSAVGQKVTLKWDAPNGTPNPNPGTTTLSESFENGIPASWKTIDADGDGNNWTTTPPPGGSSFAGHNSAICVSSASYIN
FEGPQNPDNYLVTPELSLPNGGTLTFWVCAQDANYASEHYAVYASSTGNDASNFANALLEEVLTAKTVVTAPEAIRGTRV
QGTWYQKTVQLPAGTKYVAFRHFGCTDFFWINLDDVEIKANGKRADFTETFESSTHGEAPAEWTTIDADGDGQGWLCLSS
GQLGWLTAHGGTNVVASFSWNGMALNPDNYLISKDVTGATKVKYYYAVNDGFPGDHYAVMISKTGTNAGDFTVVFEETPN
GINKGGARFGLSTEANGAKPQSVWIERTVDLPAGTKYVAFRHYNCSDLNYILLDDIQFTMGGSPTPTDYTYTVYRDGTKI
KEGLTETTFEEDGVATGNHEYCVEVKYTAGVSPKECVNVTVDPVQFNPVQNLTGSAVGQKVTLKWDAPNGTPNPNPGTTT
LSESFENGIPASWKTIDADGDGNNWTTTPPPGGTSFAGHNSAICVSSASYINFEGPQNPDNYLVTPELSLPNGGTLTFWV
CAQDANYASEHYAVYASSTGNDASNFANALLEEVLTAKTVVTAPEAIRGTRVQGTWYQKTVQLPAGTKYVAFRHFGCTDF
FWINLDDVEIKANGKRADFTETFESSTHGEAPAEWTTIDADGDGQGWLCLSSGQLDWLTAHGGTNVVASFSWNGMALNPD
NYLISKDVTGATKVKYYYAVNDGFPGDHYAVMISKTGTNAGDFTVVFEETPNGINKGGARFGLSTEANGAKPQSVWIERT
VDLPAGTKYVAFRHYNCSDLNYILLDDIQFTMGGSPTPTDYTYTVYRDGTKIKEGLTETTFEEDGVATGNHEYCVEVKYT
AGVSPKECVNVTVDPVQFNPVQNLTGSAVGQKVTLKWDAPNGTPNPNPGTTTLSESFENGIPASWKTIDADGDGNNWTTT
PPPGGTSFAGHNSAICVSSASYINFEGPQNPDNYLVTPELSLPNGGTLTFWVCAQDANYASEHYAVYASSTGNDASNFAN
ALLEEVLTAKTVVTAPEAIRGTRVQGTWYQKTVQLPAGTKYVAFRHFGCTDFFWINLDDVEIKANGKRADFTETFESSTH
GEAPAEWTTIDADGDGQGWLCLSSGQLGWLTAHGGTNVVASFSWNGMALNPDNYLISKDVTGATKVKYYYAVNDGFPGDH
YAVMISKTGTNAGDFTVVFEETPNGINKGGARFGLSTEANGAKPQSVWIERTVDLPAGTKYVAFRHYNCSDLNYILLDDI
QFTMGGSPTPTDYTYTVYRDGTKIKEGLTETTFEEDGVATGNHEYCVEVKYTAGVSPKECVNVTINPTQFNPVQNLTAEQ
APNSMDAILKWNAPASKRAEVLNEDFENGIPASWKTIDADGDGNNWTTTPPPGGSSFAGHNSAICVSSASYINFEGPQNP
DNYLVTPELSLPGGGTLTFWVCAQDANYASEHYAVYASSTGNDASNFANALLEEVLTAKTVVTAPEAIRGTRVQGTWYQK
TVQLPAGTKYVAFRHFGCTDFFWINLDDVVITSGNAPSYTYTIYRNNTQIASGVTETTYRDPDLATGFYTYGVKVVYPNG
ESAIETATLNITSLADVTAQKPYTLTVVGKTITVTCQGEAMIYDMNGRRLAAGRNTVVYTAQGGHYAVMVVVDGKSYVEK
LAVK
>Q50925 1.7.2.6~~~hao1~~~Hydroxylamine oxidoreductase~~~COG3303
MRIGEWMRGLLLCAGLMMCGVVHADISTVPDETYDALKLDRGKATPKETYEALVKRYKDPAHGAGKGTMGDYWEPIAISI
YMDPNTFYKPPVSPKEVAERKDCVECHSDETPVWVRAWKRSTHANLDKIRNLKSDDPLYYKKGKLEEVENNLRSMGKLGE
KETLKEVGCIDCHVDVNKKDKADHTKDIRMPTADTCGTCHLREFAERESERDTMVWPNGQWPAGRPSHALDYTANIETTV
WAAMPQREVAEGCTMCHTNQNKCDNCHTRHEFSAAESRKPEACATCHSGVDHNNWEAYTMSKHGKLAEMNRDKWNWEVRL
KDAFSKGGQNAPTCAACHMEYEGEYTHNITRKTRWANYPFVPGIAENITSDWSEARLDSWVLTCTQCHSERFARSYLDLM
DKGTLEGLAKYQEANAIVHKMYEDGTLTGQKTNRPNPPEPEKPGFGIFTQLFWSKGNNPASLELKVLEMAENNLAKMHVG
LAHVNPGGWTYTEGWGPMNRAYVEIQDEYTKMQELSALQARVNKLEGKQTSLLDLKGTGEKISLGGLGGGMLLAGALALI
GWRKRKQTRA
>P45387 3.4.21.-~~~hap~~~Adhesion and penetration protein autotransporter~~~
MKKTVFRLNFLTACISLGIVSQAWAGHTYFGIDYQYYRDFAENKGKFTVGAQNIKVYNKQGQLVGTSMTKAPMIDFSVVS
RNGVAALVENQYIVSVAHNVGYTDVDFGAEGNNPDQHRFTYKIVKRNNYKKDNLHPYEDDYHNPRLHKFVTEAAPIDMTS
NMNGSTYSDRTKYPERVRIGSGRQFWRNDQDKGDQVAGAYHYLTAGNTHNQRGAGNGYSYLGGDVRKAGEYGPLPIAGSK
GDSGSPMFIYDAEKQKWLINGILREGNPFEGKENGFQLVRKSYFDEIFERDLHTSLYTRAGNGVYTISGNDNGQGSITQK
SGIPSEIKITLANMSLPLKEKDKVHNPRYDGPNIYSPRLNNGETLYFMDQKQGSLIFASDINQGAGGLYFEGNFTVSPNS
NQTWQGAGIHVSENSTVTWKVNGVEHDRLSKIGKGTLHVQAKGENKGSISVGDGKVILEQQADDQGNKQAFSEIGLVSGR
GTVQLNDDKQFDTDKFYFGFRGGRLDLNGHSLTFKRIQNTDEGAMIVNHNTTQAANVTITGNESIVLPNGNNINKLDYRK
EIAYNGWFGETDKNKHNGRLNLIYKPTTEDRTLLLSGGTNLKGDITQTKGKLFFSGRPTPHAYNHLNKRWSEMEGIPQGE
IVWDHDWINRTFKAENFQIKGGSAVVSRNVSSIEGNWTVSNNANATFGVVPNQQNTICTRSDWTGLTTCQKVDLTDTKVI
NSIPKTQINGSINLTDNATANVKGLAKLNGNVTLTNHSQFTLSNNATQIGNIRLSDNSTATVDNANLNGNVHLTDSAQFS
LKNSHFSHQIQGDKGTTVTLENATWTMPSDTTLQNLTLNNSTITLNSAYSASSNNTPRRRSLETETTPTSAEHRFNTLTV
NGKLSGQGTFQFTSSLFGYKSDKLKLSNDAEGDYILSVRNTGKEPETLEQLTLVESKDNQPLSDKLKFTLENDHVDAGAL
RYKLVKNDGEFRLHNPIKEQELHNDLVRAEQAERTLEAKQVEPTAKTQTGEPKVRSRRAARAAFPDTLPDQSLLNALEAK
QAELTAETQKSKAKTKKVRSKRAVFSDPLLDQSLFALEAALEVIDAPQQSEKDRLAQEEAEKQRKQKDLISRYSNSALSE
LSATVNSMLSVQDELDRLFVDQAQSAVWTNIAQDKRRYDSDAFRAYQQQKTNLRQIGVQKALANGRIGAVFSHSRSDNTF
DEQVKNHATLTMMSGFAQYQWGDLQFGVNVGTGISASKMAEEQSRKIHRKAINYGVNASYQFRLGQLGIQPYFGVNRYFI
ERENYQSEEVRVKTPSLAFNRYNAGIRVDYTFTPTDNISVKPYFFVNYVDVSNANVQTTVNLTVLQQPFGRYWQKEVGLK
AEILHFQISAFISKSQGSQLGKQQNVGVKLGYRW
>Q93TJ5 1.14.13.84~~~hapE~~~4-hydroxyacetophenone monooxygenase~~~
MSAFNTTLPSLDYDDDTLREHLQGADIPTLLLTVAHLTGDLQILKPNWKPSIAMGVARSGMDLETEAQVREFCLQRLIDF
RDSGQPAPGRPTSDQLHILGTWLMGPVIEPYLPLIAEEAVTAEEDLRAPRWHKDHVASGRDFKVVIIGAGESGMIAALRF
KQAGVPFVIYEKGNDVGGTWRENTYPGCRVDINSFWYSFSFARGIWDDCFAPAPQVFAYMQAVAREHGLYEHIRFNTEVS
DAHWDESTQRWQLLYRDSEGQTQVDSNVVVFAVGQLNRPMIPAIPGIETFKGPMFHSAQWDHDVDWSGKRVGVIGTGASA
TQFIPQLAQTAAELKVFARTTNWLLPTPDLHEKISDSCKWLLAHVPHYSLWYRVAMAMPQSVGFLEDVMVDVGYPPTELA
VSARNDRLRQDISAWMEPQFADRPDLREVLIPDSPVGGKRIVRDNGTWISTLKRDNVSMIRQPIEVITPKGICCVDGTEH
EFDLIVYGTGFHASKFLMPINVTGRDGVALHDVWKGDDARAYLGMTVPQFPNMFCMYGPNTGLVVYSTVIQFSEMTASYI
VDAVRLLLEGGHQSMEVKTPVFESYNQRVDEGNALRAWGFSKVNSWYKNSKGRVTQNFPFTAVEFWQRTHSVEPTDYQLG
>P24153 3.4.24.-~~~hap~~~Hemagglutinin/proteinase~~~COG3227
MKMIQRPLNWLVLAGAATGFPLYAAQMVTIDDASMVEQALAQQQYSMMPAASGFKAVNTVQLPNGKVKVRYQQMYNGVPV
YGTVVVATESSKGISQVYGQMAQQLEADLPTVTPDIESQQAIALAVSHFGEQHAGESLPVENESVQLMVRLDDNQQAQLV
YLVDFFVASETPSRPFYFISAETGEVLDQWDGINHAQATGTGPGGNQKTGRYEYGSNGLPGFTIDKTGTTCTMNNSAVKT
VNLNGGTSGSTAFSYACNNSTNYNSVKTVNGAYSPLNDAHFFGKVVFDMYQQWLNTSPLTFQLTMRVHYGNNYENAFWDG
RAMTFGDGYTRFYPLVDINVSAHEVSHGFTEQNSGLVYRDMSGGINEAFSDIAGEAAEYFMRGNVDWIVGADIFKSSGGL
RYFDQPSRDGRSIDHASQYYSGIDVHHSSGVFNRAFYLLANKSGWNVRKGFEVFAVANQLYWTPNSTFDQGGCGVVKAAQ
DLNYNTADVVAAFNTVGVNASCGTTPPPVGKVLEKGKPITGLSGSRGGEDFYTFTVTNSGSVVVSISGGTGDADLYVKAG
SKPTTSSWDCRPYRSGNAEQCSISAVVGTTYHVMLRGYSNYSGVTLRLD
>Q54450 ~~~hasA~~~Hemophore HasA~~~
MAFSVNYDSSFGGYSIHDYLGQWASTFGDVNHTNGNVTDANSGGFYGGSLSGSQYAISSTANQVTAFVAGGNLTYTLFNE
PAHTLYGQLDSLSFGDGLSGGDTSPYSIQVPDVSFGGLNLSSLQAQGHDGVVHQVVYGLMSGDTGALETALNGILDDYGL
SVNSTFDQVAAATAVGVQHADSPELLAA
>Q7BLV3 2.4.1.212~~~hyaD~~~Hyaluronan synthase~~~
MNTLSQAIKAYNSNDYQLALKLFEKSAEIYGRKIVEFQITKCKEKLSAHPSVNSAHLSVNKEEKVNVCDSPLDIATQLLL
SNVKKLVLSDSEKNTLKNKWKLLTEKKSENAEVRAVALVPKDFPKDLVLAPLPDHVNDFTWYKKRKKRLGIKPEHQHVGL
SIIVTTFNRPAILSITLACLVNQKTHYPFEVIVTDDGSQEDLSPIIRQYENKLDIRYVRQKDNGFQASAARNMGLRLAKY
DFIGLLDCDMAPNPLWVHSYVAELLEDDDLTIIGPRKYIDTQHIDPKDFLNNASLLESLPEVKTNNSVAAKGEGTVSLDW
RLEQFEKTENLRLSDSPFRFFAAGNVAFAKKWLNKSGFFDEEFNHWGGEDVEFGYRLFRYGSFFKTIDGIMAYHQEPPGK
ENETDREAGKNITLDIMREKVPYIYRKLLPIEDSHINRVPLVSIYIPAYNCANYIQRCVDSALNQTVVDLEVCICNDGST
DNTLEVINKLYGNNPRVRIMSKPNGGIASASNAAVSFAKGYYIGQLDSDDYLEPDAVELCLKEFLKDKTLACVYTTNRNV
NPDGSLIANGYNWPEFSREKLTTAMIAHHFRMFTIRAWHLTDGFNEKIENAVDYDMFLKLSEVGKFKHLNKICYNRVLHG
DNTSIKKLGIQKKNHFVVVNQSLNRQGITYYNYDEFDDLDESRKYIFNKTAEYQEEIDILKDIKIIQNKDAKIAVSIFYP
NTLNGLVKKLNNIIEYNKNIFVIVLHVDKNHLTPDIKKEILAFYHKHQVNILLNNDISYYTSNRLIKTEAHLSNINKLSQ
LNLNCEYIIFDNHDSLFVKNDSYAYMKKYDVGMNFSALTHDWIEKINAHPPFKKLIKTYFNDNDLKSMNVKGASQGMFMT
YALAHELLTIIKEVITSCQSIDSVPEYNTEDIWFQFALLILEKKTGHVFNKTSTLTYMPWERKLQWTNEQIESAKRGENI
PVNKFIINSITL
>P96274 2.3.1.48~~~~~~Probable histone acetyltransferase Rv0428c~~~COG0456
MVSWPGLGTRVTVRYRRPAGSMPPLTDAVGRLLAVDPTVRVQTKTGTIVEFSPVDVVALRVLTDAPVRTAAIRALEHAAA
AAWPGVERTWLDGWLLRAGHGAVLAANSAVPLDISAHTNTITEISAWYASRDLQPWLAVPDRLLPLPADLAGERREQVLV
RDVSTGEPDRSVTLLDHPDDTWLRLYHQRLPLDMATPVIDGELAFGSYLGVAVARAAVTDAPDGTRWVGLSAMRAADEQS
ATGSAGRQLWEALLGWGAGRGATRGYVRVHDTATSVLAESLGFRLHHHCRYLPAQSVGWDTF
>P9WKY5 2.3.1.48~~~~~~Histone acetyltransferase~~~
MSFGFPTFSQNRFTEQYSGLCPIAPGRGAGLQPCRRDCPVARWLVADHPVFGSDCRCRMMVGVNRVRIGRHELTGA
>P45856 1.1.1.157~~~mmgB~~~Probable 3-hydroxybutyryl-CoA dehydrogenase~~~COG1250
MEIKQIMVAGAGQMGSGIAQTAADAGFYVRMYDVNPEAAEAGLKRLKKQLARDAEKGKRTETEVKSVINRISISQTLEEA
EHADIVIEAIAENMAAKTEMFKTLDRICPPHTILASNTSSLPITEIAAVTNRPQRVIGMHFMNPVPVMKLVEVIRGLATS
EETALDVMALAEKMGKTAVEVNDFPGFVSNRVLLPMINEAIYCVYEGVAKPEAIDEVMKLGMNHPMGPLALADFIGLDTC
LSIMEVLHSGLGDSKYRPCPLLRKYVKAGWLGKKSGRGFYDYEEKTS
>P52041 1.1.1.157~~~hbd~~~3-hydroxybutyryl-CoA dehydrogenase~~~COG1250
MKKVCVIGAGTMGSGIAQAFAAKGFEVVLRDIKDEFVDRGLDFINKNLSKLVKKGKIEEATKVEILTRISGTVDLNMAAD
CDLVIEAAVERMDIKKQIFADLDNICKPETILASNTSSLSITEVASATKRPDKVIGMHFFNPAPVMKLVEVIRGIATSQE
TFDAVKETSIAIGKDPVEVAEAPGFVVNRILIPMINEAVGILAEGIASVEDIDKAMKLGANHPMGPLELGDFIGLDICLA
IMDVLYSETGDSKYRPHTLLKKYVRAGWLGRKSGKGFYDYSK
>Q0AVM2 1.1.1.35~~~~~~3-hydroxybutyryl-CoA dehydrogenase~~~COG1250
MKIMVLGAGTMGAGIVQTAAQAGFEVVVRDIKQEFVDRGIAGIDKLLSKNVDKGRMTAEDKAAVMGRISGTVDMGAAADC
DLVIEAALEVMDIKKAIFKELDSICKPECILASNTSALSVTEIAAATGRADKVIGMHFFNPVPAMKLVEVIRGASTSQAT
YDAIKDLSVKMGKSPVEINEAPGFVVNRLLIPMLNEGMYCLMEGVANAADIDTSMKFGAGHPMGPLALADMIGLDICLKI
METLYKEFGDPKYRPCPLLAKMVRANKLGRKTGEGFFAY
>A1KFU9 ~~~hbhA~~~Heparin-binding hemagglutinin~~~
MAENSNIDDIKAPLLAALGAADLALATVNELITNLRERAEETRTDTRSRVEESRARLTKLQEDLPEQLTELREKFTAEEL
RKAAEGYLEAATSRYNELVERGEAALERLRSQQSFEEVSARAEGYVDQAVELTQEALGTVASQTRAVGERAAKLVGIELP
KKAAPAKKAAPAKKAAPAKKAAAKKAPAKKAAAKKVTQK
>A5TZK3 ~~~hbhA~~~Heparin-binding hemagglutinin~~~
MAENSNIDDIKAPLLAALGAADLALATVNELITNLRERAEETRTDTRSRVEESRARLTKLQEDLPEQLTELREKFTAEEL
RKAAEGYLEAATSRYNELVERGEAALERLRSQQSFEEVSARAEGYVDQAVELTQEALGTVASQTRAVGERAAKLVGIELP
KKAAPAKKAAPAKKAAPAKKAAAKKAPAKKAAAKKVTQK
>P9WIP9 ~~~hbhA~~~Heparin-binding hemagglutinin~~~
MAENSNIDDIKAPLLAALGAADLALATVNELITNLRERAEETRTDTRSRVEESRARLTKLQEDLPEQLTELREKFTAEEL
RKAAEGYLEAATSRYNELVERGEAALERLRSQQSFEEVSARAEGYVDQAVELTQEALGTVASQTRAVGERAAKLVGIELP
KKAAPAKKAAPAKKAAPAKKAAAKKAPAKKAAAKKVTQK
>P80172 ~~~hblA~~~Hemolysin BL-binding component~~~
MIKKIPYKLLAVSTLLTITTANVVSPVATFASEIEQTNNGDTALSANEAKMKETLQKAGLFAKSMNAYSYMLIKNPDVNF
EGITINGYVDLPGRIVQDQKNARAHAVTWDTKVKKQLLDTLTGIVEYDTTFDNYYETMVEAINTGDGETLKEGITDLRGE
IQQNQKYAQQLIEELTKLRDSIGHDVRAFGSNKELLQSILKNQGADVDADQKRLEEVLGSVNYYKQLESDGFNVMKGAIL
GLPIIGGIIVGVARDNLGKLEPLLAELRQTVDYKVTLNRVVGVAYSNINEIDKALDDAINALTYMSTQWHDLDSQYSGVL
GHIENAAQKADQNKFKFLKPNLNAAKDSWKTLRTDAVTLKEGIKELKVETVTPQK
>O05690 3.1.1.22~~~~~~D-(-)-3-hydroxybutyrate oligomer hydrolase~~~
MKTMQGKGSGRRLRGALLVTMAASGAIGLAGCGGSNDNTTTTTPTNVKPSFVGTVTVTHFDGVSDDLLTAGLGAAGLASA
TAPTVANATAPTAAELRRLAIYNNYRALVDTNAKGGYGTLYGPNVDASGNVTSGSGMVPGVEYVAYSDDGSGQQNVVLLV
QIPDAFDAANPCIITATSSGSRGIYGAISTGEWGLKRKCAVAYTDKGTRAGPHDLATDTVPLQDGTRTTRAAAGSKAQFA
APLTDTQLAAFNLATPNRLAFKHAHSQRNPEKDWGRFTLQAVQFAFWAINDKLSGGSAPNGSALAVRPDNTIVIASSVSN
GGGAAIAAAEQDTTHLIDGVAVGEPGLNLPASANVQVQRGGVTLPVTGKPLFDYVSYANAFRLCAALSSSVSGAPTQSFF
AGNIGWPASVQANRCAALHANGLLSSTTTAAQADEALQKMRTYGWEPESDLVHASMAYFEIDPSVATTFGNALARASVLD
NLCNFSFAAVDTSFHPTTVNATALAQLASTGNGIPPTTGVQLINNLAQGGATQSKQSVDSSGTQAANLDGALCLRKLLTG
ADAASQALQLGISQTLRTGNLGGRPALIVQGRNDALLPVNHGARPYLGLNAQVDTSSKLSYIEVTNAQHFDGFIDLVPGY
DTLFVPLVLYEQRALDAVYANLKNGTPLPPSQVVRTTPRGGTAGSAPAIAATNVPNFTNTPAVADRISVSVSGGVATVSV
PN
>Q9X6X9 3.1.1.22~~~~~~D-(-)-3-hydroxybutyrate oligomer hydrolase~~~
MKTIQGKSPGRWYSRGMLLAAMAASGVIGLAACGGGNDGNSAGNNGNAGGNGNNNGNNNGNTVSNTKPSFVGTVTVRRFD
GVSDDLLTAGLGASGLASATAPAVANAVAPTAAELRRLTIYNNYRALIDTSAKGGYGTLYGPNVDADGNVTSGNGMVAGA
EYVAYPDDGSGQQNVVLLVQIPDAFDAAHPCIITATSSGSRGIYGAISTGEWGLKRKCAVAYTDKGTGAGPHDLATDTVP
LQDGTRTTRTLAGNTAQFAAPLAASRLAAFNVATPNRLAFKHAHSQRNPEKDWGLFTLQAVQFAFWAINDKLGISSGQTV
SQLPVRPGNTIVIASSVSNGGGAAIAAAEQDTGNLIDGVAVGEPALSLPSSINVQVKRGGASLPINGKPLFDYVSYANEF
RLCAALSASVASAPTQAYFGAALGWPASVQANRCAALHAKGLLSSTTTAAQADEALQKMRDYGWEPESDLLHASMAYFEI
DPSVATTFGNALARASVFDNLCDLSFAAVDGSFHPATMNATVLAQLAATGNGVPPTTGVQLINNIAQGGAAQSRQSIDSS
GTQAANLDGALCLRNLLSGSDAASQALQLGLSQTLRSGNLRGKPALIVQGRNDALLPVNHGARPYLGLNAQVDGSSKLSY
IEVTNAQHFDGFIDLLPGYDSLFVPLAVYEQRALDAVYANLRSGTPLPPSQVVRTTPRGGAAGAAPPITAANVPNFTMTP
AAGDRIQVSVSGGVATVSVPN
>Q0K9H3 3.1.1.22~~~phaZ2~~~D-(-)-3-hydroxybutyrate oligomer hydrolase~~~
MHSTQIPPQQKQKRRLRLTVLAAAASMLAAACVSGDDNNNGNGSNPNTKPANIGTVTINSYNGTTDDLLTAGLGKDGLAS
ATAPLPANPTAPTAAELRRYAIHTNYRAIVDTTASGGYGSLYGPNVDAQGNVTGSDGKVAGVEYLAFSDDGSGQQNVTML
VQIPASFNTSKPCMITATSSGSRGVYGAIATGEWGLKRGCAVAYTDKGTGAAPHDLDTDTVPLIDGTRATRAAAGKNAQF
AAPAGATSLADFTAANPHRLAFKHAHSQRNPEKDWGKFTLQAVEFAIWAINDRFGAVSANGTRQRTLDKDRIVVIASSVS
NGGGAAVAAAEQDAGGLIDGVAVGEPNLNMPPNTGIVVQRGATPVAASGRTLYDYTTTANLLQHCAARATALTQAPFYTN
PATATFFANRCQTLAEKGLVSGANTDEQSASALQALHDAGWEAESDDLHPSLAVFDVAAAISVNYANAYAQASVTDRLCG
YSFASTLTDLKPAAIAPAALASMFATGNGVPPQPPVQLINDLDPQHGPYLNLASVSPSTLREDLNYDGANCLRSLLAGSD
AAARALQAGQALTLRNGNLRGKPAVIVHGRSDGLLPVNHTSRPYLGLNRQQEGVTSKLSYVEVENAQHFDAFIGLVPGYS
NRYVPLHVYLNRALDAVYDNLTAGKALPPSQVLRTTPRGGTLNTPAPALLPSNVPPFAASPAAGNAITVNANAVQVPD
>Q8Y585 ~~~hbp1~~~Hemin/hemoglobin-binding protein 1~~~COG5386
MKKVLVFAAFIVLFSFSFLSTGLTAQAALKDGTYSVDYTVIQGDSDSASMANDYFDKPATVTVNGGKSTVSLQVNHSKWI
TGLWVEGNAVSVTSKNASSDTRKVSFPVSTLSNPVNAKIKVDIDDDDLNYHHEYQIKLRFDEGSAKALAGAVKSSDNNTT
TPATKSDSSNKVTNPKSSDSSQMFLYGIIFVATGAGLILLKRRAIFK
>Q7AP54 ~~~hbp2~~~Hemin/hemoglobin-binding protein 2~~~COG5386
MKKLWKKGLVAFLALTLIFQLIPGFASAADSRLKDGGEYQVQVNFYKDNTGKTTKESSEADKYIDHTATIKVENGQPYMY
LTITNSTWWQTMAVSKNGTRPEKPAQADVYQDRYEDVQTVSTDAAKDTRVEKFKLSSLDDVIFSYMHIKVDAISYDHWYQ
VDLTIDPSTFKVISEPAVTTPVTLSDGIYTIPFVAKKANDDSNSSMQNYFNNPAWLKVKNGKKMVAMTVNDNKTVTALKT
TLAGTLQDVKVVSEDKDANTRIVEFEVEDLNQPLAAHVNYEAPFNGSVYKGQADFRYVFDTAKATAASSYPGSDETPPVV
NPGETNPPVTKPDPGTTNPPVTTPPTTPSKPAVVDPKNLLNNHTYSIDFDVFKDGTTETSMMESYVMKPALIKVENNQPY
VYLTLTNSSWIKTFQYKVNGVWKDMEVVSGDINKNTRTVKYPVKDGTANTDVKTHVLIEDMPGFSYDHEYTVQVKLNAAT
IKDITGKDVTLKEPVKKDILNTGNVASNNNAGPKLAKPDFDDTNSVQKTASKTEKNAKTNDSSSMVWYITLFGASFLYLA
YRLKRKRLS
>P33950 ~~~hbpA~~~Heme-binding protein A~~~COG0747
MKLKATLTLAAATLVLAACDQSSSANKSTAQTEAKSSSNNTFVYCTAKAPLGFSPALIIEGTSYNASSQQVYNRLVEFKK
GSTDIEPALAESWEISDDGLSYTFHLRKGVKFHTTKEFTPTRDFNADDVVFSFQRQLDPNHPYHNVSKGTYPYFKAMKFP
ELLKSVEKVDDNTIRITLNKTDATFLASLGMDFISIYSAEYADSMLKAGKPETLDSRPVGTGPFVFVDYKTDQAIQYVAH
ENYWKGRTPLDRLVISIVPDATTRYAKLQAGTCDLILFPNVADLAKMKTDPKVQLLEQKGLNVAYIAFNTEKAPFDNVKV
RQALNYAVDKKAIIEAVYQGAGTSAKNPLPPTIWSYNDEIQDYPYDPEKAKQLLAEAGYPNGFETDFWIQPVIRASNPNP
KRMAELIMADWAKIGVKTNPVTYEWADYRKRAKEGELTAGIFGWSGDNGDPDNFLSPLLGSSNIGNSNMARFNNSEFDAL
LNEAIGLTNKEERAKLYKQAQVIVHNQAPWIPVAHSVGFAPLSPRVKGYVQSPFGYDAFYGVSVDGK
>O88093 3.4.21.-~~~hbp~~~Hemoglobin-binding protease hbp autotransporter~~~
MNRIYSLRYSAVARGFIAVSEFARKCVHKSVRRLCFPVLLLIPVLFSAGSLAGTVNNELGYQLFRDFAENKGMFRPGATN
IAIYNKQGEFVGTLDKAAMPDFSAVDSEIGVATLINPQYIASVKHNGGYTNVSFGDGENRYNIVDRNNAPSLDFHAPRLD
KLVTEVAPTAVTAQGAVAGAYLDKERYPVFYRLGSGTQYIKDSNGQLTKMGGAYSWLTGGTVGSLSSYQNGEMISTSSGL
VFDYKLNGAMPIYGEAGDSGSPLFAFDTVQNKWVLVGVLTAGNGAGGRGNNWAVIPLDFIGQKFNEDNDAPVTFRTSEGG
ALEWSFNSSTGAGALTQGTTTYAMHGQQGNDLNAGKNLIFQGQNGQINLKDSVSQGAGSLTFRDNYTVTTSNGSTWTGAG
IVVDNGVSVNWQVNGVKGDNLHKIGEGTLTVQGTGINEGGLKVGDGKVVLNQQADNKGQVQAFSSVNIASGRPTVVLTDE
RQVNPDTVSWGYRGGTLDVNGNSLTFHQLKAADYGAVLANNVDKRATITLDYALRADKVALNGWSESGKGTAGNLYKYNN
PYTNTTDYFILKQSTYGYFPTDQSSNATWEFVGHSQGDAQKLVADRFNTAGYLFHGQLKGNLNVDNRLPEGVTGALVMDG
AADISGTFTQENGRLTLQGHPVIHAYNTQSVADKLAASGDHSVLTQPTSFSQEDWENRSFTFDRLSLKNTDFGLGRNATL
NTTIQADNSSVTLGDSRVFIDKNDGQGTAFTLEEGTSVATKDADKSVFNGTVNLDNQSVLNINDIFNGGIQANNSTVNIS
SDSAVLGNSTLTSTALNLNKGANALASQSFVSDGPVNISDATLSLNSRPDEVSHTLLPVYDYAGSWNLKGDDARLNVGPY
SMLSGNINVQDKGTVTLGGEGELSPDLTLQNQMLYSLFNGYRNIWSGSLNAPDATVSMTDTQWSMNGNSTAGNMKLNRTI
VGFNGGTSPFTTLTTDNLDAVQSAFVMRTDLNKADKLVINKSATGHDNSIWVNFLKKPSNKDTLDIPLVSAPEATADNLF
RASTRVVGFSDVTPILSVRKEDGKKEWVLDGYQVARNDGQGKAAATFMHISYNNFITEVNNLNKRMGDLRDINGEAGTWV
RLLNGSGSADGGFTDHYTLLQMGADRKHELGSMDLFTGVMATYTDTDASADLYSGKTKSWGGGFYASGLFRSGAYFDVIA
KYIHNENKYDLNFAGAGKQNFRSHSLYAGAEVGYRYHLTDTTFVEPQAELVWGRLQGQTFNWNDSGMDVSMRRNSVNPLV
GRTGVVSGKTFSGKDWSLTARAGLHYEFDLTDSADVHLKDAAGEHQINGRKDSRMLYGVGLNARFGDNTRLGLEVERSAF
GKYNTDDAINANIRYSF
>D3KVM6 1.6.5.7~~~~~~2-hydroxy-1,4-benzoquinone reductase~~~
MALKIAVVVGSLRRDSFNKQLAHALASLAPSDFSFDFVDIGGLPLYSQDYDSDFPEVARAFKQQIADADGLLIVTPEYNR
SMPGVLKNALDWASRPWGQSVWGGKPGAIIGTSVGAIGTAIAQSHLRGVCAYLDIVLMNQPEMYIKHDESRIDANGNIVS
EDTRKYLQTFMDKYAAWVRDRRV
>Q0QFQ3 3.7.1.23~~~hbzF~~~Maleylpyruvate hydrolase~~~
MKICRFNENRLGVVDGDEVLDVTEALSVLPSYEYPLPGYDPLIKHLDALLARIETLIKDAPRILLADVRLLSPVANPGKI
IAAPINYTRHLQEVLADSAINNGVASFTQHIKKSGLFLKANSSLAGAGEGVALSHQDRRNDHEVELAIVIGKTARNVPRE
KSLEYVAGYCIGIDMTVRGPEERSFRKSPDSYTILGPWLVTRDEIDSPGELQMSLKVNGEVRQNANTSDLILGVEELVEF
ASSFYTLHPGDVIISGTPEGVGPVNPGDAMLAEIERIGTMTIAIRSV
>Q06281 ~~~hctB~~~Histone H1-like protein HC2~~~
MLGVQKKRSTRKTAARKTVVRKPAAKKTAAKKAPVRKVAAKKTVARKTVAKKTVAARKPVAKKATAKKAPVRKVAAKKTV
ARKTVAKKTVAARKPVAKRVASTKKSSVAVKAGVCMKKHKHTAACGRVAASGVKVCASAAKRKTNPNRSRTAHSWRQQLM
KLVAR
>Q06280 ~~~hctB~~~Histone-like protein HC2~~~
MLGVQKKCSTRKTAARKTVVRKPAAKKTAAKKAPVRKVAAKKTVARKTVAKKTVAARKPVAKKATAKKAPVRKVAAKKTV
ARKTVAKKTVAARKPVAKKATAKKAPVRKAVAKKTVARKTVAKKTVAARKPVAKRVASTKKSSIAVKAGVCMKKHKHTAA
CGRVAASGVKVCASAAKRKTNPNRSRTAHSWRQQLMKLVAR
>P0CI32 1.3.1.87~~~hcaB~~~3-phenylpropionate-dihydrodiol/cinnamic acid-dihydrodiol dehydrogenase~~~COG1028
MSDLHNESIFITGGGSGLGLALVERFIEKGAQVATLELSAAKVASLRQRFGEHILAVEGNVTCYADYQRAVDQILTRSGK
LDCFIGNAGIWDHNASLVNTPAETLETGFHELFNVNVLGYLLGAKACAPALIASEGSMIFTLSNAAWYPGGGGPLYTASK
HAATGLIRQLAYELAPKVRVNGVGPCGMASDLRGPQALGQSETSIMQSLTPEKIAAILPLQFFPQPADFTGPYVMLTSRR
NNRALSGVMINADAGLAIRGIRHVAAGLDL
>P0ABW0 ~~~hcaC~~~3-phenylpropionate/cinnamic acid dioxygenase ferredoxin subunit~~~COG2146
MNRIYACPVADVPEGEALRIDTSPVIALFNVGGEFYAINDRCSHGNASMSEGYLEDDATVECPLHAASFCLKTGKALCLP
ATDPLTTYPVHVEGGDIFIDLPEAQP
>P77650 1.18.1.3~~~hcaD~~~3-phenylpropionate/cinnamic acid dioxygenase ferredoxin--NAD(+) reductase component~~~COG0446
MKEKTIIIVGGGQAAAMAAASLRQQGFTGELHLFSDERHLPYERPPLSKSMLLEDSPQLQQVLPANWWQENNVHLHSGVT
IKTLGRDTRELVLTNGESWHWDQLFIATGAAARPLPLLDALGERCFTLRHAGDAARLREVLQPERSVVIIGAGTIGLELA
ASATQRRCKVTVIELAATVMGRNAPPPVQRYLLQRHQQAGVRILLNNAIEHVVDGEKVELTLQSGETLQADVVIYGIGIS
ANEQLAREANLDTANGIVIDEACRTCDPAIFAGGDVAITRLDNGALHRCESWENANNQAQIAAAAMLGLPLPLLPPPWFW
SDQYSDNLQFIGDMRGDDWLCRGNPETQKAIWFNLQNGVLIGAVTLNQGREIRPIRKWIQSGKTFDAKLLIDENIALKSL
>P0ABR5 1.14.12.19~~~hcaE~~~3-phenylpropionate/cinnamic acid dioxygenase subunit alpha~~~COG4638
MTTPSDLNIYQLIDTQNGRVTPRIYTDPDIYQLELERIFGRCWLFLAHESQIPKPGDFFNTYMGEDAVVVVRQKDGSIKA
FLNQCRHRAMRVSYADCGNTRAFTCPYHGWSYGINGELIDVPLEPRAYPQGLCKSHWGLNEVPCVESYKGLIFGNWDTSA
PGLRDYLGDIAWYLDGMLDRREGGTEIVGGVQKWVINCNWKFPAEQFASDQYHALFSHASAVQVLGAKDDGSDKRLGDGQ
TARPVWETAKDALQFGQDGHGSGFFFTEKPDANVWVDGAVSSYYRETYAEAEQRLGEVRALRLAGHNNIFPTLSWLNGTA
TLRVWHPRGPDQVEVWAFCITDKAASDEVKAAFENSATRAFGPAGFLEQDDSENWCEIQKLLKGHRARNSKLCLEMGLGQ
EKRRDDGIPGITNYIFSETAARGMYQRWADLLSSESWQEVLDKTAAYQQEVMK
>Q47140 1.14.12.19~~~hcaF~~~3-phenylpropionate/cinnamic acid dioxygenase subunit beta~~~COG5517
MSAQVSLELHHRISQFLFHEASLLDDWKFRDWLAQLDEEIRYTMRTTVNAQTRDRRKGVQPPTTWIFNDTKDQLERRIAR
LETGMAWAEEPPSRTRHLISNCQISETDIPNVFAVRVNYLLYRAQKERDETFYVGTRFDKVRRLEDDNWRLLERDIVLDQ
AVITSHNLSVLF
>Q47141 ~~~hcaR~~~Hca operon transcriptional activator HcaR~~~COG0583
MELRHLRYFVAVAQALNFTRAAEKLHTSQPSLSSQIRDLENCVGVPLLVRDKRKVALTAAGECFLQDALAILEQAENAKL
RARKIVQEDRQLTIGFVPSAEVNLLPKVLPMFRLRQPDTLIELVSLITTQQEEKIRRGELDVGLMRHPVYSPEIDYLELF
DEPLVVVLPVDHPLAHEKEITAAQLDGVNFVSTDPVYSGSLAPIVKAWFAQENSQPNIVQVATNILVTMNLVGMGLGVTL
IPGYMNNFNTGQVVFRPIAGNVPSIALLMAWKKGEMKPALRDFIAIVQERLASVTA
>Q47142 ~~~hcaT~~~Probable 3-phenylpropionic acid transporter~~~COG2814
MVLQSTRWLALGYFTYFFSYGIFLPFWSVWLKGIGLTPETIGLLLGAGLVARFLGSLLIAPRVSDPSRLISALRVLALLT
LLFAVAFWAGAHVAWLMLVMIGFNLFFSPLVPLTDALANTWQKQFPLDYGKVRLWGSVAFVIGSALTGKLVTMFDYRVIL
ALLTLGVASMLLGFLIRPTIQPQGASRQQESTGWSAWLALVRQNWRFLACVCLLQGAHAAYYGFSAIYWQAAGYSASAVG
YLWSLGVVAEVIIFALSNKLFRRCSARDMLLISAICGVVRWGIMGATTALPWLIVVQILHCGTFTVCHLAAMRYIAARQG
SEVIRLQAVYSAVAMGGSIAIMTVFAGFLYQYLGHGVFWVMALVALPAMFLRPKVVPSC
>P83589 1.1.1.35~~~~~~Probable 3-hydroxyacyl-CoA dehydrogenase~~~
MKIKKAAVIGAGVMGAAIAAQLANAGIPVLLLDIVLPDKPDRNFLAKAGVERALKARPAAFMDNDRARLIEVGNLEDDLK
KLKDVDWVLEAIIEKLDAKHDLWEKVEKVVKKTAIISSNSSGIPMHLQIEGRSEDFQRRFVGAHFFNPPRYLHLLEVIPT
DKTDPQVVKDFSEFAEHTLGKGVVVANDVPGFVANRIGVYGIVRAMQHMEKYGLTPAEVDQLTGPALGRASSATFRTADL
SGLDIISHVATDIGGVTPDDEDFTLTESFKNMVAGGILGDKSGSGFYKKTKDEKARPRFSTTCKPANTKTRARCACPPWT
PRANLSPSATPCTPWKARKATSCAPP
>Q56840 1.1.1.268~~~xecD1~~~2-(R)-hydroxypropyl-CoM dehydrogenase~~~COG1028
MSRVAIVTGASSGNGLAIATRFLARGDRVAALDLSAETLEETARTHWHAYADKVLRVRADVADEGDVNAAIAATMEQFGA
IDVLVNNAGITGNSEAGVLHTTPVEQFDKVMAVNVRGIFLGCRAVLPHMLLQGAGVIVNIASVASLVAFPGRSAYTTSKG
AVLQLTKSVAVDYAGSGIRCNAVCPGMIETPMTQWRLDQPELRDQVLARIPQKEIGTAAQVADAVMFLAGEDATYVNGAA
LVMDGAYTAI
>Q56841 1.1.1.269~~~xecE1~~~2-(S)-hydroxypropyl-CoM dehydrogenase 1~~~COG1028
MLDAEVIAITGGAAGIGLAVAHAAIRAGARVALIDRDGACAQRAAAEFGAAAWGVGADVTDEAAITAAMAGAQRALGPLT
GLVNNAGIAGFGSVHATEVETWSRIMAVNVTGTFLASKAALFGMLERGRGAIVNFGSVAGLVGIPTMAAYCAAKGAVVNL
TRQMAADYSGRGIRVNVVCPGTVAGTDMGRQLLGTDCDPELEARRLAKYPMGRFGTPEDIAEAAVFLLSTKAAFVTGSVL
AVDGGMTAI
>A7IQH5 1.1.1.269~~~xecE3~~~2-(S)-hydroxypropyl-CoM dehydrogenase 3~~~COG1028
MSNRLKNEVIAITGGGAGIGLAIASAALREGAKVALIDLDQGLAERSAAMLSTGGAVAKGFGADVTKAADITAAITSAEQ
TIGSLTGLVNNAGIAGFGSVHDADAAAWDRIMAVNVTGTFLASKAALAGMLERHKGTIVNFGSVAGLVGIPTMAAYCAAK
GAIVNLTRQMAADYSGRGVRVNAVCPGTVTSTGMGQQLLGSDTSPEVQARRLAKYPIGRFGTPEDIAEAVIFLLSDQAAF
VTGAAFAVDGGMTAI
>P31658 3.1.2.-~~~hchA~~~Protein/nucleic acid deglycase 1~~~COG0693
MTVQTSKNPQVDIAEDNAFFPSEYSLSQYTSPVSDLDGVDYPKPYRGKHKILVIAADERYLPTDNGKLFSTGNHPIETLL
PLYHLHAAGFEFEVATISGLMTKFEYWAMPHKDEKVMPFFEQHKSLFRNPKKLADVVASLNADSEYAAIFVPGGHGALIG
LPESQDVAAALQWAIKNDRFVISLCHGPAAFLALRHGDNPLNGYSICAFPDAADKQTPEIGYMPGHLTWYFGEELKKMGM
NIINDDITGRVHKDRKLLTGDSPFAANALGKLAAQEMLAAYAG
>Q2G0M7 3.1.2.-~~~hchA~~~Protein/nucleic acid deglycase HchA~~~COG0693
MSQDVNELSKQPTPDKAEDNAFFPSPYSLSQYTAPKTDFDGVEHKGAYKDGKWKVLMIAAEERYVLLENGKMFSTGNHPV
EMLLPLHHLMEAGFDVDVATLSGYPVKLELWAMPTEDEAVISTYNKLKEKLKQPKKLADVIKNELGPDSDYLSVFIPGGH
AAVVGISESEDVQQTLDWALDNDRFIVTLCHGPAALLSAGLNREKSPLEGYSVCVFPDSLDEGANIEIGYLPGRLKWLVA
DLLTKQGLKVVNDDMTGRTLKDRKLLTGDSPLASNELGKLAVNEMLNAIQNK
>P64313 3.1.2.-~~~hchA~~~Protein/nucleic acid deglycase HchA~~~
MSQDVNELSKQPTPDKAEDNAFFPSPYSLSQYTAPKTDFDGVEHKGAYKDGKWKVLMIAAEERYVLLENGKMFSTGNHPV
EMLLPLHHLMEAGFDVDVATLSGYPVKLELWAMPTEDEAVISTYNKLKEKLKQPKKLADVIKNELGPDSDYLSVFIPGGH
AAVVGISESEDVQQTLDWALDNDRFIVTLCHGPAALLSAGLNREKSPLEGYSVCVFPDSLDEGANIEIGYLPGRLKWLVA
DLLTKQGLKVVNDDMTGRTLKDRKLLTGDSPLASNELGKLAVNEMLNAIQNK
>O69762 4.1.2.61~~~~~~Hydroxycinnamoyl-CoA hydratase-lyase~~~COG1024
MSTYEGRWKTVKVEIEDGIAFVILNRPEKRNAMSPTLNREMIDVLETLEQDPAAGVLVLTGAGEAWTAGMDLKEYFREVD
AGPEILQEKIRREASQWQWKLLRMYAKPTIAMVNGWCFGGGFSPLVACDLAICADEATFGLSEINWGIPPGNLVSKAMAD
TVGHRQSLYYIMTGKTFGGQKAAEMGLVNESVPLAQLREVTIELARNLLEKNPVVLRAAKHGFKRCRELTWEQNEDYLYA
KLDQSRLLDTEGGREQGMKQFLDDKSIKPGLQAYKR
>I3VE77 5.4.99.64~~~hcmA~~~2-hydroxyisobutanoyl-CoA mutase large subunit~~~
MTWLEPQIKSQLQSERKDWEANEVGAFLKKAPERKEQFHTIGDFPVQRTYTAADIADTPLEDIGLPGRYPFTRGPYPTMY
RSRTWTMRQIAGFGTGEDTNKRFKYLIAQGQTGISTDFDMPTLMGYDSDHPMSDGEVGREGVAIDTLADMEALLADIDLE
KISVSFTINPSAWILLAMYVALGEKRGYDLNKLSGTVQADILKEYMAQKEYIYPIAPSVRIVRDIITYSAKNLKRYNPIN
ISGYHISEAGSSPLQEAAFTLANLITYVNEVTKTGMHVDEFAPRLAFFFVSQGDFFEEVAKFRALRRCYAKIMKERFGAR
NPESMRLRFHCQTAAATLTKPQYMVNVVRTSLQALSAVLGGAQSLHTNGYDEAFAIPTEDAMKMALRTQQIIAEESGVAD
VIDPLGGSYYVEALTTEYEKKIFEILEEVEKRGGTIKLIEQGWFQKQIADFAYETALRKQSGQKPVIGVNRFVENEEDVK
IEIHPYDNTTAERQISRTRRVRAERDEAKVQAMLDQLVAVAKDESQNLMPLTIELVKAGATMGDIVEKLKGIWGTYRETP
VF
>I3VE74 5.4.99.64~~~hcmB~~~2-hydroxyisobutanoyl-CoA mutase small subunit~~~
MDQTPIRVLLAKVGLDGHDRGVKVVARALRDAGMDVIYSGLHRTPEEVVNTAIQEDVDVLGVSLLSGVQLTVFPKIFKLL
DERGAGDLIVIAGGVMPDEDAAAIRKLGVREVLLQDTPPQAIIDSIRSLVAARGAR
>G3XD67 1.4.99.5~~~hcnA~~~Hydrogen cyanide synthase subunit HcnA~~~
MHLLERQHDIQPLSRADMTIHLNGQPVAAAAGETVLNVLNAVGLRRLARNDHGQASGAFCGMGVCHCCLVAIDGRPKRRA
CQTVVRPGMRVETESNRFDQEERP
>O85226 1.4.99.5~~~hcnA~~~Hydrogen cyanide synthase subunit HcnA~~~COG3383
MRQIDRNFDIQPLQHADMTISLNGQPVTAALGETVLSVIQATGLRQVARNDHGQLVGAYCGMGVCHCCLVQIDGRHKRRA
CQTLVKPGMQVQTLSNRITETEPTL
>Q9I1S2 1.4.99.5~~~hcnB~~~Hydrogen cyanide synthase subunit HcnB~~~
MNLRPVIVGGGSAGMAAAIELARRGVPCVLFDEASRPGGVVYRGPLRAGVDPAYLGARYTRMLEKLRRDFSACAGHIDLR
LNSRVVGGDGQRLMVLDEAERLHEVEYSHLLLATGCHERSVPFPGWTLPGVMLLGGLQLQIKSGVVKPLGDTLIAGSGPL
LPLVACQLHAAGVRVAGVYEACAFGRMARESLALLNKPQLFLDGLSMLGYLKLNGIPLHYGWGVVEASGDGELTEVTVAP
YDEEWRPDLENARPVKASTLAVGYGFIPRTQLSQQLGLEHGFSDDGYLRAECNVWQQSSQPHIHLAGDMAGIRGGEAAMI
GGRIAALSILLQREAIAPAEAIERRESHLARLEAIKRFRAGVERYTQRGARQVELARADTVICRCEQVTRGDIERALEQG
VQDIAGLKMRTRAGMGDCQGRMCIGYCSDRLRRATGRHDVGWLRPRFPIDPIPFSAFQNLGTEA
>O85227 1.4.99.5~~~hcnB~~~Hydrogen cyanide synthase subunit HcnB~~~COG0446
MSLNPVIVGGGPAGMAAAIELAEHGVRSTLIEEASRLGGVVYRGPLRDGVQLDYLGPRYCEMLAKLHGDFADHEQMIDVR
LNSRVVGAEGTQSLVLLDGEEQVQQVSYEQLILAAGCHERSVPFPGWTLPGVKLLGGLQLQIKSGVVKPQSPVVIAGTGP
LLPLVACQLHASGVRVAGVYEACALGKIAKQSLAMLNKPQLFLDGLSMLAYLKLHGIALRYGWGVVEAQGQDALSVVTVA
PYSSDWQPDMAKAQRIAAQTLAVGYGFIPRTQLSQQMGLEHNFSDDGYLRASANAWQQSSEPHVHLAGDMGGIRGGEAAM
LSGRIAALSILMQRGVLSNEAALQRRQGYERKLASILRFRGAVDRYTARGAGQVELPKGDTVICRCEHTTRNDIERALSQ
GVQDMASLKMRTRVSMGDCQGRMCVGYCSDRLRQATGRKDVGWIRPRFPLDPIPFSAFPPSDQEVSQHD
>G3XD12 1.4.99.5~~~hcnC~~~Hydrogen cyanide synthase subunit HcnC~~~
MNRTYDIVIAGGGVIGASCAYQLSRRGNLRIAVVDDKRPGNATRASAGGLWAIGESVGLGCGVIFFRMMSSRNRREAQGA
AVAVDASTPHILPPAFFDLALQSNALYPELHRELIERHGMDFKFERTGLKYVIQDDEDRQYAEHIVAQIPHLAEQVRWLD
REELRRAEPAVSHAAHGALEFLCDHQVSPFRLADAYLEAARQNGVELLPGTNVTGVLRQGRRISGVRTDNAGVLHCRTLI
NAAGAWAAELSEMATGRRIPVKPVKGQIVLTERMPRLLNGCLTTSDCYMAQKDNGEILIGSTTEDKGFDVSNTFPEIAGL
VQGAVRCVPELQQVNLKRTWAGLRPGSPDELPILGPVAEVEGYLNACGHFRTGILTSAITGVLLDRLVHEETLPLDIAPF
LAARFQPEPAAVAVAAC
>O85228 1.4.99.5~~~hcnC~~~Hydrogen cyanide synthase subunit HcnC~~~COG0665
MIKHYDVVIAGGGVIGASCAYQLSKRKDLKVALIDAKRPGNASRASAGGLWAIGESVGLGCGVIFFRMMSANRKREAQGS
AVVVDSSTPHILPQSFFDFALQSNELYPRLHRELMGLHNMDFKFEQTGLKFVIYDEEDRLYAEHIVGCIPHLSDQVRWLD
QAALRASEPNVSHEAQGALEFLCDHQVNPFRLTDAYTEGARQNGVDVYFNTNVTGVLHQGNRVSGVKTDVAGLFRCTTLI
NAAGAWAAELSLQATGIEIPVKPVKGQILLTERMPKLLNGCLTTSDCYMAQKDNGEILIGSTTEDKGFDVTTTYPEINGL
VQGAVRCVPELAHVNLKRCWAGLRPGSPDELPILGPMDGVEGYLNACGHFRTGILTSAITGVLLDKLVNEEALPLDITPF
LARRFATAPVKKQPEPA
>Q9I747 ~~~hcp1~~~Protein hcp1~~~
MAVDMFIKIGDVKGESKDKTHAEEIDVLAWSWGMSQSGSMHMGGGGGAGKVNVQDLSFTKYIDKSTPNLMMACSSGKHYP
QAKLTIRKAGGENQVEYLIITLKEVLVSSVSTGGSGGEDRLTENVTLNFAQVQVDYQPQKADGAKDGGPVKYGWNIRQNV
QA
>O25001 3.5.2.6~~~hcpA~~~Beta-lactamase HcpA~~~COG0790
MLGNVKKTLFGVLCLGTLCLRGLMAEPDAKELVNLGIESAKKQDFAQAKTHFEKACELKNGFGCVFLGAFYEEGKGVGKD
LKKAIQFYTKGCELNDGYGCNLLGNLYYNGQGVSKDAKKASQYYSKACDLNHAEGCMVLGSLHHYGVGTPKDLRKALDLY
EKACDLKDSPGCINAGYIYSVTKNFKEAIVRYSKACELKDGRGCYNLGVMQYNAQGTAKDEKQAVENFKKGCKSSVKEAC
DALKELKIEL
>O25103 3.5.2.6~~~hcpB~~~Beta-lactamase HcpB~~~
MVGGGTVKKDLKKAIQYYVKACELNEMFGCLSLVSNSQINKQKLFQYLSKACELNSGNGCRFLGDFYENGKYVKKDLRKA
AQYYSKACGLNDQDGCLILGYKQYAGKGVVKNEKQAVKTFEKACRLGSEDACGILNNY
>O25728 3.5.2.6~~~hcpC~~~Putative beta-lactamase HcpC~~~COG0790
MLENVKKSFFRVLCLGALCLGGLMAEQDPKELVGLGAKSYKEKDFTQAKKYFEKACDLKENSGCFNLGVLYYQGQGVEKN
LKKAASFYAKACDLNYSNGCHLLGNLYYSGQGVSQNTNKALQYYSKACDLKYAEGCASLGGIYHDGKVVTRDFKKAVEYF
TKACDLNDGDGCTILGSLYDAGRGTPKDLKKALASYDKACDLKDSPGCFNAGNMYHHGEGATKNFKEALARYSKACELEN
GGGCFNLGAMQYNGEGVTRNEKQAIENFKKGCKLGAKGACDILKQLKIKV
>O24968 3.5.2.6~~~hcpD~~~Putative beta-lactamase HcpD~~~COG0790
MIKSWTKKWFLILFLMASCSSYLVATTGEKYFKMATQAFKRGDYHKAVAFYKRSCNLRVGVGCTSLGSMYEDGDGVDQNI
TKAVFYYRRGCNLRNHLACASLGSMYEDGDGVQKNLPKAIYYYRRGCHLKGGVSCGSLGFMYFNGTGVKQNYAKALFLSK
YACSLNYGISCNFVGYMYRNAKGVQKDLKKALANFKRGCHLKDGASCVSLGYMYEVGMDVKQNGEQALNLYKKGCYLKRG
SGCHNVAVMYYTGKGVPKDLDKAISYYKKGCTLGFSGSCKVLEEVIGKKSDDLQDDAQNDTQDDMQ
>Q01770 1.7.99.1~~~hcp~~~Hydroxylamine reductase~~~COG1151
MSNAMFCYQCQETVGNKGCTQVGVCGKKPETAALQDALIYVTKGLGQIATRLRAEGKAVDHRIDRLVTGNLFATITNANF
DDDILAERVRMTCAAKKELAASLTDKSGLSDAALWEASEKSAMLAKAGTVGVMATTDDDVRSLRWLITFGLKGMAAYAKH
ADVLGKHENSLDAFMQEALAKTLDDSLSVADLVALTLETGKFGVSAMALLDAANTGTYGHPEITKVNIGVGSNPGILISG
HDLRDLEMLLKQTEGTGVDVYTHSEMLPAHYYPAFKKYAHFKGNYGNAWWKQKEEFESFNGPVLLTTNCLVPPKDSYKDR
VYTTGIVGFTGCKHIPGEIGEHKDFSAIIAHAKTCPAPTEIESGEIIGGFAHNQVLALADKVIDAVKSGAIKKFVVMAGC
DGRAKSRSYYTDFAEGLPKDTVILTAGCAKYRYNKLNLGDIGGIPRVLDAGQCNDSYSLAVIALKLKEVFGLEDVNDLPI
VYNIAWYEQKAVIVLLALLSLGVKNIHLGPTLPAFLSPNVAKVLVEQFNIGGITSPQDDLKAFFG
>P31101 1.7.99.1~~~hcp~~~Hydroxylamine reductase~~~COG1151
MFCFQCQETAKNTGCTVKGMCGKPEETANLQDLLIFVLRGIAIYGEKLKELGQPDRSNDDFVLQGLFATITNANWDDARF
EAMISEGLARRDKLRNAFLAVYKAKNGKDFSEPLPEAATWTGDSTAFAEKAKSVGILATENEDVRSLRELLIIGLKGVAA
YAEHAAVLGFRKTEIDEFMLEALASTTKDLSVDEMVALVMKAGGMAVTTMALLDEANTTTYGNPEITQVNIGVGKNPGIL
ISGHDLKDMAELLKQTEGTGVDVYTHGEMLPANYYPAFKKYPHFVGNYGGSWWQQNPEFESFNGPILLTTNCLVPLKKEN
TYLDRLYTTGVVGYEGAKHIADRPAGGAKDFSALIAQAKKCPPPVEIETGSIVGGFAHHQVLALADKVVEAVKSGAIKRF
VVMAGCDGRQKSRSYYTEVAENLPKDTVILTAGCAKYRYNKLNLGDIGGIPRVLDAGQCNDSYSLAVIALKLKEVFGLDD
INDLPVSYDIAWYEQKAVAVLLALLFLGVKGIRLGPTLPAFLSPNVAKVLVENFNIKPIGTVQDDIAAMMAGK
>P75825 1.7.99.1~~~hcp~~~Hydroxylamine reductase~~~COG1151
MFCVQCEQTIRTPAGNGCSYAQGMCGKTAETSDLQDLLIAALQGLSAWAVKAREYGIINHDVDSFAPRAFFSTLTNVNFD
SPRIVGYAREAIALREALKAQCLAVDANARVDNPMADLQLVSDDLGELQRQAAEFTPNKDKAAIGENILGLRLLCLYGLK
GAAAYMEHAHVLGQYDNDIYAQYHKIMAWLGTWPADMNALLECSMEIGQMNFKVMSILDAGETGKYGHPTPTQVNVKATA
GKCILISGHDLKDLYNLLEQTEGTGVNVYTHGEMLPAHGYPELRKFKHLVGNYGSGWQNQQVEFARFPGPIVMTSNCIID
PTVGAYDDRIWTRSIVGWPGVRHLDGDDFSAVITQAQQMAGFPYSEIPHLITVGFGRQTLLGAADTLIDLVSREKLRHIF
LLGGCDGARGERHYFTDFATSVPDDCLILTLACGKYRFNKLEFGDIEGLPRLVDAGQCNDAYSAIILAVTLAEKLGCGVN
DLPLSLVLSWFEQKAIVILLTLLSLGVKNIVTGPTAPGFLTPDLLAVLNEKFGLRSITTVEEDMKQLLSA
>Q6WRT6 1.7.99.1~~~hcp~~~Hydroxylamine reductase~~~
MYCIQCEQTLHTATGTGCRFARGDCGKTAAISDQQDALVAALLAVSSHADAARKVGLIDAEVDAFVPQALFATLTNVNFD
PERLAGYIRKAQELRNRLQLALAGKPLALPALADADWPFAAAQQAEAGKIVALNRDAARIGEDVLGLRLLCLYGLKGIAA
YMEHARVLGQTDTQVAAGFHAHMAYLASEPTDAKGLFAEALAIGTLNFRVMEMLDAGATGTFGDPQPTPVNRRPVAGKAI
LVSGHDLHDLLRILEQTAGRGINVYTHGEMLPAHGYPAFHAHPHLIGNYGSAWQNQQAEFAAFPGAIVMTSNCLIDPRTG
AYQDRIFTRSIVGWPGVRHIEGEDFAEVIACAEALPGFAATEAPVTQLTGFGRNALMTAAPAVIERVKVGKIRHFYLIGG
CDGARAERAYYADLARMLPQDTVVLTLGCGKFRLDGIDFGAVDGLPRLLDVGQCNDAYAAIRLALALAEAFDCGVNDLPL
TLVLSWFEQKAIVILLTLLALGVKDIRVGPTAPGFLTPNLIATLNAQFGLRLISTPEETMAETLSA
>O33819 1.1.7.1~~~hcrA~~~4-hydroxybenzoyl-CoA reductase subunit alpha~~~
MSPKLPQHGTVGVRTPLVDGVEKVTGKAKYTADIAAPDALVGRILRSPHAHARILAIDTSAAEALEGVIAVCTGAETPVP
FGVLPIAENEYPLARDKVRYRGDPVAAVAAIDEVTAEKALALIKVDYEVLPAYMTPKAAMKAGAIALHDDKPNNILREVH
AEFGDVAAAFAEADLIREKTYTFAEVNHVHMELNATLAEYDPVRDMLTLNTTTQVPYYVHLKVAACLQMDSARIRVIKPF
LGGGFGARTEGLHFEIIAGLLARKAKGTVRLLQTREETFIAHRGRPWTEVKMKIGLKKDGKIAALALEATQAGGAYAGYG
IITILYTGALMHGLYHIPAIKHDAWRVYTNTPPCGAMRGHGTVDTRAAFEALLTEMGEELGIDSLKIRQINMLPQIPYVT
MYAQRVMSYGVPECLEKVKAASGWEERKGKLPKGRGLGIALSHFVSGTSTPKHWTGEPHATVNLKLDFDGGITLLTGAAD
IGQGSNTMASQVAAEVLGVRLSRIRVISADSALTPKDNGSYSSRVTFMVGNASISAAEELKGVLVKAAAKKLDAREEDIE
VIDEMFMVSGSQDPGLSFQEVVKAAMVDSGTITVKGTYTCPTEFQGDKKIRGSAIGATMGFCYAAQVVEASVDEITGKVT
AHKVWVAVDVGKALNPLAVEGQTQGGVWMGMGQALSEETVYDNGRMVHGNILDYRVPTIVESPDIEVIIVESMDPNGPFG
AKEASEGMLAGFLPAIHEAVYEAVGVRATDFPLSPDRITELLDAKEAAA
>O33820 1.1.7.1~~~hcrB~~~4-hydroxybenzoyl-CoA reductase subunit beta~~~
MNILTDFRTHRPATLADAVNALAAEATLPLGAGTDLLPNLRRGLGHPAALVDLTGIDGLATISTLADGSLRIGAGATLEA
IAEHDAIRTTWPALAQAAESVAGPTHRAAATLGGNLCQDTRCTFYNQSEWWRSGNGYCLKYKGDKCHVIVKSDRCYATYH
GDVAPALMVLDARAEIVGPAGKRTVPVAQLFRESGAEHLTLEKGELLAAIEVPPTGAWSAAYSKVRIRDAVDFPLAGVAA
ALQRDGDRIAGLRVAITGSNSAPLMVPVDALLGGNWDDAAAETLAQLVRKTSNVLRTTITGVKYRRRVLLAISRKVVDQL
WEAR
>O33818 1.1.7.1~~~hcrC~~~4-hydroxybenzoyl-CoA reductase subunit gamma~~~
MKNILRLTLNGRAREDLVPDNMLLLDYLRETVGLTGTKQGCDGGECGACTVLVDDRPRLACSTLAHQVAGKKVETVESLA
TQGTLSKLQAAFHEKLGTQCGFCTPGMIMASEALLRKNPSPSRDEIKAALAGNLCRCTGYVRSSKSVETAAAARLCEEGA
R
>P75824 1.-.-.-~~~hcr~~~NADH oxidoreductase HCR~~~COG1018
MTMPTNQCPWRMQVHHITQETPDVWTISLICHDYYPYRAGQYALVSVRNSAETLRAYTISSTPGVSEYITLTVRRIDDGV
GSQWLTRDVKRGDYLWLSDAMGEFTCDDKAEDKFLLLAAGCGVTPIMSMRRWLAKNRPQADVRVIYNVRTPQDVIFADEW
RNYPVTLVAENNVTEGFIAGRLTRELLAGVPDLASRTVMTCGPAPYMDWVEQEVKALGVTRFFKEKFFTPVAEAATSGLK
FTKLQPAREFYAPVGTTLLEALESNNVPVVAACRAGVCGCCKTKVVSGEYTVSSTMTLTDAEIAEGYVLACSCHPQGDLV
LA
>E4MYY0 4.2.3.187~~~~~~(2Z,6E)-hedycaryol synthase~~~
MAEFEIPDFYVPFPLECNPHLEEASRAMWEWIDANGLAPTERARDRMRRTGADLSGAYVWPRADLDTLTIGLKWIALTFR
IDDQIDEDDTAERLPARMTAIDELRGTLHGLPVSGRSPTARALGALWQETALGRPATWCDAFIGHFEAFLQTYTTEAGLN
AHGAGLRLDDYLDRRMYSVGMPWLWDLDELRLPIFLPGSVRTCGPMNKLRRAGALHIALVNDVFSVERETLVGYQHNAVT
IIREAQGCSLQEAVDQVAVLVEAQLHTVLQARQELLEELDRQALPSRAREAAVDYAANVAANLSGQLVWHSSVERYAVDD
LQSAADPRATPTTSSLGI
>Q9PLI1 ~~~hctA~~~Histone H1-like protein HC1~~~
MALKDTAKKMTDLLESIQQNLLKAEKGNKAAAQRVRTESIKLEKIAKVYRKESIKAEKMGLMKRSKVAAKKAKAAAKKPA
KATKVVTKKACTKKTCATKAKAKPVKKAATKTKAKVTKKVRSTKK
>Q9Z720 ~~~hctA~~~Histone H1-like protein HC1~~~
MALKDTAKKMKDLLDSIQHDLAKAEKGNKAAAQRVRTDSIKLEKVAKLYRKESIKAEKSGLLKRKPSTKAPAKVKKTAEK
KAPKKSSAAAAKTSKAVKASKPASKKTAAKKVKKPSKARGFRK
>Q46204 ~~~hctA~~~Histone H1-like protein HC1~~~
MALKDTAKKMRDLLESIQRDLDKAERGNKAAAQRVRTDSIKLEKVAKVYRKESIKAEKSGLMTRKPATKAKKAAATKKAA
PKPKIQAKAAPKAKATTKKTPAKAKAKKSSKSRYLRK
>B0B8W9 ~~~hctA~~~Histone H1-like protein Hc1~~~
MALKDTAKKMTDLLESIQQNLLKAEKGNKAAAQRVRTESIKLEKTAKVYRKESIKAEKMGLMKKSKAAAKKAKAAAKKPV
RATKTVAKKACTKRTCATKAKVKPTKKAAPKTKVKTAKKTRSTKK
>P0CE15 ~~~hctA~~~Histone H1-like protein Hc1~~~
MALKDTAKKMTDLLESIQQNLLKAEKGNKAAAQRVRTESIKLEKIAKVYRKESIKAEKMGLMKKSKAAAKKAKAAAKKPV
RAAKTVAKKACTKRTCATKAKVKPTKKAAPKTKVKTAKKTRSTKK
>P38020 ~~~hctB~~~Histone H1-like protein HC2~~~
MLGVQKKRSTRKTAARKTVVRKPAAKKTAAKKASVRKVAAKKTVARKTVAKKAVAARKPAAKKTAAKKAPVRKVAAKKTV
ARKTVAKKAVAARKTVAKKSVAARKTAAKKAPVRKVAAKKTVARKTVAKKAVAARKPAAKRTVSTKKTAVAAKAGVCMKK
HKHTAACGRVAASGVKVCASSAKRRTHHNRSRTAHSWRQQLMKLVAK
>Q9Z8F9 ~~~hctB~~~Histone H1-like protein HC2~~~
MIGAQKKQSGKKTASRAVRKPAKKVAAKRTVKKATVRKTAVKKPAVRKTAAKKTVAKKTTAKRTVRKTVAKKPAVKKVAA
KRVVKKTVAKKTTAKRAVRKTVAKKPVARKTTVAKGSPKKAAACALACHKNHKHTSSCKRVCSSTATRKHGSKSRVRTAH
GWRHQLIKMMSR
>Q46397 ~~~hctB~~~Histone H1-like protein HC2~~~
MLGVQKKRSTRKTAARKTVVRKPAAKKTAAKKAPVRKVAAKKTVARKTVAKKTVAARKPVAKKATAKKAPVRKAVAKKTV
ARKTVAKKTVAARKPVAKKATAKKAPVRKVAAKKTVARKTVAKKTVAARKPVAKKATAKKAPVRKAVAKKTVAKRVASTK
KSSVAVKAGVCMKKHKHTAACGRVAASGVKVCASAAKRKMNPNRSRTAHSWRQQLMKLVAR
>P45579 1.1.1.-~~~hcxA~~~Hydroxycarboxylate dehydrogenase A~~~COG0371
MPHNPIRVVVGPANYFSHPGSFNHLHDFFTDEQLSRAVWIYGKRAIAAAQTKLPPAFGLPGAKHILFRGHCSESDVQQLA
AESGDDRSVVIGVGGGALLDTAKALARRLGLPFVAVPTIAATCAAWTPLSVWYNDAGQALHYEIFDDANFMVLVEPEIIL
NAPQQYLLAGIGDTLAKWYEAVVLAPQPETLPLTVRLGINNAQAIRDVLLNSSEQALSDQQNQQLTQSFCDVVDAIIAGG
GMVGGLGDRFTRVAAAHAVHNGLTVLPQTEKFLHGTKVAYGILVQSALLGQDDVLAQLTGAYQRFHLPTTLAELEVDINN
QAEIDKVIAHTLRPVESIHYLPVTLTPDTLRAAFKKVESFKA
>P30178 1.1.1.-~~~hcxB~~~Hydroxycarboxylate dehydrogenase B~~~COG2055
MESGHRFDAQTLHSFIQAVFRQMGSEEQEAKLVADHLIAANLAGHDSHGIGMIPSYVRSWSQGHLQINHHAKTVKEAGAA
VTLDGDRAFGQVAAHEAMALGIEKAHQHGIAAVALHNSHHIGRIGYWAEQCAAAGFVSIHFVSVVGIPMVAPFHGRDSRF
GTNPFCVVFPRKDNFPLLLDYATSAIAFGKTRVAWHKGVPVPPGCLIDVNGVPTTNPAVMQESPLGSLLTFAEHKGYALA
AMCEILGGALSGGKTTHQETLQTSPDAILNCMTTIIINPELFGAPDCNAQTEAFAEWVKASPHDDDKPILLPGEWEVNTR
RERQKQGIPLDAGSWQAICDAARQIGMPEETLQAFCQQLAS
>Q70I53 3.5.1.-~~~hdaH~~~Histone deacetylase-like amidohydrolase~~~
MAIGYVWNTLYGWVDTGTGSLAAANLTARMQPISHHLAHPDTKRRFHELVCASGQIEHLTPIAAVAATDADILRAHSAAH
LENMKRVSNLPTGGDTGDGITMMGNGGLEIARLSAGGAVELTRRVATGELSAGYALVNPPGHHAPHNAAMGFCIFNNTSV
AAGYARAVLGMERVAILDWDVHHGNGTQDIWWNDPSVLTISLHQHLCFPPDSGYSTERGAGNGHGYNINVPLPPGSGNAA
YLHAMDQVVLHALRAYRPQLIIVGSGFDASMLDPLARMMVTADGFRQMARRTIDCAADICDGRIVFVQEGGYSPHYLPFC
GLAVIEELTGVRSLPDPYHEFLAGMGGNTLLDAERAAIEEIVPLLADIR
>Q9HXM1 3.5.1.-~~~~~~Histone deacetylase-like amidohydrolase~~~
MTRRTAFFFDELCLWHAAGPHALTLPVGGWVQPPAAAGHAESPETKRRLKSLLDVSGLTARLQLRSAPPASDEDLLRVHP
AHYLERFKALSDAGGGSLGQDAPIGPGSYEIARLSAGLAIAALDAVLAGEADNAYSLSRPPGHHCLPDQAMGFCFFANIA
VAIEAAKARHGVERVAVLDWDVHHGNGTQAIYYRRDDVLSISLHQDGCFPPGYSGAEDIGEDRGRGFNLNVPLLPGGGHD
AYMQAMQRIVLPALERFRPQLIVVASGFDANAVDPLARMQLHSDSFRAMTAMVRDAAERHAGGRLVVVHEGGYSEAYVPF
CGLAVIEELSGVRSAVRDPLRDFIELQQPNAAFRDFQRQRLEELAAQFGLCPAQPLQAAR
>P69931 ~~~hda~~~DnaA regulatory inactivator Hda~~~COG0593
MNTPAQLSLPLYLPDDETFASFWPGDNSSLLAALQNVLRQEHSGYIYLWAREGAGRSHLLHAACAELSQRGDAVGYVPLD
KRTWFVPEVLDGMEHLSLVCIDNIECIAGDELWEMAIFDLYNRILESGKTRLLITGDRPPRQLNLGLPDLASRLDWGQIY
KLQPLSDEDKLQALQLRARLRGFELPEDVGRFLLKRLDREMRTLFMTLDQLDRASITAQRKLTIPFVKEILKL
>Q9AGY8 2.7.1.168~~~hddA~~~D-glycero-alpha-D-manno-heptose 7-phosphate kinase~~~
MIFRSKAPLRLGFAGGGTDVSPYSDEYGGYVLNATVDMYAYCTIEVTNDNRVCFYAADREEIFEGNSLEEFELDGNLDLH
KGIYNRVVKQFNHGRPLSFRMTTYSDAPAGSGLGSSSTMVVAILKGFVEWLNLPLGEYDVAHLAYEIERIDVGLSGGKQD
QYAATFGGFNFIEFYKEDKVIVNPLRIKNWIINELENSMILYYTGVSRESAKIIDEQTKNTKEKNSRSLEAMHELKADAL
IMKEAILKGDLKTFAEYLGKSWEAKKRMASSISNSYLDKIYEVAIETGAYAGKVSGAGGGGFMMFIVDPTKKITVSRELN
KMGGHTMNFHFVKHGTQGWRV
>Q0P8I9 2.7.1.168~~~hddA~~~D-glycero-alpha-D-manno-heptose 7-phosphate kinase~~~COG2605
MKTIRTQTPLRLGLAGGGTDINLYCDKYTGYVLNATISLYIHCTLIKREDGKIIFDSPDTNSYCEYESKEFLGNDGKLDI
FKSIYNRIVKDFTKKPLSFSLHTYSDVPSGSGLGGSSTLVVGVIKAFAEWLNLPLGEYEIAKLAYEIEREDLGIVGGAQD
QYAATFGGFNFMEFYNNKRVIVNPLRIKNWIASELEARTVLYFTNITREAKDIEEHKKGKLGDEKSLEAMHAIKQDAIKM
KEALFRADFGTLAQILGKSWRSKKIISEIVSNDELERIYKLAIDNGAYSGKTSGAGAGGFMFFFVDPTKKYNLIKALRKE
QGYVQDFSFTKEGVKSWRI
>O53637 2.7.1.168~~~hddA~~~D-glycero-alpha-D-manno-heptose 7-phosphate kinase~~~COG2605
MAILRGRAPLRLGLGGGGTDVEPYSSQFGGRILSVTIDKYAYAFAERGTGDEIAFRSPDRDRAGQASIDDLASLEEDFPL
HVAVYRRVIAEFNGGTPFPLQLATQVDAPPGSGLGSSSALVVAMLLTTCALIGSSPGPYELARLAWEIERVDLGMAGGWQ
DHYAAAFGGFNFMESRPNGEVVVNPLRIRREVIAELEASLLLYFGGVSRLSSEVIADQQRNVVERDADALAATHSICAEA
LEMKDLLVVGDIPGFADSLLRGWQAKKRTSTRISNPAIEHAYQVAQSSGMVAGKVSGAGGGGFLMMIVDPRRRIEVARSL
ERECGGSVAPCLFTKGGAVTWHIPESTAPVRRGVADAVASALGNAGILLCAGCVLATSHSTWRVPV
>Q9AGY6 2.7.7.71~~~hddC~~~D-glycero-alpha-D-manno-heptose 1-phosphate guanylyltransferase~~~
MEAIILVGGLGKRLRSVVSELPKPMAPIDNKPFLHYIFWYLNKQGIDQVILSTGYKHEMIETYFGNRYHGISINYSIEQE
PLGTGGAIKKAFRKTTEENVVIINGDTLFLVDLRKMFERHISFKADLTLALKPMKEFERYGTVITRDSRVIAFKEKGYHS
EGNINGGVYIANKAIFECESLSEKFSFEQDFLEKEFLQKKFYGFISDAYFIDIGIPDDYRKAQKELQHYI
>Q0P8J1 2.7.7.71~~~hddC~~~D-glycero-alpha-D-manno-heptose 1-phosphate guanylyltransferase~~~COG1208
MQAIILCGGLGTRLKSIIKDIPKPMAPINDKPFLEFIFEYLKKQGIKEVILAVSYKYEVIKEYFKDEFLGIKIKYSIEKE
PLGTGGAIKETLKFVKNEAYVLNGDTFFDIDLSKLKLNESKICLALKQMNDFDRYGTVNVDEQDLVISFEEKVFKKQGLI
NGGIYLLTKDIFNDFALQEKFSFEEFLQENYKKLKARACIFDDYFIDIGVPEDYYHFLINN
>Q2YK18 ~~~hdeA~~~Probable acid stress chaperone HdeA~~~
MIKALFNKNTALAAVAILALSGGAMAESAKTHKTDMAKKKVSELTCEDFNGLEESFKPTVVGWVVGFNKKGKEEDAVIDV
DGIETVTPAIIEACKQEPKASFWKKAEAELKKVF
>P0AES9 ~~~hdeA~~~Acid stress chaperone HdeA~~~
MKKVLGVILGGLLLLPVVSNAADAQKAADNKKPVNSWTCEDFLAVDESFQPTAVGFAEALNNKDKPEDAVLDVQGIATVT
PAIVQACTQDKQANFKDKVKGEWDKIKKDM
>P0AET2 ~~~hdeB~~~Acid stress chaperone HdeB~~~
MNISSLRKAFIFMGAVAALSLVNAQSALAANESAKDMTCQEFIDLNPKAMTPVAWWMLHEETVYKGGDTVTLNETDLTQI
PKVIEYCKKNPQKNLYTFKNQASNDLPN
>P0AET5 ~~~hdeD~~~Protein HdeD~~~COG3247
MLYIDKATILKFDLEMLKKHRRAIQFIAVLLFIVGLLCISFPFVSGDILSTVVGALLICSGIALIVGLFSNRSHNFWPVL
SGFLVAVAYLLIGYFFIRAPELGIFAIAAFIAGLFCVAGVIRLMSWYRQRSMKGSWLQLVIGVLDIVIAWIFLGATPMVS
VTLVSTLVGIELIFSAASLFSFASLFVKQQ
>Q5LA59 1.1.1.159~~~hdhA~~~7alpha-hydroxysteroid dehydrogenase~~~COG1028
MNRFENKIIIITGAAGGIGASTTRRIVSEGGKVVIADYSREKADQFAAELSNSGADVRPVYFSATELKSCKELITFTMKE
YGQIDVLVNNVGGTNPRRDTNIETLDMDYFDEAFHLNLSCTMYLSQLVIPIMSTQGGGNIVNVASISGITADSNGTLYGA
SKAGVINLTKYIATQTGKKNIRCNAVAPGLILTPAALNNLNEEVRKIFLGQCATPYLGEPQDVAATIAFLASEDARYITG
QTIVVDGGLTIHNPTINLV
>G9FRD7 1.1.1.-~~~hdha~~~7alpha-hydroxysteroid dehydrogenase~~~
MKRLEGKVAIVTSSTRGIGRASAEALAKEGALVYLAARSEELANEVIADIKKQGGVAKFVYFNAREEETYTSMVEKVAEA
EGRIDILVNNYGGTNVNLDKNLTAGDTDEFFRILKDNVQSVYLPAKAAIPHMEKVGGGSIVNISTIGSVVPDISRIAYCV
SKSAINSLTQNIALQYARKNIRCNAVLPGLIGTRAALENMTDEFRDSFLGHVPLNRVGRPEDIANAVLYYASDDSGYVTG
MIHEVAGGFALGTPQYSEYCPR
>P0AET8 1.1.1.159~~~hdhA~~~7alpha-hydroxysteroid dehydrogenase~~~COG1028
MFNSDNLRLDGKCAIITGAGAGIGKEIAITFATAGASVVVSDINADAANHVVDEIQQLGGQAFACRCDITSEQELSALAD
FAISKLGKVDILVNNAGGGGPKPFDMPMADFRRAYELNVFSFFHLSQLVAPEMEKNGGGVILTITSMAAENKNINMTSYA
SSKAAASHLVRNMAFDLGEKNIRVNGIAPGAILTDALKSVITPEIEQKMLQHTPIRRLGQPQDIANAALFLCSPAASWVS
GQILTVSGGGVQELN
>P50200 1.1.1.-~~~~~~7alpha-hydroxysteroid dehydrogenase~~~COG1028
MNKLENKVALVTSATRGIGLASAIKLAQNGAIVYMGVRRLEATQEICDKYKEEGLILKPVFFDAYNIDIYKEMIDTIIKN
EGKIDILVNNFGTGRPEKDLDLVNGDEDTFFELFNYNVGSVYRLSKLIIPHMIENKGGSIVNISSVGGSIPDISRIGYGV
SKSGVNNITKQIAIQYAKYGIRCNAVLPGLIATDAAMNSMPDEFRKSFLSHVPLNRIGNPEDIANSVLFFVPSEDSSYIT
GSILEVSGGYNLGTPQYAEFVGSKVVE
>Q1PW30 1.7.2.8~~~~~~Hydrazine dehydrogenase~~~
MRKFLKVTLASALIGCGVIGTVSSLMVKEAKAVEIITHWVPHEVYGMPGEPDNSGKVFFSGLKAKYMGYPKDAQRSPYPG
KYSKFWKTLPAYRYYIPDYMYNRDEVRPSNPIKGTFKLEQCVACHSVMTPGIVRDYNKSAHSKAEPAPTGCDTCHGNNHQ
KLTMPSSKACGTAECHETQYNEQGQGGIGSHASCSSFAQVECAWSIERPPGDTAGCTFCHTSPEERCSTCHQRHQFDPAV
ARRSEQCKTCHWGKDHRDWEAYDIGLHGTVYQVNKWDTEQFDFSKKLSDADYVGPTCQYCHMRGGHHNVQRASIVYTSMG
MSMADRGAPLWKEKRDRWVSICDDCHSPRFARENLQAMDESVKDASLKYRETFKVAEDLLIDGVLDPMPKDLCPDWSGQH
IWSLKIGAYHDGEAYGGTTGESGEFRMSNCTDVERLCFESVGYFQTYIYKGMAHGSWNDATYSDGSFGMDRWLVNVKQNA
SRARRLAALEKKVGISWQPEQFWKTGEWLDQLTGPYIVKNHPGKTIFDLCPDPGWLDTHHAPAEEVEYIERKLKELGITA
GSHSAHHHESGHDPAARSMKEH
>P08159 1.5.3.6~~~6-hdno~~~(R)-6-hydroxynicotine oxidase~~~
MSSKLATPLSIQGEVIYPDDSGFDAIANIWDGRHLQRPSLIARCLSAGDVAKSVRYACDNGLEISVRSGGHNPNGYATND
GGIVLDLRLMNSIHIDTAGSRARIGGGVISGDLVKEAAKFGLAAVTGMHPKVGFCGLALNGGVGFLTPKYGLASDNILGA
TLVTATGDVIYCSDDERPELFWAVRGAGPNFGVVTEVEVQLYELPRKMLAGFITWAPSVSELAGLLTSLLDALNEMADHI
YPSVFVGVDENRAPSVTVCVGHLGGLDIAERDIARLRGLGRTVSDSIAVRSYDEVVALNAEVGSFEDGMSNLWIDREIAM
PNARFAEAIAGNLDKFVSEPASGGSVKLEIEGMPFGNPKRTPARHRDAMGVLALAEWSGAAPGSEKYPELARELDAALLR
AGVTTSGFGLLNNNSEVTAEMVAEVYKPEVYCRLAAVKREYDPENRFRHNYNIDPEGS
>Q2FZE2 1.14.99.48~~~isdG~~~Heme oxygenase (staphylobilin-producing) 1~~~COG2329
MKFMAENRLTLTKGTAKDIIERFYTRHGIETLEGFDGMFVTQTLEQEDFDEVKILTVWKSKQAFTDWLKSDVFKAAHKHV
RSKNEDESSPIINNKVITYDIGYSYMK
>A6QG37 1.14.99.48~~~isdG~~~Heme oxygenase (staphylobilin-producing) 1~~~
MKFMAENRLTLTKGTAKDIIERFYTRHGIETLEGFDGMFVTQTLEQEDFDEVKILTVWKSKQAFTDWLKSDVFKAAHKHV
RSKNEDESSPIINNKVITYDIGYSYMK
>Q7A649 1.14.99.48~~~isdG~~~Heme oxygenase (staphylobilin-producing) 1~~~
MKFMAENRLTLTKGTAKDIIERFYTRHGIETLEGFDGMFVTQTLEQEDFDEVKILTVWKSKQAFTDWLKSDVFKAAHKHV
RSKNEDESSPIINNKVITYDIGYSYMK
>Q8NX62 1.14.99.48~~~isdG~~~Heme oxygenase (staphylobilin-producing) 1~~~
MKFMAENRLTLTKGTAKDIIERFYTRHGIETLEGFDGMFVTQTLEQEDFDEVKILTVWKSKQAFTDWLKSDVFKAAHKHV
RSKNEDESSPIINNKVITYDIGYSYMK
>Q2G1J2 1.14.99.48~~~isdI~~~Heme oxygenase (staphylobilin-producing) 2~~~COG2329
MFMAENRLQLQKGSAEETIERFYNRQGIETIEGFQQMFVTKTLNTEDTDEVKILTIWESEDSFNNWLNSDVFKEAHKNVR
LKSDDDGQQSPILSNKVFKYDIGYHYQK
>A6QDF1 1.14.99.48~~~isdI~~~Heme oxygenase (staphylobilin-producing) 2~~~
MFMAENRLQLQKGSAEETIERFYNRQGIETIEGFQQMFVTKTLNTEDTDEVKILTIWESEDSFNNWLNSDVFKEAHKNVR
LKSDDDGQQSPILSNKVFKYDIGYHYQK
>Q99X56 1.14.99.48~~~isdI~~~Heme oxygenase (staphylobilin-producing) 2~~~
MFMAENRLQLQKGSAEETIERFYNRQGIETIEGFQQMFVTKTLNTEDTDEVKILTIWESEDSFNNWLNSDVFKEAHKNVR
LKSDDDGQQSPILSNKVFKYDIGYHYQK
>Q7A827 1.14.99.48~~~isdI~~~Heme oxygenase (staphylobilin-producing) 2~~~
MFMAENRLQLQKGSAEETIERFYNRQGIETIEGFQQMFVTKTLNTEDTDEVKILTIWESEDSFNNWLNSDVFKEAHKNVR
LKSDDDGQQSPILSNKVFKYDIGYHYQK
>Q81L50 1.14.14.18~~~isdG~~~Heme-degrading monooxygenase~~~COG2329
MIIVTNTAKITKGNGHKLIDRFNKVGQVETMPGFLGLEVLLTQNTVDYDEVTISTRWNAKEDFQGWTKSPAFKAAHSHQG
GMPDYILDNKISYYDVKVVRMPMAAAQ
>A4QFW4 3.1.3.-~~~hdpA~~~Dihydroxyacetone phosphatase~~~
MTVNISYLTDMDGVLIKEGEMIPGADRFLQSLTDNNVEFMVLTNNSIFTPRDLSARLKTSGLDIPPERIWTSATATAHFL
KSQVKEGTAYVVGESGLTTALHTAGWILTDANPEFVVLGETRTYSFEAITTAINLILGGARFICTNPDVTGPSPSGILPA
TGSVAALITAATGAEPYYIGKPNPVMMRSALNTIGAHSEHTVMIGDRMDTDVKSGLEAGLSTVLVRSGISDDAEIRRYPF
RPTHVINSIADLADCWDDPFGDGAFHVPDEQQFTD
>P55792 4.2.1.120~~~abfD~~~4-hydroxybutyryl-CoA dehydratase/vinylacetyl-CoA-Delta-isomerase~~~
MLMTAEQYIESLRKLNTRVYMFGEKIENWVDHPMIRPSINCVAMTYELAQDPQYADLMTTKSNLIGKTINRFANLHQSTD
DLRKKVKMQRLLGQKTASCFQRCVGMDAFNAVFSTTYEIDQKYGTNYHKNFTEYLKYIQENDLIVDGAMTDPKGDRGLAP
SAQKDPDLFLRIVEKREDGIVVRGAKAHQTGSINSHEHIIMPTIAMTEADKDYAVSFACPSDADGLFMIYGRQSCDTRKM
EEGADIDLGNKQFGGQEALVVFDNVFIPNDRIFLCQEYDFAGMMVERFAGYHRQSYGGCKVGVGDVVIGAAALAADYNGA
QKASHVKDKLIEMTHLNETLYCCGIACSAEGYPTAAGNYQIDLLLANVCKQNITRFPYEIVRLAEDIAGGLMVTMPSEAD
FKSETVVGRDGETIGDFCNKFFAAAPTCTTEERMRVLRFLENICLGASAVGYRTESMHGAGSPQAQRIMIARQGNINAKK
ELAKAIAGIK
>O32215 3.6.4.12~~~helD~~~DNA helicase IV~~~COG3973
MNQQDKEWKEEQSRIDEVLKELEKKERFLETSAGGLKHDIIGLRKSFWEDVKVNFDDAHEAIETMASIKQQAELLSDREH
NHRRMDQQLKRIHQLKKSPYFGRIDFIENGEEQAERIYIGLASCLDEKEEHFLIYDWRAPISSLYYNYSPGKAEYEVPGE
TIEGEMVLKRQFMIKNGTLKAMFNTDMTIGDEMLQEVLSHHSDTQMKNIVSTIQKEQNQIIRNEKSKILIVQGAAGSGKT
SAALQRVAYLLYRHRGVIDAGQIVLFSPNFLFNSYVSSVLPELGEENMEQATFQEYIEHRLGRKFKCESPFDQLEYCLTE
TKGGDFPTRLAGITWKAGLSFQQFINEYVTRLSSEGMIFKNIIFRGQKLITKEQIQSYFYSLDQNHSIPNRMEQTAKWLL
SELNKLEKKERRKDWVVHEAELLDKEDYLDVYKKLQERKRFSESTFNDYQREQQLLAAIIVKKAFKPLKQAVRLLAFLDV
TQLYLQLFSGWGGKFQHEKMDAIGELTRSAFTDNKLLYEDAAPFLYMQDLIEGRKKNTKIKHLFIDEAQDYSPFQMAYMR
SIFPAASMTVLGDINQSIYAHTINGDQRMDACFEDEPAEYVRLKRTYRSTRQIVEFTKAMLQDGADIEPFNRSGEMPLVV
KTEGHESLCQKLAQEIGRLKKKGHETIAVICKTAHQCIQAHAHMSEYTDVRLIHKENQPFQKGVCVIPVYLAKGIEFDAV
LVYDASEEHYHTEHDRRLLYTACTRAMHMLAVFYTGEASPFVTAVPPHLYQIAE
>P15038 5.6.2.4~~~helD~~~DNA helicase IV~~~COG0210
MELKATTLGKRLAQHPYDRAVILNAGIKVSGDRHEYLIPFNQLLAIHCKRGLVWGELEFVLPDEKVVRLHGTEWGETQRF
YHHLDAHWRRWSGEMSEIASGVLRQQLDLIATRTGENKWLTREQTSGVQQQIRQALSALPLPVNRLEEFDNCREAWRKCQ
AWLKDIESARLQHNQAYTEAMLTEYADFFRQVESSPLNPAQARAVVNGEHSLLVLAGAGSGKTSVLVARAGWLLARGEAS
PEQILLLAFGRKAAEEMDERIRERLHTEDITARTFHALALHIIQQGSKKVPIVSKLENDTAARHELFIAEWRKQCSEKKA
QAKGWRQWLTEEMQWSVPEGNFWDDEKLQRRLASRLDRWVSLMRMHGGAQAEMIASAPEEIRDLFSKRIKLMAPLLKAWK
GALKAENAVDFSGLIHQAIVILEKGRFISPWKHILVDEFQDISPQRAALLAALRKQNSQTTLFAVGDDWQAIYRFSGAQM
SLTTAFHENFGEGERCDLDTTYRFNSRIGEVANRFIQQNPGQLKKPLNSLTNGDKKAVTLLDESQLDALLDKLSGYAKPE
ERILILARYHHMRPASLEKAATRWPKLQIDFMTIHASKGQQADYVIIVGLQEGSDGFPAAARESIMEEALLPPVEDFPDA
EERRLMYVALTRARHRVWALFNKENPSPFVEILKNLDVPVARKP
>P9WMR1 3.6.4.-~~~helY~~~Probable helicase HelY~~~COG4581
MTELAELDRFTAELPFSLDDFQQRACSALERGHGVLVCAPTGAGKTVVGEFAVHLALAAGSKCFYTTPLKALSNQKHTDL
TARYGRDQIGLLTGDLSVNGNAPVVVMTTEVLRNMLYADSPALQGLSYVVMDEVHFLADRMRGPVWEEVILQLPDDVRVV
SLSATVSNAEEFGGWIQTVRGDTTVVVDEHRPVPLWQHVLVGKRMFDLFDYRIGEAEGQPQVNRELLRHIAHRREADRMA
DWQPRRRGSGRPGFYRPPGRPEVIAKLDAEGLLPAITFVFSRAGCDAAVTQCLRSPLRLTSEEERARIAEVIDHRCGDLA
DSDLAVLGYYEWREGLLRGLAAHHAGMLPAFRHTVEELFTAGLVKAVFATETLALGINMPARTVVLERLVKFNGEQHMPL
TPGEYTQLTGRAGRRGIDVEGHAVVIWHPEIEPSEVAGLASTRTFPLRSSFAPSYNMTINLVHRMGPQQAHRLLEQSFAQ
YQADRSVVGLVRGIERGNRILGEIAAELGGSDAPILEYARLRARVSELERAQARASRLQRRQAATDALAALRRGDIITIT
HGRRGGLAVVLESARDRDDPRPLVLTEHRWAGRISSADYSGTTPVGSMTLPKRVEHRQPRVRRDLASALRSAAAGLVIPA
ARRVSEAGGFHDPELESSREQLRRHPVHTSPGLEDQIRQAERYLRIERDNAQLERKVAAATNSLARTFDRFVGLLTEREF
IDGPATDPVVTDDGRLLARIYSESDLLVAECLRTGAWEGLKPAELAGVVSAVVYETRGGDGQGAPFGADVPTPRLRQALT
QTSRLSTTLRADEQAHRITPSREPDDGFVRVIYRWSRTGDLAAALAAADVNGSGSPLLAGDFVRWCRQVLDLLDQVRNAA
PNPELRATAKRAIGDIRRGVVAVDAG
>P26093 ~~~hel~~~Lipoprotein E~~~COG2503
MKTTLKMTALAALSAFVLAGCGSHQMKSEGHANMQLQQQAVLGLNWMQDSGEYKALAYQAYNAAKVAFDHAKVAKGKKKA
VVADLDETMLDNSPYAGWQVQNNKPFDGKDWTRWVDARQSRAVPGAVEFNNYVNSHNGKVFYVTNRKDSTEKSGTIDDMK
RLGFNGVEESAFYLKKDKSAKAARFAEIEKQGYEIVLYVGDNLDDFGNTVYGKLNADRRAFVDQNQGKFGKTFIMLPNAN
YGGWEGGLAEGYFKKDTQGQIKARLDAVQAWDGK
>P16618 1.2.1.70~~~hemA~~~Glutamyl-tRNA reductase~~~COG0373
MHILVVGVDYKSAPIEIREKVSFQPNELAEAMVQLKEEKSILENIIVSTCNRTEIYAVVDQLHTGRYYIKKFLADWFQLS
KEELSPFLTFYESDAAVEHLFRVACGLDSMVIGETQILGQVRDSFKTAQQEKTIGTIFNELFKQAVTVGKRTHAETDIGS
NAVSVSYAAVELAKKIFGNLSSKHILILGAGKMGELAAENLHGQGIGKVTVINRTYLKAKELADRFSGEARSLNQLESAL
AEADILISSTGASEFVVSKEMMENANKLRKGRPLFMVDIAVPRDLDPALNDLEGVFLYDIDDLEGIVEANMKERRETAEK
VELLIEETIVEFKQWMNTLGVVPVISALREKALAIQSETMDSIERKLPHLSTREKKLLNKHTKSIINQMLRDPILKVKEL
AADADSEEKLALFMQIFDIEEAAGRQMMKTVESSQKVHSFKKAESKAGFSPLVSE
>P28462 1.2.1.70~~~hemA~~~Glutamyl-tRNA reductase~~~COG0373
MNIISVGVNHKTAPIEIRERIALSEVQNKEFVTDLVSSGLASEAMVVSTCNRTELYVVPGMPEVNCDYLKDYIISYKDAR
NAVRPEHFFNRFYCGTARHLFEVSSAIDSLVLGEGQILGQVKDAYRIAAEVGTAGILLTRLCHTAFSVAKKVKTRTKLME
GAVSVSYAAVELAQKIFSNLSMKKVLLIGAGETGELAAKHMYAKNARNIVITNRTQSKAEALAEELGTNRVLPYESYKEH
LHEFDIIITAVSTKEYILNAAEMQQSMAKRRLKPVIILDLGLPRNVDPEVGALQNMFLKDIDALKHIIDKNLERRRAELP
KVKSIIDEELIGFGQWINTLKVRPTIVDLQSKFIEIKEKELERYRHKVSEEELKRMEHLTDRILKKILHHPIKMLKAPVD
TADNIPSKVNLVRNIFDLEEPNQSLQ
>P0A6X1 1.2.1.70~~~hemA~~~Glutamyl-tRNA reductase~~~COG0373
MTLLALGINHKTAPVSLRERVSFSPDKLDQALDSLLAQPMVQGGVVLSTCNRTELYLSVEEQDNLQEALIRWLCDYHNLN
EEDLRKSLYWHQDNDAVSHLMRVASGLDSLVLGEPQILGQVKKAFADSQKGHMKASELERMFQKSFSVAKRVRTETDIGA
SAVSVAFAACTLARQIFESLSTVTVLLVGAGETIELVARHLREHKVQKMIIANRTRERAQILADEVGAEVIALSDIDERL
READIIISSTASPLPIIGKGMVERALKSRRNQPMLLVDIAVPRDVEPEVGKLANAYLYSVDDLQSIISHNLAQRKAAAVE
AETIVAQETSEFMAWLRAQSASETIREYRSQAEQVRDELTAKALAALEQGGDAQAIMQDLAWKLTNRLIHAPTKSLQQAA
RDGDNERLNILRDSLGLE
>P9WMP7 1.2.1.70~~~hemA~~~Glutamyl-tRNA reductase~~~COG0373
MSVLLFGVSHRSAPVVVLEQLSIDESDQVKIIDRVLASPLVTEAMVLSTCNRVEVYAVVDAFHGGLSVIGQVLAEHSGMS
MGELTKYAYVRYSEAAVEHLFAVASGLDSAVIGEQQVLGQVRRAYAVAESNRTVGRVLHELAQRALSVGKRVHSETAIDA
AGASVVSVALGMAERKLGSLAGTTAVVIGAGAMGALSAVHLTRAGVGHIQVLNRSLSRAQRLARRIRESGVPAEALALDR
LANVLADADVVVSCTGAVRPVVSLADVHHALAAARRDEATRPLVICDLGMPRDVDPAVARLPCVWVVDVDSVQHEPSAHA
AAADVEAARHIVAAEVASYLVGQRMAEVTPTVTALRQRAAEVVEAELLRLDNRLPGLQSVQREEVARTVRRVVDKLLHAP
TVRIKQLASAPGGDSYAEALRELFELDQTAVDAVATAGELPVVPSGFDAESRRGGGDMQSSPKRSPSN
>P18079 2.3.1.37~~~hemA~~~5-aminolevulinate synthase~~~COG0156
MDYNLALDKAIQKLHDEGRYRTFIDIEREKGAFPKAQWNRPDGGKQDITVWCGNDYLGMGQHPVVLAAMHEALEAVGAGS
GGTRNISGTTAYHRRLEAEIADLHGKEAALVFSSAYIANDATLSTLRVLFPGLIIYSDSLNHASMIEGIKRNAGPKRIFR
HNDVAHLRELIAADDPAAPKLIAFESVYSMDGDFGPIKEICDIADEFGALTYIDEVHAVGMYGPRGAGVAERDGLMHRID
IFNGTLAKAYGVFGGYIAASAKMVDAVRSYAPGFIFSTSLPPAIAAGAQASIAFLKTAEGQKLRDAQQMHAKVLKMRLKA
LGMPIIDHGSHIVPVVIGDPVHTKAVSDMLLSDYGVYVQPINFPTVPRGTERLRFTPSPVHDLKQIDGLVHAMDLLWARC
ALNRAEASA
>P45622 4.2.1.24~~~hemB~~~Delta-aminolevulinic acid dehydratase~~~COG0113
MAIKYGRPIELREVSRRDGAAASPALDLAIRPRRNRKAEWARRMVRENVLTTDDLIWPLFLIDGNNKREQIASMPGVERL
SVDQAVREAERAMKLTIPCIALFPYTDPSLRDEEGSEACNPNNLVCQAVRAIKKEFPEIGVLCDVALDPFTSHGHDGLIA
DGAILNDETVAVLVRQALVQAEAGCDIIAPSDMMDGRVAAIREGLDQAGLIDVQIMAYAAKYASAFYGPFRDAIGSAKTL
TGDKRTYQMDSANTDEALREVELDISEGADMVMVKPGMPYLDVVRRVKDTFAMPTFAYQVSGEYAMIAAAAGNGWLDGDR
AMMESLLAFKRAGADGVLSYFAPKAAEKLRTQG
>Q59334 4.2.1.24~~~hemB~~~Delta-aminolevulinic acid dehydratase~~~COG0113
MSQLDLLNIVHRPRRLRRTAALRNLVQENTLTVNDLVFPLFVMPGTNAVEEVSSMPGSFRFTIDRAVEECKELYDLGIQG
IDLFGIPEQKTEDGSEAYNDNGILQQAIRAIKKAVPELCIMTDVALDPFTPFGHDGLVKDGIILNDETVEVLQKMAVSHA
EAGADFVSPSDMMDGRIGAIREALDETDHSDVGILSYAAKYASSFYGPFRDALHSAPQFGDKSTYQMNPANTEEAMKEVE
LDIVEGADIVMVKPGLAYLDIVWRTKERFDVPVAIYHVSGEYAMVKAAAAKGWIDEDRVMMESLLCMKRAGADIIFTYYA
KEAAKKLR
>P0ACB2 4.2.1.24~~~hemB~~~Delta-aminolevulinic acid dehydratase~~~COG0113
MTDLIQRPRRLRKSPALRAMFEETTLSLNDLVLPIFVEEEIDDYKAVEAMPGVMRIPEKHLAREIERIANAGIRSVMTFG
ISHHTDETGSDAWREDGLVARMSRICKQTVPEMIVMSDTCFCEYTSHGHCGVLCEHGVDNDATLENLGKQAVVAAAAGAD
FIAPSAAMDGQVQAIRQALDAAGFKDTAIMSYSTKFASSFYGPFREAAGSALKGDRKSYQMNPMNRREAIRESLLDEAQG
ADCLMVKPAGAYLDIVRELRERTELPIGAYQVSGEYAMIKFAALAGAIDEEKVVLESLGSIKRAGADLIFSYFALDLAEK
KILR
>P9WMP5 4.2.1.24~~~hemB~~~Delta-aminolevulinic acid dehydratase~~~COG0113
MSMSSYPRQRPRRLRSTVAMRRLVAQTSLEPRHLVLPMFVADGIDEPRPITSMPGVVQHTRDSLRRAAAAAVAAGVGGLM
LFGVPRDQDKDGVGSAGIDPDGILNVALRDLAKDLGEATVLMADTCLDEFTDHGHCGVLDDRGRVDNDATVARYVELAVA
QAESGAHVVGPSGMMDGQVAAIRDGLDAAGYIDVVILAYAAKFASAFYGPFREAVSSSLSGDRRTYQQEPGNAAEALREI
ELDLDEGADIVMVKPAMGYLDVVAAAADVSPVPVAAYQVSGEYAMIRAAAANNWIDERAAVLESLTGIRRAGADIVLTYW
AVDAAGWLT
>Q59643 4.2.1.24~~~hemB~~~Delta-aminolevulinic acid dehydratase~~~
MSFTPANRAYPYTRLRRNRRDDFSRRLVRENVLTVDDLILPVFVLDGVNQRESIPSMPGVERLSIDQLLIEAEEWVALGI
PALALFPVTPVEKKSLDAAEAYNPEGIAQRATRALRERFPELGIITDVALDPFTTHGQDGILDDDGYVLNDVSIDVLVRQ
ALSHAEAGAQVVAPSDMMDGRIGAIREALESAGHTNVRIMAYSAKYASAYYGPFRDAVGSASNLGKGNKATYQMDPANSD
EALHEVAADLAEGADMVMVKPGMPYLDIVRRVKDEFRAPTFVYQVSGEYAMHMGAIQNGWLAESVILESLTAFKRAGADG
ILTYFAKQAAEQLRRGR
>P42504 4.2.1.24~~~hemB~~~Delta-aminolevulinic acid dehydratase~~~
MTLITPPFPTNRLRRMRRTEALRDLAQENRLSVKDLIWPIFITDVPGADVEISSMPGVVRRTMDGALKAAEGSRDAGHSR
DLPVPLTDPAVKTETCEMAWQPDNFTNRVIAAMKQAVPEVAIMTDIALDPYNANGHDGLVRDGILLNDETTEALVKMALA
QAAAGADILGPSDMMDGRVGAIRQAMEAAGHKDIAILSYAAKYASAFYGPFRDAVGASSALKGDKKTYQMNPANSAEALR
NVARDIAEGADMVMVKPGMPYLDIVRQVKDAFGMPTYAYQVSGEYAMLMAAVQNGWLNHDKVMLESLMAFRRAGCDGVLT
YFAPAAAKLIGA
>P64334 4.2.1.24~~~hemB~~~Delta-aminolevulinic acid dehydratase~~~
MKFDRHRRLRSSATMRDMVRENHVRKEDLIYPIFVVEKDDVKKEIKSLPGVYQISLNLLESELKEAYDLGIRAIMFFGVP
NSKDDIGTGAYIHDGVIQQATRIAKKMYDDLLIVADTCLCEYTDHGHCGVIDDHTHDVDNDKSLPLLVKTAISQVEAGAD
IIAPSNMMDGFVAEIRRGLDEAGYYNIPIMSYGVKYASSFFGPFRDAADSAPSFGDRKTYQMDPANRLEALRELESDLKE
GCDMMIVKPALSYLDIVRDVKNHTNVPVVAYNVSGEYSMTKAAAQNGWIDEERVVMEQMVSMKRAGADMIITYFAKDICR
YLDK
>P06983 2.5.1.61~~~hemC~~~Porphobilinogen deaminase~~~COG0181
MLDNVLRIATRQSPLALWQAHYVKDKLMASHPGLVVELVPMVTRGDVILDTPLAKVGGKGLFVKELEVALLENRADIAVH
SMKDVPVEFPQGLGLVTICEREDPRDAFVSNNYDSLDALPAGSIVGTSSLRRQCQLAERRPDLIIRSLRGNVGTRLSKLD
NGEYDAIILAVAGLKRLGLESRIRAALPPEISLPAVGQGAVGIECRLDDSRTRELLAALNHHETALRVTAERAMNTRLEG
GCQVPIGSYAELIDGEIWLRALVGAPDGSQIIRGERRGAPQDAEQMGISLAEELLNNGAREILAEVYNGDAPA
>P9WMP3 2.5.1.61~~~hemC~~~Porphobilinogen deaminase~~~COG0181
MIRIGTRGSLLATTQAATVRDALIAGGHSAELVTISTEGDRSMAPIASLGVGVFTTALREAMEAGLVDAAVHSYKDLPTA
ADPRFTVAAIPPRNDPRDAVVARDGLTLGELPVGSLVGTSSPRRAAQLRALGLGLEIRPLRGNLDTRLNKVSSGDLDAIV
VARAGLARLGRLDDVTETLEPVQMLPAPAQGALAVECRAGDSRLVAVLAELDDADTRAAVTAERALLADLEAGCSAPVGA
IAEVVESIDEDGRVFEELSLRGCVAALDGSDVIRASGIGSCGRARELGLSVAAELFELGARELMWGVRH
>P64341 2.5.1.61~~~hemC~~~Porphobilinogen deaminase~~~
MRKLVVGSRRSKLALTQSQQFINKLKAVEPNLEIEIKEIVTKGDRIVDKQLSKVGGKGLFVKEIQHELFEKNIDMAIHSL
KDVPSVIPEGLTLGCIPDRELPFDAYISKTHTPLSQLPEGSIIGTSSLRRGAQILSKYPNLEIKWIRGNIDTRLEKLQTE
DYDAIILAAAGLRRMGWSDDIVTSYLDRDTLLPAIGQGALGIECRSDDEELLTLLSKVHNDEVAKCVTAERTFLAEMDGS
CQVPIAGYATISDQKEIEFTGLIMTPDGKERFEYTMNGTDPVELGKTVSNKLKEQGAYEIIKRLNEQH
>Q9KVM1 2.5.1.61~~~hemC~~~Porphobilinogen deaminase~~~COG0181
MTETPIRIATRQSPLALWQANYVKDALMAAHPGLQVELVTMVTRGDVILDTPLAKVGGKGLFVKELEIAMLEGRADLAVH
SMKDVPVDFPDGLGLVTICEREDPRDAFVSNTYAKIEDLPSGAIVGTCSLRRQCQLKAARPDLVIKELRGNVGTRLSKLD
AGEYDAIILAAAGLKRLELESRIRSFIEPEQSLPAVGQGAVGIECRVNDQRVRALLAPLNHADTADRVRCERAMNLTLQG
GCQVPIGSYALLEGDTIWLRALVGEPDGSQIVRGEIRGPRTQAEQLGITLAEQLLSQGAKEILERLYCDHE
>P48246 4.2.1.75~~~hemD~~~Uroporphyrinogen-III synthase~~~
MSGWRLLLTRPDEECAALAASLGEAGVHSSSLPLLAIDPLEETPEQRTLMLDLDRYCAVVVVSKPAARLGLERLDRYWPQ
PPQQTWCSVGAATAAILEAYGLDVTYPEQGDDSEALLALPAFQDSLRVHDPKVLIMRGEGGREFLAERLRGQGVQVDYLP
LYRRRAPDYPAGELLARVRAERLNGLVVSSGQGLQNLYQLAAADWPEIGRLPLFVPSPRVAEMARELGAQRVIDCRGASA
PALLAALTSAA
>P36553 1.3.3.3~~~hemF~~~Oxygen-dependent coproporphyrinogen-III oxidase~~~COG0408
MKPDAHQVKQFLLNLQDTICQQLTAVDGAEFVEDSWQREAGGGGRSRVLRNGGVFEQAGVNFSHVHGEAMPASATAHRPE
LAGRSFEAMGVSLVVHPHNPYVPTSHANVRFFIAEKPGADPVWWFGGGFDLTPFYGFEEDAIHWHRTARDLCLPFGEDVY
PRYKKWCDEYFYLKHRNEQRGIGGLFFDDLNTPDFDRCFAFMQAVGKGYTDAYLPIVERRKAMAYGERERNFQLYRRGRY
VEFNLVWDRGTLFGLQTGGRTESILMSMPPLVRWEYDYQPKDGSPEAALSEFIKVRDWV
>P72848 1.3.3.3~~~hemF~~~Oxygen-dependent coproporphyrinogen-III oxidase~~~COG0408
MTVSPTTQPQTNHSLPPADAKQRVSQFMQTLQDEICQGLEALDGKGKFQEDSWQREEGGGGRSRVLADGDFLEQGGVNFS
EVWGKSLPPSILKQRPEAEGHEFYATGTSMVLHPKNPYIPTVHLNYRYFEAGPVWWFGGGADLTPYYPFAEDAAHFHHTL
KNACDQTHGEFYPVFKRWCDEYFYLKHRQEMRGIGGIFFDYQDGNAPLYRGPDPNGPAAQYSNQLAPIEPLGWEDLFSFA
QRCGRAFLPAYSPIVEKRRNTEYGDRQRQFQLYRRGRYVEFNLVYDRGTIFGLQTNGRTESILMSLPPLVRWQYCYSPEA
GSPEAELTEKFLVPQDWVNS
>O07621 ~~~hemAT~~~Heme-based aerotactic transducer HemAT~~~COG0840
MLFKKDRKQETAYFSDSNGQQKNRIQLTNKHADVKKQLKMVRLGDAELYVLEQLQPLIQENIVNIVDAFYKNLDHESSLM
DIINDHSSVDRLKQTLKRHIQEMFAGVIDDEFIEKRNRIASIHLRIGLLPKWYMGAFQELLLSMIDIYEASITNQQELLK
AIKATTKILNLEQQLVLEAFQSEYNQTRDEQEEKKNLLHQKIQETSGSIANLFSETSRSVQELVDKSEGISQASKAGTVT
SSTVEEKSIGGKKELEVQQKQMNKIDTSLVQIEKEMVKLDEIAQQIEKIFGIVTGIAEQTNLLSLNASIESARAGEHGKG
FAVVANEVRKLSEDTKKTVSTVSELVNNTNTQINIVSKHIKDVNELVSESKEKMTQINRLFDEIVHSMKISKEQSGKIDV
DLQAFLGGLQEVSRAVSHVAASVDSLVILTEE
>P0ACB4 1.3.5.3~~~hemG~~~Protoporphyrinogen IX dehydrogenase [quinone]~~~COG4635
MKTLILFSTRDGQTREIASYLASELKELGIQADVANVHRIEEPQWENYDRVVIGASIRYGHYHSAFQEFVKKHATRLNSM
PSAFYSVNLVARKPEKRTPQTNSYARKFLMNSQWRPDRCAVIAGALRYPRYRWYDRFMIKLIMKMSGGETDTRKEVVYTD
WEQVANFAREIAHLTDKPTLK
>P57777 4.98.1.1~~~hemH~~~Ferrochelatase~~~COG0276
MTQKLAVVLFNLGGPDGPDAVRPFLFNLFRDPAIIGAPALIRYPLAALISTTREKSAKANYAIMGGGSPLLPETEKQARA
LEAALALAMPGVEAKCFIAMRYWHPLTDETARQVAAFAPDQVVLLPLYPQFSTTTTGSSLKAWKKTYKGSGVQTTVGCYP
TEGGLIEAHARMIRESWEKAGSPTNIRLLFSAHGLPEKVILAGDPYQKQVEATAAAVAAHLPPQIEWTVCYQSRVGPLKW
IGPSTDDEIRRAGGEDKGVMITPIAFVSEHVETLVELDHEYAELAEEVGAAPYLRVSALGTAPEFIDGLAKAVRDSVGKA
PGTVSSACGWRCGADWSKCPCREGASA
>P23871 4.98.1.1~~~hemH~~~Ferrochelatase~~~COG0276
MRQTKTGILLANLGTPDAPTPEAVKRYLKQFLSDRRVVDTSRLLWWPLLRGVILPLRSPRVAKLYASVWMEGGSPLMVYS
RQQQQALAQRLPEMPVALGMSYGSPSLESAVDELLAEHVDHIVVLPLYPQFSCSTVGAVWDELARILARKRSIPGISFIR
DYADNHDYINALANSVRASFAKHGEPDLLLLSYHGIPQRYADEGDDYPQRCRTTTRELASALGMAPEKVMMTFQSRFGRE
PWLMPYTDETLKMLGEKGVGHIQVMCPGFAADCLETLEEIAEQNREVFLGAGGKKYEYIPALNATPEHIEMMANLVAAYR
>P72793 1.3.99.-~~~hemJ~~~Protoporphyrinogen IX oxidase~~~COG1981
MPKREYFSLPCPLSTFTMAYYWFKAFHLIGIVVWFAGLFYLVRLFVYHAEADQEPEPAKTILKKQYELMEKRLYNIITTP
GMVVTVAMAIGLIFTEPEILKSGWLHIKLTFVALLLLYHFYCGRVMKKLAQGESQWSGQQFRALNEAPTILLVVIVLLAV
FKNNLPLDATTWLIVALVIAMAASIQLYAKKRRRDQALLTEQQKAASAQN
>O34162 1.3.98.3~~~hemN~~~Oxygen-independent coproporphyrinogen III oxidase~~~COG0635
MIPSAITSPAPDRRALSDFRALAGRIDGNGPRYTSYPTADRFHNGPDLSLYHDALAACRADAPAPLSLYLHIPFCENICY
YCGCNKIITRDHGRSARYVNYLGREMALVADRLGPRRQVLQSHWGGGTPTFLDPGEMRRVMALLHEHFELAAEGEHSIEI
DPRRVDHARMALLAELGFNRVSLGVQDFDPEVQQAIHRIQPFEETRAVVDAARTLGFRSVSLDLIYGLPHQTAARFGRTI
DQVLALRPDRLSVYSYAHLPHVFKPQRRIDENALPPAGEKLDILVSTIERLSAEGYVYIGMDHFALPDDDLAVAQREGRL
QRNFQGYSTHAGYDQVGLGISAIGAIAGRYVQNARTLDEYYGALDHGRLPLARGVAMSADDHLRREIIGALMCNGVLDIP
ALEARHGIRFGTAFAPELADLAALGADGLVQCAPDRITVTPLGRLLVRRVAMVFDRYLREDAARPASTGAQAVAANDGAQ
PVRFVPRARYSRVV
>P32131 1.3.98.3~~~hemN~~~Oxygen-independent coproporphyrinogen III oxidase~~~COG0635
MSVQQIDWDLALIQKYNYSGPRYTSYPTALEFSEDFGEQAFLQAVARYPERPLSLYVHIPFCHKLCYFCGCNKIVTRQQH
KADQYLDALEQEIVHRAPLFAGRHVSQLHWGGGTPTYLNKAQISRLMKLLRENFQFNADAEISIEVDPREIELDVLDHLR
AEGFNRLSMGVQDFNKEVQRLVNREQDEEFIFALLNHAREIGFTSTNIDLIYGLPKQTPESFAFTLKRVAELNPDRLSVF
NYAHLPTIFAAQRKIKDADLPSPQQKLDILQETIAFLTQSGYQFIGMDHFARPDDELAVAQREGVLHRNFQGYTTQGDTD
LLGMGVSAISMIGDCYAQNQKELKQYYQQVDEQGNALWRGIALTRDDCIRRDVIKSLICNFRLDYAPIEKQWDLHFADYF
AEDLKLLAPLAKDGLVDVDEKGIQVTAKGRLLIRNICMCFDTYLRQKARMQQFSRVI
>P77915 1.3.98.3~~~hemN~~~Oxygen-independent coproporphyrinogen III oxidase~~~
MLDTIRWDADLIRRYDLSGPRYTSYPTAVQFHEGIGPFDQLHALRDSRKAGHPLSLYVHIPFCANICYYCACNKVITKDR
GRSAPYLARLVREIEIVSRHLSREQVVEQLHFGGGTPTFLSPGQLRELMSQLRTHLNLLDDDSGDYGIEIDPREADWSTM
GLLRELGFNRVSLGVQDFDMEVQKAVNRMQTPEETRTIVEAARTLQYRSINLDLIYGLPKQTPDSFARTVDEVIALQPDR
LSVFNYAHLPERFMPQRRINADDLPSPGQKLEMLQRTTEQLAAAGYRYIGMDHFALPDDELASAQEDGTLQRNFQGYTTH
GHCDLVGLGVSAISQIGDLYSQNSSDINDYQTSLDNGQLAIRRGLHCNSDDRVRRAVIQQLICHFELAFEDIETEFGIDF
RSYFAELWPDLERFAADGLIRLDAKGIDITSSGRLLVRSICMLFDRYLPSLNRQRFSRVI
>P74132 1.3.98.3~~~hemN~~~Oxygen-independent coproporphyrinogen III oxidase~~~COG0635
MTTTFPTVEFSAELLNKYNQGIPRYTSYPPATELNKEFDPSDFQTAINLGNYKKTPLSLYCHIPFCAKACYFCGCNTIIT
QHKPAVDPYLKAVAKQIALVAPLVDQQRPVQQLHWGGGTPNYLTLEQAEFLFNTITDAFPLAENAEISIEINPCYVDKDY
IFALRQLGFNRISFGIQDFNSQVQQAVNRIQPEAMLFQVMDWIRQANFDSVNVDLIYGLPHQNLATFRETLRKTAQLNPD
RIAVFNFAYVPWLKPVQKKMPESALPPAEEKLKIMQATIADLTEQGYVFIGMDHFAKPDDELAIAQRRGELHRNFQGYTT
QPESDLLGFGITSISMLQDVYAQNHKTLKAFYNALDREVMPIEKGFKLSQDDLIRRTVIKELMCQFKLSAQELESKYNLG
FDCDFNDYFAKELSALDVLEADGLLRRLGDGLEVTPRGRILIRNIAAVFDTYLQNKSKQQMFSRAI
>P31499 ~~~hemR~~~Hemin receptor~~~
MPRSTSDRFRWSPLSLAIACTLSLAVQAADTSSTQTNSKKRIADTMVVTATGNERSSFEAPMMVTVVEADTPTSETATSA
TDMLRNIPGLTVTGSGRVNGQDVTLRGYGKQGVLTLVDGIRQGTDTGHLNSTFLDPALVKRVEIVRGPSALLYGSGALGG
VISYETVDAADLLLPGQNSGYRVYSAAATGDHSFGLGASAFGRTDDVDGILSFGTRDIGNIRQSDGFNAPNDETISNVLA
KGTWRIDQIQSLSANLRYYNNSALEPKNPQTSAASSTNLMTDRSTIQRDAQLKYNIKPLDQEWLNATAQVYYSEVEINAR
PQGTPEEGRKQTTKGGKLENRTRLFTDSFASHLLTYGTEAYKQEQTPSGATESFPQADIRFGSGWLQDEITLRDLPVSIL
AGTRYDNYRGSSEGYADVDADKWSSRGAVSVTPTDWLMLFGSYAQAFRAPTMGEMYNDSKHFSMNIMGNTLTNYWVPNPN
LKPETNETQEYGFGLRFNDLMMAEDDLQFKASYFDTNAKDYISTGVTMDFGFGPGGLYCKNCSTYSTNIDRAKIWGWDAT
MTYQTQWFNLGLAYNRTRGKNQNTNEWLDTINPDTVTSTLDVPVANSGFAVGWIGTFADRSSRVSSSGTPQAGYGVNDFY
VSYKGQEQFKGMTTTVVLGNAFDKGYYGPQGVPQDGRNAKFFVSYQW
>P31517 ~~~hemS~~~Hemin transport protein HemS~~~COG3720
MSKSIYEQYLQAKADNPGKYARDLATLMGISEAELTHSRVSHDAKRLKGDARALLAALEAVGEVKAITRNTYAVHEQMGR
YENQHLNGHAGLILNPRNLDLRLFLNQWASAFTLTEETRHGVRHSIQFFDHQGDALHKVYVTEQTDMPAWEALLAQFITT
EIPELQLEPLSAPEVTEPTATDEAVDAEWRAMTDVHQFFQLLKRNNLTRQQAFRAVGNDLAYQVDNSSLTQLLNIAQQEQ
NEIMIFVGNRGCVQIFTGMIEKVTPHQDWINVFNQRFTLHLIETTIAESWITRKPTKDGFVTSLELFAADGTQIAQLYGQ
RTEGQPEQTQWRDEIARLNNKDIAA
>Q60AX2 ~~~~~~Bacteriohemerythrin~~~COG2703
MALMTWTAAEFGTNVGFADDQHKTIFDMVNKLHDTAATGNRSEIGKQLDALIDYVVMHFKSEETEMQKKGYADFAAHKAE
HDKLVGVCADLQKKFHAGEAEVNQDTTRFVRDWLVNHIPKVDKLYGPCLSA
>P54304 ~~~hemW~~~Heme chaperone HemW~~~COG0635
MKSAYIHIPFCEHICHYCDFNKYFIQSQPVDEYLNALEQEMINTIAKTGQPDLKTIFIGGGTPTSLSEEQLKKLMDMINR
VLKPSSDLSEFAVEANPDDLSAEKLKILKEAGVNRLSFGVQTFEDDLLEKIGRVHKQKDVFTSFERAREIGFENISLDLM
FGLPGQTLKHLEHSINTALSLDAEHYSVYSLIVEPKTVFYNLMQKGRLHLPPQEQEAEMYEIVMSKMEAHGIHQYEISNF
AKAGMESKHNLTYWSNEQYFGFGAGAHGYIGGTRTVNVGPVKHYIDLIAEKGFPYRDTHEVTTEEQIEEEMFLGLRKTAG
VSKKRFAEKYGRSLDGLFPSVLKDLAEKGLIHNSESAVYLTHQGKLLGNEVFGAFLGEL
>P52062 ~~~hemW~~~Heme chaperone HemW~~~COG0635
MVKLPPLSLYIHIPWCVQKCPYCDFNSHALKGEVPHDDYVQHLLNDLDNDVAYAQGREVKTIFIGGGTPSLLSGPAMQTL
LDGVRARLPLAADAEITMEANPGTVEADRFVDYQRAGVNRISIGVQSFSEEKLKRLGRIHGPQEAKRAAKLASGLGLRSF
NLDLMHGLPDQSLEEALGDLRQAIELNPPHLSWYQLTIEPNTLFGSRPPVLPDDDALWDIFEQGHQLLTAAGYQQYETSA
YAKPGYQCQHNLNYWRFGDYIGIGCGAHGKVTFPDGRILRTTKTRHPRGFMQGRYLESQRDVEATDKPFEFFMNRFRLLE
AAPRVEFIAYTGLCEDVIRPQLDEAIAQGYLTECADYWQITEHGKLFLNSLLELFLAE
>Q9CGF7 ~~~hemW~~~Heme chaperone HemW~~~COG0635
MLQKPNSAYFHIPFCSHICYYCDFAKVLMTGQPIDAYIESLIEEFQSFEIEKLRTIYIGGGTPSVLSAQQLERLLTAIAE
QLDLEVLEEFTVEANPGDLSDEVIKVLADSAVNRISLGVQTFNNALLKKIGRTHTEVQVYDSVERLKKAGFENITIDLIY
ALPGQTMEMVKSDVEKFLELKLPHVALYSLILEDHTVFMNRQRRGLLRLPSEDKNADMYEYIMDILAKNGYNHYEVSNFG
LPGFESKHNITYWDNEEYYGIGAGASGYLAGIRYKNLGPVHHYLKAAPTEKRINEEVLSKKSQIEEEMFLGLRKKSGVLV
EKFENKFKCSFEKLYGEQITELINQKLLYNDRQRIHMTDKGFELGNNVFEKFLLDDINF
>P9WP73 ~~~hemW~~~Heme chaperone HemW~~~COG0635
MVFRQAPVELPGLAPMPGQPFGVYLHVPFCLTRCGYCDFNTYTPAQLGGVSPDRWLLALRAELELAAAKLDAPTVHTVYV
GGGTPSLLGGERLATLLDMVRDHFVLAPDAEVSTEANPESTWPEFFATIRAAGYTRVSLGMQSVAPRVLATLDRVHSPGR
AAAAATEAIAEGFTHVNLDLIYGTPGESDDDLVRSVDAAVQAGVDHVSAYALVVEHGTALARRVRRGELAAPDDDVLAHR
YELVDARLSAAGFAWYEVSNWCRPGGECRHNLGYWDGGQWWGAGPGAHGYIGVTRWWNVKHPNTYAEILAGATLPVAGFE
QLGADALHTEDVLLKVRLRQGLPLARLGAAERERAEAVLADGLLDYHGDRLVLTGRGRLLADAVVRTLLG
>P73245 ~~~hemW~~~Heme chaperone HemW~~~COG0635
MNTGTYLMPTAAYIHIPFCRQRCFYCDFPIAVTGFQSLTLDGWVGEYVEAVCREIAGQQHQGQPLQTVFFGGGTPSLLPI
TGLEKILLAVDQYLGIAPDAEISIEIDPGTFDQVQLQGYKNLGINRFSLGVQAFQDNLLALCGRHHRRRDIDQALTAIAK
ENIENWSLDLITGLPEQTAADWHSSLTLALAAGPKHISCYDLVLEPQTVFDKWEQRGKLAVPPPERSADFYRHGQEVLTQ
AGFHHYEISNYGRPGHQCRHNQIYWRNLPYYGLGMGATSYIDGKRFGRPRTRNGYYQWLESWLNQGCPIPGERVSPLENL
LESLMLGLRLTAGVTWAQLPSVNQTEKAKILATLTSFGDRRWLEFYGEDNQMLAPNQTTTETVQRFCFTDPEGILYSNQI
LSALFAALEEDF
>P09127 ~~~hemX~~~Protein HemX~~~COG2959
MTEQEKTSAVVEETREAVDTTSQPVATEKKSKNNTALILSAVAIAIALAAGIGLYGWGKQQAVNQTATSDALANQLTALQ
KAQESQKAELEGIIKQQAAQLKQANRQQETLAKQLDEVQQKVATISGSDAKTWLLAQADFLVKLAGRKLWSDQDVTTAAA
LLKSADASLADMNDPSLITVRRAITDDIASLSAVSQVDYDGIILKLNQLSNQVDNLRLADNDSDGSPMDSDGEELSSSIS
EWRINLQKSWQNFMDNFITIRRRDDTAVPLLAPNQDIYLRENIRSRLLVAAQAVPRHQEETYRQALENVSTWVRAYYDTD
DATTKAFLDEVDQLSQQNISMDLPETLQSQAMLEKLMQTRVRNLLAQPAAGTTEAKPAPAPQADTPAAAPQGE
>P0ACB7 ~~~hemY~~~Protein HemY~~~COG3071
MLKVLLLFVLLIAGIVVGPMIAGHQGYVLIQTDNYNIETSVTGLAIILILAMVVLFAIEWLLRRIFRTGAHTRGWFVGRK
RRRARKQTEQALLKLAEGDYQQVEKLMAKNADHAEQPVVNYLLAAEAAQQRGDEARANQHLERAAELAGNDTIPVEITRV
RLQLARNENHAARHGVDKLLEVTPRHPEVLRLAEQAYIRTGAWSSLLDIIPSMAKAHVGDEEHRAMLEQQAWIGLMDQAR
ADNGSEGLRNWWKNQSRKTRHQVALQVAMAEHLIECDDHDTAQQIIIDGLKRQYDDRLLLPIPRLKTNNPEQLEKVLRQQ
IKNVGDRPLLWSTLGQSLMKHGEWQEASLAFRAALKQRPDAYDYAWLADALDRLHKPEEAAAMRRDGLMLTLQNNPPQ
>Q796V8 1.3.99.-~~~hemZ~~~Oxygen-independent coproporphyrinogen-III oxidase-like protein HemZ~~~COG0635
MQIKIEGIHDDRLHRPLQNIANLFYEECELAYGGEEPADFVISLALSQTDEHVTVSGEVKGTGIKEQHTKFFSPDMTEKE
AFKQVKNTISYVYLNLLQAHTGITQKWGILTGIRPTKLLHKKLQSGMSKEQAHAELKKDYLIHDEKIMLMQEIVDRQLAA
VPDLYRVKDEVSIYIGIPFCPTKCAYCTFPAYAIQGQAGRVGSFLWGLHYEMQKIGEWLKEHDVKVTTIYFGGGTPTSIT
AEEMDLLYEEMVRSFPDVKNIREITVEAGRPDTITEEKLAVLNKYDIDRISINPQSYENETLKAIGRHHTVEETIEKYHL
SRQHGMNNINMDLIIGLPGEGVKEFRHSLSETEKLMPESLTVHTLSFKRASEMTRNKHKYKVAGREEVSQMMEDAVAWTK
EHGYVPYYLYRQKNILGNLENVGYSLPGQESIYNIMIMEEVQTIIGIGCGAASKFIDRDTGKITHFANPKDPKSYNERFE
HYTDEKIKYLEQIFEKTTKQH
>Q05819 4.2.2.7~~~~~~Heparin lyase I~~~
MKKQILYLIVLQQLFLCSAYAQQKKSGNIPYRVNVQADSAKQKAIIDNKWVAVGINKPYALQYDDKLRFNGKPSYRFELK
AEDNSLEGYAAGETKGRTELSYSYATTNDFKKFPPSVYQNAQKLKTVYHYGKGICEQGSSRSYTFSVYIPSSFPDNATTI
FAQWHGAPSRTLVATPEGEIKTLSIEEFLALYDRMIFKKNIAHDKVEKKDKDGKITYVAGKPNGWKVEQGGYPTLAFGFS
KGYFYIKANSDRQWLTDKADRNNANPENSEVMKPYSSEYKTSTIAYKMPFAQFPKDCWITFDVAIDWTKYGKEANTILKP
GKLDVMMTYTKNKKPQKAHIVNQQEILIGRNDDDGYYFKFGIYRVGNSTVPVTYNLSGYSETAR
>P22638 ~~~hepA~~~Heterocyst differentiation ATP-binding protein HepA~~~COG1132
MPKSPHKLFKANSFWKENNLILREIKHFRKIAILAVIFSFLAASFEGVSIGFLLSFLQKLTSPNDPIQTGISWVDMILAA
DAWPIPPIYRISLLILLSTWMRATFNYFGGVYTESAQLNLADRLHKQIFEQLQALRLSYFAQTRSGELINTITTEIERIK
QGFSGLAFVLTRIMTVCVYFVVMFSISWQLSIISVLIFLLLAVGLSTLNKRVRETSFGISHANAQFTAVAVEFINGIRTI
QAFGTQEFERQRFYKASTNQLNAAIKVVLAWTLVKPIAEGIATTVLISLIVISFATFTLPVASLLTFFFVLVRVIPNIQD
INGTVAFLSTLQGSSENIKNILQTNNKPYLKNGKLHFQGLKRSIDLVSVDFGYTADNLVLNNITLTIERGKTTALVGASG
AGKTTLADLIPRFYDPTEGQILVDGLDVQYFEINSLRRKMAVVSQDTFIFNTSIRDNIAYGTSGASEAEIREVARLANAL
QFIEEMPEGFDTKLGDRGVRLSGGQRQRIAIARALLRDPEILILDEATSALDSVSERLIQESIEKLSVGRTVIAIAHRLS
TIAKADKVVVMEQGRIVEQGNYQELLEQRGKLWKYHQMQHESGQTNS
>C6XZB6 4.2.2.7~~~hepB~~~Heparin and heparin-sulfate lyase~~~COG5652
MKRQLYLYVIFVVVELMVFTTKGYSQTKADVVWKDVDGVSMPIPPKTHPRLYLREQQVPDLKNRMNDPKLKKVWADMIKM
QEDWKPADIPEVKDFRFYFNQKGLTVRVELMALNYLMTKDPKVGREAITSIIDTLETATFKPAGDISRGIGLFMVTGAIV
YDWCYDQLKPEEKTRFVKAFVRLAKMLECGYPPVKDKSIVGHASEWMIMRDLLSVGIAIYDEFPEMYNLAAGRFFKEHLV
ARNWFYPSHNYHQGMSYLNVRFTNDLFALWILDRMGAGNVFNPGQQFILYDAIYKRRPDGQILAGGDVDYSRKKPKYYTM
PALLAGSYYKDEYLNYEFLKDPNVEPHCKLFEFLWRDTQLGSRKPDDLPLSRYSGSPFGWMIARTGWGPESVIAEMKVNE
YSFLNHQHQDAGAFQIYYKGPLAIDAGSYTGSSGGYNSPHNKNFFKRTIAHNSLLIYDPKETFSSSGYGGSDHTDFAAND
GGQRLPGKGWIAPRDLKEMLAGDFRTGKILAQGFGPDNQTPDYTYLKGDITAAYSAKVKEVKRSFLFLNLKDAKVPAAMI
VFDKVVASNPDFKKFWLLHSIEQPEIKGNQITIKRTKNGDSGMLVNTALLPDAANSNITSIGGKGKDFWVFGTNYTNDPK
PGTDEALERGEWRVEITPKKAAAEDYYLNVIQIADNTQQKLHEVKRIDGDKVVGVQLADRIVTFSKTSETVDRPFGFSVV
GKGTFKFVMTDLLPGTWQVLKDGKILYPALSAKGDDGALYFEGTEGTYRFLR
>C7EXL6 4.2.2.8~~~hepC~~~Heparin-sulfate lyase~~~
MNKTFKYIVLLALACFVGKANAQELKTEVFSLLNLDYPGLEKVKALHQEGKDADAAKALLDYYRARTNVKTPDINLKKVT
IGKDEQKMADEALQHTFFAHKGYQPSFNYGEDIDWRYWPVKDNELRWQLHRHKWFTPMGKAYRVSGDEKYAVEWTKQYID
WIKKNPLVKVDKKEYEMTGDNQLKGDVENARFAWRPLEVSNRLQDQTSQFQLFLPSPSFTPEFLTEFLVNYHKHAIHILG
IYSAQGNHLLFEAQRMIYAGAFFPEFKEAAAWRKSGIDIMNREINVQVYNDGGQFELDPHYHLAAINIFCKALNIADLYG
FRNEFPQEYLDTIEKMIVFYANVSFPDYTNPCFSDAKLTNKKEMLKNYRNWSKMFPKNQFIKYLATDGKEGALPEYLSKG
FLKSGFFVFRNSWGTDATQMVVKAGPKAFWHCQPDNGTFELWFNGKNLFPDSGSYVYAGEGEVMEQRNWHRQTCVHNTVT
LNNKNLDQTESVTKLWQPEGNVQILVTENPSYKNLKHRRSVFFVDNSYFVIVDEMVGSQKGSINLHYQMPKGEIANSRED
MTFVTQFEEGSNMKLQCFGPEGMTMKKEPGWCSTAYRKRYKRMNVSFNVKKDSEDAVRYITVICPIKNSADAPKLSAKFK
NKTFNENGLEVEVKVNGKKQSLNYKL
>Q89YR9 4.2.2.8~~~hepC~~~Heparin-sulfate lyase~~~COG5434
MKNIFFICFCALFAFSGCADDDDDLLTGGNVDIDLLPDAKPNDVVDPQVFEAINLNYPGLEKVKEFYEAGEHYYAANALL
EYYRTRTNVTNPNLSLINVTISEAEQAKADYALVDYRFHVNNFYEDKETLKPYSVKQDGGINWEYSPKDASDEYQKQLHR
HQWFIPQAKAYRVSGDEKYIQSWIEVYKNWIENNPKPTTGPNTTSWWQLQVSTRIGDQVQLLEYFKNSVNFTPEWLSTFL
VEFAEQADFLVDYPYESGGNILISQANALATAGTLMPEFKNAEKWMNTGYQILSEEVQNQIMSDGWHKEMSLHYHIGIVA
DFYEAMKLAEANQLSSKLPSDFTEPLRKAAEVVMYFTYPNYFIKGSDNVVPMFNDSWSRTRNVLKNTNFKQYVEMFPDSE
ELKYMQTAGNGGTAQGRTPNNDMKLFDQAGYYVLRNGWTPASTVMILSNNKSNDASNSLSAYSHNQPDNGTFELYHNGRN
FFPDSGVCTYYTSGGDNDLRYWFRGIDKHNTLSIGKQNIKKAAGKLLKSEEGATELVVFENQGYDNLKHRRAVFYVNKKF
FVLVDEGIGNAEGTINLSFNLCEGTASEVVMDTDKNGVHTAFSNNNNIIVRTFANKAVTCSPFTGRIAYLVDGAYNTRQS
YTIDMNKSADETARYITVILPVNGSTDTSSISAKFIDSGYSENSASVEVSVNGETHTLSYTL
>Q59289 4.2.2.8~~~hepC~~~Heparin-sulfate lyase~~~COG5434
MTTKIFKRIIVFAVIALSSGNILAQSSSITRKDFDHINLEYSGLEKVNKAVAAGNYDDAAKALLAYYREKSKAREPDFSN
AEKPADIRQPIDKVTREMADKALVHQFQPHKGYGYFDYGKDINWQMWPVKDNEVRWQLHRVKWWQAMALVYHATGDEKYA
REWVYQYSDWARKNPLGLSQDNDKFVWRPLEVSDRVQSLPPTFSLFVNSPAFTPAFLMEFLNSYHQQADYLSTHYAEQGN
HRLFEAQRNLFAGVSFPEFKDSPRWRQTGISVLNTEIKKQVYADGMQFELSPIYHVAAIDIFLKAYGSAKRVNLEKEFPQ
SYVQTVENMIMALISISLPDYNTPMFGDSWITDKNFRMAQFASWARVFPANQAIKYFATDGKQGKAPNFLSKALSNAGFY
TFRSGWDKNATVMVLKASPPGEFHAQPDNGTFELFIKGRNFTPDAGVFVYSGDEAIMKLRNWYRQTRIHSTLTLDNQNMV
ITKARQNKWETGNNLDVLTYTNPSYPNLDHQRSVLFINKKYFLVIDRAIGEATGNLGVHWQLKEDSNPVFDKTKNRVYTT
YRDGNNLMIQSLNADRTSLNEEEGKVSYVYNKELKRPAFVFEKPKKNAGTQNFVSIVYPYDGQKAPEISIRENKGNDFEK
GKLNLTLTINGKQQLVLVP
>Q5IW40 1.13.11.72~~~hepD~~~2-hydroxyethylphosphonate dioxygenase~~~
MRIDPFKLAHWMNARKYTAAQTADLAGLPLDDLRRLLGDEANEPDPAAATALAEALSVEPSQLAADAHRNLTVVHKSAEE
MHASRRPIQRDGIHFYNYYTLAAPEGRVAPVVLDILCPSDRLPALNNGHLEPAITVNLGPGDINGRWGEEITPQTWRVLH
ANHGGDRWITGDSYVEPSYCPHSYSLAGDAPARIVSYTAQSNISPLMTEANNWSTGAFEEALKALSGKVSAGSVLDLFLA
RRAHTRTSAAEAAGVPPADLEAALRSPASETGLTVLRTLGRALGFDYRVLLPADDQHDGVGKTWTTIEDSRRSRRTFGTY
EAASMASAAHLPDLVGSFLRVDADGRGADLIDHAENHYVVTEGRLTLEWDGPDGPASVELEPDGSAWTGPFVRHRWHGTG
TVLKFGSGAHLGYQDWLELTNTFEPAATLRRGRRDLAGWGYDN
>Q7WF17 2.7.7.70~~~~~~D-beta-D-heptose 1-phosphate adenylyltransferase~~~COG0615
MSSARFESKILSRAELVAAVAAGRLPRPLVFTNGVFDILHRGHVTYLDQAAQLGATLVVAVNTDESVRRLGKGSDRPLNQ
VQDRAALLAALGCVDAVTSFHEDTPQELIGELRPDLIVKGGDYDMDTLPETALVKSWGGRAVAIPFDFERSTTALLGKIR
QG
>Q7WGU8 2.7.1.167~~~rfaE~~~D-beta-D-heptose 7-phosphate kinase~~~COG2870
MNQYPAERIARARVLVVGDVMLDRYWFGEVDRISPEAPVPVVRVARREDRLGGAANVARNVAALGAQVTLIGVVGADEVG
HRIERMAAEEGVRTDLVSDTEHPTTLKMRVLGRQQQLLRVDFEQHPEPAALDGISAAVARQLAQHDIVVLSDYAKGVLDR
VESIIAAAVGHSLPVLVDPKGDHYERYRGATLVTPNRAEMREAVGRWKTEDELAERAQRLRLDLDLEALLVTRSEQGMTL
FTDAGRDHADAAAHEVYDVSGAGDTVLATLAVMRAVGLSWGDAMRWANRAGGIVVGKLGTSVVTAAELAGEST
>P31112 2.5.1.30~~~hepS~~~Heptaprenyl diphosphate synthase component 1~~~COG0142
MQDIYGTLANLNTKLKQKLSHPYLAKHISAPKIDEDKLLLFHALFEEADIKNNDRENYIVTAMLVQSALDTHDEVTTARV
IKRDENKNRQLTVLAGDYFSGLYYSLLSEMKDIYMIRTLATAIKEINEHKIRLYDRSFKDENDFFESVGIVESALFHRVA
EHFNLPRWKKLSSDFFVFKRLMNGNDAFLDVIGSFIQLGKTKEEILEDCFKKAKNSIESLLPLNSPIQNILINRLKTISQ
DQTYHQKVEEG
>P31114 2.5.1.30~~~hepT~~~Heptaprenyl diphosphate synthase component 2~~~COG0142
MLNIIRLLAESLPRISDGNENTDVWVNDMKFKMAYSFLNDDIDVIERELEQTVRSDYPLLSEAGLHLLQAGGKRIRPVFV
LLSGMFGDYDINKIKYVAVTLEMIHMASLVHDDVIDDAELRRGKPTIKAKWDNRIAMYTGDYMLAGSLEMMTRINEPKAH
RILSQTIVEVCLGEIEQIKDKYNMEQNLRTYLRRIKRKTALLIAVSCQLGAIASGADEKIHKALYWFGYYVGMSYQIIDD
ILDFTSTEEELGKPVGGDLLQGNVTLPVLYALKNPALKNQLKLINSETTQEQLEPIIEEIKKTDAIEASMAVSEMYLQKA
FQKLNTLPRGRARSSLAAIAKYIGKRKF
>A0A0B0QJR1 3.1.27.-~~~hepT~~~tRNA nuclease HepT~~~
MTNIEPVIIETRLELIGRYLDHLKKFENISLDDYLSSFEQQLITERLLQLITQAAIDINDHILSKLKSGKSYTNFEAFIE
LGKYQILTPELAKQIAPSSGLRNRLVHEYDDIDPNQVFMAISFALQQYPLYVRQINSYLITLEEEND
>P43934 3.1.-.-~~~hepT~~~Probable ribonuclease HepT~~~COG1708
MMTDKLNLNVLDAAFYSLEQTVVQISDRNWFDMQPSIVQDTLIAGAIQKFEFVYELSLKMMKRQLQQDAINTDDIGAYGF
KDILREALRFGLIGDMSKWVAYRDMRNITSHTYDQEKAMAVYAQIDDFLIESSFLLEQLRQRNQYD
>Q8ECH6 3.1.-.-~~~hepT~~~mRNA nuclease HepT~~~COG2445
MNDIIINKIATIKRCIKRIQQVYGDGSQFKQDFTLQDSVILNLQRCCEACIDIANHINRQQQLGIPQSSRDSFTLLAQNN
LITQPLSDNLKKMVGLRNIAVHDYQELNLDIVVHVVQHHLEDFEQFIDVIKAE
>P46049 ~~~hesA1~~~Protein HesA, heterocyst~~~COG0476
MINLTPTELERYSRQMMLPNFGEAAQKRLKSATVLVTGVGGLGGTAALYLAVAGVGRLILVRGGDLRLDDMNRQVLMTDD
WVGKPRVFKAKETLQAINPDIQIETIHDYVTSDNVDSLVQSADMALDCAHNFTERDLLNSACVRWRKPMVEAAMDGMEAY
LTTIIPGVTPCLSCIFPEKPEWDRRGFSVLGAVSGTLACLTALEAIKLITGFSQPLLSQLLTIDLNRMEFAKRRLYRDRS
CPVCGNDAPWRYAQSNSMETSSNCTHS
>P46048 ~~~hesA2~~~Protein HesA, vegetative~~~COG0476
MVNLTSTELERYRRQIILPGFGQEAQQRLKSATVLVTGVGGLGGTAALYLAIAGVGRLILVRGGELRLDDMNRQILMSDD
WVGKPRVFKAKKRLEDINPDVEVEAIFDYVTPDNVDSLIQSADVALDCAHNFGERDLLNAACVRWRKPMVEAAMDGMDAY
LTTIIPGVTPCLSCLFPEKPEWDRRGFGVLGAVSGTLACLTALEAMKLITGFSQPLSSELLTMNLHQLTFAKRRSYRDRN
CPVCGTHSQHYPHPQQLSRVLVNSQ
>P46051 ~~~hesB1~~~Protein HesB, heterocyst~~~COG0316
MTVTLTEKAEFRLRAFLRGSAKDANETTKGIRISVKDGGCSGYEYLMDVTSQPQPDDLVSQQGSVLVYVDAKSAPLLEGI
VIDFVEGLVESGFKFTNPNATSTCGCGKSFKAGDCSPEGVPCS
>P46052 ~~~hesB2~~~Protein HesB, vegetative~~~COG0316
MTLTLTEAAEFRLRTFLLSFSKDENSTQRGIRVAVEDGGCSGYQYSIKIINAPQADDMVLQQGKLRIYVDSQSAPLLEGV
VVDFVDGLLESGFKFSNPNATDTCGCGKSFQAGNCSPAGVPCS
>Q9AH77 3.4.21.-~~~hetR~~~DNA-binding transcriptional activator HetR~~~
MSNDIDLIKRLDPSAMDQIMLYLAFSAMRTSGHRHGAFLDAAATAAKCAIYMTYLEQGQNLRMTGHLHHLEPKRVKIIVE
EVRQALTEGKLLKMLGSQEPRYLIQLPYVWLEKYPWQPGRSRVPGTSLTSEEKRQIEQKLPSNLPDAQLVSSFEFLDLIE
FLHRRSQEDLPTEHQMPLSEALGEHIKRRLLYSGTVTRIDSPWGMPFYALTRPFYAPADDQERTYIMVEDTARYFRMMKN
WAERRRNAMRLLEELDILPEKMEQAMEELDEIIRAWADKYHQDGGIAVVLQTVFGEKED
>P27709 3.4.21.-~~~hetR~~~DNA-binding transcriptional activator HetR~~~
MSNDIDLIKRLGPSAMDQIMLYLAFSAMRTSGHRHGAFLDAAATAAKCAIYMTYLEQGQNLRMTGHLHHLEPKRVKIIVE
EVRQALMEGKLLKTLGSQEPRYLIQFPYVWMEQYPWIPGRSRIPGTSLTSEEKRQIEHKLPSNLPDAQLVTSFEFLELIE
FLHKRSQEDLPPEHRMELSEALAEHIKRRLLYSGTVTRIDSPWGMPFYALTRPFYAPADDQERTYIMVEDTARYFRMMKD
WAEKRPNAMRALEELDVPPERWDEAMQELDEIIRTWADKYHQVGGIPMILQMVFGRKED
>P96155 3.2.1.52~~~exoI~~~Beta-hexosaminidase~~~
MNYRIDFAVLSEHPQFCRFGLTLHNLSDQDLKAWSLHFTIDRYIQPDSISHSQIHQVGSFCSLTPEQDVINSNSHFYCEF
SIKTAPFPFHYYTDGIKAAFVQINDVEPRVRHDVIVTPIALASPYRERSEIPATDAATLSLLPKPNHIERLDGEFALTAG
SQISLQSSCAETAATWLKQELTHLYQWQPHDIGSADIVLRTNPTLDEGAYLLSVDRKPIRLEASSHIGFVHASATLLQLV
RPDGDNLLVPHIVIKDAPRFKYRGMMLDCARHFHPLERVKRLINQLAHYKFNTFHWHLTDDEGWRIEIKSLPQLTDIGAW
RGVDEVLEPQYSLLTEKHGGFYTQEEIREVIAYAAERGITVIPEIDIPGHSRAAIKALPEWLFDEDDQSQYRSIQYYNDN
VLSPALPGTYRFLDCVLEEVAALFPSHFIHIGADEVPDGVWVNSPKCQALMAEEGYTDAKELQGHLLRYAEKKLKSLGKR
MVGWEEAQHGDKVSKDTVIYSWLSEQAALNCARQGFDVILQPGQFTYLDIAQDYAPEEPGVDWAGVTPLERAYRYEPLVE
VPEHDPLRKRILGIQCALWCELVNNQDRMDYMIYPRLTALAGSGLDTKIPA
>Q7WUL4 3.2.1.52~~~~~~Beta-N-acetylhexosaminidase~~~
MPDVAVIPRPVLLETTDGPPFVLTAATILVVDSAPELVAVGVLAADLLGRLSGRPVEVRYTEGGAPSVVRLRLSEDLPAG
DEAYRLVVSEHRVDIDARSAAGLVRAVVTLRQTVSSLGDGTLTVPALRVEDHPRYAWRGLSIDVARHFFTVDDLKAIIGL
LAHYKLNVLHLHLTDDQGWRVHLPSRPHLTRASAGTSVGGGPGGFYNPAQLAEIVVARAARGIRVVPEIDVPGHVNAATH
AYGDLTPSGEPTDVYTGIEVGFSRLHDDLPATRPFLRDVFTDLAAMTPGEYVHIGGDEVLTMDHDKYARLVGYAASVVRD
AGKKVVGWQEISSTPLEPGTVVQYWDINADPAPFVAAAQAGAHVLMSPGSRAYLDMKYDATTELGLEWAGHIELRDAYDW
EPSTLIPGVPPESVIGVEAAVWTETLTDLGELTSMLLPRLAAVAEVAWTAPQDRDWDDFSGRVAQHAPFWDRVGFRWHAS
PQVSWPGPGSAPGAAF
>O66127 2.5.1.83~~~hexs-a~~~Hexaprenyl-diphosphate synthase small subunit ((2E,6E)-farnesyl-diphosphate specific)~~~
MRYLHKIELELNRLTSRYPFFKKIAFDAEIIKLVDDLNVDENVKCAIVAIDTSMRMQDFINEDNKDSFVLSTDVLSALFY
KYLSQPFYQHDFLVLTDCVSRINELKSIRATITDEIALHNINKQIHYMFIQPYMNNEKVVSYE
>P48823 3.2.1.52~~~~~~Beta-hexosaminidase A~~~
MSFITSAHATAAQVPLTTSQMLGQKLMLDFRYYCGESKKPSGDCRAAMTTLPPELSELISRYDIGGAILFAENVQNTAQI
ISLTNALQSAAQQSKSQLPLFIAIDQEGGRVARINREQATSFTGNMSIGATYPKQGDIYATKVASAIGKELNSLGINVNF
APTVDVNSNPNNPVINVRSFSENPTVVTKLGLAQVKAFEAAGVLSALKHFPGHGDTHVDSHTGLPRVDHDRDKINQQDLL
PFAEIIKASPPGMIMTAHIQYPALDNSKVVNSQGESMIRPATMSYQIMTQLLRHELGYQGVTVTDALDMAGISDFFNPVD
ATIETFNAGVDIALMPIAIRNRADIKRFEQYMAQLADALETNKLNQEQLSSSMARIAKLKTKLPQSSASLAIANSTLGNP
SHRRLEAELALAAITEVKNDGVLPLRDNAQVVHLIMPDRQKCFALEQALQTYSKNSLTLSCTSLQAYDPDIAHDAIKQAD
MIIAAHASPPQSAVEIGGMDDVKKLREHGVARNVQPAALKALLQYGQQQGKKQLFISLRAPYEISTFGPLSNAVLASYAY
NVDVNHDKKVAGPAYTALAKVILGIAKAEGSLPVTVNH
>O66129 2.5.1.83~~~hexs-b~~~Hexaprenyl-diphosphate synthase large subunit ((2E,6E)-farnesyl-diphosphate specific)~~~
MIALSYKAFLNPYIIEVEKRLYECIQSDSETINKAAHHILSSGGKRVRPMFVLLSGFLNDTQKDDLIRTAVSLELVHMAS
LVHDDYIDNSDMRRGNTSVHIAFDKDTAIRTGHFLLARALQNIATINNSKFHQIFSKTILEVCFGEFDQMADRFNYPVSF
TAYLRRINRKTAILIEASCHLGALSSQLDEQSTYHIKQFGHCIGMSYQIIDDILDYTSDEATLGKPVGSDIRNGHITYPL
MAAIANLKEQDDDKLEAVVKHLTSTSDDEVYQYIVSQVKQYGIEPAELLSRKYGDKAKYHLSQLQDSNIKDYLEEIHEKM
LKRVY
>A3RXB7 1.1.3.29~~~~~~N-acetyl-D-hexosamine oxidase~~~
MTLDVSRQDPRYNTLKHGFNLRWPSTDAQAAGRIALCEKADDVAPALQHIIDTGMRPTVRSGGHCYEDFVSNNPDGAIVD
LSLLNAPEVRADGTVRIPAGTQNWNGYLELYKRHNLTLPGGSCYSVGAGGHICGGGYGLLSRLQGLTVDWLSAVDIVTVD
RQGRAAPRTVDATRDPELFRACRGAGGGNFGIITAYTFARLPEAPREVALATVAFDWAAMTPERFAELLRLYGEYWETRG
KDPDTWGMFSLLKLTHRSAGQIVMLTQFCNPDGTCRDLSVLNDFLARFRACAPVPLKGRPPGYGPAHRQGVGQLLCSKPH
TVVRYDWLTATQTVNGSGPNQRGKYKSAYMKRGFTAREAQRIYTHLTRTVPGIDLSQSLLQVDSYGGAVNKTERIADTAV
PQRASVMKLQYQTYWTSAADDAGHLRWIGDFYRDVYGTPDVSAPHAGTPYPGDRYEGCYINYPDVDMLAYPFWPQLYYGD
GDLYAFLQRVKRRYDPNNIFHHAMSVRP
>Q88P32 ~~~hexR~~~HTH-type transcriptional regulator HexR~~~COG1737
MRNLLEQIQGRLDELNKAERKVAEVILLNPQQATRFSIAALAQAAKVSEPTVNRFCRSFGVSGYPELKLQLAQSLASGAA
YVSRAVEADDDPAAYTQKIFASAIASLDSACQQLDPQQVSRAVDMMIQARQIHFFGLGASAPVALDAQHKFFRFNLAVSA
HADVLMQRMLASVAHTGDLFVIISYTGRTRELVEVARLARENGASVLGLTAAGSPLANACSLSLHIPLPEDTDIYMPMTS
RIIQLTVLDVLATGMTLRRGVDFQPHLRKIKESLNASRYPIEDDDLN
>Q8A7C8 3.1.6.-~~~~~~Delta 4,5-hexuronate-2-O-sulfatase~~~COG3119
MGLALCGAAAQAQEKPNFLIIQCDHLTQRVVGAYGQTQGCTLPIDEVASRGVIFSNAYVGCPLSQPSRAALWSGMMPHQT
NVRSNSSEPVNTRLPENVPTLGSLFSESGYEAVHFGKTHDMGSLRGFKHKEPVAKPFTDPEFPVNNDSFLDVGTCEDAVA
YLSNPPKEPFICIADFQNPHNICGFIGENAGVHTDRPISGPLPELPDNFDVEDWSNIPTPVQYICCSHRRMTQAAHWNEE
NYRHYIAAFQHYTKMVSKQVDSVLKALYSTPAGRNTIVVIMADHGDGMASHRMVTKHISFYDEMTNVPFIFAGPGIKQQK
KPVDHLLTQPTLDLLPTLCDLAGIAVPAEKAGISLAPTLRGEKQKKSHPYVVSEWHSEYEYVTTPGRMVRGPRYKYTHYL
EGNGEELYDMKKDPGERKNLAKDPKYSKILAEHRALLDDYITRSKDDYRSLKVDADPRCRNHTPGYPSHEGPGAREILKR
K
>P0ABC3 ~~~hflC~~~Modulator of FtsH protease HflC~~~COG0330
MRKSVIAIIIIVLVVLYMSVFVVKEGERGITLRFGKVLRDDDNKPLVYEPGLHFKIPFIETVKMLDARIQTMDNQADRFV
TKEKKDLIVDSYIKWRISDFSRYYLATGGGDISQAEVLLKRKFSDRLRSEIGRLDVKDIVTDSRGRLTLEVRDALNSGSA
GTEDEVTTPAADNAIAEAAERVTAETKGKVPVINPNSMAALGIEVVDVRIKQINLPTEVSEAIYNRMRAEREAVARRHRS
QGQEEAEKLRATADYEVTRTLAEAERQGRIMRGEGDAEAAKLFADAFSKDPDFYAFIRSLRAYENSFSGNQDVMVMSPDS
DFFRYMKTPTSATR
>P25746 ~~~hflD~~~High frequency lysogenization protein HflD~~~COG2915
MAKNYYDITLALAGICQSARLVQQLAHQGHCDADALHVSLNSIIDMNPSSTLAVFGGSEANLRVGLETLLGVLNASSRQG
LNAELTRYTLSLMVLERKLSSAKGALDTLGNRINGLQRQLEHFDLQSETLMSAMAAIYVDVISPLGPRIQVTGSPAVLQS
PQVQAKVRATLLAGIRAAVLWHQVGGGRLQLMFSRNRLTTQAKQILAHLTPEL
>P0ABC7 ~~~hflK~~~Modulator of FtsH protease HflK~~~COG0330
MAWNQPGNNGQDRDPWGSSKPGGNSEGNGNKGGRDQGPPDLDDIFRKLSKKLGGLGGGKGTGSGGGSSSQGPRPQLGGRV
VTIAAAAIVIIWAASGFYTIKEAERGVVTRFGKFSHLVEPGLNWKPTFIDEVKPVNVEAVRELAASGVMLTSDENVVRVE
MNVQYRVTNPEKYLYSVTSPDDSLRQATDSALRGVIGKYTMDRILTEGRTVIRSDTQRELEETIRPYDMGITLLDVNFQA
ARPPEEVKAAFDDAIAARENEQQYIREAEAYTNEVQPRANGQAQRILEEARAYKAQTILEAQGEVARFAKLLPEYKAAPE
ITRERLYIETMEKVLGNTRKVLVNDKGGNLMVLPLDQMLKGGNAPAAKSDNGASNLLRLPPASSSTTSGASNTSSTSQGD
IMDQRRANAQRNDYQRQGE
>Q9Z873 ~~~hflX~~~GTPase HflX~~~COG2262
MDTIDTPGEQGSQSFGNSLGARFDLPRKEQDPSQALAVASYQNKTDSQVVEEHLDELISLADSCGISVLETRSWILKTPS
ASTYINVGKLEEIEEILKEFPSIGTLIIDEEITPSQQRNLEKRLGLVVLDRTELILEIFSSRALTAEANIQVQLAQARYL
LPRLKRLWGHLSRQKSGGGSGGFVKGEGEKQIELDRRMVRERIHKLSAQLKAVIKQRAERRKVKSRRGIPTFALIGYTNS
GKSTLLNLLTAADTYVEDKLFATLDPKTRKCVLPGGRHVLLTDTVGFIRKLPHTLVAAFKSTLEAAFHEDVLLHVVDASH
PLALEHVQTTYDLFQELKIEKPRIITVLNKVDRLPQGSIPMKLRLLSPLPVLISAKTGEGIQNLLSLMTEIIQEKSLHVT
LNFPYTEYGKFTELCDAGVVASSRYQEDFLVVEAYLPKELQKKFRPFISYVFPEDCGDDEGRGPVLESSFGD
>P25519 ~~~hflX~~~GTPase HflX~~~COG2262
MFDRYDAGEQAVLVHIYFTQDKDMEDLQEFESLVSSAGVEALQVITGSRKAPHPKYFVGEGKAVEIAEAVKATGASVVLF
DHALSPAQERNLERLCECRVIDRTGLILDIFAQRARTHEGKLQVELAQLRHLATRLVRGWTHLERQKGGIGLRGPGETQL
ETDRRLLRNRIVQIQSRLERVEKQREQGRQSRIKADVPTVSLVGYTNAGKSTLFNRITEARVYAADQLFATLDPTLRRID
VADVGETVLADTVGFIRHLPHDLVAAFKATLQETRQATLLLHVIDAADVRVQENIEAVNTVLEEIDAHEIPTLLVMNKID
MLEDFEPRIDRDEENKPNRVWLSAQTGAGIPQLFQALTERLSGEVAQHTLRLPPQEGRLRSRFYQLQAIEKEWMEEDGSV
SLQVRMPIVDWRRLCKQEPALIDYLI
>O66512 ~~~hfq~~~RNA-binding protein Hfq~~~COG1923
MPYKLQESFLNTARKKRVKVSVYLVNGVRLQGRIRSFDLFTILLEDGKQQTLVYKHAITTIVPHERLEIEFEEAGVPGQG
>O31796 ~~~hfq~~~RNA-binding protein Hfq~~~COG1923
MKPINIQDQFLNQIRKENTYVTVFLLNGFQLRGQVKGFDNFTVLLESEGKQQLIYKHAISTFAPQKNVQLELE
>Q2YPW9 ~~~hfq~~~RNA-binding protein Hfq~~~
MAERSQNLQDLFLNSVRKQKISLTIFLINGVKLTGIVTSFDNFCVLLRRDGHSQLVYKHAISTIMPSQPVQMFEGEEA
>Q9A7H8 ~~~hfq~~~RNA-binding protein Hfq~~~COG1923
MSAEKKQNLQDTFLNSVRKSKTPLTIFLVNGVKLQGVVSWFDNFCVLLRRDGQSQLVYKHAISTIMPAQPVQLYEPSADA
DD
>P0A6X3 ~~~hfq~~~RNA-binding protein Hfq~~~COG1923
MAKGQSLQDPFLNALRRERVPVSIYLVNGIKLQGQIESFDQFVILLKNTVSQMVYKHAISTVVPSRPVSHHSNNAGGGTS
SNYHHGSSAQNTSAQQDSEETE
>P64345 ~~~hfq~~~RNA-binding protein Hfq~~~
MTAKGQMLQDPFLNALRKEHVPVSIYLVNGIKLQGQVESFDQYVVLLRNTSVTQMVYKHAISTIVPARSVNLQHENRPQA
APTSTLVQVETVQQPAE
>A1KT11 ~~~hfq~~~RNA-binding protein Hfq~~~
MTAKGQMLQDPFLNALRKEHVPVSIYLVNGIKLQGQVESFDQYVVLLRNTSVTQMVYKHAISTIVPARSVNLQHENRPQA
APASTLVQVETVQQPAE
>A6VD57 ~~~hfq~~~RNA-binding protein Hfq~~~
MSKGHSLQDPYLNTLRKERVPVSIYLVNGIKLQGQIESFDQFVILLKNTVSQMVYKHAISTVVPSRPVRLPSGDQPAEPG
NA
>Q9HUM0 ~~~hfq~~~RNA-binding protein Hfq~~~
MSKGHSLQDPYLNTLRKERVPVSIYLVNGIKLQGQIESFDQFVILLKNTVSQMVYKHAISTVVPSRPVRLPSGDQPAEPG
NA
>B3EWP0 ~~~hfq~~~RNA-binding protein Hfq~~~COG1923
MSKGHSLQDPYLNTLRKERVPVSIYLVNGIKLQGQIESFDQFVILLKNTVSQMVYKTAISTVVPSRPVRLPSGDQPAEPG
NA
>P0A1R0 ~~~hfq~~~RNA-binding protein Hfq~~~
MAKGQSLQDPFLNALRRERVPVSIYLVNGIKLQGQIESFDQFVILLKNTVSQMVYKHAISTVVPSRPVSHHSNNAGGGAS
NNYHHGSNAQGSTAQQDSEETE
>Q9WYZ6 ~~~hfq~~~RNA-binding protein Hfq~~~COG1923
MALAEKFNLQDRFLNHLRVNKIEVKVYLVNGFQTKGFIRSFDSYTVLLESGNQQSLIYKHAISTIIPSSYVMLMPKKQET
AQEAETSENEGS
>Q9KV11 ~~~hfq~~~RNA-binding protein Hfq~~~COG1923
MAKGQSLQDPFLNALRRERIPVSIYLVNGIKLQGQIESFDQFVILLKNTVNQMVYKHAISTVVPARPVSHHSGDRPASDR
PAEKSEE
>Q47952 ~~~hgbA~~~Hemoglobin and hemoglobin-haptoglobin-binding protein~~~COG1629
MKANKLSAITLCILGYAHTVYAESNMQTEKLETIVVSSEDDSVHNKNVGEIKKNAKALSKQQVQDSRDLVRYETGVTVVE
KGRFGSSGYAIRGVDENRVAVVVDGLHQAETISSQGFKELFEGYGNFNNTRNGVEVENLKQAVIQKGADAIRTGSGSLGG
TVSFESKDARDYLIDKNYHFGYKTGYSSADNQKLHSVTAAGRYSDFDLLAVHTQRHGNELRNYGYRHYDGSVVRKEREKA
DPYKITKQSSLIKIGYQLNDTNRFTLGYDDSRNTSRGTDWSNAFTSYNGGPFLKDVRHTNDQSNRKNISFVYENFDTNDF
WDTLKITHNHQKIKLKARLDEYCDVNGEIDCPAIANPSGLYINDKGIFLDKHDGEITHKKEGEFNNYFDSKGKEVRVKGF
NVDSILINCDQYDCSKPMQLLSSTNNGYGGSPNKYIYKTYELFEKTMNNGNGKYAVLEIRSSGHEKFSRVYLPSEKGYVE
NQWKDRDLNTDTQQYNIDLTKSFKLKSVEHNATYGGLYSEVKKSMTNRAGYEAYNRQWWANIFFGKENNKPNKCQPYNGN
SFTTLCSHEDRLFSFLIPVKTKTGALYVTDKIKLNDKVNLDVAYRYDRIKHDPKYIPGTTPKLPTDLILGRFIEFKPKNT
YATQDEKNENAEKNAVYLASKKTKFSANSYSATFSFDPMDFLKIQAKYATGFRAPTSDEIYFVFQHPSFSIYPNLYLKAE
RSKNKEVAITLHKQKSFLTVNLFQTDYKDFLDLAYLKKGSLPYGNGGSQLETLLYQNVNRDKARVKGLEVNSKLHLGDVW
RTLDGFNLSYKLSLQKGRMSSKVGEEGKQRDTNKLDTPMNAIQPQTHVVGVGYEHPQEKFGVDMYLTHASAKKEKDTFNM
FYDGKDQKDQHIKWRSDRYTLVDLIAYVKPVKNVTLRAGVYNLTNREYGTWDSIRSIRPFGTTNLINQETGKGIKRFNAP
GRNFRVNAEITF
>P11569 4.2.1.167~~~hgdA~~~(R)-2-hydroxyglutaryl-CoA dehydratase, subunit alpha~~~COG1775
MPKTVSPGVQALRDVVEKVYRELREAKERGEKVGWSSSKFPCELAESFGLHVGYPENQAAGIAANRDGEVMCQAAEDIGY
DNDICGYARISLAYAAGFRGANKMDKDGNYVINPHSGKQMKDANGKKVFDADGKPVIDPKTLKPFATTDNIYEIAALPEG
EEKTRRQNALHKYRQMTMPMPDFVLCCNNICNCMTKWYEDIARRHNIPLIMIDVPYNEFDHVNEANVKYIRSQLDTAIRQ
MEEITGKKFDEDKFEQCCQNANRTAKAWLKVCDYLQYKPAPFNGFDLFNHMADVVTARGRVEAAEAFELLAKELEQHVKE
GTTTAPFKEQHRIMFEGIPCWPKLPNLFKPLKANGLNITGVVYAPAFGFVYNNLDELVKAYCKAPNSVSIEQGVAWREGL
IRDNKVDGVLVHYNRSCKPWSGYMPEMQRRFTKDMGIPTAGFDGDQADPRNFNAAQYETRVQGLVEAMEANDEKKGK
>P11570 4.2.1.167~~~hgdB~~~(R)-2-hydroxyglutaryl-CoA dehydratase, subunit beta~~~COG1775
MAISALIEEFQKVSASPKTMLAKYKAQGKKAIGCLPYYVPEELVYAAGMVPMGVWGCNGKQEVRSKEYCASFYCTIAQQS
LEMLLDGTLDGLDGIITPVLCDTLRPMSQNFKVAMKDKMPVIFLAHPQVRQNAAGKQFTYDAYSEVKGHLEEICGHEITN
DAILDAIKVYNKSRAARREFCKLANEHPDLIPASVRATVLRAAYFMLKDEYTEKLEELNKELAAAPAGKFDGHKVVVSGI
IYNMPGILKAMDDNKLAIAADDCAYESRSFAVDAPEDLDNGLQALAVQFSKQKNDVLLYDPEFAKNTRSEHVCNLVKESG
AEGLIVFMMQFCDPEEMEYPDLKKALDAHHIPHVKIGVDQMTRDFGQAQTALEAFAESL
>P11568 3.6.1.-~~~hgdC~~~(R)-2-hydroxyglutaryl-CoA dehydratase activating ATPase~~~COG1924
MSIYTLGIDVGSTASKCIILKDGKEIVAKSLVAVGTGTSGPARSISEVLENAHMKKEDMAFTLATGYGRNSLEGIADKQM
SELSCHAMGASFIWPNVHTVIDIGGQDVKVIHVENGTMTNFQMNDKCAAGTGRFLDVMANILEVKVSDLAELGAKSTKRV
AISSTCTVFAESEVISQLSKGTDKIDIIAGIHRSVASRVIGLANRVGIVKDVVMTGGVAQNYGVRGALEEGLGVEIKTSP
LAQYNGALGAALYAYKKAAK
>D2RJU7 1.1.1.399~~~hgdH~~~(R)-2-hydroxyglutarate dehydrogenase~~~COG1052
MKVLCYGVRDVELPIFEACNKEFGYDIKCVPDYLNTKETAEMAAGFDAVILRGNCFANKQNLDIYKKLGVKYILTRTAGT
DHIDKEYAKELGFPMAFVPRYSPNAIAELAVTQAMMLLRHTAYTTSRTAKKNFKVDAFMFSKEVRNCTVGVVGLGRIGRV
AAQIFHGMGATVIGEDVFEIKGIEDYCTQVSLDEVLEKSDIITIHAPYIKENGAVVTRDFLKKMKDGAILVNCARGQLVD
TEAVIEAVESGKLGGYGCDVLDGEASVFGKDLEGQKLENPLFEKLVDLYPRVLITPHLGSYTDEAVKNMVEVSYQNLKDL
AETGDCPNKIK
>Q88E47 1.13.11.5~~~hmgA~~~Homogentisate 1,2-dioxygenase~~~COG3508
MNRDTSPDLHYLSGFGNEFASEALPGALPVGQNSPQKAPYGLYAELLSGTAFTMARSELRRTWLYRIRPSALHPRFERLA
RQPLGGPLGGINPNRLRWSPQPIPAEPTDFIEGWLPMAANAGAEKPAGVSIYIYRANRSMERVFFNADGELLLVPEQGRL
RIATELGVMEVEPLEIAVIPRGMKFRVELLDGQARGYIAENHGAPLRLPDLGPIGSNGLANPRDFLTPVAHYEEAEGPVQ
LVQKFLGEHWACELQHSPLDVVAWHGSNVPYKYDLRRFNTIGTVSFDHPDPSIFTVLTSPTSVHGMANMDFVIFPPRWMV
AENTFRPPWFHRNLMNEFMGLINGAYDAKAEGFLPGGASLHGVMSAHGPDAETCEKAIAADLAPHKIDNTMAFMFETSQV
LRPSLQALECPQLQADYDSCWATLPSTFNPNRR
>Q6EMI9 1.13.11.5~~~hmgA~~~Homogentisate 1,2-dioxygenase~~~COG3508
MTRDTSPDLHYLSGFGNEFASEALPGALPVGQNSPQRAPYGLYAELLSGTAFTMARSELRRTWLYRIRPSALHPRFERLA
RQPLTAPLGAINPNRLRWSPQPIPAEPTDFIEGWLPMVANAPAQKPAGVSIYIYCANRSMERVFFNADGELLLVPEQGRL
RIATELGVMEVGPLEIAVIPRGMKFRVELLDGQARGYIAENHGAPLRIPELGPIGSNGLANPRDFLTPVAHYEEAEGPVQ
LVQKFLGEHWACELQHSPLDVVAWHGSNVPYKYDLRRFNTIGTVSFDHPDPSIFTVLTSPTSVHGLANMDFVIFPPRWMV
AENTFRPPWFHRNLMNEFMGLISGAYDAKAEGFLPGGASLHGVMSAHGPDAETCEKAIAADLAPHKIDNTMAFMFETSQV
LRPSQQALECPQLQADYDSCWATLPSTFNPNRR
>Q9X4F5 1.13.11.5~~~hmgA~~~Homogentisate 1,2-dioxygenase~~~COG3508
MLEKAEKQRRAGSGQQRAAGYMPGFGNDFETESLPGALPQGQNSPQKCNYGLYAEQLSGSPFTAPRGTNERSWLYRIRPS
VRHTGRFRRVDYPHWKTAPHVGEHSLALGQLRWSPLPAPSEALDFLQGIRTMTTAGDALTQAGMAAHAYAFNADMVDDYF
FNADGELLIVPETGAIQVFTELGRMDVEPSEICLIPRGMMFKVTRLGEEKVWRGYICENYGAKFTLPDRGPIGANCLANP
RDFKTPVAAYEDKETPCRVQVKWCGSFHMVEIGHSPLDVVAWHGNYAPYKYDLKTFSPVGAILFDHPDPSIFTVLTAPSG
EEGTANVDFVIFPPRWLVAEHTFRPPWYHRNIMSEFMGLIYGRYDAKEEGFVPGGMSLHNMMLAHGPDFSGFEKASNGEL
KPVKLDNTMAFMFETRFPQQLTTFAAELDTLQDDYMDCWSGLERKFDGTPGIK
>P73726 2.5.1.115~~~~~~Homogentisate phytyltransferase~~~COG0382
MATIQAFWRFSRPHTIIGTTLSVWAVYLLTILGDGNSVNSPASLDLVFGAWLACLLGNVYIVGLNQLWDVDIDRINKPNL
PLANGDFSIAQGRWIVGLCGVASLAIAWGLGLWLGLTVGISLIIGTAYSVPPVRLKRFSLLAALCILTVRGIVVNLGLFL
FFRIGLGYPPTLITPIWVLTLFILVFTVAIAIFKDVPDMEGDRQFKIQTLTLQIGKQNVFRGTLILLTGCYLAMAIWGLW
AAMPLNTAFLIVSHLCLLALLWWRSRDVHLESKTEIASFYQFIWKLFFLEYLLYPLALWLPNFSNTIF
>P76097 1.13.11.93~~~ydcJ~~~2-oxoadipate dioxygenase/decarboxylase~~~COG5383
MANSITADEIREQFSQAMSAMYQQEVPQYGTLLELVADVNLAVLENNPQLHEKMVNADELARLNVERHGAIRVGTAQELA
TLRRMFAIMGMYPVSYYDLSQAGVPVHSTAFRPIDDASLARNPFRVFTSLLRLELIENEILRQKAAEILRQRDIFTPRCR
QLLEEYEQQGGFNETQAQEFVQEALETFRWHQSATVDEETYRALHNEHRLIADVVCFPGCHINHLTPRTLDIDRVQSMMP
ECGIEPKILIEGPPRREVPILLRQTSFKALEETVLFAGQKQGTHTARFGEIEQRGVALTPKGRQLYDDLLRNAGTGQDNL
THQMHLQETFRTFPDSEFLMRQQGLAWFRYRLTPSGEAHRQAIHPGDDPQPLIERGWVVAQPITYEDFLPVSAAGIFQSN
LGNETQTRSHGNASREAFEQALGCPVLDEFQLYQEAEERSKRRCGLL
>P9WL01 1.13.11.93~~~~~~2-oxoadipate dioxygenase/decarboxylase~~~COG5383
MSRSKRLQTGQLRARFAAGLSAMYAAEVPAYGTLVEVCAQVNSDYLTRHRRAERLGSLQRVTAERHGAIRVGNPAELAAV
ADLFAAFGMLPVGYYDLRTAESPIPVVSTAFRPIDANELAHNPFRVFTSMLAIEDRRYFDADLRTRVQTFLARRQLFDPA
LLAQARAIAADGGCDADDAPAFVAAAVAAFALSREPVEKSWYDELSRVSAVAADIAGVGSTHINHLTPRVLDIDDLYRRM
TERGITMIDTIQGPPRTDGPDVLLRQTSFRALAEPRMFRDEDGTVTPGILRVRFGEVEARGVALTPRGRERYEAAMAAAD
PAAVWATHFPSTDAEMAAQGLAYYRGGDPSAPIVYEDFLPASAAGIFRSNLDRDSQTGDGPDDAGYNVDWLAGAIGRHIH
DPYALYDALAQEERR
>Q88CC1 1.13.11.93~~~hglS~~~2-oxoadipate dioxygenase/decarboxylase~~~COG5383
MPANDFVSPDSIRAQFSAAMSLMYKQEVPLYGTLLELVSEINQQVMAQQPEVAEALRWTGEIERLDQERHGAIRVGTAEE
LATIARLFAVMGMQPVGYYDLSSAGVPVHSTAFRAVHEQSLHVSPFRVFTSLLRLELIDNPQLRELAQSILAKRQIFTSR
ALELIAQCEREGGLDAADAETFVQEALHTFRWHQDATVTAEQYQQLHDQHRLIADVVAFKGPHINHLTPRTLDIDAIQLG
MPAKGIPPKAVVEGPPTRRHPILLRQTSFKALQETVAFRDQQGREGSHTARFGEIEQRGAALTPKGRQLYDKLLDATRVA
LGGAPAEANAERYMALLQANFAEFPDDLAQMREQGLAYFRYFATEKGLAARDQEGRPTTLQGLIDAGHVHFEALVYEDFL
PVSAAGIFQSNLGDDAQAEYGSNANREAFEAALGLQVQDELALYAQSERRSLQACAQALNLGSM
>A0A0H2UZX2 1.13.11.93~~~ydcJ~~~2-oxoadipate dioxygenase/decarboxylase~~~
MANSITADEIREQFSQAMSAMYQQEVPQYGTLLELVADVNLAVLENNPQLHEKMVNADELARLNVERHGAIRVGTAQELA
TLRRMFAIMGMYPVSYYDLSQAGVPVHSTAFRPIDDASLARNPFRVFTSLLRLELIENEILRQKAAEILRQRDIFTPRCR
QLLEEYEQQGGFNETQAQEFVQEALETFRWHQLATVDEETYRALHNEHRLIADVVCFPGCHINHLTPRTLDIDRVQSMMP
ECGIEPKILIEGPPRREVPILLRQTSFKALEETVLFAGQKQGTHTARFGEIEQRGVALTPKGRQLYDDLLRNAGTGQDNL
THQMHLQETFRTFPDSEFLMRQQGLAWFRYRLTPSGEAHRQAIHPGDDPQPLIERGWVVAQPITYEDFLPVSAAGIFQSN
LGNETQTRSHGNASREAFEQALGCPVLDEFQLYQEAEERSKRRCGLL
>P44795 ~~~~~~Probable hemoglobin and hemoglobin-haptoglobin-binding protein 1~~~COG1629
MTNFKFSLLACSIAFALNASIAYAAQPTNQPTNQPTNQPTNQPTNQPTNQPTNQNSNVSEQLEQINVSGSSENINVKEKK
VGETQISAKKLAKQQASDSRDLVRYETGITVVETGRTGASGYAVRGVDENRVGIMVDGLRQAETLSSQGFKELFEGYGNF
NNTRNSIEIENVKTATITKGADSLKSGSGALGGSVIFETKDARDYLIDKDYYLSYKRGYQTMNNQNLKTLTLAGRSKKFD
ILIIDTTRDGHEIENYDYKIYPNKQADLRAVGPTREKADPYQITRQSTLIKLGFQPNENHRLSVALDDSTLETKGIDLSY
ALRPYSTANNEKYGERIINDQSKRKNIQFSYENFSQTPFWDHIKLSYSSQKITNKARSDEYCHQSTCNGVSNPQGLHLVE
EKGVYKIKDKYGGELESKEIGWSHEFKNSKGEDADKDISQRSSLDSVLINCEKLDCSKKFRIYQEYDENSSEKYTYDDRE
IEVGTLPNGKKYGKIPLKKGKTPSWNGFPQETARFLFPKSYGYSTDFVNDRDLNTHTQQIKLDLDKEFHLWHTQHQLKYG
GLYEKTLKSMVNHQYNTAANVQWWADYFFCARAKGGNLGEKKTPHPNVSVAGCVNGTPLHSDIGKDTYLIPVTTKNNVLY
FGDNVQLTSWLGLDLNYRYDHVKYLPGYDEKTPVPGGLIAGIFVPFNEKDVVYGAYVPSGYKDCRYNTECYKKNFEENLA
LLLRKTDYKHHSYNLGLNLDPTDWLRVQLKYANAFRAPTSDEIYMTFKHPDFSIGPNTNLKAETAKTKEVAFTFYKENSY
LTLSAFQSDYRNFIDLVFEKNKQIDKGSAIEYPFYQNQNRDQARVRGIEIASRLEMGDLFEKLQGFHLGYKLTYQKGRIK
DNKLRSGYAEFLKLNPQYTAIASQDQPMNALQPTTSVYNIGYDAPSKKWGMDVYITDVAAKKAKDSFNSQWTSMVKRKEN
IYGTERTVPATQANGKDVKDSRGLWRNNRYTVIDTIAYWKPIKNLTFTAGVYNLTNKKYLTWDSARSVRHLGTINRVETA
TGKGLNRFYAPGRNYRMSVQFEF
>P44809 ~~~~~~Probable hemoglobin and hemoglobin-haptoglobin-binding protein 2~~~COG1629
MTNFRLNVLAYSVMLGLTAGVAYAAQPTNQPTNQPTNQPTNQPTNQPTNQPTNQNSNVSEQLEQINVSGSTENSDTKTPP
KIAETVKTAKTLEREQANNIKDIVKYETGVTVVEAGRFGQSGFAIRGVDENRVAINIDGLRQAETLSSQGFKELFEGYGN
FNNTRNGAEIETLKEVNITKGADSIKNGSGSLGGSVIYKTKDARDYLINKDYYVSYKKGYATENNQSFDTLTLAGRYKKF
DVLVVTTSRNGHELENYGYKNYNDKIQGKKREKADPYKIEQDSTLLKLSFNPTENHRFTFAADLYEHRSRGQDLSYTLKY
QRSGNETPEVDSRHTNDKTKRRNISFSYENFSQTPFWDTLKLTYSDQRIKTRARTDEYCDAGVRHCEGTDNPTGLKVTNG
KITRRDGSDLQFEEKNNTAKSSDKTYDFKKFIDTDKRVIDDKLVLNNPSDTWYDCSIFNCENNAKIKVFKGNNYYGYDGK
WKEVDLEIKELNGKKFAKIKDNDRKIKSILPSSPGYLERLWQERDLDTNTQQLNLDLTKDFKIWHIEHNLQYGGSYNTAM
KRMVNRAGNDASDVQWWATPTLGEDSWTGKPHTCATTYEWNANLCPRVDPEFSYLLPIKTTGKSVYLFDNFVITDYLSFD
LGYRYDNIHYQPKYKHGITPKLPDDIVKGLFIPLPNNSNSDPNKVKENVQQNIDYIAKQNKKYKAHSYSFVSTIDPTSFL
RLQLKYSKGFRTPTSDEMYFTFKHPDFTILPNTDLKPEIAKTKEIAFTLHNDDWGFISTSLFKTNYKNFIDLIFKKQETF
KVGGSGRGETLPFSLYQNINRDNASLKGIEINSKVFLGKMAKFMDGFNLSYKYTYQKGRMNGNIPMNAIQPRTMVYGLGY
DHPNHKFGFDFYTTHVASKNPEDTYNMFYKEENKKDSTIKWRSKSYTILDLIGYVQPIKNLTIRAGVYNLTNRKYITWDS
ARSIRSFGTSNVIDQSTGLGINRFYAPGRNYKMSVQFEF
>P44836 ~~~~~~Probable hemoglobin and hemoglobin-haptoglobin-binding protein 3~~~COG1629
MTNFKFSLLACSIAFALNASTVYAAQPTNQPTNQPTNQPTNQPTNQPTNQPTNQPTNQPTNQPTNQPTNQPTNQNSNVSE
QLEQINVSGSSENINIKEKKVGETQISAKKLAKQQASDSRDLVRYETGITVVETGRTGASGYAVRGVDENRVGIMVDGLR
QAETLSSQGFKELFEGYGNFNNTRNSIEIENVKTATITKGADSLKSGSGALGGSVIFETKDARDYLIDKDYYLSYKRGYQ
TMNNQNLKTLTLAGRSKKFDILIIDTTRDGHEIENYDYKIYPNKQADLRAVGPTREKADPYQITRQSTLIKLGFQPNENH
RLSVALDDSTLETKGIDLSYALRPYSTAGNEKYGERIINDQSKRKNIQFSYENFSQTPFWDHIKLSYSSQKITNKARSDE
YCHQSTCNGVSNPQGLHLVEEGGVYKIVDKNGDKLTYNKNAGWYGQFQNKNGENVDNDIDSTGGSLDSVLIDCERLNCKN
KFQVFVEKDEEGKDKYEYEERDIIVETLPNGKKYGKITLKKGKTPLWDDVYQEESARFLFPKSYGYSTDFVNDRDLNTNT
QQIKLDLDKEFSLWHTQHSLKYGGFYEKTLKSMVNHQYNTVANVQWWAGNFFCNKLENGKRTPAPDYSHRCSLMNTDKGK
ETYLIPVTTKNNVLYFGDNVQLTSWLGLDLNYRYDHVKYLPSYDEKIPVPNGLITGLFKKFGPKDYVYGSKYSKPADYTD
CTYNSDCYKKNFKDNLALLLRKTDYKHHSYNLGLNLDPTDWLRVQLKYANGFRAPTSDEIYMTFKHPQFSIQPNTDLKAE
TSKTKEVAFTFYKNSSYITLNAFQNDYRNFIDLVEVGPRPIEEGSTIAYPFHQNQNRDRARVRGIEIASRLEMGDLFEKL
QGFHLGYKFTYQKGRIKDNGLNPKYKEFLELNKDKHPEYEAIARKPQPMNALQPTTSVYNIGYDAPSQKWGVDMYITNVA
AKKAKDSFNSQWTSMVKRKEKIYGNEKDAEASTANGKEVKDSRGLWRNNRYTVIDTIAYWKPIKNLTFTAGVYNLTNKKY
LTWDSARSIRHLGTINRVETATGKGLNRFYAPGRNYRMSVQFEF
>Q9ZA21 ~~~hgpA~~~Hemoglobin and hemoglobin-haptoglobin-binding protein A~~~
MTNFRLNVLAYSVMLGLTASVAYAEPTNQPTNQPTNQPTNQPTNQPTNQPTNQPTNQPTNQPTNQPTNQNSNASEQLEQI
NVSGSTENTDTKAPPKIAETVKTAKKLEKEQAQDVKDLVRYETGITVVEAGRFGNSGFAVRGVEENRVAVQIDGLHQAET
ISSQGFKELFEGYGNFNNTRNSAEIETLKQVTIRKGADSLKSGSGALGGSVSLDTKDARDYLLNKNYYASYKRGYNTADN
QNLNTLTLGGRYKYFDAIAVLTSRKGHELENFGYKNYNDKIQGKTREKADPYRRTQDSALLKIGFQPTENHRFSVVADLY
KQTSKGHDFSYTLKPNTQYMTYDEKELRHTNDKVERKNIAFVYENFTETPFWDTLKITYSHQKITTSARTDDYCDGNDKC
ALAGNPLGMKYNQDNQLVGKDGKSAKYQDINKTQVIKERLPFTKPNGRWRFHKVDWDALKKKYPGVPIYASCLEEDNDPS
EFCTYEVKTTKKENTFEINGKRYDLLSEADKNVISDEQRLPTNVSYLFSCDGLNCDKKTILGFKKRRNLLKIFLFEVIEK
RCQKYGKTKVKANDQLSGPYLFMPNKKGYQANLWSQRDLTSETKQINLDLTKHLELGKTQHDLSYGGLWSEMEKSMTNLA
GDTPLNVKWWAQYPHNCATFLPPSTMTPNAKPTLNPERTSTLCNNVNVFSFLIPVKTKTGALYFINDFRVNNYVAFNLGY
RYDRVKYEPEYIPGKTPKIPDDMVTNLYIKTPEFDASKADSDPDELSKKEANAAANIKEIAQPKKFSASSYSFGTTLDPL
NWLRLQAKYSKGFRAPTSDEIYFTFKHPDFSIQPNRDLQPETAKTKELSLTVHNDMGYITTSVFDTRYQNFIDLSYQGRR
DVHGHSKLIPFHFYQNVNRPNAKVTGFEIASQISLGNITKLFNGFSLSYKYTYQKGRINGNIPMNAIQPRTAVYGVSYVH
PDDKYGLDLYISHASAKNAEDTYNMFYKEEGKTDSTIKWRSKSYTTIDLLGYIKPIKNLTLRAGVYNLTNRKYITWDSAR
SIRPFGTSNMINQDTGLGINRFYAPERNYRMSVQFEF
>P9WHQ9 2.4.2.8~~~hpt~~~Hypoxanthine-guanine phosphoribosyltransferase~~~COG0634
MHVTQSSSAITPGQTAELYPGDIKSVLLTAEQIQARIAELGEQIGNDYRELSATTGQDLLLITVLKGAVLFVTDLARAIP
VPTQFEFMAVSSYGSSTSSSGVVRILKDLDRDIHGRDVLIVEDVVDSGLTLSWLSRNLTSRNPRSLRVCTLLRKPDAVHA
NVEIAYVGFDIPNDFVVGYGLDYDERYRDLSYIGTLDPRVYQ
>P99085 2.4.2.8~~~hpt~~~Hypoxanthine-guanine phosphoribosyltransferase~~~
MHNDLKEVLLTEEDIQNICKELGAQLTKDYQGKPLVCVGILKGSAMFMSDLIKRIDTHLSIDFMDVSSYHGGTESTGEVQ
IIKDLGSSIENKDVLIIEDILETGTTLKSITELLQSRKVNSLEIVTLLDKPNRRKADIEAKYVGKKIPDEFVVGYGLDYR
ELYRNLPYIGTLKPEVYSN
>Q5XEL6 2.4.2.8~~~hpt~~~Hypoxanthine-guanine phosphoribosyltransferase~~~
MLEQDIQKILYSENDIIRKTKKLGEQLTKDYQEKNPLMIGVLKGSVPFMAELMKHIDTHVEIDFMVVSSYHGGTSSSGEV
KILKDVDTNIEGRDIIIVEDIIDTGRTLKYLRDMFKYRKANTIKIATLFDKPEGRVVKIEADYVCYNIPNEFIVGFGLDY
AENYRNLPYVGVLKEEVYSK
>P0ACE3 ~~~hha~~~Hemolysin expression-modulating protein Hha~~~
MSEKPLTKTDYLMRLRRCQTIDTLERVIEKNKYELSDNELAVFYSAADHRLAELTMNKLYDKIPSSVWKFIR
>Q7CR17 ~~~hha~~~Hemolysin expression-modulating protein Hha~~~
MSDKPLTKTDYLMRLRRCQTIDTLERVIEKNKYELSDNELAVFYSAADHRLAELTMNKLYDKIPSSVWKFIR
>P72780 ~~~hhoA~~~Putative serine protease HhoA~~~COG0265
MKYPTWLRRIGGYLLAFAVGTAFGIANLPHAVAAADDLPPAPVITAQASVPLTSESFVAAAVSRSGPAVVRIDTETVVTR
RTDPILDDPFFQEFFGRSFPVPPRERRIAGQGSGFIIDNSGIILTNAHVVDGASKVVVTLRDGRTFDGQVRGTDEVTDLA
VVKIEPQGSALPVAPLGTSSNLQVGDWAIAVGNPVGLDNTVTLGIISTLGRSAAQAGIPDKRVEFIQTDAAINPGNSGGP
LLNARGEVIGINTAIRADATGIGFAIPIDQAKAIQNTLAAGGTVPHPYIGVQMMNITVDQAQQNNRNPNSPFIIPEVDGI
LVMRVLPGTPAERAGIRRGDVIVAVDGTPISDGARLQRIVEQAGLNKALKLDLLRGDRRLSLTVQTAQLRNPTS
>P73940 ~~~hhoB~~~Putative serine protease HhoB~~~COG0265
MAIHLKASHLGVAVLLLLFGGAIGAAGGGYLLSSGQNHSSPDSPVNTSPQSLTPAPVESNYRSALPLTLPRSAQDDQELN
FIARAVQKIGPAVVRIDSERTAVSQGGPMGDQPFFRRFFGEEMPPNPDPREQGTGSGFILSSDGEVLTNAHVVEGASTVK
VTLKDGSVLEGKVMGIDTMTDVAVVKVEAENLPVVEIGQSDRLQPGEWAIAIGNPLGLDNTVTVGIISALGRSSSEVGVP
DKRVRFIQTDAAINPGNSGGPLLNAKGEVIGVNTAIRADAQGLGFAIPIQTAQNVAENLFTKGKMEHPYLGIHMVTLTPE
MTKQLRTSGELPAGVTADTGVLIIQVSPGSPAAQAGLAPGDIILEVGGMGVKTATDVQERVEVSQIGEPLAIAVKRGQKP
QMMAVRPGPFPEDLGQ
>P12779 ~~~~~~Host-inducible protein A~~~
MHLDRSDSNGGSSRYTLDHEPPVVPIDLKTFRREIRKFHGKEITDIADNPQEYSDFVSAKARRTADVAQQYGIRRDSENA
RYFSYQLGNQCVGLMRTEGGFSMEEEFESKSWRDQFPGHQEITSTVDLQVAHPLVENAGDILLEAPTSEGRRTTVAELAR
GKPRGESRAAMMGFVEVDDCDMVLDPKQHPDKWTQTSAAEWRRKDKPPLYLRKFEDAETAQCSTKAALTRLTKMTSCDRI
LARFGRDGRAPSAKTGPHVMDRERKSVTNCHALTAIPIRCLGAKELERRVSLRPS
>P76106 3.1.-.-~~~hicA~~~Probable mRNA interferase toxin HicA~~~COG1724
MKQSEFRRWLESQGVDVANGSNHLKLRFHGRRSVMPRHPCDEIKEPLRKAILKQLGLS
>P67697 ~~~hicB~~~Antitoxin HicB~~~COG1598
MRYPVTLTPAPEGGYMVSFVDIPEALTQGETVAEAMEAAKDALLTAFDFYFEDNELIPLPSPLNSHDHFIEVPLSVASKV
LLLNAFLQSEITQQELARRIGKPKQEITRLFNLHHATKIDAVQLAAKALGKELSLVMV
>Q72IW9 1.1.1.286~~~hicd~~~Isocitrate/homoisocitrate dehydrogenase~~~COG0473
MAYRICLIEGDGIGHEVIPAARRVLEATGLPLEFVEAEAGWETFERRGTSVPEETVEKILSCHATLFGAATSPTRKVPGF
FGAIRYLRRRLDLYANVRPAKSRPVPGSRPGVDLVIVRENTEGLYVEQERRYLDVAIADAVISKKASERIGRAALRIAEG
RPRKTLHIAHKANVLPLTQGLFLDTVKEVAKDFPLVNVQDIIVDNCAMQLVMRPERFDVIVTTNLLGDILSDLAAGLVGG
LGLAPSGNIGDTTAVFEPVHGSAPDIAGKGIANPTAAILSAAMMLDYLGEKEAAKRVEKAVDLVLERGPRTPDLGGDATT
EAFTEAVVEALKSL
>Q5SIJ1 1.1.1.286~~~hicd~~~Isocitrate/homoisocitrate dehydrogenase~~~COG0473
MAYRICLIEGDGIGHEVIPAARRVLEATGLPLEFVEAEAGWETFERRGTSVPEETVEKILSCHATLFGAATSPTRKVPGF
FGAIRYLRRRLDLYANVRPAKSRPVPGSRPGVDLVIVRENTEGLYVEQERRYLDVAIADAVISKKASERIGRAALRIAEG
RPRKTLHIAHKANVLPLTQGLFLDTVKEVAKDFPLVNVQDIIVDNCAMQLVMRPERFDVIVTTNLLGDILSDLAAGLVGG
LGLAPSGNIGDTTAVFEPVHGSAPDIAGKGIANPTAAILSAAMMLDYLGEKEAAKRVEKAVDLVLERGPRTPDLGGDATT
EAFTEAVVEALKSL
>P14212 ~~~hifA~~~Major fimbrial subunit~~~
MKKTLLGSLILLAFAGNVQADINTETSGKVTFFGKVVENTCKVKTEHKNLSVVLNDVGKNSLSTKVNTAMPTPFTITLQN
CDPTTANGTANKANKVGLYFYSWKNVDKENNFTLKNEQTTADYATNVNIQLMESNGTKAISVVGKETEDFMHTNNNGVAL
NQTHPNNAHISGSTQLTTGTNELPLHFIAQYYATNKATAGKVQSSVDFQIAYE
>P9WJA7 ~~~higA1~~~Antitoxin HigA1~~~COG1396
MSIDFPLGDDLAGYIAEAIAADPSFKGTLEDAEEARRLVDALIALRKHCQLSQVEVAKRMGVRQPTVSGFEKEPSDPKLS
TLQRYARALDARLRLVLEVPTLREVPTWHRLSSYRGSARDHQVRVGADKEILMQTNWARHISVRQVEVA
>Q9KMG4 ~~~higA-1~~~Antitoxin HigA-1~~~COG3093
MRKTKRRPVSVGEMLKVEFLEPMGITSKALAEAMGVHRNTVSNLINGGVLTAPVAIKLAAALGNTPEFWLNIQHAVDLWD
TRNRYQEEAKFVKPLFVSLEQSART
>O53467 ~~~higA2~~~Putative antitoxin HigA2~~~COG1396
MAMTLRDMDAVRPVNREAVDRHKARMRDEVRAFRLRELRAAQSLTQVQVAALAHIRQSRVSSIENGDIGSAQVNTLRKYV
SALGGELDITVRLGDETFTLA
>Q9KMA5 ~~~higA-2~~~Antitoxin HigA-2~~~COG2944
MSNRDLFAELSSALVEAKQHSEGKLTLKTHHVNDVGELNISPDEIVSIREQFNMSRGVFARLLHTSSRTLENWEQGRSVP
NGQAVTLLKLVQRHPETLSHIAEL
>O53333 ~~~higA3~~~Putative antitoxin HigA3~~~COG1396
MTMARNWRDIRADAVAQGRVDLQRAAVAREEMRDAVLAHRLAEIRKALGHARQADVAALMGVSQARVSKLESGDLSHTEL
GTLQAYVAALGGHLRIVAEFGENTVELTA
>P67701 ~~~higA~~~Antitoxin HigA~~~COG5499
MIAIADILQAGEKLTAVAPFLAGIQNEEQYTQALELVDHLLLNDPENPLLDLVCAKITAWEESAPEFAEFNAMAQAMPGG
IAVIRTLMDQYGLTLSDLPEIGSKSMVSRVLSGKRKLTLEHAKKLATRFGISPALFID
>Q7A224 ~~~higA~~~Antitoxin HigA~~~
MRQFKVSHPGEMIARDLEDMGVSGRRFAHNIGVTPATVSRLLAGKTALTPSLSIRIAAALGSTPEFWLRLQSNYDLRQLE
NQIDTSGIVLYGESNEQQQNAQEH
>P67703 ~~~higA~~~Antitoxin HigA~~~
MIAIADILQAGEKLTAVAPFLAGIQNEEQYTQALELVDHLLLNDPENPLLDLVCAKITAWEESAPEFAEFNAMAQAMPGG
IAVIRTLMDQYGLTLSDLPEIGSKSMVSRVLSGKRKLTLEHAKKLATRFGISPALFID
>P9WJA5 3.1.-.-~~~higB1~~~Probable endoribonuclease HigB1~~~COG4679
MPPPDPAAMGTWKFFRASVDGRPVFKKEFDKLPDQARAALIVLMQRYLVGDLAAGSIKPIRGDILELRWHEANNHFRVLF
FRWGQHPVALTAFYKNQQKTPKTKIETALDRQKIWKRAFGDTPPI
>Q9KMG5 ~~~higB-1~~~Toxin HigB-1~~~COG3549
MALEFKDKWLEQFYEDDKRHRLIPSSIENALFRKLEILDAAQAESDLRIPPGNRFEHLEGNLKGWCSIRVNKQYRLIFQW
VDGVALNTYLDPHKY
>O53468 ~~~higB2~~~Putative toxin HigB2~~~COG4683
MNVPWENAHGGALYCLIRGDEFSAWHRLLFQRPGCAESVLACRHFLDGSPVARCSYPEEYHPCVISRIALLCDSVGWTAD
VERISAWLNGLDRETYELVFAAIEVLEEEGPALGCPLVDTVRGSRHKNMKELRPGSQGRSEVRILFAFDPARQAIMLAAG
NKAGRWTQWYDEKIKAADEMFAEHLAQFEDTKPKRRKRKKG
>Q9KMA6 ~~~higB-2~~~Toxin HigB-2~~~COG4737
MKSVFVESTIFEKYRDEYLSDEEYRLFQAELMLNPKLGDVIQGTGGLRKIRVASKGKGKRGGSRIIYYFLDEKRRFYLLT
IYGKNEMSDLNANQRKQLMAFMEAWRNEQS
>P64578 3.1.-.-~~~higB~~~mRNA interferase toxin HigB~~~COG4680
MHLITQKALKDAAEKYPQHKTELVALGNTIAKGYFKKPESLKAVFPSLDNFKYLDKHYVFNVGGNELRVVAMVFFESQKC
YIREVMTHKEYDFFTAVHRTKGKK
>Q7A225 3.1.-.-~~~higB~~~Endoribonuclease HigB~~~
MIKSFKHKGLKLLFEKGVTSGVPAQDVDRINDRLQAIDTATEIGELNRQIYKLHPLKGDREGYWSITVRANWRITFQFIN
GDAYILNYEDYH
>P64580 3.1.-.-~~~higB~~~mRNA interferase HigB~~~
MHLITQKALKDAAEKYPQHKTELVALGNTIAKGYFKKPESLKAVFPSLDNFKYLDKHYVFNVGGNELRVVAMVFFESQKC
YIREVMTHKEYDFFTAVHRTKGKK
>P73276 2.7.13.3~~~hik2~~~Sensor histidine kinase Hik2~~~COG0642
MAGSISSMYSPSAGLISLCQSQVRLLQQGLRVDWCGVYLNQEETEQGLVPLVVSHGSTLVESESYGLISLPQGEVSPPMD
DFSLPAVPVGVGQLSRRSRLEPPPFDADKRLVLPLVYGEEMVGLLVIHRSQGQWHGEEMMQLEAIAKSLAVACLLDQQQD
WYRQAWEEQNQQYQWERQHWADLLHQLRNPLTALKTFSKLLLKRWHGDNKSQQVVEGIVRQGEHLQELLQSFEASQSQGP
EAVPLLSSSPVTTIQVLPPADRVETMPLANFSLGEVLPPILLAHQAIAAERNITLTAQIALIDTVVMANRLALREVVNNL
LDNGIKYTPNGGLVEVSLALEKVSSSGMDWATLAIADTGYGIPPEDQQKIFERNYRGVQGRGSINGTGLGLAIVADLVAQ
MGGKITVTSPNGLSRDPDQPGSTFTLWLRSGEQV
>Q8DMC5 2.7.13.3~~~hik2~~~Sensor histidine kinase Hik2~~~COG2205
MLWPASEEFAALCRTQLELVVNSLGASSLAVYLSETLNDSPSWSPVAVYPEAASLLSLAIPPTLPPPTQVPETSLSHYPQ
QVVSSLANQLILPLMYQNWVLGVLVAQRQHRPWLAAEQAQLQQVAQTLAIACVLDQRQQWLSHSPAQPLDQRQQRFDDLL
HQLRNPVAAIRTFVKLLLKRLEPDHKGRPLAEGIAKETERLMALLEDYRQQRNDIPALTGSQPLPLAGKPLDLAETLLPL
ISAAQARAEMEGKTFVVEIPPQLPPIWLEERVLQEVVGNLLDNAFKYTPKGGTIGLRLMLSSPALELTVWDTGCGIPKEA
QPRLFERGYRGVQADSGIEGSGLGLAIAQDLLRPYGLSLRVTSPYAGDRGTAFTLAIPWQMKVEP
>P74199 2.7.13.3~~~~~~Sensor histidine kinase Hik34~~~COG0642
MNEVCLKLSDLFVSSGWGGYDRGRAPQWAHPRAQQQWFGAIAALEPFLRQTLPNVGGELPGICLTGPAPVLKDAVLVRNF
YQGIATPWEEFSPWPCLAGEESEWSAVPPMREIPLFPQDPLAEEQFCWLMTPQFGLLLLLGKNEQGLAQFYWTFDPEILQ
QAWLSLQARLKYGLSPDLSLLQKTIAAFNFPQPDFRLVTYFGQLMLDYQPNPYNLPPCQEQESAEPSPDVELLQALTHEV
RTPLTSIRTLTKLLLRRKDLSPEVLKRIESIDRECSDQISRMDLIFRATELESTPLPELVVPLTVTSLEAVFQAGIPRWQ
KQAQRYNVNLQAQIPHSLPQVWSNPSLLDQVLGGMIEKFVRNFNGGGEINLQITTAGDQLKVQFHTQSVHQANPVRALGE
LLMFQPQTGCLSLNWDVTKNLFQLLGGKLIVRRRSPSEEILTIYLKCEQRTVPVANYDRQFTMV
>A0A0H3L1B8 1.14.11.74~~~hilA~~~L-isoleucine 3(1)-dioxygenase~~~
MTDLLTLEPTQTILTGSKKTNFGYLESTDGVINFSIVKNIILNGHHHGNVLYVIRNYASKAVCEKLAKNFDYRVTQSGGN
RADDGFVLTNQIGATQFSRNGEQYIHEVNRVNQSVADLMKATSAEDSESLFLNLTLEKEFLERGIHFGPARFKNGYACFA
TFRRWLDNGVMSLMPHEDMAQVDFAKEDGFEIANTQTVTAYNVCLEAAQGGGQLKIWNLIPDQVCRETLGVTRTGYPYPP
HLLNETESLSVQLNAGDLYFMNACHLHGVSSVSEGSRLTAGRFIGKLNDRKVVYWT
>P43015 ~~~hilA~~~Transcriptional regulator HilA~~~
MPHFNPVPVSNKKFVFDDFILNMDGSLLRSEKKVNIPPKEYAVLVILLEAAGEIVSKNTLLDQVWGDAEVNEESLTRCIY
ALRRILSEDKEHRYIETLYGQGYRFNRPVVVVSPPAPQPTTHTLAILPFQMQDQVQSESLHYSIVKGLSQYAPFGLSVLP
VTITKNCRSVKDILELMDQLRPDYYISGQMIPDGNDNIVQIEIVRVKGYHLLHQESIKLIEHQPASLLQNKIANLLLRCI
PGLRWDTKQISELNSIDSTMVYLRGKHELNQYTPYSLQQALKLLTQCVNMSPNSIAPYCALAECYLSMAQMGIFDKQNAM
IKAKEHAIKATELDHNNPQALGLLGLINTIHSEYIVGSLLFKQANLLSPISADIKYYYGWNLFMAGQLEEALQTINECLK
LDPTRAAAGITKLWITYYHTGIDDAIRLGDELRSQHLQDNPILLSMQVMFLSLKGKHELARKLTKEISTQEITGLIAVNL
LYAEYCQNSERALPTIREFLESEQRIDNNPGLLPLVLVAHGEAIAEKMWNKFKNEDNIWFKRWKQDPRLIKLR
>A0A0H3L116 1.14.11.75~~~hilB~~~3(1)-hydroxy-L-isoleucine 4-dioxygenase~~~COG4340
MMEYATHLSRQGYAFIPGDYYRSTEAMQFSNKEDFLDELEELKKGYENLLLDPYSPGNRWRGYAQCKKNEKGELTFGKFN
PYKQTKAFNPDTGDIIRDYPLLPEAITRNRLFQTLLHDDLSLVDAYESIGPVDSLTIGIHFFRYQATENEPAYSSPVWLH
KDDEDVVFVHMINASPNMLGGDSLIASHPRSIDRVLRLEQLFDTLVVNHDKLHAVTPVGARENSGPAQRDIILITFQKNE
EKTACPV
>P0ACE7 3.9.1.-~~~hinT~~~Purine nucleoside phosphoramidase~~~COG0537
MAEETIFSKIIRREIPSDIVYQDDLVTAFRDISPQAPTHILIIPNILIPTVNDVSAEHEQALGRMITVAAKIAEQEGIAE
DGYRLIMNTNRHGGQEVYHIHMHLLGGRPLGPMLAHKGL
>P44956 3.9.1.-~~~~~~Purine nucleoside phosphoramidase~~~COG0537
MAEETIFSKIIRKEIPANIVYQDELVTAFRDISPQAKTHILIIPNKVIPTVNDVTEQDEVALGRLFSVAAKLAKEEGVAE
DGYRLIVNCNKHGGQEVFHLHMHLVGGEPLGRMLAK
>P03013 ~~~hin~~~DNA-invertase hin~~~
MATIGYIRVSTIDQNIDLQRNALTSANCDRIFEDRISGKIANRPGLKRALKYVNKGDTLVVWKLDRLGRSVKNLVALISE
LHERGAHFHSLTDSIDTSSAMGRFFFHVMSALAEMERELIVERTLAGLAAARAQGRLGGRPRAINKHEQEQISRLLEKGH
PRQQLAIIFGIGVSTLYRYFPASRIKKRMN
>P83341 ~~~~~~High-potential iron-sulfur protein isozyme 1~~~
AEKLEESSAEAKALSYVHDATTSGHDSYQEGQKCINCLLYTDPSQEEWGGCAVFPGKLVNANGWCTAYVARG
>P38941 ~~~hip1~~~High-potential iron-sulfur protein isozyme 1~~~
AERLDENSPEALALNYKHDGASVDHPSHAAGQKCINCLLYTDPSATEWGGCAVFPNKLVNANGWCTAYVARG
>P04168 ~~~hip1~~~High-potential iron-sulfur protein isozyme 1~~~
EPRAEDGHAHDYVNEAADASGHPRYQEGQLCENCAFWGEAVQDGWGRCTHPDFDEVLVKAEGWCSVYAPAS
>P9WHR2 3.4.21.-~~~hip1~~~Serine protease Hip1~~~
MGMRLSRRDKIARMLLIWAALAAVALVLVGCIRVVGGRARMAEPKLGQPVEWTPCRSSNPQVKIPGGALCGKLAVPVDYD
RPDGDVAALALIRFPATGDKIGSLVINPGGPGESGIEAALGVFQTLPKRVHERFDLVGFDPRGVASSRPAIWCNSDADND
RLRAEPQVDYSREGVAHIENETKQFVGRCVDKMGKNFLAHVGTVNVAKDLDAIRAALGDDKLTYLGYSYGTRIGSAYAEE
FPQRVRAMILDGAVDPNADPIEAELRQAKGFQDAFNNYAADCAKNAGCPLGADPAKAVEVYHSLVDPLVDPDNPRISRPA
RTKDPRGLSYSDAIVGTIMALYSPNLWQHLTDGLSELVDNRGDTLLALADMYMRRDSHGRYNNSGDARVAINCVDQPPVT
DRDKVIDEDRRAREIAPFMSYGKFTGDAPLGTCAFWPVPPTSQPHAVSAPGLVPTVVVSTTHDPATPYKAGVDLANQLRG
SLLTFDGTQHTVVFQGDSCIDEYVTAYLIGGTTPPSGAKC
>P9WHR3 3.4.21.-~~~hip1~~~Serine protease Hip1~~~COG0596
MGMRLSRRDKIARMLLIWAALAAVALVLVGCIRVVGGRARMAEPKLGQPVEWTPCRSSNPQVKIPGGALCGKLAVPVDYD
RPDGDVAALALIRFPATGDKIGSLVINPGGPGESGIEAALGVFQTLPKRVHERFDLVGFDPRGVASSRPAIWCNSDADND
RLRAEPQVDYSREGVAHIENETKQFVGRCVDKMGKNFLAHVGTVNVAKDLDAIRAALGDDKLTYLGYSYGTRIGSAYAEE
FPQRVRAMILDGAVDPNADPIEAELRQAKGFQDAFNNYAADCAKNAGCPLGADPAKAVEVYHSLVDPLVDPDNPRISRPA
RTKDPRGLSYSDAIVGTIMALYSPNLWQHLTDGLSELVDNRGDTLLALADMYMRRDSHGRYNNSGDARVAINCVDQPPVT
DRDKVIDEDRRAREIAPFMSYGKFTGDAPLGTCAFWPVPPTSQPHAVSAPGLVPTVVVSTTHDPATPYKAGVDLANQLRG
SLLTFDGTQHTVVFQGDSCIDEYVTAYLIGGTTPPSGAKC
>P00266 ~~~hip~~~High-potential iron-sulfur protein~~~
GTNASMRKAFNYQEVSKTAGKNCANCAQFIPGASASAAGACKVIPGDSQIQPTGYCDAYIVKK
>P83342 ~~~~~~High-potential iron-sulfur protein isozyme 2~~~
AELERLSEDDATAQALSYTHDASGVTHDSYQEGSRCSNCLLYSNPDAKDWGPCSVFPKHLVAEGGWCTAWVGRG
>P38524 ~~~hip2~~~High-potential iron-sulfur protein isozyme 2~~~
MERLSEDDPAAQALEYRHDASSVQHPAYEEGQTCLNCLLYTDASAQDWGPCSVFPGKLVSANGWCTAWVAR
>P04169 ~~~hip2~~~High-potential iron-sulfur protein isozyme 2~~~
GLPDGVEDLPKAEDDHAHDYVNDAADTDHARFQEGQLCENCQFWVDYVNGWGYCQHPDFTDVLVRGEGWCSVYAPA
>P33678 ~~~hip~~~High-potential iron-sulfur protein~~~
GTNAAMRKAFNYQDTAKNGKKCSGCAQFVPGASPTAAGGCKVIPGDNQIAPGGYCDAFIVKK
>P23874 2.7.11.1~~~hipA~~~Serine/threonine-protein kinase toxin HipA~~~COG3550
MPKLVTWMNNQRVGELTKLANGAHTFKYAPEWLASRYARPLSLSLPLQRGNITSDAVFNFFDNLLPDSPIVRDRIVKRYH
AKSRQPFDLLSEIGRDSVGAVTLIPEDETVTHPIMAWEKLTEARLEEVLTAYKADIPLGMIREENDFRISVAGAQEKTAL
LRIGNDWCIPKGITPTTHIIKLPIGEIRQPNATLDLSQSVDNEYYCLLLAKELGLNVPDAEIIKAGNVRALAVERFDRRW
NAERTVLLRLPQEDMCQTFGLPSSVKYESDGGPGIARIMAFLMGSSEALKDRYDFMKFQVFQWLIGATDGHAKNFSVFIQ
AGGSYRLTPFYDIISAFPVLGGTGIHISDLKLAMGLNASKGKKTAIDKIYPRHFLATAKVLRFPEVQMHEILSDFARMIP
AALDNVKTSLPTDFPENVVTAVESNVLRLHGRLSREYGSK
>Q8EIX3 2.7.11.1~~~hipA~~~Serine/threonine-protein kinase toxin HipA~~~COG3550
MSTAKTLTLEMHLGDLMIGELSFDATADTFAVHYTKDWQQSGFPLSPTIPLDGTGTSNQISMFLVNLLPENKGLDYLIES
LGVSKGNTFALIRAIGLDTAGAIAFVPKGALLPETQLRPIKAEEVIQRIEDPTMWPMEIWDGKPRLSVAGVQPKLNLFYN
GKEFAFAEGTLSSTHIVKFEKYHHLVINEFITMRLAKVLGMNVANVDIVHFGRYKALCVERFDRRNIPGEQRVLRRHIVD
SCQALGFSVSKKYERNFGTGRDVKDIREGVSFNRLFSLAAKCRNPVAAKQDMLQWALFNLLTGNADAHGKNYSFFMTPSG
MEPTPWYDLVSVDMYEDFEQQLAMAIDDEFDPNSIYAYQLAAFMDGLGLPRNLLISNLTRIARRIPQAIAEVILMLPPLD
EDEASFVAHYKTQLLARCERYLGFVDEVRDVEV
>P23873 ~~~hipB~~~Antitoxin HipB~~~COG1396
MMSFQKIYSPTQLANAMKLVRQQNGWTQSELAKKIGIKQATISNFENNPDNTTLTTFFKILQSLELSMTLCDAKNASPES
TEQQNLEW
>Q8EIX4 ~~~hipB~~~Antitoxin HipB~~~COG1396
MASPLNQQSLGLLIKERRKSAALTQDVAAMLCGVTKKTLIRVEKGEDVYISTVFKILDGLGIDIVSAQTSDTETNGWY
>P45493 3.5.1.32~~~hipO~~~Hippurate hydrolase~~~COG1473
MNLIPEILDLQGEFEKIRHQIHENPELGFDELCTAKLVAQKLKEFGYEVYEEIGKTGVVGVLKKGNSDKKIGLRADMDAL
PLQECTNLPYKSKKENVMHACGHDGHTTSLLLAAKYLASQNFNGALNLYFQPAEEGLGGAKAMIEDGLFEKFDSDYVFGW
HNMPFGSDKKFYLKKGAMMASSDSYSIEVIGRGGHGSAPEKAKDPIYAASLLIVALQSIVSRNVDPQNSAVVSIGAFNAG
HAFNIIPDIATIKMSVRALDNETRKLTEEKIYKICKGIAQANDIEIKINKNVVAPVTMNNDEAVDFASEVAKELFGEKNC
EFNHRPLMASEDFGFFCEMKKCAYAFLENENDIYLHNSSYVFNDKLLARAASYYAKLALKYLK
>P00260 ~~~hip~~~High-potential iron-sulfur protein~~~
MSDKPISKSRRDAVKVMLGTAAAIPMINLVGFGTARASAPANAVAADDATAIALKYNQDATKSERVAAARPGLPPEEQHC
ANCQFMQADAAGATDEWKGCQLFPGKLINVNGWCASWTLKAG
>B3EBZ3 ~~~~~~High-potential iron-sulfur protein~~~
SAPANAVSADDATAIALKYNQDATKSERVSAARPGLPPEEQHCANCQFMQADAAGATDEWKGCQLFPGKLINVNGWCASW
TLKAG
>P00264 ~~~hip~~~High-potential iron-sulfur protein~~~
QDLPPLDPSAEQAQALNYVKDTAEAADHPAHQEGEQCDNCMFFQADSQGCQLFPQNSVEPAGWCQSWTAQN
>B3EBZ6 ~~~~~~High-potential iron-sulfur protein~~~
QDLPHVDPATDPTAQALKYSEDAANADRAAAARPGKPPEEQFCHNCQFVLADSGEWRPCSLFPGKAVHETGWCASWTLKA
G
>B3EBZ5 ~~~~~~High-potential iron-sulfur protein~~~
EVPADAVTESDPTAVALKYHRNAAESERVAAARPGLPPEEQHCENCQFMLPDQGADEWRGCSLFPGKLINLNGWCASWTL
RAG
>P00262 ~~~hip~~~High-potential iron-sulfur protein~~~
EVPANAVTESDPTAVALKYHRNAEASERVAAARPGLPPEEQHCENCQFMLPDQGADEWRGCSLFPGKLINLDGWCASWTL
RAG
>P59860 ~~~hip~~~High-potential iron-sulfur protein~~~
VPANAVTESDPAAVALKYHRDAASSERVAAARPGLPPEEQHCENCQFMNPDSAAADWKGCQLFPGKLINLSGWCASWTLR
AG
>P80882 ~~~hip~~~High-potential iron-sulfur protein~~~
AAPLVAETDANAKSLGYVADTTKADKTKYPKHTKDQSCSTCALYQGKTAPQGACPLFAGKEVVAKGWCSAWAKKA
>P38589 ~~~hip~~~High-potential iron-sulfur protein~~~
QDKIDPKMVQYQDSPKDGNKCSTCVNFEAPSSCKIVAGKISPNGWCIAYAPMEDKKG
>P00265 ~~~hip~~~High-potential iron-sulfur protein~~~
APVDEKNPQAVALGYVSDAAKADKAKYKQFVAGSHCGNCALFQGKATDAVGGCPLFAGKQVANKGWCSAWAKKA
>P80176 ~~~hip~~~High-potential iron-sulfur protein~~~
AAPANAVTADDPTAIALKYNQDATKSERVAAARPGLPPEEQHCANCQFMQANVGEGDWKGCQLFPGKLINVNGWCASWTL
KAG
>P00263 ~~~hip~~~High-potential iron-sulfur protein~~~
EDLPHVDAATNPIAQSLHYIEDANASERNPVTKTELPGSEQFCHNCSFIQADSGAWRPCTLYPGYTVSEDGWCLSWAHKT
A
>P00261 ~~~hip~~~High-potential iron-sulfur protein~~~
EAPANAVAANDPTAVALKYNADATKSDRLAAARPGLPPAEQHCANCQFHLDDVAGATEEWHGCSLFPGKLINVDGWCASW
TLKAG
>B3EBZ4 ~~~~~~High-potential iron-sulfur protein~~~
EAPANAVTMDDPTAQALKYHPSAADSDRVAAARPGLPPEEQHCANCNFMQADVGEGDYKGCQLFPGKLINVNGWCASWTL
KAG
>O34520 2.4.2.17~~~hisG~~~ATP phosphoribosyltransferase~~~COG0040
MGKLLTMAMPKGRIFEEAAGLLRQAGYRLPEEFEDSRKLIIDVPEENLRFILAKPMDVTTYVEHGVADVGIAGKDVMLEE
ERDVYEVLDLNISKCHLAVAGLPNTDWSGVAPRIATKYPNVASSYFREQGEQVEIIKLNGSIELAPLIGLADRIVDIVST
GQTLKENGLVETEHICDITSRFIVNPVSYRMKDDVIDEMASRLSLVVEGETAK
>Q5HSJ4 2.4.2.17~~~hisG~~~ATP phosphoribosyltransferase~~~
MQENTRLRIAIQKSGRLSKESIELLSECGVKMHIHEQSLIAFSTNLPIDILRVRDDDIPGLIFDGVVDLGIIGENVLEEN
ELERQSLGENPSYKLLKKLDFGYCRLSLALPQENKFQNLKDFEGLRIATSYPQLLKRFMKENGINYKNCTLTGSVEVAPR
ANLADAICDLVSSGATLQANNLKEVKVIYESRACLIQKENALSKEKQALVDKIMLRVAGVMQARESKYIMLHAPKEKLDK
IQALLPGVERPTILPLAHDEKNVALHMVSKENLFWETMEALKEEGASSILVLPIEKMLK
>P60757 2.4.2.17~~~hisG~~~ATP phosphoribosyltransferase~~~COG0040
MTDNTRLRIAMQKSGRLSDDSRELLARCGIKINLHTQRLIAMAENMPIDILRVRDDDIPGLVMDGVVDLGIIGENVLEEE
LLNRRAQGEDPRYFTLRRLDFGGCRLSLATPVDEAWDGPLSLNGKRIATSYPHLLKRYLDQKGISFKSCLLNGSVEVAPR
AGLADAICDLVSTGATLEANGLREVEVIYRSKACLIQRDGEMEESKQQLIDKLLTRIQGVIQARESKYIMMHAPTERLDE
VIALLPGAERPTILPLAGDQQRVAMHMVSSETLFWETMEKLKALGASSILVLPIEKMME
>Q02129 2.4.2.17~~~hisG~~~ATP phosphoribosyltransferase~~~COG0040
MIKIAITKGRIQKQVTKLLENADYDVEPILNLGRELQIKTKDDLQIIFGKPNDVITFLEHGIVDIGFVGKDTLDENDFDD
YYELLYLKIGQCIFALASYPDFSNKNFQRHKRIASKYPRVTKKYFAQKQEDIEIIKLEGSVELGPVVGLADAIVDIVETG
NTLSANGLEVIEKISDISTRMIVNKSSFKFKKDKIIEMVERLEDAQTN
>P9WMN1 2.4.2.17~~~hisG~~~ATP phosphoribosyltransferase~~~COG0040
MLRVAVPNKGALSEPATEILAEAGYRRRTDSKDLTVIDPVNNVEFFFLRPKDIAIYVGSGELDFGITGRDLVCDSGAQVR
ERLALGFGSSSFRYAAPAGRNWTTADLAGMRIATAYPNLVRKDLATKGIEATVIRLDGAVEISVQLGVADAIADVVGSGR
TLSQHDLVAFGEPLCDSEAVLIERAGTDGQDQTEARDQLVARVQGVVFGQQYLMLDYDCPRSALKKATAITPGLESPTIA
PLADPDWVAIRALVPRRDVNGIMDELAAIGAKAILASDIRFCRF
>Q4FQF7 2.4.2.17~~~hisG~~~ATP phosphoribosyltransferase~~~COG0040
MTEVTNSLPTSGLLNEANDEFLGLTLALSKGRILEETMPLLRAAGVELLEDPEASRKLIFPTSNPNVRVLILRASDVPTY
VEHGAADFGVAGKDVLLEHGANHVYELLDLKIAQCKLMTAGVKDAPLPNRRLRIATKYVNVARAYFASQGQQVDVIKLYG
SMELAPLVGLGDLIVDVVDTGNTLRANGLEARDHICDVSSRLIVNQVSYKRKFALLEPILDSFKNSINSTS
>P00499 2.4.2.17~~~hisG~~~ATP phosphoribosyltransferase~~~
MLDNTRLRIAIQKSGRLSDDSRELLARCGIKINLHTQRLIAMAENMPIDILRVRDDDIPGLVMDGVVDLGIIGENVLEEE
LLNRRAQGEDPRYLTLRRLDFGGCRLSLATPVDEAWDGPAALDGKRIATSYPHLLKRYLDQKGVSFKSCLLNGSVEVAPR
AGLADAICDLVSTGATLEANGLREVEVIYRSKACLIQRDGEMAQSKQELIDKLLTRIQGVIQARESKYIMMHAPSERLEE
VIALLPGAERPTILPLAGEQQRVAMHMVSSETLFWETMEKLKALGASSILVLPIEKMME
>Q9X0D2 2.4.2.17~~~hisG~~~ATP phosphoribosyltransferase~~~COG0040
MLKLAIPKGRLEEKVMTYLKKTGVIFERESSILREGKDIVCFMVRPFDVPTYLVHGVADIGFCGTDVLLEKETSLIQPFF
IPTNISRMVLAGPKGRGIPEGEKRIATKFPNVTQRYCESKGWHCRIIPLKGSVELAPIAGLSDLIVDITETGRTLKENNL
EILDEIFVIRTHVVVNPVSYRTKREEVVSFLEKLQEVIEHDSNEQSRG
>P62381 2.4.2.17~~~hisG~~~ATP phosphoribosyltransferase~~~COG0040
MRRFALTVALPKGRMFREAYEVLKRAGLDLPEVEGERTLLHGKEGGVALLELRNKDVPIYVDLGIAEIGVVGKDVLLDSG
RDLFEPVDLGFGACRLSLIRRPGDTGPIRRVATKYPNFTARLLKERGWAADVVELSGNIELAAVTGLADAVVDVVQTGAT
LRAAGLVEVEVLAHSTARLVVNRQALKLKRAVLKPLIQRLRELSGS
>Q9KSX4 2.4.2.17~~~hisG~~~ATP phosphoribosyltransferase~~~COG0040
MQTQRLRIAIQKKGRLSQECQELLKKCGVKFNIMGERLVVHSLNMPIDLLLVRDDDIPGLIMDGVVDLGFVGENVLEETR
LDRLALNQRNEFTTLRRMDFGGCRLSIAIEKDAEYRGPQDLNGKRIATTYPQLLKAYMDRQGVDFSTCMLTGSVEVAPRA
GLADAIADLVSTGATLEANGLKEVEVIFESKATLIQRPGAFAADKAALIDKLLTRMHGVQQAKESKYIMLHAPVEKLAQI
KTLLPGAEDPTVLPLSADKSKVAVHMVSSENLFWETMEQLKALGASSILVLPIEKMME
>Q81G00 3.6.1.31~~~hisE~~~Phosphoribosyl-ATP pyrophosphatase~~~
MENAFKLLYKTIEERKGSPLPESYTNYLFSKGEDKILKKIGEECAEVIIACKNNDKEEVVKEMVDVFYHCFVLLAEKNIA
LEDVMREVKERNGKLSRVGDRREIDTL
>Q7P0E6 3.6.1.31~~~hisE~~~Phosphoribosyl-ATP pyrophosphatase~~~COG0140
MTPDVLKNIADTLEARREAAPQSSYVASLFHKGEDAILKKVAEEAAETLMASKDKDKLHLVREVADLWFHTMVLLTYHGL
RPEDVVMELHRREGISGLDEKASRKPTA
>P9WMM9 3.6.1.31~~~hisE~~~Phosphoribosyl-ATP pyrophosphatase~~~COG0140
MQQSLAVKTFEDLFAELGDRARTRPADSTTVAALDGGVHALGKKLLEEAGEVWLAAEHESNDALAEEISQLLYWTQVLMI
SRGLSLDDVYRKL
>P37793 ~~~hisI~~~Histidine biosynthesis bifunctional protein HisIE~~~
MLTEQQRRELDWEKTDGLMPVIVQHAVSGEVLMLGYMNPEALDKTIESGKVTFFSRTKQRLWIKGETSGNFLNVVSIAPD
CDNDTLLVLANPIGPTCHKGTSSCFGNTAHQWLFLYQLEQLLAERKYADPETSYTAKLYASGTKRIAQKVGEEGVETALA
ATVHDRFELTNEASDLMYHLLVLLQDQDLDLTTVIENLHKRHQ
>Q9EWK0 3.6.1.31~~~hisE~~~Phosphoribosyl-ATP pyrophosphatase~~~COG0140
MSKKTFEELFTELQHKAANGDPATSRTAELVDKGVHAIGKKVVEEAAEVWMAAEYEGKDAAAEEISQLLYHVQVMMVARG
ISLDDVYAHL
>P9WMM7 3.5.4.19~~~hisI~~~Phosphoribosyl-AMP cyclohydrolase~~~COG0139
MTLDPKIAARLKRNADGLVTAVVQERGSGDVLMVAWMNDEALARTLQTREATYYSRSRAEQWVKGATSGHTQHVHSVRLD
CDGDAVLLTVDQVGGACHTGDHSCFDAAVLLEPDD
>Q9PM74 5.3.1.16~~~hisA~~~1-(5-phosphoribosyl)-5-[(5-phosphoribosylamino)methylideneamino] imidazole-4-carboxamide isomerase~~~COG0106
MTQIIPALDLIDGEVVRLVKGDYEQKKVYKYNPLKKFKEYEKAGAKELHLVDLTGAKDPSKRQFALIEKLAKEVSVNLQV
GGGIRSKEEVKALLDCGVKRVVIGSMAIKDATLCLEILKEFGSEAIVLALDTILKEDYVVAVNAWQEASDKKLMEVLDFY
SNKGLKHILCTDISKDGTMQGVNVRLYKLIHEIFPNICIQASGGVASLKDLENLKGICSGVIVGKALLDGVFSVEEGIRC
LAN
>Q8FNZ7 5.3.1.16~~~hisA~~~1-(5-phosphoribosyl)-5-[(5-phosphoribosylamino)methylideneamino] imidazole-4-carboxamide isomerase~~~COG0106
MTFTILPAVDVVNGQAVRLDQGEAGTEKSYGTPLESALRWQEQGAEWLHFVDLDAAFNRGSNHELMAEITRQLDIKVELT
GGIRDDASLERALATGATRVNIGTAALEKPEWIADVIRRHGEKIAVDIAVRLENGEWRTKGNGWVSDGGDLWEVLERLDS
QGCSRFVVTDVSKDGTLTGPNVDLLRDVAAATDAPIVASGGISTLEDVLGLAKYQDEGIDSVIIGKALYEHRFTLAEALE
AVEKLG
>P9WMM5 5.3.1.16~~~priA~~~Phosphoribosyl isomerase A~~~COG0106
MPLILLPAVDVVEGRAVRLVQGKAGSQTEYGSAVDAALGWQRDGAEWIHLVDLDAAFGRGSNHELLAEVVGKLDVQVELS
GGIRDDESLAAALATGCARVNVGTAALENPQWCARVIGEHGDQVAVGLDVQIIDGEHRLRGRGWETDGGDLWDVLERLDS
EGCSRFVVTDITKDGTLGGPNLDLLAGVADRTDAPVIASGGVSSLDDLRAIATLTHRGVEGAIVGKALYARRFTLPQALA
AVRD
>A1R562 5.3.1.16~~~hisA~~~1-(5-phosphoribosyl)-5-[(5-phosphoribosylamino)methylideneamino] imidazole-4-carboxamide isomerase~~~COG0106
MTTSAQSVLELLPAVDIVDGQAVRLLQGEAGSETSYGTPLEAALNWQNDGAEWVHMVDLDAAFGRGNNAALISDVVSQLN
VKVELSGGLRDDESLERALELGVARVNLGTAALENPEWTRKAIDRFGDKIAVGLDVRGTTLAGRGWTKEGGDLWEVLARL
EDAGCARYVVTDVTKDGTLQGPNVELLRQMVEKTGKPVVASGGISSLEDLRVLRELVPLGVEGAIVGKALYAGAFTLPEA
LDVAGRR
>P10372 5.3.1.16~~~hisA~~~1-(5-phosphoribosyl)-5-[(5-phosphoribosylamino)methylideneamino] imidazole-4-carboxamide isomerase~~~
MIIPALDLIDGTVVRLHQGDYARQRDYGNDPLPRLQDYAAQGAGVLHLVDLTGAKDPAKRQIPLIKTLVAGVNVPVQVGG
GVRTEEDVAALLKAGVARVVIGSTAVKSPDVVKGWFERFGAQALVLALDVRIDEHGTKQVAVSGWQENSGVSLEQLVETY
LPVGLKHVLCTDISRDGTLAGSNVSLYEEVCARYPQIAFQSSGGIGDIDDIAALRGTGVRGVIVGRALLEGKFTVKEAIQ
CWQNV
>P16250 5.3.1.16~~~priA~~~Phosphoribosyl isomerase A~~~COG0106
MSKLELLPAVDVRDGQAVRLVHGESGTETSYGSPLEAALAWQRSGAEWLHLVDLDAAFGTGDNRALIAEVAQAMDIKVEL
SGGIRDDDTLAAALATGCTRVNLGTAALETPEWVAKVIAEHGDKIAVGLDVRGTTLRGRGWTRDGGDLYETLDRLNKEGC
ARYVVTDIAKDGTLQGPNLELLKNVCAATDRPVVASGGVSSLDDLRAIAGLVPAGVEGAIVGKALYAKAFTLEEALEATS
>Q9X0C7 5.3.1.16~~~hisA~~~1-(5-phosphoribosyl)-5-[(5-phosphoribosylamino)methylideneamino] imidazole-4-carboxamide isomerase~~~COG0106
MLVVPAIDLFRGKVARMIKGRKENTIFYEKDPVELVEKLIEEGFTLIHVVDLSNAIENSGENLPVLEKLSEFAEHIQIGG
GIRSLDYAEKLRKLGYRRQIVSSKVLEDPSFLKSLREIDVEPVFSLDTRGGRVAFKGWLAEEEIDPVSLLKRLKEYGLEE
IVHTEIEKDGTLQEHDFSLTKKIAIEAEVKVLAAGGISSENSLKTAQKVHTETNGLLKGVIVGRAFLEGILTVEVMKRYA
R
>P61779 4.3.2.10~~~hisH~~~Imidazole glycerol phosphate synthase subunit HisH~~~COG0118
MLAILDYKAGNQTSVRRALDHLGIPCVITADPEVIQGAAGVIFPGVGAAGQAMNELVTTGLDEVLRRQVQAGRPLLGICV
GCQIMLDYSQENDTKALGIIPGECRLFNPAWTDEDGAPIRVPHMGWNHIVQRRPCELLKGIEPEAEFYFVHSYYPAPPEE
YVIATCTYGAEFCAIHGGPGLWAVQFHPEKSGRPGLRLLANFHRYCTEAADAQ
>P60595 4.3.2.10~~~hisH~~~Imidazole glycerol phosphate synthase subunit HisH~~~COG0118
MNVVILDTGCANLNSVKSAIARHGYEPKVSRDPDVVLLADKLFLPGVGTAQAAMDQVRERELFDLIKACTQPVLGICLGM
QLLGRRSEESNGVDLLGIIDEDVPKMTDFGLPLPHMGWNRVYPQAGNRLFQGIEDGAYFYFVHSYAMPVNPWTIAQCNYG
EPFTAAVQKDNFYGVQFHPERSGAAGAKLLKNFLEM
>P59957 4.3.2.10~~~hisH~~~Imidazole glycerol phosphate synthase subunit HisH~~~
MTAKSVVVLDYGSGNLRSAQRALQRVGAEVEVTADTDAAMTADGLVVPGVGAFAACMAGLRKISGERIIAERVAAGRPVL
GVCVGMQILFACGVEFGVQTPGCGHWPGAVIRLEAPVIPHMGWNVVDSAAGSALFKGLDVDARFYFVHSYAAQRWEGSPD
ALLTWATYRAPFLAAVEDGALAATQFHPEKSGDAGAAVLSNWVDGL
>P9WMM1 4.3.2.10~~~hisH~~~Imidazole glycerol phosphate synthase subunit HisH~~~COG0118
MTAKSVVVLDYGSGNLRSAQRALQRVGAEVEVTADTDAAMTADGLVVPGVGAFAACMAGLRKISGERIIAERVAAGRPVL
GVCVGMQILFACGVEFGVQTPGCGHWPGAVIRLEAPVIPHMGWNVVDSAAGSALFKGLDVDARFYFVHSYAAQRWEGSPD
ALLTWATYRAPFLAAVEDGALAATQFHPEKSGDAGAAVLSSWVDGL
>Q9X0C8 4.3.2.10~~~hisH~~~Imidazole glycerol phosphate synthase subunit HisH~~~COG0118
MRIGIISVGPGNIMNLYRGVKRASENFEDVSIELVESPRNDLYDLLFIPGVGHFGEGMRRLRENDLIDFVRKHVEDERYV
VGVCLGMQLLFEESEEAPGVKGLSLIEGNVVKLRSRRLPHMGWNEVIFKDTFPNGYYYFVHTYRAVCEEEHVLGTTEYDG
EIFPSAVRKGRILGFQFHPEKSSKIGRKLLEKVIECSLSRR
>Q7SIC0 4.3.2.10~~~hisH~~~Imidazole glycerol phosphate synthase subunit HisH~~~COG0118
MRMKALLIDYGSGNLRSAAKALEAAGFSVAVAQDPKAHEEADLLVLPGQGHFGQVMRAFQESGFVERVRRHLERGLPFLG
ICVGMQVLYEGSEEAPGVRGLGLVPGEVRRFRAGRVPQMGWNALEFGGAFAPLTGRHFYFANSYYGPLTPYSLGKGEYEG
TPFTALLAKENLLAPQFHPEKSGKAGLAFLALARRYFEVL
>Q9KSX0 4.3.2.10~~~hisH~~~Imidazole glycerol phosphate synthase subunit HisH~~~COG0118
MTQNVVIIDTGCANISSVKFAIERLGYAVTISRDPQVVLAADKLFLPGVGTASEAMKNLTERDLIELVKRVEKPLLGICL
GMQLLGKLSEEKGQKADEIVQCLGLVDGEVRLLQTGDLPLPHMGWNTVQVKEGHPLFNGIEPDAYFYFVHSFAMPVGDYT
IAQCEYGQPFSAAIQAGNYYGVQFHPERSSKAGARLIQNFLEL
>P62450 4.3.2.10~~~hisF~~~Imidazole glycerol phosphate synthase subunit HisF~~~COG0107
MLSKRIIPCLDVRAGRLTKGVKFEGNVDIGDPVATARRYYEEGADEIVFYDITASHEDRGIFLDVVERVASEIFIPFSVG
GGINTVDDMRAVLMAGAEKVSVNSGAVKTPDIISQGAAAFGSQAIVVGMDVKQVEKSATIPSGYEIVIHGGRKYMGMDAI
EWAKTCESLGAGELCVNSIDADGTKDGYELTLTRMISDAVTIPVIASGGAGSPEHMYDALTRGGASAALIASIVHYGTYT
IPDLKRRISGMGAKMRMVW
>P60664 4.3.2.10~~~hisF~~~Imidazole glycerol phosphate synthase subunit HisF~~~COG0107
MLAKRIIPCLDVRDGQVVKGVQFRNHEIIGDIVPLAKRYAEEGADELVFYDITASSDGRVVDKSWVSRVAEVIDIPFCVA
GGIKSLEDAAKILSFGADKISINSPALADPTLITRLADRFGVQCIVVGIDTWYDAETGKYHVNQYTGDESRTRVTQWETL
DWVQEVQKRGAGEIVLNMMNQDGVRNGYDLEQLKKVREVCHVPLIASGGAGTMEHFLEAFRDADVDGALAASVFHKQIIN
IGELKAYLATQGVEIRIC
>P9WMM3 4.3.2.10~~~hisF~~~Imidazole glycerol phosphate synthase subunit HisF~~~COG0107
MYADRDLPGAGGLAVRVIPCLDVDDGRVVKGVNFENLRDAGDPVELAAVYDAEGADELTFLDVTASSSGRATMLEVVRRT
AEQVFIPLTVGGGVRTVADVDSLLRAGADKVAVNTAAIACPDLLADMARQFGSQCIVLSVDARTVPVGSAPTPSGWEVTT
HGGRRGTGMDAVQWAARGADLGVGEILLNSMDADGTKAGFDLALLRAVRAAVTVPVIASGGAGAVEHFAPAVAAGADAVL
AASVFHFRELTIGQVKAALAAEGITVR
>Q9X0C6 4.3.2.10~~~hisF~~~Imidazole glycerol phosphate synthase subunit HisF~~~COG0107
MLAKRIIACLDVKDGRVVKGTNFENLRDSGDPVELGKFYSEIGIDELVFLDITASVEKRKTMLELVEKVAEQIDIPFTVG
GGIHDFETASELILRGADKVSINTAAVENPSLITQIAQTFGSQAVVVAIDAKRVDGEFMVFTYSGKKNTGILLRDWVVEV
EKRGAGEILLTSIDRDGTKSGYDTEMIRFVRPLTTLPIIASGGAGKMEHFLEAFLAGADAALAASVFHFREIDVRELKEY
LKKHGVNVRLEGL
>Q7SIB9 4.3.2.10~~~hisF~~~Imidazole glycerol phosphate synthase subunit HisF~~~COG0107
MSLAKRIVPCLDVHAGRVVKGVNFVNLRDAGDPVEAARAYDEAGADELVFLDISATHEERAILLDVVARVAERVFIPLTV
GGGVRSLEDARKLLLSGADKVSVNSAAVRRPELIRELADHFGAQAVVLAIDARWRGDFPEVHVAGGRVPTGLHAVEWAVK
GVELGAGEILLTSMDRDGTKEGYDLRLTRMVAEAVGVPVIASGGAGRMEHFLEAFQAGAEAALAASVFHFGEIPIPKLKR
YLAEKGVHVRLD
>Q8ABA7 ~~~hisB~~~Histidine biosynthesis bifunctional protein HisB~~~COG0131
MKKKILFIDRDGTLVIEPPIDYQLDSLEKLEFYPRVFRNLGFIRSKLDFEFVMVTNQDGLGTSSFPEDTFWPAHNLMLKT
LAGEGITFDDILIDRSFPEDNAPTRKPRTGMLTKYIDNPEYDLAESFVIGDRPTDVELAKNLGCRAIYLQEATDDLKEKG
LEEVCALATTDWDQVAEFLFAGERKAEVRRTTKETDIYVSLNLDGNGGCDISTGLGFFDHMLEQIGKHSGMDLTIRVKGD
LEVDEHHTIEDTAIALGECIYQALGSKRGIERYGYALPMDDCLCQVCLDFGGRPWLVWDAEFNREKIGEMPTEMFLHFFK
SLSDAAKMNLNIKAEGQNEHHKIEGIFKALARALKMALKRDIYHFELPSSKGVL
>Q9S5G5 ~~~hisB~~~Histidine biosynthesis bifunctional protein HisB~~~COG0131
MSQKYLFIDRDGTLISEPPSDFQVDRFDKLAFEPGVIPQLLKLQKAGYKLVMITNQDGLGTQSFPQADFDGPHNLMMQIF
TSQGVQFDEVLICPHLPADECDCRKPKVKLVERYLAEQAMDRANSYVIGDRATDIQLAENMGINGLRYDRETLNWPMIGE
QLTRRDRYAHVVRNTKETQIDVQVWLDREGGSKINTGVGFFDHMLDQIATHGGFRMEINVKGDLYIDDHHTVEDTGLALG
EALKIALGDKRGICRFGFVLPMDECLARCALDISGRPHLEYKAEFTYQRVGDLSTEMIEHFFRSLSYTMGVTLHLKTKGK
NDHHRVESLFKAFGRTLRQAIRVEGDTLPSSKGVL
>P06987 ~~~hisB~~~Histidine biosynthesis bifunctional protein HisB~~~COG0131
MSQKYLFIDRDGTLISEPPSDFQVDRFDKLAFEPGVIPELLKLQKAGYKLVMITNQDGLGTQSFPQADFDGPHNLMMQIF
TSQGVQFDEVLICPHLPADECDCRKPKVKLVERYLAEQAMDRANSYVIGDRATDIQLAENMGITGLRYDRETLNWPMIGE
QLTRRDRYAHVVRNTKETQIDVQVWLDREGGSKINTGVGFFDHMLDQIATHGGFRMEINVKGDLYIDDHHTVEDTGLALG
EALKIALGDKRGICRFGFVLPMDECLARCALDISGRPHLEYKAEFTYQRVGDLSTEMIEHFFRSLSYTMGVTLHLKTKGK
NDHHRVESLFKAFGRTLRQAIRVEGDTLPSSKGVL
>A0QX83 4.2.1.19~~~hisB~~~Imidazoleglycerol-phosphate dehydratase~~~COG0131
MSALANRRARVERKTKESEIVVDLDLDGTGVVDIDTGVPFFDHMLTSLGSHASFDLTVHAKGDIEIEGHHTVEDTAIVLG
QALGQALGDKKGIRRFGDAFIPMDESLAHAAVDVSGRPYFVHTGEPESMVSFTIAGTGAPYHTVINRHVFESLAFNARIA
LHVRTLYGRDPHHITEAQYKAVARALRQAVEYDARVTGVPSTKGTL
>P9WML9 4.2.1.19~~~hisB~~~Imidazoleglycerol-phosphate dehydratase~~~COG0131
MTTTQTAKASRRARIERRTRESDIVIELDLDGTGQVAVDTGVPFYDHMLTALGSHASFDLTVRATGDVEIEAHHTIEDTA
IALGTALGQALGDKRGIRRFGDAFIPMDETLAHAAVDLSGRPYCVHTGEPDHLQHTTIAGSSVPYHTVINRHVFESLAAN
ARIALHVRVLYGRDPHHITEAQYKAVARALRQAVEPDPRVSGVPSTKGAL
>D2QPE6 ~~~hisB~~~Histidine biosynthesis bifunctional protein HisB~~~COG0131
MQKIVFIDRDGTLIAEPQPDQQVDSLAKLDFIPKAISAMRKIAEDTTYELVMVTNQDGLGTGSFPEDTFWPAHNKMMSTF
AGENVNFAAVHIDRHFPHDNSSTRKPGVGMLTQYFEASYDLTNSFVIGDRLTDVQLAVNLGAKAILFMPPNGLAAVQSAD
VSGLTEAMKQAIVLQTGDWDEIYEFLRLPARTALVERNTKETQIRVELNLDGRGRADMHTGLGFFDHMLDQVAKHSGADL
AIHVNGDLHIDEHHTIEDTALALGEAYRRALGDKRGISRYGFLLPMDEALAQVGIDFSGRPWLVWDAEFKREKIGDMPTE
MFYHFFKSFSDTALCNLNIKVEGDNEHHKIEAIFKAFAKAIKMAVRRDINELDNLPSTKGVL
>P64373 4.2.1.19~~~hisB~~~Imidazoleglycerol-phosphate dehydratase~~~
MIYQKQRNTAETQLNISISDDQSPSHINTGVGFLNHMLTLFTFHSGLSLNIEAQGDIDVDDHHVTEDIGIVIGQLLLEMI
KDKKHFVRYGTMYIPMDETLARVVVDISGRPYLSFNASLSKEKVGTFDTELVEEFFRAVVINARLTTHIDLIRGGNTHHE
IEAIFKAFSRALGIALTATDDQRVPSSKGVIE
>Q46WL3 2.6.1.9~~~hisC2~~~Histidinol-phosphate aminotransferase 2~~~COG0079
MSVVDPSLIERIIRDDVRAMGAYHVPDSHGLVKLDAMENPYRLPPALRSELAARLGEVALNRYPVPSSEALRAKLKEVMQ
VPAGMEVLLGNGSDEIISMLALAAARPGAKVMAPVPGFVMYAMSAQFAGLEFVGVPLRADFTLDRGAMLAAMAEHQPAIV
YLAYPNNPTGNLFDAADMEAIVRAAQGSVCRSLVVVDEAYQPFAQESWMSRLTDFGNLLVMRTVSKLGLAGIRLGYVAGD
PQWLEQLDKVRPPYNVNVLTEATALFALEHVAVLDEQAAQLRAERSRVAEGMAAHGGVTVFPSAANFLLARVPDAAQTFD
RLLARKVLIKNVSKMHPLLANCLRVTVSTPEENAQFLEAFAASLQD
>Q8R5Q4 2.6.1.9~~~hisC~~~Histidinol-phosphate aminotransferase~~~COG0079
MIENLLREEIKGFKNYEVENVPYKYKMDANETPFELPEEVMKNIGDIVKSIHVNIYPDPTAEKLREELARYCSVTPKNIF
VGNGSDEIIHLIMLAFVDKGDTVLYPHPSFAMYSIYSKIAGANEIAVNLNEDYTYNVERFAEAVERYKPKLVFLCNPNNP
TGSVIDEEDIIRIIEKARGIVIVDEAYFEFYGKTLVPYIDRFENLIVLRTLSKAFGIAGLRVGYALSNGEIVKYLNLVKS
PYNLNSLSQRIALEVLKSGVLKERVNYIINEREKLVKELNKINGIKVYPSHANFVLCKFENANDVHKRLVERGILVRNFS
NVKGLEGTLRITVSSSDANDYLINALREILS
>Q9PII2 2.6.1.9~~~hisC~~~Histidinol-phosphate aminotransferase~~~COG0079
MKFNEFLNNLSNYEPGKDIEVIAKEYGVKEVIKLASNENPFGTPPKAIECLRQNANKAHLYPDDSMIELKSTLAQKYKVQ
NENIIIGAGSDQVIEFAIHSKLNSKNAFLQAGVTFAMYEIYAKQCGAKCYKTQSITHNLDEFKKLYETHKDEIKLIFLCL
PNNPLGECLDASEATEFIKGVNEDCLVVIDAAYNEFASFKDSKKHLEPCELIKEFDNVLYLGTFSKLYGLGGLRIGYGIA
NANIISAFYKLRAPFNVSNLALKAAVAAMDDDEFTEKTLENNFSQMELYKEFAKKHNIKIIDSYTNFITYFFDEKNSTDL
SEKLLKKGIIIRNLKSYGLNAIRITIGTSYENEKFFTEFDKILR
>Q9KJU4 2.6.1.9~~~hisC~~~Histidinol-phosphate aminotransferase~~~COG0079
MTKITLSDLPLREELRGEHAYGAPQLNVDIRLNTNENPYPPSEALVADLVATVDKIATELNRYPERDAVELRDELAAYIT
KQTGVAVTRDNLWAANGSNEILQQLLQAFGGPGRTALGFQPSYSMHPILAKGTHTEFIAVSRGADFRIDMDVALEEIRAK
QPDIVFVTTPNNPTGDVTSLDDVERIINVAPGIVIVDEAYAEFSPSPSATTLLEKYPTKLVVSRTMSKAFDFAGGRLGYF
VANPAFIDAVMLVRLPYHLSALSQAAAIVALRHSADTLGTVEKLSVERVRVAARLEELGYAVVPSESNFVFFGDFSDQHA
AWQAFLDRGVLIRDVGIAGHLRTTIGVPEENDAFLDAAAEIIKLNL
>Q72DA0 2.6.1.9~~~hisC~~~Histidinol-phosphate aminotransferase~~~COG0079
MTAPSMSRPDDVRPEVLDFKPYVPGLSIDEIRDRFGLADVVKLASNENPLGTSPVVQRTLKTKADLAFRYAQSGNPRLTR
AIAAHHGVAPERVVAGNGSDEIIDLLIRVRATPGKHNIVAFRPCFSIYELQAKFCGLEFRQADLRPDFTFDWDAFLAATD
ENTAIAFVTTPDNPSGWCPPVSELEHVARTLPPSCLFVIDEAYMDFCGDEAAHSLLSRLDAFPNIAVLRTFSKSFGLAGL
RLGYGILPERLADYLHRVRLPFSVNILAEEAGLAALEDTVFRSETLRVTAEGRAYIAEGLTALGCEVMPSWANFIMFRPP
TDATDLFEALLRRGIIIRPLKSYGLPQHLRVSVGNADENRRFIEACKEILPHA
>P06986 2.6.1.9~~~hisC~~~Histidinol-phosphate aminotransferase~~~COG0079
MSTVTITDLARENVRNLTPYQSARRLGGNGDVWLNANEYPTAVEFQLTQQTLNRYPECQPKAVIENYAQYAGVKPEQVLV
SRGADEGIELLIRAFCEPGKDAILYCPPTYGMYSVSAETIGVECRTVPTLDNWQLDLQGISDKLDGVKVVYVCSPNNPTG
QLINPQDFRTLLELTRGKAIVVADEAYIEFCPQASLAGWLAEYPHLAILRTLSKAFALAGLRCGFTLANEEVINLLMKVI
APYPLSTPVADIAAQALSPQGIVAMRERVAQIIAEREYLIAALKEIPCVEQVFDSETNYILARFKASSAVFKSLWDQGII
LRDQNKQPSLSGCLRITVGTREESQRVIDALRAEQV
>Q39YP6 2.6.1.9~~~hisC~~~Histidinol-phosphate aminotransferase~~~COG0079
MIPLRQNIASMKGYIPGYQPPDIASWIKLNTNENPYPPSPEVVKAILEELGPDGAALRIYPSASSQKLREVAGELYGFDP
SWIIMANGSDEVLNNLIRAFAAEGEEIGYVHPSYSYYGTLAEVQGARVRTFGLTGDFRIAGFPERYEGKVFFLTTPNAPL
GPSFPLEYIDELARRCAGMLVLDETYAEFAESNALELVRRHENVVVTRTLSKSYSLAGMRIGLAIARPEVIAALDKIRDH
YNLDRLAQAACVAALRDQAYLSECCRRIRETREWFTTELRSIGYDVIPSQGNYLFATPPDRDGKRVYDGLYARKVLVRHF
SDPLLAHGMRISIGTREEMEQTLAALKEIG
>A6TBC4 2.6.1.9~~~hisC~~~Histidinol-phosphate aminotransferase~~~
MSIEDLARANVRALTPYQSARRLGGKGDVWLNANEFPTAVAFQLTEQTLNRYPEPQPKAVIESYARYAEVKPEQVLVSRG
ADEGIELLIRAFCEPGEDAVLYCPPTYGMYSVSAETIGVECRTVPTLADWQLDLPGIEARLDGVKVVFVCSPNNPTGQII
DPQSMRDLLEMTRGKAIVVADEAYIEFCPQATLAGWLSDYPHLVVLRTLSKAFALAGLRCGFTLANAEVINVLLKVIAPY
PLSTPVADIAAQALSPEGIAAMRQRVAQILDERRYLVEQLRGIACVEQVFDSETNYVLARITASSAVFKSLWDQGIILRD
QNKQPSLSGCLRITIGTRAESQRVIDALTAENV
>Q92A83 2.6.1.9~~~hisC~~~Histidinol-phosphate aminotransferase~~~COG0079
MKWKKSLAGLSSYKPGKREEEVMAELGLTKITKLSSNENPLGTSKKVAAIQANSSVETEIYPDGWASSLRKEVADFYQLE
EEELIFTAGVDELIELLTRVLLDTTTNTVMATPTFVQYRQNALIEGAEVREIPLLQDGEHDLEGMLNAIDEKTTIVWICN
PNNPTGNYIELADIQAFLDRVPSDVLVVLDEAYIEYVTPQPEKHEKLVRTYKNLIITRTFSKIYGLASARVGYGIADKEI
IRQLNIVRPPFNTTSIGQKLAIEAIKDQAFIGECRTSNANGIKQYEAFAKRFEKVKLYPANGNFVLIDLGIEAGTIFSYL
EKNGYITRSGAALGFPTAVRITIGKEEDNSAVIALLEKLL
>P9WML7 2.6.1.9~~~hisC~~~Histidinol-phosphate aminotransferase~~~COG0079
MTRSGHPVTLDDLPLRADLRGKAPYGAPQLAVPVRLNTNENPHPPTRALVDDVVRSVREAAIDLHRYPDRDAVALRADLA
GYLTAQTGIQLGVENIWAANGSNEILQQLLQAFGGPGRSAIGFVPSYSMHPIISDGTHTEWIEASRANDFGLDVDVAVAA
VVDRKPDVVFIASPNNPSGQSVSLPDLCKLLDVAPGIAIVDEAYGEFSSQPSAVSLVEEYPSKLVVTRTMSKAFAFAGGR
LGYLIATPAVIDAMLLVRLPYHLSSVTQAAARAALRHSDDTLSSVAALIAERERVTTSLNDMGFRVIPSDANFVLFGEFA
DAPAAWRRYLEAGILIRDVGIPGYLRATTGLAEENDAFLRASARIATDLVPVTRSPVGAP
>P67725 2.6.1.9~~~hisC~~~Histidinol-phosphate aminotransferase~~~
MKEQLNQLSAYQPGLSPRALKEKYGIEGDLYKLASNENLYGPSPKVKEAISAHLDELYYYPETGSPTLKAAISKHLNVDQ
SRILFGAGLDEVILMISRAVLTPGDTIVTSEATFGQYYHNAIVESANVIQVPLKDGGFDLEGILKEVNEDTSLVWLCNPN
NPTGTYFNHESLDSFLSQVPPHVPVIIDEAYFEFVTAEDYPDTLALQQKYDNAFLLRTFSKAYGLAGLRVGYVVASEHAI
EKWNIIRPPFNVTRISEYAAVAALEDQQYLKEVTHKNSVERERFYQLPQSEYFLPSQTNFIFVKTKRVNELYEALLNVGC
ITRPFPTGVRITIGFKEQNDKMLEVLSNFKYE
>Q9X0D0 2.6.1.9~~~hisC~~~Histidinol-phosphate aminotransferase~~~COG0079
MNPLDLIAKRAYPYETEKRDKTYLALNENPFPFPEDLVDEVFRRLNSDALRIYYDSPDEELIEKILSYLDTDFLSKNNVS
VGNGADEIIYVMMLMFDRSVFFPPTYSCYRIFAKAVGAKFLEVPLTKDLRIPEVNVGEGDVVFIPNPNNPTGHVFEREEI
ERILKTGAFVALDEAYYEFHGESYVDFLKKYENLAVIRTFSKAFSLAAQRVGYVVASEKFIDAYNRVRLPFNVSYVSQMF
AKVALDHREIFEERTKFIVEERERMKSALREMGYRITDSRGNFVFVFMEKEEKERLLEHLRTKNVAVRSFREGVRITIGK
REENDMILRELEVFK
>P34037 2.6.1.9~~~hisC~~~Histidinol-phosphate aminotransferase~~~COG0079
MTAAPELRPKSWIDSIAPYIPGSSKTLDGRPAVKLSSNENPLGTSLKAKEAYREAIDSLSLYPDSGATALREAIGACYNL
DPARIIHGTGSDEILHLAAGAYAGQDDEVLYPRYSFSVYPLAARRVGATPVEAPDDDYRCSVDALLKAVTPRTRVVFIAN
PNNPTGTWITRAEVEKLHNGLPRNCLLVIDQAYAEYLDPECDDGALALAKNTKNVLVTRTFSKIYGLAAERIGWAYACPE
IIDALNRIRAPFNVTIAGQKAAVAALEDQAFIQNSFKHNKKWRGWFENQMALLSNAGIRVIPSSANFTLLLFEGSLTAET
AYKALMDHGYTTRWLPGQRLPHALRITIGSEKHMQDVAGILTSLVRQAL
>O34411 3.1.3.15~~~hisK~~~Histidinol-phosphatase~~~COG1387
MQKRDGHIHTPFCPHGSNDTLRQYAEEALKKGFESITFTEHAPLPPSFTDPTPLKDSAMAQASLERYIHEISGLKKEYRG
QLSIRTGLEVDYIAEFEDEITLFLDTYGPYLDDSILSVHFLRTDSSYLCLDYDEHTFKELISACGSIEAVYEQYYRSIYS
SIVASLGVYKPKRVGHITLVQKFIKLFPYSMSEHIRGLVSLCLNAIEENGMELDFNTSGLRKTYAGGIYIEDWMLNEAKQ
KKIPLVFGSDAHQAGDVGYAYEAFLERC
>Q02150 3.1.3.15~~~hisK~~~Histidinol-phosphatase~~~COG1387
MKKLDYHFHSHFSADSEELPRKHVTEAIAHGLEEICFTEHRDFYFPGMDFSLNLPEYFQEINRLQAEFKDKIKIKIGLEM
GIDLRFKSEINQFIDSAPFDFVIASVHEIGDIEVYDGTEFYLQKIKEEAQREYLLACLDVVQNFENYNSFGHLDYVARYG
PYTDKSIKFAENREILFEILRALASKEKALEINTRLFDDPKTEQFYSDLLINFKKLGGKFITLGTDSHIAKRDWLSIHKA
RTLIKKAGFHELATFSGMKIDKNKKSIKE
>P0DV34 3.1.3.15~~~~~~Histidinol-phosphatase~~~
MKNLAIFDLDNTLINTDSDHAWPQYLIKKGLVDAAETEAQNEKFYRDYQNGCLDIDAFLKFHLAPLARYSKEELAEFHRE
FMAEYIIPHISPMQRMLVQSHQMAGDETLVISSTNEFIITPVCHLFGITNIIGTQLETGSDGRYTGNYIGTPSLKEGKIT
RLNQWLAERGETLQSYGKTYFYSDSKNDLPLLRLVSEPVAVNPDAELEKEAKEKGWPVLNFK
>Q9I6F6 3.1.3.15~~~~~~Histidinol-phosphatase~~~
MRLALFDLDNTLLAGDSDHSWGEWLCQRGLVDAAEYQARNDAFYADYVAGKLDVLAYQAFTQAILGRTEMAQLETWHRQF
MQEVIEPIVLAKGEALLAEHRAAGDRLVIITATNRFVTGPIAERLGVETLIATECEMRDGRYTGQTFDVPCFQGGKVVRL
QRWLDENGLDLEGASFYSDSLNDLPLLEKVSRPVAVDPDPRLRAEAEKRGWPIISLR
>Q46125 ~~~hisJ~~~Probable histidine-binding protein~~~
MKKFLTAFLVAFTGLFLVACQNTKTENNASNEANTTLTLKVGTAPNYKPFNFKQDSKLTGFDTDLIEEIAKKNGIEIVWV
ETNFDGLIPALKSGKIDMIASAMSATDERRQSVDFTKPYYMSKNLYLKLKNNDSLQTKNDLEGKKIGVQLGTLQENTAKA
IKNAQVQSNKDLNIAVLALKNNKIDAIVADQDTAKGFLAENPELVSFYQETDGGEGFSFAFDKNKQKDIIEIFNKGIDEA
KTDGFYDTLIKKYELE
>P0AEU0 ~~~hisJ~~~Histidine-binding periplasmic protein~~~COG0834
MKKLVLSLSLVLAFSSATAAFAAIPQNIRIGTDPTYAPFESKNSQGELVGFDIDLAKELCKRINTQCTFVENPLDALIPS
LKAKKIDAIMSSLSITEKRQQEIAFTDKLYAADSRLVVAKNSDIQPTVESLKGKRVGVLQGTTQETFGNEHWAPKGIEIV
SYQGQDNIYSDLTAGRIDAAFQDEVAASEGFLKQPVGKDYKFGGPSVKDEKLFGVGTGMGLRKEDNELREALNKAFAEMR
ADGTYEKLAKKYFDFDVYGG
>P02910 ~~~hisJ~~~Histidine-binding periplasmic protein~~~
MKKLALSLSLVLAFSSATAAFAAIPQKIRIGTDPTYAPFESKNAQGELVGFDIDLAKELCKRINTQCTFVENPLDALIPS
LKAKKIDAIMSSLSITEKRQQEIAFTDKLYAADSRLVVAKNSDIQPTVASLKGKRVGVLQGTTQETFGNEHWAPKGIEIV
SYQGQDNIYSDLTAGRIDAAFQDEVAASEGFLKQPVGKDYKFGGPAVKDEKLFGVGTGMGLRKEDNELREALNKAFAEMR
ADGTYEKLAKKYFDFDVYGG
>P0AEU3 ~~~hisM~~~Histidine transport system permease protein HisM~~~COG4160
MIEILHEYWKPLLWTDGYRFTGVAITLWLLILSVVIGGVLALFLAIGRVSSNKYIQFPIWLFTYIFRGTPLYVQLLVFYS
GMYTLEIVKGTEFLNAFFRSGLNCTVLALTLNTCAYTTEIFAGAIRSVPHGEIEAARAYGFSTFKMYRCIILPSALRIAL
PAYSNEVILMLHSTALAFTATVPDLLKIARDINAATYQPFTAFGIAAVLYLIISYVLISLFRRAEKRWLQHVKPSSTH
>P0A2I7 ~~~hisM~~~Histidine transport system permease protein HisM~~~
MIEIIQEYWKSLLWTDGYRFTGVAITLWLLISSVVMGGLLAVILAVGRVSSNKFIRFPIWLFTYIFRGTPLYVQLLVFYS
GMYTLEIVKGTDLLNAFFRSGLNCTVLALTLNTCAYTTEIFAGAIRSVPHGEIEAARAYGFSSFKMYRCIILPSALRIAL
PAYSNEVILMLHSTALAFTATVPDLLKIARDINSATYQPFTAFGIAAVLYLLISYVLISLFRRAERRWLQHVSSK
>Q8NS80 3.1.3.15~~~hisN~~~Histidinol-phosphatase~~~COG0483
MSKYADDLALALELAELADSITLDRFEASDLEVSSKPDMTPVSDADLATEEALREKIATARPADSILGEEFGGDVEFSGR
QWIIDPIDGTKNYVRGVPVWATLIALLDNGKPVAGVISAPALARRWWASEGAGAWRTFNGSSPRKLSVSQVSKLDDASLS
FSSLSGWAERDLRDQFVSLTDTTWRLRGYGDFFSYCLVAEGAVDIAAEPEVSLWDLAPLSILVTEAGGKFTSLAGVDGPH
GGDAVATNGILHDETLDRLK
>P95189 3.1.3.15~~~hisN~~~Histidinol-phosphatase~~~COG0483
MSHDDLMLALALADRADELTRVRFGALDLRIDTKPDLTPVTDADRAVESDVRQTLGRDRPGDGVLGEEFGGSTTFTGRQW
IVDPIDGTKNFVRGVPVWASLIALLEDGVPSVGVVSAPALQRRWWAARGRGAFASVDGARPHRLSVSSVAELHSASLSFS
SLSGWARPGLRERFIGLTDTVWRVRAYGDFLSYCLVAEGAVDIAAEPQVSVWDLAALDIVVREAGGRLTSLDGVAGPHGG
SAVATNGLLHDEVLTRLNAG
>Q9K4B1 3.1.3.15~~~hisN~~~Histidinol-phosphatase~~~COG0483
MPDYLDDLRLAHVLADAADAATMDRFKALDLKVETKPDMTPVSEADKAAEELIRGHLSRARPRDSVHGEEFGVAGTGPRR
WVIDPIDGTKNYVRGVPVWATLIALMEAKEGGYQPVVGLVSAPALGRRWWAVEDHGAFTGRSLTSAHRLHVSQVSTLSDA
SFAYSSLSGWEEQGRLDGFLDLTREVWRTRAYGDFWPYMMVAEGSVDLCAEPELSLWDMAANAIIVTEAGGTFTGLDGRP
GPHSGNAAASNGRLHDELLGYLNQRY
>P07109 ~~~hisP~~~Histidine transport ATP-binding protein HisP~~~COG4598
MSENKLNVIDLHKRYGEHEVLKGVSLQANAGDVISIIGSSGSGKSTFLRCINFLEKPSEGSIVVNGQTINLVRDKDGQLK
VADKNQLRLLRTRLTMVFQHFNLWSHMTVLENVMEAPIQVLGLSKQEARERAVKYLAKVGIDERAQGKYPVHLSGGQQQR
VSIARALAMEPEVLLFDEPTSALDPELVGEVLRIMQQLAEEGKTMVVVTHEMGFARHVSTHVIFLHQGKIEEEGAPEQLF
GNPQSPRLQRFLKGSLK
>P02915 ~~~hisP~~~Histidine transport ATP-binding protein HisP~~~
MMSENKLHVIDLHKRYGGHEVLKGVSLQARAGDVISIIGSSGSGKSTFLRCINFLEKPSEGAIIVNGQNINLVRDKDGQL
KVADKNQLRLLRTRLTMVFQHFNLWSHMTVLENVMEAPIQVLGLSKHDARERALKYLAKVGIDERAQGKYPVHLSGGQQQ
RVSIARALAMEPDVLLFDEPTSALDPELVGEVLRIMQQLAEEGKTMVVVTHEMGFARHVSSHVIFLHQGKIEEEGDPEQV
FGNPQSPRLQQFLKGSLK
>P52094 ~~~hisQ~~~Histidine transport system permease protein HisQ~~~COG4215
MLYGFSGVILQGALVTLELAISSVVLAVIIGLIGAGGKLSQNRLSGLIFEGYTTLIRGVPDLVLMLLIFYGLQIALNTVT
EAMGVGQIDIDPMVAGIITLGFIYGAYFTETFRGAFMAVPKGHIEAATAFGFTRGQVFRRIMFPSMMRYALPGIGNNWQV
ILKSTALVSLLGLEDVVKATQLAGKSTWEPFYFAIVCGVIYLVFTTVSNGVLLFLERRYSVGVKRADL
>P0A2I9 ~~~hisQ~~~Histidine transport system permease protein HisQ~~~
MLYGFSGVILQGAIVTLELALSSVVLAVLIGLVGAGAKLSQNRVTGLIFEGYTTLIRGVPDLVLMLLIFYGLQIALNVVT
DSLGIDQIDIDPMVAGIITLGFIYGAYFTETFRGAFMAVPKGHIEAATAFGFTHGQTFRRIMFPAMMRYALPGIGNNWQV
ILKATALVSLLGLEDVVKATQLAGKSTWEPFYFAVVCGLIYLVFTTVSNGVLLLLERRYSVGVKRADL
>Q8G2R2 1.1.1.23~~~hisD~~~Histidinol dehydrogenase~~~
MVTTLRQTDPDFEQKFAAFLSGKREVSEDVDRAVREIVDRVRREGDSALLDYSRRFDRIDLEKTGIAVTEAEIDAAFDAA
PASTVEALKLARDRIEKHHARQLPKDDRYTDALGVELGSRWTAIEAVGLYVPGGTASYPSSVLMNAMPAKVAGVDRIVMV
VPAPDGNLNPLVLVAARLAGVSEIYRVGGAQAIAALAYGTETIRPVAKIVGPGNAYVAAAKRIVFGTVGIDMIAGPSEVL
IVADKDNNPDWIAADLLAQAEHDTAAQSILMTNDEAFAHAVEEAVERQLHTLARTETASASWRDFGAVILVKDFEDAIPL
ANRIAAEHLEIAVADAEAFVPRIRNAGSIFIGGYTPEVIGDYVGGCNHVLPTARSARFSSGLSVLDYMKRTSLLKLGSEQ
LRALGPAAIEIARAEGLDAHAQSVAIRLNL
>P06988 1.1.1.23~~~hisD~~~Histidinol dehydrogenase~~~COG0141
MSFNTIIDWNSCTAEQQRQLLMRPAISASESITRTVNDILDNVKARGDEALREYSAKFDKTTVTALKVSAEEIAAASERL
SDELKQAMAVAVKNIETFHTAQKLPPVDVETQPGVRCQQVTRPVASVGLYIPGGSAPLFSTVLMLATPASIAGCKKVVLC
SPPPIADEILYAAQLCGVQDVFNVGGAQAIAALAFGTESVPKVDKIFGPGNAFVTEAKRQVSQRLDGAAIDMPAGPSEVL
VIADSGATPDFVASDLLSQAEHGPDSQVILLTPAADMARRVAEAVERQLAELPRAETARQALNASRLIVTKDLAQCVEIS
NQYGPEHLIIQTRNARELVDSITSAGSVFLGDWSPESAGDYASGTNHVLPTYGYTATCSSLGLADFQKRMTVQELSKEGF
SALASTIETLAAAERLTAHKNAVTLRVNALKEQA
>Q606Q2 1.1.1.23~~~hisD~~~Histidinol dehydrogenase~~~COG0141
MTEVKIKRLYTGDADFASQLDRLLAWSESEDTDIHQRVTEIIGCIRRDGDAALVELTARFDHFVVDTAAALELPRDVLEA
AWQALPAEQAKALREAAERIRAYAERQKLDSWDYREADGTLLGQKITPLDRVGLYVPGGKAAYPSSVLMNAVPAKVAGVP
ELIMAVPAPRGELNALVLAAAYISGVDRVFRIGGAQAVAALAYGTETVPRVDKIVGPGNIYVATAKKLVFGQVGIDMVAG
PSEILVISDGRTDPDWIAMDLFSQAEHDEDAQAILISPDAAHLEAVQASIERLLPGMERAEVIRTSLERRGGMILVDDLE
QAAAVANRIAPEHLELSVESPEVLVESIRNAGAIFMGRYTAEALGDYCAGPNHVLPTSGTARFSSPLGVYDFQKRSSLIY
CSPDGADQLGRTASLLAWGEGLGAHARSAEYRIRHH
>P9WNW9 1.1.1.23~~~hisD~~~Histidinol dehydrogenase~~~COG0141
MTAPPPVLTRIDLRGAELTAAELRAALPRGGADVEAVLPTVRPIVAAVAERGAEAALDFGASFDGVRPHAIRVPDAALDA
ALAGLDCDVCEALQVMVERTRAVHSGQRRTDVTTTLGPGATVTERWVPVERVGLYVPGGNAVYPSSVVMNVVPAQAAGVD
SLVVASPPQAQWDGMPHPTILAAARLLGVDEVWAVGGAQAVALLAYGGTDTDGAALTPVDMITGPGNIYVTAAKRLCRSR
VGIDAEAGPTEIAILADHTADPVHVAADLISQAEHDELAASVLVTPSEDLADATDAELAGQLQTTVHRERVTAALTGRQS
AIVLVDDVDAAVLVVNAYAAEHLEIQTADAPQVASRIRSAGAIFVGPWSPVSLGDYCAGSNHVLPTAGCARHSSGLSVQT
FLRGIHVVEYTEAALKDVSGHVITLATAEDLPAHGEAVRRRFER
>P10370 1.1.1.23~~~hisD~~~Histidinol dehydrogenase~~~
MSFNTLIDWNSCSPEQQRALLTRPAISASDSITRTVSDILDNVKTRGDDALREYSAKFDKTEVTALRVTPEEIAAAGARL
SDELKQAMTAAVKNIETFHSAQTLPPVDVETQPGVRCQQVTRPVSSVGLYIPGGSAPLFSTVLMLATPARIAGCQKVVLC
SPPPIADEILYAAQLCGVQEIFNVGGAQAIAALAFGSESVPKVDKIFGPGNAFVTEAKRQVSQRLDGAAIDMPAGPSEVL
VIADSGATPDFVASDLLSQAEHGPDSQVILLTPDADIARKVAEAVERQLAELPRADTARQALSASRLIVTKDLAQCVAIS
NQYGPEHLIIQTRNARDLVDAITSAGSVFLGDWSPESAGDYASGTNHVLPTYGYTATCSSLGLADFQKRMTVQELSKAGF
SALASTIETLAAAERLTAHKNAVTLRVNALKEQA
>Q9K6Z0 ~~~hisZ~~~ATP phosphoribosyltransferase regulatory subunit~~~COG3705
MSKPFMFEKPFGMRDTLPEWYKTKKNICDQMTEEINLWGYDMIETPTLEYYETVGVVSAILDQQLFKLLDQQGNTLVLRP
DMTAPIARLVASSLKDRAYPLRLAYQSNVYRAQQNEGGKPAEFEQLGVELIGDGTASADGEVIALMIAALKRAGLSEFKV
AIGHVGYVNALLMDVVGNEQRADRLRRFLYEKNYVGYREHVKSLNLSTIDKSRLMNLLSLRGGRAAIEEARGLIQTEKGK
TALAEMTKLYEVLESYGASEYVKFDLTLVLHMSYYTGVVFEGYGNRLGVPLCSGGRYDELLSKFHRPAQATGFGVRIDLL
VEALNGEIISNGHEQTCILFSNERRFEAIELARKKRANGEAVVLQDLAGVTDVDAMSSNYQDVIYCIGTAGRGGEDA
>Q02147 ~~~hisZ~~~ATP phosphoribosyltransferase regulatory subunit~~~COG3705
MEKINYLLPEESAEMTLNQVKSLRQIEGRLRKLFSLKNYQEVMPPSFEYTQLYTALESNGKTFNQEKMFQFIKHEGQSIT
LRYDFTLPLVRLYSQIKDSTSARYSYFGKIFRKEKRHKGRSTENYQIGIELFGESADKSELEILSLALQVIEQLGLNKTV
FEIGSAKFFQRLCQLADGSTELLTELLLKKDLSGLNAFIEKNNFSKELRGLLKEIFITNELSRLENLVTNTKDDVLISSF
DQLKEFSEKLSMIKPIIIDLGMVPKMDYYTDLMFKAYSSAANQPILSGGRYDQLLSNFQEEAFAIGFCCHMDTILKALER
QELEEDND
>Q4FTX3 ~~~hisZ~~~ATP phosphoribosyltransferase regulatory subunit~~~COG3705
MLPDGVADVLFEDAHKQEVLRHQLTQQLITHGYQLVSPPMIEFTESLLSGASEDLKRQTFKIIDQLTGRLMGIRADITPQ
ILRIDAHHGGDGIARYCYAGDVIHTLPSGLFGSRTPLQLGAEIFGCESIAADIELIDVLFSMINSLDMSAVLHVDLGHVT
IFKRLAELAALSASDTEQLMQLYANKNLPELKQVCQVLPMGSDFYTLARFGHDIANLLGRLSENAQQDTKIVTAIDELQR
LKAHLQVQWQCAVSIDVTELSGYHYHTGIVFNGYINSETQPLVRGGRFDGMKSNQLATNQPRQATGFSMDVSRLLAHTQL
DAPFIVLIDYDAFNNLDSAQRQLLLQQVASLRQQGYRVTMPLTAEDMPVGLTHRLSLADNQWRLHAV
>Q9X0D3 ~~~hisZ~~~ATP phosphoribosyltransferase regulatory subunit~~~COG3705
MDFLDFEKVFSFYSKATKKGFSPFFVPALEKAEEPAGNFFLDRKGNLFSIREDFTKTVLNHRKRYSPDSQIKVWYADFVY
RYSGSDLVAEYQLGLEKVPRNSLDDSLEVLEIIVESASEFFEGPVIVEIGHTGVYEDLLKEIPKDLHEKVLNLIDTKNLA
EIEFLSHMKKIDLSRVEKIIEDSIYRRSPEHLKTMDLPLSVREDLLSASSFLQEKFPTVSVEIDLTLARTIEEYCGLIFT
IYDTSSSRLVAAGGEYTVNGEKGVGGSIFLEGKTC
>O07513 ~~~hit~~~Protein hit~~~COG0537
MHCAENCIFCKIIAGDIPSAKVYEDEHVLAFLDISQVTKGHTLVIPKTHIENVYEFTDELAKQYFHAVPKIARAIRDEFE
PIGLNTLNNNGEKAGQSVFHYHMHIIPRYGKGDGFGAVWKTHADDYKPEDLQNISSSIAKRLASS
>O32142 3.5.2.17~~~pucM~~~5-hydroxyisourate hydrolase~~~COG2351
MGKLTTHILDLTCGKPAANVKIGLKRLGESIMKEVYTNNDGRVDVPLLAGEELMSGEYVMEFHAGDYFASKNMNAADQPF
LTIVTVRFQLADPDAHYHIPLLLSPFGYQVYRGS
>P76341 3.5.2.17~~~hiuH~~~5-hydroxyisourate hydrolase~~~COG2351
MLKRYLVLSVATAAFSLPSLVNAAQQNILSVHILNQQTGKPAADVTVTLEKKADNGWLQLNTAKTDKDGRIKALWPEQTA
TTGDYRVVFKTGDYFKKQNLESFFPEIPVEFHINKVNEHYHVPLLLSQYGYSTYRGS
>Q4VYA5 3.5.2.17~~~hiuH~~~5-hydroxyisourate hydrolase~~~
MKRHILATVIASLVAAPAMALAAGNNILSVHILDQQTGKPAPGVEVVLEQKKDNGWTQLNTGHTDQDGRIKALWPEKAAA
PGDYRVIFKTGQYFESKKLDTFFPEIPVEFHISKTNEHYHVPLLLSQYGYSTYRGS
>Q0Z8B6 ~~~~~~Bacteriocin hiracin-JM79~~~
MKKKVLKHCVILGILGTCLAGIGTGIKVDAATYYGNGLYCNKEKCWVDWNQAKGEIGKIIVNGWVNHGPWAPRR
>O07778 2.7.13.3~~~~~~Sensor histidine kinase component HK1~~~COG2205
MPITPLLHESVARFAATGADITTRAEPDLFVSIDPDHLRRILTAVLDNAITHGDGEIAVTAHARDGAVDIGVRDHGPGFA
DHFLPVAFDRFTRADTARGGRGSGLGLAIVAALTTTHGGHANATNHPDGGAELRITLPTPRPPFHEELPRITSSDTKDPN
REHDTSDQ
>O07777 2.7.13.3~~~~~~Sensor histidine kinase component HK2~~~COG2972
MALVLAAAGAVTVVQFRDAAHEADPDGALRGLTDDITADLVRELVTILPIVLVIAAVAAYLLSRAALRPVDRIRAAAQTL
TTTPHPDTDAPLPVPPTDDEIAWLATTLNTMLTRLQRALAHEQQFVADASHELRTPLALLTTELELRCAGPDPPTS
>Q2G1X0 ~~~hly~~~Alpha-hemolysin~~~
MKTRIVSSVTTTLLLGSILMNPVANAADSDINIKTGTTDIGSNTTVKTGDLVTYDKENGMHKKVFYSFIDDKNHNKKLLV
IRTKGTIAGQYRVYSEEGANKSGLAWPSAFKVQLQLPDNEVAQISDYYPRNSIDTKEYMSTLTYGFNGNVTGDDTGKIGG
LIGANVSIGHTLKYVQPDFKTILESPTDKKVGWKVIFNNMVNQNWGPYDRDSWNPVYGNQLFMKTRNGSMKAADNFLDPN
KASSLLSSGFSPDFATVITMDRKASKQQTNIDVIYERVRDDYQLHWTSTNWKGTNTKDKWIDRSSERYKIDWEKEEMTN
>P09616 ~~~hly~~~Alpha-hemolysin~~~
MKTRIVSSVTTTLLLGSILMNPVAGAADSDINIKTGTTDIGSNTTVKTGDLVTYDKENGMHKKVFYSFIDDKNHNKKLLV
IRTKGTIAGQYRVYSEEGANKSGLAWPSAFKVQLQLPDNEVAQISDYYPRNSIDTKEYMSTLTYGFNGNVTGDDTGKIGG
LIGANVSIGHTLKYVQPDFKTILESPTDKKVGWKVIFNNMVNQNWGPYDRDSWNPVYGNQLFMKTRNGSMKAADNFLDPN
KASSLLSSGFSPDFATVITMDRKASKQQTNIDVIYERVRDDYQLHWTSTNWKGTNTKDKWTDRSSERYKIDWEKEEMTN
>Q2SY18 5.1.3.20~~~hldD~~~ADP-L-glycero-D-manno-heptose-6-epimerase~~~
MTLIVTGAAGFIGANLVKALNERGETRIIAVDNLTRADKFKNLVDCEIDDYLDKTEFVERFARGDFGKVRAVFHEGACSD
TMETDGRYMMDNNFRYSRAVLDACLAQGAQFLYASSAAIYGGSSRFVEEREVEAPLNVYGYSKFLFDQVIRRVMPGAKSQ
IAGFRYFNVYGPRESHKGRMASVAFHNFNQFRAEGKVKLFGEYSGYGPGEQTRDFVSVEDVAKVNLYFFDHPEKSGIFNL
GTGRAQPFNDIAATVVNTLRALEGQPALTLAEQVEQGLVEYVPFPDALRGKYQCFTQADQTKLRAAGYDAPFLTVQEGVD
RYVRWLFGQL
>P67911 5.1.3.20~~~hldD~~~ADP-L-glycero-D-manno-heptose-6-epimerase~~~COG0451
MIIVTGGAGFIGSNIVKALNDKGITDILVVDNLKDGTKFVNLVDLNIADYMDKEDFLIQIMAGEEFGDVEAIFHEGACSS
TTEWDGKYMMDNNYQYSKELLHYCLEREIPFLYASSAATYGGRTSDFIESREYEKPLNVYGYSKFLFDEYVRQILPEANS
QIVGFRYFNVYGPREGHKGSMASVAFHLNTQLNNGESPKLFEGSENFKRDFVYVGDVADVNLWFLENGVSGIFNLGTGRA
ESFQAVADATLAYHKKGQIEYIPFPDKLKGRYQAFTQADLTNLRAAGYDKPFKTVAEGVTEYMAWLNRDA
>P67910 5.1.3.20~~~hldD~~~ADP-L-glycero-D-manno-heptose-6-epimerase~~~COG0451
MIIVTGGAGFIGSNIVKALNDKGITDILVVDNLKDGTKFVNLVDLNIADYMDKEDFLIQIMAGEEFGDVEAIFHEGACSS
TTEWDGKYMMDNNYQYSKELLHYCLEREIPFLYASSAATYGGRTSDFIESREYEKPLNVYGYSKFLFDEYVRQILPEANS
QIVGFRYFNVYGPREGHKGSMASVAFHLNTQLNNGESPKLFEGSENFKRDFVYVGDVADVNLWFLENGVSGIFNLGTGRA
ESFQAVADATLAYHKKGQIEYIPFPDKLKGRYQAFTQADLTNLRAAGYDKPFKTVAEGVTEYMAWLNRDA
>P76658 ~~~hldE~~~Bifunctional protein HldE~~~COG0615
MKVTLPEFERAGVMVVGDVMLDRYWYGPTSRISPEAPVPVVKVNTIEERPGGAANVAMNIASLGANARLVGLTGIDDAAR
ALSKSLADVNVKCDFVSVPTHPTITKLRVLSRNQQLIRLDFEEGFEGVDPQPLHERINQALSSIGALVLSDYAKGALASV
QQMIQLARKAGVPVLIDPKGTDFERYRGATLLTPNLSEFEAVVGKCKTEEEIVERGMKLIADYELSALLVTRSEQGMSLL
QPGKAPLHMPTQAQEVYDVTGAGDTVIGVLAATLAAGNSLEEACFFANAAAGVVVGKLGTSTVSPIELENAVRGRADTGF
GVMTEEELKLAVAAARKRGEKVVMTNGVFDILHAGHVSYLANARKLGDRLIVAVNSDASTKRLKGDSRPVNPLEQRMIVL
GALEAVDWVVSFEEDTPQRLIAGILPDLLVKGGDYKPEEIAGSKEVWANGGEVLVLNFEDGCSTTNIIKKIQQDKKG
>P0A0M2 ~~~hld~~~Delta-hemolysin~~~
MAQDIISTIGDLVKWIIDTVNKFTKK
>P0C1V1 ~~~hld~~~Delta-hemolysin~~~
MAQDIISTIGDLVKWIIDTVNKFTKK
>P0A071 ~~~hlgA~~~Gamma-hemolysin component A~~~
MIKNKILTATLAVGLIAPLANPFIEISKAENKIEDIGQGAEIIKRTQDITSKRLAITQNIQFDFVKDKKYNKDALVVKMQ
GFISSRTTYSDLKKYPYIKRMIWPFQYNISLKTKDSNVDLINYLPKNKIDSADVSQKLGYNIGGNFQSAPSIGGSGSFNY
SKTISYNQKNYVTEVESQNSKGVKWGVKANSFVTPNGQVSAYDQYLFAQDPTGPAARDYFVPDNQLPPLIQSGFNPSFIT
TLSHERGKGDKSEFEITYGRNMDATYAYVTRHRLAVDRKHDAFKNRNVTVKYEVNWKTHEVKIKSITPK
>P0A072 ~~~hlgA~~~Gamma-hemolysin component A~~~
MIKNKILTATLAVGLIAPLANPFIEISKAENKIEDIGQGAEIIKRTQDITSKRLAITQNIQFDFVKDKKYNKDALVVKMQ
GFISSRTTYSDLKKYPYIKRMIWPFQYNISLKTKDSNVDLINYLPKNKIDSADVSQKLGYNIGGNFQSAPSIGGSGSFNY
SKTISYNQKNYVTEVESQNSKGVKWGVKANSFVTPNGQVSAYDQYLFAQDPTGPAARDYFVPDNQLPPLIQSGFNPSFIT
TLSHERGKGDKSEFEITYGRNMDATYAYVTRHRLAVDRKHDAFKNRNVTVKYEVNWKTHEVKIKSITPK
>P0A074 ~~~hlgA~~~Gamma-hemolysin component A~~~
MIKNKILTATLAVGLIAPLANPFIEISKAENKIEDIGQGAEIIKRTQDITSKRLAITQNIQFDFVKDKKYNKDALVVKMQ
GFISSRTTYSDLKKYPYIKRMIWPFQYNISLKTKDSNVDLINYLPKNKIDSADVSQKLGYNIGGNFQSAPSIGGSGSFNY
SKTISYNQKNYVTEVESQNSKGVKWGVKANSFVTPNGQVSAYDQYLFAQDPTGPAARDYFVPDNQLPPLIQSGFNPSFIT
TLSHERGKGDKSEFEITYGRNMDATYAYVTRHRLAVDRKHDAFKNRNVTVKYEVNWKTHEVKIKSITPK
>Q2FVK1 ~~~hlgB~~~Gamma-hemolysin component B~~~
MKMNKLVKSSVATSMALLLLSGTANAEGKITPVSVKKVDDKVTLYKTTATADSDKFKISQILTFNFIKDKSYDKDTLVLK
ATGNINSGFVKPNPNDYDFSKLYWGAKYNVSISSQSNDSVNVVDYAPKNQNEEFQVQNTLGYTFGGDISISNGLSGGLNG
NTAFSETINYKQESYRTTLSRNTNYKNVGWGVEAHKIMNNGWGPYGRDSFHPTYGNELFLAGRQSSAYAGQNFIAQHQMP
LLSRSNFNPEFLSVLSHRQDGAKKSKITVTYQREMDLYQIRWNGFYWAGANYKNFKTRTFKSTYEIDWENHKVKLLDTKE
TENNK
>P0A075 ~~~hlgB~~~Gamma-hemolysin component B~~~
MKMNKLVKSSVATSMALLLLSGTANAEGKITPVSVKKVDDKVTLYKTTATADSDKFKISQILTFNFIKDKSYDKDTLVLK
ATGNINSGFVKPNPNDYDFSKLYWGAKYNVSISSQSNDSVNVVDYAPKNQNEEFQVQNTLGYTFGGDISISNGLSGGLNG
NTAFSETINYKQESYRTTLSRNTNYKNVGWGVEAHKIMNNGWGPYGRDSFHPTYGNELFLAGRQSSAYAGQNFIAQHQMP
LLSRSNFNPEFLSVLSHRQDGAKKSKITVTYQREMDLYQIRWNGFYWAGANYKNFKTRTFKSTYEIDWENHKVKLLDTKE
TENNK
>P0A077 ~~~hlgB~~~Gamma-hemolysin component B~~~
MKMNKLVKSSVATSMALLLLSGTANAEGKITPVSVKKVDDKVTLYKTTATADSDKFKISQILTFNFIKDKSYDKDTLVLK
ATGNINSGFVKPNPNDYDFSKLYWGAKYNVSISSQSNDSVNVVDYAPKNQNEEFQVQNTLGYTFGGDISISNGLSGGLNG
NTAFSETINYKQESYRTTLSRNTNYKNVGWGVEAHKIMNNGWGPYGRDSFHPTYGNELFLAGRQSSAYAGQNFIAQHQMP
LLSRSNFNPEFLSVLSHRQDGAKKSKITVTYQREMDLYQIRWNGFYWAGANYKNFKTRTFKSTYEIDWENHKVKLLDTKE
TENNK
>Q2FVK2 ~~~hlgC~~~Gamma-hemolysin component C~~~
MLKNKILTTTLSVSLLAPLANPLLENAKAANDTEDIGKGSDIEIIKRTEDKTSNKWGVTQNIQFDFVKDKKYNKDALILK
MQGFISSRTTYYNYKKTNHVKAMRWPFQYNIGLKTNDKYVSLINYLPKNKIESTNVSQTLGYNIGGNFQSAPSLGGNGSF
NYSKSISYTQQNYVSEVEQQNSKSVLWGVKANSFATESGQKSAFDSDLFVGYKPHSKDPRDYFVPDSELPPLVQSGFNPS
FIATVSHEKGSSDTSEFEITYGRNMDVTHAIKRSTHYGNSYLDGHRVHNAFVNRNYTVKYEVNWKTHEIKVKGQN
>Q99RL1 ~~~hlgC~~~Gamma-hemolysin component C~~~
MLKNKILATTLSVSLLAPLANPLLENAKAANDTEDIGKGSDIEIIKRTEDKTSNKWGVTQNIQFDFVKDKKYNKDALILK
MQGFISSRTTYYNYKKTNHVKAMRWPFQYNIGLKTNDKYVSLINYLPKNKIESTNVSQTLGYNIGGNFQSAPSLGGNGSF
NYSKSISYTQQNYVSEVEQQNSKSVLWGVKANSFATESGQKSAFDSDLFVGYKPHSKDPRDYFVPDSELPPLVQSGFNPS
FIATVSHEKGSSDTSEFEITYGRNMDVTHAIKRSTHYGNSYLDGHRVHNAFVNRNYTVKYEVNWKTHEIKVKGQN
>Q7A3S2 ~~~hlgC~~~Gamma-hemolysin component C~~~
MLKNKILATTLSVSLLAPLANPLLENAKAANDTEDIGKGSDIEIIKRTEDKTSNKWGVTQNIQFDFVKDKKYNKDALILK
MQGFISSRTTYYNYKKTNHVKAMRWPFQYNIGLKTNDKYVSLINYLPKNKIESTNVSQTLGYNIGGNFQSAPSLGGNGSF
NYSKSISYTQQNYVSEVEQQNSKSVLWGVKANSFATESGQKSAFDSDLFVGYKPHSKDPRDYFVPDSELPPLVQSGFNPS
FIATVSHEKGSSDTSEFEITYGRNMDVTHAIKRSTHYGNSYLDGHRVHNAFVNRNYTVKYEVNWKTHEIKVKGQN
>Q93NH4 1.5.3.5~~~6-hlno~~~(S)-6-hydroxynicotine oxidase~~~
MYDAIVVGGGFSGLKAARDLTNAGKKVLLLEGGERLGGRAYSRESRNVPGLRVEIGGAYLHRKHHPRLAAELDRYGIPTA
AASEFTSFRHRLGPTAVDQAFPIPGSEAVAVEAATYTLLRDAHRIDLEKGLENQDLEDLDIPLNEYVDKLDLPPVSRQFL
LAWAWNMLGQPADQASALWMLQLVAAHHYSILGVVLSLDEVFSNGSADLVDAMSQEIPEIRLQTVVTGIDQSGDVVNVTV
KDGHAFQAHSVIVATPMNTWRRIVFTPALPERRRSVIEEGHGGQGLKILIHVRGAEAGIECVGDGIFPTLYDYCEVSESE
RLLVAFTDSGSFDPTDIGAVKDAVLYYLPEVEVLGIDYHDWIADPLFEGPWVAPRVGQFSRVHKELGEPAGRIHFVGSDV
SLEFPGYIEGALETAECAVNAILHS
>A0A075BSX9 1.5.3.5~~~nctB~~~(S)-6-hydroxynicotine oxidase~~~
MTEKIYDAIVVGAGFSGLVAARELSAQGRSVLIIEARHRLGGRTHVVNFLGRPVEIGGAGVHWCQPHVFAEMQRYGFGFK
EAPLADLDKAYMVFADGQKIDVPPATFDEEYTTAFEKFCSRSRELFPRPYSPLDNHEVSNLDGVSARDHLESLGLNELQL
ASMNAELTLYGGAPTTELSYPSFVKFHALASWDTITFTDSEKRYHVQGGTNALCQAIFDDCRADSEFGVPVEAVAQTDNG
VTVTLADKRVFRALTCVLTLPTKVYADVRFEPPLPPEKRAFIEHAEMADGAELYVHVRQNLGNTFTFCDDPNPFNAVQTY
AYDDELGTILKITIGRQSLINLENFDAIAAEIRKIHGDVEVLEALPYNWAMDEYARTSYPAMRKGWFSRYKDMAKPENRL
FFAGSATADGWHEYIDGAIESGIRVGREIRHFMKATA
>Q99289 ~~~~~~Thermolabile hemolysin~~~COG3240
MMKKTITLLTALLPLASAVAEEPTLSPEMVSASEVISTQENQTYTYVRCWYRTSYSKDDPATDWEWAKNEDGSYFTIDGY
WWSSVSFKNMFYTNTSQNVIRQRCEATLDLANENADITFFAADNRFSYNHTIWSNDAAMQPDQINKVVALGDSLSDTGNI
FNASQWRFPNPNSWFLGHFSNGFVWTEYIAKAKNLPLYNWAVGGAAGENQYIALTGVGEQVSSYLTYAKLAKNYKPANTL
FTLEFGLNDFMNYNRGVPEVKADYAEALIRLTDAGAKNFMLMTLPDATKAPQFKYSTQEEIDKIRAKVLEMNEFIKAQAM
YYKAQGYNITLFDTHALFETLTSAPEEHGFVNASDPCLDINRSSSVDYMYTHALRSECAASGAEKFVFWDVTHPTTATHR
YVAEKMLESSNNLAEYRF
>P85219 ~~~~~~Hemolysin H1C~~~
MSGIVEAISNAVKSGLDHDWVNMGTSIADVVAKGADFIAGFFS
>P85222 ~~~~~~Hemolysin H1U~~~
MSGIVEAISNAVKSGLDHDWVNMGTSIADVVAKGADFIAGFFS
>P19249 ~~~tdh1~~~Thermostable direct hemolysin 1~~~
MKHQYFAKKSFLFISMLAAFKTSAFELPSVPFPAPGSDEILFVVRDTTFNTQAPVNVKVSDFWTNRNVKRKPYEDVYGQS
VFTTSGTKWLTSYMTVNINDKDYTMAAVSGYKSGHSAVFVKSGQVQLQHSYNSVANFVGEDEGSIPSKMYLDETPEYFVN
VEAYESGSGNILVMCISNKESFFECKHQQ
>P19250 ~~~tdh2~~~Thermostable direct hemolysin 2~~~
MKYRYFAKKSFLFISMLAAFKTFAFELPSVPFPAPGSDEILFVVRDTTFNTNAPVNVEVSDFWTNRNVKRKPYKDVYGQS
VFTTSGTKWLTSYMTVNINDKDYTMAAVSGYKHGHSAVFVKSDQVQLQHSYDSVANFVGEDEDSIPSKMYLDETPEYFVN
VEAYESGSGNILVMCISNKESFFECKHQQ
>P85221 ~~~~~~Hemolysin H3C~~~
MSDFVNAISEAVKAGLSADWVTMGTSIADALAKGADFILGFFN
>P85224 ~~~~~~Hemolysin H3U~~~
MSDFVNAISEAVKAGLSADWVTMGTSIADALAKGADFILGFFN
>P28029 ~~~tdh3~~~Thermostable direct hemolysin-related~~~
MKYRYFAKKSFLFISMLAAFKTFAFELPSVPFPAPGSDEILFVVRDATFNTNAPVNVKVSDFWTNRNVKRKPYKDVYGQS
VFTTSGTKWLTSYMTVNINDKDYTMAAVSGYKRGHSAVFVKSDQVQLQHSYNSVANFVGEDEDSIPSKMYLDETPEYFVN
VEAYESGSGNILVMCISNKESFFECEHQK
>P09983 ~~~hlyA~~~Hemolysin, chromosomal~~~
MPTITAAQIKSTLQSAKQSAANKLHSAGQSTKDALKKAAEQTRNAGNRLILLIPKDYKGQGSSLNDLVRTADELGIEVQY
DEKNGTAITKQVFGTAEKLIGLTERGVTIFAPQLDKLLQKYQKAGNKLGGSAENIGDNLGKAGSVLSTFQNFLGTALSSM
KIDELIKKQKSGGNVSSSELAKASIELINQLVDTAASLNNVNSFSQQLNKLGSVLSNTKHLNGVGNKLQNLPNLDNIGAG
LDTVSGILSAISASFILSNADADTGTKAAAGVELTTKVLGNVGKGISQYIIAQRAAQGLSTSAAAAGLIASVVTLAISPL
SFLSIADKFKRANKIEEYSQRFKKLGYDGDSLLAAFHKETGAIDASLTRISTVLASVSSGISAAATTSLVGAPVSALVGA
VTGIISGILEASKQAMFEHVASKMADVIAEWEKKHGKNYFENGYDARHAAFLEDNFKILSQYNKEYSVERSVLITQQHWD
TLIGELAGVTRNGDKTLSGKSYIDYYEEGKRLEKKPDEFQKQVFDPLKGNIDLSDSKSSTLLKFVTPLLTPGEEIRERRQ
SGKYEYITELLVKGVDKWTVKGVQDKGSVYDYSNLIQHASVGNNQYREIRIESHLGDGDDKVFLSAGSANIYAGKGHDVV
YYDKTDTGYLTIDGTKATEAGNYTVTRVLGGDVKVLQEVVKEQEVSVGKRTEKTQYRSYEFTHINGKNLTETDNLYSVEE
LIGTTRADKFFGSKFADIFHGADGDDHIEGNDGNDRLYGDKGNDTLSGGNGDDQLYGGDGNDKLIGGAGNNYLNGGDGDD
ELQVQGNSLAKNVLSGGKGNDKLYGSEGADLLDGGEGNDLLKGGYGNDIYRYLSGYGHHIIDDDGGKDDKLSLADIDFRD
VAFRREGNDLIMYKAEGNVLSIGHKNGITFKNWFEKESGDISNHQIEQIFDKDGRVITPDSLKKALEYQQSNNKASYVYG
NDALAYGSQGNLNPLINEISKIISAAGNFDVKEERAAASLLQLSGNASDFSYGRNSITLTASA
>P08715 ~~~hlyA~~~Hemolysin, plasmid~~~
MTTITTAQIKSTLQSAKQSAANKLHSAGQSTKDALKKAAEQTRNAGNRLILLIPKDYKGQGSSLNDLVRTADELGIEVQY
DEKNGTAITKQVFGTAEKLIGLTERGVTIFAPQLDKLLQKYQKAGNILGGGAENIGDNLGKAGGILSTFQNFLGTALSSM
KIDELIKKQKSGGNVSSSELAKASIELINQLVDTVASLNNNVNSFSQQLNTLGSVLSNTKHLNGVGNKLQNLPNLDNIGA
GLDTVSGILSAISASFILSNADADTRTKAAAGVELTTKVLGNVGKGISQYIIAQRAAQGLSTSAAAAGLIASAVTLAISP
LSFLSIADKFKRANKIEEYSQRFKKLGYDGDSLLAAFHKETGAIDASLTTISTVLASVSSGISAAATTSLVGAPVSALVG
AVTGIISGILEASKQAMFEHVASKMADVIAEWEKKHGKNYFENGYDARHAAFLEDNFKILSQYNKEYSVERSVLITQQHW
DTLIGELAGVTRNGDKTLSGKSYIDYYEEGKRLEKKXDEFQKQVFDPLKGNIDLSDSKSSTLLKFVTPLLTPGEEIRERR
QSGKYEYITELLVKGVDKWTVKGVQDKGAVYDYSNLIQHASVGNNQYREIRIESHLGDGDDKVFLSAGSANIYAGKGHDV
VYYDKTDTGYLTIDGTKATEAGNYTVTRVLGGDVKVLQEVVKEQEVSVGKRTEKTQYRSYEFTHINGKNLTETDNLYSVE
ELIGTTRADKFFGSKFTDIFHGADGDDLIEGNDGNDRLYGDKGNDTLSGGNGDDQLYGGDGNDKLIGVAGNNYLNGGDGD
DEFQVQGNSLAKNVLFGGKGNDKLYGSEGADLLDGGEGDDLLKGGYGNDIYRYLSGYGHHIIDDDGGKEDKLSLADIDFR
DVAFKREGNDLIMYKGEGNVLSIGHKNGITFRNWFEKESGDISNHEIEQIFDKSGRIITPDSLKKALEYQQRNNKASYVY
GNDALAYGSQGDLNPLINEISKIISAAGSFDVKEERTAASLLQLSGNASDFSYGRNSITLTTSA
>Q06803 ~~~tlyA~~~Hemolysin A~~~
MRLDEYVHSEGYTESRSKAQDIILAGCVFVNGVKVTSKAHKIKDTDNIEVVQNIKYVSRAGEKLEKAFVEFGISVENKIC
LDIGASTGGFTDCLLKHGAKKVYALDVGHNQLVYKLRNDNRVVSIEDFNAKDINKEMFNDEIPSVIVSDVSFISITKIAP
IIFKELNNLEFWVTLIKPQFEAERGDVSKGGIIRDDILREKILNNAISKIIDCGFKEVNRTISPIKGAKGNIEYLAHFII
>P16466 ~~~hpmA~~~Hemolysin~~~
MKSKNFKLSPSGRLAASLAIIFVSLNAYGNGIVPDAGHQGPDVSAVNGGTQVINIVTPNNEGISHNQYQDFNVGKPGAVF
NNALEAGQSQLAGHLNANSNLNGQAASLILNEVVSRNPSFLLGQQEVFGIAAEYVLSNPNGITCDGCGFINTSRSSLVVG
NPLFENGQLKGYSTLNNTNLLSLGKNGLNTTGLLDLIAPRIDSRGKITAAEISAFTGQNTFSQHFDILSSQKPVSALDSY
FFGSMQSGRIRIINTAEGSGVKLAGKFTADNDLSVKADNIQTDSQVRYDSYDKDGSENYQNYRGGITVNNSGSSQTLTKT
ELKGKNITLVASSHNQIKASDLMGDDITLQGADLTIDGKQLQQKETDIDNRWFYSWKYDVTKEKEQIQQIGSQIDAKNNA
TLTATKGDVTLDAAKINAGNNLAINANKDIHINGLVEKESRSENGNKRNHTSRLESGSWSNSHQTETLKASELTAGKDLG
LDAQGSITAQGAKLHANENVLVNAKDNINLNVQKTNNDKTVTDNHVMWGGIGGGQNKNNNNQQQVSHATQLTADGQLLLA
ADNNVNITGSQVKGNQGAFVKTTQGDVVIDNALSETISKIDERTGTAFNITKSSHKNETNKQTSTGSELISDAQLTVVSG
NDVNVIGSLIKSADKLGIHSLGDINVKSAQQVTKIDDEKTSLAITGHAKEVEDKQYSAGFHITHTTNKNTSTETEQANST
ISGANVDLQANKDVTFAGSDLKTTAGNASITGDNVAFVSTENKKQTDNTDTTISGGFSYTGGVDKVGSKADFQYDKQHTQ
TEVTKNRGSQTEVAGDLTITANKDLLHEGASHHVEGRYQESGENIQHLAVNDSETSKTDSLNVGIDVGVNLDYSGVTKPV
KKAIEDGVNTTKPGNNTDLTKKVTARDAIANLANLSNLETPNVGVEVGIKGGGSQQSQTDSQAVSTSINAGKIDIDSNNK
LHDQGTHYQSTQEGISLTANTHTSEATLDKHQTTFHETKGGGQIGVSTKTGSDITVAIKGEGQTTDNALMETKAKGSQFT
SNGDISINVGENAHYEGAQFDAQKGKTVINAGGDLTLAQATDTHSESQSNVNGSANLKVGTTPESKDYGGGFNAGTTHHS
KEQTTAKVGTITGSQGIELNAGHNLTLQGTHLSSEQDIALNATNKVDLQSASSEHTEKGNNLSGGVQAGFGKKMTDDASS
VNGLGSAQFAIGKQDEKSVSREGGTINNSGNLTINGNSVHLQGAQVNSKDTQLTSQSGDIEITSAQSTDYKNNWGTDIGF
NGKKTNNTPKEVTEEKPATSIHNIGGKLLVNVEDQQKTSHQNATLETGTLTINSNKDLTLSGANVTADSVTGNVGGSLNI
ASQKESDRHVTVGVNVGYNHTNDPKSSQVNKTAKAGGSLLEKTIKDTIDSGIKSSTDAISDKYNSLSSTIADKTGISDET
KAKIDQGFGKVGNGIKNIVTGAEGHTANADIKVTHVDNDAVTKTTSLTSNNDLSLNVNGSTKLTGAEIVSQQGQVDLGGS
SVKLENIEGHHYEAGADLDLKSSVVDLAKQLVGGDISFKSPVKTNETVNTKASISEK
>P15320 ~~~shlA~~~Hemolysin~~~
MKNNNFRLSAAGKLAAALAIILAASAGAYAAEIVAANGANGPGVSTAATGAQVVDIVAPNGNGLSHNQYQDFNVNQPGAV
LNNSREAGLSQLAGQLGANPNLGGREASVILNEVIGRNPSLLHGQQEIFGMAADYVLANPNGISCQSCGFINTSHSSLVV
GNPLVENGVLQGYSTFGNRNTLSLNGTLNAGGVLDLIAPKIDSRGEVIVQDFKQSNGKVTSAAINAISGLNRVARDGTVQ
ASQQMPTALDSYYLGSMQAGRINIINTAQGSGVKLAGSLNAGDELKVKAYDIRSESRVDDASSNKNGGDNYQNYRGGIYV
NDRSSSQTLTRTELKGKNISLVADNHAHLTATDIRGEDITLQGGKLTLDGQQLKQTQGHTDDRWFYSWQYDVTREREQLQ
QAGSTVAASGSAKLISTQEDVKLLGANVSADRALSVKAARDVHLAGLVEKDKSSERGYQRNHTSSLRTGRWSNSDESESL
KASELRSEGELTLKAGRNVSTQGAKVHAQRDLTIDADNQIQVGVQKTANAKAVRDDKTSWGGIGGGDNKNNSNRREISHA
SELTSGGTLRLNGQQGVTITGSKARGQKGGEVTATHGGLRIDNALSTTVDKIDARTGTAFNITSSSHKADNSYQSSTASE
LKSDTNLTLVSHKDADVIGSQVASGGELSVESKTGNINVKAAERQQNIDEQKTALTVNGYAKEAGDKQYRAGLRIEHTRD
SEKTTRTENSASSLSGGSVKLKAEKDVTFSGSKLVADKGDASVSGNKVSFLAADDKTASNTEQTKIGGGFYYTGGIDKLG
SGVEAGYENNKTQAQSSKAITSGSDVKGNLTINARDKLTQQGAQHSVGGAYQENAAGVDHLAAADTASTTTTKTDVGVNI
GANVDYSAVTRPVERAVGKAAKLDATGVINDIGGIGAPNVGLDIGAQGGSSEKRSSSSQAVVSSVQAGSIDINAKGEVRD
QGTQYQASKGAVNLTADSHRSEAAANRQDEQSRDTRGSAGVRVYTTTGSDLTVDAKGEGGTQRSNSSASQAVTGSIDAAN
GINVNVKKDAIYQGTALNGGRGKTAVNAGGDIRLDQASDKQSESRSGFNVKASAKGGFTADSKNFGAGFGGGTHNGESSS
STAQVGNISGQQGVELKAGRDLTLQGTDVKSQGDVSLSAGNKVALQAAESTQTRKESKLSGNIDLGAGSSDSKEKTGGNL
SAGGAFDIAKVNESATERQGATIASDGKVTLSANGKGDDALHLQGAKVSGGSAALEAKNGGILLESAKNEQHKDNWSLGI
KANAKGGQTFNKDAGGKVDPNTGKDTHTLGAGLKVGVEQQDKTTHANTGITAGDVTLNSGKDTRLAGARVDADSVQGKVG
GDLHVESRKDVENGVKVDVDAGLSHSNDPGSSITSKLSKVGTPRYAGKVKEKLEAGVNKVADATTDKYNSVARRLDPQQD
TTGAVSFSKAEGKVTLPATPAGEKPQGPLWDRGARTVGGAVKDSITGPAGRQGHLKVNADVVNNNAVGEQSAIAGKNGVA
LQVGGQTQLTGGEIRSQQGKVELGGSQVSQQDVNGQRYQGGGRVDAAATVGGLLGGAAKQSVAGNVPFASGHASTQQADA
KAGVFSGK
>P09545 ~~~hlyA~~~Hemolysin~~~
MPKLNRCAIAIFTILSAISSPTLLANINEPSGEAADIISQVADSHAIKYYNAADWQAEDNALPSLAELRDLVINQQKRVL
VDFSQISDAEGQAEMQAQFRKAYGVGFANQFIVITEHKGELLFTPFDQAEEVDPQLLEAPRTARLLARSGFASPAPANSE
TNTLPHVAFYISVNRAISDEECTFNNSWLWKNEKGSRPFCKDANISLIYRVNLERSLQYGIVGSATPDAKIVRISLDDDS
TGAGIHLNDQLGYRQFGASYTTLDAYFREWSTDAIAQDYRFVFNASNNKAQILKTFPVDNINEKFERKEVSGFELGVTGG
VEVSGDGPKAKLEARASYTQSRWLTYNTQDYRIERNAKNAQAVSFTWNRQQYATAESLLNRSTDALWVNTYPVDVNRISP
LSYASFVPKMDVIYKASATETGSTDFIIDSSVNIRPIYNGAYKHYYVVGAHQSYHGFEDTPRRRITKSASFTVDWDHPVF
TGGRPVNLQLASFNNRCIQVDAQGRLAANTCDSQQSAQSFIYDQLGRYVSASNTKLCLDGEALDALQPCNQNLTQRWEWR
KGTDELTNVYSGESLGHDKQTGELGLYASSNDAVSLRTITAYTDVFNAQESSPILGYTQGKMNQQRVGQDHRLYVRAGAA
IDALGSASDLLVGGNGGSLSSVDLSGVKSITATSGDFQYGGQQLVALTFTYQDGRQQTVGSKAYVTNAHEDRFDLPAAAK
ITQLKIWSDDWLVKGVQFDLN
>Q47258 ~~~hlyB~~~Alpha-hemolysin translocation ATP-binding protein HlyB~~~
MDSCHKIDYGLYALEILAQYHNVSVNPEEIKHRFDTDGTGLGLTSWLLAAKSLELKVKQVKKTIDRLNFISLPALVWRED
GRHFILTKVSKEANRYLIFDLEQRNPRVLEQSEFEALYQGHIILIASRSSVAGKLAKFDFTWFIPAIIKYRRIFIETLVV
SVFLQLFALITPLFFQVVMDKVLVHRGFSTLNVITVALSVVVVFEIILSGLRTYIFAHSTSRIDVELGAKLFRHLLALPI
SYFESRRVGDTVARVRELDQIRNFLTGQALTSVLDLLFSFIFFAVMWYYSPKLTLVILFSLPCYAAWSVFISPILRRRLD
DKFSRNADNQSFLVESVTAINTIKAMAVSPQMTNIWDKQLAGYVAAGFKVTVLATIGQQGIQLIQKTVMIINLWLGAHLV
ISGDLSIGQLIAFNMLAGQIVAPVIRLAQIWQDFQQVGISVTRLGDVLNSPTESYHGKLALPEINGDITFRNIRFRYKPD
SPVILDNINLSIKQGEVIGIVGRSGSGKSTLTKLIQRFYIPENGQVLIDGHDLALADPNWLRRQVGVVLQDNVLLNRSII
DNISLANPGMSVEKVIYAAKLAGAHDFISELREGYNTIVGEQGAGLSGGQRQRIAIARALVNNPKILIFDEATSALDYES
EHIIMRNMHKICKGRTVIIIAHRLSTVKNADRIIVMEKGKIVEQGKHKELLSEPESLYSYLYQLQSD
>P08716 ~~~hlyB~~~Alpha-hemolysin translocation ATP-binding protein HlyB~~~
MDSCHKIDYGLYALEILAQYHNVSVNPEEIKHRFDTDGTGLGLTSWLLAAKSLELKVKQVKKTIDRLNFIFLPALVWRED
GRHFILTKISKEVNRYLIFDLEQRNPRVLEQSEFEALYQGHIILITSRSSVTGKLAKFDFTWFIPAIIKYRRIFIETLVV
SVFLQLFALITPLFFQVVMDKVLVHRGFSTLNVITVALSVVVVFEIILSGLRTYIFAHSTSRIDVELGAKLFRHLLALPI
SYFESRRVGDTVARVRELDQIRNFLTGQALTSVLDLLFSLIFFAVMWYYSPKLTLVILFSLPCYAAWSVFISPILRRRLD
DKFSRNADNQSFLVESVTAINTIKAMAVSPQMTNIWDKQLAGYVAAGFKVTVLATIGQQGIQLIQKTVMIINLWLGAHLV
ISGDLSIGQLIAFNMLAGQIVAPVIRLAQIWQDFQQVGISVTRLGDVLNSPTESYHGKLTLPEINGDITFRNIRFRYKPD
SPVILDNINLSIKQGEVIGIVGRSGSGKSTLTKLIQRFYIPENGQVLIDGHDLALADPNWLRRQVGVVLQDNVLLNRSII
DNISLANPGMSVEKVIYAAKLAGAHDFISELREGYNTIVGEQGAGLSGGQRQRIAIARALVNNPKILIFDEATSALDYES
EHVIMRNMHKICKGRTVIIIAHRLSTVKNADRIIVMEKGKIVEQGKHKELLSEPESLYSYLYQLQSD
>Q8FDZ8 ~~~hlyB~~~Alpha-hemolysin translocation ATP-binding protein HlyB~~~COG2274
MDSCHKIDYGLYALEILAQYHNVSVNPEEIKHRFDTDGTGLGLTSWLLAAKSLELKVKQVKKTIDRLNFISLPALVWRED
GRHFILTKVSKEANRYLIFDLEQRNPRVLEQSEFEALYQGHIILIASRSSVTGKLAKFDFTWFIPAIIKYRKIFIETLVV
SVFLQLFALITPLFFQVVMDKVLVHRGFSTLNVITVALSVVVVFEIILSGLRTYIFAHSTSRIDVELGAKLFRHLLALPI
SYFESRRVGDTVARVRELDQIRNFLTGQALTSVLDLLFSFIFFAVMWYYSPKLTLVILFSLPCYAAWSVFISPILRRRLD
DKFSRNADNQSFLVESVTAINTIKAMAVSPQMTNIWDKQLAGYVAAGFKVTVLATIGQQGIQLIQKTVMIINLWLGAHLV
ISGDLSIGQLIAFNMLAGQIVAPVIRLAQIWQDFQQVGISVTRLGDVLNSPTESYHGKLALPEINGNITFRNIRFRYKPD
SPVILDNINLSIKQGEVIGIVGRSGSGKSTLTKLIQRFYIPENGQVLIDGHDLALADPNWLRRQVGVVLQDNVLLNRSII
DNISLANPGMSVEKVIYAAKLAGAHDFISELREGYNTIVGEQGAGLSGGQRQRIAIARALVNNPKILIFDEATSALDYES
EHIIMRNMHKICKGRTVIIIAHRLSTVKNADRIIVMEKGKIVEQGKHKELLSEPESLYSYLYQLQSD
>P15321 ~~~shlB~~~Hemolysin transporter protein ShlB~~~
MIKKITALTLLVSTALSAETLPDSHMMQDMSMGESRRALQDSTREVNQLIEQRRYQQLKQQRLLAEPAAPALPQSAQCLP
IAGVYLQGVTLLSPADLSALSGLPEQCISSNDINRLTRELTRLYVQKGYITARVQIVRPNSQGELGLSVTEGFIEKIEGG
DRWVNSRLLFPGLEGKPLKLTELDQGLDQANRLQSNTTKLDILPGRQVGGSVIRLRNQHAKPWLITAGTDNYGQKSTGRW
LARATATLDSPFGLSDFVSLNANSTLENPAHRYNRAYTLLYSLPYGAFTFSGFASFSSYENHQQLPHNVVKLHGQTQQYG
LRSDYVFYRDHDQIDSLSGQLTYKRIDNYFESVRLEVSSPTLTLAELSASHLQILPNGVFSANLSVEQGMPWLGAGRHPS
SVHLDSQFTKGKLFANLSQRLRLGDATYQLNNLFYGQYSRDPLPGVEWLSLTDRSAVRGFSRSTQSGDNGWYLQNTLSRS
FNLGATTLTPRLGADVGRILPRQDNSGWRSSAGISTGATLRYQRALVDLEVSRGWILSNHATPEDPVQVLARFSYTF
>P06736 2.3.1.-~~~hlyC~~~Protein-lysine myristoyltransferase HlyC~~~
MNINKPLEILGHVSWLWASSPLHRNWPVSLFAINVLPAIQANQYVLLTRDDYPVAYCSWANLSLENEIKYLNDVTSLVAE
DWTSGDRKWFIDWIAPFGDNGALYKYMRKKFPDELFRAIRVDPKTHVGKVSEFHGGKIDKQLANKIFKQYHHELITEVKR
KSDFNFSLTG
>O05961 ~~~tlyC~~~Hemolysin C~~~COG1253
MLKSSKHEDSSSKKNQNNKLIFIVRQLFYLIKHFFSKTKTPDNFFGIIKRLKINSQKMSLDEFNILANLLKLEDKIVEDI
MVPRSDIIAIKLTTNLEELSESIKIAVPHTRTLIYDGTLDNVVGFIHIKDLFKALATKQNSPLKRLIRKHIIAAPSMKLL
DLLAKMRRERTHIAIVVDEYGGTDGLVTIEDLIEEIVGRIDDEHDQQLDSANFKVINNSTIIANARIEVELLEEIIGEKL
KNDDDEFDTIGGLVLTRVSSVPAIGTRIDISENIEIEVTDATPRSLKQVKIRLKNGLNSDNLT
>Q68W10 ~~~tlyC~~~Hemolysin C~~~COG1253
MLKSSKKEDSSSKKNQDNKLIFIVRQLFYLIKHLFSKTKTPDNFFGIIKRLKINSKKMSLDECNILANLLQLENKTVEDI
MVPRSDIVAIKLTTNLAELSESIKIEVPHTRTLIYDGTLDNVVGFIHIKDLFKALATKQNSTLKRLIRKHIIAAPSMKLL
DLLAKMRRERTHIAIVVDEYGGTDGLVTIEDLIEEIVGRIDDEHDQQLDSTNFKVINNSTIIANARIEVELLEEIIKEKI
KNDDDEFDTIGGLVLTRVSSVPAIGTRIDISENIEIEVIDATPRSLKQVKIRLKNGLNRDSFNLT
>P09986 ~~~hlyD~~~Hemolysin secretion protein D, chromosomal~~~
MKTWLMGFSEFLLRYKLVWSETWKIRKQLDTPVREKDENEFLPAHLELIETPVSRRPRLVAYFIMGFLVIAFILSVLGQV
EIVATANGKLTLSGRSKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQIRYQILSRSI
ELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTILARINRYENVSRVEKSRLDDF
RSLLHKQAIAKHAVLEQENKYVEAANELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDSIELLTLELEK
NEERQQASVIRAPVSGKVQQLKVHTEGGVVTTAETLMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPYTRYGY
LVGKVKNINLDAIEDQKLGLVFNVIVSVEENDLSTGNKHIPLSSGMAVTAEIKTGMRSVISYLLSPLEESVTESLHER
>P06739 ~~~hlyD~~~Hemolysin secretion protein D, plasmid~~~
MKTWLMGFSEFLLRYKLVWSETWKIRKQLDTPVREKDENEFLPAHLELIETPVSRRPRLVAYFIMGFLVIAFILSVLGQV
EIVATANGKLTHSGRSKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSI
ELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLDDF
SSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLLTLELAK
NEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETLMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPYTRYGY
LVGKVKNINLDAIEDQRLGLVFNVIISIEENCLSTGNKNIPLSSGMAVTAEIKTGMRSVISYLLSPLEESVTESLRER
>P77335 ~~~hlyE~~~Hemolysin E, chromosomal~~~
MTEIVADKTVEVVKNAIETADGALDLYNKYLDQVIPWQTFDETIKELSRFKQEYSQAASVLVGDIKTLLMDSQDKYFEAT
QTVYEWCGVATQLLAAYILLFDEYNEKKASAQKDILIKVLDDGITKLNEAQKSLLVSSQSFNNASGKLLALDSQLTNDFS
EKSSYFQSQVDKIRKEAYAGAAAGVVAGPFGLIISYSIAAGVVEGKLIPELKNKLKSVQNFFTTLSNTVKQANKDIDAAK
LKLTTEIAAIGEIKTETETTRFYVDYDDLMLSLLKEAAKKMINTCNEYQKRHGKKTLFEVPEV
>P14711 ~~~~~~Hemolysin, heat labile~~~
FELPSIPFPSPGSDEILFVVRDTTFNTKEPVNVKVSDFWTNRNVKRKPYEDVYGQSVFTTSGSKWLTSYMTVSINNKDYT
MAAVSGYKDGFSSVFVKSGQIQLQHYYNSVADFVGGDENSIPSKTYLDETPSYFVNVEAYESGSGNILVMCISNKESYFE
CEEQQ
>P52695 ~~~hlyU~~~Transcriptional activator HlyU~~~COG0640
MPYLKGAPMNLQEMEKNSAKAVVLLKAMANERRLQILCMLLDNELSVGELSSRLELSQSALSQHLAWLRRDGLVNTRKEA
QTVFYTLSSTEVKAMIELLHRLYCQANQ
>O52791 1.13.11.46~~~~~~4-hydroxymandelate synthase~~~
MQNFEIDYVEMYVENLEVAAFSWVDKYAFAVAGTSRSADHRSIALRQGQVTLVLTEPTSDRHPAAAYLQTHGDGVADIAM
ATSDVAAAYEAAVRAGAEAVRAPGQHSEAAVTTATIGGFGDVVHTLIQRDGTSAELPPGFTGSMDVTNHGKGDVDLLGID
HFAICLNAGDLGPTVEYYERALGFRQIFDEHIVVGAQAMNSTVVQSASGAVTLTLIEPDRNADPGQIDEFLKDHQGAGVQ
HIAFNSNDAVRAVKALSERGVEFLKTPGAYYDLLGERITLQTHSLDDLRATNVLADEDHGGQLFQIFTASTHPRHTIFFE
VIERQGAGTFGSSNIKALYEAVELERTGQSEFGAARR
>Q59465 7.2.2.12~~~cadA~~~Cadmium, zinc and cobalt-transporting ATPase~~~COG2217
MQEYHIHNLDCPDCASKLERDLNELDYVKKAQINFSTSKLFLDTSDFEKVKAFIKQNEPHLSLSFKEATEKPLSFTPLII
TIMVFLGAILILHLNPSPLIEKAMFFVLALVYLVSGKDVILGAFRGLRKGQFFDENALMLIATIAAFFVGAYEESVSIMV
FYSAGEFLQKLAVSRSKKSLKALVDVAPNLAYLKKGDELVSVAPEDLRVNDIVVVKVGEKVPVDGVVVKGESLLDERALS
GESMPVNVSENSKVLGGSLNLKAVLEIQVEKMYKDSSIAKVVDLVQQATNEKSETEKFITKFSRYYTPSVLFIALMIAVL
PPLFSMGSFDEWIYRGLVALMVSCPCALVISVPLGYFGGVGAASRKGILMKGVHVLEVLTQAKSIAFDKTGTLTKGVFKV
TDIVPQNGHSKEEVLHYASCSQLLSTHPIALSIQKACEEMLKDDKHQHDIKNYEEVSGMGVKAQCHTDLIIAGNEKMLDQ
FHIAHSPSKENGTIVHVAFNQTYVGYIVISDEIKDDAIECLRDLKVQGIENFCILSGDRKSATESIAQTLGCEYHASLLP
EEKTSVFKTFKERYKAPAIFVGDGINDAPTLASADVGIGMGKGSELSKQSADIVITNDSLNSLVKVLAIAKKTKSIIWQN
ILFALGIKAVFIVLGLMGVASLWEAVFGDVGVTLLALANSMRAMRA
>E4QP00 1.1.3.47~~~~~~5-(hydroxymethyl)furfural oxidase~~~
MTDTIFDYVIVGGGTAGSVLANRLSARPENRVLLIEAGIDTPENNIPPEIHDGLRPWLPRLSGDKFFWPNLTIHRAAEHP
GITREPQFYEQGRLLGGGSSVNMVVSNRGLPRDYDEWQALGADGWDWQGVLPYFIKTERDADYGDDPLHGNAGPIPIGRV
DSRHWSDFTVAATQALEAAGLPNIHDQNARFDDGYFPPAFTLKGEERFSAARGYLDASVRVRPNLSLWTESRVLKLLTTG
NAITGVSVLRGRETLQVQAREVILTAGALQSPAILLRTGIGPAADLHALGIPVLADRPGVGRNLWEHSSIGVVAPLTEQA
RADASTGKAGSRHQLGIRASSGVDPATPSDLFLHIGADPVSGLASAVFWVNKPSSTGWLKLKDADPFSYPDVDFNLLSDP
RDLGRLKAGLRLITHYFAAPSLAKYGLALALSRFAAPQPGGPLLNDLLQDEAALERYLRTNVGGVWHASGTARIGRADDS
QAVVDKAGRVYGVTGLRVADASIMPTVPTANTNLPTLMLAEKIADAILTQA
>Q9AQI0 4.1.3.16~~~proA~~~4-hydroxy-4-methyl-2-oxoglutarate aldolase/4-carboxy-4-hydroxy-2-oxoadipate aldolase~~~
MYELGVVYRNIQRADRAAADGLAALGSATVHEAMGRVGLLKPYMRPIYAGKQVSGTAVTVLLQPGDNWMMHVAAEQIQPG
DIVVAAVTAECTDGYFGDLLATSFQARGARALIIDAGVRDVKTLQEMDFPVWSKAISSKGTIKATLGSVNIPIVCAGMLV
TPGDVIVADDDGVVCVPAARAVEVLAAAQKRESFEGEKRAKLASGVLGLDMYKMREPLEKAGLKYID
>A5W059 4.1.3.17~~~~~~4-hydroxy-4-methyl-2-oxoglutarate aldolase/4-carboxy-4-hydroxy-2-oxoadipate aldolase~~~COG0684
MNTLIGKTGIVVRNIQRAELDSIDALGRLGVATVHEAQNRKGLLSSKMRPIQQGTSLAGSAVTVLVAPGDNWMFHVAVEQ
CRPGDVLVVSPSSPCTDGYFGDLLATSLQARGVRALIVDAGVRDTQTLRDMGFAVWARAINAQGTVKETLGSVNLPVICG
GQLINPGDIVVADDDGVVVVRRDECESTLVAAAERAGLEEEKRLRLAAGELGLDIYKMRERLEAKGLRYVDNIEDLEG
>O34873 4.1.3.4~~~yngG~~~Hydroxymethylglutaryl-CoA lyase YngG~~~COG0119
MPYPKKVTIKEVGPRDGLQNEPVWIATEDKITWINQLSRTGLSYIEITSFVHPKWIPALRDAIDVAKGIDREKGVTYAAL
VPNQRGLENALEGGINEACVFMSASETHNRKNINKSTSESLHILKQVNNDAQKANLTTRAYLSTVFGCPYEKDVPIEQVI
RLSEALFEFGISELSLGDTIGAANPAQVETVLEALLARFPANQIALHFHDTRGTALANMVTALQMGITVFDGSAGGLGGC
PYAPGSSGNAATEDIVYMLEQMDIKTNVKLEKLLSAAKWIEEKMGKPLPSRNLQVFKSS
>Q9FD71 2.3.3.10~~~mvaS~~~Hydroxymethylglutaryl-CoA synthase~~~COG3425
MTIGIDKISFFVPPYYIDMTALAEARNVDPGKFHIGIGQDQMAVNPISQDIVTFAANAAEAILTKEDKEAIDMVIVGTES
SIDESKAAAVVLHRLMGIQPFARSFEIKEACYGATAGLQLAKNHVALHPDKKVLVVAADIAKYGLNSGGEPTQGAGAVAM
LVASEPRILALKEDNVMLTQDIYDFWRPTGHPYPMVDGPLSNETYIQSFAQVWDEHKKRTGLDFADYDALAFHIPYTKMG
KKALLAKISDQTEAEQERILARYEESIIYSRRVGNLYTGSLYLGLISLLENATTLTAGNQIGLFSYGSGAVAEFFTGELV
AGYQNHLQKETHLALLDNRTELSIAEYEAMFAETLDTDIDQTLEDELKYSISAINNTVRSYRN
>Q0QLF5 1.1.1.291~~~Hgd~~~2-(hydroxymethyl)glutarate dehydrogenase~~~
MEKSIKIGFIGLGAMGKPMAINLLKEGVTVYAFDLMEANVAAVVAQGAQACENNQKVAAASDIIFTSLPNAGIVETVMNG
PGGVLSACKAGTVIVDMSSVSPSSTLKMAKVAAEKGIDYVDAPVSGGTKGAEAGTLTIMVGASEAVFEKIQPVLSVIGKD
IYHVGDTGAGDAVKIVNNLLLGCNMASLAEALVLGVKCGLKPETMQEIIGKSSGRSYAMEAKMEKFIMSGDFAGGFAMDL
QHKDLGLALEAGKEGNVPLPMTAMATQIFEGGRAMGLGREDMSAVIKVWEQMTGVSVSGGQ
>O31534 1.14.14.18~~~hmoA~~~Heme-degrading monooxygenase HmoA~~~COG2329
MFVQLRKMTVKEGFADKVIERFSAEGIIEKQEGLIDVTVLEKNVRRGDEEVVVMIRWESEDHWKQWEKSDAHIAGHKANK
GKPKPDYLISTEVSMYHVRAVKQGTYNQ
>P38049 1.14.14.18~~~hmoB~~~Heme-degrading monooxygenase HmoB~~~COG2329
MKVYITYGTADFLKTIVQKHPSENILLMQGQENAILIHETNGDTVFQAPHAYEVIDQVGEIKHPGFAVLNNIAVTQEGRP
LFENRFKNRAGKVENEPGFEAIRVLRPLDSDTYVILTLWETESAFQDWQQSGSYKEAHKKRDTSAGIDTTSIFSRPSYVT
TYFAVE
>O52792 1.1.3.46~~~hmo~~~4-hydroxymandelate oxidase~~~
MTYVSLADLERAARDVLPGEIFDFLAGGSGTEASLVANRTALERVFVIPRMLRDLTDVTTEIDIFGRRAALPMAVAPVAY
QRLFHPEGELAVARAARDAGVPYTICTLSSVSLEEIAAVGGRPWFQLYWLRDEKRSLDLVRRAEDAGCEAIVFTVDVPWM
GRRLRDMRNGFALPEWVTAANFDAGTAAHRRTQGVSAVADHTAREFAPATWESVEAVRAHTDLPVVLKGILAVEDARRAV
DAGAGGIVVSNHGGRQLDGAVPGIEMLGEIVAAVSGGCEVLVDGGIRSGGDVLKATALGASAVLVGRPVMWALAAAGQDG
VRQLLELLAEEVRDAMGLAGCESVGAARRLNTKLGVV
>Q8Y563 1.14.-.-~~~~~~Heme-degrading monooxygenase~~~COG2329
MKKVFITTGTEHYLRQLMANYTGGNVTLLQNFSQSLLYQESTGEKLFQEGAEYRVLQSSGSIKGFGVVVFEYIHLRDEEI
PIFLQMYQRASLHFSETPGLQSTKLTKAMNMNKFLIISFWDSEVFFHDWKKSPLSKEITNIMRKNNTQSGFSHEDIYHYP
EFSHDAK
>Q988D0 4.1.1.51~~~~~~3-hydroxy-2-methylpyridine-4,5-dicarboxylate 4-decarboxylase~~~COG0235
MRRKVFEELVTATKILLNEGIMDTFGHISARDPEDPASFFLAQKLAPSLITVDDIQRFNLDGETSDNRPSYLERYIHSEI
YKTRPDVQCVLHTHSPAVLPYCFVDTPLRPVTHMGAFIGESVPVYEIRDKHGDETDLFGGSPDVCADIAESLGSQTVVLM
ARHGVVNVGKSVREVVFRAFYLEQEAAALTAGLKIGNVKYLSPGEIKTAGKLVGAQIDRGWNHWSQRLRQAGLA
>A2RIH3 ~~~hmpT~~~Thiamine precursor transporter HmpT~~~COG4720
MKLMDNKNIKKLTLLAIWTALTFVLGRLFTFPIPGSAGNILTLLDVGIYTAVFLFGKREAAIIGGFAAFLLDLTAGFSNY
MFFSLIIHGGQGYLAGLTRYKWLNFLLSLLVMVGGYFIVGGLMYGWGSAIAGLWVNIVQVIVGFVLAKVLSPLIERTGIL
NGFRKA
>P49852 1.14.12.17~~~hmp~~~Flavohemoprotein~~~COG1017
MLDNKTIEIIKSTVPVLQQHGETITGRFYDRMFQDHPELLNIFNQTNQKKKTQRTALANAVIAAAANIDQLGNIIPVVKQ
IGHKHRSIGIKPEHYPIVGKYLLIAIKDVLGDAATPDIMQAWEKAYGVIADAFIGIEKDMYEQAEEQAGGWKEYKPFVIA
KKERESKEITSFYLKPEDSKPLPEFQAGQYISIKVRIPDSEYTHIRQYSLSDMPGKDYYRISVKKDGVVSSYLHDGLQEG
DSIEISAPAGDFVLDHASQKDLVLISAGVGITPMISMLKTSVSKQPERQILFIHAAKNSEYHALRHEVEEAAKHSAVKTA
FVYREPTEEDRAGDLHFHEGQIDQQFLKELIANTDADYYICGSPSFITAMHKLVSELGSAPESIHYELFGPQLSLAQSV
>P39662 1.14.12.17~~~hmp~~~Flavohemoprotein~~~COG1017
MLTQKTKDIVKATAPVLAEHGYDIIKCFYQRMFEAHPELKNVFNMAHQEQGQQQQALARAVYAYAENIEDPNSLMAVLKN
IANKHASLGVKPEQYPIVGEHLLAAIKEVLGNAATDDIISAWAQAYGNLADVLMGMESELYERSAEQPGGWKGWRTFVIR
EKRPESDVITSFILEPADGGPVVNFEPGQYTSVAIDVPALGLQQIRQYSLSDMPNGRSYRISVKREGGGPQPPGYVSNLL
HDHVNVGDQVKLAAPYGSFHIDVDAKTPIVLISGGVGLTPMVSMLKVALQAPPRQVVFVHGARNSAVHAMRDRLREAAKT
YENLDLFVFYDQPLPEDVQGRDYDYPGLVDVKQIEKSILLPDADYYICGPIPFMRMQHDALKNLGIHEARIHYEVFGPDL
FAE
>P24232 1.14.12.17~~~hmp~~~Flavohemoprotein~~~COG1017
MLDAQTIATVKATIPLLVETGPKLTAHFYDRMFTHNPELKEIFNMSNQRNGDQREALFNAIAAYASNIENLPALLPAVEK
IAQKHTSFQIKPEQYNIVGEHLLATLDEMFSPGQEVLDAWGKAYGVLANVFINREAEIYNENASKAGGWEGTRDFRIVAK
TPRSALITSFELEPVDGGAVAEYRPGQYLGVWLKPEGFPHQEIRQYSLTRKPDGKGYRIAVKREEGGQVSNWLHNHANVG
DVVKLVAPAGDFFMAVADDTPVTLISAGVGQTPMLAMLDTLAKAGHTAQVNWFHAAENGDVHAFADEVKELGQSLPRFTA
HTWYRQPSEADRAKGQFDSEGLMDLSKLEGAFSDPTMQFYLCGPVGFMQFTAKQLVDLGVKQENIHYECFGPHKVL
>P26353 1.14.12.17~~~hmp~~~Flavohemoprotein~~~
MLDAQTIATVKATIPLLVETGPKLTAHFYDRMFTHNPELKEIFNMSNQRNGDQREALFNAIAAYASNIENLPALLPAVEK
IAQKHTSFQIKPEQYNIVGTHLLATLDEMFNPGQEVLDAWGKAYGVLANVFIHREAEIYHENASKDGGWEGTRPFRIVAK
TPRSALITSFEFEPVDGGTVAEYRPGQYLGVWLKPEGFAHQEIRQYSLTRKPDGKGYRIAVKREDGGQVSNWLHHHASVG
DGVHLAAPAGDFFMNVAADTPVSLISAGVGQTPMLAMLDTLAKEQHTAQVNWFHAAENGDVHAFADEVSELGRTLPRFTA
HTWYREPTESDRAQRLFDSEGLMDLSKLEAAISDPAMQFYLCGPVGFMQFAAKQLVSLGVNNENIHYECFGPHKVL
>Q9KMY3 1.14.12.17~~~hmp~~~Flavohemoprotein~~~COG1017
MLTQEHINIIKSTIPLLESAGPALTQHFYQRMFSHNPELKHIFNMTHQKTGRQSVALFEAIAAYAKHIDNLAALTSAVER
IAHKHTSFNIQPEHYQIVGHHLLETLRELAPDAFTQPVEEAWTAAYFFLAQVFIDREGALYLERKQALGGWRDGRTFVVR
EKQVESAYVTSFVLVPADGGAVLDYQPGQYIGIEVTPEGSDYREIRQYSLSHASNGREYRISVKREGVGSDNPGLVSHYL
HNNVKVGDSVKLYAPAGDFFYVERERPVVLISAGVGATPMQAILHTLAKQNKSGVTYLYACNSAKEHTFAQETAQLIAQQ
GWMQQVWYRDESADDVLQGEMQLAELILPIEDGDFYLCGPIGFMQYVVKQLLALGVDKARIHYEVFGPHAQLAA
>Q9X5X4 ~~~hmrR~~~HTH-type transcriptional regulator HmrR~~~
MNIGEASKVSGVSSKMIRYYEQIGLISPAVRTASSYRTYGDNDVHTLRFIRRARDLGFSVEQIKELLALWRDRSRASSDV
KAVALEHIAELERKIAAIQDMTRTLKHLASHCHGDGRPDCPIIEEMAKGGGAAKTEINPRFGVASLK
>P71119 1.14.14.18~~~hmuO~~~Heme oxygenase~~~
MTTATAGLAVELKQSTAQAHEKAEHSTFMSDLLEGRLGVAEFTRLQEQAWLFYTALEQAADAVRASGFAESLLDPALNRA
EVLARDLDKLNDGSEWRSRITASPAVIDYVNRLEEIRDNVDGPALVAHHYVRYLGDLSGGQVIARMMQRHYGVDPEALGF
YHFEGIAKLKVYKDEYREKLNNLELSDEQRENLLKEATDAFVFNHQVFADLGKGL
>Q56991 ~~~hmuT~~~Hemin-binding periplasmic protein HmuT~~~COG4558
MRLRLLSLPFILSLCAPLLPLNTLAAERIVTIGGDVTEIAYALGAGDEIVARDSTSQQPQAAQKLPDVGYMRTLNAEGIL
AMKPTMLLVSELAQPSLVLTQIASSGVNVVTVPGQTTPESVAMKINAVATALHQTEKGQKLIEDYQQRLAAVNKTPLPVK
VLFVMSHGGLTPMAAGQNTAADAMIRAAGGSNAMQGFSRYRPLSQEGVIASAPDLLLITTDGVKALGSSENIWKLPGMAL
TPAGKHKRLLVVDDMALLGFGLETPQVLAQLREKMEQMQ
>Q56992 ~~~hmuU~~~Hemin transport system permease protein HmuU~~~COG0609
MNGRVQPRLMLGFLLILLVILALGSANMGALSLSFRTLWNTSTNDAMWHIWLNIRLPRVLLAVVVGCALAVSGTIMQGLF
RNPLADPGLLGISSGAALCVGLIIVMPFSLPPLLALYSHMVGAFIGSLAISTIIFTLSRWGHGNLARLLLAGIAINALCG
AAVGVLTYISDDQQLRQFSLWSMGSLGQAQWSTLLVASSLILPTCILGLLQARQLNLLQLGDEEAHYLGVNVRQAKLRLL
LLSAILIGAAVAVSGVIGFIGLVVPHLIRMRIGADHRWLLPGAALGGACLLLTADTLARTLVAPAEMPVGLLTSLLGGPY
FLWLILRQREQRSG
>Q8L1U3 7.6.2.-~~~hmuV~~~Hemin import ATP-binding protein HmuV~~~
MTLQAQDLSVDRGAKRILTQVSLTLEPGRMLGLLGANGAGKSTLLACLSGELEPVCGHIEINGKPLRSLASAKQARLRAV
LPQKPSLSFDLGVREVVGMGAYPYAELSPADVDALCEKALRQAGVSHLAGRRYLELSGGEQQRVQFARVLMQCQAAPAGQ
PRYLMLDEPISNLDPRHQIDVLRTAHDLAREAGVGVLVIVHDVNLSARWCDRLLLLAQGSVVADGAPAEVLTPANLRRVY
GVEADVLPHPREAGTLLVLMR
>O70014 7.6.2.-~~~hmuV~~~Hemin import ATP-binding protein HmuV~~~
MISAQNLVYSLQGRRLTDNVSLTFPGGEIVAILGPNGAGKSTLLRQLTGYLQPDSGECRLFNKPLNEWSITELAKHRAVM
RQNSHMAFPFSVQEVIQMGRHPHRTGNQDNETAQIMALCDCQALANRDYRQLSGGEQQRVQLARLLVQLWEPTPSPKWLF
LDEPTSALDIHHQQHLFRLLRQLVHERQFNVCCVLHDLNLAARYADRIVLMQKGKVIANGKPQDVLTQQELTMLYGADIT
VLEDPANHSPLIVLDH
>Q70YG7 7.6.2.-~~~hmuV~~~Hemin import ATP-binding protein HmuV~~~COG4559
MSTTAAIQASNISVTFGHRTILDKIDIEIFSGQVTALLGPNGAGKSTLLKILSGEISSTGKMAYFGVPQALWQPNELAKH
LAILPQQSTLSFPFIAQEVVELGALPLNLSHQQVSEVALHYMQQTDISDRANNLYPALSGGEKQRLHLARVLTQLHHSGD
KKILMLDEPTSALDLAHQHNTLRIARSLAHQEQCAVVVVLHDLNLAAQYADRMVMLHNGKLVCDAPPWEALNAERIEQVY
GYSSLVAAHPTMDFPMVYPI
>P74981 7.6.2.-~~~hmuV~~~Hemin import ATP-binding protein HmuV~~~COG4559
MVDTAVVDTALLEANQLSYHVQGQKLINNVSLQIASGEMVAIIGPNGAGKSTLLRLLTGYLAPSEGHCQLLGKNLNSWQP
QALARTRAVMRQYSDLAFPFSVSEVIQMGRAPYGAAQNRQALQEVMAQTDCLALAQRDYRALSGGEQQRVQLARVLAQLW
QPEPTSRWLFLDEPTSALDLYHQQHTLRLLRQLTLEEPLAVCCVLHDLNLAALYADRILLLAQGELVACGTPEEVLNAET
LTRWYQADLGISRHPESALPQIYLRQ
>Q56993 7.6.2.-~~~hmuV~~~Hemin import ATP-binding protein HmuV~~~COG4559
MVDMAVTPVALLEASHLHYHVQQQALINDVSLHIASGEMVAIIGPNGAGKSTLLRLLTGYLSPSHGECHLLGQNLNSWQP
KALARTRAVMRQYSELAFPFSVSEVIQMGRAPYGGSQDRQALQQVMAQTDCLALAQRDYRVLSGGEQQRVQLARVLAQLW
QPQPTPRWLFLDEPTSALDLYHQQHTLRLLRQLTRQEPLAVCCVLHDLNLAALYADRIMLLAQGKLVACGTPEEVLNAET
LTQWYQADLGVSRHPESALPQIYLRQ
>Q50365 ~~~hmw1~~~Cytadherence high molecular weight protein 1~~~
MKKSKEAVFEDKDYTEENPEQIFGNLYDGKLTVQDGKVKIAYDGDGNGYYIAFNSETGVYYDPYGDTEYDISVLFDANGN
SFVFADAPTVEVLAGEQEQTEAEPDYLQYVGNEAYGYYDEAGEWVWSGYFEGDQWISTLPQTEAEEKQFGFEDNIETTPT
ASEDFGLEADVPAPEVAEPSYEVQPEVAAEPVYDVQPEVAVEPVGETTATVEPQAVEIQPEVVVEPIVESQLEQPVEVQA
EMVQPEVAVEPQLEVSLDPIGETAPILEQVEPQAVQTQPEIPAEQSAVELQPEPVAEVQSEMVQPEAAAEPVTEAQQTEP
TPVVETIAEITPQVVTEPVVAVVEHQPEAVAEPLPVEPAVAGVSELIPTEQVQPEVVVESTPVAEVQSEMVQPEVAVEPI
VEPQPEQPVEVQPEVITTPEVASVLEVQPENPVVEVEQVVEPQPETPVEVQPEPVVETVQEAVAEPTQVVEPQPQAAPQP
AVYEWNLTPEAAPVEQPEVIPVTVVESQATATAEPQPAVAPVADMDYVLHLTDTVKNQPQTAPVQPTTPIKIEVAESTPT
VTTSPVEPTIAPPLFEIELNNTTSSDLPLVEVVDFKHNQHGAVGTHSFDDFTPPEVGMESKTHCHSNSEVVWRVSEPKTV
PVPPAVSSINIQTVNRVVEPTISTPTTPVVESAPAIEIFVDTPPVETKEASSNVDVVQQPVKPLMPVMVEQLRTTELQPT
TEINLFANSDINSIIAELKQGRSNPAINFDDIFKMSSYQMVVKKSFVQISDFITNSKTDITNRFLLIKKELQAELTRLIE
ENEQLKAEFLNAKDLSVYQKDELLRSLSNDFTIAHRPSDSYEQLQKSGELVRNIQKAILENESKIKNIQITLKELKAVYK
LCSDTVLNGMAKLDSVLRFNKKEKDPLLLNSMETLSSFETEPQAIIEDLLDFSSSFDKMSNEQLDEFVYQNLDSGLNLDL
DGFDHQLSSMNIHGLEPLDPMKLDDFDFETLTPDKTSNLSSILDDELMENGGDFNLDY
>P75471 ~~~hmw2~~~Cytadherence high molecular weight protein 2~~~
MNDTDKKFPLQPVYDTGFDDGYLQRDYEKCLESAAANDAQTVELQTQLLAEIKNLENEIKALKAQESRQPDPHNNARIQS
LEASLNRLVNEYNNFEFQKNYMVDRVAELNNKARFFKDELKRLQQENAAFVNSRYANWADFQSNYQLKLDQFQALIDQQN
QTIKQLNEQIAANQGLIDQNVQRLQQNHSLDQQERDALLYEVDHLYNELYELENQKRLVGIEYEATYQDLVSADAELQNV
YETIAQNQANFQKQCDAYWAQLKQVEQQIQTTKQELVDEESTLKVRLNDADFYINSRLAELDDLTSKINERDFVSKEQAQ
DVKASLANLTKEKERLSAEKDSFERLRNTALNDINRMEQENALFAKHLEQQQYEFERKQQESLLKLETEHKQLQKRIGEF
KIESEAKSEALLIQERELLEKRREIDDLLTQASLEYEQQRRTNQVLKEKHRQVQQHFQNLVHAKKKLDQKRHYLAEQKRI
DEEQIFKLKEKIATERRELEKLYLVKKQKQDQKENDLLIFEKQLRQYQADFENEIEEKQNELFASQKSLQKSFTQLKNKE
AELNQKAQKIAEDWAHLKQNKHHHADLEIFLEGEFNHLQQEKHKLLEARTQFDNRVSLLSARFKQKQAELVKQKQSLEQL
TAAFNKEQEAVERDWKDRLANLEKQKEMLGDKVHQFDENSLNISKKLAERELAIKFKEKELEAAQKQLSLDNNNNAGLKL
QLDKLSESLKTERLELEASKERILDFYDESSRRIADYESDLQARLAEVKTLEKNQQETAAKSERELKVALEKLNQAKKAF
LQIRKQQLLEIASVKQQLAQKANLLKNQQAELDKQTEELEAAFLEQDTDKKELEKALHSVKSKQELLERERSFLLQKQRE
FAEHVAGFKRQVHFKTTQMQRLSEFNKQQQSEQIKRETELKIAFADLKKDYQLFELQKNQEFQQIEQKHKELELLAQKQA
ELKQELEQKATALASQDQDTVQAKLDLARQQHELELRQNAFNQASLSLNKQREQLTNQVKVLHGELKKRHEKLTLKDRLL
AEKEKDQHKKDAEINQRFKQFENEYADFDQAKKRELQELNQIRRNLEQSNASLLKKRNQLTLDFALLRKVQHNTQTNRVQ
LNTQIKEFLLEKKNFQKASDEAALQKALLIKRLRSFASKLQLQREALAIQKLEFDKRDEQQKSEINNAKLQLEQFKLEKQ
NFDEAKQKQLIEFKDQCQRLDVEKRLLKQKLVQLKNLSKSYLTYKNRADLSQQQLQHKYANLLELKEKLQTAKRALDKKH
RAIYGKMAQFVSELRQEKKQLLSAQKQVDDKSRLLEQNQRHLQNLSSETKKKRQSLEHDINKFDQRRKEAVSSILNSHKK
LKQKEGELQGILQKLSLKKTQIEQEFSKLYQQREKLDRQRTTLSKLHRELKAQNEATAHKNREVLEIENYYKKELQRLTT
EKSEFDNNKNRLFEYFRKIRNEIEKKEAHIKTVLEETQKKRHLVETEAVKLHLQKQSIISKGQELKEIKERVSRDISHTN
KQREELNSLLHQNKLLQKNLAEREREINNKDSLLTQKIQTAKQKLSEKEARILKLLEKMRAVEQQYQAEITRLKTRNADL
EKNDNKHLFPPLFKINGNDMNYPYPYPWFYPQQKQEDSSNQIRHLFEQQLQFMQQRYENELTELRRQRALLEKKLDQIQL
ESQLSAKKNDFEKVEQMMQKLLEKTEQKLSAFDQKINALAEQINTQKAEHADSEKQQLLLRIEQLEKQNLAQAVQTPQPV
QPVVQAPAVVPQVIQPQVVQSQPAFLATQQSISKQQQIAQLNAEINSIKKLIAQKAAK
>Q7NBT3 ~~~hlp3~~~Cytadherence high molecular weight protein 3~~~
MIMNPKIHNKILKNLAKLKKKVFTKYAAYDFNFAYDKNGNVYLVGVDNVTNQTFNLIKPVFKFLKKPLPAELYGMDQQPF
YFVNNHHYIDALNSDTGEQELLRYNVIDQSLVNAQTNDLVDPAFYTDLEGYELDLSQYTGSLLDLSNEVISVEQQPVEQE
VNLTPEQVEEAEQVEQQPVDQQQVQQVDPNLNEQPVEGDNQNFTQQYYDQQLGYADQNVDYGYDPQQYTQEQDYVDNTQQ
YDQVQDYVDPNQQYYDDQQQYDQQGYDQGYDQQYDQQGYDQQGYDQGYDQQYDQQYYDDQQQYDEQPDQQVKAVVEQVVD
EVVEEQQPVEVAKPAPTKPVGPKPQPGKKATKYVIKKPEPKPKVVKEEPIEPAVEKEEVVTVVEQVVDQPVQVAEVQPEP
VVVADDEIKLASEQPVKKKINLDDLQQIPVVIKLPKFETPKLPEPKADSEQKEEIAVKVVEQPVENPQVQETKHHHALPK
VKIEKRQEVELVPSKLDDHYDLIEEEDDFFVDKFKFEDIKLSDLLVEQKPIEVNQPVQQPVVLEQSTPSVQAQPQSVEPK
LEITKLEELVEIKTDNTESLNKLETLIDENKKIIDQFKQLKEEAKKSNSNINLEKVAKQLVDYLTNKLNEKTAALNKPEP
STVELNKVEQAKQKAVEKLVHEQVVFQPREKVVQQPKEVVAKPYFEESDDLLTSVSNKPKQPTSELLDFLVQQVVDGEED
DLPPPTNFDKWPNQNVRQKLDEINQVEAQRFNQTQFVPPQSLNQVETPNQRLFLEPEIQVQPQALYTASREHEQVQPKAQ
HQQPTTRIEREEVVNKFQREPLVSPNRLAYHSNKEFDDLYQNHYEQRTARINPQDSYYDQGYEQPDPYQEQQPYPQEQYL
DPRYQQQVDPRYQKETYQEYNRPFPPNQEYDYYPPAYESRRDYQPYQPRRVNYEVRKPLAYEFSKQPAPRRYQQLPNRYN
ESDQSRQLAYPVHKGTLRTEADFLRFREGYGYDYDRPSTQYYRSNYDTYVREVRRPIRQLGMIEPVAEFRSRTLAPRRVA
RPTYGLRRVSRIPSLAPRGYNQQPRVRRVPVSRGYW
>Q50360 ~~~hmw3~~~Cytadherence high molecular weight protein 3~~~
MTDKERAKLAKAYGKLAQKIQKSYPDINVVYGRDAKNKLHALYQDPETGNIFSLEKRKQLPADYPLFELDSDEPISFAPK
IIPLTAFDGNNNEVIVQYDQVNNTFYDQDGNVLDVSGYRDGENIPLVDYLNYGGSTASADTTTSEPLSGEGYPDIDAGLP
VVDPDATPEQQADQLFGLDPLPQAPDEYQDTTAPPAYDQTFDQATYDQQAYDQNYDPNAYYDQQAYDQSFDQQAYDQAYD
ANAYNTQNYDQAHDPNAYYDSQAYSDPDQASAVAPIEVAPLQPEPVAPVVEPTAVPIVESAPIVEVTPTVEPTPTPVVET
APVVEAPKVVEPTPTPVVEATPAPKVEPKVVEQPQPTPVTVEVDSPKVEIPKVVTAKVALQVAQPTPVPAVPKVAPQPTP
APVVVQPTAVVQPVVKAEPKVVTPTPAPQVVVTPQVATPKVTPKVVQTTPAVPPVVVQPEVVVQPIIRPTQPEPEWKPSP
ASVVEPQPCQSACVNNESGAITIHTTNRSLLLEKLASLGHLHDASTRTPLPHERYQLAPPSEYVATKYNEPLFNLPAIRN
SWARFTRPTVESTPIASRFTGVTPMAVNYRNPASLNFDSLNSFGAYRSPSSFYPLRRPLELSSLRRNRSSFFNTHRFDLG
SNYTSFTPRYRSPLRGGLSQRFPLRSSWSKEF
>P24092 ~~~hmcA~~~High-molecular-weight cytochrome c~~~COG0484
MRNGRTLLRWAGVLAATAIIGVGGFWSQGTTKALPEGPGEKRADLIEIGAMERFGKLDLPKVAFRHDQHTTAVTGMGKDC
AACHKSKDGKMSLKFMRLDDNSAAELKEIYHANCIGCHTDLAKAGKKTGPQDGECRSCHNPKPSAASSWKEIGFDKSLHY
RHVASKAIKPVGDPQKNCGACHHVYDEASKKLVWGKNKEDSCRACHGEKPVDKRPALDTAAHTACISCHMDVAKTKAETG
PVNCAGCHAPEAQAKFKVVREVPRLDRGQPDAALILPVPGKDAPREMKGTMKPVAFDHKAHEAKANDCRTCHHVRIDTCT
ACHTVNGTADSKFVQLEKAMHQPDSMRSCVGCHNTRVQQPTCAGCHGFIKPTKSDAQCGVCHVAAPGFDAKQVEAGALLN
LKAEQRSQVAASMLSARPQPKGTFDLNDIPEKVVIGSIAKEYQPSEFPHRKIVKTLIAGIGEDKLAATFHIEKGTLCQGC
HHNSPASLTPPKCASCHGKPFDADRGDRPGLKAAYHQQCMGCHDRMKIEKPANTACVDCHKERAK
>Q46505 1.12.1.3~~~hndA~~~NADP-reducing hydrogenase subunit HndA~~~
MQNSTCQAVGECRVPEHAVLPQPLYREVVQFIESLPQKEGHLVTVLHKAQSVFGYLPIEVQQFVADHMEVPLAQVYGVVS
FYTFFTMVPKGKYPISVCMGTACFVKGADKVVHAFKEQLKIDIGDVTPDGRFSIDTLRCVGGCALAPIVMVGEKVYGNVT
PGQVKKILAEY
>Q46506 1.12.1.3~~~hndB~~~NADP-reducing hydrogenase subunit HndB~~~
MSTIRSFEDLKAKRQEILDRKAARNGKTIINVSLATCSIAAGGKVAMEAMQDEVAKNGLTGVEFMQSSCMTYCYAEPTVE
ITLPGKDPVVFGGVDENRARELVTEYVMKGEPVEGIIPVNYERVVL
>Q46507 1.12.1.3~~~hndC~~~NADP-reducing hydrogenase subunit HndC~~~
MAATTTEKKQLRIATRNCGFIDPESIDDYIALRGYEGLAKVLTMTPAEVVDLVKRSGLRGRGGAGFPTGIKWGIALGNKA
DQKYMVCNADEGDPEFMDRAVLEGDPHSVVEAMAIGGYAIGATRGTVYIRAEYPLAIKRLKKAIDDAREYGLLGENIFGS
GFDFDIELKYGAGAFVCGEETALIRSMEGKRGEPVTKPPFPAQSGYWEKPTIVNNVETFANIPAIIINGADWFSGIGTAT
SKGTKVFALAGKIQNVGLIEVPMGISLREVIFDIGGGCPDGKAFKAVQTGGPSGGALANKDLDVAIDYESLAACKSIMGS
GGMVVMDEDDCMVSVAKFFLDFTMDETCGKCTPCRIGSKRLYEILDRITKGKGTRADLDRLKSLSEIIKDTALCGLGQTM
PNPILSTMDTFANEYEAHVDDKKCPAHVCTALLTYTIDPAKCTGCGLCTRVCPVECISGTKKQPHTIDTTRCIKCGACYD
KCKFDSIIKQ
>Q46508 1.12.1.3~~~hndD~~~NADP-reducing hydrogenase subunit HndD~~~
MSMLTITIDGKTTSVPEGSTILDAAKTLDIDIPTLCYLNLEALSINNKAASCRVCVVEVEGRRNLAPSCATPVTDNMVVK
TNSLRVLNARRTVLELLLSDHPKDCLVCAKSGECELQTLAERFGIRESPYDGGEMSHYRKDISASIIRDMDKCIMCRRCE
TMCNTVQTCGVLSGVNRGFTAVVAPAFEMNLADTVCTNCGQCVAVCPTGALVEHEYIWEVVEALANPDKVVIVQTAPAVR
AALGEDLGVAPGTSVTGKMAAALRRLGFDHVFDTDFAADLTIMEEGSEFLDRLGKHLAGDTNVKLPILTSCCPGWVKFFE
HQFPDMLDVPSTAKSPQQMFGAIAKTYYADLLGIPREKLVVVSVMPCLAKKYECARPEFSVNGNPDVDIVITTRELAKLV
KRMNIDFAGLPDEDFDAPLGASTGAAPIFGVTGGVIEAALRTAYELATGETLKKVDFEDVRGMDGVKKAKVKVGDNELVI
GVAHGLGNARELLKPCGAGETFHAIEVMACPGGCIGGGGQPYHHGDVELLKKRTQVLYAEDAGKPLRKSHENPYIIELYE
KFLGKPLSERSHQLLHTHYFKRQRL
>P0DV92 ~~~~~~Retron Ec78 putative HNH endonuclease~~~
MKELARLESPEILDQYTAGQNDWMEIDQSAVWPKLTEMQGEFCAYCECRLNRRHIEHFRPRGKFPALTFIWSNLFGSCGD
SKKSGGWSRCGIYKDNGAGAYNADDLIKPDEENPDDYLLFLTTGEVVPAIGLTGRALKKAQETIRVFNLNGDIKLLGSRR
TAVQAIMPNVEYLYSLLEEFEEDDWNEMLRDELEKIESDEFKTALKHAWTSNQEFA
>P0DV93 ~~~~~~Retron Ec83 putative HNH endonuclease~~~
MKRINKTAEDQFLINFKAQNPNGTWDEFRNHEQGILYKRLKQHICNDQMYLCAYCEIDLDRENEHEIKVEHFKSKSGSLP
GGSNWHLEWSNLLAVCLGGTNTGDDFELPANLSCDSYKSHYEDKNKINDKDWTGKILLPLTLPDAHNFFTFEKVTGKLLP
NESYCNTISIDGKPAAETLSIVTKTIEVLNLNCSRLNNARRKLLFHFNNCARERNLRKLHNLLLQWNQGEPKFFQTTRDI
IIRDDRICQGLLNGTIRY
>P0DV99 ~~~~~~Retron Vc95 putative HNH endonuclease~~~
MIPLSHNNTPTELENYVKLKGQSLTIQDFSAHDFQGVKKIVRDRLHTLQGELCVYCEKKYSVDEMQVEHIKPKSGRNAQP
NLCFTYSNYAVSCIQENRKTQTCGQKKKDNILFIEPTSPSCNSHFSLDTDGFINPRGFKNRKEKHSIQTTIDMLGLNKPH
LQLERKKQIERLIYILKATKHNRHELTNKFIKSGNFKYILRELTM
>Q0QLF7 1.3.7.1~~~Hnr~~~6-hydroxynicotinate reductase~~~
MFKIDEEKCKKCRMCVKECPVHAVYYEKKDKGAIVEITEKCVECGICKRVCKFGAIENDAPLESVITCSSCPIQCKVPLG
ETGACTRYRNVGGKLVRDRELVVEALEQKEAADNIKKPIITAVGAGTNYPCSKPAPHIVSECRDGVDVVTVVTEAPLSYS
GLVIKLDTNTYIGEEGDPVYRDGKVVGMVNTEEYGSKMIAIGGANRLTGDNGFATARTIVELANGEEVELKVNKKIVLKL
KAGVAPVIDGVEESIMRIGCGSATVGLFAKRMKDAVDECIVIDHHVIGLCSEHLAGEAVGMTWSGIIPNATKSSRGRYFG
GHGSGIGGTSLETPRDAIKGADMSIAKAGMQVMVVNTTGEIYALFELKADGSFDEIPMTEAALGVALAIQDNCQRSMTSI
LYTGGTGGSARGGVCTHPVKITEAVHEQKAVLTIGGAPAFVYPGGGINFMVDTQKVVNKAFTWVPTPATVAPVEYTMTVA
DYEAMGGHMDQIKDVSEYK
>Q9L5H8 ~~~hns~~~DNA-binding protein H-NS, plasmid~~~
MSEALKSLNNIRTLRAQGRELPLEILEELLEKLSVVVEERRQEESSKEAELKARLEKIESLRQLMLEDGIDPEELLSSFS
AKSGAPKKVREPRPAKYKYTDVNGETKTWTGQGRTPKALAEQLEAGKKLDDFLI
>P0ACF8 ~~~hns~~~DNA-binding protein H-NS~~~COG2916
MSEALKILNNIRTLRAQARECTLETLEEMLEKLEVVVNERREEESAAAAEVEERTRKLQQYREMLIADGIDPNELLNSLA
AVKSGTKAKRAQRPAKYSYVDENGETKTWTGQGRTPAVIKKAMDEQGKSLDDFLIKQ
>A0A0F6B244 ~~~hns~~~DNA-binding protein H-NS~~~
MSEALKILNNIRTLRAQARECTLETLEEMLEKLEVVVNERREEESAAAAEVEERTRKLQQYREMLIADGIDPNELLNSMA
AAKSGTKAKRAARPAKYSYVDENGETKTWTGQGRTPAVIKKAMEEQGKQLEDFLIKE
>A0A0H3NBY9 ~~~hns~~~DNA-binding protein H-NS~~~
MSEALKILNNIRTLRAQARECTLETLEEMLEKLEVVVNERREEESAAAAEVEERTRKLQQYREMLIADGIDPNELLNSMA
AAKSGTKAKRAARPAKYSYVDENGETKTWTGQGRTPAVIKKAMEEQGKQLEDFLIKE
>P0A1S2 ~~~hns~~~DNA-binding protein H-NS~~~
MSEALKILNNIRTLRAQARECTLETLEEMLEKLEVVVNERREEESAAAAEVEERTRKLQQYREMLIADGIDPNELLNSMA
AAKSGTKAKRAARPAKYSYVDENGETKTWTGQGRTPAVIKKAMEEQGKQLEDFLIKE
>P09120 ~~~hns~~~DNA-binding protein H-NS~~~
MSEALKILNNIRTLRAQARECTLETLEEMLEKLEVVVNERREEESAAAAEVEERTRKLQQYREMLIADGIDPNELLNSLA
AVKSGTKAKRAQRPAKYSYVDENGETKTWTGQGRTPAVIKKAMDEQGKSLDDFLIKQ
>P0DOA5 ~~~hns~~~DNA-binding protein H-NS~~~COG2916
MSEALKILNNIRTLRAQARECTLETLEEMLEKLEVVVNERREEDSQAQAEIEERTRKLQQYREMLIADGIDPNELLNAMA
VTKAAATKSKRAARPAKYKYIDENGETKTWTGQGRTPAVIKKAIEEQGKSLDDFLL
>P72849 1.14.14.18~~~pbsA1~~~Heme oxygenase 1~~~COG5398
MSVNLASQLREGTKKSHSMAENVGFVKCFLKGVVEKNSYRKLVGNLYFVYSAMEEEMAKFKDHPILSHIYFPELNRKQSL
EQDLQFYYGSNWRQEVKISAAGQAYVDRVRQVAATAPELLVAHSYTRYLGDLSGGQILKKIAQNAMNLHDGGTAFYEFAD
IDDEKAFKNTYRQAMNDLPIDQATAERIVDEANDAFAMNMKMFNELEGNLIKAIGIMVFNSLTRRRSQGSTEVGLATSEG
>P74133 1.14.14.18~~~pbsA2~~~Heme oxygenase 2~~~COG5398
MTNLAQKLRYGTQQSHTLAENTAYMKCFLKGIVEREPFRQLLANLYYLYSALEAALRQHRDNEIISAIYFPELNRTDKLA
EDLTYYYGPNWQQIIQPTPCAKIYVDRLKTIAASEPELLIAHCYTRYLGDLSGGQSLKNIIRSALQLPEGEGTAMYEFDS
LPTPGDRRQFKEIYRDVLNSLPLDEATINRIVEEANYAFSLNREVMHDLEDLIKAAIGEHTFDLLTRQDRPGSTEARSTA
GHPITLMVGE
>P51015 4.1.3.39~~~bphI~~~4-hydroxy-2-oxovalerate aldolase 4~~~COG0119
MKLEGKKVTVHDMTLRDGMHPKRHQMTLEQMKSIACGLDAAGIPLIEVTHGDGLGGSSVNYGFPAHSDEEYLGAVIPLMK
QAKVSALLLPGIGTVEHLKMAKDLGVNTIRVATHCTEADVSEQHITQSRKLGLDTVGFLMMAHMASPEKLVSQALLMQGY
GANCIYVTDSAGYMLPDDVKARLSAVRAALKPETELGFHGHHNLAMGVANSIAAIEAGATRIDAAAAGLGAGAGNTPMEV
FIAVCARMGIETGVDVFKIQDVAEDLVVPIMDHVIRIDRDSLTLGYAGVYSSFLLFAKRASAKYGVPARDILVELGRRGM
VGGQEDMIEDTAMTMARERGLTLTAA
>A9D857 ~~~~~~Hoefavidin~~~
MNKVLAIVLTITVAGFAQTAFADDHAMSPDMKLLAGASNWVNQSGSVAQFVFTPSPTQPQTYEVSGNYINNAQGTGCKGT
PYPLSGAYYSGNQIISFSVVWSNASANCQSATGWTGYFDFSGSQAVLKTDWNLAFYSGSTPAIQQGQDDFMQSVATVSES
LLTE
>P51020 4.1.3.39~~~mhpE~~~4-hydroxy-2-oxovalerate aldolase~~~COG0119
MNGKKLYISDVTLRDGMHAIRHQYSLENVRQIAKALDDARVDSIEVAHGDGLQGSSFNYGFGAHSDLEWIEAAADVVKHA
KIATLLLPGIGTIHDLKNAWQAGARVVRVATHCTEADVSAQHIQYARELGMDTVGFLMMSHMTTPENLAKQAKLMEGYGA
TCIYVVDSGGAMNMSDIRDRFRALKAELKPETQTGMHAHHNLSLGVANSIAAVEEGCDRIDASLAGMGAGAGNAPLEVFI
AAADKLGWQHGTDLYALMDAADDLVRPLQDRPVRVDRETLALGYAGVYSSFLRHCETAAARYGLSAVDILVELGKRRMVG
GQEDMIVDVALDLRNNK
>P9WMK5 4.1.3.43~~~hsaF~~~4-hydroxy-2-oxohexanoate aldolase~~~COG0119
MTDMWDVRITDTSLRDGSHHKRHQFTKDEVGAIVAALDAAGVPVIEVTHGDGLGGSSFNYGFSKTPEQELIKLAAATAKE
ARIAFLMLPGVGTKDDIKEARDNGGSICRIATHCTEADVSIQHFGLARELGLETVGFLMMAHTIAPEKLAAQARIMADAG
CQCVYVVDSAGALVLDGVADRVSALVAELGEDAQVGFHGHENLGLGVANSVAAVRAGAKQIDGSCRRFGAGAGNAPVEAL
IGVFDKIGVKTGIDFFDIADAAEDVVRPAMPAECLLDRNALIMGYSGVYSSFLKHAVRQAERYGVPASALLHRAGQRKLI
GGQEDQLIDIALEIKRELDSGAAVTH
>P51016 4.1.3.39~~~dmpG~~~4-hydroxy-2-oxovalerate aldolase~~~
MTFNPSKKLYISDVTLRDGSHAIRHQYTLDDVRAIARALDKAKVDSIEVAHGDGLQGSSFNYGFGRHTDLEYIEAVAGEI
SHAQIATLLLPGIGSVHDLKNAYQAGARVVRVATHCTEADVSKQHIEYARNLGMDTVGFLMMSHMIPAEKLAEQGKLMES
YGATCIYMADSGGAMSMNDIRDRMRAFKAVLKPETQVGMHAHHNLSLGVANSIVAVEEGCDRVDASLAGMGAGAGNAPLE
VFIAVAERLGWNHGTDLYTLMDAADDIVRPLQDRPVRVDRETLGLGYAGVYSSFLRHAEIAAAKYNLKTLDILVELGHRR
MVGGQEDMIVDVALDLLAAHKENRA
>Q53WI0 4.1.3.39~~~~~~4-hydroxy-2-oxovalerate aldolase~~~
MSWDLSTAKPPVVVDTTLRDGSHAHRHQYTVEEARAIAQALDEAGVYAIEVSHGDGLGGSSLQYGFSRTDEMELIRAVRE
TVRRAKVAALLLPGIGTRKELKEAVEAGIQMVRIATQCTEADISEQHFGMAKEMGLEAVGFLMMSHMRPPEFLAEQARLM
EGYGADVVYIVDSAGAMLPEDAYARVKALKEALSRAKVGFHAHNNLGLAIGNTLAALAAGADWVDATLRGYGAGAGNAPL
EVLAAVLDKAGLNPGLDVFKLLDAAEYVMGPILHFQPYPDRDSVAIGYAGVYSTFLLHAKRIGKELGVDPLAILLELGRR
QAVAGQEDWILRVALELKEKEAGALAD
>O31266 1.13.11.48~~~hod~~~1H-3-hydroxy-4-oxoquinaldine 2,4-dioxygenase~~~
MTDTYLHETLVFDNKLSYIDNQRDTDGPAILLLPGWCHDHRVYKYLIQELDADFRVIVPNWRGHGLSPCEVPDFGYQEQV
KDALEILDQLGVETFLPVSHSHGGWVLVELLEQAGPERAPRGIIMDWLMWAPKPDFAKSLTLLKDPERWREGTHGLFDVW
LDGHDEKRVRHHLLEEMADYGYDCWGRSGRVIEDAYGRNGSPMQMMANLTKTRPIRHIFSQPTEPEYEKINSDFAEQHPW
FSYAKLGGPTHFPAIDVPDRAAVHIREFATAIRQGQ
>P37305 ~~~hokA~~~Protein HokA~~~
MPQKYRLLSLIVICFTLLFFTWMIRDSLCELHIKQESYELAAFLACKLKE
>P77494 ~~~hokB~~~Toxic protein HokB~~~
MKHNPLVVCLLIICITILTFTLLTRQTLYELRFRDGDKEVAALMACTSR
>P0ACG4 ~~~hokC~~~Toxic protein HokC~~~
MKQHKAMIVALIVICITAVVAALVTRKDLCEVHIRTGQTEVAVFTAYESE
>P77091 ~~~hokE~~~Toxic protein HokE~~~
MLTKYALAAVIVLCLTVLGFTLLVGDSLCEFTVKERNIEFKAVLAYEPKK
>P28630 2.7.7.7~~~holA~~~DNA polymerase III subunit delta~~~COG1466
MIRLYPEQLRAQLNEGLRAAYLLLGNDPLLLQESQDAVRQVAAAQGFEEHHTFSIDPNTDWNAIFSLCQAMSLFASRQTL
LLLLPENGPNAAINEQLLTLTGLLHDDLLLIVRGNKLSKAQENAAWFTALANRSVQVTCQTPEQAQLPRWVAARAKQLNL
ELDDAANQVLCYCYEGNLLALAQALERLSLLWPDGKLTLPRVEQAVNDAAHFTPFHWVDALLMGKSKRALHILQQLRLEG
SEPVILLRTLQRELLLLVNLKRQSAHTPLRALFDKHRVWQNRRGMMGEALNRLSQTQLRQAVQLLTRTELTLKQDYGQSV
WAELEGLSLLLCHKPLADVFIDG
>P71730 2.7.7.7~~~holA~~~Probable DNA polymerase III subunit delta~~~COG1466
MSEAKPLHLVLGDEELLVERAVADVLRSARQRAGTADVPVSRMRAGDVGAYELAELLSPSLFAEERIVVLGAAAEAGKDA
AAVIESAAADLPAGTVLVVVHSGGGRAKSLANQLRSMGAQVHPCARITKVSERADFIRSEFASLRVKVDDETVTALLDAV
GSDVRELASACSQLVADTGGAVDAAAVRRYHSGKAEVRGFDIADKAVAGDVAGAAEALRWAMMRGEPLVVLADALAEAVH
TIGRVGPQSGDPYRLAAQLGMPPWRVQKAQKQARRWSRDTVATAMRLVAELNANVKGAVADADYALESAVRQVAELVADR
GR
>P28631 2.7.7.7~~~holB~~~DNA polymerase III subunit delta'~~~COG0470
MRWYPWLRPDFEKLVASYQAGRGHHALLIQALPGMGDDALIYALSRYLLCQQPQGHKSCGHCRGCQLMQAGTHPDYYTLA
PEKGKNTLGVDAVREVTEKLNEHARLGGAKVVWVTDAALLTDAAANALLKTLEEPPAETWFFLATREPERLLATLRSRCR
LHYLAPPPEQYAVTWLSREVTMSQDALLAALRLSAGSPGAALALFQGDNWQARETLCQALAYSVPSGDWYSLLAALNHEQ
APARLHWLATLLMDALKRHHGAAQVTNVDVPGLVAELANHLSPSRLQAILGDVCHIREQLMSVTGINRELLITDLLLRIE
HYLQPGVVLPVPHL
>P28905 2.7.7.7~~~holC~~~DNA polymerase III subunit chi~~~COG2927
MKNATFYLLDNDTTVDGLSAVEQLVCEIAAERWRSGKRVLIACEDEKQAYRLDEALWARPAESFVPHNLAGEGPRGGAPV
EIAWPQKRSSSRRDILISLRTSFADFATAFTEVVDFVPYEDSLKQLARERYKAYRVAGFNLNTATWK
>P28632 2.7.7.7~~~holD~~~DNA polymerase III subunit psi~~~COG3050
MTSRRDWQLQQLGITQWSLRRPGALQGEIAIAIPAHVRLVMVANDLPALTDPLVSDVLRALTVSPDQVLQLTPEKIAMLP
QGSHCNSWRLGTDEPLSLEGAQVASPALTDLRANPTARAALWQQICTYEHDFFPRND
>P0ABS8 2.7.7.7~~~holE~~~DNA polymerase III subunit theta~~~
MLKNLAKLDQTEMDKVNVDLAAAGVAFKERYNMPVIAEAVEREQPEHLRSWFRERLIAHRLASVNLSRLPYEPKLK
>F8J3H2 2.3.1.-~~~holE~~~Holothin acyltransferase~~~
MSEKLDSYKLMQEHQTWTSKPASLEEWQIVNEWAIAEKWDLGLGDTERFFNIDEEGFYLGYVNDEPVASVSVVNYTDEYA
YAGFYLVAPGARGKGYGLRLSYDAFRHCDKRSVGLDGMPEQEENYKKGGFVTHYETSRLVGIHNQQVDAPDGVQNITADN
IDEVIKFDEKITGYPRAALLKDWFSGEGRHGFVINSGDGVIGVVGIRRSTDGYRLGPLYSENQAVCDKLFAMALAQVPQG
TQVTIDAPTLDLGFINGLKKMGFEEIFHTFRMYRGKEPQGEKHKIQAIASLELG
>Q877R9 3.2.2.-~~~hopAM1-1~~~3' cyclic ADP-D-ribose synthase HopAM1~~~
MHANPLSSFNRAQHGNLTNVEASQVKSAGTSSTTNIDSKNIEEHVADRLSDLGRPDGGWFFEKSLGTLKNLNLEQLAGIH
DVLKLTDGVKNIVSFGAREGGFELAMQFRHDLYRSQHPDENSPHDAATHYLDAISLQSNKFTKLEKLQHVDVFKMQNPFW
DVGYKNGIAHAKKMAFFITPEWLGSDFCKQEFQWLSETKNKDIKSAFVIFKDVDLKSKNMTSIFNFADFHKSRVMMASTP
PESGLNNVKIENSVDLNFKRLLTDRESWELNNFLGD
>Q79LY0 3.1.3.48~~~hopD2~~~Effector protein hopD2~~~COG5599
MNPLQPIQHSITNSQMSGGQQLEAEGSQAHNSYSHPDRISLSQLSQSAHLALDHLSTQPNTDHQRVASLVRNAVQDGKFQ
LQSSNDTQVTYKTSVCPPANADTMGAAHLINNELTVQARLNDQLEYDIVSAHLYGPSEAISIDASSPPSANDLASSGLSE
RTHLGMNRVLLRYAVPPRETEDQCVMVIDKMPPPKHGKMSFFRTTNDLSKLPLGMETGGLSDLKLAGCERISSVEQVKSI
RAALGGGPLTVLDLREESHAIVNGLPITLRGPMDWANAGLSQVDGAARESAMITELKRTKSLTLVDANYVKGKKSNPQTT
ELKNLNVRSEREVVTEAGATYRRVAITDHNRPSPEATDELVDIMRHCLQANESLVVHCNGGRGRTTTAMIMVDMLKNARN
HSAETLITRMAKLSYDYNMTDLGSISALKRPFLEDRLKFLQAFHDYARNNPSGLSLNWTQWRAKIALE
>Q887D0 ~~~hopM1~~~Effector protein HopM1~~~
MISSRIGGAGGVELSRVNQQHDTVPAQTAHPNAVTAGMNPPLTPDQSGSHATESSSAGAARLNVAARHTQLLQAFKAEHG
TAPVSGAPMISSRAALLIGSLLQAEPLPFEVMAEKLSPERYQLKQFQGSDLQQRLEKFAQPGQIPDKAEVGQLIKGFAQS
VADQLEHFQLMHDASPATVGQHAKADKATLAVSQTALGEYAGRASKAIGEGLSNSIASLDEHISALDLTLQDAEQGNKES
LHADRQALVDAKTTLVGLHADFVKSPEAKRLASVAAHTQLDNVVSDLVTARNTVGGWKGAGPIVAAAVPQFLSSMTHLGY
VRLSTSDKLRDTIPETSSDANMLKASIIGMVAGIAHETVNSVVKPMFQAALQKTGLNERLNMVPMKAVDTNTVIPDPFEL
KSEHGELVKKTPEEVAQDKAFVKSERALLNQKKVQGSSTHPVGELMAYSAFGGSQAVRQMLNDVHQINGQTLSARALASG
FGGAVSASSQTLLQLKSNYVDPQGRKIPVFTPDRAESDLKKDLLKGMDLREPSVRTTFYSKALSGIQSSALTSALPPVTA
QAEGASGTLSAGAILRNMALAATGSVSYLSTLYTNQSVTAEAKALKAAGMGGATPMLDRTETALNNIRHPNRESLPHTFQ
KSTLSGIPRVAENAYHMGRGALQLPTQMAVDTVRVVDEGVLNAVASAREALKQPTKDDDALRALEEGLLDPR
>E1V825 ~~~hopP~~~Major outer membrane protein~~~COG3203
MKKTLLATAIAGAMAASGAQAATVYNQDGTKLDIYGNVQIGFRNIEAENDNGNIETQNDVFDNGSTIGFAAEHVIYDGLT
GYMKIEFDDFKADEMKTAGRDAGDTAYVGLKGNFGDVKLGSYDTLMDDWIQDPITNNEYFDVSDTSGSGSSVVAVGGEVE
TDQLTYVSPSFNGLELAIGTQYKGDMEEENVTSRGNASVFGGAKYTAGNFSVAATYDNLDNYEVTQTGVDNKQEFGDRYG
VTGQYQWNSLRVALKYERFDSDLDNVDSVNFYGLGARYGYGYGDIYGAYQYVDVGGDTFGNVVDDATSGDSPSDTASDRG
DDTYNEFIIGGTYNISDAMYTWVEAAFYDREDDEGDGVAAGVTYMF
>A9ZM27 ~~~hopP~~~Monosaccharide porin~~~
MKKTLLATAIAGAMAASGAQAATVYNQDGTKLDIYGNVQIGFRNIEAENDNGNIETQNDVFDNGSTIGFAAEHVIYDGLT
GYMKIEFDDFKADEMKTAGRDAGDTAYVGLKGNFGDVKLGSYDTLMDDWIQDPITNNEYFDVSDTSGSGSSVVAVGGEVE
TDQLTYVSPSFNGLELAIGTQYKGDMEEENVTSRGNASVFGGAKYTAGNFSVAATYDNLDNYEVTQTGVDNKQEFGDRYG
VTGQYQWNSLRVALKYERFDSDLDNVDSVNFYGLGARYGYGYGDIYGAYQYVDVGGDTFGNVVDDATSGDSPSDTASDRG
DDTYNEFIIGGTYNISDAMYTWVEAAFYDREDDEGDGVAAGVTYMF
>Q8RP17 ~~~hopW1-1~~~Effector protein hopW1-1~~~
MSPAQIIRTPHSFPPSFTGTSSSAENSHAQSPQQVLTRAFVASGELNAAFGRTSTASEQDFTSLLGTLQRELERKTLSFP
DIAELANQLAEAAKGDQGGHWLGRDEQQTLKGMIDRCKSQLAHTHASDASYDPLAQVCENLKTARLHQSISQMTGEAHAK
VRGVPDLLALIQLDPDVLAEKPVGMTSYVNFGSFICMAKARTAELSEDLRSDPNEVALLLHPHADTILELERLPDALAAL
TENCPDTPTRDDLRSLAKETGELLQQLRANDLLPRSEEVSSYQGETSVRSREVVEPKLTLCQAGGNGQGQLEASSARPES
LRYAPTRAASSGSEARVPGQAVGGKIADDAQKVAGLYAEKKRTNWTQANGVAGKISHKIQSLLGMRDAGSRVQAFVAFMA
DGKGRPGATMLDLGDGWMRATRVIKGEAALIDFQCDSDGKVVDARHPGRFPVLPQGNEREAFKTVLQELKFRGAETLSKV
PVYYVNRNTRGYVIPTHGYVVAGHPNRGRKSGAVLYGVGGDPKRGPVALDEKLLGHLVGRSDSKTSSKLSAPVKAAISAL
AGASFATREDFYDAYCAVRGDAVDPLERHNEISSIYRLLPLSTMEMWPKKADDYRVARPAAPERDLRAFENLPKDIGRKA
QLKKVSNVDSIDLLEAKRQFTLHQLYQDEMLGRNGTGVPSADFKPKVDAQRRDQLVASTPKFQRLPPHTTDKVGNCNTGA
SSLLQRAVDTYTEKNNLPPEKVTAASIFGIGSSHRLAIWDPLDGSSSNKSSKDR
>P69782 ~~~hosA~~~Transcriptional regulator HosA~~~
MALRNKAFHQLRQLFQQHTARWQHELPDLTKPQYAVMRAIADKPGIEQVALIEAAVSTKATLAEMLARMENRGLVRREHD
AADKRRRFVWLTAEGEKVLAAAIPIGDSVDEEFLGRLSAEEQELFMQLVRKMMNT
>O87198 2.3.3.14~~~~~~Homocitrate synthase~~~COG0119
MREWKIIDSTLREGEQFEKANFSTQDKVEIAKALDEFGIEYIEVTTPVASPQSRKDAEVLASLGLKAKVVTHIQCRLDAA
KVAVETGVQGIDLLFGTSKYLRAAHGRDIPRIIEEAKEVIAYIREAAPHVEVRFSAEDTFRSEEQDLLAVYEAVAPYVDR
VGLADTVGVATPRQVYALVREVRRVVGPRVDIEFHGHNDTGCAIANAYEAIEAGATHVDTTILGIGERNGITPLGGFLAR
MYTLQPEYVRRKYKLEMLPELDRMVARMVGVEIPFNNYITGETAFSHKAGMHLKAIYINPEAYEPYPPEVFGVKRKLIIA
SRLTGRHAIKARAEELGLHYGEEELHRVTQHIKALADRGQLTLEELDRILREWITA
>P22317 1.12.1.2~~~hoxF~~~NAD-reducing hydrogenase HoxS subunit alpha~~~COG1894
MDSRITTILERYRSDRTRLIDILWDVQHEYGHIPDAVLPQLGAGLKLSPLDIRETASFYHFFLDKPSGKYRIYLCNSVIA
KINGYQAVREALERETGIRFGETDPNGMFGLFDTPCIGLSDQEPAMLIDKVVFTRLRPGKITDIIAQLKQGRSPAEIANP
AGLPSQDIAYVDAMVESNVRTKGPVFFRGRTDLRSLLDQCLLLKPEQVIETIVDSRLRGRGGAGFSTGLKWRLCRDAESE
QKYVICNADEGEPGTFKDRVLLTRAPKKVFVGMVIAAYAIGCRKGIVYLRGEYFYLKDYLERQLQELREDGLLGRAIGGR
AGFDFDIRIQMGAGAYICGDESALIESCEGKRGTPRVKPPFPVQQGYLGKPTSVNNVETFAAVSRIMEEGADWFRAMGTP
DSAGTRLLSVAGDCSKPGIYEVEWGVTLNEVLAMVGARDARAVQISGPSGECVSVAKDGERKLAYEDLSCNGAFTIFNCK
RDLLEIVRDHMQFFVEESCGICVPCRAGNVDLHRKVEWVIAGKACQKDLDDMVSWGALVRRTSRCGLGATSPKPILTTLE
KFPEIYQNKLVRHEGPLLPSFDLDTALGGYEKALKDLEEVTR
>P22320 1.12.1.2~~~hoxH~~~NAD-reducing hydrogenase HoxS subunit beta~~~COG3259
MSRKLVIDPVTRIEGHGKVVVHLDDDNKVVDAKLHVVEFRGFEKFVQGHPFWEAPMFLQRICGICFVSHHLCGAKALDDM
VGVGLKSGIHVTPTAEKMRRLGHYAQMLQSHTTAYFYLIVPEMLFGMDAPPAQRNVLGLIEANPDLVKRVVMLRKWGQEV
IKAVFGKKMHGINSVPGGVNNNLSIAERDRFLNGEEGLLSVDQVIDYAQDGLRLFYDFHQKHRAQVDSFADVPALSMCLV
GDDDNVDYYHGRLRIIDDDKHIVREFDYHDYLDHFSEAVEEWSYMKFPYLKELGREQGSVRVGPLGRMNVTKSLPTPLAQ
EALERFHAYTKGRTNNMTLHTNWARAIEILHAAEVVKELLHDPDLQKDQLVLTPPPNAWTGEGVGVVEAPRGTLLHHYRA
DERGNITFANLVVATTQNNQVMNRTVRSVAEDYLGGHGEITEGMMNAIEVGIRAYDPCLSCATHALGQMPLVVSVFDAAG
RLIDERAR
>P23516 ~~~hoxN~~~High-affinity nickel transport protein~~~COG3376
MFQLLAGVRMNSTGRPRAKIILLYALLIAFNIGAWLCALAAFRDHPVLLGTALLAYGLGLRHAVDADHLAAIDNVTRKLM
QDGRRPITAGLWFSLGHSSVVVLASVLIAVMATTLQERLDAFHEVGSVIGTLASALFLFAIAAINLVILRSAYRAFRRVR
RGGIYVEEDFDLLFGNRGFLARIFRPLFRFITRSWHMYPLGMLFALGFDTATEVALLGISTMEASRGVPIWSILVFPALF
TAGMALIDTIDSILMCGAYAWAYAKPVRKLYYNMTITFVSAIVALIVGGIETLGLLADKFMLKGVFWNAVGALNENFCQL
GFVIIGIFTVCWVVSIVVYRLRRYDDSEVRA
>P22318 1.12.1.2~~~hoxU~~~NAD-reducing hydrogenase HoxS subunit gamma~~~COG3383
MSIQITIDGKTLTTEEGRTLVDVAAENGVYIPTLCYLKDKPCLGTCRVCSVKVNGNVAAACTVRVSKGLNVEVNDPELVD
MRKALVEFLFAEGNHNCPSCEKSGRCQLQAVGYEVDMMVSRFPYRFPVRVVDHASEKIWLERDRCIFCQRCVEFIRDKAS
GRKIFSISHRGPESRIEIDAELANAMPPEQVKEAVAICPVGTILEKRVGYDDPIGRRKYEIQSVRARALEGEDK
>P22319 1.12.1.2~~~hoxY~~~NAD-reducing hydrogenase HoxS subunit delta~~~COG1941
MRAPHKDEIASHELPATPMDPALAANREGKIKVATIGLCGCWGCTLSFLDMDERLLPLLEKVTLLRSSLTDIKRIPERCA
IGFVEGGVSSEENIETLEHFRENCDILISVGACAVWGGVPAMRNVFELKDCLAEAYVNSATAVPGAKAVVPFHPDIPRIT
TKVYPCHEVVKMDYFIPGCPPDGDAIFKVLDDLVNGRPFDLPSSINRYD
>Q6VE93 2.3.1.-~~~hopZ1a~~~Serine/threonine-protein acetyltransferase HopZ1a~~~
MGNVCVGGSRMSHQVYSPDRADTPPRSERNTPDRRQRAAGDAERTQSMRLQQKINDLKPYVRHARGPIKAYGQAALDRAS
GKKTSVSFAELDATHLDAMVYIENQRNPGLNLKHFRDHKELIQALQSDGPSAFRAIFPQTCPETGQTLKHHVMADVRLHQ
GGAPTIIITEPAVIVGARYQQLQRHNLTLEDLSESGVPLSQVAIIETQAQKTSDDCVMYSLNYAIKAHKNAAQFDDIHHG
LQHGTLSTESESRARTTLGALEASSSYSVMHEGAHAAFGADVLPVDFYKHGASLTQAKQLMKRPDGRMAGRVNSEGHSEA
ENLVQRNQAFRVKRRELLDDETPSNTQFSASIDGFRLQEIKRVLAEEQR
>A0A0H2XEA6 1.14.14.18~~~bphO~~~Heme oxygenase~~~
MMQALGQGHIDADTYAQVLRRHHRLLAGFEEQLSDWLVTLVGSGWQYRRRVPALREDLRVLGQPVDAAVPPPASSEAARW
GMLYVIEGSQLGGRVIARMLRKRQPGLAHALHYFELADEDPAGWRRFQAVLEQRLQSAAARADAIAGAQAMFAHFHTCLA
AEARP
>Q48B61 ~~~hopAB1~~~Effector protein hopAB1~~~
MPGINGAGPSNFFWQWRTDGEPVTEREHDSSRSASSANSPELPPPASPAESGRQRLLRSSALSRQTREWLEATPARVQGA
TPPAEARQSPEAQQAERIVQELVRGGADLNNVRTMLRNVMDNNAVAFSRVERDILLQHFPNMPMTGISSDSVLANELRQR
LRQTVRQQRIQSSTPARLADSSSGSSQRSLIGRSTMLMTPGRSSSSSAAASRTSVDRHPQGLDLESARLASAARHNHSAN
QTNEALRRLTQEGVDMERLRTSLGRYIMSLEPLPPDLRRALESVGINPFIPEELSLVDHPVLNFSAALNRMLASRQTTTN
SPELPPLASSAESGRRRLLRSPPLLSGQREWIEQSMRQEAEPQSSRLNRAVRLAVMPPQNENEDNVAYAIRLRRLNPGAD
VSRVVASFITDPAARQQVVNDIRAALDIAPQFSQLRTISKADAESEELGFRDAADHPDNATSCLFGEELSLSNPDQQVIG
LAVNPTDKPQPYSQEVNKALTFMDMKKLAQYLADKPEHPLNRQRLDAKNIAKYAFKIVP
>Q9RBW3 ~~~hopAB1~~~Effector protein hopAB1~~~
MPGINGAGPSNFFWQWRTDGEPVTEREHDSSRSASSANSPELPPPASPAESGRQRLLRSSALSRQTREWLEATPARVQGA
TPPAEARQSPEAQQAERIVQELVRGGADLNNVRTMLRNVMDNNAVAFSRVERDILLQHFPNMPMTGISSDSVLANELRQR
LRQTVRQQRIQSSTPARLADSSSGSSQRSLIGRSTMLMTPGRSSSSSAAASRTSVDRHPQGLDLESARLASAARHNHSAN
QTNEALRRLTQEGVDMERLRTSLGRYIMSLEPLPPDLRRALESVGINPFIPEELSLVDHPVLNFSAALNRMLASRQTTTN
SPELPPLASSAESGRRRLLRSPPLLSGQREWIEQSMRQEAEPQSSRLNRAVRLAVMPPQNENEDNVAYAIRLRRLNPGAD
VSRVVASFITDPAARQQVVNDIRAALDIAPQFSQLRTISKADAESEELGFRDAADHPDNATSCLFGEELSLSNPDQQVIG
LAVNPTDKPQPYSQEVNKALTFMDMKKLAQYLADKPEHPLNRQRLDAKNIAKYAFKIVP
>Q8RSY1 ~~~hopAB2~~~Effector protein HopAB2~~~
MAGINRAGPSGAYFVGHTDPEPVSGQAHGSGSGASSSNSPQVQPRPSNTPPSNAPAPPPTGRERLSRSTALSRQTREWLE
QGMPTAEDASVRRRPQVTADAATPRAEARRTPEATADASAPRRGAVAHANSIVQQLVSEGADISHTRNMLRNAMNGDAVA
FSRVEQNIFRQHFPNMPMHGISRDSELAIELRGALRRAVHQQAASAPVRSPTPTPASPAASSSGSSQRSLFGRFARLMAP
NQGRSSNTAASQTPVDRSPPRVNQRPIRVDRAAMRNRGNDEADAALRGLVQQGVNLEHLRTALERHVMQRLPIPLDIGSA
LQNVGINPSIDLGESLVQHPLLNLNVALNRMLGLRPSAERAPRPAVPVAPATASRRPDGTRATRLRVMPEREDYENNVAY
GVRLLNLNPGVGVRQAVAAFVTDRAERPAVVANIRAALDPIASQFSQLRTISKADAESEELGFKDAADHHTDDVTHCLFG
GELSLSNPDQQVIGLAGNPTDTSQPYSQEGNKDLAFMDMKKLAQFLAGKPEHPMTRETLNAENIAKYAFRIVP
>Q2QCI9 ~~~hopAB3~~~Effector protein HopAB3~~~
MAGINGAGPSGAYFVGHTDPEPASGGAHGSSSGASSSNSPRLPAPPDAPASQARDRREMLLRARPLSRQTREWVAQGMPP
TAEAGVPIRPQESAEAAAPQARAEERHTPEADAAASHVRTEGGRTPQALAGTSPRHTGAVPHANRIVQQLVDAGADLAGI
NTMIDNAMRRHAIALPSRTVQSILIEHFPHLLAGELISGSELATAFRAALRREVRQQEASAPPRTAARSSVRTPERSTVP
PTSTESSSGSNQRTLLGRFAGLMTPNQRRPSSASNASASQRPVDRSPPRVNQVPTGANRVVMRNHGNNEADAALQGLAQQ
GVDMEDLRAALERHILHRRPIPMDIAYALQGVGIAPSIDTGESLMENPLMNLSVALHRALGPRPARAQAPRPAVPVAPAT
VSRRPDSARATRLQVIPAREDYENNVAYGVRLLSLNPGAGVRETVAAFVNNRYERQAVVADIRAALNLSKQFNKLRTVSK
ADAASNKPGFKDLADHPDDATQCLFGEELSLTSSVQQVIGLAGKATDMSESYSREANKDLVFMDMKKLAQFLAGKPEHPM
TRETLNAENIAKYAFRIVP
>Q8RP04 ~~~hopAB3~~~Effector protein hopAB3~~~
MVGISGRAGPSGSYNYSGHTDNPEPVSGRARDSNSEANSSNSPQVPPPLNAPASPMPAGRPRFLRSMALSSQTREWLEKG
MPTEAEAGVPIRLQERAANTAPQARAEERHTQPADAAAPHARAERGRTLQAPASTSPLYTGAVPRANRIVQQLVEAGADL
ANIRTMFRNMLRGEEMILSRAEQNVFLQHFPDMLPCGIDRNSELAIALREALRRADSQQAARAPARTPPRSSVRTPERSP
APRTATESSSGSNQRSLLGRFAGLMTSNQRRPSSASNASTSQRPVDRNPPRINLMPTGANRVAMRNRGNNEADAALQALA
QNGINMEDLRAALEAYIVWLRPIPLDIANALEGVGITPRFDNPEEAKVDNPLMNLSSALKRRLDA
>Q57160 1.14.14.9~~~hpaB~~~4-hydroxyphenylacetate 3-monooxygenase oxygenase component~~~COG2368
MKPEDFRASTQRPFTGEEYLKSLQDGREIYIYGERVKDVTTHPAFRNAAASVAQLYDALHKPEMQDSLCWNTDTGSGGYT
HKFFRVAKSADDLRHERDAIAEWSRLSYGWMGRTPDYKAAFGCALGGTPGFYGQFEQNARNWYTRIQETGLYFNHAIVNP
PIDRHLPTDKVKDVYIKLEKETDAGIIVSGAKVVATNSALTHYNMIGFGSAQVMGENPDFALMFVAPMDADGVKLISRAS
YEMVAGATGSPYDYPLSSRFDENDAILVMDNVLIPWENVLLYRDFDRCRRWTMEGGFARMYPLQACVRLAVKLDFITALL
KKSLECTGTLEFRGVQADLGEVVAWRNTFWALSDSMCSEATPWVNGAYLPDHAALQTYRVLAPMAYAKIKNIIERNVTSG
LIYLPSSARDLNNPQIDQYLAKYVRGSNGMDHVQRIKILKLMWDAIGSEFGGRHELYEINYSGSQDEIRLQCLRQAQSSG
NMDKMMAMVDRCLSEYDQNGWTVPHLHNNDDINMLDKLLK
>Q48440 1.14.14.9~~~hpaB~~~4-hydroxyphenylacetate 3-monooxygenase oxygenase component~~~COG2368
MKPENFRADTKRPLTGEEYLKSLQDGREIYIYGERVKDVTTHPAFRNAAASVAQLYDALHNPELQNTLCWGTDTGSGGYT
HKFFRVAKSADDLRQQRDAIAEWSRLSYGWMGRTPDYKAAFGGGLGANPGFYGQFEQNARDWYTRIQETGLYFNHAIVNP
PIDRHKPADEVKDVYIKLEKETDAGIIVSGAKVVATNSALTHYNMIGFGSAQVMGENPDFALMFVAPMDAEGDKLISRAS
YELVAGATGSPYDYPLSSRFDENDAILVMDNVLIPWENVLIYRDFDRCRRWTMEGGFARMYPLQACVRLAVKLDFITALL
KRSLECTGTLEFRGVQAELGEVVAWRNMFWALSDSMCAEATPWVNGAYLPDHAALQTYRVMAPMPYAKIKNIIERSVTSG
LIYLPSSARDLNNPQINDTLAKYVRGSNGMDHVERIKILKLMWDAIGSEFGGCHELYEINYSGSQDEIRLQCLRQAQSSG
NMDKMMAMVDRCLSEYDQNGWTVPHLHNNTDINMLDKLLK
>Q9HWT7 1.14.14.9~~~hpaB~~~4-hydroxyphenylacetate 3-monooxygenase oxygenase component~~~
MKPEDFRASATRPFTGEEYLASLRDDREIYIYGDRVKDVTSHPAFRNAAASMARLYDALHDPQSKEKLCWETDTGNGGYT
HKFFRYARSADELRQQRDAIAEWSRLTYGWMGRTPDYKAAFGSALGANPGFYGRFEDNAKTWYKRIQEACLYLNHAIVNP
PIDRDKPVDQVKDVFISVDEEVDGGIVVSGAKVVATNSALTHYNFVGQGSAQLLGDNTDFALMFIAPMNTPGMKLICRPS
YELVAGIAGSPFDYPLSSRFDENDAILVMDKVFIPWENVLIYRDFERCKQWFPQGGFGRLFPMQGCTRLAVKLDFITGAL
YKALQCTGSLEFRGVQAQVGEVVAWRNLFWSLTDAMYGNASEWHGGAFLPSAEALQAYRVLAPQAYPEIKKTIEQVVASG
LIYLPSGVRDLHNPQLDKYLSTYCRGSGGMGHRERIKILKLLWDAIGSEFGGRHELYEINYAGSQDEIRMQALRQAIGSG
AMKGMLGMVEQCMGDYDENGWTVPHLHNPDDINVLDRIRQ
>Q5SJP8 1.14.14.9~~~~~~4-hydroxyphenylacetate 3-monooxygenase oxygenase component~~~COG2368
MARTGAEYIEALKTRPPNLWYKGEKVEDPTTHPVFRGIVRTMAALYDLQHDPRYREVLTYEEEGKRHGMSFLIPKTKEDL
KRRGQAYKLWADQNLGMMGRSPDYLNAVVMAYAASADYFGEFAENVRNYYRYLRDQDLATTHALTNPQVNRARPPSGQPD
PYIPVGVVKQTEKGIVVRGARMTATFPLADEVLIFPSTLLQAGSEKYALAFALPTSTPGLHFVCREALVGGDSPFDHPLS
SRVEEMDCLVIFDDVLVPWERVFILGNVELCNNAYAATGALNHMAHQVVALKTAKTEAFLGVAALMAEGIGADVYGHVQE
KIAEIIVYLEAMRAFWTRAEEEAKENAYGLLVPDRGALDGARNLYPRLYPRIREILEQIGASGLITLPSEKDFKGPLGPF
LEKFLQGAALEAKERVALFRLAWDMTLSGFGARQELYERFFFGDPVRMYQTLYNVYNKEPYKERIRAFLKESLKVFEEVQ
A
>Q57501 1.5.1.36~~~hpaC~~~4-hydroxyphenylacetate 3-monooxygenase reductase component~~~COG1853
MQLDEQRLRFRDAMASLSAAVNIITTEGDAGQCGITATAVCSVTDTPPSLMVCINANSAMNPVFQGNGKLCVNVLNHEQE
LMARHFAGMTGMAMEERFSLSCWQKGPLAQPVLKGSLASLEGEIRDVQAIGTHLVYLVEIKNIILSAEGHGLIYFKRRFH
PVMLEMEAAI
>Q48441 1.5.1.36~~~hpaC~~~4-hydroxyphenylacetate 3-monooxygenase reductase component~~~COG1853
MQLDEQRLRFRDAMASLSPAVNVITTEAEAGAAVSPHRPSCSVTDTPPSVMVCINANSAMNPVFQGNGKLCINVLNHEQE
EMARHFAGMTGMTMDDRFGLSGWQKGALGQPVLKGALASLEGEISQVQTIGSHLVYLVEIRNITLSQQGHGLIYFKRRFH
PVMMEMDVVA
>Q9HWT6 1.5.1.37~~~hpaC~~~4-hydroxyphenylacetate 3-monooxygenase reductase component~~~
MSQLEPRQQAFRNAMAHLSAAVNVITSNGPAGRCGITATAVCSVTDSPPTLMLCINRNSEMNTVFKANGRLCVNVLSGEH
EEVARHFAGMTEVPMERRFALHDWREGLAGLPVLHGALANLQGRIAEVQEIGTHSVLLLELEDIQVLEQGDGLVYFSRSF
HRLQCPRRAA
>Q5SJP7 1.5.1.36~~~~~~4-hydroxyphenylacetate 3-monooxygenase, reductase component~~~COG1853
MKEAFKEALARFASGVTVVAARLGEEERGMTATAFMSLSLEPPLVALAVSERAKLLPVLEGAGAFTVSLLREGQEAVSEH
FAGRPKEGIALEEGRVKGALAVLRCRLHALYPGGDHRIVVGLVEEVELGEEGPPLVYFQRGYRRLVWPS
>Q6Q271 1.5.1.36~~~C1-hpah~~~p-hydroxyphenylacetate 3-hydroxylase, reductase component~~~COG1853
MNQLNTAIVEKEVIDPMAFRRALGNFATGVTIMTAQTSSGERVGVTANSFNSVSLDPALVLWSIDKKSSSYRIFEEATHF
GVNILSAAQIELSNRFARRSEDKFANIEFDLGVGNIPLFKNCSAAFECERYNIVEGGDHWIIIGRVVKFHDHGRSPLLYH
QGAYSAVLPHPSLNMKSETAEGVFPGRLYDNMYYLLTQAVRAYQNDYQPKQLASGFRTSEARLLLVLESKTASSKCDLQR
EVAMPIREIEEATKILSEKGLLIDNGQHYELTEQGNACAHMLYKIAESHQEEVFAKYTVDERKLFKNMLKDLIGI
>Q6Q272 1.14.14.9~~~C2-hpah~~~p-hydroxyphenylacetate 3-hydroxylase, oxygenase component~~~COG1960
MENTVLNLDSDVIHACEAIFQPIRLVYTHAQTPDVSGVSMLEKIQQILPQIAKNAESAEQLRRVPDENIKLLKEIGLHRA
FQPKVYGGLEMSLPDFANCIVTLAGACAGTAWAFSLLCTHSHQIAMFSKQLQDEIWLKDPDATASSSIAPFGKVEEVEGG
IILNGDYGWSSGCDHAEYAIVGFNRFDADGNKIYSFGVIPRSDYEIVDNWYAQAIKSSGSKMLKLVNVFIPEYRISKAKD
MMEGKSAGFGLYPDSKIFYTPYRPYFASGFSAVSLGIAERMIEAFKEKQRNRVRAYTGANVGLATPALMRIAESTHQVAA
ARALLEKTWEDHRIHGLNHQYPNKETLAFWRTNQAYAVKMCIEAVDRLMAAAGATSFMDNSELQRLFRDAHMTGAHAYTD
YDVCAQILGRELMGMEPDPTMV
>A4IT51 1.14.14.8~~~hpaH~~~Anthranilate 3-monooxygenase oxygenase component~~~COG2368
MRMGIRTGAQYISGLKSRKPEIWLSGRRVINVCEEPVFKQPIREIARLYDMQHDPEYQDKITHICTETGERVSNAFLVPK
SREDLLARRALFEVWARATFGLMGRTPDFLNVVLTSLYSNASFLEKYNPQWAENIRAYYRYVRDNDLFLTHAIINPQNDR
SKPSHEQQDTFTHLGVVRETPEGLIVRGAKMLATLAPITDEVIIYTFPGYKPGDERYAVSFAIPIDTPGLRILCREPMQD
GTRPLFDHPLASRFEEMDALLVFNDVLVPWDRVFIYNNVEAANLLYPKTGIAQQPAHQTGVRGLIKLQFATEVAIRLADS
IGVDVYLNVQNDLGELLQSVEAIRALLHLAEHELEVLPSGEVMPGWVPLETIRGLLPKLYPRAVEVLQIIGAGGLLMSPT
GADFANPELAADMEKYYAGRIGVGGEERVRLFKLAWDLCGEAFGQRLLQYERFYTGDPIRKRAIFYNNIKRERTLVMVDE
ALRMPNQQEKVVNA
>A1B198 5.1.1.22~~~hpbD~~~4-hydroxyproline betaine 2-epimerase~~~COG4948
MKIAEIHVYAHDLPVKDGPYTIASSTVWSLQTTLVKIVADSGLAGWGETCPVGPTYAPSHALGARAALAEMAPGLIGANP
LQPLVLRRRMDGLLCGHNYAKAAIDIAAYDLMGKHYGVRVADLLGGVAAERVPSYYATGIGQPDEIARIAAEKVAEGFPR
LQIKIGGRPVEIDIETVRKVWERIRGTGTRLAVDGNRSLPSRDALRLSRECPEIPFVLEQPCNTLEEIAAIRGRVQHGIY
LDESGEDLSTVIRAAGQGLCDGFGMKLTRIGGLQQMAAFRDICEARALPHSCDDAWGGDIIAAACTHIGATVQPRLNEGV
WVAQPYIAQPYDEENGIRIAGGHIDLPKGPGLGITPDESLFGPPVASFS
>Q0FPQ4 5.1.1.22~~~hpbD~~~4-hydroxyproline betaine 2-epimerase~~~COG4948
MKIAEIQLFQHDLPVVNGPYRIASGDVWSLTTTIVKIIAEDGTIGWGETCPVGPTYAEAHAGGALAALEVLASGLAGAEA
LPLPLHTRMDSLLCGHNYAKSALDIAVHDLWGKRLGVPVHELLGGALTDSVSSYYSLGVMEPDEAARQALEKQREGYSRL
QVKLGARPIEIDIEAIRKVWEAVRGTGIALAADGNRGWTTRDALRFSRECPDIPFVMEQPCNSFEDLEAIRPLCHHALYM
DEDGTSLNTVITAAATSLVDGFGMKVSRIGGLQHMRAFRDFCAARNLPHTCDDAWGGDIVSAACTHIASTVLPRLMEGAW
LAQPYVAEHYDAENGVRIEGGRIRVPQGPGLGLTIDPERFGPPLFSA
>Q05353 1.13.11.15~~~hpcB~~~3,4-dihydroxyphenylacetate 2,3-dioxygenase~~~COG3384
MGKLALAAKITHVPSMYLSELPGKNHGCRQGAIDGHKEISKRCREMGVDTIIVFDTHWLVNSAYHINCADHFEGVYTSNE
LPHFIRDMTYNYEGNPELGQLIADEALKLGVRAKAHNIPSLKLEYGSVVPMRYMNEDKRFKVVSISAFCTVHDFADSRKL
GERIVKAIEQYDGTVAVLASGSLSHRFIDDQRAEEGMNSYTREFDRQMDERVVKLWREGQFKEFCNMLPEYADYCYGEGN
MHDTVMLLGMLGWDKYDGKVWSLSPSYSQASWHRSG
>Q05354 5.3.3.10~~~hpcD~~~5-carboxymethyl-2-hydroxymuconate Delta-isomerase~~~
MPHFIVECSDNIREEADLPGLFAKVNPTLAATGIFPLAGIRSRVHWVDTWQMADGQHDYASVHMTLKIGAGRSLESRQQA
GEMLFELIKTHFAALMESRLLALSFEIEELHPTLNFKQNNVHALFK
>P37352 ~~~hpcE~~~Homoprotocatechuate catabolism bifunctional isomerase/decarboxylase~~~
MKGTIFAVALNHRSQLDAWQEAFQQSPIKAPPKTAVWFIKPRNTVIGCGEPIPFPQGENLLSGATVALIVGKTATKVREE
DAAEYIAGYALANDVSLPEESFYRPAIKAKCRDGFCPIGETVALSNVDNLTIYTEINGRPADHWNTSDLQRNAAQLLSAL
SEFATLNPGDAILLGTPQARVEIQPGDRVRVLAEGFPPLENPVVDEREVTTRKSFPTLPHPHGTLFALGLNYADHASELE
FKPPEEPLVFLKAPNTLTGDNQTSVRPNNIEYMHYEAELVVVIGKQARNVSEADAMDYVAGYTVCNDYAIRDYLENYYRP
NLRVKSRDGLTPMLSTIVPKEAIPDPHNLTLRTFVNGELRQQGTTADLIFSVPFLIAYLSEFMTLNPGDMIATGTPKGLS
DVGDEVVVEVEGVGRLVNRIVSEETAK
>P42270 4.2.1.163~~~hpcG~~~2-oxo-hept-4-ene-1,7-dioate hydratase~~~COG3971
MFDKHTHTLIAQRLDQAEKQREQIRAISLDYPEITIEDAYAVQREWVRLKIAEGRTLKGHKIGLTSKAMQASSQISEPDY
GALLDDMFFHDGSDIPTDRFIVPRIEVELAFVLAKPLRGPNCTLFDVYNATDYVIPALELIDARCHNIDPETQRPRKVFD
TISDNAANAGVILGGRPIKPDELDLRWISALMYRNGVIEETGVAAGVLNHPANGVAWLANKLAPYDVQLEAGQIILGGSF
TRPVPARKGDTFHVDYGNMGSISCRFV
>B1IS70 4.1.2.52~~~hpcH~~~4-hydroxy-2-oxo-heptane-1,7-dioate aldolase~~~
MENSFKAALKAGRPQIGLWLGLSSSYSAELLAGAGFDWLLIDGEHAPNNVQTVLTQLQAIAPYPSQPVVRPSWNDPVQIK
QLLDVGTQTLLVPMVQNADEAREAVRATRYPPAGIRGVGSALARASRWNRIPDYLQKANDQMCVLVQIETREAMKNLPQI
LDVEGVDGVFIGPADLSADMGYAGNPQHPEVQAAIEQAIVQIRESGKAPGILIANEQLAKRYLELGALFVAVGVDTTLLA
RAAEALAARFGAQATAVKPGVY
>Q47098 4.1.2.52~~~hpcH~~~4-hydroxy-2-oxo-heptane-1,7-dioate aldolase~~~COG3836
MENSFKAALKAGRPQIGLWLGLSSSYSAELLAGAGFDWLLIDGEHAPNNVQTVLTQLQAIAPYPSQPVVRPSWNDPVQIK
QLLDVGTQTLLVPMVQNADEAREAVRATRYPPAGIRGVGSALARASRWNRIPDYLQKANDQMCVLVQIETREAMKNLPQI
LDVEGVDGVFIGPADLSADMGYAGNPQHPEVQAAIEQAIVQIRESGKAPGILIANEQLAKRYLELGALFVAVGVDTTLLA
RAAEALAARFGAQATAVKPGVY
>Q84F14 1.97.1.-~~~hpdA~~~4-hydroxyphenylacetate decarboxylase activating enzyme~~~
MSSQKQLEGMIFDVQSFSVHDGPGCRTTVFLNGCPLSCKWCANPESWTVRPHMMFSELSCQYENGCTVCHGKCKNGALSF
NLDNKPVIDWNICKDCESFECVNSCYYNAFKLCAKPYTVDELVQVIKRDSNNWRSNGGVTFSGGEPLLQHEFLHEVLLKC
HEVNVHTAIETSACVSNEVFNKIFNDIDFAFIDIKHMDREKHKEQTGVYNDLILENISNLANSDWNGRLVLRVPVISGFN
DSDENISDIISFMHKNNLVEINLLPFHRLGESKWTQLGKEYEYSDKGDVDEGHLEELQDIFLDNGIACYVGHVTAF
>Q38HX2 1.97.1.-~~~csdA~~~4-hydroxyphenylacetate decarboxylase activating enzyme~~~
MKEKGLIFDIQSFSVHDGPGCRTSVFFIGCPLQCKWCANPESWTKKKHIMVAENVCKWKNGCRSCINACSHDSIKFSEDG
KLKISWDTCEKCETFDCVNMCPNNALKQCVKEYTVDELMTILKRDFNNWGSDGGVTFTGGDPLMHHEFLVEVLKKCYDSQ
IHKAIETSGYAKQEVFLEVLKYIDFAFIDVKNMDREKHKQGTGVYNDLILSNIEALKKSNWNGRLVLRQPTIAGYNDSDE
NAYKLIEFMNKNSLYEINLLKFHRLGETKWNQLGKEYEYSKYGDMTNEKMEHLQQLYLDNNIACYIGDNTPF
>Q84F16 4.1.1.83~~~hpdB~~~4-hydroxyphenylacetate decarboxylase glycyl radical subunit~~~
MSQSKEDKIRSILEAKNIKSNFQNKENLSEFNEKKASKRAEDLLDVYYNTLSTADMEFPYWYNREYRKSDGDIPVVRRAK
ALKAAFSHMTPNIIPGEKIVMQKTRHYRGSFPMPWVSESFFVAQGEQMREEAKKLASNTADELTKFGSGGGNVTESFGNV
VSIAGKFGMRKEEVPVLVKMAKEWVGKSVEDLGFHYEKMMPDYDLKENLMSTLICMFDSGYTLPQGREVINYFYPLNYGL
DGIIEMAKECKKAVAGNASGDGLIGMDRLYFYEAVIQVIEGLQTWILNYAKHAKYLESIETDLEAKKEYSDLVEILEHIA
HKQPRTFREALQLTYTIHIASVNEDAISGMSIGRFGQILYPWYEQDIEKGLITKEEVIELLELYRIKITCIDCFASAGVN
GGVLSGNTFNTLSIGGLKEDGSTGANELEELLLEASMRCRTPQPSLTMLYDEKLPEDFLMKAAECTKLGSGYPAWVNNSN
GTTFMMKQFADEGMTVEEARAFALGGCLETSPGCWKQLTLNGKTYSIAGGAGQSAGSGVHFIANPKILELVLMNGKDYRM
NIQVFEPHNKPLDTYEEVIEVFKDYYKQAINVLERANNIELDIWRKFDTSIINSLLKPDCLDKGQHIGNMGYRYNATLNV
ETCGTVTMVNSFAALKKLVYDDKAFTIEEIKDAILNNFGFKDALEVGNYSMADQVKVDKTGKYDAIYKACLDAPKYGNND
LYADNILKNYEVWLSKVCEEAQSLYAKKMYPCQISVSTHGPQGAATLATPDGRLSGTTYSDGSVSAYAGTDKNGVYALFE
SATIWDQAVVQNSQMNLKLHPTTIKGQQGTKKLLDLTRSYLRKGGFHIQYNVVDSETLKDAQKNPDNYRQLMVRVAGFTQ
YWCELGKPIQDEVIARTEYEGV
>Q38HX4 4.1.1.83~~~csdB~~~4-hydroxyphenylacetate decarboxylase glycyl radical subunit~~~
MNVKETKLEDVLKSRGIDMKDAYNISEADIPEAKESTQKLMDIYYTLKVTADMEAAYWYNRTWWENDGEVIEVRRAKAVA
ASLSHMTPTILPYEKLVMNKTKNVRGAFPFPWVCASFFNAQAEALMNEVDAPAENEADSVSVVGAGGGNVTESYGNVISI
AKKFGMRKEEIPVLVKTSKPWEGISVEELSNKYSKMTPGYDQFKNIMESVICMFDSFAIPQGREVINYYMPLQYGFDGII
KLCDEKIAEVMGEAGDDGDFGMSRGYYYAAMKEITKGLSAWCENYSKRAKYLASIETDSEIKANYEKIEEVMGNIAHKKP
ANFWEAIQMTLCCHFGVVNEDPQSGLSIGRLGQVLQPFYEKDVEDGIMTDEEVIELLELYRIKITCIECFASAGVSGGVL
SGNTFNNLSLGGQNYDGLSAVTPLEYLIVEAGMRNQTPQPTLSVLYDEKTPEDFLMKAASCTKLGLGYPAWMNNQTGMNF
MMRNYGPEGMDLHDARAWCLGGCLESAPGCFLPLEYNGKVTMIPGGASPTCGTGVHFIGMPKVLELVLTNGLDKRTGKQV
YPPHNKKLDSYETMVNQWKEYMELTTDVVNRCNNIQMDIWRKYNMPAVNSLLKPDCFKKGKHIGTMGARYNSCINFESCG
TITFVNSLSSIKKNVFDDSKFTIEEMTDAMLNNFGFKTAYETEVFSPDFRESTDKSTKYEKIFAACVNAPKYGNADKYAD
EIFKAYHYYIYDMTHKFRSYYGKPLYLCQISVSTHGPQGFVTLATADGRLAGTTYSDGSVSAAAGTDKNGIYAIFESATV
YDHSMHQNAQMNLKLHPTAVKGINGTRKLLDLVRAYMRKGGFHVQFNVVDSKTLRDAQLTPEKYRELMVRVAGFTQYWCE
IGKPIQDEVIYRTEYDK
>Q84F15 4.1.1.83~~~hpdC~~~4-hydroxyphenylacetate decarboxylase small subunit~~~
MRKHSDCMNFCAVDATKGICRLSKQMINLDDAACPEIKVMPKCKNCKNFVEANDEGIGKCVGLEKEDWVYSTLNAITCEG
HVFNE
>Q38HX3 4.1.1.83~~~csdC~~~4-hydroxyphenylacetate decarboxylase small subunit~~~
MRHYDCKNYINLDCEKGLCALTKGMVPIDGEGSEACPNFKPAEKCGNCKNFCNPDKYGLGTCTGLEKENWAYATCGASAC
PSYKAE
>A4NBN9 ~~~pe~~~Surface-adhesin protein E~~~
MKKIILTLSLGLLTACSAQIQKAEQNDMKLAPPTDVRSGYIRLVKNVNYYIDSESIWVDNQEPQIVHFDAVVNLDKGLYV
YPEPKRYARSVRQYKILNCANYHLTQVRTDFYDEFWGQGLRAAPKKQKKHTLSLTPDTTLYNAAQIICANYGKAFSVDKK
>P43961 ~~~pe~~~Surface-adhesin protein E~~~
MKKIILTLSLGLLTACSAQIQKAEQNDVKLAPPTDVRSGYIRLVKNVNYYIDSESIWVDNQEPQIVHFDAVVNLDRGLYV
YPEPKRYARSVRQYKILNCANYHLTQIRTDFYDEFWGQGLRAAPKKQKKHTLSLTPDTTLYNAAQIICANYGKAFSVDKK
>A0A318FL05 4.2.1.177~~~hpfG~~~(2S)-3-sulfopropanediol dehydratase~~~
MKVNHTTACGTQPFDKTYSLGYQVHHEDWSPYPRVNRLRQAFLDRPYDIDVERLRLVTEAYQKHEDAPRKLKCARAFENI
LLNTKLYIYDEDLILGEIAAPAKASPIYPEFSVNWIINEILHSPFEERANDQFYIRNDEERKEIVELCRYWEGKTVDDLI
NSRLEIDQTKGSEVGEKIFQTNLYHYAGAGHLAIDYARLMAVGYNGLIDNAQAGLEKLSKRDPEYGDKRDFYTAMIIELE
AAKKYIARYAKLAQESAEKEENPQRKQELETMALNCQQIAGGVPQTFWQALQLFNFATTLIQIESNGHSISYGRMDQWLY
PWFAADMKNNTITKEFALELIEVQYVKMNNPTKLKDKGTVAVRNGRGFGGESLTLGGVDREGNDATNDLTMLMLEGSAHT
RMMNPWVCVRMHENTPYELKIKTVECIRAGYGHPKLFNDAPSIKGMMRKGMTLEEARDYCVVGCVELDLAGKEYGWHDAA
YVNTPKMMEMVVNGGRSLSTGEQLGPDTGSLDTYKSFDEVLASVDQQFEYWTDQMCSSLNIIDNAHRELKPVPYVSAFYE
DCMISGKDLTEGGAKYNGIAPQAAGMATCADSLATIKQLVFDEKRYSGAEMLQAVKDNWVGHEKLYALVNSSKVRHYGND
DDYADDLFKFMFECYCRHISGRKTPRGGEFSPGVYSVNANVGMGLNTNASIDGRKKFEPISDNMGPVHTDGGSHDICGPT
ALVNSLTKVDHSLATNGTLMNLRFPQEAVAGVEGRDNLLSFIDEYIAKQAMHVQFNIMSSATMRAAQKKPEDYKDMLVRV
AGYSAYFVELGKPLQKDLIQRTELHF
>P28368 ~~~yvyD~~~Ribosome hibernation promotion factor~~~COG1544
MNYNIRGENIEVTPALKDHVERKIGKLERYFDHSVDADVNVNLKFYNDKESKVEVTIPMTDLALRSEVHNEDMYNAIDLA
TNKLERQIRKHKTKVNRKFREQGSPKYLLANGLGSDTDIAVQDDIEEEESLDIVRQKRFNLKPMDSEEAILQMNMLGHNF
FVFTNAETNLTNVVYRRNDGKYGLIEPTE
>Q9RVE7 ~~~hpf~~~Ribosome hibernation promotion factor~~~COG1544
MQIYQLSGRNVEVTEPMREYVEEKLSRLDRYTDQITDARVTLTVRDVRNNERRNRVEVQLNVPGGIIRAEEHHADMYAAI
DKASDVLERQLRKFKTRYMKQRQEGRPEPLPGPAEAEVNAQGSGAAMDDVSEFHPEIVRQKRFELRPMSAEDAVVQMEAL
GHDFYVFQDLQGQTGVVYRRRDGHYGLIGSS
>P0AFX0 ~~~hpf~~~Ribosome hibernation promoting factor~~~COG1544
MQLNITGNNVEITEALREFVTAKFAKLEQYFDRINQVYVVLKVEKVTHTSDATLHVNGGEIHASAEGQDMYAAIDGLIDK
LARQLTKHKDKLKQH
>A4N8V8 ~~~hpf~~~Putative metal ABC transporter substrate-binding protein Hpf~~~
MRNSFKIMTALALGLFAMQANAKFKVVTTFTVIQDIAQNVAGNAATVESITKPGAEIHEYEPTPKDIVKAQSADLILWNG
LNLERWFERFFQNVKDKPAVVVTEGIQPLSIYEGPYKDAPNPHAWMSPSNALIYIENIKNALVKYDPQNAAVYEKNAADY
AQKIKQLDEPLRAKLAQIPEAQRWLVTSEGAFSYLAKDYNLKEGYLWPINAEQQGTPQQVRKVIDLVRKNNIPVVFSEST
ISAKPAQQVAKESGAKYGGVLYVDSLSAKNGPVPTYIDLLNVTVSTIVKGFGK
>A2RIX0 ~~~hpf~~~Ribosome hibernation promotion factor~~~COG1544
MIKFNIRGENVEVTDAIRAYVEDKIGKLDKYFNDGHEVTAYVNLKVYTEKRAKVEVTLPAKNVTLRAEDTSQDMYSSIDF
VEEKLERQIRKYKTRMNRKPRNAVPTGQVFGDEFAPLDTTDEVAEDHVDIVRTKHVALKPMDAEEAVLQMDMLGHDFYVF
TDADSNGTHVVYRRTDGRYGLIETE
>A0A0H3GEZ8 ~~~hpf~~~Ribosome hibernation promotion factor~~~
MLKYNIRGENIEVTEPIRDYVEKKIDKLERYFTETPDANVHVNLKVYSDKNAKVEVTIPLPNLVLRAEETSGDLYASIDL
IVDKLERQIRKHKTKVNRKFRDKGAERDYFAYSDVNGSTPPEENEGDFDLEIVRTKQFSLKPMDSEEAVLQMNLLGHSFY
VYTDAETNGTNIVYSRKDGKYGLIETN
>O05886 ~~~hpf~~~Ribosome hibernation promotion factor~~~COG1544
MSRLAVDSGQVLAEPKSNAEIVFKGRNVEIPDHFRIYVSQKLARLERFDRTIYLFDVELDHERNRRQRKSCQRVEITARG
RGPVVRGEACADSFYAALESAVVKLESRLRRGKDRRKVHYGDKTPVSLAEATAVVPAPENGFNTRPAEAHDHDGAVVERE
PGRIVRTKEHPAKPMSVDDALYQMELVGHDFFLFYDKDTERPSVVYRRHAYDYGLIRLA
>Q2FIN9 ~~~hpf~~~Ribosome hibernation promotion factor~~~
MIRFEIHGDNLTITDAIRNYIEEKIGKLERYFNDVPNAVAHVKVKTYSNSATKIEVTIPLKNVTLRAEERNDDLYAGIDL
INNKLERQVRKYKTRINRKSRDRGDQEVFVAELQEMQETQVDNDAYDDNEIEIIRSKEFSLKPMDSEEAVLQMNLLGHDF
FVFTDRETDGTSIVYRRKDGKYGLIQTSEQ
>Q2G055 ~~~hpf~~~Ribosome hibernation promotion factor~~~COG1544
MIRFEIHGDNLTITDAIRNYIEEKIGKLERYFNDVPNAVAHVKVKTYSNSATKIEVTIPLKNVTLRAEERNDDLYAGIDL
INNKLERQVRKYKTRINRKSRDRGDQEVFVAELQEMQETQVDNDAYDDNEIEIIRSKEFSLKPMDSEEAVLQMNLLGHDF
FVFTDRETDGTSIVYRRKDGKYGLIQTSEQ
>Q2YSH7 ~~~hpf~~~Ribosome hibernation promotion factor~~~
MIRFEIHGDNLTITDAIRNYIEEKIGKLERYFNDVPNAVAHVKVKTYSNSATKIEVTIPLKNVTLRAEERNDDLYAGIDL
INNKLERQVRKYKTRINRKSRDRGDQEVFVAELQEMQETQVDNDAYDDNEIEIIRSKEFSLKPMDSEEAVLQMNLLGHDF
FVFTDRETDGTSIVYRRKDGKYGLIQTSEQ
>Q7A6R6 ~~~hpf~~~Ribosome hibernation promotion factor~~~
MIRFEIHGDNLTITDAIRNYIEEKIGKLERYFNDVPNAVAHVKVKTYSNSATKIEVTIPLKNVTLRAEERNDDLYAGIDL
INNKLERQVRKYKTRINRKSRDRGDQEVFVAELQEMQETQVDNDAYDDNEIEIIRSKEFSLKPMDSEEAVLQMNLLGHDF
FVFTDRETDGTSIVYRRKDGKYGLIQTSEQ
>Q5XAQ7 ~~~hpf~~~Ribosome hibernation promotion factor~~~
MIKFSIRGENIEVTEAIRDYVESKLTKIEKYFAKDQEIDARVNLKVYRERSSKVEVTIPLDSVTLRAEDVSQDMYGSIDL
VVDKIERQIRKNKTKIAKKHREKVPTGQVFTTEFEAEEVDEIPEVQVVRTKNVTLKPMDVEEARLQMELLGHDFFIYTDS
EDGATNILYRREDGNLGLIEAK
>P47908 ~~~hpf~~~Ribosome hibernation promotion factor~~~COG1544
MKLLIQGNNIAVTESIHDYVESKLEKATKHFQTFATKVDVHLSVANNARITDKHKAEVTVYANGTVIRAQEGSENLYASI
DLVSDKIARQLRKYKEKNFGKKTHVQEKTSEVLPEDPVPDNLIGDRAPELPSEVVRMKYFAMPPMTIDEALEQLQLVDHD
FYMFLNKDTNAINVIYIRNHGGYGVIQPRLGKE
>O52815 2.6.1.103~~~hpgT~~~(S)-3,5-dihydroxyphenylglycine transaminase~~~
MEILVFMDSYGLSTQLSMETLHGSLTDPAISSMNLLNELIDEYPVAISMAAGRPYEEFFDVRLIHEYIDAYCDHLRHDRK
LAEAVVTRTLFQYGTTKGIIADLIARNLAEDENIDAAAESVVVTVGAQEAMFLILRTLRADERDVLLAPAPTYVGLTGAA
LLTDTPVWPVQSTANGVDPEDLVLQLKRADEQGKRVRACYVTPNFANPTGTSMDLPARHRLLEVAEANGILLLEDNAYGL
FGSERLPSLKSLDRSGNVVYIGSFAKTGMPGARVGYVVADQRVAGGGLLADQLSKLKGMLTVNTSPIAQAVIAGKLLLND
FSLTKANAREIAIYQRNLQLTLSELERTLGGLPEVGWNTPTGGFFVTVTVPFVVDDELLAHAARDHGVLFTPMHHFYGGK
DGFNQLRLSISLLTPELIKEGVTRLAALITARLRWPRA
>P56867 ~~~hpi~~~Hexagonally packed intermediate-layer surface protein~~~
MKKNIALMALTGILTLASCGQNGTGTTPTADACATANTCSVTVNISGVSSADFDVTMDGKTTSMTLSNGQKLPVAKTGTV
TLTPKAKDGYTTPAAQSTTISSTNLTPSVNFAYTTVPSTGNGNGNGGTTPTQPFTLNITSPTNGAAATTGTPIRVVFTSS
VALSSATCKIGNSAAVNAQVSSTGGYCDVTPTTAGGGLITVTGTANGQTVSSTVTVDVKAPVVDNRYGTVTPAGDQELTL
TNEGIVKDADNGWRRLGQGVSTPSDPNGNVDIYVKGTVNFSVNAAAGSKVEVFLARTTGSDVPTNDDVQAGDVLRSVAST
SGTETFSLDSRRLAEFDGVRKWIVVRINGTQVTYQPVIADNKGPQQPDPELNGVQNAYSNILNNYNNSGLTYVRGDVNVF
TGNPSLQDREFGQAPLGSSFVQRRPSGFESIRYYLVPETAFGNKALQESDEMLRAKAIKSVATVVSAPVLEPGTVKATSF
SRVIGSGATSTVTPKAQDNVTYRVYAISRDQLGNETASATYELVRFDNVGPTITGSVIRDTSDLPFASQEPERCLSDIAT
ITLGGITDNAGGVGLNPGQGLTFTLGGRQIQAGQFDTNQLADGEYTIGFNSLTDALGNPVVSAPTNAKVYIDNTDPTVNF
NRAVMQGTFASGERVSVESDASDGGCGVYETRLFWDTDNGVVDDATTTPAIGHPVQFARQRVTDGAKADSLNAGWNALQL
PNGAGAVYLRALVVDRAGNATISTTPIVVNAKITNQARPLLGGFDAFKRNASAQFMSNSNAISGVNGTAVTPNTTANSAL
DNILSLDSVGTLTTNAYLPRGATETAITEKIRNVGAYGRFDATQWNRIRDYQLNTDPTLRSAYVNAGNLANQRGNNWRIR
TPWVELGSSDTANTQQKFDFNSDLLNDFYFGRTFGNNDNVNLFSYDQFNGIVSGTAGAYSFYGETVQK
>P13126 ~~~hpi~~~Hexagonally packed intermediate-layer surface protein~~~
MKKNIALMALTGVLTLASCGQNGNTPTADTTAPTVSLSVNNANLPSGVGSVVLSGTVNEASTVVVKNNAGTTVCTVEVAA
SGTFTCPATTIAGNTSTTSTSTSYTATATDAAKNVGTSSVVTVNVAGVSNPAPTTAVLTIDLAGVSSAPITIKDANGNVV
QGYDNVTVNDNATITVARGVYTVTAGNVSGFNGPTTNFRVDLSGGNQTVTLNYTQAGTTTPTPVGSINILTPAVGTSVTG
GSTVRVTFDKANEVQCMVGGAAAVTAQVDSTSGYCDVVVPNSTGNVVITVMGKGVNGQTVTATRNISVTQAAVSYGVVTP
AGDQELTLTSEGIVRDADSGWRRLGQGVSTPSDPNLNLDIYIKGTVNFSVNAPAGQKVELFLARTTGSDVPTNDDIQAGD
VLRSVASTSGTETFSLDSRRLAEFDGVRKWIVVRINGTQVTYQPVIADNKGPQQPDPELNGVQNAYSNILNNYNNSGLTY
VRGPVNVFTSNPSLQDREFGQAPVGSSFVQRRPAGFESIRYYLVPESAFNNKALQESDEMLRAKAVKSVATVVSAPVLEP
GTVKATSFSRVIGSGATSTVAPKALDNVTYRVYAISRDQVGNETASATYDLVRFDNVGPTITGSVIRDTSDLPFPSQEPE
RCLSDIATISLGGIADNVGGVGLNPGQGLTFTLGGRQIQAGQFDTNQLADPEYTIGFNSLTDALGNPVVTAPTNAKVYID
NTDPTVNFNRAVMQGTYASGGRVSVESDASDGGCGVYETRLFWDTANGVVDDATTTPAIGHPVQFARQRVTDGAKADSLN
AGWNALQLPNGAGAVYLRSLVVDRAGNATISTTSIVVNAKITNQARPLLGGFDAFKRNASAQFVGDDNVIAGVNGTAATP
NVTGNSALDNILSLDSVGTLTTNAYLPRGATETAITEKIRNVGAYGRFDATQWNLIRDYQLNTDPTLRSAYVNAGNLANQ
RGNNWRIRTPWVELGSSDTANTQQKFDFNSDLLNDFYYGRTFGNNHSVNLFSYDQFNGVVSDTAGAYSFYGETVRK
>Q6N3F1 4.2.3.156~~~hpnC~~~Hydroxysqualene synthase~~~COG1562
MTSASELRSGKTHRDENFPVASWIIHPRHRDLILAFYNFVRTADDIADHEMLDGDTKLEYLDLLEAELLGRGETQPEAVH
LRRALAERGMPPRHALDLLTAFRMDVTKLRYEDWDEVIHYCRYSAMPVGRFMLDVHGESTTTWQASDALCAGLQINNHLQ
DCGKDYRTLNRVYLPRDVLDAAGAKVEDLGLQKSSPALLKCLQGLAVRTASLLGDGRPLAAEIKDYRLGLEVSVIQAYAD
RIVRMLQTRDPLSERVHLKPIEFVIASFGAMSSEIVRRSFGKGPVSHPAPRA
>Q5NP67 4.2.3.156~~~hpnC~~~Hydroxysqualene synthase~~~COG1562
MEGACASTYRSVSIKTKNKLNAAALVSGKGHQDENFPVASFLINPEYRPIIMAFYQFARQADDVADNVIASKKDRLAILE
DMRSSLTGESQSEPNAVVLRQTLITHGLDHTIVHGLDLLEAFRRDVSVNRYENWDALMDYCRYSASPVGRFVLDVHKESR
NLWPMNDALCTALQVINHLQDCGKDYRMMNRIYIPSDIMEAVGATAGDLGRFHASLPLRQAIETAALKTKSLLKRSSGFS
AAIHDKRLGVEVAVIQRLAESLTECLTKHDPLSERVHHNKAETLGLAFVAAAGRLFS
>Q6N3F2 2.5.1.103~~~hpnD~~~Presqualene diphosphate synthase~~~COG1562
MTVHATPEPAAHQGVALGSSFYAAMRILPRPQREAMFQVYSFCRFVDDIADSDRPREQRVAELQQWRDDIAALYRGAPPP
RLADYQESLRTFGLKREDFEAIIDGMEMDVDADIRAPDEATLDLYCDRVASAVGRLSVRIFGLPEADGIELSHHLGRALQ
LTNILRDIDEDAGIGRLYLPSELLHKVGITATDPRVVAADSALPSVCAPLVERALAHFAAADKVMNRNPRRVVKAPRIMG
KYYYSILQLLIARGFAAPRAPVKLGKASKIAILLQYAIV
>H2VFR7 2.5.1.103~~~hpnD~~~Presqualene diphosphate synthase~~~COG1562
MTSAMKKIQPEAFSEKSSDSQASVSGAKSSSFYIGMRVLPPAEREAMYAIYNFCRQVDDIADDLEGSQEERKQALDAWRH
DINALYAGEPCGQAAFLKEPVARFHLRQEDFIAVIDGMAMDLKGPIVFPDEATLDLYCDRVASAVGRLSVYVFGMDPNIG
ESLAYHLGRALQLTNILRDIDEDAEIGRCYLPREPLEKAGIPLDIEKALADPRLDKVCRDLAWQAEGHYAASDHIIHNRP
KGYLIAPRLMAAAYSALLRKMLAQGWKNPRKKVKHNKLALLWTLLRLKVTS
>Q6N3F3 1.17.8.1~~~hpnE~~~Hydroxysqualene dehydroxylase~~~COG1232
MSKTVHVIGAGISGLAAAIRLARAGLTVHVHEAMQQAGGRCRSYFDAQTGLVIDNGNHLLLSGNHAACEYARTIGTEAGL
VGPERAEFDFIDLPANARWRLKLGGGKLPLWLFDANSRVPDTSIGDYLGLMPLLWAPTTKLIGDTINCSGPLYDRLVAPL
LLAALNVDPPEGSAGLAGAVVRETLLAGGKACRPLIARDGLSAVLVEPAVAQLAARGPGVQFGHELRALTPAGDRVGALQ
FGGEDVVTLGPDDAVVLAVPPRPAASLLPGLKTPQEYRAIVNAHFNYAPPPGMPALTGVIGGVVEWLFAFPNRLSVTISN
GDRLVDAPREQLAAEIWGEICKIAGISANLPPWQIVRERRATFAATPAQNALRPGPVTQWRNLYLAGDWTDTGLPATIEG
SVRSGNRAADLVLAAGRA
>Q5NP65 1.17.8.1~~~hpnE~~~Hydroxysqualene dehydroxylase~~~COG1232
MSVTHIIGAGLAGLSAAVAITHAGGRVKIYEASAMAGGRARSYHDKKLGIEIDNGNHMLLSGNHSAKTYLKRIGAEHRFK
SPKEAAFSFCDLSDKERFTIKLSNGPLPWWVLCAKSRVPHSKAKDYLALLSLLLADHNTKIGDLVPDNTALWRKLLDPFF
VSVLNTPAREGAACLAAAVIRETLMKGGKACIPRIAYPNLASSFIDPALDYLKARGVEVDFRNRLRQIHFSGQDVASLEF
AHQDVKLGKGDKVIIALPAWVVQSLIPDIETPDKYQAIINAHFLMKPTAAMPHIMGVVGGTADWIFTFENRISVTISAAN
HLLALEKEELVKRIWDDIQTVYAFKQDMPEWQVVTEKRATFEATVEQNNRRPPAVTAWNNLFLAGNWVRTGLPATIESAI
RSGQTAADLALSHS
>A0A0H3KP92 ~~~hpnN~~~Hopanoid transporter HpnN~~~COG4258
MVTSLIVRLVAWSVRRPVWVVVLSLLIAAFSGVYVARHFKINTDISKLVDAEPQWAALSQAVDRAFPQRNGTILAVVEAP
APEFATAAAHALTESLQKQAAAGRIGPVAEPGGGPFFEHNGLLFLSPQQVADTTSQLASARPLVNELAKNPSLTGLATTL
STTLGQPLLTGQVKLPSMAKLLSRSAATVDDVLAGKPAAFSWRALVDNDAARQPARAFVTVQPVVNYGALKAGAQTSDVI
RETARALDLEKRYGAVVRLTGEQPLADDEFSSVEDGAALNGVVTLLVVFVILWLALRSKRMIASVLVTLFVGLVVTAALG
LAMVGSLNMISVAFMVLFVGLGVDFSIQYGVKYREERFRDERIDHALIGAAHSMGMPLALATTAVAASFFSFIPTAYRGV
SELGLIAGVGMFVALLTTLTLLPALLRLFAPPGESKTPGFPWLAPVDDYLDRHRKPILIGTLAVVIGALPLLAFLHFDFN
PLHLKDPHSESMSTLLALKDSPEAAVNDVTLLAPSLADADAAAKRLDALPEVGRTTTLSTFIPADQPEKRAAIATAASTL
LPALTQPPAPPATDAQRVAALKRASDLLGYAAEDHPGPGAAAAQHLSQSLAKLAAADSATRDRAERAFADTLRIALNQLA
ALLQPQEITRDTLPPPLVRDWVAPDGKALVQISPKVPKGVDPNDDTMLRHFATAVKAAEPGAIGGPISILHSANTIISAF
LHAALWSIISITILLWITLRRFGDVLRTLVPLLVSGIVTLEMCVVLGMSLNFANIIALPLMLGVGVAFKVYFVMAWRAGQ
TGLLHSSLTHAVLFSAATTATAFGSLWLSHHPGTSSMGKLLALALTCTLIGAVVFQPVLMGKPRVKRAKNQSQGINE
>Q2T2R2 ~~~hpnN~~~Hopanoid transporter HpnN~~~
MLTSVLVRLVAWSVRRPIWVVVLSLVAAALSGVYVAHHFKINTDISKLVENDPKWAALGHAIDDAFPQRSQTILAVVEAP
APEFAAAAANALADGLRREAEAGRIGQVSEPGGGPLFEHDGLLFLPEQDVATTTAQLASARPLINVLAKDPSIAGLATTL
STTLGVPLQSGQVKLSGMAKLLSRSAATVDDVLAGKPAAFSWRALVDADAAREPARAFVTVQPVVNYGALQAGEQASRTI
RATAQALKLDERFGAAVRLTGEQPLADEEFASVQDGALVNGIATLAIVLVILWIALRSKRMIAAVFVTLFVGLVVTAALG
LMMVGSLNMISVAFMVLFVGLGVDFAIQYGVKYREERHRDPNLEHALVGAAHAMGMPLTLATAAVAASFFSFLPTAYRGV
SELGLIAGVGMFVALFTTLTLLPALLRLLAPPGERKPPGFPRLAPVDDYLDRHRKPILIGTLAVVIGALPLLAHLRFDFN
PLHLKDPHSESMATLLALKDSPEASVNDVSLLAPSLAAANAAAQRLGALPEVGRTTTLSTFIPDAQPQKLATIAAAAREL
LPALTQPAAAPVSDAQRVAALKRASNLLEYAAEDYPGPGAAAAKHLSESLAKLAAADAATRERAEHAFSAPLKIALNQLA
MLLQPSEITRKNLPPQIVRDWVAPDGRALVQISPKVPKGADPGDDAMLRRFAKAVKAAEPGTIGGPISILHSADTIIRAF
LQAAALSVVSITVLLWITLRRFGDVLRTLVPLLVSGVVTLELCVLLGMPLNFANIIALPLMLGVGVAFKVYFVMAWRAGQ
TGLLQSSLTHAVLFSAATTATAFGSLWLSHHPGTASMGRLLALALSCTLIGAVVFQPVLMGKPRTKRVTNQSQGIDE
>B3QHB6 ~~~hpnN~~~Hopanoid transporter HpnN~~~
MLKSAIVSIVRASTRFAAFTVLIGVFLAVAAGFYTYQHFGINTDINHLISSDLDWRKRDIAFEKAFDQERLILAVVEAPT
PEFANAAAAKLTAELSKNNINFDSVKRLGGGPFFDRSGLLFLPKDEVAKATGQFQQAVPLIEIMAGDPSIRGLTAALETG
LVGLKRGELTLDATAKPFNTVAATVEDVLGKQQAFFSWRGLVNPEPLTDGDKRAFIEVKPILDFKALEPGKAATDAIRQA
AVDLKIEQDFGARVRLTGPVPIANEEFATVKDGAVVNGIGTVVVVLLILWMALHSSKIIFAVAANLVIGLSITTAVGLML
VDSLNLLSIAFAVLFVGLGVDFGIQFSVRYRSERHKTGDLEKALVQAAEYSAVPLSLAAMSTTAGFLSFLPTSYKGISEL
GEIAGAGMAIAFFTSITVLPALLKLLNPAGEKEPLGYAFLAPVDHFLEKHRIAIIVGTIGVALAGLPLLYFMHFDFNPIN
LRSPKVESIATFLDLRKDPNTGANAVNVMAPNEQAAREIEAKLAKLPQVSRTISLDTFVPPDQPEKLKLIQAGAKVLEPA
LNPEQIDPPPSDQDNIASLKSSAEALRRAAGEATGPGADASRRLATALTKLAGADQAMREKAQDVFVRPLLLDFELLRNM
LKAQPVTLDNLPADIVSSWKTKDGQIRVEVLPSGDPNDNDTLRKFAAAVLQAEPLATGGPVSILKSGDTIVASFIQAGLW
ALLSISILLWITLRRISDVALTLVPLLVAGAVTLEICVLIDLPLNFANIVALPLLLGVGVAFKIYYVTAWRSGRTNLLQS
ALTRAIFFSALTTATAFGSLWLSSHPGTASMGKLLALSLLTTLGAVLLFQPALMGKPRHIDESGDTDL
>B3QHD1 2.1.1.-~~~hpnP~~~Hopanoid C-2 methylase~~~
MKAESGQTSRRILCVFPRYTKSFGTFQHSYPLMDDVAAFMPPQGLLVIAAYLPDEWSVRFVDENIRAATADDFAWADAVF
VSGMHIQRQQMNDICRRAHDFDLPVALGGPSVSACPDYYPNFDYLHVGELGDATDQLIAKLTHDVTRPKRQVVFTTEDRL
DMTLFPIPAYELAECSKYLLGSIQYSSGCPYQCEFCDIPGLYGRNPRLKTPEQIITELDRMIECGIRGSVYFVDDNFIGN
RKAALDLLPHLVEWQKRTGFQLQLACEATLNIAKRPEILELMREAYFCTIFVGIETPDPTALKAMHKDHNMMVPILEGVR
TISSYGIEVVSGIILGLDTDTPETGEFLMQFIEQSQIPLLTINLLQALPKTPLWDRLQREGRLVHDDNRESNVDFLLPHD
QVVAMWKDCMARAYQPEALLKRYDYQIAHAYATRLHPSTPQRASKANIKRGMIMLRNIIWQIGIRGDYKLAFWKFALRRL
IRGDIENLLLVMVVAHHLIIYAREASRGHANASNYSIRLREAAVPAE
>Q60AV6 2.1.1.-~~~hpnR~~~Hopanoid C-3 methylase~~~COG1032
MKVFCVHPSPLMYTKVFLRLEPLGLELVAESLRRAGHDIRLMDLQVESHADFLRELDTWRPDVVCFSLNYLANVPEVIDL
AKTAKSRLPECFTFVGGHSASFVAKDLLDHGEGLLDCVLRGEGEAGAPKLLETLARRGNIDEVPGVVSLTGEGPPPGFTD
NLDEHLPARDLLKYRRKYFLGTLDPCASIEFSRGCPWDCSFCSAWTFYGRSYRVMSTERIMEDLRRIKEPGIFIVDDVAF
IQAQHGMEIGEAIAREGIRKQYYLETRGDVLLRNKEVFKLWKKLGMEYMFLGVEAIDAEGLQKFRKRVSLGKNFEALEFA
RSLGITVAINLIADPDWDRERFEVIRQWCMEIPEIVNISVNTPYPGTESWHTESRQLTTRDYRLFDIQHAVLPTRLPLPE
FYGELVKTQQVLYKKHMGWAAARDTLKILGGHLLRGQTNFLRSLWKFNSVFNPELQLADHRQPVKYPMTLPPAPTEQKIE
AKTLYVHRSQGRKSRALDDATEKFVDEGRMGAATG
>P0A0V6 ~~~hpn~~~Histidine-rich metal-binding polypeptide~~~
MAHHEEQHGGHHHHHHHTHHHHYHGGEHHHHHHSSHHEEGCCSTSDSHHQEEGCCHGHHE
>Q2RIS7 7.2.3.1~~~hppA1~~~Putative K(+)-stimulated pyrophosphate-energized sodium pump 1~~~COG3808
MELLAPLTGIVALLFAFYLTNKINRSDPGNPRMQEIAVAIHEGAMAFLMREYRTLIFFVLGMTALIVVAGFMTRGAESMQ
PATAIAYVAGTLCSIGAGYIGMQVATRANVRTANAARHSSNAALDIAFSGGSVMGMAVVGLGLLGLGIINYVFKNPSIVN
GFALGASSIALFARVGGGIYTKAADVGADLVGKVEAGIPEDDPRNPAVIADNVGDNVGDVAGMGADLFESYVGSIISGIA
LAAALNIPNGTLVPLMIAAIGIVSSILGAFFVKTGEGANAQKALNTGTMVASILAIVGTFLATRLLPAHFTAGSMSYTST
GVFAATIAGLIAGVLIGRITEYYTSGDYEPVKEIAKASQTGTATNIIEGLSTGMLSTVLPILVIVIAIIASYRFAGLYGI
AMAAVGMLSTTGTTVAVDAYGPIADNAGGIAEMAELDPKVRKITDALDSVGNTTAAIGKGFAIGSAALTALALFSAYTAA
ARITAIDLTDPKVVGGLFIGGMLPFLFAALTMKAVGRAAFQMIEEVRRQFKSIPGLMEGKARPDYARCVAISTGAAIKEM
IVPGLLAVLVPLAVGLIPGLGKEALGGLLAGATVTGFLMAVMMANAGGAWDNAKKYIEGGQYGGKGSPAHAAAVNGDTVG
DPFKDTSGPAMNILIKLMTIVSLVFAPLFMQL
>Q8UG67 7.1.3.1~~~hppA~~~K(+)-insensitive pyrophosphate-energized proton pump~~~COG3808
MRMTVIPIVILCGVLSVVYAVWTTKSVLDADQGNERMREIAGYIREGAQAYLTRQYLTIAIVGLIVAVLAWYLLSAIAAI
GFVIGAVLSGVAGFVGMHVSVRANLRTAQAASHSLGAGLDIAFKSGAITGMLVAGLALLGVSIYYFVLTSVLGHPPGSRA
VIDALVSLGFGASLISIFARLGGGIFTKGADVGGDLVGKVEAGIPEDDPRNPATIADNVGDNVGDCAGMAADLFETYAVS
VVATMVLAAIFFAGTPILESAMVYPLAICGACILTSIAGTFFVKLGTNNSIMGALYKGLIATGVFSVAGLAVATYATVGW
GTIGTVAGMEITGTNLFFCGLVGLVVTALIVVITEYYTGTNKRPVNSIAQASVTGHGTNVIQGLAVSLESTALPAIVIVG
GIIGTYQLGGLFGTGIAVTAMLGLAGMIVALDAFGPVTDNAGGIAEMAGLDPDVRKATDALDAVGNTTKAVTKGYAIGSA
GLGALVLFAAYANDLSYFAANGDTYPYFKDIGEISFSLANPYVVAGLLFGGLIPYLFGGIAMTAVGKAASAIVEEVRRQF
REKPGIMAGTEKPDYGRAVDLLTKAAIREMVIPSLLPVLAPLVVYFGVLLISGSKASAFAALGASLLGVIINGLFVAISM
TSGGGAWDNAKKSFEDGFIDKDGVRHVKGSEAHKASVTGDTVGDPYKDTAGPAVNPAIKITNIVALLLLAVLAH
>Q3AFC6 7.1.3.1~~~hppA~~~K(+)-stimulated pyrophosphate-energized proton pump~~~COG3808
MENGMTLAYYGLGAGILAILFALYLFSSVLKEDMGNEKMREISQAIFEGAMAYLNRQYKTLIPFALVVFVLLVVGFGYKE
GDFGYGLKVGVSFLVGAIASALAGYAGMTSTTKANARTTQAARKSLNAALNVAFRAGGVMGMSVAGLGLLGVSALYIIFK
DVHVIDSFAFGASAIAFFARVGGGIYTKAADVGADLVGKVEAGIPEDDPRNPAVIADNVGDNVGDTAGMGADLFESYGAT
TMAAMLLGLTFAKNHGFSEVLGATFPLLLGAAGIVAAIISTFFVRTSEDGNPQMALNIGLWSTNFITAIFTYIIAQYVFG
SEWAPKIFIAVVSGLVVNVAIGSLTEYYTSNLKPPAQKIAEASTTGPATNIISGIAVGMRSTYLPIIVIVAAIMVGYWAA
GFYGIALAAMGMLATAAMVVAVDSFGPVADNAGGIAEMAELGPEIRNKTDKLDAVGNTTAAVAKGFAIGSAALTALALFS
AYTDLAKTNPNLQKYLVNGKFDLNITDPWVLVGLFLGGTVAFLVAALTMESVGKAAFDMIEEVRRQFREIPGLMEGKARP
DYARCVSISTAAAIRQMIAPGLLAVGAPLAIGFILGFKALTGYLAGVTATGVLLAIYMANAGGAWDNAKKYIEAGNLGGK
GSDTHKAAVVGDTVGDPFKDTSGPAMNPLMKVAGTFALIIVPLLLF
>O68460 7.1.3.1~~~hppA~~~K(+)-insensitive pyrophosphate-energized proton pump~~~COG3808
MAGIYLFVVAAALAALGYGALTIKTIMAADAGTARMQEISGAVQEGASAFLNRQYKTIAVVGAVVFVILTALLGISVGFG
FLIGAVCSGIAGYVGMYISVRANVRVAAGAQQGLARGLELAFQSGAVTGMLVAGLALLSVAFYYILLVGIGATGRALIDP
LVALGFGASLISIFARLGGGIFTKGADVGADLVGKVEAGIPEDDPRNPAVIADNVGDNVGDCAGMAADLFETYAVTVVAT
MVLASIFFAGVPAMTSMMAYPLAIGGVCILASILGTKFVKLGPKNNIMGALYRGFLVSAGASFVGIILATAIVPGFGDIQ
GANGVLYSGFDLFLCAVIGLLVTGLLIWVTEYYTGTNFRPVRSVAKASTTGHGTNVIQGLAISMEATALPALIICAAIIT
TYQLSGLFGIAITVTSMLALAGMVVALDAYGPVTDNAGGIAEMANLPEDVRKTTDALDAVGNTTKAVTKGYAIGSAGLGA
LVLFAAYTEDLAFFKANVDAYPAFAGVDVNFSLSSPYVVVGLFIGGLLPYLFGSMGMTAVGRAAGSVVEEVRRQFREIPG
IMEGTAKPEYGRCVDMLTKAAIKEMIIPSLLPVLAPIVLYFVILGIADKSAAFSALGAMLLGVIVTGLFVAISMTAGGGA
WDNAKKYIEDGHYGGKGSEAHKAAVTGDTVGDPYKDTAGPAVNPMIKITNIVALLLLAVLAH
>Q9X913 7.1.3.1~~~hppA~~~K(+)-insensitive pyrophosphate-energized proton pump~~~COG3808
MAELPTSHLAAAVLTDGNRALVAVIAVVALAALVLAGVLVRQVLAAGEGTDSMKKIAAAVQEGANAYLARQLRTLGVFAV
VVFFLLMLLPADDWNQRAGRSIFFLIGAAFSAATGYIGMWLAVRSNVRVAAAAREATPAPGEPEKDLALVSHKATKIAFR
TGGVVGMFTVGLGLLGACCVVLVYAADAPKVLEGFGLGAALIAMFMRVGGGIFTKAADVGADLVGKVEQGIPEDDPRNAA
TIADNVGDNVGDCAGMAADLFESYAVTLVAALILGKVAFGDFGLAFPLLVPAIGVLTAMIGIFAVAPRRSDRSGMSAINR
GFFISAVISLVLVAVAVFVYLPGKYADLDGVTDAAIAGKSGDPRILALVAVAIGIVLAALIQQLTGYFTETTRRPVKDIG
KSSLTGPATVVLAGISLGLESAVYTALLIGLGVYGAFLLGGTSIMLALFAVALAGTGLLTTVGVIVAMDTFGPVSDNAQG
IAEMSGDVEGAGAQVLTDLDAVGNTTKAITKGIAIATAVLAAAALFGSYRDAITTGAADVGEKLSGEGAPMTLMMDISQP
NNLVGLIAGAAVVFLFSGLAINAVSRSAGAVVYEVRRQFRERPGIMDYSEKPEYGKVVDICTRDALRELATPGLLAVMAP
IFIGFTLGVGALGAFLAGAIGAGTLMAVFLANSGGSWDNAKKLVEDGHHGGKGSEAHAATVIGDTVGDPFKDTAGPAINP
LLKVMNLVALLIAPAVIKFSYGADKSVVVRVLIAVVAFAVIAAAVYVSKRRGIAMGDEDDADPEPKSADPAVVS
>Q9S5X0 7.2.3.1~~~hppA~~~K(+)-stimulated pyrophosphate-energized sodium pump~~~COG3808
MYVAALFFLIPLVALGFAAANFAAVVRKPEGTERMKEISSYIRSGADSFLAHETKAIFKVAIVIAILLMIFTTWQTGVAF
LLGAVMSASAGIVGMKMATRANVRVAEAARTTKKIGPALKVAYQGGSVMGLSVGGFALLGLVLVYLIFGKWMGQVDNLNI
YTNWLGINFVPFAMTVSGYALGCSIIAMFDRVGGGVYTKAADMAADLVGKTELNLPEDDPRNPATIADNVGDNVGDVAGL
GADLLESFVGAIVSSIILASYMFPIYVQKIGENLVHQVPKETIQALISYPIFFALVGLGCSMLGILYVIVKKPSDNPQRE
LNISLWTSALLTVVLTAFLTYFYLKDLQGLDVVGFRFGAISPWFSAIIGIFSGILIGFWAEYYTSYRYKPTQFLSKSSIE
GTGMVISNGLSLGMKSVFPPTLTLVLGILFADYFAGLYGVAIAALGMLSFVATSVSVDSYGPIADNAGGISEMCELDPEV
RKITDHLDAVGNTTAAIGKGFAIGSAIFAALSLFASYMFSQISPSDIGKPPSLVLLLNMLDARVIAGALLGAAITYYFSG
YLISAVTKAAMKMVDEIRRQAREIPGLLEGKAKPDYNRCIEITSDNALKQMGYPAFIAILTPLVTGFLLGAEFVGGVLIG
TVLSGAMLAILTANSGGAWDNAKKYLEAGNLEGYGKGSEPHKALVIGDTVGDPLKDTVGPSLDILIKIMSVVSVIAVSIF
KHVHLF
>P80064 1.13.11.27~~~hpd~~~4-hydroxyphenylpyruvate dioxygenase~~~
ADLYENPMGLMGFEFIELASPTPNTLEPIFEIMGFTKVATHRSKDVHLYRQGAINLILNNEPHSVASYFAAEHGPSVCGM
AFRVKDSQKAYKRALELGAQPIHIETGPMELNLPAIKGIGGAPLYLIDRFGEGSSIYDIDFVFLEGVDRHPVGAGLKIID
HLTHNVYRGRMAYWANFYEKLFNFREIRYFDIKGEYTGLTSKAMTAPDGMIRIPLNEESSKGAGQIEEFLMQFNGEGIQH
VAFLSDDLIKTWDHLKSIGMRFMTAPPDTYYEMLEGRLPNHGEPVGELQARGILLDGSSESGDKRLLLQIFSETLMGPVF
FEFIQRKGDDGFGEGNFKALFESIERDQVRRGVLSTD
>Q53586 1.13.11.27~~~hpd~~~4-hydroxyphenylpyruvate dioxygenase~~~COG3185
MTQTTHHTPDTARQADPFPVKGMDAVVFAVGNAKQAAHYYSTAFGMQLVAYSGPENGSRETASYVLTNGSARFVLTSVIK
PATPWGHFLADHVAEHGDGVVDLAIEVPDARAAHAYAIEHGARSVAEPYELKDEHGTVVLAAIATYGKTRHTLVDRTGYD
GPYLPGYVAAAPIVEPPAHRTFQAIDHCVGNVELGRMNEWVGFYNKVMGFTNMKEFVGDDIATEYSALMSKVVADGTLKV
KFPINEPALAKKKSQIDEYLEFYGGAGVQHIALNTGDIVETVRTMRAAGVQFLDTPDSYYDTLGEWVGDTRVPVDTLREL
KILADRDEDGYLLQIFTKPVQDRPTVFFEIIERHGSMGFGKGNFKALFEAIEREQEKRGNL
>Q9JN69 1.11.1.23~~~hppE~~~(S)-2-hydroxypropylphosphonic acid epoxidase~~~
MDVRTLAVGKAHLEALLATRKMTLEHLQDVRHDATQVYFDGLEHLQNVAQYLAIPLSEFFVGQTQSDLDDGVKIARRNGG
FKREEIRGGVHYYTYEHLVTTNQDPGLMALRLDLHSDDEQPLRLNGGHGSREIVYVTRGAVRVRWVGDNDELKEDVLNEG
DSIFILPNVPHSFTNHVGGAKSEIIAINYG
>Q56185 1.11.1.23~~~hppE~~~(S)-2-hydroxypropylphosphonic acid epoxidase~~~
MSNTKTASTGFAELLKDRREQVKMDHAALASLLGETPETVAAWENGEGGELTLTQLGRIAHVLGTSIGALTPPAGNDLDD
GVIIQMPDERPILKGVRDNVDYYVYNCLVRTKRAPSLVPLVVDVLTDNPDDAKFNSGHAGNEFLFVLEGEIHMKWGDKEN
PKEALLPTGASMFVEEHVPHAFTAAKGTGSAKLIAVNF
>P26281 2.7.6.3~~~folK~~~2-amino-4-hydroxy-6-hydroxymethyldihydropteridine pyrophosphokinase~~~COG0801
MTVAYIAIGSNLASPLEQVNAALKALGDIPESHILTVSSFYRTPPLGPQDQPDYLNAAVALETSLAPEELLNHTQRIELQ
QGRVRKAERWGPRTLDLDIMLFGNEVINTERLTVPHYDMKNRGFMLWPLFEIAPELVFPDGEMLRQILHTRAFDKLNKW
>P43777 2.7.6.3~~~folK~~~2-amino-4-hydroxy-6-hydroxymethyldihydropteridine pyrophosphokinase~~~COG0801
MITAYIALGSNLNTPVEQLHAALKAISQLSNTHLVTTSSFYKSKPLGPQDQPDYVNAVAKIETELSPLKLLDELQRIENE
QGRVRLRRWGERTLDLDILLYGNEIIQNERLTIPHYDMHNREFVIVPLFEIASDLVLPNSQIITELVKQFADHKMIKLNP
>P9WNC7 2.7.6.3~~~folK~~~2-amino-4-hydroxy-6-hydroxymethyldihydropteridine pyrophosphokinase~~~COG0801
MTRVVLSVGSNLGDRLARLRSVADGLGDALIAASPIYEADPWGGVEQGQFLNAVLIADDPTCEPREWLRRAQEFERAAGR
VRGQRWGPRNLDVDLIACYQTSATEALVEVTARENHLTLPHPLAHLRAFVLIPWIAVDPTAQLTVAGCPRPVTRLLAELE
PADRDSVRLFRPSFDLNSRHPVSRAPES
>O34483 2.7.11.-~~~hprK~~~HPr kinase/phosphorylase~~~COG1493
MAKVRTKDVMEQFNLELISGEEGINRPITMSDLSRPGIEIAGYFTYYPRERVQLLGKTELSFFEQLPEEEKKQRMDSLCT
DVTPAIILSRDMPIPQELIDASEKNGVPVLRSPLKTTRLSSRLTNFLESRLAPTTAIHGVLVDIYGVGVLITGKSGVGKS
ETALELVKRGHRLVADDCVEIRQEDQDTLVGNAPELIEHLLEIRGLGIINVMTLFGAGAVRSNKRITIVMNLELWEQGKQ
YDRLGLEEETMKIIDTEITKLTIPVRPGRNLAVIIEVAAMNFRLKRMGLNAAEQFTNKLADVIEDGEQEE
>O07664 2.7.11.-~~~hprK~~~HPr kinase/phosphorylase~~~COG1493
MTEVVKIYQLVENLSLEVVYGDEESLNRTIKTGEISRPGLELTGYFNYYSHDRLQLFGSKEITFAERMMPEERLLVMRRL
CAKDTPAFIVSRGLEIPEELITAAKENGVSVLRSPISTSRLLGELSSYLDGRLAVRTSVHGVLVDVYGLGVLIQGDSGIG
KSETALELIKRGHRLIADDRVDVYQQDELTVVGEPPKILQHLIEIRGIGIIDVMNLFGASAVRGFMQVQLVVYLEAWEKD
KKYDRLGSDDAMVEIANVDVPQIRIPVKTGRNVAIIIEVAAMNFRAKTMGYDATKTFEERLTRLIEENSGE
>Q9RE09 2.7.11.-~~~hprK~~~HPr kinase/phosphorylase~~~COG1493
MADSVTVRQLVKATKLEVYSGEEYLDQRQVVLSDISRPGLELTGYFNYYPHERIQLFGRTEISFARNMSSEERLLILKRM
ATEDTPAFLVSRGLEAPAEMITAATAAHIPVLGSRLPTTRLSSLITEYLDSQLAERRSMHGVLVDIYGLGVLITGDSGVG
KSETALELVQRGHRLIADDRVDVYQQDEQTIVGAAPPILSHLLEIRGLGIIDVMNLFGAGAVREDTTISLIVHLENWTPD
KTFDRLGSGEQTQLIFDVPVPKITVPVKVGRNLAIIIEVAAMNFRAKSMGYDATKTFEKNLNHLIEHNEETDQNSSGDK
>P75548 2.7.11.-~~~hprK~~~HPr kinase/phosphorylase~~~
MKKLLVKELIEQFQDCVNLIDGHTNTSNVIRVPGLKRVVFEMLGLFSSQIGSVAILGKREFGFLSQKTLVEQQQILHNLL
KLNPPAIILTKSFTDPTVLLQVNQTYQVPILKTDFFSTELSFTVETYINEQFATVAQIHGVLLEVFGVGVLLTGRSGIGK
SECALDLINKNHLFVGDDAIEIYRLGNRLFGRAQEVAKKFMEIRGLGIINVERFYGLQITKQRTEIQLMVNLLSLEKQTT
VTFERLGTELKKQRLLGVDLSFYEIPISPGRKTSEIIESAVIDFKLKHSGYNSALDFIENQKAILKRKKDES
>P60701 2.7.11.-~~~hprK~~~HPr kinase/phosphorylase~~~
MLTTEKLVETLKLDLIAGEEGLSKPIKNADISRPGLEMAGYFSHYASDRIQLLGTTELSFYNLLPDKDRAGRMRKLCRPE
TPAIIVTRGLQPPEELVEAAKELNTPLIVAKDATTSLMSRLTTFLEHALAKTTSLHGVLVDVYGVGVLITGDSGIGKSET
ALELVKRGHRLVADDNVEIRQINKDELIGKPPKLIEHLLEIRGLGIINVMTLFGAGSILTEKRIRLNINLENWNKQKLYD
RVGLNEETLSILDTEITKKTIPVRPGRNVAVIIEVAAMNYRLNIMGINTAEEFSERLNEEIIKNSHKSEE
>Q9S1H5 2.7.11.-~~~hprK~~~HPr kinase/phosphorylase~~~COG1493
MLTTKSLVERFELEMIAGEAGLNKQIKNTDISRPGLEMAGYFSHYASDRIQLLGTTELSFYNLLPDEERKGRMRKLCRPE
TPAIIVTRDLEPPEELIEAAKEHETPLITSKIATTQLMSRLTTFLEHELARTTSLHGVLVDVYGVGVLITGDSGIGKSET
ALELIKRGHRLVADDNVEIREISKDELIGRAPKLIEHLLEIRGLGIINVMTLFGAGSILTEKRLRLNIHLENWHKEKLYD
RVGLNEETLRILDTEITKKTIPVRPGRNVAVIIEVAAMNYRLNIMGINTAEEFNDRLNAEILRNGNNGNNGEEK
>Q9WXK7 2.7.11.-~~~hprK~~~HPr kinase/phosphorylase~~~
MSVTVKMLVDKVKLDVIYGDDDLLSKEITTSDISRPGLEMTGYFDYYSPERLQLLGMKEWSYLTKMTSHNRRHVLREMIK
PETPAIIVARNLAIPEEMISAAKEKGIAILQSHVPTSRLSGEMSWYLDSCLAERTSVHGVLMDIYGMGVLIQGDSGIGKS
ETGLELVKRGHRLVADDRVDVFAKDEETLWGEPAEILRHLLEIRGVGIIDVMSLYGASAVKDSSQVQLAIYLENYESGKV
FDRLGNGNEELELSGVKIPRLRIPVQTGRNMSVVIEAAAMNYRAKQMGFDATKTFEERLTQLITKNEGNQ
>Q5XD71 2.7.11.-~~~hprK~~~HPr kinase/phosphorylase~~~
MTVTVKMLVQKVKLDVVYATDNLLSKEITTSDISRPGLEMTGYFDYYAPERLQLFGMKEWSYLTQMTSHNRYSVLKEMFK
KDTPAVVVSRNLAIPKEMVQAAKEEGISLLSSRVSTSRLAGEMSYFLDASLAERTSVHGVLMDIYGMGVLIQGDSGIGKS
ETGLELVKRGHRLVADDRVDVYAKDEETLWGEPAEILRHLLEIRGVGIIDVMSLYGASAVKDSSQVQLAIYLENFEAGKV
FDRLGNGNEEITFSGVRIPRIRIPVKTGRNVSVVIEAAAMNHRAKEMGFDATKTFEDRLTQLITKNEVSQ
>Q9ZA98 2.7.11.-~~~hprK~~~HPr kinase/phosphorylase~~~
MTVTVKMLVDKLKLKVIYGNEKLLSKPITTADISRPGLEMTGYFDFYSPERIQLVGMKEWSYLKTLTDHNRYSVFSNMFK
EETPAVIVARGLDIPEEMYRAAKENGVAVLQGRNGTSSLSGDMSWYLNAQLAERTSVHGVLVDIYGMGVLIQGDSGIGKS
ETGLELVKRGHRLVADDRVDVYAKDEETLWGEPAEILHHLLEIRGVGIIDVMSLYGASAVRNSSQVQLAIYLENFEDGKV
FDRLGNGNEEIELQGVKIPRVRIPVKTGRNVSVVIEAAAMNYRAKQMGFDATKTFEERLTNLISKNGED
>P76340 ~~~hprR~~~Transcriptional regulatory protein HprR~~~COG0745
MKILLIEDNQRTQEWVTQGLSEAGYVIDAVSDGRDGLYLALKDDYALIILDIMLPGMDGWQILQTLRTAKQTPVICLTAR
DSVDDRVRGLDSGANDYLVKPFSFSELLARVRAQLRQHHALNSTLEISGLRMDSVSHSVSRDNISITLTRKEFQLLWLLA
SRAGEIIPRTVIASEIWGINFDSDTNTVDVAIRRLRAKVDDPFPEKLIATIRGMGYSFVAVKK
>P47666 1.11.1.-~~~~~~Hydroperoxide reductase~~~COG1765
MDKKYDITAVLNDDSSINAVSDNFQITLDARPKEKSKGINPLSAFLAGLAACELATANAMAAAKMITLNKALINIKGYRL
TNPSDGYFGLRELNIHWEIHSPNEEEEIKEFIDFVSKRCPAHNTLHGTSNFKINISVTLVH
>P75170 1.11.1.-~~~~~~Hydroperoxide reductase~~~
MDKKYDITAVLNEDSSMTAISDQFQITLDARPKHTAKGFGPLAALLSGLAACELATANLMAPAKMITINKLLMNVTGSRS
TNPTDGYFGLREINLHWEIHSPNSETEIKEFIDFVSKRCPAHNTLQGVSQLKINVNVTLVH
>P76339 2.7.13.3~~~hprS~~~Sensor histidine kinase HprS~~~COG2205
MKRLSITVRLTLLFILLLSVAGAGIVWTLYNGLASELKWRDDTTLINRTAQIKQLLIDGVNPDTLPVYFNRMMDVSQDIL
IIHGDSINKIVNRTNVSDGMLNNIPASETISAAGIYRSIINDTEIDALRINIDEVSPSLTVTVAKLASARHNMLEQYKIN
SIIICIVAIVLCSVLSPLLIRTGLREIKKLSGVTEALNYNDSREPVEVSALPRELKPLGQALNKMHHALVKDFERLSQFA
DDLAHELRTPINALLGQNQVTLSQTRSIAEYQKTIAGNIEELENISRLTENILFLARADKNNVLVKLDSLSLNKEVENLL
DYLEYLSDEKEICFKVECNQQIFADKILLQRMLSNLIVNAIRYSPEKSRIHITSFLDTNSYLNIDIASPGTKINEPEKLF
RRFWRGDNSRHSVGQGLGLSLVKAIAELHGGSATYHYLNKHNVFRITLPQRN
>P0A9M2 2.4.2.8~~~hpt~~~Hypoxanthine phosphoribosyltransferase~~~COG0634
MKHTVEVMIPEAEIKARIAELGRQITERYKDSGSDMVLVGLLRGSFMFMADLCREVQVSHEVDFMTASSYGSGMSTTRDV
KILKDLDEDIRGKDVLIVEDIIDSGNTLSKVREILSLREPKSLAICTLLDKPSRREVNVPVEFIGFSIPDEFVVGYGIDY
AQRYRHLPYIGKVILLDE
>O33799 2.4.2.8~~~hpt~~~Hypoxanthine phosphoribosyltransferase~~~
MKHTVEVMIPEAEIKARIAELGRQITERYKDSGSEMVLVGLLRGSFMFMADLCREVQVPHEVDFMTASSYGSGMSTTRDV
KILKDLDEDIRGKDVLIVEDIIDSGNTLSKVREILGLREPKSLAICTLLDKPSRREVDVPVEFVGFSIPDEFVVGYGIDY
AQRYRHLPYVGKVVLLDE
>P11065 ~~~hpr~~~DNA-binding transcriptional repressor ScoC~~~COG1846
MNRVEPPYDVKEALVFTQKMAQLSKALWKSIEKDWQQWLKPYDLNINEHHILWIAYQLNGASISEIAKFGVMHVSTAFNF
SKKLEERGYLRFSKRLNDKRNTYVQLTEEGTEVFWSLLEEFDPTRNAVFKGSQPLYHLFGKFPEVAEMMCMIRHIYGDDF
MEIFETSLTNIDNDFESVNGKLKKKAKDSAADEPAEELEPVNS
>C0CMQ8 1.1.1.81~~~hpr~~~Hydroxypyruvate reductase~~~COG1052
MKIVVLDGYCLNPGDLDWKGLEALGECIVYDRTSLTDMEEVISRIGDADIVYTNKTPMPREVFEKCPNIRFVGVLATGYN
VVDVNTAKEKGIPVANIPTYGTASVGQFAIALLLEICHHVGHHNQVVHEGKWESNPDWCFWDYPLIELDGKNMGIIGYGR
IGQATGKIAQALGMKVLAYDAYKNPALENENCRYVELDELLSQSDVIALHCPLFPETEGIVNKENIAKMKDGVIILNNSR
GPLIVEQDLVDALNSGKVAAAGLDVVSTEPIKGDNPLLGAKNCIITPHISWAPKESRKRLMDIAVNNLEEFLKGSPVNVV
NK
>Q9X1C1 1.1.1.81~~~~~~Hydroxypyruvate reductase~~~COG1052
MARYRVHVNDPLDKEATQLLMNKEELEVTSEHLEKDELMKIIPEVDVLVVRSATKVTADIIEAGKNLKIIARAGIGLDNI
DVQKAKEKGIKVLNTPGASAPSVAELAMGLMLACARHIARATVSLKEGKWEKKALKGKELLGKTLGLIGFGNIGQEVAKR
ALAFGMKIIAYDPAKPETDLPVEYVDLDTLFKESDFISLHVPLTESTRHIINRESIAKMKDGVIIVNTARGGTIDEEALY
EEVVSGKVYAAGLDVFEVEPPTDEIRRKLLSLDNVVATPHIGASTAEAQRRVGIELVEKIFKELGI
>E5Y7I4 4.4.1.41~~~hpsG~~~(2S)-3-sulfopropanediol sulfolyase~~~COG1882
MSQCCCLSPQEERLQGTKKVNRQGRERVYKILDRIQFTVPHVDIERARYFTESMRQTEGELLTLRWAKALKNVAEKMTVY
ITPDQLLAGRVGQLGRYGILYPEIDGDFYIEVMKDLPNREKSPFQIDPTDMQILMEEIAPYWEGKTYHEHLNKVLPAEIR
GVTYHDERGLKSKFVVSETSSYRSALQWVPDYEKAMKRGFIDIQNEAKAKLAGLDLTNSVDIWEKKPFLEAMIIVCDAIM
IWAKRHAQLARDTAAATSDPVRKQELLRMADICEHVPAYPARNFREAVQCQWFVQMFSRIEQKASAIISNGRMDQYLYPY
YKKDIEEGTLTSEEAKELLECMWVDMAQFIDLYINPTGNEFQEGYAHWEAVTVGGQTPEGEDATNELSYLFLESKREFPM
TYPDLAVRIHSRTPDRFLYEIALTVQDGSGFPKLINDEEVVPLNAIKGCPINEALDYAISGCTETRMPNRDTYTSGCVYI
NFATALEMLMNNGRLHYYGDELIGLETGDPTRFQTWEEFYEAYKAQHINLLQKAFQQQHIVDRLRPQHFAAPLSSVLHNL
CMKNMQDLHSEKIEGGVDYSYFEFLGYATVVDSLAAIKKLVFEEKRLTMREVLDAMNANFVGYEPIQEMLKNAPCYGNND
PYADSIAKDVDRFTQVEAEKSSRDRGIHVDVRYVPITSHVPFGKIIAATPNGRVAGFPLADGSSASHGADHNGPTAVLLS
NYHSKNYGMINRASRLLNIKLSPKCVAGEQGAKKIMSIIRTWCDLKLWHLQFNIVNRDTLLAAQKDPNSYRNLIVRVAGY
SAYFCDMSPDLQNDIIDRTEHADL
>Q46N53 1.1.1.308~~~hpsN~~~Sulfopropanediol 3-dehydrogenase~~~COG0141
MISYLKKAEKTPQTETATAQKVVTEMLAEIQARGKDAVRQYAKQLDGWSGDIVLTPDQIREQTKDVPAGVRADIDFAIRQ
VTDFALAQRESLKEFSVELHPGVTAGQRVLPVNVVGCYAPAGRYAHIASAYMGVATAKAAGVKTVVACSSPFRGQGIHPH
VLYAFQAAGADVIMALGGVQAIASMAYGLFTGKPADVVVGPGNKFVAEAKRSLYGQVGIDVFAGPSEVAVIADETADPAI
VASDLVGQAEHGHESPAWLFTTSRDLADRVMALVPELIAKLPPTARDAATAAWRDYGEVILCGTREEVVEISDRYASEHL
EVHTADLDWWLANLTCYGSLFLGEETTVAFGDKTSGPNHVLPTKGAARYSGGLSVHKFMKTLTWQQMTREATRQIGQVTA
RISRLEGMEAHARTADDRMAKYFPNASFEMGTPVEV
>P42405 4.1.2.43~~~hxlA~~~3-hexulose-6-phosphate synthase~~~COG0269
MELQLALDLVNIPEAIELVKEVEQYIDVVEIGTPVVINEGLRAVKEIKEAFPQLKVLADLKIMDAGGYEIMKASEAGADI
ITVLGATDDATIKGAVEEAKKQKKKILVDMINVKDIESRAKEIDALGVDYICVHTGYDLQAEGKNSFEELTTIKNTVKNA
KTAIAGGIKLDTLPEVIQQKPDLVIVGGGITSAADKAETASKMKQLIVQG
>Q48907 4.1.2.43~~~rmpA~~~3-hexulose-6-phosphate synthase~~~
MALTQMALDSLDFDATVALAEKVAPHVDILEIGTPCIKHNGIKLLETLRAKFPNNKILVDLKTMDAGFYEAEPFYKAGAD
ITTVLGVADLGTIKGVIDAANKYGKKAQIDLINVGDKAARTKEVAKLGAHIIGVHTGLDQQAAGQTPFADLATVTGLNLG
LEVSVAGGVKPATVAQVKDAGATIIVAGAAIYGAADPAAAAAEITGLAK
>Q9LBW4 4.1.2.43~~~rmpA~~~3-hexulose-6-phosphate synthase~~~
MKLQVAIDLLSTEAALELAGKVAEYVDIIELGTPLIEAEGLSVITAVKKAHPDKIVFADMKTMDAGELEADIAFKAGADL
VTVLGSADDSTIAGAVKAAQAHNKGVVVDLIGIEDKATRAQEVRALGAKFVEMHAGLDEQAKPGFDLNGLLAAGEKARVP
FSVAGGVKVATIPAVQKAGAEVAVAGGAIYGAADPAAAAKELRAAIA
>Q7A774 4.1.2.43~~~~~~3-hexulose-6-phosphate synthase~~~
MELQLAIDLLNKEDAAELANKVKDYVDIVEIGTPIIYNEGLPAVKHMADNISNVKVLADMKIMDAADYEVSQAIKFGADV
ITILGVAEDASIKAAIEEAHKNNKQLLVDMIAVQDLEKRAKELDEMGADYIAVHTGYDLQAEGQSPLESLRTVKSVIKNS
KVAVAGGIKPDTIKDIVAESPDLVIVGGGIANADDPVEAAKQCRAAIEGK
>Q2G1E1 ~~~hptR~~~Transcriptional regulatory protein HptR~~~COG2207
MFKVVICDDERIIREGLKQIIPWGDYHFNTIYTAKDGVEALSLIQQHQPELVITDIRMPRKNGVDLLNDIAHLDCNVIIL
SSYDDFEYMKAGIQHHVLDYLLKPVDHAQLEVILGRLVRTLLEQQSQNGRSLASCHDAFQPLLKVEYDDYYVNQIVDQIK
QSYQTKVTVSDLIQHIDVSESYAMRTFKDHVGITIVDYLNRYRILQSLQLLDRHYKHYEIADKVGFSEYKMFSYHFKKYL
QMSPSDYCKQAK
>Q5HJF7 ~~~hptR~~~Transcriptional regulatory protein HptR~~~
MFKVVICDDERIIREGLKQIIPWGDYHFNTIYTAKDGVEALSLIQQHQPELVITDIRMPRKNGVDLLNDIAHLDCNVIIL
SSYDDFEYMKAGIQHHVLDYLLKPVDHAQLEVILGRLVRTLLEQQSQNGRSLASCHDAFQPLLKVEYDDYYVNQIVDQIK
QSYQTKVTVSDLIQHIDVSESYAMRTFKDHVGITIVDYLNRYRILQSLQLLDRHYKHYEIADKVGFSEYKMFSYHFKKYL
QMSPSDYCKQAK
>Q7A7X9 ~~~hptR~~~Transcriptional regulatory protein HptR~~~
MFKVVICDDERIIREGLKQIIPWGDYHFNTIYTAKDGVEALSLIQQHQPELVITDIRMPRKNGVDLLNDIALLDCNVIIL
SSYDDFEYMKAGIQHHVLDYLLKPVDHAQLEVILGRLVRTLLEQQSQNGRSLASCHDAFQPLLKVEYDDYYVNQIVDQIK
QSYQTKVTVSDLIQHIDVSESYAMRTFKDHVGITIVDYLNRYRILQSLQLLDRHYKHYEIADKVGFSEYKMFSYHFKKYL
QMSPSDYCKQAK
>Q2G1E0 2.7.13.3~~~hptS~~~Sensor protein kinase HptS~~~COG2972
MTAYKPYRHQLRRSLFASTIFPVFLVIIIGLVSFYAIYIWIEHRTIHQHVDESQSSLHHTEKQIQTFITQHNNSFQELDL
TNHHDVTATKRELLKLIHQQPATLYYELSGPNQFITNNYEHLNTKNMYLFSTHQLKFKNSTYMLKIYMANTPRLSEIKKD
NRQFALIVDQYDNILYANDDRFTIGEKYRPQQFGFMNESVKLNHADHRLIIYKDIHENIEDGITLLIVMAVVLVLLVIFG
FISADNMAKRQTKDIETIIQKIYYAKNRHLGTYTPLKNNSELEEINNYIYDLFESNEQLIHSIEHTERRLRDIQLKEIER
QFQPHFLFNTMQTIQYLITLSPKLAQTVVQQLSQMLRYSLRTNSHTVELNEELNYIEQYVAIQNIRFDDMIKLHIESSEE
ARHQTIGKMMLQPLIENAIKHGRDTESLDITIRLTLARQNLHVLVCDNGIGMSSSRLQYVRQSLNNDVFDTKHLGLNHLH
NKAMIQYGSHARLHIFSKRNQGTLICYKIPLSRGNVDV
>Q5HJF6 2.7.13.3~~~hptS~~~Sensor protein kinase HptS~~~
MTAYKPYRHQLRRSLFASTIFPVFLVIIIGLVSFYAIYIWIEHRTIHQHVDESQSSLHHTEKQIQTFITQHNNSFQELDL
TNHHDVTATKRELLKLIHQQPATLYYELSGPNQFITNNYEHLNTKNMYLFSTHQLKFKNSTYMLKIYMANTPRLSEIKKD
NRQFALIVDQYDNILYANDDRFTIGEKYRPQQFGFMNESVKLNHADHRLIIYKDIHENIEDGITLLIVMAVVLVLLVIFG
FISADNMAKRQTKDIETIIQKIYYAKNRHLGTYTPLKNNSELEEINNYIYDLFESNEQLIHSIEHTERRLRDIQLKEIER
QFQPHFLFNTMQTIQYLITLSPKLAQTVVQQLSQMLRYSLRTNSHTVELNEELNYIEQYVAIQNIRFDDMIKLHIESSEE
ARHQTIGKMMLQPLIENAIKHGRDTESLDITIRLTLARQNLHVLVCDNGIGMSSSRLQYVRQSLNNDVFDTKHLGLNHLH
NKAMIQYGSHARLHIFSKRNQGTLICYKIPLSRGNVDV
>Q6F6Y2 1.14.13.113~~~hpxO~~~FAD-dependent urate hydroxylase~~~COG0654
MNVVIIGAGMGGLTTGIALKKFGHQVTIFEQAEQILPVGAAISLWSNGVKCLNYLGLNEQIAKLGGQMDNLAYVDGLTGD
VMTEFSLQPLIEEVGQRPYPVSRAELQNMLMDEFGREDIHLGKRMVALQQKDDQVEIEFADGSSILADVLVGADGTHSIT
RTYVLGEKVERRYAGYVNWNGLVDISSDLAPADQWTTYVGEGKRASLMPVADNRFYFFLDVPLEAGLENDKCKYKETLQS
YFKGWCPQVQTLIERLDPQKTNRVEICDIEPFAQFYKGRVVLVGDAAHSTTPDIGQGGCQAMEDAIYLARSLQINTLSVE
DALRRYQEKRNQRANELVLRARKRCDVTHMKDEAVTTAWYAELRQEKGLHIMNGIISNIVGNPLD
>A6T923 1.14.13.113~~~hpxO~~~FAD-dependent urate hydroxylase~~~
MKAIVIGAGIGGLSAAVALKQSGIDCDVYEAVKEIKPVGAAISVWPNGVKCMAHLGMGDIMETFGGPLRRMAYRDFRSGE
NMTQFSLAPLIERTGSRPCPVSRAELQREMLDYWGRDSVQFGKRVTRCEEDADGVTVWFTDGSSASGDLLIAADGSHSAL
RPWVLGFTPQRRYAGYVNWNGLVEIDEALAPGDQWTTFVGEGKRVSLMPVSAGRFYFFFDVPLPAGLAEDRDTLRADLSR
YFAGWAPPVQKLIAALDPQTTNRIEIHDIEPFSRLVRGRVALLGDAGHSTTPDIGQGGCAAMEDAVVLGAVFRQTRDIAA
ALREYEAQRCDRVRDLVLKARKRCDITHGKDMQLTEAWYQELREETGERIINGMCDTILSGPLG
>A1TFU9 1.14.13.113~~~hpxO~~~FAD-dependent urate hydroxylase~~~COG0654
MKVVIVGAGMGGMSAAIALRQIGIDTVVYERVTENKPVGAAISVWSNGVKCLNYLGLQEETAELGGKVETMSYVDGHTGD
TMCRFSMHPLIEQVGQRPYPIARAELQLMLMKAYGIDDINFGMKMVGVENDTAGSAAKATFADGTTVSADVIIGADGAGS
ITREYVLGGPVSRRYAGYVNYNGLVSTDDAIGPATEWTTYVGDGKRVSVMPVSDDRFYFFFDVVEPQGSPYEEGRVREVL
RAHFAGWTPGVQTLIDTLDPLATNRVEILDLDPFHTWVKGRVAVLGDAAHNTTPDIGQGGCSAMEDAIALQWAFKDHPDD
VHAALAAYQSARTERAADLVLRARKRCDVTHAKDPQVTSRWYDELRNEDGTNIIRGIVGNIVGGPLTPVTAATEG
>A6T9C8 3.5.1.126~~~hpxW~~~Oxamate amidohydrolase proenzyme~~~
MHSSNVSTHGMAVAPHHLASQSALAILREGGSAIEAMVAAAAAIAVVYPHMNGLGGDGFWLIVPPEGDPIAIDASGAAGS
LATLEAYAGQRHIPNRGPQAALTVAGTVSGWVEALRISRDLTGRALPVARLLADAIGYAEDGIPVTASQAHATASKLEEL
RHQPGFSETWLVAGEAPRPGSRFRQPALAGTLRMLASDGLDSFYRGPLAERLAQGMAALGMPITLGDLQAHRARRPGPLT
LQHQQGTLWNLAPPTQGLVSLAILGITDRLKMADADDAQTVHRIVEATKRAFALRDAHITDPRHLDVDVQQLLTPEALQP
LADSIDDASASPWGGGKGPGDTVWMGVVDNSGLAVSFIQSIYHEFGSGVVLPDTGIVWQNRGAAFSLDPQHLLALAPGKQ
PFHTLNPAAARLNDGRVMVYGSMGGDGQPQTQAALFTRYILQGVPLQESISRPRWLLGRTWGQSSDSLKLEGRFAPACIA
RLRELGHDVEVLADFSEAMGHAGAIVRHPNGLLEGATDPRSNGAAAGY
>Q8PDQ6 1.14.13.-~~~hpyO~~~FAD-dependent urate hydroxylase~~~COG2072
MSLAQLEHALQHDLQRLAHGGEPWVRPRVHPAGHVYDVVIVGAGQSGLGAAFALQRERVHNVLVIDENPPGQEGPWVTYA
RMQTLRTPKQITSIDLGVPTLTFRAWWEAQHGAAGWDALDKIPRGTWMDYLRWYRAALRLPVRNATQLVRIEPDAAPGIH
RLHLAMGAPLMARKIILATGIQGGGQWQVPEWITQALPAQRYAHTSGPIDYAALAGKRVGILGGGASAFDNACFALDQGV
ARAEVFVRRAALPRVNPIRHMEQAGIIPRFAALPDADKYRMMASFFGRNQPPTNDTFQRACAHAGFALHLDAPWLGVEEH
NDVVVVRTPQGEHRFDFLAIATGLVTDPRLRPELAALSGRIACWADRYQAPPGQANPVLDAHPYLGPGFELLPRTPDDAA
AVDGLFAFNYSALINHGLSAAALSGLKVALPRLARAVADQLFLDDRQAMVEAYLGYDQAEFVGQWPQPTQAVA
>Q45881 ~~~hcbA~~~Histone-like protein Hq1~~~
MPAKKRKTTRQRRRSKARSASANTAALRKVSKERDQARRKLRAAQKKLAKAKKDASRKLAKLRKEAARKVAAAKKTRAPS
KKGRKKATRKKGGGRSRKTARKVSTMKRGRGRPRKKA
>Q9FDN6 ~~~hrb~~~High molecular weight rubredoxin~~~COG1773
MDTKALHTLTYGLYIITAKKGDRFNGQVANTVFQITSDPPTIAVSINKQNLTHEFIQAGQGFVISVLAREVPLSLIGQFG
FKSGREMDKFAGINYKLSEGGLPYLADHTLAYLEASLNQTVDAGTHSIFIGTVTDAAVLLQGEPMTYAYYHQVKRGTTPK
TAPTFTVGREKDKTALASPKYQCTICNYVYDPVQGDPEHGIAPGTPFADLPEDWTCPICGAGKDAFEQI
>Q5EF74 ~~~hrcA~~~Heat-inducible transcription repressor HrcA~~~
MQVQLTNRQQHILWATVRHYIATAEPVGSKALIEEYDLGVSSATIRNVMGVLEKSGLLYQPHTSAGRVPSDSGYRIYVDK
LITPSEVLAKEVESALQQRLQWEDWSLEILLQGAAQILASLSGCISLITMPQTNTATVRHLQLMQIEAGRIMLILVTDNY
ETHSKLMDLPPGRSEKPDPEVIDRELQIVSNFLNSHLRGRSLLEITTLDWSQLDREFQLYGEFLKTSVAGLANRTAAPAA
TQIMVRGVAEVLRQPEFSQLQQVQTIIQLLEEEQEQLWRLIFEEPELEDTNKSKVTVRIGAENPLEPIRTCSLISSTYRR
GAVPLGSVGSSRPSRLDYENAIAVVAAAADYLSEAFS
>P30727 ~~~hrcA~~~Heat-inducible transcription repressor HrcA~~~COG1420
MEMEERKLKILQAIINDYINNGEPVGSRTIAKKYNLGISSATIRNEMADLEEMGYIEQLHTSSGRKPSDKGYRLYVDRLM
EIPSMSVEEEMLIKAKIIDSALYEIDKLVKQAMSLVSEMTKLTCVVKSLSARKSYIKSISLINIEPNMILCVFITDSGMI
KNSIIRVKSNIENSSLERIANILNSKLKGLTIEQINLEVINNIKKDLREYGHIFDCIMPNLYDILREADSTEVYKEGTMN
IFNYPEFKDIEKAKEFLSVIDDRRILDTLFNASGGVTVNIGNENSIKEARDFSVVSSVYKYNGRPLGTIGIIGPTRIPYS
KVIKVIMEVVDQINNNLDKMNNS
>A0R0T9 ~~~hrcA~~~Heat-inducible transcription repressor HrcA~~~COG1420
MGSADDRRFEVLRAIVADFVATKEPIGSKTLVERHNLGVSSATVRNDMAVLEAEGYITQPHTSSGRVPTEKGYREFVDRI
DNVKPLSSSERRAILNFLESGVDLDDVLRRAVRLLAQLTRQVAIVQYPTLSTSSVRHLEVVALTPARLLLVVITDTGRVD
QRIVELGDAIDEHELSKLRDMLGQAMEGKPLAQASIAVSDLASHLNGSDRLGDAVGRAATVLVETLVEHTEERLLLGGTA
NLTRNTADFGGSLRSVLEALEEQVVVLRLLAAQQEAGKVTVRIGHETEAEQMAGASVVSTAYGSSGKVYGGMGVVGPTRM
DYPGTIANVAAVALYIGEVLGSR
>P9WMK3 ~~~hrcA~~~Heat-inducible transcription repressor HrcA~~~COG1420
MGSADERRFEVLRAIVADFVATQEPIGSKSLVERHNLGVSSATVRNDMAVLEAEGYITQPHTSSGRVPTEKGYREFVDRL
EDVKPLSSAERRAIQSFLESGVDLDDVLRRAVRLLAQLTRQVAVVQYPTLSTSTVRHLEVIALTPARLLMVVITDSGRVD
QRIVELGDVIDDHQLAQLREILGQALEGKKLSAASVAVADLASQLGGAGGLGDAVGRAATVLLESLVEHTEERLLLGGTA
NLTRNAADFGGSLRSILEALEEQVVVLRLLAAQQEAGKVTVRIGHETASEQMVGTSMVSTAYGTAHTVYGGMGVVGPTRM
DYPGTIASVAAVALYIGDVLGAR
>P68792 ~~~hrcA~~~Heat-inducible transcription repressor HrcA~~~
MITDRQLSILNAIVEDYVDFGQPVGSKTLIERHNLNVSPATIRNEMKQLEDLNYIEKTHSSSGRSPSQLGFRYYVNRLLE
QTSHQKTNKLRRLNQLLVENQYDVSSALTYFADELSNISQYTTLVVHPNHKQDIINNVHLIRANPNLVIMVIVFSSGHVE
HVHLASDIPFSNDKLNTISNFVTNKLTEFNQNLQDDIVSFVQSEQEEIFINKLINTMNNHISNQSNSIYMGGKVKLIDAL
NESNVSSIQPILQYIESNRIAELLQDISSPNINVKIGNEIDDSLSDISIVTSQYHFDETLKGQIAVIGPTAMHYQNVIQL
LNRIW
>P68793 ~~~hrcA~~~Heat-inducible transcription repressor HrcA~~~
MITDRQLSILNAIVEDYVDFGQPVGSKTLIERHNLNVSPATIRNEMKQLEDLNYIEKTHSSSGRSPSQLGFRYYVNRLLE
QTSHQKTNKLRRLNQLLVENQYDVSSALTYFADELSNISQYTTLVVHPNHKQDIINNVHLIRANPNLVIMVIVFSSGHVE
HVHLASDIPFSNDKLNTISNFVTNKLTEFNQNLQDDIVSFVQSEQEEIFINKLINTMNNHISNQSNSIYMGGKVKLIDAL
NESNVSSIQPILQYIESNRIAELLQDISSPNINVKIGNEIDDSLSDISIVTSQYHFDETLKGQIAVIGPTAMHYQNVIQL
LNRIW
>P68794 ~~~hrcA~~~Heat-inducible transcription repressor HrcA~~~
MITDRQLSILNAIVEDYVDFGQPVGSKTLIERHNLNVSPATIRNEMKQLEDLNYIEKTHSSSGRSPSQLGFRYYVNRLLE
QTSHQKTNKLRRLNQLLVENQYDVSSALTYFADELSNISQYTTLVVHPNHKQDIINNVHLIRANPNLVIMVIVFSSGHVE
HVHLASDIPFSNDKLNTISNFVTNKLTEFNQNLQDDIVSFVQSEQEEIFINKLINTMNNHISNQSNSIYMGGKVKLIDAL
NESNVSSIQPILQYIESNRIAELLQDISSPNINVKIGNEIDDSLSDISIVTSQYHFDETLKGQIAVIGPTAMHYQNVIQL
LNRIW
>Q9WZV5 ~~~hrcA~~~Heat-inducible transcription repressor HrcA~~~COG1420
MRRLNRKNNEALKKLNDRQRKVLYCIVREYIENKKPVSSQRVLEVSNIEFSSATIRNDMKKLEYLGYIYQPHTSAGRIPT
DKGLRFYYEEMLKISKETSEADLAVETFKSMPLADPEKVLFLAGNLLARLTEGYVLIERPNTRDLKILRVMLIPVSEDYL
IFSILTEFGVSKVTPIKTQERLNWEEIERQLNFLLRGRTVGEVLMGKIESLKGSGFLRLIESLIGETVERYLDAGLENLL
KDETLTLEDIRNLLEEVKDQKFLESLVGEGITVRIGREIGRKKLEKFAVFSGKYFKGESPIGSVYLFTSKVTKYDRNHRV
FEYILNRLSEYFTSTSRR
>O85093 ~~~hrcQa~~~Type III secretion protein hrcQa~~~
MSALRLRKVDALLAQATRELGAGQSLGFSAAGQDAELTLLPLLADAGEPAGAVWLSTAIGPLLLSDAEALLSLLGDIPLT
LGGEQQAWYWQLFNQRLSPTVARLLAPVEPLHNKPQAPTLGCRVQIRRGGEQLHAHMHATPDTLLRLLRSASWQARTRTV
DESWSVASPLIIGEMSLTREQIASLRPGDVVLPAHCQFDSAGQGFLSLAGRQWAAQTDQHAQRLFLRLSHEEHRHHEY
>O85094 ~~~hrcQb~~~Type III secretion protein HrcQb~~~
MSTEDLYQEDVEMLDDYEDPSTEQHWSEEDGEPSGYATAEPDDHAAQEEQDEPPALDSLALDLTLRCGELRLTLAELRRL
DAGTILEVTGISPGHATLCHGEQVVAEGELVDVEGRLGLQITRLVTRS
>Q60235 ~~~hrcQb~~~Type III secretion protein HrcQb~~~
MSTEDLYQDDVEMLDDYEEPVPEQADQQQRDDEYAEHAFGYADSDAEHEEQSGDHHESPMLDSLELDLTLRCGDLRLTLA
ELRRLDAGSILEVSGIAPGHATLCHGEQVVAEGELVDVEGRLGLQITRLVARS
>P18182 ~~~hrdA~~~RNA polymerase principal sigma factor HrdA~~~COG0568
MRGGQRRASRLRPPTYRRRPPPAASILEVAPVQTQTLTQTDTAAGGAEPDAERGVLLAMPAQPGAGAALPHPGAPVDVPE
HPEPPPPTRTESGGPSSDLFRQYLREIGRIPLLSAAEEVDLARRVEAGLFAEEKLRCSPGLDDRLALDLDRLVVLGRLAK
RRLIEANLRLVVSVAKRYVGRGLTMLDLVQEGNLGLIRAVEKFDYARGYKFSTYATWWIRQAMSRALADQARTIRVPVHV
VELINRVVRVQRRMLQERGCEPTPQEVAAHLDLAPERVGEVLRLAQEPVSLHAPVGEEDDVALGDLIEDGDAASPVESAA
FLLLRQHLEAVLSTLGERERKVVQLRYGLADGRPRTLEEIGRLFGVTRERIRQIESKTLSKLRDHAYADQLRGYLD
>P9WJA2 ~~~hrp1~~~Hypoxic response protein 1~~~
MTTARDIMNAGVTCVGEHETLTAAAQYMREHDIGALPICGDDDRLHGMLTDRDIVIKGLAAGLDPNTATAGELARDSIYY
VDANASIQEMLNVMEEHQVRRVPVISEHRLVGIVTEADIARHLPEHAIVQFVKAICSPMALAS
>P9WJA3 ~~~hrp1~~~Hypoxic response protein 1~~~COG0517
MTTARDIMNAGVTCVGEHETLTAAAQYMREHDIGALPICGDDDRLHGMLTDRDIVIKGLAAGLDPNTATAGELARDSIYY
VDANASIQEMLNVMEEHQVRRVPVISEHRLVGIVTEADIARHLPEHAIVQFVKAICSPMALAS
>O51767 3.6.4.13~~~hrpA~~~ATP-dependent RNA helicase HrpA~~~
MNDFKLPIYKYKDELIKVLKNHNVLIVESPTGSGKTTQLPRIIYEAGFAKLGKIGVTQPRRIATVSIAEYIAKHIGVNVG
EEVGYKIRFEEITSPKTKIKLMTDGVLLQELKKDTLLYEYDVIIIDEAHERSLNIDFILGLIKDISRKRDDFKIIVSSAT
INTKIFSKYFNNAPVVSIETITYPVQIIYNPPLLNTSKGMILKIKEIVLNVIKEKKAGDILIFLSGEKEIKETIKELQEL
NSKKNLIIFPLYGRMPKEAQEQIFMTTPKNKRKIIVSTNIAETSITIENIKIVIDSGKVKTNKFQTKTHTYSLQEVPISK
SSATQRAGRAGRLSKGTCYRLYKREDYQLREDYQKEEIYRTDLSEVVLRMADIGIRDFTHFDFISKPSTHSIQTASKILK
SLDAINNKNELTEIGKYMILFPLIPAHSRALVEAMINYPQAIYQTTIGLSFLSTSGIFLLPQNEEMEARQAHLKYKNPMG
DLIGFVNIFEDFKKALNKEAFTKENYLDLQGLEEIANVQMQLENIISKLNIPIIQKGVFDNEGYLKSIMRGMRDYICFKT
SKKKYKTIKAQNVIIHPGSLISTDSVKYFVAGEIIETTKMYARSIGVLKKEWIDDIILNEEFKHNDISSKENQITNTGQT
KIINEIKIGKKIFKAEYKNNIYVIKINLETLKEIIFKNELNNQNNEDLKKIKIQLMHKNITVFNNKKFLETIEIVKNMGK
DWHCIKKYETKNVNIDEPEKMKNLLECTMQFISFPPKKNALFLSLETDYSGNFRLKPKQNFIMAIEESIESIKSLIENKE
YIQKLHFIKKLINKVYKKLNYFF
>P43329 3.6.4.13~~~hrpA~~~ATP-dependent RNA helicase HrpA~~~COG1643
MTEQQKLTFTALQQRLDSLMLRDRLRFSRRLHGVKKVKNPDAQQAIFQEMAKEIDQAAGKVLLREAARPEITYPDNLPVS
QKKQDILEAIRDHQVVIVAGETGSGKTTQLPKICMELGRGIKGLIGHTQPRRLAARTVANRIAEELKTEPGGCIGYKVRF
SDHVSDNTMVKLMTDGILLAEIQQDRLLMQYDTIIIDEAHERSLNIDFLLGYLKELLPRRPDLKIIITSATIDPERFSRH
FNNAPIIEVSGRTYPVEVRYRPIVEEADDTERDQLQAIFDAVDELSQESHGDILIFMSGEREIRDTADALNKLNLRHTEI
LPLYARLSNSEQNRVFQSHSGRRIVLATNVAETSLTVPGIKYVIDPGTARISRYSYRTKVQRLPIEPISQASANQRKGRC
GRVSEGICIRLYSEDDFLSRPEFTDPEILRTNLASVILQMTALGLGDIAAFPFVEAPDKRNIQDGVRLLEELGAITTDEQ
ASAYKLTPLGRQLSQLPVDPRLARMVLEAQKHGCVREAMIITSALSIQDPRERPMDKQQASDEKHRRFHDKESDFLAFVN
LWNYLGEQQKALSSNAFRRLCRTDYLNYLRVREWQDIYTQLRQVVKELGIPVNSEPAEYREIHIALLTGLLSHIGMKDAD
KQEYTGARNARFSIFPGSGLFKKPPKWVMVAELVETSRLWGRIAARIDPEWVEPVAQHLIKRTYSEPHWERAQGAVMATE
KVTVYGLPIVAARKVNYSQIDPALCRELFIRHALVEGDWQTRHAFFRENLKLRAEVEELEHKSRRRDILVDDETLFEFYD
QRISHDVISARHFDSWWKKVSRETPDLLNFEKSMLIKEGAEKISKLDYPNFWHQGNLKLRLSYQFEPGADADGVTVHIPL
PLLNQVEESGFEWQIPGLRRELVIALIKSLPKPVRRNFVPAPNYAEAFLGRVKPLELPLLDSLERELRRMTGVTVDREDW
HWDQVPDHLKITFRVVDDKNKKLKEGRSLQDLKDALKGKVQETLSAVADDGIEQSGLHIWSFGQLPESYEQKRGNYKVKA
WPALVDERDSVAIKLFDNPLEQKQAMWNGLRRLLLLNIPSPIKYLHEKLPNKAKLGLYFNPYGKVLELIDDCISCGVDKL
IDANGGPVWTEEGFAALHEKVRAELNDTVVDIAKQVEQILTAVFNINKRLKGRVDMTMALGLSDIKAQMGGLVYRGFVTG
NGFKRLGDTLRYLQAIEKRLEKLAVDPHRDRAQMLKVENVQQAWQQWINKLPPARREDEDVKEIRWMIEELRVSYFAQQL
GTPYPISDKRILQAMEQISG
>Q9F0B1 ~~~hrpA~~~Hrp pili protein HrpA~~~
MNIMSSLTNAGRGVVNTVGGAAQGINSVKSSADRNIALTKNTGSTDSIDATRSSISKGDAKSAELDGTANEENGLLRETS
MLAGFEDKKEALSNQIVASKIRNSVVQF
>Q52473 ~~~hrpA~~~Hrp pili protein HrpA~~~
MVAFAGLTSKLTNLGNSAVGGVGGALQGVNTVASNATLQKNILLGTGDSLSVDAQAKASKESDANGAKLIAMQAQETMKK
QTMDVLNAIQAGKEDSTNKKISATATNAKGISY
>P37024 3.6.4.13~~~hrpB~~~ATP-dependent RNA helicase HrpB~~~COG1643
MSSLPVAAVLPELLTALDCAPQVLLSAPTGAGKSTWLPLQLLAHPGINGKIILLEPRRLAARNVAQRLAELLNEKPGDTV
GYRMRAQNCVGPNTRLEVVTEGVLTRMIQRDPELSGVGLVILDEFHERSLQADLALALLLDVQQGLRDDLKLLIMSATLD
NDRLQQMLPEAPVVISEGRSFPVERRYLPLPAHQRFDDAVAVATAEMLRQESGSLLLFLPGVGEIQRVQEQLASRIGSDV
LLCPLYGALSLNDQRKAILPAPQGMRKVVLATNIAETSLTIEGIRLVVDCAQERVARFDPRTGLTRLITQRVSQASMTQR
AGRAGRLEPGISLHLIAKEQAERAAAQSEPEILQSDLSGLLMELLQWGCSDPAQMSWLDQPPVVNLLAAKRLLQMLGALE
GERLSAQGQKMAALGNDPRLAAMLVSAKNDDEAATAAKIAAILEEPPRMGNSDLGVAFSRNQPAWQQRSQQLLKRLNVRG
GEADSSLIAPLLAGAFADRIARRRGQDGRYQLANGMGAMLDANDALSRHEWLIAPLLLQGSASPDARILLALLVDIDELV
QRCPQLVQQSDTVEWDDAQGTLKAWRRLQIGQLTVKVQPLAKPSEDELHQAMLNGIRDKGLSVLNWTAEAEQLRLRLLCA
AKWLPEYDWPAVDDESLLAALETWLLPHMTGVHSLRGLKSLDIYQALRGLLDWGMQQRLDSELPAHYTVPTGSRIAIRYH
EDNPPALAVRMQEMFGEATNPTIAQGRVPLVLELLSPAQRPLQITRDLSDFWKGAYREVQKEMKGRYPKHVWPDDPANTA
PTRRTKKYS
>Q01099 ~~~hrpN~~~Harpin HrpN~~~
MSLNTSGLGASTMQISIGGAGGNNGLLGTSRQNAGLGGNSALGLGGGNQNDTVNQLAGLLTGMMMMMSMMGGGGLMGGGL
GGGLGNGLGGSGGLGEGLSNALNDMLGGSLNTLGSKGGNNTTSTTNSPLDQALGINSTSQNDDSTSGTDSTSDSSDPMQQ
LLKMFSEIMQSLFGDGQDGTQGSSSGGKQPTEGEQNAYKKGVTDALSGLMGNGLSQLLGNGGLGGGQGGNAGTGLDGSSL
GGKGLQNLSGPVDYQQLGNAVGTGIGMKAGIQALNDIGTHSDSSTRSFVNKGDRAMAKEIGQFMDQYPEVFGKPQYQKGP
GQEVKTDDKSWAKALSKPDDDGMTPASMEQFNKAKGMIKSAMAGDTGNGNLQARGAGGSSLGIDAMMAGDAINNMALGKL
GAA
>O87653 ~~~hrpZ~~~Harpin HrpZ~~~
MQSLSLNSSSLQTPAMALVLVRPETETTGASTSSRALQEVVVKLAEELMRNGQLDDSSPLGKLLAKSMAADGKAGGGIED
VIAALDKLIHEKLGDNFGASADNASGTGQQDLMTQVLSGLAKSMLDDLLTKQDGGSSFSEDDMPMLSKIAQFMDDNPAQF
PKPDSGSWVNELKEDNFLDGDKTAAFRSALDIIGQQLGNQQSDAGGLAGTGGGLGTPSSFSNNSSVTGDPLIGANTGPGD
SGNSSSEAGQLIGEFIDRGLQSVLAGGGLGTPVNTPQTGTAANGGQSAQDLDQLLGGLLLKGLEATLKDAGQTATDVQSS
AAQIATLLVSTLLQGTRNQAAA
>Q52481 ~~~hrpZ~~~Harpin HrpZ~~~
MQSLSLNSSTLQSPSMALVLIRPETETTGPSTSSRALQEVIAQLAQELTHNGQLDESSPLGKLLGKAMAASGKAGGGLED
IKAALDTLIHEKLGDNFGASADNASDTGQHDLMTQVLNGLAKSMLNDLLTKQDDGTRFSEDDMPMLKKIAEFMDDNPAQF
PKPDSGSWVNELKEDNFLDGDETAQFRSALDIIGQQLGSQQNAAGGLAGDSSGGGLGSPVSNTENSPGSLGDPLIDANTG
PASNSNSNGDVGQLIGELIDRGLQSVLAGGGLGTPVSTANTALVPGGEQPNQDLGQLLGGLLQKGLEATLQDAGQTGTGV
QSSTAQVALLLVNMLLQSTKNQAAA
>Q9F0B0 ~~~hrpZ~~~Harpin HrpZ~~~
MQSLSLNSSTLQSPSMALVLIRPETETTGSSTSSRALQEVIAQLAQELTHNGQLDESSPLGKLLGKAMAASGKAGGGLED
IKAALDTLIHEKLGDNFGASADNASDTGQHDLMTQVLNGLAKSMLNDLLTKQDDGTRFSEDDMPMLKKIAEFMDDNPAQF
PKPDSGSWVNELKEDNFLDGDETAQFRSALDIIGQQLGSQQNAAGGLAGDSSGGGLGSPVSNTENSPGSLGDPLIDANTG
PASNSNSNGDVGQLIGELIDRGLQSVLAGGGLGTPVSTANTALVPGGEQPNQDLGQLLGGLLQKGLEATLQDAGQTGTGV
QSSAAQVALLLVNMLLQSTKNQAAA
>Q887C6 ~~~hrpZ~~~Harpin HrpZ~~~
MQALNSISSLQTSASLFPVSLNSDVSANTSTSSKELKAVIDQLVQALTQSGQLDETSPLGKMLAKAMAADGKSANSIDDI
TASLDKLIHEKLGDNFGASAGIGAGGGGGGIGGAGSGSGVGGGLSSDAGAGQSDLMSQVLNGLGKAVLDDLLTPSGEGGT
TFSSDDMPTLEKVAQFMDDNKAQFPTRDGGSWMNELKEDNGLDAQETAQFRSALDVIGQQLGQQQGDASGVTSGGGLGSP
VSDSSLGNPAIDANTGPAANGNASVDVGQLIGQLIDRGLQSVSSGGGLGTPVDNSTQPTGGTPAANPTGNVSNQDLGQLL
SGLLQRGLEATLQDAGNTGADLQSSAAQVAAQLINALLQGTNNQTNQAVA
>P35674 ~~~hrpZ~~~Harpin HrpZ~~~
MQSLSLNSSSLQTPAMALVLVRPEAETTGSTSSKALQEVVVKLAEELMRNGQLDDSSPLGKLLAKSMAADGKAGGGIEDV
IAALDKLIHEKLGDNFGASADSASGTGQQDLMTQVLNGLAKSMLDDLLTKQDGGTSFSEDDMPMLNKIAQFMDDNPAQFP
KPDSGSWVNELKEDNFLDGDETAAFRSALDIIGQQLGNQQSDAGSLAGTGGGLGTPSSFSNNSSVMGDPLIDANTGPGDS
GNTRGEAGQLIGELIDRGLQSVLAGGGLGTPVNTPQTGTSANGGQSAQDLDQLLGGLLLKGLEATLKDAGQTGTDVQSSA
AQIATLLVSTLLQGTRNQAAA
>A6QJK1 7.6.2.-~~~hrtA~~~Putative hemin import ATP-binding protein HrtA~~~
MALVVEDIVKNFGEGLSETKVLKGINFEVEQGEFVILNGASGSGKTTLLTILGGLLSQTSGTVLYNDAPLFDKQHRPSDL
RLEDIGFIFQSSHLVPYLKVIEQLTLVGQEAGMTKQQSSTRAIQLLKNIGLEDRLNVYPHQLSGGEKQRVAIMRAFMNNP
KIILADEPTASLDADRATKVVEMIRQQIKEQQMIGIMITHDRRLFEYADRVIELEDGKITD
>Q99RR8 7.6.2.-~~~hrtA~~~Putative hemin import ATP-binding protein HrtA~~~
MALVVKDIVKNFGEGLSETKVLKGINFEVEQGEFVILNGASGSGKTTLLTILGGLLSQTSGTVLYNDAPLFDKQHRPSDL
RLEDIGFIFQSSHLVPYLKVIEQLTLVGQEAGMTKQQSSTRAIQLLKNIGLEDRLNVYPHQLSGGEKQRVAIMRAFMNNP
KIILADEPTASLDADRATKVVEMIRQQIKEQQMIGIMITHDRRLFEYADRVIELEDGKITD
>P9WJA1 1.14.14.12~~~hsaA~~~Flavin-dependent monooxygenase, oxygenase subunit HsaA~~~COG1960
MTSIQQRDAQSVLAAIDNLLPEIRDRAQATEDLRRLPDETVKALDDVGFFTLLQPQQWGGLQCDPALFFEATRRLASVCG
STGWVSSIVGVHNWHLALFDQRAQEEVWGEDPSTRISSSYAPMGAGVVVDGGYLVNGSWNWSSGCDHASWTFVGGPVIKD
GRPVDFGSFLIPRSEYEIKDVWYVVGLRGTGSNTLVVKDVFVPRHRFLSYKAMNDHTAGGLATNSAPVYKMPWGTMHPTT
ISAPIVGMAYGAYAAHVEHQGKRVRAAFAGEKAKDDPFAKVRIAEAASDIDAAWRQLIGNVSDEYALLAAGKEIPFELRA
RARRDQVRATGRSIASIDRLFEASGATALSNEAPIQRFWRDAHAGRVHAANDPERAYVIFGNHEFGLPPGDTMV
>Q0S811 1.14.14.12~~~hsaA~~~Flavin-dependent monooxygenase, oxygenase subunit HsaA~~~COG1960
MQRLDALLPTLRERAQETEDLRRIPDDSMKALQETGFFRLLQPEQWGGYQADPVLFYSAVRKIASACGSTGWVSSIIGVH
NWHLALFSQQAQEDVWGNDTDVRISSSYAPMGAGQVVDGGYTVNGAWAWSSGCDHASWAVLGGPVIKDGRPVDFVSFLIP
REDYRIDDVWNVVGLRGTGSNTVVVEDVFVPTHRVLSFKAMSNLTAPGLERNTAPVYKMPWGTIHPTTISAPIVGMAYGA
YDAHVEHQGKRVRAAFAGEKAKDDPFAKVRIAEASSDIDAAWRQLSGNVADEYALLVAGEEVPFELRLRARRDQVRATGR
AISSIDKLFESSGATALANGTPLQRFWRDAHAGRVHAANDPERAYVMYGTGEFGLPITDTMV
>P9WND9 1.5.1.36~~~hsaB~~~Flavin-dependent monooxygenase, reductase subunit HsaB~~~COG1853
MSAQIDPRTFRSVLGQFCTGITVITTVHDDVPVGFACQSFAALSLEPPLVLFCPTKVSRSWQAIEASGRFCVNVLTEKQK
DVSARFGSKEPDKFAGIDWRPSELGSPIIEGSLAYIDCTVASVHDGGDHFVVFGAVESLSEVPAVKPRPLLFYRGDYTGI
EPEKTTPAHWRDDLEAFLITTTQDTWL
>Q0S808 1.5.1.36~~~hsaB~~~Flavin-dependent monooxygenase, reductase subunit HsaB~~~COG1853
MSEVTGDGAVAAEAIDPRRFRTVLGQFCTGVTIITTIDDGVPVGFACQSFAALSLEPPLVLFCPTKTSRSWAAIERSGIF
CVNVLAEEQQSTCARFGSRDPDKFAGIDWTESPLGSPILTGSLAHIDCSLESVHDGGDHWVAFGRVSSLSEIREERPLLF
YRGQYTGIEPDKTVPAPWRDDLEAFLTTSSEDTWL
>P9WNW7 1.13.11.25~~~hsaC~~~Iron-dependent extradiol dioxygenase~~~COG0346
MSIRSLGYLRIEATDMAAWREYGLKVLGMVEGKGAPEGALYLRMDDFPARLVVVPGEHDRLLEAGWECANAEGLQEIRNR
LDLEGTPYKEATAAELADRRVDEMIRFADPSGNCLEVFHGTALEHRRVVSPYGHRFVTGEQGMGHVVLSTRDDAEALHFY
RDVLGFRLRDSMRLPPQMVGRPADGPPAWLRFFGCNPRHHSLAFLPMPTSSGIVHLMVEVEQADDVGLCLDRALRRKVPM
SATLGRHVNDLMLSFYMKTPGGFDIEFGCEGRQVDDRDWIARESTAVSLWGHDFTVGARG
>Q9KWQ5 1.13.11.25~~~hsaC~~~Iron-dependent extradiol dioxygenase~~~COG0346
MSIRSLAYMRIEATDMSAWREYGLKVLGMVEGKGSDPDALYLRMDDFPARLVIFPGEHDRLSVSGWETANAAELQEVRDN
LSAAGVAFKEGTAEQLQDRRVDELITFEDPSGNTLEAFHGAALEHRRVVSPYGHKFVTGEQGLGHVVLSTTDDEASLRFY
RDVLGFRLRDSMRLPPQLVGRPADGKPAWLRFFGCNPRHHSLAFLPMPTPSGIVHLMIEVENSDDVGLCLDRALRKKVKM
SATLGRHVNDLMLSFYMKTPGGFDIEFGCEGRQVEDESWIARESTAVSLWGHDFSVGMQP
>P9WNH5 3.7.1.17~~~hsaD~~~4,5:9,10-diseco-3-hydroxy-5,9,17-trioxoandrosta-1(10),2-diene-4-oate hydrolase~~~COG2267
MTATEELTFESTSRFAEVDVDGPLKLHYHEAGVGNDQTVVLLHGGGPGAASWTNFSRNIAVLARHFHVLAVDQPGYGHSD
KRAEHGQFNRYAAMALKGLFDQLGLGRVPLVGNSLGGGTAVRFALDYPARAGRLVLMGPGGLSINLFAPDPTEGVKRLSK
FSVAPTRENLEAFLRVMVYDKNLITPELVDQRFALASTPESLTATRAMGKSFAGADFEAGMMWREVYRLRQPVLLIWGRE
DRVNPLDGALVALKTIPRAQLHVFGQCGHWVQVEKFDEFNKLTIEFLGGGR
>Q9KWQ6 3.7.1.17~~~hsaD~~~4,5:9,10-diseco-3-hydroxy-5,9,17-trioxoandrosta-1(10),2-diene-4-oate hydrolase~~~COG2267
MTTTEEALTFESTSKFAQVRPHLKLHYHEAGVGNDTTIVLLHGGGPGASSWSNFARNIPVLAEKFHVLAVDQPGYGLSDK
PTEHPQYFVHSASALKDLLDTLGVGGRVHLLGNSLGGGAAVRFALDYPDRAGRLVLMGPGGLSVNLFAPDPTEGVKNLGK
FGYQPTRENLEAFLRIMVFDQKLITDELIDERFAAASTPESLAAAKAMGKSFSSADFELGMLWRDAYKLRQRVLLIWGRE
DRVNPLDGALVALKMIPRAQLHVFGGCGHWAQLEKFDEFNRLATDFLLDGGK
>A8AWU7 ~~~hsa~~~Streptococcal hemagglutinin~~~COG3147
MFFKRQKGQYHEVERVTRFKLIKSGKHWLRAATSQFGLLRLMKGADISSVEVKVAEEQSVEKGGLNYLKGIIATGAVLGG
AVVTSSSVYAEEEQALEKVIDTRDVLATRGEAVLSEEAATTLSSEGANPVESLSDTLSASESASANSVSTSISISESFSV
SASASLSSSSSLSQSSSESASASESLSVSASTSQSFSSTTSSTQSSNNESLISSDSSNSLNTNQSVSARNQNARVRTRRA
VAANDTEAPQVKSGDYVVYRGESFEYYAEITDNSGQVNRVVIRNVEGGANSTYLSPNWVKYSTENLGRPGNATVQNPLRT
RIFGEVPLNEIVNEKSYYTRYIVAWDPSGNATQMVDNANRNGLERFVLTVKSQNEKYDPADPSVTYVNNLSNLSTSEREA
VAAAVRAANPNIPPTAKITVSQNGTVTITYPDKSTDTIPANRVVKDLQISKSNSASQSSSVSASQSASTSVSASISASMS
ASVSVSTSASTSASVSASESASTSASVSASESASTSASVSASKSSSTSASVSASESASTSASVSASESASTSASVSASES
ASTSASVSASTSASTSASVSASESASTSASVSASESASTSASVSASESASTSASVSASESASTSASVSASESSSTSASVS
ASESASTSASVSASESASTSASVSASTSASTSASVSASTSASTSASVSASTSASTSASVSASESASTSASVSASESASTS
ASVSASTSASTSASVSASTSASTSASVSASESASTSASVSASTSASTSASVSASESASTSASVSASTSASTSASVSASES
ASTSASVSASESASTSASVSASTSASTSASVSASESASTSASVSASESASTSASVSASESASTSASVSASTSASTSASVS
ASESASTSASVSASESASTSASVSASESASTSASVSASESASTSASVSASTSASTSASVSASESASTSASVSASESASTS
ASVSASESASTSASVSASESASTSASISASESASTSASVSASESASTSASVSASTSASTSASVSASESASTSASVSASES
ASTSASVSASESASTSASVSASESASTSASVSASESASTSASVSASTSASTSASVSASESSSTSASVSASESASTSSSVS
ASESASTSASVSASESASTSASVSASESASTSASVSASESASTSASVSASESASTSASVSASESASTSASVSASESASTS
ASVSASTSASTSASVSASESASTSASVSASESASTSASVSASTSASTSASVSASESASTSASVSASESASTSASVSASES
ASTSASVSASTSASTSASVSASESASTSASVSASESASMSASVSASESASTSASVSASESASTSASVSASESASTSASVS
ASESASTSASVSASESASTSASVSASESASTSASVSASESAYTSASASASESASTSASISASESASTSASVSASESAYTS
ASVSASESGSTSASVSASESASTSASVSASESASTSASVSASTSASTSASVSASESSSTSASVSASESASTSASVSASES
ASTSASVSASTSASTSASVSASESASTSASVSASESASTSASVSASESASTSASVSTSESASTSASVSASESASTSASVS
ASESASTSASVSASESSSTSASVSASESASTSASVSASESASTSASVSASESASTSASVSASESASTSVSVSASESASTS
ASVSASESASSSASVSASKSASMSASVLASESASTSASVSASESASTSASVSASESASTSASVSASESASTSASVSASES
ASTSASVSASESASTSASVSASESASTSASVSASTSASTSASVSASESASTSASVSASESASTSASVSASESASTSASVS
ASESVSANESASTSASVSASTSASTSASVSSSESASTSASVSASESASTSASVSASESASTSASVSASESASISASISAS
ESSSTSASVSASESASTSASVSASTSTSTSASVSASESASTSASVFASESASTSASVSASESASTSASVSASTSASTSAS
VSASESASTSASISASESASTSASISASESSSTSASVSASTSASTSASVSASESTSTSVSISASESVSISTSVSQSMSVS
ESLSLSVSTSTLHSQLNGIYESELNSLSLSESLSMSQSLSQSLSDSQSTSATQSMHDRISKGQLPRTGESESKASILALG
IGALGLAFKKRKKNESED
>P0A6Z1 ~~~hscA~~~Chaperone protein HscA~~~COG0443
MALLQISEPGLSAAPHQRRLAAGIDLGTTNSLVATVRSGQAETLADHEGRHLLPSVVHYQQQGHSVGYDARTNAALDTAN
TISSVKRLMGRSLADIQQRYPHLPYQFQASENGLPMIETAAGLLNPVRVSADILKALAARATEALAGELDGVVITVPAYF
DDAQRQGTKDAARLAGLHVLRLLNEPTAAAIAYGLDSGQEGVIAVYDLGGGTFDISILRLSRGVFEVLATGGDSALGGDD
FDHLLADYIREQAGIPDRSDNRVQRELLDAAIAAKIALSDADSVTVNVAGWQGEISREQFNELIAPLVKRTLLACRRALK
DAGVEADEVLEVVMVGGSTRVPLVRERVGEFFGRPPLTSIDPDKVVAIGAAIQADILVGNKPDSEMLLLDVIPLSLGLET
MGGLVEKVIPRNTTIPVARAQDFTTFKDGQTAMSIHVMQGERELVQDCRSLARFALRGIPALPAGGAHIRVTFQVDADGL
LSVTAMEKSTGVEASIQVKPSYGLTDSEIASMIKDSMSYAEQDVKARMLAEQKVEAARVLESLHGALAADAALLSAAERQ
VIDDAAAHLSEVAQGDDVDAIEQAIKNVDKQTQDFAARRMDQSVRRALKGHSVDEV
>P0A6L9 ~~~hscB~~~Co-chaperone protein HscB~~~COG1076
MDYFTLFGLPARYQLDTQALSLRFQDLQRQYHPDKFASGSQAEQLAAVQQSATINQAWQTLRHPLMRAEYLLSLHGFDLA
SEQHTVRDTAFLMEQLELREELDEIEQAKDEARLESFIKRVKKMFDTRHQLMVEQLDNETWDAAADTVRKLRFLDKLRSS
AEQLEEKLLDF
>Q9KTX9 ~~~hscB~~~Co-chaperone protein HscB homolog~~~COG1076
MNYFELFGLPIQFELDGSLLSSQFRALQKRFHPDNFATASERDRLMAVQQAAQINDAYQTLKDPLRRAEYLLSLQGIEMN
AEQQTLQDPMFLMEQMELREELESVTACADPEAALVAFDTKVTAMQRHYLAQLQGQLAQSEWLAAADQIRKLKFIAKLKN
EVERVEDQLLG
>P77319 ~~~hscC~~~Chaperone protein HscC~~~COG0443
MDNAELAIGIDLGTTNSLIAVWKDGAAQLIPNKFGEYLTPSIISMDENNHILVGKPAVSRRTSHPDKTAALFKRAMGSNT
NWRLGSDTFNAPELSSLVLRSLKEDAEEFLQRPIKDVVISVPAYFSDEQRKHTRLAAELAGLNAVRLINEPTAAAMAYGL
HTQQNTRSLVFDLGGGTFDVTVLEYATPVIEVHASAGDNFLGGEDFTHMLVDEVLKRADVARTTLNESELAALYACVEAA
KCSNQSPLHIRWQYQEETRECEFYENELEDLWLPLLNRLRVPIEQALRDARLKPSQIDSLVLVGGASQMPLVQRIAVRLF
GKLPYQSYDPSTIVALGAAIQAACRLRSEDIEEVILTDICPYSLGVEVNRQGVSGIFSPIIERNTTVPVSRVETYSTMHP
EQDSITVNVYQGENHKVKNNILVESFDVPLKKTGAYQSIDIRFSYDINGLLEVDVLLEDGSVKSRVINHSPVTLSAQQIE
ESRTRLSALKIYPRDMLINRTFKAKLEELWARALGDEREEIGRVITDFDAALQSNDMARVDEVRRRASDYLAIEIP
>G9FRD6 1.1.1.201~~~hdhb~~~7beta-hydroxysteroid dehydrogenase~~~
MNFREKYGQWGIVLGATEGIGKASAFELAKRGMDVILVGRRKEALEELAKAIHEETGKEIRVLPQDLSEYDAAERLIEAT
KDLDMGVIEYVACLHAMGQYNKVDYAKYEQMYRVNIRTFSKLLHHYIGEFKERDRGAFITIGSLSGWTSLPFCAEYAAEK
AYMMTVTEGVAYECANTNVDVMLLSAGSTITPTWLKNKPSDPKAVAAAMYPEDVIKDGFEQLGKKFTYLAGELNREKMKE
NNAMDRNDLIAKLGKMFDHMA
>A4ECA9 1.1.1.201~~~~~~7beta-hydroxysteroid dehydrogenase~~~
MNLREKYGEWGLILGATEGVGKAFCEKIAAGGMNVVMVGRREEKLNVLAGEIRETYGVETKVVRADFSQPGAAETVFAAT
EGLDMGFMSYVACLHSFGKIQDTPWEKHEAMINVNVVTFLKCFHHYMRIFAAQDRGAVINVSSMTGISSSPWNGQYGAGK
AFILKMTEAVACECEGTGVDVEVITLGTTLTPSLLSNLPGGPQGEAVMKIALTPEECVDEAFEKLGKELSVIAGQRNKDS
VHDWKANHTEDEYIRYMGSFYRD
>R9UAM1 1.1.1.201~~~~~~7beta-hydroxysteroid dehydrogenase~~~
MTLREKYGEWGIILGATEGVGKAFCERLAKEGMNVVMVGRREEKLKELGEELKNTYEIDYKVVKADFSLPDATDKIFAAT
ENLDMGFMAYVACLHSFGKIQDTPWEKHEAMINVNVVTFMKCFYHYMKIFAAQDRGAVINVSSMTGISSSPWNGQYGAGK
AFILKMTEAVACETEKTNVDVEVITLGTTLTPSLLSNLPGGPQGEAVMKTAQTPEEVVDEAFEKLGKELSVISGERNKAS
VHDWKANHTEDDYIRYMGSFYQE
>A7B4V1 1.1.1.201~~~~~~7beta-hydroxysteroid dehydrogenase~~~COG0300
MTLREKYGEWGIILGATEGVGKAFCERLAKEGMNVVMVGRREEKLKELGEELKNTYEIDYKVVKADFSLPDATDKIFAAT
ENLDMGFMAYVACLHSFGKIQDTPWEKHEAMINVNVVTFMKCFYHYMKIFAAQDRGAVINVSSMTGISSSPWNGQYGAGK
AFILKMTEAVACETEKTNVDVEVITLGTTLTPSLLSNLPGGPQGEAVMKTAQTPEEVVDEAFEKLGKELSVISGERNKAS
VHDWKANHTEDDYIRYMGSFYQE
>Q7A801 3.1.21.3~~~hsdR~~~Type I restriction enzyme SauN315I endonuclease subunit~~~
MAYQSEYALENEMMNQLEQLGYERVTIRDNKQLLDNFRTILNERHADKLEGNPLTDKEFQRLLTMIDGKSIFESARILRD
KLPLRRDDESEIYLSFLDKKSWCKNKFQVTNQVSVEDTYKARYDVTILINGLPLVQVELKRRGIDINEAFNQVKRYRKQN
YTGLFRYIQMFIISNGVETRYFSNNDSELLKSHMFYWSDKQNNRINTLQSFAESFMRPCQLAKMISRYMIINETDRILMA
MRPYQVYAVEALIQQATETGNNGYVWHTTGSGKTLTSFKASQILSQQDDIKKVIFLVDRKDLDSQTEEEFNKFAKGAVDK
TFNTSQLVRQLNDKSLPLIVTTIQKMAKAIQGNAHLLEQYKTNKVVFIIDECHRSQFGDMHRLVKQHFKNAQYFGFTGTP
RFPENSSQDGRTTADIFGRCLHTYLIRDAIHDGNVLGFSVDYINTFKNKALKAEDNSMVEAIDTEEVWLADKRVELVTRH
IINNHDKYTRNRQYSSIFTVQSIHALIKYYETFKRLNKKLEQPLTIAGIFTFKPNEDDRDGEVPYHSREKLEIMISDYNK
KFETNFSTDTTNEYFNHISKNVKKGVKDSKIDILIVVNMFLTGFDSKVLNTLYVDKNLMYHDLIQAYSRTNRVEKESKPF
GKIVNYRDLKKETDDALRVFSQTNDTDTILMRSYEEYKKEFMDAYRELKMIVPTPHMVDDIQDEEELKRFVEAYRLLAKI
ILRLKAFDEFEFTIDEIGMDEQENEDYKSKYLAVYDQVKRATAEKNKVSILNDIDFEIEMMRNDTINVNYIMNILRQIDL
EDKAEQRRNQEQIRRILDHADDPTLRLKRDLIREFIDNVVPSLNKDDDIDQEYVNFESIKKEAEFKGFAGERSIDEQALK
TISNDYQYSGVVNPHHLKKMIGDLPLKEKRKARKAIESFVAETTEKYGV
>P9WGT1 1.1.1.53~~~fabG3~~~3-alpha-(or 20-beta)-hydroxysteroid dehydrogenase~~~COG1028
MSGRLIGKVALVSGGARGMGASHVRAMVAEGAKVVFGDILDEEGKAVAAELADAARYVHLDVTQPAQWTAAVDTAVTAFG
GLHVLVNNAGILNIGTIEDYALTEWQRILDVNLTGVFLGIRAVVKPMKEAGRGSIINISSIEGLAGTVACHGYTATKFAV
RGLTKSTALELGPSGIRVNSIHPGLVKTPMTDWVPEDIFQTALGRAAEPVEVSNLVVYLASDESSYSTGAEFVVDGGTVA
GLAHNDFGAVEVSSQPEWVT
>P19992 1.1.1.53~~~~~~3-alpha-(or 20-beta)-hydroxysteroid dehydrogenase~~~
MNDLSGKTVIITGGARGLGAEAARQAVAAGARVVLADVLDEEGAATARELGDAARYQHLDVTIEEDWQRVVAYAREEFGS
VDGLVNNAGISTGMFLETESVERFRKVVDINLTGVFIGMKTVIPAMKDAGGGSIVNISSAAGLMGLALTSSYGASKWGVR
GLSKLAAVELGTDRIRVNSVHPGMTYTPMTAETGIRQGEGNYPNTPMGRVGNEPGEIAGAVVKLLSDTSSYVTGAELAVD
GGWTTGPTVKYVMGQ
>P52644 ~~~hslJ~~~Heat shock protein HslJ~~~COG3187
MKKVAAFVALSLLMAGCVSNDKIAVTPEQLQHHRFVLESVNGKPVTSDKNPPEISFGEKMMISGSMCNRFSGEGKLSNGE
LTAKGLAMTRMMCANPQLNELDNTISEMLKEGAQVDLTANQLTLATAKQTLTYKLADLMN
>P37565 ~~~hslO~~~33 kDa chaperonin~~~COG1281
MDYLVKALAYDGKVRAYAARTTDMVNEGQRRHGTWPTASAALGRTMTASLMLGAMLKGDDKLTVKIEGGGPIGAIVADAN
AKGEVRAYVSNPQVHFDLNEQGKLDVRRAVGTNGTLSVVKDLGLREFFTGQVEIVSGELGDDFTYYLVSSEQVPSSVGVG
VLVNPDNTILAAGGFIIQLMPGTDDETITKIEQRLSQVEPISKLIQKGLTPEEILEEVLGEKPEILETMPVRFHCPCSKE
RFETAILGLGKKEIQDMIEEDGQAEAVCHFCNEKYLFTKEELEGLRDQTTR
>P0A6Y5 ~~~hslO~~~33 kDa chaperonin~~~COG1281
MPQHDQLHRYLFENFAVRGELVTVSETLQQILENHDYPQPVKNVLAELLVATSLLTATLKFDGDITVQLQGDGPMNLAVI
NGNNNQQMRGVARVQGEIPENADLKTLVGNGYVVITITPSEGERYQGVVGLEGDTLAACLEDYFMRSEQLPTRLFIRTGD
VDGKPAAGGMLLQVMPAQNAQQDDFDHLATLTETIKTEELLTLPANEVLWRLYHEEEVTVYDPQDVEFKCTCSRERCADA
LKTLPDEEVDSILAEDGEIDMHCDYCGNHYLFNAMDIAEIRNNASPADPQVH
>P99082 ~~~hslO~~~33 kDa chaperonin~~~
MTHDYIVKALAFDGEIRAYAALTTETVQEAQTRHYTWPTASAAMGRTMTATAMMGAMLKGDQKLTVTVDGQGPIGRIIAD
ANAKGEVRAYVDHPQTHFPLNEQGKLDVRRAVGTNGSIIVVKDVGMKDYFSGASPIVSGELGEDFTYYYATSEQTPSSVG
LGVLVNPDNTIKAAGGFIIQVMPGAKDETISKLEKAISEMTPVSKLIEQGLTPEGLLNEILGEDHVQILEKMPVQFECNC
SHEKFLNAIKGLGEAEIQNMIKEDHGAEAVCHFCGNKYKYTEEELNVLLESLA
>Q9X1B4 ~~~hslO~~~33 kDa chaperonin~~~COG1281
MIYYGTMFDHKVRFSIVRMREVVEEARNRHALSYLATVVLGRALIGAALVTPWLAEKERWTLDIEGNGPIRRVVAQSTSE
FTVRGYVANPKVELPLNEKGKFDVAGAIGQGVLRVVRDLGLKTPFVSQVPLVSGEIAEDLAYYFAVSEQIPSAFSIGVLV
DSDGVKIAGGFAVQIIDRTLEQEKVEMIEKNIKNLPSISKLFQEAEPLDVLERIFGEKVGFVETAEIKYKCDCNREKAKN
ALLVLDKKELEDMRKEGKGEVVCKWCNTRYVFSEEELEELLKFKVDDSGS
>P0ACG8 ~~~hslR~~~Heat shock protein 15~~~COG1188
MKEKPAVEVRLDKWLWAARFYKTRALAREMIEGGKVHYNGQRSKPSKIVELNATLTLRQGNDERTVIVKAITEQRRPASE
AALLYEETAESVEKREKMALARKLNALTMPHPDRRPDKKERRDLLRFKHGDSE
>Q2YQZ4 ~~~hslU~~~ATP-dependent protease ATPase subunit HslU~~~
MSNFSPREIVSELDRFIIGQKDAKRAVAIALRNRWRRQQLEGQMREEVMPKNILMIGPTGVGKTEISRRLAKLAGAPFVK
VEATKFTEVGYVGRDVEQIIRDLVEIAITLVREKRREDVKAKAHLNAEERVLDALVGKTASPATRDSFRKKLRNGEMDDK
EIEIEVSDSGASPNFEIPGMPGANIGVLNISDMLGKAMGGRTKTRKTTVKDSYPILINDESDKLLDQDQIVQEALRVSED
EGIVFIDEIDKIAAREGGSGAGVSREGVQRDLLPLVEGTTVATKYGPVKTDHILFITSGAFHVSKPSDLLPELQGRLPIR
VELSALTREDFRRILTETEASLIKQYIALMETEEVKLEFSDDAIDALADIAVDLNATVENIGARRLQTVIEKVLDEISFT
APDKAGATFIIDAAYVKEKIGGLAKNTDLSRFIL
>P0A6H6 ~~~hslU~~~ATP-dependent protease ATPase subunit HslU~~~COG1220
MSEMTPREIVSELDKHIIGQDNAKRSVAIALRNRWRRMQLNEELRHEVTPKNILMIGPTGVGKTEIARRLAKLANAPFIK
VEATKFTEVGYVGKEVDSIIRDLTDAAVKMVRVQAIEKNRYRAEELAEERILDVLIPPAKNNWGQTEQQQEPSAARQAFR
KKLREGQLDDKEIEIDLAAAPMGVEIMAPPGMEEMTSQLQSMFQNLGGQKQKARKLKIKDAMKLLIEEEAAKLVNPEELK
QDAIDAVEQHGIVFIDEIDKICKRGESSGPDVSREGVQRDLLPLVEGCTVSTKHGMVKTDHILFIASGAFQIAKPSDLIP
ELQGRLPIRVELQALTTSDFERILTEPNASITVQYKALMATEGVNIEFTDSGIKRIAEAAWQVNESTENIGARRLHTVLE
RLMEEISYDASDLSGQNITIDADYVSKHLDALVADEDLSRFIL
>P0A6H5 ~~~hslU~~~ATP-dependent protease ATPase subunit HslU~~~COG1220
MSEMTPREIVSELDKHIIGQDNAKRSVAIALRNRWRRMQLNEELRHEVTPKNILMIGPTGVGKTEIARRLAKLANAPFIK
VEATKFTEVGYVGKEVDSIIRDLTDAAVKMVRVQAIEKNRYRAEELAEERILDVLIPPAKNNWGQTEQQQEPSAARQAFR
KKLREGQLDDKEIEIDLAAAPMGVEIMAPPGMEEMTSQLQSMFQNLGGQKQKARKLKIKDAMKLLIEEEAAKLVNPEELK
QDAIDAVEQHGIVFIDEIDKICKRGESSGPDVSREGVQRDLLPLVEGCTVSTKHGMVKTDHILFIASGAFQIAKPSDLIP
ELQGRLPIRVELQALTTSDFERILTEPNASITVQYKALMATEGVNIEFTDSGIKRIAEAAWQVNESTENIGARRLHTVLE
RLMEEISYDASDLSGQNITIDADYVSKHLDALVADEDLSRFIL
>P43773 ~~~hslU~~~ATP-dependent protease ATPase subunit HslU~~~COG1220
MSEMTPREIVSELDQHIIGQADAKRAVAIALRNRWRRMQLQEPLRHEVTPKNILMIGPTGVGKTEIARRLAKLANAPFIK
VEATKFTEVGYVGKEVDSIIRDLTDSAMKLVRQQEIAKNRARAEDVAEERILDALLPPAKNQWGEVENHDSHSSTRQAFR
KKLREGQLDDKEIEIDVSAGVSMGVEIMAPPGMEEMTNQLQSLFQNLGSDKTKKRKMKIKDALKALIDDEAAKLINPEEL
KQKAIDAVEQNGIVFIDEIDKICKKGEYSGADVSREGVQRDLLPLVEGSTVSTKHGMVKTDHILFIASGAFQVARPSDLI
PELQGRLPIRVELTALSAADFERILTEPHASLTEQYKALMATEGVNIAFTTDAVKKIAEAAFRVNEKTENIGARRLHTVM
ERLMDKISFSASDMNGQTVNIDAAYVADALGEVVENEDLSRFIL
>P63796 ~~~hslU~~~ATP-dependent protease ATPase subunit HslU~~~
MDTAGIRLTPKEIVSKLNEYIVGQNDAKRKVAIALRNRYRRSLLDEESKQEISPKNILMIGPTGVGKTEIARRMAKVVGA
PFIKVEATKFTELGYVGRDVESMVRDLVDVSVRLVKAQKKSLVQDEATAKANEKLVKLLVPSMKKKASQTNNPLESLFGG
AIPNFGQNNEDEEEPPTEEIKTKRSEIKRQLEEGKLEKEKVRIKVEQDPGALGMLGTNQNQQMQEMMNQLMPKKKVEREV
AVETARKILADSYADELIDQESANQEALELAEQMGIIFIDEIDKVATNNHNSGQDVSRQGVQRDILPILEGSVIQTKYGT
VNTEHMLFIGAGAFHVSKPSDLIPELQGRFPIRVELDSLSVEDFVRILTEPKLSLIKQYEALLQTEEVTVNFTDEAITRL
AEIAYQVNQDTDNIGARRLHTILEKMLEDLSFEAPSMPNAVVDITPQYVDDKLKSISTNKDLSAFIL
>P63797 ~~~hslU~~~ATP-dependent protease ATPase subunit HslU~~~
MDTAGIRLTPKEIVSKLNEYIVGQNDAKRKVAIALRNRYRRSLLDEESKQEISPKNILMIGPTGVGKTEIARRMAKVVGA
PFIKVEATKFTELGYVGRDVESMVRDLVDVSVRLVKAQKKSLVQDEATAKANEKLVKLLVPSMKKKASQTNNPLESLFGG
AIPNFGQNNEDEEEPPTEEIKTKRSEIKRQLEEGKLEKEKVRIKVEQDPGALGMLGTNQNQQMQEMMNQLMPKKKVEREV
AVETARKILADSYADELIDQESANQEALELAEQMGIIFIDEIDKVATNNHNSGQDVSRQGVQRDILPILEGSVIQTKYGT
VNTEHMLFIGAGAFHVSKPSDLIPELQGRFPIRVELDSLSVEDFVRILTEPKLSLIKQYEALLQTEEVTVNFTDEAITRL
AEIAYQVNQDTDNIGARRLHTILEKMLEDLSFEAPSMPNAVVDITPQYVDDKLKSISTNKDLSAFIL
>Q9WYZ2 ~~~hslU~~~ATP-dependent protease ATPase subunit HslU~~~COG1220
MKSFDEMTPKEIVQELDKYIVGQYEAKKAVAIAVRNRIRRQKLPEEWRKEVLPKNILMIGPTGVGKTEIARRLAQLSGSP
FLKVEATRFTEVGYVGKNVDSMIRDLVEISVNMVKQEKIKEVERQAEELVEERILDALVPESKAMPVVTNPFINLITGGQ
QQQYTPEDRRRFRAKREEMREKLRKGELEDEEIEIELEETVSPFMGIFGPGMEDLGIEITNMFSGMLPKRKKKRKMKVSE
ARKVLLPLEAEKLIDMDKVVQEALDRAQNRGIIFIDEIDKIAGKESAVGPDVSRQGVQRDLLPIVEGTTIMTKYGPVRTD
FILFIAAGAFHVSRPSDLIPELQGRFPIRVELSPLTEEDFVRILKEPENAIIKQYQALLSTEGVELVFTEDGIREMARIA
YQLNQRLENIGARRLYTVAEKVLEEISFEAPDIPEKRVVVDAEYVRRRLEKIVQDEDLSAYIL
>Q81WK5 3.4.25.2~~~hslV~~~ATP-dependent protease subunit HslV~~~COG5405
MGNFHATTIFAVHHNGECAMAGDGQVTMGNAVVMKHTARKVRKLFQGKVLAGFAGSVADAFTLFEMFEGKLEEYNGNLQR
AAVEMAKQWRGDKMLRQLEAMLIVMDKTTMLLVSGTGEVIEPDDGILAIGSGGNYALSAGRALKQYASEHLTAKQIAKAS
LEIAGDICVYTNHNIIVEEL
>B7LA29 3.4.25.2~~~hslV~~~ATP-dependent protease subunit HslV~~~
MTTIVSVRRNGHVVIAGDGQATLGNTVMKGNVKKVRRLYNDKVIAGFAGGTADAFTLFELFERKLEMHQGHLVKAAVELA
KDWRTDRMLRKLEALLAVADETASLIITGNGDVVQPENDLIAIGSGGPYAQAAARALLENTELSAREIAEKALDIAGDIC
IYTNHFHTIEELSYKA
>P0A7B8 3.4.25.2~~~hslV~~~ATP-dependent protease subunit HslV~~~COG5405
MTTIVSVRRNGHVVIAGDGQATLGNTVMKGNVKKVRRLYNDKVIAGFAGGTADAFTLFELFERKLEMHQGHLVKAAVELA
KDWRTDRMLRKLEALLAVADETASLIITGNGDVVQPENDLIAIGSGGPYAQAAARALLENTELSAREIAEKALDIAGDIC
IYTNHFHTIEELSYKA
>P43772 3.4.25.2~~~hslV~~~ATP-dependent protease subunit HslV~~~COG5405
MTTIVSVRRNGQVVVGGDGQVSLGNTVMKGNARKVRRLYNGKVLAGFAGGTADAFTLFELFERKLEMHQGHLLKSAVELA
KDWRTDRALRKLEAMLIVADEKESLIITGIGDVVQPEEDQILAIGSGGNYALSAARALVENTELSAHEIVEKSLRIAGDI
CVFTNTNFTIEELPN
>P65796 3.4.25.2~~~hslV~~~ATP-dependent protease subunit HslV~~~
MSNTTLHATTIYAVRHNGKAAMAGDGQVTLGQQVIMKQTARKVRRLYEGKVLAGFAGSVADAFTLFEKFETKLQQFSGNL
ERAAVELAQEWRGDKQLRQLEAMLIVMDKDAILVVSGTGEVIAPDDDLIAIGSGGNYALSAGRALKRHASHLSAEEMAYE
SLKVAADICVFTNDNIVVETL
>P65797 3.4.25.2~~~hslV~~~ATP-dependent protease subunit HslV~~~
MSNTTLHATTIYAVRHNGKAAMAGDGQVTLGQQVIMKQTARKVRRLYEGKVLAGFAGSVADAFTLFEKFETKLQQFSGNL
ERAAVELAQEWRGDKQLRQLEAMLIVMDKDAILVVSGTGEVIAPDDDLIAIGSGGNYALSAGRALKRHASHLSAEEMAYE
SLKVAADICVFTNDNIVVETL
>Q9WYZ1 3.4.25.2~~~hslV~~~ATP-dependent protease subunit HslV~~~COG5405
MKFHGTTILVVRRNGQTVMGGDGQVTFGSTVLKGNARKVRKLGEGKVLAGFAGSVADAMTLFDRFEAKLREWGGNLTKAA
VELAKDWRTDRVLRRLEALLLVADKENIFIISGNGEVIQPDDDAAAIGSGGPYALAAAKALLRNTDLSAREIVEKAMTIA
GEICIYTNQNIVIEEV
>P81958 ~~~~~~Probable Hsp20 family chaperone~~~COG0071
MLFSLINQNQDLLENLFEDFKTNSLTNNNNIMKTDIQEQDNQYFITIELPGFKKEDVKVALEEGYLVVEAKNSKKNQIKE
ANFIRKERFQGFLRRSFYLGDDFLLEDIKGSLEQGLLKLSVPKKEVKPKEKHYIKLN
>Q03928 ~~~~~~18 kDa heat shock protein~~~COG0071
MFGMVPFRRNNNGLMRREDFFDKMFDNFFSDDFFPTTTFNGNAGFKVDIKEDDDKYTVAADLPGVKKDNIELQYENNYLT
INAKRDDIVETKDDNNNFVRRERSYGELRRSFYVDNIDDSKIDASFLDGVLRITLPKKVKGKDNGRRIDIH
>Q53595 ~~~~~~18 kDa heat shock protein~~~
MLMRTDPFREFDRITRELTAPGTWSRPTAMPMDACREGDTYVVSFDLPGVDPEAIEIDIERNMLTVKAERGPAGNAEHVR
MEVAERPLGVFSRQLVLADTLDTEQVRADYDAGVLTLRIPIAERAKRRRVKVGQGESHRQITG
>B1N1A2 1.14.13.163~~~nicB~~~6-hydroxy-3-succinoylpyridine 3-monooxygenase HspA~~~
MQRKLDSEPLRTRIYIDGYNFYYGCLRGTPYKWLDLLPLFEKHILPSILVTDNHGQIRAWRLLESPSIKYFTAKIIESVA
RAGDSVSSQARYHTALRKLHDGRIELIEGYYAVNKMKVKIVDPENPDKAPRECREIQAWKVEEKQSDVNLALQAYHDSIT
GQVDHAVIVTNDTDIAPALQMIRAHTDVRIGVVVPTSGQNRSANTDLIKFAHWKREHINSGELAACQLPRVIPGRKPTIK
PESWYGQPELLQEILDLAIPVRGSRAAAFKWMEQPNQFLSGERPIELVETAEGATRVLQYIHSWIAQQEELP
>F8G0M4 1.14.13.163~~~hspB~~~6-hydroxy-3-succinoylpyridine 3-monooxygenase HspB~~~COG0654
MSMKQRVIIVGGGPVGLLTALGLAKAGTNVVVLEAESQPSDSPRALVYHFPVLPHLKRLGVLDDCVAAGLMRQNFAWRVH
STSEMIFWDLSCLEGDVELPYALHLGQDKLSRILIEHLKALPNVEVRYSSPVVDCEVGPRSVRVVLGGESPGVIVEGDWL
IGADGANSFVRREVLNQNFFGITWPQRYVATNTRFDFDKLGFGKTTMQVDDVYGSVICNIDADSLWRVTFMEDPNLPMEG
IRGRIDQVFKELLPTNDPYEVVAFSPYRMHQRVTDRMRNGRVILIGDAAHVTNPTGGLGLTGGMFDAFALTSVLNQVIHD
GRSEDILDVFEADRRRKFIELVSPRASDNLRNLYHQKPGEGKNDWVNNTRSISKDIDRMRDALRFPETMETFL
>P0AB20 ~~~hspQ~~~Heat shock protein HspQ~~~COG3785
MIASKFGIGQQVRHSLLGYLGVVVDIDPVYSLSEPSPDELAVNDELRAAPWYHVVMEDDNGLPVHTYLAEAQLSSELQDE
HPEQPSMDELAQTIRKQLQAPRLRN
>P40183 ~~~hspR~~~Putative heat shock protein HspR~~~COG0789
MDGRRRNPYELTEDTPVYVISVAAQLSGLHPQTLRQYDRLGLVSPDRTAGRGRRYSARDIELLRQVQQLSQDEGINLAGI
KRIIELENQVAELQARAAELAAALDGAATAMRQREAAVHASYRRDLVPYQEVQQTSALVVWRPSRRGQSSD
>P31474 ~~~hsrA~~~Probable transport protein HsrA~~~COG0477
MSDKKKRSMAGLPWIAAMAFFMQALDATILNTALPAIAHSLNRSPLAMQSAIISYTLTVAMLIPVSGWLADRFGTRRIFT
LAVSLFTLGSLACALSNSLPQLVVFRVIQGIGGAMMMPVARLALLRAYPRNELLPVLNFVAMPGLVGPILGPVLGGVLVT
WATWHWIFLINIPIGIAGLLYARKHMPNFTTARRRFDITGFLLFGLSLVLFSSGIELFGEKIVASWIALTVIVTSIGLLL
LYILHARRTPNPLISLDLFKTRTFSIGIVGNIATRLGTGCVPFLMPLMLQVGFGYQAFIAGCMMAPTALGSIIAKSMVTQ
VLRRLGYRHTLVGITVIIGLMIAQFSLQSPAMAIWMLILPLFILGMAMSTQFTAMNTITLADLTDDNASSGNSVLAVTQQ
LSISLGVAVSAAVLRVYEGMEGTTTVEQFHYTFITMGIITVASAAMFMLLKTTDGNNLIKRQRKSKPNRVPSESE
>A6QJK3 ~~~hssR~~~Heme response regulator HssR~~~
MVQCLVVDDDPRILNYIASHLQIEHIDAYTQPSGEAALKLLEKQRVDIAVVDIMMDGMDGFQLCNTLKNDYDIPVIMLTA
RDALSDKERAFISGTDDYVTKPFEVKELIFRIRAVLRRYNINSNSEMTIGNLTLNQSYLELQVSNKTMTLPNKEFQLLFM
LAARPKQIFTREQIIEKIWGYDYEGDERTVDVHIKRLRQRLKKLNATLTIETVRGQGYKVENHV
>A6QJK4 2.7.13.3~~~hssS~~~Heme sensor protein HssS~~~
MFKTLYARIAIYSITVILFSALISFVLTNVYYHYNLKASNDAKIMKTLKEARQYEQSAKPTHIQQYFKHLGQMNYQIMTI
DQKGHKTFYGEPFREDTLSQNAINNVLNNQDYHGIKDKPFALFVTGFFDNVTDNTVGINFKTKDGSIAVFMRPDIGETFS
EFRTFLAVLLMLLLFISISLVIASTYSIIRPVKKLKLATERLIDGDFETPIKQTRKDEIGTLQYHFNKMRESLGQVDQMR
QHFVQNVSHEIKTPLTHIHHLLSELQQTSDKTLRQQYINDIYTITTQLSGLTTELLLLSELDNHQHLLFDDKIQVNQLIK
DIIRHEQFAADEKSLIILADLESINFLGNQRLLHQALSNLLINAIKYTDVGGAIDIALQHSHNNIIFTISNDGSPISPQA
EARLFERFYKVSKHDNSNGLGLAITKSIIELHHGTIQFTQSNEYVTTFTITLPNNSL
>O32323 2.5.1.44~~~hss~~~Homospermidine synthase~~~
MTDWPVYHRIDGPIVMIGFGSIGRGTLPLIERHFAFDRSKLVVIDPSDEARKLAEARGVRFIQQAVTRDNYRELLVPLLT
AGPGQGFCVNLSVDTSSLDIMELARENGALYIDTVVEPWLGFYFDPDLKPEARSNYALRETVLAARRNKPGGTTAVSCCG
ANPGMVSWFVKQALVNLAADLGVTGEEPTTREEWARLAMDLGVKGIHIAERDTQRASFPKPFDVFVNTWSVEGFVSEGLQ
PAELGWGTFERWMPDNARGHDSGCGAGIYLLQPGANTRVRSWTPTAMAQYGFLVTHNESISIADFLTVRDAAGQAVYRPT
CHYAYHPCNDAVLSLHEMFGSGKRQSDWRILDETEIVDGIDELGVLLYGHGKNAYWYGSQLSIEETRRIAPDQNATGLQV
SSAVLAGMVWALENPNAGIVEADDLDFRRCLEVQTPYLGPVVGVYTDWTPLAGRPGLFPEDIDTSDPWQFRNVLVRD
>P01559 ~~~sta1~~~Heat-stable enterotoxin ST-IA/ST-P~~~
MKKLMLAIFISVLSFPSFSQSTESLDSSKEKITLETKKCDVVKNNSEKKSENMNNTFYCCELCCNPACAGCY
>P07965 ~~~sta3~~~Heat-stable enterotoxin A3/A4~~~
MKKSILFIFLSVLSFSPFAQDAKPVESSKEKITLESKKCNIAKKSNKSGPESMNSSNYCCELCCNPACTGCY
>P07593 ~~~ystA~~~Heat-stable enterotoxin A~~~
MKKIVFVLVLMLSSFGAFGQETVSGQFSDALSTPITAEVYKQACDPPLPPAEVSSDWDCCDVCCNPACAGC
>P74977 ~~~ystB~~~Heat-stable enterotoxin B~~~
MKKIILALVLMLFSFCTLGQETASMHLDDTLSAPIAAEINRKACDTQTPSPSEENDDWCCEVCCNPACAGC
>O50319 ~~~ystC~~~Heat-stable enterotoxin C~~~
MKKIVFVLTLMLFSFGTLGQETASGQVGDVSSSTIATEVSEAECGTQSATTQGENDWDWCCELCCNPACFGC
>P22542 ~~~stiI~~~Heat-stable enterotoxin II~~~
MKKNIAFLLASMFVFSIATNAYASTQSNKKDLCEHYRQIAKESCKKGFLGVRDGTAGACFGAQIMVAAKGC
>P0A4M3 ~~~stn~~~Heat-stable enterotoxin ST~~~
MRNLFIALMLLFSSIAFSQTVENNKKTVQQPQQIESKVNIKKLSENEECPFIKQVDENGNLIDCCEICCNPACFGCLN
>P0A4M4 ~~~stn~~~Heat-stable enterotoxin ST~~~
IDCCEICCNPACFGCLNDANGLINGDRPIRAQHVC
>O53664 4.2.1.-~~~htdX~~~3-hydroxyacyl-thioester dehydratase X~~~COG2030
MTQPSGLKNLLRAAAGALPVVPRTDQLPNRTVTVEELPIDPANVAAYAAVTGLRYGNQVPLTYPFALTFPSVMSLVTGFD
FPFAAMGAIHTENHITQYRPIAVTDAVGVRVRAENLREHRRGLLVDLVTNVSVGNDVAWHQVTTFLHQQRTSLSGEPKPP
PQKKPKLPPPAAVLRITPAKIRRYAAVGGDHNPIHTNPIAAKLFGFPTVIAHGMFTAAAVLANIEARFPDAVRYSVRFAK
PVLLPATAGLYVAEGDGGWDLTLRNMAKGYPHLTATVRGL
>I6YBZ8 4.2.1.-~~~htdY~~~3-hydroxyacyl-thioester dehydratase Y~~~COG2030
MAIDPNSIGAVTEPMLFEWTDRDTLLYAIGVGAGTGDLAFTTENSHGIDQQVLPTYAVICCPAFGAAAKVGTFNPAALLH
GSQGIRLHAPLPAAGKLSVVTEVADIQDKGEGKNAIVVLRGRGCDPESGSLVAETLTTLVLRGQGGFGGARGERPAAPEF
PDRHPDARIDMPTREDQALIYRLSGDRNPLHSDPWFATQLAGFPKPILHGLCTYGVAGRALVAELGGGVAANITSIAARF
TKPVFPGETLSTVIWRTEPGRAVFRTEVAGSDGAEARVVLDDGAVEYVAG
>P9WNP3 4.2.1.-~~~htdZ~~~3-hydroxyacyl-thioester dehydratase Z~~~COG2030
MRTFESVADLAAAAGEKVGQSDWVTITQEEVNLFADATGDHQWIHVDPERAAAGPFGTTIAHGFMTLALLPRLQHQMYTV
KGVKLAINYGLNKVRFPAPVPVGSRVRATSSLVGVEDLGNGTVQATVSTTVEVEGSAKPACVAESIVRYVA
>O53478 ~~~~~~HTH-type transcriptional regulator Rv2034~~~COG0640
MSTYRSPDRAWQALADGTRRAIVERLAHGPLAVGELARDLPVSRPAVSQHLKVLKTARLVCDRPAGTRRVYQLDPTGLAA
LRTDLDRFWTRALTGYAQLIDSEGDDT
>P9WMC3 ~~~~~~HTH-type transcriptional repressor Rv3405c~~~COG1309
MTTRPATDRRKMPTGREEVAAAILQAATDLFAERGPAATSIRDIAARSKVNHGLVFRHFGTKDQLVGAVLDHLGTKLTRL
LHSEAPADIIERALDRHGRVLARALLDGYPVGQLQQRFPNVAELLDAVRPRYDSDLGARLAVAHALALQFGWRLFAPMLR
SATGIDELTGDELRLSVNDAVARILEPH
>Q9ADP7 ~~~~~~HTH-type transcriptional repressor SCO4008~~~COG1309
MAARDPEATKARIFEAAVAEFARHGIAGARIDRIAAEARANKQLIYAYYGNKGELFASVLEKKMLDLAISVPVDPDDIEG
WIDRLLDYHAAHPELLRLLFWEGMEYGTAELPHEAERQEHYARKVAAVRDGQERGVITDAIPAPDLLFLLVAMANWAVVV
PQMKRILVGGGDAGTDGLRDSIKKAARRIVDR
>P9WME9 ~~~~~~HTH-type transcriptional repressor Rv2887~~~COG1846
MGLADDAPLGYLLYRVGAVLRPEVSAALSPLGLTLPEFVCLRMLSQSPGLSSAELARHASVTPQAMNTVLRKLEDAGAVA
RPASVSSGRSLPATLTARGRALAKRAEAVVRAADARVLARLTAPQQREFKRMLEKLGSD
>B1MCE2 2.1.1.374~~~htm~~~2-heptyl-1-hydroxyquinolin-4(1H)-one methyltransferase~~~
MTENLQDMFESSYRGEAPEQLAARPPWSIGQPQPEILKLIEQGKVHGDVLDAGCGEAATALYLAERGHTAVGLDAAPTAI
QLAKGYAAERGLTNVTFDVADISNFTGYDGRFGTIIDSTLFHSMPVELREGYQQSIVRAAAPGANYIVLVFDKAAFPPDI
DGPHPVSEPELREIVSKYWTVDDISPARLYANGDGFQDGGAQRFAEFREESNGWVSMAGWLLQAHRD
>A1KG37 2.1.1.374~~~htm~~~2-heptyl-1-hydroxyquinolin-4(1H)-one methyltransferase~~~
MSTVLTYIRAVDIYEHMTESLDLEFESAYRGESVAFGEGVRPPWSIGEPQPELAALIVQGKFRGDVLDVGCGEAAISLAL
AERGHTTVGLDLSPAAVELARHEAAKRGLANASFEVADASSFTGYDGRFDTIVDSTLFHSMPVESREGYLQSIVRAAAPG
ASYFVLVFDRAAIPEGPINAVTEDELRAAVSKYWIIDEIKPARLYARFPAGFAGMPALLDIREEPNGLQSIGGWLLSAHL
G
>A5TZU0 2.1.1.374~~~htm~~~2-heptyl-1-hydroxyquinolin-4(1H)-one methyltransferase~~~COG2226
MSTVLTYIRAVDIYEHMTESLDLEFESAYRGESVAFGEGVRPPWSIGEPQPELAALIVQGKFRGDVLDVGCGEAAISLAL
AERGHTTVGLDLSPAAVELARHEAAKRGLANASFEVADASSFTGYDGRFDTIVDSTLFHSMPVESREGYLQSIVRAAAPG
ASYFVLVFDRAAIPEGPINAVTEDELRAAVSKYWIIDEIKPARLYARFPAGFAGMPALLDIREEPNGLQSIGGWLLSAHL
G
>P9WKL5 2.1.1.374~~~htm~~~2-heptyl-1-hydroxyquinolin-4(1H)-one methyltransferase~~~COG2226
MSTVLTYIRAVDIYEHMTESLDLEFESAYRGESVAFGEGVRPPWSIGEPQPELAALIVQGKFRGDVLDVGCGEAAISLAL
AERGHTTVGLDLSPAAVELARHEAAKRGLANASFEVADASSFTGYDGRFDTIVDSTLFHSMPVESREGYLQSIVRAAAPG
ASYFVLVFDRAAIPEGPINAVTEDELRAAVSKYWIIDEIKPARLYARFPAGFAGMPALLDIREEPNGLQSIGGWLLSAHL
G
>A1B9Z3 2.6.1.77~~~hpa/tpa~~~Hypotaurine/taurine--pyruvate aminotransferase~~~COG0161
MTLDLNPNDMSHVVAADRAHVWHHLSQHKQYETIDPRVFVEGKGMRLWDATGREFLDAVSGGVWTVNVGYGRESIADAIR
DQLVKLNYYAGAAGTVPGAIFAQKLIEKMPGMTRVYYSNSGSEANEKVYKMVRQIAARHHGGKKWKILYRDRDYHGTTIA
TLATSGQDQRAIAYGPFPDGFVRVPHCLEYRKQWDVENYGERAADAIEEVILREGPDTVGAIVLEPVTAGGGVITPPEGY
WQRVQEICRKYDILLHIDEVVCGLGRTGTWFGYQQYGIEPDFVTMAKGVASGYAAISCTVTTERVFEMFKDAPEDGMSFF
RDISTFGGCTSGPVAAIENMRIIEDEGLLDNTVAMGERTLANLNALMEKHKVIGDVRGKGLFCGAELVADRASKEPMDEK
KVQAVVADCLAQGVIIGATNRSLPGFNNTLCLSPALIATADNIDRITDAIDNALTKVFA
>P42555 ~~~htpG~~~Chaperone protein HtpG~~~
MKKQFDTEVNDLLYLIIHSLYSHKEIFLRELISNASDAIDKLKFLSLTNEKFKNIALEPKIEISFDDKSILIKDNGIGMD
EQDLTNHLGVIAKSGTKEFINNLKQDEKKSASLIGQFGVGFYSAFIVSEKVEVTSKKALESDAYIWSSDGKTGYEIEKAK
KEESGTEIKLYLNKEGLEYANKWKIQEIIKKYSNHINYPIYIKYSEPIMKDGKQEGIEEKEEKLNETTALWTKNKSEIKA
EEYNEFYKNTTFDYENPLMHIHTKAEGNLEYTNLFYVPSKAPYDLYYPNTKPGVKLFINRIFITDSEGSLLPNYLRFIKG
IIDCQDLPLNVSREILQQNKILSKIKSSSVKKILSELEKLSKKNPEKFSEFSKEFGRCIKEGVYSDFENREKLISLIRFK
SSSVDGFVSFKEYKERMNESQKSIYYITGGKENILKENPIVAAYKEKGFEILIMDDELDEAILNLIPEYEGLKLKAINKN
ETSNELKDENFKKIEEEFKDTLTKVKEILKDHIKEVNLSATLIKEPSAIIIDSNDPTYQMQKIMLSMGQEVKEIKPILEL
NPNNKIVQNLKNLEPEKLEKISILLFEEAMLTSGMPSKNPGKFINIINEFIEKDFL
>Q83EL0 ~~~htpG~~~Chaperone protein HtpG~~~COG0326
MSLQPQAETLSFEAEVKQLLHLVAHSLYSNKEIFLRELISNSSDAADKLRYQALSDAALYENDADLKIWIDFDKDNRTIT
IRDNGIGMSREEVIENLGTIAKSGTRAFRELLAEKKAEDSQLIGQFGVGFYSAFIVADRVVVRTRRAGMKADQGVEWEST
GEGEYTLKNIDKPTRGTEVVLHLKESEEEFLDPLRLRAIITKYSDHILLPIVMKKIKTSGADDEDKNETPEEEVVNRANA
LWVLPKDKIKDEEYKELYKHIAHDFEDPLAWVHNKVEGKLEYTTLLYIPARAPFDLWNREGQRGLKLYVKRIFIMDDAEH
FMPMYLRFVKGIVDSNDLPLNISRELLQSNEVINKIKAGCVKRILSLLEDLAKNDKEKYASFWKAFGQVLKEGPAEDFAN
RDRIANLLRFASTHNDTDEQNVSLQDYISRMKPEQNKIYYIVADTYTSAKNSPLLEVFRKKDIEVLLMSDRVDEWLVAHL
NEFEGKSLQSIAKGTLDLGDLEKEEKVETEKFEKDFDELLKQFKEVLGEKIKDVRITHRLTDSPTCVVFDENEMSGHLQR
LLIQTGQDFMQAKPILEINPSHPLILRVKNESDKTRFNRWADLLLNQALLAEGEQLKDPASFVKGLNELLLDS
>Q728G0 ~~~htpG~~~Chaperone protein HtpG~~~COG0326
MATAPASHAFRTEVRKMLHIITHSLYTNREIFLRELVSNASDALDKLRFIRSRGDAVVAPDLAPGIDISVDKEARILTIA
DTGVGMTRQELMDNLGTIARSGSEQFVADLAAAENAKDADAASIIGRFGVGFYAVFMVADRVEVTSRSYIEGEAAHTWTS
DGLGEFTVEEATGDIPQRGTVIKAHLREDAAEFLEKYRIEGILRKHSQFISFPIRVDGEQVNTTPALWREPKFSITDEQY
ADFYKHLTFDTEAPLRTLHVSVDAPVQFTGLVFVPPHGQEVFSMGRDRWGLDLYVRRVLIQRENKDLLPEYLGFLKGIVD
TEDLPLNISRETLQENVVVRKIGQTLTKQVLADLARLAADDAEAYATFWRQHGKVFKLGYSDYANREKFAPLLRFNSSHH
DDAQGLTSLDDYISRAREGQKEIWYIAAPGREAARLDPRVEVFRRKGLEVLYLLEPIDEFVLETLDSYSDFSFKAVEHAD
GEKLAQFEDTGPARDVTPLTEDEDAAFARLIERMKALLGDAVEDVRISHRLADSPACLVQPGGASTSSMDRLLRVLHKDE
SVPRKVFEVNRDHPILRNLLKVFTSDASDPLVEDTTRQLFATSLMLDGYLKDPHELAAMMHRLMEKSGDWYKAVRGL
>P0A6Z3 ~~~htpG~~~Chaperone protein HtpG~~~COG0326
MKGQETRGFQSEVKQLLHLMIHSLYSNKEIFLRELISNASDAADKLRFRALSNPDLYEGDGELRVRVSFDKDKRTLTISD
NGVGMTRDEVIDHLGTIAKSGTKSFLESLGSDQAKDSQLIGQFGVGFYSAFIVADKVTVRTRAAGEKPENGVFWESAGEG
EYTVADITKEDRGTEITLHLREGEDEFLDDWRVRSIISKYSDHIALPVEIEKREEKDGETVISWEKINKAQALWTRNKSE
ITDEEYKEFYKHIAHDFNDPLTWSHNRVEGKQEYTSLLYIPSQAPWDMWNRDHKHGLKLYVQRVFIMDDAEQFMPNYLRF
VRGLIDSSDLPLNVSREILQDSTVTRNLRNALTKRVLQMLEKLAKDDAEKYQTFWQQFGLVLKEGPAEDFANQEAIAKLL
RFASTHTDSSAQTVSLEDYVSRMKEGQEKIYYITADSYAAAKSSPHLELLRKKGIEVLLLSDRIDEWMMNYLTEFDGKPF
QSVSKVDESLEKLADEVDESAKEAEKALTPFIDRVKALLGERVKDVRLTHRLTDTPAIVSTDADEMSTQMAKLFAAAGQK
VPEVKYIFELNPDHVLVKRAADTEDEAKFSEWVELLLDQALLAERGTLEDPNLFIRRMNQLLVS
>P9WMJ7 ~~~htpG~~~Chaperone protein HtpG~~~COG0326
MNAHVEQLEFQAEARQLLDLMVHSVYSNKDAFLRELISNASDALDKLRIEALRNKDLEVDTSDLHIEIDADKAARTLTVR
DNGIGMAREEVVDLIGTLAKSGTAELRAQLREAKNAAASEELIGQFGIGFYSSFMVADKVQLLTRKAGESAATRWESSGE
GTYTIESVEDAPQGTSVTLHLKPEDAEDDLHDYTSEWKIRNLVKKYSDFIAWPIRMDVERRTPASQEEGGEGGEETVTIE
TETLNSMKALWARPKEEVSEQEYKEFYKHVAHAWDDPLEIIAMKAEGTFEYQALLFIPSHAPFDLFDRDAHVGIQLYVKR
VFIMGDCDQLMPEYLRFVKGVVDAQDMSLNVSREILQQDRQIKAIRRRLTKKVLSTIKDVQSSRPEDYRTFWTQFGRVLK
EGLLSDIDNRETLLGISSFVSTYSEEEPTTLAEYVERMKDGQQQIFYATGETRQQLLKSPHLEAFKAKGYEVLLLTDPVD
EVWVGMVPEFDGKPLQSVAKGEVDLSSEEDTSEAEREERQKEFADLLTWLQETLSDHVKEVRLSTRLTESPACLITDAFG
MTPALARIYRASGQEVPVGKRILELNPSHPLVTGLRQAHQDRADDAEKSLAETAELLYGTALLAEGGALEDPARFAELLA
ERLARTL
>P22359 ~~~htpG~~~Chaperone protein HtpG~~~COG0326
MSETATTNKETRGFQSEVKQLLHLMIHSLYSNKEIFLRELISNASDAVDKLRFQALSHPDLYQGDAELGVKLSFDKDKNT
LTISDNGIGMTRDEVIENLGTIAKSGTAEFFSKLSQEQSKNSQLIGQFGVGFYSAFIVADAVTVRTRAAGSAPADAVQWY
SKGEGEYTVETINKESRGTDIILHLREEGKEFLSEWRLRDVISKYSDHIGIPVYIQTSVMDEEGKATEETKWEQINKAQA
LWTRAKSEVTDEEYKEFYKHVSHDFADPLVWSHNKVEGKNDYTSLLYIPAKAPWDLFNREHKHGLKLYVQRVFIMDDAAQ
FMPSYLRFVRGLIDSNDLPLNVSREILQDNKITQSLRQACTKRVLTMLERMASNDADNYQKFWKEFGLVMKEGPAEDFAN
REKIASLLRFASTHIDSAEQTISLASYVERMKEGQDKIYYLTADSYTAAKNSPHLEQFKSKGIEVILMFDRIDEWLMNYL
PEFEGKAFQSITKAGLDLSQFEDEAEKEKHKETEEQFKSVVERLKGYLGSRVKEVRTTFKLANTPAVVVTDDYEMGTQMA
KLLAAAGQPVPEVKYILEVNPEHALVKRMADEADEQTFGRWAEVLLGQAMLAERGSMEDPSQFLGAVNQLLAPSH
>P65813 3.4.24.-~~~htpX~~~Protease HtpX~~~COG0501
MMRIALFLLTNLAVMVVFGLVLSLTGIQSSSVQGLMIMALLFGFGGSFVSLLMSKWMALRSVGGEVIEQPRNERERWLVN
TVATQARQAGIAMPQVAIYHAPDINAFATGARRDASLVAVSTGLLQNMSPDEAEAVIAHEISHIANGDMVTMTLIQGVVN
TFVIFISRILAQLAAGFMGGNRDEGEESNGNPLIYFAVATVLELVFGILASIITMWFSRHREFHADAGSAKLVGREKMIA
ALQRLKTSYEPQEATSMMAFCINGKSKSLSELFMTHPPLDKRIEALRTGEYLK
>P23894 3.4.24.-~~~htpX~~~Protease HtpX~~~COG0501
MMRIALFLLTNLAVMVVFGLVLSLTGIQSSSVQGLMIMALLFGFGGSFVSLLMSKWMALRSVGGEVIEQPRNERERWLVN
TVATQARQAGIAMPQVAIYHAPDINAFATGARRDASLVAVSTGLLQNMSPDEAEAVIAHEISHIANGDMVTMTLIQGVVN
TFVIFISRILAQLAAGFMGGNRDEGEESNGNPLIYFAVATVLELVFGILASIITMWFSRHREFHADAGSAKLVGREKMIA
ALQRLKTSYEPQEATSMMALCINGKSKSLSELFMTHPPLDKRIEALRTGEYLK
>P9WHS5 3.4.24.-~~~htpX~~~Protease HtpX homolog~~~COG0501
MTWHPHANRLKTFLLLVGMSALIVAVGALFGRTALMLAALFAVGMNVYVYFNSDKLALRAMHAQPVSELQAPAMYRIVRE
LATSAHQPMPRLYISDTAAPNAFATGRNPRNAAVCCTTGILRILNERELRAVLGHELSHVYNRDILISCVAGALAAVITA
LANMAMWAGMFGGNRDNANPFALLLVALLGPIAATVIRMAVSRSREYQADESGAVLTGDPLALASALRKISGGVQAAPLP
PEPQLASQAHLMIANPFRAGERIGSLFSTHPPIEDRIRRLEAMARG
>O30795 3.4.24.-~~~htpX~~~Protease HtpX homolog~~~COG0501
MLFEQIAANKRRTWFLLVAFFALLALIGAAAGYLWMNSPLGGVIIAFIIGLIYAITMIFQSTEVVMSMNGARQVSEQEAP
ELYHIVQDMAMVAQIPMPRVYIVEDDSPNAFATGSNPENAAVAATTGLLRLMNREELEGVIGHEVSHIRNYDIRISTIAV
ALASAITMISSVAGRMMWYGGGRRRNDRDDDSGLGLLMLVFSLIAIILAPLAATLVQLAISRQREFLADASSVELTRNPQ
GMIRALQKLDNSEPMHRHVDDASAALYISDPKKKGGLQKLFYTHPPISERVERLRKM
>Q87QN1 3.4.24.-~~~htpX~~~Protease HtpX~~~COG0501
MKRIMLFLATNLAVVLVLSVVLNIVYATTGMQPGSLSGLLVMAAVFGFGGALISLMMSKGMALRSVGGMVIESPRNETEH
WLLETVGRQAQQAGIGMPTVAIYDSADINAFATGAKRDDSLVAVSTGLLHNMTRDEAEAVLAHEVSHIANGDMVTMTLMQ
GVVNTFVIFLSRFIANIVASNDDEEGQGTNMMVYFGVSMVLELVFGFLASFITMWYSRHREFHADAGAARLVGKEKMIAA
LERLKMSQESKLDGTMMAFGINGKQSLTELLMSHPPLDKRIAALRNQ
>Q9PA93 3.4.24.-~~~htpX~~~Protease HtpX~~~COG0501
MLTRIVLFAITNLAVLILASIVMSLLGVNPTQMSGLLVMALIFGFAGSFISLLMSKAIAKRTTGAYVIDQPRNLSERWLL
DTVSRQAEIVGIGRPEIAIYEGVEINAFATGADRNNALVAVSTGLLQNMSQDEVEAVLGHEIAHVANGDMVTMALLQGVL
NTFVIVLARVVGGFIDSLLSGNRGGARGVAYYAIVLVLELLFGLFATMITMWFSRRREFRADEGGAYLAGRNKMIAALER
LGINHGQSTLPTQVQAFGIYGGIGEGLRKLFLSHPPLSERIAALRVARQ
>O06291 3.4.21.107~~~htrA1~~~Probable serine protease HtrA1~~~COG0265
MDTRVDTDNAMPARFSAQIQNEDEVTSDQGNNGGPNGGGRLAPRPVFRPPVDPASRQAFGRPSGVQGSFVAERVRPQKYQ
DQSDFTPNDQLADPVLQEAFGRPFAGAESLQRHPIDAGALAAEKDGAGPDEPDDPWRDPAAAAALGTPALAAPAPHGALA
GSGKLGVRDVLFGGKVSYLALGILVAIALVIGGIGGVIGRKTAEVVDAFTTSKVTLSTTGNAQEPAGRFTKVAAAVADSV
VTIESVSDQEGMQGSGVIVDGRGYIVTNNHVISEAANNPSQFKTTVVFNDGKEVPANLVGRDPKTDLAVLKVDNVDNLTV
ARLGDSSKVRVGDEVLAVGAPLGLRSTVTQGIVSALHRPVPLSGEGSDTDTVIDAIQTDASINHGNSGGPLIDMDAQVIG
INTAGKSLSDSASGLGFAIPVNEMKLVANSLIKDGKIVHPTLGISTRSVSNAIASGAQVANVKAGSPAQKGGILENDVIV
KVGNRAVADSDEFVVAVRQLAIGQDAPIEVVREGRHVTLTVKPDPDST
>Q7A6C9 3.4.21.-~~~~~~Serine protease HtrA-like~~~
MDIGKKHVIPKSQYRRKRREFFHNEDREENLNQHQDKQNIDNTTSKKADKQIHKDSIDKHERFKNSLSSHLEQRNRDVNE
NKAEESKSNQDSKSAYNRDHYLTDDVSKKQNSLDSVDQDTEKSKYYEQNSEATLSTKSTDKVESTEMRKLSSDKNKVGHE
EQHVLSKPSEHDKETRIDSESSRTDSDSSMQTEKIKKDSSDGNKSSNLKSEVISDKSNTVPKLSESDDEVNNQKPLTLPE
EQKLKRQQSQNEQTKTYTYGDSEQNDKSNHENDLSHHTPSISDDKDNVMRENHIVDDNPDNDINTLSLSKIDDDRKLDEK
IHVEDKHKQNADSSETVGYQSQSTASHRSTEKRNISINDHDKLNGQKTNTKTSANNNQKKATSKLNKGRATNNNYSDILK
KFWMMYWPKLVILMGIIILIVILNAIFNNVNKNDRMNDNNDADAQKYTTTMKNANNTVKSVVTVENETSKDSSLPKDKAS
QDEVGSGVVYKKSGDTLYIVTNAHVVGDKENQKITFSNNKSVVGKVLGKDKWSDLAVVKATSSDSSVKEIAIGDSNNLVL
GEPILVVGNPLGVDFKGTVTEGIISGLNRNVPIDFDKDNKYDMLMKAFQIDASVNPGNSGGAVVNREGKLIGVVAAKISM
PNVENMSFAIPVNEVQKIVKDLETKGKIDYPDVGVKMKNIASLNSFERQAVKLPGKVKNGVVVDQVDNNGLADQSGLKKG
DVITELDGKLLEDDLRFRQIIFSHKDDLKSITAKIYRDGKEKEINIKLK
>O34358 3.4.21.107~~~htrA~~~Serine protease Do-like HtrA~~~COG0265
MDNYRDENRTKGNENEVFLTKENDQSASYSARNVIHDQEKKKRGFGWFRPLLGGVIGGSLALGIYTFTPLGDHDSQDTAK
QSSSQQQTQSVTATSTSSESKKSSSSSSAFKSEDSSKISDMVEDLSPAIVGITNLQAQSNSSLFGSSSSDSSEDTESGSG
SGVIFKKENGKAYIITNNHVVEGASSLKVSLYDGTEVTAKLVGSDSLTDLAVLQISDDHVTKVANFGDSSDLRTGETVIA
IGDPLGKDLSRTVTQGIVSGVDRTVSMSTSAGETSINVIQTDAAINPGNSGGPLLNTDGKIVGINSMKISEDDVEGIGFA
IPSNDVKPIAEELLSKGQIERPYIGVSMLDLEQVPQNYQEGTLGLFGSQLNKGVYIREVASGSPAEKAGLKAEDIIIGLK
GKEIDTGSELRNILYKDAKIGDTVEVKILRNGKEMTKKIKLDQKEEKTS
>Q9LA06 3.4.21.107~~~htrA~~~Serine protease Do-like HtrA~~~COG0265
MAKANIGKLLLTGVVGGAIALGGSAIYQSTTNQSANNSRSNTTSTKVSNVSVNVNTDVTSAIKKVSNSVVSVMNYQKDNS
QSSDFSSIFGGNSGSSSSTDGLQLSSEGSGVIYKKSGGDAYVVTNYHVIAGNSSLDVLLSGGQKVKASVVGYDEYTDLAV
LKISSEHVKDVATFADSSKLTIGEPAIAVGSPLGSQFANTATEGILSATSRQVTLTQENGQTTNINAIQTDAAINPGNSG
GALINIEGQVIGITQSKITTTEDGSTSVEGLGFAIPSNDVVNIINKLEADGKISRPALGIRMVDLSQLSTNDSSQLKLPS
SVTGGVVVYSVQSGLPAASAGLKAGDVITKVGDTAVTSSTDLQSALYSHNINDTVKVTYYRDGKSNTADVKLSKSTSDLE
TSSPSSSN
>P73354 ~~~htrA~~~Putative serine protease HtrA~~~COG0265
MSAQAVFPIAPHRADFFPRFVLSNSSANKCHQAMKDVSLHSPKQTPSKISLAYLGLVLVGMGIGAGGTFVLTNPQWADHL
TNNSVISPLVTNQSIAPANESLATNLQSRLSPREPSNFVVDVVESTGPAVVRINAQKTVKSQVPQAFNDPFLQRFFGSQM
PPMPNERVQRGTGSGFIVSNDGKIFTNAHVVDGADEVTVTLKDGRSFPGRVMGSDPSTDVAVVKIEAGDLPTVALGDSDH
LQVGEWAIAIGNPLGLDNTVTTGILSATGRRSADIGVPDKRVEFIQTDAAINPGNSGGPLLNADGQVIGMNTAIIQNAQG
IGFAIPINKAQEIAQQLIATGKVEHAYLGIQMVTMTPELQSQIRQETGMNIPVDKGVVIMQVMPNSPAAIAKLEQGDVLQ
SLQGQPVENAEQVQSLVGKLAVGDEVELGILRNGQQQNLTVTIGALPSAPPQ
>Q9R9I1 3.4.21.107~~~htrB~~~Serine protease Do-like HtrB~~~COG0265
MDYRRDGQNDQHQTEPSHTEQQNTENQKLIGHSEQELLDAPVSYEAGRQETASALEMEKQETAVKKEKKRRAAWLSPILG
GIIGGGLMLGIAPYLPSDQNQATETASANKQVQSDNFTTAPITNASNIADMVEDLEPTIVGISNIQTSQNNTFGTGGGSS
SESESGTGSGVIFKKDSDKAYIITNNHVVEGANKLTVTLYNGETETAKLVGSDTITDLAVLEISGKNVKKVASFGDSSQL
RTGEKVIAIGNPLGQQFSGTVTQGIISGLNRTIDVDTTQGTVEMNVLQTDAAINPGNSGGPLINASGQVIGINSLKVSES
GVESLGFAIPSNDVEPIVDQLLQNGKVDRPFLGVQMIDMSQVPETYQENTLGLFGDQLGKGVYVKEVQANSPAEKAGIKS
EDVIVKLNGKDVESSADIRQILYKDLKVGDKTTIQVLRKGKTKTLNATLTKQTESSSS
>P33129 ~~~htrE~~~Outer membrane usher protein HtrE~~~COG3188
MTIEYTKNYHHLTRIATFCALLYCNTAFSAELVEYDHTFLMGQNASNIDLSRYSEGNPAIPGVYDVSVYVNDQPIINQSI
TFVAIEGKKNAQACITLKNLLQFHINSPDINNEKAVLLARDETLGNCLNLTEIIPQASVRYDVNDQRLDIDVPQAWVMKN
YQNYVDPSLWENGINAAMLSYNLNGYHSETPGRKNESIYAAFNGGMNLGAWRLRASGNYNWMTDSGSNYDFKNRYVQRDI
ASLRSQLILGESYTTGETFDSVSIRGIRLYSDSRMLPPTLASFAPIIHGVANTNAKVTITQGGYKIYETTVPPGAFVIDD
LSPSGYGSDLIVTIEESDGSKRTFSQPFSSVVQMLRPGVGRWDISGGQVLKDDIQDEPNLFQASYYYGLNNYLTGYTGIQ
ITDNNYTAGLLGLGLNTSVGAFSFDVTHSNVRIPDDKTYQGQSYRVSWNKLFEETSTSLNIAAYRYSTQNYLGLNDALTL
IDEVKHPEQDLEPKSMRNYSRMKNQVTVSINQPLKFEKKDYGSFYLSGSWSDYWASGQNRSNYSIGYSNSTSWGSYSVSA
QRSWNEDGDTDDSVYLSFTIPIEKLLGTEQRTSGFQSIDTQISSDFKGNNQLNVSSSGYSDNARVSYSVNTGYTMNKASK
DLSYVGGYASYESPWGTLAGSISANSDNSRQVSLSTDGGFVLHSGGLTFSNDSFSDSDTLAVVQAPGAQGARINYGNSTI
DRWGYGVTSALSPYHENRIALDINDLENDVELKSTSAVAVPRQGSVVFADFETVQGQSAIMNITRSDGKNIPFAADIYDE
QGNVIGNVGQGGQAFVRGIEQQGNISIKWLEQSKPVSCLAHYQQSPEAEKIAQSIILNGIRCQIQ
>P25666 ~~~htrL~~~Protein HtrL~~~
MKSSTTIITAYFDIGRGDWTANKGFREKLARSVDVYFSYFERLAALENEMIIFTSPDLKPRVEAIRNGKPTTVIVIDIKK
KFRYIRSRIEKIQKDESFTNRLEPRQLKNPEYWSPEYVLVCNLKAYFVNKAINMGLVKTPLVAWIDFGYCHKPNVTRGLK
IWDFPFDESKMHLFTIKKGLTVTSQQQVFDFMIGNHVYIIGGAIVGSQHKWKEFYKLVLESQKITLNNNIVDDDQGIFVM
CYYKRPDLFNLNYLGRGKWFDLFRCFRSNTLGAKMQALRIFLSRK
>O69061 ~~~htxB~~~Probable phosphite transport system-binding protein HtxB~~~
MQVFTLFSKFKKALTRAILAFIATIIVCTPAQAAEVVNGKLHLRFAIAPMRPTPSQTIKEFEPIFKYLADQLGATYEIVS
PESWAAISVAMTNGHVDVGWLGPWGYVLSNKKAGTEVLATVKYRGEPFYKALIVGRADLPIKKWPEDAKGLKLSLSDQGN
TSGWLIPMAYFKSIGIDPASYFEYREGATFGQNESQIQHGLIDLGSDMDRGRNGMIEAGQIDPSKSKIVWESSKLPNDAI
SVPKDFDPALKARITEILTSLSEEKAQSLMGSGYNGFVKAKHSDYKVIEDAGRILGKL
>O25087 1.14.99.-~~~hugZ~~~Heme oxygenase HugZ~~~COG0748
MLNRIIEHMNAHHVEDMKGLLKKFGQVHHAENVAFKSVDPQGIVIGYNHNQTLRIEFNHEVKDPKDYKNAIIELCQSVEK
THDLKGVEEEVKAFKESFDSVCLATLHPNGHVVCSYAPLMSDGKQYYIYVSEVAEHFAGLKNNPHNVEVMFLEDESKAKS
AILRKRLRYKTNARFIERGAEFDKAFDSFIEKTGGAGGIKTIRAMQDFHLIALDFKEGRFVKGFGQAYDILGDKIAYVGD
KGNPHNFAHKK
>P26408 ~~~hupR1~~~Hydrogenase transcriptional regulatory protein HupR1~~~
MAASAPAILLVDDEPHSLAAMKLALEDDFDVLTAQGAEAAIAILEEEWVQVIICDQRMPGRTGVDFLTEVRERWPETVRI
IITGYTDSASMMAAINDAGIHQFLTKPWHPEQLLSSARNAARMFTLARENERLSLEMRLLNSTSESRVEKRRRALREGMG
FETILRTPNSAMTGAIALARQFASFDVPVLLRGEPGSGRAQLARAMHYVSLRSDKPFYEINLAGLPEDLAMIELFGARRG
VLPGGVAKIGLAQKADRGTLFVAGVEAASPALQLALLRMLADGAITPLGGQETASTNLRLITGAAADLRAMVAEGRFRAD
LYYALSAGEIALPPLRARRGDVALLAQSMLAEAAVRHGKQALGFDAAALEFLENYDWPGNLRELHNEVTRMLIFAQDNVL
GAELISRHILQAAPSESGADRSAEEVMTADGTLKDRIELIEMRILRETLTRNRWNKSRAAAELGLSRVGLRAKLDRYGIE
HPAGRVQEEEED
>Q9HU77 3.5.3.13~~~hutF~~~Formimidoylglutamate deiminase~~~
MSAIFAERALLPEGWARNVRFEISADGVLAEIRPDANADGAERLGGAVLPGMPNLHSHAFQRAMAGLAEVAGNPNDSFWT
WRELMYRMVARLSPEQIEVIACQLYIEMLKAGYTAVAEFHYVHHDLDGRSYADPAELSLRISRAASAAGIGLTLLPVLYS
HAGFGGQPASEGQRRFINGSEAYLELLQRLRAPLEAAGHSLGLCFHSLRAVTPQQIATVLAAGHDDLPVHIHIAEQQKEV
DDCQAWSGRRPLQWLYENVAVDQRWCLVHATHADPAEVAAMARSGAVAGLCLSTEANLGDGIFPATDFLAQGGRLGIGSD
SHVSLSVVEELRWLEYGQRLRDRKRNRLYRDDQPMIGRTLYDAALAGGAQALGQPIGSLAVGRRADLLVLDGNDPYLASA
EGDALLNRWLFAGGDRQVRDVMVAGRWVVRDGRHAGEERSARAFVQVLGELLD
>Q9HZ59 3.5.3.8~~~~~~Formimidoylglutamase~~~
MYPAPDMSLWQGRIDSQEGADARRWHQWMRPYADDAEAASVLLGFASDEGVRRNQGRQGARHGPPALRRALANLAWHGEQ
AIYDAGDIVAGDDLEAAQECYAQRVADLLACGHRVVGLGGGHEIAYASFAGLARHLSRHERLPRIGILNFDAHFDLRHAE
RASSGTPFRQIAELCQASDWPFAYCCLGISRLSNTAALFDQAQRLGVRYLLDRQLQPWNLERSEAFLDGFLQSVDHLYLT
VCLDVLPAAQAPGVSAPSAHGVEMPVVEHLVRRAKASGKLRLADIAELNPQLDSDQRTARIAARLVDSLVN
>P42068 3.5.3.8~~~hutG~~~Formimidoylglutamase~~~COG0010
MDKYPFLREAGSSFKDRDVTKMSDLIATWDGQDIKGPALIGVPLSKSSISHSGASFAPGTIRQALKHSSAYSAELGEHVV
SELLYDLGDIDIHVTDIVKSHHHIFQTMHALLSDHPDWVPLILGGDNSISYSTIKAIAQTKGTTAVIQFDAHHDVRNTED
GGPTNGTPFRRLLDEEIIEGQHLIQLGIREFSNSQAYEAYAKKHNVNIHTMDMIREKGLIPTIKEILPVVQDKTDFIFIS
VDMDVLDQSHAPGCPAIGPGGLYTDELLEAVKYIAQQPNVAGIEIVEVDPTLDFRDMTSRAAAHVLLHALKGMKLSPFK
>Q9HU92 3.5.1.68~~~hutG~~~N-formylglutamate deformylase~~~
MDEVLSFKRGRVPLLISMPHPGTRLTPAVDAGLVEEARALTDTDWHIPRLYDFAEELGASTLAAHYSRYVVDLNRPSDDK
PLYSTATTGLYPDTLFDGRPLYREGMAPSAEERMRYLAEVWTPYHRTIAEELARLKAEFGYALLWDAHSIRSHVPHLFDG
RLPDFNLGTNAGASCDPALAARLEAVCAAAEGYSHVLNGRFKGGHITRHYGQPEQHVHAVQLELAQCTYMDEQAPFAYRA
DLAEATRAVIRELLESLLAWGRERYA
>O31201 3.5.1.68~~~hutG~~~N-formylglutamate deformylase~~~
MDKVLSFHQGRLPLLISMPHAGLRLSDAVRDGLVEEARSLPDTDWHIPQLYDFARDLGASVVAAEYSRFVIDLNRPDDDK
PLYAGATTGLYPATLFEGEPLFKEGLAPSGEERKRYLEQIWRPYHGTLRRELDRLREQFGYALLWDAHSIRSHIPHLFDG
KLPDFNLGTFNGASCDPVLAERLQGVCAEATGYSHVLNGRFKGGHITRHYGDPAKHIHAVQLELAQSTYMEETEPFTYRE
DLAQPTQVVLKQLLQALLAWGAERYQR
>P99158 3.5.3.8~~~hutG~~~Formimidoylglutamase~~~
MYKQGEPNLWTGRLDSETDPKKFRHFQTVTFEDLSKLEKSSMPSGVGILGYAVDKGVALNKGRIGAKEGPDAIKQAFAGL
PDLNQCETLVDYGNVYHDHEELIDTQKEFAMLAAKSIANHRQTFLLGGGHDIAYAQYLATRKVYPTQSIGVINIDAHFDT
RAEQQSTSGTSFRQILEEDENTDYLVLGIAQGGNTQSLFDYAKEKKIDYVFADELLSHVSPTIKDMIERFVHEHDVIMFT
ICMDVIDSAFAPGVSAPAVLGLYPHTVLELAKRIIPSDKVSSVSIAEMNPTYDADNRTAKLVANLVHHFLK
>Q9KSQ2 3.5.3.8~~~hutG~~~Formimidoylglutamase~~~COG0010
MNPNFTTEHTWQGRHDPEDGQAGRRVHHIACPIQVGELANQEPGVALIGFECDAGVERNKGRTGAKHAPSLIKQALANLA
WHHPIPIYDLGNIRCEGDELEQAQQECAQVIQQALPHARAIVLGGGHEIAWATFQGLAQHFLATGVKQPRIGIINFDAHF
DLRTFESELAPVRPSSGTPFNQIHHFCQQQGWDFHYACLGVSRASNTPALFERADKLGVWYVEDKAFSPLSLKDHLTQLQ
HFIDDCDYLYLTIDLDVFPAASAPGVSAPAARGVSLEALAPYFDRILHYKNKLMIADIAEYNPSFDIDQHTARLAARLCW
DIANAMAEQVQSIRHP
>P10944 4.3.1.3~~~hutH~~~Histidine ammonia-lyase~~~COG2986
MVTLDGSSLTTADVARVLFDFEEAAASEESMERVKKSRAAVERIVRDEKTIYGINTGFGKFSDVLIQKEDSAALQLNLIL
SHACGVGDPFPECVSRAMLLLRANALLKGFSGVRAELIEQLLAFLNKRVHPVIPQQGSLGASGDLAPLSHLALALIGQGE
VFFEGERMPAMTGLKKAGIQPVTLTSKEGLALINGTQAMTAMGVVAYIEAEKLAYQTERIASLTIEGLQGIIDAFDEDIH
LARGYQEQIDVAERIRFYLSDSGLTTSQGELRVQDAYSLRCIPQVHGATWQTLGYVKEKLEIEMNAATDNPLIFNDGDKV
ISGGNFHGQPIAFAMDFLKIAISELANIAERRIERLVNPQLNDLPPFLSPHPGLQSGAMIMQYAAASLVSENKTLAHPAS
VDSIPSSANQEDHVSMGTIAARHAYQVIANTRRVIAIEAICALQAVEYRGIEHAASYTKQLFQEMRKVVPSIQQDRVFSY
DIERLTDWLKKESLIPDHQNKELRGMNI
>P21310 4.3.1.3~~~hutH~~~Histidine ammonia-lyase~~~COG2986
MTELTLKPGTLTLAQLRAIHAAPVRLQLDASAAPAIDASVACVEQIIAEDRTAYGINTGFGLLASTRIASHDLENLQRSL
VLSHAAGIGAPLDDDLVRLIMVLKINSLSRGFSGIRRKVIDALIALVNAEVYPHIPLKGSVGASGDLAPLAHMSLVLLGE
GKARYKGQWLSATEALAVAGLEPLTLAAKEGLALLNGTQASTAYALRGLFYAEDLYAAAIACGGLSVEAVLGSRSPFDAR
IHEARGQRGQIDTAACFRDLLGDSSEVSLSHKNCDKVQDPYSLRCQPQVMGACLTQLRQAAEVLGIEANAVSDNPLVFAA
EGDVISGGNFHAEPVAMAADNLALAIAEIGSLSERRISLMMDKHMSQLPPFLVENGGVNSGFMIAQVTAAALASENKALS
HPHSVDSLPTSANQEDHVSMAPAAGKRLWEMAENTRGVLAIEWLGACQGLDLRKGLKTSAKLEKARQALRSEVAHYDRDR
FFAPDIEKAVELLAKGSLTGLLPAGVLPSL
>O31197 4.3.1.3~~~hutH~~~Histidine ammonia-lyase~~~COG2986
MTVILRPGSVPLSDLETIYWTGAPARLDAAFDAGIAKAAARIAEIVAGNAPVYGINTGFGKLASIKIDSSDVATLQRNLI
LSHCCGVGQPLTEDIVRLIMALKLISLGRGASGVRLELVRLIEAMLDKGVIPLIPEKGSVGASGDLAPLAHMAAVMMGHG
EAFFAGERMKGDAALKAAGLSPVTLAAKEGLALINGTQVSTALALAGLFRAHRAGQAALITGALSTDAAMGSSAPFHPDI
HTLRGHKGQIDTAAALRQLLTGSPIRQSHIEGDERVQDPYCIRCQPQVDGACLDLLRSVAATLTIEANAVTDNPLVLSDN
SVVSGGNFHAEPVAFAADQIALAVCEIGAISQRRIALLVDPALSYGLPAFLAKKPGLNSGLMIAEVTSAALMSENKQLSH
PASVDSTPTSANQEDHVSMACHGARRLLQMTENLFSIIGIEALAAVQGIEFRAPLTTSPELQKAAAAVRGVSSSIEEDRY
MADDLKAAGDLVASGRLAAAVSAGILPKLEN
>P64416 4.3.1.3~~~hutH~~~Histidine ammonia-lyase~~~
MTLYLDGETLTIEDIKSFLQQQSKIEIIDDALERVKKSRAVVERIIENEETVYGITTGFGLFSDVRIDPTQYNELQVNLI
RSHACGLGEPFSKEVALVMMILRLNTLLKGHSGATLELVRQLQFFINERIIPIIPQQGSLGASGDLAPLSHLALALIGEG
KVLYRGEEKDSDDVLRELNRQPLNLQAKEGLALINGTQAMTAQGVISYIEAEDLGYQSEWIAALTHQSLNGIIDAYRHDV
HSVRNFQEQINVAARMRDWLEGSTLTTRQAEIRVQDAYTLRCIPQIHGASFQVFNYVKQQLEFEMNAANDNPLIFEEANE
TFVISGGNFHGQPIAFALDHLKLGVSELANVSERRLERLVNPQLNGDLPAFLSPEPGLQSGAMIMQYAAASLVSENKTLA
HPASVDSITSSANQEDHVSMGTTAARHGYQIIENARRVLAIECVIALQAAELKGVEGLSPKTRRKYEEFRSIVPSITHDR
QFHKDIEAVAQYLKQSIYQTTACH
>P24221 4.3.1.3~~~hutH~~~Histidine ammonia-lyase~~~
MHTVVVGTSGTTAEDVVAVARHGARVELSAAAVEALAAARLIVDALAAKPEPVYGVSTGFGALASRHIGTELRAQLQRNI
VRSHAAGMGPRVEREVVRALMFLRLKTVASGHTGVRPEVAQTMADVLNAGITPVVHEYGSLGCSGDLAPLSHCALTLMGE
GEAEGPDGTVRPAGELLAAHGIAPVELREKEGLALLNGTDGMLGMLVMALADLRNLYTSADITAALSLEALLGTDKVLAP
ELHAIRPHPGQGVSADNMSRVLAGSGLTGHHQDDAPRVQDAYSVRCAPQVNGAGRDTLDHAALVAGRELASSVDNPVVLP
DGRVESNGNFHGAPVAYVLDFLAIVAADLGSICERRTDRLLDKNRSHGLPPFLADDAGVDSGLMIAQYTQAALVSEMKRL
AVPASADSIPSSAMQEDHVSMGWSAARKLRTAVDNLARIVAVELYAATRAIELRAAEGLTPAPASEAVVAALRAAGAEGP
GPDRFLAPDLAAADTFVREGRLVAAVEPVTGPLA
>A0KF84 3.5.2.7~~~hutI~~~Imidazolonepropionase~~~COG1228
MNKELLNCERVWLNVTPATLRSDLADYGLLEPHALGVHEGRIHALVPMQDLKGPYPAHWQDMKGKLVTPGLIDCHTHLIF
AGSRAEEFELRQKGVPYAEIARKGGGIISTVRATRAACEEQLFELALPRVKSLIREGVTTVEIKSGYGLTLEDELKMLRV
ARRLGEALPIRVKTTLLAAHAVPPEYRDDPDSWVETICQEIIPAAAEAGLADAVDVFCEHIGFSLAQTEQVYLAADQYGL
AVKGHMDQLSNLGGSTLAANFGALSVDHLEYLDPEGIQALAHRGVVATLLPTAFYFLKETKLPPVAALRKAGVPMAVSSD
INPGTAPIVSLRMAMNMACTLFGLTPVEAMAGVTRHAARALGEQEQLGQLRVGMLADFLVWNCGHPAELSYLIGVDQLVS
RVINGEETLHG
>Q8U8Z6 3.5.2.7~~~hutI~~~Imidazolonepropionase~~~COG1228
MPGNNSAKGTATGNATALWRNAQLATLNPAMDGIGAVENAVIAVRNGRIAFAGPESDLPDDLSTADETTDCGGRWITPAL
IDCHTHLVFGGNRAMEFEMRLNGATYEEIAKAGGGIVSSVRDTRALSDEVLVAQALPRLDTLLSEGVSTIEIKSGYGLDI
ETELKMLRVARRLETLRPVRIVTSYLAAHATPADYKGRNADYITDVVLPGLEKAHAEGLADAVDGFCEGIAFSVKEIDRV
FAAAQQRGLPVKLHAEQLSNLGGAELAASYNALSADHLEYLDETGAKALAKAGTVAVLLPGAFYALREKQLPPVQALRDA
GAEIALATDCNPGTSPLTSLLLTMNMGATLFRMTVEECLTATTRNAAKALGLLAETGTLEAGKSADFAIWDIERPAELVY
RIGFNPLHARIFKGQKVSP
>P42084 3.5.2.7~~~hutI~~~Imidazolonepropionase~~~COG1228
MPKQIDTILINIGQLLTMESSGPRAGKSMQDLHVIEDAVVGIHEQKIVFAGQKGAEAGYEADEIIDCSGRLVTPGLVDPH
THLVFGGSREKEMNLKLQGISYLDILAQGGGILSTVKDTRAASEEELLQKAHFHLQRMLSYGTTTAEVKSGYGLEKETEL
KQLRVAKKLHESQPVDLVSTFMGAHAIPPEYQNDPDDFLDQMLSLLPEIKEQELASFADIFTETGVFTVSQSRRYLQKAA
EAGFGLKIHADEIDPLGGAELAGKLKAVSADHLVGTSDEGIKKLAEAGTIAVLLPGTTFYLGKSTYARARAMIDEGVCVS
LATDFNPGSSPTENIQLIMSIAALHLKMTAEEIWHAVTVNAAYAIGKGEEAGQLKAGRSADLVIWQAPNYMYIPYHYGVN
HVHQVMKNGTIVVNREGAILG
>A0QRN6 3.5.2.7~~~hutI~~~Imidazolonepropionase~~~COG1228
MTTLLIDNIGSLVTNDPTLDAGPLGVLRDAAVVVEDGRIAWYGATSAAPAADTRLDAAGRAVIPGFVDSHAHLVFAGDRS
EEFAARMSGTPYQAGGIRTTVTATRDATDATLKSTVTRLAAEALRSGTTTLECKSGYGLTVEQELRSLQVAAEITDEVTF
MGAHVVPPEYAETPDDYVELVCTAMLDACAPAAKWVDVFCERGAFDLDQSRAILQAGIARGLQPRVHANQLGPGPGVQLA
VECNAASADHVTHVSDADIAALAGSNTVATLLPAAEFSTRAAYPDGRRLIDAGVTVALSPDCNPGSSFTTNMPFCIAVAV
REMHLTPDEAVWAATAGGARALRRDDVGHLAVGARADLALLDAPSHIHLAYRPGVPLVAAVLRNGEIVWQTKEVTS
>P64418 3.5.2.7~~~hutI~~~Imidazolonepropionase~~~
MNDLIINHIAELILPKSTDKPLKGKELDELNVVKNGTVVIKDGKIVYAGQHTDDYDATETIDASGKVVSPALVDAHTHLT
FGGSREHEMSLKRQGKSYLEILEMGGGILSTVNATRETSEDDLFKKAEHDLLTMIKHGVLAVESKSGYGLDRENELKQLK
VSNRLAEKYDLDMKHTFLGPHAVPKEASSNEAFLEEMIALLPEVKQYADFADIFCETGVFTIEQSQHYMQKAKEAGFKVK
IHADEIDPLGGLELAIDEQAISADHLVASSDKGKEKLRNSDTVAVLLPATTFYLGKEDYADARGMLDNNGAIALATDYNP
GSSVTNNLQLVMAIAALKLKLSPSEVWNAVTVNAAKAIDINAGTINTGDKANLVIWDAPNHEYIPYHFGINHAEKVIKDG
KVIVDNTLSFKA
>P10943 ~~~hutP~~~Hut operon positive regulatory protein~~~
MTLHKERRIGRLSVLLLLNEAEESTQVEELERDGWKVCLGKVGSMDAHKVVAAIETASKKSGVIQSEGYRESHALYHATM
EALHGVTRGEMLLGSLLRTVGLRFAVLRGNPYESEAEGDWIAVSLYGTIGAPIKGLEHETFGVGINHI
>A4IK89 ~~~hutP~~~Hut operon positive regulatory protein~~~
MGKEKSVRIGRQALLLAMLDEGEEGAILDELRASNWRYCQGRVGAMEPQKIVAAIETAAKRHEVVDGSLYRDMHALYHAI
LEAVHGVTRGQVELGDLLRTAGLRFAVVRGTPYEQPKEGEWIAVALYGTIGAPVRGLEHEAVGLGINHI
>P25503 4.2.1.49~~~hutU~~~Urocanate hydratase~~~COG2987
MTDVKKSIRANRGTELECLGWEQEAVLRMLRNNLDPEVAEKPEDLIVYGGIGKAARDWDAFHAIEHSLKTLKNDETLLVQ
SGKPVGMFRTHPQAPRVLLANSVLVPKWADWEHFHELEKKGLMMYGQMTAGSWIYIGSQGILQGTYETFAELARQHFGGS
LKGTLTLTAGLGGMGGAQPLSVTMNEGVVIAVEVDEKRIDKRIETKYCDRKTASIEEALAWAEEAKLAGKPLSIALLGNA
AEVHHTLLNRGVKIDIVTDQTSAHDPLIGYVPEGYSLDEADRLRQDTPELYVRLAKQSMKKHVEAMLAFQQKGSIVFDYG
NNIRQVAKDEGLENAFDFPGFVPAYIRPLFCEGKGPFRWAALSGDPADIYRTDALLKELFPTNKALHRWIDMAQEKVTFQ
GLPSRICWLGYGERKKMGLAINELVRTGELKAPVVIGRDHLDCGSVASPNRETEAMKDGSDAVGDWAVLNALVNTAAGAS
WVSFHHGGGVGMGYSLHAGMVAVADGSELADERLARVLTSDPGMGIIRHADAGYERAVEVAKEQDIIVPMQK
>Q5L084 4.2.1.49~~~hutU~~~Urocanate hydratase~~~COG2987
MAEKRTVSPPAGTERRAKGWIQEAALRMLNNNLHPDVAERPDELIVYGGIGKAARNWECYEAIVDTLLRLENDETLLIQS
GKPVAVFRTHPDAPRVLIANSNLVPAWATWDHFHELDKKGLIMYGQMTAGSWIYIGSQGIVQGTYETFAEVARQHFGGTL
AGTITLTAGLGGMGGAQPLAVTMNGGVCLAIEVDPARIQRRIDTNYLDTMTDSLDAALEMAKQAKEEKKALSIGLVGNAA
EVLPRLVEMGFVPDVLTDQTSAHDPLNGYIPAGLTLDEAAELRARDPKQYIARAKQSIAAHVRAMLAMQKQGAVTFDYGN
NIRQVAKDEGVDDAFSFPGFVPAYIRPLFCEGKGPFRWVALSGDPEDIYKTDEVILREFSDNERLCHWIRMAQKRIKFQG
LPARICWLGYGERAKFGKIINDMVAKGELKAPIVIGRDHLDSGSVASPNRETEGMKDGSDAIADWPILNALLNAVGGASW
VSVHHGGGVGMGYSIHAGMVIVADGTKEAEKRLERVLTTDPGLGVVRHADAGYELAIRTAKEKGIDMPMLK
>A0QRN3 4.2.1.49~~~hutU~~~Urocanate hydratase~~~COG2987
MEGARPVRAPRGTTLTARSWATEAPLRMLMNNLDPENAERPDDLVVYGGTGRAARNWASFDAMVRTLTTLREDETMLVQS
GKPVGVFQTHEWAPRVLIANSNLVGDWATWPEFRRLEAMGLTMYGQMTAGSWIYIGTQGIVQGTYETFAAAAEKRFGGTL
AGTLTLTGGCGGMGGAQPLAVTLNGGACLIVDVDEARLRRRVEHRYLDEVADNLDDAVTKAVATRKDKRAWSVGVVGNAA
EVFPELLRRGVPIDLVTDQTSAHDPLSYLPIGISVEDWEDYATKKPDEFTERAEESMAVQVRAMVEFQDAGAEVFDYGNS
IRDEARKAGYDRAFEFPGFVPAYIRPQFCEGRGPFRWVALSGDPKDIHATDEAIMKLFPDDDRLQKWMRGAREKISFQGL
PARICWLGYGERDKAGVLFNDLVASGKVSAPIVIGRDHLDSGSVASPYRETEAMLDGSDAIADWPLLNALTATSSGATWV
SIHHGGGVGIGRSIHAGQVGVADGTELAAQKLSRLLTNDPGMGVIRHVDAGYERAEEIAAERGVRIPMREGE
>Q9HU83 4.2.1.49~~~hutU~~~Urocanate hydratase~~~
MTTPSKFRDIEIRAPRGTTLTAKSWLTEAPLRMLMNNLDPEVAENPRELVVYGGIGRAARNWECYDRIVETLKQLNDDET
LLVQSGKPVGVFKTHANAPRVLIANSNLVPHWATWEHFNELDAKGLAMYGQMTAGSWIYIGSQGIVQGTYETFVEAGRQH
YDGNLKGRWVLTAGLGGMGGAQPLAATLAGACSLNIECQQSRIDFRLRSRYVDEQAKDLDDALARIQRYTAEGKAISIAL
LGNAAEILPELVRRGVRPDMVTDQTSAHDPLNGYLPAGWSWEEYRDRAQTDPAAVVKAAKQSMAVHVRAMLAFQQQGVPT
FDYGNNIRQMAKEEGVANAFDFPGFVPAYIRPLFCRGIGPFRWAALSGDPQDIYKTDAKVKQLIPDDAHLHRWLDMARER
ISFQGLPARICWVGLGLRAKLGLAFNEMVRTGELSAPIVIGRDHLDSGSVASPNRETEAMQDGSDAVSDWPLLNALLNTA
SGATWVSLHHGGGVGMGFSQHSGMVIVCDGSDEAAERIARVLTNDPGTGVMRHADAGYQVAIDCAKEQGLNLPMITAQR
>P25080 4.2.1.49~~~hutU~~~Urocanate hydratase~~~COG2987
MTDNNKYRDVEIRAPRGNKLTAKSWLTEAPLRMLMNNLDPQVAENPKELVVYGGIGRAARNWECYDKIVETLTRLEDDET
LLVQSGKPVGVFKTHSNAPRVLIANSNLVPHWANWEHFNELDAKGLAMYGQMTAGSWIYIGSQGIVQGTYETFVEAGRQH
YGGTVKAKWVLTAGLGGMGGAQPLAATLAGACSLNIECQQSRIDFRLETRYVDEQATDLDDALVRIAKYTAEGKAISIAL
HGNAAEILPELVKRGVRPDMVTDQTSAHDPLNGYLPAGWTWEQYRDRAQTEPAAVVKAAKQSMAVHVQAMLDFQKQGVPT
FDYGNNIRQMAKEEGVANAFDFPGFVPAYIRPLFCRGVGPFRWAALSGEAEDIYKTDAKVKELIPDDAHLHRWLDMARER
ISFQGLPARICWVGLGLRAKLGLAFNEMVRSGELSAPVVIGRDHLDSGSVSSPNRETEAMRDGSDAVSDWPLLNALLNTA
GGATWVSLHHGGGVGMGFSQHSGMVIVCDGTDEAAERIARVLTNDPGTGVMRHADAGYDIAIDCAKEQGLDLPMITG
>Q9AGU4 4.2.1.49~~~hutU~~~Urocanate hydratase~~~COG2987
MTSTTPKSPAAFTRHRDGEIRAARGTQLTAKSWMTEAPLRMLMNNLDPQVAENPTELVVYGGIGRAARNWECYDKIVESL
TNLNDDETLLVQSGKPVGVFKTHSNAPRVLIANSNLVPHWATWEHFNELDAKGLAMYGQMTAGSWINIGSQGIVQGTYET
FVEAGRQHYNGSLKGKWVLTAGLGGMGGAQPLAATLAGACSLNIECQQSRIDFRLATRYVDEQALDLDDALVRIAKYTAE
GKAISIALCGNAAELLPEMVRRGVRPDMVTDQTSAHDPLNGYLPKGWTWEQYRDRAVTDPAAVVKAAKASMGEHVEAMLA
FQKAGIPTFDYGNNIRQMAKEVGVENAFDFPGFVPAYIRPLFCRGVGPFRWVALSGDAEDIYKTDAKVKELIADDAHLHN
WLDMARERISFQGLPARICWVGLGQRAKLGLAFNEMVRSGELKAPIVIGRDHLDSGSVSSPNRETESMKDGSDAVSDWPL
LNALLNTASGATWVSLHHGGGVGMGFSQHSGMVIVCDGTDEAAERIARVLHNDPATGVMRHADAGYDIAIDCANEQGLNL
PMING
>P67417 4.2.1.49~~~hutU~~~Urocanate hydratase~~~
MRKIQAKKGLSIECKGWEQEAVLRMLYNNLDPEVAERPEDLVVYGGIGKAARNWEAFEAIEKTLRELESDETMLVQSGKP
VAVFKTHEEAPRVLISNSVLVPEWANWDHFNELDKKGLIMYGQMTAGSWIYIGSQGIVQGTYETFAELGNQHFNGDLAGT
VTLTAGLGGMGGAQPLAITMNHGVAICVDVDETRVDKRIDTKYCDVKTADLDEALKLAEEAKERGEGLSIGLVGNAVDIH
QAILEKGFKIDIITDQTSAHDPLNGYVPQGYSVEEAKVLREKDPKKYVELSQASMAKHVELMLEFQKRGAVAFDYGNNIR
QVAFNNGVKNAFDFPGFVPAYIRPLFCEGKGPFRFAALSGDPKDIERADEEMRKLFPENEKLLRWLDLAEEKISYQGLPS
RIAWLGYGERAKMGLALNRLVRDGEISAPIVIGRDHLDAGSVASPNRETESMKDGSDAVGDWAVLNALINTAAGGSWISF
HHGGGVGMGYSLHAGMVVVADGSERAERRLERVLTTDPGMGVARHVDAGYDIAIQTAKEKGIHIPMIDKAGDK
>Q9KL40 ~~~hutX~~~Intracellular heme transport protein HutX~~~COG3721
MYSGAYSFVQISTAAYRISITRLEKTMESLQQQVAQLLEQQPTLLPAAMAEQLNVTEFDIVHALPEEMVAVVDGSHAQTI
LESLPEWGPVTTIMTIAGSIFEVKAPFPKGKVARGYYNLMGRDGELHGHLKLENISHVALVSKPFMGRESHYFGFFTAQG
ENAFKIYLGRDEKRELIPEQVARFKAMQQQHKQ
>Q9KL41 1.14.99.58~~~hutZ~~~Heme oxygenase HutZ~~~COG0748
MDQQVKQERLQGRLEPEIKEFRQERKTLQLATVDAQGRPNVSYAPFVQNQEGYFVLISHIARHARNLEVNPQVSIMMIED
ETEAKQLFARKRLTFDAVASMVERDSELWCQVIAQMGERFGEIIDGLSQLQDFMLFRLQPEQGLFVKGFGQAYQVSGDDL
VDFVHLEEGHRKISNG
>P42505 ~~~hvrA~~~Trans-acting regulatory protein HvrA~~~
MDIDALSLNELKALRSKVDRAIVTYEERKKKEAFAELDEIARKMGYPLAEILTMVETKPRKTVAAKYANPANPSETWTGR
GRKPKWVEAALASGKSLEDLTI
>Q9I5N9 7.4.2.8~~~hxcR~~~Type II secretion system protein HxcR~~~
MSLLPYAWAKAQRALLRPGEHGATLLVSPRTPGWAISEVRQRHAPASLESVRDDELDTLLASAYSDTGSAAAVVGAAESE
VDLDRLMDDIPEVTDLLDTQDGAPVIRMINALLTQAARDEASDIHIEPFETHSVVRYRVDGALRDVVAPRKALHAALVSR
IKIMAQLDIAEKRLPQDGRIALRVAGRPIDIRVSTVPTGHGERVVMRLLDKQAGRLRLETLGMAPGVLAPLDNLIRQPHG
IVLVTGPTGSGKTTTLYAALARLDASTSNILTVEDPVEYDLPGISQIQVNARIDMTFAVALRAILRQDPDIIMIGEIRDL
ETAQIAVQASLTGHLVLATLHTNDAVSAVTRLVDMGVEPFLLASSMLGVLAQRLVRRLCTHCRVEEDGGWRAVGCPACNQ
TGYSGRTGIHELFVIDDEIRRLVHQGRAEQDLREAARAAGMRSMREDGERWIASGSTTLEEILRVTRDA
>P42406 ~~~hxlR~~~HTH-type transcriptional activator HxlR~~~COG1733
MSRMDDKRFNCEKELTLAVIGGKWKMLILWHLGKEGTKRFNELKTLIPDITQKILVNQLRELEQDMIVHREVYPVVPPKV
EYSLTPHGESLMPILEAMYEWGKGYMELIDIDKNVMKESL
>P77625 3.1.3.22~~~hxpA~~~Hexitol phosphatase A~~~COG0637
MRCKGFLFDLDGTLVDSLPAVERAWSNWARRHGLAPEEVLAFIHGKQAITSLRHFMAGKSEADIAAEFTRLEHIEATETE
GITALPGAIALLSHLNKAGIPWAIVTSGSMPVARARHKIAGLPAPEVFVTAERVKRGKPEPDAYLLGAQLLGLAPQECVV
VEDAPAGVLSGLAAGCHVIAVNAPADTPRLNEVDLVLHSLEQITVTKQPNGDVIIQ
>P77247 3.1.3.68~~~hxpB~~~Hexitol phosphatase B~~~COG0637
MSTPRQILAAIFDMDGLLIDSEPLWDRAELDVMASLGVDISRRNELPDTLGLRIDMVVDLWYARQPWNGPSRQEVVERVI
ARAISLVEETRPLLPGVREAVALCKEQGLLVGLASASPLHMLEKVLTMFDLRDSFDALASAEKLPYSKPHPQVYLDCAAK
LGVDPLTCVALEDSVNGMIASKAARMRSIVVPAPEAQNDPRFVLADVKLSSLTELTAKDLLG
>Q8NR92 1.13.11.37~~~~~~Hydroxyquinol 1,2-dioxygenase~~~COG3485
MTISAQQQAVEEDLVERVLASFDSCENPRLKLVMKSLTVHLHDFIRDVRLTEEEWNYAIDFLTKVGHITDDKRQEFVLLS
DTLGASMQTIAVNNEAYEDATEATVFGPFFVDDAPLVQNGDDIAFGAVGQPAWVEGTVKDTEGNPIPNARIEVWECDEDG
LYDVQYADERSAGRAHLYSDENGEYHFWGLTPVPYPIPHDGPVGQMLQAVGRSPVRCAHLHFMVTAPEKRTLVTHIFVEG
DPQLEIGDSVFGVKDSLIKKFVEQPAGTATPDGRDVGDQTWARTRFDIVLAPGNV
>Q8NL92 1.13.11.37~~~~~~Hydroxyquinol 1,2-dioxygenase~~~COG3485
MTTTTADHNISAQQKAVEENLVNRVLQSFDACENPRLKQLMESLVVHLHDFIRDVRLTEDEWNYAIDFLTAVGHITDDKR
QEFVLLSDTLGASMQTIAVNNEAYENSTEATVFGPFFLDDAPEVELGGDIAGGAQGQAAWIEGTVTDTEGNPVPNARIEV
WECDEDGLYDVQYADERMAGRAYMHTDANGDYRFWGLTPVPYPIPHDGPVGNMLKAVGRSPVRCAHLHFMVTAPELRTLV
THIFVEGDPQLEIGDSVFGVKDSLIKKFEEQAPGTPTPDGRDLGDQTWARTRFDIVLAPGA
>P44602 ~~~hxuA~~~Heme/hemopexin-binding protein~~~COG3210
MYKLNVISLIILTTYTGATYASARDLPQGSSVVVGEANVSTIGNKMTIDQKTPTTQIDWHSFDIGQNKEVEFKQPDANSV
AYNRVTGGNASQIQGKLTANGKVYLANPNGVIITQGAEINVAGLFATTKDLERISENGNGNGNKFTRKLKDGQVVKEGQV
INKGKIKAKDFVVLNGDKVINEGEIDATNNGKVYLSSGYNFTFTLSDSSISVALEDNAVQSIVQNEGIIKAGDITLNAKG
RNQALDSLVMNNGVLEATKVSNKNGKVVLSADDVQLNNKSDIKGESEVVFTNEPKNKIKITSQTGSKVTSPKINFTGKSV
NINGDFGRDDSKAHYNEEHKRLDTEVNIDVPDNENIRIAEKDNTGTGTGTDSFIQTGALSSLLANNGKVNLKGKDVNISG
RIHIDSFRGSDSLLKLTNQGHIKINHADIHSTGRLFFITSLQNEKDSQSDITITDSKINLGNGAMGLGRSLDKENCDNQR
WCRTETSQRKKFDVHMRNVVFDQVDDVVVAGGFKKVNLDNIVATGKTNFYIDGGVSRNNSRYEYGVLDLDKRTLLSELDQ
RRRRWKYYNDLDLDMNKAYWHRFDMFATKNTGRSTIKDTEINISNSKINLKNGFVHLLAEKIKLDNSKIDITFDKDNSQD
ISTQINRLGMNGKVSMVNSHIKIVGDEKSDISAKAPYATMFLIGELIGEKSSIFVKSHQGYTFRTDGDTKIAGKNSKDDL
KITAINTGGRTGKEVIINGAPGSIDNDANIANMAFTIGDNANTKTTIENADITALAPNGGTAYLSSKGVEIEVNPNSNFT
FFELPREKNFNQTKIKGDSTKLSERGFARLYDKINGVRASNLSAEQLNVTDASEKIINTKLVSSLDVEKLVSVAVCDAGK
GCEEQQFGDKGNNTKVSVGELETEQ
>P45354 ~~~hxuA~~~Heme/hemopexin-binding protein~~~
MYKLNVISLIILTTCSGAAYASTPDFPQHHKTVFGTVTIEKTTADKMTIKQGSDKAQIDWKSFDIGQKKEVKFEQPNEHA
VAYNRVIGGNASQIQGKLTANGKVYLANPNGVIITQGAEINVAGLLATTKDLERISENSNSYQFTRRTKDRQVLKEGLVL
KDGQVVKEGQVINEGNITAQDFVVLNGDEVINKGNINVEKNSTINGKVYLSSGYNFTFTLPDSGISVALEDNTVQGIVKN
EGSIKAGEITLSAKGRKQALDSLVMNNGVLEATKVSNKNGKVVLSADNVELNNESNIKGEIVTFGADVTSNKELKDNIKI
TSKTGSKVTSPKINFTGKSVNINGNFGREDSTTHYKDEFKKLNTEVNIDVPDNENIRIADIEDNTGTGTTGTGTSSFIQT
GALSSLLANNGKVNLKGNNVNISGRIHIDSFRGSDSLLKLTNKGHIDINNADIHSKGRLFFITSLQNEEDFKSNITITDS
KINLGNGAMGLGRSVDEKDYDNRWQKTEGSQRKKFDVKMSNVEFNQVDDVILAGGFEKVNLDKIVATGQTNFYIDGGVSR
NGRKYEYGVLDLDKRTQLSELNQGRRRWGYYYDLELDMNRAYLYRFDLFATKNTGRSTIKDTEINISNSNINLKNGFVHL
LAEKIKLDNSKIDITFDKDNSQDTLAQTNRLGMNGKVSMINSHIKIVGDEKEGISPTGTYATMFLIGELIGEKSSIFVKS
HQGYTFKTDGNTKIAGKYSKEDLKITAINTGGRAAEEVLINGALGSADNDANIANMAFTIGDSANTKTTIENADITALAP
NGGTAYLSSKDVEIEVKPNSNFTFFELPREKNLNQTKINGASTKLSERGFARLYDKINGVRASNLSAEQLNVTDASEKII
NTKLVSSLDVEKLVSVAVCDAGNGCEEQQFGDKGNNTKVSVGELEAEQ
>P45356 ~~~hxuB~~~Heme/hemopexin transporter protein HuxB~~~
MKMRPRYSVIASAVSLGFVLSKSVMALGQPDTGSLNRELEQRQIQSEAKPSGELFNQTANSPYTAQYKQGLKFPLTQVQI
LDRNNQEVVTDELAHILKNYVGKEVSLSDLSNLANEISEFYRHNNYLVAKAILPPQEIEQGTVKILLLKGNVGEIRLQNH
SALSNKFVSRLSNTTVNTSEFILKDELEKFALTINDVPGVNAGLQLSAGKKVGEANLLIKINDAKRFSSYVSVDNQGNKY
TGRYRLAAGTKVSNLNGWGDELKLDLMSSNQANLKNARIDYSSLIDGYSTRFGVTANYLDYKLGGNFKSLQSQGHSHTLG
AYLLHPTIRTPNFRLSTKVSFNHQNLTDKQQAVYVKQKRKINSLTAGIDGSWNLIKDGTTYFSLSTLFGNLANQTSEKKH
NAVENFQPKSHFTVYNYRLSHEQILPKSFAFNIGINGQFADKTLESSQKMLLGGLSGVRGHQAGAASVDEGHLIQTEFKH
YLPVFSQSVLVSSLFYDYGLGKYYKNSQFLEQGVKNSVKLQSVGAGLSLSDAGSYAINVSVAKPLDNNINNADKHQFWLS
MIKTF
>P44600 ~~~hxuC~~~Heme/hemopexin utilization protein C~~~COG4771
MRFSKLSLAITTTLVTANALAQSVELDSINVIATRDPSRFAYTPEKQSKDSLLSKQATSVADALEDIPNVDVRGGSRSIA
QKPNIRGLSDNRVVQVIDGVRQNFDLAHRGSYFLPMSLIQEIEVIKGPSSSLWGSGALGGVVAMRTPNALDLLKNNDKFG
VKIRQGYQTANNLSEKDVSVFAANDKFDVLISGFYNNADNLRTGKGNKLNNTAYKQFGGLAKFGWQINDANRVELSHRET
RFKQTAPSNNEVENELTNEQITDQIKKFHGQKDDLLPPTTQPSPSERSEFYSKVKTRLGSVSYLTDQQIPDQSTVFNYYL
TPDNPYLNTHIALYNNKTIEKEQRKVSGVKDQTKLTTRGINLRNSSELSHISFVYGVDYMRDKIRTERGTNGSDAKFRAD
PYNANSNTTGVYLIAHIPLFGEKLLVSPSVRYDHYDTSSKTVKYKDNHLSPATKLTWIVTNWLDFTAKYNEAFRAPSMQE
RFVSGAHFGANTLGLDHINRFVANPNLRPETAKNKEITANLHFDSLFKQGDKFKIEATYFRNDVKDFINLKIFNDAKTSA
SAGANPNTNGALLPKNSQYQNITNARLSGIELQAQYQTERLTLFTNYGSTKGKDKDSGEALSNIAASKIGVGVNYALVKD
KFTVGATVTHYAAQRRVPKDHSVTYPSYILTDLRATYAPLKGEWKNLRLDFALENLFDRKYQPAFSLMEGTGRNAKISAV
YSF
>P19930 3.4.23.-~~~hyaD~~~Hydrogenase 1 maturation protease~~~COG0680
MSEQRVVVMGLGNLLWADEGFGVRVAERLYAHYHWPEYVEIVDGGTQGLNLLGYVESASHLLILDAIDYGLEPGTLRTYA
GERIPAYLSAKKMSLHQNSFSEVLALADIRGHLPAHIALVGLQPAMLDDYGGSLSELAREQLPAAEQAALAQLAAWGIVP
QPANESRCLNYDCLSMENYEGVRLRQYRMTQEEQG
>P19931 ~~~hyaE~~~Hydrogenase-1 operon protein HyaE~~~COG1999
MSNDTPFDALWQRMLARGWTPVSESRLDDWLTQAPDGVVLLSSDPKRTPEVSDNPVMIGELLREFPDYTWQVAIADLEQS
EAIGDRFGVFRFPATLVFTGGNYRGVLNGIHPWAELINLMRGLVEPQQERAS
>E8MGH8 3.2.1.185~~~hypBA1~~~Non-reducing end beta-L-arabinofuranosidase~~~
MNVTITSPFWKRRRDQIVESVIPYQWGVMNDEIDTTVPDDPAGNQLADSKSHAVANLKVAAGELDDEFHGMVFQDSDVYK
WLEEAAYALAYHPDPELKALCDRTVDLIARAQQSDGYLDTPYQIKSGVWADRPRFSLIQQSHEMYVMGHYIEAAVAYHQV
TGNEQALEVAKKMADCLDANFGPEEGKIHGADGHPEIELALAKLYEETGEKRYLTLSQYLIDVRGQDPQFYAKQLKAMNG
DNIFHDLGFYKPTYFQAAEPVRDQQTADGHAVRVGYLCTGVAHVGRLLGDQGLIDTAKRFWKNIVTRRMYVTGAIGSTHV
GESFTYDYDLPNDTMYGETCASVAMSMFAQQMLDLEPKGEYADVLEKELFNGSIAGISLDGKQYYYVNALETTPDGLDNP
DRHHVLSHRVDWFGCACCPANIARLIASVDRYIYTERDGGKTVLSHQFIANTAEFASGLTVEQRSNFPWDGHVEYTVSLP
ASATDSSVRFGLRIPGWSRGSYTLTVNGKPAVGSLEDGFVYLVVNAGDTLEIALELDMSVKFVRANSRVRSDAGQVAVMR
GPLVYCAEQVDNPGDLWNYRLADGVTGADAAVAFQADLLGGVDTVDLPAVREHADEDDAPLYVDADEPRAGEPATLRLVP
YYSWANREIGEMRVFQRR
>E8MGH9 3.2.1.187~~~hypBA2~~~Beta-L-arabinobiosidase~~~
MHHSTRKRWLASIGAVAAVATLATGGAVTAQAADAPVIKNADVAYPSFKGSDDPMKTAANNTTYNPAVSYLQETFDNDVK
NLAGIDTDHDFWIDKILTRTGAQPTGKGTNDKGAYSYEGSDGNNYLFTRGRAAYMYTHTPNQLGFVGDTAYWDQTSRSGF
TVTVNADGSNQTLNEDASQRKQTPSYFTSLFQTGGKSLKIKEVKYITYNNVMVANLTVESTQDRDVTLTTASPFAAEGAD
GATELTGRVNVKNNLTTIYPRFSANNQDGSNWIVSGGKLTSTLSLKANEPQTVKIQLGLIANELPDSTKEYEARYTGDLK
DAAASYKDSVTTYNKWWVDNAPYVDTPEDNIDKTVVYRWWLSRFNMLDANMPGNTFQYPTSIEGVLGYNNQIVLTSGMFM
MDTKWFRNPEYSYGTWLSAGDTAKKSKAGYYYYHDNPGDPANWNHSYTQYITRAGWDSYKVHGGPSTVAEELADQGAEDV
QGLLASKSEPDNNDNQNNNDNSLIDWSWWSMTGNDADAVSFSEPGRSGQRMDRADGSANMWANANAAAQAYKAAGDTANA
EKMQAIADKIQKEVTTELWDKSDNLLKHKWLNDGAFAKYKEINNYYPYSEGLMPTGNEDYNKALRLFEDSNEFPIFPFFT
ANQADKAALNFPGSNNFSIINAQPLLQVYSAGIRNYDAAKNGYITNEQFKKLLYWVAFAHYQGGDNNYPDQNEFWNEDNN
NVGDVNGDGVINNLDKNLDAAQNGGKITYRSWIHHTQLGTTNWTMVEDVAGMVPREDNKIELNPIEIPGWNYFTVNNLRY
HDQDVSIVWDKDGSHYGGPAGYSLYVGGKLAFTSDKLAHLIYDPAAGTVEVKDDSSAQVTVGAEAVKNVKAANQVTFNAD
QRVTDLFAKSGTNVDSASKSTTNVAKDADVTGTTYAEKDTNYPAKNAVDGKTVMESFWGTKGSENKTDTLNIKFKDGKQK
IDDIRLYFYQSSSSQTISGYAEPANYKLEYQKDDGTWAPIADQVRTPNYAGANYNRIQFTPVETTTIRVTFTPQAGMAVG
VKEIEAYNTGIKADGTSENQTPQVDAYVSSSTSSGAKLVGTVKDDGLPAEGDVTTTWSQVSGPEGGTAKFVDASAASTTV
TFNKEGDYVLKLTASDGEKEGSKEITVHGIPSDGTVNVAPQSSASASYTNGYQPKDNAKKVIDGQVVYANTPNETWNNWG
DSTGVEPWLQLKWAGKVPLKKAKVFFWTDGGGVPMVSSWKLQYADADGNWQDVKLADGQSYTVNRNEGNEVKFADAVETD
KLRVVFPKGAIVGASEFEAYAIEPVSVDEVNRLVQTGSKADDLKLPSTVSAVYTDGSRRDLAVTWGKVTDAQLAADAVFD
VKGTVAGALNGTVAHIAARSDTASQTVGNAQPVEQTVYQNAKSIDLPATVPVKFPNGYNDDRKVTWKDADIKAIDLTKVG
DYEVAGTVDDGSSSAAAKLTVHVVADPNGSSTPEPEPEPLVGWIEGKATRTTISPDSEATWSPAEGKLNDGVVVDDTWPT
TDDQNVNDKVWGSWGKAKDGMYAQYDFGQSVTIDQSRAQFWANFAETDDSKGGLEVPDAWKIQYLAEDGSWKDVEPTEDY
TVVRNSPASRADTDAKGWSAVTFKPVTTKSLRLVLTPYTGSSTFGAAVAEWGVHGIDGTEPEPTPVDKTALESALDTANG
LDASRYTAASWAEFQQIIDAAQAVYDDANATAEQVAEQVTKLEDGQKALVALATDVEKSTLQAAIDAAKAEAASGKYTDK
SVEALNKAIEAAEGVLKVGEVGEVTQAAVQEASASLNKAVKALEEKPAAETVKKESLEASIEQAKKADKSKYTEEAWQAL
QSQIAAAQKVYDDKDAKQADVDAAQDALDKAFWATKVEQKPGSQQPGVTDTDKDDKDNKGDRVPPTGAAVSVVAAAAVLL
TAAGVTILKRRQSGDHGSARHSA
>P37180 ~~~hybB~~~Probable Ni/Fe-hydrogenase 2 b-type cytochrome subunit~~~COG5557
MSHDPQPLGGKIISKPVMIFGPLIVICMLLIVKRLVFGLGSVSDLNGGFPWGVWIAFDLLIGTGFACGGWALAWAVYVFN
RGQYHPLVRPALLASLFGYSLGGLSITIDVGRYWNLPYFYIPGHFNVNSVLFETAVCMTIYIGVMALEFAPALFERLGWK
VSLQRLNKVMFFIIALGALLPTMHQSSMGSLMISAGYKVHPLWQSYEMLPLFSLLTAFIMGFSIVIFEGSLVQAGLRGNG
PDEKSLFVKLTNTISVLLAIFIVLRFGELIYRDKLSLAFAGDFYSVMFWIEVLLMLFPLVVLRVAKLRNDSRMLFLSALS
ALLGCATWRLTYSLVAFNPGGGYAYFPTWEELLISIGFVAIEICAYIVLIRLLPILPPLKQNDHNRHEASKA
>P37182 3.4.23.-~~~hybD~~~Hydrogenase 2 maturation protease~~~COG0680
MRILVLGVGNILLTDEAIGVRIVEALEQRYILPDYVEILDGGTAGMELLGDMANRDHLIIADAIVSKKNAPGTMMILRDE
EVPALFTNKISPHQLGLADVLSALRFTGEFPKKLTLVGVIPESLEPHIGLTPTVEAMIEPALEQVLAALRESGVEAIPRE
AIHD
>P0AAN1 ~~~hybE~~~Hydrogenase-2 operon protein HybE~~~COG1773
MTEEIAGFQTSPKAQVQAAFEEIARRSMHDLSFLHPSMPVYVSDFTLFEGQWTGCVITPWMLSAVIFPGPDQLWPLRKVS
EKIGLQLPYGTMTFTVGELDGVSQYLSCSLMSPLSHSMSIEEGQRLTDDCARMILSLPVTNPDVPHAGRRALLFGRRSGE
NA
>P0A703 ~~~hybF~~~Hydrogenase maturation factor HybF~~~COG0375
MHELSLCQSAVEIIQRQAEQHDVKRVTAVWLEIGALSCVEESAVRFSFEIVCHGTVAQGCDLHIVYKPAQAWCWDCSQVV
EIHQHDAQCPLCHGERLRVDTGDSLIVKSIEVE
>P0AAM7 ~~~hybG~~~Hydrogenase maturation factor HybG~~~COG0298
MCIGVPGQVLAVGEDIHQLAQVEVCGIKRDVNIALICEGNPADLLGQWVLVHVGFAMSIIDEDEAKATLDALRQMDYDIT
SA
>P0AEV4 ~~~hycA~~~Formate hydrogenlyase regulatory protein HycA~~~
MTIWEISEKADYIAQRHRRLQDQWHIYCNSLVQGITLSKARLHHAMSCAPDKELCFVLFEHFRIYVTLADGFNSHTIEYY
VETKDGEDKQRIAQAQLSIDGMIDGKVNIRDREQVLEHYLEKIAGVYDSLYTAIENNVPVNLSQLVKGQSPAA
>P0AAK1 ~~~hycB~~~Formate hydrogenlyase subunit 2~~~COG1142
MNRFVIADSTLCIGCHTCEAACSETHRQHGLQSMPRLRVMLNEKESAPQLCHHCEDAPCAVVCPVNAITRVDGAVQLNES
LCVSCKLCGIACPFGAIEFSGSRPLDIPANANTPKAPPAPPAPARVSTLLDWVPGIRAIAVKCDLCSFDEQGPACVRMCP
TKALHLVDNTDIARVSKRKRELTFNTDFGDLTLFQQAQSGEAK
>P16429 ~~~hycC~~~Formate hydrogenlyase subunit 3~~~COG0651
MSAISLINSGVAWFVAAAVLAFLFSFQKALSGWIAGIGGAVGSLYTAAAGFTVLTGAVGVSGALSLVSYDVQISPLNAIW
LITLGLCGLFVSLYNIDWHRHAQVKCNGLQINMLMAAAVCAVIASNLGMFVVMAEIMALCAVFLTSNSKEGKLWFALGRL
GTLLLAIACWLLWQRYGTLDLRLLDMRMQQLPLGSDIWLLGVIGFGLLAGIIPLHGWVPQAHANASAPAAALFSTVVMKI
GLLGILTLSLLGGNAPLWWGIALLVLGMITAFVGGLYALVEHNIQRLLAYHTLENIGIILLGLGAGVTGIALEQPALIAL
GLVGGLYHLLNHSLFKSVLFLGAGSVWFRTGHRDIEKLGGIGKKMPVISIAMLVGLMAMAALPPLNGFAGEWVIYQSFFK
LSNSGAFVARLLGPLLAVGLAITGALAVMCMAKVYGVTFLGAPRTKEAENATCAPLLMSVSVVALAICCVIGGVAAPWLL
PMLSAAVPLPLEPANTTVSQPMITLLLIACPLLPFIIMAICKGDRLPSRSRGAAWVCGYDHEKSMVITAHGFAMPVKQAF
APVLKLRKWLNPVSLVPGWQCEGSALLFRRMALVELAVLVVIIVSRGA
>P16430 ~~~hycD~~~Formate hydrogenlyase subunit 4~~~COG0650
MSVLYPLIQALVLFAVAPLLSGITRVARARLHNRRGPGVLQEYRDIIKLLGRQSVGPDASGWVFRLTPYVMVGVMLTIAT
ALPVVTVGSPLPQLGDLITLLYLFAIARFFFAISGLDTGSPFTAIGASREAMLGVLVEPMLLLGLWVAAQVAGSTNISNI
TDTVYHWPLSQSIPLVLALCACAFATFIEMGKLPFDLAEAEQELQEGPLSEYSGSGFGVMKWGISLKQLVVLQMFVGVFI
PWGQMETFTAGGLLLALVIAIVKLVVGVLVIALFENSMARLRLDITPRITWAGFGFAFLAFVSLLAA
>P16431 ~~~hycE~~~Formate hydrogenlyase subunit 5~~~COG3261
MSEEKLGQHYLAALNEAFPGVVLDHAWQTKDQLTVTVKVNYLPEVVEFLYYKQGGWLSVLFGNDERKLNGHYAVYYVLSM
EKGTKCWITVRVEVDANKPEYPSVTPRVPAAVWGEREVRDMYGLIPVGLPDERRLVLPDDWPDELYPLRKDSMDYRQRPA
PTTDAETYEFINELGDKKNNVVPIGPLHVTSDEPGHFRLFVDGENIIDADYRLFYVHRGMEKLAETRMGYNEVTFLSDRV
CGICGFAHSTAYTTSVENAMGIQVPERAQMIRAILLEVERLHSHLLNLGLACHFTGFDSGFMQFFRVRETSMKMAEILTG
ARKTYGLNLIGGIRRDLLKDDMIQTRQLAQQMRREVQELVDVLLSTPNMEQRTVGIGRLDPEIARDFSNVGPMVRASGHA
RDTRADHPFVGYGLLPMEVHSEQGCDVISRLKVRINEVYTALNMIDYGLDNLPGGPLMVEGFTYIPHRFALGFAEAPRGD
DIHWSMTGDNQKLYRWRCRAATYANWPTLRYMLRGNTVSDAPLIIGSLDPCYSCTDRMTVVDVRKKKSKVVPYKELERYS
IERKNSPLK
>P16432 ~~~hycF~~~Formate hydrogenlyase subunit 6~~~COG1143
MFTFIKKVIKTGTATSSYPLEPIAVDKNFRGKPEQNPQQCIGCAACVNACPSNALTVETDLATGELAWEFNLGHCIFCGR
CEEVCPTAAIKLSQEYELAVWKKEDFLQQSRFALCNCRVCNRPFAVQKEIDYAIALLKHNGDSRAENHRESFETCPECKR
QKCLVPSDRIELTRHMKEAI
>P16433 ~~~hycG~~~Formate hydrogenlyase subunit 7~~~COG3260
MSNLLGPRDANGIPVPMTVDESIASMKASLLKKIKRSAYVYRVDCGGCNGCEIEIFATLSPLFDAERFGIKVVPSPRHAD
ILLFTGAVTRAMRSPALRAWQSAPDPKICISYGACGNSGGIFHDLYCVWGGTDKIVPVDVYIPGCPPTPAATLYGFAMAL
GLLEQKIHARGPGELDEQPAEILHGDMVQPLRVKVDREARRLAGYRYGRQIADDYLTQLGQGEEQVARWLEAENDPRLNE
IVSHLNHVVEEARIR
>P0AEV9 3.4.23.51~~~hycI~~~Hydrogenase 3 maturation protease~~~COG0680
MTDVLLCVGNSMMGDDGAGPLLAEKCAAAPKGNWVVIDGGSAPENDIVAIRELRPTRLLIVDATDMGLNPGEIRIIDPDD
IAEMFMMTTHNMPLNYLIDQLKEDIGEVIFLGIQPDIVGFYYPMTQPIKDAVETVYQRLEGWEGNGGFAQLAVEEE
>Q45515 3.5.2.-~~~~~~D-hydantoinase~~~
MTKLIKNGTIVTATDIYEADLLIQDGKIAVIGRNLDESGAEVIDATGCYVFPGGIDPHTHLDMPFGGTVTKDDFESGTIA
AAFGGTTTIIDFCLTNKGEPLKKAIETWHNKATGKAVIDYGFHLMISEITDDVLEELPKVIEEEGITSFKVFMAYKDVFQ
ADDGTLYRTLVAAKELGALVMVHAENGDVIDYLTKKALEDGHTDPIYHALTRPPELEGEATGRACQLTELAGSQLYVVHV
SCAQAVEKIAEARNKGLNVWGETCPQYLVLDQSYLEKPNFEGAKYVWSPPLREKWHQEVLWNALKNGQLQTLGSDQCSFD
FKGQKELGRGDFTKIPNGGPIIEDRVSILFSEGVKKGRITLNQFVDIVSTRIAKLFGLFPKKGTIAVGADADLVIFDPTV
ERVISAETHHMAVDYNPFEGMKVTGEPVSVLCRGEFVVRDKQFVGKPGYGQYVKRAKYGALMADQDVVKMS
>Q9I676 3.5.2.2~~~dht~~~D-hydantoinase/dihydropyrimidinase~~~
MSLLIRGATVVTHEESYRADVLCANGLIQAIGENLETPSGCDVLDGGGQYLMPGGIDPHTHMQLPFMGTVASEDFFSGTA
AGLAGGTTSIIDFVIPNPRQSLLEAFHTWRGWAQKSAADYGFHVAITWWSDEVAREMGELVAQHGVNSFKHFMAYKNAIM
AADDTLVASFERCLELGAVPTVHAENGELVFHLQQKLLAQGLTGPEAHPLSRPPQVEGEAASRAIRIAETLGTPLYLVHI
SSREALDEIAYARAKGQPVYGEVLAGHLLLDDSVYRHPDWATAAGYVMSPPFRPVEHQEALWRGLQSGNLHTTATDHCCF
CAEQKAMGRDDFSKIPNGTAGIEDRMALLWDAGVNSGRLSMHEFVALTSTNTAKIFNLFPRKGAIRVGADADLVLWDPQG
SRTLSAATHHQRVDFNIFEGRTVRGIPSHTISQGKLLWAAGDLRAEPGAGRYVERPAYPSVYEVLGRRAERQRPVAVER
>Q59699 3.5.2.2~~~dht~~~D-hydantoinase/dihydropyrimidinase~~~COG0044
MSLLIRGATVVTHEESYPADVLCVDGLIRAIGPNLEPPTDCEILDGSGQYLMPGGIDPHTHMQLPFMGTVASEDFFSGTA
AGLAGGTTSIIDFVIPNPQQSLLEAFHTWRGWAQKSASDYGFHVAITWWSEQVAEEMGELVAKHGVNSFKHFMAYKNAIM
AADDTLVASFERCLQLGAVPTVHAENGELVYHLQKKLLAQGMTGPEAHPLSRPSQVEGEAASRAIRIAETIGTPLYVVHI
SSREALDEITYARAKGQPVYGEVLPGHLLLDDSVYRDPDWATAAGYVMSPPFRPREHQEALWRGLQSGNLHTTATDHCCF
CAEQKAMGRDDFSRIPNGTAGIEDRMAVLWDAGVNSGRLSMHEFVALTSTNTAKIFNLFPRKGAIRVGADADLVLWDPQG
TRTLSAQTHHQRVDFNIFEGRTVRGVPSHTISQGKVLWADGDLRRRGRGGAVCGTAGVSVGVRGAGATRRTAAPDARSAL
RPLGLLRSPSPASQI
>Q8VTT5 3.5.2.-~~~hyuA~~~D-hydantoinase~~~
MDIIIKNGTIVTADGISRADLGIKDGKITQIGGALGPAERTIDAAGRYVFPGGIDVHTHVETVSFNTQSADTFATATVAA
ACGGTTTIVDFCQQDRGHSLAEAVAKWDGMAGGKSAIDYGYHIIVLDPTDSVIEELEVLPDLGITSFKVFMAYRGMNMID
DVTLLKTLDKAVKTGSLVMVHAENGDAADYLRDKFVAEGKTAPIYHALSRPPRVEAEATARALALAEIVNAPIYIVHVTC
EESLEEVMRAKSRGVRALAETCTHYLYLTKEDLERPDFEGAKYVFTPPARAKKDHDVLWNALRNGVFETVSSDHCSWLFK
GHKDRGRNDFRAIPNGAPGVEERLMMVYQGVNEGRISLTQFVELVATRPAKVFGMFPQKGTIAVGSDADIVLWDPEAEMV
IEQTAMHNAMDYSSYEGHKVKGVPKTVLLRGKVIVDEGSYVGEPTDGKFLKRRKYKQ
>O52683 1.12.1.4~~~hydA~~~Bifurcating [FeFe] hydrogenase alpha subunit~~~COG1905
MKIYVDGREVIINDNERNLLEALKNVGIEIPNLCYLSEASIYGACRMCLVEINGQITTSCTLKPYEGMKVKTNTPEIYEM
RRNILELILATHNRDCTTCDRNGSCKLQKYAEDFGIRKIRFEALKKEHVRDESAPVVRDTSKCILCGDCVRVCEEIQGVG
VIEFAKRGFESVVTTAFDTPLIETECVLCGQCVAYCPTGALSIRNDIDKLIEALESDKIVIGMIAPAVRAAIQEEFGIDE
DVAMAEKLVSFLKTIGFDKVFDVSFGADLVAYEEAHEFYERLKKGERLPQFTSCCPAWVKHAEHTYPQYLQNLSSVKSPQ
QALGTVIKKIYARKLGVPEEKIFLVSFMPCTAKKFEAEREEHEGIVDIVLTTRELAQLIKMSRIDINRVEPQPFDRPYGV
SSQAGLGFGKAGGVFSCVLSVLNEEIGIEKVDVKSPEDGIRVAEVTLKDGTSFKGAVIYGLGKVKKFLEERKDVEIIEVM
ACNYGCVGGGGQPYPNDSRIREHRAKVLRDTMGIKSLLTPVENLFLMKLYEEDLKDEHTRHEILHTTYRPRRRYPEKDVE
ILPVPNGEKRTVKVCLGTSCYTKGSYEILKKLVDYVKENDMEGKIEVLGTFCVENCGASPNVIVDDKIIGGATFEKVLEE
LSKNG
>O52682 1.12.1.4~~~hydB~~~Bifurcating [FeFe] hydrogenase beta subunit~~~COG1894
MFKNAKEFVQYANKLKTLREKKLNGVSIYVCVGTGCTAKGALKVYSAFEEELKKRNLLGQVTLEKIDDDKVTLNRTGCCG
RCSSGPLVKIMPYRFFYSNVAPEDVPEIVDRTVLKGEPIERLFLTDPLTGEKVPRIEDTTLFKNQDFYIMEAIGESECDS
IEDYIARSGYESLVKALTSMTPEEIIETVKASGLRGRGGGGFPTGLKWEFTRKAQGDIKFVVCNGDEGDPGAFMNRTLLE
RDPHLVLEGMIIAGYAVGAQKGYAYIRAEYPFAVKMFKKAIEDARKLGLLGENILGTGFSFDLEVKEGAGAFVCGEETAL
LASIEGKRGMPRPKPPFPAQSGLWGKPTLINNVETYANIPRILRDGVENYRKRGTENSPGTKMFSVAGPLKATGIIEVEF
GTTLRDIIYNICGGFVEGEEFKAVQIGGPSGACLSEDFIDMPLDYDTLKKADAMVGSGGIVVITKKTCMVEVARFFLDFT
KRESCGKCVPCREGTMQAYNILEKFTHGKATYEDLKTLEHLSKTIKTASLCGLGKTAPNPILSTLKLFREEYIAHIEGEC
PSGMCTAFKKYVINPDICKGCGLCARSCPQNAITGERGKPYTIDQEKCVKCGLCASKCPFKAIELV
>O52681 1.12.1.4~~~hydC~~~Bifurcating [FeFe] hydrogenase gamma subunit~~~COG1905
MERHFEKVEEILKKYGYKRENLIKILLEIQEIYRYLPEDVINYVSTAMGIPPAKIYGVATFYAQFSLKPKGKYTIMVCDG
TACHMAGSPEVLKAIEEETGLTPGNVTEDLMFSLDQVGCLGACALAPVMVINGEVYGNLTADKVKEILRKIKEKERESAN
V
>Q9X0Z6 1.8.-.-~~~~~~[FeFe] hydrogenase maturase subunit HydE~~~COG0502
MTGREILEKLERREFTREVLKEALSINDRGFNEALFKLADEIRRKYVGDEVHIRAIIEFSNVCRKNCLYCGLRRDNKNLK
RYRMTPEEIVERARLAVQFGAKTIVLQSGEDPYYMPDVISDIVKEIKKMGVAVTLSLGEWPREYYEKWKEAGADRYLLRH
ETANPVLHRKLRPDTSFENRLNCLLTLKELGYETGAGSMVGLPGQTIDDLVDDLLFLKEHDFDMVGIGPFIPHPDTPLAN
EKKGDFTLTLKMVALTRILLPDSNIPATTAMGTIVPGGREITLRCGANVIMPNWTPSPYRQLYQLYPGKICVFEKDTACI
PCVMKMIELLGRKPGRDWGGRKRVFETV
>P81006 3.5.2.-~~~lhyD~~~L-hydantoinase~~~
MFDVIVKNCRLVSSDGITEADILVKDGKVAAISADTSDVEASRTIDAGGKFVMPGVVDEHVHIIDMDLKNRYGRFELDSE
SAAVGGITTIIEMPITFPPTTTLDAFLEKKKQAGQRLKVDFALYGGGVPGNLPEIRKMHDAGAVGFKSMMAASVPGMFDA
VSDGELFEIFQEIAACGSVIVVHAENETIIQALQKQIKAAGGKDMAAYEASQPVFQENEAIQRALLLQKEAGCRLIVLHV
SNPDGVELIHQAQSEGQDVHCESGPQYLNITTDDAERIGPYMKVAPPVRSAEMNIRLWEQLENGLIDTLGSDHGGHPVED
KEPGWKDVWKAGNGALGLETSLPMMLTNGVNKGRLSLERLVEVMCEKPAKLFGIYPQKGTLQVGSDADLLILDLDIDTKV
DASQFRSLHKYSPFDGMPVTGAPVLTMVRGTVVAEKGEVLVEQGFGQFVTRRNYEASK
>Q9F466 5.1.99.5~~~hyuA~~~Hydantoin racemase~~~
MRILVINPNSSSALTESVADAAQQVVATGTIISAINPSRGPAVIEGSFDEALATFHLIEEVERAERENPPDAYVIACFGD
PGLDAVKELTDRPVVGVAEAAIHMSSFVAATFSIVSILPRVRKHLHELVRQAGATNRLASIKLPNLGVMAFHEDEHAALE
TLKQAAKEAVQEDGAESIVLGCAGMVGFARQLSDELGVPVIDPVEAACRVAESLVALGYQTSKANSYQKPTEKQYL
>Q00924 5.1.99.5~~~hyuE~~~Hydantoin racemase~~~
MKIKVINPNTTLAMTKGIEHAAKSAARSDTQIVAVSPKMGPASIESYYDEYLSIPGVIEEIKKGEEEGVDAFVIACWGDP
GLHAAREVTDKPVVGIAESSVYLASMLAARFSVVTVLPRIKTMLEDLVDSYGMQKRVLNIRTTPMGVLDFERDPEAGIEM
LRQEGKRAVEEDNAEAILLGCAGMAEFADSLEKELGVPVIDGVVAGVKFAETIVDLGKKTSKLKTYKYPEKKEYVGALEN
FGRNQTTTK
>Q6TMG4 5.1.99.5~~~hyuA~~~Hydantoin racemase~~~
MHIHLINPNSTASMTAQALESALLVKHAHTHVSASNPTDTPASIEGGADEAMSVPGMLAEIRQGEAQGVDAYVIACFDDP
GLHAAREVAKGPVIGICQAAVQVAMTISRRFSVITTLPRSVPIIEDLVSDYGAERHCRKVRAIDLPVLALEEDPQRAERL
LLKEIEIAKAEDGAEAIVLGCAGMSSLCDRLQKATGVPVIDGVTAAVKMAEALLGAGYATSKVNTYAYPRIKAAAGHKVC
A
>Q834W6 5.1.1.-~~~~~~Hydrophobic dipeptide epimerase~~~COG4948
MKIKQVHVRASKIKLKETFTIALGTIESADSAIVEIETEEGLVGYGEGGPGIFITGETLAGTLETIELFGQAIIGLNPFN
IEKIHEVMDKISAFAPAAKAAIDIACYDLMGQKAQLPLYQLLGGYDNQVITDITLGIDEPNVMAQKAVEKVKLGFDTLKI
KVGTGIEADIARVKAIREAVGFDIKLRLDANQAWTPKDAVKAIQALADYQIELVEQPVKRRDLEGLKYVTSQVNTTIMAD
ESCFDAQDALELVKKGTVDVINIKLMKCGGIHEALKINQICETAGIECMIGCMAEETTIGITAAAHLAAAQKNITRADLD
ATFGLETAPVTGGVSLEAKPLLELGEAAGLGISH
>O52866 3.3.2.10~~~~~~Soluble epoxide hydrolase~~~
MSTEITHHQAMINGYRMHYVTAGSGYPLVLLHGWPQSWYEWRNVIPALAEQFTVIAPDLRGLGDSEKPMTGFDKRTMATD
VRELVSHLGYDKVGVIGHDWGGSVAFYFAYDNRDLVERLFILDMIPGLIKAGDSFPIPVALMINHIFFHGGNPDWATALI
SKDVNLYLRRFLTTLDYNYSPNVFSEEDIAEYVRVNSLPGSIRSGCQWYATGLREDTENLAKATDKLTIPVIAWGGSHFL
GDIRPAWQEVAENVEGGAVENCGHFVPEEKPQFVIDTALKFFAPLR
>P23481 1.-.-.-~~~hyfA~~~Hydrogenase-4 component A~~~COG1142
MNRFVVAEPLWCTGCNTCLAACSDVHKTQGLQQHPRLALAKTSTITAPVVCHHCEEAPCLQVCPVNAISQRDDAIQLNES
LCIGCKLCAVVCPFGAISASGSRPVNAHAQYVFQAEGSLKDGEENAPTQHALLRWEPGVQTVAVKCDLCDFLPEGPACVR
ACPNQALRLITGDSLQRQMKEKQRLAASWFANGGEDPLSLTQEQR
>P23482 1.-.-.-~~~hyfB~~~Hydrogenase-4 component B~~~COG0651
MDALQLLTWSLILYLFASLASLFLLGLDRLAIKLSGITSLVGGVIGIISGITQLHAGVTLVARFAPPFEFADLTLRMDSL
SAFMVLVISLLVVVCSLYSLTYMREYEGKGAAAMGFFMNIFIASMVALLVMDNAFWFIVLFEMMSLSSWFLVIARQDKTS
INAGMLYFFIAHAGSVLIMIAFLLMGRESGSLDFASFRTLSLSPGLASAVFLLAFFGFGAKAGMMPLHSWLPRAHPAAPS
HASALMSGVMVKIGIFGILKVAMDLLAQTGLPLWWGILVMAIGAISALLGVLYALAEQDIKRLLAWSTVENVGIILLAVG
VAMVGLSLHDPLLTVVGLLGALFHLLNHALFKGLLFLGAGAIISRLHTHDMEKMGALAKRMPWTAAACLIGCLAISAIPP
LNGFISEWYTWQSLFSLSRVEAVALQLAGPIAMVMLAVTGGLAVMCFVKMYGITFCGAPRSTHAEEAQEVPNTMIVAMLL
LAALCVLIALSASWLAPKIMHIAHAFTNTPPATVASGIALVPGTFHTQVTPSLLLLLLLAMPLLPGLYWLWCRSRRAAFR
RTGDAWACGYGWENAMAPSGNGVMQPLRVVFSALFRLRQQLDPTLRLNKGLAHVTARAQSTEPFWDERVIRPIVSATQRL
AKEIQHLQSGDFRLYCLYVVAALVVLLIAIAV
>P77858 1.-.-.-~~~hyfC~~~Hydrogenase-4 component C~~~COG0650
MRQTLCDGYLVIFALAQAVILLMLTPLFTGISRQIRARMHSRRGPGIWQDYRDIHKLFKRQEVAPTSSGLMFRLMPWVLI
SSMLVLAMALPLFITVSPFAGGGDLITLIYLLALFRFFFALSGLDTGSPFAGVGASRELTLGILVEPMLILSLLVLALIA
GSTHIEMISNTLAMGWNSPLTTVLALLACGFACFIEMGKIPFDVAEAEQELQEGPLTEYSGAGLALAKWGLGLKQVVMAS
LFVALFLPFGRAQELSLACLLTSLVVTLLKVLLIFVLASIAENTLARGRFLLIHHVTWLGFSLAALAWVFWLTGL
>P0AEW1 1.-.-.-~~~hyfE~~~Hydrogenase-4 component E~~~COG4237
MTGSMIVNNLAGLMMLTSLFVISVKSYRLSCGFYACQSLVLVSIFATLSCLFAAEQLLIWSASAFITKVLLVPLIMTYAA
RNIPQNIPEKALFGPAMMALLAALIVLLCAFVVQPVKLPMATGLKPALAVALGHFLLGLLCIVSQRNILRQIFGYCLMEN
GSHLVLALLAWRAPELVEIGIATDAIFAVIVMVLLARKIWRTHGTLDVNNLTALKG
>P77329 1.-.-.-~~~hyfG~~~Hydrogenase-4 component G~~~COG3261
MNVNSSSNRGEAILAALKTQFPGAVLDEERQTPEQVTITVKINLLPDVVQYLYYQHDGWLPVLFGNDERTLNGHYAVYYA
LSMEGAEKCWIVVKALVDADSREFPSVTPRVPAAVWGEREIRDMYGLIPVGLPDQRRLVLPDDWPEDMHPLRKDAMDYRL
RPEPTTDSETYPFINEGNSDARVIPVGPLHITSDEPGHFRLFVDGEQIVDADYRLFYVHRGMEKLAETRMGYNEVTFLSD
RVCGICGFAHSVAYTNSVENALGIEVPQRAHTIRSILLEVERLHSHLLNLGLSCHFVGFDTGFMQFFRVREKSMTMAELL
IGSRKTYGLNLIGGVRRDILKEQRLQTLKLVREMRADVSELVEMLLATPNMEQRTQGIGILDRQIARDLRFDHPYADYGN
IPKTLFTFTGGDVFSRVMVRVKETFDSLAMLEFALDNMPDTPLLTEGFSYKPHAFALGFVEAPRGEDVHWSMLGDNQKLF
RWRCRAATYANWPVLRYMLRGNTVSDAPLIIGSLDPCYSCTDRVTLVDVRKRQSKTVPYKEIERYGIDRNRSPLK
>P71229 ~~~hyfR~~~DNA-binding transcriptional activator HyfR~~~COG3604
MAMSDEAMFAPPQGITIEAVNGMLAERLAQKHGKASLLRAFIPLPPPFSPVQLIELHVLKSNFYYRYHDDGSDVTATTEY
QGEMVDYSRHAVLLGSSGMAELRFIRTHGSRFTSQDCTLFNWLARIITPVLQSWLNDEEQQVALRLLEKDRDHHRVLVDI
TNAVLSHLDLDDLIADVAREIHHFFGLASVSMVLGDHRKNEKFSLWCSDLSASHCACLPRCMPGESVLLTQTLQTRQPTL
THRADDLFLWQRDPLLLLLASNGCESALLIPLTFGNHTPGALLLAHTSSTLFSEENCQLLQHIADRIAIAVGNADAWRSM
TDLQESLQQENHQLSEQLLSNLGIGDIIYQSQAMEDLLQQVDIVAKSDSTVLICGETGTGKEVIARAIHQLSPRRDKPLV
KINCAAIPASLLESELFGHDKGAFTGAINTHRGRFEIADGGTLFLDEIGDLPLELQPKLLRVLQEREIERLGGSRTIPVN
VRVIAATNRDLWQMVEDRQFRSDLFYRLNVFPLELPPLRDRPEDIPLLAKHFTQKMARHMNRAIDAIPTEALRQLMSWDW
PGNVRELENVIERAVLLTRGNSLNLHLNVRQSRLLPTLNEDSALRSSMAQLLHPTTPENDEEERQRIVQVLRETNGIVAG
PRGAATRLGMKRTTLLSRMQRLGISVREVL
>O30478 4.1.3.45~~~~~~3-hydroxybenzoate synthase~~~
MNPSSLVLNGLTSYFENGRARVVPPVGRNILGVVNYASVCEYPTLDHGYPELEINMVAPTAEPFAEVWVTDAESEHGERD
GITYAHDGEYFFCAGRVPPTGRYTEATRAAYVTMFELLEEFGYSSVFRMWNFIGDINRDNAEGMEVYRDFCRGRAEAFEQ
CRLEFDQFPAATGIGSRGGGIAFYLLACRSGGHVHIENPRQVPAYHYPKRYGPRAPRFARATYLPSRAADGVGGQVFVSG
TASVLGHETAHEGDLVKQCRLALENIELVISGGNLAAHGISAGHGLTALRNIKVYVRRSEDVPAVREICREAFSPDADIV
YLTVDVCRSDLLVEIEGVVM
>P30147 5.3.1.22~~~hyi~~~Hydroxypyruvate isomerase~~~COG3622
MLRFSANLSMLFGEYDFLARFEKAAQCGFRGVEFMFPYDYDIEELKHVLASNKLEHTLHNLPAGDWAAGERGIACIPGRE
EEFRDGVAAAIRYARALGNKKINCLVGKTPAGFSSEQIHATLVENLRYAANMLMKEDILLLIEPINHFDIPGFHLTGTRQ
ALKLIDDVGCCNLKIQYDIYHMQRMEGELTNTMTQWADKIGHLQIADNPHRGEPGTGEINYDYLFKVIENSDYNGWVGCE
YKPQTTTEAGLRWMDPYR
>P0A700 ~~~hypA~~~Hydrogenase maturation factor HypA~~~COG0375
MHEITLCQRALELIEQQAAKHGAKRVTGVWLKIGAFSCVETSSLAFCFDLVCRGSVAEGCKLHLEEQEAECWCETCQQYV
TLLTQRVRRCPQCHGDMLQIVADDGLQIRRIEIDQE
>P0A0U5 ~~~hypA~~~Hydrogenase maturation factor HypA~~~COG0375
MHEYSVVSSLIALCEEHAKKNQAHKIERVVVGIGERSAMDKSLFVSAFETFREESLVCKDAILDIVDEKVELECKDCSHV
FKPNALDYGVCEKCHSKNVIITQGNEMRLLSLEMLAE
>P0A0U4 ~~~hypA~~~Hydrogenase/urease maturation factor HypA~~~COG0375
MHEYSVVSSLIALCEEHAKKNQAHKIERVVVGIGERSAMDKSLFVSAFETFREESLVCKDAILDIVDEKVELECKDCSHV
FKPNALDYGVCEKCHSKNVIITQGNEMRLLSLEMLAE
>P0AAN3 ~~~hypB~~~Hydrogenase maturation factor HypB~~~COG0378
MCTTCGCGEGNLYIEGDEHNPHSAFRSAPFAPAARPKMKITGIKAPEFTPSQTEEGDLHYGHGEAGTHAPGMSQRRMLEV
EIDVLDKNNRLAERNRARFAARKQLVLNLVSSPGSGKTTLLTETLMRLKDSVPCAVIEGDQQTVNDAARIRATGTPAIQV
NTGKGCHLDAQMIADAAPRLPLDDNGILFIENVGNLVCPASFDLGEKHKVAVLSVTEGEDKPLKYPHMFAAASLMLLNKV
DLLPYLNFDVEKCIACAREVNPEIEIILISATSGEGMDQWLNWLETQRCA
>O25560 ~~~hypB~~~Hydrogenase/urease maturation factor HypB~~~COG0378
MSEQRQESLQNNPNLSKKDVKIVEKILSKNDIKAAEMKERYLKEGLYVLNFMSSPGSGKTTMLENLADFKDFKFCVVEGD
LQTNRDADRLRKKGVSAHQITTGEACHLEASMIEGAFDLLKDEGALEKSDFLIIENVGNLVCPSSYNLGAAMNIVLLSVP
EGDDKVLKYPTMFMCADAVIISKADMVEVFNFRVSQVKEDMQKLKPEAPIFLMSSKDPKSLEDFKNFLLEKKRENYQSTH
SF
>P28155 ~~~hypB~~~Hydrogenase maturation factor HypB~~~
MCTVCGCGTSAIEGHTHEVGDDGHGHHHHDGHHDHDHDHDHHRGDHEHDDHHHAEDGSVHYSKGIAGVHVPGMSQERIIQ
VEKDILSKNDAYAAENRRHFERQGVFALNFVSSPGSGKTSLLVRTIKDLKDRLSISVIEGDQQTSNDAARIRETGARAIQ
INTGKGCHLDAHMVGHAVEDLAPEPGSALFIENVGNLVCPAAFDLGEAHKVVVLSVTEGEDKPLKYPDMFAAADLMILNK
ADLLPHLDFNTGFCIANALRVNPRLQTLTVSARTGEGMEAFYAWLEVSAARRAIRSKVA
>P0AAM3 ~~~hypC~~~Hydrogenase maturation factor HypC~~~COG0298
MCIGVPGQIRTIDGNQAKVDVCGIQRDVDLTLVGSCDENGQPRVGQWVLVHVGFAMSVINEAEARDTLDALQNMFDVEPD
VGALLYGEEK
>A0A031WDE4 4.2.1.172~~~pflD~~~Trans-4-hydroxy-L-proline dehydratase~~~
MARGTFERTKKLREESINAEPHISIERAVLMTEAYKKYEGSVEIPVLRALSFKHYIENRTLSINDGELIVGEKGDSPNGA
PTYPEICCHTMEDLEVMHNRDIINFSVSEEARKIHKEEIIPFWKKRQTRDKIINAMTPEWLAAYEAGMFTEFMEQRAPGH
TVCGDTIYKKGFLDLKKDIEARLKELDFLNDLDAYNKKADLEAMAIACDAMVILGKRYAEKARQMAEEETDEAKKKDLLL
IAETCDVVPAHKPETYHQAIQMYWFVHIGVTTELNIWDAFTPGRLDQHLNPFYERDVENGILDRDRAQELLECLWVKFNN
QPAPPKVGITLKESSTYTDFANINTGGINPDGQDGVNEVSYIILDVMDEMKLIQPSSNVQISKKTPQKFLKRACEISRKG
WGQPAFYNTEAIVQELMEAGKTIEDARLGGTSGCVETGCFGKEAYVLTGYMNIPKILELTLNNGYDPISKKQIGIETGDP
RNFQSYEELFEAFKKQLHYMIDIKIEGNAVIENICAKHMPCPLMSTIVDDCIEKGKDYQRGGARYNTRYIQGVGIGTITD
SLTAIKYNVFDKKKFDMDTLLKALDANFEGYEAILNLVSNKTPKYGNDDDYADEIMQEIFNAYYNEVTGRPTVCGGEYRV
DMLPTTCHIYFGEIMGASPNGRLCAKPVSEGISPEKGGDTNGPTAVIKSCAKMDHIKTGGTLLNQRFAPSVVQGEKGLDN
MANLVRAYFNMDGHHIQFNVFDKNVLLEAQKNPQDYKDLIVRVAGYSDHFNNLSRTLQDEIIGRTEQTF
>P24192 ~~~hypD~~~Hydrogenase maturation factor HypD~~~COG0409
MRFVDEYRAPEQVMQLIEHLRERASHLSYTAERPLRIMEVCGGHTHAIFKFGLDQLLPENVEFIHGPGCPVCVLPMGRID
TCVEIASHPEVIFCTFGDAMRVPGKQGSLLQAKARGADVRIVYSPMDALKLAQENPTRKVVFFGLGFETTMPTTAITLQQ
AKARDVQNFYFFCQHITLIPTLRSLLEQPDNGIDAFLAPGHVSMVIGTDAYNFIASDFHRPLVVAGFEPLDLLQGVVMLV
QQKIAAHSKVENQYRRVVPDAGNLLAQQAIADVFCVNGDSEWRGLGVIESSGVHLTPDYQRFDAEAHFRPAPQQVCDDPR
ARCGEVLTGKCKPHQCPLFGNTCNPQTAFGALMVSSEGACAAWYQYRQQESEA
>P26411 ~~~hypD~~~Hydrogenase maturation factor HypD~~~
MKFASEFRDPALAKGLLAEIARLADQIGATAEKPVHIMEICGGHTHSIFRYGLDKLIHPGIEFIHGPGCPVCVLPRARVD
ECIEIAGRPEVIFCTFGDAMRVPGSKLSLMQAKAAGADIRMVYSPLDALELARRNPGREVVFFGLGFETTTPSTALAIQQ
AAREGLANFSVFCNHITVPEPIRALLDDPYMRLDGFIGPGHVSMVIGIHPYDFIAEDYGKPLVVAGFEPTDLLQSVLMVL
RQISQGRAAIENQYARVVPEHGNRVSLAAIADVYERRPSFEWRGLGEIDASGLRIRAAYRAHDAEEKFGVGYAGQRAAVE
EAEGCACGAVMTGRMKPVACAQFGKGCTPEMPLGALMVSSEGACAAYWQYGGARAAE
>P31905 4.2.1.-~~~hypE~~~Carbamoyl dehydratase HypE~~~COG0309
MSGTVKLGYQRPLNIKSGRIDMGHGAGGRAAAQLIQELFVAAFDNEWLRQGNDQAAFAMPAGARMVMATDAHVVSPLFFP
GGDIGSLSVHGTINDVAMAGAKPLYLAASFILEEGFPLADLKRIVESMAGAAREAGVPIVTGDTKVVEQGKGDGVFITTT
GVGVVPAGILIDGAGARPGDAILLSGTMGEHGVAILSKRESLEFDTEIRSDSAALHDLVAQMLAVVPGVRVLRDPTRGGL
ATTLNEISSQSGVGMVLDEAAIPVLPQVDAACELLGLDPLYVANEGKLVAICAAADADALLAAMRGHPLGREARRIGEVI
EDGRHFVQMRTKFGGMRVVDWLSGEQLPRIC
>P24193 4.2.1.-~~~hypE~~~Carbamoyl dehydratase HypE~~~COG0309
MNNIQLAHGSGGQAMQQLINSLFMEAFANPWLAEQEDQARLDLAQLVAEGDRLAFSTDSYVIDPLFFPGGNIGKLAICGT
ANDVAVSGAIPRYLSCGFILEEGLPMETLKAVVTSMAETARAAGIAIVTGDTKVVQRGAVDKLFINTAGMGAIPANIHWG
AQTLTAGDVLLVSGTLGDHGATILNLREQLGLDGELVSDCAVLTPLIQTLRDIPGVKALRDATRGGVNAVVHEFAAACGC
GIELSEAALPVKPAVRGVCELLGLDALNFANEGKLVIAVERNAAEQVLAALHSHPLGKDAALIGEVVERKGVRLAGLYGV
KRTLDLPHAEPLPRIC
>P30131 6.2.-.-~~~hypF~~~Carbamoyltransferase HypF~~~COG0068
MAKNTSCGVQLRIRGKVQGVGFRPFVWQLAQQLNLHGDVCNDGDGVEVRLREDPETFLVQLYQHCPPLARIDSVEREPFI
WSQLPTEFTIRQSTGGTMNTQIVPDAATCPACLAEMNTPGERRYRYPFINCTHCGPRFTIIRAMPYDRPFTVMAAFPLCP
ACDKEYRDPLDRRFHAQPVACPECGPHLEWVSHGEHAEQEAALQAAIAQLKMGKIVAIKGIGGFHLACDARNSNAVATLR
ARKHRPAKPLAVMLPVADGLPDAARQLLTTPAAPIVLVDKKYVPELCDDIAPDLNEVGVMLPANPLQHLLLQELQCPLVM
TSGNLSGKPPAISNEQALADLQGIADGFLIHNRDIVQRMDDSVVRESGEMLRRSRGYVPDALALPPGFKNVPPVLCLGAD
LKNTFCLVRGEQAVLSQHLGDLSDDGIQMQWREALRLMQNIYDFTPQYVVHDAHPGYVSSQWAREMNLPTQTVLHHHAHA
AACLAEHQWPLDGGDVIALTLDGIGMGENGALWGGECLRVNYRECEHLGGLPAVALPGGDLAAKQPWRNLLAQCLRFVPE
WQNYSETASVQQQNWSVLARAIERGINAPLASSCGRFFDAVAAALGCAPATLSYEGEAACALEALAASCHGVTHPVTMPR
VDNQLDLATFWQQWLNWQAPVNQRAWAFHDALAQGFAALMREQATMRGITTLVFSGGVIHNRLLRARLAHYLADFTLLFP
QSLPAGDGGLSLGQGVIAAARWLAGEVQNG
>Q02987 6.2.-.-~~~hypF~~~Carbamoyltransferase HypF~~~
MQAWRIRVRGQVQGVGFRPFVWQLARARGLRGVVLNDAEGVLIRVAGDLGDFAAALRDQAPPLARVDAVEVTAAVCDDLP
EGFQIAASGAAGAETRVTPDAATCPDCLAEIRGEGRRRGYAFTNCTHCGPRFSILQSLPYDRARTTMAPFAMCPACRAEY
EDPADRRFHAQPIACPDCGPRLWLEAGGAELPGDAIGLAAARLKAGEILAVKGLGGFHLACDATNADAVDLLRARKRRPA
KPFALMAREEDLARIVAVSPAALAALRDPAAPIVLMPARGSLPETLAPGMAELGVMLPYTPLHHLLLDAFGGVLVMTSGN
LSGAPQVIGNDEAREKLSAFADAFLMHDRAIARRLDDSVVRVDPPMVLRRARGQVPGTLPLPPGFETAPQIVAYGGQMKA
ALCLIKTGQALLGHHLGELDEALTWEAFLQADADYAALFDHRPQAVAVDLHPDFRASRHGAARAGRLGVPLIAVQHHHAH
LAACLGENLWPKDGGKVAVIVLDGLGLGPDGTVWGGELLLGDYKGFERVAWLKPAPLIGGDRAQIEPWRNALVRLDAAGL
SDLADRLFPAAPRDLARQLAAKGINAPLSSSAGRLFDAVAACLGICPMRQSYEGEAAMRLESLAADTGPVPDLPCVGGAI
DPAPLFQLLAAGERPDRVAHALHASLAQAFAAEARRLIEAGQAEAVALTGGCFQNSRLATMTRNFLADQGILTQGRIPAN
DGGLALGQALVAAAKLESN
>P0CZ00 4.2.2.1~~~~~~Hyaluronate lyase~~~
MFGTPSRRTFLTASALSAMALAASPTVTDAIAAPGPDSWSALCERWIDIITGRRAARTSDPRARAIIAKTDRKVAEILTD
LVSGSSRQTVLISADLRKEQSPFITKTARAIESMACGWATPGSSYHKDPEILSACIEGLRDFCRLRYNPSQDEYGNWWDW
EDGASRAVADVMCILHDVLPPEVMSAAAAGIDHFIPDPWFQQPGSVKPTANPVQPVVSTGANRMDLTRAVMCRSIATGDE
KRLRHAVDGLPDAWRVTTEGDGFRADGGFIQHSHIPYTGGYGDVLFSGLAMLFPLVSGMRFDIDESARKAFHDQVERGFI
PVMYNGQILDDVRGRSISRINESAAMHGISIARAMLMMADALPTHRAEQWRGIVHGWMARNTFDHLSEPSTLVDISLFDA
AAKAPRPGVVDAELLRVHGPSRPATADWLITVSNCSDRIAWYEYGNGENEWAYRTSQGMRYLLLPGDMGQYEDGYWATVD
YSAPTGTTVDSTPLKRAVGASWAAKTPTNEWSGGLASGSWSAAASHITSQDSALKARRLWVGLKDAMVELTTDVTTDASR
AITVVEHRKVASSSTKLLVDGNRVSSATSFQNPRWAHLDGVGGYVFATDTDLSADVATRKGTWIDVNPSRKVKGADEVIE
RAYASLHGHPPRSSSPWALLPTASRSHTMALATRPGVEPFTVLRNDGNRPGRASAGALLTKDPTVVTTLAFWKPATCGGV
AVNRPALVQTRESANQMEVVIVEPTQKRGSLTVTIEGSWKVKTADSHVDVSCENAAGTLHVDTAGLGGQSVRVTLARQVT
QTPSGGGRHDRA
>Q53591 4.2.2.1~~~hylB~~~Hyaluronate lyase~~~COG5492
MKQVVDNQTQNKELVKNGDFNQTNPVSGSWSHTSAREWSAWIDKENTADKSPIIQRTEQGQVSLSSDKGFRGAVTQKVNI
DPTKKYEVKFDIETSNKAGQAFLRIMEKKDNNTRLWLSEMTSGTTNKHTLTKIYNPKLNVSEVTLELYYEKGTGSATFDN
ISMKAKGPKDSEHPQPVTTQIEESVNTALNKNYVFNKADYQYTLTNPSLGKIVGGILYPNATGSTTVKISDKSGKIIKEV
PLSVTASTEDKFTKLLDKWNDVTIGNHVYDTNDSNMQKINQKLDETNAKNIKTIKLDSNHTFLWKDLDNLNNSAQLTATY
RRLEDLAKQITNPHSTIYKNEKAIRTVKESLAWLHQNFYNVNKDIEGSANWWDFEIGVPRSITATLALMNNYFTDAEIKT
YTDPIEHFVPDAGYFRKTLDNPFKALGGNLVDMGRVKIIEGLLRKDNTIIEKTSHSLKNLFTTATKAEGFYADGSYIDHT
NVAYTGAYGNVLIDGLTQLLPIIQETDYKISNQELDMVYKWINQSFLPLIVKGELMDMSRGRSISREAASSHAAAVEVLR
GFLRLANMSNEERNLDLKSTIKTIITSNKFYNVFNNLKSYSDIANMNKMLNDSTVATKPLKSNLSTFNSMDRLAYYNAEK
DFGFALSLHSKRTLNYEGMNDENTRDWYTGDGMFYLYNSDQSHYSNHFWPTVNPYKMAGTTEKDAKREDTTKEFMSKHSK
DAKEKTGQVTGTSDFVGSVKLNDHFALAAMDFTNWDRTLTAQKGWVILNDKIVFLGSNIKNTNGIGNVSTTIDQRKDDSK
TPYTTYVNGKTIDLKQASSQQFTDTKSVFLESKEPGRNIGYIFFKNSTIDIERKEQTGTWNSINRTSKNTSIVSNPFITI
SQKHDNKGDSYGYMMVPNIDRTSFDKLANSKEVELLENSSKQQVIYDKNSQTWAVIKHDNQESLINNQFKMNKAGLYLVQ
KVGNDYQNVYYQPQTMTKTDQLAI
>Q54873 4.2.2.1~~~~~~Hyaluronate lyase~~~COG5492
MQTKTKKLIVSLSSLVLSGFLLNHYMTIGAEETTTNTIQQSQKEVQYQQRDTKNLVENGDFGQTEDGSSPWTGSKAQGWS
AWVDQKNSADASTRVIEAKDGAITISSHEKLRAALHRMVPIEAKKKYKLRFKIKTDNKIGIAKVRIIEESGKDKRLWNSA
TTSGTKDWQTIEADYSPTLDVDKIKLELFYETGTGTVSFKDIELVEVADQLSEDSQTDKQLEEKIDLPIGKKHVFSLADY
TYKVENPDVASVKNGILEPLKEGTTNVIVSKDGKEVKKIPLKILASVKDAYTDRLDDWNGIIAGNQYYDSKNEQMAKLNQ
ELEGKVADSLSSISSQADRTYLWEKFSNYKTSANLTATYRKLEEMAKQVTNPSSRYYQDETVVRTVRDSMEWMHKHVYNS
EKSIVGNWWDYEIGTPRAINNTLSLMKEYFSDEEIKKYTDVIEKFVPDPEHFRKTTDNPFKALGGNLVDMGRVKVIAGLL
RKDDQEISSTIRSIEQVFKLVDQGEGFYQDGSYIDHTNVAYTGAYGNVLIDGLSQLLPVIQKTKNPIDKDKMQTMYHWID
KSFAPLLVNGELMDMSRGRSISRANSEGHVAAVEVLRGIHRIADMSEGETKQCLQSLVKTIVQSDSYYDVFKNLKTYKDI
SLMQSLLSDAGVASVPRPSYLSAFNKMDKTAMYNAEKGFGFGLSLFSSRTLNYEHMNKENKRGWYTSDGMFYLYNGDLSH
YSDGYWPTVNPYKMPGTTETDAKRADSDTGKVLPSAFVGTSKLDDANATATMDFTNWNQTLTAHKSWFMLKDKIAFLGSN
IQNTSTDTAATTIDQRKLESGNPYKVYVNDKEASLTEQEKDYPETQSVFLESFDSKKNIGYFFFKKSSISMSKALQKGAW
KDINEGQSDKEVENEFLTISQAHKQNRDSYGYMLIPNVDRATFNQMIKELESSLIENNETLQSVYDAKQGVWGIVKYDDS
VSTISNQFQVLKRGVYTIRKEGDEYKIAYYNPETQESAPDQEVFKKLEQAAQPQVQNSKEKEKSEEEKNHSDQKNLPQTG
EGQSILASLGFLLLGAFYLFRRGKNN
>A0A3D9VCI6 4.2.2.1~~~tchly8B~~~Hyaluronate lyase~~~
MSWNRRSFLGALGVTCLAGAGMVPIVRPRTAAAADEFDLLRERWCSLVTGSGYDPDVEPFKSRLAALGAEAEQYLTTLAP
GETSLWPDLPLDTSTWNMTLSARRLRTMAVAYLVPGTGHTGNSAMAEAAVTAFDELTTRFYAPPHWWGNWWDWLIGTPQA
LNDFCALLYEQLGPELIDRYVQRVDHYVDPGAIDRTTGANRGWLCEVTAVRGVLGKSPEMMAKARDGLSPIMVYVTDGDG
FYRDGSFIQHEYYAYTGSYGISLLQSVSGLFALLAGSTWEIVDPNRQVLFDSIENSFAPFVYNGLLMDAVAGRVISREAE
HDHWRGHLLAASVLRMAEAGSPEEAKRWRGIVKGWLLRESEPRYMGDQTLTMAAVADAQAVLDDPTIEPLPEPVEHRIFA
AMDQAVHRRPTWAFSISMRSVRTAFYETINGENLKGWHTGVGMTYWWGADFGNDHYTDGFWPTADPYRLPGTTVSRKPLE
DGVGNNVLPTEAWAGGTTDGEFAAVGQSIQALESTLRGRKSWFCLDDAVVCLGAGITCADGYAVDTTVDQRNLGENGVHD
FRLNGIPSPTSGTWSLTVPNARWAHLEGFGGYVFPGGARVSAIRETRTGSWYDINVGGPRDELRRRYVTVYLDHGVDPVD
ASYVYLVMPGATRQETIRRAADRRWLRVLANTADRQAISVPSLGFVGANFFAPGTVDALTVDQPCSVLVRVADGRATICV
SDPRQDGSTVRVTWNRPVASVVSSDPTVRVVEAGERLVLDVTVEETAGMTQRAVVALA
>Q9F464 3.5.1.87~~~hyuC~~~N-carbamoyl-L-amino-acid hydrolase~~~
MTLQKAQAERIEKEIRELSRFSAEGPGVTRLTYTPEHAAARETLIAAMKAAALSVREDALGNIIGRREGTDPELPAIAVG
SHFDSVRNGGMFDGTAGVVCALEAARVMLENGYVNRHPFEFIAIVEEEGARFSSGMLGGRAIAGLVADRELDSLVDEDGV
SVRQAATAFGLKPGELQAAARSAADLRAFIELHIEQGPILEQEQIEIGVVTSIVGVRALRVAVKGRSDHAGTTPMHLRQD
ALVPAALMVREVNRFVNEIADGTVATVGHLTVAPGGGNQVPGEVDFTLDLRSPHEESLRVLIDRISVMVGEVASQAGVAA
DVDEFFNLSPVQLAPTMVDAVREAASALQFTHRDISSGAGHDSMFIAQVTDVGMVFVPSRAGRSHVPEEWTDFDDLRKGT
EVVLRVMKALDR
>Q01264 3.5.1.87~~~hyuC~~~N-carbamoyl-L-amino-acid hydrolase~~~
MKTVTISKERLRIHIEQLGEIGKTKDKGVQRLALSKEDREATLLVSEWMREAGLTVTHDHFGNLIGRKEGETPSLPSVMI
GSHIDSVRNGGKFDGVIGVLAGIEIVHAISEANVVHEHSIEVVAFCEEEGSRFNDGLFGSRGMVGKVKPEDLQKVDDNNV
TRYEALKTFGFGIDPDFTHQSIREIGDIKHYFEMHIEQGPYLEKNNYPIGIVSGIAGPSWFKVRLVGEAGHAGTVPMSLR
KDPLVGAAEVIKEVETLCMNDPNAPTVGTVGRIAAFPGGSNIIPESVEFTLDIRDIELERRNKIIEKIEEKIKLVSNTRG
LEYQIEKNMAAVPVKCSENLINSLKQSCKELEIDAPIIVSGAGHDAMFLAEITEIGMVFVRCRNGISHSPKEWAEIDDIL
TGTKVLYESIIKHI
>Q6DTN4 3.5.1.87~~~hyuC~~~N-carbamoyl-L-amino-acid hydrolase~~~
MAAPGENRRVNADRLWDSLMEMAKIGPGVAGGNNRQTLTDADGEGRRLFQSWCEEAGLSMGVDKMGTMFLTRPGTDPDAL
PVHIGSHLDTQPTGGKFDGVLGVLSGLEAVRTMNDLGIKTKHPIVVTNWTNEEGARFAPAMLASGVFAGVHTLEYAYARK
DPEGKSFGDELKRIGWLGDEEVGARKMHAYFEYHIEQGPILEAENKQIGVVTHCQGLWWLEFTLTGREAHTGSTPMDMRV
NAGLAMARILEMVQTVAMENQPGAVGGVGQMFFSPNSRNVLPGKVVFTVDIRSPDQAKLDGMRARIEAEAPKICERLGVG
CSIEAVGHFDPVTFDPKLVETVRGAAEKLGYSHMNLVSGAGHDACWAAKVAPTTMIMCPCVGGLSHNEAEDISREWAAAG
ADVLFHAVLETAEIVE
>D6R8X8 ~~~hyuP~~~Hydantoin permease~~~
MNSTPIEEARSLLNPSNAPTRYAERSVGPFSLAAIWFAMAIQVAIFIAAGQMTSSFQVWQVIVAIAAGCTIAVILLFFTQ
SAAIRWGINFTVAARMPFGIRGSLIPITLKALLSLFWFGFQTWLGALALDEITRLLTGFTNLPLWIVIFGAIQVVTTFYG
ITFIRWMNVFASPVLLAMGVYMVYLMLDGADVSLGEVMSMGGENPGMPFSTAIMIFVGGWIAVVVSIHDIVKECKVDPNA
SREGQTKADARYATAQWLGMVPASIIFGFIGAASMVLVGEWNPVIAITEVVGGVSIPMAILFQVFVLLATWSTNPAANLL
SPAYTLCSTFPRVFTFKTGVIVSAVVGLLMMPWQFAGVLNTFLNLLASALGPLAGIMISDYFLVRRRRISLHDLYRTKGI
YTYWRGVNWVALAVYAVALAVSFLTPDLMFVTGLIAALLLHIPAMRWVAKTFPLFSEAESRNEDYLRPIGPVAPADESAT
ANTKEQNQR
>Q1Q0T2 1.7.2.7~~~~~~Hydrazine synthase subunit alpha~~~
MGKRKLGVIASAFVAGALVCGSTLVNAEPVMTGGPVQGKALWTDYSGMSKEVQGPVSQILFTQSPRTAKGDPYQNYPHYI
PEGSRIVLFDLNTKELKVLTNDFATAFDPCTYWDGKKFAFAGVHKKGGGCQIWEMNIDGSGLRQMTDLKGTCRSPIYYAA
GSIEEGEGRIIWRDRYFEGDWKEHGMVEKTGMIIFSGSPEGVMDEFHNPYAYNLYRLDTQGGKIIQRITGHVLSGIEFPH
LNTTIDQITYNLSSNFDPWLTPDGNILFSSVQANGSRAGGEGRVMICVDNWDGAYPRPIYGNCDGEIGGTSGRSQAKITF
GDRKIVYVESPYMNWGVGQLAAVSWDAPFNKTYEKLTGKDGGLYRSPYPLPDDRMLVSYAERGDFGIYWFNFSKCAAGDK
VYDDPNWNDHQPAPVYVKYKPRWINTFTAGKNFGVTVVTYQPFDQVKVEGYPHSWGTWICFDTTLSDQPVGPYPHQKAKN
VSHGDIKAVRIIQGYQCVEPDSTRFRVGAGAHLLGGERSSSNSGTAFQQRGIIGYQYVESDGSTVTSQLSDVPYYMQILD
DKGMSVQTALTWAYLRPYHGRICSGCHYGSYRGRAFKNIHAKALYNWWYDDRSHYDSPFAFRYLKFDNDGNYKGVKHGED
VVVPSDIYYGGPSGTTSQPVEGLTLDKQRTVDFRRDIQPILDAKCAMCHDSNNPPNLGGGLELVSVDGIAAYSRAYNSLL
EPQRGKDPNIGGKYVNPSAAINSLLVWRLYEAELSANAPREKIFPIEGRLLHNKFLTQDERYAIVEWIDLGAQWDNIPGP
DFYPGYLVK
>Q1Q0T4 ~~~~~~Hydrazine synthase subunit beta~~~
MVIRRKMNKMIRKGMIGAVMLGAAVAISGGVATAGYIQGTHVKTDLPGPFHITMSPDGSTLFISNQSGHSVTFVDARTQK
VTGEVAVRVQPEASAVTPDGAFLYVCNAESDSVSVVDIQRKQEIKEIKVGDWPSGIKISPDGKTAYVACSGCMWNAIDVI
DTGRMEKVRSIYTSDYGPRMVEISPDGKTLVAILDTVGSINRSVDFIDIASGRVVENRVIHESSNLRDVVYTPDGKYIAV
THQTPKNWLPVCEAENGQVFTNNVTIIETKAGGKVARLPLDDLNNYDGNPYGMAMDPKGKYLYIGVRGMHRVTILDMDKV
LGLVRSSTQEELDYLRDDLGLVRDYLVARVPTGLGPSSVCLSPDGKFCYAANYFSNNVTVIRTAVD
>Q1Q0T3 1.7.2.7~~~~~~Hydrazine synthase subunit gamma~~~
MAREMRLGGKERMKTGVVKIGLVAALGVVGLISAGGVYAGQPRVISTIQTGATWEPLGREEPLTVPEVHFRVKHSPFKSE
LVRYGQFQFNDAAWSLQGSYSCASCHYERGQTTGLIWDLGDEGWGSWKNTKYIRGGRYLPPFRHEGFTGHPDEIVGATSS
LDRVCGRDPGFVFRSENFSPMRLEALICYIRALEFTGSPFRNADGSLTEAQKRGQKIFEDPKVGCLECHPGDPMDPRALF
SDAQTHDVGTGRVGVNGFRSTPGKVFNISALEAGEDPYGVESNTPIIGLDLVKEFDTPTLRDIYASGTYFHDGGARTLMD
TINNTVNDKDMHGRTSHLKQQELQDLVEYLKAL
>P01093 ~~~~~~Alpha-amylase inhibitor Haim-1~~~
DAGNRIAAPACVHFTADWRYTFVTNDCSIDYSVTVAYGDGTDVPCRSANPGDILTFPGYGTRGNEVLGAVLCATDGSA
>P09921 ~~~~~~Alpha-amylase inhibitor Paim-1~~~
ASEPAPACVVMYESWRYTTAANNCADTVSVSVAYQDGATGPCATLPPGAVTTVGEGYLGEHGHPDHLALCPSS
>P20078 ~~~~~~Alpha-amylase inhibitor Haim-2~~~
MKRYVCSTFVACVMVLCVIPASGAAAHEAVAEDAGNRIAAPACVHFTADWRYTFVTNDCSIDYSVTVAYGDGTDVPCRSA
NPGDILTFPGYGTRGNEVLGAVLCATDGSALPVDRERAVR
>P20596 ~~~~~~Alpha-amylase inhibitor Paim-2~~~
SDASEPAPACVVMYESWRYTTAANNCADTVSVSVAYQDGATGPCATLPPGAVTTVGEGYLGEHGHPDHLALCPSS
>P37595 3.4.19.5~~~iaaA~~~Isoaspartyl peptidase~~~COG1446
MGKAVIAIHGGAGAISRAQMSLQQELRYIEALSAIVETGQKMLEAGESALDVVTEAVRLLEECPLFNAGIGAVFTRDETH
ELDACVMDGNTLKAGAVAGVSHLRNPVLAARLVMEQSPHVMMIGEGAENFAFARGMERVSPEIFSTSLRYEQLLAARKEG
ATVLDHSGAPLDEKQKMGTVGAVALDLDGNLAAATSTGGMTNKLPGRVGDSPLVGAGCYANNASVAVSCTGTGEVFIRAL
AAYDIAALMDYGGLSLAEACERVVMEKLPALGGSGGLIAIDHEGNVALPFNTEGMYRAWGYAGDTPTTGIYREKGDTVAT
Q
>Q7CQV5 3.4.19.5~~~iaaA~~~Isoaspartyl peptidase~~~
MNKAVIAIHGGAGAIARAQMSHEQELRYIQALSEIVESGQKMLEAGDSALDVVTEAVRLLEACPLFNAGIGAVYTRDGTH
ELDACVMDGNTLKAGAVAGVSHVRHPVLAARLVMERSPHVLMVGEGAENFAFSQGMARVSPDIFSTPARYEQLLAARAAG
EMALDHSGAPLDETKKMGTVGAVARDKFGNLAAATSTGGMTNKLPGRVGDSPLVGAGCYANNASVAVSCTGTGEVFIRTL
AAYDIAALMEYGGLSLADACERVVMEKLPALGGSGGLIAVDHEGNVALPFNSEGMYRAWGYAGDTPTTGIYRE
>Q5P603 6.2.1.75~~~iaaB~~~Indoleacetate--CoA ligase~~~COG0318
MELSEWIDRHAGLEPGKTAIRFPERDLSYAQLAGLVERLASALKASGVAHRSCVAYLGYNSPEMLATLFACARLGALFMP
LNWRLAGPEHRQLLADCPPSVLFVEPRFVAQIDAFRDALADVTLVAFDAPPQGWISYEALLERSGDAVPRDPQVGPQTPL
LICYTSGTTGKPKGALLSQGALAWNAVNSIDLHELSADDRILTTLPLFHVGGLNNQTTPALSAGATVVLHPKFDADATFD
AIEQERITLTVLVPAQLEMMIARPRWQSADLSSLRMITTGSTIVPERLIREVHRRGVPLVQIYGSTETCPIAAYVKPADA
QRKAGSAGRAAPHCSLRIVGDDGHDVKPGATGEILVRGPNVMNAYWNDLQASAAVLKDGWFRTGDMGHQDGEGYLWVDGR
KKEMIISGGENIYPAEIENLLGESPDIAEVAVVGRLDERWGEVVVAVVVPLEGRTLDAGHVLQLLEGRIARYKLPKEVVF
LDELPRTALGKVRKDDVRQLVARKTFMEQT
>Q5P5Z9 1.3.3.17~~~IaaF~~~Benzylmalonyl-CoA dehydrogenase~~~COG1960
MDFDLTDEQRAIQDTFARFSDERIAPQAAALDEARAFPRALFRELAELGFFGMRYPESVGGSGLALSEFCLALSEVARGS
MSLAGAVAMQSLMGTKFLQLLGNADIVERLFKPALRGDRIGAICMTEPNAGSDLESIATTATRVDGGYVINGQKTWITSA
PVADFFTVFARAGDEKKLTIFLVEKDVPGITVGREIHKMGVWALPTSEVAFDGCFVPDSHRLSKEEGDGEGHLKKTLAEI
RIITGAMALGVARAALFAAVRYAGERKQFGKPINRFQAIQLKLADMATGLEAATTLVHRAAWLCDMKRPHHKEAAMAKLF
ATETAAGICDDAARVLASYGYAMEYPVQRYLRDVRFTLIGGGTSEILKLVIAKEVSS
>Q5P5Z3 2.8.3.28~~~iaaL~~~Phenylsuccinyl-CoA transferase~~~COG1804
MKRRTVLSMEQALSMPYATLRFAQLGWRVIRLESTPSRGGLPGDPNRYIGANVVDDDRRTYFIAPNVGKEAIAINLKEPD
GQALLRRLLVELDVDVFCCNTVPRRYEQLGIDYETLSRTKPDLIWAGISAMGPDYPDAPGYDPVLQAMAGYMELTGDADG
PPTLAGVPIVDLKAGDEVFANVMLALLERAETGKGSRIDVSMLQAAASWLITTLPLLDFDCQPAEITRCGNAHRKFIPTN
VYPTADGFIYMAIGSDVQWRRLTEIPKFASLGAAPRATNEGRHKERDAIHRDMAAVTTRFATAEIAADFRDATIPHAPIH
DIPAVRDMEAVRRRLTTTRTPDGRLVHMQPMAVDVAGASGELAFPAKYGQDTCAVLREAGYADEAIAQLRERGIVAG
>O50173 3.5.1.134~~~iaaspH~~~Indole-3-acetyl-aspartic acid hydrolase~~~
MPLLNEYIRQLLPEMTQWRRDLHHYAESGWVEFRTASKVAEQLHQLGYDLTLGRDAVDADSRMGLPDEITLANAFQRARE
QGAPEPWLSAFEGGFTGIVATLDTGRPGPTLAFRVDMDALDLNEDTDGHHRPFREDFASCNPGMMHACGHDGHTAIGLGL
AHVLKQYADRLHGVIKLIFQPAEEGTRGARAMVAAGVVDDVDYFTAIHIGTGVPAGTVVCGSDNFMATTKFDALFTGVAA
HAGGKPEDGRNALLAAAQAAIALHAIAPHSAGASRVNVGVMQAGTGRNVVPSGALLKVETRGETEDINRYVFERAREVIH
GAAAMYGASVELRLMGAATSSAPSPGWVHYLREQAARVPGVEQAIDRIAAPAGSEDATLMMARVQQHNGLASYMVFGTEL
SAGHHNEQFDFDENVMAIAVETLALTALNFPWQRGV
>P07512 ~~~~~~Alpha-amylase inhibitor Z-2685~~~
ATGSPVAECVEYFQSWRYTDVHNGCADAVSVTVEYTHGQWAPCRVIEPGGWATFAGYGTDGNYVTGLHTCDPATPS
>P01092 ~~~~~~Alpha-amylase inhibitor HOE-467A~~~
MRVRALRLAALVGAGAALALSPLAAGPASADTTVSEPAPSCVTLYQSWRYSQADNGCAQTVTVKVVYEDDTEGLCYAVAP
GQITTVGDGYIGSHGHARYLARCL
>P94531 3.2.1.55~~~abfA~~~Intracellular exo-alpha-(1->5)-L-arabinofuranosidase 1~~~COG3534
MKKARMIVDKEYKIGEVDKRIYGSFIEHMGRAVYEGIYEPDHPEADEDGFRKDVQSLIKELQVPIIRYPGGNFLSGYNWE
DGVGPVENRPRRLDLAWQTTETNEVGTNEFLSWAKKVNTEVNMAVNLGTRGIDAARNLVEYCNHPKGSYWSDLRRSHGYE
QPYGIKTWCLGNEMDGPWQIGHKTADEYGRLAAETAKVMKWVDPSIELVACGSSNSGMPTFIDWEAKVLEHTYEHVDYIS
LHTYYGNRDNNLPNYLARSMDLDHFIKSVAATCDYVKAKTRSKKTINLSLDEWNVWYHSNEADKKVEPWITARPILEDIY
NFEDALLVGSLLITMLQHADRVKIACLAQLVNVIAPIMTEKGGEAWRQPIFYPYMHASVYGRGESLKPLISSPKYDCSDF
TDVPYVDAAVVYSEEEETLTIFAVNKAEDQMETEISLRGFESYQIAEHIVLEHQDIKATNQHNRKNVVPHSNGSSSVSEN
GLTAHFTPLSWNVIRLKKQS
>P94552 3.2.1.55~~~abf2~~~Intracellular exo-alpha-L-arabinofuranosidase 2~~~COG3534
MSEHQAVIQTDIAKGTINKNIYGHFAEHLGRGIYEGIWVGTDSDIPNINGIRKDVLEALKQLHIPVLRWPGGCFADEYHW
ANGVGDRKTMLNTHWGGTIESNEFGTHEFMMLCELLECEPYICGNVGSGTVQEMSEWIEYMTFEEGTPMSDWRKQNGREE
PWKLKYFGVGNENWGCGGNMHPEYYADLYRRFQTYVRNYSGNDIYKIAGGANVDDFNWTDVLMKKAAGLMDGLSLHYYTI
PGDFWKGKGSATEFTEDEWFITMKKAKYIDELIQKHGTIMDRYDPEQRVGLIIDEWGTWFDPEPGTNPGFLYQQNTIRDA
LVAASHFHIFHQHCRRVQMANIAQTVNVLQAMILTEGERMLLTPTYHVFNMFKVHQDASLLATETMSADYEWNGETLPQI
SISASKQAEGDINITICNIDHQNKAEAEIELRGLHKAADHSGVILTAEKMNAHNTFDDPHHVKPESFRQYTLSKNKLKVK
LPPMSVVLLTLRADS
>A3DIH0 3.2.1.55~~~~~~Intracellular exo-alpha-(1->5)-L-arabinofuranosidase~~~COG3534
MKKARMTVDKDYKIAEIDKRIYGSFVEHLGRAVYDGLYQPGNSKSDEDGFRKDVIELVKELNVPIIRYPGGNFVSNYFWE
DGVGPVEDRPRRLDLAWKSIEPNQVGINEFAKWCKKVNAEIMMAVNLGTRGISDACNLLEYCNHPGGSKYSDMRIKHGVK
EPHNIKVWCLGNEMDGPWQVGHKTMDEYGRIAEETARAMKMIDPSIELVACGSSSKDMPTFPQWEATVLDYAYDYVDYIS
LHQYYGNKENDTADFLAKSDDLDDFIRSVIATCDYIKAKKRSKKDIYLSFDEWNVWYHSNNEDANIMQNEPWRIAPPLLE
DIYTFEDALLVGLMLITLMKHADRIKIACLAQLINVIAPIVTERNGGAAWRQTIFYPFMHASKYGRGIVLQPVINSPLHD
TSKHEDVTDIESVAIYNEEKEEVTIFAVNRNIHEDIVLVSDVRGMKDYRLLEHIVLEHQDLKIRNSVNGEEVYPKNSDKS
SFDDGILTSMLRRASWNVIRIGK
>Q841V6 3.2.1.55~~~abfB~~~Intracellular exo-alpha-(1->5)-L-arabinofuranosidase~~~COG3534
MTTHNSQYSAETTHPDKQESSPAPTAAGTTASNVSTTGNATTPDASIALNADATPVADVPPRLFGSFVEHLGRCVYGGIY
EPSHPTADENGFRQDVLDLVKELGVTCVRYPGGNFVSNYNWEDGIGPRENRPMRRDLAWHCTETNEMGIDDFYRWSQKAG
TEIMLAVNMGTRGLKAALDELEYVNGAPGTAWADQRVANGIEEPMDIKMWCIGNEMDGPWQVGHMSPEEYAGAVDKVAHA
MKLAESGLELVACGSSGAYMPTFGTWEKTVLTKAYENLDFVSCHAYYFDRGHKTRAAASMQDFLASSEDMTKFIATVSDA
ADQAREANNGTKDIALSFDEWGVWYSDKWNEQEDQWKAEAAQGLHHEPWPKSPHLLEDIYTAADAVVEGSLMITLLKHCD
RVRSASRAQLVNVIAPIMAEEHGPAWRQTTFYPFAEAALHARGQAYAPAISSPTIHTEAYGDVPAIDAVVTWDEQARTGL
LLAVNRDANTPHTLTIDLSGLPGLPGLGTLALGKAQLLHEDDPYRTNTAEAPEAVTPQPLDIAMNATGTCTATLPAISWI
SVEFHG
>Q9XBQ3 3.2.1.55~~~abfA~~~Intracellular exo-alpha-(1->5)-L-arabinofuranosidase~~~
MATKKATMIIEKDFKIAEIDKRIYGSFIEHLGRAVYGGIYEPGHPQADENGFRQDVIELVKELQVPIIRYPGGNFVSGYN
WEDGVGPKEQRPRRLDLAWKSVETNEIGLNEFMDWAKMVGAEVNMAVNLGTRGIDAARNLVEYCNHPSGSYYSDLRIAHG
YKEPHKIKTWCLGNEMDGPWQIGHKTAVEYGRIACEAAKVMKWVDPTIELVVCGSSNRNMPTFAEWEATVLDHTYDHVDY
ISLHQYYGNRDNDTANYLALSLEMDDFIRSVVAIADYVKAKKRSKKTIHLSFDEWNVWYHSNEADKLIEPWTVAPPLLED
IYNFEDALLVGCMLITLMKHADRVKIACLAQLVNVIAPIMTEKNGPAWKQTIYYPFMHASVYGRGVALHPVISSPKYDSK
DFTDVPYLESIAVYNEEKEEVTIFAVNRDMEDALLLECDVRSFEDYRVIEHIVLEHDNVKQTNSAQSSPVVPHRNGDAQL
SDRKVSATLPKLSWNVIRLGKR
>Q82P90 3.2.1.55~~~~~~Extracellular exo-alpha-(1->5)-L-arabinofuranosidase~~~COG3940
MRRLTVRLFTAVLAALALLTMGTPAHATAPASPSVTFTNPLAEKRADPHIFKHTDGYYYFTATVPEYDRIVLRRATTLQG
LATAPETTIWTKHASGVMGAHIWAPEIHFIDGKWYVYFAAGSTSDVWAIRMYVLESGAANPLTGSWTEKGQIATPVSSFS
LDATTFVVNGVRHLAWAQRNPAEDNNTSLFIAKMANPWTISGTPTEISQPTLSWETVGYKVNEGPAVIQHGGKVFLTYSA
SATDANYCLGMLSASASADLLNAASWTKSSQPVFKTSEATGQYGPGHNSFTVSEDGKSDILVYHDRNYKDISGDPLNDPN
RRTRLQKVYWNADGTPNFGIPVADGVTPVRFSSYNYPDRYIRHWDFRARIEANVTNLADSQFRVVTGLAGSGTISLESAN
YPGYYLRHKNYEVWVEKNDGSSAFKNDASFSRRAGLADSADGIAFESYNYPGRYLRHYENLLRIQPVSTALDRQDATFYA
E
>P82594 3.2.1.55~~~~~~Extracellular exo-alpha-(1->5)-L-arabinofuranosidase~~~
MCTREAVRMSREHDLPEIPSRRLLLKGAAAAGALTAVPGVAHAAPRPAPYENPLVRQRADPHIHRHTDGRYYFTATAPEY
DRIVLRRSRTLGGLSTAAESVIWRAHPTGDMAAHIWAPELHRIGGKWYVYFAAAPAEDVWRIRIWVLENSHPDPFKGTWE
EKGQVRTAWETFSLDATTFTHRGARYLCWAQHEPGADNNTGLFLSEMANPWTLTGPQIRLSTPEYDWECVGYKVNEGPYA
LKRNGRIFLTYSASATDHHYCVGMFTADAGGNLMDPGNWSKSPIPVFTGNETTKQYGPGHNCFTVAEDGRSDVLVYHARQ
YKEIVGDP
>P53627 3.2.1.55~~~abfA~~~Intracellular exo-alpha-(1->5)-L-arabinofuranosidase~~~
MRTARFTLDPAFTVGAVNPRLFGSFVEHLGRCVYTGVFEPGHPTADAEGLRQDVLELVRELGVTAVRYPGGNFVSGYKWE
DSVGPVEDRPRRLDLAWRSTETNRFGLSEYIAFLKKIGPQAEPMMAVNLGTRGVAEALELQEYANHPSGTALSDLRAEHG
DKDPFGIRLWCLGNEMDGPWQTGHKTAEEYGRVAAETARAMRQIDPDVELVACGSSGQSMETFAEWEATVLKETYDLVDH
ISLHAYYEPHDGDVDSFLASAVDMESFIENVVATCDHVGARLKSKKKINLSFDEWNVWYMTKTQAEVSALDWPEAPRLLE
DNYSVMDAVVFGSLLIALLRHADRVTVACLAQLVNVIAPIMTEPGGPAWRQTTFFPFSQASKYGRGEVLDVRVDSPTYDT
AKYGEADLLHATAVVRARRSVTVFAVNRSRTGALPLEVALSGLELTEVVEHSALADADPDARNTLAEPERVVPHPVDGTS
LRDGRLTAALEPLSWNSIRCADPAPGQPPRRPGEGTGFTGTPPAAPPSSSSAPRPDPTARRSPDRTARARVLAAARVRRM
PFGRTKVCGAPVRPPTYAPRFQPFRKTWTRWAPAPRSGSPSRRSPTEASIPGGTSSRNVVRYQVTPWRRSPPGSAPGTPA
PTRRRRTRAGASRGAPRTARRC
>B3EYM8 3.2.1.99~~~abnB~~~Intracellular endo-alpha-(1->5)-L-arabinanase~~~
MVHFHPFGNVNFYEMDWSLKGDLWAHDPVIAKEGSRWYVFHTGSGIQIKTSEDGVHWENMGWVFPSLPDWYKQYVPEKDE
DHLWAPDICFYNGIYYLYYSVSTFGKNTSVIGLATNQTLDPRDPDYEWKDMGPVIHSTASDNYNAIDPNVVFDQEGQPWL
SFGSFWSGIQLIQLDTETMKPAAQAELLTIASRGEEPNAIEAPFIVCRNGYYYLFVSFDFCCRGIESTYKIAVGRSKDIT
GPYVDKNGVSMMQGGGTILDEGNDRWIGPGHCAVYFSGVSAILVNHAYDALKNGEPTLQIRPLYWDDEGWPYLSV
>Q93HT9 3.2.1.99~~~abn-ts~~~Intracellular endo-alpha-(1->5)-L-arabinanase~~~
MVHFHPFGNVNFYEMDWSLKGDLWAHDPVIAKEGSRWYVFHTGSGIQIKTSEDGVHWENMGRVFPSLPDWCKQYVPEKDE
DHLWAPDICFYNGIYYLYYSVSTFGKNTSVIGLATNRTLDPRDPDYEWKDMGPVIHSTASDNYNAIDPNVVFDQEGQPWL
SFGSFWSGIQLIQLDTETMKPAAQAELLTIASRGEEPNAIEAPFIVCRNGYYYLFVSFDFCCRGIESTYKIAVGRSKDIT
GPYVDKNGVSMMQGGGTILDAGNDRWIGPGHCAVYFSGVSAILVNHAYDALKNGEPTLQIRPLYWDDEGWPYL
>D0C6T7 1.14.13.235~~~iacA~~~Indole-3-acetate monooxygenase~~~
MMNKLSKMEFAAQDKAVDLDALCQEIRERACAGEFDNQAYVSQDIIEKLKKIGVYRALVPKRFGGEEWSPRQFCELIETL
SKADGSVGWVASFGMSPAYLGSLPEETLKELYQNGPDVVFAGGIFPPQPAEITDEGVVVRGRWKFSSGCMGADIVGVGIS
PLKNNEMQGLPRMAVMPANKAKIEMTWDTVGLKGTGSHDLVVEDVLVEKKWTFVRGEPSKLSEPFFKYPSLSLATQVLTV
VGIGVAAAALEEFEKLAPGKASITGGSEIANRPVTQYEFAQADAEFQAAKSWFYQTMDIVWNEIIAGREATAEQISDMRL
ACTHAARVCAKVTRKMQMLAGMTAIYTNNPFSRFVNDTNVVTQHAFMGDATLQNAGLVSFGLKPAPGYL
>B0FXI0 1.14.13.235~~~iacA~~~Indole-3-acetate monooxygenase~~~
MDSLCLRAAPASALASGPAFEALLDGVRDRARLGEFDRQRHISRDVIDAFKAHGVYRALVPKRFGGLECSPAAFCEMIER
ISHADGSAGWVASFGMSPVYLAALPLETIAEIYGNSPDTVFAGGIFPPQAAEIVSGGFKINGRWKYSSGSMGADIVGVGI
APRNGDKLDLPRLAVLPRSQARIEETWDTVGLLGTGSHDLVVEDVVVGEQWTFVRGGKPNLDEPFFRYPSLSFATQVLSV
VGLGIARAALDELSGMASGRISVTGAPALADRPLAQVDVAKAEAALRSARAFFYESIERAWEHVLAGDPVPVDVTNLLRL
SSTHAARVAAEVARSAQMLSGMTGIYNESPLARCVNDAQVVTQHAFMGDVTYQNAGAMFFGKQPLPGYL
>B0FXI7 ~~~iacR~~~HTH-type transcriptional repressor IacR~~~
MSNAKNTSAASPARKGHSHHDPASDEFRKEDFPFYWLARVHGRYTQNMERLLKKIDLDVPRWRVLWILNENGESSISEIS
THAIAKLSTITKIVYRMKEDGLVDTAPSPEDGRVTQVRITEVGLQNIERMQEVTRELFQRSFKGLTEAQVQRLNRMLEVV
FHNLETL
>Q8YQ14 ~~~iacT~~~Iron and copper transporter IacT~~~COG4773
MVFVECGAALKLNQCIYIGIASTICLLITQKANAQEKPVNTKNIGLITNIPRLSDIERPPTSVKDWLSQSAPTPPKIKIT
GVRINRTDDNFEIILETPDGEISAPETLQEGNIFIADIPNAVLALPEGKEFREDTPVDGISYVTVTQQESNNTVRVTIAS
SGKLPPIQVVNQSNGLTIALTPTSPDIELIVTAQKRPEDAQDVPLSLTVIPQQEIEDAQIRSFQDIANNTPNFSFLPTTA
GSADFSYYSVRGLNNFNFLANQDTVGFYIDDVPFDYGGFLDVGLIDLERVEVLRGPQSTLYGRSSPAGVVNVISRPPSNQ
PEMRISALYGSYNNRELQLSLSDAIIPDKLAFRLAGAYNARDGVFDNTFLNKPIGERSQLTGRAQILWTPTPEWNISFNA
YASDNDNGNPTFSRQNAENPFQVSQEVDGFHRLSTNTQALKISYNGDGFRATSITTRRFSNQNTLVGDNFPGDLLQQIIG
INSTLWSQEFRLQSPESADRLRWLLGGYYESRNFNVLDDTFKYSDAGAVFFGLPASGSDRVSAEQNRHTYAIFGQIDYKP
IAPLTLFAGWRYETADAELDRRRVFVNPDGTANPPTAEVRNATLNSDAFIPRFGLQYRFNPNLMAYATIAKGYRPSGFNY
RADTEDTRRFQEETTWTYEAGLKSSWLDDRLSANLSIFQSDVDNYQVLLTDDFGFFRNVTNANVKVTGLEFELKANPLQG
LDLIAGIGYVDSKFKNYRNSFTNRDFSNNRVPFAPELTYNLAVQYRSPGGIFARAELRGYGITYFDDANQVKQDPYALVN
ARIGYEGEKYGIYLYANNLFDTRYITSGFLFPPPNVTAGFGDPVTYGVRVSASF
>P39377 3.4.19.-~~~iadA~~~Isoaspartyl dipeptidase~~~COG1820
MIDYTAAGFTLLQGAHLYAPEDRGICDVLVANGKIIAVASNIPSDIVPNCTVVDLSGQILCPGFIDQHVHLIGGGGEAGP
TTRTPEVALSRLTEAGVTSVVGLLGTDSISRHPESLLAKTRALNEEGISAWMLTGAYHVPSRTITGSVEKDVAIIDRVIG
VKCAISDHRSAAPDVYHLANMAAESRVGGLLGGKPGVTVFHMGDSKKALQPIYDLLENCDVPISKLLPTHVNRNVPLFEQ
ALEFARKGGTIDITSSIDEPVAPAEGIARAVQAGIPLARVTLSSDGNGSQPFFDDEGNLTHIGVAGFETLLETVQVLVKD
YDFSISDALRPLTSSVAGFLNLTGKGEILPGNDADLLVMTPELRIEQVYARGKLMVKDGKACVKGTFETA
>A0A291SJC7 4.2.3.157~~~~~~Isoafricanol synthase~~~
MHTHASRPHARQSALPRRAALFDFPASADLSPDTGAARQHTIQWLSRFRVFENHASVEEYDALRFDVLTGLFYPRATGAD
LNLGSDLVGWYFVFDDQFDGELGCRPEEVARLVADVIRVTEEDMAPGGTGGGEGPLLESFRDLWHRINSGRPRVWRDRFR
HHWLEYLHSYHREALERTGAAPADGGGDAPRSVEDVLALRRHSIGVQPCLDLNEPFGGYTLPSALHGGFPLARMREATDD
VVVFTNDIASLDKELAVGDVHNSVIVQWKLAGGGVEDAVRHIAGLANARYGWFEETAARLPELLAEAGADPGTHRAVGRY
VDGMRHVMTGNLGWSLRTARYDERGTEAVSGGRERPWARLTGAEDLIRAGRGAPPPPGSGPDTRQPMPSEPSQLA
>G2P5T1 4.2.3.157~~~~~~Isoafricanol synthase~~~
MHAHASRPQARQTTLLRRAALFDFPASADLSPGTEAARHHTIQWLSRFGVFEGHESVAEYDALRFDVLAGLFYPRATGAD
LNLGSDLVGWYFVFDDQFDGELGSRPEAVARLVADVIRITEEDTAHGRAQDGEGPLLESFRDLWRRISSGRPQVWRDRFR
HHWLEYLHSYHREALERTGALPGAGGDAPRSVEAVLALRRHSIGVQPCLDLNEPFGGYTLPPALHGGFPMARMREATDDV
VVFTNDIASLDKELAVGDVHNSVIVQWERAGGELEDAVRHIADLANARYRWFEETAARLPALLTEAGADPGTHHAVGRYV
DGMRHVMTGNLGWSVRTARYDERGTEAVSGGRQRPWAQLTGAEELIRAGRGAPLPPLGSGSGSR
>P0A9W6 ~~~ibaG~~~Acid stress protein IbaG~~~COG5007
MENNEIQSVLMNALSLQEVHVSGDGSHFQVIAVGELFDGMSRVKKQQTVYGPLMEYIADNRIHAVSIKAYTPAEWARDRK
LNGF
>Q086E4 ~~~~~~Ice-binding protein 1~~~COG4932
MNHSIKKTYLVFTMLLGFILLAGCNGDNNNDNSNNDNNGVLLTSIAVTPATPSMPLGLKQQFTAMGTYSDGTSSDITNSA
TWSSDDSTVATINGSGLAMGVIPGSVAITASLIDSSSNEQSATTTLTITDATLTALAITPVNPSLAKGLTKQFMATGTYS
DGTSPDVTTSVTWSSANTLVATVNASGLASGVAIGSSIITASLGSDETTTELNITDAILSSIALTPVEPSIAKGITQQFT
AIGTYSDGISVDITASSNWSSADTLVATMNTSGAAKGVSIGSSIITADFQAQSATSLLTVTDASLTSIMLTPANPHIPKG
NTLQLTATGIYSDGISVDITSSAIWSSADTLIATVNADGVVSGITSGSAIITATSAALSATTTVTVTDTTLTSIAVTPGN
QTIVKGSNKQLTATGTYSDGSLANITASVTWSSADTLVATVNNSGLASGIETGSSLISASSGALSGSTNLTITGAALNSI
VVSPTNLSLVKGMNKQFAATATYSDGSVADISTSVTWSSADTLVATIDVNGLANGKAAGSSLITATSGAQSNSTNLTVTD
ATLNSIDVTPINPSIIKNSSQNFVATGHYSDGSTTNITSTVMWSSADTLVATLNPNEQLNSGRATAIEVGSSVIQASLSG
VFADTTLNVTAALPNNPLAPELGEVARFAMLASQAITTTSGSAIVDGDLGILDQARSYYAGFTPGVNAGEFDELTNGLSY
AGDDSTPPYVVPVPYASMVAFINQSRTDLGIAYNFLAADPNPNAATQVCPIELGNLTLTRGVYKTAADVTLQTGTLTLDG
EGDPDSVFIFTIGGNLTSGAPGGDIVLINGAQAKNIYWRTAGKTVIGTNTNFSGNVFAWSEVNVRTGANVTGRLFAVTDQ
VTLDANAVTKAN
>P0C054 ~~~ibpA~~~Small heat shock protein IbpA~~~COG0071
MRNFDLSPLYRSAIGFDRLFNHLENNQSQSNGGYPPYNVELVDENHYRIAIAVAGFAESELEITAQDNLLVVKGAHADEQ
KERTYLYQGIAERNFERKFQLAENIHVRGANLVNGLLYIDLERVIPEAKKPRRIEIN
>Q06277 ~~~ibpA~~~Protein adenylyltransferase and cysteine protease IbpA~~~
MNKNCYKLIFSKTRGCLVPVAECITSAVDSGSSDSVVVSEKTDEEDRQGSIEDYRLSNVCLSVKTFLNPVSSALCLNWKS
VSVLLLSMVAAPNFAQSAEEAAKAEKTPKLTEIQNGNDGIQLETKNQNIGVGAGTTENNHPTKLYKTENNVIVIDIAKPN
DKGISDNRFQKFNIPNGAVFKNNKDQQRSELVGYLEGNKNLADKEAKVILNQVTGSELSQIKGALEILGTKADLVIANQH
GINLNGVQTINAGRFVATTSKLIDPNKMEFDVTQGTVTIDVNGFATDNLPYLDIVAKKIEQKGTIGNKEKEKNKTSETEI
TFIAGKGKIKYNIENDGKTKLEVQKDSNTSQPSDKEEVAITGASTGAMHGKSIKLIVTEQGAGVKHDGIILSENDIKIES
NKGDIDLGDKLQAKNEISLNNAKRITIANEITADKSITITADDVKLKNNKEASATEEAKLKGKGKLASKKVKVEAKKSLV
LDDETKVVATDLELKSQTLTNQGRIYGNKVKIDTDKLVNKKEIYAEDNLDITTKGKTVTVSVNKDNKRKADVKEETVADL
DVGFENTGTIESKSKAKLTFKDNTSFVSKGNKFIKAKDELTIDAQNVVISENDELQTTARLTINAAGNVVNNGLLASGKT
LTINAKQGSIYNEKGILGAREQLTLSAKGNNKETEGNIINGADSLLHSEGKMELDAENTVYNLGNIFAKSDLTVKANELI
NDVKLSGSITKKSPYSVLNRYRRSDIASHGWHNNDYRLWINPIEFEKAEVKVEKAGLIRAEGNFKFEGKKGDNQQDATLT
NHGVINVKNTFEAQNAKVVNNMKAYQANLLTEFFKQKQDITFNYQPRARLFLSALSGQAERKFNSLEELFDGLFSEQPIT
NSSSYYADNSQAVHLLEEIKSPTFQKAMTLVFGANWKNEDHKKLSQRWKEFKEKQDAHFDYRPTDKAKILAQRINGKIDE
LKNGSTGGFSESERITVGQHKFDLSKVEFRSEVNRKENLNNSNVDLSALSDLLSIPNLFVDNSVQLDKTVDKNIEIDEED
EFLLKPHTGEEPDLLNENELSENGKFLDKLLGEIGEKTYIREVSDDWERDPDEPDEPDYKTESRLETRDRFDTLPSEVQD
KLRQKFNEYKEKAQQKRQAEALQAKTKNEQLQSDLETGYKEEEKRQAKNDLEKQAELQQLDQQEKEKLAKEKELQGKINE
EKQQEALAKQKQEQQKQADAKAKIEEEKRLEEYRKELAKDHQIEEALSKNQFLKEVDDTRPKVETDPLYRTKLQYINQDE
YFGSKYFLNKVGSSTDAGKKVAVIGDNYLEHQLITKSIEKKVDNHLALKYQVNDAQLVKKLIDNSYFESKELGLKVGEAL
TKEQQNQLKQDIVWYVKANINNKEVLLPQVYFANKTLRDAEKFKGLGDALIRANEINLKTRDVLNSGTISGKNIDIEAEN
KIKNRGDILSEESTRLVGHKGIDNTARSFVNGNGDVEVQRASIRTEGHLHLEADEDSDINSKGSDIKGKTGFVKARNFNT
TDTHRTEHSVEKGRIFSKKGEILGYRKESTQKAISVGSNTEFDHVHFAIKNDVNQEGSKIKAKVVTGVVQGDYNTKAGRN
AQQTERYIRLDQEYSSGHISGAGFTVSHERDSQNGEKTNIGGASSNTGTGFTLGGSFSETREKETSLTHTNSDLQVDHGI
LHVLKKAEIGGVDINKHKFTGKAVEEDEAKAEQQAKAKAAPDATDNAAQKEEPKFKVLSQSEVDDLMTEKSANDLFNKYK
KVKEDEGFELSAKEITSNKQKDEYHLDSERSVLKFGIETEGHSAIADAVSHVAKEIVEAQRGVKQDGTVALQHISDVANI
VTGELVGGSSKFGFERNYETNKVKETSDIRTKIAGNITLSAHGGNLQLKNVESDANSKLTLQAKRNVDILDGETTRESTE
RQSRQKFAFGINSGCSVMSGGCNGGVSGSVDGNESFTTEKSVTHNNSLLRAKNLKIAAGKDLNLISSNIKADHLDLNIKG
KTNIVSKQDSFDRLYRGFDFSASAGAALSSSTLVKGNGSFGAGYTHEVENRKLLNQQAGIVANRITGQIKDLDLVAAHFI
NKDENSGFRVSGNVTSQQLNDSHHKDGGSVGVSVGINERGASSFNVRGGRAEQKHYDAVQKSVISGINLKDNNVTGEIVD
DLSKAKTVTRDDVYASTQFNFEVADLVELGEKAKSKLQSKFSKAVNNDAEQPTTTRISSEDVVEMVDNPLYGSNADVRKL
RTLDEVGEGYSTLGDQNANKGRKLPNGSDDIYSLLGKVKVSGDEPVYDKVSAEGAYDLLGDSNANKGRTLRNNSDDLYST
VGDANSDISRIRSNVYDEIAAGPYSLLGRTKAAEEHIYEQIGEGPYSLLGNGSAVRNRTLGGESNSTYSTVGDANSDISR
IRSNVYDEIVAGPYSLLGKPKAAEEHIYEQIGEGPYSLLGNGSAVRNRTLGGESDSPYSTVGDANSDISRIRSNVYDEIV
AGPYSLLGRTKAAEEHIYEQIGEGPYSLLGNGSAVRNRTLGGESDSPYSLLGGEGTRNKVLADTIESIYSTLSRPQASSN
LEMVDNPLYDSVRRSASDQLPELPTVRNLLNSDTEAGNGTYSEITSRTRNANDPLPPLPNEFRTRLSQGADLADHVYDTI
GSIYSVLSKPKASSNLEMVDNPLYGSVRRAAGDQLPELPTVKTLLNKVEEVGNEIYSEITSKTRSANDPLPALPNFRLTQ
EVDTADHIYADINDVVNRANKAKRDLPATPEATPKVAVDGGDYATIGEVSPLQPRASRQQGSSDYEEIPLPQETAPQKTS
PVKRTSAEGEDGYATIAEVLQPRAAKGQVSDYETIPLDEPSQAAVRTERSAVEGDYAEITSPSIQPRSARGQSGGEEFEP
FPSEFSSEPQSPKRALPAENAVVNELGNELKARLKSKEDQANPAKAEVSEPIYATLDKSPEGLARAKAKGDEAAAANPIV
KTRVEDDVAPELPARPSNLSDSISNETIAENGQSVALGTPKSAVAESNRNNNGNQKLQSEGAEGVSPKTKSEDKSWFAKV
KDFFFAKSNKSQAKEAKSEQETVSKPNYDSLEDDLNLKNLLALEDKRGSSFEENVLKNPEFLAEAREIAKKYIPEATIKQ
MGNSPEFDEILTEGAKKVEKRINDALTFKPSVDEFNEIQGLVKNIQKGSAVDDLNAQTLAITEALADTSKTIQRNPKLKE
EVQGAIEEFLKSSQGKELTVEMIEKLNHGLRPDEGSDRLLYKKENLTKENAVFSSPQASKIQLNETVDFINQAIKQNVEP
SVLAGLVYQRLIAYHPFAEGNGRMARVVVNKILLDAGYPPFTKFSSEFETQIIPQTKATAKSATSAEVVKEFLTELGKKS
SPQEGGANNQNGQATSPVTLKSKDVSEVENTQSADSLTIKQPEQGKAGGQLPSVPKVETSVNEVAPLSSVPAELKDAAGG
NKKAAEKSEGATGVEKEKTTLFQRVKQFFTGSKSGAKPVAGDETANKVNYQDLEDNLNLKGLISLEDDRNANFESNVLKN
EKFLDEAREISKKSIPEATVKQMSHLPEFDDILTEGAKKVESRINKAITFRPSVEEFSEIQDLVKTLPKTKVIEDLSTKT
NEITEALAATSKTIQRTPELKEQLKTAIEDFLQNSQGKPLTVQMIENLNHGLRPDEGEGRLLYKKENLTKENAVFSSPEA
AKIQLAETVDFINRAKNEGIEPSVVGALVYQRLIAYHPFAEGNGRMARVIVNKILLDAGYPAFTKFSDEFEPQIIPQTKA
STKSATSSEVVVEFLKELAKKGSKEDNEQNLEKTDRTSTDLTESAVENSAALSSGTVRSATVSETVTETEQAKAKPVSDL
VSSKDLVEQQRTVLQRIQDQFQPLKVKSKIDAVRSSVEEFGGEVSFKFAQSKGEVYKEIVKHIETQNGVCESTCAHWIAK
NVNPTDENFFNTLYEGGKKGHLKKETIDSIKKLQTEFINSGSATQQFKLTDSWLQEQGVVPKEKKVADFVRRDEVSGTVS
KNDVSSLVKAILDTGDDTAGVKKISINLEGGSHTVSAAVDGSKVTFFDPNFGEMTFPTHQQFENWLKNAFWQKSGYAGKQ
EGRRFFNVVNYKKNN
>P0C058 ~~~ibpB~~~Small heat shock protein IbpB~~~COG0071
MRNFDLSPLMRQWIGFDKLANALQNAGESQSFPPYNIEKSDDNHYRITLALAGFRQEDLEIQLEGTRLSVKGTPEQPKEE
KKWLHQGLMNQPFSLSFTLAENMEVSGATFVNGLLHIDLIRNEPEPIAAQRIAISERPALNS
>A5XB26 ~~~~~~Ice-binding protein~~~
MKTLISNSKKVLIPLIMGSIFAGNVMAAGPYAVELGEAGTFTILSKSGITDVYPSTVTGNVGTSPITGAALLLNCDEVTG
AMYTVDSAGPLPCSINSPYLLELAVSDMGIAYNDAAGRVPADHTELGTGEIGGLTLEPGVYKWSSDVNISTDVTFNGTMD
DVWIMQISGNLNQANAKRVTLTGGALAKNIFWQVAGYTALGTYASFEGIVLSKTLISVNTGTTVNGRLLAQTAVTLQKNT
INAPTEQYEEAPL
>H7FWB6 ~~~~~~Ice-binding protein~~~COG3420
MKILKRIPVLAVLLVGLMTNCSNDSDSSSLSVANSTYETTALNSQKSSTDQPNSGSKSGQTLDLVNLGVAANFAILSKTG
ITDVYKSAITGDVGASPITGAAILLKCDEVTGTIFSVDAAGPACKITDASRLTTAVGDMQIAYDNAAGRLNPDFLNLGAG
TIGGKTLTPGLYKWTSTLNIPTDITISGSSTDVWIFQVAGNLNMSSAVRITLAGGAQAKNIFWQTAGAVTLGSTSHFEGN
ILSQTGINMKTAASINGRMMAQTAVTLQMNTVTIPQ
>A0A7D5JNZ7 ~~~~~~Ice-binding protein~~~
MLKINRKYAIILAIVAFSSFQTEAKAASISMLGTASNFGVLGGSTVTNTGPSVITESLGVSTGSSATGFPPAIVNGTIFT
SDTVAAQAQVDNATAYNKLASLIPNKDLTGLDLGGLTLTPGVYSFSSSAQLTGILTLDNLGDPNALFVFQIGSTLTTASN
SSIVTTNGDAPNVFFQIGSSATLGTGTQFMGNILALTSITLTTGVNIDCGRALAQNGAVTMDTNKVSNACYTKPQEKAVV
PEPDSSLAVLGSGLVSLLFAFRKRFRKGW
>Q5HKQ0 2.4.1.-~~~icaA~~~Poly-beta-1,6-N-acetyl-D-glucosamine synthase~~~COG1215
MHVFNFLLFYPIFMSIYWIVGSIYYFFIKEKPFNRSLLVKSEHQQVEGISFLLACYNESETVQDTLSSVLSLEYPEKEII
IINDGSSDNTAEIIYDFKKNHDFKFVDLEVNRGKANALNEGIKQASYEYVMCLDADTVIDDDAPFYMIEDFKKNPKLGAV
TGNPRIRNKSSILGKIQTIEYASIIGCIKRSQSLAGAINTISGVFTLFKKSALKDVGYWDTDMITEDIAVSWKLHLFDYE
IKYEPRALCWMLVPETIGGLWKQRVRWAQGGHEVLLRDFWPTIKTKKLSLYILMFEQIASITWVYIVLCYLSFLVITANI
LDYTYLKYSFSIFFFSSFTMTFINIIQFTVALFIDSRYEKKNIVGLIFLSWYPTLYWVINAAVVIMAFPKALKRKKGGYA
TWSSPDRGNIQR
>Q6TYB1 3.5.1.-~~~icaB~~~Poly-beta-1,6-N-acetyl-D-glucosamine N-deacetylase~~~
MKPFKLIFISALMILIMTNATPISHLNAQANEENKKLKYEKNSALALNYHRVRKKDPLNDFISLLSGSKEIKNYSVTDQE
FKSQIQWLKAHDAKFLTLKEFIKYKEKGKFPKRSVWINFDDMDQTIYDNAFPVLKKYHIPATGFLITNHIGSTNFHNLNL
LSKKQLDEMYETGLWDFESHTHDLHALKKGNKSKFLDSSQSVASKDIKKSEHYLNKNYPKNERALAYPYGLINDDKIKAM
KKNGIQYGFTLQEKAVTPDADNYRIPRILVSNDAFETLIKEWDGFDEEK
>Q5HKP7 ~~~icaC~~~Probable poly-beta-1,6-N-acetyl-D-glucosamine export protein~~~COG3936
MKKNKLELVYLRAFICVIIIVTHLLTQITLENEQMSDSSLILQYYIRNIFIFGTPSFIILSQLLTTLNYESVTINYLFSR
FKYIFIPYLLIGLFYSYSESLITASSFKKQFIENVVLGQWYGYFIIIIMQFFVLSYIIYKINFRLFNSKILLLLAFIVQQ
SYLHYFLNNDTFHQFMTHYYPLSENTMILGWIFYFFLGGYIGYNYEKILSFLEKYLIIVIMLTLGAYVLFIAVSGSDYWN
VTSFTYTLTLYNSVMFFLLLGVCMHFKTMLLNTIKAISAFSFFIYLLHPIILDSLFAYTNIFEDNTIVFLAISLLMILGI
CIGVGMMLREFYIFRFVIGKQPYKLQFDQYQPNWN
>Q5HKP9 ~~~icaD~~~Poly-beta-1,6-N-acetyl-D-glucosamine synthesis protein IcaD~~~
MVKPRQRQYPTVTSYLNIVRESLFITISGVFWMYCIVVMIVYIGTLINSQMESVITIRIALNVENTEIYKLFGWMSLFVL
IIFIFFTFSLAFQKYKKGRDI
>A0A4P7TQN2 ~~~icaR~~~Invasion chromosome antigen R~~~
MGKISDLNYSQHITLADNFKQKNEALDTWYVGMNDFARIAGGQNSRSNILSPRAFLEFLAKIFTLGYVDFSKRSNEAGRN
MMAHIEFSSYSKDTDGNEKMKFYMNNPEGERADLSKVKIEITLASASTKGIREGHTVIIFKQSDGSTNRYEGKSFERKDD
SSLHLITNKVLACYQREANKKIARLLNNHQKLNNIQELNDSQELNNSQKLNNSQELNNSQVSCKGSVDSTITDLLEKALN
KG
>Q5HCN2 ~~~icaR~~~Biofilm operon icaADBC HTH-type negative transcriptional regulator IcaR~~~
MKDKIIDNAITLFSEKGYDGTTLDDIAKSVNIKKASLYYHFDSKKSIYEQSVKCCFDYLNNIIMMNQNKSNYSIDALYQF
LFEFIFDIEERYIRMYVQLSNTPEEFSGNIYGQIQDLNQSLSKEIAKFYDESKIKMTKEDFQNLILLFLESWYLKASFSQ
KFGAVEESKSQFKDEVYSLLNIFLKK
>A6QKF4 ~~~icaR~~~Biofilm operon icaADBC HTH-type negative transcriptional regulator IcaR~~~
MKDKIIDNAITLFSEKGYDGTTLDDIAKSVNIKKASLYYHFDSKKSIYEQSVKCCFDYLNNIIMMNQNKSNYSIDALYQF
LFEFIFDIEERYIRMYVQLSNTPEEFSGNIYGQIQDLNQSLSKEIAKFYDESKIKMTKEDFQNLILLFLESWYLKASFSQ
KFGAVEESKSQFKDEVYSLLNIFLKK
>Q5HKQ1 ~~~icaR~~~Biofilm operon icaADBC HTH-type negative transcriptional regulator IcaR~~~COG1309
MKDKIIDNAITLFSEKGYDGTTLDDISKSVNIKKASLYYHYDNKEEIYRKSVENCFNYFIDFLLRNHDDNYSIDGLYQFL
FKFIFDVDERYIKLYVQLSSAPEALNSEIKHHLQEINTTLHDELIKYYDPTHIALDKEDFINLILLFLETWYFRASFSQK
FGIIEDSKNRFKDQVYSLLNVFLKK
>A0A4P7TKK3 ~~~icaT~~~Invasion chromosome antigen T~~~
MLPSISINNTSAAYPESINENNNDEINGLVQEFKNLFNGKEGISTCIKHLLELIKNAIRVNDDPYRSNINNPSVTYIDIG
SNDTDHITIGIDNQEPIELPANYKDKELVRTIINDNIVEKTHDINNKEMIFSALKEIYDGDPGFIFDKISHKLRHTVTEF
DESGKSEPTDLFTWYGKDKKGDSLAIVIKNKNGNDYLSLGYYDQDDYHIQRGIRINGDSLTQYCSENARSASAWFESSKA
IMAESFATGSDHQVVNELNGERLREPNEVFKRLGRAIRYNFQVDDAKFRRDNVKEIISTLFANKVDVDHPENKYKDFKNL
EDKVEKRLQNRQTKYQNEINQLSALGVNFDDI
>P06620 ~~~inaZ~~~Ice nucleation protein~~~
MNLDKALVLRTCANNMADHCGLIWPASGTVESRYWQSTRRHENGLVGLLWGAGTSAFLSVHADARWIVCEVAVADIISLE
EPGMVKFPRAEVVHVGDRISASHFISARQADPASTSTSTLTPMPTAIPTPMPAVASVTLPVAEQARHEVFDVASVSAAAA
PVNTLPVTTPQNVQTATYGSTLSGDNHSRLIAGYGSNETAGNHSDLIAGYGSTGTAGSDSWLVAGYGSTQTAGGDSALTA
GYGSTQTAREGSNLTAGYGSTGTAGSDSSLIAGYGSTQTSGGDSSLTAGYGSTQTAQEGSNLTAGYGSTGTAGSDSSLIA
GYGSTQTSGGDSSLTAGYGSTQTAQEGSNLTAGYGSTGTAGVDSSLIAGYGSTQTSGSDSALTAGYGSTQTAQEGSNLTA
GYGSTGTAGSDSSLIAGYGSTQTSGSDSSLTAGYGSTQTAQEGSILTAGYGSTGTAGVDSSLIAGYGSTQTSGSDSALTA
GYGSTQTAQEGSNLTAGYGSTGTAGADSSLIAGYGSTQTSGSESSLTAGYGSTQTAREGSTLTAGYGSTGTAGADSSLIA
GYGSTQTSGSESSLTAGYGSTQTAQQGSVLTSGYGSTQTAGAASNLTTGYGSTGTAGHESFIIAGYGSTQTAGHKSILTA
GYGSTQTARDGSDLIAGYGSTGTAGSGSSLIAGYGSTQTASYRSMLTAGYGSTQTAREHSDLVTGYGSTSTAGSNSSLIA
GYGSTQTAGFKSILTAGYGSTQTAQERTSLVAGYGSTSTAGYSSSLIAGYGSTQTAGYESTLTAGYGSTQTAQENSSLTT
GYGSTSTAGYSSSLIAGYGSTQTAGYESTLTAGYGSTQTAQERSDLVTGYGSTSTAGYASSLIAGYGSTQTAGYESTLTA
GYGSTQTAQENSSLTTGYGSTSTAGFASSLISGYGSTQTAGYKSTLTAGYGSTQTAEYGSSLTAGYGSTATAGQDSSLIA
GYGSSLTSGIRSFLTAGYGSTLIAGLRSVLIAGYGSSLTSGVRSTLTAGYGSNQIASYGSSLIAGHESIQVAGNKSMLIA
GKGSSQTAGFRSTLIAGAGSVQLAGDRSRLIAGADSNQTAGDRSKLLAGNNSYLTAGDRSKLTGGHDCTLMAGDQSRLTA
GKNSVLTAGARSKLIGSEGSTLSAGEDSILIFRLWDGKRYRQLVARTGENGVEADIPYYVNEDDDIVDKPDEDDDWIEVK
>P16528 ~~~iclR~~~Transcriptional repressor IclR~~~COG1414
MVAPIPAKRGRKPAVATAPATGQVQSLTRGLKLLEWIAESNGSVALTELAQQAGLPNSTTHRLLTTMQQQGFVRQVGELG
HWAIGAHAFMVGSSFLQSRNLLAIVHPILRNLMEESGETVNMAVLDQSDHEAIIIDQVQCTHLMRMSAPIGGKLPMHASG
AGKAFLAQLSEEQVTKLLHRKGLHAYTHATLVSPVHLKEDLAQTRKRGYSFDDEEHALGLRCLAACIFDEHREPFAAISI
SGPISRITDDRVTEFGAMVIKAAKEVTLAYGGMR
>Q1LRY0 ~~~icmF~~~Fused isobutyryl-CoA mutase~~~COG1703
MTDLSDVSRTAAAKPPAVPGRGPANKVRFVTAASLFDGHDASINIMRRILQSQGCEVIHLGHNRSVQEVVTAALQEDVQG
IAISSYQGGHVEYFKYMIDLLREHGGEHIQVFGGGGGVIVPDEIRELQAYGVARIYSPEDGQRMGLAGMITDMAQRCDID
LTRYAPTTLDTVVAGDRRALAQLITALENGKADPELVSALHAQAKAAAVPVLGITGTGGAGKSSLTDELIRRFRLDQDDA
LSIAVISIDPSRRKSGGALLGDRIRMNAINHPNIFMRSLATREAGSEISQALPDVIAACKAARFDLVIVETSGIGQGDAA
IVPHVDLSLYVMTPEFGAASQLEKIDMLDFADFVAINKFDRKGAQDAWRDVAKQVQRNREQWHSRAEDMPVYGTQASRFN
DDGVTMLYQGLVGALGARGMSLKPGTLPNLEGRISTGQNVIVPPARSRYLAELADTVRAYHRRVVAQSKLARERQQLRAA
HDMLQGAGHESAALETLASERDVSLGAVERKLLAMWPQMQQAYSGDEYVVKIRDKEIRTGLISTTLSGTKIRKVVLPRFE
DEGEILKWLMRENVPGSFPYTAGVFAFKREGEDPTRMFAGEGDAFRTNRRFKLVSEGMEAKRLSTAFDSVTLYGEDPHER
PDIYGKVGNSGVSIATLEDMKVLYDGFDLTNPSTSVSMTINGPAPTILAMFMNTAIDQQIDRFRADNGRDPTADEEAKIR
AWVLQNVRGTVQADILKEDQGQNTCIFSTEFSLKVMGDIQEYFVHHQVRNFYSVSISGYHIAEAGANPISQLAFTLANGF
TYVEAYLARGMHIDDFAPNLSFFFSNGMDPEYSVLGRVARRIWAVTMRDKYGANDRSQKLKYHIQTSGRSLHAQEIDFND
IRTTLQALIAIYDNCNSLHTNAYDEAITTPTAESVRRALAIQLIINREWGVAKCENPNQGSFLIEELTDLVEEAVLQEFE
RIAERGGVLGAMETGYQRGKIQEESLYYEQLKHDGTLPIIGVNTFRNPNGDPTPQTLELARSSEDEKQSQLHRLTEFHGA
HQADAEAMLARLRQAVIDNRNVFAVLMDAVRVCSLGQITHALFEVGGQYRRNM
>Q5KUG0 ~~~icmF~~~Fused isobutyryl-CoA mutase~~~COG1703
MAHIYRPKHHVRFVTASSLFDGHDASINIMRRILQASGAEVIHLGHNRSVEEIVNAAIQEDVQGIAVSSYQGGHMEFFKY
MYDLLQERGASHIRIYGGGGGVIIPREIKELHEYGIARIFSPEDGRRLGLQGMINVMLEECDFPTVTVVTDELERLPSGD
VQAIARLITLCEYRAEGENKEAAAAAEAAIEQVKALEKRVPVLGITGTGGAGKSSLTDELVRRFLNEIPDIKIAILSVDP
TKQKTGGALLGDRIRMNSINSPRVYMRSLATRHSRTELSPAIRDAISVVKAAGFDLVIIETSGIGQGDAAITEVCDVSMY
VMTSEFGAPTQLEKIDMIDYADLIVINKFERKGSEDAKRQVQKQYQRSHQLFDRDVSEMPVYGTIASQFNDPGTNTLFVA
LVDTINKKAGTNWKTSLKTVANVEKHNVIIPNERRYYLREIAETVRSYHRRAEQQVEVARRLFQIEGAIEAAKERGEAED
VIRALETLKADYEAKLTPESKRILATWEETKAKYAAKQFVTKVRDKEIVTELTTKTLSGLDIPKVVLPKFKDYGEILRWV
YKENVPGSFPYTAGVFPFKRQGEDPKRQFAGEGTPERTNRRFHYLCKEDKAKRLSTAFDSVTLYGEDPDYRPDIFGKVGE
SGVSVCTLDDMKKLYKGFDLCDPLTSVSMTINGPAPILLAMFMNTAIDQQVEKKEAELGRPLTPEEYEQVKEWTLQTVRG
TVQADILKEDQGQNTCIFSTDFALKMMGDIQEYFIKHRVRNYYSVSISGYHIAEAGANPITQLAFTLANGFTYVEYYLSR
GMHIDDFAPNLSFFFSNGLDPEYSVIGRVARRIWAIVMREKYGANERSQKLKYHIQTSGRSLHAQEIDFNDIRTTLQALL
AIYDNCNSLHTNAYDEAITTPTEESVRRAMAIQLIITKEFGLTKNENPLQGSFIIEELTDLVEEAVLQEFERLNDRGGVL
GAMEMQYQRGKIQDESLYYETKKHTGELPIIGVNTFLNPNPPSEDELNNIQLARATYEEKETQIRNLREFQERNKDKAGP
ALERLKQVATSGGNIFEELMETVKVASLGQITRALYEVGGQYRRNM
>Q5Z110 ~~~icmF~~~Fused isobutyryl-CoA mutase~~~COG1703
MADSTLHQPAYPVRFVTSAALFDGHDAAINIMRRILQSQGAEVIHLGHNRAVHEVVAAAVEEDVQGVAVSSYQGGHVEYF
EYLASALRDAGAGHVRVFGGGGGVIVPEEIERLARSGVRIFSPEDGQRLGLPGMINELIQTCDVDLTGERPAVEAVLAGE
RTALARVITCLQQDALPAADRDALLAAARDRTVPVLGITGTGGSGKSSLTDELVRRLRTDQQDKLRVAILAVDPTRRRGG
GALLGDRIRMNSLDGTHVFFRSLATRGGHELPHDIDAVIAACKAAGYDLVILETPGIGQGDAAIVDHVDVAMYVMTPEFG
AASQLEKIDMLDFADVVAINKFERRGGADAVRDVSRQLLRNREAFGADPADMPVFGTSAATFNDDGVTALYQHLLELLGA
RGLPVDEGVLPRVQTRVSTRFAQIIPTARVRYLAEIADTVRTYHARTRDQVAAAQRVQRLELVAAELPGDAAVADLLARA
RAELDPENAALLARWPEVAESYRGPEQVVRVRDREIRTTLRRESLSGSSIPRVALPRFTDHGELLRFLRSENLPGHFPFT
AGVFPFKRDNEDPARMFAGEGDPFRTNRRFKVLSEHSEAKRLSTAFDSVTLYGRDPDERPDIYGKVGTSGVSIATVDDMK
ALYDGFDLTAPTTSVSMTINGPAPTILAFFLNTAIDQALDRFRAAEGREPTADEAADLRARTLATVRGTVQADILKEDQG
QNTCIFSTEFSLRMMADIQEWFVRNKVRNFYSVSISGYHIAEAGANPISQLAFTLANGFTYVEAYLARGMHIDDFAPNLS
FFFSNGMDPEYSVIGRVARRIWAIALRDKYGAAERSQKLKYHVQTSGRSLHAQEMNFNDIRTTLQALIAIYDNCNSLHTN
AYDEAVTTPTEDSVRRALAIQLIINREWGLAMNENPLQGSFIIDELTDLAEEAVLTEFERISERGGVLGAMETGYQRGKI
QDESMLYEHRKHDGSLPIIGVNTFRNPHGEPERTLELARATEREKQSQLDRVREFQRRHRTQAQAALARLEEVARTDENI
FEVLMDAARVCSLQQVTETFFTVGGQYRRNV
>Q146L7 ~~~icmF~~~Fused isobutyryl-CoA mutase~~~COG1703
MTDLSTPQRAGSHKLPAGRRLRFVTAAALFDGHDASINIMRRILQASGVEVIHLGHNRSVDEVATAALHEDADGVAVSSY
QGGHNEYFRYLVDLLRARGGERIKVFGGGGGVIVPEEIAGLERYGVEKIYSPQDGQRLGLQGMIDDMIARCAEGARAAAA
TGESQVGAWAAEFSEHGLPRFDSRDDVGVDRQGAVARNPSSEASRVAAAGRGDHLDRGVRAASTADTADTANTANTANTA
NTGSVADAADAADAADAADAADAASTASTASTASTASTASTAGIPDPASLVFRRLAQLISAFETAAIDVNTRDKLSALAE
VTAIPLLGITGTGGAGKSSLTDELIRRFRLDYGDALTIAVLAIDPSRRKSGGALLGDRIRMNAIGDWGGGARVYMRSMAT
REASSEISDSLPDALMLCKAAGFDLIVVETSGIGQGNAAIVPFVDESLYVMTPEFGAASQLEKIDMLDFASFVAINKFDR
KGARDALRDVAKQVQRNRADFAKSPEAMPVFGTIASRFNDDGVTALYRHVAEALRKHGLRSGGGRLAAPEDLRFSSGRNA
IVPPARVRYLADIAQTIHAYRERADAQARLARERWQLIEARRMLVETGEAARSTVATSASPGASASSKANACTSTSSKAN
ASPGANTTANSNASATSGTATPTDALNPTLSQLDTLITQRTASLGERERILLDTWPEIVAAYSGTEHIVRVRDREIRTAL
TVATLSGSEVRKVSLPKFVDHGEILRWLMLDNLPGYFPFTAGVFPFRRENEDPTRMFAGEGDPQRTNRRFKLLSEGMPAK
RLSTAFDSVTLYGEEPHERPDIYGKVGNSGVSVATLDDMKTLYDGFDLCAPETSVSMTINGPAPTILAMFFNVAIDQQIA
RTTQRQGRPLTEDELAATRRTALENVRGTVQADILKEDQGQNTCIFSTEFSLKVMGDIQAYFVEHGVRNFYSVSISGYHI
AEAGANPISQLAYTLANGFTYVEAYLARGMSIDDFAPNLSFFFSNGMDPEYTVLGRVARRIWAVAMRERYGANERSQKLK
YHVQTSGRSLHAQEIDFNDIRTTLQALIAIYDNCNSLHTNAFDEAITTPTEESVRRAVAIQLIINREWGLAKNQNPNQGS
FVIEELTDLVEEAVLAEFDRLTERGGVLGAMETGYQRGRIQDESMLYEHRKHDGSYPIVGVNTFLSAHPHEAPQPIALAR
STDDEKQSQLQRLRAFQAQHRDAAPAALERLKRAVIDDENVFAVLMDVVRVCSLGQITHALFEVGGQYRRNM
>Q5ZYD0 ~~~icmS~~~Type 4 adapter protein IcmS~~~
MERDISKCMAKIAASMNAKFYLNDRFVSFDEVFSETGLLPAIAKRADQLCSLCLGYGLGATYDESEGALLGIRVVFDEVT
PNVLRLLCMTDVMNELIQGGPSRDYTPLDELMYD
>Q5ZS31 ~~~icmW~~~Type 4 adapter protein IcmW~~~
MPDLSHEASAKYWFEYLDPMIYRVITFMESVENWTLDGNPELEEAMKQLGQELDDIEKIDLGLLAEEDKFIRIVGNIKSG
RGLRLLQAIDTVHPGSASRVLIHAEETSLSSSDPAGFFLKRNIVFERLRLLSRVFCQYRLKLVLRALEGDE
>Q7BCK4 ~~~icsA~~~Outer membrane autotransporter IcsA~~~
MNQIHKFFCNMTQCSQGGAGELPTVKEKTCKLSFSPFVVGASLLLGGPIAFATPLSGTQELHFSEDNYEKLLTPVDGLSP
LGAGEDGMDAWYITSSNPSHASRTKLRINSDIMISAGHGGAGDNNDGNSCGGNGGDSITGSDLSIINQGMILGGSGGSGA
DHNGDGGEAVTGDNLFIINGEIISGGHGGDSYSDSDGGNGGDAVTGVNLPIINKGTISGGNGGNNYGEGDGGNGGDAITG
SSLSVINKGTFAGGNGGAAYGYGYDGYGGNAITGDNLSVINNGAILGGNGGHWGDAINGSNMTIANSGYIISGKEDDGTQ
NVAGNAIHITGGNNSLILHEGSVITGDVQVNNSSILKIINNDYTGTTPTIEGDLCAGDCTTVSLSGNKFTVSGDVSFGEN
SSLNLAGISSLEASGNMSFGNNVKVEAIINNWAQKDYKLLSADKGITGFSVSNISIINPLLTTGAIDYTKSYISDQNKLI
YGLSWNDTDGDSHGEFNLKENAELTVSTILADNLSHHNINSWDGKSLTKSGEGTLILAEKNTYSGFTNINAGILKMGTVE
AMTRTAGVIVNKGATLNFSGMNQTVNTLLNSGTVLINNINAPFLPDPVIVTGNMTLEKNGHVILNNSSSNVGQTYVQKGN
WHGKGGILSLGAVLGNDNSKTDRLEIAGHASGITYVAVTNEGGSGDKTLEGVQIISTDSSDKNAFIQKGRIVAGSYDYRL
KQGTVSGLNTNKWYLTSQMDNQESKQMSNQESTQMSSRRASSQLVSSLNLGEGSIHTWRPEAGSYIANLIAMNTMFSPSL
YDRHGSTIVDPTTGQLSETTMWIRTVGGHNEHNLADRQLKTTANRMVYQIGGDILKTNFTDHDGLHVGIMGAYGYQDSKT
HNKYTSYSSRGTVSGYTAGLYSSWFQDEKERTGLYMDAWLQYSWFNNTVKGDGLTGEKYSSKGITGALEAGYIYPTIRWT
AHNNIDNALYLNPQVQITRHGVKANDYIEHNGTMVTSSGGNNIQAKLGLRTSLISQSCIDKETLRKFEPFLEVNWKWSSK
QYGVIMNGMSNHQIGNRNVIELKTGVGGRLADNLSIWGNVSQQLGNNSYRDTQGILGVKYTF
>P33546 2.3.1.-~~~icsB~~~N-epsilon-fatty acyltransferase IcsB~~~
MSLKISNFIDASNTKGPIRVEDTEHGPILIAQKFNLKDLFFRTLSTINAKINSQILNEQLKNYRLENQKSLLLFLNTLAS
EKSAESAFAAYEAAKNSIQHSFTGRDIKLMLNTAERFHGIGTAKNLERHLVFRCWGNRGITHLGHTSISIKNNLLQEPTH
TYLSWYPGGNVTKDTEINYLFEKRSGYSVDTYKQDKLNMISEQTAERLDAGQEVRNLLNSKQDQNNNKKIFFPRANQKKD
PYGYWGVSADKVYIPLSGDNKTKDGKISHNLFGLDETNMSKFICKKKADAFRQLANYKLISKSENCAGMALNVLKAGNSE
IYFPLPDVKLVATPNDVYAYANKVRQRIESLNQSYNEIMKYIESDFDLSRLTQLRRSYLKSFNKINLIHTPKTFKPLSIS
LYKHPTENVSSEDFDAVINACHSYLVKSAPSNMTRVLNELKTEATDKKEEIIEKSIKIIDYYNSLKSPDLGTKLYIHDLL
QINKLLLNNSHSNI
>O33641 3.4.23.-~~~icsP~~~Outer membrane protease IcsP~~~
MKLKFFVLALCVPAIFTTHATTNYPLFIPDNISTDISLGSLSGKTKERVYHPKEGGRKISQLDWKYSNATIVRGGIDWKL
IPKVSFGVSGWTTLGNQKASMVDKDWNNSNTPQVWTDQSWHPNTHLRDANEFELNLKGWLLNNLDYRLGLIAGYQESRYS
FNAMGGSYIYSENGGSRNKKGAHPSGERTIGYKQLFKIPYIGLTANYRHENFEFGAELKYSGWVLSSDTDKHYQTETIFK
DEIKNQNYCSVAANIGYYVTPSAKFYIEGSRNYISNKKGDTSLYEQSTNISGTIKNSASIEYIGFLTSAGIKYIF
>P9WMH1 ~~~ideR~~~Iron-dependent repressor IdeR~~~COG1321
MNELVDTTEMYLRTIYDLEEEGVTPLRARIAERLDQSGPTVSQTVSRMERDGLLRVAGDRHLELTEKGRALAIAVMRKHR
LAERLLVDVIGLPWEEVHAEACRWEHVMSEDVERRLVKVLNNPTTSPFGNPIPGLVELGVGPEPGADDANLVRLTELPAG
SPVAVVVRQLTEHVQGDIDLITRLKDAGVVPNARVTVETTPGGGVTIVIPGHENVTLPHEMAHAVKVEKV
>P41560 1.1.1.42~~~icdI~~~Isocitrate dehydrogenase [NADP] 1~~~
MTNKIIIPTTGDKITFIDGKLSVPNNPIIPYIEGDGIGVDVTPPMLKVVNAAVAKAYGGDRKIEWLEVYAGEKATKMYDS
ETWLPEETLNILQEYKVSIKGPLTTPVGGGMSSLNVAIRQMLDLYVCQRPVQWFTGVPSPVKRPSEVDMVIFRENTEDIY
AGIEYKAGSDKAKSVIKFLIEEMGASNIRFTENCGIGIKPVSKEGSQRLVRQAIQYAIDNNKDSVTLVHKGNIMKFTEGA
FKDWGYELAIEEFGASLLHGGPWCSLKNPNTGKEIIIKDVIADAMLQQVLLRPAEYSVIATLNLNGDYLSDALAAQVGGI
GIAPGANLGDEVAVFEATHGTAPKYAGKNKVNPGSVILSAEMMLRHMGWLEADLLLKGMSGAIQAKTVTYDFERLMDDAT
LVSCSAFGDCIIDHM
>P41561 1.1.1.42~~~icd2~~~Isocitrate dehydrogenase [NADP] 2~~~
MSTDNSKIIYTITDEAPALATYSLLPIIQAYTASSGINVETRDISLAGRILANFPKYLTKEQRIDDALAELGELAQTPEA
NIIKLPNISASIPQLEAVIKELQAKGYDLPHYPAEPQNEAEESIKLTYAKILGSAVNPVLREGNSDRRAPASVKQYARNN
PHSMGAWSKESKSHVAHMASGDFYGSEKSVTIDGATSVNIEFVAKNGDVTLLKSKLPLLDKEIIDASVMSKSALVEFFET
EINKAKEEDVLLSLHLKATMMKVSDPVMFGHAVRVFYKDVFAKHAATFEQLGVDADNGIGDVYAKIARLPAAQKEEIEAD
LQAVYATRPEMAMVDSDKGITNLHVPSDVIIDASMPAALRASGMMWGPDGKQKDTKFMIPDRNYAGVFSAVVDFCRENGA
FNPATMGTVPNVGLMAQKAEEYGSHDKTFTMKAAGTVRVVNSQGERLIEQEVAQGDIYRMCQVKDAPIQDWVKLAVTRAR
ATGTPTVFWLDENRGHDEQMIKKVNTYLADHDTTGLDIQILEPVKACEFTLARVAKGEDAISVTGNVLRDYLTDLFPILE
LGTSAKMLSIVPLMNGGGLFETGAGGSAPKHVQQFEKENHLRWDSLGEFLALAASLEHVAVTTGNARAQILADTLDAATG
KFLDTNKSPSRKVGELDNRGSHFYLAMYWAQALAAQTTDTELQASFSSVAQALTKQEEKIVAELNAAQGPAIDLNGYYFA
DTKLAEKAMRPSETFNTILSALL
>P16100 1.1.1.42~~~icd~~~Isocitrate dehydrogenase [NADP]~~~
MSTPKIIYTLTDEAPALATYSLLPIIKAFTGSSGIAVETRDISLAGRLIATFPEYLTDTQKISDDLAELGKLATTPDANI
IKLPNISASVPQLKAAIKELQQQGYKLPDYPEEPKTDTEKDVKARYDKIKGSAVNPVLREGNSDRRAPLSVKNYARKHPH
KMGAWSADSKSHVAHMDNGDFYGSEKAALIGAPGSVKIELIAKDGSSTVLKAKTSVQAGEIIDSSVMSKNALRNFIAAEI
EDAKKQGVLLSVHLKATMMKVSDPIMFGQIVSEFYKDALTKHAEVLKQIGFDVNNGIGDLYARIKTLPEAKQKEIEADIQ
AVYAQRPQLAMVNSDKGITNLHVPSDVIVDASMPAMIRDSGKMWGPDGKLHDTKAVIPDRCYAGVYQVVIEDCKQHGAFD
PTTMGSVPNVGLMAQKAEEYGSHDKTFQIPADGVVRVTDESGKLLLEQSVEAGDIWRMCQAKDAPIQDWVKLAVNRARAT
NTPAVFWLDPARAHDAQVIAKVERYLKDYDTSGLDIRILSPVEATRFSLARIREGKDTISVTGNVLRDYLTDLFPIMELG
TSAKMLSIVPLMSGGGLFETGAGGSAPKHVQQFLEEGYLRWDSLGEFLALAASLEHLGNAYKNPKALVLASTLDQATGKI
LDNNKSPARKVGEIDNRGSHFYLALYWAQALAAQTEDKELQAQFTGIAKALTDNETKIVGELAAAQGKPVDIAGYYHPNT
DLTSKAIRPSATFNAALAPLA
>P39126 1.1.1.42~~~icd~~~Isocitrate dehydrogenase [NADP]~~~COG0538
MAQGEKITVSNGVLNVPNNPIIPFIEGDGTGPDIWNAASKVLEAAVEKAYKGEKKITWKEVYAGEKAYNKTGEWLPAETL
DVIREYFIAIKGPLTTPVGGGIRSLNVALRQELDLFVCLRPVRYFTGVPSPVKRPEDTDMVIFRENTEDIYAGIEYAKGS
EEVQKLISFLQNELNVNKIRFPETSGIGIKPVSEEGTSRLVRAAIDYAIEHGRKSVTLVHKGNIMKFTEGAFKNWGYELA
EKEYGDKVFTWAQYDRIAEEQGKDAANKAQSEAEAAGKIIIKDSIADIFLQQILTRPNEFDVVATMNLNGDYISDALAAQ
VGGIGIAPGANINYETGHAIFEATHGTAPKYAGLDKVNPSSVILSGVLLLEHLGWNEAADLVIKSMEKTIASKVVTYDFA
RLMDGATEVKCSEFGEELIKNMD
>P50216 1.1.1.42~~~icd~~~Isocitrate dehydrogenase [NADP]~~~COG2838
MAKIIWTRTDEAPLLATYSLKPVVEAFAATAGIEVETRDISLAGRILAQFPERLTEDQKVGNALAELGELAKTPEANIIK
LPNISASVPQLKAAIKELQDQGYDIPELPDNATTDEEKDILARYNAVKGSAVNPVLREGNSDRRAPIAVKNFVKKFPHRM
GEWSADSKTNVATMDANDFRHNEKSIILDAADEVQIKHIAADGTETILKDSLKLLEGEVLDGTVLSAKALDAFLLEQVAR
AKAEGILFSAHLKATMMKVSDPIIFGHVVRAYFADVFAQYGEQLLAAGLNGENGLAAILSGLESLDNGEEIKAAFEKGLE
DGPDLAMVNSARGITNLHVPSDVIVDASMPAMIRTSGHMWNKDDQEQDTLAIIPDSSYAGVYQTVIEDCRKNGAFDPTTM
GTVPNVGLMAQKAEEYGSHDKTFRIEADGVVQVVSSNGDVLIEHDVEANDIWRACQVKDAPIQDWVKLAVTRSRLSGMPA
VFWLDPERAHDRNLASLVEKYLADHDTEGLDIQILSPVEATQLSIDRIRRGEDTISVTGNVLRDYNTDLFPILELGTSAK
MLSVVPLMAGGGLFETGAGGSAPKHVQQVQEENHLRWDSLGEFLALAESFRHELNNNGNTKAGVLADALDKATEKLLNEE
KSPSRKVGEIDNRGSHFWLTKFWADELAAQTEDADLAATFAPVAEALNTGAADIDAALLAVQGGATDLGGYYSPNEEKLT
NIMRPVAQFNEIVDALKK
>Q9ZH99 1.1.1.42~~~icd~~~Isocitrate dehydrogenase [NADP]~~~COG0538
MTELTGVSIVTYQHIKVPSQGEKITVNKAVLEVPDRPIIPFIEGDGIGIDIAPVMKNVVDAAVEKSYAGKRKIEWMEIYA
GEKATKVYGKDNWLPDETLEAIKEYQVAIKGPLTTPVGGGIRSLNVALRQQLDLYVCLRPVRYFTGVPSPVKTPEKVNMV
IFRENSEDIYAGIEWPAGSPEAVKLINFLQNEMGVKKIRFPETAGIGIKPVSKEGTSRLVRRAIQYAIDNDRDSVTLVHK
GNIMKFTEGAFKDWGYEVAVKEFGAKPLDGGPWHVFENPKTGQKITIKDVIADAFLQQILLRPAEYSVIATLNLNGDYIS
DALAAEVGGIGIAPGANLSDTVGLFEATHGTAPKYAGQDKVNPGSLILSAEMMLRYLGWKEAADLVVQGIEGAIESKTVT
YDFARLMTGAKEVSTSQFGKAIIKHIL
>P08200 1.1.1.42~~~icd~~~Isocitrate dehydrogenase [NADP]~~~COG0538
MESKVVVPAQGKKITLQNGKLNVPENPIIPYIEGDGIGVDVTPAMLKVVDAAVEKAYKGERKISWMEIYTGEKSTQVYGQ
DVWLPAETLDLIREYRVAIKGPLTTPVGGGIRSLNVALRQELDLYICLRPVRYYQGTPSPVKHPELTDMVIFRENSEDIY
AGIEWKADSADAEKVIKFLREEMGVKKIRFPEHCGIGIKPCSEEGTKRLVRAAIEYAIANDRDSVTLVHKGNIMKFTEGA
FKDWGYQLAREEFGGELIDGGPWLKVKNPNTGKEIVIKDVIADAFLQQILLRPAEYDVIACMNLNGDYISDALAAQVGGI
GIAPGANIGDECALFEATHGTAPKYAGQDKVNPGSIILSAEMMLRHMGWTEAADLIVKGMEGAINAKTVTYDFERLMDGA
KLLKCSEFGDAIIENM
>P9WKL1 1.1.1.42~~~icd~~~Isocitrate dehydrogenase [NADP]~~~COG0538
MSNAPKIKVSGPVVELDGDEMTRVIWKLIKDMLILPYLDIRLDYYDLGIEHRDATDDQVTIDAAYAIKKHGVGVKCATIT
PDEARVEEFNLKKMWLSPNGTIRNILGGTIFREPIVISNVPRLVPGWTKPIVIGRHAFGDQYRATNFKVDQPGTVTLTFT
PADGSAPIVHEMVSIPEDGGVVLGMYNFKESIRDFARASFSYGLNAKWPVYLSTKNTILKAYDGMFKDEFERVYEEEFKA
QFEAAGLTYEHRLIDDMVAACLKWEGGYVWACKNYDGDVQSDTVAQGYGSLGLMTSVLMTADGKTVEAEAAHGTVTRHYR
QYQAGKPTSTNPIASIFAWTRGLQHRGKLDGTPEVIDFAHKLESVVIATVESGKMTKDLAILIGPEQDWLNSEEFLDAIA
DNLEKELAN
>Q02NB5 1.1.1.42~~~icd~~~Isocitrate dehydrogenase [NADP]~~~
MGYQKIQVPATGDKITVNADMSLSVPKNPIIPFIEGDGIGVDISPVMIKVVDAAVEKAYKGERKIAWMEVYAGEKATQVY
DQDTWLPQETLDAVRDYVVSIKGPLTTPVGGGIRSLNVALRQQLDLYVCQRPVRWFEGVPSPVKKPGDVDMVIFRENSED
IYAGVEWKAGSPEAEKVIKFLTEEMGVKKIRFTENCGIGIKPVSQEGTKRLVRKALQYAVDNDRSSVTLVHKGNIMKFTE
GAFKDWGYEVARDEFGAELLDGGPWMQFKNPKTGKNVVVKDVIADAMLQQILLRPAEYDVIATLNLNGDYLSDALAAEVG
GIGIAPGANLSDSVAMFEATHGTAPKYAGQDKVNPGSLILSAEMMLRHMGWTEAADLIIKGTNGAIAAKTVTYDFERLMD
GATLLSCSEFGDAMIAKM
>P99167 1.1.1.42~~~icd~~~Isocitrate dehydrogenase [NADP]~~~
MTAEKITQGTEGLNVPNEPIIPFIIGDGIGPDIWKAASRVIDAAVEKAYNGEKRIEWKEVLAGQKAFDTTGEWLPQETLD
TIKEYLIAVKGPLTTPIGGGIRSLNVALRQELDLFTCLRPVRWFKGVPSPVKRPQDVDMVIFRENTEDIYAGIEFKEGTT
EVKKVIDFLQNEMGATNIRFPETSGIGIKPVSKEGTERLVRAAIQYAIDNNRKSVTLVHKGNIMKFTEGSFKQWGYDLAL
SEFGDQVFTWQQYDEIVENEGRDAANAAQEKAEKEGKIIIKDSIADIFLQQILTRPAEHDVVATMNLNGDYISDALAAQV
GGIGIAPGANINYETGHAIFEATHGTAPKYAGLNKVNPSSVILSSVLMLEHLGWQEAADKITDSIEDTIASKVVTYDFAR
LMDGAEEVSTSAFADELIKNLK
>P80046 1.1.1.42~~~icd~~~Isocitrate dehydrogenase [NADP]~~~COG0538
MYEKLQPPSVGSKITFVAGKPVVPNDPIIPYIRGDGTGVDIWPATELVINAAIAKAYGGREEINWFKVYAGDEACELYGT
YQIFPEDTLTAIKEYGVAIKGPLTTPVGGGIRSLNVALRQIFDLYTCVRPCRYYPGTPSPHKTPEKLDIIVYRENTEDIY
LGIEWAEGTEGAKKLIAYLNDELIPTTPALGKKQIRLDSGIGIKPISKTGSQRLVRRAILHAKRLPKAKQMVTLVHKGNI
MKFTEGPFRDWGYELATTEFRAECVTERESWICGNKESNPDLTIEANAHMIDPGYDTLTEEKQAVIKQEVEQVLNSIWES
HGNGQWKEKVMVNDRIADSIFQQIQTRPDEYSILATMNLNGDYLSDAAAAVVGGLGMGPGANIGDSAAIFEATHGTAPKH
AGLDRINPGSVILSGVMMLEFMGWQEAADLIKKGIGAAIANREVTYDLARLMEPKVDKPLKCSEFAQAIVSHFDD
>P33197 1.1.1.42~~~icd~~~Isocitrate dehydrogenase [NADP]~~~COG0473
MPLITTETGKKMHVLEDGRKLITVIPGDGIGPECVEATLKVLEAAKAPLAYEVREAGASVFRRGIASGVPQETIESIRKT
RVVLKGPLETPVGYGEKSANVTLRKLFETYANVRPVREFPNVPTPYAGRGIDLVVVRENVEDLYAGIEHMQTPSVAQTLK
LISWKGSEKIVRFAFELARAEGRKKVHCATKSNIMKLAEGTLKRAFEQVAQEYPDIEAVHIIVDNAAHQLVKRPEQFEVI
VTTNMNGDILSDLTSGLIGGLGFAPSANIGNEVAIFEAVHGSAPKYAGKNVINPTAVLLSAVMMLRYLEEFATADLIENA
LLYTLEEGRVLTGDVVGYDRGAKTTEYTEAIIQNLGKTPRKTQVRGYKPFRLPQVDGAIAPIVPRSRRVVGVDVFVETNL
LPEALGKALEDLAAGTPFRLKMISNRGTQVYPPTGGLTDLVDHYRCRFLYTGEGEAKDPEILDLVSRVASRFRWMHLEKL
QEFDGEPGFTKAQGED
>P50740 5.3.3.2~~~fni~~~Isopentenyl-diphosphate delta-isomerase~~~COG1304
MTRAERKRQHINHALSIGQKRETGLDDITFVHVSLPDLALEQVDISTKIGELSSSSPIFINAMTGGGGKLTYEINKSLAR
AASQAGIPLAVGSQMSALKDPSERLSYEIVRKENPNGLIFANLGSEATAAQAKEAVEMIGANALQIHLNVIQEIVMPEGD
RSFSGALKRIEQICSRVSVPVIVKEVGFGMSKASAGKLYEAGAAAVDIGGYGGTNFSKIENLRRQRQISFFNSWGISTAA
SLAEIRSEFPASTMIASGGLQDALDVAKAIALGASCTGMAGHFLKALTDSGEEGLLEEIQLILEELKLIMTVLGARTIAD
LQKAPLVIKGETHHWLTERGVNTSSYSVR
>P99172 5.3.3.2~~~fni~~~Isopentenyl-diphosphate delta-isomerase~~~
MSDFQREQRKNEHVEIAMAQSDAMHSDFDKMRFVHHSIPSINVNDIDLTSQTPDLTMAYPIYINAMTGGSEWTKNINEKL
AVVARETRLAMAVGSTHAALRNPRMAETFTIARKMNPEGMIFSNVGADVPVEKALEAVELLEAQALQIHVNSPQELVMPE
GNREFVTWLDNIASIVSRVSVPVIIKEVGFGMSKELMHDLQQIGVKYVDVSGKGGTNFVDIENERRANKDMDYLSSWGQS
TVESLLETTAYQSEISVFASGGLRTPLDAIKSLALGAKATGMSRPFLNQVENNGIAHTVAYVESFIEHMKSIMTMLDAKN
IDDLTQKQIVFSPEIMSWIEQRSLNIHRG
>Q9KWG2 5.3.3.2~~~fni~~~Isopentenyl-diphosphate delta-isomerase~~~
MTSAQRKDDHVRLAIEQHNAHSGRNQFDDVSFVHHALAGIDRPDVSLATSFAGISWQVPIYINAMTGGSEKTGLINRDLA
TAARETGVPIASGSMNAYIKDPSCADTFRVLRDENPNGFVIANINATTTVDNAQRAIDLIEANALQIHINTAQETPMPEG
DRSFASWVPQIEKIAAAVDIPVIVKEVGNGLSRQTILLLADLGVQAADVSGRGGTDFARIENGRRELGDYAFLHGWGQST
AACLLDAQDISLPVLASGGVRHPLDVVRALALGARAVGSSAGFLRTLMDDGVDALITKLTTWLDQLAALQTMLGARTPAD
LTRCDVLLHGELRDFCADRGIDTRRLAQRSSSIEALQTTGSTR
>Q8DUI9 5.3.3.2~~~fni~~~Isopentenyl-diphosphate delta-isomerase~~~COG1304
MTNRKDDHIKYALDYRSPYNSFDDIELIHHSLPDYDLAEIDLSTHFAGQDFDFPFYINAMTGGSQKGKEVNEKLAQVADT
CGLLFVTGSYSTALKNPDDTSYQVKKSRPHLLLATNIGLDKPYQAGLQAVRDLQPLFLQVHINLMQELLMPEGEREFRSW
KKHLSDYAKKLQLPFILKEVGFGMDVKTIQTAIDLGVKTVDISGRGGTSFAYIENRRGGNRSYLNQWGQTTAQVLLNAQP
LMDKVEILASGGIRHPLDIIKALVLGAKAVGLSRTMLELVEQHSVHEVIAIVNGWKEDLRLIMCALNCQTIAELRNVDYL
LYGRLREGQRQ
>B2ILS5 5.3.3.2~~~fni~~~Isopentenyl-diphosphate delta-isomerase~~~
MTTNRKDEHILYALEQKSSYNSFDEVELIHSSLPLYNLDEIDLSTEFAGRKWDFPFYINAMTGGSNKGREINQKLAQVAE
SCGILFVTGSYSAALKNPTDDSFSVKSSHPNLLLGTNIGLDKPVELGLQTVEEMNPVLLQVHVNVMQELLMPEGERKFRS
WQSHLADYSKQIPVPIVLKEVGFGMDAKTIERAYEFGVRTVDLSGRGGTSFAYIENRRSGQRDYLNQWGQSTMQALLNAQ
EWKDKVELLVSGGVRNPLDMIKCLVFGAKAVGLSRTVLELVETYTVEEVIGIVQGWKADLRLIMCSLNCATIADLQKVDY
LLYGKLKEAKDQMKKA
>Q746I8 5.3.3.2~~~fni~~~Isopentenyl-diphosphate delta-isomerase~~~COG1304
MNIRERKRKHLEACLEGEVAYQKTTTGLEGFRLRYQALAGLALGEVDLTTPFLGKTLKAPFLIGAMTGGEENGERINLAL
AEAAEALGVGMMLGSGRILLERPEALRSFRVRKVAPKALLIANLGLAQLRRYGRDDLLRLVEALEADALAFHVNPLQEAV
QRGDTDFRGLVERLAELLPLPFPVMVKEVGHGLSREAALALRDLPLAAVDVAGAGGTSWARVEEWVRFGEVRHPELCEIG
IPTARAILEVREVLPHLPLVASGGVYTGTDGAKALALGADLLAVARPLLRPALEGAERVAAWIGDYLEELRTALFAIGAK
NPKEARGRVERV
>Q31L64 ~~~idiA~~~Iron deficiency-induced protein A~~~COG1840
MSESMFSRRDFLLGGTALAGTLLLDSFGDWRRRAEAAEGEVNLYSGRHYNTDNQIYREFTQKTGIKVNLIEGEADALLAR
LKSEGSRSPADVFITVDAGRLWQATQANLLRPLTQAQAPKLYQAVPANLRDPQGRWFALSKRARVIMYNRDRVNASQLST
YEDLANPKWRNQILVRSSSNVYNLSLTGEMIAADGAAKTEAWARGLVQNFARQPQGGDTPQILACAAGVGSLAIANTYYL
VRLFKSKKAEEREAARKIKVFFPNQKGRGTHVNISGAGIVRTAPNPRAAQLLLEYLLSSQAQAVFARGNGEYPVLRGVSL
DPILAGFGQFKESKISASVFGANNAQALQLMDRAGWK
>Q5N0R0 ~~~idiA~~~Iron deficiency-induced protein A~~~COG1840
MSESMFSRRDFLLGGTALAGTLLLDSFGDWRRRAEAAEGEVNLYSGRHYNTDNQIYREFTQKTGIKVNLIEGEADALLAR
LKSEGSRSPADVFITVDAGRLWQATQANLLRPLTQAQAPKLYQAVPANLRDPQGRWFALSKRARVIMYNRDRVNASQLST
YEDLANPKWRNQILVRSSSNVYNLSLTGEMIAADGAAKTEAWARGLVQNFARQPQGGDTPQILACAAGVGSLAIANTYYL
VRLFKSKKAEEREAARKIKVFFPNQKGRGTHVNISGAGIVRTAPNPRAAQLLLEYLLSSQAQAVFARGNGEYPVLRGVSL
DPILAGFGQFKESKISASVFGANNAQALQLMDRAGWK
>Q8DLH9 ~~~idiA~~~Iron deficiency-induced protein A~~~COG1840
MEKVGRRVFLGMGAAATAYVTHHLWNQNAESSYAQQSSGGVINVYSARHYDTDKALYNTFTQQTGIRVNIIEAEADALIE
RIRSEGSRTPADVLITVDAGRLWRAQEAGILQPIQSRVLNSVVPANLREPQGHWFGLSRRVRVLIYNKSRVNPSQLSTYE
DLANPKWRRQILTRSSSNIYNQSLTGSLLAIHGAQKTEQWARGLVQNFARPPEGNDTAQIRACAEGVGSVAIANHYYLAR
LIASDKEQDRAVAAKVGLFFPNQRDRGAHVNISGAGVVAGAPNRQGAIRFLEYLVSPKAQEMFAMANFEYPVRAGVPVHP
IVKQFGNFRGQNVNAAVFGRNNAEALRIMDRAGWR
>Q46822 5.3.3.2~~~idi~~~Isopentenyl-diphosphate Delta-isomerase~~~COG1443
MQTEHVILLNAQGVPTGTLEKYAAHTADTRLHLAFSSWLFNAKGQLLVTRRALSKKAWPGVWTNSVCGHPQLGESNEDAV
IRRCRYELGVEITPPESIYPDFRYRATDPSGIVENEVCPVFAARTTSALQINDDEVMDYQWCDLADVLHGIDATPWAFSP
WMVMQATNREARKRLSAFTQLK
>P9WKK5 5.3.3.2~~~idi~~~Isopentenyl-diphosphate Delta-isomerase~~~COG1443
MTRSYRPAPPIERVVLLNDRGDATGVADKATVHTGDTPLHLAFSSYVFDLHDQLLITRRAATKRTWPAVWTNSCCGHPLP
GESLPGAIRRRLAAELGLTPDRVDLILPGFRYRAAMADGTVENEICPVYRVQVDQQPRPNSDEVDAIRWLSWEQFVRDVT
AGVIAPVSPWCRSQLGYLTKLGPCPAQWPVADDCRLPKAAHGN
>P26173 5.3.3.2~~~idi~~~Isopentenyl-diphosphate Delta-isomerase~~~COG1443
MAEEMIPAWVEGVLQPVEKLEAHRKGLRHLAISVFVTRGNKVLLQQRALSKYHTPGLWANTCCTHPYWGEDAPTCAARRL
GQELGIVGLKLRHMGQLEYRADVNNGMIEHEVVEVFTAEAPEGIEPQPDPEEVADTEWVRIDALRSEIHANPERFTPWLK
IYIEQHRDMIFPPVTA
>Q8ZM82 5.3.3.2~~~idi~~~Isopentenyl-diphosphate Delta-isomerase~~~
MTEEHVVLLDEQDKPSGTLEKYAAHTLNTPLHLAFSCWLFNEDGQLLVTRRSLSKKAWPGVWTNSVCGHPQQGETTEEAI
IRRCRFELGVEITDLTPVYPHFSYRATDPNGIVENEVCPVFAARATSVLQVNSEEVMDYQWSEFKSVWKSLLATPWAFSP
WMVMQASDEQARERLLNYCQR
>P0DPC6 ~~~idlP~~~iraD leader peptide~~~
MENEHQYSGARCSGQAAYVAKRQECAK
>P39346 1.1.1.264~~~idnD~~~L-idonate 5-dehydrogenase (NAD(P)(+))~~~COG1063
MQVKTQSCVVAGKKTVAVTEQTIDWNNNGTLVQITRGGICGSDLHYYQEGKVGNFMIKAPMVLGHEVIGKVIHSDSSELH
EGQTVAINPSKPCGHCKYCIEHNENQCTDMRFFGSAMYFPHVDGGFTRYKMVETSQCVPYPAKADEKVMAFAEPLAVAIH
AAHQAGELQGKRVFISGVGPIGCLIVSAVKTLGAAEIVCADVSPRSLSLGKEMGADVLVNPQNDDMDHWKAEKGYFDVSF
EVSGHPSSVNTCLEVTRARGVMVQVGMGGAMAEFPMMTLIGKEISLRGSFRFTSEFNTAVSWLANGVINPLPLLSAEYPF
TDLEEALRFAGDKTQAAKVQLVF
>P0A9P9 1.1.1.69~~~idnO~~~5-keto-D-gluconate 5-reductase~~~COG1028
MNDLFSLAGKNILITGSAQGIGFLLATGLGKYGAQIIINDITAERAELAVEKLHQEGIQAVAAPFNVTHKHEIDAAVEHI
EKDIGPIDVLVNNAGIQRRHPFTEFPEQEWNDVIAVNQTAVFLVSQAVTRHMVERKAGKVINICSMQSELGRDTITPYAA
SKGAVKMLTRGMCVELARHNIQVNGIAPGYFKTEMTKALVEDEAFTAWLCKRTPAARWGDPQELIGAAVFLSSKASDFVN
GHLLFVDGGMLVAV
>P39344 ~~~idnT~~~Gnt-II system L-idonate transporter~~~COG2610
MPLIIIAAGVALLLILMIGFKVNGFIALVLVAAVVGFAEGMDAQAVLHSIQNGIGSTLGGLAMILGFGAMLGKLISDTGA
AQRIATTLIATFGKKRVQWALVITGLVVGLAMFFEVGFVLLLPLVFTIVASSGLPLLYVGVPMVAALSVTHCFLPPHPGP
TAIATIFEANLGTTLLYGFIITIPTVIVAGPLFSKLLTRFEKAPPEGLFNPHLFSEEEMPSFWNSIFAAVIPVILMAIAA
VCEITLPKTNTVRLFFEFVGNPAVALFIAIVIAIFTLGRRNGRTIEQIMDIIGDSIGAIAMIVFIIAGGGAFKQVLVDSG
VGHYISHLMTGTTLSPLLMCWTVAALLRIALGSATVAAITTAGVVLPIINVTHADPALMVLATGAGSVIASHVNDPGFWL
FKGYFNLTVGETLRTWTVMETLISIMGLLGVLAINAVLH
>E2GIN1 1.14.11.45~~~ido~~~L-isoleucine-4-hydroxylase~~~
MKMSGFSIEEKVHEFESKGFLEISNEIFLQEEENHSLLTQAQLDYYNLEDDAYGECRARSYSRYIKYVDSPDYILDNSND
YFQSKEYNYDDGGKVRQFNSINDSFLCNPLIQNIVRFDTEFAFKTNIIDKSKDLIIGLHQVRYKATKERPSFSSPIWLHK
DDEPVVFLHLMNLSNTAIGGDNLIANSPREINQFISLKEPLETLVFGQKVFHAVTPLGTECSTEAFRDILLVTFSYKETK
>P0DV66 1.-.-.-~~~idrA~~~Iodate reductase subunit IdrA~~~
MSENIKQGGAGTFMQAPQDSVPLPPKDAEVMTTACDYCTVACGYKVYRWPVGKEGGMKAKDNAFNADFPHQQIFGAWASP
AQHNIVNHNGRSHHVLVLPDRDTTVVNPGGNHSIRGGTLAQKCYNPSNRTSERLLYPMIRVRGTLMPVSWDLATEVMADI
SQYILAKYGEHAWAMKTYSYQYFENTYAITKLGMTSIGTPAFAWHDKASATNDATGLDDAGVNSFASSDQDWADCEVAFL
SGVDPYETKTTLFTQWMMPGDKKFIFVTPHRTMGVAWAESTGRGMWLPIIPGTDTVLHLALARIIVENGWQDQAFIDKWV
ANKWEVDSGYGRGTRNTGWQWRTTWGKWQSDWKDYSAWILKQKEGELETAAKITGLRAEDIRKAAEWIAKPKADGTRVKA
SFMLEKGNYWTNNYMNSASLASLGLICGSGNRPGQMISRGGGHQRGGMSAGGGSGWLSPEKYPGRRKKSFNLDRWMMNGN
VRFAWVIGTTWTAAMMASQALQDKMFSLTRGNPHQISSLDRKAIFETLKQRVDSGGTVIANSDIYPCVPVGTEYADIVLP
AATWGEDNFTRCNSERRLRLYSKFYDAPGEAKPDWWIIAKFAQKMGYDKDGSYQWKNSNDVFEEAARFGRNGVLNYHPLV
VKAKEKGVKGHELLRTYGTDGIQTPIRMKDGELVGTQRLHDPANDWDEVEGQEVKRKWLYAFGTHSGKAILLKTPWDYPG
WSQFYKAALPRKEKGEVWVTNGRVNETWQSGFDDLRKPYLSQRWPYPMLIMHPDEAKPRGIESGDFVQVYNDTVYIQMGE
PQGVKEDDLYFDTLMKNGHIKTTDGQFVAVAIVSEEMRPGVVMANFNYPQAPANSVVHAVPDPMTNNYRYKLGRGVLTKV
GESPYKHSFTSMTLKPRDIV
>A0A391NTR7 1.-.-.-~~~idrA~~~Iodate reductase subunit IdrA~~~
MSKPDEYLSSNSVPLPPQDADVLTTACDYCIVACGYKVYRWPVGKEGGAKASENALGADFPHQMMMGAWASPAQHNVVSY
RDQPHHVVVIADKDATVVNPGGNHSIRGGTLAEKCYNPSNRTRERLQHPMIRVNGKLTPVSWDLATEVMADISQYIIAKY
GEHAWAVKSHSYQYFENVYAITKLAMTSIGTPAFAWHDKCSATNDATGLDDAGIDSFASSYEDWADCEVAFLSGVDPYET
KTTLFTSHMMPGDKKFVFVTPHMTMGVAWSVKAGRGLWLPIIPGTDTVLHMALARIIIENDWQDQPFIDKWIANSWEVDS
GYGRGTRNTGWQWRTTWGTWQSDWQDYRKFILAQEESKLDVAAQITGLSADDIRTAAEWIAKPKADGSHPKTSFMCEKGN
YWSNNYMNSASFASLGLICGSGNRKGRMISRGGGHQRGGLSAGGNSEWLSPEKYPGRRKKSFNLDLWLMEGNIRFAWVIG
TTWVAAMMGSNALEAKMRSLTAESPHQIKSLDRAAIFETLKARVDSGGMVMANSDLYPVVPVGTDFADIVYPVASWGEDN
FTRCNSERRLRLYSKFYDAPGEAKPDWWIVQKFAQKMGLDKDGGYSWKDSNDVFEEVARFSRDGVLNYHPLAEKAKASGI
KAQELLRGYGTTGIQTPIRERGGELVGTVRLHDPDNDWGEIEGSTVHTKALVAFNTHSGKAILLKSPWQYAGWIQFYEAI
KPRAAKGEVWVTNGRVNETWQSGFDDRRKPYLSQRWPEGFIFINPEDARKKGIESGDYVEVVNDTVYIQTGQPQGVLDAD
LTFNQLMADGHIKITTGRFKTIAVVSDEMRPGVCQANFNVPSSPANAVVSAVPDPMTNNYRYKLGRGVLNKVGESPYKHN
FTQMSLKPRNIV
>P0DV67 1.-.-.-~~~idrB~~~Iodate reductase subunit IdrB~~~
MTTHPIHLHHDDPAHGGERACMSRRSFLLAGGAMVTLASLPGTAVAALKALKADYPAVKIGKLSRLKTGEPLEFAYPYPD
VNNILVKLGAEAGGGVGPQADVVAFNQQCTHMGGPLQGTYKAKHQALGPCPLHLTTFDLTRHGMVISGHATESLPQIVLE
VRGDDIYAVGVQGLIYGYSSNRAGR
>A0A391NZA8 1.-.-.-~~~idrB~~~Iodate reductase subunit IdrB~~~
MSENIIPVRAVPAHDHEHDGERACMSRRRFLLFGGTSVALLSIASLPGVAQVMQALKADYARQRIGSLSALKTGEPLDFN
YPYPDVRNILVKLGVAAGGGIGADKDIVAFNQQCTHMGGPLDGTYKAEHQILGPCPLHLTTFDLTRHGMVASGHATESLP
QIVLEVQGDDIYAIGVLGLVYGFDSMNDVQPA
>P0DV68 1.11.1.5~~~idrP1~~~Cytochrome-c peroxidase IdrP1~~~
MGHIRSIRLALAVAAVCTAASAAAGDAKFPPLGPLPPVPVPADNPMTADKVALGKQLFWDNRLSGDGSTPCVSCHLPALG
WGDGGAISRGYPGTKHWRNSQTIVNSAYYNKLFWAGSVTSLEAQAPSAAEGGVAGNGDRSLMEMRLRFIPEYVAAFKNVF
GADWPRMTQAYAAIAAYQRTVVSDATRVPFDRWQAGDKAAMSAEAQRGYALFSGKAGCIACHNGPLASDQRFYNLGLPEH
PDLAEDPLLQITHRWEQYQKGTTEDGYRHADRDKGYYYQTKNPKDIGKFRTPSLREVKYTGPYMHNGTLATLDEVVAFYN
AGGGTAPGKTDKLKPLGLTEQESKDLVAFVEALSMTEPLIHDDPKLPGDYQPLATQ
>A0A391NGM7 1.11.1.5~~~idrP1~~~Cytochrome-c peroxidase IdrP1~~~
MNNRKPLQLSLLVASLAVAFTASATNADAHPPLAPLPPVPVPKDNPQSAEKIALGKQLFWDYRLSGDGSMPCVSCHLPAL
GWGDGGQISRGYPGTKHWRNSQTILNSAYYNKLFWEGSVNSLEEQAPSAAEGAVAGNGDPSVMEMRLRFVPEYVDAFKNV
FGSQWPRMNDAYRAIASYQRTVVSDASKVPFDRYANGDKNALDTSQKRGMALFNGKAGCVQCHNGPLASDQKYYDLGLPD
FAGFVDDPLYQVTHRWEHYQKGVSEPRYRAANMDYGLYYVTKNPKDVGKFRTPSLREAKYTAPYMHNGVFTSLQEVVDFY
DRGGGSGTSKSELLKPLKLAAQEKQDLIAFIEALSMSEPLLHDDPTLPGEYQPLPAPIK
>P0DV69 1.11.1.5~~~idrP2~~~Cytochrome-c peroxidase IdrP2~~~
MTTHQSIRRLSRIAALVGLAFVAGTVAAADGKAELQALPEAKAGNADMVELGKHLFFDTRLSGDMGVSCASCHDPAKGFS
DGMPLSAGYPSVEYFRNAPTLINSRFKNVFMWDGRLDGADMGTLVRDMLTEAHTMNMDSRLMQERLSQVPEYVAMWQKFR
KDDINGMRVYGVVGEYVKTLVSQNAPIDRFLKGDGSALTSQQKDGYEIFTGKGGCVACHNGPLGSDGQVHNTGVPENPEV
LKNPNRTVTLLRHYATSGMPNYMNARTDLGHYAISKDPADMNKFATPSLRELKYTAPYMHNGMLTTLDQVVDFYNQGGGQ
GSELTPLGLSGSEKKALVAFLEALSGEPLNVVAPTLPDYQPRQFGKN
>A0A391NKV7 1.11.1.5~~~idrP2~~~Cytochrome-c peroxidase IdrP2~~~
MKWHRGRLTQTLGAMGLTATLTVAAQAAGQGDMLDLAPMPPAKAGNPAMIELGKQFFFDRRLSGDWGVSCASCHDPAKGW
GDGLALSKGYPSMEYFRNSPTVLNAAHRKRFLWDGRLDGADPGTLARDMITEAHTMNMDGRLMQERLQQVPEYAALWQKW
RNDDINGMRVFNAVGEFITSLETRNAPFDDFAKGDSTAITKEAQHGYALFKGKAGCVSCHNGPIGSDGKLHKTGVPEHPD
VLNNPLRTITMLRHYATSGMPNYMSARSDVGAYAISKDERDVGKFQTAQLRDLKYTAPYMHNGVFDTLEEVVAFYNQGGG
EGSALSPLTLSTAEQQALVAFLLTLSGDPLIVEDPGQPDMQPRVFGKN
>C3VA26 1.13.11.88~~~iem~~~Isoeugenol monooxygenase~~~
MARLNRNDPQLVGTLLPTRIEADLFDLEVDGEIPKSINGTFYRNTPEPQVTPQKFHTFIDGDGMASAFHFEDGHVDFISR
WVKTARFTAERLARKSLFGMYRNPYTDDTSVKGLDRTVANTSIISHHGKVLAVKEDGLPYELDPRTLETRGRFDYDGQVT
SQTHTAHPKYDPETGDLLFFGSAAKGEATPDMAYYIVDKHGKVTHETWFEQPYGAFMHDFAITRNWSIFPIMPATNSLSR
LKAKQPIYMWEPELGSYIGVLPRRGQGSQIRWLKAPALWVFHVVNAWEVGTKIYIDLMESEILPFPFPNSQNQPFAPEKA
VPRLTRWEIDLDSSSDEIKRTRLHDFFAEMPIMDFRFALQCNRYGFMGVDDPRKPLAHQQAEKIFAYNSLGIWDNHRGDY
DLWYSGEASAAQEPAFVPRSPTAAEGDGYLLTVVGRLDENRSDLVILDTQDIQSGPVATIKLPFRLRAALHGCWVPRP
>A5HV13 1.13.11.88~~~iso~~~Isoeugenol monooxygenase~~~
MATFDRNDPQLAGTMFPTRIEANVFDLEIEGEIPRAINGSFFRNTPEPQVTTQPFHTFIDGDGLASAFHFEDGQVDFVSR
WVCTPRFEAERSARKSLFGMYRNPFTDDPSVEGIDRTVANTSIITHHGKVLAAKEDGLPYELDPQTLETRGRYDYKGQVT
SHTHTAHPKFDPQTGEMLLFGSAAKGERTLDMAYYIVDRYGKVTHETWFKQPYGAFMHDFAVTRNWSIFPIMPATNSLER
LKAKQPIYMWEPERGSYIGVLPRRGQGKDIRWFRAPALWVFHVVNAWEEGNRILIDLMESEILPFPFPNSQNLPFDPSKA
VPRLTRWEIDLNSGNDEMKRTQLHEYFAEMPIMDFRFALQDHRYAYMGVDDPRRPLAHQQAEKIFAYNSLGVWDNHRKDY
ELWFTGKMSAAQEPAFVPRSPDAPEGDGYLLSVVGRLDEDRSDLVILDTQCLAAGPVATVKLPFRLRAALHGCWQSKN
>P20458 ~~~infA~~~Translation initiation factor IF-1~~~COG0361
MAKDDVIEVEGTIVETLPNAMFKVELENGHTVLAHVSGKIRMHFIRILPGDKVTVELSPYDLTRGRITYRYK
>Q2YPC1 ~~~infA~~~Translation initiation factor IF-1~~~
MAKEEVLEFPGVVTELLPNAMFRVKLENEHEIIAHTAGRMRKNRIRVLAGDKVLVEMTPYDLTKGRITYRFK
>Q2SU48 ~~~infA~~~Translation initiation factor IF-1~~~
MAKDDVIQMQGEVIENLPNATFRVKLENGHVVLGHISGKMRMHYIRILPGDKVTVELTPYDLSRARIVFRAK
>Q18CI2 ~~~infA~~~Translation initiation factor IF-1~~~COG0361
MAKKDVIELEGTVSEALPNAMFKVKLENGHEILCHISGKLRMNFIRILEGDKVNVELSPYDLTRGRITWRKK
>P69222 ~~~infA~~~Translation initiation factor IF-1~~~COG0361
MAKEDNIEMQGTVLETLPNTMFRVELENGHVVTAHISGKMRKNYIRILTGDKVTVELTPYDLSKGRIVFRSR
>Q50298 ~~~infA~~~Translation initiation factor IF-1~~~
MQPKFNNQAKQDKLVLTGKILEIIHGDKFRVLLENNVEVDAHLAGKMRMRRLRILPGDLVEVEFSPYDLKLGRIIGRK
>A0QSL3 ~~~infA~~~Translation initiation factor IF-1~~~COG0361
MAKKDGAIEVEGRVIEPLPNAMFRIELENGHKVLAHISGKMRQHYIRILPEDRVVVELSPYDLSRGRIVYRYK
>P9WKK3 ~~~infA~~~Translation initiation factor IF-1~~~COG0361
MAKKDGAIEVEGRVVEPLPNAMFRIELENGHKVLAHISGKMRQHYIRILPEDRVVVELSPYDLSRGRIVYRYK
>P65116 ~~~infA~~~Translation initiation factor IF-1~~~
MSKEDSFEMEGTVVDTLPNTMFRVELENGHVVTAHISGKMRKNYIRILTGDKVRVELTPYDLSKGRITYRAR
>Q2FW28 ~~~infA~~~Translation initiation factor IF-1~~~COG0361
MAKQDVIELEGTVLDTLPNAMFKVELENGHEILAHVSGKIRMNYIRILPGDKVTVEMSPYDLTRGRITYRYK
>P65119 ~~~infA~~~Translation initiation factor IF-1~~~
MAKQDVIELEGTVLDTLPNAMFKVELENGHEILAHVSGKIRMNYIRILPGDKVTVEMSPYDLTRGRITYRYK
>P65121 ~~~infA~~~Translation initiation factor IF-1~~~COG0361
MAKDDVIEVEGKVVDTMPNAMFTVELENGHQILATVSGKIRKNYIRILAGDRVTVEMSPYDLTRGRITYRFK
>Q5SHR1 ~~~infA~~~Translation initiation factor IF-1~~~COG0361
MAKEKDTIRTEGVVTEALPNATFRVKLDSGPEILAYISGKMRMHYIRILPGDRVVVEITPYDPTRGRIVYRK
>Q8KLI6 ~~~infA~~~Translation initiation factor IF-1~~~
MAKEKDTIRTEGVVTEALPNATFRVKLDSGPEILAYISGKMRMHYIRILPGDRVVVEITPYDPTRGRIVYRK
>P0A705 ~~~infB~~~Translation initiation factor IF-2~~~COG0532
MTDVTIKTLAAERQTSVERLVQQFADAGIRKSADDSVSAQEKQTLIDHLNQKNSGPDKLTLQRKTRSTLNIPGTGGKSKS
VQIEVRKKRTFVKRDPQEAERLAAEEQAQREAEEQARREAEESAKREAQQKAEREAAEQAKREAAEQAKREAAEKDKVSN
QQDDMTKNAQAEKARREQEAAELKRKAEEEARRKLEEEARRVAEEARRMAEENKWTDNAEPTEDSSDYHVTTSQHARQAE
DESDREVEGGRGRGRNAKAARPKKGNKHAESKADREEARAAVRGGKGGKRKGSSLQQGFQKPAQAVNRDVVIGETITVGE
LANKMAVKGSQVIKAMMKLGAMATINQVIDQETAQLVAEEMGHKVILRRENELEEAVMSDRDTGAAAEPRAPVVTIMGHV
DHGKTSLLDYIRSTKVASGEAGGITQHIGAYHVETENGMITFLDTPGHAAFTSMRARGAQATDIVVLVVAADDGVMPQTI
EAIQHAKAAQVPVVVAVNKIDKPEADPDRVKNELSQYGILPEEWGGESQFVHVSAKAGTGIDELLDAILLQAEVLELKAV
RKGMASGAVIESFLDKGRGPVATVLVREGTLHKGDIVLCGFEYGRVRAMRNELGQEVLEAGPSIPVEILGLSGVPAAGDE
VTVVRDEKKAREVALYRQGKFREVKLARQQKSKLENMFANMTEGEVHEVNIVLKADVQGSVEAISDSLLKLSTDEVKVKI
IGSGVGGITETDATLAAASNAILVGFNVRADASARKVIEAESLDLRYYSVIYNLIDEVKAAMSGMLSPELKQQIIGLAEV
RDVFKSPKFGAIAGCMVTEGVVKRHNPIRVLRDNVVIYEGELESLRRFKDDVNEVRNGMECGIGVKNYNDVRTGDVIEVF
EIIEIQRTIA
>P04766 ~~~infB~~~Translation initiation factor IF-2~~~
MSKMRVYEYAKKQNVPSKDVIHKLKEMNIEVNNHMAMLEADVVEKLDHQYRPKAEKKTETKNEKKAEKKTDKPKRPMPAK
TADFSDEEIFDDVKEAAKPAKKKGAAKGKETKRTEAQQQEKKAFQAAKKKGKGPAKGKKQAAPAAKQVPQPAKKEKELPK
KITFEGSLTVAELAKKLGREPSEIIKKLFMLGVMATINQDLDKDAIELICSDYGVEVEEKVTIDETNFEAIEIADAPEDL
VERPPVVTIMGHVDHGKTTLLDAIRHSKVTEQEAGGITQHIGAYQVTVNDKKITFLDTPGHEAFTTMRARGRQVTDIVIL
VVAADDGVMPQTVEAINHAKAANVPIIVAINKMDKPEANPDRVMQELMEYNLVPEEWGGDTIFCKLSAKTKEGLDHLLEM
ILLVSEMEELKANPNRRAVGTVIEAKLDKGRGPVATLLVQAGTLKVGDPIVVGTTYGRVRAMVNDSGRRVKEAGPSMPVE
ITGLHDVPQAGDRFMVFEDEKKARQIGEARAQRQLQEQRSVKTRVSLDDLFEQIKQGEMKELNLIVKADVQGSVEALVAA
LQKIDVEGVRVKIIHAAVGAITESDISLATASNAIVIGFNVRPDANAKRAAESEKVDIRLHRIIYNVIEEIEAAMKGMLD
PEYEEKVIGQAEVRQTFKVSKVGTIAGCYVTDGKITRDSKVRLIRQGIVVYEGEIDSLKRYKDDVREVAQGYECGLTIKN
FNDIKEGDVIEAYVMQEVARA
>P9WKK1 ~~~infB~~~Translation initiation factor IF-2~~~COG0481
MAAGKARVHELAKELGVTSKEVLARLSEQGEFVKSASSTVEAPVARRLRESFGGSKPAPAKGTAKSPGKGPDKSLDKALD
AAIDMAAGNGKATAAPAKAADSGGAAIVSPTTPAAPEPPTAVPPSPQAPHPGMAPGARPGPVPKPGIRTPRVGNNPFSSA
QPADRPIPRPPAPRPGTARPGVPRPGASPGSMPPRPGGAVGGARPPRPGAPRPGGRPGAPGAGRSDAGGGNYRGGGVGAA
PGTGFRGRPGGGGGGRPGQRGGAAGAFGRPGGAPRRGRKSKRQKRQEYDSMQAPVVGGVRLPHGNGETIRLARGASLSDF
ADKIDANPAALVQALFNLGEMVTATQSVGDETLELLGSEMNYNVQVVSPEDEDRELLESFDLSYGEDEGGEEDLQVRPPV
VTVMGHVDHGKTRLLDTIRKANVREAEAGGITQHIGAYQVAVDLDGSQRLITFIDTPGHEAFTAMRARGAKATDIAILVV
AADDGVMPQTVEAINHAQAADVPIVVAVNKIDKEGADPAKIRGQLTEYGLVPEEFGGDTMFVDISAKQGTNIEALEEAVL
LTADAALDLRANPDMEAQGVAIEAHLDRGRGPVATVLVQRGTLRVGDSVVAGDAYGRVRRMVDEHGEDVEVALPSRPVQV
IGFTSVPGAGDNFLVVDEDRIARQIADRRSARKRNALAARSRKRISLEDLDSALKETSQLNLILKGDNAGTVEALEEALM
GIQVDDEVVLRVIDRGVGGITETNVNLASASDAVIIGFNVRAEGKATELASREGVEIRYYSVIYQAIDEIEQALRGLLKP
IYEENQLGRAEIRALFRSSKVGLIAGCLVTSGVMRRNAKARLLRDNIVVAENLSIASLRREKDDVTEVRDGFECGLTLGY
ADIKEGDVIESYELVQKERA
>Q9HV55 ~~~infB~~~Translation initiation factor IF-2~~~
MTQVTVKELAQVVDTPVERLLLQMRDAGLPHTSAEQVVTDSEKQALLTHLKGSHGDRASEPRKITLQRKTTTTLKVGGSK
TVSVEVRKKKTYVKRSPDEIEAERQRELEEQRAAEEAERLKAEEAAARQRAEEEARKAEEAARAKAAQEAAATAGAEPAV
VADVAVAEPVAKPAAVEERKKEEPRRVPKRDEDDDRRDRKHTQHRPSVKEKEKVPAPRVAPRSTDEESDGYRRGGRGGKS
KLKKRNQHGFQNPTGPIVREVNIGETITVAELAAQMSVKGAEVVKFMFKMGSPVTINQVLDQETAQLVAEELGHKVKLVS
ENALEEQLAESLKFEGEAVTRAPVVTVMGHVDHGKTSLLDYIRRAKVAAGEAGGITQHIGAYHVETERGMVTFLDTPGHA
AFTAMRARGAQATDIVILVVAADDGVMPQTQEAVQHAKAAGVPIVVAVNKIDKPEANPDNIKNGLAALDVIPEEWGGDAP
FVPVSAKLGTGVDELLEAVLLQAEVLELKATPSAPGRGVVVESRLDKGRGPVATVLVQDGTLRQGDMVLVGINYGRVRAM
LDENGKPIKEAGPSIPVEILGLDGTPDAGDEMTVVADEKKAREVALFRQGKFREVKLARAHAGKLENIFENMGQEEKKTL
NIVLKADVRGSLEALQGSLSGLGNDEVQVRVVGGGVGGITESDANLALASNAVLFGFNVRADAGARKIVEAEGLDMRYYN
VIYDIIEDVKKALTGMLGSDLRENILGIAEVRDVFRSPKFGAIAGCMVTEGMVHRNRPIRVLRDDVVIFEGELESLRRFK
DDVAEVRAGMECGIGVKSYNDVKVGDKIEVFEKVEVARSL
>P65134 ~~~infB~~~Translation initiation factor IF-2~~~
MSKQRIYEYAKELNLKSKEIIDELKSMNIEVSNHMQALEDDQIKALDKKFKKEQKNDNKQSTQNNHQKSNNQNQNKGQQK
DNKKNQQQNNKGNKGNKKNNRNNKKNNKNNKPQNQPAAPKEIPSKVTYQEGITVGEFADKLNVESSEIIKKLFLLGIVAN
INQSLNQETIELIADDYGVEVEEEVVINEEDLSIYFEDEKDDPEAIERPAVVTIMGHVDHGKTTLLDSIRHTKVTAGEAG
GITQHIGAYQIENDGKKITFLDTPGHAAFTTMRARGAQVTDITILVVAADDGVMPQTIEAINHAKEAEVPIIVAVNKIDK
PTSNPDRVMQELTEYGLIPEDWGGETIFVPLSALSGDGIDDLLEMIGLVAEVQELKANPKNRAVGTVIEAELDKSRGPSA
SLLVQNGTLNVGDAIVVGNTYGRIRAMVNDLGQRIKTAGPSTPVEITGINDVPQAGDRFVVFSDEKQARRIGESRHEASI
VQQRQESKNVSLDNLFEQMKQGEMKDLNVIIKGDVQGSVEALAASLMKIDVEGVNVRIIHTAVGAINESDVTLANASNGI
IIGFNVRPDSGAKRAAEAENVDMRLHRVIYNVIEEIESAMKGLLDPEFEEQVIGQAEVRQTFKVSKVGTIAGCYVTEGKI
TRNAGVRIIRDGIVQYEGELDTLKRFKDDAKEVAKGYECGITIENYNDLKEGDVIEAFEMVEIKR
>P48515 ~~~infB~~~Translation initiation factor IF-2~~~COG0532
MAKVRIYQLAKELGMETQELLELLDQMGVAYKSHASTLEEKDAEAVRELVKEQRGLQEKLAEEERRKSLPRRPPVVVIMG
HVDHGKTTLLDYLRKSRIAEKEAGGITQHVGAFEVKTPQGTVVFIDTPGHEAFTTIRQRGAKVADIAVIVIAADDGIMPQ
TEEAIAHAKAAGAKLIFAINKIDLPQADPEKVKRQLMERGFVPEEYGGDAIVIPISAKTGQGVQDLLEMILLLAELEDYR
ADPNAEPRGVILESKLDKQAGIIANMLVQEGTFRVGDYVVAGEAYGRIRAMMDADGNQRKEAGPGSAVQVLGFQELPHAG
DVVEWVPDLEAAKEIAEERKEERKAREEEEKARRPRTMAELLRAMQEEGRKELNLILRADTQGSLEAIQHILARESTEDV
KINILLAQVGAPTESDVLLAQTANAAILAFGVNPPGSVKKKAEEKGVLLKTFRIIYDLVDEVRNMVKGQREPQYKEEVLG
QAEVRAIFRLPTGKQVAGCMVTQGRIPRNAEVRVLRDGQVIWQGRIASLKRFKEDVREVAQGYECGIGLDGFDDFREGDV
IEAFQMVEVPA
>Q5HWW2 ~~~infC~~~Translation initiation factor IF-3~~~
MSKEKEVLLNEEIRADEIRCVGDDGKVYGIISSDEALEIANRLGLDLVMIAADAKPPVCKIMDYGKFRYQQEKKQKEAKK
KQKVIDIKEIKLSVKIAQNDINYKVKHALEFLEQGKHVRFRVFLKGREMATPEAGVALLEKIWTMIENEANRDKEPNFEG
RYVNMLVTPKKA
>P0A707 ~~~infC~~~Translation initiation factor IF-3~~~COG0290
MKGGKRVQTARPNRINGEIRAQEVRLTGLEGEQLGIVSLREALEKAEEAGVDLVEISPNAEPPVCRIMDYGKFLYEKSKS
SKEQKKKQKVIQVKEIKFRPGTDEGDYQVKLRSLIRFLEEGDKAKITLRFRGREMAHQQIGMEVLNRVKDDLQELAVVES
FPTKIEGRQMIMVLAPKKKQ
>P03000 ~~~infC~~~Translation initiation factor IF-3~~~
MSKDFIINEQIRAREVRLIDQNGDQLGIKSKQEALEIAARRNLDLVLVAPNAKPPVCRIMDYGKFRFEQQKKEKEARKKQ
KVINVKEVRLSPTIEEHDFNTKLRNARKFLEKGDKVKATIRFKGRAITHKEIGQRVLDRLSEACADIAVVETAPKMDGRN
MFLVLAPKNDNK
>Q9ZMV2 ~~~infC~~~Translation initiation factor IF-3~~~COG0290
MSRNEVLLNGDINFKEVRCVGDNGEVYGIISSKEALKIAQNLGLDLVLISASAKPPVCKVMDYNKFRYQNEKKIKEAKKK
QKQIEIKEIKLSTQIAQNDINYKVKHAREFIESNKHVKFKVVLKGRESQNSKAGLDVLFRVQTMMQDLANPEKEPKTEGR
FVSWMFVPKAKEAPKNEKKTKENNPPFNRINLMKGENHAKNED
>P9WKJ9 ~~~infC~~~Translation initiation factor IF-3~~~COG0290
MSTETRVNERIRVPEVRLIGPGGEQVGIVRIEDALRVAADADLDLVEVAPNARPPVCKIMDYGKYKYEAAQKARESRRNQ
QQTVVKEQKLRPKIDDHDYETKKGHVVRFLEAGSKVKVTIMFRGREQSRPELGYRLLQRLGADVADYGFIETSAKQDGRN
MTMVLAPHRGAKTRARARHPGEPAGGPPPKPTAGDSKAAPN
>P48516 ~~~infC~~~Translation initiation factor IF-3~~~
MIREQRSSRGGSRDQRTNRRIRAREVRVVGSDGSQLGVMPLEAALDRARTEGLDLVEISPMASPPVCKIMDYGKFKYEEK
KKASEAKRAQVTVLLKEVKLRPKTEEHDYEFKVRNTRRFIEDGNKAKVVIQFRGREITHREQGTAILDDVAKDLKDVAVV
EQMPRMEGRLMFMILAPTPKVAQKARELVRQAATAAKRPPPPGAPGAGKSAAGASSGAEEKAEETAEEKKEAQAAPAAAE
AQSPTAS
>Q9I0A0 ~~~infC~~~Translation initiation factor IF-3~~~
MIIKREMRQDKRAQPKPPINENISAREVRLIGADGQQVGVVSIDEAIRLAEEAKLDLVEISADAVPPVCRIMDYGKHLFE
KKKQAAVAKKNQKQAQVKEIKFRPGTEEGDYQVKLRNLVRFLSEGDKAKVSLRFRGREMAHQELGMELLKRVEADLVEYG
TVEQHPKLEGRQLMMVIAPKKKK
>P65140 ~~~infC~~~Translation initiation factor IF-3~~~
MSTIAKDQTQINDKIRAKELRLIGQDGEQIGVKSKREALEMAERVDLDLVVVAPNAKPPVARIMDYGKFKFEQQKKEKEM
KKKQKIINVKEIRLSPTIEEHDFQTKLKNGRKFLTKGDKCKVSIRFRGRAITHKEIGQRVLEKYADECKDIATVEQKPKM
DGRQMFIMLAPTAEK
>P65144 ~~~infC~~~Translation initiation factor IF-3~~~COG0290
MFFSNKTKEVKTIAKQDLFINDEIRVREVRLIGLEGEQLGIKPLSEAQALADNANVDLVLIQPQAKPPVAKIMDYGKFKF
EYQKKQKEQRKKQSVVTVKEVRLSPTIDKGDFDTKLRNARKFLEKGNKVKVSIRFKGRMITHKEIGAKVLAEFAEATQDI
AIIEQRAKMDGRQMFMQLAPATDKK
>Q5SKU2 ~~~infC~~~Translation initiation factor IF-3~~~COG0290
MKEYLTNERIRAKQVRVVGPDGKQLGIMDTREALRLAQEMDLDLVLVGPNADPPVARIMDYSKWRYEQQMAEKEARKKAK
RTEVKSIKFRVKIDEHDYQTKLGHIKRFLQEGHKVKVTIMFRGREVAHPELGERILNRVTEDLKDLAVVEMKPEMLGRDM
NMLLAPVKVSA
>O05443 ~~~~~~Immunity factor for TNT~~~
MTIGVDLSTDLQDWIRLSGMNMIQGSETNDGRTILWNKGGEVRYFIDRLAGWYVITSSDRMSREGYEFAAASMSVIEKYL
YGYFGGSVRSERELPAIRAPFQPEELMPEYSIGTMTFAGRQRDTLIDSSGTVVAITAADRLVELSHYLDVSVNVIKDSFL
DSEGKPLFTLWKDYKG
>P44969 3.4.21.72~~~iga~~~Immunoglobulin A1 protease autotransporter~~~COG3266
MLNKKFKLNFIALTVAYALTPYTEAALVRDDVDYQIFRDFAENKGRFSVGATNVEVRDKNNHSLGNVLPNGIPMIDFSVV
DVDKRIATLINPQYVVGVKHVSNGVSELHFGNLNGNMNNGNAKSHRDVSSEENRYFSVEKNEYPTKLNGKAVTTEDQTQK
RREDYYMPRLDKFVTEVAPIEASTASSDAGTYNDQNKYPAFVRLGSGSQFIYKKGDNYSLILNNHEVGGNNLKLVGDAYT
YGIAGTPYKVNHENNGLIGFGNSKEEHSDPKGILSQDPLTNYAVLGDSGSPLFVYDREKGKWLFLGSYDFWAGYNKKSWQ
EWNIYKPEFAKTVLDKDTAGSLTGSNTQYNWNPTGKTSVISNGSESLNVDLFDSSQDTDSKKNNHGKSVTLRGSGTLTLN
NNIDQGAGGLFFEGDYEVKGTSDSTTWKGAGVSVADGKTVTWKVHNPKSDRLAKIGKGTLIVEGKGENKGSLKVGDGTVI
LKQQADANNKVKAFSQVGIVSGRSTVVLNDDKQVDPNSIYFGFRGGRLDANGNNLTFEHIRNIDDGARLVNHNTSKTSTV
TITGESLITDPNTITPYNIDAPDEDNPYAFRRIKDGGQLYLNLENYTYYALRKGASTRSELPKNSGESNENWLYMGKTSD
EAKRNVMNHINNERMNGFNGYFGEEEGKNNGNLNVTFKGKSEQNRFLLTGGTNLNGDLKVEKGTLFLSGRPTPHARDIAG
ISSTKKDQHFAENNEVVVEDDWINRNFKATNINVTNNATLYSGRNVANITSNITASDNAKVHIGYKAGDTVCVRSDYTGY
VTCTTDKLSDKALNSFNATNVSGNVNLSGNANFVLGKANLFGTISGTGNSQVRLTENSHWHLTGDSNVNQLNLDKGHIHL
NAQNDANKVTTYNTLTVNSLSGNGSFYYLTDLSNKQGDKVVVTKSATGNFTLQVADKTGEPTKNELTLFDASNATRNNLN
VSLVGNTVDLGAWKYKLRNVNGRYDLYNPEVEKRNQTVDTTNITTPNNIQADVPSVPSNNEEIARVETPVPPPAPATPSE
TTETVAENSKQESKTVEKNEQDATETTAQNGEVAEEAKPSVKANTQTNEVAQSGSETEETQTTEIKETAKVEKEEKAKVE
KDEIQEAPQMASETSPKQAKPAPKEVSTDTKVEETQVQAQPQTQSTTVAAAEATSPNSKPAEETQPSEKTNAEPVTPVVS
KNQTENTTDQPTEREKTAKVETEKTQEPPQVASQASPKQEQSETVQPQAVLESENVPTVNNAEEVQAQLQTQTSATVSTK
QPAPENSINTGSATAITETAEKSDKPQTETAASTEDASQHKANTVADNSVANNSESSDPKSRRRRSISQPQETSAEETTA
ASTDETTIADNSKRSKPNRRSRRSVRSEPTVTNGSDRSTVALRDLTSTNTNAVISDAMAKAQFVALNVGKAVSQHISQLE
MNNEGQYNVWVSNTSMNENYSSSQYRRFSSKSTQTQLGWDQTISNNVQLGGVFTYVRNSNNFDKASSKNTLAQVNFYSKY
YADNHWYLGIDLGYGKFQSNLKTNHNAKFARHTAQFGLTAGKAFNLGNFGITPIVGVRYSYLSNANFALAKDRIKVNPIS
VKTAFAQVDLSYTYHLGEFSVTPILSARYDTNQGSGKINVNQYDFAYNVENQQQYNAGLKLKYHNVKLSLIGGLTKAKQA
EKQKTAELKLSFSF
>Q54875 3.4.24.13~~~iga~~~Immunoglobulin A1 protease~~~
MEKYFGEKQERFSFRKLSVGLVSATISSLFFMSVLASSSVDAQETAGVHYKYVADSELSSEEKKQLVYDIPTYVENDDET
YYLVYKLNSQNQLAELPNTGSKNERQALVAGASLAALGILIFAVSKKKVKNKTVLHLVLVAGIGNGVLVSVHALENHLLL
NYNTDYELTSGEKLPLPKEISGYTYIGYIKEGKTTSDFEVSNQEKSAATPTKQQKVDYNVTPNFVDHPSTVQAIQEQTPV
SSTKPTEVQVVEKPFSTELINPRKEEKQSSDSQEQLAEHKNLETKKEEKISPKEKTGVNTLNPQDEVLSGQLNKPELLYR
EETIETKIDFQEEIQENPDLAEGTVRVKQEGKLGKKVEIVRIFSVNKEEVSREIVSTSTTAPSPRIVEKGTKKTQVIKEQ
PETGVEHKDVQSGAIVEPAIQPELPEAVVSDKGEPEVQPTLPEAVVTDKGEPAVQPELPEAVVSDKGEPEQVAPLPEYKG
NIEQVKPETPVEKTKEQGPEKTEEVPVKPTEETPVNPNEGTTEGTSIQGAENPVQPAEDTQTNSGKIANENTGEVSNKPS
DSKPPVEESNQPEKNGTATKPENSGNTTSENGQTEPEPSNGNSTEDVSTKSNTSNSNGNEEIKQENELDPDKKVEDPEKT
LELRNVSDLELYSLSNGTYKQHISLEQVPSNPNSYFVKVKSSSFKDVYLPVASISEGRKNDKILYKITAKVEKLQQEIES
RYKDNFTFYLAKKGTEETTNFTSFSNLVKAINQNLSGTYHLGASLNANEVELSTDDKSYIKGTFTGQLIGEKDGKHYAIY
NLKKPLFENLSGATVEKLSLKNVAISGKNDIGSLANEATNGTKIKQVHVDGVLAGERGVGGLLAKADQSSIAESSFKGRI
VNTYETTDSYNIGGLVGHLTGKNASIAKSKATVTISSNTNRSDQTVGGLAGLVDRDAQIQDSYAEGDINNVKHFGRVAGV
AGNLWDRTSGDVRHAGSLTNVLSDVNVTNGNAITGYHYTGMKVANTFSSKANRVFNVTLEKNEVVSKESFEERGTMLDAS
QIASKKAEINLITPPIVEPLSTSGKKDSDFSKIAHYQANRALVYKNIEKLLPFYNKATIVKYGNLVKENSILYQKELLSA
VMMKDDQVITDIISNKQTANKLLLHYKDHSSEKFDLRYQADFANLAEYSIGDSGLLYTPNQFLYHQDSIINQVLPELNRV
NYQSDAVRNTLGISPEVKLTELYLEEQFTKTKEHLAENLKKLLSSDAGLVTDNEVMTGYIIDKIKRNKEALLLGMSYLER
WYNFSYGQVNVKDLVMYHPDFFGKGNTSPLDTLIELGKSGFNNLLAKNNVDTYAISLASHHGTTDLFSTLENYRKVFLPD
KTNNDWFKSQTKAYIVEEKSNIEEVKTKQGLVGTKYSIGVYDRITSATWKYRNMVLPLLTLPERSVFVISTISSLGFGAY
DRYRNKEHQANGDLNSFVEKSAHETAERQRDHYDYWYRILDEKGREKLYRNILLYDAYKFGTNHTEGKATEVADFDSPNP
AMKHFFGPVGNKVGHNGHGAYATGDAVYYMGYRMLDKDGAITYTHEMTHNSDQDIYLGGYGRRSGLGPEFFAKGLLQAPD
QPSDATITINSILKHSKSDSKEGERLQVLDPTTRFKDATDLQKYVHNMFDVVYMLEYLEGKSIVKKLNVYQKIEALRKIE
NQYLTDPADGNDVYATNVVKNLTEDEAKKLTSFDSLIDNNILSAREYKAGTYERNGYFTIKLFAPIFSALSGEKGTPGDL
MGRRIAFELLAAKGFKDGMVPYISNQYEEDAKQQGQTINLYGKERGLVTDELVLKKVFDGKYKTWAEFKTAMYQERVDQF
GNLKQVTFKDPTKPWPRYGTKTINNVDELQKLMDEAVLQDAKERNYYYWNNYNPETDSAVHKLKRAIFKAYLDQTNDFRR
SIFENKK
>P42782 3.4.21.72~~~iga~~~Immunoglobulin A1 protease autotransporter~~~
MLNKKFKLNFIALTVAYALTPYTEAALVRDDVDYQIFRDFAENKGKFSVGATNVLVKDKNNKDLGTALPNGIPMIDFSVV
DVDKRIATLINPQYVVGVKHVSNGVSELHFGNLNGNMNNGNAKAHRDVSSEENRYFSVEKNEYPTKLNGKTVTTEDQTQK
RREDYYMPRLDKFVTEVAPIEASTASSDAGTYNDQNKYPAFVRLGSGSQFIYKKGDNYSLILNNHEVGGNNLKLVGDAYT
YGIAGTPYKVNHENNGLIGFGNSKEEHSDPKGILSQDPLTNYAVLGDSGSPLFVYDREKGKWLFLGSYDFWAGYNKKSWQ
EWNIYKSQFTKDVLNKDSAGSLIGSKTDYSWSSNGKTSTITGGEKSLNVDLADGKDKPNHGKSVTFEGSGTLTLNNNIDQ
GAGGLFFEGDYEVKGTSDNTTWKGAGVSVAEGKTVTWKVHNPQYDRLAKIGKGTLIVEGTGDNKGSLKVGDGTVILKQQT
NGSGQHAFASVGIVSGRSTLVLNDDKQVDPNSIYFGFRGGRLDLNGNSLTFDHIRNIDDGARLVNHNMTNASNITITGES
LITDPNTITPYNIDAPDEDNPYAFRRIKDGGQLYLNLENYTYYALRKGASTRSELPKNSGESNENWLYMGKTSDEAKRNV
MNHINNERMNGFNGYFGEEEGKNNGNLNVTFKGKSEQNRFLLTGGTNLNGDLTVEKGTLFLSGRPTPHARDIAGISSTKK
DPHFAENNEVVVEDDWINRNFKATTMNVTGNASLYSGRNVANITSNITASNKAQVHIGYKTGDTVCVRSDYTGYVTCTTD
KLSDKALNSFNPTNLRGNVNLTESANFVLGKANLFGTIQSRGNSQVRLTENSHWHLTGNSDVHQLDLANGHIHLNSADNS
NNVTKYNTLTVNSLSGNGSFYYLTDLSNKQGDKVVVTKSATGNFTLQVADKTGEPNHNELTLFDASKAQRDHLNVSLVGN
TVDLGAWKYKLRNVNGRYDLYNPEVEKRNQTVDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETV
AENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEKAKVETEKTQ
EVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPE
NTTPATTQPTVNSESSNKPKNRHRRSVRSVPHNVEPATTSSNDRSTVALCDLTSTNTNAVLSDARAKAQFVALNVGKAVS
QHISQLEMNNEGQYNVWVSNTSMNKNYSSSQYRRFSSKSTQTQLGWDQTISNNVQLGGVFTYVRNSNNFDKATSKNTLAQ
VNFYSKYYADNHWYLGIDLGYGKFQSKLQTNHNAKFARHTAQFGLTAGKAFNLGNFGITPIVGVRYSYLSNADFALDQAR
IKVNPISVKTAFAQVDLSYTYHLGEFSVTPILSARYDANQGSGKINVNGYDFAYNVENQQQYNAGLKLKYHNVKLSLIGG
LTKAKQAEKQKTAELKLSFSF
>Q59947 3.4.24.13~~~iga~~~Immunoglobulin A1 protease~~~COG0810
MEKYFGEKQERFSFRKLSVGLVSATISSLFFMSVLASSSVDAQETAGVHYKYVADSELSSEEKKQLVYDIPTYVENDDET
YYLVYKLNSQNQLAELPNTGSKNERQALVAGASLAALGILIFAVSKKKVKNKTVLHLVLVAGMGNGVLVSVHALENHLLL
NYNTDYELTSGEKLPLPKEISGYTYIGYIKEGKTTSDFEVSNQEKSAATPTKQQKVDYNVTPNFVDHPSTVQAIQEQTPV
SSTKPTEVQVVEKPFSTELINPRKEEKQSSDSQEQLAEHKNLETKKEEKISPKEKTGVNTLNPQDEVLSGQLNKPELLYR
EETIETKIDFQEEIQENPDLAEGTVRVKQEGKLGKKVEIVRIFSVNKEEVSREIVSTSTTAPSPRIVEKGTKKTQVIKEQ
PETGVEHKDVQSGAIVEPAIQPELPEAVVSDKGEPEVQPTLPEAVVTDKGETEVQPESPDTVVSDKGEPEQVAPLPEYKG
NIEQVKPETPVEKTKEQGPEKTEEVPVKPTEETPVNPNEGTTEGTSIQEAENPVQPAEESTTNSEKVSPDTSSENTGEVS
SNPSDSTTSVGESNKPEHNDSKNENSEKTVEEVPVNPNEGTVEGTSNQETEKPVQPAEETQTNSGKIANENTGEVSNKPS
DSKPPVEESNQPEKNGTATKPENSGNTTSENGQTEPEKKLELRNVSDIELYSQTNGTYRQHVSLDGIPENTDTYFVKVKS
SAFKDVYIPVASITEEKRNGQSVYKITAKAEKLQQELENKYVDNFTFYLDKKAKEENTNFTSFSNLVKAINQNPSGTYHL
AASLNANEVELGPDERSYIKDTFTGRLIGEKDGKNYAIYNLKKPLFENLSGATVEKLSLKNVAISGKNDIGSLANEATNG
TKIKQVHVDGVLAGERGVGGLLAKADQSSIAESSFKGRIVNTYETTDAYNIGGLVGHLTGKNASIAKSKATVTISSNTNR
SDQTVGGLAGLVDQDAHIQNSYAEGDINNVKHFGKVAGVAGYLWDRTSGEEKHAGELTNVLSDVNVTNGNAITGYHYTGM
KVANTFSSKANRVFNVTLEKDEVVSKESFEERGTMLDASQIVSKKAEINPLTLPTVEPLSTSGKKDSDFSKIAHYQANRA
LVYKNIEKLLPFYNKSTIVKYGNLVKENSLLYQKELLSAVMMKDDQVITDIVSNKQTANKLLLHYNDHSSEKFDLKYQTD
FANLAEYNLGNTGLLYTPNQFLYDRDSIVKEVLPELQKLDYQSDAIRKTLGISPEVKLTELYLEDQFSKTKQNLGDSLKK
LLSADAGLASDNSVTRGYLVDKIKNNKEALLLGLTYLERWYNFNYGQVNVKDLVMYHPDFFGKGNTSPLDTLIELGKSGF
NNLLAKNNVDTYGISLASQHGATDLFSTLEHYRKVFLPNTSNNDWFKSETKAYIVEEKSTIEEVKTKQGLAGTKYSIGVY
DRITSATWKYRNMVLPLLTLPERSVFVISTMSSLGFGAYDRYRSSDHKAGKALNDFVEENARETAKRQRDHYDYWYRILD
EQSREKLYRTILLYDAYKFGDDTTSGKATAEAKFDSSNPAMKNFFGPVGNKVVHNQHGAYATGDGVYYMSYRMLDKDGAI
TYTHEMTHDSDQDIYLGGYGRRNGLGPEFFAKGLLQAPDQPSDATITINSILKHSKSDSTEGSRLQVLDPTERFQNAADL
QNYVHNMFDLIYMMEYLEGQSIVNKLSVYQKMAALRKIENKYVKDPADGNEVYATNVVKELTEAEARNLNSFESLIDHNI
LSAREYQSGDYERNGYYTIKLFAPIYSALSSEKGTPGDLMGRRIAYELLAAKGFKDGMVPYISNQYEEDAKQQGQTINLY
GKERGLVTDELVLKKVFDGKYKTWAEFKTAMYQERVDQFGNLKQVTFKDPTKPWPSYGTKTINNVDELQALMDQAVLKDA
EGPRWSNYDPEIDSAVHKLKRAIFKAYLDQTNDFRSSIFENKK
>Q59986 3.4.24.13~~~iga~~~Immunoglobulin A1 protease~~~
MKKFLGEKQTRFAFRKLAVGLVSAAISSLFFVSIVGVDSVQAQEKLNVHYKYVTDTEITPQEKELIVSGVPRMPEGNEET
YYLVYRLNSNAGAKTLPNTGDNNSNTMMAAGLLLTTIGLVVFAVSKRKVQSKFLLTVLVGASVGGGLILSVDALENGSLL
QYNAEYQVSAGESLPSPGEISGYTYVGYIKDESIKKLLDNKIPDNQQNANVDKEALNQNKKLDYSVSFDKNGLKNQTVGV
NTIEPQDEVLSGRVAKPELLYKETSIETEIAYGEQIQENPDLAEGTVRVKQEGKPGRKIEVVRIFTVDNAEVSREVLSTK
IEEATPKIVEKGTKKLEAPSEKPVTSNLVQPEQVAPLPEYTGVQSGAIVEPEQVASLPEYSGTLSGAIVEPEQIEPEIGG
VQSGAIVEPEQVTPLPEYTGTQAGAVVSPEQVAPLPEYTGTQSGAIVEPAQVTPLPEYTGVQSGAIVKPAQVTPLPEYTG
TQSGAIVEPEQVTPSPEYTGVQAGAIVEPEQVASLPEYTGSQAGAIVEPEQVEPPQEYTGNIEPAAPEAENPTEKAQEPK
EQKQEPEKNIELRNVSDVELYSLADGKYKQHVSLDAIPSNQENYFVKVKSSKFKDVFLPISSIVDSTKDGQPVYKITASA
EKLKQDVNNKYEDNFTFYLAKKAEREVTNFTSFSNLVQAINNNLNGTYYLAASLNANEVELENGASSYIKGRFTGKLFGS
KDGKNYAIYNLKKPLFDTLSAATVENLTLKDVNISGKTDIGALANEANNATRINNVHVDGVLAGERGIGGLVWKADNSKI
SNSSFKGRIVNSYETKAPYNIGGLVGQLTGINALVDKSKATITISSNADSTNQTVGGLAGLVEKDALISNSYAEGNINNV
KRFGSVAGVAGYLWDRDSSEERHAGRLHNVLSDINVMNGNAISGYHYRGMRITDSYSNKDNRVYKVTLEKDEVVTKESLE
ERGTILDVSQIASKKSEINSLSAPKVETLLTSTNKESDFSKVKDYQASRALAYKNIEKLLPFYNKATIVKYGNLVKEDST
LYEKEILSAVMMKDNEVITDIASHKEAANKLLIHYKDHSSEKLDLTYQSDFSKLAEYRVGDTGLIYTPNQFLQNHSSIVN
EVLPDLKAVDYQSEAIRNTLGISSGVSLTELYLEEQFAKTKENLANTLEKLLSADAVIASENQTINGYVVDKIKRNKEAL
LLGLTYLERWYNFNYGDVNVKDLVMYHMDFFGKGNVSPLDTIIELGKSGFNNLLAKNNVDAYNISLANNNATKDLFSTLA
NYREVFLPNKTNNQWFKEQTKAYIVEEKSAIDEVRVKQEQAGSKYSIGVYDRITSDTWKYRNMVLPLLTMPERSVFVIST
ISSLGFGAYDRYRNNEHRAGAELNKFVEDNAQETAKRQRDHYDYWYRILDEQGREKLYRNILVYDAYKFGDDTTVDKATV
EAQFDSSNPAMKYFFGPVGNKVVHNKHGAYATGDSVYYMGYRMLDKDGAITYTHEMTHDSDNEIYLGGYGRRSGLGPEFF
AKGLLQAPDHPDDATITVNSILKYDKNDASEKSRLQVLDPTKRFQNADDLKNYVHNMFDVIYMLEYLEGMSIVNRLSDVQ
KVNALRKIENKYVRDADGNDVYATNVIKNITMADAQKLNSFNSLIENDILSAREYKNGDVERNGYHTIKLFSPIYSALSS
EKGTPGDLMGRRIAYELLAAKGFKDGMVPYISNQYEDDAKQNGKTISIYGKTRGLVTDDLVLRKVFNGQFNNWTEFKKAM
YEERKNKFDSLNKVTFDDTRQPWTSYATKTISTVEELQTLMDEAVLQDANDNWYSWSGYKPEYNSAVHKLKKAVFKAYLD
QTKDFRKSIFENQK
>P45800 ~~~yrfF~~~Putative membrane protein IgaA homolog~~~
MSTIVIFLAALLACSLLAGWLIKVRSRRRQLPWTNAFADAQTRKLTPEERSAVENYLESLTQVLQVPGPTGASAAPISLA
LNAESNNVMMLTHAITRYGISTDDPNKWRYYLDSVEVHLPPFWEQYINDENTVELIHTDSLPLVISLNGHTLQEYMQETR
SYALQPVPSTQASIRGEESEQIELLNIRKETHEEYALSRPRGLREALLIVASFLMFFFCLITPDVFVPWLAGGALLLLGA
GLWGLFAPPAKSSLREIHCLRGTPRRWGLFGENDQEQINNISLGIIDLVYPAHWQPYIAQDLGQQTDIDIYLDRHVVRQG
RYLSLHDEVKNFPLQHWLRSTIIAAGSLLVLFMLLFWIPLDMPLKFTLSWMKGAQTIEATSVKQLADAGVRVGDTLRISG
TGMCNIRTSGTWSAKTNSPFLPFDCSQIIWNDARSLPLPESELVNKATALTEAVNRQLHPKPEDESRVSASLRSAIQKSG
MVLLDDFGDIVLKTADLCSAKDDCVRLKNALVNLGNSKDWDALVKRANAGKLDGVNVLLRPVSAESLDNLVATSTAPFIT
HETARAAQSLNSPAPGGFLIVSDEGSDFVDQPWPSASLYDYPPQEQWNAFQKLAQMLMHTPFNAEGIVTKIFTDANGTQH
IGLHPIPDRSGLWRYLSTTLLLLTMLGSAIYNGVQAWRRYQRHRTRMMEIQAYYESCLNPQLITPSESLIE
>E1WIS2 ~~~igaA~~~Intracellular growth attenuator protein IgaA~~~
MSTILIFIAALLACSLLAIWRFRVKSRRGSLPWISAFQDAQTRKLLPEERSAVENYLDNLSQIQQVPGPTGASAAPISLT
LNAESNSVVILTHSITRYGITTDDPNKWRYYLDSVEVHLPPFWEQYINDENNVELILTDTLPLVISLNGHTLQEYMQESR
GYALQNTASTQASIRGEESEQIELLNIRQETHEEYALSRPAGLREALLIVASFLLFFFCLITPDVFVPWMIGGAILLLAA
GLWGLFAPPSKSALREIHCLRGTPRRWGLFGENNQEQINNISLGIIDLIYPAHWQPYITQDLGQQTDIDIYLDRHVARQG
RFLSLHDEVKNFPLQHWLRSTVIAIGSLLVLFMLLFWIPLDMPIKFTLSWMKGAQTIEATTVKQLEKAGVRVGDTLHLSG
KGMCNIHSGATWSGQSNSPFMPFDCSQIIWNDAPALPLPESDLVNKAMALSQAVNRQLHPKPEDDSRVSASLRSAIQKSG
MVLLDDFGDIVLKTADLCAAEDECVRLKNALVNLGNSKDWNALVKRANAGKLDGVNVLLRPVSAESLENLVTTSTAPFIS
RETARAAQSLNSPAPGGFLIASDEGSELVDQTWPSTPLYDYPAQEQWSAFQRLAQTLMQTPFSAEGIVTSVYTDANGTQH
ISLHRIPDKSGWWRYLGTTLLMLAMIVSAVYNGIQAFRRYQRHRTRMADIQEYYESCLNPRLTVSPENLI
>P09790 3.4.21.72~~~iga~~~IgA-specific serine endopeptidase autotransporter~~~
MKAKRFKINAISLSIFLAYALTPYSEAALVRDDVDYQIFRDFAENKGKFFVGATDLSVKNKRGQNIGNALSNVPMIDFSV
ADVNKRIATVVDPQYAVSVKHAKAEVHTFYYGQYNGHNDVADKENEYRVVEQNNYEPHKAWGASNLGRLEDYNMARFNKF
VTEVAPIAPTDAGGGLDTYKDKNRFSSFVRIGAGRQLVYEKGVYHQEGNEKGYDLRDLSQAYRYAIAGTPYKDINIDQTM
NTEGLIGFGNHNKQYSAEELKQALSQDALTNYGVLGDSGSPLFAFDKQKNQWVFLGTYDYWAGYGKKSWQEWNIYKKEFA
DKIKQHDNAGTVKGNGEHHWKTTGTNSHIGSTAVRLANNEGDANNGQNVTFEDNGTLVLNQNINQGAGGLFFKGDYTVKG
ANNDITWLGAGIDVADGKKVVWQVKNPNGDRLAKIGKGTLEINGTGVNQGQLKVGDGTVILNQKADADKKVQAFSQVGIV
SGRGTLVLNSSNQINPDNLYFGFRGGRLDANGNDLTFEHIRNVDEGARIVNHNTDHASTITLTGKSLITNPNSLSVHSIQ
NDYDEDDYSYYYRPRRPIPQGKDLYYKNYRYYALKSGGRLNAPMPENGVAENNDWIFMGYTQEEARKNAMNHKNNRRIGD
FGGFFDEENGKGHNGALNLNFNGKSAQKRFLLTGGANLNGKISVTQGNVLLSGRPTPHARDFVNKSSARKDAHFSKNNEV
VFEDDWINRTFKAAEIAVNQSASFSSGRNVSDITANITATDNAKVNLGYKNGDEVCVRSDYTGYVTCNTGNLSDKALNSF
DATRINGNVNLNQNAALVLGKAALWGKIQGQGNSRVSLNQHSKWHLTGDSQVHNLSLADSHIHLNNASDAQSANKYHTIK
INHLSGNGHFHYLTDLAKNLGDKVLVKESASGHYQLHVQNKTGEPNQEGLDLFDASSVQDRSRLFVSLANHYVDLGALRY
TIKTENGITRLYNPYAGNGRPVKPAPSPAANTASQAQKATQTDGAQIAKPQNIVVAPPSPQANQAEEALRQQAKAEQVKR
QQAAEAEKVARQKDEEAKRKAAEIARQQEEARKAAELAAKQKAEAERKARELARQKAEEASHQANAKPKRRRRRAILPRP
PAPVFSLDDYDAKDNSESSIGNLARVIPRMGRELINDYEEIPLEELEDEAEEERRQATQFHSKSRNRRAISSEPSSDEDA
SESVSTSDKHPQDNTELHEKVETAGLQPRAAQPRTQAAAQADAVSTNTNSALSDAMASTQSILLDTGAYLTRHIAQKSRA
DAEKNSVWMSNTGYGRDYASAQYRRFSSKRTQTQIGIDRSLSENMQIGGVLTYSDSQHTFDQAGGKNTFVQANLYGKYYL
NDAWYVAGDIGAGSLRSRLQTQQKANFNRTSIQTGLTLGNTLKINQFEIVPSAGIRYSRLSSADYKLGDDSVKVSSMAVK
TLTAGLDFAYRFKVGNLTVKPLLSAAYFANYGKGGVNVGGKSFAYKADNQQQYSAGVALLYRNVTLNVNGSITKGKQLEK
QKSGQIKIQIRF
>C5W022 3.4.22.-~~~ide~~~IgM protease~~~
MNIQERFSLRKSAVGLVSVSLLCAIYTSTVAADTVVTGVNEIIEESQVKDEVSIESEKNESLDGSNIEIVEEIADNIPSP
VIAEGEVAVEMKVDRGTENVVSRNDTEVTTSEQNQIEVTETKEILNQTSYQTESGEQRQIIWAHGITPPAMEQSGGFVKE
KYGDYLNYTAPFEAGKGYYDTNKSLNASFIDLNLCFAAVSSNMVHWWLEQNSSYVERYLKEKKGTVNVEENYAITDLRRY
INSFQNQQNSRVFDMFKTYYGYRTNGFVSDALVDLFINGYKPKAQGGVNLEDSQLVPDSRGGFFYDVFKEKKLTNRIFSG
SYERFGEDVRTVLESKGLLGLTYRTLGYATHIVTVWGAEYDNQGKIKAVYITDSDDQQEQIGLKRMGITRDASGNPRLNN
HMKNNSAGALLDYVHTIRLGQDLWEEYFNPLAKAKETASQTLADTKKALDLSIQGQSELPESMRLIYLEKLNNLYNQGIL
SIQKAESSEMLSGALENGLNSLKSLDFPISEVGNALAPDLPVGDRSTVSDVDSLSSQETSSTNLEADTENAGIIADGTNQ
LHFPVEAQTTSSVEAEGDNVFEQEADTLPIIIENKDEFGSELSRNMQTSETDSLVVAVEEDVKNDEVAQVEELLESEKVE
NQSSELLSDTLIVESANDKEEDRVEAVVSEQPDSIPHQNVEISLVEPTNVETETVVTPINDAATPHGSPTYIDNSVTESV
ATPLEKDSIQAGETEIAEPTSSESTNVETETVVTPVNDVATPHGSPTYIDNSVTESVATPLEKDSIQAGETEIAEPTSSE
STNVETETVVTPVNDVATPHGSPTYIDNSVTESVATPLEKDSIQAGETEIAEPTSSESTSVEAELVDNSEIHAATSSVTP
CGSSAYADGSTTESVATPLEKDSIQTGNTEIAEPTSSKSTNVEAASVDNSEIHADASLTAVSSVNLDNPVIEPVAISLIG
SKRDTNAEVEVSSLSKREVRKTNTDGLISVQSKVIKKELLESSLAEAGSPLLEATIAQSSNSNSTEIGMSYQNTVLLESN
NTERQVSKAEIVMEHKETELVETVSSASEPVVLVENISQTSNNTIESGKNMGVQSQAGAKQILGVEQSSKVSTPTSRQIM
GVGLLTLVLGSALGLLKKRRK
>P0A6X7 ~~~ihfA~~~Integration host factor subunit alpha~~~COG0776
MALTKAEMSEYLFDKLGLSKRDAKELVELFFEEIRRALENGEQVKLSGFGNFDLRDKNQRPGRNPKTGEDIPITARRVVT
FRPGQKLKSRVENASPKDE
>Q51472 ~~~ihfA~~~Integration host factor subunit alpha~~~
MGALTKAEIAERLYEELGLNKREAKELVELFFEEIRQALEHNEQVKLSGFGNFDLRDKRQRPGRNPKTGEEIPITARRVV
TFRPGQKLKARVEAYAGTKS
>P30787 ~~~ihfA~~~Integration host factor subunit alpha~~~
MSEKTLTRMDLSEAVFREVGLSRNESAQLVETVLQHMSDALVRGETVKISSFGTFSVRDKTSRMGRNPKTGEEVPISPRR
VLSFRPSHLMKDRVAERNAK
>P0A6Y1 ~~~ihfB~~~Integration host factor subunit beta~~~COG0776
MTKSELIERLATQQSHIPAKTVEDAVKEMLEHMASTLAQGERIEIRGFGSFSLHYRAPRTGRNPKTGDKVELEGKYVPHF
KPGKELRDRANIYG
>Q06607 ~~~ihfB~~~Integration host factor subunit beta~~~
MIRSELIAKIAEENPHLFQRDVEKIVNTIFEEIIEAMARGDRVELRGFGAFSVKKRDARTGRNPRTGTSVAVDEKHVPFF
KTGKLLRDRLNGGEE
>A0QWS8 ~~~mIHF~~~Integration host factor~~~COG0099
MALPQLTDEQRAAALEKAAAARRARAELKDRLKRGGTNLKQVLTDAETDEVLGKMKVSALLEALPKVGKVKAQEIMTELE
IAPTRRLRGLGDRQRKALLEKFDQS
>P71658 ~~~mihF~~~Integration host factor~~~COG0099
MALPQLTDEQRAAALEKAAAARRARAELKDRLKRGGTNLTQVLKDAESDEVLGKMKVSALLEALPKVGKVKAQEIMTELE
IAPTRRLRGLGDRQRKALLEKFGSA
>Q9KXR9 ~~~sihF~~~Integration host factor~~~COG0099
MALPPLTPEQRAAALEKAAAARRERAEVKNRLKHSGASLHEVIKQGQENDVIGKMKVSALLESLPGVGKVRAKQIMERLG
ISESRRVRGLGSNQIASLEREFGSTGS
>M1GRN3 5.1.1.21~~~~~~Isoleucine 2-epimerase~~~
MGKLDKASKLIDEENKYYARSARINYYNLVIDHAHGATLVDVDGNKYIDLLASASAINVGHTHEKVVKAIADQAQKLIHY
TPAYFHHVPGMELSEKLAKIAPGNSPKMVSFGNSGSDANDAIIKFARAYTGRQYIVSYMGSYHGSTYGSQTLSGSSLNMT
RKIGPMLPSVVHVPYPDSYRTYPGETEHDVSLRYFNEFKKPFESFLPADETACVLIEPIQGDGGIIKAPEEYMQLVYKFC
HEHGILFAIDEVNQGLGRTGKMWAIQQFKDIEPDLMSVGKSLASGMPLSAVIGKKEVMQSLDAPAHLFTTAGNPVCSAAS
LATLDVIEYEGLVEKSATDGAYAKQRFLEMQQRHPMIGDVRMWGLNGGIELVKDPKTKEPDSDAATKVIYYAFAHGVVII
TLAGNILRFQPPLVIPREQLDQALQVLDDAFTAVENGEVTIPKDTGKIGW
>P04968 4.3.1.19~~~ilvA~~~L-threonine dehydratase biosynthetic IlvA~~~COG1171
MADSQPLSGAPEGAEYLRAVLRAPVYEAAQVTPLQKMEKLSSRLDNVILVKREDRQPVHSFKLRGAYAMMAGLTEEQKAH
GVITASAGNHAQGVAFSSARLGVKALIVMPTATADIKVDAVRGFGGEVLLHGANFDEAKAKAIELSQQQGFTWVPPFDHP
MVIAGQGTLALELLQQDAHLDRVFVPVGGGGLAAGVAVLIKQLMPQIKVIAVEAEDSACLKAALDAGHPVDLPRVGLFAE
GVAVKRIGDETFRLCQEYLDDIITVDSDAICAAMKDLFEDVRAVAEPSGALALAGMKKYIALHNIRGERLAHILSGANVN
FHGLRYVSERCELGEQREALLAVTIPEEKGSFLKFCQLLGGRSVTEFNYRFADAKNACIFVGVRLSRGLEERKEILQMLN
DGGYSVVDLSDDEMAKLHVRYMVGGRPSHPLQERLYSFEFPESPGALLRFLNTLGTYWNISLFHYRSHGTDYGRVLAAFE
LGDHEPDFETRLNELGYDCHDETNNPAFRFFLAG
>P9WG95 4.3.1.19~~~ilvA~~~L-threonine dehydratase biosynthetic IlvA~~~COG1171
MSAELSQSPSSSPLFSLSGADIDRAAKRIAPVVTPTPLQPSDRLSAITGATVYLKREDLQTVRSYKLRGAYNLLVQLSDE
ELAAGVVCSSAGNHAQGFAYACRCLGVHGRVYVPAKTPKQKRDRIRYHGGEFIDLIVGGSTYDLAAAAALEDVERTGATL
VPPFDDLRTIAGQGTIAVEVLGQLEDEPDLVVVPVGGGGCIAGITTYLAERTTNTAVLGVEPAGAAAMMAALAAGEPVTL
DHVDQFVDGAAVNRAGTLTYAALAAAGDMVSLTTVDEGAVCTAMLDLYQNEGIIAEPAGALSVAGLLEADIEPGSTVVCL
ISGGNNDVSRYGEVLERSLVHLGLKHYFLVDFPQEPGALRRFLDDVLGPNDDITLFEYVKRNNRETGEALVGIELGSAAD
LDGLLARMRATDIHVEALEPGSPAYRYLL
>P9WG41 2.2.1.6~~~ilvB1~~~Acetolactate synthase large subunit IlvB1~~~COG0028
MSAPTKPHSPTFKPEPHSAANEPKHPAARPKHVALQQLTGAQAVIRSLEELGVDVIFGIPGGAVLPVYDPLFDSKKLRHV
LVRHEQGAGHAASGYAHVTGRVGVCMATSGPGATNLVTPLADAQMDSIPVVAITGQVGRGLIGTDAFQEADISGITMPIT
KHNFLVRSGDDIPRVLAEAFHIAASGRPGAVLVDIPKDVLQGQCTFSWPPRMELPGYKPNTKPHSRQVREAAKLIAAARK
PVLYVGGGVIRGEATEQLRELAELTGIPVVTTLMARGAFPDSHRQNLGMPGMHGTVAAVAALQRSDLLIALGTRFDDRVT
GKLDSFAPEAKVIHADIDPAEIGKNRHADVPIVGDVKAVITELIAMLRHHHIPGTIEMADWWAYLNGVRKTYPLSYGPQS
DGSLSPEYVIEKLGEIAGPDAVFVAGVGQHQMWAAQFIRYEKPRSWLNSGGLGTMGFAIPAAMGAKIALPGTEVWAIDGD
GCFQMTNQELATCAVEGIPVKVALINNGNLGMVRQWQSLFYAERYSQTDLATHSHRIPDFVKLAEALGCVGLRCEREEDV
VDVINQARAINDCPVVIDFIVGADAQVWPMVAAGTSNDEIQAARGIRPLFDDITEGHA
>O06335 2.2.1.6~~~ilvB2~~~Putative acetolactate synthase large subunit IlvB2~~~COG0028
MTVGDHLVARMRAAGISVVCGLPTSRLDSLLVRLSRDAGFQIVLARHEGGAGYLADGFARASGKSAAVFVAGPGATNVIS
AVANASVNQVPMLILTGEVAVGEFGLHSQQDTSDDGLGLGATFRRFCRCSVSIESIANARSKIDSAFRALASIPRGPVHI
ALPRDLVDERLPAHQLGTAAAGLGGLRTLAPCGPDVADEVIGRLDRSRAPMLVLGNGCRLDGIGEQIVAFCEKAGLPFAT
TPNGRGIVAETHPLSLGVLGIFGDGRADEYLFDTPCDLLIAVGVSFGGLVTRSFSPRWRGLKADVVHVDPDPSAVGRFVA
TSLGITTSGRAFVNALNCGRPPRFCRRVGVRPPAPAALPGTPQARGESIHPLELMHELDRELAPNATICADVGTCISWTF
RGIPVRRPGRFFATVDFSPMGCGIAGAIGVALARPEEHVICIAGDGAFLMHGTEISTAVAHGIRVTWAVLNDGQMSASAG
PVSGRMDPSPVARIGANDLAAMARALGAEGIRVDTRCELRAGVQKALAATGPCVLDIAIDPEINKPDIGLGR
>P37251 2.2.1.6~~~ilvB~~~Acetolactate synthase large subunit~~~COG0028
MGTNVQVDSASAECTQTMSGALMLIESLKKEKVEMIFGYPGGAVLPIYDKLYNSGLVHILPRHEQGAIHAAEGYARVSGK
PGVVIATSGPGATNLVTGLADAMIDSLPLVVFTGQVATSVIGSDAFQEADILGITMPVTKHSYQVRQPEDLPRIIKEAFH
IATTGRPGPVLIDIPKDVATIEGEFSYDHEMNLPGYQPTTEPNYLQIRKLVEAVSSAKKPVILAGAGVLHGKASEELKNY
AEQQQIPVAHTLLGLGGFPADHPLFLGMAGMHGTYTANMALHECDLLISIGARFDDRVTGNLKHFARNAKIAHIDIDPAE
IGKIMKTQIPVVGDSKIVLQELIKQDGKQSDSSEWKKQLAEWKEEYPLWYVDNEEEGFKPQKLIEYIHQFTKGEAIVATD
VGQHQMWSAQFYPFQKADKWVTSGGLGTMGFGLPAAIGAQLAEKDATVVAVVGDGGFQMTLQELDVIRELNLPVKVVILN
NACLGMVRQWQEIFYEERYSESKFASQPDFVKLSEAYGIKGIRISSEAEAKEKLEEALTSREPVVIDVRVASEEKVFPMV
APGKGLHEMVGVKP
>P08142 2.2.1.6~~~ilvB~~~Acetolactate synthase isozyme 1 large subunit~~~COG0028
MASSGTTSTRKRFTGAEFIVHFLEQQGIKIVTGIPGGSILPVYDALSQSTQIRHILARHEQGAGFIAQGMARTDGKPAVC
MACSGPGATNLVTAIADARLDSIPLICITGQVPASMIGTDAFQEVDTYGISIPITKHNYLVRHIEELPQVMSDAFRIAQS
GRPGPVWIDIPKDVQTAVFEIETQPAMAEKAAAPAFSEESIRDAAAMINAAKRPVLYLGGGVINAPARVRELAEKAQLPT
TMTLMALGMLPKAHPLSLGMLGMHGVRSTNYILQEADLLIVLGARFDDRAIGKTEQFCPNAKIIHVDIDRAELGKIKQPH
VAIQADVDDVLAQLIPLVEAQPRAEWHQLVADLQREFPCPIPKACDPLSHYGLINAVAACVDDNAIITTDVGQHQMWTAQ
AYPLNRPRQWLTSGGLGTMGFGLPAAIGAALANPDRKVLCFSGDGSLMMNIQEMATASENQLDVKIILMNNEALGLVHQQ
QSLFYEQGVFAATYPGKINFMQIAAGFGLETCDLNNEADPQASLQEIINRPGPALIHVRIDAEEKVYPMVPPGAANTEMV
GE
>P27696 2.2.1.6~~~budB~~~Acetolactate synthase, catabolic~~~
MDKQYPVRQWAHGADLVVSQLEAQGVRQVFGIPGAKIDKVFDSLLDSSIRIIPVRHEANAAFMAAAVGRITGKAGVALVT
SGPGCSNLITGMATANSEGDPVVALGGAVKRADKAKQVHQSMDTVAMFSPVTKYAIEVTAPDALAEVVSNAFRAAEQGRP
GSAFVSLPQDVVDGPVSGKVLPASGAPQMGAAPDDAIDQVAKLIAQAKNPIFLLGLMASQPENSKALRRLLETSHIPVTS
TYQAAGAVNQDNFSRFAGRVGLFNNQAGDRLLQLADLVICIGYSPVEYEPAMWNSGNATLVHIDVLPAYEERNYTPDVEL
VGDIAGTLNKLAQNIDHRLVLSPQAAEILRDRQHQRELLDRRGAQLNQFALHPLRIVRAMQDIVNSDVTLTVDMGSFHIW
IARYLYTFRARQVMISNGQQTMGVALPWAIGAWLVNPERKVVSVSGDGGFLQSSMELETAVRLKANVLHLIWVDNGYNMV
AIQEEKKYQRLSGVEFGPMDFKAYAESFGAKGFAVESAEALEPTLRAAMDVDGPAVVAIPVDYRDNPLLMGQLHLSQIL
>Q81S27 1.1.1.86~~~ilvC2~~~Ketol-acid reductoisomerase (NADP(+)) 2~~~COG0059
MKTYYEQDANVGLLQGKTVAVIGYGSQGHAQAQNLRDSGVEVVVGVRPGKSFEVAKADGFEVMSVSEAVRTAQVVQMLLP
DEQQAHVYKAEVEENLREGQMLLFSHGFNIHFGQINPPSYVDVAMVAPKSPGHLVRRVFQEGNGVPALVAVHQDATGTAL
HVALAYAKGVGCTRAGVIETTFQEETETDLFGEQAVLCGGVTALVKAGFETLTEGGYRPEIAYFECLHELKLIVDLMYEG
GLTNMRHSISDTAEFGDYVTGSRIVTDETKKEMKRVLTEIQQGEFAKKWILENQAGRPTYNAMKKAEQNHQLEKVGEELR
EMMSWIHAPKELVKK
>C8WR67 1.1.1.86~~~ilvC~~~Ketol-acid reductoisomerase (NADP(+))~~~COG0059
MEKIYYDADISIQPLADKRIAVIGYGSQGHAHAQNLRDSGFDVVIGLRPGSSWAKAEADGFRVMAVGEAVEESDVIMILL
PDERQPAVYEREIRPYLTAGKALAFAHGFNIHFSQIQPPKDVDVFMVAPKGPGHLVRRVYEAGGGVPALIAVHQDASGQA
KDLALAYARGIGAGRAGILTTTFREETETDLFGEQAVLCGGLSALIKAGFETLVEAGYQPEIAYFECLHEMKLIVDLIYE
GGLEYMRYSISDTAQWGDFTSGPRIINEETKKEMRRILADIQSGAFAKSWILENQANRPMFNAINRRELEHPIEVVGRKL
RSMMPFIKAKRPGDDRVPATADRA
>C1DFH7 1.1.1.86~~~ilvC~~~Ketol-acid reductoisomerase (NADP(+))~~~COG0059
MKVYYDKDCDLSIIQSKKVAIIGYGSQGHAHACNLKDSGVDVYVGLRAGSASVAKAEAHGLTVKSVKDAVAAADVVMILT
PDEFQGRLYKDEIEPNLKKGATLAFAHGFSIHYNQVVPRADLDVIMIAPKAPGHTVRSEFVRGGGIPDLIAVYQDASGNA
KNLALSYACGVGGGRTGIIETTFKDETETDLFGEQAVLCGGCVELVKAGFETLVEAGYAPEMAYFECLHELKLIVDLMFE
GGIANMNYSISNNAEYGEYVTGPEVINEQSRQAMRNALKRIQDGEYAKMFITEGAANYPSMTAYRRNNAAHQIEVVGEKL
RTMMPWIAANKIVDKTKN
>P37253 1.1.1.86~~~ilvC~~~Ketol-acid reductoisomerase (NADP(+))~~~COG0059
MVKVYYNGDIKENVLAGKTVAVIGYGSQGHAHALNLKESGVDVIVGVRQGKSFTQAQEDGHKVFSVKEAAAQAEIIMVLL
PDEQQQKVYEAEIKDELTAGKSLVFAHGFNVHFHQIVPPADVDVFLVAPKGPGHLVRRTYEQGAGVPALFAIYQDVTGEA
RDKALAYAKGIGGARAGVLETTFKEETETDLFGEQAVLCGGLSALVKAGFETLTEAGYQPELAYFECLHELKLIVDLMYE
EGLAGMRYSISDTAQWGDFVSGPRVVDAKVKESMKEVLKDIQNGTFAKEWIVENQVNRPRFNAINASENEHQIEVVGRKL
REMMPFVKQGKKKEAVVSVAQN
>Q9PHN5 1.1.1.86~~~ilvC~~~Ketol-acid reductoisomerase (NADP(+))~~~COG0059
MAITVYYDKDCDLNLIKSKKVAIIGFGSQGHAHAMNLRDNGVNVTIGLREGSVSAVKAKNAGFEVVSVSEASKIADVIMI
LAPDEIQADIFNVEIKPNLSEGKAIAFAHGFNIHYGQIVVPKGVDVIMIAPKAPGHTVRNEFTLGGGTPCLIAIHQDESK
NAKNLALSYASAIGGGRTGIIETTFKAETETDLFGEQAVLCGGLSALIQAGFETLVEAGYEPEMAYFECLHEMKLIVDLI
YQGGIADMRYSISNTAEYGDYITGPKIITEETKKAMKGVLKDIQNGVFAKDFILERRAGFARMHAERKNMNDSLIEKTGR
NLRAMMPWISAKKLVDKDKN
>Q5HVD9 1.1.1.86~~~ilvC~~~Ketol-acid reductoisomerase (NADP(+))~~~
MAITVYYDKDCDLNLIKSKKVAIIGFGSQGHAHAMNLRDNGVNVTIGLREGSVSAVKAKNAGFEVMSVSEASKIADVIMI
LAPDEIQADIFNVEIKPNLSEGKAIAFAHGFNIHYGQIVVPKGVDVIMIAPKAPGHTVRNEFTLGGGTPCLIAIHQDESK
NAKNLALSYASAIGGGRTGIIETTFKAETETDLFGEQAVLCGGLSALIQAGFETLVEAGYEPEMAYFECLHEMKLIVDLI
YQGGIADMRYSISNTAEYGDYITGPKIITEETKKAMKGVLKDIQNGVFAKDFILERRAGFARMHAERKNMNDSLIEKTGR
NLRAMMPWISAKKLVDKDKN
>Q57179 1.1.1.86~~~ilvC~~~Ketol-acid reductoisomerase (NADP(+))~~~COG0059
MAIELLYDADADLSLIQGRKVAIVGYGSQGHAHSQNLRDSGVEVVIGLREGSKSAEKAKEAGFEVKTTAEAAAWADVIML
LAPDTSQAEIFTNDIEPNLNAGDALLFGHGLNIHFDLIKPADDIIVGMVAPKGPGHLVRRQFVDGKGVPCLIAVDQDPTG
TAQALTLSYAAAIGGARAGVIPTTFEAETVTDLFGEQAVLCGGTEELVKVGFEVLTEAGYEPEMAYFEVLHELKLIVDLM
FEGGISNMNYSVSDTAEFGGYLSGPRVIDADTKSRMKDILTDIQDGTFTKRLIANVENGNTELEGLRASYNNHPIEETGA
KLRDLMSWVKVDARAETA
>A8ZTR0 1.1.1.382~~~~~~Ketol-acid reductoisomerase (NAD(+))~~~COG0059
MPTINFGGVEENVVTSEEFTLKKAREVLKNEVITVLGYGVQGPAQALNLKDNGFEVIIGQLEGDAYWEKAIADGFVPGKT
LFPIEEAAKKGTIIKMLLSDAGQVAVWPKVKKCLKKGDALYFSHGFGIVYKDQTGIVPPKNVDVILVAPKGSGTNVRRNF
KDGSGINSSYAVFQDATGRAEERTIALGIAIGSGYLFPTTFEKEVFSDLTGERGVLMGCLAGTMEAQYNVLRKHGHSPSE
AFNETVEELTQSLIRLVAENGMDWMFANCSTTAQRGALDWAPKFRDAVAPVFDSLYRRVKNGAETRRVLKVNSAPNYLEK
LRKELDTIKNSEMWQAGAAVRALRPENRKKKK
>P05793 1.1.1.86~~~ilvC~~~Ketol-acid reductoisomerase (NADP(+))~~~COG0059
MANYFNTLNLRQQLAQLGKCRFMGRDEFADGASYLQGKKVVIVGCGAQGLNQGLNMRDSGLDISYALRKEAIAEKRASWR
KATENGFKVGTYEELIPQADLVINLTPDKQHSDVVRTVQPLMKDGAALGYSHGFNIVEVGEQIRKDITVVMVAPKCPGTE
VREEYKRGFGVPTLIAVHPENDPKGEGMAIAKAWAAATGGHRAGVLESSFVAEVKSDLMGEQTILCGMLQAGSLLCFDKL
VEEGTDPAYAEKLIQFGWETITEALKQGGITLMMDRLSNPAKLRAYALSEQLKEIMAPLFQKHMDDIISGEFSSGMMADW
ANDDKKLLTWREETGKTAFETAPQYEGKIGEQEYFDKGVLMIAMVKAGVELAFETMVDSGIIEESAYYESLHELPLIANT
IARKRLYEMNVVISDTAEYGNYLFSYACVPLLKPFMAELQPGDLGKAIPEGAVDNGQLRDVNEAIRSHAIEQVGKKLRGY
MTDMKRIAVAG
>B4U6I9 1.1.1.383~~~ilvC~~~Ketol-acid reductoisomerase (NAD(P)(+))~~~COG0059
MAKIYYDEDASLGILAMKTVAIVGYGSQGHAHALNLRDSGIRVIVALDDKSPHRKTAMEDGFSVYTTSRATQEADVIMIL
TPDTVQPAVYKECIEPNLTPGKAIAFAHGFNIHFGQIVPPKDIDVFMVAPKGPGHLVRWMYEEGKGVPALISIHQDATGS
CRDIALAYAKGIGATRAGVIETTFREETETDLFGEQAVLCGGATALIKAGFETLVEAGYQPEMAYFECLHELKLIVDLIY
QHGIAGMRYSISDTAKYGDVTRGDRVYEAVKPLMKQMLKEIQDGEFAREWILENQANRPVYNALLNKDKEHLVEKVGKEL
RQMMPWLSGKELK
>Q02138 1.1.1.86~~~ilvC~~~Ketol-acid reductoisomerase (NADP(+))~~~COG0059
MAVTMYYEDDVEVSALAGKQIAVIGYGSQGHAHAQNLRDSGHNVIIGVRHGKSFDKAKEDGFETFEVGEAVAKADVIMVL
APDELQQSIYEEDIKPNLKAGSALGFAHGFNIHFGYIKVPEDVDVFMVAPKAPGHLVRRTYTEGFGTPALFVSHQNASGH
AREIAMDWAKGIGCARVGIIETTFKEETEEDLFGEQAVLCGGLTALVEAGFETLTEAGYAGELAYFEVLHEMKLIVDLMY
EGGFTKMRQSISNTAEFGDYVTGPRIITDAVKKNMKLVLADIQSGKFAQDFVDDFKAGRPKLTAYREAAKNLEIEKIGAE
LRKAMPFTQSGDDDAFKIYQ
>A0QUX8 1.1.1.86~~~ilvC~~~Ketol-acid reductoisomerase (NADP(+))~~~COG0059
MAVEMFYDDDADLSIIQGRKVAVIGYGSQGHAHSLSLRDSGVQVKVGLKEGSKSREKAEEQGLEVDTPAEVAKWADVIML
LAPDTAQASIFTNDIEPNLEDGNALFFGHGLNIHFGLIKAPENVTVGMVAPKGPGHLVRRQFVDGKGVPCLIAIDQDPKG
EGQALALSYAAAIGGARAGVIKTTFKEETETDLFGEQAVLCGGTEELIKTGFEVMVEAGYAPEMAYFEVLHELKLIVDLI
YEGGIARMNYSVSDTAEFGGYLSGPRVIDADTKKRMQDILKDIQDGSFVKRLVANVEGGNKELEALRKANAEHPIEVTGK
KLRDLMSWVDRPITETA
>P9WKJ7 1.1.1.86~~~ilvC~~~Ketol-acid reductoisomerase (NADP(+))~~~COG0059
MALEMFYDDDADLSIIQGRKVGVIGYGSQGHAHSLSLRDSGVQVRVGLKQGSRSRPKVEEQGLDVDTPAEVAKWADVVMV
LAPDTAQAEIFAGDIEPNLKPGDALFFGHGLNVHFGLIKPPADVAVAMVAPKGPGHLVRRQFVDGKGVPCLVAVEQDPRG
DGLALALSYAKAIGGTRAGVIKTTFKDETETDLFGEQTVLCGGTEELVKAGFEVMVEAGYPAELAYFEVLHELKLIVDLM
YEGGLARMYYSVSDTAEFGGYLSGPRVIDAGTKERMRDILREIQDGSFVHKLVADVEGGNKQLEELRRQNAEHPIEVVGK
KLRDLMSWVDRPITETA
>Q9HVA2 1.1.1.86~~~ilvC~~~Ketol-acid reductoisomerase (NADP(+))~~~
MRVFYDKDCDLSIIQGKKVAIIGYGSQGHAHACNLKDSGVDVTVGLRSGSATVAKAEAHGLKVADVKTAVAAADVVMILT
PDEFQGRLYKEEIEPNLKKGATLAFAHGFSIHYNQVVPRADLDVIMIAPKAPGHTVRSEFVKGGGIPDLIAIYQDASGNA
KNVALSYACGVGGGRTGIIETTFKDETETDLFGEQAVLCGGCVELVKAGFETLVEAGYAPEMAYFECLHELKLIVDLMYE
GGIANMNYSISNNAEYGEYVTGPEVINAESRAAMRNALKRIQDGEYAKMFITEGAANYPSMTAYRRNNAAHPIEQIGEKL
RAMMPWIAANKIVDKSKN
>P05989 1.1.1.86~~~ilvC~~~Ketol-acid reductoisomerase (NADP(+))~~~
MANYFNTLNLRQQLAQLGKCRFMGRDEFADGASYLQGKKVVIVGCGAQGLNQGLNMRDSGLDISYALRKEAIAEKRASWR
KATENGFKVGTYEELIPQADLVVNLTPDKQHSDVVRSVQPLMKDGAALGYSHGFNIVEVGEQIRKDITVVMVAPKCPGTE
VREEYKRGFGVPTLIAVHPENDPQGEGMAIAKAWAAATGGHRAGVLESSFVAEVKSDLMGEQTILCGMLQAGSLLCFDKL
VAEGTDPAYAEKLIQFGWETITEALKQGGITLMMDRLSNPAKLRAYALSEQLKEIMAPLFQKHMDDIISGEFSSGMMADW
ANDDKKLLTWREETGKTAFETAPQYEGKIGEQEYFDKGVLMIAMVKAGVELAFETMVDSGIIEESAYYESLHELPLIANT
IARKRLYEMNVVISDTAEYGNYLFSYACVPLLKPFMAELQPGDLGSAIPEGAVDNAQLRDVNDAIRSHAIEQVGKKLRGY
MTDMKRIAVAG
>D0WGK0 1.1.1.86~~~ilvC~~~Ketol-acid reductoisomerase (NADP(+))~~~COG0059
MSVKTKEKEMAVTILYEQDVDPKVIQGLKVGIIGYGSQGHAHALNLMDSGVDVRVGLREGSSSWKTAEEAGLKVTDMDTA
AEEADVIMVLVPDEIQPKVYQEHIAAHLKAGNTLAFAHGFNIHYGYIVPPEDVNVIMCAPKGPGHIVRRQFTEGSGVPDL
ACVQQDATGNAWDIVLSYCWGVGGARSGIIKATFAEETEEDLFGEQAVLCGGLVELVKAGFETLTEAGYPPELAYFECYH
EMKMIVDLMYESGIHFMNYSISNTAEYGEYYAGPKVINEQSREAMKEILKRIQDGSFAQEFVDDCNNGHKRLLEQREAIN
THPIETTGAQIRSMFSWIKKED
>Q2FWK4 1.1.1.86~~~ilvC~~~Ketol-acid reductoisomerase (NADP(+))~~~COG0059
MTTVYYDQDVKTDALQGKKIAVVGYGSQGHAHAQNLKDNGYDVVIGIRPGRSFDKAKEDGFDVFPVAEAVKQADVIMVLL
PDEIQGDVYKNEIEPNLEKHNALAFAHGFNIHFGVIQPPADVDVFLVAPKGPGHLVRRTFVEGSAVPSLFGIQQGASGQA
RNIALSYAKGIGATRAGVIETTFKEETETDLFGEQAVLCGGVSKLIQSGFETLVEAGYQPELAYFEVLHEMKLIVDLMYE
GGMENVRYSISNTAEFGDYVSGPRVITPDVKENMKAVLTDIQNGNFSNRFIEDNKNGFKEFYKLREEQHGHQIEKVGREL
REMMPFIKSKSIEK
>Q04M32 1.1.1.86~~~ilvC~~~Ketol-acid reductoisomerase (NADP(+))~~~COG0059
MTVQMEYEKDVKVAALDGKKIAVIGYGSQGHAHAQNLRDSGRDVIIGVRPGKSFDKAKEDGFDTYTVAEATKLADVIMIL
APDEIQQELYEAEIAPNLEAGNAVGFAHGFNIHFEFIKVPADVDVFMCAPKGPGHLVRRTYEEGFGVPALYAVYQDATGN
AKNIAMDWCKGVGAARVGLLETTYKEETEEDLFGEQAVLCGGLTALIEAGFEVLTEAGYAPELAYFEVLHEMKLIVDLIY
EGGFKKMRQSISNTAEYGDYVSGPRVITEQVKENMKAVLADIQNGKFANDFVNDYKAGRPKLTAYREQAANLEIEKVGAE
LRKAMPFVGKNDDDAFKIYN
>Q0AV19 1.1.1.383~~~ilvC~~~Ketol-acid reductoisomerase (NAD(P)(+))~~~COG0059
MARMFYDADANLENLKGKTIAVMGFGSQGHAQAQNLKESGLNVIVGLRKPFDEASEKEWNAVIAAGITPMSVAEAAEAAD
VIQILLPDEVQARVYNAEIKPYLKAGNALGFSHGFNIHFGQIVPPAFVDVFMVAPKSPGHLVRRMYVKGAGVPGLVAVQQ
DYSGKAKDLALAYACGIGCTRAGVIETSFQEETETDLFGEQCVLCGGVTELVKAGFETLVEAGYQPEIAYFECMHELKLI
VDLMYEGGMSYMRYSISDTAEWGDYTKGPEIIGEEARYAMYEALQDIQDGSFAKGWLLENMVGRPRFNALKRQNREHLIE
EVGAELRGMMPWLKETK
>P29107 1.1.1.86~~~ilvC~~~Ketol-acid reductoisomerase (NADP(+))~~~COG0059
MARMYYDQDANLDLLAGKTVAIIGYGSQGHAHALNLKDSGVNVVVGLYSGSKSVAKAEGAGLKVLSVAEAAKAADLIMIL
LPDEVQKTVYEAEIAPNLVAGNVLLFAHGFNINFAQIVPPADVDVVMAAPKGPGHLVRRTYEQGQGVPALFAVYQDASGQ
ARDYAMAYAKGIGGTRAGILETTFREETETDLFGEQVVLCGGLTALIKAGFDTLVEAGYQPELAYFECLHEVKLIVDLIV
EGGLAKMRDSISNTAEYGDLTRGPRIVTEETKAEMRQILDEIQSGQFAREFVLENQAGKPGFTAMRRRESEELIEEVGKD
LRAMFSWLKDR
>K4LVZ1 1.1.1.382~~~ilvC~~~Ketol-acid reductoisomerase (NAD(+))~~~COG0059
MKIYYDQDADLQYLDGKTVAVIGYGSQGHAQSQNLRDSGVKVVVADIPSSENWKKAEEAQFQPLTADEAAREADIIQILV
PDEKQAALYRESIAPNLRPGKALVFSHGFNIHFKQIVPPPDVDVFMVAPKGPGHLVRRMYEEGAGVPSLVAVEQDYSGQA
LNLALAYAKGIGATRAGVIQTTFKEETETDLFGEQAVLCGGITELIRAGFDTLVDAGYQPEIAYFECLHEMKLIVDLIYE
GGISTMRYSISDTAEYGDLTRGKRIITEATREEMKKILKEIQDGVFAREWLLENQVGRPVYNALRRKEQNHLIETVGARL
RGMMPWLKKKVI
>P51785 4.2.1.9~~~ilvD~~~Dihydroxy-acid dehydratase~~~COG0129
MAELRSNMITQGIDRAPHRSLLRAAGVKEEDFGKPFIAVCNSYIDIVPGHVHLQEFGKIVKEAIREAGGVPFEFNTIGVD
DGIAMGHIGMRYSLPSREIIADSVETVVSAHWFDGMVCIPNCDKITPGMLMAAMRINIPTIFVSGGPMAAGRTSDGRKIS
LSSVFEGVGAYQAGKINENELQELEQFGCPTCGSCSGMFTANSMNCLSEALGLALPGNGTILATSPERKEFVRKSAAQLM
ETIRKDIKPRDIVTVKAIDNAFALDMALGGSTNTVLHTLALANEAGVEYSLERINEVAERVPHLAKLAPASDVFIEDLHE
AGGVSAALNELSKKEGALHLDALTVTGKTLGETIAGHEVKDYDVIHPLDQPFTEKGGLAVLFGNLAPDGAIIKTGGVQNG
ITRHEGPAVVFDSQDEALDGIINRKVKEGDVVIIRYEGPKGGPGMPEMLAPTSQIVGMGLGPKVALITDGRFSGASRGLS
IGHVSPEAAEGGPLAFVENGDHIIVDIEKRILDVQVPEEEWEKRKANWKGFEPKVKTGYLARYSKLVTSANTGGIMKI
>P55186 4.2.1.9~~~ilvD~~~Dihydroxy-acid dehydratase~~~COG0129
MPPYRSRTTTHGRNMAGARGLWRATGMKDEDFGKPIIAVANSFTQFVPGHVHLKDLGQLVAREIEAAGGVAKEFNTIAVD
DGIAMGHGGMLYSLPSRDLIADSVEYMVNAHCADAIVCISNCDKITPGMLMAAMRLNIPVVFVSGGPMEAGKVTVKGKIR
ALDLVDAMVVAADDSYSDEEVEAIEKAACPTCGSCSGMFTANSMNCLTEALGLSLPGNGSVLATHADREALFKEAGRVVV
DLCQRWYEQEDATALPRGIATRAAFENAMSLDIAMGGSTNTVLHLLAAAHEGGIDFSMADIDRLSRHVPCLSKVAPAKSD
VHMEDVHRAGGVMAILGELERGGLIDASQPTVHAPTMGEALARWDIGRTNSQIAHEFFKAAPGGKPTQVAFSQAARWEEL
DLDRENGVIRSVEHPFSKDGGLAVLFGNLAPEGCIVKTAGVDESILTFRGTARVFESQDAAVSGILGGQVKAGEVVVIRY
EGPKGGPGMQEMLYPTTYLKSKGLGAACALVTDGRFSGGTSGLSIGHVSPEAGEGGLIALVETGDPILIDIPTRGITLEV
SDAVLAARREAQLARGKDAWTPLNRKRDLTPALRAYAAMTTNAARGAVRDVSQIERG
>P05791 4.2.1.9~~~ilvD~~~Dihydroxy-acid dehydratase~~~COG0129
MPKYRSATTTHGRNMAGARALWRATGMTDADFGKPIIAVVNSFTQFVPGHVHLRDLGKLVAEQIEAAGGVAKEFNTIAVD
DGIAMGHGGMLYSLPSRELIADSVEYMVNAHCADAMVCISNCDKITPGMLMASLRLNIPVIFVSGGPMEAGKTKLSDQII
KLDLVDAMIQGADPKVSDSQSDQVERSACPTCGSCSGMFTANSMNCLTEALGLSQPGNGSLLATHADRKQLFLNAGKRIV
ELTKRYYEQNDESALPRNIASKAAFENAMTLDIAMGGSTNTVLHLLAAAQEAEIDFTMSDIDKLSRKVPQLCKVAPSTQK
YHMEDVHRAGGVIGILGELDRAGLLNRDVKNVLGLTLPQTLEQYDVMLTQDDAVKNMFRAGPAGIRTTQAFSQDCRWDTL
DDDRANGCIRSLEHAYSKDGGLAVLYGNFAENGCIVKTAGVDDSILKFTGPAKVYESQDDAVEAILGGKVVAGDVVVIRY
EGPKGGPGMQEMLYPTSFLKSMGLGKACALITDGRFSGGTSGLSIGHVSPEAASGGSIGLIEDGDLIAIDIPNRGIQLQV
SDAELAARREAQDARGDKAWTPKNRERQVSFALRAYASLATSADKGAVRDKSKLGG
>P9WKJ5 4.2.1.9~~~ilvD~~~Dihydroxy-acid dehydratase~~~COG0129
MPQTTDEAASVSTVADIKPRSRDVTDGLEKAAARGMLRAVGMDDEDFAKPQIGVASSWNEITPCNLSLDRLANAVKEGVF
SAGGYPLEFGTISVSDGISMGHEGMHFSLVSREVIADSVEVVMQAERLDGSVLLAGCDKSLPGMLMAAARLDLAAVFLYA
GSILPGRAKLSDGSERDVTIIDAFEAVGACSRGLMSRADVDAIERAICPGEGACGGMYTANTMASAAEALGMSLPGSAAP
PATDRRRDGFARRSGQAVVELLRRGITARDILTKEAFENAIAVVMAFGGSTNAVLHLLAIAHEANVALSLQDFSRIGSGV
PHLADVKPFGRHVMSDVDHIGGVPVVMKALLDAGLLHGDCLTVTGHTMAENLAAITPPDPDGKVLRALANPIHPSGGITI
LHGSLAPEGAVVKTAGFDSDVFEGTARVFDGERAALDALEDGTITVGDAVVIRYEGPKGGPGMREMLAITGAIKGAGLGK
DVLLLTDGRFSGGTTGLCVGHIAPEAVDGGPIALLRNGDRIRLDVAGRVLDVLADPAEFASRQQDFSPPPPRYTTGVLSK
YVKLVSSAAVGAVCG
>P74689 4.2.1.9~~~ilvD~~~Dihydroxy-acid dehydratase~~~COG0129
MSNNPRSQVITQGTQRSPNRAMLRAVGFGDDDFTKPIVGIANGYSTITPCNMGINDLALRAEAGLRTAGAMPQLFGTITI
SDGISMGTEGMKYSLVSREVIADSIETVCNGQRMDGVLAIGGCDKNMPGAMIAMARLNIPSIFVYGGTIKPGHYAGEDLT
VVSAFEAVGQYSAGKIDEETLYGIERNACPGAGSCGGMFTANTMSSAFEAMGMSLPYSSTMAAVDGEKADSTEESAKVLV
EAIKKQILPSQILTRKAFENAIAVIMAVGGSTNAVLHLLAIANTIGVPLSLDDFETIRHKVPVLCDLKPSGKYVTTNLHA
AGGIPQVMKILLVNGILHGDALTITGQTIAEVLADIPDQPPAGQDVIHSWDDPVYQEGHLAVLKGNLATEGSVAKISGVK
KPVITGPAKVFESEEDCLEAILAGKIQAGDVVVVRYEGPKGGPGMREMLAPTSAIIGAGLGDSVGLITDGRFSGGTYGLV
VGHVAPEAYVGGAIALVQEGDQITIDAGKRLLQLNISEEELAQRRAQWTPPQPRYPRGILAKYAKLVSSSSLGAVTDIDL
F
>O31461 2.6.1.42~~~ilvE~~~Branched-chain-amino-acid transaminase 1~~~COG0115
MNKLIEREKTVYYKEKPDPSSLGFGQYFTDYMFVMDYEEGIGWHHPRIAPYAPLTLDPSSSVFHYGQAVFEGLKAYRTDD
GRVLLFRPDQNIKRLNRSCERMSMPPLDEELVLEALTQLVELEKDWVPKEKGTSLYIRPFVIATEPSLGVKASRSYTFMI
VLSPVGSYYGDDQLKPVRIYVEDEYVRAVNGGVGFAKTAGNYAASLQAQRKANELGYDQVLWLDAIEKKYVEEVGSMNIF
FVINGEAVTPALSGSILSGVTRASAIELIRSWGIPVREERISIDEVYAASARGELTEVFGTGTAAVVTPVGELNIHGKTV
IVGDGQIGDLSKKLYETITDIQLGKVKGPFNWTVEV
>P39576 2.6.1.42~~~ilvK~~~Branched-chain-amino-acid aminotransferase 2~~~COG0115
MTKQTIRVELTSTKKPKPDPNQLSFGRVFTDHMFVMDYAADKGWYDPRIIPYQPLSMDPAAMVYHYGQTVFEGLKAYVSE
DDHVLLFRPEKNMERLNQSNDRLCIPQIDEEQVLEGLKQLVAIDKDWIPNAEGTSLYIRPFIIATEPFLGVAASHTYKLL
IILSPVGSYYKEGIKPVKIAVESEFVRAVKGGTGNAKTAGNYASSLKAQQVAEEKGFSQVLWLDGIEKKYIEEVGSMNIF
FKINGEIVTPMLNGSILEGITRNSVIALLKHWGLQVSERKIAIDEVIQAHKDGILEEAFGTGTAAVISPVGELIWQDETL
SINNGETGEIAKKLYDTITGIQKGAVADEFGWTTEVAALTESK
>P0AB80 2.6.1.42~~~ilvE~~~Branched-chain-amino-acid aminotransferase~~~COG0115
MTTKKADYIWFNGEMVRWEDAKVHVMSHALHYGTSVFEGIRCYDSHKGPVVFRHREHMQRLHDSAKIYRFPVSQSIDELM
EACRDVIRKNNLTSAYIRPLIFVGDVGMGVNPPAGYSTDVIIAAFPWGAYLGAEALEQGIDAMVSSWNRAAPNTIPTAAK
AGGNYLSSLLVGSEARRHGYQEGIALDVNGYISEGAGENLFEVKDGVLFTPPFTSSALPGITRDAIIKLAKELGIEVREQ
VLSRESLYLADEVFMSGTAAEITPVRSVDGIQVGEGRCGPVTKRIQQAFFGLFTGETEDKWGWLDQVNQ
>A0R066 2.6.1.42~~~ilvE~~~Branched-chain-amino-acid aminotransferase~~~COG0115
MNSGPLEFTVSANTNPATDAVRESILANPGFGKYYTDHMVSIDYTVDEGWHNAQVIPYGPIQLDPSAIVLHYGQEIFEGL
KAYRWADGSIVSFRPEANAARLQSSARRLAIPELPEEVFIESLRQLIAVDEKWVPPAGGEESLYLRPFVIATEPGLGVRP
SNEYRYLLIASPAGAYFKGGIKPVSVWLSHEYVRASPGGTGAAKFGGNYAASLLAQAQAAEMGCDQVVWLDAIERRYVEE
MGGMNLFFVFGSGGSARLVTPELSGSLLPGITRDSLLQLATDAGFAVEERKIDVDEWQKKAGAGEITEVFACGTAAVITP
VSHVKHHDGEFTIADGQPGEITMALRDTLTGIQRGTFADTHGWMARLN
>P9WQ75 2.6.1.42~~~ilvE~~~Branched-chain-amino-acid aminotransferase~~~COG0115
MTSGSLQFTVLRAVNPATDAQRESMLREPGFGKYHTDHMVSIDYAEGRGWHNARVIPYGPIELDPSAIVLHYAQEVFEGL
KAYRWADGSIVSFRADANAARLRSSARRLAIPELPDAVFIESLRQLIAVDKAWVPGAGGEEALYLRPFIFATEPGLGVRP
ATQYRYLLIASPAGAYFKGGIAPVSVWVSTEYVRACPGGTGAAKFGGNYAASLLAQAEAAENGCDQVVWLDAVERRYIEE
MGGMNIFFVLGSGGSARLVTPELSGSLLPGITRDSLLQLAIDAGFAVEERRIDIDEWQKKAAAGEITEVFACGTAAVITP
VARVRHGASEFRIADGQPGEVTMALRDTLTGIQRGTFADTHGWMARLG
>O86428 2.6.1.42~~~ilvE~~~Branched-chain-amino-acid aminotransferase~~~
MSMADRDGVIWYDGELVQWRDATTHVLTHTLHYGMGVFEGVRAYDTPQGTAIFRLQAHTDRLFDSAHIMNMQIPYSRDEI
NEATRAAVRENNLESAYIRPMVFYGSEGMGLRASGLKVHVIIAAWSWGAYMGEEALQQGIKVRTSSFTRHHVNISMTRAK
SNGAYINSMLALQEAISGGADEAMMLDPEGYVAEGSGENIFIIKDGVIYTPEVTACLNGITRNTILTLAAEHGFKLVEKR
ITRDEVYIADEAFFTGTAAEVTPIREVDGRKIGAGRRGPVTEKLQKAYFDLVSGKTEAHAEWRTLVK
>P0A1A5 2.6.1.42~~~ilvE~~~Branched-chain-amino-acid aminotransferase~~~
MTTKKADYIWFNGEMVRWEDAKVHVMSHALHYGTSVFEGIRCYDSHKGPVVFRHREHMQRLRDSAKIYRFPVSQSIDELM
EACRDVIRKNNLTSAYIRPLVFVGDVGMGVNPPPGYTTDVIIAAFPWGAYLGAEALDQGIDAMVSSWNRAAPNTIPTAAK
AGGNYLSSLLVGSEARRHGYQEGIALDVNGYISEGAGENLFEVKDGVLFTPPFTSSALPGITRDAIIKLAKELGIEVREQ
VLSRESLYLADEVFMSGTAAEITPVRSVDGIQVGEGRCGPVTKRIQQAFFGLFTGETEDKWGWLDPVNS
>P99138 2.6.1.42~~~ilvE~~~Probable branched-chain-amino-acid aminotransferase~~~
MSQAVKVERRETLKQKPNTSQLGFGKYFTDYMLSYDYDADKGWHDLKIVPYGPIEISPAAQGVHYGQSVFEGLKAYKRDG
EVALFRPEENFKRLNNSLARLEMPQVDEAELLEGLKQLVDIERDWIPEGEGQSLYIRPFVFATEGALGVGASHQYKLLII
LSPSGAYYGGETLKPTKIYVEDEYVRAVRGGVGFAKVAGNYAASLLAQTNANKLGYDQVLWLDGVEQKYIEEVGSMNIFF
VENGKVITPELNGSILPGITRKSIIELAKNLGYEVEERRVSIDELFESYDKGELTEVFGSGTAAVISPVGTLRYEDREIV
INNNETGEITQKLYDVYTGIQNGTLEDKNGWRVVVPKY
>P74921 2.6.1.42~~~ilvE~~~Probable branched-chain-amino-acid aminotransferase~~~COG0115
MLIWWRGKFRRADEISLDFSLFEKSLQGAVYETLRTYSRAPFAAYKHYTRLKRSADFFNLPLSLSFDEFTKVLKAGADEF
KQEVRIKVYLFPDSGEVLFVFSPLNIPDLETGVEVKISNVRRIPDLSTPPALKITGRTDIVLARREIVDCYDVILLGLNG
QVCEGSFSNVFLVKEGKLITPSLDSGILDGITRENVIKLAKSLEIPVEERVVWVWELFEADEMFLTHTSAGVVPVRRLNE
HSFFEEEPGPVTATLMENFEPFVLNLEENWVGI
>P0DP90 2.2.1.6~~~ilvG~~~Acetolactate synthase isozyme 2 large subunit~~~
MNGAQWVVHALRAQGVNTVFGYPGGAIMPVYDALYDGGVEHLLCRHEQGAAMAAIGYARATGKTGVCIATSGPGATNLIT
GLADALLDSIPVVAITGQVSAPFIGTDAFQEVDVLGLSLACTKHSFLVQSLEELPRIMAEAFDVACSGRPGPVLVDIPKD
IQLASGDLEPWFTTVENEVTFPHAEVEQARQMLAKAQKPMLYVGGGVGMAQAVPALREFLAATKMPATCTLKGLGAVEAD
YPYYLGMLGMHGTKAANFAVQECDLLIAVGARFDDRVTGKLNTFAPHASVIHMDIDPAEMNKLRQAHVALQGDLNALLPA
LQQPLNQYDWQQHCAQLRDEHSWRYDHPGDAIYAPLLLKQLSDRKPADCVVTTDVGQHQMWAAQHIAHTRPENFITSSGL
GTMGFGLPAAVGAQVARPNDTVVCISGDGSFMMNVQELGTVKRKQLPLKIVLLDNQRLGMVRQWQQLFFQERYSETTLTD
NPDFLMLASAFGIHGQHITRKDQVEAALDTMLNSDGPYLLHVSIDELENVWPLVPPGASNSEMLEKLS
>P9WG39 2.2.1.6~~~ilvG~~~Acetolactate synthase large subunit IlvG~~~COG0028
MSTDTAPAQTMHAGRLIARRLKASGIDTVFTLSGGHLFSIYDGCREEGIRLIDTRHEQTAAFAAEGWSKVTRVPGVAALT
AGPGITNGMSAMAAAQQNQSPLVVLGGRAPALRWGMGSLQEIDHVPFVAPVARFAATAQSAENAGLLVDQALQAAVSAPS
GVAFVDFPMDHAFSMSSDNGRPGALTELPAGPTPAGDALDRAAGLLSTAQRPVIMAGTNVWWGHAEAALLRLVEERHIPV
LMNGMARGVVPADHRLAFSRARSKALGEADVALIVGVPMDFRLGFGGVFGSTTQLIVADRVEPAREHPRPVAAGLYGDLT
ATLSALAGSGGTDHQGWIEELATAETMARDLEKAELVDDRIPLHPMRVYAELAALLERDALVVIDAGDFGSYAGRMIDSY
LPGCWLDSGPFGCLGSGPGYALAAKLARPQRQVVLLQGDGAFGFSGMEWDTLVRHNVAVVSVIGNNGIWGLEKHPMEALY
GYSVVAELRPGTRYDEVVRALGGHGELVSVPAELRPALERAFASGLPAVVNVLTDPSVAYPRRSNLA
>P00894 2.2.1.6~~~ilvH~~~Acetolactate synthase isozyme 3 small subunit~~~COG0440
MRRILSVLLENESGALSRVIGLFSQRGYNIESLTVAPTDDPTLSRMTIQTVGDEKVLEQIEKQLHKLVDVLRVSELGQGA
HVEREIMLVKIQASGYGRDEVKRNTEIFRGQIIDVTPSLYTVQLAGTSGKLDAFLASIRDVAKIVEVARSGVVGLSRGDK
IMR
>A0QUX7 2.2.1.6~~~ilvH~~~Acetolactate synthase small subunit~~~COG0440
MSNGTPTHTLSVLVEDKPGVLARVSSLFSRRGFNIQSLAVGATEQKDMSRMTIVVSVEDSPLEQITKQLNKLINVIKIVE
QEEDNSVSRELALIKVRADATTRGQIIEAVNLFRAKVVDVSTESLTIEATGTPEKLEALLRVLEPYGIREIAQSGVVSVS
RGPRGIGAAK
>P9WKJ3 2.2.1.6~~~ilvH~~~Putative acetolactate synthase small subunit~~~COG0440
MSPKTHTLSVLVEDKPGVLARVAALFSRRGFNIESLAVGATECKDRSRMTIVVSAEDTPLEQITKQLNKLINVIKIVEQD
DEHSVSRELALIKVQADAGSRSQVIEAVNLFRANVIDVSPESLTVEATGNRGKLEALLRVLEPFGIREIAQSGMVSLSRG
PRGIGTAK
>P00893 2.2.1.6~~~ilvI~~~Acetolactate synthase isozyme 3 large subunit~~~COG0028
MEMLSGAEMVVRSLIDQGVKQVFGYPGGAVLDIYDALHTVGGIDHVLVRHEQAAVHMADGLARATGEVGVVLVTSGPGAT
NAITGIATAYMDSIPLVVLSGQVATSLIGYDAFQECDMVGISRPVVKHSFLVKQTEDIPQVLKKAFWLAASGRPGPVVVD
LPKDILNPANKLPYVWPESVSMRSYNPTTTGHKGQIKRALQTLVAAKKPVVYVGGGAITAGCHQQLKETVEALNLPVVCS
LMGLGAFPATHRQALGMLGMHGTYEANMTMHNADVIFAVGVRFDDRTTNNLAKYCPNATVLHIDIDPTSISKTVTADIPI
VGDARQVLEQMLELLSQESAHQPLDEIRDWWQQIEQWRARQCLKYDTHSEKIKPQAVIETLWRLTKGDAYVTSDVGQHQM
FAALYYPFDKPRRWINSGGLGTMGFGLPAALGVKMALPEETVVCVTGDGSIQMNIQELSTALQYELPVLVVNLNNRYLGM
VKQWQDMIYSGRHSQSYMQSLPDFVRLAEAYGHVGIQISHPHELESKLSEALEQVRNNRLVFVDVTVDGSEHVYPMQIRG
GGMDEMWLSKTERT
>P0ADG1 2.2.1.6~~~ilvM~~~Acetolactate synthase isozyme 2 small subunit~~~COG3978
MMQHQVNVSARFNPETLERVLRVVRHRGFHVCSMNMAAASDAQNINIELTVASPRSVDLLFSQLNKLVDVAHVAICQSTT
TSQQIRA
>P0ADF8 2.2.1.6~~~ilvN~~~Acetolactate synthase isozyme 1 small subunit~~~COG0440
MQNTTHDNVILELTVRNHPGVMTHVCGLFARRAFNVEGILCLPIQDSDKSHIWLLVNDDQRLEQMISQIDKLEDVVKVQR
NQSDPTMFNKIAVFFQ
>Q04789 2.2.1.6~~~alsS~~~Acetolactate synthase~~~COG0028
MTKATKEQKSLVKNRGAELVVDCLVEQGVTHVFGIPGAKIDAVFDALQDKGPEIIVARHEQNAAFMAQAVGRLTGKPGVV
LVTSGPGASNLATGLLTANTEGDPVVALAGNVIRADRLKRTHQSLDNAALFQPITKYSVEVQDVKNIPEAVTNAFRIASA
GQAGAAFVSFPQDVVNEVTNTKNVRAVAAPKLGPAADDAISAAIAKIQTAKLPVVLVGMKGGRPEAIKAVRKLLKKVQLP
FVETYQAAGTLSRDLEDQYFGRIGLFRNQPGDLLLEQADVVLTIGYDPIEYDPKFWNINGDRTIIHLDEIIADIDHAYQP
DLELIGDIPSTINHIEHDAVKVEFAEREQKILSDLKQYMHEGEQVPADWKSDRAHPLEIVKELRNAVDDHVTVTCDIGSH
AIWMSRYFRSYEPLTLMISNGMQTLGVALPWAIGASLVKPGEKVVSVSGDGGFLFSAMELETAVRLKAPIVHIVWNDSTY
DMVAFQQLKKYNRTSAVDFGNIDIVKYAESFGATGLRVESPDQLADVLRQGMNAEGPVIIDVPVDYSDNINLASDKLPKE
FGELMKTKAL
>O53554 2.2.1.6~~~ilvX~~~Putative acetolactate synthase large subunit IlvX~~~COG0028
MNGAQALINTLVDGGVDVCFANPGTSEMHFVAALDAVPRMRGMLTLFEGVATGAADGYARIAGRPAAVLLHLGPGLGNGL
ANLHNARRARVPMVVVVGDHATYHKKYDAPLESDIDAVAGTVSGWVRRTEAAADVGADAEAAIAASRSGSQIATLILPAD
VCWSDGAHAAAGVPAQAAAAPVDVGPVAGVLRSGEPAMMLIGGDATRGPGLTAAARIVQATGARWLCETFPTCLERGAGI
PAVERLAYFAEGAAAQLDGVKHLVLAGARSPVSFFAYPGMPSDLVPAGCEVHVLAEPGGAADALAALADEVAPGTVAPVA
GASRPQLPTGDLTSVSAADVVGALLPERAIVVDESNTCGVLLPQATAGAPAHDWLTLTGGAIGYGIPAAVGAAVAAPDRP
VLCLESDGSAMYTISGLWSQARENLDVTTVIYNNGAYDILRIELQRVGAGSDPGPKALDLLDISRPTMDFVKIAEGMGVP
ARRVTTCEEFADALRAAFAEPGPHLIDVVVPSLVG
>A6M2W4 ~~~~~~D-galactonate dehydratase family member Cbei_4837~~~COG4948
MEPTIITDVLCYITKPDRHNLVVVKVETNKGIYGLGCATFQQRPKAVSLVVSEYLKPILIGRDANNIEDLWQMMMVNSYW
RNGPILNNAISGVDMALWDIKGKLANMPLYQLFGGKSRDAIAAYTHAVADNLEDLYTEIDEIRKKGYQHIRCQLGFYGGN
SSEFHTTDNPTQGSYFDQDEYMRTTVSMFSSLREKYGYKFHILHDVHERLFPNQAVQFAKDVEKYKPYFIEDILPPDQNE
WLGQIRSQTSTPLATGELFNNPMEWKSLIANRQVDFIRCHVSQIGGITPALKLGSLCAAFGVRIAWHTPSDITPIGVAVN
IHLNINLHNAAIQENIEINDNTRCVFSGIPEAKNGFFYPIESPGIGVDIDENEIIKYPVEYRPHEWTQSRIPDGTIVTP
>A4W7D6 ~~~~~~D-galactonate dehydratase family member Ent638_0932~~~COG4948
MTPVIIKNIECFITRPDRHNLVTVRVTTEQGITGHGCATFQQRPLAVKTLVDEYLQPLMIGRDANNIEDLWQMMNVNAYW
RNGPLMNNAISGVDMALWDIKGQLAGMPLYQLFGGKSRDAIPAYSHASGETLEALFASVDALIAQGYRHIRCQLGFYGGT
PSALHAPDNPTPGAWFDQQEYMSNTVEMFHALREKYGWKLHILHDVHERLFPQQAVQLAKQLEPFQPYFIEDILPPQQSA
WLEQVRQQSCVPLALGELFNNPAEWHDLIVNRRIDFIRCHVSQIGGITPALKLAHLCQAFGVRLAWHGPGDMTPIGVAVN
THLNIHLHNAAIQEFIPRSATTNDVFPGAPEVKEGFVYPPVQPGIGVGFNEALALAHPVLYRPHEWTQSRLPDGTIHTP
>A5KUH4 ~~~~~~D-galactonate dehydratase family member VSWAT3_13707~~~
MKETIISDIHCIITKPDRHNLITVVVETNEGVTGFGCATFQQRPLAVKTMVDEYLKPILIGKNANNIEDLWQMMMVNAYW
RNGPVINNAISGVDMALWDIKAKLAGMPLHQLFGGKSRDAIPVYTHATSDTMEGIYDLVEGFLEKGYKHIRCQLGFYGGV
PTDLHTTQNPTEGSYYDQDQYMDNTLTMFKSLREKYGNQFHILHDVHERLFPNQAIQFAKEVEQYKPYFIEDILPPNQTE
WLDNIRSQSSVSLGLGELFNNPEEWKSLIANRRIDFIRCHVSQIGGITPALKLGHLCQNFGVRIAWHCPPDMTPIGAAVN
THLNVHLHNAAIQEHVEYNGNTHKVFPNAAEPINGYLYASEIAGIGVEIDREAAAEFPVMYRPHEWTQSRLPDGAIHTP
>D0X4R4 ~~~~~~D-galactonate dehydratase family member VME_00770~~~
MNKNTISNIECVITKPDRHNLITVIVETESGVTGYGCATFQQRPLAVKTMVDEYLKPLLIGKDANNIEDLWQMMMVNAYW
RNGPVINNAISGVDMALWDIKAKIANMPLHQLFGGKSRDAIQVYTHATSDTMEGLYEQVDKYLEQGYQHIRCQLGFYGGV
PENIQTAQNPTQGSYYDQDQYIENTVEMFKNLREKYGKQFHILHDVHERLFPNQAIQFAKQIEQYNPFFIEDILPPSQTE
WLDNIRNQSSVSLALGELFNNPEEWKALIINRRVDFIRCHVSQIGGITPALKLGHFCESFGVRIAWHCPPDMTPIGAAVN
THLNVHLHNAAIQEHVEYKANTQRVFPNAAEPINGYLYASEIAGIGVEMDREAAQDFPVEYRPHEWTQSRLPDGSIHTP
>Q747K6 ~~~imcH~~~Cytochrome c-type protein ImcH~~~COG3005
MTLRKTAGYLWNPISLIGFLLAVVATGLIIAFIAMEMITGIDHPYIGLLVYFAFPGMLILGLILVPIGAWRVRNQRRTEV
PEEVPPYPRVDFNDPHKRRLFIFFVLASVIFVLIVSVASILGFEFTESTTFCGELCHVVMEPEHKAWQGSPHARVKCVEC
HVGPGAEWYVKAKLSGLRQVWAVLTHSYHFPIATPIENLRPARDTCEQCHWPEKFYSGRQRVFYHYAPNKENTPREINML
IKIGGTPKSPHAMGIHWHIGTEVTYIARDRKRLDIPYVAVKQKDGSIVEYMDTEKPLTREEIAKAEKRRMDCIDCHNRPT
HIYRSPAREMDEHIVSGQIDAGLPYIKKVAVEILEQPYKSKEEAHAAIEAKLPEYYAKNFPEVAKVKAAAINQAVAHVKD
IYSRNFFPRMKVTWSTYPNHIGHFYTPGCFRCHDGKHKTSTGKIISKDCNMCHEMIGQKGENIPEGKVVKEFVHPADIGD
ALYNVNCSDCHMAAAEDSAGGEGPGKH
>P21879 1.1.1.205~~~guaB~~~Inosine-5'-monophosphate dehydrogenase~~~COG0516
MWESKFSKEGLTFDDVLLVPAKSEVLPRDVDLSVELTKTLKLNIPVISAGMDTVTESAMAIAMARQGGLGIIHKNMSIEQ
QAEQVDKVKRSERGVITNPFFLTPDHQVFDAEHLMGKYRISGVPIVNNEEDQKLVGIITNRDLRFISDYSMKISDVMTKE
ELVTASVGTTLDEAEKILQKHKIEKLPLVDDQNKLKGLITIKDIEKVIEFPNSSKDIHGRLIVGAAVGVTGDTMTRVKKL
VEANVDVIVIDTAHGHSQGVLNTVTKIRETYPELNIIAGNVATAEATRALIEAGADVVKVGIGPGSICTTRVVAGVGVPQ
ITAIYDCATEARKHGKTIIADGGIKFSGDITKALAAGGHAVMLGSLLAGTSESPGETEIYQGRRFKVYRGMGSVAAMEKG
SKDRYFQEENKKFVPEGIEGRTPYKGPVEETVYQLVGGLRSGMGYCGSKDLRALREEAQFIRMTGAGLRESHPHDVQITK
ESPNYTIS
>P49058 1.1.1.205~~~guaB~~~Inosine-5'-monophosphate dehydrogenase~~~
MPNKITKEALTFDDVSLIPRKSSVLPSEVSLKTQLTKNISLNIPFLSSAMDTVTESQMAIAIAKEGGIGIIHKNMSIEAQ
RKEIEKVKTYKFQKTINTNGDTNEQKPEIFTAKQHLEKSDAYKNAEHKEDFPNACKDLNNKLRVGAAVSIDIDTIERVEE
LVKAHVDILVIDSAHGHSTRIIELIKKIKTKYPNLDLIAGNIVTKEAALDLISVGADCLKVGIGPGSICTTRIVAGVGVP
QITAICDVYEACNNTNICIIADGGIRFSGDVVKAIAAGADSVMIGNLFAGTKESPSEEIIYNGKKFKSYVGMGSISAMKR
GSKSRYFQLENNEPKKLVPEGIEGMVPYSGKLKDILTQLKGGLMSGMGYLGAATISDLKINSKFVKISHSSLKESHPHDV
FSIT
>P0ADG7 1.1.1.205~~~guaB~~~Inosine-5'-monophosphate dehydrogenase~~~COG0516
MLRIAKEALTFDDVLLVPAHSTVLPNTADLSTQLTKTIRLNIPMLSAAMDTVTEARLAIALAQEGGIGFIHKNMSIERQA
EEVRRVKKHESGVVTDPQTVLPTTTLREVKELTERNGFAGYPVVTEENELVGIITGRDVRFVTDLNQPVSVYMTPKERLV
TVREGEAREVVLAKMHEKRVEKALVVDDEFHLIGMITVKDFQKAERKPNACKDEQGRLRVGAAVGAGAGNEERVDALVAA
GVDVLLIDSSHGHSEGVLQRIRETRAKYPDLQIIGGNVATAAGARALAEAGCSAVKVGIGPGSICTTRIVTGVGVPQITA
VADAVEALEGTGIPVIADGGIRFSGDIAKAIAAGASAVMVGSMLAGTEESPGEIELYQGRSYKSYRGMGSLGAMSKGSSD
RYFQSDNAADKLVPEGIEGRVAYKGRLKEIIHQQMGGLRSCMGLTGCGTIDELRTKAEFVRISGAGIQESHVHDVTITKE
SPNYRLGS
>P9WKI7 1.1.1.205~~~guaB~~~Inosine-5'-monophosphate dehydrogenase~~~COG0516
MSRGMSGLEDSSDLVVSPYVRMGGLTTDPVPTGGDDPHKVAMLGLTFDDVLLLPAASDVVPATADTSSQLTKKIRLKVPL
VSSAMDTVTESRMAIAMARAGGMGVLHRNLPVAEQAGQVEMVKRSEAGMVTDPVTCRPDNTLAQVDALCARFRISGLPVV
DDDGALVGIITNRDMRFEVDQSKQVAEVMTKAPLITAQEGVSASAALGLLRRNKIEKLPVVDGRGRLTGLITVKDFVKTE
QHPLATKDSDGRLLVGAAVGVGGDAWVRAMMLVDAGVDVLVVDTAHAHNRLVLDMVGKLKSEVGDRVEVVGGNVATRSAA
AALVDAGADAVKVGVGPGSICTTRVVAGVGAPQITAILEAVAACRPAGVPVIADGGLQYSGDIAKALAAGASTAMLGSLL
AGTAEAPGELIFVNGKQYKSYRGMGSLGAMRGRGGATSYSKDRYFADDALSEDKLVPEGIEGRVPFRGPLSSVIHQLTGG
LRAAMGYTGSPTIEVLQQAQFVRITPAGLKESHPHDVAMTVEAPNYYAR
>P99106 1.1.1.205~~~guaB~~~Inosine-5'-monophosphate dehydrogenase~~~
MWESKFAKESLTFDDVLLIPAQSDILPKDVDLSVQLSDKVKLNIPVISAGMDTVTESKMAIAMARQGGLGVIHKNMGVEE
QADEVQKVKRSENGVISNPFFLTPEESVYEAEALMGKYRISGVPIVDNKEDRNLVGILTNRDLRFIEDFSIKIVDVMTQE
NLITAPVNTTLEEAEKILQKHKIEKLPLVKDGRLEGLITIKDIEKVIEFPNAAKDEHGRLLVAAAIGISKDTDIRAQKLV
EAGVDVLVIDTAHGHSKGVIDQVKHIKKTYPEITLVAGNVATAEATKDLFEAGADIVKVGIGPGSICTTRVVAGVGVPQI
TAIYDCATEARKHGKAIIADGGIKFSGDIIKALAAGGHAVMLGSLLAGTEESPGATEIFQGRQYKVYRGMGSLGAMEKGS
NDRYFQEDKAPKKFVPEGIEGRTAYKGALQDTIYQLMGGVRAGMGYTGSHDLRELREEAQFTRMGPAGLAESHPHNIQIT
KESPNYSF
>Q5X9A3 1.1.1.205~~~guaB~~~Inosine-5'-monophosphate dehydrogenase~~~
MSNWDTKFLKKGYTFDDVLLIPAESHVLPNEVDLKTKLADNLTLNIPIITAAMDTVTGSKMAIAIARAGGLGVIHKNMSI
TEQAEEVRKVKRSENGVIIDPFFLTPEHKVSEAEELMQRYRISGVPIVETLANRKLVGIITNRDMRFISDYNAPISEHMT
SEHLVTAAVGTDLETAERILHEHRIEKLPLVDNSGRLSGLITIKDIEKVIEFPHAAKDEFGRLLVAAAVGVTSDTFERAE
ALFEAGADAIVIDTAHGHSAGVLRKIAEIRAHFPNRTLIAGNIATAEGARALYDAGVDVVKVGIGPGSICTTRVVAGVGV
PQVTAIYDAAAVAREYGKTIIADGGIKYSGDIVKALAAGGNAVMLGSMFAGTDEAPGETEIYQGRKFKTYRGMGSIAAMK
KGSSDRYFQGSVNEANKLVPEGIEGRVAYKGAASDIVFQMLGGIRSGMGYVGAGDIQELHENAQFVEMSGAGLIESHPHD
VQITNEAPNYSVH
>P0C0H6 1.1.1.205~~~guaB~~~Inosine-5'-monophosphate dehydrogenase~~~COG0516
MSNWDTKFLKKGYTFDDVLLIPAESHVLPNEVDLKTKLADNLTLNIPIITAAMDTVTGSKMAIAIARAGGLGVIHKNMSI
TEQAEEVRKVKRSENGVIIDPFFLTPEHKVSEAEELMQRYRISGVPIVETLANRKLVGIITNRDMRFISDYNAPISEHMT
SEHLVTAAVGTDLETAERILHEHRIEKLPLVDNSGRLSGLITIKDIEKVIEFPHAAKDEFGRLLVAAAVGVTSDTFERAE
ALFEAGADAIVIDTAHGHSAGVLRKIAEIRAHFPNRTLIAGNIATAEGARALYDAGVDVVKVGIGPGSICTTRVVAGVGV
PQVTAIYDAAAVAREYGKTIIADGGIKYSGDIVKALAAGGNAVMLGSMFAGTDEAPGETEIYQGRKFKTYRGMGSIAAMK
KGSSDRYFQGSVNEANKLVPEGIEGRVAYKGAASDIVFQMLGGIRSGMGYVGAGDIQELHENAQFVEMSGAGLIESHPHD
VQITNEAPNYSVH
>Q44052 3.2.1.94~~~imd~~~Isomalto-dextranase~~~
MMNLSRRTLLTTGSAATLAYALGMAGSAQAATAVTARPGVPVTAAPPLRLASRNSVFTRSGAGPRYWNIYGYSFPHNAPI
PENEWKANIDWLAGNFADFGYDIACTDGWIEGSSRTTGNGYITSYNDSWQHDWAYWANYLAARKMKLGVYYNPLWVHRAA
VEDASKTVLGRPDVKIADLVVPGDFFARDIGGNQLYWLDVTKSGAKEYVQGYVRYFKDLGVPYLRIDFLSWYEDGRDANI
GQVNAPHGRANYELALSWINEAAGEDMEVSLVMPHMFQDGSAELANGDLVRINADADKGGWDRLSGMRQNWQDAWPNWAN
PFCGFTGWSHRNGRGQLILDGDFMRASTFASDEERKTMMNLMVAAGSPLAIADTYQQIGNNAWVYTNKEVLQLNADGLVG
KPLYRSATPFSKDPGSRDTERWAGQLPDGSWGVALFNRSDTETVTKTIDFAKDLGLATGGNVRDLWEHRNLGMDSRATAA
LAPHASAIFRVTPPKMHGTTRYPAAFAAWGGGAGFNNNHPGYDGNGFVDGLQAGSGSADPLVTFAVQVPHRAATPSGYRY
ANATDDNTTSKTTTKKANPEKADRSTVDGPVHVSFPGLATWDTWGVAAGTITLDAGLNLVTIGRGATDKGAINLNWIELD
M
>A0A0F5HNH9 1.16.3.1~~~IMEF~~~Ferritin-like protein~~~
MKEELDAFHQIFTTTKEAIERFMAMLTPVIENAEDDHERLYYHHIYEEEEQRLSRLDVLIPLIEKFQDETDEGLFSPSNN
AFNRLLQELNLEKFGLHNFIEHVDLALFSFTDEERQTLLKELRKDAYEGYQYVKEKLAEINARFDHDYADPHAHHDEHRD
HLADMPSAGSSHEEVQPVAHKKKGFTVGSLIQ
>P01077 ~~~smpI~~~Metalloproteinase inhibitor~~~
MVRKRALGLAGSALTLVLGAVGFTAPAQAAPSCPAGSLCTYSGTGLSGARTVIPASDMEKAGTDGVKLPASARSFANGTH
FTLRYGPARKVTCVRFPCYQYATVGKVAPGAQLRSLPSPGATVTVGQDLGD
>D2PPM8 3.2.1.205~~~~~~Isomaltose glucohydrolase~~~COG3387
MTTSARDTGLDSHELARLHELARHSHAVITRHQDAGGAYPAAPTFSAYRGYAWLRDGSFTAEGISRYGDVASAGRFHDWV
DGVLRRRRGQVDDLLAAVDRGEVPSNEGMLPTRFTFDGNDGSDPWWDFQTDGYGMWLWSVVTHAARHGLDLERWRAGIDV
AVDYLLAFWDRPCYDWWEEHVEHRHVSTLGAIHGGLVAVGTCAALRSAPWSAATLQVAARIRSLVSAEGVVDGHLVKWLG
SSAVDGSLPACVVPFGLVPPDDDVAAMTRAAVAKDLDVDGGVHRFAADVFYGGGQWILLSALLGWNLAAAGDTAGALRHL
RWIADQADADGDLPEQVPHHLLHPGSRAEWVARWGTVATPLLWSHGMYLILADELGLLPPAAKDA
>Q06578 ~~~imm1~~~Pyocin-S1 immunity protein~~~
MKSKISEYTEKEFLEFVEDIYTNNKKKFPTEESHIQAVLEFKKLTEHPSGSDLLYYPNENREDSPAGVVKEVKEWRASKG
LPGFKAG
>P04482 ~~~imm~~~Colicin-E2 immunity protein~~~
MELKHSISDYTEAEFLEFVKKICRAEGATEEDDNKLVREFERLTEHPDGSDLIYYPRDDREDSPEGIVKEIKEWRAANGK
SGFKQG
>Q06579 ~~~imm2~~~Pyocin-S2 immunity protein~~~
MKSKISEYTEKEFLEFVKDIYTNNKKKFPTEESHIQAVLEFKKLTEHPSGSDLLYYPNENREDSPAGVVKEVKEWRASKG
LPGFKAG
>P02984 ~~~imm~~~Colicin-E3 immunity protein~~~
MGLKLDLTWFDKSTEDFKGEEYSKDFGDDGSVMESLGVPFKDNVNNGCFDVIAEWVPLLQPYFNHQIDISDNEYFVSFDY
RDGDW
>P13476 ~~~imm~~~Colicin-E5 immunity protein~~~
MKLSPKAAIEVCNEAAKKGLWILGIDGGHWLNPGFRIDSSASWTYDMPEEYKSKIPENNRLAIENIKDDIENGYTAFIIT
LKM
>Q03708 ~~~imm~~~Colicin-E7 immunity protein~~~
MELKNSISDYTEAEFVQLLKEIEKENVAATDDVLDVLLEHFVKITEHPDGTDLIYYPSDNRDDSPEGIVKEIKEWRAANG
KPGFKQG
>P09881 ~~~imm~~~Colicin-E8 immunity protein~~~
MELKNSISDYTETEFKKIIEDIINCEGDEKKQDDNLEHFISVTEHPSGSDLIYYPEGNNDGSPEAVIKEIKEWRAANGKS
GFKQG
>P13479 ~~~imm~~~Colicin-E9 immunity protein~~~
MELKHSISDYTEAEFLQLVTTICNADTSSEEELVKLVTHFEEMTEHPSGSDLIYYPKEGDDDSPSGIVNTVKQWRAANGK
SGFKQG
>P96630 3.4.-.-~~~immA~~~Metallopeptidase ImmA~~~COG2856
MITIYTSKGIKHKVQSVIKTHGTNNVYEICDIQKIYILKNDLGQANGLLQHDKATDQYLIHINENLQHQQFVIAHELGHY
FLHKRLNTFKVVNCSKVLKDKLEHQASLFASELILTDKMLNEALPYIQGFSKEQIAAYFNVPSFVTDYKLSQIGSFSNRI
YSHEISAFG
>P05701 ~~~cai~~~Colicin-A immunity protein~~~
MMNEHSIDTDNRKANNALYLFIIIGLIPLLCIFVVYYKTPDALLLRKIATSTENLPSITSSYNPLMTKVMDIYCKTAPFL
ALILYILTFKIRKLINNTDRNTVLRSCLLSPLVYAAIVYLFCFRNFELTTAGRPVRLMATNDATLLLFYIGLYSIIFFTT
YITLFTPVTAFKLLKKRQ
>P02986 ~~~cim~~~Cloacin immunity protein~~~
MGLKLHIHWFDKKTEEFKGGEYSKDFGDDGSVIESLGMPLKDNINNGWFDVEKPWVSILQPHFKNVIDISKFDYFVSFVY
RDGNW
>P11899 ~~~cdi~~~Colicin-D immunity protein~~~
MNKMAMIDLAKLFLASKITAIEFSERICVERRRLYGVKDLSPNILNCGEELFMAAERFEPDADRANYEIDDNGLKVEVRS
ILEKFKL
>P18002 ~~~cmi~~~Colicin-M immunity protein~~~
MLTLYGYIRNVFLYRMNDRSCGDFMKVISMKFIFILTIIALAAVFFWSEDKGPACYQVSDEQARTFVKNDYLQRMKRWDN
DVQLLGTEIPKITWEKIERSLTDVEDEKTLLVPFKAEGPDGKRMYYGMYHCEEGYVEYAND
>P96631 ~~~immR~~~HTH-type transcriptional regulator ImmR~~~COG1396
MSLGKRLKEARQKAGYTQKEAAEKLNIGNNNLSNYERDYRDPDTDTLLKLSNLYNVSTDYLLGKDEVSKKNETDLLNKTI
NEAIQELKDEDTLLFMNDGEFDEETARLVKKALKNGIKFIDELKKKE
>C8ZZN2 ~~~~~~D-galactonate dehydratase family member EGBG_01401~~~COG4948
MTPTIITDVKSFAIKPDRHNLVVVKVETNKGISGLGCSTFQFRPLAVKTVVDEYLRPLLMGRDANEIEDIWQVMNVNSYW
RNGPITNNAISGIDMALWDIKGQLADMPLYQLLGGKARTAIPAYTHAVADNLDDLYHEIDRFLAAGYRYIRCQLGFYGGN
PSQLQTPEEPISGSYFDQTDYMETTLKMFAAIKEKYGNQFQMLHDVHERLHPNQAIQFAKAAEPYQLFFLEDILPPDQSH
WLTQLRSQSATPIATGELFNNPMEWQELVKNRQIDFMRAHVSQIGGITPALKLAHFCDAMGVRIAWHTPSDISPVGLAVN
THLNIHLHNAAIQETIELPANTQSVFVGSPQPKGGFFYPMEKSGIGITFDEEAAADFPVVYRPHEWTQSRTPDGTLITP
>A0QX86 3.1.3.25~~~impA~~~Inositol-1-monophosphatase ImpA~~~COG0483
MTVVGELDPQKLTALVATAAEILDAASVPFVAGHRADSAVRKQGNDFATEVDLAIERQVVRALTEATGIGVHGEEFGGEP
IDSPLVWVLDPIDGTFNYAAGSPMAAILLGLLADGEPVAGLTWLPFTGEKYSALVGGPLYSDGKPCPPLGSPTLADSIIG
IQTFNIDSRGRFPGRYRVEVLANLSRVCSRVRMHGATGVDLAYVAAGILGGAISFGHHIWDHAAGVALVRAAGGVVTDLT
GAPWTVDSKSVLAAAPGVHEKMLEIVKSTGKPEDYL
>O53907 3.1.3.25~~~impA~~~Probable inositol 1-monophosphatase ImpA~~~COG0483
MHLDSLVAPLVEQASAILDAATALFLVGHRADSAVRKKGNDFATEVDLAIERQVVAALVAATGIEVHGEEFGGPAVDSRW
VWVLDPIDGTINYAAGSPLAAILLGLLHDGVPVAGLTWMPFTDPRYTAVAGGPLIKNGVPQPPLADAELANVLVGVGTFS
ADSRGQFPGRYRLAVLEKLSRVSSRLRMHGSTGIDLVFVADGILGGAISFGGHVWDHAAGVALVRAAGGVVTDLAGQPWT
PASRSALAGPPRVHAQILEILGSIGEPEDY
>Q9I5W4 3.4.24.-~~~impA~~~Immunomodulating metalloprotease~~~
MSLSTTAFPSLQGENMSRSPIPRHRALLAGFCLAGALSAQAATQEEILDAALVSGDSSQLTDSHLVALRLQQQVERIRQT
RTQLLDGLYQNLSQAYDPGAASMWVLPANPDNTLPFLIGDKGRVLASLSLEAGGRGLAYGTNVLTQLSGTNAAHAPLLKR
AVQWLVNGDPGAATAKDFKVSVVGVDKTAALNGLKSAGLQPADAACNALTDASCASTSKLLVLGNGASAASLSATVRARL
QAGLPILFVHTNGWNQSSTGQQILAGLGLQEGPYGGNYWDKDRVPSSRTRTRSVELGGAYGQDPALVQQIVDGSWRTDYD
WSKCTSYVGRTTCDDVPGLSDFSKRVDVLKGALDAYNQKAQNLFALPGTTSLRLWLLWADAVRQNIRYPMDKAADTARFQ
ETFVADAIVGYVREAGAAQKELGSYAGQRQQSMPVSGSEETLTLTLPSAQGFTAIGRMAAPGKRLSIRIEDAGQASLAVG
LNTQRIGSTRLWNTRQYDRPRFLKSPDIKLQANQSVALVSPYGGLLQLVYSGATPGQTVTVKVTGAASQPFLDIQPGEDS
SQAIADFIQALDADKADWLEIRSGSVEVHAKVEKVRGSIDKDYGGDVQRFIRELNEVFIDDAYTLAGFAIPNQAKTPAIQ
QECAARGWDCDSETLHKLPGTQHINVDQYAQCGGGCSGNPYDQTWGLNPRGWGESHELGHNLQVNRLKVYGGRSGEISNQ
IFPLHKDWRVLREFGQNLDDTRVNYRNAYNLIVAGRAEADPLAGVYKRLWEDPGTYALNGERMAFYTQWVHYWADLKNDP
LQGWDIWTLLYLHQRQVDKSDWDANKAALGYGTYAQRPGNSGDASSTDGNDNLLLGLSWLTQRDQRPTFALWGIRTSAAA
QAQVAAYGFAEQPAFFYANNRTNEYSTVKLLDMSQGSPAWPFP
>D5RAW5 ~~~impX~~~Riboflavin transporter ImpX~~~
MDNHIKGALLVCLAATMWGFDGIALTPRLFSLHVPFVVFILHLLPLILMSILFGKEEVKNIKKLQKNDLFFFFCVALFGG
CLGTLCIVKALFLVNFKHLTVVTLLQKLQPIFAIILARLLLKEKLKRAYLFWGFLALLGGYLLTFEFHLPEFVSSDNLLP
ASLYSLLAAFSFGSATVFGKRILKSASFRTALYLRYLMTSCIMFVIVTFTSGFGDFLVATAGNWLIFVIIALTTGSGAIL
LYYFGLRYITAKVATMCELCFPISSVVFDYLINGNVLSPVQIASAILMIISIIKISKLN
>B8H429 ~~~imuA~~~Protein ImuA~~~
MEAGTRTPTPVLSFGEPSIDGCFPGGGLPLGGWHEVTGAGLEDETGAAPAAFVTQLIRGLTDRKGGAVVWVARRADLFAP
GLLGLGFPAARLIQVRARDEAETLSLLEDALSTQGVAAAVAEAEAPDLTAGRRLQLACEKRGGFGVVLHRRPYGGRAGGK
PRLVSGSASFSRWRIAPAPSGPPPDDIGLGPPRWRVELERCRGGRPGGWILQAQEAGHGPHPFRLVSQLADHDVAAAEAG
RRFG
>B8H428 ~~~imuB~~~Protein ImuB~~~
MGLFPGQKAADALALVPDLVTADHDPAADRAALEALCDWCVRFSPAVAIDGDDGLFLDITGTDHLWGGEAAMLVDLVSRL
ARWGVPARAAIADTAGAAWALARFGPDLAIAPPGEQTAAIATLPVAALRLGDAAEAQLPRLGLHRVGQVLALPRAQLAKR
FGLAAVLRLDQALGAASEALTFRRPASPWFDRLAFFEPISAPEDLARVAADALALICARLEAEGRGAKRFEVVFHRLDGR
AFPVRVGLARIGRDAQRLARLVKPKLDMVDPGFGIEVVTVHAFAVEPMAAAQARLDADAAASADETLAPLIDRLVNRLGE
NRVWRADPFESHVPERSVVRVGPLDPPPAARWDPDRPRPVRLFKRPEAIVAIAAELPDYPPRLFTWRGRSHRVRRAEGPE
RIGQEWWRVGVEKGQTGPGKIRDYYRVEDDTGGRFWIFRQGLYGGEDAPKWWIHGLFG
>P27294 ~~~inaA~~~Protein InaA~~~COG3642
MAVSAKYDEFNHWWATEGDWVEEPNYRRNGMSGVQCVERNGKKLYVKRMTHHLFHSVRYPFGRPTIVREVAVIKELERAG
VIVPKIVFGEAVKIEGEWRALLVTEDMAGFISIADWYAQHAVSPYSDEVRQAMLKAVALAFKKMHSINRQHGCCYVRHIY
VKTEGNAEAGFLDLEKSRRRLRRDKAINHDFRQLEKYLEPIPKADWEQVKAYYYAM
>P23382 3.4.24.-~~~ina~~~Immune inhibitor A~~~
MKDAKADTKEKLNQPATGTPAATGPVKGGLNGKVPTSPAKQKAYNGDVRKDKVLVLLVEYADFKHNNIDKEPGYMYSEDF
NKEHYEKMLYGDEPFALEDGSKIETFKQYYEEQSGGSYTVDGTVTKWLTVPGKAADYGADAATGHDNKGPKGPRDLVKDA
LKAAVDSGLDLSQFDQFDQYDVNGDGNQNQPDGLIDHLMIIHAGVGQEAGGGKLGDDAIWSHRWTVGPKPFAIEGTQAKV
PYWGGKMAAFDYTIEPEDGAVGVNAHEYGHDLGLPDEYDTDYTGHGEPIQAWSVMSGGTWAGKIAGTTPTSFSPQNKEFF
QKTIGGNWANIVEVDYEKLNKGIGLATYLDQSVTKSNRPGMIRVNLPDKDVKTIRPAFGKQYYYSTKGDNLHTTLETPLF
DLTNATNAKFDFKSLYEIEAEYDFLEVHAVTEDGQKTLIERLGEKANSGNAEATNGKWIDKSYDLSQFKGKKVKLTFEYI
TDGGLALNGGLLDNASLTVDGKVTFSDDAEGTPQLKLDGFVVSSGTEKKKHNYYVEWRNHTGSDSALKFARGPEYNSGMV
VWYADSAYADNWVGLHPGHGFLGVVDSHPEAIVGTLNGKPTIDSSTRFQIADAAFSFDKTPAWKVVSPTRGTYTYDGLAG
VAKFDDSKTYINQQIPDAGRILPKLGLKFEVVGQADDNSAGAVRLYR
>H2VFV1 ~~~incA~~~Inclusion membrane protein A~~~
MTVSTDNTSPVISRASSPTFGDHGKDFDNNKIIPISIEAPTSSAAAVGAKTAIEPEGRSPLLQRICYLVKIIAAIALFVV
GIAALVCLYLGSVISTPSLILMLAIMLVSFVIVITAIRDGTPSQVVRHMKQQIQQFGEENTRLHTAVENLKAVNVELSEQ
INQLKQLHTRLSDFGDRLEANTGDFTALIADFQLSLEEFKSVGTKVETMLSPFEKLAQSLKETFSQEAVQAMMSSVTELR
TNLNALKELITENKTVIEQLKADAQLREEQVRFLEKRKQELEEACSTLSHSIATLQESTTLLKDSTTNLHAVESRLIGVM
VQDGAESSTVEEASQDDSAQPQDENQSDAGEHKDS
>A0A0H3MD02 ~~~incA~~~Inclusion membrane protein A~~~
MTTPTLIVTPPSPPAPSYSANRVPQPSLMDKIKKIAAIASLILIGTIGFLALLGHLVGFLIAPQITIVLLALFITSLAGN
ALYLQKTANLHLYQDLQREVGSLKEINFMLSVLQKEFLHLSKEFATTSKDLSAVSQDFYSCLQGFRDNYKGFESLLDEYK
NSTEEMRKLFSQEIIADLKGSVASLREEIRFLTPLAEEVRRLAHNQESLTAAIEELKTIRDSLRDEIGQLSQLSKTLTSQ
IALQRKESSDLCSQIRETLSSPRKSASPSTKSS
>P0CI27 ~~~incA~~~Inclusion membrane protein A~~~
MTTPTLIVTPPSPPAPSYSANRVPQPSLMDKIKKIAAIASLILIGTIGFLALLGHLVGFLIAPQITIVLLALFIISLAGN
ALYLQKTANLHLYQDLQREVGSLKEINFMLSVLQKEFLHLSKEFATTSKDLSAVSQDFYSCLQGFRDNYKGFESLLDEYK
NSTEEMRKLFSQEIIADLKGSVASLREEIRFLTPLAEEVRRLAHNQQSLTVVIEELKTIRDSLRDEIGQLSQLSKTLTSQ
IALQRKESSDLCSQIRETLSSPRKSASPSTKSS
>B0B9M3 ~~~incD~~~Inclusion membrane protein D~~~
MTKVYAHSIQQERVVDRIALLERCLDLSNSLPTAKRLAAVAVATILAIALLVVAGLLFSGVLCSPVSVVAASLFFGVGAF
LLGGALVGGVLTTEAVTRERLHRSQTLMWNNLCCKTAEVEQKISTASANAKSNDKLENSVSKKGAS
>B0B9M4 ~~~incE~~~Inclusion membrane protein E~~~
MECVKQLCRNHLRLDNLTDPVRSVLTKGTTAEKVQLAACCLGVVCSIICLALGIAAAAVGVSCGGFALGLGIIAILLGIV
LFATSALDVLENHGLVGCPFKLPCKSSPANEPAVQFFKGKNGSADQVILVTQ
>P0DJI4 ~~~incE~~~Inclusion membrane protein E~~~
MECVKQLCRNHLCLDSLTGPVRSVLTQGTTAEKVQLVVSCLGVVCSIICLALGIAAAAVGVSCSGFAIGLGVIAILLGIV
LFAISALDVLEDHGLVGCPFKLPCKSSPANEPTVQFFKGKNGSADKVILVTQ
>B0B9M5 ~~~incF~~~Inclusion membrane protein F~~~
MGDVMIQSVKTESGLVDGHHGSCDSLGCVVGALAKVAKLVVALAALVLNGALCVLSLVALCVGATPVGPLAVLVATTLAS
FLCVAYVLFIAAKDRGWIASTNKC
>A0A0H3MGR4 ~~~incG~~~Inclusion membrane protein G~~~
MICCDKVLSSVQSMPVIDKCSVTKCLQTAKQAVVLALSLFAVFASGSLSILSAAVLFSGTAAVLPYLLILTTALLGCVYA
VIVLLRSLSAVVQSCKKRSPEEIEGAARPSDQQESGGRLSEESASPQASPTSSTLRLESAFRSIGDSVSGAFDDINKDNS
RSRSRSF
>P0DPS6 ~~~incG~~~Inclusion membrane protein G~~~
MICCDKVLSSVQSMPVIDKCSVTKCLQTAKQAAVLALSLFAVFASGSLSILSAAVLFSGTAAVLPYLLILTTALLGFVCA
VIVLLRNLSAVVQSCKKRSPEEIEGAARPSDQQESGGRLSEESASPQASPTSSTFGLESALRSIGDSVSGAFDDINKDNS
RSRSHSF
>A0A0D4BS77 2.1.1.47~~~ind1~~~Indolepyruvate C-methyltransferase~~~
MTRTDFAQSAVASIFTGAIASHAAVLADDLGLFDALAKGKLRNRDLDRSPWLRNRIRISGALEALCRVGAVQRCTDGYEL
TDVGTELAGQVPVFRLWLGGYASVLAGQISIGADPATGVHGGIVAESSGAIGARYLDETIVNLLESLRPEGRICDIGCGT
GARLLRVCRRVNQPGIGYDLSAKAVEAARETVDEARRIGVDIDVRQGDATALTQDHPDVDIVTQAFMTHHIAPDEYCAAV
LRSYRSRFPRARYLVIFDTVPSQDSEEPEIFAPGFDYIHALQNMEPRSRGAARRMFTEAGYICREEVELAVPNSYAWVLE
MRDREGPAS
>A0A0D4BSN8 1.1.1.397~~~ind2~~~Beta-methylindole-3-pyruvate reductase~~~
MKLDDKRILIIGAGEVGTAVAEDLVNRSDPTEIIIHTSRQQTMDMRVGHLKEMAGPRTLLTGSWGDIFAPYELTHRSRSE
INDRNVRLALAEFFLQPSGEAQLRRTTIYELISRHRPHIVIDAVNSASVCTYTEDPHQTCGELLDLARGTGGPRTAEAPA
ELPAVTPDIADVATDALLSLSTPILHRYVDSLRRAMADFQVERFIKVSTTGLGGMGYNCPYTHGSVTEFGLSDALVGKIG
SAGVLHQLLWNLHHTAGCDVRLVIPAALIGWESVRHGAYTSRGRPVALQDCSRPLPLHLDRPLGEHAAASSVAEPAAEDE
PSAEMVHVPAGDNSTYSRAEMSLSTALGQFESVTREEVAAAVLDTLLGSTRFDLFTAMDTASLQSSYLAAQMRTSTLTSM
RQLEKAYDRPSIVSGNLGPTISKDLLELHVLCTAAGSLEQARTMSTTVLASSASALVREDVYLRQQALSIGLAVLLPDDQ
WLAGPRLSVPSRIDPEAKVTRADIDDWSRQGWVDLRPARILHWQENLRRIEQDASAGKTAFALDDTAYDVGEVLAYHYKL
TGQARRIKGL
>A0A0D4BSP3 2.1.1.328~~~ind7~~~N-demethylindolmycin N-methyltransferase~~~
MHTDWETSESAEDYSRNTAAAQWEPMGYPAVFRSLALATTDSDNAPPILDYGCGPGFVDRHVAEKYGRRVIAVDISSSMI
DLARSQHSHPLVTYRHVPDSQLDFLGDKEIGGCMSCFVLMQMADSDTQVEICRRIRRTLAPGAMLAVLNTHPDSVGIQFA
TLRNGEPDRVYQPGDPMTTVLTTDKGVLRLQDYYWRVTDYVHALEAAGFHEVTVEHLPPPPADPTPHPQFLLVRGTA
>P0AEW6 2.7.1.73~~~gsk~~~Guanosine-inosine kinase~~~COG0524
MKFPGKRKSKHYFPVNARDPLLQQFQPENETSAAWVVGIDQTLVDIEAKVDDEFIERYGLSAGHSLVIEDDVAEALYQEL
KQKNLITHQFAGGTIGNTMHNYSVLADDRSVLLGVMCSNIEIGSYAYRYLCNTSSRTDLNYLQGVDGPIGRCFTLIGESG
ERTFAISPGHMNQLRAESIPEDVIAGASALVLTSYLVRCKPGEPMPEATMKAIEYAKKYNVPVVLTLGTKFVIAENPQWW
QQFLKDHVSILAMNEDEAEALTGESDPLLASDKALDWVDLVLCTAGPIGLYMAGFTEDEAKRKTQHPLLPGAIAEFNQYE
FSRAMRHKDCQNPLRVYSHIAPYMGGPEKIMNTNGAGDGALAALLHDITANSYHRSNVPNSSKHKFTWLTYSSLAQVCKY
ANRVSYQVLNQHSPRLTRGLPEREDSLEESYWDR
>O24767 2.7.1.73~~~gsk~~~Guanosine-inosine kinase~~~
MNKIAVIGKVFVDIKGTSFAPLHKDAKNVGDITFSNGGTGRNVAQNLAVLGNEVRFISTVTNDQIGVGVLDELKSYGANV
DHVEMLEDHGMGMWLAVMDNEGDLQTSISKQPDAKLLEEAILRQSIYALDGVDAVAIDLDLSVTVLERLIHLCRKMELPL
FGVCGHLSVIERNRHLLQGFTGFICSREEAEILSDLSIVTVEDAIHVANELAKKGAPFTVVTMSELGAVYVDRRTATSGH
VGTKKVKVVDSTGAGDSFFSAVLSELTQEKSAEEALKLGMKVAAEVIASTENGLVPEMLDALQ
>P0A5Y7 1.3.1.9~~~inhA~~~Enoyl-[acyl-carrier-protein] reductase [NADH]~~~
MTGLLDGKRILVSGIITDSSIAFHIARVAQEQGAQLVLTGFDRLRLIQRITDRLPAKAPLLELDVQNEEHLASLAGRVTE
AIGAGNKLDGVVHSIGFMPQTGMGINPFFDAPYADVSKGIHISAYSYASMAKALLPIMNPGGSIVGMDFDPSRAMPAYNW
MTVAKSALESVNRFVAREAGKYGVRSNLVAAGPIRTLAMSAIVGGALGEEAGAQIQLLEEGWDQRAPIGWNMKDATPVAK
TVCALLSDWLPATTGDIIYADGGAHTQLL
>P42829 1.3.1.9~~~inhA~~~Enoyl-[acyl-carrier-protein] reductase [NADH]~~~COG0623
MTGLLEGKRILVTGIITDSSIAFHIAKVAQEAGAELVLTGFDRLKLVKRIADRLPKPAPLLELDVQNEEHLSTLADRITA
EIGEGNKIDGVVHSIGFMPQSGMGINPFFDAPYEDVSKGIHISAYSYASLAKAVLPIMNPGGGIVGMDFDPTRAMPAYNW
MTVAKSALESVNRFVAREAGKVGVRSNLVAAGPIRTLAMSAIVGGALGDEAGQQMQLLEEGWDQRAPLGWNMKDPTPVAK
TVCALLSDWLPATTGTVIYADGGASTQLL
>P9WGR1 1.3.1.9~~~inhA~~~Enoyl-[acyl-carrier-protein] reductase [NADH]~~~COG0623
MTGLLDGKRILVSGIITDSSIAFHIARVAQEQGAQLVLTGFDRLRLIQRITDRLPAKAPLLELDVQNEEHLASLAGRVTE
AIGAGNKLDGVVHSIGFMPQTGMGINPFFDAPYADVSKGIHISAYSYASMAKALLPIMNPGGSIVGMDFDPSRAMPAYNW
MTVAKSALESVNRFVAREAGKYGVRSNLVAAGPIRTLAMSAIVGGALGEEAGAQIQLLEEGWDQRAPIGWNMKDATPVAK
TVCALLSDWLPATTGDIIYADGGAHTQLL
>Q8G9F9 4.2.1.103~~~inhA~~~Isonitrile hydratase~~~
MALQIGFLLFPQVQQLDLTGPYDVLASLPDVQVHLVWKDLVPVTSSTGLQLKPTTTFEDCPVLDVICVPGGAGVGPLMED
EQTLDFIRSQAAQARYVTSVCTGSLVLGAAGLLQGKRATTHWAYHDLLPTLGAIPVKDRVVRDGNLFTGGGITAGIDFAL
TLAQELVGVDTAQLVQLQLEYAPAPPFDSGSPDTAPSAVVDEARKRAAPSLKLRTEITERAAAKLNLR
>P18958 ~~~inh~~~Proteinase inhibitor~~~
MKQLIIATLLSALSGGCMASSLRLPSAAELSGQWVLSGAEQHCDIRLNTDVLDGTTWKLAGDTACLQKLLPEAPVGWRPT
PDGLTLTQADGSAVAFFSRNRDRYEHKLVDGSVRTLKKKA
>Q03026 ~~~inh~~~Proteinase inhibitor~~~
MSASAKLSRMVCLLCGFFSTGISMASSLILLSASDLAGQWTLQQDEAPAICHLELRDSEVAEASGYDLGGDTACLTRWLP
SEPRAWRPTPAGIALLERGGLTLMLLGRQGEGDYRVQKGDGGQLVLRRATP
>Q9KGS7 ~~~inh~~~Alkaline proteinase inhibitor~~~
MPSSVQATAGLLATLMMFCGEVAMARSLLLAEPSQLAGQWQAVLSSPQDNAQTQAMQDKPSNSCLVELKVDQTLGGQTDC
LGQWLGDEPVRWFTEPDGLSLIGKQDSRTHLGLRQGDHYQMTLKSGLILRLERNKSQSAH
>Q54478 ~~~inh~~~Alkaline proteinase inhibitor~~~
MKGTLTRAALAAGGMMVTSAVMAGSLALPTAQSLAGQWEVADSERQCQIEFLANEQSETNGYQLVDRQRCLQSVFAAEVV
AGAGPDGIALLQADGSTLAFFSRDGDLYRNQLGAGDALTLKALA
>P9WJ99 ~~~iniA~~~Isoniazid-induced protein IniA~~~COG0699
MVPAGLCAYRDLRRKRARKWGDTVTQPDDPRRVGVIVELIDHTIAIAKLNERGDLVQRLTRARQRITDPQVRVVIAGLLK
QGKSQLLNSLLNLPAARVGDDEATVVITVVSYSAQPSARLVLAAGPDGTTAAVDIPVDDISTDVRRAPHAGGREVLRVEV
GAPSPLLRGGLAFIDTPGVGGLGQPHLSATLGLLPEADAVLVVSDTSQEFTEPEMWFVRQAHQICPVGAVVATKTDLYPR
WREIVNANAAHLQRARVPMPIIAVSSLLRSHAVTLNDKELNEESNFPAIVKFLSEQVLSRATERVRAGVLGEIRSATEQL
AVSLGSELSVVNDPNLRDRLASDLERRKREAQQAVQQTALWQQVLGDGFNDLTADVDHDLRTRFRTVTEDAERQIDSCDP
TAHWAEIGNDVENAIATAVGDNFVWAYQRSEALADDVARSFADAGLDSVLSAELSPHVMGTDFGRLKALGRMESKPLRRG
HKMIIGMRGSYGGVVMIGMLSSVVGLGLFNPLSVGAGLILGRMAYKEDKQNRLLRVRSEAKANVRRFVDDISFVVSKQSR
DRLKMIQRLLRDHYREIAEEITRSLTESLQATIAAAQVAETERDNRIRELQRQLGILSQVNDNLAGLEPTLTPRASLGRA
>P9WJ97 ~~~iniB~~~Isoniazid-induced protein IniB~~~
MTSLIDYILSLFRSEDAARSFVAAPGRAMTSAGLIDIAPHQISSVAANVVPGLNLGAGDPMSGLRQAVAARHGFAQDVAN
VGFAGDAGAGVASVITTDVGAGLASGLGAGFLGQGGLALAASSGGFGGQVGLAAQVGLGFTAVIEAEVGAQVGAGLGIGT
GLGAQAGMGFGGGVGLGLGGQAGGVIGGSAAGAIGAGVGGRLGGNGQIGVAGQGAVGAGVGAGVGGQAGIASQIGVSAGG
GLGGVGNVSGLTGVSSNAVLASNASGQAGLIASEGAALNGAAMPHLSGPLAGVGVGGQAGAAGGAGLGFGAVGHPTPQPA
ALGAAGVVAKTEAAAGVVGGVGGATAAGVGGAHGDILGHEGAALGSVDTVNAGVTPVEHGLVLPSGPLIHGGTGGYGGMN
PPVTDAPAPQVPARAQPMTTAAEHTPAVTQPQHTPVEPPVHDKPPSHSVFDVGHEPPVTHTPPAPIELPSYGLFGLPGF
>P9WJ95 ~~~iniC~~~Isoniazid-induced protein IniC~~~COG0699
MSTSDRVRAILHATIQAYRGAPAYRQRGDVFCQLDRIGARLAEPLRIALAGTLKAGKSTLVNALVGDDIAPTDATEATRI
VTWFRHGPTPRVTANHRGGRRANVPITRRGGLSFDLRRINPAELIDLEVEWPAEELIDATIVDTPGTSSLACDASERTLR
LLVPADGVPRVDAVVFLLRTLNAADVALLKQIGGLVGGSVGALGIIGVASRADEIGAGRIDAMLSANDVAKRFTRELNQM
GICQAVVPVSGLLALTARTLRQTEFIALRKLAGAERTELNRALLSVDRFVRRDSPLPVDAGIRAQLLERFGMFGIRMSIA
VLAAGVTDSTGLAAELLERSGLVALRNVIDQQFAQRSDMLKAHTALVSLRRFVQTHPVPATPYVIADIDPLLADTHAFEE
LRMLSLLPSRATTLNDDEIASLRRIIGGSGTSAAARLGLDPANSREAPRAALAAAQHWRRRAAHPLNDPFTTRACRAAVR
SAEAMVAEFSARR
>P0DJM0 ~~~inlA~~~Internalin A~~~COG4886
MRKKRYVWLKSILVAILVFGSGVWINTSNGTNAQAATITQDTPINQIFTDTALAEKMKTVLGKTNVTDTVSQTDLDQVTT
LQADRLGIKSIDGVEYLNNLTQINFSNNQLTDITPLKNLTKLVDILMNNNQIADITPLANLTNLTGLTLFNNQITDIDPL
KNLTNLNRLELSSNTISDISALSGLTSLQQLSFGNQVTDLKPLANLTTLERLDISSNKVSDISVLAKLTNLESLIATNNQ
ISDITPLGILTNLDELSLNGNQLKDIGTLASLTNLTDLDLANNQISNLAPLSGLTKLTELKLGANQISNISPLAGLTALT
NLELNENQLEDISPISNLKNLTYLTLYFNNISDISPVSSLTKLQRLFFYNNKVSDVSSLANLTNINWLSAGHNQISDLTP
LANLTRITQLGLNDQAWTNAPVNYKANVSIPNTVKNVTGALIAPATISDGGSYTEPDITWNLPSYTNEVSYTFSQPVTIG
KGTTTFSGTVTQPLKAIFNVKFHVDGKETTKEVEAGNLLTEPAKPVKEGHTFVGWFDAQTGGTKWNFSTDKMPTNDINLY
AQFSINSYTATFDNDGVTTSQTVDYQGLLQEPTAPTKEGYTFKGWYDAKTGGDKWDFATSKMPAKNITLYAQYSANSYTA
TFDVDGKSTTQAVDYQGLLKEPKAPTKAGYTFKGWYDEKTDGKKWDFATDKMPANDITLYAQFTKNPVAPPTTGGNTPPT
TNNGGNTTPPSANIPGSDTSNTSTGNSASTTSTMNAYDPYNSKEASLPTTGDSDNALYLLLGLLAVGTAMALTKKARASK
>P0DQD3 ~~~inlB~~~Internalin B~~~
MKEKHNPRRKYCLISGLAIIFSLWIIIGNGAKVQAETITVSTPIKQIFPDDAFAETIKDNLKKKSVTDAVTQNELNSIDQ
IIANNSDIKSVQGIQYLPNVTKLFLNGNKLTDIKPLTNLKNLGWLFLDENKIKDLSSLKDLKKLKSLSLEHNGISDINGL
VHLPQLESLYLGNNKITDITVLSRLTKLDTLSLEDNQISDIVPLAGLTKLQNLYLSKNHISDLRALAGLKNLDVLELFSQ
ECLNKPINHQSNLVVPNTVKNTDGSLVTPEIISDDGDYEKPNVKWHLPEFTNEVSFIFYQPVTIGKAKARFHGRVTQPLK
EVYTVSYDVDGTVIKTKVEAGTRITAPKPPTKQGYVFKGWYTEKNGGHEWNFNTDYMSGNDFTLYAVFKAETTEKTVNLT
RYVKYIRGNAGIYKLPREDNSLKQGTLASHRCKALTVDREARNGGKLWYRLKNIGWTKAENLSLDRYDKMEYDKGVTAYA
RVRNASGNSVWTKPYNTAGAKHVNKLSVYQGKNMRILREAKTPITTWYQFSIGGKVIGWVDTRALNTFYKQSMEKPTRLT
RYVSANKAGESYYKVPVADNPVKRGTLAKYKNQKLIVDCQATIEGQLWYRIRTSSTFIGWTKAANLRAQK
>P0DQD2 ~~~inlB~~~Internalin B~~~
MKEKHNPRRKYCLISGLAIIFSLWIIIGNGAKVQAETITVPTPIKQIFSDDAFAETIKDNLKKKSVTDAVTQNELNSIDQ
IIANNSDIKSVQGIQYLPNVTKLFLNGNKLTDIKPLANLKNLGWLFLDENKVKDLSSLKDLKKLKSLSLEHNGISDINGL
VHLPQLESLYLGNNKITDITVLSRLTKLDTLSLEDNQISDIVPLAGLTKLQNLYLSKNHISDLRALAGLKNLDVLELFSQ
ECLNKPINHQSNLVVPNTVKNTDGSLVTPEIISDDGDYEKPNVKWHLPEFTNEVSFIFYQPVTIGKAKARFHGRVTQPLK
EVYTVSYDVDGTVIKTKVEAGTRITAPKPPTKQGYVFKGWYTEKNGGHEWNFNTDYMSGNDFTLYAVFKAETTEKAVNLT
RYVKYIRGNAGIYKLPREDNSLKQGTLASHRCKALTVDREARNGGKLWYRLKNIGWTKAENLSLDRYDKMEYDKGVTAYA
RVRNASGNSVWTKPYNTAGAKHVNKLSVYQGKNMRILREAKTPITTWYQFSIGGKVIGWVDTRALNTFYKQSMEKPTRLT
RYVSANKAGESYYKVPVADNPVKRGTLAKYKNQKLIVDCQATIEGQLWYRIRTSSTFIGWTKAANLRAQK
>P71451 ~~~inlC~~~Internalin C~~~
MLKKNNWLQNAVIAMLVLIVGLCINMGSGTKVQAESIQRPTPINQVFPDPGLANAVKQNLGKQSVTDLVSQKELSGVQNF
NGDNSNIQSLAGMQFFTNLKELHLSHNQISDLSPLKDLTKLEELSVNRNRLKNLNGIPSACLSRLFLDNNELRDTDSLIH
LKNLEILSIRNNKLKSIVMLGFLSKLEVLDLHGNEITNTGGLTRLKKVNWIDLTGQKCVNEPVKYQPELYITNTVKDPDG
RWISPYYISNGGSYVDGCVLWELPVYTDEVSYKFSEYINVGETEAIFDGTVTQPIKN
>Q7AP87 ~~~inlH~~~Internalin H~~~COG4886
MKKRWNSVFKLVLMVTAILGLSLYVTTSQGVEVRAESITQPTAINVIFPDPALANAIKIAAGKSNVTDTVTQADLDGITT
LSAFGTGVTTIEGVQYLNNLIGLELKDNQITDLTPLKNLTKITELELSGNPLKNVSAIAGLQSIKTLDLTSTQITDVTPL
AGLSNLQVLYLDLNQITNISPLAGLTNLQYLSIGNAQVSDLTPLANLSKLTTLKADDNKISDISPLASLPNLIEVHLKNN
QISDVSPLANTSNLFIVTLTNQTITNQPVFYQNNLVVPNVVKGPSGAPIAPATISDNGTYASPNLTWNLTSFINNVSYTF
NQSVTFKNTTVPFSGTVTQPLTEAYTAVFDVDGKQTSVTVGANELIKEPTAPTKEGYTFTGWYDAKTGGTKWDFATDKMP
AEDITLYAQFTINSYTATFDIDGKLTTQKVTYQSLLEEPVAPTKDGYTFTGWYDAKTGGTKWDFATGKMPAGNITLYAQF
TKNDNPNPDDPTTNTPTGNGDGTSNPSNSGGNTTLPTAGDENTMLPIFIGVFLLGTATLILRKTIKVK
>Q8YA32 ~~~inlI~~~Internalin I~~~COG4886
MKKKFSIVIISVLLLGYLAPFDTLLVGADETTVSEDTAVKTAEADSATEGIESETGSDDETAEEPKEAKEAEASKETTEK
EEKAKTEEPASNIKTEINTDKSQLKQTSLKAAVPAGSTYNSLFPDDNLAKKLAVIITGNAAATGNESVDSAALLAISQLD
LSGETGNDPTDISNIEGLQYLENLTSLNLSENNISDLAPLKDLVNLVSLNLSSNRTLVNLSGVEDLVNLQELNVSANKAL
EDISQVASLPVLKEISAQGCNIKTLELKNPAGAVLPELETFYLQENDLTNLTSLAKLPKLKNLYIKGNASLKSLETLNGA
TKLQLIDASNCTDLETLGDISGLSELEMIQLSGCSKLKEITSLKNLPNLVNITADSCAIEDLGTLNNLPKLQTLVLSDNE
NLTNITAITDLPQLKTLTLDGCGITSIGTLDNLPKLEKLDLKENQITSISEITDLPRLSYLDVSVNNLTTIGDLKKLPLL
EWLNVSSNRLSDVSTLTNFPSLNYINISNNVIRTVGKMTELPSLKEFYAQNNSISDISMIHDMPNLRKVDASNNLITNIG
TFDNLPKLQSLDVHSNRITSTSVIHDLPSLETFNAQTNLITNIGTMDNLPDLTYVNLSFNRIPSLAPIGDLPNLETLIVS
DNNSYLRSLGTMDGVPKLRILDLQNNYLNYTGTEGNLSSLSDLTNLTELNLRNNVYIDDISGLSTLSRLIYLNLDSNKIE
DISALSNLTNLQELTLENNKIENISALSDLENLNKLVVSKNKIIDISPVANMVNRGAIVTASNQTYTLPTVLSYQSSFTI
DNPVIWYDGTLLAPSSIGNSGNYKDGKITWTNMTATSSSTLFNFNRLKDGLTFSGTVTQPYKSAAKVTADAEQTYTIGDT
ISEEQFLKDVNAKSSDGAPVTSDFATVVDLNTFGEYEVTLTSEKDGIQGDSCKVIVKVLHGAPVISADQTISYDKHATIT
EKQFLEDIHASTDLDTAITTNFSTAVNLNKGGDYTVALNSENEDGVKAETVYVTVTVNKDPAPIISAKTEITYDKFSKKT
EAAFLDDIDADTNDGSIVTSNFATAVNLDKAGDYTVTLNSINSDGVAGTPTAIIVHVEKEKIATISTNTAQQYEKYAKIN
ETQFLKDVHASINASPTTAVLESDFETVVKLDVPGTYTVTITATNEDGGVSAPKEVSVIVRKIPAPEITADKEITYPKFD
EVSEAEFLNDIHATISDKNVAITSNFSTDVNLNKAGDYTVTLNATNEDGVKATPVEVIVHVQQGERPVITADATISYDKF
ANITEAKFLEDIHATSSDGQSSTVITSNFQTATNFKTAMSYTVTLNAVNEDGISAEPVAVTVTINKEPAAALKADAEVSY
AKNEAVTESDFFKDVHLEGTEAPSTAKATSNFDSVVDRSKTGDYTVTINATNEDGAVSTPIEVIVHIEAESAPVITANAE
VKYNKHEQTDERRFLYDSEAKIDEANVEIKTDFAEKVDINKVGTYTVTLTATNEDGQAANPVEVSVIVSDAAAEKVNVKY
VDENGSEISAAETLTGNLDETFSIDAKSIAGYKCDATLSGVFSTVEQTVVFHYKAIKPGVVTIKYEDTNGKAVDEDKQIT
GEVGDDFEAEAQTVSGYSCRAIASGKITEEPQTITFTYSTATPSKKSGEITVQYVDESGKKLADSKKVTGNIDDSYSVEA
KAIEGYSVVGDDSAKGVFTEKSQTVTFKYKKNTQVSKDDPKVKGKTNQPSSTDTKLKVDNNSLPATGDTENMILAVLIGF
NMLIVASIFLFRKPKTNQ
>Q8Y3L4 ~~~inlJ~~~Internalin J~~~COG4886
MKTTKIVIASLVSLTMVSNPLLTFAATNDVIDNTTEITTDKETSSTQPTIKNTLKAGQTQSFNDWFPDDNFASEVAAAFE
MQATDTISEEQLATLTSLDCHNSSITDMTGIEKLTGLTKLICTSNNITTLDLSQNTNLTYLACDSNKLTNLDVTPLTKLT
YLNCDTNKLTKLDVSQNPLLTYLNCARNTLTEIDVSHNTQLTELDCHLNKKITKLDVTPQTQLTTLDCSFNKITELDVSQ
NKLLNRLNCDTNNITKLDLNQNIQLTFLDCSSNKLTEIDVTPLTQLTYFDCSVNPLTELDVSTLSKLTTLHCIQTDLLEI
DLTHNTQLIYFQAEGCRKIKELDVTHNTQLYLLDCQAAGITELDLSQNPKLVYLYLNNTELTELDVSHNTKLKSLSCVNA
HIQDFSSVGKIPALNNNFEAEGQTITMPKETLTNNSLTIAVSPDLLDQFGNPMNIEPGDGGVYDQATNTITWENLSTDNP
AVTYTFTSENGAIVGTVTTPFEAPQPIKGEDVTVHYLDDKGEKLADDEVLSGNLDDPYTSSAKDIPDYTLTTTPDNATGT
FTTTSQSVTYVYTKNIVAAEPVTVNYVDDTGKTLSPSEILNGNVGDTYNATAKQIDGYTLSAEPTNATGQFTSSAQTVNY
IYTKNPAPEKGVVEIHYVDEDNKQLNSTTEISGTIGDNYTTEPKTIEGYTLTTTPGNATGTFTTGSQTVTYVYTKNIEAA
EPITVNYVDANGKTLAPSETLNGNVGDTYKATAKQIDGYTLSAEPTNATGQFTSSAQTVNYIYTKNTNTDQPLPTKKPTN
TTPTKPSNLKTTEVKKASDTLPKTGDSAPWKSALLGVFLSSTALVIWKKKK
>Q10896 2.3.1.-~~~nrp~~~Isonitrile lipopeptide synthase~~~COG1020
MHRVRLSRSQRNLYNGVRQDNNPALYLIGKSYRFRRLELARFLAALHATVLDNPVQLCVLENSGADYPDLVPRLRFGDIV
RVGSADEHLQSTWCSGILGKPLVRHTVHTDPNGYVTGLDVHTHHILLDGGATGTIEADLARYLTTDPAGETPSVGAGLAK
LREAHRRETAKVEESRGRLSAVVQRELADEAYHGGHGHSVSDAPGTAAKGVLHESATICGNAFDAILTLSEAQRVPLNVL
VAAAAVAVDASLRQNTETLLVHTVDNRFGDSDLNVATCLVNSVAQTVRFPPFASVSDVVRTLDRGYVKAVRRRWLREEHY
RRMYLAINRTSHVEALTLNFIREPCAPGLRPFLSEVPIATDIGPVEGMTVASVLDEEQRTLNLAIWNRADLPACKTHPKV
AERIAAALESMAAMWDRPIAMIVNDWFGIGPDGTRCQGDWPARQPSTPAWFLDSARGVHQFLGRRRFVYPWVAWLVQRGA
APGDVLVFTDDDTDKTIDLLIACHLAGCGYSVCDTADEISVRTNAITEHGDGILVTVVDVAATQLAVVGHDELRKVVDER
VTQVTHDALLATKTAYIMPTSGTTGQPKLVRISHGSLAVFCDAISRAYGWGAHDTVLQCAPLTSDISVEEIFGGAACGAR
LVRSAAMKTGDLAALVDDLVARETTIVDLPTAVWQLLCADGDAIDAIGRSRLRQIVIGGEAIRCSAVDKWLESAASQGIS
LLSSYGPTEATVVATFLPIVCDQTTMDGALLRLGRPILPNTVFLAFGEVVIVGDLVADGYLGIDGDGFGTVTAADGSRRR
AFATGDRVTVDAEGFPVFSGRKDAVVKISGKRVDIAEVTRRIAEDPAVSDVAVELHSGSLGVWFKSQRTREGEQDAAAAT
RIRLVLVSLGVSSFFVVGVPNIPRKPNGKIDSDNLPRLPQWSAAGLNTAETGQRAAGLSQIWSRQLGRAIGPDSSLLGEG
IGSLDLIRILPETRRYLGWRLSLLDLIGADTAANLADYAPTPDAPTGEDRFRPLVAAQRPAAIPLSFAQRRLWFLDQLQR
PAPVYNMAVALRLRGYLDTEALGAAVADVVGRHESLRTVFPAVDGVPRQLVIEARRADLGCDIVDATAWPADRLQRAIEE
AARHSFDLATEIPLRTWLFRIADDEHVLVAVAHHIAADGWSVAPLTADLSAAYASRCAGRAPDWAPLPVQYVDYTLWQRE
ILGDLDDSDSPIAAQLAYWENALAGMPERLRLPTARPYPPVADQRGASLVVDWPASVQQQVRRIARQHNATSFMVVAAGL
AVLLSKLSGSPDVAVGFPIAGRSDPALDNLVGFFVNTLVLRVNLAGDPSFAELLGQVRARSLAAYENQDVPFEVLVDRLK
PTRALTHHPLIQVMLAWQDNPVGQLNLGDLQATPMPIDTRTARMDLVFSLAERFSEGSEPAGIGGAVEYRTDVFEAQAID
VLIERLRKVLVAVAAAPERTVSSIDALDGTERARLDEWGNRAVLTAPAPTPVSIPQMLAAQVARIPEAEAVCCGDASMTY
RELDEASNRLAHRLAGCGAGPGECVALLFERCAPAVVAMVAVLKTGAAYLPIDPANPPPRVAFMLGDAVPVAAVTTAGLR
SRLAGHDLPIIDVVDALAAYPGTPPPMPAAVNLAYILYTSGTTGEPKGVGITHRNVTRLFASLPARLSAAQVWSQCHSYG
FDASAWEIWGALLGGGRLVIVPESVAASPNDFHGLLVAEHVSVLTQTPAAVAMLPTQGLESVALVVAGEACPAALVDRWA
PGRVMLNAYGPTETTICAAISAPLRPGSGMPPIGVPVSGAALFVLDSWLRPVPAGVAGELYIAGAGVGVGYWRRAGLTAS
RFVACPFGGSGARMYRTGDLVCWRADGQLEFLGRTDDQVKIRGYRIELGEVATALAELAGVGQAVVIAREDRPGDKRLVG
YATEIAPGAVDPAGLRAQLAQRLPGYLVPAAVVVIDALPLTVNGKLDHRALPAPEYGDTNGYRAPAGPVEKTVAGIFARV
LGLERVGVDDSFFELGGDSLAAMRVIAAINTTLNADLPVRALLHASSTRGLSQLLGRDARPTSDPRLVSVHGDNPTEVHA
SDLTLDRFIDADTLATAVNLPGPSPELRTVLLTGATGFLGRYLVLELLRRLDVDGRLICLVRAESDEDARRRLEKTFDSG
DPELLRHFKELAADRLEVVAGDKSEPDLGLDQPMWRRLAETVDLIVDSAAMVNAFPYHELFGPNVAGTAELIRIALTTKL
KPFTYVSTADVGAAIEPSAFTEDADIRVISPTRTVDGGWAGGYGTSKWAGEVLLREANDLCALPVAVFRCGMILADTSYA
GQLNMSDWVTRMVLSLMATGIAPRSFYEPDSEGNRQRAHFDGLPVTFVAEAIAVLGARVAGSSLAGFATYHVMNPHDDGI
GLDEYVDWLIEAGYPIRRIDDFAEWLQRFEASLGALPDRQRRHSVLPMLLASNSQRLQPLKPTRGCSAPTDRFRAAVRAA
KVGSDKDNPDIPHVSAPTIINYVTNLQLLGLL
>P0DX16 2.3.1.-~~~scoA~~~Isonitrile lipopeptide synthase~~~
MSPHDDAINGTGDMTDRRPLLAAQEGIWTGQQLDPDSPAYNTAEYVHIDGPVDSAVFDTALHHVVAETKALNVAFVVDEQ
GQPWETDAPAGDWHLHTADLTAEPDPHAAALAWMDRDMARPVDLARRPVFGHALLRIAPEQYLWYHRVHHIALDGFGLSL
VARRVAEVYTALTVGEPVADSGFGTLASVRDEERVYRESARFAKDRDYWADRFADRPPVATPAGRTALPARTFHRRVVDL
GAVQTETLRAVARDLEVTWSEVLLAVTAARLHHATGASEIVLSLPVMGRLGSVSLRVPCMVRNILPLRVTVTASDSLREL
AARISRELRSGLPHQRYRYEQLRRDLRLVGGQRRLSGPGVNIMPFEYDLRFAGHPSTVHNVSAGPVDDLSVNVYDRAEGA
GLRIAVDANPDLYDEADVTALQEGLLSLLGQAVAAPDRALGELRTREAVPVLDGGPLPGPVRPVLGLIADHAAQRGGSVA
VEHDGRSITYAQLFGSARDLARRLAARQVGRGDVVAVAVPRGIDAITAILGVLLSGAAYCPLDPTAPRARKAELLDDARP
ALVLTASAHAADFGDRAVVRLDQPEPESQEAARPTAPAPTAPAPEDLAYVIHTSGSTGRPKGVEIGHRALAHFVAGATHR
YGLHHGDRVVQFAALHFDTSVEEVFLTLCAGATLVVRTDDMTDSVPGFLDACARLRISFLDLPTAYWHELAYAISTGAAA
LPAEVRTVVIGGEAALPERVDRWRKAVGTSVRLLNTYGPTEATVVATVADLHDPSLAPGDVPIGLPLPGTRAAVVDGELH
LLGDNLAVGYRGDRPPDAARFAPLDAVHEAPRAYRTGDLVRIGDDGQLRYLGRSDTEFKISGHRVHPAEVESALLAHPGV
RDAAVVGQLLHDGTRRLVAHVVPDGPAPAVALIRDHLRAALPAAMVPSAVEFLDRLPRTSAGKIDRNALAAMAPDVHVPD
PDAQVPDPGAETAAHDSTLERTIAAVWQQVLAVAAVSARDDVFDLGAQSLQVIQVANRLSVELRRDVKVAWLFQHPTPAE
LARFLKQQEQQAHAQVQPRPAGPGLPPTLLADAVLDPDIRPGGGHPRAAGTPDRVLLTGATGFVGVHLLAELLTSTDAEV
VCTVRAPSPAAAAARIHQTLEIHQIHLSDVARKRITAVPADLARPRLGLDEALFAELTRTCGAIVHNGATVSIMREYATL
RAANTESTRDLLRMAAVRSTPLHFVSTLSVAPPIGLAPEVPEAFLPPHTGLRYGYQQSKWAAERLLEQAAERGLPVTVHR
LGRIVGPHATGYVNERDFLWSVLRAGVPAGIVPDLFEEETWTPVDHIAQALVHLSLGQRPPTATVFNHATTPVRLSDVYD
WLEEYGYPLRRMPLAQWRAELRGSSGAFGAVATTLAFFDSWDADTDEATGPELRLGRVRADNVVTGLHGSGITCPSVDRD
LVFRYLDHCVTTGTLPAPAGKQGHPAMPAK
>P9WM65 ~~~~~~Acyl carrier protein Rv0100~~~
MRDRILAAVCDVLYIDEADLIDGDETDLRDLGLDSVRFVLLMKQLGVNRQSELPSRLAANPSIAGWLRELEAVCTEFG
>B2HKM1 6.2.1.-~~~mmaC~~~Fatty acid--[acyl-carrier-protein] ligase MmaC~~~COG0318
MSDLPATVLERIIEQAQRRPEAIALRRCDGTSALPYSELAAEVDRYAGALRAQSASRGSRVLVISDNGPETYLAVLACAK
LGAIAVMADGNLPPATIDRFCQITDPVAVLIAPGSKVGSSSLPEGLAAIPAIRVDIGSGTGEFAHSPDTDRPATEPGLGA
DDPLAMIFTSGTTGEPKAVLLANRTFFAVPDILRNEGLNWVTWVDGETTYSPLPATHIGGLWWILTCLMRGGLCITGGEN
TPSLMQILNSNAVNTTCLVPTLLSKLVSELKSAATTVPSLRLLGYGGSRAIAADVRFIEATGVRTAQVYGLSETGCTALC
LPTDDDSIAKIEAGAVGRPYPGVEVYLAADDEADGAGPNAPGAGPSASFGTLWIKSPANMLGYWSNPQRTQEVLIDGWVN
TGDLLERHEDGFFYIKGRSSEMIISGGVNIAPDEVDRIAEGVPGVREAACFEIPDPEFGALVGLAVVAATDMDASAARKL
KHTIAAHYRRESESVARPSTIVIVSEIPRTQSGKVMRTSLAAAANQVQTGG
>P9WQ55 6.2.1.20~~~~~~Medium/long-chain-fatty-acid--[acyl-carrier-protein] ligase FadD10~~~COG0318
MGGKKFQAMPQLPSTVLDRVFEQARQQPEAIALRRCDGTSALRYRELVAEVGGLAADLRAQSVSRGSRVLVISDNGPETY
LSVLACAKLGAIAVMADGNLPIAAIERFCQITDPAAALVAPGSKMASSAVPEALHSIPVIAVDIAAVTRESEHSLDAASL
AGNADQGSEDPLAMIFTSGTTGEPKAVLLANRTFFAVPDILQKEGLNWVTWVVGETTYSPLPATHIGGLWWILTCLMHGG
LCVTGGENTTSLLEILTTNAVATTCLVPTLLSKLVSELKSANATVPSLRLVGYGGSRAIAADVRFIEATGVRTAQVYGLS
ETGCTALCLPTDDGSIVKIEAGAVGRPYPGVDVYLAATDGIGPTAPGAGPSASFGTLWIKSPANMLGYWNNPERTAEVLI
DGWVNTGDLLERREDGFFYIKGRSSEMIICGGVNIAPDEVDRIAEGVSGVREAACYEIPDEEFGALVGLAVVASAELDES
AARALKHTIAARFRRESEPMARPSTIVIVTDIPRTQSGKVMRASLAAAATADKARVVVRG
>P0DX14 6.2.1.-~~~scoC~~~Fatty acid--[acyl-carrier-protein] ligase ScoC~~~
MDRLHHPQLQTLVQTTSFHAQHEPTTPAVLCEGRTLTYEQLHRESNRIAHALKAAGLAPGDRVAYLGKESEHYYEILFGC
AKSGTVLVPVNWRLTAPEVSHILQDSGTRLLFLEDEFGPVVEKMPAAPPETIVALGESFAAWKASHLDTDPKPHDVTPDT
PVAQLYTSGTTGLPKGVVLAHRSFFAIRDALASEGLDWIDWRVGDIALIGIPGFHIGGLWWATQNFNAGTTVVAMRAFAA
RQAVDLIRDLGITTACVVPAMLRMMLTEPGVGAKDFTTLRKTVYGGSPISEALLEESLAVLDCEFAQIYGLTETGNTAVC
LPPAAHVPGGSLMQAAGHPYPGVRSKVIDGEGRELPPGAVGEVCLATPARMVEYWGLPDKTAETLVDGWIHTGDAGYVDE
DGYVFIRDRIKDAILVAGENVYPAEIENVLEGHPGVAEAVVVGAPDERWGEYVHAFVVAAPGQQPSPRDLHTFLVPQLAS
FKLPARYEFIDSVPRNPSGKILRRELRDRFWGDSARKVN
>B2HKM2 4.3.2.11~~~mmaD~~~(2E)-enoyl-[ACP] glycyltransferase~~~
MSTTDLTSPAHAAVESAGTTAIAEDLLARVLEPYSYKGCRYLLDARYHADDDSVLAYGNFTISESAYIRSTGHFNAVELI
LCFNQLAYSAFAPAVANEEIPQLRGWSLEDYFQHQLASMFIRNSSSRFNRPINPAKFSARLQCRNLQVVQRTWRYLLVPC
AIEFWDEDGGAASGEVELAALNIP
>P9WM67 4.3.2.11~~~fcoT~~~(2E)-enoyl-[ACP] glycyltransferase~~~
MSHTDLTPCTRVLASSGTVPIAEELLARVLEPYSCKGCRYLIDAQYSATEDSVLAYGNFTIGESAYIRSTGHFNAVELIL
CFNQLAYSAFAPAVLNEEIRVLRGWSIDDYCQHQLSSMLIRKASSRFRKPLNPQKFSARLLCRDLQVIERTWRYLKVPCV
IEFWDENGGAASGEIELAALNIP
>P0DX13 4.3.2.11~~~scoD~~~(2E)-enoyl-[ACP] glycyltransferase~~~
MTDEALLTQVLMPYKDHCKYLRSAVVTETDGRASARCEFEIPESCYIDDTGHLNSVEVNICYNQMMYYLVAKSVKEGLGT
GFESWTLEDFWKHQLPDILIARFSSNFRRPVNPRVFSGEMEFRSVTRRAPAGGSPFVHADTAFRYWDADAGRCDGEATLA
FVNVP
>P9WG83 1.14.11.78~~~~~~(3R)-3-[(carboxymethyl)amino]fatty acid oxygenase/decarboxylase~~~COG2175
MTLKVKGEGLGAQVTGVDPKNLDDITTDEIRDIVYTNKLVVLKDVHPSPREFIKLGRIIGQIVPYYEPMYHHEDHPEIFV
SSTEEGQGVPKTGAFWHIDYMFMPEPFAFSMVLPLAVPGHDRGTYFIDLARVWQSLPAAKRDPARGTVSTHDPRRHIKIR
PSDVYRPIGEVWDEINRTTPPIKWPTVIRHPKTGQEILYICATGTTKIEDKDGNPVDPEVLQELMAATGQLDPEYQSPFI
HTQHYQVGDIILWDNRVLMHRAKHGSAAGTLTTYRLTMLDGLKTPGYAA
>A0A3B6UEU3 1.14.11.78~~~ScoE~~~(3R)-3-[(carboxymethyl)amino]fatty acid oxygenase/decarboxylase~~~
MQIDEQPGNAIGAAVEGFDHATASDADIDALKSTIYTKKIAVLKGQDLSPQQFLALGKRLGRPEAYYEPMYQHPEVTEIF
VSSNVPENGKQIGVPKTGKFWHADYQFMPDPFGITLIYPQVIPEKNRGTYFIDMGRAYDRLPEDLKKEISGTYCRHSVRK
YFKIRPHDVYRPISEIIEEVERKTPAVVQPTTFTHPMTGETVLYISEGFTVGIEDQDGKPLDEELLKRLFDATGQLDESF
EHDNIHLQSFEQGDLLVWDNRSLIHRARHTTTPEPTVSYRVTVHDERKLHDGIQAA
>Q8A7J8 5.5.1.4~~~~~~Inositol-3-phosphate synthase 1~~~COG1260
MKQEIKPATGRLGVLVVGVGGAVATTMIVGTLASRKGLAKPIGSITQLATMRMENNEEKLIKDVVPLTDLNDIVFGGWDI
FPDNAYEAAMYAEVLKEKDLNGVKDELEAIKPMPAAFDHNWAKRLNGTHIKKAATRWEMVEQLRQDIRDFKAANNCERVV
VLWAASTEIYIPLSDEHMSLAALEKAMKDNNTEVISPSMCYAYAAIAEDAPFVMGAPNLCVDTPAMWEFSKQKNVPISGK
DFKSGQTLMKTVLAPMFKTRMLGVNGWFSTNILGNRDGEVLDDPDNFKTKEVSKLSVIDTIFEPEKYPDLYGDVYHKVRI
NYYPPRKDNKEAWDNIDIFGWMGYPMEIKVNFLCRDSILAAPIALDLVLFSDLAMRAGMCGIQTWLSFFCKSPMHDFEHQ
PEHDLFTQWRMVKQTLRNMIGEKEPDYLA
>Q8NLE6 5.5.1.4~~~ino-1~~~Inositol-3-phosphate synthase~~~COG1260
MSTSTIRVAIAGVGNCATSLIQGVEYYRNADPSETVPGLMHVKFGDYHVGDIEFVAAFDVDAEKVGIDLADATEASQNCT
IKIADVPQTGINVLRGPTLDGLGDHYRATIDESTAEPVDVVQALIDAKADVLVSYLPVGSEEADKFYAQAAIDAGCAFVN
ALPVFIASDPEWAKKFTDAGIPIVGDDIKSQIGATITHRVLARLFEERGVRVDRTMQLNVGGNMDFKNMLDRNRLESKKV
SKTQAVTSNIPDGPLSGKVEDRNVHIGPSDHVQWLDDRKWAYVRLEGTAFGGVPLNLEYKLEVWDSPNSAGIIIDAVRAA
KIALDRGIGGPIMPASSYLMKSPPEQLPDDVARERLEAFIIEA
>A0R7G6 5.5.1.4~~~ino1~~~Inositol-3-phosphate synthase~~~COG1260
MSEHAGEIRVAIVGVGNCASSLVQGVQYYRNADENTTVPGLMHVKFGPYHVRDVNFVAAFDVDAKKVGFDLSEAIFASEN
NTIKIADVPPTDVIVQRGPTLDGIGKYYADTIEVSDAEPVDVVKVLKEAEVDVLVSYLPVGSEEADKFYAQCAIDAGVAF
VNALPVFIASDPVWAKKFEDAGVPIVGDDIKSQVGATITHRVMAKLFEDRGVTLDRTYQLNVGGNMDFLNMLERSRLESK
KVSKTQAVTSNLSGALAGKVEDKNVHIGPSDHVAWLDDRKWAYVRLEGRAFGDVPLNLEYKLEVWDSPNSAGVIIDAVRA
AKIAKDRGIGGPIEAASAYLMKSPPKQLADDVARAELETFIEG
>P9WKI1 5.5.1.4~~~ino1~~~Inositol-3-phosphate synthase~~~COG1260
MSEHQSLPAPEASTEVRVAIVGVGNCASSLVQGVEYYYNADDTSTVPGLMHVRFGPYHVRDVKFVAAFDVDAKKVGFDLS
DAIFASENNTIKIADVAPTNVIVQRGPTLDGIGKYYADTIELSDAEPVDVVQALKEAKVDVLVSYLPVGSEEADKFYAQC
AIDAGVAFVNALPVFIASDPVWAKKFTDARVPIVGDDIKSQVGATITHRVLAKLFEDRGVQLDRTMQLNVGGNMDFLNML
ERERLESKKISKTQAVTSNLKREFKTKDVHIGPSDHVGWLDDRKWAYVRLEGRAFGDVPLNLEYKLEVWDSPNSAGVIID
AVRAAKIAKDRGIGGPVIPASAYLMKSPPEQLPDDIARAQLEEFIIG
>Q8A7J9 3.1.3.64~~~~~~Putative phosphatidylinositol-3-phosphatase~~~COG1267
MKRPSFLPVLIGTGFGSGFSPFAPGTAGALLASIIWIALYFLLPFTALLWTTAALVVLFTFAGIWAANKLESCWGEDPSR
VVVDEMVGVWIPLLAVPDNDRWYWYVIAAFALFRIFDIVKPLGVRKMENFKGGVGVMMDDVLAGVYSFILIAVARWVIG
>A1T557 ~~~~~~INSIG protein homolog~~~
MRLRISEAVVLFLLGAVAALIGDHSHVVTGTTVYHTDAVPFVWSSPFWFPILVGAATASLAELRLHLPAPRDGVTARQAL
GGVAAVVGTYVTTALVHAFPVVPVTALVCAAAAITWCVLGDGPGAACGVVIAVIGPAVEIALVQLGVFAYHPDSDGLFGV
APFLAPLYFAFGVVAALLGELAVARRPQLGPPVCDTVSRGPGAG
>P19769 ~~~insK~~~Putative transposase InsK for insertion sequence element IS150~~~COG2801
MKVLNELRQFYPLDELLRAAEIPRSTFYYHLKALSKPDKYADVKKRISEIYHENRGRYGYRRVTLSLHREGKQINHKAVQ
RLMGTLSLKAAIKVKRYRSYRGEVGQTAPNVLQRDFKATRPNEKWVTDVTEFAVNGRKLYLSPVIDLFNNEVISYSLSER
PVMNMVENMLDQAFKKLNPHEHPVLHSDQGWQYRMRRYQNILKEHGIKQSMSRKGNCLDNAVVECFFGTLKSECFYLDEF
SNISELKDAVTEYIEYYNSRRISLKLKGLTPIEYRNQTYMPRV
>P9WMB3 ~~~~~~Putative prophage phiRv2 integrase~~~COG0582
MTQTGKRQRRKFGRIRQFNSGRWQASYTGPDGRVYIAPKTFNAKIDAEAWLTDRRREIDRQLWSPASGQEDRPGAPFGEY
AEGWLKQRGIKDRTRAHYRKLLDNHILATFADTDLRDITPAAVRRWYATTAVGTPTMRAHSYSLLRAIMQTALADDLIDS
NPCRISGASTARRVHKIRPATLDELETITKAMPDPYQAFVLMAAWLAMRYGELTELRRKDIDLHGEVARVRRAVVRVGEG
FKVTTPKSDAGVRDISIPPHLIPAIEDHLHKHVNPGRESLLFPSVNDPNRHLAPSALYRMFYKARKAAGRPDLRVHDLRH
SGAVLAASTGATLAELMQRLGHSTAGAALRYQHAAKGRDREIAALLSKLAENQEM
>P32053 ~~~intA~~~Prophage integrase IntA~~~COG0582
MARKTKPLTDTEIKAAKPKDADYQLYDGDGLTLLIKSSGSKLWQFRYYRPLTKQRTKQSFGAYPAVSLSDARKLRAESKV
LLAKDIDPQEHQKEQVRNSQEAKTNTFLLVAERWWNVKKTSVTEDYADDIWRSLERDIFPAIGDISITEIKAHTLVKAVQ
PVQARGALETVRRLCQRINEVMIYAQNTGLIDAVPSVNIGKAFEKPQKKNMPSIRPDQLPQLMHTMRTASISMSTRCLFM
WQLLTITRPAEAAEARWDEIDFNASEWKIPAARMKMNRDHTVPLSDGALAILEMMKPLSGGREFIFPSRIKPNQPMNSQT
VNAALKRAGLGGVLVSHGLRSIASTALNEEGFPPDVIEAALAHVDKNEVRRAYNRSDYLEQRRPMMQWWADLVKAADSGS
IVLTHLSKIRLVG
>P76168 ~~~intQ~~~Putative defective protein IntQ~~~COG0582
MITDVWKYRGKSTGELRSSVCYAIKTGVFDYAKQFPSSRNLEKFGEARQDLTIKELAEKFLALKETEVAKTSLNTYRAVI
KNILSIIGEKNLASSINKEKLLEVRKELLTGYQIPKSNYIVTQPGRSAVTVNNYMTNLNAVFQFGVDNGYLADNPFKGIS
PLKESRTIPDPLSREEFIRLIDACRNQQAKNLWCVSVYTGVRPGELCALGWEDIDLKNGTMMIRRNLAKDRFTVPKTQAG
TNRVIHLIKPAIDALRSQMTLTRLSKEHIIDVHFREYGRTEKQKCTFVFQPEVSARVKNYGDHFTVDSIRQMWDAAIKRA
GLRHRKSYQSRHTYACWSLTAGANPAFIANQMGHADAQMVFQVYGKWMSENNNAQVALLNTQLSEFAPTMPHNEAMKN
>P37326 ~~~intS~~~Prophage integrase IntS~~~COG0582
MLTVKQIEAAKPKEKPYRLLDGNGLYLYVPVSGKKVWQLRYKIDGKEKILTVGKYPLMTLQEARDKAWTARKDISVGIDP
VKAKKASSNNNSFSAIYKEWYEHKKQVWSVGYATELAKMFDDDILPIIGGLEIQDIEPMQLLEVIRRFEDRGAMERANKA
RRRCGEVFRYAIVTGRAKYNPAPDLADAMKGYRKKNFPFLPADQIPAFNKALATFSGSIVSLIATKVLRYTALRTKELRS
MLWKNVDFENRIITIDASVMKGRKIHVVPMSDQVVELLTTLSSITKPVSEFVFAGRNDKKKPICENAVLLVIKQIGYEGL
ESGHGFRHEFSTIMNEHEWPADAIEVQLAHANGGSVRGIYNHAQYLDKRREMMQWWADWLDEKVE
>P19870 4.2.2.17~~~~~~Inulin fructotransferase [DFA-I-forming]~~~
MANTVYDVTTWSGATISPYVDIGAVINQIIADIKANQTSQAARPGAVIYIPPGHYDLLTRVVVDVSFLQIKGSGHGFLSE
AIRDESSTGSWVETQPGASHIRVKNTDGNREAFLVSRSGDPNVVGRLNSIEFKGFCLDGVTDSKPYSPGNSKIGISVQSD
NDSFHVEGMGFVYLEHAIIVKGADAPNITNNFIAECGSCIELTGASQVAKITNNFLISAWAGYSIYAENAEGPLITGNSL
LWAANITLSDCNRVSISSNKLLSNFPSMVALLGNCSENLIAANHFRRVSGDGTSTRFDDLFGLVHIEGNNNTVTGNMFSF
NVPASSISPSGATPTIILVKSGDSNYLATNNIVSNVSAMVVLDGSTTATRIIYSAKNSQLNAYTTSYTLVPTP
>D3WYV9 2.4.1.9~~~inuGB~~~Inulosucrase~~~
MLENKNHKKMSLSGKSLLMGTLSTAAIVLSASTVNAATTNADNVTKNQTVAVSATTTNNETNNQVSSSSEKTADSKTEKD
TNLTSAATKEVKADAAKTTSPVNNVKTVADTTTTTKETTDNTEKSPVNFSADVKKNDAVKQDEKAATAVKANTEVKANET
STKSASKDNKAELKGQIKDIVKESGVDTSKLTDDQINELNKISFSKEAKSGTQLTYSDFKKIAKTLIEQDARYAVPFFNA
SKIKNMPAAKTLDAQTGKVEDLEIWDSWPVQDAKTGYVSNWNGYQLVIGMMGVPNTNDNHIYLLYNKYGDNNFNNWKNAG
PIFGLGTPVIQQWSGSATLNKDGSIQLYYTKVDTSDNNTNHQKIASATVYLNLEKNQDKISIAHVDNDHIVFEGDGYHYQ
TYNQWKKTNKGADNIAMRDAHVIDDKDGNRYLVFEASTGTENYQGADQIYQWLNYGGTNKDNLGDFLQILSNSDIKDRAK
WSNAAIGIIKLNNDTKNPGVEKVYTPLISAPMVSDEIERPDVVRLGNKYYLFAATRLNRGSNDDAWMAANKAVGDNVAMI
GYVSDNLTHGYVPLNESGVVLTASVPANWRTATYSYYAVPVEGRDDQLLITSYITNRGEVAGKGMHATWAPSFLLQINPD
NTTTVLAKMTNQGDWIWDDSSENADMMGVLEKDAPNSAALPGEWGKPVDWDLIGGYNLKPHQPVTPIPNVPTTPEKPENP
TTPNTPDTPHTPTTPNTPDTPRTPEVPTTPVKKTTQSELRS
>Q74K42 2.4.1.9~~~inuJ~~~Inulosucrase~~~COG1621
MLENKNHKKISLSGKSLLMGTLSTAAIVLSASTANAATINADNVNENQTVEVTASSVNNENNKQVTEKDSADKSTSDVAE
DANTKKSNENTETTEKNTQTVVTNAPVSDVKNTNTVTAETPVDKVVNNSDQKTTNAATTDTKKDDVKQVEKKDSVDKTNA
EENKDSSVKPAENATKAELKGQVKDIVEESGVDTSKLTNDQINELNKINFSKEAKSGTQLTYNDFKKIAKTLIEQDARYA
IPFFNASKIKNMPAAKTLDAQSGKVEDLEIWDSWPVQDAKTGYVSNWNGYQLVIGMMGVPNVNDNHIYLLYNKYGDNDFN
HWKNAGPIFGLGTPVIQQWSGSATLNKDGSIQLYYTKVDTSDNNTNHQKLASATVYLNLEKDQDKISIAHVDNDHIVFEG
DGYHYQTYDQWKETNKGADNIAMRDAHVIDDDNGNRYLVFEASTGTENYQGDDQIYQWLNYGGTNKDNLGDFFQILSNSD
IKDRAKWSNAAIGIIKLNDDVKNPSVAKVYSPLISAPMVSDEIERPDVVKLGNKYYLFAATRLNRGSNDDAWMATNKAVG
DNVAMIGYVSDNLTHGYVPLNESGVVLTASVPANWRTATYSYYAVPVEGRDDQLLITSYITNRGEVAGKGMHATWAPSFL
LQINPDNTTTVLAKMTNQGDWIWDDSSENPDMMGVLEKDAPNSAALPGEWGKPVDWDLIGGYNLKPHQPVTPIPNVPTTP
ETPTTPDKPEVPTTPEVPTTPETPTPEAPKNPVKKTSQSKLPKAGDKNSFAAVVLGAVSSILGAVGLTGVSKRKRNN
>P0A1I3 ~~~invA~~~Invasion protein InvA~~~
MLLSLLNSARLRPELLILVLMVMIISMFVIPLPTYLVDFLIALNIVLAILVFMGSFYIDRILSFSTFPAVLLITTLFRLA
LSISTSRLILIEADAGEIIATFGQFVIGDSLAVGFVVFSIVTVVQFIVITKGSERVAEVAARFSLDGMPGKQMSIDADLK
AGIIDADAARERRSVLERESQLYGSFDGAMKFIKGDAIAGIIIIFVNFIGGISVGMTRHGMDLSSALSTYTMLTIGDGLV
AQIPALLIAISAGFIVTRVNGDSDNMGRNIMTQLLNNPFVLVVTAILTISMGTLPGFPLPVFVILSVVLSVLFYFKFREA
KRSAAKPKTSKGEQPLSIEEKEGSSLGLIGDLDKVSTETVPLILLVPKSRREDLEKAQLAERLRSQFFIDYGVRLPEVLL
RDGEGLDDNSIVLLINEIRVEQFTVYFDLMRVVNYSDEVVSFGINPTIHQQGSSQYFWVTHEEGEKLRELGYVLRNALDE
LYHCLAVTLARNVNEYFGIQETKHMLDQLEAKFPDLLKEVLRHATVQRISEVLQRLLSERVSVRNMKLIMEALALWAPRE
KDVINLVEHIRGAMARYICHKFANGGELRAVMVSAEVEDVIRKGIRQTSGSTFLSLDPEASANLMDLITLKLDDLLIAHK
DLVLLTSVDVRRFIKKMIEGRFPDLEVLSFGEIADSKSVNVIKTI
>P11922 ~~~~~~Invasin~~~
MVFQPISEFLLIRNAGMSMYFNKIISFNIISRIVICIFLICGMFMAGASEKYDANAPQQVQPYSVSSSAFENLHPNNEME
SSINPFSASDTERNAAIIDRANKEQETEAVNKMISTGARLAASGRASDVAHSMVGDAVNQEIKQWLNRFGTAQVNLNFDK
NFSLKESSLDWLAPWYDSASFLFFSQLGIRNKDSRNTLNLGVGIRTLENGWLYGLNTFYDNDLTGHNHRIGLGAEAWTDY
LQLAANGYFRLNGWHSSRDFSDYKERPATGGDLRANAYLPALPQLGGKLMYEQYTGERVALFGKDNLQRNPYAVTAGINY
TPVPLLTVGVDQRMGKSSKHETQWNLQMNYRLGESFQSQLSPSAVAGTRLLAESRYNLVDRNNNIVLEYQKQQVVKLTLS
PATISGLPGQVYQVNAQVQGASAVREIVWSDAELIAAGGTLTPLSTTQFNLVLPPYKRTAQVSRVTDDLTANFYSLSALA
VDHQGNRSNSFTLSVTVQQPQLTLTAAVIGDGAPANGKTAITVEFTVADFEGKPLAGQEVVITTNNGALPNKITEKTDAN
GVARIALTNTTDGVTVVTAEVEGQRQSVDTHFVKGTIAADKSTLAAVPTSIIADGLMASTITLELKDTYGDPQAGANVAF
DTTLGNMGVITDHNDGTYSAPLTSTTLGVATVTVKVDGAAFSVPSVTVNFTADPIPDAGRSSFTVSTPDILADGTMSSTL
SFVPVDKNGHFISGMQGLSFTQNGVPVSISPITEQPDSYTATVVGNTAGDVTITPQVDTLILSTLQKKISLFPVPTLTGI
LVNGQNFATDKGFPKTIFKNATFQLQMDNDVANNTQYEWSSSFTPNVSVNDQGQVTITYQTYSEVAVTAKSKKFPSYSVS
YRFYPNRWIYDGGTSLVSSLEASRQCQGSDMSAVLESSRATNGTRAPDGTLWGEWGSLTAYSSDWQSGEYWVKKTSTDFE
TMNMDTGALVQGPAYLAFPLCALAI
>F8DT27 3.2.1.26~~~sacC~~~Extracellular sucrase~~~COG1621
MFNFNASRWTRAQAMKVNKFDLTTSMPEIGTDFPIMRDDLWLWDTWPLRDINGNPVSFKGWNVIFSLVADRNIPWNDRHS
HARIGYFYSKDGKSWVYGGHLLQESANTRTAEWSGGTIMAPGSRNQVETFFTSTLFDKNGVREAVAAVTKGRIYADSEGV
WFKGFDQSTDLFQADGLFYQNYAENNLWNFRDPHVFINPEDGETYALFEANVATVRGEDDIGEDEIGPVPANTVVPKDAN
LCSASIGIARCLSPDRTEWELLPPLLTAFGVNDQMERPHVIFQNGLTYLFTISHDSTYADGLTGSDGLYGFVSENGIFGP
YEPLNGSGLVLGGPASQPTEAYAHYIMNNGLVESFINEIIDPKSGKVIAGGSLAPTVRVELQGHETFATEVFDYGYIPAS
YAWPVWPFPDRRK
>P69343 ~~~invF~~~Invasion protein InvF~~~
MSFSESRHNENCLIQEGALLFCEQAVVAPVSGDLVFRPLKIEVLSKLLAFIDGAGLVDTTYAESDKWVLLSPEFRAIWQD
RKRCEYWFLQQIITPSPAFNKVLALLRKSESYWLVGYLLAQSTSGNTMRMLGEDYGVSYTHFRRLCSRALGGKAKSELRN
WRMAQSLLNSVEGHENITQLAVNHGYSSPSHFSSEIKELIGVSPRKLSNIIQLADK
>Q7P0P8 ~~~rsfS~~~Ribosomal silencing factor RsfS~~~COG0799
MEIQEISKLAIEALEDIKGKDIIELDTSKLTSLFQRMIVATGDSNRQVKALANSVQVKLKEAGVDIVGSEGHESGEWVLV
DAGDVVVHVMLPAVRDYYDIEALWGGQKPSFAVGAAKPWSAV
>P0AAT6 ~~~rsfS~~~Ribosomal silencing factor RsfS~~~COG0799
MQGKALQDFVIDKIDDLKGQDIIALDVQGKSSITDCMIICTGTSSRHVMSIADHVVQESRAAGLLPLGVEGENSADWIVV
DLGDVIVHVMQEESRRLYELEKLWS
>Q9KD89 ~~~rsfS~~~Ribosomal silencing factor RsfS~~~COG0799
MSNQELLQLAVNAVDDKKAEQVVALNMKGISLIADFFLICHGNSEKQVQAIAHELKKVAQEQGIEIKRLEGYEQARWVLI
DLGDVVVHVFHKDERAYYNLEKLWGDAPTVELEGVIS
>Q02SH2 ~~~rsfS~~~Ribosomal silencing factor RsfS~~~
MQTEQLVQVAIDALEDLKAQDIVTLDVRDKTSVTDYMVIACGSSSRQVKSLADNVLTKAKENGVKPLGSEGLESGEWALL
DLGDVVVHVMLPATRQFYDLERLWQGAEQSRAQHQPEE
>Q97P97 ~~~rsfS~~~Ribosomal silencing factor RsfS~~~COG0799
MNEKELLELVVKAADEKRAEDILALDVQDLTSVTDYLVITSSMNSRQLDAIAANIREKVAQAGFKGSHVEGDAAGGWVLL
DLGAVVVHIFSEEMRAHYNLEKLWHEANSVDISEALA
>P73658 ~~~rsfS~~~Ribosomal silencing factor RsfS~~~COG0799
MTEISRFDAYPPTLTVNPIVPADAPATEHLVWTIAQAAEERKAGDLVILKVTDVSYLADYFVICTGFSRTQVRAIADNIE
KQVELVHGQLPTHTEGNSESIWVLQDFGDVLVHTFMPEEREFYKLEAFWGHAQEQTLADIATAIGVAYNAPTSP
>O83720 ~~~rsfS~~~Ribosomal silencing factor RsfS~~~COG0799
MSANGAASAVAEALCDARAEDVCVFDVSARCGWADFAVVATVPGLLHGTHRLVCEQAARFGLREVHRKKRGLCEEQWRVL
DFGSILVHLMSAQARAFYDLDRLWQDCLVAR
>Q5NLX3 ~~~rsfS~~~Ribosomal silencing factor RsfS~~~COG0799
MPAPSSPRKNQTSFDPEMLLKLVTDSLDDDQALEIATIPLAGKSSIADYMVIASGRSSRQVTAMAQKLADRIKAATGYVS
KIEGLPAADWVLLDAGDIIIHLFRPEVRSFYNLERMWGFGDESDQPVSQSVLS
>P42412 1.2.1.-~~~iolA~~~Malonate-semialdehyde dehydrogenase~~~COG1012
MAEIRKLKNYINGEWVESKTDQYEDVVNPATKEVLCQVPISTKEDIDYAAQTAAEAFKTWSKVAVPRRARILFNFQQLLS
QHKEELAHLITIENGKNTKEALGEVGRGIENVEFAAGAPSLMMGDSLASIATDVEAANYRYPIGVVGGIAPFNFPMMVPC
WMFPMAIALGNTFILKPSERTPLLTEKLVELFEKAGLPKGVFNVVYGAHDVVNGILEHPEIKAISFVGSKPVGEYVYKKG
SENLKRVQSLTGAKNHTIVLNDANLEDTVTNIVGAAFGSAGERCMACAVVTVEEGIADEFMAKLQEKVADIKIGNGLDDG
VFLGPVIREDNKKRTLSYIEKGLEEGARLVCDGRENVSDDGYFVGPTIFDNVTTEMTIWKDEIFAPVLSVIRVKNLKEAI
EIANKSEFANGACLFTSNSNAIRYFRENIDAGMLGINLGVPAPMAFFPFSGWKSSFFGTLHANGKDSVDFYTRKKVVTAR
YPAPDFN
>P42413 5.3.1.30~~~iolB~~~5-deoxy-glucuronate isomerase~~~COG3718
MSYLLRKPQSHEVSNGVKLVHEVTTSNSDLTYVEFKVLDLASGSSYTEELKKQEICIVAVTGKITVTDHESTFENIGTRE
SVFERKPTDSVYISNDRAFEITAVSDARVALCYSPSEKQLPTKLIKAEDNGIEHRGQFSNKRTVHNILPDSDPSANSLLV
VEVYTDSGNWSSYPPHKHDQDNLPEESFLEETYYHELDPGQGFVFQRVYTDDRSIDETMTVGNENVVIVPAGYHPVGVPD
GYTSYYLNVMAGPTRKWKFYNDPAHEWILER
>P42414 2.7.1.92~~~iolC~~~5-dehydro-2-deoxygluconokinase~~~COG0524
MKYTFNEEKAFDIVAIGRACIDLNAVEYNRPMEETMTFSKYVGGSPANIAIGSAKLGLKAGFIGKIPDDQHGRFIESYMR
KTGVDTTQMIVDQDGHKAGLAFTEILSPEECSILMYRDDVADLYLEPSEVSEDYIANAKMLLVSGTALAKSPSREAVLKA
VQYAKKHQVKVVFELDYRPYTWQSSDETAVYYSLVAEQSDIVIGTRDEFDVMENRTGGSNEESVNHLFGHSADLVVIKHG
VEGSYAYSKSGEVFRAQAYKTKVLKTFGAGDSYASAFIYGLVSGKDIETALKYGSASASIVVSKHSSSEAMPTAEEIEQL
IEAQS
>Q9KAG8 2.7.1.92~~~iolC~~~5-dehydro-2-deoxygluconokinase~~~COG0524
MTYELSTDREFDLIAIGRACIDLNAVEYNRPMEETMTFSKYVGGSPANIVIGSSKLGLKAGFIGKIADDQHGRFIESYMR
GVGVDTSNLVVDQEGHKTGLAFTEIKSPEECSILMYRQDVADLYLSPEEVNEAYIRRSKLLLVSGTALSKSPSREAVLKA
IRLAKRNDVKVVFELDYRPYSWETPEETAVYYSLVAEQSDIVIGTREEFDVLENRTEKGDNDETIRYLFKHSPELIVIKH
GVEGSFAYTKAGEAYRGYAYKTKVLKTFGAGDSYASAFLYALISGKGIETALKYGSASASIVVSKHSSSDAMPSVEEIEA
LIEKDETITIA
>P42415 3.7.1.22~~~iolD~~~3D-(3,5/4)-trihydroxycyclohexane-1,2-dione hydrolase~~~COG3962
MGKKIRLTTAQALIKFLNQQYIHVDGKEEPFVEGIFTIFGHGNVLGIGQALEQDAGHLKVYQGKNEQGMAHAAMAYSKQM
LRRKIYAVSTSVGPGAANLVAAAGTALANNIPVLLIPADTFATRQPDPVLQQMEQEYSAAITTNDALKPVSRYWDRITRP
EQLMSSLLRAFEVMTDPAKAGPATICISQDVEGEAYDFDESFFVKRVHYIDRMQPSERELQGAAELIKSSKKPVILVGGG
AKYSGARDELVAISEAYNIPLVETQAGKSTVEADFANNLGGMGITGTLAANKAARQADLIIGIGTRYTDFATSSKTAFDF
DKAKFLNINVSRMQAYKLDAFQVVADAKVTLGKLHGLLEGYESEFGTTIRELKDEWLAERERLSKVTFKREAFDPEIKNH
FSQEVLNEYADALNTELPQTTALLTINETIPEDSVIICSAGSLPGDLQRLWHSNVPNTYHLEYGYSCMGYEVSGTLGLKL
AHPDREVYSIVGDGSFLMLHSELITAIQYNKKINVLLFDNSGFGCINNLQMDHGSGSYYCEFRTDDNQILNVDYAKVAEG
YGAKTYRANTVEELKAALEDAKKQDVSTLIEMKVLPKTMTDGYDSWWHVGVAEVSEQESVQKAYEAKEKKLESAKQY
>P42416 4.2.1.44~~~iolE~~~Inosose dehydratase~~~COG1082
MGKNEILWGIAPIGWRNDDMPEIGAGNTLQHLLSDIVVARFQGTEVGGFFPEPAILNKELKLRNLRIAGKWFSSFILRDG
LGEAAKTFTLHCEYLQQVNADVAVVSEQTYSVQSLEKNVFTEKPHFTDDEWERLCEGLNHLGEIAAQHGLKLVYHHHLGT
GVQTAEEVDRLMAGTDPAHVHLLYDTGHAYISDGDYMGMLEKHIGRIKHVHFKDARLNVMEQCRLEGQSFRQSFLKGMFT
VPGDGCIDFREVYQLLLKHSYSGWIVIEAEQDPDVANPLEYALIARNYIDQQLLDLA
>Q88S37 4.2.1.44~~~iolE~~~Inosose dehydratase~~~COG1082
MSSKAEKDIKWGIAPIGWRNDDIPSIGKDNNLQQLLSDIVVAGFQGTEVGGFFPGPEKLNYELKLRNLEIAGQWFSSYII
RDGIEKASEAFEKHCQYLKAINAPVAVVSEQTYTIQRSDTANIFKDKPYFTDKEWDEVCKGLNHYGEIAAKYGLKVAYHH
HMGTGIQTKEETDRLMANTDPKLVGLLYDTGHIAVSDGDYMALLNAHIDRVVHVHFKDVRRSKEEECRAKGLTFQGSFLN
GMFTVPGDGDLDFKPVYDKLIANNYKGWIVVEAEQDPSKANPLEMAQIAHRYIKQHLIEN
>P26935 1.1.1.18~~~iolG~~~Inositol 2-dehydrogenase/D-chiro-inositol 3-dehydrogenase~~~COG0673
MSLRIGVIGTGAIGKEHINRITNKLSGAEIVAVTDVNQEAAQKVVEQYQLNATVYPNDDSLLADENVDAVLVTSWGPAHE
SSVLKAIKAQKYVFCEKPLATTAEGCMRIVEEEIKVGKRLVQVGFMRRYDSGYVQLKEALDNHVIGEPLMIHCAHRNPTV
GDNYTTDMAVVDTLVHEIDVLHWLVNDDYESVQVIYPKKSKNALPHLKDPQIVVIETKGGIVINAEIYVNCKYGYDIQCE
IVGEDGIIKLPEPSSISLRKEGRFSTDILMDWQRRFVAAYDVEIQDFIDSIQKKGEVSGPTAWDGYIAAVTTDACVKAQE
SGQKEKVELKEKPEFYQSFTTVQN
>Q8ZK57 1.1.1.18~~~iolG~~~Inositol 2-dehydrogenase~~~
MTLKAGIVGIGMIGSDHLRRLANTVSGVEVVAVCDIVAGRAQAALDKYAIEAKDYNDYHDLINDKDVEVVIITASNEAHA
DVAVAALNANKYVFCEKPLAVTAADCQRVIEAEQKNGKRMVQIGFMRRYDKGYVQLKNIIDSGEIGQPLMVHGRHYNAST
VPEYKTPQAIYETLIHEIDVMHWLLNEDYKTVKVYFPRQSSLVTTLRDPQLVVMETTSGINIVVEVFVNCQYGYDIHCDV
TGEKGMAELPTVASAAVRKAAKYSTDILVDWKQRFIDAYDIEFQDFFDRLNAGLPPAGPTSWDGYLAAVTADACVKSQET
GNTEIVELPSKPDFYK
>Q9WYP5 1.1.1.18~~~iolG~~~Myo-inositol 2-dehydrogenase~~~COG0673
MRIGVIGLGRIGTIHAENLKMIDDAILYAISDVREDRLREMKEKLGVEKAYKDPHELIEDPNVDAVLVCSSTNTHSELVI
ACAKAKKHVFCEKPLSLNLADVDRMIEETKKADVILFTGFNRRFDRNFKKLKEAVENGTIGKPHVLRITSRDPAPPPLDY
IRVSGGIFLDMTIHDFDMARYIMGEEVEEVFADGSVLVDEEIGKAGDVDTAVVVLRFKSGALGVIDNSRRAVYGYDQRIE
VFGSKGRIFADNVRETTVVLTDEQGDRGSRYLYFFLERYRDSYLEELKTFIKNVKSGEPPAVSGEDGKMALLLGYAAKKS
LEEKRSVKLEEVIG
>P42419 5.3.99.11~~~iolI~~~Inosose isomerase~~~COG1082
MKLCFNEATTLENSNLKLDLELCEKHGYDYIEIRTMDKLPEYLKDHSLDDLAEYFQTHHIKPLALNALVFFNNRDEKGHN
EIITEFKGMMETCKTLGVKYVVAVPLVTEQKIVKEEIKKSSVDVLTELSDIAEPYGVKIALEFVGHPQCTVNTFEQAYEI
VNTVNRDNVGLVLDSFHFHAMGSNIESLKQADGKKIFIYHIDDTEDFPIGFLTDEDRVWPGQGAIDLDAHLSALKEIGFS
DVVSVELFRPEYYKLTAEEAIQTAKKTTVDVVSKYFSM
>P42420 4.1.2.29~~~iolJ~~~6-phospho-5-dehydro-2-deoxy-D-gluconate aldolase~~~COG0191
MAFVSMKELLEDAKREQYAIGQFNINGLQWTKAILQAAQKEQSPVIAAASDRLVDYLGGFKTIAAMVGALIEDMAITVPV
VLHLDHGSSAERCRQAIDAGFSSVMIDGSHQPIDENIAMTKEVTDYAAKHGVSVEAEVGTVGGMEDGLVGGVRYADITEC
ERIVKETNIDALAAALGSVHGKYQGEPNLGFKEMEAISRMTDIPLVLHGASGIPQDQIKKAITLGHAKININTECMVAWT
DETRRMFQENSDLYEPRGYLTPGIEAVEETVRSKMREFGSAGKAAKQQVG
>Q9WYP3 1.1.1.-~~~iolM~~~scyllo-inosose 3-dehydrogenase~~~COG1063
MRAVRLHAKWDPRPEFKLGPKDIEGKLTWLGSKVWRYPEVRVEEVPEPRIEKPTEIIIKVKACGICGSDVHMAQTDEEGY
ILYPGLTGFPVTLGHEFSGVVVEAGPEAINRRTNKRFEIGEPVCAEEMLWCGHCRPCAEGFPNHCENLNELGFNVDGAFA
EYVKVDAKYAWSLRELEGVYEGDRLFLAGSLVEPTSVAYNAVIVRGGGIRPGDNVVILGGGPIGLAAVAILKHAGASKVI
LSEPSEVRRNLAKELGADHVIDPTKENFVEAVLDYTNGLGAKLFLEATGVPQLVWPQIEEVIWRARGINATVAIVARADA
KIPLTGEVFQVRRAQIVGSQGHSGHGTFPRVISLMASGMDMTKIISKTVSMEEIPEYIKRLQTDKSLVKVTMLNE
>Q9WYP4 3.7.1.-~~~iolN~~~3-dehydro-scyllo-inosose hydrolase~~~COG1402
MERPTGVYFQTMTMKQIRERLKQCDLIIIPVGSTENHGPNAPTGEDTFLVTRMAEQVALKTGCTVAEPIWYGYHPYHHIG
MPGTVPVKDEAFIDYLVSVIAGFWNTGFRKQILLNGHGQEFVIPIAIHKFAKIFQVPAIIINLNWYHAIQDKFKTKEEGG
PYETPFIHADEVETSWSLALFPEFMHQEWAVDTEPKGFLPEGHIDKAGNLLHRPIAWYGHVGGGPIEVVAYPEGVVGKAT
LASAEKAKEGVEALLDYLEKLVRDIMERFPAGKLPPAEMLSQRPKEELEALTKEPLTEGWRNLYTAGNLWG
>Q9WYP7 5.1.3.-~~~iolO~~~5-keto-L-gluconate epimerase~~~COG1082
MKLSLVISTSDAAFDALAFKGDLRKGMELAKRVGYQAVEIAVRDPSIVDWNEVKILSEELNLPICAIGTGQAYLADGLSL
THPNDEIRKKAIERVVKHTEVAGMFGALVIIGLVRGRREGRSYEETEELFIESMKRLLELTEHAKFVIEPLNRYETDFIN
TIDDALRILRKINSNRVGILADTFHMNIEEVNIPESLKRAGEKLYHFHVADSNRWAPGCGHFDFRSVFNTLKEIGYNRYV
SVECLPLPGGMEEAAEIAFKTLKELIIKLT
>P46336 1.1.1.-~~~iolS~~~Aldo-keto reductase IolS~~~COG0667
MKKAKLGKSDLQVFPIGLGTNAVGGHNLYPNLNEETGKELVREAIRNGVTMLDTAYIYGIGRSEELIGEVLREFNREDVV
IATKAAHRKQGNDFVFDNSPDFLKKSVDESLKRLNTDYIDLFYIHFPDEHTPKDEAVNALNEMKKAGKIRSIGVSNFSLE
QLKEANKDGLVDVLQGEYNLLNREAEKTFFPYTKEHNISFIPYFPLVSGLLAGKYTEDTTFPEGDLRNEQEHFKGERFKE
NIRKVNKLAPIAEKHNVDIPHIVLAWYLARPEIDILIPGAKRADQLIDNIKTADVTLSQEDISFIDKLFA
>O34718 ~~~iolT~~~Major myo-inositol transporter IolT~~~COG2814
MNKQGNQMSFLRTIILVSTFGGLLFGYDTGVLNGALPYMGEPDQLNLNAFTEGLVTSSLLFGAALGAVFGGRMSDFNGRR
KNILFLAVIFFISTIGCTFAPNVTVMIISRFVLGIAVGGASVTVPAYLAEMSPVESRGRMVTQNELMIVSGQLLAFVFNA
ILGTTMGDNSHVWRFMLVIASLPALFLFFGMIRMPESPRWLVSKGRKEDALRVLKKIRDEKRAAAELQEIEFAFKKEDQL
EKATFKDLSVPWVRRIVFIGLGIAIVQQITGVNSIMYYGTEILRNSGFQTEAALIGNIANGVISVLATFVGIWLLGRVGR
RPMLMTGLIGTTTALLLIGIFSLVLEGSPALPYVVLSLTVTFLAFQQGAISPVTWLMLSEIFPLRLRGLGMGVTVFCLWM
VNFAVSFTFPILLAAIGLSTTFFIFVGLGICSVLFVKRFLPETKGLSLEQLEENFRAYDHSGAKKDSGAEVIG
>O05265 1.1.1.371~~~iolU~~~scyllo-inositol 2-dehydrogenase (NADP(+)) IolU~~~COG0673
MIRFAIIGTNWITDRFLESAADIEDFQLTAVYSRSAERAGEFAAKHNAAHAFSDLQEMAASDCFDAVYIASPNALHKEQA
VLFMNHGKHVLCEKPFASNTKETEEMISAAKANGVVLMEAMKTTFLPNFKELKKHLHKIGTVRRFTASYCQYSSRYDAFR
SGTVLNAFQPELSNGSLMDIGVYCIYPAVVLFGAPKDVKANGYALSSGVDGEGTVILSYDGFEAVLMHSKISTSYAPAEI
QGEDGTIVIDTIHRPERVEIRYRDGRLENIAIPDPKPAMFYEAEEFVTLIKENKLESEENTFERSLTTAKIMEEARKQMG
IVYPADQA
>O32223 1.1.1.371~~~iolW~~~scyllo-inositol 2-dehydrogenase (NADP(+)) IolW~~~COG0673
MITLLKGRRKVDTIKVGILGYGLSGSVFHGPLLDVLDEYQISKIMTSRTEEVKRDFPDAEVVHELEEITNDPAIELVIVT
TPSGLHYEHTMACIQAGKHVVMEKPMTATAEEGETLKRAADEKGVLLSVYHNRRWDNDFLTIKKLISEGSLEDINTYQVS
YNRYRPEVQARWREKEGTATGTLYDLGSHIIDQTLHLFGMPKAVTANVMAQRENAETVDYFHLTLDYGKLQAILYGGSIV
PANGPRYQIHGKDSSFIKYGIDGQEDALRAGRKPEDDSWGADVPEFYGKLTTIRGSDKKTETIPSVNGSYLTYYRKIAES
IREGAALPVTAEEGINVIRIIEAAMESSKEKRTIMLEH
>P40332 1.1.1.370~~~iolX~~~scyllo-inositol 2-dehydrogenase (NAD(+))~~~COG0673
MEHQVRCAVLGLGRLGYYHAKNLVTSVPGAKLVCVGDPLKGRAEQVARELGIEKWSEDPYEVLEDPGIDAVIIVTPTSTH
GDMIIKAAENGKQIFVEKPLTLSLEESKAASEKVKETGVICQVGFMRRFDPAYADAKRRIDAGEIGKPIYYKGFTRDQGA
PPAEFIKHSGGIFIDCSIHDYDIARYLLGAEITSVSGHGRILNNPFMEQYGDVDQALTYIEFDSGAAGDVEASRTSPYGH
DIRAEVIGTEGSIFIGTLRHQHVTILSAKGSSFDIIPDFQTRFHEAYCLELQHFAECVRNGKTPIVTDIDATINLEVGIA
ATNSFRNGMPVQLDVKRAYTGM
>Q51697 1.3.99.16~~~iorA~~~Isoquinoline 1-oxidoreductase subunit alpha~~~
MIEFILNGQPVRVTEVPEDAPLLWVVREHLKLSGTKFGCGLGLCGACTVHINGEAARSCITPLSVVARQSVTTIEGLDPQ
HAHPLQRAWIAEQVPQCGYCQSGQIMQAAALLKKVPKPSDAQIVEAMDGNLCRCGTYQRIKIAIHRAAKEAA
>Q51698 1.3.99.16~~~iorB~~~Isoquinoline 1-oxidoreductase subunit beta~~~
MKTVLPSVPETVRLSRRGFLVQAGTITCSVAFGSVPAAAGDTAESTPSIAAVSPNVWVRVHADGIVDIVCPAVELGQGAH
TALPRFVAEELDADWDRVRVQQAGASDKVYGNPLAWGTQFTAASRTTVGYFDVLRVAGAQARFVLVQTAARRWSVPADQL
ETQKGVVLHRRSRRSATYGELVASVQVPESFPHFFARNEATQPADDYFGAAPPSVVAQAAGPASGAIALKHRSTYRLIGK
DAPRKDIPPKVNGQACYGMDVQVPGMLYAMVETGPVAGMAPERVDDGAARQVPGIHHVLSLPHGVAVVGRDIFAVRAARA
RLLVNWKANPDKQSYDSGQVLDEFSDLCRNGIERNAVQAWKQGELSSIDAVFARPDVRIESFEMQSDLVYQAPMEPQSAV
IQPHADGSAEAWVGTQWPTVEQGFAAGILGIAPDKLTMHLPLVGGGFGRRLEPGALVDAAHIVRAIGKTVKVIWSREDDL
KRNPFRQALACRVEAAVLEKDQRILALRHTVAADSWLARLFPQYFNAYQQTDPGNWIGGMVAYDVPLQRIDALTPRRSVD
VCYMRGIGVAQVKFAQESLVDQIARRLNADPVDFRLAHLNTSPRGAAVVRTVAEMSDWKRRSADAGGGMALGLAYTPYSN
AHVALVSEVHFNRSENTLSVSRVWCAVDVGMVAQPDIVKAQMEGGIIQGLSVALMERVQVAKGVLQHSNFHDYPMLRMSQ
VPQIHVRLVETDQAMAGVAELGLLQIGPAINNAFARITGQHLRSLPMRPALAQMKRSGPTA
>A0A0H2USG1 2.3.2.27~~~~~~E3 ubiquitin-protein ligase IpaH1.4~~~
MIKSTNIQAIGSGIMHQINNVYSLTPLSLPMELTPSCNEFYLKTWSEWEKNGTPGEQRNIAFNRLKICLQNQEAELNLSE
LDLKTLPDLPPQITTLEIRKNLLTHLPDLPPMLKVIHAQFNQLESLPALPETLEELNAGDNKIKELPFLPENLTHLRVHN
NRLHILPLLPPELKLLVVSGNRLDSIPPFPDKLEGLALANNFIEQLPELPFSMNRAVLMNNNLTTLPESVLRLAQNAFVN
VAGNPLSGHTMRTLQQITTGPDYSGPQIFFSMGNSATISAPEHSLADAVTAWFPENKQSDVSQIWHAFEHEEHANTFSAF
LDRLSDTVSARNTSGFREQVAAWLEKLSASAELRQQSFAVAADATESCEDRVALTWNNLRKTLLVHQASEGLFDNDTGAL
LSLGREMFRLEILEDIARDKVRTLHFVDEIEVYLAFQTMLAEKLQLSTAVKEMRFYGVSGVTANDLRTAEAMVRSREENE
FTDWFSLWGPWHAVLKRTEADRWAQAEEQKYEMLENEYSQRVADRLKASGLSGDADAEREAGAQVMRETEQQIYRQLTDE
VLALRLSENGSNHIA
>A0A0H2USC0 2.3.2.27~~~~~~E3 ubiquitin-protein ligase IpaH2.5~~~
MIKSTNIQVIGSGIMHQINNIHSLTLFSLPVSLSPSCNEYYLKVWSEWERNGTPGEQRNIAFNRLKICLQNQEAELNLSE
LDLKTLPDLPPQITTLEIRKNLLTHLPDLPPMLKVIHAQFNQLESLPALPETLEELNAGDNKIKELPFLPENLTHLRVHN
NRLHILPLLPPELKLLVVSGNRLDSIPPFPDKLEGLALANNFIEQLPELPFSMNRAVLMNNNLTTLPESVLRLAQNAFVN
VAGNPLSGHTMRTLQQITTGPDYSGPQIFFSMGNSATISAPEHSLADAVTAWFPENKQSDVSQIWHAFEHEEHANTFSAF
LDRLSDTVSARNTSGFREQVAAWLEKLSASAELRQQSFAVAADATESCEDRVALTWNNLRKTLLVHQASEGLFDNDTGAL
LSLGREMFRLEILEDIARDKVRTLHFVDEIEVYLAFQTMLAEKLQLSTAVKEMRFYGVSGVTANDLRTAEAMVRSREENE
FTDWFSLWGPWHAVLKRTEADRWAQAEEQKYEMLENEYSQRVADRLKASGLSGDADAEREAGAQVMRETEQQIYRQLTDE
VLA
>Q83RJ4 2.3.2.27~~~ipaH3~~~E3 ubiquitin-protein ligase ipaH3~~~
MSIMLPINNNFSLSQNSFYNTISGTYADYFSAWDKWEKQALPGENRNEAVSLLKECLINQFSELQLNRLNLSSLPDNLPP
QITVLEITQNALISLPELPASLEYLDACDNRLSTLPELPASLKHLDVDNNQLTMLPELPALLEYINADNNQLTMLPELPT
SLEVLSVRNNQLTFLPELPESLEALDVSTNLLESLPAVPVRNHHSEETEIFFRCRENRITHIPENILSLDPTCTIILEDN
PLSSRIRESLSQQTAQPDYHGPRIYFSMSDGQQNTLHRPLADAVTAWFPENKQSDVSQIWHAFEHEEHANTFSAFLDRLS
DTVSARNTSGFREQVAAWLEKLSTSAELRQQSFAVAADATESCEDRVALTWNNLRKTLLVHQASEGLFDNDTGALLSLGR
EMFRLEILEDIARDKVRTLHFVDEIEVYLAFQTMLAEKLQLSTAVKEMRFYGVSGVTANDLRTAEAMVRSREENEFTDWF
SLWGPWHAVLKRTEADRWAQAEEQKYEMLENEYSQRVADRLKASGLSGDADAEREAGAQVMRETEQQIYRQVTDEVLALR
LSENGSQLHHS
>P18009 2.3.2.27~~~~~~Probable E3 ubiquitin-protein ligase ipaH4.5~~~
MKPINNHSFFRSLCGLSCISRLSVEEQCTRDYHRIWDDWAREGTTTENRIQAVRLLKICLDTREPVLNLSLLKLRSLPPL
PLHIRELNISNNELISLPENSPLLTELHVNGNNLNILPTLPSQLIKLNISFNRNLSCLPSLPPYLQSLSARFNSLETLPE
LPSTLTILRIEGNRLTVLPELPHRLQELFVSGNRLQELPEFPQSLKYLKVGENQLRRLSRLPQELLALDVSNNLLTSLPE
NIITLPICTNVNISGNPLSTHVLQSLQRLTSSPDYHGPQIYFSMSDGQQNTLHRPLADAVTAWFPENKQSDVSQIWHAFE
HEEHANTFSAFLDRLSDTVSARNTSGFREQVAAWLEKLSASAELRQQSFAVAADATESCEDRVALTWNNLRKTLLVHQAS
EGLFDNDTGALLSLGREMFRLEILEDIARDKVRTLHFVDEIEVYLAFQTMLAEKLQLSTAVKEMRFYGVSGVTANDLRTA
EAMVRSREENEFTDWFSLWGPWHAVLKRTEADRWAQAEEQKYEMLENEYSQRVADRLKASGLSGDADAEREAGAQVMRET
EQQIYRQLTDEVLA
>P18014 2.3.2.27~~~~~~Probable E3 ubiquitin-protein ligase ipaH7.8~~~
MFSVNNTHSSVSCSPSINSNSTSNEHYLRILTEWEKNSSPGEERGIAFNRLSQCFQNQEAVLNLSDLNLTSLPELPKHIS
ALIVENNKLTSLPKLPAFLKELNADNNRLSVIPELPESLTTLSVRSNQLENLPVLPNHLTSLFVENNRLYNLPALPEKLK
FLHVYYNRLTTLPDLPDKLEILCAQRNNLVTFPQFSDRNNIRQKEYYFHFNQITTLPESFSQLDSSYRINISGNPLSTRV
LQSLQRLTSSPDYHGPQIYFSMSDGQQNTLHRPLADAVTAWFPENKQSDVSQIWHAFEHEEHANTFSAFLDRLSDTVSAR
NTSGFREQVAAWLEKLSASAELRQQSFAVAADATESCEDRVALTWNNLRKTLLVHQASEGLFDNDTGALLSLGREMFRLE
ILEDIARDKVRTLHFVDEIEVYLAFQTMLAEKLQLSTAVKEMRFYGVSGVTANDLRTAEAMVRSREENEFTDWFSLWGPW
HAVLKRTEADRWAQAEEQKYEMLENEYSQRVADRLKASGLSGDADAEREAGAQVMRETEQQIYRQLTDEVLALRLSENGS
RLHHS
>Q8VSC3 2.3.2.27~~~~~~E3 ubiquitin-protein ligase ipaH9.8~~~
MLPINNNFSLPQNSFYNTISGTYADYFSAWDKWEKQALPGEERDEAVSRLKECLINNSDELRLDRLNLSSLPDNLPAQIT
LLNVSYNQLTNLPELPVTLKKLYSASNKLSELPVLPPALESLQVQHNELENLPALPDSLLTMNISYNEIVSLPSLPQALK
NLRATRNFLTELPAFSEGNNPVVREYFFDRNQISHIPESILNLRNECSIHISDNPLSSHALPALQRLTSSPDYHGPRIYF
SMSDGQQNTLHRPLADAVTAWFPENKQSDVSQIWHAFEHEEHANTFSAFLDRLSDTVSARNTSGFREQVAAWLEKLSASA
ELRQQSFAVAADATESCEDRVALTWNNLRKTLLVHQASEGLFDNDTGALLSLGREMFRLEILEDIARDKVRTLHFVDEIE
VYLAFQTMLAEKLQLSTAVKEMRFYGVSGVTANDLRTAEAMVRSREENEFTDWFSLWGPWHAVLKRTEADRWAQAEEQKY
EMLENEYPQRVADRLKASGLSGDADAEREAGAQVMRETEQQIYRQLTDEVLALRLPENGSQLHHS
>P18010 ~~~ipaA~~~Invasin IpaA~~~
MHNVNNTQAPTFLYKATSPSSTEYSELKSKISDIHSSQTSLKTPASVSEKENFATSFNQKCLDFLFSSSGKEDVLRSIYS
NSMNAYAKSEILEFSNVLYSLVHQNGLNFENEKGLQKIVAQYSELIIKDKLSQDSAFGPWSAKNKKLHQLRQNIEHRLAL
LAQQHTSGEALSLGQKLLNTEVSSFIKNNILAELKLSNETVSSLKLDDLVDAQAKLAFDSLRNQRKNTIDSKGFGIGKLS
RDLNTVAVFPELLRKVLNDILEDIKDSHPIQDGLPTPPEDMPDGGPTPGANEKTSQPVIHYHINNDNRTYDNRVFDNRVY
DNSYHENPENDAQSPTSQTNDLLSRNGNSLLNPQRALVQKVTSVLPHSISDTVQTFANNSALEKVFNHTPDNSDGIGSDL
LTTSSQERSANNSLSRGHRPLNIQNSSTTPPLHPEGVTSSNDNSSDTTKSSASLSHRVASQINKFNSNTDSKVLQTDFLS
RNGDTYLTRETIFEASKKVTNSLSNLISLIGTKSGTQERELQEKSKDITKSTTEHRINNKLKVTDANIRNYVTETNADTI
DKNHAIYEKAKEVSSALSKVLSKIDDTSAELLTDDISDLKNNNDITAENNNIYKAAKDVTTSLSKVLKNINKD
>P18013 ~~~ipaD~~~Invasin IpaD~~~
MNITTLTNSISTSSFSPNNTNGSSTETVNSDIKTTTSSHPVSSLTMLNDTLHNIRTTNQALKKELSQKTLTKTSLEEIAL
HSSQISMDVNKSAQLLDILSRNEYPINKDARELLHSAPKEAELDGDQMISHRELWAKIANSINDINEQYLKVYEHAVSSY
TQMYQDFSAVLSSLAGWISPGGNDGNSVKLQVNSLKKALEELKEKYKDKPLYPANNTVSQEQANKWLTELGGTIGKVSQK
NGGYVVSINMTPIDNMLKSLDNLGGNGEVVLDNAKYQAWNAGFSAEDETMKNNLQTLVQKYSNANSIFDNLVKVLSSTIS
SCTDTDKLFLHF
>Q54150 3.4.22.-~~~ipaJ~~~Cysteine protease IpaJ~~~
MSEQRKPCKRGCIHTGVMLYGVLLQGAIPREYMISHQTDVRVNENRVNEQGCFLARKQMYDNSCGAASLLCAAKELGVDK
IPQYKGSMSEMTRKSSLDLDNRCERDLYLITSGNYNPRIHKDNIADAGYSMPDKIVMATRLLGLNAYVVEESNIFSQVIS
FIYPDARDLLIGMGCNIVHQRDVLSSNQRVLEAVAVSFIGVPVGLHWVLCRPDGSYMDPAVGENYSCFSTMELGARRSNS
NFIGYTKIGISIVITNEAL
>P9WPW1 4.1.99.-~~~ipdA~~~Cholesterol ring-cleaving hydrolase IpdA subunit~~~COG1788
MPDKRTALDDAVAQLRSGMTIGIAGWGSRRKPMAFVRAILRSDVTDLTVVTYGGPDLGLLCSAGKVKRVYYGFVSLDSPP
FYDPWFAHARTSGAIEAREMDEGMLRCGLQAAAQRLPFLPIRAGLGSSVPQFWAGELQTVTSPYPAPGGGYETLIAMPAL
RLDAAFAHLNLGDSHGNAAYTGIDPYFDDLFLMAAERRFLSVERIVATEELVKSVPPQALLVNRMMVDAIVEAPGGAHFT
TAAPDYGRDEQFQRHYAEAASTQVGWQQFVHTYLSGTEADYQAAVHNFGASR
>Q0S7P9 4.1.99.-~~~ipdA~~~Cholesterol ring-cleaving hydrolase IpdA subunit~~~COG1788
MVSKRDKRISLDDAVGELRSGMTIGIGGWGSRRKPMALVRALLRSDVTDLTVVTYGGPDLGLLCSAGKVTKAYYGFVSLD
SAPFYDPWFAKARTAGEIAVREMDEGMVKCGLEAAAARLPFLPIRAGLGSDVRRFWGDELRTVTSPYPDASGKSETLIAM
PALNLDAALVHLNLGDKHGNAAYTGVDPYFDDLYCAAAEKRFVSVERVVETEELVKTVPLQNLILNRMMVDGVVEAPNGA
HFTLAGDSYGRDEKFQRHYAESAKTPQAWQQFVATYLSGSEDDYQAAVKKFAEEQA
>P9WPV9 4.1.99.-~~~ipdB~~~Cholesterol ring-cleaving hydrolase IpdB subunit~~~COG2057
MSTRAEVCAVACAELFRDAGEIMISPMTNMASVGARLARLTFAPDILLTDGEAQLLADTPALGKTGAPNRIEGWMPFGRV
FETLAWGRRHVVMGANQVDRYGNQNISAFGPLQRPTRQMFGVRGSPGNTINHATSYWVGNHCKRVFVEAVDVVSGIGYDK
VDPDNPAFRFVNVYRVVSNLGVFDFGGPDHSMRAVSLHPGVTPGDVRDATSFEVHDLDAAEQTRLPTDDELHLIRAVIDP
KSLRDREIRS
>Q0S7Q0 4.1.99.-~~~ipdB~~~Cholesterol ring-cleaving hydrolase IpdB subunit~~~COG2057
MSETITEVTRAEYCAIACADIFSGAGEIMASPMATLPLIGARLARLTTEPDLLITDGEALIFADTPAVGAKAPIEGWMPF
RKVFDVVASGRRHVVMGANQIDRHGNQNLSAFGPLQQPTRQMFGVRGAPGNTINHPTSYWVGKHTSRVFCDTVDIVSGVG
YDQIDPENPAYRFHHLHRVVSNLGVFDFGGPDHTFRALSLHPGVTADQVADNTSFEVAGLADAGVTREPTDEELRLIREV
LDPRSLRDREVSV
>P71847 1.3.1.-~~~ipdC~~~(3aS,4S,5R,7aS)-5-hydroxy-7a-methyl-1-oxo-octahydro-1H-indene-4-carboxyl-CoA dehydrogenase~~~COG2070
MRLRTPLTELIGIEHPVVQTGMGWVAGARLVSATANAGGLGILASATMTLDELAAAITKVKAVTDKPFGVNIRADAADAG
DRVELMIREGVRVASFALAPKQQLIARLKEAGAVVIPSIGAAKHARKVAAWGADAMIVQGGEGGGHTGPVATTLLLPSVL
DAVAGTGIPVIAAGGFFDGRGLAAALCYGAAGVAMGTRFLLTSDSTVPDAVKRRYLQAGLDGTVVTTRVDGMPHRVLRTE
LVEKLESGSRARGFAAALRNAGKFRRMSQMTWRSMIRDGLTMRHGKELTWSQVLMAANTPMLLKAGLVDGNTEAGVLASG
QVAGILDDLPSCKELIESIVLDAITHLQTASALVE
>I6Y3V5 1.3.99.-~~~ipdE1~~~Acyl-CoA dehydrogenase IpdE1~~~COG1960
MQDVEEFRAQVRGWLADNLAGEFAALKGLGGPGREHEAFEERRAWNQRLAAAGLTCLGWPEEHGGRGLSTAHRVAFYEEY
ARADAPDKVNHFGEELLGPTLIAFGTPQQQRRFLPRIRDVTELWCQGYSEPGAGSDLASVATTAELDGDQWVINGQKVWT
SLAHLSQWCFVLARTEKGSQRHAGLSYLLVPLDQPGVQIRPIVQITGTAEFNEVFFDDARTDADLVVGAPGDGWRVAMAT
LTFERGVSTLGQQIVYARELSNLVELARRTAAADDPLIRERLTRAWTGLRAMRSYALATMEGPAVEQPGQDNVSKLLWAN
WHRNLGELAMDVIGKPGMTMPDGEFDEWQRLYLFTRADTIYGGSNEIQRNIIAERVLGLPREAKG
>I6YCF5 1.3.99.-~~~ipdE2~~~Acyl-CoA dehydrogenase IpdE2~~~COG1960
MTPPEERQMLRETVASLVAKHAGPAAVRAAMASDRGYDESLWRLLCEQVGAAALVIPEELGGAGGELADAAIVVQELGRA
LVPSPLLGTTLAELALLAAAKPDAQALTELAQGSAIGALVLDPDYVVNGDIADIVVAATSGQLTRWTRFSAQPVATMDPT
RRLARLQSEETEPLCPDPGIADTAAILLAAEQIGAAERCLQLTVEYAKSRVQFGRPIGSFQALKHRMADLYVTIAAARAV
VADACHAPTPTNAATARLAASEALSTAAAEGIQLHGGIAITWEHDMHLYFKRAHGSAQLLESPREVLRRLESEVWESP
>I6YCF0 1.1.1.-~~~ipdF~~~(5R,7aS)-5-hydroxy-7a-methyl-1-oxo-2,3,5,6,7,7a-hexahydro-1H-indene-carboxyl-CoA reductase~~~COG1028
MNLSVAPKEIAGHGLLDGKVVVVTAAAGTGIGSATARRALAEGADVVISDHHERRLGETAAELSALGLGRVEHVVCDVTS
TAQVDALIDSTTARMGRLDVLVNNAGLGGQTPVADMTDDEWDRVLDVSLTSVFRATRAALRYFRDAPHGGVIVNNASVLG
WRAQHSQSHYAAAKAGVMALTRCSAIEAAEYGVRINAVSPSIARHKFLDKTASAELLDRLAAGEAFGRAAEPWEVAATIA
FLASDYSSYLTGEVISVSCQHP
>P33547 ~~~ipgA~~~Chaperone protein IpgA~~~
MCRKLYDKLYEITGAKLDFNDKNQAFILLEEQIPVCITDNDEYIFLTGLLNEHELFTENIINPEHILILNYSLSRDYGSS
ICLLPDTHQCVLTKKHYKKYLSPDELIESLYEFLFCIKLTIANITSEVN
>P0A2U4 ~~~ipgC~~~Chaperone protein IpgC~~~
MSLNITENESISTAVIDAINSGATLKDINAIPDDMMDDIYSYAYDFYNKGRIEEAEVFFRFLCIYDFYNVDYIMGLAAIY
QIKEQFQQAADLYAVAFALGKNDYTPVFHTGQCQLRLKAPLKAKECFELVIQHSNDEKLKIKAQSYLDAIQDIKE
>Q07566 3.1.3.78~~~ipgD~~~Inositol phosphate phosphatase IpgD~~~
MHITNLGLHQVSFQSGDSYKGAEETGKHKGVSVISYQRVKNGERNKGIEALNRLYLQNQTSLTGKSLLFARDKAEVFCEA
IKLAGGDTSKIKAMMERLDTYKLGEVNKRHINELNKVISEEIRAQLGIKNKKELQTKIKQIFTDYLNNKNWGPVNKNISH
HGKNYSFQLTPASHMKIGNKNIFVKEYNGKGICCASTRERDHIANMWLSKVVDDEGKEIFSGIRHGVISAYGLKKNSSER
AVAARNKAEELVSAALYSRPELLSQALSGKTVDLKIVSTSLLTPTSLTGGEESMLKDQVSALKGLNSKRGGPTKLLIRNS
DGLLKEVSVNLKVVTFNFGVNELALKMGLGWRNVDKLNDESICSLLGDNFLKNGVIGGWAAEAIEKNPPCKNDVIYLANQ
IKEIVNNKLQKNDNGEPYKLSQRVTLLAYTIGAVPCWNCKSGKDRTGMQDAEIKREIIRKHETGQFSQLNSKLSSEEKRL
FSTILMNSGNMEIQEMNTGVPGNKVMKKLPLSSLELSYSERIGDPKIWNMVKGYSSFV
>Q6XVY3 ~~~ipgE~~~Chaperone protein IpgE~~~
MEDLADVICRALGIPLIDIDDQAIMLDDDVLIYIEKEGDSINLLCPFCALPENINDLIYALSLNYSEKICLATDDEGGNL
IARLDLTGINEFEDVYVNTEYYISRVRWLKDEFARRMKGY
>Q05918 3.1.3.48~~~iphP~~~Tyrosine-protein phosphatase~~~
MKTHHANLALALMLGLSSSATAVAADAPQAVATKAAAPNVKPVAADAHGVIPDGAPGMCARSPACRATAIPADAFVRTAD
LGRLTDADRDALAALGVKLDIDLRTADEEAQSPDLLARDDRFDYQRISLMGTEKMDLQKMMTSFPDSLGEAYVQWLGHSQ
PQFKQVFQRIAAQQDGAVLFHCTAGKDRTGIIAGLLLDLAGVPKAEIVHNYAISAHYLEGQPKDSDERADHGAGQAEPGD
RPQDGGHGRYRAGQHGAVLAALHSQYGGAEGYLKSIGVSEQEIQQLKVRLGQAG
>P39804 ~~~ipi~~~Intracellular proteinase inhibitor~~~
MENQEVVLSIDAIQEPEQIKFNMSLKNQSERAIEFQFSTGQKFELVVYDSEHKERYRYSKEKMFTQAFQNLTLESGETYD
FSDVWKEVPEPGTYEVKVTFKGRAENLKQVQAVQQFEVK
>Q8NMF0 ~~~ipsA~~~HTH-type transcriptional regulator IpsA~~~COG1609
MGRKQQYGTLASIAAKLGISRTTVSNAYNRPEQLSAELRQRILDTAEDMGYLGPDPVARSLRTRRAGAIGVLLTEDLTYA
FEDMASVDFLAGVAQAAGDTQLTLIPASPASSVDHVSAQQLVNNAAVDGVVIYSVAKGDPHIDAIRARGLPAVIADQPAR
EEGMPFIAPNNRKAIAPAAQALIDAGHRKIGILSIRLDRANNDGEVTRERLENAQYQVQRDRVRGAMEVFIEAGIDPGTV
PIMECWINNRQHNFEVAKELLETHPDLTAVLCTVDALAFGVLEYLKSVGKSAPADLSLTGFDGTHMALARDLTTVIQPNK
LKGFKAGETLLKMIDKEYVEPEVELETSFHPGSTVAPI
>P58758 2.5.1.27~~~tzs~~~Adenylate dimethylallyltransferase~~~
MLLHLIYGPTCSGKTDMAIQIAQETGWPVVALDRVQCCPQIATGSGRPLESELQSTRRIYLDSRPLTEGILDAESAHRRL
IFEVDWRKSEEGLILEGGSISLLNCMAKSPFWRSGFQWHVKRLRLGDSDAFLTRAKQRVAEMFAIREDRPSLLEELAELW
NYPAARPILEDIDGYRCAIRFARKHDLAISQLPNIDAGRHVELIEAIANEYLEHALSQERDFPQWPEDGAGQPVCPVTLT
RIR
>P0A3L6 2.5.1.27~~~izt~~~Adenylate dimethylallyltransferase~~~
MDLRLIFGPTCTGKTSTAVALAQQTGLPVLSLDRVQCCPQLSTGSGRPTVEELKGTSRLYLDDRPLVKGIIAAKQAHERL
MGEVYNYEAHGGLILEGGSISLLKCMAQSSYWSADFRWHIIRHELADEETFMNVAKARVKQMLRPAAGLSIIQELVDLWK
EPRLRPILKEIDGYRYAMLFASQNQITSDMLLQLDADMEDKLIHGIAQEYLIHARRQEQKFPRVNAAAYDGFEGHPFGMY
>P46376 2.5.1.27~~~fas4~~~Adenylate dimethylallyltransferase~~~
MKESTMAQTQARFDRVRWEPGVYAIVGATGIGKSAEASKLALSHSAPIVVADRIQCYSDLLVTSGRAFDAKVEGLNRVWL
DNRTIHQGNFDPDEAFDRLIKVLTSYVDRGEAVVMEGGSISLILRFAQTISNLPFPAVVNVMPIPDRQHYFAQQCARARQ
MLRGDSTGRNLLTELAEAWVLGDQHNFIASVAGLDCVLDWCATHSVTPEELANRDLTTEVLDELAASMGGRYVEHGVLQQ
EIFLRTFGAPGVTAR
>Q936T0 6.3.2.-~~~ipuC~~~Glutamate--isopropylamine ligase~~~
MSEENKKQILKVRDFIEKHNIDTIRLGAVDIDGVWRGKQVGAEYFLNKAALDGTQISNILFGWDVADHLVDGLEFTGWDS
GYPDIALIPDLSTLSLVPWQEKTASVLCDIQHLNGEPLNLSPRNLLRKAIEKAEQLGYKCYAAYEFEFYLLNDSIASISA
DQWRSINPVEKSGHCYSMLHHSSSSDIMGEVRKYMRDAGIVLEATNSEHGPGQYEINIKYDDALKAADDAIFVKNGIKEI
AAKHGMTATFMAKPSAEWSGSSGHVHMSLSDLAGTPVFANPENPGALSEVGYNFLAGMVALAREMSAIYLPNINSYKRTA
GASWAGGNSSWGFDNRTVSHRAITSAGSAARVENRIPGADTNPYLVIAASLLSGLYGIENKLKPKDPILGNAYKVSPELA
RPLAASLEEAAGIFRESEMARVIFPNEFVEHYAQMKVWEIKQSNSFVNNWELARYLDII
>Q936S7 3.4.-.-~~~ipuF~~~Gamma-glutamyl-L-1-hydroxyisopropylamide hydrolase~~~
MEKLRILICDGNTEADRASFKKFVGCAPSKQFESLLKNYNSQIRTEIAFPADPGPLMTLPLGAYDGILITGSNSHIYEAQ
PGNLRQIEFAQKAFASGTPMFGVCWGMQLAVVAAGGEVLPSRVADCSCETPFATGVELTSYGSGHPMHHSRTSGFDVFSF
HSDEVTRLPGGAVVTARNRNFIQAVEIKHGRSTFWGVQYHPELSGWDQAGFLRESARSLVEDGSYETLNHVEHAAQAISM
FKAGAQISEENLVHFEGVDTNSFEFRPLEILNWLDHLVIPTAKRKFGWGGGWLQK
>P19514 3.6.1.1~~~ppa~~~Inorganic pyrophosphatase~~~
MAFENKIVEAFIEIPTGSQNKYEFDKERGIFKLDRVLYSPMFYPAEYGYLQNTLALDGDPLDILVITTNPPFPGCVIDTR
VIGYLNMVDSGEEDAKLIGVPVEDPRFDEVRSIEDLPQHKLKEIAHFFERYKDLQGKRTEIGTWEGPEAAAKLIDECIAR
YNEQK
>O84777 3.6.1.1~~~ppa~~~Inorganic pyrophosphatase~~~
MSKTPLSIAHPWHGPVLTRDDYESLCCYIEITPADSVKFELDKETGILKVDRPQKFSNFCPCLYGLLPKTYCGDLSGEYS
GQQSNRENIKGDGDPLDICVLTEKNITQGNILLQARPIGGIRILDSEEADDKIIAVLEDDLVYGNIEDISECPGTVLDMI
QHYFLTYKATPESLIQAKPAKIEIVGLYGKKEAQKVIRLAHEDYCNLFM
>Q8XIQ9 3.6.1.1~~~~~~Cobalt-dependent inorganic pyrophosphatase~~~
MKDVIYITGHKNPDSDSICAALAYAEFKNKTQDTPAIPVRLGNVSQETQYILDYFGVEAPQFLETVKLKVEDLEMDKIAP
LAPEVSLKMAWNIMRDKNLKSIPVADGNNHLLGMLSTSNITATYMDIWDSNILAKSATSLDNILDTLSAEAQNINEERKV
FPGKVVVAAMQAESLKEFISEGDIAIAGDRAEIQAELIELKVSLLIVTGGHTPSKEIIELAKKNNITVITTPHDSFTASR
LIVQSLPVDYVMTKDNLVAVSTDDLVEDVKVTMSETRYSNYPVIDENNKVVGSIARFHLISTHKKKVIQVDHNERGQSVH
GLEDAEVLEIIDHHRVADIQTGNPIYFRNEPLGSTSTIVAKRFFENGIRPSREAAGLLCGAIISDTLLFKSPTCTPQDVK
MCRKLAEIAGIVPETFAKEMFKAGTSLKGKSIEEIFNADFKPFTIEGVKVGVAQVNTMDIEGFMPLKGEMLDYMNQKAES
MGLEMIMLLLTDIINEGSQILVAGRSPEIAEEAFKVKLEDSTTFLPGVLSRKKQVVPPLTQIITTRVSK
>P0A7A9 3.6.1.1~~~ppa~~~Inorganic pyrophosphatase~~~COG0221
MSLLNVPAGKDLPEDIYVVIEIPANADPIKYEIDKESGALFVDRFMSTAMFYPCNYGYINHTLSLDGDPVDVLVPTPYPL
QPGSVIRCRPVGVLKMTDEAGEDAKLVAVPHSKLSKEYDHIKDVNDLPELLKAQIAHFFEHYKDLEKGKWVKVEGWENAE
AAKAEIVASFERAKNK
>O05724 3.6.1.1~~~ppa~~~Inorganic pyrophosphatase~~~
MAFENKIVEAFIEIPTGSQNKYEFDKERGIFKLDRVLYSPMFYPAEYGYLQNTLALDGDPLDILVITTNPTFPGCVIDTR
VIGYLNMVDSGEEDAKLIGVPVEDPRFDEVRSIEDLPQHKLKEIAHFFERYKDLQGKRTEIGTWEGPEAAAKLIDECIAR
YNEQK
>P56153 3.6.1.1~~~ppa~~~Inorganic pyrophosphatase~~~COG0221
MNLEKLEVSHDADSLCVVIEISKHSNIKYELDKESGALMVDRVLYGAQNYPANYGFVPNTLGSDGDPVDALVLSDVAFQA
GSVVKARLVGVLNMEDESGMDEKLIALPIDKIDPTHSYVKDIDDLSKHTLDKIKHFFETYKDLEPNKWVKVKGFENKESA
IKVLEKAIKAYQG
>O69540 3.6.1.1~~~ppa~~~Inorganic pyrophosphatase~~~COG0221
MQFDVTIEIPKGQRNKYEVDHKTGRVRLDRYLYTPMAYPTDYGFIEDTLGEDGDPLDALVLLPEPLFPGVLVEARPVGMF
RMVDEHGGDDKVLCVPVNDHRWDHIHGIIDVPTFELDAIKHFFVHYKDLEPGKFVKAADWVGRDEAEAEVQRSVERFKAG
GH
>P9WI55 3.6.1.1~~~ppa~~~Inorganic pyrophosphatase~~~COG0221
MQFDVTIEIPKGQRNKYEVDHETGRVRLDRYLYTPMAYPTDYGFIEDTLGDDGDPLDALVLLPQPVFPGVLVAARPVGMF
RMVDEHGGDDKVLCVPAGDPRWDHVQDIGDVPAFELDAIKHFFVHYKDLEPGKFVKAADWVDRAEAEAEVQRSVERFKAG
TH
>Q9K0G4 3.6.1.1~~~ppa~~~Inorganic pyrophosphatase~~~
MADFNQILTPGDVDGGIINVVNEIPAGSNHKIEWNRKLAAFQLDRVEPAIFAKPTNYGFIPQTLDEDGDELDVLLVTEQP
LATGVFLEARVIGVMKFVDDGEVDDKIVCVPADDRNNGNAYKTLSDLPQQLIKQIEFHFNHYKDLKKAGTTKVESWGDAE
EAKKVIKESIERWNKQA
>P80562 3.6.1.1~~~ppa~~~Inorganic pyrophosphatase~~~COG0221
MDLSRIPAQPKPGVINILIEIAGGSQNKYEFDKDLEAFALDRVLYSSVKYPYDYGFVPNTLADDGDPLDGMVIIDEPTFP
GCVIAARPIGFLEMIDGGDRDEKILAVPDKDPRYAHVKSLNDVAPHRLDEIAEFFRSYKNLEKKVTQILGWQDVDQVKAL
VDQSIKAYK
>Q9HWZ6 3.6.1.1~~~ppa~~~Inorganic pyrophosphatase~~~
MSYSKIPAGKDLPNDIYVAIEIPANHAPIKYEIDKDTDCLFVDRFMATPMFYPANYGFIPNTLADDGDPLDVLVVTPYPV
APGSVIRARPVGVLHMTDEAGGDAKLIAVPHDKLSVLYKDVKEYTDLPALLLEQIKHFFENYKDLEKGKWVKVEGWGNAD
AARAEITKAVAAFQK
>P58733 3.6.1.1~~~ppa~~~Inorganic pyrophosphatase~~~
MDLSRIPPQPKAGILNVLIEIPAGSKNKYEFDKDLNAFALDRVLYSSVQYPYDYGFVPITNNLADDGDPLDGMVIMVPPT
FPGVATARPIGMLQMVDGGDRDEKFLCVPAKDPRYTYVKSANDLAGHRLDEIFEFFRSYKNLFKKPTEFFGWKGDVAGLP
LVEECVKNYYKTYCKNDHGK
>Q9ZCW5 3.6.1.1~~~ppa~~~Inorganic pyrophosphatase~~~COG0221
MFIKKIKAKANNNEINVIIEIPMNSGPIKYEFDKESGALFVDRFMQTTMSYPCNYGFIPDTLSNDGDPVDVLVVAHHPVV
PGSVIKCRAIGVLMMEDESGLDEKIIAVPTSKLDITFDHIKELDDLCEMLKKRIVHFFEHYKDLEKGKWVKVTGWGDKVK
AETLIKEGIDRN
>P80507 3.6.1.1~~~ppa~~~Inorganic pyrophosphatase~~~COG0221
MDLSRIPAQPKAGLINVLIEIPAGSKNKYEFDKDMNCFALDRVLYSSVQYPYDYGFIPNTLADDGDPLDGMVIMDQPTFP
GCVITARPIGMLEMIDGGDRDEKILCVPAKDPRYTYVKSINDLAGHRLDEIAEFFRSYKNLEKKVTEILGWKDVDAVLPL
VEECVKNYK
>P38576 3.6.1.1~~~ppa~~~Inorganic pyrophosphatase~~~COG0221
MANLKSLPVGDKAPEVVHMVIEVPRGSGNKYEYDPDLGAIKLDRVLPGAQFYPGDYGFIPSTLAEDGDPLDGLVLSTYPL
LPGVVVEVRVVGLLLMEDEKGGDAKVIGVVAEDQRLDHIQDIGDVPEGVKQEIQHFFETYKALEAKKGKWVKVTGWRDRK
AALEEVRACIARYKG
>P39375 ~~~iraD~~~Anti-adapter protein IraD~~~COG3518
MMRQSLQAVLPEISGNKTSSLRKSVCSDLLTLFNSPHSALPSLLVSGMPEWQVHNPSDKHLQSWYCRQLRSALLFHEPRI
AALQVNLKEAYCHTLAISLEIMLYHDDEPLTFDLVWDNGGWRSATLENVS
>P75987 ~~~iraM~~~Anti-adapter protein IraM~~~
MKWIVIDTVIQPTCGISFSAIWGNMKMIIWYQSTIFLPPGSIFTPVKSGIILKDKEYPITIYHIAPFNKDLWSLLKSSQE
CPPGESKITNKCLHNSCIIKICPYGLK
>Q9F0N8 ~~~iraM~~~Anti-adapter protein IraM~~~
MEWKVVDTVISPSTGVSFSCIHSLKNLRLTLWYQADVYMPPGSIIIPFNKGVLINDKLYPVTVYNVTRFNPVLWKSLKEN
SHCPGNCNPKSEACSYPFECLVSVCPFGLTRNIQIDNKKV
>P0AAN9 ~~~iraP~~~Anti-adapter protein IraP~~~
MKNLIAELLFKLAQKEEESKELCAQVEALEIIVTAMLRNMAQNDQQRLIDQVEGALYEVKPDASIPDDDTELLRDYVKKL
LKHPRQ
>Q7CR46 ~~~iraP~~~Anti-adapter protein IraP~~~
MKNLIAELLLKLAQKEEESKELVAQVEALEIIVTAMLRNMAQNEQEMLIRQVEGALEGVKPDASVPDHDTELLRQYVKKL
LRHPRH
>D9N164 ~~~~~~Inward rectifier potassium channel Kirbac3.1~~~
MTGGMKPPARKPRILNSDGSSNITRLGLEKRGWLDDHYHDLLTVSWPVFITLITGLYLVTNALFALAYLACGDVIENARP
GSFTDAFFFSVQTMATIGYGKLIPIGPLANTLVTLEALCGMLGLAVAASLIYARFTRPTAGVLFSSRMVISDFEGKPTLM
MRLANLRIEQIIEADVHLVLVRSEISQEGMVFRRFHDLTLTRSRSPIFSLSWTVMHPIDHHSPIYGETDETLRNSHSEFL
VLFTGHHEAFAQNVHARHAYSCDEIIWGGHFVDVFTTLPDGRRALDLGKFHEIAQ
>A0A0H2V630 2.4.1.369~~~iroB~~~Enterobactin C-glucosyltransferase~~~COG1819
MRILFVGPPLYGLLYPVLSLAQAFRVNGHEVLIASGGQFAQKAAEAGLVVFDAAPGLDSEAGYRHHEAQRKKSNIGTQMG
NFSFFSEEMADHLVEFAGHWRPDLIIYPPLGVIGPLIAAKYDIPVVMQTVGFGHTPWHIRGVTRSLTDAYRRHNVGATPR
DMAWIDVTPPSMSILENDGEPIIPMQYVPYNGGAVWEPWWERRPDRKRLLVSLGTVKPMVDGLDLIAWVMDSASEVDAEI
ILHISANARSDLRSLPSNVRLVDWIPMGVFLNGADGFIHHGGAGNTLTALHAGIPQIVFGQGADRPVNARVVAERGCGII
PGDVGLSSNMINAFLNNRSLRKASEEVAAEMAAQPCPGEVAKSLITMVQKG
>A0A0H2V660 3.1.1.109~~~iroD~~~Iron(III) salmochelin esterase~~~COG2382
MLNMQQHPSAIASLRNQLAAGHIANLTDFWREAESLNVPLVTPVEGAEDEREVTFLWRARHPLQGVYLRLNRVTDKEHVE
KGMMSALPETDIWTLTLRLPASYCGSYSLLEIPPGTTAETIALSGGRFATLAGKADPLNKMPEINVRGNAKESVLTLDKA
PALSEWNGGFHTGQLLTSMRIIAGKSRQVRLYIPDIDISQPLGLVVLPDGETWFDHLGVCAAIDAAINNRRIVPVAVLGI
DNINEHERTEILGGRSKLIKDIAGHLLPMIRAEQPQRQWADRSRTVLAGQSLGGISALMGARYAPETFGLVLSHSPSMWW
TPERTSRPGLFSETDTSWVSEHLLSAPPQGVRISLCVGSLEGSTVPHVQQLHQRLITAGVESHCAIYTGGHDYAWWRGAL
IDGIGLLQG
>A0A0H2V871 3.1.1.107~~~iroE~~~Apo-salmochelin esterase~~~COG2819
MYAREYRSTRPHKAIFFHLSCLTLICSAQVYAKPDMRPLGPNIADKGSVFYHFSATSFDSVDGTRHYRVWTAVPNTTAPA
SGYPILYMLDGNAVMDRLDDELLKQLSEKTPPVIVAVGYQTNLPFDLNSRAYDYTPAAESRKTDLHSGRFSRKSGGSNNF
RQLLETRIAPKVEQGLNIDRQRRGLWGHSYGGLFVLDSWLSSSYFRSYYSASPSLGRGYDALLSRVTAVEPLQFCTKHLA
IMEGSATQGDNRETHAVGVLSKIHTTLTILKDKGVNAVFWDFPNLGHGPMFNASFRQALLDISGENANYTAGCHELSH
>P50500 1.16.3.-~~~iro~~~Iron oxidase~~~
MSEKDKMITRRDALRNIAVVVGSVATTTMMGVGVADAGSMPKAAVQYQDTPKGKDHCSVCAQFIAPHSCKVVAGNISPNG
WCVAFVPKSA
>P12608 ~~~irpA~~~Iron-regulated protein A~~~COG3487
MIVTGSQVRQGLNTWFVLPLRRTAIGLGCAGVATLFSACGQTQALITNQTIQGFVDQVVVPSYVSVAAGATQLEQALQTY
QQAPTAANLEAARQAWRVARDRWEQTECFAFGPADSEGFDGAMDTWPIDRQGLKTAAAQPVEQREDSRKGFHAIEELLFA
ATEPTLSDRQHLVILATDLTKQAQGLVTRWQQASDQPAYRSVLLSAGSTDSAYPTLNAAGTEIVQGLVDSLSEVASEKIG
GPLETQEPDRFESFVSRNTLSDLRNNWTGAWNVYRGQRSDGVAAGSLQQRLQQQHPVIAQQLDQQFATARQALWAIPEPI
ETNLASPRGKVAVLTAQTAIAAVSDTLERQVLPLVQ
>C1CZ84 3.4.24.-~~~irrE~~~Radiation response metalloprotease IrrE~~~COG2856
MTDPAPPPTALAAAKARMRELAASYGAGLPGRDTHSLMHGLDGITLTFMPMGQRDGAYDPEHHVILINSQVRPERQRFTL
AHEISHALLLGDDDLLSDLHDEYEGDRLEQVIETLCNVGAAALLMPAELIDDLLTRFGPTGRALAELARRADVSATSALY
ALAERTAPPVIYAVCALSRQEDEGEGGGAKELTVRASSASAGVKYSLSAGTPVPDDHPAALALDTRLPLAQDSYVPFRSG
RRMPAYVDAFPERQRVLVSFALPAGRSEPDADKPEAPGDQS
>Q9RXY7 3.4.24.-~~~irrE~~~Radiation response metalloprotease IrrE~~~COG2856
MPSANVSPPCPSGVRGGGMGPKAKAEASKPHPQIPVKLPFVTAPDALAAAKARMRDLAAAYVAALPGRDTHSLMAGVPGV
DLKFMPLGWRDGAFDPEHNVILINSAARPERQRFTLAHEIGHAILLGDDDLLSDIHDAYEGERLEQVIETLCNVAAAAIL
MPEPVIAEMLERFGPTGRALAELAKRAEVSASSALYALTEQTPVPVIYAVCAPGKPPREQAASDEDAGPSTEKVLTVRAS
SSTRGVKYTLASGTPVPADHPAALALATGMEVREESYVPFRSGRKMKAEVDAYPSRGIVAVSFEFDPARLGRKDSEQADR
DEPQDAAQ
>A0R6H8 7.2.2.-~~~irtA~~~Mycobactin import ATP-binding/permease protein IrtA~~~COG1132
MARGIQGVMMRGFGARDHQATVVSTEAITPNLLRLRMVSPTLFEDAVAEPTSWLRFWFPDPAGSKTEFQRAYTMSEMSPE
TGEFAIDVVLHEPAGPASRWARSAKPGDAIAVMTLGSAGFSVPEDPPAGYLLIGDAAATPAINGIIGVVPHDIPIEVYLE
EHDENDRLIPIAEHPRMRVHWVVREDATSLAGAIEARDWSNWYCWVTPEAGSLKHLRTRLRDEFGFPKAELHPQAYWTEG
RAMGTKRGDDDKTPEVNPAPRADKPEAPAPAAAGRGNWRAQAAGRLLAPLKTTLIISGVLQAIITLVQLAPFVLLVELAR
LLLSGASSDRLWTLGVVAISLLGTGSFLAAALTLWLHLVDARFARDLRTGLLTKMSRLPLGWFTARGSGSIKQLVQDDTL
SLHYLITHAIPDAVAAVIAPVAVLVYLFVVDWRLALVMFVPVLIYLVLMTVMTIQSGPKIAQSQRWAERMSAEAGAYLEG
QPVVRVFGGAAASSFRRRLDEYIGFLVAWQKPFTGKKSMMDLVTRPGTFLWLIVAVGTPMITSGAMDPVDILPFLLLGTT
FGVRLLGIAYGLGGIRGGMLAARRIQTTLDETELVIREQTGKRDGEPAVVFDNVTFGYRPDIPVLHDISLQLTPGTVTAL
VGPSGSGKSTLAALLARFHDVDAGAIRLGGRDIRTLTADELYRQVGFVLQDTQLVGGTVAENIALADPDASIERIQDAAR
DAQIHDRIMRLPNGYDTPLGAASSLSGGEKQRLTIARAILADTPVLILDEATAFADPESEYLVQQALNRLTRDRTVLVIA
HRLHTITHADQIVVLEGGRIVETGTHERLLDAAGRYRQLWETGQRPALATAAGPTGEAVR
>G7CBF5 7.2.2.-~~~irtA~~~Mycobactin import ATP-binding/permease protein IrtA~~~COG1132
MARGFQGVMLRGLGARDHQATVVDKEYIAPHFVRVRLVSPTLFDEVIVEPTSWLRFWFPDPDGSDTEFQRAYTITESDPE
TGRFAVDMVLHEPAGPASTWARTVEPGATIAVMSMGSRGFSVPEDPEDRPVGYLLIGDSASTPAINGIIEVVPHDIPIEL
YLEQHHDDDVLIPLAEHPRLRVHRVSRDDASSLAAALELRDWSNWYCWAGPEAGALKQVRTRLRDEFGFPKREVYAQAYW
TEGRAMGSSRGETSTPAKPAAKTAPAKAAAKPAAASGAGTPEHAAAPAAATTGAPQAAPAPGAAQPRTPVRGRWRAEAGS
RLLAPLKKPLIVSGVLQALITLIELAPFVLLVELARLLLGGAEAERLWTLGLTAVSLIGLGAVLAAAMTLWLHRVDARFA
HELRGRLLTKLSRLPLGWFTRRGSASTKQLVQDDTLALHYLITHAIPDAVAAVVAPVAVLVYLFVADWRVALVLFIPVLV
YLVLMSVMTIQSGSKIAQAPRWAERMGGEAGAFLEGQPVIRIFGGAAASRFRRRLDDYIDFLVSWQRPFVGKKTLMDLVT
RPATFLWIILVAGVPLVVTGRMDPVNLLPFLLLGTTFGARLLGIGYGLSGIQTGMLAARRIQTVLDEPELVVRDRTGQAG
TDHASGDQARPGTVELDRVSFEYRPGVPVIRDVTLTLRPGTVTALVGPSGSGKSTLAALVARFHDVTQGAIRVDGRDIRT
LTADELYRRVGFVLQDAQLVHGSVAENIALAEPDAGLERIRTAARDAQIHDRITRMPDGYDSVLGAGSALSGGERQRVTI
ARAILADTPVLVLDEATAFADPESEYLVQQAINRLTRDRTVLVIAHRLHTITHADQIVVLDDGRIVEVGTHDELLAAGGR
YRGLWDSGRYSSPDAGRPVSADAVEVGR
>P9WQJ9 7.2.2.-~~~irtA~~~Mycobactin import ATP-binding/permease protein IrtA~~~COG1132
MARGLQGVMLRSFGARDHTATVIETISIAPHFVRVRMVSPTLFQDAEAEPAAWLRFWFPDPNGSNTEFQRAYTISEADPA
AGRFAVDVVLHDPAGPASSWARTVKPGATIAVMSLMGSSRFDVPEEQPAGYLLIGDSASIPGMNGIIETVPNDVPIEMYL
EQHDDNDTLIPLAKHPRLRVRWVMRRDEKSLAEAIENRDWSDWYAWATPEAAALKCVRVRLRDEFGFPKSEIHAQAYWNA
GRAMGTHRATEPAATEPEVGAAPQPESAVPAPARGSWRAQAASRLLAPLKLPLVLSGVLAALVTLAQLAPFVLLVELSRL
LVSGAGAHRLFTVGFAAVGLLGTGALLAAALTLWLHVIDARFARALRLRLLSKLSRLPLGWFTSRGSGSIKKLVTDDTLA
LHYLVTHAVPDAVAAVVAPVGVLVYLFVVDWRVALVLFGPVLVYLTITSSLTIQSGPRIVQAQRWAEKMNGEAGSYLEGQ
PVIRVFGAASSSFRRRLDEYIGFLVAWQRPLAGKKTLMDLATRPATFLWLIAATGTLLVATHRMDPVNLLPFMFLGTTFG
ARLLGIAYGLGGLRTGLLAARHLQVTLDETELAVREHPREPLDGEAPATVVFDHVTFGYRPGVPVIQDVSLTLRPGTVTA
LVGPSGSGKSTLATLLARFHDVERGAIRVGGQDIRSLAADELYTRVGFVLQEAQLVHGTAAENIALAVPDAPAEQVQVAA
REAQIHDRVLRLPDGYDTVLGANSGLSGGERQRLTIARAILGDTPVLILDEATAFADPESEYLVQQALNRLTRDRTVLVI
AHRLHTITRADQIVVLDHGRIVERGTHEELLAAGGRYCRLWDTGQGSRVAVAAAQDGTR
>A0R6H7 7.2.2.-~~~irtB~~~Mycobactin import ATP-binding/permease protein IrtB~~~COG1132
MIRTLIALVPADKRGTLGLYTVLTVLSVVIRAAGTVLLVPLVAALFGDTPQDAWPWLGWLTAATAAGWIVDTTTSRLGFD
LGFAVLDHTQHDVADRMPNIRLDWLTAENTATARAAIASTGPELVGLVVNLLTPLIGAVLLPAAIAVALVAVSPPLGLAA
LAGVVVLLGAMWASNRLSRKADTVADETNSAFTERIIEFARTQQALRAARRVEPARSLVGDALGAQHGAGVRLLAMQIPG
QLLFSLASQLALILLAGMATWLTVRGELSVPEAVAMIVVVARYLEPFTSLSELTPAIESTRGTLGRIRAVLDAPTLTAGD
AAPADTKSAPRIEFDCVTFGYGDHPVLDDVSFVLEPGSTTAIVGPSGSGKSTILSLIAGLHQPTEGRVLIDGVDAASLDD
ESRRAATSVVFQQPYLFDGSIRDNILVGDPGADEDRLAAAVRLARVDELTARLPNGDASKVGEAGAALSGGERQRVSIAR
ALVKPAPVLLVDEATSALDTENEAAVVDALTADLRHRTRVIVAHRLASIRHADRVLFLDGGRIVEDGTIDGLLAAGGRFD
EFWRRQHEAADWQITH
>G7CBF6 7.2.2.-~~~irtB~~~Mycobactin import ATP-binding/permease protein IrtB~~~COG1132
MIRTLLRLVPAEKRGAVAGYAVLTLLSVLLRAVGAVLLIPLLAALFSDTPSDAWLWLGWLTAVTLAGWVTDTNTARLGFD
LGFAVLSRTQHDMADRLPNVAMSWFTPDNTATARQAIAATGPELAGLVVNLLTPLIGAALLPAAIGVALLFVSVPLGLAA
LAGVAVLFGALALSGRLSRAADKVAGETNSAFTERIIEFARTQQALRAARRVEPARSQVGSALAAQHGAGLRLLTMQIPG
QVLFSLAGQVALIGFAGMAVWLTVRGQLGVPEAIALIVVLVRYLEPFAAIADLAPALETTRATLNRIQAVLDAPTLPAGR
RRLDRTGAAPSIEFDDVRFSYGDEVVLDGVSFTLRPGNTTAIVGPSGSGKTTILSLIAGLQQPASGRVLLDGVDVTTLDP
EARRAAVSVVFQHPYLFDGTLRDNVLVGDPEADPDDVTAAMRLARVDELLDRLPDGDATVVGEGGTALSGGERQRVSIAR
ALLKPAPVLLVDEATSALDNANEAAVVDALTADPRPRTRVIVAHRLASIRHADRVLFVEAGRVVEDGAIDELLAAGGRFA
QFWAQQQAASEWAIGSTAR
>P9WQJ7 7.2.2.-~~~irtB~~~Mycobactin import ATP-binding/permease protein IrtB~~~COG1132
MIRTWIALVPNDHRARLIGFALLAFCSVVARAVGTVLLVPLMAALFGEAPQRAWLWLGWLSAATVAGWVLDAVTARIGIE
LGFAVLNHTQHDVADRLPVVRLDWFTAENTATARQAIAATGPELVGLVVNLVTPLTSAILLPAVIALALLPISWQLGVAA
LAGVPLLLGALWASAAFARRADTAADKANTALTERIIEFARTQQALRAARRVEPARSLVGNALASQHTATMRLLGMQIPG
QLLFSIASQLALIVLAGTTAALTITGTLTVPEAIALIVVMVRYLEPFTAVSELAPALESTRATLGRIGSVLTAPVMVAGS
GTWRDGAVVPRIEFDDVAFGYDGGSGPVLDGVSFCLQPGTTTAIVGPSGCGKSTILALIAGLHQPTRGRVLIDGTDVATL
DARAQQAVCSVVFQHPYLFHGTIRDNVFAADPGASDDQFAQAVRLARVDELIARLPDGANTIVGEAGSALSGGERQRVSI
ARALLKAAPVLLVDEATSALDAENEAAVVDALAADPRSRTRVIVAHRLASIRHADRVLFVDDGRVVEDGSISELLTAGGR
FSQFWRQQHEAAEWQILAE
>Q2FV52 3.2.-.-~~~isaA~~~Probable transglycosylase IsaA~~~COG0741
MKKTIMASSLAVALGVTGYAAGTGHQAHAAEVNVDQAHLVDLAHNHQDQLNAAPIKDGAYDIHFVKDGFQYNFTSNGTTW
SWSYEAANGQTAGFSNVAGADYTTSYNQGSNVQSVSYNAQSSNSNVEAVSAPTYHNYSTSTTSSSVRLSNGNTAGATGSS
AAQIMAQRTGVSASTWAAIIARESNGQVNAYNPSGASGLFQTMPGWGPTNTVDQQINAAVKAYKAQGLGAWGF
>Q5HCY1 3.2.-.-~~~isaA~~~Probable transglycosylase IsaA~~~
MKKTIMASSLAVALGVTGYAAGTGHQAHAAEVNVDQAHLVDLAHNHQDQLNAAPIKDGAYDIHFVKDGFQYNFTSNGTTW
SWSYEAANGQTAGFSNVAGADYTTSYNQGSNVQSVSYNAQSSNSNVEAVSAPTYHNYSTSTTSSSVRLSNGNTAGATGSS
AAQIMAQRTGVSASTWAAIIARESNGQVNAYNPSGASGLFQTMPGWGPTNTVDQQINAAVKAYKAQGLGAWGF
>P65645 3.2.-.-~~~isaA~~~Probable transglycosylase IsaA~~~
MKKTIMASSLAVALGVTGYAAGTGHQAHAAEVNVDQAHLVDLAHNHQDQLNAAPIKDGAYDIHFVKDGFQYNFTSNGTTW
SWSYEAANGQTAGFSNVAGADYTTSYNQGSDVQSVSYNAQSSNSNVEAVSAPTYHNYSTSTTSSSVRLSNGNTAGATGSS
AAQIMAQRTGVSASTWAAIIARESNGQVNAYNPSGASGLFQTMPGWGPTNTVDQQINAAVKAYKAQGLGAWGF
>P99160 3.2.-.-~~~isaA~~~Probable transglycosylase IsaA~~~
MKKTIMASSLAVALGVTGYAAGTGHQAHAAEVNVDQAHLVDLAHNHQDQLNAAPIKDGAYDIHFVKDGFQYNFTSNGTTW
SWSYEAANGQTAGFSNVAGADYTTSYNQGSDVQSVSYNAQSSNSNVEAVSAPTYHNYSTSTTSSSVRLSNGNTAGATGSS
AAQIMAQRTGVSASTWAAIIARESNGQVNAYNPSGASGLFQTMPGWGPTNTVDQQINAAVKAYKAQGLGAWGF
>P60158 3.2.-.-~~~isaA~~~Probable transglycosylase IsaA~~~
MKKTIMASSLAVALGVTGYAAGTGHQAHAAEVNVDQAHLVDLAHNHQDQLNAAPIKDGAYDIHFVKDGFQYNFTSNGTTW
SWSYEAANGQTAGFSNVAGADYTTSYNQGSNVQSVSYNAQSSNSNVEAVSAPTYHNYSTSTTSSSVRLSNGNTAGATGSS
AAQIMAQRTGVSASTWAAIIARESNGQVNAYNPSGASGLFQTMPGWGPTNTVDQQINAAVKAYKAQGLGAWGF
>A7IY64 3.2.-.-~~~isaA~~~Probable transglycosylase IsaA~~~COG1388
MKKTILASSLAVALGVTGYATTADHNQAHASEENIDKAHLADLAQNNPEELNQKPLHAGAYNYNFVLGGNEYTFTSNGQS
WSWNYTAAGAQSATSNSVQDVTTQATTNTNETSASEVSAQKQSSNTPVAAVEAPKASSNTQTSAATRTYKVAQTSAASTG
GSVKAQFLAAGGTEAMWNSIVMPESSGNPNAVNPAGYRGLGQTKESWGSGSVASQTKGMINYGESRYGSMEAAMTFRASH
GWW
>Q2FUX3 ~~~isaB~~~Immunodominant staphylococcal antigen B~~~
MNKTSKVCVAATLALGTLIGVTVVENSAPTSKQAQAAITPYYTYNGYIGNNANFILDKNFINAIKYDNVKFNGIKLAKTN
TIKKVEKYDQTFKGVSAKGNEASQLQFVVKNNISLKDIQKAYGKDLKKENGKTKEADSGIFYYQNAKKTLGIWFVVDHNR
VVEVTVGHTPYKTSK
>Q7A377 ~~~isaB~~~Immunodominant staphylococcal antigen B~~~
MNKTSKVCVAATLALGTLIGVTVVENSAPTSKQAQAAITPYYTYNGYIGNNANFILDKNFINAIKYDNVKFNGIKLAKTN
TIKKVEKYDQTFKGVSAKGNEASQLQFVVKNNISLKDIQKAYGKDLKKENGKTKEADSGIFYYQNAKKTLGIWFVVDHNR
VVEVTVGHTPYKTSK
>Q9LAB5 ~~~isaB~~~Immunodominant staphylococcal antigen B~~~
MNKTSKVCVAATLALGTLIGVTVVENSAPTSKQAQAAITPYYTYNGYIGNNANFILDKNFINAIKYDNVKFNGIKLAKTN
TIKKVEKYDQTFKGVSAKGNEASQLQFVVKNNISLKDIQKAYGKDLKKENGKTKEADSGIFYYQNAKKTLGIWFVVDHNR
VVEVTVGHTPYKTSK
>A0NLY7 3.5.2.20~~~~~~Isatin hydrolase~~~COG1878
MSAQSALSGLGAKLLSGEVEVVDCTGVLGPNTPILQLPPDFAKNTPKVEIHKISEYDSDGPFFAWNWMVLGEHSGTHFDA
PHHWITGKDYSDGFTDTLDVQRLIAPVNVIDCSKESAADPDFLLTADLIKAWEAEHGEIGAGEWVVMRTDWDKRAGDEAA
FLNADETGPHSPGPTPDAIEYLLSKKIVGWGSQCIGTDAGQAGGMEPPFPAHNLLHRDNCFGLASLANLDKLPAKGAILI
AAPLKIERGTGSPIRALALVPKA
>P0AAC8 ~~~iscA~~~Iron-binding protein IscA~~~COG0316
MSITLSDSAAARVNTFLANRGKGFGLRLGVRTSGCSGMAYVLEFVDEPTPEDIVFEDKGVKVVVDGKSLQFLDGTQLDFV
KEGLNEGFKFTNPNVKDECGCGESFHV
>P0AGK8 ~~~iscR~~~HTH-type transcriptional regulator IscR~~~COG1959
MRLTSKGRYAVTAMLDVALNSEAGPVPLADISERQGISLSYLEQLFSRLRKNGLVSSVRGPGGGYLLGKDASSIAVGEVI
SAVDESVDATRCQGKGGCQGGDKCLTHALWRDLSDRLTGFLNNITLGELVNNQEVLDVSGRQHTHDAPRTRTQDAIDVKL
RA
>P9WQ71 2.8.1.7~~~iscS~~~IscS-like cysteine desulfurase~~~COG1104
MAYLDHAATTPMHPAAIEAMAAVQRTIGNASSLHTSGRSARRRIEEARELIADKLGARPSEVIFTAGGTESDNLAVKGIY
WARRDAEPHRRRIVTTEVEHHAVLDSVNWLVEHEGAHVTWLPTAADGSVSATALREALQSHDDVALVSVMWANNEVGTIL
PIAEMSVVAMEFGVPMHSDAIQAVGQLPLDFGASGLSAMSVAGHKFGGPPGVGALLLRRDVTCVPLMHGGGQERDIRSGT
PDVASAVGMATAAQIAVDGLEENSARLRLLRDRLVEGVLAEIDDVCLNGADDPMRLAGNAHFTFRGCEGDALLMLLDANG
IECSTGSACTAGVAQPSHVLIAMGVDAASARGSLRLSLGHTSVEADVDAALEVLPGAVARARRAALAAAGASR
>O31269 2.8.1.7~~~iscS~~~Cysteine desulfurase IscS~~~
MKLPIYLDYSATTPVDPRVAQKMCECLTMEGNFGNPASRSHVFGWKAEEAVENARRQVAELVNADPREIVWTSGATESDN
LAIKGVAHFNASKGKHIITSKIEHKAVLDTTRQLEREGFEVTYLEPGEDGLITPAMVAAALREDTILVSVMHVNNEIGTV
NDIAAIGELTRSRGVLYHVDAAQSTGKVAIDLERMKVDLMSFSAHKTYGPKGIGALYVRRKPRVRLEAQMHGGGHERGMR
SGTLATHQIVGMGEAFRIAREEMAAESRRIAGLSHRFHEQVSTLEEVYLNGSATARVPHNLNLSFNYVEGESLIMSLRDL
AVSSGSACTSASLEPSYVLRALGRNDELAHSSIRFTFGRFTTEEEVDYAARKVCEAVGKLRELSPLWDMYKDGVDLSKIE
WQAH
>P0A6B9 2.8.1.7~~~iscS~~~Cysteine desulfurase IscS~~~COG1104
MKLPIYLDYSATTPVDPRVAEKMMQFMTMDGTFGNPASRSHRFGWQAEEAVDIARNQIADLVGADPREIVFTSGATESDN
LAIKGAANFYQKKGKHIITSKTEHKAVLDTCRQLEREGFEVTYLAPQRNGIIDLKELEAAMRDDTILVSIMHVNNEIGVV
QDIAAIGEMCRARGIIYHVDATQSVGKLPIDLSQLKVDLMSFSGHKIYGPKGIGALYVRRKPRVRIEAQMHGGGHERGMR
SGTLPVHQIVGMGEAYRIAKEEMATEMERLRGLRNRLWNGIKDIEEVYLNGDLEHGAPNILNVSFNYVEGESLIMALKDL
AVSSGSACTSASLEPSYVLRALGLNDELAHSSIRFSLGRFTTEEEIDYTIELVRKSIGRLRDLSPLWEMYKQGVDLNSIE
WAHH
>P0A6B7 2.8.1.7~~~iscS~~~Cysteine desulfurase IscS~~~COG1104
MKLPIYLDYSATTPVDPRVAEKMMQFMTMDGTFGNPASRSHRFGWQAEEAVDIARNQIADLVGADPREIVFTSGATESDN
LAIKGAANFYQKKGKHIITSKTEHKAVLDTCRQLEREGFEVTYLAPQRNGIIDLKELEAAMRDDTILVSIMHVNNEIGVV
QDIAAIGEMCRARGIIYHVDATQSVGKLPIDLSQLKVDLMSFSGHKIYGPKGIGALYVRRKPRVRIEAQMHGGGHERGMR
SGTLPVHQIVGMGEAYRIAKEEMATEMERLRGLRNRLWNGIKDIEEVYLNGDLEHGAPNILNVSFNYVEGESLIMALKDL
AVSSGSACTSASLEPSYVLRALGLNDELAHSSIRFSLGRFTTEEEIDYTIELVRKSIGRLRDLSPLWEMYKQGVDLNSIE
WAHH
>O25008 2.8.1.7~~~iscS~~~Cysteine desulfurase IscS~~~COG1104
MLQRIYLDNNATTRIDPKVKEIMDPFLRDHYGNPSSLHQFGTETHPAIAEALDKLYKGINARDIDDVIITSCATESNNWV
LKGVYFDECLKKGKNHIVTTVAEHPAVRSTCNFLESLGVEVTYLPINEHGSITAEQVKEAITEKTALVSVMWANNETGLI
FPIEEIGAICKEKGVLFHTDAVQAIGKIPVDVLKANADFLSFSAHKFHGPKGIGGLYIRSGVGLTPLFHGGEHMNGRRSG
TLNVPYIVGMGEAMKLAVEHLDYEKEVVGKLRDKLEEALLKIPDVMVVGDRIHRVPNTTLVSVRGIEGEAMLWDLNRSNI
AASTGSACASEDLEANPVMVAIGASKELAHTAIRLSLSRFNTEAEIDKTIEVFSQAAVRLRNISSSY
>O54055 2.8.1.7~~~iscS~~~Cysteine desulfurase IscS~~~
MEKRFIYADNAATTAVSEEVLSAMLPYFRTAYGNASSIYKLGRDAQRDVELAREKVAKALGAEPREIYFTSCGSESDNWA
IKGTAELMAKKGKKHIVTSVFEHHAVLHTCEYLEKHGYEVTYVPVNDKGLIDPEDVRKAVREDTALVTIMYANNEIGTIQ
PIEEIAAVCREKGVLFHTDAVQAVGHVDIDVHKQGIDMLSLSGHKIHAQKGIGAIYIRKGIVLPNLVHGGGQERGKRAGT
ENVPAIVGLGVAIEAAVRNTAEKAAIIIPRRNRIIDELLKIPYTRLNGDREKRLPGNINISFEGIEGESLLLMLDLNGIC
ASSGSACTSGSLDPSHVLLSIGLKHAVAHGSLRLSIEEDVSDEDVEYIIETIPKVVQRLRSMSPVWERMMKGEKYD
>O67045 ~~~iscU~~~Iron-sulfur cluster assembly scaffold protein IscU~~~COG0822
MSFEYNEKVLDHFLNPRNVGVLEDANGVGQCGNPACGDAMLFTIKVNPENDVIEDVRFKTFGCGSAIAVSSMLTEMVKGK
PIQYALNLTYKDIFEELGGLPPQKIHCTNLGLETLHVAIKDYLMKQGRVEEASKIPDCYEEEEEQEESKEFEFLSGT
>O31270 ~~~iscU~~~Iron-sulfur cluster assembly scaffold protein IscU~~~
MAYSDKVIDHYENPRNVGKLDAQDPDVGTGMVGAPACGDVMRLQIKVNEQGIIEDAKFKTYGCGSAIASSSLATEWMKGR
TLEEAETIKNTQIAEELALPPVKIHCSVLAEDAIKAAVRDYKHKKGLV
>P0ACD6 ~~~iscU~~~Iron-sulfur cluster assembly scaffold protein IscU~~~COG0822
MAYSEKVIDHYENPRNVGSFDNNDENVGSGMVGAPACGDVMKLQIKVNDEGIIEDARFKTYGCGSAIASSSLVTEWVKGK
SLDEAQAIKNTDIAEELELPPVKIHCSILAEDAIKAAIADYKSKREAK
>P0ACD4 ~~~iscU~~~Iron-sulfur cluster assembly scaffold protein IscU~~~COG0822
MAYSEKVIDHYENPRNVGSFDNNDENVGSGMVGAPACGDVMKLQIKVNDEGIIEDARFKTYGCGSAIASSSLVTEWVKGK
SLDEAQAIKNTDIAEELELPPVKIHCSILAEDAIKAAIADYKSKREAK
>Q57074 ~~~iscU~~~Iron-sulfur cluster assembly scaffold protein IscU~~~COG0822
MAYSEKVIDHYENPRNVGSLDKKDSNVGTGMVGAPACGDVMQLQIKVDDNGIIEDAKFKTYGCGSAIASSSLITEWVKGK
SLEEAGAIKNSQIAEELELPPVKVHCSILAEDAIKAAIADYKAKQG
>Q9A1G2 ~~~iscU~~~Iron-sulfur cluster assembly scaffold protein IscU~~~
MALSKLNHLYMAVVADHSKRPHHHGQLDGVEAVQLNNPTCGDVISLTVKFDEDKIEDIAFAGNGCTISTASSSMMTDAVI
GKSKEEALALADIFSEMVQGQENPAQKELGEAELLAGVAKFPQRIKCSTLAWNALKEAIKRSANAQHLTDQNVKEGKNV
>P0C0M0 ~~~iscX~~~Protein IscX~~~COG2975
MGLKWTDSREIGEALYDAYPDLDPKTVRFTDMHQWICDLEDFDDDPQASNEKILEAILLVWLDEAE
>P0C0L9 ~~~iscX~~~Protein IscX~~~COG2975
MGLKWTDSREIGEALYDAYPDLDPKTVRFTDMHQWICDLEDFDDDPQASNEKILEAILLVWLDEAE
>Q2FHV1 ~~~isdA~~~Iron-regulated surface determinant protein A~~~
MTKHYLNSKYQSEQRSSAMKKITMGTASIILGSLVYIGADSQQVNAATEATNATNNQSTQVSQATSQPINFQVQKDGSSE
KSHMDDYMQHPGKVIKQNNKYYFQTVLNNASFWKEYKFYNANNQELATTVVNDNKKADTRTINVAVEPGYKSLTTKVHIV
VPQINYNHRYTTHLEFEKAIPTLADAAKPNNVKPVQPKPAQPKTPTEQTKPVQPKVEKVKPTVTTTSKVEDNHSTKVVST
DTTKDQTKTQTAHTVKTAQTAQEQNKVQTPVKDVATAKSESNNQAVSDNKSQQTNKVTKHNETPKQASKAKELPKTGLTS
VDNFISTVAFATLALLGSLSLLLFKRKESK
>Q2FZE9 ~~~isdA~~~Iron-regulated surface determinant protein A~~~COG5386
MTKHYLNSKYQSEQRSSAMKKITMGTASIILGSLVYIGADSQQVNAATEATNATNNQSTQVSQATSQPINFQVQKDGSSE
KSHMDDYMQHPGKVIKQNNKYYFQTVLNNASFWKEYKFYNANNQELATTVVNDNKKADTRTINVAVEPGYKSLTTKVHIV
VPQINYNHRYTTHLEFEKAIPTLADAAKPNNVKPVQPKPAQPKTPTEQTKPVQPKVEKVKPTVTTTSKVEDNHSTKVVST
DTTKDQTKTQTAHTVKTAQTAQEQNKVQTPVKDVATAKSESNNQAVSDNKSQQTNKVTKHNETPKQASKAKELPKTGLTS
VDNFISTVAFATLALLGSLSLLLFKRKESK
>A6QG31 ~~~isdA~~~Iron-regulated surface determinant protein A~~~
MTKHYLNSKYQSEQRSSAMKKITMGTASIILGSLVYIGADSQQVNAATEATNATNNQSTQVSQATSQPINFQVQKDGSSE
KSHMDDYMQHPGKVIKQNNKYYFQTVLNNASFWKEYKFYNANNQELATTVVNDNKKADTRTINVAVEPGYKSLTTKVHIV
VPQINYNHRYTTHLEFEKAIPTLADAAKPNNVKPVQPKPAQPKTPTEQTKPVQPKVEKVKPTVTTTSKVEDNHSTKVVST
DTTKDQTKTQTAHTVKTAQTAQEQNKVQTPVKDVATAKSESNNQAVSDNKSQQTNKVTKHNETPKQASKAKELPKTGLTS
VDNFISTVAFATLALLGSLSLLLFKRKESK
>Q99UX4 ~~~isdA~~~Iron-regulated surface determinant protein A~~~
MTKHYLNSKYQSEQRSSAMKKITMGTASIILGSLVYIGADSQQVNAATEATNATNNQSTQVSQATSQPINFQVQKDGSSE
KSHMDDYMQHPGKVIKQNNKYYFQTVLNNASFWKEYKFYNANNQELATTVVNDNKKADTRTINVAVEPGYKSLTTKVHIV
VPQINYNHRYTTHLEFEKAIPTLADAAKPNNVKPVQPKPAQPKTPTEQTKPVQPKVEKVKPTVTTTSKVEDNHSTKVVST
DTTKDQTKTQTAHTVKTAQTAQEQNKVQTPVKDVATAKSESNNQAVSDNKSQQTNKVTKHNETPKQASKAKELPKTGLTS
VDNFISTVAFATLALLGSLSLLLFKRKESK
>Q7A655 ~~~isdA~~~Iron-regulated surface determinant protein A~~~
MTKHYLNSKYQSEQRSSAMKKITMGTASIILGSLVYIGADSQQVNAATEATNATNNQSTQVSQATSQPINFQVQKDGSSE
KSHMDDYMQHPGKVIKQNNKYYFQTVLNNASFWKEYKFYNANNQELATTVVNDNKKADTRTINVAVEPGYKSLTTKVHIV
VPQINYNHRYTTHLEFEKAIPTLADAAKPNNVKPVQPKPAQPKTPTEQTKPVQPKVEKVKPTVTTTSKVEDNHSTKVVST
DTTKDQTKTQTAHTVKTAQTAQEQNKVQTPVKDVATAKSESNNQAVSDNKSQQTNKVTKHNETPKQASKAKELPKTGLTS
VDNFISTVAFATLALLGSLSLLLFKRKESK
>P0C1S5 ~~~isdA~~~Iron-regulated surface determinant protein A~~~
MTKHYLNSKYQSEQRSSAMKKITMGTASIILGSLVYIGADSQQVNAATEATNATNNQSTQVSQATSQPINFQVQKDGSSE
KSHMDDYMQHPGKVIKQNNKYYFQAVLNNASFWKEYKFYNANNQELATTVVNDDKKADTRTINVAVEPGYKSLTTKVHIV
VPQINYNHRYTTHLEFEKAIPTLADAAKPNNVKPVQPKPAQPKTPTEQTKPVQPKVEKVKPAVTAPSKNENRQTTKVVSS
EATKDQSQTQSARTVKTTQTAQDQNKVQTPVKDVATAKSESNNQAVSDNKSQQTNKVTKQNEVHKQGPSKDSKAKELPKT
GLTSVDNFISTVAFATLALLGSLSLLLFKRKESK
>Q7A152 ~~~isdA~~~Iron-regulated surface determinant protein A~~~
MTKHYLNSKYQSEQRSSAMKKITMGTASIILGSLVYIGADSQQVNAATEATNATNNQSTQVSQATSQPINFQVQKDGSSE
KSHMDDYMQHPGKVIKQNNKYYFQTVLNNASFWKEYKFYNANNQELATTVVNDNKKADTRTINVAVEPGYKSLTTKVHIV
VPQINYNHRYTTHLEFEKAIPTLADAAKPNNVKPVQPKPAQPKTPTEQTKPVQPKVEKVKPTVTTTSKVEDNHSTKVVST
DTTKDQTKTQTAHTVKTAQTAQEQNKVQTPVKDVATAKSESNNQAVSDNKSQQTNKVTKHNETPKQASKAKELPKTGLTS
VDNFISTVAFATLALLGSLSLLLFKRKESK
>Q2FZF0 ~~~isdB~~~Iron-regulated surface determinant protein B~~~COG5180
MNKQQKEFKSFYSIRKSSLGVASVAISTLLLLMSNGEAQAAAEETGGTNTEAQPKTEAVASPTTTSEKAPETKPVANAVS
VSNKEVEAPTSETKEAKEVKEVKAPKETKEVKPAAKATNNTYPILNQELREAIKNPAIKDKDHSAPNSRPIDFEMKKKDG
TQQFYHYASSVKPARVIFTDSKPEIELGLQSGQFWRKFEVYEGDKKLPIKLVSYDTVKDYAYIRFSVSNGTKAVKIVSST
HFNNKEEKYDYTLMEFAQPIYNSADKFKTEEDYKAEKLLAPYKKAKTLERQVYELNKIQDKLPEKLKAEYKKKLEDTKKA
LDEQVKSAITEFQNVQPTNEKMTDLQDTKYVVYESVENNESMMDTFVKHPIKTGMLNGKKYMVMETTNDDYWKDFMVEGQ
RVRTISKDAKNNTRTIIFPYVEGKTLYDAIVKVHVKTIDYDGQYHVRIVDKEAFTKANTDKSNKKEQQDNSAKKEATPAT
PSKPTPSPVEKESQKQDSQKDDNKQLPSVEKENDASSESGKDKTPATKPTKGEVESSSTTPTKVVSTTQNVAKPTTASSK
TTKDVVQTSAGSSEAKDSAPLQKANIKNTNDGHTQSQNNKNTQENKAKSLPQTGEESNKDMTLPLMALLALSSIVAFVLP
RKRKN
>Q5HGV5 ~~~isdB~~~Iron-regulated surface determinant protein B~~~
MNKQQKEFKSFYSIRKSSLGVASVAISTLLLLMSNGEAQAAAEETGGTNTEAQPKTEAVASPTTTSEKAPETKPVANAVS
VSNKEVEAPTSETKEAKEVKEVKAPKETKEVKPAAKATNNTYPILNQELREAIKNPAIKDKDHSAPNSRPIDFEMKKKDG
TQQFYHYASSVKPARVIFTDSKPEIELGLQSGQFWRKFEVYEGDKKLPIKLVSYDTVKDYAYIRFSVSNGTKAVKIVSST
HFNNKEEKYDYTLMEFAQPIYNSADKFKTEEDYKAEKLLAPYKKAKTLERQVYELNKIQDKLPEKLKAEYKKKLEDTKKA
LDEQVKSAITEFQNVQPTNEKMTDLQDTKYVVYESVENNESMMDTFVKHPIKTGMLNGKKYMVMETTNDDYWKDFMVEGQ
RVRTISKDAKNNTRTIIFPYVEGKTLYDAIVKVHVKTIDYDGQYHVRIVDKEAFTKANTDKSNKKEQQDNSAKKEATPAT
PSKPTPSPVEKESQKQDSQKDDNKQLPSVEKENDASSESGKDKTPATKPTKGEVESSSTTPTKVVSTTQNVAKPTTASSK
TTKDVVQTSAGSSEAKDSAPLQKANIKNTNDGHTQSQNNKNTQENKAKSLPQTGEESNKDMTLPLMALLALSSIVAFVLP
RKRKN
>A6QG30 ~~~isdB~~~Iron-regulated surface determinant protein B~~~
MNKQQKEFKSFYSIRKSSLGVASVAISTLLLLMSNGEAQAAAEETGGTNTEAQPKTEAVASPTTTSEKAPETKPVANAVS
VSNKEVEAPTSETKEAKEVKEVKAPKETKEVKPAAKATNNTYPILNQELREAIKNPAIKDKDHSAPNSRPIDFEMKKKDG
TQQFYHYASSVKPARVIFTDSKPEIELGLQSGQFWRKFEVYEGDKKLPIKLVSYDTVKDYAYIRFSVSNGTKAVKIVSST
HFNNKEEKYDYTLMEFAQPIYNSADKFKTEEDYKAEKLLAPYKKAKTLERQVYELNKIQDKLPEKLKAEYKKKLEDTKKA
LDEQVKSAITEFQNVQPTNEKMTDLQDTKYVVYESVENNESMMDTFVKHPIKTGMLNGKKYMVMETTNDDYWKDFMVEGQ
RVRTISKDAKNNTRTIIFPYVEGKTLYDAIVKVHVKTIDYDGQYHVRIVDKEAFTKANTDKSNKKEQQDNSAKKEATPAT
PSKPTPSPVEKESQKQDSQKDDNKQLPSVEKENDASSESGKDKTPATKPTKGEVESSSTTPTKVVSTTQNVAKPTTASSK
TTKDVVQTSAGSSEAKDSAPLQKANIKNTNDGHTQSQNNKNTQENKAKSLPQTGEESNKDMTLPLMALLALSSIVAFVLP
RKRKN
>Q7A656 ~~~isdB~~~Iron-regulated surface determinant protein B~~~
MNKQQKEFKSFYSIRKSSLGVASVAISTLLLLMSNGEAQAAAEETGGTNTEAQPKTEAVASPTTTSEKAPETKPVANAVS
VSNKEVEAPTSETKEAKEVKEVKAPKETKAVKPAAKATNNTYPILNQELREAIKNPAIKDKDHSAPNSRPIDFEMKKENG
EQQFYHYASSVKPARVIFTDSKPEIELGLQSGQFWRKFEVYEGDKKLPIKLVSYDTVKDYAYIRFSVSNGTKAVKIVSST
HFNNKEEKYDYTLMEFAQPIYNSADKFKTEEDYKAEKLLAPYKKAKTLERQVYELNKIQDKLPEKLKAEYKKKLEDTKKA
LDEQVKSAITEFQNVQPTNEKMTDLQDTKYVVYESVENNESMMDTFVKHPIKTGMLNGKKYMVMETTNDDYWKDFMVEGQ
RVRTISKDAKNNTRTIIFPYVEGKTLYDAIVKVHVKTIDYDGQYHVRIVDKEAFTKANTDKSNKKEQQDNSAKKEATPAT
PSKPTPSPVEKESQKQDSQKDDNKQLPSVEKENDASSESGKDKTPATKPTKGEVESSSTTPTKVVSTTQNVAKPTTASSK
TTKDVVQTSAGSSEAKDSAPLQKANIKNTNDGHTQSQNNKNTQENKAKSLPQTGEESNKDMTLPLMALLALSSIVAFVLP
RKRKN
>Q6GHV7 ~~~isdB~~~Iron-regulated surface determinant protein B~~~
MNKQQKEFKSFYSIRKSSLGVASVAISTLLLLMSNGEAKAAEETGVTNTEAQPKTEAVASPTTTTTEKAPEAKPVAKPVA
NAVSVSNKEVVAPTTETKEAKEVKAVKEVKAPKEAKEEKPAAKADNNTYPILNQELREAIKNPAIKDKDHSAPNSRPIDF
EMKKKDGTQQFYHYASSVKPARVIFTDSKPEIELGLQSGQFWRKFEVYEGDKKLPIKLVSYDTVKDYAYIRFSVSNGTKA
VKIVSSTHFNNKEEKYDYTLMEFAQPIYNSADKFKTEEDYKAEKLLAPYKKAKTLERQVYELNKIQDKLPEKLKAEYKKK
LEETKKALDEQVKSAITEFQNVQPTNEKMTDLQDTKYVVYESVENNESMMDAFVKHPIKTGMLNGKKYMVMETTNDDYWK
DFMVEGQRVRTISKDAKNNTRTIIFPYVEGKTLYDAIVKVHVKTIDYDGQYHVRIVDKEAFTKANADKTNKKEQQDNSAK
KETTPAMPSKPTTPPVEKESQKQDSQKDDNKQSPSVEKENDASSESGKDKMPVTKPAKAEVESSSTTPTKVVSTTQNVAK
PTTASSETTKDVVQTSAGSSEAKDSAPLQKANIKNTNDGHTQSQNNKNTQENKAKSLPQTGEESNKDMTLPLMALIALSS
IVAFVLPRKRKN
>Q8NX66 ~~~isdB~~~Iron-regulated surface determinant protein B~~~
MNKQQKEFKSFYSIRKSSLGVASVAISTLLLLMSNGEAQAAAEETGGTNTEAQPKTEAVASPTTTSEKAPETKPVANAVS
VSNKEVEAPTSETKEAKEVKEVKAPKETKEVKPAAKATNNTYPILNQELREAIKNPAIKDKDHSAPNSRPIDFEMKKKDG
TQQFYHYASSVKPARVIFTDSKPEIELGLQSGQFWRKFEVYEGDKKLPIKLVSYDTVKDYAYIRFSVSNGTKAVKIVSST
HFNNKEEKYDYTLMEFAQPIYNSADKFKTEEDYKAEKLLAPYKKAKTLERQVYELNKIQDKLPEKLKAEYKKKLEDTKKA
LDEQVKSAITEFQNVQPTNEKMTDLQDTKYVVYESVENNESMMDTFVKHPIKTGMLNGKKYMVMETTNDDYWKDFMVEGQ
RVRTISKDAKNNTRTIIFPYVEGKTLYDAIVKVHVKTIDYDGQYHVRIVDKEAFTKANTDKSNKKEQQDNSAKKEATPAT
PSKPTPSPVEKESQKQDSQKDDNKQLPSVEKENDASSESGKDKTPATKPTKGEVESSSTTPTKVVSTTQNVAKPTTASSK
TTKDVVQTSAGSSEAKDSAPLQKANIKNTNDGHTQSQNNKNTQENKAKSLPQTGEESNKDMTLPLMALLALSSIVAFVLP
RKRKN
>Q8KQR1 ~~~isdC~~~Iron-regulated surface determinant protein C~~~COG5386
MKNILKVFNTTILALIIIIATFSNSANAADSGTLNYEVYKYNTNDTSIANDYFNKPAKYIKKNGKLYVQITVNHSHWITG
MSIEGHKENIISKNTAKDERTSEFEVSKLNGKIDGKIDVYIDEKVNGKPFKYDHHYNITYKFNGPTDVAGANAPGKDDKN
SASGSDKGSDGTTTGQSESNSSNKDKVENPQTNAGTPAYIYAIPVASLALLIAITLFVRKKSKGNVE
>A6QG32 ~~~isdC~~~Iron-regulated surface determinant protein C~~~
MKNILKVFNTTILALIIIIATFSNSANAADSGTLNYEVYKYNTNDTSIANDYFNKPAKYIKKNGKLYVQITVNHSHWITG
MSIEGHKENIISKNTAKDERTSEFEVSKLNGKIDGKIDVYIDEKVNGKPFKYDHHYNITYKFNGPTDVAGANAPGKDDKN
SASGSDKGSDGTTTGQSESNSSNKDKVENPQTNAGTPAYIYAIPVASLALLIAITLFVRKKSKGNVE
>Q7A151 ~~~isdC~~~Iron-regulated surface determinant protein C~~~
MKNILKVFNTTILALIIIIATFSNSANAADSGTLNYEVYKYNTNDTSIANDYFNKPAKYIKKNGKLYVQITVNHSHWITG
MSIEGHKENIISKNTAKDERTSEFEVSKLNGKIDGKIDVYIDEKVNGKPFKYDHHYNITYKFNGPTDVAGANAPGKDDKN
SASGSDKGSDGTTTGQSESNSSNKDKVENPQTNAGTPAYIYAIPVASLALLIAITLFVRKKSKGNVE
>Q2FZE6 ~~~isdE~~~High-affinity heme uptake system protein IsdE~~~COG0614
MRIIKYLTILVISVVILTSCQSSSSQESTKSGEFRIVPTTVALTMTLDKLDLPIVGKPTSYKTLPNRYKDVPEIGQPMEP
NVEAVKKLKPTHVLSVSTIKDEMQPFYKQLNMKGYFYDFDSLKGMQKSITQLGDQFNRKAQAKELNDHLNSVKQKIENKA
AKQKKHPKVLILMGVPGSYLVATDKSYIGDLVKIAGGENVIKVKDRQYISSNTENLLNINPDIILRLPHGMPEEVKKMFQ
KEFKQNDIWKHFKAVKNNHVYDLEEVPFGITANVDADKAMTQLYDLFYKDKK
>A6QG34 ~~~isdE~~~High-affinity heme uptake system protein IsdE~~~
MRIIKYLTILVISVVILTSCQSSSSQESTKSGEFRIVPTTVALTMTLDKLDLPIVGKPTSYKTLPNRYKDVPEIGQPMEP
NVEAVKKLKPTHVLSVSTIKDEMQPFYKQLNMKGYFYDFDSLKGMQKSITQLGDQFNRKAQAKELNDHLNSVKQKIENKA
AKQKKHPKVLILMGVPGSYLVATDKSYIGDLVKIAGGENVIKVKDRQYISSNTENLLNINPDIILRLPHGMPEEVKKMFQ
KEFKQNDIWKHFKAVKNNHVYDLEEVPFGITANVDADKAMTQLYDLFYKDKK
>Q7A652 ~~~isdE~~~High-affinity heme uptake system protein IsdE~~~
MRIIKYLTILVISVVILTSCQSSSSQESTKSGEFRIVPTTVALTMTLDKLDLPIVGKPTSYKTLPNRYKDVPEIGQPMEP
NVEAVKKLKPTHVLSVSTIKDEMQPFYKQLNMKGYFYDFDSLKGMQKSITQLGDQFNRKAQAKELNDHLNSVKQKIENKA
AKQKKHPKVLILMGVPGSYLVATDKSYIGDLVKIAGGENVIKVKDRQYISSNTENLLNINPDIILRLPHGMPEEVKKMFQ
KEFKQNDIWKHFKAVKNNHVYDLEEVPFGITANVDADKAMTQLYDLFYKDKK
>Q2FZE5 ~~~isdF~~~Probable heme-iron transport system permease protein IsdF~~~COG0609
MMIKNKKKLLFLCLLVILIATAYISFVTGTIKLSFNDLFTKFTTGSNEAVDSIIDLRLPRILIALMVGAMLAVSGALLQA
ALQNPLAEANIIGVSSGALIMRALCMLFIPQLYFYLPLLSFIGGLIPFLIIILLHSKFRFNAVSMILVGVALFVLLNGVL
EILTQNPLMKIPQGLTMKIWSDVYILAVSALLGLILTLLLSPKLNLLNLDDIQARSIGFNIDRYRWLTGLLAVFLASATV
AIVGQLAFLGIIVHVVRKLVGGNYRVLIPFSTVIGAWLLLVADLLGRVIQPPLEIPANAILMIVGGPMLIYLICQSQRNR
I
>A6QG35 ~~~isdF~~~Probable heme-iron transport system permease protein IsdF~~~
MMIKNKKKLLFLCLLVILIATAYISFVTGTIKLSFNDLFTKFTTGSNEAVDSIIDLRLPRILIALMVGAMLAVSGALLQA
ALQNPLAEANIIGVSSGALIMRALCMLFIPQLYFYLPLLSFIGGLIPFLIIILLHSKFRFNAVSMILVGVALFVLLNGVL
EILTQNPLMKIPQGLTMKIWSDVYILAVSALLGLILTLLLSPKLNLLNLDDIQARSIGFNIDRYRWLTGLLAVFLASATV
AIVGQLAFLGIIVPHVVRKLVGGNYRVLIPFSTVIGAWLLLVADLLGRVIQPPLEIPANAILMIVGGPMLIYLICQSQRN
RI
>Q8NX64 ~~~isdF~~~Probable heme-iron transport system permease protein IsdF~~~
MMIKNKKKLLFLCLLVILIATAYISFVTGTIKLSFNDLITKFTTGNNEAVDSIIDLRLPRILIALMVGAMLAVSGALLQA
ALQNPLAEANIIGVSSGALIMRALCMLFIPQLYFYLPLLSFIGGLIPFLIIILLHSKFRFNAVSMILVGVALFVLLNGVL
EILTQNPLMKIPQGLTMKIWSDVYILAVSALLGLILTLLLSPKLNLLNLDDIQARSIGFNIDRYRWLTGLLAVFLASATV
AIVGQLAFLGIIVPHVVRKLVGGNYRVLIPFSTVIGAWLLLVADLLGRVIQPPLEIPANAILMIVGGPMLIYLICQSQRN
RI
>Q2FG07 ~~~isdH~~~Iron-regulated surface determinant protein H~~~
MNKHHPKLRSFYSIRKSTLGVASVIVSTLFLITSQHQAQAAENTNTSDKISENQNNNATTTQPPKDTNQTQPATQPANTA
KNYPAADESLKDAIKDPALENKEHDIGPREQVNFQLLDKNNETQYYHFFSIKDPADVYYTKKKAEVELDINTASTWKKFE
VYENNQKLPVRLVSYSPVPEDHAYIRFPVSDGTQELKIVSSTQIDDGEETNYDYTKLVFAKPIYNDPSLVKSDTNDAVVT
NDQSSSVASNQTNTNTSNQNTSTINNANNQPQATTNMSQPAQPKSSTNADQASSQPAHETNSNGNTNDKTNESSNQSDVN
QQYPPADESLQDAIKNPAIIDKEHTADNWRPIDFQMKNDKGERQFYHYASTVEPATVIFTKTGPIIELGLKTASTWKKFE
VYEGDKKLPVELVSYDSDKDYAYIRFPVSNGTREVKIVSSIEYGENIHEDYDYTLMVFAQPITNNPDDYVDEETYNLQKL
LAPYHKAKTLERQVYELEKLQEKLPEKYKAEYKKKLDQTRVELADQVKSAVTEFENVTPTNDQLTDLQEAHFVVFESEEN
SESVMDGFVEHPFYTATLNGQKYVVMKTKDDSYWKDLIVEGKRVTTVSKDPKNNSRTLIFPYIPDKAVYNAIVKVVVANI
GYEGQYHVRIINQDINTKDDDTSQNNTSEPLNVQTGQEGKVADTDVAENSSTATNPKDASDKADVIEPESDVVKDADNNI
DKDVQHDVDHLSDMSDNNHFDKYDLKEMDTQIAKDTDRNVDKDADNSVGMSSNVDTDKDSNKNKDKVIQLNHIADKNNHT
GKAAKLDVVKQNYNNTDKVTDKKTTEHLPSDIHKTVDKTVKTKEKAGTPSKENKLSQSKMLPKTGETTSSQSWWGLYALL
GMLALFIPKFRKESK
>Q2FXJ2 ~~~isdH~~~Iron-regulated surface determinant protein H~~~COG5386
MNKHHPKLRSFYSIRKSTLGVASVIVSTLFLITSQHQAQAAENTNTSDKISENQNNNATTTQPPKDTNQTQPATQPANTA
KNYPAADESLKDAIKDPALENKEHDIGPREQVNFQLLDKNNETQYYHFFSIKDPADVYYTKKKAEVELDINTASTWKKFE
VYENNQKLPVRLVSYSPVPEDHAYIRFPVSDGTQELKIVSSTQIDDGEETNYDYTKLVFAKPIYNDPSLVKSDTNDAVVT
NDQSSSVASNQTNTNTSNQNISTINNANNQPQATTNMSQPAQPKSSTNADQASSQPAHETNSNGNTNDKTNESSNQSDVN
QQYPPADESLQDAIKNPAIIDKEHTADNWRPIDFQMKNDKGERQFYHYASTVEPATVIFTKTGPIIELGLKTASTWKKFE
VYEGDKKLPVELVSYDSDKDYAYIRFPVSNGTREVKIVSSIEYGENIHEDYDYTLMVFAQPITNNPDDYVDEETYNLQKL
LAPYHKAKTLERQVYELEKLQEKLPEKYKAEYKKKLDQTRVELADQVKSAVTEFENVTPTNDQLTDLQEAHFVVFESEEN
SESVMDGFVEHPFYTATLNGQKYVVMKTKDDSYWKDLIVEGKRVTTVSKDPKNNSRTLIFPYIPDKAVYNAIVKVVVANI
GYEGQYHVRIINQDINTKDDDTSQNNTSEPLNVQTGQEGKVADTDVAENSSTATNPKDASDKADVIEPESDVVKDADNNI
DKDVQHDVDHLSDMSDNNHFDKYDLKEMDTQIAKDTDRNVDKDADNSVGMSSNVDTDKDSNKNKDKVIQLNHIADKNNHT
GKAAKLDVVKQNYNNTDKVTDKKTTEHLPSDIHKTVDKTVKTKEKAGTPSKENKLSQSKMLPKTGETTSSQSWWGLYALL
GMLALFIPKFRKESK
>Q5HF43 ~~~isdH~~~Iron-regulated surface determinant protein H~~~
MNKHHPKLRSFYSIRKSTLGVASVIVSTLFLITSQHQAQAAENTNTSDKISENQNNNATTTQPPKDTNQTQPATQPANTA
KNYPAADESLKDAIKDPALENKEHDIGPREQVNFQLLDKNNETQYYHFFSIKDPADVYYTKKKAEVELDINTASTWKKFE
VYENNQKLPVRLVSYSPVPEDHAYIRFPVSDGTQELKIVSSTQIDDGEETNYDYTKLVFAKPIYNDPSLVKSDTNDAVVT
NDQSSSVASNQTNTNTSNQNTSTINNANNQPQATTNMSQPAQPKSSTNADQASSQPAHETNSNGNTNDKTNESSNQSDVN
QQYPPADESLQDAIKNPAIIDKEHTADNWRPIDFQMKNDKGERQFYHYASTVEPATVIFTKTGPIIELGLKTASTWKKFE
VYEGDKKLPVELVSYDSDKDYAYIRFPVSNGTREVKIVSSIEYGENIHEDYDYTLMVFAQPITNNPDDYVDEETYNLQKL
LAPYHKAKTLERQVYELEKLQEKLPEKYKAEYKKKLDQTRVELADQVKSAVTEFENVTPTNDQLTDLQEAHFVVFESEEN
SESVMDGFVEHPFYTATLNGQKYVVMKTKDDSYWKDLIVEGKRVTTVSKDPKNNSRTLIFPYIPDKAVYNAIVKVVVANI
GYEGQYHVRIINQDINTKDDDTSQNNTSEPLNVQTGQEGKVADTDVAENSSTATNPKDASDKADVIEPESDVVKDADNNI
DKDVQHDVDHLSDMSDNNHFDKYDLKEMDTQIAKDTDRNVDKDADNSVGMSSNVDTDKDSNKNKDKVIQLNHIADKNNHT
GKAAKLDVVKQNYNNTDKVTDKKTTEHLPSDIHKTVDKTVKTKEKAGTPSKENKLSQSKMLPKTGETTSSQSWWGLYALL
GMLALFIPKFRKESK
>Q931P4 ~~~isdH~~~Iron-regulated surface determinant protein H~~~
MNKHHPKLRSFYSIRKSTLGVASVIVSTLFLITSQHQAQAAENTNTSDKISENQNNNATTTQQPKDTNQTQPATQPVITA
KNYPAADESLKDAIKDPALENKEHDIGPREQVNFQLLDKNNETQYYHFFSIKDPADVYYTKKKAEVELDINTASTWKKFE
VYENNQKLPVRLVSYSPVPEDHAYIRFPVSDGTQELKIVSSTQIDDGEETNYDYTKLVFAKPIYNDPSLVKSDTNDAVVT
NDQSSSDASNQTNTNTSNQNTSTTNNANNQPQATTNMSQPAQPKSSANADQASSQPAHETNSNGNTNDKTNESSNQSDVN
QQYPPADESLQDAIKNPAIIDKEHTADNWRPIDFQMKNDKGERQFYHYASTVEPATVIFTKTGPVIELGLKTASTWKKFE
VYEGDKKLPVELVSYDSDKDYAYIRFPVSNGTRDVKIVSSIEYGENIHEDYDYTLMVFAQPITNNPDDYVDEETYNLQKL
LAPYHKAKTLERQVYELEKLQEKLPEKYKAEYKKKLDQTRVELADQVKSAVTEFENVTPTNDQLTDLQEAHFVVFESEEN
SESVMDGFVEHPFYTATLNGQKYVVMKTKDDSYWKDLIVEGKRVTTVSKDPKNNSRTLIFPYIPDKAVYNAIVKVVVANI
GYEGQYHVRIINQDINTKDDDTSQNNTSEPLNVQTGQEGKVADTDVAENSSTATNPKDASDKADVIEPESDVVKDADNNI
DKDVQHDVDHLSDMSDNNHFDKYDLKEMDTQIAKDTDRNVDNSVGMSSNVDTDKDSNKNKDKVIQLAHIADKNNHTGKAA
KLDVVKQNYNNTDKVTDKKTTEHLPSDIHKTVDKTVKTKEKAGTPSKENKLSQSKMLPKTGETTSSQSWWGLYALLGMLA
LFIPKFRKESK
>Q99TD3 ~~~isdH~~~Iron-regulated surface determinant protein H~~~
MNKHHPKLRSFYSIRKSTLGVASVIVSTLFLITSQHQAQAAENTNTSDKISENQNNNATTTQQPKDTNQTQPATQPVITA
KNYPAADESLKDAIKDPALENKEHDIGPREQVNFQLLDKNNETQYYHFFSIKDPADVYYTKKKAEVELDINTASTWKKFE
VYENNQKLPVRLVSYSPVPEDHAYIRFPVSDGTQELKIVSSTQIDDGEETNYDYTKLVFAKPIYNDPSLVKSDTNDAVVT
NDQSSSDASNQTNTNTSNQNTSTTNNANNQPQATTNMSQPAQPKSSANADQASSQPAHETNSNGNTNDKTNESSNQSDVN
QQYPPADESLQDAIKNPAIIDKEHTADNWRPIDFQMKNDKGERQFYHYASTVEPATVIFTKTGPVIELGLKTASTWKKFE
VYEGDKKLPVELVSYDSDKDYAYIRFPVSNGTRDVKNVSSIEYGENIHEDYDYTLMVFAQPITNNPDDYVDEETYNLQKL
LAPYHKAKTLERQVYELEKLQEKLPEKYKAEYKKKLDQTRVELADQVKSAVTEFENVTPTNDQLTDLQEAHFVVFESEEN
SESVMDGFVEHPFYTATLNGQKYVVMKTKDDSYWKDLIVEGKRVTTVSKDPKNNSRTLIFPYIPDKAVYNAIVKVVVANI
GYEGQYHVRIINQDINTKDDDTSQNNTSEPLNVQTGQEGKVADTDVAENSSTATNPKDASDKADVIEPESDVVKDADNNI
DKDVQHDVDHLSDMSDNNHFDKYDLKEMDTQIAKDTDRNVDNSVGMSSNVDTDKDSNKNKDKVIQLAHIADKNNHTGKAA
KLDVVKQNYNNTDKVTDKKTTEHLPSDIHKTVDKTVKTKEKAGTPSKENKLSQSKMLPKTGETTSSQSWWGLYALLGMLA
LFIPKFRKESK
>Q6G8J7 ~~~isdH~~~Iron-regulated surface determinant protein H~~~
MNKHHPKLRSFYSIRKSILGVASVIVSTLFLITSQHQAQAAENTNTSDKISENQNNNATTTQPPKDTNQTQPATQPANTA
KTYPAADESLKDAIKDPALENKEHDIGPREQVNFQLLDKNNETQYYHFFSIKDPADVYYTKKKAEVELDINTASTWKKFE
VYENNQKLPVRLVSYSPVPEDHAYIRFPVSDGTQELKIVSSTQIDDGEETNYDYTKLVFAKPIYNDPSLVKSDTNDAVVT
NDQSSSDASNQTNTNTSNQNTSTINNANNQPQATTNMSQPAQPKSSANADQASSQPAHETNSNGNTNDKTNESSNQSDVN
QQYPPADESLQDAIKNPAIIDKEHTADNWRPIDFQMKNDKGERQFYHYASTVEPATVIFTKTGPIIELGLKTASTWKKFE
VYEGDKKLPVELVSYDSDKDYAYIRFPVSNGTREVKIVSSIEYGENIHEDYDYTLMVFAQPITNNPDDYVDEETYNLQKL
LAPYHKAKTLERQVYELEKLQEKLPEKYKAEYKKKLDQTRVELADQVKSAVTEFENVTPTNDQLTDVQEAHFVVFESEEN
SESVMDGFVEHPFYTATLNGQKYVVMKTKDDSYWKDLIVEGKRVTTVSKDPKNNSRTLIFPYIPDKAVYNAIVKVVVANI
GYEGQYHVRIINQDINTKDDDTSQNNTSEPLNVQTGQEGKVADTDVAENSSTATNPKDASDKADVIEPDSDVVKDADNNI
DKDVQHDVDHLSDMSDNNHFDKYDLKEMDTQIAKDTDRNVDKGADNSVGMSSNVDTDKDSNKNKDKVIQLNHIADKNNHN
GKAAKLDVVKQNYNNTDKVTDKKTTEHLPSDIHKTVDKTVKTKEKAGTPSKENKLSQSKMLPKTGETTSSQSWWGLYALL
GMLALFIPKFRKESK
>Q8NW39 ~~~isdH~~~Iron-regulated surface determinant protein H~~~
MNKHHPKLRSFYSIRKSILGVASVIVSTLFLITSQHQAQAAENTNTSDKISENQNNNATTTQPPKDTNQTQPATQPANTA
KTYPAADESLKDAIKDPALENKEHDIGPREQVNFQLLDKNNETQYYHFFSIKDPADVYYTKKKAEVELDINTASTWKKFE
VYENNQKLPVRLVSYSPVPEDHAYIRFPVSDGTQELKIVSSTQIDDGEETNYDYTKLVFAKPIYNDPSLVKSDTNDAVVT
NDQSSSDASNQTNTNTSNQNTSTINNANNQPQATTNMSQPAQPKSSANADQASSQPAHETNSNGNTNDKTNESSNQSDVN
QQYPPADESLQDAIKNPAIIDKEHTADNWRPIDFQMKNDKGERQFYHYASTVEPATVIFTKTGPIIELGLKTASTWKKFE
VYEGDKKLPVELVSYDSDKDYAYIRFPVSNGTREVKIVSSIEYGENIHEDYDYTLMVFAQPITNNPDDYVDEETYNLQKL
LAPYHKAKTLERQVYELEKLQEKLPEKYKAEYKKKLDQTRVELADQVKSAVTEFENVTPTNDQLTDVQEAHFVVFESEEN
SESVMDGFVEHPFYTATLNGQKYVVMKTKDDSYWKDLIVEGKRVTTVSKDPKNNSRTLIFPYIPDKAVYNAIVKVVVANI
GYEGQYHVRIINQDINTKDDDTSQNNTSEPLNVQTGQEGKVADTDVAENSSTATNPKDASDKADVIEPDSDVVKDADNNI
DKDVQHDVDHLSDMSDNNHFDKYDLKEMDTQIAKDTDRNVDKGADNSVGMSSNVDTDKDSNKNKDKVIQLNHIADKNNHN
GKAAKLDVVKQNYNNTDKVTDKKTTEHLPSDIHKTVDKTVKTKEKAGTPSKENKLSQSKMLPKTGETTSSQSWWGLYALL
GMLALFIPKFRKESK
>Q1QU27 1.1.1.313~~~isfD2~~~Sulfoacetaldehyde reductase 2~~~COG4221
MSDIVLITGATSGFGRAAARRFADAGWSLILTGRREERLTELAEELSQRVRVHTAVLDVRDEKAVQSVIDELPEAFRRVK
TLVNNAGLALAPQPAQDVDLADWHTMIDTNIKGLVNVTHAVLPTLIETGAGASIVNLGSVAGQWPYPGSHVYGASKAFVQ
QFTYNLRCDLQGTGVRVTDVAPGMAETEFTLVRTGGDQAASDALYRDTTPLQAEDVAELIFYTATLPAHVNVNRLEVMPT
RQAWSAFAVDRD
>Q1R183 1.1.1.313~~~isfD~~~Sulfoacetaldehyde reductase~~~COG4221
MTDCVFITGATSGFGRAAAHRFAAAGWSLVLTGRRLERLEALKEELQGRVPVHIIALDVRDSDVVDAAVAALPEGFTRVR
TLLNNAGLALAPQSAQHTDRSDWHTMIDTNVTGLVNVTHALLPTLIDVGEGATIVNVGSIAGQWPYPGSHVYGASKAFVK
QFSYNLRCDLLGTGVRVTDLAPGIAETEFTLVRTGGDQAASDALYRGTTALTAEDIAEQMFYIATLPPHVNFNRLEVMPT
RQAWSAFAIDYDA
>D3U1D9 1.1.1.313~~~isfD~~~Sulfoacetaldehyde reductase~~~COG4221
MATSKVVFITGATSGFGEAAAQVFADAGWSLVLSGRRFERLKTLQDKLASQVPVHIIELDVRDSDSVAAAVAALPADFAD
ITTLINNAGLALSPQPAQKVDLDDWKTMIDTNVTGLVNVTHALLPTLINHGAGASIINIGSIAGQWPYPGSHVYGASKAF
VKQFSYNLRCDLLGTGVRVTDLAPGIAETEFTLVRTKGDQAASDNLYRGTTPLSARDIAEQMFYIATLPDHMNINRVEVM
PVRQAWQPFAIDRD
>Q188R7 ~~~~~~Iron-sulfur flavoprotein CD630_04720~~~COG0655
MIITVINGSPRKNGATSKVLTYLYKDIERLIPDVKINYFDLSEVNPSYCIGCLNCYKMGKCINQNDKVEYIHDIITKSDG
VIFGSPTYGSSVTGLFKVFTDRAHMMLERLLYRKPCIAVTTYENARGSKAISFIKSMVLDSGGYVCGSLSIKTGFNQNPI
TEKVESKIQKVSKKFIYCIEEKKNPPVLSQIYNFIAINAVLKPMAFKDIEQYKGIIDRWEEQGII
>Q9F488 ~~~isiA~~~Iron stress-induced chlorophyll-binding protein~~~
MQTYDNPKVKYDWWAGNARFANLSGLFIGAHVAQAALTTLWAGAFTWFELSRYQSGVPMGEQGLILLPHLATLGFGVGAG
GQIVDTYPYFVIGALHIISSAVLGAGALLHTFKGPENLKDATGPAREFHFEWDDANKLGLILGHHLLFLGAGALLLVAKA
MFWGGLYDTTIHDVRVVTEPTLNPFIIFGYQTHFASVDNLEDVVGGHIYVGLLLIFGGVWHILVKPLAWAKKLLIFSGEA
ILSYSLGGIALAGFVAAYFCAVNTLAYPVEFYGPPLEVKFWYLRPYFADTIQLPYGNYTSRAWLANTHFFLAFFFLQGHL
WHALRAIGFDFKRVEKALSAVETSS
>Q8YQ35 ~~~isiA~~~Iron stress-induced chlorophyll-binding protein~~~
MQTYDNPNIKYDWWAGNARFANLSGLFIGAHVAQAALTTLWAGAFTWFEISRYKPEIPMGEQGLILLPHLATLGFGVGVS
GQVVNTYPYFVIGALHLISSAVLGAGALFHTFKGPRNLKNTTGSARKFHFEWNDPKQLGLILGHHLLFLGMAALLLVGKA
MFWGGLYDATTQVVRVVNHPTLNPFVIYGYQTHFASVNNLEDLVGGHIYVGLILIGGGIWHIVKEPLPWAKKLLIFSGEA
ILSYSLGGIALAGFVAAYFCAVNTLAYPVEFYGAPLELKFGVTPYFADTVKLADGGYSARAWLANAHFFLAFFFLQGHLW
HALRAIGVDFRQIEKSLNAISSAE
>P15347 ~~~isiA~~~Iron stress-induced chlorophyll-binding protein~~~
MQTYNNPEVTYDWWAGNARFANLSGLFIAAHVAQAALIMFWAGAFTLYEISWLTADQSMGEQGLILLPHLATLGLGVGDG
GQVTDTYPLFVVGAVHLIASAVLGAGALFHTFRAPSDLAAASGAAKRFHFDWNDPKQLGLILGHHLLFLGVGALLLVAKA
TTWGGLYDAASQTVRLVTEPTLNPAVIYGYQTHFASIDNLEDLVGGHVYVGVMLIAGGIWHILVPPFQWTKKVLIYSGEA
ILSYSLGGIALAGFVAAYFCAVNTLAYPVEFYGAPLEIKLGVTPYFADTVQLPFGAHTPRAWLSNAHFFLAFFCLQGHLW
HALRAMGFDFRRVEKALSSVEA
>P31157 ~~~isiA~~~Iron stress-induced chlorophyll-binding protein~~~
MQTYDNPDVKYEWWAGNARFADLSGQFIGAHVAHAALIVFWAGAFTLFEISYFDPTLPMGEQNLILLPHLATLGLGIEGN
GAINTEPYFVIGAIHLISSAVLAAGGLFHVLRGPQDLKTATGPARRFHFDWEDPKQLGLILGHHLLLLGLGAFLLVAKAM
YFGGLYDTATQTVRLVTEPTLDPAVIYGYQTHFATVDNLEDIVGGHIYVGVLLVAGGIWHILVPPLQWAKKVLLFSGEAI
LSYSLGAIALAGFVAAYFCAVNTTAYPVEFYGPVLDVKLSIVPYFADTIELPMNEHTSRAWLANAHFFFAFFFLQGHLWH
ALRAMGFDFRRVEKVLSDPLDA
>Q55274 ~~~isiA~~~Iron stress-induced chlorophyll-binding protein~~~
MQTYGNDTVQYEWWAGNARFADQSGLFIAAHVAQAALTAFWAGAFTLFEISRFDPTQAMGDQGLILLPHLATLGWGVGDG
GQIVDTYPYFVIGSIHLIASAVLGAGALFHTLRAPADLSTLKGQGKKFHFTWENPQQLGIILGHHLLFLGAGALLLAGKA
MYWGGLYDATTQTVRLVSQPTLDPLVIYGYQTHFASISSLEDLVGGHIFVGFLLIGGGIWHILVPPLGWAKKVLLFSGEA
ILSYSLGGIALAGFVAAYFCAVNTLAYPPEFYGPPLAIKLGIFPYFADTVELPMHAHTSRAWLANAHFFLAFFFLQGHLW
HALRALGFDFKRVEQAFDSLQT
>E5Y378 4.4.1.38~~~islA~~~Isethionate sulfite-lyase~~~COG1882
MTQVAEIKSPHEQRLEDNIAGKEDIYRESHKRVFKLLERFDGQKPAIDVERALYFTQSMAETVGQPLVLRWAKALMNVAK
NITVMVQDDQLLLGRCGGHDGRYGILYPELDGDFLDIAVRDLPTRPQSPASISPEDAKIVVEQIAPFWKGRTYHEALNKA
LPAEVHKLTYDDPDGLISRFIVNETSSFRSSIQWVHDYEVVLKRGFNGLKQEMEEKLAALDPASPVDQVDKRPFIEATIL
VCDAIVLWAKRHADAARKAAEACADPVRKAELIRMAENAEHVPANPARDFYEAVQSQYFTQMFSRLEQKTGTTISNGRMD
QYFYPFYKKDMEAGILTDEKTLEYLECMWVGMAEFIDMYISPAGGAFNEGYAHWEAVTIGGQTPDGRDATNDLTYLFLKS
KREFPLHYPDLAARIHSRAPERYLWDVAETIKFGSGFPKLCNDEECIPLYVSKGATFEEALDYAVSGCIEIRMPNRDTYT
SGGAYTNFASAVEMALYDGKMKKYGDVQLGIQTGDARKFKSWDEFWNAYVQQHMLLLRTTFIQQYIVIQTRAKHFAQPMG
SVLHALCRKHCIDLHQPQIPEGLNFGYFEFMGLGTVIDSLAAIKKLVFEDKKLTMDQLIDALEANFEGYEDIQQLLRTAP
CYGNDDEYADEIGRELDRMAVSFAAKYGKEMGINNDARYVPFTSHVPFGKVVSATPNGRVAWFPLADGSSPSHGADHNGP
TAILLSNHNTKNYGMRARAARLINVKFTPKCVEGDAGTEKLVQFIRTWCDLKLWHIQFNVINADTLKKAQKDPQKYRNLI
VRIAGYSAYFVDLTPDLQNDLIARTGHDQM
>B8J0R1 4.4.1.38~~~islA~~~Isethionate sulfite-lyase~~~COG1882
MSMTTCECRSPQEQRLYDKIEGREDRFRKTHPRVFRLLERFEGQKPRIDIERALYFTQSMQETEGQPLVLRWAKALMHIA
RNMTVYVQEDQLLLGRAGCDGRYGILYPELDGDFLDIAVRDLPTRKTSPATITPEDARRVVEEIAPYWKGKTYHEALNAA
LPAEVHKLTYDDPEGLISRFIVNETSSFRSSIQWVHDYEKILKRGFNSIKKEAREKLAALDPLSAKDDREKRPFLEAVMI
VCDAIVLWAKRHAVLAREMAEKESDPVRKAELLRMAENAEHVPGEPARDFWEACQSQWFTQMFSRIEQKTGTTISNGRMD
QYFQPYYKQDREAGKITEAQAMELLECMWVGMAEFIDMYISPTGGAFNEGYAHWEAVTVGGQTPDGRDASNDLTYLILKS
KREFPLHYPDLAARIHSRAPERYLWDVAETIKYGSGFPKLINDEEIVPLYVSKGATFEEALDYAVSGCTEARMPNRDTYT
SGGAYINFAAAVEMVLRNGRMKKYGDQKLGVETGDPRSFTTWDQFWNAYVEQHLLFLKTAFTQQYIINKLRAEHFAQPMG
SAMHDLCMKHCIDLHQEQIPEGINLGYFEYMGLGTVVDSLAAVKKLVFEEKKLSMDKLIAAIDADFEGYEDVRALLRSAP
CYGNNDEYADAIGRDIDRISVEYGNKYSMSDLGIHNDVRYVPFTSHVPFGKVVSATPNGRTDGFPLSDGSSASHGADVNG
PTAVLLSNCTTKNMGLRDRAARMLNIKFTPKCVEGEQGTEKLVSFIRTFCDLKLWHVQFNVVNKGTLVAAQKDPQKYRNL
IVRIAGYSAYFVDLSPDLQNDLIARTEHDVM
>Q727N1 4.4.1.38~~~iseG~~~Isethionate sulfite-lyase~~~COG1882
MQCCNQLSPHEQRLQDKIEGKVDRYRATHERVFTILESFDNTRPRIDVERAKYFTESMKATEGQPLPLRWAKALMHIAEN
MTVYIDDHQLICGRAGYQGRYGVLYPELDGDFLGTAIEDLPNRAESPFAITPEDARVVVEEIAPFWKGKTYHEALNLALP
ADVHKLTYDDPQGLMSRFIVNETSSFRSSIQWVHDYEKVLKRGFRSIKEEALEKIAALDPMSPCDNVEKRPFLEAIVIVC
DAIILWAKRHAKLAAELAAKETDPTRKRELETMAEICAWVPENPARTFHEAVQAQWFTQVFSRIEQKTGTIVSNGRMDQY
FWPFYEKDLAEGRITEDSALELLECMWVGMAQYVDLYISPTGGAFNEGYAHWEAVTIGGQTPEGRDATNDLTYLFLKSKR
EFPLHYPDLAARIHSRSPERYLWEVAETIKDGSGFPKLINDEEVVPLYVSKGATFAEALDYAVSGCTEARMPNRDTYTSG
GAYINFAAALEMVLYNGKMLKYGDTDLGAHTGDPCEFKTWEEFWNAYVTQHHLFLKTAFVQQHIINNLRARHFAQPMGSS
LHDLCMKHCLDLHTPQIPEGINLGYFEYMGFGTVVDSLSAIKKLVFEDKKLTMGELIEALKCNFEGKEDIQQLLKSAPCY
GNNDDYADSIARDIDALSVKYGRRYSPELGMHNDVRYVPFTSHVPFGRVVSATPNGRKAWSALSDGSSASHGADVNGPTA
ILQSNFNSKNYGMRDRAARMLNIKFTPKCVEGEEGSQKLVSFIRTFCDLKLWHVQFNVINKETLLAAQRDPEKYRNLIVR
IAGYSAYFVDLSPDLQNDLIARTGHDVM
>Q312S2 4.4.1.38~~~islA~~~Isethionate sulfite-lyase~~~COG1882
MQCCTTPLSPHEQRLQDKIAGKEDSFRKSHERVFNILDSFDGKRPRIDVERAKLFTDSMKETEGQPLVLRWAKAMKHVAE
HITVYIDDDQLICGRGGCPGRYGVLYPELDGDFLDLAIEDLPNRTESPFTITEADARVVVEEIAPYWKGKTYHEDLNLAL
PSDVHKLTYDDPQGLKSRFIVNETSSFRSSIQWVHDYEKVLKRGFRGLKEEAQEKIAGLDPLSPRDNVEKRPFLEAIVIV
CDAIILWANRHAKLAADMAAAETNPVRKAELETMAEICAWVPENPARNFYEAVQAQWFTQMFSRLEQKTGTIVSNGRMDQ
YFWPFYRKDIEEGRITEESALELLECMWVGMAQYVDLYISPAGGAFNEGYAHWEAVTIGGQTPQGLDATNDLTYLFLKSK
REFPLHYPDLAARIHSRSPERYLHDVAETIKFGSGFPKLINDEEIVPLYVSKGASFEEALDYAVSGCTEARMPNRDTYTS
GGAYINFAAALEMVLYNGRMLKYGENELGLETGDPTRFETWEEFWNAYVLQHEHFLRAAFIQQHIINNVRARHFAQPMGS
ALHDLCMKHCLDLHTPQIPEGINLGYFEYMGFGTVVDSLAAIKKLVFEDKKLTMQEVIEALKCNFEGKEDVQQMLKSAPC
YGNNDEYADSIAREIDAISVKYGRRYSPELGMHNDVRYVPFTSHVPFGKVVSATPNGRLAWTPLSDGSSASHGADVNGPT
AVLQSNFSSKNYGYRDRAARMLNIKFTPKCVEGDEGTEKLVSFIRTFCDLKLWHVQFNVINRDTLIAAQKDPEKYRSLIV
RIAGYSAYFVDLSPDLQNDLIARTQHDAM
>E5Y377 1.97.1.-~~~islB~~~Isethionate sulfite-lyase activating enzyme~~~COG1180
MGSFEDRKATGTVFNIQKYSVHDGPGIRTIVFLKGCPLSCKWCSNPESQASHPQVAYNKGRCIGCHRCIKACEHDAITVN
EDGTLSLDRGKCDVCKTLDCAHACPAQGMIIYGENKTVDQILKEVEKDALFYARSGGGMTLSGGEPLMHADIALPLLREA
RHRRIKTAIETCGCIPWDTLKEAAPYLNYVLFDVKQMDSEKHREGVGVGNELILSNLKKLLTEFPNLHVQVRTPIIPGFN
DNDEFAYALGEFLKGYENVGYEALPYHRLGTQKYDFLSREYAMGDVSLPDGVAQRIQRIVDETRGAVTEEKK
>B8J0R0 1.97.1.-~~~islB~~~Isethionate sulfite-lyase activating enzyme~~~COG1180
MCLDDKQQGMVFNIQKYSVHDGPGIRTIVFLKGCSLSCRWCSNPESQKSCAELACNPGRCIDISKCGHCLTACPHGAITC
GDDDKPRIDRSHCADCSIPCAEVCPAQGLLVYGKKRAVGDVLRVVEQDMAFYARSGGGLTLSGGEPLLQGSFAVALLREA
RARRIRTAVETCGMVPADTVREAAPHLSYVLYDIKHMNSEIHETQTGLPNARILENFRILAEEFPHLPILARTPVIPGFN
DNEKAVAAIARFIKAYPHVNYELLPYHRLGTQKYHFLGREVPMGEVSLNKAVTDGLQKTALDILGERVQIPR
>Q727N0 1.97.1.-~~~iseH~~~Isethionate sulfite-lyase activating enzyme~~~COG1180
MSSIADRKTTGITFNIQKYSVHDGPGIRTVVFLKGCPLKCRWCSNPESQRKSVELAYNTGRCLTLAKCVRCVEICTAGAI
SRAEDDTISIDRALCNDCEQLCSGACPSNALITYGAHKTVDEVLRAVEQDSLFYARSGGGMTISGGEPFAQPAFTLALLR
EARRRRVHTAVETCGYASWDDMAAALPFLNYVLYDIKNLDDARHKEATGVSNQRIVENLRALRAEFPGIPVLVRTPVIPG
FNDNEADIAAIAALTRELGVSYQLLPYHRLGTQKYHFLDREAPMGEVTLDAETMRRLEAVVAQTADS
>Q312S3 1.97.1.-~~~islB~~~Isethionate sulfite-lyase activating enzyme~~~COG1180
MSSFEDKKTTGITFNIQKYSVHDGPGIRTVVFLKGCPLRCRWCSNPESQRRRIELAYNTGRCLTLTKCVRCVEVCTMNAI
TRADDDTISIDRALCEECGMFCAEACPSKALITYGTTRTVDEVLNVVEQDSVFYARSGGGITLSGGEPFAQPAFALALLR
EARRRHIHTAVETCGYASWSDMEPALEYVKFVHYDIKSLDDEKHRSATGVSNVRIIENLRNIRSRFPALKVVVRTPVIPG
FNDTEEDIRAIARLTAELEVEYQLLPYHRLGTQKYTFLDRQAPMGEVVLDEQVMTALNAVVAAEHATDG
>O32611 3.2.1.68~~~iam~~~Isoamylase~~~
MDPHAPQRQRSGQRLRALALAALACALSPAHAAIDAQQLGARYDAAQANLAFRVYSSRATRVEVFLYKNPTGSQEVARLA
LSKDPATQVWSLSLPTSTIKNTYGITGAVYYGYRAWGPNWPYDAAWTKGSATGFVSDVDNAGNRFNPNKLLLDPYAREIS
QDPNTATCADGTIYATGAAHRNKDSGLCASKGIALAADATSVGSKPTRALKDEVIYEVHVRGLTRNDDSVPAAERGTYKG
AARKAAALAALGVTAVEFLPVQETQNDQNDVDPNSTAGDNYWGYMTLNYFAPDRRYAYDKSAGGPTREWKAMVKAFHDAG
IKVYIDVVYNHTGEGGPWSGTDGLSVYNLLSFRGLDNPAYYSLSSDYKYPWDNTGVGGNYNTRHPIAQNLIVDSLAYWRD
ALGVDGFRFDLASVLGNSCQHGCFNFDKNDSGNALNRIVAELPPRPAAGGAGADLIAEPWAIGGNSYQVGGFPAGWAEWN
GLYRDALRKKQNKLGVETVTPGTLATRFAGSNDLYGDDGRKPWHSINFVVAHDGFTLNDLYAYNDKQNNQPWPYGPSDGG
EDHNLSWNQGGIVAEQRKAARTGLALLMLSAGVPMITGGDEALRTQFGNNNTYNLDSAANWLYWSRSALEADHETYTKRL
IAFRKAHPALRPANFYSASDTNGNVMEQLRWFKPDGAQADSAYFNGADNHALAWRIDGSEFGDSASAIYVAYNGWSGAVD
FKLPWPGTGKQWYRVTDTATWNEGPNAVALPGSETLIGGENTVYGMQARSLLLLIAK
>P10342 3.2.1.68~~~iam~~~Isoamylase~~~
MKCPKILAALLGCAVLAGVPAMPAHAAINSMSLGASYDAQQANITFRVYSSQATRIVLYLYSAGYGVQESATYTLSPAGS
GVWAVTVPVSSIKAAGITGAVYYGYRAWGPNWPYASNWGKGSQAGFVSDVDANGDRFNPNKLLLDPYAQEVSQDPLNPSN
QNGNVFASGASYRTTDSGIYAPKGVVLVPSTQSTGTKPTRAQKDDVIYEVHVRGFTEQDTSIPAQYRGTYYGAGLKASYL
ASLGVTAVEFLPVQETQNDANDVVPNSDANQNYWGYMTENYFSPDRRYAYNKAAGGPTAEFQAMVQAFHNAGIKVYMDVV
YNHTAEGGTWTSSDPTTATIYSWRGLDNATYYELTSGNQYFYDNTGIGANFNTYNTVAQNLIVDSLAYWANTMGVDGFRF
DLASVLGNSCLNGAYTASAPNCPNGGYNFDAADSNVAINRILREFTVRPAAGGSGLDLFAEPWAIGGNSYQLGGFPQGWS
EWNGLFRDSLRQAQNELGSMTIYVTQDANDFSGSSNLFQSSGRSPWNSINFIDVHDGMTLKDVYSCNGANNSQAWPYGPS
DGGTSTNYSWDQGMSAGTGAAVDQRRAARTGMAFEMLSAGTPLMQGGDEYLRTLQCNNNAYNLDSSANWLTYSWTTDQSN
FYTFAQRLIAFRKAHPALRPSSWYSGSQLTWYQPSGAVADSNYWNNTSNYAIAYAINGPSLGDSNSIYVAYNGWSSSVTF
TLPAPPSGTQWYRVTDTCDWNDGASTFVAPGSETLIGGAGTTYGQCGQSLLLLISK
>P26501 3.2.1.68~~~iam~~~Isoamylase~~~
MKCPKILAALLGCAVLAGVPAMPAHAAINSMSLGASYDAQQANITFRVYSSQATRIVLYLYSAGYGVQESATYTLSPAGS
GVWAVTVPVSSIKAAGITGAVYYGYRAWGPNWPYASNWGKGSQAGFVSDVDANGDRFNPNKLLLDPYAQEVSQDPLNPSN
QNGNVFASGASYRTTDSGIYAPKGVVLVPSTQSTGTKPTRAQKDDVIYEVHVRGFTEQDTSIPAQYRGTYYGAGLKASYL
ASLGVTAVEFLPVQETQNDANDVVPNSDANQNYWGYMTENYFSPDRRYAYNKAAGGPTAEFQAMVQAFHNAGIKVYMDVV
YNHTAEGGTWTSSDPTTATIYSWRGLDNTTYYELTSGNQYFYDNTGIGANFNTYNTVAQNLIVDSLAYWANTMGVDGFRF
DLASVLGNSCLNGAYTASAPNCPNGGYNFDAADSNVAINRILREFTVRPAAGGSGLDLFAEPWAIGGNSYQLGGFPQGWS
EWNGLFRDSLRQAQNELGSMTIYVTQDANDFSGSSNLFQSSGRSPWNSINFIDVHDGMTLKDVYSCNGANNSQAWPYGPS
DGGTSTNYSWDQGMSAGTGAAVDQRRAARTGMAFEMLSAGTPLMQGGDEYLRTLQCNNNAYNLDSSANWLTYSWTTDQSN
FYTFAQRLIAFRKAHPALRPSSWYSGSQLTWYQPSGAVADSNYWNNTSNYAIAYAINGPSLGDSNSIYVAYNGWSSSVTF
TLPAPPSGTQWYRVTDTCDWNDGASTFVAPGSETLIGGAGTTYGQCGQSLLLLISK
>Q9RBP5 1.1.1.398~~~isoH~~~1-hydroxy-2-glutathionyl-2-methyl-3-butene dehydrogenase~~~
MSTVLVVGADKGIAHSISRQLHDRGEDVIAACLFDGADLAAAGITVEPGVDVTSQESVEALAARLSEKGVKLDAVFHVAG
VMWLDEVGSLDYDLIRRQIEINTLGPLRTIEAVRPLLNEGAKVGIVTSRVGSLGDNTSGGMYSYRISKAAANMVGLNFHH
DLSKDGVSVLLLHPGMVATDLTKDFPGEHSYITPEQAAAGLIKNIDNLTPETSGRFQHSDGTFLQW
>Q9RBP4 4.4.1.34~~~isoI~~~Isoprene-epoxide--glutathione S-transferase~~~
MITVYGYVPAWGIPDISPYVTKVVNYLSFTGIEFEYKTQDLATLDQDSPHGKLPYIVDSDGTKVGDSNTIIEYLKNKFGD
KLDADLSKQQLAQALAFHRLIEEHLYWSGIIQARWQDDAGWETYIPFIVQGAEVTPEMRVGLDAFRARILDGFNGQGMGR
RSEEVVAEFFRADIDALSDFLDDKPFILGDKVHSIDASLYSTLRHIADQPQQWLGSGYVQTKPNLVDYLERIRKQYDI
>P11018 3.4.21.-~~~isp~~~Major intracellular serine protease~~~COG1404
MNGEIRLIPYVTNEQIMDVNELPEGIKVIKAPEMWAKGVKGKNIKVAVLDTGCDTSHPDLKNQIIGGKNFTDDDGGKEDA
ISDYNGHGTHVAGTIAANDSNGGIAGVAPEASLLIVKVLGGENGSGQYEWIINGINYAVEQKVDIISMSLGGPSDVPELK
EAVKNAVKNGVLVVCAAGNEGDGDERTEELSYPAAYNEVIAVGSVSVARELSEFSNANKEIDLVAPGENILSTLPNKKYG
KLTGTSMAAPHVSGALALIKSYEEESFQRKLSESEVFAQLIRRTLPLDIAKTLAGNGFLYLTAPDELAEKAEQSHLLTL
>P22939 2.5.1.10~~~ispA~~~Farnesyl diphosphate synthase~~~COG0142
MDFPQQLEACVKQANQALSRFIAPLPFQNTPVVETMQYGALLGGKRLRPFLVYATGHMFGVSTNTLDAPAAAVECIHAYS
LIHDDLPAMDDDDLRRGLPTCHVKFGEANAILAGDALQTLAFSILSDADMPEVSDRDRISMISELASASGIAGMCGGQAL
DLDAEGKHVPLDALERIHRHKTGALIRAAVRLGALSAGDKGRRALPVLDKYAESIGLAFQVQDDILDVVGDTATLGKRQG
ADQQLGKSTYPALLGLEQARKKARDLIDDARQSLKQLAEQSLDTSALEALADYIIQRNK
>Q08291 2.5.1.10~~~~~~Farnesyl diphosphate synthase~~~
MAQLSVEQFLNEQKQAVETALSRYIERLEGPAKLKKAMAYSLEAGGKRIRPLLLLSTVRALGKDPAVGLPVACAIEMIHT
YSLIHDDLPSMDNDDLRRGKPTNHKVFGEAMAILAGDGLLTYAFQLITEIDDERIPPSVRLRLIERLAKAAGPEGMVAGQ
AADMEGEGKTLTLSELEYIHRHKTGKMLQYSVHAGALIGGADARQTRELDEFAAHLGLAFQIRDDILDIEGAEEKIGKPV
GSDQSNNKATYPALLSLAGAKEKLAFHIEAAQRHLRNADVDGAALAYICELVAARDH
>P0AD57 2.5.1.90~~~ispB~~~Octaprenyl diphosphate synthase~~~COG0142
MNLEKINELTAQDMAGVNAAILEQLNSDVQLINQLGYYIVSGGGKRIRPMIAVLAARAVGYEGNAHVTIAALIEFIHTAT
LLHDDVVDESDMRRGKATANAAFGNAASVLVGDFIYTRAFQMMTSLGSLKVLEVMSEAVNVIAEGEVLQLMNVNDPDITE
ENYMRVIYSKTARLFEAAAQCSGILAGCTPEEEKGLQDYGRYLGTAFQLIDDLLDYNADGEQLGKNVGDDLNEGKPTLPL
LHAMHHGTPEQAQMIRTAIEQGNGRHLLEPVLEAMNACGSLEWTRQRAEEEADKAIAALQVLPDTPWREALIGLAHIAVQ
RDR
>P44916 2.5.1.90~~~ispB~~~Octaprenyl diphosphate synthase~~~COG0142
MKKQDLMSIDEIQKLADPDMQKVNQNILAQLNSDVPLIGQLGFYIVQGGGKRIRPLIAVLAARSLGFEGSNSITCATFVE
FIHTASLLHDDVVDESDMRRGRATANAEFGNAASVLVGDFIYTRAFQLVAQLESLKILSIMADATNVLAEGEVQQLMNVN
DPETSEANYMRVIYSKTARLFEVAGQAAAIVAGGTEAQEKALQDYGRYLGTAFQLVDDVLDYSANTQALGKNVGDDLAEG
KPTLPLLHAMRHGNAQQAALIREAIEQGGKREAIDEVLAIMTEHKSLDYAMNRAKEEAQKAVDAIEILPESEYKQALISL
AYLSVDRNY
>Q9PM68 ~~~ispDF~~~Bifunctional enzyme IspD/IspF~~~COG0245
MSEMSLIMLAAGNSTRFNTKVKKQFLRLGNDPLWLYATKNLSSFYPFKKIVVTSSNITYMKKFTKNYEFIEGGDTRAESL
KKALELIDSEFVMVSDVARVLVSKNLFDRLIENLDKADCITPALKVADTTLFDNEALQREKIKLIQTPQISKTKLLKKAL
DQNLEFTDDSTAIAAMGGKIWFVEGEENARKLTFKEDLKKLDLPTPSFEIFTGNGFDVHEFGENRPLLLAGVQIHPTMGL
KAHSDGDVLAHSLTDAILGAAGLGDIGELYPDTDMKFKNANSMELLKQAYDKVREIGFELINIDICVMAQSPKLKDFKQA
MQSNIAHTLDLDEFRINVKATTTEKLGFIGRKEGMAVLSSVNLKYFDWTRL
>Q06755 2.7.7.60~~~ispD~~~2-C-methyl-D-erythritol 4-phosphate cytidylyltransferase~~~COG1211
MSYDVVIPAAGQGKRMKAGRNKLFIELKGDPVIIHTLRVFDSHRQCDKIILVINEQEREHFQQLLSDYPFQTSIELVAGG
DERQHSVYKGLKAVKQEKIVLVHDGARPFIKHEQIDELIAEAEQTGAAILAVPVKDTIKRVQDLQVSETIERSSLWAVQT
PQAFRLSLLMKAHAEAERKGFLGTDDASLVEQMEGGSVRVVEGSYTNIKLTTPDDLTSAEAIMESESGNKHV
>Q2SWT6 2.7.7.60~~~ispD~~~2-C-methyl-D-erythritol 4-phosphate cytidylyltransferase~~~
MTSRLFALIPCAGTGSRSGSALPKQYRTLAGRALLHYTLAAFDACSEFAQTLVVISPDDAHFDARRFAGLRFAVRRCGGA
SRQASVMNGLIQLAEFGATDADWVLVHDAARPGITPALIRTLIGALKDDPVGGIVALPVADTLKRVPAGGDAIERTESRN
GLWQAQTPQMFRIGMLRDAIQRAQLEGRDLTDEASAIEWAGHTPRVVQGSLRNFKVTYPEDFDLAEAILAHPARAS
>Q46893 2.7.7.60~~~ispD~~~2-C-methyl-D-erythritol 4-phosphate cytidylyltransferase~~~COG1211
MATTHLDVCAVVPAAGFGRRMQTECPKQYLSIGNQTILEHSVHALLAHPRVKRVVIAISPGDSRFAQLPLANHPQITVVD
GGDERADSVLAGLKAAGDAQWVLVHDAARPCLHQDDLARLLALSETSRTGGILAAPVRDTMKRAEPGKNAIAHTVDRNGL
WHALTPQFFPRELLHDCLTRALNEGATITDEASALEYCGFHPQLVEGRADNIKVTRPEDLALAEFYLTRTIHQENT
>Q743W5 2.7.7.60~~~ispD~~~2-C-methyl-D-erythritol 4-phosphate cytidylyltransferase~~~COG1211
MVPETGLAPETGSSGTVAAVVPAAGSGERLAAGIPKAFCEIDGASMLARAVAGLLDSKVVDHVVVAVPADRVDEAKRLLP
GQATVVAGGADRTASVRLALAAVPGNPAFVLVHDAARALTPPALIARVVQALRDGHRAVVPALPLHDTVKAVDANGVVLG
TPERDGLRAVQTPQGFATDLLLRAYAAGAGTAGFTDDASLVEHVGGQVQVVDGDPLAFKITTQLDLLLAETIVRR
>P9WKG9 2.7.7.60~~~ispD~~~2-C-methyl-D-erythritol 4-phosphate cytidylyltransferase~~~COG1211
MVREAGEVVAIVPAAGSGERLAVGVPKAFYQLDGQTLIERAVDGLLDSGVVDTVVVAVPADRTDEARQILGHRAMIVAGG
SNRTDTVNLALTVLSGTAEPEFVLVHDAARALTPPALVARVVEALRDGYAAVVPVLPLSDTIKAVDANGVVLGTPERAGL
RAVQTPQGFTTDLLLRSYQRGSLDLPAAEYTDDASLVEHIGGQVQVVDGDPLAFKITTKLDLLLAQAIVRG
>Q5F829 2.7.7.60~~~ispD~~~2-C-methyl-D-erythritol 4-phosphate cytidylyltransferase~~~
MKRKNIALIPAAGIGVRFGADKPKQYVEIGSKTVLEHVLGIFERHEAVDLTVVVVSPEDTFADKVQTAFPQVRVWKNGGQ
TRAETVRNGVAKLLETGLAAETDNILVHDAARCCLPSEALARLIEQAGNAAEGGILAVPVADTLKRAESGQISATVDRSG
LWQAQTPQLFQAGLLHRALAAENLGGITDEASAVEKLGVRPLLIQGDARNLKLTQPQDAYIVRLLLNAV
>Q9X1B3 2.7.7.60~~~ispD~~~2-C-methyl-D-erythritol 4-phosphate cytidylyltransferase~~~COG1211
MNVAILLAAGKGERMSENVPKQFLEIEGRMLFEYPLSTFLKSEAIDGVVIVTRREWFEVVEKRVFHEKVLGIVEGGDTRS
QSVRSALEFLEKFSPSYVLVHDSARPFLRKKHVSEVLRRARETGAATLALKNSDALVRVENDRIEYIPRKGVYRILTPQA
FSYEILKKAHENGGEWADDTEPVQKLGVKIALVEGDPLCFKVTFKEDLELARIIAREWERIP
>Q5SLX2 2.7.7.60~~~ispD~~~2-C-methyl-D-erythritol 4-phosphate cytidylyltransferase~~~COG1211
MEVSVLIPAAGNGLRLGRGPKAFLQVGGRTLLEWTLAAFRDAAEVLVALPPGAEPPKGLGAVFLEGGATRQASVARLLEA
ASLPLVLVHDVARPFVSRGLVARVLEAAQRSGAAVPVLPVPDTLMAPEGEAYGRVVPREAFRLVQTPQGFFTALLREAHA
YARRKGLEASDDAQLVQALGYPVALVEGEATAFKITHPQDLVLAEALARVWSA
>O67060 2.7.1.148~~~ispE~~~4-diphosphocytidyl-2-C-methyl-D-erythritol kinase~~~COG1947
MIKVLSPAKINLGLWVLGRLPSGYHEILTLYQEIPFYDEIYIREGVLRVETNIGIPQEENLVYKGLREFERITGIEINYS
IFIQKNIPPGAGLGGGSSNLAVVLKKVNELLGSPLSEEELRELVGSISADAPFFLLGKSAIGRGKGEVLEPVETEISGKI
TLVIPQVSSSTGRVYSSLREEHFVTPEYAEEKIQRIISGEVEEIENVLGDIARELYPEINEVYRFVEYLGFKPFVSGSGS
TVYFFGGASEELKKAAKMRGWKVVELEL
>Q8FI04 2.7.1.148~~~ispE~~~4-diphosphocytidyl-2-C-methyl-D-erythritol kinase~~~COG1947
MRTQWPSPAKLNLFLYITGQRADGYHTLQTLFQFLDYGDTISIELRDDGDIRLLTPVEGVEHEDNLIVRAARLLMKTAAD
SGRLSTGSGANISIDKRLPMGGGLGGGSSNAATVLVALNHLWQCGLSMDELAEMGLTLGADVPVFVRGHAAFAEGVGEIL
MPVDPPEKWYLVAHPGVSIPTPVIFKDPELPRNTPKRSIETLLKCEFSNDCEVIARKRFREVDAVLSWLLEYAPSRLTGT
GACVFAEFDTESEARQVLEQAPEWLNGFVAKGVNLSPLHRAML
>P62615 2.7.1.148~~~ispE~~~4-diphosphocytidyl-2-C-methyl-D-erythritol kinase~~~COG1947
MRTQWPSPAKLNLFLYITGQRADGYHTLQTLFQFLDYGDTISIELRDDGDIRLLTPVEGVEHEDNLIVRAARLLMKTAAD
SGRLPTGSGANISIDKRLPMGGGLGGGSSNAATVLVALNHLWQCGLSMDELAEMGLTLGADVPVFVRGHAAFAEGVGEIL
TPVDPPEKWYLVAHPGVSIPTPVIFKDPELPRNTPKRSIETLLKCEFSNDCEVIARKRFREVDAVLSWLLEYAPSRLTGT
GACVFAEFDTESEARQVLEQAPEWLNGFVAKGANLSPLHRAML
>P9WKG7 2.7.1.148~~~ispE~~~4-diphosphocytidyl-2-C-methyl-D-erythritol kinase~~~COG1947
MSASDGNTAELWVPTGSVTVRVPGKVNLYLAVGDRREDGYHELTTVFHAVSLVDEVTVRNADVLSLELVGEGADQLPTDE
RNLAWQAAELMAEHVGRAPDVSIMIDKSIPVAGGMAGGSADAAAVLVAMNSLWELNVPRRDLRMLAARLGSDVPFALHGG
TALGTGRGEELATVLSRNTFHWVLAFADSGLLTSAVYNELDRLREVGDPPRLGEPGPVLAALAAGDPDQLAPLLGNEMQA
AAVSLDPALARALRAGVEAGALAGIVSGSGPTCAFLCTSASSAIDVGAQLSGAGVCRTVRVATGPVPGARVVSAPTEV
>P83700 2.7.1.148~~~ispE~~~4-diphosphocytidyl-2-C-methyl-D-erythritol kinase~~~COG1947
MERLAPAKVNLGLSVRFRREDGYHELHTLFAPFSLADRLVVEPVSSGLHFQGPYGRENLAYRAASLYLEAAGQPGGVRIL
LEKRIPEGAGLGGGSSDAAQVLLALQALYPAEVDLFALARTLGADVPFFLLGRGAEARGVGERLKPLALPPVPAVVFFPG
LRVPTPLVYRAVRPEDFGPDLPVEAILEALARGEEPPYWNSLEGPAFRLFPELKEVRGRMRALGLRGVLMSGSGSAFFGL
AEGPDHARRAAEALRAWGRAWAGTLGGGDAGSGPA
>Q06756 4.6.1.12~~~ispF~~~2-C-methyl-D-erythritol 2,4-cyclodiphosphate synthase~~~COG0245
MFRIGQGFDVHQLVEGRPLIIGGIEIPYEKGLLGHSDADVLLHTVADACLGAVGEGDIGKHFPDTDPEFKDADSFKLLQH
VWGIVKQKGYVLGNIDCTIIAQKPKMLPYIEDMRKRIAEGLEADVSQVNVKATTTEKLGFTGRAEGIAAQATVLIQKG
>B4EC22 4.6.1.12~~~ispF~~~2-C-methyl-D-erythritol 2,4-cyclodiphosphate synthase~~~COG0245
MDFRIGQGYDVHQLVEGRPLIIGGVTIPYERGLLGHSDADVLLHAITDALFGAAALGDIGRHFSDTDAAFKGADSRVLLR
ACAERVKAAGFTIQNVDSTVIAQAPKLAPHIDGMRANIAADLGLPLERVNVKAKTNEKLGYLGRGEGIEAQAAALLVKQG
G
>A3NWD9 4.6.1.12~~~ispF~~~2-C-methyl-D-erythritol 2,4-cyclodiphosphate synthase~~~
MDFRIGQGYDVHQLVPGRPLIIGGVTIPYERGLLGHSDADVLLHAITDALFGAAALGDIGRHFSDTDPRFKGADSRALLR
ECASRVAQAGFAIRNVDSTIIAQAPKLAPHIDAMRANIAADLDLPLDRVNVKAKTNEKLGYLGRGEGIEAQAAALVVREA
AA
>Q3JRA0 4.6.1.12~~~ispF~~~2-C-methyl-D-erythritol 2,4-cyclodiphosphate synthase~~~
MDFRIGQGYDVHQLVPGRPLIIGGVTIPYERGLLGHSDADVLLHAITDALFGAAALGDIGRHFSDTDPRFKGADSRALLR
ECASRVAQAGFAIRNVDSTIIAQAPKLAPHIDAMRANIAADLDLPLDRVNVKAKTNEKLGYLGRGEGIEAQAAALVVREA
AA
>Q63T71 4.6.1.12~~~ispF~~~2-C-methyl-D-erythritol 2,4-cyclodiphosphate synthase~~~COG0245
MDFRIGQGYDVHQLVPGRPLIIGGVTIPYERGLLGHSDADVLLHAITDALFGAAALGDIGRHFSDTDPRFKGADSRALLR
ECASRVAQAGFAIRNVDSTIIAQAPKLAPHIDAMRANIAADLDLPLDRVNVKAKTNEKLGYLGRGEGIEAQAAALVVREA
AA
>P62617 4.6.1.12~~~ispF~~~2-C-methyl-D-erythritol 2,4-cyclodiphosphate synthase~~~COG0245
MRIGHGFDVHAFGGEGPIIIGGVRIPYEKGLLAHSDGDVALHALTDALLGAAALGDIGKLFPDTDPAFKGADSRELLREA
WRRIQAKGYTLGNVDVTIIAQAPKMLPHIPQMRVFIAEDLGCHMDDVNVKATTTEKLGFTGRGEGIACEAVALLIKATK
>Q5NFU1 4.6.1.12~~~ispF~~~2-C-methyl-D-erythritol 2,4-cyclodiphosphate synthase~~~COG0245
MSFRIGHGYDVHKFTSAKQNIIIGGVEIAYHLGLEAHSDGDVLIHALCDAILGALGLGDIGKHFLDTDNQFKNIDSKFFL
AEIKKMLDKKQYSISNIDCTIIAQAPKMLPHIEKMRACLANILEIQISQINIKATTTERLGFIGREEGIATHVVCLLYR
>P44815 4.6.1.12~~~ispF~~~2-C-methyl-D-erythritol 2,4-cyclodiphosphate synthase~~~COG0245
MIRIGHGFDVHAFGEDRPLIIGGVEVPYHTGFIAHSDGDVALHALTDAILGAAALGDIGKLFPDTDMQYKNADSRGLLRE
AFRQVQEKGYKIGNVDITIIAQAPKMRPHIDAMRAKIAEDLQCDIEQVNVKATTTEKLGFTGRQEGIACEAVALLIRQ
>P9WKG5 4.6.1.12~~~ispF~~~2-C-methyl-D-erythritol 2,4-cyclodiphosphate synthase~~~COG0245
MNQLPRVGLGTDVHPIEPGRPCWLVGLLFPSADGCAGHSDGDVAVHALCDAVLSAAGLGDIGEVFGVDDPRWQGVSGADM
LRHVVVLITQHGYRVGNAVVQVIGNRPKIGWRRLEAQAVLSRLLNAPVSVSATTTDGLGLTGRGEGLAAIATALVVSLR
>P57708 4.6.1.12~~~ispF~~~2-C-methyl-D-erythritol 2,4-cyclodiphosphate synthase~~~
MRIGHGYDVHRFGEGDFITLGGVRIPHKHGLVAHSDGDVLLHALSDALLGAAALGDIGKHFPDTDPRFKGADSRALLRHV
VAIVAEKGWKVGNVDATIVAQAPKMAPHIETMRGLIAEDLGVAVDQVNVKATTTERLGFTGREEGIAVHAVALLMAR
>Q8ZMF7 4.6.1.12~~~ispF~~~2-C-methyl-D-erythritol 2,4-cyclodiphosphate synthase~~~
MRIGHGFDVHAFGGEGPIIIGGVRIPYEKGLLAHSDGDVALHALTDALLGAAALGDIGKLFPDTDPAFKGADSRELLREA
WRRIQAKGYTLGNVDVTIIAQAPKMLPHIPQMRVFIAEDLGCHMDEVNVKATTTEKLGFTGRGEGIACEAVALLMKAAK
>Q8EBR3 4.6.1.12~~~ispF~~~2-C-methyl-D-erythritol 2,4-cyclodiphosphate synthase~~~COG0245
MKIRIGHGFDVHKFGEPRPLILCGVEVPYETGLVAHSDGDVVLHAISDAILGAMALGDIGKHFPDTDAAYKGADSRVLLR
HCYALAKAKGFELGNLDVTIIAQAPKMAPHIEDMRQVLAADLNADVADINVKATTTEKLGFTGRKEGIAVEAVVLLSRQ
>Q8RQP5 4.6.1.12~~~ispF~~~2-C-methyl-D-erythritol 2,4-cyclodiphosphate synthase~~~COG0245
MRIGYGEDSHRLEEGRPLYLCGLLIPSPVGALAHSDGDAALHALTDALLSAYGLGDIGLLFPDTDPRWRGERSEVFLREA
LRLVEARGAKLLQASLVLTLDRPKLGPHRKALVDSLSRLLRLPQDRIGLTFKTSEGLAPSHVQARAVVLLDG
>Q8ZBP7 4.6.1.12~~~ispF~~~2-C-methyl-D-erythritol 2,4-cyclodiphosphate synthase~~~COG0245
MRIGHGFDVHKFGENGSGPLIIGGVRIPYEKGLLAHSDGDVALHAATDALLGAAALGDIGKLFPDTDPAFKGADSRGLLR
EAYRRILAKGYKLGNLDITIIAQAPKMAPHIPQMRVNLAEDLQCHMDDINVKATTTEQLGFTGRGEGIACEAVVLLVNVE
QG
>O67496 1.17.7.3~~~ispG~~~4-hydroxy-3-methylbut-2-en-1-yl diphosphate synthase (flavodoxin)~~~COG0821
MIQKRKTRQIRVGNVKIGGDAPIVVQSMTSTKTHDVEATLNQIKRLYEAGCEIVRVAVPHKEDVEALEEIVKKSPMPVIA
DIHFAPSYAFLSMEKGVHGIRINPGNIGKEEIVREIVEEAKRRGVAVRIGVNSGSLEKDLLEKYGYPSAEALAESALRWS
EKFEKWGFTNYKVSIKGSDVLQNVRANLIFAERTDVPLHIGITEAGMGTKGIIKSSVGIGILLYMGIGDTVRVSLTDDPV
VEVETAYEILKSLGLRRRGVEIVACPTCGRIEVDLPKVVKEVQEKLSGVKTPLKVAVMGCVVNAIGEAREADIGLACGRG
FAWLFKHGKPIKKVDESEMVDELLKEIQNMEKDGGTN
>Q81LV7 1.17.7.3~~~ispG~~~4-hydroxy-3-methylbut-2-en-1-yl diphosphate synthase (flavodoxin)~~~COG0821
MTHRTKTRPVKVGNLTIGGNNELIIQSMTTTKTHDVEATVAEIKRLEEAGCQVVRVAVPDERAANAIADIKKQINIPLVA
DIHFDYRLALKAIEGGIDKVRINPGNIGRRHKVEAVVNAAKERGIPIRIGVNAGSLERHILEKYGYPTADGMVESALHHI
KILEDLDFHDIIVSMKASDVNLAIEAYEKAARAFDYPLHLGITESGTLFAGTVKSAAGLGAILNKGIGNTLRISLSADPV
EEVKVARELLKSFGLASNAATLISCPTCGRIEIDLISIANEVEEYISTLQVPIKVAVLGCAVNGPGEAREADIGIAGARG
EGLLFRKGQVVRKVPEEIMVEELKKEIDVIAAEMAAEREKEKETQEQ
>P62620 1.17.7.3~~~ispG~~~4-hydroxy-3-methylbut-2-en-1-yl diphosphate synthase (flavodoxin)~~~COG0821
MHNQAPIQRRKSTRIYVGNVPIGDGAPIAVQSMTNTRTTDVEATVNQIKALERVGADIVRVSVPTMDAAEAFKLIKQQVN
VPLVADIHFDYRIALKVAEYGVDCLRINPGNIGNEERIRMVVDCARDKNIPIRIGVNAGSLEKDLQEKYGEPTPQALLES
AMRHVDHLDRLNFDQFKVSVKASDVFLAVESYRLLAKQIDQPLHLGITEAGGARSGAVKSAIGLGLLLSEGIGDTLRVSL
AADPVEEIKVGFDILKSLRIRSRGINFIACPTCSRQEFDVIGTVNALEQRLEDIITPMDVSIIGCVVNGPGEALVSTLGV
TGGNKKSGLYEDGVRKDRLDNNDMIDQLEARIRAKASQLDEARRIDVQQVEK
>P9WKG3 1.17.7.3~~~ispG~~~4-hydroxy-3-methylbut-2-en-1-yl diphosphate synthase (flavodoxin)~~~COG0821
MTVGLGMPQPPAPTLAPRRATRQLMVGNVGVGSDHPVSVQSMCTTKTHDVNSTLQQIAELTAAGCDIVRVACPRQEDADA
LAEIARHSQIPVVADIHFQPRYIFAAIDAGCAAVRVNPGNIKEFDGRVGEVAKAAGAAGIPIRIGVNAGSLDKRFMEKYG
KATPEALVESALWEASLFEEHGFGDIKISVKHNDPVVMVAAYELLAARCDYPLHLGVTEAGPAFQGTIKSAVAFGALLSR
GIGDTIRVSLSAPPVEEVKVGNQVLESLNLRPRSLEIVSCPSCGRAQVDVYTLANEVTAGLDGLDVPLRVAVMGCVVNGP
GEAREADLGVASGNGKGQIFVRGEVIKTVPEAQIVETLIEEAMRLAAEMGEQDPGATPSGSPIVTVS
>Q72H18 1.17.7.3~~~ispG~~~4-hydroxy-3-methylbut-2-en-1-yl diphosphate synthase (flavodoxin)~~~COG0821
MEGMRRPTPTVYVGRVPIGGAHPIAVQSMTNTPTRDVEATTAQVLELHRAGSEIVRLTVNDEEAAKAVPEIKRRLLAEGV
EVPLVGDFHFNGHLLLRKYPKMAEALDKFRINPGTLGRGRHKDEHFAEMIRIAMDLGKPVRIGANWGSLDPALLTELMDR
NASRPEPKSAHEVVLEALVESAVRAYEAALEMGLGEDKLVLSAKVSKARDLVWVYRELARRTQAPLHLGLTEAGMGVKGI
VASAAALAPLLLEGIGDTIRVSLTPSPKEPRTKEVEVAQEILQALGLRAFAPEVTSCPGCGRTTSTFFQELAEEVSRRLK
ERLPEWRARYPGVEELKVAVMGCVVNGPGESKHAHIGISLPGAGEEPKAPVYADGKLLTILKGEGIAEEFLRLVEDYVKT
RFAPKA
>Q5SLI8 1.17.7.3~~~ispG~~~4-hydroxy-3-methylbut-2-en-1-yl diphosphate synthase (flavodoxin)~~~COG0821
MEGMRRPTPTVYVGRVPIGGAHPIAVQSMTNTPTRDVEATTAQVLELHRAGSEIVRLTVNDEEAAKAVPEIKRRLLAEGA
EVPLVGDFHFNGHLLLRKYPKMAEALDKFRINPGTLGRGRHKDEHFAEMIRIAMDLGKPVRIGANWGSLDPALLTELMDR
NARRPEPKSAHEVVLEALVESAVRAYEAALEMGLGEDKLVLSAKVSKARDLVWVYRELARRTQAPLHLGLTEAGMGVKGI
VASAAALAPLLLEGIGDTIRVSLTPAPGEPRTKEVEVAQEILQALGLRAFAPEVTSCPGCGRTTSTFFQELAEEVSRRLK
ERLPEWRARYPGVEELKVAVMGCVVNGPGESKHAHIGISLPGAGEEPKAPVYADGKLLTILKGEGIAEEFLRLVEDYVKT
RFAPKA
>Q84GJ3 1.17.7.3~~~ispG~~~4-hydroxy-3-methylbut-2-en-1-yl diphosphate synthase (flavodoxin)~~~
MEGMRRPTPTVYVGRVPIGGAHPIAVQSMTNTPTRDVEATTAQVLELHRAGSEIVRLTVNDEEAAKAVPEIKRRLLAEGV
EVPLVGDFHFNGHLLLRKYPKMAEALDKFRINPGTLGRGRHKDEHFAEMIRIAMDLGKPVRIGANWGSLDPALLTELMDR
NASRPEPKSAHEVVLEALVESAVRAYEAALEMGLGEDKLVLSAKVSKARDLVWVYRELARRTQAPLHLGLTEAGMGVKGI
VASAAALAPLLLEGIGDTIRVSLTPSPKEPRTKEVEVAQEILQALGLRAFAPEVTSCPGCGRTTSTFFQELAEEVSRRLK
ERLPEWRARYPGVEELKVAVMGCVVNGPGESKHAHIGISLPGAGEEPKAPVYADGKLLTILKGEGIAEEFLRLVEDYVKT
RFAPKA
>Q8DK70 1.17.7.1~~~ispG~~~4-hydroxy-3-methylbut-2-en-1-yl diphosphate synthase (ferredoxin)~~~COG0821
MQTLPSPVQATPTETAIVRRKTRPVPIGSVVIGGGHPVAVQSMINEDTLDIEGSVAAIRRLHEIGCEIVRVTVPSLAHAK
AMEEIRDRLYKTYKPVPLVADVHHNGMKIALEVAKYVDNVRINPGLYVFEKPKPNRTEYTQAEFDEIGAKIRETLEPLVI
SLRDQGKSMRIGVNHGSLAERMLFTYGDTPEGMVESALEFIRICESLNFYNLEISLKASRVPVMIAANRLMVKRMDELGM
DYPLHLGVTEAGDGEYGRIKSTAGIATLLAEGIGDTIRVSLTEAPEKEIPVCYGILQALGLRRTMVEYVACPSCGRTLFN
LEEVLHKVREATKHLTGLNIAVMGCIVNGPGEMADADYGYVGKQPGYISLYRGREEVKKVPEAEGVAALVELIKADGRWV
DP
>P9WKF9 1.17.7.4~~~ispH1~~~4-hydroxy-3-methylbut-2-enyl diphosphate reductase 1~~~COG0761
MAEVFVGPVAQGYASGEVTVLLASPRSFCAGVERAIETVKRVLDVAEGPVYVRKQIVHNTVVVAELRDRGAVFVEDLDEI
PDPPPPGAVVVFSAHGVSPAVRAGADERGLQVVDATCPLVAKVHAEAARFAARGDTVVFIGHAGHEETEGTLGVAPRSTL
LVQTPADVAALNLPEGTQLSYLTQTTLALDETADVIDALRARFPTLGQPPSEDICYATTNRQRALQSMVGECDVVLVIGS
CNSSNSRRLVELAQRSGTPAYLIDGPDDIEPEWLSSVSTIGVTAGASAPPRLVGQVIDALRGYASITVVERSIATETVRF
GLPKQVRAQ
>P9WKG1 1.17.7.4~~~ispH2~~~4-hydroxy-3-methylbut-2-enyl diphosphate reductase 2~~~COG0761
MVPTVDMGIPGASVSSRSVADRPNRKRVLLAEPRGYCAGVDRAVETVERALQKHGPPVYVRHEIVHNRHVVDTLAKAGAV
FVEETEQVPEGAIVVFSAHGVAPTVHVSASERNLQVIDATCPLVTKVHNEARRFARDDYDILLIGHEGHEEVVGTAGEAP
DHVQLVDGVDAVDQVTVRDEDKVVWLSQTTLSVDETMEIVGRLRRRFPKLQDPPSDDICYATQNRQVAVKAMAPECELVI
VVGSRNSSNSVRLVEVALGAGARAAHLVDWADDIDSAWLDGVTTVGVTSGASVPEVLVRGVLERLAECGYDIVQPVTTAN
ETLVFALPRELRSPR
>O67625 1.17.7.4~~~ispH~~~4-hydroxy-3-methylbut-2-enyl diphosphate reductase~~~COG0761
MVDIIIAEHAGFCFGVKRAVKLAEESLKESQGKVYTLGPIIHNPQEVNRLKNLGVFPSQGEEFKEGDTVIIRSHGIPPEK
EEALRKKGLKVIDATCPYVKAVHEAVCQLTREGYFVVLVGEKNHPEVIGTLGYLRACNGKGIVVETLEDIGEALKHERVG
IVAQTTQNEEFFKEVVGEIALWVKEVKVINTICNATSLRQESVKKLAPEVDVMIIIGGKNSGNTRRLYYISKELNPNTYH
IETAEELQPEWFRGVKRVGISAGASTPDWIIEQVKSRIQEICEGQLVSS
>Q72G08 1.17.7.4~~~ispH~~~4-hydroxy-3-methylbut-2-enyl diphosphate reductase~~~COG0761
MNVIRARTAGFCMGVSLALRKLDREVDRAEEKAAQGSPRCRIATFGPIIHNPQVLEAYAGMGVRCLRQVDEVEAGDHVVI
RAHGVPQQQEKALRSRDAVVVDATCPKVKKAQLGIEEQCRAGRTLLLFGEAEHPEVRGLLSYAGEGALVFGSVDELEGLP
LQPETEYFLAAQTTQDRVAFEVARAWLHERLGHEVPVLETICDATRLRQQEAIDIARKVDAMVVVGGFDSGNTRRLADVA
AAQGVFTVHVENESQLPVEQLRGCGIIGLTAGASTPKSIIDATQRFLESL
>P62623 1.17.7.4~~~ispH~~~4-hydroxy-3-methylbut-2-enyl diphosphate reductase~~~COG0761
MQILLANPRGFCAGVDRAISIVENALAIYGAPIYVRHEVVHNRYVVDSLRERGAIFIEQISEVPDGAILIFSAHGVSQAV
RNEAKSRDLTVFDATCPLVTKVHMEVARASRRGEESILIGHAGHPEVEGTMGQYSNPEGGMYLVESPDDVWKLTVKNEEK
LSFMTQTTLSVDDTSDVIDALRKRFPKIVGPRKDDICYATTNRQEAVRALAEQAEVVLVVGSKNSSNSNRLAELAQRMGK
RAFLIDDAKDIQEEWVKEVKCVGVTAGASAPDILVQNVVARLQQLGGGEAIPLEGREENIVFEVPKELRVDIREVD
>O31751 2.5.1.-~~~uppS~~~Isoprenyl transferase~~~COG0020
MLNILKNWKNQQTAASNLERYTKEDILKGEIPEHIAIIMDGNGRWAKKRSLPRIAGHHEGMKVVKRTTKLANELGVKVLT
LYAFSTENWKRPKMEVDFLMKLPEEFLNTYLPELVEENVQVRIIGDETALPAHTLRAIEKAVQDTAQNDGMILNFALNYG
GRTEIVSAAKSLAEKVKEGSLNIEDIDESLFSTYLMTESLQDPELLIRTSGEIRLSNFMLWQVAYSEFVFTDVLWPDFKE
DHFLQALGEFQQRGRRFGGI
>Q9PP99 2.5.1.-~~~uppS~~~Isoprenyl transferase~~~COG0020
MNELKHLAVVMDGNRRWARAKGFLAKLGYSQGVKTMQKLMEVCMEENISNLSLFAFSTENWKRPKDEIDFIFELLDRCLD
EALEKFEKNNVRLRAIGDLSRLEDKVREKITLVEEKTKHCDALCVNLAISYGARDEIIRAAKRVIEKKLELNEENLTQNL
DLPLDVDLMLRVGNAKRLSNFLLWQCSYAEIYFSETLFPSLTKREFKRIIKEFRNRERTFGK
>Q831K9 2.5.1.-~~~uppS~~~Isoprenyl transferase~~~COG0020
MLRFFPQKNKYVEEASQYAFDKEGQIPQHIAIIMDGNGRWAQNRRLPRIAGHKEGMDTVKKITKHASHLGVKVLTLYAFS
TENWKRPTDEVNFLMQLPVDFFDTFVPELIKENVKVNVMGYQEFLPSHTQDAVKRAIEQTKDNTGMVLNFALNYGARAEL
LTAMKQIAAEVSEKAYTADEITEETIADHLMTGFLPTELRDPELLIRTSGEERISNFLLWQIAYSELFFTKALWPDFSGD
TLETAIASFQNRNRRFGGLKETTETEGSDPQ
>P55984 2.5.1.-~~~uppS~~~Isoprenyl transferase~~~COG0020
MDNTLKHLAIIMDGNGRWAKLKNKARAYGHKKGVKTLKDITIWCANHKLECLTLYAFSTENWKRPKSEVDFLMKMLKKYL
KDERSTYLNNNIRFRAIGDLEGFSKELRDTILQLENDTRHFKDFTQVLALNYGSKNELSRAFKSLLESPPSHINLLESLE
NEISNRLDTHDLPEVDLLLRTGGEMRLSNFLLWQSSYAELFFTPILWPDFTPKDLENIISDFYKRVRKFGELKC
>P60477 2.5.1.-~~~uppS~~~Isoprenyl transferase~~~
MFKKLINKKNTINNYNEELDSSNIPEHIAIIMDGNGRWAKKRKMPRIKGHYEGMQTIKKITRIASDIGVKYLTLYAFSTE
NWSRPESEVNYIMNLPVNFLKTFLPELIEKNVKVETIGFTDKLPKSTIEAINNAKEKTANNTGLKLIFAINYGGRAELVH
SIKNMFDELHQQGLNSDIIDETYINNHLMTKDYPDPELLIRTSGEQRISNFLIWQVSYSEFIFNQKLWPDFDEDELIKCI
KIYQSRQRRFGGLSEE
>Q97SR4 2.5.1.-~~~uppS~~~Isoprenyl transferase~~~COG0020
MFGFFKKDKAVEVEVPTQVPAHIGIIMDGNGRWAKKRMQPRVFGHKAGMEALQTVTKAANKLGVKVITVYAFSTENWTRP
DQEVKFIMNLPVEFYDNYVPELHANNVKIQMIGETDRLPKQTFEALTKAEELTKNNTGLILNFALNYGGRAEITQALKLI
SQDVLDAKINPGDITEELIGNYLFTQHLPKDLRDPDLIIRTSGELRLSNFLPWQGAYSELYFTDTLWPDFDEAALQEAIL
AYNRRHRRFGGV
>Q8DRB3 2.5.1.-~~~uppS~~~Isoprenyl transferase~~~COG0020
MFGFFKKDKAVEVEVPTQVPAHIGIIMDGNGRWAKKRMQPRVFGHKAGMEALQTVTKAANKLGVKVITVYAFSTENWTRP
DQEVKFIMNLPVEFYDNYVPELHANNVKIQMIGETDRLPKQTFEALTKAEELTKNNTGLILNFALNYGGRAEITQALKLI
SQDVLDAKINPGDITEELIGNYLFTQHLPKDLRDPDLIIRTSGELRLSNFLPWQGAYSELYFTDTLWPDFDEAALQEAIL
AYNRRHRRFGGV
>P29139 3.4.21.-~~~isp~~~Intracellular serine protease~~~COG1404
MERKVHIIPYQVIKQEQQVNEIPRGVEMIQAPAVWNQTRGRGVKVAVLDTGCDADHPDLKARIIGGRNFTDDDEGDPEIF
KDYNGHGTHVAGTIAATENENGVVGVAPEADLLIIKVLNKQGSGQYDWIIQGIYYAIEQKVDIISMSLGGPEDVPELHEA
VKKAVASQILVMCAAGNEGDGDDRTDELGYPGCYNEVISVGAINFDRHASEFSNSNNEVDLVAPGEDILSTVPGGKYATF
SGTSMATPHVAGALALIKQLANASFERDLTEPELYAQLIKRTIPLGNSPKMEGNGLLYLTAVEELSRIFDTQRVAGILST
ASLKVK
>Q45619 ~~~~~~Insertion sequence IS5376 putative ATP-binding protein~~~
MKERIHEYCHRLHLPVMAERWSAMAEYASTHNISYSEFLFRLLEAEIVEKQARSIQTLIKLSKLPYRKTIDTFDFTAQPS
VDERRIRELLTLSFIDRKENILFLGPPGIGKTHLAISIGMEAIARGYKTYFITAHDLVNQLRRADQEGKLEKKLRVFVKP
TVLIIDEMGYLKLDPNSAHYLFQVIARRYEHAPIILTSNKSFGEWGEIVGDSVLATAMLDRLLHHSIIFNLKGESYRLRE
KRLQEEKQKDQ
>Q47316 6.3.2.38~~~iucA~~~N(2)-citryl-N(6)-acetyl-N(6)-hydroxylysine synthase~~~
MILPSEKSATDVAAQCFLNALIRETKDWQLAEYPPDELIIPLDEQKSLHFRVAYFSPTQHHRFAFPAHLVTASGSYPVDF
TTLSRLIIDKLRHQLFLPVPLCETFHQRVLESYAHTQQTIDARHDWAILREKALNFGEAEQALLTGHAFHPAPKSHEPFN
RQEAERYLPDMAPHFPLRWFSVDKTQIAGESLHLNLQQRLTRFAAENAPQLLNELSDNQWLFPLRPWQGEYLFQQVWCQA
LFAKGLIRDLGEAGTSWLPTTSSRSLYCATSRDMIKFSLSVRLTNSVRTLSVKEVERGMRLARLAQTDGWQMLQARFPTF
RVMQEDDWTGLRDLNGNIMQESLFSPAWKTLLLEQPQSQTNVLVSLTQAGPHGGDSLLVSAVKRLSDRLGITVQQAAHAW
VDAYCQQVLKPLFTAEADYGLVLLAHQQNILVQMLGDLPVGFIYRDCQGSAFMPHATEWLDTIDEAQAENIFTREQLLRY
FPYYLLVNSTFAVTAALGAAGLDSEANLMARVRTLLAEVRDQVTHKTCLNYVLESPYWNVKGNFFCYLNDHNENTIVDPS
VIYFDFANPLQAQEV
>Q76BS7 6.3.2.38~~~iucA~~~N(2)-citryl-N(6)-acetyl-N(6)-hydroxylysine synthase~~~COG4264
MHRNNERINLEKSNDWANTNEPSIACFLNSLARESQSVQLLWGEDGKRVYRLPLANSDSINIPLSYFSSLGSHEYCLPAL
LHTQDSIKTLSVEQLIEHIVNEPALVGIVSEAKKAIFTKRVLESHRNTEQAIEHSPYQEQLFTEQLDFKTAEQGLLIGHS
FHPAPKSREQFSLSDAKLYSPELGGQFKLFWLSVEQSLLTSGSSADIHFNQRFEALVAHDPKLVEALQNAQQQGHELLPV
HPWQWHVMVENPSIKGYIATKQIQNLGQLGATWYPTSSTRSLYAPGLPYMLKFSLSVKLTNSIRNLSLKECDSWNDLNDL
FQHPQLAQQLGNGRGFQLMQEPAYIGLKDLNGKIIDESLVAFRDNPLMNNPAEEAVVLATLTQQNPYGGSSLVAARIEHY
ATQQHLSLHQAASLWFDAYCRHAVVPLFHLQANFGIVFLAHQQNIVMQLEQGFPVGMYYRDCQGTGYTDLAFKLFGEQLG
DRKEALENYWNQDKVRRYFAYYLIINSTFNLISAICANLDVEESELIEILYHNLNALLQSGVKDDLCLRYVLTSEALCCK
GNFFCYLQNFNENSIPDPAVIYFDLPNPLARVEEIAHV
>Q47317 2.3.1.102~~~iucB~~~N(6)-hydroxylysine O-acetyltransferase~~~
MSGPNIVHSGYGLRCEKLDKPLNLGWGLDNSAVLHWPGELPTGWLCDALDQIFIAAPQLSAVVLPWSEWCEEPQALTLFG
QVQSDIIHRSAFWQLPLWLSSPANRASGEMVFDAEREIYFPQRPPRPQGEVYRRYDPRIRRMLSFRIADPVSDAERFTRW
MNDPRVEYFWEQSGSLEVQIAYLERQLTSKHAFPLIGCFDDRPVSNIEIYWAAEDRIGRHYSWQPFDRGLHLLVGEQQWR
GAHYVQSWLRGVTHYLLLNEPRTQRTVLEPRTDNQRLFRHLEPAGYRTIKEFDFPHKRSRMVMADRHHFFTEVGL
>Q47318 6.3.2.39~~~iucC~~~Aerobactin synthase~~~
MNHKDWDLVNRRLVAKMLSELEYEQVFHAESQGDDRYCINLPGAQWRFIAERGIWGWLWIDAQTLRCADEPVLAQTLLMQ
LKQVLSMSDATVAEHMQDLYATLLGDLQLLKARRGLSASDLINLNADRLQCLLSGHPKFVFNKGRRGWGKEALERYAPEY
ANTFRLHWLAVKREHMIWRCDNEMDIHQLLTAAMDPQEFARFSQVWQENGLDHNWLPLPVHPWQWQEKIATDFIADFGEG
RMVSLGEFGDQWLAQQSLRTLTNASRRGGLDIKLPLTIYNTSCYRGIPGRYIAAGPLASRWLQQVFATDATLVQSGAVIL
GEPAAGYVSHEGYAALARAPYRYQEMLGVIWRENPCRWLKPDESPFLMATLMEWDENNQPLAGAYIDRSGLDAETWLTQL
FRVVVVPLYHLLCRYGVALIAHGQNITLAMKEGVPQRVLLKDFQGDMRLVKEEFPEMDSLPQEVRDVTSRLSADYLIHDL
QTGHFVTVLRFISPLMVRLGVPERRFYQLLAAVLSDYMKKHPQMSERFALFSLFRPQIIRVVLNPVKLTWPDLDGGSRML
PNYLEDLQNPLWLVTQEYES
>Q76BS5 6.3.2.39~~~iucC~~~Aerobactin synthase~~~
MVMDQLALHRYWAVANQKMVGKILSEFAYEQAFQFEPTAQGYQLNLENGTRYCFAGEENIWGQVMIDPTSITRHAEIEAD
EPISAALLMRDLQPLLKMPDDAFAEHLEDLNATLLGDCKLMQRNEAITARDLAMLPCEQQQTYFDGHPKFVFNKGRRGWG
SDDLKRYAPEAERVSNWVGSRFITRFCSSPPTMKSHGKPCCKAPSRPMKSSRWTVCWLPISLDSTIIVMFRFILWQWSNK
LALLFVREIATKQLVYLGEFGDHFLPQLSLRTLSNVTRPAGYDIKLPLTVMNTSCYRGIPGRYILAGPTASDWIDQVFKS
DPLLIAKQAEVLQEPAAAFAAQADYALLPNAPYRYHELLGVIWRESAASKLKAGERAILMAALMESDNQGQPLIAEYVQA
SGLTLEAWLSKLFDAVVIPYYHLLCNYGVSLIAHGQNVTLVLENHAPKRILLKDFQGDMRLVSREYPEQASLDDSVKKVT
VRLPEHLIIHDLQTGHFVTTLRFISPLVAKLGFSEPQFYRLLGDRLKAYMAAHREYQPRFEQFDLFKPRILRIGLNLAKF
RHSTDASASRMLPDMDDMLNNPLTKALEHQG
>P11295 1.14.13.59~~~iucD~~~L-lysine N6-monooxygenase~~~
MKKSVDFIGVGTGPFNLSIAALSHQIEELDCLFFDEHPHFSWHPGMLVPDCHMQTVFLKDLVSAVAPTNPYSFVNYLVKH
KKFYRFLTSRLRTVSREEFSDYLRWAAEDMNNLYFSHTVENIDFDKKRRLFLVQTSQGEYFARNICLGTGKQPYLPPCVK
HMTQSCFHASEMNLRRPDLSGKRITVVGGGQSGADLFLNALRGEWGEAAEINWVSRRNNFNALDEAAFADEYFTPEYISG
FSGLEEDIRHQLLDEQKMTSDGITADSLLTIYRELYHRFEVLRKPRNIRLLPSRSVTTLESSGPGWKLLMEHHLDQGRES
LESDVVIFATGYRSALPQILPSLMPLITMHDKNTFKVRDDFTLEWSGPKENNIFVVNASMQTHGIAEPQLSLMAWRSARI
LNRVMGRDLFDLSMPPALIQWRSGT
>P14542 ~~~iutA~~~Ferric aerobactin receptor~~~
MMISKKYTLWALNPLLLTMMAPAVAQQTDDETFVVSANRSNRTVAEMAQTTWVIENAELEQQIQGGKELKDALAQLIPGL
DVSSRSRTNYGMNVRGRPLVVLVDGVRLNSSRTDSRQLDSIDPFNMHHIEVIFGATSLYGGGSTGGLINIVTKKGQPETM
MEFEAGTKSGFSSSKDHDERIAGAVSGGNEHISGRLSVAYQKFGGWFDGNGDATLLDNTQTGLQYSDRLDIMGTGTLNID
ESRQLQLITQYYKSQGDDDYGLNLGKGFSAIRGTSTPFVSNGLNSDRIPGTDGHLISLQYSDSAFLGQELVGQVYYRDES
LRFYPFPTVNANKQVTAFSSSQQDTDQYGMKLTLNSKPMDGWQITWGLDADHERFTSNQMFFDLAQASASGGLNNKKIYT
TGRYPSYDITNLAAFLQSGYDINNLFTLNGGVRYQYTENKIDDFIGYAQQRQIGAGKATSADAFWRLSRLRHFLFNAGLL
MHITEPQQAWLNFSQGLELPDPGKYYGRGIYGAAVNGHLPLTKSVNVSDSKLEGVKVDSYELGWRFTGNNLRTQIAAYYS
ISDKSVVANKDLTISVVDDKRRIYGVEGAVDYLIPDTDWSTGVNFNVLKTESKVNGTWQKYDVKTASPSKATAYIGWAPD
PWSLRVQSTTSFDVSDAQGYKVDGYTTVDLLGSYQLPVGTLSFSIENLFDRDYTTVWGQRAPLYYSPGYGPASLYDYKGR
GRTFGLNYSVLF
>Q6U607 ~~~iutA~~~Ferric aerobactin receptor~~~
MMISKKYTLWALNPLLLTMMAPAVAQQTDDETFVVSANRSNRTVAEMAQTTWVIENAELEQQIQGGKELKDALAQLIPGL
DVSSRSRTNYGMNVRGRPLVVLVDGVRLNSSRTDSRQLDSIDPFNIDRIEVISGATSLYGGGSTGGLINIVTKKGQPETI
MEFEAGTKSGFSSSKDHDERIAGAVSGGNEHISGRLSVAYQKFGGWFDGNGDATLLDNTQTGLQYSDRLDIMGTGTLNID
ESRQLQLITQYYKSQGDDDYGLNLGKGFSAIRGTSTPFVSNGLNSDRIPGTERHLISLQYSDSAFLGQELVGQVYYRDES
LRFYPFPTVNANKQVTAFSSSQQDTDQYGMKLTLNSKPMDGWQITWGLDADHERFTSNQMFFDLAQASASGGLNNKKIYT
TGRYPSYDITNLAAFLQSGYDINNLFTLNGGVRYQYTENKIDDFIGYAQQRQIAAGKATSADAIPGGSVDYDNFLFNAGL
LMHITERQQAWLNFSQGVELPDPGKYYGRGIYGAAVNGHLPLTKSVNVSDSKLEGVKVDSYELGWRFTGNNLRTQIAAYY
SISDKSVVANKDLTISVVDDKRRIYGVEGAVDYLIPDTDWSTGVNFNVLKTESKVNGTWQKYDVKTASPSKATAYIGWAP
DPWSLRVQSTTSFDVSDAQGYKVDGYTTVDLLGSYQLPVGTLSFSIENLFDRDYTTVWGQRAPLYYSPGYGPASLYDYKG
RGRTFGLNYSVLF
>A0A1W7HCY1 1.16.1.10~~~iutB~~~Ferric aerobactin reductase IutB~~~
MSGHSFFEHLFEHSQHVTPYLHGAIKPRPERCAEHGFIHIEHASSDHIRALYESLKLAHPEAGAAYWLTRTWTLLCWQPL
YVAFIAIYSCQGLPKLSSMGQHVQPRFVSGYQFDDDEYRQGSEQELIAHAGKELCALFDYFRQEMSLWTRIRPGFTQHLF
ADGVFGCLVKLSQFYPTLSGDYFLEQARLWLAACQLPEKLIQSLRYDETSRQLSLVRTSCCLVYKCQGRELCRDCPRHPD
NKRE
>P0AD59 ~~~ivy~~~Inhibitor of vertebrate lysozyme~~~
MGRISSGGMMFKAITTVAALVIATSAMAQDDLTISSLAKGETTKAAFNQMVQGHKLPAWVMKGGTYTPAQTVTLGDETYQ
VMSACKPHDCGSQRIAVMWSEKSNQMTGLFSTIDEKTSQEKLTWLNVNDALSIDGKTVLFAALTGSLENHPDGFNFK
>Q9HXB1 ~~~ivy~~~Inhibitor of vertebrate lysozyme~~~
MNGVSRLLSLALLGAALHWAPAQAEEQPRLFELLGQPGYKATWHAMFKGESDVPKWVSDASGPSSPSTSLSLEGQPYVLA
NSCKPHDCGNNRLLVAFRGDKSAAYGLQVSLPDEPAEVMQTPSKYATYRWYGEPSRQVRELLMKQLESDPNWK
>P0CI72 3.5.4.11~~~~~~Isoxanthopterin deaminase~~~
MTTYDTQPSTLIRNAAAIMTGGRGTADDPSRVPGPDIRIVGDTIDAIGALAPRPGETIVDATDCVIYPAWVNTHHHLFQS
LLKGEPAGLDATLTPWLAATPYRFRALFDERRFRLAARIGLIELARSGCATVADHNYVYYPGMPFDSSAILFEEAEKLGL
RFVLLRGGATQTRQLEADLPTALRPETLDAYVADIERLAARYHDASPRAMRRVVMAPTTVLYSISPREMRETAAVARRLG
LRMHSHLSETVGYQDSAYSMYGKSPVAFCGEHDWLGSDVWYAHLVKVDADEIALLAQTGTGVAHCPQSNGRLGSGICPVR
EMADAGVPVSIGVDGAASNEAADMISEVHMTWLAQRARLGMLAQPAYRGGSFEGGAGAASIAEVIHWGTAGGARVMGLDE
VGKVAVGYAADIAVYRLDDPRYFGLHDPAIGPVASGGRPSVMALFSAGKRVVVDDLIEGVDIKELGGEARRVVRELLREV
VV
>Q9PMS6 3.6.1.66~~~~~~dITP/XTP pyrophosphatase~~~COG0127
MKIILATSNKHKVLELKEILKDFEIYAFDEVLMPFEIEENGKTFKENALIKARAVFNALDEKQKKDFIALSDDSGICVDV
LEGNPGIYSARFSGKGDDKSNRDKLVNEMIKKGFKQSKAYYVAAIAMVGLMGEFSTHGTMHGKVIDTEKGENGFGYDSLF
IPKGFDKTLAQLSVDEKNNISHRFKALELAKIILKILNKG
>Q83FA3 3.6.1.66~~~~~~dITP/XTP pyrophosphatase~~~COG0127
MLEIVLASQNSSKLAEMQELLRDLEIKFIPQTEFSVPDIEETGSTFVENAIIKARHAAKQTGLPALADDSGLTIAALNSA
PGVFSSRYAGKNATDAERIQKVLEALEAADDSDRSASFHCVIALMENENDPAPLICHGVWEGEIAREPRGKNGFGYDPIF
YVPSHQRTAAELDPQEKNAISHRGQALEQLSTVLTEAFLV
>P52061 3.6.1.66~~~rdgB~~~dITP/XTP pyrophosphatase~~~COG0127
MQKVVLATGNVGKVRELASLLSDFGLDIVAQTDLGVDSAEETGLTFIENAILKARHAAKVTALPAIADDSGLAVDVLGGA
PGIYSARYSGEDATDQKNLQKLLETMKDVPDDQRQARFHCVLVYLRHAEDPTPLVCHGSWPGVITREPAGTGGFGYDPIF
FVPSEGKTAAELTREEKSAISHRGQALKLLLDALRNG
>P44598 3.6.1.66~~~~~~dITP/XTP pyrophosphatase~~~COG0127
MKQKIVLATGNKGKVKEMADVLSDFGFEVIAQTDLGIESPEETGLTFVENALLKARYASEKSGLPAIADDSGLVVSALNG
APGLYSARYAGEEGNDAKNREKLLAELAHIAQEQRQAKFVSCIVFLQHPTDPSPIIAEGECCGVIGFEEKGENGFGYDSL
FFSPEQGCTFAELETAEKKKISHRAKALSVLKSKL
>B1MLZ4 3.6.1.66~~~~~~dITP/XTP pyrophosphatase~~~
MKILVASRNPKKLAELSRVLESSGVSGVELVSLTDVPEYEEVPETGASFEDNALIKAREGVKHTGLACVADDSGLAVDAL
NWMPGVLSARWSGRHGDDAANTALLLAQLSDIPDERRGAAFVSACALVTPEGEEVVVEGRWKGSIARIPAGQNGFGYDPI
FVPRGGLRTAAELTPEEKDAVSHRGRALAALLPMLRNLVNLGRTAP
>P9WMR7 3.6.1.66~~~~~~dITP/XTP pyrophosphatase~~~COG0127
MALVTKLLVASRNRKKLAELRRVLDGAGLSGLTLLSLGDVSPLPETPETGVTFEDNALAKARDAFSATGLASVADDSGLE
VAALGGMPGVLSARWSGRYGDDAANTALLLAQLCDVPDERRGAAFVSACALVSGSGEVVVRGEWPGTIAREPRGDGGFGY
DPVFVPYGDDRTAAQLSPAEKDAVSHRGRALALLLPALRSLATG
>P99094 3.6.1.66~~~~~~dITP/XTP pyrophosphatase~~~
MKEIVIASNNQGKINDFKVIFPDYHVIGISELIPDFDVEETGSTFEENAILKSEAAAKALNKTVIADDSGLEVFALNGEP
GIYSARYAGENKSDEANIEKLLNKLGNTTDRRAQFVCVISMSGPDMETKVFKGTVSGEIADGKYGENGFGYDPIFYVPKL
DKTMAQLSKEQKGQISHRRNAINLLQAFLEGEKNV
>Q9WY06 3.6.1.66~~~~~~dITP/XTP pyrophosphatase~~~COG0127
MKKLTVYLATTNPHKVEEIKMIAPEWMEILPSPEKIEVVEDGETFLENSVKKAVVYGKKLKHPVMADDSGLVIYSLGGFP
GVMSARFMEEHSYKEKMRTILKMLEGKDRRAAFVCSATFFDPVENTLISVEDRVEGRIANEIRGTGGFGYDPFFIPDGYD
KTFGEIPHLKEKISHRSKAFRKLFSVLEKILESENR
>F4KU78 1.21.1.1~~~IYD~~~Iodotyrosine deiodinase~~~COG0778
MKQKPAFIPYAGAQFEPEEMLSKSAEYYQFMDHRRTVREFSNRAIPLEVIENIVMTASTAPSGAHKQPWTFVVVSDPQIK
AKIRQAAEKEEFESYNGRMSNEWLEDLQPFGTDWHKPFLEIAPYLIVVFRKAYDVLPDGTQRKNYYVQESVGIACGFLLA
AIHQAGLVALTHTPSPMNFLQKILQRPENERPFLLVPVGYPAEGAMVPDLQRKDKAAVMVVY
>B9K712 1.21.1.1~~~IYD~~~Iodotyrosine deiodinase~~~COG0778
MKMLYDLAKKRKTVRRFKKEKPPLEDLIYSLKVANEAPSGMNAQPWRFLIVEDEKLKGQIRRVCERSEKTFYENVRGRLK
EWLDEKRFTWRKPFLKEAPYLLLVFSEKSAPYSRESVWLAVGYLLLALEEKGLGSVPYTPPDFREVEKLVNTPSELRLEV
ILPVGYPDDPKPKYPRNEVIVRYNTF
>Q72IJ9 3.4.19.-~~~~~~Probable TtuB-protein conjugate cleaving protease~~~COG1310
MAPPSVPIVYAGGRGGCPGGYHGLVLYVPRGLLEETRAHLLREAPKEGVGLWAGRREVERVIPLPNVHPSPLTAYLADPL
ALLKALKALEREGLSLLAIYHSHPKGPALPSPRDIKEARWRVPYVIFGTDGVRAFLLPEGQEVALVVL
>P9WJW9 ~~~jefA~~~Drug efflux pump JefA~~~COG0477
MTPRQRLTVLATGLGIFMVFVDVNIVNVALPSIQKVFHTGEQGLQWAVAGYSLGMAAVLMSCALLGDRYGRRRSFVFGVT
LFVVSSIVCVLPVSLAVFTVARVIQGLGAAFISVLSLALLSHSFPNPRMKARAISNWMAIGMVGAASAPALGGLMVDGLG
WRSVFLVNVPLGAIVWLLTLVGVDESQDPEPTQLDWVGQLTLIPAVALIAYTIIEAPRFDRQSAGFVAALLLAAGVLLWL
FVRHEHRAAFPLVDLKLFAEPLYRSVLIVYFVVMSCFFGTLMVITQHFQNVRDLSPLHAGLMMLPVPAGFGVASLLAGRA
VNKWGPQLPVLTCLAAMFIGLAIFAISMDHAHPVALVGLTIFGAGAGGCATPLLHLGMTKVDDGRAGMAAGMLNLQRSLG
GIFGVAFLGTIVAAWLGAALPNTMADEIPDPIARAIVVDVIVDSANPHAHAAFIGPGHRITAAQEDEIVLAADAVFVSGI
KLALGGAAVLLTGAFVLGWTRFPRTPAS
>B9IS84 ~~~jetB~~~Wadjet protein JetB~~~
MQNVSEREREEMGIVVNYLFSHNFLLKEFEREKYHLAVRNKDIIKQYLQVIGWDFIVDEKHGCIVIVSPHYEHRLKLKKD
ETIWLLVLRLIYEEKRSALSISQYPFTTLQEIKGKYETFRLPFVSKTKLRELVQIGKQNQLLRPIDNDIESDDCRFQLFH
SCIHVLQQGDLNVLYEKIKSYSEGGDHSEMDEETTIN
>B9IS85 ~~~jetC~~~Wadjet protein JetC~~~
MKKLRLINWHYYSDETILFGKQTVISGHTGAGKSTVIDALQVLFISDERKIKFNSAAYEEANRTLINYLRGKIGTEEKPF
VREGYFTTYIVAEFYDEKAGESFVIGISIDVFKDDEKVKEYFIIPKSEINMISFFSTKDEKRYVEKQADFCKKIREQFPE
AIIEKSSNQYQKALLQRFGGLHERFVKTFARALSFKPIDNMKDFVYKNILDEKELKIDVMRNIFQTHEELQRELEELKER
KEELERIDNIYLECVKLEADISIQEYVLRGLEYLLIQEEKSMCKKSIEQREKELRKCESDQKKTAEQKEHARKKETEYEI
KIKDSAEQKRQKQLQEQIAQAKKECGDLECTKNIYVHSLAREEKDVSSLLNYQGNEYFSLSKDEKHALEIGRDCLAFLSH
NDGTGGNREEQTLNKLGESLKRISGRFYKSTAELEHRSAELKTEEKELLSDIENLKRRKRPYPMSVEKLKGLLEKHLEDQ
SKVWILCEELEIKNDKWRNALEGYLNTQRFDILVEPHMFATALSIYEKEKWNLGLEGVGLVDTEKEQTYLGKVEKGSLAE
EIVGGNPIVQARIHHLLGRVIKADNEQELRKYKTAVTATCMSYQRLVARQIPRKVYETPYIGAHAIQKQLEIKEENLKEI
QTELQIVGYYIKDFKKWIEILEDKQSDYKNYILNFSLNDSILEFNKNINKWKSELNTLDLSRLESLRQKLKEWNGKYNQF
NGEEGRLFEQIGKVKEELQRVNAELWKKEKAATEILEKWKNWKFEYRIELLQEAEQRYEQAISTNKAYGAIKNKYENNKK
ENQNKYEEKRGFLESERKSYNEARTFQGIIQAKDNKQYEEALRKIANLDIPKFEQEIKETLQQAEEEFQSHFIYKMREAI
QAARREFNQLNHALGRFKFRNDTYRFVIKPSEQYKKFYDVIMDERVQPEISLFDFGDEDRAEILKDLFGRLVVGEYGENE
EFVDYRNYLDFDLSINNENGTRFMSNLLREQSGGETQTPFYIAILASFQHLYRNKNTIRLVVFDEAFNKMDEERIQISLR
LIKQLDLQLIAAVPDEKMAHMAAEADTAIIINRIGHSCFTDILSYPREDEAIGLQEQDSFSLIE
>B9IS86 ~~~jetD~~~Wadjet protein JetD~~~
MDYKSKILSVLLNKYENSKTAHTGERSAQRPQFSFRQKHELSKAYNDEMDYTNRLEINTALKDLIRKKIIEVKWEKWEEN
RIAEKVYLQYDFIPQAYREAGIEPKIEKMNRILKVLEPLAVHSWEWVRQWYKEVQQSFQNNKTARINLNDVKGYELLVKA
LSRLEGLEDSIPKRTFSQLVFGDTKLFETTIQNRLLIIYKRYGDIEYESDKEYLESIGILENIQPVYIKGNVDIRVRGEK
IALGSFPGGFGLMDETIKELEIQYVHDESIMLIENMTTYYEQIKKNNNILFIYTGGFPKKNVQQLLKKLNIYLENHPVPV
YHYGDLDYGGIQIFEYIKRSFFSGLEPYMMDVATYRQFVKYGMEFGEGYEEKLLKMLENEQYSLWHELIKEMLKEKKRVE
QEVIVRNVI
>P0AEW9 2.7.1.56~~~fruK~~~1-phosphofructokinase~~~COG1105
MSRRVATITLNPAYDLVGFCPEIERGEVNLVKTTGLHAAGKGINVAKVLKDLGIDVTVGGFLGKDNQDGFQQLFSELGIA
NRFQVVQGRTRINVKLTEKDGEVTDFNFSGFEVTPADWERFVTDSLSWLGQFDMVCVSGSLPSGVSPEAFTDWMTRLRSQ
CPCIIFDSSREALVAGLKAAPWLVKPNRRELEIWAGRKLPEMKDVIEAAHALREQGIAHVVISLGAEGALWVNASGEWIA
KPPSVDVVSTVGAGDSMVGGLIYGLLMRESSEHTLRLATAVAALAVSQSNVGITDRPQLAAMMARVDLQPFN
>P75038 2.7.1.56~~~fruK~~~Putative 1-phosphofructokinase~~~
MLNHNSKVWIVNYACAIDYYLDKHKQQRGVLTPGGKGINMAIVMALFGIKPTVLTFLGQPTKDLFLQLLKPYQLDLVSFP
ATTQTRINVKLLDGAQTTEINDVTPLIEEQAVHEMIAYLKANVKPNDLLVLNGRFLQRDLVKLLDVAFSLTKYVVLDVDE
PQLLQLLNQRQPWLMKPNRDEFVAMVNANNSNVDQQELVQLIKQFQTTQNLLMSDGAQGAYFFDQQQLLFMEAIPPQQLV
STTGAGDTLLGVFLANLLLDKDPVGSLKVAVNYASATISKLAVVNSNDQIVLKATNYYYL
>Q9KM71 2.7.1.56~~~fruk~~~1-phosphofructokinase~~~COG1105
MTKKVVTITLNPALDLTGSVNQLNVGSVSLVGQSSLHAAGKGVNVAKVLSELGAQVTVTGFLGRDNQELFCQLFEQLGVQ
DAFIRIAGATRINVKLVEQSGAVSDINFPGIQVTEADIEAFEATLQRLAQDHDYFVLAGSLPQGISPQRCAGWIAQLRSM
NKKVLFDSSRDALLAGLDAKPWLIKPNDEELSQWCGRELTTLTDCQQAAAELAQKQIENIVISMGAEGVMWLHENQWLHA
KPPKMQVVSTVGAGDTLVAGLCWGHMQRMEKESLLRFATALSALAVTQVGVGLGDREQLNTLQQQIQVSALYPTMGA
>P23354 2.7.1.56~~~fruK~~~1-phosphofructokinase~~~COG1105
MSLQAITVTLNPAIDQTIQLDRLQPGAVHRASSVRNDAGGKGINVAACLADWGSQVAALGVLGVGNAGVFEALFRERGIT
DHCHRVAGDTRTNLKLVEAQVNETTDINLPGLQLGQAHLQGVADHLAPLLRAGLPVVLSGSLPAGLPEDSWAQLQAQASA
AGARVLLDTSGAPLVAALAAAPVAMPYAVKPNRHELEAWTGHPLGDHAALTAAAHALIARGIQLVVISMGTEGALFVQRD
QQLIARPPRLAQGSSVGAGDAMVAGLAAALLDDATELEQCARLATAFSMCRLESGDARRITPEGVRDAAAAVVIGAVP
>Q6L741 2.6.1.94~~~kacL~~~2'-deamino-2'-hydroxyneamine transaminase~~~
MSTHPVLDWSRSAEHLRRSHGVTTDPRPDEDGHYPCVLTRGSGTRVYDLDGNAYLDLTGSFGSVLIGHAEPAVVRAVTDV
LSEGNLFYTGASPRRLALAERLLDWFPWSEQAIFYRTGSCAVSAAARLAQHATGRNRVLSSGYHGWHDWHLEAVPEAKPK
TFESYATEFHNDLALYRSWLDRHGEEIAAVVVTPEPHRFDHAYYQELREVAKEHGCLFVVDEVKTGFRAGAGGFSALAGI
EPDAVTVSKGMANGHSISAVVGQRQLTQELSEAHVWSTYQNEQVGFAAALASLDFLERHDVAAVTRRTGEAVRQGVLQLF
AEHGLPVGAPGWGPMFELDFDAADEGLAERLEAALLRHGIFCDTGDDFNMMFHTAEHTDELLERFAAALGDL
>P73302 2.7.4.3~~~adk1~~~Adenylate kinase 1~~~COG0563
MAMAKGLIFLGAPGSGKGTQAVGLAETLGIPHISTGDMLRQAIADGTELGNQAKGYMDKGELVPDQLILGLIEERLGHKD
AKAGWILDGFPRNVNQAIFLDELLVNIGHRTHWVINLKVPDEVIVERLLARGRADDNETTIRNRLLVYTEQTAPLMAYYQ
EQGKLYSLDGNQPVEAIATNLEKLVKP
>O66490 2.7.4.3~~~adk~~~Adenylate kinase~~~COG0563
MILVFLGPPGAGKGTQAKRLAKEKGFVHISTGDILREAVQKGTPLGKKAKEYMERGELVPDDLIIALIEEVFPKHGNVIF
DGFPRTVKQAEALDEMLEKKGLKVDHVLLFEVPDEVVIERLSGRRINPETGEVYHVKYNPPPPGVKVIQREDDKPEVIKK
RLEVYREQTAPLIEYYKKKGILRIIDASKPVEEVYRQVLEVIGDGN
>P16304 2.7.4.3~~~adk~~~Adenylate kinase~~~COG0563
MNLVLMGLPGAGKGTQGERIVEDYGIPHISTGDMFRAAMKEETPLGLEAKSYIDKGELVPDEVTIGIVKERLGKDDCERG
FLLDGFPRTVAQAEALEEILEEYGKPIDYVINIEVDKDVLMERLTGRRICSVCGTTYHLVFNPPKTPGICDKDGGELYQR
ADDNEETVSKRLEVNMKQTQPLLDFYSEKGYLANVNGQQDIQDVYADVKDLLGGLKK
>J7RC67 2.7.4.3~~~adk~~~Adenylate kinase~~~COG0563
MRLILLGPPGAGKGTQAAFLTQHYGIPQISTGDMLRAAVKAGTPLGLEAKKVMDAGGLVSDDLIIGLVRDRLTQPDCANG
YLFDGFPRTIPQADALKSAGIALDYVVEIEVPESDIIERMSGRRVHPASGRSYHVRFNPPKAEGVDDVTGEPLVQRDDDR
EETVRHRLNVYQNQTRPLVDYYSSWAQSDAAAAPKYRKISGVGSVDEIKSRLSQALQS
>Q3JVB1 2.7.4.3~~~adk~~~Adenylate kinase~~~
MRLILLGAPGAGKGTQANFIKEKFGIPQISTGDMLRAAVKAGTPLGVEAKTYMDEGKLVPDSLIIGLVKERLKEADCANG
YLFDGFPRTIAQADAMKEAGVAIDYVLEIDVPFSEIIERMSGRRTHPASGRTYHVKFNPPKVEGKDDVTGEPLVQRDDDK
EETVKKRLDVYEAQTKPLITYYGDWARRGAENGLKAPAYRKISGLGAVEEIRARVFDALK
>P69441 2.7.4.3~~~adk~~~Adenylate kinase~~~COG0563
MRIILLGAPGAGKGTQAQFIMEKYGIPQISTGDMLRAAVKSGSELGKQAKDIMDAGKLVTDELVIALVKERIAQEDCRNG
FLLDGFPRTIPQADAMKEAGINVDYVLEFDVPDELIVDRIVGRRVHAPSGRVYHVKFNPPKVEGKDDVTGEELTTRKDDQ
EETVRKRLVEYHQMTAPLIGYYSKEAEAGNTKYAKVDGTKPVAEVRADLEKILG
>Q5NFR4 2.7.4.3~~~adk~~~Adenylate kinase~~~COG0563
MRIILLGAPGAGKGTQAKIIEQKYNIAHISTGDMIRETIKSGSALGQELKKVLDAGELVSDEFIIKIVKDRISKNDCNNG
FLLDGVPRTIPQAQELDKLGVNIDYIVEVDVADNLLIERITGRRIHPASGRTYHTKFNPPKVADKDDVTGEPLITRTDDN
EDTVKQRLSVYHAQTAKLIDFYRNFSSTNTKIPKYIKINGDQAVEKVSQDIFDQLNKR
>P27142 2.7.4.3~~~adk~~~Adenylate kinase~~~
MNLVLMGLPGAGKGTQAEKIVAAYGIPHISTGDMFRAAMKEGTPLGLQAKQYMDRGDLVPDEVTIGIVRERLSKDDCQNG
FLLDGFPRTVAQAEALETMLADIGRKLDYVIHIDVRQDVLMERLTGRRICRNCGATYHLIFHPPAKPGVCDKCGGELYQR
ADDNEATVANRLEVNMKQMKPLVDFYEQKGYLRNINGEQDMEKVFADIRELLGGLAR
>A0QSH8 2.7.4.3~~~adk~~~Adenylate kinase~~~COG0563
MRVVLLGPPGAGKGTQAEKLSEKLGIPQISTGDLFRKNIGDGTPLGLEAKRYLDAGDLVPAELTNRLVEDRIDQPDAAEG
FILDGYPRSVEQAGALKDMLAARNTKLDAVLEFQVSEDELLTRLKGRGRADDTDEVIRNRMKVYREETEPLLEYYRDDLK
TVNAVGALDEVFARALSALGQ
>P9WKF5 2.7.4.3~~~adk~~~Adenylate kinase~~~COG0563
MRVLLLGPPGAGKGTQAVKLAEKLGIPQISTGELFRRNIEEGTKLGVEAKRYLDAGDLVPSDLTNELVDDRLNNPDAANG
FILDGYPRSVEQAKALHEMLERRGTDIDAVLEFRVSEEVLLERLKGRGRADDTDDVILNRMKVYRDETAPLLEYYRDQLK
TVDAVGTMDEVFARALRALGK
>P10772 2.7.4.3~~~adk~~~Adenylate kinase~~~
MAINIILLGPPGAGKGTQARRLIDERGLVQLSTGDMLREARSSGTEMGKRVAEVMDRGELVTDEIVIGLIREKLGQGGKG
FIFDGFPRTLAQADALQALMAEMDQRIDAVIEMRVDDAALVSRISGRFTCGNCGEVYHDVTKPTKEPGKCDVCGSTDLRR
RADDNEESLKTRLMEYYKKTSPLIGYYYVKGNLNPVDGLAEIDEVAAQVAKVMDKIPA
>Q6LTE1 2.7.4.3~~~adk~~~Adenylate kinase~~~COG0563
MRIILLGAPGAGKGTQAQFIMAKFGIPQISTGDMLRAAIKAGTELGKQAKSVIDAGQLVSDDIILGLVKERIAQDDCAKG
FLLDGFPRTIPQADGLKEVGVVVDYVIEFDVADSVIVERMAGRRAHLASGRTYHNVYNPPKVEGKDDVTGEDLVIREDDK
EETVLARLGVYHNQTAPLIAYYGKEAEAGNTQYLKFDGTKAVAEVSAELEKALA
>P84139 2.7.4.3~~~adk~~~Adenylate kinase~~~
MNIVLMGLPGAGKGTQADRIVEKYGTPHISTGDMFRAAIQEGTELGVKAKSFMDQGALVPDEVTIGIVRERLSKSDCDNG
FLLDGFPRTVPQAEALDQLLADMGRKIEHVLNIQVEKEELIARLTGRRICKVCGTSYHLLFNPPQVEGKCDKDGGELYQR
ADDNPDTVTNRLEVNMNQTAPLLAFYDSKEVLVNINGQKDIKDVFKDLDVILQGNGQ
>P99062 2.7.4.3~~~adk~~~Adenylate kinase~~~
MNIILMGLPGAGKGTQASEIVKKFPIPHISTGDMFRKAIKEETELGKEAKSYMDRGELVPDEVTVGIVKERISEDDAKKG
FLLDGFPRTIEQAEALNNIMSELDRNIDAVINIEVPEEELMNRLTGRRICESCGTTYHLVFNPPKVEGICDIDGGKLYQR
EDDNPETVANRLSVNIKQSKPILDFYDQKGVLKNIDGSKDISDVTKDVIDILDHL
>Q04ML5 2.7.4.3~~~adk~~~Adenylate kinase~~~COG0563
MNLLIMGLPGAGKGTQAAKIVEQFHVAHISTGDMFRAAMANQTEMGVLAKSYIDKGELVPDEVTNGIVKERLSQDDIKET
GFLLDGYPRTIEQAHALDKTLAELGIELEGIINIEVNPDSLLERLSGRIIHRVTGETFHKVFNPPVDYKEEDYYQREDDK
PETVKRRLDVNIAQGEPIIAHYRAKGLVHDIEGNQDINDVFSDIEKVLTNLK
>Q5XEB4 2.7.4.3~~~adk~~~Adenylate kinase~~~
MNLLIMGLPGAGKGTQAAKIVEEFGVAHISTGDMFRAAMANQTEMGRLAKSYIDKGELVPDEVTNGIVKERLAEDDIAEK
GFLLDGYPRTIEQAHALDATLEELGLRLDGVINIKVDPSCLVERLSGRIINRKTGETFHKVFNPPVDYKEEDYYQREDDK
PETVKRRLDVNMAQEEPILEHYRKLDLVTDIEGNQEITDVFADVEKALLELK
>Q97SU1 2.7.4.3~~~adk~~~Adenylate kinase~~~COG0563
MNLLIMGLPGAGKGTQAAKIVEQFHVAHISTGDMFRAAMANQTEMGVLAKSYIDKGELVPDEVTNGIVKERLSQDDIKET
GFLLDGYPRTIEQAHALDKTLAELGIELEGVINIEVNPDSLLERLSGRIIHRVTGETFHKVFNPPVDYKEEDYYQREDDK
PETVKRRLDVNIAQGEPIIAHYRAKGLVHDIEGNQDINDVFSDIEKVLTNLK
>Q5SHQ9 2.7.4.3~~~adk~~~Adenylate kinase~~~COG0563
MDVGQAVIFLGPPGAGKGTQASRLAQELGFKKLSTGDILRDHVARGTPLGERVRPIMERGDLVPDDLILELIREELAERV
IFDGFPRTLAQAEALDRLLSETGTRLLGVVLVEVPEEELVRRILRRAELEGRSDDNEETVRRRLEVYREKTEPLVGYYEA
RGVLKRVDGLGTPDEVYARIRAALGI
>Q9KTB7 2.7.4.3~~~adk~~~Adenylate kinase~~~COG0563
MRIILLGAPGAGKGTQAQFIMEKFGIPQISTGDMLRAAIKAGTELGKQAKAVIDAGQLVSDDIILGLIKERIAQADCEKG
FLLDGFPRTIPQADGLKEMGINVDYVIEFDVADDVIVERMAGRRAHLPSGRTYHVVYNPPKVEGKDDVTGEDLVIREDDK
EETVRARLNVYHTQTAPLIEYYGKEAAAGKTQYLKFDGTKQVSEVSADIAKALA
>Q8YT42 ~~~kaiA~~~Circadian clock oscillator protein KaiA~~~
MTQEVDQQILLQQLKSDYRQILLSYFTTDKALKEKIDKFINAVFCANIPVPEIIEIHMELIDEFSKQLRLEGRGDETLMD
YRLTLIDILAHLCEAYRGAIFK
>Q79PF6 ~~~kaiA~~~Circadian clock oscillator protein KaiA~~~
MLSQIAICIWVESTAILQDCQRALSADRYQLQVCESGEMLLEYAQTHRDQIDCLILVAANPSFRAVVQQLCFEGVVVPAI
VVGDRDSEDPDEPAKEQLYHSAELHLGIHQLEQLPYQVDAALAEFLRLAPVETMADHIMLMGANHDPELSSQQRDLAQRL
QERLGYLGVYYKRDPDRFLRNLPAYESQKLHQAMQTSYREIVLSYFSPNSNLNQSIDNFVNMAFFADVPVTKVVEIHMEL
MDEFAKKLRVEGRSEDILLDYRLTLIDVIAHLCEMYRRSIPRET
>P74644 ~~~kaiA~~~Circadian clock oscillator protein KaiA~~~
MQSPLSLCLFAPEHVAHRLRSIFQGDRHYLSTFQALDDFCAFLEDKPERIDCLLVYYEANSLPVLNRLYEQGRLLPIILL
EPSPSALAKTTDEHPTIVYHNAEIHLPESQWSELPTVVDRAIAHYLHLGPICTLPNQTETIPAPIVDESSQSFLLLQQRR
LADKLKERLGYLGVYYKRKPSHFYRNFSPQEKQEYLEDLSSQYREIILSYFSDEGTVNDLLDQFVNQAFFADLAISQILE
IHMELMDEFSQHLKLEGRSEEVLLDYRLVLIDILAHLGEMYRRSIPREDIPFDVYYQTD
>Q79V62 ~~~kaiA~~~Circadian clock oscillator protein KaiA~~~
MAQSTALTICGLVYSPAIGQELVRLHTSDIDELVYFSSEREFCNYLEARRNSIACLILEWGEGTPQIITYLHHSATLLPA
ILIFPAAPAPPPAGPHYHIAEVILTTDQLDQLNRQIEEAITGFVKLCPGCAVPPHVLFRLPALKESSNVDPQHRLSQKLK
ERLGYLGVYYKRDTAFFFRRMSPADKRKLLDELRSIYRTIVLEYFNTDAKVNERIDEFVSKAFFADISVSQVLEIHVELM
DTFSKQLKLEGRSEDILLDYRLTLIDVIAHLCEMYRRSIPREV
>Q6L8K1 ~~~kaiA~~~Circadian clock oscillator protein KaiA~~~
MAQSTALTICGLVYSPAIGQELVRLHTSDIDELVYFSSEREFCNYLEARRNSVACLILEWGEGTPQIITYLHHSATLLPA
ILIFPAAPAPPPAGPHYHIAEVILTTDQLDQLNRQIEEAITGFVKLCPGCAVPPHVLFRLPALKESSNVDPQHRLSQKLK
ERLGYLGVYYKRDTAFFFRRMSPADKRKLLDELRSIYRTIVLEYFNTDAKVNERIDEFVSKAFFADISVSQVLEIHVELM
DTFSKQLKLEGRSEDILLDYRLTLIDVIAHLCEMYRRSIPREV
>P74645 ~~~kaiB1~~~Circadian clock oscillator protein KaiB1~~~COG4251
MSPFKKTYVLKLYVAGNTPNSVRALKMLKNILEQEFQGVYALKVIDVLKNPQLAEEDKILATPTLAKILPPPVRKIIGDL
SDREKVLIGLDLLYDEIREREAEDQ
>Q55819 ~~~kaiB3~~~Circadian clock protein KaiB3~~~COG4251
MDMNRIVLRLYITGNSVRSQQAIANIYRICQEDLGDQYNVEIIDVLEQPQRAEEEKIMVTPTLIKQLPPPLQRIIGDMSN
TEKVLLGLDIVPEGLQVRLPED
>Q8YT41 ~~~kaiB~~~Circadian clock oscillator protein KaiB~~~COG4251
MNKARKTYVLKLYVAGNTPNSVRALKTLKNILEQEFQGIYALKVIDVLKNPQLAEEDKILATPTLSKILPPPVRKIIGDL
SDRERVLIGLDLLYEELTEEDWEAQSNL
>Q79PF5 ~~~kaiB~~~Circadian clock oscillator protein KaiB~~~COG4251
MSPRKTYILKLYVAGNTPNSVRALKTLKNILEVEFQGVYALKVIDVLKNPQLAEEDKILATPTLAKVLPLPVRRIIGDLS
DREKVLIGLDLLYGELQDSDDF
>Q79V61 ~~~kaiB~~~Circadian clock oscillator protein KaiB~~~COG4251
MAPLRKTYVLKLYVAGNTPNSVRALKTLNNILEKEFKGVYALKVIDVLKNPQLAEEDKILATPTLAKVLPPPVRRIIGDL
SNREKVLIGLDLLYEEIGDQAEDDLGLE
>P74646 2.7.11.1~~~kaiC1~~~Circadian clock oscillator protein KaiC 1~~~COG0467
MNLPIVNERNRPDVPRKGVQKIRTVIEGFDEITHGGLPIGRTTLVSGTSGTGKTLLAVQFLYQGIHHFDYPGLFITFEES
PSDIIENAYSFGWDLQQLIDDGKLFILDASPDPEGQEVVGTFDLSALIERIQYAVRKYKAKLVSIDSVTAVFQQYDAASV
VRREIFRLVARLKQLQVTSIMTTERVEEYGPIARFGVEEFVSDNVVVLRNVLEGERRRRTVEILKLRGTTHMKGEYPFTI
THDGINIFPLGAMRLTQRSSNARISSGVQTLDEMCGGGFFKDSIILATGATGTGKTLLVSKFLQEGCRQRERAILFAYEE
SRAQLSRNASSWGIDFEEMEHKGLLKLLCTYPESAGLEDHLQMIKSEISEFKPSRIAIDSLSALARGVTNNAFRQFVIGV
TGYAKQEEITGFFTNTTDQFMGAHSITESHISTITDTILMLQYVEIRGEMSRALNVFKMRGSWHDKGIREYSISHDGPDI
RDSFRNYERIISGSPTRISVDEKSELSRIVRGVKDKTAE
>P73860 2.7.11.1~~~kaiC2~~~Circadian clock protein KaiC2~~~COG0467
MTDNSQSLSLIKCPTGIQGFDEITNGGLPQGRPTLICGSAGCGKTLFGVEFLVRGAVEYGEPGVLVSFEESAKEIIQNVA
SLGWNLQDLVAEEKILIDHIYVEASEIQETGEYDLEALFIRLGYAINKIGAKRILLDTIEVLFSGLENTNIVRAELRRLF
HWLKQKGVTAVITGERGDKNLTRQGLEEYVSDCVIKLDQKTVEEVATRTIQVVKYRGSRHSNNEYPFLIEENGISVLPIT
SLILNHSVSQERISTGIPQLDDMFGGQGYYRGSSILVTGRAGTGKTTLAAFFAQATCLRGERCLYLATEESPQQICRNLN
SIGLDLSPYLDSQLLQFDATRPTNYNLEMRLFKIHSWVRNFKPSLVVVDPMSNLITSGNLNQTKNFFMRLIDYLKSQKIT
VFLTDLTGGNVGYDNEQTEVGVSSLMDTWLELQTLRINGERNRILYILKSRGMAHSNQVREFILSNDGVDLIEAYIGEGQ
VLTGTQRINQILEEEAIAKRRQQALELSKRNFERKKYLLQAKIDALQMKLASQDEELEVLMLEEKEFKQTMLANRNLIKK
SRHIYQNP
>P74503 2.7.11.1~~~kaiC3~~~Circadian clock protein KaiC3~~~COG0467
MIDQETDGIEKLETGIPGFDFLSDGGLPLGRATLIAGTAGSAKTIFASQFLVEGIQRGENGVFVTFEEPPKALRKNMRGF
GWDIQQWENEGKWVFVDASPQPGDRPIVSGEYDLGALIARIEHAVRKYKASRISLDSLGAIFSHLSDSAQVRSDLFRLAS
ALRELGVTAIMTAERVEEYGEISRYGVEEFVADNVVIVRNVLADEKRRRTIEILKYRGTDHQKGEFPFTIINKKGIVIIP
LSAIELEQKSSDIRITSGSEELDRMCGSGFFRDSIILVSGATGTGKTLMVTEFMDGGVANGERCLVFAFEESREQLIRNA
TGWGVDFKQMEKEGKLKVVCRYPETTNLENHLIMMKDIIQEFKPNRVAVDSLSALERVSTLKSFREFIIGLTSFIKQQEI
GGLFTSTTPNLLGGASITDAHISTITDSIILLRYVEMYGEMRRGITVLKMRGSMHDKDIREFSIDHQGMHIGKPFRNVTG
ILAGTPMYTAQSEVERLSGLFDEKI
>Q79PF4 2.7.11.1~~~kaiC~~~Circadian clock oscillator protein KaiC~~~COG0467
MTSAEMTSPNNNSEHQAIAKMRTMIEGFDDISHGGLPIGRSTLVSGTSGTGKTLFSIQFLYNGIIEFDEPGVFVTFEETP
QDIIKNARSFGWDLAKLVDEGKLFILDASPDPEGQEVVGGFDLSALIERINYAIQKYRARRVSIDSVTSVFQQYDASSVV
RRELFRLVARLKQIGATTVMTTERIEEYGPIARYGVEEFVSDNVVILRNVLEGERRRRTLEILKLRGTSHMKGEYPFTIT
DHGINIFPLGAMRLTQRSSNVRVSSGVVRLDEMCGGGFFKDSIILATGATGTGKTLLVSRFVENACANKERAILFAYEES
RAQLLRNAYSWGMDFEEMERQNLLKIVCAYPESAGLEDHLQIIKSEINDFKPARIAIDSLSALARGVSNNAFRQFVIGVT
GYAKQEEITGLFTNTSDQFMGAHSITDSHISTITDTIILLQYVEIRGEMSRAINVFKMRGSWHDKAIREFMISDKGPDIK
DSFRNFERIISGSPTRITVDEKSELSRIVRGVQEKGPES
>Q8GGL1 2.7.11.1~~~kaiC~~~Circadian clock oscillator protein KaiC~~~
MNQSLGPSEPEKPQDNTAEDSTEPTPDNHRADLSELRGIPKKQTGIEGFEDISHGGLPLGRTTLVSGTSGTGKTLFAMQF
LYNGIVKYQEPGIFVTFEETPADIIRNASSFGWDLQALIDRGQLFILDASPDPEGYEVSGNFDLSALIERIQYAIRKYKA
KRVSIDSVTAIFQQYDPAGVVRRELFRLTARLKQANVTTVMTTERTDEYGPIARYGVEEFVSDNVVILRNILEGEKRRRT
IEILKLRGTTHMKGEYPFTITNDGINIFPLGAMQLTQRSSNVRVSSGIEKLDEMCGGGFFKDSIILATGATGTGKTSLVS
KFLERGCLDGERCILFAYEESRAQLSRNASSWGIDLEEFERQGLLKIICAYPESAGLEDHLQKIKTEMMAFKPSRMAIDS
LSALARGVSQNAFRQFVIGVTGLAKQEEITGFFTNTTDQFMGSHSITESHISTITDTIILLQYVEIRGEMARALNVFKMR
GSWHDKGIREYLISNAGIQIRDSFRGYERIISGSPTRINVDEKNELSRIVQNVQALEEEGL
>Q79V60 2.7.11.1~~~kaiC~~~Circadian clock oscillator protein KaiC~~~COG0467
MTNLPEHQSSPTEQSSAEVKKIPTMIEGFDDISHGGLPQGRTTLVSGTSGTGKTLFAVQFLYNGITIFNEPGIFVTFEES
PQDIIKNALSFGWNLQSLIDQGKLFILDASPDPDGQEVAGDFDLSALIERIQYAIRKYKATRVSIDSVTAVFQQYDAASV
VRREIFRLAFRLKQLGVTTIMTTERVDEYGPVARFGVEEFVSDNVVILRNVLEGERRRRTVEILKLRGTTHMKGEYPFTI
NNGINIFPLGAMRLTQRSSNVRVSSGVKTLDEMCGGGFFKDSIILATGATGTGKTLLVSKFLETGCQQGERALLFAYEES
RAQLSRNASSWGIDFEELERRGLLRIICAYPESAGLEDHLQIIKSEIADFKPSRVAIDSLSALARGVSNNAFRQFVIGVT
GFAKQEEITGFFTNTTDQFMGSNSITESHISTITDTILLLQYVEIRGEMSRAINVFKMRGSWHDKGIREYVITEKGAEIR
DSFRNFEGIISGTPTRISVDEKTELARIAKGMQDLESE
>Q6L8J9 2.7.11.1~~~kaiC~~~Circadian clock oscillator protein KaiC~~~
MTNLPEHQSSPTEQSSAEVKKIPTMIEGFDDISHGGLPQGRTTLVSGTSGTGKTLFAVQFLYNGITIFNEPGIFVTFEES
PQDIIKNALSFGWNLQSLIDQGKLFILDASPDPDGQEVAGDFDLSALIERIQYAIRKYKATRVSIDSVTAVFQQYDAASV
VRREIFRLAFRLKQLGVTTIMTTERVDEYGPVARFGVEEFVSDNVVILRNVLEGERRRRTVEILKLRGTTHMKGEYPFTI
NNGINIFPLGAMRLTQRSSNVRVSSGVKTLDEMCGGGFFKDSIILATGATGTGKTLLVSKFLETGCQQGERALLFAYEES
RAQLSRNASSWGIDFEELERRGLLRIICAYPESAGLEDHLQIIKSEIADFKPSRVAIDSLSALARGVSNNAFRQFVIGVT
GFAKQEEITGFFTNTTDQFMGSNSITESHISTITDTILLLQYVEIRGEMSRAINVFKMRGSWHDKGIREYVITEKGAEIR
DSFRNFEGIISGTPTRISVDEKTELARIAKGMQDLESE
>Q8RHX1 4.3.1.14~~~kal~~~3-aminobutyryl-CoA ammonia lyase~~~COG1607
MKSLIRLRMSSHDAHYGGNLVDGARMLQLFGDVATELLIQLDGDEGLFKAYDSVEFMAPVFAGDYIEAEGEIVNVGNSSR
KMVFEARKVIVPRPDISDSAADVLAEPIVVCRATGTCVTPKDKQRGKK
>O34676 5.4.3.2~~~kamA~~~L-lysine 2,3-aminomutase~~~COG1509
MKNKWYKPKRHWKEIELWKDVPEEKWNDWLWQLTHTVRTLDDLKKVINLTEDEEEGVRISTKTIPLNITPYYASLMDPDN
PRCPVRMQSVPLSEEMHKTKYDLEDPLHEDEDSPVPGLTHRYPDRVLFLVTNQCSMYCRYCTRRRFSGQIGMGVPKKQLD
AAIAYIRETPEIRDCLISGGDGLLINDQILEYILKELRSIPHLEVIRIGTRAPVVFPQRITDHLCEILKKYHPVWLNTHF
NTSIEMTEESVEACEKLVNAGVPVGNQAVVLAGINDSVPIMKKLMHDLVKIRVRPYYIYQCDLSEGIGHFRAPVSKGLEI
IEGLRGHTSGYAVPTFVVDAPGGGGKIALQPNYVLSQSPDKVILRNFEGVITSYPEPENYIPNQADAYFESVFPETADKK
EPIGLSAIFADKEVSFTPENVDRIKRREAYIANPEHETLKDRREKRDQLKEKKFLAQQKKQKETECGGDSS
>Q9XBQ8 5.4.3.2~~~kamA~~~L-lysine 2,3-aminomutase~~~
MINRRYELFKDVSDADWNDWRWQVRNRIETVEELKKYIPLTKEEEEGVAQCVKSLRMAITPYYLSLIDPNDPNDPVRKQA
IPTALELNKAAADLEDPLHEDTDSPVPGLTHRYPDRVLLLITDMCSMYCRHCTRRRFAGQSDDSMPMERIDKAIDYIRNT
PQVRDVLLSGGDALLVSDETLEYIIAKLREIPHVEIVRIGSRTPVVLPQRITPELVNMLKKYHPVWLNTHFNHPNEITEE
STRACQLLADAGVPLGNQSVLLRGVNDCVHVMKELVNKLVKIRVRPYYIYQCDLSLGLEHFRTPVSKGIEIIEGLRGHTS
GYCVPTFVVDAPGGGGKTPVMPNYVISQSHDKVILRNFEGVITTYSEPINYTPGCNCDVCTGKKKVHKVGVAGLLNGEGM
ALEPVGLERNKRHVQE
>Q8RHX4 5.4.3.2~~~kamA~~~L-lysine 2,3-aminomutase~~~COG1509
MNTVNTRKKFFPNVTDEEWNDWTWQVKNRLESVEDLKKYVDLSEEETEGVVRTLETLRMAITPYYFSLIDLNSDRCPIRK
QAIPTIQEIHQSDADLLDPLHEDEDSPVPGLTHRYPDRVLLLITDMCSMYCRHCTRRRFAGSSDDAMPMDRIDKAIEYIA
KTPQVRDVLLSGGDALLVSDKKLESIIQKLRAIPHVEIIRIGSRTPVVLPQRITPELCNMLKKYHPIWLNTHFNHPQEVT
PEAKKACEMLADAGVPLGNQTVLLRGINDSVPVMKRLVHDLVMMRVRPYYIYQCDLSMGLEHFRTPVSKGIEIIEGLRGH
TSGYAVPTFVVDAPGGGGKTPVMPQYVISQSPHRVVLRNFEGVITTYTEPENYTHEPCYDEEKFEKMYEISGVYMLDEGL
KMSLEPSHLARHERNKKRAEAEGKK
>P25920 2.1.1.180~~~kamB~~~16S rRNA (adenine(1408)-N(1))-methyltransferase~~~
MRRVVGKRVQEFSDAEFEQLRSQYDDVVLDVGTGDGKHPYKVARQNPSRLVVALDADKSRMEKISAKAAAKPAKGGLPNL
LYLWATAERLPPLSGVGELHVLMPWGSLLRGVLGSSPEMLRGMAAVCRPGASFLVALNLHAWRPSVPEVGEHPEPTPDSA
DEWLAPRYAEAGWKLADCRYLEPEEVAGLETSWTRRLHSSRDRFDVLALTGTISP
>E3PRJ5 5.4.3.3~~~kamD~~~Lysine 5,6-aminomutase alpha subunit~~~COG0274
MISVESKLNLDFNLVEKARAKAKAIAIDTQEFIEKHTTVTVERAVCRLLGIDGVDTDEVPLPNIVVDHIKENNGLNLGAA
MYIANAVLNTGKTPQEIAQAISAGELDLTKLPMKDLFEVKTKALSMAKETVEKIKNNRSIRESRFEEYGDKSGPLLYVIV
ATGNIYEDITQAVAAAKQGADVIAVIRTTGQSLLDYVPYGATTEGFGGTYATQENFRLMREALDKVGAEVGKYIRLCNYC
SGLCMPEIAAMGAIERLDVMLNDALYGILFRDINMQRTMIDQNFSRIINGFAGVIINTGEDNYLTTADAFEEAHTVLASQ
FINEQFALLAGLPEEQMGLGHAFEMDPELKNGFLYELSQAQMAREIFPKAPLKYMPPTKFMTGNIFKGHIQDALFNMVTI
MTNQRIHLLGMLTEALHTPFMSDRALSIENAQYIFNNMESISEEIQFKEDGLIQKRAGFVLEKANELLEEIEQLGLFDTL
EKGIFGGVKRPKDGGKGLNGVVSKDENYYNPFVELMLNK
>Q8RHX7 5.4.3.3~~~~~~Lysine 5,6-aminomutase alpha subunit~~~COG0274
MGKLDLDWGLVKEARESAKKIAADAQVFIDAHSTVTVERTICRLLGIDGVDEFGVPLPNVIVDFIKDNGNISLGVAKYIG
NAMIETKLQPQEIAEKVAKKELDITKMQWHDDFDIQLALKDITHSTVERIKANRKAREDYLEQFGGDKKGPYIYVIVATG
NIYEDVTQAVAAARQGADVVAVIRTTGQSLLDFVPFGATTEGFGGTMATQENFRIMRKALDDVGVELGRYIRLCNYCSGL
CMPEIAAMGALERLDMMLNDALYGILFRDINMKRTLVDQFFSRIINGFAGVIINTGEDNYLTTADAIEEAHTVLASQFIN
EQFALVAGLPEEQMGLGHAFEMEPGTENGFLLELAQAQMAREIFPKAPLKYMPPTKFMTGNIFKGHIQDALFNIVTITTG
QKVHLLGMLTEAIHTPFMSDRALSIENARYIFNNLKDFGNDIEFKKGGIMNTRAQEVLKKAAELLKTIETMGIFKTIEKG
VFGGVRRPIDGGKGLAGVFEKDNTYFNPFIPLMLGGDR
>E3PRJ4 5.4.3.3~~~kamE~~~Lysine 5,6-aminomutase beta subunit~~~COG2185
MSSGLYSMEKKEFDKVLDLERVKPYGDTMNDGKVQLSFTLPLKNNERSAEAAKQIALKMGLEEPSVVMQQSLDEEFTFFV
VYGNFVQSVNYNEIHVEAVNSEILSMEETDEYIKENIGRKIVVVGASTGTDAHTVGIDAIMNMKGYAGHYGLERYEMIDA
YNLGSQVANEDFIKKAVELEADVLLVSQTVTQKNVHIQNMTHLIELLEAEGLRDRFVLLCGGPRINNEIAKELGYDAGFG
PGRFADDVATFAVKTLNDRMNS
>Q8RHX8 5.4.3.3~~~~~~Lysine 5,6-aminomutase beta subunit~~~COG5012
MSSGLYSTEKRDFDTTLDLTQIRPYGDTMNDGKVQMSFTLPVACNEKGIEAALQLARKMGFVNPAVAFSEALDKEFSFYV
VYGATSFSVDYTAIKVQALEIDTMDMHECEKYIEENFGREVVMVGASTGTDAHTVGIDAIMNMKGYAGHYGLERYKGVRA
YNLGSQVPNEEFIKKAIELKADALLVSQTVTQKDVHIENLTNLVELLEAEGLRDKIILIAGGARITNDLAKELGYDAGFG
PGKYADDVATFILKEMVQRGMNK
>Q65CC7 2.4.1.301~~~kanE~~~Alpha-D-kanosaminyltransferase~~~
MHLLVRALVEEMAGRGVPHRVLTMSPPKVPKDIRIGQRIKVHARRLPVLPIPSDLEGYFGLVGAWAKGSLLWVLRNRKRL
RREIGARVHAHCDGSGAAAFYPYLMSRILGVPLVVQIHSSRYLSQHPTTLFERVTDPIAKWAERHAVRKAAAVLMLTDRA
RDEMRRKAQLPAERVHRLAYLASDQFKDADTEARRAELRERYGLDDRPIVLYVGRIAAEKGVEYYIEAAAELTRRGRDCQ
FVIAGDGPARPDLEKLIGARGLRDRVTITGFMSHEFIPSMISLGELVVLPSRYEELGIVILECMTMRRPLVAHDVNGVNK
LIEDGTTGIVVPPFRTPEMADAVERLLDDPELRERMAENAAPLPAAKYSLSAAGDQLAGIYREIGL
>Q65CC1 2.4.1.284~~~kanF~~~2-deoxystreptamine glucosyltransferase~~~
MQVQILRMSRALAELGVRQQVLTVGFPGLPRVRRDSENLVVRITRAPLPRLRSRITGLVGLNQAWLAAALTECVKLRRRW
PADLIQVHLDGQLWALLAGPVAARLVGVPYTVTVHCSRLAVYQPMSTVDRIQHPLVTAVERWALRRAAGITTLTERTATV
LAAELGAAQRVIDVVPDAVDPDRAEAAPAEVERLKKRFGLPQEGGPVIGFVGRIAHEKGWRHAVQAVAELADAGRDFTFL
VVGDGPQRADMEAAVAEAGLTDRFVFTGFLPNDEIPAVMTALDVLLMPSVHEELGGSAVEAMLAGTPVAAYGVGGLCDTV
GKVTPSLLAAPGQVAELARTVKRVLDDPAPVLAELRAGREWLADEFGVHHAAGLALAHYERVLGKER
>Q6L732 1.14.11.37~~~kanJ~~~Kanamycin B dioxygenase~~~
MALAAPPGELTLALTPDDKTLDPASLDRALAILAEHGILVLTGMLRTRLTDQLRTAMLDDLPEVLRQQDVPTNFVPGHVQ
QDPPVRESLLFPDVLLNPVVYQITHAVLGADARNAVYSGNMNLPGSHEQPVHLDEPHLWPGISHPPYCLCVDVPLIDFTL
ENGSTEYWPGSHVLNPDECYDERGCVLPAELERRRAVAPPVRFPIPVGSVVIRDGRLWHRGVPNLSAAPRPLLAMTHYTE
WFDMPPIQLPDTVKSWVDGSDRHTHAHFVAGDVDHLTGDHPFAVR
>Q2MFU6 1.1.1.355~~~kanK~~~2'-dehydrokanamycin reductase~~~
MSSQLALRGPELSANLCKPEEDTLRVLVTGGSGNVGVGVVRALNAARHHVVVASRGYSPALLPEGVRAVRLERTEPDAYT
RLVAAEKPDAVIDLTCHDAADAAVTLRACAGVDRVVVVSSVTAAGPATTTPVTEATAAPPLSEYGIDKLAVEETVRAAWA
DGTSQALLVRLGAVYRLGADLDGQLAEDGCWLAHAAAGAPAVLADDGAARWNLLHADDAGAALAELLANDRARGVLVHLA
SRHPLPWRELYERVHHALGRPFNPVSVPAEWAAEQLEDAEFLAETSRWDQVFDLGLLDRLAPSYQERGGPSRVTEVALWL
IRQGRVGDAELGAEIQELPARLAAVRTAPGLV
>P05058 2.7.7.-~~~knt~~~Kanamycin nucleotidyltransferase~~~
MNGPIIMTREERMKIVHEIKERILDKYGDDVKAIGVYGSLGRQTDGPYSDIEMMCVMSTEEAEFSHEWTTGEWKVEVNFD
SEEILLDYASQVESDWPLTHGQFFSILPIYDSGGYLEKVYQTAKSVEAQKFHDAICALIVEELFEYAGKWRNIRVQGPTT
FLPSLTVQVAMAGAMLIGLHHRICYTTSASVLTEAVKQSDLPSGYDHLCQFVMSGQLSDSEKLLESLENFWNGIQEWTER
HGYIVDVSKRIPF
>P05057 2.7.7.-~~~knt~~~Kanamycin nucleotidyltransferase~~~
MNGPIIMTREERMKIVHEIKERILDKYGDDVKAIGVYGSLGRQTDGPYSDIEMMCVMSTEEAEFSHEWTTGEWKVEVNFD
SEEILLDYASQVESDWPLTHGQFFSILPIYDSGGYLEKVYQTAKSVEAQTFHDAICALIVEELFEYAGKWRNIRVQGPTT
FLPSLTVQVAMAGAMLIGLHHRICYTTSASVLTEAVKQSDLPSGYDHLCQFVMSGQLSDSEKLLESLENFWNGIQEWTER
HGYIVDVSKRIPF
>Q08429 ~~~kapB~~~Kinase-associated lipoprotein B~~~
MSTFETGSIVKGFYKTGVYIGEITACRPQHYLVKVKAVLTHPAQGDLHHPKQADVPFFHERKALAYGEQTNIPHHMVKPY
DGEVPDYTESLREATAQMRAKLNEDGSEWAKRSLHNLDILEKEYFNRP
>H8ESN0 2.3.1.293~~~kasA~~~3-oxoacyl-[acyl-carrier-protein] synthase 1~~~
MSQPSTANGGFPSVVVTAVTATTSISPDIESTWKGLLAGESGIHALEDEFVTKWDLAVKIGGHLKDPVDSHMGRLDMRRM
SYVQRMGKLLGGQLWESAGSPEVDPDRFAVVVGTGLGGAERIVESYDLMNAGGPRKVSPLAVQMIMPNGAAAVIGLQLGA
RAGVMTPVSACSSGSEAIAHAWRQIVMGDADVAVCGGVEGPIEALPIAAFSMMRAMSTRNDEPERASRPFDKDRDGFVFG
EAGALMLIETEEHAKARGAKPLARLLGAGITSDAFHMVAPAADGVRAGRAMTRSLELAGLSPADIDHVNAHGTATPIGDA
AEANAIRVAGCDQAAVYAPKSALGHSIGAVGALESVLTVLTLRDGVIPPTLNYETPDPEIDLDVVAGEPRYGDYRYAVNN
SFGFGGHNVALAFGRY
>P9WQD9 2.3.1.293~~~kasA~~~3-oxoacyl-[acyl-carrier-protein] synthase 1~~~COG0304
MSQPSTANGGFPSVVVTAVTATTSISPDIESTWKGLLAGESGIHALEDEFVTKWDLAVKIGGHLKDPVDSHMGRLDMRRM
SYVQRMGKLLGGQLWESAGSPEVDPDRFAVVVGTGLGGAERIVESYDLMNAGGPRKVSPLAVQMIMPNGAAAVIGLQLGA
RAGVMTPVSACSSGSEAIAHAWRQIVMGDADVAVCGGVEGPIEALPIAAFSMMRAMSTRNDEPERASRPFDKDRDGFVFG
EAGALMLIETEEHAKARGAKPLARLLGAGITSDAFHMVAPAADGVRAGRAMTRSLELAGLSPADIDHVNAHGTATPIGDA
AEANAIRVAGCDQAAVYAPKSALGHSIGAVGALESVLTVLTLRDGVIPPTLNYETPDPEIDLDVVAGEPRYGDYRYAVNN
SFGFGGHNVALAFGRY
>P9WQD7 2.3.1.294~~~kasB~~~3-oxoacyl-[acyl-carrier-protein] synthase 2~~~COG0304
MTELVTGKAFPYVVVTGIAMTTALATDAETTWKLLLDRQSGIRTLDDPFVEEFDLPVRIGGHLLEEFDHQLTRIELRRMG
YLQRMSTVLSRRLWENAGSPEVDTNRLMVSIGTGLGSAEELVFSYDDMRARGMKAVSPLTVQKYMPNGAAAAVGLERHAK
AGVMTPVSACASGAEAIARAWQQIVLGEADAAICGGVETRIEAVPIAGFAQMRIVMSTNNDDPAGACRPFDRDRDGFVFG
EGGALLLIETEEHAKARGANILARIMGASITSDGFHMVAPDPNGERAGHAITRAIQLAGLAPGDIDHVNAHATGTQVGDL
AEGRAINNALGGNRPAVYAPKSALGHSVGAVGAVESILTVLALRDQVIPPTLNLVNLDPEIDLDVVAGEPRPGNYRYAIN
NSFGFGGHNVAIAFGRY
>Q9ZGM4 1.11.1.21~~~katG1~~~Catalase-peroxidase 1~~~COG0376
MDGKVGSTTTGCPVIHGGMTSTGTSNTAWWPNALNLDILHQHDTKTNPMEKDFNYREEVKKLDFEALKKDLHALMTDSQA
WWPADWGHYGGLMIRMSWHAAGSYRVADGRGGAGTGNQRFAPLNSWPDNVNLDKARRLLWPIKKKYGNKISWADLIVLAG
TIAYESMGLKTFGFGFGREDIWHPEKDVYWGSEQEWLGAKRYDGKSRESLENPLAAVQMGLIYVNPEGVNGQPDPLRTAQ
DVRVTFGRMAMNDEETVALTAGGHTVGKCHGNGNAKLLGPNPEAANVEDQGLGWINKTTRGIGRNTVSSGIEGAWTTHPT
QWDNGYFYLLLNYDWELKKSPAGAWQWEPIHIKEEDKPVDVEDPAIRHNPIMTDADMAMKMDPVYRKIAERFYQDPDYFA
EVFARAWFKLTHRDMGPKTRYIGPDVPKEDLIWQDPVPAGNRAYDIAAAKAKIAASNLTIGEMVSTAWDSARTFRGSDKR
GGANGARIRLKPQKDWEGNEPQRLTKVLQILEDIATDTGASVADVIILAGNVGIEKAAKAAGFDIIVPFAPGRGDATDDM
TDAESFDVLEPLHDGYRNWLKKTYDVRPEELMLDRTQLMGLTAHEMTVLVGGLRVLGTNHNNTQYGVFTDRVGALTNDFF
VNLTDMANVWIPSKDNLYEIRDRKAGNIKWTATRVDLVFGSNSILRSYAEVYAQDDNKGKFIQDFVAAWTKVMNADRFDL
A
>Q4F6N6 1.11.1.21~~~katG2~~~Catalase-peroxidase 2~~~COG0376
MSNEGQCPFNHANGGGTTNRDWWPNELRLDLLSQHSSKTDPLDPGFNYAEAFNSLDLDALRKDLAALMTDSQDWWPADFG
HYGPLFVRMAWHSAGTYRMGDGRGGAGRGQQRFAPLNSWPDNVSLDKARRLLWPIKQKYGQKISWADLLILTGDVALTTM
GFKTFGYAGGREDTWEPDRDVYWGSETTWLGGDLRYDKGGACESQHGGNAGRNLENPLAAVQMGLIYVNPEGPDGNPDPV
AAAYDIREVFGRMAMNDEETVALIAGGHAFGKTHGAGPADNVGLEPEAAGLEQQGLGWKNSFGTGKGADTITSGLEVTWS
DTPTQWGMGFFKNLFGYEWELTKSPAGAHQWVAKNAEPTIPHAHDPSKKLLPTMLTTDLSLRFDPVYEKISRHFMDNPDV
FADAFARAWFKLTHRDMGPRARYLGPDVPTEELIWQDPIPAVDHVLVDDTDVAPLKETILASGLSVAELVSTAWASASTF
RGSDKRGGANGARIRLAPQKDWAVNEPARLAKVLKVLERIQGEFNSTQPGGKKISLADLIVLAGGAGIEQAAKRAGHDVV
VPFAPGRMDASQEQTDAHSFAVLEPVADGFRNFVKGKFAVPAEALLIDKAQLLTLTAPQMTALVGGLRVLNVQTGDEKHG
VFTDQPETLTVDFFRNLLDMATEWKPIAGEDTYEGRDRRTGELKWTGTRVDLVFGSNAVLRALSEVYASADGEAKFIRDF
VAAWVKVMNLDRFDLA
>Q9WXB9 1.11.1.21~~~katG2~~~Catalase-peroxidase 2~~~COG0376
MFKRTIPLFAAFTLAISPSIFPNYAHAQEDKPKTNQYWWPKMLDLSPLRQPNATSNPMGEKFNYAEEFNSLDLNAVIEDL
KKLMTTSQDWWPADYGNYGPLFIRMSWHAAGTYRIYDGRGGANGGFQRFAPQNSWPDNANLDKARRLLWPIKQKYGRKIS
WADLLVLAGNVAMESMGFKTIGFAGGREDAWEAININWGPEGKWLESKRQDKDGKLEKPLAATVMGLIYVNPEGPNGVPD
PLAAAEKIRETFGRMAMNDEETVALIAGGHAFGKTHGAASGKYLGPAPEAAGIEEQGFGWKNSYGSGKGKDTITSGLEGA
WTVTPTHWSHNYLQNLFNFNWVKTKSPGGAIQWVPENSNASSMVPDAFDPSKRHAPVMLTTDLALKFDPVYSKIAKRFLD
NPKEFDDAFARAWFKLIHRDMGPRSRYLGSLVPKEAMIWQDPVPPVDYKLVDANDIANLKGKILNSGLTTSELVKTAWAS
ASTFRGTDMRGGANGARIRLAPQKDWPANDPQELAKVLKTLESIQNNFNNAQADGKKISLADLIVLGGNAAIEQAAKQAG
YDIIVPFTPGRTDATQGMTDVKSFEVLEPKADGFRNYFDKSNNMSPPEMLVEKASLLKLSVPEMTVLVGGMRVLNANTGQ
NQYGVFTDKPGTLNNDFFINLLSMSTEWKKSSETEGIYEGYERKTGKLKWKATSVDLIFGANSELRAVAEAYATDDAKEK
FIQDFINAWVKVMTADRFDIKAANANINS
>A0QXX7 1.11.1.21~~~katG2~~~Catalase-peroxidase 2~~~COG0376
MSSDTSDSRPPNPDTKTASTSESENPAIPSPKPKSGAPLRNQDWWPNQIDVSRLHPHPPQGNPLGEDFDYAEEFAKLDVN
ALKADLTALMTQSQDWWPADYGHYGGLFIRMSWHSAGTYRIHDGRGGGGQGAQRFAPINSWPDNVSLDKARRLLWPIKQK
YGNKISWADLLVFTGNVALESMGFKTFGFGFGREDIWEPEEILFGEEDEWLGTDKRYGGGEQRQLAEPYGATTMGLIYVN
PEGPEGQPDPLAAAHDIRETFGRMAMNDEETAALIVGGHTFGKTHGAGDASLVGPEPEAAPIEQQGLGWKSSYGTGKGPD
TITSGLEVVWTNTPTKWDNSFLEILYGYEWELTKSPAGAWQFTAKDGAGAGTIPDPFGGPGRNPTMLVTDISMRVDPIYG
KITRRWLDHPEELSEAFAKAWYKLLHRDMGPISRYLGPWVAEPQLWQDPVPAVDHPLVDDQDIAALKSTVLDSGLSTGQL
IKTAWASAASYRNTDKRGGANGARVRLEPQKNWDVNEPAELATVLPVLERIQQDFNASASGGKKVSLADLIVLAGSAAIE
KAAKDGGYNVTVPFAPGRTDASQENTDVESFAVLEPRADGFRNYVRPGEKVQLEKMLLERAYFLGVTAPQLTALVGGLRA
LDVNHGGTKHGVFTDRPGALTNDFFVNLLDMGTEWKTSETTENVYEGVDRKTGQLKWTATANDLVFGSHSVLRAVAEVYA
QSDNGERFVNDFVKAWVKVMNNDRFDLK
>Q3JNW6 1.11.1.21~~~katG~~~Catalase-peroxidase~~~
MSNEAKCPFHQAAGNGTSNRDWWPNQLDLSILHRHSSLSDPMGKDFNYAQAFEKLDLAAVKRDLHALMTTSQDWWPADFG
HYGGLFIRMAWHSAGTYRTADGRGGAGEGQQRFAPLNSWPDNANLDKARRLLWPIKQKYGRAISWADLLILTGNVALESM
GFKTFGFAGGRADTWEPEDVYWGSEKIWLELSGGPNSRYSGDRQLENPLAAVQMGLIYVNPEGPDGNPDPVAAARDIRDT
FARMAMNDEETVALIAGGHTFGKTHGAGPASNVGAEPEAAGIEAQGLGWKSAYRTGKGADAITSGLEVTWTTTPTQWSHN
FFENLFGYEWELTKSPAGAHQWVAKGADAVIPDAFDPSKKHRPTMLTTDLSLRFDPAYEKISRRFHENPEQFADAFARAW
FKLTHRDMGPRARYLGPEVPAEVLLWQDPIPAVDHPLIDAADAAELKAKVLASGLTVSQLVSTAWAAASTFRGSDKRGGA
NGARIRLAPQKDWEANQPEQLAAVLETLEAIRTAFNGAQRGGKQVSLADLIVLAGCAGVEQAAKNAGHAVTVPFAPGRAD
ASQEQTDVESMAVLEPVADGFRNYLKGKYRVPAEVLLVDKAQLLTLSAPEMTVLLGGLRVLGANVGQSRHGVFTAREQAL
TNDFFVNLLDMGTEWKPTAADADVFEGRDRATGELKWTGTRVDLVFGSHSQLRALAEVYGSADAQEKFVRDFVAVWNKVM
NLDRFDLA
>Q939D2 1.11.1.21~~~katG~~~Catalase-peroxidase~~~COG0376
MSNEAKCPFHQAAGNGTSNRDWWPNQLDLSILHRHSSLSDPMGKDFNYAQAFEKLDLAAVKRDLHALMTTSQDWWPADFG
HYGGLFIRMAWHSAGTYRTADGRGGAGEGQQRFAPLNSWPDNANLDKARRLLWPIKQKYGRAISWADLLILTGNVALESM
GFKTFGFAGGRADTWEPEDVYWGSEKIWLELSGGPNSRYSGDRQLENPLAAVQMGLIYVNPEGPDGNPDPVAAARDIRDT
FARMAMNDEETVALIAGGHTFGKTHGAGPASNVGAEPEAAGIEAQGLGWKSAYRTGKGADAITSGLEVTWTTTPTQWSHN
FFENLFGYEWELTKSPAGAHQWVAKGADAVIPDAFDPSKKHRPTMLTTDLSLRFDPAYEKISRRFHENPEQFADAFARAW
FKLTHRDMGPRARYLGPEVPAEVLLWQDPIPAVDHPLIDAADAAELKAKVLASGLTVSQLVSTAWAAASTFRGSDKRGGA
NGARIRLAPQKDWEANQPEQLAAVLETLEAIRTAFNGAQRGGKQVSLADLIVLAGCAGVEQAAKNAGHAVTVPFAPGRAD
ASQEQTDVESMAVLEPVADGFRNYLKGKYRVPAEVLLVDKAQLLTLSAPEMTVLLGGLRVLGANVGQSRHGVFTAREQAL
TNDFFVNLLDMGTEWKPTAADADVFEGRDRATGELKWTGTRVDLVFGSHSQLRALAEVYGSADAQEKFVRDFVAVWNKVM
NLDRFDLA
>P13029 1.11.1.21~~~katG~~~Catalase-peroxidase~~~COG0376
MSTSDDIHNTTATGKCPFHQGGHDQSAGAGTTTRDWWPNQLRVDLLNQHSNRSNPLGEDFDYRKEFSKLDYYGLKKDLKA
LLTESQPWWPADWGSYAGLFIRMAWHGAGTYRSIDGRGGAGRGQQRFAPLNSWPDNVSLDKARRLLWPIKQKYGQKISWA
DLFILAGNVALENSGFRTFGFGAGREDVWEPDLDVNWGDEKAWLTHRHPEALAKAPLGATEMGLIYVNPEGPDHSGEPLS
AAAAIRATFGNMGMNDEETVALIAGGHTLGKTHGAGPTSNVGPDPEAAPIEEQGLGWASTYGSGVGADAITSGLEVVWTQ
TPTQWSNYFFENLFKYEWVQTRSPAGAIQFEAVDAPEIIPDPFDPSKKRKPTMLVTDLTLRFDPEFEKISRRFLNDPQAF
NEAFARAWFKLTHRDMGPKSRYIGPEVPKEDLIWQDPLPQPIYNPTEQDIIDLKFAIADSGLSVSELVSVAWASASTFRG
GDKRGGANGARLALMPQRDWDVNAAAVRALPVLEKIQKESGKASLADIIVLAGVVGVEKAASAAGLSIHVPFAPGRVDAR
QDQTDIEMFELLEPIADGFRNYRARLDVSTTESLLIDKAQQLTLTAPEMTALVGGMRVLGANFDGSKNGVFTDRVGVLSN
DFFVNLLDMRYEWKATDESKELFEGRDRETGEVKFTASRADLVFGSNSVLRAVAEVYASSDAHEKFVKDFVAAWVKVMNL
DRFDLL
>Q5NGV7 1.11.1.21~~~katG~~~Catalase-peroxidase~~~COG0376
MLKKIVTALGMSGMLLASSNAIAEDTTTKNDNLSPQSVDLSPLRNLNKLDSPMDKDYNYHQAFKKLDTEQLKKDMQDLLT
QSQDWWPADFGNYGPFFIRLSWHDAGTYRIYDGRGGANRGQQRFSPLNSWPDNVNLDKARQLLWPIKQKYGDAVSWSDLI
VLAGTVSLESMGMKPIGFAFGREDDWQGDDTNWGLSPEEIMSSNVRDGKLAPAYAATQMGLIYVNPEGPDGKPDIKGAAS
EIRQAFRAMGMTDKETVALIAGGHTFGKTHGAVPEDKVKQAIGPAPDKAPIEQQGLGWHNSYGTGNGDDTMGSGLEGSWT
STPTFWNHDFLHNLYNLDWKKTLSPAGAHQWTPTNAKPENMVPDAHKPGVKHKPIMFTTDLALKEDDGFNKYTQEFYNNP
EEFKEEFAKAWFKLTHRDMGPKSRYIGPWIPEQNFIWQDPVPAADYKQVSTQDIAQLEQDIINSGLTNQQLIKTAWDSAS
TYRKTDYRGGSNGARIALAPEKDWQMNEPAKLEVVLTKLKEIQTNFNNSKTDGTKVSLADLIVLGGNVGVEQAAKQAGYN
IQMPFVPGRTDATQAQTDIESFNYLKTKSDGFINYTDGSVSADKLPQTLVEKASMLDLNIPEMTVLVGGMRALDVNYDNS
QEGVLTTTPGQLNNSFFVNLLDMSTQWKKSDKKDGEYIGIDRKTGKQKWTASPVDLIFGSNSELKAVAQVYAENGNEQKF
VNDFAKAWHKVMMLGRFDVQQ
>P14412 1.11.1.21~~~katG~~~Catalase-peroxidase~~~
MENQNRQNAAQCPFHGSVTNQSSNRTTNKDWWPNQLNLSILHQHDRKTNPHDEEFNYAEEFQKLDYWALKEDLRKLMTES
QDWWPADYGHYGPLFIRMAWHSAGTYRIGDGRGGASTGTQRFAPLNSWPDNANLDKARRLLWPIKKKYGNKISWADLFIL
AGNVAIESMGGKTIGFGGGRVDVWHPEEDVYWGSEKEWLASERYSGDRELENPLAAVQMGLIYVNPEGPDGKPDPKAAAR
DIRETFRRMGMNDEETVALIAGGHTFGKAHGAGPATHVGPEPEAAPIEAQGLGWISSYGKGKGSDTITSGIEGAWTPTPT
QWDTSYFDMLFGYDWWLTKSPAGAWQWMAVDPDEKDLAPDAEDPSKKVPTMMMTTDLALRFDPEYEKIARRFHQNPEEFA
EAFARAWFKLTHRDMGPKTRYLGPEVPKEDFIWQDPIPEVDYELTEAEIEEIKAKILNSGLTVSELVKTAWASASTFRNS
DKRGGANGARIRLAPQKDWEVNEPERLAKVLSVYEDIQRELPKKVSIADLIVLGGSAAVEKAARDAGFDVKVPFFPGRGD
ATQEQTDVESFAVLEPFADGFRNYQKQEYSVPPEELLVDKAQLLGLTAPEMTVLVGGLRVLGANYRDLPHGVFTDRIGVL
TNDFFVNLLDMNYEWVPTDSGIYEIRDRKTGEVRWTATRVDLIFGSNSILRSYAEFYAQDDNQEKFVRDFINAWVKVMNA
DRFDLVKKARESVTA
>D8INT8 1.11.1.21~~~katG~~~Catalase-peroxidase~~~COG0376
MSNEAKCPFNHTAGSGTSNRDWWPKQLRVDLLAQHSSKSNPMGEDFDYAEAFKSLDLAAVKADLAKVMTDSQDWWPADFG
HYGPLFVRMAWHSAGTYRIGDGRGGAGRGQQRFAPLNSWPDNVNLDKARRLLWPVKQKYGNKISWADLLILTGNVALETM
GFKTFGFAGGRADVWEPDLDVYWGTESTWLGGDDRYGKGKGSSSQGEIPADAHRHGQEQARTAPAGRNLENPLAAVQMGL
IYVNPEGPEGNPDPLAAAHDIRETFARMAMDDEETVALIAGGHTFGKTHGAGDAKHVGREPEGEDMDSQGLGWKSSFGSG
VGGDTISSGLEVTWTQTPAQWSNYFFENLFKYEWELTKSPAGAHQWVAKGADAVIPHAHGGAPLLPTMLTTDLSLRFDPA
YEKISRRFLEHPEQFADAFARAWFKLTHRDLGPRSRYLGPEVPAEELIWQDPLPQAEGAQIDAADVAALKAKVLGSGLSV
PELVATAWASASTFRGGDMRGGANGARIRLAPQKDWAANQPAQLAKVLKTLEGIQSAFNQGGKKVSLADLIVLAGSAAVE
KAAQDAGVAVAVPFRAGRVDASQEQTDAASFAPLEPIVDGFRNFQKQRYAVRGEDMLIDKAQQLTLSAPEMTVLVGGLRV
LGNNVGGSTKGMFTDRVGVLSNDFFVNLLDMATEWKSTSPAQEEFEGRDRKTGAVKWAGTRVDLVFGSNAVLRALAEVYA
SADAKEKFVKDFVAAWVKVMELDRFDLK
>P46817 1.11.1.21~~~katG~~~Catalase-peroxidase~~~
MPEQHPPITETTTGAASNGCPVVGHMKYPVEGGGNQDWWPNRLNLKVLHQNPAVADPMGAAFDYAAEVATIDVDALTRDI
EEVMTTSQPWWPADYGHYGPLFIRMAWHAAGTYRIHDGRGGAGGGMQRFAPLNSWPDNASLDKARRLLWPVKKKYGKKLS
WADLIVFAGNCALESMGFKTFGFGFGRVDQWEPDEVYWGKEATWLGDERYSGKRDLENPLAAVQMGLIYVNPEGPNGNPD
PMAAAVDIRETFRRMAMNDVETAALIVGGHTFGKTHGAGPADLVGPEPEAAPLEQMGLGWKSSYGTGTGKDAITSGIEVV
WTNTPTKWDNSFLEILYGYEWELTKSPAGAWQYTAKDGAGAGTIPDPFGGPGRSPTMLATDLSLRVDPIYERITRRWLEH
PEELADEFAKAWYKLIHRDMGPVARYLGPLVPKQTLLWQDPVPAVSHDLVGEAEIASLKSQILASGLTVSQLVSTAWAAA
SSFRGSDKRGGANGGRIRLQPQVGWEVNDPDGDLRKVIRTLEEIQESFNSAAPGNIKVSFADLVVLGGCAAIEKAAKAAG
HNITVPFTPGRTDASQEQTDVESFAVLEPKADGFRNYLGKGNPLPAEYMLLDKANLLTLSAPEMTVLVGGLRVLGANYKR
LPLGVFTEASESLTNDFFVNLLDMGITWEPSPADDGTYQGKDGSGKVKWTGSRVDLVFGSNSELRALVEVYGADDAQPKF
VQDFVAAWDKVMNLDRFDVR
>H8F3Q9 1.11.1.21~~~katG~~~Catalase-peroxidase~~~
MPEQHPPITETTTGAASNGCPVVGHMKYPVEGGGNQDWWPNRLNLKVLHQNPAVADPMGAAFDYAAEVATIDVDALTRDI
EEVMTTSQPWWPADYGHYGPLFIRMAWHAAGTYRIHDGRGGAGGGMQRFAPLNSWPDNASLDKARRLLWPVKKKYGKKLS
WADLIVFAGNCALESMGFKTFGFGFGRVDQWEPDEVYWGKEATWLGDERYSGKRDLENPLAAVQMGLIYVNPEGPNGNPD
PMAAAVDIRETFRRMAMNDVETAALIVGGHTFGKTHGAGPADLVGPEPEAAPLEQMGLGWKSSYGTGTGKDAITSGIEVV
WTNTPTKWDNSFLEILYGYEWELTKSPAGAWQYTAKDGAGAGTIPDPFGGPGRSPTMLATDLSLRVDPIYERITRRWLEH
PEELADEFAKAWYKLIHRDMGPVARYLGPLVPKQTLLWQDPVPAVSHDLVGEAEIASLKSQIRASGLTVSQLVSTAWAAA
SSFRGSDKRGGANGGRIRLQPQVGWEVNDPDGDLRKVIRTLEEIQESFNSAAPGNIKVSFADLVVLGGCAAIEKAAKAAG
HNITVPFTPGRTDASQEQTDVESFAVLEPKADGFRNYLGKGNPLPAEYMLLDKANLLTLSAPEMTVLVGGLRVLGANYKR
LPLGVFTEASESLTNDFFVNLLDMGITWEPSPADDGTYQGKDGSGKVKWTGSRVDLVFGSNSELRALVEVYGADDAQPKF
VQDFVAAWDKVMNLDRFDVR
>P9WIE5 1.11.1.21~~~katG~~~Catalase-peroxidase~~~COG0376
MPEQHPPITETTTGAASNGCPVVGHMKYPVEGGGNQDWWPNRLNLKVLHQNPAVADPMGAAFDYAAEVATIDVDALTRDI
EEVMTTSQPWWPADYGHYGPLFIRMAWHAAGTYRIHDGRGGAGGGMQRFAPLNSWPDNASLDKARRLLWPVKKKYGKKLS
WADLIVFAGNCALESMGFKTFGFGFGRVDQWEPDEVYWGKEATWLGDERYSGKRDLENPLAAVQMGLIYVNPEGPNGNPD
PMAAAVDIRETFRRMAMNDVETAALIVGGHTFGKTHGAGPADLVGPEPEAAPLEQMGLGWKSSYGTGTGKDAITSGIEVV
WTNTPTKWDNSFLEILYGYEWELTKSPAGAWQYTAKDGAGAGTIPDPFGGPGRSPTMLATDLSLRVDPIYERITRRWLEH
PEELADEFAKAWYKLIHRDMGPVARYLGPLVPKQTLLWQDPVPAVSHDLVGEAEIASLKSQIRASGLTVSQLVSTAWAAA
SSFRGSDKRGGANGGRIRLQPQVGWEVNDPDGDLRKVIRTLEEIQESFNSAAPGNIKVSFADLVVLGGCAAIEKAAKAAG
HNITVPFTPGRTDASQEQTDVESFAVLEPKADGFRNYLGKGNPLPAEYMLLDKANLLTLSAPEMTVLVGGLRVLGANYKR
LPLGVFTEASESLTNDFFVNLLDMGITWEPSPADDGTYQGKDGSGKVKWTGSRVDLVFGSNSELRALVEVYGADDAQPKF
VQDFVAAWDKVMNLDRFDVR
>Q9R2E9 1.11.1.21~~~katG~~~Catalase-peroxidase~~~
MPEATEHPPIGEAQTEPAQSGCPMVIKPPVEGGSNRDWWPNAVNLKMLQKDPEVIDPIDEGYDYREAVQTLDVDQLARDF
DELCTNSQDWWPADFGHYGPLFIRMSWHAAGTYRVQDGRGGAGKGMQRFAPLNSWPDNVSLDKARRLLWPLKKKYGKKLS
WSDLIVYAGNRAMENMGFKTAGFAFGRPDYWEPEEDVYWGAEHEWLGSQDRYAGANGDRTKLENPLGASHMGLIYVNPEG
PEGNPDPIAAAIDIRETFGRMAMNDVETAALIVGGHTFGKTHGATDIVNGPEPEAAPLEQMGLGWSNPGVGIDTVSSGLE
VTWTHTPTKWDNSFLEILYGNEWELFKSPAGANQWRPKDNGWADSVPMAQGTGKTHPAMLTTDLSMRMDPIYGEITRRWL
DHPEELAEEYAKAWFKLLHRDMGPVQRYLGPLVPTQTWLWQDIVPAGKPLSDADVATLKGAIADSGLTVQQLVSTAWKAA
SSFRISDMRGGANGGRIRLQPQLGWESNEPDELAQVISKLEEIQGSSGIDVSFADLVVLGGNVGIETAAKAAGFDIEVPF
SSGRGDATQEQTDVEAFSYLEPKADGFRNYVGKGLNLPAEYQLIDQANLLNLSAPQMTVLIGGLRALGITHGDSKLGVLT
DTPGQLTNDYFVNLTDMGVKWAPAPADDGTYVGTDRDTGEVKYTASRVDLLFGSNSQLRALAEVYAEDDSRDKFVKDFVA
AWVNVMDADRYDIGKGA
>Q2JZT8 1.11.1.21~~~katG~~~Catalase-peroxidase~~~
MDNPTDTAGKCPVAHGNKPRGPSNRDWWPNQLNVQILHHNSGRADPLGKDFDYAEEFKKLDLDALKKDLHALMTDSQDWW
PADFGHYGGLFIRMAWHSAGTYRITDGRGGAGQGQQRFAPLNSWPDNANLDKARRLLWPIKQKYGNRISWADLLILTGNV
ALESMGFKTFGFAGGRADVWEPEELYWGPEGTWLGDERYSGERQLAEPLGAVQMGLIYVNPEGPNGNPDPVAAARDIRET
FARMAMNDEETVALIAGGHTFGKTHGAGDPSFIGAEPEGGAIEDQGLGWKSSFGTGVGKDAITAGLEVTWSQTPTKWSNY
FFENLFAYEWELTKSPAGAHQWRAKNAEASIPDAYEPGKKHVPTMLTTDLSLRFDPIYEKISRRFLENPDQFADAFARAW
FKLTHRDMGPKVRYLGPEVPAEDLIWQDVIPAVDHPLVDDKDIAELKAKVLATGLTVQELVSTAWASASTFRGSDKRGGA
NGARIRLAPQKDWEANQPAQLAKVLGVLEGIQKDFNAAQTGAKKISLADLIVLAGAAGVEKAAAAGGNAVSVPLTPGRMD
ASEAQTDAHSFAPLEPRIDGFRNYVNGKRLQFMKPEEALVDRAQLLTLTGPEMTVLVGGLRVLKAGNPEHGVFTSRPETL
TNDFFVNLLDVATQWVPATGKEGVYEGRDRKTGAAKWTGTRVDLIFGSHSQLRAFAEVYGQADAKQKFVKDFVAAWNKVM
NADRFDLV
>P37743 1.11.1.21~~~katG~~~Catalase-peroxidase~~~
MDGKDKATGKCPVMHGAMTAAGVSNTSWWPNALNLDILHQHDTKGNPLNGFDYRAAVKGLDVGLRADLHALMTDSQPWWP
ADWGHYGGLMIRMAWHAAGSYRAADGRGGGNTGKPARFAPLNSWPDNVSLDKARRLLWPIKKKYGNAVSWADLILFAGTV
AYESMGLKTFGFGFGREDIWAPEKDVYWGAEKDWLAPSDGRYGDLAKPETMENPLAAVQMGLIYVNPEGVNGQPDPARTA
LHIRETFARMGMNDEETVALTAGGHTVGKAHGNGDAKALGPDPEAADVTVRALAGRTRIWAARRRRPSPRGSRAPGPRIR
RAGTWAISRCSSGHDWELTKSPAGAWQWKPVTIAEEAKPLDATDLTTRHDPLMTDADMAMKVDPSTMRSVRSSWPIRPPS
TTLSRAPGSSCCIATWGRRRATSAPMCPPRIWSAGPGAAGPTGWDVAKVKAQIAASGLSVADLVATAWDSARTFRQSDYR
GGANGARIRLAPQKDWAGNEPERLAGACGARTDRGGAGASVADVIVLAGNLGVEQAAAGVSRWRCPSPPVAAMRAAMTDG
PSLTCWSRCMTASATG
>Q9RJH9 1.11.1.21~~~katG~~~Catalase-peroxidase~~~COG0376
MSENHDAIVTDAKTEETDGCPVAHGRAPHPTQGGGNRQWWPERLNLKILAKNPAVANPLGEEFDYAEAFEALDLAAVKRD
IAEVLTTSQDWWPADFGNYGPLMIRMAWHSAGTYRISDGRGGAGAGQQRFAPLNSWPDNGNLDKARRLLWPVKKKYGQNL
SWADLLVLTGNVALETMGFETFGFAGGRADVWEAEEDVYWGPETTWLDDRRYTGDRELENPLGAVQMGLIYVNPEGPNGN
PDPIAAARDIRETFRRMAMNDEETVALIAGGHTFGKTHGAGPADAVGDDPEAAAMEQQGLGWKSTHGTGKGGDAITSGLE
VTWTSTPTQWGNGFFKNLFEFEYELEQSPAGANQWVAKDAPEIIPDAHDPAKKHRPRMLTTDLSLRLDPIYGPISRRFYE
NPEEFADAFARAWFKLTHRDMGPKSLYLGPEVPEETLIWQDPLPEPEGEVIDAEDVATLKTKLLESGLSVSQLVTTAWAS
ASTFRGSDKRGGANGARIRLEPQRGWEVNEPDELAQVLRVLEGVQREFNSGSGAKKVSLADLIVLGGSAAVEKAAKEAGF
PVEVPFAAGRVDATEEHTDAESFEALEPTADGFRNYLGKGNRLPAEFLLLDRANLLTLSAPEMTVLVGGLRVLGAGHQQS
QLGVFTRTPGSLTNDFFVNLLDLGTTWKSTSEDRTTFEGRDAATGEVKWAGSRADLVFGSNAELRALAEVYASDDAGEKF
VHDFVAAWVKVMNLDRFDLA
>O87864 1.11.1.21~~~katG~~~Catalase-peroxidase~~~
MTENHDAIVTDAKSEGSGGCPVAHDRALHPTQGGGNRQWWPERLNLKILAKNPAVANPLDEDFDYAEAFKALDLAAVKRD
IAEVLTTSQDWWPADFGNYGPLMIRMAWHSAGTYRISDGRGGAGAGQQRFAPLNSWPDNGNLDKARRLLWPVKKKYGQSI
SWADLLILTGNVALETMGFKTFGFGGGRADVWEAEEDVYWGPETTWLDDRRYTGDRELENPLGAVQMGLIYVNPEGPNGN
PDPIAAARDIRETFRRMAMNDEETVALIAGGHTFGKTHGAGPADHVGADPEAASLEEQGLGWRSTYGTGKGADAITSGLE
VTWTSTPTQWSNGFFKNLFEYEYELEQSPAGAHQWVAKNAPEIIPDAHDPSKKHRPRMLTTDLSLRFDPIYEPISRRFYE
NPEEFADAFARAWYKLTHRDMGPKSLYLGPEVPEETLLWQDPLPEREGELIDDADIAILKTKLLESGLSVSQLVTTAWAS
ASTFRASDKRGGANGARIRLAPQRGWEVNDPDQLAQVLRTLENVQQEFNASSGAKKVSLADLIVLGGAAGVEKAAKEAGF
EIQVPFTPGRVDATEEHTDVESFEALEPTADGFRNYLGKGNRLPAEYLLLDKANLLNLSAPEMTVLVGGLRVLGANHQQS
QLGVFTKTPGVLTNDFFVNLLDMGTTWKATSEDQTTFEGRDAATGEVKWAGSRADLVFGSNSELRALAEVYASDDAKEKF
VKDFVAAWHKVMDADRFDLV
>Q31MN3 1.11.1.21~~~katG~~~Catalase-peroxidase~~~COG0376
MTATQGKCPVMHGGATTVNISTAEWWPKALNLDILSQHDRKTNPMGPDFNYQEEVKKLDVAALKQDLQALMTDSQDWWPA
DWGHYGGLMIRLTWHAAGTYRIADGRGGAGTGNQRFAPLNSWPDNTNLDKARRLLWPIKQKYGNKLSWADLIAYAGTIAY
ESMGLKTFGFAFGREDIWHPEKDIYWGPEKEWVPPSTNPNSRYTGDRELENPLAAVTMGLIYVNPEGVDGNPDPLKTAHD
VRVTFARMAMNDEETVALTAGGHTVGKCHGNGNAALLGPEPEGADVEDQGLGWINKTQSGIGRNAVTSGLEGAWTPHPTQ
WDNGYFRMLLNYDWELKKSPAGAWQWEPINPREEDLPVDVEDPSIRRNLVMTDADMAMKMDPEYRKISERFYQDPAYFAD
VFARAWFKLTHRDMGPKARYIGPDVPQEDLIWQDPIPAGNRNYDVQAVKDRIAASGLSISELVSTAWDSARTYRNSDKRG
GANGARIRLAPQKDWEGNEPDRLAKVLAVLEGIAAATGASVADVIVLAGNVGVEQAARAAGVEIVLPFAPGRGDATAEQT
DTESFAVLEPIHDGYRNWLKQDYAATPEELLLDRTQLLGLTAPEMTVLIGGLRVLGTNHGGTKHGVFTDREGVLTNDFFV
NLTDMNYLWKPAGKNLYEICDRKTNQVKWTATRVDLVFGSNSILRAYSELYAQDDNKEKFVRDFVAAWTKVMNADRFDLD
>Q5MZ99 1.11.1.21~~~katG~~~Catalase-peroxidase~~~COG0376
MTATQGKCPVMHGGATTVNISTAEWWPKALNLDILSQHDRKTNPMGPDFNYQEEVKKLDVAALKQDLQALMTDSQDWWPA
DWGHYGGLMIRLTWHAAGTYRIADGRGGAGTGNQRFAPLNSWPDNTNLDKARRLLWPIKQKYGNKLSWADLIAYAGTIAY
ESMGLKTFGFAFGREDIWHPEKDIYWGPEKEWVPPSTNPNSRYTGDRELENPLAAVTMGLIYVNPEGVDGNPDPLKTAHD
VRVTFARMAMNDEETVALTAGGHTVGKCHGNGNAALLGPEPEGADVEDQGLGWINKTQSGIGRNAVTSGLEGAWTPHPTQ
WDNGYFRMLLNYDWELKKSPAGAWQWEPINPREEDLPVDVEDPSIRRNLVMTDADMAMKMDPEYRKISERFYQDPAYFAD
VFARAWFKLTHRDMGPKARYIGPDVPQEDLIWQDPIPAGNRNYDVQAVKDRIAASGLSISELVSTAWDSARTYRNSDKRG
GANGARIRLAPQKDWEGNEPDRLAKVLAVLEGIAAATGASVADVIVLAGNVGVEQAARAAGVEIVLPFAPGRGDATAEQT
DTESFAVLEPIHDGYRNWLKQDYAATPEELLLDRTQLLGLTAPEMTVLIGGLRVLGTNHGGTKHGVFTDREGVLTNDFFV
NLTDMNYLWKPAGKNLYEICDRKTNQVKWTATRVDLVFGSNSILRAYSELYAQDDNKEKFVRDFVAAWTKVMNADRFDLD
>P73911 1.11.1.21~~~katG~~~Catalase-peroxidase~~~COG0376
MGTQPARKLRNRVFPHPHNHRKEKPMANDQVPASKCPVMHGANTTGQNGNLNWWPNALNLDILHQHDRKTNPMDDGFNYA
EAFQQLDLAAVKQDLHHLMTDSQSWWPADWGHYGGLMIRMAWHAAGTYRIADGRGGAATGNQRFAPLNSWPDNVNLDKAR
RLLWPIKKKYGNKLSWGDLIILAGTMAYESMGLKVYGFAGGREDIWHPEKDIYWGAEKEWLASSDHRYGSEDRESLENPL
AAVQMGLIYVNPEGVDGHPDPLCTAQDVRTTFARMAMNDEETVALTAGGHTVGKCHGNSKAELIGPEPEGADVVEQGLGW
HNQNGKGVGRETMSSGIEGAWTTHPTQWDNGYFYMLFNHEWELKKSPAGAWQWEPVNIKEEDKPVDVEDPNIRHNPIMTD
ADMAMIKDPIYRQISERFYREPDYFAEVFAKAWFKLTHRDLGPKSRYLGPDVPQEDLIWQDPIPPVDYTLSEGEIKELEQ
QILASGLTVSELVCTAWDSARTFRSSDYRGGANGARIRLEPQKNWPGNEPTRLAKVLAVLENIQANFAKPVSLADLIVLG
GGAAIAKAALDGGIEVNVPFLPGRGDATQAMTDAESFTPLEPIHDGYRNWLKQDYAVSPEELLLERTQLMGLTAPEMTVL
IGGMRVLGTNHGGTKHGVFTDRVGVLSNDFFVNLTDMAYQWRPAGNNLYEIGDRQTGEVKWTATKVDLVFGSNSILRSYA
EVYAQDDNREKFVRDFVAAWTKVMNADRFDLPRG
>Q9X6B0 1.11.1.21~~~katG~~~Catalase-peroxidase~~~COG0376
MLKKILPVLITLAIVHNTPTAWAAEAPKTDSFYLPKSLDLSPLRLHNIESNPYGKDFNYAQQFKTLDLEAVKKDIKTVLT
TSQDWWPADYGNYGPFFIRMAWHGAGTYRIYDGRGGADGGQQRFEPLNSWPDNANLDKARRLLWPIKKKYGAKISWGDLM
VLTGNVALESMGFKTLGFAGGREDDWQSDLVYWGAGNKMLSDNRDKNGKLPKPLAATQMGLIYVNPEGPNGKPDPVAAAK
DIREAFARMAMNDEETVALIAGGHTFGKAHGAASPEKCLGAAPGEAGLEQQGLGWANKCGSGNGKDTITSGLEGAWTTDP
THFTMQYLSNLYKHEWVLTKSPAGAWQWKPKNAANVVPDATDPTKFHPLMMFTTDIALKVDPEYKKITTRFLENPEEFKM
AFARAWFKLTHRDMGPAARYLGDEVPKETFIWQDPLPAANYKMIDSADISELKDKILKTGLSDTKLIKTAWASASTFRGT
DFRGGDNGARIRLAPQKDWPVNDPAELHSVLAALMEVQNNFNKDRSDGKKVSLSDLIVLGGNAAIEDAAKKAGYSISIPF
TPGRTDASQEETDVSSFAVLEPTADGFRNYYDAKRNTLSPIASLIDRANKLELTVPEMTVLIGGLRVLDVNSGGSKAGVL
TNTPGQLNNNFFVNLLDMSTKWTKSPKAEGYFDGYDRKTGKLKWTASSVDLVFGSNPELRAVAEVYASDDAKEKFVHDFT
KVWEKVMNLDRFDIKNN
>B0VH76 2.6.1.111~~~kat~~~3-aminobutyryl-CoA aminotransferase~~~COG0001
MAEKLKLARSMSLFEEAKQLVPGGVAGIRRPYNFVPGEYPIFFDHGKGGRVVDVDGNEYIDFLCAYGPIIIGYREDEIDD
AVINQIKNKGFCFSLTQEMQNTLVKKLRELIPCCEMAALVKTGSDATTIAIRVARGYTGKTKIARYGYHGWHDWCVEVKG
GIPPKLYEDIYEFHYNDLDSLKAILEANKDDMAGIIITPIGHPNGAEVQMPKPGYLEAVRELANQYHCLLIFDEIRSGFR
CSLGGAQKLFGVTPDLSTFGKAMANGYAIAALVGKEEYMQVLADKVFLSSTFFPNSDGIVAAIKTIEILERDRILDVVAA
KGRKFGAEVEKVVEESGVPVNFTGAPWMPYITFKKDEAGLYKKLRTEYYTQLIRHNVFMQPYHHGYICYRHTDEDLAYTV
EAIRESLAEVKKML
>A0A139GI49 ~~~kgpE~~~Kawaguchipeptin peptide~~~
MKNPTLLPKLTAPVERPAVTSSDLKQASSVDAAWLNGDNNWSTPFAGVNAAWLNGDNNWSTPFAGVNAAWLNGDNNWSTP
FAADGAE
>P0AB74 4.1.2.40~~~kbaY~~~D-tagatose-1,6-bisphosphate aldolase subunit KbaY~~~COG0191
MSIISTKYLLQDAQANGYAVPAFNIHNAETIQAILEVCSEMRSPVILAGTPGTFKHIALEEIYALCSAYSTTYNMPLALH
LDHHESLDDIRRKVHAGVRSAMIDGSHFPFAENVKLVKSVVDFCHSQDCSVEAELGRLGGVEDDMSVDAESAFLTDPQEA
KRFVELTGVDSLAVAIGTAHGLYSKTPKIDFQRLAEIREVVDVPLVLHGASDVPDEFVRRTIELGVTKVNVATELKIAFA
GAVKAWFAENPQGNDPRYYMRVGMDAMKEVVRNKINVCGSANRISA
>Q9KIP8 4.1.2.40~~~kbaY~~~D-tagatose-1,6-bisphosphate aldolase subunit KbaY~~~COG0191
MSIISTKYLLQDAQANGYAVPAFNIHNAETIQAILEVCSEMRSPVILAGTPGTFKHIALEEIYALCSAYSTTYNMPLALH
LDHHESLDDIRRKVHAGVRSAMIDGSHFPFAENVKLVKSVVDFCHSQDCSVEAELGRLGGVEDDMSVDAESAFLTDPQEA
KRFVELTGVDSLAVAIGTAHGLYSKTPKIDFQRLAEIREVVDVPLVLHGASDVPDEFVRRTIELGVTKVNVATELKIAFA
GAVKAWFAENPQGNDPRNYMRVGMDAMKEVVRNKINVCGSANRISA
>P0C8K0 ~~~kbaZ~~~D-tagatose-1,6-bisphosphate aldolase subunit KbaZ~~~COG4573
MKHLTEMVRQHKAGKTNGIYAVCSAHPLVLEAAIRYASANQTPLLIEATSNQVDQFGGYTGMTPADFRGFVCQLADSLNF
PQDALILGGDHLGPNRWQNLPAAQAMANADDLIKSYVAAGFKKIHLDCSMSCQDDPIPLTDDIVAERAARLAKVAEETCL
EHFGEADLEYVIGTEVPVPGGAHETLSELAVTTPDAARATLEAHRHAFEKQGLNAIWPRIIALVVQPGVEFDHTNVIDYQ
PAKASALSQMVENYETLIFEAHSTDYQTPQSLRQLVIDHFAILKVGPALTFALREALFSLAAIEEELVPAKACSGLRQVL
EDVMLDRPEYWQSHYHGDGNARRLARGYSYSDRVRYYWPDSQIDDAFAHLVRNLADSPIPLPLISQYLPLQYVKVRSGEL
QPTPRELIINHIQDILAQYHTACEGQ
>P0C8K1 ~~~kbaZ~~~D-tagatose-1,6-bisphosphate aldolase subunit KbaZ~~~COG4573
MKHLTEMVRQHKAGKTNGIYAVCSAHPLVLEAAIRYASANQTPLLIEATSNQVDQFGGYTGMTPADFRGFVCQLADSLNF
PQDALILGGDHLGPNRWQNLPAAQAMANADDLIKSYVAAGFKKIHLDCSMSCQDDPIPLTDDIVAERAARLAKVAEETCL
EHFGEADLEYVIGTEVPVPGGAHETLSELAVTTPDAARATLEAHRHAFEKQGLNAIWPRIIALVVQPGVEFDHTNVIDYQ
PAKASALSQMVENYETLIFEAHSTDYQTPQSLRQLVIDHFAILKVGPALTFALREALFSLAAIEEELVPAKACSGLRQVL
EDVMLDRPEYWQSHYHGDGNARRLARGYSYSDRVRYYWPDSQIDDAFAHLVRNLADSPIPLPLISQYLPLQYVKVRSGEL
QPTPRELIINHIQDILAQYHTACEGQ
>P0AB77 2.3.1.29~~~kbl~~~2-amino-3-ketobutyrate coenzyme A ligase~~~COG0156
MRGEFYQQLTNDLETARAEGLFKEERIITSAQQADITVADGSHVINFCANNYLGLANHPDLIAAAKAGMDSHGFGMASVR
FICGTQDSHKELEQKLAAFLGMEDAILYSSCFDANGGLFETLLGAEDAIISDALNHASIIDGVRLCKAKRYRYANNDMQE
LEARLKEAREAGARHVLIATDGVFSMDGVIANLKGVCDLADKYDALVMVDDSHAVGFVGENGRGSHEYCDVMGRVDIITG
TLGKALGGASGGYTAARKEVVEWLRQRSRPYLFSNSLAPAIVAASIKVLEMVEAGSELRDRLWANARQFREQMSAAGFTL
AGADHAIIPVMLGDAVVAQKFARELQKEGIYVTGFFYPVVPKGQARIRTQMSAAHTPEQITRAVEAFTRIGKQLGVIA
>P0ADE6 ~~~kbp~~~Potassium binding protein Kbp~~~COG1652
MGLFNFVKDAGEKLWDAVTGQHDKDDQAKKVQEHLNKTGIPDADKVNIQIADGKATVTGDGLSQEAKEKILVAVGNISGI
ASVDDQVKTATPATASQFYTVKSGDTLSAISKQVYGNANLYNKIFEANKPMLKSPDKIYPGQVLRIPEE
>B0VHH0 2.3.1.247~~~kce~~~3-keto-5-aminohexanoate cleavage enzyme~~~COG3246
MEPLILTAAITGAETTRADQPNLPITPEEQAKEAKACFEAGARVIHLHIREDDGRPSQRLDRFQEAISAIREVVPEIIIQ
ISTGGAVGESFDKRLAPLALKPEMATLNAGTLNFGDDIFINHPADIIRLAEAFKQYNVVPEVEVYESGMVDAVARLIKKG
IITQNPLHIQFVLGVPGGMSGKPKNLMYMMEHLKEEIPTATWAVAGIGRWHIPTSLIAMVTGGHIRCGFEDNIFYHKGVI
AESNAQLVARLARIAKEIGRPLATPEQAREILALNK
>Q8RHX2 2.3.1.247~~~kce~~~3-keto-5-aminohexanoate cleavage enzyme~~~COG3246
MMEKLIITAAICGAEVTKEHNPAVPYTVEEIAREAESAYKAGASIIHLHVREDDGTPTQDKERFRKCIEAIREKCPDVII
QPSTGGAVGMTDLERLQPTELHPEMATLDCGTCNFGGDEIFVNTENTIKNFGKILIERGVKPEIEVFDKGMIDYAIRYQK
QGFIQKPMHFDFVLGVQMSASARDLVFMSESIPEGSTWTVAGVGRHQFQMAALAIVMGGHVRVGFEDNVYIDKGILAKSN
GELVERVVRLAKELGREIATPDEARQILSLKK
>P31069 ~~~kch~~~Voltage-gated potassium channel Kch~~~COG1226
MSHWATFKQTATNLWVTLRHDILALAVFLNGLLIFKTIYGMSVNLLDIFHIKAFSELDLSLLANAPLFMLGVFLVLNSIG
LLFRAKLAWAISIILLLIALIYTLHFYPWLKFSIGFCIFTLVFLLILRKDFSHSSAAAGTIFAFISFTTLLFYSTYGALY
LSEGFNPRIESLMTAFYFSIETMSTVGYGDIVPVSESARLFTISVIISGITVFATSMTSIFGPLIRGGFNKLVKGNNHTM
HRKDHFIVCGHSILAINTILQLNQRGQNVTVISNLPEDDIKQLEQRLGDNADVIPGDSNDSSVLKKAGIDRCRAILALSD
NDADNAFVVLSAKDMSSDVKTVLAVSDSKNLNKIKMVHPDIILSPQLFGSEILARVLNGEEINNDMLVSMLLNSGHGIFS
DNDELETKADSKESAQK
>P0A333 ~~~kcsA~~~pH-gated potassium channel KcsA~~~COG1226
MPPMLSGLLARLVKLLLGRHGSALHWRAAGAATVLLVIVLLAGSYLAVLAERGAPGAQLITYPRALWWSVETATTVGYGD
LYPVTLWGRLVAVVVMVAGITSFGLVTAALATWFVGREQERRGHFVRHSEKAAEEAYTRTTRALHERFDRLERMLDDNRR
>P0A334 ~~~kcsA~~~pH-gated potassium channel KcsA~~~
MPPMLSGLLARLVKLLLGRHGSALHWRAAGAATVLLVIVLLAGSYLAVLAERGAPGAQLITYPRALWWSVETATTVGYGD
LYPVTLWGRLVAVVVMVAGITSFGLVTAALATWFVGREQERRGHFVRHSEKAAEEAYTRTTRALHERFDRLERMLDDNRR
>P38493 2.7.4.25~~~cmk~~~Cytidylate kinase~~~COG0283
MEKKLSIAIDGPAAAGKSTVAKIVAEKKSYIYIDTGAMYRAITYAALQENVDLTDEEKLAELLKRTDIELITTKDGQKVF
VNGTDVTEAIRTDEISNQVSIAAKHRSVREEMVKRQQQLGEKGGVVMDGRDIGTHVLPNAEVKIFLLASVEERAKRRYEE
NVKKGFDVNYETLIEEIARRDKLDSEREVSPLRKAEDALEIDTTSLSIQEVADKILEAVEQKSR
>P0A6I0 2.7.4.25~~~cmk~~~Cytidylate kinase~~~COG0283
MTAIAPVITIDGPSGAGKGTLCKAMAEALQWHLLDSGAIYRVLALAALHHHVDVASEDALVPLASHLDVRFVSTNGNLEV
ILEGEDVSGEIRTQEVANAASQVAAFPRVREALLRRQRAFRELPGLIADGRDMGTVVFPDAPVKIFLDASSEERAHRRML
QLQEKGFSVNFERLLAEIKERDDRDRNRAVAPLVPAADALVLDSTTLSIEQVIEKALQYARQKLALA
>P9WPA9 2.7.4.25~~~cmk~~~Cytidylate kinase~~~COG0283
MSRLSAAVVAIDGPAGTGKSSVSRRLARELGARFLDTGAMYRIVTLAVLRAGADPSDIAAVETIASTVQMSLGYDPDGDS
CYLAGEDVSVEIRGDAVTRAVSAVSSVPAVRTRLVELQRTMAEGPGSIVVEGRDIGTVVFPDAPVKIFLTASAETRARRR
NAQNVAAGLADDYDGVLADVRRRDHLDSTRAVSPLQAAGDAVIVDTSDMTEAEVVAHLLELVTRRSEAVR
>P63807 2.7.4.25~~~cmk~~~Cytidylate kinase~~~
MKAINIALDGPAAAGKSTIAKRVASELSMIYVDTGAMYRALTYKYLKLNKTEDFAKLVDQTTLDLTYKADKGQCVILDNE
DVTDFLRNNDVTQHVSYVASKEPVRSFAVKKQKELAAEKGIVMDGRDIGTVVLPDADLKVYMIASVEERAERRYKDNQLR
GIESNFEDLKRDIEARDQYDMNREISPLRKADDAVTLDTTGKSIEEVTDEILAMVSQIK
>Q5XCU3 2.7.4.25~~~cmk~~~Cytidylate kinase~~~
MKAIKIAIDGPASSGKSTVAKIIAKNLGYTYLDTGAMYRSATYIALTHGYTDKEVALILEELEKNPISFKKAKDGSQLVF
LGDEDVTLVIRQNDVTNNVSWVSALPEIREELVHQQRRIAQAGGIIMDGRDIGTVVLPDAELKIFLVASVEERAERRYKE
NLEKGIESDFETLKEEIAARDYKDSHRKVSPLKAAEDALIFDTTGVSIDGVVQFIQEKAEKIVDMS
>Q97PK6 2.7.4.25~~~cmk~~~Cytidylate kinase~~~COG0283
MKTIQIAIDGPASSGKSTVAKIIAKDFGFTYLDTGAMYRAATYMALKNQLGVEEVEALLALLDQHPISFGRSETGDQLVF
VGDVDITHPIRENEVTNHVSAIAAIPQVREKLVSLQQEIAQQGGIVMDGRDIGTVVLPQAELKIFLVASVDERAERRYKE
NIAKGIETDLETLKKEIAARDYKDSHRETSPLKQAEDAVYLDTTGLNIQEVVEKIKAEAEKRM
>Q5SL35 2.7.4.25~~~cmk~~~Cytidylate kinase~~~COG0283
MRGIVTIDGPSASGKSSVARRVAAALGVPYLSSGLLYRAAAFLALRAGVDPGDEEGLLALLEGLGVRLLAQAEGNRVLAD
GEDLTSFLHTPEVDRVVSAVARLPGVRAWVNRRLKEVPPPFVAEGRDMGTAVFPEAAHKFYLTASPEVRAWRRARERPQA
YEEVLRDLLRRDERDKAQSAPAPDALVLDTGGMTLDEVVAWVLAHIRR
>B1JRD8 2.7.4.25~~~cmk~~~Cytidylate kinase~~~
MTAIAPVITVDGPSGAGKGTLCKALAESLNWRLLDSGAIYRVLALAALHHQVDISTEEALVPLAAHLDVRFVSQNGQLQV
ILEGEDVSNEIRTETVGNTASQAAAFPRVREALLRRQRAFREAPGLIADGRDMGTIVFPDAPVKIFLDASSQERAHRRML
QLQERGFNVNFERLLAEIQERDNRDRNRSVAPLVPAADALVLDSTSMSIEQVIEQALAYAQRILALPLKK
>Q1JUQ0 4.2.1.43~~~araD~~~L-2-keto-3-deoxyarabonate dehydratase~~~
MTSSSTPRHRGIFPVVPTTFADTGELDLASQKRAVDFMIDAGSDGLCILANFSEQFAITDDERDVLTRTILEHVAGRVPV
IVTTSHYSTQVCAARSLRAQQLGAAMVMAMPPYHGATFRVPEAQIFEFYARVSDAIAIPIMVQDAPASGTALSAPFLARM
AREIEQVAYFKIETPGAANKLRELIRLGGDAIEGPWDGEEAITLLADLHAGATGAMTGGGFPDGIRPILEAWREGRHDDA
YARYQAWLPLINHENRQSGILTAKALMREGGVIASERPRHPMPELHPDTRAELLAIARRLDPLVLRWAH
>P9WG37 4.1.1.-~~~kdc~~~Alpha-keto-acid decarboxylase~~~COG3961
MTPQKSDACSDPVYTVGDYLLDRLAELGVSEIFGVPGDYNLQFLDHIVAHPTIRWVGSANELNAGYAADGYGRLRGMSAV
VTTFGVGELSVTNAIAGSYAEHVPVVHIVGGPTKDAQGTRRALHHSLGDGDFEHFLRISREITCAQANLMPATAGREIDR
VLSEVREQKRPGYILLSSDVARFPTEPPAAPLPRYPGGTSPRALSLFTKAAIELIADHQLTVLADLLVHRLQAVKELEAL
LAADVVPHATLMWGKSLLDESSPNFLGIYAGAASAERVRAAIEGAPVLVTAGVVFTDMVSGFFSQRIDPARTIDIGQYQS
SVADQVFAPLEMSAALQALATILTGRGISSPPVVPPPAEPPPAMPARDEPLTQQMVWDRVCSALTPGNVVLADQGTSFYG
MADHRLPQGVTFIGQPLWGSIGYTLPAAVGAAVAHPDRRTVLLIGDGAAQLTVQELGTFSREGLSPVIVVVNNDGYTVER
AIHGETAPYNDIVSWNWTELPSALGVTNHLAFRAQTYGQLDDALTVAAARRDRMVLVEVVLPRLEIPRLLGQLVGSMAPQ
>Q8RHX3 1.4.1.11~~~kdd~~~L-erythro-3,5-diaminohexanoate dehydrogenase~~~COG0604
MKKGCKYGTHRVIEPAGVLPQPAKKISNDMEIFSNEILIDVIALNIDSASFTQIEEEAGHDVEKVKAKIKEIVAERGKMQ
NPVTGSGGMLIGTVEKIGDDLVGKTDLKVGDKIATLVSLSLTPLRIDEIINIKPEIDRVEIKGKAILFESGIYAVLPKDM
PENLALAALDVAGAPAQVAKLVKPCQSVAILGSAGKSGMLCAYEAVKRVGPTGKVIGVVRNDKEKALLQRVSDKVKIVIA
DATKPMDVLHAVLEANDAKEVDVAINCVNVPNTEMSTILPVKEFGIAYFFSMATGFSKAALGAEGVGKDITMIVGNGYTV
DHAAITLEELRESAVLREIFNEIYL
>Q6FFQ1 4.2.1.41~~~~~~Probable 5-dehydro-4-deoxyglucarate dehydratase~~~COG0329
MDALELKNIVSDGLLSFPVTDFDQNGDFNAASYAKRLEWLAPYGASALFAAGGTGEFFSLTGDEYSDVIKTAVDACKGSV
PIIAGAGGPTRQAILQAQEAERLGAHGILLMPHYLTEASQEGLVEHVKQVCNAVNFGVIFYNRSVSKLNVDSLQQLVESC
PNLIGFKDSSGQIDMMTEVVQTLGDRLSYLGGLPTAEIFAAPYKALGSPVYSSAVFNFIPKTAMEFYNALRNDDFATTQR
LIRDFFLPLIKIRNRKSGYAVSMVKAGAKIVGHDAGPVRPPLSDLTPQDYEDLAALIATLGPQ
>Q8UB77 4.2.1.41~~~~~~Probable 5-dehydro-4-deoxyglucarate dehydratase~~~COG0329
MNPEQIKTALGSGLLSFPVTHFDAEGRFAADSYREHVEWLAGYKAPVLFAAGGTGEFFSLKPDEIPTIVAAAKEVAGETA
IVSGCGYGTEIAVDIARSVEKVGADGILLLPHYLIDAPQEGLYAHIKKVCQSVGIGVMVYNRDNSVLQADTLARLCDECP
NLVGFKDGTGDIGLVRQITAKMGDRLMYLGGMPTAELFAEAYLGAGFTTYSSAVFNFVPGLANEFYAALRAGERATCERI
LVDFFYPFMAIRNRAKGYAVSAVKAGVRLQGFNAGPVRAPLKDLTNEEIGMLEALIGTHKRKA
>Q8A712 3.1.3.103~~~~~~2-keto-3-deoxy-D-glycero-D-galacto-9-phosphonononic acid phosphatase~~~COG1778
MKEIKLILTDIDGVWTDGGMFYDQTGNEWKKFNTSDSAGIFWAHNKGIPVGILTGEKTEIVRRRAEKLKVDYLFQGVVDK
LSAAEELCNELGINLEQVAYIGDDLNDAKLLKRVGIAGVPASAPFYIRRLSTIFLEKRGGEGVFREFVEKVLGINLEDFI
AVIQ
>P50845 2.7.1.45~~~kdgK~~~2-dehydro-3-deoxygluconokinase~~~COG0524
MKLDAVTFGESMAMFYANEYGGLHEVSTFSKGLAGAESNVACGLARLGFRMGWMSKVGNDQLGTFILQELKKEGVDVSRV
IRSQDENPTGLLLKSKVKEGDPQVTYYRKNSAASTLTTAEYPRDYFQCAGHLHVTGIPPALSAEMKDFTYHVMNDMRNAG
KTISFDPNVRPSLWPDQATMVHTINDLAGLADWFFPGIAEGELLTGEKTPEGIADYYLKKGASFVAIKLGKEGAYFKTGT
SEGFLEGCRVDRVVDTVGAGDGFAVGVISGILDGLSYKDAVQRGNAIGALQVQAPGDMDGLPTREKLASFLSAQRTVHQK
KGDY
>P45416 2.7.1.45~~~kdgK~~~2-dehydro-3-deoxygluconokinase~~~COG0524
MTTKNIAIIGECMIELSQKGADLNRGFGGDTLNTAVYISRQVKPDALDVHYVTALGTDSFSSEMMASWQKEGVKTDLIQR
LDNKLPGLYFIETDATGERTFYYWRNDAAARYWLESPDADTISQQLAQFDYIYLSGISLAILNQASRARLLTVLRACRAN
GGKVIFDNNYRPRLWQSKEETRQAYSDMLACTDIAFLTLDDEDMLWGELPVDEVLKRTHGAGVMEVVIKRGADACLVSIQ
GEALLEVPAIKLPKEKVVDTTAAGDSFSAGYLSVRLNGGSAQDAAKRGHLTASTVIQYRGAIIPLEAMPA
>P37647 2.7.1.45~~~kdgK~~~2-dehydro-3-deoxygluconokinase~~~COG0524
MSKKIAVIGECMIELSEKGADVKRGFGGDTLNTSVYIARQVDPAALTVHYVTALGTDSFSQQMLDAWHGENVDTSLTQRM
ENRLPGLYYIETDSTGERTFYYWRNEAAAKFWLESEQSAAICEELANFDYLYLSGISLAILSPTSREKLLSLLRECRANG
GKVIFDNNYRPRLWASKEETQQVYQQMLECTDIAFLTLDDEDALWGQQPVEDVIARTHNAGVKEVVVKRGADSCLVSIAG
EGLVDVPAVKLPKEKVIDTTAAGDSFSAGYLAVRLTGGSAEDAAKRGHLTASTVIQYRGAIIPREAMPA
>E0J5J4 2.7.1.45~~~kdgK~~~2-dehydro-3-deoxygluconokinase~~~
MSKKIAVIGECMIELSEKGADVKRGFGGDTLNTSVYIARQVDPAALTVHYVTALGTDSFSQQMLDAWHGENVDTSLTQRM
ENRLPGLYYIETDSTGERTFYYWRNEAAAKFWLESEQSAAICEELANFDYLYLSGISLAILSPTSREKLLSLLRECRANG
GKVIFDNNYRPRLWASKEETQQVYQQMLECTDIAFLTLDDEDALWGQQPVEDVIARTHNAGVKEVVVKRGADSCLVSIAG
EGLVDVPAVKLPKEKVIDTTAAGDSFSAGYLAVRLTGGSAENAAKRGHLTASTVIQYRGAIIPREAMPA
>Q15UF1 2.7.1.45~~~~~~2-dehydro-3-deoxygluconokinase~~~COG0524
MRNPTFIAIGECMVELSVTAQNKLQHSYAGDTYNSLVYAKRWHNELDCYFLSGIGQDSFSTLMTAHWQQHGISDEFALTS
DDHNVGIYAIKNDDSGERHFDYWRKESAATQLMSLIEQSDCESHWPHFDLVYFSGISLGILSDEDKDKLINLITRLKAKG
SKVAFDPNYRPKMWANKAHAIRWLEAAYTVSDIVLPGTEDHHDLLGHASVSEIVNYCKQYDVQELVVKAGKEGVFAFDCG
QALCHVPFTPADRQLDTTAAGDSFAGVYLACRMADKPMKLSIEHASAAAGLVVQHQGAIVEHTIFDAFRQRLANHTVA
>Q53W83 2.7.1.45~~~kdgK~~~2-dehydro-3-deoxygluconokinase~~~
MLEVVTAGEPLVALVPQEPGHLRGKRLLEVYVGGAEVNVAVALARLGVKVGFVGRVGEDELGAMVEERLRAEGVDLTHFR
RAPGFTGLYLREYLPLGQGRVFYYRKGSAGSALAPGAFDPDYLEGVRFLHLSGITPALSPEARAFSLWAMEEAKRRGVRV
SLDVNYRQTLWSPEEARGFLERALPGVDLLFLSEEEAELLFGRVEEALRALSAPEVVLKRGAKGAWAFVDGRRVEGSAFA
VEAVDPVGAGDAFAAGYLAGAVWGLPVEERLRLANLLGASVAASRGDHEGAPYREDLEVLLKATQTFMR
>P0ABN1 2.7.1.107~~~dgkA~~~Diacylglycerol kinase~~~COG0818
MANNTTGFTRIIKAAGYSWKGLRAAWINEAAFRQEGVAVLLAVVIACWLDVDAITRVLLISSVMLVMIVEILNSAIEAVV
DRIGSEYHELSGRAKDMGSAAVLIAIIVAVITWCILLWSHFG
>Q934G3 ~~~kdgM~~~Oligogalacturonate-specific porin KdgM~~~COG1452
MKIKLLTLAVASLVSVNALAVSIDYRHEMQDTAQAGHKDRLLISHRFANGFGLSSEVKWAQSSADKTPNKPFNEQVSNGT
EVVASYVYKFNSVFSIEPGFSLESGSSNNNYRPYLRGRANVTDDLSVALRYRPYFKRNSGNIGKDNTMDKGYTLTGNIDY
TFLKDYTIGYELEYKKGTSGKTILSDNDDYDITHNVKLSYKWDKNWKPYVEVGNVSGSETTDERQTRYRVGVQYSF
>P50844 ~~~kdgR~~~HTH-type transcriptional regulator KdgR~~~COG1609
MKKKTTGHTTIKDVAECAGVSKSTVSRYINGKIDAISPEKVKNIKKAIAELNYRPSKMAQGLKIKKSKLIGFVVADITNP
FSVAAFRGVEEVCDQYGYSIMVCNTDNSPEKEREMLLKLEAHSVEGLILNATGENKDVLRAFAEQQIPTILIDRKLPDLK
LDTVTTDNRWITKEILQKVYSKGYTDVALFTEPISSISPRAERAAVYQEMASVQNVNGLVRLHEIDVKDKEQLKAELRSF
HKEMPEQKKAILALNGLIMLKIISCMEELGLRIPQDIGIAGFDDTEWYKLIGPGITTIAQPSHDMGRTAMERVLKRIEGD
KGAPQTIELEAKVIMRKSL
>P37728 ~~~kdgR~~~HTH-type transcriptional regulator KdgR~~~
MIFNRSVTYSNARLPYSKRSLYTKTRVLFFLKQKILSRVTTKMAIADLDKQPDSVSSVLKVFGILQALGEEREIGITELS
QRVMMSKSTVYRFLQTMKSLGYVAQEGESEKYSLTLKLFELGAKALQNVDLIRSADIQMRELSALTRETIHLGALDEDSI
VYIHKIDSMYNLRMYSRIGRRNPLHSTAIGKVLLAWRDREEVKEILSQVEFKRTTVHTIGSTEELLPQLDLVRQQGYGED
NEEQEEGLRCIAVPVFDRFGVVIAGLSISFPTIRFSEDNKHEYVAMLHTAARNISDQMGYHDYPF
>P76268 ~~~kdgR~~~HTH-type transcriptional regulator KdgR~~~COG1414
MANADLDKQPDSVSSVLKVFGILQALGEEREIGITELSQRVMMSKSTVYRFLQTMKTLGYVAQEGESEKYSLTLKLFELG
ARALQNVDLIRSADIQMRELSRLTKETIHLGALDEDSIVYIHKIDSMYNLRMYSRIGRRNPLYSTAIGKVLLAWRDRDEV
KQILEGVEYKRSTERTITSTEALLPVLDQVREQGYGEDNEEQEEGLRCIAVPVFDRFGVVIAGLSISFPTLRFSEERLQE
YVAMLHTAARKISAQMGYHDYPF
>P50847 ~~~kdgT~~~2-keto-3-deoxygluconate permease~~~
MKIKATIERVPGGMMIIPLFLGAALNTFAPGTAEFFGGFTGALITGTLPILGVFIFCVGATIDFRSSGYIARKGITLLLG
KIGFAALLGVIAAQFIPDDGIQSGFFAGLSVLAIVAVMNETNGGLYLALMNHMGRKEDAGAFAFISTESGPFMTMVTFGV
TGLAAFPWETLAATVIPFLLGCILGNLDHDLRDLFSKVVPAIIPFFAFSLGNTLNFGMLIQSGLLGIFIGVSVVILSGSS
LFLLDRFIARGDGVAGVAASSTAGAAVAVPYALAEANASFAPVAESATAIIATSVIVTSLLTPLATVWVDKKIKQKKRRT
PPPKNQMTIN
>P15701 ~~~kdgT~~~2-keto-3-deoxygluconate permease~~~
MHIKRSIEKIPGGMMLVPLFLGALCHTFAPGAGKYFGSFTNGLISGTVPILAVWFFCMGASIRLSATGTVLRKSGTLVVT
KIAVAWVVAAVASRILPENGVEVGFFAGLSTLALVAAMDMTNGGLYASIMQQYGTKEESGAFVLMSLESGPLMTMVILGT
AGIASFEPHVFVGAVLPFLVGFALGNLDPELRDFFSRAVQTLIPFFAFALGNTIDLSVIGQTGLLGVLLGISVIIITGIP
LIVADKVLGGGDGTAGIAASSSAGAAVATPVLIAEMVPAFKPVAPAATTLVATSVIVTSVLVPIITAMWSKRVKGGDGTV
PKEDAVEEKAEQQRRRIIK
>P0A712 ~~~kdgT~~~2-keto-3-deoxygluconate permease~~~
MQIKRSIEKIPGGMMLVPLFLGALCHTFSPGAGKYFGSFTNGMITGTVPILAVWFFCMGASIKLSATGTVLRKSGTLVVT
KIAVAWVVAAIASRIIPEHGVEVGFFAGLSTLALVAAMDMTNGGLYASIMQQYGTKEEAGAFVLMSLESGPLMTMIILGT
AGIASFEPHVFVGAVLPFLVGFALGNLDPELREFFSKAVQTLIPFFAFALGNTIDLTVIAQTGLLGILLGVAVIIVTGIP
LIIADKLIGGGDGTAGIAASSSAGAAVATPVLIAEMVPAFKPMAPAATSLVATAVIVTSILVPILTSIWSRKVKARAAKI
EILGTVK
>O87681 1.5.99.14~~~kdhA~~~6-hydroxypseudooxynicotine dehydrogenase complex subunit alpha~~~
MKPPSFDYVVADSVEHALRLLADGGDDAKIIAGGQSLVPLLNFRMSRPSLLVDINRVPGLANIRKSDQTIAIGALTRHAK
LTTSKTISQNLPILSEAAAWIAHPQIRNRGTIGGSLAHADAAAELPVVLLALDAYVTAQSLQGERKIPLKELLVSHFVSS
ILPGELIVEVNVPQLPHGSGAAFDEFSRRHGDYAIGGAASIVTLDEQGKCSRARITVLGGGSTAIRCQEAENILIDSTLS
SHDIAAAAHAAVQGLDPVPTVHGSAQYRAQVIRTMVERTLAKALHRARPTKESMDH
>O87682 1.5.99.14~~~kdhB~~~6-hydroxypseudooxynicotine dehydrogenase complex subunit beta~~~
MNAFRLTVEVNGVTHATDVEPRRLLADFLRDDLHLRGTRVGCEHGVCGSCTVLLDGQPVRSCTVLAVQANNSRIETVESL
QKDGQLHPLQRSFSKCHALQCGFCTSGFLMTLKPLYDDEDVTLDATSAREAISGNICRCTGYQQIVEATVDAFHCRDHND
>Q933N0 1.5.99.14~~~kdhC~~~6-hydroxypseudooxynicotine dehydrogenase complex subunit gamma~~~
MMAKAKALIPDNGRAGADEGNRQAWIGQEVLRREDRRLLTGTATFAGDLGVPGQLHMRIVRSTQAHARIVSIDATEAEKT
PGVRMVITSEHTRHLGSVLLEELGYHEIYENIEDFSHPVLAVDKVLYVGQPVVAVLAVDPYLAEDAAELVSIEYEPLPVL
LDPEEALTGKVELFPGRGNEGARIKKAYGDIDRAFAEAEHVIRHKYVTNRHSGVPMEPRAVVVQPDPARDTLFIWGTVHV
HDNRRIIAKMLNLPEVNVRMKHVEIGGSFGVKGGVFPENVVAAWAARTLGVPIKWTEDRVEHMTSTSHAREMVHKLELAL
DAEGRILGMKDEIFHNHGAYFRQAEPLVSDITAGIVFGPYRVPAYDATLHAVFTNKTPVGAYRAPGRYESTFARERIFDL
ACAEIGLSKTEFRRRNLLTAEDLPWTPGLDIVHEPYHFDSGDVVKHFNEALEAANFSEWLEESKRLRADGRKVGVGLGVL
MDKAGLGLFETGGVEVSRAGRVTVKTGGSSVGQGIETVLAQIVAEELQIAPENIDIVHSDTELIPDGVGSWSSRSTVLAG
GAARKAALAVVEKARRLASEMLEADPDDLELTAGSFKVKGTDQQISLYEIAAARDPFTARADNDEPGLAADAVYMNNAMN
YPYGVTLVQIELDPDTGGHRILRFSTSTEAGRVINPLTTRGQIIGAAVQGIGGALYEEFLYEEDGQPITTSFMDYLLPSA
QEMPNVDCFVTEDAKSPDNPFGAKGLGEIGIIAAGAAIASAIDDAIADGVHTDRLPVTPEQIFSRCQGLNKAER
>O86224 2.7.1.166~~~kdkA~~~3-deoxy-D-manno-octulosonic acid kinase~~~COG3642
MHQFQQDNQYFIFNFDRTFEQATEFFQAEFWQKQERVIGSAKGRGITYFLQTEDWFGVNCALRHYYRGGLWGKLNKDRYR
FSALETTRSFAEFHLLQRLYEAGLPVPKPIAARIQKGKLGICYQADILTEKIENAQDLTALLQTQTLPKETWMQIGRLIR
KLHDLQICHTDLNAHNILLQQTEQGQKCWLLDFDKCGEKSADFWKVQNLNRLKRSFEKEVGRMNIQFTEQNWADLTSAYH
Q
>Q8EEB1 2.6.1.109~~~kdnA~~~8-amino-3,8-dideoxy-alpha-D-manno-octulosonate transaminase~~~COG0399
MPGFELFGPEEKQEVADVMEHGFTFRYNFDHMRNDRWKTRDMEQLLCEKMNVKHAHLLSSGTAALQTAMMAAGIGAGDEV
IVPPFTFVASVEAIFMAGAVPIFAEIDETLCLSPEGIEAVITPRTKAINLVHMCGSMAKMDEIKAICKKHNLVLLEDACQ
AIGGSYKGQALGTIGDVGCYSFDSVKTITCGEGGAVITNNTEIYDNAHMFSDHGHDHIGKDRGAESHPIMGLNFRISEMN
AALGLAQLRKLDTIIDIQRKNKKAIKDAMASIPEVSFREIPDPEGDSAGFLSFMLPTEARTQEISKKLAANGVDGCFYWY
VNNWHYLKNWKHIQELKAPAALPITLIADRPDYTQISVPKSDAIMSRTISMLIKLSWTDAQIAERIENIKKAFAQ
>Q8EEB0 1.1.3.48~~~kdnB~~~3-deoxy-alpha-D-manno-octulosonate 8-oxidase~~~COG1454
MSFKNFKVVEKMIFGRGSFVQLDDVLAAQRKADDDFVVFLVDDVHQGKPLEARIPVKAQDLLIWVNVDEEPSTIQIDALT
EQVQAFNGKLPVSVVGLGGGSTMDVAKAVSLMLTNPGGSAMYQGWDLIKKPAVHHIGIPTISGTGAEASRTAVLCGPVRK
LGLNSDYTVFDQIIMDSELIDGVETDQWFYTGMDCYIHCVESLEGTFLNEFSKAYAEKAMDLCRQVYLEDHPEKDDKLMM
ASFMGGMSIAYSQVGACHAVSYGLSYILGYHHGIGNCIAFDVLEEFYPEGVAEFRLMMKKHNITLPKNICKDLPDETIAK
MVAVTKSMGPLWANVYGPTWEEKVTDEMLTALFRRI
>P0A057 ~~~kdpA2~~~Potassium-transporting ATPase potassium-binding subunit 2~~~
MSIVLFLIVFILLSLIVSRYLYSVALNVPSKIDVVFNPIEKLIYQLIGTKLEHMSGKTYIKHFLLFNGLMGGLSFVLLLI
QQWLFLNPNHNLNQSVSLAFNTMASFLTNTNLQHYAGETDLSYLTQMCVITFLMFTSAASGYAVCIAMLRRLTGMTDVIG
NFYQDITRFIVRVLIPFALIISLFLISQGTPQTLKGNLVIETLSGVKQTIAYGPMASLESIKHLGTNGGGFLGANSSTPF
ENPTYWSNYAEALSMMLIPGSLVFLFGRMLKTKLQIHPHAIMIFVAMFVMFIGFLVTCLYFEFAGNPVLHHLGIAGGNME
GKETRFGIGLSALFTTITTAFTTGTVNNMHDSLTPLGGMVPMVLMMLNAVFGGEGVGLMNMLIYVMLTVFICSLMIGKTP
SYLGMKIEGKEMKLIALSFLVHPLLILVFSALAFIVPGASDALTNPQFHGVSQVLYEFTSSSANNGSGFEGLGDNTVFWN
ISTGIVMLLARYIPIVLQILIVSSLVNKKTYQQHTQDVPINNLFFSSVLIIFIILLSGLTFLPDLMLGPIGEQLLLHA
>O32327 ~~~kdpA~~~Potassium-transporting ATPase potassium-binding subunit~~~COG2060
MEILQIAIILIVFVLLCIPIGRYMYKVSEHKKTLLDPVLDKIDGFIYKLSGIQKEEEMNWKQYIFALLMCNAVPAIIGYI
ILRIQAVGIFNPNHVKGMEQGLTFNTIISFLTNTNLQDYAGETGASYLSQMIVITFFMFFAAATGIAVALAFIRALSGKK
KLGNFYVDLVRITTRILLPLSIIVAIFYIGQGVPQTLSANKTVTTIEGKLQNIPLGPVASLEAIKLIGTNGGGFFSANSS
HPFENPTPLTNSVQIITLLLLAGSMVVCFGHMIKKKKQAVAIFAAMMVLLLAGAAICFSAEKAGNPALSRIGLSQSMGNL
EGKEERFGIAGSSLFTTVTTDTSCGAVNNMHDSLTPIGGAVPLINMMLNVIFGGVGVGFMNMIMYAILTVFLCGLMVGRT
PEFLNKKIEGKEIKLVAFAIIVHPFLILMSSALALTTKQGLAGISNPGFHGLTQVLYQFTSSAANNGSGFEGLIDNTMFW
NVSAGVVMFLGRYLSIIILLAVASSFAAKRAVPATQGTFKTDNTIFTVTLIVIIVIIGALTFLPAVALGPISEYLTL
>P03959 ~~~kdpA~~~Potassium-transporting ATPase potassium-binding subunit~~~COG2060
MAAQGFLLIATFLLVLMVLARPLGSGLARLINDIPLPGTTGVERVLFRALGVSDREMNWKQYLCAILGLNMLGLAVLFFM
LLGQHYLPLNPQQLPGLSWDLALNTAVSFVTNTNWQSYSGETTLSYFSQMAGLTVQNFLSAASGIAVIFALIRAFTRQSM
STLGNAWVDLLRITLWVLVPVALLIALFFIQQGALQNFLPYQAVNTVEGAQQLLPMGPVASQEAIKMLGTNGGGFFNANS
SHPFENPTALTNFVQMLAIFLIPTALCFAFGEVMGDRRQGRMLLWAMSVIFVICVGVVMWAEVQGNPHLLALGTDSSINM
EGKESRFGVLVSSLFAVVTTAASCGAVIAMHDSFTALGGMVPMWLMQIGEVVFGGVGSGLYGMMLFVLLAVFIAGLMIGR
TPEYLGKKIDVREMKLTALAILVTPTLVLMGAALAMMTDAGRSAMLNPGPHGFSEVLYAVSSAANNNGSAFAGLSANSPF
WNCLLAFCMFVGRFGVIIPVMAIAGSLVSKKSQAASSGTLPTHGPLFVGLLIGTVLLVGALTFIPALALGPVAEYLS
>P9WKF3 ~~~kdpA~~~Potassium-transporting ATPase potassium-binding subunit~~~COG2060
MSGTSWLQFAALIAVLLLTAPALGGYLAKIYGDEAKKPGDRVFGPIERVIYQVCRVDPGSEQRWSTYALSVLAFSVMSFL
LLYGIARFQGVLPFNPTDKPAVTDHVAFNAAVSFMTNTNWQSYSGEATMSHFTQMTGLAVQNFVSASAGMCVLAALIRGL
ARKRASTLGNFWVDLARTVLRIMFPLSFVVAILLVSQGVIQNLHGFIVANTLEGAPQLIPGGPVASQVAIKQLGTNGGGF
FNVNSAHPFENYTPIGNFVENWAILIIPFALCFAFGKMVHDRRQGWAVLAIMGIIWIGMSVAAMSFEAKGNPRLDALGVT
QQTTVDQSGGNLEGKEVRFGVGASGLWAASTTGTSNGSVNSMHDSYTPLGGMVPLAHMMLGEVSPGGTGVGLNGLLVMAI
LAVFIAGLMVGRTPEYLGKKIQATEMKLVTLYILAMPIALLSFAAASVLISSALASRNNPGPHGLSEILYAYTSGANNNG
SAFAGLTASTWSYDTTIGVAMLIGRFFLIIPVLAIAGSLARKGTTPVTAATFPTHKPLFVGLVIGVVLIVGGLTFFPALA
LGPIVEQLSTQ
>P0A008 7.2.2.6~~~kdpB1~~~Potassium-transporting ATPase ATP-binding subunit 1~~~
MAETTKIFESHLVKQALKDSVLKLYPVYMIKNPIMFVVEVGMLLALGLTIYPDLFHQESVSRLYVFSIFIILLLTLVFAN
FSEALAEGRGKAQANALRQTQTEMKARRIKQDGSYEMIDASDLKKGHIVRVATGEQIPNDGKVIKGLATVDESAITGESA
PVIKESGGDFDNVIGGTSVASDWLEVEITSEPGHSFLDKMIGLVEGATRKKTPNEIALFTLLMTLTIIFLVVILTMYPLA
KFLNFNLSIAMLIALAVCLIPTTIGGLLSAIGIAGMDRVTQFNILAKSGRSVETCGDVNVLILDKTGTITYGNRMADAFI
PVKSSSFERLVKAAYESSIADDTPEGRSIVKLAYKQHIDLPQEVGEYIPFTAETRMSGVKFTTREVYKGAPNSMVKRVKE
AGGHIPVDLDALVKGVSKKGGTPLVVLEDNEILGVIYLKDVIKDGLVERFRELREMGIETVMCTGDNELTAATIAKEAGV
DRFVAECKPEDKINVIREEQAKGHIVAMTGDGTNDAPALAEANVGLAMNSGTMSAKEAANLIDLDSNPTKLMEVVLIGKQ
LLMTRGSLTTFSIANDIAKYFAILPAMFMAAMPAMNHLNIMHLHSPESAVLSALIFNALIIVLLIPIAMKGVKFKGASTQ
TILMKNMLVYGLGGMIVPFIGIKLIDLIIQLFV
>P03960 7.2.2.6~~~kdpB~~~Potassium-transporting ATPase ATP-binding subunit~~~COG2216
MSRKQLALFEPTLVVQALKEAVKKLNPQAQWRNPVMFIVWIGSLLTTCISIAMASGAMPGNALFSAAISGWLWITVLFAN
FAEALAEGRSKAQANSLKGVKKTAFARKLREPKYGAAADKVPADQLRKGDIVLVEAGDIIPCDGEVIEGGASVDESAITG
ESAPVIRESGGDFASVTGGTRILSDWLVIECSVNPGETFLDRMIAMVEGAQRRKTPNEIALTILLIALTIVFLLATATLW
PFSAWGGNAVSVTVLVALLVCLIPTTIGGLLSAIGVAGMSRMLGANVIATSGRAVEAAGDVDVLLLDKTGTITLGNRQAS
EFIPAQGVDEKTLADAAQLASLADETPEGRSIVILAKQRFNLRERDVQSLHATFVPFTAQSRMSGINIDNRMIRKGSVDA
IRRHVEANGGHFPTDVDQKVDQVARQGATPLVVVEGSRVLGVIALKDIVKGGIKERFAQLRKMGIKTVMITGDNRLTAAA
IAAEAGVDDFLAEATPEAKLALIRQYQAEGRLVAMTGDGTNDAPALAQADVAVAMNSGTQAAKEAGNMVDLDSNPTKLIE
VVHIGKQMLMTRGSLTTFSIANDVAKYFAIIPAAFAATYPQLNALNIMCLHSPDSAILSAVIFNALIIVFLIPLALKGVS
YKPLTASAMLRRNLWIYGLGGLLVPFIGIKVIDLLLTVCGLV
>P0A059 ~~~kdpC2~~~Potassium-transporting ATPase KdpC subunit 2~~~
MQTIRKSLGLVLIMFVLCGFIFPLTVTALGQVLFPEQANGSLVKQDGKVIGSKLIGQQWTEPKYFHGRISAVNYNMNANE
VKESGGPASGGSNYGNSNPELKKRVQETIKQEGKKISSDAVTASGSGLDPDITVDNAKQQVKRIAKERNIDASKINHLID
ENKQASPMADDYVNVLKLNITLDKL
>P94606 ~~~kdpC~~~Potassium-transporting ATPase KdpC subunit~~~COG2156
MKYFKSALRLGIVLIIICGLIYPLFITAVGQTVFHNKANGSIVTFKGKEVGSALLGQNFTDKRFFRGRVSSVNYNTYTKN
DSNKDEVASGSQNLAPSNKDLKNRVKKDIDDFLKTHPGVKKDEIPTDLLTSSGSGLDPDISPKAAEIQVPSVSKATGISQ
SKLKQIIKKCTEGRTLGVLGEERVNVLKVNLEVASMLKNSKIGE
>P03961 ~~~kdpC~~~Potassium-transporting ATPase KdpC subunit~~~COG2156
MSGLRPALSTFIFLLLITGGVYPLLTTVLGQWWFPWQANGSLIREGDTVRGSALIGQNFTGNGYFHGRPSATAEMPYNPQ
ASGGSNLAVSNPELDKLIAARVAALRAANPDASASVPVELVTASASGLDNNITPQAAAWQIPRVAKARNLSVEQLTQLIA
KYSQQPLVKYIGQPVVNIVELNLALDKLDE
>P9WKF1 ~~~kdpC~~~Potassium-transporting ATPase KdpC subunit~~~COG2156
MRRQLLPALTMLLVFTVITGIVYPLAVTGVGQLFFGDQANGALLERDGQVIGSAHIGQQFTAAKYFHPRPSSAGDGYDAA
ASSGSNLGPTNEKLLAAVAERVTAYRKENNLPADTLVPVDAVTGSGSGLDPAISVVNAKLQAPRVAQARNISIRQVERLI
EDHTDARGLGFLGERAVNVLRLNLALDRL
>P21865 2.7.13.3~~~kdpD~~~Sensor protein KdpD~~~COG2205
MNNEPLRPDPDRLLEQTAAPHRGKLKVFFGACAGVGKTWAMLAEAQRLRAQGLDIVVGVVETHGRKDTAAMLEGLAVLPL
KRQAYRGRHISEFDLDAALARRPALILMDELAHSNAPGSRHPKRWQDIEELLEAGIDVFTTVNVQHLESLNDVVSGVTGI
QVRETVPDPFFDAADDVVLVDLPPDDLRQRLKEGKVYIAGQAERAIEHFFRKGNLIALRELALRRTADRVDEQMRAWRGH
PGEEKVWHTRDAILLCIGHNTGSEKLVRAAARLASRLGSVWHAVYVETPALHRLPEKKRRAILSALRLAQELGAETATLS
DPAEEKAVVRYAREHNLGKIILGRPASRRWWRRETFADRLARIAPDLDQVLVALDEPPARTINNAPDNRSFKDKWRVQIQ
GCVVAAALCAVITLIAMQWLMAFDAANLVMLYLLGVVVVALFYGRWPSVVATVINVVSFDLFFIAPRGTLAVSDVQYLLT
FAVMLTVGLVIGNLTAGVRYQARVARYREQRTRHLYEMSKALAVGRSPQDIAATSEQFIASTFHARSQVLLPDDNGKLQP
LTHPQGMTPWDDAIAQWSFDKGLPAGAGTDTLPGVPYQILPLKSGEKTYGLVVVEPGNLRQLMIPEQQRLLETFTLLVAN
ALERLTLTASEEQARMASEREQIRNALLAALSHDLRTPLTVLFGQAEILTLDLASEGSPHARQASEIRQHVLNTTRLVNN
LLDMARIQSGGFNLKKEWLTLEEVVGSALQMLEPGLSSPINLSLPEPLTLIHVDGPLFERVLINLLENAVKYAGAQAEIG
IDAHVEGENLQLDVWDNGPGLPPGQEQTIFDKFARGNKESAVPGVGLGLAICRAIVDVHGGTITAFNRPEGGACFRVTLP
QQTAPELEEFHEDM
>P9WGL3 2.7.13.3~~~kdpD~~~Sensor protein KdpD~~~COG2205
MTLLFADLCAIFTPYRWMIEHVTTKRGQLRIYLGAAPGVGKTYAMLGEAHRRLERGTDVVAAVVETHGRNKTAKLLEGIE
MIPPRYVEYRGARFPELDVEAVLRRHPQVVLVDELAHTNTPGSKNPKRWQDVQEILDAGITVISTVNIQHLEGLNDVVEQ
ITGIEQKEKIPDEIVRAADQVELVDITPEALRRRLAHGNVYAAERVDAALSNYFRTGNLTALREIALLWLADQVDAALEK
YRADKKITATWEARERVVVAVTGGPESETLVRRASRIASKSSAELMVVHVIRGDGLAGVSAPQLGRVRELATSLGATMHT
VVGDDVPTALLDFAREMNATQLVVGTSRRSRWARLFDEGIGARTVQEPGGIDVHMVTHPAASRASGWSRVSPRERHIASW
LAALVVPSVICAITVAWLDRFMGIGGESALFFIGVLIVALLGGVAPAALSALLSGMLLNYFLTEPRYTWTIAEPDAAVTE
FVLLAMAVAVAVLVDGAASRTREARRASQEAELLALFAGSVLRGADLATLLQRVRETYSQRAVTMLRVRQGASTGETVAC
VGTNPCRDVDSADTAIEVGDDEFWMLMAGRKLAARDRRVLTAVATQAAGLVKQRELAEEAGQAEAIARADELRRSLLSAV
SHDLRTPLAAAKVAVSSLRTEDVAFSPEDTAELLATIEESIDQLTALVANLLDSSRLAAGVIRPQLRRAYLEEAVQRALV
SIGKGATGFYRSGIDRVKVDVGDAVAMADAGLLERVLANLIDNALRYAPDCVVRVNAGRVRERVLINVIDEGPGVPRGTE
EQLFAPFQRPGDHDNTTGVGLGMSVARGFVEAMGGTISATDTPGGGLTVVIDLAAPEDRP
>Q2FWH7 2.7.13.3~~~kdpD~~~Sensor histidine kinase KdpD~~~COG2205
MSNTESLNIGKKRGSLTIYIGYSPGVGKTFEMLSNAIELFQSNVDIKIGYIEPHQRDETNALAEQLPKITTNFTKHGSHH
FQYLDVDRIIEESPTIVLIDELAHTNISRDRHEKRYMDIEEILNHGIDVHTTLNIQHIESLSSQIELMTGVHVKERVPDY
FIMSADVLEVVDISPEQLIKRLKAGKVYKKDRLDVAFSNFFTYAHLSELRTLTLRTVADLMSDKEKVRHNHKTSLKPHIA
VAISGSIYNEAVIKEAFHIAQKEHAKFTAIYIDVFEKNRQYKDSQKQVHQHLMLAKSLGAKVKVVYSQTVALGLDEWCKN
QDVTKLIIGQHIRNKWRDFFNTPLIDHLMSFEHSYKIEIVPIKQIPVELKMNKSPYRPKGKRFAIDMLKMILIQIICVMM
GLWIYQLDKHESSTIILMIFLIGIILLSIWTRSFIIGFLAAIINVFVFNYFFTEPRYTFEVYRFDYPITFIVSILTSILT
SALLKQIKFQYSITKKQLYRTDLLFQFNDSIKQTYTVENLLINAGYQINQLLQQSITIYVINQSKVIKTIPLQNHIDNTT
QQHEQALSWVIKNERQAGATTDTFPGINKWLIPIGTSPIKGILAIDYQSSQVINPYDASILESMLNELSLAVENVTLLKQ
TRESMLQAERQLTHSNFLRSISHDIRTPLTTIMGNLDILVSHSKDMSIIEKEQLLVHSFQESQYLYLLVTNILSLTKLQS
SNVQIKLQPYLVSELVEEIDMILERRHLKKRITVSSSVNLQFIHIDSKLILQALFNLIENAVKHTSTDTKINLSIRYASY
EQIEFAVIDEGPGISLEEQQKIFEPFYTGSNKYFKDNQKESMGLGLYLVQTILHKHQSNLQYKPNQPHGSIFYFNIYTDF
NEGDV
>P21866 ~~~kdpE~~~KDP operon transcriptional regulatory protein KdpE~~~COG0745
MTNVLIVEDEQAIRRFLRTALEGDGMRVFEAETLQRGLLEAATRKPDLIILDLGLPDGDGIEFIRDLRQWSAVPVIVLSA
RSEESDKIAALDAGADDYLSKPFGIGELQARLRVALRRHSATTAPDPLVKFSDVTVDLAARVIHRGEEEVHLTPIEFRLL
AVLLNNAGKVLTQRQLLNQVWGPNAVEHSHYLRIYMGHLRQKLEQDPARPRHFITETGIGYRFML
>P9WGN1 ~~~kdpE~~~Transcriptional regulatory protein KdpE~~~COG0745
MTLVLVIDDEPQILRALRINLTVRGYQVITASTGAGALRAAAEHPPDVVILDLGLPDMSGIDVLGGLRGWLTAPVIVLSA
RTDSSDKVQALDAGADDYVTKPFGMDEFLARLRAAVRRNTAAAELEQPVIETDSFTVDLAGKKVIKDGAEVHLTPTEWGM
LEMLARNRGKLVGRGELLKEVWGPAYATETHYLRVYLAQLRRKLEDDPSHPKHLLTESGMGYRFEA
>Q2FWH6 ~~~kdpE~~~Transcriptional regulatory protein KdpE~~~COG0745
MQSKILIIEDDHAITHLLDVALTLDYYNVTTADNATQAHFKIQIDKPDVILLDLGLPDKDGLCLISEIRQHTDIPIIVIS
ARQEEQTIIQALDNGANDYMTKPFNVDELRARIRVIERIAKSHQETNIVFTNGLLSIDFGSKSVVINNQEVHLTPNEFSL
LELLSNHKGKVLTYEMILKRIYGYVNKTEMPSLRVHMTSLRQKLSQCHEDAKDIIKTHPRIGYQMLQWKEK
>P36937 ~~~kdpF~~~Potassium-transporting ATPase KdpF subunit~~~
MSAGVITGVLLVFLLLGYLVYALINAEAF
>Q1NEI6 1.1.1.401~~~LRA5~~~2-dehydro-3-deoxy-L-rhamnonate dehydrogenase (NAD(+))~~~COG1028
MSVFAGRYAGRCAIVTGGASGLGKQVAARIIAEGGAVALWDLNGDALAATQAEIDATHVVALDVSDHAAVAAAAKDSAAA
LGKVDILICSAGITGATVPVWEFPVDSFQRVIDINLNGLFYCNREVVPFMLENGYGRIVNLASVAGKEGNPNASAYSASK
AGVIGFTKSLGKELAGKGVIANALTPATFESPILDQLPQSQVDYMRSKIPMGRLGLVEESAAMVCFMASEECSFTTASTF
DTSGGRTTF
>P0DOW0 1.1.1.401~~~~~~2-dehydro-3-deoxy-L-rhamnonate dehydrogenase (NAD(+))~~~
MKTLTWTAKETMSILSAPAPVPEPGWIALRVAGVGICGSELSGYLGHNELRKPPLVMGHEFSGVVEEVGHGVTNVKIGDL
VTANPLVTCGRCIHCLRGERQRCESRRIIGIDFPGAYAERVLVPSNQCYAVKDAIDGALVEPLACAVRAVGLARIKVGDT
AVVIGAGIIGLMTVRLLGLSGAKRIAVVDPNDERLKISQLWGATEMAPNLGALLTDNHPQSFDCVIDAVGLSTTRRDSLN
ALIRGGRAVWIGLHEALTHLDGNQIVRDELEVRGSFCYTDDEFIRAVSLINSQKFLPVDRQWLDVRSLEEGPAAFKELVN
GSPFSKIILTF
>B7H226 2.5.1.55~~~kdsA~~~2-dehydro-3-deoxyphosphooctonate aldolase~~~
MSQLKPQEVVRLGDIQMANHLPFVLFGGMNVLESKDLAFEIAETYIDICKRLDIPYVFKASFDKANRSSLHSFRGPGLEK
GIEWLGDIKKHFNVPIITDVHEPYQAAPVAEVADIIQLPAFLSRQTDLVEAMAKTQAIINIKKAQFLAPHEMRHILHKCL
EAGNDKLILCERGSAFGYNNLVVDMLGFDIMKEMNVPVFFDVTHALQTPGGRSDSAGGRRAQITTLARAGMATGLAGLFL
ESHPDPDKAKCDGPSALRLSQLEPFLAQLKELDTLVKGFKKLDTH
>O66496 2.5.1.55~~~kdsA~~~2-dehydro-3-deoxyphosphooctonate aldolase~~~COG2877
MEKFLVIAGPCAIESEELLLKVGEEIKRLSEKFKEVEFVFKSSFDKANRSSIHSFRGHGLEYGVKALRKVKEEFGLKITT
DIHESWQAEPVAEVADIIQIPAFLCRQTDLLLAAAKTGRAVNVKKGQFLAPWDTKNVVEKLKFGGAKEIYLTERGTTFGY
NNLVVDFRSLPIMKQWAKVIYDATHSVQLPGGLGDKSGGMREFIFPLIRAAVAVGCDGVFMETHPEPEKALSDASTQLPL
SQLEGIIEAILEIREVASKYYETIPVK
>Q8YHF1 2.5.1.55~~~kdsA~~~2-dehydro-3-deoxyphosphooctonate aldolase~~~COG2877
MVTANSTVKVGNVTFSNSAPLALIAGPCQMETRDHAFEMAGRLKEMTDKLGIGLVYKSSFDKANRTSLKAARGIGLEKAL
EVFSDLKKEYGFPVLTDIHTEEQCAAVAPVVDVLQIPAFLCRQTDLLIAAARTGRVVNVKKGQFLAPWDMKNVLAKITES
GNPNVLATERGVSFGYNTLVSDMRALPIMAGLGAPVIFDATHSVQQPGGQGGSTGGQREFVETLARAAVAVGVAGLFIET
HEDPDNAPSDGPNMVPIDKMPALLEKLMAFDRIAKAL
>B4EDA2 2.5.1.55~~~kdsA~~~2-dehydro-3-deoxyphosphooctonate aldolase~~~COG2877
MKLCDFEVGLDQPFFLIAGTCVVESEQMTIDTAGRLKEICEKLNVPFIYKSSYDKANRSSGKSFRGLGMDEGLRILSEVK
RQLGLPVLTDVHSIDEIEQVASVVDVLQTPAFLCRQTDFIHACARSGKPVNIKKGQFLAPHDMKNVIDKARDAAREAGLS
EDRFMACERGVSFGYNNLVSDMRSLAIMRETNAPVVFDATHSVQLPGGQGTSSGGQREFVPVLARAAVATGVAGLFMETH
PNPAEAKSDGPNAVPLNRMGALLETLVTLDQAVKRNPFLENDFN
>P0A715 2.5.1.55~~~kdsA~~~2-dehydro-3-deoxyphosphooctonate aldolase~~~COG2877
MKQKVVSIGDINVANDLPFVLFGGMNVLESRDLAMRICEHYVTVTQKLGIPYVFKASFDKANRSSIHSYRGPGLEEGMKI
FQELKQTFGVKIITDVHEPSQAQPVADVVDVIQLPAFLARQTDLVEAMAKTGAVINVKKPQFVSPGQMGNIVDKFKEGGN
EKVILCDRGANFGYDNLVVDMLGFSIMKKVSGNSPVIFDVTHALQCRDPFGAASGGRRAQVAELARAGMAVGLAGLFIEA
HPDPEHAKCDGPSALPLAKLEPFLKQMKAIDDLVKGFEELDTSK
>P45251 2.5.1.55~~~kdsA~~~2-dehydro-3-deoxyphosphooctonate aldolase~~~COG2877
MQNKIVKIGNIDVANDKPFVLFGGMNVLESRDMAMQVCEAYVKVTEKLGVPYVFKASFDKANRSSIHSYRGPGMEEGLKI
FQELKDTFGVKIITDVHEIYQCQPVADVVDIIQLPAFLARQTDLVEAMAKTGAVINVKKPQFLSPSQMGNIVEKIEECGN
DKIILCDRGTNFGYDNLIVDMLGFSVMKKASKGSPVIFDVTHSLQCRDPFGAASSGRRAQVTELARSGLAVGIAGLFLEA
HPNPNQAKCDGPSALPLSALEGFVSQMKAIDDLVKSFPELDTSI
>P56060 2.5.1.55~~~kdsA~~~2-dehydro-3-deoxyphosphooctonate aldolase~~~COG2877
MKTSKTKTPKSVLIAGPCVIESLENLRSIATKLQPLANNERLDFYFKASFDKANRTSLESYRGPGLEKGLEMLQTIKEEF
GYKILTDVHESYQASVAAKVADILQIPAFLCRQTDLIVEVSQTNAIVNIKKGQFMNPKDMQYSVLKALKTRDKSIQSPTY
ETALKNGVWLCERGSSFGYGNLVVDMRSLKIMREFAPVIFDATHSVQMPGGANGKSSGDSSFAPILARAAAAVGIDGLFA
ETHVDPKNALSDGANMLKPDELEQLVTDMLKIQNLF
>Q5ZWA3 2.5.1.55~~~kdsA~~~2-dehydro-3-deoxyphosphooctonate aldolase~~~COG2877
MRLCGFEAGLDKPLFLIAGPCVIESEELALETAGYLKEMCSQLNIPFIYKSSFDKANRSSISSYRGPGFEKGLSILEKVK
SQIGVPVLTDVHEDTPLFEVSSVVDVLQTPAFLCRQTNFIQKVAAMNKPVNIKKGQFLAPWEMKHVIAKAKAQGNEQIMA
CERGVSFGYNNLVSDMRSLVIMRETGCPVVYDATHSVQLPGGNNGVSGGQREFIPALARAAVAVGISGLFMETHPDPDKA
LSDGPNSWPLDKMKQLLESLKAADEVYKKYSTDF
>Q9JZ55 2.5.1.55~~~kdsA~~~2-dehydro-3-deoxyphosphooctonate aldolase~~~
MDIKINDITLGNNSPFVLFGGINVLESLDSTLQTCAHYVEVTRKLGIPYIFKASFDKANRSSIHSYRGVGLEEGLKIFEK
VKAEFGIPVITDVHEPHQCQPVAEVCDVIQLPAFLARQTDLVVAMAKTGNVVNIKKPQFLSPSQMKNIVEKFHEAGNGKL
ILCERGSSFGYDNLVVDMLGFGVMKQTCGNLPVIFDVTHSLQTRDAGSAASGGRRAQALDLALAGMATRLAGLFLESHPD
PKLAKCDGPSALPLHLLEDFLIRIKALDDLIKSQPILTIE
>Q9ZFK4 2.5.1.55~~~kdsA~~~2-dehydro-3-deoxyphosphooctonate aldolase~~~
MAQKIVRVGDIQIGNDLPFVLFGGMNVLESRDLAMQVCEEYVRVTEKLGIPYVFKASFDKANRSSIHSFRGPGLEEGMKI
FEEIKKTFKVPVITDVHEPFQAQPVAEVCDIIQLPAFLSRQTDLVVAMARTNAVINIKKAQFLAPQEMKHILTKCEEAGN
DRLILCERGSSFGYNNLVVDMLGFGIMKQFEYPVFFDVTHALQMPGGRADSAGGRRAQVTDLAKAGLSQKLAGLFLEAHP
DPEHAKCDGPCALRLNKLEAFLSQLKQLDELIKSFPAIETA
>A5F692 2.5.1.55~~~kdsA~~~2-dehydro-3-deoxyphosphooctonate aldolase~~~COG2877
MEHKIVHVGDIPVANDKPFTLFAGMNVLESRDLAMQICEHYVKVTDKLGIPYVFKASFDKANRSSVHSYRGPGLEEGMKI
FQELKETFGVKIITDVHTEAQAQPVADVVDVIQLPAFLARQTDLVEAMAKTGAVINVKKPQFMSPGQVGNIVEKFAECGN
DKVILCERGSCHGYDNLVVDMLGFGVMKQASNGSPIIFDVTHSLQMRDPSGAASGGRREQTVELAKAGLATGIAGLFIEA
HPNPDKARCDGPSALPLDKLEPFLAQMKALDDLIKSFAHIDIR
>A3M4Z0 2.7.7.38~~~kdsB~~~3-deoxy-manno-octulosonate cytidylyltransferase~~~
MKHIVIPARFSSSRLPGKPLLLIHDRPMILRVVDQAKKVEGFDDLCVATDDERIAEICRAEGVDVVLTSADHPSGTDRLS
EVARIKGWDADDIIVNVQGDEPLLPAQLVQQVAKLLVDKPNCSMSTLCEPIHALDEFQRDSIVKVVMSKQNEALYFSRAT
IPYDRDGAKRDEPTLHTQAFRHLGLYAYRVSLLQEYVTWEMGKLEKLESLEQLRVLENGHRIAIAVAEANLPPGVDTQAD
LDRLNNMPVESFE
>O66914 2.7.7.38~~~kdsB~~~3-deoxy-manno-octulosonate cytidylyltransferase~~~COG1212
MRRAVIIPARLGSTRLKEKPLKNLLGKPLIRWVVEGLVKTGERVILATDSERVKEVVEDLCEVFLTPSDLPSGSDRVLYV
VRDLDVDLIINYQGDEPFVYEEDIKLIFRELEKGERVVTLARKDKEAYERPEDVKVVLDREGYALYFSRSPIPYFRKNDT
FYPLKHVGIYGFRKETLMEFGAMPPSKLEQIEGLEQLRLLENGIKIKVLITENYYHGVDTEEDLKIVEEKLKNL
>Q83E52 2.7.7.38~~~kdsB~~~3-deoxy-manno-octulosonate cytidylyltransferase~~~COG1212
MEFRVIIPARFDSTRLPGKALVDIAGKPMIQHVYESAIKSGAEEVVIATDDKRIRQVAEDFGAVVCMTSSDHQSGTERIA
EAAVALGFEDDEIIVCLQGDEPLIPPDAIRKLAEDLDEHDNVKVASLCTPITEVDELFNPHSTKVVLNRRNYALYFSHAP
IPWGRDTFSDKENLQLNGSHYRHVGIYAYRVGFLEEYLSWDACPAEKMEALEQLRILWHGGRIHMVVAKSKCPPGVDTEE
DLERVRAYF
>P04951 2.7.7.38~~~kdsB~~~3-deoxy-manno-octulosonate cytidylyltransferase~~~COG1212
MSFVVIIPARYASTRLPGKPLVDINGKPMIVHVLERARESGAERIIVATDHEDVARAVEAAGGEVCMTRADHQSGTERLA
EVVEKCAFSDDTVIVNVQGDEPMIPATIIRQVADNLAQRQVGMATLAVPIHNAEEAFNPNAVKVVLDAEGYALYFSRATI
PWDRDRFAEGLETVGDNFLRHLGIYGYRAGFIRRYVNWQPSPLEHIEMLEQLRVLWYGEKIHVAVAQEVPGTGVDTPEDL
ERVRAEMR
>P44490 2.7.7.38~~~kdsB~~~3-deoxy-manno-octulosonate cytidylyltransferase~~~COG1212
MSFTVIIPARFASSRLPGKPLADIKGKPMIQHVFEKALQSGASRVIIATDNENVADVAKSFGAEVCMTSVNHNSGTERLA
EVVEKLAIPDNEIIVNIQGDEPLIPPVIVRQVADNLAKFNVNMASLAVKIHDAEELFNPNAVKVLTDKDGYVLYFSRSVI
PYDRDQFMNLQDVQKVQLSDAYLRHIGIYAYRAGFIKQYVQWAPTQLENLEKLEQLRVLYNGERIHVELAKEVPAVGVDT
AEDLEKVRAILAAN
>Q9HZM5 2.7.7.38~~~kdsB~~~3-deoxy-manno-octulosonate cytidylyltransferase~~~
MTQAFTVVIPARYASTRLPGKPLQDIAGQPMIQRVWNQARKSAASRVVVATDDERILAACQGFGAEALLTRAEHNSGTDR
LEEVASRLGLASDAIVVNVQGDEPLIPPALIDQVAANLAAHPEAAIATLAEPIHEVSALFNPNVVKVATDIDGLALTFSR
APLPWARDAFARDRDSLPEGVPYRRHIGIYAYRVGFLADFVAWGPCWLENAESLEQLRALWHGVRIHVADARENMLPGVD
TPEDLERVRRVLGG
>Q8EEA9 2.7.7.90~~~kdsB~~~8-amino-3,8-dideoxy-manno-octulosonate cytidylyltransferase~~~COG1212
MNVTLLIPARYGSSRFPGKPLAPINGKPMIQHVYERASLAKGLTNIYVATDDERIKSAVEGFGGKVVMTSPDAASGTDRI
NDAINQLGLKDDDLVINLQGDQPLIDPTSIEQVISLFERHPGEFEMATLGYEIVNKAELDDPMHVKMVFDNDYYALYFSR
ARIPFGRDTKDYPVYKHLGVYAYTRRFVQAFAALPLGRLEDLEKLEQLRALEYGHKIKVAISAFDSIEVDTPEDIRKCEQ
RLAVD
>Q9KQX2 2.7.7.38~~~kdsB~~~3-deoxy-manno-octulosonate cytidylyltransferase~~~COG1212
MSFTVVIPARYQSTRLPGKPLADIGGKPMIQWVYEQAMQAGADRVIIATDDERVEQAVQAFGGVVCMTSPNHQSGTERLA
EVVAKMAIPADHIVVNVQGDEPLIPPAIIRQVADNLAACSAPMATLAVEIEDEAEVFNPNAVKVITDKSGYALYFSRATI
PWDRDNFAKADKAIVQPLLRHIGIYAYRAGFINTYLDWQPSQLEKIECLEQLRVLWHGEKIHVAVALEAPPAGVDTPEDL
EVVRRIVAERAQ
>Q8ZGA4 2.7.7.38~~~kdsB~~~3-deoxy-manno-octulosonate cytidylyltransferase~~~COG1212
MSFIAIIPARYASTRLPGKPLADIAGKPMVVHVMERALASGADRVIVATDHPDVVKAVEAAGGEVCLTRADHQSGTERLA
EVIEHYGFADDDIIVNVQGDEPLVPPVIIRQVADNLAACSAGMATLAVPIASSEEAFNPNAVKVVMDAQGYALYFSRATI
PWERERFAQSKETIGDCFLRHIGIYAYRAGFIRRYVNWAPSQLEQIELLEQLRVLWYGEKIHVAVAKAVPAVGVDTQSDL
DRVRAIMLNQ
>A0A140N5J7 3.1.3.45~~~kdsC~~~3-deoxy-D-manno-octulosonate 8-phosphate phosphatase KdsC~~~COG1778
MSKAGASLATCYGPVSADVMAKAENIRLLILDVDGVLSDGLIYMGNNGEELKAFNVRDGYGIRCALTSDIEVAIITGRKA
KLVEDRCATLGITHLYQGQSNKLIAFSDLLEKLAIAPENVAYVGDDLIDWPVMEKVGLSVAVADAHPLLIPRADYVTRIA
GGRGAVREVCDLLLLAQGKLDEAKGQSI
>P67653 3.1.3.45~~~kdsC~~~3-deoxy-D-manno-octulosonate 8-phosphate phosphatase KdsC~~~COG1778
MSKAGASLATCYGPVSADVMAKAENIRLLILDVDGVLSDGLIYMGNNGEELKAFNVRDGYGIRCALTSDIEVAIITGRKA
KLVEDRCATLGITHLYQGQSNKLIAFSDLLEKLAIAPENVAYVGDDLIDWPVMEKVGLSVAVADAHPLLIPRADYVTRIA
GGRGAVREVCDLLLLAQGKLDEAKGQSI
>P45314 3.1.3.45~~~~~~3-deoxy-D-manno-octulosonate 8-phosphate phosphatase KdsC~~~COG1778
MQQKLENIKFVITDVDGVLTDGQLHYDANGEAIKSFHVRDGLGIKMLMDADIQVAVLSGRDSPILRRRIADLGIKLFFLG
KLEKETACFDLMKQAGVTAEQTAYIGDDSVDLPAFAACGTSFAVADAPIYVKNAVDHVLSTHGGKGAFREMSDMILQAQG
KSSVFDTAQGFLKSVKSMGQ
>Q8Z3G5 3.1.3.45~~~kdsC~~~3-deoxy-D-manno-octulosonate 8-phosphate phosphatase KdsC~~~COG1778
MSKAGASLATCYGPVSTHVMTKAENIRLLILDVDGVLSDGLIYMGNNGEELKAFNVRDGYGIRCALTSNIEVAIITGRKA
KLVEDRCATLGIVHLYQGQSNKLIAFSDLLEKLAIAPENVAYVGDDLIDWPVMEKVGLSVAVADAHPLLIPRADYVTHIA
GGRGAVREVCDLLLLAQGKLDEAKGQSI
>Q8ZB47 3.1.3.45~~~kdsC~~~3-deoxy-D-manno-octulosonate 8-phosphate phosphatase KdsC~~~COG1778
MSNTAYIDTCYGPVADDVIQRAANIRLLICDVDGVMSDGLIYMGNQGEELKAFNVRDGYGIRCLITSDIDVAIITGRRAK
LLEDRANTLGITHLYQGQSDKLVAYHELLATLQCQPEQVAYIGDDLIDWPVMAQVGLSVAVADAHPLLLPKAHYVTRIKG
GRGAVREVCDLILLAQDKLEGATGLSI
>P45395 5.3.1.13~~~kdsD~~~Arabinose 5-phosphate isomerase KdsD~~~COG0517
MSHVELQPGFDFQQAGKEVLAIERECLAELDQYINQNFTLACEKMFWCKGKVVVMGMGKSGHIGRKMAATFASTGTPSFF
VHPGEAAHGDLGMVTPQDVVIAISNSGESSEITALIPVLKRLHVPLICITGRPESSMARAADVHLCVKVAKEACPLGLAP
TSSTTATLVMGDALAVALLKARGFTAEDFALSHPGGALGRKLLLRVNDIMHTGDEIPHVKKTASLRDALLEVTRKNLGMT
VICDDNMMIEGIFTDGDLRRVFDMGVDVRQLSIADVMTPGGIRVRPGILAVEALNLMQSRHITSVMVADGDHLLGVLHMH
DLLRAGVV
>Q5NGP7 5.3.1.13~~~kdsD~~~Arabinose 5-phosphate isomerase KdsD~~~COG0517
MEISMTSHINNAVETFRLEIETLEKLKNSIDENFEKACEIILENNRDKSRVIITGMGKSGHIGKKMAATFASTGTPAFFV
HPGEAGHGDFGMITKNDVLIAISNSGTSSEIMGLLPMIKHLDIPIIAITSNPKSILARNSNVTLNLHVDKEACPLNLAPT
SSTTATLVLGDALAIALLKAKNFSEKDFAFSHPNGALGRKLILKVENIMRKGNEIPIVKPTDNIRKAILEISDKGVGNTL
VAENNTLLGIFTDGDLRRMFEAESFNSQRAISEVMTKNPKSISKEEMAITALEKMEKYEITSLAVVDNGHNILGIVTMHD
LIKLELR
>Q9HVW0 5.3.1.13~~~kdsD~~~Arabinose 5-phosphate isomerase KdsD~~~
MNMSQNLDFIHSAQRTIGLERDAVDSLLARIGDDFVKACELLLAGKGRVVVVGMGKSGHVGKKIAATLASTGTPSFFVHP
AEASHGDMGMITKDDVVLALSNSGSTAEIVTLLPLIKRLGITLISMTGNPESPLAKAAEVNLDASVGQEACPLNLAPTSS
TTVTLVLGDALAIALLEARGFTAEDFAFSHPGGALGRRLLLKVEDVMHVGEGLPQVLLGTSLTGALMEMTRKGLGMTVVL
DEHGKLAGIFTDGDLRRALDRGIDVRQVTIDQVMTVHGKTVRAEILAAEALKIMEDNKIGALVVVDADDRPVGALNMHDL
LRAGVM
>P0DOV8 4.1.2.58~~~~~~2-dehydro-3,6-dideoxy-6-sulfogluconate aldolase~~~
MLKRQSAPLGTWLMSASASTAEALGYAGFDWLLVDMEHVPIEFRDLWHILQAIQCTGAQPIVRVAANDPVLLKRALDLGS
TNVMVPFVENAEQARAAVSAVKYPPMGTRGFAAVHRASRYGTWKGYGQQANDSVSCILQIETATALANLEEIAAVPGVDA
LFLGPGDLSSVCGHIGNPAHPDIQAMISDAIVRCKAIGMPIGIVGGTPELVGSYLEQGYAFAAVASDMAMMMSKANELLV
ALKGRQAPEAVATAY
>Q8A945 1.1.1.102~~~kdsr~~~3-ketodihydrosphingosine reductase~~~COG4221
MQPQIILITGASSGFGKITAQMLSEQGHIVYGTSRKPSENIGKVRMLVVDVTNSISVRQAVEQIISEQGRMDVLINNAGM
GIGGALELATEEEVSMQMNTNFFGVVNMCKAVLPYMRKARRGKIINISSIGGVMGIPYQGFYSASKFAVEGYSEALALEV
HPFHIKVCLVQPGDFNTGFTDNRNISELTGQNEDYADSFLRSLKIIEKEERNGCHPRKLGAAICKIVARKNPPFRTKVGP
LVQVLFAKSKSWLPDNMMQYALRIFYAIR
>O66663 2.4.99.12~~~kdtA~~~3-deoxy-D-manno-octulosonic acid transferase~~~COG1519
MQFEVLKRFFPKESLKNCKGALWVHTASIGEFNTFLPILKELKREHRILLTYFSPRAREYLKTKSDFYDCLHPLPLDNPF
SVKRFEELSKPKALIVVEREFWPSLIIFTKVPKILVNAYAKGSLIEKILSKKFDLIIMRTQEDVEKFKTFGAKRVFSCGN
LKFICQKGKGIKLKGEFIVAGSIHTGEVEIILKAFKEIKKTYSSLKLILVPRHIENAKIFEKKARDFGFKTSFFENLEGD
VILVDRFGILKELYPVGKIAIVGGTFVNIGGHNLLEPTCWGIPVIYGPYTHKVNDLKEFLEKEGAGFEVKNETELVTKLT
ELLSVKKEIKVEEKSREIKGCYLEKLREFLRGL
>Q45374 2.4.99.12~~~waaA~~~3-deoxy-D-manno-octulosonic acid transferase~~~
MGRGVYTLALRGLAPLIWLWMWRRARRAGGQWELFAPARFGRAGARAPAPLAAPVWVHAVSLGETRAAQPLVQALLERGL
PVLLTHTTATGRAEGERLFGAAIGRGQLQQAWLPYDFPGATRRFLARHAPRCGLLMEREVWPNLLAAARAQGVPMALVSA
RFSASSLRQAGWLGQALREALAGLDRVLAQTDEDGARLCQAGANAYTVTGSLKFDVALPEAQLRVGHAWAGATGRPVIAL
ASTREGEDAMFIEAIGALQAHRAATPRPLILLIPRHPQRFDEAAAQLQAAGLAYARRSAGSGEPGPHIDVLLGDTLGEMP
FYYAAADVAIVGGSFARLGGQNLIEACAAGTPVIVGPHTFNFKDAARDAIAAGAALRAPDARTALDWALQLLAEPARRQA
MSEAARAWTAAHAGATRRTLDALEDWLG
>F0T4D1 2.4.99.12~~~waaA~~~3-deoxy-D-manno-octulosonic acid transferase~~~
MIKGRRTKLHTFLYDCFLIFAFMVGLPRILYKRFVHGKYTKSLGIRFGFKKPEVPGTGPVAWFHGASVGETALLLPLLKR
FMKEYPEWRCVVTSCTESGHENAHRLFGPLGVTTFILPLDLSIIIKPVVRAISPSLLVFSEGDCWLNFIEEAKRLGATAV
IINGKLSANSCKRFTILKRFGRNYFSPVDGFLLQDEQHKARFLQLGVDKEKIQVTGNIKTYTETLSENNQRDYWREKLQL
AQDTELLVLGSVHPKDVEVWLPVVRELRRNLKVLWVPRHIERSKELEALLSKENISYGLWSKEATFAQHDAIIVDAIGWL
KQLYSAADLAFVGGTFDDRIGGHNLLEPLQCGVPLIFGPHIQSQSDLAERLLSMGAGCCLDKTNIVKVITFLLDHPEERA
AYIQKGAMFLHEEKVAFDRTWESFKRYIPCVKI
>Q46222 2.4.99.12~~~waaA~~~3-deoxy-D-manno-octulosonic acid transferase~~~COG1519
MMLRGVHRIFKCFYDVVLVCAFVIALPKLLYKMLVYGKYKKSLAVRFGLKKPHVPGEGPLVWFHGASVGEVRLLLPVLEK
FCEEFPGWRCLVTSCTELGVQVASQVFIPMGATVSILPLDFSIIIKSVVAKLRPSLVVFSEGDCWLNFIEEAKRIGATTL
VINGRISIDSSKRFKFLKRLGKNYFSPVDGFLLQDEVQKQRFLSLGIPEHKLQVTGNIKTYVAAQTALHLERETWRDRLR
LPTDSKLVILGSMHRSDAGKWLPVVQKLIKEGVSVLWVPRHVEKTKDVEESLHRLHIPYGLWSRGANFSYVPVVVVDEIG
LLKQLYVAGDLAFVGGTFDPKIGGHNLLEPLQCEVPLIFGPHITSQSELAQRLLLSGAGLCLDEIEPIIDTVSFLLNNQE
VREAYVQKGKVFVKAETASFDRTWRALKSYIPLYKNS
>B0B9V8 2.4.99.12~~~waaA~~~3-deoxy-D-manno-octulosonic acid transferase~~~
MIRRWLTSRLYDAFLVCAFFVSAPRIFYKVFFHGKYIDSWKIRFGVQKPFVKGEGPLVWFHGASVGEVSLLAPLLNRWRE
EFPEWRFVVTTCSEAGVHTARRLYESLGATVFVLPLDLSCIIKSVVRKLAPDIVIFSEGDCWLHFLTESKRLGAKAFLIN
GKLSEHSCKRFSFLKRLGRNYFAPLDLLILQDELYKQRFMQIGISSDKIHVTGNMKTFIESSLATNRRDFWRAKLQISSQ
DRLIVLGSMHPKDVEVWAEVVSHFHNSSTKILWVPRHLEKLKEHAKLLEKAGILFGLWSQGASFRQYNSLIMDAMGVLKD
IYSAADIAFVGGTFDPSVGGHNLLEPLQKEAPLMFGPYIYSQSVLAERLREKEAGLSVNKETLLDVVTDLLQNEKNRQAY
IEKGKSFLKQEENSFQQTWEILKSQITCMKI
>P0AC75 2.4.99.12~~~waaA~~~3-deoxy-D-manno-octulosonic acid transferase~~~COG1519
MLELLYTALLYLIQPLIWIRLWVRGRKAPAYRKRWGERYGFYRHPLKPGGIMLHSVSVGETLAAIPLVRALRHRYPDLPI
TVTTMTPTGSERVQSAFGKDVQHVYLPYDLPDALNRFLNKVDPKLVLIMETELWPNLIAALHKRKIPLVIANARLSARSA
AGYAKLGKFVRRLLRRITLIAAQNEEDGARFVALGAKNNQVTVTGSLKFDISVTPQLAAKAVTLRRQWAPHRPVWIATST
HEGEESVVIAAHQALLQQFPNLLLILVPRHPERFPDAINLVRQAGLSYITRSSGEVPSTSTQVVVGDTMGELMLLYGIAD
LAFVGGSLVERGGHNPLEAAAHAIPVLMGPHTFNFKDICARLEQASGLITVTDATTLAKEVSSLLTDADYRSFYGRHAVE
VLYQNQGALQRLLQLLEPYLPPKTH
>P44806 2.4.99.12~~~waaA~~~3-deoxy-D-manno-octulosonic acid transferase~~~COG1519
MWRFFYTSLLLICQPLILCFIGLLSVKSPRYRQRLAERYGFYGNASCPPPQGIFIHAASVGEVIAATPLVRQLQQDYPHL
SITFTTFTPTGSERVKATFGDSVFHYYLPLDLPFSIHRFINFVQPKLCIVMETELWPNLIHQLFLRNIPFVIANARLSAR
SAHRYGKIKAHLQTMWSQISLIAAQDNISGKRYATLGYPKEKLNITGNIKYDLNTNDELLRKIDSLRTLWKQDRPIWIAA
STHNGEDEIILKSHRALLAKYPNLLLLLVPRHPERFNVVADLLKKEKFQFIRRSTNELPNENTQVILGDSMGELMLMYGI
SDIAFVGGSLVKHGGHNPLEPLAFKMPVITGKHTFNFPEIFRMLVEVQGVLEVNSTADALERAVEALLNSKESRERLGNA
GYEVLMENRGALQRLLDLLKPYLERNV
>P50842 1.1.1.127~~~kduD~~~2-dehydro-3-deoxy-D-gluconate 5-dehydrogenase~~~COG1028
MGYLHDAFSLKGKTALVTGPGTGIGQGIAKALAGAGADIIGTSHTSSLSETQQLVEQEGRIFTSFTLDMSKPEAIKDSAA
ELFENRQIDILVNNAGIIHREKAEDFPEENWQHVLNVNLNSLFILTQLAGRHMLKRGHGKIINIASLLSFQGGILVPAYT
ASKHAVAGLTKSFANEWAASGIQVNAIAPGYISTANTKPIRDDEKRNEDILKRIPAGRWGQADDIGGTAVFLASRASDYV
NGHILAVDGGWLSR
>Q05528 1.1.1.127~~~kduD~~~2-dehydro-3-deoxy-D-gluconate 5-dehydrogenase~~~COG1028
MILNTFNLQGKVALITGCDTGLGQGMAVGLAEAGCDIVGVNIVEPKETIEKVTAVGRRFLSLTADMSDISGHAALVEKAV
AEFGKVDILVNNAGIIRREDAIEFSEKNWDDVMNLNIKSVFFMSQTVARQFIKQGHGGKIINIASMLSFQGGIRVPSYTA
SKSAVMGITRLLANEWAKHNINVNAIAPGYMATNNTQQLRADQDRSKEILDRIPAGRWGLPQDLQGPAVFLASSASDYVN
GYTIAVDGGWLAR
>P37769 1.1.1.127~~~kduD~~~2-dehydro-3-deoxy-D-gluconate 5-dehydrogenase~~~COG1028
MILSAFSLEGKVAVVTGCDTGLGQGMALGLAQAGCDIVGINIVEPTETIEQVTALGRRFLSLTADLRKIDGIPALLDRAV
AEFGHIDILVNNAGLIRREDALEFSEKDWDDVMNLNIKSVFFMSQAAAKHFIAQGNGGKIINIASMLSFQGGIRVPSYTA
SKSGVMGVTRLMANEWAKHNINVNAIAPGYMATNNTQQLRADEQRSAEILDRIPAGRWGLPSDLMGPIVFLASSASDYVN
GYTIAVDGGWLAR
>Q838L9 5.3.1.17~~~kduI1~~~4-deoxy-L-threo-5-hexosulose-uronate ketol-isomerase 1~~~COG3717
METRYTHSPADIRHYSTEQLRDEFLVEKVFIPGAISLTYTHNDRMIFGGVTPTTEELEIILDKELGVDYFLERRELGVIN
IGGPGFIEIDGAKETMKKQDGYYIGKETKHVRFSSENPDNPAKFYISCVPAHHKYPNVKISIDEITPMETGDPLTLNQRK
IYQYIHPNVCESCQLQMGYTILEPGSAWNTMPCHTHERRMEAYVYFDMEEDTRIFHMMGKPDETKHLVMSNEQAAISPSW
SIHSGVGTSNYSFIWAMCGENITYTDMDMVAMDQLK
>P50843 5.3.1.17~~~kduI~~~4-deoxy-L-threo-5-hexosulose-uronate ketol-isomerase~~~COG3717
MENRYSVHPEQVKRFTTEELRSHFLMDSLFTENKLTMYYSHEDRVVIGGAAPGQSELKLDAGDFLKTDFFLERREIGIIN
VGQPGAVRVGDDEYVLQTKDFLYIGMGNQDVSFSSLNGEKAKFYFVSACAHKSYPTQKAALSELTPDRLGDDAASNVRSL
YKVIHQDGIKSCQLMMGITMLDQNNNWNTMPAHVHDRRMEAYLYLDLEKDSKVFHFMGQPDETRHLVVGNEQAVLSPAWS
IHSGAGTSNYSFVWAMAGENYTFTDMDLIPMDGLK
>Q05529 5.3.1.17~~~kduI~~~4-deoxy-L-threo-5-hexosulose-uronate ketol-isomerase~~~COG3717
MQVRQSIHSDHARQLDTAGLRREFLIEHIFDADACTMTYSHIDRIIVGGVMPVHQAVTVGEDVGKQLGVSYFLERRELGA
INIGGAGVVSVDGERYAIGHEEAIYIGKGARDIRFTSVDPAKPARFYYNSAPAHTTFPTRKITAAEASPQTIGDDATSNR
RTINKYIVPDVLPTCQLTMGLTKLAEGNLWNTMPCHTHERRMEVYFYFDMDEETAVFHMMGQPQETRHILVHNEQAVISP
SWSIHSGVGTKRYTFIWGMVGENQVFSDMDHVKVSELR
>Q46938 5.3.1.17~~~kduI~~~4-deoxy-L-threo-5-hexosulose-uronate ketol-isomerase~~~COG3717
MDVRQSIHSAHAKTLDTQGLRNEFLVEKVFVADEYTMVYSHIDRIIVGGIMPITKTVSVGGEVGKQLGVSYFLERRELGV
INIGGAGTITVDGQCYEIGHRDALYVGKGAKEVVFASIDTGTPAKFYYNCAPAHTTYPTKKVTPDEVSPVTLGDNLTSNR
RTINKYFVPDVLETCQLSMGLTELAPGNLWNTMPCHTHERRMEVYFYFNMDDDACVFHMMGQPQETRHIVMHNEQAVISP
SWSIHSGVGTKAYTFIWGMVGENQVFDDMDHVAVKDLR
>P41249 ~~~~~~Apokedarcidin~~~
ASAAVSVSPATGLADGATVTVSASGFATSTSATALQCAILADGRGACNVAEFHDFSLSGGEGTTSVVVRRSFTGYVMPDG
PEVGAVDCDTAPGGCEIVVGGNTGEYGNAAISFG
>P03819 ~~~kefC~~~Glutathione-regulated potassium-efflux system protein KefC~~~COG0475
MDSHTLIQALIYLGSAALIVPIAVRLGLGSVLGYLIAGCIIGPWGLRLVTDAESILHFAEIGVVLMLFIIGLELDPQRLW
KLRAAVFGCGALQMVICGGLLGLFCMLLGLRWQVAELIGMTLALSSTAIAMQAMNERNLMVTQMGRSAFAVLLFQDIAAI
PLVAMIPLLATSSASTTMGAFALSALKVAGALVLVVLLGRYVTRPALRFVARSGLREVFSAVALFLVFGFGLLLEEVGLS
MAMGAFLAGVLLASSEYRHALESDIEPFKGLLLGLFFIGVGMSIDFGTLLENPLRIVILLLGFLIIKIAMLWLIARPLQV
PNKQRRWFAVLLGQGSEFAFVVFGAAQMANVLEPEWAKSLTLAVALSMAATPILLVILNRLEQSSTEEAREADEIDEEQP
RVIIAGFGRFGQITGRLLLSSGVKMVVLDHDPDHIETLRKFGMKVFYGDATRMDLLESAGAAKAEVLINAIDDPQTNLQL
TEMVKEHFPHLQIIARARDVDHYIRLRQAGVEKPERETFEGALKTGRLALESLGLGPYEARERADVFRRFNIQMVEEMAM
VENDTKARAAVYKRTSAMLSEIITEDREHLSLIQRHGWQGTEEGKHTGNMADEPETKPSS
>P0A754 1.6.5.2~~~kefF~~~Glutathione-regulated potassium-efflux system ancillary protein KefF~~~COG2249
MILIIYAHPYPHHSHANKRMLEQARTLEGVEIRSLYQLYPDFNIDIAAEQEALSRADLIVWQHPMQWYSIPPLLKLWIDK
VFSHGWAYGHGGTALHGKHLLWAVTTGGGESHFEIGAHPGFDVLSQPLQATAIYCGLNWLPPFAMHCTFICDDETLEGQA
RHYKQRLLEWQEAHHG
>A0A1L7NQ96 5.1.3.-~~~DAE~~~Ketose 3-epimerase~~~
MKIGCHGLVWTGHFDAEGIRYSVQKTREAGFDLVEFPLMDPFSFDVQTAKSALAEHGLAASASLGLSDATDVSSEDPAVV
KAGEELLNRAVDVLAELGATDFCGVIYSAMKKYMEPATAAGLANSKAAVGRVADRASDLGINVSLEVVNRYETNVLNTGR
QALAYLEELNRPNLGIHLDTYHMNIEESDMFSPILDTAEALRYVHIGESHRGYLGTGSVDFDTFFKALGRIGYDGPVVFE
SFSSSVVAPDLSRMLGIWRNLWADNEELGAHANAFIRDKLTAIKTIELH
>A0R2B1 2.2.1.5~~~kgd~~~Multifunctional 2-oxoglutarate metabolism enzyme~~~COG0508
MSSSPSPFGQNEWLVEEMYRKFRDDPSSVDPSWHEFLVDYSPEPTTDSASNGRTTTAAPVTPPTPAPAPAPEPKAAPKPA
AKTEAKPAKPAKSATPAKGDESQILRGAAAAVVKNMNASLEVPTATSVRAIPAKLMIDNRVVINNHLKRTRGGKISFTHL
LGYAIVQAVKKFPNMNRHFAVVDGKPTAITPAHTNLGLAIDLQGKDGNRSLVVAAIKRCETMRFGQFIAAYEDIVRRARD
GKLTAEDFSGVTISLTNPGTLGTVHSVPRLMQGQGAIIGAGAMEYPAEFQGASEERIADLGIGKLITLTSTYDHRIIQGA
ESGDFLRTIHQLLLDDDFFDEIFRELGIPYEPVRWRTDNPDSIEDKNARVIELIAAYRNRGHLMADIDPLRLDNTRFRSH
PDLDVNSHGLTLWDLDREFKVDGFAGVQRKKLRDILSVLRDAYCRHVGVEYTHILEPEQQRWIQERVETKHDKPTVAEQK
YILSKLNAAEAFETFLQTKYVGQKRFSLEGAETVIPMMDAVIDQCAEHGLDEVVIAMPHRGRLNVLANIVGKPYSQIFSE
FEGNLNPSQAHGSGDVKYHLGATGTYIQMFGDNDIEVSLTANPSHLEAVDPVLEGLVRAKQDLLDTGEEGSDNRFSVVPL
MLHGDAAFAGQGVVAETLNLALLRGYRTGGTIHIVVNNQIGFTTAPTDSRSSEYCTDVAKMIGAPIFHVNGDDPEACAWV
ARLAVDFRQAFKKDVVIDMLCYRRRGHNEGDDPSMTQPYMYDVIDTKRGSRKAYTEALIGRGDISMKEAEDALRDYQGQL
ERVFNEVRELEKHEIEPSESVEADQQIPSKLATAVDKAMLQRIGDAHLALPEGFTVHPRVRPVLEKRREMAYEGRIDWAF
AELLALGSLIAEGKLVRLSGQDTQRGTFTQRHAVIVDRKTGEEFTPLQLLATNPDGTPTGGKFLVYNSALSEFAAVGFEY
GYSVGNPDAMVLWEAQFGDFVNGAQSIIDEFISSGEAKWGQLSDVVLLLPHGHEGQGPDHTSGRIERFLQLWAEGSMTIA
MPSTPANYFHLLRRHGKDGIQRPLIVFTPKSMLRNKAAVSDIRDFTESKFRSVLEEPMYTDGEGDRNKVTRLLLTSGKIY
YELAARKAKENREDVAIVRIEQLAPLPRRRLAETLDRYPNVKEKFWVQEEPANQGAWPSFGLTLPEILPDHFTGLKRISR
RAMSAPSSGSSKVHAVEQQEILDTAFG
>P9WIS5 2.2.1.5~~~kgd~~~Multifunctional 2-oxoglutarate metabolism enzyme~~~COG0508
MANISSPFGQNEWLVEEMYRKFRDDPSSVDPSWHEFLVDYSPEPTSQPAAEPTRVTSPLVAERAAAAAPQAPPKPADTAA
AGNGVVAALAAKTAVPPPAEGDEVAVLRGAAAAVVKNMSASLEVPTATSVRAVPAKLLIDNRIVINNQLKRTRGGKISFT
HLLGYALVQAVKKFPNMNRHYTEVDGKPTAVTPAHTNLGLAIDLQGKDGKRSLVVAGIKRCETMRFAQFVTAYEDIVRRA
RDGKLTTEDFAGVTISLTNPGTIGTVHSVPRLMPGQGAIIGVGAMEYPAEFQGASEERIAELGIGKLITLTSTYDHRIIQ
GAESGDFLRTIHELLLSDGFWDEVFRELSIPYLPVRWSTDNPDSIVDKNARVMNLIAAYRNRGHLMADTDPLRLDKARFR
SHPDLEVLTHGLTLWDLDRVFKVDGFAGAQYKKLRDVLGLLRDAYCRHIGVEYAHILDPEQKEWLEQRVETKHVKPTVAQ
QKYILSKLNAAEAFETFLQTKYVGQKRFSLEGAESVIPMMDAAIDQCAEHGLDEVVIGMPHRGRLNVLANIVGKPYSQIF
TEFEGNLNPSQAHGSGDVKYHLGATGLYLQMFGDNDIQVSLTANPSHLEAVDPVLEGLVRAKQDLLDHGSIDSDGQRAFS
VVPLMLHGDAAFAGQGVVAETLNLANLPGYRVGGTIHIIVNNQIGFTTAPEYSRSSEYCTDVAKMIGAPIFHVNGDDPEA
CVWVARLAVDFRQRFKKDVVIDMLCYRRRGHNEGDDPSMTNPYVYDVVDTKRGARKSYTEALIGRGDISMKEAEDALRDY
QGQLERVFNEVRELEKHGVQPSESVESDQMIPAGLATAVDKSLLARIGDAFLALPNGFTAHPRVQPVLEKRREMAYEGKI
DWAFGELLALGSLVAEGKLVRLSGQDSRRGTFSQRHSVLIDRHTGEEFTPLQLLATNSDGSPTGGKFLVYDSPLSEYAAV
GFEYGYTVGNPDAVVLWEAQFGDFVNGAQSIIDEFISSGEAKWGQLSNVVLLLPHGHEGQGPDHTSARIERFLQLWAEGS
MTIAMPSTPSNYFHLLRRHALDGIQRPLIVFTPKSMLRHKAAVSEIKDFTEIKFRSVLEEPTYEDGIGDRNKVSRILLTS
GKLYYELAARKAKDNRNDLAIVRLEQLAPLPRRRLRETLDRYENVKEFFWVQEEPANQGAWPRFGLELPELLPDKLAGIK
RISRRAMSAPSSGSSKVHAVEQQEILDEAFG
>P72194 3.4.22.47~~~kgp~~~Lys-gingipain 381~~~
MRKLLLLIAASLLGVGLYAQSAKIKLDAPTTRTTCTNNSFKQFDASFSFNEVELTKVETKGGTFASVSIPGAFPTGEVGS
PEVPAVRKLIAVPVGATPVVRVKSFTEQVYSLNQYGSEKLMPHQPSMSKSDDPEKVPFVYNAAAYARKGFVGQELTQVEM
LGTMRGVRIAALTINPVQYDVVANQLKVRNNIEIEVSFQGADEVATQRLYDASFSPYFETAYKQLFNRDVYTDHGDLYNT
PVRMLVVAGAKFKEALKPWLTWKAQKGFYLDVHYTDEAEVGTTNASIKAFIHKKYNDGLAASAAPVFLALVGDTDVISGE
KGKKTKKVTDLYYSAVDGDYFPEMYTFRMSASSPEELTNIIDKVLMYEKATMPDKSYLEKALLIAGADSYWNPKIGQQTI
KYAVQYYYNQDHGYTDVYSYPKAPYTGCYSHLNTGVGFANYTAHGSETSWADPSLTATQVKALTNKDKYFLAIGNCCVTA
QFDYPQPCFGEVMTRVKEKGAYAYIGSSPNSYWGEDYYWSVGANAVFGVQPTFEGTSMGSYDATFLEDSYNTVNSIMWAG
NLAATHAGNIGNITHIGAHYYWEAYHVLGDGSVMPYRAMPKTNTYTLPASLPQNQASYSIQASAGSYVAISKDGVLYGTG
VANASGVATVNMTKQITENGNYDVVITRSNYLPVIKQIQAGEPSPYQPVSNLTATTQGQKVTLKWDAPSAKKAEASREVK
RIGDGLFVTIEPANDVRANEAKVVLAADNVWGDNTGYQFLLDADHNTFGSVIPATGPLFTGTASSNLYSANFEYLIPANA
DPVVTTQNIIVTGQGEVVIPGGVYDYCITNPEPASGKMWIAGDGGNQPARYDDFTFEAGKKYTFTMRRAGMGDGTDMEVE
DDSPASYTYTVYRDGTKIQEGLTATTFEEDGVAAGNHEYCVEVKYTAGVSPKVCKDVTVEGSNEFAPVQNLTGSAVGQKV
TLKWDAPNGTPNPNPNPNPGTTTLSESFENGIPASWKTIDADGDGHGWKPGNAPGIAGYNSNGCVYSESFGLGGIGVLTP
DNYLITPALDLPNGGKLTFWVCAQDANYASEHYAVYASSTGNDASNFTNALLEETITAKGVRSPEAIRGRIQGTWRQKTV
DLPAGTKYVAFRHFQSTDMFYIDLDEVEIKANGKRADFTETFESSTHGEAPAEWTTIDADGDGQDWLCLSSGQLDWLTAH
GGTNVVASFSWNGMALNPDNYLISKDVTGATKVKYYYAVNDGFPGDHYAVMISKTGTNAGDFTVVFEETPNGINKGGARF
GLSTEANGAKPQSVWIERTVDLPAGTKYVAFRHYNCSDLNYILLDDIQFTMGGSPTPTDYTYTVYRDGTKIKEGLTETTF
EEDGVATGNHEYCVEVKYTAGVSPKVCVNVTINPTQFNPVKNLKAQPDGGDVVLKWEAPSGKRGELLNEDFEGDAIPTGW
TALDADGDGNNWDITLNEFTRGERHVLSPLRASNVAISYSSLLQGQEYLPLTPNNFLITPKVEGAKKITYKVGSPGLPQW
SHDHYALCISKSGTAAADFEVIFEETMTYTQGGANLTREKDLPAGTKYVAFRHYNCTDVLGIMIDDVVITGEGEGPSYTY
TVYRDGTKIQEGLTETTYRDAGMSAQSHEYCVEVKYAAGVSPKVCVDYIPDGVADVTAQKPYTLTVVGKTITVTCQGEAM
IYDMNGRRLAAGRNTVVYTAQGGYYAVMVVVDGKSYVEKLAIK
>P72197 3.4.22.47~~~kgp~~~Lys-gingipain HG66~~~
MRKLLLLIAASLLGVGLYAQNAKIKLDAPTTRTTCTNNSFKQFDASFSFNEVELTKVETKGGTFASVSIPGAFPTGEVGS
PEVPAVRKLIAVPVGATPVVRVKSFTEQVYSLNQYGSEKLMPHQPSMSKSDDPEKVPFAYNAAAYARKGFVGQELTQVEM
LGTMRGVRIAALTINPVQYDVVANQLKVRNNIEIEVSFQGADEVATQRLYDASFSPYFETAYKQLFNRDVYTDHGDLYNT
PVRMLVVAGAKFKEALKPWLTWKAQKGFYLDVHYTDEAEVGTTNASIKAFIHKKYNDGLAASAAPVFLALVGDTDVISGE
KGKKTKKVTDLYYSAVDGDYFPEMYTFRMSASSPEELTNIIDKVLMYEKATMPDKSYLEKALLIAGADSYWNPKIGQQTI
KYAVQYYYNQDHGYTDVYSYPKAPYTGCYSHLNTGVGFANYTAHGSETSWADPSVTATQVKALTNKNKYFLAIGNCCVTA
QFDYPQPCFGEVMTRVKEKGAYAYIGSSPNSYWGEDYYWSVGANAVFGVQPTFEGTSMGSYDATFLEDSYNTVNSIMWAG
NLAATHAENIGNVTHIGAHYYWEAYHVLGDGSVMPYRAMPKTNTYTLPASLPQNQASYSIQASAGSYVAISKDGVLYGTG
VANASGVATVNMTKQITENGNYDVVITRSNYLPVIKQIQAGEPSPYQPVSNLTATTQGQKVTLKWDAPSAKKAEGSREVK
RIGDGLFVTIEPANDVRANEAKVVLAADNVWGDNTGYQFLLDADHNTFGSVIPATGPLFTGTASSNLYSANFEYLIPANA
DPVVTTQNIIVTGQGEVVIPGGVYDYCITNPEPASGKMWIAGDGGNQPARYDDFTFEAGKKYTFTMRRAGMGDGTDMEVE
DDSPASYTYTVYRDGTKIKEGLTATTFEEDGVAAGNHEYCVEVKYTAGVSPKVCKDVTVEGSNEFAPVQNLTGSAVGQKV
TLKWDAPNGTPNPNPNPNPGTTTLSESFENGIPASWKTIDADGDGHGWKPGNAPGIAGYNSNGCVYSESFGLGGIGVLTP
DNYLITPALDLPNGGKLTFWVCAQDANYASEHYAVYASSTGNDASNFTNALLEETITAKGVRSPEAIRGRIQGTWRQKTV
DLPAGTKYVAFRHFQSTDMFYIDLDEVEIKANGKRADFTETFESSTHGEAPAEWTTIDADGDGQGWLCLSSGQLDWLTAH
GGTNVVASFSWNGMALNPDNYLISKDVTGATKVKYYYAVNDGFPGDHYAVMISKTGTNAGDFTVVFEETPNGINKGGARF
GLSTEADGAKPQSVWIERTVDLPAGTKYVAFRHYNCSDLNYILLDDIQFTMGGSPTPTDYTYTVYRDGTKIKEGLTETTF
EEDGVATGNHEYCVEVKYTAGVSPKKCVNVTINPTQFNPVKNLKAQPDGGDVVLKWEAPSAKKAEGSREVKRIGDGLFVT
IEPANDVRANEAKVVLAADNVWGDNTGYQFLLDADHNTFGSVIPATGPLFTGTASSNLYSANFEYLIPANADPVVTTQNI
IVTGQGEVVIPGGVYDYCITNPEPASGKMWIAGDGGNQPARYDDFTFEAGKKYTFTMRRAGMGDGTDMEVEDDSPASYTY
TVYRDGTKIKEGLTETTYRDAGMSAQSHEYCVEVKYAAGVSPKVCVDYIPDGVADVTAQKPYTLTVVGKTITVTCQGEAM
IYDMNGRRLAAGRNTVVYTAQGGYYAVMVVVDGKSYVEKLAVK
>Q51817 3.4.22.47~~~kgp~~~Lys-gingipain W83~~~
MRKLLLLIAASLLGVGLYAQSAKIKLDAPTTRTTCTNNSFKQFDASFSFNEVELTKVETKGGTFASVSIPGAFPTGEVGS
PEVPAVRKLIAVPVGATPVVRVKSFTEQVYSLNQYGSEKLMPHQPSMSKSDDPEKVPFVYNAAAYARKGFVGQELTQVEM
LGTMRGVRIAALTINPVQYDVVANQLKVRNNIEIEVSFQGADEVATQRLYDASFSPYFETAYKQLFNRDVYTDHGDLYNT
PVRMLVVAGAKFKEALKPWLTWKAQKGFYLDVHYTDEAEVGTTNASIKAFIHKKYNDGLAASAAPVFLALVGDTDVISGE
KGKKTKKVTDLYYSAVDGDYFPEMYTFRMSASSPEELTNIIDKVLMYEKATMPDKSYLEKVLLIAGADYSWNSQVGQPTI
KYGMQYYYNQEHGYTDVYNYLKAPYTGCYSHLNTGVSFANYTAHGSETAWADPLLTTSQLKALTNKDKYFLAIGNCCITA
QFDYVQPCFGEVITRVKEKGAYAYIGSSPNSYWGEDYYWSVGANAVFGVQPTFEGTSMGSYDATFLEDSYNTVNSIMWAG
NLAATHAGNIGNITHIGAHYYWEAYHVLGDGSVMPYRAMPKTNTYTLPASLPQNQASYSIQASAGSYVAISKDGVLYGTG
VANASGVATVSMTKQITENGNYDVVITRSNYLPVIKQIQVGEPSPYQPVSNLTATTQGQKVTLKWEAPSAKKAEGSREVK
RIGDGLFVTIEPANDVRANEAKVVLAADNVWGDNTGYQFLLDADHNTFGSVIPATGPLFTGTASSNLYSANFEYLVPANA
DPVVTTQNIIVTGQGEVVIPGGVYDYCITNPEPASGKMWIAGDGGNQPARYDDFTFEAGKKYTFTMRRAGMGDGTDMEVE
DDSPASYTYTVYRDGTKIKEGLTATTFEEDGVAAGNHEYCVEVKYTAGVSPKVCKDVTVEGSNEFAPVQNLTGSSVGQKV
TLKWDAPNGTPNPNPNPNPNPGTTLSESFENGIPASWKTIDADGDGHGWKPGNAPGIAGYNSNGCVYSESFGLGGIGVLT
PDNYLITPALDLPNGGKLTFWVCAQDANYASEHYAVYASSTGNDASNFTNALLEETITAKGVRSPKAIRGRIQGTWRQKT
VDLPAGTKYVAFRHFQSTDMFYIDLDEVEIKANGKRADFTETFESSTHGEAPAEWTTIDADGDGQGWLCLSSGQLDWLTA
HGGSNVVSSFSWNGMALNPDNYLISKDVTGATKVKYYYAVNDGFPGDHYAVMISKTGTNAGDFTVVFEETPNGINKGGAR
FGLSTEANGAKPQSVWIERTVDLPAGTKYVAFRHYNCSDLNYILLDDIQFTMGGSPTPTDYTYTVYRDGTKIKEGLTETT
FEEDGVATGNHEYCVEVKYTAGVSPKKCVDVTVNSTQFNPVQNLTAEQAPNSMDAILKWNAPASKRAEVLNEDFENGIPA
SWKTIDADGDGNNWTTTPPPGGSSFAGHNSAICVSSASHINFEGPQNPDNYLVTPELSLPGGGTLTFWVCAQDANYASEH
YAVYASSTGNDASNFANALLEEVLTAKTVVTAPEAIRGTRAQGTWYQKTVQLPAGTKYVAFRHFGCTDFFWINLDDVVIT
SGNAPSYTYTIYRNNTQIASGVTETTYRDPDLATGFYTYGVKVVYPNGESAIETATLNITSLADVTAQKPYTLTVVGKTI
TVTCQGEAMIYDMNGRRLAAGRNTVVYTAQGGHYAVMVVVDGKSYVEKLAVK
>B2RLK2 3.4.22.47~~~kgp~~~Lys-gingipain~~~COG1974
MRKLLLLIAASLLGVGLYAQSAKIKLDAPTTRTTCTNNSFKQFDASFSFNEVELTKVETKGGTFASVSIPGAFPTGEVGS
PEVPAVRKLIAVPVGATPVVRVKSFTEQVYSLNQYGSEKLMPHQPSMSKSDDPEKVPFVYNAAAYARKGFVGQELTQVEM
LGTMRGVRIAALTINPVQYDVVANQLKVRNNIEIEVSFQGADEVATQRLYDASFSPYFETAYKQLFNRDVYTDHGDLYNT
PVRMLVVAGAKFKEALKPWLTWKAQKGFYLDVHYTDEAEVGTTNASIKAFIHKKYNDGLAASAAPVFLALVGDTDVISGE
KGKKTKKVTDLYYSAVDGDYFPEMYTFRMSASSPEELTNIIDKVLMYEKATMPDKSYLEKALLIAGADSYWNPKIGQQTI
KYAVQYYYNQDHGYTDVYSYPKAPYTGCYSHLNTGVGFANYTAHGSETSWADPSLTATQVKALTNKDKYFLAIGNCCVTA
QFDYPQPCFGEVMTRVKEKGAYAYIGSSPNSYWGEDYYWSVGANAVFGVQPTFEGTSMGSYDATFLEDSYNTVNSIMWAG
NLAATHAGNIGNITHIGAHYYWEAYHVLGDGSVMPYRAMPKTNTYTLPASLPQNQASYSIQASAGSYVAISKDGVLYGTG
VANASGVATVNMTKQITENGNYDVVITRSNYLPVIKQIQAGEPSPYQPVSNLTATTQGQKVTLKWDAPSAKKAEASREVK
RIGDGLFVTIEPANDVRANEAKVVLAADNVWGDNTGYQFLLDADHNTFGSVIPATGPLFTGTASSNLYSANFEYLIPANA
DPVVTTQNIIVTGQGEVVIPGGVYDYCITNPEPASGKMWIAGDGGNQPARYDDFTFEAGKKYTFTMRRAGMGDGTDMEVE
DDSPASYTYTVYRDGTKIQEGLTATTFEEDGVAAGNHEYCVEVKYTAGVSPKVCKDVTVEGSNEFAPVQNLTGSAVGQKV
TLKWDAPNGTPNPNPNPNPGTTTLSESFENGIPASWKTIDADGDGHGWKPGNAPGIAGYNSNGCVYSESFGLGGIGVLTP
DNYLITPALDLPNGGKLTFWVCAQDANYASEHYAVYASSTGNDASNFTNALLEETITAKGVRSPEAIRGRIQGTWRQKTV
DLPAGTKYVAFRHFQSTDMFYIDLDEVEIKANGKRADFTETFESSTHGEAPAEWTTIDADGDGQDWLCLSSGQLDWLTAH
GGTNVVASFSWNGMALNPDNYLISKDVTGATKVKYYYAVNDGFPGDHYAVMISKTGTNAGDFTVVFEETPNGINKGGARF
GLSTEANGAKPQSVWIERTVDLPAGTKYVAFRHYNCSDLNYILLDDIQFTMGGSPTPTDYTYTVYRDGTKIKEGLTETTF
EEDGVATGNHEYCVEVKYTAGVSPKVCVNVTINPTQFNPVKNLKAQPDGGDVVLKWEAPSGKRGELLNEDFEGDAIPTGW
TALDADGDGNNWDITLNEFTRGERHVLSPLRASNVAISYSSLLQGQEYLPLTPNNFLITPKVEGAKKITYKVGSPGLPQW
SHDHYALCISKSGTAAADFEVIFEETMTYTQGGANLTREKDLPAGTKYVAFRHYNCTDVLGIMIDDVVITGEGEGPSYTY
TVYRDGTKIQEGLTETTYRDAGMSAQSHEYCVEVKYAAGVSPKVCVDYIPDGVADVTAQKPYTLTVVGKTITVTCQGEAM
IYDMNGRRLAAGRNTVVYTAQGGYYAVMVVVDGKSYVEKLAIK
>Q1JUP4 1.2.1.26~~~araE~~~Alpha-ketoglutaric semialdehyde dehydrogenase 1~~~
MANVTYTDTQLLIDGEWVDAASGKTIDVVNPATGKPIGRVAHAGIADLDRALAAAQSGFEAWRKVPAHERAATMRKAAAL
VRERADAIAQLMTQEQGKPLTEARVEVLSAADIIEWFADEGRRVYGRIVPPRNLGAQQTVVKEPVGPVAAFTPWNFPVNQ
VVRKLSAALATGCSFLVKAPEETPASPAALLRAFVDAGVPAGVIGLVYGDPAEISSYLIPHPVIRKVTFTGSTPVGKQLA
SLAGLHMKRATMELGGHAPVIVAEDADVALAVKAAGGAKFRNAGQVCISPTRFLVHNSIRDEFTRALVKHAEGLKVGNGL
EEGTTLGALANPRRLTAMASVIDNARKVGASIETGGERIGSEGNFFAPTVIANVPLDADVFNNEPFGPVAAIRGFDKLEE
AIAEANRLPFGLAGYAFTRSFANVHLLTQRLEVGMLWINQPATPWPEMPFGGVKDSGYGSEGGPEALEPYLVTKSVTVMA
V
>Q08IC0 1.2.1.26~~~~~~Alpha-ketoglutaric semialdehyde dehydrogenase 2~~~
MQLTGEMLIGAEAVAGSAGTLRAFDPSKGEPIDAPVFGVAAQADVERACELARDAFDAYRAQPLAARAAFLEAIADEIVA
LGDALIERAHAETGLPVARLQGERGRTVGQLRLFARVVRDGRFLAASIDPAQPARTPLPRSDLRLQKVGLGPVVVFGASN
FPLAFSVAGGDTASALAAGCPVIVKAHEAHLGTSELVGRAIRAAVAKTGMPAGVFSLLVGPGRVIGGALVSHPAVQAVGF
TGSRQGGMALVQIANARPQPIPVYAEMSSINPVVLFPAALAARGDAIATGFVDSLTLGVGQFCTNPGLVLAIDGPDLDRF
ETVAAQALAKKPAGVMLTQGIADAYRNGRGKLAELPGVREIGAGEAAQTDCQAGGALYEVGAQAFLAEPAFSHEVFGPAS
LIVRCRDLDEVARVLEALEGQLTATLQMDADDKPLARRLLPVLERKAGRLLVNGYPTGVEVCDAMVHGGPFPATSNPAVT
SVGATAIERFLRPVCYQDFPDDLLPEGLQESNPLAIPRLRDGKAE
>Q08IB7 1.2.1.26~~~LhpG~~~Alpha-ketoglutaric semialdehyde dehydrogenase 3~~~
MQLTGHLLIGQSAIAGQNGTLHAIAAATGEPLDPAFGGASLHDLDTACALADDAFDTYRDTSLEARAAFLDAIGRHIMAL
GDELIERCVIETGLPRARIEGERGRTVGQLALFASLVRDGGFLDARIDPARPERKPLPRVDLRLRNIAVGPVAVFGASNF
PLAFSVAGGDTASALAAGCPVIVKAHSAHPGTSALVGRAIQQAARECGMPAGVFSLLFDASREIGQALVADPRIKAVGFT
GSRRGGVALMHIAAARPEPIPVYAEMSSINPVLLLPAALDARHDAIAPQFVASLTLGAGQFCTNPGLVLAVDGPALRAFE
EAAAAAVRAAPAQTMLTPHIHASYEQGVAALRDHAAVELLAQGAEGNRLQARAALLATSAEAFITHPELRDEVFGPASLI
VRCPDADTLHRVLKSLEGQLTIAAHLADGDAPLFAALRPLLERKAGRILVNGFGTGVEVGHAMVHGGPFPATSDTRTTSV
GARAIERFLRPVSYQDLPDALLPEAIRSGNPLNVPQRIDGVPAPREANHV
>Q6FFQ0 1.2.1.26~~~~~~Alpha-ketoglutaric semialdehyde dehydrogenase~~~COG1012
MSENNGKQFINGQRVAANAPTIESINATDYQPTGYLFSQATLDEVDQAAQAAYQAFLKYQHTTQQQRADFLDEIAIQIEN
LGSKLQEVAAQETGLPLVRLQGETGRVTGQLRLFAELLRRGDFYGARIDTALPERKPLPRVDLRQYKIGVGPVAVFGASN
FPLAFSTAGGDTVAALAAGCSVVFKAHSGHMATAELVAQAIEKAILNSGIPSGTFNMIFGSRVGANLVEHPLIQAAGFTG
SLEGGMALFNLAQNRPQPIPFFAEMSSVNPVIVMPEALNARGEKVAQDTVASFNMGCGQFCTKPGLIIGIKSPAFDQFVT
ALIDTTRTAVPQIMLNQGTLKSYQQGIDALLNEQGFKCIASGQAPELISQAQPHLFQADQSVLLSGNPKLQHEVFGPMSI
VIAVDDEATLLNGLEKLAGQLTATIIADESDLPQAKELLNLLTRKAGRVLFNGFPTGVEVSDAMVHGGPFPATSDSRGTS
VGTGAIERFLRPVCYQNTSQVLLPDVLKDGNPLHITRLVNGVLTQN
>P42236 1.2.1.26~~~gucD~~~Alpha-ketoglutaric semialdehyde dehydrogenase~~~COG1012
MSVITEQNTYLNFINGEWVKSQSGDMVKVENPADVNDIVGYVQNSTAEDVERAVTAANEAKTAWRKLTGAERGQYLYKTA
DIMEQRLEEIAACATREMGKTLPEAKGETARGIAILRYYAGEGMRKTGDVIPSTDKDALMFTTRVPLGVVGVISPWNFPV
AIPIWKMAPALVYGNTVVIKPATETAVTCAKIIACFEEAGLPAGVINLVTGPGSVVGQGLAEHDGVNAVTFTGSNQVGKI
IGQAALARGAKYQLEMGGKNPVIVADDADLEAAAEAVITGAFRSTGQKCTATSRVIVQSGIYERFKEKLLQRTKDITIGD
SLKEDVWMGPIASKNQLDNCLSYIEKGKQEGASLLIGGEKLENGKYQNGYYVQPAIFDNVTSEMTIAQEEIFGPVIALIK
VDSIEEALNIANDVKFGLSASIFTENIGRMLSFIDEIDAGLVRINAESAGVELQAPFGGMKQSSSHSREQGEAAKDFFTA
IKTVFVKP
>P0AEX3 ~~~kgtP~~~Alpha-ketoglutarate permease~~~COG0477
MAESTVTADSKLTSSDTRRRIWAIVGASSGNLVEWFDFYVYSFCSLYFAHIFFPSGNTTTQLLQTAGVFAAGFLMRPIGG
WLFGRIADKHGRKKSMLLSVCMMCFGSLVIACLPGYETIGTWAPALLLLARLFQGLSVGGEYGTSATYMSEVAVEGRKGF
YASFQYVTLIGGQLLALLVVVVLQHTMEDAALREWGWRIPFALGAVLAVVALWLRRQLDETSQQETRALKEAGSLKGLWR
NRRAFIMVLGFTAAGSLCFYTFTTYMQKYLVNTAGMHANVASGIMTAALFVFMLIQPLIGALSDKIGRRTSMLCFGSLAA
IFTVPILSALQNVSSPYAAFGLVMCALLIVSFYTSISGILKAEMFPAQVRALGVGLSYAVANAIFGGSAEYVALSLKSIG
METAFFWYVTLMAVVAFLVSLMLHRKGKGMRL
>Q2GLF7 2.7.4.8~~~gmk~~~Guanylate kinase~~~COG0194
MLKSVGVILVLSSPSGCGKTTVANKLLEKQKNNIVKSVSVTTRAARKGEKEGKDYYFVDREEFLRLCSNGEIIEHAEVFG
NFYGVPRKNLEDNVDKGVSTLLVIDWQGAFKFMEMMREHVVSIFIMPPSMEELRRRLCGRRADDSEVVEARLKGAAFEIS
HCEAYDYVIVNEDIEETADRISNILRAEQMKTCRQVGLRELLESRFPIED
>Q6G439 2.7.4.8~~~gmk~~~Guanylate kinase~~~COG0194
MVTFFENELSAKKRNQRRGFLFILSSPSGAGKSTLSRLLLKDGKLELSISMTTRQKRPSEVDGLHYHFISKKEFKRKRDG
NEFIEWAEVHGNYYGTLRESVENVLSTGRDMLFDIDYQGTKQLQKKMPGDTVSVFILPPSMKELISRLYRRAEDSQDIIN
LRLKNARTEMQHWRSYDYVIINENLNQSVSLIKSIYLAETVKRERCFFLEPFINGLIAEKID
>Q83EL7 2.7.4.8~~~gmk~~~Guanylate kinase~~~COG0194
MNKANLFIISAPSGAGKTSLVRALVKALAEIKISISHTTRPKRPGDQEGVDYFFIDETRFQAMVKEGAFLEHATIYERHY
GTEKDWVLRQLKAGRDVLLEIDWQGARQIRELFPPALSIFILPPSIEALRERLIKRRQDDTAIIEQRLALAREEMAHYKE
FDYLVVNDNFDQAVQNLIHIISAERLQRDVQEKKLSRLLAELVEKQ
>P60546 2.7.4.8~~~gmk~~~Guanylate kinase~~~COG0194
MAQGTLYIVSAPSGAGKSSLIQALLKTQPLYDTQVSVSHTTRQPRPGEVHGEHYFFVNHDEFKEMISRDAFLEHAEVFGN
YYGTSREAIEQVLATGVDVFLDIDWQGAQQIRQKMPHARSIFILPPSKIELDRRLRGRGQDSEEVIAKRMAQAVAEMSHY
AEYDYLIVNDDFDTALTDLKTIIRAERLRMSRQKQRHDALISKLLAD
>Q8Y672 2.7.4.8~~~gmk~~~Guanylate kinase~~~COG0194
MTERGLLIVLSGPSGVGKGTVREAVFKDPETSFDYSISMTTRLPREGEQDGVDYYFRSREVFEQAIKDGKMLEYAEYVGN
YYGTPLEYVEEKLAAGVDIFLEIEVQGAMQVRKAMPEGIFIFLTPPDLSELKNRIIGRGTESMEVVEERMETAKKEIEMM
ASYDYAVVNDVVANAVQKIKGIVETEHLKTERVIHRYKKMLEGLQ
>P9WKE9 2.7.4.8~~~gmk~~~Guanylate kinase~~~COG3709
MSVGEGPDTKPTARGQPAAVGRVVVLSGPSAVGKSTVVRCLRERIPNLHFSVSATTRAPRPGEVDGVDYHFIDPTRFQQL
IDQGELLEWAEIHGGLHRSGTLAQPVRAAAATGVPVLIEVDLAGARAIKKTMPEAVTVFLAPPSWQDLQARLIGRGTETA
DVIQRRLDTARIELAAQGDFDKVVVNRRLESACAELVSLLVGTAPGSP
>Q9HTM2 2.7.4.8~~~gmk~~~Guanylate kinase~~~
MSGTLYIVSAPSGAGKTSLVKALLDAAPEVRVSVSHTTRGMRPGEVDGVNYHFTSREEFLAMLERNEFLEHAEVFGNLYG
TSQRWVEKTLAEGLDLILEIDWQGAQQVRRLMPEAQSIFILPPSQEALRQRLTNRGQDSDEVIERRMREAVSEMSHYVEY
DHLVINDDFAHALDDLKAIFRARQLRQDAQQQRHAELLGRLLA
>Q5HGM3 2.7.4.8~~~gmk~~~Guanylate kinase~~~
MDNEKGLLIVLSGPSGVGKGTVRKRIFEDPSTSYKYSISMTTRQMREGEVDGVDYFFKTRDAFEALIKDDQFIEYAEYVG
NYYGTPVQYVKDTMDEGHDVFLEIEVEGAKQVRKKFPDALFIFLAPPSLEHLRERLVGRGTESDEKIQSRINEARKEVEM
MNLYDYVVVNDEVELAKNRIQCIVEAEHLKRERVEAKYRKMILEAKK
>P99176 2.7.4.8~~~gmk~~~Guanylate kinase~~~
MDNEKGLLIVLSGPSGVGKGTVRKRIFEDPSTSYKYSISMTTRQMREGEVDGVDYFFKTRDAFEALIKDDQFIEYAEYVG
NYYGTPVQYVKDTMDEGHDVFLEIEVEGAKQVRKKFPDALFIFLAPPSLDHLRERLVGRGTESNEKIQSRINEARKEVEM
MNLYDYVVVNDEVELAKNRIQCIVEAEHLKRERVEAKYRKMILEAKK
>Q9KNM4 2.7.4.8~~~gmk~~~Guanylate kinase~~~COG0194
MGKGTLYIVSAPSGAGKSSLIAALLEQNPTYAMKVSVSHTTRGMRPGEQDGVHYHFVEKEHFIELIGKGEFLEYAEVFGN
YYGTSRVWIENTLNKGIDVFLDIDWQGARQIRSQMPEAKSIFILPPSKEELERRLNTRGQDSDAVIAKRMGEAKSEISHY
SEYDYVIINDDFDVALMDFKAIIRAERLKQDKQAAKYSAMLSALLAE
>P9WFM7 ~~~khpA~~~RNA-binding protein KhpA~~~COG1837
MSAVVVDAVEHLVRGIVDNPDDVRVDLITSRRGRTVEVHVHPDDLGKVIGRGGRTATALRTLVAGIGGRGIRVDVVDTDQ
>P0A4Q4 ~~~khpA~~~RNA-binding protein KhpA~~~COG1837
MLEEALEHLVKGIVDNPDDVQVASRNLRRGRVLEVRVHPDDLGKVIGRNGRTARALRTVVGAIGGRGVRVDLVDVDHVR
>A0A0H2ZMB4 ~~~khpA~~~RNA-binding protein KhpA~~~COG1837
MDTIENLIIAIVKPLISQPDALTIKIEDTPEFLEYHLNLDQSDVGRVIGRKGRTISAIRTIVYSVPTEYKKVRIVIDEK
>Q8DQG4 ~~~khpA~~~RNA-binding protein KhpA~~~COG1837
MDTIENLIIAIVKPLISQPDALTIKIEDTPEFLEYHLNLDQSDVGRVIGRKGRTISAIRTIVYSVPTEYKKVRIVIDEK
>D0VX24 ~~~khpB~~~RNA-binding protein KhpB~~~
MDMVTVTAKTVEEAVTKALIELQTTSDKLTYEIVEKGSAGFLGIGSKPAIIRAKRKETLQDKAIEFLEQVFDAMNMAVDI
SVEYNETEKEMNVNLKGDDMGILIGKRGQTLDSLQYLVSLVVNKSSSDYIRVKLDTENYRERRKETLETLAKNIAYKVKR
TKRSVSLEPMNPYERRIIHAALQNDKYVVTRSDGEEPFRHVIISLKRENRRDRNDRSDRNEK
>F9ULM5 ~~~khpB~~~RNA-binding protein KhpB~~~COG1847
MTVFEGNTVAAAIAAGLKQLHRTRDQVEVEVIAEAKKGFLGLGKHPAQVRLTVVPASAAPATTPTSATATAQQSVATEST
TAPTMPRPTVQTPKSTPTRQAKTSQATTSAAKPATSKAKAVAKPASMAVTTGPVIADTDQSKPATTSKTKSVAADQSQTP
RTPAEIAARQAANETAVRALCDYLLAVVKELGVTADLDVDFGNRYATLNFDTTKQGLLIGKHGRTINALQDLAQVYMNHH
GASHVNVVLDVDDYRERRAATLKRLAESTAREVIATGKQVFLDPMPSFERKLIHAELANNHHVTTFSEGRDPHRAVVVAI
RK
>A0A0H2ZPS7 ~~~kphB~~~RNA-binding protein KhpB~~~COG1847
MVVFTGSTVEEAIQKGLKELDIPRMKAHIKVISREKKGFLGLFGKKPAQVDIEAISETTVVKANQQVVKGVPKKINDLNE
PVKTVSEETVDLGHVVDAIKKIEEEGQGISDEVKAEILKHERHASTILEETGHIEILNELQIEEAMREEAGADDLETEQD
QAESQELEDLGLKVETNFDIEQVATEVMAYVQTIIDDMDVEATLSNDYNRRSINLQIDTNEPGRIIGYHGKVLKALQLLA
QNYLYNRYSRTFYVTINVNDYVEHRAEVLQTYAQKLATRVLEEGRSHKTDPMSNSERKIIHRIISRMDGVTSYSEGDEPN
RYVVVDTE
>Q8CY87 ~~~khpB~~~RNA-binding protein KhpB~~~COG1847
MVVFTGSTVEEAIQKGLKELDIPRMKAHIKVISREKKGFLGLFGKKPAQVDIEAISETTVVKANQQVVKGVPKKINDLNE
PVKTVSEETVDLGHVVDAIKKIEEEGQGISDEVKAEILKHERHASTILEETGHIEILNELQIEEAMREEAGADDLETEQD
QAESQELEDLGLKVETNFDIEQVATEVMAYVQTIIDDMDVEATLSNDYNRRSINLQIDTNEPGRIIGYHGKVLKALQLLA
QNYLYNRYSRTFYVTINVNDYVEHRAEVLQTYAQKLATRVLEEGRSHKTDPMSNSERKIIHRIISRMDGVTSYSEGDEPN
RYVVVDTE
>Q8UHA8 2.7.1.39~~~thrB~~~Homoserine kinase~~~COG2334
MAVYTDITEDELRNFLTQYDVGSLTSYKGIAEGVENSNFLLHTTKDPLILTLYEKRVEKNDLPFFLGLMQHLAAKGLSCP
LPLPRKDGELLGELSGRPAALISFLEGMWLRKPEAKHCREVGKALAAMHLASEGFEIKRPNALSVDGWKVLWDKSEERAD
EVEKGLREEIRPEIDYLAAHWPKDLPAGVIHADLFQDNVFFLGDELSGLIDFYFACNDLLAYDVSICLNAWCFEKDGAYN
VTKGKALLEGYQSVRPLSEAELEALPLLSRGSALRFFLTRLYDWLTTPAGALVVKKDPLEYLRKLRFHRTIANVAEYGLA
GE
>O66132 2.7.1.39~~~thrB~~~Homoserine kinase~~~COG0083
MIKIYAPASIGNVGVGFDILGAAIIPVNGSLLGDFVTVKLSNKFNLVNKGIFSNKLPKNTEQNIVWKCWLKFCNTIKRNI
PVSIILEKNMPIGSGLGSSACSIVATLVAMNEFCDKPLNSKELLLLMGEVEGEISGSIHYDNVAPCYLGGLQLILEDSKI
ISQTIPNFKNWFWIVAWPGTKVPTAEARDILPKKYKKETCIKNSRYLAGFIHASYSQQPHLAARLMQDFIAEPYRIKLLP
NYLYVKEKIKKIGAISSGISGSGPTIFSISDNINTAQKISAWLTENYLQNTTGFVHICFLDSKGVRKIG
>P07128 2.7.1.39~~~thrB~~~Homoserine kinase~~~COG0083
MAIELNVGRKVTVTVPGSSANLGPGFDTLGLALSVYDTVEVEIIPSGLEVEVFGEGQGEVPLDGSHLVVKAIRAGLKAAD
AEVPGLRVVCHNNIPQSRGLGSSAAAAVAGVAAANGLADFPLTQEQIVQLSSAFEGHPDNAAASVLGGAVVSWTNLSIDG
KSQPQYAAVPLEVQDNIRATALVPNFHASTEAVRRVLPTEVTHIDARFNVSRVAVMIVALQQRPDLLWEGTRDRLHQPYR
AEVLPITSEWVNRLRNRGYAAYLSGAGPTAMVLSTEPIPDKVLEDARESGIKVLELEVAGPVKVEVNQP
>P00547 2.7.1.39~~~thrB~~~Homoserine kinase~~~COG0083
MVKVYAPASSANMSVGFDVLGAAVTPVDGALLGDVVTVEAAETFSLNNLGRFADKLPSEPRENIVYQCWERFCQELGKQI
PVAMTLEKNMPIGSGLGSSACSVVAALMAMNEHCGKPLNDTRLLALMGELEGRISGSIHYDNVAPCFLGGMQLMIEENDI
ISQQVPGFDEWLWVLAYPGIKVSTAEARAILPAQYRRQDCIAHGRHLAGFIHACYSRQPELAAKLMKDVIAEPYRERLLP
GFRQARQAVAEIGAVASGISGSGPTLFALCDKPETAQRVADWLGKNYLQNQEGFVHICRLDTAGARVLEN
>Q8Y4A6 2.7.1.39~~~thrB~~~Homoserine kinase~~~COG0083
MRIRVPATTANLGPGFDSCGLALTLYLTLDIGAEADSWYIEHNIGGGIPHDETNVIIETALNLAPNLTPHHLVMTCDIPP
ARGLGSSSAAVVAGIELANTLAELNLSKEEKVRIAAEIEGHPDNVAPAVLGNWVVGAKLDGEDFYVRHLFPDCALIAFIP
KAELLTSESRGVLPDTLPFKEAVQASSIANVMIAAILRNDMTLAGEMMERDLWHEKYRSQLVPHLAQIRDVAKNQGAYAA
CLSGAGPTVLVFAPRNLANKLQTSLQTLEIDADVLLLDVEGSGAEVFR
>B1MLU6 2.7.1.39~~~thrB~~~Homoserine kinase~~~
MSEVLPAGLATTVLVPASSANLGPGFDSLGIALSLYDEIEVNTTESGLKVAVEGQGAGEVPLDGSHLVVRAIERGLAAGG
AAAPGLIVQCHNKIPHSRGLGSSAAAAVAGLGVANGLLAKAGRAVLSDDVLVQLASEFEGHPDNAAASVLGGAVVSWSET
SGATPIYAATRLDVHPDIKIVAAIPEEQSSTAHTRVLLPQAVTHVDARFNISRVALLTVALTARPDLLMTATEDRLHQPQ
RASAMPASADVLAYLRSQGVAAVLSGAGPAVLALTTVDLPDSAVKYAEDQGFSLVAMAVSAGVSVR
>P9WKE7 2.7.1.39~~~thrB~~~Homoserine kinase~~~COG0083
MVTQALLPSGLVASAVVAASSANLGPGFDSVGLALSLYDEIIVETTDSGLTVTVDGEGGDQVPLGPEHLVVRAVQHGLQA
AGVSAAGLAVRCRNAIPHSRGLGSSAAAVVGGLAAVNGLVVQTDSSPSSDAELIQLASEFEGHPDNAAAAVLGGAVVSWT
DHSGDRPNYSAVSLRLHPDIRLFTAIPEQRSSTAETRVLLPAQVSHDDARFNVSRAALLVVALTERPDLLMAATEDLLHQ
PQRAAAMTASAEYLRLLRRHNVAAALSGAGPSLIALSTDSELPTDAVEFGAAKGFAVTELTVGEAVRWSPTVRVPG
>Q1CMW6 2.7.1.39~~~thrB~~~Homoserine kinase~~~
MVKIYAPASIGNVSVGFDVLGAAVSPIDGTLLGDCVSVTAAERFSLHNEGRFVSKLPDDPKQNIVYQCWERFCQEMGKEI
PVAMVLEKNMPIGSGLGSSACSVVAGLMAMNEFCGQPLDKVTLLGMMGELEGRVSGSIHFDNVAPCYLGGMQLILEQEGY
ISQDVPGFSDWLWVMAYPGIKVSTAEARAILPAQYRRQDCITHGRNLAGFIHACHTQQPDLAAKMMKDVIAEPYRTQLLP
GFAAARQAAQDIGALACGISGSGPTLFAVCNDQATAQRMAGWLQNHYLQNDEGFVHICRLDTAGARLLG
>O07534 ~~~khtS~~~K(+)/H(+) antiporter modulator KhtS~~~
MRREERNMDKLLISFLLSLFMVYFPPSDVVLPSQFEASTDSYVPMSSYPQETQSAKTPSPGSMHPAELIKEYSPLAQSVR
QLSVKPLDEPLINRLEKALAVPVKYQSNYLRI
>O07535 ~~~khtT~~~K(+)/H(+) antiporter subunit KhtT~~~COG0490
MNIKENDLPGIGKKFEIETRSHEKMTIIIHDDGRREIYRFNDRDPDELLSNISLDDSEARQIAAILGGMVYKPQALESIE
MAFSDLIIEWFKVEKGAKSIGRTLGELDVRQNYDVTVIAIIKHNQEKLLNPGADSIIEENDTLVLSGERKHLKKLIHDFL
SGEGV
>O07536 ~~~khtU~~~K(+)/H(+) antiporter subunit KhtU~~~COG0475
MDHLVFEVGTALVLVAIASVIANKIKFSIIPFLIVLGMLVGPHAPKMGIIDLTFIQSSEIIEFFGRMGVLFLLFYLGLEF
SVGKLIKSGKSIAVGGTIYILINFSLGLLYGFITGFSFLEVLILAGVITISSSAIVAKVLVDLKRTANPETELILGIIMF
EDIFLAVYLSVVSGLILGDATSVGSALLSILIAFGYMLLFFIAARKLPPLLNKLLDIRSNEVFIIVIFAALFFIAGFSET
IHVAEAIGALLLGLVFSETEHSDRIEHLVVPFRDFFGAMFFFSFGLSIDPFSLGEAVWLALGAVILTILGNFIAGMVAGR
RAGLSHKASSNIGLTIVSRGEFSIIVANLGIAGGLSATLKPFAALYVLILAILGPLVTKESKRIYRLLNKVFKWKPEVQP
AKKQG
>P00557 2.7.1.163~~~hph~~~Hygromycin-B 4-O-kinase~~~
MKKPELTATSVEKFLIEKFDSVSDLMQLSEGEESRAFSFDVGGRGYVLRVNSCADGFYKDRYVYRHFASAALPIPEVLDI
GEFSESLTYCISRRAQGVTLQDLPETELPAVLQPVAEAMDAIAAADLSQTSGFGPFGPQGIGQYTTWRDFICAIADPHVY
HWQTVMDDTVSASVAQALDELMLWAEDCPEVRHLVHADFGSNNVLTDNGRITAVIDWSEAMFGDSQYEVANIFFWRPWLA
CMEQQTRYFERRHPELAGSPRLRAYMLRIGLDQLYQSLVDGNFDDAAWAQGRCDAIVRSGAGTVGRTQIARRSAAVWTDG
CVEVLADSGNRRPSTRPRAKE
>P09979 2.7.1.119~~~hyg~~~Hygromycin-B 7''-O-kinase~~~
MTQESLLLLDRIDSDDSYASLRNDQEFWEPLARRALEELGLPVPPVLRVPGESTNPVLVGEPDPVIKLFGEHWCGPESLA
SESEAYAVLADAPVPVPRLLGRGELRPGTGAWPWPYLVMSRMTGTTWRSAMDGTTDRNALLALARELGRVLGRLHRVPLT
GNTVLTPHSEVFPELLRERRAATVEDHRGWGYLSPRLLDRLEDWLPDVDTLLAGREPRFVHGDLHGTNIFVDLAATEVTG
IVDFTDVYAGDSRYSLVQLHLNAFRGDREILAALLDGAQWKRTEDFARELLAFTFLHDFEVFEETPLDLSGFTDPEELAQ
FLWGPPDTAPGA
>B3TMR8 1.1.1.384~~~~~~dTDP-3,4-didehydro-2,6-dideoxy-alpha-D-glucose 3-reductase~~~
MENPANANPIRVGVIGCADIAWRRALPALEAEPLTEVTAIASRRWDRAKRFTERFGGEPVEGYPALLERDDVDAVYVPLP
AVLHAEWIDRALRAGKHVLAEKPLTTDRPQAERLFAVARERGLLLMENFMFLHHPQHRQVADMLDEGVIGEIRSFAASFT
IPPKPQGDIRYQADVGGGALLDIGVYPIRAAGLFLGADLEFVGAVLRHERDRDVVVGGNALLTTRQGVTAQLTFGMEHAY
TNNYEFRGSTGRLWMNRVFTPPATYQPVVHIERQDHAEQFVLPAHDQFAKSIRAFAQAVLSGEHPREWSEDSLRQASLVD
AVRTGARDIYFP
>P38393 ~~~kilR~~~Killing protein KilR~~~
MIAHHFGTDEIPRQCVTPGDYVLHEGRTYIASANNIKKRKLYIRNLTTKTFITDRMIKVFLGRDGLPVKAESW
>P96589 ~~~kimA~~~Potassium transporter KimA~~~COG0531
MYHSIKRFLIGKPLKSQAAGEQKLTKLKALAMLSSDALSSVAYGTEQILIILATISAAAFWYSIPIAVGVLILLLALILS
YRQIIYAYPQGGGAYIVSKENLGEKPGLIAGGSLLVDYILTVAVSISAGTDAITSAFPALHDYHVPIAIFLVLVIMILNL
RGLSESASILAYPVYLFVVALLVLIAVGLFKLMTGQIDQPAHHTSLGTPVAGITLFLLLKAFSSGCSALTGVEAISNAIP
AFKNPPARNAARTLAMMGILLAILFSGITVLAYGYGTAPKPDETVVSQIASETFGRNVFYYVIQGVTSLILVLAANTGFS
AFPQLAFNLARDQYMPRMFTVRGDRLGFSNGIIFLGFASIVLIILFGGQTEHLIPLYAVGVFIPFTLSQTGMCMKWIKQK
PKGWIGKMLINSCGALISFMVLSILFVTKFNVVWPVLIFMPIVVLLFFAIKNHYTAVGEQLRIVDKEPEEIKGTVVIVPV
AGVTTVVQKSIHYAKSLSDQVIAVHVSFDREQEKKFEKRWEELNNGVRLVTLHSSYRSLVHPFDKFLETVEAKAKKEQFS
VMVLFPQFITKKRWHTILHNQSAFLLRVRLFWKKDIMVATLPYHFKK
>P16497 2.7.13.3~~~kinA~~~Sporulation kinase A~~~COG4191
MEQDTQHVKPLQTKTDIHAVLASNGRIIYISANSKLHLGYLQGEMIGSFLKTFLHEEDQFLVESYFYNEHHLMPCTFRFI
KKDHTIVWVEAAVEIVTTRAERTEREIILKMKVLEEETGHQSLNCEKHEIEPASPESTTYITDDYERLVENLPSPLCISV
KGKIVYVNSAMLSMLGAKSKDAIIGKSSYEFIEEEYHDIVKNRIIRMQKGMEVGMIEQTWKRLDGTPVHLEVKASPTVYK
NQQAELLLLIDISSRKKFQTILQKSRERYQLLIQNSIDTIAVIHNGKWVFMNESGISLFEAATYEDLIGKNIYDQLHPCD
HEDVKERIQNIAEQKTESEIVKQSWFTFQNRVIYTEMVCIPTTFFGEAAVQVILRDISERKQTEELMLKSEKLSIAGQLA
AGIAHEIRNPLTAIKGFLQLMKPTMEGNEHYFDIVFSELSRIELILSELLMLAKPQQNAVKEYLNLKKLIGEVSALLETQ
ANLNGIFIRTSYEKDSIYINGDQNQLKQVFINLIKNAVESMPDGGTVDIIITEDEHSVHVTVKDEGEGIPEKVLNRIGEP
FLTTKEKGTGLGLMVTFNIIENHQGVIHVDSHPEKGTAFKISFPKK
>O34206 2.7.13.3~~~kinB~~~Alginate biosynthesis sensor protein KinB~~~COG5002
MSMPLPMKLRTRLFLSISALITVSLFGLLLGLFSVMQLGRAQEQRMSHHYATIEVSQQLRQLLGDQLVILLRETPDGQAL
ERSQNDFRRVLEQGRANTVDSAEQAALDGVRDAYLQLQAHTPALLEAPMVDNDGFSEAFNGLRLRLQDLQQLALAGISDA
ETSARHRAYLVAGLLGLVGVAILLIGFVTAHSIARRFGAPIETLARAADRIGEGDFDVTLPMTNVAEVGQLTRRFGLMAE
ALRQYRKTSVEEVLSGERRLQAVLDSIDDGLVIFDNQGRIEHANPVAIRQLFVSNDPHGKRIDEILSDVDVQEAVEKALL
GEVQDEAMPDLVVDVAGESRLLAWSLYPVTHPGGHSVGAVLVVRDVTEQRAFERVRSEFVLRASHELRTPVTGMQMAFSL
LRERLDFPAESREADLIQTVDEEMSRLVLLINDLLNFSRYQTGMQKLELASCDLVDLLTQAQQRFIPKGEARRVSLQLEL
GDELPRLQLDRLQIERVIDNLLENALRHSSEGGQIHLQARRQGDRVLIAVEDNGEGIPFSQQGRIFEPFVQVGRKKGGAG
LGLELCKEIIQLHGGRIAVRSQPGQGARFYMLLPV
>P39764 2.7.13.3~~~kinC~~~Sporulation kinase C~~~COG3852
MRKYQARIISIILAMIFIMFWDYLFYFIGKNPINWPVDIVYTAVTLVSVWMLAYYIDEKQQLVKKMKDNEWKYKQLSEEK
NRIMDNLQEIVFQTNAKGEITYLNQAWASITGFSISECMGTMYNDYFIKEKHVADHINTQIQNKASSGMFTAKYVTKNGT
IFWGEVHYKLYYDRDDQFTGSLGTMSDITERKEAEDELIEINERLARESQKLSITSELAAGIAHEVRNPLTSVSGFLQIM
KTQYPDRKDYFDIIFSEIKRIDLVLSELLLLAKPQAITFKTHQLNEILKQVTTLLDTNAILSNIVIEKNFKETDGCMING
DENQLKQVFINIIKNGIEAMPKGGVVTISTAKTASHAVISVKDEGNGMPQEKLKQIGKPFYSTKEKGTGLGLPICLRILK
EHDGELKIESEAGKGSVFQVVLPLKSDS
>O31671 2.7.13.3~~~kinD~~~Sporulation kinase D~~~COG3852
MLERCKLKILKGACGRVKLYIILVVIPAIVISFFVYEKEKDTIAAEHKQEASVLLNLHRNKINYLIGETMARMTSLSIAI
DRPVDIKKMQSILEKTFDSEPRFSGLYFLNAKGDVTASTTELKTKVNLADRSFFIKAKETKKTVISDSYSSRITGQPIFT
ICVPVLDSKRNVTDYLVAAIQIDYLKNLINLLSPDVYIEVVNQDGKMIFASGQASHAEDQKPVSGYLDDISWNMKVYPNP
VTIEELSKSLVLPLSCIIVLLNILFILVLYYLLKRQTQLERSENEAQKLELIGTLAASTAHEIRNPLTGISGFIQLLQKK
YKGEEDQLYFSIIEQEIKRINQIVSEFLVLGKPTAEKWELNSLQDIIGEIMPIIYSEGNLYNVEVELQYLTEQPLLVKCT
KDHIKQVILNVAKNGLESMPEGGKLTISLGALDKKAIIKVVDNGEGISQEMLDHIFLPFVTSKEKGTGLGLVVCKRIVLM
YGGSIHIESEVRRGTEVTITLPVSAS
>O31661 2.7.13.3~~~kinE~~~Sporulation kinase E~~~COG5002
METLGVQTNSELREELNRLKEENARLKKELNQHQVIVNNTLDAIFICDNEMRIVQANEATERMLQVDSEDLKKRSVLDFL
FSIPKDELNLSVKKFFKKGFLWKEVPIRLDCGATKYIEFLAKRGIGEDFFFVVMRDISSKKILEREFSMNEQLFKDLFDR
AVDGIVLFDKDGGFIDANLSFCKSFEINHNELSHLSLYEFIDSGSRKDFDNIWKALNRKGKAKGELPVKLRSGVQKLFEF
TITSNIISGFYMSIMRDITEKRSMELQLFKSEERFREIFENAMDAIIIWSNDGRIVKANQSACKIFELPMNLLLKRKLCD
FLVDSQQKYSITKRKYAKYGEIREELLFQMGNGQFKELEFTSKRTILENQHLTILRNVSDRKRMEKELRESELKFRKVFN
GSMDGNVLFDNQYRIIDANPLASHILGLSHEEIKQHSLLDIISAYEIENLASPARQINFDEMDNEIPFLLSSGDNRKLEF
SFKRNIIQNMNLAIFKDVTERKELEERLRKSDTLHVVGELAAGIAHEIRNPMTALKGFIQLLKGSVEGDYALYFNVITSE
LKRIESIITEFLILAKPQAIMYEEKHVTQIMRDTIDLLNAQANLSNVQMQLDLIDDIPPIYCEPNQLKQVFINILKNAIE
VMPDGGNIFVTIKALDQDHVLISLKDEGIGMTEDKLKRLGEPFYTTKERGTGLGLMVSYKIIEEHQGEIMVESEEGKGTV
FHITLPVRQNAEERRNDE
>P42968 ~~~kipR~~~HTH-type transcriptional regulator KipR~~~COG1414
MQNKNKTVVKSMALLNLFLHKPSLTLSELVSLTGMPKTSVHRMVSSLEEMGFLSRDASGAYSLGLVFLEFGQLVADRLDI
RKIAKPVMEELCREVDEAVQLIMRDGNEAIYVEKIEGTQTVRLYTAIGRRSPLYAGACARSILSFLPREEIEAYIKQTEL
ISIGSGTITDPEKLLQEIDASVQNGYTVSYSELENYTAAIGAPIFNHERQVAAGISIAGFEARFTEDRLPYLTEKVKDAA
LQISRKIGYT
>Q81JX0 2.7.1.21~~~tdk~~~Thymidine kinase~~~COG1435
MYLINQNGWIEVICGSMFSGKSEELIRRVRRTQFAKQHAIVFKPCIDNRYSEEDVVSHNGLKVKAVPVSASKDIFKHITE
EMDVIAIDEVQFFDGDIVEVVQVLANRGYRVIVAGLDQDFRGLPFGQVPQLMAIAEHVTKLQAVCSACGSPASRTQRLID
GEPAAFDDPIILVGASESYEPRCRHCHAVPTKQR
>Q814U0 2.7.1.21~~~tdk~~~Thymidine kinase~~~
MYLINQNGWIEVICGSMFSGKSEELIRRVRRTQFAKQHAIVFKPCIDNRYSEEDVVSHNGLKVKAVPVSASKDIFEHITE
ELDVIAIDEVQFFDGDIVEVVQVLANRGYRVIVAGLDQDFRGLPFGQVPQLMAIAEHVTKLQAVCSVCGSPASRTQRLID
GEPAAFDDPIILVGASESYEPRCRHCHAVPANKDK
>Q97F65 2.7.1.21~~~tdk~~~Thymidine kinase~~~COG1435
MYRPKDHGWVEVIVGPMYSGKSEELIRRIRRAKIAKQKIQVFKPEIDNRYSKEDVVSHMGEKEQAVAIKNSREILKYFEE
DTEVIAIDEVQFFDDEIVEIVNKIAESGRRVICAGLDMDFRGKPFGPIPELMAIAEFVDKIQAICVVCGNPATRTQRLIN
GKPAFYDDPVVLIGAMESYEARCRKCHVVPQKKEV
>P75070 2.7.1.21~~~tdk~~~Thymidine kinase~~~
MSFSQVFHQSPRGWIEVICGPMFSGKTEELLRKIKRWKLAKIPVIIFKPKIDTRQQHLVKSRNGHSDEAIEINSPLEIYD
YLTKDRFDVVAIDEAQFFSSEIVEVVKSLNDLGINVIVSGLDTDFRAEPFGSIPQLLAIADKICKLDAVCNVCGQLAQRT
QRIVSKSNETVLIGDIEAYEPRCKLHQPSAG
>P65231 2.7.1.21~~~tdk~~~Thymidine kinase~~~
MYETYHSGWIECITGSMFSGKSEELIRRLRRGIYAKQKVVVFKPAIDDRYHKEKVVSHNGNAIEAINISKASEIMTHNLT
NVDVIGIDEVQFFDDEIVSIVEKLSADGHRVIVAGLDMDFRGEPFEPMPKLMAVSEQVTKLQAVCAVCGSSSSRTQRLIN
GKPAKIDDPIILVGANESYEPRCRAHHIVAPSDNNKEEL
>Q9WYN2 2.7.1.21~~~tdk~~~Thymidine kinase~~~COG1435
MSGKLTVITGPMYSGKTTELLSFVEIYKLGKKKVAVFKPKIDSRYHSTMIVSHSGNGVEAHVIERPEEMRKYIEEDTRGV
FIDEVQFFNPSLFEVVKDLLDRGIDVFCAGLDLTHKQNPFETTALLLSLADTVIKKKAVCHRCGEYNATLTLKVAGGEEE
IDVGGQEKYIAVCRDCYNTLKKRV
>Q9PPP5 2.7.1.21~~~tdk~~~Thymidine kinase~~~COG1435
MAKVNAFSKKIGWIELITGPMFAGKTAELIRRLHRLEYADVKYLVFKPKIDTRSIRNIQSRTGTSLPSVEVESAPEILNY
IMSNSFNDETKVIGIDEVQFFDDRICEVANILAENGFVVIISGLDKNFKGEPFGPIAKLFTYADKITKLTAICNECGAEA
THSLRKIDGKHADYNDDIVKIGCQEFYSAVCRHHHKVPNRPYLNSNSEEFIKFFKNKKRNKNI
>P00552 2.7.1.95~~~neo~~~Aminoglycoside 3'-phosphotransferase~~~
MIEQDGLHAGSPAAWVERLFGYDWAQQTIGCSDAAVFRLSAQGRPVLFVKTDLSGALNELQDEAARLSWLATTGVPCAAV
LDVVTEAGRDWLLLGEVPGQDLLSSHLAPAEKVSIMADAMRRLHTLDPATCPFDHQAKHRIERARTRMEAGLVDQDDLDE
EHQGLAPAELFARLKARMPDGEDLVVTHGDACLPNIMVENGRFSGFIDCGRLGVADRYQDIALATRDIAEELGGEWADRF
LVLYGIAAPDSQRIAFYRLLDEFF
>P0A3Y5 2.7.1.95~~~aphA~~~Aminoglycoside 3'-phosphotransferase~~~
MAKMRISPELKKLIEKYRCVKDTEGMSPAKVYKLVGENENLYLKMTDSRYKGTTYDVEREKDMMLWLEGKLPVPKVLHFE
RHDGWSNLLMSEADGVLCSEEYEDEQSPEKIIELYAECIRLFHSIDISDCPYTNSLDSRLAELDYLLNNDLADVDCENWE
EDTPFKDPRELYDFLKTEKPEEELVFSHGDLGDSNIFVKDGKVSGFIDLGRSGRADKWYDIAFCVRSIREDIGEEQYVEL
FFDLLGIKPDWEKIKYYILLDELF
>P00553 2.7.1.95~~~aphA4~~~Aminoglycoside 3'-phosphotransferase~~~
MNESTRNWPEELLELLGQTELTVNKIGYSGDHVYHVKEYRGTPAFLKIAPSVWWRTLRPEIEALAWLDGKLPVPKILYTA
EHGGMDYLLMEALGGKDGSHETIQAKRKLFVKLYAEGLRSVHGLDIRECPLSNGLEKKLRDAKRIVDESLVDPADIKEEY
DCTPEELYGLLLESKPVTEDLVFAHGDYCAPNLIIDGEKLSGFIDLGRAGVADRYQDISLAIRSLRHDYGDDRYKALFLE
LYGLDGLDEDKVRYYIRLDEFF
>P52603 ~~~klcA~~~Antirestriction protein KlcA~~~
MTDVQIPSPIVATRVAEADRLRFLPTYFGPSMLRMLRGEALVFGWMGRLCAAYHGGFWHFYTLSNGGFYMAPEHDGRLRI
EVDGNGFAGELSADAAGIVATLFALNQLCAELAGTADADALIDRYHHLAAFASEHAEAAAIYRAID
>P52605 ~~~klcB~~~Protein KlcB~~~
MQDDNIKRRNDAIVAGRLFSGSVQDRETERCHHCGELLHPFPEPEYWMQYPLPTCCVLDGLKFCDDYRKPDCIAAYLVAN
PTPPEATTPAARRRTKARKSKPQTEDKDARIAALAATLPEDRAGLLAVAADAVAAVHDAVLNRADLVADVAGERYAAAVW
KLNGGTFFGCAGDQDAAERVIERHCRATPGVVPMWGQEGDFLASVDGMRVWVEVESGYGGLTTVHFQFHAVDLDGPFISE
TGYRSHYDHARGGMTVDQVADGVLRALLRSHRRYLDARDQDRLADEPLPAWLAGITPPPRRVRAVVEDWRKPDELPPGFA
WVDAVLPAHQAFIARKWAASAKAKLAAARAKAQEPAGQRREPVTPAKPEPEPAKDEDAPAWPATFFPGLRCEIVSVHHPV
FAKEIGKHVIITKISPETRQVWAHDDKPPRYRINRNGRKVCEYDPRCIESCYGYDQLRAAI
>D0EM77 3.4.24.-~~~kly~~~Karilysin~~~COG0265
MKRFILLFFLSTIAIFKVYSQRLYDNGPLTGDNNYVLQGSKWNKTTLKYYIYNSSSHLTTTERENAIRSAFALWSDKSTL
SFIQVYNPNQADIKIKWEKGNHGDGYPFDGNTGILAHAFYPPPAGGNYAGHLHFDGDENWSINGSGIDLITVAAHEIGHL
LGIEHSNVSSALMYPYYTGIKRQLDNDDCLAVWDLYGYPFSISGPSSVCDQATYTVENLLSGATVQWSVSNPNIATINSS
NGVLTCRGNGICEVRATINNSSVALTPLKICLGTPISQDITLTVESLNSNGTLCTDNPNAIMADHPGGNHLGYIREYEWR
ISNGWQIAHHPGDNGIYADHFIVTVIPLSPLPGSPTVSVRARSECGWGTWKEVQIPAVSCSRTISPFTLSPNPATDEVIL
QLMETDEVSGLSVLSTDRSAYEIQIWSGMRMLRSFRTNEPTFQISMTGLPAGLYFVRVVKNGQTYTQKLIKK
>Q11PP7 1.14.13.9~~~kmo~~~Kynurenine 3-monooxygenase~~~COG0654
MKEQITICGAGLVGSLLAVYLIERGFSVRVFEKRKDPRKNEADAGRSINLAISHRGIHALKDAQTGLEKEALKLAVPMYG
RAIHDLHGHVSFQAYGEASQHINSIGRGALNKLLITTAENLGVHFLFEHTCTDYHAAGEQWLFSDITGNTVATQSKEIVI
GADGAFSIVRSFLSKQQQPQPQIETLEYGYKELEIASAHTETITNNQALHIWPRERFMLIALPNEDGSYTATLFLPLKGE
ISFEALQSDQDIQLFFKKYFPDTENLFPDLTEQFYRHPTSKLFTIHSSNWFNAHTLLIGDAAHALVPFYGQGMNAGFEDC
RILAEIIDGKSKTNWSEIFAEFYNQRKENADAISDLALQNFIEMRDHVADASFLLRKKIEKHLHQELEDAFIPQYTMVSF
TDISYKEAMETGLLHQKILDEIMAIPDIEAAWPTEELKNKVITVTKKYI
>Q84HF5 1.14.13.9~~~kmo~~~Kynurenine 3-monooxygenase~~~
MTATDNARQVTIIGAGLAGTLVARLLARNGWQVNLFERRPDPRIETGARGRSINLALAERGAHALRLAGLEREVLAEAVM
MRGRMVHVPGTPPNLQPYGRDDSEVIWSINRDRLNRILLDGAEAAGASIHFNLGLDSVDFARQRLTLSNVSGERLEKRFH
LLIGADGCNSAVRQAMASVVDLGEHLETQPHGYKELQITPEASAQFNLEPNALHIWPHGDYMCIALPNLDRSFTVTLFLH
HQSPAAQPASPCFAQLVDGHAARRFFQRQFPDLSPMLDSLEQDFEHHPTGKLATLRLTTWHVGGQAVLLGDAAHPMVPFH
GQGMNCALEDAVALAEHLQSAADNASALAAFTAQRQPDALAIQAMALENYVEMSSKVASPTYLLERELGQIMAQRQPTRF
IPRYSMVTFSRLPYAQAMARGQIQEQLLKFAVANHSDLTSINLDAVEHEVTRCLPPLSHLC
>O53838 ~~~kmtR~~~HTH-type transcriptional regulator KmtR~~~COG0640
MYADSGPDPLPDDQVCLVVEVFRMLADATRVQVLWSLADREMSVNELAEQVGKPAPSVSQHLAKLRMARLVRTRRDGTTI
FYRLENEHVRQLVIDAVFNAEHAGPGIPRHHRAAGGLQSVAKASATKDVG
>P77154 2.4.1.230~~~ycjT~~~Kojibiose phosphorylase~~~COG1554
MTRPVTLSEPHFSQHTLNKYASLMAQGNGYLGLRASHEEDYTRQTRGMYLAGLYHRAGKGEINELVNLPDVVGMEIAING
EVFSLSHEAWQRELDFASGELRRNVVWRTSNGSGYTIASRRFVSADQLPLIALEITITPLDADASVLISTGIDATQTNHG
RQHLDETQVRVFGQHLMQGSYTTQDGRSDVAISCCCKVSGDVQQCYTAKERRLLQHTSAQLHAGETMTLQKLVWIDWRDD
RQAALDEWGSASLRQLEMCAQQSYDQLLAASTENWRQWWQKRRITVNGGEAHDQQALDYALYHLRIMTPAHDERSSIAAK
GLTGEGYKGHVFWDTEVFLLPFHLFSDPTVARSLLRYRWHNLPGAQEKARRNGWQGALFPWESARSGEEETPEFAAINIR
TGLRQKVASAQAEHHLVADIAWAVIQYWQTTGDESFIAHEGMALLLETAKFWISRAVRVNDRLEIHDVIGPDEYTEHVNN
NAYTSYMARYNVQQALNIARQFGCSDDAFIHRAEMFLKELWMPEIQPDGVLPQDDSFMAKPAINLAKYKAAAGKQTILLD
YSRAEVNEMQILKQADVVMLNYMLPEQFSAASCLANLQFYEPRTIHDSSLSKAIHGIVAARCGLLTQSYQFWREGTEIDL
GADPHSCDDGIHAAATGAIWLGAIQGFAGVSVRDGELHLNPALPEQWQQLSFPLFWQGCELQVTLDAQRIAIRTSAPVSL
RLNGQLITVAEESVFCLGDFILPFNGTATKHQEDE
>Q8L163 2.4.1.230~~~kojP~~~Kojibiose phosphorylase~~~
MVKHMFLEDVNNLISDDKWLIFQNEYNTEVNPRYETLFTLTNGYMGVRGTFEEGSEGERSGNFIAGIFDKSDAQVREIVN
AQNWLRIKLYVEGEELSLDKCQLIEFKRILDMKKGILFRSMLIKDSKDRITRIEGYRFISRSDLHRSAIKLFVTPVNYSG
VVGIESIIDGTVLNSADSPKHRVKHLKVADNSSLNKSGVYLETATIDDDIRIATGSAVRLYHYEDKEKNNIAKFKRFLPL
GEMSIEYFEFDGTENKTVVIDKFIITYTSRDVKKGLLKSTVEKELFAFAGEGIDKELQRHIEVYEELWSVADINIEGDEE
ADKALRFNIFHLMSSVNENDPMVSIAAKALHGEGYKGHVFWDTEIFMLPFFIYVHPKAAKTLLMYRYNMLDAARKNAALN
GYKGAQYPWESADTGEEETPKWGFDYMGNPVRIWTGDLEHHITADIAFAVWEYFRATEDIEFMLNYGAEVIFETARFWVS
RCEYVKELDRYEINNVIGPDEFHEHVDNNAYTDYLAKWNIKKGLELINMLKEKYPEHYHAISNKKCLTNEEMEKWKEVEE
KIYIPYDKDKKLIEQFEGYFDKKDYVIDKFDENNMPIWPEGVDITKLGDTQLIKQADVVMLMLLLGEEFDEETKRINYEY
YEKRTMHKSSLGPSMYAIMGLKVGDHKNAYQSFMRSANVDLVDNQGNTKEGLHAASAGGTWQVVVFGFGGMEIDKEGALN
INSWLPEKWDKLSYKVFWKGNLIEVIVTKQEVTVKKLKGKGNIKVKVKGKELTIE
>P03052 ~~~trfB~~~TrfB transcriptional repressor protein~~~
MKKRLTESQFQEAIQGLEVGQQTIEIARGVLVDGKPQATFATSLGLTRGAVSQAVHRVWAAFEDKNLPEGYARVTAVLPE
HQAYIVRKWEADAKKKQETKR
>O53182 1.2.7.3~~~korA~~~2-oxoglutarate oxidoreductase subunit KorA~~~COG0674
MDPNGSGAGPESHDAAFHAAPDRQRLENVVIRFAGDSGDGMQLTGDRFTSEAALFGNDLATQPNYPAEIRAPAGTLPGVS
SFQIQIADYDILTAGDRPDVLVAMNPAALKANIGDLPLGGMVIVNSDEFTKRNLTKVGYVTNPLESGELSDYVVHTVAMT
TLTLGAVEAIGASKKDGQRAKNMFALGLLSWMYGRELEHSEAFIREKFARKPEIAEANVLALKAGWNYGETTEAFGTTYE
IPPATLPPGEYRQISGNTALAYGIVVAGQLAGLPVVLGSYPITPASDILHELSKHKNFNVVTFQAEDEIGGICAALGAAY
GGALGVTSTSGPGISLKSEALGLGVMTELPLLVIDVQRGGPSTGLPTKTEQADLLQALYGRNGESPVAVLAPRSPADCFE
TALEAVRIAVSYHTPVILLSDGAIANGSEPWRIPDVNALPPIKHTFAKPGEPFQPYARDRETLARQFAIPGTPGLEHRIG
GLEAANGSGDISYEPTNHDLMVRLRQAKIDGIHVPDLEVDDPTGDAELLLIGWGSSYGPIGEACRRARRRGTKVAHAHLR
YLNPFPANLGEVLRRYPKVVAPELNLGQLAQVLRGKYLVDVQSVTKVKGVSFLADEIGRFIRAALAGRLAELEQDKTLVA
RLSAATAGAGANG
>P07674 ~~~korB~~~Transcriptional repressor protein KorB~~~
MTAAQAKTTKKNTAAAAQEAAGAAQPSGLGLDSIGDLSSLLDAPAASQGGSGPIELDLDLIDEDPHQPRTADNPGFSPES
IAEIGATIKERGVKSPISVRENQEQPGRYIINHGARRYRGSKWAGKKSIPAFIDNDYNEADQVIENLQRNELTPREIADF
IGRELAKGKKKGDIAKEIGKSPAFITQHVTLLDLPEKIADAFNTGRVRDVTVVNELVTAFKKRPEEVEAWLDDDTQEITR
GTVKLLREFLDEKGRDPNTVDAFNGQTDAERDAEAGDGQDGEDGDQDGKDAKEKGAKEPDPDKLKKAIVQVEHDERPARL
ILNRRPPAEGYAWLKYEDDGQEFEANLADVKLVALIEG
>O53181 1.2.7.3~~~korB~~~2-oxoglutarate oxidoreductase subunit KorB~~~COG1013
MTRSGDEAQLMTGVTGDLAGTELGLTPSLTKNAGVPTTDQPQKGKDFTSDQEVRWCPGCGDYVILNTIRNFLPELGLRRE
NIVFISGIGCSSRFPYYLETYGFHSIHGRAPAIATGLALAREDLSVWVVTGDGDALSIGGNHLIHALRRNINVTILLFNN
RIYGLTKGQYSPTSEVGKVTKSTPMGSLDHPFNPVSLALGAEATFVGRALDSDRNGLTEVLRAAAQHRGAALVEILQDCP
IFNDGSFDALRKEGAEERVIKVRHGEPIVFGANGEYCVVKSGFGLEVAKTADVAIDEIIVHDAQVDDPAYAFALSRLSDQ
NLDHTVLGIFRHISRPTYDDAARSQVVAARNAAPSGTAALQSLLHGRDTWTVD
>A9AWD6 3.1.7.12~~~~~~(+)-kolavelool synthase~~~
MRPTPTLAEFLHAPLTTIRQVAPATMVFSSGGSRRKAALANMSAAGEEYARWSHQQLLKCLELFFSHGIKHLFLPMLLPN
QFQETTPNYREHIEQWVAWGAASQTMLEYYQEHNWRVRLLDTQYSPILADAAQRLQQPYDHPDQPTLWWFVVRDSEDPWQ
IIFQAAQKTVFKTRSQAIEAIYGEPIPPAELFVSFGKPQVNHDLLPPLLVGELQCYWTQKPGYTLSEEEFRQILYDFAFL
RKTWQVDKTERTQAALAFRQHWERGPILGLGQQLGPFWYPQSTSIESEL
>P12033 2.7.1.19~~~prkA~~~Phosphoribulokinase 1~~~
MSKKHPIISVTGSSGAGTSTVKHTFDQIFRREGVKAVSIEGDAFHRFNRADMKAELDRRYAAGDATFSHFSYEANELKEL
ERVFREYGETGQGRTRTYVHDDAEAARTGVAPGNFTDWRDFDSDSHLLFYEGLHGAVVNSEVNIAGLADLKIGVVPVINL
EWIQKIHRDRATRGYTTEAVTDVILRRMHAYVHCIVPQFSQTDINFQRVPVVDTSNPFIARWIPTADESVVVIRFRNPRG
IDFPYLTSMIHGSWMSRANSIVVPGNKLDLAMQLILTPLIDRVVRESKVA
>P19924 2.7.1.19~~~cfxP~~~Phosphoribulokinase, plasmid~~~COG3954
MSERYPIIAITGSSGAGTTSVTRTFENIFCREGVKSVVIEGDSFHRYDRAEMKVKMAEAERTGNMNFSHFGAENNLFGDL
ESLFRSYAESGTGMRRRYLHSTEEAAPFGQQPGTFTAWEPLPADTDLLFYEGLHGGVVTDEVNVAQYPNLLIGVVPVINL
EWIQKLWRDKKQRGYSTEAVTDTILRRMPDYVNYICPQFSRTHVNFQRVPCVDTSNPFISREIPAPDESMVVIRFANPKG
IDFQYLLSMIHDSFMSRANTIVVPGGKMELAMQLIFTPFVLRMMERRKRAAL
>P19923 2.7.1.19~~~cfxP~~~Phosphoribulokinase, chromosomal~~~COG3954
MSERYPIIAITGSSGAGTTSVTRTFENIFRREGVKSVVIEGDSFHRYDRAEMKVKMAEAERTGNMNFSHFGEENNLFGEL
ENLFRSYAETGTGMHRHYLHSPEEAAPFGQEPGTFTQWEPLPADTDLLFYEGLHGGVVTDSVNVAQYPNLLIGVVPVINL
EWIQKLWRDKKQRGYSTEAVTDTILRRMPDYVNYICPQFSRTHVNFQRVPCVDTSNPFISREIPAPDESMVVIRFANPKG
IDFQYLLSMIHDSFMSRANTIVVPGGKMELAMQLIFTPFVLRMMERRKRAAQ
>P37101 2.7.1.19~~~prk~~~Phosphoribulokinase~~~COG0572
MTTQLDRVVLIGVAGDSGCGKSTFLRRLTDLFGEEFMTVICLDDYHSLDRQGRKAAGVTALDPRANNFDLMYEQIKTLKS
GQSIMKPIYNHETGLLDPPEKVEPNKVVVIEGLHPLYDERVRELVDFGVYLDISEEVKINWKIQRDMAERGHTYEDILAS
INARKPDFTAYIEPQKQYADVVIQVLPTRLIEDKESKLLRVRLVQKEGVKFFEPAYLFDEGSTIDWRPCGRKLTCTYPGI
KMYYGPDNFMGNEVSLLEVDGRFENLEEMVYVENHLSKTGTKYYGEMTELLLKHKDYPGTDNGTGLFQVLVGLKMRKVYE
QLTAEAKVPASV
>Q5XC85 2.7.6.1~~~prs2~~~Putative ribose-phosphate pyrophosphokinase 2~~~
MTERYADKQIKLFSLTSNLPIAEKIAKAAGIPLGKMSSRQFSDGEIMINIEETVRGDDIYIIQSTSFPVNDNLWELLIMI
DACKRASANTVNIVLPYFGYSRQDRVAKPREPITAKLVANMLTKAGIDRVVTLDLHAVQVQGFFDIPVDNLFTVPLFAER
YSKLGLSGSDVVVVSPKNSGIKRARSLAEYLDSPIAIIDYAQDDSEREQGYIIGDVSGKKAILIDDILNTGKTFAEAAKI
LERSGATDTYAVASHGLFAGGAAEVLETAPIKEIIVTDSVKTKNRVPENVTYLSASDLIAEAIIRIHERRPLSPLFSYQP
KGKNNA
>P14193 2.7.6.1~~~prs~~~Ribose-phosphate pyrophosphokinase~~~COG0462
MSNQYGDKNLKIFSLNSNPELAKEIADIVGVQLGKCSVTRFSDGEVQINIEESIRGCDCYIIQSTSDPVNEHIMELLIMV
DALKRASAKTINIVIPYYGYARQDRKARSREPITAKLFANLLETAGATRVIALDLHAPQIQGFFDIPIDHLMGVPILGEY
FEGKNLEDIVIVSPDHGGVTRARKLADRLKAPIAIIDKRRPRPNVAEVMNIVGNIEGKTAILIDDIIDTAGTITLAANAL
VENGAKEVYACCTHPVLSGPAVERINNSTIKELVVTNSIKLPEEKKIERFKQLSVGPLLAEAIIRVHEQQSVSYLFS
>Q63XL8 2.7.6.1~~~prs~~~Ribose-phosphate pyrophosphokinase~~~COG0462
MSSHDGLMVFTGNANPALAQEVVKILGIPLGKAMVSRFSDGEIQVEIQENVRGKDVFVLQSTCAPTNDNLMELMIMVDAL
KRASAGRITAAIPYFGYARQDRRPRSARVAISAKVVANMLEIAGVERIITMDLHADQIQGFFDIPVDNIYATPILLGDLR
KQNYPDLLVVSPDVGGVVRARALAKQLNCDLAIIDKRRPKANVAEVMNIIGEVEGRTCVIMDDMVDTAGTLCKAAQVLKE
RGAKQVFAYATHPVLSGGAADRIAASALDELVVTDTIPLSAESLACPKIRALSSAGLLAETFSRIRRGDSVMSLFAES
>P0A717 2.7.6.1~~~prs~~~Ribose-phosphate pyrophosphokinase~~~COG0462
MPDMKLFAGNATPELAQRIANRLYTSLGDAAVGRFSDGEVSVQINENVRGGDIFIIQSTCAPTNDNLMELVVMVDALRRA
SAGRITAVIPYFGYARQDRRVRSARVPITAKVVADFLSSVGVDRVLTVDLHAEQIQGFFDVPVDNVFGSPILLEDMLQLN
LDNPIVVSPDIGGVVRARAIAKLLNDTDMAIIDKRRPRANVSQVMHIIGDVAGRDCVLVDDMIDTGGTLCKAAEALKERG
AKRVFAYATHPIFSGNAANNLRNSVIDEVVVCDTIPLSDEIKSLPNVRTLTLSGMLAEAIRRISNEESISAMFEH
>P9WKE3 2.7.6.1~~~prs~~~Ribose-phosphate pyrophosphokinase~~~COG0462
MSHDWTDNRKNLMLFAGRAHPELAEQVAKELDVHVTSQDAREFANGEIFVRFHESVRGCDAFVLQSCPAPVNRWLMEQLI
MIDALKRGSAKRITAVMPFYPYARQDKKHRGREPISARLIADLLKTAGADRIVTVDLHTDQIQGFFDGPVDHMRGQNLLT
GYIRDNYPDGNMVVVSPDSGRVRIAEKWADALGGVPLAFIHKTRDPRVPNQVVSNRVVGDVAGRTCVLIDDMIDTGGTIA
GAVALLHNDGAGDVIIAATHGVLSDPAAQRLASCGAREVIVTNTLPIGEDKRFPQLTVLSIAPLLASTIRAVFENGSVTG
LFDGDA
>P65235 2.7.6.1~~~prs~~~Ribose-phosphate pyrophosphokinase~~~
MAAYDSLMVFTGNANPELAQRVVRHLDISLGNASVSKFSDGEVAVELLENVRGRDVFILQPTCAPTNDNLMEILTMADAL
KRASAGRITTAIPYFGYARQDRRPRSVRVPISAKLVANMLYSAGIDRVLTVDLHADQIQGFFDIPVDNIYATPILLNDIK
QQRIENLTVVSPDIGGVVRARAVAKSLNADLAIIDKRRPKANVAEVMNIIGDIQGRTCLIVDDMIDTANTLCKAAVALKE
RGAERVLAYASHAVFSGEAVSRIASSEIDQVVVTDTIPLSEAAKNCDRIRQVTIAGLLAETVRRISNEESVSYLFNEEVM
TGSMLLP
>P0A1V6 2.7.6.1~~~prs~~~Ribose-phosphate pyrophosphokinase~~~
MPDMKLFAGNATPELAQRIANRLYTSLGDAAVGRFSDGEVSVQINENVRGGDIFIIQSTCAPTNDNLMELVVMVDALRRA
SAGRITAVIPYFGYARQDRRVRSARVPITAKVVADFLSSVGVDRVLTVDLHAEQIQGFFDVPVDNVFGSPILLEDMLQLN
LDNPIVVSPDIGGVVRARAIAKLLNDTDMAIIDKRRPRANVSQVMHIIGDVAGRDCVLVDDMIDTGGTLCKAAEALKERG
AKRVFAYATHPIFSGNAANNLRNSVIDEVVVCDTIPLTDEIKALPNVRTLTLSGMLAEAIRRISNEESISAMFEH
>P65237 2.7.6.1~~~prs~~~Ribose-phosphate pyrophosphokinase~~~
MLNNEYKNSSLKIFSLKGNEALAQEVADQVGIELGKCSVKRFSDGEIQINIEESIRGCDVFIIQPTSYPVNLHLMELLIM
IDACKRASAATINIVVPYYGYARQDRKARSREPITAKLVANLIETAGATRMIALDLHAPQIQGFFDIPIDHLMGVPILAK
HFKDDPNINPEECVVVSPDHGGVTRARKLADILKTPIAIIDKRRPRPNVAEVMNIVGEIEGRTAIIIDDIIDTAGTITLA
AQALKDKGAKEVYACCTHPVLSGPAKERIENSAIKELIVTNSIHLDEDRKPSNTKELSVAGLIAQAIIRVYERESVSVLF
D
>Q03961 ~~~kpsD~~~Polysialic acid transport protein KpsD~~~
MKLFKSILLIAACHAAQASAAIDINADPNLTGAAPLTGILNGQQSDTQNMSGFDNTPPPSPPVVMSRMFGAQLFNGTSAD
SGATVGFNPDYILNPGDSIQVRLWGAFTFDGALQVDPKGNIFLPNVGPVKVAGVSNSQLNALVTSKVKEVYQSNVNVYAS
LLQAQPVKVYVTGFVRNPGLYGGVTSDSLLNYLIKAGGVDPERGSYVDIVVKRGNRVRSNVNLYDFLLNGKLGLSQFADG
DTIIVGPRQHTFSVQGDVFNSYDFEFRESSIPVTEALSWARPKPGATHITIMRKQGLQKRSEYYPISSAPGRMLQNGDTL
IVSTDRYAGTIQVRVEGAHSGEHAMVLPYGSTMRAVLEKVRPNSMSQMNAVQLYRPSVAQRQKEMLNLSLQKLEEASLSA
QSSTKEEASLRMQEAQLISRFVAKARTVVPKGEVILNESNIDSVLLEDGDVINIPEKTSLVMVHGEVLFPNAVSWQKGMT
TEDYIEKCGGLTQKSGNARIIVIRQNGAAVNAEDVDSLKPGDEIMVLPKYESKNIEVTRGISTILYQLAVGAKVILSL
>Q8FDQ2 5.3.1.13~~~kpsF~~~Arabinose 5-phosphate isomerase KpsF~~~COG0517
MSERHLPDDQSSTIDPYLITSVRQTLAEQSAALQNLSKQLDSGQYQRVLNLIMNCKGHVILSGMGKSGHVGRKISATLAS
TGTPSFFIHPAEAFHGDLGMITPYDLLILISASGETDEILKLVPSLKNFGNRIIAITNNGNSTLAKNADAVLELHMANET
CPNNLAPTTSTTLTMAIGDALAIAMIHQRKFMPNDFARYHPGGSLGRRLLTRVADVMQHDVPAVQLDASFKTVIQRITSG
CQGMVMVEDAEGGLAGIITDGDLRRFMEKEGSLTSATAAQMMTREPLTLPEDTMIIEAEEKMQKHRVSTLLVTNKANKVT
GLVRIFD
>P23889 ~~~kpsM~~~Polysialic acid transport protein KpsM~~~
MARSGFEVQKVTVEALFLREIRTRFGKFRLGYLWAILEPSAHLLILLGILGYVMHRTMPDISFPVFLLNGLIPFFIFSSI
SKRSIGAIEANQGLFNYRPVKPIDTIIARALLETLIYVAVYILLMLIVWMTGEYFEITNFLQLVLTWSLLIILSCGVGLI
FMVVGKTFPEMQKVLPILLKPLYFISCIMFPLHSIPKQYWSYLLWNPLVHVVELSREAVMPGYISEGVSLNYLAMFTLVT
LFIGLALYRTREEAMLTS
>P23888 ~~~kpsT~~~Polysialic acid transport ATP-binding protein KpsT~~~
MIKIENLTKSYRTPTGRHYVFKNLNIIFPKGYNIALIGQNGAGKSTLLRIIGGIDRPDSGNIITEHKISWPVGLAGGFQG
SLTGRENVKFVARLYAKRDELNERVDFVEEFSELGKYFDMPIKTYSSGMRSRLAFGLSMAFKFDYYLIDEITAVGDAKFK
KKCSDIFDKIREKSHLIMVSHSERALKEYCDVAIYLNKEGQGKFYKNVTEAIADYKKDL
>P42216 2.7.7.38~~~kpsU~~~3-deoxy-manno-octulosonate cytidylyltransferase~~~
MSKAVIVIPARYGSSRLPGKPLLDIVGKPMIQHVYERALQVAGVAEVWVATDDPRVEQAVQAFGGKAIMTRNDHESGTDR
LVEVMHKVEADIYINLQGDEPMIRPRDVETLLQGMRDDPALPVATLCHAISAAEAAEPSTVKVVVNTRQDALYFSRSPIP
YPRNAEKARYLKHVGIYAYRRDVLQNYSQLPESMPEQAESLEQLRLMNAGINIRTFEVAATGPGVDTPACLEKVRALMAQ
ELAENA
>A3DJX6 2.7.1.-~~~kptA~~~Probable RNA 2'-phosphotransferase~~~COG1859
MVLIDYSKLSKEVAYALRHAPWEYGLELDAEGWVDINQLLSSLHECEKWKKVSEHDLHVMIEKSDKKRYEISNGKIRALY
GHSIPQRIIKEQKCPPEVLYHGTARRFVKSIKEKGLQPQGRQYVHLSADVETALQVGKRRDIKPVLLIVNALEAWSEGIK
FYLGNDKVWLADAIPSKYIRFE
>P70789 2.7.1.40~~~ttuE~~~Pyruvate kinase~~~
MFIRSNRRAKIVATVGPASSSPAILRSLFLAGVDTFRLNFSHGSRDDHAAAYRHIRALEKELGTSIGILQDLQGPKIRIG
VLHEGRLQLTKDAEIRFVCGTEPGRGLMDIPLPHREIFAAVKPGDDLLIDDGRVRVRALGVSDEFIDAKVIVAGPISNRK
GVNLPGTVLDISPLTPKDRKDLEFGLELGVDWIALSFVQTARDMIEARSLVSDRAGLIAKIEKPSALDEIDDIVALSDAI
MVARGDLGVEIPPEDVPGRQKELIRACRIAAKPVIVATQMLDSMVTSPTPTRAEASDVAGAIYDGADAVMLSAESATGAF
PVETVEIMSRIIEKTEKHKFYRPILEATEPQIAHTPPHAVATAAADVALALKAPVIVAFTVSGTTASRISRARPPLPILA
LTPSEQTARQLGLMWGVVSLLSPTVDTYEQSVDRATQAAVQTGLAEKSDQIVVVTGFPFATAGSTNNLRVTQAG
>P0AD61 2.7.1.40~~~pykF~~~Pyruvate kinase I~~~COG0469
MKKTKIVCTIGPKTESEEMLAKMLDAGMNVMRLNFSHGDYAEHGQRIQNLRNVMSKTGKTAAILLDTKGPEIRTMKLEGG
NDVSLKAGQTFTFTTDKSVIGNSEMVAVTYEGFTTDLSVGNTVLVDDGLIGMEVTAIEGNKVICKVLNNGDLGENKGVNL
PGVSIALPALAEKDKQDLIFGCEQGVDFVAASFIRKRSDVIEIREHLKAHGGENIHIISKIENQEGLNNFDEILEASDGI
MVARGDLGVEIPVEEVIFAQKMMIEKCIRARKVVITATQMLDSMIKNPRPTRAEAGDVANAILDGTDAVMLSGESAKGKY
PLEAVSIMATICERTDRVMNSRLEFNNDNRKLRITEAVCRGAVETAEKLDAPLIVVATQGGKSARAVRKYFPDATILALT
TNEKTAHQLVLSKGVVPQLVKEITSTDDFYRLGKELALQSGLAHKGDVVVMVSGALVPSGTTNTASVHVL
>P77983 2.7.1.40~~~pykF~~~Pyruvate kinase I~~~
MKKTKIVCTIGPKTESEEMLSKMLDAGMNVMRLNFSHGDYAEHGQRIQNLRNVMSKTGKKAAILLDTKGPEIRTIKLEGG
NDVSLKAGQTFTFTTDKSVVGNNEIVAVTYEGFTSDLSVGNTVLVDDGLIGMEVTAIEGNKVICKVLNNGDLGENKGVNL
PGVSIALPALAEKDKQDLIFGCEQGVDFVAASFIRKRSDVVEIREHLKAHGGENIQIISKIENQEGLNNFDEILEASDGI
MVARGDLGVEIPVEEVIFAQKMMIEKCIRARKVVITATQMLDSMIKNPRPTRAEAGDVANAILDGTDAVMLSGESAKGKY
PLEAVSIMATICERTDRVMNSRLDYNNDSRKLRITEAVCRGAVETAEKLEAPLIVVATQGGKSARAVRKYFPDATILALT
TNEVTARQLVLSKGVVSQLVKEINSTDDFYRLGKDVALQSGLAQKGDVVVMVSGALVPSGTTNTASVHVL
>Q44473 2.7.1.40~~~ttuE~~~Pyruvate kinase~~~
MFIRNNRRSKIVATVGPASSSPDMLRSLFLAGVDTFRLNFSHGARADHAEVYRNIRALEQEHDAAIAVLQDLQGPKIRIG
VLAHGRLDLARGSTIGFILGREGGEGMNDIPLPHREIFEVAVPGMDLLIDDGRIKVRIMEVMDGRLVCEVLNGGALSNRK
GVNVPGAVLDISPLTAKDREDLEFGLELGVDWVALSFVQRARDMIEARSLVGDRAGLIAKIEKPSALDDIEDIVRLSDSV
MVARGDLGVEIPPEDVPGKQKEIIRACRLAAKPVIVATQMLDSMVSSPTPTRAEASDVAGAIYDGADAVMLSAETATGAY
PVEAVEIMNRIIEKTEKHKHYRPILEATEPDVAQSPPHAVATAAANVAVALGSPVVVAYTSSGTTAARISRARPALPILA
LTPSEQVARRLNMFWGVVGVRSQDVHTYEASLIHAQQAVQEAKLASPSDHIVIVAGFPFAQQGSTNNLRVVQIAATDNLE
IA
>P21599 2.7.1.40~~~pykA~~~Pyruvate kinase II~~~COG0469
MSRRLRRTKIVTTLGPATDRDNNLEKVIAAGANVVRMNFSHGSPEDHKMRADKVREIAAKLGRHVAILGDLQGPKIRVST
FKEGKVFLNIGDKFLLDANLGKGEGDKEKVGIDYKGLPADVVPGDILLLDDGRVQLKVLEVQGMKVFTEVTVGGPLSNNK
GINKLGGGLSAEALTEKDKADIKTAALIGVDYLAVSFPRCGEDLNYARRLARDAGCDAKIVAKVERAEAVCSQDAMDDII
LASDVVMVARGDLGVEIGDPELVGIQKALIRRARQLNRAVITATQMMESMITNPMPTRAEVMDVANAVLDGTDAVMLSAE
TAAGQYPSETVAAMARVCLGAEKIPSINVSKHRLDVQFDNVEEAIAMSAMYAANHLKGVTAIITMTESGRTALMTSRISS
GLPIFAMSRHERTLNLTALYRGVTPVHFDSANDGVAAASEAVNLLRDKGYLMSGDLVIVTQGDVMSTVGSTNTTRILTVE
>Q8ZNW0 2.7.1.40~~~pykA~~~Pyruvate kinase II~~~
MSRRLRRTKIVTTLGPATDRDNNLEKVIAAGANVVRMNFSHGSPEDHKMRADKVREIAAKLGRHVAILGDLQGPKIRVST
FKEGKVFLNIGDKFLLDANLGKGEGDKEKVGIDYKGLPADVVPGDILLLDDGRVQLKVLEVQGMKVFTEVTVGGPLSNNK
GINKLGGGLSAEALTEKDKADIQTAALIGVDYLAVSFPRCGEDLNYARRLARDAGCDAKIVAKVERAEAVCDQNAMDDII
LASDVVMVARGDLGVEIGDPELVGIQKALIRRARQLNRAVITATQMMESMITNPMPTRAEVMDVANAVLDGTDAVMLSAE
TAAGQYPSETVAAMARVCLGAEKIPSINVSKHRLDVQFDNVEEAIAMSAMYAANHLKGVTAIITMTESGRTALMTSRISS
GLPIFAMSRHERTLNLTALYRGVTPVHFDSAADGVVAAHEAVNLLRDKGYLVSGDLVIVTQGDVMSTVGSTNTTRILTVE
>P80885 2.7.1.40~~~pyk~~~Pyruvate kinase~~~COG0469
MRKTKIVCTIGPASESIEMLTKLMESGMNVARLNFSHGDFEEHGARIKNIREASKKLGKNVGILLDTKGPEIRTHTMENG
GIELETGKELIISMDEVVGTTDKISVTYEGLVHDVEQGSTILLDDGLIGLEVLDVDAAKREIKTKVLNNGTLKNKKGVNV
PGVSVNLPGITEKDARDIVFGIEQGVDFIAPSFIRRSTDVLEIRELLEEHNAQDIQIIPKIENQEGVDNIDAILEVSDGL
MVARGDLGVEIPAEEVPLVQKELIKKCNALGKPVITATQMLDSMQRNPRPTRAEASDVANAIFDGTDAIMLSGETAAGSY
PVEAVQTMHNIASRSEEALNYKEILSKRRDQVGMTITDAIGQSVAHTAINLNAAAIVTPTESGHTARMIAKYRPQAPIVA
VTVNDSISRKLALVSGVFAESGQNASSTDEMLEDAVQKSLNSGIVKHGDLIVITAGTVGESGTTNLMKVHTVGDIIAKGQ
GIGRKSAYGPVVVAQNAKEAEQKMTDGAVLVTKSTDRDMIASLEKASALITEEGGLTSHAAVVGLSLGIPVIVGLENATS
ILTDGQDITVDASRGAVYQGRASVL
>Q46078 2.7.1.40~~~pyk~~~Pyruvate kinase~~~COG0469
MDRRTKIVCTLGPAVASADGILRLVEDGMDVARLNFSHGDHPDHEQNYKWVREAAEKTGRAVGILADLQGPKIRLGRFTD
GATVWENGETIRITVDDVEGTHDRVSTTYKNLAKDAKPGDRLLVDDGKVGLVCVSVEGNDVICEVVEGGPVSNNKGVSLP
GMDISVPALSEKDIRDLRFALKLGVDFIALSFVRSPADAELVHKIMDEEGRRVPVIAKLEKPEAVTSLEPIVLAFDAVMV
ARGDLGVEVPLEEVPLVQKRAIQIARENAKPVIVATQMLDSMIENSRPTRAEASDVANAVLDGADAVMLSGETSVGKDPH
NVVRTMSRIVRFAETDGRVPDLTHIPRTKRGVISYSARDIAERLNARALVAFTTSGDTAKRVARLHSHLPLLVFTPNEAV
RSELALTWGATTFLCPPVSDTDDMMREVDRALLAMPEYNKGDMMVVVAGSPPGVTGNTNMIHVHLLGDDTRIAKL
>Q02499 2.7.1.40~~~pyk~~~Pyruvate kinase~~~
MKRKTKIVCTIGPASESVDKLVQLMEAGMNVARLNFSHGDHEEHGRRIANIREAAKRTGRTVAILLDTKGPEIRTHNMEN
GAIELKEGSKLVISMSEVLGTPEKISVTYPSLIDDVSVGAKILLDDGLISLEVNAVDKQAGEIVTTVLNGGVLKNKKGVN
VPGVKVNLPGITEKDRADILFGIRQGIDFIAASFVRRASDVLEIRELLEAHDALHIQIIAKIENEEGVANIDEILEAADG
LMVARGDLGVEIPAEEVPLIQKLLIKKCNMLGKPVITATQMLDSMQRNPRPTRAEASDVANAIFDGTDAVMLSGETAAGQ
YPVEAVKTMHQIALRTEQALEHRDILSQRTKESQTTITDAIGQSVAHTALNLDVAAIVTPTVSGKTPQMVAKYRPKAPII
AVTSNEAVSRRLALVWGVYTKEAPHVNTTDEMLDVAVDAAVRSGLVKHGDLVVITAGVPVGETGSTNLMKVHVISDLLAK
GQGIGRKSAFGKAVVAKTAEEARQKMVDGGILVTVSTDADMMPAIEKAAAIITEEGGLTSHAAVVGLSLGIPVIVGVENA
TTLFKDGQEITVDGGFGAVYRGHASVL
>P34038 2.7.1.40~~~pyk~~~Pyruvate kinase~~~
MKKTKIVSTLGPASDDIETITKLAEAGANVFRFNFSHGNHEEHLARMNMVREVEKKTGKLLGIALDTKGAEIRTTDQEGG
KFTINTGDEIRVSMDATKAGNKDMIHVTYPGLFDDTHVGGTVLIDDGAVGLTIKAKDEEKRELVCEAQNTGVIGSKKGVN
APGVEIRLPGITEKDTDDIRFGLKHGINFIFASFVRKAQDVLDIRALCEEANASYVKIFPKIESQEGIDNIDEILQVSDG
LMVARGDMGVEIPFINVPFVQKTLIKKCNALGKPVITATQMLDSMQENPRPTRAEVTDVANAVLDGTDATMLSGESANGL
YPVQSVQAMHDIDVRTEKELDTRNTLALQRFEEYKGSNVTEAIGESVVRTAQELGVKTIIAATSSGYTARMISKYRPDAT
IVALTFDEKIQHSLGIVWGVEPVLAKKPSNTDEMFEEAARVAKEHGFVKDGDLVIIVAGVPFGQSGTTNLMKLQIIGNQL
AQGLGVGTGSVIGKAVVANSAEEANAKVHEGDILVAKTTDKDYMPAIKKASGMIVEASGLTSHAAVVGVSLGIPVVVGVA
DATSKIADGSTLTVDARRGAIYQGEVSNL
>P78031 2.7.1.40~~~pyk~~~Pyruvate kinase~~~
MIHHLKRTKIIATCGPALTKKLWTLAMLDDPAYAAMKAEAYANIENIIKNGVTVIRLNFSHGNHEEQAVRIKIVRDVAKK
LNLPVSIMLDTNGPEIRVFETAPEGLKILKDSEVVINTTTKEVAKNNQFSVSDASGTYNMVNDVKVGQKILVDDGKLSLV
VKRIDTKNNQVICVAQNDHTIFTKKRLNLPNADYSIPFLSAKDLRDIDFGLTHQIDYIAASFVNTTENIKQLRDYLASKN
AKHVKLIAKIESNHALNNIDGIIKASDGIMVARGDLGLEIPYYKVPYWQRYMIKACRFFNKRVITATQMLDSLEKNIQPT
RAEVTDVYFAVDRGNDATMLSGETANGAFPLNAVYVMKMIDKQSETFFDYQYNLNYYMANSKARHSEFWKQVVLPLAQKT
APKRKLINSDFKYDFVVHATNNLNEIYALSNARLAAAVIILTNDPQVYTGHGVDYGIFPYLIDQKPQSLSKAEFKSLANV
AIKHYQQHGEISQLKQCLGVFHNKIISL
>P9WKE5 2.7.1.40~~~pyk~~~Pyruvate kinase~~~COG0469
MTRRGKIVCTLGPATQRDDLVRALVEAGMDVARMNFSHGDYDDHKVAYERVRVASDATGRAVGVLADLQGPKIRLGRFAS
GATHWAEGETVRITVGACEGSHDRVSTTYKRLAQDAVAGDRVLVDDGKVALVVDAVEGDDVVCTVVEGGPVSDNKGISLP
GMNVTAPALSEKDIEDLTFALNLGVDMVALSFVRSPADVELVHEVMDRIGRRVPVIAKLEKPEAIDNLEAIVLAFDAVMV
ARGDLGVELPLEEVPLVQKRAIQMARENAKPVIVATQMLDSMIENSRPTRAEASDVANAVLDGADALMLSGETSVGKYPL
AAVRTMSRIICAVEENSTAAPPLTHIPRTKRGVISYAARDIGERLDAKALVAFTQSGDTVRRLARLHTPLPLLAFTAWPE
VRSQLAMTWGTETFIVPKMQSTDGMIRQVDKSLLELARYKRGDLVVIVAGAPPGTVGSTNLIHVHRIGEDDV
>Q7A559 2.7.1.40~~~pyk~~~Pyruvate kinase~~~
MRKTKIVCTIGPASESEEMIEKLINAGMNVARLNFSHGSHEEHKGRIDTIRKVAKRLDKIVAILLDTKGPEIRTHNMKDG
IIELERGNEVIVSMNEVEGTPEKFSVTYENLINDVQVGSYILLDDGLIELQVKDIDHAKKEVKCDILNSGELKNKKGVNL
PGVRVSLPGITEKDAEDIRFGIKENVDFIAASFVRRPSDVLEIREILEEQKANISVFPKIENQEGIDNIAEILEVSDGLM
VARGDMGVEIPPEKVPMVQKDLIRQCNKLGKPVITATQMLDSMQRNPRATRAEASDVANAIYDGTDAVMLSGETAAGLYP
EEAVKTMRNIAVSAEAAQDYKKLLSDRTKLVETSLVNAIGISVAHTALNLNVKAIVAATESGSTARTISKYRPHSDIIAV
TPSEETARQCSIVWGVQPVVKKGRKSTDALLNNAVATAVETGRVSNGDLIIITAGVPTGETGTTNMMKIHLVGDEIANGQ
GIGRGSVVGTTLVAETVKDLEGKDLSDKVIVTNSIDETFVPYVEKALGLITEENGITSPSAIVGLEKGIPTVVGVEKAVK
NISNNMLVTIDAAQGKIFEGYANVL
>Q6GG09 2.7.1.40~~~pyk~~~Pyruvate kinase~~~
MRKTKIVCTIGPASESEEMIEKLINAGMNVARLNFSHGSHEEHKGRIDTIRKVAKRLDKIVAILLDTKGPEIRTHNMKDG
IIELERGNEVIVSMNEVEGTPEKFSVTYENLINDVQVGSYILLDDGLIELQVKDIDHAKKEVKCDILNSGELKNKKGVNL
PGVRVSLPGITEKDAEDIRFGIKENVDFIAASFVRRPSDVLEIREILEEQKANISVFPKIENQEGIDNIEEILEVSDGLM
VARGDMGVEIPPEKVPMVQKDLIRQCNKLGKPVITATQMLDSMQRNPRATRAEASDVANAIYDGTDAVMLSGETAAGLYP
EEAVKTMRNIAVSAEAAQDYKKLLSDRTKLVETSLVNAIGISVAHTALNLNVKAIVAATESGSTARTISKYRPHSDIIAV
TPSEETARQCSIVWGVQPVVKKGRKSTDALLNNAVATAVETGRVTNGDLIIITAGVPTGETGTTNMMKIHLVGDEIANGQ
GIGRGSVVGTTLVAETVKDLEGKDLSDKVIVTNSIDETFVPYVEKALGLITEENGITSPSAIVGLEKGIPTVVGVEKAVK
NISNNVLVTIDAAQGKIFEGYANVL
>Q9WY51 2.7.1.40~~~pyk~~~Pyruvate kinase~~~COG0469
MRSTKIVCTVGPRTDSYEMIEKMIDLGVNVFRINTSHGDWNEQEQKILKIKDLREKKKKPVAILIDLAGPKIRTGYLEKE
FVELKEGQIFTLTTKEILGNEHIVSVNLSSLPKDVKKGDTILLSDGEIVLEVIETTDTEVKTVVKVGGKITHRRGVNVPT
ADLSVESITDRDREFIKLGTLHDVEFFALSFVRKPEDVLKAKEEIRKHGKEIPVISKIETKKALERLEEIIKVSDGIMVA
RGDLGVEIPIEEVPIVQKEIIKLSKYYSKPVIVATQILESMIENPFPTRAEVTDIANAIFDGADALLLTAETAVGKHPLE
AIKVLSKVAKEAEKKLEFFRTIEYDTSDISEAISHACWQLSESLNAKLIITPTISGSTAVRVSKYNVSQPIVALTPEEKT
YYRLSLVRKVIPVLAEKCSQELEFIEKGLKKVEEMGLAEKGDLVVLTSGVPGKVGTTNTIRVLKVD
>Q607C7 5.1.1.-~~~~~~L-Lys-D/L-Arg epimerase~~~COG4948
MKIADIQVRTEHFPLTRPYRIAFRSIEEIDNLIVEIRTADGLLGLGAASPERHVTGETLEACHAALDHDRLGWLMGRDIR
TLPRLCRELAERLPAAPAARAALDMALHDLVAQCLGLPLVEILGRAHDSLPTSVTIGIKPVEETLAEAREHLALGFRVLK
VKLCGDEEQDFERLRRLHETLAGRAVVRVDPNQSYDRDGLLRLDRLVQELGIEFIEQPFPAGRTDWLRALPKAIRRRIAA
DESLLGPADAFALAAPPAACGIFNIKLMKCGGLAPARRIATIAETAGIDLMWGCMDESRISIAAALHAALACPATRYLDL
DGSFDLARDVAEGGFILEDGRLRVTERPGLGLVYPD
>Q2J7L5 2.1.1.179~~~Krm~~~16S rRNA (guanine(1405)-N(7))-methyltransferase~~~
MAVRDGGGSAPVGSQEQAVDRVRETVARSRRYGAVAPETVRRLAERALVASRGDEPEAVKRTKRSLHEIYGAYLPERAPG
YPGLLRDIGAAVGGGDPDAVAAAVSRAMRVHASTRERLPYLREFYAAVFGAVPTPAVVQDLACGLNPLAFGSMGLPAQTT
YLASDIDSQQMEFLDRALDLLEVEHRVEVVDLVSGAVPAQHADVTLVLKTLPLLERQRAGAGWELVDALRSPFVVVSFPT
RSLGQRSKGMFQTYSAAFEAQAAERGWTFDQAEIANELIYIVRR
>P62584 ~~~~~~P fimbrial regulatory protein KS71A~~~
MSEYMKNEILEFLNRHNGGKTAEIAEALAVTDYQARYYLLLLEKAGMVQRSPLRRGMATYWFLKGEKQAGQSCSSTT
>Q9ZG90 3.2.1.103~~~~~~Keratan-sulfate endo-1,4-beta-galactosidase~~~
MRKTKFWLVLSLIATSLSIFACKKDSTATKNPIPEVSKAKASTKLLNATTVATTDYELIWSDEFNSSGGFDSTKWSYADR
GTVAWNKYMTSLPAYASQDGSNLVLRMDNAVVAGDPVAYHAGGVKSMGKFSMTYGKVEVRAKFTQGRGSWPAIWMMPEPA
TAYGGWPSCGEIDSMEHVNNESVMYHTIHNGSVTNANGGSTASKSATYNTTDYNLYTMIWSPNDIRFYVNNSLQYTYARV
SGGGTQQWPFDVPFYLILNQAGGAGWPGAITNADLPFSMQVDYVRVYKLPLFSNGDFESGVIYPWTTWGGGSSVVSTDAR
TGTKCIRETGGETSIEQYLTGLTPNTTYRFGGYAKVSAAGQSVSIGVKNYGGTAVDATIGTTSYSNNSVTFTTGANNTTA
TVYFYKPLSGTVYGDDFYLEKL
>F1CMX0 1.14.15.30~~~kshA~~~3-ketosteroid-9-alpha-monooxygenase, oxygenase component~~~
MSLGTSEQSEIREIVAGSAPARFARGWHCLGLAKDFKDGKPHSVHAFGTKLVVWADSNDEIRILDAYCRHMGGDLSQGTV
KGDEIACPFHDWRWGGNGRCKNIPYARRVPPIAKTRAWHTLDQDGLLFVWHDPQGNPPPADVTIPRIAGATSDEWTDWVW
YTTEVDTNCREIIDNIVDMAHFFYVHYSFPVYFKNVFEGHVASQFMRGQAREDTRPHANGQPKMIGSRSDASYFGPSFMI
DDLVYEYEGYDVESVLINCHYPVSQDKFVLMYGMIVKKSDRLEGEKALQTAQQFGNFIAKGFEQDIEIWRNKTRIDNPLL
CEEDGPVYQLRRWYEQFYVDVEDVAPEMTDRFEFEMDTTRPVAAWMKEVEANIARKAALDTETRSAPEQSTTAG
>F1CMX6 1.14.13.-~~~kshA~~~Probable 3-ketosteroid-9-alpha-monooxygenase, oxygenase component~~~
MGSTDTEDQVRTIDVGTPPERYARGWHCLGLVRDFADGKPHQVDAFGTSLVVFAGEDGKLNVLDAYCRHMGGNLAQGSVK
GNTIACPFHDWRWRGDGKCAEIPYARRVPPLARTRTWPVAEVSGQLFVWHDPQGSKPPAELAVPEVPTYGDPGWTDWVWN
SIEVTGSHCREIVDNVVDMAHFFYVHYGMPTYFRNVFEGHTATQVMRSLPRADAVGVSQATNYSAESRSDATYYGPSYMI
DKLWSAGRDPESTPNIYLINCHYPISPTSFRLQYGVMVERPEGVPPEQAEQIAQAVAQGVAIGFEQDVEIWKNKSRIDNP
LLCEEDGPVYQLRRWYEQFYVDVEDIRPEMVNRFEYEIDTTRALTSWQAEVDENVAAGRSAFAPNLTRAREAASAESGS
>F1CMX8 1.14.15.30~~~kshA~~~3-ketosteroid-9-alpha-monooxygenase, oxygenase component~~~
MAQIREIDVGEVRTRFARGWHCLGLSRTFKDGKPHAVEAFGTKLVVWADSNGEPKVLDAYCRHMGGDLSQGEIKGDSVAC
PFHDWRWGGNGKCTDIPYARRVPPLARTRSWITMEKHGQLFVWNDPEGNTPPPEVTIPEIEQYGSDEWTDWTWNQIRIEG
SNCREIIDNVVDMAHFFYIHYAFPTFFKNVFEGHIAEQYLNTRGRPDKGMATQYGLESTLESYAAYYGPSYMINPLKNNY
GGYQTESVLINCHYPITHDSFMLQYGIIVKKPQGMSPEQSDVLAAKLTEGVGEGFLQDVEIWKNKTKIENPLLCEEDGPV
YQLRRWYEQFYVDVADVTEKMTGRFEFEVDTAKANEAWEKEVAENLERKKREEEQGKQEAEV
>B6V6V5 1.14.15.30~~~kshA~~~3-ketosteroid-9-alpha-monooxygenase, oxygenase component~~~
MTVPQERIEIRNIDPGTNPTRFARGWHCIGLAKDFRDGKPHQVKVFGTDLVVFADTAGKLHVLDAFCRHMGGNLARGEIK
GDTIACPFHDWRWNGQGRCEAVPYARRTPKLGRTKAWTTMERNGVLFVWHCPQGSEPTPELAIPEIEGYEDGQWSDWTWT
TIHVEGSHCREIVDNVVDMAHFFYVHFQMPEYFKNVFDGHIAGQHMRSYGRDDIKTGVQMDLPEAQTISDAFYYGPSFML
DTIYTVSEGTTIESKLINCHYPVTNNSFVLQFGTIVKKIEGMSEEQAAEMATMFTDGLEEQFAQDIEIWKHKSRIENPLL
TEEDGPVYQLRRWYNQFYVDLEDVTPDMTQRFEFEVDTSRALESWHKEVEENLAGTAE
>F1CMY8 1.14.15.30~~~kshA~~~3-ketosteroid-9-alpha-monooxygenase, oxygenase component~~~
MSIDTARSGSDDDVEIREIQAAAAPTRFARGWHCLGLLRDFQDGKPHSIEAFGTKLVVFADSKGQLNVLDAYCRHMGGDL
SRGEVKGDSIACPFHDWRWNGKGKCTDIPYARRVPPIAKTRAWTTLERNGQLYVWNDPQGNPPPEDVTIPEIAGYGTDEW
TDWSWKSLRIKGSHCREIVDNVVDMAHFFYIHYSFPRYFKNVFEGHTATQYMHSTGREDVISGTNYDDPNAELRSEATYF
GPSYMIDWLESDANGQTIETILINCHYPVSNNEFVLQYGAIVKKLPGVSDEIAAGMAEQFAEGVQLGFEQDVEIWKNKAP
IDNPLLSEEDGPVYQLRRWYQQFYVDVEDITEDMTKRFEFEIDTTRAVASWQKEVAENLAKQAEGSTATP
>A0R4R3 1.14.15.30~~~kshA~~~3-ketosteroid-9-alpha-monooxygenase, oxygenase component~~~COG4638
MATETVGIREIDTGALPDRYARGWHCLGPVKNFSDGKPHSVNIFGTKLVVFADSKGELNVLDAYCRHMGGDLSKGTVKGD
EVACPFHDWRWGGDGKCKLVPYAKRTPRLARTRSWHTDVRGGLLFVWHDHEGNPPQPEVRIPEIPEWHSGEWTDWKWNSM
LIEGSNCREIIDNVTDMAHFFYIHFGLPTYFKNVFEGHIASQYLHNVGRPDVNDLGTAYGEAKLDSEASYFGPSFMINWL
HNTYGEFKAESILINCHYPVTQDSFVLQWGVIVEKPKGLDDATTEKLADAFTEGVSKGFLQDVEIWKHKTRIDNPLLVEE
DGAVYQMRRWYQQFYVDVADITPDMTDRFEMEVDTTAAVEKWNIEVQENLKAQAEAEKAEQSS
>P71875 1.14.15.30~~~kshA~~~3-ketosteroid-9-alpha-monooxygenase, oxygenase component~~~COG4638
MSTDTSGVGVREIDAGALPTRYARGWHCLGVAKDYLEGKPHGVEAFGTKLVVFADSHGDLKVLDGYCRHMGGDLSEGTVK
GDEVACPFHDWRWGGDGRCKLVPYARRTPRMARTRSWTTDVRSGLLFVWHDHEGNPPDPAVRIPEIPEAASDEWTDWRWN
RILIEGSNCRDIIDNVTDMAHFFYIHFGLPTYFKNVFEGHIASQYLHNVGRPDVDDLGTSYGEAHLDSEASYFGPSFMIN
WLHNRYGNYKSESILINCHYPVTQNSFVLQWGVIVEKPKGMSEEMTDKLSRVFTEGVSKGFLQDVEIWKHKTRIDNPLLV
EEDGAVYQLRRWYEQFYVDVADIKPEMVERFEIEVDTKRANEFWNAEVEKNLKSREVSDDVPAEQH
>P9WJ93 1.14.15.30~~~hmp~~~3-ketosteroid-9-alpha-monooxygenase, ferredoxin reductase component~~~COG1018
MTEAIGDEPLGDHVLELQIAEVVDETDEARSLVFAVPDGSDDPEIPPRRLRYAPGQFLTLRVPSERTGSVARCYSLCSSP
YTDDALAVTVKRTADGYASNWLCDHAQVGMRIHVLAPSGNFVPTTLDADFLLLAAGSGITPIMSICKSALAEGGGQVTLL
YANRDDRSVIFGDALRELAAKYPDRLTVLHWLESLQGLPSASALAKLVAPYTDRPVFICGPGPFMQAARDALAALKVPAQ
QVHIEVFKSLESDPFAAVKVDDSGDEAPATAVVELDGQTHTVSWPRTAKLLDVLLAAGLDAPFSCREGHCGACACTLRAG
KVNMGVNDVLEQQDLDEGLILACQSRPESDSVEVTYDE
>B6V6V6 1.14.15.30~~~kshB~~~3-ketosteroid-9-alpha-monooxygenase, ferredoxin reductase component~~~
MTTVEVPHGSRSVILTVSAVVEETADTRSIVFAVPDELRDKFAYRPGQFLTLRIPSDRTGSVARCYSLASSPFTDDAPKV
TVKRTSDGYGSNWLCDNIATGQTLEVLPPAGVFTPKSLDHDFLLFGAGSGITPVISILKSALTQGGGKVVLVYANRDEKS
VIFAEELRALAEKYPTRLTVVHWLESVQGLPTADQLAAIAAPYESYEAFMCGPGPFMDTVHQALNTVGMPRARVHAEVFN
SLSGDPFADQAPVEVSDEDAADAATVEVELDGEVHKLSWPRKQTLVDIMLAKGIDVPYSCQEGECGSCACTVLEGKVEME
NCDVLDPEDIEAGYILGCQARPVTDHLKIEF
>A0R4Z6 ~~~kstR2~~~HTH-type transcriptional repressor KstR2~~~COG1309
MAPDTPSQPASRRDELLQLAATMFADRGLKATTVRDIADSAGILSGSLYHHFKSKEQMVEEVLRDFLDWLFGRYQQILDT
ATSPLEKLTGLFMASFEAIEHRHAQVVIYQDEAKRLSDLPQFDFVETRNKEQRKMWVDILQEGVADGSFRPDLDVDLVYR
FIRDTTWVSVRWYKPGGPLSAEQVGQQYLAIVLGGITQSQGDKHA
>P9WMB9 ~~~kstR2~~~HTH-type transcriptional repressor KstR2~~~COG1309
MDRVAGQVNSRRGELLELAAAMFAERGLRATTVRDIADGAGILSGSLYHHFASKEEMVDELLRGFLDWLFARYRDIVDST
ANPLERLQGLFMASFEAIEHHHAQVVIYQDEAQRLASQPRFSYIEDRNKQQRKMWVDVLNQGIEEGYFRPDLDVDLVYRF
IRDTTWVSVRWYRPGGPLTAQQVGQQYLAIVLGGITKEGV
>Q0S7V2 ~~~kstR2~~~HTH-type transcriptional repressor KstR2~~~COG1309
MTPPPADDTSGKSGRRTELLDIAATLFAERGLRATTVRDIADAAGILSGSLYHHFDSKESMVDEILRGFLDDLFGKYREI
VASGLDSRATLEALVTTSYEAIDASHSAVAIYQDEVKHLVANERFTYLSELNTEFRELWMGVLEAGVKDGSFRSDIDVEL
AFRFLRDTAWVAVRWYRPGGSVTVDTVAKQYLSIVLDGLASPHN
>A0R528 ~~~kstR~~~HTH-type transcriptional repressor KstR~~~COG1309
MTNVAVLSESELGSEAQRERRKRILDATLAIASKGGYEAVQMRAVAERADVAVGTLYRYFPSKVHLLVSALGREFERIDA
KTDRAALAGGTPYQRLNFMVGKLNRAMQRNPLLTEAMTRAFVFADASAAGEVDHVGKLMDSMFARAMSDGEPTEDQYHIA
RVISDVWLSNLLAWLTRRASATDVSKRLDLAVRLLIGTEEQPKI
>P96856 ~~~kstR~~~HTH-type transcriptional repressor KstR~~~COG1309
MSSANTNTSSAPDAPPRAVMKVAVLAESELGSEAQRERRKRILDATMAIASKGGYEAVQMRAVADRADVAVGTLYRYFPS
KVHLLVSALGREFSRIDAKTDRSAVAGATPFQRLNFMVGKLNRAMQRNPLLTEAMTRAYVFADASAASEVDQVEKLIDSM
FARAMANGEPTEDQYHIARVISDVWLSNLLAWLTRRASATDVSKRLDLAVRLLIGDQDSA
>Q0S868 ~~~kstR~~~HTH-type transcriptional repressor KstR~~~COG1309
MTTSSRSRSSTVAAATLGEDDLSSNAQKERRKRILDATLALASKGGYEAVQMRAVAERADVAVGTLYRYFPSKVHLLVSA
LAREFERIDSRGKNPPGRNPLERMQLILSQITRAMQRDPLLTEAMTRAFMFADASAAAEVDQVGKLMDRLFARAMTDTEP
TEDQLAVARVISDVWLSNLVAWLTRRSSATDVANRLELTVELLLGDGSRRPE
>Q3XZZ9 2.7.1.-~~~~~~Probable ketoamine kinase HMPREF0351_12196~~~
MDIQTVLSDLKLNGKVIPVVGGDVNQTYRIKTEHRAYFLKIHPNVKKGFFEAEVDGLKELSAFVRVPDTYMLGETSEGAY
LLMEWIEPGKGDQRDLAAALANLHQQTAPQFGFRKDNYLGTLVQKNSFEEDWWTFFFKDRLESQISLAEETNRWNVQRQE
KYLRFKERVLKSVEPKKITPRLLHGDLWSGNVFFDQQGHPVFVDPAVSYGNREQDIAMSQLFGGFRPEFLDAYQTIFPLE
KGWKDRLPIYQLYYLLAHLNMFGESYGSQVDQLLENF
>F9UPU7 2.7.1.-~~~~~~Probable ketoamine kinase lp_1983~~~COG3001
MHLTKTWLAQLPLTDIQQVQPVSGGDINAAFQIITRHHQYFLKVQPHNDVTFFDHEVAGLRLLGAVTKTPRVIASGTIAT
DGYLLLDWLATGTGSQSALGAAVAKVHHQHHAQFGLDHDFTAGKLPKINHWQTDWATFYTQQRLDVLVNLAKEHHLWSET
REMHYHRLRQQLLQDSHMHTVKPSLLHGDLWSGNYLFDTTGTPVLIDPDVFYGDREMDLAMTTIFGGFDTDFYQAYQAAY
PVAPGMQDRLPSYQLYYLLAHLNLFGETYGPAVDRILMQY
>Q2FV31 2.7.1.-~~~~~~Probable ketoamine kinase SAOUHSC_02908~~~COG3001
MNEQWLEHLPLKDIKEISPVSGGDVNEAYRVETDTDTFFLLVQRGRKESFYAAEIAGLNEFERAGITAPRVIASGEVNGD
AYLVMTYLEEGASGSQRQLGQLVAQLHSQQQEEGKFGFSLPYEGGDISFDNHWQDDWCTIFVDKRLDHLKDELLNRGLWD
ANDIKVYDKVRRQIVAELEKHQSKPSLLHGDLWGGNYMFLQDGRPALFDPAPLYGDREFDIGITTVFGGFTSEFYDAYNK
HYPLAKGASYRLEFYRLYLLMVHLLKFGEMYRDSVAHSMDKILQDTTS
>Q5SJ35 2.7.1.-~~~~~~Probable ketoamine kinase TTHA1179~~~COG3001
MDPLALLRKAGLEAEGPALPLHGGDISRVWRVGRFVVKTAQDPPPGLFRAEARGLQALAERGVRVPRVHWVGEEGLVLAY
LEPGPEDWEGLARTLAALHRRREGSYLAEPGFLGTFPLPGREGGEWTAFFYERCVLPLLEATWDRLQGLGPKVEALYQRP
LPAEGPAPLHGDLWHGNVYFAREGPALLDPSFFVGERGVDLAMMRLFGGFPRRFWEVYGELYPVPEEVERALPRYQVYYL
LAHVHFFGQGYLGALWRAISAS
>O67099 2.7.4.9~~~tmk~~~Thymidylate kinase~~~COG0125
MLIAFEGIDGSGKTTQAKKLYEYLKQKGYFVSLYREPGGTKVGEVLREILLTEELDERTELLLFEASRSKLIEEKIIPDL
KRDKVVILDRFVLSTIAYQGYGKGLDVEFIKNLNEFATRGVKPDITLLLDIPVDIALRRLKEKNRFENKEFLEKVRKGFL
ELAKEEENVVVIDASGEEEEVFKEILRALSGVLRV
>Q2SWM4 2.7.4.9~~~tmk~~~Thymidylate kinase~~~
MARGKFITFEGIDGAGKTTHLQWFCDRLQERLGPAGRHVVVTREPGGTRLGETLREILLNQPMDLETEALLMFAGRREHL
ALVIEPALARGDWVVSDRFTDATFAYQGGGRGLPRDKLEALERWVQGGFQPDLTVLFDVPPQIASARRGAVRMPDKFESE
SDAFFARTRAEYLRRAQEAPHRFVIVDSSEPIAQIRKQLEGVLAAL
>P0A720 2.7.4.9~~~tmk~~~Thymidylate kinase~~~COG0125
MRSKYIVIEGLEGAGKTTARNVVVETLEQLGIRDMVFTREPGGTQLAEKLRSLVLDIKSVGDEVITDKAEVLMFYAARVQ
LVETVIKPALANGTWVIGDRHDLSTQAYQGGGRGIDQHMLATLRDAVLGDFRPDLTLYLDVTPEVGLKRARARGELDRIE
QESFDFFNRTRARYLELAAQDKSIHTIDATQPLEAVMDAIRTTVTHWVKELDA
>Q2GHN3 2.7.4.9~~~tmk~~~Thymidylate kinase~~~COG0125
MFITFEGIDGSGKTTQSHLLAEYLSEIYGVNNVVLTREPGGTLLNESVRNLLFKAQGLDSLSELLFFIAMRREHFVKIIK
PSLMQKKIVICDRFIDSTIAYQGYGQGIDCSLIDQLNDLVIDVYPDITFIIDVDINESLSRSCKNGYEFADMEFYYRVRD
GFYDIAKKNPHRCHVITDKSETYDIDDINFVHLEVIKVLQMV
>P44719 2.7.4.9~~~tmk~~~Thymidylate kinase~~~COG0125
MKGKFIVIEGLEGAGKSSAHQSVVRVLHELGIQDVVFTREPGGTPLAEKLRHLIKHETEEPVTDKAELLMLYAARIQLVE
NVIKPALMQGKWVVGDRHDMSSQAYQGGGRQLDPHFMLTLKETVLGNFEPDLTIYLDIDPSVGLARARGRGELDRIEQMD
LDFFHRTRARYLELVKDNPKAVVINAEQSIELVQADIESAVKNWWKSNEK
>O26009 2.7.4.9~~~tmk~~~Thymidylate kinase~~~COG0125
MYVVLEGVDGAGKSTQVELLKDRFKNALFTKEPGGTRMGESLRRIALNENISELARAFLFLSDRAEHTESVIKPALKEKK
LIISDRSLISGMAYSQFSSLELNLLATQSVLPAKIILLLIDKEGLKQRLSLKSLDKIENQGIEKLLHIQQKLKTHAYALQ
EKFGCEVLELDAKESVKNLHEKIAAFIKCAV
>P9WKE1 2.7.4.9~~~tmk~~~Thymidylate kinase~~~COG0125
MLIAIEGVDGAGKRTLVEKLSGAFRAAGRSVATLAFPRYGQSVAADIAAEALHGEHGDLASSVYAMATLFALDRAGAVHT
IQGLCRGYDVVILDRYVASNAAYSAARLHENAAGKAAAWVQRIEFARLGLPKPDWQVLLAVSAELAGERSRGRAQRDPGR
ARDNYERDAELQQRTGAVYAELAAQGWGGRWLVVGADVDPGRLAATLAPPDVPS
>Q9HZN8 2.7.4.9~~~tmk~~~Thymidylate kinase~~~
MTGLFVTLEGPEGAGKSTNRDYLAERLRERGIEVQLTREPGGTPLAERIRELLLAPSDEPMAADTELLLMFAARAQHLAG
VIRPALARGAVVLCDRFTDATYAYQGGGRGLPEARIAALESFVQGDLRPDLTLVFDLPVEIGLARAAARGRLDRFEQEDR
RFFEAVRQTYLQRAAQAPERYQVLDAGLPLAEVQAGLDRLLPNLLERLNG
>A7WYM2 2.7.4.9~~~tmk~~~Thymidylate kinase~~~
MSAFITFEGPEGSGKTTVINEVYHRLVKDYDVIMTREPGGVPTGEEIRKIVLEGNDMDIRTEAMLFAASRREHLVLKVIP
ALKEGKVVLCDRYIDSSLAYQGYARGIGVEEVRALNEFAINGLYPDLTIYLNVSAEVGRERIIKNSRDQNRLDQEDLKFH
EKVIEGYQEIIHNESQRFKSVNADQPLENVVEDTYQTIIKYLEKI
>P65248 2.7.4.9~~~tmk~~~Thymidylate kinase~~~
MSAFITFEGPEGSGKTTVINEVYHRLVKDYDVIMTREPGGVPTGEEIRKIVLEGNDMDIRTEAMLFAASRREHLVLKVIP
ALKEGKVVLCDRYIDSSLAYQGYARGIGVEEVRALNEFAINGLYPDLTIYLNVSAEVGRERIIKNSRDQNRLDQEDLKFH
EKVIEGYQEIIHNESQRFKSVNADQPLENVVEDTYQTIIKYLEKI
>P65249 2.7.4.9~~~tmk~~~Thymidylate kinase~~~
MSAFITFEGPEGSGKTTVINEVYHRLVKDYDVIMTREPGGVPTGEEIRKIVLEGNDMDIRTEAMLFAASRREHLVLKVIP
ALKEGKVVLCDRYIDSSLAYQGYARGIGVEEVRALNEFAINGLYPDLTIYLNVSAEVGRERIIKNSRDQNRLDQEDLKFH
EKVIEGYQEIIHNESQRFKSVNADQPLENVVEDTYQTIIKYLEKI
>Q6GJI9 2.7.4.9~~~tmk~~~Thymidylate kinase~~~
MSAFITFEGPEGSGKTTVINEVYHRLVKDYDVIMTREPGGVPTGEEIRKIVLEGNDMDIRTEAMLFAASRREHLVLKVIP
ALKEGKVVLCDRYIDSSLAYQGYARGIGVEEVRALNEFAINGLYPDLTIYLNVSAEVGRERIIKNSRDQNRLDQEDLKFH
EKVIEGYQEIIHNESQRFKSVNADQPLENVVEDTYQTIIKYLEKI
>Q97R91 2.7.4.9~~~tmk~~~Thymidylate kinase~~~COG0125
MSKGFLVSLEGPEGAGKTSVLEALLPILEEKGVEVLTTREPGGVLIGEKIREVILDPSHTQMDAKTELLLYIASRRQHLV
EKVLPALEAGKLVIMDRFIDSSVAYQGFGRGLDIEAIDWLNQFATDGLKPDLTLYFDIEVEEGLARIAANSDREVNRLDL
EGLDLHKKVRQGYLSLLDKEGNRIVKIDASLPLEQVVETTKAVLFDGMGLAK
>Q9X0I3 2.7.4.9~~~tmk~~~Thymidylate kinase~~~COG0125
MFITFEGIDGSGKSTQIQLLAQYLEKRGKKVILKREPGGTETGEKIRKILLEEEVTPKAELFLFLASRNLLVTEIKQYLS
EGYAVLLDRYTDSSVAYQGFGRNLGKEIVEELNDFATDGLIPDLTFYIDVDVETALKRKGELNRFEKREFLERVREGYLV
LAREHPERIVVLDGKRSIEEIHRDVVREVKRRWKLDV
>Q5SHX3 2.7.4.9~~~tmk~~~Thymidylate kinase~~~COG0125
MPGLFLTLEGLDGSGKTTQARRLAAFLEAQGRPVLLTREPGGGLPEVRSLLLTQELSPEAEYLLFSADRAEHVRKVILPG
LAAGKVVISDRYLDSSLAYQGYGRGLPLPWLREVAREATRGLKPRLTFLLDLPPEAALRRVRRPDRLEGLGLEFFRRVRE
GYLALARAEPGRFVVLDATLPEEEIARAIQAHLRPLLP
>Q9KQI2 2.7.4.9~~~tmk~~~Thymidylate kinase~~~COG0125
MNAKFIVIEGLEGAGKSTAIQVVVETLQQNGIDHITRTREPGGTLLAEKLRALVKEEHPGEELQDITELLLVYAARVQLV
ENVIKPALARGEWVVGDRHDMSSQAYQGGGRQIAPSTMQSLKQTALGDFKPDLTLYLDIDPKLGLERARGRGELDRIEKM
DISFFERARERYLELANSDDSVVMIDAAQSIEQVTADIRRALQDWLSQVNRV
>O69169 2.7.4.9~~~tmk~~~Thymidylate kinase~~~COG0125
MNSKFIVIEGLEGAGKTTTRDTVVAVLRAQGINDIVFTREPGGTPLAEKLRDLIKQGIDGEVLTDKAEVLMLYAARVQLV
ENVIKPALARGSWVVGDRHDLSSQAYQGGGRGIDSQLMASLRDTVLGEFRPDLTLYLDLPPAVGLARARARGELDRIEQE
SLAFFERTRARYLELAASDASIKTIDASQPIEQVSASISQALAQWLTNQEPV
>O32080 ~~~ktrA~~~Ktr system potassium uptake protein A~~~COG0569
MGRIKNKQFAVIGLGRFGGSICKELHRMGHEVLAVDINEEKVNAYASYATHAVIANATEENELLSLGIRNFEYVIVAIGA
NIQASTLTTLLLKELDIPNIWVKAQNYYHHKVLEKIGADRIIHPEKDMGVKIAQSLSDENVLNYIDLSDEYSIVELLATR
KLDSKSIIDLNVRAKYGCTILAIKHHGDICLSPAPEDIIREQDCLVIMGHKKDIKRFENEGM
>O87952 ~~~ktrA~~~Ktr system potassium uptake protein A~~~COG0569
MKTGDKQFAVIGLGRFGLAVCKELQDSGSQVLAVDINEDRVKEAAGFVSQAIVANCTHEETVAELKLDDYDMVMIAIGAD
VNASILATLIAKEAGVKSVWVKANDRFQARVLQKIGADHIIMPERDMGIRVARKMLDKRVLEFHPLGSGLAMTEFVVGSR
LMGKTLSDLALCKVEGVQVLGYKRGPEIIKAPDMSTTLEIGDLIIVVGPQDKLANKLKSL
>O32081 ~~~ktrB~~~Ktr system potassium uptake protein B~~~COG0168
MTLQKDKVIKWVRFTPPQVLAIGFFLTIIIGAVLLMLPISTTKPLSWIDALFTAASATTVTGLAVVDTGTQFTVFGQTVI
MGLIQIGGLGFMTFAVLIVMILGKKIGLKERMLVQEALNQPTIGGVIGLVKVLFLFSISIELIAALILSIRLVPQYGWSS
GLFASLFHAISAFNNAGFSLWPDNLMSYVGDPTVNLVITFLFITGGIGFTVLFDVMKNRRFKTFSLHTKLMLTGTLMLNA
IAMLTVFILEYSNPGTLGHLHIVDKLWASYFQAVTPRTAGFNSLDFGSMREGTIVFTLLLMFIGAGSASTASGIKLTTFI
VILTSVIAYLRGKKETVIFRRSIKYPIIIKALAVSVTSLFIVFLGIFALTITEQAPFLQIVFETFSAFGTVGLTMGLTPE
LTTAGKCIIIVIMFIGRIGPLTFVFSFAKTEQSNIRYPDGEVFTG
>O87953 ~~~ktrB~~~Ktr system potassium uptake protein B~~~COG0168
MTQFHQRGVFYVPDGKRDKAKGGEPRIILLSFLGVLLPSAVLLTLPVFSVSGLSITDALFTATSAISVTGLGVVDTGQHF
TLAGKILLMCLMQIGGLGQMTLSAVLLYMFGVRLSLRQQALAKEALGQERQVNLRRLVKKIVTFALVAEAIGFVFLSYRW
VPEMGWQTGMFYALFHSISAFNNAGFALFSDSMMSFVNDPLVSFTLAGLFIFGGLGFTVIGDVWRHWRKGFHFLHIHTKI
MLIATPLLLLVGTVLFWLLERHNPNTMGSLTTGGQWLAAFFQSASARTAGFNSVDLTQFTQPALLIMIVLMLIGAGSTST
GGGIKVSTFAVAFMATWTFLRQKKHVVMFKRTVNWPTVTKSLAIIVVSGAILTTAMFLLMLTEKASFDKVMFETISAFAT
VGLTAGLTAELSEPGKYIMIVVMIIGRIGPLTLAYMLARPEPTLIKYPEDTVLTG
>P39760 ~~~ktrC~~~Ktr system potassium uptake protein C~~~COG0569
MKKEFAVIGLGRFGGSICKALSEEGVEVMAMDIDEDKVNEYAKIASHAVIGDSTDESVLKNLGLRNFDHVIVAIGENIQA
SILTTLILKELGVHTITVKAQNDYHEKVLSKIGADHIVHPERDMAKRIAHNIVSNNVLDYLELSEEHSLVEIVANSRLAG
NTLLDLDIRAKYGINIVAIKRGKEVIVSPLATEVIHQEDILIVIGSVTDISRFEKRVLHTK
>P63183 ~~~kup~~~Low affinity potassium transport system protein Kup~~~COG3158
MSTDNKQSLPAITLAAIGVVYGDIGTSPLYTLRECLSGQFGFGVERDAVFGFLSLIFWLLIFVVSIKYLTFVMRADNAGE
GGILTLMSLAGRNTSARTTSMLVIMGLIGGSFFYGEVVITPAISVMSAIEGLEIVAPQLDTWIVPLSIIVLTLLFMIQKH
GTAMVGKLFAPIMLTWFLILAGLGLRSIIANPEVLHALNPMWAVHFFLEYKTVSFIALGAVVLSITGVEALYADMGHFGK
FPIRLAWFTVVLPSLTLNYFGQGALLLKNPEAIKNPFFLLAPDWALIPLLIIAALATVIASQAVISGVFSLTRQAVRLGY
LSPMRIIHTSEMESGQIYIPFVNWMLYVAVVIVIVSFEHSSNLAAAYGIAVTGTMVLTSILSTTVARQNWHWNKYFVALI
LIAFLCVDIPLFTANLDKLLSGGWLPLSLGTVMFIVMTTWKSERFRLLRRMHEHGNSLEAMIASLEKSPPVRVPGTAVYM
SRAINVIPFALMHNLKHNKVLHERVILLTLRTEDAPYVHNVRRVQIEQLSPTFWRVVASYGWRETPNVEEVFHRCGLEGL
SCRMMETSFFMSHESLILGKRPWYLRLRGKLYLLLQRNALRAPDQFEIPPNRVIELGTQVEI
>O34859 ~~~ku~~~Non-homologous end joining protein Ku~~~COG1273
MNRTPSLHTKEKKGFIDMHTMWKGSISFGLVNIPIKLYAATEDKDIKLRSLHKEDHAPIKYEKVCTNCEKTLSPDEIVKG
YEYVKGKYVVLTDEDLKSLKQEHEEKAVEIVDFVQLQEIDPIYFNRSYFVGPGDNGTKAYTLLREALRSTGKIGIANMTI
RSKQQLAILRVYENCIVMESIHYPDEVRSAAQVPGVPDQSNVNDKELQTAITLIDELTAKFEPEKYEDTYRQALLQRVND
KLENKETAVTPDKAPPREDVIDLVSALQASIDRTRRPNRETPAAAPAQAAEPKGAGDKKQKTTRKKASGTS
>A0R3S7 ~~~ku~~~Non-homologous end joining protein Ku~~~COG1273
MRSIWKGSIAFGLVNVPVKVYSATEDHDIKFHQVHAKDNGRIRYKRVCEVCGEVVEYRDINKAFESDDGQMVVITDEDIA
TLPEERSREIEVVEFIPAEQLDPLMYDKSYFLEPDSKSSKSYVLLAKTLAETDRIAIVHFSLRNKSRLAALRVKDFSKRD
VMMIHTLLWPDEIRDPDFPILDKEVQIKPAELKMAGQVVESMTDDFKPDLYHDDYQEQLRELVQAKLEGGEAFSVEEQPA
ELDEGTEDVSDLLAKLEASVKARKGGKSDSKDDSDSESDSKESKSDSKPAKKAPAKKAAAKKSTAKKAPAKKAAAKKS
>P9WKD9 ~~~mku~~~Non-homologous end joining protein Ku~~~COG1273
MRAIWTGSIAFGLVNVPVKVYSATADHDIRFHQVHAKDNGRIRYKRVCEACGEVVDYRDLARAYESGDGQMVAITDDDIA
SLPEERSREIEVLEFVPAADVDPMMFDRSYFLEPDSKSSKSYVLLAKTLAETDRMAIVHFTLRNKTRLAALRVKDFGKRE
VMMVHTLLWPDEIRDPDFPVLDQKVEIKPAELKMAGQVVDSMADDFNPDRYHDTYQEQLQELIDTKLEGGQAFTAEDQPR
LLDEPEDVSDLLAKLEASVKARSKANSNVPTPP
>Q9I1W5 ~~~ku~~~Non-homologous end joining protein Ku~~~
MARAIWKGAISFGLVHIPVSLSAATSSQGIDFDWLDQRSMEPVGYKRVNKVTGKEIERENIVKGVEYEKGRYVVLSEEEI
RAAHPKSTQTIEIFAFVDSQEIPLQHFDTPYYLVPDRRGGKVYALLRETLERTGKVALANVVLHTRQHLALLRPLQDALV
LITLRWPSQVRSLDGLELDESVTEAKLDKRELEMAKRLVEDMASHWEPDEYKDSFSDKIMKLVEEKAAKGQLHAVEEEEE
VAGKGADIIDLTDLLKRSLRSRAGGGKDKGSEKAGADAKGRAKSGASRSRRKA
>A9AWD5 5.5.1.29~~~~~~(+)-kolavenyl diphosphate synthase~~~COG1657
MSLIVDILIDDLRALIRDLGQNGGLMSPSVYDTSQALRLYPTPSEEHVWPAVNWLISQQQSDGGWGNPSMPLSRAVPTLA
AILALRRHCQRRSTFDGLLEAKRFLRRQLEYWEKPLPDNLPVGMELLLPYMLEEAYREEHQDDIDDVPIKLRLNIPLAPY
RELIALGEHKRSLIQQKKPRAGTAPVYSWEAWASHADPELIDGSGGIGHSPAATAAWLFAANHNPNLRNEIAGAENYLRQ
ASLATSESAPCIMPTAWPIPRFEQSFSLYALVTGGILDFPSIQDVLKPQIADLHQALKPRGIGFSDDFMPDGDDTAAAVA
VLIAAGYPVDLAILNQFEREPYFVAYHGELQPSISLTARAVHALDLAGVDISRWWKIFIDAQKLDGSWSGDKWNTSWLYT
TCHVLIALKNSPYKTAMKEAVAALQVHQHPDGGWGIINRSTTVETAYAVLALQNLREAGLLDDDDIHMLQRGYNWLCIHY
RPFRMKEYQCWLNKEIYCPQRIDRAYELSAMLAVTLGELKL
>Q81PP9 3.5.1.9~~~kynB~~~Kynurenine formamidase~~~COG1878
MKTSKWIDISQPLNNDIATWPGDTPFSYEVLWSKEESGSVNVGKLTMSIHTGTHIDAPFHFDNDGKKVLDLDIQVYVGPT
RIIDVSNLESIGKKELEKFHLEGVERLLLRTSSHGKANEFPDIIPHLRADIAPFLSEKGIRLIGVDVPSVDPLDDKELAA
HHQLFKHSIHILENVVLDHVADGDYELIALPLALSDADGSPVRAVIRPI
>Q81CK1 3.5.1.9~~~kynB~~~Kynurenine formamidase~~~
MKTSEWIDISQPLNNNIATWPGDTPFSYEVSWSKEESGSVNVGKLTMSIHTGTHIDAPFHFDNDGKKVLDLDVQVYVGPA
RIIDVSNLESIGKKELESFHLEGVERLLLRTSSHGKAEEFPEVIPHLRADIASFLSEKGIRLIGVDVPSVDPLDDKELAA
HHQLFKHGIHILENVVLDHVADGDYELIALPLALTDADGSPVRAVIRPI
>B4E9I9 3.5.1.9~~~kynB~~~Kynurenine formamidase~~~COG1878
MDTLWDISPPVSPATPVWPGDTPVAVERVWRMEAGSPVNVARLTLSPHTGAHCDAPLHYDADGAPIGAVPLDTYLGPCRV
IHCIGAAPVVRPADVEAALDGVPPRVLLRTYARAAVEQWDSNFCAVAPDTVDLLAAHGVKLIGIDTPSLDPQESKTMDAH
RRVRAHRMAILEGIVLDDVPPGDYELIALPLKFATLDASPVRAVLRALPAQAS
>P0C8P4 3.5.1.9~~~kynB~~~Kynurenine formamidase~~~COG1878
MPQAPQLHDGRRIWDISPAVSPATPVWPGDTPFQHDPAWQLDEHCPVNVGRITMSPHTGAHADAPLHYAADGAPIGAVPL
DAYLGPCRVIHCIGAAPRVEPQHIAHALAGTPPRVLLRTYAQAPQGKWDSAFCAVAPETISLLARHGVRLIGIDTPSLDP
ETSKTMDAHHAVRDHQLAILEGIVLDEVPAGDYELIALPLRLATLDASPVRAVLRELP
>Q9I234 3.5.1.9~~~kynB~~~Kynurenine formamidase~~~
MTSLRYWDISPALDPNTPTWPGDTPFQQEWAARLDEQCPVNVGRITLSPHTGAHVDGPLHYRADGLPIGQVPLDIYMGPC
RVIHCIGANPLVTPEHLAGQLDDLPSRVLLRTFERVPANWPEGFCAIAPATIECLAERGVRLVGIDTPSLDPQHSKTLDA
HHAVGRHGMAILEGVVLDDVPAGDYELLALPLKFTHLDASPVRAVLRALPTAE
>P83788 3.7.1.3~~~kynU~~~Kynureninase~~~
MTTRNDCLALDAQDSLAPLRQQFALPEGVIYLDGNSLGARPVAALARAQAVIAEEWGNGLIRSWNSAGWRDLSERLGNRL
ATLIGARDGEVVVTDTTSINLFKVLSAALRVQATRSPERRVIVTETSNFPTDLYIAEGLADMLQQGYTLRLVDSPEELPQ
AIDQDTAVVMLTHVNYKTGYMHDMQALTALSHECGALAIWDLAHSAGAVPVDLHQAGADYAIGCTYKYLNGGPGSQAFVW
VSPQLCDLVPQPLSGWFGHSRQFAMEPRYEPSNGIARYLCGTQPITSLAMVECGLDVFAQTDMASLRRKSLALTDLFIEL
VEQRCAAHELTLVTPREHAKRGSHVSFEHPEGYAVIQALIDRGVIGDYREPRIMRFGFTPLYTTFTEVWDAVQILGEILD
RKTWAQAQFQVRHSVT
>A0Q6X4 ~~~~~~Bacterial lipoprotein FTN_1103~~~
MKYGNLMMTKKKLLIGMVTISGIVILGSCGKTETVNELLIVDQCNDVRDLCRLELANAQVSRYTNFLGKTIKRLQSQTPL
RDIQGTVTWNASAGTSLADNSDVQSELGLSCQDDNCTANSNSTAYTLPVGSNTISVSGTVTVDGKTIDLATDVPALVINT
SAAGSSVHVFPTELEGNLTLQDLVDSLNQGRHYAHATFSADGSNLKIQCDPGYVWLDDINPEYGGQSSAASARSVAMVSW
VEELEEFRVDEFRFLHFDMSSLTLNGVRLGNHVFWEMGCWPT
>Q5ZWE8 3.4.22.-~~~~~~Serine protease Lpg1137~~~
MIQRGFTMQERREKQGNNSPYLLTPYEMANLVFKTGAISFTVSAGTQPFQYLLNKLQFSQSGTPSGLSGGLFRGMYRGFL
PYAIAGQKRGAVAVTHKQTNKVTEEEEFEAPFRQRWWGTIFFSQADLLVSNGLSGKARLQNVGVINAENFKWSLSNFWKL
TSVNWGSRSFAGGVNFALIGFAGDYVSSFYKFDKDLYNKILGGATSGVIATLFTTAPNAYADSKLLQTKVAENNRLITVS
PYTMFGQMKSHVKAVGLKEAFMTFFKVSYLQQVAVRAPQAAITFALIFGMDEYMGPQPLKKVWPGRVEELESENPSPSPT
KK
>S2DJ52 1.1.99.2~~~~~~L-2-hydroxyglutarate dehydrogenase~~~COG0579
MDFQVIIIGGGIVGLATGLKIKQRNPNIKVALLEKEEEVAKHQTGNNSGVIHSGLYYKPGSLKAKNCIEGYHELVRFCEE
ENIPFELTGKVVVATRKEQVPLLNSLLERGLQNGLKGTRSITLDELKHFEPYCAGVAAIHVPQTGIVDYKLVAEKYAEKF
QILGGQVFLGHKVIKVETQNTASIIHTSKGSFSTNLLINCAGLYSDKVAQMNQKESLDVKIIPFRGEYYKIKKEREYLVK
NLIYPVPDPNFPFLGVHFTRMMKGGVEAGPNAVLAFKREGYKKSQVNFSELAETLSWPGFQKVASKYWKTGMGELFRSFS
KKAFTDALKELIPDIQESDLIEGGAGVRAQACDRTGGLLDDFCIREDQNAIHVLNAPSPAATSSLSIGGTVCEWALKRF
>Q48501 ~~~acdT~~~Bacteriocin acidocin 8912~~~
MISSHQKTLTDKELALISGGKTHYPTNAWKSLWKGFWESLRYTDGF
>Q76KX0 3.5.1.101~~~laaA~~~L-amino acid amidase~~~
MEFIEKIREGYAAFGAYQTWYRVTGDLSSGRTPLVVIHGGPGCTHDYVDAFKDVAASGHAVIHYDQLGNGRSTHLPDKDP
SFWTVGLFLEELNNLLDHLQISDNYAILGQSWGGMLGSEHAILQPKGLRAFIPANSPTCMRTWVSEANRLRKLLPEGVHE
TLLKHETAGTYQDPEYLAASRVFYDHHVCRVIPWPEEVARTFAAVDADPTVYHAMSGPTEFHVIGSLKDWKSTGRLSAIN
VPTLVISGRHDEATPLVVKPFLDEIADVRWALFEDSSHMPHVEERQACMGTVVKFLDEVCSAKYKVLKAS
>Q55629 1.4.5.-~~~~~~L-amino acid dehydrogenase~~~COG1231
MVIRSGKTNLNPPCALMAPSSSCDCIIVGSGLSGLIAARNLSRVNYSVLVIEAQERLGGRMYGEYLPSGQWIDRGGQWVG
PTQDRFLALLNEYNIERFPSPADGLKVLLFDGKRYEFDGFFQGVFQGEAPKISSDEWNDAMVAWEKFNTLAQSLDEQHPE
ATPENKKLDSQTFADWIKENTHTAFGHWYFSYMCRAVGFLGPAEPSQVSLLHILWGHKSASQGENPEAELLHGGAGQIPQ
KIAAELGNSILLGEPVIHIAQDDKGVEVTTTTGKYQGKFAIVATPPHLAGRITYSPPMPPLRQQLTQRVPMGTCCKLLIS
YDRPFWREKGLAGIGLGNTTWIELCADSSDPTTGVGVIASFVVGDRYGKWIAMGEAERRQGVLSDLALYFGEEALSPETY
DEVDWPSEQWVGGGYAAFMPPGVWTSFGQALSAPVGRIHWAGTEIAPRWAGFFDGAIRTGEAAAKAIIGLL
>Q31LZ8 ~~~labA~~~Low amplitude and bright protein LabA~~~COG1432
MAFRPSRLAIFIDGNNMFYAQQKNGWFFDPRRVLNYFANRPEIELVNAYWYTGLKDPQDQRGFRDALVSLGYTVRTKMLK
EFHDESNGNRYFQRANLDIEIVIDMFNTVEQYDEIVLFSGDGDFERAIELLRAKQTHITVVSTDGMIARELRNATDRYID
LNDIRSFIEKTERPEPFVSAPVIAPA
>P23494 5.3.1.26~~~lacA~~~Galactose-6-phosphate isomerase subunit LacA~~~
MAIVVGADLKGTRLKDVVKNFLVEEGFEVIDVTKDGQDFVDVTLAVASEVNKDEQNLGIVIDAYGAGPFMVATKIKGMVA
AEVSDERSAYMTRGHNNARMITVGAEIVGDELAKNIAKAFVNGKYDGGRHQVRVDMLNKMC
>P23495 5.3.1.26~~~lacB~~~Galactose-6-phosphate isomerase subunit LacB~~~
MRIAIGCDHIVTDVKMAVSEFLKSKGYEVLDFGTYDHVRTHYPIYGKKVGEAVVSGQADLGVCICGTGVGINNAVNKVPG
VRSALVRDMTSALYAKEELNANVIGFGGMITGGLLMNDIIEAFIEAEYKPTEENKKLIAKIEHVETHNAHQADEEFFTEF
LEKWDRGEYHD
>P65647 5.3.1.26~~~lacB~~~Galactose-6-phosphate isomerase subunit LacB~~~
MKIALGCDHIVTDTKMRVSEFLKSKGHEVIDVGTYDFTRTHYPIFGKKVGEQVVSGNADLGVCICGTGVGINNAVNKVPG
VRSALVRDMTSALYAKEELNANVIGFGGRIIGELLMCDIIDAFINAEYKATEENKKLIAKIKHLETSNADQADPHFFDEF
LEKWDRGEYHD
>Q833W9 2.7.1.144~~~lacC~~~Tagatose-6-phosphate kinase~~~COG1105
MIVTVTMNPSIDISYLLDHLKLDTVNRTSQVTKTPGGKGLNVTRVIHDLGGDVIATGVLGGFHGAFIANELKKANIPQAF
TSIKEETRDSIAILHEGNQTEILEAGPTVSPEEISNFLENFDQLIKQAEIVTISGSLAKGLPSDFYQELVQKAHAQEVKV
LLDTSGDSLRQVLQGPWKPYLIKPNLEELEGLLGQDFSENPLAAVQTALTKPMFAGIEWIVISLGKDGAIAKHHDQFYRV
KIPTIQAKNPVGSGDATIAGLAYGLAKDAPAAELLKWGMAAGMANAQERMTGHVDVENVKKHLMNIQVVEIAK
>P23391 2.7.1.144~~~lacC~~~Tagatose-6-phosphate kinase~~~
MILTVTLNPSVDISYPLETLKIDTVNRVKDVSKTAGGKGLNVTRVLYESGDKVTATGFLGGKIGEFIESELEQSPVSPAF
YKISGNTRNCIAILHEGNQTEILEQGPTISHEEAEGFLDHYSNLIKQSEVVTISGSLPSGLPNDYYEKLIQLASDEGVAV
VLDCSGAPLETVLKSSAKPTAIKPNNEELSQLLGKEVTKDIEELKDVLKESLFSGIEWIVVSLGRNGAFAKHGDVFYKVD
IPDIPVVNPVGSGDSTVAGIASALNSKKSDADLLKHAMTLGMLNAQETMTGHVNMTNYETLNSQIGVKEV
>P0A0B9 2.7.1.144~~~lacC~~~Tagatose-6-phosphate kinase~~~COG1105
MILTLTLNPSVDISYPLTALKLDDVNRVQEVSKTAGGKGLNVTRVLAQVGEPVLASGFIGGELGQFIAKKLDHADIKHAF
YNIKGETRNCIAILHEGQQTEILEQGPEIDNQEAAGFIKHFEQLLEKVEAVAISGSLPKGLNQDYYAQIIERCQNKGVPV
ILDCSGATLQTVLENPYKPTVIKPNISELYQLLNQPLDESLESLKQAVSQPLFEGIEWIIVSLGAQGAFAKHNHTFYRVN
IPTISVLNPVGSGDSTVAGITSAILNHENDHDLLKKANTLGMLNAQEAQTGYVNLNNYDDLFNQIEVLEV
>Q5HE12 2.7.1.144~~~lacC~~~Tagatose-6-phosphate kinase~~~
MILTLTLNPSVDISYPLTALKLDDVNRVQEVSKTAGGKGLNVTRVLAQVGEPVLASGFIGGELGQFIAKKLDHADIKHAF
YNIKGETRNCIAILHEGQQTEILEQGPEIDNQEAAGFIKHFEQLLEKVEAVAISGSLPKGLNQDYYAQIIERCQNKGVPV
ILDCSGATLQTVLENPYKPTVIKPNISELYQLLNQPLDESLESLKQAVSQPLFEGIEWIIVSLGAQGAFAKHNHTFYRVN
IPTISVLNPVGSGDSTVAGITSAILNHENDHDLLKKANTLGMLNAQEAQTGYVNLNNYDDLFNQIEVLEV
>P63703 4.1.2.40~~~lacD1~~~Tagatose 1,6-diphosphate aldolase 1~~~
MTITANKRHYLEKVSHQGIISALAFDQRGALKQMMAAHQEGEATVTQIETLKVLVSEELTPYASSILLDPEYGLLATKVR
ANQTGLLLAYEKTGYDATTTSRLPDCLVEWSVKRLKAAGADAIKFLLYYDVDGDEQINLQKQAYIERIGSECTAEDIPFF
LELLSYDERISDNNSAAYAKLKPHKVNGAMSVFSDKRFGVDVLKVEVPVNMAYVEGFTEGEVHYSQAEAIKAFQDQEAAS
HLPYIYLSAGVSAKLFQETLYFAAAAGAQFSGVLCGRATWAGSVPVYITKGEDEARKWLCTEGFQNIDELNRVLEETASP
WTEKI
>Q8DWE5 4.1.2.40~~~lacD2~~~Tagatose 1,6-diphosphate aldolase 2~~~COG3684
MILSQQKYNYLAKVSDSNGVISALAFDQRGALKCLMAQYQMKEPTVAQMEELKVLVSEELTPYASSILLDPEYGLPAAQA
RDREAGLLLAYEKTGYDANTTSRLPDCLVDWSIKRLKEAGADAVKFLLYYDVDGDPQVNVQKQAYIERIGSECQAEDIPF
FLEILTYDETISNNSSVEFAKVKVHKVNDAMKVFSAERFGIDVLKVEVPVNMVYVEGFAEGEVVYSKEEAAQAFREQEAS
TDLPYIYLSAGVSAELFQETLVFAHKAGAKFNGVLCGRATWAGSVQVYMEEGKEAARQWLRTSGLQNINELNKVLKTTAS
PWTEKVSVG
>P63705 4.1.2.40~~~lacD2~~~Tagatose 1,6-diphosphate aldolase 2~~~
MTITLTENKRKSMEKLSVDGVISALAFDQRGALKRMMAQHQTKEPTVEQIEELKSLVSEELTPFASSILLDPEYGLPASR
VRSEEAGLLLAYEKTGYDATTTSRLPDCLDVWSAKRIKEAGAEAVKFLLYYDIDGDQDVNEQKKAYIERIGSECRAEDIP
FYLEILTYDEKIADNASPEFAKVKAHKVNEAMKVFSKERFGVDVLKVEVPVNMKFVEGFADGEVLFTKEEAAQAFRDQEA
STDLPYIYLSAGVSAKLFQDTLVFAAESGAKFNGVLCGRATWAGSVKVYIEEGPQAAREWLRTEGFKNIDELNKVLDKTA
SPWTEKM
>P26593 4.1.2.40~~~lacD~~~Tagatose 1,6-diphosphate aldolase~~~
MVLTEQKRKSLEKLSDKNGFISALAFDQRGALKRLMAQYQDTEPTVAQMEELKVLVADELTKYASSMLLDPEYGLPATKA
LDKEAGLLLAFEKTGYDTSSTKRLPDCLDVWSAKRIKEQGADAVKFLLYYDVDSSDELNQQKQAYIERVGSECVAEDIPF
FLEILAYDEEISDAGSVEYAKVKPRKVIEAMKVFSDPRFNIDVLKVEVPVNVKYVEGFADGEVVYSKAEAADFFKAQEEA
TNLPYIYLSAGVSAKLFQETLQFAHDSGAKFNGVLCGRATWAGSVEPYIKEGEKAAREWLRTTGFENIDELNKVLVKTAS
PWTDKV
>P0A011 4.1.2.40~~~lacD~~~Tagatose 1,6-diphosphate aldolase~~~COG3684
MSKSNQKIASIEQLSNNEGIISALAFDQRGALKRMMAKHQTEEPTVAQIEQLKVLVAEELTQYASSILLDPEYGLPASDA
RNKDCGLLLAYEKTGYDVNAKGRLPDCLVEWSAKRLKEQGANAVKFLLYYDVDDAEEINIQKKAYIERIGSECVAEDIPF
FLEVLTYDDNIPDNGSVEFAKVKPRKVNEAMKLFSEPRFNVDVLKVEVPVNMKYVEGFAEGEVVYTKEEAAQHFKDQDAA
THLPYIYLSAGVSAELFQETLKFAHEAGAKFNGVLCGRATWSGAVQVYIEQGEDAAREWLRTTGFKNIDDLNKVLKDTAT
SWKQRK
>Q5HE13 4.1.2.40~~~lacD~~~Tagatose 1,6-diphosphate aldolase~~~
MSKSNQKIASIEQLSNNEGIISALAFDQRGALKRMMAKHQTEEPTVAQIEQLKVLVAEELTQYASSILLDPEYGLPASDA
RNKDCGLLLAYEKTGYDVNAKGRLPDCLVEWSAKRLKEQGANAVKFLLYYDVDDAEEINIQKKAYIERIGSECVAEDIPF
FLEVLTYDDNIPDNGSVEFAKVKPRKVNEAMKLFSEPRFNVDVLKVEVPVNMKYVEGFAEGEVVYTKEEAAQHFKDQDAA
THLPYIYLSAGVSAELFQETLKFAHEAGAKFNGVLCGRATWSGAVQVYIEQGEDAAREWLRTTGFKNIDDLNKVLKDTAT
SWKQRK
>P0A009 4.1.2.40~~~lacD~~~Tagatose 1,6-diphosphate aldolase~~~
MSKSNQKIASIEQLSNNEGIISALAFDQRGALKRMMAKHQTEEPTVAQIEQLKVLVAEELTQYASSILLDPEYGLPASDA
RNKDCGLLLAYEKTGYDVNAKGRLPDCLVEWSAKRLKEQGANAVKFLLYYDVDDAEEINIQKKAYIERIGSECVAEDIPF
FLEVLTYDDNIPDNGSVEFAKVKPRKVNEAMKLFSEPRFNVDVLKVEVPVNMKYVEGFAEGEVVYTKEEAAQHFKDQDAA
THLPYIYLSAGVSAELFQETLKFAHEAGAKFNGVLCGRATWSGAVQVYIEQGEDAAREWLRTTGFKNIDDLNKVLKDTAT
SWKQRK
>P0A010 4.1.2.40~~~lacD~~~Tagatose 1,6-diphosphate aldolase~~~
MSKSNQKIASIEQLSNNEGIISALAFDQRGALKRMMAKHQTEEPTVAQIEQLKVLVAEELTQYASSILLDPEYGLPASDA
RNKDCGLLLAYEKTGYDVNAKGRLPDCLVEWSAKRLKEQGANAVKFLLYYDVDDAEEINIQKKAYIERIGSECVAEDIPF
FLEVLTYDDNIPDNGSVEFAKVKPRKVNEAMKLFSEPRFNVDVLKVEVPVNMKYVEGFAEGEVVYTKEEAAQHFKDQDAA
THLPYIYLSAGVSAELFQETLKFAHEAGAKFNGVLCGRATWSGAVQVYIEQGEDAAREWLRTTGFKNIDDLNKVLKDTAT
SWKQRK
>P29822 ~~~lacE~~~Lactose-binding protein~~~
MDYSRLLKRSVSAALTAAALLCSTAAFAGEVTIWCWDPNFNVAIMKEAAERYTAKHPDTTFNIVDFAKADVEQKLQTGLA
SGMTDTLPDIVLIEDYGAQKYLQSFPGSFAALTDKIDFSGFAKYKVDLMTLEGQVYGVPFDSGVTGLYYRTDYLEQAGFK
PEDMQNLTWDRFIEIGKEVKAKTGHEMMALDANDGGLIRIMMQSGGQWYFNEDGSLNITGNAALKAALETQARIVNERVA
KPTSGSNDGIRALTSGDVASVLRGVWITGTVKSQPDQAGKWALTAIPKLNIEGATAASNLGGSSWYVLEASAEKDEAIDF
LNEIYAKDLDFYQKILTERGAVGSLLAARTGEAYQKPDDFFGGQTVWQNFADWLVQVPAVNYGIFTNELDTAVTANFPAL
VKGTPVDEVLKAIEDQAAGQIQ
>P29823 ~~~lacF~~~Lactose transport system permease protein LacF~~~
MATTSRSSLKRYYDVNGWLFVAPAIALISVFMLYPILRSLVLSLYTGRGMMLKFSGTGNLVRLWNDPVFWQALQNTVIFF
VVQVPIMITMALILAAMLNNPKLRYSGLFRTMIFLPCVSSLVAYSILFKSMFSLDGVVNNTLLAIGIIGEPIGWLTDPFW
AKVLIIIAITWRWTGYNMIFYLAALQNIDRSIYEAAKIDGVPSWGRFAFLTIPMLKPVILFTTITSTIGTLQLFDEVYNF
TEGTGGPANSTLTLSLYIYNLTFRFMPSFSYAATVSYVIVLMVAVLSFLQFYAARERK
>P11546 3.2.1.85~~~lacG~~~6-phospho-beta-galactosidase~~~
MTKTLPKDFIFGGATAAYQAEGATHTDGKGPVAWDKYLEDNYWYTAEPASDFYHKYPVDLELAEEYGVNGIRISIAWSRI
FPTGYGEVNEKGVEFYHKLFAECHKRHVEPFVTLHHFDTPEALHSNGDFLNRENIEHFIDYAAFCFEEFPEVNYWTTFNE
IGPIGDGQYLVGKFPPGIKYDLAKVFQSHHNMMVSHARAVKLYKDKGYKGEIGVVHALPTKYPYDPENPADVRAAELEDI
IHNKFILDATYLGHYSDKTMEGVNHILAENGGELDLRDEDFQALDAAKDLNDFLGINYYMSDWMQAFDGETEIIHNGKGE
KGSSKYQIKGVGRRVAPDYVPRTDWDWIIYPEGLYDQIMRVKNDYPNYKKIYITENGLGYKDEFVDNTVYDDGRIDYVKQ
HLEVLSDAIADGANVKGYFIWSLMDVFSWSNGYEKRYGLFYVDFDTQERYPKKSAHWYKKLAETQVIE
>C7N8L9 3.2.1.85~~~lacG~~~6-phospho-beta-galactosidase~~~COG2723
MSKKLPEDFIFGGATAAYQAEGAIKIDGKGPVAWDKFLEENYWYTAEPASDFYHQYPVDLKLCEEFGINGIRISIAWSRI
FPNGYGEVNPKGVEFYHKLFAECKKRKVEPFVTLHHFDTPEVLHSNGDFLNRENIEHFVNYAKFCFEEFSEVNYWTTFNE
IGPIGDGQYLVGKFPPGIKYDFEKLFQSHHNMVLAHAKAVNLFKKNGYHGEIGMVCALPTKYPYDPNNPKDVRAAELDDI
IHNKFILDATFKGEYSKNTMEGVNHILQVNGGKLDLREEDFEELKAAKDLNDFLGINYYMSDWMAEYDGETEIIHNATGN
KGSSKYQIKGVGQRKANESIPRTDWDWIIYPQGLYDQISRVKKDYPNYKKIYITENGLGYKDVFEDNTVYDDARIDYIRQ
HLEVISDAIKDGANVKGYFLWSLMDVFSWSNGYEKRYGLFYVDFETQKRYPKKSAYWYKKVSETKEV
>P29824 ~~~lacG~~~Lactose transport system permease protein LacG~~~
MMTTLRRRLPDIVQYSVLSLAAFLSIFPFIWMVIGTTNTTSQIIRGKVTFGTALFDNIASFFAQVDVPLVFWNSVKIALV
GTALTLLVSSLAGYGFEMFRSKLRERVYTVILLTLMVPFAALMIPLFMLMGQAGLLNTHIAIMLPMIASAFIIFYFRQAS
KAFPTELRDAAKVDGLKEWQIFFYIYVPVMRSTYAAAFVIVFMLNWNNYLWPLIVLQSNDTKTITLVVSSLASAYSPEYG
TVMIGTILATLPTLLVFFAMQRQFVQGMLGSVK
>P67768 3.2.1.85~~~lacG~~~6-phospho-beta-galactosidase~~~
MTKTLPEDFIFGGATAAYQAEGATNTDGKGRVAWDTYLEENYWYTAEPASDFYNRYPVDLELSEKFGVNGIRISIAWSRI
FPNGYGEVNPKGVEYYHKLFAECHKRHVEPFVTLHHFDTPEVLHKDGDFLNRKTIDYFVDYAEYCFKEFPEVKYWTTFNE
IGPIGDGQYLVGKFPPGIKYDFEKVFQSHHNMMVAHARAVKLFKDGGYQGEIGVVHALPTKYPFDPSNPEDVRAAELEDI
IHNKFILDATYLGKYSRETMEGVQHILSVNGGKLNITDEDYAILDAAKDLNDFLGINYYMSDWMRGYDGESEITHNATGD
KGGSKYQLKGVGQREFDVDVPRTDWDWMIYPQGLYDQIMRVVKDYPNYHKIYITENGLGYKDEFIESEKTVHDDARIDYV
RQHLNVIADAIKDGANVKGYFIWSLMDVFSWSNGYEKRYGLFYVDFETQERYPKKSAYWYKELAETKEIK
>P03023 ~~~lacI~~~Lactose operon repressor~~~COG1609
MKPVTLYDVAEYAGVSYQTVSRVVNQASHVSAKTREKVEAAMAELNYIPNRVAQQLAGKQSLLIGVATSSLALHAPSQIV
AAIKSRADQLGASVVVSMVERSGVEACKAAVHNLLAQRVSGLIINYPLDDQDAIAVEAACTNVPALFLDVSDQTPINSII
FSHEDGTRLGVEHLVALGHQQIALLAGPLSSVSARLRLAGWHKYLTRNQIQPIAEREGDWSAMSGFQQTMQMLNEGIVPT
AMLVANDQMALGAMRAITESGLRVGADISVVGYDDTEDSSCYIPPLTTIKQDFRLLGQTSVDRLLQLSQGQAVKGNQLLP
VSLVKRKTTLAPNTQTASPRALADSLMQLARQVSRLESGQ
>Q7WTB0 ~~~lacR~~~HTH-type transcriptional regulator LacR~~~COG1609
MRTIKEIALESGYSPATVSRLLNNDPNLSITADTKNKILEIANKLGYWEDHQEKKIKPTIALLYRVNHNEQLQDEYFTSL
KQALVSTVERDALKMKTFYDIEDLIKNASLFQGFIGVGAEPIENAQLVKLHKVLPNGVFVDTNPAPELFDSIRPNLPFTV
KNAIDLFIKNGINKIGFIGGVGPKHDHIQENDLRSITFVEYMKTRGMDTKWTCVEGPVSVENGYKLGKMVLAKYKNDLPE
AFLIASDTLAVGVLQAFNEENVNVPKDTKILSINNSNVVKYVSPPLSSFNINQQEMIDMALDTLTHLIIRPDRPNIDIRM
NTNLVVRKSFVPQEK
>P67744 ~~~lacR~~~Lactose phosphotransferase system repressor~~~
MNKHERLDEIAKLVNKKGTIRTNEIVEGLNVSDMTVRRDLIELENKGILTKIHGGARSNSTFQYKEISHKEKHTRQIAEK
RYIARKAASLIEDGDTLFFGPGTTVELLAEEVNHHTLTIITNCLPVYKILLEKQTAHFRVYLIGGEMRHITEAFVGEMAN
AMLEKLRFSKMFFSSNAVNKGAVMTSTLDEAYTQQLALSNSIEKYLLIDHTKVGKEDFTSFCQLNELTAVVMDYEDEEKV
ETIKTYIEVVD
>P23826 ~~~lasA~~~Bacteriocin lactocin-S~~~
MKTEKKVLDELSLHASAKMGARDVESSMNADSTPVLASVAVSMELLPTASVLYSDVAGCFKYSAKHHC
>P23496 ~~~lacX~~~Protein LacX, plasmid~~~
MTIELKNEYLTVQFKTLGGQLTSIKDKDGLEYLWQADPEYWNGQAPILFPICGSLRNDWAIYRPQERPFFTGLIRRHGFV
RKEEFTLEEVNENSVTFSIKPNAEMLDNYLYQFELRVVYTLNGKSIRTEFQVTNLETEKTMPYFIGAHPAFNCPLVEGEK
YEDYSLEFSEVESCSIPKSFPETGLLDLQDRTPFLENQKSLDLDYSLFSHDAITLDRLKSRSVTLRSRKSGKGLRVDFDD
FPNLILWSTTNKSPFIALEPWSGLSTSLEEGNILEDKPQVTKVLPLDTSKKSYDITILN
>P02920 ~~~lacY~~~Lactose permease~~~COG2223
MYYLKNTNFWMFGLFFFFYFFIMGAYFPFFPIWLHDINHISKSDTGIIFAAISLFSLLFQPLFGLLSDKLGLRKYLLWII
TGMLVMFAPFFIFIFGPLLQYNILVGSIVGGIYLGFCFNAGAPAVEAFIEKVSRRSNFEFGRARMFGCVGWALCASIVGI
MFTINNQFVFWLGSGCALILAVLLFFAKTDAPSSATVANAVGANHSAFSLKLALELFRQPKLWFLSLYVIGVSCTYDVFD
QQFANFFTSFFATGEQGTRVFGYVTTMGELLNASIMFFAPLIINRIGGKNALLLAGTIMSVRIIGSSFATSALEVVILKT
LHMFEVPFLLVGCFKYITSQFEVRFSATIYLVCFCFFKQLAMIFMSVLAGNMYESIGFQGAYLVLGLVALGFTLISVFTL
SGPGPLSLLRRQVNEVA
>Q7WTB2 ~~~lacS~~~Lactose permease~~~COG2190
MHNHKVSGKQIVSYASFCLGNLGHSAFYGVMSTYFIIFITSGMFSGLNQSVADKLIGLITGLMVLVRIIELVIDPILGNV
VDNTKTRWGKFKPWILIGTVVSAALLLILFTGIFGLAQQNWILFAILFVLIYIAFDVFYSLSDVSYWGMVPALSEDSHER
GIYTSLGAFSGIIGWNSLPIIVVPLVTGVTYAVTGKHEEGAPGWFAFAAVISALAIICALIVCFGTKEKHNIIRDSAKQK
TTLRQVFGAIFHNDQILWPSLAYLLYSLAAVITNGVLFYMYKFVIGKPNDFWVVGIIATIIGCCINPSFPVLNKYIPRKW
LFIAGQTCMVLAYVLFIFGHNNVFLMDLGLVLFNINFALLVTVLTLTDAIEYGQLKIGQRNEAVVLAVRPMIDKFAGAVS
NALVGYVAIAAGMTGSATAADMTSKGINTFNMMALYIPLALAVLSIVVFSLKVTLSEKKHAQVIEELKSKLAQGEIEKKT
SVDTGTKEVTIYAPADGELMQMSSVVDEDGKPFPGKGFAIEPSSGQIYAPFDGTIKFTFGTKHAFEIVSQNGLQVVVHVG
LGTVNLRGEGFETFYDDGQTVKKGDKLLEFDRDLALNNGYKDTIVIFYTQPGRIQNSGTIQAGKDIKHGEKVVDVQFK
>A4IU28 1.14.14.28~~~ladA~~~Long-chain alkane monooxygenase~~~COG2141
MTKKIHINAFEMNCVGHIAHGLWRHPENQRHRYTDLNYWTELAQLLEKGKFDALFLADVVGIYDVYRQSRDTAVREAVQI
PVNDPLMLISAMAYVTKHLAFAVTFSTTYEHPYGHARRMSTLDHLTKGRIAWNVVTSHLPSADKNFGIKKILEHDERYDL
ADEYLEVCYKLWEGSWEDNAVIRDIENNIYTDPSKVHEINHSGKYFEVPGPHLCEPSPQRTPVIYQAGMSERGREFAAKH
AECVFLGGKDVETLKFFVDDIRKRAKKYGRNPDHIKMFAGICVIVGKTHDEAMEKLNSFQKYWSLEGHLAHYGGGTGYDL
SKYSSNDYIGSISVGEIINNMSKLDGKWFKLSVGTPKKVADEMQYLVEEAGIDGFNLVQYVSPGTFVDFIELVVPELQKR
GLYRVDYEEGTYREKLFGKGNYRLPDDHIAARYRNISSNV
>D5SJ87 5.5.1.30~~~~~~Labda-7,13-dienyl diphosphate synthase~~~COG1657
MPVDVGTLPPPAPREAVSAAAHLLASVDGDPWGRTSPTVYETARVHAWAPHLPGRDRRVTWLLDQQRAGGLWGDGPPAYQ
VLPTLAAVTALLAELDRHPEAGHSSLGGRLAAAVAAGLDTLHGLSHHDPLPDTAAVELLVPGLITEVNDRLDAIDPEAAH
PALAPVPHGRRLTAVHGIPALPRHRLAERLARFARLPVKLHHCFEALAPVCPPGLVPARPDHLLGSSSAATAAWLATATA
APGAPGLDRLLRSTAARYGGLFPETARITVFERLWVLTTLHRAGLLATFEPLARRWVSALAAPGGVPGVPGFEPDADDTA
VTLHLATELGVPYRPEVLDPFRTGDHFACYLGEDTGSVSTNAHVLLALGTWTRHHPDTADHGNTIRLLGRWLVERQHGDG
HWDDKWHASPYYATAKVTAALSRHGGPEAADALRRAARWVRETRRTDGSWGIWGGTAEETAYAAQILLDAPEPPTDVLGC
AHAHLTARADDDGPPPALWHDKTLFAPDAIVRAEVLSTLRRLDRRLPAPAPVPPGFDAARTGPAD
>P24022 ~~~lafA~~~Bacteriocin lactacin-F subunit LafA~~~
MKQFNYLSHKDLAVVVGGRNNWQTNVGGAVGSAMIGATVGGTICGPACAVAGAHYLPILWTAVTAATGGFGKIRK
>Q03476 ~~~lafL~~~Flagellar protein LafL~~~COG1580
MTKQQMIAMFIAMIITSALVSAATIMGGIWYLNKQAQDSGETSSLLENSPLSFLVTEQPTSKGPSFHPLDKVVLSIKGKK
QTHFVMLELAIETRRPERIKDIDNYMPMVQNSLLKLFSDKTFDELQQTGAIDILQNEVKQTLLVAFAKTDIVRDIDDVLL
TKYVVQ
>Q03474 ~~~lafS~~~RNA polymerase sigma factor for flagellar operon~~~COG1191
MLDMNPQETYTAPEEVNTPSRPIDENALLQRHQVMVKRVVNQLRVHATSHCSIEDMQQIGLIALVEAGRRYGDIDDTHFP
AFAVCRVRGAILDELRRLDWRSRKTRQQAHELNDVTRDLTRSLGRMPTDSEIIKALGTDEQDYYNRQNAALAGEMQSLDQ
LMENSTDSHFGGQYDGMEHEHIRRSLDSALGRLSKRDQLLLTLFYQHELNLHEIALVLDLTPPRICQLHKQALKQLNQLM
SS
>Q03477 ~~~lafT~~~Chemotaxis protein LafT~~~COG1291
MQKFLGVLTILVCVFGGYMWAGGKLGAIWQPAEFLIIIGAAAGSLIIGNPPHVLKEMRQQVPATIKGPTEEYEYYMELMA
LLNNLLETARSRGFKFLDSHIEAPEQSSIFLMYPLVSEDHRLISFITDNLRLMAMGQMSPHELEGLLEQEIEAIQNELLL
PSRSLQRTAEALPGFGILAAVGGIIITMQAIDGSIALIGYHVAAALVGTFIGIFGCYCGLDPLSNAMAQRVKRNMTAFEC
VRATLVAYVAKKPTLLAIDAGRKHIQLDIKPTFNQMEKWLAEQEG
>Q03478 ~~~lafU~~~Chemotaxis protein LafU~~~COG1360
MQKQEHVVFKRAKAHGHDEPHGGAWKVAFADFMIALMALFLVLWVMQVVDKEERKAIVAHLHSSSVFDKSYGNPFDTSQS
ISPIDLAQDSSVPSKHNSNHVVSSYFQGDGDGPEINSLVPGTFDTQEQLAALAKVIEEMTAQINAQGNVNVTVTPQGLRI
VLQDDYKQHMFSRGGAELTPFFEDLLLALAPLFEQVTNPLIISGHTDAIPFKKRFGRQSNWALSASRADVARKTLVEGGM
PDDRVMQVTGMSDRALLNPDEPDSSENRRIELFILTTPAAKVLETLFGNQDDSELQKAKQKAEFNQPVIRQEVIRYSADA
EKQEAKIQAL
>P59852 3.4.22.-~~~lagD~~~Lactococcin-G-processing and transport ATP-binding protein LagD~~~
MKKIIYQQDEKDCGVACIAMILKHYGTEITIQRLRELSGTDLDGTSAFGIKKTFEKLGFDAPAFKAGDETWQEKDIPLPL
IAHIISEQKYQHYVVVYKVKGDEIWIADPAKGKIRKTISEFSKEWTGVLLFPKPKAEYKPSIERVDSLSTFFPILIKQKS
LFITIFGIISSYYFQGLLDNIIPNQARSTLNILSIGLIFVYLFRVLFEYSRSYLLLLIGQRMSMSIMLGYFKHVLSLPLS
FFATRKSGEIISRFLDANKIIDALASATLSLILDIGMVILVGTTLAIQSTQLFLLTLAFLPFYILVVYVFIRSYDKANTE
EMSAGAEVNSSIIESLKGIETIKSYNGENHVYDRVDSEFVTLMKKSFKSVTLDNVQQSLKMVIELISSVLILWLGSSYVI
DGKISLGQLITYNALLVFFTEPLQNIINLQVKMQKARVANKRLNEIMSISPEQRNTNINISKNIFNKDIKLDKVSFSYNM
KLPVLRDVSLEIYSKSKVALVGVSGSGKSTLAKLLVKFYDPSEGNITYGDINCQDIENHKLRNHVTYVPQESFFFNGTII
DNLTFGLSHQPEFEKIFRACKAACLVDFINQQPLRFDSVLEEGGNNLSGGQKQRLAIARAILNDSEIIIFDEATSGLDTL
LEKEILEYLIKLQDKTIIFIAHHLSIAKACDEIIVLDQGILVGRGTHEELSEKEGVYRRLLNA
>G2NFJ9 3.2.1.58~~~~~~Exo-beta-1,3-glucanase~~~
MHVPPTDPARSAPPASPHRRRRPKALGLTALAAAMLMAVPTTQAAFGSDVRPAAAQEVVGGGDLGPNVLVFDPSTPDIQG
KVDEVFRKQESNQFGTDRYALMFKPGTYNDINAQIGFYTSIAGLGLNPDDTTFNGDVTVDAGWFDGNATQNFWRSAENLA
LNPVNGTNRWAVSQAAPFRRMHVKGGLNLAPDGYGWASGGYIADSKIDGEVGPYSQQQWYTRDSSVGGWGNGVWNMTFSG
VEGAPAQSFPEPPYTTLETTPVSREKPFLYLDGDDYKVFVPAKRTNARGTSWGNGTPEGESLPLDQFYVVKPGATAETIN
AAVDQGLHLLFTPGVYHVDQPIEIDRANTVALGLGLATIIPDNGVTALKVGDVDGVKVAGLLVDAGPVNSETLVEVGSDG
ASGDHAANPTSLQDVFVRIGGAGPGKATTSIVVNSNDTIIDHTWVWRADHGEGVGWETNRADYGVHVKGDNVLATGLFVE
HFNKYDVQWSGENGKTIFYQNEKAYDAPDQAAIQNGDIKGYAAYKVDDSVTTHEGWGMGSYCYFNVNPDIRQQHGFQAPV
KPGVKFHDLLVVSLGGKGQYEHVINDIGDPTSGDTTIPSQVVSFP
>Q8CVI4 ~~~lamB~~~Maltoporin~~~COG4580
MMITLRKLPLAVAVAAGVMSAQAMAVDFHGYARSGIGWTGSGGEQQCFQTTGAQSKYRLGNECETYAELKLGQEVWKEGD
KSFYFDTNVAYSVAQQNDWEATDPAFREANVQGKNLIEWLPGSTIWAGKRFYQRHDVHMIDFYYWDISGPGAGLENIDVG
FGKLSLAATRSSEAGGSSSFASNNIYDYTNETANDVFDVRLAQMEINPGGTLELGVDYGRANLRDNYRLVDGASKDGWLF
TAEHTQSVLKGFNKFVVQYATDSMTSQGKGLSQGSGVAFDNEKFAYNINNNGHMLRILDHGAISMGDNWDMMYVGMYQDI
NWDNDNGTKWWTVGIRPMYKWTPIMSTVMEIGYDNVESQRTGDKNNQYKITLAQQWQAGDSIWSRPAIRVFATYAKWDEK
WGYDYTGSSSTNPYYGKAVSADFNGGSFGRGDSDEWTFGAQMEIWW
>P02943 ~~~lamB~~~Maltoporin~~~COG4580
MMITLRKLPLAVAVAAGVMSAQAMAVDFHGYARSGIGWTGSGGEQQCFQTTGAQSKYRLGNECETYAELKLGQEVWKEGD
KSFYFDTNVAYSVAQQNDWEATDPAFREANVQGKNLIEWLPGSTIWAGKRFYQRHDVHMIDFYYWDISGPGAGLENIDVG
FGKLSLAATRSSEAGGSSSFASNNIYDYTNETANDVFDVRLAQMEINPGGTLELGVDYGRANLRDNYRLVDGASKDGWLF
TAEHTQSVLKGFNKFVVQYATDSMTSQGKGLSQGSGVAFDNEKFAYNINNNGHMLRILDHGAISMGDNWDMMYVGMYQDI
NWDNDNGTKWWTVGIRPMYKWTPIMSTVMEIGYDNVESQRTGDKNNQYKITLAQQWQAGDSIWSRPAIRVFATYAKWDEK
WGYDYTGNADNNANFGKAVPADFNGGSFGRGDSDEWTFGAQMEIWW
>P26466 ~~~lamB~~~Maltoporin~~~
MMITLRKLPLAVAVAAGVMSAQAMAVDFHGYARSGIGWTGSGGEQQCFQATGAQSKYRLGNECETYAELKLGQEVWKEGD
KSFYFDTNVAYSVNQQNDWESTDPAFREANVQGKNLIEWLPGSTIWAGKRFYQRHDVHMIDFYYWDISGPGAGIENIDLG
FGKLSLAATRSTEAGGSYTFSSQNIYDEVKDTANDVFDVRLAGLQTNPDGVLELGVDYGRANTTDGYKLADGASKDGWMF
TAEHTQSMLKGYNKFVVQYATDAMTTQGKGQARGSDGSSSFTEELSDGTKINYANKVINNNGNMWRILDHGAISLGDKWD
LMYVGMYQNIDWDNNLGTEWWTVGVRPMYKWTPIMSTLLEVGYDNVKSQQTGDRNNQYKITLAQQWQAGDSIWSRPAIRI
FATYAKWDEKWGYIKDGDNISRYAAATNSGISTNSRGDSDEWTFGAQMEIWW
>Q7V8T1 ~~~~~~Lantipeptide prochlorosin 1.1~~~
MSEEQLKAFIAKVQADTSLQEQLKAEGADVVAIAKAAGFSITTEDLEKEHRQTLSDDDLEGVAGGFFCVQGTANRFTINV
C
>P86047 ~~~elxA~~~Lantibiotic epilancin 15X~~~
MKKELFDLNLNKDIEAQKSDLNPQSASIVKTTIKASKKLCRGFTLTCGCHFTGKK
>Q7TV59 ~~~~~~Lantipeptide prochlorosin 1.7~~~
MSEEQLKAFIAKVQADTSLQEQLKVEGADVVAIAKASGFAITTEDLKAHQANSQKNLSDAELEGVAGGTIGGTIVSITCE
TCDLLVGKMC
>Q7V7B7 ~~~~~~Lantipeptide prochlorosin 3.3~~~
MSEEQLKAFIAKVQGDSSLQEQLKAEGADVVAIAKAAGFTIKQQDLNAAASELSDEELEAASGGGDTGIQAVLHTAGCYG
GTKMCRA
>Q7V735 ~~~~~~Lantipeptide prochlorosin 4.3~~~
MSEEQLKAFIAKVQADTSLQEQLKAEGADVVAIAKAAGFTITTEDLNSHRQNLTDDELEGVAGGTASGGCDTSMFCY
>P36499 ~~~lctA~~~Lantibiotic lacticin-481~~~
MKEQNSFNLLQEVTESELDLILGAKGGSGVIHTISHECNMNSWQFVFTCCS
>P85065 ~~~~~~Lantibiotic 107891~~~
VTSWSLCTPGCTSPGGGSNCSFCC
>O87236 ~~~ltnA1~~~Lantibiotic lacticin 3147 A1~~~
MNKNEIETQPVTWLEEVSDQNFDEDVFGACSTNTFSLSDYWGNNGAWCTLTHECMAWCK
>O87237 ~~~ltnA2~~~Lantibiotic lacticin 3147 A2~~~
MKEKNMKKNDTIELQLGKYLEDDMIELAEGDESHGGTTPATPAISILSAYISTNTCPTTKCTRAC
>P56650 ~~~garA~~~Lantibiotic actagardine~~~
MSALAIEKSWKDVDLRDGATSHPAGLGFGELTFEDLREDRTIYAASSGWVCTLTIECGTVICAC
>P83674 ~~~rumA1~~~Ruminococcin-A~~~
MRNDVLTLTNPMEEKELEQILGGGNGVLKTISHECNMNTWQFLFTCC
>H2A7G5 ~~~~~~Lantibiotic macedovicin~~~
MMNATENQIFVETVSDQELEMLIGGADRGWIKTLTKDCPNVISSICAGTIITACKNCA
>O54329 ~~~mutA~~~Lantibiotic mutacin-2~~~
MNKLNSNAVVSLNEVSDSELDTILGGNRWWQGVVPTVSYECRMNSWQHVFTCC
>P36501 ~~~scnA~~~Lantibiotic streptococcin A-FF22~~~
MEKNNEVINSIQEVSLEELDQIIGAGKNGVFKTISHECHLNTWAFLATCCS
>P36500 ~~~salA~~~Lantibiotic salivaricin-A~~~
MKNSKDILNNAIEEVSEKELMEVAGGKRGSGWIATITDDCPNSVFVCC
>Q57312 ~~~elkA~~~Lantibiotic epilancin~~~
MNNSLFDLNLNKGVETQKSDLSPQSASVLKTSIKVSKKYCKGVTLTCGCNITGGK
>P08136 ~~~epiA~~~Lantibiotic epidermin~~~
MEAVKEKNDLFNLDVKVNAKESNDSGAEPRIASKFICTPGCAKTGSFNSYCC
>P21838 ~~~gdmA~~~Lantibiotic gallidermin~~~
MEAVKEKNELFDLDVKVNAKESNDSGAEPRIASKFLCTPGCAKTGSFNSYCC
>Q65DC4 ~~~lanA1~~~Lantibiotic lichenicidin A1~~~
MSKKEMILSWKNPMYRTESSYHPAGNILKELQEEEQHSIAGGTITLSTCAILSKPLGNNGYLCTVTKECMPSCN
>P86475 ~~~lchA1~~~Lantibiotic lichenicidin VK21 A1~~~
MSKKEMILSWKNPMYRTESSYHPAGNILKELQEEEQHSIAGGTITLSTCAILSKPLGNNGYLCTVTKECMPSCN
>P86720 ~~~lanA2~~~Lantibiotic lichenicidin A2~~~
MKNSAAREAFKGANHPAGMVSEEELKALVGGNDVNPETTPATTSSWTCITAGVTVSASLCPTTKCTSRC
>P86476 ~~~lchA2~~~Lantibiotic lichenicidin VK21 A2~~~
MKTMKNSAAREAFKGANHPAGMVSEEELKALVGGNDVNPETTPATTSSWTCITAGVTVSASLCPTTKCTSRC
>P80666 ~~~~~~Lantibiotic mutacin B-Ny266~~~
FKSWSFCTPGCAKTGSFNSYCC
>B5MFD0 ~~~nukA~~~Lantibiotic nukacin~~~
MENSKIMKDIEVANLLEEVQEDELNEVLGAKKKSGVIPTVSHDCHMNTFQFMFTCCS
>E0WX65 ~~~nukA~~~Lantibiotic nukacin~~~
MENSKVMKDIEVANLLEEVQEDELNEVLGAKKKSGVIPTVSHDCHMNSFQFVFTCCS
>Q9KWM4 ~~~nukA~~~Lantibiotic nukacin~~~
MENSKVMKDIEVANLLEEVQEDELNEVLGAKKKSGVIPTVSHDCHMNSFQFVFTCCS
>Q2QBT0 ~~~nsuA~~~Lantibiotic nisin-U~~~
MNNEDFNLDLIKISKENNSGASPRITSKSLCTPGCKTGILMTCPLKTATCGCHFG
>P13068 ~~~spaN~~~Lantibiotic nisin-A~~~
MSTKDFNLDLVSVSKKDSGASPRITSISLCTPGCKTGALMGCNMKTATCHCSIHVSK
>O68586 ~~~lanA~~~Lantibiotic mutacin-1140~~~
MSNTQLLEVLGTETFDVQEDLFAFDTTDTTIVASNDDPDTRFKSWSLCTPGCARTGSFNSYCC
>P86013 ~~~paenA~~~Lantibiotic paenibacillin~~~
MKVDQMFDLDLRKSYEASELSPQASIIKTTIKVSKAVCKTLTCICTGSCSNCK
>P19578 ~~~pepA~~~Lantibiotic Pep5~~~
MKNNKNLFDLEIKKETSQNTDELEPQTAGPAIRASVKQCQKTLKATRLFTVSCKGKNGCK
>O88038 ~~~ramS~~~Lanthionine-containing peptide SapB precursor RamS~~~
MNLFDLQSMETPKEEAMGDVETGSRASLLLCGDSSLSITTCN
>P29559 ~~~nisZ~~~Lantibiotic nisin-Z~~~
MSTKDFNLDLVSVSKKDSGASPRITSISLCTPGCKTGALMGCNMKTATCNCSIHVSK
>P0ACV4 ~~~lapA~~~Lipopolysaccharide assembly protein A~~~COG3771
MKYLLIFLLVLAIFVISVTLGAQNDQQVTFNYLLAQGEYRISTLLAVLFAAGFAIGWLICGLFWLRVRVSLARAERKIKR
LENQLSPATDVAVVPHSSAAKE
>P0AB60 ~~~lapB~~~Lipopolysaccharide assembly protein B~~~COG2956
MLELLFLLLPVAAAYGWYMGRRSAQQNKQDEANRLSRDYVAGVNFLLSNQQDKAVDLFLDMLKEDTGTVEAHLTLGNLFR
SRGEVDRAIRIHQTLMESASLTYEQRLLAIQQLGRDYMAAGLYDRAEDMFNQLTDETDFRIGALQQLLQIYQATSEWQKA
IDVAERLVKLGKDKQRVEIAHFYCELALQHMASDDLDRAMTLLKKGAAADKNSARVSIMMGRVFMAKGEYAKAVESLQRV
ISQDRELVSETLEMLQTCYQQLGKTAEWAEFLQRAVEENTGADAELMLADIIEARDGSEAAQVYITRQLQRHPTMRVFHK
LMDYHLNEAEEGRAKESLMVLRDMVGEKVRSKPRYRCQKCGFTAYTLYWHCPSCRAWSTIKPIRGLDGL
>P0AB58 ~~~lapB~~~Lipopolysaccharide assembly protein B~~~COG2956
MLELLFLLLPVAAAYGWYMGRRSAQQNKQDEANRLSRDYVAGVNFLLSNQQDKAVDLFLDMLKEDTGTVEAHLTLGNLFR
SRGEVDRAIRIHQTLMESASLTYEQRLLAIQQLGRDYMAAGLYDRAEDMFNQLTDETDFRIGALQQLLQIYQATSEWQKA
IDVAERLVKLGKDKQRVEIAHFYCELALQHMASDDLDRAMTLLKKGAAADKNSARVSIMMGRVFMAKGEYAKAVESLQRV
ISQDRELVSETLEMLQTCYQQLGKTAEWAEFLQRAVEENTGADAELMLADIIEARDGSEAAQVYITRQLQRHPTMRVFHK
LMDYHLNEAEEGRAKESLMVLRDMVGEKVRSKPRYRCQKCGFTAYTLYWHCPSCRAWSTIKPIRGLDGL
>Q02PA2 3.4.11.1~~~lap~~~Aminopeptidase~~~
MSNKNNLRYALGALALSVSAASLAAPSEAQQFTEFWTPGKPNPSICKSPLLVSTPLGLPRCLQASNVVKRLQKLEDIASL
NDGNRAAATPGYQASVDYVKQTLQKAGYKVSVQPFPFTAYYPKGPGSLSATVPQPVTYEWEKDFTYLSQTEAGDVTAKVV
PVDLSLGAGNTSTSGCEAEDFANFPAGSIALIQRGTCNFEQKAENAAAAGAAGVIIFNQGNTDDRKGLENVTVGESYEGG
IPVIFATYDNGVAWSQTPDLQLHLVVDVVRKKTETYNVVAETRRGNPNNVVMVGAHLDSVFEGPGINDNGSGSAAQLEMA
VLLAKALPVNKVRFAWWGAEEAGLVGSTHYVQNLAPEEKKKIKAYLNFDMIGSPNFGNFIYDGDGSDFGLQGPPGSAAIE
RLFEAYFRLRGQQSEGTEIDFRSDYAEFFNSGIAFGGLFTGAEGLKTEEQAQKYGGTAGKAYDECYHSKCDGIANINQDA
LEIHSDAMAFVTSWLSLSTKVVDDEIAAAGQKAQSRSLQMQKSASQIERWGHDFIK
>Q9HZQ8 3.4.11.1~~~lap~~~Aminopeptidase~~~
MSNKNNLRYALGALALSVSAASLAAPSEAQQFTEFWTPGKPNPSICKSPLLVSTPLGLPRCLQASNVVKRLQKLEDIASL
NDGNRAAATPGYQASVDYVKQTLQKAGYKVSVQPFPFTAYYPKGPGSLSATVPQPVTYEWEKDFTYLSQTEAGDVTAKVV
PVDLSLGAGNTSTSGCEAEDFANFPAGSIALIQRGTCNFEQKAENAAAAGAAGVIIFNQGNTDDRKGLENVTVGESYEGG
IPVIFATYDNGVAWSQTPDLQLHLVVDVVRKKTETYNVVAETRRGNPNNVVMVGAHLDSVFEGPGINDNGSGSAAQLEMA
VLLAKALPVNKVRFAWWGAEEAGLVGSTHYVQNLAPEEKKKIKAYLNFDMIGSPNFGNFIYDGDGSDFGLQGPPGSAAIE
RLFEAYFRLRGQQSEGTEIDFRSDYAEFFNSGIAFGGLFTGAEGLKTEEQAQKYGGTAGKAYDECYHSKCDGIANINQDA
LEIHSDAMAFVTSWLSLSTKVVDDEIAAAGQKAQSRSLQMQKSASQIERWGHDFIK
>F9USS9 5.1.2.1~~~larA~~~Lactate racemase~~~COG3875
MVAIDLPYDKRTITAQIDDENYAGKLVSQAATYHNKLSEQETVEKSLDNPIGSDKLEELARGKHNIVIISSDHTRPVPSH
IITPILLRRLRSVAPDARIRILVATGFHRPSTHEELVNKYGEDIVNNEEIVMHVSTDDSSMVKIGQLPSGGDCIINKVAA
EADLLISEGFIESHFFAGFSGGRKSVLPGIASYKTIMANHSGEFINSPKARTGNLMHNSIHKDMVYAARTAKLAFIINVV
LDEDKKIIGSFAGDMEAAHKVGCDFVKELSSVPAIDCDIAISTNGGYPLDQNIYQAVKGMTAAEATNKEGGTIIMVAGAR
DGHGGEGFYHNLADVDDPKEFLDQAINTPRLKTIPDQWTAQIFARILVHHHVIFVSDLVDPDLITNMHMELAKTLDEAME
KAYAREGQAAKVTVIPDGLGVIVK
>D9TQ02 5.1.2.1~~~larA~~~Lactate racemase~~~COG3875
MANIEIPYGKSKLAFDLPDERIQGILRSKAGSYKVNMSEEDIVKRALENPIGTKRLQDLAEGKKNIVIITSDHTRPVPSR
ITLPLLLDEIRKKNKSANVKILIATGFHRGTTLQEMKAKFGEDLVENEQFVVHDSRNSENMELIGTLPSGGKLEINKLAV
EADLLVAEGFIEPHFFAGFSGGRKSILPGIASVQCILANHCSEFIKNPYARTGVLENNPIHRDMIYAAKKANLAFILNVV
IDSSHKIVNAFAGHSEKAHLKGCEFVSEIATVNAKPADIVITSNGGYPLDQNIYQSVKGMTAGEAACKDGGVIIIAAECA
DGHGGEGFYRWFKESKDPQDVMNKILSRGRDETLPDQWEAQILARILINHKVIMVTDSKNYEYVKDMFMTPAKDLGEALK
IAESIVNNDSKINVIPDGVSVIVREK
>F9UST0 2.5.1.143~~~larB~~~Pyridinium-3,5-biscarboxylic acid mononucleotide synthase~~~COG1691
MATTAEILQQVAAGQLSPTAAAQQLEAGKTAALGFANVDLDRQRRNGFPEVIYGAGKTATQIVGIVQALSQQTLPILTTR
LSAEKFAALQPALPTAVYHATAQCMTVGEQPAPKTPGYIAVVTAGTSDQPVAEEAAVTAETFGNRVERVYDVGVAGIHRL
FAKLDVIRGARVVIVIAGMEGALASVVGGLVDKPVIAVPTSVGYGTSFQGMTALLTMLNSCASGITVVNIDNGFGAAYSA
SMVNQM
>F9UST1 4.99.1.12~~~larC~~~Pyridinium-3,5-bisthiocarboxylic acid mononucleotide nickel insertion protein~~~COG1641
MQTLYLDAFSGISGDMFLGALLDLGLDFEQLKTELAKLHVHGYELTQQREAQSSIYGTSFDVQVAGGKDHGFVEHHHHQH
EAGHHHDHEARHLADIEALIDGSDLSDTVKHHAKAIFMEIAQAEAAVHHMPLAEVHFHEVGALDSIVDIVGCCIGLELMQ
IDTIMASPLSDGSGFINVAHGQMPVPVPAVMQMRVGSAIPIQQRLDVHTELITPTGMGLVKTLVREFGPLPENAVPTRVG
YGFGKRDTGGFNALRAVLFEKKKLSQQIVNRTADAVLMIEANLDDQTGEGLGYVMNQLLTAGAYDVFFTPIQMKKDRPAT
KLTVLGNVNDKDLLTKLILQETTTIGVRYQTWQRTIMQRHFLTVATPYGDVQVKVATYQDIEKKMPEYADCAQLAQQFHI
PFRTVYQAALVAVDQLDEEA
>F9UST3 ~~~larD~~~D/L-lactic acid transporter~~~COG0580
MVHQLIAEFMGTALMIIFGVGVHCSSVLKGTKYRGSGHIFAITTWGFGISVALFIFGNVCINPAMVLAQCLLGNIAWSLF
IPYSVAEVLGGVVGSVIVWIMYADHFKASTDEISPITIRNLFCTAPAVRNLPRNFFVELFDTFIFISGILAISEIKTPGI
VPIGVGLLVWAIGMGLGGPTGFAMNLARDMGPRIAHAILPIANKADSDWQYGIIVPGIAPFVGAAIAAWFMHGFFGIN
>F9UST4 4.4.1.37~~~larE~~~Pyridinium-3,5-bisthiocarboxylic acid mononucleotide synthase~~~COG1606
MATLATKKATLVAALKDLQRVTVAFSGGIDSTLVLKMALDVLGRDNVTAVVANSELFTDEEFDKAMSLAEELGANVQGTT
LDYLSDDHIKNNTPDSWYYAKKMFYSRLNDIAANNGSAAVLDGMIKNDENDYRPGLKARSEAGARSLLQEADFFKTDVRA
LAQELGLTNWNKVASCSVSSRFPYGTTLTHDNIAQVMAAEKYLRSLGFPTVRVRFHNDIARIELPEARIGDFLVFNDRVN
RQLQSLGFRYVTLDLGGFRSGRMNDTLTKAQLATFA
>H7C8I3 ~~~larA~~~Lariatin~~~
MTSQPSKKTYNAPSLVQRGKFARTTAGSQLVYREWVGHSNVIKPGP
>F9USS7 ~~~larMN~~~Probable fused nickel transport protein LarMN~~~COG0310
MHIPDNYLSPATCGTLVTAMAPVWTVAVLKVKVQIKKHHETLPMLGIAASLAFLIMMFNLPIPGGTTAHAVGGTLLAVLI
GPWAACLALTVTLLLQALLFGDGGILAFGANALNMAVIMPFVGYACYRLGQKWHHEKLGLAIGAYLGINMAALVAGIELG
LQPILAHTASGAPLYCPYGLNITIPAMLTAHLLVAGWVEVVFTLLVFQFVKRVAPTNLYQTPTRRNQRPWIALLLGLAVL
SPLGLLASNTAWGEWSPQELQQRLAQQHISTHAPQGMVHGFHFQALFSDYAIAGLPVSVGYILSAITAVLIFLLLIRGLQ
HETDTSQH
>Q890D1 7.2.2.-~~~larO~~~Nickel import ATP-binding protein LarO~~~COG1122
MIKLVNICYDYPDTCGLKDLSLTVNSGDFICLMGPNGSGKSTLLRLLSGLASPTSGAYQFHDQPITTTYLADAQNRQQLH
QRIGMVFQNTDVQLFNTSVTEEVAFGPRQLGLSAAMVAQRVADCLQLTDCANLADRVPYQLSGGEKKRVALASVLALNPE
ILLLDEPLNGLTIAAQQQMLTLLQRLQAAGKTIIMASHNYQQVQAVGERFIIFNSTHQVDADLTRADLDQQPARQAQLMT
L
>F9USS6 ~~~larQ~~~Nickel permease LarQ~~~COG0619
MKPTRPNTDIPAWLQTTTTTPTPAIKAKFWQRNQRHLRQLLSRLAQPAPVTATSHWRVAPQFKLIQLLLLVILIALSNNL
ILLWSLALLVGCQLLWLPPRQLRRFMGSWLISVGMAMLFVLPSYWLAGPTTLLFFGLKTSLMLANAQYYRLTTPFQDLLA
GLKALHCPDLLIMTLAIAITYLRMLGQHLLLTMEALELRTVAPTAHPYRLIGALFGNLYLKSYTYALELYAAMEARGFNG
HYVRSTGRRTHWRDYLALSPAIIVWILFIFWRH
>F9USS8 ~~~larR~~~Lactate racemization regulatory protein~~~COG0664
MVLTDIEYLLSYLEAHHVPTIKKKRHTYLTYHGLAEHYTYVLKDGIIKNSIILQDGREYNLSYIAKPDVISLLRDEVSRS
TDQPFNVRIESEYATFYQVNRVAFWKYVNSTPELQNYVKNYYRKKLSENILRLQRMVMNGKKGAICAFIYSLVDLFGRKV
NEGILIDFVVTNDDIAGFCGISSRSSVNRMLKELRTDGVITVKNHKFIIQDVSYLLDQIAN
>Q5ZZ30 2.4.2.31~~~Lart1~~~NAD(+)--arginine ADP-ribosyltransferase Lart1~~~
MYSKYPAFFLNKNIKSSSGVQFSNVVKIPSAIESLYRGDNNLTGIIFLLPTLITGVFCQNFPEVVDIEQIRLHKLTNLSN
DFHMVSMSEDPQIALDWGNGCFITIDPVSFSDYIVDVHATFSENQLNLPGRMEREKEHVALAVPFCSIKKITIHNKELAN
PFYLSIPQENHEAKMELNTLYGELISLLRKKYTQEVDEKEEQIALRTYAIRYLDFYAKFCGCDNPFDKTIAQLSELYPEF
MSNFLQSSHFSSKTGLMKEIVVNSLDNLFKEHPYTKSIDASYIYRVKESTTCYEDDWAKPVYD
>Q02L18 3.4.24.-~~~lasA~~~Protease LasA~~~
MQHKRSRALASPRSPFLFALLALAVGGTANAHDDGLPAFRYSAELLGQLQLPSVALPLNDELFLYGRDAEAFDLEAYLAL
NAPALRDKSEYLEHWSGYYSINPKVLLTLMVMQSGPLGAPDERALAAPLGRLSAKRGFDAQVRDVLQQLSRRYYGFEEYQ
LRQAAARKAVGEDGLNAASAALLGLLREGAKASAVQGGNPLGAYAQTFQRLFGTPAAELLQPRNRVARQLQAKAALAPPS
NLMQLPWRQGYSWQPNGAHSNTGSGYPYSSFDASYDWPRWGSATYSVVAAHAGTVRVLSRCQVRVTHPSGWATNYYHMDQ
IQVSNGQQVSADTKLGVYASNINTALCEGGSSTGPHLHFSLLYNGAFVSLQGASFGPYRINVGTSNYDNDCRRYYFYNQS
AGTTHCAFRPLYNPGLAL
>P14789 3.4.24.-~~~lasA~~~Protease LasA~~~
MQHKRSRAMASPRSPFLFVLLALAVGGTANAHDDGLPAFRYSAELLGQLQLPSVALPLNDDLFLYGRDAEAFDLEAYLAL
NAPALRDKSEYLEHWSGYYSINPKVLLTLMVMQSGPLGAPDERALAAPLGRLSAKRGFDAQVRDVLQQLSRRYYGFEEYQ
LRQAAARKAVGEDGLNAASAALLGLLREGAKVSAVQGGNPLGAYAQTFQRLFGTPAAELLQPSNRVARQLQAKAALAPPS
NLMQLPWRQGYSWQPNGAHSNTGSGYPYSSFDASYDWPRWGSATYSVVAAHAGTVRVLSRCQVRVTHPSGWATNYYHMDQ
IQVSNGQQVSADTKLGVYAGNINTALCEGGSSTGPHLHFSLLYNGAFVSLQGASFGPYRINVGTSNYDNDCRRYYFYNQS
AGTTHCAFRPLYNPGLAL
>P33883 2.3.1.184~~~lasI~~~Acyl-homoserine-lactone synthase~~~
MIVQIGRREEFDKKLLGEMHKLRAQVFKERKGWDVSVIDEMEIDGYDALSPYYMLIQEDTPEAQVFGCWRILDTTGPYML
KNTFPELLHGKEAPCSPHIWELSRFAINSGQKGSLGFSDCTLEAMRALARYSLQNDIQTLVTVTTVGVEKMMIRAGLDVS
RFGPHLKIGIERAVALRIELNAKTQIALYGGVLVEQRLAVS
>P25084 ~~~lasR~~~Transcriptional activator protein LasR~~~
MALVDGFLELERSSGKLEWSAILQKMASDLGFSKILFGLLPKDSQDYENAFIVGNYPAAWREHYDRAGYARVDPTVSHCT
QSVLPIFWEPSIYQTRKQHEFFEEASAAGLVYGLTMPLHGARGELGALSLSVEAENRAEANRFMESVLPTLWMLKDYALQ
SGAGLAFEHPVSKPVVLTSREKEVLQWCAIGKTSWEISVICNCSEANVNFHMGNIRRKFGVTSRRVAAIMAVNLGLITL
>B5GRC8 4.2.3.192~~~~~~Labda-7,13(16),14-triene synthase~~~
MGRRARSARSFGVSPLWGGVSVRSGDRGEAAVGGLWEVPDFWGLFPSRISPLAGEVESGTRVWLDGWRLVEEAGPGERLK
ASKVGRLVALAYPDAPADLLRWAADLFAWLTAFDDVHVEAPGVTTAELGPHMASFVGVLETGTAPGAAPTPFPAALAELL
DRARELLTPLQEERVRARLGKVFVAMLWEITTRERTVSTAEYETMRPHTFFSAVGAALVEPCAGLDLSHGVRADPGVRRL
TQALATLWERTNDLYSFAYEQRALGSVPRTLPWLIAQERGLPLDAAFAEAGRWCEEEAVLAHRLIGELSASAREGVPEYA
GAVAHAIGGTRRLYEVSDRWREE
>P9WQ77 2.6.1.36~~~lat~~~Probable L-lysine-epsilon aminotransferase~~~COG0160
MAAVVKSVALAGRPTTPDRVHEVLGRSMLVDGLDIVLDLTRSGGSYLVDAITGRRYLDMFTFVASSALGMNPPALVDDRE
FHAELMQAALNKPSNSDVYSVAMARFVETFARVLGDPALPHLFFVEGGALAVENALKAAFDWKSRHNQAHGIDPALGTQV
LHLRGAFHGRSGYTLSLTNTKPTITARFPKFDWPRIDAPYMRPGLDEPAMAALEAEALRQARAAFETRPHDIACFVAEPI
QGEGGDRHFRPEFFAAMRELCDEFDALLIFDEVQTGCGLTGTAWAYQQLDVAPDIVAFGKKTQVCGVMAGRRVDEVADNV
FAVPSRLNSTWGGNLTDMVRARRILEVIEAEGLFERAVQHGKYLRARLDELAADFPAVVLDPRGRGLMCAFSLPTTADRD
ELIRQLWQRAVIVLPAGADTVRFRPPLTVSTAEIDAAIAAVRSALPVVT
>Q01767 2.6.1.36~~~lat~~~L-lysine-epsilon aminotransferase~~~COG0160
MGEAARHPDGDFSDVGNLHAQDVHQALEQHMLVDGYDLVLDLDASSGVWLVDAVTQKRYLDLFSFFASAPLGINPPSIVE
DPAFMRELAVAAVNKPSNPDLYSVPYARFVKTFARVLGDPRLRRLFFVDGGALAVENALKAALDWKAQKLGLAEPDTDRL
QVLHLERSFHGRSGYTMSLTNTEPSKTARFPKFGWPRISSPALQHPPAEHTGANQEAERRALEAAREAFAAADGMIACFI
AEPIQGEGGDNHLSAEFLQAMQRLCHENDALFVLDEVQSGCGITGTAWAYQQLGLQPDLVAFGKKTQVCGVMGGGRIDEV
PENVFAVSSRISSTWGGNLADMVRATRLLETIERTQVFDTVVQRGKYFRDGLEDLAARHPSVVTNARGRGLMCAVDLPDT
RTRNEVLRLMYTEHQVIALPCGGRSLRFRPALTIAEHEIDQALQALASSVTPVAESV
>Q06379 ~~~lbpA~~~Lactoferrin-binding protein A~~~
MNKKHGFPLTLTALAIATAFPAYAAQAGGATPDAAQTQSLKEITVRAAKVGRRSKEATGLGKIVKTSETLNKEQVLGIRD
LTRYDPGVAVVEQGNGASGGYSIRGVDKNRVAVSVDGVAQIQAFTVQGSLSGYGGRGGSGAINEIEYENISTVEIDKGAG
SSDHGSGALGGAVAFRTKEAADLISDGKSWGIQAKTAYGSKNRQFMKSLGAGFSKDGWEGLLIRTERQGRETRPHGDIAD
GVEYGIDRLDAFRQTYDIKRKTREPFFSVEGERESKPVAKLAGYGKYLNNQLNRWVKERIEQNQPLSAEEEAQVREAQAR
HENLSAQAYTGGGRILPDPMDYRSGSWLAKLGYRFGGRHYVGGVFEDTKQRYDIRDMTEKQYYGTDEAEKFRDKSGVYDG
DDFRDGLYFVPNIEEWKGDKNLVRGIGLKYSRTKFIDEHHRRRRMGLLYRYENEAYSDNWADKAVLSFDKQGVATDNNTL
KLNCAVYPAVDKSCRASADKPYSYDSSDRFHYREQHNVLNASFEKSLKNKWTKHHLTLGFGYDASKAISRPEQLSHNAAR
ISESTGFDENNQDKYLLGKPEVVEGSVCGYIETLRSRKCVPRKINGSNIHISLNDRFSIGKYFDFSLGGRYDRKNFTTSE
ELVRSGRYVDRSWNSGILFKPNRHFSVSYRASSGFRTPSFQELFGIDIYHDYPKGWQRPALKSEKAANREIGLQWKGDFG
FLEISSFRNRYTDMIAVADHKTKLPNQAGQLTEIDIRDYYNAQNMSLQGVNILGKIDWNGVYGKLPEGLYTTLAYNRIKP
KSVSNRPGLSLRSYALDAVQPSRYVLGFGYDQPEGKWGANIMLTYSKGKNPDELAYLAGDQKRYSTKRASSSWSTADVSA
YLNLKKRLTLRAAIYNIGNYRYVTWESLRQTAESTANRHGGDSNYGRYAAPGRNFSLALEMKF
>A1XI29 5.5.1.-~~~lbtA~~~Putative C(50) carotenoid beta-cyclase subunit A~~~
MIGLSYLLVQVVSFAGILVIDHRWKLAAFRAPAAAALAVTASVALLLTWDVLGVRSGVFFRGQTDFMTGLLVAPEIPFEE
VVFLAFLSHLALVCAAGVSRAVDHARDSRAARASRPSRMTGERR
>A1XI30 ~~~lbtBC~~~C(50) beta-cyclic carotenoids biosynthesis protein LbtBC~~~
MTSLYTTLNLTMSIPVVAVALLAAWRLRGPERRRWLIGVGGALLILMILTAVFDNIMISAGLVAYDDSLTSGIRLGVAPI
EDFAYAVAAAVFVPSVWALLTASPRVGAEVGSPTVSGRGDALLTRAPEPGDDDEVRTPERPGTPGLLTTLFWSSRPVSWV
NTAAPFALAYFLATGGFDLVGVIGTIFFLVPYNLAMYGINDVFDYESDLRNPRKGGVEGSVLERSRHTATLVASAVTTVP
FLVYLVLTGTVESSLWLAASAFAVIAYSAKGLRFKEIPFLDSLTSAFHFVSPAIVGWTIAGAELTGGVWACLIAFMLWGA
ASQAFGAVQDVRFDREADLKSVATVLGARAAVWFALACYVAAVVVLLAAAPWPASGAAFAILPYLATVAAYVGVTDADAE
RTNEGWKRFLVLNMLAGFCVTQIVLWSVLVWS
>P80959 ~~~~~~Bacteriocin lactocin-705~~~
GMSGYIQGIPDFLKGYLHGISAANKHKKGRL
>P34034 ~~~lcnA~~~Bacteriocin leucocin-A~~~
MMNMKPTESYEQLDNSALEQVVGGKYYGNGVHCTKSGCSVNWGEAFSAGVHRLANGGNGFW
>P81052 ~~~~~~Bacteriocin leucocin-B~~~
KGKGFWSWASKATSWLTGPQQPGSPLLKKHR
>P81053 ~~~~~~Bacteriocin leucocin-C~~~
KNYGNGVHCTKKGCSVDWGYAWTNIANNSVMNGLTGGNAGWHN
>Q9HTH8 1.1.1.108~~~lcdH~~~L-carnitine dehydrogenase~~~
MSFVTEIKTFAALGSGVIGSGWIARALAHGLDVVAWDPAPGAEAALRARVANAWPALRKQGLAPGAAQERLRFVASIEEC
VGDADFIQESAPERLDLKLDLHARISAAARPDVLIGSSTSGLLPSEFYAEASHPERCLVGHPFNPVYLLPLVEVVGGERT
AAEAVRAAMRVYESLGMRPLHVRKEVPGFIADRLLEALWREALHLVNDGVATTGEIDDAIRFGAGLRWSFMGTFLTYTLA
GGNAGMRHFMAQFGPALQLPWTYLPAPELTEALIDRVVEGTAEQQGARSIAELERYRDDCLLAVLGAIRETKARHGFAFA
E
>D7URM0 1.1.1.108~~~lcdH~~~L-carnitine dehydrogenase~~~
MPFITHIKTFAALGSGVIGSGWVARALAHGLDVIAWDPAPGAEQALRQRVANAWPALEKQGLAAGAAQHRLSFVSSIEEC
VRDADFIQESAPERLDLKLDLHAKISAAAKPDAIIASSTSGLLPSEFYESSSHPERCVVGHPFNPVYLLPLVEIVGGRHT
APEAIEAAKGIYTELGMRPLHVRKEVPGFIADRLLEALWREALHLVNDGVATTGEIDDAIRFGAGLRWSFMGTFLTYTLA
GGDAGMRHFMQQFGPALKLPWTYLPAPELTERLIDEVVDGTAAQVGERSIAELERYRDDTLLAVLEAIGTSKAKHGMTFS
E
>D7UNT2 1.1.1.108~~~lcdH~~~L-carnitine dehydrogenase~~~
MSFITKAACVGGGVIGGAWVARFALAGIDVKIFDPHPEAERIIGEVMANAERAYAMLTMAPLPPKGKLTFCKSIEEAVEG
ADWIQESVPERLELKRGVITKIDAAARPDALIGSSTSGLLPSDLQSEMHHPERMFVAHPYNPVYLLPLVELVGGKKTSKA
TIERAMQGVEQIGMKGVVIAKEIEAFVGDRLLEALWREALWLIQDDICHTETLDNVMRYSFGMRWAQMGLFETYRIAGGE
AGMRHFLAQFGPCLKWPWTKFTDVVDLDDALVEKIGAQSDAQAAGRSIRELERIRDENLVGIMHALKSGNGGEGWGAGKL
LADFEAKLWANARKPEADLGDVKPLRILDTKVSAAWVDYNGHMTEHRYLQVFGDTSDGVLRLIGVDLDYVRDGHSYYTVE
THIRNLGDEASGEALYSTCQILSSDEKRLHIFSTIYNAATNEAVATAEQMMLHVDSKAGKAVAAPEAVLSKLRAITEAHA
QLQTPDGAGRFVGQKRA
>A3KKC4 1.14.99.54~~~~~~Lytic cellulose monooxygenase~~~
MARRSRYISLAAVMATLLSALGVTFLLGQGRAEAHGVAMMPGSRTYLCQLDAKTGTGALDPTNPACRSALDTSGATALYN
WFAVLDSNAGGRGAGYVPDGTLCSAGNRSPYDFRGYNAARSDWPRTHLTSGSTIQVNYSNWAAHPGDFRVYLTKPGWSPT
SELGWDDLELIETVTDPPQRGSAGADGGHYYWDLALPSGRSGDALIFMQWVRSDSQENFFSCSDVVFDGGNGEVTGIRGS
GGTPTPTPTPTTPPTTPPPTHSGSCMAVYNVENSWSGGFQGSVEVMNHGTEPLNGWAVQWKPGNGTTLGGVWNGSPTRGT
DGTVKVRNVDHNRVVPPDGSVTFGFTATSTGNDFPVGTIGCVAP
>P69451 6.2.1.3~~~fadD~~~Long-chain-fatty-acid--CoA ligase~~~COG0318
MKKVWLNRYPADVPTEINPDRYQSLVDMFEQSVARYADQPAFVNMGEVMTFRKLEERSRAFAAYLQQGLGLKKGDRVALM
MPNLLQYPVALFGILRAGMIVVNVNPLYTPRELEHQLNDSGASAIVIVSNFAHTLEKVVDKTAVQHVILTRMGDQLSTAK
GTVVNFVVKYIKRLVPKYHLPDAISFRSALHNGYRMQYVKPELVPEDLAFLQYTGGTTGVAKGAMLTHRNMLANLEQVNA
TYGPLLHPGKELVVTALPLYHIFALTINCLLFIELGGQNLLITNPRDIPGLVKELAKYPFTAITGVNTLFNALLNNKEFQ
QLDFSSLHLSAGGGMPVQQVVAERWVKLTGQYLLEGYGLTECAPLVSVNPYDIDYHSGSIGLPVPSTEAKLVDDDDNEVP
PGQPGELCVKGPQVMLGYWQRPDATDEIIKNGWLHTGDIAVMDEEGFLRIVDRKKDMILVSGFNVYPNEIEDVVMQHPGV
QEVAAVGVPSGSSGEAVKIFVVKKDPSLTEESLVTFCRRQLTGYKVPKLVEFRDELPKSNVGKILRRELRDEARGKVDNK
A
>O07610 6.2.1.3~~~lcfB~~~Long-chain-fatty-acid--CoA ligase~~~COG0318
MNLVSKLEETASEKPDSIACRFKDHMMTYQELNEYIQRFADGLQEAGMEKGDHLALLLGNSPDFIIAFFGALKAGIVVVP
INPLYTPTEIGYMLTNGDVKAIVGVSQLLPLYESMHESLPKVELVILCQTGEAEPEAADPEVRMKMTTFAKILRPTSAAK
QNQEPVPDDTAVILYTSGTTGKPKGAMLTHQNLYSNANDVAGYLGMDERDNVVCALPMFHVFCLTVCMNAPLMSGATVLI
EPQFSPASVFKLVKQQQATIFAGVPTMYNYLFQHENGKKDDFSSIRLCISGGASMPVALLTAFEEKFGVTILEGYGLSEA
SPVTCFNPFDRGRKPGSIGTSILHVENKVVDPLGRELPAHQVGELIVKGPNVMKGYYKMPMETEHALKDGWLYTGDLARR
DEDGYFYIVDRKKDMIIVGGYNVYPREVEEVLYSHPDVKEAVVIGVPDPQSGEAVKGYVVPKRSGVTEEDIMQHCEKHLA
KYKRPAAITFLDDIPKNATGKMLRRALRDILPQ
>Q5SKN9 6.2.1.3~~~~~~Long-chain-fatty-acid--CoA ligase~~~COG0318
MEGERMNAFPSTMMDEELNLWDFLERAAALFGRKEVVSRLHTGEVHRTTYAEVYQRARRLMGGLRALGVGVGDRVATLGF
NHFRHLEAYFAVPGMGAVLHTANPRLSPKEIAYILNHAEDKVLLFDPNLLPLVEAIRGELKTVQHFVVMDEKAPEGYLAY
EEALGEEADPVRVPERAACGMAYTTGTTGLPKGVVYSHRALVLHSLAASLVDGTALSEKDVVLPVVPMFHVNAWCLPYAA
TLVGAKQVLPGPRLDPASLVELFDGEGVTFTAGVPTVWLALADYLESTGHRLKTLRRLVVGGSAAPRSLIARFERMGVEV
RQGYGLTETSPVVVQNFVKSHLESLSEEEKLTLKAKTGLPIPLVRLRVADEEGRPVPKDGKALGEVQLKGPWITGGYYGN
EEATRSALTPDGFFRTGDIAVWDEEGYVEIKDRLKDLIKSGGEWISSVDLENALMGHPKVKEAAVVAIPHPKWQERPLAV
VVPRGEKPTPEELNEHLLKAGFAKWQLPDAYVFAEEIPRTSAGKFLKRALREQYKNYYGGA
>P36961 ~~~~~~Bacteriocin lactococcin-G subunit alpha~~~
GTWDDIGQGIGRVAYWVGKAMGNMSDVNQASRINRKKKH
>P36962 ~~~~~~Bacteriocin lactococcin-G subunit beta~~~
KKWGWLAWVDPAYEFIKGFGKGAIKEGNKDKWKNI
>Q838S1 1.14.99.53~~~~~~Lytic chitin monooxygenase~~~COG3397
MKKSLLTIVLAFSFVLGGAALAPTVSEAHGYVASPGSRAFFGSSAGGNLNTNVGRAQWEPQSIEAPKNTFITGKLASAGV
SGFEPLDEQTATRWHKTNITTGPLDITWNLTAQHRTASWDYYITKNGWNPNQPLDIKNFDKIASIDGKQEVPNKVVKQTI
NIPTDRKGYHVIYAVWGIGDTVNAFYQAIDVNIQ
>A3KIM2 1.14.99.53~~~~~~Lytic chitin monooxygenase~~~
MHAGRKTAVLIGAALAPVIAVSLPAASASAHGYISNPPSRQAQCAAGTVSCGDITYEPQSVEGPKGLTSCSGGNSRFAEL
DDDSKGWAVTPVPRNATFSWKLTAQHSTSTWEYYVGGQRIALFDDGGAKPGAVVDHQVDFGGLDGRQKVLAVWNVADTDN
AFYACIDVNVGG
>P0A3M7 ~~~lciA~~~Lactococcin-A immunity protein~~~
MKKKQIEFENELRSMLATALEKDISQEERNALNIAEKALDNSEYLPKIILNLRKALTPLAINRTLNHDLSELYKFITSSK
ASNKNLGGGLIMSWGRLF
>P0A313 ~~~lcnA~~~Bacteriocin lactococcin-A~~~
MKNQLNFNIVSDEELSEANGGKLTFIQSTAAGDLYYNTNTHKYVYQQTQNAFGAAANTIVNGWMGGAAGGFGLHH
>P0A312 ~~~lcnA~~~Bacteriocin lactococcin-A~~~
MKNQLNFNIVSDEELSEANGGKLTFIQSTAAGDLYYNTNTHKYVYQQTQNAFGAAANTIVNGWMGGAAGGFGLHH
>P35518 ~~~lcnB~~~Bacteriocin lactococcin-B~~~
MKNQLNFNIVSDEELAEVNGGSLQYVMSAGPYTWYKDTRTGKTICKQTIDTASYTFGVMAEGWGKTFH
>P0A3G5 ~~~lcnD~~~Lactococcin A secretion protein LcnD~~~
MFDKKLLESSELYDKRYRNFSTLIILPLFILLVGGVIFTFFAHKELTVISTGSIEPTKIVAKIQSTNANPIIENNLKEGE
AVKENSLLLKYNGTPEQTQLSELLTQKKQALDKKVQLDLLQRSLTNEKNEFPTADSFGYEKSFENYEAQVKSLEATIQKS
NQAVEDQNKSTESQKQAIQNQVATLQQAIQNYSEIENAVSSGGGVSQDNPYLSQYNSYQAQQATLEADLKNQKNPDETAK
QAAKSQEESLKSQFLSGLASSKDSLKSQIQSFNVQESSLTGSNAYDNSQSSQILTLKSQALSASNKEMTDLNSTLTDLET
KISLQKQDDQYSQVFAEQAGVLHVLPDILGMKKIPIGTPIAEIYPLLKSETQVNLTSYIPSTQISGMKVGQKVRFTVQQN
LPQPEILTGIINQIDSAPTAFKEGNAYKVSATTTINAKDLPNIRYGLQGKTVTIIGKKTYFNYFLDKIMGRGNQ
>P83002 ~~~~~~Bacteriocin lactococcin MMFII~~~
TSYGNGVHCNKSKCWIDVSELETYKAGTVSNPKDILW
>Q8NN75 ~~~lcoP~~~Betaine/ectoine transporter LcoP~~~COG1292
MSTNSGNNLPESQESPEEPHYPHDTHPGLVPGISVDAQRNKFGLDKTVFGVTAALILAFIAWGISSPDSVSSVSSTMFSW
AMTNTGWLLNFVMLIGIGTMLYIAFSRYGRIKLGTDEDEPEFSRFSWIAMMFGAGIGVGIFFFGPSEPLWHYLSPPPHTV
EGSTPESLHQALAQSHFHWGLSAWGLYALVGGALAYSSYRRGRVTLISSTFRSLFGEKTEGIAGRLIDMMAIIATLFGTA
ATLGLSAIQVGQGVQIISGASEITNNILIAIIAILTIGFIISSVSGVSKGIRYLSNLNISLTLGLVLFVFITGPTLFLLN
LIPSSVLEYGSEFLSMAGKSLSWGEETIEFQAGWTAFYWAWWIAWTPFVGMFIARISRGRTLREFALITMAIPSFILILA
FTIFGGTAITMNRENVDGFDGSSSKEQVLFDMFSNLPLYSITPFILIFVLAVFFVTSADSASVVMGTMSSQGNPAPNKLI
VVFWGLCMMGIAVVMLLTGGESALTGLQNLTILIAIPFALVLIVMAIAFIKDLSTDPAAIRQRYAKAAISNAVVRGLEEH
GDDFELSIEPAEEGRGAGATFDSTADHITDWYQRTDEEGNDVDYDFTTGKWADGWTPESTEEGEVDAKKD
>Q8NSD6 ~~~lcpA~~~Cell wall biosynthesis protein LcpA~~~COG1316
MTEKYRPVRDIKPAPAAMQSTKQAGHPVFRSVVAFVSVLVLLVSGLGYLAVGKVDGVASGNLNLGGGRGIQDGNAADGAT
DILLVGSDSRSDAQGNTLTEEELAMLRAGDEENDNTDTIMVIRVPNDGSSATAVSIPRDTYIHDDDYGNMKINGVYGAYK
DARRAELMEQGFTNESELETRAKDAGREGLIDAVSDLTGITVDHYAEVGLLGFVLLTDAVGGVEVCLNNAVDEPLSGANF
PAGRQTLGGSDALSYVRQRHDLPRGDLDRIVRQQSYMASLVNQVLSSGTLTNPAKLSALADAVTRSVVIDEGWEIMSFAT
QLQNLAGGNVTFATIPVTSIDGTGDYGESVVTIDVNQVHAFFQEALGEAEPAPEDGSDDQSADQAPDLSEVEVHVLNASY
VEGLANGIAAQLQELGYSIAETGNAAEGLYYESQILAAEEDSAKALAISEALGGLPIVANSSLDDNTVIVVSAGDYAGPT
AEANAVTSSTVGQPGADVGEPIESPEFDAGGDGPRCVN
>Q8NLN8 ~~~lcpB~~~Probable cell wall biosynthesis protein LcpB~~~COG1316
MDSPGQGEIARDSQGRPILDRYGRPVRVRPQPRQTPPTPRTPPVNETRVYQPRQTPPRQTPPRQTPPRQMPPRQTPPRQV
PPQQQYQQPGQIGQVRPQPPVIAGDGGRRRKAISFKPRGCLGTIAGVLAVGLVLVFVVTLWADSKLNRVDATPATQVANT
AGTNWLLVGSDSRQGLSDEDIERLGTGGDIGVGRTDTIMVLHMPRTGEPTLLSIPRDSYVNVPGWGMDKANAAFTVGGPE
LLTQTVEEATGLRIDHYAEIGMGGLANMVDAVGGVEMCPAEPMYDPLANLDIQAGCQEFDGAAALGYVRTRATALGDLDR
VVRQREFFSALLSTATSPGTLLNPFRTFPMISNAVGTFTVGEGDHVWHLARLALAMRGGIVTETVPIASFADYDVGNVAI
WDEAGAEALFSSMR
>Q3L8N0 1.13.-.-~~~lcp~~~Rubber oxygenase~~~
MDGFSRRRMLMTGGALGAVGALGAATRALARPLWTWSPSASVAGTGVGVDPEYVWDEEADPVLAAVIDRGEVPAVNALLK
QWTRNDQALPGGLPGDLREFMEHARRMPSWADKAALDRGAQFSKTKGIYVGALYGLGSGLMSTAIPRESRAVYYSKGGAD
MKDRIAKTARLGYDIGDLDAYLPHGSMIVTAVKTRMVHAAVRHLLPQSPAWSQTSGGQKIPISQADIMVTWHSLATFVMR
KMKQWGVRVNTADAEAYLHVWQVSAHMLGVSDEYIPATWDAANAQSKQVLDPILAHTPEGEALTEVLLGIVAELDAGLTR
PLIGAFSRYTLGGEVGDMIGLAKQPVLERLIATAWPLLVAFREGLIPLPAVPAVLWTLEEALRKFVLLFLSEGRRIAIDI
PDVNRPS
>P0C2V3 ~~~lcrD~~~Low calcium response locus protein D~~~
MNPHDLEWLNRIGERKDIMLAVLLLAVVFMMVLPLPPLVLDILIAVNMTISVVLLMIAIYINSPLQFSAFPAVLLVTTLF
RLALSVSTTRMILLQADAGQIVYTFGNFVVGGNFIVGIVIFLIITIVQFLVITKGSERVAEVSARFSLDAMPGKQMSIDG
DMRAGVIDVNEARERRATIEKESQMFGSMDGAMKFVKGDAIAGLIIIFVNILGGVTIGVTQKGLAAAEALQLYSILTVGD
GMVSQVPALLIAITAGIIVTRVSSEDSSDLGSDIGKQVVAQPKAMLIGGVLLLLFGLIPGFPTVTFLILALLVGCGGYML
SRKQSRNDEANQDLQSLLTSGSGAPAARTKAKTSGANKGRLGEQEAFAMTVPLLIDVDSSQQEALEAIALNDELVRVRRA
LYLDLGVPFPGIHLRFNEGMGEGEYLISLQEVPVARGELKAGYLLVRESVSQLELLGIPYEKGEHLLPDQETFWVSVEYE
ERLEKSQLEFFSHSQVLTWHLSHVLREYAEDFIGIQETRYLLEQMEGGYGELIKEVQRIVPLQRMTEILQRLVGEDISIR
NMRSILEAMVEWGQKEKDVVQLTEYIRSSLKRYICYKYANGNNILPAYLFDQEVEEKIRSGVRQTSAGSYLALDPAVTES
LLEQVRKTIGDLSQIQSKPVLIVSMDIRRYVRKLIESEYYGLPVLSYQELTQQINIQPLGRVCL
>P0C7U7 ~~~lcrV~~~Virulence-associated V antigen~~~
MIRAYEQNPQHFIEDLEKVRVEQLTGHGSSVLEELVQLVKDKNIDISIKYDPRKDSEVFANRVITDDIELLKKILAYFLP
EDAILKGGHYDNQLQNGIKRVKEFLESSPNTQWELRAFMAVMHFSLTADRIDDDILKVIVDSMNHHGDARSKLREELAEL
TAELKIYSVIQAEINKHLSSSGTINIHDKSINLMDKNLYGYTDEEIFKASAEYKILEKMPQTTIQVDGSEKKIVSIKDFL
GSENKRTGALGNLKNSYSYNKDNNELSHFATTCSDKSRPLNDLVSQKTTQLSDITSRFNSAIEALNRFIQKYDSVMQRLL
DDTSGK
>H6LBB0 1.1.1.436~~~lctB~~~Lactate dehydrogenase (NAD(+),ferredoxin) subunit LctB~~~COG2086
MKILVCIKQVPGTSNVEVDPETGVLIRDGVESKLNPYDLFGLETAFRLKEQLGGTITTLSMGPMQSKEVLMESFYMGADE
GCLLSDRKFGGADVVATSYTLAQGTKRLGDFDLIICGKQTTDGDTAQVGPEMAEFLGIPHVTNVIKILAADEKGLTLQMN
MEESLEIQRVPYPCLITVDKDIYTPRLPSYKRKLDISKNPEIKILTLKDMYDTNEKKYGLSGSPTQVERIFPPESNVEKT
SFEGDGKVLAKALLGILTEKKYLG
>H6LBB1 1.1.1.436~~~lctC~~~Lactate dehydrogenase (NAD(+),ferredoxin) subunit LctC~~~COG1145
MAGIKIIKENVDRETFEALAEICPFDAFSYENDKLEVTAACKMCKMCLKKGPEGVLILEEDEKVAIDKSLYRGITVYVDH
IEGQIHPVTFELIGKARELAAVIGHPVYALLMGTNITEKADELLKYGVDKVFVYDKPELKHFVIEPYANVLEDFIEKVKP
SSILVGATNVGRSLAPRVAARYRTGLTADCTILEMKENTDLVQIRPAFGGNIMAQIVTENTRPQFCTVRYKVFTAPERVN
EPWGDVEMMDIEKAKLVSAIEVMEVIKKEKGIDLSEAETIVAVGRGVKCEKDLDMIHEFAEKIGATVACTRPGIEAGWFD
ARLQIGLSGRTVKPKLIIALGISGAVQFAAGMQNSEYIIAINSDPKAPIFNIAHCGMVGDLYEILPELLTMIEGPENNKD
TETISIPEAIETPERMVV
>H6LBS1 1.1.1.436~~~lctD~~~Lactate dehydrogenase (NAD(+),ferredoxin) subunit LctD~~~COG0277
MNYKKVEASDIAAIKELIPAERVFVGTEIGEDFSHDELGSIHSYPEVLIKVTSTEEVSKIMKYAYEHNIPVVVRGSGTGL
VGACVPLFGGIMLETTLMNNILELDTENLTVTVEPGVLLMELSKFVEENDLFYPPDPGEKSATIAGNISTNAGGMRAVKY
GVTRDYVRGLTVVLANGEIIELGGKIVKNSSGYSLKDLVIGSEGTLCVITKAILKLLPLPKMTLSLLIPFENISDAAGIV
PKIIKSKAIPTAIEFMERQTILFAEDFLGKKFPDSSSNAYILLTFDGNTKEQVEAEYETVANLCLAEGAKDVYIVDTVER
KDSVWSARGAFLEAIKASTTEMDECDVVVPRNRIAEFIEFTHDLAKEMDVRIPSFGHAGDGNLHIYVCRDELCQADWEAK
LAEAMDRMYAKALTFEGLVSGEHGIGYAKRKYLLNDFGTEHLALMAGIKQTFDPKNLLNPKKVCQM
>P55910 ~~~lctP~~~L-lactate permease~~~COG1620
MWEQLYDPFGNEYVSALVALTPILFFLLALTVLKMKGILAAFLTLAVSFFVSVWAFHMPVEKAISSVLLGIGSGLWPIGY
IVLMAVWLYKIAVKTGKFTIIRSSIAGISPDQRLQLLLIGFCFNAFLEGAAGFGVPIAISAALLVELGFKPLKAAALCLI
ANAASGAFGAIGIPVITGAQIGDLSALELSRTLMWTLPMISFLIPFLLVFLLDRMKGIKQTWPALLVVSGGYTAVQTLTM
AVLGPELANILAALFSMGGLALFLRKWQPKEIYREEGAGDAGEKKAYRAADIAKAWSPFYILTAAITIWSLPAFKALFQE
GGLLYQSTLLFKMPFLHQQIMKMPPIAPSAMPLDAVFKVDLLSATGTAILAAVIVTGLFSKKFSSRDAFASLKETGKELW
VPIMTICFVMGFANLANFAGLSSSIGLALAKTGDLFPFVSPVLGWIGVFITGSVVSNNALFGHLQVVTGAQIGAGSDLLL
AANTAGGVMAKLVSPQSIAIAAAAVGQTGKESKLFKRTVAYSLILLLIICIWTFILARLGV
>Q55276 5.5.1.19~~~crtL~~~Lycopene beta cyclase~~~COG0644
MFDALVIGSGPAGLAIAAELAQRGLKVQGLSPVDPFHPWENTYGIWGPELDSLGLEHLFGHRWSNCVSYFGEAPVQHQYN
YGLFDRAQLQQHWLRQCEQGGLQWQLGKAAAIAHDSHHSCVTTAAGQELQARLVVDTTGHQAAFIQRPHSDAIAYQAAYG
IIGQFSQPPIEPHQFVLMDYRSDHLSPEERQLPPTFLYAMDLGNDVYFVEETSLAACPAIPYDRLKQRLYQRLATRGVTV
QVIQHEEYCLFPMNLPLPDLTQSVVGFGGAASMVHPASGYMVGALLRRAPDLANAIAAGLNASSSLTTAELATQAWRGLW
PTEKIRKHYIYQFGLEKLMRFSEAQLNHHFQTFFGLPKEQWYGFLTNTLSLPELIQAMLRLFAQAPNDVRWGLMEQQGRE
LQLFWQAIAAR
>P76008 3.4.17.13~~~ldcA~~~Murein tetrapeptide carboxypeptidase~~~COG1619
MSLFHLIAPSGYCIKQHAALRGIQRLTDAGHQVNNVEVIARRCERFAGTETERLEDLNSLARLTTPNTIVLAVRGGYGAS
RLLADIDWQALVARQQHDPLLICGHSDFTAIQCGLLAHGNVITFSGPMLVANFGADELNAFTEHHFWLALRNETFTIEWQ
GEGPTCRAEGTLWGGNLAMLISLIGTPWMPKIENGILVLEDINEHPFRVERMLLQLYHAGILPRQKAIILGSFSGSTPND
YDAGYNLESVYAFLRSRLSIPLITGLDFGHEQRTVTLPLGAHAILNNTREGTQLTISGHPVLKM
>Q9I2S7 4.1.1.18~~~ldcA~~~Lysine decarboxylase LdcA~~~
MYKDLKFPVLIVHRDIKADTVAGERVRGIAHELEQDGFSILSTASSAEGRIVASTHHGLACILVAAEGAGENQRLLQDVV
ELIRVARVRAPQLPIFALGEQVTIENAPAESMADLHQLRGILYLFEDTVPFLARQVARAARNYLAGLLPPFFRALVEHTA
QSNYSWHTPGHGGGVAYRKSPVGQAFHQFFGENTLRSDLSVSVPELGSLLDHTGPLAEAEDRAARNFGADHTFFVINGTS
TANKIVWHSMVGREDLVLVDRNCHKSILHSIIMTGAIPLYLTPERNELGIIGPIPLSEFSKQSIAAKIAASPLARGREPK
VKLAVVTNSTYDGLCYNAELIKQTLGDSVEVLHFDEAWYAYAAFHEFYDGRYGMGTSRSEEGPLVFATHSTHKMLAAFSQ
ASMIHVQDGGTRKLDVARFNEAFMMHISTSPQYGIIASLDVASAMMEGPAGRSLIQETFDEALSFRRALANVRQNLDRND
WWFGVWQPEQVEGTDQVGTHDWVLEPSADWHGFGDIAEDYVLLDPIKVTLTTPGLSAGGKLSEQGIPAAIVSRFLWERGL
VVEKTGLYSFLVLFSMGITKGKWSTLVTELLEFKRCYDANLPLLDVLPSVAQAGGKRYNGVGLRDLSDAMHASYRDNATA
KAMKRMYTVLPEVAMRPSEAYDKLVRGEVEAVPIARLEGRIAAVMLVPYPPGIPLIMPGERFTEATRSILDYLEFARTFE
RAFPGFDSDVHGLQHQDGPSGRCYTVECIKE
>P52095 4.1.1.18~~~ldcC~~~Constitutive lysine decarboxylase~~~COG1982
MNIIAIMGPHGVFYKDEPIKELESALVAQGFQIIWPQNSVDLLKFIEHNPRICGVIFDWDEYSLDLCSDINQLNEYLPLY
AFINTHSTMDVSVQDMRMALWFFEYALGQAEDIAIRMRQYTDEYLDNITPPFTKALFTYVKERKYTFCTPGHMGGTAYQK
SPVGCLFYDFFGGNTLKADVSISVTELGSLLDHTGPHLEAEEYIARTFGAEQSYIVTNGTSTSNKIVGMYAAPSGSTLLI
DRNCHKSLAHLLMMNDVVPVWLKPTRNALGILGGIPRREFTRDSIEEKVAATTQAQWPVHAVITNSTYDGLLYNTDWIKQ
TLDVPSIHFDSAWVPYTHFHPIYQGKSGMSGERVAGKVIFETQSTHKMLAALSQASLIHIKGEYDEEAFNEAFMMHTTTS
PSYPIVASVETAAAMLRGNPGKRLINRSVERALHFRKEVQRLREESDGWFFDIWQPPQVDEAECWPVAPGEQWHGFNDAD
ADHMFLDPVKVTILTPGMDEQGNMSEEGIPAALVAKFLDERGIVVEKTGPYNLLFLFSIGIDKTKAMGLLRGLTEFKRSY
DLNLRIKNMLPDLYAEDPDFYRNMRIQDLAQGIHKLIRKHDLPGLMLRAFDTLPEMIMTPHQAWQRQIKGEVETIALEQL
VGRVSANMILPYPPGVPLLMPGEMLTKESRTVLDFLLMLCSVGQHYPGFETDIHGAKQDEDGVYRVRVLKMAG
>P0A9H3 4.1.1.18~~~cadA~~~Inducible lysine decarboxylase~~~COG1982
MNVIAILNHMGVYFKEEPIRELHRALERLNFQIVYPNDRDDLLKLIENNARLCGVIFDWDKYNLELCEEISKMNENLPLY
AFANTYSTLDVSLNDLRLQISFFEYALGAAEDIANKIKQTTDEYINTILPPLTKALFKYVREGKYTFCTPGHMGGTAFQK
SPVGSLFYDFFGPNTMKSDISISVSELGSLLDHSGPHKEAEQYIARVFNADRSYMVTNGTSTANKIVGMYSAPAGSTILI
DRNCHKSLTHLMMMSDVTPIYFRPTRNAYGILGGIPQSEFQHATIAKRVKETPNATWPVHAVITNSTYDGLLYNTDFIKK
TLDVKSIHFDSAWVPYTNFSPIYEGKCGMSGGRVEGKVIYETQSTHKLLAAFSQASMIHVKGDVNEETFNEAYMMHTTTS
PHYGIVASTETAAAMMKGNAGKRLINGSIERAIKFRKEIKRLRTESDGWFFDVWQPDHIDTTECWPLRSDSTWHGFKNID
NEHMYLDPIKVTLLTPGMEKDGTMSDFGIPASIVAKYLDEHGIVVEKTGPYNLLFLFSIGIDKTKALSLLRALTDFKRAF
DLNLRVKNMLPSLYREDPEFYENMRIQELAQNIHKLIVHHNLPDLMYRAFEVLPTMVMTPYAAFQKELHGMTEEVYLDEM
VGRINANMILPYPPGVPLVMPGEMITEESRPVLEFLQMLCEIGAHYPGFETDIHGAYRQADGRYTVKVLKEESKK
>O34851 3.4.16.-~~~ykfA~~~Probable murein peptide carboxypeptidase~~~COG1619
MKGVFSLNYKPKALNKGDTVGVIAPASPPDPKKLDTALLFLEELGLQVKLGKALKNQHGYLAGQDDERLADLHEMFRDDE
VKAVLCACGGFGTGRIAAGIDFSLIRKHPKIFWGYSDITFLHTAIHQNTGLVTFHGPMLSTDIGLDDVHPLTKASYKQLF
QETEFTYTEELSPLTELVPGKAEGELVGGNLSLLTSTLGTPFEIDTRGKLLFIEDIDEEPYQIDRMLNQLKMGGKLTDAA
GILVCDFHNCVPVKREKSLSLEQVLEDYIISAGRPALRGFKIGHCSPSIAVPIGAKAAMNTAEKTAVIEAGVSEGALKT
>Q9HTZ1 3.4.17.13~~~~~~Murein tetrapeptide carboxypeptidase~~~
MTSRPSSDQTWQPIDGRVALIAPASAIATDVLEATLRQLEVHGVDYHLGRHVEARYRYLAGTVEQRLEDLHNAFDMPDIT
AVWCLRGGYGCGQLLPGLDWGRLQAASPRPLIGFSDISVLLSAFHRHGLPAIHGPVATGLGLSPLSAPREQQERLASLAS
VSRLLAGIDHELPVQHLGGHKQRVEGALIGGNLTALACMAGTLGGLHAPAGSILVLEDVGEPYYRLERSLWQLLESIDAR
QLGAICLGSFTDCPRKEVAHSLERIFGEYAAAIEVPLYHHLPSGHGAQNRAWPYGKTAVLEGNRLRW
>Q81EP4 1.1.1.27~~~ldh1~~~L-lactate dehydrogenase 1~~~
MKKGINRVVLVGTGAVGCSYAYCMINQAVAEEFVLVDVNEAKAEGEAMDLSHAVPFAPAPTRVWKGSYEDCKDADLVVIT
AGLPQKPGETRLDLVEKNAKIFKQIVRSIMDSGFDGIFLIATNPVDILTYVTWKESGLPKERVIGSGTTLDSARFRYMLG
EYFDIGPHNIHAYIIGEHGDTELPVWSHVSVGIQKLQTLLEKDNTYNQEDLDKIFINVRDAAYHIIERKGATYYGIGMSL
LRVTKAILNDENSVLTVSAYLEGQYGQKDVYIGVPAVLNRGGVREILEVELSEDEELKFDHSVQVLKETMAPVL
>P04034 1.1.1.27~~~ldh1~~~L-lactate dehydrogenase 1~~~
MADKQRKKVILVGDGAVGSSYAFALVNQGIAQELGIVDLFKEKTQGDAEDLSHALAFTSPKKIYSADYSDASDADLVVLT
SGAPQKPGETRLDLVEKNLRITKDVVTKIVASGFKGIFLVAANPVDILTYATWKFSGFPKNRVVGSGTSLDTARFRQALA
EKVDVDARSIHAYIMGEHGDSEFAVWSHANVAGVKLEQWFQENDYLNEAEIVKLFESVRDAAYSIIAKKGATFYGVAVAL
ARITKAILDDEHAVLPVSVFQDGQYGVSDCYLGQPAVVGAEGVVNPIHIPLNDAEMQKMEASGAQLKAIIDEAFAKEEFA
SAVKN
>P14561 1.1.1.27~~~ldh1~~~L-lactate dehydrogenase 1~~~
MKQRNVNRVALIGAGSVGSSYAFALLNQSITEELVIIDLNENKAMGDAMDLNHGKVFAPNPTKTWYGTYSDCKDADIVCI
CAGANQKPGETRLDLVEKNLRIFKGIVEEIMASGFDGIFLIATNPVDILTYATWKFSGLPKERIIGSGTILDTGRFRFLL
GEYFDIAPANVHAYIIGEHGDTELPVWSHADIGGISITELIKRNPEYTMKDLDELFINVRDAAYQIIEKKGATFYGIAMG
LARITKAILNNENSVLTVSTYLDGEYGTEDVYMGVPAVVNRNGIREIVELTLNEQERQQFKHSANVLKEILAPNFKEQ
>Q5HJD7 1.1.1.27~~~ldh1~~~L-lactate dehydrogenase 1~~~
MNKFKGNKVVLIGNGAVGSSYAFSLVNQSIVDELVIIDLDTEKVRGDVMDLKHATPYSPTTVRVKAGEYSDCHDADLVVI
CAGAAQKPGETRLDLVSKNLKIFKSIVGEVMASKFDGIFLVATNPVDILAYATWKFSGLPKERVIGSGTILDSARFRLLL
SEAFDVAPRSVDAQIIGEHGDTELPVWSHANIAGQPLKTLLEQRPEGKAQIEQIFVQTRDAAYDIIQAKGATYYGVAMGL
ARITEAIFRNEDAVLTVSALLEGEYEEEDVYIGVPAVINRNGIRNVVEIPLNDEEQSKFAHSAKTLKDIMAEAEELK
>A6QDL6 1.1.1.27~~~ldh1~~~L-lactate dehydrogenase 1~~~
MNKFKGNKVVLIGNGAVGSSYAFSLVNQSIVDELVIIDLDTEKVRGDVMDLKHATPYSPTTVRVKAGEYSDCHDADLVVI
CAGAAQKPGETRLDLVSKNLKIFKSIVGEVMASKFDGIFLVATNPVDILAYATWKFSGLPKERVIGSGTILDSARFRLLL
SEAFDVAPRSVDAQIIGEHGDTELPVWSHANIAGQPLKTLLEQRPEGKAQIEQIFVQTRDAAYDIIQAKGATYYGVAMGL
ARITEAIFRNEDAVLTVSALLEGEYEEEDVYIGVPAVINRNGIRNVVEIPLNDEEQSKFAHSAKTLKDIMAEAEELK
>P65256 1.1.1.27~~~ldh1~~~L-lactate dehydrogenase 1~~~
MNKFKGNKVVLIGNGAVGSSYAFSLVNQSIVDELVIIDLDTEKVRGDVMDLKHATPYSPTTVRVKAGEYSDCHDADLVVI
CAGAAQKPGETRLDLVSKNLKIFKSIVGEVMASKFDGIFLVATNPVDILAYATWKFSGLPKERVIGSGTILDSARFRLLL
SEAFDVAPRSVDAQIIGEHGDTELPVWSHANIAGQPLKTLLEQRPEGKAQIEQIFVQTRDAAYDIIQAKGATYYGVAMGL
ARITEAIFRNEDAVLTVSALLEGEYDEEDVYIGVPAVINRNGIRNVVEIPLNDEEQSKFAHSAKTLKDIMAEAEELK
>E8ME30 1.1.1.27~~~ldh2~~~L-lactate dehydrogenase 2~~~
MAETTVKPTKLAVIGAGAVGSTLAFAAAQRGIAREIVLEDIAKERVEAEVLDMQHGSSFYPTVSIDGSDDPEICRDADMV
VITAGPRQKPGQSRLELVGATVNILKAIMPNLVKVAPNAIYMLITNPVDIATHVAQKLTGLPENQIFGSGTNLDSARLRF
LIAQQTGVNVKNVHAYIAGEHGDSEVPLWESATIGGVPMCDWTPLPGHDPLDADKREEIHQEVKNAAYKIINGKGATNYA
IGMSGVDIIEAVLHDTNRILPVSSMLKDFHGISDICMSVPTLLNRQGVNNTINTPVSDKELAALKRSAETLKETAAQFGF
>P0CW93 1.1.1.27~~~ldh2~~~L-lactate dehydrogenase 2~~~
MAETTVKPTKLAVIGAGAVGSTLAFAAAQRGIAREIVLEDIAKERVEAEVLDMQHGSSFYPTVSIDGSDDPEICRDADMV
VITAGPRQKPGQSRLELVGATVNILKAIMPNLVKVAPNAIYMLITNPVDIATHVAQKLTGLPENQIFGSGTNLDSARLRF
LIAQQTGVNVKNVHAYIAGEHGDSEVPLWESATIGGVPMCDWTPLPGHDPLDADKREEIHQEVKNAAYKIINGKGATNYA
IGMSGVDIIEAVLHDTNRILPVSSMLKDFHGISDICMSVPTLLNRQGVNNTINTPVSDKELAALKRSAETLKETAAQFGF
>Q5HCV0 1.1.1.27~~~ldh2~~~L-lactate dehydrogenase 2~~~
MKTFGKKVVLIGDGSVGSSYAFAMVTQGVADEFVIIDIAKDKVKADVQDLNHGTVHSPSPVDVKAGEYEDCKDADLVVIT
AGAPQKPGETRLQLVEKNTKIMKSIVKSVMDSGFDGYFLIAANPVDILTRFVKEYTGLPAERVIGSGTVLDSARLQYLIS
QELGVAPSSVDASIIGEHGDTELAVWSQANVAGISVYDTLKEQTGSEAKAEEIYVNTRDAAYEIIQAKGSTYYGIALALM
RISKAILNNENNVLNVSIQLDGQYGGHKGVYLGVPTLVNQHGAVKIYEMPLSAEEQALFDKSVKTLEDTFDSIKYLLED
>A6QK89 1.1.1.27~~~ldh2~~~L-lactate dehydrogenase 2~~~
MKTFGKKVVLIGDGSVGSSYAFAMVTQGVADEFVIIDIAKDKVKADVQDLNHGTVHSPSPVDVKAGEYEDCKDADLVVIT
AGAPQKPGETRLQLVEKNTKIMKSIVKSVMDSGFDGYFLIAANPVDILTRFVKEYTGLPAERVIGSGTVLDSARLQYLIS
QELGVAPSSVDASIIGEHGDTELAVWSQANVAGISVYDTLKEQTGSEAKAEEIYVNTRDAAYEIIQAKGSTYYGIALALM
RISKAILNNENNVLNVSIQLDGQYGGHKGVYLGVPTLVNQHGAVKIYEMPLSAEEQALFDKSVKTLEDTFDSIKYLLED
>P99119 1.1.1.27~~~ldh2~~~L-lactate dehydrogenase 2~~~
MKTFGKKVVLIGDGSVGSSYAFAMVTQGVADEFVIIDIAKDKVKADVQDLNHGTVHSPSPVDVKAGEYEDCKDADLVVIT
AGAPQKPGETRLQLVEKNTKIMKSIVKSVMDSGFDGYFLIAANPVDILTRFVKEYTGLPAERVIGSGTVLDSARLQYLIS
QELGVAPSSVDASIIGEHGDTELAVWSQANVAGISVYDTLKEQTGSEAKAEEIYVNTRDAAYEIIQAKGSTYYGIALALM
RISKAILNNENNVLNVSIQLDGQYGGHKGVYLGVPTLVNQHGAVKIYEMPLSAEEQALFDKSVKILEDTFDSIKYLLED
>P52643 1.1.1.28~~~ldhA~~~D-lactate dehydrogenase~~~COG1052
MKLAVYSTKQYDKKYLQQVNESFGFELEFFDFLLTEKTAKTANGCEAVCIFVNDDGSRPVLEELKKHGVKYIALRCAGFN
NVDLDAAKELGLKVVRVPAYDPEAVAEHAIGMMMTLNRRIHRAYQRTRDANFSLEGLTGFTMYGKTAGVIGTGKIGVAML
RILKGFGMRLLAFDPYPSAAALELGVEYVDLPTLFSESDVISLHCPLTPENYHLLNEAAFEQMKNGVMIVNTSRGALIDS
QAAIEALKNQKIGSLGMDVYENERDLFFEDKSNDVIQDDVFRRLSACHNVLFTGHQAFLTAEALTSISQTTLQNLSNLEK
GETCPNELV
>P26297 1.1.1.28~~~ldhA~~~D-lactate dehydrogenase~~~COG1052
MTKIFAYAIREDEKPFLKEWEDAHKDVEVEYTDKLLTPETVALAKGADGVVVYQQLDYTAETLQALADNGITKMSLRNVG
VDNIDMAKAKELGFQITNVPVYSPNAIAEHAAIQAARILRQDKAMDEKVARHDLRWAPTIGREVRDQVVGVIGTGHIGQV
FMQIMEGFGAKVIAYDIFRNPELEKKGYYVDSLDDLYKQADVISLHVPDVPANVHMINDESIAKMKQDVVIVNVSRGPLV
DTDAVIRGLDSGKIFGYAMDVYEGEVGIFNEDWEGKEFPDARLADLIARPNVLVTPHTAFYTTHAVRNMVVKAFDNNLEL
VEGKEAETPVKVG
>P30901 1.1.1.28~~~~~~D-lactate dehydrogenase~~~COG1052
MTKVFAYAIRKDEEPFLNEWKEAHKDIDVDYTDKLLTPETAKLAKGADGVVVYQQLDYTADTLQALADAGVTKMSLRNVG
VDNIDMDKAKELGFQITNVPVYSPNAIAEHAAIQAARVLRQDKRMDEKMAKRDLRWAPTIGREVRDQVVGVVGTGHIGQV
FMRIMEGFGAKVIAYDIFKNPELEKKGYYVDSLDDLYKQADVISLHVPDVPANVHMINDKSIAEMKDGVVIVNCSRGRLV
DTDAVIRGLDSGKIFGFVMDTYEDEVGVFNKDWEGKEFPDKRLADLIDRPNVLVTPHTAFYTTHAVRNMVVKAFNNNLKL
INGEKPDSPVALNKNKF
>P26298 1.1.1.28~~~~~~D-lactate dehydrogenase~~~
MKIIAYAVRDDERPFFDTWMKENPDVEVKLVPELLTEDNVDLAKGFDGADVYQQKDYTAEVLNKLADEGVKNISLRNVGV
DNLDVPTVKARGLNISNVPAYSPNAIAELSVTQLMQLLRQTPMFNKKLAKQDFRWAPDIAKELNTMTVGVIGTGRIGRAA
IDIFKGFGAKVIGYDVYRNAELEKEGMYVDTLDELYAQADVITLHVPALKDNYHMLNADAFSKMKDGAYILNFARGTLID
SEDLIKALDSGKVAGAALVTYEYETKIFNKDLEGQTIDDKVFMNLFNRDNVLITPHTAFYTETAVHNMVHVSMNSNKQFI
ETGKADTQVKFD
>Q59642 1.1.1.28~~~ldhD~~~D-lactate/D-glycerate dehydrogenase~~~
MKIIAYGIRDDEKPYLDEWVTKNHIEVKAVPDLLDSSNIDLAKDYDGVVAYQQKPYTADLFDKMHEFGIHAFSLRNVGLD
NVPADALKKNDIKISNVPAYSPRAIAELSVTQLLALLRKIPEFEYKMAHGDYRWEPDIGLELNQMTVGVIGTGRIGRAAI
DIFKPFGAKVIAYDVFRNPALEKEGMYVDTLEELYQQANVITLHVPALKDNYHMLDEKAFGQMQDGTFILNFARGTLVDT
PALLKALDSGKVAGAALDTYENEVGIFDVDHGDQPIDDPVFNDLMSRRNVMITPHAAFYTRPAVKNMVQIALDNNRDLIE
KNSSKNEVKFE
>P99116 1.1.1.28~~~ldhD~~~D-lactate dehydrogenase~~~
MTKIMFFGTRDYEKEMALNWGKKNNVEVTTSKELLSSATVDQLKDYDGVTTMQFGKLENDVYPKLESYGIKQIAQRTAGF
DMYDLDLAKKHNIVISNVPSYSPETIAEYSVSIALQLVRRFPDIERRVQTHDFTWQAEIMSKPVKNMTVAIIGTGRIGAA
TAKIYAGFGATITAYDAYPNKDLDFLTYKDSVKEAIKDADIISLHVPANKESYHLFDKAMFDHVKKGAILVNAARGAVIN
TPDLIAAVNDGTLLGAAIDTYENEAAYFTNDWTNKDIDDKTLLELIEHERILVTPHIAFFSDEAVQNLVEGGLNAALSVI
NTGTCETRLN
>P72357 1.1.1.28~~~ldhD~~~D-lactate dehydrogenase~~~
MTKIMFFGTRDYEKEMALNWGKKNNVEVTTSKELLSSATVDQLKDYDGVTTMQFGKLENDVYPKLESYGIKQIAQRTAGF
DMYDLDLAKKHNIVISNVPSYSPETIAEYSVSIALQLVRRFPDIERRVQAHDFTWQAEIMSKPVKNMTVAIIGTGRIGAA
TAKIYAGFGATITAYDAYPNKDLDFLTYKDSVKEAIKDADIISLHVPANKESYHLFDKAMFDHVKKGAILVNAARGAVIN
TPDLIAAVNDGTLLGAAIDTYENEAAYFTNDWTNKDIDDKTLLELIEHERILVTPHIAFFSDEAVQNLVEGGLNAALSVI
NTGTCETRLN
>O83080 1.1.1.28~~~ldhD~~~D-lactate dehydrogenase~~~COG1052
MRCVVFNLREEEAPYVEKWKQSHPGVVVDTYEEPLTAKNKELLKGYEGLVVMQFLAMEDEVYDYMGACKLKVLSTRTAGF
DMYNATLLKKHGIRLTNVPSYSPNAIGEYALAAALQLTRHAREIETFVRKRDFRWQKPILSKELRCSRVGILGTGRIGQA
AARLFKGVGAQVVGFDPYPNDAAKEWLTYVSMDELLSTSDVISLHMPATKDSHHLINAKTIAQMKDGVYLVNTARGAVID
SQALLDSLDKGKIAGAALDAYEFEGPYIPKDNGNNPITDTVYARLVAHERIIYTPHIAFYTETAIENMVFNSLDACTTVL
RGEPCAAEIKL
>P13714 1.1.1.27~~~ldh~~~L-lactate dehydrogenase~~~COG0039
MNKHVNKVALIGAGFVGSSYAFALINQGITDELVVIDVNKEKAMGDVMDLNHGKAFAPQPVKTSYGTYEDCKDADIVCIC
AGANQKPGETRLELVEKNLKIFKGIVSEVMASGFDGIFLVATNPVDILTYATWKFSGLPKERVIGSGTTLDSARFRFMLS
EYFGAAPQNVHAHIIGEHGDTELPVWSHANVGGVPVSELVEKNDAYKQEELDQIVDDVKNAAYHIIEKKGATYYGVAMSL
ARITKAILHNENSILTVSTYLDGQYGADDVYIGVPAVVNRGGIAGITELNLNEKEKEQFLHSAGVLKNILKPHFAEQKVN
>Q07251 1.1.1.27~~~ldh~~~L-lactate dehydrogenase~~~COG2055
MKISLTSARQLARDILAAQQVPADIADDVAEHLVESDRCGYISHGLSILPNYRTALDGHSVNPQGRAKCVLDQGTLMVFD
GDGGFGQHVGKSVMQAAIERVRQHGHCIVTLRRSHHLGRMGHYGEMAAAAGFVLLSFTNVINRAPVVAPFGGRVARLTTN
PLCFAGPMPNGRPPLVVDIATSAIAINKARVLAEKGEPAPEGSIIGADGNPTTDASTMFGEHPGALLPFGGHKGYALGVV
AELLAGVLSGGGTIQPDNPRGGVATNNLFAVLLNPALDLGLDWQSAEVEAFVRYLHDTPPAPGVDRVQYPGEYEAANRAQ
ASDTLNINPAIWRNLERLAQSLNVAVPTA
>P50933 1.1.1.27~~~ldh~~~L-lactate dehydrogenase~~~COG0039
MKVGVVGTGFVGSTAAFALVLRGSCSELVLVDRDEDRAQAEAEDIAHAAPVSHGTRVWHGGHSELADAQVVILTAGANQK
PGESRLDLLEKNADIFRELVPQITRAAPDAVLLVTSNPVDLLTDLATQLAPGQPVIGSGTVLDSARFRHLMAQHAGVDGT
HAHGYVLGEHGDSEVLAWSSAMVAGMPVADFMQAQNLPWNEQVRAKIDEGTRNAAASIIEGKRATYYGIGAALARITEAV
LRDRRAVLTVSAPTPEYGVSLSLPRVVGRQGVLSTLHPKLTGDEQQKLEQSAGVLRGFKQQLGL
>P00344 1.1.1.27~~~ldh~~~L-lactate dehydrogenase~~~
MKNNGGARVVVIGAGFVGASYVFALMNQGIADEIVLIDANESKAIGDAMDFNHGKVFAPKPVDIWHGDYDDCRDADLVVI
CAGANQKPGETRLDLVDKNIAIFRSIVESVMASGFQGLFLVATNPVDILTYATWKFSGLPHERVIGSGTILDTARFRFLL
GEYFSVAPQNVHAYIIGEHGDTELPVWSQAYIGVMPIRKLVESKGEEAQKDLERIFVNVRDAAYQIIEKKGATYYGIAMG
LARVTRAILHNENAILTVSAYLDGLYGERDVYIGVPAVINRNGIREVIEIELNDDEKNRFHHSAATLKSVLARAFTR
>P00343 1.1.1.27~~~ldh~~~L-lactate dehydrogenase~~~COG0039
MASITDKDHQKVILVGDGAVGSSYAYAMVLQGIAQEIGIVDIFKDKTKGDAIDLSNALPFTSPKKIYSAEYSDAKDADLV
VITAGAPQKPGETRLDLVNKNLKILKSIVDPIVDSGFNGIFLVAANPVDILTYATWKLSGFPKNRVVGSGTSLDTARFRQ
SIAEMVNVDARSVHAYIMGEHGDTEFPVWSHANIGGVTIAEWVKAHPEIKEDKLVKMFEDVRDAAYEIIKLKGATFYGIA
TALARISKAILNDENAVLPLSVYMDGQYGLNDIYIGTPAVINRNGIQNILEIPLTDHEEESMQKSASQLKKVLTDAFAKN
DIETRQ
>O32765 1.1.1.27~~~ldh~~~L-lactate dehydrogenase~~~COG0039
MAREEKPRKVILVGDGAVGSTFAFSMVQQGIAEELGIIDIAKEHVEGDAIDLADATPWTSPKNIYAADYPDCKDADLVVI
TAGAPQKPGETRLDLVNKNLKILSSIVEPVVESGFEGIFLVVANPVDILTHATWRMSGFPKDRVIGSGTSLDTGRLQKVI
GKMENVDPSSVNAYMLGEHGDTEFPAWSYNNVAGVKVADWVKAHNMPESKLEDIHQEVKDMAYDIINKKGATFYGIGTAS
AMIAKAILNDEHRVLPLSVPMDGEYGLHDLHIGTPAVVGRKGLEQVIEMPLSDKEQELMTASADQLKKVMDKAFKETGVK
VRQ
>D8KFT1 1.1.1.27~~~ldh~~~L-lactate dehydrogenase~~~
MKITSRKVVVIGTGFVGTSIAYSMINQGLVNELVLIDVNQDKAEGEALDLLDGISWAQENVIVRAGNYKDCENADIVVIT
AGVNQKPGQSRLDLVNTNAKIMRSIVTQVMDSGFDGIFVIASNPVDILTYVAWETSGLDQSRIVGTGTTLDTTRFRKELA
TKLEIDPRSVHGYIIGEHGDSEVAVWSHTTIGGKPILEFIVKNKKIGLEDLSNLSNKVKNAAYEIIDKKQATYYGIGMST
ARIVKAILNNEQVILPVSAYLRGEYGQEGVFTGVPSVVNQNGVREIIELNIDAYEMKQFEKSVSQLKEVIESIK
>P56511 1.1.1.27~~~ldh~~~L-lactate dehydrogenase~~~
MSSMPNHQKVVLVGDGAVGSSYAFAMAQQGIAEEFVIVDVVKDRTKGDALDLEDAQAFTAPKKIYSGEYSDCKDADLVVI
TAGAPQKPGESRLDLVNKNLNILSSIVKPVVDSGFDGIFLVAANPVDILTYATWKFSGFPKERVIGSGTSLDSSRLRVAL
GKQFNVDPRSVDAYIMGEHGDSEFAAYSTATIGTRPVRDVAKEQGVSDDDLAKLEDGVRNKAYDIINLKGATFYGIGTAL
MRISKAILRDENAVLPVGAYMDGQYGLNDIYIGTPAIIGGTGLKQIIESPLSADELKKMQDSAATLKKVLNDGLAELENK
>P0C0J3 1.1.1.27~~~ldh~~~L-lactate dehydrogenase~~~COG0039
MKPIKIALIGAGNVGNSFLYAAMNQGLASEYGIIDINPDFADGNAFDFEDASASLPFPISVSRYEYKDLKDADFIVITAG
RPQKPGETRLELVADNIRIIREIALKVKESGFSGISIIVANPVDIITRAYRDASGFSDQKVIGSGTVLDTARLQFAIAKR
AKVSPNSVQAYVMGEHGDSSFVAYSNIKIAGECFCAYSKLTGIDSSNYEKELEYPVSRRAYEIINRKRATFYGIGAAIAK
IVSNIIKDTKNIMIAGANLRGEYGFHGVNIGVPVVLGANGIEKIIEISLNDKEKEKFAKSVAIIDKIYQDAIKNI
>P78007 1.1.1.27~~~ldh~~~L-lactate dehydrogenase~~~
MKSLKVALIGSGAVGTSFLYAAMSRGLASEYMVIDINEKSQVGNVFDLQDAVPSSPQYSKVIAGDYKQLKDYDFIFIGAG
RPQKQGGETRLQLLEGNVEIMKNIAKAVKESGFKGITLIASNPVDIMAYTYLKVTGFEPNKVIGSGTLLDSARLKFAIAE
KYGMSSRDVQAYVLGEHGDSSVSIISSAKIAGLPLKHFSKASDIEKEFAEIDHFIRRRAYEIIERKGATFYGIGEATAEV
AELILRDTKEVRVVASLINGQYGAKDVMFGTPCVLGRNGVEKILEIELSATEKAGLDKSIQVLKDNIKLAKL
>Q59645 1.1.1.27~~~ldh~~~L-lactate dehydrogenase~~~
MSNIQNHQKVVLVGDGAVGSSYAFAMAEEGIAEEFVIVDVVKVRTVGDALDLEDATPFTAPKNIYSGEYSDCKDADLVVI
TAGAPQKPGETRLDLVNKNLNILSTILKPVVDSGFDGIFLVAANPVDILTYATWKFSGFPKEKVIGSGISLDTARLRVAL
GKKFNVSPESVDAYILGEHGDSEFAAYSSATIGTKPLLEIAKEEGVSTDELAEIEDSVRNKAYEIINKKGATFYGVGTAL
MRISKAILRDENAVLPVGAYMDGEYGLNDIYIGTPAVINGQGLNRVIEAPLSDDEKKKMTDSATTLKKVLTDGLNALAEK
QDK
>A9BGZ9 1.1.1.27~~~ldh~~~L-lactate dehydrogenase~~~COG0039
MKISIIGTGRVGSSTAFALINAAVADEIVLYDLNKEMAEGEALDLLHATTFHKRMIIRAGEYSDIEGSDIVLITAGAAQK
PGETRLDLTIKNAKIIKGISENIKKYAPNTLIINITNPVDVMSYVVWKVTGFESNRVIGTGTILDTARLRALIGKNCGVS
PMSVHAYIIGEHGDSELAAWSSAMIGGVPIKGFCRNCPYKDNCNKDLSKIFDDVKNSAYTIISKKGATNYGIASATTALV
ESIIKNEGRVYTPSVLLDDVYIGYPAVINKDGVERTIDITLNDEETEKFESSKSIIKEYLESIKNLL
>P00345 1.1.1.27~~~ldh~~~L-lactate dehydrogenase~~~
MKTQFTPKTRKVAVIGTGFVGSSYAFSMVNQGIANELVLIDMNKEKAEGEARDINHGMPFATPMKIWAGDYKDCADADLA
VITAGANQAPGETRLDLVEKNVKIFECIVKDIMNSGFDGIILVATNPVDILAHVTQKVSGLPNGRVIGSGTILDTARFRY
LLSDYFEVDSRNVHAYIMGEHGDTEFPVWSHAQIGGVKLEHFINTAAIEKEPDMQHLFEQTRDAAYHIINRKGATYYGIA
MGLVRITKAILDDENSILTVSALLEGQYGISDVYIGVPAIINKNGVRQIIELNLTPHEQQQLEHSASILKQTRDRAFV
>Q9EVR0 1.1.1.27~~~ldh~~~L-lactate dehydrogenase~~~COG0039
MNNRRKIVVIGASNVGSAVANKIADFQLATEVVLIDLNEDKAWGEAKDSSHATSCIYSTNIKFHLGDYEDCKDANIIVIT
AGPSIRPGETPDRLKLAGTNAKIMSSVMGEIVKRTKEAMIIMITNPLDVATYVVSTQFDYPRNLILGTGTMLETYRFRRI
LADKYQVDPKNINGYVLGEHGNAAFVAWSTTGCAGFPIDDLDEYFHRTEKLSHEAVEQELVQVAYDVINKKGFTNTGIAM
AACRFIKSVLYDEHTILPCSAVLEGEYGIKDVALSIPRMVCADGIMRSFEVHLTDDELEKMHKAAQSVRSALDGAGIK
>P0A3N0 1.1.1.27~~~ldh~~~L-lactate dehydrogenase~~~COG0039
MTSTKQHKKVILVGDGAVGSSYAFALVNQGIAQELGIIEIPQLHEKAVGDALDLSHALAFTSPKKIYAAQYSDCADADLV
VITAGAPQKPGETRLDLVGKNLAINKSIVTQVVESGFKGIFLVAANPVDVLTYSTWKFSGFPKERVIGSGTSLDSARFRQ
ALAEKLDVDARSVHAYIMGEHGDSEFAVWSHANIAGVNLEEFLKDTQNVQEAELIELFEGVRDAAYTIINKKGATYYGIA
VALARITKAILDDENAVLPLSVFQEGQYGVENVFIGQPAVVGAHGIVRPVNIPLNDAETQKMQASAKELQAIIDEAWKNP
EFQEASKN
>P13715 1.1.1.27~~~ldh~~~L-lactate dehydrogenase~~~
MKVGIVGSGFVGSATAYALVLQGVAREVVLVDLDRKLAQAHAEDILHATPFAHPVWVRSGWYEDLEGARVVIVAAGVAQR
PGETRLQLLDRNAQVFADVVPKILKAAPEAVLLIATNPVDVMTQVAYRLSGLPPERVVGSGTILDTARFRALLAQHLLVA
PQSVHAYVVGEHGDSEVLVWSSAQVGGVDLEAFAQARGRALTPDDRLRIDEGVRRAAYRIIEGKGATYYGIGAGLARLTR
AILTDEKGVFTVSLFTPEVEGVEEVALSLPRILGARGVEATLYPRLNEEERQALRRSAEILKGAASALGF
>P06150 1.1.1.27~~~ldh~~~L-lactate dehydrogenase~~~
MKVGIVGSGMVGSATAYALALLGVAREVVLVDLDRKLAQAHAEDILHATPFAHPVWVRAGSYGDLEGARAVVLAAGVAQR
PGETRLQLLDRNAQVFAQVVPRVLEAAPEAVLLVATNPVDVMTQVAYRLSALPPGRVVGSGTILDTARFRALLAEHLRVA
PQSVHAYVLGEHGDSEVLVWSSAQVGGVPLLEFAEARGRALSPEDRARIDEGVRRAAYRIIEGKGATYYGIGAGLARLVR
AILTDEKGVYTVSAFTPEVEGVLEVSLSLPRILGAGGVEGTVYPSLSPEEREALRRSAEILKEAAFALGF
>P16115 1.1.1.27~~~ldh~~~L-lactate dehydrogenase~~~COG0039
MKIGIVGLGRVGSSTAFALLMKGFAREMVLIDVDKKRAEGDALDLIHGTPFTRRANIYAGDYADLKGSDVVIVAAGVPQK
PGETRLQLLGRNARVMKEIARNVSKYAPDSIVIVVTNPVDVLTYFFLKESGMDPRKVFGSGTVLDTARLRTLIAQHCGFS
PRSVHVYVIGEHGDSEVPVWSGAMIGGIPLQNMCQICQKCDSKILENFAEKTKRAAYEIIERKGATHYAIALAVADIVES
IFFDEKRVLTLSVYLEDYLGVKDLCISVPVTLGKHGVERILELNLNEEELEAFRKSASILKNAINEITAEENKHQNTSG
>Q5SJA1 1.1.1.27~~~ldh~~~L-lactate dehydrogenase~~~COG0039
MKVGIVGSGMVGSATAYALALLGVAREVVLVDLDRKLAQAHAEDILHATPFAHPVWVRAGSYGDLEGARAVVLAAGVAQR
PGETRLQLLDRNAQVFAQVVPRVLEAAPEAVLLVATNPVDVMTQVAYRLSGLPPGRVVGSGTILDTARFRALLAEYLRVA
PQSVHAYVLGEHGDSEVLVWSSAQVGGVPLLEFAEARGRALSPEDRARIDEGVRRAAYRIIEGKGATYYGIGAGLARLVR
AILTDEKGVYTVSAFTPEVEGVLEVSLSLPRILGAGGVEGTVYPSLSPEEREALRRSAEILKEAAFALGF
>E1XUJ2 4.2.1.127~~~ldi~~~Linalool dehydratase/isomerase~~~
MRFTLKTTAIVSAAALLAGFGPPPRAAELPPGRLATTEDYFAQQAKQAVTPDVMAQLAYMNYIDFISPFYSRGCSFEAWE
LKHTPQRVIKYSIAFYAYGLASVALIDPKLRALAGHDLDIAVSKMKCKRVWGDWEEDGFGTDPIEKENIMYKGHLNLMYG
LYQLVTGSRRYEAEHAHLTRIIHDEIAANPFAGIVCEPDNYFVQCNSVAYLSLWVYDRLHGTDYRAATRAWLDFIQKDLI
DPERGAFYLSYHPESGAVKPWISAYTTAWTLAMVHGMDPAFSERYYPRFKQTFVEVYDEGRKARVRETAGTDDADGGVGL
ASAFTLLLAREMGDQQLFDQLLNHLEPPAKPSIVSASLRYEHPGSLLFDELLFLAKVHAGFGALLRMPPPAAKLAGK
>Q2FWC5 6.3.2.-~~~~~~L-aspartate--L-methionine ligase~~~COG1181
MTNNGNEPNLTLSDLYDKDVVYTSRPSYISNPWLKPDEHQSNFLTGRELLIANQLPVIVHEASATDKLHQLFQVIGKEVP
NSIYTFNNQQSYENLIKQLAHKENKKIYFQYIHDETILNQQYYALDKTLFVALNNKARIPEWTNGKFLPKRKVVKIEQFE
NEIKNWEFPLVIKPGDDLPTAGGYGVMICYHDADLQKAITRIKEATAETNSLIIEQKIEEKANYCVQFAYSESLGIQYLG
AATQLTDKYGFYNGNENTTNVPEHVIEAGRQIMENGVNQGFFGVAGFDLLVDEDDNVYAIDLNFRQNGSTSMLLLANELN
SGYQKFYSYHSKGDNTHFFNTILKYVKEGSLYPLSYYDGDWYGEDKVKSRFGCIWHGDSKETVLENERAFLAELEHY
>Q8GLI4 ~~~ldpA~~~Light dependent period A~~~COG1149
MSASFPLEALQAGHWFKLICGASYQHLPVIRNLALTYALAGADCVDVAAEPAVVRSALAGLAAYERLTGRDRPWLMVSLN
DGEDLHFRRAWFDPDRCPTDCPRPCERVCPTDAITSTGVQRDRCYGCGRCLPICPLGLIEAQAWQVDAAALLPELLDLGI
NAIEIHTQVGHQKEFQQLWQQLQPWLPQLQAIAISCPAHPEAIAYLWDLQELVGNTVETVIWQTDGRPMSGDIGAGTTHA
AVRFAQAMLTEGPPGHVQLAGGTNAHTVAKLQELKLLAPQVSRFVAGIACGGAARTPLAELLEPLAEQQRSLEAHPEVLQ
AAVTTAIALVGPLKQAVGGATRSIPAFLLRTP
>Q2G278 3.5.1.28~~~~~~Probable autolysin LDP~~~COG1388
MKKSLTVTVSSVLAFLALNNAAHAQQHGTQVKTPVQHNYVSNVQAQTQSPTTYTVVAGDSLYKIALEHHLTLNQLYSYNP
GVTPLIFPGDVISLVPQNKVKQTKAVKSPVRKASQAKKVVKQPVQQASKKVVVKQAPKQAVTKTVNVAYKPAQVQKSVPT
VPVAHNYNKSVANRGNLYAYGNCTYYAFDRRAQLGRSIGSLWGNANNWNYAAKVAGFKVDKTPEVGAIFQTAAGPYGHVG
VVESVNPNGTITVSEMNYAGFNVKSSRTILNPGKYNYIH
>P0DPD0 ~~~ldrA~~~Small toxic polypeptide LdrA~~~
MTLAQFAMIFWHDLAAPILAGIITAAIVSWWRNRK
>Q6BF25 ~~~ldrD~~~Small toxic polypeptide LdrD~~~
MTFAELGMAFWHDLAAPVIAGILASMIVNWLNKRK
>Q53W63 ~~~~~~Transcriptional regulator LdrP~~~
MKRFARKETIYLRGEEARTLYRLEEGLVRVVELLPDGRLITLRHVLPGDYFGEEALEGKAYRYTAEAMTEAVVQGLEPRA
MDHEALHRVARNLARQMRRVQAYEAHLQTGELRARIARYLLFLADTPLSARDRQGIYVTVSHEEIADATASIRESVSKVL
ADLRREGLIATAYRRVYLLDLAALEREAGSALEAA
>O53638 2.3.2.-~~~ldtA~~~L,D-transpeptidase 1~~~COG1376
MRRVVRYLSVVVAITLMLTAESVSIATAAVPPLQPIPGVASVSPANGAVVGVAHPVVVTFTTPVTDRRAVERSIRISTPH
NTTGHFEWVASNVVRWVPHRYWPPHTRVSVGVQELTEGFETGDALIGVASISAHTFTVSRNGEVLRTMPASLGKPSRPTP
IGSFHAMSKERTVVMDSRTIGIPLNSSDGYLLTAHYAVRVTWSGVYVHSAPWSVNSQGYANVSHGCINLSPDNAAWYFDA
VTVGDPIEVVG
>O53223 2.3.2.-~~~ldtB~~~L,D-transpeptidase 2~~~
MPKVGIAAQAGRTRVRRAWLTALMMTAVMIGAVACGSGRGPAPIKVIADKGTPFADLLVPKLTASVTDGAVGVTVDAPVS
VTAADGVLAAVTMVNDNGRPVAGRLSPDGLRWSTTEQLGYNRRYTLNATALGLGGAATRQLTFQTSSPAHLTMPYVMPGD
GEVVGVGEPVAIRFDENIADRGAAEKAIKITTNPPVEGAFYWLNNREVRWRPEHFWKPGTAVDVAVNTYGVDLGEGMFGE
DNVQTHFTIGDEVIATADDNTKILTVRVNGEVVKSMPTSMGKDSTPTANGIYIVGSRYKHIIMDSSTYGVPVNSPNGYRT
DVDWATQISYSGVFVHSAPWSVGAQGHTNTSHGCLNVSPSNAQWFYDHVKRGDIVEVVNTVGGTLPGIDGLGDWNIPWDQ
WRAGNAKA
>I6Y9J2 2.3.2.-~~~ldtB~~~L,D-transpeptidase 2~~~COG1376
MPKVGIAAQAGRTRVRRAWLTALMMTAVMIGAVACGSGRGPAPIKVIADKGTPFADLLVPKLTASVTDGAVGVTVDAPVS
VTAADGVLAAVTMVNDNGRPVAGRLSPDGLRWSTTEQLGYNRRYTLNATALGLGGAATRQLTFQTSSPAHLTMPYVMPGD
GEVVGVGEPVAIRFDENIADRGAAEKAIKITTNPPVEGAFYWLNNREVRWRPEHFWKPGTAVDVAVNTYGVDLGEGMFGE
DNVQTHFTIGDEVIATADDNTKILTVRVNGEVVKSMPTSMGKDSTPTANGIYIVGSRYKHIIMDSSTYGVPVNSPNGYRT
DVDWATQISYSGVFVHSAPWSVGAQGHTNTSHGCLNVSPSNAQWFYDHVKRGDIVEVVNTVGGTLPGIDGLGDWNIPWDQ
WRAGNAKA
>O06825 2.3.2.-~~~~~~Probable L,D-transpeptidase 3~~~COG1376
MRAVFGCAIAVVGIAGSVVAGPADIHLVAAKQSYGFAVASVLPTRGQVVGVAHPVVVTFSAPITNPANRHAAERAVEVKS
TPAMTGKFEWLDNDVVQWVPDRFWPAHSTVELSVGSLSSDFKTGPAVVGVASISQHTFTVSIDGVEEGPPPPLPAPHHRV
HFGEDGVMPASMGRPEYPTPVGSYTVLSKERSVIMDSSSVGIPVDDPDGYRLSVDYAVRITSRGLYVHSAPWALPALGLE
NVSHGCISLSREDAEWYYNAVDIGDPVIVQE
>O07436 2.3.2.-~~~~~~L,D-transpeptidase 4~~~COG1376
MPHWAEERHRRESNYVALEAGLDEGESIRRSEHSRSGCGADAGCWRCRGGPGRGSRRSRRSRGPGGTAGPVDPPAVDLLA
PPPDPLALPPALDPLAPPPPDPLAPPPPDPLAVPVAAGPVAGQDPTSFVGPPPFRPPTFNPVDGAMVGVAKPIVINFAVP
IADRAMAESAIHISSIPPVPGKFYWMSPTQVRWRPFEFWPANTAVNIDAAGTKSSFRTGDSLVATADDATHQMTITRNGV
VQKTFPMSMGMVSGGHQTPNGTYYVLEKFATVVMDSSTYGVPVNSAQGYKLTVSDAVRIDNSGNFVHSAPWSVADQGKRN
VTHGCINLSPANAKWFYDNFGSGDPVVVKNSVGTYNKNDGAQDWQI
>P9WKV2 2.3.2.-~~~~~~L,D-transpeptidase 5~~~
MVIRVLFRPVSLIPVNNSSTPQSQGPISRRLALTALGFGVLAPNVLVACAGKVTKLAEKRPPPAPRLTFRPADSAADVVP
IAPISVEVGDGWFQRVALTNSAGKVVAGAYSRDRTIYTITEPLGYDTTYTWSGSAVGHDGKAVPVAGKFTTVAPVKTINA
GFQLADGQTVGIAAPVIIQFDSPISDKAAVERALTVTTDPPVEGGWAWLPDEAQGARVHWRPREYYPAGTTVDVDAKLYG
LPFGDGAYGAQDMSLHFQIGRRQVVKAEVSSHRIQVVTDAGVIMDFPCSYGEADLARNVTRNGIHVVTEKYSDFYMSNPA
AGYSHIHERWAVRISNNGEFIHANPMSAGAQGNSNVTNGCINLSTENAEQYYRSAVYGDPVEVTGSSIQLSYADGDIWDW
AVDWDTWVSMSALPPPAAKPAATQIPVTAPVTPSDAPTPSGTPTTTNGPGG
>P9WKV3 2.3.2.-~~~lprQ~~~L,D-transpeptidase 5~~~COG1376
MVIRVLFRPVSLIPVNNSSTPQSQGPISRRLALTALGFGVLAPNVLVACAGKVTKLAEKRPPPAPRLTFRPADSAADVVP
IAPISVEVGDGWFQRVALTNSAGKVVAGAYSRDRTIYTITEPLGYDTTYTWSGSAVGHDGKAVPVAGKFTTVAPVKTINA
GFQLADGQTVGIAAPVIIQFDSPISDKAAVERALTVTTDPPVEGGWAWLPDEAQGARVHWRPREYYPAGTTVDVDAKLYG
LPFGDGAYGAQDMSLHFQIGRRQVVKAEVSSHRIQVVTDAGVIMDFPCSYGEADLARNVTRNGIHVVTEKYSDFYMSNPA
AGYSHIHERWAVRISNNGEFIHANPMSAGAQGNSNVTNGCINLSTENAEQYYRSAVYGDPVEVTGSSIQLSYADGDIWDW
AVDWDTWVSMSALPPPAAKPAATQIPVTAPVTPSDAPTPSGTPTTTNGPGG
>P84330 ~~~~~~Lectin OAA~~~
ALYNVENQWGGSSAPWNEGGQWEIGSRSDQNVVAINVESGDDGQTLNGTMTYAGEGPIGFRATLLGNNSYEVENQWGGDS
APWHSGGNWILGSRENQNVVAINVESGDDGQTLNGTMTYAGEGPIGFKGTTL
>P15917 3.4.24.83~~~lef~~~Lethal factor~~~
MNIKKEFIKVISMSCLVTAITLSGPVFIPLVQGAGGHGDVGMHVKEKEKNKDENKRKDEERNKTQEEHLKEIMKHIVKIE
VKGEEAVKKEAAEKLLEKVPSDVLEMYKAIGGKIYIVDGDITKHISLEALSEDKKKIKDIYGKDALLHEHYVYAKEGYEP
VLVIQSSEDYVENTEKALNVYYEIGKILSRDILSKINQPYQKFLDVLNTIKNASDSDGQDLLFTNQLKEHPTDFSVEFLE
QNSNEVQEVFAKAFAYYIEPQHRDVLQLYAPEAFNYMDKFNEQEINLSLEELKDQRMLARYEKWEKIKQHYQHWSDSLSE
EGRGLLKKLQIPIEPKKDDIIHSLSQEEKELLKRIQIDSSDFLSTEEKEFLKKLQIDIRDSLSEEEKELLNRIQVDSSNP
LSEKEKEFLKKLKLDIQPYDINQRLQDTGGLIDSPSINLDVRKQYKRDIQNIDALLHQSIGSTLYNKIYLYENMNINNLT
ATLGADLVDSTDNTKINRGIFNEFKKNFKYSISSNYMIVDINERPALDNERLKWRIQLSPDTRAGYLENGKLILQRNIGL
EIKDVQIIKQSEKEYIRIDAKVVPKSKIDTKIQEAQLNINQEWNKALGLPKYTKLITFNVHNRYASNIVESAYLILNEWK
NNIQSDLIKKVTNYLVDGNGRFVFTDITLPNIAEQYTHQDEIYEQVHSKGLYVPESRSILLHGPSKGVELRNDSEGFIHE
FGHAVDDYAGYLLDKNQSDLVTNSKKFIDIFKEEGSNLTSYGRTNEAEFFAEAFRLMHSTDHAERLKVQKNAPKTFQFIN
DQIKFIINS
>Q5ZXN5 3.1.3.-~~~lem3~~~Phosphocholine hydrolase Lem3~~~COG1391
MKLRYIINENKLVFTSCNMRDKIITGKKIIFSQSVAKDQTKNLSSFLSERFYSVNQSHNHSIIIGSSLSHQENDIEHDTI
LDTSGVLVTTDTNGIVNGARVAITDGLGGGNGDQEEDDEIYRVSHSSCENFLNCDQNIDTTLSLITQPKASDKKQTAPKT
LQHTEASMAAFIYQNHPGKGYIGEFANIGDGLIIILDKRFKIKHMVSACHIYRGFGTWTPPSLQALATTANKDALLVRQT
LKLAEGDIIISMTDGVWGELKTSLIAQTNDRRDIGVDKEYFKTLFDELTDAPYPSSFDIARIITQRAMSRSLERRKTLIK
LINEIEQQHFHEKSVKTINEVLEYFIKTGHVETAQTLKAILFEDGLSDGITYFENIEIPLEMVMHDLKSRTVGDCSTINV
TRIPYHLDELIRGFINYPEKHQILAPLFKARVKSEADLEEAFHRLSLEMVQPEIECPISETHFERAFKKETLDKTQAVLT
HYFRISTGLDSKKNYQERLNDLSAYLSKESSLEKNDIKLLLSMLDSEIKPKTGVFQTLFGENQNKLYKAFHKKIELQLLD
SEIENKNELK
>A8AVK0 ~~~lemA~~~Protein LemA~~~COG1704
MSFIITIAVIVVIVLFVISVYNSLVRARMQTQEAWSQIDVQLKRRNDLLPNLIETVKGYGKYEQATLEKVTQLRAQVASA
SSPADAMKASDALTRQISGIFAVAESYPDLKANENYLKLQEELTNTENKISYSRQLYNSVAGNYNVKLQAFPSNVIAGMF
AFRPADFLSTPEEEKAVPKVDFGSNGLGE
>P15378 ~~~comC~~~Prepilin leader peptidase/N-methyltransferase~~~COG1989
MLSILFIFGLILGSFYYTAGCRIPLHLSIIAPRSSCPFCRRTLTPAELIPILSFLFQKGKCKSCGHRISFMYPAAELVTA
CLFAAAGIRFGISLELFPAVVFISLLIIVAVTDIHFMLIPNRILIFFLPFLAAARLISPLDSWYAGLLGAAAGFLFLAVI
AAITHGGVGGGDIKLFAVIGFVLGVKMLAAAFFFSVLIGALYGAAAVLTGRLAKRQPLPFAPAIAAGSILAYLYGDSIIS
FYIKMALG
>P25960 ~~~gspO~~~Prepilin leader peptidase/N-methyltransferase~~~COG1989
MTMLLPLFILVGFIADYFVNAIAYHLSPLEDKTALTFRQVLVHFRQKKYAWHDTVPLILCVAAAIACALAPFTPIVTGAL
FLYFCFVLTLSVIDFRTQLLPDKLTLPLLWLGLVFNAQYGLIDLHDAVYGAVAGYGVLWCVYWGVWLVCHKEGLGYGDFK
LLAAAGAWCGWQTLPMILLIASLGGIGYAIVSQLLQRRTITTIAFGPWLALGSMINLGYLAWISY
>P31712 ~~~outO~~~Prepilin leader peptidase/N-methyltransferase~~~
MDDLREFAQLFPAWWFGALGVLGLIVGSFLNVVIYRLPIMLERRWRQDIELETGVADPDTRYNLWWPPSSCPHCQQAIAV
KDNIPLFSWLWLRGRSRCCHQSVSVQYPLVEVITMLAFLAAGLLWLPGMALWGALILLSFLLVLTVIDIKTLLLPDELTL
SLLWMGLLFNLSGTFVSLNDAVVGAMAGYLSLWLLYWAFKYATGKEALGYGDFKLLAALGAWLGWQALPNLVLVAALSGL
VVTLIWRGLRKEDTAKPLAFGPWLAIGGVFGMIMNGFNL
>P22610 ~~~pilD~~~Prepilin leader peptidase/N-methyltransferase~~~
MPLLDYLASHPLAFVLCTILLGLLVGSFLNVVVHRLPKMMERNWKAEAREALGLEPEPKQATYNLVLPNSACPRCGHEIR
PWENIPLVSYLALGGKCSSCKAAIGKRYPLVELATALLSGYVAWHFGFTWQAGAMLLLTWGLLAMSLIDADHQLLPDVLV
LPLLWLGLIANHFGLFASLDDALFGAVFGYLSLWSVFWLFKLVTGKEGMGYGDFKLLAMLGAWGGWQILPLTILLSSLVG
AILGVIMLRLRNAESGTPIPFGPYLAIAGWIALLWGDQITRTYLQFAGFK
>O67618 3.6.5.n1~~~lepA~~~Elongation factor 4~~~COG0481
MEQKNVRNFCIIAHVDHGKSTLADRLLEYTGAISEREKREQLLDTLDVERERGITVKMQAVRMFYKAKDGNTYKLHLIDT
PGHVDFSYEVSRALAACEGALLLIDASQGIEAQTVANFWKAVEQDLVIIPVINKIDLPSADVDRVKKQIEEVLGLDPEEA
ILASAKEGIGIEEILEAIVNRIPPPKGDPQKPLKALIFDSYYDPYRGAVAFVRIFDGEVKPGDKIMLMSTGKEYEVTEVG
AQTPKMTKFDKLSAGDVGYIAASIKDVRDIRIGDTITHAKNPTKEPVPGFQPAKPMVYAGIYPAEDTTYEELRDALEKYA
INDAAIVYEPESSPALGMGFRVGFLGLLHMEIVQERLEREYGVKIITTAPNVIYRVKKKFTDEVIEVRNPMDFPDNAGLI
EYVEEPFVLVTIITPKEYVGPIIQLCQEKRGIQKNMTYLDPNTVYLEYEMPLSEIIVDFHDKIKSISRGFASYDYEFIGY
RPSDLIKLTVLINKKPVDALSFIVHADRAQKFARRVAEKLRETIPRQLFEVHIQVAKGGKVIASERIKPLRANVTAKCYG
GDVTRKKKLLENQKEGKKRMKQFGKVQLPQEAFLSVLKVE
>P60785 3.6.5.n1~~~lepA~~~Elongation factor 4~~~COG0481
MKNIRNFSIIAHIDHGKSTLSDRIIQICGGLSDREMEAQVLDSMDLERERGITIKAQSVTLDYKASDGETYQLNFIDTPG
HVDFSYEVSRSLAACEGALLVVDAGQGVEAQTLANCYTAMEMDLEVVPVLNKIDLPAADPERVAEEIEDIVGIDATDAVR
CSAKTGVGVQDVLERLVRDIPPPEGDPEGPLQALIIDSWFDNYLGVVSLIRIKNGTLRKGDKVKVMSTGQTYNADRLGIF
TPKQVDRTELKCGEVGWLVCAIKDIHGAPVGDTLTLARNPAEKALPGFKKVKPQVYAGLFPVSSDDYEAFRDALGKLSLN
DASLFYEPESSSALGFGFRCGFLGLLHMEIIQERLEREYDLDLITTAPTVVYEVETTSREVIYVDSPSKLPAVNNIYELR
EPIAECHMLLPQAYLGNVITLCVEKRGVQTNMVYHGNQVALTYEIPMAEVVLDFFDRLKSTSRGYASLDYNFKRFQASDM
VRVDVLINGERVDALALITHRDNSQNRGRELVEKMKDLIPRQQFDIAIQAAIGTHIIARSTVKQLRKNVLAKCYGGDISR
KKKLLQKQKEGKKRMKQIGNVELPQEAFLAILHVGKDNK
>P9WK97 3.6.5.n1~~~lepA~~~Elongation factor 4~~~COG0481
MRTPCSQHRRDRPSAIGSQLPDADTLDTRQPPLQEIPISSFADKTFTAPAQIRNFCIIAHIDHGKSTLADRMLQLTGVVD
ERSMRAQYLDRMDIERERGITIKAQNVRLPWRVDKTDYVLHLIDTPGHVDFTYEVSRALEACEGAVLLVDAAQGIEAQTL
ANLYLALDRDLHIIPVLNKIDLPAADPDRYAAEMAHIIGCEPAEVLRVSGKTGEGVSDLLDEVVRQVPPPQGDAEAPTRA
MIFDSVYDIYRGVVTYVRVVDGKISPRERIMMMSTGATHELLEVGIVSPEPKPCEGLGVGEVGYLITGVKDVRQSKVGDT
VTSLSRARGAAAEALTGYREPKPMVYSGLYPVDGSDYPNLRDALDKLQLNDAALTYEPETSVALGFGFRCGFLGLLHMEI
TRERLEREFGLDLISTSPNVVYRVHKDDGTEIRVTNPSDWPEGKIRTVYEPVVKTTIIAPSEFIGTIMELCQSRRGELGG
MDYLSPERVELRYTMPLGEIIFDFFDALKSRTRGYASLDYEEAGEQEAALVKVDILLQGEAVDAFSAIVHKDTAYAYGNK
MTTKLKELIPRQQFEVPVQAAIGSKIIARENIRAIRKDVLSKCYGGDITRKRKLLEKQKEGKKRMKTIGRVEVPQEAFVA
ALSTDAAGDKGKK
>Q2FXY7 3.6.5.n1~~~lepA~~~Elongation factor 4~~~COG0481
MDNEQRLKRRENIRNFSIIAHIDHGKSTLADRILENTKSVETRDMQDQLLDSMDLERERGITIKLNAVRLKYEAKDGNTY
TFHLIDTPGHVDFTYEVSRSLAACEGAILVVDAAQGIEAQTLANVYLALDNELELLPVINKIDLPAAEPERVKQEIEDMI
GLDQDDVVLASAKSNIGIEEILEKIVEVVPAPDGDPEAPLKALIFDSEYDPYRGVISSIRIVDGVVKAGDKIRMMATGKE
FEVTEVGINTPKQLPVDELTVGDVGYIIASIKNVDDSRVGDTITLASRPASEPLQGYKKMNPMVYCGLFPIDNKNYNDLR
EALEKLQLNDASLEFEPESSQALGFGYRTGFLGMLHMEIIQERIEREFGIELIATAPSVIYQCVLRDGSEVTVDNPAQMP
DRDKIDKIFEPYVRATMMVPNDYVGAVMELCQRKRGQFINMDYLDDIRVNIVYELPLAEVVFDFFDQLKSNTKGYASFDY
EFIENKESNLVKMDILLNGDKVDALSFIVHRDFAYERGKALVEKLKTLIPRQQFEVPVQAAIGQKIVARTNIKSMGKNVL
AKCYGGDISRKRKLLEKQKAGKAKMKAVGNVEIPQDAFLAVLKMDDE
>P65272 3.6.5.n1~~~lepA~~~Elongation factor 4~~~
MDNEQRLKRRENIRNFSIIAHIDHGKSTLADRILENTKSVETRDMQDQLLDSMDLERERGITIKLNAVRLKYEAKDGNTY
TFHLIDTPGHVDFTYEVSRSLAACEGAILVVDAAQGIEAQTLANVYLALDNELELLPVINKIDLPAAEPERVKQEIEDMI
GLDQDDVVLASAKSNIGIEEILEKIVEVVPAPDGDPEAPLKALIFDSEYDPYRGVISSIRIVDGVVKAGDKIRMMATGKE
FEVTEVGINTPKQLPVDELTVGDVGYIIASIKNVDDSRVGDTITLASRPASEPLQGYKKMNPMVYCGLFPIDNKNYNDLR
EALEKLQLNDASLEFEPESSQALGFGYRTGFLGMLHMEIIQERIEREFGIELIATAPSVIYQCILRDGSEVTVDNPAQMP
DRDKIDKIFEPYVRATMMVPNDYVGAVMELCQRKRGQFINMDYLDDIRVNIVYELPLAEVVFDFFDQLKSNTKGYASFDY
EFIENKESNLVKMDILLNGDKVDALSFIVHRDFAYERGKALVEKLKTLIPRQQFEVPVQAAIGQKIVARTNIKSMGKNVL
AKCYGGDISRKRKLLEKQKAGKAKMKAVGNVEIPQDAFLAVLKMDDE
>Q5SKA7 3.6.5.n1~~~lepA~~~Elongation factor 4~~~COG0481
MVRMDLSRIRNFSIIAHVDHGKSTLADRILELTHAVSDREMREQFLDSLELERERGITIKASAVRVTYRAKDGEEYVFHL
IDTPGHVDFTYEVSRALAAVEGVLLVVDASQGVEAETLAKFYMALEHGHVIIPVINKIDLPNARPLEVALEVEEVLGLPA
DEAIFASGKTGEGVEEILEAIVQRIPPPKGDPEAPLKALIFDSVYDAYQGVIPYLRLFEGRVRPGDRIRIYSTGKEFTVD
KVGVFTPQGLVATEALEAGEVGWLVAAIRDIHDVQVGDTITLADRPTPSPYPGFRPAKPVVFAGLYPVDSGDYGKLRDAL
EKLKLNDAALTFEPESSTALGFGFRCGFLGLLHAEIVQERLEREFGLSLIATAPSVVYKVRLKSGEEVEVHNPADLPDPT
RIEEILEPYVKLTIFTPEEYVGSLMQLLQEKRGRLVNMNYLPGAQKRVELVYEAPFAEILYDFHDRLKSVSRGYASMDYE
QAGYRPGDLVKVNVLVHGEVVDALTFIAHREKAYTMARAIVDKLAEVIPRQLFEVPIQAAIGGKIIARATVKALRKDVLA
KCYGGDVTRKKKLLEKQKEGKKRLKAIGKVEVPQEAFLAVLSAGRDEPKG
>P37943 3.4.21.89~~~sipP~~~Signal peptidase I P~~~
MTKEKVFKKKSSILEWGKAIVIAVILALLIRNFLFEPYVVEGKSMDPTLVDSERLFVNKTVKYTGNFKRGDIIILNGKEK
STHYVKRLIGLPGDTVEMKNDHLFINGNEVKEPYLSYNKENAKKVGINLTGDFGPIKVPKDKYFVMGDNRQESMDSRNGL
GLFTKDDIQGTEEFVFFPFSNMRKAK
>P28628 3.4.21.89~~~sipS~~~Signal peptidase I S~~~COG0681
MKSENVSKKKSILEWAKAIVIAVVLALLIRNFIFAPYVVDGDSMYPTLHNRERVFVNMTVKYIGEFDRGDIVVLNGDDVH
YVKRIIGLPGDTVEMKNDQLYINGKKVDEPYLAANKKRAKQDGFDHLTDDFGPVKVPDNKYFVMGDNRRNSMDSRNGLGL
FTKKQIAGTSKFVFYPFNEMRKTN
>P71013 3.4.21.89~~~sipT~~~Signal peptidase I T~~~COG0681
MTEEKNTNTEKTAKKKTNTYLEWGKAIVIAVLLALLIRHFLFEPYLVEGSSMYPTLHDGERLFVNKTVNYIGELKRGDIV
IINGETSKIHYVKRLIGKPGETVQMKDDTLYINGKKVAEPYLSKNKKEAEKLGVSLTGDFGPVKVPKGKYFVMGDNRLNS
MDSRNGLGLIAEDRIVGTSKFVFFPFNEMRQTK
>P42959 3.4.21.89~~~sipU~~~Signal peptidase I U~~~COG0681
MNAKTITLKKKRKIKTIVVLSIIMIAALIFTIRLVFYKPFLIEGSSMAPTLKDSERILVDKAVKWTGGFHRGDIIVIHDK
KSGRSFVKRLIGLPGDSIKMKNDQLYINDKKVEEPYLKEYKQEVKESGVTLTGDFEVEVPSGKYFVMGDNRLNSLDSRNG
MGMPSEDDIIGTESLVFYPFGEMRQAK
>O07560 3.4.21.89~~~sipV~~~Signal peptidase I V~~~COG0681
MKKRFWFLAGVVSVVLAIQVKNAVFIDYKVEGVSMNPTFQEGNELLVNKFSHRFKTIHRFDIVLFKGPDHKVLIKRVIGL
PGETIKYKDDQLYVNGKQVAEPFLKHLKSVSAGSHVTGDFSLKDVTGTSKVPKGKYFVVGDNRIYSFDSRHFGPIREKNI
VGVISDAE
>P54506 3.4.21.89~~~sipW~~~Signal peptidase I W~~~COG0681
MKLISNILYVIIFTLIIVLTLVVISTRSSGGEPAVFGYTLKSVLSGSMEPEFNTGSLILVKEITDVKELQKGDVITFMQD
ANTAVTHRIVDITKQGDHLLFKTKGDNNAAADSAPVSDENVRAQYTGFQLPYAGYMLHFASQPIGTAVLLIVPGVMLLVY
AFVTISSAIREIERKTKALETDTKDSTMST
>P00803 3.4.21.89~~~lepB~~~Signal peptidase I~~~COG0681
MANMFALILVIATLVTGILWCVDKFFFAPKRRERQAAAQAAAGDSLDKATLKKVAPKPGWLETGASVFPVLAIVLIVRSF
IYEPFQIPSGSMMPTLLIGDFILVEKFAYGIKDPIYQKTLIETGHPKRGDIVVFKYPEDPKLDYIKRAVGLPGDKVTYDP
VSKELTIQPGCSSGQACENALPVTYSNVEPSDFVQTFSRRNGGEATSGFFEVPKNETKENGIRLSERKETLGDVTHRILT
VPIAQDQVGMYYQQPGQQLATWIVPPGQYFMMGDNRDNSADSRYWGFVPEANLVGRATAIWMSFDKQEGEWPTGLRLSRI
GGIH
>P9WKA1 3.4.21.89~~~lepB~~~Signal peptidase I~~~COG0681
MTETTDSPSERQPGPAEPELSSRDPDIAGQVFDAAPFDAAPDADSEGDSKAAKTDEPRPAKRSTLREFAVLAVIAVVLYY
VMLTFVARPYLIPSESMEPTLHGCSTCVGDRIMVDKLSYRFGSPQPGDVIVFRGPPSWNVGYKSIRSHNVAVRWVQNALS
FIGFVPPDENDLVKRVIAVGGQTVQCRSDTGLTVNGRPLKEPYLDPATMMADPSIYPCLGSEFGPVTVPPGRVWVMGDNR
THSADSRAHCPLLCTDDPLPGTVPVANVIGKARLIVWPPSRWGVVRSVNPQQGR
>A8GQT7 3.4.21.89~~~lepB~~~Signal peptidase I~~~
MQTDNTKSNTNKTAKQEWGSFAFVICIALLIRILIMEPFTVPTGSMKATILENDYIFSTKYSYGYSNYSLSFFDFIPLFK
GRIFAREPDRGDIVVFRPPNDMSVRYIKRLIGLPGDKIQLIDDVIYINDKKIERTEVGTYISEEGIKYLKFKETLPNGRT
YFSYKLAPIYGVIYNDRYGNTDVFYVPEGKYFFLGDNRDQSNDSRVNLGFVPFENFIAKAQFIWFSTKITWWDNDIGVIN
LVLKLKPWVESVRLNRIFRNLYNTDA
>Q8L2J7 3.4.21.89~~~lepB~~~Signal peptidase I~~~COG0681
MNRDNTKTNKTVKQEFASFTFVICIALVIRILIMEPFTVPTGSMKATILENDYIFSTKYSYGYSNYSLSFFDFIPLFKGR
VFAREPERGDIVVFRPPNDMSVRYIKRLIGLPGDKIQLIDDVIYINDKKIERTEVGTYIGEDGIKYLKFKETLPNGRTYF
SYKLAPIFGIISNDRYSNTGVFYVPEGQYFFLGDNRDRSNDSRVNLGFVPFENFIGKAQFIWFSTKITWWDNDIGIINLI
LKLKPWIESVRLSRIFKNLYNVDE
>Q5HHB9 3.4.21.89~~~spsB~~~Signal peptidase IB~~~
MKKEILEWIISIAVAFVILFIVGKFIVTPYTIKGESMDPTLKDGERVAVNIVGYKTGGLEKGNVVVFHANKNDDYVKRVI
GVPGDKVEYKNDTLYVNGKKQDEPYLNYNLKHKQGDYITGTFQVKDLPNANPKSNVIPKGKYLVLGDNREVSKDSRAFGL
IDEDQIVGKVSFRFWPFSEFKHNFNPENTKN
>P0A068 3.4.21.89~~~spsB~~~Signal peptidase IB~~~
MKKELLEWIISIAVAFVILFIVGKFIVTPYTIKGESMDPTLKDGERVAVNIIGYKTGGLEKGNVVVFHANKNDDYVKRVI
GVPGDKVEYKNDTLYVNGKKQDEPYLNYNLKHKQGDYITGTFQVKDLPNANPKSNVIPKGKYLVLGDNREVSKDSRAFGL
IDEDQIVGKVSFRFWPFSEFKHNFNPENTKN
>B9JN20 5.3.1.33~~~lerI~~~L-erythrulose-1-phosphate isomerase~~~COG0149
MTASPRYWIGTSWKMNKTLAEARGFAEALRDADALRDPAIQRFIIPPFTAVREVKSILSDTSVKVGAQNMHWADQGAWTG
EVSPLMLRDCNLDIVELGHSERREHFGETNETVGLKTEAAVRHGLIPLICIGETLSDRESGRAAEILSEQVVGALSKLSG
SQKQAQILLAYEPVWAIGEKGIPAEPSYADARQAEIIAVAEKVLGRRIPCLYGGSVNPDNCEELISCPHIDGLFIGRSAW
NVEGYLDILAKCAAKLRGDTK
>A0R756 5.3.1.33~~~lerI~~~L-erythrulose-1-phosphate isomerase~~~COG0149
MPDARALGAAQLWIGTSWKMNKGLAESRGYARELAEYVAAKPPAGVQPFIIPSFTALTTVRDALGDDSPVLLGVQNAHWE
DHGAWTGEVSVAQAKDAGAQIVEIGHSERREHFGETVETTRLKVAAALHHGLVPLLCIGESAENKQAGESSRFILEQAAG
ALEGLTDEHLARVLIAYEPIWAIGENGRPATVEELRQPFDDLAREYGCRTMGLLYGGSVNTDNAEDLLGIDHVTGLFIGR
AAWQLPGYVRILEMAAAHPKAKA
>Q6D8V5 5.3.1.33~~~lerI~~~L-erythrulose-1-phosphate isomerase~~~COG0149
MSSRKLTLGVSLKMYFGYQQTLDWCQKIHEIAEQHPLASLPSARLFVLPAFPTLAPVVQRFAQSPVHVGAQDLHWTDNGA
FTGEVSGTMLHEMGCRYVEIGHAERRRYFGETDEHFALKTAAAWRNGLTPVLCVGEEQRGSTQQAIDTCQAQLAAALNLA
QKQQLTGDLVLAYEPQWAIGSTEPAPTAYISEVCQALKQHLPTQAGVREGRIIYGGSAGPGLLSQLGDAVDGLFLGRFAH
DPAAFNAIMDEAFTLSSQA
>A0R758 2.7.1.209~~~lerK~~~L-erythrulose 1-kinase~~~COG2376
MTYLLNSPDDFADEAVRGLVAANPDLLTEVPGGVVRSTETPKGQPALVIGGGSGHYPAFAGWVGPGMGHGAPCGNIFSSP
SASEVYSVVRNAENGGGVILGFGNYAGDVLHFGLAAEKLRHEGIDVRIVTVSDDIASNSPENHRDRRGVAGDLPVFKIAG
AAIEAGADLDEAERVAWKANDATRSFGLAFEGCTLPGATEPLFHVEKGWMGVGLGIHGEPGVRDNRLGTAAEVADMLFDE
VTAEEPPRGENGYDGRVAVILNGLGTVKYEELFVVYGRIAERLAQQGFTVVRPEVGEFVTSLDMAGVSLTMVFLDDELER
LWTAPVETPAYRRGAMPAVDRTPRTTTWDAAETTIPEASEGSRECARNIVAVLETFQQVCADNEAELGRIDAVAGDGDHG
QGMSFGSRGAAQAARDAVDRNAGARTTLLLAGQAWADAAGGTSGALWGAALTSAGGVFSDTDGADEQAAVDAICAGIDAI
LRLGGAQPGDKTMVDAAVPFRDALVKAFDTQAGPAITSAARVAREAAEKTADITARRGRARVLGEKSVGTPDPGALSFAM
LMKALGEHLTR
>Q6D8V6 2.7.1.209~~~lerK~~~L-erythrulose kinase~~~COG2376
MTYLFNQPSSFARELTEGFVAAHADKVRQVPGGVVRSTRSREGGVAIVVGGGSGHYPAFAGLVGQGLAHGAAMGNLFASP
SAQQICSVARAAHNGGGVLLTFGNYAGDVLHFGQAKARLNAEGIPCELLAVTDDISSAPLNEWQKRRGVAGDLMVFKAVS
AAAEAGYDLAAVLEVAERANQRTRSLGVAFSGCTLPGAEHPLFTVPEGMMAVGMGIHGEPGIRDVPISTADELAELLVSS
LLKEVPHGITTLSGQRISVVLNGLGGVKYEELFVVYRRVSQLLVEQGLTVVEPEVGELVTSFNMAGLSLTLFWLDEELER
FWRAPADAPAFRKGSMSPGEPLAERTFVAELEVIPNATAASKAAAHCVAAALNAARDIVLANVTELGRIDAIAGDGDHGI
GMERGVIAAADKATEMLERQAGAGTLLQRAADAWADQAGGTSGAIWGVALNALGTVLGDEQRPDGRRVADGVRQAKESVM
HFGKAKPGDKTLVDALIPFSLALTQRVETGMSLPEAWQQAAQCAQQAADDTAQLLPKIGRARPLAEKSLGTPDAGAISLA
MILDAVSAVLNSDTTSTSSHQTATQAESER
>Q8F445 2.3.3.13~~~leuA1~~~2-isopropylmalate synthase 1~~~
MDLKDYVRIFDTTLRDGEQCPGAAMTENEKLEIASQLATMKVDIIEAGFPVSSPVQFQAVERIARETEGPMIAALARAMK
ADIEAASKALQPAKKRRIHTFIASSPIHMKYKLGKEPKEVLKMAVEAVTLCRQFVDDVEFSPEDATRSEPEFLRELCEAV
IAAGATTINIPDTVGYTTPAEYGGLFKFLLSNVRGAEKIIFSAHCHNDLGLATANSLAAVQNGARQIECTINGIGERAGN
TAMEEVVMAMRTRKDTFGIQTQIKTEEIARASYLVKTITGMLVQPNKAIVGANAFAHESGIHQDGVIKHRETYEIMKPET
VGLSSNRMVLGRHSGRAGFKDRIVKLGFSPQVEELEAAYQRFLEIADRKKEIYDEDIRALFSEEARKSTGDRFQLEGFTV
STGTKSTPTAGVRILIDGHVREESATGDGPVDAIYKAIQKTTGMDPEVSRLVISPVTEGQDAMAEASVTLEYKGDRVVGK
GSSTDIIEACSRAYISALNRL
>Q8F8T4 2.3.3.13~~~leuA2~~~2-isopropylmalate synthase 2~~~
MKQDSQSENESIVCDLSVSEDQRNVKFFSDLQTPIPKHLPFFMDVTLRDGNQALRRPWNLEQKETIFKQLLKLGVQGIEV
GFASSNNQEFEACKYLSSIAPDNVVISSLSRAVEKEIEVSWKAIRFAPKPRIHIVYPVSAFTIQNVLKISPEKVLDRISQ
SVAYAKSLVGSKGEVQFSGEHFGDSLENLDFAAEAFQIALNNGADVVNLPNTVERYRPWLFVSMVKAVANLLPEDTRISI
HTHNDLGMATATTVESYFAGAVQLETALNGLGERAGNTNTYEVAIALHNCGVEVPLNFSTIYETSRLVSYLSEIPIYEKA
PLIGEDVISHRSGIHQDGVAKTRHLQKGAYRAFDAALIGRPEGDRIEFTNQSGKSAVYCILKDAGENITLEEAGRLQPIL
KKISEDLGRRELTLEEIRIEWNRLLRAI
>P42455 2.3.3.13~~~leuA~~~2-isopropylmalate synthase~~~COG0119
MSPNDAFISAPAKIETPVGPRNEGQPAWNKQRGSSMPVNRYMPFEVEVEDISLPDRTWPDKKITVAPQWCAVDLRDGNQA
LIDPMSPERKRRMFELLVQMGFKEIEVGFPSASQTDFDFVREIIEKGMIPDDVTIQVLVQAREHLIRRTFEACEGAKNVI
VHFYNSTSILQRNVVFRMDKVQVKKLATDAAELIKTIAQDYPDTNWRWQYSPESFTGTEVEYAKEVVDAVVEVMDPTPEN
PMIINLPSTVEMITPNVYADSIEWMHRNLNRRDSIILSLHPHNDRGTGVGAAELGYMAGADRIEGCLFGNGERTGNVCLV
TLALNMLTQGVDPQLDFTDIRQIRSTVEYCNQLRVPERHPYGGDLVFTAFSGSHQDAVNKGLDAMAAKVQPGASSTEVSW
EQLRDTEWEVPYLPIDPKDVGRDYEAVIRVNSQSGKGGVAYIMKTDHGLQIPRSMQVEFSTVVQNVTDAEGGEVNSKAMW
DIFATEYLERTAPVEQIALRVENAQTENEDASITAELIHNGKDVTVDGRGNGPLAAYANALEKLGIDVEIQEYNQHARTS
GDDAEAAAYVLAEVNGRKVWGVGIAGSITYASLKAVTSAVNRALDVNHEAVLAGGV
>P09151 2.3.3.13~~~leuA~~~2-isopropylmalate synthase~~~COG0119
MSQQVIIFDTTLRDGEQALQASLSVKEKLQIALALERMGVDVMEVGFPVSSPGDFESVQTIARQVKNSRVCALARCVEKD
IDVAAESLKVAEAFRIHTFIATSPMHIATKLRSTLDEVIERAIYMVKRARNYTDDVEFSCEDAGRTPIADLARVVEAAIN
AGATTINIPDTVGYTMPFEFAGIISGLYERVPNIDKAIISVHTHDDLGLAVGNSLAAVHAGARQVEGAMNGIGERAGNCS
LEEVIMAIKVRKDILNVHTAINHQEIWRTSQLVSQICNMPIPANKAIVGSGAFAHSSGIHQDGVLKNRENYEIMTPESIG
LNQIQLNLTSRSGRAAVKHRMDEMGYKESEYNLDNLYDAFLKLADKKGQVFDYDLEALAFIGKQQEEPEHFRLDYFSVQS
GSNDIATAAVKLACGEEVKAEAANGNGPVDAVYQAINRITEYNVELVKYSLTAKGHGKDALGQVDIVANYNGRRFHGVGL
ATDIVESSAKAMVHVLNNIWRAAEVEKELQRKAQHNENNKETV
>Q71Y35 2.3.3.13~~~leuA~~~2-isopropylmalate synthase~~~
MKKIQFFDTTLRDGEQTPGVNFDVKEKIQIALQLEKLGIDVIEAGFPISSPGDFECVKAIAKAIKHCSVTGLARCVEGDI
DRAEEALKDAVSPQIHIFLATSDVHMEYKLKMSRAEVLASIKHHISYARQKFDVVQFSPEDATRSDRAFLIEAVQTAIDA
GATVINIPDTVGYTNPTEFGQLFQDLRREIKQFDDIIFASHCHDDLGMATANALAAIENGARRVEGTINGIGERAGNTAL
EEVAVALHIRKDFYQAETNIVLNQFKNSSDLISRLSGMPVPRNKAVIGGNAYAHESGIHQDGVLKNPDTYEIITPALVGV
DKNSLPLGKLSGKHAFNTRMEEMGYTLTEQEQKDAFKRFKQLADAKKEVTEEDLHALILGQSSESADDFELKHLQVQYVT
GGVQGAIVRIEERDGALVEDAATGSGSIEAIYNTINRLMKQDIELTDYRIQAITAGQDAQAEVHVVIKNDKGAVFHGIGI
DFDVLTASAKAYLQASGKSKTASKQADFEEVK
>P9WQB3 2.3.3.13~~~leuA~~~2-isopropylmalate synthase~~~COG0119
MTTSESPDAYTESFGAHTIVKPAGPPRVGQPSWNPQRASSMPVNRYRPFAEEVEPIRLRNRTWPDRVIDRAPLWCAVDLR
DGNQALIDPMSPARKRRMFDLLVRMGYKEIEVGFPSASQTDFDFVREIIEQGAIPDDVTIQVLTQCRPELIERTFQACSG
APRAIVHFYNSTSILQRRVVFRANRAEVQAIATDGARKCVEQAAKYPGTQWRFEYSPESYTGTELEYAKQVCDAVGEVIA
PTPERPIIFNLPATVEMTTPNVYADSIEWMSRNLANRESVILSLHPHNDRGTAVAAAELGFAAGADRIEGCLFGNGERTG
NVCLVTLGLNLFSRGVDPQIDFSNIDEIRRTVEYCNQLPVHERHPYGGDLVYTAFSGSHQDAINKGLDAMKLDADAADCD
VDDMLWQVPYLPIDPRDVGRTYEAVIRVNSQSGKGGVAYIMKTDHGLSLPRRLQIEFSQVIQKIAEGTAGEGGEVSPKEM
WDAFAEEYLAPVRPLERIRQHVDAADDDGGTTSITATVKINGVETEISGSGNGPLAAFVHALADVGFDVAVLDYYEHAMS
AGDDAQAAAYVEASVTIASPAQPGEAGRHASDPVTIASPAQPGEAGRHASDPVTSKTVWGVGIAPSITTASLRAVVSAVN
RAAR
>Q9JZG1 2.3.3.13~~~leuA~~~2-isopropylmalate synthase~~~
MTQTNRVIIFDTTLRDGEQSPGAAMTKEEKIRVARQLEKLGVDIIEAGFAAASPGDFEAVNAIAKTITKSTVCSLSRAIE
RDIRQAGEAVAPAPKKRIHTFIATSPIHMEYKLKMKPKQVIEAAVKAVKIAREYTDDVEFSCEDALRSEIDFLAEICGAV
IEAGATTINIPDTVGYSIPYKTEEFFRELIAKTPNGGKVVWSAHCHNDLGLAVANSLAALKGGARQVECTVNGLGERAGN
ASVEEIVMALKVRHDLFGLETGIDTTQIVPSSKLVSTITGYPVQPNKAIVGANAFSHESGIHQDGVLKHRETYEIMSAES
VGWATNRLSLGKLSGRNAFKTKLADLGIELESEEALNAAFARFKELADKKREIFDEDLHALVSDEMGSMNAESYKFISQK
ISTETGEEPRADIVFSIKGEEKRASATGSGPVDAIFKAIESVAQSGAALQIYSVNAVTQGTESQGETSVRLARGNRVVNG
QGADTDVLVATAKAYLSALSKLEFSAAKPKAQGSGTI
>P15875 2.3.3.13~~~leuA~~~2-isopropylmalate synthase~~~
MSQQVIIFDTTLRDGEQALQASLSAKEKLQIALALERMGVDVMEVGFPVSSPGDFESVQTIARTIKNSRVCALARCVEKD
IDVAAQALKVADAFRIHTFIATSPMHIATKLRSTLDEVIERAVYMVKRARNYTDDVEFSCEDAGRTPVDDLARVVEAAIN
AGARTINIPDTVGYTMPFEFAGIISGLYERVPNIDKAIISVHTHDDLGIAVGNSLAAVHAGARQVEGAMNGIGERAGNCA
LEEVIMAIKVRKDIMNVHTNINHHEIWRTSQTVSQICNMPIPANKAIVGSGAFAHSSGIHQDGVLKNRENYEIMTPESIG
LNQIQLNLTSRSGRAAVKHRMEEMGYKDTDYNMDHLYDAFLKLADKKGQVFDYDLEALAFINKQQEEPEHFRLDYFSVQS
GSSDIATASVKLACGEEIKAEAANGNGPVDAIYQAINRITGYDVELVKYDLNAKGQGKDALGQVDIVVNHHGRRFHGVGL
ATDIVESSAKAMVHVLNNIWRAAEVEKELQRKAQNKENNKETV
>Q56268 1.1.1.85~~~leuB~~~3-isopropylmalate dehydrogenase~~~
MKKIAIFAGDGIGPEIVAAARQVLDAVDQAAHLGLRCTEGLVGGAALDASDDPLPAASLQLAMAADAVILGAVGGPRWDA
YPPAKRPEQGLLRLRKGLDLYANLRPAQIFPQLLDASPLRPELVRDVDILVVRELTGDIYFGQPRGLEVIDGKRRGFNTM
VYDEDEIRRIAHVAFRAAQGRRKQLCSVDKANVLETTRLWREVVTEVARDYPDVRLSHMYVDNAAMQLIRAPAQFDVLLT
GNMFGDILSDEASQLTGSIGMLPSASLGEGRAMYEPIHGSAPDIAGQDKANPLATILSVAMMLRHSLNAEPWAQRVEAAV
QRVLDQGLRTADIAAPGTPVIGTKAMGAAVVNALNLKD
>Q2T7H6 1.1.1.85~~~leuB~~~3-isopropylmalate dehydrogenase~~~
MKIAVLPGDGIGPEIVNEAVKVLNALDEKFELEHAPVGGAGYEASGHPLPDATLALAKEADAILFGAVGDWKYDSLERAL
RPEQAILGLRKHLELFANFRPAICYPQLVDASPLKPELVAGLDILIVRELNGDIYFGQPRGVRAAPDGPFAGEREGFDTM
RYSEPEVRRIAHVAFQAAQKRAKKLLSVDKSNVLETSQFWRDVMIDVSKEYADVELSHMYVDNAAMQLAKAPKQFDVIVT
GNMFGDILSDEASMLTGSIGMLPSASLDKNNKGLYEPSHGSAPDIAGKGIANPLATILSAAMLLRYSLNRAEQADRIERA
VKTVLEQGYRTGDIATPGCRQVGTAAMGDAVVAAL
>Q9PLW0 1.1.1.85~~~leuB~~~3-isopropylmalate dehydrogenase~~~COG0473
MKTYKVAVLAGDGIGPLVMKEALKILTFIAQKYNFSFEFNEAKIGGASIDAYGVALSDETLKLCEQSDAILFGSVGGPKW
DNLPIDQRPERASLLPLRKHFNLFANLRPCKIYESLTHASPLKNEIIQKGVDILCVRELTGGIYFGKQDLGKESAYDTEI
YTKKEIERIARIAFESARIRKKKVHLIDKANVLASSILWREVVANVAKDYQDINLEYMYVDNAAMQIVKNPSIFDVMLCS
NLFGDILSDELAAINGSLGLLSSASLNDKGFGLYEPAGGSAPDIAHLNIANPIAQILSAALMLKYSFKEEQAAQDIENAI
SLALAQGKMTKDLNAKSYLNTDEMGDCILEILKENDNG
>P30125 1.1.1.85~~~leuB~~~3-isopropylmalate dehydrogenase~~~COG0473
MSKNYHIAVLPGDGIGPEVMTQALKVLDAVRNRFAMRITTSHYDVGGAAIDNHGQPLPPATVEGCEQADAVLFGSVGGPK
WEHLPPDQQPERGALLPLRKHFKLFSNLRPAKLYQGLEAFCPLRADIAANGFDILCVRELTGGIYFGQPKGREGSGQYEK
AFDTEVYHRFEIERIARIAFESARKRRHKVTSIDKANVLQSSILWREIVNEIATEYPDVELAHMYIDNATMQLIKDPSQF
DVLLCSNLFGDILSDECAMITGSMGMLPSASLNEQGFGLYEPAGGSAPDIAGKNIANPIAQILSLALLLRYSLDADDAAC
AIERAINRALEEGIRTGDLARGAAAVSTDEMGDIIARYVAEGV
>P43860 1.1.1.85~~~leuB~~~3-isopropylmalate dehydrogenase~~~COG0473
MQSYNIAVLAGDGIGPEVMAEAIKVLNRVQEKFGFKLNFNEFFVGGAAIEHCGYPLPAETLKGCDQADAILFGSVGGPKW
TNLPPDQQPERGALLPLRKHFKLFCNLRPATLYKGLEKFCPLRADIAAKGFDMVVVRELTGGIYFGQPKGREGDGVQTKA
FDTEVYYKYEIERIARAAFEAAMKRNKKVTSVDKANVLQSSILWRETVTEMAKDYPEVTLEHIYIDNATMQLIKSPESFD
VLLCSNIFGDIISDEAAMITGSMGMLPSASLNEEGFGLYEPAGGSAPDIAGKGIANPIAQILSAAMMLRYSFNLNEAADA
IESAVQKVLASGHRTADLADDSTPVSTAEMGTLITQAI
>P9WKK9 1.1.1.85~~~leuB~~~3-isopropylmalate dehydrogenase~~~COG0473
MKLAIIAGDGIGPEVTAEAVKVLDAVVPGVQKTSYDLGARRFHATGEVLPDSVVAELRNHDAILLGAIGDPSVPSGVLER
GLLLRLRFELDHHINLRPARLYPGVASPLSGNPGIDFVVVREGTEGPYTGNGGAIRVGTPNEVATEVSVNTAFGVRRVVA
DAFERARRRRKHLTLVHKTNVLTFAGGLWLRTVDEVGECYPDVEVAYQHVDAATIHMITDPGRFDVIVTDNLFGDIITDL
AAAVCGGIGLAASGNIDATRANPSMFEPVHGSAPDIAGQGIADPTAAIMSVALLLSHLGEHDAAARVDRAVEAHLATRGS
ERLATSDVGERIAAAL
>P37412 1.1.1.85~~~leuB~~~3-isopropylmalate dehydrogenase~~~
MSKNYHIAVLPGDGIGPEVMAQALKVMDAVRSRFDMRITTSHYDVGGIAIDNHGHPLPKATVEGCEQADAILFGSVGGPK
WENLPPESQPERGALLPLRKHFKLFSNLRPAKLYQGLEAFCPLRADIAANGFDILCVRELTGGIYFGQPKGREGSGQYEK
AFDTEVYHRFEIERIARIAFESARKRRRKVTSIDKANVLQSSILWREIVNDVAKTYPDVELAHMYIDNATMQLIKDPSQF
DVLLCSNLFGDILSDECAMITGSMGMLPSASLNEQGFGLYEPAGGSAPDIAGKNIANPIAQILSLALLLRYSLDANDAAT
AIEQAINRALEEGVRTGDLARGAAAVSTDEMGDIIARYVAEGV
>Q8E9N3 1.1.1.85~~~leuB~~~3-isopropylmalate dehydrogenase~~~COG0473
MSYQIAVLAGDGIGPEVMAEARKVLKAVEARFGLNIEYTEYDVGGIAIDNHGCPLPEATLKGCEAADAILFGSVGGPKWE
KLPPNEQPERGALLPLRGHFELFCNLRPAKLHDGLEHMSPLRSDISARGFDVLCVRELTGGIYFGKPKGRQGEGESEEAF
DTMRYSRREISRIARIAFEAARGRRKKVTSVDKANVLACSVLWRQVVEEVAVDFPDVELEHIYIDNATMQLLRRPDEFDV
MLCSNLFGDILSDEIAMLTGSMGLLSSASMNSTGFGLFEPAGGSAPDIAGKGIANPIAQILSAALMLRHSLKQEEAASAI
ERAVTKALNSGYLTGELLSSDQRHKAKTTVQMGDFIADAVKAGV
>Q9WZ26 1.1.1.85~~~leuB~~~3-isopropylmalate dehydrogenase~~~COG0473
MKIAVLPGDGIGPEVVREALKVLEVVEKKTGKTFEKVFGHIGGDAIDRFGEPLPEETKKICLEADAIFLGSVGGPKWDDL
PPEKRPEIGGLLALRKMLNLYANIRPIKVYRSLVHVSPLKEKVIGSGVDLVTVRELSYGVYYGQPRGLDEEKGFDTMIYD
RKTVERIARTAFEIAKNRRKKVTSVDKANVLYSSMLWRKVVNEVAREYPDVELTHIYVDNAAMQLILKPSQFDVILTTNM
FGDILSDESAALPGSLGLLPSASFGDKNLYEPAGGSAPDIAGKNIANPIAQILSLAMMLEHSFGMVEEARKIERAVELVI
EEGYRTRDIAEDPEKAVSTSQMGDLICKKLEEIW
>Q5SIY4 1.1.1.85~~~leuB~~~3-isopropylmalate dehydrogenase~~~COG0473
MKVAVLPGDGIGPEVTEAALKVLRALDEAEGLGLAYEVFPFGGAAIDAFGEPFPEPTRKGVEEAEAVLLGSVGGPKWDGL
PRKIRPETGLLSLRKSQDLFANLRPAKVFPGLERLSPLKEEIARGVDVLIVRELTGGIYFGEPRGMSEAEAWNTERYSKP
EVERVARVAFEAARKRRKHVVSVDKANVLEVGEFWRKTVEEVGRGYPDVALEHQYVDAMAMHLVRSPARFDVVVTGNIFG
DILSDLASVLPGSLGLLPSASLGRGTPVFEPVHGSAPDIAGKGIANPTAAILSAAMMLEHAFGLVELARKVEDAVAKALL
ETPPPDLGGSAGTEAFTATVLRHLA
>P61495 1.1.1.85~~~leuB~~~3-isopropylmalate dehydrogenase~~~
MKVAVLPGDGIGPEVTEAALKVLRALDEAEGLGLAYEVFPFGGAAIDAFGEPFPEPTRKGVEEAEAVLLGSVGGPKWDGL
PRKISPETGLLSLRKSQDLFANLRPAKVFPGLERLSPLKEEIARGVDVLIVRELTGGIYFGEPRGMSEAEAWNTERYSKP
EVERVARVAFEAARKRRKHVVSVDKANVLEVGEFWRKTVEEVGRGYPDVALEHQYVDAAAMHLVRSPARFDVVVTGNIFG
DILSDLASVLPGSLGLLPSASLGRGTPVFEPVHGSAPDIAGKGIANPTAAILSAAMMLEHAFGLVELARKVEDAVAKALL
ETPPPDLGGSAGTEAFTATVLRHLA
>P12010 1.1.1.85~~~leuB~~~3-isopropylmalate dehydrogenase~~~
MKMKLAVLPGDGIGPEVMDAAIRVLKTVLDNDGHEAVFENALIGGAAIDEAGTPLPEETLDICRRSDAILLGAVGGPKWD
HNPASLRPEKGLLGLRKEMGLFANLRPVKAYATLLNASPLKRERVENVDLVIVRELTGGLYFGRPSERRGPGENEVVDTL
AYTREEIERIIEKAFQLAQIRRKKLASVDKANVLESSRMWREIAEETAKKYPDVELSHMLVDSTSMQLIANPGQFDVIVT
ENMFGDILSDEASVITGSLGMLPSASLRSDRFGMYEPVHGSAPDIAGQGKANPLGTVLSAALMLRYSFGLEKEAAAIEKA
VDDVLQDGYCTGDLQVANGKVVSTIELTDRLIEKLNNSAARPRIFQ
>P15717 4.2.1.33~~~leuC1~~~3-isopropylmalate dehydratase large subunit 1~~~
MAKTLYEKLFDAHVVFEAPNETPLLYIDRHLVHEVTSPQAFDGLRAHHRPVRQPGKTFATMDHNVSTQTKDINASGEMAR
IQMQELIKNCNEFGVELYDLNHPYQGIVHVMGPEQGVTLPGMTIVCGDSHTATHGAFGALAFGIGTSEVEHVLATQTLKQ
GRAKTMKIEVTGNAAPGITAKDIVLAIIGKTGSAGGTGHVVEFCGDAIRALSMEGRMTLCNMAIEMGAKAGLVAPDETTF
NYVKGRLHAPKGRDFDEAVEYWKTLKTDDGATFDTVVALRAEEIAPQVTWGTNPGQVISVTDIIPDPASFSDPVERASAE
KALAYMGLQPGVPLTDVAIDKVFIGSCTNSRIEDLRAAAEVAKGRKVAPGVQALVVPGSGPVKAQAEAEGLDKIFIEAGF
EWRLPGCSMCLAMNNDRLNPGERCASTSNRNFEGRQGRGGRTHLVSPAMAAAAAVTGHFADIRSIK
>P80858 4.2.1.33~~~leuC~~~3-isopropylmalate dehydratase large subunit~~~COG0065
MMPRTIIEKIWDQHIVKHGEGKPDLLYIDLHLIHEVTSPQAFEGLRQKGRKVRRPQNTFATMDHNIPTVNRFEIKDEVAK
RQVTALERNCEEFGVRLADLHSVDQGIVHVVGPELGLTLPGKTIVCGDSHTSTHGAFGALAFGIGTSEVEHVLSTQTLWQ
QRPKTLEVRVDGTLQKGVTAKDVILAVIGKYGVKFGTGYVIEYTGEVFRNMTMDERMTVCNMSIEAGARAGLIAPDEVTF
EYCKNRKYTPKGEEFDKAVEEWKALRTDPGAVYDKSIVLDGNKISPMVTWGINPGMVLPVDSEVPAPESFSAEDDKKEAI
RAYEYMGLTPHQKIEDIKVEHVFIGSCTNSRMTDLRQAADMIKGKKVADSVRAIVVPGSQSVKLQAEKEGLDQIFLEAGF
EWRESGCSMCLSMNNDVVPEGERCASTSNRNFEGRQGKGARTHLVSPAMAAMAAIHGHFVDVRKFYQEKTVV
>P58946 4.2.1.33~~~leuC~~~3-isopropylmalate dehydratase large subunit~~~COG0065
MTSPVENSTSTEKLTLAEKVWRDHVVSKGENGEPDLLYIDLQLLHEVTSPQAFDGLRMTGRKLRHPELHLATEDHNVPTE
GIKTGSLLEINDKISRLQVSTLRDNCEEFGVRLHPMGDVRQGIVHTVGPQLGATQPGMTIVCGDSHTSTHGAFGSMAFGI
GTSEVEHVMATQTLPLKPFKTMAIEVTGELQPGVSSKDLILAIIAKIGTGGGQGYVLEYRGEAIRKMSMDARMTMCNMSI
EAGARAGMIAPDQTTFDYVEGREMAPKGADWDEAVAYWKTLPTDEGATFDKVVEIDGSALTPFITWGTNPGQGLPLGESV
PSPEDFTNDNDKAAAEKALQYMDLVPGTPLRDIKIDTVFLGSCTNARIEDLQIAADILKGHKIADGMRMMVVPSSTWIKQ
EAEALGLDKIFTDAGAEWRTAGCSMCLGMNPDQLKPGERSASTSNRNFEGRQGPGGRTHLVSPAVAAATAIRGTLSSPAD
I
>Q726X4 4.2.1.33~~~leuC~~~3-isopropylmalate dehydratase large subunit~~~COG0065
MAHTLAQKILQRHTDEAITDAGQIVRCRVSMVLANDITAPLAIKSFRAMGAKRVFDKDRVALVMDHFTPQKDIEAAQQVK
LTREFAREMGVTHYYEGGDCGVEHALLPELGLVGPGDVVVGADSHTCTYGGLGAFATGLGSTDVAGAMALGETWFKVPPT
IRATFTGTLPAYVGAKDLILTLIGAIGVDGALYRALEFDGAAIEALDVEGRMTMANMAIEAGGKAGLFAADAKTLTYCTT
AGRTGDTAFSADAGAVYERELSFDVTGMTPVVACPHLPDNVKPVSEVKDVTVQQVVIGSCTNGRIGDLREAAAVLRGRKV
SRDVRCIVLPATPGIWRQALREGLIETFMEAGCIVGPATCGPCLGGHMGILADGERAIATTNRNFKGRMGSLESEVYLSG
PATAAASAVTGVITDPSTL
>P0A6A6 4.2.1.33~~~leuC~~~3-isopropylmalate dehydratase large subunit~~~COG0065
MAKTLYEKLFDAHVVYEAENETPLLYIDRHLVHEVTSPQAFDGLRAHGRPVRQPGKTFATMDHNVSTQTKDINACGEMAR
IQMQELIKNCKEFGVELYDLNHPYQGIVHVMGPEQGVTLPGMTIVCGDSHTATHGAFGALAFGIGTSEVEHVLATQTLKQ
GRAKTMKIEVQGKAAPGITAKDIVLAIIGKTGSAGGTGHVVEFCGEAIRDLSMEGRMTLCNMAIEMGAKAGLVAPDETTF
NYVKGRLHAPKGKDFDDAVAYWKTLQTDEGATFDTVVTLQAEEISPQVTWGTNPGQVISVNDNIPDPASFADPVERASAE
KALAYMGLKPGIPLTEVAIDKVFIGSCTNSRIEDLRAAAEIAKGRKVAPGVQALVVPGSGPVKAQAEAEGLDKIFIEAGF
EWRLPGCSMCLAMNNDRLNPGERCASTSNRNFEGRQGRGGRTHLVSPAMAAAAAVTGHFADIRNIK
>Q7TXH6 4.2.1.33~~~leuC~~~3-isopropylmalate dehydratase large subunit~~~
MALQTGEPRTLAEKIWDDHIVVSGGGCAPDLIYIDLHLVHEVTSPQAFDGLRLAGRRVRRPELTLATEDHNVPTVDIDQP
IADPVSRTQVETLRRNCAEFGIRLHSMGDIEQGIVHVVGPQLGLTQPGMTIVCGDSHTSTHGAFGALAMGIGTSEVEHVL
ATQTLPLRPFKTMAVNVDGRLPDGVSAKDIILALIAKIGTGGGQGHVIEYRGSAIESLSMEGRMTICNMSIEAGARAGMV
APDETTYAFLRGRPHAPTGAQWDTALVYWQRLRTDVGAVFDTEVYLDAASLSPFVTWGTNPGQGVPLAAAVPDPQLMTDD
AERQAAEKALAYMDLRPGTAMREIAVDAVFVGSCTNGRIEDLRVVAEVLRGRKVADGVRMLIVPGSMRVRAQAEAEGLGE
IFTDAGAQWRQAGCSMCLGMNPDQLASGERCAATSNRNFEGRQGAGGRTHLVSPAVAAATAVRGTLSSPADLN
>P9WQF5 4.2.1.33~~~leuC~~~3-isopropylmalate dehydratase large subunit~~~COG0065
MALQTGEPRTLAEKIWDDHIVVSGGGCAPDLIYIDLHLVHEVTSPQAFDGLRLAGRRVRRPELTLATEDHNVPTVDIDQP
IADPVSRTQVETLRRNCAEFGIRLHSMGDIEQGIVHVVGPQLGLTQPGMTIVCGDSHTSTHGAFGALAMGIGTSEVEHVL
ATQTLPLRPFKTMAVNVDGRLPDGVSAKDIILALIAKIGTGGGQGHVIEYRGSAIESLSMEGRMTICNMSIEAGARAGMV
APDETTYAFLRGRPHAPTGAQWDTALVYWQRLRTDVGAVFDTEVYLDAASLSPFVTWGTNPGQGVPLAAAVPDPQLMTDD
AERQAAEKALAYMDLRPGTAMRDIAVDAVFVGSCTNGRIEDLRVVAEVLRGRKVADGVRMLIVPGSMRVRAQAEAEGLGE
IFTDAGAQWRQAGCSMCLGMNPDQLASGERCAATSNRNFEGRQGAGGRTHLVSPAVAAATAVRGTLSSPADLN
>P04787 4.2.1.33~~~leuD1~~~3-isopropylmalate dehydratase small subunit 1~~~
MAEKFTQHTGLVVPLDAANVDTDAIIPKQFLQKVTRTGFGAHLFNDWRFLDEKGQQPNPEFVLNFPEYQGASILLARENF
GCGSSREHAPWALTDYGFKVVIAPSFADIFYGNSFNNQLLPVTLSDAQVDELFALVKANPGIKFEVDLEAQVVKAGDKTY
SFKIDDFRRHCMLNGLDSIGLTLQHEDAIAAYENKQPAFMR
>Q8NQV7 4.2.1.33~~~leuD~~~3-isopropylmalate dehydratase small subunit~~~COG0066
MEKFTTYTGVGVPLQRSNVDTDQIIPAVYLKRVTRTGFEDGLFSNWRQNDPNFVLNTDTYKNGSVLVAGPDFGTGSSREH
AVWALMDYGFRAVFSSRFADIFRGNSGKAGMLTGIMEQSDIELLWKLMEQTPGLELTVNLEKQIVTAGDVVISFEVDPYI
RWRLMEGLDDAGLTLRKLDEIEDYEAKRPAFKPRTNA
>Q726X3 4.2.1.33~~~leuD~~~3-isopropylmalate dehydratase small subunit~~~COG0066
MRYAGTAHKVGDHIDTDAIIPARFLVTTDAQKLGENCMEGLEHGWVARVKSGDIMVGGRNFGCGSSREHAPIAILGAGMP
VVVAHSFARIFYRNGFNMGLLLLEVGDDVDKIADGDDIEVDAASGVITNRTTGATITCAPVPQSMRELLDTGGLVPYVRA
RLERENG
>P30126 4.2.1.33~~~leuD~~~3-isopropylmalate dehydratase small subunit~~~COG0066
MAEKFIKHTGLVVPLDAANVDTDAIIPKQFLQKVTRTGFGAHLFNDWRFLDEKGQQPNPDFVLNFPQYQGASILLARENF
GCGSSREHAPWALTDYGFKVVIAPSFADIFYGNSFNNQLLPVKLSDAEVDELFALVKANPGIHFDVDLEAQEVKAGEKTY
RFTIDAFRRHCMMNGLDSIGLTLQHDDAIAAYEAKQPAFMN
>P65278 4.2.1.33~~~leuD~~~3-isopropylmalate dehydratase small subunit~~~
MEAFHTHSGIGVPLRRSNVDTDQIIPAVFLKRVTRTGFEDGLFAGWRSDPAFVLNLSPFDRGSVLVAGPDFGTGSSREHA
VWALMDYGFRVVISSRFGDIFRGNAGKAGLLAAEVAQDDVELLWKLIEQSPGLEITANLQDRIITAATVVLPFKIDDHSA
WRLLEGLDDIALTLRKLDEIEAFEGACAYWKPRTLPAP
>P9WK95 4.2.1.33~~~leuD~~~3-isopropylmalate dehydratase small subunit~~~COG0066
MEAFHTHSGIGVPLRRSNVDTDQIIPAVFLKRVTRTGFEDGLFAGWRSDPAFVLNLSPFDRGSVLVAGPDFGTGSSREHA
VWALMDYGFRVVISSRFGDIFRGNAGKAGLLAAEVAQDDVELLWKLIEQSPGLEITANLQDRIITAATVVLPFKIDDHSA
WRLLEGLDDIALTLRKLDEIEAFEGACAYWKPRTLPAP
>Q8DTG5 4.2.1.33~~~leuD~~~3-isopropylmalate dehydratase small subunit~~~COG0066
MEEFTIYTGTTVPLMNDNIDTDQILPKQFLKLIDKKGFGKYLMYEWRYLDNNYTENPDFIFNQPEYREASILITGDNFGA
GSSREHAAWALADYGFKVIVAGSFGDIHYNNDLNNGILPIIQPKEVRDKLAKLKPTDEVTVNLFEQKIYSPVGDFSFDID
GEWKHKLLNGLDDIGITLQYEDLIAQYEQNRPSYWH
>P74207 4.2.1.33~~~leuD~~~3-isopropylmalate dehydratase small subunit~~~COG0066
MSQVKQIQGKALPLVGDDIDTDRIIPARFLRCVTFDGLGEHVFADDRQQQGGNHPFDLSQYQDATVLVVNRNFGCGSSRE
HAPQAIIKWGIKAIIGESFAEIFLGNCLANGVPCVTAPHGQIADLQQAITADPNLAVNLDLTTAAVTYGDRSFPVILSDG
AQQMLLDGQWDTCGQLVQNQGKIAATAEKLPYLHWQTSAA
>P76249 ~~~leuE~~~Leucine efflux protein~~~COG1280
MFAEYGVLNYWTYLVGAIFIVLVPGPNTLFVLKNSVSSGMKGGYLAACGVFIGDAVLMFLAWAGVATLIKTTPILFNIVR
YLGAFYLLYLGSKILYATLKGKNSEAKSDEPQYGAIFKRALILSLTNPKAILFYVSFFVQFIDVNAPHTGISFFILAATL
ELVSFCYLSFLIISGAFVTQYIRTKKKLAKVGNSLIGLMFVGFAARLATLQS
>P10151 ~~~leuO~~~HTH-type transcriptional regulator LeuO~~~COG0583
MPEVQTDHPETAELSKPQLRMVDLNLLTVFDAVMQEQNITRAAHVLGMSQPAVSNAVARLKVMFNDELFVRYGRGIQPTA
RAFQLFGSVRQALQLVQNELPGSGFEPASSERVFHLCVCSPLDSILTSQIYNHIEQIAPNIHVMFKSSLNQNTEHQLRYQ
ETEFVISYEDFHRPEFTSVPLFKDEMVLVASKNHPTIKGPLLKHDVYNEQHAAVSLDRFASFSQPWYDTVDKQASIAYQG
MAMMSVLSVVSQTHLVAIAPRWLAEEFAESLELQVLPLPLKQNSRTCYLSWHEAAGRDKGHQWMEEQLVSICKR
>O07003 3.2.1.64~~~levB~~~Levanbiose-producing levanase~~~COG1621
MNYIKAGKWLTVFLTFLGILLFIDLFPKEEHDQKTKSKQKPDYRAAYHFTTPDKWKNDPQKPIYFDGKYHYFYLYNRDYP
KGNGTEWRHAVSEDLVHWTDEGVAIPKYTNPDGDIWTGSVVVDKENTAGFGKNALVAIVTQPSAKDKKQEQYLWYSTDKG
KSFKFYSGNPVMPNPGTDDFRDPKVIWDDQDNKWVMVMAEGSKIGFYESDNLKDWHYTSGFFPEQAGMVECPDLYMMRAS
DGTNKWVLGASANGKPWGKPNTYAYWTGSFDGKEFKADQTEAQWLDYGFDWYGGVTFEDSKSTDPLEKRYALAWMNNWDY
ANNTPTMKNGFNGTDSVIRELRLKEQDGTYSLVSQPIEALEQLTVSTDEIEDQDVNGSKTLSITGDTYQLDTDLSWSELK
NAGVRLRESEDQKRHIDVGIFAEGGYAYVNRAATNQPDKSNTYVESKAPYDVNKRKVHLKILVDKTTIEVFVGDGKTVFS
NEVFPKPEDKGITLYSDGGTASFKNITVKHFDSIHE
>Q70XJ9 2.4.1.10~~~levS~~~Levansucrase~~~
MTKEHKKMYKAGKYWAVATLVSASILMEVGVTTHADAVENNKYDGTANVNIDCQANVDGKIISTDDNATSGSTKQESSIA
NDNATSGSTKQESSIANDNATSGSTKQESSIANDNATSGSTKQESSVANDNATSGSTKQESSVANDNATSGSTKQESSVA
NDNATSGSTKQESSVANDTKTAVVDESKNTSNTENDNSQLKQTNNEQPSAATQANLKKLNHEAAKAVQNAKIDAGSLTDE
QINELNKINFSKSAEKGAKLTFKDLEGIGNAIVKQDPQYAVPYFNAKEIKNMPASYTVDAQTGKMAHLDVWDSWPVQDPT
GYVSNYKGYQLVIAMMGIPNTPNGDNHIYLLYNKYGDNDFSHWRNAGSIFGTNENNVYQEWSGSAIVNDNGTIQLFYTSN
DTSDYKLNDQRLATATLNLDVDDNGVAIKSVDNYHILFEGDGFHYQTYDQFANGKDRKNDDYCLRDPHVVQSENGDRYLV
FEANTGMEDYQSDDQIYNWANYGGDDAFNIKSFFKLLNNKNDRELASLANGAIGILKLNNDQTNPKVEEVYSPLVSTLMA
SDEVERVNVVKLGDKYYLFSATRVSRGSDRELNAKDITIVGDNVAMIGYVSDNLMGKYKPLNNSGVVLTASVPANWRTAT
YSYYAVPVEGHPDQVLITSYMSNKDFASGEGNYATLAPSFIVQINPDDTTTVLARATNQGDWVWDDSSRNDNMLGVLKEG
AVNSAALPGEWGKPVDWSLINRSSGLGLKPHQPVNPSQPTTPATPVNPSQPTTPATPVNPSQPTTPATPVNPSATTTPAT
PVNPSATTTPAKPVNPSQPTTPAKPVQAGQATATNFVDQRLPQTGENNSQSQTMSFIGILLAMFGSLLGFLGIKKRRND
>D3WYW0 2.4.1.10~~~levG~~~Levansucrase~~~
MLENKKHKKMSLSGKSLLMGTLSTAAIVLSASTVNAATNTDTVDNANASQVTTVKASASVNKNDNSGLKENATNDKVAGT
ETNLNSSLNSGKETSSQVNDSKEDSSSTQVGSTPISSAIINNGKASSDLNQDSDNISDHFKDNNSQGQSSTSSEKTELKG
KIKEIVNNSGIDVTKLTNDQINNLNKVNFDNDPQDGTKLTLNDLDAIGQALIRRDPKYAVPYFNAKEIKNMDAAETKDAQ
TGKTETLEIWDSWPVQDPITGYVSNYKGYQLVIAMMGMPKKNDNHIYLLYNKYNDNEFSHWRNAGSIFGYNETPDLQEWS
GSAIVNKDGSVQLFYTKNDTSNGKLNDQQLATANLKLNVDNNGVSIASVDNDHVIFIGDGKHYQTYDQFSNGKNRNRDNY
TLRDPHVVEEENGDRYLVFEANTGSNNYQGEDQVYRWANYGGNDKFNVNNFLSYFGNNDDQALASVANGALGILKLSGDQ
NNPTVKLDDVYSPLVTSLMVSDEMERPDIVKVGNKYYLFSATRLSRGTKGEITRLANKVVGDNVAMIGFVSDSLTHGYVP
LNGSGVVLTASVPANWRTATYSYYAVPIEGKENQLLITAYMTNRGEVAGKGNNSTWAPSFILQLNPDNTTTVLAKLTNQG
VWVWNGDSENKNMIGSLEKDSPNSAALDGEWGKFIDWDAINSYSLKPHQPVTPNVPTTPEKPENPTTPNTPDTPRTPEVP
TTPVKKTTQSELPKAGAKDGIAATILGAISSMLGVIGLAGISKRKRNN
>P31080 3.4.21.88~~~lexA~~~LexA repressor~~~COG1974
MTKLSKRQLDILRFIKAEVKSKGYPPSVREIGEAVGLASSSTVHGHLARLETKGLIRRDPTKPRAIEILDEEVDIPQSQV
VNVPVIGKVTAGSPITAVENIEEYFPLPDRMVPPDEHVFMLEIMGDSMIDAGILDKDYVIVKQQNTANNGEIVVAMTEDD
EATVKRFYKEDTHIRLQPENPTMEPIILQNVSILGKVIGVFRTVH
>Q9ZFA4 3.4.21.88~~~lexA~~~LexA repressor~~~COG1974
MLTRKQMELLDFIKTRMDRDGVPPSFDEMKDALDLRSKSGIHRLITALEERGFIRRLAHRARAIEIVKLPEAMERAGFSA
RAAKAAAAPLPKGAVTVETAGALDLPLMGRIAAGLPIEAINGGPQSVTVPGMMLSGRGQHYALEVKGDSMIAAGINDGDI
VVIREQQTADNGDIVVALVADHEATLKRYRRRGGMIALEPANDSYETQVYPEQMVKVQGRLVGLIRSY
>P0A7C2 3.4.21.88~~~lexA~~~LexA repressor~~~COG1974
MKALTARQQEVFDLIRDHISQTGMPPTRAEIAQRLGFRSPNAAEEHLKALARKGVIEIVSGASRGIRLLQEEEEGLPLVG
RVAAGEPLLAQQHIEGHYQVDPSLFKPNADFLLRVSGMSMKDIGIMDGDLLAVHKTQDVRNGQVVVARIDDEVTVKRLKK
QGNKVELLPENSEFKPIVVDLRQQSFTIEGLAVGVIRNGDWL
>P0DN68 3.4.21.88~~~lexA~~~LexA repressor~~~COG1974
MENTNEKRKEMTARQEEIYEYIKKYSKENHMPPTVREIGNHFDISSTNGVRSILAALIKKGYINRSPRLSRGIEILSDDK
ESSKEVASNTIEIPIVGRVAAGTPILAVQNLEGTVTIDRDFLACRSDVFALRVKGDSMINAGIFDGDLIFARQQKTADLG
EIVVAQIDNEATVKYYHPSADHVELRPANPKYKPIIVNNRKDFSIAGRVIGVMRKVN
>P9WHR7 3.4.21.88~~~lexA~~~LexA repressor~~~COG1974
MNDSNDTSVAGGAAGADSRVLSADSALTERQRTILDVIRASVTSRGYPPSIREIGDAVGLTSTSSVAHQLRTLERKGYLR
RDPNRPRAVNVRGADDAALPPVTEVAGSDALPEPTFVPVLGRIAAGGPILAEEAVEDVFPLPRELVGEGTLFLLKVIGDS
MVEAAICDGDWVVVRQQNVADNGDIVAAMIDGEATVKTFKRAGGQVWLMPHNPAFDPIPGNDATVLGKVVTVIRKV
>B1ZZZ4 3.4.21.88~~~lexA~~~LexA repressor~~~COG1974
MLTEKQEAILDYIRSVQAQRGVPPSTREIQRHFGYESQNAAMNHLRALARKGQLHQVDGATWGLKVSEVQGHFELPIYGT
IPAGVPSMQEQQPKETITFDPAVFRLRRPERLWGLEVHGDSMIDAHILDGDIAVLERREAKPGDIVAALVDETTTTLKRL
AYVKGKPVLKPENARYALIVPKDRLEIQGVFVGLIGRAKR
>P73722 ~~~lexA~~~Transcription regulator LexA~~~COG1974
MEPLTRAQKELFDWLVSYIDETQHAPSIRQMMRAMNLRSPAPIQSRLERLRNKGYVDWTDGKARTLRILHQKPKGVSVIG
ELKGGELVEADAEEVEKIDFAPLMKKSSVFALRVMSNDLVDDFIVEGDMLILRSVTGEEEIEDGELVAASIKGGKIAIKR
YYQDGTKVVLKASNNKGPGQELKASDVEIQGILMGVWRNFQGV
>O33927 3.4.21.88~~~lexA~~~LexA repressor~~~COG1974
MKDLTERQRKVLLFIEEFIEKNGYPPSVREIARRFRITPRGALLHLIALEKKGYIERKNGKPRALRISKSIRNKIPLIGE
IRAGEKREAIEYLEDYIEIPESFLSSGYDHFLLKVKGESMIEEHICDGDLVLVRRQDWAQNGDIVAAMVDGEVTLKKFYQ
RGDTVELRPANREMSSMFFRAEKVKILGKVVGVFRKL
>P0DOW5 3.4.21.88~~~lexA~~~LexA repressor~~~
MLTERQQELLDFLRVYQRQQGVMPSTRDIQLHFGFASQTAAMSHLKALERKGVIRRLAGKARAVVFPEVMERETVDIPIF
GLIPAGFTADNPEHSDGNLTLDLRTMGLSPRSKPFALKVRGDSMTGAHIIQGDYVILEQRDPRPKDIVAALMDGETTLKR
YLVDNGQPFLRAENPSYPDLIPARELMIQGVMVGLFRPYNGR
>P60512 3.4.21.88~~~lexA~~~LexA repressor~~~
MDLTDTQQAILALIAERIDADGVPPSQTEIARAFGFKGIRAAQYHLEALEHAGAIRRVPGQARGIRLAGQGAQTRTAPVS
EVARDDVLRLPVLGRVAAGLPIGADIGSDDFVVLDRVFFSPSPDYLLKVQGDSMRDEGIFNGDLIGVHRTRDARSGQIVV
ARIDEEITVKLLKIGKDRIRLLPRNPDYAPIEVLPDQDFAIEGLYCGLLRPNR
>A0R5K5 ~~~lfrA~~~Multidrug efflux pump LfrA~~~COG0477
MSTCIEGTPSTTRTPTRAWVALAVLALPVLLIAIDNTVLAFALPLIAEDFRPSATTQLWIVDVYSLVLAALLVAMGSLGD
RLGRRRLLLIGGAGFAVVSALAAFAPSAELLVGARALLGVFGAMLMPSTLSLIRNIFTDASARRLAIAIWASCFTAGSAL
GPIVGGALLEHFHWGAVFLVAVPILLPLLVLGPRLVPESRDPNPGPFDPVSIVLSFTTMLPIVWAVKTAAHDGLSAAAAA
AFAVGIVSGALFVRRQNRSATPMLDIGLFKVMPFTSSILANFLSIIGLIGFIFFISQHLQLVLGLSPLTAGLVTLPGAVV
SMIAGLAVVKAAKRFAPDTLMVTGLVFVAVGFLMILLFRHNLTVAAIIASFVVLELGVGVSQTVSNDTIVASVPAAKSGA
ASAVSETAYELGAVVGTATLGTIFTAFYRSNVDVPAGLTPEQTGAAAESIGGAAAVAADLPAATATQLLDSARAAFDSGI
APTAVIAAMLVLAAAAVVGVAFRR
>A0R5K4 ~~~lfrR~~~HTH-type transcriptional repressor LfrR~~~COG1309
MTSPSIESGARERTRRAILDAAMLVLADHPTAALGDIAAAAGVGRSTVHRYYPERTDLLRALARHVHDLSNAAIERADPT
SGPVDAALRRVVESQLDLGPIVLFVYYEPSILADPELAAYFDIGDEAIVEVLNRASTERPEYPPGWARRVFWALMQAGYE
AAKDGMPRHQIVDAIMTSLTSGIITLPRT
>P0A8P1 2.3.2.6~~~aat~~~Leucyl/phenylalanyl-tRNA--protein transferase~~~COG2360
MRLVQLSRHSIAFPSPEGALREPNGLLALGGDLSPARLLMAYQRGIFPWFSPGDPILWWSPDPRAVLWPESLHISRSMKR
FHKRSPYRVTMNYAFGQVIEGCASDREEGTWITRGVVEAYHRLHELGHAHSIEVWREDELVGGMYGVAQGTLFCGESMFS
RMENASKTALLVFCEEFIGHGGKLIDCQVLNDHTASLGACEIPRRDYLNYLNQMRLGRLPNNFWVPRCLFSPQE
>A0A0H3KNE7 1.1.1.435~~~~~~L-fucose dehydrogenase~~~COG1028
MDLNLQDKVVIVTGGASGIGGAISMRLAEERAIPVVFARHAPDGAFLDALAQRQPRATYLPVELQDDAQCRDAVAQTIAT
FGRLDGLVNNAGVNDGIGLDAGRDAFVASLERNLIHYYAMAHYCVPHLKATRGAIVNISSKTAVTGQGNTSGYCASKGAQ
LALTREWAVALREHGVRVNAVIPAEVMTPLYRNWIATFEDPEAKLAEIAAKVPLGRRFTTPDEIADTAVFLLSPRASHTT
GEWLFVDGGYTHLDRALV
>F0M433 1.1.1.425~~~lgdh~~~Levoglucosan dehydrogenase~~~COG0673
MQNLNVGLIGGGFMGKAHSLAYAAMPMFFWPAPALPVRKVIAEANPELAAEAARRFGFENSTSDWRSIIDDPDIHVVDIA
TPNHLHAEIAIAAAEAGKHIICEKPLARTGEESKAMYDAVKDKNIVHMVAFNYRRTPAVALAKKYIEEGAIGRILSFRGT
YLQDWSADPNSPLSWRFQKSIAGSGALGDIATHVIDMARYLVGEFSAVNAVLSTWIPERPLQSGGADALGTVRGGEGPKG
PVDVDDEVMTMIRFANGAVGSVEATRNAHGRNNYITFEIHGTEGSIVFNYERRDELQVAFASDQADRRGFRTVYTGPAHP
YGEGLWPIPALGIGYGETKIIEAHDFFKAIAEGGSVSPSFADGYQVALIDDAIVESAAKESWVDVPQISA
>P39400 1.1.1.414~~~lgoD~~~L-galactonate-5-dehydrogenase~~~COG1063
MSTMNVLICQQPKELVWKQREIPIPGDNEALIKIKSVGICGTDIHAWGGNQPFFSYPRVLGHEICGEIVGLGKNIADLKN
GQQVAVIPYVACQQCPACKSGRTNCCEKISVIGVHQDGGFSEYLSVPVANILPADGIDPQAAALIEPFAISAHAVRRAAI
APGEQVLVVGAGPIGLGAAAIAKADGAQVVVADTSPARREHVATRLELPLLDPSAEDFDAQLRAQFGGSLAQKVIDATGN
QHAMNNTVNLIRHGGTVVFVGLFKGELQFSDPEFHKKETTMMGSRNATPEDFAKVGRLMAEGKITADMMLTHRYPFATLA
ETYERDVINNRELIKGVITF
>P39399 ~~~lgoR~~~Probable HTH-type transcriptional regulator LgoR~~~COG1802
MSRSQNLRHNVINQVIDDMARGHIPSPLPSQSALAEMYNISRTTVRHILSHLRECGVLTQVGNDYVIARKPDHDDGFACT
TASMSEQNKVFEQAFFTMINQRQLRPGETFSELQLARAAGVSPVVVREYLLKFGRYNLIHSEKRGQWSMKQFDQSYAEQL
FELREMLETHSLQHFLNLPDHDPRWLQAKTMLERHRLLRDNIGNSFRMFSQLDRDFHSLLLSAADNIFFDQSLEIISVIF
HFHYQWDESDLKQRNIIAVDEHMTILSALICRSDLDATLALRNHLNSAKQSMIRSINENTRYAH
>P39398 ~~~lgoT~~~Probable L-galactonate transporter~~~COG2271
MEKENITIDPRSSFTPSSSADIPVPPDGLVQRSTRIKRIQTTAMLLLFFAAVINYLDRSSLSVANLTIREELGLSATEIG
ALLSVFSLAYGIAQLPCGPLLDRKGPRLMLGLGMFFWSLFQAMSGMVHNFTQFVLVRIGMGIGEAPMNPCGVKVINDWFN
IKERGRPMGFFNAASTIGVAVSPPILAAMMLVMGWRGMFITIGVLGIFLAIGWYMLYRNREHVELTAVEQAYLNAGSVNA
RRDPLSFAEWRSLFRNRTMWGMMLGFSGINYTAWLYLAWLPGYLQTAYNLDLKSTGLMAAIPFLFGAAGMLVNGYVTDWL
VKGGMAPIKSRKICIIAGMFCSAAFTLIVPQATTSMTAVLLIGMALFCIHFAGTSCWGLIHVAVASRMTASVGSIQNFAS
FICASFAPIITGFIVDTTHSFRLALIICGCVTAAGALAYIFLVRQPINDPRKD
>A0A1B1PF34 1.4.3.11~~~lgoX~~~L-glutamate oxidase precursor~~~
MTETPRDNSATRARWQTCLKLARELLLVGPDDKDLKLSYLHTLIDTGRLGPTHHPRKKILVIGAGITGLVAGRLLKDAGY
DVTIIEANESRVGGRIKTFRATKHHQPFDDAAQYAEAGAMRLPDFHPLVLALVDKLGLGRRQFYNVDVGPSTGSGPEVPV
PPVTYTSFTGQTWTNGDDSPDFREPDKRGNSWIRANRVQVRRADYTASPERINEGFHLTGDEVRAPVVKMVDDALESVRD
YYSDVVDGKRVNKPFDEWVEGWARVIRDFDGYSMGGFLRDHAGLSDEAIEAVGTLENTSSRLHLSFFHSFLSRSDINPTV
RYWEIPGGSWRLPHALHEGLRDEVRLGHRMIRLEYHDPSRDADPEGTAVGPDGWGVTVETVAENDPQAPPRRWTADLAIV
TIPFSALRFVEIVPSMSYKKRRAIVETHYDSATKVLLEFSHRWWEFTEDDWREELERIAPGVYEYYRLGPEAAGEPARMP
TLAEADAGLLGAAVKDSGVTEEMRQIGSTMPLRGPALRPATHSFGGGSATDNPNRFMYYPSHRVEGSTGGVVLASYSWSD
DAARWDSMRSAERYVYALRNLQALHGRRIEVFFTGRGATKSWARDPYAFGEAAIYTAHQMTSFHLDASRPEGPVHFAGEH
TSLKHAWIEGALEVHTA
>Q8L3C7 1.4.3.11~~~lgoX~~~L-glutamate oxidase precursor~~~
MTTDTARRHTGAERANEMTYEQLARELLLVGPAPTNEDLKLRYLDVLIDNGLNPPGPPKRILIVGAGIAGLVAGDLLTRA
GHDVTILEANANRVGGRIKTFHAKKGEPSPFADPAQYAEAGAMRLPSFHPLTLALIDKLGLKRRLFFNVDIDPQTGNQDA
PVPPVFYKSFKDGKTWTNGAPSPEFKEPDKRNHTWIRTNREQVRRAQYATDPSSINEGFHLTGCETRLTVSDMVNQALEP
VRDYYSVKQDDGTRVNKPFKEWLAGWADVVRDFDGYSMGRFLREYAEFSDEAVEAIGTIENMTSRLHLAFFHSFLGRSDI
DPRATYWEIEGGSRMLPETLAKDLRDQIVMGQRMVRLEYYDPGRDGHHGELTGPGGPAVAIQTVPEGEPYAATQTWTGDL
AIVTIPFSSLRFVKVTPPFSYKKRRAVIETHYDQATKVLLEFSRRWWEFTEADWKRELDAIAPGLYDYYQQWGEDDAEAA
LALPQSVRNLPTGLLGAHPSVDESRIGEEQVEYYRNSELRGGVRPATNAYGGGSTTDNPNRFMYYPSHPVPGTQGGVVLA
AYSWSDDAARWDSFDDAERYGYALENLQSVHGRRIEVFYTGAGQTQSWLRDPYACGEAAVYTPHQMTAFHLDVVRPEGPV
YFAGEHVSLKHAWIEGAVETAVRAAIAVNEAPVGDTGVTAAAGRRGAAAATEPMREEALTS
>D6A5I3 1.4.3.11~~~lgoX~~~L-glutamate oxidase precursor~~~COG1231
MTEDHAVVRSDGGLSRRSFAAVAGTATVATALTSGVAAALPAPAASGDSRGADFDRCLAVARALLVLDSDDRPLVPRYQS
VLQKGLPAQRRTRPKNVLVIGAGPAGLVAAWLLKRAGHRVTVLEANGNRAGGRVKTFRSGGHERAEQPFADPRQYAEAGA
MRIPGSHPLVMELIDQFELKKRRFHYVDVDSEGRPANRTWIHVNGIRVRRADYARAPRRVNRSFGVPRAHWDTPAAAILR
SVLDPVRDEFSRVGRDGKRVDKPLPERLQGWARVVQRFGDWSMFRFLTEHAGLDERTIDLIGTLENLTSRLPLSFIHSFI
GSSLISPDTPFYELEGGTAVLPDALLERVRKDVRFDRRVTRIQYHHPDRPSPDVEQVRSKGPHVWVDTVSEGRDGPVVRE
QFTADVAVVTVPFSGLRHVQIAPPLSYGKRRAVCELHYDSATKVLLEFSRRWWEFDEADWKRELRAVDPGLYDAYRTGRA
AADGSLLGAHPSVPAGHITAGQRTHYAANRAVARDQPEAVDVVGGGSVSDNANRFMFHPSHPVPGSAGGVVLASYSWADD
ALRWDSLDDEARYPHALCGLQQVYGQRIEVFYTGAGRTQSWLRDPYAYGEASVLLPGQHTELLSAIPVAEGPLHFAGDHT
SVKPAWIEGAVESAVRAALEIHTA
>Q70LM7 ~~~lgrA~~~Linear gramicidin synthase subunit A~~~
MRILFLTTFMSKGNKVVRYLESLHHEVVICQEKVHAQSANLQEIDWIVSYAYGYILDKEIVSRFRGRIINLHPSLLPWNK
GRDPVFWSVWDETPKGVTIHLIDEHVDTGDILVQEEIAFADEDTLLDCYNKANQAIEELFIREWENIVHGRIAPYRQTAG
GTLHFKADRDFYKNLNMTTVRELLALKRLCAEPKRGEKPIDKTFHQLFEQQVEMTPDHVAVVDRGQSLTYKQLNERANQL
AHHLRGKGVKPDDQVAIMLDKSLDMIVSILAVMKAGGAYVPIDPDYPGERIAYMLADSSAAILLTNALHEEKANGACDII
DVHDPDSYSENTNNLPHVNRPDDLVYVMYTSGSTGLAKGVMIEHHNLVNFCEWYRPYFGVTPADKALVYSSFSFDGSALD
IFTHLLAGAALHIVPSERKYDLDALNDYCNQEGITISYLPTGAAEQFMQMDNQSFRVVITGGDVLKKIERNGTYKLYNGY
GPTECTIMVTMFEVDKPYANIPIGKPIDRTRILILDEALALQPIGVAGELFIVGEGLGRGYLNRPELTAEKFIVHPQTGE
RMYRTGDRARFLPDGNIEFLGRLDNLVKIRGYRIEPGEIEPFLMNHPLIELTTVLAKEQADGRKYLVGYYVAPEEIPHGE
LREWLGNDLPDYMIPTYFVHMKAFPLTANGKVDRRALPDVQADAELLGEDYVAPTDELEQQLAQVWSHVLGIPQMGIDDH
FLERGGDSIKVMQLIHQLKNIGLSLRYDQLFTHPTIRQLKRLLTEQKQVSLEPLRELDEQAEYETSAVEKRMYIIQQQDV
ESIAYNVVYTINFPLTVDTEQIRVALEQLVLRHEGLRSTYHMRGDEIVKRIVPRAELSFVRQTGEEESVQSLLAEQIKPF
DLAKAPLLRAGVIETADKKVLWFDSHHILLDGLSKSILARELQALLGQQVLSPVEKTYKSFARWQNEWFASDEYEQQIAY
WKTLLQGELPAVQLPTKKRPPQLTFDGAIQMYRVNPEITRKLKATAAKHDLTLYMLMLTIVSIWLSKMNSDSNQVILGTV
TDGRQHPDTRELLGMFVNTLPLLLSIDHEESFLHNLQQVKAKLLPALQNQYVPFDKILEAARVKREGNRHPLFDVMFMMQ
GAPETELESNMHHINAGISKFDLTLEVLERENGLNIVFEYNTHLFDEGMILRMVAQFEHLLLQAVHGLDQQVKRFELVTE
DEKRDLFLRVNDTAKAYPNKLIMSMLEDWAAATPDKTALVFREQRVTYRELNERVNQLAHTLREKGVQPDDLVMLMAERS
VEMMVAIFAVLKAGGAYLPIDPHSPAERIAYIFADSGAKLVLAQSPFVEKASMAEVVLDLNSASSYAADTSNPPLVNQPG
DLVYVMYTSGSTGKPKGVMIEHGALLNVLHGMQDEYPLLQDDAFLLKTTYIFDISVAEIFGWVPGRGKLVILEPEAEKNP
KAIWQAVVGAGITHINFVPSMLIPFVEYLEGRTEANRLRYILACGEAMPDELVPKVYEVLPEVKLENIYGPTEATIYASR
YSLAKGSQESPVPIGKPLPNYRMYIINRHGQLQPIGVPGELCIAGASLARGYLNNPALTEEKFTPHPLEKGERIYRTGDL
ARYREDGNIEYLGRMDHQVKIRGYRIELDEIRSKLIQEETIQDAVVVARNDQNGQAYLCAYLLSEQEWTVGQLRELLRRE
LPEYMIPAHFVLLKQFPLTANGKLDRKALPEPDGSVKAEAEYAAPRTELEATLAHIWGEVLGIERIGIRDNFFELGGDSI
KGLQIASRLQRINWTMVINHLFLYPTIEQIAPFVTSEQVVIEQGLVEGLVKLTPIQRDFFERITADRHHWNQARMLFCRD
GLEREWVVETLNALVLQHDALRMRFRETEQGIVQFHQGNEGKLFGFHVFDCTEELDIAKKVEEQANVLQSGMNLQEGPLV
QAALFMTRTGDHLLLAIHQLVVDEASWRIILEDFQTAYKQKAAGEPIALPNKTHSYQSWAEELHNAANSKKLTSELGYWR
KIASSPTRPLPQDQEPLSRTEQSTATAAIRFAKAETANLLHEANHAYQTEAQELLLAALGMALRDWTRADDVTVFLEKDG
RESAAKGLDVSRTVGWFHSLFPVVLSAARSGDPGEQIKQVKEMLRAIPHQGSGYSILKQLTDLRHKHPDDFTLQPKIVVH
AWEQLDAGLETDWLTLSHLPQGSVRGANAERMQQLDVFSKISNGELTIHIQYHRDEYRKATIDKLLELYQAHLNALLAHC
LQKTETELTPSDFVDKNLSRSELDDIMDLISDL
>Q70LM6 ~~~lgrB~~~Linear gramicidin synthase subunit B~~~
MSRKKVDNIYPLTPMQEGMLFHSLLDEGSESYFEQMRFTIKGLIDPAILEQSLNALIERHDILRTVFLLEKVQKPRQIVL
RERKTKVQVLDITHLSEGEQAAYLEDFAQKDRQASFDLAKDVLIRLTLVRTSADTHTLFWSHHHILLDGWCIPIVLNDFF
QIYQQRKGGLPVELGPVYPYSTYISWLGEQDAEEAKASWAEYISGYEPTSFIHKQGGKNSYRQAELVFAIEQGLTDSLNK
LAKQLHVTLNNLFRAIWGLMLQRQCNTEDVVFGSVVSGRPSHLPNVEQMVGLFINTVPIRVQAGAEQTFSELVKQVQQEA
LSLAKYHYLSLADIQGNQQLIDHILLFQNYPMGQQFLTRLNQYNEEFTLTHLSAFEQTNYDLNVMVTPSDVITIKYIYNA
AVFSEEQLLHISRQLTTIMTQVTNAPDILLQKLEVVDPAEKQLQLHSFNDTYRHYPTDKLIHQIFEERAEREPERIALVM
GEQVLTYRELNEKANQLAKLLRARGIGPESMVSLLTERSAEMMIAILAIFKAGGAYLPIDPSHPKERIEYILQDSRSELL
LVNHRFLGAVDFADRIIDLEAAEIYQGAADNLECVSHANHLAYVIYTSGSTGKPKGVMIEHASLLNIIFALQELYPLLEN
DAYLLKTTYTFDVSVAEIFGWILGSGRLVILDPGAEKEPAHIWETMVNHGVTHVNFVPSMLIPFVDYVRDQQQESPLRYI
FAAGEAMPSELVGKVYEALPGVILENIYGPTESTIYATKYSLAKDSQDVLVPIGKPLANIQTHIVNKHGQLQPVGVPGEL
CIAGASLARGYWNNEALTNEKFVPHPFAAGQRMYRTGDLARYRQDGNIEYLGRIDHQVKIRGYRIELDEIRAQLIQEASI
RDAVVIARTDHNGQAYLCAYFIADKQWTVNALREALRQTLPDYMVPSHFIQMEEFPLTSSGKIDRKALPLPDGRVHTGNV
YLAPRNPVEELVVRIWEEVLNVSQVGVHDNFFELGGHSLLATQVLSRTAKLFHVRLPMREIFTHQTVAELARRIQALRHG
AEADKHSPIQPSALQRADELPLSYAQQRLWFLDRLIPDSAMYNIPVGFRLRGTVDELVLERALNEIIQRHESLRTTFVDV
DGRALQVIHTDVHLSLGVTDLRDKPAAAKDAEWKQMAEEDAATPFRLDQWPLLRAMLIRLEEQESVLWLNVHHIISDGWS
MDVLVNELSEVYETLLKGEALPLAALPIQYRDYAVWQREKSQDDVWKEQLRYWKNKLDGSEPLLPLPTDRPRAVVQSYRG
DHLSFYVPGEVGQKLRELGRQEGATLFMTLLAAFKSFLYRYTHANDILIGTPVAGRNRQEIENLIGFFVNMLVLRTDLSD
DPTFVELLRRVRETAFDAFANEDVPFEKLVDELQIERSLSYSPLFQVLFAVQGMSTGVREGETLAIAPDEVTLNQTTKFD
LTLTMIEAADNGLKGVFEYSTDLFDRTTIERMAEHFGNLLQAIAADPGQKIVELPLLGGAEQSRMLVEWNQTDVAYSLDL
LVHERVARIAQELPEQFAVIGEQGALTYAQLDAKANQLAHALLKRGIGSEDLVGICVERSSEMQIGQLAILKAGAAYVPM
DPAYPRERLAFMIKDAGMSLVLTQERLLDALPQEAAALLCLDRDWQEIAAESTAAPAIKTNADQLAYVIYTSGSTGTPKG
VEIEHGSLLNLVNWHQRAYSVSAEDRASQIAGTAFDASVWETWPYLTAGATICQPREEIRLSPEKLRDWLVETGITISFL
PTPLAENLLPLPWPTGAALRYMLTGGDTLHQYPTADVPFTLVNQYGPTENTVVATAGAVPVLGERESAPTIGRPIDNVSV
YVLDENRQPVPVGVVGELYIGGKSLARGYRNRPDLTEASFVPNPFSPIEGARMYRTGDLVRYAADGSIEFIGRADDQVSI
RGFRVELGEIESALYAHPAVAESVVIVREDVTPGVKRLVAYAVLHEGEERQTSELRQSLKEMLPDYMVPSAIVLMEALPL
TPNGKVDRRALPLPDVAQTEWEGSFVEPQSDVERKLAEIWQEVLGVETIGVHDNFFELGGDSILTIQIVSRANQAGLQLT
PKHLFDAQTLAELAASAVVLEKAPEMQAEQGIVTGELPLTPIQTWFFEQDVRHVHHWNQSVMLAVREELDMTALTQAFAA
LPRQHDALRLRFQQVNGTWQAAHGEIADEDVLLVADLSSVPEAEREARMRHITDELQASLDIEKGPLHRAAYFQLGAEQR
LFIVIHHLVVDGVSWRIILEDLQTAYEQVKAGQKIAWPQKTTSFKSWAEELTTYAEQSAVDEYWTGMDSEQACGLPVDHP
QGKNTEGLAVQVKAKLSADETRALLQEVPAAYRTQINDVLLSALTRTITDWTNKRALYVSVEGHGREPIVDGVDVSRTVG
WFTSLYPVLLETEPDLAWGDLLKSIKEQVRAIPDKGIGYGIHRYLSRDGQTAEMLRAKPQPEISFNYLGQFGQGQTTDAA
LFQIIPNWSASNVSEDETRLYKLDVMSMVAQDQLEMSWTFSRDLYEPGTIEKLAHDYVQALRAIIAHCRTEQAGGYTPSD
FPLAELDQNSLDKFIGHNRLIENVYTLTPLQEGMLFHSLYEQAGGDYVVQLALKLEHVNVEAFSAAWQKVVERHAILRTS
FLWSGLEKPHQVVHAKVKTFVERLDWRHLTAAEQEAGLQTYLEQDRKRGFDLARPPLMRWTLIRLDASTFQFVWSFHHML
LDGWSTPIVFQDWQAFYAAASHGKEASLPAIPPFSAYIAWLKRQNLEEAQQYWRDYLQGFGVPTPLGMGKSGGSAGQPKE
YADHKLLLSERATANLLAFARKHQLTLNTVVQGAWALILARYAGEAEVVFGTTNLGRPTDLPDAEAMVGLFINTLPVRVL
FPEQTTVIDWLQSLQQAQSEMRQYEFTPLVDIQSWSEVPRGQSLFDSIFVFENYLSGTSVDSESGMLLGEVKAVEQTSYP
LTLVVAPGEELMLKLIYETGRFEQPAMDKVLAQLSSVLEAIMREPHEQLADLSIITEAERHKLLVEWNATDMPYERNLVM
HQLFEAQVEATPDAQALVVGTERLTYAELNKRANQLAHYLRAQGVGPEVLVAVLMERTTEMIVALLGIIKAGGAYVPIDP
AYPQDRIGYTLDDSQAAIVLTQERLLPMLPEHTAQVICLDRDWACMAVQPEANVPNLAAPTNLSYVIYTSGSTGLPKGVA
IQHSSVIAFIFWAKTVFSAEEMSGVLASTSICFDLSVYEIFVTLSCGGKVILADNALHLPSLPAAKEVTLINTVPSAAKE
LVRMNAIPPSVRVVNLAGEPLPNTLAQSLYALGHVQKVFNLYGPSEDTTYSTYVQVTKGAKTEPTIGRPLANTQAYVLDA
KLQPVPLGLPGELYLGGDGLARGYLKRPKMTAERFLPNPFHPDPDARMYSTGDLVRYLPDGQLEYLGRIDHQVKIRGYRI
ELGELEAVLRSHPQIKEAVVVAKEDKLGEKRLVAYITTKDGECGDRAVLTSWAKAKLPEFMVPSFFVWLDAMPLTPNGKI
DRKQLPEPEWGQVASAAGYVAPRNQTEVLVASIWADVLGIEQVGVHDNFFELGGHSLLATRVASRLRETFAKEVPIRAIF
ERPTVAELSETLGAIGQNETEAQMLPVSREAHLPLSFAQQRLWFLDRLMPDSTLYNIPSAVRLLGDLDIAAWEKSLQVLI
QRHESLRTTFGDVDGEAVQVIHSRLDGKLNVIDLRGMPADEREAEAHRLAGLEAATPFDLSQGPLLRTTLIRLAEQECVF
LFNLHHIIFDGWSIGIFLKEMRALYEAFVREEAPELAEITVQYADYAVWQRKWLEGEVLAEQLAYWKEKLSGAEPLLALP
TDQPRPAVQTHDGAMHTIKLSGELYAKLNKLSQEEGATLFMTLLAAFQVLLYRYSGQEDILVGSPVAGRNRQETEPLIGF
FINTLVLRTDLSGEPTFRELLARVRETAFEAYAHQDLPFEKLVDELELERSLSYSPLFQVMFVLQNFQLNLDEKAGIRVA
DFEMDKHLVTSKYDLTLTMAEKQNGLFATFEYNTALFHEATMERLSQHFIQLLEAIVHMPDQGIARLPLLNQSERAQLLV
EWNDTTTAYPRNKRVDQLFRETALLYPERLAVVAGNQTLTYAELERRANQTANYLQQKGVRPGALVGLCVKRSLEMLIGM
LGILKAGGAYVPLDPDYPEERLAYMMGDAGITVLLTQEQLMPGLPSGERTTIALDRDWPLIAKESEQAPDVDTTAESLAY
VIYTSGSTGLPKGTLVVHRGIVRLVKETDYVTITEQDVFLQASTVSFDAATFEIWGSLLNGAKLVLLPPELPSLAEIGQA
IQSHHVTTLWLTAGLFTLMVDHHKEYLSGVRQLLVGGDIVSVPHVKKALEIAGLTVINGYGPTENTTFTCCNPVTVMPES
AHTFPIGRPIKNTTAYVLDRHMQPVPIGVTGELYIGGDGLAEGYLNRPDLTAERFVPNPFATDQAARLYRTGDLVRYLPD
GLIEFIGRLDNQVKIRGFRIELSEVEAVLAKHPAITASVVIVHENEAGMKQLVAYAVKDAEQELGTAELRQHFKAHVPDY
MVPAAFVMLDALPLTPNGKVDRKALPAPVLERSREEDAFAAATSHVEQTLADIWCAVLRMDRIGIHDNFFELGGDSILSI
QIVARANKAGIHLTPKQLFDQQTIAELAKVAGQSTKVDAEQGNVTGEVPLLPIQTWFFEQKQPTPHHWNQSMLLQVNEPL
EEECLSQAVAQLLAHHDALRLRYTFADGQWKQTYADVDSEVPLQVEDLSMSPPAQQARKIEKLAQQAQASLDLQNGPLLK
VVYFDLGYDRPGRLLMVIHHLAVDGVSWRILIEDLQTAYGQAEKGNKIQLPPKTTSYKAWAEKLHKYASSERMLVDQDYW
LKAADELSGHPLPVHDWAENTEANGRMWTIHLEEEETDALLQKVPSRYRVQINDILLTALALAYGKWTGESALLVNLEGH
GREELFEDVDLSRTVGWFTSMYPLLIQLEPNTSSEDALARVKEKLQQIPHKGLGYGLLRYMAQDPELVEKLKAIPQAPLS
FNYLGQFHQAADAKALLAYAEGERGANSGPDNRRTHLIDVVGAVTEGKLGLSFLYNGRLYSESHIETFARHYTDALQSLI
QAEKQSYRAEDFEDADLSQSALNKVLARLKNRKGNELHGGSH
>Q70LM4 ~~~lgrD~~~Linear gramicidin synthase subunit D~~~
MNNIETYYPVTPLQQGLIFHSLLEPESGAYIVQMGLKLQGPLNIPLFEQAWQCLVDRHAIFRTRFVGGKVKEYVQVVLKD
LKISLVEHDLIHLSSSEQEAFLHHFAKEDRKRGFDIEQAPLMRLNVFHLNSETVHFLWTLHHVLIDGWSMPLVFGEVFAA
YEMLSKGQPLSLPPVRAYRDYIVWLKKQDLQQAEAFWRTYMQGFTEATPLSFGRAYKNPYLDQKQYRELDLTVSEQTSKA
LQTLARQHRLTVNTIVQGAWALLLNRYSGQDDIVFGATVSGRPADLPGVETMIGLFINTLPVRVQVNAEESVINWLKTLQ
QQQADFRQYEYTPLVEIQGWSDVPRGQSLFESILVFENMPVGKSGGGESAISIVDVYSEEQTNYPFTLVAASGKTIDIKV
KFDESQFELAAIERVVDQLHSLLSSIAKNAKQRIGDLSLISESERQQVLVEWNQTAEDYPSGLCIHQAFEQQAEKTPDAV
AVAYKNRELTYAQLNERANQLAHRLIRKGVKPDTLVGICLERSPEMIIGILGVMKAGAAYVPIDPAHPQERIAYMVADSQ
ASALLTQQSLLEILPVTAAHVICLDSDLLADEPVDNASSEVTEQNLAYVIYTSGSTGLPKGVMIEHHSAINLAYALIDAF
DIQPTSRVLQFTSFSFDVSVSEVVMALLAGATLVIEDRESLLPGPELIQVLQEQRITTVSMVSSVLAALPDADLPDLHTL
IVGGEAPSRELVARYAPGRQFFNCYGPTEATVCSTMMLCQAGMNNPPIGRPIANATVYVLDANLNPVPVGVPGELYIGGK
GLARGYWNRPELTAESFIPHPFGTAGERLYRTGDLVRYRQDGNLEFLGRIDHQVKIRGYRIELGEIENAIRQHPAVQEAV
VIAREEKAGDKRLAAYLVAAGKAQPPAEEIALFLKETLPEYMVPAGVVWLDAIPLTVNGKVDRRALPVPDWGQLSTKREY
VAPRTPTEEMVANIWSQVLSVERVGSFDDFFELGGHSLLATQTVSRLKEAFGVDLPLRVLFECSTVNKLSEWIAAAGEDK
SGLSRIPLVPVSRDRHLPLSFAQQRLWFFDRLMPNSALYNIPTAVRLQGELDMDALEQSLQTIIQRHESLRTTFTDHNGE
AVSVIHPEIDWKLERIDLRERSEEMRNEAGLRLAKEEANRPFDLVTGPLMRATIIQTDERDFIFLLNVHHIIADGWSAGI
LIRELFHCYQAFAKAEAPQLAELPIQYADYAYWQREWLTSDVLDEQLSYWRAKLGGAEPLLALPTDRPRPAVQSYAGSSI
SLLFDDELRANLLALSKREGTTLFMTLLAAFQVFLYRYTGQDDILVGTPEAGRSRQETEGLIGFFINTLVMRTDLSGEPS
FKEVLARVRETALGAYAHQDLPFEKLVDELNVERSLSYSPLFQVMFVLQNIPVQADALDGIRILPLEGSQQVETTKFDLT
LTMAEAANGLAATFEYNTALFERNTVERMIGHFSSLLKAVAANANQAITALPLMSEVEEQQLVLEWNDTAVAYSTEQLVH
ELVAQVARDMPDQPAVVTRDQLLTYGQLEAKANQLAHYLQKQGVGRGSLVGICVERSVEMVIGQLAIMKAGAAYIPMDPA
YPKERLAFMMHDASMAIVLTQAKLRQKLPADTSRLICLDADWETIAQEPTAALVNTTAASDLAYVIYTSGSTGTPKGVEI
EHAALLNLIFWHQRAYDVTATDRASQIAGTAFDASVWEIWPYVTKGATLYLPEEEIRLVPEKLRDWLVASNITVSFLPTP
LTESMLALEWPGDTALRYMLTGGDKLHHYPSEKIPFTLVNQYGPTENTVVATAGIVPKEAGQTAAPTIGRPIDNVQVYIL
DAHRQPVPVGVSGELYIGGSSLARGYLNRPDLTQERFVAHPFTEKAGARLYRTGDLVRSLPDGSIEFIGRADDQTSIRGF
RVELGEVETAIVALPAVKEAVVTVCTDKQGTKRLAAYLVLEEGAALATGDIRKALKETLPDYMVPAFFTQLAYLPLTPNG
KVDRKNLPAPDFQRPELEGEFVSPSTEKERRLAAIWKDVLGIEQIGIHDNFFELGGDSILSIQIVSRANQAGLSLAPKQL
FEYQTIAELAEIVEEKAAVQAEQGAVTGELPLLPIQKWFFRLPLANRDHWNQSVLLSIQAGIDPAALKQAVGQLMFQHDA
FRMRYTQSESGWLQAMDAPSETIPFRVEDLSQLAPEEQSSAIEAIANETQTQLSLRAGQVVQTIYFHLGKEVPGRLLIVA
HHLVVDGVSWRIILEDLQHAYQQIAAGQEVKLPAKTTSYKEWAQELERYAHSEAFKHEKSYWLSKSSVHSTELPADMPDS
AENTEATVKSVHFSLTVEETKALLQQVPQAYRTQINDVLLAALAKALGQWTGKRSVFVNVEGHGREELAEHLDLSRTVGW
FTSMYPVHLQWDETFSVRRALLTTKEELRAIPNKGLGYGVLRYLHAEQEIVDAISRIQADVLFNYMGKIDQIVGSDSLFG
SAPESSGANLCPSAQRHHLLDVNSVVAGEQLHVTWRYSEKLQRESTIAAVAESFMAALREIVAHCTLPEAGGYSPSDFPL
AVLEQKQIDKHIGFDRQIEDVYTLSPLQQGMLFHSLYNQDSGDYVVQFAVTFQNLDVSVLEKAWQNVLDRHSILRTHFVW
EGLSEPHQVVRKDVKVTLTKEDWRHLQADVQDEMLAAFLEEDRRRSFDIAQAPLSRWVVFQTKDEEYRFVWSFHHVLLDG
WSVPIVLNELLAHYAAISEGREGKLVPSQPFSQYIAWLKRQDREKAKPFWTDQLKGFHEPTSLGMGKNVAASQQKQYKEQ
SVLLSEEATEHLQSFTREHQLTLNTLVQGAWGWILGSYSGEEEVLFGATGSGRPADLPGVETMVGSFINTLPVRVPLQTD
ATLLAWLKDLQRRQLEIREYEYTPLFDIQGWSELPRGSALFESILVFENYPTVQAAKKGEDEAASATSGVSLEIHDVAAV
EQTNYPLTLVAAPGKQVAFKLKYDQDRFDDAMIERVLNQMTRLMVYMSKSPELRLNDVALMDEDERKQVLIDWNRTEKEY
PRELCLHHAFEQQAAKTPENIALEYKEQSLSYAGLNERANQLAHLLIAQGVKPDTTVAICVERSMEMIIGILGVLKAGAA
YVPIDPAHPEERIAYMLDDSQAVVVLTQAGLADKFTQAAAPVICLGEKLFADRAHVDVDNIQTDVASTNLAYVIYTSGTT
GLPKGVAVEHRSAMNMVQAYIAYFGLDESSRVLQFTSFSFDVSVSEIWQALLSGGTLVIEDRESLLPGPDLVRTLRERRI
SKVSMASSLLASLPVAEYPDLAVLEVGGDACSRELVARYATGRKFFNCYGPTEATVGTVIKQLTLDDDTPTIGRPFPNTK
LYVLDQNRKPVPVGVPGELYIGGECLARGYWNRPELTAERFVANPFGQPGERLYRTGDLVRYLPDGNVDYLGRFDDQVKI
RGYRIELGEIAEALRQHAAIREAVVLAREVRPGDKRLAAYLTSAAEQELSVDEIKQWLKEKLPDYMVPASYTWLPAIPLN
VNGKVDRKALPAPDWGQITAAYVAPRNPLEEMIANVFAEVLAVEKVGIDDNFFELGGHSLLATQTVSRLREIVGVELQLR
TLFEHPTVAGLGEQLELLTKQSSRKLAPPIGKVSRKEPLPLSFTQQRLWFLEQFTQNSSINNIPSFLRIQGELDVAAWEA
SFSAIILRHESLRTSFEVRDGRPVQVIQPHGDWAMTRIDLRALEPAEREAEIKRLAEQAIVQPFDLTKGLLLRASLVQLD
ANDFVFLFVMHHIASDGWSMGILLSELMTNYKAFRQGEASPLGELPIQYADFAVWQREWLSGEVLAEQLGYWREKLKGSE
PLLQLPTDRPRPPVQTYEGEKMSVQFGAELLKQLQSLSRKEGATLFMTLFAAFQTLLYRYTNQDDILVGTPIAGRNKQET
EQLIGYFINTLVLRTDMSGHPSFRELLARVRETALEAYAHQDVPFEKLLDELQLERSMSYSPLFQVMFILQNIPVQAEPA
GDIQLSSFDLELGAVTSKFDMTVTMVETPDGLLATLEYNKALFDSSTITRMVEHFHKLMEEIVANPDQSITLLPLMREEE
EQLLITEWNRTEVPYSREKCVHEMIEEMVSKAPDSIALIVGEQRVTYGELNRQANQLAHYLRKQGVGPEVLVGICAERTV
EMMIGLLAILKAGGAYVPIDPAYPAERIAYIIGHSQIPVLLTQEHLLPTLPEHQAKVICLDRDWATVAVESEENPGKLAT
SDNLIYVIYTSGSTGNPKGVALEHRSVIYFLSWAHDTYTPEEMSGVLFSTSICFDLSVYEMFATLTMGGKVIMAENALQL
PALPAADQVTLVNTVPSAATELVRMKGIPASVRVINLCGEPLSNRLAQELYAFPHVEKVFNLYGPTEDTVYSTHAIVTKG
ATNEPLIGRPQFNTHVFVLDSHRKPVPVGVPGELYLSGSGLARGYLHRPDLTAERFVQNPFREPGARMYRTGDLVRYLPD
GNLQFVGRVDYQVKIRGYRIELGEIESVLNRFPGVKEVVLLAREDREGDKCLVAYIVFEADCTSKIHDLNHFLADKLPAY
MIPQHYMILDSLPKTPNGKLDRKALPKPEYDRSEAGVEYVAPQTPVEIMLHAHWAAVLEMETIGVHDNFFEIGGHSLLAT
QLIFKVREELQLEVPLRILFETPTIAGMAKTIEEIIKHGLTSVSQEIDAKGLQDEVALDPAILAEQPYEGDPSQFQAALL
TGATGFLGAFLLRDLLQMTDADIYCLVRASGEEEGLARLRKTLQLYELWDEAQAHRIIPVIGDLAQPRLGLSAGQFDALA
ATVDVIYHNGALVNFVYPYAALKKANVIGTEEIIRLAAAKKTKPVHFVSTIFTFASEEGEESVAVREEDMPENSRILTSG
YTQSKWVAEHIVNLARQRGIPTAIYRCGRMTGDSETGACQKDDLMWRIAAGIIDLGKAPDMSGDLDMMPVDFASKGIVHL
SMTEHSVNSNFHLLNPNATDYDDLIAAIENKGFELERVTMDEWIEAVQEDAKDKGMDANSAAPLGNLFSDGHSSRGSVVY
VGNKTTRLLRQADIECPEIDEEVFAKVLDYFARTGQLRVTQNTRN
>Q70LM8 1.1.-.-~~~lgrE~~~Linear gramicidin dehydrogenase LgrE~~~
MQKTHVSPSRWLLSPKMTAEAEVLLFSFHYAGGHAGIYREWQKKLPVQIGVCPVQLPGRSNRFMEPYYTDLSVMIRELAE
ALLPHLNRPFAFFGHSMGALVSFELARYLRNQYGIKPRHMFASGRHAPHLPDPGEAIHHLPDAEFLKGLRTLNGTPKELF
ENEENEEILQMLLPMLRADFTICEQYQYQEEEPLGCGLTAIGGWQDPDITVAHMEAWRKHTSASFQMHMLQGDHFFLHSE
QEQLLAIIESTLQSYLVGYRGIG
>O34752 2.5.1.145~~~lgt~~~Phosphatidylglycerol--prolipoprotein diacylglyceryl transferase~~~COG0682
MNEAIEPLNPIAFQLGPLAVHWYGIIIGLGALLGLWIAMRESEKRGLQKDTFIDLVLFAIPIAIICARIYYVAFEWDYYA
AHPGEIIKIWKGGIAIHGGLIGAILTGYVFSRVKNLSFWKLADIAAPSILLGQAIGRWGNFMNQEAHGEAVSRAFLENLH
LPEFIINQMYINGQYYHPTFLYESLWSFVGVIVLLLLRRANLRRGEMFLIYIIWYSIGRYFIEGMRTDSLMLTDSLRIAQ
VISIVLIVLAVAAIIFRRVKGYSKERYAE
>P60955 2.5.1.145~~~lgt~~~Phosphatidylglycerol--prolipoprotein diacylglyceryl transferase~~~COG0682
MTSSYLHFPEFDPVIFSIGPVALHWYGLMYLVGFIFAMWLATRRANRPGSGWTKNEVENLLYAGFLGVFLGGRIGYVLFY
NFPQFMADPLYLFRVWDGGMSFHGGLIGVIVVMIIFARRTKRSFFQVSDFIAPLIPFGLGAGRLGNFINGELWGRVDPNF
PFAMLFPGSRTEDILLLQTNPQWQSIFDTYGVLPRHPSQLYELLLEGVVLFIILNLYIRKPRPMGAVSGLFLIGYGAFRI
IVEFFRQPDAQFTGAWVQYISMGQILSIPMIVAGVIMMVWAYRRSPQQHVS
>P9WK93 2.5.1.145~~~lgt~~~Phosphatidylglycerol--prolipoprotein diacylglyceryl transferase~~~COG0682
MRMLPSYIPSPPRGVWYLGPLPVRAYAVCVITGIIVALLIGDRRLTARGGERGMTYDIALWAVPFGLIGGRLYHLATDWR
TYFGDGGAGLAAALRIWDGGLGIWGAVTLGVMGAWIGCRRCGIPLPVLLDAVAPGVVLAQAIGRLGNYFNQELYGRETTM
PWGLEIFYRRDPSGFDVPNSLDGVSTGQVAFVVQPTFLYELIWNVLVFVALIYIDRRFIIGHGRLFGFYVAFYCAGRFCV
ELLRDDPATLIAGIRINSFTSTFVFIGAVVYIILAPKGREAPGALRGSEYVVDEALEREPAELAAAAVASAASAVGPVGP
GEPNQPDDVAEAVKAEVAEVTDEVAAESVVQVADRDGESTPAVEETSEADIEREQPGDLAGQAPAAHQVDAEAASAAPEE
PAALASEAHDETEPEVPEKAAPIPDPAKPDELAVAGPGDDPAEPDGIRRQDDFSSRRRRWWRLRRRRQ
>P60959 2.5.1.145~~~lgt~~~Phosphatidylglycerol--prolipoprotein diacylglyceryl transferase~~~
MTSSYLHFPDFDPVIFSIGPVALHWYGLMYLVGFVFAMWLAVRRANRPGSGWTKNEVENLLYAGFLGVFLGGRIGYVLFY
NFPLFLDNPLYLFRVWDGGMSFHGGLIGVILVMIIFARRTKRSFFQVSDFIAPLIPFGLGAGRLGNFINGELWGRVDPDF
RFAMLFPGSRAEDIALLPSHPQWQPIFDTYGVLPRHPSQLYELALEGVVLFIILNLFIRKPRPMGAVSGLFLIGYGAFRI
IVEFFRQPDAQFTGAWVQYISMGQILSIPMIIAGAIMMVWAYRRRPQQHVS
>P60962 2.5.1.145~~~lgt~~~Phosphatidylglycerol--prolipoprotein diacylglyceryl transferase~~~
MGIVFNYIDPVAFNLGPLSVRWYGIIIAVGILLGYFVAQRALVKAGLHKDTLVDIIFYSALFGFIAARIYFVIFQWPYYA
ENPSEIIKIWHGGIAIHGGLIGGFIAGVIVCKVKNLNPFQIGDIVAPSIILAQGIGRWGNFMNHEAHGGPVSRAFLEQLH
LPNFIIENMYINGQYYHPTFLYESIWDVAGFIILVNIRKHLKLGETFFLYLTWYSIGRFFIEGLRTDSLMLTSNIRVAQL
VSILLILISISLIVYRRIKYNPPLYSKVGALPWPTKKVK
>P0AC81 4.4.1.5~~~gloA~~~Lactoylglutathione lyase~~~COG0346
MRLLHTMLRVGDLQRSIDFYTKVLGMKLLRTSENPEYKYSLAFVGYGPETEEAVIELTYNWGVDKYELGTAYGHIALSVD
NAAEACEKIRQNGGNVTREAGPVKGGTTVIAFVEDPDGYKIELIEEKDAGRGLGN
>P44638 4.4.1.5~~~gloA~~~Lactoylglutathione lyase~~~COG0346
MQILHTMLRVGDLDRSIKFYQDVLGMRLLRTSENPEYKYTLAFLGYEDGESAAEIELTYNWGVDKYEHGTAYGHIAIGVD
DIYATCEAVRASGGNVTREAGPVKGGSTVIAFVEDPDGYKIEFIENKSTKSGLGN
>P16635 4.4.1.5~~~gloA~~~Lactoylglutathione lyase~~~
MSLNDLNTLPGVTAQADPATAQFVFNHTMLRVKDIEKSLDFYTRVLGFKLVDKRDFVEAKFSLYFLALVDPATIPADDDA
RHQWMKSIPGVLELTHNHGTERDADFAYHHGNTDPRGFGHICVSVPDVVAACERFEALQVPFQKRLSDGRMNHLAFIKDP
DGYWVEVIQPTPL
>Q3J1A4 ~~~pufA~~~Light-harvesting protein B-875 alpha chain~~~
MSKFYKIWMIFDPRRVFVAQGVFLFLLAVMIHLILLSTPSYNWLEISAAKYNRVAVAE
>P0C0X9 ~~~pufA~~~Light-harvesting protein B-875 alpha chain~~~
MSKFYKIWMIFDPRRVFVAQGVFLFLLAVMIHLILLSTPSYNWLEISAAKYNRVAVAE
>P35089 ~~~~~~Light-harvesting protein B-800/820 alpha chain~~~
MNQGKIWTVVPPAFGLPLMLGAVAITALLVHAAVLTHTTWYAAFLQGGVKKAA
>P02948 ~~~pufA~~~Light-harvesting protein B-870 alpha chain~~~
MSKFYKIWLVFDPRRVFVAQGVFLFLLAVLIHLILLSTPAFNWLTVATAKHGYVAAAQ
>P35101 ~~~pucAA~~~Light-harvesting protein B-800-850 alpha chain A~~~
MNQARIWTVVKPTVGLPLLLGSVTVIAILVHFAVLSHTTWFSKYWNGKAAAIESSVNVG
>P80588 ~~~~~~Light-harvesting polypeptide B-885 alpha-1 chain~~~
SAPAQWKLWLVMDPRTVMIGTAAWLGVLALLIHFLLLGTERFNWIDTGLKEQKATAAAQAAITPAPVTAAAK
>P0DJO0 ~~~pufA~~~Light-harvesting protein B-870 alpha chain~~~
MWRIWRLFDPMRAMVAQAVFLLGLAVLIHLMLLGTNKYNWLDGAKKAPAATAVAPVPAEVTSLAQAK
>Q3J144 ~~~pucA~~~Light-harvesting protein B-800/850 alpha chain~~~
MTNGKIWLVVKPTVGVPLFLSAAVIASVVIHAAVLTTTTWLPAYYQGSAAVAAE
>P0C0Y0 ~~~pucA~~~Light-harvesting protein B-800/850 alpha chain~~~
MTNGKIWLVVKPTVGVPLFLSAAVIASVVIHAAVLTTTTWLPAYYQGSAAVAAE
>P80103 ~~~~~~Light-harvesting protein B800/830/1020 alpha-2 chain~~~
MWKLWKFVDFRMTAVGFHIFFALIAFAVHFACISSERFNWLEGAPAAEYYMDENPGIWKRTSYDG
>P35090 ~~~~~~Light-harvesting protein B-800/820 alpha chain~~~
MNQGKIWTVVNPAVGLPLLLGSVAITALLVHLAVLTHTTWFPAFTQGGLKKAA
>P07367 ~~~pucA~~~Light-harvesting protein B-800/850 alpha chain~~~
MNNAKIWTVVKPSTGIPLILGAVAVAALIVHAGLLTNTTWFANYWNGNPMATVVAVAPAQ
>P35102 ~~~pucAB~~~Light-harvesting protein B-800-850 alpha chain B~~~
MNQGRIWTVVNPGVGLPLLLGSVTVIAILVHYAVLSNTTWFPKYWNGATVAAPAAAPAPAAPAAKK
>P95655 ~~~pucA~~~Light-harvesting protein B-800/850 alpha chain~~~
MNNAKMWLVVKPTVGIPLFLVACAIASFLVHLMLVLTTGWMGDYYSGSFEAASLVSNATTLLS
>P80589 ~~~~~~Light-harvesting polypeptide B-885 alpha-2 chain~~~
SAPAQWKLWLVMDPRTVMIGTAAWLGVLALLIHFLLLGTERFNWIDTGLKEQKATAAAQA
>P77799 ~~~pucA~~~Light-harvesting protein B-800/850 alpha chain~~~
MNQGKVWRVVKPTVGVPVYLGAVAVTALILHGGLLAKTDWFGAYWNGGKKAAAAAAAVAPAPVAAPQAPAQ
>P35091 ~~~~~~Light-harvesting protein B-800/850 alpha chain~~~
MNQGKIWTVVNPSVGLPLLLGSVTVIAILVHAAVLSHTTWFPAYWQGGLKKAA
>P35103 ~~~pucAC~~~Light-harvesting protein B-800-850 alpha chain C~~~
MNQGRIWTVVSPTVGLPLLLGSVAAIAFAVHFAVLENTSWVAAFMNGKSVAAAPAPAAPAAPAKK
>P26789 ~~~~~~Light-harvesting protein B-800/850 alpha chain~~~
MNQGKIWTVVNPAIGIPALLGSVTVIAILVHLAILSHTTWFPAYWQGGVKKAA
>P35104 ~~~pucAD~~~Light-harvesting protein B-800-850 alpha chain D~~~
MNQGRIWTVVKPTVGLPLLLGSVAIMVFLVHFAVLTHTTWVAKFMNGKAAAIESSIKAV
>P35105 ~~~pucAE~~~Light-harvesting protein B-800-850 alpha chain E~~~
MNQGRIWTVVKPTVGLPLLLGSVTVIAILVHFAVLSNTTWFSKYWNGKAAAIESSVSIG
>P35092 ~~~~~~Light-harvesting protein B-880 alpha chain~~~
MYKLWLLFDPRRALVALSAFLFVLALIIHFIALSTDRFNWLEGKPAVKAA
>P35093 ~~~~~~Light-harvesting protein B-880 alpha chain~~~
MYKLWLLFDPRRTLVALSAFLFVLGLIIHFISLSTDRFNWLEGKPAVRA
>P80259 ~~~~~~Light-harvesting protein B-880 alpha chain~~~
MWKVWLLFDPRRTLVALFTFLFVLALLIHFILLSTDRFNWMQGAPTAPAQTS
>P04123 ~~~pufA~~~Light-harvesting protein B-1015 alpha chain~~~
MATEYRTASWKLWLILDPRRVLTALFVYLTVIALLIHFGLLSTDRLNWWEFQRGLPKAASLVVVPPAVG
>P07503 ~~~puf2A~~~Light-harvesting protein B-808/866 alpha chain~~~
MQPRSPVRTNIVIFTILGFVVALLIHFIVLSSPEYNWLSNAEGGALLLSAARALFGI
>P97253 ~~~A1~~~Light-harvesting protein B-800/850 alpha chain~~~
MSNPKDDYKIWLVINPSTWLPVIWIVATVVAIAVHAAVLAAPGFNWIALGAAKSAAK
>P02947 ~~~~~~Light-harvesting protein B-870 alpha chain~~~
MWRIWQLFDPRQALVGLATFLFVLALLIHFILLSTERFNWLEGASTKPVQTSMVMPSSDLAV
>P80586 ~~~~~~Light-harvesting polypeptide B-800/860 alpha chain~~~
MTNGKIWLVVKPTVGLPIGMLFAALLAVLIHGLLFVDGRLKSWWSEFPVAKPAVVSVQAAPAPVAAEVK
>Q3J1A3 ~~~pufB~~~Light-harvesting protein B-875 beta chain~~~
MADKSDLGYTGLTDEQAQELHSVYMSGLWLFSAVAIVAHLAVYIWRPWF
>P0C0Y1 ~~~pufB~~~Light-harvesting protein B-875 beta chain~~~
MADKSDLGYTGLTDEQAQELHSVYMSGLWPFSAVAIVAHLAVYIWRPWF
>P95673 ~~~B1~~~Light-harvesting protein B-800/850 beta 1 chain~~~
MAERSLSGLTEEEAIAVHDQFKTTFSAFIILAAVAHVLVWVWKPWF
>P35094 ~~~~~~Light-harvesting protein B-800/820 beta chain~~~
AEVLTSEQAEELHKHVIDGTRVFLVIAAIAHFLAFTLTPWLH
>P02950 ~~~pufB~~~Light-harvesting protein B-870 beta chain~~~
MADKNDLSFTGLTDEQAQELHAVYMSGLSAFIAVAVLAHLAVMIWRPWF
>P35106 ~~~pucBA~~~Light-harvesting protein B-800-850 beta chain A~~~
MADKTLTGLTVEESEELHKHVIDGTRIFGAIAIVAHFLAYVYSPWLH
>P80590 ~~~~~~Light-harvesting polypeptide B-885 beta-1 chain~~~
AEDRKSLSGLTEQEAQEFGTLYTQGVAFVAVIAVVAHALVWAWRPWLQ
>P0DJO1 ~~~pufB~~~Light-harvesting protein B-870 beta chain~~~
MAERKGSISGLTDDEAQEFHKFWVQGFVGFTAVAVVAHFLVWVWRPWL
>Q3J145 ~~~pucB~~~Light-harvesting protein B-800/850 beta chain~~~
MTDDLNKVWPSGLTVAEAEEVHKQLILGTRVFGGMALIAHFLAAAATPWLG
>P0C0Y2 ~~~pucB~~~Light-harvesting protein B-800/850 beta chain~~~
MTDDLNKVWPSGLTVAEAEEVHKQLILGTRVFGGMALIAHFLAAAATPWLG
>P11696 ~~~~~~Light-harvesting protein B800/830/1020 beta-2 chain~~~
TDIRTGLTDEECQEIHEMNMLGMHAYWSIGLIANALAYAWRPFHQGRAGNRLEDHAPDYVRSALT
>P95674 ~~~B2~~~Light-harvesting protein B-800/850 beta 2 chain~~~
MAERSLSGLTEEEAVAVHAQFQTTFSAFIVLAAVAHVLVWVWKPWF
>P35095 ~~~puc1B~~~Light-harvesting protein B-800/820 beta-1 chain~~~
MADKPLTADQAEELHKYVIDGARAFVAIAAFAHVLAYSLTPWLH
>P07368 ~~~pucB~~~Light-harvesting protein B-800/850 beta chain~~~
MTDDKAGPSGLSLKEAEEIHSYLIDGTRVFGAMALVAHILSAIATPWLG
>P35107 ~~~pucBB~~~Light-harvesting protein B-800-850 beta chain B~~~
MADDPNKVWPTGLTIAESEELHKHVIDGTRIFGAIAIVAHFLAYVYSPWLH
>P95654 ~~~pucB~~~Light-harvesting protein B-800/850 beta chain~~~
MTDDMDKVWPTGLTLAEAEEVHKQLIDGTRVFGAIALFAHFLAAIATPWLG
>P80591 ~~~~~~Light-harvesting polypeptide B-885 beta-2 chain~~~
AEDRKSLSGLTEQEAQEFGTLYTQGVAFVAVIAIVAHALVWAWRPWLQ
>P35096 ~~~~~~Light-harvesting protein B-800/820 beta-2 chain~~~
AVLSPEQSEELHKYVIDGARAFLGIALVAHFLAFSATPWLH
>P72281 ~~~pucB~~~Light-harvesting protein B-800/850 beta chain~~~
MADDANKVWPSGLTTAEAEELQKGLVDGTRVFGVIAVLAHILAYAYTPWLH
>P35097 ~~~~~~Light-harvesting protein B-800-850 beta chain~~~
ADDVKGLTGLTAAESEELHKHVIDGTRVFFVIAIFAHVLAFAFSPWLH
>P35109 ~~~pucBD~~~Light-harvesting protein B-800-850 beta chain D~~~
MVDDPNKVWPTGLTIAESEELHKHVIDGSRIFVAIAIVAHFLAYVYSPWLH
>P26790 ~~~~~~Light-harvesting protein B-800/850 beta chain~~~
ATLTAEQSEELHKYVIDGTRVFLGLALVAHFLAFSATPWLH
>P35098 ~~~~~~Light-harvesting protein B-880 beta chain~~~
AEDRSSLSGVSDAEAKEFHALFVSSFMGFMVVAVLAHVLAWAWRPWIPGPKGWA
>P35099 ~~~~~~Light-harvesting protein B-880 beta chain~~~
AEDRSSLSGVSDAEAKEFHALFVSSFTAFIVIAVLAHVLAWAWRPWIPGPKGWA
>P80260 ~~~~~~Light-harvesting protein B-880 beta chain~~~
AEIDRPVSLSGLTEGEAREFHGVFMTSFMVFIAVAIVAHILAWMWRPWIPGPEGYA
>P04124 ~~~pufB~~~Light-harvesting protein B-1015 beta chain~~~
MADLKPSLTGLTEEEAKEFHGIFVTSTVLYLATAVIVHYLVWTARPWIAPIPKGWVNLEGVQSALSYLV
>P09927 ~~~puf2B~~~Light-harvesting protein B-808/866 beta chain~~~
MRDDDDLVPPKWRPLFNNQDWLLHDIVVKSFYGFGVIAAIAHLLVYLWKPWLP
>Q2RQ23 ~~~~~~Light-harvesting protein B-870 beta chain~~~
MAEVKQESLSGITEGEAKEFHKIFTSSILVFFGVAAFAHLLVWIWRPWVPGPNGYSALETLTQTLTYLS
>P0C190 ~~~~~~Light-harvesting protein B-870 beta chain~~~
EVKQESLSGITEGEAKEFHKIFTSSILVFFGVAAFAHLLVWIWRPWVPGPNGYS
>P80587 ~~~~~~Light-harvesting polypeptide B-800/860 beta chain~~~
ADANKVWPTGLTVAEAEELHTYVTNGFRVFVGIAVVAHVLVFAAHPWGRGGALVA
>P37339 1.1.5.13~~~lhgD~~~L-2-hydroxyglutarate dehydrogenase~~~COG0579
MYDFVIIGGGIIGMSTAMQLIDVYPDARIALLEKESAPACHQTGHNSGVIHAGVYYTPGSLKAQFCLAGNRATKAFCDQN
GIRYDNCGKMLVATSDLEMERMRALWERTAANGIEREWLNADELREREPNITGLGGIFVPSSGIVSYRDVTAAMAKIFQS
RGGEIIYNAEVSGLNEHKNGVVIRTRQGGEYEASTLISCSGLMADRLVKMLGLEPGFIICPFRGEYFRLAPEHNQIVNHL
IYPIPDPAMPFLGVHLTRMIDGSVTVGPNAVLAFKREGYRKRDFSFSDTLEILGSSGIRRVLQNHLRSGLGEMKNSLCKS
GYLRLVQKYCPRLSLSDLQPWPAGVRAQAVSPDGKLIDDFLFVTTPRTIHTCNAPSPAATSAIPIGAHIVSKVQTLLASQ
SNPGRTLRAARSVDALHAAFNQ
>P04126 ~~~~~~Light-harvesting protein B-1015 gamma chain~~~
YFAADGSVVPSISDWNLWVPLGILGIPTIWIALTYR
>O32199 ~~~liaF~~~Protein LiaF~~~COG4758
MTKKQLLGLIIALFGISMFLQIIGIGDLLFWPLFFLIAGYFLKKYSRDWLGSVMYIFAAFLFLKNLFSITFNLFGYAFAA
FLIYAGYRLIKGKPIFEPNEKQVNLNKKEHHEPPKDVKHPDMRSFFIGELQMMKQPFDLNDLNVSGFIGDIKIDLSKAMI
PEGESTIVISGVIGNVDIYVPSDLEVAVSSAVFIGDINLIGSKKSGLSTKVYAASTDFSESKRRVKVSVSLFIGDVDVKY
V
>O32200 ~~~liaG~~~Protein LiaG~~~COG3595
MKKMLGKLLITAGILVFFAVVVKDVFAAGLGFINGTEADSASASPRDIDSMVIESDSKDVRIIAEERSDISAGISGDSGK
LFVTENKRKLELTVKEKEFQFLNGFNRSTLIVRLPYDYKGDLAVRTSSGDVSVAGNDHLALSGLNAVSASGNTSVTDVRI
QDLKVKASSGDVSISNTVSKTAGIDLASGDANLVHVSGSLDVKMTSGDFNAVLKKVTGPVSVTLTSGDANVSLPQNGSFA
VNALSASGDVSSPYSFADKGHKEQHHITGTQGSGRHPIDIKTDSGDLAIR
>O32201 ~~~liaH~~~Protein LiaH~~~COG1842
MVLKRIRDMFVASVHEGLDKMENPKVMLNQYVRDMESDIAKAKQTIVKQHTIAYQFKKKYEEAAEVAGKRKNQAQLAFDA
GEEELAKKALTEMKYLEGKAAEHKASYEQANSQLADLKEQLAALETKLQDVKDKKQALIARANAAKAKEHMNTTFDKIDS
ESAYREFLRIENRIEEMEIRANYSKSAEAGTELTRKEFADDVEAEIEKMRTLSLQKSDQTKAANE
>O32202 ~~~liaI~~~Protein LiaI~~~COG4758
MKINKKTIGGFLLIVFGISVFFGGGSFGFIIPLAIGSLMTYAGIKRFAAGKTITGIIVGGIGAIMLICSLPFVVGIALAA
AMVYYGWKLMKNGSADNGVSSFDPEPASAAYQSHFDDEWEEFLKKK
>O32197 ~~~liaR~~~Transcriptional regulatory protein LiaR~~~COG2197
MIRVLLIDDHEMVRMGLAAFLEAQPDIEVIGEASDGSEGVRLAVELSPDVILMDLVMEGMDGIEATKQICRELSDPKIIV
LTSFIDDDKVYPVIEAGALSYLLKTSKAAEIADAIRAASKGEPKLESKVAGKVLSRLRHSGENALPHESLTKRELEILCL
IAEGKTNKEIGEELFITIKTVKTHITNILSKLDVSDRTQAAVYAHRNHLVN
>O32198 2.7.13.3~~~liaS~~~Sensor histidine kinase LiaS~~~COG4585
MRKKMLASLQWRAIRMTTGISLLLFVCLISFMMFYYRLDPLVLLSSSWFGIPFILILLLISVTVGFASGYMYGNRLKTRI
DTLIESILTFENGNFAYRIPPLGDDEIGLAADQLNEMAKRVELQVASLQKLSNERAEWQAQMKKSVISEERQRLARDLHD
AVSQQLFAISMMTSAVLEHVKDADDKTVKRIRMVEHMAGEAQNEMRALLLHLRPVTLEGKGLKEGLTELLDEFRKKQPID
IEWDIQDTAISKGVEDHLFRIVQEALSNVFRHSKASKVTVILGIKNSQLRLKVIDNGKGFKMDQVKASSYGLNSMKERAS
EIGGVAEVISVEGKGTQIEVKVPIFPEEKGENERDSSIID
>Q03974 2.-.-.-~~~lex1~~~Lipooligosaccharide biosynthesis protein lex-1~~~COG3306
MSAIENIVISMENATERRKHITKQFESKKLSFSFFNAYTYQSINQSINQSINQSINQSINQSINQSINQSNSILHNIEES
RILTKGEKGCLISHFLLWNKCVNENFEYLKIFEDDVILGENAEVFLNQNEWLKTRFDFNDIFIIRLETFLQPVKLEKQTK
IPPFNSRNFDILKSTHWGTAGYIISQGAAKYVIEYLKNIPSDEIVAVDELIFNKLVDVDNYIVYQLNPAICIQELQANQS
KSVLTSGLEKERQKRSKIRKKKTLKQRLTRIKENIIRALNRKKWKEQQRIKEMQGKEIVRFM
>P46320 3.2.1.86~~~licH~~~Probable 6-phospho-beta-glucosidase~~~COG1486
MTKGLKIVTIGGGSSYTPELVEGFIKRYDELPVRELWLVDIPEGEEKLNIVGTLAKRMVEKAGVPIDIHLTLDRRKALKD
ADFVTTQFRVGLLQARAKDERIPLKYGVIGQETNGPGGLFKGLRTIPVILEIAKDIEELCPNAWLVNFTNPAGMVTEALL
RYSNLKKVVGLCNVPIGIKMGVAKALDVDVDRVEVQFAGLNHMVFGLDVFLDGVSVKEQVIEAMGDPKNAMTMKNISGAE
WEPDFLKALNVIPCGYHRYYFKTKEMLEHELEASQTEGTRAEVVQKVEKELFELYKDPNLAIKPPQLEKRGGAYYSDAAC
NLISSIYNDKHDIQPVNTINNGAIASIPDDSAVEVNCVMTKTGPKPIAVGDLPVSVRGLVQQIKSFERVAAEAAVTGDYQ
TALLAMTINPLVPSDTVAKQILDEMLEAHKAYLPQFFNKIEA
>P46321 ~~~licR~~~Probable licABCH operon regulator~~~COG1762
MLHGRLRDILRLLMAAEAPVTSSFFAAQLNVTTRTVRNDIKELQGVLSGHGAFVQSVRGSGYKLRIDDEQVFRTLLQDEF
QQKKGLPVLPEERMAYLMKRLLLADHYLKLDELAEELFISKSTLQTDLKEVKKRLLPYRIVMETRPNYGFKLRGDEVQMR
YCMAEYIVDERETEIDVLNEKADILPKEEIEIIRSAILKKMKNDRIPLSNMGLNNLIIHIAIACKRIRTENYVSLFPKDM
DHILHQKEYQAAEAIVKELESKLSVTFPKDETAYITMHLLGTKRMTQSQCGEDTFSIEEETDQLTLAMIKAVDRELKLGI
LHDKELKIGLALHMKPAISRNRYGMNLRNPMLAAIKEHYPLAFEAGIIAGIVIKEQTGIEIHENEIGYLALHFGAAIERK
KTESPPKRCIIVCASGAGSAQLLREKLRSHFGKRLDILGTAEYYSLDQMSYESIDFVISTIPIKKELPVPVLKVNTILGG
TDFTKIESILSDEKEKANRYLKKELVFFQEDLRSKEEVIQFLGQKVVECGFADEEIIDSIFEREDMSPTCFGNLVAIPHP
LVPQTKTTFWAVCTLKKPIDWESQRVQFVCLLCVEKENKADLQSMYKLLGSILDDPAAMNQLIKCRSYQELSDVFDQKML
S
>P39805 ~~~licT~~~Transcription antiterminator LicT~~~COG3711
MKIAKVINNNVISVVNEQGKELVVMGRGLAFQKKSGDDVDEARIEKVFTLDNKDVSEKFKTLLYDIPIECMEVSEEIISY
AKLQLGKKLNDSIYVSLTDHINFAIQRNQKGLDIKNALLWETKRLYKDEFAIGKEALVMVKNKTGVSLPEDEAGFIALHI
VNAELNEEMPNIINITKVMQEILSIVKYHFKIEFNEESLHYYRFVTHLKFFAQRLFNGTHMESQDDFLLDTVKEKYHRAY
ECTKKIQTYIEREYEHKLTSDELLYLTIHIERVVKQA
>P81715 3.4.24.-~~~lieA~~~Leupeptin-inactivating enzyme 1~~~
MSLSVSRRLAAVTAFAVAGLFASAVPAALAAPSAVAAAPTPPDIPLANVKAHLSQLSTIAANNGGNRAHGRAGYKASIDY
VKGKLDAAGFTTTLQTFTSSGATGYNLIADWPGGDPNSVLMAGSHLDSVTSGAGINDNGSGSAAVLETALAVSRAGLQPT
KHLRFGWWGAEELGLIGSKYYVNNLPAAEKAKISGYLNFDMIGSPNPGYFVYDDDPTIEQTFKNYYAGLGVPTEIETEGD
GRSDHAPFKNAGIPVGGLFSGADYTKTAAQAQKWGGTSGQAFDRCYHSSCDSLTNINDTALDRNSDAVAYAIWTLGAGTP
VPPGQSFENTADVNVPDSPAAAVSSPITVSGVTGNAPATTKVDVNIVHTYRGDLVVDLVAPDGTVYNLHNRSGGSADNLV
QTYTVNASSEVANGVWNLRVKDTAAQDVGYINSWKITF
>Q05490 ~~~lifO~~~Lipase-specific foldase~~~
MAQADRPARGGLAARPMRGASFALAGLVACAACAAVVLWLRPAAPSPAPAGAVAGGPAAGVPAAASGAAEAAMPLPAALP
GALAGSHAPRLPLAAGGRLARTRAVREFFDYCLTAQGELTPAALDALVRREIAAQLDGSPAQAEALGVWRRYRAYFDALA
QLPGDGAVLGDKLDPAAMQLALDQRAALADRTLGEWAEPFFGDEQRRQRHDLERIRIANDTTLSPEQKAARLAALDAQLT
PDERAQQAALHAQQDAVTKIADLQKAGATPDQMRAQIAQTLGPEAAARAAQMQQDDEAWQTRYQAYAAERDRIAAQGLAP
QDRDARIAQLRQQTFTAPGEAIRAASLDRGAGG
>Q01725 ~~~lifO~~~Lipase chaperone~~~
MKKILLLIPLAFAASLAWFVWLEPSPAPETAPPASPQAGADRAPPAASAGEAVPAPQVMPAKVAPLPTSFRGTSVDGSFS
VDASGNLLITRDIRNLFDYFLSAVGEEPLQQSLDRLRAYIAAELQEPARGQALALMQQYIDYKKELVLLERDLPRLADLD
ALRQREAAVKALRARIFSNEAHVAFFADEETYNQFTLERLAIRQDGKLSAEEKAAAIDRLRASLPEDQQESVLPQLQSEL
QQQTAALQAAGAGPEAIRQMRQQLVGAEATTRLEQLDRQRSAWKGRLDDYFAEKSRIEGNTGLSEADRRAAVERLAEERF
SEQERLRLGALEQMRQAEQR
>P25772 6.5.1.2~~~ligB~~~DNA ligase B~~~COG0272
MKVWMAILIGILCWQSSVWAVCPAWSPARAQEEISRLQQQIKQWDDDYWKEGKSEVEDGVYDQLSARLTQWQRCFGSEPR
DVMMPPLNGAVMHPVAHTGVRKMVDKNALSLWMRERSDLWVQPKVDGVAVTLVYRDGKLNKAISRGNGLKGEDWTQKVSL
ISAVPQTVSGPLANSTLQGEIFLQREGHIQQQMGGINARAKVAGLMMRQDDSDTLNSLGVFVWAWPDGPQLMSDRLKELA
TAGFTLTQTYTRAVKNADEVARVRNEWWKAELPFVTDGVVVRAAKEPESRHWLPGQAEWLVAWKYQPVAQVAEVKAIQFA
VGKSGKISVVASLAPVMLDDKKVQRVNIGSVRRWQEWDIAPGDQILVSLAGQGIPRIDDVVWRGAERTKPTPPENRFNSL
TCYFASDVCQEQFISRLVWLGAKQVLGLDGIGEAGWRALHQTHRFEHIFSWLLLTPEQLQNTPGIAKSKSAQLWHQFNLA
RKQPFTRWVMAMGIPLTRAALNASDERSWSQLLFSTEQFWQQLPGTGSGRARQVIEWKENAQIKKLGSWLAAQQITGFEP
>A0R5T2 6.5.1.1~~~ligC~~~DNA ligase C1~~~COG1793
MDLPVQPPIEPMLAKAQVKVPDEAGVWSYEPKWDGFRALVFRDGDDVVLQSRNGKDLGRYFPELLDALRDELVEKCVLDG
EVVVPRDIAGRVRLDWESLSQRIHPAASRIKMLAEQTPAHFIGFDALALGDRSLLKEPFRVRREALAEAVDNKRWCHVTR
TSEDPALGTEWLKTFEGAGLDGVIAKRLDGPYLPGKREMVKVKHHRDADCVAMGYRIHKSGDGIGSILLGLYRDDGELQM
VGGAASFTAKDRIKLLAELEPLREGDEMREGDPSRWNSAADKRWTPLRPEKVCEVAYDQMEGNSVEGRRFRHAVKFLRWR
PDREPSSCTFDQLDTPLNYDLYDVLEEQ
>L0TDE1 6.5.1.1~~~ligC~~~DNA ligase C~~~COG1793
MQLPVMPPVSPMLAKSVTAIPPDASYEPKWDGFRSICFRDGDQVELGSRNERPMTRYFPELVAAIRAELPHRCVIDGEII
IATDHGLDFEALQQRIHPAESRVRMLADRTPASFIAFDLLALGDDDYTGRPFSERRAALVDAVTGSGADADLSIHVTPAT
TDMATAQRWFSEFEGAGLDGVIAKPPHITYQPDKRVMFKIKHLRTADCVVAGYRVHKSGSDAIGSLLLGLYQEDGQLASV
GVIGAFPMAERRRLLTELQPLVTSFDDHPWNWAAHVAGQRTPRKNEFSRWNVGKDLSFVPLRPERVVEVRYDRMEGARFR
HTAQFNRWRPDRDPRSCSYAQLERPLTVSLSDIVPGLR
>Q9KWL3 1.1.1.312~~~ligC~~~4-carboxy-2-hydroxymuconate-6-semialdehyde dehydrogenase~~~
MRIALAGAGAFGEKHLDGLKNIDGVEIVSIISRKAEQAAEVAAKYGAKHSGTDLSEALARDDVDAVILCTPTQMHAEQAI
ACMNAGKHVQVEIPLADSWADAEAVMKKSQETGLVCMVGHTRRFNPSHQYIHNKIVAGELAIQQMDVQTYFFRRKNMNAK
GEPRSWTDHLLWHHAAHTVDLFAYQAGKIVQANAVQGPIHPELGIAMDMSIQLKSETGAICTLSLSFNNDGPLGTFFRYI
CDNGTWIARYDDLVTGKEEPVDVSKVDVSMNGIELQDREFIAAIREGREPNSSVARVLDCYRVLGELEVQLEKQG
>O34398 ~~~ligd~~~Bifunctional non-homologous end joining protein LigD~~~COG1793
MAFTMQPVLTSSPPIGAEWRYEVKYDGYRCILRIHSSGVTLTSRNGVELSSTFPEITQFAKTAFQHLEKELPLTLDGEIV
CLVNPCRADFEHLQVRGRLKRPDKIQESANARPCCFLAFDLLERSGEDVTLLSYLDRKKSLRELISAAKLPASPDPYAKE
TIQSIPCYDHFDQLWEMVIKYDGEGIVAKKTNSKWLEKKRSSDWLKYKNFKQAYVCITGFNPNNGFLTVSVLKNGIMTPI
ASVSHGMRDEEKSAIREIMEQHGHQTPSGEFTLEPSICAAVQYLTILQGTLREVSFIGFEFQMDWTECTYAQVIRHSKPV
HPKLQFTSLDKIIFEKNKKTKEDFIQYMIEVSDYLLPFLKNRAVTVIRYPHGSRSESFFQKNKPDYAPDFVQSFYDGSHE
HIVCEDMSTLLWLCNQLALEFHVPFQTIKSRRPAEIVIDLDPPSRDDFLMAVQAANELKRLLDSFGITSYPKLSGNKGIQ
LYIPLSPEAFTYEETRQFTQLIAEYCTNAFPELFTTERLIKNRHCKLYLDYLQHAEGKTIICPYSTRGNELGTVAAPLYW
HEVQSSLTPALFTIDTVIDRIKKQGCPFFDFYRNPQDEPLSAILHQLKKKS
>A0R3R7 ~~~ligD~~~Multifunctional non-homologous end joining protein LigD~~~COG1793
MARHPWGMERYERVRLTNPDKVLYPATGTTKAEVFDYYLSIAQVMVPHIAGRPVTRKRWPNGVAEEAFFEKQLASSAPSW
LERGSITHKSGTTTYPIINTREGLAWVAQQASLEVHVPQWRFEDGDQGPATRIVFDLDPGEGVTMTQLCEIAHEVRALMT
DLDLETYPLTSGSKGLHLYVPLAEPISSRGASVLARRVAQQLEQAMPKLVTATMTKSLRAGKVFLDWSQNNAAKTTIAPY
SLRGRDHPTVAAPRTWDEIADPELRHLRFDEVLDRLDEYGDLLAPLDADAPIADKLTTYRSMRDASKTPEPVPKEIPKTG
NNDKFVIQEHHARRLHYDLRLERDGVLVSFAVPKNLPETTAENRLAVHTEDHPIEYLAFHGSIPKGEYGAGDMVIWDSGS
YETEKFRVPEELDNPDDSHGEIIVTLHGEKVDGRYALIQTKGKNWLAHRMKDQKNARPEDFAPMLATEGSVAKYKAKQWA
FEGKWDGYRVIIDADHGQLQIRSRTGREVTGEYPQFKALAADLAEHHVVLDGEAVALDESGVPSFGQMQNRARSTRVEFW
AFDILWLDGRSLLRAKYSDRRKILEALADGGGLIVPDQLPGDGPEAMEHVRKKRFEGVVAKKWDSTYQPGRRSSSWIKDK
IWNTQEVVIGGWRQGEGGRSSGIGALVLGIPGPEGLQFVGRVGTGFTEKELSKLKDMLKPLHTDESPFNAPLPKVDARGV
TFVRPELVGEVRYSERTSDGRLRQPSWRGLRPDKTPDEVVWE
>P9WNV3 ~~~ligD~~~Multifunctional non-homologous end joining DNA repair protein LigD~~~COG1793
MGSASEQRVTLTNADKVLYPATGTTKSDIFDYYAGVAEVMLGHIAGRPATRKRWPNGVDQPAFFEKQLALSAPPWLSRAT
VAHRSGTTTYPIIDSATGLAWIAQQAALEVHVPQWRFVAEPGSGELNPGPATRLVFDLDPGEGVMMAQLAEVARAVRDLL
ADIGLVTFPVTSGSKGLHLYTPLDEPVSSRGATVLAKRVAQRLEQAMPALVTSTMTKSLRAGKVFVDWSQNSGSKTTIAP
YSLRGRTHPTVAAPRTWAELDDPALRQLSYDEVLTRIARDGDLLERLDADAPVADRLTRYRRMRDASKTPEPIPTAKPVT
GDGNTFVIQEHHARRPHYDFRLECDGVLVSWAVPKNLPDNTSVNHLAIHTEDHPLEYATFEGAIPSGEYGAGKVIIWDSG
TYDTEKFHDDPHTGEVIVNLHGGRISGRYALIRTNGDRWLAHRLKNQKDQKVFEFDNLAPMLATHGTVAGLKASQWAFEG
KWDGYRLLVEADHGAVRLRSRSGRDVTAEYPQLRALAEDLADHHVVLDGEAVVLDSSGVPSFSQMQNRGRDTRVEFWAFD
LLYLDGRALLGTRYQDRRKLLETLANATSLTVPELLPGDGAQAFACSRKHGWEGVIAKRRDSRYQPGRRCASWVKDKHWN
TQEVVIGGWRAGEGGRSSGVGSLLMGIPGPGGLQFAGRVGTGLSERELANLKEMLAPLHTDESPFDVPLPARDAKGITYV
KPALVAEVRYSEWTPEGRLRQSSWRGLRPDKKPSEVVRE
>Q9I1X7 ~~~ligD~~~Multifunctional non-homologous end joining protein LigD~~~
MPSSKPLAEYARKRDFRQTPEPSGRKPRKDSTGLLRYCVQKHDASRLHYDFRLELDGTLKSWAVPKGPCLDPAVKRLAVQ
VEDHPLDYADFEGSIPQGHYGAGDVIVWDRGAWTPLDDPREGLEKGHLSFALDGEKLSGRWHLIRTNLRGKQSQWFLVKA
KDGEARSLDRFDVLKERPDSVLSERTLLPRHGEAATPAARPARRGKSGGKTPMPEWIAPELASLVEQPPRGEWAYELKLD
GYRLMSRIEDGHVRLLTRNGHDWTERLPHLEKALAGLGLQRSWLDGELVVLDEEGRPDFQALQNAFEEGRGENILYVLFD
LPYHEGEDLRDVALEERRARLEALLEGRDEDPLRFSATLAEDPRDLLASACKLGLEGVIGKRLGSAYRSRRSNDWIKLKC
QLRQEFVIVGYTEPKGSRRHIGALLLGLYSPDEERRLRYAGKVGSGFTAASLKKVRERLEPLAVRSSPLAKVPPARETGS
VQWVRPQQLCEVSYAQMTRGGIIRQAVFHGLREDKPAREVTGERPAGPPPLRGARKASAGASRAATAGVRISHPQRLIDP
SIQASKLELAEFHARYADLLLRDLRERPVSLVRGPDGIGGELFFQKHAARLKIPGIVQLDPALDPGHPPLLQIRSAEALV
GAVQMGSIEFHTWNASLANLERPDRFVLDLDPDPALPWKRMLEATQLSLTLLDELGLRAFLKTSGGKGMHLLVPLERRHG
WDEVKDFAQAISQHLARLMPERFSAVSGPRNRVGKIFVDYLRNSRGASTVAAYSVRAREGLPVSVPVFREELDSLQGANQ
WNLRSLPQRLDELAGDDPWADYAGTRQRISAAMRRQLGRG
>P27457 ~~~ligE~~~Beta-etherase~~~
MARNNTITLYDLQLESGCTISPYVWRTKYALKHKGFDIDIVPGGFTGILERTGGRSERVPVIVDDGEWVLDSWVIAEYLD
EKYPDRPMLFEGPTQKNLMKFLDNWLWSTAVGPWFRCYILDYHDLSLPQDRDYVRWSREQWFLGGQRLEDVQAGREDRLP
LVPPTLEPFRRILAETKWLGGDQPNFADYSALAVFLWTASVARTPPLTEDDPLRDWLDRGFDLFDGLGRHPGMNPLFGLK
LREGDPEPFVRQTGPAGAGGQALNKGPQTTKMPPRVAEKAD
>P30347 ~~~ligF~~~Protein LigF~~~
MTLKLYSFGPGANSLKPLATLYEKGLEFEQVFVDPSKFEQHSDWFKKINPRGQVPALWHDGKVVTESTVICEYLEDVFPE
SGNSLRPADPFKRAEMRVWTKWVDEYFCWCVSTIGWAFGIKAIAQKMSDEEFEEHINKNVPIPEQQLKWRRARNGFPQEM
LDEEFRKVGVSVARLEETLSKQDYLVDTGYSLADICNFAIANGLQRPGGFFGDYVNQEKTPGLCAWLDRINARPAIKEMF
EKSKREDLLKRQNEKVA
>Q93PS7 3.1.1.57~~~pmdD~~~2-pyrone-4,6-dicarbaxylate hydrolase~~~
MSQFEKTPGWLDWYANPSKPQFKLPAGAVDAHCHVFGPGNEFPFAPERKYTPCDASKAQLYALRDHLGFARNVVVQATCH
GADNRAMVDACKSSGGKARGVATVKRSISDAELQQLHDAGVRGVRFNFVKRLVDFTPKDELMEIAGRIAKLGWHVVIYFE
AVDLPELWDFFTALPTTVVVDHMGRPDVTKGVDSEEFALFLKFMREHQNVWSKVSCPERLSVTGPKALNGEQNAYRDVVP
FARRVVEEFPDRVLWGTDWPHPNLKDHMPDDGLLVDFIPHIAPTAELQQKLLVDNPMRLYWPEEV
>O87170 3.1.1.57~~~ligI~~~2-pyrone-4,6-dicarboxylate hydrolase~~~COG3618
MTNDERILSWNETPSKPRYTPPPGAIDAHCHVFGPMAQFPFSPKAKYLPRDAGPDMLFALRDHLGFARNVIVQASCHGTD
NAATLDAIARAQGKARGIAVVDPAIDEAELAALHEGGMRGIRFNFLKRLVDDAPKDKFLEVAGRLPAGWHVVIYFEADIL
EELRPFMDAIPVPIVIDHMGRPDVRQGPDGADMKAFRRLLDSREDIWFKATCPDRLDPAGPPWDDFARSVAPLVADYADR
VIWGTDWPHPNMQDAIPDDGLVVDMIPRIAPTPELQHKMLVTNPMRLYWSEEM
>G2IQQ5 4.2.1.-~~~ligJ~~~2-keto-4-carboxy-3-hexenedioate hydratase~~~COG2159
MMMIIDCHGHYTVLPKAHDEWREQQKAAFKAGQPAPPYPEISDDEIRETIEANQLRLIKERGADMTIFSPRASAMAPHVG
DQSVAVPWAQACNNLIARVVDLFPETFAGVCMLPQSPEADMTSSIAELERCVNELGFIGCNLNPDPGGGHFKHPPLTDRF
WYPFYEKMVELDVPAMIHVSGSCNPAMHATGAYYLAADTIAFMQLLQGNLFADFPTLRFIIPHGGGAVPYHWGRFRGLAD
MLKQPSLDTLLMNNVFFDTCVYHQPGINLLADVIDNKNILFGSEMVGAVRGIDPTTGHYFDDTKRYIDALDISDQERHAI
FEGNTRRVFPRLDAKLKARGL
>G2IQQ8 4.1.3.17~~~ligK~~~4-carboxy-4-hydroxy-2-oxoadipate aldolase~~~COG0684
MRGAAMGVVVQNIERAPLEVIDGLAACGVATVHEAQGRTGLLASYMRPIYRGARVAGSALTISAPPGDNWMVHVAIEQLK
AGDILLLAPTSPCEDGYFGDLLATSAQARGCRGLVIDAGVRDVRDLTEMNFPVWSKAIYAQGTVKNTLGSVNVPVVCANA
LVNPGDVIVADDDGVCVVPLANAEKVLEAARAREANEGDKREKMANGVLGLDLYKMRERLEKEGLKYV
>G2IQS7 2.1.1.341~~~ligM~~~Vanillate/3-O-methylgallate O-demethylase~~~COG0404
MSAPTNLEQVLAAGGNTVEMLRNSQIGAYVYPVVAPEFSNWRTEQWAWRNSAVLFDQTHHMVDLYIRGKDALKLLSDTMI
NSPKGWEPNKAKQYVPVTPYGHVIGDGIIFYLAEEEFVYVGRAPAANWLMYHAQTGGYNVDIVHDDRSPSRPMGKPVQRI
SWRFQIQGPKAWDVIEKLHGGTLEKLKFFNMAEMNIAGMKIRTLRHGMAGAPGLEIWGPYETQEKARNAILEAGKEFGLI
PVGSRAYPSNTLESGWIPSPLPAIYTGDKLKAYREWLPANSYEASGAIGGSFVSSNIEDYYVNPYEIGYGPFVKFDHDFI
GRDALEAIDPATQRKKVTLAWNGDDMAKIYASLFDTEADAHYKFFDLPLANYANTNADAVLDAAGNVVGMSMFTGYSYNE
KRALSLATIDHEIPVGTELTVLWGEENGGTRKTTVEPHKQMAVRAVVSPVPYSVTARETYEGGWRKAAVTA
>Q0KJL4 5.3.3.-~~~ligU~~~(4E)-oxalomesaconate Delta-isomerase~~~
MPRRDRNMDSAPCMWMRGGTSKGGYFLRADLPADTAARDAFLLAVMGSPDPRQIDGMGGADPLTSKVAVVSKSERPGIDV
DYLFLQVFVDQAIVTDAQNCGNILAGVGPFAIERGLVAASGDETRVAIFMENTGQVAVATVRTPGGSVTYAGDAAIDGVP
GTHAPIPTEFRDTAGSSCGALLPSGNAVDVVNGLPVTLIDNGMPCVVMKAADVGITGYEDRDSLDANAELKAKIEAIRLA
VGELMNLGDVTEKSVPKMMLVAPPRDGGAVCVRSFIPHRAHATIGVLGAVSVATACLIPGSPAAEVAVVPEGARKTLSIE
HPTGEMSCVLEVDDAGNVVSAALLRTARKLMDGVVFV
>G2IN04 1.14.13.-~~~ligXa~~~5,5'-dehydrodivanillate O-demethylase oxygenase subunit~~~COG4638
MLSAEQNDKLARVGPGTPMGELLRRYWHPIGGESEFETKATRPVRLMGEDLVLYKDLSGNYGLMDRHCPHRRADMACGMV
EADGLRCSYHGWMFDAQGACTEQPFEDTANPKGRYKDKVRIKAYPVRALGGLLWAYMGPLPAPELPDWEPFSWKNGFRQI
VISVLPCNWLQGQENSMDPIHFEWMHANWSKRLRGETGPYGPKHLKIDFREYDYGFTYNRIREDTDETNPLWTIGRACLW
PNAMFTGDHFEYRVPIDDETMMSVGWFFTRVPRDAEPYVQESIPVWHGPIKDAQGEWITSHVMNQDFVAWIGQGTISDRT
QENLGLSDKGIGMMRRQFLRDMEKISRGEDPKAIIRDPAINKAIPLPTIHRDAVMEGMTAEEIEAGGALHLKRFIFQYGQ
PEHVLKMQQDAMRISQDNKGYVDA
>G2IN77 1.14.13.-~~~ligXc~~~5,5'-dehydrodivanillate O-demethylase ferredoxin subunit~~~COG0633
MAQLKVVTRDGSLHEFEAPDGYTVMEAIRDQGIDELLAICGGCCSCATCHVFVEEAFLDKLPPLKGDEDDLLDSSDHRQA
NSRLSCQLPIGPELGGMTVTIAPED
>G2ITT5 1.14.13.-~~~ligXd~~~5,5'-dehydrodivanillate O-demethylase ferredoxin reductase subunit~~~COG1251
MPHFDCLIVGGGHAGAQAAILLRQLKFEGTVGLISGETEYPYERPPLSKDYLAGEKIFDRILLRPRNFWGDQGIELFLGE
RVKALQPAEHSLTTASGAEFTYGKLIWAGGGVARRLSCPGGTAKGLFTVRTRADVDAVMAVLPQAERFAIVGGGYIGLEA
AAVLSKLGKQVTLIEALDRVLARVAGPELSAFFEDEHRAHGVDVRLACGVEAIEADEQDRATGVRLADGTIIPTDAVIVG
IGIVPETGPLLLAGASGGNGVDVDEYCLTSLPDVYAIGDCAAHENRFAEGRRVRVESVQNANDQARTAVQHIIGTPAPYD
AVPWFWSNQYDLRLQTVGLAVAHDERVVRGDPATRSFSVVYLRQGHVVALDCVNRTKDYVQGRALVVDGTRVDRDRLADA
DTPLKELTAAQG
>O85057 1.14.14.51~~~~~~Limonene hydroxylase~~~
MGSKYAAGHYSCSSYFQSLDIPENRQFVQGMKKRYGQDTVISSVMANTYSGIQMILEAIVHLRSTDRKKILNYLYNKTFP
SPSGNITIESNHHLSREVRIGQANLDGQFDIVWSSEQPIPAKPLMTNTIIDSANEEQIWKYVVESMGEETADGVLVLDQD
QTILYANSAAYSFLRVKQGDILKEEQLREISHQLIKKETSKYGVQLFIFKRAKRGPLLVTKPDKEPYRFGRVVTYNPSFE
KELRTASIASQSDANVLILGETGSGKEVLARTIHEQSPRRNGPFVALNAGAIPRELIASELFGYVEGAFTGARKGGRPGK
FEVADGGTLFLDEIGDMPLELQVNLLRVLEERKVIRIGDHKERPINVRVIAATNRNLKEEIAYRGSFRSDLYYRLNVFTI
HIPPLRDRKEDIETLSLQFLKNFHQHYCGKGTCHLSNSALQLLQSYNWPGNIRELRNVIERAFLLAIDEPEILPIHLPEE
IQNANCAIPPSSVNNLKDVEKKMIEQALKESKSLTEAAKKLGITRSTLYRKIKQWKIHKTTFS
>Q8YK97 1.13.11.61~~~~~~Linolenate 9R-lipoxygenase~~~
MDLNTYLKLLNLLDSESQKIMLELQAMFSAAGLALRGRGTHTDGIIVKGNLTVLHSSDVPSHSLFTPGKKYDVIFRHANI
VGGAKDDALINGRGSAIRIGNIGDDLSKPRLLDLVLNTGEVFGLPTARLYHQFFGSDFHQKSDMLASGSLRRYAVEAALR
NPDSFTELYYHTQLCYEWVDSKKKSRYARFRLLNPNQSTEGGLLDDSVEIGPRLVLPRKRGDTREKNYLRNEFRQRLTDG
NIVEYVLQAQFRSIEDVAVDCSNIWDPNTYPWLDIAAIVLNQDESENDYYQEIAYNPGNTHYDLKLPNSYSVDDFASLGV
SGALVHYFGSIVRAERTQYLYGSKDDLPGKPVYFPLPVTEIPSKRFLFLLEKYNFLTDNSYPSDGEHDKIEALVSAMPTT
ALDLAVGTTDPTDIPDSYFLERRLNGYNPGAIRESSGQEGWTHELTHNLAKYDIKPGLHFPDFVQCRLFVDKQNGVKLHS
IKIDDHEITPCQEQWQYAKRTYLQAEFLSQELKLHLARCHFNIEQYVMAIKRRLAPTHPVRAFINPHLEGLIFINSSAVP
KIIGSTGFIPIASMLTQGSIVDVMKNELSKLSYMWNPIADLPRDIPGDLFTPAATAYWELLNNYVEQGLLQPFEDELRTE
VNAIQVDELFAELKERSLYSGDQPPKYDSSELKSLLMYIIYHSSFLHSWANFKQYDDAGNPNHVSMGDYSQYDQQTQDKI
RFSQRSLTWVLSSIRYNSVAVYGSDLLKQLIREKSSILEPGLPLEDLMMSINI
>Q9ZAG3 3.3.2.8~~~limA~~~Limonene-1,2-epoxide hydrolase~~~
MTSKIEQPRWASKDSAAGAASTPDEKIVLEFMDALTSNDAAKLIEYFAEDTMYQNMPLPPAYGRDAVEQTLAGLFTVMSI
DAVETFHIGSSNGLVYTERVDVLRALPTGKSYNLSILGVFQLTEGKITGWRDYFDLREFEEAVDLPLRG
>Q9EUT9 1.14.13.107~~~limB~~~Limonene 1,2-monooxygenase~~~
MTDDFGRVDFGAFLAPWHRADSDANFAIHQDLELVEHLDRLGFAEFWLGEHHSGGVEIVASPEMFMAAAAQRTQRIKLGL
GVVSLPYHHPFLVADRLVLLDHLSRGRMIFGAGPGQLADDAKMLGIDPIDSRRKMEEAFDVIHRLLAGETVTQKTDWFTC
QDAYLHVAPYSNIQKAVTATVSPTGPKLAGKYGSGILSLAATNPVGVEKLAEHWKIAEDIAAENGQTVDRADWRLSGIMH
VAETEEQARADVRHGLLYLMNYLSNITPGFAAAPDVDSLIDGINDAGLAVIGTPEMAVTQIRRLQEKSGGFGKFLVLHGE
WASTTAALHSFELIAQQVAPHFNGDLGPRLRGYNQTMNSNRSAADITQAAQEEAQKRFEAERAIRTN
>Q9RA05 1.1.1.n4~~~limC~~~(-)-trans-carveol dehydrogenase~~~
MARVEGQVALITGAARGQGRSHAIKLAEEGADVILVDVPNDVVDIGYPLGTADELDQTAKDVENLGRKAIVIHADVRDLE
SLTAEVDRAVSTLGRLDIVSANAGIASVPFLSHDIPDNTWRQMIDINLTGVWHTAKVAVPHILAGERGGSIVLTSSAAGL
KGYAQISHYSAAKHGVVGLMRSLALELAPHRVRVNSLHPTQVNTPMIQNEGTYRIFSPDLENPTREDFEIASTTTNALPI
PWVESVDVSNALLFLVSEDARYITGAAIPVDAGTTLK
>P59766 4.5.1.-~~~linA1~~~Hexachlorocyclohexane dehydrochlorinase 1~~~
MSDLDRLASRAAIQDLYSDQLIGVDKRQEGRLASIWWDDAEWTIEGIGTYKGPEGALDLANNVLWPMYHETIHYGTNLRL
EFVSADKVNGIGDVLCLGNLVEGNQSILIAAVYTNEYERRDGVWKLSKLNGCMNYFTPLAGIHFAPPGALLQKS
>P59767 4.5.1.-~~~linA2~~~Hexachlorocyclohexane dehydrochlorinase 2~~~
MSDLDRLASRAAIQDLYSDKLIAVDKRQEGRLASIWWDDAEWTIEGIGTYKGPEGALDLANNVLWPMFHECIHYGTNLRL
EFVSADKVNGIGDVLLLGNLVEGNQSILIAAVFTDEYERRDGVWKFSKRNACTNYFTPLAGIHFAPPGIHFAPSGA
>P51697 4.5.1.-~~~linA~~~Gamma-hexachlorocyclohexane dehydrochlorinase~~~COG3631
MSDLDRLASRAAIQDLYSDKLIAVDKRQEGRLASIWWDDAEWTIEGIGTYKGPEGALDLANNVLWPMFHECIHYGTNLRL
EFVSADKVNGIGDVLLLGNLVEGNQSILIAAVFTDEYERRDGVWKFSKRNACTNYFTPLAGIHFAPPGIHFAPSGA
>P06107 ~~~linA~~~Lincosamide resistance protein~~~
MKNNNVTEKELFYILDLFEHMKVTYWLDGGWGVDVLTGKQQREHRDIDIDFDAQHTQKVIQKLEDIGYKIEVHWMPSRME
LKHEEYGYLDIHPINLNDDGSITQANPEGGNYVFQNDWFSETNYKDRKIPCISKEAQLLFHSGYDLTETDHFDIKNLKSI
T
>D4Z2G1 3.8.1.5~~~linB~~~Haloalkane dehalogenase~~~COG0596
MSLGAKPFGEKKFIEIKGRRMAYIDEGTGDPILFQHGNPTSSYLWRNIMPHCAGLGRLIACDLIGMGDSDKLDPSGPERY
AYAEHRDYLDALWEALDLGDRVVLVVHDWGSALGFDWARRHRERVQGIAYMEAIAMPIEWADFPEQDRDLFQAFRSQAGE
ELVLQDNVFVEQVLPGLILRPLSEAEMAAYREPFLAAGEARRPTLSWPRQIPIAGTPADVVAIARDYAGWLSESPIPKLF
INAEPGALTTGRMRDFCRTWPNQTEITVAGAHFIQEDSPDEIGAAIAAFVRRLRPA
>D4YYG1 1.1.1.-~~~linC~~~2,5-dichloro-2,5-cyclohexadiene-1,4-diol dehydrogenase~~~COG1028
MSDLSGKTIIVTGGGSGIGRATVELLVASGANVAVADINDEAGEAVVAASGGKAAYFRCDIAQEEDVKALVAQTLAAFGG
LDGAFNNAAIPQAGLPLAEVSLERFRQSMDINVTGTFLCMKYQILAMIERGTKGSIVNTASAAGVVGVPMHGEYVGAKHA
VVGLTRVAAADYGKHGIRVNALVPGAVRTPMLQRAMDNDAGLEPYLNSIHPIGRFSEPHEQAQAAVWLLSDAASFVTGSC
LAADGGFTAI
>D4Z909 2.5.1.-~~~linD~~~2,5-dichlorohydroquinone reductive dechlorinase~~~
MSADTETLARKVREEVIKPEQSTLISPDRQSPSLLRREATVEPRFELFHFVFSVCSQKVRGTLMEKGVTFGSNELTILPP
QNENYCPQYVRLRLRSEAAAKHRPVSSFTGQSSVDSEGFDPLVVPTLVDHETGRILADSKAICLYLCDALSGGTDLLPAD
IREAVLKQVQLADTTPHVALLYGADPDGDRRPESMQAVMPGIHAHKIDAVRRNIPLADGDPLLLEAYQHKIVKEEAAASF
VINEPQMRTAISKAEQLVTDLDRDLGASTGPWLFGDRFTLADLFWAVSLYRFLWLGYSGFWKDGAGKPRVEAYANRLFAR
PSVKDAIIQWPGHPPSENVIHLLSNA
>Q9WXE6 1.13.11.66~~~linE~~~Chlorohydroquinone/hydroquinone 1,2-dioxygenase~~~
MMQLPERVEGLHHITVATGSAQGDVDLLVKTLGQRLVKKTMFYDGARPVYHLYFGNELGEPGTLYTTFPVRQAGYTGKRG
AGQISAVSYNAPVGTLSWWQEHLIKRAVTVSEVRERFGQKYLSFEHPDCGVGFEIIEQDTDGQFEPWDSPYVPKEVALRG
FHSWTATLNRNEEMDSFMRNAWNLKPQGRDGNYQRYAFGNGGAAKVLDVYIDEDERPGTWALGEGQVHHAAFEVADLDVQ
AALKFDVEGLGYTDFSDRKHRGYFESIYVRTPGGVLFEASVTLGFTHDESPEKLGSEVKVAPQLEGVKDELLRTMNDPIV
I
>Q5W9E3 1.3.1.-~~~linF~~~Maleylacetate reductase~~~COG1454
MQFVYDPLPYRVIFGAGSVRRVADELSHVGSRALVLSTPEQAGSAQELAATLGDKAVGLFSKAVMHVPVATVDAAAAVAR
ELDADCTVAIGGGSTVGLAKALSLRLDLPSLVVPTTYAGSEVTPIWGLTEDGIKTTGRDKKVLPKVVVYDPDLTLSLPAE
MSIASGLNAIAHAMEGLYAFDGNPIVSLMAEESIRALARSLPLIKADPTDAKARGDALYGCWLAGSVLGAASVALHHKLC
HTLGGTFDMPHAQTHTAVLPHAIAYNAPSVPEAMERASRALGGGDPATKLYELAVGLGAEMSLAKLGMPKDGIAKAAALA
VANPYPNPRPITEEGIVQLLSRAVEGLPPITA
>D4Z260 1.1.1.-~~~linX~~~2,5-dichloro-2,5-cyclohexadiene-1,4-diol dehydrogenase LinX~~~COG1028
MANRLAGKVALITGGASGLGAAQAKRFAEEGAKVVIGDLNEEMAKGVVAEIRAAGGDALFIRLDVTDAASWNNAIAAAVE
AFGGLTTLSNTAGIIHPGGFEEESIEGWNKMVAVNQTAIFLGIKAAIPELVKSGNGSIINISSLIGMFPTAGNASYCATK
AAVRIMSKAAALEFVDRGVRVNTIVPGGMNTPITANVPPDVLKQQTSQIPMGKLGDPIDIANGALFLASDEAKYITGVDL
PIDGGWSVGV
>P0DV83 ~~~~~~Outer membrane lipoprotein BBA14~~~
MQIKNFPFLFLLNSLIIFSCSTIASLPEEPSSPQESTLKALSLYEAHLSSYIMYLQTFLVKTKQKVNNKNYPEFTLFDTS
KLKKDQTLKSIKTNIAALKNHIDKIKPIAMQIYKKYSKNIP
>P19833 3.1.1.3~~~lip1~~~Lipase 1~~~
MFIMIKKSELAKAIIVTGALVFSIPTLAEVTLSETTVSSIKSEATVSSTKKALPATPSDCIADSKITAVALSDTRDNGPF
SIRTKRISRQSAKGFGGGTIHYPTNASGCGLLGAIAVVPGYVSYENSIKWWGPRLASWGFVVITINTNSIYDDPDSRAAQ
LNAALDNMIADDTVGSMIDPKRLGAIGWSMGGGGALKLATERSTVRAIMPLAPYHDKSYGEVKTPTLVIACEDDRIAETK
KYANAFYKNAIGPKMKVEVNNGSHFCPSYRFNEILLSKPGIAWMQRYINNDTRFDKFLCANENYSKSPRISAYDYKDCP
>P40601 3.1.1.3~~~lip-1~~~Lipase 1~~~
MKRSFIFAPGMLALSISAISNAHAYNNLYVFGDSLSDGGNNGRYTVDGINGTESKLYNDFIAQQLGIELVNSKKGGTNYA
AGGATAVADLNNKHNTQDQVMGYLASHSNRADHNGMYVHWIGGNDVDAALRNPADAQKIITESAMAASSQVHALLNAGAG
LVIVPTVPDVGMTPKIMEFVLSKGGATSKDLAKIHAVVNGYPTIDKDTRLQVIHGVFKQIGSDVSGGDAKKAEETTKQLI
DGYNELSSNASKLVDNYNQLEDMALSQENGNIVRVDVNALLHEVIANPLRYGFLNTIGYACAQGVNAGSCRSKDTGFDAS
KPFLFADDFHPTPEAHHIVSQYTVSVLNAPYRVMLLTNANNVPVKGALASLDGRLQQLRNVDNEQGKLGVFGGYSGNHSH
TLTLGSDYQIMDNILLGGMISRYQDNSSPADNFHYDGRGYVFTAYGLWRYYDKGWISGDLHYLDMKYEDITRGIVLNDWL
RKENASTSGHQWGGRITAGWDIPLTSAVTTSPIIQYAWDKSYVKGYRESGNNSTAMHFGEQRYDSQVGTLGWRLDTNFGY
FNPYAEVRFNHQFGDKRYQIRSAINSTQTSFVSESQKQDTHWREYTIGMNAVITKDWGAFASISRNDGDVQNHTYSFSLG
VNASF
>Q02104 3.1.1.3~~~lip1~~~Lipase 1~~~
MLLKRLCFAALFSLSMVGCTNAPNALAVNTTQKIIQYERNKSDLEIKSLTLASGDKMVYAENGNVAGEPLLLIHGFGGNK
DNFTRIARQLEGYHLIIPDLLGFGESSKPMSADYRSEAQRTRLHELLQAKGLASNIHVGGNSMGGAISVAYAAKYPKDVK
SLWLVDSAGFWSAGIPKSLEGATLENNPLLIKSNEDFYKMYDFVMYKPPYLPKSVKAVFAQERIKNKELDAKILEQIVTD
NVEERAKIIAQYKIPTLVVWGDKDQIIKPETVNLIKKIIPQAQVIMMEDVGHVPMVEALDETADNYKAFRSILEAQR
>Q2FUU5 3.1.1.3~~~lipA~~~Lipase 1~~~COG1075
MKSQNKYSIRKFSVGASSILIATLLFLSGGQAQAAEKQVNMGNSQEDTVTAQSIGDQQTRENANYQRENGVDEQQHTENL
TKNLHNDKTISEENHRKTDDLNKDQLKDDKKSSLNNKNIQRDTTKNNNANPSDVNQGLEQAINDGKQSKVASQQQSKEAD
NSQDSNANNNLPSQSRIKEAPSLNKLDQTSQREIVNETEIEKVQPQQNNQANDKITNYNFNNEQEVKPQKDEKTLSVSDL
KNNQKSPVEPTKDNDKKNGLNLLKSSAVATLPNKGTKELTAKAKDDQTNKVAKQGQYKNQDPIVLVHGFNGFTDDINPSV
LAHYWGGNKMNIRQDLEENGYKAYEASISAFGSNYDRAVELYYYIKGGRVDYGAAHAAKYGHERYGKTYEGIYKDWKPGQ
KVHLVGHSMGGQTIRQLEELLRNGNREEIEYQKKHGGEISPLFKGNHDNMISSITTLGTPHNGTHASDLAGNEALVRQIV
FDIGKMFGNKNSRVDFGLAQWGLKQKPNESYIDYVKRVKQSNLWKSKDNGFYDLTREGATDLNRKTSLNPNIVYKTYTGE
ATHKALNSDRQKADLNMFFPFVITGNLIGKATEKEWRENDGLVSVISSQHPFNQAYTKATDKIQKGIWQVTPTKHDWDHV
DFVGQDSSDTVRTREELQDFWHHLADDLVKTEKLTDTKQA
>Q9S2A5 3.1.1.3~~~~~~Lipase 1~~~COG2755
MRRFRLVGFLSSLVLAAGAALTGAATAQAAQPAAADGYVALGDSYSSGVGAGSYISSSGDCKRSTKAHPYLWAAAHSPST
FDFTACSGARTGDVLSGQLGPLSSGTGLVSISIGGNDAGFADTMTTCVLQSESSCLSRIATAEAYVDSTLPGKLDGVYSA
ISDKAPNAHVVVIGYPRFYKLGTTCIGLSETKRTAINKASDHLNTVLAQRAAAHGFTFGDVRTTFTGHELCSGSPWLHSV
NWLNIGESYHPTAAGQSGGYLPVLNGAA
>P24484 3.1.1.3~~~lip2~~~Lipase 2~~~
MPILPVPALNALLTKTIKTIKTGAAKNAHQHHVLHHTLKGLDNLPAPVLERINRRLKASTAEQYPLADAHLRLILAISNK
LKRPLAIDKLPKLRQKFGTDAVSLQAPSVWQQNADASGSTENAVSWQDKTIANADGGDMTVRCYQKSTQNSERKSTDEAA
MLFFHGGGFCIGDIDTHHEFCHTVCAQTGWAVVSVDYRMAPEYPAPTALKDCLAAYAWLAEHSQSLGASPSRIVLSGDSA
GGCLAALVAQQVIKPIDALWQDNNQAPAADKKVNDTFKNSLADLPRPLAQLPLYPVTDYEAEYPSWELYGEGLLLDHNDA
EVFNSAYTQHSGLPQSHPLISVMHGDNTQLCPSYIVVAELDILRDEGLAYAELLQKEGVQVQTYTVLGAPHGFINLMSVH
QGLGNQTTYIINEFACLVQNLLTSEGDKPNLRA
>Q2G155 3.1.1.3~~~lip2~~~Lipase 2~~~COG1075
MLRGQEERKYSIRKYSIGVVSVLAATMFVVSSHEAQASEKTSTNAAAQKETLNQPGEQGNAITSHQMQSGKQLDDMHKEN
GKSGTVTEGKDTLQSSKHQSTQNSKTIRTQNDNQVKQDSERQGSKQSHQNNATNNTERQNDQVQNTHHAERNGSQSTTSQ
SNDVDKSQPSIPAQKVIPNHDKAAPTSTTPPSNDKTAPKSTKAQDATTDKHPNQQDTHQPAHQIIDAKQDDTVRQSEQKP
QVGDLSKHIDGQNSPEKPTDKNTDNKQLIKDALQAPKTRSTTNAAADAKKVRPLKANQVQPLNKYPVVFVHGFLGLVGDN
APALYPNYWGGNKFKVIEELRKQGYNVHQASVSAFGSNYDRAVELYYYIKGGRVDYGAAHAAKYGHERYGKTYKGIMPNW
EPGKKVHLVGHSMGGQTIRLMEEFLRNGNKEEIAYHKAHGGEISPLFTGGHNNMVASITTLATPHNGSQAADKFGNTEAV
RKIMFALNRFMGNKYSNIDLGLTQWGFKQLPNESYIDYIKRVSKSKIWTSDDNAAYDLTLDGSAKLNNMTSMNPNITYTT
YTGVSSHTGPLGYENPDLGTFFLMATTSRIIGHDAREEWRKNDGVVPVISSLHPSNQPFVNVTNDEPATRRGIWQVKPII
QGWDHVDFIGVDFLDFKRKGAELANFYTGIINDLLRVEATESKGTQLKAS
>Q7A7P2 3.1.1.3~~~lip2~~~Lipase 2~~~
MLRGQEERKYSIRKYSIGVVSVLAATMFVVSSHEAQASEKTPTSNAAAQKETLNQPGEQGNAITSHQMQSGKQLDDMHKE
NGKSGTVTEGKDTLQSSKHQSTQNSKTIRTQNDNQVKQDSERQGSKQSHQNNATNNTERQNDQVQNTHHAERNGSQSTTS
QSNDVDKSQPSIPAQKVLPNHDKAAPTSTTPPSNDKTAPKSTKAQDATTDKHPNQQDTHQPAHQIIDAKQDDTVRQSEQK
PQVGDLSKHIDGQNSPEKPTDKNTDNKQLIKDALQAPKTRSTTNAAADAKKVRPLKANQVQPLNKYPVVFVHGFLGLVGD
NAPALYPNYWGGNKFKVIEELRKQGYNVHQASVSAFGSNYDRAVELYYYIKGGRVDYGAAHAAKYGHERYGKTYKGIMPN
WEPGKKVHLVGHSMGGQTIRLMEEFLRNGNKEEIAYHKAHGGEISPLFTGGHNNMVASITTLATPHNGSQAADKFGNTEA
VRKIMFALNRFMGNKYSNIDLGLTQWGFKQLPNESYIDYIKRVSKSKIWTSDDNAAYDLTLDGSAKLNNMTSMNPNITYT
TYTGVSSHTGPLGYENPDLGTFFLMDTTSRIIGHDAREEWRKNDGVVPVISSLHPSNQPFINVTNDEPATRRGIWQVKPI
IQGWDHVDFIGVDFLDFKRKGAELANFYTGIINDLLRVEATESKGTQLKAS
>P10335 3.1.1.3~~~lip2~~~Lipase 2~~~
MLRGQEERKYSIRKYSIGVVSVLAATMFVVSSHEAQASEKTSTNAAAQKETLNQPGEQGNAITSHQMQSGKQLDDMHKEN
GKSGTVTEGKDTLQSSKHQSTQNSKTIRTQNDNQVKQDSERQGSKQSHQNNATNNTERQNDQVQNTHHAERNGSQSTTSQ
SNDVDKSQPSIPAQKVIPNHDKAAPTSTTPPSNDKTAPKSTKAQDATTDKHPNQQDTHQPAHQIIDAKQDDTVRQSEQKP
QVGDLSKHIDGQNSPEKPTDKNTDNKQLIKDALQAPKTRSTTNAAADAKKVRPLKANQVQPLNKYPVVFVHGFLGLVGDN
APALYPNYWGGNKFKVIEELRKQGYNVHQASVSAFGSNYDRAVELYYYIKGGRVDYGAAHAAKYGHERYGKTYKGIMPNW
EPGKKVHLVGHSMGGQTIRLMEEFLRNGNKEEIAYHKAHGGEISPLFTGGHNNMVASITTLATPHNGSQAADKFGNTEAV
RKIMFALNRFMGNKYSNIDLGLTQWGFKQLPNESYIDYIKRVSKSKIWTSDDNAAYDLTLDGSAKLNNMTSMNPNITYTT
YTGVSSHTGPLGYENPDLGTFFLMATTSRIIGHDAREEWRKNDGVVPVISSLHPSNQPFVNVTNDEPATRRGIWQVKPII
QGWDHVDFIGVDFLDFKRKGAELANFYTGIINDLLRVEATESKGTQLKAS
>Q93J06 3.1.1.3~~~~~~Lipase 2~~~COG2755
MPKPALRRVMTATVAAVGTLALGLTDATAHAAPAQATPTLDYVALGDSYSAGSGVLPVDPANLLCLRSTANYPHVIADTT
GARLTDVTCGAAQTADFTRAQYPGVAPQLDALGTGTDLVTLTIGGNDNSTFINAITACGTAGVLSGGKGSPCKDRHGTSF
DDEIEANTYPALKEALLGVRARAPHARVAALGYPWITPATADPSCFLKLPLAAGDVPYLRAIQAHLNDAVRRAAEETGAT
YVDFSGVSDGHDACEAPGTRWIEPLLFGHSLVPVHPNALGERRMAEHTMDVLGLD
>P24640 3.1.1.3~~~lip3~~~Lipase 3~~~
MLLKRLGLAALFSLSMVGCTTAPNTLAVNTTQKIIQYERSKSDLEVKSLTLASGDKMVYAENDNVTGEPLLLIHGFGGNK
DNFTRIADKLEGYHLIIPDLLGFGNSSKPMTADYRADAQATRLHELMQAKGLASNTHVGGNSMGGAISVAYAAKYPKEIK
SLWLVDTAGFWSAGVPKSLEGATLENNPLLINSKEDFYKMYDFVMYKPPYIPKSVKAVFAQERINNKALDTKILEQIVTD
NVEERAKIIAKYNIPTLVVWGDKDQVIKPETTELIKEIIPQAQVIMMNDVGHVPMVEAVKDTANDYKAFRDGLKK
>Q8DLC2 2.8.1.8~~~lipA2~~~Lipoyl synthase 2~~~COG0320
MALSRPLPSWLRKPLGKASEISTVQRLVRQYGIHTICEEGRCPNRGECYGQKTATFLLLGPTCTRACAFCQVEKGHAPAA
VDPEEPTKIAAAVATLGLRYVVLTSVARDDLPDQGAGQFVATMAAIRQRCPGTEIEVLSPDFRMDRGRLSQRDCIAQIVA
AQPACYNHNLETVRRLQGPVRRGATYESSLRVLATVKELNPDIPTKSGLMLGLGETEAEIIETLKDLRRVGCDRLTLGQY
LPPSLSHLPVVKYWTPEEFNTLGNIARELGFSHVRSGPLVRSSYHAAEGG
>O32129 2.8.1.8~~~lipA~~~Lipoyl synthase~~~COG0320
MAKKDEHLRKPEWLKIKLNTNENYTGLKKLMRENNLHTVCEEAKCPNIHECWAVRRTATFMILGSVCTRACRFCAVKTGL
PTELDLQEPERVADSVALMNLKHAVITAVARDDQKDGGAGIFAETVRAIRRKSPFTTIEVLPSDMGGNYDNLKTLMDTRP
DILNHNIETVRRLTPRVRARATYDRSLEFLRRAKEMQPDIPTKSSIMIGLGETKEEIIEVMDDLLANNVDIMAIGQYLQP
TKKHLKVQKYYHPDEFAELKEIAMQKGFSHCEAGPLVRSSYHADEQVNEASKKRQAQA
>P60716 2.8.1.8~~~lipA~~~Lipoyl synthase~~~COG0320
MSKPIVMERGVKYRDADKMALIPVKNVATEREALLRKPEWMKIKLPADSTRIQGIKAAMRKNGLHSVCEEASCPNLAECF
NHGTATFMILGAICTRRCPFCDVAHGRPVAPDANEPVKLAQTIADMALRYVVITSVDRDDLRDGGAQHFADCITAIREKS
PQIKIETLVPDFRGRMDRALDILTATPPDVFNHNLENVPRIYRQVRPGADYNWSLKLLERFKEAHPEIPTKSGLMVGLGE
TNEEIIEVMRDLRRHGVTMLTLGQYLQPSRHHLPVQRYVSPDEFDEMKAEALAMGFTHAACGPFVRSSYHADLQAKGMEV
K
>P9WK91 2.8.1.8~~~lipA~~~Lipoyl synthase~~~COG0320
MSVAAEGRRLLRLEVRNAQTPIERKPPWIKTRARIGPEYTELKNLVRREGLHTVCEEAGCPNIFECWEDREATFLIGGDQ
CTRRCDFCQIDTGKPAELDRDEPRRVADSVRTMGLRYATVTGVARDDLPDGGAWLYAATVRAIKELNPSTGVELLIPDFN
GEPTRLAEVFESGPEVLAHNVETVPRIFKRIRPAFTYRRSLGVLTAARDAGLVTKSNLILGLGETSDEVRTALGDLRDAG
CDIVTITQYLRPSARHHPVERWVKPEEFVQFARFAEGLGFAGVLAGPLVRSSYRAGRLYEQARNSRALASR
>P26504 3.1.1.3~~~~~~Lipase~~~
MGVFDYKNLGTEASKTLFADATAITLYTYHNLDNGFAVGYQQHGLGLGCRHTGRGVARQHRLPGSDPPAFPGILTRKRPP
WTRCTQPVGRQSSASALGYGGKVDARGTFFGEKAGYTTAQAEVLGKYDDAGKLLEIGIGFRGTSGPRESLITTPCRSGQR
PARRAGPQGLCEKLCRRTFGGLLKTVADYAGAHGLSGKDVLVSGHSLGGLAVNSMADLSTSKWAGFYKDANYLAYASPTQ
SAGDKVLNIGYENDPVFRALDGSTFNLSSLGVHDKAHESTTDNIVSFNDHYASTLWNVLPFSIANLSTWVSHLPSAYGDG
MTRVLESGFYEQMTRDSTIILCPTWSDPARANTWVQDLNRNAEPHTGNTFIIGSDGNDLIQGGKGADFIEGGKGNDTIRD
NSGHNTFLFSGHFGQDRIIGYQPTGWCSRAPTAAPTCATTRRPWGPIRC
>P65286 2.8.1.8~~~lipA~~~Lipoyl synthase~~~
MATKNEEILRKPDWLKIKLNTNENYTGLKKMMREKNLNTVCEEAKCPNIHECWGARRTATFMILGAVCTRACRFCAVKTG
LPNELDLNEPERVAESVELMNLKHVVITAVARDDLRDAGSNVYAETVRKVRERNPFTTIEILPSDMGGDYDALETLMASR
PDILNHNIETVRRLTPRVRARATYDRTLEFLRRSKELQPDIPTKSSIMVGLGETIEEIYETMDDLRANDVDILTIGQYLQ
PSRKHLKVQKYYTPLEFGKLRKVAMDKGFKHCQAGPLVRSSYHADEQVNEAAKEKQRQGEAQLNS
>P60720 2.3.1.181~~~lipB~~~Octanoyltransferase~~~COG0321
MYQDKILVRQLGLQPYEPISQAMHEFTDTRDDSTLDEIWLVEHYPVFTQGQAGKAEHILMPGDIPVIQSDRGGQVTYHGP
GQQVMYVLLNLKRRKLGVRELVTLLEQTVVNTLAELGIEAHPRADAPGVYVGEKKICSLGLRIRRGCSFHGLALNVNMDL
SPFLRINPCGYAGMEMAKISQWKPEATTNNIAPRLLENILALLNNPDFEYITA
>P9WK83 2.3.1.181~~~lipB~~~Octanoyltransferase~~~COG0321
MTGSIRSKLSAIDVRQLGTVDYRTAWQLQRELADARVAGGADTLLLLEHPAVYTAGRRTETHERPIDGTPVVDTDRGGKI
TWHGPGQLVGYPIIGLAEPLDVVNYVRRLEESLIQVCADLGLHAGRVDGRSGVWLPGRPARKVAAIGVRVSRATTLHGFA
LNCDCDLAAFTAIVPCGISDAAVTSLSAELGRTVTVDEVRATVAAAVCAALDGVLPVGDRVPSHAVPSPL
>Q5SLQ3 2.3.1.181~~~lipB~~~Octanoyltransferase~~~COG0321
MEFLVEDLGLVPYGEAWAYQKRVHREVVAGNRPPTLLLLEHPRVITLGRKATGENLLFPESWYRENGFELYWVERGGDVT
YHGPGQLVGYPIFPVGREVRRFLRQIEEAIVRVAAGYGISAYPTPGYAGVWVGEDKLCAIGVAVKEGVSFHGFALNVNTD
LNDFTVIVPCGLKGKGVTSLEKLLGRKVPMEEAKARVVAAFAEVFGLRPVEGSVHEA
>Q65NA4 3.-.-.-~~~lipC~~~Spore germination lipase LipC~~~COG2755
MTLQYTALGDSLTVGVGAGLFEPGFVQRYKRKMEEDLNEEVSLIVFAKSGLETSEILAMLNEPFIMEQVKKADVITITGC
GNDLLQSLEIYEKEKDEHVFLEASSHCQKNYSGMLEKIREIKGEKDTRYLVRLLNLYNPFPSIELADKWISGFNRHLKQL
ESAPQIKVIDTYAVFKGREKEYLSIDRVHPSSRGYEAMSEKLRAAGYGRLEG
>P42969 3.-.-.-~~~lipC~~~Spore germination lipase LipC~~~COG2755
MVLRYTALGDSLTTGRGSGLFSPGFVQRFGDMMEADLKTRTAINIFARSGLNTEEILGLLSYPYVQKCIRDADMITITGC
GNDLIDSVLAYQTSKDETIFSRVSAHCHENFEKMIAKVAEIKGENPSPYAIRVFNLYNPFPEIDIAGKWITSFNSHLETL
ASAPHVKIADAYSIFKGKEQEYLSFDGVHPNSKGYQAMAEAVHKLGYKELSVS
>P96402 3.1.1.-~~~lipC~~~Esterase LipC~~~COG0657
MNQRRAAGSTGVAYIRWLLRARPADYMLALSVAGGSLPVVGKHLKPLGGVTAIGVWGARHASDFLSATAKDLLTPGINEV
RRRDRASTQEVSVAALRGIVSPDDLAVEWPAPERTPPVCGALRHRRYVHRRRVLYGDDPAQLLDVWRRKDMPTKPAPVLI
FVPGGAWVHGSRAIQGYAVLSRLAAQGWVCLSIDYRVAPHHRWPRHILDVKTAIAWARANVDKFGGDRNFIAVAGCSAGG
HLSALAGLTANDPQYQAELPEGSDTSVDAVVGIYGRYDWEDRSTPERARFVDFLERVVVQRTIDRHPEVFRDASPIQRVT
RNAPPFLVIHGSRDCVIPVEQARSFVERLRAVSRSQVGYLELPGAGHGFDLLDGARTGPTAHAIALFLNQVHRSRAQFAK
EVI
>P40600 3.1.1.3~~~lip~~~Extracellular lipase~~~
MKKKLIYAAVVSALLAGCGGSDDNKGDTSSYLDYLLTGSNAVGPSALAARAWDGTLKFSTETADLSNPVSAMSTLDGWST
TQAIQIVPVTSSGITVQAPTTAEFGASVAPLYLLEVTFDSTALRPSGVKKVLTYGVDFVVAASAWQAEPGSAQAVEPLPC
LANDSGHRTAERQSRRCLKAGSDYGNYKNNAGSNAQEQTINGLIALQEGLFKAATGIATDHVIFSDWFGTQSGADVLVAV
KGAAASVLKADPVTLDAAKLWKQDAWEHQPARHLYPGRDRPTCLPDPAGCRAVPAAEQKDAIATAFGPVLRSTRLLKRPR
SIPVPSSCLTSSPHRRPQVPGARPRPSPGTVPSQPVRHRQCAEGVTRSDRRAGGGGRGSGPAGDADCRSDPPERAAGRGE
QADWGDAHLRRQAAGRRAEHWSLQPAADAGRGAIRADACLRQGCPQHHHGCHHLSARRDLGQRERLRPGAGPDLEDLCRH
AGGQEGGAGGDRSSAARRAWLRLSGSMDTVTTSDNPTPYLNLSYLTVARDNLKQSVAICWACVWRLAWPTPRAIGTAGSL
KVHFLGHSLGASRVPTCCGRQPDHRQRASGCPVQVRYRWPGHAGSHSAAAAELADFGPTIKMGVLTSGSAELKAGFTAYA
PNCTDGGAYLLRQRVPAEPGRGHSATAATRCRVQLCGPVGAGFG
>Q7D5F9 3.1.1.1~~~lipF~~~Carboxylesterase/phospholipase LipF~~~
MSSYYARRPLQSSGCSNSDSCWDGAPIEITESGPSVAGRLAALASRMTIKPLMTVGSYLSPLPLPLGFVDFACRVWRPGQ
GTVRTTINLPNATAQLVRAPGVRAADGAGRVVLYLHGGAFVMCGPNSHSRIVNALSGFAESPVLIVDYRLIPKHSLGMAL
DDCHDAYQWLRARGYRPEQIVLAGDSAGGYLALALAQRLQCDDEKPAAIVAISPLLQLAKGPKQDHPNIGTDAMFPARAF
DALAAWVRAAAAKNMVDGRPEDLYEPLDHIESSLPPTLIHVSGSEVLLHDAQLGAGKLAAAGVCAEVRVWPGQAHLFQLA
TPLVPEATRSLRQIGQFIRDATADSSLSPVHRSRYVAGSPRAASRGAFGQSPI
>O06350 3.1.1.1~~~lipF~~~Carboxylesterase/phospholipase LipF~~~COG0657
MSSYYARRPLQSSGCSNSDSCWDGAPIEITESGPSVAGRLAALASRMTIKPLMTVGSYLSPLPLPLGFVDFACRVWRPGQ
GTVRTTINLPNATAQLVRAPGVRAADGAGRVVLYLHGGAFVMCGPNSHSRIVNALSGFAESPVLIVDYRLIPKHSLGMAL
DDCHDAYQWLRARGYRPEQIVLAGDSAGGYLALALAQRLQCDDEKPAAIVAISPLLQLAKGPKQDHPNIGTDAMFPARAF
DALAAWVRAAAAKNMVDGRPEDLYEPLDHIESSLPPTLIHVSGSEVLLHDAQLGAGKLAAAGVCAEVRVWPGQAHLFQLA
TPLVPEATRSLRQIGQFIRDATADSSLSPVHRSRYVAGSPRAASRGAFGQSPI
>P71668 3.1.1.-~~~lipI~~~Esterase LipI~~~COG0657
MPSLDNTADEKPAIDPILLKVLDAVPFRLSIDDGIEAVRQRLRDLPRQPVHPELRVVDLAIDGPAGPIGTRIYWPPTCPD
QAEAPVVLYFHGGGFVMGDLDTHDGTCRQHAVGADAIVVSVDYRLAPEHPYPAAIEDAWAATRWVAEHGRQVGADLGRIA
VAGDSAGGTIAAVIAQRARDMGGPPIVFQLLWYPSTLWDQSLPSLAENADAPILDVKAIAAFSRWYAGEIDLHNPPAPMA
PGRAENLADLPPAYIAVAGYDPLRDDGIRYGELLAAAGVPVEVHNAQTLVHGYVGYAGVVPAATEATNRGLVALRVVLHG
>O07732 ~~~lipJ~~~Bifunctional lipase/adenylate cyclase LipJ~~~COG2114
MAQAPHIHRTRYAKCGDMDIAYQVLGDGPTDLLVLPGPFVPIDSIDDEPSLYRFHRRLASFSRVIRLDHRGVGLSSRLAA
ITTLGPKFWAQDAIAVMDAVGCEQATIFAPSFHAMNGLVLAADYPERVRSLIVVNGSARPLWAPDYPVGAQVRRADPFLT
VALEPDAVERGFDVLSIVAPTVAGDDVFRAWWDLAGNRAGPPSIARAVSKVIAEADVRDVLGHIEAPTLILHRVGSTYIP
VGHGRYLAEHIAGSRLVELPGTDTLYWVGDTGPMLDEIEEFITGVRGGADAERMLATIMFTDIVGSTQHAAALGDDRWRD
LLDNHDTIVCHEIQRFGGREVNTAGDGFVATFTSPSAAIACADDIVDAVAALGIEVRIGIHAGEVEVRDASHGTDVAGVA
VHIGARVCALAGPSEVLVSSTVRDIVAGSRHRFAERGEQELKGVPGRWRLCVLMRDDATRTR
>Q8Y489 2.3.1.200~~~lipL~~~Lipoyl-[GcvH]:protein N-lipoyltransferase~~~COG0095
MNIENTLLKQDVWRFIDNTTINPAFDAIQSFATDDTLCRSVGARMAPSTVRGWVHEKTVSLGIQDSKLPDIDKGIAFLQK
QGYRVVVRNSGGLAVVLDSGVLNLSMVLPDAERGIAIERGYETMFTLIKDMFVDCNEVIEAKEIEDSYCPGSYDLSIQGK
KFAGISQRRMAKGVAVQIYLAIDGDQTTRSELIRDFYTISGKAKQTKYTFPDVNPNVMGSLSDLMKNDISLNGTLVRLFN
SLRHYAGDLVSGTLTSEELDLFPAYYERLIARNDKVLT
>P39648 2.3.1.204~~~lipL~~~Octanoyl-[GcvH]:protein N-octanoyltransferase~~~COG0095
MANQPIDLLMQPKWRVIDQSSLGPLFDAKQSFAMDDTLCMSVGKGVSPATARSWVHHDTIVLGIQDTRLPFLQDGISLLE
SEGYRVIVRNSGGLAVVLDDGVLNISLIFEDEKKGIDIDKGYEAMVELMRRMLRPYNAKIEAYEIEGSYCPGSYDLSING
KKFAGISQRRVRGGVAVQIYLCADKSGSERADLIRRFYQAALKDKQNDKKGVYPEIRPETMASLSELLQKDISVQDLMFA
LLTELKALSTHLYSAGLSIDEEMEFEKNLVRMAERNAKVFG
>Q9K6A7 2.3.1.204~~~lipL~~~Octanoyl-[GcvH]:protein N-octanoyltransferase~~~COG0095
MSLLLQQHLSQPWRFLDHTSFGPTFQALQSFAYDDTLCTSIGKSQSPPTLRAWVHHNTVVLGIQDSRLPQIKAGIEALKG
FQHDVIVRNSGGLAVVLDSGILNLSLVLKEEKGFSIDDGYELMYELICSMFQDHREQIEAREIVGSYCPGSYDLSIDGKK
FAGISQRRIRGGVAVQIYLCVSGSGAERAKMIRTFYDKAVAGQPTKFVYPRIKPETMASLSELLGQPHNVSDVLLKALMT
LQQHGASLLTESLSADEWLLYEQHFARISERNEKLLAE
>P71778 3.1.1.-~~~lipL~~~Esterase/beta-lactamase LipL~~~COG1680
MMVDTGVDHRAVSSHDGPDAGRRVFGAADPRFACVVRAFASMFPGRRFGGGALAVYLDGQPVVDVWKGWADRAGWVPWSA
DSAPMVFSATKGMTATVIHRLADRGLIDYEAPVAEYWPAFGANGKATLTVRDVMRHQAGLSGLRGATQQDLLDHVVMEER
LAAAVPGRLLGKSAYHALTFGWLMSGLARAVTGKDMRLLFREELAEPLDTDGLHLGRPPADAPTRVAEIIMPQDIAANAV
LTCAMRRLAHRFSGGFRSMYFPGAIAAVQGEAPLLDAEIPAANGVATARALARMYGAIANGGEIDGIRFLSRELVTGLTR
NRRQVLPDRNLLVPLNFHLGYHGMPIGNVMPGFGHVGLGGSIGWTDPETGVAFALVHNRLLSPLVMTDHAGFVGIYHLIR
QAAAQARKRGYQPVTPFGAPYSEPGAAAG
>D4QF25 1.14.11.49~~~lipL~~~Uridine-5'-phosphate dioxygenase~~~
MQLMKSSYLELTARGHVTDLLKPDDTLEMLETYGFAVTQSPVEAVSTAHAYREIAAIREDFGLGEPYVPLLYRDRDEPTV
TAVTRKGGSDHPVFHTGEAQGWHTDGLLEDIGTIKTTLLYCVSPAHRGGRTFLLNAGRVFEELRMEDPEAADVLLRDTIL
GRRSTIPGVDREAVGPVFLELGDGHYATRYGEGRVERWYPADAAEQHALDRALRFFRARRDDPDVRIDLLLRAGQCLIFR
NDVLAHGRENFTDDPQRPRLLLRSLHTNAPKKPS
>P54511 2.3.1.181~~~lipM~~~Octanoyltransferase LipM~~~COG0095
MQKETWRFIDSGNASPAFNMALDEALLYWHSEKKIPPVIRFYGWNPATLSVGYFQNIKKEINFEAVHKYNLGFVRRPTGG
RGVLHDQELTYSVIVSEEHPEMPATVTEAYRVISEGILQGFRNLGLDAYFAIPRTEKEKESLKNPRSSVCFDAPSWYELV
VEGRKVAGSAQTRQKGVILQHGSILLDLDEDKLFDLFLYPSERVRERMQRNFKNKAVAINELIEKRVTMDEARKAFKEGF
ETGLNIHLEPYELSQEELDFVHHLAETKYASDEWNYKR
>Q50681 3.1.1.-~~~lipM~~~Probable carboxylic ester hydrolase LipM~~~COG0657
MGAPRLIHVIRQIGALVVAAVTAAATINAYRPLARNGFASLWSWFIGLVVTEFPLPTLASQLGGLVLTAQRLTRPVRAVS
WLVAAFSALGLLNLSRAGRQADAQLTAALDSGLGPDRRTASAGLWRRPAGGGTAKTPGPLRMLRIYRDYAHDGDISYGEY
GRANHLDIWRRPDLDLTGTAPVLFQIPGGAWTTGNKRGQAHPLMSHLAELGWICVAINYRHSPRNTWPDHIIDVKRALAW
VKAHISEYGGDPDFIAITGGSAGGHLSSLAALTPNDPRFQPGFEEADTRVQAAVPFYGVYDFTRLQDAMHPMMLPLLERM
VVKQPRTANMQSYLDASPVTHISADAPPFFVLHGRNDSLVPVQQARGFVDQLRQVSKQPVVYAELPFTQHAFDLLGSARA
AHTAIAVEQFLAEVYATQHAGSEPGPAVAIP
>P95125 3.1.1.-~~~lipN~~~Carboxylic ester hydrolase LipN~~~COG0657
MTKSLPGVADLRLGANHPRMWTRRVQGTVVNVGVKVLPWIPTPAKRILSAGRSVIIDGNTLDPTLQLMLSTSRIFGVDGL
AVDDDIVASRAHMRAICEAMPGPQIHVDVTDLSIPGPAGEIPARHYRPSGGGATPLLVFYHGGGWTLGDLDTHDALCRLT
CRDADIQVLSIDYRLAPEHPAPAAVEDAYAAFVWAHEHASDEFGALPGRVAVGGDSAGGNLSAVVCQLARDKARYEGGPT
PVLQWLLYPRTDFTAQTRSMGLFGNGFLLTKRDIDWFHTQYLRDSDVDPADPRLSPLLAESLSGLAPALIAVAGFDPLRD
EGESYAKALRAAGTAVDLRYLGSLTHGFLNLFQLGGGSAAGTNELISALRAHLSRV
>P37966 ~~~lipO~~~Lipoprotein LipO~~~COG1653
MKIRMRKKWMALPLAAMMIAGCSHSETSNSASGSKDTIKIMAPLLSPESPSDKSPSLKALEKYTGKEIKVTWVPDSSYND
KFNIVMASGEMPHAIVIKDKSAGFIKSVKAGAFWELSPYLKDYKNLSQADEKILKNSSVNGEVYGIYRTRDLIRACMIIR
TDWLKNVGLDMPETLDDFYEVLKAFKEKDPDGNGKDDTYGMVVPKWMGLGNGSPWDVLQIWFGAPNRYGVENGKLIPDFT
TKEYMDALTFFKKLYDEGLINKDFAVMDSAKWNDPVVKGKAGVIVDTGSRASQIQSAMEEADESNKDIIDIVGSLEGPNG
KRTFPTSGYSGMITIPKSSVKTEKELKEVLSFLDKMNDKEAQILTNNGVKGRNYELKDGVFTSLEKNNKSLLYEHEGLAQ
FSMSIPKSEYYIEDQKTKLFQHRKDIITEGEKIAVFNPAESLVSDVYTQKGAQLDNIILDARTQFIIGEIDEKGFDDAVE
LWKKSGGNELMKDLNKLYQSSK
>I6Y9F7 3.1.1.-~~~lipQ~~~Esterase LipQ~~~COG0657
MHIASVTSRCSRAGAEALRQGAQLAADARDTCRAGALLLRGSPCAIGWVAGWLSAEFPARVVTGHALSRISPRSIGRFGT
SWAAQRADQILHAALVDAFGPDFRDLVWHPTGEQSEAARRSGLLNLPHIPGPHRRYAAQTSDIPYGPGGRENLLDIWRRP
DLAPGRRAPVLIQVPGGAWTINGKRPQAYPLMSRMVELGWICVSINYSKSPRCTWPAHIVDVKRAIAWVRENIADYGGDP
DFITITGGSAGAHLAALAALSANDPALQPGFESADTAVQAAAPYYGVYDLTNAENMHEMMMPFLEHFVMRSRYVDNPGLF
KAASPISYVHSEAPPFFVLHGEKDPMVPSAQSRAFSAALRDAGAATVSYAELPNAHHAFDLAATVRSRMVAEAVSDFLGV
IYGRRMGARKGSLALSSPPAS
>P9WK85 3.1.1.-~~~lipR~~~Putative acetyl-hydrolase LipR~~~COG0657
MNLRKNVIRSVLRGARPLFASRRLGIAGRRVLLATLTAGARAPKGTRFQRVSIAGVPVQRVQPPHAATSGTLIYLHGGAY
ALGSARGYRGLAAQLAAAAGMTALVPDYTRAPHAHYPVALEEMAAVYTRLLDDGLDPKTTVIAGDSAGGGLTLALAMALR
DRGIQAPAALGLICPWADLAVDIEATRPALRDPLILPSMCTEWAPRYVGSSDPRLPGISPVYGDMSGLPPIVMQTAGDDP
ICVDADKIETACAASKTSIEHRRFAGMWHDFHLQVSLLPEARDAIADLGARLRGHLHQSQGQPRGVVK
>O53424 3.1.1.-~~~lipU~~~Esterase LipU~~~COG0657
MAVRPVLAVGSYLPHAPWPWGVIDQAARVLLPASTTVRAAVSLPNASAQLVRASGVLPADGTRRAVLYLHGGAFLTCGAN
SHGRLVELLSKFADSPVLVVDYRLIPKHSIGMALDDCHDGYRWLRLLGYEPEQIVLAGDSAGGYLALALAQRLQEVGEEP
AALVAISPLLQLAKEHKQAHPNIKTDAMFPARAFDALDALVASAAARNQVDGEPEELYEPLEHITPGLPRTLIHVSGSEV
LLHDAQLAAAKLAAAGVPAEVRVWPGQVHDFQVAASMLPEAIRSLRQIGEYIREATG
>L0TC47 3.1.1.1~~~lipV~~~Lipase LipV~~~COG0596
MIIDLHVQRYGPSGPARVLTIHGVTEHGRIWHRLAHHLPEIPIAAPDLLGHGRSPWAAPWTIDANVSALAALLDNQGDGP
VVVVGHSFGGAVAMHLAAARPDQVAALVLLDPAVALDGSRVREVVDAMLASPDYLDPAEARAEKATGAWADVDPPVLDAE
LDEHLVALPNGRYGWRISLPAMVCYWSELARDIVLPPVGTATTLVRAVRASPAYVSDQLLAALDKRLGADFELLDFDCGH
MVPQAKPTEVAAVIRSRLGPR
>I6Y2J4 3.1.1.3~~~lipY~~~Triacylglycerol lipase~~~COG0657
MVSYVVALPEVMSAAATDVASIGSVVATASQGVAGATTTVLAAAEDEVSAAIAALFSGHGQDYQALSAQLAVFHERFVQA
LTGAAKGYAAAELANASLLQSEFASGIGNGFATIHQEIQRAPTALAAGFTQVPPFAAAQAGIFTGTPSGAAGFDIASLWP
VKPLLSLSALETHFAIPNNPLLALIASDIPPLSWFLGNSPPPLLNSLLGQTVQYTTYDGMSVVQITPAHPTGEYVVAIHG
GAFILPPSIFHWLNYSVTAYQTGATVQVPIYPLVQEGGTAGTVVPAMAGLISTQIAQHGVSNVSVVGDSAGGNLALAAAQ
YMVSQGNPVPSSMVLLSPWLDVGTWQISQAWAGNLAVNDPLVSPLYGSLNGLPPTYVYSGSLDPLAQQAVVLEHTAVVQG
APFSFVLAPWQIHDWILLTPWGLLSWPQINQQLGIAA
>Q5U780 3.1.1.3~~~~~~Lipase~~~
MKCCRIMFVLLGLWFVFGLSVPGGRTEAASLRANDAPIVLLHGFTGWGREEMFGFKYWGGVRGDIEQWLNDNGYRTYTLA
VGPLSSNWDRACEAYAQLVGGTVDYGAAHAAKHGHARFGRTYPGLLPELKRGGRIHIIAHSQGGQTARMLVSLLENGSQE
EREYAKAHNVSLSPLFEGGHHFVLSVTTIATPHDGTTLVNMVDFTDRFFDLQKAVLEAAAVASNVPYTSQVYDFKLDQWG
LRRQPGESFDHYFERLKRSPVWTSTDTARYDLSVSGAEKLNQWVQASPNTYYLSFSTERTYRGALTGNHYPELGMNAFSA
VVCAPFLGSYRNPTLGIDDRWLENDGIVNTVSMNGPKRGSSDRIVPYDGTLKKGVWNDMGTYNVDHLEIIGVDPNPSFDI
RAFYLRLAEQLASLRP
>P22088 3.1.1.3~~~lip~~~Triacylglycerol lipase~~~COG1075
MARTMRSRVVAGAVACAMSIAPFAGTTAVMTLATTHAAMAATAPAAGYAATRYPIILVHGLSGTDKYAGVLEYWYGIQED
LQQNGATVYVANLSGFQSDDGPNGRGEQLLAYVKTVLAATGATKVNLVGHSQGGLSSRYVAAVAPDLVASVTTIGTPHRG
SEFADFVQDVLAYDPTGLSSSVIAAFVNVFGILTSSSHNTNQDALAALQTLTTARAATYNQNYPSAGLGAPGSCQTGAPT
ETVGGNTHLLYSWAGTAIQPTLSVFGVTGATDTSTLPLVDPANVLDLSTLALFGTGTVMINRGSGQNDGLVSKCSALYGK
VLSTSYKWNHLDEINQLLGVRGAYAEDPVAVIRTHANRLKLAGV
>P0DUB8 3.1.1.3~~~lip~~~Triacylglycerol lipase~~~
MVRSMRSRVAARAVAWALAVMPLAGAAGLTMAASPAAVAADTYAATRYPVILVHGLAGTDKFANVVDYWYGIQSDLQSHG
AKVYVANLSGFQSDDGPNGRGEQLLAYVKQVLAATGATKVNLIGHSQGGLTSRYVAAVAPQLVASVTTIGTPHRGSEFAD
FVQDVLKTDPTGLSSTVIAAFVNVFGTLVSSSHNTDQDALAALRTLTTAQTATYNRNFPSAGLGAPGSCQTGAATETVGG
SQHLLYSWGGTAIQPTSTVLGVTGATDTSTGTLDVANVTDPSTLALLATGAVMINRASGQNDGLVSRCSSLFGQVISTSY
HWNHLDEINQLLGVRGANAEDPVAVIRTHVNRLKLQGV
>P26876 3.1.1.3~~~lip~~~Triacylglycerol lipase~~~
MKKKSLLPLGLAIGLASLAASPLIQASTYTQTKYPIVLAHGMLGFDNILGVDYWFGIPSALRRDGAQVYVTEVSQLDTSE
VRGEQLLQQVEEIVALSGQPKVNLIGHSHGGPTIRYVAAVRPDLIASATSVGAPHKGSDTADFLRQIPPGSAGEAVLSGL
VNSLGALISFLSSGSTGTQNSLGSLESLNSEGAARFNAKYPQGIPTSACGEGAYKVNGVSYYSWSGSSPLTNFLDPSDAF
LGASSLTFKNGTANDGLVGTCSSHLGMVIRDNYRMNHLDEVNQVFGLTSLFETSPVSVYRQHANRLKNASL
>P08658 3.1.1.3~~~lips~~~Triacylglycerol lipase~~~COG1075
MDDSVNTRYPILLVHGLFGFDRIGSHHYFHGIKQALNECGASVFVPIISAANDNEARGDQLLKQIHNLRRQVGAQRVNLI
GHSQGALTARYVAAIAPELIASVTSVSGPNHGSELADRLRLAFVPGRLGETVAAALTTSFSAFLSALSGHPRLPQNALNA
LNALTTDGVAAFNRQYPQGLPDRWGGMGPAQVNAVHYYSWSGIIKGSRLAESLNLLDPLHNALRVFDSFFTRETRENDGM
VGRFSSHLGQVIRSDYPLDHLDTINHMARGSRRRINPVELYIEHAKRLKEAGL
>P0DUB9 3.1.1.3~~~lip~~~Triacylglycerol lipase~~~
ADTYAATRYPVILVHGLAGTDKFANVVDYWYGIQSDLQSHGAKVYVANLSGFQSDDGPNGRGEQLLAYVKQVLAATGATK
VNLIGHSQGGLTSRYVAAVAPQLVASVTTIGTPHRGSEFADFVQDVLKTDPTGLSSTVIAAFVNVFGTLVSSSHNTDQDA
LAALRTLTTAQTATYNRNFPSAGLGAPGSCQTGAATETVGGSQHLLYSWGGTAIQPTSTVLGVTGATDTSTGTLDVANVT
DPSTLALLATGAVMINRASGQNDGLVSRCSSLFGQVISTSYHWNHLDEINQLLGVRGANAEDPVAVIRTHVNRLKLQGV
>P25275 3.1.1.3~~~lip~~~Triacylglycerol lipase~~~
MARTMRSRVVAGAVACAMSIAPFAGTTAVMTLATTHAAMAATAPADGYAATRYPIILVHGLSGTDKYAGVVEYWYGIQED
LQQNGATVYVANLSGFQSDDGANGRGEQLLAYVKTVLAATGATKVNLVGHSQGGLTSRYVAAVAPDLVASVTTIGTPHRG
SEFADFVQNVLAYDPTGLSSSVIAAFVNVFGILTSSSHNTNQDALAALQTLTTARAATYNQNYPSAGLGAPGSCQTGAPT
ETVGGNTHLLYSWAGTAIQPTLSVFGITGATDTSTVPLVDLANVLDPSTLALFGTGTVMINRGSGQNDGLVSKCSALYGK
VLSTSYKWNHLDEINQLLGVRGAYAEDPVAVIRTHANRLKLAGV
>P26877 3.1.1.3~~~lipL~~~Triacylglycerol lipase~~~
MKKKSLLPLGLAIGLASLAASPLIQASTYTQTKYPIVLAHGMLGFDNILGVDYWFGIPSALRRDGAQVYVTEVSQLDTSE
VRGEQLLQQVEEIVALSGQPKVNLIGHSHGGPTIRYVAAVRPDLMPSATSVGAPHKGSDTADFLRQIPPGSAGEAVLSGL
VNSLGALISFLSSGSAGTQNSLGSLESLNSEGAARFNAKYPQGIPTSACGEGAYKVNGVSYYSWSGSSPLTNFLDPSDAF
LGASSLTFKNGTANDGLVGTCSSHLGMVIRDNYRMNHLDEVNQVFGLTSLFETSPVSVYRQHANRLKNASL
>P0C0R3 3.1.1.3~~~lip~~~Lipase~~~
MKTRQNKYSIRKFSVGASSILIAALLFMGGGSAQAAEQQQDKGTVENSTTQSIGDGNEKLSEQQSTQNKNVNEKSNVNSI
TENESLHNETPKNEDLIQQQKDSQNDNKSESVVEQNKENGAFVQNHSEEKPQQEQVELEKHASENNQTLHSKAAQSNEDV
KTKPSQLDNTAAKQEDSQKENLSKQDTQSSKTTDLLRATAQNQSKDSQSTEEINKEVNNDTQQVTAKNDDAKVESFNLNS
KEEPLKVDKQANPTTDKDKSSKNDKGSQDGLANLESNAVATTNKQSKQQVSEKNEDQTNKSAKQKQYKNNDPIILVHGFN
GFTDDINPSVLTHYWGGDKMNIRQDLEENGYEAYEASISAFGSNYDRAVELYYYIKGGRVDYGAAHAAKYGHERYGKTYE
GVYKDWKPGQKIHLVGHSMGGQTIRQLEELLRHGNPEEVEYQKQHGGEISPLYQGGHDNMVSSITTLGTPHNGTHASDLL
GNEAIVRQLAYDVGKMYGNKDSRVDFGLEHWGLKQKPNESYIQYVKRVQNSKLWKSKDSGLHDLTRDGATDLNRKTSLNP
NIVYKTYTGESTHKTLAGKQKADLNMFLPFTITGNLIGKAKEKEWRENDGLVSVISSQHPFNQKYVEATDKNQKGVWQVT
PTKHDWDHVDFVGQDSTDTKRTRDELQQFWHGLAEDLVQSEQLTSTNK
>P04635 3.1.1.3~~~lip~~~Lipase~~~
MKETKHQHTFSIRKSAYGAASVMVASCIFVIGGGVAEANDSTTQTTTPLEVAQTSQQETHTHQTPVTSLHTATPEHVDDS
KEATPLPEKAESPKTEVTVQPSSHTQEVPALHKKTQQQPAYKDKTVPESTIASKSVESNKATENEMSPVEHHASNVEKRE
DRLETNETTPPSVDREFSHKIINNTHVNPKTDGQTNVNVDTKTIDTVSPKDDRIDTAQPKQVDVPKENTTAQNKFTSQAS
DKKPTVKAAPEAVQNPENPKNKDPFVFVHGFTGFVGEVAAKGENHWGGTKANLRNHLRKAGYETYEASVSALASNHERAV
ELYYYLKGGRVDYGAAHSEKYGHERYGKTYEGVLKDWKPGHPVHFIGHSMGGQTIRLLEHYLRFGDKAEIAYQQQHGGII
SELFKGGQDNMVTSITTIATPHNGTHASDDIGNTPTIRNILYSFAQMSSHLGTIDFGMDHWGFKRKDGESLTDYNKRIAE
SKIWDSEDTGLYDLTREGAEKINQKTELNPNIYYKTYTGVATHETQLGKHIADLGMEFTKILTGNYIGSVDDILWRPNDG
LVSEISSQHPSDEKNISVDENSELHKGTWQVMPTMKGWDHSDFIGNDALDTKHSAIELTNFYHSISDYLMRIEKAESTKN
A
>Q93MW7 3.1.1.3~~~~~~Lipase~~~
MRLSRRAATASALLLTPALALFGASAAVSAPRIQATDYVALGDSYSSGVGAGSYDSSSGSCKRSTKSYPALWAASHTGTR
FNFTACSGARTGDVLAKQLTPVNSGTDLVSITIGGNDAGFADTMTTCNLQGESACLARIAKARAYIQQTLPAQLDQVYDA
IDSRAPAAQVVVLGYPRFYKLGGSCAVGLSEKSRAAINAAADDINAVTAKRAADHGFAFGDVNTTFAGHELCSGAPWLHS
VTLPVENSYHPTANGQSKGYLPVLNSAT
>P15493 3.1.1.3~~~hlyC~~~Triacylglycerol lipase~~~COG1075
MNKIIILIALSLFSSSIWAGTSAHALSQQGYTQTRYPIVLVHGLFGFDTLAGMDYFHGIPQSLTRDGAQVYVAQVSATNS
SERRGEQLLAQVESLLAVTGAKKVNLIGHSHGGPTIRYVASVRPDLVASVTSIGGVHKGSAVADLVRGVIPSGSVSEQVA
VGLTQGLVALIDLLSGGKAHPQDPLASLAALTTEGSLKFNQYYPEGVPTSACGEGAYQVNGVRYYSWSGAATVTNILDPS
DVAMGLIGLVFNEPNDGLVATCSTHLGKVIRDDYRMNHLDEINGLLGIHSLFETDPVTLYRQHANRLKQAGL
>N6Z5E2 5.4.4.8~~~lis~~~Linalool isomerase~~~
MESTRMLRQPIQLLQGHKGPVTASRHRRNAVVYALLCLLALLPVATGQSAAWQAAGLGLFMPGAGFLALGGAWALLFPLT
VFVFWLAVIAWFWSGMVVAPLTLWLGTAALAGWLAGEAIWPPAVYLAPAAAAATFLFFQYRGAKRRAKDREHFKFRQSFF
AESLAEVHQRAATEPEPGERELTPDQLQGVRYLLELALQPVGQYKGYTIIDQFQPAALRYQLNHIGFALGMVQGHYTPNF
QGYLGQAQRNVIDTYRERKVWGYWVYESMWGHFNFSDFDPARKDNIMLTGWYGMHVGQYMLNAGDTRYSQPGSLSFRLND
KTCYHHDIHSINQSVRENFQSSDFCLYPCEPNWVYPVCNMYGMSSLAVYDTLFERRDTAQVLPKWLHMLDTEFTDQKGSL
VGLRSYWTGLEMPFYTGEAGFAFFANIFSTDLARKLWAVGRKELSMCLTQDAEGQTRLTLPKEALAFFDTIDAGNYRPGK
LFAYVAVQMCAREFGDDELAEAARRSMDQDCGPVVENGVARYTKGSTLANIWGVEGRLMRTGDFRNSFVKGPPSSVFDGP
LLGDARYPEILVAKAFSRGDDLELVLYPGAGDGPQTLGFERLKPGVRYVVEGAASGEFTADADGRASLAVTLSGRTALHI
KPGH
>Q9I2A0 4.1.3.26~~~liuE~~~3-hydroxy-3-isohexenylglutaryl-CoA/hydroxy-methylglutaryl-CoA lyase~~~
MNLPKKVRLVEVGPRDGLQNEKQPIEVADKIRLVDDLSAAGLDYIEVGSFVSPKWVPQMAGSAEVFAGIRQRPGVTYAAL
APNLKGFEAALESGVKEVAVFAAASEAFSQRNINCSIKDSLERFVPVLEAARQHQVRVRGYISCVLGCPYDGDVDPRQVA
WVARELQQMGCYEVSLGDTIGVGTAGATRRLIEAVASEVPRERLAGHFHDTYGQALANIYASLLEGIAVFDSSVAGLGGC
PYAKGATGNVASEDVLYLLNGLEIHTGVDMHALVDAGQRICAVLGKSNGSRAAKALLAKA
>Q8YEE8 ~~~~~~Leu/Ile/Val-binding protein homolog 3~~~COG0683
MNLKLLSSVAFAATIGFASAAYADITIGVIAPLTGPVAAFGDQVKKGAETAVEVINKAGGIKGEKVVLKFADDAGEPKQG
VSAANQIVGDGIKFVVGLVTTGVAVPVSDVLSENGVLMVTPTATGPDLTARGLENVFRTCGRDGQQAEVMADYVLKNMKD
KKVAVIHDKGAYGKGLADAFKAAINKGGITEVHYDSVTPGDKDFSALVTKLKSAGAEVVYFGGYHAEGGLLSRQLHDAGM
QALVLGGEGLSNTEYWAIGGTNAQGTLFTNAKDATKNPAAKDAIQALKAKNIPAEAFTMNAYAAVEVIKAGIERAGSTDD
SAAVAKALHDGKPIETAIGTLTYSETGDLSSPSFDIFKWDDGKIVGLE
>Q1MDE9 ~~~braC3~~~Leu/Ile/Val-binding protein BraC3~~~COG0683
MTLKTLTATLVASLAFAPLAHADITIGLIAPLTGPVAAYGDQVKNGAQTAVDEINKKGGILGEKVVLELADDAGEPKQGV
SAANKVVGDGIRFVVGPVTSGVAIPVSDVLAENGVLMVTPTATAPDLTKRGLTNVLRTCGRDDQQAEVAAKYVLKNFKDK
RVAIVNDKGAYGKGLADAFKATLNAGGITEVVNDAITPGDKDFSALTTRIKSEKVDVVYFGGYHPEGGLLARQLHDLAAN
ATIIGGDGLSNTEFWAIGTDAAGGTIFTNASDATKSPDSKAAADALAAKNIPAEAFTLNAYAAVEVLKAGIEKAGSAEDA
EAVATALKDGKEIPTAIGKVTYGETGDLTSQSFSLYKWEAGKIVAAE
>P22731 ~~~livF~~~High-affinity branched-chain amino acid transport ATP-binding protein LivF~~~COG0410
MEKVMLSFDKVSAHYGKIQALHEVSLHINQGEIVTLIGANGAGKTTLLGTLCGDPRATSGRIVFDDKDITDWQTAKIMRE
AVAIVPEGRRVFSRMTVEENLAMGGFFAERDQFQERIKWVYELFPRLHERRIQRAGTMSGGEQQMLAIGRALMSNPRLLL
LDEPSLGLAPIIIQQIFDTIEQLREQGMTIFLVEQNANQALKLADRGYVLENGHVVLSDTGDALLANEAVRSAYLGG
>P0AEX7 ~~~livH~~~High-affinity branched-chain amino acid transport system permease protein LivH~~~COG0559
MSEQFLYFLQQMFNGVTLGSTYALIAIGYTMVYGIIGMINFAHGEVYMIGSYVSFMIIAALMMMGIDTGWLLVAAGFVGA
IVIASAYGWSIERVAYRPVRNSKRLIALISAIGMSIFLQNYVSLTEGSRDVALPSLFNGQWVVGHSENFSASITTMQAVI
WIVTFLAMLALTIFIRYSRMGRACRACAEDLKMASLLGINTDRVIALTFVIGAAMAAVAGVLLGQFYGVINPYIGFMAGM
KAFTAAVLGGIGSIPGAMIGGLILGIAEALSSAYLSTEYKDVVSFALLILVLLVMPTGILGRPEVEKV
>P0AD96 ~~~livJ~~~Leu/Ile/Val-binding protein~~~COG0683
MNIKGKALLAGCIALAFSNMALAEDIKVAVVGAMSGPVAQYGDQEFTGAEQAVADINAKGGIKGNKLQIVKYDDACDPKQ
AVAVANKVVNDGIKYVIGHLCSSSTQPASDIYEDEGILMITPAATAPELTARGYQLILRTTGLDSDQGPTAAKYILEKVK
PQRIAIVHDKQQYGEGLARAVQDGLKKGNANVVFFDGITAGEKDFSTLVARLKKENIDFVYYGGYHPEMGQILRQARAAG
LKTQFMGPEGVANVSLSNIAGESAEGLLVTKPKNYDQVPANKPIVDAIKAKKQDPSGAFVWTTYAALQSLQAGLNQSDDP
AEIAKYLKANSVDTVMGPLTWDEKGDLKGFEFGVFDWHANGTATDAK
>P17215 ~~~livJ~~~Leu/Ile/Val/Thr-binding protein~~~
MKGKTLLAGCIALSLSHMAFADDIKVAVVGAMSGPVAQYGDQEFTGAEQAIADINAKGGIKGDKLVAVKYDDACDPKQAV
AVANKVVNDGIKYVIGHLCSSSTQPASDIYEDEGILMITPAATAPELTARGYKLVLRTTGLDSDQGPTAAKYILEKVKPQ
RIAIIHDKQQYGEGLARAVQDGLKKGGVNVVFFDGITAGEKDFSTLVARLKKENIDFVYYGGYHPEMGQILRQSRAAGLK
TQFMGPEGVANVSLSNIAGESAEGLLVTKPKNYDQVPANKPIVDAIKAKKQDPSGAFVWTTYAALQSLQAGLNHSDDPAE
IAKYLKGATVDTVMGPLSWDEKGDLKGFEFGVFDWHANGTATDAK
>P04816 ~~~livK~~~Leucine-specific-binding protein~~~COG0683
MKRNAKTIIAGMIALAISHTAMADDIKVAVVGAMSGPIAQWGDMEFNGARQAIKDINAKGGIKGDKLVGVEYDDACDPKQ
AVAVANKIVNDGIKYVIGHLCSSSTQPASDIYEDEGILMISPGATNPELTQRGYQHIMRTAGLDSSQGPTAAKYILETVK
PQRIAIIHDKQQYGEGLARSVQDGLKAANANVVFFDGITAGEKDFSALIARLKKENIDFVYYGGYYPEMGQMLRQARSVG
LKTQFMGPEGVGNASLSNIAGDAAEGMLVTMPKRYDQDPANQGIVDALKADKKDPSGPYVWITYAAVQSLATALERTGSD
EPLALVKDLKANGANTVIGPLNWDEKGDLKGFDFGVFQWHADGSSTAAK
>P0A1W6 ~~~livK~~~Leucine-specific-binding protein~~~
MKRKAKTIIAGIVALAVSQGAMADDIKVAIVGAMSGPVAQWGDMEFNGARQAIKDINAKGGIKGDKLVGVEYDDACDPKQ
AVAVANKIVNDGIQYVIGHLCSSSTQPASDIYEDEGILMISPGATNPELTQRGYQYIMRTAGLDSSQGPTAAKYILETVK
PQRIAIIHDKQQYGEGLARSVQDGLKQGNANIVFFDGITAGEKDFSALIARLQKENIDFVYYGGYYPEMGQMLRQARANG
LKTQFMGPEGVGNASLSNIAGGAAEGMLVTMPKRYDQDPANKAIVEALKADKKDPSGPYVWITYAAVQSLATAMTRSASH
APLDLVKDLKANGADTVIGPLKWDEKGDLKGFEFGVFQWHADGSSTVAK
>Q2MF66 1.1.3.-~~~livQ~~~6'''-hydroxyparomomycin C oxidase~~~
MERLRGPSPLENTTARHPAPLGPAHRDGLEPGTADRVWDVCVIGSGASGAVAADRLVRQGLDVLMVEEGFRLAPHVGLDE
AESLSRQALARDGEGNWTDEGWPWTTSNLGGGTVYYGGASFRYRPFDFDPGELVHTDGVDVRWPYTLADLVPYYEVLERR
LGVCGGDAPGIHRGSRHSRGPAHQPSPAARVLRAAGESLGYRPFPTPLAINRDPHGGRAACARDSLCVSHLCPTGAKGDV
VAVFLAPLAAHPNFALRTGVRALRLEQDRSGEVAAVRCLDRQTGQAHRVRARVYVVACNAIQSAALLLRSRTPYSPDGVG
NHSHLVGRGLCMKLSEYLSGTVDADPAVLADPYTNTGPFSTVAFLDHYLDPDCPGGFGGLIYESKRDQRHKLVHDALELR
IETILADHPNLDNRVGLSTHLDEDGMPAVVIDYTPDPRDLDRLRYMTGRCERLLRTAGARGIRSRSTGFAQGSSHLHGTC
RAGHDPARSVVDAWGRVHSADNVYIVDGSFMPYPGGLNPTLTIQAHALRTSRAIASHLAADRAAHV
>Q7BHI8 ~~~lktA~~~Leukotoxin~~~
MGTRLTTLSNGLKNTLTATKSGLHKAGQSLTQAGSSLKTGAKKIILYIPQNYQYDTEQGNGLQDLVKAAEELGIEVQREE
RNNIATAQTSLGTIQTAIGLTERGIVLSAPQIDKLLQKTKAGQALGSAESIVQNANKAKTVLSGIQSILGSVLAGMDLDE
ALQNNSNQHALAKAGLELTNSLIENIANSVKTLDEFGEQISQFGSKLQNIKGLGTLGDKLKNIGGLDKAGLGLDVISGLL
SGATAALVLADKNASTAKKVGAGFELANQVVGNITKAVSSYILAQRVAAGLSSTGPVAALIASTVSLAISPLAFAGIADK
FNHAKSLESYAERFKKLGYDGDNLLAEYQRGTGTIDASVTAINTALAAIAGGVSAAAAGSVIASPIALLVSGITGVISTI
LQYSKQAMFEHVANKIHNKIVEWEKNNHGKNYFENGYDARYLANLQDNMKFLLNLNKELQAERVIAITQQQWDNNIGDLA
GISRLGEKVLSGKAYVDAFEEGKHIKADKLVQLDSANGIIDVSNSGKAKTQHILFRTPLLTPGTEHRERVQTGKYEYITK
LNINRVDSWKITDGAASSTFDLTNVVQRIGIELDNAGNVTKTKETKIIAKLGEGDDNVFVGSGTTEIDGGEGYDRVHYSR
GNYGALTIDATKETEQGSYTVNRFVETGKALHEVTSTHTALVGNREEKIEYRHSNNQHHAGYYTKDTLKAVEEIIGTSHN
DIFKGSKFNDAFNGGDGVDTIDGNDGNDRLFGGKGDDILDGGNGDDFIDGGKGNDLLHGGKGDDIFVHRKGDGNDIITDS
DGNDKLSFSDSNLKDLTFEKVKHNLVITNSKKEKVTIQNWFREADFAKEVPNYKATKDEKIEEIIGQNGERITSKQVDDL
IAKGNGKITQDELSKVVDNYELLKHSKNVTNSLDKLISSVSAFTSSNDSRNVLVAPTSMLDQSLSSLQFARAA
>Q9CFU9 7.2.2.10~~~yoaB~~~Calcium-transporting ATPase 1~~~COG0474
MQPYNQSVNEVLEETKSQFEGLSPKEVKNRQAKDGFNELKEKKKTSTWELFIDTLKDPMVIILLLVAFVQLFLGEFVESL
VIFIVLMINSVVAVVQTKRAESSLDALRQMSAPSAKVLRNGEKTSIPARELVVGDIVSLEAGDFIPADGRLIDVQNLRVE
EGMLTGESEPVEKFSDVIEGEVALGDRKNMVFSSSLVVYGRADFLVTAIAEQTEIGKIAQMLETAEAKQTPLQQKLEKFG
KQLGWVILALCALIFAVQILRLFTTNQTADMQKAVLDSFMFAVAVAVAAIPEALSSVVTIVLSVGTNKMAKQHAIMRNLP
AVETLGSTSVICTDKTGTLTQNKMTVVDSFLPTQGSKELTDLTQADQKLLLNAMVLCNDSSFSQEGQLLGDPTEVALIAY
SDKIGYPYQDLREKSPRLAEFPFDSERKLMSTINDFEGQKTIFVKGGPDVLFNRCNQVFLDGKVQEFTPELKEKFQAQNE
AFSQKALRVLAYAYKPVSDNKTELTLTDENDLILIGLSAMIDPPREAVYDSIAEAKKAGIKTIMITGDHKTTAQAIAKDI
GLMNEGDMALTGQELDALTEDELRENLEKISVYARVSPENKIRIVRAWQNEHQVTAMTGDGVNDAPALKQANIGIAMGSG
TDVAKDASSMILTDDNFVSIVSAVSIGRVVYDNIKKSISYLFSGNLGAIIAIVFALVVGWVNPFTALQLLFINLVNDSVP
AIALGMEKAEPDVMEKAPRQLNEGIFANGLMRVILIRGSLIGIAAIISQYVGQKTSPEMGVAMAFTTLILARTLQTFAAR
SNSQNILKLGFTTNKYVLMAVTFCLALYSLTTLPFLREIFSIPAAFGWSQWIVAAGLAVIAVICMEILKSIKGVFEKH
>P33232 1.1.-.-~~~lldD~~~L-lactate dehydrogenase~~~COG1304
MIISAASDYRAAAQRILPPFLFHYMDGGAYSEYTLRRNVEDLSEVALRQRILKNMSDLSLETTLFNEKLSMPVALAPVGL
CGMYARRGEVQAAKAADAHGIPFTLSTVSVCPIEEVAPAIKRPMWFQLYVLRDRGFMRNALERAKAAGCSTLVFTVDMPT
PGARYRDAHSGMSGPNAAMRRYLQAVTHPQWAWDVGLNGRPHDLGNISAYLGKPTGLEDYIGWLGNNFDPSISWKDLEWI
RDFWDGPMVIKGILDPEDARDAVRFGADGIVVSNHGGRQLDGVLSSARALPAIADAVKGDIAILADSGIRNGLDVVRMIA
LGADTVLLGRAFLYALATAGQAGVANLLNLIEKEMKVAMTLTGAKSISEITQDSLVQGLGKELPAALAPMAKGNAA
>P9WND5 1.1.-.-~~~lldD~~~Putative L-lactate dehydrogenase~~~COG1304
MAVNRRVPRVRDLAPLLQFNRPQFDTSKRRLGAALTIQDLRRIAKRRTPRAAFDYADGGAEDELSIARARQGFRDIEFHP
TILRDVTTVCAGWNVLGQPTVLPFGIAPTGFTRLMHTEGEIAGARAAAAAGIPFSLSTLATCAIEDLVIAVPQGRKWFQL
YMWRDRDRSMALVRRVAAAGFDTMLVTVDVPVAGARLRDVRNGMSIPPALTLRTVLDAMGHPRWWFDLLTTEPLAFASLD
RWPGTVGEYLNTVFDPSLTFDDLAWIKSQWPGKLVVKGIQTLDDARAVVDRGVDGIVLSNHGGRQLDRAPVPFHLLPHVA
RELGKHTEILVDTGIMSGADIVAAIALGARCTLIGRAYLYGLMAGGEAGVNRAIEILQTGVIRTMRLLGVTCLEELSPRH
VTQLRRLGPIGAPT
>P33231 ~~~lldP~~~L-lactate permease~~~COG1620
MNLWQQNYDPAGNIWLSSLIASLPILFFFFALIKLKLKGYVAASWTVAIALAVALLFYKMPVANALASVVYGFFYGLWPI
AWIIIAAVFVYKISVKTGQFDIIRSSILSITPDQRLQMLIVGFCFGAFLEGAAGFGAPVAITAALLVGLGFKPLYAAGLC
LIVNTAPVAFGAMGIPILVAGQVTGIDSFEIGQMVGRQLPFMTIIVLFWIMAIMDGWRGIKETWPAVVVAGGSFAIAQYL
SSNFIGPELPDIISSLVSLLCLTLFLKRWQPVRVFRFGDLGASQVDMTLAHTGYTAGQVLRAWTPFLFLTATVTLWSIPP
FKALFASGGALYEWVINIPVPYLDKLVARMPPVVSEATAYAAVFKFDWFSATGTAILFAALLSIVWLKMKPSDAISTFGS
TLKELALPIYSIGMVLAFAFISNYSGLSSTLALALAHTGHAFTFFSPFLGWLGVFLTGSDTSSNALFAALQATAAQQIGV
SDLLLVAANTTGGVTGKMISPQSIAIACAAVGLVGKESDLFRFTVKHSLIFTCIVGVITTLQAYVLTWMIP
>P0ACL7 ~~~lldR~~~Putative L-lactate dehydrogenase operon regulatory protein~~~COG2186
MIVLPRRLSDEVADRVRALIDEKNLEAGMKLPAERQLAMQLGVSRNSLREALAKLVSEGVLLSRRGGGTFIRWRHDTWSE
QNIVQPLKTLMADDPDYSFDILEARYAIEASTAWHAAMRATPGDKEKIQLCFEATLSEDPDIASQADVRFHLAIAEASHN
IVLLQTMRGFFDVLQSSVKHSRQRMYLVPPVFSQLTEQHQAVIDAIFAGDADGARKAMMAHLSFVHTTMKRFDEDQARHA
RITRLPGEHNEHSREKNA
>Q5ZT84 1.13.11.27~~~lly~~~4-hydroxyphenylpyruvate dioxygenase~~~COG3185
MQNNNPCGLDGFAFLEFSGPDRNKLHQQFSEMGFQAVAHHKNQDITLFKQGEIQFIVNAASHCQAEAHASTHGPGACAMG
FKVKDAKAAFQHAIAHGGIAFQDAPHANHGLPAIQAIGGSVIYFVDEEHQPFSHEWNITSPEPVVGNGLTAIDHLTHNVY
RGNMDKWASFYASIFNFQEIRFFNIKGKMTGLVSRALGSPCGKIKIPLNESKDDLSQIEEFLHEYHGEGIQHIALNTNDI
YKTVNGLRKQGVKFLDVPDTYYEMINDRLPWHKEPLNQLHAEKILIDGEADPKDGLLLQIFTENIFGPVFFEIIQRKGNQ
GFGEGNFQALFEAIERDQVRRGTLKELS
>Q8Y8Q5 7.2.2.10~~~~~~Calcium-transporting ATPase lmo0841~~~COG0474
MEIYRKSAAETFTQLEATEKGLTTSEVTKRQEKYGFNELKNKKKDPLWKLFLETFKDPMVIVLVIAALVQLVLGEVVESL
IIFLVLIVNSIISVVQTRKAESSLDALREMSAPVAKVIRDGSKQSIHARELVPGDVVILDAGDFVPADGRLFESGSLKID
EGMLTGESEAVEKYIDTIPDEVGLGDRVNMVFSGSLVVYGRGMFVVTGTASETEIGKIAGLLETAEAKQTPLQRKLESFS
KKLGLGILALCVLIFAVEAGRVLLGDNSADMATAILNAFMFAVAVAVAAIPEALSSIVTIVLAVGTNKMAKQHAIIRKLP
AVETLGSTSVICTDKTGTLTQNKMTVVDYYLPDGTKENFPESPENWSEGERRLIHIAVLCNDSNINSEGKELGDPTEVAL
IAFSNKNNQDYNEIREKFIREGEIPFDSDRKLMSTLHTFNENKAMLTKGGPDVMFARCSYVFLDGEEKPMTEEILAKLKE
TNEEFSNQALRVLAYGYKRMPADTTELKLEDEQDIVLVGLTAMIDPPREAVYASIEESKKAGIRTVMITGDHKTTAQAIG
RDIGLMDADDIALTGQELDAMPEEELDKKLEHIAVYARVSPENKIRIVKAWQKKGKITAMTGDGVNDAPALKQADIGVAM
GSGTDVAKDSAAMILTDDNFVSIVDAVGVGRTVFDNIKKSIAYLFAGNLGAIIAILFALVLDWINPFTALQLLFINLVND
SLPAIALGMEKAEPDVMKRKPRDINEGIFAGGTMRAVISRGVLIGIAVIISQYIGMQISPEMSVAMAFTTLILARTLQTF
AARSNVQTAFGAGFFSNKYVIGAVLLCFVLYGITVLPGAREIFSIPASFGLHEWSIAAGLALAAVVMMEIIKVVQNKFFK
>P21795 1.13.12.4~~~~~~L-lactate 2-monooxygenase~~~
MSNWGDYENEIYGQGLVGVAPTLPMSYADWEAHAQQALPPGVLSYVAGGSGDEHTQRANVEAFKHWGLMPRMLMAATERD
LSVELWGKTWAAPMFFAPIGVIALCAQDGHGDAASAQASARTGVPYITSTLAVSSLEDIRKHAGDTPAYFQLYYPEDRDL
AESFIRRAEEAGYDGLVITLDTWIFGWRPRDLTISNFPFLRGLCLTNYVTDPVFQKKFKAHSGVEAEGLRDNPRLAADFW
HGLFGHSVTWEDIDWVRSITKMPVILKGIQHPDDARRAVDSGVDGIYCSNHGGRQANGGLPALDCLPEVVKASGDTPVLF
DSGIRTGADVVKALAMGASAVGIGRPYAWGAALGGSKGIEHVARSLLAEADLIMAVDGYRNLKELTIDALRPTR
>Q9CHL8 7.6.2.2~~~lmrA~~~Multidrug resistance ABC transporter ATP-binding and permease protein~~~COG1132
MERGPQMANRIEGKAVDKTSIKHFIKLIRAAKPRYLFFIIGILAGIVGTLIQLQVPKMVQPLVNSFGHGVNGGKVALVIA
LYIGSAAVSAIAAIVLGIFGESVVKNLRTRVWDKMIHLPVKYFDEVKTGEMSSRLANDTTQVKNLIANSIPQAFTSILLL
VGSIVFMLQMQWRLTLAMIIAVPVVMLIMFPIMTFGQKIGRTRQDSLANFQGIASESLSEIRLVKSSNAEKQASKKAEND
VNALYKIGVKEAIFDGLMSPVMMLSMMLMIFGLLAYGIYLISTGVMSLGTLLGMMMYLMNLIGAVPTVATFFTELAKASG
STGRLTELLDEEQEVLHQGESLDLEGKTLSARHVDFAYDDSEQILRDISFEAQPNSIIAFAGPSGGGKSTIFSLLERFYQ
PTAGEITIDGQPIDNISLENWRSQIGFVSQDSAIMAGTIRENLTYGLEGDYTDEDLWQVLDLAFARSFVENMPDQLNTEV
GERGVKISGGQRQRLAIARAFLRNPKILMLDEATASLDSESESMVQKALDSLMKGRTTLVIAHRLSTIVDADKIYFIEKG
QITGSGKHNELVATHPLYAKYVSEQLTVGQ
>B3TLD6 3.2.1.140~~~lnbB~~~Lacto-N-biosidase~~~
MEKSSNRRFGVRTVAAIVAGLMVGGMCTAMTASAADDSAAGYSATAPVNLTRPATVPSMDGWTDGTGAWTLGEGTRVVSS
DALAARAQSLASELTKFTDVDIKAATGSATGKDISLTLDASKKAELGDEGFKLNIGSKGLEVIGATDIGVFYGTRSVSQM
LRQGQLTLPAGTVATKPKYKERGATLCACQINISTDWIDRFLSDMADLRLNYVLLEMKLKPEEDNTKKAATWSYYTRDDV
KKFVKKANNYGIDVIPEINSPGHMNVWLENYPEYQLADNSGRKDPNKLDISNPEAVKFYKTLIDEYDGVFTTKYWHMGAD
EYMIGTSFDNYSKLKTFAEKQYGAGATPNDAFTGFINDIDKYVKAKGKQLRIWNDGIVNTKNVSLNKDIVIEYWYGAGRK
PQELVQDGYTLMNATQALYWSRSAQVYKVNAARLYNNNWNVGTFDGGRQIDKNYDKLTGAKVSIWPDSSYFQTENEVEKE
IFDGMRFISQMTWSDSRPWATWNDMKADIDKIGYPLDIREYDYTPVDAGIYDIPQLKSISKGPWELITTPDGYYQMKDTV
SGKCLALFTGSKHLDVVTQVGARPELRNCADVSVGQDQRNTANERNTQKWQIRADKDGKYTISPALTQQRLAIATGNEQN
IDLETHRPAAGTVAQFPADLVSDNALFTLTGHMGMSATVDSKTVNPASPSKITVKVRAASNANTGDVTVTPVVPEGWEIK
PGSVSLKSIPAGKAAIAYFNVVNTTGTGDATVQFKLTNTKTGEELGTTSVALTGSLTKDVEASDYAASSQETTGEHAPVG
NAFDKNANTFWHSKYSNPSANLPHWLAFKASPGEGNKIAAITHLYRQDKLNGPAKNVAVYVVAASDANSVADVTNWGEPV
ATAEFPYTKELQTIALPNTIPSGDVYVKFQINDAWGLTETSAGVTWAAVAELAATAKATPVELTEPEQPKDNPEVTETPE
ATGVTVSGDGVANGALSLKKGTTAQLTAKVAPDDADQAVTWASSDDKVVTVDKTGKVTAVAKGVAKVTATTANGKSASVT
VTVTEDSEVPGPTGPTEPTKPGTEKPTTKPTTKPNDGKLSATGADTAVLATIAALFALAGGAVVAVRRRSVR
>P0DW93 3.2.1.140~~~lnbX~~~Lacto-N-biosidase~~~
MTSRQGRQAIAATAAMGVAVALALPTAAFAQSATQGKETATTTSSGTTYYVSSAHGDDANAGTSENAPWKSLTKVNDIAS
DLGPGDSVLLEYGSEFNDQYLHIKDTAGNADAPITISAYGDADEGKPVIASNGVKGSQWEQDYRANVGNHKNKGTVSTTL
LLKDVSYITVSNLEITNDDADVYDPIDTWKWTDTPDSDGTKLDRSASRMDRTGVAGIAENGATMSNVTLDNLYIHDVDGN
IYNKHMANGGIYFMAHYPMENTSAETDVWLREHVSRFDHVTIRNSTVKDVDRWGIAVGYTAYLNYIDANYGDGSIDDALI
AKYGSTNVRIENNYVKGAGGDAITLMYCDRPVIEHNVGDSVSKHINTQDYTQPGSYGGRVAAGIWPWRCKDPVFQYNEMY
NNLNAEHGNGDGQAWDADYGDGTLYQYNYSYGNSFASLMICNWYAVNTTFRYNISQNDRQGVFDLPSNGPGNHIYNNTVY
VDADSQVLTKRSNSQSLFENNIFINATNTKKTETWNRGSQNGGQTYDNNMYVNYANKPTSDANAIEADDVSAVLAGAGSA
PTSALKSGAEHARTGEKAAFDGYRPVAGSKAINAGKVVSDLNDYAVENDFLGNAVKGRPDLGAVESDVVSVTMASSKYET
GTETDSGTGDKTKVIHVTFTDKNPVTVKELLSNVSADKGVDKAVYRVADAKSGKSADARSAESEPNMLDRLLSLLPGSDR
NAKDDETKLADSEPVRDGDILRFSAEGTDETDEYTIRQRITWDWVADYEQGVADFDWKAQRRTSAGGEWTTISAYDGSWP
NTVYDQYYGVGVNGTLAELSGDRKQTHGLLIDKPGDGLPTAMAWKAPESGTVMLSLKTFADKIAEPYLRQNADNAGKKVT
LSLMRNDETLCSADDLSVYQKSSEQFAQCLAEHGSIDVQEGDWIRIVADAETGVKAPSLHISPVITYEDKAPAAPKQNVR
YDVSYAATDAVVGTQSAVAAAFTADGGEADAPDGVAFAFKDGGDEGEASPVIDASTGAVTFTPAAGQYGATVTRTVVVTY
ADGSSDETTVTFRVAQSHAQRLNVLYPTVRGDAGTDLKRTPKFTLKADGAAASVPEGTTFALGANAPAGASVDMANGTVT
LNSGVGGTVTVPVTVTFADDGASVSSTARFEVTAPAALGSSELETATVDGVNVVYAPFSADSPMTVAQLLAKVTAEPSGA
DKGVYRDGVRLEAGAELAENDVLRFSAKGSTVSDDYVVKSKTTWDWVNDFQVRVQGPIWYGQRQTEADGVWSDIADFDAT
YPNWMYETYYGPGVDYANHSLPTDRSAIHGLISDSPASAGGSAMAWKAPKAGTVKVSIREDEPYLRQDGSNGKALTLRLM
HDDKVVCFADLTVSKQRSEEFANCVADKGEIAVEAGDWIRVTATSASGMNKPSAHISPVIAYMAASTPGPEPVPVDKSTL
KATVEEALGLAESDYTDESWAALVAARDAAQTVLDDDAATAEQVETAQNALRDAIDGLEKKPVDPDPNPKPDPNPDPDPT
PDPDPDPGPDTKPGDGSGNGSGTGNGSGSGNGSTGSGSDGATTGGKLTATGADVAGAAAMVALTAAAGIGLAAAARRRR
>P0DW94 ~~~lnbY~~~Chaperone for lacto-N-biosidase~~~
MPRRHRFAAAIAAVAVAAVLLVTLTVAVVTHGDGAFAPAGTPAGAGASAGIGSDTGSNASEDSDMFPTIVFGDTVIERKE
YVAALKAQHGAARLYFRQTYGVDPAEDGWDKAHDGEVPCRWLASRAIDELRRRHAAYLIGVDLGQVADDSYASIVARMEA
VNSGNAELKSDGGIVYGRTGFDIDSYLSYELSALKNAYTGDESNPGMSLSDDEVRRYYDEHDWTKDGVDGKTPLDEVRGN
VKAQMRSERYDELVSQRAEAIDVTDLPWDALYRFTAGRLG
>E8MF13 2.4.1.211~~~lnpA~~~1,3-beta-galactosyl-N-acetylhexosamine phosphorylase~~~
MTSTGRFTLPSEENFAEKTKELAELWGADAIRNSDGTHLDEAVLALGKKIYNAYFPTRAHNEWITLHMDETPQVYLLTDR
ILAESDTVDIPLMESFFAEQLKPNRDADPHKYWEVVDRTTGEVVDSANWTLDADEDTVHVSGVAAWHEYTVSFLAYIIWD
PVEMYNHLTNDWGDKEHEIPFDIYHPATRKFVFDTFEQWLKDSPQTDVVRFTTFFYQFTLLFDEKRREKVVDWFGCACTV
SPRALDDFEAKYGYRLRPEDFVDGGAYNSAWRVPRKAQRDWIDFLSGFVRENVKQLADMSHAAGKEAMMFLGDQWIGTEP
YKDGFDELGLDAVVGSIGDGTTTRMIADIPGVKYTEGRFLPYFFPDTFYEGNDPSIEGLDNWRKARRAILRSPISRMGYG
GYLSLAAKFPKFVDTVTHIANEFRDIHDRTGGVAAEGELNVAILNSWGKMRSWMAFTVAHALPNKQTYSYYGILESLSGM
RVNVRFISFDDVLAHGIDSDIDVIINGGPVDTAFTGGDVWTNPKLVETVRAWVRGGGAFVGVGEPSSAPRFQTGRFFQLA
DVIGVDEERYQTLSVDKYFPPVVPDHFITADVPVDPAAREAWEQAGYRIPLSGCGGGQSIKPLGGIDFGEPVLNTYPVNE
NVTLLRADGGQVQLATNDYGKGRGVYISGLPYSAANARLLERVLFYASHNEDKYAAWSSSNPECEVAHFPEQGLYCVINN
TDQPQKTTVTLADGTTEDFDLPDSGIAWREA
>P94438 2.7.13.3~~~lnrJ~~~Sensor histidine kinase LnrJ~~~COG4585
MKALFFTRMFTLMVSCLMYLSIVKEDNWFGYVFIAAGAAMYAANHVLLTKETNAIWFCLIDIAIGFSFGFIFPGTGLFII
MLCPVAVAFFLRGFPKRTAWSVLCLSSILFLTVLIRTYAMFGNEFVIDHLTSMTFVVFCGVVGKLIRKLLDAQDTAKQQF
QELTESHLALSAAHQELHLYAKQVEELTAIYERNRMAREIHDTVGHKMTALLVQLQLLREWQKRDSQKADETVGVCETLA
REALDDVRLSVRTLQTENDPSLIESLKQLTEDFCKNAGVTTEFAVSGDPAIIPLSLHPTLIRTVQEALTNAKRHGGAAAC
SIQLACTTDSISLVIKDDGKGNPEAALGFGLLNMKKRAAEHGGMIRFESERDQGFTVNAEFSLANKKWSFGPVQQKESLS
>P94439 ~~~lnrK~~~Transcriptional regulatory protein LnrK~~~COG2197
MIKIIITDDQDIVREGLASLLQLREELDVIATARNGQEAFEKAKELEPDIVLMDIRMPVSNGVEGTKLITSSLPSVKVLM
LTTFKDSALIAEALEEGASGYLLKDMSADTIVKAVMTVHSGGMVLPPELTAQMLNEWKREKQLKGINEIEKPNELLDLTE
REIEVLAELGYGLNNKEIAEKLYITEGTVKNHVSNIISKLAVRDRTQAAIYSVRYGVSVF
>P94440 7.6.2.-~~~lnrL~~~Linearmycin resistance ATP-binding protein LnrL~~~COG1131
MLQAENIKKAYGKKTIVKGISFSLKKGESFGLLGPNGAGKSTTISMISGLVPHDSGNITVGGYVIGKETAKAKQKIGIVP
QEIALYPTLTAHENLMFWGKMYGLTHGEAKKRAAEVLEYVGLTERAKDKIETFSGGMKRRINIGAALMHKPELLIMDEPT
VGIDPQSRNHILETVKQLNETGMTVIYTSHYMEEVEFLCDRIGIIDQGEMIAIGTKTDLCSRLGGDTIIQLTVSGINEAF
LVAIRSLAHVNDVTVHELELKIDISAAHHEKVVTSLLAEATAHHINLLSLQVQEPNLERLFLNLTGRTLRD
>P94441 ~~~lnrM~~~Linearmycin resistance permease protein LnrM~~~COG0842
MKKSIWIAWKDVKIRITDRKGFMMLILMPLILTCILGAALGSVVDGGSRIDDIKVGYIQSDQSDTANMFTKDVLKKMKSI
KVTKVGSKDKMKKLIDEKKIDVGIVIPNHWEAGKTSAVVNAAPDQTLKSSIIETAASSFIEQYKAVKEAASGSMDYISKT
EAVKQGKLDPAQFAEKLAKTLEKETGDKLTIAEKSVGSKAVTSFQYYSAAMLCMFMLFHITVGAKSFLQEKDTETLARML
MTPAQKSVILFGKWLGTYLFAIIQFFIFLIVTINVFGVDWGGNLLLVSVLGLSYAAAVSGISMLLASCISDMKTADAIGG
FGIQLLAVLGGSMLPLYQFPDVLQSVSKAVPNRWALDGFLSLMEGGGWADLQKPVLLFAAIGFCSLVIGIRRLHTR
>P94442 ~~~lnrN~~~Linearmycin resistance permease protein LnrN~~~COG0842
MKKILAICGIELSLIFKKPQNYLIMFAAPLLLTFVFGSMLSGNDDKVRLAIVDQDDTILSQHYIRQLKAHDDMYVFENMS
ESKASEKLKQKKIAGIIVISRSFQTQLEKGKHPELIFRHGPELSEAPMVKQYAESALATLNIQVTAAKTASQTAGENWKA
AYKTVFAKKHEDIVPAVTRQTLSDKKEGAEASDTASRAAGFSILFVMLTMMGAAGTILEARKNGVWSRLLTASVSRAEIG
AGYVLSFFVIGWIQFGILLLSTHWLFGINWGNPAAVIVLVSLFLLTVVGIGLMIAANVRTPEQQLAFGNLFVIATCMVSG
MYWPIDIEPKFMQSIAEFLPQKWAMSGLTEIIANGARVTDILGICGILLAFAAITFAAGLKALRA
>D5SL78 4.2.3.26~~~~~~R-linalool synthase~~~COG2124
MQEFEFAVPAPSRVSPDLARARARHLDWVHAMDLVRGEEARRRYEFSCVADIGAYGYPHATGADLDLCVDVLGWTFLFDD
QFDAGDGRERDALAVCAELTDLLWKGTAATAASPPIVVAFSDCWERMRAGMSDAWRRRTVHEWVDYLAGWPTKLADRAHG
AVLDPAAHLRARHRTICCRPLFALAERVGGYEVPRRAWHSSRLDGMRFTTSDAVIGMNELHSFEKDRAQGHANLVLSLVH
HGGLTGPEAVTRVCDLVQGSIESFLRLRSGLPELGRALGVEGAVLDRYADALSAFCRGYHDWGRGASRYTTRDHPGDLGL
ENLVARSSG
>Q8Y9T5 ~~~lntA~~~Listeria nuclear targeted protein A~~~
MKKLVAWFNGLSKMWKVVVIIGAVFVVIIALTTGEDEGEQTKTKKDSNKVVKTASRPKLSTKDLALIKADLAEFEARELS
SEKILKDTIKEESWSDLDFANDNINQMIGTMKRYQQEILSIDAIKRSSEASADTEAFKKIFKEWSEFKIERIQVTIDLLN
GKKDSEAVFKKTYPNQIIFKKVRTNKLQTALNNLKVGYELLDSQK
>Q8FJY4 2.3.1.269~~~lnt~~~Apolipoprotein N-acyltransferase~~~COG0815
MAFASLIERQRIRLLLALLFGACGTLAFSPYDVWPAAIISLMGLQALTFNRRPLQSAAIGFCWGFGLFGSGINWVYVSIA
TFGGMPGPVNIFLVVLLAAYLSLYTGLFAGVLSRLWPKTTWLRVAIAAPALWQVTEFLRGWVLTGFPWLQFGYSQIDGPL
KGLAPIMGVEAINFLLMMVSGLLALALVKRNWRPLVVAVVLFALPFPLRYIQWFTPQPEKTIQVSMVQGDIPQSLKWDGD
QLLNTLKIYYNATAPLMGKSSLIIWPESAITDLEINQQPFLKALDGELRDKGSSLVTGIVDARLNKQNRYDTYNTIITLG
KGAPYSYESADRYNKNHLVPFGEFVPLESILRPLAPFFDLPMSSFSRGPYIQPPLSLNGIQLTAAICYEIILGEQVRDNF
RPDTDYLLTISNDAWFGKSIGPWQHFQMARMRALELARPLLRSTNNGITAVIGPQGEIQAMIPQFTREVLTTNVTPTTGL
TPYARTGNWPLWVLTALFGFAAVLMSLRARKH
>P23930 2.3.1.269~~~lnt~~~Apolipoprotein N-acyltransferase~~~COG0815
MAFASLIERQRIRLLLALLFGACGTLAFSPYDVWPAAIISLMGLQALTFNRRPLQSAAIGFCWGFGLFGSGINWVYVSIA
TFGGMPGPVNIFLVVLLAAYLSLYTGLFAGVLSRLWPKTTWLRVAIAAPALWQVTEFLRGWVLTGFPWLQFGYSQIDGPL
KGLAPIMGVEAINFLLMMVSGLLALALVKRNWRPLVVAVVLFALPFPLRYIQWFTPQPEKTIQVSMVQGDIPQSLKWDEG
QLLNTLKIYYNATAPLMGKSSLIIWPESAITDLEINQQPFLKALDGELRDKGSSLVTGIVDARLNKQNRYDTYNTIITLG
KGAPYSYESADRYNKNHLVPFGEFVPLESILRPLAPFFDLPMSSFSRGPYIQPPLSANGIELTAAICYEIILGEQVRDNF
RPDTDYLLTISNDAWFGKSIGPWQHFQMARMRALELARPLLRSTNNGITAVIGPQGEIQAMIPQFTREVLTTNVTPTTGL
TPYARTGNWPLWVLTALFGFAAVLMSLRQRRK
>A0QZ13 2.3.1.269~~~lnt~~~Apolipoprotein N-acyltransferase~~~COG0815
MIPAVTDDDPLEDPLDDDVAPGLDDAEPEPEPRDEHDEPSRPATGSRIGGWVARRGSRFGKGVLDRCAPLSAAIGGGLAL
WLSFPPIGWWFTAFPGLALLGWVLTRTATTKAGGFGYGVLFGLAFYVPLLPWISGLVGAVPWLALAFAESLFCGLFGLGA
VVVVRLPGWPLWFATLWVAAEWAKSTFPFGGFPWGASSYGQTNGPLLALARIGGAPLVSFAVALIGFSLTLLTAQIVWWW
RHGHKPGVPAPAVMLPGVAIAASLLVTALVWPQVRQSGTGAGDDTAVTVAAVQGNVPRLGLEFNAQRRAVLDNHVKETLR
LADDVKAGRAAQPMFVIWPENSSDIDPLLNADASAQITTAAEAIDAPILVGGVVRADGYTPDNPVANNTVIVWEPTDGPG
ERHDKQIVQPFGEYLPWRGFFKHLSSYADRAGYFVPGTGTGVVHAAGVPIGITTCWEVIFDRAARESVLNGAQVLAVPSN
NATFDEAMSAQQLAFGKLRAVEHDRYVVVAGTTGISAVIAPDGHEISRTEWFQPAYLDNQIRLKTDLTPATKWGPIVQAV
LVIAGVAVLLIAILHNGRFAPRMLRRRSATTVKR
>Q9ZI86 2.3.1.269~~~lnt~~~Apolipoprotein N-acyltransferase~~~
MRWISRPGWPGHLLALAAGALTPLALAPFDYWPLAILSIALLYLGLRGLPGKSALWRGWWYGFGAFGAGTSWIYVSIHDY
GAASVPLASLLMLGFTAGVAFFFALPAWLWARCLRRDNAPLGDALAFAALWLALELFRSWFLTGFPWLYAGYSQLQGPLA
GLVPVGGVWLSSFVIALSAALLVNLPRLFPHGASLLLGLVLLLGPWAAGLYLKGHAWTHSAGEPLRVVAIQGNIAQELKW
DPNQVRAQLDLYRDLSLPQQDVDLIVWPETAVPILQDMASGYLGAMGQVADEKNAALITGVPVRERLADGKSRYFNGITV
VGEGAGTYLKQKLVPFGEYVPLQDLLRGLIAFFDLPMSDFARGPADQPLLKAKGYQIAPYICYEVVYPEFAAALAAQSQV
LLTVSNDTWFGTSIGPLQHLQMAQMRALESGRWMIRATNNGVTGLIDPYGRIVRQIPQFQQGILRGEVIPMQGLTPYLQY
RVWPLAGLAGVLLLWALLGRQLRPQERRLFG
>F2JXJ3 1.4.3.20~~~lodA~~~L-lysine 6-oxidase~~~
MALSVHPSIGVARLGNANTDNFVLNPMEIGGLPYEHDVDLKPTTTVVNFKDEAGCIRRQGQVFKVFGASNEELTLDSPNV
KNIEWTVHLANKKAAWYEFRELNGNLLYGRDNSYSARGVPWRNASKTASSERQSLIIDLGPRSVSGVMATVEISINNIPE
TYLHPSYPSGELLQGSKHFESLGTLRTDSQGRLIVLGGYGFAGGNTDLSGYGGGDDWYDDISDGSVTCVVTYSDDSSETS
TAWMVVGSPDFAPEIVNISTLSDTCFDVGVRNFDLVPDMYDSATGHYKSDYVANFDRDILPIIQRISQYQWVSNVQSMSG
FFSFQFDYRDGSAANKANRMKYYNYFRQLDNKVIGDYDQPQQVLMSSEVEGDILPLMPMNSGSNSVSSSNFYDLTDNVVE
KFLALDATQLFLLGQWAEGEFTAGPADDYPVSDMDTASIGNCVGLPMCPGIEMTWSLQNPVIYKDAYQIKHYQDKAYFDV
NGLTPERDECEEETGCEPGDLTKRMACPWQADFFNCTIQTVNFSEPSVNKASQTETVTSRTHYEWGNLPAGVSVPDQSSV
SATKNVDEKVPLPPAYYSYWWPPQSPWDVLTGELDTEGQLHSHLPAGQQINYARGINSYSQMVEHWSALAFIRDRNQNND
GFPFFTETERNHELFDFKEVLVGQVTGNSEDNETSLPVFFINANKESLEGKGTKKGKLMASYFEERAFSKVRSSNIRPRS
GTRMRG
>F2JXJ2 1.-.-.-~~~lodB~~~Putative FAD-dependent oxidoreductase LodB~~~COG0644
MESYDAIVIGGGPAGAASALSLLTHHNKRVLLLERGDFSQARIGEQVSHSIFDFLAYLDLPVSEFGESCFSPNYGKTSLW
GSSIESHHLSMFATQGATYQLDRAAFDETLLMAFVERGGTVIPRCKQMKIEQSDSVWQVQFVHPEQGEQTVCCDYLVDAS
GRQSKLSAMLGVEPVMDDQLVGVGAFIRNPDNAFEQHQRIESCEYGWWYMAGLSSELAVVTCFTDMDIMREMRLNKASVW
NQYLAETSAIADCVKGSETTHPKLWVKQAHSQYCTSELPDRFIAVGDAALSFDPVSSMGIGFAMTSACHSTRALVSDSKD
AVLQYQQDMARIYQEYHVTKTRIYQREKRWPNQLFWQRRHAFSALQHAS
>B2HS63 3.2.2.n1~~~~~~Cytokinin riboside 5'-monophosphate phosphoribohydrolase~~~COG1611
MTAKSDEPGRWTVAVYCAAAPTHPELLELAGAVGAAIAARGWTLVWGGGHVSAMGAVSSAARAHGGWTVGVIPKMLVHRE
LADHDADELVVTETMWERKQVMEDRANAFITLPGGVGTLDELLDVWTEGYLGMHDKSIVVLDPWGHFDGLRAWLSELADT
GYVSRTAMERLIVVDNLDDALQACAPG
>O05306 3.2.2.n1~~~log~~~Cytokinin riboside 5'-monophosphate phosphoribohydrolase~~~COG1611
MSAKIDITGDWTVAVYCAASPTHAELLELAAEVGAAIAGRGWTLVWGGGHVSAMGAVASAARACGGWTVGVIPKMLVYRE
LADHDADELIVTDTMWERKQIMEDRSDAFIVLPGGVGTLDELFDAWTDGYLGTHDKPIVMVDPWGHFDGLRAWLNGLLDT
GYVSPTAMERLVVVDNVKDALRACAPS
>P48636 3.2.2.n1~~~~~~Cytokinin riboside 5'-monophosphate phosphoribohydrolase~~~
MTLRSVCVFCGASPGASPVYQEAAVALGRHLAERGLTLVYGGGAVGLMGTVADAALAAGGEVIGIIPQSLQEAEIGHKGL
TRLEVVDGMHARKARMAELADAFIALPGGLGTLEELFEVWTWGQLGYHAKPLGLLEVNGFYDPLLTFLDHLVDERFVRAE
HRGMLQRGASPEALLDALAAWTPSVAPKWVDRTPQ
>P46378 3.2.2.n1~~~fas6~~~Cytokinin riboside 5'-monophosphate phosphoribohydrolase~~~COG1611
MNLRPMPATTVSAQARPTPKSVTVFCGAMPGRGTKYGQLAEGMGRAIARSKLRLVYGGARVGLMGTLANAALDSGGTVVG
VIPESFTAIPEAAHHGLTELHVVHDMHQRKALMAELGDAFIALPGGVGTAEEFFEVLTWSHLGLHNKPCVLLNDNEYYRP
LLSYIEHAAVEGFITPATRSRVIVCKDIEGAIAAIRSP
>P25894 3.4.24.-~~~loiP~~~Metalloprotease LoiP~~~COG0501
MKIRALLVAMSVATVLTGCQNMDSNGLLSSGAEAFQAYSLSDAQVKTLSDQACQEMDSKATIAPANSEYAKRLTTIANAL
GNNINGQPVNYKVYMAKDVNAFAMANGCIRVYSGLMDMMTDNEVEAVIGHEMGHVALGHVKKGMQVALGTNAVRVAAASA
GGIVGSLSQSQLGNLGEKLVNSQFSQRQEAEADDYSYDLLRQRGISPAGLATSFEKLAKLEEGRQSSMFDDHPASAERAQ
HIRDRMSADGIK
>P39917 ~~~lolA~~~Outer-membrane lipoprotein carrier protein~~~COG2834
MNTIKILIGLLGIFLFSLSGIVSAQSDATTQLSQLLSNFRTYQAKFNQITFDGQDRVIQQSHGRVMIMRPGRFRWETDSP
TKQIIITNGKTLWVYDVDLSQATQQPLAQKTNINPASLLSGSVKDLKQKFTITISPTPDAATFQLVPNLGKSLNFNWIRL
KFSKKQLTEMTVLNNLDERSIFQFSQIKVNAPLSSTLFEFKPSRGIDVVKQ
>P61316 ~~~lolA~~~Outer-membrane lipoprotein carrier protein~~~COG2834
MKKIAITCALLSSLVASSVWADAASDLKSRLDKVSSFHASFTQKVTDGSGAAVQEGQGDLWVKRPNLFNWHMTQPDESIL
VSDGKTLWFYNPFVEQATATWLKDATGNTPFMLIARNQSSDWQQYNIKQNGDDFVLTPKASNGNLKQFTINVGRDGTIHQ
FSAVEQDDQRSSYQLKSQQNGAVDAAKFTFTPPQGVTVDDQRK
>Q9I0M4 ~~~lolA~~~Outer-membrane lipoprotein carrier protein~~~
MRLIRTLFVAALAMGASLAHADDSAAVQRLTGLLNKAQTLTARFSQLTLDGSGTRLQETAGQLSLKRPGLFRWHTDAPNE
QLLISNGEKVWLYDPDLEQVTIQKLDQRLTQTPALLLSGDISKISESFAITYKEGGNVVDFVLKPKTKDTLFDTLRLSFR
SGKVNDMQMIDGVGQRTNILFFDVKMNEALDAKQFTFDVPPGVDVIQE
>Q8ZGC6 ~~~lolA~~~Outer-membrane lipoprotein carrier protein~~~COG2834
MKRLLVACCFLSGLISASALADASTDLQNRLSKVNSFHASFSQAVTSSDGAVVQEGEGELWVKRPNLFNWHMTSPDESVL
ISDGETLWFYNPFVEQATATWLKNATGNTPFMLITRNNPDDWKQYNVKQKGDDFELTPKSASGNLKQFAISVTPSGTIKS
FTAVEQDGQRSAYTLKSQQSSVVDASKFTFTPPKGVTLDDQR
>Q83AQ2 ~~~lolB~~~Outer-membrane lipoprotein LolB~~~COG3017
MSLISNNEERSLRVRYCIAIALSALLISGCTTLRLPNQSTSVYHQQTWAQRYYDLSRISQWNIDGAFSIQQPGKTIIAAY
DWQQKGMNYRIRIHSSLDIYSVNISGRPGMVTLWRSPRQHYTASTPEQLMQQQLGWQLPLSNLYYWIRGIPAPGAYQADF
DTYTHLIALQQSGWHIRFSQYTTVGSVDLPRTLQLSNGSLAVKIVVKHWQ
>P61320 ~~~lolB~~~Outer-membrane lipoprotein LolB~~~COG3017
MPLPDFRLIRLLPLAALVLTACSVTTPKGPGKSPDSPQWRQHQQDVRNLNQYQTRGAFAYISDQQKVYARFFWQQTGQDR
YRLLLTNPLGSTELELNAQPGNVQLVDNKGQRYTADDAEEMIGKLTGMPIPLNSLRQWILGLPGDATDYKLDDQYRLSEI
TYSQNGKNWKVVYGGYDTKTQPAMPANMELTDGGQRIKLKMDNWIVK
>P0ADC3 ~~~lolC~~~Lipoprotein-releasing system transmembrane protein LolC~~~COG4591
MYQPVALFIGLRYMRGRAADRFGRFVSWLSTIGITLGVMALVTVLSVMNGFERELQNNILGLMPQAILSSEHGSLNPQQL
PETAVKLDGVNRVAPITTGDVVLQSARSVAVGVMLGIDPAQKDPLTPYLVNVKQTDLEPGKYNVILGEQLASQLGVNRGD
QIRVMVPSASQFTPMGRIPSQRLFNVIGTFAANSEVDGYEMLVNIEDASRLMRYPAGNITGWRLWLDEPLKVDSLSQQKL
PEGSKWQDWRDRKGELFQAVRMEKNMMGLLLSLIVAVAAFNIITSLGLMVMEKQGEVAILQTQGLTPRQIMMVFMVQGAS
AGIIGAILGAALGALLASQLNNLMPIIGVLLDGAALPVAIEPLQVIVIALVAMAIALLSTLYPSWRAAATQPAEALRYE
>O66646 7.6.2.-~~~lolD~~~Lipoprotein-releasing system ATP-binding protein LolD~~~COG1136
MAEILRAENIKKVIRGYEILKGISLSVKKGEFVSIIGASGSGKSTLLYILGLLDAPTEGKVFLEGKEVDYTNEKELSLLR
NRKLGFVFQFHYLIPELTALENVIVPMLKMGKPKKEAKERGEYLLSELGLGDKLSRKPYELSGGEQQRVAIARALANEPI
LLFADEPTGNLDSANTKRVMDIFLKINEGGTSIVMVTHERELAELTHRTLEMKDGKVVGEITRV
>P75957 7.6.2.-~~~lolD~~~Lipoprotein-releasing system ATP-binding protein LolD~~~COG1136
MNKILLQCDNLCKRYQEGSVQTDVLHNVSFSVGEGEMMAIVGSSGSGKSTLLHLLGGLDTPTSGDVIFNGQPMSKLSSAA
KAELRNQKLGFIYQFHHLLPDFTALENVAMPLLIGKKKPAEINSRALEMLKAVGLDHRANHRPSELSGGERQRVAIARAL
VNNPRLVLADEPTGNLDARNADSIFQLLGELNRLQGTAFLVVTHDLQLAKRMSRQLEMRDGRLTAELSLMGAE
>P75958 ~~~lolE~~~Lipoprotein-releasing system transmembrane protein LolE~~~COG4591
MAMPLSLLIGLRFSRGRRRGGMVSLISVISTIGIALGVAVLIVGLSAMNGFERELNNRILAVVPHGEIEAVDQPWTNWQE
ALDHVQKVPGIAAAAPYINFTGLVESGANLRAIQVKGVNPQQEQRLSALPSFVQGDAWRNFKAGEQQIIIGKGVADALKV
KQGDWVSIMIPNSNPEHKLMQPKRVRLHVAGILQLSGQLDHSFAMIPLADAQQYLDMGSSVSGIALKMTDVFNANKLVRD
AGEVTNSYVYIKSWIGTYGYMYRDIQMIRAIMYLAMVLVIGVACFNIVSTLVMAVKDKSGDIAVLRTLGAKDGLIRAIFV
WYGLLAGLFGSLCGVIIGVVVSLQLTPIIEWIEKLIGHQFLSSDIYFIDFLPSELHWLDVFYVLVTALLLSLLASWYPAR
RASNIDPARVLSGQ
>P37945 3.4.21.53~~~lon1~~~Lon protease 1~~~COG0466
MAEELKRSIPLLPLRGLLVYPTMVLHLDVGRDKSVQALEQAMMHDHMIFLATQQDISIDEPGEDEIFTVGTYTKIKQMLK
LPNGTIRVLVEGLKRAHIVKYNEHEDYTSVDIQLIHEDDSKDTEDEALMRTLLDHFDQYIKISKKISAETYAAVTDIEEP
GRMADIVASHLPLKLKDKQDILETADVKDRLNKVIDFINNEKEVLEIEKKIGQRVKRSMERTQKEYYLREQMKAIQKELG
DKEGKTGEVQTLTEKIEEAGMPDHVKETALKELNRYEKIPSSSAESSVIRNYIDWLVALPWTDETDDKLDLKEAGRLLDE
EHHGLEKVKERILEYLAVQKLTKSLKGPILCLAGPPGVGKTSLAKSIAKSLGRKFVRISLGGVRDESEIRGHRRTYVGAM
PGRIIQGMKKAGKLNPVFLLDEIDKMSSDFRGDPSSAMLEVLDPEQNSSFSDHYIEETFDLSKVLFIATANNLATIPGPL
RDRMEIINIAGYTEIEKLEIVKDHLLPKQIKEHGLKKSNLQLRDQAILDIIRYYTREAGVRSLERQLAAICRKAAKAIVA
EERKRITVTEKNLQDFIGKRIFRYGQAETEDQVGVVTGLAYTTVGGDTLSIEVSLSPGKGKLILTGKLGDVMRESAQAAF
SYVRSKTEELGIEPDFHEKYDIHIHVPEGAVPKDGPSAGITMATALVSALTGRAVSREVGMTGEITLRGRVLPIGGLKEK
ALGAHRAGLTTIIAPKDNEKDIEDIPESVREGLTFILASHLDEVLEHALVGEKK
>P36773 3.4.21.53~~~lon1~~~Lon protease 1~~~
MFFGRDDKKEAQKRGLTVPLLPLRDIIVFPHMVVPLFVGREKSIAALKDAMAHKGPDDKAVILLAAQKKAKTNDPTPDDI
FHFGTLGHVIQLLPLPDGTVKVLVEGVRRAKVKKFHPNDAFFMVEVEEVEEQTEKTVELEALVRSVHSVFEAFVKLNKRI
PPEMLMQVASIDDPARLADTIVAHLSLKLNDKQALLETESPAKRLEKLYELMQGEIEILQVEKKIRTRVKKQMEKTQKEY
YLNEQMQAIQKELGERDEFKNEIQEIEEKLKNKRMSKEATLKVKKELKKLRMMSPMSAEATVVRNYIDWIISLPWYDETQ
DRLDVTEAETVLNEDHYGLKKPKERILEYLAVQQLVKKLKGPVLCFVGPPGVGKTSLARSIARATGRKFVRLSLGGVRDE
AEIRGHRRTYIGAMPGKLIQSLKKAGSNNPVFLLDEIDKMSTDFRGDPSAALLEVLDPEQNHTFNDHYLDLDYDLSKVMF
ICTANTMHNIPGPLQDRMEVIRIAGYTEPEKLSIARRYLIPKEQEANGLSDLKVDISDPALRTIIHRYTRESGVRSLERE
IGGVFRKIARDVLKNGKRDIDVDRKMAMKFLGTPRYRYGMAEAEDQVGIVTGLAWTELGGEILTTEATIMPGKGKLIITG
KLGEVMQESAQAAMSYVRSRAERFGIDRKVFENYDIHVHLPEGAIPKDGPSAGVTICTALVSALTRVLIRRDVAMTGEIT
LRGRVLPIGGLKEKTLAAHRAGIKTVLIPKANKKDLKDIPLKIRKQLRIVPVEFVDDVLREALVLEKPEEFGRKPTTDGG
KLGGTTELPASPAVAPA
>Q72KS4 3.4.21.53~~~lon1~~~Lon protease 1~~~COG0466
MKDFLRLELPVLPLRNTVVLPHTTTGVDVGRLKSKRAVEEALSADRLLFLVTQKDPEVDDPAPEDLYAVGTLAVVKQAMR
LPDGTLQVMVEARSRARLLSYVAAPYLRAVGEAIPEPPLKDPELARVLVNEVQEAFERYLQNHKTLRLDRYQQEAVKSTR
DPAILADLVAHHATWTLEEKQTILETPEVEERLKRVLALLLRDLERFELDKKIAARVKEQMDQNQREYYLREQMKAIQKE
LGGGEDFLTEIEELRERIEKKGMPEPVKEKALKELKRLERMQPGSPEATVSRTYLDWLLEVPWTEADPEVLDISVTKRVL
DEDHYGLKEVKERILEYLAVRQLTQGKEVKGHAPILCFVGPPGVGKTSLGKSIARSMNRRFHRISLGGVRDEAEIRGHRR
TYIGALPGKIIQGMKQVGVVNPVFLLDEIDKLSSDWRGDPAAALLEVLDPEQNHTFTDHYLDVPYDLSKVFFITTANTLS
TIPRPLLDRMEVIEIPGYTLHEKRAIARYFRWPFQVKEAGLEGRLEITDRAIERIVQEYTREAGVRNLDRELSKVARKAA
KDYLEKPWEGVRVVDAEDLEAYLGVPKYRPDRAEKEPQVGAAQGLAWTPYGGTLLTIEAVAVPGTGKVNLTGNLGEVMKE
SAHAALTYLRAHREEWGLPEGFHKDYDLHIHVPEGATPKDGPSAGITIATALASALTGRPVRMDIAMTGEITLRGRVLPI
GGVKEKLLAAHQAGIHRVILPKENAAELKEVPEEILKDLEIHFVEEVGEVLKLLLLPPPPPPAVQPDRPQPGVGA
>P42425 3.4.21.53~~~lon2~~~Lon protease 2~~~COG0470
MSWTGIALFIQLFFGIIIGLYFWNLLKNQRTQKVTIDKESKKEMEQLRKMRAISLSEPLSEKVRPKSFKDIVGQEDGIKA
LKAALCGPNPQHVIVYGPPGVGKTAAARLVLEEAKKHKQSPFKEQAVFVELDATTARFDERGIADPLIGSVHDPIYQGAG
AMGQAGIPQPKQGAVTHAHGGVLFIDEIGELHPIQMNKMLKVLEDRKVFLDSAYYSEENTQIPNHIHDIFQNGLPADFRL
IGATTRMPNEIPPAIRSRCLEVFFRELEKDELKTVAKTAADKIEKNISEEGLDLLTSYTRNGREAVNMIQIAAGMAVTEN
RKDITIEDIEWVIHSSQLTPKHEQKIGVEPQVGIVNGLAVYGPNSGSLLEIEVSVTAAQDKGSINITGIAEEESIGSQSK
SIRRKSMAKGSVENVLTVLRTMGMKPSDYDIHINFPGGIPIDGPSAGIAMAAGIFSAIHKIPIDNTVAMTGEISLNGLVK
PIGGVIPKIKAAKQSGAKKVIIPYENQQAILKQIDGIEIIAVKTFQEVLDEILVNPPTEQKPFHIEINKESV
>P36774 3.4.21.53~~~lon2~~~Lon protease 2~~~
MSDEKKKGSAASAMPTAMAPPGLINKEDIPQVLPILPLRNSVFFPGGVLPLAVGRQKTIALIKDAVRDDQVIGVVTQRRA
EEEDPGAADLYTMGTVARIVKLLKMGEDNYSLVVQGLARFRVVELVQEAPYLKARVDAVEDKTSSENVEVEALGINLKKL
AREVIELMPELPAAATELVESITHPGHLADLIAANVDVPIEEKQAVLETVDLKARMKLVLELLNRKREILKLSNKIDSAV
KGEMSKTQREYYLRQQLKAIKEELGEMGEEEEELDELQERLKKAGLPPDVEKVANKELNRLKTIPAASSEYTVARTYLDW
IADLPWAKISEDNLDIENARQQLDKDHFGIKKVKKRILEYLAVRKLKNDMRGPILCLVGPPGVGKTSLGQSVAKATGRKF
VRLSLGGVRDEAEIRGHRRTYVGALPGRFIQSMKKAGTKNPVMMLDEIDKLGADFRGDPSAALLEVLDPEQNNTFSDHYL
DVPFDLSKVMFVATANQLDPIPGPLRDRMEIIELTGYTFEEKQSIARIHLVPKQLKEHGLSPDHIDITDEALLTLTTAYT
REAGVRNLERRIADICRAVAVEVAGGKTEKQTINADRVKEILGPEMFYSEVAERTEVPGVATGLAWTAAGGDLLFIEATK
MAGKGGMTLTGQLGDVMKESATAALSYLRSKAEQLGISPNFLEKTDLHLHFPAGSIPKDGPSAGVTILTALTSLLTGIRV
RHDTAMTGEATLRGLVLPVGGIKEKVLAAHRAGIKRVILPERCRKDLIDVPDQARNELEFIFVTHMDDVLKAALETPPVG
VAGTPGGEPGKEAPLPKPAESAPEVRA
>O66605 3.4.21.53~~~lon~~~Lon protease~~~COG0466
MNELFQTPQVEAGIKEYPLMPLRDIVIFPTMVQPLFVGRRFSIRAIEEANKKDKLIFLVLQKDKDVEEPKEEDIYKVGVV
AYILRTVPIEDARVKVLVQGLKRGVIKKLEWKEDHYVAQVDVIEERDIPPESQTIEDKALIKAVKESIDKLVSLGKQIIP
DLVVLIKELEEPGKLADMVASILDIKSSQAQEILETFDPRERLKKVYKFLQDEIGLLEVKQRISEIARERMEKEQREYYL
RQQLKAIQEELGEAGGIKAEIEEYTKKFEEVKECMPEEGVKEVEKNIKRLERLHPESAEAGVIRTWLDWVLDLPWCTRTE
DNYDLERAREILDRDHYDLEKVKDRIIEYLAIRKLTQGKEAPTQILAFVGPPGVGKTSLGRSIAEALGRKFVRIALGGIR
DEAEIRGHRRTYVGAMPGRIIQAIKQAGTKNPVIMLDEIDKLAISFQGDPAAALLEVLDPEQNKKFTDLYIGIPFDLSEV
IFICTGNRADTIPTPLLDRMELIMLSGYSEEEKLFIAKKHLIPKLIPLHGFSPEEIEFTDEAILEIIRGYTREAGVRNLQ
RQISAVLRKIAVKKLQGEKGPFNITPELVRKLLGVPRYRPEREKKPLVGVATGLAWTEVGGEIMFIEATKMKGKGSLVLT
GSLGDIMKESAQAALSYIRSKAEDYGIDPDIFSQVDVHVHVPEGAVPKDGPSAGVAIATALLSLFTDIPVRMDVAMTGEI
TLRGRVLPVGGLKEKILAAKRAEIYEVILPAKNKDEVMEELPEYVREKMTLHFVDNLEEVFKIALVREPKPLKEA
>P77810 3.4.21.53~~~lon~~~Lon protease~~~
MKEAQSMFEIPRGALYPVPPLRDIVVFPHMIVPLFVGREKSVRALEDVMKDDKQILLVTQKNAAQDDPTPADIYSVGTVG
TVLQLLKLPDGTVKVLVEGGQRASITKFAENEDFFQAHADLVEEKVGESQELEALGRAVVSQFEQYIKLNKKIPPEVLVS
INQIEEPGKLADTVASHLALKIPEKQQLLECATVSERLERVYAFMEGEIGVLQVEKRIRNRVKRQMEKTQREYYLNEQLK
AIQKELGETEDGRDESAELEEKINKTRFSKEARDKALAELKKLRSMSPMSAEATVVRNYLDWMLSIPWKKRTKVKKDLKL
AQKILDADHYGLEKVKERILEYLRVQNRMNKVKGPIQSLVGPPGVGKTSLGKSIAKSTGRNFVRMSLGGVRDEAEVRGHR
RTYIGSMPGKVIQGMKKAKSSNPLFLLDEIDKLGADWRGDPSSALLEVLDPEQNGTFNDHYLEVDYDLSDVMFVCTANTM
RMPQPLLDRMEIIRVAGYTEDEKVEISKRHLIEKQVEANGLKKGEFAISDDALRDLIRYYTREAGVRSLEREIANLCRKA
VKEILMKGSAGAKVSVTRRNLDKYAGVRRFHFGEAELEDLVGVTTGLAWTEVGGELLSIEAVSLPGKGRVTTTGKLGDVM
KESVQAAESYVKSRATAFGIKPTLFEKRDIHVHVPEGATPKDGPSAGVAMITSIVSVLTGIAVRKDVAMTGEITLRGRVL
PIGGLKEKLLAALRGGLKHVLIPKDNEKDLAEIPDNVKRGLEIIPVSTVDDVLKHALVREVEPIEWKEPEAVEPAVAKPQ
TDGGGEVLRH
>Q2YPX3 3.4.21.53~~~lon~~~Lon protease~~~
MTGIEQKTPVGGSETGGADGLYAVLPLRDIVVFPHMIVPLFVGREKSIRALEEVMGVDKQILLATQKNAADDDPAPDAIY
EIGTIANVLQLLKLPDGTVKVLVEGTARAKISKFTDREDYHEAYAAALQEPEEDAVEIEALARSVVPDFENYVKLNKKIS
PEVVGAASQIDDYSKLADTVASHLAIKIPEKQEMLSVLSVRERLEKALSFMEAEISVLQVEKRIRSRVKRQMEKTQREYY
LNEQMKAIQKELGDSEDGRDEVAEIEERITKTKLSKEAREKALAELKKLRSMSPMSAEATVVRNYLDWLLSIPWGKKSKV
KQDLNFAQEVLDAEHFGLGKVKERIVEYLAVQARSTKIKGPILCLVGPPGVGKTSLARSIAKATGREYVRMSLGGVRDEA
EIRGHRRTYIGSMPGKVIQSMKKAKKSNPLFLLDEIDKMGQDFRGDPSSAMLEVLDPEQNATFMDHYLEVEYDLSNVMFV
TTANTMNIPVPLLDRMEIIRIAGYTEDEKLEIAKRHLLPKAIKDHALQPKEFSVTEDALRNVIRHYTREAGVRSLEREVM
TLARKAVTEILKTKKKSVKITDKNLSDYLGVEKFRFGQIDGEDQVGVVTGLAWTEVGGELLTIEGVMMPGKGRMTVTGNL
RDVMKESISAAASYVRSRAIDFGIEPPLFDKRDIHVHVPEGATPKDGPSAGIAMVTAIVSVLTGIPVRKDIAMTGEVTLR
GRVLPIGGLKEKLLATLRGGIKKVLIPEENAKDLAEIPDNVKNNLEIVPVSRVGEVLKHALVRQPEPIEWTEQENPTAVP
PVEDEAGASLAH
>O69300 3.4.21.53~~~lon~~~Lon protease~~~COG0466
MQIEEIQNYPANLPVLVEDELFLYPFMITPIFINDSSNMKALDLAIKNDSMLFVAPSKLENGRNFDEIYNCGVIGTIMRK
VPLPDGRVKILFQGYAKGKIIEQISNKPLEAKIELIKEDFLEGTKKEALLEVLKEKVKNLANISHYFSPDLLRTIEEGFD
ASRICDLILNTVRIKKQVAYEFFVLTDLEQKLVKLIDLIAQEIEANKIQKEIKNKVHSRIDKVNKEYFLKEQLRQIQKEL
GSDTQKEDEVREYQKRLELKKKFMHEDAYKEIKKQIEKFERIHQDNSEASMIQTYIETALDIPFEKISKKKLDIKEVSKQ
LNHDHYALNKPKERIEEYFAVRELLEKRKIAEKDGAKVILCLYGPPGVGKTSLANSVSKALKRELIRIALGGLEDVNELR
GHRRTYIGAMPGRITQGLIEAKQINPVIVLDEIDKLNRSFRGDPSAVLLEILDPEQNSKFRDYYLNFNIDLSKVIFIATA
NDISNIPAPLRDRMEFIELSSYTPSEKFHIMKKYLIPDELKKHGLKSNELSIDDETIELIISDYTRESGVRNLRRKVAEL
CRKSAKKLLLENIKKVIINTKNLNEFLDKKVFEIEKNNGENQVGQVNGLAWTSVGGDVLKVEAVKIKGKGELTLTGSLGD
VMKESARIAFSMIKVLIDEGKIKIPKKIIIDPKVNVYDSYNIHIHVPDGATPKDGPSAGITISTAIASIFSDKKVKADVA
MTGEIDLKGKVLPIGGLKEKLIAAYKADIKTALIPRKNYERDLKDIPSEVRDNMEIIAVDTFSDVLEYTLV
>B8GX12 3.4.21.53~~~lon~~~Lon protease~~~
MSELRTLPVLPLRDIVVFPHMVVPLFVGRDKSVRALEEVMRGDKQILLVTQKNSADDDPAPGDIFEVGVLATVLQLLKLP
DGTVKVLVEGKARAAVVSFTDQESYYEAQIGEVSEDDGAGPEAEALSRAVVEQFENYVKLNKKVPPEALASIPQIAEPGK
LADSIAAHLSVKIGDKQNLLEIFDVVKRLEKVFALMEGEISVLQVEKKIRSRVKRQMEKTQREYYLNEQMKAIQRELGDP
DDARDELIDLEKRIKKTKLSKEARTKAESELKKLRNMSPMSAESTVVRNYLDWLLSIPWGKAKTKKIDLVESEGILDADH
YGLEKVKERILEYLAVQARTNSLKGPILCLVGPPGVGKTSLGKSIAKATGREFVRMSLGGVRDEAEIRGHRRTYIGSMPG
KVVQSMKKAKTTNAFVLLDEIDKMGSDYRGDPASALLEVLDPSQNSTFGDHYLEVDYDLSQVMFVTTANSLNMPQPLLDR
MEIIRIPGYTEDEKLEIAKRHILPKLAKDHGLKPAEFIVPDKAIRDLIRYYTREAGVRSLERELGALARKTVRDLAREKV
ASITIDDERLAKYAGVKKYRYGETDEVDQVGIVTGLAWTEFGGDILTIEAVKMPGKGRMQITGNLKDVMKESIAAANSYV
RSRALQFGIKPPVFEKTDVHIHVPDGATPKDGPSAGIAMALAMVSVLTGIPIRKDIAMTGEITLRGRVTAIGGLKEKLLA
ALRSGVKTVLIPQENEKDLADVPQTVKDGLEIIPVSTVDEVLKHALTGPLTPVEWNEAEEPITTSAKKDDGDSDAMLTH
>P0A9M0 3.4.21.53~~~lon~~~Lon protease~~~COG0466
MNPERSERIEIPVLPLRDVVVYPHMVIPLFVGREKSIRCLEAAMDHDKKIMLVAQKEASTDEPGVNDLFTVGTVASILQM
LKLPDGTVKVLVEGLQRARISALSDNGEHFSAKAEYLESPTIDEREQEVLVRTAISQFEGYIKLNKKIPPEVLTSLNSID
DPARLADTIAAHMPLKLADKQSVLEMSDVNERLEYLMAMMESEIDLLQVEKRIRNRVKKQMEKSQREYYLNEQMKAIQKE
LGEMDDAPDENEALKRKIDAAKMPKEAKEKAEAELQKLKMMSPMSAEATVVRGYIDWMVQVPWNARSKVKKDLRQAQEIL
DTDHYGLERVKDRILEYLAVQSRVNKIKGPILCLVGPPGVGKTSLGQSIAKATGRKYVRMALGGVRDEAEIRGHRRTYIG
SMPGKLIQKMAKVGVKNPLFLLDEIDKMSSDMRGDPASALLEVLDPEQNVAFSDHYLEVDYDLSDVMFVATSNSMNIPAP
LLDRMEVIRLSGYTEDEKLNIAKRHLLPKQIERNALKKGELTVDDSAIIGIIRYYTREAGVRGLEREISKLCRKAVKQLL
LDKSLKHIEINGDNLHDYLGVQRFDYGRADNENRVGQVTGLAWTEVGGDLLTIETACVPGKGKLTYTGSLGEVMQESIQA
ALTVVRARAEKLGINPDFYEKRDIHVHVPEGATPKDGPSAGIAMCTALVSCLTGNPVRADVAMTGEITLRGQVLPIGGLK
EKLLAAHRGGIKTVLIPFENKRDLEEIPDNVIADLDIHPVKRIEEVLTLALQNEPSGMQVVTAK
>P46067 3.4.21.53~~~lon~~~Lon protease~~~
MNPERSERIEIPVLPLRDVVVYPHMVIPLFVGREKSIRCLEAAMDHDKKIMLVAQKEASTDEPGINDLFSVGTVASILQM
LKLPDATVKVLVEGLQRARISALSDNGDHFTAKAEYLTSPEIEEREQEVLVRTAINQFEGYIKLNKKIPPEVLTSLNSID
DAARLADTVAAHMPLKLSDKQSVLEMSDVDERLEYLMAMMESEIDLLQVEKRIRNRVKKQMEKSQREYYLNEQMKAIQKE
LGEMDDAPDENEALKRKIDAARMPKEAREKTEAELQKLKMMSPMSAEATVVRGYIDWMVQVPWNARSKVEKDLQKRRQTL
DTDHFGLERVKDRILEYLAVQSRVSKIKGPILCLVGPPGVGKTSLGQSIAKATGRKYVRMALGGVRDEAEIRGHRRTYIG
SMPGKLIQKMAKVGVKNPLFLLDEIDKMSSDMRGDPASALLEVLDPEQNIAFNDHYLEVDYDLSDVMFVATSNSMNIPAP
LLDRMEVIRLSGYTEDEKLNIAKKHLLSKQIERNALKEHELIVDDSAIVGIIRYYTRERGVRSLERELSKLCRKAVKTLL
MDKSVKHIEINADNLKDYLGVQRYDYGRADSENRVGQVTGLAWTEVGGDLLTIETACVPGKGKLTYTGSLGEVMQESIQA
ALTVVRARAEKLGINGDFYEKRDIHVHVPEGATPKDGPSAGIAMCTALVSCLTGNPVRADVAMTGEITLRGQVLPIGGLK
EKLLAAHRGGIKTVLIPDENKRDLEEIPENVIADLDIHPVKRIEEVLALALQNAPYGMQVASVK
>O31147 3.4.21.53~~~lon~~~Lon protease~~~
MAEAKTVPVLFLNDSIVLPGMVVPIELDDAARAAVDAARASESGELLIAPRLEDRYPAYGVLASIVQIGRLPNGDAAAVV
RGERRAHIGSGTSGPGAALWVQVEEVTDPEPTDETKKLAGEYKKLLLAMLQRRDAWQIVDMVNKITDPSALADTAGYASY
LTGTQKRELLETTDVDRRLSLLIGWTGDHLAETEVNDKIAEDVRTGMEKQQKEFLLRQQLAAIRKELGELDDNGDGSSDD
YRARIEQADLPEKVREAALREVGKLERASDQSPEGGWIRTWLDTVLDLPWNVRTEDSTDLARAREILDTDHHGLSDVKDR
IVEYLAVRGAAPQRGMAVVGGRGSGAVMVLAGPPGVGKTSLGESVARALDRKFVRVALGGVRDEAEIRGHRRTYVGALPG
RIVRAIGEAGSMNPVVLLDEIDKVGSDYRGDPAAALLEVLDPAQNHTFRDHYLDLDLDLSDVVFLVTANVIENIPSALLD
RMELVEIDGYTADDKLAIAQGFLLPRQRERGGLTSDEVTVTEAALRKIAADYTREPGVRQFERLLAKAMRKAATKLADHP
QAAPITIDEPDLVEYLGRPRFLPESAERTAVPGVATGLAVTGLGGDVLYIEANSTEGEPGLQLTGQLGDVMKESAQIAMS
YVRAHAKQLGVDPEALNRRIHIHVPAGAVPKDGPSAGVTMVTALVSMATGRKVRGDVGMTGEVTLNGRVLPIGGVKQKLL
AAQRAGLSTVFIPQRNQPDLDDVPADVLDALDVRPMTDVADIIAAALEPAHEASTAAAA
>Q2YKK7 2.7.13.3~~~~~~Blue-light-activated histidine kinase~~~
MAIDLRPFIPFGRGALSQATDPFRAAVEFTLMPMLITNPHLPDNPIVFANPAFLKLTGYEADEVMGRNCRFLQGHGTDPA
HVRAIKSAIAAEKPIDIDIINYKKSGEAFWNRLHISPVHNANGRLQHFVSSQLDVTLELSRLVELEKERKTLSIETARSK
DQLDYIVEVANIGFWTREFYSGKMTCSAECRRIYGFTPDEPVHFDTILDLVVLEDRMTVVQKAHQAVTGEPYSIEYRIVT
RLGETRWLETRAKALTGENPLVLGIVQDVTERKKAEANKALVSREIAHRFKNSMAMVQSIANQTLRNTYDPEQANRLFSE
RLRALSQAHDMLLKENWAGATIQQICATALAPFNSTFANRIHMSGPHLLVSDRVTVALSLAFYELATNAVKYGALSNEKG
VINITWAIMEDKGEKKFHMRWAESRGPEVMQPARRGFGQRLLHSVLAEELKAKCDVEFAASGLLIDVLAPITPEVFPGMG
HNVPEQRIA
>Q8YC53 2.7.13.3~~~~~~Blue-light-activated histidine kinase~~~COG2202
MAIDLRPFIPFGRGALSQATDPFRAAVEFTLMPMLITNPHLPDNPIVFANPAFLKLTGYEADEVMGRNCRFLQGHGTDPA
HVRAIKSAIAAEKPIDIDIINYKKSGEAFWNRLHISPVHNANGRLQHFVSSQLDVTLELSRLVELEKERKTLSIETARSK
DQLDYIVEVANIGFWTREFYSGKMTCSAECRRIYGFTPDEPVHFDTILDLVVLEDRMTVVQKAHQAVTGEPYSIEYRIVT
RLGETRWLETRAKALTGENPLVLGIVQDVTERKKAEANKALVSREIAHRFKNSMAMVQSIANQTLRNTYDPEQANRLFSE
RLRALSQAHDMLLKENWAGATIQQICATALAPFNSTFANRIHMSGPHLLVSDRVTVALSLAFYELATNAVKYGALSNEKG
VINITWAIMEDKGEKKFHMRWAESRGPEVMQPARRGFGQRLLHSVLAEELKAKCDVEFAASGLLIDVLAPITPEVFPGMG
HNVPEQRIA
>Q881J7 ~~~~~~Blue-light-activated protein~~~COG0784
MSENKTRVDNAATGDIQHQGKDIFFAAVETTRMPMIVTDPNRPDNPIIFSNRAFLEMTGYTAEEILGTNCRFLQGPDTDP
AVVQSIRDAIAQRNDISAEIINYRKDGSSFWNALFISPVYNDAGDLIYFFASQLDISRRKDAEEALRQAQKMEALGQLTG
GIAHDFNNLLQVMGGYIDLIGSAAEKPVIDVQRVQRSVYHAKSAVERASTLTKQLLAFARKQKLQGRVLNLNGLVSIVEP
LIERTFGPEVAIETDLEPALKNCRIDPTQAEVALLNIFINARDALIGRENPKVFIETRNLLVDELANMSYDGLLPGRYVS
IAVTDNGIGMPASIRDRVMDPFFTTKEEGKGSGLGLSMVYGFAKQSGGAARIYTEEGVGTTLRLYFPVDEAGLTNTESPQ
ASDRRLGSSERILIVEDRPDVAELAKMVLDDYGYVSEIVLNAREALKKFESGNMYDLLFTDLIMPGGMNGVMLAREVRRR
YPKVKVLLTTGYAESSIERTDIGGSEFDVVSKPCMPHDLARKVRQVLDGPNGIA
>Q1M667 2.7.13.3~~~lov~~~Blue-light-activated histidine kinase~~~
MTPHTKEKLHGDLPSASSKAASADRKELAAIAFERTRMPMVVTDGRKPDLPIVLANKAFLELTGYPAQEVLGRNCRFLQG
PATSPIAVAEIRAAIAGEREVSVEILNYKKSGEQFWNRLHLSPVHGDDGKILYFFGSQIDMTEYRRIEALEASEHRLLME
VDHRSKNVLAIVDSIVRLSNADDPALYAAAIQHRVQALARAHTLLAARGWTNISLEELIRQQVTPFAATRAIFNGPDINM
PAPVVQPLALVLHELAVNAAHHGALAVAQGRLSISWKPRPSGAGFYIRWQEVGAPTPPKLAKRGFGTVIVGAMVEKQLKG
RLQKIWSDEGLLIDIEIPSAGPTCA
>Q9I4G8 1.13.11.-~~~loxA~~~Lipoxygenase LoxA~~~
MKRRSVLLSGVALSGTALANDSIFFSPLKYLGAEQQRSIDASRSLLDNLIPPSLPQYDNLAGKLARRAVLTSKKLVYVWT
ENFGNVKGVPMARSVPLGELPNVDWLLKTAGVIVELIVNFVASLPASAAAQFERIATGLSGDLEAARQVHEALLEEAKND
PAAAGSLLLRFTELQTRVIAILTRVGLLVDDILKSASNLVTQRGQGDGLNRFRAVFGTLRLPEVADSFRDDEAFAYWRVA
GPNPLLIRRVDALPANFPLGEEQFRRVMGADDSLLEAAASRRLYLLDYAELGKLAPSGAVDKLLTGTGFAYAPIALFALG
KDRARLLPVAIQCGQDPATHPMFVRPAESESDLYWGWQMAKTVVQVAEENYHEMFVHLAQTHLVSEAFCLATQRTLAPSH
PLHVLLAPHFEGTLFINEGAARILLPSAGFIDVMFAAPIQDTQATAGGNRLGFDFYRGMLPESLKARNVDDPLALPDYPY
RDDGLLVWNAIRQWAADYVAVYYASDGDVTADVELAAWVGEVIGSGKVAGFRPITGRSQLVEVLTMVIFTASAQHAAVNF
PQPSMMTYAPAICAMSAAPAPDSPSGKSEADWLKMMPPTLVALEKVNIYHLLGSVYHGRLGDYRQTGFPYAPVFSDRRVT
ASGGPLERFQARLKEVEATIRTRNQARRRPYEYLLPSRIPASTNI
>Q44467 1.1.3.-~~~~~~L-lactate oxidase~~~
MNNNDIEYNAPSEIKYIDVVNTYDLEEEASKVVPHGGFNYIAGASGDEWTKRANDRAWKHKLLYPRLAQDVEAPDTSTEI
LGHKIKAPFIMAPIAAHGLAHTTKEAGTARAVSEFGTIMSISAYSGATFEEISEGLNGGPRWFQIYMAKDDQQNRDILDE
AKSDGATAIILTADSTVSGNRDRDVKNKFVYPFGMPIVQRYLRGTAEGMSLNNIYGASKQKISPRDIEEIAGHSGLPVFV
KGIQHPEDADMAIKRGASGIWVSNHGARQLYEAPGSFDTLPAIAERVNKRVPIVFDSGVRRGEHVAKALASGADVVALGR
PVLFGLALGGWQGAYSVLDYFQKDLTRVMQLTGSQNVEDLKGLDLFDNPYGYEY
>F4G5A4 1.1.3.-~~~~~~L-lactate oxidase~~~COG1304
MSPSDRIPPGVWNAIDYERLAPQAMDAGRHAYVAGGCGWDATVAANRAAFAGWAVLPRLLRDVRAGHTRLQLAGMDLPHP
LLLAPVAHQRLAHPDAEIATARAAQATGSCLVASTLSSCTLEDIAAASGPARWFQLYLQPEREHSLDLLRRAEAAGYRAI
VLTLDASIQLASRGALQAGFAMPADCVSANLARYPQPAPAQPAAGESRIFQGAMRHAPRWDDLRWLLASTRLPVWIKGVL
HPEDARELQAAGAAGLIVSNHGGRSLDGAPASLRMLPALRTAVGAGYPLLLDGGVRSGQDAFKALALGADAVLVGRLQVY
ALAVAGALGVAHMLQMLVEELHACMAQAGCARLSDITHDTLTPSC
>A0A5N1I561 1.1.3.-~~~~~~L-lactate oxidase~~~
MTVYYKGFPQSDRNEAIKMVNVDELEDRVRKVMPEAAYYYIASGSENEWTWRNNTAAFNHFQIVPRSLTNMDNPSTETQF
MGMDLKTPIMICPIACHGIAHKDAEVATAQGAKAAGALFSSSTYANRSVEDIATATGDSPKFFQLYLSKDWDFNKMVFDA
VKSAGYKGIMLTVDALVSGYREANLRTNFTFPVPLDFFTRYVGAEGEGMSVAQMYANSAQKIGPADVAKIKEMSGLPVFV
KGVMNAEDAYMAIGAGADGIVVSNHGGREIDTAPATIDMLPEIAAAVNGRVPIILDSGVRRGSHVFKALALGADLVGIGR
PFLYGLALGGAKGVESVINQINNEFKILMQLTGCKTVEDVKHADIRQINYTADNLPSNTDPSVRRAYPVTKENQMEGTQD
AATGASKH
>C2K1F0 1.1.3.-~~~~~~L-lactate oxidase~~~
MVDAVKADPFGKVNAIDVLDLASLEARAEKVLGRGEFGYISEGSDDGYTMHRNTTAFQDVHMLPRVLQGVENPDQSTTFM
GAKLASPLLTAPIASNTLAHPSGELGLAKGAKEAGIMMSQSTFASKTIAETAAVSDGAPYMFQLYMPKDWEYCQYLLDEA
KQAGALAIILTADSTLGGYREKDVMNHYHLKGRLANLEGYNTGQSGVGAGGLFKESMQKLDLGLISKLASYSGLPIIIKG
IQHPADAVAAITAGAAGIYVSNHGGRQLDGAPGAIEQLPAIAAAVDHRVPIIFDGGVQRGTHVLKALALGADLVGIGRPF
SYGLALGGWQGVKDVADHLKMEINIAMQLTGCQTMADVKQMKVKTTFA
>C0XIJ3 1.1.3.-~~~haox~~~L-lactate oxidase~~~
MVVVNGYKQNENEKKLNVLNLDQLEKQAKEIIPTGGFGYISGGSEDEWTLRENRRAFTHKQIVPRALTNIEKPELETNVF
GIPLKTPLFMVPAAAQGLAHVKGEVDTAKGVAAVGGLMAQSTYSSTSIADTAASGTGAPQFFQLYMSKDWDFNEALLDEA
KRAGVKGIILTVDATVDGYREADIINNFQFPIPMANLTKYSEDDGQGKGIAEIYASAAQKIGSDDVARIANYTDLPVIVK
GIESPEDALYAIGAGASGIYVSNHGGRQLNGGPASFDVLEDVAKAVNGKVPVIFDSGIRRGSDVFKALASGADLVGIGRP
VIYGLALGGAQGVQSVFEHLDHELEIIMQLAGTKTISDVKNAKLLNIRY
>B1HZY7 1.1.3.-~~~~~~L-lactate oxidase~~~
MTNTTDGDLLLKNITAQAPFPICFADLEKAVAEKIPAGPFGYIRSGAGGEQTLRNNRSAFEKYSIVPRFLNDVSNVHTSI
NLFGKTYPTPLLFAPVGMNGMVHEEGELAAVRAAQQLNMPYIQSTVSTYALEDVAEAAPSATKWFQLYWSTNEEIAFSMA
ARAESAGFEAIVLTVDTVMLGWREEDVRNQFSPLKLGYAKGNYINDPVFMASLPNDSFESYVQGVLQNVFHPTLNWEHVR
ELKRRTNLPILLKGILHPEDAKLAIVNGVDGIIVSNHGGRQLDGVIGSLDALPSIVSAVKGQIPIILDSGVYRGMDALKA
LALGADAVAIGRPFIYGLALEGQQGVERVMTNIYDELKVSIALAGTTSIEGLRTITLVKNDGMEVK
>Q8Z0C8 1.1.3.-~~~lox~~~L-lactate oxidase~~~COG1304
MTAISSPINLFEYEQLAKTHLSQMAFDYYISGAGDEITLQENRAVFERIKLRPRMLVDVSQINLTTSVLGQPLQLPLLIA
PMAFQCLAHTEGELATAMAAASAGTGMVLSTLSTKSLEEVAEVGSKFSPSLQWFQLYIHKDRGLTRALVERAYAAGYKAL
CLTVDAPVLGQRERDRRNEFVLPPGLHLANLTTISGLNIPHAPGESGLFTYFAQQLNPALTWDDLEWLQSLSPLPLVLKG
ILRGDDAARAVEYGAKAIVVSNHGGRQLDGAIASLDALPEIVAAVNGKAEVLLDGGIRRGTDIIKALAIGAQAVLIGRPV
LWGLAVGGQAGVSHVISLLQKELNVAMALIGCSQLQDIDTSFLHL
>Q8RNT4 1.13.11.12~~~lox~~~Linoleate 9/13-lipoxygenase~~~COG0753
MKRRSVLLSGVALSGTALANDSIFFSPLKYLGAEQQRSIDASRSLLDNLIPPSLPQYDNLAGKLARRAVLTSKKLVYVWT
ENFANVKGVPMARSVPLGELPNVDWLLKTAGVIVELIVNFVASLPASAAAQFERIAAGLSGDLEAARQVHEALLEEAKND
PAAAGSLLLRFTELQTRVIALLTRVGLLVDDILKSASNLVTQGGQGDGLNRFRAVFGTLRLPEVADSFRDDEAFAYWRVA
GPNPLLIRRVDALPANFPLGEEQFRRVMGADDSLLEAAASRRLYLLDYAELGKLAPSGAVDKLLTGTGFAYAPIALFALG
KDRAGLLPVAIQCGQDPATHPMFVRPAESESDLYWGWQMAKTVVQVAEENYHEMFVHLAQTHLVSEAFCLATQRTLAPSH
PLHVLLAPHFEGTLFINEGAARILLPSAGFIDVMFAAPIQDTQATAGGNRLGFDFYRGMLPESLKARNVDDPAALPDYPY
RDDGLLVWNAIRQWAADYVAVYYASDGDVTADVELAAWVGEVIGSGKVAGFRPITGRSQLVEVLTMVIFTASAQHAAVNF
PQPSMMTYAPAICAMSAAPAPDSPSGKSEADWLKMMPPTLVALEKVNIYHLLGSVYHGRLGDYRQTGFPYAPVFSDRRVT
ASGGPLERFQARLKEVEATIRTRNQARRKPYEYLLPSRIPASTNI
>B7RR92 1.1.3.-~~~~~~L-lactate oxidase~~~COG1304
MESADLRDPDGMPVTLSDFEIDAAGRLSADLLAYLEGGAEAGQSVTENRAAFGRIGLLPKLLSPCAGGHTRTTILGKQAP
HPIMVAPMAFQNLFHPQGESATAMAAAAQDATMVLSCQTSTPPEDIATIPGRRWFQLYMQADHEATMALVTRAVDCGADA
LVVTLDAPINGLRDREVAAGFTLPDDVRPVMLDVLPQPPRPHLRDGQSVVFDGMMVFAPTADDLARLIADSPVPVIVKGC
LRPADATRLIDLGAQGIIVSNHGGRVLDTVPAPITQLAAVVDAVAGAVPVYVDGGIRRGSDVFKALALGAQAVLVGRPVM
HGLIVDGPRGASQVLRRLRDELEVTMALCGCATVADITPDLLTGFSGTGS
>O33655 1.1.3.-~~~lctO~~~L-lactate oxidase~~~COG1304
MENKSEMINATTIEFKTSSAEGSVDFVNVFDLEKMAQKVIPKGAFGYIASGAGDTFTLHENIRSFNHKLIPHGLKGVENP
STEITFIGDKLASPIILAPVAAHKLANEQGEIASAKGVKEFGTIYTTSSYSTTDLPEISQTLGDSPHWFQFYYSKDDGIN
RHIMDRLKAEGVKSIVLTVDATVGGNREVDKRNGFVFPVGMPIVQEYLPNGAGKTMDYVYKATKQALSPKDVEYIAQYSG
LPVYVKGPQCAEDAFRALEAGASGIWVTNHGGRQLDGGPAAFDSLQEVAESVDRRVPIVFDSGVRRGQHVFKALASGADL
VALGRPVIYGLAMGGSVGTRQVFEKINDELKMVMQLAGTQTIDDVKHFKLRHNPYDSSIPFSPKCFKIRLIFRRPNQILG
QFF
>Q72U69 ~~~orfC~~~Antigen Lp49~~~
MNSNPKKKFLKLIKIKSDIILLIPIFLFLVCCKSGDFSLLSSPINREKNGTEIVKFSIHPYKGTVIRLGEEILPFKVLEM
DKNIALVEMAIPVYKDEKEIELKLSSPGFQNSSYRIRKPEELNEKLIALDKEGITHRFISRFKTGFQPKSVRFIDNTRLA
IPLLEDEGMDVLDINSGQTVRLSPPEKYKKKLGFVETISIPEHNELWVSQMQANAVHVFDLKTLAYKATVDLTGKWSKIL
LYDPIRDLVYCSNWISEDISVIDRKTKLEIRKTDKIGLPRGLLLSKDGKELYIAQFSASNQESGGGRLGIYSMDKEKLID
TIGPPGNKRHIVSGNTENKIYVSDMCCSKIEVYDLKEKKVQKSIPVFDKPNTIALSPDGKYLYVSCRGPNHPTEGYLKKG
LVLGKVYVIDTTTDTVKEFWEAGNQPTGLDVSPDNRYLVISDFLDHQIRVYRRDGF
>P08497 ~~~ask~~~Aspartokinase II operon leader peptide~~~
MKKAERGASPKQIKPHRYYLLAVH
>P32717 3.1.6.21~~~yjcS~~~Linear primary-alkylsulfatase~~~COG2015
MNNSRLFRLSRIVIALTAASGMMVNTANAKEEAKAATQYTQQVNQNYAKSLPFSDRQDFDDAQRGFIAPLLDEGILRDAN
GKVYYRADDYKFDINAAAPETVNPSLWRQSQINGISGLFKVTDKMYQVRGQDISNITFVEGEKGIIVIDPLVTPPAAKAA
LDLYFQHRPQKPIVAVIYTHSHTDHYGGVKGIISEADVKSGKVQVIAPAGFMDEAISENVLAGNIMSRRALYSYGLLLPH
NAQGNVGNGLGVTLATGDPSIIAPTKTIVRTGEKMIIDGLEFDFLMTPGSEAPAEMHFYIPALKALCTAENATHTLHNFY
TLRGAKTRDTSKWTEYLNETLDMWGNDAEVLFMPHTWPVWGNKHINDYIGKYRDTIKYIHDQTLHLANQGYTMNEIGDMI
KLPPALANNWASRGYYGSVSHNARAVYNFYLGYYDGNPANLHPYGQVEMGKRYVQALGGSARVINLAQEANKQGDYRWSA
ELLKQVIAANPGDQVAKNLQANNFEQLGYQAESATWRGFYLTGAKELREGVHKFSHGTTGSPDTIRGMSVEMLFDFMAVR
LDSAKAAGKNISLNFNMSNGDNLNLTLNDSVLNYRKTLQPQADASFYISREDLHAVLTGQAKMADLVKAKKAKIIGNGAK
LEEIIACLDNFDLWVNIVTPN
>Q9I5I9 3.1.6.21~~~sdsA1~~~Linear primary-alkylsulfatase~~~
MSRLLALLALAPLLAGAAETTAPKPPSAFTVEAQRRVEAELPFADRADFERADRGLIRRPERLLIRNPDGSVAWQLGGYD
FLLDGKPRDSINPSLQRQALLNLKYGLFEVAEGIYQVRGFDLANITFIRGDSGWIVVDTLTTPATARAAYELVSRELGER
PIRTVIYSHAHADHFGGVRGLVEPQQVASGAVQIIAPAGFMEAAIKENVLAGNAMMRRATYQYGTQLPKGPQGQVDMAIG
KGLARGPLSLLAPTRLIEGEGEDLVLDGVPFTFQNTPGTESPAEMNIWLPRQKALLMAENVVGTLHNLYTLRGAEVRDAL
GWSKYINQALHRFGRQAEVMFAVHNWPRWGNAEIVEVLEKQRDLYGYLHDQTLHLANQGVTIGQVHNRLRLPPSLDQEWY
DRGYHGSVSHNARAVLNRYLGYYDGNPATLDPLSPEDSAGRYVEYMGGAERLLEQARASYARGEYRWVVEVVNRLVFAEP
DNRAARELQADALEQLGYQAENAGWRNSYLSAAYELRHGVPRDQPTMKAGSADALAAMDTGLLFDYLGVRLDAGAAEGKA
LSINLRLPDIGENYLLELKNSHLNNLRGVQSEDAGQTVSIDRADLNRLLLKEVSAVRLVFEGKLKSSGNPLLLGQLFGML
GDFDFWFDIVTPAAKSEG
>Q52556 3.1.6.21~~~sdsA~~~Linear primary-alkylsulfatase~~~
MIEAPEGLIIVDTGESVDQSRKVLAEFRKISDKPIKAIVYTHFHPDHINGVKAFVSEEQVKSGEVRIYAQETLLDNVVTQ
GSLVGPILTMRSGYSFGVALSDEDKRDMNAGLGPLAHEGASTFIAPTDTFRDSLDTTIAGLKVQFLHVPSEAPDEIVLYL
PDNRVLISAEVTQGPTLPNVHTLRGTKFRDPVVWVASLDKLRAFQADVMVPLHGQPVSGREKVEEVLRMTRDAIAYIHDQ
TVRWMNKGLTPDELVEKVKLPPHLAGYTPYLREYYGTVKHSVRQIYQGYLGWFQGDPVDLDPIPPAEKARRLIALMGGRD
KVLMAAGDAYLKGDWQWAAELSGYAIRVDHDDKLARDIKARSFRRLGYASMNINWRNWYLMSAMELEGKLEGDVALEMSR
RVRAAFLSPDMLKNLPARIFLQNWVTRIDPEKSGDVELALGFAFPDIDEAWTLEVRRGVAQLKSGIDPAVPLRLTLDKRY
LDTVISGENSLLKGALLGDVKVDGNLLDIKTFLGCFDFEDAPIALTVR
>F2WP51 3.1.6.21~~~sdsAP~~~Linear primary-alkylsulfatase~~~
MKLNALSTATHGSRSSPVKLWKFSTSFLLAASIIVSGQSWAAETAKPATDATKAANDALLKELPFDDKTSFDLAHKGFIA
PLPAEPIKGEKGNMIWDPSKYGFIKEGEAAPDTTNPSLWRQSQLINISGLFEVTDGIYQVRNYDLSNMTIVEGKDGITIF
DPLISQETAKAALDLYYKHRPKKPVVAVIYTHSHVDHYGGVRGVVDEADVKAGKVKIYAPLGFLEHAVAENVMAGTAMSR
RASYMYGNLLPPDAKGQLGAGLGTTTSAGTVTLIPPTDIIKETGETHVIDGLTYEFMYAPGSEAPAEMLYYIKEKKALNA
AEDSTHTLHNTYSLRGAKIRDPLAWSKYLNEALKLWGDDVQVMYAMHHWPVWGNKEVREQLSLQRDMYRYINDETLRLAN
KGYTMTEIAEQVKLPKKIATKFSNRGYYGSLNHNVKATYVLYLGWFIGNPATLWELPPADKAKRYVEMMGGADAVLKKAK
EYYDKGDFRWVAEVVNHVVFAEPNNQAAKNMQADALEQLGYQAESGPWRNFYLTGAQELRNGVQQLPTPDTASPDTVKAM
DLDLFFDFLAMRLKGPDVADKHITLNLDFTDLKQKYTLEMVNGVLNHTEGMQAKNADATVTLTRETLNNVMLKQTTLKDA
ESSGDIKIEGDKGKLEELMSYMDNFDFWFNIVTP
>P9WHH7 1.6.5.2~~~lpdA~~~NAD(P)H dehydrogenase (quinone)~~~COG1249
MVTRIVILGGGPAGYEAALVAATSHPETTQVTVIDCDGIGGAAVLDDCVPSKTFIASTGLRTELRRAPHLGFHIDFDDAK
ISLPQIHARVKTLAAAQSADITAQLLSMGVQVIAGRGELIDSTPGLARHRIKATAADGSTSEHEADVVLVATGASPRILP
SAQPDGERILTWRQLYDLDALPDHLIVVGSGVTGAEFVDAYTELGVPVTVVASQDHVLPYEDADAALVLEESFAERGVRL
FKNARAASVTRTGAGVLVTMTDGRTVEGSHALMTIGSVPNTSGLGLERVGIQLGRGNYLTVDRVSRTLATGIYAAGDCTG
LLPLASVAAMQGRIAMYHALGEGVSPIRLRTVAATVFTRPEIAAVGVPQSVIDAGSVAARTIMLPLRTNARAKMSEMRHG
FVKIFCRRSTGVVIGGVVVAPIASELILPIAVAVQNRITVNELAQTLAVYPSLSGSITEAARRLMAHDDLDCTAAQDAAE
QLALVPHHLPTSN
>F9UT67 2.5.1.129~~~lpdB~~~Flavin prenyltransferase LpdB~~~COG0163
MKRIVVGITGASGTIYAVDLLEKLHQRPDVEVHLVMSAWAKKNLELETDYSLAQLTALADATYRANDQGAAIASGSFLND
GMVIVPASMKTVAGIAYGFGDNLISRAADVTIKEQRKLVIVPRETPLSVIHLENLTKLAKLGAQIIPPIPAFYNHPQSIQ
DLVNHQTMKILDAFHIHNETDRRWEGD
>F9US27 4.1.1.59~~~lpdC~~~Gallate decarboxylase~~~COG0043
MAEQPWDLRRVLDEIKDDPKNYHETDVEVDPNAELSGVYRYIGAGGTVQRPTQEGPAMMFNNVKGFPDTRVLTGLMASRR
RVGKMFHHDYQTLGQYLNEAVSNPVAPETVAEADAPAHDVVYKATDEGFDIRKLVAAPTNTPQDAGPYITVGVVFGSSMD
KSKSDVTIHRMVLEDKDKLGIYIMPGGRHIGAFAEEYEKANKPMPITINIGLDPAITIGATFEPPTTPFGYNELGVAGAI
RNQAVQLVDGVTVDEKAIARSEYTLEGYIMPNERIQEDINTHTGKAMPEFPGYDGDANPALQVIKVTAVTHRKNAIMQSV
IGPSEEHVSMAGIPTEASILQLVNRAIPGKVTNVYNPPAGGGKLMTIMQIHKDNEADEGIQRQAALLAFSAFKELKTVIL
VDEDVDIFDMNDVIWTMNTRFQADQDLMVLSGMRNHPLDPSERPQYDPKSIRFRGMSSKLVIDGTVPFDMKDQFERAQFM
KVADWEKYLK
>Q8X5K5 ~~~lpfA~~~Probable major fimbrial subunit LpfA~~~COG3539
MEFFMKKVVFALTALALTSGTVFAAESGDGTVKFTGEIVDSPCVLSVDSQNQEVVLGQVQKSVFAAVGDKSPAKPFEIKL
EDCDTTTMKKANVSFSGVGDADKSDLISVSTEAGAAKGVGIGIYDNSNTLVALNGGKASVDLSKGQTVLYFTANYVSTLA
TVTTGYGNAQVDFNLSYE
>Q8X5K6 ~~~lpfB~~~Probable fimbrial chaperone LpfB~~~COG3121
MDRMMKSKFVALALSLFLSQSVLAGGVGLSSTRVIYDGSKKEASLTVQNKSKTEEFLIQSWVDDAAGSKKTPFIITPPLF
KLDPEKNNILRIVNITPGLPQDRESVYWVNVKAIPSKSDDSENKNVLQIAVRTRIKLFYRPAGLKGDVKTAPNELRFTRN
GNQLRVDNPTVFNITFNQFFANDKEIEKAGMVPAKGALNITLPAGVGSVSKIKYNTINDFGSXAEMITKNVD
>Q8X5K8 ~~~lpfC'~~~Probable outer membrane usher protein LpfC'~~~
MPIFQREGHLKYSFAAGEYQAGNYDSASPRFGQLDLIYGLPWGMTAYGGVLISNNYNAFTLGIGKNFGYIGAISIDVTQA
KSELNNDRDSQGQSYRFLYSKSFESGTDFRLAGYRYSTSGFYTFQEATDVRSDADSDYNRYHKRSEIQGNLTQQLGAYGS
VYLNLTQQDYWNDAGKQNTVSAGYNGRIGKVSYSIAYSWNKSPEWDESDRLWSFNISVPLGRAWSNYRVTTDQDGRTNQQ
VGVSGTLLEDRNLSYSVQEGYASNGVGNSGNANVGYQGGSGNVNVGYSYGKDYRQLNYSVRGGVIVHSEGVTLSQPLGET
MTLISVPGARNARVVNNGGVQVDWMGNAIVPYAMPYRENEISLRSDSLGDDVDVENAFQKVVPTRGAIVRARFDTRVGYR
VLMTLLRSAGSPVPFGATATLITDKQNEVSSIVGEEGQLYISGMPEEGRVLIKWGNDASQQCVAPYKLSLELKQGGIIPV
SANCQ
>Q8X5K7 ~~~lpfC~~~Probable outer membrane usher protein LpfC~~~
MSRKTVSRTFSSFSISVVAVAVASTFSAHAGKFNPKFLEDVQGVGQHVDLTMFEKGQEQQLPGIYRVSVYVNEQRMETRT
LEFKEATEAQRKAMGESLVPCLSRTQLAEMGVRVESFPALNLVSAEACVPFDEIIPLASSHFDFSEQKLVLSFPQAAMHQ
VARGTVPESLWDEGIPALLLDYSFSGSNSEYDSTGSSSSYVDDNGTVHHDDGKDTLKSDSYYLNLRSGLNLGAWRLRNYS
TWSHSGGKAQWDNIGTSLSRAIIPFKAQLTMGDTATAGDIFDSVQMRGAMLASDEEMLPDSQRGFAPIVRGIAKSNAEVS
IEQNGYVIYRTYVQPGAFEINDLYPTANSGDLTVIIKEADGSEQRFI
>Q8X5K9 ~~~lpfD~~~Probable minor fimbrial subunit LpfD~~~COG3539
MKAAIALSLLGCVFGFSGKAFAGDAWGPCTPADGTTYHYNVDVDVGIPDAAKNVAGTVLPDVLNWSNGQNVSLICECPDS
YKNEKDTLVQGVSMLPPSGRTVDSMKYYTLTEELEVATNIRISTSVYGFVPFKNQQALQTTGCNKVITTPYMGGAGLLSF
AITKPFIGDSVIPLTLIAELYASKTNKDYGTIPISSVSIQGRVTVTQDCEIKPGTVLDVPFGEFPSSAFKNRQGQMPEGA
TEQEINLSFDCNNISDGIKVALRLEGATNADDPRAVDMGNPDIGVLVKDSSGKILVPNDSSSTTLLNLSSLDSKTHRNAA
IRLLALPISTTGKAPKGGTFEGVTTIYLEME
>Q8X5L0 ~~~lpfE~~~Probable fimbrial subunit LpfE~~~COG3539
MKFKRLLHSGIASLSLVACGVNAATDLGPAGDIHFSITITTKACEMEKSDLEVDMGTMTLQKPAAVGTVLSKKDFTIELK
ECDGISKATVEMDSQSDSDDDSMFALEAGGATGVALKIEDDKGTQQVPKGSSGTPIEWAIDGETTSLHYQASYVVVNTQA
TGGTANALVNFSITYE
>A8DYP9 ~~~uof~~~fur leader peptide~~~
MIRIISRANSVTSSNEVNRLVTGQIPHD
>A0A0H3JR16 6.3.1.20~~~~~~Lipoate--protein ligase 1~~~
MKFISNNNITDPTLNLAMEEYVLKNLPAEESYFLFYINRPSIIVGKNQNTIEEVNQTYIDAHNIDVVRRISGGGAVYHDT
GNLNFSFITDDDGNSFHNFQKFTEPIVQALQSLGVNAELTGRNDIQVGQAKISGNAMVKVKNRMFSHGTLMLNSDLDEVQ
NALKVNPAKIKSKGIKSVRKRVANIQEFLNDPLEIEEFKKIILKTIFGETEVEEYKLTDEDWENIEKLSNDKYRTWAWNY
GRNPKYNFEREEKFEKGFVQIKFDVKRGKIEHAKIFGDFFGVGDVTDLENALVGCLHDFEHIEEALSEYDLYHYFGDIDR
HELIRLMS
>A0A0H3JX98 6.3.1.20~~~~~~Lipoate--protein ligase 2~~~
MYLIEPIRNGEYITDGAIALAMQVYVNQHIFLDEDILFPYYCDPKVEIGRFQNTAIEVNQDYIDKHSIQVVRRDTGGGAV
YVDKGAVNMCCILEQDTSIYGDFQRFYQPAIKALHTLGATDVIQSGRNDLTLNGKKVSGAAMTLMNNRIYGGYSLLLDVN
YEAMDKVLKPNRKKIASKGIKSVRARVGHLREALDEKYRDITIEEFKNLMVTQILGIDDIKEAKRYELTDADWEAIDELA
DKKYKNWDWNYGKSPKYEYNRSERLSSGTVDITISVEQNRIADCRIYGDFFGQGDIKDVEEALQGTKMTREDLMHQLKQL
DIVYYFGNVTVESLVEMILS
>P0DN72 6.3.1.20~~~lplA~~~Lipoate--protein ligase~~~
MYLIEPIRNGKRITDGAVALAMQVYVQENLFLDDDILFPYYCDPKVEIGKFQNAVVETNQEYLKEHHIPVVRRDTGGGAV
YVDSGAVNICYLINDNGVFGDFKRTYQPAIEALHHLGATGVEMSGRNDLVIDGKKVSGAAMTIANGRVYGGYSLLLDVDF
EAMEKALKPNRKKIESKGIRSVRSRVGNIREHLAPQYQGITIEEFKDLMICQLLQIETISQAKRYDLTEKDWQQIDALTE
RKYHNWEWNYGNAPQYRYHRDGRFTGGTVDIHLDIKKGYIAACRIYGDFFGKADIAELEGHLIGTRMEKEDVLATLNAID
LAPYLGAITAEELGDLIFS
>P32099 6.3.1.20~~~lplA~~~Lipoate-protein ligase A~~~COG0095
MSTLRLLISDSYDPWFNLAVEECIFRQMPATQRVLFLWRNADTVVIGRAQNPWKECNTRRMEEDNVRLARRSSGGGAVFH
DLGNTCFTFMAGKPEYDKTISTSIVLNALNALGVSAEASGRNDLVVKTVEGDRKVSGSAYRETKDRGFHHGTLLLNADLS
RLANYLNPDKKKLAAKGITSVRSRVTNLTELLPGITHEQVCEAITEAFFAHYGERVEAEIISPNKTPDLPNFAETFARQS
SWEWNFGQAPAFSHLLDERFTWGGVELHFDVEKGHITRAQVFTDSLNPAPLEALAGRLQGCLYRADMLQQECEALLVDFP
EQEKELRELSAWMAGAVR
>P39130 3.2.1.67~~~lplD~~~Alpha-galacturonidase~~~COG1486
MFHISTLDQIKIAYIGGGSQGWARSLMSDLSIDERMSGTVALYDLDFEAAQKNEVIGNHSGNGRWRYEAVSTLKKALSAA
DIVIISILPGSLDDMEVDVHLPERCGIYQSVGDTVGPGGIIRGLRAVPIFAEIARAIRDYAPESWVINYTNPMSVCTRVL
YKVFPGIKAIGCCHEVFGTQKLLAEMVTERLGIEVPRREDIRVNVLGINHFTWITKASYRHIDLLPIFREFSAHYGESGY
ELEGECWRDSVFCSAHRVAFDLFETYGAIPAAGDRHLAEFLPGPYLKQPEVWKFHLTPISFRKQDRAEKRQETERLIVQQ
RGVAEKASGEEGVNIIAALLGLGELVTNVNMPNQGQVLNLPIQAIVETNAFITRNRVQPILSGALPKGVEMLAARHISNQ
EAVADAGLTKDTGLAFQAFLNDPLVQIDRSDAEQLFNDMLQCIMQS
>A9KTB9 3.2.1.67~~~~~~Alpha-galacturonidase~~~COG1486
MKYNNGKVSDVKIAYIGGGSRGWAWTFMTDLAMEPNMSGKISLYDIDQEAAKNNEIIGNMITRRDDTVGKWNYETANTME
AALTGADFVVISILPGTFDEMEADVHMPERLGIYQSVGDTAGPGGMMRALRTIPMFVTIANAIKEYSPKAWVINYTNPMS
MCVKTLYHVFPEIKAFGCCHEVFGTQKVLKGIAEQELKIDRIDRNDIHVNVLGINHFTWFNYASYQGIDLFPIYCKYIED
HFEEGFEEKDENWANASFACKHRVKFDLFNEFGLIAAAGDRHLTEFMPSERYLKDKETVADWNFGLTTVEWRKKDLEDRL
NKSHRLVSGEEEIKLEPSGEEGILLIKALCGLTRVISNVNIPNTNLQIENLPSTAIVETNAVFERDSIRPIMAGEMPENV
VKLTMPHILNHEYIMEAALTFDKSLVVKAFEQDPLVKDMATKEEVEKLVEDMLDATKAYLPKEWNL
>D3T426 3.2.1.67~~~~~~Alpha-galacturonidase~~~COG1486
MKYNGDKVEGIKIAYIGGGSRGWAWRLMSDLALEQSLSGTVYLYDIDYEAAKTNEIIGNNLKSQWEYKSVDSMEEALKGA
DFVIISILPGTFNEMMSDVHTPEKFGIYQSVGDTTGPGGLFRALRTIPLYVEFANKIKKYCPEAWVINYTNPMALCVKTL
YETFPKIKAFGCCHEVFSTQNLIAKAAKEIEGIECSREDIRTNVLGINHFTWIDKATYKNIDLIPVYKKFVEKYFESGYE
DRGDWKESYFNSANKVKFDLFNKYGIIAAAGDRHLAEFIPFFGYLENPEAVAKWKFHLTPVSWRIKNREELIKKSKKMAK
GEEKFEIEPSGEEGVKQMKALLGLGDLITNVNLPNRGQMEGVEFNTVVETNAFFTKDRVQPIISGKLPDTVNMLLSPHVL
NQKMIFEAAIKKDKELVFHAFLNDPFVRKLTYSDAKKLFNEMFDNAREYLKGW
>I3VRU1 3.2.1.67~~~~~~Alpha-galacturonidase~~~COG1486
MRYTDGKVHDITIAYIGGGSRGWAWNLMTDLAKEESISGTVKLYDIDYDAAHDNEIIGNALSMRQDVKGKWLYKACETLE
ESLKGADFVIISILPGTFDEMESDVHAPEKYGIYQSVGDTVGPGGIVRALRTIPMFVDIANAIKEHCPDAWVINYTNPMT
LCVRTLYEIFPQIKAFGCCHEVFGTQKLLSRALQDIEGIENVPREEIKINVLGINHFTWIDNARYKDIDLMYVYKQFVNK
YYESGFVSDANNNWMNNSFVSAERVKFDLFLRYGVIAAAGDRHLAEFVPGYWYLKDPETVREWMFGLTTVSWRKEDLKRR
LERSKRLKTGEEKFELKETGEEGVRQIKALLGLGDLVTNVNMPNHGQIEGIPYGAVVETNALFSGNKLKPVLSGKLPDNV
NSLVLRQVYNQETTLKAALKRDFDLAFSAFVNDPLVTISLKDAKKLFKEMLENTKKYLDGWKIKA
>O07608 6.3.1.20~~~lplJ~~~Lipoate-protein ligase LplJ~~~COG0095
MLFIDNQNINDPRINLAIEEYCVKHLDPEQQYLLFYVNQPSIIIGKNQNTIEEINTKYVEENGIIVVRRLSGGGAVYHDL
GNLNFSFITKDDGDSFHNFKKFTEPVIQALHQLGVEAELSGRNDIVVDGRKISGNAQFATKGRIFSHGTLMFDSAIDHVV
SALKVKKDKIESKGIKSIRSRVANISEFLDDKMTTEEFRSHLLRHIFNTNDVGNVPEYKLTEKDWETIHQISKERYQNWD
WNYGRSPKFNLNHSKRYPVGSIDLHLEVKKGKIEDCKIFGDFFGVGDVSEIENLLVGKQYERSVIADVLEGVNLKHYFGN
ITKEDFLDLIY
>P39196 ~~~lplT~~~Lysophospholipid transporter LplT~~~COG0477
MSESVHTNTSLWSKGMKAVIVAQFLSAFGDNALLFATLALLKAQFYPEWSQPILQMVFVGAYILFAPFVGQVADSFAKGR
VMMFANGLKLLGAASICFGINPFLGYTLVGVGAAAYSPAKYGILGELTTGSKLVKANGLMEASTIAAILLGSVAGGVLAD
WHVLVALAACALAYGGAVVANIYIPKLAAARPGQSWNLINMTRSFLNACTSLWRNGETRFSLVGTSLFWGAGVTLRFLLV
LWVPVALGITDNATPTYLNAMVAIGIVVGAGAAAKLVTLETVSRCMPAGILIGVVVLIFSLQHELLPAYALLMLIGVMGG
FFVVPLNALLQERGKKSVGAGNAIAVQNLGENSAMLLMLGIYSLAVMIGIPVVPIGIGFGALFALAITALWIWQRRH
>P45464 ~~~lpoA~~~Penicillin-binding protein activator LpoA~~~COG3107
MVPSTFSRLKAARCLPVVLAALIFAGCGTHTPDQSTAYMQGTAQADSAFYLQQMQQSSDDTRINWQLLAIRALVKEGKTG
QAVELFNQLPQELNDAQRREKTLLAVEIKLAQKDFAGAQNLLAKITPADLEQNQQARYWQAKIDASQGRPSIDLLRALIA
QEPLLGAKEKQQNIDATWQALSSMTQEQANTLVINADENILQGWLDLQRVWFDNRNDPDMMKAGIADWQKRYPNNPGAKM
LPTQLVNVKAFKPASTNKIALLLPLNGQAAVFGRTIQQGFEAAKNIGTQPVAAQVAAAPAADVAEQPQPQTVDGVASPAQ
ASVSDLTGEQPAAQPVPVSAPATSTAAVSAPANPSAELKIYDTSSQPLSQILSQVQQDGASIVVGPLLKNNVEELLKSNT
PLNVLALNQPENIENRVNICYFALSPEDEARDAARHIRDQGKQAPLVLIPRSSLGDRVANAFAQEWQKLGGGTVLQQKFG
STSELRAGVNGGSGIALTGSPITLRATTDSGMTTNNPTLQTTPTDDQFTNNGGRVDAVYIVATPGEIAFIKPMIAMRNGS
QSGATLYASSRSAQGTAGPDFRLEMEGLQYSEIPMLAGGNLPLMQQALSAVNNDYSLARMYAMGVDAWSLANHFSQMRQV
QGFEINGNTGSLTANPDCVINRNLSWLQYQQGQVVPVS
>P45299 ~~~lpoA~~~Penicillin-binding protein activator LpoA~~~COG3107
MSILLQGERFKKRLMPILLSMALAGCSNLLGSNFTQTLQKDANASSEFYINKLGQTQELEDQQTYKLLAARVLIRENKVE
QSAALLRELGELNDAQKLDRALIEARISAAKNANEVAQNQLRALDLNKLSPSQKSRYYETLAIVAENRKDMIEAVKARIE
MDKNLTDVQRHQDNIDKTWALLRSANTGVINNASDEGNAALGGWLTLIKAYNDYIRQPVQLSQALQSWKNAYPNHAAATL
FPKELLTLLNFQQTNVSQIGLLLPLSGDGQILGTTIQSGFNDAKGNSTIPVQVFDTSMNSVQDIIAQAKQAGIKTLVGPL
LKQNLDVILADPAQIQGMDVLALNATPNSRAIPQLCYYGLSPEDEAESAANKMWNDGVRNPLVAMPQNDLGQRVGNAFNV
RWQQLAGTDANIRYYNLPADVTYFVQENNSNTTALYAVASPTELAEMKGYLTNIVPNLAIYASSRASASATNTNTDFIAQ
MNGVQFSDIPFFKDTNSPQYQKLAKSTGGEYQLMRLYAMGADAWLLINQFNELRQVPGYRLSGLTGILSADTNCNVERDM
TWYQYQDGAIVPVAN
>P0AB38 ~~~lpoB~~~Penicillin-binding protein activator LpoB~~~COG3417
MTKMSRYALITALAMFLAGCVGQREPAPVEEVKPAPEQPAEPQQPVPTVPSVPTIPQQPGPIEHEDQTAPPAPHIRHYDW
NGAMQPMVSKMLGADGVTAGSVLLVDSVNNRTNGSLNAAEATETLRNALANNGKFTLVSAQQLSMAKQQLGLSPQDSLGT
RSKAIGIARNVGAHYVLYSSASGNVNAPTLQMQLMLVQTGEIIWSGKGAVSQQ
>Q8ZQ08 ~~~lpoB~~~Penicillin-binding protein activator LpoB~~~
MTKMHRYAAIAALAIFLSGCMAQRQPAPVEEVKPAPEQPAQPPQPPVVPSVPTIPQQPGPIEHEDQTGQPAPKVRHYDWN
GAMQPLVSKMLQADGVTAGSVLLVDSVNNRTNGSLNANEATETLRNALANNGKFTLVSVQQLSMAKQQLGLSPQDSLGTR
SKAIGIARNVGAQYVLYSSASGNVNAPALQMQLMLVQTGEIIWSGKGAVQQQ
>E8XH70 ~~~lpp1~~~Major outer membrane lipoprotein Lpp 1~~~
MNRTKLVLGAVILGSTLLAGCSSNAKIDQLSSDVQTLNAKVDQLSNDVNAMRSDVQAAKDDAARANQRLDNQATKYRK
>P0A0V1 ~~~~~~LPP20 lipoprotein~~~
MKNQVKKILGMSVVAAMVIVGCSHAPKSGISKSNKAYKEATKGAPDWVVGDLEKVAKYEKYSGVFLGRAEDLITNNDVDY
STNQATAKARANLAANLKSTLQKDLENEKTRTVDASGKRSISGTDTEKISQLVDKELIASKMLARYVGKDRVFVLVGLDK
QIVDKVREELGMVKK
>P0A0V0 ~~~~~~LPP20 lipoprotein~~~
MKNQVKKILGMSVVAAMVIVGCSHAPKSGISKSNKAYKEATKGAPDWVVGDLEKVAKYEKYSGVFLGRAEDLITNNDVDY
STNQATAKARANLAANLKSTLQKDLENEKTRTVDASGKRSISGTDTEKISQLVDKELIASKMLARYVGKDRVFVLVGLDK
QIVDKVREELGMVKK
>P9WK81 ~~~lppA~~~Putative lipoprotein LppA~~~
MIAPQPISRTLPRWQRIVALTMIGISTALIGGCTMDHNPDTSRRLTGEQKIQLIDSMRNKGSYEAARERLTATARIIADR
VSAAIPGQTWKFDDDPNIQQSDRNGALCDKLTADIARRPIANSVMFGATFSAEDFKIAANIVREEAAKYGATTESSLFNE
SAKRDYDVQGNGYEFRLLQIKFATLNITGDCFLLQKVLDLPAGQLPPEPPIWPTTSTPH
>P9WFN3 ~~~lppC~~~Putative lipoprotein LppC~~~COG1881
MTSTLHRTPLATAGLALVVALGGCGGGGGDSRETPPYVPKATTVDATTPAPAAEPLTIASPMFADGAPIPVQFSCKGANV
APPLTWSSPAGAAELALVVDDPDAVGGLYVHWIVTGIAPGSGSTADGQTPAGGHSVPNSGGRQGYFGPCPPAGTGTHHYR
FTLYHLPVALQLPPGATGVQAAQAIAQAASGQARLVGTFEG
>O07750 ~~~lppE~~~Probable lipoprotein LppE~~~
MCNRLVTVTGVAMVVAAGLSACGQAQTVPRKAARLTIDGVTHTTRPATCSQEHSYRTIDIRNHDSTVQAVVLLSGDRVIP
QWVKIRNVDGFNGSFWHGGVGNARADRARNTYTVAGSAYGISSKKPNTVVSTDFNILAEC
>P9WK77 ~~~lppJ~~~Putative lipoprotein LppJ~~~
MPHSTADRRLRLTRQALLAAAVVPLLAGCALVMHKPHSAGSSNPWDDSAHPLTDDQAMAQVVEPAKQIVAAADLQAVRAG
FSFTSCNDQGDPPYQGTVRMAFLLQGDHDAYFQHVRAAMLSHGWIDGPPPGQYFHGITLHKNGVTANMSLALDHSYGEMI
LDGECRNTTDHHHDDETTNITNQLVQP
>P9WK75 ~~~lppK~~~Putative lipoprotein LppK~~~
MRRNIRVTLGAATIVAALGLSGCSHPEFKRSSPPAPSLPPVTSSPLEAAPITPLPAPEALIDVLSRLADPAVPGTNKVQL
IEGATPENAAALDRFTTALRDGSYLPMTFAANDIAWSDNKPSDVMATVVVTTAHPDNREFTFPMEFVSFKGGWQLSRQTA
EMLLAMGNSPDSTPSATSPAPAPSPTPPG
>P17323 ~~~lppL~~~Lipopeptide~~~
MKRLFLSFVALALLAGSIAACGQKGPLYLPDDEKAKKEHSKDRYGF
>O53505 ~~~lppM~~~Protein LppM~~~
MARTRRRGMLAIAMLLMLVPLATGCLRVRASITISPDDLVSGEIIAAAKPKNSKDTGPALDGDVPFSQKVAVSNYDSDGY
VGSQAVFSDLTFAELPQLANMNSDAAGVNLSLRRNGNIVILEGRADLTSVSDPDADVELTVAFPAAVTSTNGDRIEPEVV
QWKLKPGVVSTMSAQARYTDPNTRSFTGAGIWLGIAAFAAAGVVAVLAWIDRDRSPRLTASGDPPTS
>P9WK71 ~~~lppO~~~Putative lipoprotein LppO~~~
MTDPRHTVRIAVGATALGVSALGATLPACSAHSGPGSPPSAPSAPAAATVMVEGHTHTISGVVECRTSPAVRTATPSESG
TQTTRVNAHDDSASVTLSLSDSTPPDVNGFGISLKIGSVDYQMPYQPVQSPTQVEATRQGKSYTLTGTGHAVIPGQTGMR
ELPFGVHVTCP
>P9WK69 ~~~lppP~~~Putative lipoprotein LppP~~~
MRRQRSAVPILALLALLALLALIVGLGASGCAWKPPTTRPSPPNTCKDSDGPTADTVRQAIAAVPIVVPGSKWVEITRGH
TRNCRLHWVQIIPTIASQSTPQQLLFFDRNIPLGSPTRNPKPYITVLPAGDDTVTVQYQWQIGSDQECCPTGIGTVRFHI
GSDGKLEALGSIPHQ
>Q8NMT9 2.3.2.-~~~lppS~~~Putative L,D-transpeptidase LppS~~~COG1376
MRVFRGRRGAVAGSFLAVLAIGSLALTGCTIERSDAQEQSSQQSTEVEAEEAQAPVISVDDGDEDVDPSESVIVKSMGDG
LSKVTMTNEEGYEVESELSDDGRSWTTAETLGYNRTYTIKATDKNGETATASFSTATPAATTNVALSPLADSVVGVGQTI
GFRFGSPVKDRKAAQDAITVTTSPKVEGGFYWLNNSELRWRPAEYWEPGTEVTVEADIYGKDLGGGVWGETDNATNFTIG
DKVEAVADDATKTMSVYKNGELLRTMPVSFGRDTSEWATPNGTYIIGDRNESMIMDSTTFGLGYEEGGYRTPVKYATQMS
YSGIYVHAAPWSVGAQGSYNTSHGCINVSTENAQWFQEAVKRGDIVTVKNTIGETLSGYDGLGDWNIPWSEWSKGNADQT
SAW
>P9WK67 ~~~lppW~~~Putative lipoprotein LppW~~~COG2367
MRARPLTLLTALAAVTLVVVAGCEARVEAEAYSAADRISSRPQARPQPQPVELLLRAITPPRAPAASPNVGFGELPTRVR
QATDEAAAMGATLSVAVLDRATGQLVSNGNTQIIATASVAKLFIADDLLLAEAEGKVTLSPEDHHALDVMLQSSDDGAAE
RFWSQDGGNAVVTQVARRYGLRSTAPPSDGRWWNTISSAPDLIRYYDMLLDGSGGLPLDRAAVIIADLAQSTPTGIDGYP
QRFGIPDGLYAEPVAVKQGWMCCIGSSWMHLSTGVIGPERRYIMVIESLQPADDATARATITQAVRTMFPNGRI
>A0A0H3MGR5 ~~~lppX~~~Putative phthiocerol dimycocerosate transporter LppX~~~
MNDGKRAVTSAVLVVLGACLALWLSGCSSPKPDAEEQGVPVSPTASDPALLAEIRQSLDATKGLTSVHVAVRTTGKVDSL
LGITSADVDVRANPLAAKGVCTYNDEQGVPFRVQGDNISVKLFDDWSNLGSISELSTSRVLDPAAGVTQLLSGVTNLQAQ
GTEVIDGISTTKITGTIPASSVKMLDPGAKSARPATVWIAQDGSHHLVRASIDLGSGSIQLTQSKWNEPVNVD
>P9WK65 ~~~lppX~~~Putative phthiocerol dimycocerosate transporter LppX~~~
MNDGKRAVTSAVLVVLGACLALWLSGCSSPKPDAEEQGVPVSPTASDPALLAEIRQSLDATKGLTSVHVAVRTTGKVDSL
LGITSADVDVRANPLAAKGVCTYNDEQGVPFRVQGDNISVKLFDDWSNLGSISELSTSRVLDPAAGVTQLLSGVTNLQAQ
GTEVIDGISTTKITGTIPASSVKMLDPGAKSARPATVWIAQDGSHHLVRASIDLGSGSIQLTQSKWNEPVNVD
>P69776 ~~~lpp~~~Major outer membrane lipoprotein Lpp~~~COG4238
MKATKLVLGAVILGSTLLAGCSSNAKIDQLSSDVQTLNAKVDQLSNDVNAMRSDVQAAKDDAARANQRLDNMATKYRK
>P9WK37 ~~~lpqB~~~Lipoprotein LpqB~~~COG5401
MERLMRLTILLFLGAVLAGCASVPSTSAPQAIGTVERPVPSNLPKPSPGMDPDVLLREFLKATADPANRHLAARQFLTES
ASNAWDDAGSALLIDHVVFVETRSAEKVSVTMRADILGSLSDVGVFETAEGQLPDPGPIELVKTSDGWRIDRLPNGVFLD
WQQFQETYKRNTLYFADPTGKTVVPDPRYVAVSDRDQLATELVSKLLAGPRPEMARTVRNLLAPPLRLRGPVTRADGGKS
GIGRGYGGARVDMEKLSTTDPHSRQLLAAQIIWTLARADIRGPYVINADGAPLEDRFAEGWTTSDVAATDPGVADGAAAG
LHALVNGSLVAMDAQRVTPVPGAFGRMPEQTAAAVSRSGRQVASVVTLGRGAPDEAASLWVGDLGGEAVQSADGHSLSRP
SWSLDDAVWVVVDTNVVLRAIQDPASGQPARIPVDSTAVASRFPGAINDLQLSRDGTRAAMVIGGQVILAGVEQTQAGQF
ALTYPRRLGFGLGSSVVSLSWRTGDDIVVTRTDAAHPVSYVNLDGVNSDAPSRGLQTPLTAIAANPSTVYVAGPQGVLMY
SASVESRPGWADVPGLMVPGAAPVLPG
>P9WK63 ~~~lpqE~~~Putative lipoprotein LpqE~~~COG2847
MNRCNIRLRLAGMTTWVASIALLAAALSGCGAGQISQTANQKPAVNGNRLTINNVLLRDIRIQAVQTSDFIQPGKAVDLV
LVAVNQSPDVSDRLVGITSDIGSVTVAGDARLPASGMLFVGTPDGQIVAPGPLPSNQAAKATVNLTKPIANGLTYNFTFK
FEKAGQGSVMVPISAGLATPHE
>A5U990 ~~~lpqH~~~Lipoprotein LpqH~~~
MKRGLTVAVAGAAILVAGLSGCSSNKSTTGSGETTTAAGTTASPGAASGPKVVIDGKDQNVTGSVVCTTAAGNVNIAIGG
AATGIAAVLTDGNPPEVKSVGLGNVNGVTLGYTSGTGQGNASATKDGSHYKITGTATGVDMANPMSPVNKSFEIEVTCS
>P9WK61 ~~~lpqH~~~Lipoprotein LpqH~~~
MKRGLTVAVAGAAILVAGLSGCSSNKSTTGSGETTTAAGTTASPGAASGPKVVIDGKDQNVTGSVVCTTAAGNVNIAIGG
AATGIAAVLTDGNPPEVKSVGLGNVNGVTLGYTSGTGQGNASATKDGSHYKITGTATGVDMANPMSPVNKSFEIEVTCS
>L7N6B0 3.2.1.52~~~lpqI~~~Beta-hexosaminidase LpqI~~~COG1472
MAFPRTLAILAAAAALVVACSHGGTPTGSSTTSGASPATPVAVPVPRSCAEPAGIPALLSPRDKLAQLLVVGVRDAADAQ
AVVTNYHVGGILIGSDTDLTIFDGALAEIVAGGGPLPLAVSVDEEGGRVSRLRSLIGGTGPSARELAQTRTVQQVRDLAR
DRGRQMRKLGITIDFAPVVDVTDAPDDTVIGDRSFGSDPATVTAYAGAYAQGLRDAGVLPVLKHFPGHGRGSGDSHNGGV
TTPPLDDLVGDDLVPYRTLVTQAPVGVMVGHLQVPGLTGSEPASLSKAAVNLLRTGTGYGAPPFDGPVFSDDLSGMAAIS
DRFGVSEAVLRTLQAGADIALWVTTKEVPAVLDRLEQALRAGELPMSAVDRSVVRVATMKGPNPGCGR
>P96264 3.4.11.1~~~lpqL~~~Probable lipoprotein aminopeptidase LpqL~~~COG2234
MVNKSRMMPAVLAVAVVVAFLTTGCIRWSTQSRPVVNGPAAAEFAVALRNRVSTDAMMAHLSKLQDIANANDGTRAVGTP
GYQASVDYVVNTLRNSGFDVQTPEFSARVFKAEKGVVTLGGNTVEARALEYSLGTPPDGVTGPLVAAPADDSPGCSPSDY
DRLPVSGAVVLVDRGVCPFAQKEDAAAQRGAVALIIADNIDEQAMGGTLGANTDVKIPVVSVTKSVGFQLRGQSGPTTVK
LTASTQSFKARNVIAQTKTGSSANVVMAGAHLDSVPEGPGINDNGSGVAAVLETAVQLGNSPHVSNAVRFAFWGAEEFGL
IGSRNYVESLDIDALKGIALYLNFDMLASPNPGYFTYDGDQSLPLDARGQPVVPEGSAGIERTFVAYLKMAGKTAQDTSF
DGRSDYDGFTLAGIPSGGLFSGAEVKKSAEQAELWGGTADEPFDPNYHQKTDTLDHIDRTALGINGAGVAYAVGLYAQDL
GGPNGVPVMADRTRHLIAKP
>O53780 ~~~lpqN~~~Lipoprotein LpqN~~~
MKHFTAAVATVALSLALAGCSFNIKTDSAPTTSPTTTSPTTSTTTTSATTSAQAAGPNYTIADYIRDNHIQETPVHHGDP
GSPTIDLPVPDDWRLLPESSRAPYGGIVYTQPADPNDPPTIVAILSKLTGDIDPAKVLQFAPGELKNLPGFQGSGDGSAA
TLGGFSAWQLGGSYSKNGKLRTVAQKTVVIPSQGAVFVLQLNADALDDETMTLMDAANVIDEQTTITP
>O53859 ~~~lpqS~~~Lipoprotein LpqS~~~
MVWMRSAIVAVALGVTVAAVAAACWLPQLHRHVAHPNHPLTTSVGSEFVINTDHGHLVDNSMPPCPERLATAVLPRSATP
VLLPDVVAAAPGMTAALTDPVAPAARGPPAAQGSVRTGQDLLTRFCLARR
>P9WK59 ~~~lpqT~~~Putative lipoprotein LpqT~~~
MAGRRCPQDSVRPLAVAVAVATLAMSAVACGPKSPDFQSILSTSPTTSAVSTTTEVPVPLWKYLESVGVTGEPVAPSSLT
DLTVSIPTPPGWAPMKNPNITPNTEMIAKGESYPTAMLMVFKLHRDFDIAEALKHGTADARLSTNFTELDSSTADFNGFP
SSMIQGSYDLHGRRLHTWNRIVFPTGAPPAKQRYLVQLTITSLANEAVKHASDIEAIIAGFVVAAK
>A0R2I8 ~~~~~~Probable monoacyl phosphatidylinositol tetramannoside-binding protein LpqW~~~COG0747
MGVPTPARRARLTFGALLAVPTLLLGGCTVSPPPAPQSTETTETTPPPPPKAPTQIIMAIDSIGPGFNPHLLSDQSPVNA
AIASLVLPSSFRPVPDPTSPTGSRWELDTTLLESAEVTNENPFTVTYKIRPEAQWTDNAPIAADDYWYLWRQMVSQPGVV
DPAGYDLITGVQSVEGGKQAVVTFSQPYPAWRELFNDILPAHIVKDIPGGFGAGLARAMPVTGGQFRVETIDPQRDEILL
ARNDRFWSVPAKPDLVLFRRGGAPAALADSIRNGDTQVAQVHGGAATFAQLSAIPDVRTARIVTPRVMQLTLRAQQPKLA
DPQVRKAILGLIDVDLLASVGAGDDNTVTLAQAQVRSPSDPGYVPTAPPAMTRDDALELLRDAGYVSEPVPPPDNTADDP
PPDNGRERIVKDGVPLTIVLGVASNDPTSVAVANTAADQLRNVGIDASVLALDPVALYGDALVNNRVDAVVGWRQAGGDL
ATVLASRYGCRALEATPVATAVPGPATTTSQAPTTTTTTTPPATTTPTPTAPIPAPESGELVQAPSNITGICDRSIQPRI
DAALDGTDDIADVIQAVEPRLWNMATVLPILQDTTIVAAGPSVQNVSLTGAVPVGIVGDAGDWTKTK
>P9WGU7 ~~~lpqW~~~Probable monoacyl phosphatidylinositol tetramannoside-binding protein LpqW~~~COG0747
MGVPSPVRRVCVTVGALVALACMVLAGCTVSPPPAPQSTDTPRSTPPPPRRPTQIIMGIDWIGPGFNPHLLSDLSPVNAA
ISALVLPSAFRPIPDPNTPTGSRWEMDPTLLVSADVTNNHPFTVTYKIRPEAQWTDNAPIAADDFWYLWQQMVTQPGVVD
PAGYHLITSVQSLEGGKQAVVTFAQPYPAWRELFTDILPAHIVKDIPGGFASGLARALPVTGGQFRVENIDPQRDEILIA
RNDRYWGPPSKPGIILFRRAGAPAALADSVRNGDTQVAQVHGGSAAFAQLSAIPDVRTARIVTPRVMQFTLRANVPKLAD
TQVRKAILGLLDVDLLAAVGAGTDNTVTLDQAQIRSPSDPGYVPTAPPAMSSAAALGLLEASGFQVDTNTSVSPAPSVPD
STTTSVSTGPPEVIRGRISKDGEQLTLVIGVAANDPTSVAVANTAADQLRDVGIAATVLALDPVTLYHDALNDNRVDAIV
GWRQAGGNLATLLASRYGCPALQATTVPAANAPTTAPSAPIGPTPSAAPDTATPPPTAPRRPSDPGALVKAPSNLTGICD
RSIQSNIDAALNGTKNINDVITAVEPRLWNMSTVLPILQDTTIVAAGPSVQNVSLSGAVPVGIVGDAGQWVKTGQ
>G7CES0 ~~~lpqY~~~Trehalose-binding lipoprotein LpqY~~~COG1653
MDGRQVVRARRWCATAAVALMTASTVAACGSDSGEIVISYYTPANEAATFTAVAQRCNAELGGRFRIEQRSLPREADAQR
LQLARRLTGNDRSLDVMALDVVWTAEFAEAGWALPLSEDPAGLAEADATTNTLPGPLETAKWNGELYAAPITTNTQLLWY
RADLMDEPPATWDEMLSEAARLHAQGGPSWIAVQGKQYEGLVVWFNTLLESAGGQVLSDDGQRVTLTDTPEHRAATVKAL
EIIKAVATAPGADPSITQTDENTARLALEQGRAALEVNWPYVLPSLLENAIKGGVGFLPLNENPALRGSINDVGTFAPTD
EQFDLALNASKEVFGFARYPGVRPDEPARVTLGGLNLAVASTTRHKAEAFEAVRCLRNEENQRLTSIEGGLPAVRTSLYD
DPQFQAKYPQYEIIRDQLINAAVRPATPVYQAMSTRMSATLAPISQIDPERTADELAEQVQQAIDGKGLIP
>P9WGU9 ~~~lpqY~~~Trehalose-binding lipoprotein LpqY~~~COG1653
MVMSRGRIPRLGAAVLVALTTAAAACGADSQGLVVSFYTPATDGATFTAIAQRCNQQFGGRFTIAQVSLPRSPNEQRLQL
ARRLTGNDRTLDVMALDVVWTAEFAEAGWALPLSDDPAGLAENDAVADTLPGPLATAGWNHKLYAAPVTTNTQLLWYRPD
LVNSPPTDWNAMIAEAARLHAAGEPSWIAVQANQGEGLVVWFNTLLVSAGGSVLSEDGRHVTLTDTPAHRAATVSALQIL
KSVATTPGADPSITRTEEGSARLAFEQGKAALEVNWPFVFASMLENAVKGGVPFLPLNRIPQLAGSINDIGTFTPSDEQF
RIAYDASQQVFGFAPYPAVAPGQPAKVTIGGLNLAVAKTTRHRAEAFEAVRCLRDQHNQRYVSLEGGLPAVRASLYSDPQ
FQAKYPMHAIIRQQLTDAAVRPATPVYQALSIRLAAVLSPITEIDPESTADELAAQAQKAIDGMGLLP
>P9WK55 ~~~lprA~~~Lipoprotein LprA~~~
MKHPPCSVVAAATAILAVVLAIGGCSTEGDAGKASDTAATASNGDAAMLLKQATDAMRKVTGMHVRLAVTGDVPNLRVTK
LEGDISNTPQTVATGSATLLVGNKSEDAKFVYVDGHLYSDLGQPGTYTDFGNGASIYNVSVLLDPNKGLANLLANLKDAS
VAGSQQADGVATTKITGNSSADDIATLAGSRLTSEDVKTVPTTVWIASDGSSHLVQIQIAPTKDTSVTLTMSDWGKQVTA
TKPV
>P9WK53 ~~~lprB~~~Putative lipoprotein LprB~~~COG1188
MRRKVRRLTLAVSALVALFPAVAGCSDSGDNKPGATIPSTPANAEGRHGPFFPQCGGVSDQTVTELTRVTGLVNTAKNSV
GCQWLAGGGILGPHFSFSWYRGSPIGRERKTEELSRASVEDINIDGHSGFIAIGNEPSLGDSLCEVGIQFSDDFIEWSVS
FSQKPFPLPCDIAKELTRQSIANSK
>P9WK49 ~~~lprE~~~Putative lipoprotein LprE~~~
MPGVWSPPCPTTPRVGVVAALVAATLTGCGSGDSTVAKTPEATPSLSTAHPAPPSSEPSPPSATAAPPSNHSAAPVDPCA
VNLASPTIAKVVSELPRDPRSEQPWNPEPLAGNYNECAQLSAVVIKANTNAGNPTTRAVMFHLGKYIPQGVPDTYGFTGI
DTSQCTGDTVALTYASGIGLNNVVKFRWNGGGVELIGNTTGG
>P65315 ~~~lprF~~~Putative diacylated glycolipid transporter LprF~~~
MNGLISQACGSHRPRRPSSLGAVAILIAATLFATVVAGCGKKPTTASSPSPGSPSPEAQQILQDSSKATKGLHSVHVVVT
VNNLSTLPFESVDADVTNQPQGNGQAVGNAKVRMKPNTPVVATEFLVTNKTMYTKRGGDYVSVGPAEKIYDPGIILDKDR
GLGAVVGQVQNPTIQGRDAIDGLATVKVSGTIDAAVIDPIVPQLGKGGGRLPITLWIVDTNASTPAPAANLVRMVIDKDQ
GNVDITLSNWGAPVTIPNPAG
>P9WK47 ~~~lprF~~~Putative diacylated glycolipid transporter LprF~~~
MNGLISQACGSHRPRRPSSLGAVAILIAATLFATVVAGCGKKPTTASSPSPGSPSPEAQQILQDSSKATKGLHSVHVVVT
VNNLSTLPFESVDADVTNQPQGNGQAVGNAKVRMKPNTPVVATEFLVTNKTMYTKRGGDYVSVGPAEKIYDPGIILDKDR
GLGAVVGQVQNPTIQGRDAIDGLATVKVSGTIDAAVIDPIVPQLGKGGGRLPITLWIVDTNASTPAPAANLVRMVIDKDQ
GNVDITLSNWGAPVTIPNPAG
>A0A0H3M3S8 ~~~lprG~~~Lipoarabinomannan carrier protein LprG~~~
MRTPRRHCRRIAVLAAVSIAATVVAGCSSGSKPSGGPLPDAKPLVEEATAQTKALKSAHMVLTVNGKIPGLSLKTLSGDL
TTNPTAATGNVKLTLGGSDIDADFVVFDGILYATLTPNQWSDFGPAADIYDPAQVLNPDTGLANVLANFADAKAEGRDTI
NGQNTIRISGKVSAQAVNQIAPPFNATQPVPATVWIQETGDHQLAQAQLDRGSGNSVQMTLSKWGEKVQVTKPPVS
>Q9CCP6 ~~~lprG~~~Lipoarabinomannan carrier protein LprG~~~
MQAPKHHRRLFAVLATLNTATAVIAGCSSGSNLSSGPLPDATTWVKQATDITKNVTSAHLVLSVNGKITGLPVKTLTGDL
TTHPNTVASGNATITLDGADLNANFVVVDGELYATLTPSKWSDFGKASDIYDVASILNPDAGLANVLANFTGAKTEGRDS
INGQSAVRISGNVSADAVNKIAPPFNATQPMPATVWIQETGDHQLAQIRIDNKSSGNSVQMTLSNWDEPVQVTKPQVS
>P9WK45 ~~~lprG~~~Lipoarabinomannan carrier protein LprG~~~
MRTPRRHCRRIAVLAAVSIAATVVAGCSSGSKPSGGPLPDAKPLVEEATAQTKALKSAHMVLTVNGKIPGLSLKTLSGDL
TTNPTAATGNVKLTLGGSDIDADFVVFDGILYATLTPNQWSDFGPAADIYDPAQVLNPDTGLANVLANFADAKAEGRDTI
NGQNTIRISGKVSAQAVNQIAPPFNATQPVPATVWIQETGDHQLAQAQLDRGSGNSVQMTLSKWGEKVQVTKPPVS
>P9WK43 ~~~lprH~~~Putative lipoprotein LprH~~~
MACLGRPGCRGWAGASLVLVVVLALAACTESVAGRAMRATDRSSGLPTSAKPARARDLLLQDGDRAPFGQVTQSRVGDSY
FTSAVPPECSAALLFKGSPLRPDGSSDHAEAAYNVTGPLPYAESVDVYTNVLNVHDVVWNGFRDVSHCRGDAVGVSRAGR
STPMRLRYFATLSDGVLVWTMSNPRWTCDYGLAVVPHAVLVLSACGFKPGFPMAEWASKRRAQLDSQV
>P9WK41 ~~~lprI~~~Lipoprotein LprI~~~COG4461
MRWIGVLVTALVLSACAANPPANTTSPTAGQSLDCTKPATIVQQLVCHDRQLTSLDHRLSTAYQQALAHRRSAALEAAQS
SWTMLRDACAQDTDPRTCVQEAYQTRLVQLAIADPATATPPVLTYRCPTQDGPLTAQFYNQFDPKTAVLNWKGDQVIVFV
ELSGSGARYGRQGIEYWEHQGEVRLDFHGATFVCRTS
>O33192 ~~~lprJ~~~Putative lipoprotein LprJ~~~
MTAHTHDGTRTWRTGRQATTLLALLAGVFGGAASCAAPIQADMMGNAFLTALTNAGIAYDQPATTVALGRSVCPMVVAPG
GTFESITSRMAEINGMSRDMASTFTIVAIGTYCPAVIAPLMPNRLQA
>I6Y3P1 ~~~lprN~~~Lipoprotein LprN~~~COG1463
MNRIWLRAIILTASSALLAGCQFGGLNSLPLPGTAGHGEGAYSVTVEMADVATLPQNSPVMVDDVTVGSVAGIVAVQRPD
GSFYAAVKLDLDKNVLLPANAVAKVSQTSLLGSLHVELAPPTDRPPTGRLVDGSRITEANTDRFPTTEEVFSALGVVVNK
GNVGALEEIIDETHQAVAGRQAQFVNLVPRLAELTAGLNRQVHDIIDALDGLNRVSAILARDKDNLGRALDTLPDAVRVL
NQNRDHIVDAFAALKRLTMVTSHVLAETKVDFGEDLKDLYSIVKALNDDRKDFVTSLQLLLTFPFPNFGIKQAVRGDYLN
VFTTFDLTLRRIGETFFTTAYFDPNMAHMDEILNPPDFLIGELANLSGQAADPFKIPPGTASGQ
>P0ADV1 ~~~lptA~~~Lipopolysaccharide export system protein LptA~~~COG1934
MKFKTNKLSLNLVLASSLLAASIPAFAVTGDTDQPIHIESDQQSLDMQGNVVTFTGNVIVTQGTIKINADKVVVTRPGGE
QGKEVIDGYGKPATFYQMQDNGKPVEGHASQMHYELAKDFVVLTGNAYLQQVDSNIKGDKITYLVKEQKMQAFSDKGKRV
TTVLVPSQLQDKNNKGQTPAQKKGN
>P45074 ~~~lptA~~~Lipopolysaccharide export system protein LptA~~~COG1934
MKLVSNKILFLATMVLASSSAFALKDDVNQPINIVSDNQSLDMEKSVVTFTDNVVITQGSIVIKANKVVITRPAEKSGKK
ETVEAFGTPVTFHQQLDNGKPVDGKANKVHYDLGNEFLTLTNNAELKQLDSKINGSVITYDVKKQQLKANGNGKSRVTTV
LIPSQLQQAKGK
>Q9HVV7 ~~~lptH~~~Lipopolysaccharide export system protein LptH~~~
MRFVNTLPLIFGLTAALGSSMALALPSDREQPIRVQADSAELDDKQGVAVYRGDVVVTQGSTKLTGNTVTLKQDKNGDIE
VVTSVGKPAYYEQKPAPDKDVTKAYGLTIQYFVTQNRVVLIDQAKVIQEGNTFEGEKIVYDTQRQIVNAGRATGSQVTSP
RPRIDMVIQPKKKAQ
>P0A9V1 7.5.2.-~~~lptB~~~Lipopolysaccharide export system ATP-binding protein LptB~~~COG1137
MATLTAKNLAKAYKGRRVVEDVSLTVNSGEIVGLLGPNGAGKTTTFYMVVGIVPRDAGNIIIDDDDISLLPLHARARRGI
GYLPQEASIFRRLSVYDNLMAVLQIRDDLSAEQREDRANELMEEFHIEHLRDSMGQSLSGGERRRVEIARALAANPKFIL
LDEPFAGVDPISVIDIKRIIEHLRDSGLGVLITDHNVRETLAVCERAYIVSQGHLIAHGTPTEILQDEHVKRVYLGEDFR
L
>P45073 7.5.2.-~~~lptB~~~Lipopolysaccharide export system ATP-binding protein LptB~~~COG1137
MSILTAENLAKSYKSRKVVSDVSLTVNSNEIVGLLGPNGAGKTTTFYMVVGLVRQDQGKIVIDGEDISLLPMHNRAQRGI
GYLPQEASIFRRLTVYENLMAVLEIRKDLTPQQRREKADELIEEFNISHIRDNLGQALSGGERRRVEIARALAANPKFIL
LDEPFAGVDPISVSDIKKIITDLRNRGLGVLITDHNVRETLDVCERAYIVGAGKIIATGTPEQVMNDEQVKRVYLGEQFK
L
>P0A9V4 7.5.2.-~~~lptB~~~Lipopolysaccharide export system ATP-binding protein LptB~~~
MATLTAKNLAKAYKGRRVVEDVSLTVNSGEIVGLLGPNGAGKTTTFYMVVGIVPRDAGNIIIDDDDISLLPLHARARRGI
GYLPQEASIFRRLSVYDNLMAVLQIRDDLSAEQREDRANELMEEFHIEHLRDSMGQSLSGGERRRVEIARALAANPKFIL
LDEPFAGVDPISVIDIKRIIEHLRDSGLGVLITDHNVRETLAVCERAYIVSQGHLIAHGTPTEILQDEHVKRVYLGEDFR
L
>P0ADV9 ~~~lptC~~~Lipopolysaccharide export system protein LptC~~~COG3117
MSKARRWVIIVLSLAVLVMIGINMAEKDDTAQVVVNNNDPTYKSEHTDTLVYNPEGALSYRLIAQHVEYYSDQAVSWFTQ
PVLTTFDKDKIPTWSVKADKAKLTNDRMLYLYGHVEVNALVPDSQLRRITTDNAQINLVTQDVTSEDLVTLYGTTFNSSG
LKMRGNLRSKNAELIEKVRTSYEIQNKQTQP
>P0ADW2 ~~~lptC~~~Lipopolysaccharide export system protein LptC~~~
MSKARRWVIIVLSLAVLVMIGINMAEKDDTAQVVVNNNDPTYKSEHTDTLVYNPEGALSYRLIAQHVEYYSDQAVSWFTQ
PVLTTFDKDKIPTWSVKADKAKLTNDRMLYLYGHVEVNALVPDSQLRRITTDNAQINLVTQDVTSEDLVTLYGTTFNSSG
LKMRGNLRSKNAELIEKVRTSYEIQNKQTQP
>P31554 ~~~lptD~~~LPS-assembly protein LptD~~~COG1452
MKKRIPTLLATMIATALYSQQGLAADLASQCMLGVPSYDRPLVQGDTNDLPVTINADHAKGDYPDDAVFTGSVDIMQGNS
RLQADEVQLHQKEAPGQPEPVRTVDALGNVHYDDNQVILKGPKGWANLNTKDTNVWEGDYQMVGRQGRGKADLMKQRGEN
RYTILDNGSFTSCLPGSDTWSVVGSEIIHDREEQVAEIWNARFKVGPVPIFYSPYLQLPVGDKRRSGFLIPNAKYTTTNY
FEFYLPYYWNIAPNMDATITPHYMHRRGNIMWENEFRYLSQAGAGLMELDYLPSDKVYEDEHPNDDSSRRWLFYWNHSGV
MDQVWRFNVDYTKVSDPSYFNDFDNKYGSSTDGYATQKFSVGYAVQNFNATVSTKQFQVFSEQNTSSYSAEPQLDVNYYQ
NDVGPFDTRIYGQAVHFVNTRDDMPEATRVHLEPTINLPLSNNWGSINTEAKLLATHYQQTNLDWYNSRNTTKLDESVNR
VMPQFKVDGKMVFERDMEMLAPGYTQTLEPRAQYLYVPYRDQSDIYNYDSSLLQSDYSGLFRDRTYGGLDRIASANQVTT
GVTSRIYDDAAVERFNISVGQIYYFTESRTGDDNITWENDDKTGSLVWAGDTYWRISERWGLRGGIQYDTRLDNVATSNS
SIEYRRDEDRLVQLNYRYASPEYIQATLPKYYSTAEQYKNGISQVGAVASWPIADRWSIVGAYYYDTNANKQADSMLGVQ
YSSCCYAIRVGYERKLNGWDNDKQHAVYDNAIGFNIELRGLSSNYGLGTQEMLRSNILPYQNTL
>P44846 ~~~lptD~~~LPS-assembly protein LptD~~~COG1452
MNKKHTLISLAILTALYSQQSLADLHEQCLMGVPKFSGEVVTGDVNALPVYIEADNAEINQPNDATYQGNVDLKQGNRHL
LAQSVQVKQSGNQSTPLRMAYVRNGFDYKDNQINMLGKDAEFNLDSHDGNLTNSEYEFVGRQGRGKADNITLHNNYRVMK
NATFTSCLHGDNAWAVDASEIRQYVKEEYAEMWHARFKIHGVPVFYTPYLQLPIGDRRRSGLLIPSAGTSSQDGLWYAQP
IYWNIAPNYDLTFTPKYMSRRGWQANGEFRYLTSIGEGKVAGEYLGKVRYSEYASDNRKRHLFYWNHNSSFLQNWRLNIN
YTRVSDKRYFNDFDSIYGRSTDGYANQYARIAYYQPNYNFSLSAHQFQIFDDIVNIGPYRAVPQLDFNYHKYDLANGWLN
FKLHSQAVRFDNDSKLMPTAWRFHAEPSLNSLMSNKYGSLNIETKLYATRYEQKKGSGKNAEDVQKTVNRVIPQFKVDLQ
SVLARDITFLKEYTQTFEPHVQYLYRPYRNQSNIGSTLNNDYLGFGYDSALVQQDYYSLFRDRRYSGLDRISSANQVTLG
GTTRFYDIAGEERFNLSAGQIYYLSNSRIDENPANKTPTSSSAWALESNWKISNKWYWRGSYQFDTHTNSTSLANTSLEY
NPEKNNLIQLNYRYANQEYIDQNLGKSANAYQQDIQQVGLVVGWEIANNWAVVGRYYQDLALQKPVEQYLGVQYNSCCWA
ASVGVKRNVTNHQNQTRNEIVYDNSIGITLELRGLGSNDHQSGIQEMLEKGKLPYIRAFSLD
>Q5F651 ~~~lptD~~~LPS-assembly protein LptD~~~
MARLFSLKPLVLALGFCFGTHCAADTVAAEEADGRVAEGGAQGASESAQASDLTLGSTCLFCSNESGSPERTEAAVQGSG
EASVPEDYTRIVADRMEGQSKVKVRAEGSVIIERDGAVLNTDWADYDQSGDTVTVGDRFALQQDGTLIRGETLTYNLDQQ
TGEAHNVRMETEQGGRRLQSVSRTAEMLGEGRYKLTETQFNTCSAGDAGWYVKAASVEADRGKGIGVAKHAAFVFGGVPL
FYTPWADFPLDGNRKSGLLVPSVSAGSDGVSLSVPYYFNLAPNFDATFAPGIIGERGATFDGQIRYLRPDYSGQTDLTWL
PHDKKSGRNNRYQAKWQHRHDISDTLQAGVDFNQVSDSGYYRDFYGGEEIAGNVNLNRRVWLDYGGRAAGGSLNAGLSVQ
KYQTLANQSGYKDEPYAIMPRLSADWHKNAGRAQIGVSAQFTRFSHDGRQDGSRLVVYPGIKWDFSNSWGYVRPKLGLHA
TYYSLDSFGGKASRSVGRVLPVVNIDGGTTFERNTRLFGGGVVQTIEPRLFYNYIPAKSQNDLPNFDSSESSFGYGQLFR
ENLYYGNDRINAANSLSTAVQSRILDGATGEERFRAGIGQKFYFKDDAVMLDGSVGKNPRSRSDWVAFASGGIGGRFTLD
SSIHYNQNDKRAEHYAVGAGYRPAPGKVLNARYKYGRNEKIYLQADGSYFYDKLSQLDLSAQWPLTRNLSAVVRYNYGFE
AKKPIEMLAGAEYKSSCGCWGAGVYAQRYVTGENTYKNAVFFSLQLKDLSSVGRNPAGRMDVAVPGYIPAHSLSAGRNKR
P
>Q9K187 ~~~lptD~~~LPS-assembly protein LptD~~~
MARLFSLKPLVLALGLCFGTHCAAADAVAAEETDNPTAGESVRSVSEPIQPTSLSLGSTCLFCSNESGSPERTEAAVQGS
GEASIPEDYTRIVADRMEGQSQVQVRAEGNVVVERNRTTLNTDWADYDQSGDTVTAGDRFALQQDGTLIRGETLTYNLEQ
QTGEAHNVRMEIEQGGRRLQSVSRTAEMLGEGHYKLTETQFNTCSAGDAGWYVKAASVEADREKGIGVAKHAAFVFGGVP
IFYTPWADFPLDGNRKSGLLVPSLSAGSDGVSLSVPYYFNLAPNLDATFAPSVIGERGAVFDGQVRYLRPDYAGQSDLTW
LPHDKKSGRNNRYQAKWQHRHDISDTLQAGVDFNQVSDSGYYRDFYGNKEIAGNVNLNRRVWLDYGGRAAGGSLNAGLSV
LKYQTLANQSGYKDKPYALMPRLSVEWRKNTGRAQIGVSAQFTRFSHDSRQDGSRLVVYPDIKWDFSNSWGYVRPKLGLH
ATYYSLNRFGSQEARRVSRTLPIVNIDSGATFERNTRMFGGEVLQTLEPRLFYNYIPAKSQNDLPNFDSSESSFGYGQLF
RENLYYGNDRINTANSLSAAVQSRILDGATGEERFRAGIGQKFYFKDDAVMLDGSVGKKPRNRSDWVAFASGSIGSRFIL
DSSIHYNQNDKRAENYAVGASYRPAQGKVLNARYKYGRNEKIYLKSDGSYFYDKLSQLDLSAQWPLTRNLSAVVRYNYGF
EAKKPIEVLAGAEYKSSCGCWGAGVYAQRYVTGENTYKNAVFFSLQLKDLSSVGRNPADRMDVAVPGYITAHSLSAGRNK
RP
>Q9I5U2 ~~~lptD~~~LPS-assembly protein LptD~~~
MAVKSLVFRRKFPLLVTGSLLALQPVAALTVQAADQFDCKVSATGGWDCSPLQNANANLPPRPAHTATSVSTAAAGSSVS
GSGGETVEAEPTQRLVTESGGRALKSRSADYSHLDWIPREKLTAAQLAEIGPYCGGSYIEPVRPGMDDGAPSDESPTYVS
AKASRYEQEKQIATLAGDVVLRQGSMQVEGDEANLHQLENRGELVGNVKLRDKGMLVVGDHAQVQLDNGEAQVDNAEYVI
HKAHARGSALYAKRSENAIIMLKDGTYTRCEPSSNAWTLKGNNVKLNPATGFGTATNATLRVKDFPVFYTPYIYFPIDDR
RQSGFLPPSFSSTSDTGFTLVTPYYFNLAPNYDATLYPRYMAKRGMMLEGEFRYLTHSSEGIVNAAYLNDKDDHREGFPD
YSKDRWLYGLKNTTGLDSRWLAEVDYTRISDPYYFQDLDTDLGVGSTTYVNQRGTLTYRGDTFTGRLNAQAYQLATTTDV
TPYDRLPQITFDGFLPYNPGGMQFTYGTEFVRFDRDLDENIYFNDDGSIRGKRPDASLQGLARATGDRMHLEPGMSLPMT
RSWGYVTPTLKYLYTKYDLDLDSQGKTDLNKRDESFDSNQDRSLPLVKVDSGLYFDRDTTFAGTPFRQTLEPRAMYLYVP
YKDQDSLPVFDTSEPSFSYDSLWRENRFTGKDRIGDANQLSLGVTSRFIEENGFERASISAGQIYYFRDRRVQLPGLTEK
DLKRLNLDPSGLDNDSWRSPYAFAGQYRFNRDWRINSDFNWNPNTSRTESGSAIFHYQPEVDPGKVVNVGYRYRADARRF
DSSRGTFRYGNENDIIKQHDFSVIWPLVPQWSVLARWQYDYNKNRTLEAFGGFEYDSCCWKLRLINRYWLDVDDDAFLVQ
SEKADRGIFLQIVLKGLGGIVGNKTEMFLDKGIQGYRQREDQAM
>Q8ZRW0 ~~~lptD~~~LPS-assembly protein LptD~~~
MKKRIPTLLATMIASALYSHQGLAADLASQCMLGVPSYDRPLVKGDTNDLPVTINADNAKGNYPDDAVFTGNVDIMQGNS
RLQADEVQLHQKQAEGQPEPVRTVDALGNVHYDDNQVILKGPKGWANLNTKDTNVWEGDYQMVGRQGRGKADLMKQRGEN
RYTILENGSFTSCLPGSDTWSVVGSEVIHDREEQVAEIWNARFKVGPVPIFYSPYLQLPVGDKRRSGFLIPNAKYTTKNY
FEFYLPYYWNIAPNMDATITPHYMHRRGNIMWENEFRYLTQAGEGVMELDYLPSDKVYEDDHPKEGDKHRWLFYWQHSGV
MDQVWRFNVDYTKVSDSSYFNDFDSKYGSSTDGYATQKFSVGYAVQNFDATVSTKQFQVFNDQNTSSYSAEPQLDVNYYH
NDLGPFDTRIYGQAVHFVNTKDNMPEATRVHLEPTINLPLSNRWGSLNTEAKLMATHYQQTNLDSYNSDPNNKNKLEDSV
NRVMPQFKVDGKLIFERDMAMLAPGYTQTLEPRVQYLYVPYRDQSGIYNYDSSLLQSDYNGLFRDRTYGGLDRIASANQV
TTGVTTRIYDDAAVERFNVSVGQIYYFTESRTGDDNIKWENDDKTGSLVWAGDTYWRISERWGLRSGVQYDTRLDSVATS
SSSLEYRRDQDRLVQLNYRYASPEYIQATLPSYYSTAEQYKNGINQVGAVASWPIADRWSIVGAYYFDTNSSKPADQMLG
LQYNSCCYAIRVGYERKLNGWDNDKQHAIYDNAIGFNIELRGLSSNYGLGTQEMLRSNILPYQSSM
>Q83SQ0 ~~~lptD~~~LPS-assembly protein LptD~~~
MKKRIPTLLATMIATALYSQQGLAADLASQCMLGVPSYDRPLVQGDTNDLPVTINADHAKGDYPDDAVFTGSVDIMQGNS
RLQADEVQLHQKEAPGQPEPVRTVDALGNVHYDDNQVILKGPKGWANLNTKDTNVWEGDYQMVGRQGRGKADLMKQRGEN
RYTILDNGSFTSCLPGSDTWSVVGSEIIHDREEQVAEIWNARFKVGPVPIFYSPYLQLPVGDKRRSGFLIPNAKYTTTNY
FEFYLPYYWNIAPNMDATITPHYMHRRGNIMWENEFRYLSQAGAGLMELDYLPSDKVYEDEHPNDDSSRRWLFYWNHSGV
MDQVWRFNVDYTKVSDPSYFNDFDNKYGSSTDGYATQKFSVGYAVQNFNATVSTKQFQVFSEQNTSSYSAEPQLDVNYYQ
NDVGPFDTRIYGQAVHFVNTRDDMPEATRVHLEPTINLPLSNNWGSINTEAKFLATHYQQTNLDWYNSRNTTKLDESVNR
VMPQFKVDGKMVFERDMEMLAPGYTQTLEPRAQYLYVPYRDQSDIYNYDSSLLQSDYSGLFRDRTYGGLDRIASANQVTT
GVTSRIYDDAAVERFNISVGQIYYFTESRTGDDNITWENDDKTGSLVWAGDTYWRISERWGLRGGIQYDTRLDNVATSNS
SIEYRRDEDRLVQLNYHYASPEYIQATLPKYYSTAEQYKNGISQVGAVASRPIADRWSIVGAYYYDTNANKQADSMLGVQ
YSSCCYAIRVGYERKLNGWDNDKQHAVYDNAIGFNIELRGLSSNYGLGTQEMLRSNILPYQNTL
>Q8ZIK3 ~~~lptD~~~LPS-assembly protein LptD~~~COG1452
MKKRFPTLLATLIWTALYSQHTLADLAEQCMLGVPTYDQPLVTGDPNQLPVRINADKTEANYPDNALFTGNVIVQQGNST
LTANQVELTQVQKPGEVIPLRTVTATGDVNYDDPQIKLKGPKGWSNLNTKDTDMDKGKYQMVGRQGRGDADLMKLRDQSR
YTILKNGTFTSCLPGDNSWSVVGSEVIHDREEQVVEVWNARFKIGKVPVFYSPYMQLPVGDKRRSGFLIPNAKFTSNNGF
EFLLPYYWNIAPNFDATITPHYMERRGLQWQNEFRYLLAPGSGTMALDWLPNDRIYTGPDGTDKNATRWLYYWGHSGVMD
QVWRFNINYTRVSDPAYFTDLTSQYGSTTDGYATQIFTAGYANENWNATLSSKQFQVFTAAGNSNAYRAQPQLDMNYYKN
DVGPFDMHVYGQAAKFTSVNPTNPEASRFHIEPTVNLPLSNSWGSINTEAKLLATHYQQDIPASFADNASNPKLKDSVNR
VLPQFKVDGKVVFDRSMDWATGFTQTLEPRAQYLYVPYRNQDDIYIYDTTLMQSDYSGLFRDRTYSGLDRIASANQVSTG
LTSRIYDDARVERFNVSVGQIYYFSRSRTGNTEAIDNSNATGSLVWAGDTFWRINDQLGLKGGAQYDTRLGSLTLGNAIM
EYRKDADRMIQLNYRYASPKYIQAAVPKVYNPDYQQGISQVGTTASWPIADRWAIVGAYYYDTKAKQPASQLVGLQYNTC
CWAVNLGYERKITGWNAQGQTSKYDNKIGFNIELRGLSGGHSLGTAQMLNSGILPYQSAF
>P0ADC1 ~~~lptE~~~LPS-assembly lipoprotein LptE~~~COG2980
MRYLATLLLSLAVLITAGCGWHLRDTTQVPSTMKVMILDSGDPNGPLSRAVRNQLRLNGVELLDKETTRKDVPSLRLGKV
SIAKDTASVFRNGQTAEYQMIMTVNATVLIPGRDIYPISAKVFRSFFDNPQMALAKDNEQDMIVKEMYDRAAEQLIRKLP
SIRAADIRSDEEQTSTTTDTPATPARVSTTLGN
>Q8ZQZ7 ~~~lptE~~~LPS-assembly lipoprotein LptE~~~
MRYLVTLLLSLAVLVTAGCGWHLRSTTQVPASMKTMILDSGDPNGPLSRAVRNQLRLNNVNLLDKDTTRKDVPSLRLGTV
TILQDTASVFQDGQTAEYQMVMTVNASVLIPGHDIYPISTKVYRSFFDNPQMALAKDNEQAMIVQEMYDKAAEQLIRKLT
SVRAADIQATKEEATADNETAAPASTPARVSTTLSN
>Q83LX4 ~~~lptE~~~LPS-assembly lipoprotein LptE~~~
MRYLATLLLSLAVLITAGCGWHLRDTTQVPSTMKVMILDSGDPNGPLSRAVRNQLRLNGVELLDKETTRKDVPSLRLGKV
SIAKDTASVFRNGQTAEYQMIMTVNATVLIPGRDIYPISAKVFRSFFDNPQMALAKDNEQDMIVKEMYDRAAEQLIRKLP
SIRAADIRSDEEQTSTTTDTPATPARVSTMLGN
>Q7CJV2 ~~~lptE~~~LPS-assembly lipoprotein LptE~~~COG2980
MRHRILTLLLGLAVLVTAGCGFNLRGTTQVPTELQKLLLESSDPYGPLARSIRQQLRLNNVTIVDDAMRKDIPTLRIIGS
SESQETVSIFRNGVAAENQLVLHVQAQVLIPGHDIYPLQVNVFRTFFDNPLTALAKEAEAEVLRQEMREQAAQQLVRQLL
TVHAAEVKNTQKNGDKPVSDANAAQGSTPTAVNETTLGEPAVSTSAK
>P0AF98 ~~~lptF~~~Lipopolysaccharide export system permease protein LptF~~~COG0795
MIIIRYLVRETLKSQLAILFILLLIFFCQKLVRILGAAVDGDIPANLVLSLLGLGVPEMAQLILPLSLFLGLLMTLGKLY
TESEITVMHACGLSKAVLVKAAMILAVFTAIVAAVNVMWAGPWSSRHQDEVLAEAKANPGMAALAQGQFQQATNGSSVLF
IESVDGSDFKDVFLAQIRPKGNARPSVVVADSGHLTQLRDGSQVVTLNQGTRFEGTALLRDFRITDFQDYQAIIGHQAVA
LDPNDTDQMDMRTLWNTDTDRARAELNWRITLVFTVFMMALMVVPLSVVNPRQGRVLSMLPAMLLYLLFFLIQTSLKSNG
GKGKLDPTLWMWTVNLIYLALAIVLNLWDTVPVRRLRASFSRKGAV
>P0AFA1 ~~~lptF~~~Lipopolysaccharide export system permease protein LptF~~~
MIIIRYLVRETLKSQLAILFILLLIFFCQKLVRILGAAVDGDIPANLVLSLLGLGVPEMAQLILPLSLFLGLLMTLGKLY
TESEITVMHACGLSKAVLVKAAMILAVFTAIVAAVNVMWAGPWSSRHQDEVLAEAKANPGMAALAQGQFQQATNGSSVLF
IESVDGSDFKDVFLAQIRPKGNARPSVVVADSGHLTQLRDGSQVVTLNQGTRFEGTALLRDFRITDFQDYQAIIGHQAVA
LDPNDTDQMDMRTLWNTDTDRARAELNWRITLVFTVFMMALMVVPLSVVNPRQGRVLSMLPAMLLYLLFFLIQTSLKSNG
GKGKLDPTLWMWTVNLIYLALAIVLNLWDTVPVRRLRASFSRKGAV
>P0ADC6 ~~~lptG~~~Lipopolysaccharide export system permease protein LptG~~~COG0795
MQPFGVLDRYIGKTIFTTIMMTLFMLVSLSGIIKFVDQLKKAGQGSYDALGAGMYTLLSVPKDVQIFFPMAALLGALLGL
GMLAQRSELVVMQASGFTRMQVALSVMKTAIPLVLLTMAIGEWVAPQGEQMARNYRAQAMYGGSLLSTQQGLWAKDGNNF
VYIERVKGDEELGGISIYAFNENRRLQSVRYAATAKFDPEHKVWRLSQVDESDLTNPKQITGSQTVSGTWKTNLTPDKLG
VVALDPDALSISGLHNYVKYLKSSGQDAGRYQLNMWSKIFQPLSVAVMMLMALSFIFGPLRSVPMGVRVVTGISFGFVFY
VLDQIFGPLTLVYGIPPIIGALLPSASFFLISLWLLMRKS
>P0AD89 ~~~tnaC~~~Tryptophanase operon leader peptide~~~
MNILHICVTSKWFNIDNKIVDHRP
>Q2SWY6 2.3.1.129~~~lpxA~~~Acyl-[acyl-carrier-protein]--UDP-N-acetylglucosamine O-acyltransferase~~~
MSRIHPTAIIEPGAQLHETVEVGPYAIVGSNVTIGARTTIGSHSVIEGHTTIGEDNRIGHYASVGGRPQDMKYKDEPTRL
VIGDRNTIREFTTIHTGTVQDAGVTTLGDDNWIMAYVHIGHDCRVGSHVVLSSNAQMAGHVEIGDWAIVGGMSGVHQYVR
IGAHSMLGGASALVQDIPPFVIAAGNKAEPHGINVEGLRRRGFSPDAISALRSAYRILYKNSLSLEEAKVQLSELAQAGG
DGDAAVKALVDFVESSQRGIIR
>Q9PIM1 2.3.1.129~~~lpxA~~~Acyl-[acyl-carrier-protein]--UDP-N-acetylglucosamine O-acyltransferase~~~COG1043
MKKIHPSAVIEEGAQLGDDVVIEAYAYVSKDAKIGNNVVIKQGARILSDTTIGDHSRVFSYAIVGDIPQDISYKEEQKSG
VVIGKNATIREFATINSGTAKGDGFTRIGDNAFIMAYCHIAHDCLLGNNIILANNATLAGHVELGDFTVVGGLTPIHQFV
KVGEGCMIAGASALSQDIVPFCLAEGNRASIRSLNLVGIRRRFDKDEVDRLSRAFKTLFRQGDLKENAKNLLENQESENV
KKMCHFILETKRGIPVYRGKNNA
>P0A722 2.3.1.129~~~lpxA~~~Acyl-[acyl-carrier-protein]--UDP-N-acetylglucosamine O-acyltransferase~~~COG1043
MIDKSAFVHPTAIVEEGASIGANAHIGPFCIVGPHVEIGEGTVLKSHVVVNGHTKIGRDNEIYQFASIGEVNQDLKYAGE
PTRVEIGDRNRIRESVTIHRGTVQGGGLTKVGSDNLLMINAHIAHDCTVGNRCILANNATLAGHVSVDDFAIIGGMTAVH
QFCIIGAHVMVGGCSGVAQDVPPYVIAQGNHATPFGVNIEGLKRRGFSREAITAIRNAYKLIYRSGKTLDEVKPEIAELA
ETYPEVKAFTDFFARSTRGLIR
>O25927 2.3.1.129~~~lpxA~~~Acyl-[acyl-carrier-protein]--UDP-N-acetylglucosamine O-acyltransferase~~~COG1043
MSKIAKTAIISPKAEINKGVEIGEFCVIGDGVKLDEGVKLHNNVTLQGHTFVGKNTEIFPFAVLGTQPQDLKYKGEYSEL
IIGEDNLIREFCMINPGTEGGIKKTLIGDKNLLMAYVHVAHDCVIGSHCILANGVTLAGHIEIGDYVNIGGLTAIHQFVR
IAKGCMIAGKSALGKDVPPYCTVEGNRAFIRGLNRHRMRQLLESKDIDFIYALYKRLFRPIPSLRESAKLELEEHANNPF
VKEICSFILESSRGVAYKSSEYSSEEKQEE
>B4F258 2.3.1.129~~~lpxA~~~Acyl-[acyl-carrier-protein]--UDP-N-acetylglucosamine O-acyltransferase~~~COG1043
MIDKSAVIHPSSIIEEGAVIGANVRIGPFCVIGSHVEIGEGTDIKSHVVINGHTRIGRDNQIYQFASIGEVNQDLKYRGE
PTQVIIGDRNLIRESVTIHRGTTQGGNITKIGNDNLLMINTHVAHDCIIGDRCIIANNGTLGGHVTLGDYVIIGGMSAVH
QFCQIGSHVMVGGCSGVAQDVPPFVIAQGNHATPYGLNIEGLKRRGFAKEDLHAIRNAYKILYRNGKTLEEAREEIAQLA
ADNNNQYVKIFSDFLENSAKSNRGIIR
>A6V1E4 2.3.1.129~~~lpxA~~~Acyl-[acyl-carrier-protein]--UDP-N-acetylglucosamine O-acyltransferase~~~
MSLIDPRAIIDPSARLAADVQVGPWSIVGAEVEIGEGTVIGPHVVLKGPTKIGKHNRIYQFSSVGEDTPDLKYKGEPTRL
VIGDHNVIREGVTIHRGTVQDRAETTIGDHNLIMAYAHIGHDSVIGNHCILVNNTALAGHVHVDDWAILSGYTLVHQYCR
IGAHSFSGMGSAIGKDVPAYVTVFGNPAEARSMNFEGMRRRGFSSEAIHALRRAYKVVYRQGHTVEEALAELAESAAQFP
EVAVFRDSIQSATRGITR
>P10441 2.4.1.182~~~lpxB~~~Lipid-A-disaccharide synthase~~~COG0763
MTEQRPLTIALVAGETSGDILGAGLIRALKEHVPNARFVGVAGPRMQAEGCEAWYEMEELAVMGIVEVLGRLRRLLHIRA
DLTKRFGELKPDVFVGIDAPDFNITLEGNLKKQGIKTIHYVSPSVWAWRQKRVFKIGRATDLVLAFLPFEKAFYDKYNVP
CRFIGHTMADAMPLDPDKNAARDVLGIPHDAHCLALLPGSRGAEVEMLSADFLKTAQLLRQTYPDLEIVVPLVNAKRREQ
FERIKAEVAPDLSVHLLDGMGREAMVASDAALLASGTAALECMLAKCPMVVGYRMKPFTFWLAKRLVKTDYVSLPNLLAG
RELVKELLQEECEPQKLAAALLPLLANGKTSHAMHDTFRELHQQIRCNADEQAAQAVLELAQ
>O67648 3.5.1.108~~~lpxC~~~UDP-3-O-acyl-N-acetylglucosamine deacetylase~~~COG0774
MGLEKTVKEKLSFEGVGIHTGEYSKLIIHPEKEGTGIRFFKNGVYIPARHEFVVHTNHSTDLGFKGQRIKTVEHILSVLH
LLEITNVTIEVIGNEIPILDGSGWEFYEAIRKNILNQNREIDYFVVEEPIIVEDEGRLIKAEPSDTLEVTYEGEFKNFLG
RQKFTFVEGNEEEIVLARTFCFDWEIEHIKKVGLGKGGSLKNTLVLGKDKVYNPEGLRYENEPVRHKVFDLIGDLYLLGS
PVKGKFYSFRGGHSLNVKLVKELAKKQKLTRDLPHLPSVQAL
>P0A725 3.5.1.108~~~lpxC~~~UDP-3-O-acyl-N-acetylglucosamine deacetylase~~~COG0774
MIKQRTLKRIVQATGVGLHTGKKVTLTLRPAPANTGVIYRRTDLNPPVDFPADAKSVRDTMLCTCLVNEHDVRISTVEHL
NAALAGLGIDNIVIEVNAPEIPIMDGSAAPFVYLLLDAGIDELNCAKKFVRIKETVRVEDGDKWAEFKPYNGFSLDFTID
FNHPAIDSSNQRYAMNFSADAFMRQISRARTFGFMRDIEYLQSRGLCLGGSFDCAIVVDDYRVLNEDGLRFEDEFVRHKM
LDAIGDLFMCGHNIIGAFTAYKSGHALNNKLLQAVLAKQEAWEYVTFQDDAELPLAFKAPSAVLA
>P47205 3.5.1.108~~~lpxC~~~UDP-3-O-acyl-N-acetylglucosamine deacetylase~~~
MIKQRTLKNIIRATGVGLHSGEKVYLTLKPAPVDTGIVFCRTDLDPVVEIPARAENVGETTMSTTLVKGDVKVDTVEHLL
SAMAGLGIDNAYVELSASEVPIMDGSAGPFVFLIQSAGLQEQEAAKKFIRIKREVSVEEGDKRAVFVPFDGFKVSFEIDF
DHPVFRGRTQQASVDFSSTSFVKEVSRARTFGFMRDIEYLRSQNLALGGSVENAIVVDENRVLNEDGLRYEDEFVKHKIL
DAIGDLYLLGNSLIGEFRGFKSGHALNNQLLRTLIADKDAWEVVTFEDARTAPISYMRPAAAV
>B0VMV2 2.3.1.191~~~lpxD~~~UDP-3-O-acylglucosamine N-acyltransferase~~~
MKVQQYRLDELAHLVKGELIGEGSLQFSNLASLENAEVNHLTFVNGEKHLDQAKVSRAGAYIVTAALKEHLPEKDNFIIV
DNPYLAFAILTHVFDKKISSTGIESTARIHPSAVISETAYIGHYVVIGENCVVGDNTVIQSHTKLDDNVEVGKDCFIDSY
VTITGSSKLRDRVRIHSSTVIGGEGFGFAPYQGKWHRIAQLGSVLIGNDVRIGSNCSIDRGALDNTILEDGVIIDNLVQI
AHNVHIGSNTAIAAKCGIAGSTKIGKNCILAGACGVAGHLSIADNVTLTGMSMVTKNISEAGTYSSGTGLFENNHWKKTI
VRLRQLADVPLTQITKRLDHIQAQIESLESTFNLRK
>P0CD76 2.3.1.191~~~lpxD~~~UDP-3-O-acylglucosamine N-acyltransferase~~~
MSQSTYSLEQLADFLKVEFQGNGATLLSGVEEIEEAKTAHITFLDNEKYAKHLKSSEAGAIIISRTQFQKYRDLNKNFLI
TSESPSLVFQKCLELFITPVDSGFPGIHPTAVIHPTAIIEDHVCIEPYAVVCQHAHVGSACHIGSGSVIGAYSTVGEHSY
IHPRVVIRERVSIGKRVIIQPGAVIGSCGFGYVTSAFGQHKHLKHLGKVIIEDDVEIGANTTIDRGRFKHSVVREGSKID
NLVQIAHQVEVGQHSMIVAQAGIAGSTKIGNHVIIGGQAGITGHICIADHVIMMAQTGVTKSITSPGIYGGAPARPYQEI
HRQVAKVRNLPRLEERIAALEKLVQKLEALSEQH
>P21645 2.3.1.191~~~lpxD~~~UDP-3-O-(3-hydroxymyristoyl)glucosamine N-acyltransferase~~~COG1044
MPSIRLADLAQQLDAELHGDGDIVITGVASMQSAQTGHITFMVNPKYREHLGLCQASAVVMTQDDLPFAKSAALVVKNPY
LTYARMAQILDTTPQPAQNIAPSAVIDATAKLGNNVSIGANAVIESGVELGDNVIIGAGCFVGKNSKIGAGSRLWANVTI
YHEIQIGQNCLIQSGTVVGADGFGYANDRGNWVKIPQIGRVIIGDRVEIGACTTIDRGALDDTIIGNGVIIDNQCQIAHN
VVIGDNTAVAGGVIMAGSLKIGRYCMIGGASVINGHMEICDKVTVTGMGMVMRPITEPGVYSSGIPLQPNKVWRKTAALV
MNIDDMSKRLKSLERKVNQQD
>Q9HXY6 2.3.1.191~~~lpxD~~~UDP-3-O-acylglucosamine N-acyltransferase~~~
MMSTLSYTLGQLAAHVGAEVRGDADLPIQGLATLQEAGPAQLSFLANPQYRKYLPESRAGAVLLTAADADGFAGTALVVA
NPYLAYASLSHLFDRKPKAAAGIHPTAIVAADAEVDPSASVGAYAVIESGARIGAGVSIGAHCVIGARSVIGEGGWLAPR
VTLYHDVTIGARVSIQSGAVIGGEGFGFANEKGVWQKIAQIGGVTIGDDVEIGANTTIDRGALSDTLIGNGVKLDNQIMI
AHNVQIGDHTAMAACVGISGSAKIGRHCMLAGGVGLVGHIEICDNVFVTGMTMVTRSITEPGSYSSGTAMQPAAEWKKSA
ARIRQLDDMARRLQQLEKRLAAVTSSGDASSDA
>P0A1X4 2.3.1.191~~~lpxD~~~UDP-3-O-(3-hydroxymyristoyl)glucosamine N-acyltransferase~~~
MPSIRLADLAEQLDAELHGDGDIVITGVASMQSATTGHITFMVNPKYREHLGLCQASAVVMTQDDLPFAKSAALVVKNPY
LTYARMAQILDTTPQPAQNIAPSAVIDATATLGSNVSVGANAVIESGVQLGDNVVIGAGCFVGKNSKIGAGSRLWANVTI
YHDIQIGENCLIQSSTVIGADGFGYANDRGNWVKIPQLGRVIIGDRVEIGACTTIDRGALDDTVIGNGVIIDNQCQIAHN
VVIGDNTAVAGGVIMAGSLKIGRYCMIGGASVINGHMEICDKVTVTGMGMVMRPITEPGVYSSGIPLQPNKVWRKTAALV
MNIDDMSKRLKAIERKVNQQD
>O24866 3.1.-.-~~~lpxE~~~Lipid A 1-phosphatase~~~COG0671
MKKFLFKQKFCESLPKSFSKTLLALSLGLILLGIFAPFPKVPKQPSVPLMFHFTEHYARFIPTILSVAIPLIQRDAVGLF
QVANASIATTLLTHTTKRALNHVTINDQRLGERPYGGNFNMPSGHSSMVGLAVAFLMRRYSFKKYFWLLPLVPLTMLARI
YLDMHTIGAVLTGLGVGMLCVSLFTSPKKP
>O84467 3.6.1.-~~~lpxG~~~UDP-2,3-diacylglucosamine pyrophosphatase LpxG~~~
MFVSVGITASLTTILAAPVLTWVWANHLEPNLLRVTRLNWNLPKKFAHLHGLRIVQISDLHLNHSTPDAFLKKVSRKISS
LSPDILVFTGDFVCRAKVETPERLKHFLCSLHAPLGCFACLGNHDYATYVSRDIHGKINTISAMNSRPLKRAFTSVYQSL
FASSRNEFADTLNPQIPNPHLVSILRNTPFQLLHNQSATLSDTINIVGLGDFFAKQFDPKKAFTDYNPTLPGIILSHNPD
TIHHLQDYPGDVVFSGHSHGPQISLPWPKFANTITNKLSGLENPELARGLFSFPEESRLLYVNRGLGGWKRIRFCSPPEI
CLMRCLYEP
>P43341 3.6.1.54~~~lpxH~~~UDP-2,3-diacylglucosamine hydrolase~~~COG2908
MATLFIADLHLCVEEPAITAGFLRFLAGEARKADALYILGDLFEAWIGDDDPNPLHRKMAAAIKAVSDSGVPCYFIHGNR
DFLLGKRFARESGMTLLPEEKVLELYGRRVLIMHGDTLCTDDAGYQAFRAKVHKPWLQTLFLALPLFVRKRIAARMRANS
KEANSSKSLAIMDVNQNAVVSAMEKHQVQWLIHGHTHRPAVHELIANQQPAFRVVLGAWHTEGSMVKVTADDVELIHFPF
>P44046 3.6.1.54~~~lpxH~~~UDP-2,3-diacylglucosamine hydrolase~~~COG2908
MKHSYFISDLHLSETQPELTALFVDFMQNLAPQAERLYILGDLFDFWIGDDEQSALIQQVKDLIKFVSDQGVQCYFQHGN
RDFLIGERFSKETGAQLLPDYQLITLYDKKILLCHGDTLCIDDEAYQQFRRRVHQKWLQRLFLCLPLKVRVIIAEKIRAK
SNQDKQAKSQEIMDVNQAFTAEKVQEFGVNLLIHGHTHREAIHQQEEFTRIVLGDWRKNYASILKMDESGEFGFIKD
>A6T5R0 3.6.1.54~~~lpxH~~~UDP-2,3-diacylglucosamine hydrolase~~~
MATLFIADLHLQTEEPAITAGFLRFLQGEARQADALYILGDLFEAWIGDDDPNPLHQQIASAIKAVVDAGVPCYFIHGNR
DFLVGQRFARQSGMILLAEEERLDLYGREVLIMHGDTLCTDDQGYLAFRAKVHTPWIQRLFLALPLFIRHRIAARMRADS
KAANSSKSMEIMDVNPQAVVDAMERHHVQWLIHGHTHRPAVHELQANGQPAWRVVLGAWHSEGSMVKVTPDDVELIHFPF
>Q9I2V0 3.6.1.54~~~lpxH~~~UDP-2,3-diacylglucosamine hydrolase~~~
MSVLFISDLHLEAERPDITRAFLSFLDERARRAEALYILGDFFEAWIGDDGMDAFQRSIAQSLRQVADGGTRIYLMHGNR
DFLIGKAFCREAGCTLLPDPSVIDLYGEPVLLMHGDSLCTRDEAYMRLRRWLRNPLTLWVLRHLPLATRHKLARKLRKES
RAQTRMKAVDIIDVTPEEVPRVMRGHGVRTLIHGHTHRPAEHPLDIDGQPARRIVLGDWDRQGWALEIDANGHRQAPFPL
>Q9A716 3.6.1.54~~~lpxI~~~UDP-2,3-diacylglucosamine pyrophosphatase LpxI~~~COG3494
MRKLGLIAGGGALPVELASHCEAAGRAFAVMRLRSFADPSLDRYPGADVGIGEFGKIFKALRAEGCDVVCFAGNVSRPDF
SALMPDARGLKVLPSLIVAARKGDDALLRRVLDEFEKEGFEIEGAHEVMGEMTLPRGRLGKVSPAPEHMADIDKALDVAR
EIGRLDIGQGAVVCEGLVLAVEAQEGTDAMLRRVADLPEAIRGRAERRLGVLAKAPKPIQETRVDLPTIGVATIHRAARA
GLAGIVGEAGRLLVVDREAVIAAADDLGLFVLGVDPQERP
>O67572 2.7.1.130~~~lpxK~~~Tetraacyldisaccharide 4'-kinase~~~COG1663
MLRSSLLPFSYLYEKIINFRNTLYDKGFLKIKKLPVPVISVGNLSVGGSGKTSFVMYLADLLKDKRVCILSRGYKRKSKG
TLIVSEYGNLKVSWEEAGDEPYLMAKLLPHVSVVASEDRYKGGLLALEKLSPEVFILDDGFQHRKLHRDLNILLLKKKDL
KDRLLPAGNLREPLKEIRRADALVLTYQEVEPFEFFTGKPTFKMFREFCCLLNSDFEEVPFDILKEREVIAFSGLGDNGQ
FRKVLKNLGIKVKEFMSFPDHYDYSDFTPEEGEIYLTTPKDLIKLQGYENVFALNFKVKLEREEKLKKLIYRIFY
>P27300 2.7.1.130~~~lpxK~~~Tetraacyldisaccharide 4'-kinase~~~COG1663
MIEKIWSGESPLWRLLLPLSWLYGLVSGAIRLCYKLKLKRAWRAPVPVVVVGNLTAGGNGKTPVVVWLVEQLQQRGIRVG
VVSRGYGGKAESYPLLLSADTTTAQAGDEPVLIYQRTDAPVAVSPVRSDAVKAILAQHPDVQIIVTDDGLQHYRLARDVE
IVVIDGVRRFGNGWWLPAGPMRERAGRLKSVDAVIVNGGVPRSGEIPMHLLPGQAVNLRTGTRCDVAQLEHVVAMAGIGH
PPRFFATLKMCGVQPEKCVPLADHQSLNHADVSALVSAGQTLVMTEKDAVKCRAFAEENWWYLPVDAQLSGDEPAKLLTQ
LTLLASGN
>P0ACV0 2.3.1.241~~~lpxL~~~Lipid A biosynthesis lauroyltransferase~~~COG1560
MTNLPKFSTALLHPRYWLTWLGIGVLWLVVQLPYPVIYRLGCGLGKLALRFMKRRAKIVHRNLELCFPEMSEQERRKMVV
KNFESVGMGLMETGMAWFWPDRRIARWTEVIGMEHIRDVQAQKRGILLVGIHFLTLELGARQFGMQEPGIGVYRPNDNPL
IDWLQTWGRLRSNKSMLDRKDLKGMIKALKKGEVVWYAPDHDYGPRSSVFVPLFAVEQAATTTGTWMLARMSGACLVPFV
PRRKPDGKGYQLIMLPPECSPPLDDAETTAAWMNKVVEKCIMMAPEQYMWLHRRFKTRPEGVPSRY
>P59198 2.3.1.243~~~lpxM1~~~Lipid A biosynthesis acyltransferase 1~~~
METKKNNSEYIPEFDKSFRHPRYWGAWLGVAAMAGIALTPPKFRDPILARLGRFAGRLGKSSRRRALINLSLCFPERSEA
EREAIVDEMFATAPQAMAMMAELAIRGPEKIQPRVGWQGLEIIEEMRRNNEKVIFLVPHGWAVDIPAMLMASQGQKMAAM
FHNQGNPVFDYVWNTVRRRFGGRLHARNDGIKPFIQSVRQGYWGYYLPDQDHGPEHSEFVDFFATYKATLPAIGRLMKVC
RARVVPLFPIYDGKTHRLTIQVRPPMDDLLEADDHTIERRMNEEVEIFVGPRPEQYTWILKLLKTRKPGEIQPYKRKDLY
PIK
>O06659 2.3.1.243~~~lpxM2~~~Lipid A biosynthesis acyltransferase 2~~~
MKKYKSEFIPEFKKNYLSPVYWFTWFVLGMIAGISMFPPSFRDPVLAKIGRWVGRLSRKARRRATINLSLCFPEKSDTER
EIIVDNMFATALQSIVMMAELAIRGPEKFQKRVFWKGLEILEEIRHNNRNVIFLVPHGWSVDIPAMLLAAQGEKMAAMFH
QQRNPVIDYVWNSVRRKFGGRLHSREDGIKPFIQSVRQGYWGYYLPDQDHGPEYSEFADFFATYKATLPIIGRLMNISQA
MIIPLFPVYDEKKHFLTIEVRPPMDACIASADNKMIARQMNKTVEILVGSHPEQYIWVLKLLKTRKSNEADPYP
>P24205 2.3.1.243~~~lpxM~~~Lipid A biosynthesis myristoyltransferase~~~COG1560
METKKNNSEYIPEFDKSFRHPRYWGAWLGVAAMAGIALTPPKFRDPILARLGRFAGRLGKSSRRRALINLSLCFPERSEA
EREAIVDEMFATAPQAMAMMAELAIRGPEKIQPRVDWQGLEIIEEMRRNNEKVIFLVPHGWAVDIPAMLMASQGQKMAAM
FHNQGNPVFDYVWNTVRRRFGGRLHARNDGIKPFIQSVRQGYWGYYLPDQDHGPEHSEFVDFFATYKATLPAIGRLMKVC
RARVVPLFPIYDGKTHRLTIQVRPPMDDLLEADDHTIARRMNEEVEIFVGPRPEQYTWILKLLKTRKPGEIQPYKRKDLY
PIK
>P0ACV2 2.3.1.242~~~lpxP~~~Lipid A biosynthesis palmitoleoyltransferase~~~COG1560
MFPQCKFSREFLHPRYWLTWFGLGVLWLWVQLPYPVLCFLGTRIGAMARPFLKRRESIARKNLELCFPQHSAEEREKMIA
ENFRSLGMALVETGMAWFWPDSRVRKWFDVEGLDNLKRAQMQNRGVMVVGVHFMSLELGGRVMGLCQPMMATYRPHNNQL
MEWVQTRGRMRSNKAMIGRNNLRGIVGALKKGEAVWFAPDQDYGRKGSSFAPFFAVENVATTNGTYVLSRLSGAAMLTVT
MVRKADYSGYRLFITPEMEGYPTDENQAAAYMNKIIEKEIMRAPEQYLWIHRRFKTRPVGESSLYI
>P76445 2.7.4.29~~~lpxT~~~Lipid A 1-diphosphate synthase~~~COG0671
MIKNLPQIVLLNIVGLALFLSWYIPVNHGFWLPIDADIFYFFNQKLVESKAFLWLVALTNNRAFDGCSLLAMGMLMLSFW
LKENAPGRRRIVIIGLVMLLTAVVLNQLGQALIPVKRASPTLTFTDINRVSELLSVPTKDASRDSFPGDHGMMLLIFSAF
MWRYFGKVAGLIALIIFVVFAFPRVMIGAHWFTDIIVGSMTVILIGLPWVLLTPLSDRLITFFDKSLPGKNKHFQNK
>A0A172U6X0 5.1.3.31~~~~~~L-ribulose 3-epimerase~~~
MAFPKRLEYGGHALVWSGDWSAAGARKAIAGAARAGYDYIEIALLDPWQIDVALTKDLLQEYNLRAHASLGLSAATDVTS
TDPAIVAKGDELLRKATDVLYALGGSELCGVIYCALGKYPGPASRENRANSVAAMQRLADYAADKGINIDLEVVNRYETN
IMNTGLEGLAFLDEVNRPNAFLHLDTYHMNIEENGMAKSVLAAGDRLGYVHIGESHRGYLGTGNVDFASFFAALKQIDYR
GPITFESFSSEIVDPKLSNTLCVWRNLWHDSDDLAGKALEFIKQRY
>Q98FW0 5.1.3.31~~~~~~L-ribulose 3-epimerase~~~COG1082
MARIGIHSFVWSASSAQSELERTLANTREAGFDLIEFSYLDPADVDIGGLAKRIADLGLGVAISIGLPGDGDISSADKAV
AARGVEILNETVALTRDLGGRKVAGILSAGHGLQLEAPTRDQWSRSTAALAKVAETAKAAGVTLNLEIVNRFESNLLNTA
AQGLAFIEDTGSDNIFLHLDTFHMNIEEADVGLAIRHAAGKIGYVHIGESHRGFLGTGNIDFAAIFDALTAVGYADDLSF
ESFSSEIVDENLSKKTAIWRNLWADNMALAKHARAFIGLGLETARRKAELVSARHKP
>P72358 ~~~lrgA~~~Antiholin-like protein LrgA~~~COG1380
MVVKQQKDASKPAHFFHQVIVIALVLFVSKIIESFMPIPMPASVIGLVLLFVLLCTGAVKLGEVEKVGTTLTNNIGLLFV
PAGISVVNSLGVISQAPFLIIGLIIVSTILLLICTGYVTQIIMKVTSRSKGDKVTKKIKIEEAQAHD
>P60643 ~~~lrgB~~~Antiholin-like protein LrgB~~~COG1346
MINHLALNTPYFGILLSVIPFFLATILFEKTNRFFLFAPLFVSMVFGVAFLYLTGIPYKTYKIGGDIIYFFLEPATICFA
IPLYKKREVLVKHWHRIIGGIGIGTVVALLIILTFAKLAQFANDVILSMLPQAATTAIALPVSAGIGGIKELTSLAVILN
GVIIYALGNKFLKLFRITNPIARGLALGTSGHTLGVAPAKELGPVEESMASIALVLVGVVVVAVVPVFVAIFF
>P36771 ~~~lrhA~~~Probable HTH-type transcriptional regulator LrhA~~~COG0583
MISANRPIINLDLDLLRTFVAVADLNTFAAAAAAVCRTQSAVSQQMQRLEQLVGKELFARHGRNKLLTEHGIQLLGYARK
ILRFNDEACSSLMFSNLQGVLTIGASDESADTILPFLLNRVSSVYPKLALDVRVKRNAYMAEMLESQEVDLMVTTHRPSA
FKALNLRTSPTHWYCAAEYILQKGEPIPLVLLDDPSPFRDMVLATLNKADIPWRLAYVASTLPAVRAAVKAGLGVTARPV
EMMSPDLRVLSGVDGLPPLPDTEYLLCYDPSSNNELAQVIYQAMESYHNPWQYSPMSAPEGDDSLLIERDIE
>P96582 ~~~lrpC~~~HTH-type transcriptional regulator LrpC~~~COG1522
MKLDQIDLNIIEELKKDSRLSMRELGRKIKLSPPSVTERVRQLESFGIIKQYTLEVDQKKLGLPVSCIVEATVKNADYER
FKSYIQTLPNIEFCYRIAGAACYMLKINAESLEAVEDFINKTSPYAQTVTHVIFSEIDTKNGRG
>Q54087 ~~~lrp~~~Leucine-rich protein~~~
MELKDYFPEMQVGPHPLGDKEWVSVKEGDQYVHFPKSCLSEKERLLLEVGLGQYEVLQPLGSPWQRYLLDHQGNPPQLFE
TSQFIYLNHQQVLPADLVELLQQMIAGLEVILPISTTQTAFLCRQATSIKVLRSLEGLLPTLESDFGLALTMFVGNAWYQ
VAAGTLRECFEEECQLLTAYLKQKSGGKLLTFAEVMLWSILSHQSFPALTRQFHQFLNPQSDMADVVHALWSEHGNLVQT
AQRLYIHRNSLQYKLDKFAQQSGLHLKQLDDLAFAYLFLLKY
>P0ACJ0 ~~~lrp~~~Leucine-responsive regulatory protein~~~COG1522
MVDSKKRPGKDLDRIDRNILNELQKDGRISNVELSKRVGLSPTPCLERVRRLERQGFIQGYTALLNPHYLDASLLVFVEI
TLNRGAPDVFEQFNTAVQKLEEIQECHLVSGDFDYLLKTRVPDMSAYRKLLGETLLRLPGVNDTRTYVVMEEVKQSNRLV
IKTR
>B5M9L6 1.14.13.-~~~~~~Putative epoxidase LasC~~~
MTNTRSAVVLGGGMAGMLVSSMLARHVGSVTVIDRDAFPAGPDLRKGVPQARHAHILWSGGARIVEELLPGTTDRLLGAG
AHRIGIPDGQVSYTAYGWQHRFPEAQFMIACSRALLDWTVREETLREERIALVEKTEVLALLGDAGRVTGVRVRDQESGE
EREVPADLVVDTTGRGSPSKRLLAELGLPAPEEEFVDSGMVYATRLFRAPEAAATNFPLVSVHADHRAGRPGCNAVLMPI
EDGRWIVTVSGTRGGEPPADDEGFARFARDGVRHPLVGELIAKAQPLTSVERSRSTVNRRLHYDRLATWPEGLVVLGDAV
AAFNPVYGHGMSAAAHSVLALRSQLGQRAFQPGLARAAQRAIAVAVDDAWVLATSHDIGYPGCRTQTRDPRLTRHAGERQ
RVTDLVGLTATRNQVVNRAAVALNTLSAGMASMQDPAVMAAVRRGPEVPAPTEPPLRPDEVARLVSGAGVTA
>B6ZK72 5.5.1.-~~~~~~Epoxide hydrolase LasB~~~
MPAETVRKEVALEYCRRVNAGELEGVLQLFAPDALLVDPLGTEPVVGRAALAARLAPALRGAVHEEPGRPYAAHDGTSVV
LPATVTVGAPGAPPQRRGRTRVMGVIEVGEDGLIREMRVMWGVTDSSWTARPAPDEERRKELAREHCLRINDGDVDGLLK
LYSPRIRFEDPVGSWTRTGLEALRAHATMAVGSNVRETAGLTVAGQDGRHAAVTVSATMDYLPSGPLLARHHLMTLPAPA
DPHRALIGIEYVMVIGVDADGLIDEMRAYWGATDVSLLDPAA
>Q53353 1.13.11.43~~~~~~Lignostilbene-alpha,beta-dioxygenase isozyme I~~~
MAHFPQTPGFSGTLRPLRIEGDILDIEIEGEVPPQLNGTFHRVHPDAQFPPRFEDDQFFNGDGMVSLFRFHDGKIDFRQR
YAQTDKWKVERKAGKSLFGAYRNPLTDDASVQGMIRGTANTNVMVHAGKLYAMKEDSPCLIMDPLTLETEGYTNFDGKLQ
SQTFCAHPKIDPVTGNLCAFAYGAKGLMTLDMAYIEISPTGKLLKEIPFQNPYYCMMHDFGVTEDYAVFAVMPLLSSWDR
LEQRLPFFGFDTTLPCYLGILPRNGDARDLRWFKTGNCFVGHVMNAFNDGTKVHIDMPVSRNNSFPFFDVHGAPFDPVAG
QGFLTRWTVDMASNGDSFEKTERLFDRPDEFPRIDERYATRAYRHGWMLILDTEKPYEAPGGAFYALTNTLGHIDLATGK
SSSWWAGPRCAIQEPCFIPRSPDAPEGDGYVIALVDDHVANYSDLAIFDAQHVDQGPIARAKLPVRIRQGLHGNWADASR
LAVAA
>Q52008 1.13.11.43~~~lsdB~~~Lignostilbene-alpha,beta-dioxygenase isozyme III~~~
MAHFPDTSGMTGVLRPLRIEGDILDLEVEGEIPAQLDGTFHRVHPDAQFPPRFEDDQFFNGDGMVSLFRFHDGKIDFRQR
YAQTDKWKVERKAGKSLFGAYRNPLTDDASVQGMIRGTANTNVMVHAGKLYAMKEDSPCLIMDPLTLETEGYTNFDGKLK
NQTFSAHAKIDPVTGNFCNFGYAATGLLTTDCSYFEIDPAGNLLFETEFQVPYYCMMHDYGLTEHYAIFHIVPCSPNWDR
LKAGLPHFGFDTTLPVWLGVVPRGPGVTNKDVRWFKAPKTIFASHVMNAFEEGSKIHFDTPQAENNAFPFFPDIHGAPFD
PVAARPYLHRWTVDLGSNSEDFAEVRQLTSWIDEFPRVDARYVGQPYRHGWGLVMDPEMEMEFARGRASGFKMNRIGHWD
HATGKEDSWWCGPQSIIQEPCFVPRMADSAEGDGYIIALVDNLITNYSDLVVLDALNLKDGPIGRAKLPIRLRSGLHGNW
ADASKLPIAA
>O82881 3.1.-.-~~~lsoA~~~mRNA endoribonuclease LsoA~~~
MAQNPFKALNINIDKIESALTQNGVTNYSSNVKNERETHISGTYKGIDFLIKLMPSGGNTTIGRASGQNNTYFDEIALII
KENCLYSDTKNFEYTIPKFSDDDRANLFEFLSEEGITITEDNNNDPNCKHQYIMTTSNGDRVRAKIYKRGSIQFQGKYLQ
IASLINDFMCSILNMKEIVEQKNKEFNVDIKKETIESELHSKLPKSIDKIHEDIKKQLSCSLIMKKIDVEMEDYSTYCFS
ALRAIEGFIYQILNDVCNPSSSKNLGEYFTENKPKYIIREIHQETINGEIAEVLCECYTYWHENRHGLFHMKPGIADTKT
INKLESIAIIDTVCQLIDGGVARLKL
>Q7DKW4 ~~~lsoB~~~Antitoxin LsoB~~~
MKKDKKYQIEAIKNKDKTLFIVYATDIYSPSEFFSKIESDLKKKKSKGDVFFDLIIPNGGKKDRYVYTSFNGEKFSSYTL
NKVTKTDEYNDLSELSASFFKKNFDKINVNLLSKATSFALKKGIPI
>P00804 3.4.23.36~~~lspA~~~Lipoprotein signal peptidase~~~COG0597
MSQSICSTGLRWLWLVVVVLIIDLGSKYLILQNFALGDTVPLFPSLNLHYARNYGAAFSFLADSGGWQRWFFAGIAIGIS
VILAVMMYRSKATQKLNNIAYALIIGGALGNLFDRLWHGFVVDMIDFYVGDWHFATFNLADTAICVGAALIVLEGFLPSR
AKKQ
>P9WK99 3.4.23.36~~~lspA~~~Lipoprotein signal peptidase~~~COG0597
MPDEPTGSADPLTSTEEAGGAGEPNAPAPPRRLRMLLSVAVVVLTLDIVTKVVAVQLLPPGQPVSIIGDTVTWTLVRNSG
AAFSMATGYTWVLTLIATGVVVGIFWMGRRLVSPWWALGLGMILGGAMGNLVDRFFRAPGPLRGHVVDFLSVGWWPVFNV
ADPSVVGGAILLVILSIFGFDFDTVGRRHADGDTVGRRKADG
>Q9HVM5 3.4.23.36~~~lspA~~~Lipoprotein signal peptidase~~~
MPDVDRFGRLPWLWITVLVFVLDQVSKAFFQAELSMYQQIVVIPDLFSWTLAYNTGAAFSFLADSSGWQRWLFALIAIVV
SASLVVWLKRLKKGETWLAIALALVLGGALGNLYDRMVLGHVVDFILVHWQNRWYFPAFNLADSAITVGAVMLALDMFRS
KKSGEAAHG
>Q2FHP2 3.4.23.36~~~lspA~~~Lipoprotein signal peptidase~~~
MHKKYFIGTSILIAVFVVIFDQVTKYIIATTMKIGDSFEVIPHFLNITSHRNNGAAWGILSGKMTFFFIITIIILIALVY
FFIKDAQYNLFMQVAISLLFAGALGNFIDRILTGEVVDFIDTNIFGYDFPIFNIADSSLTIGVILIIIALLKDTSNKKEK
EVK
>P9WIP7 ~~~lsr2~~~Nucleoid-associated protein Lsr2~~~
MAKKVTVTLVDDFDGSGAADETVEFGLDGVTYEIDLSTKNATKLRGDLKQWVAAGRRVGGRRRGRSGSGRGRGAIDREQS
AAIREWARRNGHNVSTRGRIPADVIDAYHAAT
>P77257 7.6.2.13~~~lsrA~~~Autoinducer 2 import ATP-binding protein LsrA~~~COG1129
MQTSDTRALPLLCARSVYKQYSGVNVLKGIDFTLHQGEVHALLGGNGAGKSTLMKIIAGITPADSGTLEIEGNNYVRLTP
VHAHQLGIYLVPQEPLLFPSLSIKENILFGLAKKQLSMQKMKNLLAALGCQFDLHSLAGSLDVADRQMVEILRGLMRDSR
ILILDEPTASLTPAETERLFSRLQELLATGVGIVFISHKLPEIRQIADRISVMRDGTIALSGKTSELSTDDIIQAITPAV
REKSLSASQKLWLELPGNRPQHAAGTPVLTLENLTGEGFRNVSLTLNAGEILGLAGLVGAGRTELAETLYGLRTLRGGRI
MLNGKEINKLSTGERLLRGLVYLPEDRQSSGLNLDASLAWNVCALTHNLRGFWAKTAKDNATLERYRRALNIKFNQPEQA
ARTLSGGNQQKILIAKCLEASPQVLIVDEPTRGVDVSARNDIYQLLRSIAAQNVAVLLISSDLEEIELMADRVYVMHQGE
ITHSALTERDINVETIMRVAFGDSQRQEASC
>Q8ZKQ4 7.6.2.13~~~lsrA~~~Autoinducer 2 import ATP-binding protein LsrA~~~
MQISHNTASPLICVQNIYKSYSGVEVLKGIDFTLHAGEVHALLGGNGAGKSTLMKIIAGIVPPDGGTIDIAGVRCSHLTP
LKAHQYGIYLVPQEPLLFPSLSVRENILFGLQGRQASTEKMQQLLKAMGCQLDPASAAGTLDVADRQIVEIMRGLMRDSR
ILILDEPTASLTPAETDRLFTRLQELLKKGVGIVFISHKLPEIRQLAHCVSVMRDGKIALFGKTHDLSTDEIIQAITPAT
QGVSLSANQKLWLELPGSRPQNERGATVLALESLTGEGFMNINLEVRAGEILGLAGLVGAGRTELAETLYGIRPVNAGRM
LFNGQEINALTTQQRLQLGLVYLPEDRQSSGLYLDASLAWNVCSLTHNQKGFWIKPQRDNATLERYHRALNIKLNNAEQA
ARTLSGGNQQKVLIAKCLEASPQLLIVDEPTRGVDVSARSDIYQLLRSIAQQNVAVLFISSDLEEIEQMADRVYVMHQGE
LGGPALCGEEINVDTIMHVAFGEHGASEATC
>P76142 ~~~lsrB~~~Autoinducer 2-binding protein LsrB~~~COG1879
MTLHRFKKIALLSALGIAAISMNVQAAERIAFIPKLVGVGFFTSGGNGAQQAGKELGVDVTYDGPTEPSVSGQVQLINNF
VNQGYNAIIVSAVSPDGLCPALKRAMQRGVRVLTWDSDTKPECRSYYINQGTPAQLGGMLVDMAARQVNKDKAKVAFFYS
SPTVTDQNQWVKEAKAKIAKEHPGWEIVTTQFGYNDATKSLQTAEGILKAYSDLDAIIAPDANALPAAAQAAENLKNDKV
AIVGFSTPNVMRPYVERGTVKEFGLWDVVQQGKISVYVADALLKKGSMKTGDKLDIKGVGQVEVSPNSVQGYDYEADGNG
IVLLPERVIFNKENIGKYDF
>Q8Z2X8 ~~~lsrB~~~Autoinducer 2-binding protein LsrB~~~COG1879
MARHSIKMIALLTAFGLASAVMTVQAAERIAFIPKLVGVGFFTSGGNGAQEAGKALGIDVTYDGPTEPSVSGQVQLVNNF
VNQGYDAIIVSAVSPDGLCPALKRAMQRGVKILTWDSDTKPECRSYYINQGTPKQLGSMLVEMAAHQVDKEKAKVAFFYS
SPTVTDQNQWVKEAKAKISQEHPGWEIVTTQFGYNDATKSLQTAEGIIKAYPDLDAIIAPDANALPAAAQAAENLKRNNL
AIVGFSTPNVMRPYVQRGTVKEFGLWDVVQQGKISVYVANALLKNMPMNVGDSLDIPGIGKVTVSPNSEQGYHYEAKGNG
IVLLPERVIFNKDNIDKYDF
>Q8ZKQ1 ~~~lsrB~~~Autoinducer 2-binding protein LsrB~~~
MARHSIKMIALLTAFGLASAAMTVQAAERIAFIPKLVGVGFFTSGGNGAQEAGKALGIDVTYDGPTEPSVSGQVQLVNNF
VNQGYDAIIVSAVSPDGLCPALKRAMQRGVKILTWDSDTKPECRSYYINQGTPKQLGSMLVEMAAHQVDKEKAKVAFFYS
SPTVTDQNQWVKEAKAKISQEHPGWEIVTTQFGYNDATKSLQTAEGIIKAYPDLDAIIAPDANALPAAAQAAENLKRNNL
AIVGFSTPNVMRPYVQRGTVKEFGLWDVVQQGKISVYVANALLKNMPMNVGDSLDIPGIGKVTVSPNSEQGYHYEAKGNG
IVLLPERVIFNKDNIDKYDF
>Q74PW2 ~~~lsrB~~~Autoinducer 2-binding protein LsrB~~~COG1879
MRTQRLKKLALVCALGFACITTAQAAERIAFIPKLVGVGFFTSGGKGAVDAGKALGVDVTYDGPTEPSVSGQVQLINNFV
NQGYNAIVVSAVSPDGLCPALKRAMQRGVKILTWDSDTKPECRSVYINQGTPNQLGSMLVDMAANQVKKEQAKVAFFYSS
PTVTDQNQWVNEAKKKIQQEHPGWEIVTTQFGYNDATKSLQTAEGILKAYADLDAIIAPDANALPAAAQAAENLKRANVA
IVGFSTPNVMRPYVERGTVKEFGLWDVVNQGKISVYVANEMLKKGDLNVGDKIDIPNIGVVEVSPNRVQGYDYEAKGNGI
VLLPQRVIFTKENISKYDF
>P77672 ~~~lsrC~~~Autoinducer 2 import system permease protein LsrC~~~COG1172
MLKFIQNNREITALLAVVLLFVLPGFLDRQYLSVQTLTMVYSSAQILILLAMGATLVMLTRNIDVSVGSITGMCAVLLGM
LLNAGYSLPVACVATLLLGLLAGFFNGVLVAWLKIPAIVATLGTLGLYRGIMLLWTGGKWIEGLPAELKQLSAPLLLGVS
AIGWLTIILVAFMAWLLAKTAFGRSFYATGDNLQGARQLGVRTEAIRIVAFSLNGCMAALAGIVFASQIGFIPNQTGTGL
EMKAIAACVLGGISLLGGSGAIIGAVLGAWFLTQIDSVLVLLRIPAWWNDFIAGLVLLAVLVFDGRLRCALERNLRRQKY
ARFMTPPPSVKPASSGKKREAA
>Q8ZKQ3 ~~~lsrC~~~Autoinducer 2 import system permease protein LsrC~~~
MLKFIQNNREATALLAIVCLFVFPGALDSQYLSVQTLTMVFSSAQILMLLAIGATMVMLTRNIDVSVGSTTGMCAVLLGV
MLNAGYSLPVACLATLILGIVAGFFNGVLVAWLKIPAIVATLGTLGLYRGIMLLWTGGKWIEGLPAGLKQLSAPVFLGIS
AIGWFTLVLALLMAWLLAKTAFGRNFYATGDNLQGARQLGVRTEMVRIMAFSLNGGMAALAGIVFASQIGFIPNQTGTGL
EMKAIAACVLGGISLLGGSGTVIGAILGAYFLTQIDSVLVLLRIPAWWNDFIAGLVLLGVLVFDGRLRCALQRNLRRQKY
ARFISPPTPLQTEAKTHAQQNKNKEVA
>P0AFS1 ~~~lsrD~~~Autoinducer 2 import system permease protein LsrD~~~COG1172
MRIRYGWELALAALLVIEIVAFGAINPRMLDLNMLLFSTSDFICIGIVALPLTMVIVSGGIDISFGSTIGLCAIALGVLF
QSGVPMPLAILLTLLLGALCGLINAGLIIYTKVNPLVITLGTLYLFAGSALLLSGMAGATGYEGIGGFPMAFTDFANLDV
LGLPVPLIIFLICLLVFWLWLHKTHAGRNVFLIGQSPRVALYSAIPVNRTLCALYAMTGLASAVAAVLLVSYFGSARSDL
GASFLMPAITAVVLGGANIYGGSGSIIGTAIAVLLVGYLQQGLQMAGVPNQVSSALSGALLIVVVVGRSVSLHRQQIKEW
LARRANNPLP
>Q8ZKQ2 ~~~lsrD~~~Autoinducer 2 import system permease protein LsrD~~~
MNPWRRYSWEIALAALLIFEILAFGLINPRLLDINVLLFSTSDFICIGIVALPLTMVIVSGGMDISFGSTIGLCAITLGV
LFQLGMPLPLAIIITLLLGAICGLINAGLIIYTGVNPLVITLGTMYLFGGSALLLSGMAGATGYEGIGGFPTAFTDFANI
SFLGIPMPLIFFLVCCLFFWLLMHRTHMGRNVFLIGQSARVAQYSAIPVNRTLYTVYAMTGCASAIAAVLLVSYFGSARS
DLGASFLMPAITAVVLGGANIYGGSGSIMGSALAALLVGFLQQGLQMAGVPNQISSALSGALLIVVVVGRSVSLHRHQIL
EWYSRRRNAHQA
>Q8ZKP8 5.1.3.-~~~lsrE~~~Putative epimerase LsrE~~~
MNSQFAGLTREACVALLASYPLSVGILAGQWIALHRYLQQLEALNQPLLHLDLMDGQFCPQFTVGPWAVGQLPQTFIKDV
HLMVADQWTAAQACVKAGAHCITLQAEGDIHLHHTLSWLGQQTVPVIGGEMPVIRGISLCPATPLDVIIPILSDVEVIQL
LAVNPGYGSKMRSSDLHERVAQLLCLLGDKREGKIIVIDGSLTQDQLPSLIAQGIDRVVSGSALFRDDRLVENTRSWRAM
FKVAGDTTFLPSTA
>P76143 2.3.1.245~~~lsrF~~~3-hydroxy-5-phosphonooxypentane-2,4-dione thiolase~~~COG1830
MADLDDIKDGKDFRTDQPQKNIPFTLKGCGALDWGMQSRLSRIFNPKTGKTVMLAFDHGYFQGPTTGLERIDINIAPLFE
HADVLMCTRGILRSVVPPATNRPVVLRASGANSILAELSNEAVALSMDDAVRLNSCAVAAQVYIGSEYEHQSIKNIIQLV
DAGMKVGMPTMAVTGVGKDMVRDQRYFSLATRIAAEMGAQIIKTYYVEKGFERIVAGCPVPIVIAGGKKLPEREALEMCW
QAIDQGASGVDMGRNIFQSDHPVAMMKAVQAVVHHNETADRAYELYLSEKQ
>Q8ZKQ0 2.3.1.245~~~lsrF~~~3-hydroxy-5-phosphonooxypentane-2,4-dione thiolase~~~
MADLDDIKDGKDFHTDKPQTNTLFALKGCGALDWGMQSRLARIFNPKTRKTVMLAFDHGYFQGPTTGLERIDINIAPLFE
YADVLMCTRGILRSVVPPAINKPVVLRASGANSILTELSNEAVAVAMDDAVRLNSCAAAAQVYIGSEHEHQSIKNIIQLI
DAGLRVGMPIMAVTGVGKDMARDQRYFSLATRIAAEMGAQIIKTYYVDKGFERIAAGCPVPIVIAGGKKLPEREALEMCY
QAIDQGASGVDMGRNIFQSEDPVAMIKAVHAVVHHNETAERAYELFLSEKS
>P64461 5.3.1.32~~~lsrG~~~(4S)-4-hydroxy-5-phosphonooxypentane-2,3-dione isomerase~~~COG1359
MHVTLVEINVHEDKVDEFIEVFRQNHLGSVQEEGNLRFDVLQDPEVNSRFYIYEAYKDEDAVAFHKTTPHYKTCVAKLES
LMTGPRKKRLFNGLMP
>Q8ZKP9 5.3.1.32~~~lsrG~~~(4S)-4-hydroxy-5-phosphonooxypentane-2,3-dione isomerase~~~
MHVTLVEINVHDDKVEQFIDVFRQNHLGSIKEPGNLRFDVLQDPQVLTRFYIYEAYVDEQAVAFHKTTPHYKTCVEQLEP
LMTGPRTKKVFMGLMP
>Q7CG46 5.3.1.32~~~lsrG~~~(4S)-4-hydroxy-5-phosphonooxypentane-2,3-dione isomerase~~~COG1359
MHVTLVEINVKEDKVDQFIEVFRANHLGSIREAGNLRFDVLRDEHIPTRFYIYEAYTDEAAVAIHKTTPHYLQCVEQLAP
LMTGPRKKTVFIGLMP
>P77432 2.7.1.189~~~lsrK~~~Autoinducer-2 kinase~~~COG1070
MARLFTLSESKYYLMALDAGTGSIRAVIFDLEGNQIAVGQAEWRHLAVPDVPGSMEFDLNKNWQLACECMRQALHNAGIA
PEYIAAVSACSMREGIVLYNNEGAPIWACANVDARAAREVSELKELHNNTFENEVYRATGQTLALSAIPRLLWLAHHRSD
IYRQASTITMISDWLAYMLSGELAVDPSNAGTTGLLDLTTRDWKPALLDMAGLRADILSPVKETGTLLGVVSSQAAELCG
LKAGTPVVVGGGDVQLGCLGLGVVRPAQTAVLGGTFWQQVVNLAAPVTDPEMNVRVNPHVIPGMVQAESISFFTGLTMRW
FRDAFCAEEKLIAERLGIDTYTLLEEMASRVPPGSWGVMPIFSDRMRFKTWYHAAPSFINLSIDPDKCNKATLFRALEEN
AAIVSACNLQQIADFSNIHPSSLVFAGGGSKGKLWSQILADVSGLPVNIPVVKEATALGCAIAAGVGAGIFSSMAETGER
LVRWERTHTPDPEKHELYQDSRDKWQAVYQDQLGLVDHGLTTSLWKAPGL
>Q8ZKQ6 2.7.1.189~~~lsrK~~~Autoinducer-2 kinase~~~
MARLCTHTESGHYLMALDAGTGSVRAVIFDLQGKQIAVGQAEWQHLAVPDVPGSMEFDLAKNWQLACQCIRQALQKAAIP
ATAIAAVSACSMREGIVIYDSNGEPIWACANVDARAAHEVSELKELYDNTFEEEVYRCSGQTLALSAIPRLLWLAHHRPD
IYHRASTVTMISDWMAFMLSGELAVDPSNAGTTGLLDLVTRNWKRSLLQMAGLRSDILSPVKETGTLLGHISQKAAEQCD
LQAGTPVIVGGGDVQLGCLGLGVVRPAQTAVLGGTFWQQVVNLPAPVTDPNMNVRINPHVIPGMVQTESISFFTGLTMRW
FRDAFCAEEKLIAERLGIDAYSLLEDMASRVPPGAYGVMPIFSDVMRFKRWYHAAPSFINLSIDPEKCNKATLFRALEEN
AAIVSACNLQQIAAFSGVQADSLVFAGGGSKGKLWSQILADVTGLTVHVPVVKEATALGCAIAAGVGVGVWPSLAETGEK
LVRWDREHKPNPENFAVYQQAREKWQAVYQDQRALVDGGLTTSLWKAPGL
>P76141 ~~~lsrR~~~Transcriptional regulator LsrR~~~COG2390
MTINDSAISEQGMCEEEQVARIAWFYYHDGLTQSEISDRLGLTRLKVSRLLEKGHQSGIIRVQINSRFEGCLEYETQLRR
QFSLQHVRVIPGLADADVGGRLGIGAAHMLMSLLQPQQMLAIGFGEATMNTLQRLSGFISSQQIRLVTLSGGVGSYMTGI
GQLNAACSVNIIPAPLRASSADIARTLKNENCVKDVLLAAQAADVAIVGIGAVSQQDDATIIRSGYISQGEQLMIGRKGA
VGDILGYFFDAKGDVVTNIKIHNELIGLPLSALKTIPVRVGVAGGENKAEAIAAAMKGGYINALVTDQDTAAAILRS
>Q8ZKQ5 ~~~lsrR~~~Transcriptional regulator LsrR~~~
MSDNTLVSDYGMCEEEQVARIAWFYYHDGLTQSEISERLGLTRLKVSRLLEKGHQSGIIRVQINSRFEGCLEYENALRNH
FALQNIRVLPALPDADIGLRLGIGAAHMLMESLRPQQLLAVGFGEATMTTLKRLSGFISAQQIRLVTLSGGVGPYMTGIG
QLDAACSVSIMPAPLRASSQEIACTLRNENSVRDVMLTAQAADAAIVGIGAINQKDQASILKSGYITQGEQLMIGRKGAV
GDILGYFFDAHGEIIPDIKIHNELIGLKLNSLSTIPTVIGVAGGEQKAEAIIAAMRGNYINALVTDQKTAGKIIQIIEK
>P10547 3.4.24.75~~~lss~~~Lysostaphin~~~
MKKTKNNYYTRPLAIGLSTFALASIVYGGIQNETHASEKSNMDVSKKVAEVETSKAPVENTAEVETSKAPVENTAEVETS
KAPVENTAEVETSKAPVENTAEVETSKAPVENTAEVETSKAPVENTAEVETSKAPVENTAEVETSKAPVENTAEVETSKA
PVENTAEVETSKAPVENTAEVETSKAPVENTAEVETSKAPVENTAEVETSKAPVENTAEVETSKAPVENTAEVETSKALV
QNRTALRAATHEHSAQWLNNYKKGYGYGPYPLGINGGMHYGVDFFMNIGTPVKAISSGKIVEAGWSNYGGGNQIGLIEND
GVHRQWYMHLSKYNVKVGDYVKAGQIIGWSGSTGYSTAPHLHFQRMVNSFSNSTAQDPMPFLKSAGYGKAGGTVTPTPNT
GWKTNKYGTLYKSESASFTPNTDIITRTTGPFRSMPQSGVLKAGQTIHYDEVMKQDGHVWVGYTGNSGQRIYLPVRTWNK
STNTLGVLWGTIK
>P72097 2.4.3.6~~~lst~~~N-acetyllactosaminide alpha-2,3-sialyltransferase~~~
MGLKKACLTVLCLIVFCFGIFYTFDRVNQGERNAVSLLKEKLFNEEGEPVNLIFCYTILQMKVAERIMAQHPGERFYVVL
MSENRNEKYDYYFNQIKDKAERAYFFHLPYGLNKSFNFIPTMAELKVKSMLLPKVKRIYLASLEKVSIAAFLSTYPDAEI
KTFDDGTGNLIQSSSYLGDEFSVNGTIKRNFARMMIGDWSIAKTRNASDEHYTIFKGLKNIMDDGRRKMTYLPLFDASEL
KTGDETGGTVRILLGSPDKEMKEISEKAAKNFKIQYVAPHPRQTYGLSGVTTLNSPYVIEDYILREIKKNPHTRYEIYTF
FSGAALTMKDFPNVHVYALKPASLPEDYWLKPVYALFTQSGIPILTFDDKN
>Q9CNC4 2.4.99.-~~~lst~~~CMP-N-acetylneuraminate:beta-galactoside alpha-2,3-sialyltransferase~~~
MNLIICCTPLQVLIAEKIIAKFPHMPFYGVMLSTVSNKKFDFYAKRLAQQCQGFFSMVQHKDRFNLLKEILYLKRTFSGK
HFDQVFVANINDLQIQFLLSAIDFNLLNTFDDGTINIVPNSLFYQDDPATLQRKLINVLLGNKYSIQSLRALSHTHYTIY
KGFKNIIERVEPIELVAADNSEKVTSAVINVLLGQPVFAEDERNIALAERVIKQFNIHYYLPHPREKYRLAQVNYIDTEL
IFEDYILQQCQTHKYCVYTYFSSAIINIMNKSDNIEVVALKIDTENPAYDACYDLFDELGVNVIDIRE
>O07051 4.1.2.49~~~ltaA~~~L-allo-threonine aldolase~~~
MRYIDLRSDTVTQPTDAMRQCMLHAEVGDDVYGEDPGVNALEAYGADLLGKEAALFVPSGTMSNLLAVMSHCQRGEGAVL
GSAAHIYRYEAQGSAVLGSVALQPVPMQADGSLALADVRAAIAPDDVHFTPTRLVCLENTHNGKVLPLPYLREMRELVDE
HGLQLHLDGARLFNAVVASGHTVRELVAPFDSVSICLSKGLGAPVGSLLVGSHAFIARARRLRKMVGGGMRQAGILAQAG
LFALQQHVVRLADDHRRARQLAEGLAALPGIRLDLAQVQTNMVFLQLTSGESAPLLAFMKARGILFSGYGELRLVTHLQI
HDDDIEEVIDAFTEYLGA
>Q2FZP8 ~~~ltaA~~~Proton-coupled antiporter flippase LtaA~~~COG2814
MQDSSLNNYANHKNFILMLIILFLMEFARGMYILSYINFLPTVTSIAVAITSLAFSIHFIADASTNFVIGFLLKKFGTKI
VLTTGFILAFTSLFLVIWFPASPFVIIFSAMMLGIAVSPIWVIMLSSVEEDKRGKQMGYVYFSWLLGLLVGMVFMNLLIK
VHPTRFAFMMSLVVLIAWILYYFVDVKLTNYNTRPVKAQLRQIVDVTKRHLLLFPGILLQGAAIAALVPILPTYATKVIN
VSTIEYTVAIIIGGIGCAVSMLFLSKLIDNRSRNFMYGVILSGFILYMILIFTLSMIVNIHILWIIALAIGLMYGILLPA
WNTFMARFIKSDEQEETWGVFNSIQGFGSMIGPLFGGLITQFTNNLNNTFYFSALIFLVLAVFYGSYFIVNREKAK
>P75823 4.1.2.48~~~ltaE~~~Low specificity L-threonine aldolase~~~COG2008
MIDLRSDTVTRPSRAMLEAMMAAPVGDDVYGDDPTVNALQDYAAELSGKEAAIFLPTGTQANLVALLSHCERGEEYIVGQ
AAHNYLFEAGGAAVLGSIQPQPIDAAADGTLPLDKVAMKIKPDDIHFARTKLLSLENTHNGKVLPREYLKEAWEFTRERN
LALHVDGARIFNAVVAYGCELKEITQYCDSFTICLSKGLGTPVGSLLVGNRDYIKRAIRWRKMTGGGMRQSGILAAAGIY
ALKNNVARLQEDHDNAAWMAEQLREAGADVMRQDTNMLFVRVGEENAAALGEYMKARNVLINASPIVRLVTHLDVSREQL
AEVAAHWRAFLAR
>O50584 4.1.2.48~~~ltaE~~~Low specificity L-threonine aldolase~~~
MTDQSQQFASDNYSGICPEAWAAMEKANHGHERAYGDDQWTARAADHFRKLFETDCEVFFAFNGTAANSLALSSLCQSYH
SVICSETAHVETDECGAPEFFSNGSKLLTARSEGGKLTPASIREVALKRQDIHYPKPRVVTITQATEVGSVYRPDELKAI
SATCKELGLNLHMDGARFSNACAFLGCTPAELTWKAGIDVLCFGGTKNGMAVGEAILFFNRKLAEDFDYRCKQAGQLASK
MRFLSAPWVGLLEDGAWLRHAAHANHCAQLLSSLVADIPGVELMFPVEANGVFLQMSEPALEALRNKGWRFYTFIGSGGA
RFMCSWDTEEARVRELAADIRAVMSA
>Q797B3 ~~~ltaS1~~~Lipoteichoic acid synthase 1~~~COG1368
MKKLFSYKLSFFVLAVILFWAKTYLSYKTEFNLGVKGTTQEILLIFNPFSSAVFFLGLALLAKGRKSAIIMLIIDFLMTF
VLYANILFYRFFDDFLTFPNIKQSGNVGNMGDGIFSIMAGHDIFYFLDIIILIAVLIWRPELKEYKMKKRFASLVILSGI
ALFFINLHYAEKDRPQLLTRTFDRNYIVKYLGLYNYTIYDGVQTAQTETQRAYASSDDLTSVENYTTSHYAKPNAEYFGS
AKGKNIIKIHLESFQSFLIDYKLNGEEVTPFLNKLAHGGEDVTYFDNFFHQTGQGKTSDAELTMDNSIFGLPEGSAFVTK
GENTYQSLPAILDQKEGYTSAVLHGDYKSFWNRDQIYKHIGYDKFFDASTYDMSDENVINMGLKDKPFFTESIPKLESLK
QPFYAHLITLTNHYPFNLDEKDASLKKATTGDNTVDSYFQTARYLDEALEQFFKELKEAGLYDNSVIMIYGDHNGISENH
NRAMKEILGKEITDYQNAQNQRVPLMIRVPGKKGGVNHTYGGEIDVMPTLLHLEGIDSQKYINFGTDLFSKDHDDTVAFR
NGDFVTPKYTSVDNIIYDTKTGEKLKANEETKNLKTRVNQQLSLSDSVLYKDLLRFHKLNDFKAVDPSDYHYGKEKEIK
>O34952 ~~~ltaS2~~~Lipoteichoic acid synthase 2~~~COG1368
MKTFIKERGLAFFLIAVVLLWIKTYVGYVLNFNLGIDNTIQKILLFVNPLSSSLFFLGFGLLFKKKLQQTAIIVIHFLMS
FLLYANIVYYRFFNDFITIPVIMQAKTNGGQLGDSAFSLMRPTDAFYFIDTIILIILAIKVNKPAETSSKKSFRIIFASS
ILVFLINLAVAESDRPELLTRSFDRNYLVKYLGTYNFTIYDAVQNIKSNSQRALADSSDVTEVENYMKANYDVPNNVYFG
KAEGKNVIYVSLESLQSFIIDYKIDGKEVTPFLNKLAHDNETFYFDNFFHQTGQGKTSDAEFMMENSLYPLAQGSVFVNK
AQNTLQSVPAILKSKNYTSATFHGNTQTFWNRNEMYKAEGIDKFFDSAYYDMNEENTKNYGMKDKPFFKESMPLLESLPQ
PFYTKFITLSNHFPFGMDEGDTDFPAGDFGDSVVDNYFQSAHYLDQSIEQFFNDLKKDGLYDKSIIVMYGDHYGISENHN
KAMAKVLGKDEITDYDNAQLQRVPLFIHAAGVKGEKVHKYAGDVDVAPTILHLLGVDTKDYLMSGSDILSKEHREVIPFR
NGDFISPKYTKISGKYYDTKTGKELDESEVDKSEDSLVKKELEMSDKIINGDLLRFYEPKGFKKVNPSDYDYTKHDEDSS
ETSKDNEDK
>Q2G093 ~~~ltaS~~~Lipoteichoic acid synthase~~~COG1368
MSSQKKKISLFAFFLLTVITITLKTYFSYYVDFSLGVKGLVQNLILLMNPYSLVALVLSVFLFFKGKKAFWFMFIGGFLL
TFLLYANVVYFRFFSDFLTFSTLNQVGNVESMGGAVSASFKWYDFVYFIDTLVYLFILIFKTKWLDTKAFSKKFVPVVMA
ASVALFFLNLAFAETDRPELLTRTFDHKYLVKYLGPYNFTVYDGVKTIENNQQKALASEDDLTKVLNYTKQRQTEPNPEY
YGVAKKKNIIKIHLESFQTFLINKKVNGKEVTPFLNKLSSGKEQFTYFPNFFHQTGQGKTSDSEFTMDNSLYGLPQGSAF
SLKGDNTYQSLPAILDQKQGYKSDVMHGDYKTFWNRDQVYKHFGIDKFYDATYYDMSDKNVVNLGLKDKIFFKDSANYQA
KMKSPFYSHLITLTNHYPFTLDEKDATIEKSNTGDATVDGYIQTARYLDEALEEYINDLKKKGLYDNSVIMIYGDHYGIS
ENHNNAMEKLLGEKITPAKFTDLNRTGFWIKIPGKSGGINNEYAGQVDVMPTILHLAGIDTKNYLMFGTDLFSKGHNQVV
PFRNGDFITKDYKYVNGKIYSNKNNELITTQPADFEKNKKQVEKDLEMSDNVLNGDLFRFYKNPDFKKVNPSKYKYETGP
KANSKK
>Q7A6U1 ~~~ltaS~~~Lipoteichoic acid synthase~~~
MSSQKKKISLFAFFLLTVITITLKTYFSYYVDFSLGVKGLVQNLILLMNPYSLVALVLSVFLFFKGKKAFWFMFIGGFLL
TFLLYANVVYFRFFSDFLTFSTLNQVGNVESMGGAVSASFKWYDFVYFIDTLVYLFILIFKTKWLDTKAFSKKFVPVVMA
ASVALFFLNLAFAETDRPELLTRTFDHKYLVKYLGPYNFTVYDGVKTIENNQQKALASEDDLTKVLNYTKQRQTEPNPEY
YGVAKKKNIIKIHLESFQTFLINKKVNGKEVTPFLNKLSSGKEQFTYFPNFFHQTGQGKTSDSEFTMDNSLYGLPQGSAF
SLKGDNTYQSLPAILDQKQGYKSDVMHGDYKTFWNRDQVYKHFGIDKFYDATYYDMSDKNVVNLGLKDKIFFKDSANYQA
KMKSPFYSHLITLTNHYPFTLDEKDATIEKSNTGDATVDGYIQTARYLDEALEEYINDLKKKGLYDNSVIMIYGDHYGIS
ENHNNAMEKLLGEKITPAKFTDLNRTGFWIKIPGKSGGINNEYAGQVDVMPTILHLAGIDTKNYLMFGTDLFSKGHNQVV
PFRNGDFITKDYKYVNGKIYSNKNNELITTQPADFEKNKKQVEKDLEMSDNVLNGDLFRFYKNPDFKKVNPSKYKYETGP
KANSKK
>Q7A1I3 ~~~ltaS~~~Lipoteichoic acid synthase~~~
MSSQKKKISLFAFFLLTVITITLKTYFSYYVDFSLGVKGLVQNLILLMNPYSLVALVLSVFLFFKGKKAFWFMFIGGFLL
TFLLYANVVYFRFFSDFLTFSTLNQVGNVESMGGAVSASFKWYDFVYFIDTLVYLFILIFKTKWLDTKAFSKKFVPVVMA
ASVALFFLNLAFAETDRPELLTRTFDHKYLVKYLGPYNFTVYDGVKTIENNQQKALASEDDLTKVLNYTKQRQTEPNPEY
YGVAKKKNIIKIHLESFQTFLINKKVNGKEVTPFLNKLSSGKEQFTYFPNFFHQTGQGKTSDSEFTMDNSLYGLPQGSAF
SLKGDNTYQSLPAILDQKQGYKSDVMHGDYKTFWNRDQVYKHFGIDKFYDATYYDMSDKNVVNLGLKDKIFFKDSANYQA
KMKSPFYSHLITLTNHYPFTLDEKDATIEKSNTGDATVDGYIQTARYLDEALEEYINDLKKKGLYDNSVIMIYGDHYGIS
ENHNNAMEKLLGEKITPAKFTDLNRTGFWIKIPGKSGGINNEYAGQVDVMPTILHLAGIDTKNYLMFGTDLFSKGHNQVV
PFRNGDFITKDYKYVNGKIYSNKNNELITTQPADFEKNKKQVEKDLEMSDNVLNGDLFRFYKNPDFKKVNPSKYKYETGP
KANSKK
>A4F2N8 4.3.1.16~~~thadh~~~L-threo-3-hydroxyaspartate ammonia-lyase~~~
MQLSSYHDVIKAAERLEGFANRTPVFTSRTLDAETGAQVFIKCENLQRTGSFKFRGAFNALSRFDEAQRKAGVVAFSSGN
HAQGIALAARLLQMPATIVMPTDAPAAKVAATREYGATVVFYDRITEDREQIGRTLAEQHGMTLIPSYDHPDVLAGQGTA
AKELLEFTGPLDALFVGLGGGGMLSGTALATRALSPDCLLYGVEPEAGNDGQRSFQTGSIVHIDTPATIADGAQTQHLGN
HTFPIIRENVNDILTVSDAELVESMRFFMQRMKMVVEPTGCLGLAALRNLKQQFRGQRVGIIVTGGNVDIEKYASLLKG
>Q0KBC7 1.1.1.411~~~ltnD~~~L-threonate dehydrogenase~~~COG2084
MSRNIGVIGLGAMGFGVAQSLLRAGFNVHACDLRPEVLQRFADAGGVPCASPAELGSRCDVVLTLVVNAQQTEAVLFGAN
GAAAAMQPGKLVIASATVPPGFAEALGRRLAEQGLLMLDAPVSGGAARAASGEMTMMTSGPAEAYSLAEDVLAAIAGKVY
RLGAAHGAGSKVKIINQLLAGVHIAAAAEAMALGLREGVDPDALYDVITHSAGNSWMFENRVPHILKGDYTPLSAVDIFV
KDLGMVLDTARHSKFPLPLSAAAHQMFMMASTAGHGGEDDSAVIKIFPGIELPGKAE
>A0A0H2VA68 1.1.1.411~~~ltnD~~~L-threonate dehydrogenase~~~COG2084
MKTGSEFHVGIVGLGSMGMGAALSCVRAGLSTWGADLNSNACATLKEAGACGVSDNAATFAEKLDALLVLVVNATQVKQV
LFGEKGVAQHLKPGTAVMVSSTIASADAQEIATALAGFGLEMLDAPVSGGAVKAANGEMTVMASGSDIAFERLAPVLEAV
AGKVYRIGSEPGLGSTVKIIHQLLAGVHIAAGAEAMALAARAGIPLDVMYDVVTNAAGNSWMFENRMRHVVDGDYTPHSA
VDIFVKDLGLVADTAKALHFPLPLASTALNMFTSASNAGYGKEDDSAVIKIFSGITLPGAKS
>P44979 1.1.1.411~~~ltnD~~~L-threonate dehydrogenase~~~COG2084
MENQNYSVAVIGLGSMGMGAAVSCINAGLTTYGIDLNPVALEKLKAAGAKAVAANGYDFAHELDAVVILVVNAAQANAVL
FGENGIAKKLKAGTAVMVSSTMAAQDAQIISQKLTELGLIMLDAPVSGGAAKALKGEMTVMASGSKQAFELLQPVLDATA
AKVYNIGEEIGLGATVKIVHQLLAGVHIAAGAEAMALASKAGIPLDVMYDVVTNAAGNSWMFENRMKHVVEGDYTPLSMV
DIFVKDLGLVNDTAKSLHFPLHLASTAYSMFTEASNAGYGKEDDSAVIKIFSGVSLPKKGA
>Q6CZ26 1.1.1.411~~~ltnD~~~L-threonate dehydrogenase~~~COG2084
MKKTSDYAVAVIGLGSMGFGAAASCINAGLTTYGVDINPQALEKLRQAGAAQADTRIDAFADKLDAVVLLVVNATQVNGI
LFGEPQVAAKLKPGTVVMVSSTISAQDAKNIEQRLAEHQLVMLDAPVSGGAAKAAAGDMTVMASGSDLAFEKLKPVLDAV
AGKVYRIGEEIGLGATVKIIHQLLAGVHIAAGAEAMALAARADIPLDIMYDVVTNAAGNSWMFENRMRHVVDGDYTPKSA
VDIFVKDLGLVTDTAKSLHFPLPLASTAFNMFTAASNAGFGKEDDSAVIKIFNGITLPEKKEAP
>I6Y3T7 4.1.3.-~~~ltp2~~~17-hydroxy-3-oxo-4-pregnene-20-carboxyl-CoA lyase~~~COG0183
MLSGQAAIVGIGATDFSKNSGRSELRLAAEAVLDALADAGLSPTDVDGLTTFTMDTNTEIAVARAAGIGELTFFSKIHYG
GGAACATVQHAAMAVATGVADVVVAYRAFNERSGMRFGQVQTRLTENADSTGVDNSFSYPHGLSTPAAQVAMIARRYMHL
SGATSRDFGAVSVADRKHAANNPKAYFYGKPITIEDHQNSRWIAEPLRLLDCCQETDGAVAIVVTSAARARDLKQRPVVI
EAAAQGCSPDQYTMVSYYRPELDGLPEMGLVGRQLWAQSGLTPADVQTAVLYDHFTPFTLIQLEELGFCGKGEAKDFIAD
GAIEVGGRLPINTHGGQLGEAYIHGMNGIAEGVRQLRGTSVNPVAGVEHVLVTAGTGVPTSGLILG
>D1AB74 4.1.3.-~~~ltp2~~~Steroid side-chain-cleaving aldolase~~~COG0183
MSVLPGAAAIAGIGATEFSKNSGRSELQLACEAVLAAIADAGLEPSDVDGLVTFTADTSSEIHVARNTGIGELKFFSRVG
YGGGAACGTVQQAAMAVATGIAEVVVCYRAFNERSGVRYGLGQAGRQMDQGADSAAYAWLLPFGLNTPAQWVAMFARRYM
HEYGATSEDFGRVAVVDRKHAATNPKAWFYQRPITLEDHQNSRWIVEPLHLLDCCQESDGGQALVVVSTERARDLPHPPA
LIWGAAQGSGYDQHMMTSYYRSEITGIPEMGLVGQQLYAQSGLNPSDIGAAILYDHFTPLVLPQLEELGFCARGEAKDFI
ADGNLEIGGRLPCNTHGGQLGEAYIHGMNGIAEAVRLVRGTSVNQPGDVTNVLVTAGTGVPTSGLILGADR
>P0A3U0 ~~~ltrA~~~Group II intron-encoded protein LtrA~~~
MKPTMAILERISKNSQENIDEVFTRLYRYLLRPDIYYVAYQNLYSNKGASTKGILDDTADGFSEEKIKKIIQSLKDGTYY
PQPVRRMYIAKKNSKKMRPLGIPTFTDKLIQEAVRIILESIYEPVFEDVSHGFRPQRSCHTALKTIKREFGGARWFVEGD
IKGCFDNIDHVTLIGLINLKIKDMKMSQLIYKFLKAGYLENWQYHKTYSGTPQGGILSPLLANIYLHELDKFVLQLKMKF
DRESPERITPEYRELHNEIKRISHRLKKLEGEEKAKVLLEYQEKRKRLPTLPCTSQTNKVLKYVRYADDFIISVKGSKED
CQWIKEQLKLFIHNKLKMELSEEKTLITHSSQPARFLGYDIRVRRSGTIKRSGKVKKRTLNGSVELLIPLQDKIRQFIFD
KKIAIQKKDSSWFPVHRKYLIRSTDLEIITIYNSELRGICNYYGLASNFNQLNYFAYLMEYSCLKTIASKHKGTLSKTIS
MFKDGSGSWGIPYEIKQGKQRRYFANFSECKSPYQFTDEISQAPVLYGYARNTLENRLKAKCCELCGTSDENTSYEIHHV
NKVKNLKGKEKWEMAMIAKQRKTLVVCFHCHRHVIHKHK
>P0A3U1 ~~~ltrA~~~Group II intron-encoded protein LtrA~~~COG3344
MKPTMAILERISKNSQENIDEVFTRLYRYLLRPDIYYVAYQNLYSNKGASTKGILDDTADGFSEEKIKKIIQSLKDGTYY
PQPVRRMYIAKKNSKKMRPLGIPTFTDKLIQEAVRIILESIYEPVFEDVSHGFRPQRSCHTALKTIKREFGGARWFVEGD
IKGCFDNIDHVTLIGLINLKIKDMKMSQLIYKFLKAGYLENWQYHKTYSGTPQGGILSPLLANIYLHELDKFVLQLKMKF
DRESPERITPEYRELHNEIKRISHRLKKLEGEEKAKVLLEYQEKRKRLPTLPCTSQTNKVLKYVRYADDFIISVKGSKED
CQWIKEQLKLFIHNKLKMELSEEKTLITHSSQPARFLGYDIRVRRSGTIKRSGKVKKRTLNGSVELLIPLQDKIRQFIFD
KKIAIQKKDSSWFPVHRKYLIRSTDLEIITIYNSELRGICNYYGLASNFNQLNYFAYLMEYSCLKTIASKHKGTLSKTIS
MFKDGSGSWGIPYEIKQGKQRRYFANFSECKSPYQFTDEISQAPVLYGYARNTLENRLKAKCCELCGTSDENTSYEIHHV
NKVKNLKGKEKWEMAMIAKQRKTLVVCFHCHRHVIHKHK
>P16462 ~~~ltxA~~~Leukotoxin~~~COG2931
MATTTLPNTKQQAAQFANSVADRAKENIDAAKEQLQKALDKLGKTGKKLTLYIPKNYKKGNGLTALIKAAQKLGIEVYHE
GKDGPALTNGILNTGKKLLGLTERGLTLFAPELDKWIQGNKHLSNSVGSTGNLTKAIDKVQSVLGTLQAFLNTAFSGMDL
DALIKARQNGKNVTDVQLAKASLNLINELIGTISSITNNVDTFSKQLNKLGEALGQVKHFGSFGDKLKNLPKLGNLGKGL
GALSGVLSAISAALLLANKDADTATKAAAAAELTNKVLGNIGKAITQYLIAQRAAAGLSTTGPVAGLIASVVSLAISPLS
FLGIAKQFDRARMLEEYSKRFKKFGYNGDSLLGQFYKNTGIADAAITTINTVLSAIAAGVGAASAGSLVGAPIGLLVSAI
TSLISGILDASKQAVFEHIANQLADKIKAWENKYGKNYFENGYDARHSAFLEDSLKLFNELREKYKTENILSITQQGWDQ
RIGELAGITRNGDRIQSGKAYVDYLKKGEELAKHSDKFTKQILDPIKGNIDLSGIKGSTTLTFLNPLLTAGKEERKTRQS
GKYEFITELKVKGRTDWKVKGVPNSNGVYDFSNLIQHAVTRDNKVLEARLIANLGAKDDYVFVGSGSTIVNAGDGYDVVD
YSKGRTGALTIDGRNATKAGQYKVERDLSGTQVLQETVSKQETKRGKVTDLLEYRNYKLDYYYTNKGFKAHDELNSVEEI
IGSTLRDKFYGSKFNDVFHGHDGDDLIYGYDGDDRLYGDNGNDEIHGGQGNDKLYGGAGNDRLFGEYGNNYLDGGEGDDH
LEGGNGSDILRGGSGNDKLFGNQGDDLLDGGEGDDQLAGGEGNDIYVYRKEYGHHTITEHSGDKDKLSLANINLKDVSFE
RNGNDLLLKTNNRTAVTFKGWFSKPNSSAGLDEYQRKLLEYAPEKDRARLKRQFELQRGKVDKSLNNKVEEIIGKDGERI
TSQDIDNLFDKSGNKKTISPQELAGLIKNKGKSSSLMSSSRSSSMLTQKSGLSNDISRIISATSGFGSSGKALSASPLQT
NNNFNSYANSLATTA
>P23702 7.4.2.5~~~ltxB~~~Leukotoxin export ATP-binding protein LtxB~~~COG2274
MDSQKNTNLALQALEVLAQYHNISINPEEIKHKFDIDGHGLNQTKWLLAAKSLGLKVRTANKTVDRLPFLHLPALAWRDD
GEHFILLKIDQETDRYLIFDLIQKNPIVLDKNEFEERYQSKVILIASRASIVGNLAKFDFTWFIPAVIKYRKIFIETLIV
SIFLQIFALITPLFFQVVMDKVLVHRGFSTLNVITVALAIVVLFEIILGGLRTYVFAHSTSRIDVELGARLFRHLLALPI
SYFEARRVGDTVARVRELDQIRNFLTGQALTSILDLLFSFIFFAVMWYYSPKLTLVVLGSLPCYVIWSVFISPILRRRLD
DKFARNADNQSFLVESVTAINTIKAMAISPQMTNIWDKQLASYVAVSFKVTVLATIGQQGIQLIQKAVMVINLWLGAHLV
ISGDLSIGQLIAFNMLAGQIISPVIRLAQIWQDFQQVGISVTRLGDVLNSPTENNTASVSLPEIQGEISFRNIKFRYKPD
SPMILNNINLDISQGEVIGIVGRSGSGKSTLTKLIQRFYIPEQGQVLIDGHDLALADPNWLRRQVGVVLQDNVLLNRSIR
ENIALTNPGMPMEKVIAAAKLAGAHDFISELREGYNTVVGEQGAGLSGGQRQRIAIARALVNNPRILIFDEATSALDYES
ENIIMHNMHKICQNRTVLIIAHRLSTVKNADRIIVMDKGEIIEQGKHQELLKDEKGLYSYLHQLQVN
>P16461 2.3.1.-~~~ltxC~~~Leukotoxin-activating lysine-acyltransferase LtxC~~~COG2994
MEKNNNFEMLGYVAWLWANSPLHRNWSLSLLAINVLPAIQYGQYTLLMRDGVPIAFCSWANLSLENEIKYLEDVSSLVYD
DWNSGDRKWFIDWIAPFGHNYVLYKHMRKSFPYDLFRSIRVYKGSSEGKITEFHGGKVDKQLANKIFQQYHFELINELKN
KSEVISIN
>P18790 ~~~ltxD~~~Leukotoxin export protein LtxD~~~COG0845
MKTWLLALYDVLSRYKNVWNETWKIRKQLDSPVREKDENEFLPAHLELIETPVSNAPRFVSYSIMLFLTLAIIVSIFSNV
EIIATASGKFALSGRSKEIKPIENSLVKHIFVKEGEYVKKGELLLKLTALGAEADTLKTKTSLSQAKLEEFRYKSLLEAV
EKDQLPILDFSKIDLPFMTENDQKRVTLLIEEQFSTWQKQRHQKTLNLNKKEAEKLSYLARIKKYEGLINTEQVRLDDFR
ALYKEHAIAKHTVLDEENKYQDAINELEVYKASLMQVENEVLLAKEEQELVTQLFKNDILDKLKQATDNVNLLTFELDKN
NQRQQVSEIRAPVSGTVQQLKVHTIDGVVTTAETLMVVVPEEDSLEVTALIQNKDIGFVKEGQEVVIKVEAFPYTRYGYL
TGKVKNITLDAIEHPKLGLVFNTIIELDKKTLSTEEKEIPLSAGMEITAEIKTGMRSVISYLLSPLEESIDKSLRER
>Q5X159 2.3.2.27~~~lubX~~~E3 ubiquitin-protein ligase LubX~~~
MATRNPFDIDHKSKYLREAALEANLSHPETTPTMLTCPIDSGFLKDPVITPEGFVYNKSSILKWLETKKEDPQSRKPLTA
KDLQPFPELLIIVNRFVETQTNYEKLKNRLVQNARVAARQKEYTEIPDIFLCPISKTLIKTPVITAQGKVYDQEALSNFL
IATGNKDETGKKLSIDDVVVFDELYQQIKVYNFYRKREVQKNQIQPSVSNGFGFFSLNFLTSWLWGTEEKKEKTSSDMTY
>Q5ZRQ0 2.3.2.27~~~lubX~~~E3 ubiquitin-protein ligase LubX~~~COG5113
MGYRIEMATRNPFDIDHKSKYLREAALEANLSHPETTPTMLTCPIDSGFLKDPVITPEGFVYNKSSILKWLETKKEDPQS
RKPLTAKDLQPFPELLIIVNRFVETQTNYEKLKNRLVQNARVAARQKEYTEIPDIFLCPISKTLIKTPVITAQGKVYDQE
ALSNFLIATGNKDETGKKLSIDDVVVFDELYQQIKVYNFYRKREMQKNQIQPSVSSGFGFFSLNFLTSWLWGTEEKKEKT
SSDMTY
>O69690 ~~~lucA~~~Lipid uptake coordinator A~~~
MGRKVAVLWHASFSIGAGVLYFYFVLPRWPELMGDTGHSLGTGLRIATGALVGLAALPVVFTLLRTRKPELGTPQLALSM
RIWSIMAHVLAGALIVGTAISEVWLSLDAAGQWLFGIYGAAAAIAVLGFFGFYLSFVAELPPPPPKPLKPKKPKQRRLRR
KKTAKGDEAEPEAAEEAENTELAAQEDEEAVEAPPESIESPGGEPESATREAPAAETATAEEPRGGLRNRRPTGKTSHRR
RRTRSGVQVAKVDE
>Q934G0 1.17.2.2~~~luh~~~Lupanine 17-hydroxylase [cytochrome c]~~~
MSANKNIWIIRLGVAFVCVAIGAAQANEKDGSAVTSGNWSLLGGGNEQHYFSALKDVNKSNVKNLGLSWFTDMEAGDGLV
GNPLVADGVIYQGGPPGKIYANDLKTGKNLWTYTPEVQYDKDTSWTGFWGTHVNRGLAVDDDNVYIGSYCKLLAVSRTTH
KLTWSSQSCDPKKMQAITGAPRVGGGKVFIGNASGDFGGDRGHLDAFDAKTGKHLWRFYTMPGDPSKPFENDLLAKASKT
WGTDYWKYTKGGVSPWDAITYDEASDTLYFGTDGPSPWSPAQRAPDAGDELFSHSIIAVDASTGAYKWHFQTVQNDGSNM
SATMHIMLADLPVEGVSKRVVMTAPKNGYFYVLDASTGKFISADHYVPVNWTKGLDPKTGRPIPSNEANYWERPGEMTIP
LPGDVGGHNWEAMAYNPELRTVYIPSTLVPVTVVASKDTGELDLDYYYGMRPDATIKTQGDLVAWDPLLQKEKWRAKRSL
PVNGGVLATAGGLVFQGTGDGHFEAFDANTGEKLWSFHVGGSILAAPTTVEVDGDQYLIVASGNGGASGMRGIPRLMNNL
QSQGPARLLAFRLGGKTELPITSTPDFPKPQYPKPTSAMAESGRHIFNANACGACHGFNAEGSTPGLPDLRRSDKLDLAV
MKSIVIDGAFKPLGMPGHPHISDADLQALQAFILQKAWTAYDTQQTLKTSDTGAQ
>Q2FXB1 ~~~lukDv~~~Leucotoxin LukDv~~~
MKMKKLVKSSVASSIALLLLSNTVDAAQHITPVSEKKVDDKITLYKTTATSDNDKLNISQILTFNFIKDKSYDKDTLVLK
AAGNINSGYKKPNPKDYNYSQFYWGGKYNVSVSSESNDAVNVVDYAPKNQNEEFQVQQTLGYSYGGDINISNGLSGGLNG
SKSFSETINYKQESYRTTIDRKTNHKSIGWGVEAHKIMNNGWGPYGRDSYDPTYGNELFLGGRQSSSNAGQNFLPTHQMP
LLARGNFNPEFISVLSHKQNDTKKSKIKVTYQREMDRYTNQWNRLHWVGNNYKNQNTVTFTSTYEVDWQNHTVKLIGTDS
KETNPGV
>O54082 ~~~lukD~~~Leucotoxin LukD~~~
MKIEKLGKSSVASSIALLLLSNTVDAAQNITPKREKKVDDKITLYKTTATSDNDKLNIFQILTFNFIKDKSYDKDTLVLK
AAGNINSGYKNSNPKDYNYSQFYWGGKYNVSVSSESNDAVNVVDYAPKNQNEEFQVQQTLGYSYGGDINISNGLSGGLNG
SKSFSETINYKQESYRTTIDRKTNHKSIGWGVEAHKIMNNGWGPYGRDSYDPTYGNELFLGGDKSSSNAGQNFLPTHQIP
LLARGNFNPEFISVLSHKLFDTKKSKIKVTYQREMDRYTNQWNRSHWVGNNYKNQNTVTFTSTYEVDWQNILLKLIGTDS
KETNPGV
>Q2FXB0 ~~~lukEv~~~Leucotoxin LukEv~~~
MLAATLSVGLIAPLASPIQESRANTNIENIGDGAEVIKRTEDVSSKKWGVTQNVQFDFVKDKKYNKDALIVKMQGFINSR
TSFSDVKGSGYELTKRMIWPFQYNIGLTTKDPNVSLINYLPKNKIETTDVGQTLGYNIGGNFQSAPSIGGNGSFNYSKTI
SYTQKSYVSEVDKQNSKSVKWGVKANEFVTPDGKKSAHDRYLFVQSPNGPTGSAREYFAPDNQLPPLVQSGFNPSFITTL
SHEKGSSDTSEFEISYGRNLDITYATLFPRTGIYAERKHNAFVNRNFVVRYEVNWKTHEIKVKGHN
>O54081 ~~~lukE~~~Leucotoxin LukE~~~
MFKKKMLAASLSVGLIAPLASPIQESRANTNIENIGDGAEVIKRTEDVSSKKWGVTQNVQFDFVKDKKYNKDALIVKMQG
FINSRTSFSDVKGRGYELTKRLIWPFQYNIGLTTKDPNVSLINSITLPKTKIETTDVGQTLGYNIGGNFQSAPSIGGNGS
FNYSKTISYTQKSYVSEVDKQNSKSVKWGVKANKFVTPDGKKFAHDRYLFVQSPNGPTGSAREYFAPDNQLPPLVQSGFN
PSFITTLSHEKGSKLIRVNLKFSYGRNLDITYATLFPRTGIYAERKHNAFVNRNFVVRYKVNWKTHEIKVKGHN
>P31715 ~~~lukF~~~Leukocidin-F subunit~~~
MKMNKLVKSSVATSMALLLLSGTANAEGKITPVSVKKVDDKVTLYKTTATADSDKFKISQILTFNFIKDKSYDKDTLVLK
ATGNINSGFVKPNPNDYDFSKLYWGAKYNVSISSQSNDSVNAVDYAPKNQNEEFQVQNTLGYTFGGDISISNGLSGGLNG
NTAFSETINYKQESYRTLSRNTNYKNVGWGVEAHKIMNGWGPYGRDSFHPTYGNELFLAGRQSSAYAGQNFIAQHQMPLL
SRSNFNPEFLSVLSHRQDRAKKSKITVTYQREMDLYQIRWNGFYWAGANYKNFKTRTFKSTYEIDWENHKVKLLDTKETE
NNK
>Q2FWP0 ~~~~~~Uncharacterized leukocidin-like protein 1~~~
MIKQLCKNITICTLALSTTFTVLPATSFAKINSEIKQVSEKNLDGDTKMYTRTATTSDSQKNITQSLQFNFLTEPNYDKE
TVFIKAKGTIGSGLRILDPNGYWNSTLRWPGSYSVSIQNVDDNNNTNVTDFAPKNQDESREVKYTYGYKTGGDFSINRGG
LTGNITKESNYSETISYQQPSYRTLLDQSTSHKGVGWKVEAHLINNMGHDHTRQLTNDSDNRTKSEIFSLTRNGNLWAKD
NFTPKDKMPVTVSEGFNPEFLAVMSHDKKDKGKSQFVVHYKRSMDEFKIDWNRHGFWGYWSGENHVDKKEEKLSALYEVD
WKTHNVKFVKVLNDNEKK
>P21224 ~~~~~~Uncharacterized leukocidin-like protein 1~~~
MIKQLCKNITICTLALSTTFTVLPATSFAKINSEIKQVSEKNLDGDTKMYTRTATTSDSQKNITQSLQFNFLTEPNYDKE
TVFIKAKGTIGSGLRILDPNGYWNSTLRWPGSYSVSIQNVDDNNNTNVTDFAPKNQDESREVKYTYGYKTGGDFSINRGG
LTGNITKESNYSETISYQQPSYRTLLDQSTSHKGVGWKVEAHLINNMGHDHTRQLTNDSDNRTKSEIFSLTRNGNLWAKD
NFTPKDKMPVTVSEGFNPEFLAVMSHDKKDKGKSQFVVHYKRSMDEFKIDWNRHGFWGYWSGENHVDKKEEKLSALYEVD
WKTHNVKFVKVLNDNEKK
>Q7A4L0 ~~~~~~Uncharacterized leukocidin-like protein 1~~~
MIKQLYKNITICSLAISTALTVFPATSYAKINSEIKAVSEKNLDGDTKMYTRTATTSDSQKNITQSLQFNFLTEPNYDKE
TVFIKAKGTIGSGLRILDPNGYWNSTLRWPGSYSVSIQNVDDNNNTNVTDFAPKNQDESREVKYTYGYKTGGDFSINRGG
LTGNITKESNYSETISYQQPSYRTLLDQSTSHKGVGWKVEAHLINNMGHDHTRQLTNDSDNRTKSEIFSLTRNGNLWAKD
NFTPKDKMPVTVSEGFNPEFLAVMSHDKKDKGKSQFVVHYKRSMDEFKIDWNRHGFWGYWSGENHVDKKEEKLSALYEVD
WKTHDVKFVKVLNDNEKK
>Q2FWN9 ~~~~~~Uncharacterized leukocidin-like protein 2~~~
MKNKKRVLIASSLSCAILLLSAATTQANSAHKDSQDQNKKEHVDKSQQKDKRNVTNKDKNSTAPDDIGKNGKITKRTETV
YDEKTNILQNLQFDFIDDPTYDKNVLLVKKQGSIHSNLKFESHKEEKNSNWLKYPSEYHVDFQVKRNRKTEILDQLPKNK
ISTAKVDSTFSYSSGGKFDSTKGIGRTSSNSYSKTISYNQQNYDTIASGKNNNWHVHWSVIANDLKYGGEVKNRNDELLF
YRNTRIATVENPELSFASKYRYPALVRSGFNPEFLTYLSNEKSNEKTQFEVTYTRNQDILKNRPGIHYAPPILEKNKDGQ
RLIVTYEVDWKNKTVKVVDKYSDDNKPYKEG
>Q5HEH9 ~~~~~~Uncharacterized leukocidin-like protein 2~~~
MKNKKRVLIASSLSCAILLLSAATTQANSAHKDSQDQNKKEHVDKSQQKDKRNVTNKDKNSTAPDDIGKNGKITKRTETV
YDEKTNILQNLQFDFIDDPTYDKNVLLVKKQGSIHSNLKFESHKEEKNSNWLKYPSEYHVDFQVKRNRKTEILDQLPKNK
ISTAKVDSTFSYSSGGKFDSTKGIGRTSSNSYSKTISYNQQNYDTIASGKNNNWHVHWSVIANDLKYGGEVKNRNDELLF
YRNTRIATVENPELSFASKYRYPALVRSGFNPEFLTYLSNEKSNEKTQFEVTYTRNQDILKNRPGIHYAPPILEKNKDGQ
RLIVTYEVDWKNKTVKVVDKYSDDNKPYKEG
>Q99SN7 ~~~~~~Uncharacterized leukocidin-like protein 2~~~
MKNKKRVLIASSLSCAILLLSAATTQANSAHKDSQDQNKKEHVDKSQQKDKRNVTNKDKNSTVPDDIGKNGKITKRTETV
YDEKTNILQNLQFDFIDDPTYDKNVLLVKKQGSIHSNLKFESHKEEKNSNWLKYPSEYHVDFQVKRNRKTEILDQLPKNK
ISTAKVDSTFSYSSGGKFDSTKGIGRTSSNSYSKTISYNQQNYDTIASGKNNNWHVHWSVIANDLKYGGEVKNRNDELLF
YRNTRIATVENPELSFASKYRYPALVRSGFNPEFLTYLSNEKSNEKTQFEVTYTRNQDILKNRPGIHYAPPILEKNKDGQ
RLIVTYEVDWKNKTVKVVDKYSDDNKPYKEG
>P31716 ~~~lukS~~~Leukocidin-S subunit~~~
MLKNKILATTLSVSLLAPLANPLLENAKAANDTEDIGKGSDIEIIKRTEDKTSNKWGVTQNIQFDFVKDTKYNKDALILK
MQGFISSRTTYYNYKKTNHVKAMRWPFQYNIGLKTNDKYVSLINYLPKNKIESTNVSQTLGYNIGGNFQSAPSLGGNGSF
NYSKSISYTQQNYVSEVEQQNSKSVLWGVKANSFATESGQKSAFDSDLFVGYKPHSKDPRDYFVPDSELPPLVQSGFNPS
FIATVSHEKGSSDTSEFEITYGRNMDVTHAIKRSTHYGNSYLDGHRVHNAFVNRNYTVKYEVNWKTHEIKVKGQN
>O07020 ~~~lutA~~~Lactate utilization protein A~~~COG0247
MKVSLFVTCLVDMFQTNVGKATVELLERLGCEVDFPEGQICCGQPAYNSGYVHDAKKAMKRMIETFQDSEYVVSPSGSCT
TMFREYPHLFQDDPKWADKAKKLADKTYELTDFIVNVLGVEDVGATLHTKATLHTSCHMTRLLGVRKEPMKLLSHVKGLQ
FTELPGKHNCCGFGGTFSVKMAQISEQMVDEKVECVEETGAEVLIGADCGCLMNIGGRLGRKDKNVKVMHIAEVLNSR
>O07021 ~~~lutB~~~Lactate utilization protein B~~~COG1139
MAMKIGTDAFKERVSQGIDNEFMRGAVSGAQERLRTRRLEAAEELGNWEEWRSLSEEIRQHVLENLDFYLGQLAENVAKR
GGHVYFAKTAEEASSYIRDVIQKKNGKKIVKSKSMVTEEINLNEVLEKEGCEVVETDLGEYILQIDDHDPPSHIVAPALH
KNKEQIRDVFKERLDYQHTEKPEELVMHARAILRKKFLEADIGITGCNFAIADTGSVSLVTNEGNGRLVSTLPKTQITVM
GMERIVPSFSEFEVLVSMLTRSAVGQRLTSYITALTGPKLEGEVDGPEEFHLVIVDNGRSNILGTEFQSVLQCIRCAACI
NVCPVYRHVGGHSYGSIYSGPIGAVLSPLLGGYDDYKELPYASSLCAACSEACPVKIPLHELLLKHRQNIVEKEGRAPIS
EKLAMKAFGLGASSLSLYKMGSKWAPAAMTPFTEDEKISKGPGPLKNWTQIRDFPAPHKSRFRDWFADRETSERTKEEQ
>O32259 ~~~lutC~~~Lactate utilization protein C~~~COG1556
MTKGTIQNQESFLNRIASSLGRERRTGGVAVPEWAHQPQYKTLEGYSADDLVTVLKNHCVKIHTELIETDSTGLYDALRE
QVSRFSGGPVIIPKDPRFEEYGLKSLLTKDWPSEGTPVWEWDADKGEENIKKAEQANVGITFSEITLAESGTVVLFSSKD
IGRSVSLLPTTYIAIVPKSSIVPRMTQASDIIRQNIANGVTVPSCINYITGPSNSADIEMDLVVGVHGPVKAAYILVSDR
>Q9RT57 ~~~lutC~~~Lactate utilization protein C~~~COG1556
MTTIPSTAEAKLEMLTTINRAIAGSRPEALPPYPVPAPLSRAEILHQFEDRILDYGAAYTHVSAAELPGAIAKALGNARR
VIVPAGIPAPWLTVGMDVLRDEPPLSHAELDRADAVLTGCAVAISETGTIILDHRADQGRRALSLIPDFHICVVREDQIV
QTVREGVEAVAASVREGRPLTWLSGGSATSDIELVRVEGVHGPRRLQVIVVG
>P71067 ~~~lutP~~~L-lactate permease~~~COG1620
MQWTQAYTPIGGNLLLSALAALVPIIFFFWALAIKRMKGYTAGLATLGIALIIAVLVYRMPAEKALMSATQGAVYGLLPI
GWIIVTSVFLYKITVKTGQFDIIRSSVLSITDDRRLQALLIAFSFGAFLEGAAGFGAPVAISAALLVGLGFNPLYAAGIC
LIANTAPVAFGAIGIPITAVEGPTGIPAMEISQMVGRQLPFLSVFIPLYLIIIMSGFRKALEIWPAILVSGVSFAVVQYL
SSNFLGPELPDVLSALVSMAALAVFLKWWKPKTTFRFAGEQESAASIETARTNPAAPAYRGGQIFKAWSPFLLLTAMISV
WGIPSVKSALTGHYEGSAVFLKWLNAVGEKLTFSPGVPFLNNQIVNADGTPIEAVYKLEVLGSAGTAILIAAVLSKFITA
ISWKDWGTVFKETVQELKLPILTIASVVGFAYVTNSSGMSTTLGMTLALTGSMFTFFSPVLGWLGVFITGSDTSANLLFG
NLQKVTALSVGMDPVLSVAANSSGGVTGKMISPQSIAVACAAVGLAGKESDLFRFTIKHSLFLLLLVCIITFLQHHVFSW
MIP
>O07007 ~~~lutR~~~HTH-type transcriptional regulator LutR~~~COG2186
MIKNGELKPGDKLDSVQALAESFQVSRSAVREALSALKAMGLVEMKQGEGTYLKEFELNQISQPLSAALLMKKEDVKQLL
EVRKLLEIGVASLAAEKRTEADLERIQDALKEMGSIEADGELGEKADFAFHLALADASQNELLKHLMNHVSSLLLETMRE
TRKIWLFSKKTSVQRLYEEHERIYNAVAAGNGAQAEAAMLAHLTNVEDVLSGYFEENVQ
>P19839 1.14.14.3~~~luxA~~~Alkanal monooxygenase alpha chain~~~
MKFGNFLLTYQPPQFSQTEVMKRLVKLGRISEECGFDTVWLLEHHFTEFGLLGNPYVAAAYLLGATKKLNVGTAAIVLPT
AHPVRQLEDVNLLDQMSKGRFRFGICRGLYNKDFRVFGTDMNNSRALAECWYGLIKNGMTEGYMEADNEHIKFHKVKVNP
AAYSRGGAPVYVVAESASTTEWAAQFGLPMILSWIINTNEKKAQLELYNEVAQEYGHDIHNIDHCLSYITSVDHDSIKAK
EICRKFLGHWYDSYVNATTIFDDSDQTRGYDFNKGQWRDFVLKGHKDTNRRIDYSYEINPVGTPQECIDIIQKDIDATGI
SNICCGFEANGTVDEIIASMKLFQSDVMPFLKEKQRSLLY
>P29238 1.14.14.3~~~luxA~~~Alkanal monooxygenase alpha chain~~~
MKISNICFSYQPPGESHQEVMERFIRLGVASEELNFDGFYTLEHHFTEFGITGNLYIACANILGRTKRIQVGTMGIVLPT
EHPARHVESLLVLDQLSKGRFNYGTVRGLYHKDFRVFGTSQEDSRKTAENFYSMILDASKTGVLHTDGEVVEFPDVNVYP
EAYSKKQPTCMTAESSETITYLAERGLPMVLSWIIPVSEKVSQMELYNEVAAEHGHDINNIEHILTFICSVNEDGEKADS
VCRNFLENWYDSYKNATNIFNDSNQTRGYDYLKAQWREWVMKGLADPRRRLDYSNELNPVGTPERCIEIIQSNIDATGIK
HITVGFEANGSEQEIRESMELFMEKVAPHLKDPQ
>P23146 1.14.14.3~~~luxA~~~Alkanal monooxygenase alpha chain~~~
MKFGNFLLTYQPPQFSQTEVMKWLVKLGRISEECGFDTVWLLEHHFTEFGLLGNPYVAAAYLLGATKKLNVGTAAIVLPT
AHPVRQLEEVNLLDQMSKGRFRFGICRGLYNKDFRVFGTDMNNSRALMECWYKLIRNGMTEGYMEADNEHIKFHKVKVLP
TAYSQGGAPIYVVAESASTTEWAAQHGLPMILSWIINTNDKKAQIELYNEVAQEYGHDIHNIDHCLSYITSVDHDSMKAK
EICRNFLGHWYDSYVNATTIFDDSDKTKGYDFNKGQWRDFVLKGHKNTNRRVDYSYEINPVGTPQECIDIIQTDIDATGI
SNICCGFEANGTVDEIISSMKLFQSDVMPFLKEKQQFSYYIS
>P24113 1.14.14.3~~~luxA~~~Alkanal monooxygenase alpha chain~~~
MKFGNIFSYQPPGESHKEVMDRFVRLGVASEELNFDTYWLEHHFTEFGLTGNLFVACANLLGRTTKLNVGTMIVLPTAHP
ARQMEDLLLLDQMSKGRFNFGVVRGYHKDFRVFGVTMEDSRAITEDFHTMIMDGTKTGLHTDGKNIEFPDVNVYPEAYLE
KIPTCMTAESATTTWLAERGLPMVLSWIITTSEKKAQMELYNAVRDSGYSEEYIKNVDHSMTLICSVDEDGKKAEDVREF
LGNWYDSYVNATNIFSESNQTRGYDYHKGQKDFVLQGHTNTKRRVDYSNDLNPVGTPEKCIEIQRDIDATGITNITLGFE
ANGSEEEIIASMKRFMQVAPFLKDPK
>P07740 1.14.14.3~~~luxA~~~Alkanal monooxygenase alpha chain~~~
MKFGNFLLTYQPPELSQTEVMKRLVNLGKASEGCGFDTVWLLEHHFTEFGLLGNPYVAAAHLLGATETLNVGTAAIVLPT
AHPVRQAEDVNLLDQMSKGRFRFGICRGLYDKDFRVFGTDMDNSRALMDCWYDLMKEGFNEGYIAADNEHIKFPKIQLNP
SAYTQGGAPVYVVAESASTTEWAAERGLPMILSWIINTHEKKAQLDLYNEVATEHGYDVTKIDHCLSYITSVDHDSNRAK
DICRNFLGHWYDSYVNATKIFDDSDQTKGYDFNKGQWRDFVLKGHKDTNRRIDYSYEINPVGTPEECIAIIQQDIDATGI
DNICCGFEANGSEEEIIASMKLFQSDVMPYLKEKQ
>P09141 1.14.14.3~~~luxB~~~Alkanal monooxygenase beta chain~~~
MNFGLFFLNFQLKGMTSEAVLDNMIDTIALVDKDEYHFKTAFVNEHHFSKNGIVGAPMTAASFLLGLTERLHIGSLNQVI
TTHHPVRIAEEASLLDQMSDGRFILGLSDCVSDFEMDFFKRQRDSQQQQFEACYEILNDGITTNYCYANNDFYNFPKISI
NPHCISKENLKQYILATSMGVVEWAAKKGLPLTYRWSDTLAEKENYYQRYLTVAAENNVDITHVDHQFPLLVNINPDRDI
AKQEMRDYIRGYIAEAYPNTDQEEKIEELIKQHAVGTEDEYYESSKYALEKTGSKNVLLSFESMKNKAAVIDLINMVNEK
IKKNL
>P19840 1.14.14.3~~~luxB~~~Alkanal monooxygenase beta chain~~~
MKFGLFFLNFINSTTVQEQSIVRMQEITEYVDKLNFEQILVYENHFSDNGVVGAPLTVSGFLLGLTEKIKIGSLNHIITT
HHPVAIAEEACLLDQLSEGRFILGFSDCEKKDEMHFFNRPVEYQQQLFEECYEIINDALTTGYCNPDNDFYSFPKISVNP
HAYTPGGPRKYVTATSHHIVEWAAKKGIPLIFKWDDSNDVRYEYAERYKAVADKYDVDLSEIDHQLMILVNYNEDSNKAK
QETRAFISDYVLEMHPNENFENKLEEIIAENAVGNYTECITAAKLAIEKCGAKSVLLSFEPMNDLMSQKNVINIVDDNIK
KYHMEYT
>P29239 1.14.14.3~~~luxB~~~Alkanal monooxygenase beta chain~~~
MNFGLFFLNFQPEGMTSEMVLDNMVDTVALVDKDDYHFKRVLVSEHHFSKNGIIGEPLTAISFLLGLTKRIEIGSLNQVI
TTHHPVRIGEQTGLLDQMSYGRFVLGLSDCVNDFEMDFFKRKRSSQQQQFEACYEILNEALTTNYCQADDDFFNFPRISV
NPHCISEVKQYILASSMGVVEWAARKGLPLTYRWSDSLAEKEKYYQRYLAVAKENNIDVSNIDHQFPLLVNINENRRIAR
DEVREYIQSYVSEAYPTDPNIELRVEELIEQHAVGKVDEYYDSTMHAVKVTGSKNLLLSFESMKNKDDVTKLINMFNQKI
KDNLIK
>P23147 1.14.14.3~~~luxB~~~Alkanal monooxygenase beta chain~~~
MKFGLFFLNFINSTTIQEQSIARMQEITEYVDKLNFEQILVCENHFSDNGVVGAPLTVSGFLLGLTEKIKIGSLNHVITT
HHPVRIAEEACLLDQLSEGRFILGFSDCERKDEMPFFNRPEQYQQQLFEECYDIINDALTTGYCNPNGDFYNFPKISMNP
HAYTQNGPRKYVTATSCHVVEWAAKKGIPLIFKWDDSNEVKHEYAKRYQAIAGEYGVDLAEIDHQLMILVNYSEDSEKAK
EETRAFISDYILAMHPNENFEKKLEEIITENSVGDYMECTTAAKLAMEKCGTKGILLSFESMSDFTHQINAIDIVNDNIK
KYHM
>P12744 1.14.14.3~~~luxB~~~Alkanal monooxygenase beta chain~~~
MNFGLFFLNFPENTSSETVLDNMINTVSLVDKDYKNFTTALVNHHFSKNGIVGAPMTAASFLLGLTERLHIGSLNQITTH
HPVRIAEEASLLDQMSDSRFILGLSDCVNFEMDFFKRQRDSQQLQFEACYDIINEAITTNYCANNDFYNFPRISINPHCL
SKENMKQYILASSVSVEWAAKKALPLTYRWSDTLEDKEILYKRYLEVAKHNIDVSNVEHQFPLLVNLNHDRDVAHQEATA
YVSYIAEVYPHLNQQQKIAELISQHAIGTDNDYYSTLNALERTGSKNVLLSFESMKNHDDVVKVINMNEKIQKNLPSS
>P07739 1.14.14.3~~~luxB~~~Alkanal monooxygenase beta chain~~~
MKFGLFFLNFMNSKRSSDQVIEEMLDTAHYVDQLKFDTLAVYENHFSNNGVVGAPLTVAGFLLGMTKNAKVASLNHVITT
HHPVRVAEEACLLDQMSEGRFAFGFSDCEKSADMRFFNRPTDSQFQLFSECHKIINDAFTTGYCHPNNDFYSFPKISVNP
HAFTEGGPAQFVNATSKEVVEWAAKLGLPLVFRWDDSNAQRKEYAGLYHEVAQAHGVDVSQVRHKLTLLVNQNVDGEAAR
AEARVYLEEFVRESYSNTDFEQKMGELLSENAIGTYEESTQAARVAIECCGAADLLMSFESMEDKAQQRAVIDVVNANIV
KYHS
>P19841 1.2.1.50~~~luxC~~~Long-chain acyl-protein thioester reductase~~~
MCNAEFKGDCMIKKIPMIIGGAERDTSEHEYRELTLNSYKVSIPIINQDDVEAIKSQSVENNLNINQIVNFLYTVGQKWK
SENYSRRLTYIRDLVRFLGYSPEMAKLEANWISMILSSKSALYDIVETELGSRHIVDEWLPQGDCYVKAMPKGKSVHLLA
GNVPLSGVTSIIRAILTKNECIIKTSSADPFTAIALASSFIDTDEHHPISRSMSVMYWSHNEDIAIPQQIMNCADVVVSW
GGYDAIKWATEHTPVNVDILKFGPKKSIAIVDNPVDITASAIGVAHDICFYDQQACFSTQDIYYIGDNIDAFFDELVEQL
NLYMDILPKGDQTFDEKASFSLIEKECQFAKYKVEKGDNQSWLLVKSPLGSFGNQPLARSAYIHHVSDISEITPYIENRI
TQTVTVTPWESSFKYRDVLASHGAERIVESGMNNIFRVGGAHDGMRPLQRLVKYISHERPYTYSTKDVAVKIEQTRYLEE
DKFLVFVP
>P19197 2.3.1.-~~~luxD~~~Acyl transferase~~~
MENESKYKTIDHVICVEGNKKIHVWETLPEENSPKRKNAIIIASGFARRMDHFAGLAEYLSRNGFHVIRYDSLHHVGLSS
GTIDEFTMSIGKQSLLAVVDWLTTRKINNFGMLASSLSARIAYASLSEINASFLITAVGFVNLRYSLERALGFDYLSLPI
NELPNNLDFEGHKLGAEVFARDCLDFGWEDLASTINNMMYLDIPFIAFTANNDNWVKQDEVITLLSNIRSNRCKIYSLLG
SSHDLSENLVVLRNFYQSVTKAAIAMDNDHLDIDVDITEPSFEHLTIATVNERRMRIEIENQAISLS
>P05521 2.3.1.-~~~luxD~~~Acyl transferase~~~
MNNQCKTIAHVLRVNNGQELHVWETPPKENVPFKNNTILIASGFARRMDHFAGLAEYLSENGFHVFRYDSLHHVGLSSGS
IDEFTMTTGKNSLCTVYHWLQTKGTQNIGLIAASLSARVAYEVISDLELSFLITAVGVVNLRDTLEKALGFDYLSLPIDE
LPNDLDFEGHKLGSEVFVRDCFEHHWDTLDSTLDKVANTSVPLIAFTANNDDWVKQEEVYDMLAHIRTGHCKLYSLLGSS
HDLGENLVVLRNFYQSVTKAAIAMDGGSLEIDVDFIEPDFEQLTIATVNERRLKAEIESRTPEMA
>P09142 ~~~luxF~~~Non-fluorescent flavoprotein~~~
MTKWNYGVFFLNFYHVGQQEPSLTMSNALETLRIIDEDTSIYDVVAFSEHHIDKSYNDETKLAPFVSLGKQIHILATSPE
TVVKAAKYGMPLLFKWDDSQQKRIELLNHYQAAAAKFNVDIAGVRHRLMLFVNVNDNPTQAKAELSIYLEDYLSYTQAET
SIDEIINSNAAGNFDTCLHHVAEMAQGLNNKVDFLFCFESMKDQENKKSLMINFDKRVINYRKEHNLN
>P12745 ~~~luxF~~~Non-fluorescent flavoprotein~~~
MNKWNYGVFFVNFYNKGQQEPSKTMNNALETLRIIDEDTSIYDVINIDDHYLVKKDSEDKKLASFITLGEKLYVLATSEN
TVDIAAKYALPLVFKWDDINEERLKLLSFYNASASKYNKNIDLVRHQLMLHVNVNEAETVAKEELKLYIENYVACTQPSN
FNGSIDSIIQSNVTGSYKDCLSYVANLAGKFDNTVDFLLCFESMQDQNKKKSVMIDLNNQVIKFRQDNNLI
>P54298 2.3.1.184~~~luxM~~~Acyl-homoserine-lactone synthase LuxM~~~
MKLMLSLGSLSANSLPIEKKQQVLIDLVIRTYQSHERTELFKAITEYRKNQLIALFPEHANKSYSIIFELMDYRDLIERY
PSTLSEEATLLEKVVGQCFMHWLDFWCECEIAAIKAKFPLKENELPAPQLLFEDSAYYGALVERVEDTQLMVQIPSHPQA
MPLSDAITLSNLELFIQGEKWYEMLSLLSLSQVGKHFIVLKHPVQDSCPTLVASALIQNWSVRDTWLSYAPQFSNEQWNY
CFPSYGYSEFTRLQLFTPSSLSKCYSLPEFDNEFKLQLSDTQAVCEVLRLTVSGNAQQKLYFLYLAQKELMSVLHQAGYK
IGFTIIEQPFMLNFYRAIDAKAYFHSGYCDLNDDGKQTYRGFWNFEMMVKAFSNIDFRGYKRAVRASRKRGSLERDEHV
>A7MRY4 2.7.13.3~~~luxN~~~Autoinducer 1 sensor kinase/phosphatase LuxN~~~
MFDFSLEAIVYAKAITLLATVAVVMMWLFYYCYRLKQKNEVIFGTHHAAYIAYSVCIIAWISSNAYFHTDLLPELGASAG
MFMAKFANLASFFAFAFAYYFSCQLAAEQRKGKVHRWQQGIFVSLTVYSLFINLRPGLTVEHVDIVGPSQFIIEFGPHTS
YFFIGLVSFVVLTLVNLVAMRTNSSKLTLAKTNYMIAGILVFMLSTAVIHLGMTYFMGDFSLTWLPPALSISEMLFVGYA
LLTSRFYSVKYIAYLALSVLLVCAIFVLPLGAIFIPLTESNQWLIAIPICALIGITWQLLYKKTSRYASFLIYGDKKTPV
QQILSLEEDFKLSIDDAMRRLGKLLQIPNDKLRLVTSNYNETFYEEYLSSNRSVLVFDELSEELEYKVSAKRSMKALYDK
MSSNNTALVMPLFGQGKSVTHLLISPHKSNNQMFSNEEISAVQTLLTRVQSTIEADRRIRQSRALANSIAHEMRNPLAQV
QLQFEALKQHIENHAPVEQITLDIENGQAAIQRGRQLIDIILREVSDSSPEHEPIAMTSIHKAVDQAVSHYGFENEKIIE
RIRLPQHTDFVAKLNETLFNFVIFNLIRNAIYYFDSYPDSQIEISTKTGPYENTLIFRDTGPGIDETISHKIFDDFFSYQ
KSGGSGLGLGYCQRVMRSFGGRIECKSKLGTFTEFHLYFPVVPNAPKADTLRTPYFNDWKQNKRSNEHKVAPNVQINNQS
PTVLIVDDKEVQRALVQMYLNQLGVNSLQANNGENAVEVFKANHVDLILMDVQMPVMNGFDASQRIKELSPQTPIVALSG
ESGERELDMINKLMDGRLEKPTTLNALRHVLGNWLNKNTASSACEAERE
>A7MVC2 ~~~luxO~~~Luminescence regulatory protein LuxO~~~
MVEDTASVAALYRSYLTPLGIDINIVGTGRDAIESLNHRIPDLILLDLRLPDMTGMDVLHAVKKSHPDVPIIFMTAHGSI
DTAVEAMRHGSQDFLIKPCEADRLRVTVNNAIRKATKLKNEADNPGNQNYQGFIGSSQTMQQVYRTIDSAASSKASIFIT
GESGTGKEVCAEAIHAASKRGDKPFIAINCAAIPKDLIESELFGHVKGAFTGAANDRQGAAELADGGTLFLDELCEMDLD
LQTKLLRFIQTGTFQKVGSSKMKSVDVRFVCATNRDPWKEVQEGRFREDLYYRLYVIPLHLPPLRERGKDVIEIAYSLLG
YMSHEEGKSFVRFAQDVIERFNSYEWPGNVRQLQNVLRNIVVLNNGKEITLDMLPPPLNQPVVRQSVAKFIEPDIMTVSD
IMPLWMTEKMAIEQAIQACEGNIPRAAGYLDVSPSTIYRKLQAWNSKDEKQNV
>O87455 ~~~luxO~~~Regulatory protein LuxO~~~COG3604
MQEFSLSTLLEMTVGLASGANNEERFHRLLDAVRKAVICDCVVLMSLHNDTLTPLAMQGLTRDTLGRRFVVSEHPRLAQI
CSADLPVRFAADCPLPDPFDGLLLDSEDDLPMHSCMGLPLHFGEQLLGILTLDSLKPDAFDHLSPRSLEILAAIAAATLK
MTLTFSELENQAKQTQLRLEELNQEAWSRDSVEIIGNSGPMLAMKADIDVVAPSQFNILIHGETGVGKELVARTIHQRSN
RKRQPLVYVNCAAIPENLLESELFGHVKGAFTGADRARMGKFALADGGTLFLDEIGELPLSAQSKILRALQNHEIQPVGQ
DRVQTVDVRILAATNRDLKKEVEAGRFRADLYHRLSVYPIYVPPLRERKGDLSLLAGYFVEQARRKLGITQLKLHGDVLS
QLIQYPWPGNVRELEHVINRAALKAKARQRGRPVTTLKVEDLGELQGPRAAMVEPTAQDEPMLGEWIGELGLRDATDEFQ
RHLISETLTQADFNWAEAARRLQTDRANLTRLAKRLGITVSRSHSIERSR
>Q9KT84 ~~~luxO~~~Regulatory protein LuxO~~~COG2204
MVEDTASVAALYRSYLTPLDIDINIVGTGRDAIESIGRREPDLILLDLRLPDMTGMDVLYAVKEKSPDVPIVFMTAHGSI
DTAVEAMRHGAQDFLIKPCEADRLRVTVNNAIRKASKLKNDVDNKNQNYQGFIGSSQTMQAVYRTIDSAASSKASIFITG
ESGTGKEVCAEAIHAASKRGDKPFIAINCAAIPKDLIESELFGHVKGAFTGAATERQGAAEAADGGTLFLDELCEMDLDL
QTKLLRFIQTGTFQKVGSSKMKSVDVRFVCATNRDPWKEVQEGRFREDLYYRLYVIPLHLPPLRARGDDVIEIAYSLLGF
MSKEEGKDFVRLSAEVVERFRQYEWPGNVRQLQNVLRNVVVLNEGREITLDMLPPPLNQMSAPINRALPLAHENKVSVHE
IFPLWMTEKQAIEQAIEACDGNIPRAATYLDVSPSTIYRKLQTWNEKVKEKEKER
>Q06877 ~~~lumP~~~Lumazine protein~~~
MFRGIVQGRGVIRSISKSEDSQRHGIAFPEGMFQLVDVDTVMLVNGCSLTVVRILGDMVYFDIDQALGTTTFDGLKEGDQ
VNLEIHPKFGEVVGRGGLTGNIKGTALVAAIEENDAGFSVLIDIPKGLAENLTVKDDIGIDGISLPITDMSDSIITLNYS
RDLLASTNIASLAKDVKVNVEILNEW
>P25082 ~~~luxL~~~Lumazine protein~~~
MFKGIVQGVGIIKKISKNDDTQRHGITFPKDILDSVEKDTVMLVNGCSVTVVRITGDVVYFDIDQAINTTTFRKLEVGNK
VNLEVRPGFGSLLGKGALTGNIKGVATVDNITEEEDLLKVYIKIPKDLIENISSEDHIGINGVSNSIEEVSNDIICINYP
KNLSITTNLGTLETGSEVNVETLNVSNEW
>P54300 ~~~luxP~~~Autoinducer 2-binding periplasmic protein LuxP~~~
MKKALLFSLISMVGFSPASQATQVLNGYWGYQEFLDEFPEQRNLTNALSEAVRAQPVPLSKPTQRPIKISVVYPGQQVSD
YWVRNIASFEKRLYKLNINYQLNQVFTRPNADIKQQSLSLMEALKSKSDYLIFTLDTTRHRKFVEHVLDSTNTKLILQNI
TTPVREWDKHQPFLYVGFDHAEGSRELATEFGKFFPKHTYYSVLYFSEGYISDVRGDTFIHQVNRDNNFELQSAYYTKAT
KQSGYDAAKASLAKHPDVDFIYACSTDVALGAVDALAELGREDIMINGWGGGSAELDAIQKGDLDITVMRMNDDTGIAMA
EAIKWDLEDKPVPTVYSGDFEIVTKADSPERIEALKKRAFRYSDN
>Q9KLK7 2.7.13.3~~~luxQ~~~Autoinducer 2 sensor kinase/phosphatase LuxQ~~~COG0784
MNIRPSQIKHKQRIASFITHAVVVVMGVLIVSVLFQSYQISSRLMAQEGQRTSVQTSSLIQSLFDFRLAALRIHQDSTAK
NASLINALVSRDSSRLDEFFSSVDELELSNAPDLRFISSHDNILWDDGNASFYGIAQQELNKLIRRVAISGNWHLVQTPS
EGKSVHILMRRSSLIEAGTGQVVGYLYVGIVLNDNFALLENIRSGSNSENLVLAVDTTPLVSTLKGNEPYSLDYVVHSAK
DAMRDSFIVGQTFLEVESVPTYLCVYSIQTNQNVLTLRDNFYFWMAFALISMIGVSIASRWWLQKRIQREIETLMNYTHK
LMDLDTKSEFIGSKIYEFDYFGRTLEQSFRRLANKEKQFEDLFNFALSPTMLWNTSGRLIRMNPSAQIQFLREDAQNHFL
FEILERQLLPTITNAAQGNNPSDVTTEVDGRVYRWNLSPIMVEGQIISIITQGQDITTIAEAEKQSQAARREAEESARVR
AEFLAKMSHELRTPLNGVLGVSQLLKRTPLNDEQREHVAVLCSSGEHLLAVLNDILDFSRLEQGKFRIQKNEFRLKELVC
AIDRIYRPLCNEKGLELVVNSNITTAAIVRSDQIRINQILFNLLNNAIKFTHQGSIRVELQLIEGDPLAQLVIQVVDTGI
GIREQDLTVIFEPFMQAESTTTREYGGSGLGLTIVHSLVEMLSGQLHVSSEYGIGTRFEIQLPIELVEKPDAPQQLLPAP
DPQPLFDKTLRVLLVEDNHTNAFIAQAFCRKYGLDVSWVTDGLQAIEELKIHDYDLVLMDNQLPYLDGVETTRTIKKVLH
LPVVVYACTADGLEETRQAFFHAGAEYVLVKPLKEQTLHKALEHFKHHHGQKNAGLN
>P54302 2.7.13.3~~~luxQ~~~Autoinducer 2 sensor kinase/phosphatase LuxQ~~~
MTTTRSNIKKRRSLATLITKIIILVLAPIILGIFIQSYYFSKQIIWQEVDRTKQQTSALIHNIFDSHFAAIQIHHDSNSK
SEVIRDFYTDRDTDVLNFFFLSIDQSDPSHTPEFRFLTDHKGIIWDDGNAHFYGVNDLILDSLANRVSFSNNWYYINVMT
SIGSRHMLVRRVPILDPSTGEVLGFSFNAVVLDNNFALMEKLKSESNVDNVVLVANSVPLANSLIGDEPYNVADVLQRKS
SDKRLDKLLVIETPIVVNAVTTELCLLTVQDNQSVVTLQIQHILAMLASIIGMIMIALMSREWIESKVSAQLESLMSYTR
SAREEKGFERFGGSDIEEFDHIGSTLESTFEELEAQKKSFRDLFNFALSPIMVWSEESVLIQMNPAARKELVIEDDHEIM
HPVFQGFKEKLTPHLKMAAQGATLTGVNVPIGNKIYRWNLSPIRVDGDISGIIVQGQDITTLIEAEKQSNIARREAEKSA
QARADFLAKMSHEIRTPINGILGVAQLLKDSVDTQEQKNQIDVLCHSGEHLLAVLNDILDFSKIEQGKFNIQKHPFSFTD
TMRTLENIYRPICTNKGVELVIENELDPNVEIFTDQVRLNQILFNLVSNAVKFTPIGSIRLHAELEQFYGAENSVLVVEL
TDTGIGIESDKLDQMFEPFVQEESTTTREYGGSGLGLTIVKNLVDMLEGDVQVRSSKGGGTTFVITLPVKDRERVLRPLE
VSQRIKPEALFDESLKVLLVEDNHTNAFILQAFCKKYKMQVDWAKDGLDAMELLSDTTYDLILMDNQLPHLGGIETTHEI
RQNLRLGTPIYACTADTAKETSDAFMAAGANYVMLKPIKENALHEAFVDFKQRFLVERT
>P12746 ~~~luxR~~~Transcriptional activator protein LuxR~~~
MKNINADDTYRIINKIKACRSNNDINQCLSDMTKMVHCEYYLLAIIYPHSMVKSDISILDNYPKKWRQYYDDANLIKYDP
IVDYSNSNHSPINWNIFENNAVNKKSPNVIKEAKTSGLITGFSFPIHTANNGFGMLSFAHSEKDNYIDSLFLHACMNIPL
IVPSLVDNYRKINIANNKSNNDLTKREKECLAWACEGKSSWDISKILGCSERTVTFHLTNAQMKLNTTNRCQSISKAILT
GAIDCPYFKN
>P21308 ~~~luxR~~~HTH-type transcriptional regulator LuxR~~~
MDSIAKRPRTRLSPLKRKQQLMEIALEVFARRGIGRGGHADIAEIAQVSVATVFNYFPTREDLVDEVLNHVVRQFSNFLS
DNIDLDIHARENIANITNAMIELVSQDCHWLKVWFEWSASTRDEVWPLFVTTNRTNQLLVQNMFIKAIERGEVCDQHEPE
HLANLFHGICYSIFVQANRSKSEAELTNLVSAYLDMLCIYNREHH
>O34667 4.4.1.21~~~luxS~~~S-ribosylhomocysteine lyase~~~COG1854
MPSVESFELDHNAVVAPYVRHCGVHKVGTDGVVNKFDIRFCQPNKQAMKPDTIHTLEHLLAFTIRSHAEKYDHFDIIDIS
PMGCQTGYYLVVSGEPTSAEIVDLLEDTMKEAVEITEIPAANEKQCGQAKLHDLEGAKRLMRFWLSQDKEELLKVFG
>Q9RRU8 4.4.1.21~~~luxS~~~S-ribosylhomocysteine lyase~~~COG1854
MPDMANVESFDLDHTKVKAPYVRLAGVKTTPKGDQISKYDLRFLQPNQGAIDPAAIHTLEHLLAGYMRDHLEGVVDVSPM
GCRTGMYMAVIGEPDEQGVMKAFEAALKDTAGHDQPIPGVSELECGNYRDHDLAAARQHARDVLDQGLKVQETILLER
>P45578 4.4.1.21~~~luxS~~~S-ribosylhomocysteine lyase~~~COG1854
MPLLDSFTVDHTRMEAPAVRVAKTMNTPHGDAITVFDLRFCVPNKEVMPERGIHTLEHLFAGFMRNHLNGNGVEIIDISP
MGCRTGFYMSLIGTPDEQRVADAWKAAMEDVLKVQDQNQIPELNVYQCGTYQMHSLQEAQDIARSILERDVRINSNEELA
LPKEKLQELHI
>P44007 4.4.1.21~~~luxS~~~S-ribosylhomocysteine lyase~~~COG1854
MPLLDSFKVDHTKMNAPAVRIAKTMLTPKGDNITVFDLRFCIPNKEILSPKGIHTLEHLFAGFMRDHLNGDSIEIIDISP
MGCRTGFYMSLIGTPNEQKVSEAWLASMQDVLGVQDQASIPELNIYQCGSYTEHSLEDAHEIAKNVIARGIGVNKNEDLS
LDNSLLK
>I0JJR3 4.4.1.21~~~luxS~~~S-ribosylhomocysteine lyase~~~COG1854
MTQMNVESFNLDHTKVKAPYIRLVGVTEGDKGDKIYKYDIRVKQPNQEHMDMPALHSLEHLMAENSRNHHDRIIDIGPMG
CQTGFYLAVLNDDSYENILQVVENTLKDVLNATEVPACNEVQCGFAASHSLEGAQELARELLNKRNEWTEVF
>Q9ZMW8 4.4.1.21~~~luxS~~~S-ribosylhomocysteine lyase~~~COG1854
MKMNVESFNLDHTKVKAPYVRIADRKKGVNGDLIVKYDVRFKQPNRDHMDMPSLHSLEHLVAEIIRNHANYVVDWSPMGC
QTGFYLTVLNHDNYTEILEVLEKTMQDVLKAKEVPASNEKQCGWAANHTLEGAQNLARAFLDKRAEWSEVGV
>Q8Z4D7 4.4.1.21~~~luxS~~~S-ribosylhomocysteine lyase~~~COG1854
MPLLDSFAVDHTRMQAPAVRGAKTMNTPHGDAITVFDLRFCIPNKEVMPEKGIHTLEHLFAGFMRDHLNGNGVEIIDISP
MGCRTGFYMSLIGTPDEQRVADAWKAAMADVLKVQDQNQIPELNVYQCGTYQMHSLSEAQDIARHILERDVRVNSNKELA
LPKEKLQELHI
>P65330 4.4.1.21~~~luxS~~~S-ribosylhomocysteine lyase~~~
MTKMNVESFNLDHTKVVAPFIRLAGTMEGLNGDVIHKYDIRFKQPNKEHMDMPGLHSLEHLMAENIRNHSDKVVDLSPMG
CQTGFYVSFINHDNYDDVLNIVEATLNDVLNATEVPACNEVQCGWAASHSLEGAKTIAQAFLDKRNEWHDVFGTGK
>P0C5S4 ~~~luxU~~~Phosphorelay protein LuxU~~~
MNTDVLNQQKIEELSAEIGSDNVPVLLDIFLGEMDSYIGTLTELQGSEQLLYLKEISHALKSSAASFGADRLCERAIAID
KKAKANQLQEQGMETSEMLALLHITRDAYRSWTN
>P21578 ~~~luxY~~~Yellow fluorescent protein~~~
MFKGIVEGIGIIEKIDIYTDLDKYAIRFPENMLNGIKKESSIMFNGCFLTVTSVNSNIVWFDIFEKEARKLDTFREYKVG
DRVNLGTFPKFGAASGGHILSARISCVASIIEIIENEDYQQMWIQIPENFTEFLIDKDYIAVDGISLTIDTIKNNQFFIS
LPLKIAQNTNMKWRKKGDKVNVELSNKINANQCW
>Q5ZY48 ~~~lvgA~~~Type 4 adapter protein LvgA~~~
MADGDIEIKAGFVDTDLDDRKLTMIDDLNNPLAIVERVYLIWWHWADFHLHVISPHIDTITPAIVIEPELIPGSNDHEFV
YSIHDSGSKLSTSKSQDMFSAGMSMCKLFYTIEKMVYILVERLKSGGVSMEAEVQIAFAGHEIAQRKAFESIINLPYNVV
VTNFDPGIWGEKYLQNVKRLADKGYGYPPESPRKIYMHPVSSGTTARK
>Q9RNG8 ~~~lvgA~~~Type 4 adapter protein LvgA~~~
MADGDIEIKAGFVDTDLDDRKLTMIDDLNNPLAIVERVYLIWWHWADFHLHVISPHIDTITPAIVIEPELISGSNDHEFV
YSIHDSGSKLSTSKSQDMFSAGMSMCKLFYTIEKMVHILVDRLKSGGVSMEEEVQIAFAGHEIAQRKAFESIINLPYNVV
VTNFDPGIWGEKYLQNVKRLADKGYGYPPESPRENYKHPVSSATTARK
>Q2NCA3 2.7.13.3~~~~~~Blue-light-activated histidine kinase 1~~~COG3920
MPLKGEISAQAGREFDTSRLDLRAIIDPRDLRVDPTRLFLETTQQTRLAICISDPHQPDCPVVYVNQAFLDLTGYAREEI
VGRNCRFLQGADTDPEQVRKLREGIAAERYTVVDLLNYRKDGIPFWNAVHVGPIYGEDGTLQYFYGSQWDITDIVAERRK
AETQRRIAAELRHRTGNIFAVLNAIIGLTSRRERDVSEFADKLSERVSALASAHRMTIMDEPDQEAVAIDDLVTGVMKPY
RNRFAERVTTSGPKIELGPRSVTALGLALHELATNAVKYGALSVDAGRVEISWSREDGDVTLVWQEQGGPTVSQEQSEPV
KGNGTMLIDGMIASLTGSIERDFAAAGLQAKITLPVHQPE
>Q2NB77 2.7.13.3~~~~~~Blue-light-activated histidine kinase 2~~~COG3920
MAVGLAEHDKEAWGRLPFSLTIADISQDDEPLIYVNRAFEQMTGYSRSSVVGRNCRFLQGEKTDPGAVERLAKAIRNCEE
VEETIYNYRADGEGFWNHLLMGPLEDQDEKCRYFVGIQVDMGQSESPDRATELDRQLAEVQHRVKNHLAMIVSMIRIQSS
QAGGVGSQFDSLSRRVEALQLLYQEMDIAGAAKATDKIIPLGAYLGRIASAINHIDGRGAIKVNVQADTVDVPVETAGRI
GLLVSEVLTNALQHAFSDRASGVVQLRSSVMSGEQLRVTVEDDGRGIPEDCDWPNEGNLGSRIVRQLVQGLGAELNVTRG
GTGTIVNIDIPLSQQKTLIADERTKD
>Q2NB98 ~~~~~~Light-activated DNA-binding protein EL222~~~COG4566
MLDMGQDRPIDGSGAPGADDTRVEVQPPAQWVLDLIEASPIASVVSDPRLADNPLIAINQAFTDLTGYSEEECVGRNCRF
LAGSGTEPWLTDKIRQGVREHKPVLVEILNYKKDGTPFRNAVLVAPIYDDDDELLYFLGSQVEVDDDQPNMGMARRERAA
EMLKTLSPRQLEVTTLVASGLRNKEVAARLGLSEKTVKMHRGLVMEKLNLKTSADLVRIAVEAGI
>Q9LBG2 1.1.1.-~~~lvr~~~Levodione reductase~~~
MTATSSPTTRFTDRVVLITGGGSGLGRATAVRLAAEGAKLSLVDVSSEGLEASKAAVLETAPDAEVLTTVADVSDEAQVE
AYVTATTERFGRIDGFFNNAGIEGKQNPTESFTAAEFDKVVSINLRGVFLGLEKVLKIMREQGSGMVVNTASVGGIRGIG
NQSGYAAAKHGVVGLTRNSAVEYGRYGIRINAIAPGAIWTPMVENSMKQLDPENPRKAAEEFIQVNPSKRYGEAPEIAAV
VAFLLSDDASYVNATVVPIDGGQSAAY
>P10773 3.2.1.17~~~lyzB~~~B-enzyme~~~
ISPLGSVTKKNQDSTAYNWTGNKTANGNWPVLGICAVHRKKDIGGSGNSPVIPFGTTLKTDKDIWLPDGVGYKSSFNVDD
TGSGPKKTDYWIDIYYSKDTKAAINYGVVKLSYTYST
>Q5ZU17 3.3.2.2~~~~~~Lysoplasmalogenase~~~COG3714
MTYSFSKPVSWVFLFTAVIYLVSLSFIQYPATTVLKPIPIVCLIVGVFRTSLSSSAKILLILALVFSLAGDVVLTLPFSL
QLELGIACFLLAHCFYITLFLKSFEFNRLHLFYYLPIFLFMGFAAFTMIPYLGNLLIPVMIYFCVLMLMVFSAFQVKKET
LTISSGALFFLISDLTLALNLFIYTQADVRIFVMFTYYVAQFLLTFGLVRLYEKGG
>P9WG51 3.3.2.2~~~~~~Lysoplasmalogenase~~~COG3714
MGSIAGFSSAVLSKLGIPVPYAPRLLAGGWVVAGWAGLAYGVYLTVIALRLPPGSELTGHAMLQPAFKASMAVLLAAAAV
AHPIGRERRWLVPALLLSATGDWLLAIPWWTWAFVFGLGAFLLAHLCFIGALLPLARQAAPSRGRVAAVVAMCVASAGLL
VWFWPHLGKDNLTIPVTVYIVALSAMVCTALLARLPTIWTAVGAVCFAASDSMIGIGRFILGNEALAVPIWWSYAAAEIL
ITAGFFFGREVPDNAAAPTDS
>Q2FVT1 ~~~lyrA~~~Lysostaphin resistance protein A~~~COG1266
MKNNKISGFQWAMTIFVFFVITMALSIMLRDFQSIIGVKHFIFEVTDLAPLIAAIICILVFKYKKVQLAGLKFSISLKVI
ERLLLALILPLIILIIGMYSFNTFADSFILLQSTGLSVPITHILIGHILMAFVVEFGFRSYLQNIVETKMNTFFASIVVG
LMYSVFSANTTYGTEFAAYNFLYTFSFSMILGELIRATKGRTIYIATTFHASMTFGLIFLFSEEIGDLFSIKVIAISTAI
VAVGYIGLSLIIRGIAYLTTRRNLEELEPNNYLDHVNDDEETNHTEAEKSSSNIKDAEKTGVATASTVGVAKNDTENTVA
DEPSIHEGTEKTEPQHHIGNQTESNHDEDHDITSESVESAESVKQAPQSDDLTNDSNEDEIEQSLKEPATYKEDRRSSVV
IDAEKHIEKTEEQSSDKNK
>Q7A3Z2 ~~~lyrA~~~Lysostaphin resistance protein A~~~
MKNNKISGFQWAMTIFVFFVITMALSIMLRDFQSIIGVKHFIFEVTDLAPLIAAIICILVFKYKKVQLAGLKFSISLKVI
ERLLLALILPLIILIIGMYSFNTFADSFILLQSTGLSVPITHILIGHILMAFVVEFGFRSYLQNIVETKMNTFFASIVVG
LMYSVFSANTTYGTEFAAYNFLYTFSFSMILGELIRATKGRTIYIATTFHASMTFGLIFLFSEEIGDLFSIKVIAISTAI
VAVGYIGLSLIIRGIAYLTTRRNLEELEPNNYLDHVNDDEETNHTEAEKSSSNIKDAEKTGVATASTVGVAKNDTENTVA
DEPSIHEGTEKTEPQHHIGNQTESNHDEDHDITSESVESAESVKQAPQSDDLTNDSNEDEIEQSLKEPATYKEDRRSSVV
IDAEKHIEKTEEQSSDKNK
>M4GGR9 5.1.1.5~~~lyr~~~Lysine racemase~~~
MSLGIRYLALLPLFVITACQQPVNYNPPATQVAQVQPAIVNNSWIEISRSALDFNVKKVQSLLGKQSSLCAVLKGDAYGH
DLSLVAPIMIENNVKCIGVTNNQELKEVRDLGFKGRLMRVRNATEQEMAQATNYNVEELIGDLDMAKRLDAIAKQQNKVI
PIHLALNSGGMSRNGLEVDNKSGLEKAKQISQLANLKVVGIMSHYPEEDANKVREDLARFKQQSQQVLEVMGLERNNVTL
HMANTFATITVPESWLDMVRVGGIFYGDTIASTDYKRVMTFKSNIASINYYPKGNTVGYDRTYTLKRDSVLANIPVGYAD
GYRRVFSNAGHALIAGQRVPVLGKTSMNTVIVDITSLNNIKPGDEVVFFGKQGNSEITAEEIEDISGALFTEMSILWGAT
NQRVLVD
>I6XEI5 3.2.1.17~~~~~~Putative peptidoglycan hydrolase Rv2525c~~~COG3757
MSVSRRDVLKFAAATPGVLGLGVVASSLRAAPASAGSLGTLLDYAAGVIPASQIRAAGAVGAIRYVSDRRPGGAWMLGKP
IQLSEARDLSGNGLKIVSCYQYGKGSTADWLGGASAGVQHARRGSELHAAAGGPTSAPIYASIDDNPSYEQYKNQIVPYL
RSWESVIGHQRTGVYANSKTIDWAVNDGLGSYFWQHNWGSPKGYTHPAAHLHQVEIDKRKVGGVGVDVNQILKPQFGQWA
>C7QJ42 1.14.11.-~~~~~~L-lysine 3-hydroxylase~~~COG2175
MKNLSAYEVYESPKTSGESRTEAVSEAAFESDPEVSAILVLTSSEASTLERVADLVTAHALYAAHDFCAQAQLAAAELPS
RVVARLQEFAWGDMNEGHLLIKGLPQVRSLPPTPTSNVHAVAATTPMSRYQALINECVGRMIAYEAEGHGHTFQDMVPSA
MSAHSQTSLGSAVELELHTEQAFSPLRPDFVSLACLRGDPRALTYLFSARQLVATLTTQEIAMLREPMWTTTVDESFLAE
GRTFLLGFERGPIPILSGADDDPFIVFDQDLMRGISAPAQELQQTVIRAYYAERVSHCLAPGEMLLIDNRRAVHGRSIFA
PRFDGADRFLSRSFIVADGSRSRHARSSFGRVVSARFS
>C7PLM6 1.14.11.-~~~~~~L-lysine 4-hydroxylase~~~COG2175
MRPLDVTPTISPGAQDLPRTMHFAAEPPLQPLIIDITEEEKLEITYIGKKLKRKYKSYDDPGFISMLHLNAYTLLPERIA
KVLSNFGTDFSDQQYGAVVLRGLIEIGQDELGPTPRSWQETDHEKIMEYGFISSLLHGAVPSKPVEYFAQRKGGGLMHAI
IPDENMSFTQTGSGSRTDLFVHTEDAFLHNAADFLSFLFLRNEERVPSTLYSIRSHGRPDAILQELFKPIYKCPKDANYA
SEEALGDDIRTSVLYGSRSAPFMRFDAAEQIYNEDANQDPEALHNLKRFWEEARKLIYNDFVPESGDLIFVNNHLCAHGR
NAFLAGFREENGQLVKCERRLMLRMMSKTSLINIREVTHPENPYLIMEEHYGKVYSAHLANL
>A5FF23 1.14.11.-~~~~~~L-lysine 4-hydroxylase~~~COG2175
MKSQSLIEDEIPVKENYAYQIPTSPLIVEVTPQERNILSNVGALLEKAFKSYENPDYIEALHLYSFQLLPERIARILSRF
GTDFSADQYGAIIFRGLLEVDQDHLGPTPANWQSADYSKLNKYGFICSLLHGAVPSKPVQYYAQRKGGGILHAVIPDEKM
AATQTGSGSKTNLYVHTEDAFLLHQADFLSFLYLRNEERVPSTLYSVRSHGKVNKIMEKLFDPIYQCPKDANYQEEINDG
PLASVLYGNKKLPFIRFDAAEQIFNENAGQTPEALYNLTEFWNEAKELINSDYIPDSGDVIFVNNHLCAHGRSAFTAGQK
EENGKLVPCERRQMLRMMSKTSLIHIRSMTHTDDPYFVMEEHLGKVFDQA
>J3BZS6 1.14.11.-~~~~~~L-lysine 4-hydroxylase~~~COG2175
MKSQSIMSVERSAETSLTLEIPTSPLIIKITQQERNILSNVGNLLVKAFGNYENPDYIASLHLHAFQLLPERITRILSQF
GSDFSAEQYGAIVFQGLIEVDQDDLGPTPPNWQGADYGKLNKYGFICSLLHGAVPSKPVQYYAQRKGGGLLHAVIPDEKM
AATQTGSGSKTDLFVHTEDAFLSNQADFLSFLYLRNEERVPSTLYSIRSHGKMNPVMKKLFEPIYQCPKDANYNDEDVAN
SGPTASVLYGNRELPFIRFDAAEQIFNENAGQTSEALGNLMDFWDEAKTLINSDYIPNSGDLIFVNNHLCAHGRSAFIAG
QRIENGEIIKCERRQMLRMMSKTSLIHIRSVTRTDDPYFIMEEHLGKIFDLD
>G8T8D0 1.14.11.-~~~~~~L-lysine 4-hydroxylase~~~COG2175
METIIESRQRINSPGVLPPPLSPLIVDVTPKERASISNVANILLKAFGHYEHPDFISALHLNAFQLLPERIAGILSRFGT
DFSRHQYGALVFRGLTEVDQEALGPTPPSWKETDYSKLVKYGFICSLLHGAIPSKPVQYYAQRKGGGLLHAVIPDEKMSH
TQTGSGSRTDLFVHTEDAFLFNQADFLSFLFLRNEEQVPSTLYSIRSHGDTNAIMAELFKPIYKCPKDANYADDENAGEE
VTTSILYGNRERPFIRFDAAEQIYNEKAGQTPEAMHNLVRFWDEAKQLIYNDFVPDSGDLIFVNNHLCAHGRNSFVAGYR
NENGQLVKCERRLMLRMMSKTSLINIQSVTQLNDPYFIMEEHYGKLFHSQQ
>P09181 ~~~cnl~~~Lysis protein for colicin N~~~
MCGKILLILFFIMTLSACQVNHIRDVKGGTVAPSSSSRLTGLKLSKRSKDPL
>P76594 2.3.1.-~~~patZ~~~Peptidyl-lysine N-acetyltransferase PatZ~~~COG0045
MSQRGLEALLRPKSIAVIGASMKPNRAGYLMMRNLLAGGFNGPVLPVTPAWKAVLGVLAWPDIASLPFTPDLAVLCTNAS
RNLALLEELGEKGCKTCIILSAPASQHEDLRACALRHNMRLLGPNSLGLLAPWQGLNASFSPVPIKRGKLAFISQSAAVS
NTILDWAQQRKMGFSYFIALGDSLDIDVDELLDYLARDSKTSAILLYLEQLSDARRFVSAARSASRNKPILVIKSGRSPA
AQRLLNTTAGMDPAWDAAIQRAGLLRVQDTHELFSAVETLSHMRPLRGDRLMIISNGAAPAALALDALWSRNGKLATLSE
ETCQKLRDALPEHVAISNPLDLRDDASSEHYIKTLDILLHSQDFDALMVIHSPSAAAPATESAQVLIEAVKHHPRSKYVS
LLTNWCGEHSSQEARRLFSEAGLPTYRTPEGTITAFMHMVEYRRNQKQLRETPALPSNLTSNTAEAHLLLQQAIAEGATS
LDTHEVQPILQAYGMNTLPTWIASDSTEAVHIAEQIGYPVALKLRSPDIPHKSEVQGVMLYLRTANEVQQAANAIFDRVK
MAWPQARVHGLLVQSMANRAGAQELRVVVEHDPVFGPLIMLGEGGVEWRPEDQAVVALPPLNMNLARYLVIQGIKSKKIR
ARSALRPLDVAGLSQLLVQVSNLIVDCPEIQRLDIHPLLASGSEFTALDVTLDISPFEGDNESRLAVRPYPHQLEEWVEL
KNGERCLFRPILPEDEPQLQQFISRVTKEDLYYRYFSEINEFTHEDLANMTQIDYDREMAFVAVRRIDQTEEILGVTRAI
SDPDNIDAEFAVLVRSDLKGLGLGRRLMEKLITYTRDHGLQRLNGITMPNNRGMVALARKLGFNVDIQLEEGIVGLTLNL
AQREES
>Q8ZMX2 2.3.1.-~~~pat~~~Peptidyl-lysine N-acetyltransferase Pat~~~
MSQQGLEALLRPKSIAVIGASMKPHRAGYLMMRNLLAGGFNGPVLPVTPAWKAVLGVMAWPDIASLPFTPDLAILCTNAS
RNLALLDALGAKGCKTCIILSAPTSQHEELLACARHYKMRLLGPNSLGLLAPWQGLNASFSPVPIKQGKLAFISQSAAVS
NTILDWAQQREMGFSYFIALGDSLDIDVDELLDYLARDSKTSAILLYLEQLSDARRFVSAARSASRNKPILVIKSGRSPA
AQRLLNTSAGMDPAWDAAIQRAGLLRVQDTHELFSAVETLSHMRPLRGDRLMIISNGAAPAALALDELWSRNGKLATLSE
ETCLQLRQALPAHIDIANPLDLCDDASSEHYVKTLDILLASQDFDALMVIHSPSAAAPGTESAHALIETIKRHPRGKFVT
LLTNWCGEFSSQEARRLFSEAGLPTYRTPEGTITAFMHMVEYRRNQKQLRETPALPSNLTSNTAEAHNLLQRAIAEGAAS
LDTHEVQPILHAYGLHTLPTWIASDSAEAVHIAEQIGYPVALKLRSPDIPHKSEVQGVMLYLRTASEVQQAANAIFDRVK
MAWPQARIHGLLVQSMANRAGAQELRVVVEHDPVFGPLIMLGEGGVEWRPEEQAVVALPPLNMNLARYLVIQGIKQRKIR
ARSALRPLDIVGLSQLLVQVSNLIVDCPEIQRLDIHPLLASASEFTALDVTLDIAPFDGDNESRLAVRPYPHQLEEWVEM
KNGDRCLFRPILPEDEPQLRQFIAQVTKEDLYYRYFSEINEFTHEDLANMTQIDYDREMAFVAVRRMDNAEEILGVTRAI
SDPDNVDAEFAVLVRSDLKGLGLGRRLMEKLIAYTRDHGLKRLNGITMPNNRGMVALARKLGFQVDIQLEEGIVGLTLNL
AKCDES
>Q7M135 3.4.21.50~~~~~~Lysyl endopeptidase~~~
GVSGSCNIDVVCPEGNGHRDVIRSVAAYSKQGTMWCTGSLVNNSANDKKMYFLTANHCGMTTAAIASSMVVYWNYQNSTC
RAPGSSSSGANGDGSLAQSQTGAVVRATNAASDFTLLELNTAANPAYNLFWAGWDRRDQNFAGATAIHHPNVAEKRISHS
TVATEISGYNGATGTSHLHVFWQASGGVTEPGSSGSPIYSPEKRVLGQLHGGPSSCSATGADRSDYYGRVFTSWTGGGTS
ATRLSDWLDAAGTGAQFIDGLDSTGTPPV
>Q02SZ7 3.4.21.50~~~prpL~~~Lysyl endopeptidase~~~
MHKRTYLNACLVLALAAGASQASAAPGASEMAGDVAVLQASPASTGHARFANPNAATSAAGIHFAAPPARRVARAAPLAP
KPGTPLQVGVGLKTATPEIDLATLEWIDTPDGRHTARFPISAAGAASLRAAIRLETRSGSLPDDVLLHFAGAGKEIFEAS
GKDLSLNRPYWSPVIEGDTLTVELVLPANLQPGDLRLSVPQVSYFADSLYKAGYRDGFGASGSCEVDAVCATQSGTRAYD
NATAAVAKMVFTSSADGGSYICTGTLLNNGNSPKRQLFWSAAHCIEDQATAATLQTIWFYNTTQCYGDASTINQSVTVLT
GGANILHRDAKRDTLLLELKRTPPAGVFYQGWSATPIANGSLGHDIHHPRGDAKKYSQGNVSAVGVTYDGHTALTRVDWP
SAVVEGGSSGSGLLTVAGDGSYQLRGGLYGGPSYCGAPTSQRNDYFSDFSGVYSQISRYFAP
>Q9HWK6 3.4.21.50~~~prpL~~~Lysyl endopeptidase~~~
MHKRTYLNACLVLALAAGASQALAAPGASEMAGDVAVLQASPASTGHARFANPNAAISAAGIHFAAPPARRVARAAPLAP
KPGTPLQVGVGLKTATPEIDLTTLEWIDTPDGRHTARFPISAAGAASLRAAIRLETHSGSLPDDVLLHFAGAGKEIFEAS
GKDLSVNRPYWSPVIEGDTLTVELVLPANLQPGDLRLSVPQVSYFADSLYKAGYRDGFGASGSCEVDAVCATQSGTRAYD
NATAAVAKMVFTSSADGGSYICTGTLLNNGNSPKRQLFWSAAHCIEDQATAATLQTIWFYNTTQCYGDASTINQSVTVLT
GGANILHRDAKRDTLLLELKRTPPAGVFYQGWSATPIANGSLGHDIHHPRGDAKKYSQGNVSAVGVTYDGHTALTRVDWP
SAVVEGGSSGSGLLTVAGDGSYQLRGGLYGGPSYCGAPTSQRNDYFSDFSGVYSQISRYFAP
>Q9AJC6 1.4.1.18~~~lysDH~~~Lysine 6-dehydrogenase~~~
MKVLVLGAGLMGKEAARDLVQSQDVEAVTLADVDLAKAEQTVRQLHSKKLAAVRVDAGDPQQLAAAMKGHDVVVNALFYQ
FNETVAKTAIETGVHSVDLGGHIGHITDRVLELHERAQAAGVTIIPDLGVAPGMINILSGYGASQLDEVESILLYVGGIP
VRPEPPLEYNHVFSLEGLLDHYTDPALIIRNGQKQEVPSLSEVEPIYFDRFGPLEAFHTSGGTSTLSRSFPNLKRLEYKT
IRYRGHAEKCKLLVDLTLTRHDVEVEINGCRVKPRDVLLSVLKPLLDLKGKDDVVLLRVIVGGRKDGKETVLEYETVTFN
DRENKVTAMARTTAYTISAVAQLIGRGVITKRGVYPPEQIVPGDVYMDEMKKRGVLISEKRTVHS
>P78285 3.2.1.17~~~rrrD~~~Lysozyme RrrD~~~COG3772
MPPSLRKAVAAAIGGGAIAIASVLITGPSGNDGLEGVSYIPYKDIVGVWTVCHGHTGKDIMLGKTYTKAECKALLNKDLA
TVARQINPYIKVDIPETTRGALYSFVYNVGAGNFRTSTLLRKINQGDIKGACDQLRRWTYAGGKQWKGLMTRREIEREVC
LWGQQ
>P94633 ~~~lysE~~~Lysine exporter LysE~~~COG1279
MEIFITGLLLGASLLLSIGPQNVLVIKQGIKREGLIAVLLVCLISDVFLFIAGTLGVDLLSNAAPIVLDIMRWGGIAYLL
WFAVMAAKDAMTNKVEAPQIIEETEPTVPDDTPLGGSAVATDTRNRVRVEVSVDKQRVWVKPMLMAIVLTWLNPNAYLDA
FVFIGGVGAQYGDTGRWIFAAGAFAASLIWFPLVGFGAAALSRPLSSPKVWRWINVVVAVVMTALAIKLMLMG
>P9WK31 ~~~lysE~~~Lysine exporter LysE~~~COG1279
MNSPLVVGFLACFTLIAAIGAQNAFVLRQGIQREHVLPVVALCTVSDIVLIAAGIAGFGALIGAHPRALNVVKFGGAAFL
IGYGLLAARRAWRPVALIPSGATPVRLAEVLVTCAAFTFLNPHVYLDTVVLLGALANEHSDQRWLFGLGAVTASAVWFAT
LGFGAGRLRGLFTNPGSWRILDGLIAVMMVALGISLTVT
>P94632 ~~~lysG~~~Lysine export transcriptional regulatory protein LysG~~~COG0583
MNPIQLDTLLSIIDEGSFEGASLALSISPSAVSQRVKALEHHVGRVLVSRTQPAKATEAGEVLVQAARKMVLLQAETKAQ
LSGRLAEIPLTIAINADSLSTWFPPVFNEVASWGGATLTLRLEDEAHTLSLLRRGDVLGAVTREANPVAGCEVVELGTMR
HLAIATPSLRDAYMVDGKLDWAAMPVLRFGPKDVLQDRDLDGRVDGPVGRRRVSIVPSAEGFGEAIRRGLGWGLLPETQA
APMLKAGEVILLDEIPIDTPMYWQRWRLESRSLARLTDAVVDAAIEGLRP
>P9WMF5 ~~~lysG~~~HTH-type transcriptional regulator LysG~~~COG0583
MVDPQLDGPQLAALAAVVELGSFDAAAERLHVTPSAVSQRIKSLEQQVGQVLVVREKPCRATTAGIPLLRLAAQTALLES
EALAEMGGNASLKRTRITIAVNADSMATWFSAVFDGLGDVLLDVRIEDQDHSARLLREGVAMGAVTTERNPVPGCRVHPL
GEMRYLPVASRPFVQRHLSDGFTAAAAAKAPSLAWNRDDGLQDMLVRKAFRRAITRPTHFVPTTEGFTAAARAGLGWGMF
PEKLAASPLADGSFVRVCDIHLDVPLYWQCWKLDSPIIARITDTVRAAASGLYRGQQRRRRPG
>Q93R93 2.6.1.118~~~lysJ~~~[LysW]-aminoadipate semialdehyde transaminase~~~COG4992
METRTLEDWRALLEAEKTLDSGVYNKHDLLIVRGQGARVWDAEGNEYIDCVGGYGVANLGHGNPEVVEAVKRQAETLMAM
PQTLPTPMRGEFYRTLTAILPPELNRVFPVNSGTEANEAALKFARAHTGRKKFVAAMRGFSGRTMGSLSVTWEPKYREPF
LPLVEPVEFIPYNDVEALKRAVDEETAAVILEPVQGEGGVRPATPEFLRAAREITQEKGALLILDEIQTGMGRTGKRFAF
EHFGIVPDILTLAKALGGGVPLGVAVMREEVARSMPKGGHGTTFGGNPLAMAAGVAAIRYLERTRLWERAAELGPWFMEK
LRAIPSPKIREVRGMGLMVGLELKEKAAPYIARLEKEHRVLALQAGPTVIRFLPPLVIEKEDLERVVEAVRAVLA
>Q5SHH5 2.6.1.118~~~lysJ~~~[LysW]-aminoadipate semialdehyde transaminase~~~COG4992
METRTLEDWRALLEAEKTLDSGVYNKHDLLIVRGQGARVWDAEGNEYIDCVGGYGVANLGHGNPEVVEAVKRQAETLMAM
PQTLPTPMRGEFYRTLTAILPPELNRVFPVNSGTEANEAALKFARAHTGRKKFVAAMRGFSGRTMGSLSVTWEPKYREPF
LPLVEPVEFIPYNDVEALKRAVDEETAAVILEPVQGEGGVRPATPEFLRAAREITQEKGALLILDEIQTGMGRTGKRFAF
EHFGIVPDILTLAKALGGGVPLGAAVMREEVARSMPKGGHGTTFGGNPLAMAAGVAAIRYLERTRLWERAAELGPWFMEK
LRAIPSPKIREVRGMGLMVGLELKEKAAPYIARLEKEHRVLALQAGPTVIRFLPPLVIEKEDLERVVEAVRAVLA
>Q8VUS5 3.5.1.130~~~lysK~~~[LysW]-lysine hydrolase~~~COG0624
MSKSALDPVEFLKGALEIPSPSGKERLVAEYLAEGMQKLGLKGFVDEADNARGQVGEGPVQVVLLGHIDTVPGQIPVRLE
GGRLFGRGAVDAKGPFVAMIFAAAGLSEEARKRLTVHLVGATEEEAPSSKGARFVAPRLKPHYAVIGEPSGWEGITLGYK
GRLLVKARREKDHFHSAHHEPNAAEELISYFVAIKAWAEAMNVGQRPFDQVQYTLRDFRVHPAELRQVAEMFFDLRLPPR
LPPEEAIRHLTAYAPPTIELEFFGREVPYQGPKDTPLTRAFRQAIRKAGGRPVFKLKTGTSDMNVLAPHWPVPMVAYGPG
DSTLDHTPYEHVEVAEFLKGIEVLRGALEALAQTHAGEKEG
>P25310 3.2.1.17~~~acm~~~Lysozyme M1~~~
MPAYSSLARRGRRPAVVLLGGLVSASLALTLAPTAAAAPLAPPPGKDVGPGEAYMGVGTRIEQGLGAGPDERTIGPADTS
GVQGIDVSHWQGSINWSSVKSAGMSFAYIKATEGTNYKDDRFSANYTNAYNAGIIRGAYHFARPNASSGTAQADYFASNG
GGWSRDNRTLPGVLDIEHNPSGAMCYGLSTTQMRTWINDFHARYKARTTRDVVIYTTASWWNTCTGSWNGMAAKSPFWVA
HWGVSAPTVPSGFPTWTFWQYSATGRVGGVSGDVDRNKFNGSAARLLALANNTA
>Q72LL6 2.6.1.39~~~lysN~~~2-aminoadipate transaminase~~~COG1167
MKPLSWSEAFGKGAGRIQASTIRELLKLTQRPGILSFAGGLPAPELFPKEEAAEAAARILREKGEVALQYSPTEGYAPLR
AFVAEWIGVRPEEVLITTGSQQALDLVGKVFLDEGSPVLLEAPSYMGAIQAFRLQGPRFLTVPAGEEGPDLDALEEVLKR
ERPRFLYLIPSFQNPTGGLTPLPARKRLLQMVMERGLVVVEDDAYRELYFGEARLPSLFELAREAGYPGVIYLGSFSKVL
SPGLRVAFAVAHPEALQKLVQAKQGADLHTPMLNQMLVHELLKEGFSERLERVRRVYREKAQAMLHALDREVPKEVRYTR
PKGGMFVWMELPKGLSAEGLFRRALEENVAFVPGGPFFANGGGENTLRLSYATLDREGIAEGVRRLGRALKGLLALV
>P75826 ~~~lysO~~~Lysine exporter LysO~~~COG2431
MFSGLLIILVPLIVGYLIPLRQQAALKVINQLLSWMVYLILFFMGISLAFLDNLASNLLAILHYSAVSITVILLCNIAAL
MWLERGLPWRNHHQQEKLPSRIAMALESLKLCGVVVIGFAIGLSGLAFLQHATEASEYTLILLLFLVGIQLRNNGMTLKQ
IVLNRRGMIVAVVVVVSSLIGGLINAFILDLPINTALAMASGFGWYSLSGILLTESFGPVIGSAAFFNDLARELIAIMLI
PGLIRRSRSTALGLCGATSMDFTLPVLQRTGGLDMVPAAIVHGFILSLLVPILIAFFSA
>P25737 ~~~lysP~~~Lysine-specific permease LysP~~~COG0833
MVSETKTTEAPGLRRELKARHLTMIAIGGSIGTGLFVASGATISQAGPGGALLSYMLIGLMVYFLMTSLGELAAYMPVSG
SFATYGQNYVEEGFGFALGWNYWYNWAVTIAVDLVAAQLVMSWWFPDTPGWIWSALFLGVIFLLNYISVRGFGEAEYWFS
LIKVTTVIVFIIVGVLMIIGIFKGAQPAGWSNWTIGEAPFAGGFAAMIGVAMIVGFSFQGTELIGIAAGESEDPAKNIPR
AVRQVFWRILLFYVFAILIISLIIPYTDPSLLRNDVKDISVSPFTLVFQHAGLLSAAAVMNAVILTAVLSAGNSGMYAST
RMLYTLACDGKAPRIFAKLSRGGVPRNALYATTVIAGLCFLTSMFGNQTVYLWLLNTSGMTGFIAWLGIAISHYRFRRGY
VLQGHDINDLPYRSGFFPLGPIFAFILCLIITLGQNYEAFLKDTIDWGGVAATYIGIPLFLIIWFGYKLIKGTHFVRYSE
MKFPQNDKK
>A2RNZ6 ~~~lysP~~~Lysine-specific permease LysP~~~COG0833
MRVSLSLTSRCRINFIRERILENSSNSTTETQVKRALKSRHVSMIALGGTIGTGLFLTSGDVIHTAGPFGALTAYVLIGA
MVYFLMTSLGEMATYLPTSGSFSDYGTRYVDPAFGFALGWNYWLNWAITVAVDLTAVALCIKFWLPDVPSWIFSLIALII
VFSINALSVKTFGETEYWLSAIKITVVVLFLIIGFLSIFGIMGGHIDVAKNLSVGNHGFVGGLGSFTTGGGILGVLLVAG
FSFQGTELLGITAGEAENPEKSIPKAMNSIFWRILVFYILSIFVMAAIIPFTDPHLVGGNSAAQSPFTIVFERVGFSIAA
SIMNAVVLTSVVSAANSGMYASTRMLYSLAKDGGAPKIFSKTSKNGIPFIALLATTAVALLTFLTSIYGVSFFTLLVSAS
GLTGFIAWIGIAISHFRFRRAYVAQGKDVKKLPYHAKLFPFGPILALIMTVLVTLGQDPMLLFGKTWVQGVVMYAAIPLF
FILYLGYKFKNKTKLIPLKDVDLSRHKD
>Q9X1T3 5.1.1.5~~~~~~Lysine racemase~~~
MVYPRLLINLKEIEENARKVVEMASRRGIEIVGVTKVTLGDPRFAETLRKAGIGILGESRIRNVLRMKKAGIEGPFMLLR
LPMMSELVEDVKHFDYIMVSDPDVAKKVDELSREMKRNVKIIYMIDVGDLREGVWFEKAVEEIAQCRGANIVGIGTNFGC
YGGIIPTREKFEILLDIKEKLEKNHGFNIEIVSGGNTPALYALENGEIPEGINQLRIGEAIVLGRDITNNRVIDWLSQNT
FLIEAEVIEVKEKPSVPLGKRGLDVFGRKVDFVDRGIRKRAICALGEQDIDSRGLIPVDKGVEVLHASSDHIVLDVTDFG
DVKVGDVFRFRMTYSCLLKAMTSPFVEKVYEPSI
>Q04HB7 5.1.1.5~~~~~~Lysine racemase~~~COG0787
MVEAIHRSTRIEFSKSSLAYNVQYTKQVSGAKTLWLAVKSNAYGHGLLQVSKIARECGVDGLAVSVLDEGIAIRQAGIDD
FILILGPIDVKYAPIASKYHFLTTVSSLDWLKSADKILGKEKLSVNLAVDTGMNRIGVRSKKDLKDEIEFLQEHSDHFSY
DGIFTHFASSDNPDDHYFQRQKNRWYELIDGLIMPRYVHVMNSGAAMYHSKELPGCNSIARVGTVVYGVEPSEGVLGPID
KLKPVFELKSALTFVKKIPAGEGISYGSKFVTSRDTWIGTLPIGYGDGWLAEYQDFQLLIDGQKCRQVGQIAMDQMMVAL
PHEYPIGTEVTLIGKSGKYENTLYDLHKHSGVPPWKITVAFSDRLKRMVVD
>Q5SH22 ~~~lysW~~~Alpha-aminoadipate carrier protein LysW~~~
MVGTCPECGAELRLENPELGELVVCEDCGAELEVVGLDPLRLEPAPEEAEDWGE
>O06136 ~~~LysX2~~~MprF-like domain protein Rv1619~~~COG2898
MVAAAGEPLNCQRANPEVTVKLPSADVVPRLRGRQRVVVHVDSRTARCVGALALVCAACWLIALLAGDYRHAQWAVAGRL
GWSLTVLAAVAFIARGIFLGRPVTAMHATAAGLFLLAGLAAHVLVADLLGEILIAGSGWALMWPTSAHPRPEDLPRVWAL
INATRADSLAPFAMQAGKSHHFSAAGTAALAYRTRIGYAVVSGDPIGDEAQFPQLVADFAAMCHMHGWRIVVVGCSERRL
GLWSDPMVVGQSLRPIPIGRDVVIDVSNFEMTGRRFRNLRQAVKRTHNFGVTTEIVAEQQLDDQRQAELAEVLAASPSGA
RTDRGFCMNLDGVLEGRYPGIQLIIARDASGRVQGFHRYATAGGGSDMSLDVPWRRRGAPNGIDERLSADMIAAAKDAGV
QRLSLAFAAFPDLFGANQLGRLQRVCRALIHILDPLIALESLYRYLRKFHALDERRYVLISMTQVFALALVLLSLEFVPR
RRHL
>P9WFU7 ~~~lysX~~~Lysylphosphatidylglycerol biosynthesis bifunctional protein LysX~~~COG1190
MGLHLTVPGLRRDGRGVQSNSHDTSSKTTADISRCPQHTDAGLQRAATPGISRLLGISSRSVTLTKPRSATRGNSRYHWV
PAAAGWTVGVIATLSLLASVSPLIRWIIKVPREFINDYLFNFPDTNFAWSFVLALLAAALTARKRIAWLVLLANMVLAAV
VNAAEIAAGGNTAAESFGENLGFAVHVVAIVVLVLGYREFWAKVRRGALFRAAAVWLAGAVVGIVASWGLVELFPGSLAP
DERLGYAANRVVGFALADPDLFTGRPHVFLNAIFGLFGAFALIGAAIVLFLSQRADNALTGEDESAIRGLLDLYGKDDSL
GYFATRRDKSVVFASSGRACITYRVEVGVCLASGDPVGDHRAWPQAVDAWLRLCQTYGWAPGVMGASSQGAQTYREAGLT
ALELGDEAILRPADFKLSGPEMRGVRQAVTRARRAGLTVRIRRHRDIAEDEMAQTITRADSWRDTETERGFSMALGRLGD
PADSDCLLVEAIDPHNQVLAMLSLVPWGTTGVSLDLMRRSPQSPNGTIELMVSELALHAESLGITRISLNFAVFRAAFEQ
GAQLGAGPVARLWRGLLVFFSRWWQLETLYRSNMKYQPEWVPRYACYEDARVIPRVGVASVIAEGFLVLPFSRRNRVHTG
HHPAVPERLAATGLLHHDGSAPDVSGLRQVGLTNGDGVERRLPEQVRVRFDKLEKLRSSGIDAFPVGRPPSHTVAQALAA
DHQASVSVSGRIMRIRNYGGVLFAQLRDWSGEMQVLLDNSRLDQGCAADFNAATDLGDLVEMTGHMGASKTGTPSLIVSG
WRLIGKCLRPLPNKWKGLLDPEARVRTRYLDLAVNAESRALITARSSVLRAVRETLFAKGFVEVETPILQQLHGGATARP
FVTHINTYSMDLFLRIAPELYLKRLCVGGVERVFELGRAFRNEGVDFSHNPEFTLLEAYQAHADYLEWIDGCRELIQNAA
QAANGAPIAMRPRTDKGSDGTRHHLEPVDISGIWPVRTVHDAISEALGERIDADTGLTTLRKLCDAAGVPYRTQWDAGAV
VLELYEHLVECRTEQPTFYIDFPTSVSPLTRPHRSKRGVAERWDLVAWGIELGTAYSELTDPVEQRRRLQEQSLLAAGGD
PEAMELDEDFLQAMEYAMPPTGGLGMGIDRVVMLITGRSIRETLPFPLAKPH
>Q5SH23 6.3.2.43~~~lysX~~~Alpha-aminoadipate--LysW ligase LysX~~~COG0189
MLAILYDRIRPDERMLFERAEALGLPYKKVYVPALPMVLGERPKELEGVTVALERCVSQSRGLAAARYLTALGIPVVNRP
EVIEACGDKWATSVALAKAGLPQPKTALATDREEALRLMEAFGYPVVLKPVIGSWGRLLAKVTDRAAAEALLEHKEVLGG
FQHQLFYIQEYVEKPGRDIRVFVVGERAIAAIYRRSAHWITNTARGGQAENCPLTEEVARLSVKAAEAVGGGVVAVDLFE
SERGLLVNEVNHTMEFKNSVHTTGVDIPGEILKYAWSLAS
>O50146 1.2.1.103~~~lysY~~~[LysW]-L-2-aminoadipate 6-phosphate reductase~~~COG0002
MDKKTLSIVGASGYAGGEFLRLALSHPYLEVKQVTSRRFAGEPVHFVHPNLRGRTNLKFIPPEKLEPADILVLALPHGVF
AREFDRYSALAPILIDLSADFRLKDPELYRRYYGEHPRPDLLGCFVYAVPELYREALKGADWIAGAGCNATATLLGLYPL
LKAGVLKPTPIFVTLLISTSAAGAEASPASHHPERAGSIRVYKPTGHRHTAEVVENLPGRPEVHLTAIATDRVRGILMTA
QCFVQDGWSERDVWQAYREAYAGEPFIRLVKQKKGVHRYPDPRFVQGTNYADIGFELEEDTGRLVVMTAIDNLVKGTAGH
ALQALNVRMGWPETLGLDFPGLHP
>O50147 2.7.2.17~~~lysZ~~~[LysW]-aminoadipate kinase~~~COG0548
MIVVKVGGAEGINYEAVAKDAASLWKEGVKLLLVHGGSAETNKVAEALGHPPRFLTHPGGQVSRLTDRKTLEIFEMVYCG
LVNKRLVELLQKEGANAIGLSGLDGRLFVGRRKTAVKYVENGKVKVHRGDYTGTVEEVNKALLDLLLQAGYLPVLTPPAL
SYENEAINTDGDQIAALLATLYGAEALVYLSNVPGLLARYPDEASLVREIPVERIEDPEYLALAQGRMKRKVMGAVEAVR
GGVKRVVFADARVENPIRRALSGEGTVVR
>P34020 3.2.1.17~~~lyc~~~Autolytic lysozyme~~~COG3409
MKGIDIYSGQGSVDFNAVKESGVEVVYIKATEGLTYTDSTYKDFYDGAKNAGLKIGFYHYLRANDPTSEAEHFFNTISGL
SLDCKCAIDVEVTLGQSIDQISSNVRKFADYLINKGLDVCVYTYTNFYKDNLNSTVKDLPLWIAEYGVSKPNIDASYVGF
QYSDSGSVNGISGSADLDEFSEGILVGGTVVIDPGQGGDDNIKAIQQDLNILLKRGLEVDGIEGPETEAAIKDFQSIMGL
TVDGIWGTNTSGAAQQIFSRPLDGVAYPHYEYATRYIQYRVGASVDGTFGSGTKAKVAAWQSNQGLMADGVVGSATWSKL
LDEN
>Q02112 ~~~lytA~~~Membrane-bound protein LytA~~~
MKKFIALLFFILLLSGCGVNSQKSQGEDVSPDSNIETKEGTYVGLADTHTIEVTVDNEPVSLDITEESTSDLDKFNSGDK
VTITYEKNDEGQLLLKDIERAN
>P59205 3.2.1.96~~~lytB~~~Putative endo-beta-N-acetylglucosaminidase~~~COG4193
MKKVRFIFLALLFFLASPEGAMASDGTWQGKQYLKEDGSQAANEWVFDTHYQSWFYIKADANYAENEWLKQGDDYFYLKS
GGYMAKSEWVEDKGAFYYLDQDGKMKRNAWVGTSYVGATGAKVIEDWVYDSQYDAWFYIKADGQHAEKEWLQIKGKDYYF
KSGGYLLTSQWINQAYVNASGAKVQQGWLFDKQYQSWFYIKENGNYADKEWIFENGHYYYLKSGGYMAANEWIWDKESWF
YLKFDGKMAEKEWVYDSHSQAWYYFKSGGYMTANEWIWDKESWFYLKSDGKIAEKEWVYDSHSQAWYYFKSGGYMTANEW
IWDKESWFYLKSDGKIAEKEWVYDSHSQAWYYFKSGGYMAKNETVDGYQLGSDGKWLGGKTTNENAAYYQVVPVTANVYD
SDGEKLSYISQGSVVWLDKDRKSDDKRLAITISGLSGYMKTEDLQALDASKDFIPYYESDGHRFYHYVAQNASIPVASHL
SDMEVGKKYYSADGLHFDGFKLENPFLFKDLTEATNYSAEELDKVFSLLNINNSLLENKGATFKEAEEHYHINALYLLAH
SALESNWGRSKIAKDKNNFFGITAYDTTPYLSAKTFDDVDKGILGATKWIKENYIDRGRTFLGNKASGMNVEYASDPYWG
EKIASVMMKINEKLGGKD
>P59206 3.2.1.96~~~lytB~~~Putative endo-beta-N-acetylglucosaminidase~~~COG4193
MKKVRFIFLALLFFLASPEGAMASDGTWQGKQYLKEDGSQAANEWVFDTHYQSWFYIKADANYAENEWLKQGDDYFYLKS
GGYMAKSEWVEDKGAFYYLDQDGKMKRNAWVGTSYVGATGAKVIEDWVYDSQYDAWFYIKADGQHAEKEWLQIKGKDYYF
KSGGYLLTSQWINQAYVNASGAKVQQGWLFDKQYQSWFYIKENGNYADKEWIFENGHYYYLKSGGYMAANEWIWDKESWF
YLKFDGKIAEKEWVYDSHSQAWYYFKSGGYMAANEWIWDKESWFYLKFDGKMAEKEWVYDSHSQAWYYFKSGGYMTANEW
IWDKESWFYLKSDGKIAEKEWVYDSHSQAWYYFKSGGYMTANEWIWDKESWFYLKSDGKMAEKEWVYDSHSQAWYYFKSG
GYMAKNETVDGYQLGSDGKWLGGKATNKNAAYYQVVPVTANVYDSDGEKLSYISQGSVVWLDKDRKSDDKRLAITISGLS
GYMKTEDLQALDASKDFIPYYESDGHRFYHYVAQNASIPVASHLSDMEVGKKYYSADGLHFDGFKLENPFLFKDLTEATN
YSAEELDKVFSLLNINNSLLENKGATFKEAEEHYHINALYLLAHSALESNWGRSKIAKDKNNFFGITAYDTTPYLSAKTF
DDVDKGILGATKWIKENYIDRGRTFLGNKASGMNVEYASDPYWGEKIASVMMKINEKLGGKD
>Q02114 3.5.1.28~~~lytC~~~N-acetylmuramoyl-L-alanine amidase LytC~~~COG0860
MRSYIKVLTMCFLGLILFVPTALADNSVKRVGGSNRYGTAVQISKQMYSTASTAVIVGGSSYADAISAAPLAYQKNAPLL
YTNSDKLSYETKTRLKEMQTKNVIIVGGTPAVSSNTANQIKSLGISIKRIAGSNRYDTAARVAKAMGATSKAVILNGFLY
ADAPAVIPYAAKNGYPILFTNKTSINSATTSVIKDKGISSTVVVGGTGSISNTVYNKLPSPTRISGSNRYELAANIVQKL
NLSTSTVYVSNGFSYPDSIAGATLAAKKKQSLILTNGENLSTGARKIIGSKNMSNFMIIGNTPAVSTKVANQLKNPVVGE
TIFIDPGHGDQDSGAIGNGLLEKEVNLDIAKRVNTKLNASGALPVLSRSNDTFYSLQERVNKAASAQADLFLSIHANAND
SSSPNGSETYYDTTYQAANSKRLAEQIQPKLAANLGTRDRGVKTAAFYVIKYSKMPSVLVETAFITNASDASKLKQAVYK
DKAAQAIHDGTVSYYR
>P39848 3.2.1.96~~~lytD~~~Beta-N-acetylglucosaminidase~~~COG4193
MKKRLIAPMLLSAASLAFFAMSGSAQAAAYTDYSLYKVEPSNTFSTESQASQAVAKLEKDTGWDASYQASGTTTTYQISA
SGIHSESEAKAILSGLAKQTSITGTSSPVGSKQPYVTISSGAISGEKQANTILAKLKQETGVAGAVKAYGAAQPYMNVMT
SDIADETKVKALIQSLAKQTGIKSSYQPITHTVSVTTIQSGTIVGDSRAAQIKNAFQKESGLQASLKETVKGQAYYTFTT
AAISGEANAKTLLQQLKQSTGITGSYKSINQKTTVESYNVQSAYFKGLSTVKDAISQIKKNTGVSGSYQQVGKSTSYTVN
MKGITKQQLQKIDTFFKKKKWHYTSSSVKKTTTSAAYQITTAKILGEQQANKAAAFFAQKKVKAAKTAAGSTAENQYQLI
SEETSDQAKVTKGLNILKKNQLSASAKSVKKQIADTFKITTESLLDQTKVNQALTFFKSNHISVASQKTGQTAASSYQIT
TEAIISQEEIDRVLTFFKQNHIAVTTSKTGQTAYTQYKIVTTQLSSKTALNNGLTYLKSKSVTPSYTTKSNTLYKISVNE
QFTGNDTAAAASTKLKQLYGWTSSIVKIKNGPQIMKTNYNLSLRDMVQKQMTVSPQTDGAAYVSLTYINTATSTVTADVL
NIRSTPEVSPTNVIGQFKKGDKVKVIGQINGWAKINLGWRNASSDEVVQYVDPNNFSRDSKYYFQFLKLSQTAGLSVTEV
NQKVLAGKGILTGRAKAFIDAANQYSINELYLISHALLETGNGTSALANGLTYNGKTVYNMYGIGAYDSNPNYYGAKYAY
EQGWFTPEAAIIGGAKFIGSSYIHNTAYNQDTLYKMRWSATATHQYATDIGWAYKQVNRMYSLYSLLDGYTLYFDVPEYR
>P54421 3.4.-.-~~~lytE~~~Probable peptidoglycan endopeptidase LytE~~~COG0791
MKKQIITATTAVVLGSTLFAGAASAQSIKVKKGDTLWDLSRKYDTTISKIKSENHLRSDIIYVGQTLSINGKSTSSKSSS
SSSSSSTYKVKSGDSLWKISKKYGMTINELKKLNGLKSDLLRVGQVLKLKGSTSSSSSSSSKVSSSSTSTYKVKSGDSLS
KIASKYGTTVSKLKSLNGLKSDVIYVNQVLKVKGTSTSSSKPASSSSSSSSKTSSTSLNVSKLVSDAKALVGTPYKWGGT
TTSGFDCSGFIWYVLNKQTSVGRTSTAGYWSSMKSIASPSVGDFVFFTTYKSGPSHMGIYIGNNSFIHAGSDGVQISSLN
NSYWKPRYLGAKRF
>O07532 3.4.-.-~~~lytF~~~Peptidoglycan endopeptidase LytF~~~COG0791
MKKKLAAGLTASAIVGTTLVVTPAEAATIKVKSGDSLWKLAQTYNTSVAALTSANHLSTTVLSIGQTLTIPGSKSSTSSS
TSSSTTKKSGSSVYTVKSGDSLWLIANEFKMTVQELKKLNGLSSDLIRAGQKLKVSGTVSSSSSSSKKSNSNKSSSSSSK
SSSNKSSSSSSSTGTYKVQLGDSLWKIANKVNMSIAELKVLNNLKSDTIYVNQVLKTKSSGSDTSSKDNSSKSNQTSATT
KYTVKSGDSLWKIANNYNLTVQQIRNINNLKSDVLYVGQVLKLTGKASSGSSSSSSSSSNASSGTTTTYTVKSGDSLWVI
AQKFNVTAQQIREKNNLKTDVLQVGQKLVISGKASSSSSSGSSNTTSSTSAKINTMISAAKAQLGVPYRWGGTTPSGFDC
SGFIYYVLNKVTSVSRLTAAGYWNTMKSVSQPAVGDFVFFSTYKAGPSHVGIYLGNGEFINANDSGVVISNMNNSYWKQR
YLGAKRYF
>O32083 3.2.1.-~~~lytG~~~Exo-glucosaminidase LytG~~~COG1705
MARKKLKKRKLLISLFFLVSIPLALFVLATTLSKPIEISKETEEIDEQQVFIDSLSGHAQILYEKYHVLPSITIAQAILE
SDWGNSELAAKANNLFGVKGNYKGHHVTMETDEVEKGKRKTIRAKFRKYSTFFESMDDHAQLFVRGTSWNKKKYKPVLEA
GNYKEAATALQTSGYATDPDYADKISAIVEKYDLDEYDEVNPSLKSVDLNASIKDSAVQDVWSKPSTDDRSIRLTSAQSY
VGKDIKVVSKKQKGQSVWYQFQINDKLIGWIDDSAVEIKEAT
>O32130 3.4.-.-~~~lytH~~~L-Ala--D-Glu endopeptidase~~~COG0739
MKVLLSALLLLLFAFEPSASGKKLSDPVLSKRMELYHKIEAVTQIPWYALAAVDQYEENVRSNRKDLPEKAGIISIYIPD
DIWSGPENPNPKDDAPLSIKVFDGIGMDGDGDGKAEVSNDEDILYTFSQYLLSYGTDEDNIRIGLWNYYRRDQTVGIISE
FMKLFKAYGHIDLGEHAFPLPIRTDYSYRSTWGDARGFGGRRIHEGTDIFAHYGLPVKSTCYGVVEMKGWNRFGGWRIGI
RDINNTYHYFAHLNGFAKGIKTGQIVEPGQVIGSVGSSGYGPPGTAGKFPPHLHYGMYKDNGRTEWSFDPYPHLRAWERY
EYQKKK
>Q2FXU3 3.5.1.-~~~lytH~~~Probable cell wall amidase LytH~~~COG0860
MKKIEAWLSKKGLKNKRTLIVVIAFVLFIIFLFLLLNSNSEDSGNITITENAELRTGPNAAYPVIYKVEKGDHFKKIGKV
GKWIEVEDTSSNEKGWIAGWHTNLDIVADNTKEKNPLQGKTIVLDPGHGGSDQGASSNTKYKSLEKDYTLKTAKELQRTL
EKEGATVKMTRTDDTYVSLENRDIKGDAYLSIHNDALESSNANGMTVYWYHDNQRALADTLDATIQKKGLLSNRGSRQEN
YQVLRQTKVPAVLLELGYISNPTDETMIKDQLHRQILEQAIVDGLKIYFSA
>O33599 3.4.24.75~~~lytM~~~Glycyl-glycine endopeptidase LytM~~~COG0739
MKKLTAAAIATMGFATFTMAHQADAAETTNTQQAHTQMSTQSQDVSYGTYYTIDSNGDYHHTPDGNWNQAMFDNKEYSYT
FVDAQGHTHYFYNCYPKNANANGSGQTYVNPATAGDNNDYTASQSQQHINQYGYQSNVGPDASYYSHSNNNQAYNSHDGN
GKVNYPNGTSNQNGGSASKATASGHAKDASWLTSRKQLQPYGQYHGGGAHYGVDYAMPENSPVYSLTDGTVVQAGWSNYG
GGNQVTIKEANSNNYQWYMHNNRLTVSAGDKVKAGDQIAYSGSTGNSTAPHVHFQRMSGGIGNQYAVDPTSYLQSR
>Q9ZNI1 3.-.-.-~~~lytN~~~Probable cell wall hydrolase LytN~~~COG1388
MFVYYCKECFIMNKQQSKVRYSIRKVSIGILSISIGMFLALGMSNKAYADEIDKSKDFTRGYEQNVFAKSELNANKNTTK
DKIKNEGAVKTSDTSLKLDNKSAISNGNEINQDIKISNTPKNSSQGNNLVINNNELTKEIKIANLEAQNSNQKKTNKVTN
NYFGYYSFREAPKTQIYTVKKGDTLSAIALKYKTTVSNIQNTNNIANPNLIFIGQKLKVPMTPLVEPKPKTVSSNNKSNS
NSSTLNYLKTLENRGWDFDGSYGWQCFDLVNVYWNHLYGHGLKGYGAKDIPYANNFNSEAKIYHNTPTFKAEPGDLVVFS
GRFGGGYGHTAIVLNGDYDGKLMKFQSLDQNWNNGGWRKAEVAHKVVHNYENDMIFIRPFKKA
>Q2FX77 3.5.1.28~~~lytO~~~Probable autolysin LytO~~~COG1388
MQAKLTKNEFIEWLKTSEGKQFNVDLWYGFQCFDYANAGWKVLFGLLLKGLGAKDIPFANNFDGLATVYQNTPDFLAQPG
DMVVFGSNYGAGYGHVAWVIEATLDYIIVYEQNWLGGGWTDGIEQPGWGWEKVTRRQHAYDFPMWFIRPNFKSETAPRSV
QSPTQAPKKETAKPQPKAVELKIIKDVVKGYDLPKRGSNPKGIVIHNDAGSKGATAEAYRNGLVNAPLSRLEAGIAHSYV
SGNTVWQALDESQVGWHTANQIGNKYYYGIEVCQSMGADNATFLKNEQATFQECARLLKKWGLPANRNTIRLHNEFTSTS
CPHRSSVLHTGFDPVTRGLLPEDKRLQLKDYFIKQIRAYMDGKIPVATVSNESSASSNTVKPVASAWKRNKYGTYYMEES
ARFTNGNQPITVRKVGPFLSCPVGYQFQPGGYCDYTEVMLQDGHVWVGYTWEGQRYYLPIRTWNGSAPPNQILGDLWGEI
S
>P60611 ~~~lytR~~~Transcriptional regulatory protein LytR~~~COG3279
MKALIIDDEPLARNELTYLLNEIGGFEEINEAENVKETLEALLINQYDIIFLDVNLMDENGIELGAKIQKMKEPPAIIFA
TAHDQYAVQAFELNATDYILKPFGQKRIEQAVNKVRATKAKDDNNASAIANDMSANFDQSLPVEIDDKIHMLKQQNIIGI
GTHNGITTIHTTNHKYETTEPLNRYEKRLNPTYFIRIHRSYIINTKHIKEVQQWFNYTYMVILTNGVKMQVGRSFMKDFK
ASIGLL
>P60609 ~~~lytR~~~Transcriptional regulatory protein LytR~~~
MKALIIDDEPLARNELTYLLNEIGGFEEINEAENVKETLEALLINQYDIIFLDVNLMDENGIELGAKIQKMKEPPAIIFA
TAHDQYAVQAFELNATDYILKPFGQKRIEQAVNKVRATKAKDDNNASAIANDMSANFDQSLPVEIDDKIHMLKQQNIIGI
GTHNGITTIHTTNHKYETTEPLNRYEKRLNPTYFIRIHRSYIINTKHIKEVQQWFNYTYMVILTNGVKMQVGRSFMKDFK
ASIGLL
>Q53705 2.7.13.3~~~lytS~~~Sensor histidine kinase/phosphatase LytS~~~COG3275
MLSLTMLLLERVGLIIILAYVLMNIPYFKNLMNRRRTWKARWQLCIIFSLFALMSNLTGIVIDHQHSLSGSVYFRLDDDV
SLANTRVLTIGVAGLVGGPFVGLFVGVISGIFRVYMGGADAQVYLISSIFIGIIAGYFGLQAQRRKRYPSIAKSAMIGIV
MEMIQMLSILTFSHDKAYAVDLISLIALPMIIVNSVGTAIFMSIIISTLKQEEQMKAVQTHDVLQLMNQTLPYFKEGLNR
ESAQQIAMIIKNLMKVSAVAITSKNEILSHVGAGSDHHIPTNEILTSLSKDVLKSGKLKEVHTKEEIGCSHPNCPLRAAI
VIPLEMHGSIVGTLKMYFTNPNDLTFVERQLAEGLANIFSSQIELGEAETQSKLLKDAEIKSLQAQVSPHFFFNSINTIS
ALVRINSEKARELLLELSYFFRANLQGSKQHTITLDKELSQVRAYLSLEQARYPGRFNININVEDKYRDVLVPPFLIQIL
VENAIKHAFTNRKQGNDIDVSVIKETATHVRIIVQDNGQGISKDKMHLLGETSVESESGTGSALENLNLRLKGLFGKSAA
LQFESTSSGTTFWCVLPYERQEEE
>P60612 2.7.13.3~~~lytS~~~Sensor histidine kinase/phosphatase LytS~~~
MLSLTMLLLERVGLIIILAYVLMNIPYFKNLMNRRRTWKARWQLCIIFSLFALMSNLTGIVIDHQHSLSGSVYFRLDDDV
SLANTRVLTIGVAGLVGGPFVGLFVGVISGIFRVYMGGADAQVYLISSIFIGIIAGYFGLQAQRRKRYPSIAKSAMIGIV
MEMIQMLSILTFSHDKAYAVDLISLIALPMIIVNSVGTAIFMSIIISTLKQEEQMKAVQTHDVLQLMNQTLPYFKEGLNR
ESAQQIAMIIKNLMKVSAVAITSKNEILSHVGAGSDHHIPTNEILTSLSKDVLKSGKLKEVHTKEEIGCSHPNCPLRAAI
VIPLEMHGSIVGTLKMYFTNPNDLTFVERQLAEGLANIFSSQIELGEAETQSKLLKDAEIKSLQAQVSPHFFFNSINTIS
ALVRINSEKARELLLELSYFFRANLQGSKQHTITLDKELSQVRAYLSLEQARYPGRFNININVEDKYRDVLVPPFLIQIL
VENAIKHAFTNRKQGNDIDVSVIKETATHVRIIVQDNGQGISKDKMHLLGETSVESESGTGSALENLNLRLKGLFGKSAA
LQFESTSSGTTFWCVLPYERQEEE
>P37677 2.7.1.-~~~lyx~~~L-xylulose/3-keto-L-gulonate kinase~~~COG1070
MTQYWLGLDCGGSWLKAGLYDREGREAGVQRLPLCALSPQPGWAERDMAELWQCCMAVIRALLTHSGVSGEQIVGIGISA
QGKGLFLLDKNDKPLGNAILSSDRRAMEIVRRWQEDGIPEKLYPLTRQTLWTGHPVSLLRWLKEHEPERYAQIGCVMMTH
DYLRWCLTGVKGCEESNISESNLYNMSLGEYDPCLTDWLGIAEINHALPPVVGSAEICGEITAQTAALTGLKAGTPVVGG
LFDVVSTALCAGIEDEFTLNAVMGTWAVTSGITRGLRDGEAHPYVYGRYVNDGEFIVHEASPTSSGNLEWFTAQWGEISF
DEINQAVASLPKAGGDLFFLPFLYGSNAGLEMTSGFYGMQAIHTRAHLLQAIYEGVVFSHMTHLNRMRERFTDVHTLRVT
GGPAHSDVWMQMLADVSGLRIELPQVEETGCFGAALAARVGTGVYHNFSEAQRDLRHPVRTLLPDMTAHQLYQKKYQRYQ
HLIAALQGFHARIKEHTL
>Q5E1G4 1.1.1.17~~~yggP~~~Mannitol-1-phosphate 5-dehydrogenase~~~COG1063
MTQTTAAVICGEKDIQLRTFELPSISADELLVKNISNSVCLSTYKAALLGSKHKRVPENIDEVPVITGHEYAGVIVEVGE
NLKDQFKAGDSFVLQPAMGLPTGYSAGYSYETFGGNATYSIIPKIAIDLGCVLPYDGSYYADASLAEPMSCIIGAFHASY
HTTQFVYEHEMGIKEGGTLALLACAGPMGIGAIDYAINGPVKPRRIVVTDIDEDRLSRAESLIPVSAAKAQGIELIYVNT
IEMEDPVTYLKSLNDDQGYDDVMVYAAVAQVLEQADALLGNDGCLNFFAGPTDKEFKVPFNFYNVHYESTHIVGTSGGST
GDMVESLELSAQGDINPSFMITHVGGLQAAPHTILNQLDIPGGKKLIYPHIDLPLTAIDNFASLAEQDPFFSELDAILAK
NNYVWNQHAEKALLEFYDVSLSV
>P50468 ~~~~~~M protein, serotype 2.1~~~
MARKDTNKQYSLRKLKTGTASVAVAVAVLGAGFANQTTVKANSKNPVPVKKEAKLSEAELHDKIKNLEEEKAELFEKLDK
VEEEHKKVEEEHKKDHEKLEKKSEDVERHYLRQLDQEYKEQQERQKNLEELERQSQREVEKRYQEQLQKQQQLEKEKQIS
EASRKSLRRDLEASRAAKKDLEAEHQKLKEEKQISEASRKSLRRDLEASRAAKKDLEAEHQKLKEEKQISEASRQGLSRD
LEASRAAKKDLEAEHQKLKEEKQISEASRQGLSRDLEASREAKKKVEADLAEANSKLQALEKLNKELEEGKKLSEKEKAE
LQAKLEAEAKALKEQLAKQAEELAKLKGNQTPNAKVAPQANRSRSAMTQQKRTLPSTGETANPFFTAAAATVMVSAGMLA
LKRKEEN
>P16947 ~~~~~~M protein, serotype 49~~~
MARKDTNKQYSLRKLKTGTASVAVAVAVLGAGFANQTEVKAAEKKVEAKVEVAENNVSSVARREKELYDQIADLTDKNGE
YLERIGELEERQKNLEKLEHQSQVAADKHYQEQAKKHQEYKQEQEERQKNQEQLERKYQREVEKRYQEQLQKQQQLETEK
QISEASRKSLSRDLEASREAKKKVEADLAALTAEHQKLKEEKQISDASRQGLSRDLEASREAKKKVEADLAALTAEHQKL
KEEKQISDASRQGLSRDLEASREAKKKVEADLAEANSKLQALEKLNKELEEGKKLSEKEKAELQARLEAEAKALKEQLAK
QAEELAKLKGNQTPNAKVAPQANRSRSAMTQQKRTLPSTGETANPFFTAAAATVMVSAGMLALKRKEEN
>P02977 ~~~emm5~~~M protein, serotype 5~~~
MARENTNKHYWLRKLKKGTASVAVALSVLGAGLVVNTNEVSAAVTRGTINDPQRAKEALDKYELENHDLKTKNEGLKTEN
EGLKTENEGLKTENEGLKTEKKEHEAENDKLKQQRDTLSTQKETLEREVQNTQYNNETLKIKNGDLTKELNKTRQELANK
QQESKENEKALNELLEKTVKDKIAKEQENKETIGTLKKILDETVKDKIAKEQENKETIGTLKKILDETVKDKLAKEQKSK
QNIGALKQELAKKDEANKISDASRKGLRRDLDASREAKKQLEAEHQKLEEQNKISEASRKGLRRDLDASREAKKQLEAEQ
QKLEEQNKISEASRKGLRRDLDASREAKKQVEKALEEANSKLAALEKLNKELEESKKLTEKEKAELQAKLEAEAKALKEQ
LAKQAEELAKLRAGKASDSQTPDTKPGNKAVPGKGQAPQAGTKPNQNKAPMKETKRQLPSTGETANPFFTAAALTVMATA
GVAAVVKRKEEN
>P08089 ~~~emm6~~~M protein, serotype 6~~~
MAKNNTNRHYSLRKLKKGTASVAVALSVIGAGLVVNTNEVSARVFPRGTVENPDKARELLNKYDVENSMLQANNDKLTTE
NNNLTDQNKNLTTENKNLTDQNKNLTTENKNLTDQNKNLTTENKELKAEENRLTTENKGLTKKLSEAEEEAANKERENKE
AIGTLKKTLDETVKDKIAKEQESKETIGTLKKTLDETVKDKIAKEQESKETIGTLKKTLDETVKDKIAKEQESKETIGTL
KKILDETVKDKIAREQKSKQDIGALKQELAKKDEGNKVSEASRKGLRRDLDASREAKKQVEKDLANLTAELDKVKEEKQI
SDASRQGLRRDLDASREAKKQVEKALEEANSKLAALEKLNKELEESKKLTEKEKAELQAKLEAEAKALKEQLAKQAEELA
KLRAGKASDSQTPDAKPGNKVVPGKGQAPQAGTKPNQNKAPMKETKRQLPSTGETANPFFTAAALTVMATAGVAAVVKRK
EEN
>Q3AEU2 4.3.1.2~~~~~~Methylaspartate ammonia-lyase 1~~~COG3799
MRIKDVLFVKGSSGFYFDDQKAIKSGAVTDGFTYKGKPLTPGFSRVRQGGEAVSIMLFLENGEIAVGDCVAVQYSGVDGR
DPVFLADNFIEVLEEEIKPRLVGYNLVRFREAARYFTNLTDKRGKRYHTALRYGLTQALLDAVAKINRTTMAEVIAEEYG
LDLTLNPVPLFAQSGDDRYINADKMILKRVDVLPHGLFNHPAKTGEEGKNLTEYALWLKQRIKTLGDHDYLPVFHFDVYG
TLGTVFNDNLDRIADYLARLEEKVAPHPLQIEGPVDLGSKERQIEGLKYLQEKLITLGSKVIIVADEWCNNLSDIKEFVD
AGAGGMVQIKSPDLGGVNDIIEAVLYAKEKGTGAYLGGSCNETDVSAKITVHVGLATGPAQLLVKPGMGVDEGLTIMRNE
MMRTLAILQRNKVTFQKKVG
>O66145 4.3.1.2~~~~~~Methylaspartate ammonia-lyase~~~COG3799
MKIKQALFTAGYSSFYFDDQQAIKNGAGHDGFIYTGDPVTPGFTSVRQAGECVSVQLILENGAVAVGDCAAVQYSGAGGR
DPLFLAEHFIPFLNDHIKPLLEGRDVDAFLPNARFFDKLRIDGNLLHTAVRYGLSQALLDATALASGRLKTEVVCDEWQL
PCVPEAIPLFGQSGDDRYIAVDKMILKGVDVLPHALINNVEEKLGFKGEKLREYVRWLSDRILSLRSSPRYHPTLHIDVY
GTIGLIFDMDPVRCAEYIASLEKEAQGLPLYIEGPVDAGNKPDQIRMLTAITKELTRLGSGVKIVADEWCNTYQDIVDFT
DAGSCHMVQIKTPDLGGIHNIVDAVLYCNKHGMEAYQGGTCNETEISARTCVHVALAARPMRMLIKPGMGFDEGLNIVFN
EMNRTIALLQTKD
>Q05514 4.3.1.2~~~~~~Methylaspartate ammonia-lyase~~~
MKIVDVLCTPGLTGFYFDDQRAIKKGAGHDGFTYTGSTVTEGFTQVRQKGESISVLLVLEDGQVAHGDCAAVQYSGAGGR
DPLFLAKDFIPVIEKEIAPKLIGREITNFKPMAEEFDKMTVNGNRLHTAIRYGITQAILDAVAKTRKVTMAEVIRDEYNP
GAEINAVPVFAQSGDDRYDNVDKMIIKEADVLPHALINNVEEKLGLKGEKLLEYVKWLRDRIIKLRVREDYAPIFHIDVY
GTIGAAFDVDIKAMADYIQTLAEAAKPFHLRIEGPMDVEDRQKQMEAMRDLRAELDGRGVDAELVADEWCNTVEDVKFFT
DNKAGHMVQIKTPDLGGVNNIADAIMYCKANGMGAYCGGTCNETNRSAEVTTNIGMACGARQVLAKPGMGVDEGMMIVKN
EMNRVLALVGRRK
>P77791 2.3.1.79~~~maa~~~Maltose O-acetyltransferase~~~COG0110
MSTEKEKMIAGELYRSADETLSRDRLRARQLIHRYNHSLAEEHTLRQQILADLFGQVTEAYIEPTFRCDYGYNIFLGNNF
FANFDCVMLDVCPIRIGDNCMLAPGVHIYTATHPIDPVARNSGAELGKPVTIGNNVWIGGRAVINPGVTIGDNVVVASGA
VVTKDVPDNVVVGGNPARIIKKL
>P71534 1.1.1.100~~~mabA~~~3-oxoacyl-[acyl-carrier-protein] reductase MabA~~~COG1028
MTVTDNPADTAGEATAGRPAFVSRSVLVTGGNRGIGLAIARRLAADGHKVAVTHRGSGAPDDLFGVQCDVTDSAAVDRAF
KEVEEHQGPVEVLVANAGISKDAFLMRMTEERFEEVINTNLTGAFRCAQRASRTMQRKRFGRIIFIGSVSGMWGIGNQAN
YAAAKAGLIGMARSISRELAKAGVTANVVAPGYIDTEMTRALDERIQAGALDFIPAKRVGTAEEVAGAVSFLASEDASYI
AGAVIPVDGGMGMGH
>P9WGT3 1.1.1.100~~~mabA~~~3-oxoacyl-[acyl-carrier-protein] reductase MabA~~~COG1028
MTATATEGAKPPFVSRSVLVTGGNRGIGLAIAQRLAADGHKVAVTHRGSGAPKGLFGVECDVTDSDAVDRAFTAVEEHQG
PVEVLVSNAGLSADAFLMRMTEEKFEKVINANLTGAFRVAQRASRSMQRNKFGRMIFIGSVSGSWGIGNQANYAASKAGV
IGMARSIARELSKANVTANVVAPGYIDTDMTRALDERIQQGALQFIPAKRVGTPAEVAGVVSFLASEDASYISGAVIPVD
GGMGMGH
>A0A1V0ELS9 1.13.11.86~~~mabB~~~5-aminosalicylate 1,2-dioxygenase~~~
MNAPDSFQTDLDALHEDMARANMAPTWKYVSDFVAKEPRVGFRPWLWRWNDVLPLLMRAGDLITPERGAERRSMEHVNPD
LKSAYSTSHTIATAFQLVRAGETAPAHRHAAAAIRFAARSKGGSVYTRVQGERLMMEEFDLLLTPAGTWHEHANETANDI
VWLDALDFPLVNLLKASVFEPGDSDTCEPKPDDFSRQHLGLYRPVGWSDYPEPHPVMRYPWVEMKAALDAAASSGATGSP
FDGIVMAYTNPLNSGPTLPTLSCRAQLLRPKESTCAHRATSSTVYFVISGTGTTVVNGTAYRWGPGDVFVVPNWAWHEHL
NGDSDAYLFSITDEPVMRTLGIYREQAYAAPGPHQLITGEFDSTQQCVRELSSL
>Q8GAI3 1.5.3.19~~~abo~~~4-methylaminobutanoate oxidase (formaldehyde-forming)~~~
MDRLVDRDISASMFTATSHDPLPTHVRTVVVGGGIIGASIAYHLSAAGENDTLLLESNVLGSGTSWHAAGLVTGARGTTT
MTKLAKYGLDFYSRLEQMSGLDVSFQRCGSLSVARTAGRVDELLYAKDVADQQGVRTEWLTEDRYKELWPLATYSGVAGA
LLLPDDGHINPGHATVALAKLAHSLGTQIRENVAVHKVLRQGDLVVGVLTDQGIVHCDRVILACGLWTRDLAATAGVKVP
LYAAEHIHVRSAEIDGAVPELPVYRDLDNSYYIRHEAGRLLVGAFEPDGLPRPVEEIPSNGFAEFGPEWEHFAPIRAKAE
GVVPALASAGFDRFLNAPESFTPDANFAVGETSELSNLFVAAGFNSQGIIFAPGIGKELAEWVISGTPGFDSSAVDVQRF
SGHQNNRNYLKARTKEGLGRLYAMHWPNLQMETGRNVRRTPLHARLAELGACFGEVNGGERANWYGAPGTSPTYDYSYGR
PNWFDRVAEEHKAAREGVVLFDLSPFAKFEVAGPDALEVCQMAATADIDVETDKAVYTLFLNDRAGIELDGTITRLGLDR
FLVVTPSFTQQKTAAYLKRIARGKAAAVFDCTAALATIGVMGPKSRELLSRISPEDWSDEAQRYTHGRMVEIADGYAYSL
RVSFVGELGYELYPSADMAVNVLDALWEAGQDLGLKLAGYHALDSLRSEKGFRHLGHDIGPIDDPYSAGLRFTISMDKPG
GFLGKDALLKLDPTAPDHRTVYVALEDPDPVFVHDETVYCNGLPVGRMTSGSYGHTLGRAVGIAALEPDADLSGDFEVQC
KGRLYPAKVSRRPFYDPKGERLRG
>Q8GAJ0 1.5.3.21~~~mao~~~4-methylaminobutanoate oxidase (methylamine-forming)~~~
MGRIGILGAGLAGLAAATKLAEAGENVTVFEARNRPGGRVWSETLDTPKGSYVIERGAEFVLDGYTSMRRLLSQFGLSLV
DTGMSYYVREPGDTTGITCDDIIRTGREALELASGSGLQGTAEELLAKLPDEPELVDALRARIEISTAVSASEVTARSLQ
HIASFEPKPSWRVAGGNQRLPDAMAAALGSAVRYGETVRAVENISDGGVLVTTDTDTSVFDTVVVALPLAVIRDSQLNLP
TTEARDAALKHVLQGHAAKLHLPLETQPATSAVMSVEGRYWTWTATDESGAVAPVLNAFMGSPSAITRANLKQRPAEWVA
KARALRTDLAIPQDAAALTTVWSEDQLAGGAYAAHAPGVTAAGTALLEKPVGDVFWAGEYSEPEFVGLMEGAIRSGERAA
GRVMQRLETKSGNSDSERSKA
>P75830 ~~~macA~~~Macrolide export protein MacA~~~COG0845
MKKRKTVKKRYVIALVIVIAGLITLWRILNAPVPTYQTLIVRPGDLQQSVLATGKLDALRKVDVGAQVSGQLKTLSVAIG
DKVKKDQLLGVIDPEQAENQIKEVEATLMELRAQRQQAEAELKLARVTYSRQQRLAQTKAVSQQDLDTAATEMAVKQAQI
GTIDAQIKRNQASLDTAKTNLDYTRIVAPMAGEVTQITTLQGQTVIAAQQAPNILTLADMSAMLVKAQVSEADVIHLKPG
QKAWFTVLGDPLTRYEGQIKDVLPTPEKVNDAIFYYARFEVPNPNGLLRLDMTAQVHIQLTDVKNVLTIPLSALGDPVGD
NRYKVKLLRNGETREREVTIGARNDTDVEIVKGLEAGDEVVIGEAKPGAAQ
>Q2EHL8 7.6.2.-~~~macB~~~Macrolide export ATP-binding/permease protein MacB~~~COG0577
MNIIEIKQLNRYFGEGENRVHVLKDISLSIERGDFVAIMGQSGSGKSTLMNIIGCLDTATGGSSKIDGKETIELTNDQLS
DLRSQKFGFIFQRYNLLSSLTAAENVALPAIYAGMPQSQRLERAKQLLEKLGLGDKWQNKPNQLSGGQQQRVSIARALMN
GGEIILADEPTGALDSHSGENVMEILRQLHEEGHTIIMVTHDKHIAASANRIIEIKDGEIISDTQKRQVKSAVKNPSVFK
GRFGFSKDQLMEAFRMSVSAIVAHKMRSLLTMLGIIIGITSVVSVVALGNGSQQKILENIRGIGTNTMTIFNGNGFGDRR
SRHIQNLKISDANTLSKQSYIQSVTPNTSSSGILVVGNKSFTSANLYGIGEQYFDVEGLKLKQGRLLTEDDVDQSNQVVV
LDESAKKAIFANENPLGKTVIFNKRPFRVIGVVSDQQLGGFPGNSLNLYSPYSTVLNKITGGSRIGSITVKISDDVNSTV
AEKSLTELLKSLHGKKDFFIMNSDTIKQTIENTTGTMKLLISSIAFISLIVGGIGVMNIMLVSVTERTKEIGVRMAIGAR
QINILQQFLIEAVLICLIGGVAGILLSVLIGVLFNSFITDFSMDFSTASIVTAVLFSTLIGVLFGYMPAKKAAELNPITA
LAQE
>Q0TJH0 7.6.2.-~~~macB~~~Macrolide export ATP-binding/permease protein MacB~~~
MTPLLELKDIRRSYPAGDEQVEVLKGISLDIYAGEMVAIVGASGSGKSTLMNILGCLDKATSGTYRVAGQDVATLDADAL
AQLRREHFGFIFQRYHLLSHLTAEQNVEVPAVYAGLERKQRLLRAQELLQRLGLEDRTEYYPAQLSGGQQQRVSIARALM
NGGQVILADEPTGALDSHSGEEVMAILHQLRDRGHTVIIVTHDPQVAAQAERVIEIRDGEIVRNPPAVEKVNATGGTEPV
VNTASGWRQFVSGFNEALTMAWRALAANKMRTLLTMLGIIIGIASVVSIVVVGDAAKQMVLADIRSIGTNTIDVYPGNDF
GDDDPQYQQALKYDDLIAIQKQPWVASATPAVSQNLRLRYNNVDVAASANGVSGDYFNVYGMTFSEGNTFNQEQLNGRAQ
VVVLDSNTRRQLFPHKADVVGEVILVGNMPARVIGVAEEKQSMFGSSKVLRVWLPYSTMSGRVMGQSWLNSITVRVKEGF
DSAEAEQQLTRLLSLRHGKKDFFTWNMDGVLKTVEKTTRTLQLFLTLVAVISLVVGGIGVMNIMLVSVTERTREIGIRMA
VGARASDVLQQFLIEAVLVCLVGGALGITLSLLIAFTLQLFLPGWEIGFSPLALLLAFLCSTVTGILFGWLPARNAARLD
PVDALARE
>P75831 7.6.2.-~~~macB~~~Macrolide export ATP-binding/permease protein MacB~~~COG0577
MTPLLELKDIRRSYPAGDEQVEVLKGISLDIYAGEMVAIVGASGSGKSTLMNILGCLDKATSGTYRVAGQDVATLDADAL
AQLRREHFGFIFQRYHLLSHLTAEQNVEVPAVYAGLERKQRLLRAQELLQRLGLEDRTEYYPAQLSGGQQQRVSIARALM
NGGQVILADEPTGALDSHSGEEVMAILHQLRDRGHTVIIVTHDPQVAAQAERVIEIRDGEIVRNPPAIEKVNVTGGTEPV
VNTVSGWRQFVSGFNEALTMAWRALAANKMRTLLTMLGIIIGIASVVSIVVVGDAAKQMVLADIRSIGTNTIDVYPGKDF
GDDDPQYQQALKYDDLIAIQKQPWVASATPAVSQNLRLRYNNVDVAASANGVSGDYFNVYGMTFSEGNTFNQEQLNGRAQ
VVVLDSNTRRQLFPHKADVVGEVILVGNMPARVIGVAEEKQSMFGSSKVLRVWLPYSTMSGRVMGQSWLNSITVRVKEGF
DSAEAEQQLTRLLSLRHGKKDFFTWNMDGVLKTVEKTTRTLQLFLTLVAVISLVVGGIGVMNIMLVSVTERTREIGIRMA
VGARASDVLQQFLIEAVLVCLVGGALGITLSLLIAFTLQLFLPGWEIGFSPLALLLAFLCSTVTGILFGWLPARNAARLD
PVDALARE
>Q5MK06 7.6.2.-~~~macB~~~Macrolide export ATP-binding/permease protein MacB~~~
MSLIECKNINRYFGSGENRVHILKDISLSIEKGDFVAIIGQSGSGKSTLMNILGCLDTAGSGSYRIDGIETAKMQPDELA
ALRRERFGFIFQRYNLLSSLTARDNVALPAVYMGMGGKERSARADKLLQDLGLASKEGNKPGELSGGQQQRVSIARALMN
GGEIIFADEPTGALDTASGKNVMEIIRRLHEAGHTVIMVTHDPGIAANANRVIEIRDGEIISDTSKNPEIPASNVGRIQE
KASWSFYYDQFVEAFRMSVQAVLAHKMRSLLTMLGIIIGIASVVSVVALGNGSQKKILEDISSMGTNTISIFPGRGFGDR
RSGKIKTLTIDDAKIIAKQSYVASATPMTSSGGTLTYRNTDLTASLYGVGEQYFDVRGLKLETGRLFDENDVKEDAQVVV
IDQNVKDKLFADSDPLGKTILFRKRPLTVIGVMKKDENAFGNSDVLMLWSPYTTVMHQITGESHTNSITVKIKDNANTRV
AEKGLAELLKARHGTEDFFMNNSDSIRQMVESTTGTMKLLISSIALISLVVGGIGVMNIMLVSVTERTKEIGIRMAIGAR
RGNILQQFLIEAVLICIIGGLVGVGLSAAVSLVFNHFVTDFPMDISAASVIGAVACSTGIGIAFGFMPANKAAKLNPIDA
LAQD
>P01549 ~~~~~~Macromomycin~~~
MLQNTSRFLARAGATVGVAAGLAFSLPADRDGAPGVTVTPATGLSNGQTVTVSATGLTPGTVYHVGQCAVVEPGVIGCDA
TTSTDVTADAAGKITAQLKVHSSFQAVVGANGTPWGTVNCKVVSCSAGLGSDSGEGAAQAITFA
>A0A0H2ZNG3 ~~~macP~~~Penicillin-binding protein 2a activator MacP~~~
MGKSLLTDEMIERANRGEKISGPPLLDDNEETKILPTSSSRFGYANPKDHGFSQETLKIQVEPSIHKSRRIENTKRNVFN
SKLNKILFAVIFLLILLVLAMKLL
>O06924 2.3.1.187~~~madA~~~Acetyl-S-ACP:malonate ACP transferase~~~
MQKEKVWDKLSTDTEERMNAANELFSDRKVVPSQNGVALLEAVIRAGDRINLEGNNQKQADFLAECLGSCDSEKINNLHM
VQSAVPLPIHLDIFDKGIAKKLDFAYGGPMAAKVAEFLREGKLELGAIHTYLELFARYFMDLTPRVSLICAYEGDKDGNL
YTGFNTEDTPVIAEATKFRQGIVIAQVNKLVDKVQRVDIPGEWVDAVIESPKPFYLEPLFTRDPANITDTQVLMGMMALK
GIYGEYGVQRLNHGIGFFTAAIELLLPTYGNELGLKGKICKHFALNPHPTMIPAIEDGWVESIHSFGGELGMQKYCEARP
DIFFIGPDGSMRSNRAYSQTAGHYATDMFIGGTLQIDKYGNSSTATASRVAGFGGAPNMGCDAKGRRHVTDSWLKCGAEF
EDQAALLGDMPRGKRLVVQMQETFKEKMDPSFVEKLDAWNLAKNANLDLAPVMIYSDDLTHIVTEEGIAYLAKCRGLEER
MAAIRGVAGYTEVGLSADPKETKTLRERGIVKTPEDLGIDRSRANRSMLAAKSVKDLVDCSGGLYEPPARFVNW
>O06923 7.2.4.1~~~madB~~~Carboxybiotin decarboxylase~~~
MEQLMSLFPAISTLFTQDPVISITRIALIIFGFFLSYFGFKRTLEPLIMVPMGLGMIAINAGVLFLEAGVVGTIHLDPLV
SEPSVLVNLMQVNWLQPVYNFTFSNGLIACIVFFGIGAMSDISFILIRPWASIIVALFAEMGTFATLIIGIKMGLLPNEA
AAVATIGGADGPMVLFASLILAKDLFVPIAIIAYLYLSLTYAGYPYLIKLLVPKKYRGLEVEMDFPEVSQRSKFVFSVLA
CMLLCLLLPVASPLILSFFLGIAIKEAQIEPFQNLLETTLTYGSTLFLGLLLGALCEAKTILDPKISLIVVLGITALLIS
GIGGVLGGWIVYWFSKGKFNPVIGIAGVSCLPTTAKIAQKTVTEENPYAVILPLAMGAGVCGLIVSAIATGVFISTLFLL
N
>O06926 2.1.3.10~~~madC~~~Malonyl-S-ACP:biotin-protein carboxyltransferase MADC~~~
MAKWTELQDKSFLEATARERAVGIVDEGTFTEFCGPFDKIYSPHLPLMGEAIEYDDGLVAGVGKIGKKPIFVISQEGRFI
GGSIGEVSGAKMVKTIQLASDLYEEMVSEKPDLPEEMRPAVVISFETGGVRLHEANAGLLAHAEVMDQIQNCRGRVPIIS
LIGSRVGCFGGMGFVAAATDVIIMSQFGRLGLTGPEVIEQEMGKDEFDASDRALVYRTTGGKHKYIIGDCNYLAADSIRS
FRETTTAVLQKPMEEIETFRRIGSMEKIKEQIELVKLSVSLKPKDSMDVWAHAGNENPESLINMTLDEFLAQAKRLKA
>O06927 2.1.3.10~~~madD~~~Malonyl-S-ACP:biotin-protein carboxyltransferase MADD~~~
MEIMMGQGRLAIEKIVDPESFKENTIGESSFEDNEVGPGAVVGTAQIGDQDCTIIASDAMAMNERFPVVYAGIIGLEEGY
KMAMAVYKTIEADKEKKGTEKRPILLIVDTPGNGPGKQEEIFGMNKSTGAYQLALAEARKAGHPIVAMVIGRAISGAFLC
HGLQADRILSLSSKFETMIHVMPLTSVSVITKLDIERLEELSKTNPVFAAGPDFFYQLGGVEELVEEVDGMRSCILKHIA
EIREMKAAGEEARLGPWGRGALGEQRGGRMIRGKVMAMMDKQFFAFAEQNLY
>O06930 4.1.1.-~~~madF~~~Biotin carrier protein MADF~~~
MEIKSKMPGSIIEVKVSVGDNLEAGSLILIMEALKMKQEIRSQEGGVVKELKVNTGDRVSPGQVLAIIE
>O06928 6.2.1.35~~~madH~~~ACP-SH:acetate ligase~~~
MAEQLKELAEMVESFGTAPTMGEMPCRTLATKGINGPTAAHVIEEIHTPFNLAYVTFTTGSTAFQNVVGVTHSEIDGRVR
ASLAAFDMANVERHGKFLVTYAPLVNVFSAEALKIHGLDWFFLQRSSRDAFLLSLCQEKPNVLIGESTFIRSALEDASVL
GLSHSIPQGVIAFTAGTPLDLDLLQVAEKHNWKIHDLYGCQEFGWLTLDGVPLRADITLIPSPKGSDFREFVVGGLPMAD
SFPYAESGHVCNPEGKIITYRRARTNPEYEVIVRETKLSSKETTERVARTILRIKGRVVKVDPALKVSSTKTVLDLVPSV
SAEGKSTSESYRIEGDDKTFLFETLIEAQLALQQTAKTDQVWKKTR
>E1V8J1 1.1.1.40~~~maeB~~~NADP-dependent malic enzyme~~~COG0281
MTDAKRQAALDYHAKPIPGKLSVELTKPTATARDLALAYSPGVAEPVREIARDPENAYLYTGKGNLVAVISDGSAILGLG
NLGPLASKPVMEGKGVLFKRFAGINSIDVEVDAESPQAFIDTVARIADSWGGINLEDIKAPECFEIERALVEQCNIPVFH
DDQHGTAIVTAAGMLNALDIAGKSLESARIVCLGAGAAAIACMKLLVACGARSENLVMLDRKGVIHSGREDLNQYKAMFA
IDTDKRTLADAIEGADVFVGLSGPGLMTEEHIRRMADNPVVFACTNPDPEIHPDLARETRPDVIMATGRSDYPNQVNNVL
GFPFIFRGALDVRATRINEDMKVAAVHALKDLAREPVPQAVLEAYDKDAMSFGRDYIIPTPIDVRLLERVSSAVAQAAVD
SGVARRPYPAHYPLKTVDDVYG
>Q9JS44 ~~~mafA1~~~Adhesin MafA~~~
MKTLLLLIPLVLTACGTLTGIPAHGGGKRFAVEQELVAASSRAAVKEMDLSALKGRKAALYVSVMGDQGSGNISGGRYSI
DALIRGGYHNNPESATQYSYPAYDTTATTKSDALSSVTTSTSLLNAPAAALTKNSGRKGERSAGLSVNGTGDYRNETLLA
NPRDVSFLTNLIQTVFYLRGIEVVPPEYADTDVFVTVDVFGTVRSRTELHLYNAETLKAQTKLEYFAVDRDSRKLLITPK
TAAYESQYQEQYALWTGPYKVSKTVKASDRLMVDFSDITPYGDTTAQNRPDFKQNNGKKPDVGNEVIRRRKGG
>Q2W031 ~~~magA~~~Iron transporter MagA~~~
MELHHPELTYAAIVALAAVLCGGMMTRLKQPAVVGYILAGVVLGPSGFGLVSNRDAVATLAEFGVLMLLFVIGMKLDIIR
FLEVWKTAIFTTVLQIAGSVGTALLLRHGLGWSLGLAVVLGCAVAVSSTAVVIKVLESSDELDTPVGRTTLGILIAQDMA
VVPMMLVLESFETKALLPADMARVVLSVLFLVLLFWWLSKRRIDLPITARLSRDSDLATLSTLAWCFGTAAISGVLDLSP
AYGAFLGGVVLGNSAQRDMLLKRAQPIGSVLLMVFFLSIGLLLDFKFIWKNLGTVLTLLAMVTLFKTALNVTALRLARQD
WPSAFLAGVALAQIGEFSFLLAETGKAVKLISAQETKLVVAVTVLSLVLSPFWLFTMRRMHRVAAVHVHSFRDLVTRLYG
DEARAFARTARRARVLVRRGSWRDDPNAGPGSGI
>O24766 5.2.1.1~~~maiA~~~Maleate isomerase~~~COG3473
MKTYRIGQIVPSSNTTMETEIPAMLQARYAEFPEERFTFHSSRMRMMHVNPEELKAMDIASDRCAVELSDARMSVMAYAC
LVAIMAQGDGYHRVSQARLQNTVKENGVEIPVLSSAGALVDTLKEFGYKKVSIITPYMKPLTKRVADYIEAEGIEVQDSI
SLEVSDNLEVGLLNPENLLEHVKRLNHDGVDAVILSACVQMPSLPAIQRAQDQIGKPVLSAAVWTVYQMLKNLGLETRVP
NAGHILSGVKPQA
>Q5YXQ1 5.2.1.1~~~maiA~~~Maleate isomerase~~~COG3473
MGIRRIGLVVPSSNVTVETEMPALLSRHPGAEFSFHSTRMRMHTVSPEGLAAMNAQRERCVLEIADAAPEVILYACLVAV
MVGGPGEHHRVESAVAEQLATGGSQALVRSSAGALVEGLRALDAQRVALVTPYMRPLAEKVVAYLEAEGFTISDWRALEV
ADNTEVGCIPGEQVMAAARSLDLSEVDALVISCCVQMPSLPLVETAEREFGIPVLSAATAGAYSILRSLDLPVAVPGAGR
LLRQDSAVTAS
>Q88FY4 5.2.1.1~~~maiA~~~Maleate isomerase~~~COG3473
MTQLYRIGQIVPSSNTTMETEIPAMLNARQAIRPERFTFHSSRMRMKQVKKEELAAMDAESDRCAVELSDAKVDVLGYAC
LVAIMAMGLGYHRQSEKRLQQATADNDALAPVITSAGALVEALHVMKAKRIAIVAPYMKPLTELVVNYIREEGFEVQDWR
ALEIPDNLAVARHDPANLPGIVAGMDLEGVDVVVLSACVQMQSLPAVAKVEAQTGKPVVTAAIATTYAMLKALDLEPIVP
GAGALLSGAY
>Q9HI36 ~~~hcpA~~~Major exported protein~~~
MATPAYMSITGTKQGLITAGAFTEDSVGNTYQEGHEDQVMVQGFNHEVIIPRDPQSGQPTGQRVHKPVVITKVFDKASPL
LLAALTSGERLTKVEIQWYRTSAAGTQEHYYTTVLEDAIIVDIKDYMHNCQDPGNAHFTHLEDVHFTYRKITWTHEVSGT
SGSDDWRSPVAG
>O07177 2.7.1.175~~~mak~~~Maltokinase~~~COG3281
MTRSDTLATKLPWSDWLSRQRWYAGRNRELATVKPGVVVALRHNLDLVLVDVTYTDGATERYQVLVGWDFEPASEYGTKA
AIGVADDRTGFDALYDVAGPQFLLSLIVSSAVCGTSTGEVTFTREPDVELPFAAQPRVCDAEQSNTSVIFDRRAILKVFR
RVSSGINPDIELNRVLTRAGNPHVARLLGAYQFGRPNRSPTDALAYALGMVTEYEANAAEGWAMATASVRDLFAEGDLYA
HEVGGDFAGESYRLGEAVASVHATLADSLGTAQATFPVDRMLARLSSTVAVVPELREYAPTIEQQFQKLAAEAITVQRVH
GDLHLGQVLRTPESWLLIDFEGEPGQPLDERRAPDSPLRDVAGVLRSFEYAAYGPLVDQATDKQLAARAREWVERNRAAF
CDGYAVASGIDPRDSALLLGAYELDKAVYETGYETRHRPGWLPIPLRSIARLTAS
>Q7WUM3 2.7.1.175~~~mak1~~~Maltokinase~~~
MTLPFAEWLPKQRWYAGRSRVLASVKEASATPLGEELDLVLVDVEYTDGSSERYQVMVGWGDGPLPEYSTIASIGTADDG
RDGYDALYDPRATRHLLGLVDTSATAGDVTFEKEPGVELPLEAWPRVFDAEQSNTSVIFDEDAILKLFRRVTCGVNPDIE
LNRVLGRAGNPHVARLLGSLQSADDSGPCSLGMVTEYAANSAEGWAMATASARDLFADAEMRADEVGGDFQGESYRLGEA
VASVHRTLAEELGTGPAPFPLDAVLARVRTAAAAVPELQQFVPAITARFEALTGAEVVVQRVHGDLHLGQVLRTPEAWLL
IDFEGEPGQPLDERRMPDSPLRDVAGVLRSYEYAAYQLLVDQDDDEHLAARAREWVDRNRAAFCDGYTNVAGADPREQGA
LLSAYELDKAVYEAAYEARHRPGWLRIPLRSITRLVG
>P23917 2.7.1.4~~~mak~~~Fructokinase~~~COG1940
MRIGIDLGGTKTEVIALGDAGEQLYRHRLPTPRDDYRQTIETIATLVDMAEQATGQRGTVGMGIPGSISPYTGVVKNANS
TWLNGQPFDKDLSARLQREVRLANDANCLAVSEAVDGAAAGAQTVFAVIIGTGCGAGVAFNGRAHIGGNGTAGEWGHNPL
PWMDEDELRYREEVPCYCGKQGCIETFISGTGFAMDYRRLSGHALKGSEIIRLVEESDPVAELALRRYELRLAKSLAHVV
NILDPDVIVLGGGMSNVDRLYQTVGQLIKQFVFGGECETPVRKAKHGDSSGVRGAAWLWPQE
>Q7U2S7 2.7.1.175~~~mak~~~Maltokinase~~~
MTRSDTLATKLPWSDWLPRQRWYAGRNRELATVKPGVVVALRHNLDLVLVDVTYTDGATERYQVLVGWDFEPASEYGTKA
AIGVADDRTGFDALYDVAGPQFLLSLIVSSAVCGTSTGEVTFTREPDVELPFAAQPRVCDAEQSNTSVIFDRRAILKVFR
RVSSGINPDIELNRVLTRAGNPHVARLLGAYQFGRPNRSPTDALAYALGMVTEYEANAAEGWAMATASVRDLFAEGDLYA
HEVGGDFAGESYRLGEAVASVHATLADSLGTAQATFPVDRMLARLSSTVAVVPELREYAPTIEQQFQKLAAEAITVQRVH
GDLHLGQVLRTPESWLLIDFEGEPGQPLDERRAPDSPLRDVAGVLRSFEYAAYGPLVDQATDKQLAARAREWVERNRAAF
CDGYAVASGIDPRDSALLLGAYELDKAVYETGYETRHRPGWLPIPLRSIARLTAS
>A0R6D9 2.7.1.175~~~mak~~~Maltokinase~~~COG3281
MSVEFEDWLTQQRWYAGRNRELVSATTAMAVRLRDGLELVLLQANYADGPDERYQVIVATGSGPIDEYSVVATIGIADGQ
TAYDALYDPDATRYLLSLIDESATVQNVRFVREPDVELPLDAPPRVFGAEQSNTSVVFGEDAIFKLFRRITPGVHPDIEL
NRVLARAGNPHVARLLGSFETEWEGEPYALGMVTEFAANSAEGWDMATTSTRDLFAEGDLYAEEVGGDFAGEAYRLGEAV
ASVHACLAHELGTEEVPFPADVMAQRLAAAVDAVPELREHVPQIEERYHKLADTTMTVQRVHGDLHLGQVLRTPKGWLLI
DFEGEPGQPLDERRRPDTPVRDVAGILRSFEYAAHQRLVDQAGDDDDRARQLAARAREWVTRNCASFCDGYAAEAGTDPR
DSADLLAAYELDKAVYEAAYEARHRPSWLPIPLGSIARLLE
>A1TH50 2.7.1.175~~~mak~~~Maltokinase~~~COG3281
MTLAFGDWIVHRRWYAGRSRELVSAEPAVVTPLRDDLDHILLDVTYTDGTVERYQLVVRWADSPVAGFGEAATIGTALGP
QGERIAYDALFDPDAARHLLRLVDASATVADLRFTREPGATLPLYAPPKVSSAEQSNTSVIFGKDAMLKVFRRVTPGINP
DIELNRVLAQAGNRHVARLLGSFETSWAGPGTDRCALGMVTAFAANSAEGWDMATASAREMFADVVGSDFADESYRLGNA
VASVHATLAEALGTSTEPFPVDTVLARLQSAARSAPELAGRAAAVEERYRRLDGRAITVQRVHGDLHLGQVLRTPDDWLL
IDFEGEPGQPLDERRRPDSPLRDVAGVLRSFEYAAYQKLVELAPEQDADGRLADRARNWVDRNSAAFCAGYAAVAGDDPR
RDGDVLAAYELDKAVYEAAYEARFRPSWLPIPMRSIDRILG
>Q03PA4 5.1.3.21~~~~~~Maltose epimerase~~~COG2017
MEITKSAAGTLNQQDVSKYVLTNQQGTQVAVLTWGATLQEFSVVEDGKRHSLIVNKPDLAGYDHNPYYLCQALGRVAGRI
AGAQFELDGQTVHLEANEEPNASHGGPHGFTFVNWDATTNQTADTASVVLTHTSTPADDRYPGNLETTITYTLTEENRLD
ITFDAQSDAATLFNPTIHTYFNVTDDQHDLDQQWVKLSGDKRLVLDQAKIPTGEMVPTAGTGYDFSQPRTVKDGLDQLHQ
TGQVEYDDAFVVEPSKDTPIATIGDTTGHREVSIYSDRNGLVVFTANPTDDARADVRDYNALAMEAQTLPDAIHHADFGD
VVLPANQPVEHTISYQYTRK
>P0AEY0 ~~~malE~~~Maltose/maltodextrin-binding periplasmic protein~~~COG2182
MKIKTGARILALSALTTMMFSASALAKIEEGKLVIWINGDKGYNGLAEVGKKFEKDTGIKVTVEHPDKLEEKFPQVAATG
DGPDIIFWAHDRFGGYAQSGLLAEITPDKAFQDKLYPFTWDAVRYNGKLIAYPIAVEALSLIYNKDLLPNPPKTWEEIPA
LDKELKAKGKSALMFNLQEPYFTWPLIAADGGYAFKYENGKYDIKDVGVDNAGAKAGLTFLVDLIKNKHMNADTDYSIAE
AAFNKGETAMTINGPWAWSNIDTSKVNYGVTVLPTFKGQPSKPFVGVLSAGINAASPNKELAKEFLENYLLTDEGLEAVN
KDKPLGAVALKSYEEELAKDPRIAATMENAQKGEIMPNIPQMSAFWYAVRTAVINAASGRQTVDEALKDAQTRITK
>P0AEX9 ~~~malE~~~Maltose/maltodextrin-binding periplasmic protein~~~COG2182
MKIKTGARILALSALTTMMFSASALAKIEEGKLVIWINGDKGYNGLAEVGKKFEKDTGIKVTVEHPDKLEEKFPQVAATG
DGPDIIFWAHDRFGGYAQSGLLAEITPDKAFQDKLYPFTWDAVRYNGKLIAYPIAVEALSLIYNKDLLPNPPKTWEEIPA
LDKELKAKGKSALMFNLQEPYFTWPLIAADGGYAFKYENGKYDIKDVGVDNAGAKAGLTFLVDLIKNKHMNADTDYSIAE
AAFNKGETAMTINGPWAWSNIDTSKVNYGVTVLPTFKGQPSKPFVGVLSAGINAASPNKELAKEFLENYLLTDEGLEAVN
KDKPLGAVALKSYEEELAKDPRIAATMENAQKGEIMPNIPQMSAFWYAVRTAVINAASGRQTVDEALKDAQTRITK
>P19576 ~~~malE~~~Maltose/maltodextrin-binding periplasmic protein~~~
MKIKTGVGILALSALTTMMISAPALAKIEEGKLVIWINGDKGYNGLAEVGKKFEQDTGIKVTVEHPDKLEEKFPQVAATG
DGPDIIFWAHDRFGGYAQSGLLAEVTPDKAFQDKLYPFTWDAVRYNGKLIAYPIAVEALSLIYNKDLVPNPPKTWEEIPA
LDKELKVKGKSAIMFNLQEPYFTWPLIAADGGYAFKFENGKYDVKDVGVDNAGAKAGLTFLIDMIKNKNMSADTDYSIAE
AAFNKGETAMTINGPWAWSNIDKSKVNYGVTLLPTFKGKPSKPFVGVLSAGINAASPNKELAKEFLENYLLTDQGLEAVN
KDKPLGAVALKSFQEQLAKDPRIAATMDNAQKGEIMPNIPQMSAFWYAVRTAVINAASGRQTVDAALKDAQSRITK
>P02916 ~~~malF~~~Maltose/maltodextrin transport system permease protein MalF~~~COG1175
MDVIKKKHWWQSDALKWSVLGLLGLLVGYLVVLMYAQGEYLFAITTLILSSAGLYIFANRKAYAWRYVYPGMAGMGLFVL
FPLVCTIAIAFTNYSSTNQLTFERAQEVLLDRSWQAGKTYNFGLYPAGDEWQLALSDGETGKNYLSDAFKFGGEQKLQLK
ETTAQPEGERANLRVITQNRQALSDITAILPDGNKVMMSSLRQFSGTQPLYTLDGDGTLTNNQSGVKYRPNNQIGFYQSI
TADGNWGDEKLSPGYTVTTGWKNFTRVFTDEGIQKPFLAIFVWTVVFSLITVFLTVAVGMVLACLVQWEALRGKAVYRVL
LILPYAVPSFISILIFKGLFNQSFGEINMMLSALFGVKPAWFSDPTTARTMLIIVNTWLGYPYMMILCMGLLKAIPDDLY
EASAMDGAGPFQNFFKITLPLLIKPLTPLMIASFAFNFNNFVLIQLLTNGGPDRLGTTTPAGYTDLLVNYTYRIAFEGGG
GQDFGLAAAIATLIFLLVGALAIVNLKATRMKFD
>P68183 ~~~malG~~~Maltose/maltodextrin transport system permease protein MalG~~~COG3833
MAMVQPKSQKARLFITHLLLLLFIAAIMFPLLMVVAISLRQGNFATGSLIPEQISWDHWKLALGFSVEQADGRITPPPFP
VLLWLWNSVKVAGISAIGIVALSTTCAYAFARMRFPGKATLLKGMLIFQMFPAVLSLVALYALFDRLGEYIPFIGLNTHG
GVIFAYLGGIALHVWTIKGYFETIDSSLEEAAALDGATPWQAFRLVLLPLSVPILAVVFILSFIAAITEVPVASLLLRDV
NSYTLAVGMQQYLNPQNYLWGDFAAAAVMSALPITIVFLLAQRWLVNGLTAGGVKG
>Q97LM4 3.2.1.122~~~malH~~~Maltose-6'-phosphate glucosidase MalH~~~COG1486
MKKFSVVIAGGGSTFTPGIVLMLLDNMDKFPIRKLKFYDNDKERQAIVAGACEIILKEKAPEIEFLATTNPKEAFTDVDF
VMAHIRVGKYAMRELDEKIPLKYGVVGQETCGPGGIAYGMRSIGGVIEILDYMEKYSPNAWMLNYSNPAAIVAEATRKLR
PNSKILNICDMPIGIETRMAEILGLESRKEMTVKYYGLNHFGWWSDIRDKDGNDLMPKLKEHVKKYGYVAENGDTQHTDA
SWNDTFAKAKDVYAVDPSTLPNTYLKYYLFPDYVVEHSNKEYTRANEVMDGREKFVFGECKKVIENQSTKGCKMEIDEHA
SYIVDLARAISYNTHERMLLIVPNNGSIENFDSTGMVEIPCIVGSNGPEPLTMGKIPQFQKGLMEQQVSVEKLVVEAWKE
KSYQKLWQAITLSRTVPSAKVAKQILDELIEVNKDYWPELN
>O06901 3.2.1.122~~~malH~~~Maltose-6'-phosphate glucosidase~~~
MKQFSILIAGGGSTFTPGIILMLLDNLDKFPIRQIKMFDNDAERQAKIGEACAILLKEKAPQIKFSYSTNPEEAFTDIDF
VMAHIRVGKYPMRELDEKIPLRHGVVGQETCGPGGIAYGMRSIGGVIGLIDYMEKYSPNAWMLNYSNPAAIVAEATRRLR
PNSKVLNICDMPIGIEVRMAEILGLESRKDMDIMYYGLNHFGWWKSVRDKQGNDLMPKLREHVSQYGYVVPKGDNQHTEA
SWNDTFAKAKDVLALDPTTLPNTYLKYYLFPDYVVEHSNKEYTRANEVMDGREKFVFGECEKVVKNQSSEGCALHIDEHA
SYIVDLARAIAFNTKEKMLLIVENNGAIVNFDSTAMVEIPCIVGSNGPEPLVVGRIPQFQKGMMEQQVTVEKLTVEAWIE
GSYQKLWQAITMSKTVPSAKVAKDILDDLIEANKEYWPVLK
>P18811 ~~~malI~~~Maltose regulon regulatory protein MalI~~~COG1609
MATAKKITIHDVALAAGVSVSTVSLVLSGKGRISTATGERVNAAIEELGFVRNRQASALRGGQSGVIGLIVRDLSAPFYA
ELTAGLTEALEAQGRMVFLLHGGKDGEQLAQRFSLLLNQGVDGVVIAGAAGSSDDLRRMAEEKAIPVIFASRASYLDDVD
TVRPDNMQAAQLLTEHLIRNGHQRIAWLGGQSSSLTRAERVGGYCATLLKFGLPFHSDWVLECTSSQKQAAEAITALLRH
NPTISAVVCYNETIAMGAWFGLLKAGRQSGESGVDRYFEQQVSLAAFTDATPTTLDDIPVTWASTPARELGITLADRMMQ
KITHEETHSRNLIIPARLIAAK
>P68187 7.5.2.1~~~malK~~~Maltose/maltodextrin import ATP-binding protein MalK~~~COG3842
MASVQLQNVTKAWGEVVVSKDINLDIHEGEFVVFVGPSGCGKSTLLRMIAGLETITSGDLFIGEKRMNDTPPAERGVGMV
FQSYALYPHLSVAENMSFGLKLAGAKKEVINQRVNQVAEVLQLAHLLDRKPKALSGGQRQRVAIGRTLVAEPSVFLLDEP
LSNLDAALRVQMRIEISRLHKRLGRTMIYVTHDQVEAMTLADKIVVLDAGRVAQVGKPLELYHYPADRFVAGFIGSPKMN
FLPVKVTATAIDQVQVELPMPNRQQVWLPVESRDVQVGANMSLGIRPEHLLPSDIADVILEGEVQVVEQLGNETQIHIQI
PSIRQNLVYRQNDVVLVEEGATFAIGLPPERCHLFREDGTACRRLHKEPGV
>Q1R3Q1 7.5.2.1~~~malK~~~Maltose/maltodextrin import ATP-binding protein MalK~~~
MASVQLQNVTKAWGEVVVSKDINLDIHEGEFVVFVGPSGCGKSTLLRMIAGLETITSGDLFIGEKRMNDTPPAERGVGMV
FQSYALYPHLSVAENMSFGLKLAGAKKEVINQRVNQVAEVLQLAHLLDRKPKALSGGQRQRVAIGRTLVAEPSVFLLDEP
LSNLDAALRVQMRIEISRLHKRLGRTMIYVTHDQVEAMTLADKIVVLDAGRVAQVGKPLELYHYPADRFVAGFIGSPKMN
FLPVKVTATAIDQVQVELPMPNRQQVWLPVESRDVQVGANMSLGIRPEHLLPSDIADVILEGEVQVVEQLGNETQIHIQI
PSIRQNLVYRQNDVVLVEEGATFAIGLPPERCHLFREDGTACRRLHKEPGV
>P19566 7.5.2.1~~~malK~~~Maltose/maltodextrin import ATP-binding protein MalK~~~
MASVQLRNVTKAWGDVVVSKDINLDIHDGEFVVFVGPSGCGKSTLLRMIAGLETITSGDLFIGETRMNDIPPAERGVGMV
FQSYALYPHLSVAENMSFGLKLAGAKKEVMNQRVNQVAEVLQLAHLLERKPKALSGGQRQRVAIGRTLVAEPRVFLLDEP
LSNLDAALRVQMRIEISRLHKRLGRTMIYVTHDQVEAMTLADKIVVLDAGRVAQVGKPLELYHYPADRFVAGFIGSPKMN
FLPVKVTATAIEQVQVELPNRQQIWLPVESRGVQVGANMSLGIRPEHLLPSDIADVTLEGEVQVVEQLGHETQIHIQIPA
IRQNLVYRQNDVVLVEEGATFAIGLPPERCHLFREDGSACRRLHQEPGV
>E6ENP7 2.4.1.8~~~malP~~~Maltose phosphorylase~~~
MQSVAFMIDLSIKSLRGILMKQIKRLFQIDPWKIRTTHLDKENLRLQESLTSIGNGYMGMRGNFEEHYSGDHHQGTYLAG
VWYPDKTRVGWWKNGYPEYFGKVINAINFIAMDLQIDGQTIDLATTPYEDFSLELDMQNGVLSRQFTIQTPKNKVRFSFE
RFLSLEKKEAAYIHLTIEMLEGTGTITLHSKLDGDVQNEDSNYEEHFWEEIAIETQETLGFVTTKTIPNNFEIERFTVTA
GMRHFIDGASVVPTYTQQPLALTAELTVSLNEGETTAITKEVLVVTSRDVPETQQITRVNELFSEMTTLYPEAKAGQAAA
WAKRWQLADVVIEGDDEAQQGIRFNLFQLFSTYYGEDDRLNIGPKGFTGEKYGGATYWDTEAYAVPLYLALAKPEVTKNL
LKYRHNQLPQAIHNAQQQGLKGALYPMVTFTGVECHNEWEITFEEIHRNGAIAYAIYNYVNYTGDQDYLKDAGLEVLVAI
ARFWADRVHFSQRHKQYMIHGVTGPNEYENNINNNWYTNTIAAWVLRYTRESYLKFQEETTLKIADDELAKWADIVENMY
FPVDNELGIFVQHDTFLDKDLMPVSDLPLSELPLNQHWSWDKILRSCFIKQADVLQGIYFFNDAFSLEEKRRNFNFYEPM
TVHESSLSPSIHAVLAAELGMEEKAVEMYQRTARLDLDNYNNDTEDGLHITSMTGSWLAIVQGFAQMKTDHQQLKFAPFL
PATWTAYSFHINYRNRLLFVEVAADQVAFTLLDGPAIPLTVYDQKYTLKDRLVLPIRKEEVHV
>Q9PRA0 ~~~~~~Membrane-associated lipoprotein~~~
MKKNKLTTLALILPITILTPIVIASCTNKTKVKKSSSLDKIASNLKLEYFNNKANTKASSVQKDEIKKPLNLPNDVVFSV
KDVFVSHKDQSVLIVKYTLKKGNEIQEYTYEIKGFKSVYEKDKIVNDLSQANEDFKKIVNNIRLKDTFDFKLAAFPNQNY
DQLLPSQIYKNYYQGIEIQQHKYQNELDIKIINFLYPDGDFGSANKNGTLKLSLMLTDKKNNQVYYKLLEVSGFKSNPYG
VDENGTIPGIGTERLKPKNQDDYFSKTQLQRYEIDNEGYLQILKRQNNDKNWKELRPDLNATVSDIKHFDEKAKNVGQDS
YESAAYKGFTLPVYESDGKISGLALAGKDTPKGPSWVDAIGRNQWQIGGLPRTLPNEKYRQEAMQTFSLGILNNDSHKNN
TYNKTAGTTWILDYQKTSDNKYPTKWYFATNLHVADAINENTLSINLMRLMDSAQIKTTFRLSNLDENIYNFGFRSKEHG
KNLLNHGLKKIFDGRDFLKTKPAEYLINSQKEKYKDVGNFTDFAVFELDFEKLELVNVWKNFLGENNGLVTKYNNYNPQE
LAKVITSNYANNKNNQIKFLSKSYLSDYSKIDVPLKYRQEDAKTWFKKYDELFALGWPNSTEDFFFKAYVDDDQLKYRTR
DNFSLWTNSDYRFFNNLTQQEGGQPAFPPERTERGNYLSYAIGFRSFIQKPGIVDAFIAVPQIGNNLYTSSDNKKYINMG
LEYLPKHFAPAGGASGTSVRNQKNELVAIYHAKYDSSKTGLAAAFRSEGYDYQGLYGNYNLPQYDLIYGGGKDQTEKKSY
REAMKDIYQNNNIKTALFPDGFDKIPDEFKFNNN
>O66937 2.4.1.25~~~malQ~~~4-alpha-glucanotransferase~~~COG1640
MRLAGILLHVTSLPSPYGIGDLGKEAYRFLDFLKECGFSLWQVLPLNPTSLEAGNSPYSSNSLFAGNYVLIDPEELLEED
LIKERDLKRFPLGEALYEVVYEYKKELLEKAFKNFRRFELLEDFLKEHSYWLRDYALYMAIKEEEGKEWYEWDEELKRRE
KEALKRVLNKLKGRFYFHVFVQFVFFKQWEKLRRYARERGISIVGDLPMYPSYSSADVWTNPELFKLDGDLKPLFVAGVP
PDFFSKTGQLWGNPVYNWEEHEKEGFRWWIRRVHHNLKLFDFLRLDHFRGFEAYWEVPYGEETAVNGRWVKAPGKTLFKK
LLSYFPKNPFIAEDLGFITDEVRYLRETFKIPGSRVIEFAFYDKESEHLPHNVEENNVYYTSTHDLPPIRGWFENLGEES
RKRLFEYLGREIKEEKVNEELIRLVLISRAKFAIIQMQDLLNLGNEARMNYPGRPFGNWRWRIKEDYTQKKEFIKKLLGI
YGREV
>Q59266 2.4.1.25~~~malQ~~~4-alpha-glucanotransferase~~~
MHISSLPGKYGIGTFGRSAYEFCDFLEKAGQKYWQILPLGQTSYGDSPYQSFSAFAGNPYFIDLDILNEKNLLDKDDYEE
KNFGDNKEMINYGLIFNEKMKVLRKAYMNFNSKDDESFAKFIEDEKKWLDDYSLFMALKYKFNFISWNSWNKDIKLRKNE
EIEKYKDELKEDVNYWKFLQYEFFSQWKNLKDYANKKNIKIIGDIPIYIAQDSSDVWSNPDIFLLNKETLEPLKVSGCPP
DAFSETGQLWGNPIYDWGYLEKTNFEWWVDRIKSSLKLYDILRIDHFRGFEAYWSVDYGEKTAQNGKWIKGPEMKLFNVI
KEKIGDIEIIAEDLGYLTEETLEFKKRTGFPGMKIIQFAFGGDSSNPYLPHNYEKNCVAYTGTHDNDTVRGWFEVTGSKE
EKEKAVEYFKLTEEEGYNWGVIRGVWSSVANTSIGVMQDFLNLGNEARINKPSTLASNWSWRAKDNVFTNELANKIYRLT
RIYGRCE
>P15977 2.4.1.25~~~malQ~~~4-alpha-glucanotransferase~~~COG1640
MESKRLDNAALAAGISPNYINAHGKPQSISAETKRRLLDAMHQRTATKVAVTPVPNVMVYTSGKKMPMVVEGSGEYSWLL
TTEEGTQYKGHVTGGKAFNLPTKLPEGYHTLTLTQDDQRAHCRVIVAPKRCYEPQALLNKQKLWGACVQLYTLRSEKNWG
IGDFGDLKAMLVDVAKRGGSFIGLNPIHALYPANPESASPYSPSSRRWLNVIYIDVNAVEDFHLSEEAQAWWQLPTTQQT
LQQARDADWVDYSTVTALKMTALRMAWKGFAQRDDEQMAAFRQFVAEQGDSLFWQAAFDALHAQQVKEDEMRWGWPAWPE
MYQNVDSPEVRQFCEEHRDDVDFYLWLQWLAYSQFAACWEISQGYEMPIGLYRDLAVGVAEGGAETWCDRELYCLKASVG
APPDILGPLGQNWGLPPMDPHIITARAYEPFIELLRANMQNCGALRIDHVMSMLRLWWIPYGETADQGAYVHYPVDDLLS
ILALESKRHRCMVIGEDLGTVPVEIVGKLRSSGVYSYKVLYFENDHEKTFRAPKAYPEQSMAVAATHDLPTLRGYWECGD
LTLGKTLGLYPDEVVLRGLYQDRELAKQGLLDALHKYGCLPKRAGHKASLMSMTPTLNRGLQRYIADSNSALLGLQPEDW
LDMAEPVNIPGTSYQYKNWRRKLSATLESMFADDGVNKLLKDLDRRRRAAAKKK
>P9WK23 2.4.1.25~~~malQ~~~4-alpha-glucanotransferase~~~COG1640
MTELAPSLVELARRFGIATEYTDWTGRQVLVSEATLVAALAALGVPAQTEQQRNDALAAQLRSYWARPLPATIVMRAGEQ
TQFRVHVTDGAPADVWLQLEDGTTRAEVVQVDNFTPPFDLDGRWIGEASFVLPADLPLGYHRVNLRSGDSQASAAVVVTP
DWLGLPDKLAGRRAWGLAVQLYSVRSRQSWGIGDLTDLANLALWSASAHGAGYVLVNPLHAATLPGPAGRSKPIEPSPYL
PTSRRFVNPLYLRVEAIPELVDLPKRGRVQRLRTNVQQHADQLDTIDRDSAWAAKRAALKLVHRVPRSAGRELAYAAFRT
REGRALDDFATWCALAETYGDDWHRWPKSLRHPDASGVADFVDKHADAVDFHRWLQWQLDEQLASAQSQALRAGMSLGIM
ADLAVGVHPNGADAWALQDVLAQGVTAGAPPDEFNQLGQDWSQPPWRPDRLAEQEYRPFRALIQAALRHAGAVRIDHIIG
LFRLWWIPDGAPPTQGTYVRYDHDAMIGIVALEAHRAGAVVVGEDLGTVEPWVRDYLLLRGLLGTSILWFEQDRDCGPAG
TPLPAERWREYCLSSVTTHDLPPTAGYLAGDQVRLRESLGLLTNPVEAELESARADRAAWMAELRRVGLLADGAEPDSEE
AVLALYRYLGRTPSRLLAVALTDAVGDRRTQNQPGTTDEYPNWRVPLTGPDGQPMLLEDIFTDRRAATLAEAVRAATTSP
MSCW
>O87172 2.4.1.25~~~malQ~~~4-alpha-glucanotransferase~~~
MELPRAFGLLLHPTSLPGPYGVGVLGREARDFLRFLKEAGGRYWQVLPLGPTGYGDSPYQSFSAFAGNPYLIDLRPLAER
GYVRLEDPGFPQGRVDYGLLYAWKWPALKEAFRGFKEKASPEEREAFAAFREREAWWLEDYALFMALKGAHGGLPWNRWP
LPLRKREEKALREAKSALAEEVAFHAFTQWLFFRQWGALKAEAEALGIRIIGDMPIFVAEDSAEVWAHPEWFHLDEEGRP
TVVAGVPPDYFSETGQRWGNPLYRWDVLEREGFSFWIRRLEKALELFHLVRIDHFRGFEAYWEIPASCPTAVEGRWVKAP
GEKLFQKIQEVFGEVPVLAEDLGVITPEVEALRDRFGLPGMKVLQFAFDDGMENPFLPHNYPAHGRVVVYTGTHDNDTTL
GWYRTATPHEKAFMARYLADWGITFREEEEVPWALMHLGMKSVARLAVYPVQDVLALGSEARMNYPGRPSGNWAWRLLPG
ELSPEHGARLRAMAEATERL
>P0A4T1 ~~~malR~~~HTH-type transcriptional regulator MalR~~~COG1609
MPVTIKDVAKAAGVSPSTVTRVIQNKSTISDETKKRVRKAMKELNYHPNLNARSLVSSYTQVIGLVLPDDSDAFYQNPFF
PSVLRGISQVASENHYAIQIATGKDEKERLNAISQMVYGKRVDGLIFLYAQEEDPLVKLVAEEQFPFLILGKSLSPFIPL
VDNDNVQAGFDATEYFIKKGCKRIAFIGGSKKLFVTKDRLTGYEQALKHYKLTTDNNRIYFADEFLEEKGYKFSKRLFKH
DPQIDAIITTDSLLAEGVCNYIAKHQLDVPVLSFDSVNPKLNLAAYVDINSLELGRVSLETILQIINDNKNNKQICYRQL
IAHKIIEK
>P0A4T2 ~~~malR~~~HTH-type transcriptional regulator MalR~~~COG1609
MPVTIKDVAKAAGVSPSTVTRVIQNKSTISDETKKRVRKAMKELNYHPNLNARSLVSSYTQVIGLVLPDDSDAFYQNPFF
PSVLRGISQVASENHYAIQIATGKDEKERLNAISQMVYGKRVDGLIFLYAQEEDPLVKLVAEEQFPFLILGKSLSPFIPL
VDNDNVQAGFDATEYFIKKGCKRIAFIGGSKKLFVTKDRLTGYEQALKHYKLTTDNNRIYFADEFLEEKGYKFSKRLFKH
DPQIDAIITTDSLLAEGVCNYIAKHQLDVPVLSFDSVNPKLNLAAYVDINSLELGRVSLETILQIINDNKNNKQICYRQL
IAHKIIEK
>P06993 ~~~malT~~~HTH-type transcriptional regulator MalT~~~COG2909
MLIPSKLSRPVRLDHTVVRERLLAKLSGANNFRLALITSPAGYGKTTLISQWAAGKNDIGWYSLDEGDNQQERFASYLIA
AVQQATNGHCAICETMAQKRQYASLTSLFAQLFIELAEWHSPLYLVIDDYHLITNPVIHESMRFFIRHQPENLTLVVLSR
NLPQLGIANLRVRDQLLEIGSQQLAFTHQEAKQFFDCRLSSPIEAAESSRICDDVSGWATALQLIALSARQNTHSAHKSA
RRLAGINASHLSDYLVDEVLDNVDLATRHFLLKSAILRSMNDALITRVTGEENGQMRLEEIERQGLFLQRMDDTGEWFCY
HPLFGNFLRQRCQWELAAELPEIHRAAAESWMAQGFPSEAIHHALAAGDALMLRDILLNHAWSLFNHSELSLLEESLKAL
PWDSLLENPQLVLLQAWLMQSQHRYGEVNTLLARAEHEIKDIREDTMHAEFNALRAQVAINDGNPDEAERLAKLALEELP
PGWFYSRIVATSVLGEVLHCKGELTRSLALMQQTEQMARQHDVWHYALWSLIQQSEILFAQGFLQTAWETQEKAFQLINE
QHLEQLPMHEFLVRIRAQLLWAWARLDEAEASARSGIEVLSSYQPQQQLQCLAMLIQCSLARGDLDNARSQLNRLENLLG
NGKYHSDWISNANKVRVIYWQMTGDKAAAANWLRHTAKPEFANNHFLQGQWRNIARAQILLGEFEPAEIVLEELNENARS
LRLMSDLNRNLLLLNQLYWQAGRKSDAQRVLLDALKLANRTGFISHFVIEGEAMAQQLRQLIQLNTLPELEQHRAQRILR
EINQHHRHKFAHFDENFVERLLNHPEVPELIRTSPLTQREWQVLGLIYSGYSNEQIAGELEVAATTIKTHIRNLYQKLGV
AHRQDAVQHAQQLLKMMGYGV
>P59213 ~~~malX~~~Maltooligosaccharide ABC transporter solute-binding lipoprotein~~~COG2182
MSSKFMKSAAVLGTATLASLLLVACGSKTADKPADSGSSEVKELTVYVDEGYKSYIEEVAKAYEKEAGVKVTLKTGDALG
GLDKLSLDNQSGNVPDVMMAPYDRVGSLGSDGQLSEVKLSDGAKTDDTTKSLVTAANGKVYGAPAVIESLVMYYNKDLVK
DAPKTFADLENLAKDSKYAFAGEDGKTTAFLADWTNFYYTYGLLAGNGAYVFGQNGKDAKDIGLANDGSIVGINYAKSWY
EKWPKGMQDTEGAGNLIQTQFQEGKTAAIIDGPWKAQAFKDAKVNYGVATIPTLPNGKEYAAFGGGKAWVIPQAVKNLEA
SQKFVDFLVATEQQKVLYDKTNEIPANTEARSYAEGKNDELTTAVIKQFKNTQPLPNISQMSAVWDPAKNMLFDAVSGQK
DAKTAANDAVTLIKETIKQKFGE
>P59214 ~~~malX~~~Maltooligosaccharide ABC transporter solute-binding lipoprotein~~~COG2182
MSSKFMKSTAVLGTVTLASLLLVACGSKTADKPADSGSSEVKELTVYVDEGYKSYIEEVAKAYEKEAGVKVTLKTGDALG
GLDKLSLDNQSGNVPDVMMAPYDRVGSLGSDGQLSEVKLSDGAKTDDTTKSLVTAANGKVYGAPAVIESLVMYYNKDLVK
DAPKTFADLENLAKDSKYAFAGEDGKTTAFLADWTNFYYTYGLLAGNGAYVFGQNGKDAKDIGLANDGSIAGINYAKSWY
EKWPKGMQDTEGAGNLIQTQFQEGKTAAIIDGPWKAQAFKDAKVNYGVATIPTLPNGKEYAAFGGGKAWVIPQAVKNLEA
SQKFVDFLVATEQQKVLYDKTNEIPANTEARSYAEGKNDELTTAVIKQFKNTQPLPNISQMSAVWDPAKNMLFDAVSGQK
DAKTAANDAVTLIKETIKQKFGE
>P23256 ~~~malY~~~Protein MalY~~~COG1168
MFDFSKVVDRHGTWCTQWDYVADRFGTADLLPFTISDMDFATAPCIIEALNQRLMHGVFGYSRWKNDEFLAAIAHWFSTQ
HYTAIDSQTVVYGPSVIYMVSELIRQWSETGEGVVIHTPAYDAFYKAIEGNQRTVMPVALEKQADGWFCDMGKLEAVLAK
PECKIMLLCSPQNPTGKVWTCDELEIMADLCERHGVRVISDEIHMDMVWGEQPHIPWSNVARGDWALLTSGSKSFNIPAL
TGAYGIIENSSSRDAYLSALKGRDGLSSPSVLALTAHIAAYQQGAPWLDALRIYLKDNLTYIADKMNAAFPELNWQIPQS
TYLAWLDLRPLNIDDNALQKALIEQEKVAIMPGYTYGEEGRGFVRLNAGCPRSKLEKGVAGLINAIRAVR
>P21517 3.2.1.20~~~malZ~~~Maltodextrin glucosidase~~~COG0366
MLNAWHLPVPPFVKQSKDQLLITLWLTGEDPPQRIMLRTEHDNEEMSVPMHKQRSQPQPGVTAWRAAIDLSSGQPRRRYS
FKLLWHDRQRWFTPQGFSRMPPARLEQFAVDVPDIGPQWAADQIFYQIFPDRFARSLPREAEQDHVYYHHAAGQEIILRD
WDEPVTAQAGGSTFYGGDLDGISEKLPYLKKLGVTALYLNPVFKAPSVHKYDTEDYRHVDPQFGGDGALLRLRHNTQQLG
MRLVLDGVFNHSGDSHAWFDRHNRGTGGACHNPESPWRDWYSFSDDGTALDWLGYASLPKLDYQSESLVNEIYRGEDSIV
RHWLKAPWNMDGWRLDVVHMLGEAGGARNNMQHVAGITEAAKETQPEAYIVGEHFGDARQWLQADVEDAAMNYRGFTFPL
WGFLANTDISYDPQQIDAQTCMAWMDNYRAGLSHQQQLRMFNQLDSHDTARFKTLLGRDIARLPLAVVWLFTWPGVPCIY
YGDEVGLDGKNDPFCRKPFPWQVEKQDTALFALYQRMIALRKKSQALRHGGCQVLYAEDNVVVFVRVLNQQRVLVAINRG
EACEVVLPASPFLNAVQWQCKEGHGQLTDGILALPAISATVWMN
>Q93DY9 ~~~mamA~~~Magnetosome protein MamA~~~COG0457
MSSKPSNMLDEVTLYTHYGLSVAKKLGANMVDAFRSAFSVNDDIRQVYYRDKGISHAKAGRYSEAVVMLEQVYDADAFDV
EVALHLGIAYVKTGAVDRGTELLERSIADAPDNIKVATVLGLTYVQVQKYDLAVPLLVKVAEANPVNFNVRFRLGVALDN
LGRFDEAIDSFKIALGLRPNEGKVHRAIAYSYEQMGSHEEALPHFKKANELDERSAV
>P0DO20 ~~~mamA~~~Magnetosome protein MamA~~~
MSSKPSDILDEVTLYAHYGLSVAKKLGMNMVDAFRAAFSVNDDIRQVYYRDKGISHAKAGRYSQAVMLLEQVYDADAFDV
DVALHLGIAYVKTGAVDRGTELLERSLADAPDNVKVATVLGLTYVQVQKYDLAVPLLIKVAEANPINFNVRFRLGVALDN
LGRFDEAIDSFKIALGLRPNEGKVHRAIAFSYEQMGRHEEALPHFKKANELDEGASV
>Q2W8Q0 ~~~mamA~~~Magnetosome protein MamA~~~
MSSKPSDILDEVTLYAHYGLSVAKKLGMNMVDAFRAAFSVNDDIRQVYYRDKGISHAKAGRYSQAVMLLEQVYDADAFDV
DVALHLGIAYVKTGAVDRGTELLERSLADAPDNVKVATVLGLTYVQVQKYDLAVPLLIKVAEANPINFNVRFRLGVALDN
LGRFDEAIDSFKIALGLRPNEGKVHRAIAFSYEQMGRHEEALPHFKKANELDEGASV
>V6F510 ~~~mamB~~~Magnetosome protein MamB~~~COG0053
MKFENCRDCREEVVWWAFTADICMTLFKGILGLMSGSVALVADSLHSGADVVASGVTQLSLKISNKPADERYPFGYGNIQ
YISSAIVGSLLLIGASFLMYGSVVKLISGTYEAPSIFAALGASVTVIVNELMYRYQICVGNENNSPAIIANAWDNRSDAI
SSAAVMVGVIASVIGFPIADTIAAIGVSALVGHIGLELIGKAVHGLMDSSVDTELLQTAWQIATDTPLVHSIYFLRGRHV
GEDVQFDIRLRVDPNLRIKDSSMVAEAVRQRIQDEIPHARDIRLFVSPAPAAVTVRV
>W6KHH6 ~~~mamB~~~Magnetosome protein MamB~~~
MTTAACRKCRDEVIWWAFFINIGQTTYKGVLGVLSGSAALVADAMHSGADVVATLVTMFSVKVSDKKADEKYPFGYGNIQ
FIASSIVGLILFFGALYLMYESTMQIIAGNTSSPSPFAVLGAIVSIATNELMFRYQSCVGRQNNSPAIIANAWDNRSDAL
SSVAVLIGIVAAVVGFPIADRLAAIGVGILVAKIGIELNIDAINGLMDTSVENDVLVDAYNIAKDSQHVHGVHYIRGRNV
GEDVHLDINIYVDADLKVFESDLVADAIRRKIEAEVDHVRDVHVGVTPVRIAA
>Q93DY1 ~~~mamC~~~Magnetosome protein MamC~~~
MSFQLAPYLAKSVPGIGILGGIVGGAAALAKNARLLKDKQITGTEAAIDTGKEAAGAGLATAFSAVAATAVGGGLVVSLG
AALIAGVAAKYAWDLGVDFIEKELRHGKSAEATASDEDILREELA
>A0L9X3 ~~~mamC~~~Magnetosome protein MamC~~~
MAAFNLALYLSKSIPGVGVLGGVIGGSAALAKNLKAKQRGEITTEEAVIDTGKEALGAGLATTVSAYAAGVVGGGLVVSL
GTAFAVAVAGKYAWDYGMEQMEAKLQEKKHQEQGGQTYGDNPDPFDPQELETP
>Q2W8S0 ~~~mamC~~~Magnetosome protein MamC~~~
MPFHLAPYLAKSVPGVGVLGALVGGAAALAKNVRLLKEKRITNTEAAIDTGKETVGAGLATALSAVAATAVGGGLVVSLG
TALVAGVAAKYAWDRGVDLVEKELNRGKAANGASDEDILRDELA
>Q93DY2 ~~~mamD~~~Magnetosome protein MamD~~~
MQDLFLAKVESAMQASQVGALAGQTATVSSVSATTNLATITPTTAGQAPIIVKLDAARQVTELQALMGKTVLVGKTPTTI
GGIGNWIALTPAAGAKTGAAVAGTGQLVMMKVEGTGAAIKLPALAGKSFIVAQPPVAAGTKAAGMLYLNPVGGGDMVAIN
IQNAMTQTGGLVGKTFTVAPSPVIGGTTGKFLVLKPMATGVGKAVGSGAVVAKFVPAAVTGTGGAAAIGAGSATTLMATG
ASTITPVTAAAAGSAMLTAKGVGLGLGLGLGAWGPFALGAIGLAGVVALYTWARRRHGAPDVSDDALLAAVGEE
>Q2W8R9 ~~~mamD~~~Magnetosome protein MamD~~~
MQDLLLAKVESAMQASQVSALAGQTATVTKVSAATNLATITPTAAGQAPIIVKLDATRQVVELQALVGKTVMVGKTPAAI
GGIGNWIALTPVTGAKAAAAATGAGQLVMMKVEGTAAAVNLPALAGKSFTIAQPPVAAGTKAAGMLYLNPVGGGDLIAIN
VQNAATQTGGLVGKTFVVAPSPVIGGTTGKFLVLKPLTAGAGKAVGGGAIAAKFIPAAVTGTGGAAAVGAGSASSLLTAG
AGTVTPITAAGTGSAMLSAKGLGLGLGLGLGAWGPFLLGAAGLAGAAALYVWARRRHGTPDLSDDALLAAAGEE
>V6F2B6 3.4.21.-~~~mamE~~~Magnetosome formation protease MamE~~~COG0265
MTMFNGDVEDGGRSNVSCGKDLKRYLMLMGVVALVVLFGAFIYRQSSGGLRLGAMMEQMTGARGAVNVPAQHGAPSAVVD
PAMSVPARARVAPPSAAGAIATFPPVVDFGPAPVVSGGPFTGVVTLLRNSVVSVTASSSGGQVMPDPLGLVNPDGLPRFA
NPTTRSVENIGTGVIVRNDGFIVTNYHVVRGANSVYVTVKDDVGSIRYSGEIVKMDEALDLALLKITPKVQLTAAVLGDS
DAVNVADEVIAIGTPFGLDMTVSRGIISAKRKTMVIEGMTHSNLLQTDAAINQGNSGGPLVAANGTVVGINTAIYTPNGA
FAGIGFAVPSNQARLFALDEVGWLPTSTAEGPAMGLVAMQRPMGVGVGAAGPVIAAGTPSPHVDGRQNMDCSNCHDIIPA
GNGFQAPMMPVAAPVPPPPIPANAVSPHTDGRQNMTCNTCHQFVGGAAAGPIAFGQPMMPIAAPQQPAPAIRANAANPHT
DGRQNMNCASCHQIIGSVGAAPIAAPGAGGAYRFSQPPGSLAINIQGPRGGQGAVAGSGGSRASLLGAALTPLTQRLGLQ
ANLPAGRGVFVNGVTPNTPAASAGLRPGDVILKVDGRPVHQPEEVAAIMAEMPNGRSVRIGVLRAGDVSNMSLVTGPSGL
AAAVVQAPTAPVVMAGGAPTVPGVQPVIPKVPTEFNWLGMEIETFMAPQPVVGMPGATPVAGGGKGAQVAEVLAGSRAAV
AGLQANDLIIEVNNRPVTSPARLDAAIKAATAAGQQILLKVHRNGQEFWIVL
>Q2W8Q8 3.4.21.-~~~mamE~~~Magnetosome formation protease MamE~~~
MAMFNGDVEDGGRGDASCGKDLKRYLMLMGVVALVVLFGAFIYRQSSGGLRLGAMLEQMGRGTGPAVNVPVQQGGPSAAV
NPAMSVPAGARVAPPSAAGAIATMPPMVDFGPAPIGAGGPFSSVVTLLRNSVVAVTASSANGQAMPDPLGLANPDGLPHF
ANPATRSVENIGTGVIVRNDGFIVTNYHVVRGANSVFVTVQDDVGSTRYSAEIIKMDEALDLALLKVAPKTPLTAAVLGD
SDGVQVADEVIAIGTPFGLDMTVSRGIISAKRKSMVIEGVTHSNLLQTDAAINQGNSGGPLVISNGTVVGINTAIYTPNG
AFAGIGFAVPSNQARLFILDEVGWLPTSTAEGASMGLVAMQRPMGGGVGAAGPAIFAGTRAPHTDGRQNMDCTTCHDLIP
AGNGRPAPMMPIAAPIPPPPIPMGAVSPHTDGRQNMNCANCHQMLGGAAPIAAPGLGGGAYRFAQPPGSLAINIQGPRGG
QSTAAGTGRVTLLGAALTPMSQRLGAQTGVPVGRGVFISGVTPNTPAATAGLRPGDVLLKVDGRPVRLPEEVSAIMVEMH
AGRSVRLGVLRDGDVRNMTLVAGPAGLAAAAVQAPAIADMAQPPMGGMAPTAPGMVAVPGGPAVMPKPPTEFNWLGMEIE
TFQAPRPITGVPGAVPVPGAKGAQVAEVLVGSRAAVAGLQANDLILEVNNRPVAGPARLDAAIKGATNAGQQILLKVNRN
GQEFWIVL
>Q6NE74 ~~~mamF~~~Magnetosome protein MamF~~~COG4818
MAETILIETKTAGGNCRSYLMAGASYLGILCFVPLLMSRDDEYVYFHAKQGLVLWMWSILAMFALHLPGIGKWLFGFSSM
GVLMLSVVGLVSVALRRTWRLPLISHVVALI
>Q2W8Q9 ~~~mamI~~~Magnetosome protein MamI~~~
MPSVIFGLLALALGLLGVTAWWWSVTEFLRGAVPVALLILGLVALASGVQSVRLPRSNKGTASDPDIDG
>V6F519 ~~~mamJ~~~Magnetosome-associated protein MamJ~~~
MAKNRRDRGTDLPGDGDQKISTGPEIVSVTVHPSPNLAAAAKPVQGDIWASLLESSPWSANQGGLVETAQPPSAPIRSQD
PVPVADLVNRWSQPIWRTAPLAGNAESSEEGVVAPSLTQSDSVLAVSDLVIDVQPETDAEVEVSIEPEPALVEPVIEIEA
EAAEVEPEPAPVADLVNRWAQPIWRTAPLAGNAESSEEGVVAPSLTQSDSVLAVSDLVIDVQPEANAEVEVSIEPEPALV
EPVIEIEAEAAEVEPEPAPVEPVIEIEAEAAEVEPEPAPVEPVIEIEAEAAEVEPEPAPVEPAIEIEAIRVELEPVLIDE
VVELVTEFEYSQAESVASADLIANPAPAESSRLAELLDEAAAIAAPAVAVAVEATRQPNKITASVKKRAPVQEVPVEDLL
GGIFGVAGSAVRGVFTIGGGFVDGVVKGGRLVGSNVVAGTRRLAQTIEVSCGSCSSPKCDAEDKNK
>P0DSO6 3.6.1.-~~~mamK-like~~~MamK-like protein~~~
MIVNDNQNILYVGIDFGYSKTVIMTSRGKSLSLKSLVGYPKDFVGLARLGRPYLVGDEAFEMRSYLHLRNPLLDGLLNPI
SEQDIDVTRHFISHIIKCAEPAAGEKVFAVIGVTPRFTAANKKLLLKLAQEYCQNVLLMSAPFLAGNSIGKASGSIIIDI
GAWTTDICAMKGRIPRPEDQSSIAKAGSYIDERLKNSILERYPALQINANIARMVKEQFAFVGRPQLVAACEFRSAGKAV
RCDVTEQVRAACESPFAEIAERIGAVLCVVPPEDQALVLKNIVITGAGAQIRGLPEYVKSMLAPYGDARVSIANEPLMEA
CKGALSMAQEIPPHFWGQL
>Q6NE59 3.6.1.-~~~mamK~~~Actin-like protein MamK~~~COG1077
MWIDLLARERSDKMSEGEGQAKNRLFLGIDLGTSHTAVMTSRGKKFLLKSVVGYPKDVIGLKLLGRPYVVGDEAFEMRSY
LDLRYPLQDGVLSEISDRDIEVARHLLTHVVKSAEPGANDEICAVIGVPARASGANKALLLKMAQEVVHTALVVSEPFMV
GYGLDKLNNTIIVDIGAGTTDICALKGTVPGPEDQVTLTKAGNYLDERLQNAILERHPELQMNTNVACAVKEQFSFVGAR
GEAATFEFRAAGKPVRCDVTESVKIACEALMPDIIESIEILLRSFQPEYQATVLQNIVFAGGGSRIRGLAAYVKDKLRPF
GNADVTCVKDPTFDGCRGALRLAEELPPQYWCQLGDVSGQ
>Q2W8Q6 3.6.1.-~~~mamK~~~Actin-like protein MamK~~~
MSEGEGQAKNRLFLGIDLGTSHTAVMSSRGKKFLLKSVVGYPKDVIGLKLLGRPYVVGDEAFEMRSYLDIRYPLQDGVLS
EISDRDIEVARHLLTHVVKSAEPGPNDEICAVIGVPARASAANKALLLKMAQEVVHTALVVSEPFMVGYGLDKLINTIIV
DIGAGTTDICALKGTVPGPEDQVTLTKAGNYVDERLQNAILERHPELQMNVNVACAVKEQFSFVGTPTEVASFEFRAAGK
PVRADVTEPVKIACEALMPDIIESIETLLRSFQPEYQATVLQNIVFAGGGSRIRGLAAYVKEKLRPFGDANVTCVKDPTF
DGCRGALRLAEELPPQYWRQLGDVSGS
>Q6NE58 ~~~mamL~~~Magnetosome protein MamL~~~
MVRVIGSLVFGGLILLLASSNAHMVETRFGPLIMLAPHFVVLGITFFLGFAIGIVLVFANVMKRRKHKLPGKNIVIKR
>V6F235 ~~~mamM~~~Magnetosome protein MamM~~~COG0053
MRKSGCAVCSRSIGWVGLAVSTVLMVMKAFVGLIGGSQAMLADAMYSLKDMLNALMVIIGTTISSKPLDAEHPYGHGKVE
FILSMVVSVVFIVLTGYLLVHAVQILLDESLHRTPHLIVLWAALVSIGVNVGMYFYSRCVAIETNSPLIKTMAKHHHGDA
TASGAVALGIIGAHYLNMPWIDPAVALWETIDLLLLGKVVFMDAYRGLMDHTAGEAVQNRIVEAAERVPGVRGVIHLRAR
YVGQDIWADMIIGVDPENTVEQAHEICEAVQAAVCGKIRRIESLHVSAEAREIGDTTKPSFSDQPLSFDEVMLSKVDN
>Q6NE56 ~~~mamN~~~Magnetosome protein MamN~~~COG1055
MVGFITLAVFIATFAVIYRWAEGSHLAVLAGAAVLVVIGTISGTYTPRMAVQSIYFETLALIFGMAAISALLARSGVYAY
LAAGTAELSQGQGRWILVMMALVTYGISLASNSLITVAVVVPVTLTVCFRTGIDPVPVIIAEIIAANLGGSSTMIGDFPN
MILASAGKLHFNDFIGGMMPACLILLAVTFLFFEYRQGDWKKAEIPVDLAWVRGEQLRYSDIDHRLLRYGLIIFFITVIG
LVLAGPLKVRPGWIAFVAGLTALALGRFKDEEFFSACGGSDILFFGGLFVMVGALTSVGILDWAVAWLEGVTAGHDRVRA
ILLMWMAAGVTIFVGGGTSAAVFAPVAATLRLDGDGQAAWWALALGIMAGSCAALSGATAGALAMNQYSGFVKRHPELAS
AAAAGLQFTHREYVRWGLPLMGIFLVLSTVYIAVLAG
>Q93DZ1 ~~~mamO~~~Probable membrane transporter protein MamO~~~COG0265
MIEIGETMGDQPTNKIVFCERSWKAPVSILAFLILVTFAWGAYLLDNYDEDDYFRGSDDMSVGQFLVRNVAMPDVQRLYY
TVPPAVVGVGGGGVNAGPVASGAIVGANGYVITTLHSVANVPDITVQVATSAGIRRFPAQVVKTIPGHNLALLKLQTTEK
FLHFRMANIQTVVPGQQVFAFGRNMAGAPLVRQGMVQSSDAPLAVGTTQITHLLRSDAVYSWEQTGGPLVNAQGDLVGIN
IAATGPTGKVEGFTVPAQVIVSHLQDVVRFKTGGAAGVAPPAAQTVAMGSSSWWSKAKAVVGGPTAVPGMGMNVVQGTVT
TGIPSGMPFVDTDHVGGAKIGGYSIADILGLGMLALAAGVTGGMMTMGGGVLQVAGMMVFFGYGMYLIRPVVFLTNVVVY
GAAALRNDKAQLVQWDKVKPLIPWGVAGVVIGYFIGNAIGDSVVGVLLGLFALIMAGKAVLEILQPNAGEDTAEAIAAAE
AGDEMDELMALAEGTTRPKTSGIALPEGPTRSAVLGLPMGLFSGILGISGGVIEVPLQRYIGRISLQNAIANSSVLVFWA
SVAGSVVAFIHGGSTGLIHWEAPVTLALVMIPGAYVGGILGARLMRVLPVRVLKGIYAATMAAIAIKMLTTV
>Q2W8Q2 ~~~mamO~~~Probable membrane transporter protein MamO~~~
MIEVGETMGELPTNKIVFCERSWKTPVSILAFLIFVTFAWGIYLLDHYDEDDNFHGADDLSVGQFLVRNIAMPHVQRLYH
TVPPAVVGVGGGGVNAGPVASGAIVGTNGYVITTLHSVSKLPEISVQVATTGGIRRFPAQVVKTIPGHDLALLKMQTTEK
FLHFRMADVQTVVPGQQVFAFGRNMAGAPLVRQGLVQSADAPLAVGATQITHLLRSDAVYSWEQTGGPLVNAQGDLVGIN
IAATGPTGKVEGFTVPAQVIVSHLQDVVRFKKGSATAPGQPQTQTVAAGSTNWWSKARAVVGGPTAIPGMGMNVVQGNVV
KGNVAPSIPSGMPFIDTDHVGGAKIGGYSVADIVGLVMLALAAGVTGGMMTMGGGVLQVAGMMVFFGYGMYLIRPVVFLT
NVVVYGAASLRNDKAQLVQWDKVKPLIPWGIAGVILGYFIGNAIGDSVVGILLGLFALIMAGKAVMEILQPNAGEETAES
ISATEAEDEMDELMALADGTSRPKASGLALPEGHARSAVLGLPMGLFSGILGISGGVIEVPLQRYVGRISLQNAIANSSV
LVFWASVAGSVVAFLHGSSTGLIHWEAPVTLALVMIPGAYVGGIIGARLMRVLPVRVLKGVYAATMAAIALKMLTSV
>A0A1S7LCW6 1.-.-.-~~~mamP~~~Multi-heme protein MamP~~~
MKLKGTTIVALGMLVVAIMVLASMIDLPGSDMSATPAPPDTPRGAPIVGGQGQAMGLPVAMQRRRGEQRAPVPALSDANG
GFVAPNVQFSEAHWQGMEALPLSIELKRKLKLPLDLEGLLIDETSLNAAVSGLLAGDVLVAINGRKVKTLKKMQKETRRV
QMDRRASLTVYRKGRLLTLTLSEEKNLGLAQVETAPMILPGDIMPHPYRGPCTQCHAIGTTGHITPDPDGIVLPPGPIRA
GAKMPHRDRGPCAACHAIIQ
>Q2W8Q1 1.-.-.-~~~mamP~~~Multi-heme protein MamP~~~
MNSKVALLVVGLAVVLALVIGRQGPVAPQATNTQSQAVAAGPVAAPVAFPQPLYPQAANVAMPVEPDPAAGGGTAPATES
PLPNFVPRKLKVFEGHWQGMDGRLMTEELARKLNYPRGLQGVLLGEVTLNAAFSGLLAGDLIVRIDDTPVTDMESFKAAS
RTVANRSDARISVLRKDNRPGAPVVRKLTVVLREAEGGLGFAQLEGAPMILAGDPRPHGYRGACTDCHPIGQGFELTPDP
DLISLPPPTITRDMVARSVNPHEVRGPCEACHVIK
>Q93DY8 ~~~mamQ~~~Magnetosome protein MamQ~~~COG1704
MAVSDADASSVDKVESITLQRVKQSEELLAQLYVVEESPRRMGRGPVQLMLAISVLSLVAFITTLLMRYNAFVTMYEDAQ
AKRSNFEVMIQRRDNLFGNLVKLTLNHAALEHSIFSHTSDKRKESVEAGKGGPIGSAIEQLMKQGGIGKLLGDGGAGKAL
LGADGGFGNALGRLMAIVEQYPTVQSADTYKHMMTSLVDMEDRIASKREEFNASAATYNVAITKWPWDYLAMITGFKRVE
YFHEKPAGDTPIITPQIFQELLPLTHSQESKN
>Q6NE51 ~~~mamR~~~Magnetosome protein MamR~~~
MIWTAVIKGSALMTFVQGAMALVDKVFGEEILPHRIYSSGEAAQLLGMERLQVLEMVRAGTIKAKKVGDNYRILGSNLVE
YMNR
>Q2W8P5 ~~~mamT~~~Magnetosome protein MamT~~~
MSMEAPRRGRRWVSLGMIALLAAIGLGLYWDQLSTPSGITPATSPRRAEGLLLGRLPLPMEPSLLSPLERLLEPPLRYKL
MTIRHIPPVKPGTGMPHPYVGDCIQCHLMVGGPAAGSQFKTPYGAVLENLSRVRKLGPPILPTSRQPHPPAGRCIKCHDI
VVKVPVDKKGGMRWQL
>V6F2C2 ~~~mamX~~~Magnetosome protein MamX~~~COG0265
MNTKAVAHPDIAVWIMALGIAFSMALVLTALFNANPWEDHTYDLAPPIVAGMAAPHRDGREKMVCSSCHIVTPASAATGP
GAGTLPIVEGTPAPHVDGREKMACASCHTIVKKGSVAKSGKASPAPVAFSQGMPLPEAMSVALAVTPAPAPLGNEAHERM
VPFRYQGKIVSVAGAGTRSVWGDIYIQINDGINPPMWIDLAPLWFLQAEGCLVRPGMFVKGTAFRDPTQASAGLDYAMSV
MANGEVCALRDDHLNGLWANVGGVDAEER
>V6F5F3 ~~~mamY~~~Liposome tubulation protein MamY~~~
MLMNFVNNVSKTINGGARIVYVGSFSWAVLSLLFVTAFSGWNNIFSMLPHEIFILVLTISLPIALIVLIFMLSQIVRTVE
SVKSEISTLSQRDPVSEEAVTMLADLFREHRDAVAAQVAAQVEATAQLVQINQDNRALAAPSPDSGDENPLALLAQMFRE
YRETVTAQLEAQISATTQLVEASRDSRDGIVDELRSQRVLSQEITQELSHIAQSRNVVPVAEPGLDPSQRIDRMRALAEV
LGLALNDLSMTATQLLSEHLNAAHGDREGTQKFISTLTNAYFAGDKNVFFRSLVSEVVNHSDQLQQCAIGAENVRQQISK
ILREAREIRSLVSACDPNDLVRIVFEDGELWALEKALAEHFLIDGTPISDA
>Q2W8K3 ~~~mamY~~~Liposome tubulation protein MamY~~~
MAIAAIMGDVLMLMGFNKAAFGKLNSASRAALIGAVIWAVLSIVYLTIFNGWKNLFTMLPHEFFIVLLSIALPIGLTVLI
LMLSRIVKSVDTLKSEVTTLSRNDVSSEGSVAMLADLFREHRAAIAAQVEAQVEATTQLIRLNQEGRALAAPAQASGTDE
AMTLLAQLFREHREAVAAQLEAQASATAQLVQVTRDSRDGIVDELRSQRVLSQEITQELSQITQSRTVAPAPPGLDPSQR
IDRMRALAEVLGLALNDLSMTATQVLTEHLNAAHGDRDGTRKFISTLTTAYFAGDKNVFFRSLVQEAVNRSEQLRRCAED
TESVRQQISKILREAREIRSLVAACDPNDLVRIVFEDGELWALEKALAEHFLIDGSPVWTETAPDSGMD
>V6F4W4 ~~~mamZ~~~Magnetosome protein MamZ~~~COG2717
MLEAWMPKSGKPSTGTTPADFAPTQWNIIYLLMTVGSLVAALSISIQPLLLDKIFGIAFEKEGAVNADIQVVAEIVSIVC
VGWFGLLSDRIGRVRIIATGFLIAVAGAAMSLLSLQIGLAFGAAGLVLFYLTRVLLTVGADTVQLQLSTLVGDVSSRANR
PRLMGNLVFMMVFGGTMLSAIIMQMADYKGGVFIIMCLPLLIGIAGFQMTRESLRDVAQPQQAPTGDEHPLRQVWSVITS
DPRLQLAFAAAFYTRADVIILSLFFSLWCISVSDLVGVTRTYATAHAAVMIGLLGLAVLAAIPLWRSFIERHSRISAIGA
SLSLAAVGYIWLGMFANPFNWLVALPLLMVGIGHAGCFVTLQVLTVDVSPKPILGAMVGAGYLVGGLGTVMLVQSGGYYF
DALGPRAPFILMGTGKMLVTLYAAWLLANGIDETCDHHLKSTRKVDWKPLVFLTAALPFVWLIGRSVIEGYFSNGSLGEA
PVGFVNRYLGDWAFTFLIISLSMRPVQEITGIKSLAKYRRMIGLFAFFYAVLHVLAYVTLEWALNLGDMASDIYKRPFIL
LGLAAFLLLIPLAFTSTNSQIKKIGGKRWKRLHRATYVINALVALHFILAANHENGEPYVYAAAVIVLLWYRFYQWRGGN
VLRALRIG
>C7H4X2 ~~~~~~Microbial anti-inflammatory molecule~~~
MMMPANYSVIAENEMTYVNGGANFIDAIGAVTAPIWTLDNVKTFNTNIVTLVGNTFLQSTINRTIGVLFSGNTTWKEVGN
IGKNLFGTNVKGNPIEKNNFGDYAMNALGIAAAVYNLGVAPTKNTVKETEVKFTV
>B3PF24 3.2.1.78~~~man5A~~~Mannan endo-1,4-beta-mannosidase~~~COG2730
MSLFTPLSETNVRSHTNTSSVFCRRIKTLVAGLTALGLMLAAVSASAGFYVSGKQLREGNGNNFIMRGVNLPHAWFPDRT
NQALADISATGANSVRVVLSNGRLWSRTPESQVASIISQAKARQLITVLEVHDTTGYGEQTAATLSEAVDYWIAIRNALI
GQEDYVIINIGNEPFGNGQSASTWLNLHRDAINRLRNAGFTHTLMVDAANWGQDWENIMRNNASSLFNSDPRRNVIFSVH
MYEVYPNDTAVNNYMSAFNSMNLPLVVGEFAANHFGSYVDAGSIMARAQQYGFGYLGWSWSGNSSNLSALDVVTNFNAGS
LTTWGNLLINNTNGIRNTSRKATIFGGSGSSSSSAGSCGTAPNGYPYCCNASSATGNGWGWENNRSCVVATTSTSCNWYG
TSYPICVNTSSGWGWENNRSCIAASTCAAQ
>G1K3N4 3.2.1.78~~~~~~Mannan endo-1,4-beta-mannosidase~~~
AGFYVDGNTLYDANGQPFVMRGINHGHAWYKDTASTAIPAIAEQGANTIRIVLSDGGQWEKDDIDTIREVIELAEQNKMV
AVVEVHDATGRDSRSDLNRAVDYWIEMKDALIGKEDTVIINIANEWYGSWDGSAWADGYIDVIPKLRDAGLTHTLMVDAA
GWGQYPQSIHDYGQDVFNADPLKNTMFSIHMYEYAGGDANTVRSNIDRVIDQDLALVIGEFGHRHTDGDVDEDTILSYSE
ETGTGWLAWSWKGNSTEWDYLDLSEDWAGQHLTDWGNRIVHGADGLQETSKPSTVFTDDNGGHPEPPT
>O31646 5.3.1.8~~~manA~~~Mannose-6-phosphate isomerase ManA~~~COG1482
MTTEPLFFKPVFKERIWGGTALADFGYTIPSQRTGECWAFAAHQNGQSVVQNGMYKGFTLSELWEHHRHLFGQLEGDRFP
LLTKILDADQDLSVQVHPNDEYANIHENGELGKTECWYIIDCQKDAEIIYGHNATTKEELTTMIERGEWDELLRRVKVKP
GDFFYVPSGTVHAIGKGILALETQQNSDTTYRLYDYDRKDAEGKLRELHLKKSIEVIEVPSIPERHTVHHEQIEDLLTTT
LIECAYFSVGKWNLSGSASLKQQKPFLLISVIEGEGRMISGEYVYPFKKGDHMLLPYGLGEFKLEGYAECIVSHL
>O05511 5.3.1.8~~~gmuF~~~Probable mannose-6-phosphate isomerase GmuF~~~COG1482
MTHPLFLEPVFKERLWGGTKLRDAFGYAIPSQKTGECWAVSAHAHGSSSVKNGPLAGKTLDQVWKDHPEIFGFPDGKVFP
LLVKLLDANMDLSVQVHPDDDYAKLHENGDLGKTECWYIIDCKDDAELILGHHASTKEEFKQRIESGDWNGLLRRIKIKP
GDFFYVPSGTLHALCKGTLVLEIQQNSDTTYRVYDYDRCNDQGQKRTLHIEKAMEVITIPHIDKVHTPEVKEVGNAEIIV
YVQSDYFSVYKWKISGRAAFPSYQTYLLGSVLSGSGRIINNGIQYECNAGSHFILPAHFGEFTIEGTCEFMISHP
>P39841 5.3.1.8~~~yvyI~~~Putative mannose-6-phosphate isomerase YvyI~~~COG1482
MTQSPIFLTPVFKEKIWGGTALRDRFGYSIPSESTGECWAISAHPKGPSTVANGPYKGKTLIELWEEHREVFGGVEGDRF
PLLTKLLDVKEDTSIKVHPDDYYAGENEEGELGKTECWYIIDCKENAEIIYGHTARSKTELVTMINSGDWEGLLRRIKIK
PGDFYYVPSGTLHALCKGALVLETQQNSDATYRVYDYDRLDSNGSPRELHFAKAVNAATVPHVDGYIDESTESRKGITIK
TFVQGEYFSVYKWDINGEAEMAQDESFLICSVIEGSGLLKYEDKTCPLKKGDHFILPAQMPDFTIKGTCTLIVSHI
>A1A278 3.2.1.78~~~~~~Mannan endo-1,4-beta-mannosidase~~~
MKTTVTKLLATVAAASTIFGMSTLPAFAAEGKSASNGNSVNISDVNATAETRALFDKLKNSGKGDLRFGQQHATDENISS
SASQGDVYETTGKYPAVFGWDAGLALRGAEKPGSGADKNANAKALAQNITDADSKGAIVTLSAHWCNPGTGKDFNDTTAV
ASELLPGGKYSGTFNKELDAIAATAQRAKRSDGTLIPIIFRPLHENNGSWFWWGATHASASEYKELYRYIVDYLRDVKDV
HNLLYAYSPGGVFNGDSTDYLATYPGDQWVDVLGYDEYDSDDSADDSSAWINTVVKDMKMVSDQASQRGKIVALTEFGRS
GDRKFKESGTGDKDTKFFSELAEALAENVPSTAYMMTWANFGGGGDNFQAYTSWKGSDGEADFKAFADSNKNLMASKDNV
DYSNAPAAAMQNGSARIVTPVDGNRVTDTKVVVRVKTEGVKYSDLDLNSAIVTTDRGQNVKLKYSCNGYFTGILDLNAAG
INLDQSKLTLTPQVKTKDGKTLAAADGNGSVTVKLGAKPEQTVDNVEDFDSYDNEAELQSVYSPSHSTKSNLTLVDSPED
NGTKAGNIHYDFVSYPEYNGFQRSHTPKQDWSGFSKLNMFLKADGSDHKFVVQVNAGGVTFEAYPKIDGTDGHVVSLNFG
DADGNGGDFAPASWDTAHAGMKLSQKLLSKVGSFALYINDNGGNRPKSGDLTLDSIKLDGKRDAYAPNTNPTPGNTAKAQ
SVDDFSGYSDDAAAQSAWGNRGHTEVLSLDEGPTDGSKALRFKYDFSNGGWYDVAKYLDGANWSGESVLAFQVKGDGSGN
AIGLQIGTSDGKYFLASVKLDFTGWKQIEIPLVDNANLTQSWPEDANKDNPMTEDDLASIKELVFASQQWNSESDGLDSS
IADIKVEPAENTSNEQTPKDESKTEVKADKEQEQSEDTSADVTAQDPATCPISDEDSKGSTGNTTVTVKPTPDTKEPADN
TGKDGLSRTGSNIISAIAAVAVLLLGGCAVLIARKRKGGDIE
>P49424 3.2.1.78~~~manA~~~Mannan endo-1,4-beta-mannosidase~~~COG4124
MKTITTARLPWAAQSFALGICLIALLGCNHAANKSSASRADVKPVTVKLVDSQATMETRSLFAFMQEQRRHSIMFGHQHE
TTQGLTITRTDGTQSDTFNAVGDFAAVYGWDTLSIVAPKAEGDIVAQVKKAYARGGIITVSSHFDNPKTDTQKGVWPVGT
SWDQTPAVVDSLPGGAYNPVLNGYLDQVAEWANNLKDEQGRLIPVIFRLYHENTGSWFWWGDKQSTPEQYKQLFRYSVEY
LRDVKGVRNFLYAYSPNNFWDVTEANYLERYPGDEWVDVLGFDTYGPVADNADWFRNVVANAALVARMAEARGKIPVISE
IGIRAPDIEAGLYDNQWYRKLISGLKADPDAREIAFLLVWRNAPQGVPGPNGTQVPHYWVPANRPENINNGTLEDFQAFY
ADEFTAFNRDIEQVYQRPTLIVK
>P00946 5.3.1.8~~~manA~~~Mannose-6-phosphate isomerase~~~COG1482
MQKLINSVQNYAWGSKTALTELYGMENPSSQPMAELWMGAHPKSSSRVQNAAGDIVSLRDVIESDKSTLLGEAVAKRFGE
LPFLFKVLCAAQPLSIQVHPNKHNSEIGFAKENAAGIPMDAAERNYKDPNHKPELVFALTPFLAMNAFREFSEIVSLLQP
VAGAHPAIAHFLQQPDAERLSELFASLLNMQGEEKSRALAILKSALDSQQGEPWQTIRLISEFYPEDSGLFSPLLLNVVK
LNPGEAMFLFAETPHAYLQGVALEVMANSDNVLRAGLTPKYIDIPELVANVKFEAKPANQLLTQPVKQGAELDFPIPVDD
FAFSLHDLSDKETTISQQSAAILFCVEGDATLWKGSQQLQLKPGESAFIAANESPVTVKGHGRLARVYNKL
>P49425 3.2.1.78~~~manA~~~Mannan endo-1,4-beta-mannosidase~~~COG4124
MTLLLVWLIFTGVAGEIRLEAEDGELLGVAVDSTLTGYSGRGYVTGFDAPEDSVRFSFEAPRGVYRVVFGVSFSSRFASY
ALRVDDWHQTGSLIKRGGGFFEASIGEIWLDEGAHTMAFQLMNGALDYVRLEPVSYGPPARPPAQLSDSQATASAQALFA
FLLSEYGRHILAGQQQNPYRRDFDAINYVRNVTGKEPALVSFDLIDYSPTREAHGVVHYQTPEDWIAWAGRDGIVSLMWH
WNAPTDLIEDPSQDCYWWYGFYTRCTTFDVAAALADTSSERYRLLLRDIDVIAAQLQKFQQADIPVLWRPLHEAAGGWFW
WGAKGPEPFKQLWRLLYERLVHHHGLHNLIWVYTHEPGAAEWYPGDAYVDIVGRDVYADDPDALMRSDWNELQTLFGGRK
LVALTETGTLPDVEVITDYGIWWSWFSIWTDPFLRDVDPDRLTRVYHSERVLTRDELPDWRSYVLHATTVQPAGDLALAV
YPNPGAGRLHVEVGLPVAAPVVVEVFNLLGQRVFQYQAGMQPAGLWRRAFELALAPGVYLVQVRAGNLVARRRWVSVR
>P25081 5.3.1.8~~~manA~~~Mannose-6-phosphate isomerase~~~
MQKLINSVQNYAWGSKTALTELYGIANPQQQPMAELWMGAHPKSSSRITTANGETVSLRDAIEKNKTAMLGEAVANRFGE
LPFLFKVLCAAQPLSIQVHPNKRNSEIGFAKENAAGIPMDAAERNYKDPNHKPELVFALTPFLAMNAFREFSDIVSLLQP
VAGAHSAIAHFLQVPNAERLSQLFASLLNMQGEEKSRALAVLKAALNSQQGEPWQTIRVISEYYPDDSGLFSPLLLNVVK
LNPGEAMFLFAETPHAYLQGVALEVMANSDNVLRAGLTPKYIDIPELVANVKFEPKPAGELLTAPVKSGAELDFPIPVDD
FAFSLHDLALQETSIGQHSAAILFCVEGEAVLRKDEQRLVLKPGESAFIGADESPVNASGTGRLARVYNKL
>P51529 3.2.1.78~~~manA~~~Mannan endo-1,4-beta-mannosidase~~~
MRNARSTLITTAGMAFAVLGLLFALAGPSAGRAEAAAGGIHVSNGRVVEGNGSAFVMRGVNHAYTWYPDRTGSIADIAAK
GANTVRVVLSSGGRWTKTSASEVSALIGQCKANKVICVLEVHDTTGYGKDGATSLDQAGDYWVGVKSAAWRAQEDYVVVN
IGNEPFGNTNYAAWTDATKSAIGKLRGAGLGHALMVDAPNWGQDWSGTMRSNAASVFASDPDRNTVFSIHMYGVYDTAAE
VRDYLNAFVGNGLPIVVGEFGDQHSDGNPDEDAIMATAQSLGVGYLGWSWSGNGGGVEYLDMVNGFDPNSLTSWGNRILY
GSNGIAATSRTATVYGGGGGSTGGTAPNGYPYCVNGGASDPDGDGWGWENSRSCVVRGSAADH
>Q9I1E0 ~~~~~~Mannitol-binding protein~~~
MNDSIKACLAAACLALPLLAQGAETLTIATVNNNDMIRMQRLSKVFEESHPDIALKWVVLEENVLRQRLTTDIATQGGQF
DLLTIGMYEAALWGAKGWLEPMSGLPADYALDDLLPSVRDGLSVKGTLYALPFYAEASITYYRKDLFQQAGLRMPEQPTW
TQLGEFAARLNRPDQGQYGICLRGKAGWGENMALIGTLANAFGARWFDERWQPEFSGGEWKKALDFYVSTLKRYGPPGAS
SNGFNENLALFNSGKCAIWVDASVAGSFVTDKSQSKVADATGFAFAPREVTDKGASWLYSWALAIPASSRAKDAAKAFAT
WATSQAYGKLVADREGVANVPPGTRASTYSEAYLAAAPFARVTLESLKRVDPNHPTLKPVPYVGIQLVTIPEFQAIGTQV
GKLFSAALTGQMSSDQALAAAQQSTAREMKRAGYPK
>O05512 3.2.1.78~~~gmuG~~~Mannan endo-1,4-beta-mannosidase~~~COG4124
MFKKHTISLLIIFLLASAVLAKPIEAHTVSPVNPNAQQTTKTVMNWLAHLPNRTENRVLSGAFGGYSHDTFSMAEADRIR
SATGQSPAIYGCDYARGWLETANIEDSIDVSCNGDLMSYWKNGGIPQISLHLANPAFQSGHFKTPITNDQYKKILDSSTV
EGKRLNAMLSKIADGLQELENQGVPVLFRPLHEMNGEWFWWGLTSYNQKDNERISLYKQLYKKIYHYMTDTRGLDHLIWV
YSPDANRDFKTDFYPGASYVDIVGLDAYFQDAYSINGYDQLTALNKPFAFTEVGPQTANGSFDYSLFINAIKQKYPKTIY
FLAWNDEWSAAVNKGASALYHDSWTLNKGEIWNGDSLTPIVE
>P16699 3.2.1.78~~~~~~Mannan endo-1,4-beta-mannosidase A and B~~~
MKVYKKVAFVMAFIMFFSVLPTISMSSEANGAALSNPNANQTTKNVYSWLANLPNKSNKRVVSGHFGGYSDSTLAWIKQC
ARELTGKMPGILSCDYKNWQTRLYVADQISYGCNQELINFWNQGGLVTISVHMPNPGFHSGENYKTILPTSQFQNLTNHR
TTEGRRWKDMLDKMADGLDELQNNGVTVLFRPLHEMNGEWFWWGAEGYNQFDQTRANAYISAWRDMYQYFTHERKLNNLI
WVYSPDVYRDHVTSYYPGANYVDIVALDSYHPDPHSLTDQYNRMIALDKPFAFAEIGPPESMAGSFDYSNYIQAIKQKYP
RTVYFLAWNDKWSPHNNRGAWDLFNDSWVVNRGEIDYGQSNPATVLYDFENNTLSWSGCEFTDGGPWTSNEWSANGTQSL
KADVVLGNNSYHLQKTVNRNLSSFKNLEIKVSHSSWGNVGSGMTARVFVKTGSAWRWNAGEFCQFAGKRTTALSIDLTKV
SNLHDVREIGVEYKAPANSNGKTAIYLDHVTVR
>P22533 ~~~manA~~~Beta-mannanase/endoglucanase A~~~
MRLKTKIRKKWLSVLCTVVFLLNILFIANVTILPKVGAATSNDGVVKIDTSTLIGTNHAHCWYRDRLDTALRGIRSWGMN
SVRVVLSNGYRWTKIPASEVANIISLSRSLGFKAIILEVHDTTGYGEDGAACSLAQAVEYWKEIKSVLDGNEDFVIINIG
NEPYGNNNYQNWVNDTKNAIKALRDAGFKHTIMVDAPNWGQDWSNTMRDNAQSIMEADPLRNLVFSIHMYGVYNTASKVE
EYIKSFVDKGLPLVIGEFGHQHTDGDPDEEAIVRYAKQYKIGLFSWSWCGNSSYVGYLDMVNNWDPNNPTPWGQWYKTNA
IGTSSTPTPTSTVTPTPTPTPTPTPTVTATPTPTPTPVSTPATSGQIKVLYANKETNSTTNTIRPWLKVVNSGSSSIDLS
RVTIRYWYTVDGERAQSAISDWAQIGASNVTFKFVKLSSSVSGADYYLEIGFKSGAGQLQPGKDTGEIQMRFNKDDWSNY
NQGNDWSWIQSMTSYGENEKVTAYIDGVLVWGQEPSGATPAPAPTATPTPTPTVTPTPTVTPTPTVTATPTPTPTPTPTP
VSTPATGGQIKVLYANKETNSTTNTIRPWLKVVNSGSSSIDLSRVTIRYWYTVDGERAQSAISDWAQIGASNVTFKFVKL
SSSVSGADYYLEIGFKSGAGQLQPGKDTGEIQIRFNKSDWSNYNQGNDWSWIQSMTSYGENEKVTAYIDGVLVWGQEPSG
TTPSPTSTPTVTVTPTPTPTPTPTPTPTVTPTPTVTPTPTVTATPTPTPTPIPTVTPLPTISPSPSVVEITINTNAGRTQ
ISPYIYGANQDIEGVVHSARRLGGNRLTGYNWENNFSNAGNDWYHSSDDYLCWSMGISGEDAKVPAAVVSKFHEYSLKNN
AYSAVTLQMAGYVSKDNYGTVSENETAPSNRWAEVKFKKDAPLSLNPDLNDNFVYMDEFINYLINKYGMASSPTGIKGYI
LDNEPDLWASTHPRIHPNKVTCKELIEKSVELAKVIKTLDPSAEVFGYASYGFMGYYSLQDAPDWNQVKGEHRWFISWYL
EQMKKASDSFGKRLLDVLDLHWYPEARGGNIRVCFDGENDTSKEVVIARMQAPRTLWDPTYKTSVKGQITAGENSWINQW
FSDYLPIIPNVKADIEKYYPGTKLAISEFDYGGRNHISGGIALADVLGIFGKYGVNFAARWGDSGSYAAAAYNIYLNYDG
KGSKYGNTNVSANTSDVENMPVYASINGQDDSELHIILINRNYDQKLQVKINITSTPKYTKAEIYGFDSNSPEYKKMGNI
DNIESNVFTLEVPKFNGVSHSITLDFNVSIKIIQNEVIKFIRNLVFMRALV
>Q8X7P1 2.7.7.13~~~manC1~~~Mannose-1-phosphate guanylyltransferase 1~~~COG0662
MAQSKLYPVVMAGGSGSRLWPLSRVLYPKQFLCLKGDLTMLQTTICRLNGVECESPVVICNEQHRFIVAEQLRQLNKLTE
NIILEPAGRNTAPAIALAALAAKRHSPENDPLMLVLAADHVIADEDAFRAAVRNAMPYAEAGKLVTFGIVPDLPETGYGY
IRRGEVSAGEQDTVAFEVAQFVEKPNLETAQAYVASGEYYWNSGMFLFRAGRYLEELKKYRPDILDACEKAMSAVDPDLD
FIRVDEDAFLACPEESVDYAVMERTADAVVVPMDAGWSDVGSWSSLWEISAHTAEGNVCHGDVINHKTENSYVYAESGLV
TTVGVKDLVVVQTKDAVLIADRNAVQDVKKVVEQIKADGRHEHRVHREVYRPWGKYDSIDAGDRYQVKRITVKPGEGLSV
QMHHHRAEHWVVVAGTAKVTIDGDIKLLGENESIYIPLGATHCLENPGKIPLDLIEVRSGSYLEEDDVVRFADRYGRV
>B0T0B1 4.2.1.8~~~~~~D-mannonate dehydratase Caul1427~~~COG4948
MLKIIDAKVIVTCPGRNFVTLKITTSDGVTGVGDATLNGRELAVVSYLRDHMIPCLIGRDAHRIEDVWQFFYRGSYWRGG
PVAMTALAAVDMALWDIKAKLAGMPLYQLLGGACREGVMVYGHANGETIEDTIAEARKYQALGYKAIRLQSGVPGLPSTY
GVSGDKMFYEPADGNLPTENVWSTSKYLKHAPKLFEAAREALGDDVHLLHDVHHRLTPIEAGRLGKDLEPYRLFWLEDAV
PAENQAGFRLIRQHTTTPLAVGEIFSHVWDCKQLIEEQLIDYLRATVLHAGGITNLRKIAAFADLHHVRTGCHGATDLSP
ITMAAALHFDLSVSNFGLQEYMRHTPETDAVFPHAYSYKDGMLHPGEAPGLGVDIDEALAGQYPYKRAYLPVNRLEDGTM
YNW
>Q9A4L8 4.2.1.8~~~~~~D-mannonate dehydratase CC2812~~~COG4948
MPKIIAAKTIVTCPGRNFVTLKIMTDEGVYGLGDATLNGRELAVEAYLTQHVIPCLIGRDAHQIEDIWQYLYRGCYWRRG
PVTMAAIAAVDTALWDIKGKIAGLPVYQLLGGACRVGVMVYGHANGETIDETLDNAAVYAQQGYKAIRLQTGVPGMSGTY
GVSKDKFFYEPADSDLPKETIWSTERYLRSTPALFEAARERLGDDLHLLHDVHHRLTPIEAARLGKDLEPYRLFWMEDAT
PAENQASFRLIRQHTTTPLAVGEIFNSIWDCKQLIEEQLIDYIRATVVHAGGITHLKKLASFADLHHVRTGCHGATDLSP
VCMGAALHFDLSIPNFGVQEYMRHTPETDAVFPHAYTFKDGMLHPGDAPGLGVDIDEDLAAKYPYQRAYLPIARRLDGSM
HDW
>B0T4L2 4.2.1.8~~~~~~D-mannonate dehydratase Caul1835~~~COG4948
MPKITAARVVVTCPGRNFVTLKIETSDGVYGVGDATLNGRELPVVSYLTDHVIPCLIGRDAHRIEDIWQYLYKGAYWRRG
PVTMAAIAAVDMALWDIKAKIAGLPLYQLLGGACREGIMVYGHANGATIEETLENAAVYAAQGYKAIRLQSGVPGLKGVY
GVSKDKFFYEPADGDLPTESLWSTEKYLRSAPGLFEAARDKLGWDLHLLHDVHHRLTPIEAGRLGKDLEPYRPFWMEDAV
PAENQASFRLIRQHTTTPLAVGEVFNSIWDCKQLIEEQLIDYIRATVVHAGGITHLRKIASFADLHHVRTGCHGATDLSP
IAMAAALHFDLSIPNFGIQEYMRHTEATDTVFPHAYTFNDGMLHPGDAVGLGVDINETEAAKYPYKRAYLPIARREDGSM
HDW
>Q9AAR4 4.2.1.8~~~~~~D-mannonate dehydratase CC0532~~~COG4948
MLKIIDAKVIVTCPGRNFVTLKITTEDGITGVGDATLNGRELSVVSFLQDHMVPSLIGRDAHQIEDIWQFFYRGSYWRGG
PVAMTALAAVDMALWDIKGKVAGLPVYQLLGGACRTGVTVYGHANGETIEDTIAEAVKYKAMGYKAIRLQTGVPGLASTY
GVSKDKMFYEPADNDLPTENIWSTAKYLNSVPKLFERAREVLGWDVHLLHDVHHRLTPIEAARLGKDLEPYRLFWLEDSV
PAENQAGFRLIRQHTTTPLAVGEIFAHVWDAKQLIEEQLIDYLRATVLHAGGITNLKKIAAFADLHHVKTGCHGATDLSP
VTMAAALHFDMSITNFGLQEYMRHTPETDAVFPHAYTFSDGMLHPGDKPGLGVDIDEDLAAKHPYKRAYLPVNRLEDGTM
FNW
>B3PDB1 4.2.1.-~~~rspA~~~D-galactonate dehydratase family member RspA~~~COG4948
MKIVDAKVIVTCPGRNFVTLKIVTDQGIYGIGDATLNGREKSVVSYLEDYLIPVLIGRDPQQIEDIWQFFYRGAYWRRGP
VGMTALAAIDVALWDIKAKLANMPLYQLLGGKSRERILSYTHANGKDLDSTLEAVRKAKDKGYKAIRVQCGIPGIAKTYG
VSTNTKSYEPADADLPSVEVWSTEKYLNYIPDVFAAVRKEFGPDIHLLHDVHHRLTPIEAARLGKALEPYHLFWMEDAVP
AENQESFKLIRQHTTTPLAVGEVFNSIHDCRELIQNQWIDYIRTTIVHAGGISQMRRIADFASLFHVRTGFHGATDLSPV
CMGAALHFDYWVPNFGIQEHMAHSEQMNAVFPHAYTFNDGYFTPGEKPGHGVDIDEKLAAQYPYKRACLPVNRLEDGTLW
HW
>Q8FHC7 4.2.1.-~~~rspA~~~D-galactonate dehydratase family member RspA~~~COG4948
MHHDKRCKESNMKIVKAEVFVTCPGRNFVTLKITTEDGITGLGDATLNGRELSVASYLQDHLCPQLIGRDAHRIEDIWQF
FYKGAYWRRGPVTMSAISAVDMALWDIKAKAANMPLYQLLGGASREGVMVYCHTTGHSIDEALDDYARHQELGFKAIRVQ
CGIPGMKTTYGMSKGKGLAYEPATKGQWPEEQLWSTEKYLDFMPKLFDAVRNKFGFDEHLLHDMHHRLTPIEAARFGKSI
EDYRMFWMEDPTPAENQECFRLIRQHTVTPIAVGEVFNSIWDCKQLIEEQLIDYIRTTLTHAGGITGMRRIADFASLYQV
RTGSHGPSDLSPVCMAAALHFDLWVPNFGVQEYMGYSEQMLEVFPHNWTFDNGYMHPGEKPGLGIEFDEKLAAKYPYEPA
YLPVARLEDGTLWNW
>D8ADB5 4.2.1.-~~~rspA~~~D-galactonate dehydratase family member RspA~~~
MHHDKRCKESNMKIVKAEVFVTCPGRNFVTLKITTEDGITGLGDATLNGRELSVASYLQDHLCPQLIGRDAHRIEDIWQF
FYKGAYWRRGPVTMSAISAVDMALWDIKAKAANMPLYQLLGGASREGVMVYCHTTGHSIDEALDDYARHQELGFKAIRVQ
CGIPGMKTTYGMSKGKGLAYEPATKGQWPEEQLWSTEKYLDFMPKLFDAVRNKFGFNEHLLHDMHHRLTPIEAARFGKSI
EDYRMFWMEDPTPAENQECFRLIRQHTVTPIAVGEVFNSIWDCKQLIEEQLIDYIRTTLTHAGGITGMRRIADFASLYQV
RTGSHGPSDLSPVCMAAALHFDLWVPNFGVQEYMGYSEQMLEVFPHNWTFDNGYMHSGDKPGLGIEFDEKLAAKYPYEPA
YLPVARLEDGTLWNW
>A4WA78 4.2.1.-~~~~~~D-galactonate dehydratase family member Ent638_1932~~~COG4948
MKIVGAEVFVTCPGRNFVTLKITTDDGLVGLGDATLNGRELSVASYLKDHLCPQLIGRDAHRIEDIWQFFYKGAYWRRGP
VTMSAISAVDMALWDIKAKAAGMPLYQLLGGASREGVMVYCHTTGHTIDDVLEDYARHKEMGFKAIRVQCGVPGMKTTYG
MSKGKGLAYEPATKGDWPEEQLWSTEKYLDFTPKLFDAVRSKFGYDEHLLHDMHHRLTPIEAARFGKSIEEFRLFWMEDP
TPAENQECFRLIRQHTVTPIAVGEVFNSIWDCKQLIEEQLIDYIRTTMTHAGGITGMRRIADFASLYQVRTGSHGPSDLS
PICHAAALHFDLWVPNFGVQEYMGYSEQMLEVFPHSWTFDNGYMHPGEKPGLGIEFDEKLAAKYPYDPAYLPVARLEDGT
LWNW
>B1ELW6 4.2.1.-~~~rspA~~~D-galactonate dehydratase family member RspA~~~COG4948
MKIVRAEVFVTCPGRNFVTLKITTEDGITGLGDATLNGRELSVASYLQDHLCPQLIGRDAHRIEDIWQFFYKGAYWRRGP
VTMSAISAVDMALWDIKAKVANMPLYQLLGGASREGVMVYCHTTGHSIEEALDDYARHQELGFKAIRVQCGIPGMKTTYG
MSKGKGLAYEPATKGQWPEEQLWSTEKYLDFMPKLFDAVRNKFGFDEHLLHDMHHRLTPIEAARFGKSIEDYRMFWMEDP
TPAENQECFRLIRQHTVTPIAVGEVFNSIWDCKQLIEEQLIDYIRTTLTHAGGITGMRRIADFASLYQVRTGSHGPSDLS
PVCMAAALHFDLWVPNFGVQEYMGYSEQMLEVFPHNWTFDNGYMHPGDKPGLGIEFDEKLAAKYPYEPAYLPVARLEDGT
LWNW
>A6VRA1 4.2.1.-~~~~~~D-galactonate dehydratase family member Mmwyl1_0037~~~COG4948
MKIRSAKVIVTCPGRNLVTLKIETDEGVYGIGDATLNGREKSVVSYLEDHVIPTLIGKDPQRVEDIWQYLYRGAYWRRGP
VGMTAIAAVDVALWDIKAKLAGMPLYQLLGGKSREKVMVYGHATGLDIESCLEEVRKHVELGYKAVRVQCGIPGIPTTYG
VSKEAGKPYEPADSALPAEHVWSTEKYLNNVPELFAAVRKEFGEDLHILHDVHHRLTPIQAARLGKEVEKFHLFWLEDCT
AVENQSSYELIRKHTTTPLAIGEVFNSLSDCQELIQNQLIDYIRATITHAGGITNIRRIADFASVFHVKTGFHGATDLSP
VCMGAALHFDTWVPNFGIQEHMPHTKETDLVFPHAYEFNDGFFTPGDVPGHGVDIDEEIAAKYPYKPAYLPVNRLEDGTL
WNW
>A4XF23 4.2.1.8~~~manD~~~D-mannonate dehydratase~~~COG4948
MKITAARVIITCPGRNFVTLKIETDQGVYGIGDATLNGRELSVVAYLQEHVAPCLIGMDPRRIEDIWQYVYRGAYWRRGP
VTMRAIAAVDMALWDIKAKMAGMPLYQLLGGRSRDGIMVYGHANGSDIAETVEAVGHYIDMGYKAIRAQTGVPGIKDAYG
VGRGKLYYEPADASLPSVTGWDTRKALNYVPKLFEELRKTYGFDHHLLHDGHHRYTPQEAANLGKMLEPYQLFWLEDCTP
AENQEAFRLVRQHTVTPLAVGEIFNTIWDAKDLIQNQLIDYIRATVVGAGGLTHLRRIADLASLYQVRTGCHGATDLSPV
TMGCALHFDTWVPNFGIQEYMRHTEETDAVFPHDYWFEKGELFVGETPGHGVDIDEELAAKYPYKPAYLPVARLEDGTMW
NW
>C6D9S0 4.2.1.-~~~~~~D-galactonate dehydratase family member PC1_0802~~~COG4948
MKIVSAEVFVTCPGRNFVTLKITTDSGLTGLGDATLNGRELPVASYLNDHVCPQLIGRDAHQIEDIWQYFYKGAYWRRGP
VTMSAISAVDMALWDIKAKAANMPLYQLLGGASRTGVMVYCHTTGHSIDEVLDDYAKHRDQGFKAIRVQCGVPGMETTYG
MAKGKGLAYEPATKGSLPEEQLWSTEKYLDFTPKLFEAVRDKFGFNEHLLHDMHHRLTPIEAARFGKSVEDYRLFWMEDP
TPAENQACFRLIRQHTVTPIAVGEVFNSIWDCKQLIEEQLIDYIRTTITHAGGITGMRRIADFASLYQVRTGSHGPSDLS
PICMAAALHFDLWVPNFGVQEYMGYSEQMLEVFPHSWTFDNGYMHPGEKPGLGIEFDEKLAAKYPYDPAYLPVARLEDGT
LWNW
>A5V6Z0 4.2.1.8~~~~~~D-mannonate dehydratase~~~COG4948
MKITGARVIVTCPDRNFVTLKIETDEGLTGIGDATLNGRELAVASYLTDHVIPCLIGRDAHRIEDIWNYLYRGAYWRRGP
VTMSAIAAVDTALWDIKAKAAGLPLYQLLGGRSRDGVMVYGHANGRDIEETTDEVARYIEMGYRAIRAQTGVPGLASTYG
VSSDKMYYEPADAALPTENIWSTEKYLDHVPKLFDRLRDRFGFDHHLLHDVHHRLTPIEAGRLGKSLEPYRLFWMEDATP
AENQEAFRLIRQHTVTPLAVGEVFNTIWDAKDLIQNQLIDYIRATVVHAGGISHLRRIADLAALYQVRTGCHGATDLSPV
CMGAALHFDIWVPNFGVQEYMRHTEATDAVFPHAYSFASGYMTPGDVPGHGVEIDEKLAAKYPYKPCSLPVNRLEDGTLW
HW
>B5RAG0 4.2.1.-~~~rspA~~~D-galactonate dehydratase family member RspA~~~
MKIVGAEVFVTCPGRNFVTLKITTEDGITGLGDATLNGRELPVASYLTDHLCPQLIGRDARRIEDIWQFFYKGAYWRRGP
VTMSAISAVDMALWDIKAKAANMPLYQLLGGASREGVMVYCHTTGHTIDDVLEDYARHKEQGFKAIRVQCGVPGMKTTYG
MAKGKGLAYEPATKGQWPEEQLWSTEKYLDFTPKLFDAVRNTFGFNEHLLHDMHHRLTPIEAARFGKGIEDYRLFWMEDP
TPAENQACFRLIRQHTVTPIAVGEVFNSIWDCKQLIEEQLIDYIRTTLTHAGGITGMRRIADFASLYQVRTGSHGPSDLS
PVCMAAALHFDLWVPNFGVQEFMGYSEQMLEVFPHNWTFEGGYMHPGDKPGLGIEFDEKLAAKYPYDPAYLPVARLEDGT
LWNW
>Q1NAJ2 4.2.1.8~~~~~~D-mannonate dehydratase~~~COG4948
MPKIIDAKVIITCPGRNFVTLKIMTDEGVYGLGDATLNGRELAVASYLTDHVIPCLIGRDAHRIEDLWQYLYKGAYWRRG
PVTMTAIAAVDMALWDIKGKIAGLPVYQLLGGASREGVMVYGHANGTTIEDTVKVALDYQAQGYKAIRLQCGVPGMASTY
GVSKDKYFYEPADADLPTENIWNTSKYLRIVPELFKAARESLGWDVHLLHDIHHRLTPIEAGRLGQDLEPYRPFWLEDAT
PAENQEAFRLIRQHTTAPLAVGEIFNSIWDAKDLIQNQLIDYIRATVVHAGGITHLRRIAALADLYQIRTGCHGATDLSP
VCMAAALHFDLSVPNFGIQEYMRHMPETDAVFPHAYTFADGMMHPGDQPGLGVDIDEDLAAGYEYKRAFLPVNRLEDGTM
FNW
>D9UNB2 4.2.1.-~~~~~~D-galactonate dehydratase family member SSLG_02014~~~COG4948
MASSAHSDSASADASAEIPAEILAPAPWSTPADDSEHLRITAVRTFLTAPQGCPYVIVRVETNQPGLYGLGCASDPQRTL
AIRSVVDDYYAPMLLGRDPSDIEDLHRLLFNSGYWRGGSIGQNALAGVDVALWDIKGKVAGLPLHQLLGGRAREAADAYT
HVDGDNAGEIAEKVLAAHERGYRHVRVQVSVPGTDTYGTAPRDAAEARRRELRAGSWDSLAYLRHVPPVLREIRERVGTG
VELLHDAHERLTPSQARELVHEVEDARLFFLEDALAPEDAAHFDQLRAAGSVPLAVGELYHDVMMYLPLLQRQVIDFARI
RIPTLGGLTPTRKLVAAVELFGARTAPHGPGDVSPVGMAANLGLDLSSPAFGVQEAATFREATREVFPGTPVPERGRFHG
TGLPGLGVDFDEAAARKYPVPEPLRHDRWALLRNGDGSVQRP
>G7TAD9 4.2.1.8~~~~~~D-mannonate dehydratase~~~COG4948
MSQPSDSTAPLQGSARDREIVEARVIVTCPGRNFVTLKIRTRSGITGVGDATLNGRELAVAAYLQEHLVPNLIGRDAGRI
EDIWQFFYRGAYWRRGPVTMSAIAAVDVALWDILGKMAGMPLYQLLGGRSREGALVYGHANGRDIAETSDEVGRFREMGF
IAIRAQCGVPGIKKTYGISVGGKPYEPAESELPTETVWSTPRYLGVVPRLFEQLRADHGDEIELLHDAHHRLTPIEAARL
GRDLEPYRLFWLEDATPAENQRAFEIIRQHTVTPLAVGEVFNSIWDCKHLIEQQLIDYIRTTIVHAGGLTHVRRLADFAA
LHQVRTGFHGATDLSPVCMGAALHFDTWVPNFGIQEYMFHSDEANAVFPHDYQFRAGRLHCGETPGHGVDIDEALAARYP
YTPKQLPILRLEDGTMGDW
>Q84DC4 3.5.1.86~~~mdlY~~~Mandelamide hydrolase~~~
MRHPVDMPEKVGTDAKRLFAQPEHLWELTLTEASALVRHRRITSRQLVEAWLSRIADFSELNAFISVDAAAALKQADSYD
HYLEAGGDPLPLGGVPIAVKDNIQVVGFANTAGTPALSKFFPTCNARVIEPLLKAGAIVVGKTNMHELAFGTSGYNTAYH
IPGVIGVRNAFDHSCIAGGSSSGSGTAVGALLIPAALGTDTGGSVRQPGAVNGCVGFRPTVGRYPVDGITPISPTRDTPG
PIARSVEDIVLLDSIITGALPAEVPAAESIRLGVVDQLWADLSEPVRKLTEDALRKLEQQGVQIVRVSMSEIFEMSHAVS
MPLALHECRSALTEYLSANETGVSFDELVAGISSPDVRTIFEDYILPGRLGELEGQSVDLEQAYATAMKDARPKLIQSFE
FLFKEHQLDAIIHPTTPDLAIKSNPAATSFEAFARMIRNADPASNAGMPGISLPAGLSQQEGLPVGIEIEGLPGSDARLL
SIANFIESILGRGPTPTRSGVESKISM
>F2JVT6 5.3.1.7~~~~~~D-mannose isomerase~~~COG2942
MSYPAFDSKTFLEAHIEKTMAFYFPTCIDPEGGFFQFFKDDGSVYDPNTRHLVSSTRFIFNFAQAYLHTNIAEYKHAAVH
GIQYLRQRHQSQSGGYVWLLDGGTNLDETNHCYGLAFVILAYSNALQIGLSEAEVWIEVTYDLLETHFWENKHGLYLDEI
SSDWKTVSPYRGQNANMHMCEALMSAFDATQNPKYLDRAKLLAKNICQKQASLSNSNEVWEHYTNDWQIDWDYNKNDPKH
LFRPWGFQPGHQTEWAKLLLMLDKRSPENWYLPKAKYLFDLAYKKAWDTKKGGLHYGYAPDGTVCDPDKYFWVQAESFAA
AWLLYKATKDETYYKQYLTLWEFSWNHMIDHTFGAWYRILDENNAQYDNNKSPAGKTDYHTMGACYEVLKTLTL
>A0A0P9JFY5 5.3.1.7~~~~~~D-mannose isomerase~~~
MDNNNHTFSSWLRSPAHHQWLALEGKRLLGFAKAAKLENGFGGLDDYGRLMVGATAGTMNTARMTHCFAMAHVQGIPGCA
ALIDHGIAALSGPLHDAEHGGWFSAALEDHGKTDKQAYLHAFVALAASSAVVAGRPAAQALLSDVIQVIQSRFWSDEEGA
MRESFSQDWSDEEPYRGANSNMHSTEAFLALADVTGDAQWLDRALSIVERVIHQHAGANNFQVIEHFTSGWQPLPDYNRE
NPADGFRPFGTTPGHAFEWARLVLHLEAARRRAGRSNPDWLLDDARQLFANACRYGWDVDGAPGIVYTLDWQNKPVVRHR
LHWTHCEAAAAAAALLQRTGEQQYEDWYRCFWEFNETLFIDIEHGSWRHELNERNEPSEDIWPGKPDLYHAYQATLLPVL
PLAPSLASAMAGLD
>A0A077LPS9 5.3.1.7~~~manI~~~D-mannose isomerase~~~
MTLWTARAAHRAWLDAEARRLVDFAAAADHPEHGFAWLDGSGAPLPEQGVHTWITCRVTHVAALAHLEGIPGASALADHG
LRALAGPLRDPEHDGWFTALDSRGTVADSRKEAYQHAFVLLAAASATVAGRPGARELLDAAAAVIEQRFWEEETGRCRES
WDAAWHADEPYRGANSNMHLVEAFLAAFDATGDRVWAERALRIAHFFVHEVAAPRDWRLPEHFTPDWQVVADYNTDDRAH
PFRPYGVTVGHVLEWARLLVHVEAALPDPPSWLLADAEAMFAAAVARGWSVDGTEGFVYTLDYDDTPVVRSRMHWVVAEA
ISAAAVLGQRTGDERYEHWYRTWWDHAATYFVDTVQGSWHHELDPTLAPPPGGTWSGKPDVYHAYQATRLPLLPLAPSLA
GALATVG
>O31644 ~~~manR~~~Transcriptional regulator ManR~~~COG1762
MEYINTRQKEILYLLLSEPDDYLVVQDFADRVQCSEKTIRNDLKVIEDYLNEHSHAQLIRKPGLGVYLHIEEQERTWLSQ
QLHTEHFSSRQRSDKERMLHIAYDLLMNPKPVSAKDIAARHFVNRSSIKKDLYAVEEWLKRFDLTLVSRQRLGLKVEGNE
RNKRKALARISDLIHNTAFTSQFIKSKFLHYEVDFVTKEIKSLQKKHSLYFTDETFESLLLHTLLMVRRIKMKQPISLSP
KEMAAVKKKKEYQWTFACLQRLEPVFAIRFPEEEAVYLTLHILGGKVRYPLQTEENLENAVLPKVVGHLINRVSELKMMD
FHKDQDLINGLNIHLNTVLQRLSYDLSVANPMLNDIKKMYPYLFHLIIDVLEDINQTFDLHIPEEEAAYLTLHFQAAIER
MQGSSETHKKAVIVCHMGIGMSQLLRTKIERKYHQIAVMACIAKADLKDYIKKHEDIDLVISTIALENITVPHIVVSPLL
EPGEEKKLSAFIRQLGESHRQKQKTFQMLNNTTPFLVFLQQEAEHRYKLIEQLATALFEKGYVDKDYAVHAVMREKMSAT
NIGSGIAIPHANAKFIKQSAIAIATLKEPLEWGNEKVSLVFMLAVKHEDQTMTKQLFSELSYLSEQPAFVQKLTKETNVM
TFLSHLDY
>P11444 5.1.2.2~~~mdlA~~~Mandelate racemase~~~
MSEVLITGLRTRAVNVPLAYPVHTAVGTVGTAPLVLIDLATSAGVVGHSYLFAYTPVALKSLKQLLDDMAAMIVNEPLAP
VSLEAMLAKRFCLAGYTGLIRMAAAGIDMAAWDALGKVHETPLVKLLGANARPVQAYDSHSLDGVKLATERAVTAAELGF
RAVKTKIGYPALDQDLAVVRSIRQAVGDDFGIMVDYNQSLDVPAAIKRSQALQQEGVTWIEEPTLQHDYEGHQRIQSKLN
VPVQMGENWLGPEEMFKALSIGACRLAMPDAMKIGGVTGWIRASALAQQFGIPMSSHLFQEISAHLLAATPTAHWLERLD
LAGSVIEPTLTFEGGNAVIPDLPGVGIIWREKEIGKYLV
>P54572 1.1.1.38~~~mleA~~~NAD-dependent malic enzyme 1~~~COG0281
MIAKHMIRTLMIETPSVPGNLGRVATAIGLLGGDIGEVETVKVGPNYTMRNITVQVENEEQLQEVIAAVQALGEGIRLHT
VSDEVLSAHEGGKIQMKSKMPIRSLAELGRVYTPGVADVCRLIEKEPEKASIYTTISNSVAIVTDGTAILGLGNIGSVAG
MPVMEGKAALFDQLAGISGIPILLDTSDPEEIIKTVKHISPGFSGILLEDIGSPHCFEIEDRLKEELNIPVMHDDQHGTA
VVTLAAAISACRSAGVDLKEAKVGQIGLGAAGVAICRMFMAYGVNAVYGTDKSESAMNRLEQYGGQAVSSIEELMETCDI
VIATTGVPGLIKPAFVRSGQVILALSNPKPEIEPEAALQAGAAYAADGRSVNNVLGFPGIFRGALNAKSTEINHDMLVAA
AEAIAACTKQGDVVPQPLDSKVHHAVAAAVEHAALTAVK
>P26616 1.1.1.38~~~maeA~~~NAD-dependent malic enzyme~~~COG0281
MEPKTKKQRSLYIPYAGPVLLEFPLLNKGSAFSMEERRNFNLLGLLPEVVETIEEQAERAWIQYQGFKTEIDKHIYLRNI
QDTNETLFYRLVNNHLDEMMPVIYTPTVGAACERFSEIYRRSRGVFISYQNRHNMDDILQNVPNHNIKVIVVTDGERILG
LGDQGIGGMGIPIGKLSLYTACGGISPAYTLPVVLDVGTNNQQLLNDPLYMGWRNPRITDDEYYEFVDEFIQAVKQRWPD
VLLQFEDFAQKNAMPLLNRYRNEICSFNDDIQGTAAVTVGTLIAASRAAGGQLSEKKIVFLGAGSAGCGIAEMIISQTQR
EGLSEEAARQKVFMVDRFGLLTDKMPNLLPFQTKLVQKRENLSDWDTDSDVLSLLDVVRNVKPDILIGVSGQTGLFTEEI
IREMHKHCPRPIVMPLSNPTSRVEATPQDIIAWTEGNALVATGSPFNPVVWKDKIYPIAQCNNAFIFPGIGLGVIASGAS
RITDEMLMSASETLAQYSPLVLNGEGMVLPELKDIQKVSRAIAFAVGKMAQQQGVAVKTSAEALQQAIDDNFWQAEYRDY
RRTSI
>O30807 1.1.1.39~~~dme~~~NAD-dependent malic enzyme~~~COG0280
MNTGDKAKSQAVPASGDIDQQALFFHRYPRPGKLEIQPTKPLGNQRDLALAYSPGVAAPCLAIKDNPETAADFTARANLV
AVVSNGTAVLGLGNIGPLASKPVMEGKAVLFKKFAGIDVFDIEIDAPTVDRMVDVISALEPTFGGINLEDIKAPECFEVE
RRLREKMEIPVFHDDQHGTAIIVAAAVLNGLELAGKDIAEAKIVASGAGAAALACLNLLVTLGARRENIWVHDIEGLVYK
GREALMDEWKAVYAQESDNRVLADSIGGADVFLGLSAAGVLKPELLARMAEKPLIMALANPTPEIMPEVARAARPDAMIC
TGRSDFPNQVNNVLCFPHIFRGALDCGARTINEEMKMAAVRAIAGLAREEPSDVAARAYSGETPVFGPDYLIPSPFDQRL
ILRIAPAVAKAAAESGVATRPIQDFDAYLDKLNRFVFRSGFIMKPVFAAAKNAAKNRVIFAEGEDERVLRAAQVLLEEGT
AKPILIGRPQIIETRLRRYGLRIRPDVDFEVVNPEGDPRYRDYVDDYFALVGRLGVIPEAARTIVRTNTTVIGALAVKRG
EADALICGVEGRYSRHLRDVSQIIGKRSGVLDFSALSLLISQRGATFFTDTYVSFSPSAEEIAQTTVMAANEIRRFGITP
RAALVSHSNFGSRDSESAFKMRTALQLVRELAPDLEVDGEMHGDSAISEVLRQRVMPDSTLNGEANLLVFPNLDAANITL
GVVKTMTDSLHVGPILLGSALPAHILSPSVTSRGVVNMAALAVVESSHPV
>P45868 1.1.1.38~~~maeA~~~NAD-dependent malic enzyme 2~~~COG0281
MGYYLTWLTISRKKEICLNNIKKTKEGHLETTLRGKEVLSIPTLNKGVAFSLEERQELGLEGLLPPTVLSLDQQAQRAYE
QFQAQPDRLRQNVYLSDLANRNEVLFYKLLKNHLREMLPVVYTPTVGEAIQEYSHEYRRPQGIYLSIDNIDGIEKAFENL
HATAGDIDLIVATDSESILGIGDWGVGGINIAIGKLAVYTAAAGIDPSRVIPVVLDVGTNNEKLLNDPLYIGNKHERVQG
ERYEAFIDAYVKAALKFFPKALLHWEDLGNKNARNIMKKYNHEILTFNDDIQGTGAITLAGVLAAMKKTGASIKDQRVVI
FGAGSAGIGIADQIRDTMVLAGLSEEEANKRFYTLDYRGLLTEDIEGILDFQKPYLRNADEVKDWKRDEKGQIPFDEVVR
QAKPTILIGTSGVSGAFTEEIVKEMASHVDRPVIMPMSNPTHLAEAVPEDLFKWTDGKVLIATGSPFDNVEYNGVSYEIG
QSNNAFAFPGLGLGSIVAEARIITPAMFAATADAIAEMVDLETPGAGLLPSIDKLQEVSIQVAIAVAEAAIKDGVANRQP
EDVKQAVLDAMWTPEYKKVIAK
>P76558 1.1.1.40~~~maeB~~~NADP-dependent malic enzyme~~~COG0280
MDDQLKQSALDFHEFPVPGKIQVSPTKPLATQRDLALAYSPGVAAPCLEIEKDPLKAYKYTARGNLVAVISNGTAVLGLG
NIGALAGKPVMEGKGVLFKKFAGIDVFDIEVDELDPDKFIEVVAALEPTFGGINLEDIKAPECFYIEQKLRERMNIPVFH
DDQHGTAIISTAAILNGLRVVEKNISDVRMVVSGAGAAAIACMNLLVALGLQKHNIVVCDSKGVIYQGREPNMAETKAAY
AVVDDGKRTLDDVIEGADIFLGCSGPKVLTQEMVKKMARAPMILALANPEPEILPPLAKEVRPDAIICTGRSDYPNQVNN
VLCFPFIFRGALDVGATAINEEMKLAAVRAIAELAHAEQSEVVASAYGDQDLSFGPEYIIPKPFDPRLIVKIAPAVAKAA
MESGVATRPIADFDVYIDKLTEFVYKTNLFMKPIFSQARKAPKRVVLPEGEEARVLHATQELVTLGLAKPILIGRPNVIE
MRIQKLGLQIKAGVDFEIVNNESDPRFKEYWTEYFQIMKRRGVTQEQAQRALISNPTVIGAIMVQRGEADAMICGTVGDY
HEHFSVVKNVFGYRDGVHTAGAMNALLLPSGNTFIADTYVNDEPDAEELAEITLMAAETVRRFGIEPRVALLSHSNFGSS
DCPSSSKMRQALELVRERAPELMIDGEMHGDAALVEAIRNDRMPDSSLKGSANILVMPNMEAARISYNLLRVSSSEGVTV
GPVLMGVAKPVHVLTPIASVRRIVNMVALAVVEAQTQPL
>O30808 1.1.1.40~~~tme~~~NADP-dependent malic enzyme~~~COG0280
MPGIDKTDRAMTSVTAQEALDFHSQGRPGKLEISPTKPMATQRDLSLAYSPGVAVPVKAIADDPATAYDYTARGNMVAVI
SNGTAILGLGNLGALASKPVMEGKAVLFKRFADVDSIDLEVDTENVDEFVNCVRFLGPSFGGINLEDIKAPDCFIIEQRL
REVMDIPVFHDDQHGTAIIAAAGLVNALTLTGRDFKTAKLVCNGAGAAAIACIELIKAMGFNPENIILCDTKGVIYKGRT
DGMNQWKSAHAVETDRRTLAEALDGADVFFGLSAKGALSADMVRSMGARPIIFAMANPDPEITPEEVALIRDDAIVATGR
SDYPNQVNNVLGFPYIFRGALDVRASTINDAMKIAAAEALANLAKEDVPDDVAAAYQGNRPRFGPQYIIPVPFDPRLISA
IPMAVAKAAMETGVARKPIEDLKAYGQQLSARRDPIASTLQRIVERVRRQPKRIVFAEGEEVQMMRSAIAYANQQLGTAL
LLGREEVMRETAEREGIDLDRAGIQIVNARLSKRVGAYTDFLYSRLQRKGYLFRDVQRLINTDRNHFAASMVALGDADGM
VTGLTRNYSTALEDVRRCIDPKPGHRVIGVSIALCRGRTVLVADTAVHDMPTSEELADIAEEAAGLAKRLGYVPRVAMLA
YSTFGHPSGERSERVREAVKILDRRRVDFEYDGEMAADVALNARVMEQYPFCRLSGTANVLVMPAFHSASISTKMLQELG
GSTVIGPLLVGLDKSVQIASMSAKDSDLVNLAAIAAYNAGT
>O34389 1.1.1.38~~~malS~~~NAD-dependent malic enzyme 3~~~COG0281
MKQFRVTNEGDIQTTLRGLEVLSVPFLNKGVAFTEEERKELGLKGFLPPKVLTIDDQAKRAYEQYSAQPDDLSKNVYLTA
LHDRNETLFYRLLNDHLGEMLPIVYTPTVGTAIQRYSHEYRKPRGLYLSIDDPDGMKEAFKQYKDQSDTIDLIVATDAEG
ILGIGDWGVGGIAISIGKLAVYTAAAGIDPSRVLAVVLDAGTNQESLLNDPLYVGNQHSRVRGERYDQFIDDYVALARET
FPNALLHWEDFGAKNARSILKRYKDKVCTFNDDIQGTGAVSLAAVLSCAKASKVPLRDHRVVIFGAGTAGIGIAEQLREA
LVREGLSEEESYKRFWCIDRNGLLTDDMDQLLDFQKPYARSADEVKDYQRNGDGGGIDLLEVVRQAKPTILIGTSTVSGA
FTEEIVKEMASHVKRPAILPMSNPTTLSEAKPEDLIEWTEGRALITTGSPFPPVEYNGVTYHIGQANNALVFPGLGLGTI
VTKSKLITDGMFEACARAIAGMVNVGVPGAPMLPKVEDLRTVSATVAVEVAKTAMKEGVATEEPEDIIQAVQDAMWYPVY
KPIRAI
>O34962 1.1.1.40~~~ytsJ~~~Bifunctional malic/malolactic enzyme~~~COG0281
MSLREEALHLHKVNQGKLESKSKVEVRNAKDLSLAYSPGVAEPCKDIHEDINKVYDYTMKGNMVAVVTDGTAVLGLGNIG
PEAALPVMEGKAVLFKSFAGVDAFPIALNTNDVDKIVETVKLLEPTFGGVNLEDIAAPNCFIIEERLKKETNIPVFHDDQ
HGTAIVTVAGLVNALKLSGKSMSSIKVVANGAGAAGIAIIKLLHHYGVRDIVMCDSKGAIYEGRPNGMNDVKNEVAKFTN
QDRKDGSLKDVIVDADVFIGVSVAGALTKEMVQSMAKDPIIFAMANPNPEIMPEDAREAGASVVGTGRSDFPNQVNNVLA
FPGIFRGALDVRATHINEEMKIAAVEAIASLVSEDELSADYVIPAPFDKRVAPAVAKAVAKAAMETGVARITVDPEEVAE
KTRKLTIIGE
>P16468 1.1.1.38~~~~~~NAD-dependent malic enzyme~~~
MALPGGAAMNITIRLQFEKDIVSFSDIAAAIGKAGGDIVGIDVISSSKVHTVRDITVSALDTKQCDLIIEALKKIRGVKI
VNVSDRTFLMHIGGKIETNSKIPVKTRDDLSRVYTPGVARVCTAIAEDPRKAYSLTIKRNTVAVVSDGTAVLGLGDIGPY
AAMPVMEGKAMLFKEFAGVDAFPICLDTKDTEEIIQIVKAIAPAFGGINLEDISAPRCFEIEKRLKEELDIPVFHDDQHG
TAVVLLAGLLNALKIVDKKLEDIKVVLTGIGAAGIACTKILLAAGVRNIIGVDRHGAIHRDETYENPYWQEYAQLTNPDN
LKGSLSDVIAGADVFIGVSAPGILKVEDVKKMARDPIVFAMANPIPEIDPELAEPYVRVMATGRSDYPNQINNVLCFPGI
FRGALDCRAREINEEMKLAAAKAIASVVTEDELNETYIIPSVFNSKVVERVRQAVVEAAYRTGVARKDNIPVGGYTGQ
>P9WK25 1.1.1.38~~~mez~~~Putative malate oxidoreductase [NAD]~~~COG0281
MSDARVPRIPAALSAPSLNRGVGFTHAQRRRLGLTGRLPSAVLTLDQQAERVWHQLQSLATELGRNLLLEQLHYRHEVLY
FKVLADHLPELMPVVYTPTVGEAIQRFSDEYRGQRGLFLSIDEPDEIEEAFNTLGLGPEDVDLIVCTDAEAILGIGDWGV
GGIQIAVGKLALYTAGGGVDPRRCLAVSLDVGTDNEQLLADPFYLGNRHARRRGREYDEFVSRYIETAQRLFPRAILHFE
DFGPANARKILDTYGTDYCVFNDDMQGTGAVVLAAVYSGLKVTGIPLRDQTIVVFGAGTAGMGIADQIRDAMVADGATLE
QAVSQIWPIDRPGLLFDDMDDLRDFQVPYAKNRHQLGVAVGDRVGLSDAIKIASPTILLGCSTVYGAFTKEVVEAMTASC
KHPMIFPLSNPTSRMEAIPADVLAWSNGRALLATGSPVAPVEFDETTYVIGQANNVLAFPGIGLGVIVAGARLITRRMLH
AAAKAIAHQANPTNPGDSLLPDVQNLRAISTTVAEAVYRAAVQDGVASRTHDDVRQAIVDTMWLPAYD
>P19994 3.4.11.18~~~map~~~Methionine aminopeptidase 1~~~COG0024
MIICKTPRELGIMREAGRIVALTHEELKKHIKPGISTKELDQIAERFIKKQGAIPSFKGYNGFRGSICVSVNEELVHGIP
GSRVLKDGDIISIDIGAKLNGYHGDSAWTYPVGNISDDDKKLLEVTEESLYKGLQEAKPGERLSNISHAIQTYVENEQFS
VVREYVGHGVGQDLHEDPQIPHYGPPNKGPRLKPGMVLAIEPMVNAGSRYVKTLADNWTVVTVDGKKCAHFEHTIAITET
GFDILTRV
>P9WK21 3.4.11.18~~~map-1~~~Methionine aminopeptidase 1~~~COG0024
MRPLARLRGRRVVPQRSAGELDAMAAAGAVVAAALRAIRAAAAPGTSSLSLDEIAESVIRESGATPSFLGYHGYPASICA
SINDRVVHGIPSTAEVLAPGDLVSIDCGAVLDGWHGDAAITFGVGALSDADEALSEATRESLQAGIAAMVVGNRLTDVAH
AIETGTRAAELRYGRSFGIVAGYGGHGIGRQMHMDPFLPNEGAPGRGPLLAAGSVLAIEPMLTLGTTKTVVLDDKWTVTT
ADGSRAAHWEHTVAVTDDGPRILTLG
>O34484 3.4.11.18~~~mapB~~~Methionine aminopeptidase 2~~~COG0024
MIVTNDQELEGLKKIGRIVALAREEMKRKAEPGMSTKDLDLIGKAVLDEHGAVSAPEKEYDFPGVTCISVNDEVAHGIPS
TSKILKAGDLVNIDISAEFGGFYSDTGISFVLGEGEERLHKLCQCAENAFQKGLQQAKAGKRQNQIGRAVYHEARSQGFT
VIKTLTGHGIGRSLHEAPNHIMNYYDPFDNALFKNGTVIALEPFISTKAETIVEAGDGWTFKTPDKSMVAQVEHTIVITK
DEPIILTKL
>P9WK19 3.4.11.18~~~map~~~Methionine aminopeptidase 2~~~COG0024
MPSRTALSPGVLSPTRPVPNWIARPEYVGKPAAQEGSEPWVQTPEVIEKMRVAGRIAAGALAEAGKAVAPGVTTDELDRI
AHEYLVDNGAYPSTLGYKGFPKSCCTSLNEVICHGIPDSTVITDGDIVNIDVTAYIGGVHGDTNATFPAGDVADEHRLLV
DRTREATMRAINTVKPGRALSVIGRVIESYANRFGYNVVRDFTGHGIGTTFHNGLVVLHYDQPAVETIMQPGMTFTIEPM
INLGALDYEIWDDGWTVVTKDRKWTAQFEHTLLVTDTGVEILTCL
>P0AE18 3.4.11.18~~~map~~~Methionine aminopeptidase~~~COG0024
MAISIKTPEDIEKMRVAGRLAAEVLEMIEPYVKPGVSTGELDRICNDYIVNEQHAVSACLGYHGYPKSVCISINEVVCHG
IPDDAKLLKDGDIVNIDVTVIKDGFHGDTSKMFIVGKPTIMGERLCRITQESLYLALRMVKPGINLREIGAAIQKFVEAE
GFSVVREYCGHGIGRGFHEEPQVLHYDSRETNVVLKPGMTFTIEPMVNAGKKEIRTMKDGWTVKTKDRSLSAQYEHTIVV
TDNGCEILTLRKDDTIPAIISHDE
>Q9ZCD3 3.4.11.18~~~map~~~Methionine aminopeptidase~~~COG0024
MTIKIHTEKDFIKMRAAGKLAAETLDFITDHVKPNVTTNSLNDLCHNFITSHNAIPAPLNYKGFPKSICTSINHVVCHGI
PNDKPLKNGDIVNIDVTVILDGWYGDTSRMYYVGDVAIKPKRLIQVTYDAMMKGIEVVRPGAKLGDIGYAIQSYAEKHNY
SVVRDYTGHGIGRVFHDKPSILNYGRNGTGLTLKEGMFFTVEPMINAGNYDTILSKLDGWTVTTRDKSLSAQFEHTIGVT
KDGFEIFTLSPKKLDYPPY
>P0A1X6 3.4.11.18~~~map~~~Methionine aminopeptidase~~~
MAISIKTSEDIEKMRVAGRLAAEVLEMIEPYIKPGVTTGELDRICNDYIVNEQHAISACLGYHGYPKSVCISINEVVCHG
IPDDAKHLKDGDIVNIDVTVIKDEFHGDTSKMFIVGKPTILGERLCRVTQESLYLGIKMVKPGIRLRTIGAAIQKYAEGE
GFSVVREYCGHGIGRGFHEEPQVLHYDADDGGVVLQPGMTFTIEPMLNAGDYRIRTMKDGWTVKTKDRSLSAQYEHTIVV
TENGCEILTLRKDDTIPAIITHDE
>P0A078 3.4.11.18~~~map~~~Methionine aminopeptidase~~~
MIVKTEEELQALKEIGYICAKVRNTMQAATKPGITTKELDNIAKELFEEYGAISAPIHDENFPGQTCISVNEEVAHGIPS
KRVIREGDLVNIDVSALKNGYYADTGISFVVGESDDPMKQKVCDVATMAFENAIAKVKPGTKLSNIGKAVHNTARQNDLK
VIKNLTGHGVGLSLHEAPAHVLNYFDPKDKTLLTEGMVLAIEPFISSNASFVTEGKNEWAFETSDKSFVAQIEHTVIVTK
DGPILTTKIEEE
>P99121 3.4.11.18~~~map~~~Methionine aminopeptidase~~~
MIVKTEEELQALKEIGYICAKVRNTMQAATKPGITTKELDNIAKELFEEYGAISAPIHDENFPGQTCISVNEEVAHGIPS
KRVIREGDLVNIDVSALKNGYYADTGISFVVGESDDPMKQKVCDVATMAFENAIAKVKPGTKLSNIGKAVHNTARQNDLK
VIKNLTGHGVGLSLHEAPAHVLNYFDPKDKTLLTEGMVLAIEPFISSNASFVTEGKNEWAFETSDKSFVAQIEHTVIVTK
DGPILTTKIEEE
>E6ENP9 3.1.3.90~~~mapP~~~Maltose 6'-phosphate phosphatase~~~
MNLLTINTHSWLEEEPLKKLEEIAKVILSSESEIIALQEVNQKVASKKVPLEQLTTFCPIATQTPIHEDNFAYLIVQYLA
EKGQHYYWSWEMSHIGYAIYEEGNALLSKCPLTSEALLISESQEPTNYRTRKILVAETESSKGTLTVVSGHFSWWETPCT
GFAYEWLQLEKYLAMGQQPLVILGDLNNPAGTTGYQLVENSYLPIQDAFVVAEETSGEATVEKKIDGWEENEAALRIDYA
FVPKQWHVRKYEVIFDGRKTPIVSDHFGLLIQLK
>Q8DR55 ~~~mapZ~~~Mid-cell-anchored protein Z~~~
MSKKRRNRHKKEAQEPQFDFDEAKELTVGQAIRKNEEVEAGVLPEDSILDKYVKQHRDEIEADKFATRQYKKEEFVETQS
LDDLIQEMREAVEKSEASSEEVPSSEDILLPLPLDDEEQGLDPLLLDDENPTEMTEEVEEEQNLSRLDQEDSEKKSKKGF
ILTVLALVSVIICVSAYYVYRQVARSTKEIETSQSTTANQSDVDDFNTLYDAFYTNSNKTALKNSQFDKLSQLKTLLDKL
EGSREHTLAKSKYDSLATQIKAIQDVNAQFEKPAIVDGVLDTNAKAKSDAKFTDIKTGNTELDKVLDKAISLGKSQQTST
SSSSSSQTSSSSSSQASSNTTSEPKPSSSNETRSSRSEVNMGLSSAGVAVQRSASRVAYNQSAIDDSNNSAWDFADGVLE
QILATSRSRGYITGDQYILERVNIVNGNGYYNLYKPDGTYLFTLNCKTGYFVGNGAGHADDLDY
>Q99QS1 ~~~map~~~Protein map~~~
MKFKSLITTTLALGVIASTGANFNTNEASAAAKPLDKSSSTLHHGHSNIQIPYTITVNGTSQNILSSLTFNKNQNISYKD
IENKVKSVLYFNRGISDIDLRLSKQAEYTVHFKNGTKRVIDLKSGIYTADLINTSDIKAISVNVDTKKQPKDKAKANVQV
PYTITVNGTSQNILSNLTFNKNQNISYKDLEGKVKSVLESNRGITDVDLRLSKQAKYTVNFKNGTKKVIDLKSGIYTANL
INSSDIKSININVDTKKHIENKAKRNYQVPYSINLNGTSTNILSNLSFSNKPWTNYKNLTSQIKSVLKHDRGISEQDLKY
AKKAYYTVYFKNGGKRILQLNSKNYTANLVHAKDVKRIEITVKTGTKAKADRYVPYTIAVNGTSTPILSKLKISNKQLIS
YKYLNDKVKSVLKSERGISDLDLKFAKQAKYTVYFKNGKKQVVNLKSDIFTPNLFSAKDIKKIDIDVKQYTKSKKK
>P69775 ~~~map~~~Protein map~~~
MKFKSLITTTLALGVIASTGANFNTNEASAAAKPLDKSSSTLHHGHSNIQIPYTITVNGTSQNILSSLTFNKNQNISYKD
IENKVKSVLYFNRGISDIDLRLSKQAEYTVHFKNGTKRVIDLKSGIYTADLINTSDIKAISVNVDTKKQPKDKAKANVQV
PYTITVNGTSQNILSNLTFNKNQNISYKDLEGKVKSVLESNRGITDVDLRLSKQAKYTVNFKNGTKKVIDLKSGIYTANL
INSSDIKSININVDTKKHIENKAKRNYQVPYSINLNGTSTNILSNLSFSNKPWTNYKNLTSQIKSVLKHDRGISEQDLKY
AKKAYYTVYFKNGGKRILQLNSKNYTANLVHAKDVKRIEITVKTGTKAKADRYVPYTIAVNGTSTPILSKLKISNKQLIS
YKYLNDKVKSVLKSERGISDLDLKFAKQAKYTVYFKNGKKQVVNLKSDIFTPNLFSAKDIKKIDIDVKQYTKSKKK
>Q53599 ~~~map~~~Protein map~~~
MKFKSLITTTLALGVIASTGANLDTNEASAAAKQIDKSSSSLHHGYSKIQIPYTITVNGTSQNILSSLTFNKNQQISYKD
IENKVKSVLYFNRGISDIDLRLSKQAKYTVHFKNGTKRVVDLKAGIHTADLINTSDIKAISVNVDTKKQVKDKEAKANVQ
VPYTITVNGTSQNILSNLTFKKNQQISYKDLENNVKSVLKSNRGITDVDLRLSKQAKFTVNFKNGTKKVIDLKAGIYTAN
LINTGGIKNININVETKKQAKDKEAKVNNQVPYSINLNGTTTNIQSNLAFSNKPWTNYKNLTTKVKSVLKSDRGVSERDL
KHAKKAYYTVYFKNGGKRVIHLNSNIYTANLVHAKDVKRIEVTVKTVSKVKAERYVPYTIAVNGASNPTLSDLKFTGDSR
VSYSDIKKKVKSVLKHDRGIGERELKYAEKATYTVHFKNGTKKVINLNSNISQLNLLYVKDIKNIDIDVKTGAKAKVYSY
VPYTIAVNGTTTPIASKLKLSNKQLIGYQDLNKKVKSVLKHDRGINDIELKFAKQAKYTIHFKNGKTQVVDLKSDIFTRN
LFSVKDIKKIDINVKQQSKSNKALNKVTNKATKVKFPVTINGFSNLVSNEFAFLHPHKITTNDLNAKLRLALRSDQGITK
HDIGLSERTVYKVYFKDGSSKLEDLKAAKQDSKVFKATDIKKVDIEIKF
>P0ACH5 ~~~marA~~~Multiple antibiotic resistance protein MarA~~~COG4977
MSRRNTDAITIHSILDWIEDNLESPLSLEKVSERSGYSKWHLQRMFKKETGHSLGQYIRSRKMTEIAQKLKESNEPILYL
AERYGFESQQTLTRTFKNYFDVPPHKYRMTNMQGESRFLHPLNHYNS
>P0AEY1 ~~~marC~~~UPF0056 inner membrane protein MarC~~~COG2095
MLDLFKAIGLGLVVLLPLANPLTTVALFLGLAGNMNSAERNRQSLMASVYVFAIMMVAYYAGQLVMDTFGISIPGLRIAG
GLIVAFIGFRMLFPQQKAIDSPEAKSKSEELEDEPSANIAFVPLAMPSTAGPGTIAMIISSASTVRQSSTFADWVLMVAP
PLIFFLVAVILWGSLRSSGAIMRLVGKGGIEAISRLMGFLLVCMGVQFIINGILEIIKTYH
>P29399 ~~~mstI~~~Marinostatin-L~~~
MKTTPFFANLLASQTRELTENELEMTAGGTASQQSPVQEVPEQPFATMRYPSDSDEDGFNFPV
>P27245 ~~~marR~~~Multiple antibiotic resistance protein MarR~~~COG1846
MKSTSDLFNEIIPLGRLIHMVNQKKDRLLNEYLSPLDITAAQFKVLCSIRCAACITPVELKKVLSVDLGALTRMLDRLVC
KGWVERLPNPNDKRGVLVKLTTGGAAICEQCHQLVGQDLHQELTKNLTADEVATLEYLLKKVLP
>Q9KS12 3.4.22.-~~~rtxA~~~Multifunctional-autoprocessing repeats-in-toxin~~~COG1073
MVFYLIPKRRVWLMGKPFWRSVEYFFTGNYSADDGNNNIVAIGFGGQIHAYGGDDHVTVGSIGATVYTGSGNDTVVGGSA
YLKVEDSTGHLIVKGAAGYADINKSGDGNVSFAGAAGGVSIDHLGNHGDVSYGGAAAYNGITRKGLSGNVTFAGAGGYNA
LWHETNQGNLSFTGAGAGNKLDRTWSNRYQGSHGDVTFDGAGAANSISSRVETGNITFRGAGADNHLVRKGKVGDITLQG
AGASNRIERTHQAEDVYTQTRGNIRFEGVGGYNSLYSDVAHGDIHFSGGGAYNTIIRKGSGNDFAKEGMTNAKADEIVLT
KAVMSGSWIGQDHHVTAVKSASEPNTYLFAFADSTYTKINKVQLRNDPQTGELKYYSTAWYKEVNHLSNLANQDISDNGG
FTAVNINGAYTLSDLKVEHQQSVTVHAVEKSLTEYEWVTYANGAVIDAKEVSLSDAKMGGHAIYADGTKVDVKAVKSNRQ
PNTYIYAKVLGPYTKIVVVELANDPETGALKYQARSWYKEGDHTANIANQDISSATGYNPMGKGGYSLSDLHYSVNAVRS
TSETVADIEEYTDQTLFKPANDSGESSGDVRFNGAGGGNVIKSNVTRGNVHFNGGGIANVILHSSQFGNTEFNGGGAANV
IVKSGEEGDLTFRGAGLANVLVHQSEQGKMDVYAGGAVNVLVRLGDGQYLAHLLAYGNISVQKGSGDSRVVMLGGYNTHT
QIGSGNGLWLAAGGFNVMTQVGKGDVAAVLAGGANVLTKMGEGELTSGMLGGANVITHISNDDQLSNTTAVALGGANILT
KKGKGNTLAVMGGGANVLTHVGDGTTTGVMVGGANILTKVGNGDTTGILLGVGNVLTHVGDGQTLGVMGAAGNIFTKVGD
GTSIAVMIGAGNIFTHVGEGNAWALMGGLGNVFTKVGNGDALALMVAEANVFTHIGDGMSVALMLAKGNVATKVGNGTTL
AAMVGNVNIFTHIGHGSTFAAMIGQANIMTKVGNDLTAALMVGKANIMTHVGDGTSLGLFAGEVNVMTKVGNGTTLAAMF
GKANIMTHVGDGLTGVLALGEANIVTKLGDDFMGVVAAAKANVVTHVGDATTAAVLAGKGNILTKVGEGTTVGLLISDVG
NVMTHVGDGTTIGIAKGKANLITKVGDGLGVNVTWGQANVFTQVGDGDRYNFAKGEANLITKVGDGQEVSVVQGEANIIT
HVGNGDDYTGAWGKANVITKVGHGQNVVLAKGEANIVTQVGDGDSFNALWSKGNIVTKVGDGMQVTAAKGQANITTTVGN
GLNVTAAYGDANINTKVGDGVSVNVAWGKYNINTKVGDGLNVAVMKGKANANIHVGDGLNINASYAQNNVAIKVGNGDFY
SLAVASSNTSSNKLSALFDNIKQTVLGVGGSQAINYLVQGDEASSSGTHKGRGAIATPEITKLDGFQMDAIKEVSSDLGD
SLTGSVTKVDTPDLNKMQHALNVDDSSVQAPNLIVNGDFELGEHGWQSTHGVEASYAGSVYGVEGEGHGARVTELDTYTN
TSLYQDLANLAQGEVIAVSFDFAKRAGLSNNEGIEVLWNGEVVFSSSGDESAWQQKNLKLTAQAGSNRIEFKGTGHNDGL
GYILDNVVATSESSQQANAIREHATQNPAAQNALSDKERAEADRQRLEQEKQKQLDAVAGSQSQLESTDQQALENNGQAQ
RDAVKEESEAVTAELAKLAQGLDVLDGQATHTGESGDQWRNDFAGGLLDGVQSQLDDAKQLANDKIAAAKQTLSDNNSKV
KESVAKSEAGVAQGEQNRAGVEQDIADAQADAEKRKADALAKGKDAQQAESDAHHAVNNAQSRGDRDVQLAENKANQAQA
DAQGAKQNEGDRPDRQGVTGSGLSGNAHSVEGAGETDSHVNTDSQTNADGRFSEGLTEQEQEALEGATNAVNRLQINAGI
RAKNSVSSMTSMFSETNSKSIVVPTKVSPEPERQEVTRRDVRISGVNLESLSAVQGSQPTGQLASKSVPGFKSHFASTSI
GIENELSGLVVVLPKNSAQTFGYVHDSQGNPLFMLTKDMNQGGYSNPVGINDIQGVNNWQTHTIELVTYPSEISDTAAVE
SRKEAMLWLAKEFTDHINQSNHQSLPHLVSDDGRFTLVISNSKHLIAAGNGTSIDAQGKTIGMTPSGQQATMAISAKEFG
TSSSPEVRLLESAPWYQAGLRDEFLANAKNTTLDDPATAQNVYAYLTSVYSKTADLAKEYGIYINDWDPASEGFSPNAQG
LTDPKVKNAWSILPRTKPVRMLELLSAEDSRYVRQQIAEKLKGTYSESLAKNVFEYFQYGGEVAGHGINNATTGSVQQPE
PAILFEFRSVPSALSDFVPKTASTVKVDVKALDHFDSASRKAIITEVNALVSGSEDFDAWYQEYRASKGQPPVKNPKSSA
SANHKAEWLMTQHAEQWAKITAPYTDNHETLTSTKLASNDKEELHALGETSNLENNKQQENVASIINTMLNDMLPFYALR
TERNLLVQEGDEGFEVRAWPGTEDKSKTIILEDPEDAAQHKAIERFILANFDNFEQMPDELFLVDNKVISHHEGRTHVLA
QKVDGAWQYNATVELMSVTELLDAANVTGKIRGESYQQVIDALTDYHASITEHADYEPESVEKLLNLRKKIEGYVLGHPD
SGRVEAMNSLLNQVNTRLDEVSLLSVAEQTIQAQNSFSRLYDQLEAANLKESKHLYLDQNGDFVTKGKGNLANIDLLGSR
EAVLEKVKLTVSNEYGQTVADTIFAGLSAKDLAKDGKGVDIAGLNKVHQAIEQHLSPVSATLYIWKPSDHSALGHAALQI
GQGRTQLEGQAAADFNQQNYVSWWPLGSKSSNISNILNVATKDQPDLKLRWSDFSQPAHQNDTLEHDVASEENDGFGLHD
GDIKLKRFIEKLNAAKGIDASFKEASEGYASVLLGNPDMLETTSIPAHVFQPFVEQWNDTSYDMMDVAHRFAQELRLQAQ
RSDDPELLEKRIGNVIRQFAERALEEIETFKASQADQGRVFRINLEGLDVAAMQAEWHRLSNDPDARYQLLTKNCSSTVA
KVLKAGGADKLIGHTWLPKFGVWTPTELFNFGQALQEAQLEIAAKKQSHQVTDVLDALSGNEKPKENVAIENDGTPPRDK
ESLSPLTRFLNNELYGDKEARRKIGEITQTLLDHAVEKGESQKITLQGEAGRLTGYYHQGTAPSEGETSSPSGKVVLFLH
GSGSSAEEQASAIRNHYQKQGIDMLAVNLRGYGESDGGPSEKGLYQDARTMFNYLVNDKGIDPSNIIIHGYSMGGPIAAD
LARYAAQNGQAVSGLLLDRPMPSMTKAITAHEVANPAGIVGAIAKAVNGQFSVEKNLEGLPKETSILLLTDNEGLGNEGE
KLRTKLTASGYNVTGEQTFYGHEASNRLMSQYADQIVSGLSSSASVDEDLDQQGLDTTSTKDQGISNKNDHLQVVDSKEA
LADGKILHNQNVNSWGPITVTPTTDGGETRFDGQIIVQMENDPVVAKAAANLAGKHAESSVVVQLDSDGNYRVVYGDPSK
LDGKLRWQLVGHGRDHSETNNTRLSGYSADELAVKLAKFQQSFNQAENINNKPDHISIVGCSLVSDDKQKGFGHQFINAM
DANGLRVDVSVRSSELAVDEAGRKHTKDANGDWVQKAENNKVSLSWDAQGEVVAKDERIRNGIAEGDIDLSRIGVNNVDE
PARGAIGDNNDVFDAPEKRKPETEVIANSSSSNQFSYSGNIQVNVGEGEFTAVNWGTSNVGIKVGTGGFKSLAFGDNNVM
VHIGDGESKHSVDIGGYQALEGAQMFLGNRNVSFNFGHSNDLILMMDKSIPTPPLVNPFDGAARISGVLQGIATSGEGED
WLAAQEQQWTLSGAKKFVKDMSGLDQSSSVDYTTLVELDSQNERDSRGLKHDAEATLNKQYNQWLSGNGNSGTSQLSRAD
KLRQANEKLAFNFAVGGQGADIQVTTGNWNFMFGDNIQSILDTNLGSLFGLMTQQFTATGQAKTTFTYTPQDLPRQLKNK
LLGQLAGVGAETTLADIFGVDYTASGQIVSRNGQAVDGVAILKEMLEVIGEFSGDQLQAFVDPAKLLDSLKAGIDMGADG
IKSFAETHGLKEKAPEEEKDNSSVSVNGANVNSAQGATVADGNTETAETQDRAFGFNSLNLPNLFATIFSQDKQKEMKSL
VENLKQNLTADLLNMKEKTFDFLRNSGHLQGDGDINISLGNYNFNWGGDGKDLGAYLGDNNNFWGGRGDDVFYATGKSNI
FTGGEGNDMGVLMGRENMMFGGDGNDTAVVAGRINHVFLGAGDDQSFVFGEGGEIDTGSGRDYVVTSGNFNRVDTGDDQD
YSVTIGNNNQVELGAGNDFANIFGNYNRINAGAGNDVVKLMGYHAVLNGGDGDDHLIATAISKFSQFNGGEGRDLMVLGG
YQNTFKGGTDVDSFVVSGDVIDNLVEDIRSEDNIVFNGIDWQKLWFERSGYDLKLSILRDPSNDSDQSKFEHIGSVTFSD
YFNGNRAQVVIGMSEKDLSGEREYTMLSDSAIDALVQAMSGFEPQAGDNGFIDSLESKSQAAISMAWSDVVHKKGLMV
>A0A2S3R7M0 3.4.22.-~~~~~~Multifunctional-autoprocessing repeats-in-toxin~~~
MGKPFWRSVEYFFTGNYSADDGNNSIVAIGFGGEIHAYGGDDHVTVGSIGATVYTGSGNDTVVGGSAYLRVEDTTGHLSV
KGAAGYADINKSGDGNVSFAGAAGGVSIDHLGNNGDVSYGGAAAYNGITRKGLSGNVTFKGAGGYNALWHETNQGNLSFA
GAGAGNKLDRTWFNRYQGSRGDVTFDGAGAANSISSRVETGNITFRGAGADNHLVRKGKVGDITLQGAGASNRIERTRQA
EDVYAQTRGNIRFEGVGGYNSLYSDVAHGDIHFSGGGAYNTITRKGSGSSFDAQGMEYAKAEDIVLTAAQMHGLSIDNGN
KFHAVTAVKSEREPNTYLFAIADGTYTKINKVRLYNDPETGKLKYYSEAWFKRGNHLAELARSDVSSAGGFEVNPINGGY
TLANIAVEHQQSVTVHAVEKNLTEYEWVTYANGTLIDAKDVALSEAKMGGHAISTDGTTVDVQAVKSNRKPNTYVYAKVL
GPYTKIVVVELANDPKTGALKYQARSWYKEGDHTANLANEDISSANGYHSMGKGGYSLSDLHYSVNAVRSTSETVADIDE
YTDQTLFKPATDSGESSGDVRFNGAGGGNVIKSNVTRGNVYFNGGGIANVILHSSQFGNTEFNGGGAANVIVKSGEEGDL
TFRGAGLANVLVHQSKQGKMDVYAGGAVNVLVRIGDGQYLAHLLAYGNISVHKGNGNSRVVMLGGYNTHTQIGSGNGLWL
AAGGFNVMTQVGKGDVASVLAGGANVLTKVGDGDLTAGMLGGANVITHISGDNETSNTTAVALGGANILTKKGKGNTLAV
MGGGANVLTHVGDGTTTGVMVGGANILTKVGNGDTTGIMLGVGNVLTHVGDGQTLGVMGAAGNIFTKVGDGTSIAVMIGA
GNIFTHVGEGNAWALMGGLGNVFTKVGNGDALALMVAEANVFTHIGDGMSVALMLAKGNVATKVGNGTTLAAMVGNANIF
THVGSGSTFAAMIGQANIMTKVGNDLTAALMVGKANIYTHVGDGTSLGIFAGEVNVMTKIGNGTTLAAMFGKANIMTHVG
DGLTGVLALGEANIVTKVGDDFMGVVAAAKANVVTHVGDATTAAVLAGKGNILTKVGEGTTVGLLISDIGNVMTHVGDGT
TIGIAKGKANIITKVGDGLGVNVAWGQANVFTQVGDGDRYNFAKGEANIITKVGDGKEVSVVQGKANIITHVGNGDDYTG
AWGKANVITKVGNGRNVVLAKGEANIVTQVGDGDSFNALWSKGNIVTKVGDGMQVTAAKGKANITTTVGDGLSVTAAYGD
ANINTKVGDGVSVNVAWGKYNINTKVGDGLNVAVMKGKANANIHVGDGLNINASYAQNNVAIKVGNGDFYSLAVASSNTS
SNKLSALFDNIKQTLLGVGGSQAINYLVQGDEASSSGTQKGRGAIATPEITKLDGFQMEAIEEVGSDLGDSLTGSVTKVD
TPDLNKMQNALDVDGSSDQTQAPNLIVNGDFEQGDRGWKSTHGVEASYSGNVYGVNGEGHGARVTELDTYTNTSLYQDLT
DLTEGEVIAVSFDFAKRAGLSNNEGIEVLWNGEVVFSSSGDASAWQQKTLKLTAHAGSNRIEFKGTGHNDGLGYILDNVV
AKSESSQQANAVSEHATQNQASQNALSDKERAEADRQRLEQEKQKQLDAVAGSQSQLESTDQQALGNNGQAQRDAVKEES
EAVTAELTKLAQGLDVLDGQATHTGESGDQWRNDFAGGLLDGVQSQLDDAKQLANDKIAAAKQTQSDNNSKVKESVAKSE
AGVAQGEQNRAGAEQDIAEAKADAETRKADAVAKSNDAKQAESDAHSAANDAQSRGDRDAMNAENKANQAQNDAKGTKQN
EGDRPDREGVAGSGLSGNAHSVEGAGETGSHITTDSQTNADGRFSEGLSEQEQEALEGATNAVNRLQINAGIRGKNSGST
ITSMFTETNSDSIVVPTTASQDVVRKEIRISGVNLEGLGEASHDSAESLVAARAEKVANLYRWLDTDNDVATDKYVPVPG
FERVDVDVSDEVKQRMIQSMSGYIEHTDNQVPKDQAEALATLFVESTLDYDWDKRVEFLTKLESYGYSFEAPHAEKSIVS
FWSGKNFKQYRDILDNAQTDGKKVVYDIDVKGNAFAIDLNKHLMRWGGLFLDPDNAEQNQLKSSIDAATFSNTGFWSSVY
ATGAQNDVYVIAEGGVRLGNYFWNVELPALRQLQREGLVGEIRLLDKPVSEYKDLPADQIGRRLTDAGVAVKVRFDALSH
ERQAELLADNPDGYKADTLVELDVKLSAIDSMLRESLPFYSLRTERNLLVQEGEEGFEVRSWPGIDGKSKTILLDNPEDA
AQQKSIERFILANFDNFEQMPDELFLVDNKVLSHHDGRTRIIAQKEDGAWTYNTNVELMSVTELLDAAHVNGKVRGDSYQ
QVIDALTEYHASTVEHADYELESVEKLLNLRKQIEGYVLGHPDSGRVEAMNSLLNQVNSRLEEVSVLAVSEQSIKAHDSF
SRLYDQLDNANLKESKHLYLDGNGDFVTKGKGNLATIDQLGGSDAVLEKVKAAVTHEYGQVVADTIFARLSANDLAKDGK
GIDIAGLNKVHQAIEQHMSPVSATMYIWKPSDHSTLGHAALQIGQGRTQLEGQAAADFNKQNYVSWWPLGSKSSNIRNIF
NVATEDQPDLKLRWSDFSQPAHQNDTLEHDMASEENDGFGLKDGETKLKRFIEKLNAAKGIDASYKDASEGYASVLLGNP
DMLASTGIPAHVFQPFVDQWNDTSYDMMDVANRFAEELQKQAQASGDPALVEKRIDNVVRLFAERALEEIEAFKASQADE
GRVFRINLEGLDVAAMQAEWNRLSNDPDARYQLLTKNCSSTVAKVLKAGGADKLIGHTWRPKFGVWTPTELFNFGQALQE
AQLEIAAKKQSHQVTDVLDALSGNEKHKENVTIENDGTPPRDKESLSPLTRFLNNELYGEKDARRKIGEITQTLLDHAVE
NGESQKVTLKGEAGRLTGYYHQGAASSEGETSATSGKVVLFLHGSGSSAEEQASAIRNHYQKQGIDMLAVNLRGYGESDG
GPSEKGLYQDARTMFNYLVNDKGIDPSNIIIHGYSMGGPIAADLARYAAQNGQAVSGLLLDRPMPSMTKAITAHEMANPA
GIVGAIAKAVNGQFSVEKNLKGLPKETPILLLTDNEGLGEEGEKLRAKLAIAGYNVTGEQTFYGHEASNRLMGQYADQIV
SGLFNAEQAAVEAGEVLKGLEKDFKRYGDALKPDTSVPGKSKDIRTTKDFLNGYKNDHAKEIVDGFRSDMSIKQLVDLFV
KGNWSAEQKGALAWEIESRALKVTFQNKSEKYNRLFREIASAGVVDAKATEQLAPQLMLLNLSNDGFGGRCDPLSKLVLV
AKQLENDGQVGVARQLLEKMYSAAAVLSNPTLYSDSEKANASKLLSSLAAIHAKNPMHDTSMKVWQEKLEGKQALTVNGV
VEKITDASANGKPVLLELDAPGHAMAAWAKDSGDDRVYGFYDPNAGIVEFSSAEKFGDYLTRFFGKSDLDMAQSYKLGKN
DAGEAIFNRVVVMDGNTLASYKPTFGDKTTMQGILDLPVFDATPIKKPTGGVASDLEALGDKTKVVVDLAQIFTVQELKE
RAKVFAKPIGASYQGILDQLDLVHQAKGRDQIAASFELNKKINDYIAEHPTSGRNQALTQLKEQVTSALFIGKMQVAQAG
IDAIAQTRPELAARIFMVAIEEANGKHVGLTDMMVRWANEDPYLAPKHGYKGETPSDLGFDAKYHVDLGEHYADFKQWLE
TSQSNGLLSKATLDESTKTVHLGYSYQELQDLTGAESVQMAFYFLKEAAKKADPISGDSAEMILLKKFADQSYLSQLDSD
RMDQIEGIYRSSHETDIDAWDRRYSGTGYDELTNKLASATGVDEQLAVLLDDRKGLLIGEVHGSDVNGLRFVNEQMDALK
KQGVTVIGLEHLRSDLAQPLIDRYLATGVMSSELSAMLKTKHLDVTLFENARANGIRIVALDANSSARPNVQGTEHGLMY
RAGAANNIAVEVLQNLPDGEKFVAIYGKAHLQSHKGIEGFVPGITHRLDLPALKVSDSNQFTVEQDDVSLRVVYDDVANK
PKITFKDSLSGANTALHNQNVNDWERVVVTPTADGGESRFDGQIIVQMENDDVVAKAAANLAGKHPESSVVVQIDSDGNY
RVVYGDPSKLDGKLRWQLVGHGRDDSESNNTRLSGYSADELAVKLAKFQQSFNQAENINNKPDHISIVGCSLVSDDKQKG
FGHQFINAMDANGLRVDVSVRSSELAVDEAGRKHTKDANGDWVQKAENNKVSLSWDEQGEVVAKDERIRNGIAEGDIDLS
RIGVSDVDEPARGAIGDNNDVFDAPEKRKAETETSSSSANNKLSYSGNIQVNVGDGEFTAVNWGTSNVGIKVGTGGFKSL
AFGDNNVMVHIGNGESKHSFDIGGYQALEGAQMFIGNRNVSFNLGRSNDLIVMMDKSIPTPPLVNPFDGAARISGVLQSI
ATSGEGQDWLAAQEQQWTLSGAKKFVKDMSGLDQSSSVDYTSLVELDSQNERSSRGLKHDAEAALNKQYNQWLSGNSDSD
TSKLSRADKLRQANEKLAFNFAVGGQGADIQVTTGNWNFMFGDNIQSILDTNLGSLFGLMTQQFSATGQAKTTFTYTPED
LPRQLKNKLLGQLAGVGAETTLADIFGVDYTASGQIVSRNGEAVDGVAILKEMLEVIGEFSGDQLQAFVDPAKLLDSLKS
GINMGADGIKSFAETHGLKEKAPEEEEDNSSVSVNGASVNSAQGATVADGSTETAETPDRAFGFNSLNLPNLFATIFSQD
KQKEMKSLVENLKENLTADLLNMKEKTFDFLRNSGHLQGDGDINISLGNYNFNWGGDGKDLGAYLGDNNNFWGGRGDDVF
YATGTSNIFTGGEGNDMGVLMGRENMMFGGDGNDTAVVAGRINHVFLGAGDDQSFVFGEGGEIDTGSGRDYVVTSGNFNR
VDTGDDQDYSVTIGNNNQVELGAGNDFANVFGNYNRINASAGNDVVKLMGYHAVLNGGEGEDHLIAAAISKFSQFNGGEG
RDLMVLGGYQNTFKGGTDVDSFVVSGDVIDNLVEDIRSEDNIVFNGIDWQKLWFERSGYDLKLSILRDPASDSDQAKFEH
IGSVTFSDYFNGNRAQVIIAMGEKDATGEREYTTLSESAIDALVQAMSGFDPQAGDNGFMDNLDSKSRVAITTAWADVVH
KKGITV
>Q1DB00 2.7.10.2~~~masK~~~Tyrosine-protein kinase MasK~~~COG0515
MSPPQTTLPVTEAGLVPLLQPYGPYVLVRKLAEGGMAEIFLAKLLGADGFERNVVIKRMLPHLTNNPDFVEMFRDEARLA
AKLAHPNIVQIQELGFAEGCYYICMEYLAGEDFSTTLRLAGRKRHYVPLPVVLRVLIDAARGLHFAHEFTNEAGQPLNVV
HRDISPSNLYLTYQGQVKVLDFGIAKAESRLVNTRTGVVKGKYMYMAPEQARGKEVDRRADIFALGVSLYEALTHVRPFS
RENDLAVLNALLQGELKPPRELRPDLPEELEAILLKAMAFKPEDRYPTAEAFADALETFLSEHLSGSGAMPLGAFLKGHF
GEERFTERSRIPTLATLTATYGGAAAGAQGQAPGAEPHGTNLYGVLAREGDATSAQRPGMSMRPSSPGVPAHGAASRGST
SPESAPTAGGRRWRTLAVGLAGGLMLAAAGIVGYRQWMTTPASVSLVPATVPVVEAVAPEAAAAQVGAPMEAVAPVGAAA
QAGSLTDAVANGAGGDVGETDSAQLSVDAAGVTETDEAGLAGAASDVEAEADEEGADAAPVRSKKASSQKRVTLGIDDVQ
RVVSRGRARITTCFERYKADLPSSQGEVQVQLTIVSSGKVRAGTRGPLASSGVGRCLEAQAERLRFPPHRDQEVTVVMPF
SWRVTQ
>P08997 2.3.3.9~~~aceB~~~Malate synthase A~~~COG2225
MTEQATTTDELAFTRPYGEQEKQILTAEAVEFLTELVTHFTPQRNKLLAARIQQQQDIDNGTLPDFISETASIRDADWKI
RGIPADLEDRRVEITGPVERKMVINALNANVKVFMADFEDSLAPDWNKVIDGQINLRDAVNGTISYTNEAGKIYQLKPNP
AVLICRVRGLHLPEKHVTWRGEAIPGSLFDFALYFFHNYQALLAKGSGPYFYLPKTQSWQEAAWWSEVFSYAEDRFNLPR
GTIKATLLIETLPAVFQMDEILHALRDHIVGLNCGRWDYIFSYIKTLKNYPDRVLPDRQAVTMDKPFLNAYSRLLIKTCH
KRGAFAMGGMAAFIPSKDEEHNNQVLNKVKADKSLEANNGHDGTWIAHPGLADTAMAVFNDILGSRKNQLEVMREQDAPI
TADQLLAPCDGERTEEGMRANIRVAVQYIEAWISGNGCVPIYGLMEDAATAEISRTSIWQWIHHQKTLSNGKPVTKALFR
QMLGEEMKVIASELGEERFSQGRFDDAARLMEQITTSDELIDFLTLPGYRLLA
>P77947 2.3.3.9~~~aceB~~~Malate synthase~~~
MSAPAPSPLAIVDAEPLPRQEEVLTDAALAFVAELLHRRFTRAVTNSSPRRADAAREIARTSTLDFLPETAAVRADDSWR
VAGPAALNDRRVEITGPTDRKMTINALNSGAKVWLADFEEPSAPTWENVVLGQVNLSDAYTRNIDFTDERTGKTYALRPL
KSWRPSSWPRGWHLNERHLVDPDGSFGARRVVVDFGLYFFHNAQRLLDLGKGPYFYLPKTESHLEARLWNDVFVFAQDYV
GIPQGTVRATVLIETITAAYEMEEILYELRDHASGLNAGRWDYLFSIVKNFPATRPKFVLPDRNVTMTKLTMTAFMRRYT
NCSSYCHNARASAAIGGMAAFIPSRDAEVNKVAFEKVRADKDREANDGFDGSWVAHPDLVPIAMESFDRCWHRPNQKDRL
REDVHVEAADLIAVDSLEATTYAGSSSTAVQVGIRYIEAWLRGLGRVAIFNLMEDAATAEISRSQIWQWINAGVEFEHGE
KATPIWPRKSRRGRKWRPVREELGEEPFAARDWQHAHDLVVQVSLDEDYADFLTLPAYERLRG
>P42450 2.3.3.9~~~glcB~~~Malate synthase G~~~COG2225
MTEQELLSAQTADNAGTDSTERVDAGGMQVAKVLYDFVTEAVLPRVGVDAEKFWSGFAAIARDLTPRNRELLARRDELQM
LIDDYHRNNSGTIDQEAYEDFLKEIGYLVEEPEAAEIRTQNVDTEISSTAGPQLVVPILNARFALNAANARWGSLYDALY
GTNAIPETDGAEKGKEYNPVRGQKVIEWGREFLDSVVPLDGASHADVEKYNITDGKLAAHIGDSVYRLKNRESYRGFTGN
FLDPEAILLETNGLHIELQIDPVHPIGKADKTGLKDIVLESAITTIMDFEDSVAAVDAEDKTLGYSNWFGLNTGELKEEM
SKNGRIFTRELNKDRVYIGRNGTELVLHGRSLLFVRNVGHLMQNPSILIDGEEIFEGIMDAVLTTVCAIPGIAPQNKMRN
SRKGSIYIVKPKQHGPEEVAFTNELFGRVEDLLDLPRHTLKVGVMDEERRTSVNLDASIMEVADRLAFINTGFLDRTGDE
IHTSMEAGAMVRKADMQTAPWKQAYENNNVDAGIQRGLPGKAQIGKGMWAMTELMAEMLEKKIGQPREGANTAWVPSPTG
ATLHATHYHLVDVFKVQDELRAAGRRDSLRNILTIPTAPNTNWSEEEKKEEMDNNCQSILGYVVRWVEHGVGCSKVPDIH
DIDLMEDRATLRISSQMLANWIRHDVVSKEQVLESLERMAVVVDKQNAGDEAYRDMAPNYDASLAFQAAKDLIFEGTKSP
SGYTEPILHARRREFKAKN
>P37330 2.3.3.9~~~glcB~~~Malate synthase G~~~COG2225
MSQTITQSRLRIDANFKRFVDEEVLPGTGLDAAAFWRNFDEIVHDLAPENRQLLAERDRIQAALDEWHRSNPGPVKDKAA
YKSFLRELGYLVPQPERVTVETTGIDSEITSQAGPQLVVPAMNARYALNAANARWGSLYDALYGSDIIPQEGAMVSGYDP
QRGEQVIAWVRRFLDESLPLENGSYQDVVAFKVVDKQLRIQLKNGKETTLRTPAQFVGYRGDAAAPTCILLKNNGLHIEL
QIDANGRIGKDDPAHINDVIVEAAISTILDCEDSVAAVDAEDKILLYRNLLGLMQGTLQEKMEKNGRQIVRKLNDDRHYT
AADGSEISLHGRSLLFIRNVGHLMTIPVIWDSEGNEIPEGILDGVMTGAIALYDLKVQKNSRTGSVYIVKPKMHGPQEVA
FANKLFTRIETMLGMAPNTLKMGIMDEERRTSLNLRSCIAQARNRVAFINTGFLDRTGDEMHSVMEAGPMLRKNQMKSTP
WIKAYERNNVLSGLFCGLRGKAQIGKGMWAMPDLMADMYSQKGDQLRAGANTAWVPSPTAATLHALHYHQTNVQSVQANI
AQTEFNAEFEPLLDDLLTIPVAENANWSAQEIQQELDNNVQGILGYVVRWVEQGIGCSKVPDIHNVALMEDRATLRISSQ
HIANWLRHGILTKEQVQASLENMAKVVDQQNAGDPAYRPMAGNFANSCAFKAASDLIFLGVKQPNGYTEPLLHAWRLREK
ESH
>B8ZSN3 2.3.3.9~~~glcB~~~Malate synthase G~~~
MTDRVSAGNLRVARVLYDFVNNEALPGTDINPNSFWSGVAKVVADLTPQNQSLLNSRDELQAQIDKWHRHRVIEPFDVDA
YRQFLIDIGYLLPEPDDFTISTSGVDDEITMTAGPQLVVPVLNARFALNAANARWGSLYDALYGTDTIPETEGAEKGSEY
NKIRGDKVIAYARKFMDQAVPLASDSWTNATGVSIFDGQLQIAIGTNSTGLASPEKFVGYNRQLRSSNWSVLLANHGLHI
EVLIDPESPIGKTDPVGIKDVILESAITTIMDFEDSVTAVDADDKVRGYRNWLGLNKGDLTEEVNKDGKTFTRVLNADRS
YTTPDGGELTLPGRSLLFVRNVGHLTTSDAILVDGGDGQEKEVFEGIIDAVFTGLAAIHGLKTGEANGPLTNSRTGSIYI
VKPKMHGPAEVAFTCELFSRVEDVLGLPQGTLKVGIMDEERRTTLNLKACIKAAADRVVFINTGFLDRTGDEIHTSMEAG
PMIRKGAMKNSTWIKAYEDANVDIGLAAGFKGKAQIGKGMWAMTELMADMVEQKIGQPKAGATTAWVPSPTAATLHAMHY
HQVDVAAVQQELTGQRRATVDQLLTIPLAKELAWAPEEIREEVDNDCQSILGYVVRWVDQGIGCSKVPDIHNVALMEDRA
TLRISSQLLANWLRHGVITSEDVRASLERMAPLVDQQNAEDPAYRPMAPNFDDSIAFLAAQELILSGAQQPNGYTEPILH
RRRREFKAQNR
>B2HSY2 2.3.3.9~~~glcB~~~Malate synthase G~~~COG2225
MTDRVSAGNLRVARVLYDFVNNEALPGTDIDQDSFWAGVDKVVTDLTPQNQDLLKTRDDLQAQIDKWHRHRVIEPLDPQA
YREFLTEIGYLLPAPEDFTITTSGVDDEITTTAGPQLVVPILNARFALNAANARWGSLYDALYGTDVISESDGAEKGRGY
NKVRGDKVIAYARQFLDDSVPLAGASYTDATGFKVEDGQLVVSLADTSAALADPGQFAGYTGTAENPKSILLANHGLHIE
ILIDPESQIGATDGAGVKDVILESAITTIMDFEDSVAAVDADDKVLGYRNWLGLNRGDLSEDVTKDDKTFTRVLNTDRTY
TAPHGGELTLPGRSLLFVRNVGHLMTNDAIVSDAEGAEGAPVFEGIMDALFTGLIAIHGLRSTDANGLLTNSRTGSIYIV
KPKMHGPAEVAFTCELFSRVEDVLGLPQGTMKVGIMDEERRTTLNLKACIKAAADRVVFINTGFLDRTGDEIHTSMEAGP
MIRKGAMKNTAWIKAYEDANVDTGLAAGFSGKAQIGKGMWAMTELMADMVEQKIAQPKAGATTAWVPSPTAATLHAMHYH
KVDVFAVQKELQGKTRTSVDELLTIPLAKELAWAPEEIREEVDNNCQSILGYVVRWIDQGVGCSKVPDIHNVALMEDRAT
LRISSQLLANWLRHGVITSEDARASLERMAPLVDKQNAGDPEYHAMAPNFDDSIAFLAAQDLILSGAQQPNGYTEPILHR
RRRELKARAGA
>P9WK16 2.3.3.9~~~glcB~~~Malate synthase G~~~
MTDRVSVGNLRIARVLYDFVNNEALPGTDIDPDSFWAGVDKVVADLTPQNQALLNARDELQAQIDKWHRRRVIEPIDMDA
YRQFLTEIGYLLPEPDDFTITTSGVDAEITTTAGPQLVVPVLNARFALNAANARWGSLYDALYGTDVIPETDGAEKGPTY
NKVRGDKVIAYARKFLDDSVPLSSGSFGDATGFTVQDGQLVVALPDKSTGLANPGQFAGYTGAAESPTSVLLINHGLHIE
ILIDPESQVGTTDRAGVKDVILESAITTIMDFEDSVAAVDAADKVLGYRNWLGLNKGDLAAAVDKDGTAFLRVLNRDRNY
TAPGGGQFTLPGRSLMFVRNVGHLMTNDAIVDTDGSEVFEGIMDALFTGLIAIHGLKASDVNGPLINSRTGSIYIVKPKM
HGPAEVAFTCELFSRVEDVLGLPQNTMKIGIMDEERRTTVNLKACIKAAADRVVFINTGFLDRTGDEIHTSMEAGPMVRK
GTMKSQPWILAYEDHNVDAGLAAGFSGRAQVGKGMWTMTELMADMVETKIAQPRAGASTAWVPSPTAATLHALHYHQVDV
AAVQQGLAGKRRATIEQLLTIPLAKELAWAPDEIREEVDNNCQSILGYVVRWVDQGVGCSKVPDIHDVALMEDRATLRIS
SQLLANWLRHGVITSADVRASLERMAPLVDRQNAGDVAYRPMAPNFDDSIAFLAAQELILSGAQQPNGYTEPILHRRRRE
FKARAAEKPAPSDRAGDDAAR
>P9WK17 2.3.3.9~~~glcB~~~Malate synthase G~~~COG2225
MTDRVSVGNLRIARVLYDFVNNEALPGTDIDPDSFWAGVDKVVADLTPQNQALLNARDELQAQIDKWHRRRVIEPIDMDA
YRQFLTEIGYLLPEPDDFTITTSGVDAEITTTAGPQLVVPVLNARFALNAANARWGSLYDALYGTDVIPETDGAEKGPTY
NKVRGDKVIAYARKFLDDSVPLSSGSFGDATGFTVQDGQLVVALPDKSTGLANPGQFAGYTGAAESPTSVLLINHGLHIE
ILIDPESQVGTTDRAGVKDVILESAITTIMDFEDSVAAVDAADKVLGYRNWLGLNKGDLAAAVDKDGTAFLRVLNRDRNY
TAPGGGQFTLPGRSLMFVRNVGHLMTNDAIVDTDGSEVFEGIMDALFTGLIAIHGLKASDVNGPLINSRTGSIYIVKPKM
HGPAEVAFTCELFSRVEDVLGLPQNTMKIGIMDEERRTTVNLKACIKAAADRVVFINTGFLDRTGDEIHTSMEAGPMVRK
GTMKSQPWILAYEDHNVDAGLAAGFSGRAQVGKGMWTMTELMADMVETKIAQPRAGASTAWVPSPTAATLHALHYHQVDV
AAVQQGLAGKRRATIEQLLTIPLAKELAWAPDEIREEVDNNCQSILGYVVRWVDQGVGCSKVPDIHDVALMEDRATLRIS
SQLLANWLRHGVITSADVRASLERMAPLVDRQNAGDVAYRPMAPNFDDSIAFLAAQELILSGAQQPNGYTEPILHRRRRE
FKARAAEKPAPSDRAGDDAAR
>Q9I636 2.3.3.9~~~glcB~~~Malate synthase G~~~
MTERVQVGGLQVAKVLFDFVNNEAIPGTGVSADTFWTGAEAVINDLAPKNKALLAKRDELQAKIDGWHQARAGQAHDAVA
YKAFLEEIGYLLPEAEDFQAGTQNVDDEIARMAGPQLVVPVMNARFALNASNARWGSLYDALYGTDVISEEGGAEKGKGY
NKVRGDKVIAFARAFLDEAAPLESGSHVDATSYSVKNGALVVALKNGSETGLKNAGQFLAFQGDAAKPQAVLLKHNGLHF
EIQIDPSSPVGQTDAAGVKDVLMEAALTTIMDCEDSVAAVDADDKVVIYRNWLGLMKGDLAEEVSKGGSTFTRTMNPDRV
YTRADGSELTLHGRSLLFVRNVGHLMTNDAILDKDGNEVPEGIQDGLFTSLIAIHDLNGNTSRKNSRTGSVYIVKPKMHG
PEEAAFTNELFGRVEDVLGLPRNTLKVGIMDEERRTTVNLKACIKAAKDRVVFINTGFLDRTGDEIHTSMEAGAVVRKGA
MKSEKWIGAYENNNVDVGLATGLQGKAQIGKGMWAMPDLMAAMLEQKIGHPLAGANTAWVPSPTAATLHALHYHKVDVFA
RQAELAKRTPASVDDILTIPLAPNTNWTAEEIKNEVDNNAQGILGYVVRWIDQGVGCSKVPDINDVGLMEDRATLRISSQ
LLANWLRHGVISQEQVVESLKRMAVVVDRQNASDPSYRPMAPNFDDNVAFQAALELVVEGTRQPNGYTEPVLHRRRREFK
AKNGL
>P0A8N0 ~~~matP~~~Macrodomain Ter protein~~~COG3120
MKYQQLENLESGWKWKYLVKKHREGELITRYIEASAAQEAVDVLLSLENEPVLVNGWIDKHMNPELVNRMKQTIRARRKR
HFNAEHQHTRKKSIDLEFIVWQRLAGLAQRRGKTLSETIVQLIEDAENKEKYANKMSSLKQDLQALLGKE
>Q8ZG78 ~~~matP~~~Macrodomain Ter protein~~~COG3120
MKYQQLENLESGWKWAYLVKKHREGEAITRHIENSAAQDAVEQLMKLENEPVKVQEWIDAHMNVNLATRMKQTIRARRKR
HFNAEHQHTRKKSIDLEFLVWQRLAVLARRRGNTLSDTVVQLIEDAERKEKYASQMSSLKQDLKDILDKEV
>Q56461 ~~~mauD~~~Methylamine utilization protein MauD~~~COG1225
MTFLIASNILLWIAFLGVTVVMLGLMRQVGLLHERSSPMGAMITDHGPDIGDMAPEFDLPDYFGRSVHIGGASERPTLLM
FTAPTCPVCDKLFPIIKSIARAEKIGVVMISDGAPEEHARFLKNHELGQIRYVVSAEIGMAFQVGKIPYGVLVDGEGVIR
AKGLTNTREHLESLLEADKTGFASLQQFMASRKKNAA
>Q56460 ~~~mauE~~~Methylamine utilization protein MauE~~~COG2259
MQQVLQEPLIHWALRSFLAALFATAALSKLTGMEEFHGVVRNFRLLPDMASRAVAMVLPVAELAVAAGLMIPALAAPAAL
AAAALLGVFGLAIAVNVLRGRTQIDCGCFRNGMKQRISWAMVARNAVLTAMALGAAALLPAARPGGTADLATGMLAGSVL
FLLYFSASMLGGLPARHPSTASVKGR
>Q56463 ~~~mauF~~~Methylamine utilization protein MauF~~~COG0785
MASLDNFDMAAGRTDGVADCVAFPGRFSTWTRALILAASAAGGGAAALAMDAAHVALVLGLAAFAGGLLSTWSPCGYSSI
SLLRPTGKGARAVLDWLPTFATHGLGYALGALILGTLLGAIGGIAGLSGFATSFGLGLLAVIGLAYGAHQLDFLRVPYPQ
RRAQVPHDARQRFPKWVVGGLYGLSLGLDYLTYVQTPLLYLVTAAAVLSGNVAEAVALIAIFNLGRYLPVAVNLLPVTDY
QIQSWLGRNQERAAIADGAILTAVGAAFAMLALA
>Q51658 1.-.-.-~~~mauG~~~Methylamine utilization protein MauG~~~COG1858
MLRLACLAPLAILIPAAGTAEQARPADDALAALGAQLFVDPALSRNATQSCATCHDPARAFTDPREGKAGLAVSVGDDGQ
SHGDRNTPTLGYAALVPAFHRDANGKYKGGQFWDGRADDLKQQAGQPMLNPVEMAMPDRAAVAARLRDDPAYRTGFEALF
GKGVLDDPERAFDAAAEALAAYQATGEFSPFDSKYDRVMRGEEKFTPLEEFGYTVFITWNCRLCHMQRKQGVAERETFTN
FEYHNIGLPVNETAREASGLGADHVDHGLLARPGIEDPAQSGRFKVPSLRNVAVTGPYMHNGVFTDLRTAILFYNKYTSR
RPEAKINPETGAPWGEPEVARNLSLAELQSGLMLDDGRVDALVAFLETLTDRRYEPLLEESRAAQKD
>Q56464 ~~~mauJ~~~Methylamine utilization protein MauJ~~~
MWIPYDLRASLKGETDRETLRLSRADGQPRQFLVGFFLRNPVTQAWELDLAAEDGLPELPAVPAGASFSICPNEAGKLAE
VIYRLPARSATEALELAHADLQPRLLAWLARVGRGMAIAGWRVADMTHRARWRSTPFRPSAMSLDFALAPVDRDLAPVVE
LFQRARNAPDPASRLLAAFAVLSAAVGHPAMAGSGAATLSITQDMLIHSGAIVLPDPLMGIALADLIALLRPEHDRLVGR
DGVLLPVLDDLAGQKRLSLLANLADLVAHRLIQAELAARHRDAPAPAMAAGA
>Q5ZT21 ~~~mavE~~~Effector protein MavE~~~
MTRFIMLSFVTGYRKSISCNRFSRNVVSPYFFMRTIIMTRFERNFLINSLMFLETILSVDKKLDDAIHHFTQGQYENPRY
QINSRITNADDWSKEDKLKFTSAIAEAIALVSEKYENPTSETTEQIQSARNILLDNYVPLLTANTDPENRLKSVRENSSQ
IRKELIAKLKDEVPYKSQFENPYVLFPFVAATVAVAATAASVLFGNKP
>O53451 ~~~mazE3~~~Antitoxin MazE3~~~
MYLPWGVVLAGGANGFGAGAYQTGTICEVSTQIAVRLPDEIVAFIDDEVRGQHARSRAAVVLRALERERRRRLAERDAEI
LATNTSATGDLDTLAGHCARTALDID
>P9WJ91 ~~~mazE4~~~Probable antitoxin MazE4~~~
MPFLVALSGIISGVRDHSMTVRLDQQTRQRLQDIVKGGYRSANAAIVDAINKRWEALHDEQLDAAYAAAIHDNPAYPYES
EAERSAARARRNARQQRSAQ
>P9WJ89 ~~~mazE5~~~Antitoxin MazE5~~~
MKTARLQVTLRCAVDLINSSSDQCFARIEHVASDQADPRPGVWHSSGMNRIRLSTTVDAALLTSARDMRAGITDAALIDE
ALAALLARHRSAEVDASYAAYDKHPVDEPDEWGDLASWRRAAGDS
>P9WJ87 ~~~mazE6~~~Antitoxin MazE6~~~COG0864
MKTAISLPDETFDRVSRRASELGMSRSEFFTKAAQRYLHELDAQLLTGQIDRALESIHGTDEAEALAVANAYRVLETMDD
EW
>P9WJ85 ~~~mazE7~~~Antitoxin MazE7~~~
MSTSTTIRVSTQTRDRLAAQARERGISMSALLTELAAQAERQAIFRAEREASHAETTTQAVRDEDREWEGTVGDGLG
>P0CL61 ~~~mazE9~~~Antitoxin MazE9~~~COG3609
MKLSVSLSDDDVAILDAYVKRAGLPSRSAGLQHAIRVLRYPTLEDDYANAWQEWSAAGDTDAWEQTVGDGVGDAPR
>P0AE72 ~~~mazE~~~Antitoxin MazE~~~COG2336
MIHSSVKRWGNSPAVRIPATLMQALNLNIDDEVKIDLVDGKLIIEPVRKEPVFTLAELVNDITPENLHENIDWGEPKDKE
VW
>A0R0N3 ~~~mazE~~~Antitoxin MazE~~~
MTPARDRVRRHRERLRRQGLRPVQIWVPDVNAPEFRREAHRQSELVAAGEHEAEDQAFVDAISVDWDDA
>P0C7B4 ~~~mazE~~~Antitoxin MazE~~~
MLSFSQNRSHSLEQSLKEGYSQMADLNLSLANEAFPIECEACDCNETYLSSNSTNE
>Q7A4G8 ~~~mazE~~~Antitoxin MazE~~~
MLSFSQNRSHSLEQSLKEGYSQMADLNLSLANEAFPIECEACDCNETYLSSNSTNE
>P9WII1 3.1.-.-~~~mazF2~~~Probable endoribonuclease MazF2~~~COG2337
MRRGELWFAATPGGDRPVLVLTRDPVADRIGAVVVVALTRTRRGLVSELELTAVENRVPSDCVVNFDNIHTLPRTAFRRR
ITRLSPARLHEACQTLRASTGC
>P9WIH9 3.1.-.-~~~mazF3~~~Endoribonuclease MazF3~~~COG2337
MRPIHIAQLDKARPVLILTREVVRPHLTNVTVAPITTTVRGLATEVPVDAVNGLNQPSVVSCDNTQTIPVCDLGRQIGYL
LASQEPALAEAIGNAFDLDWVVA
>P9WII5 3.1.-.-~~~mazF4~~~Endoribonuclease MazF4~~~COG2337
MNAPLRGQVYRCDLGYGAKPWLIVSNNARNRHTADVVAVRLTTTRRTIPTWVAMGPSDPLTGYVNADNIETLGKDELGDY
LGEVTPATMNKINTALATALGLPWP
>P95272 3.1.-.-~~~mazF5~~~Probable endoribonuclease MazF5~~~COG2337
MTALPARGEVWWCEMAEIGRRPVVVLSRDAAIPRLRRALVAPCTTTIRGLASEVVLEPGSDPIPRRSAVNLDSVESVSVA
VLVNRLGRLADIRMRAICTALEVAVDCSR
>P9WII3 3.1.27.-~~~mazF6~~~Endoribonuclease MazF6~~~COG2337
MVISRAEIYWADLGPPSGSQPAKRRPVLVIQSDPYNASRLATVIAAVITSNTALAAMPGNVFLPATTTRLPRDSVVNVTA
IVTLNKTDLTDRVGEVPASLMHEVDRGLRRVLDL
>P0CL62 3.1.-.-~~~mazF7~~~Probable endoribonuclease MazF7~~~COG2337
MAEPRRGDLWLVSLGAARAGEPGKHRPAVVVSVDELLTGIDDELVVVVPVSSSRSRTPLRPPVAPSEGVAADSVAVCRGV
RAVARARLVERLGALKPATMRAIENALTLILGLPTGPERGEAATHSPVRWTGGRDP
>P71650 3.1.-.-~~~mazF9~~~Endoribonuclease MazF9~~~COG2337
MMRRGEIWQVDLDPARGSEANNQRPAVVVSNDRANATATRLGRGVITVVPVTSNIAKVYPFQVLLSATTTGLQVDCKAQA
EQIRSIATERLLRPIGRVSAAELAQLDEALKLHLDLWS
>P0AE70 3.1.27.-~~~mazF~~~Endoribonuclease toxin MazF~~~COG2337
MVSRYVPDMGDLIWVDFDPTKGSEQAGHRPAVVLSPFMYNNKTGMCLCVPCTTQSKGYPFEVVLSGQERDGVALADQVKS
IAWRARGATKKGTVAPEELQLIKAKINVLIG
>A0R0N4 3.1.-.-~~~mazF~~~Probable endoribonuclease MazF~~~COG2337
MRRGDIYTAAARGAYTGKPRPVLIIQDDRFDATASVTVVPFTTSDTDAPLLRIRIEPTPATGLSTVSSLMIDKVTTVPRS
SLTHRVGRLAETDMVRTDRALLVFLGIAG
>Q2FWI8 3.1.-.-~~~mazF~~~Endoribonuclease MazF~~~COG2337
MIRRGDVYLADLSPVQGSEQGGVRPVVIIQNDTGNKYSPTVIVAAITGRINKAKIPTHVEIEKKKYKLDKDSVILLEQIR
TLDKKRLKEKLTYLSDDKMKEVDNALMISLGLNAVAHQKN
>A6QIR4 3.1.-.-~~~mazF~~~Endoribonuclease MazF~~~
MIRRGDVYLADLSPVQGSEQGGVRPVVIIQNDTGNKYSPTVIVAAITGRINKAKIPTHVEIEKKKYKLDKDSVILLEQIR
TLDKKRLKEKLTYLSDDKMKEVDNALMISLGLNAVAHQKN
>Q7A2N3 3.1.-.-~~~mazF~~~Endoribonuclease MazF~~~
MIRRGDVYLADLSPVQGSEQGGVRPVVIIQNDTGNKYSPTVIVAAITGRINKAKIPTHVEIEKKKYKLDKDSVILLEQIR
TLDKKRLKEKLTYLSDDKMKEVDNALMISLGLNAVAHQKN
>Q7A4G9 3.1.-.-~~~mazF~~~Endoribonuclease MazF~~~
MIRRGDVYLADLSPVQGSEQGGVRPVVIIQNDTGNKYSPTVIVAAITGRINKAKIPTHVEIEKKKYKLDKDSVILLEQIR
TLDKKRLKEKLTYLSDDKMKEVDNALMISLGLNAVAHQKN
>P0AEY3 3.6.1.8~~~mazG~~~Nucleoside triphosphate pyrophosphohydrolase~~~COG3956
MNQIDRLLTIMQRLRDPENGCPWDKEQTFATIAPYTLEETYEVLDAIAREDFDDLRGELGDLLFQVVFYAQMAQEEGRFD
FNDICAAISDKLERRHPHVFADSSAENSSEVLARWEQIKTEERAQKAQHSALDDIPRSLPALMRAQKIQKRCANVGFDWT
TLGPVVDKVYEEIDEVMYEARQAVVDQAKLEEEMGDLLFATVNLARHLGTKAEIALQKANEKFERRFREVERIVAARGLE
MTGVDLETMEEVWQQVKRQEIDL
>A0R3C4 3.6.1.8~~~mazG~~~Nucleoside triphosphate pyrophosphohydrolase~~~COG3956
MIVVLVDPRRPALVPVDAVEFLTGDVQYTEEMPVKVPWSLPSARPAYDGEDAPVLLSSDPEHPVVKARLAAGDRLIAAPE
PQPGERLVDAVALMDKLRTSGPWESEQTHDSLRRYLLEETYELFDAVRSGNADELREELGDVLLQVLFHARIAEDAPHHP
FSIDDVADALVRKLGNRVPAVLAGESISLDEQLAQWEERKAQEKKVKARASSMDDVPTGQPALALAQKVLARVSQAGLPA
ELIPASLTSVSVSADTDSENELRTAVLEFMDTVREVEAAVAAGRRGEDVPEELDVAPLGVISEDEWRAYWPGAESSASEA
EPEE
>P96379 3.6.1.8~~~mazG~~~Nucleoside triphosphate pyrophosphohydrolase~~~COG3956
MIVVLVDPRRPTLVPVEAIEFLRGEVQYTEEMPVAVPWSLPAARSAHAGNDAPVLLSSDPNHPAVITRLAAGARLISAPD
SQRGERLVDAVAMMDKLRTAGPWESEQTHDSLRRYLLEETYELLDAVRSGSVDQLREELGDLLLQVLFHARIAEDASQSP
FTIDDVADTLMRKLGNRAPGVLAGESISLEDQLAQWEAAKASEKARKSVADDVHTGQPALALAQKVIQRAQKAGLPAHLI
PDEITSVSVSADVDAENTLRTAVLDFIDRLRCAERAIAVARRGSNVAEQLDVTPLGVITEQEWLAHWPTAVNDSRGGSKK
RKGMR
>Q9X015 3.6.1.1~~~mazG~~~Nucleoside triphosphate pyrophosphohydrolase/pyrophosphatase MazG~~~COG3956
MKEAGILFEELVSIMEKLRSPEGCEWDRKQTHESLKPYLIEECYELIEAIDEKNDDMMKEELGDVLLQVVFHAQIARERG
AFTIEDVIRTLNEKLIRRHPHVFGDSPGYSYKQWEDIKAQEKGKKKSSRIGEINPLVPALSMARRIQENASQVGFDWKDP
EGVYEKIEEELKELKEAKDPRELEEEFGDLLFSIVNLSRFLNVDPESALRKATRKFVERFKKMEELIEKDGLVLEELPIE
KLDEYWEKAKGGDET
>P9WLP6 ~~~mbcA~~~Mycobacterial cidal antitoxin MbcA~~~
MGVNVLASTVSGAIERLGLTYEEVGDIVDASPRSVARWTAGQVVPQRLNKQRLIELAYVADALAEVLPRDQANVWMFSPN
RLLEHRKPADLVRDGEYQRVLALIDAMAEGVFV
>P9WLP7 ~~~mbcA~~~Mycobacterial cidal antitoxin MbcA~~~
MGVNVLASTVSGAIERLGLTYEEVGDIVDASPRSVARWTAGQVVPQRLNKQRLIELAYVADALAEVLPRDQANVWMFSPN
RLLEHRKPADLVRDGEYQRVLALIDAMAEGVFV
>E3YBA4 1.11.1.-~~~mbnA~~~Methanobactin mb-OB3b~~~
MTVKIAQKKVLPVIGRAAALCGSCYPCSCM
>P9WLP8 2.4.2.-~~~mbcT~~~NAD(+) phosphorylase MbcT~~~
MSDALDEGLVQRIDARGTIEWSETCYRYTGAHRDALSGEGARRFGGRWNPPLLFPAIYLADSAQACMVEVERAAQAASTT
AEKMLEAAYRLHTIDVTDLAVLDLTTPQAREAVGLENDDIYGDDWSGCQAVGHAAWFLHMQGVLVPAAGGVGLVVTAYEQ
RTRPGQLQLRQSVDLTPALYQELRAT
>P9WLP9 2.4.2.-~~~mbcT~~~NAD(+) phosphorylase MbcT~~~COG5654
MSDALDEGLVQRIDARGTIEWSETCYRYTGAHRDALSGEGARRFGGRWNPPLLFPAIYLADSAQACMVEVERAAQAASTT
AEKMLEAAYRLHTIDVTDLAVLDLTTPQAREAVGLENDDIYGDDWSGCQAVGHAAWFLHMQGVLVPAAGGVGLVVTAYEQ
RTRPGQLQLRQSVDLTPALYQELRAT
>H9XP47 1.1.1.-~~~budC~~~Meso-2,3-butanediol dehydrogenase~~~
MRFDNKVVVITGAGNGMGEAAARRFSAEGAIVVLADWAKEAVDKVAASLPKGRAMAVHIDVSDHVAVEKMMNEVAEKLGR
IDVLLNNAGVHVAGSVLETSIDDWRRIAGVDIDGVVFCSKFALPHLLKTKGCIVNTASVSGLGGDWGAAYYCAAKGAVVN
LTRAMALDHGGDGVRINSVCPSLVKTNMTNGWPQEIRDKFNERIALGRAAEPEEVAAVMAFLASDDASFINGANIPVDGG
ATASDGQPKIV
>P13658 5.6.2.1~~~mbeA~~~DNA relaxase MbeA~~~
MIVKFHARGKGGGSGPVDYLLGRERNREGATVLQGNPEEVRELIDATPFAKKYTSGVLSFAEKELPPGGREKVMASFERV
LMPGLEKNQYSILWVEHQDKGRLELNFVIPNMELQTGKRLQPYYDRADRPRIDAWQTLVNHHYGLHDPNAPENRRTLTLP
DNLPETKQALAEGVTRGIDALYHAGEIKGRQDVIQALTEAGLEVVRVTRTSISIADPNGGKNIRLKGAFYEQSFADGRGV
REKAERESRIYRENAEQRVQEARRICKRGCDIKRDENQRRYSPVHSLDRGIAGKTPGRGERGDDAAQEGRVKAGREYGHD
VTGDSLSPVYREWRDALVSWREDTGEPGRNQEAGRDIAETEREDMGRGVCAGREQEIPCPSVREISGGDSLSGERVGTSE
GVTQSDRAGNTFAERLRAAATGLYAAAERMGERLRGIAEDVFAYATGQRDAERAGHAVESAGAALERADRTLEPVIQREL
EIREERLIQEREHVLSLERERQPEIQERTLDGPSLGW
>P13657 ~~~mbeC~~~Mobilization protein MbeC~~~
MIPMKRERMLTIRVTDDEHARLLERCEGKQLAVWMRRVCLGEPVARSGKLPTLAPPLLRQLAAIGNNLNQTARKVNSGQW
SSGDRVQVVAALMAIGDELRRLRLAVREQGARDDS
>P07386 ~~~mbhA~~~Myxobacterial hemagglutinin~~~
MAAYLVQNQWGGSQATWNPGGLWLIGARDKQNVVALDIKSDDGGKTLKGTMTYNGEGPIGFRGTLSSANNYTVENQWGGT
SAPWQPGGVWVLGARDKQNIVAVSIKSNDGGKTLTGTTTYNGEGPIGFKSEVTDGDTYSVENQWGGSAAPWHSGGVWVLG
TRGKQNVINVDAKSNDGGKTLSGTMTYNGEGPIGFRGTLTSPDTYTVENQWGGSTAPWNPGGFWMIGARNGQNVVALNVA
SSDGGKTLAGTMIYNGEGPIGFRARLG
>O33406 1.12.99.6~~~hoxL~~~Uptake hydrogenase large subunit~~~
MSVIQTPNGYKLDNSGRRVVVDPVTRIEGHMRCEVNVDSNNVIRNAVSTGTMWRGLEVILKGRDPRDAWAFVERICGVCT
GCHALASVRAVENALDIRIPPNAHLIREIMAKVLQWHDHVVHFYHLHALDWVNPVNALKADPKATSELQQLVGPNHPMSS
PGYFRDIQNRLKRFVESGELGIFKNGYWDNPAYKLSPEADLMATAHYLEALDIQKEIVKIHTIFGGKNPHPNFMVGGVPC
AINMDGDLAAGAPLNMERLNFVRARIEEAYEFSKNVYIPDVIAIATFYKGWLYGGGLSATNVMDYGDYAKVNYDKSTDQL
KGGAILNGNWNEVFPVDAADPEQIQEFVAHSWYKYPDEAKGLHPWDGVTEHNYALGPNTKGTRTDIKQLDEAAKYSWIKS
PRWRGHAVEVGPLSRYILNYAQGNQYVIEQVDSSLAAFNKLAGTNLTPKQALPSTIGRTLARALEAHYCAAMMLDDWKEL
IGNIKAGDSSTANVEKWDPSTWPKEAKGYGLVAAPRGANGHWIRIKDGKIANYQCIVPTTWNGSPRDPAGNIGAFEASLM
NTPMERPEEPVEILRTLHSFDPCLACSTHVMSEDGENLAKVTVR
>P31891 1.12.99.6~~~hoxG~~~Uptake hydrogenase large subunit~~~COG0374
MSAYATQGFNLDDRGRRIVVDPVTRIEGHMRCEVNVDANNVIRNAVSTGTMWRGLEVILKGRDPRDAWAFVERICGVCTG
CHALASVRAVENALDIRIPKNAHLIREIMAKTLQVHDHAVHFYHLHALDWVDVMSALKADPKRTSELQQLVSPAHPLSSA
GYFRDIQNRLKRFVESGQLGPFMNGYWGSKAYVLPPEANLMAVTHYLEALDLQKEWVKIHTIFGGKNPHPNYLVGGVPCA
INLDGIGAASAPVNMERLSFVKARIDEIIEFNKNVYVPDVLAIGTLYKQAGWLYGGGLAATNVLDYGEYPNVAYNKSTDQ
LPGGAILNGNWDEVFPVDPRDSQQVQEFVSHSWYKYADESVGLHPWDGVTEPNYVLGANTKGTRTRIEQIDESAKYSWIK
SPRWRGHAMEVGPLSRYILAYAHARSGNKYAERPKEQLEYSAQMINSAIPKALGLPETQYTLKQLLPSTIGRTLARALES
QYCGEMMHSDWHDLVANIRAGDTATANVDKWDPATWPLQAKGVGTVAAPRGALGHWIRIKDGRIENYQCVVPTTWNGSPR
DYKGQIGAFEASLMNTPMVNPEQPVEILRTLHSFDPCLACSTHVMSAEGQELTTVKVR
>P0ACD8 1.12.99.6~~~hyaB~~~Hydrogenase-1 large chain~~~COG0374
MSTQYETQGYTINNAGRRLVVDPITRIEGHMRCEVNINDQNVITNAVSCGTMFRGLEIILQGRDPRDAWAFVERICGVCT
GVHALASVYAIEDAIGIKVPDNANIIRNIMLATLWCHDHLVHFYQLAGMDWIDVLDALKADPRKTSELAQSLSSWPKSSP
GYFFDVQNRLKKFVEGGQLGIFRNGYWGHPQYKLPPEANLMGFAHYLEALDFQREIVKIHAVFGGKNPHPNWIVGGMPCA
INIDESGAVGAVNMERLNLVQSIITRTADFINNVMIPDALAIGQFNKPWSEIGTGLSDKCVLSYGAFPDIANDFGEKSLL
MPGGAVINGDFNNVLPVDLVDPQQVQEFVDHAWYRYPNDQVGRHPFDGITDPWYNPGDVKGSDTNIQQLNEQERYSWIKA
PRWRGNAMEVGPLARTLIAYHKGDAATVESVDRMMSALNLPLSGIQSTLGRILCRAHEAQWAAGKLQYFFDKLMTNLKNG
NLATASTEKWEPATWPTECRGVGFTEAPRGALGHWAAIRDGKIDLYQCVVPTTWNASPRDPKGQIGAYEAALMNTKMAIP
EQPLEILRTLHSFDPCLACSTHVLGDDGSELISVQVR
>P31883 1.12.5.1~~~hydB~~~Quinone-reactive Ni/Fe-hydrogenase large chain~~~COG0374
MTKRIVVDPITRIEGHLRIEVVVDENNVIQDAFSTATLWRGLETILKGRDPRDAGFFTQRICGVCTYSHYKAGISAVENA
LGIKPPLNAELIRSLMSISLILHDHTVHFYHLHGLDWCDITSALKADPVAASKLAFKYSPNPIATGADELTAVQKRVAEF
AAKGNLGPFANAYWGHKTYRFSPEQNLIVLSHYLKALEVQRVAAQMMAIWGAKQPHPQSLTVGGVTSVMDALDPSRLGDW
LTKYKYVADFVNRAYYADVVMAAEVFKSEPSVLGGCNVKNFYSYQEIPLNKTEWMYSTGIVMDGDITKVHEINEDLITEE
ATHAWYKENKALHPYDGQQDPNYTGFKDMETVGPDGTMVKTKVIDEKGKYTWIKAPRYGGKPLEVGPLATIVVGLAAKNP
RIEKVATQFLKDTGLPLAALFTTLGRTAARMLECKLSADYGFEAFNSLIANLKVDQSTYTTYKIDKNKEYKGRYMGTVPR
GVLSHWVRIKNGVIQNYQAVVPSTWNAGPRDANGTKGPYEASLVGMKLQDLSQPLEIIRVIHSFDPCIACAVHVMDTKGN
ELSQYRVDPITVGCNL
>P0ACE0 1.12.99.6~~~hybC~~~Hydrogenase-2 large chain~~~COG0374
MSQRITIDPVTRIEGHLRIDCEIENGVVSKAWASGTMWRGMEEIVKNRDPRDAWMIVQRICGVCTTTHALSSVRAAESAL
NIDVPVNAQYIRNIILAAHTTHDHIVHFYQLSALDWVDITSALQADPTKASEMLKGVSTWHLNSPEEFTKVQNKIKDLVA
SGQLGIFANGYWGHPAMKLPPEVNLIAVAHYLQALECQRDANRVVALLGGKTPHIQNLAVGGVANPINLDGLGVLNLERL
MYIKSFIDKLSDFVEQVYKVDTAVIAAFYPEWLTRGKGAVNYLSVPEFPTDSKNGSFLFPGGYIENADLSSYRPITSHSD
EYLIKGIQESAKHSWYKDEAPQAPWEGTTIPAYDGWSDDGKYSWVKSPTFYGKTVEVGPLANMLVKLAAGRESTQNKLNE
IVAIYQKLTGNTLEVAQLHSTLGRIIGRTVHCCELQDILQNQYSALITNIGKGDHTTFVKPNIPATGEFKGVGFLEAPRG
MLSHWMVIKDGIISNYQAVVPSTWNSGPRNFNDDVGPYEQSLVGTPVADPNKPLEVVRTIHSFDPCMACAVHVVDADGNE
VVSVKVL
>O33405 1.12.99.6~~~hoxS~~~Uptake hydrogenase small subunit~~~
MTPTETFYEVMRRQGVTRRSFLKFCSLTATALGLGPAYTSEIAHAMETKPRTPVLWLHGLECTCCSESFIRSAHPLVKDV
VLSMISLDYDDTLMAAAGHQAEAALADTIERYKGNYILAVEGNPPLNEDGMFCIIGGKPFVDQLRYAAKHAKAIISWGSC
ASHGCVQAARPNPTRATPVHQVITDKPIIKVPGCPPIAEVMTGVITYMLTFGKLPELDRTGRPKMFYSQRIHDKCYRRPH
FDAGQFVESFDDEGARRGYCLYKVGCKGPTTYNACSTIRWNEGTSFPIQAGHGCIGCSEEGFWDKGSWYARLQDIHQFGI
EANADQIGGTVAVGAAGAVAAHAAVSALKRAQTKRQTTTTTTPKEHV
>P21950 1.12.99.6~~~hoxK~~~Uptake hydrogenase small subunit~~~
MSRLETFYDVMRRQGITRRSFLKYCSLTAAALGLGPAFAPRIAHAMETKPRTPVLWLHGLECTCCSESFIRSAHPLVKDV
VLSMISLDYDDTLMAAAGHQAEAALEETMRKYKGEYILAVEGNPPLNEDGMFCIVGGKPFIEQLRHVAKDAKAVIAWGSC
ASWGCVQAARPNPTQAVPIHKVITDKPIVKVPGCPPIAEVMTGVITYMLTFGKLPELDRQGRPKMFYGQRIHDKCYRRPH
FDAGQFVEHWDDEGARKGYCLYKVGCKGPTSYNACSTVRWNEGTSFPIQAGHGCIGCSEDGFWDKGSFYERLTTIPQFGI
EKNADEIGAAVAGGVGAAIAAHAAVTAIKRLQNKGDRP
>P31892 1.12.99.6~~~hoxK~~~Uptake hydrogenase small subunit~~~COG1740
MVETFYEVMRRQGISRRSFLKYCSLTATSLGLGPSFLPQIAHAMETKPRTPVLWLHGLECTCCSESFIRSAHPLAKDVVL
SMISLDYDDTLMAAAGHQAEAILEEIMTKYKGNYILAVEGNPPLNQDGMSCIIGGRPFIEQLKYVAKDAKAIISWGSCAS
WGCVQAAKPNPTQATPVHKVITDKPIIKVPGCPPIAEVMTGVITYMLTFDRIPELDRQGRPKMFYSQRIHDKCYRRPHFD
AGQFVEEWDDESARKGFCLYKMGCKGPTTYNACSTTRWNEGTSFPIQSGHGCIGCSEDGFWDKGSFYDRLTGISQFGVEA
NADKIGGTASVVVGAAVTAHAAASAIKRASKKNETSGSEH
>P69739 1.12.99.6~~~hyaA~~~Hydrogenase-1 small chain~~~COG1740
MNNEETFYQAMRRQGVTRRSFLKYCSLAATSLGLGAGMAPKIAWALENKPRIPVVWIHGLECTCCTESFIRSAHPLAKDV
ILSLISLDYDDTLMAAAGTQAEEVFEDIITQYNGKYILAVEGNPPLGEQGMFCISSGRPFIEKLKRAAAGASAIIAWGTC
ASWGCVQAARPNPTQATPIDKVITDKPIIKVPGCPPIPDVMSAIITYMVTFDRLPDVDRMGRPLMFYGQRIHDKCYRRAH
FDAGEFVQSWDDDAARKGYCLYKMGCKGPTTYNACSSTRWNDGVSFPIQSGHGCLGCAENGFWDRGSFYSRVVDIPQMGT
HSTADTVGLTALGVVAAAVGVHAVASAVDQRRRHNQQPTETEHQPGNEDKQA
>P31884 1.12.5.1~~~hydA~~~Quinone-reactive Ni/Fe-hydrogenase small chain~~~COG1740
MLEEKGIERRDFMKWAGAMTAMLSLPATFTPLTAKAAELADRLPVIWLHMAECTGCSESLLRTDGPGIDSLIFDYISLEY
HETVMAAAGWQAEHNLEHAIEKYKGRYVLMVEGGIPAGSSEFYLTVGPHGTTGAEHARHASANAAAIFAIGSCSSFGGVQ
AARPNPTNAQPLSKVTNKPVINVPGCPPSEKNIVGNVLHFILFGTLPSVDAFNRPMWAYGLRIHDLCERRGRFDAGEFVQ
EFGDEGAKKGYCLYKVGCKGPYTFNNCSKLRFNQHTSWPVQAGHGCIGCSEPDFWDTMGPFEEPVANRLYATAFDGLGAD
KTADKIGITLLAATAVGVAAHAVLSMMVKDKENN
>P69741 1.12.99.6~~~hybO~~~Hydrogenase-2 small chain~~~COG1740
MTGDNTLIHSHGINRRDFMKLCAALAATMGLSSKAAAEMAESVTNPQRPPVIWIGAQECTGCTESLLRATHPTVENLVLE
TISLEYHEVLSAAFGHQVEENKHNALEKYKGQYVLVVDGSIPLKDNGIYCMVAGEPIVDHIRKAAEGAAAIIAIGSCSAW
GGVAAAGVNPTGAVSLQEVLPGKTVINIPGCPPNPHNFLATVAHIITYGKPPKLDDKNRPTFAYGRLIHEHCERRPHFDA
GRFAKEFGDEGHREGWCLYHLGCKGPETYGNCSTLQFCDVGGVWPVAIGHPCYGCNEEGIGFHKGIHQLANVENQTPRSQ
KPDVNAKEGGNVSAGAIGLLGGVVGLVAGVSVMAVRELGRQQKKDNADSRGE
>P28697 ~~~mbiA~~~Uncharacterized protein MbiA~~~
MRVSWLESKCDTPFANNLSFISSGSSSSSSFTLASTACRNSCLCSSSIFFQVLRRNCSSNCCSISNVDISLSAFSFNRFE
TSSKMARYNLPCPRSLLAILSPPKCCNSPAISCQLRRCCSGCPSIDLNSSLRISTLERRVLPFSLWVSNRAKFANCSSLQ
C
>P39751 ~~~mbl~~~Cell shape-determining protein Mbl~~~COG1077
MFARDIGIDLGTANVLIHVKGKGIVLNEPSVVALDKNSGKVLAVGEEARRMVGRTPGNIVAIRPLKDGVIADFEVTEAML
KHFINKLNVKGLFSKPRMLICCPTNITSVEQKAIKEAAEKSGGKHVYLEEEPKVAAIGAGMEIFQPSGNMVVDIGGGTTD
IAVISMGDIVTSSSIKMAGDKFDMEILNYIKREYKLLIGERTAEDIKIKVATVFPDARHEEISIRGRDMVSGLPRTITVN
SKEVEEALRESVAVIVQAAKQVLERTPPELSADIIDRGVIITGGGALLNGLDQLLAEELKVPVLVAENPMDCVAIGTGVM
LDNMDKLPKRKLS
>P71716 6.2.1.-~~~mbtA~~~Salicyl-AMP ligase / salicyl-S-ArCP synthetase~~~COG1021
MPPKAADGRRPSPDGGLGGFVPFPADRAASYRAAGYWSGRTLDTVLSDAARRWPDRLAVADAGDRPGHGGLSYAELDQRA
DRAAAALHGLGITPGDRVLLQLPNGCQFAVALFALLRAGAIPVMCLPGHRAAELGHFAAVSAATGLVVADVASGFDYRPM
ARELVADHPTLRHVIVDGDPGPFVSWAQLCAQAGTGSPAPPADPGSPALLLVSGGTTGMPKLIPRTHDDYVFNATASAAL
CRLSADDVYLVVLAAGHNFPLACPGLLGAMTVGATAVFAPDPSPEAAFAAIERHGVTVTALVPALAKLWAQSCEWEPVTP
KSLRLLQVGGSKLEPEDARRVRTALTPGLQQVFGMAEGLLNFTRIGDPPEVVEHTQGRPLCPADELRIVNADGEPVGPGE
EGELLVRGPYTLNGYFAAERDNERCFDPDGFYRSGDLVRRRDDGNLVVTGRVKDVICRAGETIAASDLEEQLLSHPAIFS
AAAVGLPDQYLGEKICAAVVFAGAPITLAELNGYLDRRGVAAHTRPDQLVAMPALPTTPIGKIDKRAIVRQLGIATGPVT
TQRCH
>P9WQ62 6.3.2.-~~~mbtB~~~Phenyloxazoline synthase MbtB~~~
MVHATACSEIIRAEVAELLGVRADALHPGANLVGQGLDSIRMMSLVGRWRRKGIAVDFATLAATPTIEAWSQLVSAGTGV
APTAVAAPGDAGLSQEGEPFPVAPMQHAMWVGRHDHQQLGGVAGHLYVEFDGARVDPDRLRAAATRLALRHPMLRVQFLP
DGTQRIPPAAGSRDFPISVADLRHVAPDVVDQRLAGIRDAKSHQQLDGAVFELALTLLPGERTRLHVDLDMQAADAMSYR
ILLADLAALYDGREPPALGYTYREYRQAIEAEETLPQPVRDADRDWWAQRIPQLPDPPALPTRAGGERDRRRSTRRWHWL
DPQTRDALFARARARGITPAMTLAAAFANVLARWSASSRFLLNLPLFSRQALHPDVDLLVGDFTSSLLLDVDLTGARTAA
ARAQAVQEALRTAAGHSAYPGLSVLRDLSRHRGTQVLAPVVFTSALGLGDLFCPDVTEQFGTPGWIISQGPQVLLDAQVT
EFDGGVLVNWDVREGVFAPGVIDAMFTHQVDELLRLAAGDDAWDAPSPSALPAAQRAVRAALNGRTAAPSTEALHDGFFR
QAQQQPDAPAVFASSGDLSYAQLRDQASAVAAALRAAGLRVGDTVAVLGPKTGEQVAAVLGILAAGGVYLPIGVDQPRDR
AERILATGSVNLALVCGPPCQVRVPVPTLLLADVLAAAPAEFVPGPSDPTALAYVLFTSGSTGEPKGVEVAHDAAMNTVE
TFIRHFELGAADRWLALATLECDMSVLDIFAALRSGGAIVVVDEAQRRDPDAWARLIDTYEVTALNFMPGWLDMLLEVGG
GRLSSLRAVAVGGDWVRPDLARRLQVQAPSARFAGLGGATETAVHATIFEVQDAANLPPDWASVPYGVPFPNNACRVVAD
SGDDCPDWVAGELWVSGRGIARGYRGRPELTAERFVEHDGRTWYRTGDLARYWHDGTLEFVGRADHRVKISGYRVELGEI
EAALQRLPGVHAAAATVLPGGSDVLAAAVCVDDAGVTAESIRQQLADLVPAHMIPRHVTLLDRIPFTDSGKIDRAEVGAL
LAAEVERSGDRSAPYAAPRTVLQRALRRIVADILGRANDAVGVHDDFFALGGDSVLATQVVAGIRRWLDSPSLMVADMFA
ARTIAALAQLLTGREANADRLELVAEVYLEIANMTSADVMAALDPIEQPAQPAFKPWVKRFTGTDKPGAVLVFPHAGGAA
AAYRWLAKSLVANDVDTFVVQYPQRADRRSHPAADSIEALALELFEAGDWHLTAPLTLFGHCMGAIVAFEFARLAERNGV
PVRALWASSGQAPSTVAASGPLPTADRDVLADMVDLGGTDPVLLEDEEFVELLVPAVKADYRALSGYSCPPDVRIRANIH
AVGGNRDHRISREMLTSWETHTSGRFTLSHFDGGHFYLNDHLDAVARMVSADVR
>P9WQ63 6.3.2.-~~~mbtB~~~Phenyloxazoline synthase MbtB~~~COG1020
MVHATACSEIIRAEVAELLGVRADALHPGANLVGQGLDSIRMMSLVGRWRRKGIAVDFATLAATPTIEAWSQLVSAGTGV
APTAVAAPGDAGLSQEGEPFPLAPMQHAMWVGRHDHQQLGGVAGHLYVEFDGARVDPDRLRAAATRLALRHPMLRVQFLP
DGTQRIPPAAGSRDFPISVADLRHVAPDVVDQRLAGIRDAKSHQQLDGAVFELALTLLPGERTRLHVDLDMQAADAMSYR
ILLADLAALYDGREPPALGYTYREYRQAIEAEETLPQPVRDADRDWWAQRIPQLPDPPALPTRAGGERDRRRSTRRWHWL
DPQTRDALFARARARGITPAMTLAAAFANVLARWSASSRFLLNLPLFSRQALHPDVDLLVGDFTSSLLLDVDLTGARTAA
ARAQAVQEALRSAAGHSAYPGLSVLRDLSRHRGTQVLAPVVFTSALGLGDLFCPDVTEQFGTPGWIISQGPQVLLDAQVT
EFDGGVLVNWDVREGVFAPGVIDAMFTHQVDELLRLAAGDDAWDAPSPSALPAAQRAVRAALNGRTAAPSTEALHDGFFR
QAQQQPDAPAVFASSGDLSYAQLRDQASAVAAALRAAGLRVGDTVAVLGPKTGEQVAAVLGILAAGGVYLPIGVDQPRDR
AERILATGSVNLALVCGPPCQVRVPVPTLLLADVLAAAPAEFVPGPSDPTALAYVLFTSGSTGEPKGVEVAHDAAMNTVE
TFIRHFELGAADRWLALATLECDMSVLDIFAALRSGGAIVVVDEAQRRDPDAWARLIDTYEVTALNFMPGWLDMLLEVGG
GRLSSLRAVAVGGDWVRPDLARRLQVQAPSARFAGLGGATETAVHATIFEVQDAANLPPDWASVPYGVPFPNNACRVVAD
SGDDCPDWVAGELWVSGRGIARGYRGRPELTAERFVEHDGRTWYRTGDLARYWHDGTLEFVGRADHRVKISGYRVELGEI
EAALQRLPGVHAAAATVLPGGSDVLAAAVCVDDAGVTAESIRQQLADLVPAHMIPRHVTLLDRIPFTDSGKIDRAEVGAL
LAAEVERSGDRSAPYAAPRTVLQRALRRIVADILGRANDAVGVHDDFFALGGDSVLATQVVAGIRRWLDSPSLMVADMFA
ARTIAALAQLLTGREANADRLELVAEVYLEIANMTSADVMAALDPIEQPAQPAFKPWVKRFTGTDKPGAVLVFPHAGGAA
AAYRWLAKSLVANDVDTFVVQYPQRADRRSHPAADSIEALALELFEAGDWHLTAPLTLFGHCMGAIVAFEFARLAERNGV
PVRALWASSGQAPSTVAASGPLPTADRDVLADMVDLGGTDPVLLEDEEFVELLVPAVKADYRALSGYSCPPDVRIRANIH
AVGGNRDHRISREMLTSWETHTSGRFTLSHFDGGHFYLNDHLDAVARMVSADVR
>P9WKF7 1.14.13.59~~~mbtG~~~L-lysine N6-monooxygenase MbtG~~~COG3486
MNPTLAVLGAGAKAVAVAAKASVLRDMGVDVPDVIAVERIGVGANWQASGGWTDGAHRLGTSPEKDVGFPYRSALVPRRN
AELDERMTRYSWQSYLIATASFAEWIDRGRPAPTHRRWSQYLAWVADHIGLKVIHGEVERLAVTGDRWALCTHETTVQAD
ALMITGPGQAEKSLLPGNPRVLSIAQFWDRAAGHDRINAERVAVIGGGETAASMLNELFRHRVSTITVISPQVTLFTRGE
GFFENSLFSDPTDWAALTFDERRDALARTDRGVFSATVQEALLADDRIHHLRGRVAHAVGRQGQIRLTLSTNRGSENFET
VHGFDLVIDGSGADPLWFTSLFSQHTLDLLELGLGGPLTADRLQEAIGYDLAVTDVTPKLFLPTLSGLTQGPGFPNLSCL
GLLSDRVLGAGIFTPTKHNDTRRSGEHQSFR
>P9WIP5 ~~~mbtH~~~Protein MbtH~~~COG3251
MSTNPFDDDNGAFFVLVNDEDQHSLWPVFADIPAGWRVVHGEASRAACLDYVEKNWTDLRPKSLRDAMVED
>P9WFX1 5.4.99.5~~~mbtI~~~Salicylate synthase~~~COG1169
MSELSVATGAVSTASSSIPMPAGVNPADLAAELAAVVTESVDEDYLLYECDGQWVLAAGVQAMVELDSDELRVIRDGVTR
RQQWSGRPGAALGEAVDRLLLETDQAFGWVAFEFGVHRYGLQQRLAPHTPLARVFSPRTRIMVSEKEIRLFDAGIRHREA
IDRLLATGVREVPQSRSVDVSDDPSGFRRRVAVAVDEIAAGRYHKVILSRCVEVPFAIDFPLTYRLGRRHNTPVRSFLLQ
LGGIRALGYSPELVTAVRADGVVITEPLAGTRALGRGPAIDRLARDDLESNSKEIVEHAISVRSSLEEITDIAEPGSAAV
IDFMTVRERGSVQHLGSTIRARLDPSSDRMAALEALFPAVTASGIPKAAGVEAIFRLDECPRGLYSGAVVMLSADGGLDA
ALTLRAAYQVGGRTWLRAGAGIIEESEPEREFEETCEKLSTLTPYLVARQ
>P9WK15 2.3.1.-~~~mbtK~~~Lysine N-acyltransferase MbtK~~~COG1670
MTKPTSAGQADDALVRLARERFDLPDQVRRLARPPVPSLEPPYGLRVAQLTDAEMLAEWMNRPHLAAAWEYDWPASRWRQ
HLNAQLEGTYSLPLIGSWHGTDGGYLELYWAAKDLISHYYDADPYDLGLHAAIADLSKVNRGFGPLLLPRIVASVFANEP
RCRRIMFDPDHRNTATRRLCEWAGCKFLGEHDTTNRRMALYALEAPTTAA
>P9WQF1 ~~~mbtL~~~Acyl carrier protein MbtL~~~COG0236
MTSSPSTVSTTLLSILRDDLNIDLTRVTPDARLVDDVGLDSVAFAVGMVAIEERLGVALSEEELLTCDTVGELEAAIAAK
YRDE
>A0QUA1 6.2.1.20~~~mbtM~~~Medium/long-chain-fatty-acid--[acyl-carrier-protein] ligase MbtM~~~COG0318
MNVLSAALTEAMTTSSADLVVFEPETRTWHRHPWGQVHLRAQNVAERIGQDGSSAVGIVGEPTVEGVAAILGALLAGSAV
SILPGLVRGADPDQWADSTLNRFANIGVTTVFSHGSYLEQLRTRDSSLVIHDDAEVAHAQRSTTLELGAPLGEFAVLQGT
AGSTGTPRTAQLRPDAVLANLRGLAERVGLAGSDIGCSWLPLYHDMGLTFLLSAAVGGTETWQAPTTAFASAPFSWVHWL
TESRATLTAAPNMAYGLIGKYSRRLTDVDLSAMRFALNGGEPVDIDGTARFGTELSRFGFDPGALSPSYGLAESSCAVTV
PVPGVGLKVDEITVTTEAGSSTQKLAVLGHAIAGMEVRLQPGDEDAGVVDREVGEVEIRGTSMMSGYRGEAPLDPGEWFP
TGDLGYLTDDGLVICGRKKELITVAGRNIFPTEIERIAARVKGVREGAVVAVGTNERAVRPGLVIAAEFRGPDEAGARSE
VVQRVASECGVVPADVVFLAPGSLPRTSSGKLRRLEVKRQLEESKG
>P9WQ41 6.2.1.20~~~mbtM~~~Medium/long-chain-fatty-acid--[acyl-carrier-protein] ligase MbtM~~~COG0318
MSELAAVLTRSMQASAGDLMVLDRETSLWCRHPWPEVHGLAESVAAWLLDHDRPAAVGLVGEPTVELVAAIQGAWLAGAA
VSILPGPVRGANDQRWADATLTRFLGIGVRTVLSQGSYLARLRSVDTAGVTIGDLSTAAHTNRSATPVASEGPAVLQGTA
GSTGAPRTAILSPGAVLSNLRGLNQRVGTDAATDVGCSWLPLYHDMGLAFVLSAALAGAPLWLAPTTAFTASPFRWLSWL
SDSGATMTAAPNFAYNLIGKYARRVSEVDLGALRVTLNGGEPVDCDGLTRFAEAMAPFGFDAGAVLPSYGLAESTCAVTV
PVPGIGLLADRVIDGSGAHKHAVLGNPIPGMEVRISCGDQAAGNASREIGEIEIRGASMMAGYLGQQPIDPDDWFATGDL
GYLGAGGLVVCGRAKEVISIAGRNIFPTEVELVAAQVRGVREGAVVALGTGDRSTRPGLVVAAEFRGPDEANARAELIQR
VASECGIVPSDVVFVSPGSLPRTSSGKLRRLAVRRSLEMAD
>P9WQF9 1.3.99.-~~~mbtN~~~Acyl-[acyl-carrier-protein] dehydrogenase MbtN~~~COG1960
MTAGSDLDDFRGLLAKAFDERVVAWTAEAEAQERFPRQLIEHLGVCGVFDAKWATDARPDVGKLVELAFALGQLASAGIG
VGVSLHDSAIAILRRFGKSDYLRDICDQAIRGAAVLCIGASEESGGSDLQIVETEIRSRDGGFEVRGVKKFVSLSPIADH
IMVVARSVDHDPTSRHGNVAVVAVPAAQVSVQTPYRKVGAGPLDTAAVCIDTWVPADALVARAGTGLAAISWGLAHERMS
IAGQIAASCQRAIGITLARMMSRRQFGQTLFEHQALRLRMADLQARVDLLRYALHGIAEQGRLELRTAAAVKVTAARLGE
EVISECMHIFGGAGYLVDETTLGKWWRDMKLARVGGGTDEVLWELVAAGMTPDHDGYAAVVGASKA
>Q3J5L6 4.1.3.24~~~mcl1~~~L-malyl-CoA/beta-methylmalyl-CoA lyase~~~COG2301
MSFRLQPAPPARPNRCQLFGPGSRPALFEKMAASAADVINLDLEDSVAPDDKAQARANIIEAINGLDWGRKYLSVRINGL
DTPFWYRDVVDLLEQAGDRLDQIMIPKVGCAADVYAVDALVTAIERAKGRTKPLSFEVIIESAAGIAHVEEIAASSPRLQ
AMSLGAADFAASMGMQTTGIGGTQENYYMLHDGQKHWSDPWHWAQAAIVAACRTHGILPVDGPFGDFSDDEGFRAQARRS
ATLGMVGKWAIHPKQVALANEVFTPSETAVTEAREILAAMDAAKARGEGATVYKGRLVDIASIKQAEVIVRQAEMISA
>B6E2X2 4.1.3.24~~~mcl1~~~L-malyl-CoA/beta-methylmalyl-CoA lyase~~~
MSFRTQPPAPARLNRCQLFGPGSRPAIFEKMAQSAADVINLDLEDSVAPDDKPQARRNIIEASHNIDWGNKYLSVRINGL
DTPFWYRDVVELLEDGSERIDQIMIPKVGCAADVYAVDALVTAIEAAKGRKKRISLEVIIESAAGIAHVEEIAAASPRLQ
AMSLGAADFAASMGMATTGIGGTQENYYMLHAGVKHWSDPWHWAQAAIVAACRTHGILPVDGPFGDFSDDEGFRAQALRS
ATLGMVGKWAIHPKQVALANEVFTPSDAAVAEAREILAAMEKAKAEGAGATVYKGRLVDIASIRQAEVIVRQAEMAKV
>Q02251 2.3.1.111~~~mas~~~Mycocerosic acid synthase~~~
MESRVTPVAVIGMGCRLPGGINSPDKLWESLLRGDDLVTEIPPDRWDADDYYDPEPGVPGRSVSRWGGFLDDVAGFDAEF
FGISEREATSIDPQQRLLLETSWEAIEHAGLDPASLAGSSTAVFTGLTHEDYLVLTTTAGGLASPYVVTGLNNSVASGRI
AHTLGLHGPAMTFDTACSSGLMAVHLACRSLHDGEADLALAGGCAVLLEPHACVAASAQGMLSSTGRCHSFDADADGFVR
SEGCAMVLLKRLPDALRDGNRIFAVVRGTATNQDGRTETLTMPSEDAQVAVYRAALAAAGVQPETVGVVEAHGTGTPIGD
PIEYRSLARVYGAGTPCALGSAKSNMGHSTASAGTVGLIKAILSLRHGVVPPLLHFNRLPDELSDVETGLFVPQAVTPWP
NGNDHTPKRVAVSSFGMSGTNVHAIVEEAPAEASAPESSPGDAEVGPRLFMLSSTSSDALRQTARQLATWVEEHQDCVAA
SDLAYTLARGRAHRPVRTAVVAANLPELVEGLREVADGDALYDAAVGHGDRGPVWVFSGQGSQWAAMGTQLLASEPVFAA
TIAKLEPVIAAESGFSVTEAITAQQTVTGIDKVQPAVFAVQVALAATMEQTYGVRPGAVVGHSMGESAAAVVAGALSLED
AARVICRRSKLMTRIAGAGAMGSVELPAKQVNSELMARGIDDVVVSVVASPQSTVIGGTSDTVRDLIARWEQRDVMAREV
AVDVASHSPQVDPILDDLAAALADIAPMTPKVPYYSATLFDPREQPVCDGAYWVDNLRNTVQFAAAVQAAMEDGYRVFAE
LSPHPLLTHAVEQTGRSLDMSVAALAGMRREQPLPHGLRGLLTELHRAGAALDYSALYPAGRLVDAPLPAWTHARLFIDD
DGQEQRAQGACTITVHPLLGSHVRLTEEPERHVWQGDVGTSVLSWLSDHQVHNVAALPGAAYCEMALAAAAEVFGEAAEV
RDITFEQMLLLDEQTPIDAVASIDAPGVVNFTVETNRDGETTRHATAALRAAEDDCPPPGYDITALLQAHPHAVNGTAMR
ESFAERGVTLGAAFGGLTTAHTAEAGAATVLAEVALPASIRFQQGAYRIHPALLDACFQSVGAGVQAGTATGGLLLPLGV
RSLRAYGPTRNARYCYTRLTKAFNDGTRGGEADLDVLDEHGTVLLAVRGLRMGTGTSERDERDRLVSERLLTLGWQQRAL
PEVGDGEAGSWLLIDTSNAVDTPDMLASTLTDALKSHGPQGTECASLSWSVQDTPPNDQAGLEKLGSQLRGRDGVVIVYG
PRVGDPDEHSLLAGREQVRHLVRITRELAEFEGELPRLFVVTRQAQIVKPHDSGERANLEQAGLRGLLRVISSEHPMLRT
TLIDVDEHTDVERVAQQLLSGSEEDETAWRNGDWYVARLTPSPLGHEERRTAVLDPDHDGMRVQVRRPGDLQTLEFVASD
RVPPGPGQIEVAVSMSSINFADVLIAFGRFPIIDDREPQLGMDFVGVVTAVGEGVTGHQVGDRVGGFSEGGCWRTFLTCD
ANLAVTLPPGLTDEQAITAATAHATAWYGLNDLAQIKAGDKVLIHSATGGVGQAAISIARAKGAEIFATAGNPAKRAMLR
DMGVEHVYDSRSVEFAEQIRRDTDGYGVDIVLNSLTGAAQRAGLELLAFGGRFVEIGKADVYGNTRLGLFPFRRGLTFYY
LDLALMSVTQPDRVRELLATVFKLTADGVLTAPQCTHYPLAEAADAIRAMSNAEHTGKLVLDVPRSGRRSVAVTPEQAPL
YRRDGSYIITGGLGGLGLFFASKLAAAGCGRIVLTARSQPNPKARQTIEGLRAAGADIVVECGNIAEPDTADRLVSAATA
TGLPLRGVLHSAAVVEDATLTNITDELIDRDWSPKVFGSWNLHRATLGQPLDWFCLFSSGAALLGSPGQGAYAAANSWVD
VFAHWRRAQGLPVSAIAWGAWGEVGRATFLAEGGEIMITPEEGAYAFETLVRHDRAYSGYIPILGAPWLADLVRRSPWGE
MFASTGQRSRGPSKFRMELLSLPQDEWAGRLRRLLVEQASVILRRTIDADRSFIEYGLDSLGMLEMRTHVETETGIRLTP
KVIATNNTARALAQYLADTLAEEQAAAPAAS
>A0A7R7ZDZ6 1.11.1.6~~~CatBsu~~~Manganese catalase~~~
MFKHTKMLQHPAKPDRPDPLFAKKMQEILGGQFGEISVAMQYLFQGWNTRGNEKYKDLLMDTATEELGHVEMIATMIARL
LEDAPLDQQEKAAEDPVIGSILGGMNPHHAIVSGLGAMPESSTGVPWSGGYIVASGNLLADFRANLNAESQGRLQVARLF
EMTDDKGVKDMLSFLLARDTMHQNQWLAAIKELEAQEGPVVPGTFPKALEKQEFSHQLINFSEGEESAKQNWLNEKAPDG
EAFEYVKEAKTFGEKPELKPAPPCVHNTLPGRE
>P80878 1.11.1.6~~~ydbD~~~Manganese catalase~~~COG3546
MFKHTKMLQHPAKPDRPDPLFAKKMQEILGGQFGEISVAMQYLFQGWNTRGNEKYKDLLMDTATEELGHVEMIATMIARL
LEDAPLDQQEKAAEDPVIGSILGGMNPHHAIVSGLGAMPESSTGVPWSGGYIVASGNLLADFRANLNAESQGRLQVARLF
EMTDDKGVKDMLSFLLARDTMHQNQWLAAIKELEAQEGPVVPGTFPKALEKQEFSHQLINFSEGEVSAEQNWLNEKAPDG
EAFEYVKEAKTFGEKPELKPAPPFVHNTLPGRE
>P60355 1.11.1.6~~~~~~Manganese catalase~~~
MFKHTRKLQYNAKPDRSDPIMARRLQESLGGQWGETTGMMSYLSQGWASTGAEKYKDLLLDTGTEEMAHVEMISTMIGYL
LEDAPFGPEDLKRDPSLATTMAGMDPEHSLVHGLNASLNNPNGAAWNAGYVTSSGNLVADMRFNVVRESEARLQVSRLYS
MTEDEGVRDMLKFLLARETQHQLQFMKAQEELEEKYGIIVPGDMKEIEHSEFSHVLMNFSDGDGSKAFEGQVAKDGEKFT
YQENPEAMGGIPHIKPGDPRLHNHQG
>A0R2W9 3.5.1.115~~~mca~~~Mycothiol S-conjugate amidase~~~COG2120
MSELRLMAVHAHPDDESSKGAATTARYAAEGARVMVVTLTGGERGDILNPAMDLPEVHGRIAEVRRDEMAKAAEILGVEH
HWLGFVDSGLPEGDPLPPLPDGCFALVPLEEPVKRLVRVIREFRPHVMTTYDENGGYPHPDHIRCHQVSVAAYEAAADHL
LYPDAGEPWAVQKLYYNHGFLRQRMQLLQEEFAKNGQEGPFAKWLEHWDPDNDVFANRVTTRVHCAEYFHQRDDALRAHA
TQIDPKGDFFHAPIEWQQRLWPTEEFELARARVPVTLPEDDLFKGVEP
>P9WJN1 3.5.1.115~~~mca~~~Mycothiol S-conjugate amidase~~~COG2120
MSELRLMAVHAHPDDESSKGAATLARYADEGHRVLVVTLTGGERGEILNPAMDLPDVHGRIAEIRRDEMTKAAEILGVEH
TWLGFVDSGLPKGDLPPPLPDDCFARVPLEVSTEALVRVVREFRPHVMTTYDENGGYPHPDHIRCHQVSVAAYEAAGDFC
RFPDAGEPWTVSKLYYVHGFLRERMQMLQDEFARHGQRGPFEQWLAYWDPDHDFLTSRVTTRVECSKYFSQRDDALRAHA
TQIDPNAEFFAAPLAWQERLWPTEEFELARSRIPARPPETELFAGIEP
>Q9ADK0 3.5.1.115~~~mca~~~Mycothiol S-conjugate amidase~~~COG2120
MTDQLRLMAVHAHPDDESSKGAATMAKYVSEGVDVLVVTCTGGERGSILNPKLQGDAYIEENIHEVRRKEMDEAREILGV
GQEWLGFVDSGLPEGDPLPPLPEGCFALEDVDKAAGELVRKIRSFRPQVITTYDENGGYPHPDHIMTHKITMVAFEGAAD
TEKYPESEYGTAYQPLKVYYNQGFNRPRTEALHHALLDRGLESPYEDWLKRWSEFERKERTLTTHVPCADFFEIRDKALI
AHATQIDPEGGWFRVPMEIQKEVWPTEEYELAKSLVETSLPEDDLFAGIRDNA
>P0AAX6 ~~~mcbA~~~Uncharacterized protein McbA~~~
MKKCLTLLIATVLSGISLTAYAAQPMSNLDSGQLRPAGTVSATGASNLSDLEDKLAEKAREQGAKGYVINSAGGNDQMFG
TATIYK
>P05834 ~~~mcbA~~~Bacteriocin microcin B17~~~
MELKASEFGVVLSVDALKLSRQSPLGVGIGGGGGGGGGGSCGGQGGGCGGCSNGCSGGNGGSGGSGSHI
>P23184 ~~~mcbB~~~Microcin B17-processing protein McbB~~~
MVLPDIKKGKDMINILPFEIISRNTKTLLITYISSVDITHEGMKKVLESLRSKQGIISEYLLDKLLDESLIDKDKGKEFL
ITTGVINKTKTSPLWVNSVIISDVPHLFSNAREQWKCDGVFVSHIIDIKDNNINVSDSTLIWLHLENYHSDIVKRIYSKF
ESNPGVAFIQSYYLKESFRIDGVYSPDLGTPCHFCHIERWLSREEKSFRRNEMSWANLLQLLKKYQMTLPALALGESERG
FSYHLIKRRLQELTGTSLVKSHVDNFMSSVSADLITCILCKEPVIHWQACSCLER
>P23185 ~~~mcbC~~~Microcin B17-processing protein McbC~~~
MSKHELSLVEVTHYTDPEVLAIVKDFHVRGNFASLPEFAERTFVSAVPLAHLEKFENKEVLFRPGFSSVINISSSHNFSR
ERLPSGINFCDKNKLSIRTIEKLLVNAFSSPDPGSVRRPYPSGGALYPIEVFLCRLSENTENWQAGTNVYHYLPLSQALE
PVATCNTQSLYRSLSGGDSERLGKPHFALVYCIIFEKALFKYRYRGYRMALMETGSMYQNAVLVADQIGLKNRVWAGYTD
SYVAKTMNLDQRTVAPLIVQFFGDVNDDKCLQ
>P23186 ~~~mcbD~~~Microcin B17-processing protein McbD~~~
MINVYSNLMSAWPATMAMSPKLNRNMPTFSQIWDYERITPASAAGETLKSIQGAIGEYFERRHFFNEIVTGGQKTLYEMM
PPSAAKAFTEAFFQISSLTRDEIITHKFKTVRAFNLFSLEQQEIPAVIIALDNITAADDLKFYPDRDTCGCSFHGSLNDA
IEGSLCEFMETQSLLLYWLQGKANTEISSEIVTGINHIDEILLALRSEGDIRIFDITLPGAPGHAVLTLYGTKNKISRIK
YSTGLSYANSLKKALCKSVVELWQSYICLHNFLIGGYTDDDIIDSYQRHFMSCNKYESFTDLCENTVLLSDDVKLTLEEN
ITSDTNLLNYLQQISDNIFVYYARERVSNSLVWYTKIVSPDFFLHMNNSGAININNKIYHTGDGIKVRESKMVPFP
>P76114 ~~~mcbR~~~HTH-type transcriptional regulator McbR~~~COG1802
MPGTGKMKHVSLTLQVENDLKHQLSIGALKPGARLITKNLAEQLGMSITPVREALLRLVSVNALSVAPAQAFTVPEVGKR
QLDEINRIRYELELMAVALAVENLTPQDLAELQELLEKLQQAQEKGDMEQIINVNRLFRLAIYHRSNMPILCEMIEQLWV
RMGPGLHYLYEAINPAELREHIENYHLLLAALKAKDKEGCRHCLAEIMQQNIAILYQQYNR
>O05393 2.5.1.134~~~mccA~~~O-acetylserine dependent cystathionine beta-synthase~~~COG0031
MTVITDITELIGNTPLLRLKNFDVPEGVAVYAKLEMMNPGGSIKDRLGDMLIRDALDSGKVKPGGVIIEATAGNTGIGLA
LSARKYGLKAIFCVPEHFSREKQQIMQALGASIIHTPRQDGMQGAIQKAIQLETEIENSYCVLQFKNRVNPSTYYKTLGP
EMWEALDGNIHTFVAGAGSGGTFAGTASFLKEKNPAVKTVIVEPVGSILNGGEPHAHKTEGIGMEFIPDYMDKSHFDEIY
TVTDENAFRLVKEAAEKEGLLIGSSSGAALYAALEEAKKASAGTNIVTVFPDSSDRYISKQIYEGGI
>Q8EJI6 1.8.99.-~~~sirA~~~Dissimilatory sulfite reductase SirA~~~COG0484
MKRWKTKTALGVLFCLGSAVSATTIASDAKSDGKVVPGVGNKQQTHYTQDILANPKVSENLMEKSRGVKTLQDYIVQEQE
LFDFLFENHPVFKYDAEGRLKGTYKVSDRGEEYLHGGDSVAYSKHSKEVNSTDGTAVRYSAYEDGQRPKALQYRLGAKSI
LDFPNKFVGPEKCGECHGPQYEKWRRSRHSKTIRFPGEHPEVDNDLKKPMYTTKDTSILPSGITPDAIYATVGTPRTKYG
FIDAYLVRGTYHVKDGLLKDGTGTMVAGGNQFSRGWAEWLTPEMAAKINKAIPSFPLKMEDFGTSGSHQWGMSSYGAKYE
KEFLFQPASSYCEMCHSFKFDFQTKEEFFAALGNPKELQKHTISKGITCEECHGAGGHLDGGIGGGMPSNCERCHQRFNF
VEELAETPQGQEKLEYAFNVKMKSSCPSCGTEGSQMFASAHYDKGMRCSTCHDPHEVTDGDWKSGITKPKIIKECTDCHT
AQAEIAKNTNTHSNQTCQSCHMPNMGSCENFTAIQFPDMAGFDNVRKSHMWKIDVDPLRKTLNPPEGKSRDATTKGWTVA
KDENGYNYLDLMWTCARTSASDHDVTENKGCHSQFQSELEVGLHFEDQMEIYGEVQKWQKPVKDLFGQVLQGLQRIDKLL
EVTQLPVDKKTEVLMLTDKAQDVIKLVEADGSWGAHGPRYTQKRLDAALTYVQQAQAIIDGNGYNAKM
>Q7MSJ8 1.8.99.-~~~mccA~~~Dissimilatory sulfite reductase MccA~~~COG0484
MLSGWSVLKGGNMKYWDKALLSLFMCVSTLSIAATHAVAMEGMQMTKEAREIIAHPKGTKESRGVISLQDYIVEEQAMYD
WLFKNHPIFTKYGGKTVGKLVVKDRGEEWIEEGRGNDFSKASKRSGGEGFSSMMYRVARNSTLQYPNKFIGPEKCGECHP
AQYETWSRSRHATTIRFPGEHPEVNNKLNDPVFDKDTASILPQGITPDVVYCTVGHIRTKFGFFDAWLLRGTYHVEGGLL
KNGTGQIVAGGNQWQRTWALNLSPEVAKKIKKWVPDFPVTLEEYGDNGGYVRGLASYAAKYKKSMSFQASTSYCEVCHPW
KFDFKNESEFYAALGNAKELQKHTISKGVSCEECHGAGGHLEGGSGLLISNCERCHQRFSYSPDLMRNNPLNAGKPDLAL
SSKFKSMGPGCGSEGSQTYFTAHYEKGMRCATCHDPHDVTGNVTGEKGIKGVSYNSEQGYLSSLYSKPKLKKECTDCHKE
QAYIQSKADTHSKNSCASCHMPFMMSCENFYAIQFQDQAGFDTQRRAHIWKIDVDPARKSLVAGSTSKDPRDGKDWHFER
NEEGRNFVDLMWACARTTWADKDQAEAKGCHSPVVSELKETLHFKDQKQVYNEVMGWQTPVKDKFTQVKVGIQGLYSLLE
VKKLAPSDKTRVYELIEKAQDTVDLIEKDGSWGMHGFKYTKQRLDAAVEYINEAQRIMKKSL
>O05394 4.4.1.1~~~mccB~~~Cystathionine gamma-lyase~~~COG0626
MKKKTLMIHGGITGDEKTGAVSVPIYQVSTYKQPKAGQHTGYEYSRTANPTRTALEALVTELESGEAGYAFSSGMAAITA
VMMLFNSGDHVVLTDDVYGGTYRVMTKVLNRLGIESTFVDTSSREEVEKAIRPNTKAIYIETPTNPLLKITDLTLMADIA
KKAGVLLIVDNTFNTPYFQQPLTLGADIVLHSATKYLGGHSDVVGGLVVTASKELGEELHFVQNSTGGVLGPQDSWLLMR
GIKTLGLRMEAIDQNARKIASFLENHPAVQTLYYPGSSNHPGHELAKTQGAGFGGMISFDIGSEERVDAFLGNLKLFTIA
ESLGAVESLISVPARMTHASIPRERRLELGITDGLIRISVGIEDAEDLLEDIGQALENI
>Q47511 ~~~mccF~~~Microcin C7 self-immunity protein MccF~~~
MMIQSHPLLAAPLAVGDTIGFFSSSAPATVTAKNRFFRGVEFLQRKGFKLVSGKLTGKTDFYRSGTIKERAQEFNELVYN
PDITCIMSTIGGDNSNSLLPFLDYDAIIANPKIIIGYSDTTALLAGIYAKTGLITFYGPALIPSFGEHPPLVDITYESFI
KILTRKQSGIYTYTLPEKWSDESINWNENKILRPKKLYKNNCAFYGSGKVEGRVIGGNLNTLTGIWGSEWMPEILNGDIL
FIEDSRKSIATIERLFSMLKLNRVFDKVSAIILGKHELFDCAGSKRRPYEVLTEVLDGKQIPVLDGFDCSHTHPMLTLPL
GVKLAIDFDNKNISITEQYLSTEK
>D3JV03 1.3.8.12~~~mcd~~~(2S)-methylsuccinyl-CoA dehydrogenase~~~
MTGQPLLGDLLTLASDALPEVEALFETARSALKERVTTDGKVSSKALEEEQFAAHALSWLATYVESLRQMRAWAGRLETE
GRFGEMEALILQIAFGEYLAQIRGGIPMSQTETARVQDIGIELGHPGEAVRRLIQAGNTPAARARLVALMRDNHGRATFG
ASGLDEELEMIRDQFRRFADERVAPHAHGWHMRDELIPMEIVEALAEMGVFGLTIPEEFGGFGLSKASMVVVSEELSRGY
IGVGSLGTRSEIAAELILCGGTDAQKAAWLPKLASGEILPTAVFTEPNTGSDLGSLRTRAVKDGDEWVVHGNKTWITHAA
RTHVMTLLARTDLETTDYRGLSMFLAEKVPGTDADPFPTPGMTGGEIEVLGYRGMKEYEIGFDGFRVKAENLLGGVEGQG
FKQLMQTFESARIQTAARAIGVAQNALEVGMQYAEERKQFGKALIEFPRVAGKLAMMAVEIMVARQLTYHSAWEKDHGQR
CDLEAGMAKLLGARVAWAAADNALQIHGGNGFALEYQISRILCDARILNIFEGAAEIQAQVIARRLLD
>P9WMG5 ~~~mce2R~~~HTH-type transcriptional regulator Mce2R~~~COG2186
MALQPVTRRSVPEEVFEQIATDVLTGEMPPGEALPSERRLAELLGVSRPAVREALKRLSAAGLVEVRQGDVTTVRDFRRH
AGLDLLPRLLFRNGELDISVVRSILEARLRNFPKVAELAAERNEPELAELLQDSLRALDTEEDPIVWQRHTLDFWDHVVD
SAGSIVDRLMYNAFRAAYEPTLAALTTTMTAAAKRPSDYRKLADAICSGDPTGAKKAAQDLLELANTSLMAVLVSQASRQ
>P95251 ~~~mce3R~~~Transcriptional repressor Mce3R~~~COG1309
MASVAQPVRRRPKDRKKQILDQAVGLFIERGFHSVKLEDIAEAAGVTARALYRHYDNKQALLAEAIRTGQDQYQSARRLT
EGETEPTPRPLNADLEDLIAAAVASRALTVLWQREARYLNEDDRTAVRRRINAIVAGMRDSVLLEVPDLSPQHSELRAWA
VSSTLTSLGRHSLSLPGEELKKLLYQACMAAARTPPVCELPPLPAGDAARDEADVLFSRYETLLAAGARLFRAQGYPAVN
TSEIGKGAGIAGPGLYRSFSSKQAILDALIRRLDEWRCLECIRALRANQQAAQRLRGLVQGHVRISLDAPDLVAVSVTEL
SHASVEVRDGYLRNQGDREAVWIDLIGKLVPATSVAQGRLLVAAAISFIEDVARTWHLTRYAGVADEISGLALAILTSGA
GNLLRA
>Q9Z4N4 ~~~mceA~~~Microcin E492~~~
MREISQKDLNLAFGAGETDPNTQLLNDLGNNMAWGAALGAPGGLGSAALGAAGGALQTVGQGLIDHGPVNVPIPVLIGPS
WNGSGSGYNSATSSSGSGS
>P62530 ~~~mchB~~~Microcin H47~~~
MREITESQLRYISGAGGAPATSANAAGAAAIVGALAGIPGGPLGVVVGAVSAGLTTAIGSTVGSGSASSSAGGGS
>O86200 ~~~mchI~~~Microcin H47 immunity protein MchI~~~
MSYKKLYQLTAIFSLPLTILLVSLSSLRIVGEGNSYVDVFLSFIIFLGFIELIHGIRKILVWSGWKNGS
>Q3IZ78 4.2.1.148~~~mch~~~Mesaconyl-CoA hydratase~~~COG2030
MKTNAGRFFEDYRLGETIAHAVPRTVSGGERALYHALYPARHALSSSDEFARACGLPAAPVDELMAFHLVFGKTVPDISL
NAVANLGYAEGRWLKPVFPGDTLRAESTVIGLKENSNGASGVVWVRTRGLNQQGEAVLSYVRWVMVRKRDTAAPAPAPTV
PELAGSVAASDLVIPEGLSFTDYDLTLAGEPHRWGDYAVGEKIDHVDGVTVEESEHMLATRLWQNTAKVHFDATNRPDGR
RLIYGGHVISLARTLSFNGLANAQMIVALNAGAHANPCFAGDTVRAWSEVLDKAETADPGVGALRLRLVAMKHGTEPFVT
RSEDGKYLPGVLLDLDYWALVPR
>A9WC34 4.2.1.148~~~mch~~~Beta-methylmalyl-CoA dehydratase~~~COG2030
MSAKTNPGNFFEDFRLGQTIVHATPRTITEGDVALYTSLYGSRFALTSSTPFAQSLGLERAPIDSLLVFHIVFGKTVPDI
SLNAIANLGYAGGRFGAVVYPGDTLSTTSKVIGLRQNKDGKTGVVYVHSVGVNQWDEVVLEYIRWVMVRKRDPNAPAPET
VVPDLPDSVPVTDLTVPYTVSAANYNLAHAGSNYLWDDYEVGEKIDHVDGVTIEEAEHMQATRLYQNTARVHFNLHVERE
GRFGRRIVYGGHIISLARSLSFNGLANALSIAAINSGRHTNPSFAGDTIYAWSEILAKMAIPGRTDIGALRVRTVATKDR
PCHDFPYRDAEGNYDPAVVLDFDYTVLMPRRG
>O85014 3.5.4.27~~~mch~~~Methenyltetrahydromethanopterin cyclohydrolase~~~COG3252
MSSNTSAPSLNALAGPLVESLVADAAKLRLIVAQENGARTVDAGANARGSIEAGRRIAEICLGGLGTVTIAPIGPVASWP
YTVVVHSADPVLACLGSQYAGWSLADEEGDSGFFALGSGPGRAVAVVEELYKELGYRDNATTTALVLESGSAPPASVVNK
VAAATGLAPENVTFIYAPTQSLAGSTQVVARVLEVALHKAHTVGFDLHKILDGIGSAPLSPPHPDFIQAMGRTNDAIIYG
GRVQLFVDADDADAKQLAEQIPSTTSADHGAPFAEIFSRVNGDFYKIDGALFSPAEAIVTSVKTGKSFRGGRLEPQLVDA
SFV
>L8EBJ9 ~~~mciZ~~~Cell division inhibitor MciZ~~~
MKVHRMPKGVVLVGKAWEIRAKLKEYGRTFQYVKDWISKP
>Q9X2V7 ~~~mcjA~~~Microcin J25~~~
MIKHFHFNKLSSGKKNNVPSPAKGVIQIKKSASQLTKGGAGHVPEYFVGIGTPISFYG
>Q9X2W0 ~~~mcjD~~~Microcin-J25 export ATP-binding/permease protein McjD~~~
MERKQKNSLFNYIYSLMDVRGKFLFFSMLFITSLSSIIISISPLILAKITDLLSGSLSNFSYEYLVLLACLYMFCVISNK
ASVFLFMILQSSLRINMQKKMSLKYLRELYNENITNLSKNNAGYTTQSLNQASNDIYILVRNVSQNILSPVIQLISTIVV
VLSTKDWFSAGVFFLYILVFVIFNTRLTGSLASLRKHSMDITLNSYSLLSDTVDNMIAAKKNNALRLISERYEDALTQEN
NAQKKYWLLSSKVLLLNSLLAVILFGSVFIYNILGVLNGVVSIGHFIMITSYIILLSTPVENIGALLSEIRQSMSSLAGF
IQRHAENKATSPSIPFLNMERKLNLSIRELSFSYSDDKKILNSVSLDLFTGKMYSLTGPSGSGKSTLVKIISGYYKNYFG
DIYLNDISLRNISDEDLNDAIYYLTQDDYIFMDTLRFNLRLANYDASENEIFKVLKLANLSVVNNEPVSLDTHLINRGNN
YSGGQKQRISLARLFLRKPAIIIIDEATSALDYINESEILSSIRTHFPDALIINISHRINLLECSDCVYVLNEGNIVASG
HFRDLMVSNEYISGLASVTE
>S5N020 4.1.3.24~~~mcl~~~Malyl-CoA/beta-methylmalyl-CoA/citramalyl-CoA lyase~~~
MRKLAHNFYKPLAIGAPEPIRELPVRPERVVHFFPPHVEKIRARIPEVAKQVDVLCGNLEDAIPMDAKEAARNGFIEVVK
ATDFGDTALWVRVNALNSPWVLDDIAEIVAAVGNKLDVIMIPKVEGPWDIHFVDQYLALLEARHQIKKPILIHALLETAQ
GMVNLEEIAGASPRMHGFSLGPADLAASRGMKTTRVGGGHPFYGVLADPQEGQAERPFYQQDLWHYTIARMVDVAVAHGL
RAFYGPFGDIKDEAACEAQFRNAFLLGCTGAWSLAPNQIPIAKRVFSPDVNEVLFAKRILEAMPDGSGVAMIDGKMQDDA
TWKQAKVIVDLARMIAKKDPDLAQAYGL
>Q3J4D7 5.4.99.2~~~mcm~~~Methylmalonyl-CoA mutase~~~COG1884
MTEDLDAWRKLAEKELKGKSPDSLTWNTLEGIPVKPLYTRADLAGMEHLDGLPGVAPFTRGVRATMYAGRPWTIRQYAGF
STAEASNAFYRKALAAGQQGVSVAFDLATHRGYDSDHPRVVGDVGKAGVAIDSIEDMKILFNGIPLEKISVSMTMNGAVI
PILANFIVTGEEQGVPRAALSGTIQNDILKEFMVRNTYIYPPEPSMRIIADIIEYTSKEMPKFNSISISGYHMQEAGANL
VQELAYTLADGREYVRAALARGMNVDDFAGRLSFFFAIGMNFFMEAAKLRAARLLWHRIMSEFAPKKPGSLMLRTHCQTS
GVSLQEQDPYNNVIRTAYEAMSAALGGTQSLHTNALDEAIALPTEFSARIARNTQIILQEETGVTRVVDPLAGSYYVESL
TAELAEKAWALIEEVEAMGGMTKAVASGMPKLRIEESAARRQAAIDRGEDVIVGVNKYRLAKEDPIEILDIDNVAVRDAQ
IARLEKMRATRDEAACQAALDELTRRAAEGGNLLEAAVDASRARASVGEISMAMEKVFGRHRAEVKTLSGVYGAAYEGDD
GFAQIQRDVESFAEEEGRRPRMLVVKMGQDGHDRGAKVIATAFADIGFDVDVGTLFQTPEEAAQDAIDNDVHVVGISSLA
AGHKTLAPKLIEALKEKGAGEILVICGGVIPQQDYDFLQQAGVKAIFGPGTNIPSAAKHILDLIREARS
>O86028 5.4.99.2~~~bhbA~~~Methylmalonyl-CoA mutase~~~COG1884
MTEKTIKDWEALAEKELRVSPEGLVWHTPEGIDVKPLYTSDDMSGIGHLNSLPGFEPFVRGPRATMYAGRPWTVRQYAGF
STAEASNAFYRRNLAAGQQGVSVAFDLATHRGYDSDHPRVQGDVGKAGVAIDSVEDMKILFDGIPLDRISVSMTMNGAVI
PILASFIVAGEEQGVSRDKLSGTIQNDILKEFMVRNTYIYPPEPSMRIVADIIEYTAKEMPKFNSISISGYHMQEAGATL
VQELAFTLADGREYVRAALAKGLNVDDFAGRLSFFFAIGMNFFMEAAKLRAARLLWTRIMQEFKPEKASSLMLRTHCQTS
GVSLQEQDPYNNIVRTAFEAMSAVLGGTQSLHTNSFDEAMALPTDFSARIARNTQLILQHETGVTKVVDPLAGSYYVESL
TNELAEKAWGLIEEVEALGGMTKAVNAGLPKRLIEEAATRRQAAVDRAEEVIVGVNKYRLENEQPIDILQIDNAAVRTAQ
VKRIEETRRRRDSQKMKQALDALADVARSGKGNLLAAAVEAARARATVGEITDAMREAFGDYTAIPEVVTDIYGKAYEGD
PELGVLAGRLGEATKRLGHKPKIMVAKLGQDGHDRGAKVIASAFGDIGFDVVAGPLFQTPEEAADLALAEEVTVIGVSSL
AAGHRTLMPQLAEALKKRGGEDIIVVCGGVIPRQDYDYLMENGVAAVFGPGTQVLDAARAVLDLIEGKRRNV
>Q46971 ~~~mcnN~~~Microcin N~~~
MRELDREELNCVGGAGDPLADPNSQIVRQIMSNAAWGAAFGARGGLGGMAVGAAGGVTQTVLQGAAAHMPVNVPIPKVPM
GPSWNGSKG
>Q69HT9 1.-.-.-~~~mco~~~Multicopper oxidase mco~~~
MMDMKENDQKRNDMMDMKSHDERKNLNSSQGKNEITFPKVLDPKKDNNGYKSYTLKAQKGKTEFYKGNFSNTLGYNGNLL
GPTLKLKKGDKVKIKLVNNLDENTTFHWHGLEIDGKVDGGPSQVIKPGKEKTIKFEVKQEAATLWYHPHPSPNTAKQVYN
GLSGLLYIEDDKKNNYPSNYGKNDLPIIIQDKTFVSKKLNYTKTKDEDGTQGDTVLVNGKVDPKLTTKEGKIRLRLLNGS
NARDLNLKLSNNQSFEYIASEGGHLEKTKKLKEINLAPSARKEIVIDLSKMKEDKVNLVDNDETVILPIINKEKSTNKDT
TPKVDKKIKLEGMDDNVTINGKKFDPNRIDFTQKVNRKETWEIENVKDKMSGMKHPFHIHGTQFKVLSVDGKKPSEDMRG
KKDVISLEPGQKAKIEVVFKNTGTYMFHCHILEHEDNGMMGQIKVTK
>Q2W8M7 ~~~~~~Methyl-accepting chemotaxis protein Amb0994~~~
METTLGSYARTLSLGMLVPSAICLLAGTFGLLGGSSIALWVVIAVSLLGVVGGVKIGGSARRMAGDLSTAIHVLSRSASG
DLNARILDVRGSGGIGALQHSINRLLDLAEAFGKEAFAAVESANHGRYYRRIITTGLRGDFVLYAKTINQALKRMEARDA
EFIAFANNQVKPVVNAVAAAATELEASSGAMSAQSTDTSHQAMTVAAAAEQASVNVQAVASAVEEFSASIKEISTQVHRA
AAVASEAAGVASRTDTTVHGLSDAAQRIGAIVSLINDIAAQTNLLALNATIEAARAGDAGKGFAVVANEVKNLANQTARA
TEDITSQVAHIQEVAAEAIKAIQEITRTVSQIEETSSAVAGAVEEQNAVTVEIARNVAEAATGTSSVSSAIITVQATAAE
ATESAGQVADAASELSRQSENLSREVDGFIARIGGR
>P02942 ~~~tsr~~~Methyl-accepting chemotaxis protein I~~~COG0840
MLKRIKIVTSLLLVLAVFGLLQLTSGGLFFNALKNDKENFTVLQTIRQQQSTLNGSWVALLQTRNTLNRAGIRYMMDQNN
IGSGSTVAELMESASISLKQAEKNWADYEALPRDPRQSTAAAAEIKRNYDIYHNALAELIQLLGAGKINEFFDQPTQGYQ
DGFEKQYVAYMEQNDRLHDIAVSDNNASYSQAMWILVGVMIVVLAVIFAVWFGIKASLVAPMNRLIDSIRHIAGGDLVKP
IEVDGSNEMGQLAESLRHMQGELMRTVGDVRNGANAIYSGASEIATGNNDLSSRTEQQAASLEETAASMEQLTATVKQNA
ENARQASHLALSASETAQRGGKVVDNVVQTMRDISTSSQKIADIISVIDGIAFQTNILALNAAVEAARAGEQGRGFAVVA
GEVRNLAQRSAQAAREIKSLIEDSVGKVDVGSTLVESAGETMAEIVSAVTRVTDIMGEIASASDEQSRGIDQVGLAVAEM
DRVTQQNAALVEESAAAAAALEEQASRLTEAVAVFRIQQQQRETSAVVKTVTPAAPRKMAVADSEENWETF
>Q9WYR0 ~~~mcp1~~~Methyl-accepting chemotaxis protein 1~~~COG0840
MSLRKKVFLLMIVVVAGLLLSFFLIYRSVSNSIINSVRSNTENQAKALSKFVVEKLNNVTNVARSAATYLGSQFFEAYMI
TNQLKTTVEKEKSTFAFAFSALSFNKSAALTDGNRVDRVDFADYEKYIKAVEGKDIFFMPETFQGTPVLTVVVPIETMNT
RTGIVGFGINLSENSDLWKAVVEEGKASKSGYGLLVTSDGKVLIHKDMGNFMKDVKELGGFEKAFEEAKSGGEKYVEYEY
NGEKKYTVWEKVPGYDFYIFSTGYLEELLAEGRKATLGTIVTYVVFGGVIFAVLFVSMMPVVKRMRQQVEKVKRFGEGDL
TVEFEAKGRDELTQIEESLKEAALSLKEMIVSIIEAAKELSGASEEIKVLSEESHKMSENLHEEAKKILDEANNMSSALT
EVTSGVEEVAASAQNISKITQDLTERSEAVTKAAREGTERVEAVGGVINKLKGSAERQRDYLRELVDSAKTIGEIVDTIS
SIAEQTNLLALNAAIEAARAGEAGRGFAVVADEIRKLAEESQRATEDIAKMLSSLRTTIEHVENGSKEMFEGVDEIAVMG
EEVTKRFREILGRIEEINSMIENTAATAQEQGAAAEEMASAMDNVTKIVEGVVESLNRMESLIEDQTTSAAKVSQAAERL
SELSEQLSTLVQKFKV
>Q2W4T8 ~~~~~~Methyl-accepting chemotaxis protein Amb2333~~~
MEGPYQRLPWGIWRMSISNWRFRAKIFLIVVLSLLGMGAIVAVNLANLHNELMAARKIKTQHVVETAHSLIGHYVKLSQS
GQMSTDAAQAAAIEAIKTMRYAGTEYFWINSLAGKMVVHPIRPDMLGKDLMGLKDPAGKLFFEAMIDVVKKDKAGFVDYL
WPKPGLDQPVPKVSYVKGIEEWGWLVGSGIYVDDVDSAFRAEVMNLGGIVTGVVLLVLLVSWWIGKNVVDGMRNATGGIR
KLAEGDTSVEIKGHERGDEIGELVQAAEIFREHSLTMKRMSEERAEQRRQAEAERRSTLAGLATELERGVKSTVVTVSES
AGRMRSTATGMAGAIDNASQESQAVAAAAQQTSSNVEAVAAAAEELSSSIRGIGSQVAESTQIAKEAVDAANRTDGVVRG
LSEAADRIGEVVRLINDIAGQTNLLALNATIEAARAGEAGKGFAVVANEVKHLASQTAKATEEIGQQIASIQSTTADAVG
AIESIGKTIGRMDEIANAIAEAVEQQGAATQEIARNVHEAADGAQEVSHHISSISRTASEAGVAARELLGAAAELAGESE
TLRNGVDRFLGEVRAM
>P07017 ~~~tar~~~Methyl-accepting chemotaxis protein II~~~COG0840
MINRIRVVTLLVMVLGVFALLQLISGSLFFSSLHHSQKSFVVSNQLREQQGELTSTWDLMLQTRINLSRSAVRMMMDSSN
QQSNAKVELLDSARKTLAQAATHYKKFKSMAPLPEMVATSRNIDEKYKNYYTALTELIDYLDYGNTGAYFAQPTQGMQNA
MGEAFAQYALSSEKLYRDIVTDNADDYRFAQWQLAVIALVVVLILLVAWYGIRRMLLTPLAKIIAHIREIAGGNLANTLT
IDGRSEMGDLAQSVSHMQRSLTDTVTHVREGSDAIYAGTREIAAGNTDLSSRTEQQASALEETAASMEQLTATVKQNADN
ARQASQLAQSASDTAQHGGKVVDGVVKTMHEIADSSKKIADIISVIDGIAFQTNILALNAAVEAARAGEQGRGFAVVAGE
VRNLASRSAQAAKEIKALIEDSVSRVDTGSVLVESAGETMNNIVNAVTRVTDIMGEIASASDEQSRGIDQVALAVSEMDR
VTQQNASLVQESAAAAAALEEQASRLTQAVSAFRLAASPLTNKPQTPSRPASEQPPAQPRLRIAEQDPNWETF
>P02941 ~~~tar~~~Methyl-accepting chemotaxis protein II~~~
MFNRIRVVTMLMMVLGVFALLQLVSGGLLFSSLQHNQQGFVISNELRQQQSELTSTWDLMLQTRINLSRSAARMMMDASN
QQSSAKTDLLQNAKTTLAQAAAHYANFKNMTPLPAMAEASANVDEKYQRYQAALAELIQFLDNGNMDAYFAQPTQGMQNA
LGEALGNYARVSENLYRQTFDQSAHDYRFAQWQLGVLAVVLVLILMVVWFGIRHALLNPLARVITHIREIASGDLTKTLT
VSGRNEIGELAGTVEHMQRSLIDTVTQVREGSDAIYSGTSEIAAGNTDLSSRTEQQASALEETAASMEQLTATVKQNADN
ARQASQLAQSASETARHGGKVVDGVVNTMHEIADSSKKIADIISVIDGIAFQTNILALNAAVEAARAGEQGRGFAVVAGE
VRNLASRSAQAAKEIKALIEDSVSRVDTGSVLVESAGETMTDIVNAVTRVTDIMGEIASASDEQSRGIDQVALAVSEMDR
VTQQNASLVQESAAAAAALEEQASRLTQAVSAFRLASRPLAVNKPEMRLSVNAQSGNTPQSLAARDDANWETF
>Q9X0M7 ~~~mcp2~~~Methyl-accepting chemotaxis protein 2~~~COG0840
MSLKGKTLLVSTITLAAVVLVALLGGSVFLKAGQNVRKAFEEYELAVEALDKLGELETKVALFVNNAAKIEEVSSLFNEL
KKVADKIPSLKEHMDALERNISEIISGKTEVVSRIQSSVDQVKEDIMANLDRTRENLDKEISYSSELIRNVLFIVLPIVA
VASGVFLFVMISRSLRLLKPVMEASRSLRNNDLTINIQEAKGKDEISTLLNEFKASIEYLRNNLKDVQTETFSVAESIEE
ISKANEEITNQLLGISKEMDNISTRIESISASVQETTAGSEEISSATKNIADSAQQAASFADQSTQLAKEAGDALKKVIE
VTRMISNSAKDVERVVESFQKGAEEITSFVETINAIAEQTNLLALNAAIEAARAGEAGRGFAVVADEIRKLAEESQQASE
NVRRVVNEIRSIAEDAGKVSSEITARVEEGTKLADEADEKLNSIVGAVERINEMLQNIAAAIEEQTAAVDEITTAMTENA
KNAEEITNSVKEVNARLQEISASTEEVTSRVQTIRENVQMLKEIVARYKI
>P05704 ~~~trg~~~Methyl-accepting chemotaxis protein III~~~COG0840
MNTTPSQRLGFLHHIRLVPLFACILGGILVLFALSSALAGYFLWQADRDQRDVTAEIEIRTGLANSSDFLRSARINMIQA
GAASRIAEMEAMKRNIAQAESEIKQSQQGYRAYQNRPVKTPADEALDTELNQRFQAYITGMQPMLKYAKNGMFEAIINHE
SEQIRPLDNAYTDILNKAVKIRSTRANQLAELAHQRTRLGGMFMIGAFVLALVMTLITFMVLRRIVIRPLQHAAQRIEKI
ASGDLTMNDEPAGRNEIGRLSRHLQQMQHSLGMTVGTVRQGAEEIYRGTSEISAGNADLSSRTEEQAAAIEQTAASMEQL
TATVKQNADNAHHASKLAQEASIKASDGGQTVSGVVKTMGAISTSSKKISEITAVINSIAFQTNILALNAAVEAARAGEQ
GRGFAVVASEVRTLASRSAQAAKEIEGLISESVRLIDLGSDEVATAGKTMSTIVDAVASVTHIMQEIAAASDEQSRGITQ
VSQAISEMDKVTQQNASLVEEASAAAVSLEEQAARLTEAVDVFRLHKHSVSAEPRGAGEPVSFATV
>Q9X0N0 ~~~mcp3~~~Methyl-accepting chemotaxis protein 3~~~COG0840
MKSVASKLLLGFGLVCAGLVLFGLLTFYNILSLEKIVADTANINRAIVELAINQAGVLVAVQNKDKSLLSSSVEGLRTSL
DDIKAYQSDFSGENLKLLQESIAHLEEMIRITDSLIVDGVDQSIYDRFVELQAEIRNPLRKLVQNLGVENVSMTKNIKRN
IIFFLVVVCAAAMFIAIFTTRNLTTPLKKLAVLVENLSHGVLNVEIEKIRSKDEIGKAAMAVEKLREILLDIITGINKAS
SEVSSSSEELSATSEELSANVNSISEALVSLNKEADENSATLEEFTASIEELSSTADSNSKSAQAMLESTQRVHEQVEKS
TERIREITEKAHSTREMSENTKQALNRLLSMAENINSIVDTINSIAEQTNLLALNAAIEAARAGEAGRGFAVVADEIRKL
AEESKAATQQIGEILGKLRDEINNSSKIVESTASAIEETASLVESIKDVFESIRIAMEDVQSRVESVAASTQEQSASLEE
LSAGVTRLTELLNKTRENTSSANSALQEANAALEELSASAQSLAELAQELQRRIEFFKI
>P07018 ~~~tap~~~Methyl-accepting chemotaxis protein IV~~~COG0840
MFNRIRISTTLFLILILCGILQIGSNGMSFWAFRDDLQRLNQVEQSNQQRAALAQTRAVMLQASTALNKAGTLTALSYPA
DDIKTLMTTARASLTQSTTLFKSFMAMTAGNEHVRGLQKETEKSFARWHNDLEHQATWLESNQLSDFLTAPVQGSQNAFD
VNFEAWQLEINHVLEAASAQSQRNYQISALVFISMIIVAAIYISSALWWTRKMIVQPLAIIGSHFDSIAAGNLARPIAVY
GRNEITAIFASLKTMQQALRGTVSDVRKGSQEMHIGIAEIVAGNNDLSSRTEQQAASLAQTAASMEQLTATVGQNADNAR
QASELAKNAATTAQAGGVQVSTMTHTMQEIATSSQKIGDIISVIDGIAFQTNILALNAAVEAARAGEQGRGFAVVAGEVR
NLASRSAQAAKEIKGLIEESVNRVQQGSKLVNNAAATMIDIVSSVTRVNDIMGEIASASEEQQRGIEQVAQAVSQMDQVT
QQNASLVEEAAVATEQLANQADHLSSRVAVFTLEEHEVARHESVQLQIAPVVS
>Q9X1E2 ~~~mcp4~~~Methyl-accepting chemotaxis protein 4~~~COG0840
MRSIASKVLVIGVVVVLAFFVTQYVLLNTTVFNSIMERKKEEAKHLVESVYGILERAYEMEQKGELTREQAQELAKSLIG
KIRYDDNNYFWINDTHPRMVFHPIKPEMNGQDLSNYKDPNGVYLFNEMVKVAKEKGEGFVSYSWPKAGSDKPEPKISYVK
LFEPWGWIVGTGIYVDDVKVTVGNLIFRNVLTVSVIGIAVIIMIFFYGRVLSRKTKAVLSALEKISSGDLSVSVDIKSKD
EFGLIAQKLNETVGNLRKMVQEIDKSQDEVERVSEELFALSQQLRSALEEIARASDTISKEVQNASASIEEVTSGSEEVS
ANSQNISKLIQEISENADNIADFARNGQRVLEEAVKKVEDVSENSRETADVVSNVTESARNIEEIVRTIQSIAEQTNLLA
LNAAIEAARAGEAGRGFAVVADEIRKLAEESQKATEEISQILENIREGVERTNEMSKKNVEITKDARRLVEESYESFNQI
VTRIEDLAARIEGIAASAQELSAASEEMSSALDAVAKTTTTVADEVEEVSENITEQEKAAKRIADIGTELKKLSDELKED
VERFKI
>P39214 ~~~mcpA~~~Methyl-accepting chemotaxis protein McpA~~~COG0840
MKKILQLIKQRSITRKLLVSFLSILIIPVVILAIFAYQSASSSLDRQMMGSALENVQQLNEIINTSIGEKENSADYFSEW
LTKEKYNAKSNASIAEKFSQYISINKDVESIYTSDTKGHFTRYPDLPMPSGYNPVERDWYKKAVANKGKVVITDPYKTAS
TNTMVVTIAQQTKDGSGVIAINMTIENLLKTTKKVNIGTQGYAFIMTKDKKVVAHPNEQSGTELKGDWLDKMLSADKGDF
QYTMDGDKKKMAFDTNKLTGWKIGGTMYLDEIHEAAQPVLHLALIVLAAAIIIGIIVMTLIIRSITTPLKQLVGSSKRIS
EGDLTETIDIRSKDELGELGKSFNNMASSLRSLIHAIQDSVDNVAASSEELTASAAQTSKATEHITLAIEQFSNGNEKQN
ENIETAAEHIYQMNDGLTNMAQASEVITDSSVQSTEIASEGGKLVHQTVGQMNVIDKSVKEAEQVVRGLETKSKDITNIL
RVINGIADQTNLLALNAAIEAARAGEYGRGFSVVAEEVRKLAVQSADSAKEIEGLIIEIVKEINTSLGMFQSVNQEVQTG
LDITDKTEMSFKRISEMTNQIAGELQNMSATVQQLSASSEEVSGASEHIASISKESSAHIQDIAASAEEQLASMEEISSS
AETLSSMAEELRDMTKRFKIE
>Q9I6V2 ~~~mcpA~~~Methyl-accepting chemotaxis protein McpA~~~
MRESLGVSLPSRPLLLKLGLLSLPWALCAGLHAVGLGVWLVLALGWFASLAMLVLLLRAPRQAVANANPQADAASPAWSA
AQRALADETAQLDGHARQIDELLHSAIRQLSDSFHGLAERIDTQRGLSHSLIERYDGRGQVDEGINFQDFVRTTQQTLSL
FVEATLETSRTSQQLVERMDQVRLKITEILQSTQDMDAIAKQTNLLALNAAIEAARAGESGRGFAVVADEVRALSTRSTE
FSAAIRKHVDVVHHEIQDAESAISQLADKDMSFALDSKHKIQGMLDDLEAMNRHTLKVIHELDRLSLEVGQGVDAAVTAL
QFQDMGSQLLGQMRKHHARLGAFALGLGALEARPRQEWPERVAREVEELRRPLPSPVSQNSVNVGEVELF
>Q88KP1 ~~~mcpA~~~Methyl-accepting chemotaxis protein McpA~~~COG0840
MSALRPPLIGSRSRNMNLKFRHKILLSACGVVVLAFALFTLYNDYLQRNTIRQNIEASVQQSGALTASSVQNWMSGRILV
LENLAQDIGQQGAGDTLAGLIEQPSYTRNFLFTYLGQANGEFTQRPDAQMPAGYDPRQRPWYGAAANAGQTVLTAPYQGA
VGGLMVTIATPVKSKRNGELIGVVGGDVTLDTLVEIINSVDFGGIGHAFLADANGQVIVSPNKDQVMKNLKDIYPGSNLR
VAAGMQDVTLDGQDRIISFAPVAGLPSAQWYIGLSIDRDKAYAALSQFRTSAIIAMLIAVAAIAGLLGLLIPVLMSPLTT
MGRAMRDIAEGEGDLTRRLAVQNKDEFGELATSFNRFVERIHASISEVSSATRLVHDLSEKVVSASNASIIGSEEQSMRT
NSVAAAINELGAATQEIARNAADASQHASGASEQAHGGREVVEEAISAMTALSQRISESCAQIETLNASTDEIGKILDVI
KGISQQTNLLALNAAIEAARAGEAGRGFAVVADEVRNLAHRTQESAEEIHRMITSLQVGSREAVHTMNTSQVSSEQTVQV
ANQAGERLASVTQRIGEIDGMNQSVATATEEQTAVVESLNLDITQINALNQQGVENLNETLRHCDQLAQQAGRLKQLVGS
FRI
>P39215 ~~~mcpB~~~Methyl-accepting chemotaxis protein McpB~~~COG0840
MKTFINWLKKPSISKKLIVSFIAILIIPILILEFSSYRSASGKLDQEIMGNAKNSVDTFNTTVTNDLGEKAKAVTFFSES
LKRSAFKGKSNQEEIKAKFSQYVSINQGVARIYGGADNGTYVQAPKEKLPEGYDPRQRPWYQDAMKAGGEIVVTDPYVAA
SDGSMVITIAQELKDGSGVVAMDITIDKLLEQMKQIKVGKEGYAFIATKNKTYVAHKNHKAGEKLSGDWVAKMYANDSGE
LQYTLNNEDKKMTYTTNELTGWKIAGTMYMDEIKDASKSVLTTGMIVLIASIVAGGILILFIVRSITKPLKRLVQSSKTI
SRGDLTETIEIHSKDELGELGESFNEMGQSLRSLISAIQDSVNNVAASSEQLTASAGQTSKATEHITMAIEQFSNGNEEQ
SEKVESSSHQLNLMNEGLQQVSQTSSDITKASIQSTEIAGTGEKFVQQTVGQMNSINQSVQQAEAVVKGLEGKSKDITSI
LRVINGIADQTNLLALNAAIEAARAGESGRGFSVVAEEVRKLAVQSADSAKEIEKLIQEIVAEIDTSLHMFKEVNQEVQS
GLVVTDNTKESFQSIFSMTNEIAGKLQTMNSTVEQLSDRSQHVSAAVSGIADVSKESSASIQDIAASAEEQLASMEEISS
SATTLAQMAEELRDLTKQFKIE
>Q9I6V6 ~~~mcpB~~~Methyl-accepting chemotaxis protein McpB~~~
MGLFNAHAVAQQRADRIATLLQSFADGQLDTAVGEAPAPGYERLYDSLRALQRQLREQRAELQQVESLEAGLAEMSRQHE
AGWIDQTIPAERLEGRAARIAKGVNELVAAHIAVKMKVVSVVTAYGQGNFEPLMDRLPGKKAQITEAIDGVRERLRGAAE
ATSAQLATAAYNARIKSALDNVSANVMIADNDLNIIYMNRTVSEMLGRAEADIRKQLPNFDAGRLMGANIDVFHKNPAHQ
RHLLANLTGVHKAELNLGGRRFSLDVVPVFNDANERLGSAVQWTDRTEEHRAEQEVSQLVQAAAAGDFSKRVEEAGKEGF
FLRLAKDLNSLVDTADRGLRDVSRMLGALAQGDLTQRIEADYQGTFGQLKDFSNDTAQSLSRMLGQIREAADTINTAASE
IASGNAELSARTEQQASSLEETASSMEELTSTVKLNAENARQANSLAANASEVATQGGTVVQKVVSTMSSINESARKIAD
IIGVIDGIAFQTNILALNAAVEAARAGEQGRGFAVVAGEVRTLAQRSAAAAKEIKTLISDSVDKVENGNTLVAQAGQTMS
DIVVAIRRVTDIMSEIAAASAEQSTGIEEVNSAVSQMDDMTQQNAALVEEAAAAAEAMQEQAGLLNQSVAVFRLDTPPSV
VQLASARPSAPRPSAPAPLARSGMARASKARKEDGWEEF
>P54576 ~~~mcpC~~~Methyl-accepting chemotaxis protein McpC~~~COG0840
MFKKLHMKIAVFVSIMLIITVVLLMLSSYLTLKPMITEDGKNTTQNVTQSLEQNIELQLKSYAISLSRLANGELTHTFVT
KPSKEASRLFHDDIKQIKDNDDYVAMAYIGTAKKEMFTYPKADFAEDYDPTSRPWYKLAAETPDQVVWTEPYKDVVTGDM
IVTASKAILDRQKVIGVASYDLKLSAIQSMVNKQKVPYKGFAFLADASGNLLAHPSNQGKNISKDQTLQTIASEKKGIQD
VNGKMVVYQTIGETGWKVGTQFDTDQLMWISDKMNRANLWISLIALIITIILSYFLAKTITGPIQQLIVKTKAVSAGDLT
VRAESKSKDEVGILTRDFNLMVENMKEMVEQVRLSSGKVSDTSEQLTAVAAETNETSGQIAKAIEEVAAGASEQASEVET
INEKSESLSTKIRQIAEEAGGIKERSKSSEDASYKGLDALGQLLMKSNEANMETKKVETMLLDLENQTKNIEEVVTAISN
ISDQTNLLALNASIEAARAGESGRGFAVVADEVRKLAEQSALSTKHISETVKLIQLETKEASHAMVEASRMNDEQNSAIH
ETGEVLNTITAEMQSLVQGIDHIYAEIQRMSEEQLAISEAIQSISAISQESAAAAEEVNASTDEQLVTLDKVKHSTETLK
HASQELMNTIAKFTL
>Q88N45 ~~~mcpG~~~Methyl-accepting chemotaxis protein McpG~~~COG0840
MNKSLRFSHKILLAASLIVILAFSLFTLYNDYLQRNAIREDLENYLAEMGASTSTNIRNLFEGRIKLVENLAQNIAQDPA
NAETLMGQNALISSFLTVYLGKVDGGFSVRPDAKMPDGYDPRTRPWYKDGMNASGATLTEPYIDMTTNKMVIGILSKVSS
SVGVVGGDLALDGLVQIINSLNFGGMGYAFLVNDQGKILVHPDKDLVMKSLSDLFPQHTPKLTGELTEVQSDGQTRLLTF
SPITGLPSANWYIGLSVDKDKAFSMLSTFRTSAVIATVVAVVIIIGLLGLLIRVLMQPLHTMTRAMEDIAEGEGDLTKRL
HIHSHDEFGVLGNAFNRFVERIHSSIREVSSATEQVNEVALRVISASNSSMTNSDEQSNRTNSVAAAINELGAAAQEIAG
NAAQASQHASSARLLAEEGQQVVERNIAAMNRLSDLIVTSSAHIETLNSKTVNIGQILEVITSISQQTNLLALNAAIEAA
RAGEAGRGFAVVADEVRNLAHRTQESAQQVQTMIEELQVGARESVDTMEQSQRHSQDSMQIANQAGERLDSVTVRIGEID
GMNQSVATATEEQTAVVEAINMDINEINMLNQEGVENLQATLRACSDLEQQAGRLKHLVGSFRI
>Q88R14 ~~~mcpH~~~Methyl-accepting chemotaxis protein McpH~~~COG0840
MRIWRKSIQLQLITSMGAALLASILVVVIIFTVALNRLTDRYLVDTALPASIEAIRNDIERMLGQPLVAAADIAGNTLLR
DWLAAGEDPAQAPQFIEYLTAAKQRNHAFTTLFASTETGHYYNENGLDRTLSRSNPKDKWFYGYIDSGAERFINIDIDGA
TGELALFIDYRVEKEGKLVGVAGMGLRMTELSKLIHDFSFGEHGKVFLVRNDGLIQVHPDAAFSGKRQLAEQLGADAAKG
VMTGGESLRSSRFSRDGERYLALGLPLRDLNWTLVAEVPESEIYAQMHQAVWLTSLIGGAVALVSLLLVVLLARGLVRPI
RRVTAALVQIGSGAGDLSHRLDDSRQDELGDLARGFNRFLDSQRSLIGEVLSTSERLRRAVEQVTQVVDNTAERSGRQQE
MTEMVATAVHEMGLTVQDIARNAGDAAQASQSARDEALQAREVVQRSIRGIEGMSGDIGKAADAVSQLADEVASVDEVLA
VIRSISEQTNLLALNAAIEAARAGEMGRGFAVVADEVRTLARRTQLSTDEVQQMIQRLKLGAGSAVSSMQAGQQATGSGV
ESSQRTGASLSAITDQVEHISDMNHQVATATEEQSAVTEEINRTVQGISDLARETAAEVQGCREECQALRGLADDLARQM
GGFRL
>Q9HUB1 ~~~mcpK~~~Methyl-accepting chemotaxis protein McpK~~~
MYDWWVLQLAKLSVSRKLMVGFGVLLALLLLVVISSNRTLTHQTALSEQLAEVASLMEQTQQAEQGRLAFEAGSDPRQAE
QVRQTLAGMLQRLQALRDSELDPAALAHQVEAIEAYRKAFDDLAAADQQRSAARGVLVGTAQQALDSFARLEELMDASLA
QQAGDPQALQRSRAVADLHQQLLMVRYQVRGYVFERSDKAEQAAFAAFDALRQAATTLRGQLPGEADAALEQAMGSLQGY
RGGIEQFRAGVIRTRQAQQAMQSSTQDMARAGRTLTEAGRQLRESTASRDRASLWLIAALALAFGCVAGWAINRQIVRPL
DEALAQAEAIAAGDLGKRPQNPLTLQRRDELGQLQRVMQRMGDSLRELVGRISDGVSQLASSAEELSAVTEQTRAGVNSQ
KVETDQVATAMHEMAATVQDVARNAELASQAARQADEEARQGDAVVDQAVTRIERLASEMDVSSEAMARLKNESEQIGSV
LDVIKSVAEQTNLLALNAAIEAARAGDAGRGFAVVADEVRGLAQRTQQSTAEIEGLIQRLQQGAGEAAERLENSRSLTAS
TVELARRAGAALDSITRTVSDIQNMNLQIATAAEQQSTVAEEINRSVLSVRDVAEQSAAASEQTAASSGELARLGTQLQA
QVGRFRL
>Q9I055 ~~~mcpN~~~Methyl-accepting chemotaxis protein McpN~~~
MNESVARVFDRILRGLGLKTLNAQFLLSYALMFGLAACASVALYLSMSISPETINVAGAQRMLSQKMAREALQLRLGAGD
PKALAATIAQYERSAADLDAGNAERNVSRMGAPEIAAQRQKVAQIWGRYRAMLDQVAQPASQVDLRGFSQYSTELLGELN
NLVSLMSARADSVQHTQMWIAFGCLLAILVLVVLGRQFGLAPLMRQLRGLEVALTEVGAANFTHALAAGHADNEIGRIVA
GYERMRQDVSGLLANVKRSAAETDKDVAEALEQALGAGDQVARQHQDLDQVATAMNEMSATVAEVARHANHAAHSTRDAA
ALAHEGRRLVEHASSQTGALAEELEQTALALNTLHQHAGSVGQVLTVISSIAEQTNLLALNAAIEAARAGEAGRGFAVVA
DEVRSLANRTQQSTQEIQGLIEQLQDGANDAVAAMRGSASHAQSNLVEADSAAQALGRIVATVEELDGLNQQIATAAEEQ
SQVAQDIDRNITNVSGLSEQAHEGTAAVLSANQRVKEHMAGLRVVLGRFRT
>Q88IY8 ~~~mcpP~~~Methyl-accepting chemotaxis protein McpP~~~COG0840
MNTLRSMSISRRLWLILVVAVAMLVVLGLLMLRQIHGDLYQAKAEKTRHVVQTAAGVLAYYQGLEAAGTLSREAAQQQAL
QVVRALRYDHDDYFWINDLGPKMIMHPANPKLDDQDLSAIRDPDGFAVFNEMVALARQQDAGPVNYRWPKPGASEPVAKT
SYIQLFKPWGWIIGSGVYVDDVQAEFARQLRDASLVGVGIALLMALVVMLIARSIARPLQEAVQAMGNIASGESDLTRRL
DTHGSDEITHLGEHFNRFNGKLQGVVGQLQGAAHALAQSAGHVGDNAGAAQQRSAQQSLQMDQVATAVNEVTYAVQDVAK
TAEQAAGEMRTAQQQVTHGQQAIHGSLAQIDRLSLTIDEAVQVIRDLAGHSTRIGGVLDVIRSIAEQTNLLALNAAIEAA
RAGEQGRGFAVVADEVRLLAQRTAQSTAEIHTMIEHLQSQSDAAVKAIDTSSEASRQTVEQAREAGASLDAINQVLNNLT
ALNASIASATLQQSHVVEEINRNVLDTAGLSQQTADAARQSSDAGVALGRLSEELEQLLRQFRV
>Q88D09 ~~~mcpQ~~~Methyl-accepting chemotaxis protein McpQ~~~COG0840
MYQWLAQSLGNVSVNRKLGLGFGLVLLLTLAITLTGWHGMDSIIDRGDKLGNISVIQQYTQELRIARQQYDRRRDDASLA
ELEKALSNLDRQVQLMLGQIEQPADHQRLEQQREAVRIYQQAFNELKQADQRREASRDVLGSSADKAVDLIGRVQRSLLQ
GANINQYQHAVDVSALLQQARFQVRGYTYSGNADYQQTALKAIDQALAELRALPAKVPAEHAASLDDAATAMGGYRDAVT
QFGNAQLASEQALQRMVEQGTVLLQASQMMTASQTEVRDAAAAQAKTLLTVATVLALALGLLAAWAITRQIIIPLRQTLR
AAERVASGDLTQSLQVQRRDELGQLQASMHRMTQGLRELIGGIGDGVTQIASAAEELSAVTEQTSAGVNNQKVETDQVAT
AMNQMTATVHEVARNAEQASEAALMADQQAREGDRVVGEAVAQIERLASEVVNSSEAMNLLKTESDKIGSVLDVIKSVAQ
QTNLLALNAAIEAARAGEAGRGFAVVADEVRSLAQRTQQSTEEIEELIAGLQSGTQRVASVMDNSRQLTDSSVELTRRAG
SSLETITRTVSSIQAMNQQIATAAEEQTAVAEEINRSVMNVRDISDQTSAASEETASSSVELARLGTHLQGLVGRFRL
>Q88E10 ~~~mcpS~~~Methyl-accepting chemotaxis protein McpS~~~COG0840
MNSWFANISVNLKLGLGFGLVLVLTGLLALTGWTSLGSLIDRSNWMGDIGQLNKDLTDLRIARLQYMIANGDDTAAANTL
AKLDAFSKQQAYLATTFKSPENVKLLGELGDTISAYKLSLNKMRQGYDATRAARVSMDSSAIRADQAMDALSQEVMARPE
ADSVRLAQYQLISKARQQLLQVRIDVRGYIAENSSANEQAALRQLDAALADTDNLKRQLPSEDARLQQFENAVLAYRDAV
RQFRDAVANITTSRAEMTVQGADIVKRSDALYQIQLERRDIESTQARSLQAIATLLALLVGVLAAVLITRQITRPLQDTL
VAVEKIASGDLTQHMRVTRRDELGVLQQGIARMGTTLRELISGIRDGVTQIASAAEELSAVTEQTSAGANSQKVETDQVA
TAMHEMAATVQEVARNAEQASHAATGADDEARAGDRVVGEAIGQIERLAEDMHRSTEAMNLLQQESQKIGSVMDVIKSVA
EQTNLLALNAAIEAARAGEAGRGFAVVADEVRGLAQRTQKSTEEIEELIASLQHGTQQVANAMQGSRALTDSSVELARKA
GSSLESITSTVSSIQSMNQQIAAAAEQQSAVAEEISRSILNVRDVSEQTAAASDETAASSVELARLGGQLQTLVSQFRV
>Q88NI1 ~~~mcpU~~~Methyl-accepting chemotaxis protein McpU~~~COG0840
MPLRRLSIQWKITLLAGLCLLAIVALLVATSLTQAHRSAALVNQANTAMLEDSARQRLQAHAETQALRIQRYFMDAYQYG
NGFARLVQVLKDRGGSDLRAELTRQARASLAGNPDVIGLYLVFQPNALDQQDSHYLGQDAMGSNESGRFSLYWSQPSPGT
LELEAMPETMLGDTSIGSNGAAKNRWLTCPQDTARTCMLEPYLDEVNGRQVLMTSIALPLLEHGKVVGVVGLDIGLANLQ
QLSVNGRRDLFDGQGQVSIATAAGLLAGNSRDDSVLGKPMDKSVADGLLRVAHPFTPIPDTAPWQVVLELPESVLQAPAV
ALNQRLDAHNQNANLTSLLIGLGTAIAGLLLVWLTARGVTRPILAVAARLEDIASGEGDLTRRLDYAHQDELGQLTGWFN
RFLDKLQPVIAQVKGSLQEARNTADQSAAIASQTSDGMQQQHREIEQVATAANEMSATALDVAHNASQAAQAARAADQAS
QEGLQLVDSTRQGIDRLAAGMNTAMDEARALEDRSGQIGSVLEVIRTIAEQTNLLALNAAIEAARAGEAGRGFAVVADEV
RGLAQRTQVSVEEIRQVIEGLQQGTQDVVGAMHAGQRQAQDSAARMEQALPALQRIGEAVAVISDMNLQIASAAEEQSAV
AEEVNRNVAGIRDVTESLAGQADESARISQALNRLANQQQALMEQFRV
>A0A0R6L508 2.7.-.-~~~mcr1~~~Probable phosphatidylethanolamine transferase Mcr-1~~~
MMQHTSVWYRRSVSPFVLVASVAVFLTATANLTFFDKISQTYPIADNLGFVLTIAVVLFGAMLLITTLLSSYRYVLKPVL
ILLLIMGAVTSYFTDTYGTVYDTTMLQNALQTDQAETKDLLNAAFIMRIIGLGVLPSLLVAFVKVDYPTWGKGLMRRLGL
IVASLALILLPVVAFSSHYASFFRVHKPLRSYVNPIMPIYSVGKLASIEYKKASAPKDTIYHAKDAVQATKPDMRKPRLV
VFVVGETARADHVSFNGYERDTFPQLAKIDGVTNFSNVTSCGTSTAYSVPCMFSYLGADEYDVDTAKYQENVLDTLDRLG
VSILWRDNNSDSKGVMDKLPKAQFADYKSATNNAICNTNPYNECRDVGMLVGLDDFVAANNGKDMLIMLHQMGNHGPAYF
KRYDEKFAKFTPVCEGNELAKCEHQSLINAYDNALLATDDFIAQSIQWLQTHSNAYDVSMLYVSDHGESLGENGVYLHGM
PNAFAPKEQRSVPAFFWTDKQTGITPMATDTVLTHDAITPTLLKLFDVTADKVKDRTAFIR
>P24200 3.1.21.-~~~mcrA~~~Type IV methyl-directed restriction enzyme EcoKMcrA~~~COG1403
MHVFDNNGIELKAECSIGEEDGVYGLILESWGPGDRNKDYNIALDYIIERLVDSGVSQVVVYLASSSVRKHMHSLDERKI
HPGEYFTLIGNSPRDIRLKMCGYQAYFSRTGRKEIPSGNRTKRILINVPGIYSDSFWASIIRGELSELSQPTDDESLLNM
RVSKLIKKTLSQPEGSRKPVEVERLQKVYVRDPMVKAWILQQSKGICENCGKNAPFYLNDGNPYLEVHHVIPLSSGGADT
TDNCVALCPNCHRELHYSKNAKELIEMLYVNINRLQK
>P43485 1.5.3.-~~~mcrA~~~Mitomycin radical oxidase~~~
MSTQWGWALEPDQPGYDDARLGLNRAAESRPAYVVEAADEQEVAAAVRLAAEQKRPVGVMATGHGPSVSADDAVLVNTRR
MEGVSVDAARATAWIEAGARWRKVLEHTAPHGLAPLNGSSPNVGAVGYLVGGGAGLLGRRFGYAADHVRRLRLVTADGRL
RDVTAGTDPDLFWAVRGGKDNFGLVVGMEVDLFPVTRLYGGGLYFAGEATAEVLHAYAEWVRHVPEEMASSVLLVHNPDL
PDVPEPLRGRFITHLRIAYSGEPADGEHLVRPLRELGPILLDTVRDMPYAEVGTIHHEPTSMPYVAYDRNVLLSDLTDDA
VDIIVALAGPDAGAPFVTELRHFGGAYARPPKVPNCVGGRDAAFSLFTGAVPEAEGLRRRDDLLDRLRPWSTGGTNLNFA
GVEDISPASVEAAYTPADFARLRAVKAQYDPDNMFRVNFNIPPAESWT
>P15005 3.1.21.-~~~mcrB~~~Type IV methyl-directed restriction enzyme EcoKMcrB subunit~~~COG1401
MESIQPWIEKFIKQAQQQRSQSTKDYPTSYRNLRVKLSFGYGNFTSIPWFAFLGEGQEASNGIYPVILYYKDFDELVLAY
GISDTNEPHAQWQFSSDIPKTIAEYFQATSGVYPKKYGQSYYACSQKVSQGIDYTRFASMLDNIINDYKLIFNSGKSVIP
PMSKTESYCLEDALNDLFIPETTIETILKRLTIKKNIILQGPPGVGKTFVARRLAYLLTGEKAPQRVNMVQFHQSYSYED
FIQGYRPNGVGFRRKDGIFYNFCQQAKEQPEKKYIFIIDEINRANLSKVFGEVMMLMEHDKRGENWSVPLTYSENDEERF
YVPENVYIIGLMNTADRSLAVVDYALRRRFSFIDIEPGFDTPQFRNFLLNKKAEPSFVESLCQKMNELNQEISKEATILG
KGFRIGHSYFCCGLEDGTSPDTQWLNEIVMTDIAPLLEEYFFDDPYKQQKWTNKLLGDS
>P15006 ~~~mcrC~~~Type IV methyl-directed restriction enzyme EcoKMcrBC~~~COG4268
MEQPVIPVRNIYYMLTYAWGYLQEIKQANLEAIPGNNLLDILGYVLNKGVLQLSRRGLELDYNPNTEIIPGIKGRIEFAK
TIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHEKLNSTIRDEARSLYRKLPGISTLHLTPQHFSYLNGGKNTR
YYKFVISVCKFIVNNSIPGQNKGHYRFYDFERNEKEMSLLYQKFLYEFCRRELTSANTTRSYLKWDASSISDQSLNLLPR
METDITIRSSEKILIVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSLKPENGENIGGLLIYPHVDTAVKHRYKINGFD
IGLCTVNLGQEWPCIHQELLDIFDEYLK
>P37569 ~~~mcsA~~~Protein-arginine kinase activator protein~~~COG3880
MICQECHERPATFHFTKVVNGEKIEVHICEQCAKENSDSYGISANQGFSIHNLLSGLLNMDSSFQNAGTQMFSHSEQISA
CPKCGMTFQQFRKIGRFGCSECYKTFHSNITPILRKVHSGNTVHAGKIPKRIGGNLHVRRQIDMLKKELESLIHQEEFEN
AAHVRDQIRLLEQSLKSTDSEEEQE
>Q2G0P7 ~~~mcsA~~~Protein-arginine kinase activator protein~~~COG3880
MLCENCQLNEAELKVKVTSKNKTEEKMVCQTCAEGHHPWNQANEQPEYQEHQDNFEEAFVVKQILQHLATKHGINFQEVA
FKEEKRCPSCHMTLKDIAHVGKFGCANCYATFKDDIIDIVRRVQGGQFEHVGKTPHSSHKKIALKRKIEEKNEYLKKLIE
IQDFEEAAIVRDEIKALKAESEVQHDDA
>P37570 2.7.14.1~~~mcsB~~~Protein-arginine kinase~~~COG3869
MSLKHFIQDALSSWMKQKGPESDIVLSSRIRLARNFEHIRFPTRYSNEEASSIIQQFEDQFSEQEIPGIGKFVLIRMNDA
QPLEKRVLVEKHLISPNLTESPFGGCLLSENEEVSVMLNEEDHIRIQCLFPGFQLLEAMKAANQVDDWIEEKVDYAFNEQ
RGYLTSCPTNVGTGLRASVMMHLPALVLTRQINRIIPAINQLGLVVRGIYGEGSEAVGNIFQISNQITLGKSEQDIVEDL
NSVAAQLIEQERSAREAIYQTSKIELEDRVYRSYGVLSNCRMIESKETAKCLSDVRLGIDLGIIKGLSSNILNELMILTQ
PGFLQQYSGGALRPNERDIRRAALIRERLHLEMNGKRQEDESI
>P0DMM5 2.7.14.1~~~mcsB~~~Protein-arginine kinase~~~
MSFGKFFNTAVSAWMSQEGPNSDIVLSSRIRLARNIVDFRFPTLFSSEEAKQIVALFERAFVHRPYGEAGRFELLKMSEL
QPIEKRVLVEKHLISPHLAEDSPFGACLLSENEEISIMINEEDHIRIQCLFPGLQLAEALEAASELDDWIEGHVNYAFDE
RLGYLTSCPTNVGTGLRASVMMHLPALVLTQQINRIIPAINQLGLVVRGTYGEGSEALGNIFQISNQITLGKSEEDIVAD
LHTIVEQLIAQERAARQALVKTLGIQLEDKVFRSYGILANCRVIDSKEAAQCLSDVRLGIDLGYIKNVSRNILNELMILT
QPGFLQQYAGGVLRPEERDVRRAALIRERLRMETRRKMEGDER
>Q2G0P6 2.7.14.1~~~mcsB~~~Protein-arginine kinase~~~COG3869
MTHNIHDNISQWMKSNEETPIVMSSRIRLARNLENHVHPLMYATENDGFRVINEVQDALPNFELMRLDQMDQQSKMKMVA
KHLISPELIKQPAAAVLVNDDESLSVMINEEDHIRIQAMGTDTTLQALYNQASSIDDELDRSLDISYDEQLGYLTTCPTN
IGTGMRASVMLHLPGLSIMKRMTRIAQTINRFGYTIRGIYGEGSQVYGHTYQVSNQLTLGKSELEIIETLTEVVNQIIHE
EKQIRQKLDTYNQLETQDRVFRSLGILQNCRMITMEEASYRLSEVKLGIDLNYIELQNFKFNELMVAIQSPFLLDEEDDK
SVKEKRADILREHIK
>P65206 2.7.14.1~~~mcsB~~~Protein-arginine kinase~~~
MTHNIHDNISQWMKSNEETPIVMSSRIRLARNLENHVHPLMYATENDGFRVINEVQDALPNFELMRLDQMDQQSKMKMVA
KHLISPELIKQPAAAVLVNDDESLSVMINEEDHIRIQAMGTDTTLQALYNQASSIDDELDRSLDISYDEQLGYLTTCPTN
IGTGMRASVMLHLPGLSIMKRMTRIAQTINRFGYTIRGIYGEGSQVYGHTYQVSNQLTLGKSELEIIETLTEVVNQIIHD
EKQIRQKLDTYNQLETQDRVFRSLGILQNCRMITMEEASYRLSEVKLGIDLNYIELQNFKFNELMVAIQSPFLLDEEDDK
SVKEKRADILREHIK
>P9WJ83 ~~~mctB~~~Copper transporter MctB~~~
MISLRQHAVSLAAVFLALAMGVVLGSGFFSDTLLSSLRSEKRDLYTQIDRLTDQRDALREKLSAADNFDIQVGSRIVHDA
LVGKSVVIFRTPDAHDDDIAAVSKIVGQAGGAVTATVSLTQEFVEANSAEKLRSVVNSSILPAGSQLSTKLVDQGSQAGD
LLGIALLSNADPAAPTVEQAQRDTVLAALRETGFITYQPRDRIGTANATVVVTGGALSTDAGNQGVSVARFAAALAPRGS
GTLLAGRDGSANRPAAVAVTRADADMAAEISTVDDIDAEPGRITVILALHDLINGGHVGHYGTGHGAMSVTVSQ
>Q8NS49 ~~~mctC~~~Monocarboxylic acid transporter~~~COG4147
MNSTILLAQDAVSEGVGNPILNISVFVVFIIVTMTVVLRVGKSTSESTDFYTGGASFSGTQNGLAIAGDYLSAASFLGIV
GAISLNGYDGFLYSIGFFVAWLVALLLVAEPLRNVGRFTMADVLSFRLRQKPVRVAAACGTLAVTLFYLIAQMAGAGSLV
SVLLDIHEFKWQAVVVGIVGIVMIAYVLLGGMKGTTYVQMIKAVLLVGGVAIMTVLTFVKVSGGLTTLLNDAVEKHAASD
YAATKGYDPTQILEPGLQYGATLTTQLDFISLALALCLGTAGLPHVLMRFYTVPTAKEARKSVTWAIVLIGAFYLMTLVL
GYGAAALVGPDRVIAAPGAANAAAPLLAFELGGSIFMALISAVAFATVLAVVAGLAITASAAVGHDIYNAVIRNGQSTEA
EQVRVSRITVVVIGLISIVLGILAMTQNVAFLVALAFAVAASANLPTILYSLYWKKFNTTGAVAAIYTGLISALLLIFLS
PAVSGNDSAMVPGADWAIFPLKNPGLVSIPLAFIAGWIGTLVGKPDNMDDLAAEMEVRSLTGVGVEKAVDH
>D3JV05 3.1.2.30~~~mcl2~~~(3S)-malyl-CoA thioesterase~~~COG2301
MAHQAHPFRSVLYIPGSKERALEKAQGLAADAIIFDLEDAVAHDEKIHARRLLKTTLETADYGHRFRIVRVNGMDTEWGR
ADLEAFAEAKADAILIPKVSRAADLEAVAALVPDLPLWAMMETAQGMLNAAEIAAHPRLSGMVMGTNDLAKELGSRYRPD
RLAMQAGLGLCLLAARAHGLTIVDGVYNAFKDEEGLRAECEQGRDMGFDGKTLIHPAQLEIANAVFSPSPAEIELANRQI
AAFEEAERHGQGVAVVDGKIVENLHIVTARQTLAKAEAIAAFRAS
>Q1M7A2 ~~~mctP~~~Monocarboxylate transport permease protein~~~
MTTDINGTALAVFIFFFVLVTVMGFVASRWRKPETLAHIDEWGLGGRNFGTWITWFLVGGDFYTAYTVIAVPALVYTVGA
YGFFALPYTIVVYPFVFMVMPVLWKRAKDFGYVTAGDVVHGQYGSRGLELAVAATGVIATMPYIALQLVGMTAVLKALGL
HGELPLAIAFIVLALYTYSAGLRAPALIAFVKDIMIYIVVIAAVALIPSKLGGYANVFASADAAFQAKGSGNLLLGGNQY
VAYATLALGSALAAFMYPHTLTGIFASNSGKTIRKNAIMLPAYTLLLGLLALLGYMGHAANLKLDSANDVVPTLFKTLFS
GWFSGFAFAAIAIGALVPAAVMSIGAANLFTRNFWKAYVDPDVSDAGEAKVAKITSLVVKVGALLVIIFLPTQFALDLQL
LGGIWILQTLPALVFGLYTNWFRAPGLLAGWFVGFGGGTFLVWDAGWKPLHLISLGGEPFTVYTGLLALAANIAVAVVVN
ALLPAKAPVRA
>A9WC36 5.4.1.3~~~mct~~~2-methylfumaryl-CoA isomerase~~~COG1804
MKGILHGLRVVEGSAFVAAPLGGMTLAQLGADVIRFDPIGGGLDYKRWPVTLDGKHSLFWAGLNKGKRSIAIDIRHPRGQ
ELLTQLICAPGEHAGLFITNFPARGWLSYDELKRHRADLIMVNLVGRRDGGSEVDYTVNPQLGLPFMTGPVTTPDVVNHV
LPAWDIVTGQMIALGLLAAERHRRLTGEGQLVKIALKDVGLAMIGHLGMIAEVMINDTDRPRQGNYLYGAFGRDFETLDG
KRVMVVGLTDLQWKALGKATGLTDAFNALGARLGLNMDEEGDRFRARHEIAALLEPWFHARTLAEVRRIFEQHRVTWAPY
RTVREAIAQDPDCSTDNPMFAMVEQPGIGSYLMPGSPLDFTAVPRLPVQPAPRLGEHTDEILLEVLGLSEAEVGRLHDEG
IVAGPDRAA
>P0AEY7 1.6.5.10~~~mdaB~~~NADPH:quinone oxidoreductase MdaB~~~COG2249
MSNILIINGAKKFAHSNGQLNDTLTEVADGTLRDLGHDVRIVRADSDYDVKAEVQNFLWADVVIWQMPGWWMGAPWTVKK
YIDDVFTEGHGTLYASDGRTRKDPSKKYGSGGLVQGKKYMLSLTWNAPMEAFTEKDQFFHGVGVDGVYLPFHKANQFLGM
EPLPTFIANDVIKMPDVPRYTEEYRKHLVEIFG
>P0AEY5 1.6.5.10~~~mdaB~~~NADPH:quinone oxidoreductase MdaB~~~COG2249
MSNILIINGAKKFAHSNGQLNDTLTEVADGTLRDLGHDVRIVRADSDYDVKAEVQNFLWADVVIWQMPGWWMGAPWTVKK
YIDDVFTEGHGTLYASDGRTRKDPSKKYGSGGLVQGKKYMLSLTWNAPMEAFTEKDQFFHGVGVDGVYLPFHKANQFLGM
EPLPTFIANDVIKMPDVPRYTEEYRKHLVEIFG
>O32712 ~~~mdcC~~~Malonate decarboxylase acyl carrier protein~~~
MEQITLSFPASRALSGRALAGVVGSGDMEVLYTAAQSATLNVQITTSVDNSQARWQALFDRLNLINGLPAGQLIIHDFGA
TPGVARIRIEQVFEEAAHA
>O06925 ~~~mdcC~~~Malonate decarboxylase acyl carrier protein~~~
MEGMLNELNFKFKSENPVDVVLPKHHYGVVGSGDLEVLLKKHELEGAVEIRVVSPVRGFDHVWEKVLEKVISDAEVGNVA
IEINDNNATPVVVALRLAQALSEAKSAEQSVN
>Q4K4F7 ~~~mdcC~~~Malonate decarboxylase acyl carrier protein~~~COG3052
METLSFEFPAGQPGRGRALVGCVGSGDLEVLLEPGQPGKLSIQVQTSVNGSASRWQHLFERLFDGQTPPALLIDIHDFGA
TPGVVRLRLEQGFEEIGHD
>P71426 2.7.7.66~~~mdcG~~~Phosphoribosyl-dephospho-CoA transferase~~~
MSATPRPHDLVWLNHASALEDIAEPWVAQQWRAALPVVVRRDVDDQARVPVGVRGMKREQRAAGWVQARNIVRSVTPEML
VDREVLLHSPFVSQPPVQGAIALTLHRWPWGWGVTGSTGYALATEIPVLHAASDLDLLIRAPQPLDREALLEWQTRVAQL
PCRADTQVETPYGAFALNEWLRDGRALLKTSRGARLTATPWHREE
>Q89V40 2.1.1.334~~~mddA1~~~Methanethiol S-methyltransferase 1~~~COG2020
MERPIRHGAKTKQCRCLGTTELSMSQIDHQVHSIGPEVTGSRIFKFIAFLYGIAAYLVFFVTILYAIGFVMGLVVPKTID
TGTDTSTAEAVIINLLLMALFAVQHSVMARQRFKTWWTQFVSKPIERSTYVLFASLSLLLLFWQWRPLPTVIWEVEDPDL
AVTLVTVSFAGWVLVFTSTFIINHFELFGLHQVTNHLVGKEATPPRFKTPLLYKFVRHPIYLGFIVAFWAAPVMTAGHLL
FAAVTTIYIFVGIALEERDLIDLFGDEYRQYKQRVSMLIPWRRSV
>Q89I98 2.1.1.334~~~mddA2~~~Methanethiol S-methyltransferase 2~~~COG2020
MFARLAILLYAIVSYAAFTVSFLYALGFVGNYVVPKSIDVGSPTNLGEAILVNLLLMSLFAIQHSVMARPAFKRWWAKFL
PLACQRSTYVLLSSLILLLLFWQWRPIPTPVWQTSGIAAWLLIGVHWLGWLIAFASTHMIDHFDLFGLRQAFVAFRGTEI
SGQSFRTPLLYKIVRHPLMLGFLLAFWATPAMTAGHLLFALANTAYILVALQFEERDLIAEFGATYQDYRRRVPMLVPRL
FARRRTDDRKSPRPVGAPR
>B1WZQ6 2.1.1.334~~~mddA~~~Methanethiol S-methyltransferase~~~COG2020
MQQQQTLSNSFLSRTFVLLYGVFCYFTFLLTCVYAVGFIGNIFLPKSLDSKSQESLVTALLIDVGLLGIFALQHSVMARK
QFKAWWTRLIPKPMERSTYVLFSSLALMLVFWQWHPIGITIWNFDNLVGQIIFYSLFALGWVIVLVSTFLINHFDLFGLR
QVYLYFEEKEYTPLKFKTPAFYQYVRHPLYVGWFLVFWMTPIMTVAHLVFASVTTIYILVAIQLEEKDLVAIHGEKYENY
RRQVPMLIPFIGKKS
>O05883 2.1.1.334~~~mddA~~~Methanethiol S-methyltransferase~~~COG2020
MKRYLTIIYGAASYLVFLVAFGYAIGFVGDVVVPRTVDHAIAAPIGQAVVVNLVLLGVFAVQHSVMARQGFKRWWTRFVP
PSIERSTYVLLASVALLLLYWQWRTMPAVIWDVRQPAGRVALWALFWLGWATVLTSTFMINHFELFGLRQVYLAWRGKPY
TEIGFQAHLLYRWVRHPIMLGFVVAFWATPMMTAGHLLFAIGATGYILVALQFEERDLLAALGDQYRDYRREVSMLLPWP
HRHT
>A6VCX3 2.3.1.-~~~~~~L-methionine sulfoximine/L-methionine sulfone acetyltransferase~~~
MSASIRDAGVADLPGILAIYNDAVGNTTAIWNETPVDLANRQAWFDARARQGYPILVASDAAGEVLGYASYGDWRPFEGF
RGTVEHSVYVRDDQRGKGLGVQLLQALIERARAQGLHVMVAAIESGNAASIGLHRRLGFEISGQMPQVGQKFGRWLDLTF
MQLNLDPTRSAP
>Q9HUU7 2.3.1.-~~~pitA~~~L-methionine sulfoximine/L-methionine sulfone acetyltransferase~~~
MSASIRDAGVADLPGILAIYNDAVGNTTAIWNETPVDLANRQAWFDTRARQGYPILVASDAAGEVLGYASYGDWRPFEGF
RGTVEHSVYVRDDQRGKGLGVQLLQALIERARAQGLHVMVAAIESGNAASIGLHRRLGFEISGQMPQVGQKFGRWLDLTF
MQLNLDPTRSAP
>A0A0F6P9C0 2.1.1.334~~~mddA~~~Methanethiol S-methyltransferase~~~
MHTRTAPRVRPPSRAAKLAGLLYSLLSYLFFLMTLLYLIGFVGNVGVPKTIDSGPGASWPLALLVDVLLITLFAVQHSVM
ARKSFKQWWRPVVPAPIERATYVLASSVVLVVMFWLWQPIDLRVWQVESRLGSAVLTTLFWLGWGLILVATFLISHFELF
GVKQALDALRPAKPVDGSFRTPLLYKIVRHPMYMGFLMAFWATPEMTVGHLVFALTSTIYILIGTQLEEKDLVEIFGEKY
RNYQKNVGMLLPSLRRNPGSQD
>W6VBF4 2.1.1.334~~~mddA~~~Methanethiol S-methyltransferase~~~
MNPPNRTGHRFFVFSGKLAGLLYSLCCYLFFLLTALYLIGFLAGIGVPKDINSGPGITWPLAVLVDAILITLFAAQHSGM
ARKNFKRWWMRFIPATLERATYVLSSCLVLALLFVLWQPIATPVWNVESPWGKGLLIALFWLGWGIVLLATFLISHFELF
GVKQTLDAWRKRIPEKPAFKSPWLYKLVRHPLYVGFLIAFWATPDMTAGHLLFAILSTSYILIGAHLEEKDLVDSLGEVY
QSYQQEVGMLVPKRNQTKGR
>Q8ZPD3 2.3.1.-~~~yncA~~~L-methionine sulfoximine/L-methionine sulfone acetyltransferase~~~
MTIRFADKADCAAITEIYNHAVLHTAAIWNDRTVDTDNRLAWYEARQLLGYPVLVSEENGVVTGYASFGDWRSFDGFRYT
VEHSVYVHPAHQGKGLGRKLLSRLIDEARRCGKHVMVAGIESQNAASIRLHHSLGFTVTAQMPQVGVKFGRWLDLTFMQL
QLDEHAAPDAC
>Q99S98 ~~~sepA~~~Multidrug resistance efflux pump SepA~~~
MIVNYLKHKFYNLLTTMIVLFIFVLSGAIFLTFLGFGLYGLSRILIYFRLGDFTYNRSMYDNLLYYGSYIIFGYFIIFAV
EHLMDYFRKMLPENAYFRGATFHLISYTVATTLFYFIIHLNYVYINIDFWVIMVIIGFLYVCKLQFYPESKNLNNRK
>P0AEY8 ~~~mdfA~~~Multidrug transporter MdfA~~~COG2814
MQNKLASGARLGRQALLFPLCLVLYEFSTYIGNDMIQPGMLAVVEQYQAGIDWVPTSMTAYLAGGMFLQWLLGPLSDRIG
RRPVMLAGVVWFIVTCLAILLAQNIEQFTLLRFLQGISLCFIGAVGYAAIQESFEEAVCIKITALMANVALIAPLLGPLV
GAAWIHVLPWEGMFVLFAALAAISFFGLQRAMPETATRIGEKLSLKELGRDYKLVLKNGRFVAGALALGFVSLPLLAWIA
QSPIIIITGEQLSSYEYGLLQVPIFGALIAGNLLLARLTSRRTVRSLIIMGGWPIMIGLLVAAAATVISSHAYLWMTAGL
SIYAFGIGLANAGLVRLTLFASDMSKGTVSAAMGMLQMLIFTVGIEISKHAWLNGGNGLFNLFNLVNGILWLSLMVIFLK
DKQMGNSHEG
>Q9ZF99 1.1.1.37~~~mdh~~~Malate dehydrogenase~~~
MAKTPMRVAVTGAAGQICYSLLFRIANGDMLGKDQPVILQLLEIPNEKAQKALQGVMMEIDDCAFPLLAGMTAHADPMTA
FKDADVALLVGARPRGPGMERKDLLEANAQIFTVQGKAIDAVASRNIKVLVVGNPANTNAYIAMKSAPSLPAKNFTAMLR
LDHNRALSQIAAKTGKPVSSIEKLFVWGNHSPTMYADYRYAQIDGASVKDMINDDAWNRDTFLPTVGKRGAAIIDARGVS
SAASAANAAIDHIHDWVLGTAGKWTTMGIPSDGSYGIPEGVIFGFPVTTENGEYKIVQGLSIDAFSQERINVTLNELLEE
QNGVQHLLG
>Q6HSF4 1.1.1.37~~~mdh~~~Malate dehydrogenase~~~COG0039
MTIKRKKVSVIGAGFTGATTAFLLAQKELADVVLVDIPQLENPTKGKALDMLEASPVQGFDANIIGTSDYADTADSDVVV
ITAGIARKPGMSRDDLVATNSKIMKSITRDIAKHSPNAIIVVLTNPVDAMTYSVFKEAGFPKERVIGQSGVLDTARFRTF
IAQELNLSVKDITGFVLGGHGDDMVPLVRYSYAGGIPLETLIPKERLEAIVERTRKGGGEIVGLLGNGSAYYAPAASLVE
MTEAILKDQRRVLPAIAYLEGEYGYSDLYLGVPVILGGNGIEKIIELELLADEKEALDRSVESVRNVMKVLV
>P49814 1.1.1.37~~~mdh~~~Malate dehydrogenase~~~COG0039
MGNTRKKVSVIGAGFTGATTAFLIAQKELADVVLVDIPQLENPTKGKALDMLEASPVQGFDAKITGTSNYEDTAGSDIVV
ITAGIARKPGMSRDDLVSTNEKIMRSVTQEIVKYSPDSIIVVLTNPVDAMTYAVYKESGFPKERVIGQSGVLDTARFRTF
VAEELNLSVKDVTGFVLGGHGDDMVPLVRYSYAGGIPLETLIPKERIDAIVERTRKGGGEIVNLLGNGSAYYAPAASLTE
MVEAILKDQRRVLPTIAYLEGEYGYEGIYLGVPTIVGGNGLEQIIELELTDYERAQLNKSVESVKNVMKVLS
>Q2YLR9 1.1.1.37~~~mdh~~~Malate dehydrogenase~~~
MARNKIALIGSGMIGGTLAHLAGLKELGDVVLFDIAEGTPQGKGLDIAESSPVDGFDAKFTGANDYAAIEGADVVIVTAG
VPRKPGMSRDDLLGINLKVMEQVGAGIKKYAPEAFVICITNPLDAMVWALQKFSGLPAHKVVGMAGVLDSARFRYFLSEE
FNVSVEDVTVFVLGGHGDSMVPLARYSTVAGIPLPDLVKMGWTSQDKLDKIIQRTRDGGAEIVGLLKTGSAFYAPAASAI
QMAESYLKDKKRVLPVAAQLSGQYGVKDMYVGVPTVIGANGVERIIEIDLDKDEKAQFDKSVASVAGLCEACIGIAPSLK
>Q3JKE9 1.1.1.37~~~mdh~~~Malate dehydrogenase~~~
MAKPAKRVAVTGAAGQIAYSLLFRIANGDLLGKDQPVILQLLDLPQAQAAVKGVVMELDDCAFPLLAGVVITDDPKVAFK
DADVALLVGARPRSKGMERKDLLSANAEIFTVQGAALNEVASRDVKVLVVGNPANTNAYIAMKSAPDLPKKNFTAMLRLD
HNRALSQLAAKSGKPVASIEKLAVWGNHSPTMYPDFRFATAEGESLLKLINDDVWNRDTFIPTVGKRGAAIIEARGLSSA
ASAANAAIDHVRDWVLGTNGKWVTMGIPSDGSYGIPEDIIYGVPVICENGEYKRVEGLEIDAFSREKMDGTLAELLEERD
GVAHLLK
>P80536 1.1.1.37~~~mdh~~~Malate dehydrogenase~~~COG0039
MAKPAKRVAVTGAAGQIAYSLLFRIANGDLLGKDQPVILQLLDLPQAQAAVKGVVMELDDCAFPLLAGVVITDDPKVAFK
DADVALLVGARPRSKGMERKDLLSANAEIFTVQGAALNEVASRDVKVLVVGNPANTNAYIAMKSAPDLPKKNFTAMLRLD
HNRALSQLAAKSGKPVASIEKLAVWGNHSPTMYPDFRFATAEGESLLKLINDDVWNRDTFIPTVGKRGAAIIEARGLSSA
ASAANAAIDHVRDWVLGTNGKWVTMGIPSDGSYGIPEDIIYGVPVICENGEYKRVEGLEIDAFSREKMDGTLAELLEERD
GVAHLLK
>P80040 1.1.1.37~~~mdh~~~Malate dehydrogenase~~~COG0039
MRKKISIIGAGFVGSTTAHWLAAKELGDIVLLDFVEGVPQGKALDLYEASPIEGFDVRVTGTNNYADTANSDVIVVTSGA
PRKPGMSREDLIKVNADITRACISQAAPLSPNAVIIMVNNPLDAMTYLAAEVSGFPKERVIGQAGVLDAARYRTFIAMEA
GVSVEDVQAMLMGGHGDEMVPLPRFSTISGIPVSEFIAPDRLAQIVERTRKGGGEIVNLLKTGSAYYAPAAATAQMVEAV
LKDKKRVMPVAAYLTGQYGLNDIYFGVPVILGAGGVEKILELPLNEEEMALLNASAKAVRATLDTLKSL
>B3QPY9 1.1.1.37~~~mdh~~~Malate dehydrogenase~~~COG0039
MKITVIGAGNVGATTAFRIADKKLARELVLLDVVEGIPQGKGLDMYETGPVGLFDTKITGSNDYADTADSDIVIITAGLP
RKPGMTREDLLMKNAGIVKEVTDNIMKHSKNPIIIVVSNPLDIMTHVAWVRSGLPKERVIGMAGVLDAARFRSFIAMELG
VSMQDINACVLGGHGDAMVPVVKYTTVAGIPISDLLPAETIDKLVERTRNGGAEIVEHLKQGSAFYAPASSVVEMVESIV
LDRKRVLPCAVGLEGQYGIDKTFVGVPVKLGRNGVEQIYEINLDQADLDLLQKSAKIVDENCKMLESTIG
>P80039 1.1.1.37~~~mdh~~~Malate dehydrogenase~~~COG0039
MKITVIGAGNVGATTAFRLAEKQLARELVLLDVVEGIPQGKALDMYESGPVGLFDTKVTGSNDYADTANSDIVVITAGLP
RKPGMTREDLLSMNAGIVREVTGRIMEHSKNPIIVVVSNPLDIMTHVAWQKSGLPKERVIGMAGVLDSARFRSFIAMELG
VSMQDVTACVLGGHGDAMVPVVKYTTVAGIPVADLISAERIAELVERTRTGGAEIVNHLKQGSAFYAPATSVVEMVESIV
LDRKRVLTCAVSLDGQYGIDGTFVGVPVKLGKNGVEHIYEIKLDQSDLDLLQKSAKIVDENCKMLDASQG
>Q8NN33 1.1.1.37~~~mdh~~~Malate dehydrogenase~~~COG0039
MNSPQNVSTKKVTVTGAAGQISYSLLWRIANGEVFGTDTPVELKLLEIPQALGGAEGVAMELLDSAFPLLRNITITADAN
EAFDGANAAFLVGAKPRGKGEERADLLANNGKIFGPQGKAINDNAADDIRVLVVGNPANTNALIASAAAPDVPASRFNAM
MRLDHNRAISQLATKLGRGSAEFNNIVVWGNHSATQFPDITYATVGGEKVTDLVDHDWYVEEFIPRVANRGAEIIEVRGK
SSAASAASSAIDHMRDWVQGTEAWSSAAIPSTGAYGIPEGIFVGLPTVSRNGEWEIVEGLEISDFQRARIDANAQELQAE
REAVRDLL
>P61889 1.1.1.37~~~mdh~~~Malate dehydrogenase~~~COG0039
MKVAVLGAAGGIGQALALLLKTQLPSGSELSLYDIAPVTPGVAVDLSHIPTAVKIKGFSGEDATPALEGADVVLISAGVA
RKPGMDRSDLFNVNAGIVKNLVQQVAKTCPKACIGIITNPVNTTVAIAAEVLKKAGVYDKNKLFGVTTLDIIRSNTFVAE
LKGKQPGEVEVPVIGGHSGVTILPLLSQVPGVSFTEQEVADLTKRIQNAGTEVVEAKAGGGSATLSMGQAAARFGLSLVR
ALQGEQGVVECAYVEGDGQYARFFSQPLLLGKNGVEERKSIGTLSAFEQNALEGMLDTLKKDIALGEEFVNK
>Q25QU7 1.1.1.37~~~mdh~~~Malate dehydrogenase~~~
MKVTIVGAGNVGATCADVISYRGIASEVVLLDIKEGFAEGKALDIMQCATNTGFNTKVSGVTNDYSKTAGSDVVVITSGI
PRKPGMTREELIGINAGIVKTVAENVLKHSPNTIIVVVSNPMDTMTYLALKATGVPKNRIIGMGGALDSSRFRTYLSLAL
DKPANDISAMVIGGHGDTTMIPLTRLASYNGIPVTEFLSEEVLQKVAADTMVGGATLTGLLGTSAWYAPGASVAYLVDSI
LNDQKKMIACSVFVEGEYGQNDICIGVPCIIGKNGVEEILDIKLNDQEKALFAKSADAVRGMNDALKSILV
>P44427 1.1.1.37~~~mdh~~~Malate dehydrogenase~~~COG0039
MKVAVLGAAGGIGQALALLLKLQLPAGTDLSLYDIAPVTPGVAVDVSHIPTAVNVKGFSGEDPTPALEGADVVLISAGVA
RKPGMDRSDLFNINAGIVRGLIEKVAVTCPKACVGIITNPVNTTVAIAAEVLKKAGVYDKRKLFGVTTLDVLRSETFVAE
LKGLNVSRTSVPVIGGHSGVTILPLLSQVQYAKWNEDEIEPLTKRIQNAGTEVLNAKAGGGSATLSMAQAAARFARSLVK
GLSGETVVECTYVEGDGKYARFFSQPVRLGKEGVEEILPIGPLSNFEQQALENMLPTLRADIELGEKFING
>Q5ZT13 1.1.1.37~~~mdh~~~Malate dehydrogenase~~~COG0039
MTNNRVRVAVTGAAGQIGYALVFRIASGQMFGPNTEVELNLLELEPALPSLEGVAMELDDCAFPLLKRIVCTADLNKAMD
GVNWALLVGSVPRKQGMERSDLLQINGGIFTKQGQAINDYASDDVRVFVVGNPCNTNCLIAMNHAKDVPSDRFYAMTTLD
ELRARTQLAKKAGVDITAVTQMTIWGNHSATQYPDFYNAKINGTSAAQVINDETWLKETFVSTVQQRGAAVIKARGSSSA
ASAANAIITGVNHLVTDTPAGESFSMCRRSKGEYGVDEGLIFSFPCRREHGELKVVENLEFNDFGRERFNTTLNELRSER
DTVKSLGLLD
>Q65T37 1.1.1.37~~~mdh~~~Malate dehydrogenase~~~COG0039
MKVAVLGAAGGIGQALALLLKLQLPAGSSLSLYDVAPVTPGVAKDLSHIPTDVVVEGFAGTDPSEALKGADIVLISAGVA
RKPGMTRADLFGVNAGIIRSLTEKVAEQCPKACVGIITNPVNAMVAIAAEVLKKAGVYDKRKLFGITTLDILRAETFIAE
LKGLDPTRVTIPVIGGHSGVTILPLLSQVQNVEWSSEEEIIALTHRIQNAGTEVVEAKAGGGSATLSMAQAAARFALALV
KASQGAKVVECAYVEGDGKYARFFAQPVRLGTEGVEEYLTLGKLSAFEEKALNAMLETLQGDIKSGEDFING
>Q84FY8 1.1.1.37~~~mdh~~~Malate dehydrogenase~~~COG0039
MARSKIALIGAGQIGGTLAHLAGLKELGDVVLFDIVDGVPQGKALDIAESAPVDGFDAKYSGASDYSAIAGADVVIVTAG
VPRKPGMSRDDLIGINLKVMEAVGAGIKEHAPDAFVICITNPLDAMVWALQKFSGLPTNKVVGMAGVLDSARFRHFLAEE
FGVSVEDVTAFVLGGHGDDMVPLTRYSTVAGVPLTDLVKLGWTTQEKLDAMVERTRKGGGEIVNLLKTGSAFYAPAASAI
AMAESYLRDKKRVLPCAAYLDGQYGIDGLYVGVPVVIGENGVERVLEVTFNDDEKAMFEKSVNSVKGLIEACKSVNDKLA
>A9W386 1.1.1.37~~~mdh~~~Malate dehydrogenase~~~COG0039
MARSKIALIGAGQIGGTLAHLAGLKELGDVVLFDIVDGVPQGKALDIAESAPVDGFDAKYSGASDYSAIAGADVVIVTAG
VPRKPGMSRDDLIGINLKVMEAVGAGIKEHAPDAFVICITNPLDAMVWALQKFSGLPTNKVVGMAGVLDSARFRHFLAEE
FGVSVEDVTAFVLGGHGDDMVPLTRYSTVAGVPLTDLVKLGWTTQEKLDAMVERTRKGGGEIVNLLKTGSAFYAPAASAI
AMAESYLRDKKRVLPCAAYLDGQYGIDGLYVGVPVVIGENGVERVLEVTFNDDEKAMFEKSVNSVKGLIEACKSVNDKLA
>Q7X3X5 1.1.1.37~~~mdh~~~Malate dehydrogenase~~~
MKVAVLGAAGGIGQALALLLKTQLPAGSELSLYDIAPVTPGVAVDLSHIPTDVTITGFSGIDPTAALVGADVVLISAGVA
RKPGMDRSDLFNINAGIIKNLASKCAEVCPTACIGIITNPVNTTVPIAAEVLKQAGVYDKRKLFGITTLDVIRSETFVSA
LKGISLADVAVPVIGGHSGATILPLLSQVKGVEFTAEEIATLTTRIQNAGTEVVEAKAGGGSATLSMGHAAARFGLSLVR
ALQGEKGIVECTYVDGGSEHATFFAQPVLLGKNGVEEVLAYGDLSDFETNARDAMLEELKANITLGEEFVAG
>P48364 1.1.1.37~~~mdh~~~Malate dehydrogenase~~~
MKVAVLGAAGGIGQALALLLKTQLPAGSDLSLYDIAPVTPGVAVDLSHIPTDVTIAGFAGMDPTDALVGADVVLISAGVA
RKPGMDRSDLFNINAGIIKNLAGKCAEVCPNACIGIITNPVNTTVPIAAEVLKQAGVYDKRKLFGITTLDVIRSETFVSA
LKGISLADVEVPVIGGHSGVTILPLLSQVKGVEFTAEEVVALTARIQNAGTEVVEAKAGGGSATLSMGQAAARFGLSLVR
ALQGEKGIVECTYVDGGSEHATFFAQPVLLGKNGVEEVLAYGELSEFETNARDAMLEELKANITLGEEFVAG
>P9WK13 1.1.1.37~~~mdh~~~Malate dehydrogenase~~~COG0039
MSASPLKVAVTGAAGQIGYSLLFRLASGSLLGPDRPIELRLLEIEPALQALEGVVMELDDCAFPLLSGVEIGSDPQKIFD
GVSLALLVGARPRGAGMERSDLLEANGAIFTAQGKALNAVAADDVRVGVTGNPANTNALIAMTNAPDIPRERFSALTRLD
HNRAISQLAAKTGAAVTDIKKMTIWGNHSATQYPDLFHAEVAGKNAAEVVNDQAWIEDEFIPTVAKRGAAIIDARGASSA
ASAASATIDAARDWLLGTPADDWVSMAVVSDGSYGVPEGLISSFPVTTKGGNWTIVSGLEIDEFSRGRIDKSTAELADER
SAVTELGLI
>P0C890 1.1.1.37~~~mdh~~~Malate dehydrogenase~~~
MKITVIGAGNVGATTAFRIADKKLARELVLLDVVEGIPQGKGLDMYETGPVGLFDTKITGSNDYADTADSDIVIITAGLP
RKPGMTREDLLMKNAGIVKEVTDNIMKHSKNPIIIVVSNPLDIMTHVAWVRSGLPKERVIGMAGVLDAARFRSFIAMELG
VSMQDINACVLGGHGDAMVPVVKYTTVAGIPISDLLPAETIDKLVERTRNGGAEIVEHLKQGSAFYSPGSSVVEMVESIV
LDRKRVLPCAVGLEGQYGIDKTFVGVPVKLGRNGVEQIYEINLDQADLDLLQKSAKIVDENCKMLESTIG
>P80458 1.1.1.37~~~mdh~~~Malate dehydrogenase~~~COG0039
MARDKIALIGSGQIGGTLAHLVGLKELGDVVLFDIAEGVPQGKALDIAESSPVDGFDSKLTGANSYEAIEGARVVIVTAG
VPRKPGMSRDDLLSINLKVMEQVGAGIKKYAPDAFVICITNPLDAMVWALQKASGLPAKKVVGMAGVLDSARFRYFLADE
FNVSVEDVTAFVLGGHGDTMVPLVKYSTVAGIPLPDLVKMGWTSQARLDEIVDRTRNGGAEIVNLLKTGSAFYAPASSAI
AMAESYLKDKKRVVPVAAHLNGEYGVKDMYVGVPVVIGDKGVERIVEIELAGKDKEAFDRSVAAVQGLVEACKKIAPDLL
GR
>Q2S289 1.1.1.37~~~mdh~~~Malate dehydrogenase~~~COG0039
MKVTVIGAGNVGATVAECVARQDVAKEVVMVDIKDGMPQGKALDMRESSPIHGFDTRVTGTNDYGPTEDSDVCIITAGLP
RSPGMSRDDLLAKNTEIVGGVTEQFVEGSPDSTIIVVANPLDVMTYVAYEASGFPTNRVMGMAGVLDTGRFRSFIAEELD
VSVRDVQALLMGGHGDTMVPLPRYTTVGGIPVPQLIDDARIEEIVERTKGAGGEIVDLMGTSAWYAPGAAAAEMTEAILK
DNKRILPCAAYCDGEYGLDDLFIGVPVKLGAGGVEEVIEVDLDADEKAQLKTSAGHVHSNLDDLQRLRDEGKIG
>P82177 1.1.1.37~~~mdh~~~Malate dehydrogenase~~~COG0039
MKVAVLGAAGGIGQALALLLKTQLPAGSKLSLYDIAPVTPGVAVDLSHIPTAVEIKGFAGEDPTPALVGADVVLISAGVA
RKPGMDRSDLFNINAGIVRNLIEKVAVTCPKALVGIITNPVNTTVAIAAEVMKKAGVYDKNRLFGVTTLDVIRSETFIAE
LKGLNVADVKINVIGGHSGVTILPLLSQVEGVTFSDEEVASLTKRIQNAGTEVVEAKAGGGSATLSMGQAACRFGMSLVR
GLQGEANVVECAYVDGGSEHAEFFAQPVLLGKNGIEKVLPYGEVSAFEANARDSMLDTLKGDIKLGVDFVK
>Q82HS2 1.1.1.37~~~mdh~~~Malate dehydrogenase~~~COG0039
MTRTPVNVTVTGAAGQIGYALLFRIASGQLLGADVPVKLRLLEITPALKAAEGTAMELDDCAFPLLQGIDITDDPNVAFD
GTNVGLLVGARPRTKGMERGDLLSANGGIFKPQGKAINDNAADDVKILVVGNPANTNALIAQAAAPDVPAERFTAMTRLD
HNRALTQLAKKTGSTVADIKRLTIWGNHSATQYPDIFHASVAGKNAAEVVNDEKWLAEDFIPTVAKRGAAIIEARGASSA
ASAANAAIDHVYTWVNGTADGDWTSMGIPSDGSYGVPEGLISSFPVTTKDGRYEIVQGLEINEFSRARIDASVKELEEER
EAVRALGLI
>Q9K3J3 1.1.1.37~~~mdh~~~Malate dehydrogenase~~~COG0039
MTRTPVNVTVTGAAGQIGYALLFRIASGQLLGADVPVKLRLLEITPALKAAEGTAMELDDCAFPLLQGIEITDDPNVAFD
GANVALLVGARPRTKGMERGDLLEANGGIFKPQGKAINDHAADDIKVLVVGNPANTNALIAQAAAPDVPAERFTAMTRLD
HNRALTQLAKKTGSTVADIKRLTIWGNHSATQYPDIFHATVAGKNAAETVNDEKWLADEFIPTVAKRGAAIIEARGASSA
ASAANAAIDHVYTWVNGTAEGDWTSMGIPSDGSYGVPEGIISSFPVTTKDGSYEIVQGLDINEFSRARIDASVKELSEER
EAVRGLGLI
>A0LFF8 1.1.1.37~~~mdh~~~Malate dehydrogenase~~~COG0039
MAKKPVRVTVTGAAGQIGYALLFRVASGQMLGPDQPIILQMLELPIDKVQAALKGVMMELEDCAFPLLADMIGTGDPKVA
FKDSDYALLVGARPRGPGMERKDLLLENAKIFIEQGKAMNAVASRDIRVIVVGNPANTNAWIAMKSAPDLPKGNFTAMLR
LDHNRAKSQLATRTGKPVASVEKMIVWGNHSPTMYPDIRFCTVDGQPAVKLVNDEAWYRNEYIPKVGKRGAAIIEARGLS
SAASAANAAIDHMHDWALGTNGKWVTMGLPSDGSYGIPEGTMYGVPVTCTPGKYERVKGLEIDAFSREKMDFTLKELTEE
QAGVKEMVK
>A0A0S3QTC6 1.1.1.37~~~mdh~~~Malate dehydrogenase~~~
MGKRAKITVVGAGHVGEHVAMFCAIKELGDVVLIDIVEDMPQGKALDMFEATPLEGWDSRIVGTNDYADTADSDIVVITA
GSPRKPGMSRDDLLEINAKIIKAVTEQVAKYSPNAVIIVVTNPLDAMTQLAWNVSGFPKNRVLGQAGNLDSARFRAFIAM
ELGVSVKEISAMVLGGHGDDMVPLPRFTTVSGIPITELIPPDRIEALVQRTRVGGGEIVKLLKTGSAYYAPALATVEMVE
AILKDQKRIQPCAALCEGEYGINGVYCGVPCLLGANGVEKIIELKLTDDELKALQASAGRVKGLIDKLTEWGYIK
>P10584 1.1.1.37~~~mdh~~~Malate dehydrogenase~~~
MKAPVRVAVTGAAGQIGYSLLFRIAAGEMLGKDQPVILQLLEIPQAMKALEGVVMELEDCAFPLLAGLEATDDPKVAFKD
ADYALLVGAAPRKAGMERRDLLQVNGKIFTEQGRALAEVAKKDVKVLVVGNPANTNALIAYKNAPGLNPRNFTAMTRLDH
NRAKAQLAKKTGTGVDRIRRMTVWGNHSSTMFPDLFHAEVDGRPALELVDMEWYEKVFIPTVAQRGAAIIQARGASSAAS
AANAAIEHIRDWALGTPEGDWVSMAVPSQGEYGIPEGIVYSFPVTAKDGAYRVVEGLEINEFARKRMEITAQELLDEMEQ
VKALGLI
>Q8DEC2 1.1.1.37~~~mdh~~~Malate dehydrogenase~~~
MKVAVIGAAGGIGQALALLLKNRLPAGSDLALYDIAPVTPGVAADLSHIPTHVSIKGYAGEDPTPALEGADVVLISAGVA
RKPGMDRADLFNVNAGIVKSLAERIAVVCPNACIGIITNPVNTTVPIAAEVLKKAGVYDKRKLFGVTTLDVIRSETFVAE
LKGQDPGEVRVPVIGGHSGVTILPLLSQVEGVEFSDEEIAALTKRIQNAGTEVVEAKAGGGSATLSMGQAACRFGLALVK
ALQGEEVIEYAYVEGNGEHASFFAQPVKLGKEGVEEILPYGELSDFEKAALDGMLETLNSDIQIGVDFVK
>P0AAG5 7.6.2.2~~~mdlB~~~Multidrug resistance-like ATP-binding protein MdlB~~~COG1132
MRSFSQLWPTLKRLLAYGSPWRKPLGIAVLMMWVAAAAEVSGPLLISYFIDNMVAKNNLPLKVVAGLAAAYVGLQLFAAG
LHYAQSLLFNRAAVGVVQQLRTDVMDAALRQPLSEFDTQPVGQVISRVTNDTEVIRDLYVTVVATVLRSAALVGAMLVAM
FSLDWRMALVAIMIFPVVLVVMVIYQRYSTPIVRRVRAYLADINDGFNEIINGMSVIQQFRQQARFGERMGEASRSHYMA
RMQTLRLDGFLLRPLLSLFSSLILCGLLMLFGFSASGTIEVGVLYAFISYLGRLNEPLIELTTQQAMLQQAVVAGERVFE
LMDGPRQQYGNDDRPLQSGTIEVDNVSFAYRDDNLVLKNINLSVPSRNFVALVGHTGSGKSTLASLLMGYYPLTEGEIRL
DGRPLSSLSHSALRQGVAMVQQDPVVLADTFLANVTLGRDISEERVWQALETVQLAELARSMSDGIYTPLGEQGNNLSVG
QKQLLALARVLVETPQILILDEATASIDSGTEQAIQHALAAVREHTTLVVIAHRLSTIVDADTILVLHRGQAVEQGTHQQ
LLAAQGRYWQMYQLQLAGEELAASVREEESLSA
>P20932 1.1.99.31~~~mdlB~~~(S)-mandelate dehydrogenase~~~
MSQNLFNVEDYRKLRQKRLPKMVYDYLEGGAEDEYGVKHNRDVFQQWRFKPKRLVDVSRRSLQAEVLGKRQSMPLLIGPT
GLNGALWPKGDLALARAATKAGIPFVLSTASNMSIEDLARQCDGDLWFQLYVIHREIAQGMVLKALHTGYTTLVLTTDVA
VNGYRERDLHNRFKIPMSYSAKVVLDGCLHPRWSLDFVRHGMPQLANFVSSQTSSLEMQAALMSRQMDASFNWEALRWLR
DLWPHKLLVKGLLSAEDADRCIAEGADGVILSNHGGRQLDCAISPMEVLAQSVAKTGKPVLIDSGFRRGSDIVKALALGA
EAVLLGRATLYGLAARGETGVDEVLTLLKADIDRTLAQIGCPDITSLSPDYLQNEGVTNTAPVDHLIGKGTHA
>P20906 4.1.1.7~~~mdlC~~~Benzoylformate decarboxylase~~~
MASVHGTTYELLRRQGIDTVFGNPGSNELPFLKDFPEDFRYILALQEACVVGIADGYAQASRKPAFINLHSAAGTGNAMG
ALSNAWNSHSPLIVTAGQQTRAMIGVEALLTNVDAANLPRPLVKWSYEPASAAEVPHAMSRAIHMASMAPQGPVYLSVPY
DDWDKDADPQSHHLFDRHVSSSVRLNDQDLDILVKALNSASNPAIVLGPDVDAANANADCVMLAERLKAPVWVAPSAPRC
PFPTRHPCFRGLMPAGIAAISQLLEGHDVVLVIGAPVFRYHQYDPGQYLKPGTRLISVTCDPLEAARAPMGDAIVADIGA
MASALANLVEESSRQLPTAAPEPAKVDQDAGRLHPETVFDTLNDMAPENAIYLNESTSTTAQMWQRLNMRNPGSYYFCAA
GGLGFALPAAIGVQLAEPERQVIAVIGDGSANYSISALWTAAQYNIPTIFVIMNNGTYGALRWFAGVLEAENVPGLDVPG
IDFRALAKGYGVQALKADNLEQLKGSLQEALSAKGPVLIEVSTVSPVK
>Q84DC3 1.2.1.28~~~mdlD~~~NAD(P)-dependent benzaldehyde dehydrogenase~~~
MNYLSPAKIDSLFSAQKAYFATRATADVGFRKQSLERLKEAVINNKEALYSALAEDLGKPKDVVDLAEIGAVLHEIDFAL
AHLDEWVAPVSVPSPDIIAPSECYVVQEPYGVTYIIGPFNYPVNLTLTPLIGAIIGGNTCIIKPSETTPETSAVIEKIIA
EAFAPEYVAVIQGGRDENSHLLSLPFDFIFFTGSPNVGKVVMQAAAKHLTPVVLELGGKCPLIVLPDADLDQTVNQLMFG
KFINSGQTCIAPDYLYVHYSVKDALLERLVERVKTELPEINSTGKLVTERQVQRLVSLLEATQGQVLVGSQADVSKRALS
ATVVDGVEWNDPLMSEELFGPILPVLEFDSVRTAIDQVNKHHPKPLAVYVFGKDMDVAKGIINQIQSGDAQVNGVMLHAF
SPYLPFGGIGASGMGEYHGHFSYLTFTHKKSVRIVP
>E8WYN5 4.1.2.10~~~~~~(R)-mandelonitrile lyase~~~COG1917
MEIKRVGSQASGKGPADWFTGTVRIDPLFQAPDPALVAGASVTFEPGARTAWHTHPLGQTLIVTAGCGWAQREGGAVEEI
HPGDVVWFSPGEKHWHGAAPTTAMTHLAIQERLDGKAVDWMEHVTDEQYRR
>G8FRC5 1.14.13.229~~~mdpJ~~~Tert-butanol monooxygenase / tert-amyl alcohol desaturase oxygenase subunit~~~
MGNREPLAAAGQGTAYSGYRLRDLQNVAPTNLEILRTGPGTPMGEYMRRYWQPVCLSQELTDVPKAIRILHEDLVAFRDR
RGNVGVLHRKCAHRGASLEFGIVQERGIRCCYHGWHFDVDGSLLEAPAEPPDTKLKETVCQGAYPAFERNGLVFAYMGPA
DRRPEFPVFDGYVLPKGTRLIPFSNVFDCNWLQVYENQIDHYHTALLHNNMTVAGVDAKLADGATLQGGFGEMPIIDWHP
TDDNNGMIFTAGRRLSDDEVWIRISQMGLPNWMQNAAIVAAAPQRHSGPAMSRWQVPVDDEHSIAFGWRHFNDEVDPEHR
GREEECGVDKIDFLIGQTRHRPYEETQRVPGDYEAIVSQGPIALHGLEHPGRSDVGVYMCRSLLRDAVAGKAPPDPVRVK
AGSTDGQTLPRYASDSRLRIRRRPSREADSDVIRKAAHQVFAIMKECDELPVVQRRPHVLRRLDEIEASL
>G8FRC6 1.-.-.-~~~mdpK~~~Tert-butanol monooxygenase / tert-amyl alcohol desaturase reductase subunit~~~
MYQLSHTGKYPKTALNLRVRQITYQGIGINAYEFVREDGGELEEFTAGAHVDLYFRDGRVRQYSLCNDPAERRRYLIAVL
RDDNGRGGSIAIHERVHTQRLVAVGHPRNNFPLIEGAPHQVLLAGGIGITPLKAMVHRLERMGADYTLHYCAKSSAHAAF
QEELAPMAAKGRVIMHFDGGNPAKGLDIAALLRRYEPGWQLYYCGPPGFMEACTRACTHWPAEAVHFEYFVGAPVLPDDG
VPQDIGSDALALGFQIKIASTGTVLTVPNDKSIAQVLGEHGIEVPTSCQSGLCGTCKVRYLAGDVEHRDYLLSAEARTQF
LTTCVSRSKGATLVLDL
>A1KF14 7.6.2.-~~~~~~Multidrug efflux ATP-binding/permease protein BCG_0231~~~
MRTNCWWRLSGYVMRHRRDLLLGFGAALAGTVIAVLVPLVTKRVIDDAIAADHRPLAPWAVVLVAAAGATYLLTYVRRYY
GGRIAHLVQHDLRMDAFQALLRWDGRQQDRWSSGQLIVRTTNDLQLVQALLFDVPNVLRHVLTLLLGVAVMTWLSVPLAL
LAVLLVPVIGLIAHRSRRLLAAATHCAQEHKAAVTGVVDAAVCGIRVVKAFGQEERETVKLVMASRALYAAQLRVARLNA
HFGPLLQTLPALGQMAVFALGGWMAAQGSITVGTFVAFWACLTLLARPACDLAGMLTIAQQARAGAVRVLELIDSRPTLV
DGTKPLSLEARLSLEFQRVSFGYVADRPVLREISLSVRAGETLAVVGAPGSGKSTLASLATRCYDVTQGAVRIGGQDVRE
LTLDSLRSAIGLVPEDAVLFSGTIGANIAYGRPDATPEQIATAARAAHIEEFVNTLPDGYQTAVGARGLTLSGGQRQRIA
LARALLHQPRLLIMDDPTSAVDAVIECGIQEVLREAIADRTAVIFTRRRSMLTLADRVAVLDSGRLLDVGTPDEVWERCP
RYRELLSPAPDLADDLVVAERSPVCRPVAGLGTKAAQHTNVHNPGPHDHPPGPDPLRRLLREFRGPLALSLLLVAVQTCA
GLLPPLLIRHGIDVGIRRHVLSALWWAALAGTATVVIRWVVQWGSAMVAGYTGEQVLFRLRSVVFAHAQRLGLDAFEDDG
DAQIVTAVTADVEAIVAFLRTGLVVAVISVVTLVGILVALLAIRARLVLLIFTTMPVLALATWQFRRASNWTYRRARHRL
GTVTATLREYAAGLRIAQAFRAEYRGLQSYFAHSDDYRRLGVRGQRLLALYYPFVALLCSLATTLVLLDGAREVRAGVIS
VGALVTYLLYIELLYTPIGELAQMFDDYQRAAVAAGRIRSLLSTRTPSSPAARPVGTLRGEVVFDAVHYSYRTREVPALA
GINLRIPAGQTVVFVGSTGSGKSTLIKLVARFYDPTHGTVRVDGCDLREFDVDGYRNRLGIVTQEQYVFAGTVRDAIAYG
RPDATDAQVERAAREVGAHPMITALDNGYLHQVTAGGRNLSAGQLQLLALARARLVDPDILLLDEATVALDPATEAVVQR
ATLTLAARRTTLIVAHGLAIAEHADRIVVLEHGTVVEDGAHTELLAAGGHYSRLWAAHTRLCSPEITQLQCIDA
>O53645 7.6.2.-~~~~~~Multidrug efflux ATP-binding/permease protein Rv0194~~~COG1132
MRTNCWWRLSGYVMRHRRDLLLGFGAALAGTVIAVLVPLVTKRVIDDAIAADHRPLAPWAVVLVAAAGATYLLMYVRRYY
GGRIAHLVQHDLRMDAFQALLRWDGRQQDRWSSGQLIVRTTNDLQLVQALLFDVPNVLRHVLTLLLGVAVMTWLSVPLAL
LAVLLVPVIGLIAHRSRRLLAAATHCAQEHKAAVTGVVDAAVCGIRVVKAFGQEERETVKLVTASRALYAAQLRVARLNA
HFGPLLQTLPALGQMAVFALGGWMAAQGSITVGTFVAFWACLTLLARPACDLAGMLTIAQQARAGAVRVLELIDSRPTLV
DGTKPLSPEARLSLEFQRVSFGYVADRPVLREISLSVRAGETLAVVGAPGSGKSTLASLATRCYDVTQGAVRIGGQDVRE
LTLDSLRSAIGLVPEDAVLFSGTIGANIAYGRPDATPEQIATAARAAHIEEFVNTLPDGYQTAVGARGLTLSGGQRQRIA
LARALLHQPRLLIMDDPTSAVDAVIECGIQEVLREAIADRTAVIFTRRRSMLTLADRVAVLDSGRLLDVGTPDEVWERCP
RYRELLSPAPDLADDLVVAERSPVCRPVAGLGTKAAQHTNVHNPGPHDHPPGPDPLRRLLREFRGPLALSLLLVAVQTCA
GLLPPLLIRHGIDVGIRRHVLSALWWAALAGTATVVIRWVVQWGSAMVAGYTGEQVLFRLRSVVFAHAQRLGLDAFEDDG
DAQIVTAVTADVEAIVAFLRTGLVVAVISVVTLVGILVALLAIRARLVLLIFTTMPVLALATWQFRRASNWTYRRARHRL
GTVTATLREYAAGLRIAQAFRAEYRGLQSYFAHSDDYRRLGVRGQRLLALYYPFVALLCSLATTLVLLDGAREVRAGVIS
VGALVTYLLYIELLYTPIGELAQMFDDYQRAAVAAGRIRSLLSTRTPSSPAARPVGTLRGEVVFDAVHYSYRTREVPALA
GINLRIPAGQTVVFVGSTGSGKSTLIKLVARFYDPTHGTVRVDGCDLREFDVDGYRNRLGIVTQEQYVFAGTVRDAIAYG
RPDATDAQVERAAREVGAHPMITALDNGYLHQVTAGGRNLSAGQLQLLALARARLVDPDILLLDEATVALDPATEAVVQR
ATLTLAARRTTLIVAHGLAIAEHADRIVVLEHGTVVEDGAHTELLAAGGHYSRLWAAHTRLCSPEITQLQCIDA
>A0A1C7E424 ~~~mdrP~~~Na(+), Li(+), K(+)/H(+) antiporter~~~
MKIKDWNRSLKVRLVGEFFMNTSFWMVFPFLAIYFAEEFGKGLAGMLLIISQIFSVAANLFGGYFADRFGRKRMLVSAAV
AQGFAFLLFALANSPWLTSPELSFVAFTLAGMCGSLYWPASQAMIADVIPEKYRSDVFAVFYTTLNIAVVIGPLFGAVLF
FSYRFELLLTVAIISVLLGLLLRFYTEETLSAEVLEKWKEGNATGWRGALLTQVKDYGIILKDRVFLLFVIAGILGAQTF
MQLDLVIPVYLKETIDRQTILDFLGREWSVTGETSFGILLAENGLIVALLTVVITRWMTKFPEKWVFFFSALLFGLSMAI
FPMTSWFWIFFVAMAVFTFAELMVVGLQQSFISKLAPESMRGQYFAAASLRYTIGRMIAPISIPMTAWFGFGWTFIILGS
FAVLSGFVYLWMFHLYDKRTVANI
>O67214 2.4.1.361~~~mds~~~GDP-mannose:di-myo-inositol-1,3'-phosphate beta-1,2-mannosyltransferase~~~COG0438
MNVGIFSRWNATCGVSLHAEMIGRELLRRGYPITVFAPYLESASRWWHHKLIRPDEEYVVRCYEELSPDGKEGKIDIEKV
LEREIDFLIVESYEKLPYKDVEKLVKILKDKGIPSIAIIHEGDYEDIRYTDMNIFEKVCVFDERYVKEVLKDRVSEEKVE
IIPYPCYPVREGSREFAEDGVIKFFSFGRQPKEEYCPYIEGLKVFKRDFPNVKYRIVRAMEPLKIFEDFVEQEERILDYE
EIVKELHSADFHLLPKGNTKRVVVSSTLYQVLGTLTLTVVPDNRFFETLPHGEEAPVIFYRDVLELVKELKKASADEEYR
KKIRENASKFVEENSVERITDRFENLINSILVKNVH
>Q9WYJ4 2.4.1.361~~~mds~~~GDP-mannose:di-myo-inositol-1,3'-phosphate beta-1,2-mannosyltransferase~~~COG0438
MKIGFLSRWGATCGVGMHAEILAREFIRMGHEVVVFAPTEESASKEVKYYKRTEAQDPEFVKREIYTEVDNVTEEGWVKE
EEILKENLDLLIIETFWRVPVKPLTRLIEKLKIPVISVFHEANIFKAREVVKLPCDKIVVFDRRFYDEILEFYEIPREKV
EVISYPVMKPYDAEPERPVSEDKFLFFSFGRQPVEEYCDFLNALKKLRKRFDNVHYWIIRSDGRVDYEAEWITQWQKRPT
VEKLYSYLKGSNVHLLPKGNTPNVVVSSTLYQIIASETPIVIRDSRFVETIETDVYGFGPIVKYRNIHDLVHKLELLMLD
RELVEDIKKEVRVFVEKYGGDKIAQEFLDLAKTITK
>P76397 ~~~mdtA~~~Multidrug resistance protein MdtA~~~COG0845
MKGSYKSRWVIVIVVVIAAIAAFWFWQGRNDSRSAAPGATKQAQQSPAGGRRGMRSGPLAPVQAATAVEQAVPRYLTGLG
TITAANTVTVRSRVDGQLIALHFQEGQQVKAGDLLAEIDPSQFKVALAQAQGQLAKDKATLANARRDLARYQQLAKTNLV
SRQELDAQQALVSETEGTIKADEASVASAQLQLDWSRITAPVDGRVGLKQVDVGNQISSGDTTGIVVITQTHPIDLVFTL
PESDIATVVQAQKAGKPLVVEAWDRTNSKKLSEGTLLSLDNQIDATTGTIKVKARFNNQDDALFPNQFVNARMLVDTEQN
AVVIPTAALQMGNEGHFVWVLNSENKVSKHLVTPGIQDSQKVVIRAGISAGDRVVTDGIDRLTEGAKVEVVEAQSATTPE
EKATSREYAKKGARS
>A7ZNP8 ~~~mdtB~~~Multidrug resistance protein MdtB~~~
MQVLPPSSTGGPSRLFIMRPVATTLLMVAILLAGIIGYRALPVSALPEVDYPTIQVVTLYPGASPDVMTSAVTAPLERQF
GQMSGLKQMSSQSSGGASVITLQFQLTLPLDVAEQEVQAAINAATNLLPSDLPNPPVYSKVNPADPPIMTLAVTSTAMPM
TQVEDMVETRVAQKISQISGVGLVTLSGGQRPAVRVKLNAQAIAALGLTSETVRTAITGANVNSAKGSLDGPSRAVTLSA
NDQMQSAEEYRQLIIAYQNGAPIRLGDVATVEQGAENSWLGAWANKEQAIVMNVQRQPGANIISTADSIRQMLPQLTESL
PKSVKVTVLSDRTTNIRASVDDTQFELMMAIALVVMIIYLFLRNIPATIIPGVAVPLSLIGTFAVMVFLDFSINNLTLMA
LTIATGFVVDDAIVVIENISRYIEKGEKPLAAALKGAGEIGFTIISLTFSLIAVLIPLLFMGDIVGRLFREFAITLAVAI
LISAVVSLTLTPMMCARMLSQESLRKQNRFSRASEKMFDRIIAAYGRGLAKVLNHPWLTLSVALSTLLLSVLLWVFIPKG
FFPVQDNGIIQGTLQAPQSSSFANMAQRQRQVADVILQDPAVQSLTSFVGVDGTNPSLNSARLQINLKPLDERDDRVQKV
IARLQTAVDKVPGVDLFLQPTQDLTIDTQVSRTQYQFTLQATSLDALSTWVPQLMEKLQQLPQLSDVSSDWQDKGLVAYV
NVDRDSASRLGISMADVDNALYNAFGQRLISTIYTQANQYRVVLEHNTENTPGLAALDTIRLTSSDGGVVPLSSIAKIEQ
RFAPLSINHLDQFPVTTISFNVPDNYSLGDAVQAIMDTEKTLNLPVDITTQFQGSTLAFQSALGSTVWLIVAAVVAMYIV
LGILYESFIHPITILSTLPTAGVGALLALMIAGSELDVIAIIGIILLIGIVKKNAIMMIDFALAAEREQGMSPRDAIYQA
CLLRFRPILMTTLAALLGALPLMLSTGVGAELRRPLGIGMVGGLIVSQVLTLFTTPVIYLLFDRLALWTKSRFARHEEEA
>B7UTB3 ~~~mdtB~~~Multidrug resistance protein MdtB~~~
MQVLPPSSTGGPSRLFIMRPVATTLLMVAILLAGIIGYQALPVSALPEVDYPTIQVVTLYPGASPDVMTSAVTAPLERQF
GQMSGLKQMSSQSSGGASVITLQFQLTLPLDVAEQEVQAAINAATNLLPSDLPNPPVYSKVNPADPPIMTLAVTSTAMPM
TQVEDMVETRVAQKISQISGVGLVTLSGGQRPAVRVKLNAQAIAALGLTSETVRTAITGANVNSAKGSLDGPSRAVTLSA
NDQMQSAEEYRQLIIAYQNGAPIRLGDVATVEQGAENSWLGAWANKEQAIVMNVQRQPGANIISTADSIRQMLPQLTESL
PKSVKVTVLSDRTTNIRASVDDTQFELMMAIALVVMIIYLFLRNIPATIIPGVAVPLSLIGTFAVMVFLDFSINNLTLMA
LTIATGFVVDDAIVVIENISRYIEKGEKPLAAALKGAGEIGFTIISLTFSLIAVLIPLLFMGDIVGRLFREFAITLAVAI
LISAVVSLTLTPMMCARMLSQESLRKQNRFSRASEKMFERIIAAYGQGLAKVLNHPWLTLSVALSTLLLSVLLWVFIPKG
FFPVQDNGIIQGTLQAPQSSSFANMAQRQRQVADVILQDPAVQSLTSFVGVDGTNPSLNSARLQINLKPLDERDDRVQKV
IARLQTAVDKVPGVDLFLQPTQDLTIDTQVSRTQYQFTLQATSLDALSTWVPQLMEKLQQLPQLSDVSSDWQDKGLVAYV
NVDRDSASRLGISMADVDNALYNAFGQRLISTIYTQANQYRVVLEHNTENTPGLAALDTIRLTSSDGGVVPLSSIAKIEQ
RFAPLSINHLDQFPVTTISFNVPDNYSLGDAVQAIMDTEKTLNLPVDITTQFQGSTLAFQSALGSTVWLIVAAVVAMYIV
LGILYESFIHPITILSTLPTAGVGALLALLIAGSELDVIAIIGIILLIGIVKKNAIMMIDFALAAEREQGMSPRDAIYQA
CLLRFRPILMTTLAALLGALPLMLSTGVGAELRRPLGIGMVGGLIVSQVLTLFTTPVIYLLFDRLALWTKSRFARHEEEA
>B7ME87 ~~~mdtB~~~Multidrug resistance protein MdtB~~~
MQVLPPSSTGGPSRLFIMRPVATTLLMVAILLAGIIGYRALPVSALPEVDYPTIQVVTLYPGASPDVMTSAVTAPLERQF
GQMSGLKQMSSQSSGGASVITLQFQLTLPLDVAEQEVQAAINAATNLLPSDLPNPPVYSKVNPADPPIMTLAVTSTAMPM
TQVEDMVETRVAQKISQISGVGLVTLSGGQRPAVRVKLNAQAIAALGLTSETVRTAITGANVNSAKGSLDGPSRAVTLSA
NDQMQSAEEYRQLIIAYQNGAPIRLGDVATVEQGAENSWLGAWANKEQAIVMNVQRQPGANIISTADSIRQMLPQLTESL
PKSVKVTVLSDRTTNIRASVDDTQFELMMAIALVVMIIYLFLRNIPATIIPGVAVPLSLIGTFAVMVFLDFSINNLTLMA
LTIATGFVVDDAIVVIENISRYIEKGEKPLAAALKGAGEIGFTIISLTFSLIAVLIPLLFMGDIVGRLFREFAITLAVAI
LISAVVSLTLTPMMCARMLSQESLRKQNRFSRASEKMFDRIIAAYGRGLAKVLNHPWLTLSVALSTLLLSVLLWVFIPKG
FFPVQDNGIIQGTLQAPQSSSFANMAQRQRQVADVILQDPAVQSLTSFVGVDGTNPSLNSARLQINLKPLDERDDRVQKV
IARLQTAVDKVPGVDLFLQPTQDLTIDTQVSRTQYQFTLQATSLDALSTWVPQLMEKLQQLPQLSDVSSDWQDKGLVAYV
NVDRDSASRLGISMADVDNALYNAFGQRLISTIYTQANQYRVVLEHNTENTPGLAALDTIRLTSSDGGVVPLSSIAKIEQ
RFAPLSINHLDQFPVTTISFNVPDNYSLGDAVQAIMDTEKTLNLPVDITTQFQGSTLAFQSALGSTVWLIVAAVVAMYIV
LGILYESFIHPITILSTLPTAGVGALLALMIAGSELDVIAIIGIILLIGIVKKNAIMMIDFALAAEREQGMSPREAIYQA
CLLRFRPILMTTLAALLGALPLMLSTGVGAELRRPLGIGMVGGLIVSQVLTLFTTPVIYLLFDRLALWTKSRFARHEEEA
>B7L9U8 ~~~mdtB~~~Multidrug resistance protein MdtB~~~
MQVLPPSSTGGPSRLFIMRPVATTLLMVAILLAGIIGYRALPVSALPEVDYPTIQVVTLYPGASPDVMTSAVTAPLERQF
GQMSGLKQMSSQSSGGASVITLQFQLTLPLDVAEQEVQAAINAATNLLPSDLPNPPVYSKVNPADPPIMTLAVTSTAMPM
TQVEDMVETRVAQKISQISGVGLVTLSGGQRPAVRVKLNAQAIAALGLTSETVRTAITGANVNSAKGSLDGPSRAVTLSA
NDQMQSAEEYRQLIIAYQNGAPIRLGDVATVEQGAENSWLGAWANKEQAIVMNVQRQPGANIISTADSIRQMLPQLTESL
PKSVKVTVLSDRTTNIRASVDDTQFELMMAIALVVMIIYLFLRNIPATIIPGVAVPLSLIGTFAVMVFLDFSINNLTLMA
LTIATGFVVDDAIVVIENISRYIEKGEKPLAAALKGAGEIGFTIISLTFSLIAVLIPLLFMGDIVGRLFREFAITLAVAI
LISAVVSLTLTPMMCARMLSQESLRKQNRFSRASEKMFDRIIAAYGQGLAKVLNHPWLTLSVALSTLLLSVLLWVFIPKG
FFPVQDNGIIQGTLQAPQSSSFANMAQRQRQVADVILQDPAVQSLTSFVGVDGTNPSLNSARLQINLKPLEERDDRVQKV
IARLQTAVDKVPGVDLFLQPTQDLTIDTQVSRTQYQFTLQATSLDALSTWVPQLMEKLQQLPQLSDVSSDWQDKGLVAYV
NVDRDSASRLGISMADVDNALYNAFGQRLISTIYTQANQYRVVLEHNTENTPGLAALDTIRLTSSDGGVVPLSSIAKIEQ
RFAPLSINHLDQFPVTTISFNVPDNYSLGDAVQAIMDTEKTLNLPVDITTQFQGSTLAFQSALGSTVWLIVAAVVAMYIV
LGILYESFIHPITILSTLPTAGVGALLALMIAGSELDVIAIIGIILLIGIVKKNAIMMIDFALAAEREQGMSPRDAIYQA
CLLRFRPILMTTLAALLGALPLMLSTGVGAELRRPLGIGMVGGLIVSQVLTLFTTPVIYLLFDRLALWTKSRFARHEEEA
>Q8X7J4 ~~~mdtB~~~Multidrug resistance protein MdtB~~~COG0841
MQVLPPSSTGGPSRLFIMRPVATTLLMVAILLAGIIGYRALPVSALPEVDYPTIQVVTLYPGASPDVMTSAVTAPLERQF
GQMSGLKQMSSQSSGGASVITLQFQLTLPLDVAEQEVQAAINAATNLLPSDLPNPPVYSKVNPADPPIMTLAVTSTAMPM
TQVEDMVETRVAQKISQISGVGLVTLSGGQRPAVRVKLNAQAIAALGLTSETVRTAITGANVNSAKGSLDGPSRAVTLSA
NDQMQSAEEYRQLIIAYQNGAPIRLGDVATVEQGAENSWLGAWANKEQAIVMNVQRQPGANIISTADSIRQMLPQLTESL
PKSVKVTVLSDRTTNIRASVDDTQFELMMAIALVVMIIYLFLRNIPATIIPGVAVPLSLIGTFAVMVFLDFSINNLTLMA
LTIATGFVVDDAIVVIENISRYIEKGEKPLAAALKGAGEIGFTIISLTFSLIAVLIPLLFMGDIVGRLFREFAITLAVAI
LISAVVSLTLTPMMCARMLSQESLRKQNRFSRASEKMFDRIIAAYGRGLAKVLNHPWLTLSVALSTLLLSVLLWVFIPKG
FFPVQDNGIIQGTLQAPQSSSFANMAQRQRQVADVILQDPAVQSLTSFVGVDGTNPSLNSARLQINLKPLDERDDRVQKV
IARLQTAVDKVPGVDLFLQPTQDLTIDTQVSRTQYQFTLQATSLDALSTWVPQLMEKLQQLPQISDVSSDWQDKGLVAYV
NVDRDSASRLGISMADVDNALYNAFGQRLISTIYTQANQYRVVLEHNTENTPGLAALDTIRLTSSDGGVVPLSSIAKIEQ
RFAPLSINHLDQFPVTTISFNVPDNYSLGDAVQAIMDTEKTLNLPVDITTQFQGSTLAFQSALGSTVWLIVAAVVAMYIV
LGILYESFIHPITILSTLPTAGVGALLALMIAGSELDVIAIIGIILLIGIVKKNAIMMIDFALAAEREQGMSPRDAIYQA
CLLRFRPILMTTLAALLGALPLMLSTGVGAELRRPLGIGMVGGLIVSQVLTLFTTPVIYLLFDRLALWTKSRFARHEEEA
>B5YUD3 ~~~mdtB~~~Multidrug resistance protein MdtB~~~
MQVLPPSSTGGPSRLFIMRPVATTLLMVAILLAGIIGYRALPVSALPEVDYPTIQVVTLYPGASPDVMTSAVTAPLERQF
GQMSGLKQMSSQSSGGASVITLQFQLTLPLDVAEQEVQAAINAATNLLPSDLPNPPVYSKVNPADPPIMTLAVTSTAMPM
TQVEDMVETRVAQKISQISGVGLVTLSGGQRPAVRVKLNAQAIAALGLTSETVRTAITGANVNSAKGSLDGPSRAVTLSA
NDQMQSAEEYRQLIIAYQNGAPIRLGDVATVEQGAENSWLGAWANKEQAIVMNVQRQPGANIISTADSIRQMLPQLTESL
PKSVKVTVLSDRTTNIRASVDDTQFELMMAITLVVMIIYLFLRNIPATIIPGVAVPLSLIGTFAVMVFLDFSINNLTLMA
LTIATGFVVDDAIVVIENISRYIEKGEKPLAAALKGAGEIGFTIISLTFSLIAVLIPLLFMGDIVGRLFREFAITLAVAI
LISAVVSLTLTPMMCARMLSQESLRKQNRFSRASEKMFDRIIAAYGRGLAKVLNHPWLTLSVALSTLLLSVLLWVFIPKG
FFPVQDNGIIQGTLQAPQSSSFANMAQRQRQVADVILQDPAVQSLTSFVGVDGTNPSLNSARLQINLKPLDERDDRVQKV
IARLQTAVDKVPGVDLFLQPTQDLTIDTQVSRTQYQFTLQATSLDALSTWVPQLMEKLQQLPQISDVSSDWQDKGLVAYV
NVDRDSASRLGISMADVDNALYNAFGQRLISTIYTQANQYRVVLEHNTENTPGLAALDTIRLTSSDGGVVPLSSIAKIEQ
RFAPLSINHLDQFPVTTISFNVPDNYSLGDAVQAIMDTEKTLNLPVDITTQFQGSTLAFQSALGSTVWLIVAAVVAMYIV
LGILYESFIHPITILSTLPTAGVGALLALMIAGSELDVIAIIGIILLIGIVKKNAIMMIDFALAAEREQGMSPRDAIYQA
CLLRFRPILMTTLAALLGALPLMLSTGVGAELRRPLGIGMVGGLIVSQVLTLFTTPVIYLLFDRLALWTKSRFARHEEEA
>B7NQB1 ~~~mdtB~~~Multidrug resistance protein MdtB~~~
MQVLPPSSTGGPSRLFIMRPVATTLLMVAILLAGIIGYRALPVSALPEVDYPTIQVVTLYPGASPDVMTSAVTAPLERQF
GQMSGLKQMSSQSSGGASVITLQFQLTLPLDVAEQEVQAAINAATNLLPSDLPNPPVYSKVNPADPPIMTLAVTSTAMPM
TQVEDMVETRVAQKISQISGVGLVTLSGGQRPAVRVKLNAQAIAALGLTSETVRTAITGANVNSAKGSLDGPSRAVTLSA
NDQMQSAEEYRQLIIAYQNSAPIRLGDVATVEQGAENSWLGAWANKEQAIVMNVQRQPGANIISTADSIRQMLPQLTESL
PKSVKVTVLSDRTTNIRASVDDTQFELMMAIALVVMIIYLFLRNIPATIIPGVAVPLSLIGTFAVMVFLDFSINNLTLMA
LTIATGFVVDDAIVVIENISRYIEKGEKPLAAALKGAGEIGFTIISLTFSLIAVLIPLLFMGDIVGRLFREFAITLAVAI
LISAVVSLTLTPMMCARMLSQESLRKQNRFSRASEKMFDRIIAAYGRGLAKVLNHPWLTLSVALSTLLLSVLLWVFIPKG
FFPVQDNGIIQGTLQAPQSSSFTNMAQRQRQVADVILQDPAVQSLTSFVGVDGTNPSLNSARLQINLKPLDERDDRVQKV
IARLQTAVDKVPGVDLFLQPTQDLTIDTQVSRTQYQFTLQATSLDALSTWVPQLVEKLQQLPQLSDVSSDWQDKGLVAYV
NVDRDSASRLGISMADVDNALYNAFGQRLISTIYTQANQYRVVLEHNTENTPGLAALDTIRLTSSDGGVVPLSSIAKIEQ
RFAPLSINHLDQFPVTTISFNVPDNYSLGDAVQAIMDTEKTLNLPVDITTQFQGSTLAFQSALGSTVWLIVAAVVAMYIV
LGILYESFIHPITILSTLPTAGVGALLALMIAGSELDVIAIIGIILLIGIVKKNAIMMIDFALAAEREQGMSPRDAIYQA
CLLRFRPILMTTLAALLGALPLMLSTGVGAELRRPLGIGMVGGLIVSQVLTLFTTPVIYLLFDRLALWTKSRFARHEEEA
>B7MWY8 ~~~mdtB~~~Multidrug resistance protein MdtB~~~
MQVLPPSSTGGPSRLFIMRPVATTLLMVAILLAGIIGYRALPVSALPEVDYPTIQVVTLYPGASPDVMTSAVTAPLERQF
GQMSGLKQMSSQSSGGASVITLQFQLTLPLDVAEQEVQAAINAATNLLPSDLPNPPVYSKVNPADPPIMTLAVTSTAMPM
TQVEDMVETRVAQKISQISGVGLVTLSGGQRPAVRVKLNAQAIAALGLTSETVRTAITGANVNSAKGSLDGPSRAVTLSA
NDQMQSAEEYRQLIIAYQNGAPIRLGDVATVEQGAENSWLGAWANKEQAIVMNVQRQPGANIISTADSIRQMLPQLTESL
PKSVKVTVLSDRTTNIRASVDDTQFELMMAIALVVMIIYLFLRNIPATIIPGVAVPLSLIGTFAVMVFLDFSINNLTLMA
LTIATGFVVDDAIVVIENISRYIEKGEKPLAAALKGAGEIGFTIISLTFSLIAVLIPLLFMGDIVGRLFREFAITLAVAI
LISAVVSLTLTPMMCARMLSQESLRKQNRFSRASEKMFDRIIAAYGRGLAKVLNHPWLTLSVALSTLLLSVLLWVFIPKG
FFPVQDNGIIQGTLQAPQSSSFANMAQRQRQVADVILQDPAVQSLTSFVGVDGTNPSLNSARLQINLKPLDERDDRVQKV
IARLQTAVDKVPGVDLFLQPTQDLTIDTQVSRTQYQFTLQATSLDALSTWVPQLMEKLQQLPQLSDVSSDWQDKGLVAYV
NVDRDSASRLGISMADVDNALYNAFGQRLISTIYTQANQYRVVLEHNTENTPGLAALDTIRLTSSDGGVVPLSSIAKIEP
RFAPLSINHLDQFPVTTISFNVPDNYSLGDAVQAIMDTEKTLNLPVDITTQFQGSTLAFQSALGSTVWLIVAAVVAMYIV
LGILYESFIHPITILSTLPTAGVGALLALMIAGSELDVIAIIGIILLIGIVKKNAIMMIDFALAAEREQGMSPREAIYQA
CLLRFRPILMTTLAALLGALPLMLSTGVGAELRRPLGIGMVGGLIVSQVLTLFTTPVIYLLFDRLALWTKSRFARHEEEA
>B7M458 ~~~mdtB~~~Multidrug resistance protein MdtB~~~
MQVLPPSSTGGPSRLFIMRPVATTLLMVAILLAGIIGYRALPVSALPEVDYPTIQVVTLYPGASPDVMTSAVTAPLERQF
GQMSGLKQMSSQSSGGASVITLQFQLTLPLDVAEQEVQAAINAATNLLPSDLPNPPVYSKVNPADPPIMTLAVTSTAMPM
TQVEDMVETRVAQKISQISGVGLVTLSGGQRPAVRVKLNAQAIAALGLTSETVRTAITGANVNSAKGSLDGPSRAVTLSA
NDQMQSAEEYRQLIIAYQNGAPIRLGDVATVEQGAENSWLGAWANKEQAIVMNVQRQPGANIISTADSIRQMLPQLTESL
PKSVKVTVLSDRTTNIRASVNDTQFELMMAIALVVMIIYLFLRNIPATIIPGFAVPLSLIGTFAVMVFLDFSINNLTLMA
LTIATGFVVDDAIVVIENISRYIEKGEKPLAAALKGAGEIGFTIISLTFSLIAVLIPLLFMGDIVGRLFREFAITLAVAI
LISAVVSLTLTPMMCARMLSQESLRKQNRFSRASEKMFDRIIAAYGRGLAKVLNHPWLTLSVALSTLLLSVLLWVFIPKG
FFPVQDNGIIQGTLQAPQSSSFANMAQRQRQVADVILQDPAVQSLTSFVGVDGTNPSLNSARLQINLKPLDERDDRVQKV
IARLQTAVDKVPGVDLFLQPTQDLTIDTQVSRTQYQFTLQATSLDALSTWVPELMEKLQQLPQLSDVSSDWQDKGLVAYV
NVDRDSASRLGISMADVDNALYNAFGQRLISTIYTQANQYRVVLEHNTENTPGLAALDTIRLTSSDGGVVPLSSIAKIEQ
RFAPLSINHLDQFPVTTISFNVPDNYSLGDAVQAIMDTEKTLNLPVDITTQFQGSTLAFQSALGSTVWLIVAAVVAMYIV
LGILYESFIHPITILSTLPTAGVGALLALMIAGSELDVIAIIGIILLIGIVKKNAIMMIDFALAAEREQGMSPRDAIYQA
CLLRFRPILMTTLAALLGALPLMLSTGVGAELRRPLGIGMVGGLIVSQVLTLFTTPVIYLLFDRLALWTKSRFARHEEEA
>C4ZSG3 ~~~mdtB~~~Multidrug resistance protein MdtB~~~
MQVLPPSSTGGPSRLFIMRPVATTLLMVAILLAGIIGYRALPVSALPEVDYPTIQVVTLYPGASPDVMTSAVTAPLERQF
GQMSGLKQMSSQSSGGASVITLQFQLTLPLDVAEQEVQAAINAATNLLPSDLPNPPVYSKVNPADPPIMTLAVTSTAMPM
TQVEDMVETRVAQKISQISGVGLVTLSGGQRPAVRVKLNAQAIAALGLTSETVRTAITGANVNSAKGSLDGPSRAVTLSA
NDQMQSAEEYRQLIIAYQNGAPIRLGDVATVEQGAENSWLGAWANKEQAIVMNVQRQPGANIISTADSIRQMLPQLTESL
PKSVKVTVLSDRTTNIRASVDDTQFELMMAIALVVMIIYLFLRNIPATIIPGVAVPLSLIGTFAVMVFLDFSINNLTLMA
LTIATGFVVDDAIVVIENISRYIEKGEKPLAAALKGAGEIGFTIISLTFSLIAVLIPLLFMGDIVGRLFREFAITLAVAI
LISAVVSLTLTPMMCARMLSQESLRKQNRFSRASEKMFDRIIAAYGRGLAKVLNHPWLTLSVALSTLLLSVLLWVFIPKG
FFPVQDNGIIQGTLQAPQSSSFANMAQRQRQVADVILQDPAVQSLTSFVGVDGTNPSLNSARLQINLKPLDERDDRVQKV
IARLQTAVDKVPGVDLFLQPTQDLTIDTQVSRTQYQFTLQATSLDALSTWVPQLMEKLQQLPQLSDVSSDWQDKGLVAYV
NVDRDSASRLGISMADVDNALYNAFGQRLISTIYTQANQYRVVLEHNTENTPGLAALDTIRLTSSDGGVVPLSSIAKIEQ
RFAPLSINHLDQFPVTTISFNVPDNYSLGDAVQAIMDTEKTLNLPVDITTQFQGSTLAFQSALGSTVWLIVAAVVAMYIV
LGILYESFIHPITILSTLPTAGVGALLALLIAGSELDVIAIIGIILLIGIVKKNAIMMIDFALAAEREQGMSPREAIYQA
CLLRFRPILMTTLAALLGALPLMLSTGVGAELRRPLGIGMVGGLIVSQVLTLFTTPVIYLLFDRLALWTKSRFARHEEEA
>B1X7H1 ~~~mdtB~~~Multidrug resistance protein MdtB~~~
MQVLPPSSTGGPSRLFIMRPVATTLLMVAILLAGIIGYRALPVSALPEVDYPTIQVVTLYPGASPDVMTSAVTAPLERQF
GQMSGLKQMSSQSSGGASVITLQFQLTLPLDVAEQEVQAAINAATNLLPSDLPNPPVYSKVNPADPPIMTLAVTSTAMPM
TQVEDMVETRVAQKISQISGVGLVTLSGGQRPAVRVKLNAQAIAALGLTSETVRTAITGANVNSAKGSLDGPSRAVTLSA
NDQMQSAEEYRQLIIAYQNGAPIRLGDVATVEQGAENSWLGAWANKEQAIVMNVQRQPGANIISTADSIRQMLPQLTESL
PKSVKVTVLSDRTTNIRASVDDTQFELMMAIALVVMIIYLFLRNIPATIIPGVAVPLSLIGTFAVMVFLDFSINNLTLMA
LTIATGFVVDDAIVVIENISRYIEKGEKPLAAALKGAGEIGFTIISLTFSLIAVLIPLLFMGDIVGRLFREFAITLAVAI
LISAVVSLTLTPMMCARMLSQESLRKQNRFSRASEKMFDRIIAAYGRGLAKVLNHPWLTLSVALSTLLLSVLLWVFIPKG
FFPVQDNGIIQGTLQAPQSSSFANMAQRQRQVADVILQDPAVQSLTSFVGVDGTNPSLNSARLQINLKPLDERDDRVQKV
IARLQTAVDKVPGVDLFLQPTQDLTIDTQVSRTQYQFTLQATSLDALSTWVPQLMEKLQQLPQLSDVSSDWQDKGLVAYV
NVDRDSASRLGISMADVDNALYNAFGQRLISTIYTQANQYRVVLEHNTENTPGLAALDTIRLTSSDGGVVPLSSIAKIEQ
RFAPLSINHLDQFPVTTISFNVPDNYSLGDAVQAIMDTEKTLNLPVDITTQFQGSTLAFQSALGSTVWLIVAAVVAMYIV
LGILYESFIHPITILSTLPTAGVGALLALLIAGSELDVIAIIGIILLIGIVKKNAIMMIDFALAAEREQGMSPREAIYQA
CLLRFRPILMTTLAALLGALPLMLSTGVGAELRRPLGIGMVGGLIVSQVLTLFTTPVIYLLFDRLALWTKSRFARHEEEA
>A8A1U7 ~~~mdtB~~~Multidrug resistance protein MdtB~~~
MQVLPPSSTGGPSRLFIMRPVATTLLMVAILLAGIIGYRALPVSALPEVDYPTIQVVTLYPGASPDVMTSAVTAPLERQF
GQMSGLKQMSSQSSGGASVITLQFQLTLPLDVAEQEVQAAINAATNLLPSDLPNPPVYSKVNPADPPIMTLAVTSIAMPM
TQVEDMVETRVAQKISQISGVGLVTLSGGQRPAVRVKLNAQAIAALGLTSETVRTAITGANVNSAKGSLDGPSRAVTLSA
NDQMQSAEEYRQLIIAYQNGAPIRLGDVATVEQGAENSWLGAWANKEQAIVMNVQRQPGANIISTADSIRQMLPQLTESL
PKSVKVTVLSDRTTNIRASVDDTQFELMMAIALVVMIIYLFLRNIPATIIPGVAVPLSLIGTFAVMVFLDFSINNLTLMA
LTIATGFVVDDAIVVIENISRYIEKGEKPLAAALKGAGEIGFTIISLTFSLIAVLIPLLFMGDIVGRLFREFAITLAVAI
LISAVVSLTLTPMMCARMLSQESLRKQNRFSRASEKMFDRIIAAYGRGLAKVLNHPWLTLSVALSTLLLSVLLWVFIPKG
FFPVQDNGIIQGTLQAPQSSSFANMAQRQRQVADVILQDPAVQSLTSFVGVDGTNPSLNSARLQINLKPLDERDDRVQKV
IARLQTAVDKVPGVDLFLQPTQDLTIDTQVSRTQYQFTLQATSLDALSTWVPQLMEKLQQLPQLSDVSSDWQDKGLVAYV
NVDRDSASRLGISMADVDNALYNAFGQRLISTIYTQANQYRVVLEHNTENTPGLAALDTIRLTSSDGGVVPLSSIAKIEQ
RFAPLSINHLDQFPVTTISFNVPDNYSLGDAVQAIMDTEKTLNLPVDITTQFQGSTLAFQSALGSTVWLIVAAVVAMYIV
LGILYESFIHPITILSTLPTAGVGALLALLIAGSELDVIAIIGIILLIGIVKKNAIMMIDFALAAEREQGMSPREAIYQA
CLLRFRPILMTTLAALLGALPLMLSTGVGAELRRPLGIGMVGGLIVSQVLTLFTTPVIYLLFDRLALWTKSRFARHEEEA
>A1ACT0 ~~~mdtB~~~Multidrug resistance protein MdtB~~~
MQVLPPSSTGGPSRLFIMRPVATTLLMVAILLAGIIGYRALPVSALPEVDYPTIQVVTLYPGASPDVMTSAVTAPLERQF
GQMSGLKQMSSQSSGGASVITLQFQLTLPLDVAEQEVQAAINAATNLLPSDLPNPPVYSKVNPADPPIMTLAVTSTAMPM
TQVEDMVETRVAQKISQISGVGLVTLSGGQRPAVRVKLNAQAIAALGLTSETVRTAITGANVNSAKGSLDGPSRAVTLSA
NDQMQSAEEYRQLIIAYQNGAPIRLGDVATVEQGAENSWLGAWANKEQAIVMNVQRQPGANIISTADSIRQMLPQLTESL
PKSVKVTVLSDRTTNIRASVDDTQFELMMAIALVVMIIYLFLRNIPATIIPGVAVPLSLIGTFAVMVFLDFSINNLTLMA
LTIATGFVVDDAIVVIENISRYIEKGEKPLAAALKGAGEIGFTIISLTFSLIAVLIPLLFMGDIVGRLFREFAITLAVAI
LISAVVSLTLTPMMCARMLSQESLRKQNRFSRASEKMFDRIIAAYGRGLAKVLNHPWLTLSVALSTLLLSVLLWVFIPKG
FFPVQDNGIIQGTLQAPQSSSFANMAQRQRQVADVILQDPAVQSLTSFVGVDGTNPSLNSARLQINLKPLDERDDRVQKV
IARLQTAVDKVPGVDLFLQPTQDLTIDTQVSRTQYQFTLQATSLDALSTWVPQLMEKLQQLPQLSDVSSDWQDKGLVAYV
NVDRDSASRLGISMADVDNALYNAFGQRLISTIYTQANQYRVVLEHNTENTPGLAALDTIRLTSSDGGVVPLSSIAKIEQ
RFAPLSINHLDQFPVTTISFNVPDNYSLGDAVQAIMDTEKTLNLPVDITTQFQGSTLAFQSALGSTVWLIVAAVVAMYIV
LGILYESFIHPITILSTLPTAGVGALLALMIAGSELDVIAIIGIILLIGIVKKNAIMMIDFALAAEREQGMSPREAIYQA
CLLRFRPILMTTLAALLGALPLMLSTGVGAELRRPLGIGMVGGLIVSQVLTLFTTPVIYLLFDRLALWTKSRFARHEEEA
>Q0TG15 ~~~mdtB~~~Multidrug resistance protein MdtB~~~
MQVLPPSSTGGPSRLFIMRPVATTLLMVAILLAGIIGYRALPVSALPEVDYPTIQVVTLYPGASPDVMTSAVTAPLERQF
GQMSGLKQMSSQSSGGASVITLQFQLTLPLDVAEQEVQAAINAATNLLPSDLPNPPVYSKVNPADPPIMTLAVTSTAMPM
TQVEDMVETRVAQKISQISGVGLVTLSGGQRPAVRVKLNAQAIAALGLTSESVRTAITGANVNSAKGSLDGPSRAVTLSA
NDQMQSAEEYRQLIIAYQNGAPIRLGDVATVEQGAENSWLGAWANKEQAIVMNVQRQPGANIISTADSIRQMLPQLTESL
PKSVKVTVLSDRTTNIRASVDDTQFELMMAIALVVMIIYLFLRNIPATIIPGVAVPLSLIGTFAVMVFLDFSINNLTLMA
LTIATGFVVDDAIVVIENISRYIEKGEKPLAAALKGAGEIGFTIISLTFSLIAVLIPLLFMGDIVGRLFREFAITLAVAI
LISAVVSLTLTPMMCARMLSQESLRKQNRFSRASEKMFDRIIAAYGRGLAKVLNHPWLTLSVALSTLLLSVLLWVFIPKG
FFPVQDNGIIQGTLQAPQSSSFANMAQRQRQVADVILQDPAVQSLTSFVGVDGTNPSLNSARLQINLKPLDERDDRVQKV
IARLQTAVDKVPGVDLFLQPTQDLTIDTQVSRTQYQFTLQATSLDALSTWVPQLMEKLQQLPQLSDVSSDWQDKGLVAYV
NVDRDSASRLGISMADVDNALYNAFGQRLISTIYTQANQYRVVLEHNTENTPGLAALDTIRLTSSDGGVVPLSSIAKIEQ
RFAPLSINHLDQFPVTTISFNVPDNYSLGDAVQAIMDTEKTLNLPVDITTQFQGSTLAFQSALGSTVWLIVAAVVAMYIV
LGILYESFIHPITILSTLPTAGVGALLALMIAGSELDVIAIIGIILLIGIVKKNAIMMIDFALAAEREQGMSPREAIYQA
CLLRFRPILMTTLAALLGALPLMLSTGVGAELRRPLGIGMVGGLIVSQVLTLFTTPVIYLLFDRLALWTKSRFARHEEEA
>Q8FG04 ~~~mdtB~~~Multidrug resistance protein MdtB~~~COG0841
MQVLPPSSTGGPSRLFIMRPVATTLLMVAILLAGIIGYRALPVSALPEVDYPTIQVVTLYPGASPDVMTSAVTAPLERQF
GQMSGLKQMSSQSSGGASVITLQFQLTLPLDVAEQEVQAAINAATNLLPSDLPNPPVYSKVNPADPPIMTLAVTSTAMPM
TQVEDMVETRVAQKISQISGVGLVTLSGGQRPAVRVKLNAQAIAALGLTSETVRTAITGANVNSAKGSLDGPSRAVTLSA
NDQMQSAEEYRQLIIAYQNGAPIRLGDVATVEQGAENSWLGAWANKEQAIVMNVQRQPGANIISTADSIRQMLPQLTESL
PKSVKVTVLSDRTTNIRASVDDTQFELMMAIALVVMIIYLFLRNIPATIIPGVAVPLSLIGTFAVMVFLDFSINNLTLMA
LTIATGFVVDDAIVVIENISRYIEKGEKPLAAALKGAGEIGFTIISLTFSLIAVLIPLLFMGDIVGRLFREFAITLAVAI
LISAVVSLTLTPMMCARMLSQESLRKQNRFSRASEKMFDRIIAAYGRGLAKVLNHPWLTLSVALSTLLLSVLLWVFIPKG
FFPVQDNGIIQGTLQAPQSSSFANMAQRQRQVADVILQDPAVQSLTSFVGVDGTNPSLNSARLQINLKPLDERDDRVQKV
IARLQTAVDKVPGVDLFLQPTQDLTIDTQVSRTQYQFTLQATSLDALSTWVPQLMEKLQQLPQLSDVSSDWQDKGLVAYV
NVDRDSASRLGISMADVDNALYNAFGQRLISTIYTQANQYRVVLEHNTENTPGLAALDTIRLTSSDGGVVPLSSIAKIEQ
RFAPLSINHLDQFPVTTISFNVPDNYSLGDAVQAIMDTEKTLNLPVDITTQFQGSTLAFQSALGSTVWLIVAAVVAMYIV
LGILYESFIHPITILSTLPTAGVGALLALMIAGSELDVIAIIGIILLIGIVKKNAIMMIDFALAAEREQGMSPREAIYQA
CLLRFRPILMTTLAALLGALPLMLSTGVGAELRRPLGIGMVGGLIVSQVLTLFTTPVIYLLFDRLALWTKSRFARHEEEA
>B1IYZ9 ~~~mdtB~~~Multidrug resistance protein MdtB~~~
MQVLPPSSTGGPSRLFIMRPVATTLLMVAILLAGIIGYRALPVSALPEVDYPTIQVVTLYPGASPDVMTSAVTAPLERQF
GQMSGLKQMSSQSSGGASVITLQFQLTLPLDVAEQEVQAAINAATNLLPSDLPNPPVYSKVNPADPPIMTLAVTSTAMPM
TQVEDMVETRVAQKISQISGVGLVTLSGGQRPAVRVKLNAQAIAALGLTSETVRTAITGANVNSAKGSLDGPSRAVTLSA
NDQMQSAEEYRQLIIAYQNGAPIRLGDVATVEQGAENSWLGAWANKEQAIVMNVQRQPGANIISTADSIRQMLPQLTESL
PKSVKVTVLSDRTTNIRASVDDTQFELMMAIALVVMIIYLFLRNIPATIIPGVAVPLSLIGTFAVMVFLDFSINNLTLMA
LTIATGFVVDDAIVVIENISRYIEKGEKPLAAALKGAGEIGFTIISLTFSLIAVLIPLLFMGDIVGRLFREFAITLAVAI
LISAVVSLTLTPMMCARMLSQESLRKQNRFSRASEKMFDRIIAAYGRGLAKVLNHPWLTLSVALSTLLLSVLLWVFIPKG
FFPVQDNGIIQGTLQAPQSSSFANMAQRQRQVADVILQDPAVQSLTSFVGVDGTNPSLNSARLQINLKPLDERDDRVQKV
IARLQTAVDKVPGVDLFLQPTQDLTIDTQVSRTQYQFTLQATSLDALSTWVPQLMEKLQQLPQLSDVSSDWQDKGLVAYV
NVDRDSASRLGISMADVDNALYNAFGQRLISTIYTQANQYRVVLEHNTENTPGLAALDTIRLTSSDGGVVPLSSIAKIEQ
RFAPLSINHLDQFPVTTISFNVPDNYSLGDAVQAIMDTEKTLNLPVDITTQFQGSTLAFQSALGSTVWLIVAAVVAMYIV
LGILYESFIHPITILSTLPTAGVGALLALLIAGSELDVIAIIGIILLIGIVKKNAIMMIDFALAAEREQGMSPREAIYQA
CLLRFRPILMTTLAALLGALPLMLSTGVGAELRRPLGIGMVGGLIVSQVLTLFTTPVIYLLFDRLALWTKSRFARHEEEA
>P76398 ~~~mdtB~~~Multidrug resistance protein MdtB~~~COG0841
MQVLPPSSTGGPSRLFIMRPVATTLLMVAILLAGIIGYRALPVSALPEVDYPTIQVVTLYPGASPDVMTSAVTAPLERQF
GQMSGLKQMSSQSSGGASVITLQFQLTLPLDVAEQEVQAAINAATNLLPSDLPNPPVYSKVNPADPPIMTLAVTSTAMPM
TQVEDMVETRVAQKISQISGVGLVTLSGGQRPAVRVKLNAQAIAALGLTSETVRTAITGANVNSAKGSLDGPSRAVTLSA
NDQMQSAEEYRQLIIAYQNGAPIRLGDVATVEQGAENSWLGAWANKEQAIVMNVQRQPGANIISTADSIRQMLPQLTESL
PKSVKVTVLSDRTTNIRASVDDTQFELMMAIALVVMIIYLFLRNIPATIIPGVAVPLSLIGTFAVMVFLDFSINNLTLMA
LTIATGFVVDDAIVVIENISRYIEKGEKPLAAALKGAGEIGFTIISLTFSLIAVLIPLLFMGDIVGRLFREFAITLAVAI
LISAVVSLTLTPMMCARMLSQESLRKQNRFSRASEKMFDRIIAAYGRGLAKVLNHPWLTLSVALSTLLLSVLLWVFIPKG
FFPVQDNGIIQGTLQAPQSSSFANMAQRQRQVADVILQDPAVQSLTSFVGVDGTNPSLNSARLQINLKPLDERDDRVQKV
IARLQTAVDKVPGVDLFLQPTQDLTIDTQVSRTQYQFTLQATSLDALSTWVPQLMEKLQQLPQLSDVSSDWQDKGLVAYV
NVDRDSASRLGISMADVDNALYNAFGQRLISTIYTQANQYRVVLEHNTENTPGLAALDTIRLTSSDGGVVPLSSIAKIEQ
RFAPLSINHLDQFPVTTISFNVPDNYSLGDAVQAIMDTEKTLNLPVDITTQFQGSTLAFQSALGSTVWLIVAAVVAMYIV
LGILYESFIHPITILSTLPTAGVGALLALLIAGSELDVIAIIGIILLIGIVKKNAIMMIDFALAAEREQGMSPREAIYQA
CLLRFRPILMTTLAALLGALPLMLSTGVGAELRRPLGIGMVGGLIVSQVLTLFTTPVIYLLFDRLALWTKSRFARHEEEA
>B7NCB1 ~~~mdtB~~~Multidrug resistance protein MdtB~~~
MQVLPPSSTGGPSRLFIMRPVATTLLMVAILLAGIIGYRALPVSALPEVDYPTIQVVTLYPGASPDVMTSAVTAPLERQF
GQMSGLKQMSSQSSGGASVITLQFQLTLPLDVAEQEVQAAINAATNLLPSDLPNPPVYSKVNPADPPIMTLAVTSTAMPM
TQVEDMVETRVAQKISQISGVGLVTLSGGQRPAVRVKLNAQAIAALGLTSETVRTAITGANVNSAKGSLDGPSRAVTLSA
NDQMQSAEEYRQLIIAYQNGAPIRLGDVATVEQGAENSWLGAWANKEQAIVMNVQRQPGANIISTADSIRQMLPQLTESL
PKSVKVTVLSDRTTNIRASVNDTQFELMMAIALVVMIIYLFLRNIPATIIPGVAVPLSLIGTFAVMVFLDFSINNLTLMA
LTIATGFVVDDAIVVIENISRYIEKGEKPLAAALKGAGEIGFTIISLTFSLIAVLIPLLFMGDIVGRLFREFAITLAVAI
LISAVVSLTLTPMMCARMLSQESLRKQNRFSRASEKMFERIIAAYGRGLAKVLNHPWLTLSVALSTLLLSVLLWVFIPKG
FFPVQDNGIIQGTLQAPQSSSFTNMAQRQRQVADVILQDPAVQSLTSFVGVDGTNPSLNSARLQINLKPLDERDDRVQKV
IARLQTAVDKVPGVDLFLQPTQDLTIDTQVSRTQYQFTLQATSLDALSTWVPQLMEKLQQLPQLSDVSSDWQDKGLVAYV
NVDRDSASRLGINMADVDNALYNAFGQRLISTIYTQANQYRVVLEHNTENTPGLAALDTIRLTSSDGGVVPLSSIAKIEQ
RFAPLSINHLDQFPVTTISFNVPDNYSLGDAVQAIMDTEKTLNLPVDITTQFQGSTLAFQSALGSTVWLIVAAVVAMYIV
LGILYESFIHPITILSTLPTAGVGALLALLIAGSELDVIAIIGIILLIGIVKKNAIMMIDFALAAEREQGMSPRDAIYQA
CLLRFRPILMTTLAALLGALPLMLSTGVGAELRRPLGIGMVGGLIVSQVLTLFTTPVIYLLFDRLALWTKSRFARHEEEA
>B6HYS0 ~~~mdtB~~~Multidrug resistance protein MdtB~~~
MQVLPPSSTGGPSRLFIMRPVATTLLMVAILLAGIIGYRALPVSALPEVDYPTIQVVTLYPGASPDVMTSAVTAPLERQF
GQMSGLKQMSSQSSGGASVITLQFQLTLPLDVAEQEVQAAINAATNLLPSDLPNPPVYSKVNPADPPIMTLAVTSTAMPM
TQVEDMVETRVAQKISQISGVGLVTLSGGQRPAVRVKLNAQAIAALGLTSETVRTAITGANVNSAKGSLDGPSRAVTLSA
NDQMQSAEEYRQLIIAYQNGAPIRLGDVATVEQGAENSWLGAWANKEQAIVMNVQRQPGANIISTADSIRQMLPQLTESL
PKSVKVTVLSDRTTNIRASVDDTQFELMMAIALVVMIIYLFLRNIPATIIPGVAVPLSLIGTFAVMVFLDFSINNLTLMA
LTIATGFVVDDAIVVIENISRYIEKGEKPLAAALKGAGEIGFTIISLTFSLIAVLIPLLFMGDIVGRLFREFAITLAVAI
LISAVVSLTLTPMMCARMLSQESLRKQNRFSRASEKMFDRIIAAYGRGLAKVLNHPWLTLSVALSTLLLSVLLWVFIPKG
FFPVQDNGIIQGTLQAPQSSSFANMAQRQRQVADVILQDPAVQSLTSFVGVDGTNPSLNSARLQINLKPLDERDDRVQKV
IARLQTAVDKVPGVDLFLQPTQDLTIDTQVSRTQYQFTLQATSLDALSTWVPQLMEKLQQLPQLSDVSSDWQDKGLVAYV
NVDRDSASRLGISMADVDNALYNAFGQRLISTIYTQANQYRVVLEHNTEITPGLAALDTIRLTSSDGGVVPLSSIAKVEQ
RFAPLSINHLDQFPVTTISFNVPDNYSLGDAVQAIMDTEKTLNLPVDITTQFQGSTLAFQSALGSTVWLIVAAVVAMYIV
LGILYESFIHPITILSTLPTAGVGALLALMIAGSELDVIAIIGIILLIGIVKKNAIMMIDFALAAEREQGMSPRDAIYQA
CLLRFRPILMTTLAALLGALPLMLSTGVGAELRRPLGIGMVGGLIVSQVLTLFTTPVIYLLFDRLALWTKSRFARHEEEA
>B1LNW6 ~~~mdtB~~~Multidrug resistance protein MdtB~~~
MQVLPPSSTGGPSRLFIMRPVATTLLMVAILLAGIIGYRALPVSALPEVDYPTIQVVTLYPGASPDVMTSAVTAPLERQF
GQMSGLKQMSSQSSGGASVITLQFQLTLPLDVAEQEVQAAINAATNLLPSDLPNPPVYSKVNPADPPIMTLAVTSTAMPM
TQVEDMVETRVAQKISQISGVGLVTLSGGQRPAVRVKLNAQAIAALGLTSETVRTAITGANVNSAKGSLDGPSRAVTLSA
NDQMQSAEEYRQLIIAYQNGAPIRLGDVATVEQGAENSWLGAWANKEQAIVMNVQRQPGANIISTADSIRQMLPQLTESL
PKSVKVTVLSDRTTNIRASVDDTQFELMMAIALVVMIIYLFLRNIPATIIPGVAVPLSLIGTFAVMVFLDFSINNLTLMA
LTIATGFVVDDAIVVIENISRYIEKGEKPLAAALKGAGEIGFTIISLTFSLIAVLIPLLFMGDIVGRLFREFAITLAVAI
LISAVVSLTLTPMMCARMLSQESLRKQNRFSRASEKMFDRIIAAYGRGLAKVLNHPWLTLSVALSTLLLSVLLWVFIPKG
FFPVQDNGIIQGTLQAPQSSSFANMAQRQRQVADVILQDPAVQSLTSFVGVDGTNPSLNSARLQINLKPLDERDDRVQKV
IARLQTAVDKVPGVDLFLQPTQDLTIDTQVSRTQYQFTLQATSLDALSTWVPQLMEKLQQLPQLSDVSSDWQDKGLVAYV
NVDRDSASRLGISMADVDNALYNAFGQRLISTIYTQANQYRVVLEHNTENTPGLAALDTIRLTSSDGGVVPLSSIAKIEQ
RFAPLSINHLDQFPVTTISFNVPDNYSLGDAVQAIMDTEKTLNLPVDITTQFQGSTLAFQSALGSTVWLIVAAVVAMYIV
LGILYESFIHPITILSTLPTAGVGALLALMIAGSELDVIAIIGIILLIGIVKKNAIMMIDFALAAEREQGMSPREAIYQA
CLLRFRPILMTTLAALLGALPLMLSTGVGAELRRPLGIGMVGGLIVSQVLTLFTTPVIYLLFDRLALWTKSRFARHEEEA
>Q1R9Z6 ~~~mdtB~~~Multidrug resistance protein MdtB~~~
MQVLPPSSTGGPSRLFIMRPVATTLLMVAILLAGIIGYRALPVSALPEVDYPTIQVVTLYPGASPDVMTSAVTAPLERQF
GQMSGLKQMSSQSSGGASVITLQFQLTLPLDVAEQEVQAAINAATNLLPSDLPNPPVYSKVNPADPPIMTLAVTSTAMPM
TQVEDMVETRVAQKISQISGVGLVTLSGGQRPAVRVKLNAQAIAALGLTSETVRTAITGANVNSAKGSLDGPSRAVTLSA
NDQMQSAEEYRQLIIAYQNGAPIRLGDVATVEQGAENSWLGAWANKEQAIVMNVQRQPGANIISTADSIRQMLPQLTESL
PKSVKVTVLSDRTTNIRASVDDTQFELMMAIALVVMIIYLFLRNIPATIIPGVAVPLSLIGTFAVMVFLDFSINNLTLMA
LTIATGFVVDDAIVVIENISRYIEKGEKPLAAALKGAGEIGFTIISLTFSLIAVLIPLLFMGDIVGRLFREFAITLAVAI
LISAVVSLTLTPMMCARMLSQESLRKQNRFSRASEKMFDRIIAAYGRGLAKVLNHPWLTLSVALSTLLLSVLLWVFIPKG
FFPVQDNGIIQGTLQAPQSSSFANMAQRQRQVADVILQDPAVQSLTSFVGVDGTNPSLNSARLQINLKPLDERDDRVQKV
IARLQTAVDKVPGVDLFLQPTQDLTIDTQVSRTQYQFTLQATSLDALSTWVPQLMEKLQQLPQLSDVSSDWQDKGLVAYV
NVDRDSASRLGISMADVDNALYNAFGQRLISTIYTQANQYRVVLEHNTENTPGLAALDTIRLTSSDGGVVPLSSIAKIEQ
RFAPLSINHLDQFPVTTISFNVPDNYSLGDAVQAIMDTEKTLNLPVDITTQFQGSTLAFQSALGSTVWLIVAAVVAMYIV
LGILYESFIHPITILSTLPTAGVGALLALMIAGSELDVIAIIGIILLIGIVKKNAIMMIDFALAAEREQGMSPREAIYQA
CLLRFRPILMTTLAALLGALPLMLSTGVGAELRRPLGIGMVGGLIVSQVLTLFTTPVIYLLFDRLALWTKSRFARHEEEA
>B7LV39 ~~~mdtB~~~Multidrug resistance protein MdtB~~~
MQVLPPSSTGGPSRLFIMRPVATTLLMVAILLAGIIGYRALPVSALPEVDYPTIQVVTLYPGASPDVMTSAVTAPLERQF
GQMSGLKQMSSQSSGGASVITLQFQLTLPLDVAEQEVQAAINAATNLLPSDLPNPPVYSKVNPADPPIMTLAVTSTAMPM
TQVEDMVETRVAQKISQISGVGLVTLSGGQRPAVRVKLNAQAIAALGLTSETVRTAITGANVNSAKGSLDGPSRAVTLSA
NDQMQSAEEYRQLIIAYQNGAPIRLGDVATVEQGAENSWLGAWANKEQAIVMNVQRQPGANIISTADSIRQMLPQLTESL
PKSVKVTVLSDRTTNIRASVNDTQFELMMAIALVVMIIYLFLRNIPATIIPGVAVPLSLIGTFAVMVFLDFSINNLTLMA
LTIATGFVVDDAIVVIENISRYIEKGEKPLAAALKGAGEIGFTIISLTFSLIAVLIPLLFMGDIVGRLFREFAITLAVAI
LISAVVSLTLTPMMCARMLSQESLRKQNRFSRASEKMFDRIIAAYGRGLAKVLNHPWLTLSVALSTLLLSVLLWVFIPKG
FFPVQDNGIIQGTLQAPQSSSFANMAQRQRQVADVILQDPAVQSLTSFVGVDGTNPSLNSARLQINLKPLDERDDRVQKV
IARLQTAVDKVPGVDLFLQPTQDLTIDTQVSRTQYQFTLQATSLDALSTWVPQLVEKLQQLPQLSDVSSDWQDQGLVAYV
NVDRDSASRLGISMADVDNALYNAFGQRLISTIYTQANQYRVVLEHNTENTPGLAALDTIRLTSSDGGVVPLSSIAKIEQ
RFAPLSINHLDQFPVTTISFNMPDNYSLGDAVQAIMDTEKTLNLPVDITTQFQGSTLAFQSALGSTVWLIVAAVVAMYIV
LGILYESFIHPITILSTLPTAGVGALLALLIAGSELDVIAIIGIILLIGIVKKNAIMMIDFALAAEREQGMSPRDAIYQA
CLLRFRPILMTTLAALLGALPLMLSTGVGAELRRPLGIGMVGGLVVSQVLTLFTTPVIYLLFDRLALWTKSRFARHEEEA
>A7ZNP9 ~~~mdtC~~~Multidrug resistance protein MdtC~~~
MKFFALFIYRPVATILLSVAITLCGILGFRMLPVAPLPQVDFPVIMVSASLPGASPETMASSVATPLERSLGRIAGVSEM
TSSSSLGSTRIILQFDFDRDINGAARDVQAAINAAQSLLPSGMPSRPTYRKANPSDAPIMILTLTSDTYSQGELYDFAST
QLAPTISQIDGVGDVDVGGSSLPAVRVGLNPQALFNQGVSLDDVRTAISNANVRKPQGALEDGTHRWQIQTNDELKTAAE
YQPLIIHYNNGGAVRLGDVATVTDSVQDVRNAGMTNAKPAILLMIRKLPEANIIQTVDSIRAKLPELQETIPAAIDLQIA
QDRSPTIRASLEEVEQTLIISVALVILVVFLFLRSGRATIIPAVAVPVSLIGTFAAMYLCGFSLNNLSLMALTIATGFVV
DDAIVVLENIARHLEAGMKPLQAALQGTREVGFTVLSMSLSLVAVFLPLLLMGGLPGRLLREFAVTLSVAIVISLLVSLT
LTPMMCGWMLKASKPREQKRLRGFGRMLVALQQGYGKSLKWVLNHTRLVGVVLLGTIALNIWLYISIPKTFFPEQDTGVL
MGGIQADQSISFQAMRGKLQDFMKIIRDDPAVDNVTGFTGGSRVNSGMMFITLKPRDERSETAQQIIDRLRVKLAKEPGA
NLFLMAVQDIRVGGRQSNASYQYTLLSDDLAALREWEPKIRKKLATLPELADVNSDQQDNGAEMNLVYDRDTMARLGIDV
QAANSLLNNAFGQRQISTIYQPMNQYKVVMEVDPRYTQDISALEKMFVINNEGKAIPLSYFAKWQPANAPLSVNHQGLSA
ASTISFNLPTGKSLSDASAAIDRAMTQLGVPSTVRGSFAGTAQVFQETMNSQVILIIAAIATVYIVLGILYESYVHPLTI
LSTLPSAGVGALLALELFNAPFSLIALIGIMLLIGIVKKNAIMMVDFALEAQRHGNLTPQEAIFQACLLRFRPIMMTTLA
ALFGALPLVLSGGDGSELRQPLGITIVGGLVMSQLLTLYTTPVVYLFFDRLRLRFSRKPKQTVTE
>B7UTB4 ~~~mdtC~~~Multidrug resistance protein MdtC~~~
MKFFALFIYRPVATILLSVAITLCGILGFRMLPVAPLPQVDFPVIMVSASLPGASPETMASSVATPLERSLGRIAGVSEM
TSSSSLGSTRIILQFDFDRDINGAARDVQAAINAAQSLLPSGMPSRPTYRKANPSDAPIMILTLTSDTYSQGELYDFAST
QLAPTISQIDGVGDVDVGGSSLPAVRVGLNPQALFNQGVSLDDVRTAISNANVRKPQGALEDGTHRWQIQTNDELKTAAE
YQPLIIHYNNGGAVRLGDVATVTDSVQDVRNAGMTNAKPAILLMIRKLPEANIIQTVDSIRARLPELQSTIPAAIDLQIA
QDRSPTIRASLEEVEQTLIISVALVILVVFLFLRSGRATIIPAVAVPVSLIGTFAAMYLCGFSLNNLSLMALTIATGFVV
DDAIVVLENIARHLEAGMKPLQAALQGTREVGFTVLSMSLSLVAVFLPLLLMGGLPGRLLREFAVTLSVAIGISLLVSLT
LTPMMCGWMLKASKPREQKRLRGFGRMLVALQQGYGKSLKWVLNHTRLVGVVLLGTIALNIWLYISIPKTFFPEQDTGVL
MGGIQADQSISFQAMRGKLQDFMKIIRDDPAVDNVTGFTGGSRVNSGMMFITLKPRGERSETAQQIIDRLRKKLAKEPGA
NLFLMAVQDIRVGGRQANASYQYTLLSDDLAALREWEPKIRKKLATLPELADVNSDQEDNGAEMNLIYDRDTMARLGIDV
QAANSLLNNAFGQRQISTIYQPMNQYKVVMEVDPRYTQDISALEKMFVINNEGKAIPLSYFAKWQPANAPLSVNHQGLSA
ASTISFNLPTGKSLSDASAAIDRAMTQLGVPSTVRGSFAGTAQVFQETMNSQVILIIAAIATVYIVLGILYESYVHPLTI
LSTLPSAGVGALLALELFNAPFSLIALIGIMLLIGIVKKNAIMMVDFALEAQRHGNLTPQEAIFQACLLRFRPIMMTTLA
ALFGALPLVLSGGDGSELRQPLGITIVGGLVMSQLLTLYTTPVVYLFFDRLRLRFSRKPKQAVTE
>B7ME88 ~~~mdtC~~~Multidrug resistance protein MdtC~~~
MKFFALFIYRPVATILLSVAITLCGILGFRMLPVAPLPQVDFPVIMVSASLPGASPETMASSVATPLERSLGRIAGVSEM
TSSSSLGSTRIILQFDFDRDINGAARDVQAAINAAQSLLPSGMPSRPTYRKANPSDAPIMILTLTSDTYSQGELYDFAST
QLAPTISQIDGVGDVDVGGSSLPAVRVGLNPQALFNQGVSLDDVRTAISNANVRKPQGALEDGTHRWQIQTNDELKTAAE
YQPLIIHYNNGGAVRLGDVATVTDSVQDVRNAGMTNAKPAILLMIRKLPEANIIQTVDSIRARLPELQSTIPAAIDLQIA
QDRSPTIRASLEEVEQTLIISVALVILVVFLFLRSGRATIIPAVAVPVSLIGTFAAMYLCGFSLNNLSLMALTIATGFVV
DDAIVVLENIARHLEAGMKPLQAALQGTREVGFTVLSMSLSLVAVFLPLLLMGGLPGRLLREFAVTLSVAIGISLLVSLT
LTPMMCGWMLKASKPREQKRLRGFGRMLVALQQGYGKSLKWVLNHTRLVGVVLLGTIALNIWLYISIPKTFFPEQDTGVL
MGGIQADQSISFQAMRGKLQDFMKIIRDDPAVDNVTGFTGGSRVNSGMMFITLKPRGERSETAQQIIDRLRKKLAKEPGA
NLFLMAVQDIRVGGRQANASYQYTLLSDDLAALREWEPKIRKKLATLPELADVNSDQEDNGAEMNLIYDRDTMARLGIDV
QAANSLLNNAFGQRQISTIYQPMNQYKVVMEVDPRYTQDISALEKMFVINNEGKAIPLSYFAKWQPANAPLSVNHQGLSA
ASTISFNLPTGKSLSDASAAIDRAMTQLGVPSTVRGSFAGTAQVFQETMNSQVILIIAAIATVYIVLGILYESYVHPLTI
LSTLPSAGVGALLALQLFNAPFSLIALIGIMLLIGIVKKNAIMMVYFALEAQRHGNLTPQEAIFQACLLRFRPIMMTTLA
ALFGALPLVLSGGDGSELRQPLGITIVGGLVMSQLLTLYTTPVVYLFFDRLRLRFSRKPKQAVTE
>B7L9U9 ~~~mdtC~~~Multidrug resistance protein MdtC~~~
MKFFALFIYRPVATILLSVAITLCGILGFRMLPVAPLPQVDFPVIMVSASLPGASPETMASSVATPLERSLGRIAGVSEM
TSSSSLGSTRIILQFDFDRDINGAARDVQAAINAAQSLLPSGMPSRPTYRKANPSDAPIMILTLTSDTYSQGELYDFAST
QLAPTISQIDGVGDVDVGGSSLPAVRVGLNPQALFNQGVSLDDVRTAISNANVRKPQGALEDGTHRWQIQTNDELKTAAE
YQPLIIHYNNGGAVRLGDVATVTDSVQDVRNAGMTNAKPAILLMIRKLPEANIIQTVDSIRAKLPELQETIPAAIDLQIA
QDRSPTIRASLEEVEQTLIISVALVILVVFLFLRSGRATIIPAVAVPVSLIGTFAAMYLCGFSLNNLSLMALTIATGFVV
DDAIVVLENIARHLEAGMKPLQAALQGTREVGFTVLSMSLSLVAVFLPLLLMGGLPGRLLREFAVTLSVAIGISLLVSLT
LTPMMCGWMLKASKPREQKRLRGFGRMLVALQQGYGKSLKWVLNHTRLVGVVLLGTIALNIWLYISIPKTFFPEQDTGVL
MGGIQADQSISFQAMRGKLQDFMKIIRDDPAVDNVTGFTGGSRVNSGMMFITLKPRGERSETAQQIIDRLRVKLAKEPGA
NLFLMAVQDIRVGGRQSNASYQYTLLSDDLAALREWEPKIRKKLATLPELADVNSDQQDNGAEMNLVYDRDTMARLGIDV
QAANSLLNNAFGQRQISTIYQPMNQYKVVMEVDPRYTQDISALEKMFVINNEGKAIPLSYFAKWQPANAPLSVNHQGLSA
ASTISFNLPTGKSLSDASAAIDRAMTQLGVPSTVRGSFAGTAQVFQETMNSQVILIIAAIATVYIVLGILYESYVHPLTI
LSTLPSAGVGALLALELFNAPFSLIALIGIMLLIGIVKKNAIMMVDFALEAQRHGNLTPQEAIFQACLLRFRPIMMTTLA
ALFGALPLVLSGGDGSELRQPLGITIVGGLVMSQLLTLYTTPVVYLFFDRLRLRFSRKPKQTVTE
>Q7ACM1 ~~~mdtC~~~Multidrug resistance protein MdtC~~~COG0841
MKFFALFIYRPVATILLSVAITLCGILGFRMLPVAPLPQVDFPVIMVSASLPGASPETMASSVATPLERSLGRIAGVSEM
TSSSSLGSTRIILQFDFDRDINGAARDVQAAINAAQSLLPSGMPSRPTYRKANPSDAPIMILTLTSDTYSQGELYDFAST
QLAPTISQIDGVGDVDVGGSSLPAVRVGLNPQALFNQGVSLDDVRTAISNANVRKPQGALEDGTHRWQIQTNDELKTAAE
YQPLIIHYNNGGAVRLGDVATVTDSVQDVRNAGMTNAKPAILLMIRKLPEANIIQTVDSIRAKLPELQETIPAAIDLQIA
QDRSPTIRASLEEVEQTLIISVALVILVVFLFLRSGRATIIPAVAVPVSLIGTFAAMYLCGFSLNNLSLMALTIATGFVV
DDAIVVLENIARHLEAGMKPLQAALQGTREVGFTVLSMSLSLVAVFLPLLLMGGLPGRLLREFAVTLSVAIGISLLVSLT
LTPMMCGWMLKASKPREQKRLRGFGRMLVALQQGYGKSLKWVLNHTRLVGMVLLGTIALNIWLYISIPKTFFPEQDTGVL
MGGIQADQSISFQAMRGKLQDFMKIIRDDPAVDNVTGFTGGSRVNSGMMFITLKPRDERSETAQQIIDRLRVKLAKEPGA
NLFLMAVQDIRVGGRQSNASYQYTLLSDDLAALREWEPKIRKKLATLPELADVNSDQQDNGAEMNLVYDRDTMARLGIDV
QAANSLLNNAFGQRQISTIYQPMNQYKVVMEVDPRYTQDISALEKMFVINNEGKAIPLSYFAKWQPANAPLSVNHQGLSA
ASTISFNLPTGKSLSDASAAIDRAMTQLGVPSTVRGSFAGTAQVFQETMNSQVILIIAAIATVYIVLGILYESYVHPLTI
LSTLPSAGVGALLALELFNAPFSLIALIGIMLLIGIVKKNAIMMVDFALEAQRHGNLTPQEAIFQACLLRFRPIMMTTLA
ALFGALPLVLSGGDGSELRQPLGITIVGGLVMSQLLTLYTTPVVYLFFDRLRLRFSRKSKQTVTE
>B5YUD4 ~~~mdtC~~~Multidrug resistance protein MdtC~~~
MKFFALFIYRPVATILLSVAITLCGILGFRMLPVAPLPQVDFPVIMVSASLPGASPETMASSVATPLERSLGRIAGVSEM
TSSSSLGSTRIILQFDFDRDINGAARDVQAAINAAQSLLPSGMPSRPTYRKANPSDAPIMILTLTSDTYSQGELYDFAST
QLAPTISQIDGVGDVDVGGSSLPAVRVGLNPQALFNQGVSLDDVRTAISNANVRKPQGALEDGTHRWQIQTNDELKTAAE
YQPLIIHYNNGGAVRLGDVATVTDSVQDVRNAGMTNAKPAILLMIRKLPEANIIQTVDSIRAKLPELQETIPAAIDLQIA
QDRSPTIRASLEEVEQTLIISVALVILVVFLFLRSGRATIIPAVAVPVSLIGTFAAMYLCGFSLNNLSLMALTIATGFVV
DDAIVVLENIARHLEAGMKPLQAALQGTREVGFTVLSMSLSLVAVFLPLLLMGGLPGRLLREFAVTLSVAIGISLLVSLT
LTPMMCGWMLKASKPREQKRLRGFGRMLVALQQGYGKSLKWVLNHTRLVGMVLLGTIALNIWLYISIPKTFFPEQDTGVL
MGGIQADQSISFQAMRGKLQDFMKIIRDDPAVDNVTGFTGGSRVNSGMMFITLKPRDERSETAQQIIDRLRVKLAKEPGA
NLFLMAVQDIRVGGRQSNASYQYTLLSDDLAALREWEPKIRKKLATLPELADVNSDQQDNGAEMNLVYDRDTMARLGIDV
QAANSLLNNAFGQRQISTIYQPMNQYKVVMEVDPRYTQDISALEKMFVINNEGKAIPLSYFAKWQPANAPLSVNHQGLSA
ASTISFNLPTGKSLSDASAAIDRAMTQLGVPSTVRGSFAGTAQVFQETMNSQVILIIAAIATVYIVLGILYESYVHPLTI
LSTLPSAGVGALLALELFNAPFSLIALIGIMLLIGIVKKNAIMMVDFALEAQRHGNLTPQEAIFQACLLRFRPIMMTTLA
ALFGALPLVLSGGDGSELRQPLGITIVGGLVMSQLLTLYTTPVVYLFFDRLRLRFSRKSKQTVTE
>B7NQB0 ~~~mdtC~~~Multidrug resistance protein MdtC~~~
MKFFALFIYRPVATILLSVAITLCGILGFRMLPVAPLPQVDFPVIMVSASLPSASPETMASSVATPLERSLGRIAGVSEM
TSSSSLGSTRIILQFDFDRDINGAARDVQAAINAAQSLLPSGMPSRPTYRKANPSDAPIMILTLTSDTYSQGELYDFAST
QLAPTISQIDGVGDVDVGGSSLPAVRVGLNPQALFNQGVSLDDVRTAISNANVRKPQGALEDGTHRWQIQTNDELKTAAE
YQPLIIHYNNGGAVRLGDVATVTDSVQDVRNAGMTNAKPAILLMIRKLPEANIIQTVDSIRARLPELQSTIPAAIDLQIA
QDRSPTIRASLEEVEQTLIISVALVILVVFLFLRSGRATIIPAVAVPVSLIGTFAAMYLCGFSLNNLSLMALTIATGFVV
DDAIVVLENIARHLEAGMKPLQAALQGTREVGFTVLSMSLSLVAVFLPLLLMGGLPGRLLREFAVTLSVAIGISLLVSLT
LTPMMCGWMLKASKPREQKRLRGFGRVLVALQQGYGKSLKWVLNHTRLVGVVLLGTIALNIWLYISIPKTFFPEQDTGVL
MGGIQADQSISFQAMRGKLQDFMKIIRDDPAVDNVTGFTGGSRVNSGMMFITLKPRDERSETAQQIIDRLRVKLAKEPGA
NLFLMAVQDIRVGGRQANASYQYTLLSDDLPALREWEPKIRKKLATLPELADVNSDQQDNGAEMNLVYDRDTMARLGIDV
QAANSLLNNAFGQRQISTIYQPMNQYKVVMEVDPRYTQDISALEKMFVINNEGKAIPLSYFAKWQPANAPLSVNHQGLSA
ASTISFNLPTGKSLSDASAAIDRAMTQLGVPSTVRGSFAGTAQVFQETMNSQVILIIAAIATVYIVLGILYESYVHPLTI
LSTLPSAGVGALLALELFNAPFSLIALIGIMLLIGIVKKNAIMMVDFALEAQRHGNLTPQEAIFQACLLRFRPIMMTTLA
ALFGALPLVLSGGDGSELRQPLGITIVGGLVMSQLLTLYTTPVVYLFFDRLRLRFSRKPKQAVTE
>B7MWY9 ~~~mdtC~~~Multidrug resistance protein MdtC~~~
MKFFALFIYRPVATILLSVAITLCGILGFRMLPVAPLPQVDFPVIMVSASLPGASPETMASSVATPLERSLGRIAGVSEM
TSSSSLGSTRIILQFDFDRDINGAARDVQAAINAAQSLLPSGMPSRPTYRKANPSDAPIMILTLTSDTYSQGELYDFAST
QLAPTISQIDGVGDVDVGGSSLPAVRVGLNPQALFNQGVSLDDVRTAISNANVRKPQGALEDDTHRWQIQTNDELKTAAE
YQPLIIHYNNGGAVRLGDVATVTDSVQDVRNAGMTNAKPAILLMIRKLPEANIIQTVDSIRARLPELQSTIPAAIDLQIA
QDRSPTIRASLEEVEQTLVISVALVILVVFLFLRSGRATIIPAVAVPVSLIGTFAAMYLCGFSLNNLSLMALTIATGFVV
DDAIVVLENIARHLEAGMKPLQAALQGTREVGFTVLSMSLSLVAVFLPLLLMGGLPGRLLREFAVTLSVAIGISLLVSLT
LTPMMCGWMLKASKPREQKRLRGFGRMLVALQQGYGKSLKWVLNHTRLVGAVLLGTIALNIWLYISIPKTFFPEQDTGVL
MGGIQADQSISFQAMRGKLQDFMKIIRDEPAVDNVTGFTGGSRVNSGMMFITLKPRGERSETAQQIIDRLRKKLAKEPGA
NLFLMAVQDIRVGGRQANASYQYTLLSDDLAALREWEPKIRKKLATLPELADVNSDQEDNGAEMNLIYDRDTMARLGIDV
QAANSLLNNAFGQRQISTIYQPMNQYKVVMEVDPRYTQDISALEKMFVINNEGKAIPLSYFAKWQPANAPLSVNHQGLSA
ASTISFNLPTGKSLSDASAAIDRAMTQLGVPSTVRGSFAGTAQVFQETMNSQVILIIAAIATVYIVLGILYESYVHPLTI
LSTLPSAGVGALLALELFNAPFSLIALIGIMLLIGIVKKNAIMMVDFALEAQRHGNLTPQEAIFQACLLRFRPIMMTTLA
ALFGALPLVLSGGDGSELRQPLGITIVGGLVMSQLLTLYTTPVVYLFFDRLRLRFSRKPKQAVTE
>B7M459 ~~~mdtC~~~Multidrug resistance protein MdtC~~~
MKFFALFIYRPVATILLSVAITLCGILGFRMLPVAPLPQVDFPVIMVSASLPGASPETMASSVATPLERSLGRIAGVSEM
TSSSSLGSTRIILQFDFDRDINGAARDVQAAINAAQSLLPSGMPSRPTYRKANPSDAPIMILTLTSDTYSQGELYDFAST
QLAPTISQIDGVGDVDVGGSSLPAVRVGLNPQALFNQGVSLDDVRTAISNANVRKPQGALEDGTHRWQIQTNDELKTAAE
YQPLIIHYNNGGAVRLGDVATVTDSVQDVRNAGMTNAKPAILLMIRKLPEANIIQTVDSIRAKLPELQETIPAAIDLQIA
QDRSPTIRASLEEVEQTLIISVALVILVVFLFLRSGRATIIPAVVVPVSLIGTFAAMYLCGFSLNNLSLMALTIATGFVV
DDAIVVLENIARHLEAGMKPLQAALQGTREVGFTVLSMSLSLVAVFLPLLLMGGLPGRLLREFAVTLSVAIGISLLVSLT
LTPMMCGWMLKASKPREQKRLRGFGRMLVALQQGYGKSLKWVLNHTRLVGVVLLGTIALNIWLYISIPKTFFPEQDTGVL
MGGIQADQSISFQAMRGKLQDFMKIIRDDPAVDNVTGFTGGSRVNSGMMFITLKPRDERSETAQQIIDRLRVKLAKEPGA
NLFLMAVQDIRVGGRQSNASYQYTLLSDDLAALREWEPKIRKKLATLPELADVNSDQQDNGAEMNLVYDRDTMARLGIDV
QAANSLLNNAFGQRQISTIYQPMNQYKVVMEVDPRYTQDISALEKMFVINNEGKAIPLSYFAKWQPANAPLSVNHQGLSA
ASTISFNLPTGKSLSDASAAIDRAMTQLGVPSTVRGSFAGTAQVFQETMNSQVILIIAAIATVYIVLGILYESYVHPLTI
LSTLPSAGVGALLALELFNAPFSLIALIGIMLLIGIVKKNAIMMVDFALEAQRHGNLTPQEAIFQACLLRFRPIMMTTLA
ALFGALPLVLSGGDGSELRQPLGITIVGGLVMSQLLTLYTTPVVYLFFDRLRLRFSRKPKQTVTE
>C4ZSG4 ~~~mdtC~~~Multidrug resistance protein MdtC~~~
MKFFALFIYRPVATILLSVAITLCGILGFRMLPVAPLPQVDFPVIIVSASLPGASPETMASSVATPLERSLGRIAGVSEM
TSSSSLGSTRIILQFDFDRDINGAARDVQAAINAAQSLLPSGMPSRPTYRKANPSDAPIMILTLTSDTYSQGELYDFAST
QLAPTISQIDGVGDVDVGGSSLPAVRVGLNPQALFNQGVSLDDVRTAVSNANVRKPQGALEDGTHRWQIQTNDELKTAAE
YQPLIIHYNNGGAVRLGDVATVTDSVQDVRNAGMTNAKPAILLMIRKLPEANIIQTVDSIRAKLPELQETIPAAIDLQIA
QDRSPTIRASLEEVEQTLIISVALVILVVFLFLRSGRATIIPAVSVPVSLIGTFAAMYLCGFSLNNLSLMALTIATGFVV
DDAIVVLENIARHLEAGMKPLQAALQGTREVGFTVLSMSLSLVAVFLPLLLMGGLPGRLLREFAVTLSVAIGISLLVSLT
LTPMMCGWMLKASKPREQKRLRGFGRMLVALQQGYGKSLKWVLNHTRLVGVVLLGTIALNIWLYISIPKTFFPEQDTGVL
MGGIQADQSISFQAMRGKLQDFMKIIRDDPAVDNVTGFTGGSRVNSGMMFITLKPRDERSETAQQIIDRLRVKLAKEPGA
NLFLMAVQDIRVGGRQSNASYQYTLLSDDLAALREWEPKIRKKLATLPELADVNSDQQDNGAEMNLVYDRDTMARLGIDV
QAANSLLNNAFGQRQISTIYQPMNQYKVVMEVDPRYTQDISALEKMFVINNEGKAIPLSYFAKWQPANAPLSVNHQGLSA
ASTISFNLPTGKSLSDASAAIDRAMTQLGVPSTVRGSFAGTAQVFQETMNSQVILIIAAIATVYIVLGILYESYVHPLTI
LSTLPSAGVGALLALELFNAPFSLIALIGIMLLIGIVKKNAIMMVDFALEAQRHGNLTPQEAIFQACLLRFRPIMMTTLA
ALFGALPLVLSGGDGSELRQPLGITIVGGLVMSQLLTLYTTPVVYLFFDRLRLRFSRKPKQTVTE
>B1X7H2 ~~~mdtC~~~Multidrug resistance protein MdtC~~~
MKFFALFIYRPVATILLSVAITLCGILGFRMLPVAPLPQVDFPVIIVSASLPGASPETMASSVATPLERSLGRIAGVSEM
TSSSSLGSTRIILQFDFDRDINGAARDVQAAINAAQSLLPSGMPSRPTYRKANPSDAPIMILTLTSDTYSQGELYDFAST
QLAPTISQIDGVGDVDVGGSSLPAVRVGLNPQALFNQGVSLDDVRTAVSNANVRKPQGALEDGTHRWQIQTNDELKTAAE
YQPLIIHYNNGGAVRLGDVATVTDSVQDVRNAGMTNAKPAILLMIRKLPEANIIQTVDSIRAKLPELQETIPAAIDLQIA
QDRSPTIRASLEEVEQTLIISVALVILVVFLFLRSGRATIIPAVSVPVSLIGTFAAMYLCGFSLNNLSLMALTIATGFVV
DDAIVVLENIARHLEAGMKPLQAALQGTREVGFTVLSMSLSLVAVFLPLLLMGGLPGRLLREFAVTLSVAIGISLLVSLT
LTPMMCGWMLKASKPREQKRLRGFGRMLVALQQGYGKSLKWVLNHTRLVGVVLLGTIALNIWLYISIPKTFFPEQDTGVL
MGGIQADQSISFQAMRGKLQDFMKIIRDDPAVDNVTGFTGGSRVNSGMMFITLKPRDERSETAQQIIDRLRVKLAKEPGA
NLFLMAVQDIRVGGRQSNASYQYTLLSDDLAALREWEPKIRKKLATLPELADVNSDQQDNGAEMNLVYDRDTMARLGIDV
QAANSLLNNAFGQRQISTIYQPMNQYKVVMEVDPRYTQDISALEKMFVINNEGKAIPLSYFAKWQPANAPLSVNHQGLSA
ASTISFNLPTGKSLSDASAAIDRAMTQLGVPSTVRGSFAGTAQVFQETMNSQVILIIAAIATVYIVLGILYESYVHPLTI
LSTLPSAGVGALLALELFNAPFSLIALIGIMLLIGIVKKNAIMMVDFALEAQRHGNLTPQEAIFQACLLRFRPIMMTTLA
ALFGALPLVLSGGDGSELRQPLGITIVGGLVMSQLLTLYTTPVVYLFFDRLRLRFSRKPKQTVTE
>A8A1U8 ~~~mdtC~~~Multidrug resistance protein MdtC~~~
MKFFALFIYRPVATILLSVAITLCGILGFRMLPVAPLPQVDFPVIMVSASLPGASPETMASSVATPLERSLGRIAGVSEM
TSSSSLGSTRIILQFDFDRDINGAARDVQAAINAAQSLLPSGMPSRPTYRKANPSDAPIMILTLTSDTYSQGELYDFAST
QLAPTISQIDGVGDVDVGGSSLPAVRVGLNPQALFNQGVSLDDVRTAVSNANVRKPQGALEDGTHRWQIQTNDELKTAAE
YQPLIIHYNNGGAVRLGDVATVTDSVQDVRNAGMTNAKPAILLMIRKLPEANIIQTVDSIRAKLPELQETIPAAIDLQIA
QDRSPTIRASLEEVEQTLIISVALVILVVFLFLRSGRATIIPAVSVPVSLIGTFAAMYLCGFSLNNLSLMALTIATGFVV
DDAIVVLENIARHLEAGMKPLQAALQGTREVGFTVLSMSLSLVAVFLPLLLMGGLPGRLLREFTVTLSVAIGISLLVSLT
LTPMMCGWMLKASKPREQKRLRGFGRMLVALQQGYGKSLKWVLNHTRLVGVVLLGTIALNIWLYISIPKTFFPEQDTGVL
MGGIQADQSISFQAMRGKLQDFMKIIRDDPAVDNVTGFTGGSRVNSGMMFITLKPRDERSETAQQIIDRLRVKLAKEPGA
NLFLMAVQDIRVGGRQSNASYQYTLLSDDLAALREWEPKIRKKLATLPELADVNSDQQDNGAEMNLVYDRDTMARLGIDV
QAANSLLNNAFGQRQISTIYQPMNQYKVVMEVDPRYTQDISALEKMFVINNEGKAIPLSYFAKWQPANAPLSVNHQGLSA
ASTISFNLPTGKSLSDASAAIDRAMTQLGVPSTVRGSFAGTAQVFQETMNSQVILIIAAIATVYIVLGILYESYVHPLTI
LSTLPSAGVGALLALELFNAPFSLIALIGIMLLIGIVKKNAIMMVDFALEAQRHGNLTPQEAIFQACLLRFRPIMMTTLA
ALFGALPLVLSGGDGSELRHPLGITIVGGLVMSQLLTLYTTPVVYLFFDRLRLRFSRKPKQTVTE
>A1ACT1 ~~~mdtC~~~Multidrug resistance protein MdtC~~~
MKFFALFIYRPVATILLSVAITLCGILGFRMLPVAPLPQVDFPVIMVSASLPGASPETMASSVATPLERSLGRIAGVSEM
TSSSSLGSTRIILQFDFDRDINGAARDVQAAINAAQSLLPSGMPSRPTYRKANPSDAPIMILTLTSDTYSQGELYDFAST
QLAPTISQIDGVGDVDVGGSSLPAVRVGLNPQALFNQGVSLDDVRTAISNANVRKPQGALEDGTHRWQIQTNDELKTAAE
YQPLIIHYNNGGAVRLGDVATVTDSVQDVRNAGMTNAKPAILLMIRKLPEANIIQTVDSIRARLPELQSTIPAAIDLQIA
QDRSPTIRASLEEVEQTLIISVALVILVVFLFLRSGRATIIPAVAVPVSLIGTFAAMYLCGFSLNNLSLMALTIATGFVV
DDAIVVLENIARHLEAGMKPLQAALQGTREVGFTVLSMSLSLVAVFLPLLLMGGLPGRLLREFAVTLSVAIGISLLVSLT
LTPMMCGWMLKASKPREQKRLRGFGRMLVALQQGYGKSLKWVLNHTRLVGVVLLGTIALNIWLYISIPKTFFPEQDTGVL
MGGIQADQSISFQAMRGKLQDFMKIIRDDPAVDNVTGFTGGSRVNSGMMFITLKPRGERSETAQQIIDRLRKKLAKEPGA
NLFLMAVQDIRVGGRQANASYQYTLLSDDLAALREWEPKIRKKLATLPELADVNSDQEDNGAEMNLIYDRDTMARLGIDV
QAANSLLNNAFGQRQISTIYQPMNQYKVVMEVDPRYTQDISALEKMFVINNEGKAIPLSYFAKWQPANAPLSVNHQGLSA
ASTISFNLPTGKSLSDASAAIDRAMTQLGVPSTVRGSFAGTAQVFQETMNSQVILIIAAIATVYIVLGILYESYVHPLTI
LSTLPSAGVGALLALQLFNAPFSLIALIGIMLLIGIVKKNAIMMVYFALEAQRHGNLTPQEAIFQACLLRFRPIMMTTLA
ALFGALPLVLSGGDGSELRQPLGITIVGGLVMSQLLTLYTTPVVYLFFDRLRLRFSRKPKQAVTE
>Q0TG14 ~~~mdtC~~~Multidrug resistance protein MdtC~~~
MKFFALFIYRPVATILLSVAITLCGILGFRMLPVAPLPQVDFPVIMVSASLPGASPETMASSVATPLERSLGRIAGVSEM
TSSSSLGSTRIILQFDFDRDINGAARDVQAAINAAQSLLPSGMPSRPTYRKANPSDAPIMILTLTSDTYSQGELYDFAST
QLAPTISQIDGVGDVDVGGSSLPAVRVGLNPQALFNQGVSLDDVRTAISNANVRKPQGALEDDTHRWQIQTNDELKTAAE
YQPLIIHYNNGGAVRLGDVATVTDSVQDVRNAGMTNAKPAILLMIRKLPEANIIQTVDSIRARLPELQSTIPAAIDLQIA
QDRSPTIRASLEEVEQTLIISVALVILVVFLFLRSGRATIIPAVAVPVSLIGTFAAMYLCGFSLNNLSLMALTIATGFVV
DDAIVVLENIARHLEAGMKPLQAALQGTREVGFTVLSMSLSLVAVFLPLLLMGGLPGRLLREFAVTLSVAIGISLLVSLT
LTPMMCGWMLKASKPREQKRLRGFGRMLVALQQGYGKSLKWVLNHTRLVGAVLLGTIALNIWLYISIPKTFFPEQDTGVL
MGGIQADQSISFQAMRGKLQDFMKIIRDDPAVDNVTGFTGGSRVNSGMMFITLKPRGERSETAQQIIDRLRKKLAKEPGA
NLFLMAVQDIRVGGRQANASYQYTLLSDDLAALREWEPKIRKKLATLPELADVNSDQEDNGAEMNLIYDRDTMARLGIDV
QAANSLLNNAFGQRQISTIYQPMNQYKVVMEVDPRYTQDISALEKMFVINNEGKAIPLSYFAKWQPANAPLSVNHQGLSA
ASTISFNLPTGKSLSDASAAIDRAMTQLGVPSTVRGSFAGTAQVFQETMNSQVILIIAAIATVYIVLGILYESYVHPLTI
LSTLPSAGVGALLALELFNAPFSLIALIGIMLLIGIVKKNAIMMVDFALEAQRHGNLTPQEAIFQACLLRFRPIMMTTLA
ALFGALPLVLSGGDGSELRQPLGITIVGGLVMSQLLTLYTTPVVYLFFDRLRLRFSRKPKQAVTE
>Q8FG03 ~~~mdtC~~~Multidrug resistance protein MdtC~~~COG0841
MKFFALFIYRPVATILLSVAITLCGILGFRMLPVAPLPQVDFPVIMVSASLPGASPETMASSVATPLERSLGRIAGVSEM
TSSSSLGSTRIILQFDFDRDINGAARDVQAAINAAQSLLPSGMPSRPTYRKANPSDAPIMILTLTSDTYSQGELYDFAST
QLAPTISQIDGVGDVDVGGSSLPAVRVGLNPQALFNQGVSLDDVRTAISNANVRKPQGALEDGTHRWQIQTNDELKTAAE
YQPLIIHYNNGGAVRLGDVATVTDSVQDVRNAGMTNAKPAILLMIRKLPEANIIQTVDSIRARLPELQSTIPAAIDLQIA
QDRSPTIRASLEEVEQTLIISVALVILVVFLFLRSGRATIIPAVAVPVSLIGTFAAMYLCGFSLNNLSLMALTIATGFVV
DDAIVVLENIARHLEAGMKPLQAALQGTREVGFTVLSMSLSLVAVFLPLLLMGGLPGRLLREFAVTLSVAIGISLLVSLT
LTPMMCGWMLKASKPREQKRLRGFGRMLVALQQGYGKSLKWVLNHTRLVGAVLLGTIALNIWLYISIPKTFFPEQDTGVL
MGGIQADQSISFQAMRGKLQDFMKIIRDDPAVDNVTGFTGGSRVNSGMMFITLKPRGERSETAQQIIDRLRKKLAKEPGA
NLFLMAVQDIRVGGRQANASYQYTLLSDDLAALREWEPKIRKKLATLPELADVNSDQEDNGAEMNLIYDRDTMARLGIDV
QAANSLLNNAFGQRQISTIYQPMNQYKVVMEVDPRYTQDISALEKMFVINNEGKAIPLSYFAKWQPANAPLSVNHQGLSA
ASTISFNLPTGKSLSDASAAIDRAMTQLGVPSTVRGSFAGTAQVFQETMNSQVILIIAAIATVYIVLGILYESYVHPLTI
LSTLPSAGVGALLALELFNAPFSLIALIGIMLLIGIVKKNAIMMVDFALEAQRHGNLTPQEAIFQACLLRFRPIMMTTLA
ALFGALPLVLSGGDGSELRQPLGITIVGGLVMSQLLTLYTTPVVYLFFDRLRLRFSRKPKQAVTE
>B1IYZ8 ~~~mdtC~~~Multidrug resistance protein MdtC~~~
MKFFALFIYRPVATILLSVAITLCGILGFRMLPVAPLPQVDFPVIMVSASLPGASPETMASSVATPLERSLGRIAGVSEM
TSSSSLGSTRIILQFDFDRDINGAARDVQAAINAAQSLLPSGMPSRPTYRKANPSDAPIMILTLTSDTYSQGELYDFAST
QLAPTISQIDGVGDVDVGGSSLPAVRVGLNPQALFNQGVSLDDVRTAVSNANVRKPQGALEDGTHRWQIQTNDELKTAAE
YQPLIIHYNNGGAVRLGDVATVTDSVQDVRNAGMTNAKPAILLMIRKLPEANIIQTVDSIRAKLPELQETIPAAIDLQIA
QDRSPTIRASLEEVEQTLIISVALVILVVFLFLRSGRATIIPAVSVPVSLIGTFAAMYLCGFSLNNLSLMALTIATGFVV
DDAIVVLENIARHLEAGMKPLQAALQGTREVGFTVLSMSLSLVAVFLPLLLMGGLPGRLLREFAVTLSVAIGISLLVSLT
LTPMMCGWMLKASKPREQKRLRGFGRMLVALQQGYGKSLKWVLNHTRLVGVVLLGTIALNIWLYISIPKTFFPEQDTGVL
MGGIQADQSISFQAMRGKLQDFMKIIRDDPAVDNVTGFTGGSRVNSGMMFITLKPRDERSETAQQIIDRLRVKLAKEPGA
NLFLMAVQDIRVGGRQSNASYQYTLLSDDLAALREWEPKIRKKLATLPELADVNSDQQDNGAEMNLVYDRDTMARLGIDV
QAANSLLNNAFGQRQISTIYQPMNQYKVVMEVDPRYTQDISALEKMFVINNEGKAIPLSYFAKWQPANAPLSVNHQGLSA
ASTISFNLPTGKSLSDASAAIDRAMTQLGVPSTVRGSFAGTAQVFQETMNSQVILIIAAIATVYIVLGILYESYVHPLTI
LSTLPSAGVGALLALELFNAPFSLIALIGIMLLIGIVKKNAIMMVDFALEAQRHGNLTPQEAIFQACLLRFRPIMMTTLA
ALFGALPLVLSGGDGSELRQPLGITIVGGLVMSQLLTLYTTPVVYLFFDRLRLRFSRKPKQTVTE
>P76399 ~~~mdtC~~~Multidrug resistance protein MdtC~~~COG0841
MKFFALFIYRPVATILLSVAITLCGILGFRMLPVAPLPQVDFPVIIVSASLPGASPETMASSVATPLERSLGRIAGVSEM
TSSSSLGSTRIILQFDFDRDINGAARDVQAAINAAQSLLPSGMPSRPTYRKANPSDAPIMILTLTSDTYSQGELYDFAST
QLAPTISQIDGVGDVDVGGSSLPAVRVGLNPQALFNQGVSLDDVRTAVSNANVRKPQGALEDGTHRWQIQTNDELKTAAE
YQPLIIHYNNGGAVRLGDVATVTDSVQDVRNAGMTNAKPAILLMIRKLPEANIIQTVDSIRAKLPELQETIPAAIDLQIA
QDRSPTIRASLEEVEQTLIISVALVILVVFLFLRSGRATIIPAVSVPVSLIGTFAAMYLCGFSLNNLSLMALTIATGFVV
DDAIVVLENIARHLEAGMKPLQAALQGTREVGFTVLSMSLSLVAVFLPLLLMGGLPGRLLREFAVTLSVAIGISLLVSLT
LTPMMCGWMLKASKPREQKRLRGFGRMLVALQQGYGKSLKWVLNHTRLVGVVLLGTIALNIWLYISIPKTFFPEQDTGVL
MGGIQADQSISFQAMRGKLQDFMKIIRDDPAVDNVTGFTGGSRVNSGMMFITLKPRDERSETAQQIIDRLRVKLAKEPGA
NLFLMAVQDIRVGGRQSNASYQYTLLSDDLAALREWEPKIRKKLATLPELADVNSDQQDNGAEMNLVYDRDTMARLGIDV
QAANSLLNNAFGQRQISTIYQPMNQYKVVMEVDPRYTQDISALEKMFVINNEGKAIPLSYFAKWQPANAPLSVNHQGLSA
ASTISFNLPTGKSLSDASAAIDRAMTQLGVPSTVRGSFAGTAQVFQETMNSQVILIIAAIATVYIVLGILYESYVHPLTI
LSTLPSAGVGALLALELFNAPFSLIALIGIMLLIGIVKKNAIMMVDFALEAQRHGNLTPQEAIFQACLLRFRPIMMTTLA
ALFGALPLVLSGGDGSELRQPLGITIVGGLVMSQLLTLYTTPVVYLFFDRLRLRFSRKPKQTVTE
>B7NCB2 ~~~mdtC~~~Multidrug resistance protein MdtC~~~
MKFFALFIYRPVATILLSVAITLCGILGFRMLPVAPLPQVDFPVIMVSASLPGASPETMASSVATPLERSLGRIAGVSEM
TSSSSLGSTRIILQFDFDRDINGAARDVQAAINAAQSLLPSGMPSRPTYRKANPSDAPIMILTLTSDTYSQGELYDFAST
QLAPTISQIDGVGDVDVGGSSLPAVRVGLNPQALFNQGVSLDDVRTAISNANVRKPQGALEDGTHRWQIQTNDELKTAAE
YQPLIIHYNNGGAVRLGDVATVTDSVQDVRNAGMTNAKPAILLMIRKLPEANIIQTVDSIRAKLPELQETIPAAIDLQIA
QDRSPTIRASLEEVEQTLIISVALVILVVFLFLRSGRATIIPAVAVPVSLIGTFAAMYLCGFSLNNLSLMALTIATGFVV
DDAIVVLENIARHLEAGMKPLQAALQGTREVGFTVLSMSLSLVAVFLPLLLMGGLPGRLLREFAVTLSVAIGISLLVSLT
LTPMMCGWMLKASKPREQKRLRGFGRMLVALQQGYGKSLKWGLNHTRLVGVVLLGTIALNIWLYISIPKTFFPEQDTGVL
MGGIQADQSISFQAMRGKLQDFMKIIRDDPAVDNVTGFTGGSRVNSGMMFITLKPHDERSETAQQIIDRLRVKLAKEPGA
NLFLMAVQDIRVGGRQSNASYQYTLLSDDLAALREWEPKIRKKLATLPELADVNSDQQDNGAEMNLVYDRDTMARLGIDV
QAANSLLNNAFGQRQISTIYQPMNQYKVVMEVDPRYTQDISALEKMFVINNEGKAIPLSYFAKWQPANAPLSVNHQGLSA
ASTISFNLPTGKSLSDASAAIDRAMTQLGVPSTVRGSFAGTAQVFQETMNSQVILIIAAIATVYIVLGILYESYVHPLTI
LSTLPSAGVGALLALELFNAPFSLIALIGIMLLIGIVKKNAIMMVDFALEAQRHGNLTPQEAIFQACLLRFRPIMMTTLA
ALFGALPLVLSGGDGSELRQPLGITIVGGLVMSQLLTLYTTPVVYLFFDRLRLRFSRKPKQTVTE
>B6HYS1 ~~~mdtC~~~Multidrug resistance protein MdtC~~~
MKFFALFIYRPVATILLSVAITLCGILGFRMLPVAPLPQVDFPVIMVSASLPGASPETMASSVATPLERSLGRIAGVSEM
TSSSSLGSTRIILQFDFDRDINGAARDVQAAINAAQSLLPSGMPSRPTYRKANPSDAPIMILTLTSDTYSQGELYDFAST
QLAPTISQIDGVGDVDVGGSSLPAVRVGLNPQALFNQGVSLDDVRTAISNANVRKPQGALEDGTHRWQIQTNDELKTAAE
YQPLIIHYNNGGAVRLGDVATVTDSVQDVRNAGMTNAKPAILLMIRKLPEANIIQTVDSIRAKLPELQETIPAAIDLQIA
QDRSPTIRASLEEVEQTLIISVALVILVVFLFLRSGRATIIPAVVVPVSLIGTFAAMYLCGFSLNNLSLMALTIATGFVV
DDAIVVLENIARHLEAGMKPLQAALQGTREVGFTVLSMSLSLVAVFLPLLLMGGLPGRLLREFAVTLSVAIGISLLVSLT
LTPMMCGWMLKASKPREQKRLRGFGRMLVALQQGYGKSLKWVLNHTRLVGVVLLGTIALNIWLYISIPKTFFPEQDTGVL
MGGIQADQSISFQAMRGKLQDFMKIIRDDPAVDNVTGFTGGSRVNSGMMFITLKPRDERSETAQQIIDRLRVKLAKEPGA
NLFLMAVQDIRVGGRQSNASYQYTLLSDDLAALREWEPKIRKKLATLPELADVNSDQQDNGAEMNLVYDRDTMARLGIDV
QAANSLLNNAFGQRQISTIYQPMNQYKVVMEVDPRYTQDISALEKMFVINNEGKAIPLSYFAKWQPANAPLSVNHQGLSA
ASTISFNLPTGKSLSDASAAIDRAMTQLGVPSTVRGSFAGTAQVFQETMNSQVILIIAAIATVYIVLGILYESYVHPLTI
LSTLPSAGVGALLALELFNAPFSLIALIGIMLLIGIVKKNAIMMVDFALEAQRHGNLTPQEAIFQACLLRFRPIMMTTLA
ALFGALPLVLSGGDGSELRQPLGITIVGGLVMSQLLTLYTTPVVYLFFDRLRLRFSRKPKQTVTE
>B1LNW5 ~~~mdtC~~~Multidrug resistance protein MdtC~~~
MKFFALFIYRPVATILLSVAITLCGILGFRMLPVAPLPQVDFPVIMVSASLPGASPETMASSVATPLERSLGRIAGVSEM
TSSSSLGSTRIILQFDFDRDINGAARDVQAAINAAQSLLPSGMPSRPTYRKANPSDAPIMILTLTSDTYSQGELYDFAST
QLAPTISQIDGVGDVDVGGSSLPAVRVGLNPQALFNQGVSLDDVRTAISNANVRKPQGVLEDGTHRWQIQTNDELKTAAE
YQPLIIHYNNGGAVRLGDVATVTDSVQDVRNAGMTNAKPAILLMIRKLPEANIIQTVDSIRARLPELQSTIPAAIDLQIA
QDRSPTIRASLEEVEQTLIISVALVILVVFLFLRSGRATIIPAVAVPVSLIGTFAAMYLCGFSLNNLSLMALTIATGFVV
DDAIVVLENIARHLEAGMKPLQAALQGTREVGFTVLSMSLSLVAVFLPLLLMGGLPGRLLREFAVTLSVAIGISLLVSLT
LTPMMCGWMLKASKPREQKRLRGFGRMLVALQQGYGKSLKWVLNHTRLVGVVLLGTIALNIWLYISIPKTFFPEQDTGVL
MGGIQADQSISFQAMRGKLQDFMKIIRDDPAVDNVTGFTGGSRVNSGMMFITLKPRGERSETAQQIIDRLRVKLAKEPGA
NLFLMAVQDIRVGGRQSNASYQYTLLSDDLAALREWEPKIRKKLATLPELADVNSDQQDNGAEMNLVYDRDTMARLGIDV
QAANSLLNNAFGQRQISTIYQPMNQYKVVMEVDPRYTQDISALEKMFVINNEGKAIPLSYFAKWQPANAPLSVNHQGLSA
ASTISFNLPTGKSLSDASAAIDRAMTQLGVPSTVRGSFAGTAQVFQETMNSQVILIIAAIATVYIVLGILYESYVHPLTI
LSTLPSAGVGALLALELFNAPFSLIALIGIMLLIGIVKKNAIMMVDFALEAQRHGNLTPQEAIFQACLLRFRPIMMTTLA
ALFGALPLVLSGGDGSELRQPLGITIVGGLVMSQLLTLYTTPVVYLFFDRLRLRFSRKPKQTVTE
>Q1R9Z5 ~~~mdtC~~~Multidrug resistance protein MdtC~~~
MKFFALFIYRPVATILLSVAITLCGILGFRMLPVAPLPQVDFPVIMVSASLPGASPETMASSVATPLERSLGRIAGVSEM
TSSSSLGSTRIILQFDFDRDINGAARDVQAAINAAQSLLPSGMPSRPTYRKANPSDAPIMILTLTSDTYSQGELYDFAST
QLAPTISQIDGVGDVDVGGSSLPAVRVGLNPQALFNQGVSLDDVRTAISNANVRKPQGALEDGTHRWQIQTNDELKTAAE
YQPLIIHYNNGGAVRLGDVATVTDSVQDVRNAGMTNAKPAILLMIRKLPEANIIQTVDSIRARLPELQSTIPAAIDLQIA
QDRSPTIRASLEEVEQTLIISVALVILVVFLFLRSGRATIIPAVAVPVSLIGTFAAMYLCGFSLNNLSLMALTIATGFVV
DDAIVVLENIARHLEAGMKPLQAALQGTREVGFTVLSMSLSLVAVFLPLLLMGGLPGRLLREFAVTLSVAIGISLLVSLT
LTPMMCGWMLKASKPREQKRLRGFGRMLVALQQGYGKSLKWVLNHTRLVGVVLLGTIALNIWLYISIPKTFFPEQDTGVL
MGGIQADQSISFQAMRGKLQDFMKIIRDDPAVDNVTGFTGGSRVNSGMMFITLKPRGERSETAQQIIDRLRKKLAKEPGA
NLFLMAVQDIRVGGRQANASYQYTLLSDDLAALREWEPKIRKKLATLPELADVNSDQEDNGAEMNLIYDRDTMARLGIDV
QAANSLLNNAFGQRQISTIYQPMNQYKVVMEVDPRYTQDISALEKMFVINNEGKAIPLSYFAKWQPANAPLSVNHQGLSA
ASTISFNLPTGKSLSDASAAIDRAMTQLGVPSTVRGSFAGTAQVFQETMNSQVILIIAAIATVYIVLGILYESYVHPLTI
LSTLPSAGVGALLALQLFNAPFSLIALIGIMLLIGIVKKNAIMMVDFALEAQRHGNLTPQEAIFQACLLRFRPIMMTTLA
ALFGALPLVLSGGDGSELRQPLGITIVGGLVMSQLLTLYTTPVVYLFFDRLRLRFSRKPKQAVTE
>B7LV40 ~~~mdtC~~~Multidrug resistance protein MdtC~~~
MKFFALFIYRPVATILLSVAITLCGILGFRMLPVAPLPQVDFPVIMVSASLPGASPETMASSVATPLERSLGRIAGVSEM
TSSSSLGSTRIILQFDFDRDINGAARDVQAAINAAQSLLPSGMPSRPTYRKANPSDAPIMILTLTSDTYSQGELYDFAST
QLAPTISQIDGVGDVDVGGSSLPAVRVGLNPQALFNQGVSLDDVRTAISNANVRKPQGALEDGTHRWQIQTNDELKTAAE
YQPLIIHYNNGGAVRLGDVATVTDSVQDVRNAGMTNAKPAILLMIRKLPEANIIQTVDSIRAKLPELQETIPAAIDLQIA
QDRSPTIRASLEEVEQTLIISVALVILVVFLFLRSGRATIIPAVAVPVSLIGTFAAMYLCGFSLNNLSLMALTIATGFVV
DDAIVVLENIARHLEAGMKPLQAALQGTREVGFTVLSMSLSLVAVFLPLLLMGGLPGRLLREFAVTLSVAIGISLLVSLT
LTPMMCGWMLKASKPREQKRLRGFGRMLVALQQGYGKSLKWVLNHTRLVGVVLLGTIALNIWLYISIPKTFFPEQDTGVL
MGGIQADQSISFQAMRGKLQDFMKIIRDDPAVDNVTGFTGGSRVNSGMMFITLKPRDERSETAQQIIDRLRVKLAKEPGA
NLFLMAVQDIRVGGRQSNASYQYTLLSDDLAALREWEPKIRKKLATLPELADVNSDQQDNGAEMNLVYDRDTMARLGIDV
QAANSLLNNAFGQRQISTIYQPMNQYKVVMEVDPRYTQDISALEKMFVINNEGKAIPLSYFAKWQPANAPLSVNHQGLSA
ASTISFNLPTGKSLSDASAAIDRAMTQLGVPSTVRGSFAGTAQVFQETMNSQVILIIAAIATVYIVLGILYESYVHPLTI
LSTLPSAGVGALLALELFNAPFSLIALIGIMLLIGIVKKNAIMMVDFALEAQRHGNLTPQEAIFQACLLRFRPIMMTTLA
ALFGALPLVLSGGDGSELRQPLGITIVGGLVMSQLLTLYTTPVVYLFFDRLRLRFSRKPKQTVTE
>P36554 ~~~mdtD~~~Putative multidrug resistance protein MdtD~~~COG2814
MTDLPDSTRWQLWIVAFGFFMQSLDTTIVNTALPSMAQSLGESPLHMHMVIVSYVLTVAVMLPASGWLADKVGVRNIFFT
AIVLFTLGSLFCALSGTLNELLLARALQGVGGAMMVPVGRLTVMKIVPREQYMAAMTFVTLPGQVGPLLGPALGGLLVEY
ASWHWIFLINIPVGIIGAIATLLLMPNYTMQTRRFDLSGFLLLAVGMAVLTLALDGSKGTGLSPLTIAGLVAVGVVALVL
YLLHARNNNRALFSLKLFRTRTFSLGLAGSFAGRIGSGMLPFMTPVFLQIGLGFSPFHAGLMMIPMVLGSMGMKRIVVQV
VNRFGYRRVLVATTLGLSLVTLLFMTTALLGWYYVLPFVLFLQGMVNSTRFSSMNTLTLKDLPDNLASSGNSLLSMIMQL
SMSIGVTIAGLLLGLFGSQHVSVDSGTTQTVFMYTWLSMALIIALPAFIFARVPNDTHQNVAISRRKRSAQ
>P37636 ~~~mdtE~~~Multidrug resistance protein MdtE~~~COG0845
MNRRRKLLIPLLFCGAMLTACDDKSAENAAAMTPEVGVVTLSPGSVNVLSELPGRTVPYEVAEIRPQVGGIIIKRNFIEG
DKVNQGDSLYQIDPAPLQAELNSAKGSLAKALSTASNARITFNRQASLLKTNYVSRQDYDTARTQLNEAEANVTVAKAAV
EQATINLQYANVTSPITGVSGKSSVTVGALVTANQADSLVTVQRLDPIYVDLTQSVQDFLRMKEEVASGQIKQVQGSTPV
QLNLENGKRYSQTGTLKFSDPTVDETTGSVTLRAIFPNPNGDLLPGMYVTALVDEGSRQNVLLVPQEGVTHNAQGKATAL
ILDKDDVVQLREIEASKAIGDQWVVTSGLQAGDRVIVSGLQRIRPGIKARAISSSQENASTESKQ
>P37637 ~~~mdtF~~~Multidrug resistance protein MdtF~~~COG0841
MANYFIDRPVFAWVLAIIMMLAGGLAIMNLPVAQYPQIAPPTITVSATYPGADAQTVEDSVTQVIEQNMNGLDGLMYMSS
TSDAAGNASITLTFETGTSPDIAQVQVQNKLQLAMPSLPEAVQQQGISVDKSSSNILMVAAFISDNGSLNQYDIADYVAS
NIKDPLSRTAGVGSVQLFGSEYAMRIWLDPQKLNKYNLVPSDVISQIKVQNNQISGGQLGGMPQAADQQLNASIIVQTRL
QTPEEFGKILLKVQQDGSQVLLRDVARVELGAEDYSTVARYNGKPAAGIAIKLAAGANALDTSRAVKEELNRLSAYFPAS
LKTVYPYDTTPFIEISIQEVFKTLVEAIILVFLVMYLFLQNFRATIIPTIAVPVVILGTFAILSAVGFTINTLTMFGMVL
AIGLLVDDAIVVVENVERVIAEDKLPPKEATHKSMGQIQRALVGIAVVLSAVFMPMAFMSGATGEIYRQFSITLISSMLL
SVFVAMSLTPALCATILKAAPEGGHKPNALFARFNTLFEKSTQHYTDSTRSLLRCTGRYMVVYLLICAGMAVLFLRTPTS
FLPEEDQGVFMTTAQLPSGATMVNTTKVLQQVTDYYLTKEKDNVQSVFTVGGFGFSGQGQNNGLAFISLKPWSERVGEEN
SVTAIIQRAMIALSSINKAVVFPFNLPAVAELGTASGFDMELLDNGNLGHEKLTQARNELLSLAAQSPNQVTGVRPNGLE
DTPMFKVNVNAAKAEAMGVALSDINQTISTAFGSSYVNDFLNQGRVKKVYVQAGTPFRMLPDNINQWYVRNASGTMAPLS
AYSSTEWTYGSPRLERYNGIPSMEILGEAAAGKSTGDAMKFMADLVAKLPAGVGYSWTGLSYQEALSSNQAPALYAISLV
VVFLALAALYESWSIPFSVMLVVPLGVVGALLATDLRGLSNDVYFQVGLLTTIGLSAKNAILIVEFAVEMMQKEGKTPIE
AIIEAARMRLRPILMTSLAFILGVLPLVISHGAGSGAQNAVGTGVMGGMFAATVLAIYFVPVFFVVVEHLFARFKKA
>P69367 ~~~mdtH~~~Multidrug resistance protein MdtH~~~COG0477
MSRVSQARNLGKYFLLIDNMLVVLGFFVVFPLISIRFVDQMGWAAVMVGIALGLRQFIQQGLGIFGGAIADRFGAKPMIV
TGMLMRAAGFATMGIAHEPWLLWFSCLLSGLGGTLFDPPRSALVVKLIRPQQRGRFFSLLMMQDSAGAVIGALLGSWLLQ
YDFRLVCATGAVLFVLCAAFNAWLLPAWKLSTVRTPVREGMTRVMRDKRFVTYVLTLAGYYMLAVQVMLMLPIMVNDVAG
APSAVKWMYAIEACLSLTLLYPIARWSEKHFRLEHRLMAGLLIMSLSMMPVGMVSGLQQLFTLICLFYIGSIIAEPARET
LSASLADARARGSYMGFSRLGLAIGGAIGYIGGGWLFDLGKSAHQPELPWMMLGIIGIFTFLALGWQFSQKRAARRLLER
DA
>P69210 ~~~mdtI~~~Spermidine export protein MdtI~~~COG2076
MAQFEWVHAAWLALAIVLEIVANVFLKFSDGFRRKIFGLLSLAAVLAAFSALSQAVKGIDLSVAYALWGGFGIAATLAAG
WILFGQRLNRKGWIGLVLLLAGMIMVKLA
>P69212 ~~~mdtJ~~~Spermidine export protein MdtJ~~~COG2076
MYIYWILLGLAIATEITGTLSMKWASVSEGNGGFILMLVMISLSYIFLSFAVKKIALGVAYALWEGIGILFITLFSVLLF
DESLSLMKIAGLTTLVAGIVLIKSGTRKARKPELEVNHGAV
>P37340 ~~~mdtK~~~Multidrug resistance protein MdtK~~~COG0534
MQKYISEARLLLALAIPVILAQIAQTAMGFVDTVMAGGYSATDMAAVAIGTSIWLPAILFGHGLLLALTPVIAQLNGSGR
RERIAHQVRQGFWLAGFVSVLIMLVLWNAGYIIRSMENIDPALADKAVGYLRALLWGAPGYLFFQVARNQCEGLAKTKPG
MVMGFIGLLVNIPVNYIFIYGHFGMPELGGVGCGVATAAVYWVMFLAMVSYIKRARSMRDIRNEKGTAKPDPAVMKRLIQ
LGLPIALALFFEVTLFAVVALLVSPLGIVDVAGHQIALNFSSLMFVLPMSLAAAVTIRVGYRLGQGSTLDAQTAARTGLM
VGVCMATLTAIFTVSLREQIALLYNDNPEVVTLAAHLMLLAAVYQISDSIQVIGSGILRGYKDTRSIFYITFTAYWVLGL
PSGYILALTDLVVEPMGPAGFWIGFIIGLTSAAIMMMLRMRFLQRLPSAIILQRASR
>P31462 ~~~mdtL~~~Multidrug resistance protein MdtL~~~COG2814
MSRFLICSFALVLLYPAGIDMYLVGLPRIAADLNASEAQLHIAFSVYLAGMAAAMLFAGKVADRSGRKPVAIPGAALFII
ASVFCSLAETSTLFLAGRFLQGLGAGCCYVVAFAILRDTLDDRRRAKVLSLLNGITCIIPVLAPVLGHLIMLKFPWQSLF
WAMAMMGIAVLMLSLFILKETRPAAPAASDKPRENSESLLNRFFLSRVVITTLSVSVILTFVNTSPVLLMEIMGFERGEY
ATIMALTAGVSMTVSFSTPFALGIFKPRTLMITSQVLFLAAGITLAVSPSHAVSLFGITLICAGFSVGFGVAMSQALGPF
SLRAGVASSTLGIAQVCGSSLWIWLAAVVGIGAWNMLIGILIACSIVSLLLIMFVAPGRPVAAHEEIHHHA
>P39386 ~~~mdtM~~~Multidrug resistance protein MdtM~~~COG2814
MPRFFTRHAATLFFPMALILYDFAAYLSTDLIQPGIINVVRDFNADVSLAPAAVSLYLAGGMALQWLLGPLSDRIGRRPV
LITGALIFTLACAATMFTTSMTQFLIARAIQGTSICFIATVGYVTVQEAFGQTKGIKLMAIITSIVLIAPIIGPLSGAAL
MHFMHWKVLFAIIAVMGFISFVGLLLAMPETVKRGAVPFSAKSVLRDFRNVFCNRLFLFGAATISLSYIPMMSWVAVSPV
ILIDAGSLTTSQFAWTQVPVFGAVIVANAIVARFVKDPTEPRFIWRAVPIQLVGLSLLIVGNLLSPHVWLWSVLGTSLYA
FGIGLIFPTLFRFTLFSNKLPKGTVSASLNMVILMVMSVSVEIGRWLWFNGGRLPFHLLAVVAGVIVVFTLAGLLNRVRQ
HQAAELVEEQ
>Q8XFG0 ~~~mdtM~~~Multidrug resistance protein MdtM~~~COG2814
MQRIIQFFSQRATTLFFPMALILYDFAAYLTTDLIQPGIINVVRDFNADVSLAPASVSLYLAGGMALQWLLGPLSDRIGR
RPVLIAGALIFTLACAATLLTTSMTQFLVARFVQGTSICFIATVGYVTVQEAFGQTKAIKLMAIITSIVLVAPVIGPLSG
AALMHFVHWKVLFGIIAVMGLLALCGLLLAMPETVQRGAVPFSAVSVLRDFRNVFRNPIFLTGAATLSLSYIPMMSWVAV
SPVILIDAGGMSTSQFAWAQVPVFGAVIVANMIVVRLVKDPTRPRFIWRAVPIQLSGLATLLLGNLLLPHVWLWSVLGTS
LYAFGIGMIFPTLFRFTLFSNNLPKGTVSASLNMVILTVMAVSVEVGRWLWFHGGRLPFHLLAAVAGVIVVFTLATLLQR
VRQHEAAELAAEK
>P32716 ~~~mdtN~~~Multidrug resistance protein MdtN~~~COG1566
MESTPKKAPRSKFPALLVVALALVALVFVIWRVDSAPSTNDAYASADTIDVVPEVSGRIVELAVTDNQAVKQGDLLFRID
PRPYEANLAKAEASLAALDKQIMLTQRSVDAQQFGADSVNATVEKARAAAKQATDTLRRTEPLLKEGFVSAEDVDRARTA
QRAAEADLNAVLLQAQSAASAVSGVDALVAQRAAVEADIALTKLHLEMATVRAPFDGRVISLKTSVGQFASAMRPIFTLI
DTRHWYVIANFRETDLKNIRSGTPATIRLMSDSGKTFEGKVDSIGYGVLPDDGGLVLGGLPKVSRSINWVRVAQRFPVKI
MVDKPDPEMFRIGASAVANLEPQ
>P32715 ~~~mdtO~~~Multidrug resistance protein MdtO~~~COG1289
MSALNSLPLPVVRLLAFFHEELSERRPGRVPQTVQLWVGCLLVILISMTFEIPFVALSLAVLFYGIQSNAFYTKFVAILF
VVATVLEIGSLFLIYKWSYGEPLIRLIIAGPILMGCMFLMRTHRLGLVFFAVAIVAIYGQTFPAMLDYPEVVVRLTLWCI
VVGLYPTLLMTLIGVLWFPSRAISQMHQALNDRLDDAISHLTDSLAPLPETRIEREALALQKLNVFCLADDANWRTQNAW
WQSCVATVTYIYSTLNRYDPTSFADSQAIIEFRQKLASEINKLQHAVAEGQCWQSDWRISESEAMAARECNLENICQTLL
QLGQMDPNTPPTPAAKPPSMAADAFTNPDYMRYAVKTLLACLICYTFYSGVDWEGIHTCMLTCVIVANPNVGSSYQKMVL
RFGGAFCGAILALLFTLLVMPWLDNIVELLFVLAPIFLLGAWIATSSERSSYIGTQMVVTFALATLENVFGPVYDLVEIR
DRALGIIIGTVVSAVIYTFVWPESEARTLPQKLAGTLGMLSKVMRIPRQQEVTALRTYLQIRIGLHAAFNACEEMCQRVA
LERQLDSEERALLIERSQTVIRQGRDLLHAWDATWNSAQALDNALQPDRAGQFADALEKYAAGLATALSRSPQITLEETP
ASQAILPTLLKQEQHVCQLFARLPDWTAPALTPATEQAQGATQ
>O06989 ~~~mdxE~~~Maltodextrin-binding protein MdxE~~~COG2182
MVLLKKGFAILAASFLAIGLAACSSSKNPASSDGKKVLTVSVEETYKEYIESIKTKFEKENDVTVKIVEKQMFEQLEALP
LDGPAGNAPDVMLAAYDRIGGLGQQGHLLDIKPSNTKSFGDKEMQQVTVDGKVYGMPLVIETLILYYNKDLLKTAPKTFK
DLEKLTEDPRFAFASEKGKSTGFLAKWTDFYMSYGLLAGYGGYVFGKNGTDSGDIGLNNKGAVEAVKYAEKWFETYWPKG
MQDNSSADDFIQQMFLEGKAAAIIGGPWSAANYKEAKLNYGAAPIPTLPNGEEYAPFAGGKGWVASKYTKEPELAEKWLE
YAANDANAYAFYEDTNEVPANTAARKKADEQKNELTSAVIKQYETATPTPNIPEMAEVWTGAESLIFDAASGKKSTQTSA
NDAVNVIKENIKEKYVK
>O86311 7.6.2.-~~~~~~Multidrug efflux system ATP-binding protein Rv1218c~~~COG1131
MSADNHQVPIEIRGLTKHFGSVRALDGLDLTVREGEVHGFLGPNGAGKSTTLRILLGLVKADGGSVRLLGGDPWTDAVDL
HRHIAYVPGDVTLWPSLTGGETIDLLARMRGGIDNARRAELIERFGLDPTKKARTYSKGNRQKVSLISALSSHATLLLLD
EPSSGLDPLMENVFQQCIGEARQRGVTVLLSSHILAETEALCEKVTIIRAGKTVESGSLDALRHLSRTSIKAEMIGDPGD
LSQIKGVEDISIEGTTVRAQVDSESLRELIQVLGHAGVRSLVSQPPTLEELFLRHYSLGPEVAAEQQVATP
>P37958 ~~~mecA~~~Adapter protein MecA 1~~~COG4862
MEIERINEHTVKFYMSYGDIEDRGFDREEIWYNRERSEELFWEVMDEVHEEEEFAVEGPLWIQVQALDKGLEIIVTKAQL
SKDGQKLELPIPEDKKQEPASEDLDALLDDFQKEEQAVNQEEKEQKLQFVLRFGDFEDVISLSKLNVNGSKTTLYSFENR
YYLYVDFCNMTDEEVENQLSILLEYATESSISIHRLEEYGKLIISEHALETIKKHFAS
>P50734 ~~~mecB~~~Adapter protein MecA 2~~~COG4862
MRLERLNYNKIKIFLTLDDLTDRGLTKEDLWKDSFKVHQLFKDMMNEANTELGFEANGPIAVEVYSLQAQGMVVIVTKNQ
DADSEDDEYDDDYIEMQVKLDESADIIYQFHSFEDIIQLSESLQRIGITGGTVYHYDGQYFLSLEDLGSHTAEGVVAVLA
EYGNPTTLTIYRLQEYGKLIMDGNAVETIQTHFS
>Q2G1U5 ~~~mecA~~~Adapter protein MecA~~~COG4862
MRIERVDDTTVKLFITYSDIEARGFSREDLWTNRKRGEEFFWSMMDEINEEEDFVVEGPLWIQVHAFEKGVEVTISKSKN
EDMMNMSDDDATDQFDEQVQELLAQTLEGEDQLEELFEQRTKEKEAQGSKRQKSSARKNTRTIIVKFNDLEDVINYAYHS
NPITTEFEDLLYMVDGTYYYAVYFDSHVDQEVINDSYSQLLEFAYPTDRTEVYLNDYAKIIMSHNVTAQVRRYFPETTE
>Q97Q68 ~~~mecA~~~Adapter protein MecA~~~COG4862
MKMKQISDTTLKITISLEDLMDRGMEIADFLVPQEKTEEFFYAILDELEMPDSFLDTGMLSFRVTPKPDKVDVFVTKSKI
DQNLDFEDLSDLPDMEELAQMSPDEFIKTLEKSIADKTKDDIEAIQSLEQVEVKEEEQEQAEQEAESKKEPYIYYILSFA
KLADLVAFAKTVTFEMETSELYKMNERYYLTILVDIENHPSPYPAWLLARMREFADDSDISRSVLQEYGQVLMSHDAVLN
LQKIG
>P68261 ~~~mecI~~~Methicillin resistance regulatory protein MecI~~~
MDNKTYEISSAEWEVMNIIWMKKYASANNIIEEIQMQKDWSPKTIRTLITRLYKKGFIDRKKDNKIFQYYSLVEESDIKY
KTSKNFINKVYKGGFNSLVLNFVEKEDLSQDEIEELRNILNKK
>P0A0B0 ~~~mecR1~~~Methicillin resistance mecR1 protein~~~
MLSSFLMLSIISSLLTICVIFLVRMLYIKYTQNIMSHKIWLLVLVSTLIPLIPFYKISNFTFSKDMMNRNVSDTTSSVSH
MLDGQQSSVTKDLAINVNQFETSNITYMILLIWVFGSLLCLFYMIKAFRQIDVIKSSSLESSYLNERLKVCQSKMQFYKK
HITISYSSNIDNPMVFGLVKSQIVLPTVVVETMNDKEIEYIILHELSHVKSHDLIFNQLYVVFKMIFWFNPALYISKTMM
DNDCEKVCDRNVLKILNRHEHIRYGESILKCSILKSQHINNVAAQYLLGFNSNIKERVKYIALYDSMPKPNRNKRIVAYI
VCSISLLIQAPLLSAHVQQDKYETNVSYKKLNQLAPYFKGFDGSFVLYNEREQAYSIYNEPESKQRYSPNSTYKIYLALM
AFDQNLLSLNHTEQQWDKHQYPFKEWNQDQNLNSSMKYSVNWYYENLNKHLRQDEVKSYLDLIEYGNEEISGNENYWNES
SLKISAIEQVNLLKNMKQHNMHFDNKAIEKVENSMTLKQKDTYKYVGKTGTGIVNHKEANGWFVGYVETKDNTYYFATHL
KGEDNANGEKAQQISERILKEMELI
>P0A0B1 ~~~mecR1~~~Methicillin resistance mecR1 protein~~~
MLSSFLMLSIISSLLTICVIFLVRMLYIKYTQNIMSHKIWLLVLVSTLIPLIPFYKISNFTFSKDMMNRNVSDTTSSVSH
MLDGQQSSVTKDLAINVNQFETSNITYMILLIWVFGSLLCLFYMIKAFRQIDVIKSSSLESSYLNERLKVCQSKMQFYKK
HITISYSSNIDNPMVFGLVKSQIVLPTVVVETMNDKEIEYIILHELSHVKSHDLIFNQLYVVFKMIFWFNPALYISKTMM
DNDCEKVCDRNVLKILNRHEHIRYGESILKCSILKSQHINNVAAQYLLGFNSNIKERVKYIALYDSMPKPNRNKRIVAYI
VCSISLLIQAPLLSAHVQQDKYETNVSYKKLNQLAPYFKGFDGSFVLYNEREQAYSIYNEPESKQRYSPNSTYKIYLALM
AFDQNLLSLNHTEQQWDKHQYPFKEWNQDQNLNSSMKYSVNWYYENLNKHLRQDEVKSYLDLIEYGNEEISGNENYWNES
SLKISAIEQVNLLKNMKQHNMHFDNKAIEKVENSMTLKQKDTYKYVGKTGTGIVNHKEANGWFVGYVETKDNTYYFATHL
KGEDNANGEKAQQISERILKEMELI
>P9WHS1 3.13.1.6~~~mec~~~CysO-cysteine peptidase~~~COG1310
MLLRKGTVYVLVIRADLVNAMVAHARRDHPDEACGVLAGPEGSDRPERHIPMTNAERSPTFYRLDSGEQLKVWRAMEDAD
EVPVVIYHSHTATEAYPSRTDVKLATEPDAHYVLVSTRDPHRHELRSYRIVDGAVTEEPVNVVEQY
>P31005 1.1.1.244~~~mdh~~~NAD-dependent methanol dehydrogenase~~~
MTNFFIPPASVIGRGAVKEVGTRLKQIGAKKALIVTDAFLHSTGLSEEVAKNIREAGLDVAIFPKAQPDPADTQVHEGVD
VFKQENCDALVSIGGGSSHDTAKAIGLVAANGGRINDYQGVNSVEKPVVPVVAITTTAGTGSETTSLAVITDSARKVKMP
VIDEKITPTVAIVDPELMVKKPAGLTIATGMDALSHAIEAYVAKGATPVTDAFAIQAMKLINEYLPKAVANGEDIEAREA
MAYAQYMAGVAFNNGGLGLVHSISHQVGGVYKLQHGICNSVNMPHVCAFNLIAKTERFAHIAELLGENVSGLSTAAAAER
AIVALERYNKNFGIPSGYAEMGVKEEDIELLAKNAFEDVCTQSNPRVATVQDIAQIIKNAL
>Q9F837 2.6.1.106~~~megDII~~~dTDP-3-amino-3,4,6-trideoxy-alpha-D-glucose transaminase~~~
MTTYVWSYLLEYERERADILDAVQKVFASGSLILGQSVENFETEYARYHGIAHCVGVDNGTNAVKLALESVGVGRDDEVV
TVSNTAAPTVLAIDEIGARPVFVDVRDEDYLMDTDLVEAAVTPRTKAIVPVHLYGQCVDMTALRELADRRGLKLVEDCAQ
AHGARRDGRLAGTMSDAAAFSFYPTKVLGAYGDGGAVVTNDDETARALRRLRYYGMEEVYYVTRTPGHNSRLDEVQAEIL
RRKLTRLDAYVAGRRAVAQRYVDGLADLQDSHGLELPVVTDGNEHVFYVYVVRHPRRDEIIKRLRDGYDISLNISYPWPV
HTMTGFAHLGVASGSLPVTERLAGEIFSLPMYPSLPHDLQDRVIEAVREVITGL
>Q8RDT4 4.4.1.11~~~~~~L-methionine gamma-lyase~~~COG0626
MEMKKSGLGTTAIHAGTLKNLYGTLAMPIYQTSTFIFDSAEQGGRRFALEEAGYIYTRLGNPTTTVLENKIAALEEGEAG
IAMSSGMGAISSTLWTVLKAGDHVVTDKTLYGCTFALMNHGLTRFGVEVTFVDTSNLEEVKNAMKKNTRVVYLETPANPN
LKIVDLEALSKIAHTNPNTLVIVDNTFATPYMQKPLKLGVDIVVHSATKYLNGHGDVIAGLVVTRQELADQIRFVGLKDM
TGAVLGPQEAYYIIRGLKTFEIRMERHCKNARTIVDFLNKHPKVEKVYYPGLETHPGYEIAKKQMKDFGAMISFELKGGF
EAGKTLLNNLKLCSLAVSLGDTETLIQHPASMTHSPYTKEEREVAGITDGLVRLSVGLENVEDIIADLEQGLEKI
>Q8L0X4 4.4.1.11~~~mgl~~~L-methionine gamma-lyase~~~
METKKYGLGTTAIHAGTLKNLYGTLAMPIYQTSTFIFDSAEQGGRRFALEEAGYIYTRLGNPTTTVLENKIAALEEGEAA
VATSSGMGAISSTLWTVLKAGDHVVTDKTLYGCTFALMCHGLTRFGIEVTFVDTSNLDEVKNAMKKNTRVVYLETPANPN
LKIVDLEALSKLAHTNPNTLVIVDNTFATPYMQKPLKLGADIVVHSVTKYINGHGDVIAGLVITNKELADQIRFIGLKDM
TGAVLGPQDAYYIIRGMKTFEIRMERHCKNAKKVVEFLNKHPKIERVYYPGLETHPGHEIAKKQMKDFGAMISFELKGGF
EAGKTLLNNLKLCSLAVSLGDTETLIQHPASMTHSPYTKEEREAAGITDGLVRLSVGLENVEDIIADLEQGLEKI
>Q7MX71 4.4.1.11~~~mgl~~~L-methionine gamma-lyase~~~COG0626
MKKEDLMRSGFATRAIHGGAIENAFGCLATPIYQTSTFVFDTAEQGGRRFAGEEDGYIYTRLGNPNCTQVEEKLAMLEGG
EAAASASSGIGAISSAIWVCVKAGDHIVAGKTLYGCTFAFLTHGLSRYGVEVTLVDTRHPEEVEAAIRPNTKLVYLETPA
NPNMYLTDIKAVCDIAHKHEGVRVMVDNTYCTPYICRPLELGADIVVHSATKYLNGHGDVIAGFVVGKEDYIKEVKLVGV
KDLTGANMSPFDAYLISRGMKTLQIRMEQHCRNAQTVAEFLEKHPAVEAVYFPGLPSFPQYELAKKQMALPGAMIAFEVK
GGCEAGKKLMNNLHLCSLAVSLGDTETLIQHPASMTHSPYTPEERAASDISEGLVRLSVGLENVEDIIADLKHGLDSLI
>P13254 4.4.1.11~~~mdeA~~~L-methionine gamma-lyase~~~
MHGSNKLPGFATRAIHHGYDPQDHGGALVPPVYQTATFTFPTVEYGAACFAGEQAGHFYSRISNPTLNLLEARMASLEGG
EAGLALASGMGAITSTLWTLLRPGDEVLLGNTLYGCTFAFLHHGIGEFGVKLRHVDMADLQALEAAMTPATRVIYFESPA
NPNMHMADIAGVAKIARKHGATVVVDNTYCTPYLQRPLELGADLVVHSATKYLSGHGDITAGIVVGSQALVDRIRLQGLK
DMTGAVLSPHDAALLMRGIKTLNLRMDRHCANAQVLAEFLARQPQVELIHYPGLASFPQYTLARQQMSQPGGMIAFELKG
GIGAGRRFMNALQLFSRAVSLGDAESLAQHPASMTHSSYTPEERAHYGISEGLVRLSVGLEDIDDLLADVQQALKASA
>Q826W3 4.4.1.11~~~mgl~~~L-methionine gamma-lyase~~~COG0626
MDDGRGAGGFGGGPAVRALDTEAVHAGRDDLARQGLHAAPIDLSTTYPSYDSRAEAARIDAFAADGAEPAGPPVYGRLGN
PTVARFETALARLEGTDSAVAFASGMAALSAVLLVRNAMGLRHVVAVRPLYGCSDHLLTAGLLGSEVTWVDPAGVADALR
PDTGLVMVESPANPTLAELDLRALAHACGSVPLLADNTFATPVLQRPAEHGARLVLHSATKYLGGHGDVMAGVVACDEEF
ARGLRQIRFATGGVLHPLAGYLLLRGLSTLPIRVRAASSNAAELARRLAADPRVARVHYPRIGGAMIAFEVYGDPHEVIA
GVRLITPAVSLGSVDSLIQHPASISHRIVDAADRRGAGVSDRLLRLSVGLEDVEDLWADLDGALGTDRLPETAGAGREPS
RTALRLPERAADR
>Q73KL7 4.4.1.11~~~megL~~~L-methionine gamma-lyase~~~COG0626
MNRKELEKLGFASKQIHAGSIKNKYGALATPIYQTSTFAFDSAEQGGRRFALEEEGYIYTRLGNPTTTVVEEKLACLENG
EACMSASSGIGAVTSCIWSIVNAGDHIVAGKTLYGCTFAFLNHGLSRFGVDVTFVDTRDPENVKKALKPNTKIVYLETPA
NPNMYLCDIAAVSKIAHAHNPECKVIVDNTYMTPYLQRPLDLGADVVLHSATKYLNGHGDVIAGFVVGKKEFIDQVRFVG
VKDMTGSTLGPFEAYLIGRGMKTLDIRMEKHCANAQKVAEFLEKHPAVESIAFPGLKSFPQYELAKKQMKLCGAMIAFTV
KGGLEAGKTLINSVKFATIAVSLGDAETLIQHPASMTHSPYTPEERAASDIAEGLVRLSVGLEDAEDIIADLKQALDKLV
K
>A9WC41 4.2.1.153~~~meh~~~Mesaconyl-C(4)-CoA hydratase~~~COG3777
MSSADWMAWIGRTEQVEDDICLAQAIAAAATLEPPSGAPTADSPLPPLWHWFYFLPRAPQSQLSSDGHPQRGGFIPPIPY
PRRMFAGARIRFHHPLRIGQPARREGVIRNITQKSGRSGPLAFVTVGYQIYQHEMLCIEEEQDIVYREPGAPVPAPTPVE
LPPVHDAITRTVVPDPRLLFRFSALTFNAHRIHYDRPYAQHEEGYPGLVVHGPLVAVLLMELARHHTSRPIVGFSFRSQA
PLFDLAPFRLLARPNGDRIDLEAQGPDGATALSATVELGG
>C0HL39 ~~~~~~Mejucin~~~
LGPQLNKGCATCSIGAACLVDGPIPDEIAG
>Q0MRG5 3.1.1.113~~~mekB~~~Ethyl acetate hydrolase~~~
MNSYYTEENHGPFELINIGPLPLEEGRCMPECLLAVAVHGALNADKSNAILVPTWYSGTSKAMEQIYIGEGRALDPSKYC
IIVVNQIGNGLSSSASNTGGSLAGPGFANVRIGDDVSAQHTLLTEYFGIESLALVVGGSMGAQQTYEWAVRYPDFVKRAA
AIAGTARNSEHDFLFTEILIEAITTDPAFQAGLYRSSSAVAAGLERHAKLWTLMGWSPEFFRTGRHKALGFESMQMFVDG
FMKRYFAPMDPNNLLTMAWKWQRGDVSRHTGGDLAKALGRIKAKTYVMPISHDQFFTVDDCLSEQKMIPNSEFRPLRSID
GHLGLFGTDAQMLDQLDAHLAELLSSPAY
>A9AWD7 2.1.1.347~~~~~~(+)-O-methylkolavelool synthase~~~COG2230
MTVLSTDIPAPNSPSISDVEAYYDAMGPFYKLIWGDSVHGGYWPAGLEDMSLPEAQEHLTNLMIEKTPIKPGQHMLDLGC
GTGLPAIRMASAKQCHVHGLTVAHGQVAEAQATIQAMQMQELVHINWGNAMELPFEADFFNAAWAFESIFHMPSRLTVLQ
EANRVLQAGSYFVLTDIVEVKSLSPEQQQIFFPAFQINTLTTKQGYLDLFAQTGFEQLELIDLTAGIEKTLAHTKLGIEQ
KRAELAAIYPPEMLGMIEQTWPMVEKIYAEFVRYVLIVARKRG
>G1UB44 3.2.1.22~~~melA~~~Alpha-galactosidase Mel36A~~~COG3345
MTSNLIKFDDQNKVFHLHNKQISYLLSIEDGGTLSHLYFGGAVKNYNNQLKYPRLDRGFSGNLPESLDRTFSRDSLPKEY
SSAGEMDFHTPATIVRNPDGSNALFLAYKSYKIEDGKPDLKGLPHSWTKEDDEAQTLIVTLEDKVSKLEYDLLYTIYRDR
PVIVRSVQVHNHGEEAVYLEKVASMQMDYVDKDFEVITLPGAHANERRVQRENIGQGIKVFSSYRGTSSHQMNPFMALVD
HDTNEFMGEAYGFALAYSGNHKFEVERDQFGQIHVNTGINDYNFKWKLNPNEEFQTPEVLMVYSDQGLNKMSQAFHSLIH
ERIMRSKFKDQIRPVLVNNWEATYFDFNEDKLKTIVDKAKKLGLEMFVLDDGWFGHRDDDNSSLGDWKVYKKKFPNGLGH
FADYVHEQGLKFGLWFEPEMISYESNLYKEHPDYLMHVPGRKPCPSRNQYVLELGRKEVRDNIFEQMVKILDSKKIDYIK
WDMNRSLSDIYESDLPADQQGEAYHRYVLGYYDLLNKLVTRYPDILFEGCSGGGGRFDVGQAYYTPQIWASDNTDAIERL
KIQYGTSLVYPQSMMTSHVSVSPNEQNGRITPFNTRGAVAMWGDLGYELDLTKMSDEESDQVVKQVTEYKKIREVTQFGT
LYRLKASASNQCAWMMVDSNKNEAVVTVVNVMAHAQPYCTKTKLAGLDPDKRYKNLETDEVFGGDELMHLGFYDPIERGD
FKAKMYHFKAIN
>P23996 ~~~melA~~~Protein MelA~~~
MASEQNPLGLLGIEFTEFATPDLDFMHKVFIDFGFSKLKKHKQKDIVYYKQNDINFLLNNEKQGFSAQFAKTHGPAISSM
GWRVEDANFAFEGAVARGAKPAADEVKDLPYPAIYGIGDSLIYFIDTFGDDNNIYTSDFEALDEPIITQEKGFIEVDHLT
NNVHKGTMEYWSNFYKDIFGFTEVRYFDIKGSQTALISYALRSPDGSFCIPINEGKGDDRNQIDEYLKEYDGPGVQHLAF
RSRDIVASLDAMEGSSIQTLDIIPEYYDTIFEKLPQVTEDRDRIKHHQILVDGDEDGYLLQIFTKNLFGPIFIEIIQRKN
NLGFGEGNFKALFESIERDQVRRGVL
>P02921 ~~~melB~~~Melibiose permease~~~COG2211
MSISMTTKLSYGFGAFGKDFAIGIVYMYLMYYYTDVVGLSVGLVGTLFLVARIWDAINDPIMGWIVNATRSRWGKFKPWI
LIGTLANSVILFLLFSAHLFEGTTQIVFVCVTYILWGMTYTIMDIPFWSLVPTITLDKREREQLVPYPRFFASLAGFVTA
GVTLPFVNYVGGGDRGFGFQMFTLVLIAFFIVSTIITLRNVHEVFSSDNQPSAEGSHLTLKAIVALIYKNDQLSCLLGMA
LAYNVASNIITGFAIYYFSYVIGDADLFPYYLSYAGAANLVTLVFFPRLVKSLSRRILWAGASILPVLSCGVLLLMALMS
YHNVVLIVIAGILLNVGTALFWVLQVIMVADIVDYGEYKLHVRCESIAYSVQTMVVKGGSAFAAFFIAVVLGMIGYVPNV
EQSTQALLGMQFIMIALPTLFFMVTLILYFRFYRLNGDTLRRIQIHLLDKYRKVPPEPVHADIPVGAVSDVKA
>O07366 ~~~melB~~~Melibiose permease~~~
MSISMTTKLSYGFGAFGKDFAIGIVYMYLMYYYTDIVGLSVGVVGTLFLVARILDAIADPIMGWIVNCTRSRWGKFKPWI
LIGTITNSVVLYMLFSAHHFSGGALLAWVWLTYLLWGFTYTIMDVPFWSLVPTITLDKREREQLVPYPRFFASLAGFVTA
GVTLPFVSAVGGADRGFGFQMFTLVLIAFFVISTLVTLRNVHEVYSSDSGVSEDSSHLSLGQMVALIYKNDQLACLLGMA
LAYNTAANIIAGFAIYYFTYVIGSAEMFPYYMSYAGAANLLTLILFPRLVKGLSRRILWAGASIMPVLGCGVLLLMALSG
VYNIALISLAGVLLNIGTALFWVLQVIMVADTVDYGEYTMNIRCESIAYSVQTLVVKAGSAFAAWFIAIVLGIIGYVPNT
AQSPHTLLGMQAIMIALPTLFFALTLFLYFRYYKLNGDMLRRIQIHLLDKYRRVPENVVEPERPIVVPNQV
>Q02581 ~~~melB~~~Melibiose permease~~~
MSISMTTKLSYGFGAFGKDFAIGIVYMYLMYYYTDIVGLSVGVVGTLFLVARILDAIADPIMGWIVNCTRSRWGKFKPWI
LIGTITNSVVLYMLFSAHHFSGGALLAWVWLTYLLWGFTYTIMDVPFWSLVPTITLDKREREQLVPYPRFFASLAGFVTA
GVTLPFVNAVGGADRGFGFQMFTLVLIAFFVVSTLVTLRNVHEVYSSDSGVSEDSSHLSLRQMVALIYKNDQLACLLGMA
LAYNTAANIIAGFAIYYFTYVIGSAEMFPYYMSYAGAANLLTLILFPRLVKGLSRRILWAGASIMPVLGCGVLLLMALGG
VYNIALISLAGVLLNIGTALFWVLQVIMVADTVDYGEYTMNIRCESIAYSVQTLVVKAGSAFAAWFIAIVLGIIGYVPNV
VQSSHTLLGMQAIMIALPTLFFALTLFLYFRYYKLNGDMLRRIQIHLLDKYRRVPENDVEPERPIVVPNQV
>P30878 ~~~melB~~~Melibiose permease~~~
MSISLTTKLSYGFGAFGKDFAIGIVYMYLMYYYTDVVGLSVGLVGTLFLVARIWDAINDPIMGWIVNATRSRWGKFKPWI
LIGTLTNSLVLFLLFSAHLFEGTAQVVFVCVTYILWGMTYTIMDIPFWSLVPTITLDKREREQLVPFPRFFASLAGFVTA
GITLPFVSYVGGADRGFGFQMFTLVLIAFFIASTIVTLRNVHEVYSSDNGVTAGRPHLTLKTIVGLIYKNDQLSCLLGMA
LAYNIASNIINGFAIYYFTYVIGDADLFPYYLSYAGAANLLTLIVFPRLVKMLSRRILWAGASVMPVLSCAGLFAMALAD
IHNAALIVAAGIFLNIGTALFWVLQVIMVADTVDYGEFKLNIRCESIAYSVQTMVVKGGSAFAAFFIALVLGLIGYTPNV
AQSAQTLQGMQFIMIVLPVLFFMMTLVLYFRYYRLNGDMLRKIQIHLLDKYRKTPPFVEQPDSPAISVVATSDVKA
>O34518 ~~~melC~~~Melibiose/raffinose/stachyose import permease protein MelC~~~COG0395
MRAARTKSMRIITLLAAIVACAHFIPFYILLTTSLKAKGDYSSKWIFPADISFHNFSEAWERASLGNSFINTMIITGFSA
LLLIIFGSLAAYPLARRETKLNKAVFALLISIMIIPPLTSMVPLYRMVVDAGMVNTHAIAIFINTAAYMPLTVFLYSGFI
RSTIPKELVEAARIDGAGMLKIFFTIVFPLLKPITATICIISCVFIWNDYQFAIFFLQDQKVQTLTVAMAGFFGENANNL
HLVAAAALMAMLPMVVLFLALQKYFIAGLSSGAVKG
>O34706 ~~~melD~~~Melibiose/raffinose/stachyose import permease protein MelD~~~COG1175
MSEIARDVHVKQVKPKKQSSLWWMYIPALLSVLVFMIYPFVKGTLITFTNWNGFSQVYQWVGFAQYERLFSDPDTWHILK
NTLHYGLGSTFFQNVVGLLYALLLNQSIKTKAVTRTIVYLPVMISPLIMGYIWYFFFSYDGGALNDLLGVFGISPINALA
SPSLNPWIIVMINTYQYVGIAMVVYLAGLQSIPKDYYEAAQMDGAKQGQQFFTITLPLLMPSITINIVINIIGGLKLFDV
IIALTAGGPGNASQSMSTFMYDLYFKRQDAGYAATQGIFMAFVILIISFCALAYFKRKETEMS
>O34335 ~~~melE~~~Melibiose/raffinose/stachyose-binding protein MelE~~~COG1653
MKHTFVLFLSLILLVLPGCSAEKSSADTAKKTLTIYSTMSTDSERDTFRKLAAAFEKEHSDIHVSLHFPGNDYENMMRVR
MAANDLPDLFDTHGWGKIRYGEYTADLRDMKWTQDLDPNLNSILKNKSGKVYAYPINQAKDGLAYNRNILDRYGIAPPET
MDDFIKALRTIKEKSKGSIVPFWFAGYDKSSFAQYYDQFATPLLITDPAHNEKKQLINGTFQWSKFTYLSEILKQMQKEK
LINIDAVTAKKSQLIELMAQNKIAFTMQGGTLGQDVAQINPNVKVGIIPTPAIHPGDDPIWIGGERYTLAAWKDSPQLKE
AKDFIAFMARPANAKQMAEATSLPSGLTNVKADIFYANDYEYYQDVKVEPYFDRLYLPNGMWDVLGTVGQELAADILAPQ
DISQKLGREYKRLREQSETQGAENNE
>O34829 ~~~melR~~~HTH-type transcriptional repressor MelR~~~COG1609
MVRIKDIALKAKVSSATVSRILNEDESLSVAGETRQRVINIAEELGYQTVAKRRKSRGQKQRAQPLIGVLSCLSPDQERQ
DPYFSSIRKGIEKECFEQEIFITNSIHLGSFQEHIFRELDGVIVIGRVHDEAVKHISGRLEHAVFINHSPDPQAYDSIGI
DFESASRQAIDHLFDLGYKRLGYIGGQEKEHTLKDGQSIRRTIEDKRLTAFLESAAPQPEHVLIGEYSMREGYRLMKKAI
DQGHLPEAFFIASDSMAIGALKALQEAGLQVPRDTAIVSFNGIEEAEFASTPLTTVKVYTEEMGRTGVKLLLDRLNGRTL
PQHVTLPTTLIVRQSCGCTAKEVT
>P0ACH8 ~~~melR~~~Melibiose operon regulatory protein~~~COG2169
MNTDTFMCSSDEKQTRSPLSLYSEYQRMEIEFRAPHIMPTSHWHGQVEVNVPFDGDVEYLINNEKVNINQGHITLFWACT
PHQLTDTGTCQSMAIFNLPMHLFLSWPLDKDLINHVTHGMVIKSLATQQLSPFEVRRWQQELNSPNEQIRQLAIDEIGLM
LKRFSLSGWEPILVNKTSRTHKNSVSRHAQFYVSQMLGFIAENYDQALTINDVAEHVKLNANYAMGIFQRVMQLTMKQYI
TAMRINHVRALLSDTDKSILDIALTAGFRSSSRFYSTFGKYVGMSPQQYRKLSQQRRQTFPG
>P96517 ~~~melY~~~Melibiose permease~~~COG2211
MNTTTCTHKDNPNFWIFGLFFFLYFFIMATCFPFLPIWLSDIIGLNKTHTGIVFSCISLSAIAFQPVLGVISDKLGLKKH
LLWIISVLLFLFAPFFLYVFAPLLKTNIWLGALSGGLYIGFVFSAGSGAIEAYIERVSRNSAFEYGKARMFGCLGWGLCA
STGGILFGIDPSYVFWMGSAAALLLMLLLVVAKPKPNQTAQVMNALGANQPQITAKKVFNLFRQRRMWMFILYVIGVACV
YDVFDQQFATFFKTFFATPQEGTRAFGFATTAGEICNAIIMFCSPWIINRIGAKNTLLIAGLIMATRIIGSSFATTAVEV
IALKMLHALEVPFLLVGAFKYITGVFDTRLSATIYLIGFQFAKQSAAIFLSAFAGNMYDRIGFQETYLMLGCFVLAITVV
SAFTLSSRQEIAAAAGAAALTSQSR
>P22869 1.14.13.25~~~mmoX~~~Methane monooxygenase component A alpha chain~~~COG3350
MALSTATKAATDALAANRAPTSVNAQEVHRWLQSFNWDFKNNRTKYATKYKMANETKEQFKLIAKEYARMEAVKDERQFG
SLQDALTRLNAGVRVHPKWNETMKVVSNFLEVGEYNAIAATGMLWDSAQAAEQKNGYLAQVLDEIRHTHQCAYVNYYFAK
NGQDPAGHNDARRTRTIGPLWKGMKRVFSDGFISGDAVECSLNLQLVGEACFTNPLIVAVTEWAAANGDEITPTVFLSIE
TDELRHMANGYQTVVSIANDPASAKYLNTDLNNAFWTQQKYFTPVLGMLFEYGSKFKVEPWVKTWNRWVYEDWGGIWIGR
LGKYGVESPRSLKDAKQDAYWAHHDLYLLAYALWPTGFFRLALPDQEEMEWFEANYPGWYDHYGKIYEEWRARGCEDPSS
GFIPLMWFIENNHPIYIDRVSQVPFCPSLAKGASTLRVHEYNGQMHTFSDQWGERMWLAEPERYECQNIFEQYEGRELSE
VIAELHGLRSDGKTLIAQPHVRGDKLWTLDDIKRLNCVFKNPVKAFN
>P27353 1.14.13.25~~~mmoX~~~Methane monooxygenase component A alpha chain~~~
MAISLATKAATDALKVNRAPVGVEPQEVHKWLQSFNWDFKENRTKYPTKYHMANETKEQFKVIAKEYARMEAAKDERQFG
TLLDGLTRLGAGNKVHPRWGETMKVISNFLEVGEYNAIAASAMLWDSATAAEQKNGYLAQVLDEIRHTHQCAFINHYYSK
HYHDPAGHNDARRTRAIGPLWKGMKRVFADGFISGDAVECSVNLQLVGEACFTNPLIVAVTEWASANGDEITPTVFLSVE
TDELRHMANGYQTVVSIANDPASAKFLNTDLNNAFWTQQKYFTPVLGYLFEYGSKFKVEPWVKTWNRWVYEDWGGIWIGR
LGKYGVESPASLRDAKRDAYWAHHDLALAAYAMWPLGFARLALPDEEDQAWFEANYPGWADHYGKIFNEWKKLGYEDPKS
GFIPYQWLLANGHDVYIDRVSQVPFIPSLAKGTGSLRVHEFNGKKHSLTDDWGERQWLIEPERYECHNVFEQYEGRELSE
VIAEGHGVRSDGKTLIAQPHTRGDNLWTLEDIKRAGCVFPDPLAKF
>P18798 1.14.13.25~~~mmoY~~~Methane monooxygenase component A beta chain~~~
MSMLGERRRGLTDPEMAAVILKALPEAPLDGNNKMGYFVTPRWKRLTEYEALTVYAQPNADWIAGGLDWGDWTQKFHGGR
PSWGNETTELRTVDWFKHRDPLRRWHAPYVKDKAEEWRYTDRFLQGYSADGQIRAMNPTWRDEFINRYWGAFLFNEYGLF
NAHSQGAREALSDVTRVSLAFWGFDKIDIAQMIQLERGFLAKIVPGFDESTAVPKAEWTNGEVYKSARLAVEGLWQEVFD
WNESAFSVHAVYDALFGQFVRREFFQRLAPRFGDNLTPFFINQAQTYFQIAKQGVQDLYYNCLGDDPEFSDYNRTVMRNW
TGKWLEPTIAALRDFMGLFAKLPAGTTDKEEITASLYRVVDDWIEDYASRIDFKADRDQIVKAVLAGLK
>P27354 1.14.13.25~~~mmoY~~~Methane monooxygenase component A beta chain~~~
MSQPQSSQVTKRGLTDPERAAIIAAAVPDHALDTQRKYHYFIQPRWKPLSEYEQLSCYAQPNPDWIAGGLDWGDWTQKFH
GGRPSWGNESTELRTTDWYRHRDPARRWHHPYVKDKSEEARYTQRFLAAYSSEGSIRTIDPYWRDEILNKYFGALLYSEY
GLFNAHSSVGRDCLSDTIRQTAVFAALDKVDNAQMIQMERLFIAKLVPGFDASTDVPKKIWTTDPIYSGARATVQEIWQG
VQDWNEILWAGHAVMIATFGQFARREFFQRLATVYGDTLTPFFTAQSQTYFQTTRGAIDDLFVYCLANDSEFGAHNRTFL
NAWTEHYLASSVAALKDFVGLYAKVEKSRADRSRRRLRGAAASSAIGRSITPDKIGFRVDVDQKVDAVLAGYKN
>P11987 1.14.13.25~~~mmoZ~~~Methane monooxygenase component A gamma chain~~~
MAKLGIHSNDTRDAWVNKIAQLNTLEKAAEMLKQFRMDHTTPFRNSYELDNDYLWIEAKLEEKVAVLKARAFNEVDFRHK
TAFGEDAKSVLDGTVAKMNAAKDKWEAEKIHIGFRQAYKPPIMPVNYFLDGERQLGTRLMELRNLNYYDTPLEELRKQRG
VRVVHLQSPH
>P27355 1.14.13.25~~~mmoZ~~~Methane monooxygenase component A gamma chain~~~
MAKREPIHDNSIRTEWEAKIAKLTSVDQATKFIQDFRLAYTSPFRKSYDIDVDYQYIERKIEEKLSVLKTEKLPVADLIT
KATTGEDRAAVEATWIAKIKAAKSKYEADGIHIEFRQLYKPPVLPVNVFLRTDAALGTVLMEIRNTDYYGTPLEGLRKEP
GVKVLHLQA
>P32166 2.5.1.74~~~menA~~~1,4-dihydroxy-2-naphthoate octaprenyltransferase~~~COG1575
MTEQQISRTQAWLESLRPKTLPLAFAAIIVGTALAWWQGHFDPLVALLALITAGLLQILSNLANDYGDAVKGSDKPDRIG
PLRGMQKGVITQQEMKRALIITVVLICLSGLALVAVACHTLADFVGFLILGGLSIIAAITYTVGNRPYGYIGLGDISVLV
FFGWLSVMGSWYLQAHTLIPALILPATACGLLATAVLNINNLRDINSDRENGKNTLVVRLGEVNARRYHACLLMGSLVCL
ALFNLFSLHSLWGWLFLLAAPLLVKQARYVMREMDPVAMRPMLERTVKGALLTNLLFVLGIFLSQWAA
>P9WIP3 2.5.1.74~~~menA~~~1,4-dihydroxy-2-naphthoate octaprenyltransferase~~~COG1575
MASFAQWVSGARPRTLPNAIAPVVAGTGAAAWLHAAVWWKALLALAVAVALVIGVNYANDYSDGIRGTDDDRVGPVRLVG
SRLATPRSVLTAAMTSLALGALAGLVLALLSAPWLIAVGAICIAGAWLYTGGSKPYGYAGFGELAVFVFFGPVAVLGTQY
TQALRVDWVGLAQAVATGALSCSVLVANNLRDIPTDARADKITLAVRLGDARTRMLYQGLLAVAGVLTFVLMLATPWCVV
GLVAAPLALRAAGPVRSGRGGRELIPVLRDTGLAMLVWALAVAGALAFGQLS
>P23966 4.1.3.36~~~menB~~~1,4-dihydroxy-2-naphthoyl-CoA synthase~~~COG0447
MAEWKTKRTYDEILYETYNGIAKITINRPEVHNAFTPKTVAEMIDAFADARDDQNVGVIVLAGAGDKAFCSGGDQKVRGH
GGYVGDDQIPRLNVLDLQRLIRVIPKPVVAMVSGYAIGGGHVLHIVCDLTIAADNAIFGQTGPKVGSFDAGYGSGYLARI
VGHKKAREIWYLCRQYNAQEALDMGLVNTVVPLEQLEEETIKWCEEMLEKSPTALRFLKAAFNADTDGLAGIQQFAGDAT
LLYYTTDEAKEGRDSFKEKRKPDFGQFPRFP
>P0ABU0 4.1.3.36~~~menB~~~1,4-dihydroxy-2-naphthoyl-CoA synthase~~~COG0447
MIYPDEAMLYAPVEWHDCSEGFEDIRYEKSTDGIAKITINRPQVRNAFRPLTVKEMIQALADARYDDNIGVIILTGAGDK
AFCSGGDQKVRGDYGGYKDDSGVHHLNVLDFQRQIRTCPKPVVAMVAGYSIGGGHVLHMMCDLTIAADNAIFGQTGPKVG
SFDGGWGASYMARIVGQKKAREIWFLCRQYDAKQALDMGLVNTVVPLADLEKETVRWCREMLQNSPMALRCLKAALNADC
DGQAGLQELAGNATMLFYMTEEGQEGRNAFNQKRQPDFSKFKRNP
>A0QRD3 4.1.3.36~~~menB~~~1,4-dihydroxy-2-naphthoyl-CoA synthase~~~COG0447
MSSQAASDNPFDPAMWERVPGFDDLTDITYHRHVLDGARQPTVRVAFDRPEVRNAFRPHTVDELYRVLDHARMSSDVGVI
LLTGNGPSPKDGGWAFCSGGDQRIRGRTGYQYASGETAETVDPARAGRLHILEVQRLIRFMPKVVICLVNGWAAGGGHSL
HVTCDLTLASREHARFKQTDADVGSFDGGFGSAYLARQTGQKFAREIFFLGRAYDAQTMHQMGAVNEVVDHADLEKAGLQ
YAAEINGKSPQAIRMLKFAFNLIDDGLVGQQVFAGEATRLAYMTDEAVEGRDAFLEKRDPDWSRFPRYF
>P9WNP5 4.1.3.36~~~menB~~~1,4-dihydroxy-2-naphthoyl-CoA synthase~~~COG0447
MVAPAGEQGRSSTALSDNPFDAKAWRLVDGFDDLTDITYHRHVDDATVRVAFNRPEVRNAFRPHTVDELYRVLDHARMSP
DVGVVLLTGNGPSPKDGGWAFCSGGDQRIRGRSGYQYASGDTADTVDVARAGRLHILEVQRLIRFMPKVVICLVNGWAAG
GGHSLHVVCDLTLASREYARFKQTDADVGSFDGGYGSAYLARQVGQKFAREIFFLGRTYTAEQMHQMGAVNAVAEHAELE
TVGLQWAAEINAKSPQAQRMLKFAFNLLDDGLVGQQLFAGEATRLAYMTDEAVEGRDAFLQKRPPDWSPFPRYF
>Q7CQ56 4.1.3.36~~~menB~~~1,4-dihydroxy-2-naphthoyl-CoA synthase~~~
MIYPDETMLYAPVEWHDCSEGYTDIRYEKSTDGIAKITINRPQVRNAFRPLTVKEMIQALADARYDDNVGVIILTGEGDK
AFCAGGDQKVRGDYGGYQDDSGVHHLNVLDFQRQIRTCPKPVVAMVAGYSIGGGHVLHMMCDLTIAAENAIFGQTGPKVG
SFDGGWGASYMARIVGQKKAREIWFLCRQYDAQQALDMGLVNTVVPLADLEKETVRWCREMLQNSPMALRCLKAALNADC
DGQAGLQELAGNATMLFYMTEEGQEGRNAFNQKRQPDFSKFKRNP
>Q5HH38 4.1.3.36~~~menB~~~1,4-dihydroxy-2-naphthoyl-CoA synthase~~~
MTNRQWETLREYDEIKYEFYEGIAKVTINRPEVRNAFTPKTVAEMIDAFSRARDDQNVSVIVLTGEGDLAFCSGGDQKKR
GHGGYVGEDQIPRLNVLDLQRLIRIIPKPVIAMVKGYAVGGGNVLNVVCDLTIAADNAIFGQTGPKVGSFDAGYGSGYLA
RIVGHKKAREIWYLCRQYNAQEALDMGLVNTVVPLEKVEDETVQWCKEIMKHSPTALRFLKAAMNADTDGLAGLQQMAGD
ATLLYYTTDEAKEGRDAFKEKRDPDFDQFPKFP
>Q7A6A9 4.1.3.36~~~menB~~~1,4-dihydroxy-2-naphthoyl-CoA synthase~~~
MTNRQWETLREYDEIKYEFYEGIAKVTINRPEVRNAFTPKTVAEMIDAFSRARDDQNVSVIVLTGEGDLAFCSGGDQKKR
GHGGYVGEDQIPRLNVLDLQRLIRIIPKPVIAMVKGYAVGGGNVLNVVCDLTIAADNAIFGQTGPKVGSFDAGYGSGYLA
RIVGHKKAREIWYLCRQYNAQEALDMGLVNTVVPLDKVEDETVQWCKEIMKHSPTALRFLKAAMNADTDGLAGLQQMAGD
ATLLYYTTDEAKEGRDAFKEKRDPDFDQFPKFP
>O34514 4.2.1.113~~~menC~~~o-succinylbenzoate synthase~~~COG4948
MIEIEKITLYHLSMNLKKPFKNSIETLQERKFLIVEAIDTSGVTGWGEVSAFSSPWYTEETIGTCLHMLKDFFIPNVVGR
EFNHPSEVPDSLARYKGNRMAKAGLESAVWDIYAKKKGVSLAEALGGTRDKVPAGVVVGLAPLDDMLKEIESYQKEGYQR
IKIKIQPGQDVELVKAIRSRFPTIPLMADANSSYELKDISRLKELDDYHLMMIEQPLQADDIVDHRHLQKHLKTAICLDE
SICSVDDARRAIELGSCKIINIKPSRVGGLTEALKIHDLCKEHHMQVWCGGMLETGISRAQNVALASLPQFTIPGDISSS
SRYWDEDIVTPDIRIDNGFISVSKQPGLGVEVNQDIMRKYVTKMDVFTQHG
>Q6MQC7 4.2.1.113~~~menC~~~o-succinylbenzoate synthase~~~COG4948
MIKISYSPYTLKPVQSLNAATAATAREGVLLKVEWNDGLYGFADLHPWPELGDLSLEEQLSDLRMGRMTTQIEQSIWLAR
RDALLRKEKKHVFDGGEKIKNNYLLSHFQDLKPGFLDGLKNEGYNTVKVKMGRDLQKEADMLTHIAASGMRMRLDFNALG
SWQTFEKFMVNLPLTVRPLIEYVEDPFPFDFHAWGEARKLAKIALDNQYDKVPWGKIASAPFDVIVIKPAKTDVDKAVAQ
CQKWNLKLAVTSYMDHPVGVVHAVGVAMELKDKYGDMILESGCLTHRLYQMDSFAAELSTQGPYLLKNKGTGVGFDKLLE
ALTWYQLKVR
>Q6ARP5 4.2.1.113~~~menC~~~o-succinylbenzoate synthase~~~COG4948
MSGMELSYRRSDLIFKRPAGTSRGVLTSKPTWFVRLDIDGHGGQGEVSLIPGLSLDPEEQIGRELDLLARRLRAEEPIRL
RQFLAERGGADFSDYRSVLTDIAGILDSWQVSTDGRFPALRFALEMALLDLLSGGRQEWFASDFTRGEKRIPVNGLIWMG
EAAFMQEQIEAKLAEGYGCLKLKIGAIDFDKECALLAGIRESFSPQQLEIRVDANGAFSPANAPQRLKRLSQFHLHSIEQ
PIRQHQWSEMAALCANSPLAIALDEELIGLGAEQRSAMLDAIRPQYIILKPSLLGGFHYAGQWIELARERGIGFWITSAL
ESNLGLAAIAQWTALYQPTMPQGLGTGQLYTNNLPSNLAVDGGLLGVS
>P29208 4.2.1.113~~~menC~~~o-succinylbenzoate synthase~~~COG1441
MRSAQVYRWQIPMDAGVVLRDRRLKTRDGLYVCLREGEREGWGEISPLPGFSQETWEEAQSVLLAWVNNWLAGDCELPQM
PSVAFGVSCALAELTDTLPQAANYRAAPLCNGDPDDLILKLADMPGEKVAKVKVGLYEAVRDGMVVNLLLEAIPDLHLRL
DANRAWTPLKGQQFAKYVNPDYRDRIAFLEEPCKTRDDSRAFARETGIAIAWDESLREPDFAFVAEEGVRAVVIKPTLTG
SLEKVREQVQAAHALGLTAVISSSIESSLGLTQLARIAAWLTPDTIPGLDTLDLMQAQQVRRWPGSTLPVVEVDALERLL
>Q838J7 4.2.1.113~~~menC~~~o-succinylbenzoate synthase~~~COG4948
MNIQSIETYQVRLPLKTPFVTSYGRLEEKAFDLFVITDEQGNQGFGELVAFEQPDYVQETLVTERFIIQQHLIPLLLTEA
IEQPQEVSTIFEEVKGHWMGKAALETAIWDLYAKRQQKSLTEFFGPTRRKIPVGISLGIQEDLPQLLKQVQLAVEKGYQR
VKLKIRPGYDVEPVALIRQHFPNLPLMVDANSAYTLADLPQLQRLDHYQLAMIEQPFAADDFLDHAQLQRELKTRICLDE
NIRSLKDCQVALALGSCRSINLKIPRVGGIHEALKIAAFCQENDLLVWLGGMFESGVGRALNLQFASQPTFSFPGDISAT
ERYFYEDIITEPFILEQGTMTVPQGLGIGVTLSQTNLLKYSQYQKIM
>Q5L1G9 4.2.1.113~~~menC~~~o-succinylbenzoate synthase~~~COG4948
MAINIEYVILRHLQMELKAPFTTSFGTFQTKELILVEAVDCDGVSGWGESVAFSAPWYSEETVKTNWHMLEEFLVPLLFS
KPLRHPAELPERFAAIRQNNMAKAALEGAVWDLYAKRLGVPLCQALGGTKKEIEVGVSIGIQPTVADLLQVIERYVAQGY
RRIKVKIKPGWDVDVIRDVRRAFPDVPLMADANSAYTFADAKRLQALDEFGLMMIEQPLAADDLVDHARLQPLLQTPICL
DESIRSYDDARKALDLGSCRIINIKIGRVGGLWEAKRIHDLCAERGVSVWCGGMLEAGVGRAHNIAITTLENFALPGDTA
ASSHYWERDIITPEVEVQGGLIRVPNAPGIGYEVDRRQVERYTQFAKVFHRTATA
>B1A612 4.2.1.113~~~menC~~~o-succinylbenzoate synthase~~~
MAINIEYVILRHLQMELKAPFTTSFGTFQTKEFILVEVVDCDGVSGWGESVAFSVPWYSEETVKTNWHMLEEFLVPLLFS
KPLRHPAELPERFAAIRQNNMAKAALEGAVWDLYAKRLGVPLCQALGGTKKEIEVGVSIGIQPTVDDLLQVIERYVAQGY
RRIKVKIKPGWDVDVIRDVRRAFPDVPLMADANSAYTLADAKRLQALDEFGLMMIEQPLAADDLVDHARLQPLLKTPICL
DESIRSYDDARKALDLGSCRIINIKIGRVGGLWEAKRIHDLCAERGVPVWCGGMLEAGVGRAHNIAITTLENFALPGDTA
ASSHYWERDIITPEVEVHNGLIRVPNAPGIGYDVDRRQVERYTQFAKLFHRTATA
>A0A0P0ZBS7 4.2.1.113~~~menC~~~o-succinylbenzoate synthase~~~
MAINIEYVILRHLQMELKAPFTTSFGTFQRKELILVEVVDRDGVSGWGESVAFSAPWYSEETVKTNWHMLEDFLVPLALA
EPIHHPEELSKRFSAIRQNNMAKAALEGAVWDLYAKRLGVPLSQALGGAKKDIEVGVSIGIQPTVADLLQVIERYVAQGY
RRIKVKIKPSWDVDVIREVRRVFPDVPLMADANSAYTLVDADRLKALDEFGLLMIEQPLAADDLVDHARLQPLLQTPICL
DESIRSYDDARKALDLGSCRIINIKIGRVGGLGEAKRIHDLCAERGAPVWCGGMLEAGVGRAHNIAITTLENFTLPGDTA
ASSHYWERDIITPEVEVHGGLIRVPDAPGIGYDVDRRQVERYTQFAKVFHRTATA
>Q927X3 4.2.1.113~~~menC~~~o-succinylbenzoate synthase~~~COG4948
MYFQKARLIHAELPLLAPFKTSYGELKSKDFYIIELINEEGIHGYGELEAFPLPDYTEETLSSAILIIKEQLLPLLAQRK
IRKPEEIQELFSWIQGNEMAKAAVELAVWDAFAKMEKRSLAKMIGATKESIKVGVSIGLQQNVETLLQLVNQYVDQGYER
VKLKIAPNKDIQFVEAVRKSFPKLSLMADANSAYNREDFLLLKELDQYDLEMIEQPFGTKDFVDHAWLQKQLKTRICLDE
NIRSVKDVEQAHSIGSCRAINLKLARVGGMSSALKIAEYCALNEILVWCGGMLEAGVGRAHNIALAARNEFVFPGDISAS
NRFFAEDIVTPAFELNQGRLKVPTNEGIGVTLDLKVLKKYTKSTEEILLNKGWS
>P9WJP3 4.2.1.113~~~menC~~~o-succinylbenzoate synthase~~~COG4948
MIPVLPPLEALLDRLYVVALPMRVRFRGITTREVALIEGPAGWGEFGAFVEYQSAQACAWLASAIETAYCAPPPVRRDRV
PINATVPAVAAAQVGEVLARFPGARTAKVKVAEPGQSLADDIERVNAVRELVPMVRVDANGGWGVAEAVAAAAALTADGP
LEYLEQPCATVAELAELRRRVDVPIAADESIRKAEDPLAVVRAQAADIAVLKVAPLGGISALLDIAARIAVPVVVSSALD
SAVGIAAGLTAAAALPELDHACGLGTGGLFEEDVAEPAAPVDGFLAVARTTPDPARLQALGAPPQRRQWWIDRVKACYSL
LVPSFG
>P58486 4.2.1.113~~~menC~~~o-succinylbenzoate synthase~~~
MRSAQVYRWQIPMDAGVVLRDRRLKTRDGLYVCLRDGEREGWGEISPLPGFSQETWEEAQTALLTWVNDWLQGSEGLPEM
PSVAFGASCALAELTGVLPEAADYRAAPLCTGDPDDLVLRLADMPGEKIAKVKVGLYEAVRDGMVVNLLLEAIPDLHLRL
DANRAWTPLKAQQFAKYVNPDYRARIAFLEEPCKTRDDSRAFARETGIAIAWDESLREADFTFEAEEGVRAVVIKPTLTG
SLDKVREQVAAAHALGLTAVISSSIESSLGLTQLARIAAWLTPGTLPGLDTLHLMQAQQIRPWPGSALPCLKREELERLL
>A0A0H2WWB5 4.2.1.113~~~menC~~~o-succinylbenzoate synthase~~~
MKLTALHFYKYSEPFKSQIVTPKVTLTHRDCLFIELIDDKGNAYFGECNAFQTDWYDHETIASVKHVIEQWFEDNRNKSF
ETYEAALKLVDSLENTPAARATIVMALYQMFHVLPSFSVAYGATASGLSNKQLESLKATKPTRIKLKWTPQIMHQIRVLR
ELDFHFQLVIDANESLDRQDFTQLQLLAREQVLYIEEPFKDISMLDEVADGTIPPIALDEKATSLLDIINLIELYNVKVV
VLKPFRLGGIDKVQTAIDTLKSHGAKVVIGGMYEYGLSRYFTAMLARKGDYPGDVTPAGYYFEQDVVAHSGILKEGRLEF
RPPLVDITQLQPY
>Q47Q21 4.2.1.113~~~menC~~~o-succinylbenzoate synthase~~~COG4948
MTGRAFAIPLRTRFRGITVREGMLVRGAAGWGEFSPFAEYGPRECARWWAACYEAAELGWPAPVRDTVPVNATVPAVGPE
EAARIVASSGCTTAKVKVAERGQSEANDVARVEAVRDALGPRGRVRIDVNGAWDVDTAVRMIRLLDRFELEYVEQPCATV
DELAEVRRRVSVPIAADESIRRAEDPLRVRDAEAADVVVLKVQPLGGVRAALRLAEECGLPVVVSSAVETSVGLAAGVAL
AAALPELPYACGLATLRLLHADVCDDPLLPVHGVLPVRRVDVSEQRLAEVEIDPAAWQARLAAARAAWEQVEREPGP
>Q8DJP8 4.2.1.113~~~menC~~~o-succinylbenzoate synthase~~~COG4948
MRWQWRIYEEPLQEPLTTAQGVWRSRSGIYLRLEDEQGQVGYGEIAPLPGWGSETLNADIALCQQLPGHLTPEIMATIPE
ALPAAQFGFATAWQSVGRLPYRVRPWPICALLGSGQAALEQWQQSWQRGQTTFKWKVGVMSPEEEQAILKALLAALPPGA
KLRLDANGSWDRATANRWFAWLDRHGNGKIEYVEQPLPPDQWQALLSLAQTVTTAIALDESVVSAAEVQRWVDRGWPGFF
VIKTALFGDPDSLSLLLRRGLEPQRLVFSSALEGAIARTAIFHLLETWQPCHALGFGVDRWRSAPLLTTLTAYERLWERL
DQ
>P23970 2.2.1.9~~~menD~~~2-succinyl-5-enolpyruvyl-6-hydroxy-3-cyclohexene-1-carboxylate synthase~~~COG1165
MTVNPITHYIGSFIDEFALSGITDAVVCPGSRSTPLAVLCAAHPDISVHVQIDERSAGFFALGLAKAKQRPVLLICTSGT
AAANFYPAVVEAHYSRVPIIVLTADRPHELREVGAPQAINQHFLFGNFVKFFTDSALPEESPQMLRYIRTLASRAAGEAQ
KRPMGPVHVNVPLREPLMPDLSDEPFGRMRTGRHVSVKTGTQSVDRESLSDVAEMLAEAEKGMIVCGELHSDADKENIIA
LSKALQYPILADPLSNLRNGVHDKSTVIDAYDSFLKDDELKRKLRPDVVIRFGPMPVSKPVFLWLKDDPTIQQIVIDEDG
GWRDPTQASAHMIHCNASVFAEEIMAGLTAATRSSEWLEKWQFVNGRFREHLQTISSEDVSFEGNLYRILQHLVPENSSL
FVGNSMPIRDVDTFFEKQDRPFRIYSNRGANGIDGVVSSAMGVCEGTKAPVTLVIGDLSFYHDLNGLLAAKKLGIPLTVI
LVNNDGGGIFSFLPQASEKTHFEDLFGTPTGLDFKHAAALYGGTYSCPASWDEFKTAYAPQADKPGLHLIEIKTDRQSRV
QLHRDMLNEAVREVKKQWEL
>P17109 2.2.1.9~~~menD~~~2-succinyl-5-enolpyruvyl-6-hydroxy-3-cyclohexene-1-carboxylate synthase~~~COG1165
MSVSAFNRRWAAVILEALTRHGVRHICIAPGSRSTPLTLAAAENSAFIHHTHFDERGLGHLALGLAKVSKQPVAVIVTSG
TAVANLYPALIEAGLTGEKLILLTADRPPELIDCGANQAIRQPGMFASHPTHSISLPRPTQDIPARWLVSTIDHALGTLH
AGGVHINCPFAEPLYGEMDDTGLSWQQRLGDWWQDDKPWLREAPRLESEKQRDWFFWRQKRGVVVAGRMSAEEGKKVALW
AQTLGWPLIGDVLSQTGQPLPCADLWLGNAKATSELQQAQIVVQLGSSLTGKRLLQWQASCEPEEYWIVDDIEGRLDPAH
HRGRRLIANIADWLELHPAEKRQPWCVEIPRLAEQAMQAVIARRDAFGEAQLAHRICDYLPEQGQLFVGNSLVVRLIDAL
SQLPAGYPVYSNRGASGIDGLLSTAAGVQRASGKPTLAIVGDLSALYDLNALALLRQVSAPLVLIVVNNNGGQIFSLLPT
PQSERERFYLMPQNVHFEHAAAMFELKYHRPQNWQELETAFADAWRTPTTTVIEMVVNDTDGAQTLQQLLAQVSHL
>Q71YZ2 2.2.1.9~~~menD~~~2-succinyl-5-enolpyruvyl-6-hydroxy-3-cyclohexene-1-carboxylate synthase~~~
MTNHEQVLTDYLAAFIEELVQAGVKEAIISPGSRSTPLALMMAEHPILKIYVDVDERSAGFFALGLAKASKRPVVLLCTS
GTAAANYFPAVAEANLSQIPLIVLTADRPHELRNVGAPQAMDQLHLYGSHVKDFTDMALPENSEEMLRYAKWHGSRAVDI
AMKTPRGPVHLNFPLREPLVPILEPSPFTATGKKHHHVHIYYTHEVLDDSSIQKMVTECTGKKGVFVVGPIDKKELEQPM
VDLAKKLGWPILADPLSGLRSYGALDEVVIDQYDAFLKEAEIIDKLTPEVVIRFGSMPVSKPLKNWLEQLSDIRFYVVDP
GAAWKDPIKAVTDMIHCDERFLLDIMQQNMPDDAKDAAWLNGWTSYNKVAREIVLAEMANTTILEEGKIVAELRRLLPDK
AGLFIGNSMPIRDVDTYFSQIDKKIKMLANRGANGIDGVVSSALGASVVFQPMFLLIGDLSFYHDMNGLLMAKKYKMNLT
IVIVNNDGGGIFSFLPQANEPKYFESLFGTSTELDFRFAAAFYDADYHEAKSVDELEEAIDKASYHKGLDIIEVKTNRHE
NKANHQALWVKIADALKALD
>P9WK11 2.2.1.9~~~menD~~~2-succinyl-5-enolpyruvyl-6-hydroxy-3-cyclohexene-1-carboxylate synthase~~~COG1165
MNPSTTQARVVVDELIRGGVRDVVLCPGSRNAPLAFALQDADRSGRIRLHVRIDERTAGYLAIGLAIGAGAPVCVAMTSG
TAVANLGPAVVEANYARVPLIVLSANRPYELLGTGANQTMEQLGYFGTQVRASISLGLAEDAPERTSALNATWRSATCRV
LAAATGARTANAGPVHFDIPLREPLVPDPEPLGAVTPPGRPAGKPWTYTPPVTFDQPLDIDLSVDTVVISGHGAGVHPNL
AALPTVAEPTAPRSGDNPLHPLALPLLRPQQVIMLGRPTLHRPVSVLLADAEVPVFALTTGPRWPDVSGNSQATGTRAVT
TGAPRPAWLDRCAAMNRHAIAAVREQLAAHPLTTGLHVAAAVSHALRPGDQLVLGASNPVRDVALAGLDTRGIRVRSNRG
VAGIDGTVSTAIGAALAYEGAHERTGSPDSPPRTIALIGDLTFVHDSSGLLIGPTEPIPRSLTIVVSNDNGGGIFELLEQ
GDPRFSDVSSRIFGTPHDVDVGALCRAYHVESRQIEVDELGPTLDQPGAGMRVLEVKADRSSLRQLHAAIKAAL
>Q2FZL7 2.2.1.9~~~menD~~~2-succinyl-5-enolpyruvyl-6-hydroxy-3-cyclohexene-1-carboxylate synthase~~~COG1165
MGNHKAALTKQVFTFASELYAYGVREVVISPGSRSTPLALAFEAHPNIKTWIHPDERSAAFFAVGLIKGSERPVAILCTS
GTAAANYTPAIAESQISRIPLIVLTSDRPHELRSVGAPQAINQVNMFNNYVSYEFDMPIADDSKETIDAIYYQMQIASQY
LYGPHKGPIHFNLPFRDPLTPDLNATELLTSEMKILPHYQKSIDASALRHILNKKKGLIIVGDMQHQEVDQILTYSTIYD
LPILADPLSHLRKFDHPNVICTYDLLFRSGLDLNVDFVIRVGKPVISKKLNQWLKKTDAFQILVQNNDKIDVFPIAPDIS
YEISANDFFRSLMEDTTVNRVSWLEKWQCLEKKGRKEIKCYLEQATDESAFVGELIKKTSEKDALFISNSMPIRDVDNLL
LNKNIDVYANRGANGIDGIVSTALGMAVHKRITLLIGDLSFYHDMNGLLMSKLNNIQMNIVLLNNDGGGIFSYLPQKESA
TDYFERLFGTPTGLDFEYTAKLYQFDFKRFNSVSEFKNATLLSETSTIYELITNREDNFKQHQILYQKLSEMIHDTL
>P23971 6.2.1.26~~~menE~~~2-succinylbenzoate--CoA ligase~~~COG0318
MLTEQPNWLMQRAQLTPERIALIYEDQTVTFAELFAASKRMAEQLAAHSVRKGDTAAILLQNRAEMVYAVHACFLLGVKA
VLLNTKLSTHERLFQLEDSGSGFLLTDSSFEKKEYEHIVQTIDVDELMKEAAEEIEIEAYMQMDATATLMYTSGTTGKPK
GVQQTFGNHYFSAVSSALNLGITEQDRWLIALPLFHISGLSALFKSVIYGMTVVLHQRFSVSDVLHSINRHEVTMISAVQ
TMLASLLEETNRCPESIRCILLGGGPAPLPLLEECREKGFPVFQSYGMTETCSQIVTLSPEFSMEKLGSAGKPLFSCEIK
IERDGQVCEPYEHGEIMVKGPNVMKSYFNRESANEASFQNGWLKTGDLGYLDNEGFLYVLDRRSDLIISGGENIYPAEVE
SVLLSHPAVAEAGVSGAEDKKWGKVPHAYLVLHKPVSAGELTDYCKERLAKYKIPAKFFVLDRLPRNASNKLLRNQLKDA
RKGELL
>P37353 6.2.1.26~~~menE~~~2-succinylbenzoate--CoA ligase~~~COG0318
MIFSDWPWRHWRQVRGETIALRLNDEQLNWRELCARVDELASGFAVQGVVEGSGVMLRAWNTPQTLLAWLALLQCGARVL
PVNPQLPQPLLEELLPNLTLQFALVPDGENTFPALTSLHIQLVEGAHAATWQPTRLCSMTLTSGSTGLPKAAVHTYQAHL
ASAQGVLSLIPFGDHDDWLLSLPLFHVSGQGIMWRWLYAGARMTVRDKQPLEQMLAGCTHASLVPTQLWRLLVNRSSVSL
KAVLLGGAAIPVELTEQAREQGIRCFCGYGLTEFASTVCAKEADGLADVGSPLPGREVKIVNNEVWLRAASMAEGYWRNG
QLVSLVNDEGWYATRDRGEMHNGKLTIVGRLDNLFFSGGEGIQPEEVERVIAAHPAVLQVFIVPVADKEFGHRPVAVMEY
DHESVDLSEWVKDKLARFQQPVRWLTLPPELKNGGIKISRQALKEWVQRQQ
>P9WQ39 6.2.1.26~~~menE~~~Probable 2-succinylbenzoate--CoA ligase~~~COG0318
MRALHVPAGSATALLLPALQRVLGGSDPALVAVPTQHESLLGALRVGEQIDDDVALVVTTSGTTGPPKGAMLTAAALTAS
ASAAHDRLGGPGSWLLAVPPYHIAGLAVLVRSVIAGSVPVELNVSAGFDVTELPNAIKRLGSGRRYTSLVAAQLAKALTD
PAATAALAELDAVLIGGGPAPRPILDAAAAAGITVVRTYGMSETSGGCVYDGVPLDGVRLRVLAGGRIAIGGATLAKGYR
NPVSPDPFAEPGWFHTDDLGALESGDSGVLTVLGRADEAISTGGFTVLPQPVEAALGTHPAVRDCAVFGLADDRLGQRVV
AAIVVGDGCPPPTLEALRAHVARTLDVTAAPRELHVVNVLPRRGIGKVDRAALVRRFAGEADQ
>P63526 6.2.1.26~~~menE~~~2-succinylbenzoate--CoA ligase~~~
MDFWLYKQAQQNGHHIAITDGQESYTYQNLYCEASLLAKRLKAYQQSRVGLYIDNSIQSIILIHACWLANIEIAMINTRL
TPNEMTNQMRSIDVQLIFCTLPLELRGFQIVSLDDIEFAGRDITTNGLLDNTMGIQYDTSNETVVPKESPSNILNTSFNL
DDIASIMFTSGTTGPQKAVPQTFRNHYASAIGCKESLGFDRDTNWLSVLPIYHISGLSVLLRAVIEGFTVRIVDKFNAEQ
ILTMIKNERITHISLVPQTLNWLMQQGLHEPYNLQKILLGGAKLSATMIETALQYNLPIYNSFGMTETCSQFLTATPEML
HARPDTVGMPSANVDVKIKNPNKEGHGELMIKGANVMNGYLYPTDLTGTFENGYFNTGDIAEIDHEGYVMIYDRRKDLII
SGGENIYPYQIETVAKQFPGISDAVCVGHPDDTWGQVPKLYFVSESDISKAQLIAYLSKHLAKYKVPKHFEKVDTLPYTS
TGKLQRNKLYRG
>P38051 5.4.4.2~~~menF~~~Isochorismate synthase MenF~~~COG1169
MQSLTTALENLLRHLSQEIPATPGIRVIDIPFPLKDAFDALSWLASQQTYPQFYWQQRNGDEEAVVLGAITRFTSLDQAQ
RFLRQHPEHADLRIWGLNAFDPSQGNLLLPRLEWRRCGGKATLRLTLFSESSLQHDAIQAKEFIATLVSIKPLPGLHLTT
TREQHWPDKTGWTQLIELATKTIAEGELDKVVLARATDLHFASPVNAAAMMAASRRLNLNCYHFYMAFDGENAFLGSSPE
RLWRRRDKALRTEALAGTVANNPDDKQAQQLGEWLMADDKNQRENMLVVEDICQRLQADTQTLDVLPPQVLRLRKVQHLR
RCIWTSLNKADDVICLHQLQPTAAVAGLPRDLARQFIARHEPFTREWYAGSAGYLSLQQSEFCVSLRSAKISGNVVRLYA
GAGIVRGSDPEQEWQEIDNKAAGLRTLLQME
>P9WFW9 5.4.4.2~~~menF~~~Putative isochorismate synthase MenF~~~COG1169
MSAHVATLHPEPPFALCGPRGTLIARGVRTRYCDVRAAQAALRSGTAPILLGALPFDVSRPAALMVPDGVLRARKLPDWP
TGPLPKVRVAAALPPPADYLTRIGRARDLLAAFDGPLHKVVLARAVQLTADAPLDARVLLRRLVVADPTAYGYLVDLTSA
GNDDTGAALVGASPELLVARSGNRVMCKPFAGSAPRAADPKLDAANAAALASSAKNRHEHQLVVDTMRVALEPLCEDLTI
PAQPQLNRTAAVWHLCTAITGRLRNISTTAIDLALALHPTPAVGGVPTKAATELIAELEGDRGFYAGAVGWCDGRGDGHW
VVSIRCAQLSADRRAALAHAGGGIVAESDPDDELEETTTKFATILTALGVEQ
>P31113 2.1.1.163~~~menG~~~Demethylmenaquinone methyltransferase~~~COG2226
MQDSKEQRVHGVFEKIYKNYDQMNSVISFQQHKKWRDKTMRIMNVKEGAKALDVCCGTADWTIALAKAAGKSGEIKGLDF
SENMLSVGEQKVKDGGFSQIELLHGNAMELPFDDDTFDYVTIGFGLRNVPDYLTVLKEMRRVVKPGGQVVCLETSQPEMF
GFRQAYFMYFKYIMPFFGKLFAKSYKEYSWLQESARDFPGMKELAGLFEEAGLKNVKYHSFTGGVAATHIGWK
>P9WFR3 2.1.1.163~~~menG~~~Demethylmenaquinone methyltransferase~~~COG2226
MSRAALDKDPRDVASMFDGVARKYDLTNTVLSLGQDRYWRRATRSALRIGPGQKVLDLAAGTAVSTVELTKSGAWCVAAD
FSVGMLAAGAARKVPKVAGDATRLPFGDDVFDAVTISFGLRNVANQQAALREMARVTRPGGRLLVCEFSTPTNALFATAY
KEYLMRALPRVARAVSSNPEAYEYLAESIRAWPDQAVLAHQISRAGWSGVRWRNLTGGIVALHAGYKPGKQTPQ
>P67062 2.1.1.163~~~menG~~~Demethylmenaquinone methyltransferase~~~
MADNKANKEQVHRVFQNISKKYDRLNNIISFEQHKVWRKRVMKDMGVRKGTKALDVCCGTGDWTIALSKAVGPTGEVTGI
DFSENMLEVGKEKTASMENVKLVHGDAMELPFEDNSFDYVTIGFGLRNVPDYLVALKEMNRVLKPGGMVVCLETSQPTLP
VFKQMYALYFKFVMPIFGKLFAKSKEEYEWLQQSTFNFPGKEELKRMFEEAGFINVRVRSFTGGVAAMHLGYKEKDNTKG
D
>P37355 4.2.99.20~~~menH~~~2-succinyl-6-hydroxy-2,4-cyclohexadiene-1-carboxylate synthase~~~COG0596
MILHAQAKHGKPGLPWLVFLHGFSGDCHEWQEVGEAFADYSRLYVDLPGHGGSAAISVDGFDDVTDLLRKTLVSYNILDF
WLVGYSLGGRVAMMAACQGLAGLCGVIVEGGHPGLQNAEQRAERQRSDRQWVQRFLTEPLTAVFADWYQQPVFASLNDDQ
RRELVALRSNNNGATLAAMLEATSLAVQPDLRANLSARTFAFYYLCGERDSKFRALAAELAADCHVIPRAGHNAHRENPA
GVIASLAQILRF
>P77781 3.1.2.28~~~menI~~~1,4-dihydroxy-2-naphthoyl-CoA hydrolase~~~COG2050
MIWKRKITLEALNAMGEGNMVGFLDIRFEHIGDDTLEATMPVDSRTKQPFGLLHGGASVVLAESIGSVAGYLCTEGEQKV
VGLEINANHVRSAREGRVRGVCKPLHLGSRHQVWQIEIFDEKGRLCCSSRLTTAIL
>A0A0H3GEM5 3.1.2.28~~~menI~~~1,4-dihydroxy-2-naphthoyl-CoA hydrolase~~~
MGLQETIGIEIVSVEKGKAVVQLEVTEKVHQPFGYLHGGVSVVLAEHAASIGAAKSIEPDEIVFGLEINANHLASKQAGL
VTATAEAIHIGKSTQVWEIKITDETEKLLCISRCTIAVKKKRK
>A0QRI8 1.3.99.38~~~menJ~~~Menaquinone reductase~~~COG0644
MNTRADVVVVGAGPAGSAAAAWAARAGRDVVVVDAAQFPRDKACGDGLTPRAVAELQRLGMASWLDTRIRHHGLRMSGFG
ADVEIPWPGPSFPATSSAVPRTELDDRIRSVAADDGAKMMLGTKVVDVTHDSSGRVDAVVLDDGNTVGCAQLIVADGARS
TLGRVLGRTWHRETVYGVAIRGYIASPRASEPWITSHLELRSPEGKVLPGYGWMFPLGNGEVNIGVGALATAKRPADAAL
RPLMSYYADLRREEWGLVGEPRAGLSALLPMGGAVSGVAGPNWMLIGDAAACVNPLNGEGIDYGLETGRLAAELMTSGGV
TDYSSAWPTLLQEHYARGFSVARRLALLLTLPRFLQVTGPVAMRSATLMTIAVRVMGNLVTDEDADWIARVWRTAGLASR
RIDQRVPFS
>P9WNY9 1.3.99.38~~~menJ~~~Menaquinone reductase~~~COG0644
MSVDDSADVVVVGAGPAGSAAAAWAARAGRDVLVIDTATFPRDKPCGDGLTPRAVAELHQLGLGKWLADHIRHRGLRMSG
FGGEVEVDWPGPSFPSYGSAVARLELDDRIRKVAEDTGARMLLGAKAVAVHHDSSRRVVSLTLADGTEVGCRQLIVADGA
RSPLGRKLGRRWHRETVYGVAVRGYLSTAYSDDPWLTSHLELRSPDGAVLPGYGWIFPLGNGEVNIGVGALSTSRRPADL
ALRPLISYYTDLRRDEWGFTGQPRAVSSALLPMGGAVSGVAGSNWMLIGDAAACVNPLNGEGIDYGLETGRLAAELLDSR
DLARLWPSLLADRYGRGFSVARRLALLLTFPRFLPTTGPITMRSTALMNIAVRVMSNLVTDDDRDWVARVWRGGGQLSRL
VDRRPPFS
>Q9I060 3.4.24.33~~~~~~Peptidyl-Asp metalloendopeptidase~~~
MKKSLLCSTLALAVASAAQAAPKTVDIMVLYTPAATQTANGRDIDARIASYIEFANTAYEKSGVNLRLRLVHKQRLDWAD
YPTVTGANLDRFMRDPQVQRLREQYGADLVSLVNRSQNSGNGYITCGIGYMGSGDKNSGRFHGNAKDIAYNLTGVDCGLN
TFAHEAGHNMGLRHSYEQDLESSYYDPRYAHSGTYEWSRGYGVQGRFATVMAYPHAFGTNKQAPFFANPRLVNAECANQP
CGREEHADAVRALNSMATQIADFRPTKVPGTVNPGSGGDTPTPPDLPWCTKAKLGGLLGDGEFASMEGWRAWSGNAQLSL
VNVAKGCRDNALLVDVRGFDLLVRPIAPLRAGSGYRLSGKVMLKAANTRETVRMALLSERADGALAYNPAQSVELSVSGN
EFSRLEKTFDYRPAADQRNLYVAVWSDSGASLLVDEMNLQEAQAAPPSVPPAPKRIAYDFESGIGGWSGVHASARATRVA
SAGRLALEAYQRRYAGTGASTSLLGNLEAGRTYAFSADVRVGDGRGSQAMTYAYLYLESQGRPGEYLPLGYKVVENGRWA
SLRGQVQLPKGPIKRAELMILSGNQQESMFIDNVQLLQK
>P0C0T5 3.4.24.-~~~mepA~~~Penicillin-insensitive murein endopeptidase~~~COG3770
MNKTAIALLALLASSASLAATPWQKITQPVPGSAQSIGSFSNGCIVGADTLPIQSEHYQVMRTDQRRYFGHPDLVMFIQR
LSSQVSNLGMGTVLIGDMGMPAGGRFNGGHASHQTGLDVDIFLQLPKTRWTSAQLLRPQALDLVSRDGKHVVSTLWKPEI
FSLIKLAAQDKDVTRIFVNPAIKQQLCLDAGTDRDWLRKVRPWFQHRAHMHVRLRCPADSLECEDQPLPPSGDGCGAELQ
SWFEPPKPGTTKPEKKTPPPLPPSCQALLDEHVI
>P0C069 ~~~mepA~~~Multidrug/solvent efflux pump periplasmic linker protein MepA~~~COG0845
MQFKPAVTALVSAVALATLLSGCKKEEAAPAAQAPQVGVVTIQPQAFTLTSELPGRTSAYRVAEVRPQVNGIILKRLFKE
GSEVKEGQQLYQIDPAVYEATLANAKANLLATRSLAERYKQLIDEQAVSKQEYDDANAKRLQAEASLKSAQIDLRYTKVL
APISGRIGRSSFTEGALVSNGQTDAMATIQQLDPIYVDVTQSTAELLKLRRDLESGQLQKAGDNAASVQLVLEDGSLFKQ
EGRLEFSEVAVDETTGSVTLRALFPNPDHTLLPGMFVHARLKAGVNANAILAPQQGVTRDLKGAPTALVVNQENKVELRQ
LKASRTLGSDWLIEEGLNPGDRLITEGLQYVRPGVEVKVSDATNVKKPAGPDQANAAKADAKAE
>Q2G140 ~~~mepA~~~Multidrug export protein MepA~~~COG0534
MKDEQLYYFEKSPVFKAMMHFSLPMMIGTLLSVIYGILNIYFIGFLEDSHMISAISLTLPVFAILMGLGNLFGVGAGTYI
SRLLGAKDYSKSKFVSSFSIYGGIALGLIVILVTLPFSDQIAAILGARGETLALTSNYLKVMFLSAPFVILFFILEQFAR
AIGAPMVSMIGMLASVGLNIILDPILIFGFDLNVVGAALGTAISNVAAALFFIIYFMKNSDVVSVNIKLAKPNKEMLSEI
FKIGIPAFLMSILMGFTGLVLNLFLAHYGNFAIASYGISFRLVQFPELIIMGLCEGVVPLIAYNFMANKGRMKDVIKAVI
MSIGVIFVVCMSAVFTIGHHMVGLFTTDQAIVEMATFILKVTMASLLLNGIGFLFTGMLQATGQGRGATIMAILQGAIII
PVLFIMNALFGLTGVIWSLLIAESLCALAAMLIVYLLRDRLTVDTSELIEG
>Q7A7N0 ~~~mepA~~~Multidrug export protein MepA~~~
MKDEQLYYFEKSPVFKAMMHFSLPMMIGTLLSVIYGILNIYFIGFLEDSHMISAISLTLPVFAILMGLGNLFGVGAGTYI
SRLLGAKDYSKSKFVSSFSIYGGIALGLIVILVTLPFSDQIAAILGARGETLALTSNYLKVMFLSAPFVILFFILEQFAR
AIGAPMISMIGMLASVGLNIILDPILIFGFDLNVVGAALGTAISNVAAALFFIIYFMKNSDVVSVNIKLAKPNKEMLSEI
FKIGIPAFLMSILMGFTGLVLNLFLAHYGNFAIASYGISFRLVQFPELIIMGLCEGVVPLIAYNFMANKGRMKDVIKAVI
MSIGVIFVVCMIAVFTIGHHMVGLFTTDQAIVEMATFILKVTMASLLLNGIGFLFTGMLQATGQGRGATIMAILQGAIII
PVLFIMNALFGLTGVIWSLLIAESLCALAAMLIVYLLRDRLTVDTSELIEG
>P0C070 ~~~mepB~~~Multidrug/solvent efflux pump membrane transporter MepB~~~
MSKFFIDRPIFAWVIALVIMLVGALSILKLPINQYPSIAPPAIAIAVTYPGASAQTVQDTVVQVIEQQLNGIDNLRYVSS
ESNSDGSMTITATFEQGTNPDTAQVQVQNKLNLATPLLPQEVQQQGIRVTKAVKNFLLVIGLVSEDGSMTKDDLANYIVS
NMQDPISRTAGVGDFQVFGAQYAMRIWLDPAKLNKFQLTPVDVKTAVAAQNVQVSSGQLGGLPALPGTQLNATIIGKTRL
QTAEQFESILLKVNKDGSQVRLGDVAQVGLGGENYAVSAQFNGKPASGLAVKLATGANALDTAKALRETIKGLEPFFPPG
VKAVFPYDTTPVVTESISGVIHTLIEAVVLVFLVMYLFLQNFRATIITTMTVPVVLLGTFGILAAAGFSINTLTMFAMVL
AIGLLVDDAIVVVENVERVMSEEGLPPKEATKRSMEQIQGALVGIALVLSAVLLPMAFFGGSTGVIYRQFSITIVSAMGL
SVLVALIFTPALCATMLKPLKKGEHHTAKGGFFGWFNRNFDRSVNGYERSVGAILRNKVPFLLAYALIVVGMIWLFARIP
TAFLPEEDQGVLFAQVQTPAGSSAERTQVVVDQMREYLLKDEADTVSSVFTVNGFNFAGRGQSSGMAFIMLKPWDERSKE
NSVFALAQRAQQHFFTFRDAMVFAFAPPAVLELGNATGFDVFLQDRGGVGHEKLMEARNQFLAKAAQSKILSAVRPNGLN
DEPQYQLTIDDERASALGVTIADINNTLSIALGASYVNDFIDRGRVKKVYIQGEPSARMSPEDLQKWYVRNGAGEMVPFS
SFAKGEWTYGSPKLSRYNGVEAMEILGAPAPGYSTGEAMAEVERIAGELPSGIGFSWTGMSYEEKLSGSQMPALFALSVL
FVFLCLAALYESWSIPIAVVLVVPLGIIGALIATSLRGLSNDVYFLVGLLTTIGLAAKNAILIVEFAKELHEQGRSLYDA
AIEACRMRLRPIIMTSLAFILGVVPLTIASGAGAGSQHAIGTGVIGGMISATVLAIFWVPLFFVAVSSLFGSKEPEKDVT
PENPRYEAGQ
>P0C071 ~~~mepC~~~Multidrug/solvent efflux pump outer membrane protein MepC~~~
MTKSLLSLAVTAFILGGCSLIPDYQTPEAPVAAQWPQGPAYSPTQSADVAAAEQGWRQFFHDPALQQLIQTSLVNNRDLR
VAALNLDAYRAQYRIQRADLFPAVSATGSGSRQRVPANMSQTGESGITSQYSATLGVSAYELDLFGRVRSLTEQALETYL
SSEQARRSTQIALVASVANAYYTWQADQALFKLTEETLKTYEESYNLTRRSNEVGVASALDVSQARTAVEGARVKYSQYQ
RLVAQDVNSLTVLLGTGIPADLAKPLELDADQLAEVPAGLPSDILQRRPDIQEAEHLLKAANANIGAARAAFFPSISLTA
NAGSLSPDMGHLFSGGQGTWLFQPQINLPIFNAGSLKASLDYSKIQKDINVAKYEKTIQTAFQEVSDGLAARKTFEEQLQ
AQRDLVQANQDYYRLAERRYRIGIDSNLTFLDAQRNLFSAQQALIGDRLSQLTSEVNLYKALGGGWYEQTGQANQQASVE
TPKG
>P76190 3.4.-.-~~~mepH~~~Murein DD-endopeptidase MepH~~~COG0791
MARINRISITLCALLFTTLPLTPMAHASKQARESSATTHITKKADKKKSTATTKKTQKTAKKAASKSTTKSKTASSVKKS
SITASKNAKTRSKHAVNKTASASFTEKCTKRKGYKSHCVKVKNAASGTLADAHKAKVQKATKVAMNKLMQQIGKPYRWGG
SSPRTGFDCSGLVYYAYKDLVKIRIPRTANEMYHLRDAAPIERSELKNGDLVFFRTQGRGTADHVGVYVGNGKFIQSPRT
GQEIQITSLSEDYWQRHYVGARRVMTPKTLR
>P0AFS9 3.4.24.-~~~mepM~~~Murein DD-endopeptidase MepM~~~COG0739
MQQIARSVALAFNNLPRPHRVMLGSLTVLTLAVAVWRPYVYHRDATPIVKTIELEQNEIRSLLPEASEPIDQAAQEDEAI
PQDELDDKIAGEAGVHEYVVSTGDTLSSILNQYGIDMGDITQLAAADKELRNLKIGQQLSWTLTADGELQRLTWEVSRRE
TRTYDRTAANGFKMTSEMQQGEWVNNLLKGTVGGSFVASARNAGLTSAEVSAVIKAMQWQMDFRKLKKGDEFAVLMSREM
LDGKREQSQLLGVRLRSEGKDYYAIRAEDGKFYDRNGTGLAKGFLRFPTAKQFRISSNFNPRRTNPVTGRVAPHRGVDFA
MPQGTPVLSVGDGEVVVAKRSGAAGYYVAIRHGRSYTTRYMHLRKILVKPGQKVKRGDRIALSGNTGRSTGPHLHYEVWI
NQQAVNPLTAKLPRTEGLTGSDRREFLAQAKEIVPQLRFD
>O05318 ~~~~~~Multidrug efflux system permease protein Rv1217c~~~COG3559
MSSTVIDRARPAGHRAPHRGSGFTGTLGLLRLYLRRDRVSLPLWVLLLSVPLATVYIASVETVYPDRSARAAAAAAIMAS
PAQRALYGPVYNDSLGAVGIWKAGMFHTLIAVAVILTVIRHTRADEESGRAELIDSTVVGRYANLTGALLLSFGASIATG
AIGALGLLATDVAPAGSVAFGVALAASGMVFTAVAAVAAQLSPSARFTRAVAFAVLGTAFALRAIGDAGSGTLSWCSPLG
WSLQVRPYAGERWWVLLLSLATAAVLTVLAYRLRAGRDVGAGLIAERPGAGTAGPMLSEPFGLAWRLNRGSLLLWTVGLC
LYGLVMGSVVHGIGDQLGDNTAVRDIVTRMGGTGALEQAFLALAFTMIGMVAAAFAVSLTLRLHQEETGLRAETLLAGAV
SRTHWLASHLAMALAGSAVATLISGVAAGLAYGMTVGDVGGKLPTVVGTAAVQLPAVWLLSAVTVGLFGLAPRFTPVAWG
VLVGFIALYLLGSLAGFPQMLLNLEPFAHIPRVGGGDFTAVPLLWLLAIDAALITLGAMAFRRRDVRC
>P0AFV4 3.4.-.-~~~mepS~~~Murein DD-endopeptidase MepS/Murein LD-carboxypeptidase~~~COG0791
MVKSQPILRYILRGIPAIAVAVLLSACSANNTAKNMHPETRAVGSETSSLQASQDEFENLVRNVDVKSRIMDQYADWKGV
RYRLGGSTKKGIDCSGFVQRTFREQFGLELPRSTYEQQEMGKSVSRSNLRTGDLVLFRAGSTGRHVGIYIGNNQFVHAST
SSGVIISSMNEPYWKKRYNEARRVLSRS
>P17239 1.16.1.1~~~merA~~~Mercuric reductase~~~
MTENAPTELAITGMTCDGCAAHVRKALEGVPGVREAQVSYPDATARVVLEGEVPMQRLIKAVVASGYGVHPRSDGASSTN
DGQELHIAVIGSGGAAMACALKAVERGARVTLIERSTIGGTCVNIGCVPSKIMIRAAHIAHLRRESPFDGGIQAVAPTIQ
RTALLVQQQARVDELRHAKYEGILDGNPAITVLRGEARFKDSRSVVVHLNDGGERVVMFDRCLVATGASPAVPPIPGLKD
TPYWTSTEGLVSESIPERLAVIGSSVVALELAQAFARLGSHVTILARGTLFLREDPAIGEAITAAFRAEGIEVLEHTQAS
QVAYADGEFVLATGHGELRADKLLVATGRAPNTRRLNLEAAGVAINAQGAIVIDQGMRTNSPNIYAAGDCTDQPQFVYVA
AAAGTRAAINMMGGSAALDLTAMPAVVFTDPQVATVGYSGAEAHRDGIETDSRTLTLDNVPRALANFNTRGFIKLVAEVG
SGRLIGVQVVAPEAGELIQTAALAIRNRMTVQELADQLFPYLTMVEGLKLAAQTFTRDVKQLSCCAG
>P16171 1.16.1.1~~~merA~~~Mercuric reductase~~~
MKKYRVNVQGMTCSGCEQHVAVALENMGAKAIEVDFRRGEAVFELPDDVKVEDAKNAIADANYHPGEAEEFQSEQKTNLL
KKYRLNVEGMTCTGCEEHIAVALENAGAKGIEVDFRRGEALFELPYDVDIDIAKTAITDAQYQPGEAEEIQVQSEKRTDV
SLNDEGNYDYDYIIIGSGGAAFSSAIEAVALNAKVAMIERGTVGGTCVNVGCVPSKTLLRAGEINHLAKNNPFVGLHTSA
SNVDLAPLVKQKNDLVTEMRNEKYVNLIDDYGFELIKGESKFVNENTVEVNGNQITAKRFLIATGASSTAPNIPGLDEVD
YLTSTSLLELKKVPNRLTVIGSGYIGMELGQLFHNLGSEVTLIQRSERLLKEYDPEISEAITKALTEQGINLVTGATYER
VEQDGDIKKVHVEINGKKRIIEAEQLLIATGRKPIQTSLNLHAAGVEVGSRGEIVIDDYLKTTNSRIYSAGDVTPGPQFV
YVAAYEGGLAARNAIGGLNQKVNLEVVPGVTFTSPSIATVGLTEQQAKEKGYEVKTSVLPLDAVPRALVNRETTGVFKLV
ADAKTLKVLGAHVVAENAGDVIYAATLAVKFGLTVGDLRETMAPYLTMAEGLKLAVLTFDKDVSKLSCCAG
>P00392 1.16.1.1~~~merA~~~Mercuric reductase~~~
MTHLKITGMTCDSCAAHVKEALEKVPGVQSALVSYPKGTAQLAIVPGTSPDALTAAVAGLGYKATLADAPLADNRVGLLD
KVRGWMAAAEKHSGNEPPVQVAVIGSGGAAMAAALKAVEQGAQVTLIERGTIGGTCVNVGCVPSKIMIRAAHIAHLRRES
PFDGGIAATVPTIDRSKLLAQQQARVDELRHAKYEGILGGNPAITVVHGEARFKDDQSLTVRLNEGGERVVMFDRCLVAT
GASPAVPPIPGLKESPYWTSTEALASDTIPERLAVIGSSVVALELAQAFARLGSKVTVLARNTLFFREDPAIGEAVTAAF
RAEGIEVLEHTQASQVAHMDGEFVLTTTHGELRADKLLVATGRTPNTRSLALDAAGVTVNAQGAIVIDQGMRTSNPNIYA
AGDCTDQPQFVYVAAAAGTRAAINMTGGDAALDLTAMPAVVFTDPQVATVGYSEAEAHHDGIETDSRTLTLDNVPRALAN
FDTRGFIKLVIEEGSHRLIGVQAVAPEAGELIQTAALAIRNRMTVQELADQLFPYLTMVEGLKLAAQTFNKDVKQLSCCA
G
>P30341 1.16.1.1~~~merA~~~Mercuric reductase~~~
MLQAHTGYDLAIIGSGAGAFAAAIAARNKGRSVVMVERGTTGGTCVNVGCVPSKALLAAAEARHGAQAASRFPGIQATEP
ALDFPALISGKDTLVGQLRAEKYTDLAAEYGWQIVHGTATFADGPMLEVALNDGGTATVEAAHYLIATGSAPTAPHIDGL
DQVDYLTSTTAMELQQLPEHLLILGGGYVGLEQAQLFARLGSRVTLAVRSRLASREEPEISAGIENIFREEGITVHTRTQ
LRAVRRDGEGILATLTGPDGDQQVRASHLLIATGRRSVTNGLGLERVGVKTGERGEVVVDEYLRTDNPRIWAAGDVTCHP
DFVYVAAAHGTLVADNALDGAERTLDYTALPKVTFTSPAIASVGLTEAQLTEAGIAHQTRTLSLENVPRALVNRDTRGLV
KLIAERGTGKLLAAHVLAEGAGDVITAATYAITAGLTVDQLARTWHPYLTMAEALKLAAQTFTSDVAKLSCCAG
>P77072 4.99.1.2~~~merB~~~Alkylmercury lyase~~~
MKLAPYILELLTSVNRTNGTADLLVPLLRELAKGRPVSRTTLAGILDWPAERVAAVLEQATSTEYDKDGNIIGYGLTLRE
TSYVFEIDDRRLYAWCALDTLIFPALIGRTARVSSHCAATGAPVSLTVSPSEIQAVEPAGMAVSLVLPQEAADVRQSFCC
HVHFFASVPTAEDWASKHQGLEGLAIVSVHEAFGLGQEFNRHLLQTMSSRTP
>P22905 ~~~merC~~~Mercuric transport protein MerC~~~
MSAITRIIDKIGIVGTIVGSFSCAMCFPAAASLGAAIGLGFLSQWEGLFVQWLIPIFASVALLATLAGWFSHRQWQRTLL
GSIGPVLALVGVFGLTHHFLDKDLARVIFYTGLVVMFLVSIWDMVNPANRRCATDGCETPAPRS
>Q50919 ~~~merC~~~Mercuric transport protein MerC~~~
MGLMTRIADKTGALGSVVSAMGCAACFPALASFGAAIGLGFLSQYEGLFISRLLPLFAALAFLANALGWFSHRQWLRSLL
GMIGPAIVFAATVWLLGNWWTANLMYVGLALMIGVSIWDFVSPAHRRCGPDGCELPAKRL
>P13113 ~~~merP~~~Mercuric transport protein periplasmic component~~~
MKKLFASLAIAAVVAPVWAATQTVTLSVPGMTCSACPITVKKAISKVEGVSKVNVTFETREAVVTFDDAKTSVQKLTKAT
EDAGYPSSVKK
>P04129 ~~~merP~~~Mercuric transport protein periplasmic component~~~
MKKLFASLALAAAVAPVWAATQTVTLAVPGMTCAACPITVKKALSKVEGVSKVDVGFEKREAVVTFDDTKASVQKLTKAT
ADAGYPSSVKQ
>P22853 ~~~merR1~~~Mercuric resistance operon regulatory protein~~~
MKFRIGELADKCGVNKETIRYYERLGLIPEPERTEKGYRMYSQQTVDRLHFIKRMQELGFTLNEIDKLLGVVDRDEAKCR
DMYDFTILKIEDIQRKIEDLKRIERMLMDLKERCPENKDIYECPIIETLMKK
>P0A183 ~~~merR~~~Mercuric resistance operon regulatory protein~~~COG0789
MENNLENLTIGVFAKAAGVNVETIRFYQRKGLLLEPDKPYGSIRRYGEADVTRVRFVKSAQRLGFSLDEIAELLRLEDGT
HCEEASSLAEHKLKDVREKMADLARMEAVLSELVCACHARRGNVSCPLIASLQGGASLAGSAMP
>P04140 ~~~merT~~~Mercuric transport protein MerT~~~COG2608
MSEPKTGRGALFTGGLAAILASACCLGPLVLIALGFSGAWIGNLAVLEPYRPIFIGVALVALFFAWRRIYRQAAACKPGE
VCAIPQVRATYKLIFWIVAALVLVALGFPYVMPFFY
>K5B7F3 2.1.1.365~~~meT1~~~MMP 1-O-methyltransferase~~~COG4122
MTDIRDTDALFALADRVTGFMPADEGRTLYETAVRYLGDGVGVEIGTYCGKSTVLLGAAARQTGGVVFTVDHHHGSEEHQ
PGWEYHDPSLVDPVTGLFDTLPRLRHTLDEADLYDHVVAVVGKSAVVARGWRTPLRFLFIDGGHTEEAAQRDFDGWARWV
EVGGALVIHDVFPDPKDGGQAPFHIYQRALNTGDFREVNAYGSMRVLERTSGIAGQPL
>E3H7X6 2.3.1.31~~~metAA1~~~Homoserine O-acetyltransferase 1~~~COG1897
MPIVIPKKLPAFDTLKGENIFVMNKSRAFSQDIRPLKIVILNLMPNKIVTETQLLRLLGNTPLQIEITLLKTRTYASKNT
SQDHLTSFYKTFEDIKNHTFDGLIITGAPIEHLQFEDVDYWEELKEVMEFSKSNVTSTMHICWGSQAGLYYHYGIPKFPT
DKKIFGIFKHKIFNLKTKITRGFDDEFLVPHSRHTTVMRGDIENVPELEILAESEDAGICLVATRDRKHIFISGHLEYEK
DTLKSEYFRDLDKGRSIDIPKNYFKDDNPENDPVVTWRAHAHLLFSNWLNYCVYQETPYILK
>E3HDJ8 2.3.1.31~~~metAA2~~~Homoserine O-acetyltransferase 2~~~
MCCIAGAPSLSGRERAEKEGIRFKENKGELKIGIINLMPFKEEVEYQFYAVLGRFDISVEVEFLYPENHVFKNTDGSYIK
DNYYPLGELNNRNYDAIIMTGAPVELLDFQKVNYWDEIKNLIKSNKLPALYICWGAQAALYVKYGIEKFTLNEKLLGIFR
HRTNKNPFVSGEFWAPHSRNTQNSSKDIKNAGLRILAESDEAGVYMASDRDYREFYISGHGEYQRERLKYEYSRDQNLFP
KNYFPEDDPKKEPPMKWDSHRKEFYYKWLSHIREKKFSNISDKR
>Q7CWE8 2.3.1.31~~~metAA~~~Homoserine O-acetyltransferase~~~COG1897
MPIKIPDTLPAFETLVHEGVRVMTETAAIRQDIRPLQIGLLNLMPNKIKTEIQMARLVGASPLQVELSLIRIGGHRAKNT
PEEHLLSFYQTWEEVRHRKFDGFIITGAPIELLDYEDVTYWNEMQQIFEWTQTNVHSTLNVCWGAMAAIYHFHGVPKYEL
KEKAFGVYRHRNLSPSSIYLNGFSDDFQVPVSRWTEVRRADIEKHPELEILMESDEMGVCLAHEKAGNRLYMFNHVEYDS
TSLADEYFRDVNSGVPIKLPHDYFPHNDPELAPLNRWRSHAHLFFGNWINEIYQTTPYDPQAIGKLAA
>A0A1D3PDD4 2.3.1.31~~~metAA~~~Homoserine O-acetyltransferase~~~
MRERVAMPIKIPDHLPAKEILLKENIFIMDESRAYTQDIRPLKICILNLMPTKQETETQLLRLLGNTPLQVDVSLLHPST
HSPRNTSKEHLNLFYKTIDEVKQQKFDGMIITGAPVETLPFHDVNYWNEMTSILDWTTTNVTSTLHICWGAQAGLYHHYG
IKKKPLTTKLFGVYSHKLEVKNVNLLRGFDDVFYAPHSRHTTVSREDIERVDELIVLSSSEEAGVYIASSKDGKRVFVMG
HSEYDAHTLKQEYERDVKRGIACDPPFNYFPEGNVDALPPLQWRAHSNLLFSNWLNYYVYQETPYHLDD
>Q72X44 2.3.1.31~~~metAA~~~Homoserine O-acetyltransferase~~~
MPIIIDKDLPARKVLQEENIFVMTKERAETQDIRALKIAILNLMPTKQETEAQLLRLIGNTPLQLDVHLLHMESHLSRNV
AQEHLTSFYKTFRDIENEKFDGLIITGAPVETLSFEEVDYWEELKRIMEYSKTNVTSTLHICWGAQAGLYHHYGVQKYPL
KEKMFGVFEHEVREQHVKLLQGFDELFFAPHSRHTEVRESDIREVKELTLLANSEEAGVHLVIGQEGRQVFALGHSEYSC
DTLKQEYERDRDKGLNIDVPKNYFKHDNPNEKPLVRWRSHGNLLFSNWLNYYVYQETPYVL
>Q814M3 2.3.1.31~~~metAA~~~Homoserine O-acetyltransferase~~~
MPIIIDKDLPARKVLQKENIFVMTKERAETQDIRALKIAILNLMPTKQDTEAQLLRLIGNTPLQLDVHLLHMESHLSRNV
TQEHLTSFYKTFRDIENEKFDGLIITGAPVETLAFEEVDYWEELKHIMEYSKTNVTSTLHICWGAQAGLYYHYGVPKYPL
KEKMFGVFEHEVCEQHVKLLQGFDELFFAPHSRHTEVRENDIREVKELTLLANSEEAGVHLVIGPEGRQVFALGHSEYSC
ETLKQEYERDRDKGLNIDVPKNYFKHNNPDEKPLVRWRSHGNLLFSNWLNYYVYQETPYIL
>Q5LHS7 2.3.1.31~~~metAA~~~Homoserine O-acetyltransferase~~~COG1897
MPLNLPDKLPAIELLKEENIFVIDNSRATQQDIRPLRIVILNLMPLKITTETDLVRLLSNTPLQVEISFMKIKSHTSKNT
PIEHMKTFYTDFDKMREDRYDGMIITGAPVEQMEFEEVNYWDEITEIFDWARTHVTSTLYICWAAQAGLYHHYGIPKYAL
DKKMFGIFKHRTLLPLHPIFRGFDDEFYVPHSRHTEVRKEDILKVPELTLLSESDDSGVYMVVARGGREFFVTGHSEYSP
LTLDTEYRRDVSKGLPIEIPRNYYVNDDPDKGPLVRWRGHANLLFSNWLNYFVYQETPYNIEDIR
>A1A3D2 2.3.1.31~~~metAA~~~Homoserine O-acetyltransferase~~~
MPIKIPSGLPARDILDSERIFALEKPEAERQRVRPLKLVILNLMPKKIETETQLLRLISKSPLQVEVDFMKTSTHEATHV
SADHLVKFYETLDAFKDNYYDGLVVTGAPVEHLDFEQVDYWDEFKQILDWASTHVFSTMYLCWGAMGALNYRYGVRKELL
PEKLFGVFPQYLQDEYCFLTNGFDEICLQPHSRLAGVNEGDIAHNPELQVLTWGPKSGPGLIATRDFSEVFALGHWEYGK
YTLAEEYERDMKKGMTNVPFPENYFPHDDPKLEPLFAWRAHANLLWRNWLNWVYQTTPYDLSEVPQLREEKRLGTDRSIR
HEPGSPRVDAFTPFSHDGYGVIRG
>E2NPN0 2.3.1.31~~~metAA~~~Homoserine O-acetyltransferase~~~
MPINIPDDLPAYKVLTAENIFVMNQTRATTQRIRPLKILIVNIMPLKITTETQLLRLLSNTPLQLEVELIHMSTHDSKNV
PKEHLLTFYKTFKDIKENSYDGMIITGAPVEQMPFEEVDYWNELSEIFEWAKTHVFSSFFICWASQAALHYYYDIDKYLL
KHKLTGVYRHHTNQRKMRRKILRGFDYQFYAPHSRYTTVLKEDISSNPNLDILAESDDAGVYLVASKDGSQFFVTGHPEY
DPDTLDKEYKRDKEKPGVIAELPKNYYLDDDPSQEIQVKWRSHAYLLFSNWLNYYVYQETPYDLSDLHERKK
>Q3J205 2.3.1.31~~~metAA~~~Homoserine O-acetyltransferase~~~COG1897
MPITLPATLPAFDVLTHEGVMVMTPERAARQDIRPLRIGLLNLMPKKIQTENQFARLIGATPLQIDFQLIRMTEHQTKNT
AAEHMEAFYRPFQEVKHEKFDGLIITGAPIEHLDFADVTYWDELCEVMDWTQTNVQSTFGVCWGGMAMIYHFHRVQKHRL
QAKAFGCFRHRNVAPTSPYLRGFSDDFVIPVSRWTEMRQAEIDAAPGLRTLLASDEAGPCLVEDPGHRALYIFNHFEYDS
DTLKQEYDRDVANGKPINVPANYYPDDDPSKPPLNRWRSHAHLLYGNWINEIYQSTPYDPQQIGR
>A5N8M0 2.3.1.31~~~metAA~~~Homoserine O-acetyltransferase~~~COG1897
MPIKIKDDLPAAEILNSENIFIMPENVAFHQDIRPLKIAILNLMPIKITTEVQLLRLIGNTALQIEIELLHLKTHVCKNT
SEEYLTNFYRTFDEIKNEKFDGLIITGAPIEQLEFSRVNYWEELKDIMEWSKCHVYSTLHICWGAQAGLYYHYGIPKYIL
KEKLFGVFKHEVTEEKEKLVRGFDDEFYVPHSRYTEVKREDVEKVKELTILAQSKKAGVYLILDNKGRRIFVTGHSEYDP
LTLKDEYMRDISKGEDIKMPENYFPDDNPDRKPVVKWRSHADLLFSNWLNYYVYQETPYDLSEMPPSFL
>A0A0D8BWP6 2.3.1.31~~~metAA~~~Homoserine O-acetyltransferase~~~
MPINIPKDLPAKEILEQENIFVMDEERAYSQDIRPLNIIILNLMPEKEKAETQLLRLLGNSPLQVNVTFLRPATHEPKTT
SKHHLEQFYTIFPHIRHRKFDGMIITGAPVEQLPFEEVTYWDELTDIMEWTKTNVTSTLHICWGAQAGLYYHYGIPKYPL
PEKCFGVFNHTVEAKNVKLLRGFDDVFRMPHSRHTDVKREDIEKVPDLTILSMSDKAGVCLVASNDGRRIFLTGHPEYDA
TTLKEEYERDLAKGLPIHIPESYFPNDDPSQPPLNTWRSHANLLFVNWLNYYVYQETPYEWE
>Q0BX37 2.3.1.31~~~metAA~~~Homoserine O-acetyltransferase~~~COG1897
MPIRIPDTLPARETLLKEGVAVMTETDAVRQDIRPLQIALLNLMPNKIRTETQIARLIGASPLQVELTLVRVGGHTPKNT
SQEHLISFYQTWEEIRERKFDGFIITGAPIEQMPFEEVNYWEELTQIFDWTQTNVHSPFYICWGAMAAAYHFHGLPKYQL
EAKAFGVFRHRNLDLASPYLSGFSDDFAIPVSRWTEVRRADVLERQSLRLLIDSDDTGPCLLEDRARRSLYMFNHVEYDS
FSLKEEFERDREAGKQIQVPFGYYPGDDPARQPLNRWRSHAHLLFGNWINQTYQSTPFSLDEIGQ
>Q03V79 2.3.1.31~~~metAA~~~Homoserine O-acetyltransferase~~~COG1897
MSVILNNGLLKRESFVIGRFEVLEPTINILLVNLMPNRLQTEKQFTRLLSHLPINVRVTFAVPSEHKIRHDTDAIMTNYV
TLNDIWHKKFDGMIVTGAPVDRMKFEQIDYWDEFRHLLEWRKTHVTESLFACWAAYGAGYAERNFPVKALSEKISGVFQA
SQIFKRHSLLKDLENISMPQSRYFTVPNFGVARRLKVAGDDILGAFILRDEHVNSTYITGHFEYDTETLENEYLRDIAID
PNTIKPKNYFYNNKPTNTWQTYAEKFFVNWGELLMEKMTSSRSTIPTLNQERNKLGLGTSQCKYL
>Q2GAJ2 2.3.1.31~~~metAA~~~Homoserine O-acetyltransferase~~~COG1897
MPIRIADNLPARRTLEAEGVIVMSETEAARQDIRPMRIALLNLMPDKITTETQIARLLGATPLQVDLELVRISDHVSKNT
SAGHISAFYRPWDDVRAEKYDGLIVTGAPVETIPYEEVSYWDELRRIFDWSQSNVHRTLSVCWGAMAALYHFHGIEKHGL
PTKASGVFRHVNHAPASPWMRGLPDVFDVPVSRWSEVRREDLPEGRGLSVLADSAETGLCLIDDPAMRTLHMFNHLEYDT
LTLAGEYARDEGKYLPRNYFPGDDPQAMPANTWRGHGHLLYGNWINETYQTTPYDLADIGR
>A6LC32 2.3.1.31~~~metAA~~~Homoserine O-acetyltransferase~~~COG1897
MPLNLQKNLPAIELLKKEHIFVMDSLRASEQDIRPLRVVVLNLMPLKITTETDLVRLLSNTPLQVELDFMKIKGHTPKNT
PIEHMQEFYKDFDEMADDFYDGMIVTGAPVEQMPFEEVSYWEEITQIFDWARTHVTSTLYICWAAQAGLYHFYGVPKYDL
PAKMFGVFRHSLREPFVPIFRGFDDEFFVPHSRHTEIRREDIMKVPELTLLSESEESGVYMAMARGGREFFITGHSEYSP
YTLNDEYMRDLGKGLPINKPRNYYRNNDPAQGPVVRWRGHANLLFTNWLNYYVYQETPFRREDIKKLGSL
>Q5LSN6 2.3.1.31~~~metAA~~~Homoserine O-acetyltransferase~~~COG1897
MPIKIPAHLPAYDILTREGVMVMSEDQAARQDIRPLRIGLLNLMPKKIQTETQFARLIGATPLQIELSLIRMTEHQTKTT
ASEHMEEFYRPFQEVRDEKFDGLIITGAPIEHLEFSDVTYWDELGEVFAWTQSNVHSTFGVCWGGMAMINHFHGIRKHML
DHKAFGCFRHRNLDPASPYLRGFSDDFVIPVSRWTEVKQAEVDAVPELVTLLGSDEVGPCLISDPGHRALYIFNHFEYDS
DTLKQEYDRDVEGGTAINVPINYYPDDDPSRKPLNRWRSHAHLLYGNWISEIYETTPYDMARIGLESTDLRG
>Q97PM9 2.3.1.31~~~metAA~~~Homoserine O-acetyltransferase~~~COG1897
MPIRIDKKLPAVEILRTENIFVMDDQRAAHQDIRPLKILILNLMPQKMVTETQLLRHLANTPLQLDIDFLYMESHRSKTT
RSEHMETFYKTFPEVKDEYFDGMIITGAPVEHLPFEEVDYWEEFRQMLEWSKTHVYSTLHICWGAQAGLYLRYGVEKYQM
DSKLSGIYPQDTLKEGHLLFRGFDDSYVSPHSRHTEISKEEVLNKTNLEILSEGPQVGVSILASRDLREIYSFGHLEYDR
DTLAKEYFRDRDAGFDPHIPENYFKDDDVNQVPCLCWSSSAALFFSNWVDHAVYQETPFDWRKIEDDASAYGYL
>E8LIC0 2.3.1.31~~~metAA~~~Homoserine O-acetyltransferase~~~COG1897
MPINIPNGLPAASVLAGEQIFVMTEERATHQDIRPLTLLFLNLMPKKIATEIQYMRKLSNTPLQVNIDLLRVDKHISKNT
PQPHLDTFYKDFEEIEGRNYDGMIITGAPLDQIDFSDVTYWDKLEKIITWSKEHVTSTLFSCWGVAAGLKIFYDLPLINR
KEKLSGVFLHHTAQSLNPLIRGFDDTFLAPHSRFIDFPSDVIRKHTDLEILADSEITGMFLAATPDRRQVFVTGHPEYDA
TTLSDEYRRDLAAGKDPKLPENYFPHDDPTQIPSCVWRSHASLLFGNWLNYYVYQITPYKW
>Q9WZY3 2.3.1.31~~~metAA~~~Homoserine O-acetyltransferase~~~COG1897
MPINVPSGLPAVKVLAKEGIFVMTEKRAIHQDIRPLEILILNLMPDKIKTEIQLLRLLGNTPLQVNVTLLYTETHKPKHT
PIEHILKFYTTFSAVKDRKFDGFIITGAPVELLPFEEVDYWEELTEIMEWSRHNVYSTMFICWAAQAGLYYFYGIPKYEL
PQKLSGVYKHRVAKDSVLFRGHDDFFWAPHSRYTEVKKEDIDKVPELEILAESDEAGVYVVANKSERQIFVTGHPEYDRY
TLRDEYYRDIGRNLKVPIPANYFPNDDPTKTPILTWWSHAHLFFSNWLNYCIYQKTPYRLEDIH
>A5IIQ2 2.3.1.31~~~metAA~~~Homoserine O-acetyltransferase~~~COG1897
MPINVPSGLPAVKVLAKEGIFVMTEKRAIHQDIRPLEILILNLMPDKIKTEIQLLRLLGNTPLQVNVTLLYTETHKPKHT
PIEHILKFYTTFSAVKDRKFDGFIITGAPVELLPFEEVDYWEELTEIMEWSRHNVYSTMFICWAAQAGLYYFYGIPKYEL
PQKLSGVYKHRVAKDSVLFRGHDDFFWAPHSRYTEVKKEDIDKVPELEILAESDEAGVYVVANKSERQIFVTGHPEYDRY
TLRDEYYRDIGRNLKVPIPANYFPNDDPTKTPILTWWSHAHLFFSNWLNYCIYQKTPYRLEDIH
>D3RNP0 2.3.1.46~~~metAS~~~Homoserine O-succinyltransferase~~~COG1897
MPIVAHNDLPTFARLREEGQQILDPDFALHQDIRELHIGLLNMMPDAALAATERQFFRLIGESNQIAQFYVHPFTLKELE
RGPEARAHVERYYQSFDQIRADGLDALIITGANVTGPDLALEPFWEPLIEVVDWAYDNVCSTLCSCLATHAVMQFRYGER
RRRLPAKRWGVFPHRVLARTHPLVADVNTCFDVPHSRFNDISHAQFVGAGLHVLVESEEAGVHLAVSPDGFRQVFFQGHP
EYDTISLLKEYKREVGRFAAGARSDYPPFPDNYFGLQTRAILNEHRECVIRALDQGRKPPELPERLIAAALHNTWHDTAE
GVIGNWMGLIYQITHHDRRQTFMAGIDPNDPLGLCCG
>B4RUL1 2.3.1.46~~~metAS~~~Homoserine O-succinyltransferase~~~
MPIRIPEQLPAQNVLLGENIFTMDMDRAANQDIRPLEVGILNLMPNKIETEVQLLRLLSNTPLQINVDLIRIDNQAPKNT
PQSHMDAFYHDFSSVVNKKYDGLIVTGAPLALIDYEEVKYWETMTTILEWAQRHVNSTLYLCWAAHAAMYHFYGITRELR
DEKFSGVFKHKVNDPNNELLRGFDPSFYAPHSRYGHIDTSLYNSVDGLNVVAESDEVGAYIVASEDKRMVFVTGHPEYDP
DTLKDEYLRDIAAGQTPPIPKNYFEGDDPATSPIVQWRSHGSLLFTNWLNYYVYQTTPYDLSQLAEKSQPKR
>A0A1D3PCJ5 2.3.1.46~~~metAS~~~Homoserine O-succinyltransferase~~~
MTLLFDGDRRIKSPALAPAGGDRRARDPIELTIGLVNNMPDSALKATDVQIARLLQQAAPWHVRIRLHCFSLPSIARSPM
ASSHVAQTYTDIDRLDGLDIDGLIVTGAEPVAARLRDESYWPDLAAIVDWARTNTKTTIWSCLAAHAAVLHLDDIERQRL
ASKCSGVFDCVKVRDDWLTHGIDAPLQVPHSRLNAVNEPLLAERGYDILTRSAEVGVDIFARTMPSRFVFFQGHPEYDAL
SLQREYMRDIARYLAGQREDYPRPPRSYFSAESEAVLNTFEIRARARRDPTLAAELPGLTLRPDLAAGHAAKLLFRNWIG
YLADG
>C8NA81 2.3.1.46~~~metAS~~~Homoserine O-succinyltransferase~~~
MPLVAHRELESLDRLRAEGQEILDVRRASQQDIRELHIGLLNLMPDGALKATERQFLRLIGNSNRIAQFYVHIFTVPGVP
RSADMQAYIDSHYENFDDLAHDGLDAIIFTGTNPLHADLAQEAYWPHVQRVFDWADKNVTSVLCSCLASHLALQHFHGIA
RKRRDEKLFGVFSHRVLDRSHPMLANINTRFDMPHSRWNGISAAQLEARGLPVLVAGEESGVAMASSPDGFRQIYFQGHP
EYDRSSLLKEFRRDLALYQDGKLPQPPKLPTHYFTPAGQRLIRDYIESGRPISDFPEAQLADEVDVTWRDTAKALFANWL
GLVYQLTHKERHLQYMDGIDPADPLGRLRR
>P07623 2.3.1.46~~~metAS~~~Homoserine O-succinyltransferase~~~COG1897
MPIRVPDELPAVNFLREENVFVMTTSRASGQEIRPLKVLILNLMPKKIETENQFLRLLSNSPLQVDIQLLRIDSRESRNT
PAEHLNNFYCNFEDIQDQNFDGLIVTGAPLGLVEFNDVAYWPQIKQVLEWSKDHVTSTLFVCWAVQAALNILYGIPKQTR
TEKLSGVYEHHILHPHALLTRGFDDSFLAPHSRYADFPAALIRDYTDLEILAETEEGDAYLFASKDKRIAFVTGHPEYDA
QTLAQEFFRDVEAGLDPDVPYNYFPHNDPQNTPRASWRSHGNLLFTNWLNYYVYQITPYDLRHMNPTLD
>Q606Y5 2.3.1.46~~~metAS~~~Homoserine O-succinyltransferase~~~COG1897
MPLVAHTDLPTFQRLREEGQDVLSVERAARQDIREMHIGLLNMMPDAALEATERQFFRLVGGANPIVQFHMHPFTIEGLP
RGDQAAEHIARYYESFDRIREEGLDGLIVSGANVTQPHLQQEAFWQPLTEVFDWARSNVTSILCSCLATHALFQYSYGVE
RTHLGFKRWGVYSHRVVEPLHPLVADINTRFDVPHSRYNEIFREDMEAAGLRVLVESEEAGVHLAVSPDLFRVIYFQAHP
EYDTVSLLKEYKREILRYFSGEREDYPPFPEHYFSLEVGAALNDYGQALRSARRAGRAPPPFPEEFVLRHLDNTWRDTAK
AVFNNWLGKIYQITDQDRRKPFMAHIDPDNPLGLA
>C7C8V4 2.3.1.46~~~metAS~~~Homoserine O-succinyltransferase~~~
MLDHGTPIQPVSLPAAELDLGAVGGIAWPEPVQRRLRVGLLNNMPDSALVQTERQFRRLIGPGVELRLFSLDTVPRGPLA
RAHLDRFYETQGALAGAGLDALVVTGAEPKAERLADEPFFPALAAVVDWADASGVPTLFSCLAAHAAVLHLDGIERRPLP
TKHSGIYACTAVAHHPLLAGMPASVPVPHSRWNDLPEQALTARGYRVLRRSEQVGVDLFVRERGASMVFLQGHPEYDGDT
LAREYRRDIGRFLDGERDTPPALPENYYVDEAVLRLDAFAAVARAYRSPALHADFPTMAETLPRPAAWQEAAAGLFRNWL
ALVSDRVALAA
>G3ISL7 2.3.1.46~~~metAS~~~Homoserine O-succinyltransferase~~~COG1897
MPLVAHTDLPTFQRLREEGEEILDPDRASNQDIREMHIGLLNMMPDAALEATERQFFRLVGACNQIVQFHVHPFTIEGLK
RSPEAQAHIAKYYESFEQIKRDGLDALIISGANVSHPRLPEEDFWQPLSEVFFWAKENVTSILCSCLATHALIQYCYGIE
RTRLPAKRWGVFSHKLTDRTHPLVAEINTRFDVPHSRFNEIFQSDMERHGLKVLAVSKEAGVHLAVSPDGFRIVFFQGHP
EYDDISLLKEYKREILRFYRAERDDYPPFPENYFNDVVQQILVDYEQRVRSAKQSGQRLEEFPESLILEHIDNTWSDTAK
AVFNNWLGKIYQLTHQERGLPFMDGVDPNNPLGL
>Q3J7P0 2.3.1.46~~~metAS~~~Homoserine O-succinyltransferase~~~COG1897
MPLVANSDLPAFERLREEGETVIPRDVALHQDIREMHIGLLNMMPDAALAATERQFFRLIGESNQIAQFYIHPFTLKEIQ
RSLEANHYVERYYQTFEQIQAEGLDALIITGANVTQPQLSLEPFWKPLIKVISWAYENVTSTLCSCLATHAVLDFRYGQK
RRRLSSKRWGVYSHRVVNRSHPLVRGVNTRFDVPHSRFNEISRDQFEAAGLHVLAESEKGGAHLAVSEDLFRIVFCQGHP
EYDSISLLKEYKREILRFASGQRDNYPPFPENYFSPKIQAILEEYQEQIIIARDKDLPLPQLPEPLIVDYLDNTWHDTAE
AIINNWMGNVYQITHSDRKRPFMEDIAPDDPLGLRRPT
>G4RES5 2.3.1.46~~~metAS~~~Homoserine O-succinyltransferase~~~COG1897
MPIRIPDQLPARKTLETEGVVVMDQSRSARQDIRPLQFGLLNLMPNKQRTETQFARLIASTPLQIDLTLVRVADPLSKST
PEDYLQNFYSTWEDVRAKKFDGFVVTGAPIANMPFEDVRYWPEMLEIMDWTQTNVHHTMFICWGAQAALHHLHGVKRYRM
EHKAFGVYRHKVLDTRHPFLRGFSDDLAVPVSRYNDIDRQSLSPDLDILIDSDEVGICMLDDRKYRAAYMLNHLEYDNTS
LADEYHRDIEAGLDTPLPVNLFPGNDPSRMPENRWRSHAHLLFQNWINEIYQTTPYELEKVGTGEW
>Q15RG1 2.3.1.46~~~metAS~~~Homoserine O-succinyltransferase~~~COG1897
MPIRIPTELPAQSILNSENIFVMDTDRAAMQDIRPLEVGILNLMPNKMETEVQFLRLLSNTPLQINVDLIRIDNQAPKNT
PESHMKAFYQNFDDVADKQYDGLIVTGAPLALLDYDDVKYWQKMKVVLEWAQRHVQSTLFSCWAAHAALYHFHGKNRRLR
DEKLSGIFAHQVKDEHNELMRGFDPVFHAPHSRYGEISVADYESVEGLSVLASSEKTGAYIVASDDKRLVFVTGHPEYDP
DSLDQEYKRDLKAGLTPNMPENYYPDDDPAQGPLVTWRAHGSLLFTNWLNYYVYQNTPYDLASLSNKA
>Q3IHM8 2.3.1.46~~~metAS~~~Homoserine O-succinyltransferase~~~COG1897
MPITVKDELPAIARLRQENVFVMPQTRAKTQEIRPMRLAILNLMPNKVETEVQFIRLLANSPLQVNVELLRLDTHRSSSN
SEQHLDTFYRYFSEVKNNNYDALIVTGAPLAHLEYQDVAYWDEFTAILDWAEQHVTSTLFSCWAAHAALYHHYKIKRDLK
TDKLCGVFTHQCYFEHGALTRGFDDTFLVPHSRYGHVDVNKINACNELVVLAGSEKVGAYLVKNKSGSQVFITGHPEYDA
DSLKAEYQRDCEKSDNAPKPENYFPDDDATKQPSKTWQSHAFLLFSNWLNYYVYQTTPYDINLVSQDVRTNNYAE
>A0A1D3PCI9 2.3.1.46~~~metAS~~~Homoserine O-succinyltransferase~~~
MTLLFDKTGPIDSPTLAPASVDNHCRSPDRSAAARVIEIGLVNNMSDAALRATERQFMRLLRAGSGEHLVRLHCFALPSV
QRSPATRQRIDSLYADIADLRHTRLDALIVTGAEPRAATLQSEPYWDEMRALVDWAEANTRSTIWSCLAAHAAVLHLDGI
ERERLPQKCSGVFAGEQVNDDALLSDLPSPLKVPHSRLNDLAADRLAARGYEVLTHAPNAGVDIFARQGRSRFVFFQGHP
EYDATSLQREYLRDIGRFLTGERHDYPEFPVDYFDADIEDALDAFRAEAEAARDPAIIARLPHLALRQGTAEGIETTANA
LFRNWLISLASEP
>A6WLE9 2.3.1.46~~~metAS~~~Homoserine O-succinyltransferase~~~
MPVRIPDHLPAAEVLESENIFVMSETRAANQDIRPMKVLILNLMPNKIETETQLLRLLGNTPLQVDVDLLRIHDKESKHT
SIDHMNTFYRDFEAVRHKNYDGLIITGAPLGQIDFEDVVYWDHIREIIDWSQEHVTSVLFLCWAAHAGLYHLYGLNRKIL
QQKRSGVFVHRRTSQHFPLLRGFDDEFFAPHSRFAEMDVEEIRQHPQLQLLAESDEAGAYLVLSRNNRNLFVMGHPEYQK
STLNEEYQRDLSQGLDPNVPQNYYRNDDPKADAIARWHSHGSLLVSNWLNYYVYQLTPYDLSDMTAMTPWESR
>Q3SM51 2.3.1.46~~~metAS~~~Homoserine O-succinyltransferase~~~COG1897
MPLVAHNALPTFERLRRDGITVLSTERAANQEIRELHVGLLNMMPDAALEATERQFYRLVGESNPIAQFYMHPFTLETLP
RGDKARAHIGRHYEKFEDIRAAGLDALIITGANVSHANLADEAFWQPLIEVIDWAWDNVTSTLCSCLATHAVMQFRYAQT
RVLQPRKIWGVFEHRVTDVRHPLVADVNTRFDVPHSRWNDVSRAQFEAAGVKVLVESEEAGVHLAVSGDGLRTVFFQGHP
EYDTVSLLKEYKRDLLLATTGALAEPPFPRRYFDRKAQAFLAEFARRTRVGETLVFPEAEVVPLLDNTWHDTAEAVIGNW
IGCVYQVTHRERGLPFMPGVDPDNPLGLA
>L0E1U3 2.3.1.46~~~metAS~~~Homoserine O-succinyltransferase~~~COG1897
MPLVAHSNLPTFERLRKEGGTVLPNDYALHQDIRALHIGLLNMMPDAALAATERQFFRLVGESNQIAQFYMHPFTLAELP
RGPGGQAHVERYYETFDTIQREGLDALIITGANVSQPDLALEPFWEPLAEVVEWAWKNVTSTLCSCLTTHAVMQSRYGER
RRHRGAKLWGVFDHRVVDRTHPLVAGVNTRFDVPHSRFNDVSREQFDRHRLKVLVESERAGVHLAVSEDGFRLVFFQGHP
EYDSISLLKEYKREVLRFVNGEREEFPPLPERYLSPQAAAILEEHRERVEQARQRRVPAPELPEPLLVGRLDNTWHDSAL
AVVNNWIGNVYQFTNHDRRIPFRPGVDPNAPLNWSR
>I3BN39 2.3.1.46~~~metAS~~~Homoserine O-succinyltransferase~~~COG1897
MPLVAHTRLPTFERLKQEGQTVLSEDYAFQQDIRELHIGFLNMMPDAALAATERQFLRLVNESNLIAQFHIHPFTLGTLP
RGDKAQAHIAQYYDKFEDLQEQGLDALIITGANPAAPHLEDEPFWDGLCEVVAWAQENVTSTLCSCLASHALVQHLWGIR
RRPLGFKRWGVYSHAVTMPEHPLVNDLNTRFDVPHSRFNQIDREPLEAVGVQVLVESTEAGVHMAVSPDLFRMVFMQGHP
EYDTVSLLKEYRREVTRWFDGTRADYPPFPQNYLRPKAKAILNEYRLEQEKAKRLGKPLPDFPEKLLLPMLHNTWCDTAK
VFYSNWIGKVYQLTNNDRRKPFMEGVNPDDPLGLRQQLGI
>P00935 2.5.1.48~~~metB~~~Cystathionine gamma-synthase~~~COG0626
MTRKQATIAVRSGLNDDEQYGCVVPPIHLSSTYNFTGFNEPRAHDYSRRGNPTRDVVQRALAELEGGAGAVLTNTGMSAI
HLVTTVFLKPGDLLVAPHDCYGGSYRLFDSLAKRGCYRVLFVDQGDEQALRAALAEKPKLVLVESPSNPLLRVVDIAKIC
HLAREVGAVSVVDNTFLSPALQNPLALGADLVLHSCTKYLNGHSDVVAGVVIAKDPDVVTELAWWANNIGVTGGAFDSYL
LLRGLRTLVPRMELAQRNAQAIVKYLQTQPLVKKLYHPSLPENQGHEIAARQQKGFGAMLSFELDGDEQTLRRFLGGLSL
FTLAESLGGVESLISHAATMTHAGMAPEARAAAGISETLLRISTGIEDGEDLIADLENGFRAANKG
>Q1M0P5 2.5.1.48~~~metB~~~Cystathionine gamma-synthase~~~COG0626
MHMQTKLIHGGISEDATTGAVSVPIYQTSTYRQDAIGHHKGYEYSRSGNPTRFALEELIADLEGGVKGFAFASGLAGIHA
VFSLLQSGDHVLLGDDVYGGTFRLFNKVLVKNGLSCTIIDTSDLSQIKKAIKPNTKALYLETPSNPLLKITDLAQCASVA
KDHGLLTIVDNTFATPYYQNPLLLGADIVVHSGTKYLGGHSDVVAGLVTTNNEALAQEIAFFQNAIGGVLGPQDSWLLQR
GIKTLGLRMQAHQKNALCVAEFLEKHPKVERVYYPGLPTHPNYELAKKQMRGFSGMLSFTLKNDSEATPFVESLKLFILG
ESLGGVESLVGVPAFMTHACIPKTQREAAGIRDGLVRLSVGIEHEQDLLEDLEQAFAKIS
>P56069 2.5.1.48~~~metB~~~Cystathionine gamma-synthase~~~COG0626
MRMQTKLIHGGISEDATTGAVSVPIYQTSTYRQDAIGRHKGYEYSRSGNPTRFALEELIADLEGGVKGFAFASGLAGIHA
VFSLLQSGDHVLLGDDVYGGTFRLFNQVLVKNGLSCTIIDTSDISQIKKAIKPNTKALYLETPSNPLLKITDLAQCASVA
KDHGLLTIVDNTFATPYYQNPLLLGADIVAHSGTKYLGGHSDVVAGLVTTNNEALAQEIAFFQNAIGGVLGPQDSWLLQR
GIKTLGLRMEAHQKNALCVAEFLEKHPKVERVYYPGLPTHPNYELAKKQMRGFSGMLSFTLKNDSEAVAFVESLKLFILG
ESLGGVESLVGIPAFMTHACIPKTQREAAGIRDGLVRLSVGIEHEQDLLEDLEQAFAKIG
>P9WGB7 2.5.1.48~~~metB~~~Cystathionine gamma-synthase~~~COG0626
MSEDRTGHQGISGPATRAIHAGYRPDPATGAVNVPIYASSTFAQDGVGGLRGGFEYARTGNPTRAALEASLAAVEEGAFA
RAFSSGMAATDCALRAMLRPGDHVVIPDDAYGGTFRLIDKVFTRWDVQYTPVRLADLDAVGAAITPRTRLIWVETPTNPL
LSIADITAIAELGTDRSAKVLVDNTFASPALQQPLRLGADVVLHSTTKYIGGHSDVVGGALVTNDEELDEEFAFLQNGAG
AVPGPFDAYLTMRGLKTLVLRMQRHSENACAVAEFLADHPSVSSVLYPGLPSHPGHEIAARQMRGFGGMVSVRMRAGRRA
AQDLCAKTRVFILAESLGGVESLIEHPSAMTHASTAGSQLEVPDDLVRLSVGIEDIADLLGDLEQALG
>O31632 4.4.1.13~~~metC~~~Cystathionine beta-lyase MetC~~~COG0626
MSKHNWTLETQLVHNPFKTDGGTGAVSVPIQHASTFHQSSFEEFGAYDYSRSGTPTRTALEETIAALEGGTRGFAFSSGM
AAISTAFLLLSQGDHVLVTEDVYGGTFRMVTEVLTRFGIEHTFVDMTDRNEVARSIKPNTKVIYMETPSNPTLGITDIKA
VVQLAKENGCLTFLDNTFMTPALQRPLDLGVDIVLHSATKFLSGHSDVLSGLAAVKDEELGKQLYKLQNAFGAVLGVQDC
WLVLRGLKTLQVRLEKASQTAQRLAEFFQKHPAVKRVYYPGLADHPGAETHKSQSTGAGAVLSFELESKEAVKKLVENVS
LPVFAVSLGAVESILSYPATMSHAAMPKEEREKRGITDGLLRLSVGVEHADDLEHDFEQALKEIAPVSVR
>Q07703 4.4.1.13~~~metC~~~Cystathionine beta-lyase~~~
MSDTSAKHIDTLLQHLGSAPFNPDTGAAPVNLPSVRASTVRFQSLAKLEDAQRRKAAGERASTYGRMGMDTHAALEQVFA
ELEGGTHCYLASSGLAGISMVFLSLLSAGEHALVADCAYGPVHELHEAVLSRLGIDVTFFDAKADLASLVRPTTRLIFAE
APGSLLFEMLDMPALARFAKQHDLILATDNTWGSGYIYRPLTLGAQVSVIAGTKYVGGHSDLMLGAVVTNDEAIAKRLNR
TQYALGYSVSADDAWLALRGVRTMPVRMAQHARHALEVCEFLQNRPEVVRLYHPAWPADPGHALWQRDCSGSNGMLAVQL
GLSPQAARDFVNALTLFGIGFSWGGFESLVQLVTPGELARHQYWQGGSDALVRLHIGLESPADLIADLAQALDRAA
>Q83A83 4.4.1.13~~~metC~~~Cystathionine beta-lyase~~~COG0626
MTANNNKKSHIDTRVIHAGQKPDPLTGAVMTPIYTASTYAQKSPGVHQGYEYSRSQNPTRFAYERCVADLESGQHGFAFA
SGMAATATILELLQPGDHVVVMDDVYGGSYRLFENVRKRSAGLSFSFVDFTDENKVREAVTAKTKMLWVESPSNPRLKIV
DLAKIAEIAKEKNIIAVADNTFATPIIQRPLELGFDIVTHSATKYLNGHSDIIGGVAVVGDNKTLAEQLKYLQNAIGAIA
APFDSFMVLRGLKTLAIRMERHCENAMQLAQWLEKHPKVKRVYYPGLPSHPQHSIAKKQMRYFGGMISVELKCDLNETKK
VLERCQLFTLAESLGGVESLIEHPAIMTHASIPQAERQKLGITDGFIRLSVGIEAITDLRHDLEAAL
>P06721 4.4.1.13~~~metC~~~Cystathionine beta-lyase MetC~~~COG0626
MADKKLDTQLVNAGRSKKYTLGAVNSVIQRASSLVFDSVEAKKHATRNRANGELFYGRRGTLTHFSLQQAMCELEGGAGC
VLFPCGAAAVANSILAFIEQGDHVLMTNTAYEPSQDFCSKILSKLGVTTSWFDPLIGADIVKHLQPNTKIVFLESPGSIT
MEVHDVPAIVAAVRSVVPDAIIMIDNTWAAGVLFKALDFGIDVSIQAATKYLVGHSDAMIGTAVCNARCWEQLRENAYLM
GQMVDADTAYITSRGLRTLGVRLRQHHESSLKVAEWLAEHPQVARVNHPALPGSKGHEFWKRDFTGSSGLFSFVLKKKLN
NEELANYLDNFSLFSMAYSWGGYESLILANQPEHIAAIRPQGEIDFSGTLIRLHIGLEDVDDLIADLDAGFARIV
>P0C2T9 4.4.1.13~~~metC~~~Cystathionine beta-lyase~~~
MTSLKTKVIHGGISTDRTTGAVSVPIYQTSTYKQNGLGQPKEYEYSRSGNPTRHALEELIADLEGGVQGFAFSSGLAGIH
AVLSLFSAGDHIILADDVYGGTFRLVDKVLTKTGIIYDLVDLSNLEDLKAAFKAETKAVYFETPSNPLLKVLDIKEISSI
AKAHNALTLVDNTFATPYLQQPIALGADIVLHSATKYLGGHSDVVAGLVTTNSNELAIEIGFLQNSIGAVLGPQDSWLVQ
RGIKTLAPRMEAHSANAQKIAEFLEASQAVSKVYYPGLVNHEGHEIAKKQMTAFGGMISFELTDENAVKNFVENLRYFTL
AESLGGVESLIEVPAVMTHASIPKELREEIGIKDGLIRLSVGVEALEDLLTDLKEALEKE
>A2RM21 4.4.1.13~~~metC~~~Cystathionine beta-lyase~~~COG0626
MTSIKTKVIHGGISTDKTTGAVSVPIYQTSTYKQNGLGQPKEYEYSRSGNPTRHALEELIADLEGGVQGFAFSSGLAGIH
AVLSLFSAGDHIILADDVYGGTFRLMDKVLTKTGIIYDLVDLSNLDDLKAAFKEETKAIYFETPSNPLLKVLDIKEISAI
AKAHDALTLVDNTFATPYLQQPIALGADIVLHSATKYLGGHSDVVAGLVTTNSKELASEIGFLQNSIGAVLGPQDSWLVQ
RGIKTLALRMEAHSANAQKIAEFLETSKAVSKVYYPGLNSHPGHEIAKKQMSAFGGMISFELTDENAVKDFVENLSYFTL
AESLGGVESLIEVPAVMTHASIPKELREEIGIKDGLIRLSVGVEAIEDLLTDIKEALEKK
>Q4L332 4.4.1.13~~~metC~~~Cystathionine beta-lyase MetC~~~COG0626
MSLSKETEVIFDEHRGVDYDSANPPLYDSSTFHQKVLGGNAKFDYARSGNPNRQLLEEKLAKLEGGQYAFAYASGIAAIS
AVLLTLKANDHVILPDDVYGGTFRLTEQILNRFDIQFTTVNATQPKEIERAIQPNTKLIYVETPSNPCFKITDIRAVAAI
AKRHHLLLAVDNTFMTPLGQSPLALGADIVVHSATKFLGGHSDIIAGAAITNRKDVADALYLLQNGTGTALSAHDSWTLA
KHLKTLPVRFKQSTSNAEKLVAFLKEREEIAEVYYPGNSSLHLSQANSGGAVIGFRLKDETKTQDFVDALTLPLVSVSLG
GVETILSHPATMSHAAVPEDVRNERGITFGLFRLSVGLEQPQELIADLNYALKEAFNESIIESITEQRFSS
>P80877 2.1.1.14~~~metE~~~5-methyltetrahydropteroyltriglutamate--homocysteine methyltransferase~~~COG0620
MTTIKTSNLGFPRIGLNREWKKALEAYWKGSTDKDTFLKQIDELFLSAVKTQIDQQIDVVPVSDFTQYDHVLDTAVSFNW
IPKRFRHLTDATDTYFAIARGIKDAVSSEMTKWFNTNYHYIVPEYDESIEFRLTRNKQLEDYRRIKQEYGVETKPVIVGP
YTFVTLAKGYEPSEAKAIQKRLVPLYVQLLKELEEEGVKWVQIDEPALVTASSEDVRGAKELFESITSELSSLNVLLQTY
FDSVDAYEELISYPVQGIGLDFVHDKGRNLEQLKTHGFPTDKVLAAGVIDGRNIWKADLEERLDAVLDILSIAKVDELWI
QPSSSLLHVPVAKHPDEHLEKDLLNGLSYAKEKLAELTALKEGLVSGKAAISEEIQQAKADIQALKQFATGANSEQKKEL
EQLTDKDFKRPIPFEERLALQNESLGLPLLPTTTIGSFPQSAEVRSARQKWRKAEWSDEQYQNFINAETKRWIDIQEELE
LDVLVHGEFERTDMVEYFGEKLAGFAFTKYAWVQSYGSRCVRPPVIYGDVEFIEPMTVKDTVYAQSLTSKHVKGMLTGPV
TILNWSFPRNDISRKEIAFQIGLALRKEVKALEDAGIQIIQVDEPALREGLPLKTRDWDEYLTWAAEAFRLTTSSVKNET
QIHTHMCYSNFEDIVDTINDLDADVITIEHSRSHGGFLDYLKNHPYLKGLGLGVYDIHSPRVPSTEEMYNIIVDALAVCP
TDRFWVNPDCGLKTRQQEETVAALKNMVEAAKQARAQQTQLV
>P25665 2.1.1.14~~~metE~~~5-methyltetrahydropteroyltriglutamate--homocysteine methyltransferase~~~COG0620
MTILNHTLGFPRVGLRRELKKAQESYWAGNSTREELLAVGRELRARHWDQQKQAGIDLLPVGDFAWYDHVLTTSLLLGNV
PARHQNKDGSVDIDTLFRIGRGRAPTGEPAAAAEMTKWFNTNYHYMVPEFVKGQQFKLTWTQLLDEVDEALALGHKVKPV
LLGPVTWLWLGKVKGEQFDRLSLLNDILPVYQQVLAELAKRGIEWVQIDEPALVLELPQAWLDAYKPAYDALQGQVKLLL
TTYFEGVTPNLDTITALPVQGLHVDLVHGKDDVAELHKRLPSDWLLSAGLINGRNVWRADLTEKYAQIKDIVGKRDLWVA
SSCSLLHSPIDLSVETRLDAEVKSWFAFALQKCHELALLRDALNSGDTAALAEWSAPIQARRHSTRVHNPAVEKRLAAIT
AQDSQRANVYEVRAEAQRARFKLPAWPTTTIGSFPQTTEIRTLRLDFKKGNLDANNYRTGIAEHIKQAIVEQERLGLDVL
VHGEAERNDMVEYFGEHLDGFVFTQNGWVQSYGSRCVKPPIVIGDISRPAPITVEWAKYAQSLTDKPVKGMLTGPVTILC
WSFPREDVSRETIAKQIALALRDEVADLEAAGIGIIQIDEPALREGLPLRRSDWDAYLQWGVEAFRINAAVAKDDTQIHT
HMCYCEFNDIMDSIAALDADVITIETSRSDMELLESFEEFDYPNEIGPGVYDIHSPNVPSVEWIEALLKKAAKRIPAERL
WVNPDCGLKTRGWPETRAALANMVQAAQNLRRG
>P9WK07 2.1.1.14~~~metE~~~5-methyltetrahydropteroyltriglutamate--homocysteine methyltransferase~~~COG0620
MTQPVRRQPFTATITGSPRIGPRRELKRATEGYWAGRTSRSELEAVAATLRRDTWSALAAAGLDSVPVNTFSYYDQMLDT
AVLLGALPPRVSPVSDGLDRYFAAARGTDQIAPLEMTKWFDTNYHYLVPEIGPSTTFTLHPGKVLAELKEALGQGIPARP
VIIGPITFLLLSKAVDGAGAPIERLEELVPVYSELLSLLADGGAQWVQFDEPALVTDLSPDAPALAEAVYTALCSVSNRP
AIYVATYFGDPGAALPALARTPVEAIGVDLVAGADTSVAGVPELAGKTLVAGVVDGRNVWRTDLEAALGTLATLLGSAAT
VAVSTSCSTLHVPYSLEPETDLDDALRSWLAFGAEKVREVVVLARALRDGHDAVADEIASSRAAIASRKRDPRLHNGQIR
ARIEAIVASGAHRGNAAQRRASQDARLHLPPLPTTTIGSYPQTSAIRVARAALRAGEIDEAEYVRRMRQEITEVIALQER
LGLDVLVHGEPERNDMVQYFAEQLAGFFATQNGWVQSYGSRCVRPPILYGDVSRPRAMTVEWITYAQSLTDKPVKGMLTG
PVTILAWSFVRDDQPLADTANQVALAIRDETVDLQSAGIAVIQVDEPALRELLPLRRADQAEYLRWAVGAFRLATSGVSD
ATQIHTHLCYSEFGEVIGAIADLDADVTSIEAARSHMEVLDDLNAIGFANGVGPGVYDIHSPRVPSAEEMADSLRAALRA
VPAERLWVNPDCGLKTRNVDEVTASLHNMVAAAREVRAG
>Q9JZQ2 2.1.1.14~~~metE~~~5-methyltetrahydropteroyltriglutamate--homocysteine methyltransferase~~~
MTTLHFSGFPRVGAFRELKFAQEKYWRKEISEQELLAVAKDLREKNWKHQVAANADFVAVGDFTFYDHILDLQVATGAIP
ARFGFDSQNLSLEQFFQLARGNKDQFAIEMTKWFDTNYHYLVPEFHADTEFKANAKHYVQQLQEAQALGLKAKPTVVGPL
TFLWVGKEKGAVEFDRLSLLPKLLPVYVEILTALVEAGAEWIQIDEPALAVDLPKEWVEAYKDVYATLSKVSAKILLSTY
FGSVAEHAALLKALPVDGLHIDLVRAPEQLDAFADYDKVLSAGVIDGRNIWRANLNKVLETVEPLQAKLGDRLWISSSCS
LLHTPFDLSVEEKLKANKPDLYSWLAFTLQKTQELRVLKAALNEGRDSVAEELAASQAAADSRANSSEIHRADVAKRLAD
LPANADQRKSPFADRIKAQQAWLNLPLLPTTNIGSFPQTTEIRQARSAFKKGELSAADYEAAMKKEIALVVEEQEKLDLD
VLVHGEAERNDMVEYFGELLSGFAFTQYGWVQSYGSRCVKPPIIFGDVSRPEAMTVAWSTYAQSLTKRPMKGMLTGPVTI
LQWSFVRNDIPRSTVCKQIALALNDEVLDLEKAGIKVIQIDEPAIREGLPLKRADWDAYLNWAGESFRLSSAGCEDSTQI
HTHMCYSEFNDILPAIAAMDADVITIETSRSDMELLTAFGEFQYPNDIGPGVYDIHSPRVPTEAEVEHLLRKAIEVVPVE
RLWVNPDCGLKTRGWKETLEQLQVMMNVTRKLRAELAK
>Q8CWX6 2.1.1.14~~~metE~~~5-methyltetrahydropteroyltriglutamate--homocysteine methyltransferase~~~COG0620
MTKVSSLGYPRLGENREWKKLIEAYWAGKVSKNDLFAGAKELRLDFLKKQLNAGLDLIPVGDFSLYDHILDLSVQFNIIP
KRFAKEPIDIDLYFAIARGNKENVASSMKKWFNTNYHYIVPEWSKQRPKLNNNRLLDLYLEAREVVGDKAKPVITGPITY
VALSTGVEDFTAAVKSLLPLYKQVFTELVKAGASYIQVDEPIFVTDEGKDYLQAAKAVYAYFAKEVPDAKFIFQTYFEGL
IDSQVLSQLPVDAFGLDFVYGLEENLEAIKTGAFKGKEIFAGVIDGRNIWSSDFVKTSALLETIEEQSAALTIQPSCSLL
HVPVTTKNETDLDPVLRNGLAFADEKLTEVKRLAEHLDGREDPAYDLHIAHFDALQAADFRNVKLEDLSRVATKRPSDFA
KRRDIQQEKLHLPLLPTTTIGSFPQSREIRRTRLAWKRGDISDAEYKQFIQAEIERWIRIQEDLDLDVLVHGEFERVDMV
EFFGQKLAGFTTTKFGWVQSYGSRAVKPPIIYGDVQHLEPITVEETVYAQSLTDRPVKGMLTGPITITNWSFERTDIPRD
QLFNQIGLAIKDEIKLLENAGIAIIQVDEAALREGLPLRKSKQKAYLDDAVHAFHIATSSVKDETQIHTHMCYSKFDEII
DAIRALDADVISIETSRSHGDIIESFETAVYPLGIGLGVYDIHSPRVPTKEEVVANIERPLRQLSPTQFWVNPDCGLKTR
QEPETIAALKVLVAATKEVRQKLGN
>Q9X112 2.1.1.14~~~metE~~~5-methyltetrahydropteroyltriglutamate--homocysteine methyltransferase~~~COG0620
MKAYAFGFPKIGEKREFKKALEDFWKGKITEEQFEEEMNKLRMYMVENYRKNVDVIPSNELSYYDFVLDTAVMVGAVPER
FGEYRGLSTYFDMARGGKALEMTKFFNTNYHYLVPEIETEEFYLLENKPLEDYLFFKSKGIETAPWVIGPFTFLYLSKRN
GEWIRRPNQMEKLLESLVSVYKEVFEKLVENGCKEILVNEPAFVCDLEKAHWDLILNVYRELSEFPLTVFTYYDSVSDYE
ACVSLPVKRLHFDFVSNEENLKNLEKHGFPEDKKLVAGVINGRQPWKVDLRKVASLVEKLGASAISNSCPLFHLPVTLEL
ENNLPGGLKEKLAFAKEKLEELKMLKDFLEGKTFDLPNVSFEDFAVDLQAVERVRNLPEDSFRREKEYTERDRIQRERLN
LPLFPTTTIGSFPQTPEVRKMRSKYRKGEISKEEYEAFIKEQIKKAIELQEEIGLDVLVHGEFERTDMVEFFAEKLNGIA
TTQNGWVLSYGSRCYRPPIIYGTVTRPEPMTLKEITYAQSLTEKPVKGMLTGPVTIMSWSYYREDIPEREIAYQIALAIN
EEVKDLEEAGIKIVQIDEPAFREKAPIKKSKWPEYFEWAINAFNLAANARPETQIHAHMCYSDFNEIIEYIHQLEFDVIS
IEASRSKGEIISAFENFKGWIKQIGVGVWDIHSPAVPSINEMREIVERVLRVLPKELIWINPDCGLKTRNWDEVIPSLRN
MVALAKEMREKFES
>P0AEZ1 1.5.1.54~~~metF~~~5,10-methylenetetrahydrofolate reductase~~~COG0685
MSFFHASQRDALNQSLAEVQGQINVSFEFFPPRTSEMEQTLWNSIDRLSSLKPKFVSVTYGANSGERDRTHSIIKGIKDR
TGLEAAPHLTCIDATPDELRTIARDYWNNGIRHIVALRGDLPPGSGKPEMYASDLVTLLKEVADFDISVAAYPEVHPEAK
SAQADLLNLKRKVDAGANRAITQFFFDVESYLRFRDRCVSAGIDVEIIPGILPVSNFKQAKKFADMTNVRIPAWMAQMFD
GLDDDAETRKLVGANIAMDMVKILSREGVKDFHFYTLNRAEMSYAICHTLGVRPGL
>P45208 1.5.1.54~~~metF~~~5,10-methylenetetrahydrofolate reductase~~~COG0685
MSYAKEIDTLNQHIADFNKKINVSFEFFPPKNEKMETLLWDSIHRLKVLKPKFVSVTYGANSGERDRTHGIVKAIKQETG
LEAAPHLTGIDATPEELKQIARDYWDSGIRRIVALRGDEPKGYAKKPFYASDLVELLRSVADFDISVAAYPEVHPEAKSA
QADLINLKRKIDAGANHVITQFFFDIENYLRFRDRCASIGIDTEIVPGILPVTNFKQLQKMASFTNVKIPAWLVKAYDGL
DNDPTTRNLVAASVAMDMVKILSREGVNDFHFYTLNRSELTYAICHMLGVRP
>Q9JZQ3 1.5.1.54~~~metF~~~5,10-methylenetetrahydrofolate reductase~~~
MNYAKEINALNNSLSDLKGDINVSFEFFPPKNEQMETMLWDSIHRLQTLHPKFVSVTYGANSGERDRTHGIVKRIKQETG
LEAAPHLTGIDASPDELRQIAKDYWDSGIRRIVALRGDEPAGYEKKPFYAEDLVKLLRSVADFDISVAAYPEVHPEAKSA
QADLINLKRKIDAGANHVITQFFFDVERYLRFRDRCVMLGIDVEIVPGILPVTNFKQLGKMAQVTNVKIPSWLSQMYEGL
DDDQGTRNLVAASIAIDMVKVLSREGVKDFHFYTLNRSELTYAICHILGVRP
>G2IQS8 1.5.1.54~~~metF~~~5,10-methylenetetrahydrofolate reductase~~~COG0685
MATATLDKAALSRLFTDYSLEITPKDVEALENAAHMIPPGTLISVTFLPGAEYEDRARAAKRIQELGFRPVPHLSARRLI
DEADLRTYLDMLKGVIDLKHVFVIAGDPNEPLGIYEDALALIDSGILKEYGIEHCGISGYPEGHPDITDEKLAKAMHDKV
ASLKRQGIDYSIMTQFGFDAEPVLEWLKQIRSEGIDGPVRIGLAGPASIKTLLRFAARCGVGTSAKVVKKYGLSITSLIG
SAGPDPVIEDLTPVLGPEHGQVHLHFYPFGGLVKTNEWIVNFKGKQGI
>O54235 1.5.1.54~~~metF~~~5,10-methylenetetrahydrofolate reductase~~~
MALGTASTRTDRARTVRDILATGKTTYSFEFSAPKTPKGEKNLWSALRRVEAVAPDFVSVTYGAGGSTRAGTVRETQQIV
ADTTLTPVAHLTAVDHSVAELRNIIGQYADAGIRNMLAVRGDPPGDPNADWIAHPEGLTYAAELVRLIKESGDFCVGVAA
FPEMHPRSADWDTDVTNFVDKCRAGADYAITQMFFQPDSYLRLRDRVAAAGCATPVIPEVMPVTSVKMLERLPKLSNASF
PAELKERILTAKDDPAAVRSIGIEFATEFCARLLAEGVPGLHFITLNNSTATLEIYENLGLHHPPRA
>P13009 2.1.1.13~~~metH~~~Methionine synthase~~~COG0646
MSSKVEQLRAQLNERILVLDGGMGTMIQSYRLNEADFRGERFADWPCDLKGNNDLLVLSKPEVIAAIHNAYFEAGADIIE
TNTFNSTTIAMADYQMESLSAEINFAAAKLARACADEWTARTPEKPRYVAGVLGPTNRTASISPDVNDPAFRNITFDGLV
AAYRESTKALVEGGADLILIETVFDTLNAKAAVFAVKTEFEALGVELPIMISGTITDASGRTLSGQTTEAFYNSLRHAEA
LTFGLNCALGPDELRQYVQELSRIAECYVTAHPNAGLPNAFGEYDLDADTMAKQIREWAQAGFLNIVGGCCGTTPQHIAA
MSRAVEGLAPRKLPEIPVACRLSGLEPLNIGEDSLFVNVGERTNVTGSAKFKRLIKEEKYSEALDVARQQVENGAQIIDI
NMDEGMLDAEAAMVRFLNLIAGEPDIARVPIMIDSSKWDVIEKGLKCIQGKGIVNSISMKEGVDAFIHHAKLLRRYGAAV
VVMAFDEQGQADTRARKIEICRRAYKILTEEVGFPPEDIIFDPNIFAVATGIEEHNNYAQDFIGACEDIKRELPHALISG
GVSNVSFSFRGNDPVREAIHAVFLYYAIRNGMDMGIVNAGQLAIYDDLPAELRDAVEDVILNRRDDGTERLLELAEKYRG
SKTDDTANAQQAEWRSWEVNKRLEYSLVKGITEFIEQDTEEARQQATRPIEVIEGPLMDGMNVVGDLFGEGKMFLPQVVK
SARVMKQAVAYLEPFIEASKEQGKTNGKMVIATVKGDVHDIGKNIVGVVLQCNNYEIVDLGVMVPAEKILRTAKEVNADL
IGLSGLITPSLDEMVNVAKEMERQGFTIPLLIGGATTSKAHTAVKIEQNYSGPTVYVQNASRTVGVVAALLSDTQRDDFV
ARTRKEYETVRIQHGRKKPRTPPVTLEAARDNDFAFDWQAYTPPVAHRLGVQEVEASIETLRNYIDWTPFFMTWSLAGKY
PRILEDEVVGVEAQRLFKDANDMLDKLSAEKTLNPRGVVGLFPANRVGDDIEIYRDETRTHVINVSHHLRQQTEKTGFAN
YCLADFVAPKLSGKADYIGAFAVTGGLEEDALADAFEAQHDDYNKIMVKALADRLAEAFAEYLHERVRKVYWGYAPNENL
SNEELIRENYQGIRPAPGYPACPEHTEKATIWELLEVEKHTGMKLTESFAMWPGASVSGWYFSHPDSKYYAVAQIQRDQV
EDYARRKGMSVTEVERWLAPNLGYDAD
>O33259 2.1.1.13~~~metH~~~Methionine synthase~~~COG0646
MTAADKHLYDTDLLDVLSQRVMVGDGAMGTQLQAADLTLDDFRGLEGCNEILNETRPDVLETIHRNYFEAGADAVETNTF
GCNLSNLGDYDIADRIRDLSQKGTAIARRVADELGSPDRKRYVLGSMGPGTKLPTLGHTEYAVIRDAYTEAALGMLDGGA
DAILVETCQDLLQLKAAVLGSRRAMTRAGRHIPVFAHVTVETTGTMLLGSEIGAALTAVEPLGVDMIGLNCATGPAEMSE
HLRHLSRHARIPVSVMPNAGLPVLGAKGAEYPLLPDELAEALAGFIAEFGLSLVGGCCGTTPAHIREVAAAVANIKRPER
QVSYEPSVSSLYTAIPFAQDASVLVIGERTNANGSKGFREAMIAEDYQKCLDIAKDQTRDGAHLLDLCVDYVGRDGVADM
KALASRLATSSTLPIMLDSTETAVLQAGLEHLGGRCAINSVNYEDGDGPESRFAKTMALVAEHGAAVVALTIDEEGQART
AQKKVEIAERLINDITGNWGVDESSILIDTLTFTIATGQEESRRDGIETIEAIRELKKRHPDVQTTLGLSNISFGLNPAA
RQVLNSVFLHECQEAGLDSAIVHASKILPMNRIPEEQRNVALDLVYDRRREDYDPLQELMRLFEGVSAASSKEDRLAELA
GLPLFERLAQRIVDGERNGLDADLDEAMTQKPPLQIINEHLLAGMKTVGELFGSGQMQLPFVLQSAEVMKAAVAYLEPHM
ERSDDDSGKGRIVLATVKGDVHDIGKNLVDIILSNNGYEVVNIGIKQPIATILEVAEDKSADVVGMSGLLVKSTVVMKEN
LEEMNTRGVAEKFPVLLGGAALTRSYVENDLAEIYQGEVHYARDAFEGLKLMDTIMSAKRGEAPDENSPEAIKAREKEAE
RKARHQRSKRIAAQRKAAEEPVEVPERSDVAADIEVPAPPFWGSRIVKGLAVADYTGLLDERALFLGQWGLRGQRGGEGP
SYEDLVETEGRPRLRYWLDRLSTDGILAHAAVVYGYFPAVSEGNDIVVLTEPKPDAPVRYRFHFPRQQRGRFLCIADFIR
SRELAAERGEVDVLPFQLVTMGQPIADFANELFASNAYRDYLEVHGIGVQLTEALAEYWHRRIREELKFSGDRAMAAEDP
EAKEDYFKLGYRGARFAFGYGACPDLEDRAKMMALLEPERIGVTLSEELQLHPEQSTDAFVLHHPEAKYFNV
>O31631 2.5.1.-~~~metI~~~Cystathionine gamma-synthase/O-acetylhomoserine (thiol)-lyase~~~COG0626
MSQHVETKLAQIGNRSDEVTGTVSAPIYLSTAYRHRGIGESTGFDYVRTKNPTRQLVEDAIANLENGARGLAFSSGMAAI
QTIMALFKSGDELIVSSDLYGGTYRLFENEWKKYGLTFHYDDFSDEDCLRSKITPNTKAVFVETPTNPLMQEADIEHIAR
ITKEHGLLLIVDNTFYTPVLQRPLELGADIVIHSATKYLGGHNDLLAGLVVVKDERLGEEMFQHQNAIGAVLPPFDSWLL
MRGMKTLSLRMRQHQANAQELAAFLEEQEEISDVLYPGKGGMLSFRLQKEEWVNPFLKALKTICFAESLGGVESFITYPA
TQTHMDIPEEIRIANGVCNRLLRFSVGIEHAEDLKEDLKQALCQVKEGAVSFE
>P31547 ~~~metI~~~D-methionine transport system permease protein MetI~~~COG2011
MSEPMMWLLVRGVWETLAMTFVSGFFGFVIGLPVGVLLYVTRPGQIIANAKLYRTVSAIVNIFRSIPFIILLVWMIPFTR
VIVGTSIGLQAAIVPLTVGAAPFIARMVENALLEIPTGLIEASRAMGATPMQIVRKVLLPEALPGLVNAATITLITLVGY
SAMGGAVGAGGLGQIGYQYGYIGYNATVMNTVLVLLVILVYLIQFAGDRIVRAVTRK
>P0A8U6 ~~~metJ~~~Met repressor~~~COG3060
MAEWSGEYISPYAEHGKKSEQVKKITVSIPLKVLKILTDERTRRQVNNLRHATNSELLCEAFLHAFTGQPLPDDADLRKE
RSDEIPEAAKEIMREMGINPETWEY
>Q63YH5 2.5.1.6~~~metK~~~S-adenosylmethionine synthase~~~COG0192
MANDYLFTSESVSEGHPDKVADQISDAILDAILAQDKYSRVAAETLCNTGLVVLAGEITTTANIDYIQIARDTIKRIGYD
NTDYGIDYRGCAVLVAYDKQSPDIAQGVDRAHDNNLDQGAGDQGLMFGYACDETPELMPLPIHLSHRLVERQANLRRDGR
LPWLRPDAKSQVTVRYVDGKPHSIDTVVLSTQHAPEIDLPALREAVIEEVIKPTLPADLIKGDIKFLVNPTGRFVIGGPQ
GDCGLTGRKIIVDTYGGAAPHGGGAFSGKDPSKVDRSAAYAGRYVAKNIVAAGLASRALIQVSYAIGVAEPTSVMVNTFG
TGRVSDETITKLVREHFDLRPKGIIQMLDLLRPIYEKTAAYGHFGREEPEFSWEAADKALALAEAAGVEPAVQVA
>Q729A3 2.5.1.6~~~metK~~~S-adenosylmethionine synthase~~~COG0192
MIPSKGKYYFTSESVTEGHPDKVADQISDAVLDVLLAQDPNSRVACETLVTTGMAVIAGEITTRGYADLPHVVRETIRNI
GYNSSEMGFDWQTCAVISSIDKQSADIAQGVDRATNEDQGAGDQGMMFGFACDETATLMPAPIYWAHQLSQRLTEVRKDG
TVDIFRPDGKTQVSFEYVDGKPVRINNVVVSTQHKDSASQADIIDAVKTHVIRPILEPSGFFDEKACDIFINTTGRFVIG
GPMGDCGLTGRKIIQDTYGGMGHHGGGAFSGKDASKVDRSGAYMARYIAKNVVASGLAPKCEVQIAYCIGVAEPVSVLVS
SQGTASVPDEVLTRAVREVFDLRPFHITRRLDLLRPIYGKTSCYGHFGRELPEFTWEHTDAAADLRTAAKV
>P0A817 2.5.1.6~~~metK~~~S-adenosylmethionine synthase~~~COG0192
MAKHLFTSESVSEGHPDKIADQISDAVLDAILEQDPKARVACETYVKTGMVLVGGEITTSAWVDIEEITRNTVREIGYVH
SDMGFDANSCAVLSAIGKQSPDINQGVDRADPLEQGAGDQGLMFGYATNETDVLMPAPITYAHRLVQRQAEVRKNGTLPW
LRPDAKSQVTFQYDDGKIVGIDAVVLSTQHSEEIDQKSLQEAVMEEIIKPILPAEWLTSATKFFINPTGRFVIGGPMGDC
GLTGRKIIVDTYGGMARHGGGAFSGKDPSKVDRSAAYAARYVAKNIVAAGLADRCEIQVSYAIGVAEPTSIMVETFGTEK
VPSEQLTLLVREFFDLRPYGLIQMLDLLHPIYKETAAYGHFGREHFPWEKTDKAQLLRDAAGLK
>Q88XB8 2.5.1.6~~~metK~~~S-adenosylmethionine synthase~~~COG0192
MSERHLFTSESVSEGHPDKIADQISDAILDAMLAQDPQARVAVETSVTTGLVLVFGEVSTKAYVDIQKVVRDTIKSIGYV
DGQYGFDGDNCAVLVSLDEQSPDIAQGVDDSLETRSGDADPLDQIGAGDQGMMFGYAINETPELMPLPIALSHRLMRKIA
ALRKDGTIKWLRPDAKAQVTVEYDEDNQPKRIDTVVLSTQHDPDVDLDTIRQTVIDQVIKAVLPADLLDDQTKYLVNPTG
RFVIGGPQGDAGLTGRKVIVDTYGGFAHHGGGAFSGKDATKVDRSASYAARYIAKNVVAAGLADQVEVQLAYAIGVAEPV
SIAVDTAGTGKVSDEALINAIRENFDLRPAGIIKMLDLQRPIYRQTAAYGHFGRTDIDLPWEHTDKVDALKAVFK
>A0QI26 2.5.1.6~~~metK~~~S-adenosylmethionine synthase~~~
MSEKGRLFTSESVTEGHPDKICDAISDSVLDALLAQDPRSRVAVETLVTTGQVHVVGEVTTTAKEAFADITNTVRERILD
IGYDSSDKGFDGASCGVNIGIGAQSPDIAQGVDTAHETRVEGAADPLDAQGAGDQGLMFGYAINDTPERMPLPIALAHRL
SRRLTEVRKNGVLPYLRPDGKTQVTIEFEDDVPVRLDTVVISTQHAADIDLENTLTPDIREKVLNTVLNDLAHDTLDTSS
TRLLVNPTGKFVVGGPMGDAGLTGRKIIVDTYGGWARHGGGAFSGKDPSKVDRSAAYAMRWVAKNIVAAGLAERVEVQVA
YAIGKAAPVGLFIETFGTATVDPVKIEKIVPEVFDLRPGAIIRDLDLLRPIYAQTAAYGHFGRTDVELPWEQLNKVDDLK
RAI
>B2HP50 2.5.1.6~~~metK~~~S-adenosylmethionine synthase~~~COG0192
MSEKGRLFTSESVTEGHPDKICDAVSDSVLDALLAADPRSRVAVETLVTTGQVHVVGEVTTTAKEAFADITNIVRERILD
IGYDSSDKGFDGASCGVNIGIGAQSPDIAQGVDTAHEARVEGAADPLDAQGAGDQGLMFGYAINDTPELMPLPIALAHRL
SRRLTEVRKNGVLPYLRPDGKTQVTIAYEDRVPVRLDTVVISTQHADDIDLVKTLDPDIREQVLKTVLDDLAHDTLDASA
VRVLVNPTGKFVLGGPMGDAGLTGRKIIVDTYGGWARHGGGAFSGKDPSKVDRSAAYAMRWVAKNVVAAGLAERVEVQVA
YAIGKAAPVGLFVETFGSEAVDPVKIEKAIGEVFDLRPGAIIRDLNLLRPIYAPTAAYGHFGRTDVDLPWERLDKVDDLK
RAI
>A0QWT3 2.5.1.6~~~metK~~~S-adenosylmethionine synthase~~~COG0192
MSKGRLFTSESVTEGHPDKICDAISDSVLDALLEQDPKSRVAVETLVTTGQVHVAGEVTTTAYADIPKIVRDRILDIGYD
SSTKGFDGASCGVNVAIGAQSPDIAQGVDTAHETRVEGKADPLDLQGAGDQGLMFGYAIGDTPELMPLPIALAHRLARRL
TEVRKNGVLDYLRPDGKTQVTIQYDGTTPVRLDTVVLSTQHADGIDLEGTLTPDIREKVVNTVLADLGHETLDTSDYRLL
VNPTGKFVLGGPMGDAGLTGRKIIVDTYGGWARHGGGAFSGKDPSKVDRSAAYAMRWVAKNVVAAGLAERVEVQVAYAIG
KAAPVGLFVETFGSETVDPAKIEKAIGEVFDLRPAAIVRDLDLLRPIYAPTAAYGHFGRTDIELPWEQTNKVDDLKSAI
>P9WGV1 2.5.1.6~~~metK~~~S-adenosylmethionine synthase~~~COG0192
MSEKGRLFTSESVTEGHPDKICDAISDSVLDALLAADPRSRVAVETLVTTGQVHVVGEVTTSAKEAFADITNTVRARILE
IGYDSSDKGFDGATCGVNIGIGAQSPDIAQGVDTAHEARVEGAADPLDSQGAGDQGLMFGYAINATPELMPLPIALAHRL
SRRLTEVRKNGVLPYLRPDGKTQVTIAYEDNVPVRLDTVVISTQHAADIDLEKTLDPDIREKVLNTVLDDLAHETLDAST
VRVLVNPTGKFVLGGPMGDAGLTGRKIIVDTYGGWARHGGGAFSGKDPSKVDRSAAYAMRWVAKNVVAAGLAERVEVQVA
YAIGKAAPVGLFVETFGTETEDPVKIEKAIGEVFDLRPGAIIRDLNLLRPIYAPTAAYGHFGRTDVELPWEQLDKVDDLK
RAI
>Q5FAC0 2.5.1.6~~~metK~~~S-adenosylmethionine synthase~~~
MSEYLFTSESVSEGHPDKVADQVSDAILDAILAQDPKARVAAETLVNTGLCVLAGEITTTAQVDYIKVARETIKRIGYNS
SELGFDANGCAVGVYYDQQSPDIAQGVNEGEGIDLNQGAGDQGLMFGYACDETPTLMPFAIYYSHRLMQRQSELRKDGRL
PWLRPDAKAQLTVVYDSETGKVKRIDTVVLSTQHDPAISQEELSKAVIEQIIKPVLPPELLTDETKYLINPTGRFVIGGP
QGDCGLTGRKIIVDTYGGAAPHGGGAFSGKDPSKVDRSAAYACRYVAKNIVAAGLATQCQIQVSYAIGVAEPTSISIDTF
GTGKISEEKLIALVCEHFDLRPKGIVQMLDLLRPIYGKSAAYGHFGREEPEFTWERTDKAASLKAAAGL
>P66767 2.5.1.6~~~metK~~~S-adenosylmethionine synthase~~~
MLNNKRLFTSESVTEGHPDKIADQVSDAILDAILKDDPNARVACETTVTTGMALIAGEISTTTYVDIPKVVRETIKEIGY
TRAKYGYDYETMAILTAIDEQSPDIAQGVDKALEYRDKDSEEEIEATGAGDQGLMFGYATNETETYMPLAIYLSHQLAKR
LSDVRKDGTLNYLRPDGKVQVTVEYDENDNPVRIDTIVVSTQHADDVTLEQIQEDIKAHVIYPTVPENLINEQTKFYINP
TGRFVIGGPQGDAGLTGRKIIVDTYGGYARHGGGCFSGKDPTKVDRSAAYAARYVAKNIVAAGLADQCEVQLAYAIGVAE
PVSIAIDTFGTGKVSEGQLVEAVRKHFDLRPAGIIKMLDLKQPIYKQTAAYGHFGRTDVLFPWEKLDKVEELKDAVKY
>Q72I53 2.5.1.6~~~metK~~~S-adenosylmethionine synthase~~~COG0192
MRALRLVTSESVTEGHPDKLADRISDAILDALIAQDKKARVAAETLVTTGLVFVAGEITTEGYVDIPNLVRKTVREVGYT
RAKYGFDADTCAVLTAIDEQSPDIAGGVNLSYEWRVLKSTDPLDRVGAGDQGLMFGYATDETPELMPLPITLAHRLTMRL
AEVRKTGLLPYLRPDGKAQVTVVYEGDKPLYVKTVVVSAQHSPEVEQEQLREDLIREVVRQAIPPEYLKDGETEYLINPS
GRFILGGPHADTGLTGRKIIVDTYGGAVPHGGGAFSGKDPTKVDRSASYYARYMAKNIVAAGLARRALVELAYAIGKARP
VSLRVETFGTGVLPDEKLTEIAKKVFDPRPLAIIEELDLLRPIYTPTSAYGHFGRPGFPWEETDRVEALRREAGL
>Q831K6 7.4.2.11~~~metN2~~~Methionine import ATP-binding protein MetN 2~~~COG1135
MALIELRHVKKEFSGKAGKVTALKDIDLTVESGDIYGIIGYSGAGKSTLVRLLNGLETPTEGEVEIQGQDIALLPNKELR
NFRKKIGMIFQHFNLLWSRTVLENIMLPLEIAGVPKQNRKSRAEELIKLVGLEGRETAYPSQLSGGQKQRVGIARALANN
PDILLCDEATSALDPQTTDEVLELLLKINQELNLTVVLITHEMHVIRKICNRVAVMEYGEIVEEGKVIDIFKKPQTEIAK
RFIQQEADKNIEETELVVEEMLEQYPNGKIVRLLFHGEQAKLPIISHIVQEYQVEVSIIQGNIQQTKQGAVGSLYIQLLG
EEQNILAAIEGLRKLRVETEVIGNE
>Q99VG8 7.4.2.11~~~metN2~~~Methionine import ATP-binding protein MetN 2~~~
MIELKEVVKEYRTKNKEVLAVDHVNLSIRAGSIYGVIGFSGAGKSTLIRMFNHLEAPTSGEVIIDGDHIGQLSKNGLRAK
RQKVNMIFQHFNLLWSRTVLKNIMFPLEIAGVPRRRAKQKALELVELVGLKGREKAYPSELSGGQKQRVGIARALANDPT
VLLCDEATSALDPQTTDEILDLLLKIREQQNLTIVLITHEMHVIRRICDEVAVMESGKVIEHGPVTQVFENPQHTVTKRF
VKEDLNDDFETSLTELEPLEKDAYIVRLVFAGSTTTEPIVSSLSTAYDIKINILEANIKNTKNGTVGFLVLHIPYISSVD
FGKFEKELIERQVKMEVLRHG
>Q7A6M2 7.4.2.11~~~metN2~~~Methionine import ATP-binding protein MetN 2~~~
MIELKEVVKEYRTKNKEVLAVDHVNLSIRAGSIYGVIGFSGAGKSTLIRMFNHLEAPTSGEVIIDGDHIGQLSKNGLRAK
RQKVNMIFQHFNLLWSRTVLKNIMFPLEIAGVPRRRAKQKALELVELVGLKGREKAYPSELSGGQKQRVGIARALANDPT
VLLCDEATSALDPQTTDEILDLLLKIREQQNLTIVLITHEMHVIRRICDEVAVMESGKVIEHGPVTQVFENPQHTVTKRF
VKEDLNDDFETSLTELEPLEKDAYIVRLVFAGSTTTEPIVSSLSTAYDIKINILEANIKNTKNGTVGFLVLHIPYISSVD
FGKFEKELIERQVKMEVLRHG
>O32169 7.4.2.11~~~metN~~~Methionine import ATP-binding protein MetN~~~COG1135
MINLQDVSKVYKSKHGDVNAVQNVSLSIKKGEIFGIIGYSGAGKSSLIRLLNGLEKPTSGTVEVAGTKINEVNGRGLRKA
RHEISMIFQHFNLLWSRTVRDNIMFPLEIAGVKKSERIKRANELIKLVGLEGKEKSYPSQLSGGQKQRVGIARALANNPK
VLLCDEATSALDPQTTDSILDLLSDINERLGLTIVLITHEMHVIRKICNRVAVMENGKVVEEGEVLDVFKNPKEQMTKRF
VQQVTEPEETKETLQHLLDDTASGKMVQLTFVGESAEQPLITEMIRNFNVSVNILQGKISQTKDGAYGSLFIHIDGDEEE
VQNVIRFINDKQVKAEVITNV
>P30750 7.4.2.11~~~metN~~~Methionine import ATP-binding protein MetN~~~COG1135
MIKLSNITKVFHQGTRTIQALNNVSLHVPAGQIYGVIGASGAGKSTLIRCVNLLERPTEGSVLVDGQELTTLSESELTKA
RRQIGMIFQHFNLLSSRTVFGNVALPLELDNTPKDEVKRRVTELLSLVGLGDKHDSYPSNLSGGQKQRVAIARALASNPK
VLLCDEATSALDPATTRSILELLKDINRRLGLTILLITHEMDVVKRICDCVAVISNGELIEQDTVSEVFSHPKTPLAQKF
IQSTLHLDIPEDYQERLQAEPFTDCVPMLRLEFTGQSVDAPLLSETARRFNVNNNIISAQMDYAGGVKFGIMLTEMHGTQ
QDTQAAIAWLQEHHVKVEVLGYV
>Q87RS1 7.4.2.11~~~metN~~~Methionine import ATP-binding protein MetN~~~COG1135
MIEIKNVNKVFYQGSKEILALKDINLHIAKGTIFGVIGSSGAGKSTLIRCVNMLEAPSSGSIIVDGVDLTTLSKKQLVET
RRNIGMIFQHFNLLSSRTVFDNVALPLELAGKDKSQITTKVTELLKLVGLADKHESYPSNLSGGQKQRVAIARALASDPS
VLLCDEATSALDPATTQSILELLKEINRKLNITILLITHEMEVVKSICHEVAIIGGGELVEKGTVGDIFAHPKTELAHEF
IRSTLDLSIPEDYQARLQPNRVEGSYPLVRMEFTGATVDAPLMSQISRKYNIDVSILSSDLDYAGGVKFGMMVAELFGNE
QDDSAAIEYLREHNVKVEVLGYVL
>O32168 ~~~metP~~~Methionine import system permease protein MetP~~~COG2011
MFEKYFPNVDLTELWNATYETLYMTLISLLFAFVIGVILGLLLFLTSKGSLWQNKAVNSVIAAVVNIFRSIPFLILIILL
LGFTKFLVGTILGPNAALPALVIGSAPFYARLVEIALREVDKGVIEAAKSMGAKTSTIIFKVLIPESMPALISGITVTAI
ALIGSTAIAGAIGSGGLGNLAYVEGYQSNNADVTFVATVFILIIVFIIQIIGDLITNIIDKR
>O32167 ~~~metQ~~~Methionine-binding lipoprotein MetQ~~~COG1464
MKKLFLGALLLVFAGVMAACGSNNGAESGKKEIVVAATKTPHAEILKEAEPLLKEKGYTLKVKVLSDYKMYNKALADKEV
DANYFQHIPYLEQEMKENTDYKLVNAGAVHLEPFGIYSKTYKSLKDLPDGATIILTNNVAEQGRMLAMLENAGLITLDSK
VETVDATLKDIKKNPKNLEFKKVAPELTAKAYENKEGDAVFINVNYAIQNKLNPKKDAIEVESTKNNPYANIIAVRKGEE
DSAKIKALMEVLHSKKIKDFIEKKYDGAVLPVSE
>P28635 ~~~metQ~~~D-methionine-binding lipoprotein MetQ~~~COG1464
MAFKFKTFAAVGALIGSLALVGCGQDEKDPNHIKVGVIVGAEQQVAEVAQKVAKDKYGLDVELVTFNDYVLPNEALSKGD
IDANAFQHKPYLDQQLKDRGYKLVAVGNTFVYPIAGYSKKIKSLDELQDGSQVAVPNDPTNLGRSLLLLQKVGLIKLKDG
VGLLPTVLDVVENPKNLKIVELEAPQLPRSLDDAQIALAVINTTYASQIGLTPAKDGIFVEDKESPYVNLIVTREDNKDA
ENVKKFVQAYQSDEVYEAANKVFNGGAVKGW
>P31728 ~~~metQ~~~Probable D-methionine-binding lipoprotein MetQ~~~COG1464
MKLKQLFAITAIASALVLTGCKEDKKPEAAAAPLKIKVGVMSGPEHQVAEIAAKVAKEKYGLDVQFVEFNDYALPNEAVS
KGDLDANAMQHKPYLDEDAKAKNLNNLVIVGNTFVYPLAGYSKKIKNVNELQDGAKVVVPNDPTNRGRALILLEKQGLIK
LKDANNLLSTVLDIVENPKKLNITEVDTSVAARALDDVDLAVVNNTYAGQVGLNAQDDGVFVEDKDSPYVNIIVSRTDNK
DSKAVQDFIKSYQTEEVYQEAQKHFKDGVVKGW
>A4J778 1.16.1.-~~~~~~Metal reductase~~~COG0446
MSILFSPAQIGTLQLRNRIIMTPMHLGYTPNGEVTDQLIEFYRVRARGGAGLIVVGGCGIDRIGNAYGMTQLDDDRFIPG
LRRLADAVQAEGAKIVAQLYQAGRYAHSALTGQPAVAPSPIPSKLTGETPVELTEEKIAEIVASFAKAAKRAKTAGFDGV
EIIASAGYLISQFLSPLTNKRTDRYGGDLQARMTFGLEVVAAVREAVGSDYPIIVRVAGNDFMPGSHTNTEAQVFCQAME
KSGVNAINVTGGWHETQVPQITMNVPPGAYAYLAYGIKQAVSIPVIACNRINTPDLAEAILQEGKADFIGMARSLMADPE
LPNKAMSGHPEQIRPCIGCNQGCLDHVFRMKPVSCLVNAEAGREAELSLTPTSQPGKILVIGAGAAGLEFARVAALRGHK
VTIWEESDQAGGQLILAAAPPGRKDFLHLRTYLVNACRDLGVEIQYHTKATPENILSAVQEGKFNRVVIATGAHPITPPI
PIEEGVKVIQAWDVLAGRSKAGQNIIIVGGGAVGVETALLLAESGTLDNETLRFLMLQQAETEKELYRLLIQGTKKITVL
EMANGIGRDIGPSTRWSMLADLKRHQVNCLDETTVLEIRREGVLVKNAGTQKILPADTVILAVGSRSQNELYQALQGKVE
YLSIIGDAIKPRKVMDAIHQAYNEAIKY
>P0A9F9 ~~~metR~~~HTH-type transcriptional regulator MetR~~~COG0583
MIEVKHLKTLQALRNCGSLAAAAATLHQTQSALSHQFSDLEQRLGFRLFVRKSQPLRFTPQGEILLQLANQVLPQISQAL
QACNEPQQTRLRIAIECHSCIQWLTPALENFHKNWPQVEMDFKSGVTFDPQPALQQGELDLVMTSDILPRSGLHYSPMFD
YEVRLVLAPDHPLAAKTRITPEDLASETLLIYPVQRSRLDVWRHFLQPAGVSPSLKSVDNTLLLIQMVAARMGIAALPHW
VVESFERQGLVVTKTLGEGLWSRLYAAVRDGEQRQPVTEAFIRSARNHACDHLPFVKSAERPTYDAPTVRPGSPARL
>Q9AAS1 2.3.1.31~~~metXA~~~Homoserine O-acetyltransferase~~~COG2021
MAALDPITPAGGGTWRFPANEPLRLDSGGVIEGLEIAYQTYGQLNADKSNAVLICHALTGDQHVASPHPTTGKPGWWQRL
VGPGKPLDPARHFIICSNVIGGCMGSTGPASINPATGKTYGLSFPVITIADMVRAQAMLVSALGVETLFAVVGGSMGGMQ
VQQWAVDYPERMFSAVVLASASRHSAQNIAFHEVGRQAIMADPDWRGGAYAEHGVRPEKGLAVARMAAHITYLSEPALQR
KFGRELQRDGLSWGFDADFQVESYLRHQGSSFVDRFDANSYLYITRAMDYFDIAASHGGVLAKAFTRARNVRFCVLSFSS
DWLYPTAENRHLVRALTAAGARAAFAEIESDKGHDAFLLDEPVMDAALEGFLASAERDRGLV
>A9WKM8 2.3.1.31~~~metXA~~~Homoserine O-acetyltransferase~~~COG2021
MEAIVQAPTPEGVGIVRTQRMHWTTPLTLTSGATLGPITLAYETYGELAPDRSNAILILHALSGDAHAAGFHSPTDRKPG
WWDAMIGPGRPFDTNRYFVICSNVIGGCRGSTGPSSPHPSDGRPYGSRFPLITIEDMVHAQQRLIDALGIDTLLAVAGGS
MGGFQALAWTVEYPQRVRGAILLATSARSSPQTVAWNYIGRRAIMADPRWRGGDYYDSDAPRDGLAVARMLGHITYLCEE
KLEQRFGRRVDGDALDLGPRFAIEHYLEHQAARFNDRFDANSYLVITRAMDNWDLTARYGSLTAAFDLTRARFLALAYSS
DWLYPPAETYQMAAAAQAAGRSFTTHLITTDAGHDAFLTDVAAQSELIRDFLNRLMTE
>O68640 2.3.1.31~~~metXA~~~Homoserine O-acetyltransferase~~~COG2021
MPTLAPSGQLEIQAIGDVSTEAGAIITNAEIAYHRWGEYRVDKEGRSNVVLIEHALTGDSNAADWWADLLGPGKAINTDI
YCVICTNVIGGCNGSTGPGSMHPDGNFWGNRFPATSIRDQVNAEKQFLDALGITTVAAVLGGSMGGARTLEWAAMYPETV
GAAAVLAVSARASAWQIGIQSAQIKAIENDHHWHEGNYYESGCNPATGLGAARRIAHLTYRGELEIDERFGTKAQKNENP
LGPYRKPDQRFAVESYLDYQADKLVQRFDAGSYVLLTDALNRHDIGRDRGGLNKALESIKVPVLVAGVDTDILYPYHQQE
HLSRNLGNLLAMAKIVSPVGHDAFLTESRQMDRIVRNFFSLISPDEDNPSTYIEFYI
>G0J5N4 2.3.1.31~~~metXA~~~Homoserine O-acetyltransferase~~~COG2021
MNLQSPHLTIEMTQEIFYCQEALSLESGESFPEFQLSFTTQGQLNANKDNVIWVLHALTGDANPHEWWSGLIGEDKFFDP
SKYFIVCANFLGSCYGSTQPLSNNPNNGKPYYYDFPNITTRDIASALDKLRIHLGLEKINTVIGGSLGGQVGLEWAVSLG
EKLENAIIVASNAKASPWIIGFNETQRMAIESDSTWGKTQPEAGKKGLETARAIGMLSYRHPMTFLQNQSETEEKRDDFK
ISSYLRYQGLKLANRFNAMSYWILSKAMDSHDIGRGRGGTPVALSNIKCKVLSIGVDTDILFTSEESRYISKHVPKGTYR
EISSIYGHDAFLIEYEQLQYILKSFYLENNG
>D3P9D1 2.3.1.31~~~metXA~~~Homoserine O-acetyltransferase~~~COG2021
MKENSVGLVKTKYVTFKDDFYFESGRILSPITVAYETYGKLNEKKDNAILICHALTGSAHAAGYNSPDDQKPGWWDDMIG
PGKAFDTDKYFIICSNFLGSCYGTTGPASIDPSTGKPYGLKFPVFTVKDMVKLQKKLIDYLGIEKLLCVAGGSMGGMQAL
EWAVTFPEKTYSIIPIATAGRITPMAIAFNTIGRFAIMKDPNWMNGDYYGKTFPRDGLAIARMAGHITYMSDKSFHKKFG
RRYATFGGIYDFFGYFEVENYLRYNGYKFTERFDANSYLYIIKAMDIFDLSYGYGSYEEAIGRIEADSLFITFTSDFLFP
SYQTEEIVNIMKNHGKNPEWVNIESDYGHDAFLLEFDTQTSCIKEFLSKIYNKVANQ
>Q1J115 2.3.1.31~~~metXA~~~Homoserine O-acetyltransferase~~~COG2021
MTALISQPDLLPPPAPERCPPQQTARLFRETPLLLDCGQVVQDVRVAYHTYGTPSDHAILVLHALTGTSAVHEWWPDFLG
EGKPLDPTRDYIVCANVLGGCAGSTGPAELPRVNGEDPPLTLRDMARVGRALLEELGVRRVSVIGASMGGMLAYAWLLEC
PDLVDRAVIIGAPARHSPWAIGLNTAARNAIRAAPGGEGLKVARQIAMLSYRSPESFALTQSGWGTRRPGTPDITTYLEH
QGEKLSTRFCERSYLALTGAMDRFQPTDAELRSIRVPVLVVGISSDVLYPPAEVRTYAGLLPRGQYLELQSPHGHDAFLI
DPQGLPEAAAAFLHGA
>P45131 2.3.1.31~~~metXA~~~Homoserine O-acetyltransferase~~~COG2021
MSVQNVVLFDTQPLTLMLGGKLSHINVAYQTYGTLNAEKNNAVLICHALTGDAEPYFDDGRDGWWQNFMGAGLALDTDRY
FFISSNVLGGCKGTTGPSSINPQTGKPYGSQFPNIVVQDIVKVQKALLDHLGISHLKAIIGGSFGGMQANQWAIDYPDFM
DNIVNLCSSIYFSAEAIGFNHVMRQAVINDPNFNGGDYYEGTPPDQGLSIARMLGMLTYRTDLQLAKAFGRATKSDGSFW
GDYFQVESYLSYQGKKFLERFDANSYLHLLRALDMYDPSLGYDNVKEALSRIKARYTLVSVTTDQLFKPIDLYKSKQLLE
QSGVDLHFYEFPSDYGHDAFLVDYDQFEKRIRDGLAGN
>Q8F4I0 2.3.1.31~~~metXA~~~Homoserine O-acetyltransferase~~~
MNETGSIGIIETKYAEFKELILNNGSVLSPVVIAYETYGTLSSSKNNAILICHALSGDAHAAGYHSGSDKKPGWWDDYIG
PGKSFDTNQYFIICSNVIGGCKGSSGPLSIHPETSTPYGSRFPFVSIQDMVKAQKLLVESLGIEKLFCVAGGSMGGMQAL
EWSIAYPNSLSNCIVMASTAEHSAMQIAFNEVGRQAILSDPNWKNGLYDENSPRKGLALARMVGHITYLSDDKMREKFGR
NPPRGNILSTDFAVGSYLIYQGESFVDRFDANSYIYVTKALDHYSLGKGKELTAALSNATCRFLVVSYSSDWLYPPAQSR
EIVKSLEAADKRVFYVELQSGEGHDSFLLKNPKQIEILKGFLENPN
>A0QSZ0 2.3.1.31~~~metXA~~~Homoserine O-acetyltransferase~~~COG2021
MTIIEERATDTGMATVPLPAEGEIGLVHIGALTLENGTVLPDVTIAVQRWGELAPDRGNVVMVLHALTGDSHVTGPAGDG
HPTAGWWDGVAGPGAPIDTDHWCAIATNVLGGCRGSTGPGSLAPDGKPWGSRFPQITIRDQVAADRAALAALGITEVAAV
VGGSMGGARALEWLVTHPDDVRAGLVLAVGARATADQIGTQSTQVAAIKADPDWQGGDYHGTGRAPTEGMEIARRFAHLT
YRGEEELDDRFANTPQDDEDPLTGGRYAVQSYLEYQGGKLARRFDPGTYVVLSDALSSHDVGRGRGGVEAALRSCPVPVV
VGGITSDRLYPIRLQQELAELLPGCQGLDVVDSIYGHDGFLVETELVGKLIRRTLELAQR
>P9WJY9 2.3.1.31~~~metXA~~~Homoserine O-acetyltransferase~~~COG2021
MTISDVPTQTLPAEGEIGLIDVGSLQLESGAVIDDVCIAVQRWGKLSPARDNVVVVLHALTGDSHITGPAGPGHPTPGWW
DGVAGPGAPIDTTRWCAVATNVLGGCRGSTGPSSLARDGKPWGSRFPLISIRDQVQADVAALAALGITEVAAVVGGSMGG
ARALEWVVGYPDRVRAGLLLAVGARATADQIGTQTTQIAAIKADPDWQSGDYHETGRAPDAGLRLARRFAHLTYRGEIEL
DTRFANHNQGNEDPTAGGRYAVQSYLEHQGDKLLSRFDAGSYVILTEALNSHDVGRGRGGVSAALRACPVPVVVGGITSD
RLYPLRLQQELADLLPGCAGLRVVESVYGHDGFLVETEAVGELIRQTLGLADREGACRR
>B9L9I6 2.3.1.31~~~metXA~~~Homoserine O-acetyltransferase~~~COG2021
MKIETKIAKFTKPLYLESGRILEPWQIIYETYGELNEKKDNVILITHALSGSHHAAGMYEGDRKPGWWDGLIGDGKAIDT
TKYFVISTNVIGSCFGSTSPMSPIHPGSSERYRLKFPVVTIKDMVKAQKILLDSLGIRHLKAIVGGSMGGMQALRFAVDF
PGFCENIIPIATTYQTKPYVIAINKSMIEAIRADSEFKNGNYDPDIIKQNGLKGLAAARMIGYLNYISPKTFERKFGREY
VKTDGMFELFGRFQVESYLEYNGAMFPKWFDPLSYIYILKAISLFDISRGFVSLEDAFSQIKDKLHLISFSGDTLFFPEE
MRDIKNYMDKVGGKCNYFEINSDYGHDSFLVELEKFDFIISDILKGEV
>K4ICC9 2.3.1.31~~~metXA~~~Homoserine O-acetyltransferase~~~COG2021
MPEIFKYQDPFQLENGEVLPELEVSYSTLGKLNKEKSNVIWVCHALTANAQPEDWWRGLIGNEKGIDTEKYFIVCANIIG
SCYGSTNPKSINPETGEVYGLNFPLFSIRDVTKSLELLSEALEIEHIQFLIGGSMGGMQAMEWAIEKPDKIKNLILLATN
AKHSSWGIALNETQRMAIEADSTFYKKETNSGKKGLEAARAIALLSYRNYNTYRHTQVDQEHTADHFRASTYQKYQGEKL
SKRFNAKCYWYLSKAMDSHNVGRNRGDCKKALAKIKAETLVIAVQSDLLFPVEEQRFLAQYIPKGKLEIIDSIYGHDGFL
IEVEKIKSLVHKHFKL
>Q2S5A6 2.3.1.31~~~metXA~~~Homoserine O-acetyltransferase~~~COG2021
MSQTLTVPTLTLENGTTLRDVPVAYRTWGTLNATGTNAVLVCHALTGDTNVADWWGGLLGPGRALDPTEDFVVCLNVPGS
PYGSVAPVTVNPDTGERYGAGFPPFTTRDTVRLHRRALETLGVQRVACAVGGSMGGMHVLEWAFEATDDGAPFVRSLVPI
AVGGRHTAWQIGWGAAQRQAIFADPKWRDGTYPPDDPPTNGLATARMMAMVSYRSRPSLDGRFGRDAMPEQDGTPYAVES
YLHHHGNKLVDRFDANCYVALTRQMDTHDVARGRGDYAKVLRAIEQPSLVVGIDSDVLYPLSEQEELAEHLPSATLEVLS
APHGHDTFLIELDALNDLVSTWRANICSSVAA
>C4XNQ9 2.3.1.31~~~metXA~~~Homoserine O-acetyltransferase~~~COG2021
MSEYIEHSPSGTSVGRVEKRFFTVAEADSPLTVESGRALGPVVLAYETCGQLNERADNAVLVLHALTGDSHAAGYYEPGD
AKPGWWDLMIGPGKPIDTDRYYVICSNVIGGCMGSTGPSSLDPATGQPYGLTFPVITIGDMVRAQKRLVEHLGVTKLLSV
VGGSMGGMQALEWSVRYPDMVRTAVPLATTTKHSALAIAFNEVARQAIMADPNWNGGNYYDGVPPAHGLAVGRMIGHITY
LSDEAMRQKFDRRLQDRCENSFVLEEPDFQVESYLRYQGQKFVDRFDANSFLYITKAADYFNLEASHGCGSAVAAFAKAK
CRYLVASFSSDWLYPTYQSRSMVQAMKKNGLDVSFVELEAKWGHDAFLLPNARLSGMIARFLDRALVDAAKEDARAL
>E0UR96 2.3.1.31~~~metXA~~~Homoserine O-acetyltransferase~~~COG2021
MSLNLQTYTEHFTNPLYLESGRILEPYDITYETYGTMNEDKSNVVVVCHALTGSHHAAGLYEDETKPGWWDGFIGSGKAI
DTDKYFVICSNVIGSCFGSTGPMSLQHPYQEPYRYKFPVVSIKDMVKAQRILFDRLDIHRVHAIVGGSMGGMQALQFAIH
YPNFANKIIALATTHATQPWAIAFNKVAQESILNDPDFKQGYYDPDLLKEQGLSGMAVGRMAGHISFLSHESMREKFGRD
YKLTDGLYELFGKFQVESYLEYNGYNFTKWFDPLAYLYITKAINIYDLSRGFDSLAEALKRVTSALYLVSFKNDLLFKNF
EMKEIADELDKIGNKNHSYIDVKSDYGHDAFLVELNKFENHVKDALNG
>Q9RA51 2.3.1.31~~~metXA~~~Homoserine O-acetyltransferase~~~COG2021
MSEIALEAWGEHEALLLKPPRSPLSIPPPKPRTAVLFPRREGFYTELGGYLPEVRLRFETYGTLSRRRDNAVLVFHALTG
SAHLAGTYDEETFRSLSPLEQAFGREGWWDSLVGPGRILDPALYYVVSANHLGSCYGSTGPLSLDPHTGRPYGRDFPPLT
IRDLARAQARLLDHLGVEKAIVIGGSLGGMVALEFALMYPERVKKLVVLAAPARHGPWARAFNHLSRQAILQDPEYQKGN
PAPKGMALARGIAMMSYRAPEGFEARWGAEPELGETYLDYQGEKFLRRFHAESYLVLSRAMDTHDVGRGRGGVEEALKRL
RAIPSLFVGIDTDLLYPAWEVRQAAKAAGARYREIKSPHGHDAFLIETDQVEEILDAFLP
>B3E278 2.3.1.31~~~metXA~~~Homoserine O-acetyltransferase~~~COG2021
MSIGVVHEQTITFEAGIRLESGRILAPITLVYELYGTMNADCSNVIMVEHAWTGDAHLAGKRREDDPKPGWWDAIVGPGR
LLDTDRYCVLCSNVIGSCYGSTGPASINPRTGKRYNLSFPVITVRDMVRAQELLLDHLGIRRLLCVMGGSMGGMQALEWA
TQYPERVASVVALATTPRPSPQAISLNAVARWAIYNDPTWKKGEYKHNPKDGLALARGIGHITFLSDESMWQKFERRFSA
KDGLFDFFGQFEVERYLNYNGYNFVDRFDANCFLYLAKALDLYDVAWGYESMTDAFSRITAPIQFFAFSSDWLYPPYQTE
EMVTCLQGLGKEVEYHLIQSAYGHDAFLLEHETFTPMVRSLLERVAP
>Q3M5Q6 2.3.1.31~~~metXA~~~Homoserine O-acetyltransferase~~~COG2021
MNYQDFISEQTEYYHLPVPFELEGGGVLTGVQVAYRTWGKLNSAGDNGVLICHALTGSADADEWWEGLLGANKALDSDRD
FIICSNILGSCYGTTGATSINPQTGIPYGASFPAITIRDMVRLQAALIQHLGIKSLQLVIGGSLGGMQVLEWALLYPEIV
QAIAPIATSGRHSAWCIGLSEAQRQAIYADPNWKGGNYTKEQPPSQGLAVARMMAMSAYRSWQSFTARFGRQYDAVADQF
AIASYLQHHGQKLVQRFDANTYITLTQAMDSHDVAQGRDYKSVLQSIKQPALVVAIDSDILYPPTEQQELADFIPDAQLG
WLQSSYGHDAFLIDIATLSQLVINFRQSLSLKTFSDVTT
>Q6FEQ3 2.3.1.46~~~metXS~~~Homoserine O-succinyltransferase~~~COG2021
MSFPSDSVGLVTPQKFQFEEPLHLECGRVLPRFELMVETYGTLNADYSNAILICHALSGHHHAAGYHHDDDKKAGWWDAC
IGPGKAIDTNKFFVVALNNIGGCNGSTGPTSPNPENENRPYGPDFPLVTVRDWVKTQALLSDHLGIQSWYAVIGGSLGGM
QALQWSVDYPDRLKNCVIIASAPKLSAQNIAFNEVARQSILSDPDFYHGRYLEHDSYPKRGLILARMVGHITYLSEEAMK
QKFGRDLKSGKFMYGFDVEFQVESYLRYQGEQFSRNFDANTYLIMTKALDYFDPSREYEQSLTKAMAQTQCRFLVVSFTT
DWRFAPERSQEIVDALITNHKPVTYLDVDAEQGHDSFLFPIPLYVKTLRAFLGGEAHLKSTPQVEIN
>A9I0E6 2.3.1.46~~~metXS~~~Homoserine O-succinyltransferase~~~COG2021
MTTPVPVPNGGPAPAAVPAAAEPGSVGIVTPQLIRFDTPLPLASGQSLQSYELAVETYGTLNAGRTNAVLVCHALNASHH
VAGLAADDPNDVGWWDNMVGPGKPLDTNRFFVIGVNNLGSCFGSTGPASINPATGHPWGAAFPVLTVEDWVHAQARLADH
FGIERFAAVMGGSLGGMQALSWAITCPERVAHCIVIASTPRLSAQNIGFNEVARRAIITDPDFHGGDYYAHNTVPRRGLS
VARMIGHITYLSDDDMAEKFGRTQREPAEGGAYRYGYDVEFEVESYLRYQGEKFSRYFDANTYLLITRALDYFDPARGTG
GDLARALKPAQADFLLVSFSTDWRFPPERSREIVRALLKNGSPVTYAEIDAPHGHDAFLLDDARYHAVVRGYYERIAREL
GLDEPAAGAAPAAAEPACAEGCAA
>F4QV02 2.3.1.46~~~metXS~~~Homoserine O-succinyltransferase~~~
MTLALISTETTEEPSCKASVAAPAPKGGAQIGVKAARRPSELLQRDGAHDVAVAIPDDFELDFGGTLTQKRVIGRLHGKA
NAPLIVVAGGISADRYVHRTETKGLGWWSGAVGVRAPIDLTRFRVLAFDFAPEFGEDVKDAKTPLTITTQDQARLLALLL
DHLGVEKVAAFIGCSYGGMIALAFGELFPDWAEQLVVVSAAHRPHPLATAWRGIQRRILQLGLETGRIDQAVGLARELAM
TTYRTQEEFGDRFDSEAPSHAGQAYPVCDYLQARGRAYRDRTTPSRWLSLSDSIDRHRVEPEAITAPVTLIGFTTDRLCP
IEDMRELADRLPNLWRFEQHASVYGHDAFLKEDKLVADILTSVLKDIDQ
>Q2T284 2.3.1.46~~~metXS~~~Homoserine O-succinyltransferase~~~
MESIGVVAPHTMHFAEPLRLQSGSVLGNYQLVVETYGELNAARSNAVLVCHALNASHHVAGVYADDPRSTGWWDNMVGPG
KPLDTNRFFVIGVNNLGSCFGSTGPMSIDPATGTPYGARFPVVTVEDWVHAQARVADAFGIERFAAVMGGSLGGMQALAW
SLLYPERVAHCIDIASTPKLSAQNIAFNEVARSAILSDPDFHGGDYYAHGVKPRRGLRVARMIGHITYLSDDDMAEKFGR
ALRRADGALDAYNFNFDVEFEVESYLRYQGDKFADYFDANTYLLITRALDYFDPAKAFNGNLSAALAHTKAKYLVASFTT
DWRFAPARSREIVKALLDNRRSVSYAEIDAPHGHDAFLLDDARYHNLVRAYYERIAEEVGA
>B7X2B6 2.3.1.46~~~metXS~~~Homoserine O-succinyltransferase~~~COG2021
MSFIATPQFMHFDEPLPLQSGGSIADYDLAFETYGQLNADKSNAIVVCHALNASHHVAGSYEGQPKSEGWWDNMIGPGKP
VDTDKFFVIGINNLGSCFGSTGPMHTNPATGKPYGADFPVVTVEDWVDAQARLLDRLGIQTLAAVLGGSLGGMQALSWSL
RYPERMRHAVVVASAPNLNAENIAFNEVARRAIVTDPDFNGGHFYEHGVVPARGLRIARMVGHITYLSDDVMNQKFGRSL
RAPTLPAARGSLPPEGTDPTRGGPASDRRDYLYSTQDVEFQIESYLRYQGEKFSGYFDANTYLLITRALDYFDPARCHDG
DLTRALAVAKARFLLVSFTTDWRFAPARSREIVKSLLENNRDVSYAEIDAPHGHDAFLLDDPRYMSVMRSYFEGIAKELK
TTERAGGAA
>S2L5R8 2.3.1.46~~~metXS~~~Homoserine O-succinyltransferase~~~COG2021
MDSDTLMPELPSDSVGLVAPQTAHFDVPLALACGKTLQSYDLVYETYGKLNASRSNAVLICHALSGHHHAAGYHSREDRK
PGWWDAHIGPGKSIDTDRFFVISLNNLGGCHGSTGPCAINPDTGRQWGPDFPMMTVGDWVHSQARLADRLGIERFAAVIG
GSLGGMQVLQWSLAYPERIANAVVIAATPKLSAQNIAFNEVARQAIRSDPDFYDGWYAEHDTLPRRGLKLARMVGHITYL
SEDAMGSKFGRDLRSDDLNFGYDVEFQVESYLRYQGDTFSTSFDANTYLLMTKALDYFDPAAAHDGDLAAAVAPASCPFL
VVSFSTDWRFPPSRSRELVDALIRAGKPVSYVCIDSPHGHDAFLLPETRYQAIFASFMGRVAHDSGLEDS
>D0L1T6 2.3.1.46~~~metXS~~~Homoserine O-succinyltransferase~~~COG2021
MSDASEKLAQSVLSRSVGIVEPKTARFSEPLALDCGRSLPSYELVYETYGQLNDEGSNAVLICHALSGDHHAAGFHAETD
RKPGWWDSAIGPGKPIDTDRFFVVCLNNLGGCKGSTGPLSVDPASGKPYGPDFPIVTVKDWVHAQYRLMQYLGLSGWAAV
IGGSLGGMQVLQWSITYPDAVAHAVVIAAAPRLSAQNIAFNEVARQAIITDPEFYGGRYADHNALPRRGLMLARMLGHIT
YLSDDAMRAKFGRELRAGQVQYGFDVEFQVESYLRYQGTSFVDRFDANTYLLMTKALDYFDPAQASNDDLVAALAEVKAH
FLVVSFTSDWRFSPERSREIVRALLASGKQVSYAEIESNHGHDAFLMTIPYYHRVLAGYMANIDFASTPRGVSSPVYSTG
GAV
>A0A0I9QGZ7 2.3.1.46~~~metXS~~~Homoserine O-succinyltransferase~~~
MSQNTSVGIVTPQKIPFEMPLVLENGKTLPRFDLMIETYGELNAEKNNAVLICHALSGNHHVAGKHSAEDKYTGWWDNMV
GPGKPIDTERFFVVGLNNLGGCDGSSGPLSINPETGREYGADFPMVTVKDWVKSQAALADYLGIEQWAAVVGGSLGGMQA
LQWAISYPERVRHALVIASAPKLSTQNIAFNDVARQAILTDPDFNEGHYRSHNTVPARGLRIARMMGHITYLAEDGLGKK
FGRDLRSNGYQYGYSVEFEVESYLRYQGDKFVGRFDANTYLLMTKALDYFDPAADFGNSLTRAVQDVQAKFFVASFSTDW
RFAPERSHELVKALIAAQKSVQYIEVKSAHGHDAFLMEDEAYMRAVTAYMNNVDKDCRL
>P57714 2.3.1.46~~~metXS~~~Homoserine O-succinyltransferase~~~
MPTVFPDDSVGLVSPQTLHFNEPLELTSGKSLAEYDLVIETYGELNATQSNAVLICHALSGHHHAAGYHSVDERKPGWWD
SCIGPGKPIDTRKFFVVALNNLGGCNGSSGPASINPATGKVYGADFPMVTVEDWVHSQARLADRLGIRQWAAVVGGSLGG
MQALQWTISYPERVRHCLCIASAPKLSAQNIAFNEVARQAILSDPEFLGGYFQEQGVIPKRGLKLARMVGHITYLSDDAM
GAKFGRVLKTEKLNYDLHSVEFQVESYLRYQGEEFSTRFDANTYLLMTKALDYFDPAAAHGDDLVRTLEGVEADFCLMSF
TTDWRFSPARSREIVDALIAAKKNVSYLEIDAPQGHDAFLMPIPRYLQAFSGYMNRISV
>Q88CT3 2.3.1.46~~~metXS~~~Homoserine O-succinyltransferase~~~COG2021
MSTVFPEDSVGLVVPQTARFDEPLALACGRSLASYELVYETYGTLNASASNAVLICHALSGHHHAAGYHAATDRKPGWWD
SCIGPGKPIDTNRFFVVSLNNLGGCNGSTGPSSVNPATGKPYGADFPVLTVEDWVHSQVRLGERLGIQQWAAVVGGSLGG
MQALQWTISYPERVRHCVDIASAPKLSAQNIAFNEVARQAILTDPEFHGGSFQDQGVIPKRGLMLARMVGHITYLSDDSM
GEKFGRELKSDKLNYDFHSVEFQVESYLRYQGEEFSGRFDANTYLLMTKALDYFDPAATHGGDLAATLAHVTADYCIMSF
TTDWRFSPARSREIVDALMAARKNVCYLEIDSPYGHDAFLIPTPRYMQGFSNYMNRIAI
>Q4ZZ78 2.3.1.46~~~metXS~~~Homoserine O-succinyltransferase~~~COG2021
MPTVFPHDSVGLVTPQTAHFSEPLALACGRSLPAYDLIYETYGQLNAARSNAVLICHALSGHHHAAGFHSADDRKPGWWD
SCIGPGKPIDTTKFFVVSLNNLGGCNGSTGPSSIDPDTGKPFGANFPVVTVEDWVNSQARLADLLGIDTWAAVIGGSLGG
MQALQWTISYPNRVRHCLAIASAPKLSAQNIAFNEVARQAILTDPEFHGGSFQERGVIPKRGLMLARMVGHITYLSDDSM
GEKFGRGLKSEKLNYDFHSVEFQVESYLRYQGEEFSGRFDANTYLLMTKALDYFDPAANFNDDLAKTFANATARFCVMSF
TTDWRFSPARSRELVDALMAARKDVCYLEIDAPQGHDAFLIPIPRYLQAFGNYMNRISL
>Q8P6V8 2.3.1.46~~~metXS~~~Homoserine O-succinyltransferase~~~COG2021
MVRIVPSARRTRAPAKLDGRSTPDIAMSLVTTASPLTTADTYTPAADSDAPPAVRGELVINLPMRHAGQRELRLRYELVG
AEQAPVVFVAGGISAHRHLAASAVFPEKGWVEGLVGAGRALDPASRRLLAFDFLGADGSLDAPIDTADQADAIAALLDAL
GIARLHGFVGYSYGALVGLQFASRHAARLHTLVAVSGAHRAHPYAAAWRALQRRAVALGQLQCAEHHGLALARQFAMLSY
RTPEEFSERFDAPPELINGRVRVAAEDYLDAAGAQYVARTPVNAYLRLSESIDLHRIDPAAVAVPTVVVAVEGDRLVPLA
DLVSLVEGLGPRGSLRVLRSPFGHDAFLKEIDRIDAILTTALRTTGETA
>Q5SK88 2.5.1.-~~~oah1~~~O-acetyl-L-homoserine sulfhydrylase 1~~~COG2873
MRFETLQLHAGYEPEPTTLSRQVPIYPTTSYVFKSPEHAANLFALKEFGNIYSRIMNPTVDVLEKRLAALEGGKAALATA
SGHAAQFLALTTLAQAGDNIVSTPNLYGGTFNQFKVTLKRLGIEVRFTSREERPEEFLALTDEKTRAWWVESIGNPALNI
PDLEALAQAAREKGVALIVDNTFGMGGYLLRPLAWGAALVTHSLTKWVGGHGAVIAGAIVDGGNFPWEGGRYPLLTEPQP
GYHGLRLTEAFGELAFIVKARVDGLRDQGQALGPFEAWVVLLGMETLSLRAERHVENTLHLAHWLLEQPQVAWVNYPGLP
HHPHHDRAQKYFKGKPGAVLTFGLKGGYEAAKRFISRLKLISHLANVGDTRTLAIHPASTTHSQLSPEEQAQAGVSPEMV
RLSVGLEHVEDLKAELKEALA
>Q5SJ58 2.5.1.-~~~oah2~~~O-acetyl-L-homoserine sulfhydrylase 2~~~COG2873
MEYTTLAVLAGLPEDPHGAVGLPIYAVAAYGFKTLEEGQERFATGEGYVYARQKDPTAKALEERLKALEGALEAVVLASG
QAATFAALLALLRPGDEVVAAKGLFGQTIGLFGQVLSLMGVTVRYVDPEPEAVREALSAKTRAVFVETVANPALLVPDLE
ALATLAEEAGVALVVDNTFGAAGALCRPLAWGAHVVVESLTKWASGHGSVLGGAVLSRETELWRNYPQFLQPDLKGQIPW
EALRARCFPERVRTLGLSLCGMALSPFNAYLLFQGLETVALRVARMSETARFLAERLQGHPKVKALRYPGLPEDPAHRNA
RKYLASGGPILTLDLGDLERASRFLGAIRLLKAANLGDARTLLVHPWTTTHSRLKEEARLQAGVTPGLVRVSVGLEDPLD
LLALFEEALEAV
>Q79VI4 2.5.1.49~~~metY~~~O-acetyl-L-homoserine sulfhydrylase~~~COG2873
MPKYDNSNADQWGFETRSIHAGQSVDAQTSARNLPIYQSTAFVFDSAEHAKQRFALEDLGPVYSRLTNPTVEALENRIAS
LEGGVHAVAFSSGQAATTNAILNLAGAGDHIVTSPRLYGGTETLFLITLNRLGIDVSFVENPDDPESWQAAVQPNTKAFF
GETFANPQADVLDIPAVAEVAHRNSVPLIIDNTIATAALVRPLELGADVVVASLTKFYTGNGSGLGGVLIDGGKFDWTVE
KDGKPVFPYFVTPDAAYHGLKYADLGAPAFGLKVRVGLLRDTGSTLSAFNAWAAVQGIDTLSLRLERHNENAIKVAEFLN
NHEKVEKVNFAGLKDSPWYATKEKLGLKYTGSVLTFEIKGGKDEAWAFIDALKLHSNLANIGDVRSLVVHPATTTHSQSD
EAGLARAGVTQSTVRLSVGIETIDDIIADLEGGFAAI
>P94890 2.5.1.-~~~metY~~~O-acetyl-L-homoserine sulfhydrylase~~~
MVGPSGESMPRNFKPETIALHGGQEPDPTTTSRAVPLYQTTSYVFKDTDHAARLFGLQEFGNIYTRLMNPTTDVLEKRVA
ALEGGVAALATASGQSAEMLALLNIVEAGQEIVASSSLYGGTYNLLHYTFPKLGIKVHFVDQSDPENFRKASNDKTRAFY
AETLGNPKLDTLDIAAVSKVAKEVGVPLVIDNTMPSPYLVNPLKHGADIVVHSLTKFLGGHGTSIGGIIIDGGSFNWGNG
KFKNFTEPDPSYHGLKFWEVFGKFEPFGGVNIAFILKARVQGLRDLGPAISPFNAWQILQGVETLPLRMERHSGNALKVA
EFLQKHPKIEWVNYPGLSTDKNYATAKKYHERGLFGAIVGFEIKGGVEKAKKFIDGLELFSLLANIGDAKSLAIHPASTT
HQQLTGPEQISAGVTPGFVRLSVGLENIDDILVDLEEALKNI
>Q9WZY4 2.5.1.-~~~~~~O-acetyl-L-homoserine sulfhydrylase~~~COG2873
MDWKKYGYNTRALHAGYEPPEQATGSRAVPIYQTTSYVFRDSDHAARLFALEEPGFIYTRIGNPTVSVLEERIAALEEGV
GALAVASGQAAITYAILNIAGPGDEIVSGSALYGGTYNLFRHTLYKKSGIIVKFVDETDPKNIEEAITEKTKAVYLETIG
NPGLTVPDFEAIAEIAHRHGVPLIVDNTVAPYIFRPFEHGADIVVYSATKFIGGHGTSIGGLIVDSGKFDWTNGKFPELV
EPDPSYHGVSYVETFKEAAYIAKCRTQLLRDLGSCMSPFNAFLFILGLETLSLRMKKHCENALKIVEFLKSHPAVSWVNY
PIAEGNKTRENALKYLKEGYGAIVTFGVKGGKEAGKKFIDSLTLISHLANIGDARTLAIHPASTTHQQLTEEEQLKTGVT
PDMIRLSVGIEDVEDIIADLDQALRKSQEG
>P9WGB5 2.5.1.-~~~metZ~~~O-succinylhomoserine sulfhydrylase~~~COG0626
MTDESSVRTPKALPDGVSQATVGVRGGMLRSGFEETAEAMYLTSGYVYGSAAVAEKSFAGELDHYVYSRYGNPTVSVFEE
RLRLIEGAPAAFATASGMAAVFTSLGALLGAGDRLVAARSLFGSCFVVCSEILPRWGVQTVFVDGDDLSQWERALSVPTQ
AVFFETPSNPMQSLVDIAAVTELAHAAGAKVVLDNVFATPLLQQGFPLGVDVVVYSGTKHIDGQGRVLGGAILGDREYID
GPVQKLMRHTGPAMSAFNAWVLLKGLETLAIRVQHSNASAQRIAEFLNGHPSVRWVRYPYLPSHPQYDLAKRQMSGGGTV
VTFALDCPEDVAKQRAFEVLDKMRLIDISNNLGDAKSLVTHPATTTHRAMGPEGRAAIGLGDGVVRISVGLEDTDDLIAD
IDRALS
>P55218 2.5.1.-~~~metZ~~~O-succinylhomoserine sulfhydrylase~~~
MTQDWDAGRLDSDLEGAAFDTLAVRAGQRRTPEGEHGEALFTTSSYVFRTAADAAARFAGEVPGNVYSRYTNPTVRTFEE
RIAALEGAEQAVATASGMSAILALVMSLCSSGDHVLVSRSVFGSTISLFDKYFKRFGIQVDYPPLSDLAAWEAACKPNTK
LFFVESPSNPLAELVDIAALAEIAHAKGALLAVDNCFCTPALQQPLKLGADVVIHSATKYIDGQGRGMGGVVAGRGEQMK
EVVGFLRTAGPTLSPFNAWLFLKGLETLRIRMQAHSASALALAEWLERQPGIERVYYAGLPSHPQHELARRQQSGFGAVV
SFDVKGGRDAAWRFIDATRMVSITTNLGDTKTTIAHPATTSHGRLSPEDRARAGIGDSLIRVAVGLEDLDDLKADMARGL
AAL
>P52477 ~~~mexA~~~Multidrug resistance protein MexA~~~
MQRTPAMRVLVPALLVAISALSGCGKSEAPPPAQTPEVGIVTLEAQTVTLNTELPGRTNAFRIAEVRPQVNGIILKRLFK
EGSDVKAGQQLYQIDPATYEADYQSAQANLASTQEQAQRYKLLVADQAVSKQQYADANAAYLQSKAAVEQARINLRYTKV
LSPISGRIGRSAVTEGALVTNGQANAMATVQQLDPIYVDVTQPSTALLRLRRELASGQLERAGDNAAKVSLKLEDGSQYP
LEGRLEFSEVSVDEGTGSVTIRAVFPNPNNELLPGMFVHAQLQEGVKQKAILAPQQGVTRDLKGQATALVVNAQNKVELR
VIKADRVIGDKWLVTEGLNAGDKIITEGLQFVQPGVEVKTVPAKNVASAQKADAAPAKTDSKG
>P52002 ~~~mexB~~~Multidrug resistance protein MexB~~~
MSKFFIDRPIFAWVIALVIMLAGGLSILSLPVNQYPAIAPPAIAVQVSYPGASAETVQDTVVQVIEQQMNGIDNLRYISS
ESNSDGSMTITVTFEQGTDPDIAQVQVQNKLQLATPLLPQEVQRQGIRVTKAVKNFLMVVGVVSTDGSMTKEDLSNYIVS
NIQDPLSRTKGVGDFQVFGSQYSMRIWLDPAKLNSYQLTPGDVSSAIQAQNVQISSGQLGGLPAVKGQQLNATIIGKTRL
QTAEQFENILLKVNPDGSQVRLKDVADVGLGGQDYSINAQFNGSPASGIAIKLATGANALDTAKAIRQTIANLEPFMPQG
MKVVYPYDTTPVVSASIHEVVKTLGEAILLVFLVMYLFLQNFRATLIPTIAVPVVLLGTFGVLAAFGFSINTLTMFGMVL
AIGLLVDDAIVVVENVERVMAEEGLSPREAARKSMGQIQGALVGIAMVLSAVFLPMAFFGGSTGVIYRQFSITIVSAMAL
SVIVALILTPALCATMLKPIEKGDHGEHKGGFFGWFNRMFLSTTHGYERGVASILKHRAPYLLIYVVIVAGMIWMFTRIP
TAFLPDEDQGVLFAQVQTPPGSSAERTQVVVDSMREYLLEKESSSVSSVFTVTGFNFAGRGQSSGMAFIMLKPWEERPGG
ENSVFELAKRAQMHFFSFKDAMVFAFAPPSVLELGNATGFDLFLQDQAGVGHEVLLQARNKFLMLAAQNPALQRVRPNGM
SDEPQYKLEIDDEKASALGVSLADINSTVSIAWGSSYVNDFIDRGRVKRVYLQGRPDARMNPDDLSKWYVRNDKGEMVPF
NAFATGKWEYGSPKLERYNGVPAMEILGEPAPGLSSGDAMAAVEEIVKQLPKGVGYSWTGLSYEERLSGSQAPALYALSL
LVVFLCLAALYESWSIPFSVMLVVPLGVIGALLATSMRGLSNDVFFQVGLLTTIGLSAKNAILIVEFAKELHEQGKGIVE
AAIEACRMRLRPIVMTSLAFILGVVPLAISTGAGSGSQHAIGTGVIGGMVTATVLAIFWVPLFYVAVSTLFKDEASKQQA
SVEKGQ
>P52003 ~~~mexR~~~Multidrug resistance operon repressor~~~
MNYPVNPDLMPALMAVFQHVRTRIQSELDCQRLDLTPPDVHVLKLIDEQRGLNLQDLGRQMCRDKALITRKIRELEGRNL
VRRERNPSDQRSFQLFLTDEGLAIHQHAEAIMSRVHDELFAPLTPVEQATLVHLLDQCLAAQPLEDI
>B2RHG1 ~~~mfa1~~~Minor fimbrium subunit Mfa1~~~
MKLNKMFLVGALLSLGFASCSKEGNGPDPDNAAKSYMSMTLSMPMGSARAGDGQDQANPDYHYVGEWAGKDKIEKVSIYM
VPQGGPGLVESAEDLDFGTYYENPTIDPATHNAILKPKKGIKVNSAVGKTVKVYVVLNDIAGKAKALLANVNAADFDAKF
KEIIELSTQAQALGTVADGPNPATAAGKIAKKNGTTDETIMMTCLQPSDALTIEAAVSEANAIAGIKNQAKVTVERSVAR
AMVSTKAQSYEIKATTQIGEIAAGSVLATITDIRWVVAQGERRQYLSKKRGTVPENTWVTPGSGFVPTSSTFHTNATEYY
DYAGLWEDHNTNEAVISGTQVPTLADYQLQDVTGELANALSGKFLLPNTHKSGANAASSDYKRGNTAYVLVRAKFTPKKE
AFIDRGKTYSDNTAVPEYVAGEDFFVGENGQFYVSMKSVTDPKVGGVAGMKAHKYVKGKVLYYAWLNPSTTSPDSWWNSP
VVRNNIYHIHIKSIKKLGFNWNPLVPDPDPSNPENPNNPDPNPDEPGTPVPTDPENPLPDQDTFMSVEVTVLPWKVHSYE
VDL
>P0DOA1 ~~~mfa1~~~Minor fimbrium subunit Mfa1~~~
MKLNKMFLVGALLSLGFASCSKEGNGPAPDSSSTADTHMSVSMSLPQHNRAGDNDYNPIGEYGGVDKINDLTVYVVGDGK
IDVRKLSTADLQVNQGASTTSIVTAPFQVKSGEKTVYAIVNITPKVEAALNAATNAADLKVAYEAAYAAFSDAGSEIATL
VNNQDQMIMSGKPVVQTILPNVSAANASVQNKVPIIVKRAAIRASMTITQQPVNGAYEIKALRPGNVEVVIATVSDLKWS
VAQYEKKYYLQQKDNALSPAASFVPASTNDYNGANGAMKHYDYSQLANRITVHQLNAPYSVTDVPNVAYKYVSETTHADN
DYRKGNTTYILVKGKLKPVAAMWADGEQAAYQEGGDLFLGLVTGKFYANEANANAANPASGGAGNPRVVTYKAAAVYYYA
WLNPNTLDPTTWTMSPARRNNIYNVNISKFRNIGLSGNPFVPTDPDPNNPDTPDNPDTPDPEDPDTPNPEEPLPVQKTYM
VVDVTVTPWTLHNYDIEF
>B2RHG2 ~~~mfa2~~~Minor fimbrium anchoring subunit Mfa2~~~
MNKRKHMDIRRLIISLPAIMALWGGLASCDKMIYDNYDDCPRGVYVNFYSQTECAENPSYPAEVARLNVYAFDKDGILRS
ANVFEDVQLSAAKEWLIPLEKDGLYTIFAWGNIDDHYNIGEIKIGETTKQQVLMRLKQDGKWATNIDGTTLWYATSPVVE
LKNMEDGADQYIHTRANLREYTNRVTVSVDSLPHPENYEIKLASSNGSYRFDGTVAKADSTYYPGETKVVGDSTCRAFFT
TLKLESGHENTLSVTHKPTGREIFRTDLVGAILSSQYAQNINLRCINDFDIRLVAHHCNCPDDTYVVVQIWINGWLIHSY
EIEL
>B2RHG3 ~~~mfa3~~~Minor fimbrium tip subunit Mfa3~~~
MMQLKKRYFALILLLFLWSGCDRGVDPQPDPLQPDVYLLVNARAAHTNGEESINMDAEDFEDRVHSLAMLVFDSNTGEKV
AEHFSSSIGSGTSTYVFTVKLKPGQRDFFFVANIPNMQTAMASIVNKSDMNHFMQVFRDLDPIHYHNATNNNGFPMSRMY
SNQTVTIGGTITQPLPFKPDGENNVKLQRVVAKLDVNIVEGVENLQKIELCNANVHYRLVPNQSEPIQFYGPVELRRVGA
TNQWLGYMPEAIVESTKWWGNTGNAENKPINFFRLTTRGGLVYDVPIITHEGAIPGGQYLPFAKGLLADKPSYTVYRNRH
YIYRIKTLPDKIEVKYSICDWNIVTNDTYMGYGYNVGVDEQGNVTITNTMQNCDPHVVRLVAKNGAYFGSQPTDTSVEFT
ELANGASQTFKVNKDAVAVGSAYLEVYYNPDLNATGVVPDKVFIKK
>B2RHG4 ~~~~~~Minor fimbrium tip subunit MfA4~~~
MKKYLLYASLLTSVLLFSCSKNNPSEPVEDRSIEISIRVDDFTKTGETVRYERNQGSAAERLITNLYLLLFDQSGANPAK
YYIAGNTFSGGIWLPDDMKVKLDMTQSEAGERKVYVVANVDNAVKTALDAVANESDLQTVKRTTAMPWSTDIASPFLMSG
NKTHDFLANRLLDNVPLVRAIAKVELNISLSEKFQIVPIIVNGSLSEFKFRYVNFDKETYVVKPTTKPDNLISSANGVWP
QITDWTVWGASLNTSPAPDAGTGYTLDANGKVTALRIVTYLNERDSKGATVEVALPRVDDGTLPPPEFGPELYRLPLPDK
ILRNHWYKYEVEI
>Q7MXK0 ~~~mfA4~~~Minor fimbrium tip subunit MfA4~~~
MKKYLLYASLLTSVLLFSCSKNNPNEPVEDRSIEISIRVDDFTKTGEAVRYERNQGSAAERLITNLYLLLFDQSGANPAK
YYITGNTFTGGTWLPDDMKVKLDMTQSEAGERKVYVVANVDNAVKTALDAVANESDLQTVKRTTAMPWSTDIASPFLMSG
NKTHDFLANRLLDNVPLVRAIAKVELNISLSEKFQIVPIIVNGSLSEFKFRYVNFDKETYVVKPTTKPDNLISSANGVWP
QITDWTVWGASLNTSPAPDAGTGYTLDANGKVTALRIVTYLNERDSKGATVEVALPRVDDGTLPPPEFGPELYRLPLPDK
ILRNHWYKYEVEI
>P81363 ~~~mfa1~~~Minor fimbrium subunit Mfa1~~~
MKLNKMFLVGALLSLGFASCSKEGNGPAPDSSSTADTHMSVSMSLPQHNRAGDNDYNPIGEYGGVDKINDLTVYVVGDGK
IDVRKLSTADLQVNQGASTTSIVTAPFQVKSGEKTVYAIVNITPKVEAALNAATNAADLKVAYEAAYAAFSDAGSEIATL
VNNQDQMIMSGKPVVQTILANVSAANASVQNKVPIIVKRAAIRASMTITQQPVNGAYEIKALRPGNVEVGIATVSDLKWA
VAQYEKKYYLQQKDNALSPAASFVPASTNDYNGANGAMKHYDYSQLANRITVHQLNAPYSVTDVPNVAYKYVSETTHADN
DYRKGNTTYILVKGKLKPVAAMWADGEQAAYQEGGDLFLGLVTGKFYANEANANAANPASGGAGNPRVVTYKAAAVYYYA
WLNPNTLDPTTWTMSPARRNNIYNVNISKFRNIGLSGNPFVPTDPDPNNPDTPDNPDTPDPEDPDTPNPEEPLPVQKTYM
VVDVTVTPWTLHNYDIEF
>B2RHG5 ~~~~~~Minor fimbrium subunit Mfa5~~~
MMKRYTIILAVFLLFCTVFTFQIKARPYERFADVEKPWIQKHSMDSKLVPANKGNLIQAEIVYQSVSEHSDLVISPVNEI
RPANRFPSHRKSFFAENLRASPPVVPVAVDKYAVPVANPMDPENPNAWDVTLKITTKAVTVPVDVVMVIDQSSSMGGQNI
ARLKSAIASGQRFVKKMLPKGMATEGVRIALVSYDHEPHRLSDFTKDTAFLCQKIRALTPIWGTHTQGGLKMARNIMATS
TAVDKHIILMSDGLATEQYPVKNVTTADFIGETGNANDPIDLVIQGAINFPTNYVSNNPSTPLTPNYPTHSSKVGRRNLP
ESKFDYSNLSARITFDGVAGALVYEPRFPHPYYYYFPCNAAINEAQFAKNSGYTIHTIGYDLGDFALANNSLKLTATDEN
HFFTATPANLAAAFDNIAQTINIGIQRGEVTDFVAPGFIVKNLTQSGDVTHLLNVSNGTVHYDVSTKKLTWTTGTILSSS
EATITYRIYADLDYIQNNDIPVNTTSAIGPDLGGFDTNTEAKLTYTNSNGESNQQLIFPRPTVKLGYGVIKRHYVLVNKD
GQPIQANGTVVSSLSEAHVLQSQDFFLPSGGGHIVPKWIKLDKTTEALQYYSVPPTNTVITTADGKRYRFVEVPGSTPNP
GQIGISWKKPAGNAYFAYKLLNYWMGGTTDQQSEWDVTSNWTGAQVPLTGEDVEFATTENFGSPAVADLHVPTTNPKIIG
NLINNSDKDLVVTTNSQLTINGVVEDNNPNVGTIVVKSSKDNPTGTLLFANPGYNQNVGGTVEFYNQGYDCADCGMYRRS
WQYFGIPVNESGFPINDVGGNETVNQWVEPFNGDKWRPAPYAPDTKLQKFKGYQITNDVQAQPTGVYSFKGTLCVCDAFL
NLTRTSGVNYSGANLIGNSYTGAIDIKQGIVFPPEVEQTVYLFNTGTRDQWRKLNGSTVSGFRAGQYLSVPKNTAGQDNL
PDRIPSMHSFLVKMQNGASCTLQILYDKLLKNTTVNNGNGTQITWRSGNSGSANMPSLVMDVLGNESADRLWIFTDGGLS
FGFDNGWDGRKLTEKGLSQLYAMSDIGNDKFQVAGVPELNNLLIGFDADKDGQYTLEFALSDHFAKGGVFLEDLSRGVTR
RIVDGGSYSFDAKRGDSGARFRLSYDEEWVESAEVSVLVGTVGKRILITNNCEHACQANVYTTDGKLLIRLDVKPGSKSM
TEPLIDGAYVVSLQSPATSSNVRKVVVN
>P30958 3.6.4.-~~~mfd~~~Transcription-repair-coupling factor~~~COG1197
MPEQYRYTLPVKAGEQRLLGELTGAACATLVAEIAERHAGPVVLIAPDMQNALRLHDEISQFTDQMVMNLADWETLPYDS
FSPHQDIISSRLSTLYQLPTMQRGVLIVPVNTLMQRVCPHSFLHGHALVMKKGQRLSRDALRTQLDSAGYRHVDQVMEHG
EYATRGALLDLFPMGSELPYRLDFFDDEIDSLRVFDVDSQRTLEEVEAINLLPAHEFPTDKAAIELFRSQWRDTFEVKRD
PEHIYQQVSKGTLPAGIEYWQPLFFSEPLPPLFSYFPANTLLVNTGDLETSAERFQADTLARFENRGVDPMRPLLPPQSL
WLRVDELFSELKNWPRVQLKTEHLPTKAANANLGFQKLPDLAVQAQQKAPLDALRKFLETFDGPVVFSVESEGRREALGE
LLARIKIAPQRIMRLDEASDRGRYLMIGAAEHGFVDTVRNLALICESDLLGERVARRRQDSRRTINPDTLIRNLAELHIG
QPVVHLEHGVGRYAGMTTLEAGGITGEYLMLTYANDAKLYVPVSSLHLISRYAGGAEENAPLHKLGGDAWSRARQKAAEK
VRDVAAELLDIYAQRAAKEGFAFKHDREQYQLFCDSFPFETTPDQAQAINAVLSDMCQPLAMDRLVCGDVGFGKTEVAMR
AAFLAVDNHKQVAVLVPTTLLAQQHYDNFRDRFANWPVRIEMISRFRSAKEQTQILAEVAEGKIDILIGTHKLLQSDVKF
KDLGLLIVDEEHRFGVRHKERIKAMRANVDILTLTATPIPRTLNMAMSGMRDLSIIATPPARRLAVKTFVREYDSMVVRE
AILREILRGGQVYYLYNDVENIQKAAERLAELVPEARIAIGHGQMRERELERVMNDFHHQRFNVLVCTTIIETGIDIPTA
NTIIIERADHFGLAQLHQLRGRVGRSHHQAYAWLLTPHPKAMTTDAQKRLEAIASLEDLGAGFALATHDLEIRGAGELLG
EEQSGSMETIGFSLYMELLENAVDALKAGREPSLEDLTSQQTEVELRMPSLLPDDFIPDVNTRLSFYKRIASAKTENELE
EIKVELIDRFGLLPDPARTLLDIARLRQQAQKLGIRKLEGNEKGGVIEFAEKNHVNPAWLIGLLQKQPQHYRLDGPTRLK
FIQDLSERKTRIEWVRQFMRELEENAIA
>P9WMQ5 3.6.4.-~~~mfd~~~Transcription-repair-coupling factor~~~COG1197
MTAPGPACSDTPIAGLVELALSAPTFQQLMQRAGGRPDELTLIAPASARLLVASALARQGPLLVVTATGREADDLAAELR
GVFGDAVALLPSWETLPHERLSPGVDTVGTRLMALRRLAHPDDAQLGPPLGVVVTSVRSLLQPMTPQLGMMEPLTLTVGD
ESPFDGVVARLVELAYTRVDMVGRRGEFAVRGGILDIFAPTAEHPVRVEFWGDEITEMRMFSVADQRSIPEIDIHTLVAF
ACRELLLSEDVRARAAQLAARHPAAESTVTGSASDMLAKLAEGIAVDGMEAVLPVLWSDGHALLTDQLPDGTPVLVCDPE
KVRTRAADLIRTGREFLEASWSVAALGTAENQAPVDVEQLGGSGFVELDQVRAAAARTGHPWWTLSQLSDESAIELDVRA
APSARGHQRDIDEIFAMLRAHIATGGYAALVAPGTGTAHRVVERLSESDTPAGMLDPGQAPKPGVVGVLQGPLRDGVIIP
GANLVVITETDLTGSRVSAAEGKRLAAKRRNIVDPLALTAGDLVVHDQHGIGRFVEMVERTVGGARREYLVLEYASAKRG
GGAKNTDKLYVPMDSLDQLSRYVGGQAPALSRLGGSDWANTKTKARRAVREIAGELVSLYAKRQASPGHAFSPDTPWQAE
LEDAFGFTETVDQLTAIEEVKADMEKPIPMDRVICGDVGYGKTEIAVRAAFKAVQDGKQVAVLVPTTLLADQHLQTFGER
MSGFPVTIKGLSRFTDAAESRAVIDGLADGSVDIVIGTHRLLQTGVRWKDLGLVVVDEEQRFGVEHKEHIKSLRTHVDVL
TMSATPIPRTLEMSLAGIREMSTILTPPEERYPVLTYVGPHDDKQIAAALRRELLRDGQAFYVHNRVSSIDAAAARVREL
VPEARVVVAHGQMPEDLLETTVQRFWNREHDILVCTTIVETGLDISNANTLIVERADTFGLSQLHQLRGRVGRSRERGYA
YFLYPPQVPLTETAYDRLATIAQNNELGAGMAVALKDLEIRGAGNVLGIEQSGHVAGVGFDLYVRLVGEALETYRDAYRA
AADGQTVRTAEEPKDVRIDLPVDAHLPPDYIASDRLRLEGYRRLAAASSDREVAAVVDELTDRYGALPEPARRLAAVARL
RLLCRGSGITDVTAASAATVRLSPLTLPDSAQVRLKRMYPGAHYRATTATVQVPIPRAGGLGAPRIRDVELVQMVADLIT
ALAGKPRQHIGITNPSPPGEDGRGRNTTIKERQP
>Q7A7B2 3.6.4.-~~~mfd~~~Transcription-repair-coupling factor~~~
MTILTTLIKEDNHFQDLNQVFGQANTLVTGLSPSAKVTMIAEKYAQSNQQLLLITNNLYQADKLETDLLQFIDAEELYKY
PVQDIMTEEFSTQSPQLMSERIRTLTALAQGKKGLFIVPLNGLKKWLTPVEMWQNHQMTLRVGEDIDVDQFLNKLVNMGY
KRESVVSHIGEFSLRGGIIDIFPLIGEPIRIELFDTEIDSIRDFDVETQRSKDNIEEVDITTASDYIITEEVIRHLKEEL
KTAYENTRPKIDKSVRNDLKETYESFKLFESTYFDHQILRRLVAFMYETPSTIIEYFQKDAIIAVDEFNRIKETEESLTV
ESDSFISNIIESGNGFIGQSFIKYDDFETLIEGYPVTYFSLFATTMPIKLNHIIKFSCKPVQQFYGQYDIMRSEFQRYVN
QNYHIVVLVETETKVERMQAMLSEMHIPSITKLHRSMSSGQAVIIEGSLSEGFELPDMGLVVITERELFKSKQKKQRKRT
KAISNAEKIKSYQDLNVGDYIVHVHHGVGRYLGVETLEVGQTHRDYIKLQYKGTDQLFVPVDQMDQVQKYVASEDKTPKL
NKLGGSEWKKTKAKVQQSVEDIAEELIDLYKEREMAEGYQYGEDTAEQTTFELDFPYELTPDQAKSIDEIKDDMQKSRPM
DRLLCGDVGYGKTEVAVRAAFKAVMEGKQVAFLVPTTILAQQHYETLIERMQDFPVEIQLMSRFRTPKEIKQTKEGLKTG
FVDIVVGTHKLLSKDIQYKDLGLLIVDEEQRFGVRHKERIKTLKHNVDVLTLTATPIPRTLHMSMLGVRDLSVIETPPEN
RFPVQTYVLEQNMSFIKEALERELSRDGQVFYLYNKVQSIYEKREQLQMLMPDANIAVAHGQMTERDLEETMLSFINNEY
DILVTTTIIETGVDVPNANTLIIEDADRFGLSQLYQLRGRVGRSSRIGYAYFLHPANKVLTETAEDRLQAIKEFTELGSG
FKIAMRDLNIRGAGNLLGKQQHGFIDTVGFDLYSQMLEEAVNEKRGIKEPESEVPEVEVDLNLDAYLPTEYIANEQAKIE
IYKKLRKTETFDQIIDIKDELIDRFNDYPVEVARLLDIVEIKVHALHSGITLIKDKGKIIDIHLSVKATENIDGEVLFKA
TQPLGRTMKVGVQNNAMTITLTKQNQWLDSLKFLVKCIEESMRISDEA
>A0QSY0 ~~~mfpA~~~Pentapeptide repeat protein MfpA~~~COG1357
MRIGANGDETVWADEEFAGRDFRDEDLSRIRTERVVFTECDFSGVDLSESEHHGSAFRNCTFRRSTIWHSTFTNCSLLGS
VFTECRIRPVTFVECDFTLAVLGGCDLRAVDLSDCRLREVSLVGADLRKAVLRRADLTGSRVQDARLEEADLRGTRVDPT
FWTTAKVRGAKIDIEQALAYAAAHGLAVHGG
>I6YBX3 ~~~mfpA~~~Pentapeptide repeat protein MfpA~~~COG1357
MQQWVDCEFTGRDFRDEDLSRLHTERAMFSECDFSGVNLAESQHRGSAFRNCTFERTTLWHSTFAQCSMLGSVFVACRLR
PLTLDDVDFTLAVLGGNDLRGLNLTGCRLRETSLVDTDLRKCVLRGADLSGARTTGARLDDADLRGATVDPVLWRTASLV
GARVDVDQAVAFAAAHGLCLAGG
>A9CK30 3.1.3.79~~~mfppA~~~Mannosylfructose-phosphate phosphatase~~~COG0561
MKPLRLLSTDLDGTVVGDNDATRRFRDFWHALPDDLRPVLVFNSGRLIDDQLALLEEVPLPQPDYIIGGVGTMLHAKKRS
ELETAYTQSLGTGFDPRKIADVMNRIAGVTMQEERYQHGLKSSWFLHDADAAALGEIEAALLAADIDARIVYSSDRDLDI
LPKAADKGAALAWLCGQLRIGLDESVVSGDTGNDRAMFELKTIRGVIVGNALPELVSLAHQDNRFFHSTAKEADGVIEGL
RHWGLNPR
>A7TZT2 2.4.1.246~~~mfpsA~~~Mannosylfructose-phosphate synthase~~~COG0438
MEKFTKMGPMTTTSETERYPRIALISTHGYVAAHPPLGAADTGGQVVYVLELARKLGQLGYTVDLYTRRFEDQPEFDEVD
ERVRVVRIPCGGRDFIPKEYLHRHLMEWCENALRFIKKNDLNYSFINSHYWDAGVAGQRLSEALKIPHLHTPHSLGIWKK
RQMETDYPEKADTFELEFNFKERIQHELIIYRSCDMVIATTPVQLDVLIEDYGLKRKHIHMIPPGYDDNRFFPVSDATRQ
MIRQRFGFEGKVVLALGRLATNKGYDLLIDGFSVLAEREPEARLHLAVGGENMDEQETTILNQLKERVKSLGLEDKVAFS
GYVADEDLPDIYRAADLFVLSSRYEPFGMTAIEAMASGTPTVVTIHGGLFRAISYGRHALFADPFDKEDLGITMMKPFKH
ERLYGRLSRMGAHKARSLFTWTGIAQQLLALVEGRTMMPVLEEADWAEPWNDGD
>Q7M827 1.3.5.-~~~sdhA~~~8-methylmenaquinol:fumarate reductase flavoprotein subunit~~~COG1053
MSEQFTRREFLQSACITMGALAVSTSGVDRAFASSSLPINTSGIPSCDVLIIGSGAAGLRAAVAARKKDPSLNVIVVSKV
MPTRSATTMAEGGINGVIDFSEGDSFALHAYDTVKGGDFLVDQDTAMKFAEHAGEAIHELDYIGMPFSRDKNGKVDKRYA
GGASKIRCNFSADKTGHILTHTCLDDALKNGVKFLMDHQLLDIGVDNGRCEGVVLRDIRTGTIAPVRAKSVVLATGGYTR
VFWNRTSTPYIATGDGAASAMRAGVAFKDPEMLQFHPTGVCHGGVLITEAARGEGGILLNNQGERFMKNYAKKMELAPRD
IVSRSIETEIREGRAFGKGMEAYVLLDVTHLGKEKIMRNLPQIRHIGLLFENMDLVEKPIAIRPTAHYSMGGIDVMGLES
MSTAIPGLFAAGEAACVSIHGANRLGGNSLCDTVVTGKIAGTNAASFASSAGFGSGTHLHDLTLKWMSRFKEVANGKGEV
NEMYAIREELGAVNWDNMGVFRTESRLVALEDKHNELQARYDALRIPNTNPVFNTAFTEYVELGNILLASRAARMGAEAR
KESRGSHYREDYIKRDDANFLKHSMVTMDSNGKLHLGWKDVVVTQFKIEERKY
>Q7M826 1.3.5.-~~~sdhB~~~8-methylmenaquinol:fumarate reductase iron-sulfur subunit~~~COG0479
MKFIIDRFDGKKNYEQIYTLAKEDIEAKTLLGVLLLIKQTQDITLNFTASCRMAICGACAVRVNGHSYLACDTKMTELFE
EYKNSDTFRISPLGNHRVISDLVVDWEPAIENLRKIKPGLVAKSEFSAKEGCQQNQEEFDRIIKQWDCILCGSCVSECNK
FSADQSDYMEPFVFTQAWRLANDSRSKDPMIHVKPAVANGLWNCVHCHECTNRCPKHISAAEDIANLRVMAMKKGLNTGV
GPAHAKAFHTDLVEGSGRLNEIRLALRIEGVATVARTGMAITLMRAGKMNPLEIFGGHTIKGHEDLVKMIDAAKAATKE
>Q7M825 1.3.5.-~~~sdhE~~~8-methylmenaquinol:fumarate reductase membrane anchor subunit~~~COG2048
MQKEFAFFPGCVLSQAAIESKKSIEAIAPVLGIKLREIEGWSCCGASQAQCVDPLATLVANARNLALAEQMNLPVLTTCS
TCLLMLTRAKAELDRGAKDQINSFLAKGNMSYQGTSEVTSLLWVLAQNVEELKSKVKKPLSNLKVAVFYGCHSLRPEKDL
GFESSTNPTSFETIVKALGAQVVPFEKRLNCCGFHAVYPAESSAMKMTSGIINTAAKSEAHCVVTPCPLCQMQLDIYQED
AQKIAKSKERVPVLHLSQLVGLALGIPAKELGLNHNVIDATKLG
>A1KIJ9 ~~~~~~Probable triacylglyceride transporter BCG_1471c~~~
MRAGRRVAISAGSLAVLLGALDTYVVVTIMRDIMNSVGIPINQLHRITWIVTMYLLGYIAAMPLLGRASDRFGRKLMLQV
SLAGFIIGSVVTALAGHFGDFHMLIAGRTIQGVASGALLPITLALGADLWSQRNRAGVLGGIGAAQELGSVLGPLYGIFI
VWLLHDWRDVFWINVPLTAIAMVMIHFSLPSHDRSTEPERVDLVGGLLLALALGLAVIGLYNPNPDGKHVLPDYGAPLLV
GALVAAVAFFGWERFARTRLIDPAGVHFRPFLSALGASVAAGAALMVTLVDVELFGQGVLQMDQAQAAGMLLWFLIALPI
GAVTGGWIATRAGDRAVAFAGLLIAAYGYWLISHWPVDLLADRHNILGLFTVPAMHTDLVVAGLGLGLVIGPLSSATLRV
VPSAQHGIASAAVVVARMTGMLIGVAALSAWGLYRFNQILAGLSAAIPPNASLLERAAAIGARYQQAFALMYGEIFTITA
IVCVFGAVLGLLISGRKEHADEPEVQEQPTLAPQVEPL
>P9WJY3 ~~~~~~Probable triacylglyceride transporter Rv1410c~~~COG0477
MRAGRRVAISAGSLAVLLGALDTYVVVTIMRDIMNSVGIPINQLHRITWIVTMYLLGYIAAMPLLGRASDRFGRKLMLQV
SLAGFIIGSVVTALAGHFGDFHMLIAGRTIQGVASGALLPITLALGADLWSQRNRAGVLGGIGAAQELGSVLGPLYGIFI
VWLLHDWRDVFWINVPLTAIAMVMIHFSLPSHDRSTEPERVDLVGGLLLALALGLAVIGLYNPNPDGKHVLPDYGAPLLV
GALVAAVAFFGWERFARTRLIDPAGVHFRPFLSALGASVAAGAALMVTLVDVELFGQGVLQMDQAQAAGMLLWFLIALPI
GAVTGGWIATRAGDRAVAFAGLLIAAYGYWLISHWPVDLLADRHNILGLFTVPAMHTDLVVAGLGLGLVIGPLSSATLRV
VPSAQHGIASAAVVVARMTGMLIGVAALSAWGLYRFNQILAGLSAAIPPNASLLERAAAIGARYQQAFALMYGEIFTITA
IVCVFGAVLGLLISGRKEHADEPEVQEQPTLAPQVEPL
>A0QSB6 ~~~mftA~~~Mycofactocin precursor peptide~~~
MEPNQHVEAETELVTETLVEEVSIDGMCGVY
>P0DUE9 ~~~mftA~~~Mycofactocin precursor peptide~~~
MDRETEAETAELVTESLVEEVSIDGMCGVY
>A0PM48 ~~~mftB~~~Peptide chaperone MftB~~~COG0535
MRGLLTVPAPAQAAAGAGAFDPDRGWRLHAQVAVRPEPFGALLYHFGTRKLSFLKNRTILAVVRSLADHPDVRSACRAAG
VDDSEHAPYLHALSVLAGSHMLVPQEADQ
>A0QSB8 4.1.99.26~~~mftC~~~Mycofactocin maturase MftC~~~COG0535
MTSVQPVPRLVEQFERGLDAPICLTWELTYACNLACVHCLSSSGKRDPRELSTQQCKDIIDELERMQVFYVNIGGGEPTV
RSDFWELVDYATAHHVGVKFSTNGVRITPEVAAKLAASDYVDVQISLDGANAEVNDAVRGKGSFDMAVRALENLSNAGFT
DAKISVVVTRQNVDQLDEFAALAARYGATLRITRLRPSGRGADVWDDLHPTAEQQRQLYDWLVAKGDRVLTGDSFFHLSG
LGAPGALAGLNLCGAGRVVCLIDPVGDVYACPFAIHDKFLAGNILSDGGFQNVWQHSELFRELREPQSAGACASCGHFDA
CRGGCMAAKFFTGLPLDGPDPECVEGWGAPALEKERVKPKPSGDHSRGTKQGPVALKLLTKPPARFCNESPV
>P9WJ79 4.1.99.26~~~mftC~~~Mycofactocin maturase MftC~~~COG0535
MTSPVPRLIEQFERGLDAPICLTWELTYACNLACVHCLSSSGKRDPGELSTRQCKDIIDELERMQVFYVNIGGGEPTVRP
DFWELVDYATAHHVGVKFSTNGVRITPEVATRLAATDYVDVQISLDGATAEVNDAIRGTGSFDMAVRALQNLAAAGFAGV
KISVVITRRNVAQLDEFATLASRYGATLRITRLRPSGRGTDVWADLHPTADQQVQLYDWLVSKGERVLTGDSFFHLAPLG
QSGALAGLNMCGAGRVVCLIDPVGDVYACPFAIHDHFLAGNVLSDGGFQNVWKNSSLFRELREPQSAGACGSCGHYDSCR
GGCMAAKFFTGLPLDGPDPECVQGHSEPALARERHLPRPRADHSRGRRVSKPVPLTLSMRPPKRPCNESPV
>A0PM49 4.1.99.26~~~mftC~~~Mycofactocin maturase MftC~~~COG0535
MTTAVPRLIEQFEHGLDAPICLTWELTYACNLACVHCLSSSGKRDPGELSTRQCQDIIDELERMQVFYVNIGGGEPTVRP
DFWELVDYATAHHVGVKFSTNGVRINPEVAARLAASDCVDVQISLDGATAEVNDAVRGAGSFAMAVRALENLAAAGFADA
KISVVVTRHNVGQLDDFAALADRYGATLRITRLRPSGRGADVWEELHPTAAQQVALYDWLVAKGERVLTGDSFFHLAPLG
SSGALAGLNMCGAGRVVCLIDPVGDVYACPFAIHDRFLAGNVLTDGGFDQVWKNAPLFRQLREPQSAGACGSCGHYDSCR
GGCMAAKFFTGLPLDGPDPECVQGYGAPALAQERHAPRPRVDHSRGSRE
>A0QSB9 1.4.3.26~~~mftD~~~Pre-mycofactocin synthase~~~COG1304
MNMARDIWFETVAIAQQRARKRLPKSVYSSLISASEKGVTVTDNVESFAELGFAPHVVGAPEKRDMATTVMGQQISLPVI
ISPTGVQAVHPDGEVAVARAAAARGTAMGLSSFASKPIEEVVAVNDKIFFQIYWLGDRDAILARAERAKAAGAVGLIVTT
DWSFSHGRDWGSPKIPEKMDLKTMVTMMPEALTKPRWLWQWGKTMRPPNLRVPNQGARGEDGPPFFQAYGEWMGTPPPTW
EDIAWLREQWDGPFMLKGVIRVDDAKRAVDAGVSAISVSNHGGNNLDGTPAAIRALPVIAEAVGDQVEVLLDGGIRRGSD
VVKAVALGARAVMIGRAYLWGLAAEGQVGVENVLDILRGGIDSALMGLGRSSIHDLVPEDILVPEGFTRALGVPPASGS
>P9WND7 1.4.3.26~~~mftD~~~Pre-mycofactocin synthase~~~COG1304
MAEAWFETVAIAQQRAKRRLPKSVYSSLIAASEKGITVADNVAAFSELGFAPHVIGATDKRDLSTTVMGQEVSLPVIISP
TGVQAVDPGGEVAVARAAAARGTVMGLSSFASKPIEEVIAANPKTFFQVYWQGGRDALAERVERARQAGAVGLVVTTDWT
FSHGRDWGSPKIPEEMNLKTILRLSPEAITRPRWLWKFAKTLRPPDLRVPNQGRRGEPGPPFFAAYGEWMATPPPTWEDI
GWLRELWGGPFMLKGVMRVDDAKRAVDAGVSAISVSNHGGNNLDGTPASIRALPAVSAAVGDQVEVLLDGGIRRGSDVVK
AVALGARAVMIGRAYLWGLAANGQAGVENVLDILRGGIDSALMGLGHASVHDLSPADILVPTGFIRDLGVPSRRDV
>A0PM50 1.4.3.26~~~mftD~~~Pre-mycofactocin synthase~~~COG1304
MADEWFETVAIAQQRAKRRLPKSVYSSLISASEKGITVADNVAAFSELGFAPHVIGAAEKRDMSTTVMGQDISMPVLISP
TGVQAVHPDGEVAVARAAAARGTAMGLSSFASKTIEDVIAANPKIFFQIYWLGGRDAIAERVERARQAGAVGLIVTTDWT
FSHGRDWGSPKIPEQMNLRTILRLSPEAIVRPRWLWKFGKTLRPPDLRVPNQGRRGEPGPAFFAAYGEWMGTPPPTWDDI
AWLRELWGGPFMLKGVMRVDDAKRAVDAGVSAISVSNHGGNNLDGTPASIRALPAVAAAVGDQVEVLLDGGIRRGSDVVK
AVALGARAVLVGRAYLWGLAANGQAGVENVLDILRGGIDSALMGLGHSSIHDLRSDDILIPADFVRRLGR
>A0QSC0 3.4.14.14~~~mftE~~~Mycofactocin precursor peptide peptidase~~~COG1402
MNSAYHRHVAFPSGLGTSTSRQLHSMVPMVLVPVGSTEQHGPHLPLDTDTRIAAAVAGTVVEQFGAPADRDAVVAPPVAY
GASGEHEGFPGTVSIGTAALELLLVEYGRSASKWTSRIVFVNGHGGNVEALAAAVALLRYEGRDAGWVPCSVPDADAHAG
HTETSVLLHISPDDVLTDELVCGNTAPLAELMPRMRSGGVAAVSELGILGDPTTATAAEGERIFAEMVNGCADRIKRWQP
DRNGLLT
>P9WP59 3.4.14.14~~~mftE~~~Mycofactocin precursor peptide peptidase~~~COG1402
MNSSYHRRVPVVGELGSATSSQLPSTSPSIVIPLGSTEQHGPHLPLDTDTRIATAVARTVTARLHAEDLPIAQEEWLMAP
AIAYGASGEHQRFAGTISIGTEALTMLLVEYGRSAACWARRLVFVNGHGGNVGALTRAVGLLRAEGRDAGWCPCTCPGGD
PHAGHTETSVLLHLSPADVRTERWRAGNRAPLPVLLPSMRRGGVAAVSETGVLGDPTTATAAEGRRIFAAMVDDCVRRVA
RWMPQPDGMLT
>A0PM51 3.4.14.14~~~mftE~~~Mycofactocin precursor peptide peptidase~~~COG1402
MNSSYHRRVPVLGELGTSTSSQLPSTWPSILIPLGSTEQHGPHLPLDTDTRIATAVGRAVATRMHGRLTQCQPGWLLAPP
IAYGASGEHQSFAGTISIGAEALRVLLLEYGRSAACWADRLVFVNGHGGNVEALRGAVRQLRAEGRDAGWSACGSAGGDA
HAGHTETSVLLHISPEVVLTDRLSAGNAAPLAELLPSLRRGGVAAVSPIGVLGDPTTATAVEGRRIFAEMVDDCVSRIAR
WTPGPDGMLT
>P9WMX1 2.4.1.-~~~mftF~~~Pre-mycofactocin glycosyltransferase~~~COG1216
MTATRLPDGFAVQVDRRVRVLGDGSALLGGSPTRLLRLAPAARGLLCDGRLKVRDEVSAELARILLDATVAHPRPPSGPS
HRDVTVVIPVRNNASGLRRLVTSLRGLRVIVVDDGSACPVESDDFVGAHCDIEVLHHPHSKGPAAARNTGLAACTTDFVA
FLDSDVTPRRGWLESLLGHFCDPTVALVAPRIVSLVEGENPVARYEALHSSLDLGQREAPVLPHSTVSYVPSAAIVCRSS
AIRDVGGFDETMHSGEDVDLCWRLIEAGARLRYEPIALVAHDHRTQLRDWIARKAFYGGSAAPLAVRHPDKTAPLVISGG
ALMAWILMSIGTGLGRLASLVIAVLTGRRIARAMRCAETSFLDVLAVATRGLWAAALQLASAICRHYWPLALLAAILSRR
CRRVVLIAAVVDGVVDWLRRREGADDDAEPIGPLTYLVLKRVDDLAYGAGLWYGVVRERNIGALKPQIRT
>P9WMB7 ~~~mftR~~~Putative mycofactocin biosynthesis transcriptional regulator MftR~~~COG1309
MPHESRVGRRRSTTPHHISDVAIELFAAHGFTDVSVDDIARAAGIARRTLFRYYASKNAIPWGDFSTHLAQLQGLLDNID
SRIQLRDALRAALLAFNTFDESETIRHRKRMRVILQTPELQAYSMTMYAGWREVIAKFVARRSGGKTTDFMPQTVAWTML
GVALSAYEHWLRDESVSLTEALGAAFDVVGAGLDRLNQ
>P74167 5.1.3.34~~~MgdE~~~Monoglucosyldiacylglycerol epimerase~~~COG0300
MAMAWLMGLGLALASVLWVELVRDCYHALAHVWSPLYRLHGWHHRVFRSDLSVVSTEIYQKAHWYNDVPEALVMLAFGIW
PPLLTWMWQGFSQWPLILAASAGMVYTLGFLLSAIARGVGLPNADEITDLTHRPGPFLTPPAPWMVNRTYHWRHHFDDPN
AYFCGTLTLVDKMLGTALSLKGKKIAVTGASGGFGQALLQELHQQGAKAIAITSCGQEVTVIGAEKTIPIRTEQWCIGQE
DQLKDLLAEIDVLVINHGVNVHQRRDGQAIQEAYEVNTFSALRLMELFLATVKTNRHIATKEIWVNTSEAEVNPAFSPLY
ELSKRALGDVVTLKRLDSPCVIRKLILGPFKSKLNPVGIMSAAWVARQVVKGVKRDSRNIIVTINPITFIAFPVKEFFVS
LYFRCFTKKS
>A9BHJ0 2.4.1.270~~~mggA~~~Mannosylglucosyl-3-phosphoglycerate synthase~~~COG0438
MNNLYIFHYHYIKGGVSTVVRNIVKSLKDAYKITLFGSKKMGIDGIEEVLSYENVDFIDFPELGYIYYDSTDYKTFLELK
ESIKNKLNNYHDERAIYWAHNYNLGKNPAFTEAFKEFITTKNIPTIIQIHDFPECARWENYSFIRKFINSSLYPIRKNIQ
YATINLSDYNRLIKCGIPSENAFYLPNAVEFAKNKDKIDDIDKDEVINKLKKLGYNVDPTNKNILYPTRTIRRKNILEAV
LINRLYGKSNLLVTLPANSDKERPYEKVVKETFESEKVKGAWAISAKDPSLFPYILNISDLFFSSSVLEGFGMIYLESKF
NEKNFLTRKLDVIEDFKNIKEISYYDRFLVSLSPKEINKVKEKYEEQINKIPISEENKNHLRQDLNNKFDKDLIDFSFLP
VELQKKFCMEEEAKLNDLKEINKEIFDKIEMLTSTNHIDQGINLEDFSLKAYKSKIFLLLDKVQARGNAPKGVQGSTKEI
EDTIIDENILKSFLTIDNIRLLFSY
>A9BJC1 3.1.3.-~~~mggB~~~Mannosylglucosyl-3-phosphoglycerate phosphatase~~~COG0737
MKRFLGILVFVVMVLSVFAGPNHLVIFHMNDTHGHVWGTEDGGGFARAATLINQAREEVAKEGGAVLFLHAGDVNTGIPE
SDQLDAVPDFLALHYMGLDAMSLGNHEFDKPFEVLEKQYEVAQFPFLGANFVNEKRGGPVFEPYIIKDYGDFSVGIIGLV
TEQTKVLEPIYLGENTIVDAEETLNMYLPIVQEKADVVIVLAHLGYHADGGRPNLSVEFTTSDELAENVSGVDIIIDGHS
HTLLETPVVINNVIVAQAGDNAENIGRIDLWIDDGRIVDWRGEVIPLTSDIPEDPFIKMFTDAFYQLGSEALNEVVGVTK
VYLDGERAHVRSDETNLSNLIADGMIWKTGADVALMNGGGIRASIEAGEITYRDILTVLPFGNTLYVLELTGKDIMDVLN
YAATIPDGQGAKLHVAGLTAEIKGGKATNVKINGKPIDLNKTYKVVTNNYVAAGGDGYTMLAGKPGYDTYFRDADSLREY
IAHLGTIEDYTSQERLIELDQVK
>Q9X0V7 2.4.1.-~~~mggS~~~Mannosylglucosylglycerate synthase~~~COG0438
MKIALIHYRGGLMDGVSLEMEKWKKVLTKMGHEVHIVAENKKEGVDLTLKEIGFENPDFERVNRNFFGGIKDFLSEKEFL
DFLKEKEEELFHILNEALKDYDLIVPNNIWSLGLFPSLGLALSRLEKNFVAHHHDFWWERKHLIPENRRFREILDKHFPP
DLPNVKHVVINTIAQRELKRRRNIDSVVVPNVMDFSSPITSEEMYHRVREELQIAPGTIVALQATRIDRRKTIELSIDVV
SLLKETLTSKKEADLYNGERYSGEVILLFSGICEDEEYLKELKEYASSKGVSLLVLSEEVRKNTSLFWKLYNAADFVTYP
SILEGWGNQLLEAIAAKKPVVLFEYEVFKSDIKPAGLKYVSLGDRCFRENGLVKVDERILKKAVEEISRLLFDPSLYRET
VEHNFEVGKRHFSLERLEDILSREVLP
>P0AAG8 7.5.2.11~~~mglA~~~Galactose/methyl galactoside import ATP-binding protein MglA~~~COG1129
MVSSTTPSSGEYLLEMSGINKSFPGVKALDNVNLKVRPHSIHALMGENGAGKSTLLKCLFGIYQKDSGTILFQGKEIDFH
SAKEALENGISMVHQELNLVLQRSVMDNMWLGRYPTKGMFVDQDKMYRETKAIFDELDIDIDPRARVGTLSVSQMQMIEI
AKAFSYNAKIVIMDEPTSSLTEKEVNHLFTIIRKLKERGCGIVYISHKMEEIFQLCDEVTVLRDGQWIATEPLAGLTMDK
IIAMMVGRSLNQRFPDKENKPGEVILEVRNLTSLRQPSIRDVSFDLHKGEILGIAGLVGAKRTDIVETLFGIREKSAGTI
TLHGKQINNHNANEAINHGFALVTEERRSTGIYAYLDIGFNSLISNIRNYKNKVGLLDNSRMKSDTQWVIDSMRVKTPGH
RTQIGSLSGGNQQKVIIGRWLLTQPEILMLDEPTRGIDVGAKFEIYQLIAELAKKGKGIIIISSEMPELLGITDRILVMS
NGLVSGIVDTKTTTQNEILRLASLHL
>Q1DB04 ~~~mglA~~~Mutual gliding-motility protein MglA~~~COG1100
MSFINYSSREINCKIVYYGPGLCGKTTNLQYIYNKTAAETKGKLISLSTETDRTLFFDFLPLSLGEIRGFKTRFHLYTVP
GQVFYDASRKLILKGVDGVVFVADSQIERMEANMESLENLRINLAEQGYDLNKIPYVIQYNKRDLPNAVTVEEMRKALNH
RNIPEYQAVAPTGVGVFDTLKAVAKLVLTELKKGG
>P23924 7.5.2.11~~~mglA~~~Galactose/methyl galactoside import ATP-binding protein MglA~~~
MGSTISPPSGEYLLEMRGINKSFPGVKALDNVNLNVRPHSIHALMGENGAGKSTLLKCLFGIYQKDSGSIVFQGKEVDFH
SAKEALENGISMVHQELNLVLQRSVMDNMWLGRYPTKGMFVDQDKMYQDTKAIFDELDIDIDPRARVGTLSVSQMQMIEI
AKAFSYNAKIVIMDEPTSSLTEKEVNHLFTIIRKLKERGCGIVYISHKMEEIFQLCDEITILRDGQWIATQPLEGLDMDK
IIAMMVGRSLNQRFPDKENKPGDVILEVRHLTSLRQPSIRDVSFDLHKGEILGIAGLVGAKRTDIVETLFGIREKSSGTI
TLHGKKINNHTANEAINHGFALVTEERRSTGIYAYLDIGFNSLISNIRNYKNKVGLLDNSRMKSDTQWVIDSMRVKTPGH
RTQIGSLSGGNQQKVIIGRWLLTQPEILMLDEPTRGIDVGAKFEIYQLIAELAKKGKGIIIISSEMPELLGITDRILVMS
NGLVSGIVDTKTTTQNEILRLASLHL
>P0AEE5 ~~~mglB~~~D-galactose/methyl-galactoside binding periplasmic protein MglB~~~COG1879
MNKKVLTLSAVMASMLFGAAAHAADTRIGVTIYKYDDNFMSVVRKAIEQDAKAAPDVQLLMNDSQNDQSKQNDQIDVLLA
KGVKALAINLVDPAAAGTVIEKARGQNVPVVFFNKEPSRKALDSYDKAYYVGTDSKESGIIQGDLIAKHWAANQGWDLNK
DGQIQFVLLKGEPGHPDAEARTTYVIKELNDKGIKTEQLQLDTAMWDTAQAKDKMDAWLSGPNANKIEVVIANNDAMAMG
AVEALKAHNKSSIPVFGVDALPEALALVKSGALAGTVLNDANNQAKATFDLAKNLADGKGAADGTNWKIDNKVVRVPYVG
VDKDNLAEFSKK
>P44883 ~~~mglB~~~D-galactose/methyl-galactoside binding periplasmic protein MglB~~~COG1879
MKKTAVLSTVAFAIALGSASASFAADNRIGVTIYKYDDNFMSLMRKEIDKEAKVVGGIKLLMNDSQNAQSIQNDQVDILL
SKGVKALAINLVDPAAAPTIIGKAKSDNIPVVFFNKDPGAKAIGSYEQAYYVGTDPKESGLIQGDLIAKQWKANPALDLN
KDGKIQFVLLKGEPGHPDAEVRTKYVIEELNAKGIQTEQLFIDTGMWDAAMAKDKVDAWLSSSKANDIEVIISNNDGMAL
GALEATKAHGKKLPIFGVDALPEALQLISKGELAGTVLNDSVNQGKAVVQLSNNLAQGKSATEGTKWELKDRVVRIPYVG
VDKDNLGDFLK
>P23905 ~~~mglB~~~D-galactose/methyl-galactoside binding periplasmic protein MglB~~~
MNKKVLTLSAVMASLLFGAHAHAADTRIGVTIYKYDDNFMSVVRKAIEKDGKSAPDVQLLMNDSQNDQSKQNDQIDVLLA
KGVKALAINLVDPAAAGTVIEKARGQNVPVVFFNKEPSRKALDSYDKAYYVGTDSKESGVIQGDLIAKHWQANQGWDLNK
DGKIQYVLLKGEPGHPDAEARTTYVVKELNDKGIQTEQLALDTAMWDTAQAKDKMDAWLSGPNANKIEVVIANNDAMAMG
AVEALKAHNKSSIPVFGVDALPEALALVKSGAMAGTVLNDANNQAKATFDLAKNLAEGKGAADGTSWKIENKIVRVPYVG
VDKDNLSEFTQK
>Q08255 ~~~mglB~~~Glucose/galactose-binding lipoprotein~~~COG1879
MKENSCTACSRRLALFVGAAVLVVGCSSKTDVTLNRDKPLVFFNRQPSDPLTGKVDMAAMNWNDKTYYVGFDAKFGGSIQ
GKMILDFLASSESSVDRNGDGIIGYVLCIGDVGHNDSKVRTEGIRRALGTWTGSSDPGQAKEGQAVVGGKSYKVVELEGK
AMTGTDGSTWNTNSATESMGSWVAKFADKIDLVISNNDGMAMGCLQASNYPRGLPIFGYDANADAVESVGKGELTGTVSQ
NVDAQAVAVLQIIRNLLDGSSGEDVVANGISRPDAHGNKISAPVQYWEDVKAIMADNSEVTSANWKEYTRGARDAGVRQV
SAPTKKVLLTVHNASNDFLASAYLPALKHYAPLLNVDLTVVQGDGQNELSCLDKFTNLDMFDAFAVNMVKTNSGADYTDK
LKY
>P23200 ~~~mglC~~~Galactose/methyl galactoside import permease protein MglC~~~COG4211
MSALNKKSFLTYLKEGGIYVVLLVLLAIIIFQDPTFLSLLNLSNILTQSSVRIIIALGVAGLIVTQGTDLSAGRQVGLAA
VVAATLLQSMDNANKVFPEMATMPIALVILIVCAIGAVIGLINGLIIAYLNVTPFITTLGTMIIVYGINSLYYDFVGASP
ISGFDSGFSTFAQGFVALGSFRLSYITFYALIAVAFVWVLWNKTRFGKNIFAIGGNPEAAKVSGVNVGLNLLMIYALSGV
FYAFGGMLEAGRIGSATNNLGFMYELDAIAACVVGGVSFSGGVGTVIGVVTGVIIFTVINYGLTYIGVNPYWQYIIKGAI
IIFAVALDSLKYARKK
>Q56036 ~~~mglC~~~Galactose/methyl galactoside import permease protein MglC~~~
MSALNKKSFLTWLKEGGIYVVLLVLLAIIIFQDPTFLSLLNLSNILTQSSVRIIIALGVAGLIVTQGTDLSAGRQVGLAA
VVAATLLQSMENANKVFPEMATMPIALVILIVCAIGAVIGLVNGIIIAYLNVTPFITTLGTMIIVYGINSLYYDFVGASP
ISGFDSGFSTFAQGFVAMGSFRLSYITFYALIAVAFVWVLWNKTRFGKNIFAIGGNPEAAKVSGVNVALNLLMIYALSGV
FYAFGGLLEAGRIGSATNNLGFMYELDAIAACVVGGVSFSGGVGTVFGVVTGVIIFTVINYGLTYIGVNPYWQYIIKGGI
IIFAVALDSLKYARKK
>A0QNZ7 3.1.1.23~~~~~~Monoacylglycerol lipase~~~COG2267
MVSSTRSEHSFAGVGGVRIVYDVWTPDTDPRGVVVLAHGYAEHAGRYHHVAQRFGAAGLLVYALDHRGHGRSGGKRVHLR
DLSEFVEDFRTLVGIAANDHPTLPRIVLGHSMGGGIVFAYGARYPGEYSAMVLSGPAVNAHDGVSPVLVAVAKVLGKLAP
GIPVENLDADAVSRDPEVVAAYKADPMVHHGKLPAGIARALIGLGQSMPQRAAALTAPLLVVHGDKDRLIPVAGSRLLVD
RVASEDVHLKVYPGLYHEVFNEPEQKLVLDDVTSWIVSHL
>O07427 3.1.1.23~~~~~~Monoacylglycerol lipase~~~COG2267
MTTTRTERNFAGIGDVRIVYDVWTPDTAPQAVVVLAHGLGEHARRYDHVAQRLGAAGLVTYALDHRGHGRSGGKRVLVRD
ISEYTADFDTLVGIATREYPGCKRIVLGHSMGGGIVFAYGVERPDNYDLMVLSAPAVAAQDLVSPVVAVAAKLLGVVVPG
LPVQELDFTAISRDPEVVQAYNTDPLVHHGRVPAGIGRALLQVGETMPRRAPALTAPLLVLHGTDDRLIPIEGSRRLVEC
VGSADVQLKEYPGLYHEVFNEPERNQVLDDVVAWLTERL
>P82597 3.1.1.23~~~~~~Thermostable monoacylglycerol lipase~~~
MSEQYPVLSGAEPFYAENGPVGVLLVHGFTGTPHSMRPLAEAYAKAGYTVCLPRLKGHGTHYEDMERTTFHDWVASVEEG
YGWLKQRCQTIFVTGLSMGGTLTLYLAEHHPDICGIVPINAAVDIPAIAAGMTGGGELPRYLDSIGSDLKNPDVKELAYE
KTPTASLLQLARLMAQTKAKLDRIVCPALIFVSDEDHVVPPGNADIIFQGISSTEKEIVRLRNSYHVATLDYDQPMIIER
SLEFFAKHAG
>Q59268 5.4.99.4~~~mgm~~~2-methyleneglutarate mutase~~~
MQEKTKRIIKEDIEAVRAYSDCFDVEMPDLDENGEVIGLPAPYPREVAGTVRSGYRIYDLAKKAKERGWPIQNPILGRNT
AEETYGESQEMYAYADKFDETLFHFVHAEATRHIDPLKGRELINQSRGKGGITPIGEREFIAMGGGSKHPVRINATGDTP
HLSIINALIAGFDGTDIGPVIHVHFGGRGIHDYKTKVVNGYKAIQICAENNIFVQLDSHKHLNNIGGTDGMALAMCLLSE
GLAVHAGLPWELSAIQMNVAGINIYADLAVMRAFRKACHSKSIIAVPETFQNPPGNLVAEAAHFSRMAVTAKLGGADFYR
PKAAESVGIPTGDSMGQAIWGTEDVFGHVVNPDIQSPVIDAREAEIIDEALAVLEATLHLEGLTLEAMTDDFWKQWSDEA
LIDLIVAAGKAGVLDSQRAAGWDLKRHVVVNRDKDGITRYVKGYTPLGVDASRCAQSDEDVEVHVEKAPTRPEKIVLATV
GADAHVNGINVIREAFQDAGYDVVYLRGMNLPESVAEVAAEVGADAVGVSNLLGLGMELFPRVSKRLEELGLRDKMVVCA
GGRIAEKEEEHRQFEEKIQKEGSAFMGMDGFFGPGSSPEDCVKIIGDMINAKKA
>P22747 ~~~~~~Mgp-operon protein 3~~~
MKTMRKQIYKKAYWLLLPFLPLALANTFLVKEDSKNVTAYTPFATPITDSKSDLVSLAQLDSSYQIADQTIHNTNLFVLF
KSRDVKVKYESSGSNNISFDSTSQGEKPSYVVEFTNSTNIGIKWTMVKKYQLDVPNVSSDMNQVLKNLILEQPLTKYTLN
SSLAKEKGKTQREVHLGSGQANQWTSQRNQHDLNNNPSPNASTGFKLTTGNAYRKLSESWPIYEPIDGTKQGKGKDSSGW
SSTEENEAKNDAPSVSGGGSSSGTFNKYLNTKQALESIGILFDDQTPRNVITQLYYASTSKLAVTNNHIVVMGNSFLPSM
WYWVVERSAQENASNKPTWFANTNLDWGEDKQKQFVENQLGYKETTSTNSHNFHSKSFTQPAYLISGIDSVNDQIIFSGF
KAGSVGYDSSSSSSSSSSSSTKDQALAWSTTTSLDSKTGYKDLVTNDTGLNGPINGSFSIQDTFSFVVPYSGNHTNNGTT
GPIKTAYPVKKDQKSTVKINSLINATPLNSYGDEGIGVFDALGLNYNFKSNQERLPSRTDQIFVYGIVSPNELRSAKSSA
DSTGSDTKVNWSNTQSRYLPVPYNYSEGIIDADGFKRPENRGASVTTFSGLKSIAPDGFANSIANFSVGLKAGIDPNPVM
SGKKANYGAVVLTRGGVVRLNFNPGNDSLLSTTDNNIAPISFSFTPFTAAESAVDLTTFKEVTYNQESGLWSYIFDSSLK
PSHDGKQTPVTDNMGFSVITVSRTGIELNQDQATTTLDVAPSALAVQSGIQSTTQTLTGVLPLSEEFSAVIAKDSDQNKI
DIYKNNNGLFEIDTQLSNSVATNNGGLAPSYTENRVDAWGKVEFADNSVLQARNLVDKTVDEIINTPEILNSFFRFTPAF
EDQKATLVATKQSDTSLSVSPRIQFLDGNFYDLNSTIAGVPLNIGFPSRVFAGFAALPAWVIPVSVGSSVGILFILLVLG
LGIGIPMYRVRKLQDASFVNVFKKVDTLTTAVGSVYKKIITQTGVVKKAPSALKAANPSVKKPAAFLKPPVQPPSKPEGE
QKAVEVKSEETKS
>Q50341 ~~~~~~Mgp-operon protein 3~~~
MKSKLKLKRYLLFLPLLPLGTLSLANTYLLQDHNTLTPYTPFTTPLNGGLDVVRAAHLHPSYELVDWKRVGDTKLVALVR
SALVRVKFQDTTSSDQSNTNQNALSFDTQESQKALNGSQSGSSDTSGSNSQDFASYVLIFKAAPRATWVFERKIKLALPY
VKQESQGSGDQGSNGKGSLYKTLQDLLVEQPVTPYTPNAGLARVNGVAQDTVHFGSGQESSWNSQRSQKGLKNNPGPKAV
TGFKLDKGRAYRKLNESWPVYEPLDSTKEGKGKDESSWKNSEKTTAENDAPLVGMVGSGAAGSASSLQGNGSNSSGLKSL
LRSAPVSVPPSSTSNQTLSLSNPAPVGPQAVVSQPAGGATAAVSVNRTASDTATFSKYLNTAQALHQMGVIVPGLEKWGG
NNGTGVVASRQDATSTNLPHAAGASQTGLGTGSPREPALTATSQRAVTVVAGPLRAGNSSETDALPNVITQLYHTSTAQL
AYLNGQIVVMGSDRVPSLWYWVVGEDQESGKATWWAKTELNWGTDKQKQFVENQLGFKDDSNSDSKNSNLKAQGLTQPAY
LIAGLDVVADHLVFAAFKAGAVGYDMTTDSSASTYNQALAWSTTAGLDSDGGYKALVENTAGLNGPINGLFTLLDTFAYV
TPVSGMKGGSQNNEEVQTTYPVKSDQKATAKIASLINASPLNSYGDDGVTVFDALGLNFNFKLNEERLPSRTDQLLVYGI
VNESELKSARENAQSTSDDNSNTKVKWTNTASHYLPVPYYYSANFPEAGNRRRAEQRNGVKISTLESQATDGFANSLLNF
GTGLKAGVDPAPVARGHKPNYSAVLLVRGGVVRLNFNPDTDKLLDSTDKNSEPISFSYTPFGSAESAVDLTTLKDVTYIA
ESGLWFYTFDNGEKPTYDGKQQQVKNRKGYAVITVSRTGIEFNEDANTTTLSQAPAALAVQNGIASSQDDLTGILPLSDE
FSAVITKDQTWTGKVDIYKNTNGLFEKDDQLSENVKRRDNGLVPIYNEGIVDIWGRVDFAANSVLQARNLTDKTVDEVIN
NPDILQSFFKFTPAFDNQRAMLVGEKTSDTTLTVKPKIEYLDGNFYGEDSKIAGIPLNIDFPSRIFAGFAALPSWVIPVS
VGSSVGILLILLILGLGIGIPMYKVRKLQDSSFVDVFKKVDTLTTAVGSVYKKIITQTSVIKKAPSALKAANNAAPKAPV
KPAAPTAPRPPVQPPKKA
>Q5LH68 2.4.1.281~~~~~~4-O-beta-D-mannosyl-D-glucose phosphorylase~~~COG2152
MSLFNDKVAKLLAGHEALLMRKNEPVEEGNGVITRYRYPVLTAAHTPVFWRYDLNEETNPFLMERIGMNATLNAGAIKWD
GKYLMLVRVEGADRKSFFAVAESPNGIDNFRFWEYPVTLPEDVVPATNVYDMRLTAHEDGWIYGIFCAERHDDNAPIGDL
SSATATAGIARTKDLKNWERLPDLKTKSQQRNVVLHPEFVDGKYALYTRPQDGFIDTGSGGGIGWALIDDITHAEVGEEK
IIDKRYYHTIKEVKNGEGPHPIKTPQGWLHLAHGVRNCAAGLRYVLYMYMTSLDDPTRLIASPAGYFMAPVGEERIGDVS
NVLFSNGWIADDDGKVFIYYASSDTRMHVATSTIERLVDYCLHTPQDGFSSSASVEILKNLIERNLRLMK
>E6UIS7 2.4.1.281~~~~~~4-O-beta-D-mannosyl-D-glucose phosphorylase~~~COG2152
MIHEKYTEMRNEQEALLSRKNTKTSFYNGIYDRYEHPVLTREHIPLHWRYDLNKETNPFFQERLGINAVFNAGAIKLNDR
YCLVARVEGNDRKSFFAVAESDKGTEGFRFRQYPVCLPALTDDETNVYDMRLTQHEDGWIYGVFCVEKSAGTADLSEAVA
SAGIARTKDLTNWERLPDLVTLRSPQQRNVTLLPEFVDGKYAFYTRPMDGFIETGSGGGIGFGLADDITHAVIDEERMTS
IRRYHTITESKNGAGATPIKTERGWLNIAHGVRNTAAGLRYVIYCFVTDLSEPWKVIAEPGGYLIAPFKDERVGDVSNVV
FTNGAIVDDNGDVYIYYASSDTRLHVAVSSIDKLLDYAFNTPADALRTAECVKQRCDLIKRNIELL
>Q2G0B1 ~~~mgrA~~~HTH-type transcriptional regulator MgrA~~~COG1846
MSDQHNLKEQLCFSLYNAQRQVNRYYSNKVFKKYNLTYPQFLVLTILWDESPVNVKKVVTELALDTGTVSPLLKRMEQVD
LIKRERSEVDQREVFIHLTDKSETIRPELSNASDKVASASSLSQDEVKELNRLLGKVIHAFDETKEK
>Q7A6X2 ~~~mgrA~~~HTH-type transcriptional regulator MgrA~~~
MSDQHNLKEQLCFSLYNAQRQVNRYYSNKVFKKYNLTYPQFLVLTILWDESPVNVKKVVTELALDTGTVSPLLKRMEQVD
LIKRERSEVDQREVFIHLTDKSETIRPELSNASDKVASASSLSQDEVKELNRLLGKVIHAFDETKEK
>P0C1S0 ~~~mgrA~~~HTH-type transcriptional regulator MgrA~~~
MSDQHNLKEQLCFSLYNAQRQVNRYYSNKVFKKYNLTYPQFLVLTILWDESPVNVKKVVTELALDTGTVSPLLKRMEQVD
LIKRERSEVDQREVFIHLTDKSETIRPELSNASDKVASASSLSQDEVKELNRLLGKVIHAFDETKEK
>P64512 ~~~mgrB~~~PhoP/PhoQ regulator MgrB~~~
MKKFRWVVLVVVVLACLLLWAQVFNMMCDQDVQFFSGICAINQFIPW
>P42980 4.2.3.3~~~mgsA~~~Methylglyoxal synthase~~~COG1803
MKIALIAHDKKKQDMVQFTTAYRDILKNHDLYATGTTGLKIHEATGLQIERFQSGPLGGDQQIGALIAANALDLVIFLRD
PLTAQPHEPDVSALIRLCDVYSIPLATNMGTAEILVRTLDEGVFEFRDLLRGEEPNV
>P0A731 4.2.3.3~~~mgsA~~~Methylglyoxal synthase~~~COG1803
MELTTRTLPARKHIALVAHDHCKQMLMSWVERHQPLLEQHVLYATGTTGNLISRATGMNVNAMLSGPMGGDQQVGALISE
GKIDVLIFFWDPLNAVPHDPDVKALLRLATVWNIPVATNVATADFIIQSPHFNDAVDILIPDYQRYLADRLK
>Q9X0R7 4.2.3.3~~~mgsA~~~Methylglyoxal synthase~~~COG1803
MDKKKRIALIAHDRRKRDLLEWVSFNLGTLSKHELYATGTTGALLQEKLGLKVHRLKSGPLGGDQQIGAMIAEGKIDVLI
FFWDPLEPQAHDVDVKALIRIATVYNIPVAITRSTADFLISSPLMNDVYEKIQIDYEEELERRIRKVVEGEEEET
>Q5SHD6 4.2.3.3~~~mgsA~~~Methylglyoxal synthase~~~COG1803
MKALALIAHDAKKDEMVAFCLRHKDVLARYPLLATGTTGARIQEATGLAVERVLSGPLGGDLQIGARVAEGKVLAVVFLQ
DPLTAKPHEPDVQALMRVCNVHGVPLATNLVAAEALIAWIRKGTPQ
>P54503 ~~~mgsR~~~Regulatory protein MgsR~~~COG1393
MEQQLTFYSYPSCTSCRKTKHWLKAHQIEFNERHLFRETPTREELKYILSLTTEGIDEILATRSQTFKNLNLNIEEMTVN
EVLELLIEKPKLLRRPILVDNKKLVIGYNPGELLKLSKKKTVHQSA
>Q9RFR0 2.4.1.269~~~mgs~~~Mannosylglycerate synthase~~~
MSLVVFPFKHEHPEVLLHNVRVAAAHPRVHEVLCIGYERDQTYEAVERAAPEISRATGTPVSVRLQERLGTLRPGKGDGM
NTALRYFLEETQWERIHFYDADITSFGPDWITKAEEAADFGYGLVRHYFPRASTDAMITWMITRTGFALLWPHTELSWIE
QPLGGELLMRREVAAMLYEDERVRRRSDWGIDTLYTFVTVQQGVSIYECYIPEGKAHRLYGGLDDLRTMLVECFAAIQSL
QHEVVGQPAIHRQEHPHRVPVHIAERVGYDVEATLHRLMQHWTPRQVELLELFTTPVREGLRTCQRRPAFNFMDEMAWAA
TYHVLLEHFQPGDPDWEELLFKLWTTRVLNYTMTVALRGYDYAQQYLYRMLGRYRYQAALENGRGHPVPPRAALSTA
>Q8NT41 2.4.1.-~~~mgtA~~~GDP-mannose-dependent alpha-mannosyltransferase~~~COG0438
MEIIRPMRVAIVAESFLPNVNGVTNSVLRVLEHLKANGHDALVIAPGARDFEEEIGHYLGFEIVRVPTVRVPLIDSLPIG
VPLPSVTSVLREYNPDIIHLASPFVLGGAAAFAARQLRIPAIAIYQTDVAGFSQRYHLAPLATASWEWIKTVHNMCQRTL
APSSMSIDELRDHGINDIFHWARGVDSKRFHPGKRSVALRKSWDPSGAKKIVGFVGRLASEKGVECLAGLSGRSDIQLVI
VGDGPEAKYLQEMMPDAIFTGALGGEELATTYASLDLFVHPGEFETFCQAIQEAQASGVPTIGPRAGGPIDLINEGVNGL
LLDVVDFKETLPAAAEWILDDSRHSEMCAAAWEGVKDKTWEALCTQLLQHYADVIALSQRIPLTFFGPSAEVAKLPLWVA
RALGVRTRISIEA
>P9WMY5 2.4.1.-~~~mgtA~~~GDP-mannose-dependent alpha-mannosyltransferase~~~COG0438
MCGVRVAIVAESFLPQVNGVSNSVVKVLEHLRRTGHEALVIAPDTPPGEDRAERLHDGVRVHRVPSRMFPKVTTLPLGVP
TFRMLRALRGFDPDVVHLASPALLGYGGLHAARRLGVPTVAVYQTDVPGFASSYGIPMTARAAWAWFRHLHRLADRTLAP
STATMESLIAQGIPRVHRWARGVDVQRFAPSARNEVLRRRWSPDGKPIVGFVGRLAPEKHVDRLTGLAASGAVRLVIVGD
GIDRARLQSAMPTAVFTGARYGKELAEAYASMDVFVHSGEHETFCQVVQEALASGLPVIAPDAGGPRDLITPHRTGLLLP
VGEFEHRLPDAVAHLVHERQRYALAARRSVLGRSWPVVCDELLGHYEAVRGRRTTQAA
>P80099 2.4.1.25~~~mgtA~~~4-alpha-glucanotransferase~~~COG0366
MIGYQIYVRSFRDGNLDGVGDFRGLKNAVSYLKELGIDFVWLMPVFSSISFHGYDVVDFYSFKAEYGSEREFKEMIEAFH
DSGIKVVLDLPIHHTGFLHTWFQKALKGDPHYRDYYVWANKETDLDERREWDGEKIWHPLEDGRFYRGLFGPFSPDLNYD
NPQVFDEMKRLVLHLLDMGVDGFRFDAAKHMRDTIEQNVRFWKYFLSDLKGIFLAEIWAEARMVDEHGRIFGYMLNFDTS
HCIKEAVWKENTRVLIESIERAVIGKDYLPVNFTSNHDMSRLASFEGGFSKEKIKLSISILFTLPGVPLVFYGDELGMKG
VYQKPNTEVVLDPFPWNESMCVEGQTFWKWPAYNGPFSGISVEYQKRDPDSILSHTLGWTRFRKENQWIDRAKLEFLCKE
DKFLVYRLYDDQHSLKVFHNLSGEEVVFEGVKMKPYKTEVV
>O86956 2.4.1.25~~~mgtA~~~4-alpha-glucanotransferase~~~
MIGYQIYVRSFRDGNFDGVGDFKGLKGAISYLKELGVDFVWLMPVFSSISFHGYDVVDFYSFKAEYGDEKDFREMIEAFH
DNGIKVVLDLPIHHTGFLHTWFQKALKGDPHYRDYYVWASEKTDLDERREWDNERIWHPLEDGRFYRGLFGPLSPDLNYD
NPQVFEEMKKVVYHLLEMGVDGFRFDAAKHMRDTLEQNVRFWRYFLSDIEGIFLAEIWAESKVVDEHGRIFGYMLNFDTS
HCIKEAVWKENFKVLIESIERALVGKDYLPVNFTSNHDMSRLASFEGGLSEEKVKLSLSILFTLPGVPLIFYGDELGMKG
IYRKPNTEVVLDPFPWSENISLEGQTFWKWPAYNSPFSGVSVEYQKKKRDSILLHIMKWTGFRKENHWLDRANIEFLCKE
EKLLHVYRLVDEGRSLKVIHNLSNGEMVFEGVRVQPYSTEVI
>D0ZLQ7 ~~~mgtC~~~Protein MgtC~~~
MEERMLMFPYILNLLAAMLLGALIGAERQWRQRMAGLRTNALVATGAAVFILSSMTTSPDSPGRIAAQIVSGIGFLGAGV
IMREGMNVRGLNTAATLWCSAGIGVLCGLGQFKNALAATIIILCANILLREAAQRINQLPISAEGEKRYILKVTCNKEDE
SAVRQWLLNIVKEAAICLQGLGSVPAQEQGYKEIRAELVGHADYRKTRELIISRIGDNDNITAIHWSIDSQ
>P0CI70 ~~~mgtC~~~Protein MgtC~~~
MEERMLMFPYILNLLAAMLLGALIGAERQWRQRMAGLRTNALVATGAAVFILSSMTTSPDSPGRIAAQIVSGIGFLGAGV
IMREGMNVRGLNTAATLWCSAGIGVLCGLGQFKNALAATIIILCANILLREAAQRINQLPISAEGEKRYILKVTCNKEDE
SAVRQWLLNIVKEAAICLQGLGSVPAQEQGYKEIRAELVGHADYRKTRELIISRIGDNDNITAIHWSIDSQ
>O34442 ~~~mgtE~~~Magnesium transporter MgtE~~~COG2239
MVQNMTYDELILRIIILLRDGKIRDFRSVIDELQPYDMAFIFKEMPEKHRARYLSYLTVDDITDMIGELEREFQLVVLNK
VGKTKATLAMNKMDNDDLAQLLEEMDEELKEQLLSSMEASESKAVQLLMNYPADSAGRMMTNRYVWIPQHYTVKDAVVKL
KSFAEIAESINYLYVINESKQLVGVLSYRDLILGEPEEKVQDLMFTRVISADALQDQEEVARLIERYDFLAIPVVEENNV
LVGIVTVDDIIDVVIREADEDYEKFAASGKDITFDTKAYVAAYRRLPWLILLLFIGLISGSIISYFEDALKQVVALAFFM
PMVSGMTGNTGTQSLAVVIRGLSKEEMNKKTIVRLIFREFRTSIFIGAVCSVLIAIVSIIWQGNALLGFVVASSLFLTLI
IGTMSGTIIPIILHKLKVDPAIASGPLITTLNDILSLLIYFGIATAFIHSL
>Q830V1 ~~~mgtE~~~Magnesium transporter MgtE~~~COG2239
MNEGQEMEEQFALLLETLKNQQMNEFRELFLALHIYEQGQFYQSLDEKDRQHLYNYLSPKELADMFDVIEEDNENMKDYL
AEMRPSYAADMLAEMYTDNAVDLLNMLDKSQKAKYLSLLSSEEAGEIKELLHYEDETAGAIMTTEFVSIVANQTVRSAMY
VLKNQADMAETIYYVYVVDQENHLVGVISLRDLIVNDDDTLIADILNERVISVHVGDDQEDVAQTIRDYDFLAVPVTDYD
DHLLGIVTVDDIIDVIDDEAASDYSGLAGVDVEEVSENPLKAASKRLPWLITLLFLGMSTASLISNYESLVSEASILAVF
ISLITGTAGNAGTQSLAVAVRRLAMKDEKDSNFGRLILSEVLTGLVTGAVTGLTIMIVVGVWQHNLPLGFVIGMAMLCAI
TVANLAGSLIPMLMDKLGFDPAVASGPFITTLSDLTSVLIYFNIASMFMRYFV
>Q5SMG8 ~~~mgtE~~~Magnesium transporter MgtE~~~COG2239
MEEKLAVSLQEALQEGDTRALREVLEEIHPQDLLALWDELKGEHRYVVLTLLPKAKAAEVLSHLSPEEQAEYLKTLPPWR
LREILEELSLDDLADALQAVRKEDPAYFQRLKDLLDPRTRAEVEALARYEEDEAGGLMTPEYVAVREGMTVEEVLRFLRR
AAPDAETIYYIYVVDEKGRLKGVLSLRDLIVADPRTRVAEIMNPKVVYVRTDTDQEEVARLMADYDFTVLPVVDEEGRLV
GIVTVDDVLDVLEAEATEDIHKLGAVDVPDLVYSEAGPVALWLARVRWLVILILTGMVTSSILQGFESVLEAVTALAFYV
PVLLGTGGNTGNQSATLIIRALATRDLDLRDWRRVFLKEMGVGLLLGLTLSFLLVGKVYWDGHPLLLPVVGVSLVLIVFF
ANLVGAMLPFLLRRLGVDPALVSNPLVATLSDVTGLLIYLSVARLLLEAV
>A5A616 ~~~mgtS~~~Small protein MgtS~~~
MLGNMNVFMAVLGIILFSGFLAAYFSHKWDD
>P0DSF3 ~~~mgtT~~~Protein MgtT~~~
MNGDNPSPNRPLVTVVYKGPDFYDGEKKPPVNRR
>Q93Q23 2.4.1.129~~~mgt~~~Monofunctional glycosyltransferase~~~COG0744
MKRSDRYSNSNEHFEHMKHEPHYNTYYQPVGKPPKKKKSKRILLKILLTILIIIALFIGIMYFLSTRDNVDELRKIENKS
SFVSADNMPEYVKGAFISMEDERFYNHHGFDLKGTTRALFSTISDRDVQGGSTITQQVVKNYFYDNDRSFTRKVKELFVA
HRVEKQYNKNEILSFYLNNIYFGDNQYTLEGAANHYFGTTVNKNSTTMSHITVLQSAILASKVNAPSVYNINNMSENFTQ
RVSTNLEKMKQQNYINETQYQQAMSQLNR
>Q99T05 2.4.1.129~~~mgt~~~Monofunctional glycosyltransferase~~~
MKRSDRYSNSNEHFEHMKHEPHYNTYYQPVGKPPKKKKSKRILLKILLTILIIIALFIGIMYFLSTRDNVDELRKIENKS
SFVSADNMPEYVKGAFISMEDERFYNHHGFDLKGTTRALFSTISDRDVQGGSTITQQVVKNYFYDNDRSFTRKVKELFVA
HRVEKQYNKNEILSFYLNNIYFGDNQYTLEGAANHYFGTTVNKNSTTMSHITVLQSAILASKVNAPSVYNINNMSENFTQ
RVSTNLEKMKQQNYINETQYQQAMSQLNR
>Q7A4S6 2.4.1.129~~~mgt~~~Monofunctional glycosyltransferase~~~
MKRSDRYSNSNEHFEHMKHEPHYNTYYQPVGKPPKKKKSKRILLKILLTILIIIALFIGIMYFLSTRDNVDELRKIENKS
SFVSADNMPEYVKGAFISMEDERFYNHHGFDLKGTTRALFSTISDRDVQGGSTITQQVVKNYFYDNDRSFTRKVKELFVA
HRVEKQYNKNEILSFYLNNIYFGDNQYTLEGAANHYFGTTVNKNSTTMSHITVLQSAILASKVNAPSVYNINNMSENFTQ
RVSTNLEKMKQQNYINETQYQQAMSQLNR
>Q7A0I6 2.4.1.129~~~mgt~~~Monofunctional glycosyltransferase~~~
MKRSDRYSNSNEHFEHMKHEPHYNTYYQPVGKPPKKKKSKRILLKILLTILIIIALFIGIMYFLSTRDNVDELRKIENKS
SFVSADNMPEYVKGAFISMEDERFYNHHGFDLKGTTRALFSTISDRDVQGGSTITQQVVKNYFYDNDRSFTRKVKELFVA
HRVEKQYNKNEILSFYLNNIYFGDNQYTLEGAANHYFGTTVNKNSTTMSHITVLQSAILASKVNAPSVYNINNMSENFTQ
RVSTNLEKMKQQNYINETQYQQAMSQLNR
>P80435 2.7.7.97~~~acmA~~~3-hydroxy-4-methylanthranilate adenylyltransferase~~~
MADKWWGEQLLGRGDDGDLWAVSAAPVTRGELRAGVAGLRLRFRESGISEGSSVLLRMTPSFTYLQVLLALWSCGAQVVL
VDFRLKPAEFEPLVERVRPQYLVVAAGAGGPVTGFRQESDFEVRRLAGGRPAEDGVVLVQFSSGSTGRPKVIGRPAGSVL
AELDRHAGLPGTPGPGERVLLLNSVMHNMGLMTGVLHALAAGATLVVPPTFRPAEVLRLMARTEVSVMYGTPVHYDLLAR
TADRPERLSLRLAVSGGERVPEETRQRFLAAFGLPICQVYGVTEIGLIAGDLSGRCIPPEIGPPVPGVELEIDGEELLVR
MDRSPYLYGEHTDRYRDGWLRTFDRVGRDPETGVLSILGRSDSLVVVGGLKVDLTEVEAALLDHPRVAEVVVTHQDAIEA
FVGGDEDLTADELTAWCRERLSAVKIPKRFFVTRQLPRNSMGKLARDRALMHRHITSERSAQSRPAELKGAS
>C0LTL9 2.7.7.97~~~sibE~~~3-hydroxy-4-methylanthranilate adenylyltransferase~~~
MSALPSLRVPTLPVAFEAAARAAGDRIALEYGSSSITYRELDAAANRFARRLLREGVTPGSRVGLHMTRCLELYIAMIGL
LKAGAVVVPLNPSHPVTVRRSVVREADLPLTLRDVPAGLSSVTVERDVHELLAAGADLDDTSPGLCTDPESTAFILFTSG
STGRPRGVRIAHRGIARVASYNGEVEVRPDDCFLQLAPYSFAASTTDIWLSLLHGARVVVLPSQLPSLPKLAHTIKEYGV
TFLNLPGGLMNLLIDAHPEAFAKVRTVIVSGDFPSAPHLARVMKAVPGTVYNAFGCTENSALTAVHPMTPEDLQLGVVPI
GLPLPGVGLHVLGEDMTPCAPGEVGEMHISGAGLAQGYLGLPEETAAKFPTVDGVRMLRTGDWARTTPAGEVVLVGRTDQ
MVKIRGFRVELREVELAADQSGLVEKAVVRAVDATDGQKELVLFCTTATGEAPSIEALLADLKSRLPDYMLPARVHHLAE
LPVNVNGKLDRMALREPRAVLVEPDGTAQQQPVVRDIIATTVTRLLAVVTGREPIGVSDSFLASGANSLQVIQLAASLHD
VMGVDVRPEDIFQLDNAESLAGHIRALRQGHREVPA
>Q5EXK5 ~~~mhbT~~~3-hydroxybenzoate transporter MhbT~~~COG2271
MTQRLELQKLLNAAPVGARQWRVIICCFLVVMLDGFDTAAIGFIAPDIRSHWQLTAGDLAPLFGAGLLGLTAGALLCGPL
SDRFGRKRVIEFCVALFGLFSLLSAFAPDLQMLVFLRFLTGLGLGGAMPNTITMTSEYLPARRRGALVTLMFCGFTLGSA
FGGVVSAQLVPLIGWHGILVLGGVLPLILAVALVPLLPESPRWQIRRQLPQATIARTVGAITGERYNDTHFWLDEPAAGA
KGSISQLFAGRQLAITLMLWVVFFMSLLIIYLLSSWMPTLLNHRGIDLQHASWVTAAFQIGGTFGALLLGVLMDRFNPYR
VLALSYGLGAVCIVMIGLSENGLWLMALAIFGTGIGISGSQVGLNALTATLYPTQSRATGVSWSNAVGRCGAIVGSLSGG
VMMAMNFSFDTLFFVIAVPAVVSAVMLILLTLAVRNPTSVPAALPSAGIVNK
>A0A0K8P8E7 3.1.1.102~~~~~~Mono(2-hydroxyethyl) terephthalate hydrolase~~~
MQTTVTTMLLASVALAACAGGGSTPLPLPQQQPPQQEPPPPPVPLASRAACEALKDGNGDMVWPNAATVVEVAAWRDAAP
ATASAAALPEHCEVSGAIAKRTGIDGYPYEIKFRLRMPAEWNGRFFMEGGSGTNGSLSAATGSIGGGQIASALSRNFATI
ATDGGHDNAVNDNPDALGTVAFGLDPQARLDMGYNSYDQVTQAGKAAVARFYGRAADKSYFIGCSEGGREGMMLSQRFPS
HYDGIVAGAPGYQLPKAGISGAWTTQSLAPAAVGLDAQGVPLINKSFSDADLHLLSQAILGTCDALDGLADGIVDNYRAC
QAAFDPATAANPANGQALQCVGAKTADCLSPVQVTAIKRAMAGPVNSAGTPLYNRWAWDAGMSGLSGTTYNQGWRSWWLG
SFNSSANNAQRVSGFSARSWLVDFATPPEPMPMTQVAARMMKFDFDIDPLKIWATSGQFTQSSMDWHGATSTDLAAFRDR
GGKMILYHGMSDAAFSALDTADYYERLGAAMPGAAGFARLFLVPGMNHCSGGPGTDRFDMLTPLVAWVERGEAPDQISAW
SGTPGYFGVAARTRPLCPYPQIARYKGSGDINTEANFACAAPP
>P77397 1.14.13.127~~~mhpA~~~3-(3-hydroxy-phenyl)propionate/3-hydroxycinnamic acid hydroxylase~~~COG0654
MAIQHPDIQPAVNHSVQVAIAGAGPVGLMMANYLGQMGIDVLVVEKLDKLIDYPRAIGIDDEALRTMQSVGLVDDVLPHT
TPWHAMRFLTPKGRCFADIQPMTDEFGWPRRNAFIQPQVDAVMLEGVSRFPNVRCLFSRELEAFSQQDDEVTLHLKTAEG
QREIVKAQWLVACDGGASFVRRTLNVPFEGKTAPNQWIVVDIANDPLSTPHIYLCCDPVRPYVSAALPHAVRRFEFMVMP
GETEEQLREPQNMRKLLSKVLPNPDNVELIRQRVYTHNARLAQRFRIDRVLLAGDAAHIMPVWQGQGYNSGMRDAFNLAW
KLALVIQGKARDALLDTYQQERRDHAKAMIDLSVTAGNVLAPPKRWQGTLRDGVSWLLNYLPPVKRYFLEMRFKPMPQYY
GGALMREGEAKHSPVGKMFIQPKVTLENGDVTLLDNAIGANFAVIGWGCNPLWGMSDEQIQQWRALGTRFIQVVPEVQIH
TAQDNHDGVLRVGDTQGRLRSWFAQHNASLVVMRPDRFVAATAIPQTLGKTLNKLASVMTLTRPDADVSVEKVA
>Q9S157 1.13.11.16~~~mhpB~~~2,3-dihydroxyphenylpropionate/2,3-dihydroxicinnamic acid 1,2-dioxygenase~~~
MSGSAIATARRAFLGMSHSPLLGLNPVAADDQIAIDKAIAAARAAVHEFAPELIVLLGPDHYNGFFNELMPPFCIGSQAT
AVGDYLSPAGPLNVAGELAIALADHLMDRHFDIAVSRRMLVDHGFSQALQFLWGDEMDTPPVIPIFMNAVAQPGIARMAR
CKALGEGVGSFLDQLPLRTLLIGSGGLSHEPPVPTLAHPDPAVRERITVRSTPTEQERELKTERVKAAGLALAHGDSWMK
PLNPEWDLQWMDAMASGQLDGLCAMNEASIGAMAGNSAHESKTWLVARSALPANTRLSCPVRAYRAIPSLIAGYGVMFMH
H
>P17295 1.13.11.16~~~mhpB~~~2,3-dihydroxyphenylpropionate/2,3-dihydroxicinnamic acid 1,2-dioxygenase~~~
MPIQLECLSHTPLHGYVDPAPEVVAEVERVQAAARDRVRAFDPELVVVFAPDHFNGFFYDVMPPFCIGAAATAIGDFKSL
AGKLPVPADLALSLAESVMAADIDVALSHRMQVDHGCADALAALTGSLHRYPVIPVFINSVAPPMATLRRARLLGDAVGR
FLSRAGKRVLVVGSGGISHEPPVPELAGASEEVAERLIAGRNPSPESAARQARTVAAAKSFVAGDSHLHPLNPEWDRAFL
SLLASGELTAVDGMTNDAITRDGGKSAHEIRTWVAAFGALAAYGPYRASLDFYRAIPEWIAGFATMHAEPAAV
>P0ABR9 1.13.11.16~~~mhpB~~~2,3-dihydroxyphenylpropionate/2,3-dihydroxicinnamic acid 1,2-dioxygenase~~~COG3384
MHAYLHCLSHSPLVGYVDPAQEVLDEVNGVIASARERIAAFSPELVVLFAPDHYNGFFYDVMPPFCLGVGATAIGDFGSA
AGELPVPVELAEACAHAVMKSGIDLAVSYCMQVDHGFAQPLEFLLGGLDKVPVLPVFINGVATPLPGFQRTRMLGEAIGR
FTSTLNKRVLFLGSGGLSHQPPVPELAKADAHMRDRLLGSGKDLPASERELRQQRVISAAEKFVEDQRTLHPLNPIWDNQ
FMTLLEQGRIQELDAVSNEELSAIAGKSTHEIKTWVAAFAAISAFGNWRSEGRYYRPIPEWIAGFGSLSARTEN
>Q9KH19 1.13.11.16~~~mhpB~~~2,3-dihydroxyphenylpropionate/2,3-dihydroxicinnamic acid 1,2-dioxygenase~~~
MPVALCAMSHSPLMGRNDPEQEVIDAVDAAFDHARRFVADFAPDLIVIFAPDHYNGVFYDLLPPFCIGAAAQSVGDYGTE
AGPLDVDRDAAYAVARDVLDSGIDVAFSERMHVDHGFAQALQLLVGSITAVPTVPIFINSVAEPLGPVSRVRLLGEAVGR
AAAKLDKRVLFVGSGGLSHDPPVPQFATAPEEVRERLIDGRNPSAAERDAREQRVITAGRDFAAGTAAIQPLNPEWDRHL
LDVLASGDLEQIDAWTNDWFVEQAGHSSHEVRTWIAAYAAMSAAGKYRVTSTFYREIHEWIAGFGITTAVAVDE
>Q8KZP5 3.7.1.14~~~mhpC~~~2-hydroxy-6-oxononadienedioate/2-hydroxy-6-oxononatrienedioate hydrolase~~~
MSELNESTTSKFVTINEKGLSNFRIHLNDAGEGEAVIMLHGGGPGAGGWSNYYRNIGPFVKAGYRVILQDAPGFNKSDTV
VMDEQRGLVNARSVKGMMDVLGIEKAHLVGNSMGGAGALNFALEYPERTGKLILMGPGGLGNSLFTAMPMEGIKLLFKLY
AEPSLDTLKQMLNVFLFDQSLITDELVQGRWANIQRNPEHLKNFLLSSQKLPLSSWNVSPRMGEIKAKTLVTWGRDDRFV
PLDHGLKLVANMPDAQLHVFPRCGHWAQWEHADAFNRLTLDFLANG
>P77044 3.7.1.14~~~mhpC~~~2-hydroxy-6-oxononadienedioate/2-hydroxy-6-oxononatrienedioate hydrolase~~~COG0596
MSYQPQTEAATSRFLNVEEAGKTLRIHFNDCGQGDETVVLLHGSGPGATGWANFSRNIDPLVEAGYRVILLDCPGWGKSD
SVVNSGSRSDLNARILKSVVDQLDIAKIHLLGNSMGGHSSVAFTLKWPERVGKLVLMGGGTGGMSLFTPMPTEGIKRLNQ
LYRQPTIENLKLMMDIFVFDTSDLTDALFEARLNNMLSRRDHLENFVKSLEANPKQFPDFGPRLAEIKAQTLIVWGRNDR
FVPMDAGLRLLSGIAGSELHIFRDCGHWAQWEHADAFNQLVLNFLARP
>Q9S156 4.2.1.80~~~mhpD~~~2-keto-4-pentenoate hydratase~~~
MPTTTQIQAWAERLRHAEATATPIAPLREEITDNDSAYAVQLVNVQYAQSQGRRIVGRKIGLTSLAVQKQLGVDQPDFGT
LFADMLYGDDEAVPLSRTLQPKVEAEVALVLAKDLERPDTTLVDVISATAYVLPAIEIVGSRIADWNIRFIDTVADNASS
GLVVLGAVPTALNALDLKLCQMQMTRNGDVVSTGSGGACLGHPLNAAVWLARRLANLGQPLRAGDLVLTGALGPMVAVNA
GDRFEARISGIGSVCAQFEG
>P77608 4.2.1.80~~~mhpD~~~2-keto-4-pentenoate hydratase~~~COG3971
MTKHTLEQLAADLRRAAEQGEAIAPLRDLIGIDNAEAAYAIQHINVQHDVAQGRRVVGRKVGLTHPKVQQQLGVDQPDFG
TLFADMCYGDNEIIPFSRVLQPRIEAEIALVLNRDLPATDITFDELYNAIEWVLPALEVVGSRIRDWSIQFVDTVADNAS
CGVYVIGGPAQRPAGLDLKNCAMKMTRNNEEVSSGRGSECLGHPLNAAVWLARKMASLGEPLRTGDIILTGALGPMVAVN
AGDRFEAHIEGIGSVAATFSSAAPKGSLS
>P77589 ~~~mhpT~~~3-(3-hydroxy-phenyl)propionate transporter~~~COG2814
MSTRTPSSSSSRLMLTIGLCFLVALMEGLDLQAAGIAAGGIAQAFALDKMQMGWIFSAGILGLLPGALVGGMLADRYGRK
RILIGSVALFGLFSLATAIAWDFPSLVFARLMTGVGLGAALPNLIALTSEAAGPRFRGTAVSLMYCGVPIGAALAATLGF
AGANLAWQTVFWVGGVVPLILVPLLMRWLPESAVFAGEKQSAPPLRALFAPETATATLLLWLCYFFTLLVVYMLINWLPL
LLVEQGFQPSQAAGVMFALQMGAASGTLMLGALMDKLRPVTMSLLIYSGMLASLLALGTVSSFNGMLLAGFVAGLFATGG
QSVLYALAPLFYSSQIRATGVGTAVAVGRLGAMSGPLLAGKMLALGTGTVGVMAASAPGILVAGLAVFILMSRRSRIQPC
ADA
>O34689 1.13.11.-~~~mhqA~~~Putative ring-cleaving dioxygenase MhqA~~~COG0346
MKVNGIHHVSALTADAQKNLDFYKKVLGLKLVKKSVNQDEPTMYHLFYGDEVANPGTELTFFEIPRIAPFHAGTNSISSI
GLRVPGTEALHYWKERFEEQQVTHSGISKRAGRDILAFQDHEGQRLVLTADEEGKGYGLPVKQSGIPEEFSFRGLGPVEL
TVPYAEPTLHVLTNILGFTEISREPVEGQGTAVILESGEGGAATEIHLIERNDLPRERQGKGSVHHVAFRVRDEEELAGW
HRIISREGFSNSGIVERYYFKALYFREPNGILFELSTDGPGFMVDENLDELGQTIALPPYLEHRRAEIEAKLKPIQ
>O34842 3.1.-.-~~~mhqD~~~Putative hydrolase MhqD~~~COG0400
MKHIYEKGTSDNVLLLLHGTGGNEHDLLSLGRFIDPDAHLLGVRGSVLENGMPRFFKRLSEGVFDEKDLVVRTRELKDFI
DEAAETHQFNRGRVIAVGYSNGANIAASLLFHYKDVLKGAILHHPMVPIRGIELPDMAGLPVFIGAGKYDPLCTKEESEE
LYRYLRDSGASASVYWQDGGHQLTQHEAEQAREWYKEAIV
>O34543 1.13.11.-~~~mhqE~~~Putative ring-cleaving dioxygenase MhqE~~~COG0346
MKTEGLHHVTAFARDPQENLRFYTEVLGLRLVKKTVNFDDPGTYHFYFGNQNGDPGTIMTFFPFQGSGQGTVGKGQAGRV
YFSVPSGSLSFWKERLEKSGLSLEEKTLFGEKGLIFDDTEDLPLAIMEDAKSGKSEWTPDGITTNEAITGMKGVLLYSYD
PQATIQLLTESFGYTKVAEEDQIVRLASSAAVGGVIDVHLHPEKRGVGGYGTVHHVAFRTKKKEQAKWLPIIAENHLPSS
EILDREYFTSVYFREKGGILFEIATDEPGFMTDETFAELGTSLKLPEWLEKHRQQITDILPEL
>P96692 1.-.-.-~~~mhqN~~~Putative NAD(P)H nitroreductase MhqN~~~COG0778
MAEFTHLVNERRSASNFLSGHPITKEDLNEMFELVALAPSAFNLQHTKYVTVLDQDVKEKLKQAANGQYKVVSSSAVLLV
LGDKQAYQQAADIYEGLKVLGILNKQEYDHMVQDTVSFYENRGEQFKRDEAIRNASLSAMMFMLSAKEKGWDTCPMIGFD
AEAVKRILNIDDQFEVVMMITIGKEKTESRRPRGYRKPVNEFVEYM
>P96693 1.13.11.-~~~mhqO~~~Putative ring-cleaving dioxygenase MhqO~~~COG0346
MAKKTMGIHHITAIVGHPQENTDFYAGVLGLRLVKQTVNFDDPGTYHLYFGNEGGKPGTIITFFPWAGARQGVIGDGQVG
VTSYVVPKGAMAFWEKRLEKFNVPYTKIERFGEQYVEFDDPHGLHLEIVEREEGEANTWTFGEVTPDVAIKGFGGATLLS
EQPDKTADLLENIMGLERVGKEGDFVRYRSAGDIGNVIDLKLTPIGRGQMGAGTVHHIAWRANDDEDQLDWQRYIASHGY
GVTPVRDRNYFNAIYFREHGEILFEIATDPPGFAHDETQETMGEKLMLPVQYEPHRTQIEQGLLPFEVRELD
>P96694 1.-.-.-~~~mhqP~~~Putative oxidoreductase MhqP~~~COG2259
MEDAGLLLIRIMIGVVFLFYGTQKLFGWFGGYGIKGTGQWFESIGVKPGNVAAALSGLGELVSGILFILGVFLPLGAAII
TIIMLGAIVKVHGAKGFANGAGGFEYNLVLIAVSIGVALIGSGAYALHF
>O31672 ~~~mhqR~~~HTH-type transcriptional regulator MhqR~~~COG1846
MTEKSLKLFIVLSRAYRSINDHMNKHIHKHGLNPTEFAVLELLYHKGDQPLQQIGDKILLASGSITYVVDKLEQKELLIR
KASPTDRRVTFAQITEKGIGLLNDIFPDHAAEIDEMISVLSEEEVEMCTEMLKRVGLNAKQFHNK
>P9WKH3 1.14.99.57~~~mhuD~~~Heme oxygenase (mycobilin-producing)~~~COG2329
MPVVKINAIEVPAGAGPELEKRFAHRAHAVENSPGFLGFQLLRPVKGEERYFVVTHWESDEAFQAWANGPAIAAHAGHRA
NPVATGASLLEFEVVLDVGGTGKTA
>O68965 1.1.1.18~~~idhA~~~Inositol 2-dehydrogenase~~~COG0673
MTVRFGLLGAGRIGKVHAKAVSGNADARLVAVADAFPAAAEAIAGAYGCEVRTIDAIEAAADIDAVVICTPTDTHADLIE
RFARAGKAIFCEKPIDLDAERVRACLKVVSDTKAKLMVGFNRRFDPHFMAVRKAIDDGRIGEVEMVTITSRDPSAPPVDY
IKRSGGIFRDMTIHDFDMARFLLGEEPVSVTATAAVLIDKAIGDAGDYDSVSVILQTASGKQAIISNSRRATYGYDQRIE
VHGSKGAVAAENQRPVSIEIATGDGYTRPPLHDFFMTRYTEAYANEIESFIAAIEKGAEIAPSGNDGLAALALADAAVRS
VAEKRQISIA
>P16384 2.5.1.75~~~miaA~~~tRNA dimethylallyltransferase~~~COG0324
MSDISKASLPKAIFLMGPTASGKTALAIELRKILPVELISVDSALIYKGMDIGTAKPNAEELLAAPHRLLDIRDPSQAYS
AADFRRDALAEMADITAAGRIPLLVGGTMLYFKALLEGLSPLPSADPEVRARIEQQAAEQGWESLHRQLQEVDPVAAARI
HPNDPQRLSRALEVFFISGKTLTELTQTSGDALPYQVHQFAIAPASRELLHQRIEQRFHQMLASGFEAEVRALFARGDLH
TDLPSIRCVGYRQMWSYLEGEISYDEMVYRGVCATRQLAKRQITWLRGWEGVHWLDSEKPEQARDEVLQVVGAIAG
>Q9KAC3 2.5.1.75~~~miaA~~~tRNA dimethylallyltransferase~~~COG0324
MKEKLVAIVGPTAVGKTKTSVMLAKRLNGEVISGDSMQVYRGMDIGTAKITAEEMDGVPHHLIDIKDPSESFSVADFQDL
ATPLITEIHERGRLPFLVGGTGLYVNAVIHQFNLGDIRADEDYRHELEAFVNSYGVQALHDKLSKIDPKAAAAIHPNNYR
RVIRALEIIKLTGKTVTEQARHEEETPSPYNLVMIGLTMERDVLYDRINRRVDQMVEEGLIDEAKKLYDRGIRDCQSVQA
IGYKEMYDYLDGNVTLEEAIDTLKRNSRRYAKRQLTWFRNKANVTWFDMTDVDFDKKIMEIHNFIAGKLEEKSK
>P9WJW1 2.5.1.75~~~miaA~~~tRNA dimethylallyltransferase~~~COG0324
MRPLAIIGPTGAGKSQLALDVAARLGARVSVEIVNADAMQLYRGMDIGTAKLPVSERRGIPHHQLDVLDVTETATVARYQ
RAAAADIEAIAARGAVPVVVGGSMLYVQSLLDDWSFPATDPSVRARWERRLAEVGVDRLHAELARRDPAAAAAILPTDAR
RTVRALEVVELTGQPFAASAPRIGAPRWDTVIVGLDCQTTILDERLARRTDLMFDQGLVEEVRTLLRNGLREGVTASRAL
GYAQVIAALDAGAGADMMRAAREQTYLGTRRYVRRQRSWFRRDHRVHWLDAGVASSPDRARLVDDAVRLWRHVT
>Q9HUL9 2.5.1.75~~~miaA~~~tRNA dimethylallyltransferase~~~
MSSLPPAIFLMGPTAAGKTDLAMALADALPCELISVDSALIYRGMDIGTAKPSRELLARYPHRLIDIRDPAESYSAAEFR
ADALAAMAKATARGRIPLLVGGTMLYYKALLEGLADMPGADPEVRAAIEAEAQAEGWEALHRQLAEVDPESAARIHPNDP
QRLMRALEVYRLGGVSMSDLRRRQSAEKADFDASGRNQLPYTVAQLAIAPEQRQVLHARIAQRFRQMLEQGFIAEVEALH
ARSDLHAGLPSIRAVGYRQVWDYLDGKLSYAEMTERGIIATRQLAKRQFTWLRSWSHLHWMDSLAGDNLPRALKYLKTVS
ILA
>P37724 2.5.1.75~~~miaA~~~tRNA dimethylallyltransferase~~~
MNDVSKASLPKAIFLMGPTASGKTALAIELRKVLPVELISVDSALIYRGMDIGTAKPNADELKAAPHRLLDIRDPSQAYS
AADFRRDALAQMAEITAAGRIPLLVGGTMLYFKALLEGLSPLPSADPEVRSRIEQQAAELGWEALHQQLQEIDPVAAARI
HPNDPQRLSRALEVFFISGKTLTELTQTSGDALPYQVHQFAIAPASRELLHQRIELRFHQMLASGFEAEVRALFARGDLH
TDLPSIRCVGYRQMWSYIEGEISYDEMVYRGVCATRQLAKRQMTWLRGWEGVRWLDSENPDRARKEVLQVVGAIAD
>Q8CQL3 2.5.1.75~~~miaA~~~tRNA dimethylallyltransferase~~~COG0324
MTEMTKPFLIVIVGPTASGKTELSIEVAKKFNGEIISGDSMQVYQGMDIGTAKVTTEEMEGIPHYMIDILPPDASFSAYE
FKKRAEKYIKDITRRGKVPIIAGGTGLYIQSLLYNYAFEDESISEDKMKQVKLKLKELEHLNNNKLHEYLASFDKESAKD
IHPNNRKRVLRAIEYYLKTKKLLSSRKKVQQFTENYDTLLIGIEMSRETLYLRINKRVDIMLGHGLFNEVQHLVEQGFEA
SQSMQAIGYKELVPVIKGNISMENAVEKLKQHSRQYAKRQLTWFKNKMNVHWLNKERMSLQMMLDEITTQINKRSSNHDC
KRKHPRPSTREL
>Q97RW5 2.5.1.75~~~miaA~~~tRNA dimethylallyltransferase~~~COG0324
MKTKIIVIVGPTAVGKTALAIEVAKRFNGEVVSGDSQQVYRGLDIGTAKASPEEQAAVPHHLIDVREITESYSAFDFVSE
AKMTIEGIHNRGKLAIIAGGTGLYIQSLLEGYHLGGETPHEEILAYRASLEPYSDEELAHLVDQAGLEIPQFNRRRAMRA
LEIAHFGQDLENQETLYEPLIICLDDERSQLYERINHRVDLMFEAGLLDEAKWLFDHSPNVQAAKGIGYKELFPYFRGEQ
TLEEASESLKQATRRFAKRQLTWFRNRMQVTFYQIGESGVQDRILSQIEEFLDD
>O31778 2.8.4.3~~~miaB~~~tRNA-2-methylthio-N(6)-dimethylallyladenosine synthase~~~COG0621
MNEKQKLESGQVHPSDKKSEKDYSKYFEAVYIPPSLKDAKKRGKEAVTYHNDFKISEQFKGLGDGRKFYIRTYGCQMNEH
DTEVMAGIFMALGYEATNSVDDANVILLNTCAIRENAENKVFGELGHLKALKKNNPDLILGVCGCMSQEESVVNRILKKH
PFVDMIFGTHNIHRLPELLSEAYLSKEMVVEVWSKEGDVIENLPKVRNGKIKGWVNIMYGCDKFCTYCIVPYTRGKERSR
RPEDIIQEVRRLASEGYKEITLLGQNVNAYGKDFEDMTYGLGDLMDELRKIDIPRIRFTTSHPRDFDDRLIEVLAKGGNL
LDHIHLPVQSGSSEVLKLMARKYDRERYMELVRKIKEAMPNASLTTDIIVGFPNETDEQFEETLSLYREVEFDSAYTFIY
SPREGTPAAKMKDNVPMRVKKERLQRLNALVNEISAKKMKEYEGKVVEVLVEGESKNNPDILAGYTEKSKLVNFKGPKEA
IGKIVRVKIQQAKTWSLDGEMVGEAIEVK
>P0AEI1 2.8.4.3~~~miaB~~~tRNA-2-methylthio-N(6)-dimethylallyladenosine synthase~~~COG0621
MTKKLHIKTWGCQMNEYDSSKMADLLDATHGYQLTDVAEEADVLLLNTCSIREKAQEKVFHQLGRWKLLKEKNPDLIIGV
GGCVASQEGEHIRQRAHYVDIIFGPQTLHRLPEMINSVRGDRSPVVDISFPEIEKFDRLPEPRAEGPTAFVSIMEGCNKY
CTYCVVPYTRGEEVSRPSDDILFEIAQLAAQGVREVNLLGQNVNAWRGENYDGTTGSFADLLRLVAAIDGIDRIRFTTSH
PIEFTDDIIEVYRDTPELVSFLHLPVQSGSDRILNLMGRTHTALEYKAIIRKLRAARPDIQISSDFIVGFPGETTEDFEK
TMKLIADVNFDMSYSFIFSARPGTPAADMVDDVPEEEKKQRLYILQERINQQAMAWSRRMLGTTQRILVEGTSRKSIMEL
SGRTENNRVVNFEGTPDMIGKFVDVEITDVYPNSLRGKVVRTEDEMGLRVAETPESVIARTRKENDLGVGYYQP
>P9WK05 2.8.4.3~~~miaB~~~tRNA-2-methylthio-N(6)-dimethylallyladenosine synthase~~~COG0621
MSSASPLARCCDEATPSAGPRAAQPPYHGPVTSMVAHDAAAGVTGEGAGPPVRRAPARTYQVRTYGCQMNVHDSERLAGL
LEAAGYRRATDGSEADVVVFNTCAVRENADNRLYGNLSHLAPRKRANPDMQIAVGGCLAQKDRDAVLRRAPWVDVVFGTH
NIGSLPTLLERARHNKVAQVEIAEALQQFPSSLPSSRESAYAAWVSISVGCNNSCTFCIVPSLRGREVDRSPADILAEVR
SLVNDGVLEVTLLGQNVNAYGVSFADPALPRNRGAFAELLRACGDIDGLERVRFTSPHPAEFTDDVIEAMAQTRNVCPAL
HMPLQSGSDRILRAMRRSYRAERYLGIIERVRAAIPHAAITTDLIVGFPGETEEDFAATLDVVRRARFAAAFTFQYSKRP
GTPAAQLDGQLPKAVVQERYERLIALQEQISLEANRALVGQAVEVLVATGEGRKDTVTARMSGRARDGRLVHFTAGQPRV
RPGDVITTKVTEAAPHHLIADAGVLTHRRTRAGDAHTAGQPGRAVGLGMPGVGLPVSAAKPGGCR
>Q9RCI2 2.8.4.3~~~miaB~~~tRNA-2-methylthio-N(6)-dimethylallyladenosine synthase~~~
MTKKLHIKTWGCQMNEYDSSKMADLLDATHGYQLTDVAEEADVLLLNTCSIREKAQEKVFHQLGRWRLLKEKNPDLIIGV
GGCVASQEGEHIRQRAHYVDIIFGPQTLHRLPEMINSVRGDRSPVVDISFPEIEKFDRLPEPRAEGPTAFVSIMEGCNKY
CTYCVVPYTRGEEVSRPSDDILFEIAQLAAQGVREVNLLGQNVNAWRGENYDGTTGTFADLLRLVAAIDGIDRIRFTTSH
PIEFTDDIIEVYRDTPELVSFLHLPVQSGSDRVLNLMGRTHTALEYKAIIRKLRAARPDIQISSDFIVGFPGETTDDFEK
TMKLIADVNFDMSYSFIFSARPGTPAADMVDDVPEEEKKQRLYILQERINQQAMAWSRRMLGTTQRILVEGTSRKNIMEL
SGRTENNRVVNFEGTPEMIGKFVDVEITDVYPNSLRGKVVRTEDEMGLRVAETPESVIARTRKENELGVGFYQP
>Q7A5W3 2.8.4.3~~~miaB~~~tRNA-2-methylthio-N(6)-dimethylallyladenosine synthase~~~
MNEEQRKASSVDVLAERDKKAEKDYSKYFEHVYQPPNLKEAKKRGKQEVRYNRDFQIDEKYRGMGNERTFLIKTYGCQMN
AHDTEVIAGILEALGYQATTDINTADVILINTCAIRENAENKVFSEIGNLKHLKKERPDILIGVCGCMSQEESVVNKILK
SYQNVDMIFGTHNIHHLPEILEEAYLSKAMVVEVWSKEGDVIENLPKVREGNIKAWVNIMYGCDKFCTYCIVPFTRGKER
SRRPEDIIDEVRELAREGYKEITLLGQNVNSYGKDLQDIEYDLGDLLQAISKIAIPRVRFTTSHPWDFTDHMIDVISEGG
NIVPHIHLPVQSGNNAVLKIMGRKYTRESYLDLVKRIKDRIPNVALTTDIIVGYPNESEEQFEETLTLYDEVGFEHAYTY
LYSQRDGTPAAKMKDNVPLNVKKERLQRLNKKVGHYSQIAMSKYEGQTVTVLCEGSSKKDDQVLAGYTDKNKLVNFKAPK
EMIGKLVEVRIDEAKQYSLNGSFVKEVEPEMVIQ
>Q9WZC1 2.8.4.3~~~miaB~~~tRNA-2-methylthio-N(6)-dimethylallyladenosine synthase~~~COG0621
MRFYIKTFGCQMNENDSEAMAGLLVKEGFTPASSPEEADVVIINTCAVRRKSEEKAYSELGQVLKLKKKKKIVVGVAGCV
AEKEREKFLEKGADFVLGTRAVPRVTEAVKKALEGEKVALFEDHLDEYTHELPRIRTSRHHAWVTIIHGCDRFCTYCIVP
YTRGRERSRPMADILEEVKKLAEQGYREVTFLGQNVDAYGKDLKDGSSLAKLLEEASKIEGIERIWFLTSYPTDFSDELI
EVIAKNPKVAKSVHLPVQSGSNRILKLMNRRYTKEEYLALLEKIRSKVPEVAISSDIIVGFPTETEEDFMETVDLVEKAQ
FERLNLAIYSPREGTVAWKYYKDDVPYEEKVRRMQFLMNLQKRINRKLNERYRGKTVRIIVEAQAKNGLFYGRDIRNKII
AFEGEDWMIGRFADVKVEKITAGPLYGKVVWVEKTPSPVSSSE
>Q88KV1 1.14.99.69~~~miaE~~~tRNA 2-(methylsulfanyl)-N(6)-isopentenyladenosine(37) hydroxylase~~~COG4445
MSLIPEIDAFLGCPTPDAWIEAALADQETLLIDHKNCEFKAASTALSLIAKYNTHLDLINMMSRLAREELVHHEQVLRLM
KRRGVPLRPVSAGRYASGLRRLVRAHEPVKLVDTLVVGAFIEARSCERFAALVPHLDEELGRFYHGLLKSEARHYQGYLK
LAHNYGDEADIARCVELVRAAEMELIQSPDQELRFHSGIPQALAA
>Q08015 1.14.99.69~~~miaE~~~tRNA 2-(methylsulfanyl)-N(6)-isopentenyladenosine(37) hydroxylase~~~
MTVRQRLLSYFFNRLRMNYPQILSPVLNFLHCPTPQAWIVQARDPQNLPLLLTDHLICELKAAQTALLLVRKYVADKSGA
DALLSWLQPYEAFAFRQGPEPDFVALHKQISKSAMPQTDDPWGRQLIDRMVLLIKEELHHFWQVREVMQARNIPYVKITA
SRYAKGMLKAVRTHEPLTLIDKLICGAYIEARSCERFAALAPWLDEDLQTFYLSLLRSEARHYQDYLALAQQISAEDISA
RVRYFGEVEADLILSPDREFRFHSGVPAAG
>A4FG19 4.2.3.118~~~~~~2-methylisoborneol synthase~~~COG3170
MIELIGHETPVPSQQQHTGGVRGTSACTPPGVGERTTVLYCPPPPPERPEVAAEINRRVVVWMQGLGLGGEDNVAGVYKH
DPGRGITLCHPGSQDVERMTAAGKMIVAETAVDDYFCETNSRRDANDQTIGPNLSLAQSAIDAPRLTPDLQALWNKCRDD
HPVLRAQHEAFGDLERISSPAQAQRVRHDIAQLYLGYNAENGWRLLNRLPPVWQYLANRQMNSFRPCLNLTDALDGYELA
PQLYAHPLVQDCTARATLIATLYNDLASCEREIREHGLPFNLPAVIAAEERIALDEAFVRACEIHNELIQALEEATGHAA
SALADPALSRYLTGLWSWLAGSRHWHFTTARHRA
>A3KI17 4.2.3.118~~~~~~2-methylisoborneol synthase~~~
MPDSGPLGPHSPDHRPTPATTVPDAPASKPPDVAVTPTASEFLAALHPPVPIPSPSPPSGSASAAADTPDATTVGSALQR
ILRGPTGPGTAALALSVRHDPPSLPGSPAPAEPAAGRAVPGLYHHPVPEPDPARVEEVSRRIKRWAEDEVQLYPEDWEGE
FDGFSVGRYMVACHPDAPTVDHLMLATRLMVAENAVDDCYCEDHGGSPVGLGGRLLLAHTAIDPFHTTAEYAPPWRESLT
SDAPRRAYRSAMDYFVRAATPSQADRYRHDMARLHLGYLAEAAWAQTDHVPEVWEYLAMRQFNNFRPCPTITDTVGGYEL
PADLHARPDMQRVIALAGNATTIVNDLYSYTKELDSPGRHLNLPVVIAERERLSERDAYLKAVEVHNELQHAFEAAAAEL
AKACPLPTVLRFLKGVAAWVDGNHDWHRTNTYRYSLPDFW
>Q9F1Y6 4.2.3.118~~~~~~2-methylisoborneol synthase~~~COG3170
MPDSGTLGTPPPEQGPTPPTTLPDVPAPVIPSASVTSAASDFLAALHPPVTVPDPAPPPPPAPAAGNPPDTVTGDSVLQR
ILRGPTGPGTTSLAPAVRYGRQPGPEAPASAPPAAGRAVPGLYHHPVPEPDPVRVEEVSRRIKRWAEDEVQLYPEEWEGQ
FDGFSVGRYMVGCHPDAPTVDHLMLATRLMVAENAVDDCYCEDHGGSPVGLGGRLLLAHTAIDHFHSTAEYTPTWQASLA
ADAPRRAYDSAMGYFVRAATPSQSDRYRHDMARLHLGYLAEGAWAQTGHVPEVWEYLAMRQFNNFRPCPTITDTVGGYEL
PADLHARPDMQRVIALAGNATTIVNDLYSYTKELNSPGRHLNLPVVIAEREQLCERDAYLKAVEVHNELQHSFEAAAADL
AEACPLPPVLRFLRGVAAWVDGNHDWHRTNTYRYSLPDFW
>D3KYU2 4.2.3.118~~~tpc~~~2-methylisoborneol synthase~~~
MPDSGSLGPPTSLPEQPPAPPATAPDAPAATVTDRPVTSSVAHFLAGLHPPVTRPSSPPSPSMPPASSNPSSPPSSSMPP
ASWAPPSPLSPPAPSLPPTSPPATAPETSAATGSDSVVRRVPVGPTGLGTTALSLARRQAAVPPDAVPAPSGPSAEGPVV
PGLYHHPIPEPDPVRVAEVSRRIKRWAEDEVRLYPEEWEGQFDGFSVGRYMVACHPDAPTVDHLMLATRLMVAENAVDDC
YCEDHGGSPVGLGGRLLLAHTALDHLHTTAEYAPEWSESLGSDAPRRAYRSAMDHFVRAATPSQADRYRHDMARLHLGYL
AEAAWAETGHVPEVCEYLAMRQFNNFRPCPTITDTVGGYELPADLHARPDMQRVIALAGNATTIVNDLYSYTKELDSPGR
HLNLPVVIAEREHLSDRDAYLKAVEVHNELMHAFEAAAAELAADCPVPAVLRFLRGVAAWVDGNHDWHRTNTYRYSLPDF
W
>Q8YT18 4.2.1.1~~~~~~Metal-independent carbonic anhydrase~~~COG4337
MNLFKPRILVLFAATALISGIAIVAQTSVADSGDKITATSSLKTPIVNRAITESEVLAAQKAWGEALVAISTTYDAKGKA
SAKALAEKVIDDAYGYQFGPVLFKPTLAISPRTFRTTRAGALAYFVGDDKAFPEDKGFALSSWRKVEIKNAAIFITGNTA
TTMGNVIITDKQGKATTVDKTWQFLKDDHGKLRIITHHSSLPYEQ
>Q09T02 ~~~micA~~~Lantibiotic michiganin-A~~~
MNDILETETPVMVSPRWDMLLDAGEDTSPSVQTQIDAEFRRVVSPYMSSSGWLCTLTIECGTIICACR
>Q7WY64 ~~~mifM~~~Membrane protein insertion and folding monitor~~~
MTMFVESINDVLFLVDFFTIILPALTAIGIAFLLRECRAGEQWKSKRTDEHQTVFHINRTDFLIIIYHRITTWIRKVFRM
NSPVNDEEDAGSLLL
>O33855 6.2.1.2~~~mig~~~Medium-chain acyl-CoA ligase Mig~~~
MSDTTTAFTVPAVAKAVAAAIPDRELIIQGDRRYTYRQVIERSNRLAAYLHSQGLGCHTEREALAGHEVGQDLLGLYAYN
GNEFVEALLGAFAARVAPFNVNFRYVKSELHYLLADSEATALIYHAAFAPRVAEILPELPRLRVLIQIADESGNELLDGA
VDYEDALASVSAQPPPVRHCPDDLYVLYTGGTTGMPKGVLWRQHDIFMTSFGGRNLMTGEPSSSIDEIVQRAASGPGTKL
MILPPLIHGAAQWSVMTAITTGQTVVFPTVVDHLDAEDVVRTIEREKVMVVTVVGDAMARPLVAAIEKGIADVSSLAVVA
NGGALLTPFVKQRLIEVLPNAVVVDGVGSSETGAQMHHMSTPGAVATGTFNAGPDTFVAAEDLSAILPPGHEGMGWLAQR
GYVPLGYKGDAAKTAKTFPVIDGVRYAVPGDRARHHADGHIELLGRDSVCINSGGEKIFVEEVETAIASHPAVADVVVAG
RPSERWGQEVVAVVALSDGAAVDAGELIAHASNSLARYKLPKAIVFRPVIERSPSGKADYRWAREQAVNG
>Q0QLE6 5.3.3.6~~~mii~~~3-methylitaconate isomerase~~~
MSDQMRIPCVIMRAGTSKGIFLKGNDLPADQELRDKVILRIFGSPDVRQIDGLAGADPLTSKLAIIGPSTHPDADVDYTF
AQVSITDAVVDYNGNCGNISAGVGPFAIDESFVKAVEPMTRVCIHNTNTGKLLYAEVEVEDGKAKVSGDCKIDGVPGTNA
PELMDFSDTAGAATGKVLPTGNVVDVLSTSKGDIDVSIVDVANPCIFVHAKDVNMTGTETPDVINGNADLLAYLEEIRAK
CCVKIGMAATEKEASEKSPAFPMIAFVTKPEDYVDFSTGNTISGDDVDLVSRLMFMQVLHKTYAGTATACTGSAARIPGT
IVNQVLRDTGDEDTVRIGHPAGVIPVVSIVKDGKVEKAALIRTARRIMEGYVYVEKAKLV
>E9RFS9 1.14.13.227~~~mimA~~~Propane 2-monooxygenase, hydroxylase component large subunit~~~
MSRQSLTKAHAKISELTWEPTFATPATRFGTDYTFEKAPKKDPLKQIMRSYFSMEEEKDNRVYGAMDGAIRGNMFRQVQQ
RWLEWQKLFLSIIPFPEISAARAMPMAIDAVPNPEIHNGLAVQMIDEVRHSTIQMNLKKLYMNNYIDPAGFDMTEKAFAN
NYAGTIGRQFGEGFITGDAITAANIYLTVVAETAFTNTLFVAMPDEAAANGDYLLPTVFHSVQSDESRHISNGYSILLMA
LADERNRPLLERDLRYAWWNNHCVVDAAIGTFIEYGTKDRRKDRESYAEMWRRWIYDDYYRSYLLPLEKYGLTIPHDLVE
EAWKRIVEKGYVHEVARFFATGWPVNYWRIDTMTDTDFEWFEHKYPGWYNKFGKWWENYNRLAYPGRNKPIAFEEVGYQY
PHRCWTCMVPALIREDMIVEKVDGQWRTYCSETCYWTDAVAFRGEYEGRATPNMGRLTGFREWETLHHGKDLADIVTDLG
YVRDDGKTLVGQPHLDLDPQKMWTLDDVRGNTFNSPNVLLNQMTNDERDAHVAAYRAGGVPA
>A0QTU8 1.14.13.227~~~mimA~~~Propane 2-monooxygenase, hydroxylase component large subunit~~~COG3350
MSRQSLTKAHAKISELTWEPTFATPATRFGTDYTFEKAPKKDPLKQIMRSYFSMEEEKDNRVYGAMDGAIRGNMFRQVQQ
RWLEWQKLFLSIIPFPEISAARAMPMAIDAVPNPEIHNGLAVQMIDEVRHSTIQMNLKKLYMNNYIDPSGFDMTEKAFAN
NYAGTIGRQFGEGFITGDAITAANIYLTVVAETAFTNTLFVAMPDEAAANGDYLLPTVFHSVQSDESRHISNGYSILLMA
LADERNRPLLERDLRYAWWNNHCVVDAAIGTFIEYGTKDRRKDRESYAEMWRRWIYDDYYRSYLLPLEKYGLTIPHDLVE
EAWKRIVEKGYVHEVARFFATGWPVNYWRIDTMTDTDFEWFEHKYPGWYSKFGKWWENYNRLAYPGRNKPIAFEEVGYQY
PHRCWTCMVPALIREDMIVEKVDGQWRTYCSETCYWTDAVAFRGEYEGRETPNMGRLTGFREWETLHHGKDLADIVTDLG
YVRDDGKTLVGQPHLNLDPQKMWTLDDVRGNTFNSPNVLLNQMTDDERDAHVAAYRAGGVPA
>E9RFT0 1.18.1.-~~~mimB~~~Propane 2-monooxygenase, reductase component~~~
MADSHKINFEPVDIEMDVREDENILDAAFRQGIHLMHGCREGRCSACKSYVLDGEIQMENYSTFACNDAEVDEGFVLLCR
SHAFSDCTIELLNFDEDELLGGIPIQDVRTEVLAVEPKTRDIVSLRLKPVEPGKFDFKPGQYADLHIPGTEEHRSFSMAT
TPSCSDEVEFLIKKYPGGKFSALLDGHIQVGDEIALTGPYGSFTLKDGHVLPVVCIGGGAGMAPILSLLRHMNETGNGRP
ARFYYGARTAADLFYLDEILELGKGIKDFQFIACLSESAEGQVPGAVAVEEGMVTDVVARHESAIAKTEVYLCGPPPMVD
AALGFLDANSVPKDQVFYDSFTSPIFDQ
>A0QTU9 1.18.1.-~~~mimB~~~Propane 2-monooxygenase, reductase component~~~COG1018
MADSHKINFDPVDIEMEVREDENILDAAFRQGIHLMHGCREGRCSACKSYVLDGEIQMESYSTFACNDAEVDEGYVLLCR
SHAFSDCTIELLNFDEDELLGGIPIQDVRTQVQAVEPKTRDIVSLRLKPIEPGKFDFKPGQYADLHIPGTDEHRSFSMAT
TQSRSDEVEFLIKKYPGGKFSALLDGHIQVGDEIALTGPYGSFTLKDGHVLPVVCIGGGAGMAPILSLLRHMNETENSRP
ARFYYGARTPADLFYLDEILELGKGIKDFRFIACLSESADGEVPGRVTVEEGMVTDVVARHETAIAKTEVYLCGPPPMVD
AALMFLDANCVPKDQVFYDSFTSPIFDQ
>E9RFT1 1.14.13.227~~~mimC~~~Propane 2-monooxygenase, hydroxylase component small subunit~~~
MSAPEKPRERSFPKIEFTDSEAGAKVFPSSKSRSFSYFTPAKLRATMYEDVTVDVQPDPDRHLTQGWIYGFGNGPGGYPK
DWTTAKSSNWHAFLDPNEEWNQTIYRNNAAVVRQVELCLKNAKRARVYDGWHTIWLTFIERHVGAWMHAENGLALHVFTS
IQRSGPTNMINTAVAVNAAHKMRFAQDLALFNLDLAEATDAFDGSVHRAVWQEAPEWQPTRRVVEELTAVGDWCQLLFAT
NIVFEQLVGSLFRSELIMQIAARNGDYITPTIVGTGEHDYDRDLNYSRNLFRLLTRDPEHGEANKALFAEWLGIWVPRCL
DAARALQPIWSTPADKAVTFASSLKAAKAKFSALLEEIDLDIPEELDK
>I7FA35 1.14.13.227~~~mimC~~~Propane 2-monooxygenase, hydroxylase component small subunit~~~
MSAPEKPRERSFPKIEFTDSEAGAKEFPSSKSRSYSYFTPAKLRATMYEDVTVDVQPDPDRHLTQGWIYGFGNGPGGYPK
DWTTAKSSDWHAFRDPNEEWNQTIYRNNAAVVRQVELCLKNAKRARVYDGWNSTWLTFIERNVGAWMHAENGLALHVFTS
IQRSGPTNMINTAVAVNAAHKMRFAQDLALFNLDLAEATEAFDGSAHRAVWQEAREWQATRKVVEELTAVGDWCQLLFAT
NIVFEQLVGSLFRTELIMQIAARNGDYITPTIVGTGEHDYDRDLNYTRNLFRLLTRDPEHGEANKALFTEWLGIWVPRCL
DAALALQPIWSAPADKAVTFASSLDAAKAKFTALLEEIDLDIPEELNK
>E9RFT2 ~~~mimD~~~Propane 2-monooxygenase, effector component~~~
MSSMQFGAATEFSNRCGVTLMNTPIGRVVAEVMGAKEGVELTEYPSMIRVDGVKLLSFDYDELTDALGQEFDGSIFEEIS
STHYGRMVHLDDKTMLFASPEDAAEYIGFDLTAQ
>A0QTV0 ~~~mimD~~~Propane 2-monooxygenase, effector component~~~
MSSMQFGAATEFSNKCGVTLMNTPIGRVVAEVMGAKEGVELTEYPSMIRVDGVKLLSFDYDELTEALGEEFDGSIFEEIS
STHYGRMVHLDDKTMLFASPEDAAEYIGFDLTAK
>O67034 ~~~minC~~~Probable septum site-determining protein MinC~~~COG0850
MIEIKGKTLPVIQIKIKEKGNIDKLLKELKEKLSHNIFKGSLIILENPEVLKPEERKKVEEILKEFSRGFIEGKKEGKEK
REESRLLIIERTLRAGQRIEHRGDILILGDVNKDAEVLAGGNIIVMGKLRGVAKAGLIGDHSAVIVALKMEPQLLQIGKK
KAIMSEADRNSPGYPEVAKIEGEDIVLEPIEGAERWLKLLL
>Q01463 ~~~minC~~~Septum site-determining protein MinC~~~COG0850
MKTKKQQYVTIKGTKNGLTLHLDDACSFDELLDGLQNMLSIEQYTDGKGQKISVHVKLGNRFLYKEQEEQLTELIASKKD
LFVHSIDSEVITKKEAQQIREEAEIISVSKIVRSGQVLQVKGDLLLIGDVNPGGTVRAGGNIFVLGSLKGIAHAGFNGNN
QAVIAASEMLPTQLRINHVLNRSPDHIQKGNEMECAYLDTDGNMVIERLQHLAHLRPDLTRLEGGM
>P18196 ~~~minC~~~Septum site-determining protein MinC~~~COG0850
MSNTPIELKGSSFTLSVVHLHEAEPKVIHQALEDKIAQAPAFLKHAPVVLNVSALEDPVNWSAMHKAVSATGLRVIGVSG
CKDAQLKAEIEKMGLPILTEGKEKAPRPAPTPQAPAQNTTPVTKTRLIDTPVRSGQRIYAPQCDLIVTSHVSAGAELIAD
GNIHVYGMMRGRALAGASGDRETQIFCTNLMAELVSIAGEYWLSDQIPAEFYGKAARLQLVENALTVQPLN
>Q9HYZ7 ~~~minC~~~Probable septum site-determining protein MinC~~~
MSQADLLDQDPVFQLKGSMLAVTILELAHNDLARLERQLADKVAQAPNFFRDTPLVMALDKLPEGEGRLDLPALLEVCRR
HGLRTLAIRAGREEDIAAAQALDLPVLPPSGARERPLDIKDSAPRKPAEEPSPSAGEARPEPAKAEEKPAEPVSRPTKVV
KTPVRGGMQIYAAGGDLIVLAAVSPGAELLADGNIHVYGPMRGRALAGVKGDATARIFCQQLAAELVSIAGNYKVAEDLR
RSPQWGKAVHVSLSGDVLNITRL
>P65359 ~~~minC~~~Septum site-determining protein MinC~~~
MSNTPIELKGSSFTLSVVHLHEAEPEVIRQALEDKIAQAPAFLKHAPVVINVSGLESPVNWPELHKIVTSTGLRIIGVSG
CKDASLKVEIDRMGLPLLTEGKEKAVRPAPVEPATPSEPPQNANPITKTRLIDVPVRSGQRIYAPQCDLIVTSHVSAGAE
LIADGNIHVYGMMRGRALAGASGDREAQIFCTHLTAELVSIAGVYWLSDKIPAEFYGKAARLRLADNALTVQPLN
>Q9X0D7 ~~~minC~~~Probable septum site-determining protein MinC~~~COG0850
MVDFKMTKEGLVLLIKDYQNLEEVLNAISARITQMGGFFAKGDRISLMIENHNKHSQDIPRIVSHLRNLGLEVSQILVGS
TVEGKENDLKVQSRTTVESTGKVIKRNIRSGQTVVHSGDVIVFGNVNKGAEILAGGSVVVFGKAQGNIRAGLNEGGQAVV
AALDLQTSLIQIAGFITHSKGEENVPSIAHVKGNRIVIEPFDKVSFERSE
>Q01464 ~~~minD~~~Septum site-determining protein MinD~~~COG2894
MGEAIVITSGKGGVGKTTTSANLGTALAILGKRVCLVDTDIGLRNLDVVMGLENRIIYDLVDVVEGRCKMHQALVKDKRF
DDLLYLMPAAQTSDKTAVAPEQIKNMVQELKQEFDYVIIDCPAGIEQGYKNAVSGADKAIVVTTPEISAVRDADRIIGLL
EQEENIEPPRLVVNRIRNHLMKNGDTMDIDEIVQHLSIDLLGIVADDDEVIKASNHGEPIAMDPKNRASIAYRNIARRIL
GESVPLQVLEEQNKGMMAKIKSFFGVRS
>P0AEZ3 ~~~minD~~~Septum site-determining protein MinD~~~COG2894
MARIIVVTSGKGGVGKTTSSAAIATGLAQKGKKTVVIDFDIGLRNLDLIMGCERRVVYDFVNVIQGDATLNQALIKDKRT
ENLYILPASQTRDKDALTREGVAKVLDDLKAMDFEFIVCDSPAGIETGALMALYFADEAIITTNPEVSSVRDSDRILGIL
ASKSRRAENGEEPIKEHLLLTRYNPGRVSRGDMLSMEDVLEILRIKLVGVIPEDQSVLRASNQGEPVILDINADAGKAYA
DTVERLLGEERPFRFIEEEKKGFLKRLFGG
>Q7DDS7 ~~~minD~~~Septum site-determining protein MinD~~~
MAKIIVVTSGKGGVGKTTTSASIATGLALRGYKTAVIDFDVGLRNLDLIMGCERRVVYDLINVIQGEATLNQALIKDKNC
ENLFILPASQTRDKDALTREGVEKVMQELSGKKMGFEYIICDSPAGIEQGALMALYFADEAIVTTNPEVSSVRDSDRILG
ILQSKSHKAEQGGSVKEHLLITRYSPERVAKGEMLSVQDICDILHIPLLGVIPESQNVLQASNSGEPVIHQDSVAASEAY
KDVIARLLGENREMRFLEAEKKSFFKRLFGG
>P0A734 ~~~minE~~~Cell division topological specificity factor~~~COG0851
MALLDFFLSRKKNTANIAKERLQIIVAERRRSDAEPHYLPQLRKDILEVICKYVQIDPEMVTVQLEQKDGDISILELNVT
LPEAEELK
>O25099 ~~~minE~~~Cell division topological specificity factor~~~COG0851
MSLFDFFKNKGSAATATDRLKLILAKERTLNLPYMEEMRKEIIAVIQKYTKSSDIHFKTLDSNQSVETIEVEIILPR
>P58152 ~~~minE~~~Cell division topological specificity factor~~~
MSLIELLFGRKQKTATVARDRLQIIIAQERAQEGQTPDYLPTLRKELMEVLSKYVNVSLDNIRISQEKQDGMDVLELNIT
LPEQKKV
>O34375 ~~~minJ~~~Cell division topological determinant MinJ~~~COG0265
MSVQWGIELLKSAGLFFLHPLFWFFIIITLAFGYVRIKRERKTFHTRIADIYDDLKFTYTKGLIPGLLLSVILGGLGISI
PLGLLAIIAVITAAAAFTLRANWMSAAYIVSVSMLIGFGLQIYQAEPFLERFPQGFAVVWPAVAVFLGLLIITEGAVAYR
SAHVRTSPALVVSSRGLPIGQQLANRVWLLPLFLLVPGNGLESHLSWWPVFTVPGGSFHFLWIPYFVGFGQRVQGSLPET
SIRITAKRVCILGLAVAVLGAASLLWTPLAGAAVCTALLGRIFLSIKQRVNDNAAPFYFSKRDQGLMVLGIIPNTPAEDL
ELKIGEIITKVNGIPVKNVSDFYEALQHNRAYVKLEIIGLNGEIRFDQRASYEGEHHELGILFVKDDREDEAVASGS
>P03817 ~~~mioC~~~Protein MioC~~~COG0716
MADITLISGSTLGGAEYVAEHLAEKLEEAGFTTETLHGPLLEDLPASGIWLVISSTHGAGDIPDNLSPFYEALQEQKPDL
SAVRFGAIGIGSREYDTFCGAIDKLEAELKNSGAKQTGETLKINILDHDIPEDPAEEWLGSWVNLLK
>P0A908 ~~~mipA~~~MltA-interacting protein~~~COG3713
MTKLKLLALGVLIATSAGVAHAEGKFSLGAGVGVVEHPYKDYDTDVYPVPVINYEGDNFWFRGLGGGYYLWNDATDKLSI
TAYWSPLYFKAKDSGDHQMRHLDDRKSTMMAGLSYAHFTQYGYLRTTLAGDTLDNSNGIVWDMAWLYRYTNGGLTVTPGI
GVQWNSENQNEYYYGVSRKESARSGLRGYNPNDSWSPYLELSASYNFLGDWSVYGTARYTRLSDEVTDSPMVDKSWTGLI
STGITYKF
>P51752 5.2.1.8~~~mip~~~Peptidyl-prolyl cis-trans isomerase Mip~~~COG0545
MKRLILPFLSVGLLLGTTAHAATPLKTEQDKLSYSMGVMTGKAFRKHDIKIDPQTFSMGLSDAYLGKETQMTEAEMRQTL
QQFEKQSLQKMQHKMKQTAQQNAEKSRAFLTANKNKPGVKTLANGLQYKVLQAGQGQSPTLNDEVTVNYEGRLINGTVFD
SSYKRGQPATFPLKSVIKGWQEALTRMKPGAIWEIYVPPQLAYGEQGAPGVIGPNEALIFKVNLISVKKK
>Q5ZXE0 5.2.1.8~~~mip~~~Outer membrane protein MIP~~~COG0545
MKMKLVTAAVMGLAMSTAMAATDATSLATDKDKLSYSIGADLGKNFKNQGIDVNPEAMAKGMQDAMSGAQLALTEQQMKD
VLNKFQKDLMAKRTAEFNKKADENKVKGEAFLTENKNKPGVVVLPSGLQYKVINSGNGVKPGKSDTVTVEYTGRLIDGTV
FDSTEKTGKPATFQVSQVIPGWTEALQLMPAGSTWEIYVPSGLAYGPRSVGGPIGPNETLIFKIHLISVKKSS
>Q70YI1 5.2.1.8~~~mip~~~Outer membrane protein MIP~~~COG0545
MKMKLVTAAVMGLAMSTAMAATDATSLATDKDKLSYSIGADLGKNFKNQGIDVNPEAMAKGMQDAMSGAQLALTEQQMKD
VLNKFQKDLMAKRTAEFNKKADENKVKGEAFLTENKNKPGVVVLPSGLQYKVINAGNGVKPGKSDTVTVEYTGRLIDGTV
FDSTEKTGKPATFQVSQVIPGWTEALQLMPAGSTWEIYVPSGLAYGPRSVGGPIGPNETLIFKIHLISVKKSS
>Q01625 ~~~misCA~~~Membrane protein insertase MisCA~~~COG0706
MLLKRRIGLLLSMVGVFMLLAGCSSVKEPITADSPHFWDKYVVYPLSELITYVAKLTGDNYGLSIILVTILIRLLILPLM
IKQLRSSKAMQALQPEMQKLKEKYSSKDQKTQQKLQQETMALFQKHGVNPLAGCFPILIQMPILIGFYHAIMRTQAISEH
SFLWFDLGEKDPYYILPIVAGVATFVQQKLMMAGNAQQNPQMAMMLWIMPIMIIVFAINFPAALSLYWVVGNLFMIAQTF
LIKGPDIKKNPEPQKAGGKKK
>P54544 ~~~misCB~~~Membrane protein insertase MisCB~~~COG0706
MLKTYQKLLAMGIFLIVLCSGNAAFAATNQVGGLSNVGFFHDYLIEPFSALLKGVAGLFHGEYGLSIILVTIIVRIVVLP
LFVNQFKKQRIFQEKMAVIKPQVDSIQVKLKKTKDPEKQKELQMEMMKLYQEHNINPLAMGCLPMLIQSPIMIGLYYAIR
STPEIASHSFLWFSLGQSDILMSLSAGIMYFVQAYIAQKLSAKYSAVPQNPAAQQSAKLMVFIFPVMMTIFSLNVPAALP
LYWFTSGLFLTVQNIVLQMTHHKSKKTAALTESVK
>P9WQL5 ~~~mkl~~~Probable ribonucleotide transport ATP-binding protein mkl~~~COG1135
MRYSDSYHTTGRWQPRASTEGFPMGVSIEVNGLTKSFGSSRIWEDVTLTIPAGEVSVLLGPSGTGKSVFLKSLIGLLRPE
RGSIIIDGTDIIECSAKELYEIRTLFGVLFQDGALFGSMNLYDNTAFPLREHTKKKESEIRDIVMEKLALVGLGGDEKKF
PGEISGGMRKRAGLARALVLDPQIILCDEPDSGLDPVRTAYLSQLIMDINAQIDATILIVTHNINIARTVPDNMGMLFRK
HLVMFGPREVLLTSDEPVVRQFLNGRRIGPIGMSEEKDEATMAEEQALLDAGHHAGGVEEIEGVPPQISATPGMPERKAV
ARRQARVREMLHTLPKKAQAAILDDLEGTHKYAVHEIGQ
>P76506 ~~~mlaA~~~Intermembrane phospholipid transport system lipoprotein MlaA~~~COG2853
MKLRLSALALGTTLLVGCASSGTDQQGRSDPLEGFNRTMYNFNFNVLDPYIVRPVAVAWRDYVPQPARNGLSNFTGNLEE
PAVMVNYFLQGDPYQGMVHFTRFFLNTILGMGGFIDVAGMANPKLQRTEPHRFGSTLGHYGVGYGPYVQLPFYGSFTLRD
DGGDMADGFYPVLSWLTWPMSVGKWTLEGIETRAQLLDSDGLLRQSSDPYIMVREAYFQRHDFIANGGELKPQENPNAQA
IQDDLKDIDSE
>P43262 ~~~mlaA~~~Intermembrane phospholipid transport system lipoprotein MlaA~~~
MKLRLSALALGTTLLVGCASSGTDQQGRSDPLEGFNRTMYNFNFNVLDPYIVRPVAVAWRDYVPQPARNGLSNFTGNLEE
PAVMVNYFLQGDPYQGMVHFTRFFLNTILGMGGFIDVAGMANPKLQRTEPHRFGSTLGHYGVGYGPYVQLPFYGSFTLRD
DGGDMADALYPVLSWLTWPMSVGKWTLEGIETRAQLLDSDGLLRQSSDPYIMVREAYFQRHDFIANGGELKPQENPNAQA
IQDDLKDIDSE
>P64602 ~~~mlaB~~~Intermembrane phospholipid transport system binding protein MlaB~~~COG3113
MSESLSWMQTGDTLALSGELDQDVLLPLWEMREEAVKGITCIDLSRVSRVDTGGLALLLHLIDLAKKQGNNVTLQGVNDK
VYTLAKLYNLPADVLPR
>P0ADV7 ~~~mlaC~~~Intermembrane phospholipid transport system binding protein MlaC~~~COG2854
MFKRLMMVALLVIAPLSAATAADQTNPYKLMDEAAQKTFDRLKNEQPQIRANPDYLRTIVDQELLPYVQVKYAGALVLGQ
YYKSATPAQREAYFAAFREYLKQAYGQALAMYHGQTYQIAPEQPLGDKTIVPIRVTIIDPNGRPPVRLDFQWRKNSQTGN
WQAYDMIAEGVSMITTKQNEWGTLLRTKGIDGLTAQLKSISQQKITLEEKK
>P45028 ~~~mlaC~~~Intermembrane phospholipid transport system binding protein MlaC~~~COG2854
MNLIQLKKWFTILTFVLTAFLVTRTAIAETSPYVLMQQAADKLFSDIQANQSKIKQDPNYLRTIVRNDLLPYVNLEYAGS
KVLGSYYKSTSAEQREKFFKTFGELIEQKYAQALTNYSNQKIQIESEKELGDNNFINIRVNIIQANGVAPILLYFKWRKG
NKSGEWKVYDMVGAGVSMLEDTIKNWVGILNKQGIDTLITKMQQSASQPIIFNQ
>P64604 ~~~mlaD~~~Intermembrane phospholipid transport system binding protein MlaD~~~COG1463
MQTKKNEIWVGIFLLAALLAALFVCLKAANVTSIRTEPTYTLYATFDNIGGLKARSPVSIGGVVVGRVADITLDPKTYLP
RVTLEIEQRYNHIPDTSSLSIRTSGLLGEQYLALNVGFEDPELGTAILKDGDTIQDTKSAMVLEDLIGQFLYGSKGDDNK
NSGDAPAAAPGNNETTEPVGTTK
>P64606 ~~~mlaE~~~Intermembrane phospholipid transport system permease protein MlaE~~~COG0767
MLLNALASLGHKGIKTLRTFGRAGLMLFNALVGKPEFRKHAPLLVRQLYNVGVLSMLIIVVSGVFIGMVLGLQGYLVLTT
YSAETSLGMLVALSLLRELGPVVAALLFAGRAGSALTAEIGLMRATEQLSSMEMMAVDPLRRVISPRFWAGVISLPLLTV
IFVAVGIWGGSLVGVSWKGIDSGFFWSAMQNAVDWRMDLVNCLIKSVVFAITVTWISLFNGYDAIPTSAGISRATTRTVV
HSSLAVLGLDFVLTALMFGN
>P63386 7.6.2.-~~~mlaF~~~Intermembrane phospholipid transport system ATP-binding protein MlaF~~~COG1127
MEQSVANLVDMRDVSFTRGNRCIFDNISLTVPRGKITAIMGPSGIGKTTLLRLIGGQIAPDHGEILFDGENIPAMSRSRL
YTVRKRMSMLFQSGALFTDMNVFDNVAYPLREHTQLPAPLLHSTVMMKLEAVGLRGAAKLMPSELSGGMARRAALARAIA
LEPDLIMFDEPFVGQDPITMGVLVKLISELNSALGVTCVVVSHDVPEVLSIADHAWILADKKIVAHGSAQALQANPDPRV
RQFLDGIADGPVPFRYPAGDYHADLLPGS
>P45031 7.6.2.-~~~mlaF~~~Intermembrane phospholipid transport system ATP-binding protein MlaF~~~COG1127
MNQNLIEVKNLTFKRGDRVIYDNLNLQVKKGKITAIMGPSGIGKTTLLKLIGGQLMPEQGEILFDGQDICRLSNRELYEV
RKRMGMLFQSGALFTDISTFDNVAFPIREHTHLPENLIRQIVLMKLEAVGLRGAAALMPSELSGGMARRAALARAIALDP
DLIMFDEPFTGQDPISMGVILSLIKRLNEALNLTSIVVSHDVEEVLSIADYAYIIADQKVIAEGTSEQLLQSQDLRVVQF
LKGESDGPVRFKYPAQDYVKELFE
>P50456 ~~~mlc~~~Protein mlc~~~COG1940
MVAENQPGHIDQIKQTNAGAVYRLIDQLGPVSRIDLSRLAQLAPASITKIVREMLEAHLVQELEIKEAGNRGRPAVGLVV
ETEAWHYLSLRISRGEIFLALRDLSSKLVVEESQELALKDDLPLLDRIISHIDQFFIRHQKKLERLTSIAITLPGIIDTE
NGIVHRMPFYEDVKEMPLGEALEQHTGVPVYIQHDISAWTMAEALFGASRGARDVIQVVIDHNVGAGVITDGHLLHAGSS
SLVEIGHTQVDPYGKRCYCGNHGCLETIASVDSILELAQLRLNQSMSSMLHGQPLTVDSLCQAALRGDLLAKDIITGVGA
HVGRILAIMVNLFNPQKILIGSPLSKAADILFPVISDSIRQQALPAYSQHISVESTQFSNQGTMAGAALVKDAMYNGSLL
IRLLQG
>P54571 ~~~mleN~~~Malate-2H(+)/Na(+)-lactate antiporter~~~COG1757
MKDVRLPTLFEIIIVLGVFLALVLSFTVFLDLPIQLALFVSWFIAMLLGIRLGYSYKDLQNAIVHGISNGLEAVLILVSV
GALIGTWIAGGVVPTLIYYGLEFIHPSIFLLATLIICSIMSVATGTSWGTVGTAGIAMIAIGEGLGIPLPLVAGAILSGA
YFGDKLSPLSDSTVLASSLSKVDVLAHVRAMLYLSIPAYVITAILFTVVGFMYGGKNIDLDKVEFLKSSLQNTFDIHIWM
LIPAVLVIVLLAMKKPSMPVIVIGALLGAIWAVVFQGMDIAHAIATAYNGFSIKTDVEFLNGLLNRGGIVGMLDSLVVII
FGLGFGGLLEKLGVLKVIVSTFEKKLTSAGNVTLSTLIVAFLANIFGCAMYVSLILTPKIMEDSYDRLHLDRRVLSRNSE
VGGTLTSGMVPWSDNGIYMAGILGVSTFSYLPFMWLSFVAIGLAIIYGYTGKFIWYTKNNTVKAEKLG
>F9UMS6 4.1.1.101~~~mleS~~~Malolactic enzyme~~~COG0281
MTKTASEILNNPFLNKGTAFTKEERQALGLTGTLPSKVQTIDEQATQAYAQFKSKPSRLEQRIFLMNLFNENRTLFFHLM
DEHVVEFMPIVYDPVVADSIEQYNELFLDPQNAAFVSVDAPEDIEATLKNAADGRDIRLVVVTDAEGILGMGDWGVNGVD
IAIGKLMVYTAAAGIDPSQVLPVSIDAGTNNQKLLDDPLYLGNRHKCVSGEQYYDVIDKFVAAEQQLFPDSLLHFEDFGR
DNAQVILDKYKDQIATFNDDIQGTGMVVLAGILGALNISKESIKDQKILSFGAGTAGMGIANQILDELMQAGLTEEEAKQ
HFYAVDKQGLLFDDTEGLTPAQKAFTRKRSEFSNADELTNLEAVVKAVHPTVMIGTSTQPGTFTESIIKEMAAHTERPII
FPLSNPTKLAEAKAEDLIKWTDGRALVATGIPADDVEYKGVTYQIGQGNNALMYPGLGFGLIASTAKVLNAETLSAACHA
LGGIVDTSKPGAAVLPPVAKITEFSQKLAEVVAQSVIDQKLNKEPIADAKQAVADMKWVPEYRAISK
>A0A095AMW7 4.1.1.101~~~mleS~~~Malolactic enzyme~~~
MNTTGYDILRNPFLNKGTAFSEAERQQLGLTGTLPSQIQTIEEQAEQAYKQFQAKSPLLEKRIFLMNLFNENVTLFYHLM
DQHVSEFMPIVYDPVVAESIEQYNEIYTNPQNAAFLSVDRPEDVENALKNAAAGRDIKLVVVTDAEGILGMGDWGVNGVD
IAVGKLMVYTAAAGIDPATVLPVSIDAGTNNKELLHNPLYLGNKHERIAGEQYLEFIDKFVTAEQNLFPESLLHWEDFGR
SNAQVILDKYKESIATFNDDIQGTGMIVLAGIFGALNISKQKLVDQKFVTFGAGTAGMGIVNQIFSELKQAGLSDDEARN
HFYLVDKQGLLFDDTEGLTAAQKPFTRSRKEFVNPEQLINLETIVKELHPTVLIGTSTQPGTFTETIVKSMAENTERPII
FPLSNPTKLAEATAEDLIKWTGGKALVATGIPAADVDYKGVTYKIGQGNNALIYPGLGFGLVASTAKLLTQETISAAIHA
LGGLVDTDEPGAAVLPPVSNLTDFSQKIAEITAQSVVNQGLNREKIVDPKQAVQDAKWSAEY
>Q48796 4.1.1.101~~~mleA~~~Malolactic enzyme~~~
MTDPVSILNDPFINKGTAFTEAEREELGLNGLLPAKVQALQEQVDQTYAQFQSKVSNLEKRLFLMEIFNTNHVLFYKLFS
QHVVEFMPIVYDPTIADTIENYSELFVEPQGAAFLDINHPENIQSTLKNAANGRDIKLLVVSDAEGILGIGDWGVQGVDI
AVGKLMVYTVAAGIDPSTVLAVVIDAGTNNEKLLKDPMYLGNKFNRVRGDKYYDFIDKFVNHAESLFPNLYLHWEDFGRS
NASNILNSYKDKIATFNDDIQGTGIVVLAGVLGALKISGQKLTDQTYMSFGAGTAGMGIVKQLHEEMVEQGLSDEEAKKH
FFLVDKQGLLFDDDPDLTPEQKPFAAKRSDFKNANQLTNLQAAVEAVHPTILVGTSTHPNSFTEEIVKDMSGYTERPIIF
PISNPTKLAEAKAEDVLKWSNGKALIGTGVPVDDIEYEGNAYQIGQANNALIYPGLGFGAIAAQSKLLTPEMISAAAHSL
GGIVDTTKVGAAVLPPVSKLADFSRTVAVAVAKKAVEQGLNRQPIDDVEKAVDDLKWDPKY
>Q9EX73 3.1.1.83~~~mlhB~~~Monoterpene epsilon-lactone hydrolase~~~
MSATDTARAKELLASLVSMPDATIDDFRALYEQVCATFELPDDAQVEPVDANGADALWVSAPGVSADTVAVVVHGGGFTM
GSAHGYRELGYRLSKSGNLRALVVDYRLAPESPFPAPVDDVVAAYRYARSLDGVENVFLVGDSAGGGIAMSALITLRDAG
EQLPDAAVVLSPLVDLAGESPSLVDRAHLDPLPAAVLVNGMGGLYLNGLDVRHPVASPMHGDLTGLPATLVLVGTDEGLH
DDSTRLVDKLKAADVEVQLEIGEGLPHIWPIFSFHPDAVAATDRIGEFLRSHVAAPR
>P28224 ~~~mliC~~~Membrane-bound lysozyme inhibitor of C-type lysozyme~~~COG3895
MTMKKLLIIILPVLLSGCSAFNQLVERMQTDTLEYQCDEKPLTVKLNNPRQEVSFVYDNQLLHLKQGISASGARYTDGIY
VFWSKGDEATVYKRDRIVLNNCQLQNPQR
>Q9I574 ~~~mliC~~~Membrane-bound lysozyme inhibitor of C-type lysozyme~~~
MKKALWLLLAAVPVVLVACGGSDDDKQTAQVDYLALPGDAKLDTRSVDYKCENGRKFTVQYLNKGDNSLAVVPVSDNSTL
VFSNVISASGAKYAAGQYIWWTKGEEATLYGDWKGGEPTDGVACKER
>H7C7P1 ~~~mlpA~~~Lipoprotein MlpA~~~
MKIINILFCLFLLLLNSCNSNDNDTLKNNAQQTKSRGKRDLTQKEATPEKPKSKEELLREKLSEDQKTHLDWLKEALGND
GEFDKFLGYDESKIKTALDHIKSELDKCNGNDADQQKTTFKQTVQGALSGGIDGFGSNNAVTTCGNGS
>Q9S0H7 ~~~mlpC~~~Lipoprotein MlpC~~~
MKIINILFCLFLLMLNGCNSNDNDTLKNNAQQTKRRGKRDLTQKETTQEKPKSKEELLREKLSDDQKTHLDWLKPALTGA
GEFDKFLENDDDKIKSALDHIKTQLDSCNGDQAEQQKTTFKTVVTEFFKNGDIDNFATGAVSNCNNGG
>Q9S0E8 ~~~mlpD~~~Lipoprotein MlpD~~~
MKIINILFCLFLLMLNGCNSNDTNNSQTKSRQKRDLTQKEATQEKPKSKEELLREKLNDNQKTHLDWLKEALGNDGEFNK
FLGYDESKIKSALDHIKSELDSCTGDKVENKNTFKQVVQEALKGGIDGFENTASSTCKNS
>Q9S0B7 ~~~mlpD~~~Lipoprotein MlpF~~~
MKIINILFCLFLLLLNSCNSNDNDTLKNNAQQTKSRGKRDLTQKEATPEKPKSKEELLREKLSEDQKTHLDWLKEALGND
GEFDKFLGYDESKIKSALNHIKSELDKCTGDNSEQQKSTFKQTVQGFFSGGNIDNFANNAVSNCNNGGS
>Q9S083 ~~~mlpG~~~Lipoprotein MlpG~~~
MKIINILFCLFLLMLNGCNSNDTNTKQTKSRQKRDLTQKEATQEKPKSKSKEDLLREKLSDDQKTQLDWLKTALTGVGKF
DKFLENDEGKIKSALEHIKTELDKCNGNDEGKNTFKTTVQGFFSGGNIDNFADQATATCN
>Q9S069 ~~~mlpH~~~Lipoprotein MlpH~~~
MKIINILFCLFLLMLNGCNSNDNDTLKNNAQQTKSRRKRDLTQKEVTQEKPKSKEELLREKLNDDQKTQLDWLKTALTDA
GEFDKFLENNEDKIKSALDHIKSELDKCNGKENGDVQKNTFKQVVQGALKGGIDGFGASNATTTCNGS
>Q9S043 ~~~mlpI~~~Lipoprotein MlpI~~~
MKIINILFCLFLLMLNSCNSNDTNTSQTKSRQKRDLTQKEATQEKPKSKEDLLREKLSEDQKTHLDWLKTALTGAGEFDK
FLGYDEDKIKGALNHIKSELDKCTGDNSEQQKSTFKEVVKGALGGGIDSFATSASSTCQAQQ
>Q9RZZ0 ~~~mlpJ~~~Lipoprotein MlpJ~~~
MKIINILFCISLLLLNSCNSNDNDTLKNNAQQTKSRKKRDLSQEELPQQEKITLTSDEEKMFTSLINVFKYTIEKLNNEI
QGCMNGNKSKCNDFFDWLSEDIQKQKELAGAFTKVYNFLKSKAQNETFDTYIKGAIDCKKNTPQDCNKNNKYGDGDNLIE
QYFRGVANDMSNRNSNEEIYQYLKDELLKEDNHYAGLTANWQN
>P33358 ~~~mlrA~~~HTH-type transcriptional regulator MlrA~~~COG0789
MALYTIGEVALLCDINPVTLRAWQRRYGLLKPQRTDGGHRLFNDADIDRIREIKRWIDNGVQVSKVKMLLSNENVDVQNG
WRDQQETLLTYLQSGNLHSLRTWIKERGQDYPAQTLTTHLFIPLRRRLQCQQPTLQALLAILDGVLINYIAICLASARKK
QGKDALVVGWNIQDTTRLWLEGWIASQQGWRIDVLAHSLNQLRPELFEGRTLLVWCGENRTSAQQQQLTSWQEQGHDIFP
LGI
>P0A935 4.2.2.n1~~~mltA~~~Membrane-bound lytic murein transglycosylase A~~~COG2821
MKGRWVKYLLMGTVVAMLAACSSKPTDRGQQYKDGKFTQPFSLVNQPDAVGAPINAGDFAEQINHIRNSSPRLYGNQSNV
YNAVQEWLRAGGDTRNMRQFGIDAWQMEGADNYGNVQFTGYYTPVIQARHTRQGEFQYPIYRMPPKRGRLPSRAEIYAGA
LSDKYILAYSNSLMDNFIMDVQGSGYIDFGDGSPLNFFSYAGKNGHAYRSIGKVLIDRGEVKKEDMSMQAIRHWGETHSE
AEVRELLEQNPSFVFFKPQSFAPVKGASAVPLVGRASVASDRSIIPPGTTLLAEVPLLDNNGKFNGQYELRLMVALDVGG
AIKGQHFDIYQGIGPEAGHRAGWYNHYGRVWVLKTAPGAGNVFSG
>P41052 4.2.2.n1~~~mltB~~~Membrane-bound lytic murein transglycosylase B~~~COG2951
MFKRRYVTLLPLFVLLAACSSKPKPTETDTTTGTPSGGFLLEPQHNVMQMGGDFANNPNAQQFIDKMVNKHGFDRQQLQE
ILSQAKRLDSVLRLMDNQAPTTSVKPPSGPNGAWLRYRKKFITPDNVQNGVVFWNQYEDALNRAWQVYGVPPEIIVGIIG
VETRWGRVMGKTRILDALATLSFNYPRRAEYFSGELETFLLMARDEQDDPLNLKGSFAGAMGYGQFMPSSYKQYAVDFSG
DGHINLWDPVDAIGSVANYFKAHGWVKGDQVAVMANGQAPGLPNGFKTKYSISQLAAAGLTPQQPLGNHQQASLLRLDVG
TGYQYWYGLPNFYTITRYNHSTHYAMAVWQLGQAVALARVQ
>P0C066 4.2.2.n1~~~mltC~~~Membrane-bound lytic murein transglycosylase C~~~COG0741
MKKYLALALIAPLLISCSTTKKGDTYNEAWVKDTNGFDILMGQFAHNIENIWGFKEVVIAGPKDYVKYTDQYQTRSHINF
DDGTITIETIAGTEPAAHLRRAIIKTLLMGDDPSSVDLYSDVDDITISKEPFLYGQVVDNTGQPIRWEGRASNFADYLLK
NRLKSRSNGLRIIYSVTINMVPNHLDKRAHKYLGMVRQASRKYGVDESLILAIMQTESSFNPYAVSRSDALGLMQVVQHT
AGKDVFRSQGKSGTPSRSFLFDPASNIDTGTAYLAMLNNVYLGGIDNPTSRRYAVITAYNGGAGSVLRVFSNDKIQAANI
INTMTPGDVYQTLTTRHPSAESRRYLYKVNTAQKSYRRR
>P0AEZ7 4.2.2.n1~~~mltD~~~Membrane-bound lytic murein transglycosylase D~~~COG0741
MKAKAILLASVLLVGCQSTGNVQQHAQSLSAAGQGEAAKFTSQARWMDDGTSIAPDGDLWAFIGDELKMGIPENDRIREQ
KQKYLRNKSYLHDVTLRAEPYMYWIAGQVKKRNMPMELVLLPIVESAFDPHATSGANAAGIWQIIPSTGRNYGLKQTRNY
DARRDVVASTTAALNMMQRLNKMFDGDWLLTVAAYNSGEGRVMKAIKTNKARGKSTDFWSLPLPQETKQYVPKMLALSDI
LKNSKRYGVRLPTTDESRALARVHLSSPVEMAKVADMAGISVSKLKTFNAGVKGSTLGASGPQYVMVPKKHADQLRESLA
SGEIAAVQSTLVADNTPLNSRVYTVRSGDTLSSIASRLGVSTKDLQQWNKLRGSKLKPGQSLTIGAGSSAQRLANNSDSI
TYRVRKGDSLSSIAKRHGVNIKDVMRWNSDTANLQPGDKLTLFVKNNNMPDS
>P0AGC5 4.2.2.n1~~~mltF~~~Membrane-bound lytic murein transglycosylase F~~~COG4623
MKKLKINYLFIGILALLLAVALWPSIPWFGKADNRIAAIQARGELRVSTIHTPLTYNEINGKPFGLDYELAKQFADYLGV
KLKVTVRQNISQLFDDLDNGNADLLAAGLVYNSERVKNYQPGPTYYSVSQQLVYKVGQYRPRTLGNLTAEQLTVAPGHVV
VNDLQTLKETKFPELSWKVDDKKGSAELMEDVIEGKLDYTIADSVAISLFQRVHPELAVALDITDEQPVTWFSPLDGDNT
LSAALLDFFNEMNEDGTLARIEEKYLGHGDDFDYVDTRTFLRAVDAVLPQLKPLFEKYAEEIDWRLLAAIAYQESHWDAQ
ATSPTGVRGMMMLTKNTAQSLGITDRTDAEQSISGGVRYLQDMMSKVPESVPENERIWFALAAYNMGYAHMLDARALTAK
TKGNPDSWADVKQRLPLLSQKPYYSKLTYGYARGHEAYAYVENIRKYQISLVGYLQEKEKQATEAAMQLAQDYPAVSPTE
LGKEKFPFLSFLSQSSSNYLTHSPSLLFSRKGSEEKQN
>Q9HXN1 4.2.2.n1~~~mltF~~~Membrane-bound lytic murein transglycosylase F~~~
MFALTAYRLRCAAWLLATGIFLLLAGCSEAKAPTALERVQKEGVLRVITRNSPATYFQDRNGETGFEYELAKRFAERLGV
ELKIETADNLDDLYAQLSREGGPALAAAGLTPGREDDASVRYSHTYLDVTPQIIYRNGQQRPTRPEDLVGKRIMVLKGSS
HAEQLAELKKQYPELKYEESDAVEVVDLLRMVDVGDIDLTLVDSNELAMNQVYFPNVRVAFDFGEARGLAWALPGGDDDS
LMNEVNAFLDQAKKEGLLQRLKDRYYGHVDVLGYVGAYTFAQHLQQRLPRYESHFKQSGKQLDTDWRLLAAIGYQESLWQ
PGATSKTGVRGLMMLTNRTAQAMGVSNRLDPKQSIQGGSKYFVQIRSELPESIKEPDRSWFALAAYNIGGAHLEDARKMA
EKEGLNPNKWLDVKKMLPRLAQKQWYAKTRYGYARGGETVHFVQNVRRYYDILTWVTQPQMEGSQIAESGLHLPGVNKTR
PEEDSGDEKL
>Q74SQ6 4.2.2.n1~~~mltF~~~Membrane-bound lytic murein transglycosylase F~~~COG4623
MTRIKLSYFTIGLVALLLALALWPNIPWRNGQEGQLDQIKARGELRVSTISSPLIYSTEKDTPSGFDYELAKRFADYLGV
KLVIIPHHNIDDLFDALDNDDTDLLAAGLIYNRERLNRARTGPAYYSVSQQLVYRLGSPRPKSFSDLKGQVVVASGSAHM
TTLKRLKQTKYPELNWSSSVDKSGKELLEQVAEGKLDYTLGDSATIALLQRIHPQLAVAFDVTDEEPVTWYFKQSDDDSL
YAAMLDFYSEMVEDGSLARLEEKYLGHVGSFDYVDTKTFLSAIDNVLPSYQHLFEKHAGDIDWKLLAVIAYQESHWNPQA
TSPTGVRGLMMLTRATADGLGVKDRVDPEESIRGGAIYLQRLMKKLPETIPEDERIWFALAAYNLGYGHMLDARRLTKNQ
NGNPDSWVDVKMRLPMLSQKRYYPSTTYGYARGHEAYNYVENIRRYQVSLVGYLQEKEKKAAQHAAIEAELGKSNPVVGP
GWSIGD
>P28306 4.2.2.-~~~mltG~~~Endolytic murein transglycosylase~~~COG1559
MKKVLLIILLLLVVLGIAAGVGVWKVRHLADSKLLIKEETIFTLKPGTGRLALGEQLYADKIINRPRVFQWLLRIEPDLS
HFKAGTYRFTPQMTVREMLKLLESGKEAQFPLRLVEGMRLSDYLKQLREAPYIKHTLSDDKYATVAQALELENPEWIEGW
FWPDTWMYTANTTDVALLKRAHKKMVKAVDSAWEGRADGLPYKDKNQLVTMASIIEKETAVASERDKVASVFINRLRIGM
RLQTDPTVIYGMGERYNGKLSRADLETPTAYNTYTITGLPPGAIATPGADSLKAAAHPAKTPYLYFVADGKGGHTFNTNL
ASHNKSVQDYLKVLKEKNAQ
>A0A0H2ZLQ1 4.2.2.-~~~mltG~~~Endolytic murein transglycosylase~~~COG1559
MSEKSREEEKLSFKEQILRDLEKVKGYDEVLKEDEAVVRTPANEPSAEELMADSLSTVEEIMRKAPTVPTHPSQGVPASP
ADEIQRETPGVPSHPSQDVPSSPAEESGSRPGPGPVRPKKLEREYNETPTRVAVSYTTAEKKAEQAGPETPTPATETVDI
IRDTSRRSRREGAKPAKPKKEKKSHVKAFVISFLVFLALLSAGGYFGYQYVLDSLLPIDANSKKYVTVGIPEGSNVQEIG
TTLEKAGLVKHGLIFSFYAKYKNYTDLKAGYYNLQKSMSTEDLLKELQKGGTDEPQEPVLATLTIPEGYTLDQIAQTVGQ
LQGDFKESLTAEAFLAKVQDETFISQAVAKYPTLLESLPVKDSGARYRLEGYLFPATYSIKESTTIESLIDEMLAAMDKN
LSPYYSTIKSKNLTVNELLTIASLVEKEGAKTEDRKLIAGVFYNRLNRDMPLQSNIAILYAQGKLGQNISLAEDVAIDTN
IDSPYNVYKNVGLMPGPVDSPSLDAIESSINQTKSDNLYFVADVTEGKVYYANNQEDHDRNVAEHVNSKLN
>Q8CYJ8 4.2.2.-~~~mltG~~~Endolytic murein transglycosylase~~~COG1559
MSEKSREEEKLSFKEQILRDLEKVKGYDEVLKEDEAVVRTPANEPSAEELMADSLSTVEEIMRKAPTVPTHPSQGVPASP
ADEIQRETPGVPSHPSQDVPSSPAEESGSRPGPGPVRPKKLEREYNETPTRVAVSYTTAEKKAEQAGPETPTPATETVDI
IRDTSRRSRREGAKPAKPKKEKKSHVKAFVISFLVFLALLSAGGYFGYQYVLDSLLPIDANSKKYVTVGIPEGSNVQEIG
TTLEKAGLVKHGLIFSFYAKYKNYTDLKAGYYNLQKSMSTEDLLKELQKGGTDEPQEPVLATLTIPEGYTLDQIAQTVGQ
LQGDFKESLTAEAFLAKVQDETFISQAVAKYPTLLESLPVKDSGARYRLEGYLFPATYSIKESTTIESLIDEMLAAMDKN
LSLYYSTIKSKNLTVNELLTIASLVEKEGAKTEDRKLIAGVFYNRLNRDMPLQSNIAILYAQGKLGQNISLAEDVAIDTN
IDSPYNVYKNVGLMPGPVDSPSLDAIESSINQTKSDNLYFVADVTEGKVYYANNQEDHDRNVAEHVNSKLN
>V6F2Z8 ~~~~~~Magnetosome membrane protein 22~~~COG3597
MAAQTAASEAPAPAAAPADSPTTAGPTPDSVGRDLVAENMIKDYVLAAVAASIVPVPLFDIAAVVAIELRMIQKLSELYG
KPFSESLGRSVIASLAGGVVGYGAGMAVAVSLTKLIPGVGWMLGMVSLPVIAGATTYAIGRVFVKHYENGGDIFNLSADA
MRAYYKQQFEKGKALAAKVKARKEAAAVDDVAAAH
>A5U030 2.1.1.-~~~mmaA1~~~Mycolic acid methyltransferase MmaA1~~~COG2230
MAKLRPYYEESQSAYDISDDFFALFLDPTWVYTCAYFERDDMTLEEAQLAKVDLALDKLNLEPGMTLLDVGCGWGGALVR
AVEKYDVNVIGLTLSRNHYERSKDRLAAIGTQRRAEARLQGWEEFEENVDRIVSFEAFDAFKKERYLTFFERSYDILPDD
GRMLLHSLFTYDRRWLHEQGIALTMSDLRFLKFLRESIFPGGELPSEPDIVDNAQAAGFTIEHVQLLQQHYARTLDAWAA
NLQAARERAIAVQSEEVYNNFMHYLTGCAERFRRGLINVAQFTMTK
>P9WPB1 2.1.1.-~~~mmaA1~~~Mycolic acid methyltransferase MmaA1~~~COG2230
MAKLRPYYEESQSAYDISDDFFALFLDPTWVYTCAYFERDDMTLEEAQLAKVDLALDKLNLEPGMTLLDVGCGWGGALVR
AVEKYDVNVIGLTLSRNHYERSKDRLAAIGTQRRAEARLQGWEEFEENVDRIVSFEAFDAFKKERYLTFFERSYDILPDD
GRMLLHSLFTYDRRWLHEQGIALTMSDLRFLKFLRESIFPGGELPSEPDIVDNAQAAGFTIEHVQLLQQHYARTLDAWAA
NLQAARERAIAVQSEEVYNNFMHYLTGCAERFRRGLINVAQFTMTK
>Q7U1J9 2.1.1.79~~~cmaC~~~Cyclopropane mycolic acid synthase MmaA2~~~
MVNDLTPHFEDVQAHYDLSDDFFRLFLDPTQTYSCAHFEREDMTLEEAQIAKIDLALGKLGLQPGMTLLDIGCGWGATMR
RAIAQYDVNVVGLTLSKNQAAHVQKSFDEMDTPLDRRVLLAGWEQFNEPVDRIVSIGAFEHFGHDRHADFFARAHKILPP
DGVLLLHTITGLTRQQMVDHGLPLTLWLARFLKFIATEIFPGGQPPTIEMVEEQSAKTGFTLTRRQSLQPHYARTLDLWA
EALQEHKSEAIAIQSEEVYERYMKYLTGCAKLFRVGYIDVNQFTLAK
>A5U029 2.1.1.79~~~mmaA2~~~Cyclopropane mycolic acid synthase MmaA2~~~COG2230
MVNDLTPHFEDVQAHYDLSDDFFRLFLDPTQTYSCAHFEREDMTLEEAQIAKIDLALGKLGLQPGMTLLDIGCGWGATMR
RAIAQYDVNVVGLTLSKNQAAHVQKSFDEMDTPRDRRVLLAGWEQFNEPVDRIVSIGAFEHFGHDRHADFFARAHKILPP
DGVLLLHTITGLTRQQMVDHGLPLTLWLARFLKFIATEIFPGGQPPTIEMVEEQSAKTGFTLTRRQSLQPHYARTLDLWA
EALQEHKSEAIAIQSEEVYERYMKYLTGCAKLFRVGYIDVNQFTLAK
>Q79FX6 2.1.1.79~~~mmaA2~~~Cyclopropane mycolic acid synthase MmaA2~~~COG2230
MVNDLTPHFEDVQAHYDLSDDFFRLFLDPTQTYSCAHFEREDMTLEEAQIAKIDLALGKLGLQPGMTLLDIGCGWGATMR
RAIAQYDVNVVGLTLSKNQAAHVQKSFDEMDTPRDRRVLLAGWEQFNEPVDRIVSIGAFEHFGHDRHADFFARAHKILPP
DGVLLLHTITGLTRQQMVDHGLPLTLWLARFLKFIATEIFPGGQPPTIEMVEEQSAKTGFTLTRRQSLQPHYARTLDLWA
EALQEHKSEAIAIQSEEVYERYMKYLTGCAKLFRVGYIDVNQFTLAK
>Q7U1K0 2.1.1.-~~~cmaB~~~Methoxy mycolic acid synthase MmaA3~~~
MSDNSTGTTKSRSNVDDVQAHYDLSDAFFALFQDPTRTYSCAYFERDDMTLHEAQVAKLDLTLGKLGLEPGMTLLDVGCG
WGSVMKRAVERYDVNVVGLTLSKNQHAYCQQVLDKVDTNRSHRVLLSDWANFSEPVDRIVTIEAIEHFGFERYDDFFKFA
YNAMPADGVMLLHSITGLHVKQVIERGIPLTMEMAKFIRFIVTDIFPGGRLPTIETIEEHVTKAGFTITDIQSLQPHFAR
TLDLWAEALQAHKDEAIEIQSAEVYERYMKYLTGCAKAFRMGYIDCNQFTLAK
>A5U028 2.1.1.-~~~mmaA3~~~Methoxy mycolic acid synthase MmaA3~~~COG2230
MSDNSTGTTKSRSNVDDVQAHYDLSDAFFALFQDPTRTYSCAYFERDDMTLHEAQVAKLDLTLGKLGLEPGMTLLDVGCG
WGSVMKRAVERYDVNVVGLTLSKNQHAYCQQVLDKVDTNRSHRVLLSDWANFSEPVDRIVTIEAIEHFGFERYDDFFKFA
YNAMPADGVMLLHSITGLHVKQVIERGIPLTMEMAKFIRFIVTDIFPGGRLPTIETIEEHVTKAGFTITDIQSLQPHFAR
TLDLWAEALQAHKDEAIEIQSAEVYERYMKYLTGCAKAFRMGYIDCNQFTLAK
>P0CH91 2.1.1.-~~~mmaA3~~~Methoxy mycolic acid synthase MmaA3~~~COG2230
MSDNSTGTTKSRSNVDDVQAHYDLSDAFFALFQDPTRTYSCAYFERDDMTLHEAQVAKLDLTLGKLGLEPGMTLLDVGCG
WGSVMKRAVERYDVNVVGLTLSKNQHAYCQQVLDKVDTNRSHRVLLSDWANFSEPVDRIVTIEAIEHFGFERYDDFFKFA
YNAMPADGVMLLHSITGLHVKQVIERGIPLTMEMAKFIRFIVTDIFPGGRLPTIETIEEHVTKAGFTITDIQSLQPHFAR
TLDLWAEALQAHKDEAIEIQSAEVYERYMKYLTGCAKAFRMGYIDCNQFTLAK
>Q7U1K1 2.1.1.-~~~cmaA~~~Hydroxymycolate synthase MmaA4~~~
MTRMAEKPISPTKTRTRFEDIQAHYDVSDDFFALFQDPTRTYSCAYFEPPELTLEEAQYAKVDLNLDKLDLKPGMTLLDI
GCGWGTTMRRAVERLDVNVIGLTLSKNQHARCEQVLASIDTNRSRQVLLQGWEDFAEPVDRIVSIEAFEHFGHENYDDFF
KRCFNIMPADGRMTVQSSVSYHPYEMAARGKKLSFETARFIKFIVTEIFPGGRLPSTEMMVEHGEKAGFTVPEPLSLRPH
YIKTLRIWGDTLQSNKDKAIEVTSEEVYNRYMKYLRGCEHYFTDEMLDCSLVTYLKPGAAA
>A5U027 2.1.1.-~~~mmaA4~~~Hydroxymycolate synthase MmaA4~~~COG2230
MTRMAEKPISPTKTRTRFEDIQAHYDVSDDFFALFQDPTRTYSCAYFEPPELTLEEAQYAKVDLNLDKLDLKPGMTLLDI
GCGWGTTMRRAVERFDVNVIGLTLSKNQHARCEQVLASIDTNRSRQVLLQGWEDFAEPVDRIVSIEAFEHFGHENYDDFF
KRCFNIMPADGRMTVQSSVSYHPYEMAARGKKLSFETARFIKFIVTEIFPGGRLPSTEMMVEHGEKAGFTVPEPLSLRPH
YIKTLRIWGDTLQSNKDKAIEVTSEEVYNRYMKYLRGCEHYFTDEMLDCSLVTYLKPGAAA
>Q79FX8 2.1.1.-~~~mmaA4~~~Hydroxymycolate synthase MmaA4~~~COG2230
MTRMAEKPISPTKTRTRFEDIQAHYDVSDDFFALFQDPTRTYSCAYFEPPELTLEEAQYAKVDLNLDKLDLKPGMTLLDI
GCGWGTTMRRAVERFDVNVIGLTLSKNQHARCEQVLASIDTNRSRQVLLQGWEDFAEPVDRIVSIEAFEHFGHENYDDFF
KRCFNIMPADGRMTVQSSVSYHPYEMAARGKKLSFETARFIKFIVTEIFPGGRLPSTEMMVEHGEKAGFTVPEPLSLRPH
YIKTLRIWGDTLQSNKDKAIEVTSEEVYNRYMKYLRGCEHYFTDEMLDCSLVTYLKPGAAA
>I6WZK7 1.16.3.1~~~mmcO~~~Multicopper oxidase MmcO~~~COG2132
MPELATSGNAFDKRRFSRRGFLGAGIASGFALAACASKPTASGAAGMTAAIDAAEAARPHSGRTVTATLTPQPARIDLGG
PIVSTLTYGNTIPGPLIRATVGDEIVVSVTNRLGDPTSVHWHGIALRNDMDGTEPATANIGPGGDFTYRFSVPDPGTYWA
HPHVGLQGDHGLYLPVVVDDPTEPGHYDAEWIIILDDWTDGIGKSPQQLYGELTDPNKPTMQNTTGMPEGEGVDSNLLGG
DGGDIAYPYYLINGRIPVAATSFKAKPGQRIRIRIINSAADTAFRIALAGHSMTVTHTDGYPVIPTEVDALLIGMAERYD
VMVTAAGGVFPLVALAEGKNALARALLSTGAGSPPDPQFRPDELNWRVGTVEMFTAATTANLGRPEPTHDLPVTLGGTMA
KYDWTINGEPYSTTNPLHVRLGQRPTLMFDNTTMMYHPIHLHGHTFQMIKADGSPGARKDTVIVLPKQKMRAVLVADNPG
VWVMHCHNNYHQVAGMATRLDYIL
>Q9X5T6 2.1.1.316~~~mmcR~~~Mitomycin biosynthesis 6-O-methyltransferase~~~
MTVEQTPENPGTAARAAAEETVNDILQGAWKARAIHVAVELGVPELLQEGPRTATALAEATGAHEQTLRRLLRLLATVGV
FDDLGHDDLFAQNALSAVLLPDPASPVATDARFQAAPWHWRAWEQLTHSVRTGEASFDVANGTSFWQLTHEDPKARELFN
RAMGSVSLTEAGQVAAAYDFSGAATAVDIGGGRGSLMAAVLDAFPGLRGTLLERPPVAEEARELLTGRGLADRCEILPGD
FFETIPDGADVYLIKHVLHDWDDDDVVRILRRIATAMKPDSRLLVIDNLIDERPAASTLFVDLLLLVLVGGAERSESEFA
ALLEKSGLRVERSLPCGAGPVRIVEIRRA
>O54028 7.2.4.3~~~mmdA~~~Methylmalonyl-CoA decarboxylase subunit alpha~~~
MSVAAKKIQDLQKKKEKIALGGGIKRIEKQHASGKMTARERLAYLFDEGTFVEMDAFVQHRCTNFGMDKQDLPSESVVTG
YGMVDGRVVYAFSQDFTVTGGALGEMHAKKICKAMDMAGKVGAPVVGLNDSGGARIQEAVDALSGYGDIFYRNSIYSGVV
PQISAILGPCAGGAVYSPALTDFIFMVDQTSQMFITGPQVIKTVTGEEVTAEQLGGAMTHNSTSGCAQFISQDDKACIDD
IRRLISFLPSNNMEKAPEFGCEDDLNIQFPELDALMPDNPNKAYNMFDVITKIVDNGDYMEYQPHYSKNIITCFARVNGK
SVGIIANQPQVMAGCLDIDSGDKCAKFIRTCDAFNIPLLTIVDVPGFLPGVTQEYGGIIRHGAKILYAYSEATVPKVTLI
TRKAYGGAYVAMCSKSLGADVVLAWPTAEIAVMGPAGAVNIIFRKDIKDAKDPAATTKQKLDEYTTEFANPYQAARRGLV
DDVIEPKTSRQRIVDAFNMLEGKREKLPAKKHGNIPL
>Q57079 7.2.4.3~~~mmdA~~~Methylmalonyl-CoA decarboxylase subunit alpha~~~
MATVQEKIELLHEKLAKVKAGGGEKRVEKQHAQGKMTARERLAKLFDDNSFVELDQFVKHRCVNFGQEKKELPGEGVVTG
YGTIDGRLVYAFAQDFTVEGGSLGEMHAAKIVKVQRLAMKMGAPIVGINDSGGARIQEAVDALAGYGKIFFENTNASGVI
PQISVIMGPCAGGAVYSPALTDFIYMVKNTSQMFITGPAVIKSVTGEEVTAEDLGGAMAHNSVSGVAHFAAENEDDCIAQ
IRYLLGFLPSNNMEDAPLVDTGDDPTREDESLNSLLPDNSNMPYDMKDVIAATVDNGEYYEVQPFYATNIITCFARFDGQ
SVGIIANQPKVMAGCLDINASDKSSRFIRFCDAFNIPIVNFVDVPGFLPGTNQEWGGIIRHGAKMLYAYSEATVPKITVI
TRKAYGGSYLAMCSQDLGADQVYAWPTSEIAVMGPAGAANIIFKKDEDKDAKTAKYVEEFATPYKAAERGFVDVVIEPKQ
TRPAVINALAMLASKRENRAPKKHGNIPL
>O54031 7.2.4.3~~~mmdB~~~Methylmalonyl-CoA decarboxylase subunit beta~~~
MLQAILDFYHSTGFYGLNMGSIIMMLVACVFLYLAIAKEFEPLLLVPISFGILLTNLPFAGMMAEPLLEVHEKLSASGAH
LYTAHTAEPGGLLYYLFQGDHLGIFPPLIFLGVGAMTDFGPLISNPKSLLLGAAAQFGIFVTFFGAIASGLFTAQEAASI
GIIGGADGPTAIFLSSKLAPHLMGPIAVAAYSYMALVPIIQPPIMTALTSETERKIKMSQLRLVSKREKIIFPIVVTILV
SLIVPPAATLVGMLMLGNLFRECGVVGRLEDTAKNALINIITIFLGVTVGATATAEAFLKVETLAILGLGIVAFGIGTGS
GVLLAKFMNKLSKEPINPLLGSAGVSAVPMAARVSQVVGQKADPTNFLLMHAMGPNVAGVIGSAVSAGVLLSLFG
>Q57286 7.2.4.3~~~mmdB~~~Methylmalonyl-CoA decarboxylase subunit beta~~~
MEAFAVAIQSVINDSGFLAFTTGNAIMILVGLILLYLAFAREFEPLLLGPIAFGCLLANIPRNGFEEGVMALISAGISQE
IFPPLIFLGVGAMTDFGPLIANPKTLLLGAAAQIGVFAALGGAMMLGFTAQEAAAIGIIGGADGPTSIYLATKLAPHLLG
AIAVAAYSYMSLVPLIQPPVMKLFTTQKEREIVMEQLREVTRFEKIVFPIVATIFISLLLPSITSLLGMLMLGNLFRESG
VTDRLSDTSQNALINTVTIFLATGTGLTMSAEHFLSLETIKIILLGLFAFICGTAGGVLFGKLMSLVDGGKTNPLIGSAG
VSAVPMAARVSQVVGAKANPANFLLMHAMGPNVAGVIGTAVAAGTMLAMLSNH
>O54030 7.2.4.3~~~mmdC~~~Methylmalonyl-CoA decarboxylase subunit gamma~~~
MKNFKVTVNGTEYDVAVEEMGGAAVASAPAARPAAAPAPAAPKPAAAPAPAPAPKTTAAGAGAGANTVTAPMPGTILNVG
CHAGDKVSKGDTLVVLEAMKMENEIMAPHDGVVSEVRVQQGASVNAGDILVVLS
>Q57111 7.2.4.3~~~mmdC~~~Methylmalonyl-CoA decarboxylase subunit gamma~~~
MKKFNVTVNGTAYDVEVNEVKAAAPAAAPKAAPAAAPAPKAAPAPAPAPAAAAAPVPAGAETVKAPMPGKILSVAVSAGQ
AVKKGETLLILEAMKMQNEIAAPHDAVVSEVRVSANQTVSTGDDMVVLG
>O54029 7.2.4.3~~~mmdD~~~Methylmalonyl-CoA decarboxylase subunit delta~~~
MNITELMELFSNPETIKTLETGDLMTGIGVTVVLGMGITVVALIFLMYIIGGMAAIMAEKPKEVKETAAAAPKPEAAAAP
APAANNDEELVAVIAAAVAAQLGTSASNLIIRNVTRSLDTTPAWGRAGIVDQMATRL
>Q56724 7.2.4.3~~~mmdD~~~Methylmalonyl-CoA decarboxylase subunit delta~~~
MEGQAVTTNPWLIMAINMTVVFAVLIALGILMEIVHLIDPTKKKKEAPAATAPVATPTATPVAPANASAQNEDEVVAAIV
GAIVAMGYSSEQIASIRPTATSAKWRLEGRLSGRG
>Q57490 7.2.4.3~~~mmdE~~~Methylmalonyl-CoA decarboxylase subunit epsilon~~~
MSNATTTNGKAPSQDVVAVIVGALAAMGYSADQIAHIRPIVSYNWKMEGRLRGNR
>P45858 2.3.3.-~~~mmgD~~~Citrate/2-methylcitrate synthase~~~COG0372
MEEKQHYSPGLDGVIAAETHISYLDTQSSQILIRGYDLIELSETKSYLELVHLLLEGRLPEESEMETLERKINSASSLPA
DHLRLLELLPEDTHPMDGLRTGLSALAGYDRQIDDRSPSANKERAYQLLGKMPALTAASYRIINKKEPILPLQTLSYSAN
FLYMMTGKLPSSLEEQIFDRSLVLYSEHEMPNSTFAARVIASTHSDLYGALTGAVASLKGNLHGGANEAVMYLLLEAKTT
SDFEQLLQTKLKRKEKIMGFGHRVYMKKMDPRALMMKEALQQLCDKAGDHRLYEMCEAGERLMEKEKGLYPNLDYYAAPV
YWMLGIPIPLYTPIFFSARTSGLCAHVIEQHANNRLFRPRVSYMGPRYQTKS
>P45859 4.2.1.-~~~mmgE~~~Citrate/2-methylcitrate dehydratase~~~COG2079
MPKTDRVIEEITDYVLEKEITSAEAYTTAGHVLLDTLGCGILALRYPECTKLLGPIVPGTTVPNGSKVPGTSYVLDPVRA
AFNIGCMIRWLDYNDTWLAAEWGHPSDNLGGILAAADYVSRVRLSEGKEPLTVRDVLEMMIKAHEIQGVLALENSLNRVG
LDHVLFVKVATTAVAAKLLGGGREEIKNALSNAWIDNAALRTYRHSPNTGSRKSWAAGDATSRGVHLALMSLKGEMGYPT
ALSAPGWGFQDVLFNKKEIKLARPLDAYVMENVLFKVSYPAEFHAQTAAESAVILHPQVKNRIDEIDRVVIRTHESAIRI
IDKKGPLHNPADRDHCLQYITAIGLLFGDITAQHYEAETANDPRIDKLRDKMEVTENKTYTEDYLKPDKRSISNAVQVHF
KDGTSTEMVECEFPLGHRFRREEAVPKLLEKFSDNLKTHFPDKQHKHIYERCTSYETLQTMRVNEFVDMFCM
>P54528 4.1.3.-~~~mmgF~~~2-methylisocitrate lyase~~~COG2513
MSWIVNKQSSQEELAGRFRKLMSAPDILQIPGAHDGMAALLAKEAGFSAIYLSGAAYTASRGLPDLGIITSAEIAERAKD
LVRAADLPLLVDIDTGFGGVLNAARTAREMLEARVAAVQMEDQQLPKKCGHLNGKQLVPIKEMAQKIKAIKQAAPSLIVV
ARTDARAQEGLDAAIKRSEAYIEAGADAIFPEALQAENEFRQFAERIPVPLLANMTEFGKTPYYRADEFEDMGFHMVIYP
VTSLRAAAKACERMFGLMKEHGSQKEGLHDMQTRKELYDTISYYDYEALDKTIAKTVLPDE
>P18797 ~~~mmoB~~~Methane monooxygenase regulatory protein B~~~
MSVNSNAYDAGIMGLKGKDFADQFFADENQVVHESDTVVLVLKKSDEINTFIEEILLTDYKKNVNPTVNVEDRAGYWWIK
ANGKIEVDCDEISELLGRQFNVYDFLVDVSSTIGRAYTLGNKFTITSELMGLDRKLEDYHA
>P27356 ~~~mmoB~~~Methane monooxygenase regulatory protein B~~~
MSSAHNAYNAGIMQKTGKAFADEFFAEENQVVHESNAVVLVLMKSDEIDAIIEDIVLKGGKAKNPSIVVEDKAGFWWIKA
DGAIEIDAAEAGELLGKPFSVYDLLINVSSTVGRAYTLGTKFTITSELMGLDRALTDI
>P22868 1.14.13.25~~~mmoC~~~Methane monooxygenase component C~~~COG0543
MQRVHTITAVTEDGESLRFECRSDEDVITAALRQNIFLMSSCREGGCATCKALCSEGDYDLKGCSVQALPPEEEEEGLVL
LCRTYPKTDLEIELPYTHCRISFGEVGSFEAEVVGLNWVSSNTVQFLLQKRPDECGNRGVKFEPGQFMDLTIPGTDVSRS
YSPANLPNPEGRLEFLIRVLPEGRFSDYLRNDARVGQVLSVKGPLGVFGLKERGMAPRYFVAGGTGLAPVVSMVRQMQEW
TAPNETRIYFGVNTEPELFYIDELKSLERSMRNLTVKACVWHPSGDWEGEQGSPIDALREDLESSDANPDIYLCGPPGMI
DAACELVRSRGIPGEQVFFEKFLPSGAA
>Q53563 1.14.13.25~~~mmoC~~~Methane monooxygenase component C~~~
MYQIVIETEDGETCRRMRPSEDWISRAEAERNLLASCRAGCATCKADCTDGDYELIDVKVQAVPPDEEEDGKVLLCRTFP
RSDLHLLVPYTYDRISFEAIQTNWLAEILACDRVSSNVVRLVLQRSRPMAARISLNFVPGQFVDIEIPGTHTRRSYSMAS
VAEDGQLEFIIRLLPDGAFSKFLQTEAKVGMRVDLRGPAGSFFLHDHGGRSRVFVAGGTGLSPVLSMIRQLGKASDPSPA
TLLFGVTNREELFYVDELKTLAQSMPTLGVRIAVVNDDGGNGVDKGTVIDLLRAELEIDLLLGHARRRRRRETARSCRED
HRDRCPAWRSDFLEKFLASG
>P22867 ~~~mmoD~~~Methane monooxygenase component D~~~
MVESAFQPFSGDADEWFEEPRPQAGFFPSADWHLLKRDETYAAYAKDLDFMWRWVIVREERIVQEGCSISLESSIRAVTH
VLNYFGMTEQRAPAEDRTGGVQH
>Q9A710 3.4.24.-~~~mmpA~~~Metalloprotease MmpA~~~COG0750
MIGFLIMLVSLLFVLSVVVTVHELGHYWAARACGVAIERFSIGFGAPLISWRDKRGVEWCVASIPLGGYVRFAGDENAAS
VPDQNDLDAMRNEIRRREGDDAVNRYFHFKPVWQRAFIAVAGPMANFILAILVFAVILVSFGAQKTSTTVGEVVAGTPAA
AAGFKPGDVILKADNRQIRSFQDIQGYVALRANMPIDFAVERDGRTVHLTATPRLVERQNEISGRVKVGELGLRSAPGGR
FERSSLLSAIPDATVEVWDMIKTIAFYLGRLLMGQLPADQISGIIGIGHTAGAVTNGVVEQAPNGKALAIGLIYSQFWLI
ASLSVSIGFMNLLPIPVLDGGHLVMYAYEAVAKRPLRAEFQAAGFRAGLALILGFMLFAAWNDLNRYDVFKFIGGLFT
>P9WJV9 ~~~mmpL1~~~Probable transport protein MmpL1~~~COG0642
MRSQRLAGHLSAAARTIHALSLPIILFWVALTIVVNVVAPQLQSVARTHSVALGPHDAPSLIAMKRIGKDFQQFDSDTTA
MVLLEGQEKLGDEAHRFYDVLVTKLSQDTTHVQHIENFWGDPLTAAGSQSADGKAAYVQLNLTGDQGGSQANESVAAVQR
IVDSVPPPPGIKAYVTGPGPLGADRVVYGDRSLHTITGISIAVIAIMLFIAYRSLSAALIMLLTVGLELLAVRGIISTFA
VNDLMGLSTFTVNVLVALTIAASTDYIIFLVGRYQEARATGQNREAAYYTMFGGTAHVVLASGLTVAGAMYCLGFTRLPY
FNTLASPCAIGLVTVMLASLTLAPAIIAVASRFGLFDPKRATTKRRWRRIGTVVVRWPGPVLAATLLIALIGLLALPKYQ
TNYNERYYIPSAAPSNIGYLASDRHFPQARMEPEVLMVEADHDLRNPTDMLILDRIAKTVFHTPGIARVQSITRPLGAPI
DHSSIPFQLGMQSTMTIENLQNLKDRVADLSTLTDQLQRMIDITQRTQELTRQLTDATHDMNAHTRQMRDNANELRDRIA
DFDDFWRPLRSFTYWERHCFDIPICWSMRSLLNSMDNVDKLTEDLANLTDDTERMDTTQRQLLAQLDPTIATMQTVKDLA
QTLTSAFSGLVTQMEDMTRNATVMGRTFDAANNDDSFYLPPEAFQNPDFQRGLKLFLSPDGTCARFVITHRGDPASAEGI
SHIDPIMQAADEAVKGTPLQAASIYLAGTSSTYKDIHEGTLYDVMIAVVASLCLIFIIMLGITRSVVASAVIVGTVALSL
GSAFGLSVLIWQHILHMPLHWLVLPMAIIVMLAVGSDYNLLLIARFQEEIGAGLKTGMIRAMAGTGRVVTIAGLVFAFTM
GSMVASDLRVVGQIGTTIMIGLLFDTLVVRSYMTPALATLLGRWFWWPRRVDRLARQPQVLGPRRTTALSAERAALLQ
>P9WJV7 ~~~mmpL2~~~Probable transport protein MmpL2~~~COG1033
MSERHAALTSLPPILPRLIRRFAVVIVLLWLGFTAFVNLAVPQLEVVGKAHSVSMSPSDAASIQAIKRVGQVFGEFDSDN
AVTIVLEGDQPLGGDAHRFYSDLMRKLSADTRHVAHIQDFWGDPLTAAGSQSADDRAAYVVVYLVGNNETEAYDSVHAVR
HMVDTTPPPHGVKAYVTGPAALNADQAEAGDKSIAKVTAITSMVIAAMLLVIYRSVITAVLVLIMVGIDLGAIRGFIALL
ADHNIFSLSTFATNLLVLMAIAASTDYAIFMLGRYHESRYAGEDRETAFYTMFHGTAHVILGSGLTIAGAMYCLSFARLP
YFETLGAPIAIGMLVAVLAALTLGPAVLTVGSFFKLFDPKRRMNTRRWRRVGTAIVRWPGPVLAATCLVASIGLLALPSY
RTTYDLRKFMPASMPSNVGDAAAGRRFSRARLNPEVLLIETDHDMRNPVDMLVLDKVAKNIYHSPGIEQVKAITRPLGTT
IKHTSIPFIISMQGVNSSEQMEFMKDRIDDILVQVAAMNTSIETMHRMYALMGEVIDNTVDMDHLTHDMSDITATLRDHL
ADFEDFFRPIRSYFYWEKHCFDVPLCWSIRSIFDMFDSVDQLSEKLEYLVKDMDILITLLPQMRAQMPPMISAMTTMRDM
MLIWHGTLGAFYKQQERNNKDPGAMGRVFDAAQIDDSFYLPQSAFENPDFKRGLKMFLSPDGKAARFVIALEGDPATPEG
ISRVEPIKREAREAIKGTPLQGAAIYLGGTAATFKDIREGARYDLLIAGVAAISLILIIMMIITRSVVAAVVIVGTVVLS
MGASFGLSVLVWQDILGIELYWMVLAMSVILLLAVGSDYNLLLISRLKEEIGAGLNTGIIRAMAGTGGVVTAAGMVFAVT
MSLFVFSDLRIIGQIGTTIGLGLLFDTLVVRSFMTPSIAALLGRWFWWPLRVRPRPASQMLRPFAPRRLVRALLLPSGQH
PSATGAHE
>A0QP27 ~~~mmpL3~~~Trehalose monomycolate exporter MmpL3~~~COG2409
MFAWWGRTVYQFRYIVIGVMVALCLGGGVYGISLGNHVTQSGFYDEGSQSVAASLIGDEVYGRDRTSHVVAILTPPDDKK
VTDKAWQKKVTEELDQVVKDHEDQIVGWVGWLKAPDTTDPTVSAMKTQDLRHTFISIPLQGDDDDEILKNYQVVEPELQQ
VNGGDIRLAGLNPLASELTGTIGEDQKRAEVAAIPLVAVVLFFVFGTVIAAALPAIIGGLAIAGALGIMRLVAEFTPVHF
FAQPVVTLIGLGIAIDYGLFIVSRFREEIAEGYDTEAAVRRTVMTSGRTVVFSAVIIVASSVPLLLFPQGFLKSITYAII
ASVMLAAILSITVLAAALAILGPRVDALGVTTLLKIPFLANWQFSRRIIDWFAEKTQKTKTREEVERGFWGRLVNVVMKR
PIAFAAPILVVMVLLIIPLGQLSLGGISEKYLPPDNAVRQSQEQFDKLFPGFRTEPLTLVMKREDGEPITDAQIADMRAK
ALTVSGFTDPDNDPEKMWKERPANDSGSKDPSVRVIQNGLENRNDAAKKIDELRALQPPHGIEVFVGGTPALEQDSIHSL
FDKLPLMALILIVTTTVLMFLAFGSVVLPIKAALMSALTLGSTMGILTWMFVDGHGSGLMNYTPQPLMAPMIGLIIAVIW
GLSTDYEVFLVSRMVEARERGMSTAEAIRIGTATTGRLITGAALILAVVAGAFVFSDLVMMKYLAFGLLIALLLDATIIR
MFLVPAVMKLLGDDCWWAPRWMKRVQEKLGLGETELPDERKRPTVRESETDQRALVGVGAPPPPPRPHDPTHPAPEPVRP
MPPMRSNAPSAAGTARISTPPQPPQPPQAPAQQAGDEPATTRFAMARNAVRNAVNSAVHGGAGSAAAPTERAPRPGGPAQ
PPAPPQREEREIESWLGALRGPAPAKNVPQPPAQPQRPSTDTTRAMPPQGRPPAGPADRGNENAPTTAFSAQRPPNGGAP
ADATTAIPTPPQREQEPSTEKLNTREDAPEDPETKRRGGGMSAQDLLRREGRL
>P9WJV5 ~~~mmpL3~~~Trehalose monomycolate exporter MmpL3~~~COG2409
MFAWWGRTVYRYRFIVIGVMVALCLGGGVFGLSLGKHVTQSGFYDDGSQSVQASVLGDQVYGRDRSGHIVAIFQAPAGKT
VDDPAWSKKVVDELNRFQQDHPDQVLGWAGYLRASQATGMATADKKYTFVSIPLKGDDDDTILNNYKAIAPDLQRLDGGT
VKLAGLQPVAEALTGTIATDQRRMEVLALPLVAVVLFFVFGGVIAAGLPVMVGGLCIAGALGIMRFLAIFGPVHYFAQPV
VSLIGLGIAIDYGLFIVSRFREEIAEGYDTETAVRRTVITAGRTVTFSAVLIVASAIGLLLFPQGFLKSLTYATIASVML
SAILSITVLPACLGILGKHVDALGVRTLFRVPFLANWKISAAYLNWLADRLQRTKTREEVEAGFWGKLVNRVMKRPVLFA
APIVIIMILLIIPVGKLSLGGISEKYLPPTNSVRQAQEEFDKLFPGYRTNPLTLVIQTSNHQPVTDAQIADIRSKAMAIG
GFIEPDNDPANMWQERAYAVGASKDPSVRVLQNGLINPADASKKLTELRAITPPKGITVLVGGTPALELDSIHGLFAKMP
LMVVILLTTTIVLMFLAFGSVVLPIKATLMSALTLGSTMGILTWIFVDGHFSKWLNFTPTPLTAPVIGLIIALVFGLSTD
YEVFLVSRMVEARERGMSTQEAIRIGTAATGRIITAAALIVAVVAGAFVFSDLVMMKYLAFGLMAALLLDATVVRMFLVP
SVMKLLGDDCWWAPRWARRLQTRIGLGEIHLPDERKRPVSNGRPARPPVTAGLVAARAAGDPRPPHDPTHPLAESPRPAR
SSPASSPELTPALEATAAPAAPSGASTTRMQIGSSTEPPTTRLAAAGRSVQSPASTPPPTPTPPSAPSAGQTRAMPLAAN
RSTDAAGDPAEPTAALPIIRSDGDDSEAATEQLNARGTSDKTRQRRRGGGALSAQDLLRREGRL
>P9WJV3 ~~~mmpL4~~~Siderophore exporter MmpL4~~~COG1033
MSTKFANDSNTNARPEKPFIARMIHAFAVPIILGWLAVCVVVTVFVPSLEAVGQERSVSLSPKDAPSFEAMGRIGMVFKE
GDSDSFAMVIIEGNQPLGDAAHKYYDGLVAQLRADKKHVQSVQDLWGDPLTAAGVQSNDGKAAYVQLSLAGNQGTPLANE
SVEAVRSIVESTPAPPGIKAYVTGPSALAADMHHSGDRSMARITMVTVAVIFIMLLLVYRSIITVVLLLITVGVELTAAR
GVVAVLGHSGAIGLTTFAVSLLTSLAIAAGTDYGIFIIGRYQEARQAGEDKEAAYYTMYRGTAHVILGSGLTIAGATFCL
SFARMPYFQTLGIPCAVGMLVAVAVALTLGPAVLHVGSRFGLFDPKRLLKVRGWRRVGTVVVRWPLPVLVATCAIALVGL
LALPGYKTSYNDRDYLPDFIPANQGYAAADRHFSQARMKPEILMIESDHDMRNPADFLVLDKLAKGIFRVPGISRVQAIT
RPEGTTMDHTSIPFQISMQNAGQLQTIKYQRDRANDMLKQADEMATTIAVLTRMHSLMAEMASTTHRMVGDTEEMKEITE
ELRDHVADFDDFWRPIRSYFYWEKHCYGIPICWSFRSIFDALDGIDKLSEQIGVLLGDLREMDRLMPQMVAQIPPQIEAM
ENMRTMILTMHSTMTGIFDQMLEMSDNATAMGKAFDAAKNDDSFYLPPEVFKNKDFQRAMKSFLSSDGHAARFIILHRGD
PQSPEGIKSIDAIRTAAEESLKGTPLEDAKIYLAGTAAVFHDISEGAQWDLLIAAISSLCLIFIIMLIITRAFIAAAVIV
GTVALSLGASFGLSVLLWQHILAIHLHWLVLAMSVIVLLAVGSDYNLLLVSRFKQEIGAGLKTGIIRSMGGTGKVVTNAG
LVFAVTMASMAVSDLRVIGQVGTTIGLGLLFDTLIVRSFMTPSIAALLGRWFWWPLRVRSRPARTPTVPSETQPAGRPLA
MSSDRLG
>P9WJV1 ~~~mmpL5~~~Siderophore exporter MmpL5~~~COG1033
MIVQRTAAPTGSVPPDRHAARPFIPRMIRTFAVPIILGWLVTIAVLNVTVPQLETVGQIQAVSMSPDAAPSMISMKHIGK
VFEEGDSDSAAMIVLEGQRPLGDAAHAFYDQMIGRLQADTTHVQSLQDFWGDPLTATGAQSSDGKAAYVQVKLAGNQGES
LANESVEAVKTIVERLAPPPGVKVYVTGSAALVADQQQAGDRSLQVIEAVTFTVIIVMLLLVYRSIITSAIMLTMVVLGL
LATRGGVAFLGFHRIIGLSTFATNLLVVLAIAAATDYAIFLIGRYQEARGLGQDRESAYYTMFGGTAHVVLGSGLTIAGA
TFCLSFTRLPYFQTLGVPLAIGMVIVVAAALTLGPAIIAVTSRFGKLLEPKRMARVRGWRKVGAAIVRWPGPILVGAVAL
ALVGLLTLPGYRTNYNDRNYLPADLPANEGYAAAERHFSQARMNPEVLMVESDHDMRNSADFLVINKIAKAIFAVEGISR
VQAITRPDGKPIEHTSIPFLISMQGTSQKLTEKYNQDLTARMLEQVNDIQSNIDQMERMHSLTQQMADVTHEMVIQMTGM
VVDVEELRNHIADFDDFFRPIRSYFYWEKHCYDIPVCWSLRSVFDTLDGIDVMTEDINNLLPLMQRLDTLMPQLTAMMPE
MIQTMKSMKAQMLSMHSTQEGLQDQMAAMQEDSAAMGEAFDASRNDDSFYLPPEVFDNPDFQRGLEQFLSPDGHAVRFII
SHEGDPMSQAGIARIAKIKTAAKEAIKGTPLEGSAIYLGGTAAMFKDLSDGNTYDLMIAGISALCLIFIIMLITTRSVVA
AAVIVGTVVLSLGASFGLSVLIWQHILGIELHWLVLAMAVIILLAVGADYNLLLVARLKEEIHAGINTGIIRAMGGSGSV
VTAAGLVFAFTMMSFAVSELTVMAQVGTTIGMGLLFDTLIVRSFMTPSIAALLGKWFWWPQVVRQRPIPQPWPSPASART
FALV
>P9WJU7 ~~~mmpL7~~~Phthiocerol dimycocerosate exporter MmpL7~~~COG2409
MPSPAGRLHRIRYIRLKKSSPDCRATITSGSADGQRRSPRLTNLLVVAAWVAAAVIANLLLTFTQAEPHDTSPALLPQDA
KTAAATSRIAQAFPGTGSNAIAYLVVEGGSTLEPQDQPYYDAAVGALRADTRHVGSVLDWWSDPVTAPLGTSPDGRSATA
MVWLRGEAGTTQAAESLDAVRSVLRQLPPSEGLRASIVVPAITNDMPMQITAWQSATIVTVAAVIAVLLLLRARLSVRAA
AIVLLTADLSLAVAWPLAAVVRGHDWGTDSVFSWTLAAVLTIGTITAATMLAARLGSDAGHSAAPTYRDSLPAFALPGAC
VAIFTGPLLLARTPALHGVGTAGLGVFVALAASLTVLPALIALAGASRQLPAPTTGAGWTGRLSLPVSSASALGTAAVLA
ICMLPIIGMRWGVAENPTRQGGAQVLPGNALPDVVVIKSARDLRDPAALIAINQVSHRLVEVPGVRKVESAAWPAGVPWT
DASLSSAAGRLADQLGQQAGSFVPAVTAIKSMKSIIEQMSGAVDQLDSTVNVTLAGARQAQQYLDPMLAAARNLKNKTTE
LSEYLETIHTWIVGFTNCPDDVLCTAMRKVIEPYDIVVTGMNELSTGADRISAISTQTMSALSSAPRMVAQMRSALAQVR
SFVPKLETTIQDAMPQIAQASAMLKNLSADFADTGEGGFHLSRKDLADPSYRHVRESMFSSDGTATRLFLYSDGQLDLAA
AARAQQLEIAAGKAMKYGSLVDSQVTVGGAAQIAAAVRDALIHDAVLLAVILLTVVALASMWRGAVHGAAVGVGVLASYL
AALGVSIALWQHLLDRELNALVPLVSFAVLASCGVPYLVAGIKAGRIADEATGARSKGAVSGRGAVAPLAALGGVFGAGL
VLVSGGSFSVLSQIGTVVVLGLGVLITVQRAWLPTTPGRR
>P9WJU5 ~~~mmpL8~~~Sulfolipid-1 exporter MmpL8~~~COG1511
MCDVLMQPVRTPRPSTNLRSKPLRPTGDGGVFPRLGRLIVRRPWVVIAFWVALAGLLAPTVPSLDAISQRHPVAILPSDA
PVLVSTRQMTAAFREAGLQSVAVVVLSDAKGLGAADERSYKELVDALRRDTRDVVMLQDFVTTPPLRELMTSKDNQAWIL
PVGLPGDLGSTQSKQAYARVADIVEHQVAGSTLTANLTGPAATVADLNLTGQRDRSRIEFAITILLLVILLIIYGNPITM
VLPLITIGMSVVVAQRLVAIAGLAGLGIANQSIIFMSGMMVGAGTDYAVFLISRYHDYLRQGADSDQAVKKALTSIGKVI
AASAATVAITFLGMVFTQLGILKTVGPMLGISVAVVFFAAVTLLPALMVLTGRRGWIAPRRDLTRRFWRSSGVHIVRRPK
THLLASALVLVILAGCAGLARYNYDDRKTLPASVESSIGYAALDKHFPSNLIIPEYLFIQSSTDLRTPKALADLEQMVQR
VSQVPGVAMVRGITRPAGRSLEQARTSWQAGEVGSKLDEGSKQIAVHTGDIDKLAGGANLMASKLGDVRAQVNRAISTVG
GLIDALAYLQDLLGGNRVLGELEGAEKLIGSMRALGDTIDADASFVANNTEWASPVLGALDSSPMCTADPACASARTELQ
RLVTARDDGTLAKISELARQLQATRAVQTLAATVSGLRGALATVIRAMGSLGMSSPGGVRSKINLVNKGVNDLADGSRQL
AEGVQLLVDQVKKMGFGLGEASAFLLAMKDTATTPAMAGFYIPPELLSYATGESVKAETMPSEYRDLLGGLNVDQLKKVA
AAFISPDGHSIRYLIQTDLNPFSTAAMDQIDAITAAARGAQPNTALADAKVSVVGLPVVLKDTRDYSDHDLRLIIAMTVC
IVLLILIVLLRAIVAPLYLIGSVIVSYLAALGIGVIVFQFLLGQEMHWSIPGLTFVILVAVGADYNMLLISRLREEAVLG
VRSGVIRTVASTGGVITAAGLIMAASMYGLVFASLGSVVQGAFVLGTGLLLDTFLVRTVTVPAIAVLVGQANWWLPSSWR
PATWWPLGRRRGRAQRTKRKPLLPKEEEEQSPPDDDDLIGLWLHDGLRL
>P9WJU3 ~~~mmpL9~~~Probable transport protein MmpL9~~~COG2409
MVPGEVHMSDTPSGPHPIIPRTIRLAAIPILLCWLGFTVFVSVAVPPLEAIGETRAVAVAPDDAQSMRAMRRAGKVFNEF
DSNSIAMVVLESDQPLGEKAHRYYDHLVDTLVLDQSHIQHIQDFWRDPLTAAGAVSADGKAAYVQLYLAGNMGEALANES
VEAVRKIVANSTPPEGIRTYVTGPAALFADQIAAGDRSMKLITGLTFAVITVLLLLVYRSIATTLLILPMVFIGLGATRG
TIAFLGYHGMVGLSTFVVNILTALAIAAGTDYAIFLVGRYQEARHIGQNREASFYTMYRGTANVILGSGLTIAGATYCLS
FARLTLFHTMGPPLAIGMLVSVAAALTLAPAIIAIAGRFGLLDPKRRLKTRGWRRVGTAVVRWPGPILATSVALALVGLL
ALPGYRPGYNDRYYLRAGTPVNRGYAAADRHFGPARMNPEMLLVESDQDMRNPAGMLVIDKIAKEVLHVSGVERVQAITR
PQGVPLEHASIPFQISMMGATQTMSLPYMRERMADMLTMSDEMLVAINSMEQMLDLVQQLNDVTHEMAATTREIKATTSE
LRDHLADIDDFVRPLRSYFYWEHHCFDIPLCSATRSLFDTLDGVDTLTDQLRALTDDMNKMEALTPQFLALLPPMITTMK
TMRTMMLTMRSTISGVQDQMADMQDHATAMGQAFDTAKSGDSFYLPPEAFDNAEFQQGMKLFLSPNGKAVRFVISHESDP
ASTEGIDRIEAIRAATKDAIKATPLQGAKIYIGGTAATYQDIRDGTKYDILIVGIAAVCLVFIVMLMITQSLIASLVIVG
TVLLSLGTAFGLSVLIWQHFVGLQVHWTIVAMSVIVLLAVGSDYNLLLVSRFKEEVGAGLKTGIIRAMAGTGAVVTSAGL
VFAFTMASMAVSELRVIGQVGTTIGLGLLFDTLVVRSFMTPSIAALLGRWFWWPNMIHSRPTVPEAHTRQGARRIQPHLH
RG
>P9WJU1 ~~~~~~Acyltrehalose exporter MmpL10~~~COG2409
MVGCWVALALVLPMAVPSLAEMAQRHPVAVLPADAPSSVAVRQMAEAFHESGSENILVVLLTDEKGLGAADENVYHTLVD
RLRNDAKDVVMLQDFLTTPPLREVLGSKDGKAWILPIGLAGDLGTPKSYHAYTDVERIVKRTVAGTTLTANVTGPAATVA
DLTDAGARDRASIELAIAVMLLVILMVIYRNPVTMLLPLVTIGASLMTAQALVAGVSLVGGLAVSNQAIVLLSAMIAGAG
TDYAVFLISRYHEYVRLGEHPERAVQRAMMSVGKVIAASAATVGITFLGMRFAKLGVFSTVGPALAIGIAVSFLAAVTLL
PAILVLASPRGWVAPRGERMATFWRRAGTRIVRRPKAYLGASLIGLVALASCASLAHFNYDDRKQLPPSDPSSVGYAAME
HHFSVNQTIPEYLIIHSAHDLRTPRGLADLEQLAQRVSQIPGVAMVRGVTRPNGETLEQARATYQAGQVGNRLGGASRMI
DERTGDLNRLASGANLLADNLGDVRGQVSRAVAGVRSLVDALAYIQNQFGGNKTFNEIDNAARLVSNIHALGDALQVNFD
GIANSFDWLDSVVAALDTSPVCDSNPMCGNARVQFHKLQTARDNGTLDKVVGLARQLQSTRSPQTVSAVVNDLGRSLNSV
VRSLKSLGLDNPDAARARLISMQNGANDLASAGRQVADGVQMLVDQTKNMGIGLNQASAFLMAMGNDASQPSMAGFNVPP
QVLKSEEFKKVAQAFISPDGHTVRYFIQTDLNPFSTAAMDQVNTIIDTAKGAQPNTSLADASISMSGYPVMLRDIRDYYE
RDMRLIVAVTVVVVILILMALLRAIVAPLYLVGSVVISYMSAIGLGVVVFQVFLGQELHWSVPGLAFVVLVAVGADYNML
LASRLRDESALGVRSSVIRTVRCTGGVITAAGLIFAASMSGLLFSSIGTVVQGGFIIGVGILIDTFVVRTITVPAMATLL
GRASWWPGHPWQRCAPEEGQMSARMSARTKTVFQAVADGSKR
>P9WJT9 ~~~~~~Heme uptake protein MmpL11~~~COG2409
MMRLSRNLRRCRWLVFTGWLLALVPAVYLAMTQSGNLTGGGFEVAGSQSLLVHDQLDAHYPDRGAPALALVAAPRPDASY
QDIDNAVALLRQIASELPGVTEAPNPTQRPPQPDRPYVVSLRLDARNAGTSDVAKKLRDRIGVKGDQSGQTANGKVRLYV
IGQGALSAAAAANTKHDIANAERWNLPIILMVLVAVFGSLAAAAIPLALAVCTVVITMGLVFVLSMHTTMSVFVTSTVSM
FGIALAVDYSLFILMRYREELRCGRRPPDAVDAAMATSGLAVVLSGMTVIASLTGIYLINTPALRSMATGAILAVAVAML
TSATLTPAVLATFARAAAKRSALVHWSRRPASTQSWFWSRWVGWVMRRPWITALAASTVLLVMAAPATLMVLGNSLLRQF
DSSHEIRTGAAAAAQALGPGALGPVQVLVRFDAGGASAPEHSQTIAAIRHRIAQAPNVVSVAPPRFADDNGSALLSAVLS
VDPEDLGARDTITWMRTQLPRVAGAAQVDVGGPTALIKDFDDRVSATQPLVLVFVAVIAFLMLLISIRSVFLAFKGVLMT
LLSVAAAYGSLVMVFQWGWARGLGFPALHSIDSTVPPLVLAMTFGLSMDYEIFLLTRIRERFLQTGQTRDAVAYGVRTSA
RTITSAALIMIAVFCGFAFAGMPLVAEIGVACAVAIAVDATVVRLVLVPALMAMFDRWNWWLPRWLAHILPSVDFDRPLP
KVDLGDVVVIPDDFAAAIPPSADVRMVLKSAAKLKRLAPDAICVTDPLAFTGCGCDGKALDQVQLAYRNGIARAISWGQR
PVHPVTVWRKRLAVALDALQTTTWECGGVQTHRAGPGYRRRSPVETTNVALPTGDRLQIPTGAETLRFKGYLIMSRNSSH
DYADFADLVDTMAPETAAAVLAGMDRYYSCQAPGRQWMATQLVGRLADPQPSDLGDQSPGADAQAKWEEVRRRCLSVAVA
MLEEAR
>I6Y8F7 ~~~mmpR5~~~HTH-type transcriptional regulator MmpR5~~~COG1510
MSVNDGVDQMGAEPDIMEFVEQMGGYFESRSLTRLAGRLLGWLLVCDPERQSSEELATALAASSGGISTNARMLIQFGFI
ERLAVAGDRRTYFRLRPNAFAAGERERIRAMAELQDLADVGLRALGDAPPQRSRRLREMRDLLAYMENVVSDALGRYSQR
TGEDD
>P9WJT3 ~~~mmpS2~~~Probable transport accessory protein MmpS2~~~
MISVSGAVKRMWLLLAIVVVAVVGGLGIYRLHSIFGVHEQPTVMVKPDFDVPLFNPKRVTYEVFGPAKTAKIAYLDPDAR
VHRLDSVSLPWSVTVETTLPAVSVNLMAQSNADVISCRIIVNGAVKDERSETSPRALTSCQVSSG
>P9WJT1 ~~~mmpS3~~~Probable transport accessory protein MmpS3~~~
MSGPNPPGREPDEPESEPVSDTGDERASGNHLPPVAGGGDKLPSDQTGETDAYSRAYSAPESEHVTGGPYVPADLRLYDY
DDYEESSDLDDELAAPRWPWVVGVAAIIAAVALVVSVSLLVTRPHTSKLATGDTTSSAPPVQDEITTTKPAPPPPPPAPP
PTTEIPTATETQTVTVTPPPPPPPATTTAPPPATTTTAAAPPPTTTTPTGPRQVTYSVTGTKAPGDIISVTYVDAAGRRR
TQHNVYIPWSMTVTPISQSDVGSVEASSLFRVSKLNCSITTSDGTVLSSNSNDGPQTSC
>P9WJS9 ~~~mmpS4~~~Siderophore export accessory protein MmpS4~~~
MLMRTWIPLVILVVVIVGGFTVHRIRGFFGSENRPSYSDTNLENSKPFNPKHLTYEIFGPPGTVADISYFDVNSEPQRVD
GAVLPWSLHITTNDAAVMGNIVAQGNSDSIGCRITVDGKVRAERVSNEVNAYTYCLVKSA
>P9WJS7 ~~~mmpS5~~~Siderophore export accessory protein MmpS5~~~
MIGTLKRAWIPLLILVVVAIAGFTVQRIRTFFGSEGILVTPKVFADDPEPFDPKVVEYEVSGSGSYVNINYLDLDAKPQR
IDGAALPWSLTLKTTAPSAAPNILAQGDGTSITCRITVDGEVKDERTATGVDALTYCFVKSA
>P9WGF1 ~~~mmr~~~Multidrug resistance protein Mmr~~~COG2076
MIYLYLLCAIFAEVVATSLLKSTEGFTRLWPTVGCLVGYGIAFALLALSISHGMQTDVAYALWSAIGTAAIVLVAVLFLG
SPISVMKVVGVGLIVVGVVTLNLAGAH
>Q2W8J4 ~~~mms5~~~Magnetosome protein Mms5~~~
MLSAKGVSLGLGLGLGAWGPVLLGVVGVAGAIALYGYYKNRNAEPAAAEAV
>Q6NE76 ~~~mms6~~~Magnetite biomineralization protein Mms6~~~
MGEMEREGATAKVGAGKVGAGKVGAAKAGAAPAAAQGAGTKVVAAQGAGTKVVAAQGAGAKAAAVGVGKVGAGAKAVGGT
IWSGKGLALGLGMGLGAWGPLILGVVGAGAVYAYMKSRDIEAAQSDEEVELRDALS
>Q2W8R5 ~~~mms6~~~Magnetite biomineralization protein Mms6~~~
MGEMEREGAAAKAGAAKTGAAKTGTVAKTGIAAKTGVATAVAAPAAPANVAAAQGAGTKVALGAGKAAAGAKVVGGTIWT
GKGLGLGLGLGLGAWGPIILGVVGAGAVYAYMKSRDIESAQSDEEVELRDALA
>P28810 1.2.1.27~~~mmsA~~~Methylmalonate-semialdehyde dehydrogenase [acylating]~~~
MSVPVRHLIAGAFVEGLGAQRIPVSNPLDNSTLAEIACASAEQVEQAVASARETFASWKETPVSERARVMLRYQALLKEH
HDELAKIVSSELGKTFEDAKGDVWRGIEVVEHACNVPSLLMGETVENVARNIDTYSITQPLGVCVGITPFNFPAMIPLWM
FPLAIACGNAFILKPSEQVPLTSVRLAELFLEAGAPKGVLQVVHGGKEQVDQLLKHPQVKAVSFVGSVAVGQYVYHTGTA
HNKRVQSFAGAKNHMVIMPDADKAQVISNLVGASVGAAGQRCMAISVAVLVGAAREWIPEIRDALAKVRPGPWDDSGASY
GPVINPQAKARIERLIGQGVEEGAQLLLDGRGYKVEGYPDGNWVGPTLFAGVRPDMAIYREEVFGPVLCLAEVDSLEQAI
RLINESPYGNGTSIFTSSGAAARTFQHHIEVGQVGINIPIPVPLPFFSFTGWKGSFYGDLHAYGKQGVRFYTETKTVTAR
WFDSDSVAGTNFSIQMR
>P9WNY5 1.1.1.31~~~mmsB~~~Probable 3-hydroxyisobutyrate dehydrogenase~~~COG2084
MTTIAFLGLGNMGAPMSANLVGAGHVVRGFDPAPTAASGAAAHGVAVFRSAPEAVAEADVVITMLPTGEVVRRCYTDVLA
AARPATLFIDSSTISVTDAREVHALAESHGMLQLDAPVSGGVKGAAAATLAFMVGGDESTLRRARPVLEPMAGKIIHCGA
AGAGQAAKVCNNMVLAVQQIAIAEAFVLAEKLGLSAQSLFDVITGATGNCWAVHTNCPVPGPVPTSPANNDFKPGFSTAL
MNKDLGLAMDAVAATGATAPLGSHAADIYAKFAADHADLDFSAVIHTLRARADA
>P28811 1.1.1.31~~~mmsB~~~3-hydroxyisobutyrate dehydrogenase~~~
MTDIAFLGLGNMGGPMAANLLKAGHRVNVFDLQPKAVLGLVEQGAQGADSALQCCEGAEVVISMLPAGQHVESLYLGDDG
LLARVAGKPLLIDCSTIAPETARKVAEAAAAKGLTLLDAPVSGGVGGARAGTLSFIVGGPAEGFARARPVLENMGRNIFH
AGDHGAGQVAKICNNMLLGILMAGTAEALALGVKNGLDPAVLSEVMKQSSGGNWALNLYNPWPGVMPQAPASNGYAGGFQ
VRLMNKDLGLALANAQAVQASTPLGALARNLFSLHAQADAEHEGLDFSSIQKLYRGKD
>Q2W8R4 ~~~mmsF~~~Magnetosome protein MmsF~~~
MTEAILRSTLGARTTVMAALSYLSVLCFVPLLVDRDDEFVYFHAKQGLVIWMWGVLALFALHVPVLGKWIFGFSSMGVLV
FSLLGLVSVVFQRAWKLPLISWVAHRI
>Q47690 2.1.1.10~~~mmuM~~~Homocysteine S-methyltransferase~~~COG2040
MSQNNPLRALLDKQDILLLDGAMATELEARGCNLADSLWSAKVLVENPELIREVHLDYYRAGAQCAITASYQATPAGFAA
RGLDEAQSKALIGKSVELARKAREAYLAENPQAGTLLVAGSVGPYGAYLADGSEYRGDYHCSVEAFQAFHRPRVEALLDA
GADLLACETLPNFSEIEALAELLTAYPRARAWFSFTLRDSEHLSDGTPLRDVVALLAGYPQVVALGINCIALENTTAALQ
HLHGLTVLPLVVYPNSGEHYDAVSKTWHHHGEHCAQLADYLPQWQAAGARLIGGCCRTTPADIAALKARS
>P39131 5.1.3.14~~~mnaA~~~UDP-N-acetylglucosamine 2-epimerase~~~COG0381
MKKLKVMTVFGTRPEAIKMAPLVLELKKYPEIDSYVTVTAQHRQMLDQVLDAFHIKPDFDLNIMKERQTLAEITSNALVR
LDELFKDIKPDIVLVHGDTTTTFAGSLAAFYHQIAVGHVEAGLRTGNKYSPFPEELNRQMTGAIADLHFAPTGQAKDNLL
KENKKADSIFVTGNTAIDALNTTVRDGYSHPVLDQVGEDKMILLTAHRRENLGEPMENMFKAIRRIVGEFEDVQVVYPVH
LNPVVREAAHKHFGDSDRVHLIEPLEVIDFHNFAAKSHFILTDSGGVQEEAPSLGKPVLVLRDTTERPEGVEAGTLKLAG
TDEENIYQLAKQLLTDPDEYKKMSQASNPYGDGEASRRIVEELLFHYGYRKEQPDSFTGK
>P76112 2.3.1.-~~~mnaT~~~L-amino acid N-acyltransferase MnaT~~~COG1247
MSIRFARKADCAAIAEIYNHAVLYTAAIWNDQTVDADNRIAWFEARTLAGYPVLVSEENGVVTGYASFGDWRSFDGFRHT
VEHSVYVHPDHQGKGLGRKLLSRLIDEARDCGKHVMVAGIESQNQASLHLHQSLGFVVTAQMPQVGTKFGRWLDLTFMQL
QLDERTEPDAIG
>A0A0H3AJF5 ~~~mneA~~~Putative manganese exporter~~~COG2119
MFVYGSIINPCPTEHVMSVLAISITTVALAEIGDKTQLLSLLLASRYRKPIPIIAAIFLATLANHALAAWLGVVVADYLS
PDILKWVLVVSFLTMAGWILIPDKLDGEESISTRGPFVASFIAFFMAEIGDKTQIATSILGAQYADALSWVIVGTTLGML
LANVPVVLIGKLSADKMPLGLIRKVTAGLFLLMALATAFF
>C0SP78 ~~~mneP~~~Manganese efflux system protein MneP~~~COG0053
MASEREQISRKVALIALIANLILMAGKVFFGLVGDSEAVFADGIHSAADVVASIAVLAVIGISNKPPDQDHPFGHGKAEV
ISEAIVGIILVIVSVYILIEAILSFVKGPSVPQYSALFAALISYVAKEILYRYSIKQGKKWNSKAIIAIAYDHKGDIVAS
LAAFIGVLLAIIGNSRGWSYLLYADAIASAIVAYLIFKISMELIRPSVDVLMEKSVDPELIEEYKAVIFQCDQVKRIDRI
RAREHGHYKLLDVRLSLDHDLTIKQGHDIAREIRNEIKRQFSDVEEVLIHVNPYFEE
>P46348 ~~~mneS~~~Manganese efflux system protein MneS~~~COG0053
MERYDELKKGESGALVSIAAYLVLSAIKLIIGYLFHSEALTADGLNNTTDIIASVAVLIGLRISQKPPDEDHPYGHFRAE
TIASLIASFIMMVVGLQVLFSAGESIFSAKQETPDMIAAWTAAGGAVLMLIVYRYNKRLAKKVKSQALLAAAADNKSDAF
VSIGTFIGIVAAQFHLAWIDTVTAFVIGLLICKTAWDIFKESSHSLTDGFDIKDISAYKQTIEKISGVSRLKDIKARYLG
STVHVDVVVEVSADLNITESHDIANEIERRMKEEHAIDYSHVHMEPLEQK
>P54745 ~~~mngA~~~PTS system 2-O-alpha-mannosyl-D-glycerate-specific EIIABC component~~~COG1299
MVLFYRAHWRDYKNDQVRIMMNLTTLTHRDALCLNARFTSREEAIHALTQRLAALGKISSTEQFLEEVYRRESLGPTALG
EGLAVPHGKTAAVKEAAFAVATLSEPLQWEGVDGPEAVDLVVLLAIPPNEAGTTHMQLLTALTTRLADDEIRARIQSATT
PDELLSALDDKGGTQPSASFSNAPTIVCVTACPAGIAHTYMAAEYLEKAGRKLGVNVYVEKQGANGIEGRLTADQLNSAT
ACIFAAEVAIKESERFNGIPALSVPVAEPIRHAEALIQQALTLKRSDETRTVQQDTQPVKSVKTELKQALLSGISFAVPL
IVAGGTVLAVAVLLSQIFGLQDLFNEENSWLWMYRKLGGGLLGILMVPVLAAYTAYSLADKPALAPGFAAGLAANMIGSG
FLGAVVGGLIAGYLMRWVKNHLRLSSKFNGFLTFYLYPVLGTLGAGSLMLFVVGEPVAWINNSLTAWLNGLSGSNALLLG
AILGFMCSFDLGGPVNKAAYAFCLGAMANGVYGPYAIFASVKMVSAFTVTASTMLAPRLFKEFEIETGKSTWLLGLAGIT
EGAIPMAIEDPLRVIGSFVLGSMVTGAIVGAMNIGLSTPGAGIFSLFLLHDNGAGGVMAAIGWFGAALVGAAISTAILLM
WRRHAVKHGNYLTDGVMP
>P54746 3.2.1.-~~~mngB~~~Mannosylglycerate hydrolase~~~COG0383
MKAVSRVHITPHMHWDREWYFTTEESRILLVNNMEEILCRLEQDNEYKYYVLDGQTAILEDYFAVKPENKDRVKKQVEAG
KLIIGPWYTQTDTTIVSAESIVRNLMYGMRDCLAFGEPMKIGYLPDSFGMSGQLPHIYNGFGITRTMFWRGCSERHGTDK
TEFLWQSSDGSEVTAQVLPLGYAIGKYLPADENGLRKRLDSYFDVLEKASVTKEILLPNGHDQMPLQQNIFEVMDKLREI
YPQRKFVMSRFEEVFEKIEAQRDNLATLKGEFIDGKYMRVHRTIGSTRMDIKIAHARIENKIVNLLEPLATLAWTLGFEY
HHGLLEKMWKEILKNHAHDSIGCCCSDKVHREIVARFELAEDMADNLIRFYMRKIADNMPQSDADKLVLFNLMPWPREEV
INTTVRLRASQFNLRDDRGQPVPYFIRHAREIDPGLIDRQIVHYGNYDPFMEFDIQINQIVPSMGYRTLYIEANQPGNVI
AAKSDAEGILENAFWQIALNEDGSLQLVDKDSGVRYDRVLQIEESSDDGDEYDYSPAKEEWVITAANAKPQCDIIHEAWQ
SRAVIRYDMAVPLNLSERSARQSTGRVGVVLVVTLSHNSRRIDVDINLDNQADDHRLRVLVPTPFNTDSVLADTQFGSLT
RPVNDSAMNNWQQEGWKEAPVPVWNMLNYVALQEGRNGMAVFSEGLREFEVIGEEKKTFAITLLRGVGLLGKEDLLLRPG
RPSGIKMPVPDSQLRGLLSCRLSLLSYTGTPTAAGVAQQARAWLTPVQCYNKIPWDVMKLNKAGFNVPESYSLLKMPPVG
CLISALKKAEDRQEVILRLFNPAESATCDATVAFSREVISCSETMMDEHITTEENQGSNLSGPFLPGQSRTFSYRLA
>P13669 ~~~mngR~~~Mannosyl-D-glycerate transport/metabolism system repressor MngR~~~COG2188
MGHKPLYRQIADRIREQIARGELKPGDALPTESALQTEFGVSRVTVRQALRQLVEQQILESIQGSGTYVKEERVNYDIFQ
LTSFDEKLSDRHVDTHSEVLIFEVIPADDFLQQQLQITPQDRVWHVKRVRYRKQKPMALEETWMPLALFPDLTWQVMENS
KYHFIEEVKKMVIDRSEQEIIPLMPTEEMSRLLNISQTKPILEKVSRGYLVDGRVFEYSRNAFNTDDYKFTLIAQRKSSR
>P60675 ~~~mnhA1~~~Na(+)/H(+) antiporter subunit A1~~~
MSLLHIAVILPLIFALIIPILYRFFKRIHLGWFVLPVPIVIFIYMLTLIKTTMSGNTVMKTLNWMPHFGMNFDLYLDGLG
LLFSLLISGIGSLVVLYSIGYLSKSEQLGNFYCYLLLFMGAMLGVVLSDNVIILYLFWELTSFSSFLLISFWRERQASIY
GAQKSLIITVFGGLSLLGGIILLAIPTQSFSIQYMIQHASEIQNSPFFIFAMILIMIGAFTKSAQFPFYIWLPDAMEAPT
PVSAYLHSATMVKAGLYLIARMTPIFAASQGWVWTVTLVGLITLFWASLNATKQQDLKGILAFSTVSQLGMIMAMLGIGA
ISYHYQGDDSKIYAAAFTAAIFHLINHATFKGALFMITGAVDHSTGTRDVKKLGGLLTIMPISFTITVITALSMAGVPPF
NGFLSKESFLETTFTASQANLFSVDTLGYLFPIIGIVGSVFTFVYSIKFIMHIFFGQYKPEQLPKKAHEVSILMLLSPAI
LATLVIVLGLFPGILTNSIIEPATSSINHTVIDDVEFHMFHGLTPAFLSTLVIYILGILLIVTFSYWVKLLQRQPGKLTF
NYWYNRSANVIPNYSEKMTNSYVTDYSRNNLVIIFGALILLTFVTIFSVPFNINFKDVSPIRIFEVCIVILLLSAAFLIL
FAKSRLFSIIMLSAVGYAVSVLFIFFKAPDLALTQFVVESISTALFLLCFYHLPNLNRYNEKRSFQLTNALIAGGVGLSV
IIIGLIAYGNRHFESISKFYQEHVYDLAHGKNMVNVILVDFRGMDTLFESSVLGIAGLAVYTMIKLRKKRQTQGNEVKNH
E
>Q9ZNG6 ~~~mnhA1~~~Na(+)/H(+) antiporter subunit A1~~~
MSLLHIAVILPLIFALIIPILYRFFKRIHLGWFVLSVPIVIFIYMLTLIKTTMSGNTVMKTLNWMPHFGMNFDLYLDGLG
LLFSLLISGIGSLVVLYSIGYLSKSEQLGNFYCYLLLFMGAMLGVVLSDNVIILYLFWELTSFSSFLLISFWRERQASIY
GAQKSLIITVFGGLSLLGGIILLAIPTQSFSIQYMIQHASEIQNSPFFIFAMILIMIGAFTKSAQFPFYIWLPDAMEAPT
PVSAYLHSATMVKAGLYLIARMTPIFAASQGWVWTVTLVGLITLFWASLNATKQQDLKGILAFSTVSQLGMIMAMLGIGA
ISYHYQGDDSKIYAAAFTAAIFHLINHATFKGALFMITGAVDHSTGTRDVKKLGGLLTIMPISFTITVITALSMAGVPPF
NGFLSKESFLETTFTASQANLFSVDTLGYLFPIIGIVGSVFTFVYSIKFIMHIFFGQYKPEQLPKKAHEVSILMLLSPAI
LATLVIVFGLFPGILTNSIIEPATSSINHTVIDDVEFHMFHGLTPAFLSTLVIYILGILLIVTFSYWVKLLQRQPGKLTF
NYWYNRSANVIPNYSEKMTNSYVTDYSRNNLVIIFGALILLTFVTIFSVPFNINFKDVSPIRIFEVCIVILLLSAAFLIL
FAKSRLFSIIMLSAVGYAVSVLFIFFKAPDLALTQFVVESISTALFLLCFYHLPNLNRYNEKRSFQLTNALIAGGVGLSV
IIIGLIAYGNRHFESISKFYQEHVYDLAHGKNMVNVILVDFRGMDTLFESSVLGIAGLAVYTMIKLRKKRQTQGNEVKNH
E
>P60678 ~~~mnhB1~~~Na(+)/H(+) antiporter subunit B1~~~
MNRQQNDLILQFAAVIIFFMVMVFGFSLFLAGHYTPGGGFVGGLLFASSLVIITIAFDIETMRKIFPLDFKILIGIGLVF
CIATPIASWFLGKNFFTHVTFDIPLFILEPVHMTTAVFFDFGVLCAVVGTVMTIIISIGENE
>P60682 ~~~mnhC1~~~Na(+)/H(+) antiporter subunit C1~~~
MEIIMIFVSGILTAISVYLVLSKSLIRIVMGTTLLTHAANLFLITMGGLKHGTVPIYEANVKSYVDPIPQALILTAIVIA
FATTAFFLVLAFRTYKELGTDNVESMKGVPEDD
>P60686 ~~~mnhD1~~~Na(+)/H(+) antiporter subunit D1~~~
MIESNMLVLTLVIPVITAILLVFIGKRPIIKRYVALGGTLLTLVAAIINLANVVKHGPIRVELGSWKAPYSIVFVLDIFS
ALLIITSIIITAIVILYSYQTIGIERERYYYYFSVLFMLIGIIGAFTTGDIFNLFVFFEVFLMSSYFLLVIGSTKIQLQE
TIKYVLVNVVSSSFFVMGVAILYSVVGTLNLADISNKLANLSAHDSGLVNIVFILFIFVFATKAGVFPMFVWLPSAYYAP
PIPIIAFFGALLTKVGVYAIARTLSLFFSDNVSFSHYVILFLALLTIIFGCVGAVAYANIKKIILYNVMIAVGVILVGVA
MMTESGMIGAIYYTLHDMLVKLALFLLIGIMIKITGTADLRQFGGLIKRYPVLGWSFFIAALSLAGIPPLSGFYGKFFIV
QSTFERGFYLSGVIVLLSSLVVLYSVIRIFLQGFFGQPKGYDLNNKVDVKYLTTIAIVAVVITVLYGLSADYLYPMVKAG
AETFYNPSTYVKAVLGGK
>P60689 ~~~mnhE1~~~Na(+)/H(+) antiporter subunit E1~~~
MAVQLVLNFIIAVFWLFVTNSYTTNNFVLGFIFGLVLVYLLHRVLPGRFYVITLYRIIKLVIIFLIELIKANFDVLKIII
KPSIKNEPGFFVYHTDLKKDWQIVLLSNLITLTPGTVVLGVSDDRTKIYIHAIDFSTKEQEVESIKTSLEKIVREVGEI
>P60690 ~~~mnhE1~~~Na(+)/H(+) antiporter subunit E1~~~
MAVQLVLNFIIAVFWLFVTNSYTTNNFVLGFIFGLVLVYLLHRVLPGRFYVITLYRIIKLVIIFLIELIKANFDVLKIII
KPSIKNEPGFFVYHTDLKKDWQIVLLSNLITLTPGTVVLGVSDDRTKIYIHAIDFSTKEQEVESIKTSLEKIVREVGEI
>P60694 ~~~mnhF1~~~Na(+)/H(+) antiporter subunit F1~~~
MNHNVIIVIALIIVVISMLAMLIRVVLGPSLADRVVALDAIGLQLMAVIALFSILLNIKYMIVVIMMIGILAFLGTAVFS
KFMDKGKVIEHDQNHTD
>P60698 ~~~mnhG1~~~Na(+)/H(+) antiporter subunit G1~~~
MIKIILISLALIFVIIGALISALAAIGLLRLEDVYSRAHAAGKASTLGAMSLLFGTFLYFIATQGFVNMQLIVAIIFVLI
TGPLSSHMIMKAAYNIKTPYTKKTKVDEISEDLKDTKL
>P25745 2.8.1.13~~~mnmA~~~tRNA-specific 2-thiouridylase MnmA~~~COG0482
MSETAKKVIVGMSGGVDSSVSAWLLQQQGYQVEGLFMKNWEEDDGEEYCTAAADLADAQAVCDKLGIELHTVNFAAEYWD
NVFELFLAEYKAGRTPNPDILCNKEIKFKAFLEFAAEDLGADYIATGHYVRRADVDGKSRLLRGLDSNKDQSYFLYTLSH
EQIAQSLFPVGELEKPQVRKIAEDLGLVTAKKKDSTGICFIGERKFREFLGRYLPAQPGKIITVDGDEIGEHQGLMYHTL
GQRKGLGIGGTKEGTEEPWYVVDKDVENNILVVAQGHEHPRLMSVGLIAQQLHWVDREPFTGTMRCTVKTRYRQTDIPCT
VKALDDDRIEVIFDEPVAAVTPGQSAVFYNGEVCLGGGIIEQRLPLPV
>P9WJS5 2.8.1.13~~~mnmA~~~tRNA-specific 2-thiouridylase MnmA~~~COG0482
MKVLAAMSGGVDSSVAAARMVDAGHEVVGVHMALSTAPGTLRTGSRGCCSKEDAADARRVADVLGIPFYVWDFAEKFKED
VINDFVSSYARGETPNPCVRCNQQIKFAALSARAVALGFDTVATGHYARLSGGRLRRAVDRDKDQSYVLAVLTAQQLRHA
AFPIGDTPKRQIRAEAARRGLAVANKPDSHDICFIPSGNTKAFLGERIGVRRGVVVDADGVVLASHDGVHGFTIGQRRGL
GIAGPGPNGRPRYVTAIDADTATVHVGDVTDLDVQTLTGRAPVFTAGAAPSGPVDCVVQVRAHGETVSAVAELIGDALFV
QLHAPLRGVARGQTLVLYRPDPAGDEVLGSATIAGASGLSTGGNPGA
>Q99TM8 2.8.1.13~~~mnmA~~~tRNA-specific 2-thiouridylase MnmA~~~
MSNKDIRVVVGMSGGVDSSVTAHVLKEQGYDVIGIFMKNWDDTDENGVCTATEDYNDVIEVCNQIGIPYYAVNFEKEYWD
KVFTYFLDEYKKGRTPNPDVMCNKEIKFKAFLDHAMNLGADYVATGHYARIHRHEDGHVEMLRGVDNNKDQTYFLNQLSQ
QQLSKVMFPIGDIEKSEVRRIAEEQGLVTAKKKDSTGICFIGEKNFKTFLSQYLPAQPGDMITLDGKKMGKHSGLMYYTI
GQRHGLGIGGDGDPWFVVGKNLKDNVLYVEQGFHHDALYSDYLIASDYSFVNPEDNDLDQGFECTAKFRYRQKDTKVFVK
RENDHALRVTFAEPVRAITPGQAVVFYQGDVCLGGATIDDVFKNEGQLNYVV
>Q97T38 2.8.1.13~~~mnmA~~~tRNA-specific 2-thiouridylase MnmA~~~COG0482
MSDNSKTRVVVGMSGGVDSSVTALLLKEQGYDVIGIFMKNWDDTDENGVCTATEDYKDVVAVADQIGIPYYSVNFEKEYW
DRVFEYFLAEYRAGRTPNPDVMCNKEIKFKAFLDYAITLGADYVATGHYARVARDEDGTVHMLRGVDNGKDQTYFLSQLS
QEQLQKTMFPLGHLEKPEVRRLAEEAGLSTAKKKDSTGICFIGEKNFKNFLSNYLPAQPGRMMTVDGRDMGEHAGLMYYT
IGQRGGLGIGGQHGGDNAPWFVVGKDLSKNILYVGQGFYHDSLMSTSLEASQVHFTREMPEEFTLECTAKFRYRQPDSKV
TVHVKGEKTEVIFAEPQRAITPGQAVVFYDGEECLGGGLIDNAYRDGQVCQYI
>Q8XCQ7 ~~~mnmC~~~tRNA 5-methylaminomethyl-2-thiouridine biosynthesis bifunctional protein MnmC~~~COG0665
MKHYSIQPANLEFNAEGTPVSRDFDDVYFSNDNGLEETRYVFLGGNQLEARFPEHPHPLFVVAESGFGTGLNFLTLWQAF
DQFREAHPQAQLQRLHFISFEKFPLTRADLALAHQHWPELAPWAEQLQAQWPMPLPGCHRLLLDEGRVTLDLWFGDINEL
ISQLDDSLNQKVDAWFLDGFAPAKNPDMWTQNLFNAMARLARPGGTLATFTSAGFVRRGLQEAGFTMQKRKGFGRKREML
CGVMEQTLPLPCSTPWFNRTGSSKREVAIIGGGIASALLSLALLRRGWQVTLYCADEAPALGASGNRQGALYPLLSKHDE
ALNRFFSNGFTFARRLYDSLPVKFDHDWCGVTQLGWDEKSQHKIAQMLSMDLPEELAVAVEANAVEQITGVTTNCSGITY
PQGGWLCPAELTRNVLELAQQQGLQIYYQYQLQDLSRKDDCWLLTFAGDQQATHSVVVLANGHQISRFSQTSSLPVYSVA
GQVSHIPTTPELAKLKQVLCYDGYLTPQNPANQHHCIGASYHRGSEETAYSEDNQQQNRQRLIDCFPHAQWAKTVDVSKK
EARCGVRCATRDHLPMVGNVPDYEATLVEYASLAEQKDKAVSAPVFDDLFMFAALGSRGLCSAPLCAEILAAQMSDEPIP
MDASTLAALNPNRLWVRKLLKGKAVKAG
>P77182 ~~~mnmC~~~tRNA 5-methylaminomethyl-2-thiouridine biosynthesis bifunctional protein MnmC~~~COG0665
MKHYSIQPANLEFNAEGTPVSRDFDDVYFSNDNGLEETRYVFLGGNQLEVRFPEHPHPLFVVAESGFGTGLNFLTLWQAF
DQFREAHPQAQLQRLHFISFEKFPLTRADLALAHQHWPELAPWAEQLQAQWPMPLPGCHRLLLDEGRVTLDLWFGDINEL
TSQLDDSLNQKVDAWFLDGFAPAKNPDMWTQNLFNAMARLARPGGTLATFTSAGFVRRGLQDAGFTMQKRKGFGRKREML
CGVMEQTLPLPCSAPWFNRTGSSKREAAIIGGGIASALLSLALLRRGWQVTLYCADEAPALGASGNRQGALYPLLSKHDE
ALNRFFSNAFTFARRFYDQLPVKFDHDWCGVTQLGWDEKSQHKIAQMLSMDLPAELAVAVEANAVEQITGVATNCSGITY
PQGGWLCPAELTRNVLELAQQQGLQIYYQYQLQNLSRKDDCWLLNFAGDQQATHSVVVLANGHQISRFSQTSTLPVYSVA
GQVSHIPTTPELAELKQVLCYDGYLTPQNPANQHHCIGASYHRGSEDTAYSEDDQQQNRQRLIDCFPQAQWAKEVDVSDK
EARCGVRCATRDHLPMVGNVPDYEATLVEYASLAEQKDEAVSAPVFDDLFMFAALGSRGLCSAPLCAEILAAQMSDEPIP
MDASTLAALNPNRLWVRKLLKGKAVKAG
>Q8ZD36 ~~~mnmC~~~tRNA 5-methylaminomethyl-2-thiouridine biosynthesis bifunctional protein MnmC~~~COG0665
MNQRPIQTATLSWNEQGTPVSEQFGDIYFSNEDGLEETHHVFLKGNGFPARFASHPQQSCIFAETGFGTGLNFLTLWRDF
ALFRQQSPNATLRRLHYISFEKYPLHVADLASAHARWPELASFAEQLRAQWPLPLAGCHRILLADGAITLDLWFGDVNTL
LPTLDDSLNNQVDAWFLDGFAPAKNPDMWNEQLFNAMARMTRPGGTFSTFTAAGFVRRGLQQAGFNVTKVKGFGQKREML
TGTLPQQIHAPTAPWYHRPAATRCDDIAIIGGGIVSALTALALQRRGAVVTLYCADAQPAQGASGNRQGALYPLLNGKND
ALETFFTSAFTFARRQYDQLLEQGIAFDHQWCGVSQLAFDDKSRGKIEKMLHTQWPVEFAEAMSREQLSELAGLDCAHDG
IHYPAGGWLCPSDLTHALMMLAQQNGMTCHYQHELQRLKRIDSQWQLTFGQSQAAKHHATVILATGHRLPEWEQTHHLPL
SAVRGQVSHIPTTPVLSQLQQVLCYDGYLTPVNPANQHHCIGASYQRGDIATDFRLTEQQENRERLLRCLPQVSWPQQVD
VSDNQARCGVRCAIRDHLPMVGAVPDYAATLAQYQDLSRRIQHGGESEVNDIAVAPVWPELFMVGGLGSRGLCSAPLVAE
ILAAQMFGEPLPLDAKTLAALNPNRFWIRKLLKGRPVQTRSPATQESSR
>Q8KAS1 3.6.-.-~~~mnmE~~~tRNA modification GTPase MnmE~~~COG0486
MSPSDLHLPVPGHPIAAIATPVGVGALAIVRISGAGVLDLADRVFRKVHGSGKLAEAAGYTAHFGRLYDGEEMVDEVIAL
VFRAPRSFTAEQMVEFTCHGGPVVVGRVLRLMLDNGCRLAEPGEFTRRAFLNGRIDLLQAEAIGEMIHARTESAYRTAVS
QMKGDLSVRLGGLREQLIRSCALIELELDFSEEDVEFQSRDELTMQIETLRSEVNRLIDSYQHGRIVSEGVSTVIAGKPN
AGKSTLLNTLLGQERAIVSHMPGTTRDYIEECFIHDKTMFRLTDTAGLREAGEEIEHEGIRRSRMKMAEADLILYLLDLG
TERLDDELTEIRELKAAHPAAKFLTVANKLDRAANADALIRAIADGTGTEVIGISALNGDGIDTLKQHMGDLVKNLDKLH
EASVLVTSLRHYEALRNASDALQNALELIAHESETELIAFELRAALDYVGQITGKVVNEEVLNTIFDKFCIGK
>P25522 3.6.-.-~~~mnmE~~~tRNA modification GTPase MnmE~~~COG0486
MSDNDTIVAQATPPGRGGVGILRISGFKAREVAETVLGKLPKPRYADYLPFKDADGSVLDQGIALWFPGPNSFTGEDVLE
LQGHGGPVILDLLLKRILTIPGLRIARPGEFSERAFLNDKLDLAQAEAIADLIDASSEQAARSALNSLQGAFSARVNHLV
EALTHLRIYVEAAIDFPDEEIDFLSDGKIEAQLNDVIADLDAVRAEARQGSLLREGMKVVIAGRPNAGKSSLLNALAGRE
AAIVTDIAGTTRDVLREHIHIDGMPLHIIDTAGLREASDEVERIGIERAWQEIEQADRVLFMVDGTTTDAVDPAEIWPEF
IARLPAKLPITVVRNKADITGETLGMSEVNGHALIRLSARTGEGVDVLRNHLKQSMGFDTNMEGGFLARRRHLQALEQAA
EHLQQGKAQLLGAWAGELLAEELRLAQQNLSEITGEFTSDDLLGRIFSSFCIGK
>P75104 3.6.-.-~~~mnmE~~~tRNA modification GTPase MnmE~~~
MDTKQTMFALATAPFNSAIHIIRLSGPDVYRIINQITNKEVKPLGMRIQRVWLIDHNQKKVDDVLLFKFVAPNSYTGEDL
IEISCHGSMVIVNEIIGLLLKHGAVQAQPGEFTQRGYLNGKMSLNQAASVNNLVLSPNTTLKDVALNALAGQVDARLEPL
VEKLGQLVMQMEVNLDYPEYTDEQRELVTMNQAVVQITQILNQIVVGQDQLQRLKDPFKIAIIGNTNVGKSSLLNALLDQ
DKAIVSAIKGSTRDIVEGDFALNGHFVKILDTAGIRQHQSALEKAGIQKTFGAIKTANLVIYLLDARQPEPDPKIIARLK
KLKKDFFLVHNKADLVQQSFQVSISAKQKQIQPLVDLLTQYLHQFYSVEQNQLYLISDWQTILLQKAIAELEHFLIKQQN
CLFFDILVVHLRAAHEYILQVLGKNTNYDLINEIFKHFCLGK
>Q8YN91 3.6.-.-~~~mnmE~~~tRNA modification GTPase MnmE~~~COG0486
MAITGTIAAIATAIVPQQGSVGIVRVSGSQAIAIAQTLFDAPGKQVWESHRILYGYIRHPQTRQIVDEALLLLMKAPRSY
TREDVVEFHCHGGIIAVQQVLQLCLESGARLAQPGEFTLRAFLNGRLDLTQAESIADLVGARSPQAAQTALAGLQGKLAH
PIRQLRANCLDILAEIEARIDFEEDLPPLDDEAIISDIENIAAEISQLLATKDKGELLRTGLKVAIVGRPNVGKSSLLNA
WSQSDRAIVTDLPGTTRDVVESQLVVGGIPVQVLDTAGIRETSDQVEKIGVERSRQAANTADLVLLTIDAATGWTTGDQE
IYEQVKHRPLILVMNKIDLVEKQLITSLEYPENITQIVHTAAAQKQGIDSLETAILEIVQTGKVQAADMDLAINQRQAAA
LTQAKMSLEQVQATITQQLPLDFWTIDLRGAIQALGEITGEEVTESVLDRIFSRFCIGK
>Q9WYA4 3.6.5.-~~~mnmE~~~tRNA modification GTPase MnmE~~~COG0486
MDTIVAVATPPGKGAIAILRLSGPDSWKIVQKHLRTRSKIVPRKAIHGWIHENGEDVDEVVVVFYKSPKSYTGEDMVEVM
CHGGPLVVKKLLDLFLKSGARMAEPGEFTKRAFLNGKMDLTSAEAVRDLIEAKSETSLKLSLRNLKGGLRDFVDSLRREL
IEVLAEIRVELDYPDEIETNTGEVVTRLERIKEKLTEELKKADAGILLNRGLRMVIVGKPNVGKSTLLNRLLNEDRAIVT
DIPGTTRDVISEEIVIRGILFRIVDTAGVRSETNDLVERLGIERTLQEIEKADIVLFVLDASSPLDEEDRKILERIKNKR
YLVVINKVDVVEKINEEEIKNKLGTDRHMVKISALKGEGLEKLEESIYRETQEIFERGSDSLITNLRQKQLLENVKGHLE
DAIKSLKEGMPVDMASIDLERALNLLDEVTGRSFREDLLDTIFSNFCVGK
>O66962 ~~~mnmG~~~tRNA uridine 5-carboxymethylaminomethyl modification enzyme MnmG~~~COG0445
MAWVVDEFDVVVIGGGHAGIEAALAAARMGAKTAMFVLNADTIGQMSCNPAIGGIAKGIVVREIDALGGEMGKAIDQTGI
QFKMLNTRKGKAVQSPRAQADKKRYREYMKKVCENQENLYIKQEEVVDIIVKNNQVVGVRTNLGVEYKTKAVVVTTGTFL
NGVIYIGDKMIPGGRLGEPRSEGLSDFYRRFDFPLIRFKTGTPARLDKRTIDFSALEVAPGDDPPPKFSFWTEPVGSYWF
PKGKEQVNCWITYTTPKTHEIIRKNLHRTALYGGLIKGIGPRYCPSIEDKIVKFPDKERHQIFLEPEGLDTIEIYPNGLS
TSLPEEVQWEMYRSIPGLENVVLIRPAYAIEYDVVPPTELYPTLETKKIRGLFHAGNFNGTTGYEEAAGQGIVAGINAAL
RAFGKEPIYLRRDESYIGVMIDDLTTKGVTEPYRLFTSRSEYRLYIRQDNAILRLAKLGRELGLLSEEQYKLVKELEREI
EKWKEFYKSERVSVAVGGDTRSYSVATLMTMNYTLDDVKEKFGYEVPQHPYVKEEVEIQLKYEPYIERERKLNEKLKKLE
DTKIPPDIDYDKIPGLTKEAREKLKKFKPITVGQASRIDGITPAAITALLVYLGKLD
>Q8KA85 ~~~mnmG~~~tRNA uridine 5-carboxymethylaminomethyl modification enzyme MnmG~~~COG0445
MYDVIVVGAGHAGCEAALAVARGGLHCLLITSDLSAVARMSCNPAIGGVAKGQITREIDALGGEMGKAIDATGIQFRMLN
RSKGPAMHSPRAQADKTQYSLYMRRIVEHEPNIDLLQDTVIGVSANSGKFSSVTVRSGRAIQAKAAILACGTFLNGLIHI
GMDHFPGGRSTAEPPVEGLTESLASLGFSFGRLKTGTPPRIDSRSVDYTIVTEQPGDVDPVPFSFSSTSVANRNLVSCYL
TKTTEKTHDILRTGFDRSPLFTGKVQGVGPRYCPSIEDKISRFPDKSSHHIFLEPEGTDTVEMYVNGFSTSLPEDIQIAG
LRSIPGLEEAKMIRPGYAIEYDFFHPWQIRSTMETRPVENLFFAGQINGTSGYEEAAAQGLMAGINAVRKILGKELIVLG
RDQAYIGVLIDDLITKETKEPYRMFTSSAEHRLILRHDNADLRLRKIGYDCNLVSSDDLHRTESIIKRVQHCLEVMKTAK
VTPAEINTLLMNKGLQELKTPARALSLIKRPGISLQDILEHSLSVRSAAEELCNDPRVAEQVQIEIKYEGYIKREQLVAD
RIARLDSLHIPDNFNYDSLNSLSSEGREKLLKHRPATIGQASRILGVSPSDVSILMIRLGR
>Q8XAY0 ~~~mnmG~~~tRNA uridine 5-carboxymethylaminomethyl modification enzyme MnmG~~~COG0445
MFYPDPFDVIIIGGGHAGTEAAMAAARMGQQTLLLTHNIDTLGQMSCNPAIGGIGKGHLVKEVDALGGLMAKAIDQAGIQ
FRILNASKGPAVRATRAQADRVLYRQAVRTALENQPNLMIFQQAVEDLIVENDRVVGAVTQMGLKFRAKAVVLTVGTFLD
GKIHIGLDNYSGGRAGDPPSIPLSRRLRELPLRVGRLKTGTPPRIDARTIDFSVLAQQHGDNPMPVFSFMGNASQHPQQV
PCYITHTNEKTHDVIRSNLDRSPMYAGVIEGVGPRYCPSIEDKVMRFADRNQHQIFLEPEGLTSNEIYPNGISTSLPFDV
QMQIVRSMQGMENAKIVRPGYAIEYDFFDPRDLKPTLESKFIQGLFFAGQINGTTGYEEAAAQGLLAGLNAARLSDDKEG
WAPARSQAYLGVLVDDLCTLGTKEPYRMFTSRAEYRLMLREDNADLRLTEIGRELGLVDDERWARFNEKLENIERERQRL
KSTWVTPSAEAAAEVNAHLTAPLSREASGEDLLRRPEMTYEKLTTLTPFAPALTDEQAAEQVEIQVKYEGYIARQQDEIE
KQLRNENTLLPATLDYRQVSGLSNEVIAKLNDHKPASIGQASRISGVTPAAISILLVWLKKQGMLRRSA
>P0A6U3 ~~~mnmG~~~tRNA uridine 5-carboxymethylaminomethyl modification enzyme MnmG~~~COG0445
MFYPDPFDVIIIGGGHAGTEAAMAAARMGQQTLLLTHNIDTLGQMSCNPAIGGIGKGHLVKEVDALGGLMAKAIDQAGIQ
FRILNASKGPAVRATRAQADRVLYRQAVRTALENQPNLMIFQQAVEDLIVENDRVVGAVTQMGLKFRAKAVVLTVGTFLD
GKIHIGLDNYSGGRAGDPPSIPLSRRLRELPLRVGRLKTGTPPRIDARTIDFSVLAQQHGDNPMPVFSFMGNASQHPQQV
PCYITHTNEKTHDVIRSNLDRSPMYAGVIEGVGPRYCPSIEDKVMRFADRNQHQIFLEPEGLTSNEIYPNGISTSLPFDV
QMQIVRSMQGMENAKIVRPGYAIEYDFFDPRDLKPTLESKFIQGLFFAGQINGTTGYEEAAAQGLLAGLNAARLSADKEG
WAPARSQAYLGVLVDDLCTLGTKEPYRMFTSRAEYRLMLREDNADLRLTEIGRELGLVDDERWARFNEKLENIERERQRL
KSTWVTPSAEAAAEVNAHLTAPLSREASGEDLLRRPEMTYEKLTTLTPFAPALTDEQAAEQVEIQVKYEGYIARQQDEIE
KQLRNENTLLPATLDYRQVSGLSNEVIAKLNDHKPASIGQASRISGVTPAAISILLVWLKKQGMLRRSA
>P64230 ~~~mnmG~~~tRNA uridine 5-carboxymethylaminomethyl modification enzyme MnmG~~~
MVQEYDVIVIGAGHAGVEAGLASARRGAKTLMLTINLDNIAFMPCNPSVGGPAKGIVVREIDALGGQMAKTIDKTHIQMR
MLNTGKGPAVRALRAQADKVLYQQEMKRVIEDEENLHIMQGMVDELIIEDNEVKGVRTNIGTEYLSKAVIITTGTFLRGE
IILGNMKYSSGPNHQLPSITLSDNLRELGFDIVRFKTGTPPRVNSKTIDYSKTEIQPGDDVGRAFSFETTEYILDQLPCW
LTYTNAETHKVIDDNLHLSAMYSGMIKGTGPRYCPSIEDKFVRFNDKPRHQLFLEPEGRNTNEVYVQGLSTSLPEHVQRQ
MLETIPGLEKADMMRAGYAIEYDAIVPTQLWPTLETKMIKNLYTAGQINGTSGYEEAAGQGLMAGINAAGKVLNTGEKIL
SRSDAYIGVLIDDLVTKGTNEPYRLLTSRAEYRLLLRHDNADLRLTDMGYELGMISEERYARFNEKRQQIDAEIKRLSDI
RIKPNEHTQAIIEQHGGSRLKDGILAIDLLRRPEMTYDIILEILEEEHQLNADVEEQVEIQTKYEGYINKSLQQVEKVKR
MEEKKIPEDLDYSKIDSLATEAREKLSEVKPLNIAQASRISGVNPADISILLIYLEQGKLQRVSD
>O34614 2.1.1.61~~~mnmM~~~tRNA (mnm(5)s(2)U34)-methyltransferase~~~COG2519
MILKKILPYSKELLKMAAGEGDIVVDATMGNGHDTQFLAELVGENGHVYAFDIQESAVANTKERLGDMYQARTTLFHKSH
DKIAESLPPETHGKVAAAVFNLGYLPGGDKSITTNGSSTIKAIEQLLSIMKDEGLIVLVVYHGHPEGKAEKNDVLEFCRD
LDQQTARVLTYGFINQQNDPPFIVAIEKKAQISK
>Q2FXG9 2.1.1.61~~~mnmM~~~tRNA (mnm(5)s(2)U34)-methyltransferase~~~COG0275
MKLERILPFSKTLIKQHITPESIVVDATCGNGNDTLFLAEQVPEGHVYGFDIQDLALENTRDKVKDFNHVSLIKDGHENI
EHHINDAHKGHIDAAIFNLGYLPKGDKSIVTKPDTTIQAINSLLSLMSIEGIIVLVIYHGHSEGQIEKHALLDYLSTLDQ
KHAQVLQYQFLNQRNHAPFICAIEKIS
>Q9RCG0 1.1.99.37~~~mno~~~Methanol:N,N-dimethyl-4-nitrosoaniline oxidoreductase~~~
MQVDELLKPFPIKEFHPFPRALLGPGAHEMIGPEALKLGFKKTLVMTSGLRGSDIVHKITESMKYHGLEVVLYDKVESNP
KDYNVMDAVKLYQENKCDSFVSIGGGSSHDACKGARISVAHDGRNVNDFEGFNKSENPRNPPHIAVSTTAGTGSETSWAY
VITDTTTDPDNPHKYVAFDDASVATLAIDDPVLYYSCPIDYTAQCGFDVLAHASEPYVSRLNFEPSLGNALRAIKLTAEN
LRQATWNPSELSGREGMMYAQYIAAQAFNSGGLGIIHSISHAVSAFYDTHHGLNNAIALPRVWAFNMPVAYKRFADMAEA
MGVDTHGMTDVQAADALAAAIRLLRDVGIPEKFTDVTQDSYSKNRLGQGPTKFYEQASVIKGDDEDVDRITNHVLGDACT
PGNAKECTFETVRPVVDHCMNGDLDDLLS
>C5MRT8 1.1.99.37~~~~~~Methanol:N,N-dimethyl-4-nitrosoaniline oxidoreductase~~~
MAIELNQIWDFPIKEFHPFPRALLGVGAHDIIGVEAKNLGFKRTLLMTTGLRGSGIIEELTGKIEYQGVEVVLYDKVESN
PKDYNVMEAAALYQQERCDSIISIGGGSSHDAAKGARVVIAHDGRNINEFEGFAKSTNKQNPPHIAVSTTAGTGSETSWA
YVITDTSDMEHPHKWVGFDEATIVTLAIDDPLLYYTCPQHFTAYCGFDVLAHGSEPYVSRLDFAPSLGNALYSVELVAKH
LREAVFEPRNLKAREGMMNAQYIAGQAFNSGGLGIVHSISHAVSAFFDSHHGLNNAIALPRVWEYNLPSRYERYAQLATA
MGVDTRNMTTVQAADAAVEAAIRLSQDVGIPDNFSQVRVDSYDKNRMNTGKYAGKGEVIKGDDKSVLAISEHIQGDWCTP
GNPREVTVDSMIPVVGHAINGTY
>Q53062 1.1.99.37~~~thcE~~~Methanol:N,N-dimethyl-4-nitrosoaniline oxidoreductase~~~
AIELNQIWDFPIKEFHPFPRALMGVGAHDIIGVEAKNLGFKRTLLMTTGLRGSGIIEELVGKIEYQGVEVVLYDKVESNP
KDYNVMEAAALYQKEKCDSIISIGGGSSHDAAKGARVVIAHDGRNINEFEGFAKSTNKENPPHIAVSTTAGTGSETSWAY
VITDTSDMNNPHKWVGFDEATIVTLAIDDPLLYYTCPQHFTAYCGFDVLAHGSEPFVSRLDFAPSLGNAIYSVELVAKNL
REAVFEPRNLKAREGMMNAQYIAGQAFNSGGLGIVHSISHAVSAFFDSHHGLNNAIALPRVWEYNLPSRYERYAQLAGAL
GVDTRNLTTVQAADAAVEAAIRLAKDVGIPDNFGQVRTDSYAKNQMNTKKYEGRGDVIKGDEKTVRAISEHIQDDWCTPG
NPREVTVESMIPVVDHAINKSYF
>A0A0B0QJN8 2.7.7.108~~~mntA~~~Protein adenylyltransferase MntA~~~
MQDKIPTIAELRELSLRLLTKIPYLKMLVLFGSRATGNINANSDWDFAVLYDEEKYNLYIQNNPLAAFVIPGILGEIFKI
NSDKIDIVELNHCSKLIAHFVARDGKVLYEEPGDEFDKFQQRVLLSNTEIKKIEKTKLENIENFLQRWGV
>O34385 ~~~mntA~~~Manganese-binding lipoprotein MntA~~~COG0803
MRQGLMAAVLFATFALTGCGTDSAGKSADQQLQVTATTSQIADAAENIGGKHVKVTSLMGPGVDPHLYKASQGDTKKLMS
ADVVLYSGLHLEGKMEDVLQKIGEQKQSAAVAEAIPKNKLIPAGEGKTFDPHVWFSIPLWIYAVDEIEAQFSKAMPQHAD
AFRKNAKEYKEDLQYLDKWSRKEIAHIPEKSRVLVTAHDAFAYFGNEYGFKVKGLQGLSTDSDYGLRDVQELVDLLTEKQ
IKAVFVESSVSEKSINAVVEGAKEKGHTVTIGGQLYSDAMGEKGTKEGTYEGMFRHNINTITKALK
>P43933 2.7.7.108~~~mntA~~~Probable protein adenylyltransferase MntA~~~COG1708
MTSFAQLDIKSEELAIVKTILQQLVPDYTVWAFGSRVKGKAKKYSDLDLAIISEEPLDFLARDRLKEAFSESDLPWRVDL
LDWATTSEDFREIIRKVYVVIQEKEKTVEKPTAL
>Q8Y653 ~~~mntA~~~Manganese-binding lipoprotein MntA~~~COG0803
MKKIIVVSLFALVVVLAGCSSQNSDSKKTDGKLNVVATYSILADIVKNVGGNKIELHSIVPVGVDPHEYDPLPANIQSAA
DADLIFYNGLNLETGNGWFDRMLETADKSREDKNQVVELSKGVKPKYLTEKGKTSETDPHAWLDLHNGIIYTENVRDALV
KADPDNADFYKENAKKYIDKLATLDKEAKQKFADLPENQKTLVTSEGAFKYFAARYGLKAAYIWEINTESQGTPDQMKQI
VGIVEKEKVPNLFVETSVDPRSMESVSKETGVPIFAKIFTDSTAKKGEVGDTYLEMMRYNLDKIHDGLAK
>Q8ECH7 2.7.7.108~~~mntA~~~Protein adenylyltransferase MntA~~~COG1669
MQQLNENKIIKLLRDNIPKLQLIYLFGSYSQGTQHRNSDIDIAVLAADTLDNIARWELAQKLASALDSDVDLVDLRSAST
VLCQQVVTQGKQLWGTQQDDELFAVKTISMYQHLQAERQAIIDDVMANTAAKAHRGESL
>O34338 ~~~mntB~~~Manganese transport system ATP-binding protein MntB~~~COG1121
MFPVELDNVTVAYHKKPVLQDISLQVPEGKLIGIIGPNGAGKSTLIKTILGLVPRASGDISIYGKDYKDQRTRIGYVPQR
GSVDWDFPTSPLDVVLMGRYGRIGLLKRPKKADVEMAKAALTKVGMHDYAKRQISQLSGGQQQRVFLARALCQNADIYFM
DEPFAGVDAATERAIMTLLAELKEKGKTVLVVHHDLQTAEDYFDWILLLHLRKIAFGPTENVFTIENLQKTYGGRLTFLK
DKVLAEGHKE
>Q55282 ~~~mntB~~~Manganese transport system membrane protein MntB~~~COG1108
MNQLVVAFPFWHWLVEPLQYEFLIRAIWVSAFVGLVCAVLSCYITLKGWSLMGDAISHAVVPGVVLAYALNIPFAIGAFT
FGFGATVAIGYVKSKTRLKEDAVIGIVFTGFFALGLVLVTKIPSNVDLFHILFGNVLGISQQDIIQTLIAGSITLIVILL
RRKDLLLFCFDPNHAKAIGLRTQVMYYTLLSVLALTIVAALQTAGIILVISMLVTPGSIGYLLSDRFDHMLWYSVVSSVL
SCVLGTYLSYHFDVSTGGMIVVILTTLFVIAMIGAPKYGILAQEWRKRSGPNPEDDENQTVVVDQV
>O35024 ~~~mntC~~~Manganese transport system membrane protein MntC~~~COG1108
MFESLWLQLQHPNTQWVLAGTLLLGTASGVLGSFVLLRKQSLIGDAMAHSALPGVCLAFLFTGQKSLPFFLLGAALAGLL
GTFCIQLIPRLSKTKEDSAIGIVLSVFFGVGIILLTYIQQQGAGSQSGLDSFLFGQAASLVRQDIILIAGISAVLLLLCI
VFFKEFTLITFDLAFAKGLGIPVRFLNGLLACLIVCAVVIGLQTVGVILMAAMLITPAITARYWTERLTGMIIIAGITGG
VSGVAGTLLSTTMKGMATGPLMILSATLLFLFSMICAPKRGLAAKAIRLMRLRRRTSREQVLLAIYEQYEKNNLCVTVES
VRKKRRLSPSLCLKALNDLEQERCIERIENGIWQITSKGIEKGYHTALKQRMYEVYLMHEMELANIESDQDYFDPDRLPR
ETRERLYSLLKLYGRMPERRKASHDAEKGQIANEF
>O34500 ~~~mntD~~~Manganese transport system membrane protein MntD~~~COG1108
MSFEAWIIATGVLVGVSCGLIGTFLVLRSMAMLADAISHTVLLGIVGAFLVTGSLDGIYMFIGAAATGLLTAFLVQLLHS
KGVQSDAAIGVVFTSLFAIGVILLSVYGANVHLDIEHSLMGEIAFVPWNTVTVFGVDIGPKAFWMLASVLVLNVVLISVC
YKEFKIASFDPQMALALGIPVLLIHYVQMGMLSLTTVASFDSVGAVLVVAMLIVPPAAAHLLTDRLLYMLILSALIGGLS
AVMGYFFATWLNVSISGAMAAMTGVCYASAFLFSPANGVITKKLRTLNMQKERAG
>P96593 ~~~mntH~~~Divalent metal cation transporter MntH~~~COG1914
MMNKDITAQSPRSKAVQDALDGKIRGFRGLLPFLGPAFIAAIAYIDPGNFATNISAGSKYGYMLLWVILFSNIMALLIQS
LSAKLGIATGKNLPEVAREEFPKPVSIGLWIQGELVIIATDLAEFIGAALGLYLLFGIPMLEASIIAAIGSFAILELQRR
GYRSLEAGIAGMLFVVVIAFALQTFFAKPDAVSVMKGLFVPAFHGTDSVLLAAGILGATVMPHAIYLHSALTQRRVVGKT
DAERKKIFRFEFIDILIAMLIAGAINASMLIVAAALFFKNGLFVEDLDVAFQQFGHLVSPMSAALFGIGLLVAGLSSSSV
GTLSGDVIMQGFINYRIPLYVRRFITIIPPILIIASGVNPTTALVLSQVVLSFGIAFALIPLIMFTSNKRIMGSLINAKW
ITVVSWLIAVLIVALNVFLIVDTFR
>Q9RTP8 ~~~mntH~~~Divalent metal cation transporter MntH~~~COG1914
MDSRSPSLPDDRPDPPEQHLDARAGATLRGTAGPRGVRRILPFLGPAVIASIAYMDPGNFATNIEGGARYGYSLLWVILA
ANLMAMVIQNLSANLGIASGRNLPELIRERWPRPLVWFYWIQAELVAMATDLAEFLGAALAIQLLTGLPMFWGAVVTGVV
TFWLLNLQKRGTRPLELAVGAFVLMIGVAYLVQVVLARPDLAAVGAGFVPRLQGPGSAYLAVGIIGATVMPHVIYLHSAL
TQGRIQTDTTEEKRRLVRLNRVDVIAAMGLAGLINMSMLAVAAATFHGKNVENAGDLTTAYQTLTPLLGPAASVLFAVAL
LASGLSSSAVGTMAGDVIMQGFMGFHIPLWLRRLITMLPAFIVILLGMDPSSVLILSQVILCFGVPFALVPLLLFTARRD
VMGALVTRRSFTVIGWVIAVIIIALNGYLLWELLGG
>P0A769 ~~~mntH~~~Divalent metal cation transporter MntH~~~COG1914
MTNYRVESSSGRAARKMRLALMGPAFIAAIGYIDPGNFATNIQAGASFGYQLLWVVVWANLMAMLIQILSAKLGIATGKN
LAEQIRDHYPRPVVWFYWVQAEIIAMATDLAEFIGAAIGFKLILGVSLLQGAVLTGIATFLILMLQRRGQKPLEKVIGGL
LLFVAAAYIVELIFSQPNLAQLGKGMVIPSLPTSEAVFLAAGVLGATIMPHVIYLHSSLTQHLHGGSRQQRYSATKWDVA
IAMTIAGFVNLAMMATAAAAFHFSGHTGVADLDEAYLTLQPLLSHAAATVFGLSLVAAGLSSTVVGTLAGQVVMQGFIRF
HIPLWVRRTVTMLPSFIVILMGLDPTRILVMSQVLLSFGIALALVPLLIFTSDSKLMGDLVNSKRVKQTGWVIVVLVVAL
NIWLLVGTALGL
>Q93V04 ~~~mntH~~~Divalent metal cation transporter MntH~~~
MKEGIDMRESVAEEPKHKFFQSTEGENKSLDEVNGSVKVPKNAGFWRTLFAYTGPGILIAVGYMDPGNWITSIAGGAQFK
YSLLSVILISSLIAMLLQSMAARLGIVTGKDLAQLTRERTSKTMGIILWLITESAIMATDVAEIIGSGIAIKLLFNIPLV
VGILITTADVLILLLLMKLGFRKIEAIVATLVAVILLVFTYEVFLAGPQLDQMFAGYMPTKDIVTNKSMLYLALGIVGAT
VMPHDLYLGSSISQTRAVDRHDRQDVAKAIKFTTIDSNLQLTIAFIVNSLLLILGAALFFGTNSTVGRFVDLFNSLNNSH
IVGAIASPMLSMLFAVALLSSGQSSTITGTLSGQIIMEGFIHLRMPLWAQRLLTRLLSVTPVLIFAIYYHGNEAKIENLL
TLSQVFLSVALPFAIVPLVKFTSSKELMGEFVNKAWVKYSAWVATVVLVSLNIYLILQTVGVIG
>P9WIZ5 ~~~mntH~~~Divalent metal cation transporter MntH~~~COG1914
MAGEFRLLSHLCSRGSKVGELAQDTRTSLKTSWYLLGPAFVAAIAYVDPGNVAANVSSGAQFGYLLLWVIVAANVMAALV
QYLSAKLGLVTGRSLPEAIGKRMGRPARLAYWAQAEIVAMATDVAEVIGGAIALRIMFNLPLPIGGIITGVVSLLLLTIQ
DRRGQRLFERVITALLLVIAIGFTASFFVVTPPPNAVLGGLAPRFQGTESVLLAAAIMGATVMPHAVYLHSGLARDRHGH
PDPGPQRRRLLRVTRWDVGLAMLIAGGVNAAMLLVAALNMRGRGDTASIEGAYHAVHDTLGATIAVLFAVGLLASGLASS
SVGAYAGAMIMQGLLHWSVPMLVRRLITLGPALAILTLGFDPTRTLVLSQVVLSFGIPFAVLPLVKLTGSPAVMGGDTNH
RATTWVGWVVAVMVSLLNVMLIYLTVTG
>Q9RPF4 ~~~mntH~~~Divalent metal cation transporter MntH~~~
MTDNRVENSSGRAARKLRLALMGPAFIAAIGYIDPGNFATNIQAGASFGYQLLWVVVWANLMAMLIQILSAKLGIATGKN
LAEQIRDHYPRPVVWFYWVQAEIIAMATDLAEFIGAAIGFKLILGVSLLQGAVLTGIATFLILMLQRRGQKPLEKVIGGL
LLFVAAAYIVELFFSQPDMAQLGKGMVIPALPNPEAVFLAAGVLGATIMPHVIYLHSSLTQHLHGGTRQQRYSATKWDVA
IAMTIAGFVNLAMMATAAAAFHFSGHTGIADLDQAYLTLEPLLSHAAATVFGLSLVAAGLSSTVVGTLAGQVVMQGFVRF
HIPLWVRRTITMLPSFIVILMGLDPTRILVMSQVLLSFGIALALVPLLIFTSNATLMGELVNTRRVKQVGWIIVVLVVAL
NIWLLVGTVMGLS
>Q99UZ7 ~~~mntH~~~Divalent metal cation transporter MntH~~~
MNNKRHSTNEQLSLDEINNTIKFDHRSSNKQKFLSFLGPGLLVAVGYMDPGNWITSMQGGAQYGYTLLFVILISSLSAML
LQSMTVRLGIATGMDLAQMTRHYLSRPIAIIFWIIAELAIIATDIAEVIGSAIALNLLFNIPLIVGALITVLDVFLLLFI
MKYGFRKIEAIVGTLIFTVLFIFIFEVYISSPQLNAVLNGFIPHSEIITNNGILYIALGIIGATIMPHNLYLHSSIVQSR
TYSRHNNEEKAQAIKFATIDSNIQLSIAFVVNCLLLVLGASLFFNSNADDLGGFYDLYHALKTEPVLGATMGAIMSTLFA
VALLASGQNSTITGTLAGQIVMEGFLRLHIPNWLRRLITRSLAVIPVIVCLIIFKGNAAKIEQLLVFSQVFLSIALPFCL
IPLQLATSNKDLMGPFYNKTWVNIISWTLIIILSILNVYLIVQTFQELQS
>P76264 ~~~mntP~~~Probable manganese efflux pump MntP~~~COG1971
MNITATVLLAFGMSMDAFAASIGKGATLHKPKFSEALRTGLIFGAVETLTPLIGWGMGMLASRFVLEWNHWIAFVLLIFL
GGRMIIEGFRGADDEDEEPRRRHGFWLLVTTAIATSLDAMAVGVGLAFLQVNIIATALAIGCATLIMSTLGMMVGRFIGS
IIGKKAEILGGLVLIGIGVQILWTHFHG
>P54512 ~~~mntR~~~HTH-type transcriptional regulator MntR~~~COG1321
MTTPSMEDYIEQIYMLIEEKGYARVSDIAEALAVHPSSVTKMVQKLDKDEYLIYEKYRGLVLTSKGKKIGKRLVYRHELL
EQFLRIIGVDEEKIYNDVEGIEHHLSWNSIDRIGDLVQYFEEDDARKKDLKSIQKKTEHHNQ
>P0A9F1 ~~~mntR~~~Transcriptional regulator MntR~~~COG1321
MSRRAGTPTAKKVTQLVNVEEHVEGFRQVREAHRRELIDDYVELISDLIREVGEARQVDMAARLGVSQPTVAKMLKRLAT
MGLIEMIPWRGVFLTAEGEKLAQESRERHQIVENFLLVLGVSPEIARRDAEGMEHHVSEETLDAFRLFTQKHGAK
>Q9K943 ~~~mntR~~~HTH-type transcriptional regulator MntR~~~COG1321
MPTPSMEDYLERIYLLIEEKGYARVSDIAEALEVHPSSVTKMVQKLDKSDYLVYERYRGLILTAKGKKIGKRLVYRHDLL
EDFLKMIGVDSDHIYEDVEGIEHHLSWDAIDRIGDLVQYFQEDPSRLNDLREVQKKNEE
>P0DKB3 ~~~mntS~~~Small protein MntS~~~
MNEFKRCMRVFSHSPFKVRLMLLSMLCDMVNNKPQQDKPSDK
>P9WJS3 4.1.99.22~~~moaA1~~~GTP 3',8-cyclase 1~~~COG2896
MSTPTLPDMVAPSPRVRVKDRCRRMMGDLRLSVIDQCNLRCRYCMPEEHYTWLPRQDLLSVKEISAIVDVFLSVGVSKVR
ITGGEPLIRPDLPEIVRTLSAKVGEDSGLRDLAITTNGVLLADRVDGLKAAGMKRITVSLDTLQPERFKAISQRNSHDKV
IAGIKAVAAAGFTDTKIDTTVMRGANHDELADLIEFARTVNAEVRFIEYMDVGGATHWAWEKVFTKANMLESLEKRYGRI
EPLPKHDTAPANRYALPDGTTFGIIASTTEPFCATCDRSRLTADGLWLHCLYAISGINLREPLRAGATHDDLVETVTTGW
RRRTDRGAEQRLAQRERGVFLPLSTLKADPHLEMHTRGG
>P9WJS1 4.1.99.22~~~moaA2~~~GTP 3',8-cyclase 2~~~COG2896
MTLTALGMPALRSRTNGIADPRVVPTTGPLVDTFGRVANDLRVSLTDRCNLRCSYCMPERGLRWLPGEQLLRPDELARLI
HIAVTRLGVTSVRFTGGEPLLAHHLDEVVAATARLRPRPEISLTTNGVGLARRAGALAEAGLDRVNVSLDSIDRAHFAAI
TRRDRLAHVLAGLAAAKAAGLTPVKVNAVLDPTTGREDVVDLLRFCLERGYQLRVIEQMPLDAGHSWRRNIALSADDVLA
ALRPHFRLRPDPAPRGSAPAELWLVDAGPNTPRGRFGVIASVSHAFCSTCDRTRLTADGQIRSCLFSTEETDLRRLLRGG
ADDDAIEAAWRAAMWSKPAGHGINAPDFIQPDRPMSAIGG
>P30745 4.1.99.22~~~moaA~~~GTP 3',8-cyclase~~~COG2896
MASQLTDAFARKFYYLRLSITDVCNFRCTYCLPDGYKPSGVTNKGFLTVDEIRRVTRAFARLGTEKVRLTGGEPSLRRDF
TDIIAAVRENDAIRQIAVTTNGYRLERDVASWRDAGLTGINVSVDSLDARQFHAITGQDKFNQVMAGIDAAFEAGFEKVK
VNTVLMRDVNHHQLDTFLNWIQHRPIQLRFIELMETGEGSELFRKHHISGQVLRDELLRRGWIHQLRQRSDGPAQVFCHP
DYAGEIGLIMPYEKDFCATCNRLRVSSIGKLHLCLFGEGGVNLRDLLEDDTQQQALEARISAALREKKQTHFLHQNNTGI
TQNLSYIGG
>Q44118 4.1.99.22~~~moaA~~~GTP 3',8-cyclase~~~
MPAARPAGAGVGLVDRYGRRATDMRLSLTDKCNLRCTYCMPAEGLEWLSKQAVMSASEIVRIVGIGVGRLGVRELRLTGG
EPLVRHDLVDIIAELRRNHPELPISMTTNGVGLAKKVAPLKAAGLTRINVSLDSLHEETFTKLTRRPFLDQVLAGVDAAW
AAGLGPVKLNAVLMRGINDAEAPSLLAWAVERGYELRFIEQMPLDADHGWTRRNMITAAEIRDLLSTDFVLTPDPRARDG
APAERFEVRRRVAGSGAGLGPVLGTVGIIASVTEPFCSDCRRTRITAEGRIMSCLFSREEFDLLVLLRSGASDDDLARRW
QDAMWLKPKAHGMDHVGLDAPDFVQPDRSMSAIGG
>P69848 4.1.99.22~~~moaA~~~GTP 3',8-cyclase~~~COG2896
MVEQIKDKLGRPIRDLRLSVTDRCNFRCDYCMPKEVFGDDFVFLPKNELLTFDEMARIAKVYAELGVKKIRITGGEPLMR
RDLDVLIAKLNQIDGIEDIGLTTNGLLLKKHGQKLYDAGLRRINVSLDAIDDTLFQSINNRNIKATTILEQIDYATSIGL
NVKVNVVIQKGINDDQIIPMLEYFKDKHIEIRFIEFMDVGNDNGWDFSKVVTKDEMLTMIEQHFEIDPVEPKYFGEVAKY
YRHKDNGVQFGLITSVSQSFCSTCTRARLSSDGKFYGCLFATVDGFNVKAFIRSGVTDEELKEQFKALWQIRDDRYSDER
TAQTVANRQRKKINMNYIGG
>Q816R0 ~~~moaB~~~Molybdenum cofactor biosynthesis protein B~~~
MSVTEHKKQAPKEVRCKIVTISDTRTEETDKSGQLLHELLKEAGHKVTSYEIVKDDKESIQQAVLAGYHKEDVDVVLTNG
GTGITKRDVTIEAVSALLDKEIVGFGELFRMISYLEDIGSSAMLSRAIGGTIGRKVVFSMPGSSGAVRLAMNKLILPELG
HITFELHRQ
>P0AEZ9 ~~~moaB~~~Molybdenum cofactor biosynthesis protein B~~~COG0521
MSQVSTEFIPTRIAILTVSNRRGEEDDTSGHYLRDSAQEAGHHVVDKAIVKENRYAIRAQVSAWIASDDVQVVLITGGTG
LTEGDQAPEALLPLFDREVEGFGEVFRMLSFEEIGTSTLQSRAVAGVANKTLIFAMPGSTKACRTAWENIIAPQLDARTR
PCNFHPHLKK
>P99137 ~~~moaB~~~Molybdenum cofactor biosynthesis protein B~~~
MGEHQNVKLNRTVKAAVLTVSDTRDFVTDKGGQCVRQLLQADDVEVSDAHYTIVKDEKVAITTQVKKWLEEDIDVIITTG
GTGIAQRDVTIEAVKPLLTKEIEGFGELFRYLSYVEDVGTRALLSRAVAGTVNNKLIFSIPGSTGAVKLALEKLIKPELN
HLIHELTK
>P9WJR9 4.6.1.17~~~moaC1~~~Cyclic pyranopterin monophosphate synthase 1~~~COG0315
MIDHALALTHIDERGAARMVDVSEKPVTLRVAKASGLVIMKPSTLRMISDGAAAKGDVMAAARIAGIAAAKRTGDLIPLC
HPLGLDAVSVTITPCEPDRVKILATTTTLGRTGVEMEALTAVSVAALTIYDMCKAVDRAMEISQIVLQEKSGGRSGVYRR
SASDLACQSR
>P9WJR7 4.6.1.17~~~moaC2~~~Cyclic pyranopterin monophosphate synthase 2~~~COG0315
MARASGASDYRSGELSHQDERGAAHMVDITEKATTKRTAVAAGILRTSAQVVALISTGGLPKGDALATARVAGIMAAKRT
SDLIPLCHQLALTGVDVDFTVGQLDIEITATVRSTDRTGVEMEALTAVSVAALTLYDMIKAVDPGALIDDIRVLHKEGGR
RGTWTRR
>P0A738 4.6.1.17~~~moaC~~~Cyclic pyranopterin monophosphate synthase~~~COG0315
MSQLTHINAAGEAHMVDVSAKAETVREARAEAFVTMRSETLAMIIDGRHHKGDVFATARIAGIQAAKRTWDLIPLCHPLM
LSKVEVNLQAEPEHNRVRIETLCRLTGKTGVEMEALTAASVAALTIYDMCKAVQKDMVIGPVRLLAKSGGKSGDFKVEAD
D
>Q5L3F4 4.6.1.17~~~moaC~~~Cyclic pyranopterin monophosphate synthase~~~COG0315
MSSFTHFNEQGRAKMVDITHKEDTVRVAVAQTSVTVSREIYEKMTSNAIEKGDVLAVAQVAGVMAAKKTADLIPMCHPLM
LKGVDIAFAWENDGEAHKLVITATVKTKGSTGVEMEALTAASVCALTVYDMCKALDKGMVIGPTYLVEKTGGKSGHYRRK
TD
>Q2FVX9 4.6.1.17~~~moaC~~~Cyclic pyranopterin monophosphate synthase~~~COG0315
MTEFTHINQQGHAKMVDVSDKQITKRTAVAHSSITVNETIFKQISNNTNTKGNVLNTAQIAGIMAAKNTSTLIPMCHPLP
LTGIDVHFSWDETNAPLYTLNIQTTVSTTGKTGVEMEALTAASATALTIYDMTKAVDKGMIIGETYLESKSGGKSGDFQR
QSNQ
>P30748 ~~~moaD~~~Molybdopterin synthase sulfur carrier subunit~~~COG1977
MIKVLFFAQVRELVGTDATEVAADFPTVEALRQHMAAQSDRWALALEDGKLLAAVNQTLVSFDHPLTDGDEVAFFPPVTG
G
>Q7A441 ~~~moaD~~~Molybdopterin synthase sulfur carrier subunit~~~
MKVLYFAEIKDILQKAQEDIVLEQALTVQQFEDLLFERYPQINNKKFQVAVNEEFVQKSDFIQPNDTVALIPPVSGG
>P9WJR3 2.8.1.12~~~moaE1~~~Molybdopterin synthase catalytic subunit 1~~~COG0314
MANVVAEGAYPYCRLTDQPLSVDEVLAAVSGPEQGGIVIFVGNVRDHNAGHDVTRLFYEAYPPMVIRTLMSIIGRCEDKA
EGVRVAVAHRTGELQIGDAAVVIGASAPHRAEAFDAARMCIELLKQEVPIWKKEFSSTGAEWVGDRP
>P9WJR1 2.8.1.12~~~moaE2~~~Molybdopterin synthase catalytic subunit 2~~~COG0314
MTQVLRAALTDQPIFLAEHEELVSHRSAGAIVGFVGMIRDRDGGRGVLRLEYSAHPSAAQVLADLVAEVAEESSGVRAVA
ASHRIGVLQVGEAALVAAVAADHRRAAFGTCAHLVETIKARLPVWKHQFFEDGTDEWVGSV
>O67928 2.8.1.12~~~moaE~~~Molybdopterin synthase catalytic subunit~~~COG0314
MEVGMIPRVYLGHEWFGAERILSEYQVPEDCGAQVLFLGIPRNAPEDGGNIEALEYEAYPEMAIKEMEKIRQETIEKFGV
KEVFIHHRLGLVKIGEPSFLVLAVGGHREETFKACRYAVDETKKRVPIWKKEIFKEGKGEWVLGEKKNASGQTK
>P30749 2.8.1.12~~~moaE~~~Molybdopterin synthase catalytic subunit~~~COG0314
MAETKIVVGPQPFSVGEEYPWLAERDEDGAVVTFTGKVRNHNLGDSVNALTLEHYPGMTEKALAEIVDEARNRWPLGRVT
VIHRIGELWPGDEIVFVGVTSAHRSSAFEAGQFIMDYLKTRAPFWKREATPEGDRWVEARESDQQAAKRW
>P56422 2.8.1.12~~~moaE~~~Molybdopterin synthase catalytic subunit~~~COG0314
MLKIIQGALDTRELLKAYQEEACAKNFGAFCVFVGIVRKEDNIQGLSFDIYEALLKTWFEKWHHKAKDLGVVLKMAHSLG
DVLIGQSSFLCVSMGKNRKNALELYENFIEDFKHNAPIWKYDLIHNKRIYAKERSHPLKGSGLLA
>P65401 2.8.1.12~~~moaE~~~Molybdopterin synthase catalytic subunit~~~
MKQFEIVIEPIQTEQYREFTINEYQGAVVVFTGHVREWTKGVKTEYLEYEAYIPMAEKKLAQIGDEINEKWPGTITSIVH
RIGPLQISDIAVLIAVSSPHRKDAYRANEYAIERIKEIVPIWKKEIWEDGSKWQGHQKGNYEEAKREE
>P54796 ~~~moaF~~~Protein MoaF~~~
MTSEAVFIQVGALADGFAPHGNLLATASLPAGENFTFYVAGSEPQQLVIEDEQTLSWNGKRAPWRATALRPDILFIDFLD
PERDNASISAVCNLTQRNATLVYGQLPDEAPRAGRLQPGRTRVALTAVEVRFVFARLDAQPGPLPGFTDALIGMRNQYTY
SPTERYEHIYLNDNFYAWQCLDGVEKGLADVDRCHYVQVAEDLYLFVWREKIIPTLGVILIDLQQMRTDGKIMGYQGSDF
GALSNFPVGATAKILNVTRHQE
>A1KNB8 ~~~moaR1~~~Transcriptional regulatory protein MoaR1~~~
MGQRPFSPNHRSGVLNATTAGAVQFNVLGPLELNLRGTKLPLGTPKQRAVLAMLLLSRNQVVAADALVQAIWEKSPPARA
RRTVHTYICNLRRTLSDAGVDSRNILVSEPPGYRLLIGDRQQCDLDRFVAAKESGLRASAKGYFSEAIRYLDSALQNWRG
PVLGDLRSFMFVQMFSRALTGDELLVHTKLAEAAIACGRADVVIPKLERLVAMHPYRESLWKQLMLGYYVNEYQSAAIDA
YHRLKSTLAEELGVEPAPTIRALYHKILRQLPMDDLVGRVTRGRVDLRGGNGAKVEELTESDKDLLPIGLA
>O05797 ~~~moaR1~~~Transcriptional regulatory protein MoaR1~~~COG3629
MGQRPFSPNHRSGVLNATTAGAVQFNVLGPLELNLRGTKLPLGTPKQRAVLAMLLLSRNQVVAADALVQAIWEKSPPARA
RRTVHTYICNLRRTLSDAGVDSRNILVSEPPGYRLLIGDRQQCDLDRFVAAKESGLRASAKGYFSEAIRYLDSALQNWRG
PVLGDLRSFMFVQMFSRALTEDELLVHTKLAEAAIACGRADVVIPKLERLVAMHPYRESLWKQLMLGYYVNEYQSAAIDA
YHRLKSTLAEELGVEPAPTIRALYHKILRQLPMDDLVGRVTRGRVDLRGGNGAKVEELTESDKDLLPIGLA
>P07112 ~~~mobA~~~Mobilization protein A~~~
MAIYHLTAKTGSRSGGQSARAKADYIQREGKYARDMDEVLHAESGHMPEFVERPADYWDAADLYERANGRLFKEVEFALP
VELTLDQQKALASEFAQHLTGAERLPYTLAIHAGGGENPHCHLMISERINDGIERPAAQWFKRYNGKTPEKGGAQKTEAL
KPKAWLEQTREAWADHANRALERAGHDARIDHRTLEAQGIERLPGVHLGPNVVEMEGRGIRTDRADVALNIDTANAQIID
LQEYREAIDHERNRQSEEIQRHQRVSGADRTAGPEHGDTGRRSPAGHEPDPAGQRGAGGGVAESPAPDRGGMGGAGQRVA
GGSRRGEQRRAERPERVAGVALEAMANRDAGFHDAYGGAADRIVALARPDATDNRGRLDLAALGGPMKNDRTLQAIGRQL
KAMGCERFDIGVRDATTGQMMNREWSAAEVLQNTPWLKRMNAQGNDVYIRPAEQERHGLVLVDDLSEFDLDDMKAEGREP
ALVVETSPKNYQAWVKVADAAGGELRGQIARTLASEYDADPASADSRHYGRLAGFTNRKDKHTTRAGYQPWVLLRESKGK
TATAGPALVQQAGQQIEQAQRQQEKARRLASLELPERQLSRHRRTALDEYRSEMAGLVKRFGDDLSKCDFIAAQKLASRG
RSAEEIGKAMAEASPALAERKPGHEADYIERTVSKVMGLPSVQLARAELARAPAPRQRGMDRGGPDFSM
>O67413 2.7.7.77~~~mobA~~~Probable molybdenum cofactor guanylyltransferase~~~COG0746
MRTFTWRKGSLSKVNTCYVLAGGKSKRFGEDKLLYEIKGKKVIERVYETAKSVFKEVYIVAKDREKFSFLNAPVVLDEFE
ESASIIGLYTALKHAKEENVFVLSGDLPLMKKETVLYVLENFKEPVSVAKTEKLHTLVGVYSKKLLEKIEERIKKGDYRI
WALLKDVGYNEVEIPEELRYTLLNMNTKEDLKRILAIENHY
>Q6SSJ6 1.14.13.23~~~mobA~~~3-hydroxybenzoate 4-monooxygenase~~~
MQFHLNGFRPGNPLIAPASPLAPAHTEAVPSQVDVLIVGCGPAGLTLAAQLAAFPDIRTCIVEQKEGPMELGQADGIACR
TMEMFEAFEFADSILKEACWINDVTFWKPDPAQPGRIARHGRVQDTEDGLSEFPHVILNQARVHDHYLERMRNSPSRLEP
HYARRVLDVKIDHGAADYPVTVTLERCDAAHAGQIETVQARYVVGCDGARSNVRRAIGRQLVGDSANQAWGVMDVLAVTD
FPDVRYKVAIQSEQGNVLIIPREGGHLVRFYVEMDKLDADERVASRNITVEQLIATAQRVLHPYKLDVKNVPWWSVYEIG
QRICAKYDDVADAVATPDSPLPRVFIAGDACHTHSPKAGQGMNFSMQDSFNLGWKLAAVLRKQCAPELLHTYSSERQVVA
QQLIDFDREWAKMFSDPAKEGGQGGVDPKEFQKYFEQHGRFTAGVGTHYAPSLLTGQASHQALASGFTVGMRFHSAPVVR
VSDAKPLQLGHCGKADGRWRLYAFAGQNDLAQPESGLLALCRFLESDAASPLRRFTPSGQDIDSIFDLRAIFPQAYTEVA
LETLPALLLPPKGQLGMIDYEKVFSPDLKNAGQDIFELRGIDRQQGALVVVRPDQYVAQVLPLGDHAALSAYFESFMRA
>P32173 2.7.7.77~~~mobA~~~Molybdenum cofactor guanylyltransferase~~~COG0746
MNLMTTITGVVLAGGKARRMGGVDKGLLELNGKPLWQHVADALMTQLSHVVVNANRHQEIYQASGLKVIEDSLADYPGPL
AGMLSVMQQEAGEWFLFCPCDTPYIPPDLAARLNHQRKDAPVVWVHDGERDHPTIALVNRAIEPLLLEYLQAGERRVMVF
MRLAGGHAVDFSDHKDAFVNVNTPEELARWQEKR
>P9WJQ9 2.7.7.77~~~mobA~~~Probable molybdenum cofactor guanylyltransferase~~~COG0746
MAELAPDTVPLAGVVLAGGESRRMGRDKATLPLPGGTTTLVEHMVGILGQRCAPVFVMAAPGQPLPTLPVPVLRDELPGL
GPLPATGRGLRAAAEAGVRLAFVCAVDMPYLTVELIEDLARRAVQTDAEVVLPWDGRNHYLAAVYRTDLADRVDTLVGAG
ERKMSALVDASDALRIVMADSRPLTNVNSAAGLHAPMQPGR
>P65405 2.7.7.77~~~mobA~~~Probable molybdenum cofactor guanylyltransferase~~~
MKAIILAGGHSVRFGKPKAFAEVNGETFYSRVIKTLESTNMFNEIIISTNAQLATQFKYPNVVIDDENHNDKGPLAGIYT
IMKQHPEEELFFVVSVDTPMITGKAVSTLYQFLVSHLIENHLDVAAFKEDGRFIPTIAFYSPNALGAITKALHSDNYSFK
NIYHELSTDYLDVRDVDAPSYWYKNINYQHDLDALIQKL
>P32125 ~~~mobB~~~Molybdopterin-guanine dinucleotide biosynthesis adapter protein~~~COG1763
MAGKTMIPLLAFAAWSGTGKTTLLKKLIPALCARGIRPGLIKHTHHDMDVDKPGKDSYELRKAGAAQTIVASQQRWALMT
ETPDEEELDLQFLASRMDTSKLDLILVEGFKHEEIAKIVLFRDGAGHRPEELVIDRHVIAVASDVPLNLDVALLDINDVE
GLADFVVEWMQKQNG
>P44902 ~~~mobB~~~Molybdopterin-guanine dinucleotide biosynthesis adapter protein~~~COG1763
MIFKVIFMNNQIPLLGITGYSGSGKTTLLEKLIPELIARHIRVSVIKHSHHNMQVDKEGKDSWRMKEAGSSQVILANDER
WAIMTETPKPVSLDYLAQQFDRTLTDLVLVEGFKQEPIPKILLHRQEMTKPLPEIDEYVLAVATNYPLEIDRTLLDINRI
PQIADFIENWLHHFHGAR
>Q46810 2.7.7.76~~~mocA~~~Molybdenum cofactor cytidylyltransferase~~~COG2068
MSAIDCIITAAGLSSRMGQWKMMLPWEQGTILDTSIKNALQFCSRIILVTGYRGNELHERYANQSNITIIHNPDYAQGLL
TSVKAAVPAVQTEHCFLTHGDMPTLTIDIFRKIWSLRNDGAILPLHNGIPGHPILVSKPCLMQAIQRPNVTNMRQALLMG
DHYSVEIENAEIILDIDTPDDFITAKERYTEI
>P37329 ~~~modA~~~Molybdate-binding protein ModA~~~COG0725
MARKWLNLFAGAALSFAVAGNALADEGKITVFAAASLTNAMQDIATQFKKEKGVDVVSSFASSSTLARQIEAGAPADLFI
SADQKWMDYAVDKKAIDTATRQTLLGNSLVVVAPKASVQKDFTIDSKTNWTSLLNGGRLAVGDPEHVPAGIYAKEALQKL
GAWDTLSPKLAPAEDVRGALALVERNEAPLGIVYGSDAVASKGVKVVATFPEDSHKKVEYPVAVVEGHNNATVKAFYDYL
KGPQAAEIFKRYGFTIK
>P45323 ~~~modA~~~Molybdate-binding protein ModA~~~COG0725
MKKLTKISTALLIAGLGFSFAASAKVTVFAAASMTDALQQVAKDYAKQNPKNEVVFSFASSSTLAKQVEEGAPADIFVSA
SNKWMKYLSEKDLTVKETEKVLVGNDLVLIAPAKSAVNSVDIAKGEWINALKDSYLSVGDPAHVPAGQYAEEALTKLNLW
DKVKDRLARGKDVRGALALVERAEAPYGIVYSTDAKVSQQVKTVAVFPADSHKPVVYPVSIVKGHDNADSRDFLKYLESD
AAKKVLVGYGFSAK
>P9WGU3 ~~~modA~~~Molybdate-binding protein ModA~~~COG0725
MRWIGLSTGLVSAMLVAGLVACGSNSPASSPAGPTQGARSIVVFAAASLQSAFTQIGEQFKAGNPGVNVNFAFAGSSELA
TQLTQGATADVFASADTAQMDSVAKAGLLAGHPTNFATNTMVIVAAAGNPKKIRSFADLTRPGLNVVVCQPSVPCGSATR
RIEDATGIHLNPVSEELSVTDVLNKVITGQADAGLVYVSDALSVATKVTCVRFPEAAGVVNVYAIAVLKRTSQPALARQF
VAMVTAAAGRRILDQSGFAKP
>Q9I2N2 ~~~modA~~~Tungstate/molybdate/chromate-binding protein ModA~~~
MTTRLPQLLLALLASAVSLAASADEVQVAVAANFTAPIQAIAKEFEKDTGHRLVAAYGATGQFYTQIKNGAPFQVFLSAD
DSTPAKLEQEGEVVPGSRFTYAIGTLALWSPKAGYVDAEGEVLKSGSFRHLSIANPKTAPYGLAATQAMDKLGLAATLGP
KLVEGQNISQAYQFVSSGNAELGFVALSQIYKDGKVATGSAWIVPTELHDPIRQDAVILNKGKDNAAAKALVDYLKGAKA
AALIKSYGYEL
>Q8PHA1 ~~~modA~~~Molybdate-binding protein ModA~~~COG0725
MRMIGFWQRALCVLMLTLPVLASAQTAPVTVFAAASLKESMDEAATAYEKATGTPVRVSYAASSALARQIEQGAPADVFF
SADLEWMDYLQQHGLVLPAQRHNLLGNTLVLVAPASSKLRVDPRAPGAIAKALGENGRLAVGQTASVPAGKYAAAALRKL
GQWDSVSNRLAESESVRAALMLVSRGEAPLGIVYGSDARADAKVRVVATFPDDSHDAIVYPVAALKNSNNPATAAFVSWL
GSKPAKAIFARRGFSLKD
>P0AF01 ~~~modB~~~Molybdenum transport system permease protein ModB~~~COG4149
MILTDPEWQAVLLSLKVSSLAVLFSLPFGIFFAWLLVRCTFPGKALLDSVLHLPLVLPPVVVGYLLLVSMGRRGFIGERL
YDWFGITFAFSWRGAVLAAAVMSFPLMVRAIRLALEGVDVKLEQAARTLGAGRWRVFFTITLPLTLPGIIVGTVLAFARS
LGEFGATITFVSNIPGETRTIPSAMYTLIQTPGGESGAARLCIISIALAMISLLISEWLARISRERAGR
>P9WQL3 7.3.2.5~~~modC~~~Molybdenum import ATP-binding protein ModC~~~COG1118
MSKLQLRAVVADRRLDVEFSVSAGEVLAVLGPNGAGKSTALHVIAGLLRPDAGLVRLGDRVLTDTEAGVNVATHDRRVGL
LLQDPLLFPHLSVAKNVAFGPQCRRGMFGSGRARTRASALRWLREVNAEQFADRKPRQLSGGQAQRVAIARALAAEPDVL
LLDEPLTGLDVAAAAGIRSVLRSVVARSGCAVVLTTHDLLDVFTLADRVLVLESGTIAEIGPVADVLTAPRSRFGARIAG
VNLVNGTIGPDGSLRTQSGAHWYGTPVQDLPTGHEAIAVFPPTAVAVYPEPPHGSPRNIVGLTVAEVDTRGPTVLVRGHD
QPGGAPGLAACITVDAATELRVAPGSRVWFSVKAQEVALHPAPHQHASS
>P0A9G8 ~~~modE~~~DNA-binding transcriptional dual regulator ModE~~~COG2005
MQAEILLTLKLQQKLFADPRRISLLKHIALSGSISQGAKDAGISYKSAWDAINEMNQLSEHILVERATGGKGGGGAVLTR
YGQRLIQLYDLLAQIQQKAFDVLSDDDALPLNSLLAAISRFSLQTSARNQWFGTITARDHDDVQQHVDVLLADGKTRLKV
AITAQSGARLGLDEGKEVLILLKAPWVGITQDEAVAQNADNQLPGIISHIERGAEQCEVLMALPDGQTLCATVPVNEATS
LQQGQNVTAYFNADSVIIATLC
>P31060 ~~~modF~~~ABC transporter ATP-binding protein ModF~~~COG1119
MSSLQILQGTFRLSDTKTLQLPQLTLNAGDSWAFVGSNGSGKSALARALAGELPLLKGERQSQFSHITRLSFEQLQKLVS
DEWQRNNTDMLGPGEDDTGRTTAEIIQDEVKDAPRCMQLAQQFGITALLDRRFKYLSTGETRKTLLCQALMSEPDLLILD
EPFDGLDVASRQQLAERLASLHQSGITLVLVLNRFDEIPEFVQFAGVLADCTLAETGAKEELLQQALVAQLAHSEQLEGV
QLPEPDEPSARHALPANEPRIVLNNGVVSYNDRPILNNLSWQVNPGEHWQIVGPNGAGKSTLLSLVTGDHPQGYSNDLTL
FGRRRGSGETIWDIKKHIGYVSSSLHLDYRVSTTVRNVILSGYFDSIGIYQAVSDRQQKLVQQWLDILGIDKRTADAPFH
SLSWGQQRLALIVRALVKHPTLLILDEPLQGLDPLNRQLIRRFVDVLISEGETQLLFVSHHAEDAPACITHRLEFVPDGG
LYRYVLTKIY
>P9WJQ7 2.10.1.1~~~moeA1~~~Molybdopterin molybdenumtransferase 1~~~COG0303
MRSVEEQQARISAAAVAPRPIRVAIAEAQGLMCAEEVVTERPMPGFDQAAIDGYAVRSVDVAGVGDTGGVQVFADHGDLD
GRDVLTLPVMGTIEAGARTLSRLQPRQAVRVQTGAPLPTLADAVLPLRWTDGGMSRVRVLRGAPSGAYVRRAGDDVQPGD
VAVRAGTIIGAAQVGLLAAVGRERVLVHPRPRLSVMAVGGELVDISRTPGNGQVYDVNSYALAAAGRDACAEVNRVGIVS
NDPTELGEIVEGQLNRAEVVVIAGGVGGAAAEAVRSVLSELGEMEVVRVAMHPGSVQGFGQLGRDGVPTFLLPANPVSAL
VVFEVMVRPLIRLSLGKRHPMRRIVSARTLSPITSVAGRKGYLRGQLMRDQDSGEYLVQALGGAPGASSHLLATLAEANC
LVVVPTGAEQIRTGEIVDVAFLAQHG
>P9WJQ5 2.10.1.1~~~moaE2~~~Molybdopterin molybdenumtransferase 2~~~COG0303
MRSVQEHQRVVAEMMRACRPITVPLTQAQGLVLGGDVVAPLSLPVFDNSAMDGYAVRAEDTSGATPQNPVMLPVAEDIPA
GRADMLTLQPVTAHRIMTGAPVPTGATAIVPVEATDGGVDSVAIRQQATPGKHIRRSGEDVAAGTTVLHNGQIVTPAVLG
LAAALGLAELPVLPRQRVLVISTGSELASPGTPLQPGQIYESNSIMLAAAVRDAGAAVVATATAGDDVAQFGAILDRYAV
DADLIITSGGVSAGAYEVVKDAFGSADYRGGDHGVEFVKVAMQPGMPQGVGRVAGTPIVTLPGNPVSALVSFEVFIRPPL
RMAMGLPDPYRPHRSAVLTASLTSPRGKRQFRRAILDHQAGTVISYGPPASHHLRWLASANGLLDIPEDVVEVAAGTQLQ
VWDLT
>P12281 2.10.1.1~~~moeA~~~Molybdopterin molybdenumtransferase~~~COG0303
MEFTTGLMSLDTALNEMLSRVTPLTAQETLPLVQCFGRILASDVVSPLDVPGFDNSAMDGYAVRLADIASGQPLPVAGKS
FAGQPYHGEWPAGTCIRIMTGAPVPEGCEAVVMQEQTEQMDNGVRFTAEVRSGQNIRRRGEDISAGAVVFPAGTRLTTAE
LPVIASLGIAEVPVIRKVRVALFSTGDELQLPGQPLGDGQIYDTNRLAVHLMLEQLGCEVINLGIIRDDPHALRAAFIEA
DSQADVVISSGGVSVGEADYTKTILEELGEIAFWKLAIKPGKPFAFGKLSNSWFCGLPGNPVSATLTFYQLVQPLLAKLS
GNTASGLPARQRVRTASRLKKTPGRLDFQRGVLQRNADGELEVTTTGHQGSHIFSSFSLGNCFIVLERDRGNVEVGEWVE
VEPFNALFGGL
>P99139 2.10.1.1~~~moeA~~~Molybdopterin molybdenumtransferase~~~
MVVEKRNPIPVKEAIQRIVNQQSSMPAITVALEKSLNHILAEDIVATYDIPRFDKSPYDGFAIRSVDSQGASGQNRIEFK
VIDHIGAGSVSDKLVGDHEAVRIMTGAQIPNGADAVVMFEQTIELEDTFTIRKPFSKNENISLKGEETKTGDVVLKKGQV
INPGAIAVLATYGYAEVKVIKQPSVAVIATGSELLDVNDVLEDGKIRNSNGPMIRALAEKLGLEVGIYKTQKDDLDSGIQ
VVKEAMEKHDIVITTGGVSVGDFDYLPEIYKAVKAEVLFNKVAMRPGSVTTVAFADGKYLFGLSGNPSACFTGFELFVKP
AVKHMCGALEVFPQIIKATLMEDFTKANPFTRFIRAKATLTSAGATVVPSGFNKSGAVVAIAHANCMVMLPGGSRGFKAG
HTVDIILTESDAAEEELLL
>P12282 2.7.7.80~~~moeB~~~Molybdopterin-synthase adenylyltransferase~~~COG0476
MAELSDQEMLRYNRQIILRGFDFDGQEALKDSRVLIVGLGGLGCAASQYLASAGVGNLTLLDFDTVSLSNLQRQTLHSDA
TVGQPKVESARDALTRINPHIAITPVNALLDDAELAALIAEHDLVLDCTDNVAVRNQLNAGCFAAKVPLVSGAAIRMEGQ
ITVFTYQDGEPCYRCLSRLFGENALTCVEAGVMAPLIGVIGSLQAMEAIKMLAGYGKPASGKIVMYDAMTCQFREMKLMR
NPGCEVCGQ
>P9WMN7 ~~~moeZ~~~Probable adenylyltransferase/sulfurtransferase MoeZ~~~COG0476
MSTSLPPLVEPASALSREEVARYSRHLIIPDLGVDGQKRLKNARVLVIGAGGLGAPTLLYLAAAGVGTIGIVDFDVVDES
NLQRQVIHGVADVGRSKAQSARDSIVAINPLIRVRLHELRLAPSNAVDLFKQYDLILDGTDNFATRYLVNDAAVLAGKPY
VWGSIYRFEGQASVFWEDAPDGLGVNYRDLYPEPPPPGMVPSCAEGGVLGIICASVASVMGTEAIKLITGIGETLLGRLL
VYDALEMSYRTITIRKDPSTPKITELVDYEQFCGVVADDAAQAAKGSTITPRELRDWLDSGRKLALIDVRDPVEWDIVHI
DGAQLIPKSLINSGEGLAKLPQDRTAVLYCKTGVRSAEALAAVKKAGFSDAVHLQGGIVAWAKQMQPDMVMY
>P0DJO8 ~~~mogR~~~Motility gene repressor MogR~~~
MPKSEIRKLLQEIKKQVDNPGNSSTTEIKKMASEAGIDEQTAEEIYHLLTEFYQAVEEHGGIEKYMHSNISWLKIELELL
SACYQIAILEDMKVLDISEMLSLNDLRIFPKTPSQLQNTYYKLKKELIQVEDIPKNKPGRKRKTQKNTKKEKTNIFGKVV
PAEFKAPASIKEQISYDKSREKNLVDLLSGVKSNVQLLSENQGEENNVYDLLKSIYSLSSLAVQKEELDKKYQDLQTKCQ
ELEQENSYLKQQNETMTDSFHTLVLQVADFAYASDLDQIQALPLFSQQLVVTLNQLGVFKENYKQM
>P0AF03 2.7.7.75~~~mog~~~Molybdopterin adenylyltransferase~~~COG0521
MNTLRIGLVSISDRASSGVYQDKGIPALEEWLTSALTTPFELETRLIPDEQAIIEQTLCELVDEMSCHLVLTTGGTGPAR
RDVTPDATLAVADREMPGFGEQMRQISLHFVPTAILSRQVGVIRKQALILNLPGQPKSIKETLEGVKDAEGNVVVHGIFA
SVPYCIQLLEGPYVETAPEVVAAFRPKSARRDVSE
>P44645 2.7.7.75~~~mog~~~Molybdopterin adenylyltransferase~~~COG0521
MTALLKIGLVSVSDRASAGVYQDQGIPELQAWLEQALVDPFHLETRLIPDEQPVIEQTLKELVDEQGCHLVLTTGGTGPA
KRDVTPDATLAVADREMPGFGEQMRQVSLHFVPTAILSRQVGVIRKESLILNLPGQPKAIKETLEGVKDKEGNVLVKGIF
SAVPYCLQLINGLYIDTKPEIIESFRPKSARRENLEK
>Q9ZL45 2.7.7.75~~~mog~~~Molybdopterin adenylyltransferase~~~COG0521
MQTIHIGVLSASDRASKGVYEDLSGKAIQEVLSEYLLNPLEFHYEIVADERDLIEKSLIKMCDEYQCDLVVTTGGTGPAL
RDITPEATKKVCQKMLPGFGELMRMTSLKYVPTAILSRQSAGIRNKSLIINLPGKPKSIRECLEAVFPAIPYCVDLILGN
YMQVNEKNIQAFRPKQ
>P44206 ~~~molA~~~Molybdate-binding protein MolA~~~COG0614
MKLKSLLIACLLSSLSFSALADRIITDQLDRKVTIPDHINRAVVLQHQTLNIAVQLDATKQIVGVLSNWKKQLGKNYVRL
APELENMAMPGDLNSVNIESLLALKPDVVFVTNYAPSEMIKQISDVNIPVVAISLRTGEVGEKGKLNPTLTDEDKAYNDG
LKQGIELIAEVFEKKQQGDELVKAAFANRKLLADRLGDVSADKRVRTYMANPDLGTYGSGKYTGLMMEHAGAYNVAAATI
KGFKQVSLENVLEWNPAVILVQDRYPDVVPQILNDQGWANIQALKDKKVFLMPEYAKAWGYPMPEALALGEVWLAKALYP
QRFQDVDLDKMVNDYYQKFYRTSYKPDNAAR
>Q57130 ~~~molB~~~Molybdate import system permease protein MolB~~~COG0609
MQPDSYPKILFGLTLLLVITAVISLGIGRYSLSVPQIGQILWAKATALEIDPVQQQVIFQVRLPRILTALCVGAGLALSG
VVLQGIFRNPLVNPHIIGVTSGSAFGGTLAIFFGFSLYGLFTSTILFGFGTLALVFLFSFKFNQRSLLMLILIGMILSGL
FSALVSLLQYISDTEEKLPSIVFWLMGSFATSNWEKLLFFFVPFLLCSSILLSLSWRLNLLSLDEKEAKALGVKMAPLRW
LVIFLSGSLVACQVAISGSIGWVGLIIPHLSRMLVGANHQSLLPCTMLVGATYMLLVDNVARSLSDAEIPISILTALIGA
PLFGVLVYKLKRGGMNE
>Q57399 7.3.2.5~~~molC~~~Molybdate import ATP-binding protein MolC~~~COG1120
MNKALSVENLGFYYQAENFLFQQLNFDLNKGDILAVLGQNGCGKSTLLDLLLGIHRPIQGKIEVYQSIGFVPQFFSSPFA
YSVLDIVLMGRSTHINTFAKPKSHDYQVAMQALDYLNLTHLAKREFTSLSGGQRQLILIARAIASECKLILLDEPTSALD
LANQDIVLSLLIDLAQSQNMTVVFTTHQPNQVVAIANKTLLLNKQNFKFGETRNILTSENLTALFHLPMFEQQAQYKESF
FTHFVPLYKTLLK
>Q46203 ~~~ompA~~~Major outer membrane porin~~~
MKKLLKSALLFAATGSALSLQALPVGNPAEPSLLIDGTMWEGASGDPCDPCATWCDAISIRAGYYGDYVFDRVLKVDVNK
TFSGMAATPTQATGNASNTNQPEANGRPNIAYGRHMQDAEWFSNAAFLALNIWDRFDIFCTLGASNGYFKASSAAFNLVG
LIGFSAASSISTDLPMQLPNVGITQGVVEFYTDTSFSWSVGARGALWECGCATLGAEFQYAQSNPKIEMLNVTSSPAQFV
IHKPRGYKGASSNFPLPITAGTTEATDTKSATIKYHEWQVGLALSYRLNMLVPYIGVNWSRATFDADTIRIAQPKLKSEI
LNITTWNPSLIGSTTALPNNSGKDVLSDVLQIASIQINKMKSRKACGVAVGATLIDADKWSITGEARLINERAAHMNAQF
RF
>P10332 ~~~ompA~~~Major outer membrane porin~~~
MKKLLKSALLFAATGSALSLQALPVGNPAEPSLLIDGTMWEGASGDPCDPCATWCDAISIRAGYYGDYVFDRVLKVDVNK
TFSGMAATPTQATGNASNTNQPEANGRPNIAYGRHMQDAEWFSNAAFLALNIWDRFDIFCTLGASNGYFKSSSAAFNLVG
LIGFSATSSTSTELPMQLPNVGITQGVVEFYTDTSFSWSVGARGALWECGCATLGAEFQYAQSNPKIEVLNVTSSPAQFV
IHKPRGYKGASSNFPLPITAGTTEATDTKSATIKYHEWQVGLALSYRLNMLVPYIGVNWSRATFDADTIRIAQPKLKSEI
LNITTWNPSLLGSTTTLPNNGGKDVLSDVLQIASIQINKMKSRKACGVAVGATLIDADKWSITGEARLINERAAHMNAQF
RF
>P23732 ~~~ompA~~~Major outer membrane porin, serovar A~~~
MKKLLKSVLVFAALSSASSLQALPVGNPAEPSLMIDGILWEGFGGDPCDPCTTWCDAISMRMGYYGDFVFDRVLKTDVNK
EFQMGAAPTTRDVAGLEKDPVVNVARPNPAYGKHMQDAEMFTNAAYMALNIWDRFDVFCTLGATTGYLKGNSASFNLVGL
FGTKTQSSGFDTANIVPNTALNQAVVELYTDTTFAWSVGARAALWECGCATLGASFQYAQSKPKVEELNVLCNASEFTIN
KPKGYVGAEFPLDITAGTEAATGTKDASIDYHEWQASLALSYRLNMFTPYIGVKWSRVSFDADTIRIAQPKLAKPVLDTT
TLNPTIAGKGTVVSSAENELADTMQIVSLQLNKMKSRKSCGIAVGTTIVDADKYAVTVETRLIDERAAHVNAQFRF
>P23421 ~~~ompA~~~Major outer membrane porin, serovar B~~~
MKKLLKSVLVFAALSSASSLQALPVGNPAEPSLMIDGILWEGFGGDPCDPCTTWVDAISMRMGYYGDFVFDRVLKTDVNK
EFQMGAKPTTTTGNAVAPSTLTARENPAYGRHMQDAEMFTNAACMALNIWDRFDVFCTLGASSGYLKGNSASFNLVGLFG
NNENQTKVSNGAFVPNMSLDQSVVELYTDTAFAWSVGARAALWECGCATLGASFQYAQSKPKVEELNVLCNAAEFTINKP
KGYVGKELPLDLTAGTDAATGTKDASIDYHEWQASLALSYRLNMFTPYIGVKWSRASFDADTIRIAQPKSAETIFDVTTL
NPTIAGAGDVKTSAEGQLGDTMQIVSLQLNKMKSRKSCGIAVGTTIVDADKYAVTVETRLIDERAAHVNAQFRF
>P08780 ~~~ompA~~~Major outer membrane porin, serovar C~~~
MKKLLKSVLVFAALSSASSLQALPVGNPAEPSLMIDGILWEGFGGDPCDPCTTWCDAISMRVGYYGDFVFDRVLKTDVNK
EFQMGAAPTTSDVAGLQNDPTINVARPNPAYGKHMQDAEMFTNAAYMALNIWDRFDVFCTLGATTGYLKGNSASFNLVGL
FGTKTQSSSFNTAKLIPNTALNEAVVELYINTTFAWSVGARAALWECGCATLGASFQYAQSKPKVEELNVLCNASEFTIN
KPKGYVGAEFPLNITAGTEAATGTKDASIDYHEWQASLALSYRLNMFTPYIGVKWSRVSFDADTIRIAQPKLAEAILDVT
TLNRTTAGKGSVVSAGTDNELADTMQIVSLQLNKMKSRKSCGIAVGTTIVDADKYAVTVEARLIDERAAHVNAQFRF
>Q46409 ~~~ompA~~~Major outer membrane porin, serovar D~~~
MKKLLKSVLVFAALSSASSLQALPVGNPAEPSLMIDGILWEGFGGDPCDPCATWCDAISMRVGYYGDFVFDRVLKTDVNK
EFQMGAKPTTDTGNSAAPSTLTARENPAYGRHMQDAEMFTNAACMALNIWDRFDVFCTLGATSGYLKGNSASFNLVGLFG
DNENQKTVKAESVPNMSFDQSVVELYTDTTFAWSVGARAALWECGCATLGASFQYAQSKPKVEELNVLCNAAEFTINKPK
GYVGKEFPLDLTAGTDAATGTKDASIDYHEWQASLALSYRLNMFTPYIGVKWSRASFDADTIRIAQPKSATAIFDTTTLN
PTIAGAGDVKTGAEGQLGDTMQIVSLQLNKMKSRKSCGIAVGTTIVDADKYAVTVETRLIDERAAHVNAQFRF
>P17451 ~~~ompA~~~Major outer membrane porin, serovar E~~~
MKKLLKSVLVFAALSSASSLQALPVGNPAEPSLMIDGILWEGFGGDPCDPCTTWCDAISMRMGYYGDFVFDRVLKTDVNK
EFQMGDKPTSTTGNATAPTTLTARENPAYGRHMQDAEMFTNAACMALNIWDRFDVFCTLGASSGYLKGNSASFNLVGLFG
DNENQSTVKTNSVPNMSLDQSVVELYTDTAFSWSVGARAALWECGCATLGASFQYAQSKPKVEELNVLCNAAEFTINKPK
GYVGQEFPLALIAGTDAATGTKDASIDYHEWQASLALSYRLNMFTPYIGVKWSRASFDADTIRIAQPKSATAIFDTTTLN
PTIAGAGDVKASAEGQLGDTMQIVSLQLNKMKSRKSCGIAVGTTIVDADKYAVTVETRLIDERAAHVNAQFRF
>P16155 ~~~ompA~~~Major outer membrane porin, serovar F~~~
MKKLLKSVLVFAALSSASSLQALPVGNPAEPSLMIDGILWEGFGGDPCDPCTTWCDAISMRMGYYGDFVFDRVLKTDVNK
EFEMGEALAGASGNTTSTLSKLVERTNPAYGKHMQDAEMFTNAACMTLNIWDRFDVFCTLGATSGYLKGNSASFNLVGLF
GDGVNATKPAADSIPNVQLNQSVVELYTDTTFAWSVGARAALWECGCATLGASFQYAQSKPKIEELNVLCNAAEFTINKP
KGYVGKEFPLDLTAGTDAATGTKDASIDYHEWQASLSLSYRLNMFTPYIGVKWSRASFDSDTIRIAQPRLVTPVVDITTL
NPTIAGCGSVAGANTEGQISDTMQIVSLQLNKMKSRKSCGIAVGTTIVDADKYAVTVETRLIDERAAHVNAQFRF
>P13467 ~~~ompA~~~Major outer membrane porin, serovar H~~~
MKKLLKSVLVFAALSSASSLQALPVGNPAEPSLMIDGILWEGFGGDPCDPCATWCDAISMRVGYYGDFVFDRVLKTDVNK
EFQMGAAPTTNDAADLQNDPKTNVARPNPAYGKHMQDAEMFTNAAYMALNIWDRFDVFCTLGATTGYLKGNSASFNLVGL
FGTKTKSSDFNTAKLVPNIALNRAVVELYTDTTFAWSVGARAALWECGCATLGASFQYAQSKPKVEELNVLCNASEFTIN
KPKGYVGAEFPLDITAGTEAATGTKDASIDYHEWQASLALSYRLNMFTPYIGVKWSRVSFDADTIRIAQPKLAEAILDVT
TLNPTIAGKGTVVASGSDNDLADTMQIVSLQLNKMKSRKSCGIAVGTTIVDADKYAVTVETRLIDERAAHVNAQFRF
>Q9XBF4 ~~~ompA~~~Major outer membrane porin~~~
MKKLLKSALLSAAFAGSVGSLQALPVGNPSDPSLLIDGTIWEGAAGDPCDPCATWCDAISLRAGFYGDYVFDRILKVDAP
KTFSMGAKPTGSATANYTTAVDRPNPAYNKHLHDAEWFTNAGFIALNIWDRFDVFCTLGASNGYIKGNSTAFNLVGLFGV
KGTSVAANELPNVSLSNGVVELYTDTSFSWSVGARGALWECGCATLGAEFQYAQSKPKVEELNVICNVAQFSVNKPKGYK
GVAFPLPTDAGVATATGTKSATINYHEWQVGASLSYRLNSLVPYIGVQWSRATFDADNIRIAQPKLPTAVLNLTAWNPSL
LGNTTTLATSDSFSDFMQIVSCQINKFKSRKACGVTVGATLVDADKWSLTAEARLINERAAHVSGQFRF
>P19542 ~~~ompA~~~Major outer membrane porin, serovar L1~~~
MKKLLKSVLVFAALSSASSLQALPVGNPAEPSLMIDGILWEGFGGDPCDPCTTWCDAISMRMGYYGDFVFDRVLQTDVNK
EFQMGAKPTATTGNAAAPSTCTARENPAYGRHMQDAEMFTNAAYMALNIWDRFDVFCTLGATSGYLKGNSASFNLVGLFG
DNENQSTVKKDAVPNMSFDQSVVELYTDTTFAWSVGARAALWECGCATLGASFQYAQSKPKVEELNVLCNAAEFTINKPK
GYVGKEFPLDLTAGTDAATGTKDASIDYHEWQASLALSYRLNMFTPYIGVKWSRASFDADTIRIAQPKLATAIFDTTTLN
PTIAGAGEVKANAEGQLGDTMQIVSLQLNKMKSRKSCGIAVGTTIVDADKYAVTVETRLIDERAAHVNAQFRF
>P75024 ~~~ompA~~~Major outer membrane porin~~~
MKKLLKSVLAFAVLGSASSLHALPVGNPAEPSLMIDGILWEGFGGDPCDPCTTWCDAISLRLGYYGDFVFDRVLKTDVNK
QFEMGAAPTGDADLTTAPTPASRENPAYGKHMQDAEMFTNAAYMALNIWDRFDVFCTLGATSGYLKGNSAAFNLVGLFGR
DETAVAADDIPNVSLSQAVVELYTDTAFAWSVGARAALWECGCATLGASFQYAQSKPKVEELNVLCNAAEFTINKPKGYV
GQEFPLNIKAGTVSATDTKDASIDYHEWQASLALSYRLNMFTPYIGVKWSRASFDADTIRIAQPKLETSILKMTTWNPTI
SGSGIDVDTKITDTLQIVSLQLNKMKSRKSCGLAIGTTIVDADKYAVTVETRLIDERAAHVNAQFRF
>Q07430 ~~~ompA~~~Major outer membrane porin~~~
MKKLLKSALLSAAFAGSVGSLQALPVGNPSDPSLLIDGTIWEGAAGDPCDPCATWCDAISLRAGFYGDYVFDRILKIDAP
KTFSMGAKPTGSATANYTTAVDRPNPAYNKHLYDAEWFTNAGFIALNIWDRFDVFCTLGASNGYVKGNSAAFNLVGLFGV
KGTSVNANELPNVSLSNGVIELYTDTTFAWSVGARGALWECGCATLGAEFQYAQSKPKVEELNVICNVSQFSLNKPKGYK
GVAFPLPTDAGVVTAAGTKSATINYHEWQVGASLSYRLNSLVPYIGVQWSRATFDADNIRIAQPKLPTAILNLTAWNPSL
LGSATAVSSSDQFSDFMQIVSCQINKFKSRKACGVTVGATLVDADKWSLTAEARLINERAAHISGQFRF
>P23114 ~~~ompA~~~Major outer membrane porin, serovar L3~~~
MKKLLKSVLVFAALSSASSLQALPVGNPAEPSLMIDGILWEGFGGDPCDPCTTWCDAISMRVGYYGDFVFDRVLKTDVNK
EFQMGAEPTTSDTAGLSNDPTTNVARPNPAYGKHMQDAEMFTNAAYMALNIWDRFDVFCTLGATTGYLKGNSASFNLVGL
FGTKTQSTNFNTAKLVPNTALNQAVVELYTDTTFAWSVGARAALWECGCATLGASFQYAQSKPKVEELNVLCDASEFTIN
KPKGYVGAEFPLDITAGTEAATGTKDASIDYHEWQASLALSYRLNMFTPYIGVKWSRVSFDADTIRIAQPKLAEAVLDVT
TLNPTIAGKGSVVASGSENELADTMQIVSLQLNKMKSRKSCGIAVGTTIVDADKYAVTVETRLIDERAAHVNAQFRF
>Q00087 ~~~ompA~~~Major outer membrane porin~~~
MKKLLKSALLFAAAGSALSLQALPVGNPAEPSLLIDGTMWEGASGDPCDPCATWCDAISIRAGFYGDYVFDRILKVDVNK
TISGMAAAPTAASGTASNTTVAADRSNFAYGKHLQDAEWCTNAAYLALNIWDRFDVFCTLGASNGYFKASSDAFNLVGLI
GLAGTDFANQRPNVEISQGIVELYTDTAFSWSVGARGALWECGCATLGAEFQYAQSNPKIEMLNVTSSPAQFMIHKPRGY
KGTAANFPLPVAAGTATATDTKSATVKYHEWQVGLALSYRLNMLVPYIGVNWSRATFDADTIRIAQPKLASAILNLTTWN
PTLLGVATTLDTSNKYADFMQIVSMQINKMKSRKACGIAVGATLIDADKWSITGEARLIDERAAHINAQFRF
>Q46407 ~~~ompA~~~Major outer membrane porin~~~
MKKLLKSVLAFAVLGSASSLHALPVGNPAEPSLMIDGILWEGFGGDPCDPCTTWCDAISLRLGYYGDFVFDRVLKTDVNK
QFEMGPVPTTTDTDAAADITTSTPRENPAYGKHMQDAEMFTNAAYMALNIWDRFDVFCTLGATSGYLKGNSASFNLVGLF
GDGVANAANAIATVAADSLPNVSLSQAVVELYTDTAFAWSVGARAALWECGCATLGASFQYAQSKPKVEELNVLCNAAQF
TINKPKGYVGKEFPLALTAGTDSATDTKDASIDYHEWQASLALSYRLNMFTPYIGVKWSRASFDADTIRIAQPKLAEAIL
DVTTWNPTIAGAGTIADGTGAAATANGLADTLQIVSLQLNKMKSRKSCGLAIGTTIVDADKYAVTVETRLIDERAAHVNA
QFRF
>P16567 ~~~ompA~~~Major outer membrane porin~~~
MKKLLKSALLFAATGSALSLQALPVGNPAEPSLLIDGTMWEGASGDPCDPCSTWCDAISIRAGYYGDYVFDRVLKVDVNK
TITGMGAVPTGTAAANYKTPTDRPNIAYGKHLQDAEWFTNAAFLALNIWDRFDIFCTLGASNGYFKASSAAFNLVGLIGV
KGSSIAADQLPNVGITQGIVEFYTDTTFSWSVGARGALWECGCATLGAEFQYAQSNPKIEMLNVVSSPAQFVVHKPRGYK
GTAFPLPLTAGTDQATDTKSATIKYHEWQVGLALSYRLNMLVPYISVNWSRATFDADAIRIAQPKLAAAVLNLTTWNPTL
LGEATALDTSNKFADFLQIASIQINKMKSRKACGVAVGATLIDADKWSITGEARLINERAAHMNAQFRF
>P27455 ~~~ompA~~~Major outer membrane porin~~~
MKKLLKSALLSAAFAGSVGSLQALPVGNPSDPSLLIDGTIWEGAAGDPCDPCATWCDAISLRAGFYGDYVFDRILKVDAP
KTFSMGAKPTGSAAANYTTAVDRPNPAYNKHLHDAEWFTNAGFIALNIWDRFDVFCTLGASNGYIRGNSTAFNLVGLFGV
KGTTVNANELPNVSLSNGVVELYTDTSFSWSVGARGALWECGCATLGAEFQYAQSKPKVEELNVICNVSQFSVNKPKGYK
GVAFPLPTDAGVATATGTKSATINYHEWQVGASLSYRLNSLVPYIGVQWSRATFDADNIRIAQPKLPTAVLNLTAWNPSL
LGNATALSTTDSFSDFMQIVSCQINKFKSRKACGVTVGATLVDADKWSLTAEARLINERAAHVSGQFRF
>P06597 ~~~ompA~~~Major outer membrane porin~~~
MKKLLKSVLVFAALSSASSLQALPVGNPAEPSLMIDGILWEGFGGDPCDPCTTWCDAISMRMGYYGDFVFDRVLQTDVNK
EFQMGAKPTTATGNAAAPSTCTARENPAYGRHMQDAEMFTNAAYMALNIWDRFDVFCTLGATSGYLKGNSASFNLVGLFG
DNENHATVSDSKLVPNMSLDQSVVELYTDTTFAWSAGARAALWECGCATLGASFQYAQSKPKVEELNVLCNAAEFTINKP
KGYVGQEFPLDLKAGTDGVTGTKDASIDYHEWQASLALSYRLNMFTPYIGVKWSRASFDADTIRIAQPKSATTVFDVTTL
NPTIAGAGDVKASAEGQLGDTMQIVSLQLNKMKSRKSCGIAVGTTIVDADKYAVTVETRLIDERAAHVNAQFRF
>Q1I8U1 ~~~mnl~~~Monalysin~~~
MTIKEELGQPQSHSIELDEVSKEAASTRAALTSNLSGRFDQYPTKKGDFAIDGYLLDYSSPKQGCWVDGITVYGDIYIGK
QNWGTYTRPVFAYLQYVETISIPQNVTTTLSYQLTKGHTRSFETSVNAKYSVGANIDIVNVGSEISTGFTRSESWSTTQS
FTDTTEMKGPGTFVIYQVVLVYAHNATSAGRQNANAFAYSKTQAVGSRVDLYYLSAITQRKRVIVPSSNAVTPLDWDTVQ
RNVLMENYNPGSNSGHFSFDWSAYNDPHRRY
>P08854 ~~~mopII~~~Molybdenum-pterin-binding protein 2~~~
MSISARNQLKGKVVGLKKGVVTAEVVLEIAGGNKITSIISLDSVEELGVKEGAELTAVVKSTDVMILA
>Q08385 ~~~mopA~~~Molybdenum-pterin-binding protein MopA~~~
MNEQPLIAALSLQRAGAPRVGGDRIRLLEAIARHGTIAGAAREVGLSYKTAWDAVGTLNNLFEQPLVEAAPGGRTGGNAR
VTEAGQALIAGFGLLEGALTKALGVLEGGVSAPEKALNTLWSLTMRTSNRNTLRCTVTRVTLGAVNAEVELALTDGHSLT
AVITERSATEMGLAPGVEVFALIKASFVMLAAGGDPGRISACNRLTGIVAARTDGPVNTEIILDLGNCKSITAVITHTSA
DALGLAPGVPATALFKASHVILAMP
>Q08386 ~~~mopB~~~Molybdenum-pterin-binding protein MopB~~~
MAATKQGGGDDGRCARGVVLERTGARMGAERVALLAAIGRTGSISAAAREVGLSYKAAWDGVQAMNNLLAAPVVTAAPGG
KAGGGAVLTPAGEKLIAAYGAIEAGVAKLLSSFEKSLNLDPAEVLRGLSLRTSARNAWACKVWSVAADDVAAQVRMRLGE
GQDLTAVITARSAAEMRLAPGSEVLALVKSNFVLLAGAGVPERLSVRNRVRGRVIERIDAPLSSEVTLDLGGGKTITATI
TRDSAEMLDLHPGVETTALIKSSHVILALP
>Q43965 ~~~mopR~~~Phenol regulator MopR~~~
MSPAKDVKVYKKILEQNKDIQDLLDKIVFDAQHGQIWFDENRMLLMHTSILGFLRKDLYQMLGLERTKRFFIRCGYQAGM
RDAEVTSKLRPNLNEAEAFMAGPQMHGIRGMVQVEVNELHLSHDLKQFYADFNWLNSFEAEVHLSEFGASDQPACWMLLG
YACGYSSFVMGQTIIYQETHCVAQGDEHCRIIGKPLSEWENADELIRFMSPDAVSDEIIALQAELNQLKKNIYTEAESDY
TMFNAVGESVAYRKVCDLLKKAAGSKVAVLLQGETGVGKEAFARGIHNGSQRQAQPFVAVNCACIPPDLIESELFGVEKG
AFTGAVQSRMGKFERAHGGTIFLDEVVELSPRAQAALLRMLQEGEFERVGDSRTRQVDVRLVAATNEDLEQAVKDGKFRA
DLYYRLNIFPVIIPPLRERREDIPLLINHFLARFENMYNKTLKGLSDKAKNFMMKYDWPGNIRELENLLERATLLTDHQQ
EIKLDSLFPQHKDLEAVGETAQSLINVEDLFSENFSLDQLEQNIIRSAMDKSQQNVSEAARMLGISRATLDYRLKKITLG
>Q46509 1.2.99.7~~~mop~~~Aldehyde oxidoreductase~~~
MIQKVITVNGIEQNLFVDAEALLSDVLRQQLGLTGVKVGCEQGQCGACSVILDGKVVRACVTKMKRVADGAQITTIEGVG
QPENLHPLQKAWVLHGGAQCGFCSPGFIVSAKGLLDTNADPSREDVRDWFQKHRNACRCTGYKPLVDAVMDAAAVINGKK
PETDLEFKMPADGRIWGSKYPRPTAVAKVTGTLDYGADLGLKMPAGTLHLAMVQAKVSHANIKGIDTSEALTMPGVHSVI
THKDVKGKNRITGLITFPTNKGDGWDRPILCDEKVFQYGDCIALVCADSEANARAAAEKVKVDLEELPAYMSGPAAAAED
AIEIHPGTPNVYFEQPIVKGEDTGPIFASADVTVEGDFYVGRQPHMPIEPDVAFAYMGDDGKCYIHSKSIGVHLHLYMIA
PGVGLEPDQLVLVANPMGGTFGYKFSPTSEALVAVAAMATGRPVHLRYNYQQQQQYTGKRSPWEMNVKFAAKKDGTLLAM
ESDWLVDHGPYSEFGDLLTLRGAQFIGAGYNIPNIRGLGRTVATNHVWGSAFRGYGAPQSMFASECLMDMLAEKLGMDPL
ELRYKNAYRPGDTNPTGQEPEVFSLPDMIDQLRPKYQAALEKAQKESTATHKKGVGISIGVYGSGLDGPDASEAWAELNA
DGTITVHTAWEDHGQGADIGCVGTAHEALRPMGVAPEKIKFTWPNTATTPNSGPSGGSRQQVMTGNAIRVACENLLKACE
KPGGGYYTYDELKAADKPTKITGNWTASGATHCDAVTGLGKPFVVYMYGVFMAEVTVDVATGQTTVDGMTLMADLGSLCN
QLATDGQIYGGLAQGIGLALSEDFEDIKKHATLVGAGFPFIKQIPDKLDIVYVNHPRPDGPFGASGVGELPLTSPHAAII
NAIKSATGVRIYRLPAYPEKVLEALKA
>Q02198 1.1.1.218~~~morA~~~Morphine 6-dehydrogenase~~~
MAGKSPLINLNNGVKMPALGLGVFAASAEETASAIASAISSGYRLIDTARSYNNEAQVGEGIRNSGVDRAEMFVTTKLFN
CDYGYERALRAFDESLGRLGLDYVDLYLLHWPTKDWNATIQSWKAAEKILGDGRARAIGVCNFLEDQLDELIAASDVVPA
VNQIELHPYFAQKPLLAKNRALGIVTEAWSPIGGAINDGDGDNHGGRKHPLTDPVITTIAEAHGRSAAQVILRWHFQNDV
VAIPKSVNPERIAKNIDVFDFALSDAEMAQLDELDTGVRIGPDPRDVDTSSFAEFV
>P84308 ~~~mosA~~~Molybdenum storage protein subunit alpha~~~COG0528
MTDTTNSIKHVISPLARQTLQDRDLTRPVAGKRPIRLLPWLQVVKIGGRVMDRGADAILPLVEELRKLLPEHRLLILTGA
GVRARHVFSVGLDLGLPVGSLAPLAASEAGQNGHILAAMLASEGVSYVEHPTVADQLAIHLSATRAVVGSAFPPYHHHEF
PGSRIPPHRADTGAFLLADAFGAAGLTIVENVDGIYTADPNGPDRGQARFLPETSATDLAKSEGPLPVDRALLDVMATAR
HIERVQVVNGLVPGRLTAALRGEHVGTLIRTGVRPA
>P84253 ~~~mosB~~~Molybdenum storage protein subunit beta~~~COG0528
MANSTAELEELLMQRSLTDPQLQAAAAAAADFRILPDATVIKIGGQSVIDRGRAAVYPLVDEIVAARKNHKLLIGTGAGT
RARHLYSIAAGLGLPAGVLAQLGSSVADQNAAMLGQLLAKHGIPVVGGAGLSAVPLSLAEVNAVVFSGMPPYKLWMRPAA
EGVIPPYRTDAGCFLLAEQFGCKQMIFVKDEDGLYTANPKTSKDATFIPRISVDEMKAKGLHDSILEFPVLDLLQSAQHV
REVQVVNGLVPGNLTRALAGEHVGTIITAS
>E6UBR9 2.4.1.319~~~~~~Beta-1,4-mannooligosaccharide phosphorylase~~~COG2152
MKTQIINGVSLPNIPWQDKPADCKDVIWRYDANPIIPRDQLPTSNSIFNSAVVPYESEKGKFAGVFRVDDKCRNMELHAG
FSKDGIHWDINPDRIVFEQAEKSTEEVNQWGYGYDPRVCFIEDRFWVTWCNAYGWKPTIGVAYTFDFKTFYQCENAFLPF
NRNGVLFPRKINGKYVMFSRPSDSGHTPFGDMFISQSPDMKYWGEHRHVMGPLRAWESKKIGAGPIPIETSEGWLCFYHG
VLESCNGFVYSFSACILDKDEPWKVKYRCAEYLLSPQKIYECVGDVQNVTFPCATLVDADTGRIAIYYGCADTCVSMAFT
TVDDVVDYVKSHSSV
>O67122 ~~~motA~~~Motility protein A~~~COG1291
MDVGTIIGIIAAFLLILISILIGGSITAFINVPSIFIVVGGGMAAAMGAFPLKDFIRGVLAIKKAFLWKPPDLNDVIETI
GEIASKVRKEGILALEGDIELYYQKDPLLGDMIRMLVDGIDINDIKATAEMALAQLDEKMSTEVAVWEKLADLFPAFGMI
GTLIGLIQMLRNLNDPSALGPGMAVALITTLYGAILANAFAIPVANKLKKAKDMEVLVKTIYIEAIEKIQKGENPNVVKQ
EAAIMLGVELPEEV
>P28611 ~~~motA~~~Motility protein A~~~COG1291
MDKTSLIGIILAFVALSVGMVLKGVSFSALANPAAILIIIAGTISAVVIAFPTKEIKKVPTLFRVLFKENKQLTIEELIP
MFSEWAQLARREGLLALEASIEDVDDAFLKNGLSMAVDGQSAEFIRDIMTEEVEAMEDRHQAGAAIFTQAGTYAPTLGVL
GAVIGLIAALSHMDNTDELGHAISAAFVATLLGIFTGYVLWHPFANKLKRKSKQEVKLREVMIEGVLSVLEGQAPKVIEQ
KLLMYLPAKDRLKFAEQGEAQNGEKKEEEA
>P09348 ~~~motA~~~Motility protein A~~~COG1291
MLILLGYLVVLGTVFGGYLMTGGSLGALYQPAELVIIAGAGIGSFIVGNNGKAIKGTLKALPLLFRRSKYTKAMYMDLLA
LLYRLMAKSRQMGMFSLERDIENPRESEIFASYPRILADSVMLDFIVDYLRLIISGHMNTFEIEALMDEEIETHESEAEV
PANSLALVGDSLPAFGIVAAVMGVVHALGSADRPAAELGALIAHAMVGTFLGILLAYGFISPLATVLRQKSAETSKMMQC
VKVTLLSNLNGYAPPIAVEFGRKTLYSSERPSFIELEEHVRAVKNPQQQTTTEEA
>P28612 ~~~motB~~~Motility protein B~~~COG1360
MARKKKKKHEDEHVDESWLVPYADILTLLLALFIVLYASSSIDAAKFQMLSKSFNEVFTGGTGVLDYSSVTPPENESDGI
DEVKKEKEEKEKNKKEKEKAADQEELENVKSQVEKFIKDKKLEHQLETKMTSEGLLITIKDSIFFDSGKATIRKEDVPLA
KEISNLLVINPPRNIIISGHTDNMPIKNSEFQSNWHLSVMRAVNFMGLLIENPKLDAKVFSAKGYGEYKPVASNKTAEGR
SKNRRVEVLILPRGAAETNEK
>P0AF06 ~~~motB~~~Motility protein B~~~COG1360
MKNQAHPIIVVKRRKAKSHGAAHGSWKIAYADFMTAMMAFFLVMWLISISSPKELIQIAEYFRTPLATAVTGGDRISNSE
SPIPGGGDDYTQSQGEVNKQPNIEELKKRMEQSRLRKLRGDLDQLIESDPKLRALRPHLKIDLVQEGLRIQIIDSQNRPM
FRTGSADVEPYMRDILRAIAPVLNGIPNRISLSGHTDDFPYASGEKGYSNWELSADRANASRRELMVGGLDSGKVLRVVG
MAATMRLSDRGPDDAVNRRISLLVLNKQAEQAILHENAESQNEPVSALEKPEVAPQVSVPTMPSAEPR
>P56427 ~~~motB~~~Motility protein B~~~COG1360
MAKKNKPTECPAGEKWAVPYADFLSLLLALFIALYAISAVNKSKVEALKTEFIKIFNYAPKPEAMQPVVVIPPDSGKEEE
QMASESSKPASQNTETKATIARKGEGSVLEQIDQGSILKLPSNLLFENATSDAINQDMMLYIERIAKIIQKLPKRVHINV
RGFTDDTPLVKTRFKSHYELAANRAYRVMKVLIQYGVNPNQLSFSSYGSTNPIAPNDSLENRMKNNRVEIFFSTDANDLS
KIHSILDNEFNPHKQQE
>P55892 ~~~motB~~~Motility protein B~~~
MKNQAHPIVVVKRRRHKPHGGGAHGSWKIAYADFMTAMMAFFLVMWLISISSPKELIQIAEYFRTPLATAVTGGNRIANS
ESPIPGGGDDYTQQQGEVEKQPNIDELKKRMEQSRLNKLRGDLDQLIESDPKLRALRPHLKIDLVQEGLRIQIIDSQNRP
MFKTGSAEVEPYMRDILRAIAPVLNGIPNRISLAGHTDDFPYANGEKGYSNWELSADRANASRRELVAGGLDNGKVLRVV
GMAATMRLSDRGPDDAINRRISLLVLNKQAEQAILHENAESQNEPVSVLQQPAAAPPASVPTSPKAEPR
>Q79FN7 ~~~moxR1~~~Chaperone MoxR1~~~COG0714
MTSAGGFPAGAGGYQTPGGHSASPAHEAPPGGAEGLAAEVHTLERAIFEVKRIIVGQDQLVERMLVGLLSKGHVLLEGVP
GVAKTLAVETFARVVGGTFSRIQFTPDLVPTDIIGTRIYRQGREEFDTELGPVVANFLLADEINRAPAKVQSALLEVMQE
RHVSIGGRTFPMPSPFLVMATQNPIEHEGVYPLPEAQRDRFLFKINVGYPSPEEEREIIYRMGVTPPQAKQILSTGDLLR
LQEIAANNFVHHALVDYVVRVVFATRKPEQLGMNDVKSWVAFGASPRASLGIIAAARSLALVRGRDYVIPQDVIEVIPDV
LRHRLVLTYDALADEISPEIVINRVLQTVALPQVNAVPQQGHSVPPVMQAAAAASGR
>P9WIP1 ~~~~~~Immunogenic protein MPT63~~~
MKLTTMIKTAVAVVAMAAIATFAAPVALAAYPITGKLGSELTMTDTVGQVVLGWKVSDLKSSTAVIPGYPVAGQVWEATA
TVNAIRGSVTPAVSQFNARTADGINYRVLWQAAGPDTISGATIPQGEQSTGKIYFDVTGPSPTIVAMNNGMEDLLIWEP
>P0A5Q5 ~~~~~~Immunogenic protein MPB64~~~
MRIKIFMLVTAVVLLCCSGVATAAPKTYCEELKGTDTGQACQIQMSDPAYNINISLPSYYPDQKSLENYIAQTRDKFLSA
ATSSTPREAPYELNITSATYQSAIPPRGTQAVVLKVYQNAGGTHPTTTYKAFDWDQAYRKPITYDTLWQADTDPLPVVFP
IVQGELSKQTGQQVSIAPNAGLDPVNYQNFAVTNDGVIFFFNPGELLPEAAGPTQVLVPRSAIDSMLA
>P9WIN9 ~~~~~~Immunogenic protein MPT64~~~
MRIKIFMLVTAVVLLCCSGVATAAPKTYCEELKGTDTGQACQIQMSDPAYNINISLPSYYPDQKSLENYIAQTRDKFLSA
ATSSTPREAPYELNITSATYQSAIPPRGTQAVVLKVYQNAGGTHPTTTYKAFDWDQAYRKPITYDTLWQADTDPLPVVFP
IVQGELSKQTGQQVSIAPNAGLDPVNYQNFAVTNDGVIFFFNPGELLPEAAGPTQVLVPRSAIDSMLA
>P0A669 ~~~~~~Immunogenic protein MPB70~~~
MKVKNTIAATSFAAAGLAALAVAVSPPAAAGDLVGPGCAEYAAANPTGPASVQGMSQDPVAVAASNNPELTTLTAALSGQ
LNPQVNLVDTLNSGQYTVFAPTNAAFSKLPASTIDELKTNSSLLTSILTYHVVAGQTSPANVVGTRQTLQGASVTVTGQG
NSLKVGNADVVCGGVSTANATVYMIDSVLMPPA
>P9WNF5 ~~~~~~Immunogenic protein MPT70~~~COG2335
MKVKNTIAATSFAAAGLAALAVAVSPPAAAGDLVGPGCAEYAAANPTGPASVQGMSQDPVAVAASNNPELTTLTAALSGQ
LNPQVNLVDTLNSGQYTVFAPTNAAFSKLPASTIDELKTNSSLLTSILTYHVVAGQTSPANVVGTRQTLQGASVTVTGQG
NSLKVGNADVVCGGVSTANATVYMIDSVLMPPA
>P0CAX7 ~~~~~~Cell surface glycolipoprotein MPB83~~~
MINVQAKPAAAASLAAIAIAFLAGCSSTKPVSQDTSPKPATSPAAPVTTAAMADPAADLIGRGCAQYAAQNPTGPGSVAG
MAQDPVATAASNNPMLSTLTSALSGKLNPDVNLVDTLNGGEYTVFAPTNAAFDKLPAATIDQLKTDAKLLSSILTYHVIA
GQASPSRIDGTHQTLQGADLTVIGARDDLMVNNAGLVCGGVHTANATVYMIDTVLMPPAQ
>C1AFY9 ~~~~~~Cell surface glycolipoprotein MPB83~~~
MINVQAKPAAAASLAAIAIAFLAGCSSTKPVSQDTSPKPATSPAAPVTTAAMADPAADLIGRGCAQYAAQNPTGPGSVAG
MAQDPVATAASNNPMLSTLTSALSGKLNPDVNLVDTLNGGEYTVFAPTNAAFDKLPAATIDQLKTDAKLLSSILTYHVIA
GQASPSRIDGTHQTLQGADLTVIGARDDLMVNNAGLVCGGVHTANATVYMIDTVLMPPAQ
>P9WNF3 ~~~~~~Cell surface glycolipoprotein MPT83~~~COG2335
MINVQAKPAAAASLAAIAIAFLAGCSSTKPVSQDTSPKPATSPAAPVTTAAMADPAADLIGRGCAQYAAQNPTGPGSVAG
MAQDPVATAASNNPMLSTLTSALSGKLNPDVNLVDTLNGGEYTVFAPTNAAFDKLPAATIDQLKTDAKLLSSILTYHVIA
GQASPSRIDGTHQTLQGADLTVIGARDDLMVNNAGLVCGGVHTANATVYMIDTVLMPPAQ
>P0ACV7 3.4.17.-~~~mpaA~~~Murein peptide amidase A~~~COG2866
MTVTRPRAERGAFPPGTEHYGRSLLGAPLIWFPAPAASRESGLILAGTHGDENSSVVTLSCALRTLTPSLRRHHVVLCVN
PDGCQLGLRANANGVDLNRNFPAANWKEGETVYRWNSAAEERDVVLLTGDKPGSEPETQALCQLIHRIQPAWVVSFHDPL
ACIEDPRHSELGEWLAQAFELPLVTSVGYETPGSFGSWCADLNLHCITAEFPPISSDEASEKYLFAMANLLRWHPKDAIR
PS
>P0ACV6 3.4.17.-~~~mpaA~~~Murein peptide amidase A~~~COG2866
MTVTRPRAERGAFPPGTEHYGRSLLGAPLIWFPAPAASRESGLILAGTHGDENSSVVTLSCALRTLTPSLRRHHVVLCVN
PDGCQLGLRANANGVDLNRNFPAANWKEGETVYRWNSAAEERDVVLLTGDKPGSEPETQALCQLIHRIQPAWVVSFHDPL
ACIEDPRHSELGEWLAQAFELPLVTSVGYETPGSFGSWCADLNLHCITAEFPPISSDEASEKYLFAMANLLRWHPKDAIR
PS
>A7N805 3.4.17.-~~~mpaA~~~Murein peptide amidase A~~~
MNRYYSNNQEITVSLIPRTERAAFLITPTSYGKSVLGAPLLYFPAQVESNSRGLILAGTHGDETASIAGLSCALRSLPAE
CLKHDVILSMNPDANQLGTRANANQVDLNRAFPTQNWTEHGTVYRWSSHTPVRDVKVKTGDKEQLEPEVDALISLIELRR
PKFVVSFHEPLAFVDDPAHSDLAKWLGKQFNLPIVDDVDYETPGSFGTWCNERQLPCITVELPPISADLTIEKHLDAFIA
LLQHDPDL
>Q3YAT3 1.1.1.400~~~mpdB~~~2-methyl-1,2-propanediol dehydrogenase~~~
MTTSADQTDVLVIGSGPGGAGVTLKLVQAGYKVTCLEQGPWVTPPEHPHYHREWEIEKQRGWAYDPNVRGLPEDYPVTGF
TTPYLMNNVGGSTMHYAGHWPRYKPVDFRKGTEHGLEGTIDWPISYEELAPYYDENDAIYGISGMVGDPSYPDRTGVDRD
PPVKPGKLGRNFAQALGDLGWHWWPSDNAIITRPREGREADIAAGNELSGSPTGSLSTPTHTHWPTAIALGADLRTHARV
EQIHTKNGKATGATYIDTRTGARHEINAKIVVVSASGIGTPRLLLMSAQKGHPDGLANSNGLVGKYLMHHILRVLASVVR
TSRMEGYKGAFGAPLYSHEFYHTDTNRGFVNGFGMQVARSFGAAYTAMGSHTGYVAPWGKSHRKFFNEHFGNHLMVFMFG
EDLPVETNCVTLDPDAKDSSGLPAARVNWEPHENDIALANYGIDRIFEAARALGAVETNDTGVLNPPPGWHLMGTCRMGN
NPEDSVTNKWHQTWDVPNLFVVDGSSLTTGGAVNPTSTIGALAVRAGDYISRRFSDIVDQRTTPSNEDAPAI
>Q3YAT5 1.2.1.98~~~mpdC~~~Hydroxyisobutyraldehyde dehydrogenase~~~
MTRTLSADADTRTATPPLMYVNGEWLPARSGATFPTIEPSTGRPITEIPRGDSSDVDAAVKAAADVAVEWQFTDAITRAA
LLRRLAELVAENAEELARIESLDSGHYLAKARELVTAIPLWLEYWAGAADKVGGRTIAVPGNKLSFTLLEPLGVTAHIIP
WNYPLLILARSIAPALALGNTCVVKPAEDTSLSALKFAELVHAAGFPAGVFNVVTGYGSEAGAALAAHPEVRGITFTGST
ETGREIARLGGQHIAQVNLELGGKSPLVVFPDAPLEDAVEVAVQGFCSRAGQVCVAGSRLFLHEDIADRFLEMLVSRLET
VTVGDPFDGATQMGPLASKKHYDRVREYIEVGKQEATLLYGGGRPTDTPDDGFFVEPTVFVDVATDARIAREEIFGPVTA
VMRWSSVDDLIATINDSEFGLFAVLWCRDITSALDTAKRLQVGSVMINDWFGELPMTPHGGHKQSGTGREEGLEAVHGYT
QVKHIGINLEPSPAKSADWAGAPL
>P76329 3.1.3.70~~~yedP~~~Mannosyl-3-phosphoglycerate phosphatase~~~COG3769
MFSIQQPLLVFSDLDGTLLDSHSYDWQPAAPWLTRLREANVPVILCSSKTSAEMLYLQKTLGLQGLPLIAENGAVIQLAE
QWQEIDGFPRIISGISHGEISLVLNTLREKEHFKFTTFDDVDDATIAEWTGLSRSQAALTQLHEASVTLIWRDSDERMAQ
FTARLNELGLQFMQGARFWHVLDASAGKDQAANWIIATYQQLSGKRPTTLGLGDGPNDAPLLEVMDYAVIVKGLNREGVH
LHDEDPARVWRTQREGPEGWREGLDHFFSAR
>P37773 6.3.2.45~~~mpl~~~UDP-N-acetylmuramate--L-alanyl-gamma-D-glutamyl-meso-2,6-diaminoheptandioate ligase~~~COG0773
MRIHILGICGTFMGGLAMLARQLGHEVTGSDANVYPPMSTLLEKQGIELIQGYDASQLEPQPDLVIIGNAMTRGNPCVEA
VLEKNIPYMSGPQWLHDFVLRDRWVLAVAGTHGKTTTAGMATWILEQCGYKPGFVIGGVPGNFEVSAHLGESDFFVIEAD
EYDCAFFDKRSKFVHYCPRTLILNNLEFDHADIFDDLKAIQKQFHHLVRIVPGQGRIIWPENDINLKQTMAMGCWSEQEL
VGEQGHWQAKKLTTDASEWEVLLDGEKVGEVKWSLVGEHNMHNGLMAIAAARHVGVAPADAANALGSFINARRRLELRGE
ANGVTVYDDFAHHPTAILATLAALRGKVGGTARIIAVLEPRSNTMKMGICKDDLAPSLGRADEVFLLQPAHIPWQVAEVA
EACVQPAHWSGDVDTLADMVVKTAQPGDHILVMSNGGFGGIHQKLLDGLAKKAEAAQ
>P77348 ~~~mppA~~~Periplasmic murein peptide-binding protein~~~COG4166
MKHSVSVTCCALLVSSISLSYAAEVPSGTVLAEKQELVRHIKDEPASLDPAKAVGLPEIQVIRDLFEGLVNQNEKGEIVP
GVATQWKSNDNRIWTFTLRDNAKWADGTPVTAQDFVYSWQRLVDPKTLSPFAWFAALAGINNAQAIIDGKATPDQLGVTA
VDAHTLKIQLDKPLPWFVNLTANFAFFPVQKANVESGKEWTKPGNLIGNGAYVLKERVVNEKLVVVPNTHYWDNAKTVLQ
KVTFLPINQESAATKRYLAGDIDITESFPKNMYQKLLKDIPGQVYTPPQLGTYYYAFNTQKGPTADQRVRLALSMTIDRR
LMTEKVLGTGEKPAWHFTPDVTAGFTPEPSPFEQMSQEELNAQAKTLLSAAGYGPQKPLKLTLLYNTSENHQKIAIAVAS
MWKKNLGVDVKLQNQEWKTYIDSRNTGNFDVIRASWVGDYNEPSTFLTLLTSTHSGNISRFNNPAYDKVLAQASTENTVK
ARNADYNAAEKILMEQAPIAPIYQYTNGRLIKPWLKGYPINNPEDVAYSRTMYIVKH
>Q643C8 2.1.1.281~~~mppJ~~~Phenylpyruvate C(3)-methyltransferase~~~
MSTEVSEAQARRAVADIFNSTLASSAIGAAWELGALDELRENGKLDVSDFAVRHDLHEPAVVGMFTALASVGIVRREGAT
VVVGPYFDEANHHRSLFHWLNQGSGELFRRMPQVLPNENRTGKFYQRDAGAISYACREISERYFDPAFWAAVDGLGYTPT
TVADLGSGSGERLIQIARRFPGVRGLGVDIADGAIAMAEKEVAAKGFGDQISFVRGDARTIDQVSARGEFAEVDLLTCFM
MGHDFWPRENCVQTLRKLRAAFPNVRRFLLGDATRTVGIPDRELPVFTLGFEFGHDMMGVYLPTLDEWDGVFEEGGWRCV
KKHAIDSLSVSVVFELE
>Q643C1 1.14.11.40~~~mppO~~~Enduracididine beta-hydroxylase~~~
MLTLHLQDDDVAAIDAVADELSRRYDSVESTEFQAESRLYADELPRRVRRALHEYRSTEKSGILVVTGLPVDDSALGATP
ADRRHKPVPSTSLRQDIAFYLIANLLGDPIGWATQQDGFIMHDVYPVQGFEHEQIGWGSEETLTWHTEDAFHPLRTDYLG
LMCLRNPDGVETTACDIADVEIDDETRETLSQERFRILPDDAHRIHGKAPGDESARESALRERSRQRVASALESPDPVAV
LFGDRDDPYLRIDPHYMQGVQGETEQRALETIGAAIDDAMSGVVLSPGDIVFIDNYRVVHGRKPFRARFDGTDRWLRRLN
IARDLRKSREARLAATTRVIY
>P0ACR9 ~~~mprA~~~Transcriptional repressor MprA~~~COG1846
MDSSFTPIEQMLKFRASRHEDFPYQEILLTRLCMHMQSKLLENRNKMLKAQGINETLFMALITLESQENHSIQPSELSCA
LGSSRTNATRIADELEKRGWIERRESDNDRRCLHLQLTEKGHEFLREVLPPQHNCLHQLWSALSTTEKDQLEQITRKLLS
RLDQMEQDGVVLEAMS
>Q7U0X4 ~~~mprA~~~Response regulator MprA~~~
MSVRILVVDDDRAVRESLRRSLSFNGYSVELAHDGVEALDMIASDRPDALVLDVMMPRLDGLEVCRQLRSTGDDLPILVL
TARDSVSERVAGLDAGADDYLPKPFALEELLARMRALLRRTKPEDAAESMAMRFSDLTLDPVTREVNRGQRRISLTRTEF
ALLEMLIANPRRVLTRSRILEEVWGFDFPTSGNALEVYVGYLRRKTEADGEPRLIHTVRGVGYVLRETPP
>A0R3I8 ~~~mprA~~~Response regulator MprA~~~COG0745
MAVRILVVDDDRAVRESLRRSLSFNGYSVELAQDGVEALDAITNNRPDALILDVMMPRLDGLEVCRQLRSTGDDLPILVL
TARDSVSERVAGLDAGADDYLPKPFALEELLARMRALLRRTVSDDSGDSQKMTFSDLTLDPVTREVTRGGRQISLTRTEF
SLLEMLIANPRRVLTRSRILEEVWGFDFPTSGNALEVYIGYLRRKTEAEGEPRLIHTVRGVGYVLRETPP
>P9WGM9 ~~~mprA~~~Response regulator MprA~~~COG0745
MSVRILVVDDDRAVRESLRRSLSFNGYSVELAHDGVEALDMIASDRPDALVLDVMMPRLDGLEVCRQLRGTGDDLPILVL
TARDSVSERVAGLDAGADDYLPKPFALEELLARMRALLRRTKPEDAAESMAMRFSDLTLDPVTREVNRGQRRISLTRTEF
ALLEMLIANPRRVLTRSRILEEVWGFDFPTSGNALEVYVGYLRRKTEADGEPRLIHTVRGVGYVLRETPP
>Q7U0X3 2.7.13.3~~~mprB~~~Signal transduction histidine-protein kinase/phosphatase MprB~~~
MWWFRRRDRAPLRATSSLSLRWRVMLLAMSMVAMVVVLMSFAVYAVISAALYSDIDNQLQSRAQLLIASGSLAADPGKAI
EGTAYSDVNAMLVNPGQSIYTAQQPGQTLPVGAAEKAVIRGELFMSRRTTADQRVLAIRLTNGSSLLISKSLKPTEAVMN
KLRWVLLIVGGIGVAVAAVAGGMVTRAGLRPVGRLTEAAERVARTDDLRPIPVFGSDELARLTEAFNLMLRALAESRERQ
ARLVTDAGHELRTPLTSLRTNVELLMASMAPGAPRLPKQEMVDLRADVLAQIEELSTLVGDLVDLSRGDAGEVVHEPVDM
ADVVDRSLERVRRRRNDIHFDVEVIGWQVYGDTAGLSRMALNLMDNAAKWSPPGGHVGVRLSQLDASHAELVVSDRGPGI
PVQERRLVFERFYRSASARALPGSGLGLAIVKQVVLNHGGLLRIEDTDPGGQPPGTSIYVLLPGRRMPIPQLPGATAGAR
STDIENSRGSANVISVESQSTRAT
>P9WGL1 2.7.13.3~~~mprB~~~Signal transduction histidine-protein kinase/phosphatase MprB~~~COG2205
MWWFRRRDRAPLRATSSLSLRWRVMLLAMSMVAMVVVLMSFAVYAVISAALYSDIDNQLQSRAQLLIASGSLAADPGKAI
EGTAYSDVNAMLVNPGQSIYTAQQPGQTLPVGAAEKAVIRGELFMSRRTTADQRVLAIRLTNGSSLLISKSLKPTEAVMN
KLRWVLLIVGGIGVAVAAVAGGMVTRAGLRPVGRLTEAAERVARTDDLRPIPVFGSDELARLTEAFNLMLRALAESRERQ
ARLVTDAGHELRTPLTSLRTNVELLMASMAPGAPRLPKQEMVDLRADVLAQIEELSTLVGDLVDLSRGDAGEVVHEPVDM
ADVVDRSLERVRRRRNDILFDVEVIGWQVYGDTAGLSRMALNLMDNAAKWSPPGGHVGVRLSQLDASHAELVVSDRGPGI
PVQERRLVFERFYRSASARALPGSGLGLAIVKQVVLNHGGLLRIEDTDPGGQPPGTSIYVLLPGRRMPIPQLPGATAGAR
STDIENSRGSANVISVESQSTRAT
>C0H3X7 2.3.2.3~~~mprF~~~Phosphatidylglycerol lysyltransferase~~~COG0392
MLIKKNALSILKIVFPIAVLLFVIYQSKKELTNLSFKRTLMVINGLERTDLFMLVLIGLLAVAAMSLYDYVLKYSLRLSI
TNGKVFRVSWIANSFNNVLGFGGLAGVGLRMMFYKEHTKDHKALVKGIAWLTSSMLLGLSVFSIFVAARVLPVDEVIHEK
PWLWAVVIGFALILPLSLAVSKIKDRKAGDEENADKVKNPIFAYIGASVVEWLMAGTVIYFALFAMGIHADIRYVFGVFV
IAAIGGMISLVPGGFGSFDLLFLLGMEQLGYHQEAIVTSIVLYRLAYSFIPFILGLFFAAGDLTENTMKRLETNPRIAPA
IETTNVLLVVQRAVLVRILQGSLSLIVFVAGLIVLASVSLPIDRLTVIPHIPRPALLLFNGLSLSSALILLILPIELYKR
TKRSYTMAITALVGGFVFSFLKGLNISAIFVLPMIIVLLVLLKKQFVREQASYTLGQLIFAVALFTVALFNYNLIAGFIW
DRMKKVLRHEYFVHSTSHITHATIMAIIIVPLFFLIFTVVYHKRTKPIGEKADPERLAAFLNEKGGNALSHLGFLGDKRF
YFSSDGNALLLFGKIARRLVVLGDPSGQRESFPLVLEEFLNEAHQKGFSVLFYQIEREDMALYHDFGYNFFKLGEEAYVD
LNTFTLTGKKKAGLRAINNRFEREEYTFHVDHPPFSDAFLEELKQISDEWLGSKKEKGFSLGFFDPSYLQKAPIAYMKNA
EGEIVAFANVMPMYQEGEISVDLMRYRGDAPNGIMDALFIRMFLWAKEEGCTSFNMGMAPLANVGTAFTSFWSERFAAVI
FNNVRYMYSFSGLRAFKEKYKPEWRGKYLAYRKNRSLSVTMFLVTRLIGKSKKDSV
>Q2G2M2 2.3.2.3~~~mprF~~~Phosphatidylglycerol lysyltransferase~~~COG0392
MNQEVKNKIFSILKITFATALFIFVAITLYRELSGINFKDTLVEFSKINRMSLVLLFIGGGASLVILSMYDVILSRALKM
DISLGKVLRVSYIINALNAIVGFGGFIGAGVRAMVYKNYTHDKKKLVHFISLILISMLTGLSLLSLLIVFHVFDASLILD
KITWVRWVLYVVSFFLPLFIIYSMVRPPDKNNRFVGLYCTLVSCVEWLAAAVVLYFCGVIVDAHVSFMSFIAIFIIAALS
GLVSFIPGGFGAFDLVVLLGFKTLGVPEEKVLLMLLLYRFAYYFVPVIIALILSSFEFGTSAKKYIEGSKYFIPAKDVTS
FLMSYQKDIIAKIPSLSLAILVFFTSMIFFVNNLTIVYDALYDGNHLTYYILLAIHTSACLLLLLNVVGIYKQSRRAIIF
AMISILLITVATFFTYASYILITWLAIIFVLLIVAFRRARRLKRPVRMRNIVAMLLFSLFILYVNHIFIAGTLYALDIYT
IEMHTSVLRYYFWLTILIIAIIIGMIAWLFDYQFSKVRISSKIEDCEEIINQYGGNYLSHLIYSGDKQFFTNENKTAFLM
YRYKASSLVVLGDPLGDENAFDELLEAFYNYAEYLGYDVIFYQVTDQHMPLYHNFGNQFFKLGEEAIIDLTQFSTSGKKR
RGFRATLNKFDELNISFEIIEPPFSTEFINELQHVSDLWLDNRQEMHFSVGEFNEEYLSKAPIGVMRNEENEVIAFCSLM
PTYFNDAISVDLIRWLPELDLPLMDGLYLHMLLWSKEQGYTKFNMGMATLSNVGQLHYSYLRERLAGRVFEHFNGLYRFQ
GLRRYKSKYNPNWEPRFLVYRKDNSLWESLSKVMRVIRHK
>Q7A5R9 2.3.2.3~~~mprF~~~Phosphatidylglycerol lysyltransferase~~~
MNQEVKNKIFSILKITFATALFIFVAITLYRELSGINFKDTLVEFSKINRMSLVLLFIGGGASLVILSMYDVILSRALKM
DISLGKVLRVSYIINALNAIVGFGGFIGAGVRAMVYKNYTHDKKKLVHFISLILISMLTGLSLLSLLIVFHVFDASLILD
KITWVRWVLYVVSFFLPLFIIYSMVRPPDKNNRFVGLYCTLVSCVEWLAAAVVLYFCGVIVDAHVSFMSFIAIFIIAALS
GLVSFIPGGFGAFDLVVLLGFKTLGVPEEKVLLMLLLYRFAYYFVPVIIALILSSFEFGTSAKKYIEGSKYFIPAKDVTS
FLMSYQKDIIAKIPSLSLAILVFFTSMIFFVNNLTIVYDALYDGNHLTYYILLAIHTSACLLLLLNVVGIYKQSRRAIIF
AMISILLITVATFFTYASYILITWLAIIFVLLIVAFRRARRLKRPVRMRNIVAMLLFSLFILYVNHIFIAGTLYALDIYT
IEMHTSVLRYYFWLTILIIAIIIGMIAWLFDYQFSKVRISSKIEDCEEIINQYGGNYLSHLIYSGDKQFFTNENKTAFLM
YRYKASSLVVLGDPLGDENAFDELLEAFYNYAEYLGYDVIFYQVTDQHMPLYHNFGNQFFKLGEEAIIDLTQFSTSGKKR
RGFRATLNKFDELNISFEIIEPPFSTEFINELQHVSDLWLDNRQEMHFSVGQFNEEYLSKAPIGVMRNEENEVIAFCSLM
PTYFNDAISVDLIRWLPELDLPLMDGLYLHMLLWSKEQGYTKFNMGMATLSNVGQLHYSYLRERLAGRVFEHFNGLYRFQ
GLRRYKSKYNPNWEPRFLVYRKDNSLWESLSKVMRVIRHK
>P39790 3.4.21.-~~~mpr~~~Extracellular metalloprotease~~~COG3591
MKLVPRFRKQWFAYLTVLCLALAAAVSFGVPAKAAENPQTSVSNTGKEADATKNQTSKADQVSAPYEGTGKTSKSLYGGQ
TELEKNIQTLQPSSIIGTDERTRISSTTSFPYRATVQLSIKYPNTSSTYGCTGFLVNPNTVVTAGHCVYSQDHGWASTIT
AAPGRNGSSYPYGTYSGTMFYSVKGWTESKDTNYDYGAIKLNGSPGNTVGWYGYRTTNSSSPVGLSSSVTGFPCDKTFGT
MWSDTKPIRSAETYKLTYTTDTYGCQSGSPVYRNYSDTGQTAIAIHTNGGSSYNLGTRVTNDVFNNIQYWANQ
>P9WQN7 ~~~~~~MPT51/MPB51 antigen~~~COG0627
MKGRSALLRALWIAALSFGLGGVAVAAEPTAKAAPYENLMVPSPSMGRDIPVAFLAGGPHAVYLLDAFNAGPDVSNWVTA
GNAMNTLAGKGISVVAPAGGAYSMYTNWEQDGSKQWDTFLSAELPDWLAANRGLAPGGHAAVGAAQGGYGAMALAAFHPD
RFGFAGSMSGFLYPSNTTTNGAIAAGMQQFGGVDTNGMWGAPQLGRWKWHDPWVHASLLAQNNTRVWVWSPTNPGASDPA
AMIGQAAEAMGNSRMFYNQYRSVGGHNGHFDFPASGDNGWGSWAPQLGAMSGDIVGAIR
>P9WG65 ~~~~~~Soluble secreted antigen MPT53~~~COG0526
MSLRLVSPIKAFADGIVAVAIAVVLMFGLANTPRAVAADERLQFTATTLSGAPFDGASLQGKPAVLWFWTPWCPFCNAEA
PSLSQVAAANPAVTFVGIATRADVGAMQSFVSKYNLNFTNLNDADGVIWARYNVPWQPAFVFYRADGTSTFVNNPTAAMS
QDELSGRVAALTS
>O53508 2.4.1.-~~~mptA~~~Alpha-(1->6)-mannopyranosyltransferase A~~~
MTTPSHAPAVDLATAKDAVVQHLSRLFEFTTGPQGGPARLGFAGAVLITAGGLGAGSVRQHDPLLESIHMSWLRFGHGLV
LSSILLWTGVGVMLLAWLGLGRRVLAGEATEFTMRATTVIWLAPLLLSVPVFSRDTYSYLAQGALLRDGLDPYAVGPVGN
PNALLDDVSPIWTITTAPYGPAFILVAKFVTVIVGNNVVAGTMLLRLCMLPGLALLVWATPRLASHLGTHGPTALWICVL
NPLVLIHLMGGVHNEMLMVGLMTAGIALTVQGRNVAGIILITVAIAVKATAGIALPFLVWVWLRHLRERRGYRPVQAFLA
AAAISLLIFVAVFAVLSAVAGVGLGWLTALAGSVKIINWLTVPTGAANVIHALGRGLFTVDFYTLLRITRLIGIVIIAVS
LPLLWWRFRRDDRAALTGVAWSMLIVVLFVPAALPWYYSWPLAVAAPLAQARRAIAAIAGLSTWVMVIFKPDGSHGMYSW
LHFWIATACALTAWYVLYRSPDRRGVQAATPVVNTP
>C5CBV8 2.4.1.54~~~~~~Undecaprenyl-phosphate mannosyltransferase~~~COG1216
MRVLTIIPTYNEIESLPLTLGRLRDAVPESDVLVVDDASPDGTGDWADTRAAEDPSVHVLHRTTKDGLGGAYIAGFRWGL
ERGYDVLVEMDADGSHQPEQLPRLLEAVRTADLVIGSRRVPGGKMVNWPTSRKMISWAGSLYPRIMLGLNLTDITAGYRA
YRADTLRAIDLDAIESKGYGFQVDMTFRTARLGKRIVEVPITFVERELGESKMSGGIVGEAVVNVTRWGLAARWEGLRAR
LGL
>Q0PC20 3.2.2.30~~~pfs~~~Aminodeoxyfutalosine nucleosidase~~~COG0775
MMKIAILGAMSEEITPLLETLKDYTKIEHANNTYYFAKYKNHELVLAYSKIGKVNSTLSASVMIEKFGAQALLFTGVAGA
FNPELEIGDLLYATKLAQYDLDITAFGHPLGFVPGNEIFIKTDEKLNNLALEVAKELNIKLRAGIIATGDEFICDEAKKA
KIREIFNADACEMEGASVALVCDALKVPCFILRAMSDKAGEKAEFDFDEFVINSAKISANFVLKMCEKL
>Q9ZMY2 3.2.2.30~~~mtnN~~~Aminodeoxyfutalosine nucleosidase~~~COG0775
MQKIGILGAMREEITPILELFGVDFEEIPLGGNVFHKGVYHNKEIIVAYSKIGKVHSTLTTTSMILAFGVQKVLFSGVAG
SLVKDLKINDLLVATQLVQHDVDLSAFDHPLGFIPESAIFIETSGSLNALAKKIANEQHIALKEGVIASGDQFVHSKERK
EFLVSEFKASAVEMEGASVAFVCQKFGVPCCVLRSISDNADEKAGMSFDEFLEKSAHTSAKFLKSMVDEL
>O24915 3.2.2.30~~~mtnN~~~Aminodeoxyfutalosine nucleosidase~~~COG0775
MVQKIGILGAMREEITPILELFGVDFEEIPLGGNVFHKGVYHNKEIIVAYSKIGKVHSTLTTTSMILAFGVQKVLFSGVA
GSLVKDLKINDLLVAIQLVQHDVDLSAFDHPLGFIPESAIFIETSESLNALAKEVANEQHIVLKEGVIASGDQFVHSKER
KEFLVSEFKASAVEMEGASVAFVCQKFGVPCCVLRSISDNADEEANMSFDAFLEKSAQTSAKFLKSMVDEL
>Q9L0T8 4.2.1.151~~~mqnA~~~Chorismate dehydratase~~~COG1427
MDNSRTRPRVGHIQFLNCLPLYWGLARTGTLLDFELTKDTPEKLSEQLVRGDLDIGPVTLVEFLKNADDLVAFPDIAVGC
DGPVMSCVIVSQVPLDRLDGARVALGSTSRTSVRLAQLLLSERFGVQPDYYTCPPDLSLMMQEADAAVLIGDAALRANMI
DGPRYGLDVHDLGALWKEWTGLPFVFAVWAARRDYAEREPVITRKVHEAFLASRNLSLEEVEKVAEQAARWEAFDEDTLA
KYFTTLDFRFGAPQLEAVTEFARRVGPTTGFPADVKVELLKP
>Q5SK49 4.2.1.151~~~mqnA~~~Chorismate dehydratase~~~COG1427
MRPYVLGLPRYANVAPLHHFLRLEGFRVLHAVPAELNRLLLSGEVGLSLVSSYFYLKHQDSLGLLPDFSVAVLGRVYSVN
LFHKGALPHLARVALTTESATSVALLKLLLKEAGAGPRYERREGGLELLSAYDGVLLIGDRAIKAYAALLPEVPETPHAL
PTRFGEVEVADLSTLWFQRTHLPFVFAVWAYRRENPPPKALVQALREARREGLGRLREVAEAEARRLGVHPLLLQHYLWN
FRYHLEEPDRLGLKAFAEALGLPFAPMFYPE
>A0LR22 3.2.2.26~~~mqnB~~~Futalosine hydrolase~~~COG0775
MSVKRLIITAVAAEADAVASGLDGAQPHPQGSANVRHTATADILVAGVGSAAAAAATAAALARRHYSLVICTGIAGGIGI
AGIGDIVVADAVHPADLGAMSPDGFIPLEHLGIATTANAIDPPVVEELTGPLRYAGLAPVIGGILTVNTVTGTDAHADDL
RRRYPGAVAEAMEGYGVAVAATRAGVRYGELRVVSNRVGRRDRRAWDIPGALRRLEHAFAALGAAWCNDGSGQAAAREID
GGCP
>Q9KXN0 3.2.2.26~~~mqnB~~~Futalosine hydrolase~~~COG0775
MHLLVATAVSVERDAVARAFPAPGTEVSRPGITLHRLPDGWDLLAAGVGPARAAASTAAALTAAALDGRPYDLVVSAGIG
GGFAPEAPVGSLVVADAITAADLGAETADGFLPVTDLGFGTVTHLPPAPLVRAAAEATGARPGTVLTGSTVTGTAARAAL
LRERHPGALAEAMEGFGVAEAAAAHGVPVLELRAVSNPVGPRDRAAWRIGEALAALTDAVGKLAPVLESWKPHER
>Q5SKT7 3.2.2.26~~~mqnB~~~Futalosine hydrolase~~~COG0775
MWLLLSPTRLEAPFLEGEPFAFLAWRGLKGTGFVYLETGIGKVNAAMALAAYAARNPVEKALLFGLAGAYPGGPSLGEAV
LVEEEVEADLGLKEGLAPLGFPALALGERRYFNRFPLDPGLTGELARGLGLKVAVGLTRDLVSETPEEALALARRWGASL
ENMEGAAFARACLALGVRGAELRALSNPAGVRDKAHWRTKEALSALARAVRRLLAEEGGARRPPG
>Q9K864 1.21.98.1~~~mqnC~~~Cyclic dehypoxanthine futalosine synthase~~~COG1060
MSIDGILERAVNGERLSMEDAVKLYESDEVEKMGAAANQIMLKWHPEPITTFVIGRNVNYTNFCDTYCRFCAFYRAPGHK
EGYVLDDEVILKKIQETIDVGGTEILMQGGTNPDLTIDYYTDLLRNIKERFPNIWMHSFSPAEVWKIAEVSSMSVEEVLR
ELHEAGLDSMPGGGAEILTEETRLRVSRLKITWEQWINAMKATKKVGMHGTATMVIGFGESFEERALHLQRVRDAQDETE
CFTAFISWLFQPENTGMYKTKKLTPRDYLKNVAISRLFLDNIPNFQSSWVTMGPEVGKLSLQYGCNDFGSTMIEENVVSA
AGTTHKVNTNKILQLIREAGKIPAQRTTSYEIIRTFEDKEAAEKDFVMQN
>Q5SI12 4.1.-.-~~~mqnD~~~1,4-dihydroxy-6-naphtoate synthase~~~COG2107
MEALRLGFSPCPNDTFIFYALVHGRVESPVPLEPVLEDVETLNRWALEGRLPLTKLSYAAYAQVRDRYVALRSGGALGRG
VGPLVVARGPLQALEGLRVAVPGRHTTAYFLLSLYAQGFVPVEVRYDRILPMVAQGEVEAGLIIHESRFTYPRYGLVQVV
DLGAWWEERTGLPLPLGAILARRDLGEGLIRALDEAVRRSVAYALAHPEEALDYMRAHAQELSDEVIWAHVHTYVNAFSL
DVGEEGERAVARLFAEAEARGLAAPSPRPLFV
>Q5SK48 2.5.1.120~~~mqnE~~~Aminodeoxyfutalosine synthase~~~COG1060
MRGIRDPRLIPIAEKVMEGKRLSFEDGLVLYQTKDLPTLMRLANLVRERKHGHKTYFVHSIRVSQTNICYVGCTFCAFQR
RFGEEGAWDWDVDEVVAWVKERYQPGLTEIHLTAGHHPKRPFAYYLDLVRALKENFPGVQVKAWTAAEIHHFSKIARLPY
REVLKALKEAGLDAMPGGGAEIFAERVRRKIARAKVSAEGWLEIHRTAHELGIPTNATMLYGHIETLEERLDHMDRLRRL
QDETGGFMSFIPLAFQPDGNQLARELGKKEFTTGLDDLRNLAVARLYLDNFPHIKGYWATLTPELAQVSLDWGVTDVDGT
LIEERIVHMAGSPTPQGLTKRELARIILMAGRIPVERDALYREVRVWDRVEA
>A0LRH8 3.5.4.40~~~~~~Aminodeoxyfutalosine deaminase~~~COG1816
MTPHDPVSVEAVPKIELHVHLEGTVEPATVLDIAARNGLALPVSTVDELSALYRVTTFSDFLRLWILTTNVLRKAEDFSQ
VVVDYARRAKRHGAVYIEGIFSPVERVMRGVGWAEIFDGYCEGAERAYAEHGVVVRLTPEAYRGADPELVAEMVRYAGRY
RDRGVVGVGIGGDERARPTRHYAAAFAPAVDLGLGVVPHAGEFPLFPDGASGAATLRETIEALNPVRIRHGIAAAADPAL
VAVIRERGIVLDVCPTSNLRTGAIRDLADHPLPRLAAAGIPCTVGTDDPAVFDTDLSREFTIAARLGVEPRLLYDAGITG
ALCDDDVKSHLRQIGAATTWPTTTATWSTTAAGESL
>Q9RW45 3.5.4.40~~~~~~Aminodeoxyfutalosine deaminase~~~COG0402
MRFSAVSRHHRGASIDPMTFSEATTPDALTPDAHTPRLLTCDVLYTGMGGAQSPGGVVVVGETVAAAGHPDELRRQYPHA
AEERAGAVIAPPPVNAHTHLDMSAYEFQALPYFQWIPEVVIRGRHLRGVAAAQAGADTLTRLGAGGVGDIVWAPEVMDAL
LAREDLSGTLYFEVLNPFPDKADEVFAAARTHLERWRRLERPGLRLGLSPHTPFTVSHRLMRLLSDYAAGEGLPLQIHVA
EHPTELEMFRTGGGPLWDNRMPALYPHTLAEVIGREPGPDLTPVRYLDELGVLAARPTLVHMVNVTPDDIARVARAGCAV
VTCPRSNHHLECGTFDWPAFAAAGVEVALGTDSVASGETLNVREEVTFARQLYPGLDPRVLVRAAVKGGQRVVGGRTPFL
RRGETWQEGFRWELSRDL
>A6Q234 3.5.4.40~~~~~~Aminodeoxyfutalosine deaminase~~~COG0402
MRIIKPFAILTPQTIIQDKAVAFDKKIEAIDTVENLIKKYPNAAVEHDENSLLLPGFANPHLHLEFSANKATLQYGDFIP
WLYSVIRHREDLLPLCDGACLEQTLSSIIQTGTTAIGAISSYGEDLQACIDSALKVVYFNEVIGSNAATADVMYASFLER
FHQSKKHENERFKAAVAIHSPYSVHYILAKRALDIAKKYGSLVSVHFMESRAEREWLDKGSGEFAKFFKEFLNQTRPVND
TKSFLELFKELHTLFVHMVWANEEEIQTIASYNAHIIHCPISNRLLGNGVLDLEKIKSIPYAIATDGLSSNYSLNMYEEL
KAALFVHPNKEATTFAKELIIRATKAGYDALGFEGGEIAVGKDADMQLIDLPEGLTNVEDLYLHVILHTTKPKKVYIQGE
EHVRE
>Q82K09 3.5.4.40~~~add2~~~Aminodeoxyfutalosine deaminase~~~COG1816
MTEHFDARGTRDAQTGRDLHSFIAGLPKAELHVHHVGSASPRIVSELAARHPDSSVPTDPEALADYFTFTDFAHFIKVYL
SVVDLIRTPEDVRLLTYEVARELARQQVRYAELTITPFSSTRRGIDERAFMDAIEDARKSAEAEFGTVLRWCFDIPGEAG
LESAEETVRLATDDRLRPEGLVSFGLGGPEIGVPRPQFKPYFDRAIAAGLRSVPHAGETTGPETVWDALTDLRAERIGHG
TSSAQDPKLLAHLAEHRIPLEVCPTSNIATRAVRTLDEHPVKEFVRAGVVVTINSDDPPMFGTDLNNEYAIAARLLDLDE
RGLAGLAKNSVEASFLDAAGKARIAAEIDTYTAAWLAP
>O86737 3.5.4.40~~~~~~Aminodeoxyfutalosine deaminase~~~COG1816
MRPAYDDPRTTDQPITRARPPPRAARGRRLGEEPLTEHLVDPDVPRDLHAFIAGLPKAELHVHHVGSASPRIVSELAARH
ADSKVPTDPEALVDYFTFTDFAHFIDVYLSVVDLIRTPEDVRLLTYEVARDMARQQVRYAELTITPFSSTRRGIDEGAFM
DAIEDARKAAEAEFGTVLRWCFDIPGEAGLESAEETARLATDDRLRPEGLVSFGLGGPEIGVARPQFKPYFDRAIAAGLH
SVPHAGETTGPQTVWEALIDLRAERIGHGTSSAQDPKLLAHLAERRIPLEVCPTSNIATRAVRTLDEHPIKEFVRAGVPV
TINSDDPPMFGTDLNNEYAVAARLLGLDERGLADLAKNGVEASFLDAPGKARIADEIDTYTAAWLAS
>P65422 1.1.5.4~~~mqo1~~~Probable malate:quinone oxidoreductase 1~~~
MTTQHSKTDVILIGGGIMSATLGTLLKELSPEKNIKVFEKLAQPGEESSNVWNNAGTGHSALCELNYTKEGKDGTVDCSK
AIKINEQYQISKQFWAYLVKTGQLDNPDRFIQAVPHMSFVIGEDNVAFIKSRVATLKKSVLFEKMKLSQDEEEMKSWVPL
MIEGRKSDEPIALTYDETGTDVNFGALTAKLFENLEQRGVGIQYKQNVLDIKKQKSGAWLVKVKDLETNETTTYESDFVF
IGAGGASLPLLQKTGIKQSKHIGGFPVSGLFLRCTNQEVIDRHHAKVYGKAAVGAPPMSVPHLDTRFVDGKRSLLFGPFA
GFSPKFLKTGSHMDLIKSVKPNNIVTMLSAGIKEMSLTKYLVSQLMLSNDERMDDLRVFFPNAKNEDWEVITAGQRVQVI
KDTEDSKGNLQFGTEVITSDDGTLAALLGASPGASTAVDIMFDVLQRCYRDEFKGWEPKIKEMVPSFGYRLTDHEDLYHK
INEEVTKYLQVK
>P99115 1.1.5.4~~~mqo2~~~Probable malate:quinone oxidoreductase 2~~~
MAKSNSKDIVLIGAGVLSTTFGSMLKEIEPDWNIHVYERLDRPAIESSNERNNAGTGHAALCELNYTVLQPDGSIDIEKA
KVINEEFEISKQFWGHLVKSGSIENPREFINPLPHISYVRGKNNVKFLKDRYEAMKAFPMFDNIEYTEDIEVMKKWIPLM
MKGREDNPGIMAASKIDEGTDVNFGELTRKMAKSIEAHPNATVQFNHEVVDFEQLSNGQWEVTVKNRLTGEKFKQVTDYV
FIGAGGGAIPLLQKTGIPESKHLGGFPISGQFLACTNPQVIEQHDAKVYGKEPPGTPPMTVPHLDTRYIDGQRTLLFGPF
ANVGPKFLKNGSNLDLFKSVKTYNITTLLAAAVKNLPLIKYSFDQVIMTKEGCMNHLRTFYPEARNEDWQLYTAGKRVQV
IKDTPEHGKGFIQFGTEVVNSQDHTVIALLGESPGASTSVSVALEVLERNFPEYKTEWAPKIKKMIPSYGESLIEDEKLM
RKIRKQTSKDLELGYYEN
>O69282 1.1.5.4~~~mqo~~~Malate:quinone oxidoreductase~~~COG0579
MSDSPKNAPRITDEADVVLIGAGIMSSTLGAMLRQLEPSWTQIVFERLDGPAQESSSPWNNAGTGHSALCELNYTPEVKG
KVEIAKAVGINEKFQVSRQFWSHLVEEGVLSDPKEFINPVPHVSFGQGADQVAYIKARYEALKDHPLFQGMTYADDEATF
TEKLPLMAKGRDFSDPVAISWIDEGTDINYGAQTKQYLDAAEVEGTEIRYGHEVKSIKADGAKWIVTVKNVHTGDTKTIK
ANFVFVGAGGYALDLLRSAGIPQVKGFAGFPVSGLWLRCTNEELIEQHAAKVYGKASVGAPPMSVPHLDTRVIEGEKGLL
FGPYGGWTPKFLKEGSYLDLFKSIRPDNIPSYLGVAAQEFDLTKYLVTEVLKDQDKRMDALREYMPEAQNGDWETIVAGQ
RVQVIKPAGFPKFGSLEFGTTLINNSEGTIAGLLGASPGASIAPSAMIELLERCFGDRMIEWGDKLKDMIPSYGKKLASE
PALFEQQWARTQKTLKLEEA
>P33940 1.1.5.4~~~mqo~~~Malate:quinone oxidoreductase~~~COG0579
MKKVTAMLFSMAVGLNAVSMAAKAKASEEQETDVLLIGGGIMSATLGTYLRELEPEWSMTMVERLEGVAQESSNGWNNAG
TGHSALMELNYTPQNADGSISIEKAVAINEAFQISRQFWAHQVERGVLRTPRSFINTVPHMSFVWGEDNVNFLRARYAAL
QQSSLFRGMRYSEDHAQIKEWAPLVMEGRDPQQKVAATRTEIGTDVNYGEITRQLIASLQKKSNFSLQLSSEVRALKRND
DNTWTVTVADLKNGTAQNIRAKFVFIGAGGAALKLLQESGIPEAKDYAGFPVGGQFLVSENPDVVNHHLAKVYGKASVGA
PPMSVPHIDTRVLDGKRVVLFGPFATFSTKFLKNGSLWDLMSSTTTSNVMPMMHVGLDNFDLVKYLVSQVMLSEEDRFEA
LKEYYPQAKKEDWRLWQAGQRVQIIKRDAEKGGVLRLGTEVVSDQQGTIAALLGASPGASTAAPIMLNLLEKVFGDRVSS
PQWQATLKAIVPSYGRKLNGDVAATERELQYTSEVLGLNYDKPQAADSTPKPQLKPQPVQKEVADIAL
>O24913 1.1.5.4~~~mqo~~~Malate:quinone oxidoreductase~~~COG0579
MSMEFDAVIIGGGVSGCATFYTLSEYSSLKRVAIVEKCSKLAQISSSAKANSQTIHDGSIETNYTPEKAKKVRLSAYKTR
QYALNKGLQNEVIFETQKMAIGVGDEECEFMKKRYESFKEIFVGLEEFDKQKIKELEPNVILGANGIDRHENIIGHGYRK
DWSTMNFAKLSENFVEEALKLKPNNQVFLNFKVKKIEKRNDTYAVISEDAEEVYAKFVLVNAGSYALPLAQSMGYGLDLG
CLPVAGSFYFVPDLLRGKVYTVQNPKLPFAAVHGDPDAVIKGKTRIGPTALTMPKLERNKCWLKGISLELLKMDLNKDVF
KIAFDLMSDKEIRNYVFKNMVFELPIIGKRKFLKDAQKIIPSLSLEDLEYAHGFGEVRPQVLDRTKRKLELGEKKICTHK
GITFNMTPSPGATSCLQNALVDSQEIAAYLGESFELERFYKDLSPEELEN
>A0QVL2 1.1.5.4~~~mqo~~~Probable malate:quinone oxidoreductase~~~COG0579
MSEANAGTNAKTDVVLVGAGIMSATLGTLIKLLEPNWSITMIERLDGAAAESSDPWNNAGTGHSALCELNYTPALPDGTI
DISKAVNVNEQFQVSRQFWAHAVENGVLPDVRSFLNPVPHVSFVYGADNVQYLKARYNALVTNPLFASMEFIDDKDEFTR
RLPLMAEKRDFSEPVALNWSQHGTDVDFGSLSRQLIGFAAGNGMTTMFGHDVRDLSKNSDGSWTVKVRNRRTGNNFKINA
KFVFVGAGGGALPLLQKSGIPEAKGFGGFPVGGAFLRTNKQHLTSRHNAKVYGLPPLGAPPMSVPHLDTRVINGRQWLLF
GPFAGWSPKFLKQGKVTDLPLSVKPNNLASMLGVGLTEVGLLKYLIGQLLLSEPARVETLREFAPSAVDSDWELDIAGQR
VQVIRRKGAGGVLEFGTTVLAAADGSIAGLLGASPGASTAVPAMLDVLQRCFADRYQAWTPKLKEMVPSLGTKLSDEPKL
FEEVWSWGTKVLKLDVQANTAEAANAPATV
>P9WJP5 1.1.5.4~~~mqo~~~Probable malate:quinone oxidoreductase~~~COG0579
MSDLARTDVVLIGAGIMSATLGVLLRRLEPNWSITLIERLDAVAAESSGPWNNAGTGHSALCEMNYTPEMPDGSIDITKA
VRVNEQFQVTRQFWAYAAENGILTDVRSFLNPVPHVSFVHGSRGVEYLRRRQKALAGNPLFAGTEFIESPDEFARRLPFM
AAKRAFSEPVALNWAADGTDVDFGALAKQLIGYCVQNGTTALFGHEVRNLSRQSDGSWTVTMCNRRTGEKRKLNTKFVFV
GAGGDTLPVLQKSGIKEVKGFAGFPIGGRFLRAGNPALTASHRAKVYGFPAPGAPPLGALHLDLRFVNGKSWLVFGPYAG
WSPKFLKHGQISDLPRSIRPDNLLSVLGVGLTERRLLNYLISQLRLSEPERVSALREFAPSAIDSDWELTIAGQRVQVIR
RDERNGGVLEFGTTVIGDADGSIAGLLGGSPGASTAVAIMLDVLQKCFANRYQSWLPTLKEMVPSLGVQLSNEPALFDEV
WSWSTKALKLGAA
>Q46864 ~~~mqsA~~~Antitoxin MqsA~~~COG2944
MKCPVCHQGEMVSGIKDIPYTFRGRKTVLKGIHGLYCVHCEESIMNKEESDAFMAQVKAFRASVNAETVAPEFIVKVRKK
LSLTQKEASEIFGGGVNAFSRYEKGNAQPHPSTIKLLRVLDKHPELLNEIR
>Q46865 3.1.-.-~~~mqsR~~~mRNA interferase toxin MqsR~~~
MEKRTPHTRLSQVKKLVNAGQVRTTRSALLNADELGLDFDGMCNVIIGLSESDFYKSMTTYSDHTIWQDVYRPRLVTGQV
YLKITVIHDVLIVSFKEK
>O66465 2.7.8.13~~~mraY~~~Phospho-N-acetylmuramoyl-pentapeptide-transferase~~~COG0472
MLYQLALLLKDYWFAFNVLKYITFRSFTAVLIAFFLTLVLSPSFINRLRKIQRLFGGYVREYTPESHEVKKYTPTMGGIV
ILIVVTLSTLLLMRWDIKYTWVVLLSFLSFGTIGFWDDYVKLKNKKGISIKTKFLLQVLSASLISVLIYYWADIDTILYF
PFFKELYVDLGVLYLPFAVFVIVGSANAVNLTDGLDGLAIGPAMTTATALGVVAYAVGHSKIAQYLNIPYVPYAGELTVF
CFALVGAGLGFLWFNSFPAQMFMGDVGSLSIGASLATVALLTKSEFIFAVAAGVFVFETISVILQIIYFRWTGGKRLFKR
APFHHHLELNGLPEPKIVVRMWIISILLAIIAISMLKLR
>P0A6W3 2.7.8.13~~~mraY~~~Phospho-N-acetylmuramoyl-pentapeptide-transferase~~~COG0472
MLVWLAEHLVKYYSGFNVFSYLTFRAIVSLLTALFISLWMGPRMIAHLQKLSFGQVVRNDGPESHFSKRGTPTMGGIMIL
TAIVISVLLWAYPSNPYVWCVLVVLVGYGVIGFVDDYRKVVRKDTKGLIARWKYFWMSVIALGVAFALYLAGKDTPATQL
VVPFFKDVMPQLGLFYILLAYFVIVGTGNAVNLTDGLDGLAIMPTVFVAGGFALVAWATGNMNFASYLHIPYLRHAGELV
IVCTAIVGAGLGFLWFNTYPAQVFMGDVGSLALGGALGIIAVLLRQEFLLVIMGGVFVVETLSVILQVGSFKLRGQRIFR
MAPIHHHYELKGWPEPRVIVRFWIISLMLVLIGLATLKVR
>P9WMW7 2.7.8.13~~~mraY~~~Phospho-N-acetylmuramoyl-pentapeptide-transferase~~~COG0472
MRQILIAVAVAVTVSILLTPVLIRLFTKQGFGHQIREDGPPSHHTKRGTPSMGGVAILAGIWAGYLGAHLAGLAFDGEGI
GASGLLVLGLATALGGVGFIDDLIKIRRSRNLGLNKTAKTVGQITSAVLFGVLVLQFRNAAGLTPGSADLSYVREIATVT
LAPVLFVLFCVVIVSAWSNAVNFTDGLDGLAAGTMAMVTAAYVLITFWQYRNACVTAPGLGCYNVRDPLDLALIAAATAG
ACIGFLWWNAAPAKIFMGDTGSLALGGVIAGLSVTSRTEILAVVLGALFVAEITSVVLQILTFRTTGRRMFRMAPFHHHF
ELVGWAETTVIIRFWLLTAITCGLGVALFYGEWLAAVGA
>Q2FZ93 2.7.8.13~~~mraY~~~Phospho-N-acetylmuramoyl-pentapeptide-transferase~~~COG0472
MIFVYALLALVITFVLVPVLIPTLKRMKFGQSIREEGPQSHMKKTGTPTMGGLTFLLSIVITSLVAIIFVDQANPIILLL
FVTIGFGLIGFIDDYIIVVKKNNQGLTSKQKFLAQIGIAIIFFVLSNVFHLVNFSTSIHIPFTNVAIPLSFAYVIFIVFW
QVGFSNAVNLTDGLDGLATGLSIIGFTMYAIMSFVLGETAIGIFCIIMLFALLGFLPYNINPAKVFMGDTGSLALGGIFA
TISIMLNQELSLIFIGLVFVIETLSVMLQVASFKLTGKRIFKMSPIHHHFELIGWSEWKVVTVFWAVGLISGLIGLWIGV
H
>P55343 ~~~mraZ~~~Transcriptional regulator MraZ~~~COG2001
MFMGEYQHTIDAKGRMIVPAKFREGLGEQFVLTRGLDQCLFGYPMHEWKQIEEKLKALPLTKKDARAFTRFFFSGATECE
LDKQGRVNIASSLLNYAKLEKECVVIGVSNRIELWSKVIWEQYTEEQEDSFAEIAENMIGFDI
>P22186 ~~~mraZ~~~Transcriptional regulator MraZ~~~COG2001
MFRGATLVNLDSKGRLSVPTRYREQLLENAAGQMVCTIDIYHPCLLLYPLPEWEIIEQKLSRLSSMNPVERRVQRLLLGH
ASECQMDGAGRLLIAPVLRQHAGLTKEVMLVGQFNKFELWDETTWHQQVKEDIDAEQLATGDLSERLQDLSL
>P75467 ~~~mraZ~~~Transcriptional regulator MraZ~~~
MLLGTFNITLDAKNRISLPAKLRAFFEGSIVINRGFENCLEVRKPQDFQKYFEQFNSFPSTQKDTRTLKRLIFANANFVD
VDTAGRVLIPNNLINDAKLDKEIVLIGQFDHLEIWDKKLYEDYLANSESLETVAERMKDVK
>A0R025 ~~~mraZ~~~Transcriptional regulator MraZ~~~COG2001
MFLGTYTPKLDDKGRLTLPAKFRDALAGGLMVTKSQDHSLAVYPRDEFEKLARRASQASRSNPEARAFLRSLAAATDEQH
PDAQGRITLSADHRRYANLSKDCVVIGSVDYLEIWDAQAWQEYQQAHEENFSAATDETLRDII
>P9WJN9 ~~~mraZ~~~Transcriptional regulator MraZ~~~COG2001
MFLGTYTPKLDDKGRLTLPAKFRDALAGGLMVTKSQDHSLAVYPRAAFEQLARRASKAPRSNPEARAFLRNLAAGTDEQH
PDSQGRITLSADHRRYASLSKDCVVIGAVDYLEIWDAQAWQNYQQIHEENFSAASDEALGDIF
>P65439 ~~~mraZ~~~Transcriptional regulator MraZ~~~
MFMGEYDHQLDTKGRMIIPSKFRYDLNERFIITRGLDKCLFGYTLDEWQQIEEKMKTLPMTKKDARKFMRMFFSGAVEVE
LDKQGRINIPQNLRKYANLTKECTVIGVSNRIEIWDRETWNDFYEESEESFEDIAEDLIDFDF
>P0AD65 3.4.16.4~~~mrdA~~~Peptidoglycan D,D-transpeptidase MrdA~~~COG0768
MKLQNSFRDYTAESALFVRRALVAFLGILLLTGVLIANLYNLQIVRFTDYQTRSNENRIKLVPIAPSRGIIYDRNGIPLA
LNRTIYQIEMMPEKVDNVQQTLDALRSVVDLTDDDIAAFRKERARSHRFTSIPVKTNLTEVQVARFAVNQYRFPGVEVKG
YKRRYYPYGSALTHVIGYVSKINDKDVERLNNDGKLANYAATHDIGKLGIERYYEDVLHGQTGYEEVEVNNRGRVIRQLK
EVPPQAGHDIYLTLDLKLQQYIETLLAGSRAAVVVTDPRTGGVLALVSTPSYDPNLFVDGISSKDYSALLNDPNTPLVNR
ATQGVYPPASTVKPYVAVSALSAGVITRNTTLFDPGWWQLPGSEKRYRDWKKWGHGRLNVTRSLEESADTFFYQVAYDMG
IDRLSEWMGKFGYGHYTGIDLAEERSGNMPTREWKQKRFKKPWYQGDTIPVGIGQGYWTATPIQMSKALMILINDGIVKV
PHLLMSTAEDGKQVPWVQPHEPPVGDIHSGYWELAKDGMYGVANRPNGTAHKYFASAPYKIAAKSGTAQVFGLKANETYN
AHKIAERLRDHKLMTAFAPYNNPQVAVAMILENGGAGPAVGTLMRQILDHIMLGDNNTDLPAENPAVAAAEDH
>P39763 ~~~mreBH~~~Cell shape-determining protein MreBH~~~COG1077
MFQSTEIGIDLGTANILVYSKNKGIILNEPSVVAVDTTTKAVLAIGADAKNMIGKTPGKIVAVRPMKDGVIADYDMTTDL
LKHIMKKAAKSIGMSFRKPNVVVCTPSGSTAVERRAISDAVKNCGAKNVHLIEEPVAAAIGADLPVDEPVANVVVDIGGG
TTEVAIISFGGVVSCHSIRIGGDQLDEDIVSFVRKKYNLLIGERTAEQVKMEIGHALIEHIPEAMEIRGRDLVTGLPKTI
MLQSNEIQDAMRESLLHILEAIRATLEDCPPELSGDIVDRGVILTGGGALLNGIKEWLTEEIVVPVHVAQNPLESVAIGT
GRSLEVIDKLQKAIK
>Q01465 ~~~mreB~~~Cell shape-determining protein MreB~~~COG1077
MFGIGARDLGIDLGTANTLVFVKGKGIVVREPSVVALQTDTKSIVAVGNDAKNMIGRTPGNVVALRPMKDGVIADYETTA
TMMKYYINQAIKNKGMFARKPYVMVCVPSGITAVEERAVIDATRQAGARDAYPIEEPFAAAIGANLPVWEPTGSMVVDIG
GGTTEVAIISLGGIVTSQSIRVAGDEMDDAIINYIRKTYNLMIGDRTAEAIKMEIGSAEAPEESDNMEIRGRDLLTGLPK
TIEITGKEISNALRDTVSTIVEAVKSTLEKTPPELAADIMDRGIVLTGGGALLRNLDKVISEETKMPVLIAEDPLDCVAI
GTGKALEHIHLFKGKTR
>A0A0H3C7V4 ~~~mreB~~~Cell shape-determining protein MreB~~~
MFSSLFGVISNDIAIDLGTANTLIYQKGKGIVLNEPSVVALRNVGGRKVVHAVGIEAKQMLGRTPGHMEAIRPMRDGVIA
DFEVAEEMIKYFIRKVHNRKGFVNPKVIVCVPSGATAVERRAINDSCLNAGARRVGLIDEPMAAAIGAGLPIHEPTGSMV
VDIGGGTTEVAVLSLSGIVYSRSVRVGGDKMDEAIISYMRRHHNLLIGETTAERIKKEIGTARAPADGEGLSIDVKGRDL
MQGVPREVRISEKQAADALAEPVGQIVEAVKVALEATPPELASDIADKGIMLTGGGALLRGLDAEIRDHTGLPVTVADDP
LSCVALGCGKVLEHPKWMKGVLESTLA
>P0A9X4 ~~~mreB~~~Cell shape-determining protein MreB~~~COG1077
MLKKFRGMFSNDLSIDLGTANTLIYVKGQGIVLNEPSVVAIRQDRAGSPKSVAAVGHDAKQMLGRTPGNIAAIRPMKDGV
IADFFVTEKMLQHFIKQVHSNSFMRPSPRVLVCVPVGATQVERRAIRESAQGAGAREVFLIEEPMAAAIGAGLPVSEATG
SMVVDIGGGTTEVAVISLNGVVYSSSVRIGGDRFDEAIINYVRRNYGSLIGEATAERIKHEIGSAYPGDEVREIEVRGRN
LAEGVPRGFTLNSNEILEALQEPLTGIVSAVMVALEQCPPELASDISERGMVLTGGGALLRNLDRLLMEETGIPVVVAED
PLTCVARGGGKALEMIDMHGGDLFSEE
>Q01466 ~~~mreC~~~Cell shape-determining protein MreC~~~COG1792
MPNKRLMLLLLCIIILVAMIGFSLKGGRNTTWPEKVIGDTTGVFQNIFHTPAEFFAGIFENINDLKNTYKENERLREKLD
GQTQYEAKLQELEEENKSLRDELGHVKSIKDYKPILATVIARSPDNWAKQVTINKGTQQNVAKDMAVTNEKGALIGKIKS
SGLNNFTSAVQLLSDTDRNNRVATKISGKKGSKGYGLIEGYDKEKKRLKMTIIERKDKQDVKKGDLIETSGTGGVFPEGL
TIGEVTDIESDSYGLTKVAYVKPAADLTDLNNVIVVNRDVPTVDTEEEGS
>B8H610 ~~~mreC~~~Cell shape-determining protein MreC~~~
MRFREGPLGDLKVPLTWTAAVALIVAAVIGVAFLLADRRETLQEQAYGVTRQTVDTVARPVSGAIAAPGRWTGLGLDYVR
SYFFTAHENRRLKAELAEMRQWRDRALALQDQNDRFKSLLGLRTDPPIPMAAARVVSDSRGPFANTRLADAGSERGIVVG
NPVLNERGLVGRVVGVSRGVSRVLLLTDIASRTPVMIDRTNARAILTGDGGPNPKLDYLRGVDPIQQGDRVVTSGDGGVV
PRGLPVGAAVKGLDGRWRVVLFADQASIDYVRILLFKDFAQLADEKQLQARSLPPVTTEDPQTSILSNPVSRPVAPTPSP
ATATPSAAPAARPATTATPPQTGAPR
>P16926 ~~~mreC~~~Cell shape-determining protein MreC~~~COG1792
MKPIFSRGPSLQIRLILAVLVALGIIIADSRLGTFSQIRTYMDTAVSPFYFVSNAPRELLDGVSQTLASRDQLELENRAL
RQELLLKNSELLMLGQYKQENARLRELLGSPLRQDEQKMVTQVISTVNDPYSDQVVIDKGSVNGVYEGQPVISDKGVVGQ
VVAVAKLTSRVLLICDATHALPIQVLRNDIRVIAAGNGCTDDLQLEHLPANTDIRVGDVLVTSGLGGRFPEGYPVAVVSS
VKLDTQRAYTVIQARPTAGLQRLRYLLLLWGADRNGANPMTPEEVHRVANERLMQMMPQVLPSPDAMGPKLPEPATGIAQ
PTPQQPATGNAATAPAAPTQPAANRSPQRATPPQSGAQPPARAPGGQ
>Q8Y6Y4 ~~~mreC~~~Cell shape-determining protein MreC~~~COG1792
MPQFFLNKRLIILLISIIVLVALVGFSLRDRENASWPEQFVKDVVGFGENIVAKPTSFISGAVDGVVDLKNTYTENQHLK
ERLEELAQLESEVADLKKENKDLKESLDITDSIRDYDPLNASVISRNPTNWNDQVEIDKGSSDGVKPDMAVTTPSGLIGK
VTTTGAKSATVELLTSSDVKNRVSAKVQGKENAFGIINGYDSDTKLLELKQLPYDMKFKKGQKVVTSGLGGKFPAGIFIG
TIEKVETDKMGLSQTAFIKPGADMYDLNHVTVLKRSAEAGTTDDDTTSSDTTGGQ
>Q8DMY2 ~~~mreC~~~Cell shape-determining protein MreC~~~COG1792
MNRFKKSKYVIIVFVTVLLVSALLATTYSSTIVTKLGDGISLVDRVVQKPFQWFDSVKSDLAHLTRTYNENESLKKQLYQ
LEVKSNEVESLKTENEQLRQLLDMKSKLQATKTLAADVIMRSPVSWKQELTLDAGRSKGASENMLAIANGGLIGSVSKVE
ENSTIVNLLTNTENADKISVKIQHGSTTIYGIIIGYDKENDVLKISQLNSNSDISAGDKVTTGGLGNFNVADIPVGEVVA
TTHSTDYLTREVTVKLSADTHNVDVIELVGNS
>Q01467 ~~~mreD~~~Rod shape-determining protein MreD~~~COG2891
MKRFLLPFVMMLVFSAESIFTDLVHFPFVTDDQVLAPRFLMLVLIFMSAFINQKHAMIYGFIFGFLYDMNYTSLLGVYMF
GFAGLCYLASKAFKVLHTNAFVVILIAVLAVCLLEFYVFGIQSLIHKDIMTFNGFVLDRFIPTILLNIAAALILVLPFRL
FFMSLKKELRDE
>P0ABH4 ~~~mreD~~~Rod shape-determining protein MreD~~~COG2891
MASYRSQGRWVIWLSFLIALLLQIMPWPDNLIVFRPNWVLLILLYWILALPHRVNVGTGFVMGAILDLISGSTLGVRVLA
MSIIAYLVALKYQLFRNLALWQQALVVMLLSLVVDIIVFWAEFLVINVSFRPEVFWSSVVNGVLWPWIFLLMRKVRQQFA
VQ
>Q8DMY3 ~~~mreD~~~Cell shape-determining protein MreD~~~COG2891
MRQLKRVGVFLLLPFFVLIDAHISQLLGSFFPHVHLASHFLFLFLLFETIEVSEYLYLVYCFVIGLVYDVYFFHLIGITT
LLFILLGAFLHKLNSVILLNRWTRMLAMIVLTFLFEMGSYLLAFMVGLTVDSMSIFIVYSLVPTMILNFLWITVFQFIFE
KYYL
>P37960 ~~~mrgA~~~Metalloregulation DNA-binding stress protein~~~COG0783
MKTENAKTNQTLVENSLNTQLSNWFLLYSKLHRFHWYVKGPHFFTLHEKFEELYDHAAETVDTIAERLLAIGGQPVATVK
EYTEHASITDGGNETSASEMVQALVNDYKQISSESKFVIGLAEENQDNATADLFVGLIEEVEKQVWMLSSYLG
>P21648 ~~~mrkD~~~Fimbria adhesin protein~~~
MKKLTLFIGLMALGTTSAWASCWQSNSAYEINMAMGRVVVSPDLPVGSVIATKTWTMPDNNTIYVTCDRNTTLKSDAKVV
AAGLVQGANKVYSTAIPGIGLRFSRKGAISMIYPDSYTTTGSSFRLVGSTFTLDIIKTSTTTGSGTLASGPYTEYGPGFT
ILKTSLNADAITIVSPSCTILGGKNMNVDIGTIKRADLKGVGTWAGGTPFDIKLECSGGVSVSGYANINTSFSGTLATNT
SANQGVLLNEKTGNSAAKGVGVQVIKDNTPLEFNKKHNIGTLQSQETRYITLPLHARFYQYAPTTSTGEVESHLVFNLTY
D
>P21649 ~~~mrkE~~~Protein MrkE~~~
MPGNKIGLLNVHHRVKLLYGEGLHIRNLTPGTEIAFYVPNNSTPQGDGVAVVMSGEKMKVIIVEDEFLAQQELSWLINTH
SQMEIVGSFDDGLDVLKFLQHNKVDAIFLDINIPSLDGVLLAQNISQFAHKPFIVFITAWKEHAVEAFELEAFDYILKPY
QESRIINMLQKLTTAWEQQNNAASGLASAAPRENDTINLIKDERIIVTSIHDIYYAEAHEKMTFVYTRRESFVMPMNITE
FVPDALNIAASGQSKFLLTVAQVSIANRF
>Q8RIL0 ~~~mrnCL~~~Mini-ribonuclease 3-like protein~~~COG1939
MDNVDFSKDIRDYSGLELAFLGDAIWELEIRKYYLQFGYNIPTLNKYVKAKVNAKYQSLIYKKIINDLDEEFKVIGKRAK
NSNIKTFPRSCTVMEYKEATALEAIIGAMYLLKKEEEIKKIINIVIKGE
>Q81J58 3.1.26.-~~~mrnC~~~Mini-ribonuclease 3~~~
MIDAKQLNSLALAYMGDAVYEQYIRYHLLQKGKVRPNQLHRLGTSFVSAKAQAKVVYHLLETAFLTEEEEAVLRRGRNAN
SGTVPKNTDVQTYRHSTAFEALIGYHHLLNNRERLDEIVYKAIAVLEEQEGGTSS
>O31418 3.1.26.-~~~mrnC~~~Mini-ribonuclease 3~~~COG1939
MLEFDTIKDSKQLNGLALAYIGDAIFEVYVRHHLLKQGFTKPNDLHKKSSRIVSAKSQAEILFFLQNQSFFTEEEEAVLK
RGRNAKSGTTPKNTDVQTYRYSTAFEALLGYLFLEKKEERLSQLVAEAIQFGTSGRKTNESAT
>A0A0H2XEK8 3.4.-.-~~~MroQ~~~Membrane-embedded CAAX protease MroQ~~~
MTRLWASLLTVIIYILSQFLPLLIVKKLPFVQYSGIELTKAVIYIQLVLFLIAATTIILINLKIKNPTKLELEVKEPKKY
IIPWALLGFALVMIYQMVVSIVLTQIYGGQQVSPNTEKLIIIARKIPIFIFFVSIIGPLLEEYVFRKVIFGELFNAIKGN
RIVAFIIATTVSSLIFALAHNDFKFIPVYFGMGVIFSLAYVWTKRLAVPIIIHMLQNGFVVIFQLLNPEALKKATEQANF
IYHIFIP
>Q9RGZ5 ~~~mrpA~~~Na(+)/H(+) antiporter subunit A~~~COG1009
MTVLHWATISPFLLAILIPFLYKYARRIHTGWFVLVLPLVLFIYFIQYLSITSTGGVVEHTIPWVPSLGINFTVFVDGLS
LLFALLITGIGTLVILYSIFYLSKKTESLNNFYVYLLMFMGAMLGVVLSDNLIVLYVFWELTSLASSLLISYWFHREKST
YGAQKSMLITVFGGFAMLGGFSLLYVMTGTFSIRGIIENVDLVTSSELFLPAMILVLLGAFTKSAQFPFHIWLPDAMEAP
TPVSAYLHSATMVKAGIYLVARLTPVFAGSAEWFWLLTGFGVVTLLWGSTSAVRQKDLKGILAFSTVSQLGLIMTLLGLG
SAAIYFGESVDPAFYSFAIMAAIFHLINHATFKGSLFMTAGIIDHETGTRDIRKLGGLMAIMPVTFTVSLIGLASMAGLP
PFNGFLSKEMFFTALLRATEMNAFNMETFGIIIVVLAWIASVFTFLYCLIMFFKTFTGKFKPENYDVKVHEAPIGMLISP
VILGSLVIVFGFFPNILAYTIIEPAMQAVLPTVLADGELFHVNIYMWHGFNAELFMTMGVVAAGIILFLMMKNWAKAAFY
MKERDPLNWFYDSSLSGVITGSQFVTRIQMTGLLRDYFAYMIVFMILLLGYTMFRYDAFAIDTTNVTEIAPYIWVITIVF
IVATLSIPFINKRITAVVVVGVIGFLLALLFVVFRAPDLALTQLLIETVTVLLLMLAFYHLPELRKEEFKPRFNVLNLII
SIGVGFFITAIALSSLALGNEAGIEPISQFFIENSKELAGGYNMVNVILVDFRGLDTMLEVLVLGIAALGVIALIKLRMT
GREDV
>Q9K2S2 ~~~mrpA~~~Na(+)/H(+) antiporter subunit A~~~COG1009
MQLLHLAILSPFLFAFIIPFLAKYAKRVHTGWFVLILPVLLFIYFLPMIRMTQSGETLRSVLEWIPSLGINFTVYIDGLG
LLFALLITGIGSLVTLYSIFYLSKEKEQLGPFYVYLLMFMGAMLGVVLVDNVMVLYMFWELTSLSSFLLIGYWYKREKSR
YGAAKSLLITVSGGLCMLGGFILLYLITDSFSIREMVHQVQLIAGHELFIPAMILILLGAFTKSAQFPFYIWLPDAMEAP
TPVSAYLHSATMVKAGIYVIARFSPIFAFSAQWFWIVSLVGLFTMVWGSFHAVKQTDLKSILAFSTVSQLGMIISMLGVS
AAALHYGHTEYYTVAAMAAIFHLINHATFKGSLFMAVGIIDHETGTRDIRKLGGLMAIMPITFTISLIGTFSMAGLPPFN
GFLSKEMFFTSMLRVTHFDLFNVQTWGVLFPLFAWIGSVFTFIYSMKLLFKTFRGNYQPEQLEKQAHEAPVGMLVPPVIL
VALAVSLFFFPNILSYSLIEPAMNSIYPTLLDGHEKFHVHISQWHGVTTELLMTAGIVVIGTIGYLSLNKWKGIYKLFPS
KLTLNRLYDKLLTMMEKGSYRVTKQYMTGFLRDYLLYIFAGFIILIGGAFAIKGGFSFKTEGMAKIGVYEIILTLVMISA
TVATVFARSRLTAIIALGVVGYTLALFFVIFRAPDLALTQLVIETISVALFLLCFYHLPKLRLKTKTRTFRMTNFIISLG
VGVIVTLLGIASSSQRTKDSIASFFVKHSHDLGGGDNVVNVILVDFRGFDTMFEITVLTIAALGIYSMIKTKVKEEGKSG
E
>Q03011 ~~~mrpA~~~Major MR/P fimbria protein~~~COG3539
MKLNKLALVLGLGLSVVAGSALAADQGHGTVKFVGSIIDAPCSITPDTENQTVPLGQISTAALKDGGRSNSRDFKISLEN
CTTETYKTVQTTFTGSEATEVLEGSLGIEGIAKNAAVVITDAGGKQIKLGTPSAAQNLRDGNNDLNFAAYLQGSASEAAV
PGDFTAIATFALTYQ
>Q9RGZ4 ~~~mrpB~~~Na(+)/H(+) antiporter subunit B~~~COG2111
MKNLKSNDVLLHTLTRVVTFIILAFSVYLFFAGHNNPGGGFIGGLMTASALLLMYLGFDMRSIKKAIPFDFTKMIAFGLL
IAIFTGFGGLLVGDPYLTQYFEYYQIPILGETELTTALPFDLGIYLVVIGIALTIILTIAEDDM
>O05259 ~~~mrpB~~~Na(+)/H(+) antiporter subunit B~~~COG2111
MNEQKTNDLILQTATKLVSFIILLFSFYLFLSGHNAPGGGFVGGLITSSSIVLLLLAYDLKTVRSLLPVNFIYVAGAGLL
LAVLTGVGSFVFGAPFLTHTFGYFQLPILGKTELATATIFDLGVYLVVVGITMTIIQTIGEEE
>Q9RGZ3 ~~~mrpC~~~Na(+)/H(+) antiporter subunit C~~~COG1006
MEILMSITAGVLFMVGTYLILTKSLLRVVVGLILLSHGAHLLLLTMAGLQRGAPPLLHLEATTYSDPLPQALILTAIVIS
FGVTSFLLVLAYRTYKEHKTDDLDQLRGSADE
>O05260 ~~~mrpC~~~Na(+)/H(+) antiporter subunit C~~~COG1006
MEILMAVLAGIIFMAATYLLLSKSLLRVIIGTALLSHGVHLMLLTMGGLKKGAAPILSEHAKSFVDPLPQALILTAIVIS
FGVTSFILVMAFRAYQELKSDDMDQMRGNDQHE
>Q9RGZ2 ~~~mrpD~~~Na(+)/H(+) antiporter subunit D~~~COG0651
MNNLVILPILIPFIVGSFLILFAKHHSLQRVISGFAVVGMLLVSIYLAVDVYQNGITVLELGNWQAPFGIVLVADLFATM
MVILASIVGVVCLFFAFQTISSEREKYYFYPFYFFLLAGVNGAFLTGDLFNLFVFFEVMLIASYILIVLGGTKYQLRESL
KYVVINVFASILFIVGVAYIYSITGTLNMADLAVKVGELEQTGVLNVIAVIFLVVFAMKGGLFPLYFWLPRSYFGPPAAI
AALFGGLLTKVGIYAIMRTFTLIFNHDPGFTHTLILILAGLTMFFGVLGAVSQFDFKRILSYHIISQVGYMVMGLGIYTQ
LAIAGAIYYIAHHIIVKAALFLFAGATQRITGTTDLKKMGGLLKTHPWLAWMFFISAISLAGIPPLSGFFSKFALILAAF
LNENYIIAAVALAVGLLTLFSMMKIFIYAFWGEQKHTEEQANFKVGKLLLPIVPLVALTIILGFAAEPIFQYSLQVADQI
LDPTIYIESVLKE
>O05229 ~~~mrpD~~~Na(+)/H(+) antiporter subunit D~~~COG0651
MNNFVILPILIPLLSAILLIFMTKNLMLMRIFSTAASAIGIVISGILVQTVFTKGIQTLSLGGWKAPYGIVLAADQFASL
LVLTTAIIGLLVGLYSFRSVGEKRERSFYYSGVQFLLAGVSGAFLTGDLFNMYVFFELLLIASYMLIVLGGTKIQLRESL
KYIVFNIVSSALFVIGVGFLYAVTGTLNMADLSVKISESGQTGLITVIGVLLLLVFGMKGGIFPLYFWLPGSYYAPPAAI
SALFGALLTKVGLYAITRVFTLIFIHDTAFTHQLMIWLAALTVIFGVIGSLAYSNVMKIVIYNIITAVGVILFGVAVHTP
ASIQGAIYYLIHDMLIKGALFMLAGTLIALTGTASLHKMGGLIKRYPVLGWMFFISAISLAGIPPLSGFVGKFKIAEGGF
AEGEFTISMLILLSSLLVLYSVLRIFIHAFWGEEKETPKPNHRTAKGLLYPAAIFLLLSLLFGLGTEWVSPYVDQAAETL
LNPEKYIEAVLKE
>Q9RGZ1 ~~~mrpE~~~Na(+)/H(+) antiporter subunit E~~~COG1863
MAFQILLNLVIAVIWVNFQNSYTAVDFLIGYVVGIFILFVLRRFLRFDFYMRRIWAIIKLISLFFKELILANIDVIKIVL
SPKMNIQPGIVAVPTKLKTDWELSLLASLISLTPGTLSMDFSDDNKYIYIHAIDVPNKEKMIRDIHDTFERAILEVTK
>Q7WY60 ~~~mrpE~~~Na(+)/H(+) antiporter subunit E~~~COG1863
MAFQILLNVFLAFCWMFLSNSPSAAGFITGYILGMLSLFFFRRFFTRQFYLWKLISIIKLCFIFIKELYLANVSVMKSVL
SPKLNIRPGIFAFKTELTKDWEITMLSLLITLTPGTLVMDISDDRTILYIHAMDIEDAEKAIFDIRESFEKAIQEVSR
>Q9RGZ0 ~~~mrpF~~~Na(+)/H(+) antiporter subunit F~~~COG2212
MFQSILMIVLVVMSISLFVCFIRTLIGPTMSDRIVALDTFGINLIGFIGVIMMLQETLAYSEVVLVISILAFIGSIALSK
FIERGVVFDRG
>O05228 ~~~mrpF~~~Na(+)/H(+) antiporter subunit F~~~COG2212
MFTLILQIALGIMAVSTFLYVIRVIKGPTVPDRVVALDAIGINLIAITALVSILLKTSAFLDIILLLGILSFIGTIAFSK
FLEKGEIIENDRNR
>Q9RGY9 ~~~mrpG~~~Na(+)/H(+) antiporter subunit G~~~COG1320
MTAVEIIISIFVLIGGFLSLLGSIGIIRFPDVYGRLHAATKSATLGVISIMLATFLFFFLVHGEFVGKLLLTILFVFLTA
PVAGMMMGRSAYRVGVPLWEKSTQDDLKKMYEKKMKGSN
>O05227 ~~~mrpG~~~Na(+)/H(+) antiporter subunit G~~~COG1320
MIETAKVVVAVFILLGALICLIASFGVLRLPDVFTRAHAASKGSTLGVNMILLGVFFYLWFVTGELSAKILLGILFIFIT
SPIGGHLICRAAYNSGVKLDERSVQDDYNGIRNFVIKRKEDSYL
>G8QM64 ~~~mrpX~~~Methionine-rich peptide X~~~
MKKLAAVMLTSCLMVAVGASFADEMKKDDMKKDVMMKKDDMAKDEMKKDSMAKDGMKKDAMKKDAMMKKDGMTKDEMKK
>P24202 ~~~mrr~~~Type IV methyl-directed restriction enzyme EcoKMrr~~~COG1715
MTVPTYDKFIEPVLRYLATKPEGAAARDVHEAAADALGLDDSQRAKVITSGQLVYKNRAGWAHDRLKRAGLSQSLSRGKW
CLTPAGFDWVASHPQPMTEQETNHLAFAFVNVKLKSRPDAVDLDPKADSPDHEELAKSSPDDRLDQALKELRDAVADEVL
ENLLQVSPSRFEVIVLDVLHRLGYGGHRDDLQRVGGTGDGGIDGVISLDKLGLEKVYVQAKRWQNTVGRPELQAFYGALA
GQKAKRGVFITTSGFTSQARDFAQSVEGMVLVDGERLVHLMIENEVGVSSRLLKVPKLDMDYFE
>P43683 ~~~mrsA~~~Lantibiotic mersacidin~~~
MSQEAIIRSWKDPFSRENSTQNPAGNPFSELKEAQMDKLVGAGDMEAACTFTLPGGGGVCTLTSECIC
>D5FKJ3 2.1.1.243~~~mrsA~~~2-ketoarginine methyltransferase~~~
MNLLDSIKSENTGFETTLIKGIEPIRQFVLAISIYHLFDTKLFSLLIKHEVASPEVACNELGMEKEKLLGLFRYLKNEGI
LLETIDGFSLSKEGHALAPFEGWYVMLVGGYATTFLQMGERLQEGAGWATRDATKVGVGSCGISHFDAIPLTRSLMAQAP
GTCTKLLDLGCGNGRYLAEFCKALPQIQAWGAEPDRGGFEEAVDLIEKEGLSHRVHISHSGAVEFLDSDFDFEPDFIVLG
FVLHEILGQAGRPAVVNFLKKIVHRFPAINLIIIEVDNQFDNAGAMRHGLALAYYNPYYLLHCFTNQLLVQDADWLDIFA
EAGLSLVTRETTSDQVDSTGLEIGYLLRRA
>Q9RC23 4.1.1.-~~~mrsD~~~Mersacidin decarboxylase~~~
MSISILKDKKLLIGICGSISSVGISSYLLYFKSFFKEIRVVMTKTAEDLIPAHTVSYFCDHVYSEHGENGKRHSHVEIGR
WADIYCIIPATANILGQTANGVAMNLVATTVLAHPHNTIFFPNMNDLMWNKTVVSRNIEQLRKDGHIVIEPVEIMAFEIA
TGTRKPNRGLITPDKALLAIEKGFKERTKHPSLT
>P0DKS9 1.20.4.3~~~mrx1~~~Mycoredoxin 1~~~COG0695
MSNVTIYATDWCPYCRSLLKGLDGQEYDLIDVDQDEEAGEWVKSVNDGNRIVPTVRYSDGTHATNPLAAEVIAKIEALA
>P0A3Q9 ~~~msrAB1~~~Peptide methionine sulfoxide reductase MsrA/MsrB 1~~~COG0225
MAEIYLAGGCFWGLEEYFSRISGVLETSVGYANGQVETTNYQLLKETDHAETVQVIYDEKEVSLREILLYYFRVIDPLSI
NQQGNDRGRQYRTGIYYQDEADLPAIYTVVQEQERMLGRKIAVEVEQLRHYILAEDYHQDYLRKNPSGYCHIDVTDADKP
LIDAANYEKPSQEVLKASLSEESYRVTQEAATEAPFTNAYDQTFEEGIYVDITTGEPLFFAKDKFASGCGWPSFSRPISK
ELIHYYKDLSHGMERIEVRSRSGSAHLGHVFTDGPRELGGLRYCINSASLRFVAKDEMEKAGYGYLLPYLNK
>P0A3R0 ~~~msrAB1~~~Peptide methionine sulfoxide reductase MsrA/MsrB 1~~~COG0225
MAEIYLAGGCFWGLEEYFSRISGVLETSVGYANGQVETTNYQLLKETDHAETVQVIYDEKEVSLREILLYYFRVIDPLSI
NQQGNDRGRQYRTGIYYQDEADLPAIYTVVQEQERMLGRKIAVEVEQLRHYILAEDYHQDYLRKNPSGYCHIDVTDADKP
LIDAANYEKPSQEVLKASLSEESYRVTQEAATEAPFTNAYDQTFEEGIYVDITTGEPLFFAKDKFASGCGWPSFSRPISK
ELIHYYKDLSHGMERIEVRSRSGSAHLGHVFTDGPRELGGLRYCINSASLRFVAKDEMEKAGYGYLLPYLNK
>P60753 7.5.2.6~~~msbA~~~ATP-dependent lipid A-core flippase~~~COG1132
MHNDKDLSTWQTFRRLWPTIAPFKAGLIVAGVALILNAASDTFMLSLLKPLLDDGFGKTDRSVLVWMPLVVIGLMILRGI
TSYVSSYCISWVSGKVVMTMRRRLFGHMMGMPVSFFDKQSTGTLLSRITYDSEQVASSSSGALITVVREGASIIGLFIMM
FYYSWQLSIILIVLAPIVSIAIRVVSKRFRNISKNMQNTMGQVTTSAEQMLKGHKEVLIFGGQEVETKRFDKVSNRMRLQ
GMKMVSASSISDPIIQLIASLALAFVLYAASFPSVMDSLTAGTITVVFSSMIALMRPLKSLTNVNAQFQRGMAACQTLFT
ILDSEQEKDEGKRVIERATGDVEFRNVTFTYPGRDVPALRNINLKIPAGKTVALVGRSGSGKSTIASLITRFYDIDEGEI
LMDGHDLREYTLASLRNQVALVSQNVHLFNDTVANNIAYARTEQYSREQIEEAARMAYAMDFINKMDNGLDTVIGENGVL
LSGGQRQRIAIARALLRDSPILILDEATSALDTESERAIQAALDELQKNRTSLVIAHRLSTIEKADEIVVVEDGVIVERG
THNDLLEHRGVYAQLHKMQFGQ
>Q8FJB1 7.5.2.6~~~msbA~~~ATP-dependent lipid A-core flippase~~~COG1132
MHNDKDLSTWQTFRRLWPTIAPFKAGLIVAGVALILNAASDTFMLSLLKPLLDDGFGKTDRSVLMWMPLVVIGLMILRGI
TSYISSYCISWVSGKVVMTMRRRLFGHMMGMPVSFFDKQSTGTLLSRITYDSEQVASSSSGALITVVREGASIIGLFIMM
FYYSWQLSIILIVLAPIVSIAIRVVSKRFRNISKNMQNTMGQVTTSAEQMLKGHKEVLIFGGQEVETKRFDKVSNRMRLQ
GMKMVSASSISDPIIQLIASLALAFVLYAASFPSVMDSLTAGTITVVFSSMIALMRPLKSLTNVNAQFQRGMAACQTLFT
ILDSEQEKDEGKRVIERATGDVEFRNVTFTYPGRDVPALRNINLKIPAGKTVALVGRSGSGKSTIASLITRFYDIDEGEI
LMDGHDLREYTLASLRNQVALVSQNVHLFNDTVANNIAYARTEQYSREQIEEAARMAYAMDFINKMDNGLDTVIGENGVL
LSGGQRQRIAIARALLRDSPILILDEATSALDTESERAIQAALDELQKNRTSLVIAHRLSTIEKADEIVVVEDGVIVERG
THNDLLEHRGVYAQLHKMQFGQ
>P60752 7.5.2.6~~~msbA~~~ATP-dependent lipid A-core flippase~~~COG1132
MHNDKDLSTWQTFRRLWPTIAPFKAGLIVAGVALILNAASDTFMLSLLKPLLDDGFGKTDRSVLVWMPLVVIGLMILRGI
TSYVSSYCISWVSGKVVMTMRRRLFGHMMGMPVSFFDKQSTGTLLSRITYDSEQVASSSSGALITVVREGASIIGLFIMM
FYYSWQLSIILIVLAPIVSIAIRVVSKRFRNISKNMQNTMGQVTTSAEQMLKGHKEVLIFGGQEVETKRFDKVSNRMRLQ
GMKMVSASSISDPIIQLIASLALAFVLYAASFPSVMDSLTAGTITVVFSSMIALMRPLKSLTNVNAQFQRGMAACQTLFT
ILDSEQEKDEGKRVIERATGDVEFRNVTFTYPGRDVPALRNINLKIPAGKTVALVGRSGSGKSTIASLITRFYDIDEGEI
LMDGHDLREYTLASLRNQVALVSQNVHLFNDTVANNIAYARTEQYSREQIEEAARMAYAMDFINKMDNGLDTVIGENGVL
LSGGQRQRIAIARALLRDSPILILDEATSALDTESERAIQAALDELQKNRTSLVIAHRLSTIEKADEIVVVEDGVIVERG
THNDLLEHRGVYAQLHKMQFGQ
>Q14JW6 7.5.2.6~~~msbA~~~ATP-dependent lipid A-core flippase~~~
MANMIDKIDLKSQGSSNLSGEMTNHQKVGTLYKRLLLQVKHLWHFLLLAAIGSIFFSAADASMIYLINPILNYGFGPGGG
ITKQSATILMLMGVGMVGLLALRSVGSFVSQYFIGSLGQKVVYKFRKDIYKRLMDLPASFFDKHSTGQIISRLLYNVDQV
IEATSTAIITVVQDGTFVIGLIVVMFVSSWQLSLFLIVVGPFLGLFISIINKKFRNLSRNTQSSMGNVTHTAEETIRNYK
EIRIFGAQQKQQNKFFKNLDYTYSQQIRTIALDALTSPVIQIIASLVLAFSLFTIAIFGTNEGDGSSWLTAGSFASFFAA
AAAILKPIKNLTKVNVVIQKAVAATEDIFYILDYPAEKETGSKELAKVDGNVTIKDLSFAFGEHKVLSGVSVDIKAGQTV
AFVGKSGSGKTTLTSIISRFYTQHEGEILLDGVDTRELTLENLRSHLSIVSQNVHLFDDTVYNNIAFGLSREVSEEEVID
ALKRANAYEFVQELSDGINTNIGNNGSKLSGGQRQRISIARALLKNAPVLIFDEATSALDNESERVVQQALESLTKSCTT
IVIAHRLSTVENADKIVVMDGGRVVESGKHQELLEQGGLYTRLYQSGLQ
>P63359 7.5.2.6~~~msbA~~~ATP-dependent lipid A-core flippase~~~
MHNDKDLSTWQTFRRLWPTIAPFKAGLIVAGIALILNAASDTFMLSLLKPLLDDGFGKTDRSVLLWMPLVVIGLMILRGI
TSYISSYCISWVSGKVVMTMRRRLFGHMMGMPVAFFDKQSTGTLLSRITYDSEQVASSSSGALITVVREGASIIGLFIMM
FYYSWQLSIILVVLAPIVSIAIRVVSKRFRSISKNMQNTMGQVTTSAEQMLKGHKEVLIFGGQEVETKRFDKVSNKMRLQ
GMKMVSASSISDPIIQLIASLALAFVLYAASFPSVMDSLTAGTITVVFSSMIALMRPLKSLTNVNAQFQRGMAACQTLFA
ILDSEQEKDEGKRVIDRATGDLEFRNVTFTYPGREVPALRNINLKIPAGKTVALVGRSGSGKSTIASLITRFYDIDEGHI
LMDGHDLREYTLASLRNQVALVSQNVHLFNDTVANNIAYARTEEYSREQIEEAARMAYAMDFINKMDNGLDTIIGENGVL
LSGGQRQRIAIARALLRDSPILILDEATSALDTESERAIQAALDELQKNRTSLVIAHRLSTIEQADEIVVVEDGIIVERG
THSELLAQHGVYAQLHKMQFGQ
>Q9KQW9 7.5.2.6~~~msbA~~~ATP-dependent lipid A-core flippase~~~COG1132
MSLHSDESNWQTFKRLWTYIRLYKAGLVVSTIALVINAAADTYMISLLKPLLDEGFGNAESNFLRILPFMILGLMFVRGL
SGFASSYCLSWVSGNVVMQMRRRLFNHFMHMPVRFFDQESTGGLLSRITYDSEQVAGATSRALVSIVREGASIIGLLTLM
FWNSWQLSLVLIVVAPVVAFAISFVSKRFRKISRNMQTAMGHVTSSAEQMLKGHKVVLSYGGQEVERKRFDKVSNSMRQQ
TMKLVSAQSIADPVIQMIASLALFAVLFLASVDSIRAELTPGTFTVVFSAMFGLMRPLKALTSVTSEFQRGMAACQTLFG
LMDLETERDNGKYEAERVNGEVDVKDVTFTYQGKEKPALSHVSFSIPQGKTVALVGRSGSGKSTIANLFTRFYDVDSGSI
CLDGHDVRDYKLTNLRRHFALVSQNVHLFNDTIANNIAYAAEGEYTREQIEQAARQAHAMEFIENMPQGLDTVIGENGTS
LSGGQRQRVAIARALLRDAPVLILDEATSALDTESERAIQAALDELQKNKTVLVIAHRLSTIEQADEILVVDEGEIIERG
RHADLLAQDGAYAQLHRIQFGE
>P77338 ~~~mscK~~~Mechanosensitive channel MscK~~~COG1196
MTMFQYYKRSRHFVFSAFIAFVFVLLCQNTAFARASSNGDLPTKADLQAQLDSLNKQKDLSAQDKLVQQDLTDTLATLDK
IDRIKEETVQLRQKVAEAPEKMRQATAALTALSDVDNDEETRKILSTLSLRQLETRVAQALDDLQNAQNDLASYNSQLVS
LQTQPERVQNAMYNASQQLQQIRSRLDGTDVGETALRPSQKVLMQAQQALLNAEIDQQRKSLEGNTVLQDTLQKQRDYVT
ANSARLEHQLQLLQEAVNSKRLTLTEKTAQEAVSPDEAARIQANPLVKQELEINQQLSQRLITATENGNQLMQQNIKVKN
WLERALQSERNIKEQIAVLKGSLLLSRILYQQQQTLPSADELENMTNRIADLRLEQFEVNQQRDALFQSDAFVNKLEEGH
TNEVNSEVHDALLQVVDMRRELLDQLNKQLGNQLMMAINLQINQQQLMSVSKNLKSILTQQIFWVNSNRPMDWDWIKAFP
QSLKDEFKSMKITVNWQKAWPAVFIAFLAGLPLLLIAGLIHWRLGWLKAYQQKLASAVGSLRNDSQLNTPKAILIDLIRA
LPVCLIILAVGLILLTMQLNISELLWSFSKKLAIFWLVFGLCWKVLEKNGVAVRHFGMPEQQTSHWRRQIVRISLALLPI
HFWSVVAELSPLHLMDDVLGQAMIFFNLLLIAFLVWPMCRESWRDKESHTMRLVTITVLSIIPIALMVLTATGYFYTTLR
LAGRWIETVYLVIIWNLLYQTVLRGLSVAARRIAWRRALARRQNLVKEGAEGAEPPEEPTIALEQVNQQTLRITMLLMFA
LFGVMFWAIWSDLITVFSYLDSITLWHYNGTEAGAAVVKNVTMGSLLFAIIASMVAWALIRNLPGLLEVLVLSRLNMRQG
ASYAITTILNYIIIAVGAMTVFGSLGVSWDKLQWLAAALSVGLGFGLQEIFGNFVSGLIILFERPVRIGDTVTIGSFSGT
VSKIRIRATTITDFDRKEVIIPNKAFVTERLINWSLTDTTTRLVIRLGVAYGSDLEKVRKVLLKAATEHPRVMHEPMPEV
FFTAFGASTLDHELRLYVRELRDRSRTVDELNRTIDQLCRENDINIAFNQLEVHLHNEKGDEVTEVKRDYKGDDPTPAVG
>P0A742 ~~~mscL~~~Large-conductance mechanosensitive channel~~~COG1970
MSIIKEFREFAMRGNVVDLAVGVIIGAAFGKIVSSLVADIIMPPLGLLIGGIDFKQFAVTLRDAQGDIPAVVMHYGVFIQ
NVFDFLIVAFAIFMAIKLINKLNRKKEEPAAAPAPTKEEVLLTEIRDLLKEQNNRS
>P9WJN5 ~~~mscL~~~Large-conductance mechanosensitive channel~~~COG1970
MLKGFKEFLARGNIVDLAVAVVIGTAFTALVTKFTDSIITPLINRIGVNAQSDVGILRIGIGGGQTIDLNVLLSAAINFF
LIAFAVYFLVVLPYNTLRKKGEVEQPGDTQVVLLTEIRDLLAQTNGDSPGRHGGRGTPSPTDGPRASTESQ
>P68805 ~~~mscL~~~Large-conductance mechanosensitive channel~~~COG1970
MLKEFKEFALKGNVLDLAIAVVMGAAFNKIISSLVENIIMPLIGKIFGSVDFAKEWSFWGIKYGLFIQSVIDFIIIAFAL
FIFVKIANTLMKKEEAEEEAVVEENVVLLTEIRDLLREKK
>Q6G9L1 ~~~mscL~~~Large-conductance mechanosensitive channel~~~
MLKEFKEFALKGNVLDLAIAVVMGAAFNKIISSLVENIIMPLIGKIFGSVDFAKEWSFWGIKYGLFIQSVIDFIIIAFAL
FIFVKIANTLMKKEEAEEEAVVEENVVLLTEIRDLLREKK
>P68806 ~~~mscL~~~Large-conductance mechanosensitive channel~~~
MLKEFKEFALKGNVLDLAIAVVMGAAFNKIISSLVENIIMPLIGKIFGSVDFAKEWSFWGIKYGLFIQSVIDFIIIAFAL
FIFVKIANTLMKKEEAEEEAVVEENVVLLTEIRDLLREKK
>P39285 ~~~mscM~~~Miniconductance mechanosensitive channel MscM~~~COG1511
MRLIITFLMAWCLSWGAYAATAPDSKQITQELEQAKAAKPAQPEVVEALQSALNALEERKGSLERIKQYQQVIDNYPKLS
ATLRAQLNNMRDEPRSVSPGMSTDALNQEILQVSSQLLDKSRQAQQEQERAREIADSLNQLPQQQTDARRQLNEIERRLG
TLTGNTPLNQAQNFALQSDSARLKALVDELELAQLSANNRQELARLRSELAEKESQQLDAYLQALRNQLNSQRQLEAERA
LESTELLAENSADLPKDIVAQFKINRELSAALNQQAQRMDLVASQQRQAASQTLQVRQALNTLREQSQWLGSSNLLGEAL
RAQVARLPEMPKPQQLDTEMAQLRVQRLRYEDLLNKQPLLRQIHQADGQPLTAEQNRILEAQLRTQRELLNSLLQGGDTL
LLELTKLKVSNGQLEDALKEVNEATHRYLFWTSDVRPMTIAWPLEIAQDLRRLISLDTFSQLGKASVMMLTSKETILPLF
GALILVGCSIYSRRYFTRFLERSAAKVGKVTQDHFWLTLRTLFWSILVASPLPVLWMTLGYGLREAWPYPLAVAIGDGVT
ATVPLLWVVMICATFARPNGLFIAHFGWPRERVSRGMRYYLMSIGLIVPLIMALMMFDNLDDREFSGSLGRLCFILICGA
LAVVTLSLKKAGIPLYLNKEGSGDNITNHMLWNMMIGAPLVAILASAVGYLATAQALLARLETSVAIWFLLLVVYHVIRR
WMLIQRRRLAFDRAKHRRAEMLAQRARGEEEAHHHSSPEGAIEVDESEVDLDAISAQSLRLVRSILMLIALLSVIVLWSE
IHSAFGFLENISLWDVTSTVQGVESLEPITLGAVLIAILVFIITTQLVRNLPALLELAILQHLDLTPGTGYAITTITKYL
LMLIGGLVGFSMIGIEWSKLQWLVAALGVGLGFGLQEIFANFISGLIILFEKPIRIGDTVTIRDLTGSVTKINTRATTIS
DWDRKEIIVPNKAFITEQFINWSLSDSVTRVVLTIPAPADANSEEVTEILLTAARRCSLVIDNPAPEVFLVDLQQGIQIF
ELRIYAAEMGHRMPLRHEIHQLILAGFHAHGIDMPFPPFQMRLESLNGKQTGRTLTSAGKGRQAGSL
>Q8R6L9 ~~~mscS~~~Small-conductance mechanosensitive channel~~~COG0668
MWADIYHKLVEIYDIKAVKFLLDVLKILIIAFIGIKFADFLIYRFYKLYSKSKIQLPQRKIDTLTSLTKNAVRYIIYFLA
GASILKLFNIDMTSLLAVAGIGSLAIGFGAQNLVKDMISGFFIIFEDQFSVGDYVTINGISGTVEEIGLRVTKIRGFSDG
LHIIPNGEIKMVTNLTKDSMMAVVNIAFPIDEDVDKIIEGLQEICEEVKKSRDDLIEGPTVLGITDMQDSKLVIMVYAKT
QPMQKWAVERDIRYRVKKMFDQKNISFPYPRTTVILSEKKTN
>P0C0S1 ~~~mscS~~~Small-conductance mechanosensitive channel~~~COG0668
MEDLNVVDSINGAGSWLVANQALLLSYAVNIVAALAIIIVGLIIARMISNAVNRLMISRKIDATVADFLSALVRYGIIAF
TLIAALGRVGVQTASVIAVLGAAGLAVGLALQGSLSNLAAGVLLVMFRPFRAGEYVDLGGVAGTVLSVQIFSTTMRTADG
KIIVIPNGKIIAGNIINFSREPVRRNEFIIGVAYDSDIDQVKQILTNIIQSEDRILKDREMTVRLNELGASSINFVVRVW
SNSGDLQNVYWDVLERIKREFDAAGISFPYPQMDVNFKRVKEDKAA
>T0DVE4 ~~~mscS~~~Small-conductance mechanosensitive channel~~~COG0668
MDEIKTLLVDFFPQAKHFGIILIKAVIVFCIGFYFSFFLRNKTMKLLSKKDEILANFVAQVTFILILIITTIIALSTLGV
QTTSIITVLGTVGIAVALALKDYLSSIAGGIILIILHPFKKGDIIEISGLEGKVEALNFFNTSLRLHDGRLAVLPNRSVA
NSNIINSNNTACRRIEWVCGVGYGSDIELVHKTIKDVIDTMEKIDKNMPTFIGITDFGSSSLNFTIRVWAKIEDGIFNVR
SELIERIKNALDANHIEIPFNKLDIAIKNQDSSK
>Q8NTA6 2.4.1.250~~~mshA~~~D-inositol 3-phosphate glycosyltransferase~~~COG0438
MRVAMISMHTSPLQQPGTGDSGGMNVYILSTATELAKQGIEVDIYTRATRPSQGEIVRVAENLRVINIAAGPYEGLSKEE
LPTQLAAFTGGMLSFTRREKVTYDLIHSHYWLSGQVGWLLRDLWRIPLIHTAHTLAAVKNSYRDDSDTPESEARRICEQQ
LVDNADVLAVNTQEEMQDLMHHYDADPDRISVVSPGADVELYSPGNDRATERSRRELGIPLHTKVVAFVGRLQPFKGPQV
LIKAVAALFDRDPDRNLRVIICGGPSGPNATPDTYRHMAEELGVEKRIRFLDPRPPSELVAVYRAADIVAVPSFNESFGL
VAMEAQASGTPVIAARVGGLPIAVAEGETGLLVDGHSPHAWADALATLLDDDETRIRMGEDAVEHARTFSWAATAAQLSS
LYNDAIANENVDGETHHG
>A0QQZ8 2.4.1.250~~~mshA~~~D-inositol 3-phosphate glycosyltransferase~~~COG0438
MRLATDLETPRRVAVLSVHTSPLAQPGTGDAGGMNVYVLQTALQLARRGVEVEVFTRATSSADAPVVPVAPGVLVRNVVA
GPFEGLDKNDLPTQLCAFTAGVLRAEATHEPGYYDVVHSHYWLSGQVGWLARDRWAVPLVHTAHTLAAVKNAALAAGDAP
EPPLRAVGEQQVVDEADRLIVNTEVEAQQLVSLHNADRSRIDVVHPGVDLDVFTPGSRDAARAVFGLPTDQKIVAFVGRI
QPLKAPDILLRAAAKLPGVRVLIAGGPSGSGLAQPDTLVRLADELGISDRVTFLPPQSREQLVNVYRAADLVAVPSYSES
FGLVAVEAQACGTPVVAAAVGGLPVAVADGVSGALVDGHDIGDWADTISEVLDREPAALSRASAEHAAQFSWAHTVDALL
ASYSRAMSDYRARHPRPAARRSGRRFSMRRGVRT
>P9WMY7 2.4.1.250~~~mshA~~~D-inositol 3-phosphate glycosyltransferase~~~COG0438
MAGVRHDDGSGLIAQRRPVRGEGATRSRGPSGPSNRNVSAADDPRRVALLAVHTSPLAQPGTGDAGGMNVYMLQSALHLA
RRGIEVEIFTRATASADPPVVRVAPGVLVRNVVAGPFEGLDKYDLPTQLCAFAAGVLRAEAVHEPGYYDIVHSHYWLSGQ
VGWLARDRWAVPLVHTAHTLAAVKNAALADGDGPEPPLRTVGEQQVVDEADRLIVNTDDEARQVISLHGADPARIDVVHP
GVDLDVFRPGDRRAARAALGLPVDERVVAFVGRIQPLKAPDIVLRAAAKLPGVRIIVAGGPSGSGLASPDGLVRLADELG
ISARVTFLPPQSHTDLATLFRAADLVAVPSYSESFGLVAVEAQACGTPVVAAAVGGLPVAVRDGITGTLVSGHEVGQWAD
AIDHLLRLCAGPRGRVMSRAAARHAATFSWENTTDALLASYRRAIGEYNAERQRRGGEVISDLVAVGKPRHWTPRRGVGA
>Q9FCG5 2.4.1.250~~~mshA~~~D-inositol 3-phosphate glycosyltransferase~~~COG0438
MSQYVSRLGRRSPAASPRLRLNRKPRRVAMLSVHTSPLHQPGTGDAGGMNVYIVELAQRLAAINIEVEIFTRATTAALPP
AVELAPGVLVRHVDAGPYEGLAKEELPAQLCAFTHGVMQAWAGHRPGHYDLVHSHYWLSGHVGWLAAQRWGAPLVHAMHT
MAKVKNANLADGDTPEPAARVIGETQIVAASDRLIANTAEEADELVRHYAADPDKVAVVHPGVNLERFRPFPKGRVPGPG
QHGNARAAARARLGLPQDALIPLFAGRIQPLKAPDILLRAVAVLLDERPELRSRIVVPVVGGPSGSGLAKPEGLQKLAAR
LGIADVVRFRPPVGQEQLADWFRAASVLVMPSYSESFGLVAIEAQAAGTPVLAAAVGGLPVAVRDGHTGRLVHGHDPAAY
ARVLRDFADNPDLTPRMGDAAARHAQSFGWDSAAATTADVYTAAIQSYRRRVRSHHG
>P9WJN3 3.5.1.103~~~mshB~~~1D-myo-inositol 2-acetamido-2-deoxy-alpha-D-glucopyranoside deacetylase~~~COG2120
MSETPRLLFVHAHPDDESLSNGATIAHYTSRGAQVHVVTCTLGEEGEVIGDRWAQLTADHADQLGGYRIGELTAALRALG
VSAPIYLGGAGRWRDSGMAGTDQRSQRRFVDADPRQTVGALVAIIRELRPHVVVTYDPNGGYGHPDHVHTHTVTTAAVAA
AGVGSGTADHPGDPWTVPKFYWTVLGLSALISGARALVPDDLRPEWVLPRADEIAFGYSDDGIDAVVEADEQARAAKVAA
LAAHATQVVVGPTGRAAALSNNLALPILADEHYVLAGGSAGARDERGWETDLLAGLGFTASGT
>Q9F344 3.5.1.103~~~mshB~~~1D-myo-inositol 2-acetamido-2-deoxy-alpha-D-glucopyranoside deacetylase~~~COG2120
MTDLPGRRLLLVHAHPDDESINNGVTMARYAAEGAHVTLVTCTLGERGEVIPPALAHLSGAALGGHRRGELADAMRALGV
DDFRLLGGPGRYADSGMLGLSDNDDPGCLWQADVDAAAALLVDVIREVRPQVLVTYDPNGGYGHPDHIQAHRIAMRAAEL
AAEAGCPVAKVYWNRVPRSRVEDAFARLRDDLPGLPFEKAAGVEDVPGVVDDERITTEIRGEGTAYAAAKAAAMRAHATQ
ITVAEPYFVLSNDLAQPILTTEYYELVRGERGGEGRENDLFAGIAGTFDTGEATS
>A0QZY0 6.3.1.13~~~mshC~~~L-cysteine:1D-myo-inositol 2-amino-2-deoxy-alpha-D-glucopyranoside ligase~~~COG0215
MQSWSAPAIPVVPGRGPALRLFDSADRQVRPVTPGPTATMYVCGITPYDATHLGHAATYLTFDLVHRLWLDAGHTVQYVQ
NVTDVDDPLFERAERDGIDWRTLGDRETQLFREDMAALRVLPPHDYVAATDAIAEVVEMVEKLLASGAAYIVEDAEYPDV
YFRADATAQFGYESGYDRDTMLTLFAERGGDPDRPGKSDQLDALLWRAERPGEPSWPSPFGRGRPGWHVECSAIALTRIG
TGLDIQGGGSDLIFPHHEYSAAHAESVTGERRFARHYVHTGMIGWDGHKMSKSRGNLVLVSQLRAQGVDPSAIRLGLFSG
HYREDRFWSNEVLDEANARLARWRSATALPEAPDATDVIARVRQYLADDLDTPKALAALDGWCTDALSYGGHDTESPRLV
ATTVDALLGVDL
>P9WJM9 6.3.1.13~~~mshC~~~L-cysteine:1D-myo-inositol 2-amino-2-deoxy-alpha-D-glucopyranoside ligase~~~COG0215
MQSWYCPPVPVLPGRGPQLRLYDSADRQVRPVAPGSKATMYVCGITPYDATHLGHAATYVTFDLIHRLWLDLGHELHYVQ
NITDIDDPLFERADRDGVDWRDLAQAEVALFCEDMAALRVLPPQDYVGATEAIAEMVELIEKMLACGAAYVIDREMGEYQ
DIYFRADATLQFGYESGYDRDTMLRLCEERGGDPRRPGKSDELDALLWRAARPGEPSWPSPFGPGRPGWHVECAAIALSR
IGSGLDIQGGGSDLIFPHHEFTAAHAECVSGERRFARHYVHAGMIGWDGHKMSKSRGNLVLVSALRAQDVEPSAVRLGLL
AGHYRADRFWSQQVLDEATARLHRWRTATALPAGPAAVDVVARVRRYLADDLDTPKAIAALDGWVTDAVEYGGHDAGAPK
LVATAIDALLGVDL
>Q9ADA4 6.3.1.13~~~mshC~~~L-cysteine:1D-myo-inositol 2-amino-2-deoxy-alpha-D-glucopyranoside ligase~~~COG0215
MHAWPASEVPALPGQGRDLRIHDTATGGPVTLDPGPVARIYVCGITPYDATHMGHAATYNAFDLVQRVWLDTKRQVHYVQ
NVTDVDDPLLERAVRDGVDWTALAEQETALFREDMTALRMLPPQHYIGAVEAIPGIVPLVERLRDAGAAYELEGDVYFSV
EADPHFGGVSHLDAATMRLLSAERGGDPDRPGKKNPLDPMLWMAAREGEPSWDGGTLGRGRPGWHIECVAIALDHLGMGF
DVQGGGSDLAFPHHEMGASHAQALTGEFPMAKAYVHAGMVGLDGEKMSKSKGNLVFVSQLRREGVDPAAIRLTLLAHHYR
SDWEWTDQVLQDALARLDRWRAAVSRPDGPPAEALVEEIREALANDLDSPAALAAVDRWAALQQESGGTDIGAPGVVSRA
VDALLGVAL
>P9WJM7 2.3.1.189~~~mshD~~~Mycothiol acetyltransferase~~~COG0454
MTALDWRSALTADEQRSVRALVTATTAVDGVAPVGEQVLRELGQQRTEHLLVAGSRPGGPIIGYLNLSPPRGAGGAMAEL
VVHPQSRRRGIGTAMARAALAKTAGRNQFWAHGTLDPARATASALGLVGVRELIQMRRPLRDIPEPTIPDGVVIRTYAGT
SDDAELLRVNNAAFAGHPEQGGWTAVQLAERRGEAWFDPDGLILAFGDSPRERPGRLLGFHWTKVHPDHPGLGEVYVLGV
DPAAQRRGLGQMLTSIGIVSLARRLGGRKTLDPAVEPAVLLYVESDNVAAVRTYQSLGFTTYSVDTAYALAGTDN
>Q9KZV0 2.3.1.189~~~mshD~~~Mycothiol acetyltransferase~~~COG0456
MTSDDTVRPGRPRSIETLAELTPEQTDAVLALLTEAARTDGQHAVSEQGRLQLRGPAREGVVHLLLTLDGGELVGYAQLE
GTDPVEPPAAELVVHPSHRGQGHGRALGSALLAASGKRLRIWAHGGHSAARHLAQVLGLSLFRELRQLRRPLTGLDLPEP
RLPEGVSVRTFVPGQDDAAWLAVNAAAFAHHPEQGSLTQRDLDDRKAEPWFDPAGFFLAERDGELIGFHWTKVHAEERLG
EVYVLGIRPDTQGGGLGKALTTIGLRHLEGQGLPTAMLYVDADNKAAVAVYERLGFVTHETDLMYRTET
>B2DEU8 2.1.2.7~~~mshmt~~~2-methylserine hydroxymethyltransferase~~~
MTEQTKAYFNTPVHERDPLVAQALDNERKRQQDQIELIASENIVSRAVLDALGHEMTNKTLEGYPGNRFHGGGQFVDVVE
QAAIDRAKELFGCAYANVQPHSGTQANLAVFFLLLKPGDKVLSLDLAAGGHLSHGMKGNLSGRWFESHNYNVDPETEVID
YDEMERIAEEVRPTLLITGGSAYPRELDFERMGKIAKKVGAWFLVDMAHIAGLVAGGAHPSPFPHADIVTCTTTKTLRGP
RGGLILTNNEAWFKKLQSAVFPGVQGSLHSNVLAAKAVCLGEALRPDFKVYAAQVKANARVLAETLIARGVRIVSGGTDT
HIVLVDLSSKGLNGKQAEDLLARANITANKNPIPNDSPRPAEWVGMRLGVSAATTRGMKEDEFRTLGTVIADLIEAEAAG
NADGVVEGAKAKVATLTAAFPVYAH
>B2DEV1 2.1.2.7~~~mshmt~~~2-methylserine hydroxymethyltransferase~~~
MDHATRAHFTMTVGEVDPLLADALASERGRQQNQIELIASENIVSRAVLDALGHEITNKTLEGYPGNRFHGGGQFVDIAE
QAAIDRAKQLFNCGYANVQPHSGTQANLAVFFLLLKPGEKVLSLDLAAGGHLSHGMKANLSGRWFDATNYNVNPQNEVID
LDEMERLAEEIRPKLLITGGSAYPRELDFERMSRIAKKVGAYFLVDMAHIAGLVAGGVHPSPFPHADIVTCTTTKTLRGP
RGGLILTNNEEWYKKLQAAVFPGVQGSLHSNVLAAKAICLGEAMLDDFKVYARQVVANAKVLANTLAERGVRIVSGGTDT
HIVLLDLASKGLLGKQAETLLAKANITSNKNPIPGDSPRPPEWVGMRLGSSAATTRGLKEAEFRVLGTVIADLIDAEVAG
KADDVVEGAKAKIAELTNTFPVYGQ
>B2DEU7 2.1.2.7~~~mshmt~~~2-methylserine hydroxymethyltransferase~~~
MNELTRTFFNSSVHDTDPLIAQALDDERARQKNQIELIASENIVSQAVLDALGHEMTNKTLEGYPGNRFHGGGQFVDVVE
QAAIDRAKQLFNCGYANVQPHSGTQANLAVFFLLVKPGDRILSLDLAAGGHLSHGMKGNLSGRWFEAHNYNVDPQNEVIN
YDEMERIAEEVKPKLLITGGSAYPRELDFARMAQIAKKVGAFFMVDMAHIAGLVAGGAHPSPFPHADIVTCTTTKTLRGP
RGGLILTNNEEWYKKLQTAVFPGVQGSLHSNVLAAKAICLGEALRPEFRDYVAQVVKNAKVLAETLTSRGIRIVSGGTDT
HIVLLDLSSKGLNGKQAEDALARANITSNKNPIPNDSPRPAEWVGMRLGVSAATTRGMKEDEFRKLGNVVADLLEAESAG
NGPEAAEKAKVTVRELTEAFPVYAH
>Q9L0Q1 7.5.2.-~~~msiK~~~Diacetylchitobiose uptake system ATP-binding protein MsiK~~~COG3842
MATVTFDKATRVYPGSTKPAVDGLDIDIADGEFLVLVGPSGCGKSTSLRMLAGLEDVNGGAIRIGDRDVTHLPPKDRDIA
MVFQNYALYPHMSVADNMGFALKIAGVNKAEIRQKVEEAAKILDLTEYLDRKPKALSGGQRQRVAMGRAIVREPQVFLMD
EPLSNLDAKLRVSTRTQIASLQRRLGITTVYVTHDQVEAMTMGDRVAVLKDGLLQQVDSPRNMYDKPANLFVAGFIGSPA
MNLVEVPITDGGVKFGNSVVPVNRDALKAASDKGDRTVTVGVRPEHFDVVELNGGAAKTLSKDSADAPAGLAVSVNVVEE
TGADGYIYGTVEVGGETKDLVVRVSSRAVPEKGATVHVVPRPGEIHVFSSSTGERLTD
>A0A089QRB9 2.3.1.252~~~msl3~~~Mycolipanoate synthase~~~COG0604
MRTATATSVAVIGMACRLPGGIDSPQRLWEALLRGDDLVGEIPADRWDANVYYDPEPGVPGRSVSRWGAFLDDVGGFDCD
FFGLTEREATAIDPQHRLLLEVSWEAIEHAGVDPATLAESQTGVFVGLTHGDYELLSADCGAAEGPYGFTGTSNSFASGR
VAYTLGLHGPAVTVDTACSSGLTAVHQACRSLDDGESDLALAGGVVVTLEPRKSVSGSLQGMLSPTGRCHAFDEAADGFV
SGEGCVVLLLKRLPDAVRDGDRVLAIVRGTAANQDGRTVNIAAPSAQAQIAVYQQALAAAGVEASTVGMVEAHGTGTPVG
DPVEYASLAAVYGTEGPCALTSVKTNFGHLQSASGPLGLMKTILALRHGVVPQNLHFCRLPDQLAEIDTELFVPQANTSW
PDNTGQPRRAAVSSYGMSGTNVHAILEQAPVSEPAASGPELTPEAGGLALFPVSATSAEQLHVTAARLADWVDQNGNAGS
RVSMRDLGYTLSCRRAHRPVRTVVTASSFDELSAALRDVAGDQIPYQPAVGHDDRGPVWVFSGQGSQWPGMGTELLVAEP
VFAATVAAMEPVIARESGFSVTEAMSAPQTVSGIDRVQPTIFAVQVALAAALKSYGVRPGAIIGHSLGEAAAAVVAGALS
LHDGLRVICRRSRLMSRIAGSGAMASVELPGQQVLSELAIRGISDVVLSVVASPTSTVVGGATQSIRDLVAAWEQQDVLA
REVAVDVASHTPQVDPILDELLEVLAEVDPTAPEIPYYSATLWDPRERPSFTGEYWVENLRYTVRFAAAVQAALKDGYRV
FGELAPHPLLTYAVEQNAASLDMPIATLAAMRRGEQLPFGLRGFVADVHNAGAKVDFSVQYPDGRLVDAPLPSWTHRTLM
LSREDSHRSHTGAVQAVHPLLGAHVHLLEEPERHVWQAGVGTGAHPWLGDHRIHNVAAFPGAAYCEMALAAARTTLGELS
EVRDIKFEQTLLLDEQTVVSSAATIAAPGILQFAVESHQEGEPARRASAMLHALEEMPQPPGYDTNALTAAHESSMSGEE
LRKMFNSLGIQYGPAFSGLVAVHTARGDVTTVLAEVALPGAIRSQQSAYASHPALLDACFQSVLVHPEVQKATVGGLMLP
VGVRRLRNYHSTRSAHYCLARVTSSSRAGECEADLDVFDQAGTVLLTVEGLRLAAGISEHERANRVFDERLLTIEWERGE
LPEVPQIDAGSWLLLSASEADPLTAQLADALNAVGAQSTSVASASDVAQLRSLLGGRLTGVVVVTGPPTGGLTQCGRDYV
SQLVGIARELAELPGEPPRLFVVTRSAASVLPSDLANLEQAGLRGLMRVIDSEHPHLGATAIDVDNDETVAALVASQLQS
GSQEDETAWRNGIWYTARLRPGPLRPAERRTAVVEYRRDGMRLQIRTPGDLESLEFVTFDRVAPGPGEIEVAVTASSVNF
ADVLVAFGRYPTFEGYRQQLGIDFAGVVTAVGPDVTEHRIGDHVGGMSANGCWSTFVRCDARLAVTLPPELPVAAAAAVP
TASATAWYALHDLARICSDDKVLIHSGTGGVGQAAIAIARAAGCEIFATAGSAQRRQLLHDMGVEHVYDSRSTEFAEQIR
GDTDGYGVDVVLNSLPGAAQRAGIELLAFGGRFVEIGKRDIYGDTRLGLFPFRRNLSLYAVDLALLTHSHPHTVRRLLKT
VYQHTVEGTLPVPQTTHYPIHDAAVAIRLVGGAGHTGKVVLDVPRTGEGVAVVPPEQVRTSRPDGAYLVTGGLGGLGLFL
AGELAAAGCGRIVLNSRSTPSPHATRVIERLRAAGADIQVECGDIADAATAHRVVAVATASGLPVRGVLHAAAVVEDATL
ANVTDELIDRCWAPKVHGAWNIHRATAAQPLEWFCLFSSAAALVGSPGQGAYAAANSWLDAFAHWRRAQGLPATSIAWGA
WAEIGRATALAEGTGAAIAPAEGARAFQTLLRYGRAYSGYAPIMGTPWLTAFAQRSRFAEAFHATGQNQPATGKFLAELG
SLPREEWPRTVRRLVSDQISLLLRRTIDPDRPLSDYGLDSLGNLELRTRIETETGIRVSPTKITTVRGLAEHVCDELAAA
QSAPV
>Q7TXK8 2.3.1.41~~~~~~Phenolphthiocerol synthesis polyketide synthase type I Pks15/1~~~
MIEEQRTMSVEGADQQSEKLFHYLKKVAVELDETRARLREYEQRATEPVAVVGIGCRFPGGVDGPDGLWDVVSAGRDVVS
EFPTDRGWDVEGLYDPDPDAEGKTYTRWGAFLDDATGFDAGFFGIAPSEVLAMDPQQRLMLEVSWEALEHAGIDPLSLRG
SATGVYTGIFAASYGNRDTGGLQGYGLTGTSISVASGRVSYVLGLQGPAVSVDTACSSSLVAIHWAMSSLRSGECDLALA
GGVTVMGLPSIFVGFSRQRGLAADGRCKAFAAAADGTGWGEGAGVVVLERLSDARRLGHSVLAVVRGSAVNQDGASNGLT
APNGLAQQRVIQAALANAGLSAADVDVVEAHGTATTLGDPIEAQALLSTYGQGRPAEQPLWVGSIKSNMGHTQAAAGVAG
VIKMVQAMRHGVMPATLHVDEPSPRVDWTSGAVSVLTEAREWSVDGRPRRAAVSSFGISGTNAHLILEEAPVPAPAEAPV
EASESTGGPRPSMVPWVISARSAEALTAQAGRLMAHVQANPGLDPIDVGCSLASRSVFEHRAVVVGASREQLIAGLAGLA
AGEPGAGVAVGQPGSVGKTVVVFPGQGAQRIGMGRELYGELPVFAQAFDAVADELDRHLRLPLRDVIWGADADLLDSTEF
AQPALFAVEVASFAVLRDWGVLPDFVMGHSVGELAAAHAAGVLTLADAAMLVVARGRLMQALPAGGAMVAVAASEDEVEP
LLGEGVGIAAINAPESVVISGAQAAANAIADRFAAQGRRVHQLAVSHAFHSPLMEPMLEEFARVAARVQAREPQLGLVSN
VTGELAGPDFGSAQYWVDHVRRPVRFADSARHLQTLGATHFIEAGPGSGLTGSIEQSLAPAEAMVVSMLGKDRPELASAL
GAAGQVFTTGVPVQWSAVFAGSGGRRVQLPTYAFQRRRFWETPGADGPADAAGLGLGATEHALLGAVVERPDSDEVVLTG
RLSLADQPWLADHVVNGVVLFPGAGFVELVIRAGDEVGCALIEELVLAAPLVMHPGVGVQVQVVVGAADESGHRAVSVYS
RGDQSQGWLLNAEGMLGVAAAETPMDLSVWPPEGAESVDISDGYAQLAERGYAYGPAFQGLVAIWRRGSELFAEVVAPGE
AGVAVDRMGMHPAVLDAVLHALGLAVEKTQASTETRLPFCWRGVSLHAGGAGRVRARFASAGADAISVDVCDATGLPVLT
VRSLVTRPITAEQLRAAVTAAGGASDQGPLEVVWSPISVVSGGANGSAPPAPVSWADFCAGSDGDASVVVWELESAGGQA
SSVVGSVYAATHTALEVLQSWLGADRAATLVVLTHGGVGLAGEDISDLAAAAVWGMARSAQAENPGRIVLIDTDAAVDAS
VLAGVGEPQLLVRGGTVHAPRLSPAPALLALPAAESAWRLAAGGGGTLEDLVIQPCPEVQAPLQAGQVRVAVAAVGVNFR
DVVAALGMYPGQAPPLGAEGAGVVLETGPEVTDLAVGDAVMGFLGGAGPLAVVDQQLVTRVPQGWSFAQAAAVPVVFLTA
WYGLADLAEIKAGESVLIHAGTGGVGMAAVQLARQWGVEVFVTASRGKWDTLRAMGFDDDHIGDSRTCEFEEKFLAVTEG
RGVDVVLDSLAGEFVDASLRLLVRGGRFLEMGKTDIRDAQEIAANYPGVQYRAFDLSEAGPARMQEMLAEVRELFDTREL
HRLPVTTWDVRCAPAAFRFMSQARHIGKVVLTMPSALADRLADGTVVITGATGAVGGVLARHLVGAYGVRHLVLASRRGD
RAEGAAELAADLTEAGAKGQVVACDVADRAAVAGLFAQLSREYPPVRGVIHAAGVLDDAVITSLTPDRIDTVLRAKVDAA
WNLHQATSDLDLSMFVLCSSIAATVGSPGQGNYSAANAFLDGLAAHRQAAGLAGISLAWGLWEQPGGMTAHLSSRDLARM
SRSGLAPMSPAEAVELFDAALAIDHPLAVATLLDRAALDARAQAGALPALFSGLARRPRRRQIDDTGDATSSKSALAQRL
HGLAADEQLELLVGLVCLQAAAVLGRPSAEDVDPDTEFGDLGFDSLTAVELRNRLKTATGLTLPPTVIFDHPTPTAVAEY
VAQQMSGSRPTESGDPTSQVVEPAAAEVSVHA
>B2HIL7 2.3.1.41~~~~~~Phenolphthiocerol synthesis polyketide synthase type I Pks15/1~~~COG0604
MTTSGESADQQNDKLFRYLKKVAVELDEARARLREYEQRATEPVAVVGIGCRFPGGADGPEGLWDLVSQGRDAVTEFPND
RGWDTEGLFDPDPDAEGKTYTRWGAFVENATNFDAGFFGIPPSEVLAMDPQQRLMLEVSWEALEHAGIDPMSLRGSSTGV
FTGIFAPSYGGKDVGALQGYGLTGSPVSVASGRVAYVLGLEGPALSVDTACSSSLVAIHWAMASLRSGECDMALAGGVTV
MGLPSIFVGFSRQRGLAADGRCKAFAAAADGTGWGEGAGVLVLERLSDAQRNGHNVLAVVRGSAINQDGASNGLTAPNGL
AQQRVIQAALANCGLTSADVDVVEAHGTATTLGDPIEAEALLATYGQGRPTDQPLWVGSIKSNMGHTQAAAGVAGVIKMV
QAMRHGLMPASLHVDEPSKRVDWESGAVSVLAEARDWPDAGRPRRAGVSSFGISGTNAHVILEEAPAPEAVPDSESNKGE
PSLPVVPWVISARSAEALTAQAGRLLAHVQADPQSNPVDIGFSLAGRSAFEHRAVVVGADRQQLLTGLATLADGAPGAGV
VTGQAGSVGKTAVVFPGQGSQRIGMARELHDQLPVFAEAFDAVADELDRHLRIPLREVMWGSDAALLDSTEFAQPALFAV
EVALFAALQRWGLQPDFVMGHSVGELSAAYVAGVLTLADAAMLVVARGRLMQALPAGGAMVAVAAAEDEVLPSLTDGVGI
AAINAPKSVVISGAEAAVTAISDQFAQQGRRVHRLAVSHAFHSPLMEPMLEEFARIAAQVEAREPQIALVSNVTGELASA
DGGFGSAQYWVEHVRRAVRFADSARQLHTLGVTHFVEVGPGSGLTGSIEQSLAPAEAVVVSMLGKDRPEVASVLTAFGQL
FSTGMSVDWPAVFAGSGATRVDLPTYAFQRRRFWEVPGADGPADATGLGLGGAEHALLGAVVERPDSGGVVLTGRLALAD
QPWLADHVIGGVVLFPGAGFVELAIRAGDEVGCAVVEELVLAAPLVLHPGMGVQVQVIVGAADDSGNRALSVYSRGDQSE
DWLLNAEGMLGVEAASSGADLSVWPPEGAESVDISDGYAQLADRGYAYGPGFQGLVGVWRRDSELFAEVVAPSGVAVDKM
GMHPVVLDAVLHALGLTAEQNPDSDETKLPFCWRGVSLHAGGAGRVRARLTMSGPDSISVEIADAAGLPVLTVGALVTRA
MSAAQLRAAVAAAGGGAPDQGPLDVIWSPIPLSGSGTNGSAQPAVVSWADFCAGGDGGAAGDAGVVVWEPNPAGEDVVGS
VYAATHAALEVLQSWFDGDRAGTLVVLTHGAVAMPGENVSDLAGAAVWGIVRSAQAENPGRIVLVDADAAVEAAELVAVG
EPQLVVRSGAAHAARLAPAAPLLAVPADESAWRLAAGGGGTLEDLVIEPCPEVQAPLAAGQVRVAVRAVGVNFRDVVAAL
GMYPGEAPPLGAEGAGVVLEVGPQVSGVAVGDSVMGFLGGAGPLSVVDQQLITRMPQGWSFAQAAAVPVVFLTALFGLQD
LAKIQPGESVLIHAGTGGVGMAAVQLARHWGVEIFVTASRGKWDTLRAMGFDDDHIGDSRTLDFEEKFLAVTDGRGVDVV
LDSLAGDFVDASLRLLVRGGRFLEMGKTDIRDADKIAANYPGVWYRAFDLSEAGPVRMQEMLAEVRELFDTAVLHRLPVT
TWDVRCAPAAFRFMSQARHIGKVVLTMPSALADGLADATVLITGATGAVGAVLARHMLDAYGVRHLVLASRRGDRAEGAA
ELAAELSEAGANVQVVACDVADRDAVEAMLARLSGEYPPVRGVIHAAGVLDDAVISSLTPERIDTVLRAKVDAAWNLHEA
TLDLDLSMFVLCSSIAATVGSPGQGNYSAANSFLDGLAAHRQAAGLAGISVAWGLWEQSGGMAAHLSSRDLARMSRSGLA
PMNPEQAVGLLDAVLAINHPLMVATLLDRPALEARAQAGGLPPLFAGVVRRPRRRQIEDTGDAAQSKSALAERLNGLSAG
ERQDALVGLVCLQAAAVLGRPSPEDIDPEAGFQDLGFDSLTAVELRNRLKSATGLTLPPTVIFDHPTPTAIAEYVGRQIP
DSQATQAEEEKLPESDGEMVSVTA
>Q9X404 1.14.13.111~~~msmA~~~Methanesulfonate monooxygenase hydroxylase subunit alpha~~~
MPARNHTQWMATPPLQKGEWVDSRVYTDQEIFDEELEKIFKKAWVPFRHESELPKAYDFRTTSIANEPIIVTRGPDNEVR
AFLNVCPHRGMLIERRPSGSLYEGQPSGNPKRMTCMFHAWQFDMKGNCVYISREKEGYQDRLPKESVGLRRLRCEVKFGG
FVWVNLDDNPISLEDWAGEPFQCLRKTLEAEPMEVFHYHKAIVDTNYKLWHDTNCEFYHDFMHYHNRVTGFNDAYFARKN
ESFEHGHILVGTFEVNYDQYEGFESRAGLSFPHLPPNQWYMIDLFPGMNFNLRGSALRCDVVTPLGPNKVMIEFRGLGLK
SDTPEERQTRINHHNSIWGPFGRNLHEDLIGVQGQGTTMRPGQESRRILHGRQENQTIHDENGMRHYYDKWGKWMNRMPS
NPELPYNAPAIAAE
>Q9X405 1.14.13.111~~~msmB~~~Methanesulfonate monooxygenase hydroxylase subunit beta~~~
MDIQTEMTAPPLSGGLDPAQARDAADAVRNAIYRATILLDSQKWDEWLALCADNFVYDIKAWSPEINYDMTYLHGSRKDL
EALIRLLPKHNTDHSPLTRHTTIYTVDVADEGATAKGVSAFIVFQHLLDGTNSHIDAGESRLFLVGKYYDTFRIENGQAL
FTSRETRLENRRLDKGSHWPI
>P70752 1.14.13.111~~~msmC~~~Methanesulfonate monooxygenase ferredoxin subunit~~~
MSWTYLCDAADVAPNSLKLVDANDIRVVVANYGSGFRAIPPICPHMEEPLDESGVIANCVLTCTKHLWAWNLISLELLGE
TEKPLKTYELKEEDGKLLAFIAEELTYDFEEEDDMDDDFFSKS
>Q9X406 1.14.13.111~~~msmD~~~Putative methanesulfonate monooxygenase ferredoxin reductase subunit~~~
MTSLARADDLAAAPLARENCAVSVETKSGVFGFDCAPGETLLYAGLRHGLTLPHECATGTCGTCRARVMTGEVDVAWEEA
PGAARFKRDKGDVLLCQTRAVGDCVLRVPAEVAAKPARHQIPAYRTGLMENIRRLTGDVISFEVALSAPMDFDAGQFVVV
EAPGLEGARAYSMVNFTRSADRIELVVKRKPSGGFGDWLFGATAEGAKVKVFGPLGRATFHADEHKNLLMIAGGSGIAGM
MSILASAAEADHFRTRKGYLFFGVRTLADGFYLQEFAQRVVEAQGNLEVTLALSHEDPAGADHPDHPGVKLASGMVHEVA
GRAMAGRYDDLIAYVAGPPPMVDGALRTLITQGGLSPSAIRYDKFG
>Q00749 ~~~msmE~~~Multiple sugar-binding protein~~~COG1653
MKWYKKIGLLGIVGLTSVLLAACNKSKASQSKEDKVTIEYFNQKKEMDATLKKIIKDFERENPKIHVKMTSVPDAGTVLK
TRVLSGDVPDVINIYPQNMDFQEWSKAGYFYNMTGKAYLNHLKNHYANEYKVNQKVYSVPLTANVSGIYYNKTKFKELGL
KVPETWDEFVKLVEEIKAKKETPFALAGTEGWTLNGYHQLSLISVTGSANAANKYLRFSQPNSIKTSDKILKEDMVRLNL
LADDGNQQKNWKGASYNDALVAFANEKALMTPNGSWALPAIKQQDPKFEIGTFAFPGKKTGNGITVGAGDLALSISAKTK
HLKEAEKFVKYMTTARAMQKYYDVDGSPVAVKGVREDKNSPLQPLTKLAFTDKHYVWLGQHWNSEDDFFTATTNYLMTKN
AKGLADGLNAFFNPMKADVD
>P94360 7.5.2.-~~~msmX~~~Oligosaccharides import ATP-binding protein MsmX~~~COG3842
MAELRMEHIYKFYDQKEPAVDDFNLHIADKEFIVFVGPSGCGKSTTLRMVAGLEEISKGDFYIEGKRVNDVAPKDRDIAM
VFQNYALYPHMTVYDNIAFGLKLRKMPKPEIKKRVEEAAKILGLEEYLHRKPKALSGGQRQRVALGRAIVRDAKVFLMDE
PLSNLDAKLRVQMRAEIIKLHQRLQTTTIYVTHDQTEALTMATRIVVMKDGKIQQIGTPKDVYEFPENVFVGGFIGSPAM
NFFKGKLTDGLIKIGSAALTVPEGKMKVLREKGYIGKEVIFGIRPEDIHDELIVVESYKNSSIKAKINVAELLGSEIMIY
SQIDNQDFIARIDARLDIQSGDELTVAFDMNKGHFFDSETEVRIR
>P40873 1.5.3.1~~~soxA~~~Monomeric sarcosine oxidase~~~
MSIKKDYDVIVVGAGSMGMAAGYYLSKQGVKTLLVDSFHPPHTNGSHHGDTRIIRHAYGEGREYVPFALRAQELWYELEK
ETHHKIFTKTGVLVFGPKGEAPFVAETMEAAKEHSLDVDLLEGSEINKRWPGVTVPENYNAIFEKNSGVLFSENCIRAYR
ELAEANGAKVLTYTPVEDFEIAEDFVKIQTAYGSFTASKLIVSMGAWNSKLLSKLNIEIPLQPYRQVVGFFECDEKKYSN
THGYPAFMVEVPTGIYYGFPSFGGCGLKIGYHTYGQKIDPDTINREFGIYPEDEGNIRKFLETYMPGATGELKSGDVCMY
TKTPDEHFVIDLHPQFSNVAIAAGFSGHGFKFSSVVGETLSQLAVTGKTEHDISIFSINRPALKQKETI
>P40859 1.5.3.1~~~soxA~~~Monomeric sarcosine oxidase~~~
MSTHFDVIVVGAGSMGMAAGYQLAKQGVKTLLVDAFDPPHTNGSHHGDTRIIRHAYGEGREYVPLALRSQELWYELEKET
HHKIFTKTGVLVFGPKGESAFVAETMEAAKEHSLTVDLLEGDEINKRWPGITVPENYNAIFEPNSGVLFSENCIRAYREL
AEARGAKVLTHTRVEDFDISPDSVKIETANGSYTADKLIVSMGAWNSKLLSKLNLDIPLQPYRQVVGFFESDESKYSNDI
DFPGFMVEVPNGIYYGFPSFGGCGLKLGYHTFGQKIDPDTINREFGVYPEDESNLRAFLEEYMPGANGELKRGAVCMYTK
TLDEHFIIDLHPEHSNVVIAAGFSGHGFKFSSGVGEVLSQLALTGKTEHDISIFSINRPALKESLQKTTI
>P23342 1.5.3.1~~~soxA~~~Monomeric sarcosine oxidase~~~
MSTHFDVIVVGAGSMGMAAGYYLAKQGVKTLLVDSFDPPHTNGSHHGDTRIIRHAYGEGREYVPFALRAQELWYELEKET
HHKIFTQTGVLVYGPKGGSAFVSETMEAANIHSLEHELFEGKQLTDRWAGVEVPDNYEAIFEPNSGVLFSENCIQAYREL
AEAHGATVLTYTPVEDFEVTEDLVTIKTAKGSYTANKLVVSMGAWNSKLLSKLDVEIPLQPYRQVVGFFECDEAKYSNNA
HYPAFMVEVENGIYYGFPSFGGSGLKIGYHSYGQQIDPDTINREFGAYPEDEANLRKFLEQYMPGANGELKKGAVCMYTK
TPDEHFVIDLHPKYSNVAIAAGFSGHGFKFSSVVGETLAQLATTGKTEHDISIFSLNRDALKKEAVK
>P40854 1.5.3.1~~~soxA~~~Monomeric sarcosine oxidase~~~
MSPTYDVIVIGLGGMGSAAAHHLSARGARVLGLEKFGPVHNRGSSHGGSRITRQSYFEDPAYVPLLLRAYELYEELERAT
GRNVATLCGGVMAGPPDSRTVSGSLRSATEWDLAHEMLDAKEIRRRFPTLAPDDDEVALFEAKAGLLRPENMVAAHLQLA
TRQGAELRFEEPVLRWEPYRDGVRVHTGENTYTAGQLVICPGAWAPQLLADIGVPITVERQIMYWFQPKGGTGPFVPERH
PVYIWEDADGVQVYGFPAIDGPEKGAKVAFFRKGQHTTPETIDRTVHAHEVRAMADHMSALIPDLPGTFLKAATCMYSNT
PDEHFVIARHPAHPESVTVACGFSGHGFKFVPVVGEILADLALTGATAHPIGLFDPARLTAPAARGVQP
>Q07408 ~~~msp4~~~Major surface antigen 4~~~
MNYRELFTGGLSAATVCACSLLVSGAVVASPMSHEVASEGGVMGGSFYVGAAYSPAFPSVTSFDMRESSKETSYVRGYDK
SIATIDVSVPANFSKSGYTFAFSKNLITSFDGAVGYSLGGARVELEASYRRFATLADGQYAKSGAESLAAITRDANITET
NYFVVKIDEITNTSVMLNGCYDVLHTDLPVSPYVCAGIGASFVDISKQVTTKLAYRGKVGISYQFTPEISLVAGGFYHGL
FDESYKDIPAHNSVKFSGEAKASVKAHIADYGFNLGARFLFS
>A0QR29 ~~~mspA~~~Porin MspA~~~
MKAISRVLIAMVAAIAALFTSTGTSHAGLDNELSLVDGQDRTLTVQQWDTFLNGVFPLDRNRLTREWFHSGRAKYIVAGP
GADEFEGTLELGYQIGFPWSLGVGINFSYTTPNILIDDGDITAPPFGLNSVITPNLFPGVSISADLGNGPGIQEVATFSV
DVSGAEGGVAVSNAHGTVTGAAGGVLLRPFARLIASTGDSVTTYGEPWNMN
>Q9Z4I3 ~~~mspA~~~Major outer membrane protein MspA~~~
MKKALVFFVALAMIGSVFAAEPAAEAKVAEFSGNAAVTFGFDLDTVKAGFKNTTEADLKFNLMNGGDKSTTGNGVWGELK
LVVNALQIRATADVSDGHTFAIQTKKDNDGEDTIFVEIDTAKLHFNDLYVGITSGDFRYGGSFWYPNALNYKDSKEDEKY
TRSRAAKLGYDQGLVLGYEKKDLFKVELAARSKKDTTKKVEKVELVHLSAGAKIKEKEYYKTEPAAVTGDTAQDIFNDGT
LVSVTADPKVKTLNAEGAYYKPVMKDDETNYWTNKFALGLYGEVTPIKDLRIGVGAAYVLGQLGAAASEDDKTNDISVFA
GVDYRFNFNEDFFIQPTVTYNFYNDYKVASKNYAIETNKMNAGLRFGFAKSKSDSENESLLYTFFGQEKLFYETTKNDKG
DQILLPGVSVFGSFNFKENAMKTELPVMLTFYSGELVQNLKAYALFGANLGPDAGKGTAVAMSDAVYKGIIEKKGMQAGL
AASYDVKVNDAVTIVPAAGVLWTHGSQASGNDKMSADEVAVSLKADVKGLVSNTTFTAFWEKASFGKGAASVGGTKTSVD
AVKKGVIGLKAKIAL
>A0QPU4 ~~~mspB~~~Porin MspB~~~
MTAFKRVLIAMISALLAGTTGMFVSAGAAHAGLDNELSLVDGQDRTLTVQQWDTFLNGVFPLDRNRLTREWFHSGRAKYI
VAGPGADEFEGTLELGYQIGFPWSLGVGINFSYTTPNILIDDGDITAPPFGLNSVITPNLFPGVSISADLGNGPGIQEVA
TFSVDVSGPAGGVAVSNAHGTVTGAAGGVLLRPFARLIASTGDSVTTYGEPWNMN
>A0R3I3 ~~~mspC~~~Porin MspC~~~
MKAISRVLIAMISALAAAVAGLFVSAGTSHAGLDNELSLVDGQDRTLTVQQWDTFLNGVFPLDRNRLTREWFHSGRAKYI
VAGPGADEFEGTLELGYQIGFPWSLGVGINFSYTTPNILIDDGDITGPPFGLESVITPNLFPGVSISADLGNGPGIQEVA
TFSVDVSGPAGGVAVSNAHGTVTGAAGGVLLRPFARLIASTGDSVTTYGEPWNMN
>A0R541 ~~~mspD~~~Porin MspD~~~
MRYLVMMFALLVSVTLVSPRPANAVDNQLSVVDGQGRTLTVQQAETFLNGVFPLDRNRLTREWFHSGRATYHVAGPGADE
FEGTLELGYQVGFPWSLGVGINFSYTTPNILIDGGDITQPPFGLDTIITPNLFPGVSISADLGNGPGIQEVATFSVDVKG
AKGAVAVSNAHGTVTGAAGGVLLRPFARLIASTGDSVTTYGEPWNMN
>P0A081 1.8.4.11~~~msrA1~~~Peptide methionine sulfoxide reductase MsrA 1~~~
MNINTAYFAGGCFWCMTKPFDTFDGIEKVTSGYMGGHIENPTYEQVKSGTSGHLETVEIQYDVALFSYNKLLEIFFSVID
PLDTGGQYQDRGPQYQTAIFYTNDHQKELAETYIEQLKNTINADKAIATKILPASQFYKAEDYHQDFYKKNPERYAEEQK
IRQEYKNKQ
>P0A082 1.8.4.11~~~msrA1~~~Peptide methionine sulfoxide reductase MsrA 1~~~
MNINTAYFAGGCFWCMTKPFDTFDGIEKVTSGYMGGHIENPTYEQVKSGTSGHLETVEIQYDVALFSYNKLLEIFFSVID
PLDTGGQYQDRGPQYQTAIFYTNDHQKELAETYIEQLKNTINADKAIATKILPASQFYKAEDYHQDFYKKNPERYAEEQK
IRQEYKNKQ
>P0A086 1.8.4.11~~~msrA2~~~Peptide methionine sulfoxide reductase MsrA 2~~~COG0225
MTKEYATLAGGCFWCMVKPFTSYPGIKSVVSGYSGGHVDNPTYEQVCTNQTGHVEAVQITFDPEVTSFENILDIYFKTFD
PTDDQGQFFDRGESYQPVIFYHDEHQKKAAEFKKQQLNEQGIFKKPVITPIKPYKNFYPAEDYHQDYYKKNPVHYYQYQR
GSGRKAFIESHWGNQNA
>P65446 1.8.4.11~~~msrA2~~~Peptide methionine sulfoxide reductase MsrA 2~~~
MTKEYATLAGGCFWCMVKPFTSYPGIKSVVSGYSGGHVDNPTYEQVCTNKTGHVEAVQITFDPEVTSFENILDIYFKTFD
PTDDQGQFFDRGESYQPVIFYHDEHQKKAAEFKKQQLNEQGIFKKPVITPIKPYKNFYPAEDYHQDYYKKNPVHYYQYQR
GSGRKAFIESHWGNQNA
>Q92Y45 1.8.4.11~~~msrA3~~~Peptide methionine sulfoxide reductase MsrA 3~~~
MTKRAVLAGGCFWGMQDLIRKLPGVIETRVGYTGGDVPNATYRNHGTHAEGIEIIFDPERISYRRILELFFQIHDPTTKD
RQGNDIGTSYRSAIYYVDDEQKRIAQETIADVEASGLWPGKVVTEVEPVRDFWEAEPEHQNYLERYPNGYTCHFPRPNWV
LPRRSAAE
>P86890 ~~~msrAB~~~Peptide methionine sulfoxide reductase msrA/msrB~~~
MGAIMQANENMGSKLPKTDGKVIYLAGGCFWGLEAYMERIYGVVDASSGYANGKTQSTNYQKLHESDHAESVKVVYDPKK
ISLDKLLRYYFKVVDPVSVNKQGNDVGRQYRTGIYYVDNADKKVIDNALKELQKSVKGKIAIEVEPLKNYVRAEIYHQDY
LKKNPNGYCHIDLKKADEVIVDSDKYTKPSDEVLKKKLTQLQYEVTQNKRTEKPFENEYYNKEEEGIYVDITTGEPLFSM
ADKYDSGCGWPSFSKPISKDVVKYEDDESLNMRRTEVLSRIGKAHLGHVFNDGPKELGGLRYCINSASLRFIPLKDMEKE
GYGEFIPYIKKGELKKYIHDKTH
>P45213 ~~~msrAB~~~Peptide methionine sulfoxide reductase MsrA/MsrB~~~COG0225
MKLSKTFLFITALCCATPTLAIQNSTSSSGEQKMAMENTQNIREIYLAGGCFWGMEAYMERIHGVKDAISGYANGNTEKT
SYQMIGLTDHAETVKVTYDANQISLDKLLKYYFKVIDPTSVNKQGNDRGRQYRTGIYYQDGADKAVIGQALAQLQTKYKK
PVQIEVQPLKNYIVAEEYHQDYLKKNPNGYCHIDITKADEPVIDEKDYPKPSDAELKAKLTPLQYSVTQNKHTERSFSNE
YWDNFQPGIYVDITTGEPVFSSNDKFESGCGWPSFTKPIIKDVVHYETDNSFNMQRTEVLSRAGNAHLGHVFDDGPKDKG
GLRYCINSASIKFIPLAEMEKAGYGYLIQSIKK
>O25011 ~~~msrAB~~~Peptide methionine sulfoxide reductase MsrA/MsrB~~~COG0225
MKVLSYLKNFYLFLAIGAIMQASENMGSQHQKTDERVIYLAGGCFWGLEAYMERIYGVIDASSGYANGKTSSTNYEKLHE
SDHAESVKVIYDPKKISLDKLLRYYFKVVDPVSVNKQGNDVGRQYRTGIYYVNSADKEVIDHALKALQKEVKGKIAIEVE
PLKNYVRAEEYHQDYLKKHPSGYCHIDLKKADEVIVDDDKYTKPSDEVLKKKLTKLQYEVTQNKHTEKPFENEYYNKEEE
GIYVDITTGEPLFSSADKYDSGCGWPSFSKPINKDVVKYEDDESLNRKRIEVLSRIGKAHLGHVFNDGPKELGGLRYCIN
SAALRFIPLKDMEKEGYGEFIPYIKKGELKKYINDKKSH
>P14930 ~~~msrAB~~~Peptide methionine sulfoxide reductase MsrA/MsrB~~~
MKHRTFFSLCAKFGCLLALGACSPKIVDAGTATVPHTLSTLKTADNRPASVYLKKDKPTLIKFWASWCPLCLSELGQAEK
WAQDAKFSSANLITVASPGFLHEKKDGEFQKWYAGLNYPKLPVVTDNGGTIAQNLNISVYPSWALIGKDGDVQRIVKGSI
NEAQALALIRNPNADLGSLKHSFYKPDTQKKDSAIMNTRTIYLAGGCFWGLEAYFQRIDGVVDAVSGYANGNTENPSYED
VSYRHTGHAETVKVTYDADKLSLDDILQYYFRVVDPTSLNKQGNDTGTQYRSGVYYTDPAEKAVIAAALKREQQKYQLPL
VVENEPLKNFYDAEEYHQDYLIKNPNGYCHIDIRKADEPLPGKTKAAPQGKGFDAATYKKPSDAELKRTLTEEQYQVTQN
SATEYAFSHEYDHLFKPGIYVDVVSGEPLFSSADKYDSGCGWPSFTRPIDAKSVTEHDDFSFNMRRTEVRSRAADSHLGH
VFPDGPRDKGGLRYCINGASLKFIPLEQMDAAGYGALKGEVK
>Q9JWM8 ~~~msrAB~~~Peptide methionine sulfoxide reductase MsrA/MsrB~~~
MKHRTFFSLCAKFGCLLALGACSPKIVDAGAATVPHTLSTLKTADNRPASVYLKKDKPTLIKFWASWCPLCLSELGQTEK
WAQDAKFSSANLITVASPGFLHEKKDGDFQKWYAGLNYPKLPVVTDNGGTIAQSLNISVYPSWALIGKDGDVQRIVKGSI
NEAQALALIRDPNADLGSLKHSFYKPDTQKKDSKIMNTRTIYLAGGCFWGLEAYFQRIDGVVDAVSGYANGNTKNPSYED
VSYRHTGHAETVKVTYDADKLSLDDILQYFFRVVDPTSLNKQGNDTGTQYRSGVYYTDPAEKAVIAAALKREQQKYQLPL
VVENEPLKNFYDAEEYHQDYLIKNPNGYCHIDIRKADEPLPGKTKTAPQGKGFDAATYKKPSDAELKRTLTEEQYQVTQN
SATEYAFSHEYDHLFKPGIYVDVVSGEPLFSSADKYDSGCGWPSFTRPIDAKSVTEHDDFSYNMRRTEVRSHAADSHLGH
VFPDGPRDKGGLRYCINGASLKFIPLEQMDAAGYGALKSKVK
>P54154 1.8.4.11~~~msrA~~~Peptide methionine sulfoxide reductase MsrA~~~COG0225
MSEKKEIATFAGGCFWCMVKPFDEQPGIEKVVSGYTGGHTENPTYEEVCSETTGHREAVQITFHPDVFPYEKLLELFWQQ
IDPTDAGGQFADRGSSYRAAIFYHNDKQKELAEASKQRLAESGIFKDPIVTDILKAEPFYEAEGYHQHFYKKNPAHYQRY
RTGSGRAGFISEHWGAK
>Q93S39 1.8.4.11~~~msrA~~~Peptide methionine sulfoxide reductase MsrA~~~
MSFLDSYRKKTLMPSTDEALPGRAQPIPTSATHFVNSRPLKEPWPEGYKQVLFGMGCFWGAERLFWQVPGVYVTAVGYAG
GVTPNPTYEETCTGLTGHAEVVLVVYDPKVVSLDELLTLFWEEHDPTQGMRQGNDIGTTYRSVIYTFDKADRDVAEKSRE
AYSQALAGRGLGPITTEIEDAPELYYAEDYHQQYLAKNPNGYCGLRGTGVSCPIPLAQ
>Q6NEL2 1.8.4.11~~~msrA~~~Peptide methionine sulfoxide reductase MsrA~~~
MGWLFGAPRLVEEKDALKGGPHPVLPNPQPHAVLGTLRGQPGTETIYIGIGCYWGAEKLFWETPGVVYTSVGFAGGITPN
PTYRETCTGRTNHTEIVEVVYDPTQVTFDELVVKAMEAHDPTQGYRQGNDTGTQYRSAIYTAGPNAEQQAQRAREIVEHY
APKLAAAGLGRITTEILPLASTPAGEYYMAEDEHQQYLHKNPLGYCPHHSTGVACGIPEA
>P0A744 1.8.4.11~~~msrA~~~Peptide methionine sulfoxide reductase MsrA~~~COG0225
MSLFDKKHLVSPADALPGRNTPMPVATLHAVNGHSMTNVPDGMEIAIFAMGCFWGVERLFWQLPGVYSTAAGYTGGYTPN
PTYREVCSGDTGHAEAVRIVYDPSVISYEQLLQVFWENHDPAQGMRQGNDHGTQYRSAIYPLTPEQDAAARASLERFQAA
MLAADDDRHITTEIANATPFYYAEDDHQQYLHKNPYGYCGIGGIGVCLPPEA
>P47648 1.8.4.11~~~msrA~~~Peptide methionine sulfoxide reductase MsrA~~~COG0225
MKEIYFGGGCFWGIEKYFQLIKGVKKTSVGYLNSRIRNPSYEQVCSGYTNAVEAVKVEYEEKEISLSELIEALFEVIDPT
IRNRQGNDIGTQYRTGIYWTDSSDEKIINDKFLKLQKNYSKPIVTENKKVENYYLAEEYHQDYLKKNPNGYCHIKFD
>P9WJM5 1.8.4.11~~~msrA~~~Peptide methionine sulfoxide reductase MsrA~~~COG0225
MTSNQKAILAGGCFWGLQDLIRNQPGVVSTRVGYSGGNIPNATYRNHGTHAEAVEIIFDPTVTDYRTLLEFFFQIHDPTT
KDRQGNDRGTSYRSAIFYFDEQQKRIALDTIADVEASGLWPGKVVTEVSPAGDFWEAEPEHQDYLQRYPNGYTCHFVRPG
WRLPRRTAESALRASLSPELGT
>P54155 1.8.4.12~~~msrB~~~Peptide methionine sulfoxide reductase MsrB~~~COG0229
MAYNKEEKIKSLNRMQYEVTQNNGTEPPFQNEYWDHKEEGLYVDIVSGKPLFTSKDKFDSQCGWPSFTKPIEEEVEEKLD
TSHGMIRTEVRSRTADSHLGHVFNDGPGPNGLRYCINSAALRFVPKHKLKEEGYESYLHLFNK
>Q3JRF0 1.8.4.12~~~msrB~~~Peptide methionine sulfoxide reductase MsrB~~~
MSGDRDDPRYPYPKDDAELRRRLTPMQYEVTQHAATEPPFTGEYTDTEDAGIYHCVVCGTALFESGAKYHSGCGWPSYFK
PIDGEVIDEKMDYTHGMTRVEVRCNQCGAHLGHVFEDGPRDKTGLRYCINSAALNFEAKPERK
>P0A746 1.8.4.12~~~msrB~~~Peptide methionine sulfoxide reductase MsrB~~~COG0229
MANKPSAEELKKNLSEMQFYVTQNHGTEPPFTGRLLHNKRDGVYHCLICDAPLFHSQTKYDSGCGWPSFYEPVSEESIRY
IKDLSHGMQRIEIRCGNCDAHLGHVFPDGPQPTGERYCVNSASLRFTDGENGEEING
>E6ESW1 1.8.4.12~~~msrB~~~Peptide methionine sulfoxide reductase MsrB~~~
MTKPTEEELKQTLTDLQYAVTQENATERPFSGEYDDFYQDGIYVDIVSGEPLFSSLDKYDAGCGWPSFTKPIEKRGVKEK
ADFSHGMHRVEVRSQEADSHLGHVFTDGPLQEGGLRYCINAAALRFVPVADLEKEGYGEYLSLFK
>P0A088 1.8.4.12~~~msrB~~~Peptide methionine sulfoxide reductase MsrB~~~COG0229
MLKKDKSELTDIEYIVTQENGTEPPFMNEYWNHFAKGIYVDKISGKPLFTSEEKFHSECGWPSFSKALDDDEIIELVDKS
FGMLRTEVRSEESNSHLGHVFNDGPKESGGLRYCINSAAIQFIPYEKLEELGYGDLISHFDK
>P99065 1.8.4.12~~~msrB~~~Peptide methionine sulfoxide reductase MsrB~~~
MLKKDKSELTDIEYIVTQENGTEPPFMNEYWNHFAKGIYVDKISGKPLFTSEEKFHSECGWPSFSKALDDDEIIELVDKS
FGMVRTEVRSEESNSHLGHVFNDGPKESGGLRYCINSAAIQFIPYEKLEELGYGDLISHFDK
>P76270 1.8.4.14~~~msrC~~~Free methionine-R-sulfoxide reductase~~~COG1956
MNKTEFYADLNRDFNALMAGETSFLATLANTSALLYERLTDINWAGFYLLEDDTLVLGPFQGKIACVRIPVGRGVCGTAV
ARNQVQRIEDVHVFDGHIACDAASNSEIVLPLVVKNQIIGVLDIDSTVFGRFTDEDEQGLRQLVAQLEKVLATTDYKKFF
ASVAG
>P76342 1.8.5.-~~~msrP~~~Protein-methionine-sulfoxide reductase catalytic subunit MsrP~~~COG2041
MKKNQFLKESDVTAESVFFMKRRQVLKALGISATALSLPHAAHADLLSWFKGNDRPPAPAGKALEFSKPAAWQNNLPLTP
ADKVSGYNNFYEFGLDKADPAANAGSLKTDPWTLKISGEVAKPLTLDHDDLTRRFPLEERIYRMRCVEAWSMVVPWIGFP
LHKLLALAEPTSNAKYVAFETIYAPEQMPGQQDRFIGGGLKYPYVEGLRLDEAMHPLTLMTVGVYGKALPPQNGAPVRLI
VPWKYGFKGIKSIVSIKLTRERPPTTWNLAAPDEYGFYANVNPYVDHPRWSQATERFIGSGGILDVQRQPTLLFNGYAAQ
VASLYRGLDLRENF
>P76343 ~~~msrQ~~~Protein-methionine-sulfoxide reductase heme-binding subunit MsrQ~~~COG2717
MRLTAKQVTWLKVCLHLAGLLPFLWLVWAINHGGLGADPVKDIQHFTGRTALKFLLATLLITPLARYAKQPLLIRTRRLL
GLWCFAWATLHLTSYALLELGVNNLALLGKELITRPYLTLGIISWVILLALAFTSTQAMQRKLGKHWQQLHNFVYLVAIL
APIHYLWSVKIISPQPLIYAGLAVLLLALRYKKLRSLFNRLRKQVHNKLSV
>Q7BHL7 ~~~msrR~~~Regulatory protein MsrR~~~COG1316
MDKETNDNEYRRQSEHRTSAPKRKKKKKIRKLPIILLIVVILLIALVVYIVHSYNSGVEYAKKHAKDVKVHQFNGPVKND
GKISILVLGADKAQGGQSRTDSIMVVQYDFINKKMKMMSVMRDIYADIPGYGKHKINSAYALGGPELLRKTLDKNLGINP
EYYAVVDFTGFEKMIDELMPEGVPINVEKDMSKNIGVSLKKGNHRLNGKELLGYARFRHDPEGDFGRVRRQQQVMQTLKK
EMVNFRTVVKLPKVAGILRGYVNTNIPDSGIFQTGLSFGIRGEKDVKSLTVPIKNSYEDVNTNTDGSALQINKNTNKQAI
KDFLDED
>Q99Q02 ~~~msrR~~~Regulatory protein MsrR~~~
MDKETNDNEYRRQSEHRTSAPKRKKKKKIRKLPIILLIVVILLIALVVYIVHSYNSGVEYAKKHAKDVKVHQFNGPVKND
GKISILVLGADKAQGGQSRTDSIMVVQYDFINKKMKMMSVMRDIYADIPGYGKHKINSAYALGGPELLRKTLDKNLGINP
EYYAVVDFTGFEKMIDELMPEGVPINVEKDMSKNIGVSLKKGNHRLNGKELLGYARFRHDPEGDFGRVRRQQQVMQTLKK
EMVNFRTVVKLPKVAGILRGYVNTNIPDSGIFQTGLSFGIRGEKDVKSLTVPIKNSYEDVNTNTDGSALQINKNTNKQAI
KDFLDED
>Q5BU39 ~~~mstX~~~Protein mistic~~~
MFCTFFEKHHRKWDILLEKSTGVMEAMKVTSEEKEQLSTAIDRMNEGLDAFIQLYNESEIDEPLIQLDDDTAELMKQARD
MYGQEKLNEKLNTIIKQILSISVSEEGEKE
>Q9I1C2 1.14.14.5~~~msuD~~~Methanesulfonate monooxygenase~~~
MNVFWFLPTHGDGHFLGTSQGARPVSLPYLKQVAQAADSLGYHGVLIPTGRSCEDSWVVASALAPLTERLRFLVAIRPGI
VSPTVSARMAATLDRLSGGRLLINVVTGGDPDENRGDGIHLGHAERYEVTDEFLRVWRRVLQGEAVDFHGKHIHVENAKA
LYPPLQRPYPPLYFGGSSEAAHELAGEQVDVYLTWGEPLPAVAAKIADVRQRAARHGRTVKFGIRLHVIVRETAEEAWRA
ADRLIEHISDETIAAAQQSFARFDSEGQRRMAALHGGRRDRLEIQPNLWAGVGLVRGGAGTALVGDPRQVAERIGEYAEL
GIDSFIFSGYPHLEEAYRFAELVFPLLPEPYASLAGRGLTNLTGPFGEMIANDVLPARAGA
>Q88J85 1.5.1.38~~~msuE~~~FMN reductase (NADPH)~~~COG0431
MNARVIRVVVVSGSLRAPSRTHGLLQALVERLPAVLPKLEVHWVRIAELSASLAGSLERDSASADLQPHLQAIEQADLLL
VGSPVYRASYTGLFKHLFDLVDHQSLKGVPVVLAATGGSERHALMIDHQLRPLFAFFQAHTLPYGLYASVESFDDQRLAD
PAQFERIERVLDTVGAFFHIPVARAA
>P25240 2.1.1.72~~~~~~Type II methyltransferase M.Eco57I~~~
MKFKADQTSQKLRGGYYTPQNLADYVTKWVLSKNPKTILEPSCGDGVFIQAIANNGYNSNIELFCFELFDTEASKALERC
KLNNFSNATITEGDFLVWANECLKKNKQIFDGALGNPPFIRYQFLERNFQEQAQLVFEHLDLKFTKHTNAWVPFLLSSLA
LLKQGGRIGMVIPSEISHVMHAQSLRSYLGHVCSKIVIIDPKEIWFEDTLQGAVILLAEKKQYPDEASQGVGIVSVSGFE
FLQEDPNVLFNDTAGINGETVEGKWTKATLSIDELQLIKRVIAHPDVRKFKDIAKVDVGRYCDGANNYFLVDNETVKLYK
LERFAHPMFGRSQHCPGIIYDEKQHIENQEKGLPTNFLYIDEEFEYLSKSVKNYIKLGEVEEYHKRYKCRIRKPWFKVPS
VYSTEIGMLKRCHDAPRLIHNRVRAYTTDTAYRVSSTVTSTENLVCSFLNPITVITAELEGLFYGGGVLELVPSEIEKLY
ILIVEGLEHNVEELNLLIKDGQIERVIRQQGSLILGTLGFTQEENEKLVEIGRSLEIEGYVSRV
>O52702 2.1.1.37~~~apaLIM~~~Type II methyltransferase M.ApaLI~~~
MNKDEVVVSLFAGAGGFSSGFSQAGLKPLFGAEINADACQTYQENVGSPCHQLDLSTVDPSHIEMLTGGKRPFVVIGGPP
CQGFSTAGPRNFADPRNLLIFNYLNIVERLSPRWLIFENVEGLLTSGGGRDLARLVREFVDMGYSVRLQKVNLAAYGVPQ
TRKRVLIIGNRLGIDFQFPEELYSFDSGKAKKASGKPLAPSLAEAVAGLGPAASDKDALVPYASSEPVNAFDARMRAGNR
VEVVTHHVRVEAAERMQVELLKPGQTMKDLPPELWHESYRRRANRRVSDGTPTEKRGGAPSGIKRLHGNLQSLTITGPAA
REFIHPTEHRPLTIRECARIQTFPDKYRWVGNNASVIQQIGNAVPPLAAERLAKHLRDIDGSFGADTRPAGAMSAKLLGF
VLTEALGMSPALKSTEALLAEMHQGGFVF
>P34882 2.1.1.37~~~aquIMA~~~Type II methyltransferase M.AquIA~~~COG0270
MEKKLISLFSGAGGMDIGFHAAGFSTAVAVEQDPSCCNTLRLNMPDTPVIEGDITSITTQVILEAAKVNPLEIDLVIGGP
PCQSFSLAGKRMGMDDPRGMLVLEFLRVVREALPKCFVMENVKGMINWSKGKALEAIMTEASQPIKYAGKEYKYAVSYHV
LNAADFGVPQFRERVFIVGNRLGKTFQFPEPTHGPSNQARQIDLFGKQLKPYKTVQDAISTLPPATPPSAMALRVSQTIK
DRIKNHGY
>P54462 2.8.4.5~~~mtaB~~~Threonylcarbamoyladenosine tRNA methylthiotransferase MtaB~~~COG0621
MATVAFHTLGCKVNHYETEAIWQLFKEAGYERRDFEQTADVYVINTCTVTNTGDKKSRQVIRRAIRQNPDGVICVTGCYA
QTSPAEIMAIPGVDIVVGTQDREKMLGYIDQYREERQPINGVSNIMKARVYEELDVPAFTDRTRASLKIQEGCNNFCTFC
IIPWARGLLRSRDPEEVIKQAQQLVDAGYKEIVLTGIHTGGYGEDMKDYNFAKLLSELDTRVEGVKRIRISSIEASQITD
EVIEVLDRSDKIVNHLHIPIQSGSNTVLKRMRRKYTMEFFADRLNKLKKALPGLAVTSDVIVGFPGETEEEFMETYNFIK
EHKFSELHVFPYSKRTGTPAARMEDQVDENVKNERVHRLIALSDQLAKEYASQYENEVLEIIPEEAFKETEEENMFVGYT
DNYMKVVFKGTEDMIGKIVKVKILKAGYPYNEGQFVRVVEDEITEHMRLSS
>P34883 2.1.1.37~~~aquIMB~~~Type II methyltransferase M.AquIB~~~
MDIKNVHIKNHEQTAHAPSTLEKIRKVKQGGKLSEQTKTFGSTYRRLDPNQPSPTVTRSGYRDFIHPFEDRMLTVRELAC
LQTFPLDWEFTGTRLDSYSSKRKVTMTQFGQVGNAVPPLLAEAVAKAVSEQLLDVIDEK
>Q9X034 3.5.4.28~~~mtaD~~~5-methylthioadenosine/S-adenosylhomocysteine deaminase~~~COG0402
MIIGNCLILKDFSSEPFWGAVEIENGTIKRVLQGEVKVDLDLSGKLVMPALFNTHTHAPMTLLRGVAEDLSFEEWLFSKV
LPIEDRLTEKMAYYGTILAQMEMARHGIAGFVDMYFHEEWIAKAVRDFGMRALLTRGLVDSNGDDGGRLEENLKLYNEWN
GFEGRIFVGFGPHSPYLCSEEYLKRVFDTAKSLNAPVTIHLYETSKEEYDLEDILNIGLKEVKTIAAHCVHLPERYFGVL
KDIPFFVSHNPASNLKLGNGIAPVQRMIEHGMKVTLGTDGAASNNSLNLFFEMRLASLLQKAQNPRNLDVNTCLKMVTYD
GAQAMGFKSGKIEEGWNADLVVIDLDLPEMFPVQNIKNHLVHAFSGEVFATMVAGKWIYFDGEYPTIDSEEVKRELARIE
KELYSS
>D4ZX35 2.1.1.37~~~aplIM~~~Type II methyltransferase M.AplI~~~COG0270
MSNRLSYWEYLHQELKLNADIQSQLVVLDLFAGCGGFSLGFKAAGFQTIGYEMLADAAATYTRNLQDPCYCQTLEIGQDL
CNHPDVIIGGPPCQPFSVGGLQKGPRDSRDGLPIFIDAIARYQPEIAIFENVRGMLYKNRQYLEKIVAELERLNYRVDIK
LINAVNYGVPQKRERLFVVAYQTAWNWPEAETLAIPYTAGDAIYDTASTIPIGAKFLTPSMLEYIGRYEAKSKCVKPRDI
YLDIPCRTLTCRNLSGATSDMLRLLLPDGRRRRLTVREAARLQSFPDWFELVGSENSQFNQIGNAVPPLLAKAIAKSVKM
TLENKPSRPTDYFSPFPQQLKLPFA
>A0QR54 2.4.2.28~~~mtnP~~~S-methyl-5'-thioadenosine phosphorylase~~~COG0005
MMLGVIGGSGFYTFFGSDARAVSVETPYGPPSAPITVGTVGDHEVAFLPRHGVKHEFSPHTVPYRANLWALRSLGVRRVF
APCAVGSLTPDLGPGSIVVPDQLVDRTSGRDDTYFDSGGIHVAFADPYCPTLRAAATGLPGVVDGGTMVVIQGPRFSTRA
ESRWFASQGFTLVNMTGYPEAVLARELEMCYAAVALVTDLDAGIEVGSGVRAVDVFAEFERNMPPFKKLVFEALEAVEVE
RTCTHCLTHSGVQLPFELP
>O06401 2.4.2.28~~~mtnP~~~S-methyl-5'-thioadenosine phosphorylase~~~COG0005
MHNNGRMLGVIGGSGFYTFFGSDTRTVNSDTPYGQPSAPITIGTIGVHDVAFLPRHGAHHQYSAHAVPYRANMWALRALG
VRRVFGPCAVGSLDPELEPGAVVVPDQLVDRTSGRADTYFDFGGVHAAFADPYCPTLRAAVTGLPGVVDGGTMVVIQGPR
FSTRAESQWFAAAGCNLVNMTGYPEAVLARELELCYAAIALVTDVDAGVAAGDGVKAADVFAAFGENIELLKRLVRAAID
RVADERTCTHCQHHAGVPLPFELP
>Q2RXH9 2.4.2.28~~~mtnP~~~S-methyl-5'-thioadenosine phosphorylase~~~COG0005
MSEAYRQPVLGVIGGSGVYDIDGLEGARWQTVESPFGDVSDQILRGTLDGLEMAFLPRHGRGHVLAPSDVNYRANIDALK
RAGVTEILSVSAVGSLAEDLPPGTFVIADQFIDRTFAREKSFFGKGLVAHVSMAHPVSAWLGDRVEEVLADLAIPHRRGG
TYLCMEGPQFSTLAESNLYRQWGCHVIGMTNMPEAKLAREAEIAYCTVAMVTDFDCWHPDHDHVSVEAVVRVLLQNADKA
RSLVKAMPAKLKDRPYPLPDGSHRSLDNAIITHPDRRNPGMARKLSAVAGRVLG
>P71039 ~~~mta~~~HTH-type transcriptional activator mta~~~COG0789
MKYQVKQVAEISGVSIRTLHHYDNIELLNPSALTDAGYRLYSDADLERLQQILFFKEIGFRLDEIKEMLDHPNFDRKAAL
QSQKEILMKKKQRMDEMIQTIDRTLLSVDGGETMNKRDLFAGLSMKDIEEHQQTYADEVRKLYGKEIAEETEKRTSAYSA
DDWRTIMAEFDSIYRRIAARMKHGPDDAEIQAAVGAFRDHICQYHYDCTLDIFRGLGEVYITDERFTDSINQYGEGLAAF
LREAIIIYCDHQENPRP
>P9WIN7 ~~~~~~Low molecular weight antigen MTB12~~~
MKMVKSIAAGLTAAAAIGAAAAGVTSIMAGGPVVYQMQPVVFGAPLPLDPASAPDVPTAAQLTSLLNSLADPNVSFANKG
SLVEGGIGGTEARIADHKLKKAAEHGDLPLSFSVTNIQPAAAGSATADVSVSGPKLSSPVTQNVTFVNQGGWMLSRASAM
ELLQAAGN
>P23941 2.1.1.113~~~bamHIM~~~Type II methyltransferase M.BamHI~~~
MRFFSVFDIVKNKANQLGYTETEMYAVLKNYNVNKKDLLAYKENGVIPTDKVLNGILSYLGMTKVELELKLGRIPAGLED
VFLNNTKEIAKILENKNSVKLNEFNSIQEIKPYFYTDLGKLYNGDCLELFKQVPDENVDTIFADPPFNLDKEYDEGVTDK
NSFSGYLDWYYKWIDECIRVLKPGGSLFIYNIPKWNTYLSEYLNRKLNFRNWITVDMKFGLPIQNRLYPANYSLLYYVKG
DKPKTFNVQRIPLQTCPHCGREIKDYGGYKNKMNPKGVTLSDVWSDIYPVRHSSSKNRKFNELSVKLLDRIITMSTNEGD
VVLDPFGGSGTTFAVSEMLGRKWIGFELGNCEIIKERLKNKDKDKKLLGKVYEEKNKLFPNRVKELRKKNGLWIDDDFRQ
DHEGNSKGDKKNENNDQISLSLE
>P0DW08 2.1.1.37~~~drmMII~~~Type II methyltransferase M.Bpa9945I~~~
MIVIDLFSGAGGLSEGFHKHDFKIAAHVEKEYWACETIKTRLFYHFLKAQNDLELYHEYLRVSDNYRNIEQSRAFVFQRY
PELREKLEMEVLNRKFGNPHNDPTATSSTQMIQLIQNSLQYSRATSVDLIIGGPPCQAYSLVGRSRMKDSVGKDSRNYLF
QYYKRIVDEFKPKAFVFENVPGILTAKQGKVYQEIKESFDQIGYTVLSGTSQEDRSNVIDFADFGVPQRRKRVILFGFQK
KLNYEYPNFERHKLSWNSPLTTRDVISDLPVLKPKQGHDLRLFEYDTTQGVDQLSPYELMMREDSIGFTNHFARPIKERD
AEIYQIAIEHATQGRQIKYNELPERLKTHKNEKAFLDRFKVHWWDIIPHTVVAHISKDGHYNIHPDIEQCRSLTVREAAR
IQGFPDNYKFEGPRTAQYTQVGNAVPPLMSGIIARAVKDVINGHH
>Q9LAI2 2.1.1.113~~~bslIM~~~Type II beta methyltransferase M.BslI~~~
MNWIFNTLIQFLEDLNIDPSVVSLIDEDAKKLEEQFPKALKHPVVDEEIVYKILCEKYNLNALNVKTISETLNKEYKFGR
NSKTALKKYLDYGKEEYLIQFFNTLMLENNTYIDREYIESVLAFCEPVSKEKIKNEFIKLWNEANEVNEYGKLKDYLLGI
YSKLFSMGLENLRLIEIYNSNESLIKKVFKYESTIKELKEYCLSNQESITAGLAIKMFNEKYMELMKKEYQQDAIALKLE
EHMNQLYVDNNINEYPYIFDRGNDILLLPTEEYDFVYFHIDQDFFNRFQDENKFLDYVLSSIKQIYRVLANEKVFALKID
NIYNNEKNLKWELYPKLTIYSEHFIQTKETARFYKAYDIAKDLLSKHEFRLLENDSEKNRENILKEYFSGKISEDELFSL
VHVNMKKEHFFEFLNRFKYVHYGFTFNDCLVLDRVDKSFANGELENVISNATEILLIFYKFRADQRRIPCPSCGSLNISG
NSYPEINNRSWECKSPYCPDRSKSNRGKRYSKKSNYMQWGAIYPKSHDIIPRELIKKWRRDIIVINNEQEIFEMLVKYFS
FTDEKLLFINTNELPSVVTERENRKVVILSQKLKEKAYTSNVVVKESLEGEIEFFKNGLYLKNFTELYLPEDQRRVSPEI
NNFLNSGGRLKLIQGDSYEVLKSVEDNTFAAAVTSPPYYNAREYSQWPNLYLYFNDMYNIIKECFRTLKPGSVFLYNIAD
IVDNENIIVKSSMGNKRIPLGAYTIYFFQKAGFELLDNIIWDKGEPQSNRQKNDGKFTPHYQKPLNAYEHMFIFKKTGAP
LTLSDDWQSKRGSWIKNIVPFQPVFKINSKGENILGHTAPFPEDIPRFVANVFTKHDNDIILDPFSGSLTSAIASYKSNR
IGLGIELSPDYVELSRDRALLEGVTTKILNFN
>P10283 2.1.1.37~~~bepIM~~~Type II methyltransferase M.BepI~~~
MKVLSLFSGCGGMDLGLEGGFLAHRSSINSDLYASYISDHDENYVYLKKTGFETVFANDILPFAKLAWCNFFKNRVNQPE
NIFHLESIVDVVNNIENKQFSFPNDIDVVTGGFPCQDFSFAGKRKGFDSHKDHNGIIYNEPTEATRGQLYLWLKKVVEIT
KPKVFIAENVKGLVTLGDVKDIIQKDFRNIDDGYVVLDAQVLNAKNYGVAQNRERVIFIGISKRYANKKILDELISLQEK
SEVYPYPPYTHGTDPELKPYATLNQILAHLPEPELASTDKSQQSYSKAKLFKKTQGNIEVNMNGQAPTIRAEHHGNIEFR
RLSKENGGTNLSELHLPQRRLTVRECALIQSFPPDYEFVFNYGKANSVSASAAYKIIGNAVPPLLGFAIGRHLSQIWDKL
FKT
>P13906 2.1.1.37~~~bspRIM~~~Type II methyltransferase M.BspRI~~~
MAIKINEKGRGKFKPAPTYEKEEVRQLLMEKINEEMEAVATATSDISNDEIQYKSDKFNVLSLFCGAGGLDLGFELAGLE
QSLGTDKALEAFKDRDVYNAIRHESVFHTVYANDIFSEALQTYEKNMPNHVFIHEKDIRKIKEFPSANLVIGGFPCPGFS
EAGPRLVDDERNFLYIHFIRCLMQVQPEIFVAENVKGMMTLGGGEVFRQIVEDFGAAGYRVEARLLNARDYGVPQIRERV
IIVGVRNDIDFNYEYPEITHGNEEGLKPYVTLEEAIGDLSLDPGPYFTGSYSTIFMSRNRKKKWTDQSFTIQASGRQAPI
HPGGLPMEKVDKNKWIFPDGEENHRRLSVKEIKRIQTFPDWYEFSDGGNMKVSVNNRLDKQYKQIGNAVPVFLARAVAKS
IAQFAADYLKDNHPHEAPQMKLFI
>P22772 2.1.1.72~~~banIIIM~~~Type II methyltransferase M.BanIII~~~
MELTIEEMLIKQKETGAHYTPTDLGDIIAKRLINELKKSGISGTKKIRGLDPSCGDGELLLSLNRMGKFNNIDNIELIGI
DEDKEAIKEADFRLNEMGINDAKLSGGDFLDMVDLEGNLSLFDDDLSKIEPVDLIIANPPYVRTQVLGADRAQKLAKLFN
LKGRVDLYHAFLVAMTLQLKPGGLIGVITSNKYLANTTGESIRQFLAENYDIIEIMDLGDTKLFSGAVLQAIFFGTKKLN
KGIRQTAPANFYKIYEETDPSKTEVSIKFETLFGLLESSNTGVFNVDEKFYSVSCGKLIVPDSFKEPWVMATDEEYNWIT
NINNNSYCTIQDLCDLKVGIKTTADKVFIKSTWEELPDEIKPEVEVLKLLISTDHASKWRPLERIGNQKILYTHENLNGK
KKAIHFTQYPHALAYLETHRETLEGRKYVIKAKRNWYQIWLPQNPDHWALPKILFPDISPEPKFFYEDEGCCIDGNCYWI
IPKEENNNDILFLILGISNTKYMTNYHDIAFNNKLYPGRTRYLTQYVSNYPLPNPEANYSQEIIDVLRELLFQNPNDERK
IEIENQIENLTALAFGVERL
>P19888 2.1.1.37~~~banIM~~~Type II methyltransferase M.BanI~~~
MKIKFVDLFAGIGGIRIGFERAAKRFELETECVLSSEIDKKACETYALNFKEEPQGDIHEITSFPEFDFLLAGFPCQPFS
YAGKQQGFGDTRGTLFFEVERVLRDNRPKAFLLENVRGLVTHDKGRTLKTIISKLEELGYGVSYLLLNSSTFGVPQNRVR
IYILGILGSKPKLTLTSNVGAADSHKYKNEQISLFDESYATVKDILEDSPSEKYRCSDEFIGQLSKVVGNNFELLHGYRL
IDYRGGNSIHSWELGIKGDCTKEEIEFLNQLIANRRKKIYGTHQDGKALTLEQIRTFYNHDQLEVIIKSLLQKGYLREEE
NKFNPVCGNMSFEVFKFLDPDSISITLTSSDAHKLGVVQNNVPRRITPRECARLQGFPDDFILHSNDNFAYKQLGNSVTV
KVVEKVIEDLFQNNVNELFGQMKLANVV
>P33563 2.1.1.72~~~hsdBM~~~Type II methyltransferase M.BsuBI~~~
MTQILETVDKSRLTVNPLLKNKSELGQFFTPSSISIFMACLFSEDKLNNAKVLDAGAGIGSLTSAFLARLISENIGKADL
HLLEIDEMLEPYLSETLALFKDYIEINSQIIIDDFIEWAAYSLLDEESLLAKDKQRFTHAILNPPYKKIKSNSKHRKLLR
KAGIETVNLYSAFVALTVDLMSDGGEIVFIIPRSFCNGPYFRHFRQHLLNKTSIKHMHLFESRDKAFKDDEVLQENVISK
LEKGTVQEDVKISISTDDSFSVIRSYRYPFEKIVQPNDIEKFIHINTTNEETLIEKHPNVCYSLEELNIEVSTGPVVDFR
VKENLREMPGEGTVPLFYPNHFVGTSLEYPKMMKKPNAIIRNEKVEKWLYPNGHYVVVKRFSSKEEKRRIVAGVLTPESV
NDPVVGFENGLNVLHYNKSGISKEVAYGLYAYLNSTPVDKYFRIFNGHTQVNATDLRTMKFPSRDILISLGKWVIENIEN
VGQVEIDSKLEELLLNDRGNA
>P17044 2.1.1.37~~~hsdFM~~~Type II methyltransferase M.BsuFI~~~
MRGGNRLGAGRKVIPESEKKKRKSVYITDKLYTRIMDTDIENCNNFSQKCMALIELAMENLNKNNQEHSVKRNNILMVRD
TKSTYNKTNNNFEKQNRGIKLTFIDLFAGIGGIRLGFEDKYTKCVFSSEWDKYAAQTYEANYGEKPHGDITKINENDIPD
QDVLLAGFPCQPFSNIGKREGFAHERRNIIFDVLRILKKKQPKMFLLENVKGLLTNDNGNTFRVILDNLKSLGYSVFYEV
MDAQNFGLPQRRERIVIVGFHPDLGINDFSFPKGNPDNKVPINAILEHNPTGYSISKRLQESYLFKKDDGKPQIVDFRCT
YQVNTLVASYHKIQRLTGTFVKDGETGLRLFSELELKRLMGFPVDFKVPVSRTQMYRQFGNSVAVPMIKAVAGAMKERLL
LAEMQVLKK
>P06530 2.1.1.37~~~hsdRM~~~Type II methyltransferase M.BsuRI~~~
MTLKIDIKGRGKYKPASDYSIDDVKNVLMEKIFEESSRIINSDDDLEIIEKVDFRTDKINVLSLFSGCGGLDLGFELAGL
AAVIGEQAAMEAFKDKDRFNELRNKSIFHTIYTNDLFKEANQTYKTNFPGHVIQHEKDIRQVKYFPKCNLILGGFPCPGF
SEAGPRLIDDDRNFLYLHFIRSLIQAQPEIFVAENVKGMMTLGKGEVLNQIIEDFASAGYRVQFKLLNARDYGVPQLRER
VIIEGVRKDISFNYKYPSPTHGEETGLKPFKTLRDSIGDLVTDPGPYFTGSYSSIYMSRNRKKSWDEQSFTIQASGRQAP
LHPGGLSMKKIGKDKWVFPDGEENHRRLSVKEIARVQTFPDWFQFSQGTNSQTSINNRLDKQYKQIGNAVPVLLAKAVAS
PIANWAINYLESSPNNKIKNRERKLSIRTFLRIKTS
>Q04845 2.1.1.113~~~cfrBIM~~~Type II methyltransferase M.CfrBI~~~
MTVKNNYLNRGYLVTSTTQAKKIAKLHLEKLDLADKFSFGLPEVDDRYHVWRVPLVATGHRIGEIVINSKTSTIDYKKSS
KSPFLIKKAMEVVHTPKKVKKEKKIISKSPLNNMLLQGNCAETLKKLPDESVNLVFTSPPYYNAKPEYSEYHTYDEYLSL
LRSVIKECHRVLSEGRFFVINVSPVLIRRASRNEASKRIAVPFDLHRLFIEEGYEFIDDIHWVKPEGAGWALGRGRRFAA
DRNPLQYKPVPVTEYILVYRKKTDKLIDWNIRNHHSKEDVFDSKIGDDYEKTNLWKINPSRNRKHPATFPYGLAERVIKY
YSFKNDVILDPFAGSGTTAKAAIDLGRRFVMCEISKQYIDLIIEGTMLGGDLKIIN
>P43423 2.1.1.72~~~bseCIM~~~Type II methyltransferase M.BseCI~~~
MMSVQKANTVSRQKATGAHFTPDKLAEVIAKRILDYFKGEKNRVIRVLDPACGDGELLLAINKVAQSMNIQLELIGVDFD
IDAINIANERLSRSGHKNFRLINKDFLEMVSEGDNYDLFNIEELEPVDIIIANPPYVRTQILGAEKAQKLREKFNLKGRV
DLYQAFLVAMTQQLKSNGIIGVITSNRYLTTKGGGSTRKFLVSNFNILEIMDLGDSKFFEAAVLPAIFFGEKKNKEYQKE
NSNVPKFFKIYEQSDIEASSSVNSEFNSLIELLEVNKSGLYSVEDKTYSISLGKIISPENYKEPWILATEDEYEWFMKVN
QNAYGFIEDFAHVKVGIKTTADSVFIRSDWGELPEEQIPEDKLLRPIISADQANKWSVSLVGNNKKVLYTHEIRDGQIKA
INLEEFPRAKNYLESHKERLASRKYVLKANRNWYEIWVPHDPSLWDKPKIIFPDTSPEPKFFYEDKGSVVDGNCYWIIPK
KENSNDILFLIMGICNSKFMSKYHDIAFQNKLYAGRRRYLTQYVNKYPIPDPESIYSKEIISLVRELVNNKKETQDINEI
ENRIEKLILRAFDIESLKY
>P25263 2.1.1.37~~~hgiCIM~~~Type II methyltransferase M.HgiCI~~~
MLKFIDLFAGIGGMRLGFEQAMHELGIETACVLSSEIDKHAQTTYAMNFHEQSQGDITQIQDFPSFDFLLAGFPCQPFSY
AGKQKGFGDTRGTLFFEIERILKAYRPKGFLLENVRGLTTHDKGRTFKTILQKLHELNYGVYLILNSSNFQVPQNRLRVY
IVGLDQSQPELTITSHIGATDSHKFKQLSNQASLFDTNKIMLVRDILEDHPLDKYNCSTDFVNKLLAFIGHPIKLNGKRL
IDYRNGNSIHSWELGIKGECTSDEIQFMNALIANRRKKHFGAHQDGKKLTIEQIKTFFEHDDLDSIMQSLITKGYLQEVN
GRFNPVAGNMSFEVFKFLDPDSVSITLVSSDAHKIGVVHQNRIRRITPRECARLQGFPDSFQFHPKDSLAYRQFGNSVSV
PVVKAVILDLFKSADLASCF
>P9WPJ7 4.2.1.1~~~mtcA1~~~Beta-carbonic anhydrase 1~~~COG0288
MTVTDDYLANNVDYASGFKGPLPMPPSKHIAIVACMDARLDVYRMLGIKEGEAHVIRNAGCVVTDDVIRSLAISQRLLGT
REIILLHHTDCGMLTFTDDDFKRAIQDETGIRPTWSPESYPDAVEDVRQSLRRIEVNPFVTKHTSLRGFVFDVATGKLNE
VTP
>P9WPJ9 4.2.1.1~~~mtcA2~~~Carbonic anhydrase 2~~~COG0288
MPNTNPVAAWKALKEGNERFVAGRPQHPSQSVDHRAGLAAGQKPTAVIFGCADSRVAAEIIFDQGLGDMFVVRTAGHVID
SAVLGSIEYAVTVLNVPLIVVLGHDSCGAVNAALAAINDGTLPGGYVRDVVERVAPSVLLGRRDGLSRVDEFEQRHVHET
VAILMARSSAISERIAGGSLAIVGVTYQLDDGRAVLRDHIGNIGEEV
>P0DX06 2.1.1.383~~~mtcB~~~L-carnitine:corrinoid methyltransferase~~~
MIRNSLTDVFFAKDDVENLHEGVLRVLSKVGVKIENDEALGIFEQHGARVENGTVYIGEVLLNKALQTVPANFELQGFDR
TVQVGLDHDPVVIPTNGTPMVLNFDGSYSDTNTDDLVNFYKLIDTSDVMQVTSEIAVDVPGLDKTKDSLLAQTALLMKYS
HKPIYNILGATIHNYKKGSVAQGVRENIQFAKKYYGYDDKYVIYSGTCVISPLGVGWEAMDHFMGFIKENQPISITACSM
TNLTAPGSLYGSVVEDAAAILSIVVLSQLMNPGLPVLYTSLSSMSDMRYVQLCMGAPEFALITLGHIALANFYKIPVRVG
GALGDAFKADYQAGVESFVGLMAPMLSQSAMIPHGCGTMGSFNLTSYEKFIMDEETIRYLMRLRRGFEVSDKRKEKALKD
ITKVGPRGNFLGGRTPKEYREDNYLASEVFNRKGCKENTREEQGDIRDRARKVYDARMEAYELPDTTLEQKKLLNTELPE
QYKFDI
>P38577 ~~~mesY~~~Bacteriocin mesentericin Y105~~~
MTNMKSVEAYQQLDNQNLKKVVGGKYYGNGVHCTKSGCSVNWGEAASAGIHRLANGGNGFW
>P05302 2.1.1.37~~~ddeIM~~~Type II methyltransferase M.DdeI~~~
MNIIDLFAGCGGFSHGFKMAGYNSILAIEKDLWASQTYSFNNPNVSVITEDITTLDPGDLKISVSDVDGIIGGPPCQGFS
LSGNRDQKDPRNSLFVDFVRFVKFFSPKFFVMENVLGILSMKTKSRQYVKDIIAEEFSNVGYKVCVIILNACDYGVPQSR
QRVFFIGLKSDRPLNQQILTPPSKVIESEYTSLEEAISDLPVIEAGEGGEVQDYPVAPRNKYQENMRKGSTCVYNHVAMR
HTQRLVDRFAAIKFGQSVKHVSEEHSQRKRGDANSISGKVFSQNNMRPYPYKPCPTVAASFQSNFIHPFYNRNFTAREGA
RIQSFPDTYIFQGKRTTMSWEKHLSQYQQIGNAVPPLLAQALAERISWYFENINLINDSNVSIKRMVQRSFMSQLNLENN
VNVRQDDNYDKVHSF
>P04043 2.1.1.72~~~dpnM~~~Type II methyltransferase M1.DpnII~~~
MKIKEIKKVTLQPFTKWTGGKRQLLPVIRELIPKTYNRYFEPFVGGGALFFDLAPKDAVINDFNAELINCYQQIKDNPQE
LIEILKVHQEYNSKEYYLDLRSADRDERIDMMSEVQRAARILYMLRVNFNGLYRVNSKNQFNVPYGRYKNPKIVDEELIS
AISVYINNNQLEIKVGDFEKAIVDVRTGDFVYFDPPYIPLSETSAFTSYTHEGFSFADQVRLRDAFKRLSDTGAYVMLSN
SSSALVEELYKDFNIHYVEATRTNGAKSSSRGKISEIIVTNYEK
>P09358 2.1.1.72~~~dpnA~~~Type II methyltransferase M2.DpnII~~~
MKNNEYKYGGVLMTKPYYNKNKMILVHSDTFKFLSKMKPESMDMIFADPPYFLSNGGISNSGGQVVSVDKGDWDKISSFE
EKHEFNRKWIRLAKEVLKPNGTVWISGSLHNIYSVGMALEQEGFKILNNITWQKTNPAPNLSCRYFTHSTETILWARKND
KKARHYYNYDLMKELNDGKQMKDVWTGSLTKKVEKWAGKHPTQKPEYLLERIILASTKEGDYILDPFVGSGTTGVVAKRL
GRRFIGIDAEKEYLKIARKRLEAENETN
>P55818 ~~~mtdA~~~Bifunctional protein MdtA~~~COG0373
MSKKLLFQFDTDATPSVFDVVVGYDGGADHITGYGNVTPDNVGAYVDGTIYTRGGKEKQSTAIFVGGGDMAAGERVFEAV
KKRFFGPFRVSCMLDSNGSNTTAAAGVALVVKAAGGSVKGKKAVVLAGTGPVGMRSAALLAGEGAEVVLCGRKLDKAQAA
ADSVNKRFKVNVTAAETADDASRAEAVKGAHFVFTAGAIGLELLPQAAWQNESSIEIVADYNAQPPLGIGGIDATDKGKE
YGGKRAFGALGIGGLKLKLHRACIAKLFESSEGVFDAEEIYKLAKEMA
>O85012 1.5.1.-~~~mtdB~~~NAD(P)-dependent methylenetetrahydromethanopterin dehydrogenase~~~COG0702
MARSILHMLTPLKHMSPFDVNMAIDAGFETLIPYTGVDLTDVVSLTQDSIFSRAPQDGVRTGIFIGGKNAELALDMVDRA
KKAFVPPFVNHVFADPAGSFTTGAAMVAEVNRALKARFSTDLKGKRIVIFGGAGVVAYVAAVIGALEGAQTVLVGHDGEE
RVSKIAFTMKWRFGIDVGAVDGTLPEARRAAITDADVILSAGPAGVSILTAEDLESAPKLLVASDVNAVPPAGIAGIDVN
AVDVPLPTGKGVGIGALAVGNVKYQTQCRMFRKMLEAQEPLCLDFRDAYKLAVEIAG
>P9WLZ7 3.1.3.16~~~~~~Multidomain regulatory protein Rv1364c~~~COG1366
MAAEMDWDKTVGAAEDVRRIFEHIPAILVGLEGPDHRFVAVNAAYRGFSPLLDTVGQPAREVYPELEGQQIYEMLDRVYQ
TGEPQSGSEWRLQTDYDGSGVEERYFDFVVTPRRRADGSIEGVQLIVDDVTSRVRARQAAEARVEELSERYRNVRDSATV
MQQALLAASVPVVPGADIAAEYLVAAEDTAAGGDWFDALALGDRLVLVVGDVVGHGVEAAAVMSQLRTALRMQISAGYTV
VEALEAVDRFHKQVPGSKSATMCVGSLDFTSGEFQYCTAGHPPPLLVTADASARYVEPTGAGPLGSGTGFPVRSEVLNIG
DAILFYTDGLIERPGRPLEASTAEFADLAASIASGSGGFVLDAPARPIDRLCSDTLELLLRSTGYNDDVTLLAMQRRAPT
PPLHITLDATINAARTVRAQLREWLAEIGADHSDIADIVHAISEFVENAVEHGYATDVSKGIVVAAALAGDGNVRASVID
RGQWKDHRDGARGRGRGLAMAEALVSEARIMHGAGGTTATLTHRLSRPARFVTDTMVRRAAFQQTIDSEFVSLVESGRIV
VRGDVDSTTAATLDRQIAVESRSGIAPVTIDLSAVTHLGSAGVGALAAACDRARKQGTECVLVAPPGSPAHHVLSLVQLP
VVGADTEDIFAQE
>Q9HUH5 ~~~~~~Multidrug transporter PA4990~~~
MTNYLYLAIAIAAEVVATTSLKAVAGFSKPLPLLLVVGGYVLAFSMLVLVMRTLPVGVVYAIWSGLGIVLVSLVAMFVYG
QRLDPAALLGIGLIIAGVLVIQLFSRASGH
>P00472 2.1.1.72~~~ecoRIM~~~Type II methyltransferase M.EcoRI~~~
MARNATNKLLHKAKKSKSDEFYTQYCDIENELQYYREHFSDKVVYCNCDDPRVSNFFKYFAVNFDNLGLKKLIASCYVEN
KEGFSSSEAAKNGFYYEYHKENGKKLVFDDISVSSFCGDGDFRSSESIDLLKKSDIVVTNPPFSLFREYLDQLIKYDKKF
LIIANVNSITYKEVFNLIKENKIWLGVHLGRGVSGFIVPEHYELYGTEARIDSNGNRIISPNNCLWLTNLDVFIRHKDLP
LTRKYFGNESSYPKYDNYDAINVNKTKDIPLDYNGVMGVPITFLHKFNPEQFELIKFRKGVDEKDLSINGKCPYFRILIK
NKRLQK
>P05101 2.1.1.37~~~ecoRIIM~~~Type II methyltransferase M.EcoRII~~~
MSEFELLAQDLLEKAEAEEQLRQENDKKLLGQVLEIYDQKYVAELLRKVGKNEWSRETLNRWINGKCSPKTLTLAEEELL
RKMLPEAPAHHPDYAFRFIDLFAGIGGIRKGFETIGGQCVFTSEWNKEAVRTYKANWFNDAQEHTFNLDIREVTLSDKPE
VPENDAYAYINEHVPDHDVLLAGFPCQPFSLAGVSKKNSLGRAHGFECEAQGTLFFDVARIIRAKKPAIFVLENVKNLKS
HDKGKTFKVIMDTLDELGYEVADAAEMGKNDPKVIDGKHFLPQHRERIVLVGFRRDLNIHQGFTLRDISRFYPEQRPSFG
ELLEPVVDSKYILTPKLWEYLYNYAKKHAAKGNGFGFGLVNPENKESIARTLSARYHKDGSEILIDRGWDMATGETDFAN
EENQAHRPRRLTPRECARLMGFEKVDGRPFRIPVSDTQSYRQFGNSVVVPVFEAVAKLLEPYILKAVNADSCKVERI
>P04393 2.1.1.72~~~ecoRVM~~~Type II methyltransferase M.EcoRV~~~
MKDKVFVPPIKSQGIKTKLVPCIKRIVPKNFNGVWVEPFMGTGVVAFNVAPKDALLCDTNPHLISFYNALKNKDITGDLV
KDFLYREGEKLLLSNGEYYYEVRERFNNYKEPLDFLFLNRSCFNGMIRFNSKGGFNVPFCKKPNRFAQAYITKISNQVDR
ISEIISKGNYTFLCQSFEKTIGMVNRDDVVYCDPPYIGRHVDYFNSWGERDERLLFETLSSLNATFITSTWHHNDYRENK
YVRDLWSSFRILTKEHFYHVGASEKNRSPMVEALITNIAKDIIDHIEKSSGDILVIEE
>P14827 2.1.1.72~~~ecaIM~~~Type II methyltransferase M.EcaI~~~
MAVGLNKKEIDGPASQRAVACDLEPALPPIGARVTLNYPGKMDESIILQKKDTKYLRVGSDFAKESSLILPNSFIWSDNS
LALNRLMVEGKKAKLIYLDPPYATGMGFSSRSNEHAYDDCLTEAAYLEFMRRRLILMREILDDDGTIYVHIGHQMLGELK
CLLDEIFGRERFINLITRRKCSSKNSTKNNFANLNDYILCYSKGKKYIWNRPLKKPDAEWLAKEYPKTDSKGQFKLVPIH
APGVRHGATGGEWKGMLPPPGKHWQYTPEKLDILDASGDIHWSKTGNPRRKVYLTDDKSIGYTDYWEEFRDAHHQSILVT
GYPTEKNFNMMKLIVGASSNPGDLVIDPFCGSGSTLHAASLLQRKWIGIDESLFAAKTVMKRFAIGRAPMGDYVNTSLNK
QTELPLSLNETARHEYVSNDFNIYVDELTASVSKNELAEIQKAYRDLKANQQ
>P14871 2.1.1.72~~~fokIM~~~Type II methyltransferase M.FokI~~~
MRFIGSKVNLLDNIQEVIEENVKDDAHVFMDLFSGTGIVGENFKKDYQVLSNDSLYFSYILLKAKIENNSIPNFSELKKI
GIKEPLHYLENEEFEISHEFFLTHNYSPYMGCERMYFTVENASRIDFIRLTLNRWKNESLINELEFAYLLAILIEAVPFI
SNISGTYGAYLKHWDKRALGKLKLRTLDIGNNHYANKTYNEDANSLIEKVYGDILYIDPPYNGRQYISNYHLLETIALYD
YPEIYGKTGLRPYVESKSLYCQKKEVGNAFNHLIEKANFRHILVSYSSEGLLLEEEIESILKSHGLPETYRIYKMPYRKY
KSKHKQEASELHEYIFYIQKDIALTNSVKSNKKIEVGKHKTNSYIKSPLNYVGGKHKLLNQIVPLFPDKIDTFVDLFSGG
FNVGINVNANKIIATDINTYVVEVLDTMKKTSVEEVIAHIERRIEEYGLSKSNEEGFKAFRNYYNKTKKPLDLYTLICYS
FNYQFRFNNNQEYNNPFGRERSQFSPALKKKLVLFIEALHEKNVQFVCSEFEHFNFSQLDQNDLVYCDPPYLITTGSYND
GNRGFKDWNRLQEIKLLDILDHLNSKGVYFALSNVLSHKGLENELLLEWSKKYNIHHLQHSYSNSSHNTTRGESQEVLIT
NYTNYTK
>P76346 ~~~mtfA~~~Protein MtfA~~~COG3228
MIKWPWKVQESAHQTALPWQEALSIPLLTCLTEQEQSKLVTLAERFLQQKRLVPLQGFELDSLRSCRIALLFCLPVLELG
LEWLDGFHEVLIYPAPFVVDDEWEDDIGLVHNQRIVQSGQSWQQGPIVLNWLDIQDSFDASGFNLIIHEVAHKLDTRNGD
RASGVPFIPLREVAGWEHDLHAAMNNIQEEIELVGENAASIDAYAASDPAECFAVLSEYFFSAPELFAPRFPSLWQRFCQ
FYQQDPLQRLHHANDTDSFSATNVH
>A6TB83 ~~~mtfA~~~Protein MtfA~~~
MFKWPWKADDESGNAEMPWEQALAIPVLAHLSSTEQHKLTQMAARFLQQKRLVALQGLELTPLHQARIAMLFCLPVLELG
IEWLDGFHEVLIYPAPFIVDDEWEDDIGLVHNQRVVQSGQSWQQGPVVLNWLDIQDSFDASGFNLVVHEVAHKLDTRNGD
RASGVPLIPLREVAGWEHDLHAAMNNIQDEIDLVGESAASIDAYAATDPAECFAVLSEYFFSAPELFAPRFPALWQRFCH
FYRQDPLARRRENGLQDEGDRRIVH
>P25282 2.1.1.37~~~hgaIAM~~~Type II methyltransferase M1.HgaI~~~
MKKNIMGLSLFSSAGIGEYFLSRVGIDIIVANELIKKRADLYQKIYPNHKMVIGDIRDQRIFNKVLNIALTNQVDFLIAS
PPCQGMSVAGKNRDVSNMANDNRNYLIMYVIAMIKKLKPAYILIENVPFLLKLELYIDNKLTPIKNILEDEFGSEYHIHF
DILDAADYGTPQRRKRAIIRLNKKGTIWNLPLKQNIVSVEQAIGNLPSIESGKHSGLKWHYGRGHTEQQIEWMKHTPTGK
SAFENLVHYPRKANGEKVKGYHSSYRRIRWDEPAPTITIRNDAISSQRNVHPGRPLLDGTYSDARVLSVLELMRLTGLPD
NWEIPDDTPEILIRQIIGECIPPLLIENITREIFNEN
>P25283 2.1.1.37~~~hgaIBM~~~Type II methlytransferase M2.HgaI~~~
MKINAMSLFSSAGIGELDLHKGNLNFVVANELLKKRADTYQFFYPETKMFQGDISDEKLKREILLSAQQNNVKFLLATPP
CQGLSSVGKNKHQDHFIKDNRNFLIFEVFEFIDVLNLDFILIENVPRFIEMYFPYNGQLLLLEEILKIKYASKYQIDIVI
LNAKDYGICQSRPRAIIKMYKYGITWKLPTIQAEISLQRAIGHLPPLEPGEVSSIKWHSAPNVKPSIIEAIRHTKPGTSA
ISNPIFYPKKDNGERIKGFHNTYKRMEWDKPAPARTTYSGSISSHNNIHPGRLQLDGTYSDPRVLSLLETFIVSSIDENI
EFPPGSSETYIRTIIGEAIPPKLLSAICFPDGENINVK
>Q24SP6 2.1.1.378~~~mtgA~~~[methyl-Co(III) glycine betaine-specific corrinoid protein]--tetrahydrofolate methyltransferase~~~COG1962
MFKFTAQQHVYDINGVKVGGQPGEYPTVLIGSIFYRGHKIVSDGQKGIFDKDAAKALLDQEAELSAETGNPFIIDVLGES
VEALTKYVEFILENTTAPFLLDSISPDVRVGALKNLGKDPEIQKRLIYNSIEEHYTEEELAAIKEAGLKTAVILAFSKKA
LKPNARIDLLQGKDDKEGLIAAAKRAGIEQFLVDPGVLDVASNSWTTEAINVVKEQFGYPGGCAPSNAVYLWKKMRSKGT
PFFEVAGAAVFTYPITQGADFILYGPMMNAPWVYRAIATTDAMIAYNNKLTGVKMGTTEHPLLKIF
>P46022 2.4.1.129~~~mtgA~~~Biosynthetic peptidoglycan transglycosylase~~~COG0744
MSKSRLTVFSFVRRFLLRLMVVLAVFWGGGIALFSVAPVPFSAVMVERQVSAWLHGNFRYVAHSDWVSMDQISPWMGLAV
IAAEDQKFPEHWGFDVASIEKALAHNERNENRIRGASTISQQTAKNLFLWDGRSWVRKGLEAGLTLGIETVWSKKRILTV
YLNIAEFGDGVFGVEAAAQRYFHKPASKLTRSEAALLAAVLPNPLRFKVSSPSGYVRSRQAWILRQMYQLGGEPFMQQHQ
LD
>Q24SP7 2.1.1.376~~~mtgB~~~Glycine betaine methyltransferase~~~COG5598
MGQLLPKYNILTEDQVQKIHENTMKILEEIGIEFEYEPALEVFRREGQKVEGKRVYLTREFVESKLKSAPAEFTLHARNP
ENNVVIGGDNIVFMPGYGAPFIYELDGSRRKTTLQDYENFAKLAGASKNMHLSGGTMAEPQDIPDGVRHLQMLYSSIKNS
DKCFMGSAEGKERAEDSVEIAAILFGGKDVIKEKPVLVSLINSLTPLKYDERMLGALMAYAEAGQAVIIASLVMAGSTGP
ASLAGTLSLQNAEVLAGISLAQSINPGTPVIYGSTSALSDMRSGSLSIGSPECALFISASAQLARFYGVPSRSGGGLNDS
KTVDAQAGYESMMTLMAANLTGVNFVLHTAGILQYFMAMSYEKFIMDDEIAGMLLHYMKGYTFDEDGMAFDVIEKVGPGG
HFLTQKHTRKNHKREFYTPTLSDRSAYDTWAKEKLETKQRAHARWQQILANYVPPALDPEIDAKLQAFIAQRGKEVGEE
>P05102 2.1.1.37~~~hhaIM~~~Type II methyltransferase M.HhaI~~~
MIEIKDKQLTGLRFIDLFAGLGGFRLALESCGAECVYSNEWDKYAQEVYEMNFGEKPEGDITQVNEKTIPDHDILCAGFP
CQAFSISGKQKGFEDSRGTLFFDIARIVREKKPKVVFMENVKNFASHDNGNTLEVVKNTMNELDYSFHAKVLNALDYGIP
QKRERIYMICFRNDLNIQNFQFPKPFELNTFVKDLLLPDSEVEHLVIDRKDLVMTNQEIEQTTPKTVRLGIVGKGGQGER
IYSTRGIAITLSAYGGGIFAKTGGYLVNGKTRKLHPRECARVMGYPDSYKVHPSTSQAYKQFGNSVVINVLQYIAYNIGS
SLNFKPY
>P15446 2.1.1.37~~~hpaIIM~~~Type II methyltransferase M.HpaII~~~
MKDVLDDNLLEEPAAQYSLFEPESNPNLREKFTFIDLFAGIGGFRIAMQNLGGKCIFSSEWDEQAQKTYEANFGDLPYGD
ITLEETKAFIPEKFDILCAGFPCQAFSIAGKRGGFEDTRGTLFFDVAEIIRRHQPKAFFLENVKGLKNHDKGRTLKTILN
VLREDLGYFVPEPAIVNAKNFGVPQNRERIYIVGFHKSTGVNSFSYPEPLDKIVTFADIREEKTVPTKYYLSTQYIDTLR
KHKERHESKGNGFGYEIIPDDGIANAIVVGGMGRERNLVIDHRITDFTPTTNIKGEVNREGIRKMTPREWARLQGFPDSY
VIPVSDASAYKQFGNSVAVPAIQATGKKILEKLGNLYD
>P00473 2.1.1.72~~~hhaIIM~~~Type II methyltransferase M.HhaII~~~
MFSHQDYLSFVNKKNKMNGIDLFKLIPDKAVKIAFFDPQYRGVLDKMSYGNEGKGRGKERAALPQMTDEIIQQFINEFER
VLLPNGYLFLWVDKFHLVEGVKPWLENTPSLSVVDMLTWDKQKIGMGYRTRRRSEYLVVIQKEPKKAKITWTLHNIPDVW
AEKLQSKPHTHSKPIEMQKQLILATTQEGDLILDPASGGYSVFECCKQTNRNFIGCDLIFGDDENEQD
>P20589 2.1.1.37~~~haeIIIM~~~Type II methyltransferase M.HaeIII~~~
MNLISLFSGAGGLDLGFQKAGFRIICANEYDKSIWKTYESNHSAKLIKGDISKISSDEFPKCDGIIGGPPCQSWSEGGSL
RGIDDPRGKLFYEYIRILKQKKPIFFLAENVKGMMAQRHNKAVQEFIQEFDNAGYDVHIILLNANDYGVAQDRKRVFYIG
FRKELNINYLPPIPHLIKPTFKDVIWDLKDNPIPALDKNKTNGNKCIYPNHEYFIGSYSTIFMSRNRVRQWNEPAFTVQA
SGRQCQLHPQAPVMLKVSKNLNKFVEGKEHLYRRLTVRECARVQGFPDDFIFHYESLNDGYKMIGNAVPVNLAYEIAKTI
KSALEICKGN
>P43871 2.1.1.72~~~hindIIIM~~~Type II methyltransferase M.HindIII~~~COG0863
MIDCIYNSDSIFEIKKLDSNSIHAIISDIPYGIDYDDWDILHSNTNSALGGTSSAQHKTSLFKRRGKPLNGWSEADKKRP
QEYQEWVESWSNEWFRVLKSGSSVFVFAGRQFAHRVVVAFENSGFTFKDMLSWEKDKAPHRAQRISCVFERRGDIANTNK
WVGWRVANLRPLFEPILWFQKPYKTGSTLADNLIKHEVGAWNENSLTHWNIQQGALNHSNILKVRITSEDKGYHVAQKPL
NLMKLLIDLVTKEEQIVLDPFAGSGTTLLAAKELNRHFIGYEKNNGIYNIAVNRLGIEKNNCFYNKEKK
>P75430 6.3.3.2~~~~~~5-formyltetrahydrofolate cyclo-ligase~~~
MDKNALRKQILQKRMALSTIEKSHLDQKINQKLVAFLTPKPCIKTIALYEPIKNEVTFVDFFFEFLKINQIRAVYPKVIS
DTEIIFIDQETNTFEPNQIDCFLIPLVGFNKDNYRLGFGKGYYDRYLMQLTRQQPKIGIAYSFQKGDFLADPWDVQLDLI
INDE
>Q9HZK1 2.4.2.44~~~~~~S-methyl-5'-thioinosine phosphorylase~~~
MSVYAIIGGTGLTQLEGLTLSESLPIETPYGAPSAPLQRGRYAGREVLFLARHGHPHRFPPHQVNYRANLWALKQAGAEA
VIAVNAVGGIHAAMGTGHLCVPHQLIDYTSGREHTYFAGDIEHVTHIDFSHPYDEPLRQRLIEALRALGLAHSSHGVYAC
TQGPRLETVAEIARLERDGNDIVGMTGMPEAALARELDLPYACLALVVNPAAGKSAGIITMAEIEQALHDGIGKVREVLA
RVLAG
>P09424 1.1.1.17~~~mtlD~~~Mannitol-1-phosphate 5-dehydrogenase~~~COG0246
MKALHFGAGNIGRGFIGKLLADAGIQLTFADVNQVVLDALNARHSYQVHVVGETEQVDTVSGVNAVSSIGDDVVDLIAQV
DLVTTAVGPVVLERIAPAIAKGQVKRKEQGNESPLNIIACENMVRGTTQLKGHVMNALPEDAKAWVEEHVGFVDSAVDRI
VPPSASATNDPLEVTVETFSEWIVDKTQFKGALPNIPGMELTDNLMAFVERKLFTLNTGHAITAYLGKLAGHQTIRDAIL
DEKIRAVVKGAMEESGAVLIKRYGFDADKHAAYIQKILGRFENPYLKDDVERVGRQPLRKLSAGDRLIKPLLGTLEYGLP
HKNLIEGIAAAMHFRSEDDPQAQELAALIADKGPQAALAQISGLDANSEVVSEAVTAYKAMQ
>Q83PQ0 1.1.1.17~~~mtlD~~~Mannitol-1-phosphate 5-dehydrogenase~~~
MKALHFGAGNIGRGFIGKLLADAGIQLTFADVNQVVLDALNARHSYQVHVVGETEQVDTVSGVNAVSSIGDDVVDLIAQV
DLVTTAVGPVVLERIAPAIAKGLVKRKEQGNESPLNIIACENMVRGTTQLKGHVMNALPEDAKAWVEEHVGFVDSAVDRI
VPPSASATNDPLEVTVETFSEWIVDKTQFKGALPNIPGMELTDNLMAFVERKLFTLNTGHAITAYLGKLAGHQTIRDAIL
DEKIRAVVKGAMEESGAVLIKRYGFDADKHAAYIQKILGRFENPYLKDDVERVGRQPLRKLSAGDRLIKPLLGTLEYSLP
HKNLIQGIAGAMHFRSEDDPQAQELAALIADKGPQAALAQISGLDANSEVVSEAVTAYKAMQ
>Q2FW96 1.1.1.17~~~mtlD~~~Mannitol-1-phosphate 5-dehydrogenase~~~COG0246
MKAVHFGAGNIGRGFIGYILADNNVKVTFADVNEEIINALAHDHQYDVILADESKTTTRVNNVDAINSMQPSEALKQAIL
EADIITTAVGVNILPIIAKSFAPFLKEKTNHVNIVACENAIMATDTLKKAVLDITGPLGNNIHFANSAVDRIVPLQKNEN
ILDVMVEPFYEWVVEKDAWYGPELNHIKYVDDLTPYIERKLLTVNTGHAYLAYAGKFAGKATVLDAVEDSSIEAGLRRVL
AETSQYITNEFDFTEAEQAGYVEKIIDRFNNSYLSDEVTRVGRGTLRKIGPKDRIIKPLTYLYNKDLERTGLLNTAALLL
KYDDTADQETVEKNNYIKEHGLKAFLSEYAKVDDGLADEIIEAYNSLS
>P99140 1.1.1.17~~~mtlD~~~Mannitol-1-phosphate 5-dehydrogenase~~~
MKAVHFGAGNIGRGFIGYILADNNVKVTFADVNEEIINALAHDHQYDVILADESKTTTRVNNVDAINSMQPSEALKQAIL
EADIITTAVGVNILPIIAKSFAPFLKEKTNHVNIVACENAIMATDTLKKAVLDITGPLGNNIHFANSAVDRIVPLQKNEN
ILDVMVEPFYEWVVEKDAWYGPELNHIKYVDDLTPYIERKLLTVNTGHAYLAYAGKFAGKATVLDAVKDSSIEAGLRRVL
AETSQYITNEFDFTEAEQAGYVEKIIDRFNNSYLSDEVTRVGRGTLRKIGPKDRIIKPLTYLYNKDLERTGLLNTAALLL
KYDDTADQETVEKNNYIKEHGLKAFLSEYAKVDDGLADEIIEAYNSLS
>P33216 1.1.1.67~~~mtlK~~~Mannitol 2-dehydrogenase~~~
MTRSVTRPSYDRKALTPGIVHIGVGNFHRAHQAVYLDDLFALGEGHDWAILGAGVRPTDARMREALAAQDNLSTVIELDP
AGHRARQVGAMVGFLPVEADNAALIEAMSDPRIRIVSLTVTEGGYYVDASGAFDPTHPDIVADAAHPARPATAFGAILAA
LRARRDAGVTPFTVMSCDNLPGNGHVTRNAVVGLAELYDAELAGWVKAQVAFPNGMVDRITPATGPHERELAQGFGLADP
VPVTCEPFRQWVIEDHFPAGRPALEKVGVTFTPHVHAYEAMKIRILNGGHAVIAYPSALMDIQLVHAAMAHPLIAAFLHK
VEVEEILPHVPPVPDTSIPDYLTLIESRFSNPEIADTTRRLCLDGSNRQPKFIVPSLRDNLAAGTVPKGLVLLSALWCRY
CFGTTDSGVVVEPNDPNWTALQDRARRAKETPAEWLAMTEVYGDLAQNDLLAAEFAAALEAVWRDGAEAVLRRFLAA
>P96574 ~~~mtlR~~~Transcriptional regulator MtlR~~~COG3711
MYMTAREQKLLKHLLLQNRYITVTELAELMQVSTRTIHRELKSIKPLMETVGLTLDKQPGKGLKAVGSPEGKQKLLTDLS
YEQHEYSADERKLLILCSLLESQEPVKLYTLAHDLQVTNATVSYDLDELEKWISPFGLTLIRKRGFGIQLIGPENAKRKI
VGNLIVNRLDIQMFLEAVELNIKGKTDSSEKMFGVVSKGELLKMERILFQLKEKIAFSLSDSSYIALVVHLTYAIERIKL
GETITMEQNELEELMNAKEYSSALEIAGELERAFGVTIPEAEVGYITIHLRSANRKYKTEYKAQEIELETALQTKRLIAF
ISDKIRMDLTKNYSLYEGLIAHLEPAVSRIKENIEIYNPMKEQIKRDYFLLYMAIEEGVEKYFPGMSFSDDEIAFIVLHF
GSALEIKKEEAKVKALVVCSSGIGSSKMLASRLKKELPEIESFDMSSLIELKGKDVQAYDMIVSTVPIPYENIDYIMVSP
LLNEEDANQVKQYIKRKIPLILNKKRSSKEEAQQADVPDMLEAAESIGRYMEVIQDVLRHFTLAQLKTNPDHSMLLLELF
QQLKKDGLIRDPEKAAVCLAEREKQGGLGIPGTNMALYHLKNDEIVLPFFKMFDLSTPYEVDGMDGNTLRMTRILVMMAP
GSLSAEGSEILSAISSAIIESGESMAGFQEEGGQELYQRLNRIFFTWMKEKNIL
>P0AF10 ~~~mtlR~~~Mannitol operon repressor~~~COG3722
MVDQAQDTLRPNNRLSDMQATMEQTQAFENRVLERLNAGKTVRSFLITAVELLTEAVNLLVLQVFRKDDYAVKYAVEPLL
DGDGPLGDLSVRLKLIYGLGVINRQEYEDAELLMALREELNHDGNEYAFTDDEILGPFGELHCVAALPPPPQFEPADSSL
YAMQIQRYQQAVRSTMVLSLTELISKISLKKAFQK
>Q49400 2.1.1.72~~~~~~Type II methyltransferase M.MgeORF184P~~~
MHHFNRAKKAKNNEFFTLIDEIENEVINYQKQFANKTIFCNCNDGKNSHFFQFFQTNFNQLQLKKLIGFSFNNLSQADKF
TFDGNKVTKTKLKGNGDFSSDESIEVLKQADIVVTNPPFSLFQSFIDLLIQHNKQFLVLGLNAAVSYNHIFTYFKTNKLW
FGYTVNKTMSFSVNSDYQLYNPKTSNFFTKNGKCFQKIAGISWFTNLGKPHYNPFLNTNCFYKNNEKNYPKFDWYDAIYV
NKIKNIPMDWNGLMGVPLTFLNCYNPKQFELVDCLANPYATLDTLKTNAFVKLNQGDVRNVNGKRRYVRVIIKKQQI
>Q50290 2.1.1.72~~~mte1~~~Type II methyltransferase M.MpnI~~~
MHYFNRAKKAKNNEFYTLFEDIAAEVACYPNAFKGKVVLCNCNDGYQSNFWQFFQSQFHALGLKKLVAIAFNPLGNSYQL
NFDGKEIKELPLAGNGSFDSAEAIVLLKQSDIVVTNPPFSLFQDFVCLLAEHGKQFLVLGHNGAVGYNQIFKLFKEEQLW
YGHTVNSSMLFQVQSNFKLYDPKSVNFVKKDGQLFQKVPGISWFTNLKKNQQPAWLKTKSRYQGNEHKYPKFDWYDAIFV
SKVKEIPLDWFGYMGVPLTFLNCFNPKQFELIDCLANPYATLDTLKTNAYVRSHHGDVRNVKGKRRYVRVVIKQRQNVI
>P23192 2.1.1.72~~~mboIIM~~~Type II methyltransferase M1.MboII~~~
MLEINKIHQMNCFDFLDQVENKSVQLAVIDPPYNLSKADWDSFDSHNEFLPFTYRWIDKVLDKLDKDGSLYIFNTPFNCA
FICQYLVSKGMIFQNWITWDKRDGMGSAKRGFSTGQETILFFSKSKNHTFNYDEVRVPYESTDRIKHASEKGILKNGKRW
FPNPNGRLCGEVWHFSSQRHKEKVNGKTVKLTHITPKPRDLIERIIRASSNPNDLVLDCFMGSGTTAIVAKKLGRNFIGC
DMNAEYVNQANFVLNQLEIN
>O31662 5.3.1.23~~~mtnA~~~Methylthioribose-1-phosphate isomerase~~~COG0182
MTHSFAVPRSVEWKETAITILNQQKLPDETEYLELTTKEDVFDAIVTLKVRGAPAIGITAAFGLALAAKDIETDNVTEFR
RRLEDIKQYLNSSRPTAINLSWALERLSHSVENAISVNEAKTNLVHEAIQIQVEDEETCRLIGQNALQLFKKGDRIMTIC
NAGSIATSRYGTALAPFYLAKQKDLGLHIYACETRPVLQGSRLTAWELMQGGIDVTLITDSMAAHTMKEKQISAVIVGAD
RIAKNGDTANKIGTYGLAILANAFDIPFFVAAPLSTFDTKVKCGADIPIEERDPEEVRQISGVRTAPSNVPVFNPAFDIT
PHDLISGIITEKGIMTGNYEEEIEQLFKGEKVH
>B7MMH6 5.3.1.23~~~mtnA~~~Methylthioribose-1-phosphate isomerase~~~
MNIKGKHYRTVWVSGDGKAVEIIDQTKLPFKFEVVALTSAEMAATAIQDMWVRGAPLIGVVAAYGIALGMNHDASDMGLQ
RYYDLLIKTRPTAINLKWALDRMIDTLKDLCVSERKDVAWALAAEIAEEDVALCEQIGLHGAEVIREIAQKKPAGSVVNI
LTHCNAGWLATVDWGTALSPIYKAHENGIPVHVWVDETRPRNQGGLTAFELGSHGIPHTLIADNAGGHLMQHGDVDLCIV
GTDRTTARGDVCNKIGTYLKALAAHDNHVPFYVALPSPTIDWTIEDGKSIPIEQRDGKEQSHVYGINPQGELSWVNTAPE
GTRCGNYAFDVTPARYITGFITERGVCAASKSALADMFADLKSKALQGEQH
>Q2RXI0 5.3.1.23~~~mtnA~~~Methylthioribose-1-phosphate isomerase~~~COG0182
MNVKGTPTRTIWPAREGGAVWIIDQTRLPHEFVTQRLNDLGAVAHAIRAMLVRGAPLIGATAAYGVALGMAEDPSDEGLT
RACQTLLATRPTAVNLRWAIEAMAESLAAVPPDQRAQAAWAKAGAICDEDVALNEAIGDHGLGIIKDLARTKGVEKGGEG
PINILTHCNAGWLATVDWGTALAPLYKAHDAGLPIHVWVDETRPRNQGASLTAWELNSHGVPHTVIADNTGGHLMQHGLV
DMVIVGTDRTTATGDVCNKIGTYLKALAAFDNAVPFYVALPGPTIDWTVNDGLREIPIEQRDAAEVTRVWGRTAAGALEW
VTITPTGSPAANYAFDVTPARLITGLITERGVCAASAAGLAGLYPERAPAPVPAGSAAGKGAAATADGAL
>Q9X013 5.3.1.23~~~mtnA~~~Methylthioribose-1-phosphate isomerase~~~COG0182
MKLKTKTMEWSGNSLKLLDQRKLPFIEEYVECKTHEEVAHAIKEMIVRGAPAIGVAAAFGYVLGLRDYKTGSLTDWMKQV
KETLARTRPTAVNLFWALNRMEKVFFENADRENLFEILENEALKMAYEDIEVNKAIGKNGAQLIKDGSTILTHCNAGALA
TVDYGTALGVIRAAVESGKRIRVFADETRPYLQGARLTAWELMKDGIEVYVITDNMAGWLMKRGLIDAVVVGADRIALNG
DTANKIGTYSLAVLAKRNNIPFYVAAPVSTIDPTIRSGEEIPIEERRPEEVTHCGGNRIAPEGVKVLNPAFDVTENTLIT
AIITEKGVIRPPFEENIKKILEV
>O67788 4.2.1.109~~~mtnB~~~Methylthioribulose-1-phosphate dehydratase~~~COG0235
MNVELFKKFSEKVEEIIEAGRILHSRGWVPATSGNISAKVSEEYIAITASGKHKGKLTPEDILLIDYEGRPVGGGKPSAE
TLLHTTVYKLFPEVNAVVHTHSPNATVISIVEKKDFVELEDYELLKAFPDIHTHEVKIKIPIFPNEQNIPLLAKEVENYF
KTSEDKYGFLIRGHGLYTWGRSMEEALIHTEALEFIFECELKLLSFHS
>O31668 4.2.1.109~~~mtnB~~~Methylthioribulose-1-phosphate dehydratase~~~COG0235
MAAKQERWRELAEVKRELAERDWFPATSGNLSIKVTDEPLTFLVTASGKDKRKETVEDFLLVDQNGEPAESGHSLKPSAE
TLLHTHLYNKTNAGCCLHVHTVNNNVISELYGDQKKITFKGQEIIKALGLWEENAEVTVPIIENPAHIPTLAALFAEEIS
EDSGAVLIRNHGITAWGKTAFEAKRVLEAYEFLFSYHLKLKTLEHQLVK
>Q81MI9 1.13.11.54~~~mtnD~~~Acireductone dioxygenase~~~COG1791
MAQIRIHEVNTRIENEVKVSKFLQEEGVLYEKWNISKLPPHLNENYSLTDENKAEILAVFSKEIADVSARRGYKAHDVIS
LSNSTPNLDELLINFQKEHHHTDDEVRFIVSGHGIFAIEGKDGTFFDVELEPGDLISVPENARHYFTLQDDRQVVAIRIF
VTTEGWVPIY
>Q9ZFE7 1.13.11.54~~~mtnD~~~Acireductone dioxygenase~~~COG1791
MSALTIFSVKDPQNSLWHSTNAEEIQQQLNAKGVRFERWQADRDLGAAPTAETVIAAYQHAIDKLVAEKGYQSWDVISLR
ADNPQKEALREKFLNEHTHGEDEVRFFVEGAGLFCLHIGDEVFQVLCEKNDLISVPAHTPHWFDMGSEPNFTAIRIFDNP
EGWIAQFTGDDIASAYPRLA
>O31665 2.6.1.117~~~mtnE~~~L-glutamine--4-(methylsulfanyl)-2-oxobutanoate aminotransferase~~~COG0436
MKFEQSHVLKELPKQFFASLVQKVNRKLAEGHDVINLGQGNPDQPTPEHIVEEMKRAVADPENHKYSSFRGSYRLKSAAA
AFYKREYGIDLDPETEVAVLFGGKAGLVELPQCLLNPGDTILVPDPGYPDYWSGVTLAKAKMEMMPLVKDRAFLPDYSSI
TAEIREQAKLMYLNYPNNPTGAVATSEFFEDTVRFAAENGICVVHDFAYGAVGFDGCKPLSFLQTEGAKDIGIEIYTLSK
TYNMAGWRVGFAVGNASVIEAINLYQDHMFVSLFRATQEAAAEALLADQTCVAEQNARYESRRNAWITACREIGWDVTAP
AGSFFAWLPVPEGYTSEQFSDLLLEKANVAVAAGNGFGEYGEGYVRVGLLTSEERLKEAAYRIGKLNLFTQKSIDKTL
>O31663 2.7.1.100~~~mtnK~~~Methylthioribose kinase~~~COG4857
MGVTKTPLYETLNESSAVALAVKLGLFPSKSTLTCQEIGDGNLNYVFHIYDQEHDRALIIKQAVPYAKVVGESWPLTIDR
ARIESSALIRQGEHVPHLVPRVFYSDTEMAVTVMEDLSHLKIARKGLIEGENYPHLSQHIGEFLGKTLFYSSDYALEPKV
KKQLVKQFTNPELCDITERLVFTDPFFDHDTNDFEEELRPFVEKLWNNDSVKIEAAKLKKSFLTSAETLIHGDLHTGSIF
ASEHETKVIDPEFAFYGPIGFDVGQFIANLFLNALSRDGADREPLYEHVNQVWETFEETFSEAWQKDSLDVYANIDGYLT
DTLSHIFEEAIGFAGCELIRRTIGLAHVADLDTIVPFDKRIGRKRLALETGTAFIEKRSEFKTITDVIELFKLLVKE
>B7MMH5 2.7.1.100~~~mtnK~~~Methylthioribose kinase~~~
MTDSIPSGYKPLTCDTLPGYLSSRLTPSCEPGGLPEEWKVSEVGDGNLNMVFIVEGTHKTIIVKQALPWLRAGGEGWPLS
LSRAGFEYNVLCQEAKYAGHTLIPQVYFYDPEMALFAMEYLTPHVILRKELINGKKFPKLAEDIGRFLAQTLFNTSDIGM
SAEQKKALTAEFALNHELCKITEDLIFTEPYYNAERNNWTSPELDDAVHKAWADVEMIQVAMRYKYKFMTEAQALLHGDL
HSGSIMVTDTDTKVIDPEFGFMGPMAFDIGNYIGNLLLAYFSRPGWDANEQRRADYQEWLLQQIVQTWSVFTREFRQLWD
NKTQGDAWSTEMYQQNRAALEDAQDQFFATLLEDSLVNAGMEMNRRIIGFAGVAELKQIENTELRAGCERRALTMARDLI
VNARQFKNMDSVIQSAKVK
>Q9F0P1 2.7.1.100~~~mtnK~~~Methylthioribose kinase~~~
MSQYHTFTAHDAVAYAQQFAGIDNPSELVSAQEVGDGNLNLVFKVFDRQGVSRAIVKQALPYVRCVGESWPLTLDRARLE
AQTLVAHYQHSPQHTVKIHHFDPELAVMVMEDLSDHRIWRGELIANVYYPQAARQLGDYLAQVLFHTSDFYLHPHEKKAQ
VAQFINPAMCEITEDLFFNDPYQIHERNNYPAELEADVAALRDDAQLKLAVAALKHRFFAHAEALLHGDIHSGSIFVAEG
SLKAIDAEFGYFGPIGFDIGTAIGNLLLNYCGLPGQLGIRDAAAAREQRLNDIHQLWTTFAERFQALAAEKTRDAALAYP
GYASAFLKKVWADAVGFCGSELIRRSVGLSHVADIDTIQDDAMRHECLRHAITLGRALIVLAERIDSVDELLARVRQYS
>A0KIZ1 3.2.2.9~~~mtnN~~~5'-methylthioadenosine/S-adenosylhomocysteine nucleosidase~~~COG0775
MKVGIIGAMEQEVALLRSQMSNPTTLQLGGCEFYQGTLAGKEVILTRSGIGKVAASVATSLLLEKFAPDCVINTGSAGGF
AQDLHIGDVVIASEMRFHDVDVTAFGYEMGQMAQQPAAFPCDETLIAVAQDCIAEQGKHQTKVGLICTGDQFMCKPDAIA
KARADFPQMLAVEMEGAAIGQVCHMFKVPYLVVRAMSDIAGKEQVESFDAFIEVAGKHSAEVIIKMLGKL
>Q5E2X3 3.2.2.9~~~mtnN~~~5'-methylthioadenosine/S-adenosylhomocysteine nucleosidase~~~COG0775
MKIGIIGAMEQEVAILKDKIEGLSTVTKAGCTFYTGTLNGADVVLLQSGIGKVAAAVGTTLLIAEHNVDVVLNTGSAGGF
DSSLNLGDVVISTEVRHHDADVTAFGYEMGQMAQQPAAFIADEKLITTAEQALTEMSDKHAVRGLICTGDVFVCTPERQE
FIRTHFPSVIAVEMEASAIAQTCHQFNTPFVVVRAISDVADKESPMSFDEFLPLAAQSSSEMVLNMVTLLK
>Q81LL4 3.2.2.9~~~mtnN~~~5'-methylthioadenosine/S-adenosylhomocysteine nucleosidase~~~COG0775
MRIAVIGAMEEEVRILRDKLEQAETETVAGCEFTKGQLAGHEVILLKSGIGKVNAAMSTTILLERYKPEKVINTGSAGGF
HHSLNVGDVVISTEVRHHDVDVTAFNYEYGQVPGMPPGFKADEALVALAEKCMQAEENIQVVKGMIATGDSFMSDPNRVA
AIRDKFENLYAVEMEAAAVAQVCHQYEVPFVIIRALSDIAGKESNVSFDQFLDQAALHSTNFIVKVLEELK
>Q47UY5 3.2.2.9~~~mtnN~~~5'-methylthioadenosine/S-adenosylhomocysteine nucleosidase~~~COG0775
MKAGIIGAMEPEVAILKEKLTDAKSTEHAGYTFHQGQLDGSDVVIVQSGIGKVAAALATAILIDRFQVDYVVNTGSAGGF
DASLKVGDIVVSSEVRYHDVDLTAFGYEIGQLPANPAAFMPHDDLVAAAKKGIEQLSQTAGENIKAVTGLITTGDTFMTK
EEDVAKARANFPTMAAVEMEGAAIAQACLQLKTPFVVIRSLSDIAGKESPHTFEEYLETAAVNSSQLVLNMLGQLKGKVL
SAA
>B7MBE2 3.2.2.9~~~mtnN~~~5'-methylthioadenosine/S-adenosylhomocysteine nucleosidase~~~
MKIGIIGAMEEEVTLLRDKIENRQTISLGGCEIYTGQLNGTEVALLKSGIGKVAAALGATLLLEHCKPDVIINTGSAGGL
APTLKVGDIVVSDEARYHDADVTAFGYEYGQLPGCPAGFKADDKLIAAAEACIAELNLNAVRGLIVSGDAFINGSVGLAK
IRHNFPQAIAVEMEATAIAHVCHNFNVPFVVVRAISDVADQQSHLSFDEFLAVAAKQSSLMVESLVQKLAHG
>P0AF14 3.2.2.9~~~mtnN~~~5'-methylthioadenosine/S-adenosylhomocysteine nucleosidase~~~COG0775
MKIGIIGAMEEEVTLLRDKIENRQTISLGGCEIYTGQLNGTEVALLKSGIGKVAAALGATLLLEHCKPDVIINTGSAGGL
APTLKVGDIVVSDEARYHDADVTAFGYEYGQLPGCPAGFKADDKLIAAAEACIAELNLNAVRGLIVSGDAFINGSVGLAK
IRHNFPQAIAVEMEATAIAHVCHNFNVPFVVVRAISDVADQQSHLSFDEFLAVAAKQSSLMVESLVQKLAHG
>P0AF12 3.2.2.9~~~mtnN~~~5'-methylthioadenosine/S-adenosylhomocysteine nucleosidase~~~COG0775
MKIGIIGAMEEEVTLLRDKIENRQTISLGGCEIYTGQLNGTEVALLKSGIGKVAAALGATLLLEHCKPDVIINTGSAGGL
APTLKVGDIVVSDEARYHDADVTAFGYEYGQLPGCPAGFKADDKLIAAAEACIAELNLNAVRGLIVSGDAFINGSVGLAK
IRHNFPQAIAVEMEATAIAHVCHNFNVPFVVVRAISDVADQQSHLSFDEFLAVAAKQSSLMVESLVQKLAHG
>A6T4W3 3.2.2.9~~~mtnN~~~5'-methylthioadenosine/S-adenosylhomocysteine nucleosidase~~~
MKIGIIGAMEEEVTLLRDKIENRQTITIGGSEIYTGQLHGVDVALLKSGIGKVAAAMGATLLLERCQPDVIINTGSAGGL
ASTLKVGDIVVSDEARYHDADVTAFGYEYGQLPGCPAGFKADEKLVAAAESCIKALDLNAVRGLIVSGDAFINGSVGLAK
IRHNFPQAIAVEMEATAIAHVCHNFKVPFVVVRAISDVADQQSHLSFEEFLAVAARQSTLMVENLVQNLARG
>P9WJM3 3.2.2.9~~~mtnN~~~5'-methylthioadenosine/S-adenosylhomocysteine nucleosidase~~~COG0775
MAVTVGVICAIPQELAYLRGVLVDAKRQQVAQILFDSGQLDAHRVVLAAAGMGKVNTGLTATLLADRFGCRTIVFTGVAG
GLDPELCIGDIVIADRVVQHDFGLLTDERLRPYQPGHIPFIEPTERLGYPVDPAVIDRVKHRLDGFTLAPLSTAAGGGGR
QPRIYYGTILTGDQYLHCERTRNRLHHELGGMAVEMEGGAVAQICASFDIPWLVIRALSDLAGADSGVDFNRFVGEVAAS
SARVLLRLLPVLTAC
>Q99TQ0 3.2.2.9~~~mtnN~~~5'-methylthioadenosine/S-adenosylhomocysteine nucleosidase~~~
MIGIIGAMEEEVTILKNKLTQLSEISVAHVKFYTGILKDREVVITQSGIGKVNAAISTTLLINKFKPDVIINTGSAGALD
ESLNVGDVLISDDVKYHDADATAFGYEYGQIPQMPVAFQSSKPLIEKVSQVVQQQQLTAKVGLIVSGDSFIGSVEQRQKI
KKAFPNAMAVEMEATAIAQTCYQFNVPFVVVRAVSDLANGEAEMSFEAFLEKAAVSSSQTVEALVSQL
>Q7A5B0 3.2.2.9~~~mtnN~~~5'-methylthioadenosine/S-adenosylhomocysteine nucleosidase~~~
MIGIIGAMEEEVTILKNKLTQLSEISVAHVKFYTGILKDREVVITQSGIGKVNAAISTTLLINKFKPDVIINTGSAGALD
ESLNVGDVLISDDVKYHDADATAFGYEYGQIPQMPVAFQSSKPLIEKVSQVVQQQQLTAKVGLIVSGDSFIGSVEQRQKI
KKAFPNAMAVEMEATAIAQTCYQFNVPFVVVRAVSDLANGEAEMSFEAFLEKAAVSSSQTVEALVSQL
>A5F5R2 3.2.2.9~~~mtnN~~~5'-methylthioadenosine/S-adenosylhomocysteine nucleosidase~~~COG0775
MKIGIIGAMQQEVAILKDLIEDVQEVNQAGCTFYSGQIQGVDVVLLQSGIGKVSAALGTALLISQYAPDVVINTGSAGGF
DASLNVGDVVISSEVRHHDADVTAFGYEIGQMAGQPAAFKADEKLMTVAEQALAQLPNTHAVRGLICTGDAFVCTAERQQ
FIRQHFPSVVAVEMEASAIAQTCHQFKVPFVVVRAISDVADKESPLSFEEFLPLAAKSSSAMVLKMVELLK
>Q9KPI8 3.2.2.9~~~mtnN~~~5'-methylthioadenosine/S-adenosylhomocysteine nucleosidase~~~COG0775
MKIGIIGAMQQEVAILKDLIEDVQEVNQAGCTFYSGQIQGVDVVLLQSGIGKVSAALGTALLISQYAPDVVINTGSAGGF
DASLNVGDVVISSEVRHHDADVTAFGYEIGQMAGQPAAFKADEKLMTVAEQALAQLPNTHAVRGLICTGDAFVCTAERQQ
FIRQHFPSVVAVEMEASAIAQTCHQFKVPFVVVRAISDVADKESPLSFEEFLPLAAKSSSAMVLKMVELLK
>O31664 3.5.1.111~~~mtnU~~~2-oxoglutaramate amidase~~~COG0388
MKWTISCLQFDISYGKPSENIKKAEFFIEKESKHADVLVLPELWTTGYDLANLDELADEDGRSAQSWLKKTAKKHGVHIV
AGSVAVRKNSDVYNTMYIADKEGQIIKEYRKAHLFQLMDEHLYLSAGSEDGYFELDGVKSSGLICYDIRFPEWIRKHTTK
GANVLFISAEWPLPRLDHWKSLLIARAIENQCFVAACNCTGSNPDNEFAGHSLIIDPWGRVLAEGGREEGIVRAEIDLQE
SAEVRESIPVFDDIRKDLY
>Q819E8 5.3.2.5~~~mtnW~~~2,3-diketo-5-methylthiopentyl-1-phosphate enolase~~~
MSGIIATYLIHDDSHNLEKKAEQIALGLTIGSWTHLPHLLQEQLKQHKGNVIHVEELAEHEHTNSYLRKKVKRGIIKIEY
PLLNFSPDLPAILTTTFGKLSLDGEVKLIDLTFSDELKKHFPGPKFGIDGIRNLLQVHDRPLLMSIFKGMIGRNIGYLKT
QLRDQAIGGVDIVKDDEILFENALTPLTKRIVSGKEVLQSVYETYGHKTLYAVNLTGRTFDLKENAKRAVQAGADILLFN
VFAYGLDVLQSLAEDDEIPVPIMAHPAVSGAYSASKLYGVSSPLLLGKLLRYAGADFSLFPSPYGSVALEKEEALAISKY
LTEDDASFKKSFSVPSAGIHPGFVPFIVRDFGKDVVINAGGGIHGHPNGAQGGGKAFRTAIDATLQNKPLHEVDDINLHS
ALQIWGNPSYEVKL
>O31666 5.3.2.5~~~mtnW~~~2,3-diketo-5-methylthiopentyl-1-phosphate enolase~~~COG1850
MSELLATYLLTEPGADTEKKAEQIATGLTVGSWTDLPLVKQEQMQKHKGRVIKVEEREGTAASEKQAVITIAYPEINFSQ
DIPALLTTVFGKLSLDGKIKLIDLHFSEAFKRALPGPKFGVYGIRKLLGEFERPLLMSIFKGVIGRDLSDIKEQLRQQAL
GGVDLIKDDEIFFETGLAPFETRIAEGKQILKETYEQTGHKTLYAVNLTGRTADLKDKARRAAELGADALLFNVFAYGLD
VMQGLAEDPEIPVPIMAHPAVSGAFTSSPFYGFSHALLLGKLNRYCGADFSLFPSPYGSVALPRADALAIHEECVREDAF
NQTFAVPSAGIHPGMVPLLMRDFGIDHIINAGGGVHGHPNGAQGGGRAFRAIIDAVLEAQPIDEKAEQCKDLKLALDKWG
KAEAV
>Q5L1E2 5.3.2.5~~~mtnW~~~2,3-diketo-5-methylthiopentyl-1-phosphate enolase~~~COG1850
MSAVMATYLLHDETDIRKKAEGIALGLTIGTWTDLPALEQEQLRKHKGEVVAIEELGESERVNAYFGKRLKRAIVKIAYP
TVNFSADLPALLVTTFGKLSLDGEVRLLDLEFPDEWKRQFPGPRFGIDGIRDRVGVHNRPLLMSIFKGMIGRDLAYLTSE
LKKQALGGVDLVKDDEILFDSELLPFEKRITEGKAALQEVYEQTGKRTLYAVNLTGKTFALKDKAKRAAELGADVLLFNV
FAYGLDVLQALREDEEIAVPIMAHPAFSGAVTPSEFYGVAPSLWLGKLLRLAGADFVLFPSPYGSVALEREQALGIARAL
TDDQEPFARAFPVPSAGIHPGLVPLIIRDFGLDTIVNAGGGIHGHPDGAIGGGRAFRAAIDAVLAGRPLRAAAAENEALQ
KAIDRWGVVEVEA
>A8YER2 5.3.2.5~~~mtnW~~~2,3-diketo-5-methylthiopentyl-1-phosphate enolase~~~
MTIIVDYRFPPAINAEKQAKTIAIGQTAGTWSERHSHRQKQLQQHLAEVVGIREEADGYKVARVRFPQINVENDIASLLT
MIFGKYSMAGAGKVVGVYLPESYGTKAKLGITGIRQRLGVYDRPLVMAIFKPALGLSAQDHADILREVAFAGLDVIKDDE
IMADLPVAPTHERLDCCRRVLEEVRQQTGRNVLYAVNVTGKADELQRKARLLVKHGANALLLNVLTYGFSVLEALASDPA
IDVPIFAHPAFAGAMCAGSDTGLAYSVVLGTMMAHAGADAVLYPAAYGSLPFDPQEEGKIRDILRDRNVFPVPSAGIRPG
IVPQVLGDYGRNVILNAGTGIMDHPSGPASGVRAFFEALARIEAGESFDPANLPEGALKQAILEWG
>O31667 3.1.3.87~~~mtnX~~~2-hydroxy-3-keto-5-methylthiopentenyl-1-phosphate phosphatase~~~COG4359
MTTRKPFIICDFDGTITMNDNIINIMKTFAPPEWMALKDGVLSKTLSIKEGVGRMFGLLPSSLKEEITSFVLEDAKIREG
FREFVAFINEHEIPFYVISGGMDFFVYPLLEGIVEKDRIYCNHASFDNDYIHIDWPHSCKGTCSNQCGCCKPSVIHELSE
PNQYIIMIGDSVTDVEAAKLSDLCFARDYLLNECREQNLNHLPYQDFYEIRKEIENVKEVQEWLQNKNAGESSLK
>P40874 1.5.3.-~~~solA~~~N-methyl-L-tryptophan oxidase~~~COG0665
MKYDLIIIGSGSVGAAAGYYATRAGLNVLMTDAHMPPHQHGSHHGDTRLIRHAYGEGEKYVPLVLRAQTLWDELSRHNEE
DPIFVRSGVINLGPADSTFLANVAHSAEQWQLNVEKLDAQGIMARWPEIRVPDNYIGLFETDSGFLRSELAIKTWIQLAK
EAGCAQLFNCPVTAIRHDDDGVTIETADGEYQAKKAIVCAGTWVKDLLPELPVQPVRKVFAWYQADGRYSVKNKFPAFTG
ELPNGDQYYGFPAENDALKIGKHNGGQVIHSADERVPFAEVASDGSEAFPFLRNVLPGIGCCLYGAACTYDNSPDEDFII
DTLPGHDNTLLITGLSGHGFKFASVLGEIAADFAQDKKSDFDLTPFRLSRFQ
>A0A291P0C1 1.8.3.4~~~mtoX~~~Methanethiol oxidase~~~
MKKHLLAGACALAMGFAVIPGTFADETCNSPFTTALITGQEQYLHVWTLGMPGVGDESDKLVTISVDPKSDKYGKVINTL
SVGGRGEAHHTGFTDDRRYLWAGRLDDNKIFIFDLIDPANPKLIKTITDFADRTGYVGPHTFYALPGRMLIQALSNTKTH
DGQTGLAVYSNAGELVSLHPMPVTDGGDGYGYDIGINPAKNVLLTSSFTGWNNYMMDLGKMVKDPEAMKRFGNTMAIWDL
KSMKAEKILNVPGAPLEIRWSLKPEHNWAYTATALTSKLWLIKQDDKGEWIAKETGTIGDPSKIPLPVDISITADAKGLW
VNTFLDGTTRFYDISEPEHPKEVFSKKMGNQVNMVSQSYDGKRVYFTTSLIANWDKKGAENDQWLKAYDWDGKELVEKFT
VDFNELKLGRAHHMKFSSKTNAAELGTNQSFPTRQ
>Q5LKW0 1.8.3.4~~~mtoX~~~Methanethiol oxidase~~~COG3391
MKRREFGALAAGALAMGLPFRAFADETCQSPYMPKITGQEEFVYVWTLGVEGMGDEQDKLVTIDLRPGSATRGQVINSVS
VGGRNEAHHGGFSADRRFFWTGGLDTNRIFIFDVHSDPSNPKLHKTIDTFVKDSGGVVGPHTFFALPGSMMITGLSNDDD
HGGRTALVEYNDDGEYVATYWMPTADDMQGAVAVGDAVADGYGYDIRALIRKNVMLTSSFTGWSNYMMDFGQMLQDAEAM
KRFGNTIVQWDLHTRQPKKVFNVPGAPLEIRFPWGSNANYAFSTTALTSQLWLIYEDDAGEWQAKAVADIGNPADIPLPV
DISIAADDQTLWINSFMDGKTRLFDISDPHKPFQIYEKVIDRQVNMVSQSWDGKRVYFSSSLLANWDKKGKDDAQYLKAY
NWDGKELVEDFAVDFYELGLGRAHIMRFGSSALYSS
>P11409 2.1.1.113~~~pvuIIM~~~Type II methyltransferase M.PvuII~~~
MMTLNLQTMSSNDMLNFGKKPAYTTSNGSMYIGDSLELLESFPDESISLVMTSPPFALQRKKEYGNLEQHEYVDWFLSFA
KVVNKKLKPDGSFVVDFGGAYMKGVPARSIYNFRVLIRMIDEVGFFLAEDFYWFNPSKLPSPIEWVNKRKIRVKDAVNTV
WWFSKTEWPKSDITKVLAPYSDRMKKLIEDPDKFYTPKTRPSGHDIGKSFSKDNGGSIPPNLLQISNSESNGQYLANCKL
MGIKAHPARFPAKLPEFFIRMLTEPDDLVVDIFGGSNTTGLVAERESRKWISFEMKPEYVAASAFRFLDNNISEEKITDI
YNRILNGESLDLNSII
>P05103 2.1.1.72~~~paeR7IM~~~Type II methyltransferase M.PaeR7I~~~
MAFAPSVAHKPVAAAVCPVMAATEALATEGGLEARGAIFTRSEVVDFILDLAGYTEDQPLHEKRLLEPSFGGGDFLLPII
QRLLSAWRAARPNGTEVDDLGDAIRAVELHHDTFRSTYAAVVALLKREGLSANAATALADRWLSQGDFLLAPLEGQFDFV
VGNPPYVRPELIPAPLLAEYRSRYQTMYDRADIYIPFIERSLTALSAGGNLGFICADRWMKNRYGGPLRSLVAERFHLKV
YVDMVDTPAFHSDVIAYPAITIISREGGGATRIAHRPSIDRATLTTLAGLLSAPTLPKDAGPVRELARVTNGAEPWLLES
SDQMALIRRLEGAFPLLEEAGCKVGIGVATGADKAFIGDFESLDVEPDRKLPLVTTKDIMTGEVQWRGQGVINPFAESGG
LVDLGEYPRLRRYLEARRDVIAGRHCAKKAPANWYRTIDRITPALAARPKLLIPDIKGESHIVFEGGELYPSHNLYYVTS
DDWDLRALQAVLLSAVSRLFVATYSTKMRGGFLRFQAQYLRRIRIPRWADVPEPLRRELAEAAIKRDVQACNRAVFRLYG
LSHEERSALGGNGE
>P0DX09 2.1.1.-~~~mtpB~~~Proline betaine:corrinoid methyltransferase~~~
MYVNRRFYDKYVSTRDVELLHEYSLRVLKEVGVSFDCEEALEIFKKHGATVEGSIVKIDEDLLNQALETAPKTFTITTSA
GETKIGERYKPKTVGCYGPPKFLFEDDEYRVAKKDDMVKFLKLMDTSDVTDFVNNSAYDTPDLDKTKEDFYLPQVAMCLK
YSQKPTYGNVANSMNVRGKSLKQEAKDIAKLYKEFYDIWDRPVLLTNTCALSPLGYSYEVLDNIMGLVEEGQPVTIITCS
MTNLTAPAALLGSVIQNNATILAGIVLTQLINPGNPVIYGTVSTATDMRNVACSIGAPEAQLIQMASLALGRYYQLPVRT
GIAGTDSLKPDYQAGVESFMILMTTYLGKSDFVLNHAGILQAYALGSYEKFVLDEEVNRILLRLNRGIDISDVKAEKVFD
AIKKAGPLGNYLSGRTPKEYRQEHWLTKLFNRQAGNPQPIFDEIGDLRERASKEVEERVAGYTLPDLTKTQKDILNRYLP
EDEKF
>P00474 2.1.1.72~~~pstIM~~~Type II methyltransferase M.PstI~~~
MTEAATQLPISLNILVDSIREAANSTLDETLRSKLGQFMSSSAVSELMANLFESYVGEHEILDAGAGVGSLTAAFVQNAT
LNGAKSISSTCYEISEVMVYNLIQVLDLCKIRAMEFEVNWQQKIIESDFIQASVEQLLIENYSPKYNKAILNPPYLKIAA
KGRERALLQKVGIEASNLYSAFVALAIKQLKSGGELVAITPRSFCNGPYFNDFRKQMLDECSLNKIHVFNSRKSAFKADN
VLQENIIYHLTKGETQRKVVTVYSSTCANDINPTIFEVPFDEIVKSNNPDLFIHIVTNEQERELANKAGGLPCSLSDLGI
QVSTGKVVDFRTRENLSMEYISNSVPLIFPQHLQRCSIVWPITKAKKPNALIVNEATNNLMVPNGIYVLTRRLTAKEEKR
RIVASIYYPDIANVDTVGFDNKINYFHANGKPLDISLAKGLWVFLNSTLIDKYFRQMNGHTQVNATDLRALRYPTREQLE
DIANQVDFGEFEQTKIDEIINQSLQLM
>P0DX08 2.1.1.-~~~mtqA~~~Methylcorrinoid:tetrahydrofolate methyltransferase~~~
MIIIGEKLNGAIPVVKKAIEEKDEAFIRDRAIAQAEAGATYIDVCAGTAPEIELESLKWMMDIVQEAVETPLCIDSPDPE
ILKAVFPLCKKPGLVNSVSGEGNKMDVLLPLFDDADPTWELVAMTCDNAGIPNTVEKKVELTKMMVEEAKKHNLTPNRIH
IDPCVMALSTENHSFLNFKAEIEGIREIYPDIHITSGLSNISFGLPARKLMNQNFMTLSMFVGMDSAVMDPTSRDMMGAI
FATDALLGNDRLCRKYSKAFRQGKIGPVQAK
>P0DX07 ~~~mtqC~~~Quaternary-amine-specific corrinoid protein~~~
MADWKNLTQAVGDLEEDDVMEILNDFVATNPTEAEAEEAVAACQAGMAVVGDLFEEGEYFVGDLIFAGELLTEAINVLKP
VLGSGDTAVAGTILIGTAHGDLHDIGKNIFRSMAEAAGFQVTDLGIDVAIDTFVEKAKEIKPDIIGISGVLTLAIDSMKE
TSDALKAAGVDSKLIIGGNPVTKEACEYVGADDFTTNAAEGVKICQAWVG
>P14751 2.1.1.72~~~rsrIM~~~Type II methyltransferase M.RsrI~~~
MANRSHHNAGHRAMNALRKSGQKHSSESQLGSSEIGTTRHVYDVCDCLDTLAKLPDDSVQLIICDPPYNIMLADWDDHMD
YIGWAKRWLAEAERVLSPTGSIAIFGGLQYQGEAGSGDLISIISHMRQNSKMLLANLIIWNYPNGMSAQRFFANRHEEIA
WFAKTKKYFFDLDAVREPYDEETKAAYMKDKRLNPESVEKGRNPTNVWRMSRLNGNSLERVGHPTQKPAAVIERLVRALS
HPGSTVLDFFAGSGVTARVAIQEGRNSICTDAAPVFKEYYQKQLTFLQDDGLIDKARSYEIVEGAANFGAALQRGDVAS
>Q9RYA1 2.1.1.-~~~~~~Ribosomal RNA large subunit methyltransferase DR_0049~~~COG1092
MSAPASSAPRLRLRVSKAAELHIRDGHPWVYESSVREQNREGEPGELAVVYDRRDRFLAIGLYDPFSPLRLRVLHTGMPT
QLDDAWWAARLDAALARRAALFGPLTAFGDTDGYRVLNGESDGFPGLVVDRYAGVLVMKLYTAAWFPHLRRMLELFAARA
PDFAVVLRLSRNIAERAADLGLHDGQVIYGELAGDSVVFRESGLRFEAEVRQGQKTGFFLDQRENRRRVEGLSEGRRVLN
AFSFSGGFSLYAARGGASEVTSLDISAHALRSAERNFALNPELSAVHKTVQADVFEWLPAGKGSGADYDLVILDPPSLAR
REAEREGAIRAYGKLAEGGLTRLAPGGILVSASCSAHVSAEEFEEAVMSAVRRSGRRWRKLLSSRHAPDHHASFAEAEYL
KAVFLQMD
>A0QTK2 ~~~mtrA~~~DNA-binding response regulator MtrA~~~COG0745
MDTMRQRILVVDDDPSLAEMLTIVLRGEGFDTAVIGDGSQALTAVRELRPDLVLLDLMLPGMNGIDVCRVLRADSGVPIV
MLTAKTDTVDVVLGLESGADDYVMKPFKPKELVARVRARLRRNEDEPAEMLSIGDVEIDVPAHKVTRQGEQISLTPLEFD
LLVALARKPRQVFTRDVLLEQVWGYRHPADTRLVNVHVQRLRAKVEKDPENPQVVLTVRGVGYKAGPP
>P9WGM7 ~~~mtrA~~~DNA-binding response regulator MtrA~~~COG0745
MDTMRQRILVVDDDASLAEMLTIVLRGEGFDTAVIGDGTQALTAVRELRPDLVLLDLMLPGMNGIDVCRVLRADSGVPIV
MLTAKTDTVDVVLGLESGADDYIMKPFKPKELVARVRARLRRNDDEPAEMLSIADVEIDVPAHKVTRNGEQISLTPLEFD
LLVALARKPRQVFTRDVLLEQVWGYRHPADTRLVNVHVQRLRAKVEKDPENPTVVLTVRGVGYKAGPP
>Q9WW32 ~~~mtrA~~~HTH-type transcriptional regulator MtrA~~~
MDILDKLVDLAQLTGSADVQCLLGGQWSVRHETLQCEGLVHIVTAGSGYLCIDGETSPRPVGTGDIVFFPRGLGHVLSHD
GKYGESLQPDIRQNGTFMVKQCGNGLDMSLFCARFRYDTHADLMNGLPETVFLNIAHPSLQYVVSMLQLESEKPLTGTVS
VVNALPSVLLVLILRAYLEQDKDVELSGVLKGWQDKRLGHLIQKVIDKPEDEWNIDKMVAAANMSRAQLMRRFKSQVGLS
PHAFVNHIRLQKGALLLKKTPDSVLEVALSVGFQSETHFGKAFKRQYHVSPGQYRKEGGQK
>P0DSN3 ~~~mtrA~~~Multiheme cytochrome MtrA~~~
MKNSLKMKNLLPALVITMAMSAVMSLCIAPNAYASKWDAKMTPEQVEATLDKKFAEGNYSPKGADSCLMCHKKSEKVMDL
FKGVHGAIDSSKSPMAGLQCEACHGPLGQHNKGGNEPMITFGKQSTLSAEKQNSVCMSCHQDDKRVSWNGSHHDNADVAC
ASCHQVHVAKDPVLSKNTEMEVCTSCHTKQKADMNKRSSHPLKWAQMTCSDCHNPHGSMTDSDLNKPSINETCYSCHAEK
RGPKLWEHAPVTENCVTCHNPHGSVNDAMLKTRAPQLCQQCHASDGHASNAYLGNTGLGSNVGDNAFTGGRSCLNCHSQV
HGSNHPSGKLLQR
>P19466 ~~~mtrB~~~Transcription attenuation protein MtrB~~~
MNQKHSSDFVVIKAVEDGVNVIGLTRGTDTKFHHSEKLDKGEVIIAQFTEHTSAIKVRGEALIQTAYGEMKSEKK
>Q9X6J6 ~~~mtrB~~~Transcription attenuation protein MtrB~~~
MYTNSDFVVIKALEDGVNVIGLTRGADTRFHHSEKLDKGEVLIAQFTEHTSAIKVRGKAYIQTRHGVIESEGKK
>Q9KCC6 ~~~mtrB~~~Transcription attenuation protein MtrB~~~
MNVGDNSNFFVIKAKENGVNVFGMTRGTDTRFHHSEKLDKGEVMIAQFTEHTSAVKIRGKAIIQTSYGTLDTEKDE
>P9WGK9 2.7.13.3~~~mtrB~~~Sensor histidine kinase MtrB~~~COG5000
MIFGSRRRIRGRRGRSGPMTRGLSALSRAVAVAWRRSLQLRVVALTLGLSLAVILALGFVLTSQVTNRVLDIKVRAAIDQ
IERARTTVSGIVNGEETRSLDSSLQLARNTLTSKTDPASGAGLAGAFDAVLMVPGDGPRAASTAGPVDQVPNALRGFVKA
GQAAYQYATVQTEGFSGPALIIGTPTLSRVANLELYLIFPLASEQATITLVRGTMATGGLVLLVLLAGIALLVSRQVVVP
VRSASRIAERFAEGHLSERMPVRGEDDMARLAVSFNDMAESLSRQIAQLEEFGNLQRRFTSDVSHELRTPLTTVRMAADL
IYDHSADLDPTLRRSTELMVSELDRFETLLNDLLEISRHDAGVAELSVEAVDLRTTVNNALGNVGHLAEEAGIELLVDLP
AEQVIAEVDARRVERILRNLIANAIDHAEHKPVRIRMAADEDTVAVTVRDYGVGLRPGEEKLVFSRFWRSDPSRVRRSGG
TGLGLAISVEDARLHQGRLEAWGEPGEGACFRLTLPMVRGHKVTTSPLPMKPIPQPVLQPVAQPNPQPMPPEYKERQRPR
EHAEWSG
>P0DSN2 ~~~mtrB~~~Outer membrane protein MtrB~~~
MKFKLNLITLALLANTGFAIAADGYGLANANTEKVKMSAWSCKGCVVETGTSGTVGVGVGYNSEEDIRSANAFGTSNEVA
GKLDADVTFRGEKGYRASVEAYQLGMDGGRLEVNAGKQGQYNVNVNYRQIATYNSNSALTPYSGVGSDNLTLPDNWVTAG
SSSQMPLLMDSLNSLELSLKRERTGLGFDYQGESLWSTHVSYMREEKTGLKKASGGFFNQSMMLAEPVDYTTDSIEAGIK
LKGDNWFTALNYNGSIFKNEYNQLNFDSAFNPTFGAQTSGSIALDPDNQSHTVSLMGQYNDSTNVLSARLLTGQMSQDQA
LVTSGYGYQVPTEALDAKVDLIGLNLKVVSKVTNSLRLSGSYDYNDRDNNTQIEEWTQVSINNVNGKVAYNTPYDNTSQR
FKVAADYRITRGMKLDGGYDFRRDERNYQDRETTDENTVWARFRVNSFETWDMWVKGSYGQRDGSEYQASEWTSSETNSL
LRKYNLANRDRTQVEARVTHSPIESLTIDFGARYALDDYTDTVIGLTESKDTSYDANISYMITDDLLANAFYNYQIIESE
QAGSSNYSTPTWTGFIEDKVDVVGAGISYNNLLENKLRMGLDYTYSDSNSNTQVRQGITGDYGDYFAKVHNINLYAQYQA
TEKMALRFDYKIENYKDNDAANDIAVNGIWNVVGFGDNSHDYTAQMIMLSMSYKI
>P0DSN4 ~~~mtrC~~~Multiheme cytochrome MtrC~~~
MMNAQTTKIALLLAASAVTMALTGCGGSDGNDGNPGEPGGEPAPAIQILNFTFDKSVITNGVPSVEFTVTNENDLPVVGL
QKMRFAAAQLIPQGATGAGNASQWQYFGDETCDVAATCPGTFVDQKNGHYSYTFNMNLTANAKITYNDQLAQRVLIRAYN
TPLPDGTQVPNSNAFVDFTADTGAAPTYSRKIVATESCNTCHQDLANVKHGGAYSDVNYCATCHTAGKVGVGKEFNVLVH
AKHKDLTLGSLESCQSCHAANDAAPDWGNWSRIPTAATCGSCHSTVDFAAGKGHSQQLDNSNCIACHNSDWTAELHTGKT
ADKKAVIAQLGMQATLVGQTDDTAVLTVSILDKDGNAIDAATVQDKIKRLETVTNVGPNFPIMGYNKSPGSGAAKIAKDL
VKDGALQAGVTLVDGKLVFTTPALPFGTGDTDTAFTFIGLEMCSTGTSLTACTVDSATTSMKAELAFGTKSGNAPSMRHV
NSVNFSTCQGCHSDTFEIHKGHHSGFVMTEQVSHAKDANGKAIVGVDGCVACHTPDGTYASGANKGAFEMKLHVIHGEQG
VIKECTQCHNDFNLDAFKVKGALATSAGKYTTPITATCTSCHAPESIGHGLENMGAIVNGDYVQANQAAQSETCFYCHKP
TPTDHTQVKM
>P39897 ~~~mtrR~~~HTH-type transcriptional regulator MtrR~~~
MRKTKTEALKTKEHLMLAALETFYRKGIARTSLNEIAQAAGVTRGALYWHFKNKEDLFDALFQRICDDIENCIAQDAADA
EGGSWTVFRHTLLHFFERLQSNDIHYKFHNILFLKCEHTEQNAAVIAIARKHQAIWREKITAVLTEAVENQDLADDLDKE
TAVIFIKSTLDGLIWRWFSSGESFDLGKTAPRIIGIMMDNLENHPCLRRK
>P0AAD2 ~~~mtr~~~Tryptophan-specific transport protein~~~COG0814
MATLTTTQTSPSLLGGVVIIGGTIIGAGMFSLPVVMSGAWFFWSMAALIFTWFCMLHSGLMILEANLNYRIGSSFDTITK
DLLGKGWNVVNGISIAFVLYILTYAYISASGSILHHTFAEMSLNVPARAAGFGFALLVAFVVWLSTKAVSRMTAIVLGAK
VITFFLTFGSLLGHVQPATLFNVAESNASYAPYLLMTLPFCLASFGYHGNVPSLMKYYGKDPKTIVKCLVYGTLMALALY
TIWLLATMGNIPRPEFIGIAEKGGNIDVLVQALSGVLNSRSLDLLLVVFSNFAVASSFLGVTLGLFDYLADLFGFDDSAV
GRLKTALLTFAPPVVGGLLFPNGFLYAIGYAGLAATIWAAIVPALLARASRKRFGSPKFRVWGGKPMIALILVFGVGNAL
VHILSSFNLLPVYQ
>P9WHH3 1.8.1.15~~~mtr~~~Mycothione reductase~~~COG1249
METYDIAIIGTGSGNSILDERYASKRAAICEQGTFGGTCLNVGCIPTKMFVYAAEVAKTIRGASRYGIDAHIDRVRWDDV
VSRVFGRIDPIALSGEDYRRCAPNIDVYRTHTRFGPVQADGRYLLRTDAGEEFTAEQVVIAAGSRPVIPPAILASGVDYH
TSDTVMRIAELPEHIVIVGSGFIAAEFAHVFSALGVRVTLVIRGSCLLRHCDDTICERFTRIASTKWELRTHRNVVDGQQ
RGSGVALRLDDGCTINADLLLVATGRVSNADLLDAEQAGVDVEDGRVIVDEYQRTSARGVFALGDVSSPYLLKHVANHEA
RVVQHNLLCDWEDTQSMIVTDHRYVPAAVFTDPQIAAVGLTENQAVAKGLDISVKIQDYGDVAYGWAMEDTSGIVKLITE
RGSGRLLGAHIMGYQASSLIQPLIQAMSFGLTAAEMARGQYWIHPALPEVVENALLGLR
>Q02DS7 ~~~mtr~~~Tryptophan-specific transport protein~~~
MSSSPAQTPSRRPSLLGGSMIIAGTAVGAGMFSLPIAMSGIWFGWSVAVFLLTWFCMLLSGMMILEANLNYPVGSSFSTI
TRDLLGQGWNVVNGLSIAFVLYILTYAYISGGGSIIGYTLSSGLGVTLPEKLAGLLFALAVALVVWWSTRAVDRITTLML
GGMIITFGLSISGLLGRIQPAILFNSGEPDAVYWPYLLATLPFCLTSFGYHGNVPSLMKYYGKDPQRISRSLWIGTLIAL
AIYLLWQASTLGTIPREQFKGIIAGGSNVGTLVEYLHRITASDSLNALLTTFSNLAVASSFLGVTLGLFDYLADLCRFDD
SHFGRFKTALLTFVPPTIGGLLFPNGFIYAIGFAGLAAAFWAVIVPALMARASRKRFGSPLFRAWGGTPAIVLVLLFGVA
NAVAHILASLHWLPEYR
>P23737 2.1.1.37~~~~~~Type II methyltransferase M.Sau96I~~~
MRLNKGSIIEKMKNQNIKTQTELAEKIDISKSQLSFMFSDEYEPLKKNVIKLADVLKVSPNDIILDEEDQMPINSDFNRY
DYKLDEFIDVSNVRKNKDYNVFETFAGAGGLALGLESAGLSTYGAVEIDKNAAETLRINRPKWKVIENDIEFIADNLDEF
IDEEIDILSGGYPCQTFSYAGKRNGFADTRGTLFYPYSKILSKLKPKAFIAENVRGLVNHDDGKTLEVMLKVFIKEGYEV
YWNILNSWNYDVAQKRERIVIIGIREDLVKEQKYPFRFPLAQVYKPVLKDVLKDVPKSKVTAYSDKKREVMKLVPPGGCW
VDLPEQIAKDYMGKSWYSGGGKRGMARRISWDEPCLTLTTSPSQKQTERCHPDETRPFSIREYARIQSFPDEWEFSGGVG
AQYRQIGNAVPVNLAKYIGKSLVHYLNQFN
>P42364 ~~~scaA~~~Manganese ABC transporter substrate-binding lipoprotein scaA~~~
MKKCRFLVLLLLAFVGLAACSSQKSSTDSSSSKLNVVATNSIIADITKNIAGDKINLHSIVPVGQDPHKYEPLPEDVKKT
SKADLIFYNGINLETGGNAWFTKLVENAQKKENKDYYAVSEGVDVIYLEGQNEKGKEDPHAWLNLENGIIYAQNIAKRLI
EKDPDNKATYEKNLKAYIEKLTALDKEAKEKFNNIPEEKKMIVTSEGCPKYFSKAYNVPSAYIWEINTEEEGTPDQIKSL
VEKLRKTKVPSLFVESSVDDRPMKTVSKDTNIPIYAKIFTDSIAEKGEDGDSYYSMMKYNLDKISEGLAK
>P0A4G4 ~~~mtsA~~~Iron ABC transporter substrate-binding lipoprotein MtsA~~~
MGKRMSLILGAFLSVFLLVACSSTGTKTAKSDKLKVVATNSIIADMTKAIAGDKIDLHSIVPIGQDPHEYEPLPEDVEKT
SNADVIFYNGINLEDGGQAWFTKLVKNAQKTKNKDYFAVSDGIDVIYLEGASEKGKEDPHAWLNLENGIIYSKNIAKQLI
AKDPKNKETYEKNLKAYVAKLEKLDKEAKSKFDAIAENKKLIVTSEGCFKYFSKAYGVPSAYIWEINTEEEGTPDQISSL
IEKLKVIKPSALFVESSVDRRPMETVSKDSGIPIYSEIFTDSIAKKGKPGDSYYAMMKWNLDKISEGLAK
>P31305 ~~~fimA~~~Metal ABC transporter substrate-binding lipoprotein FimA~~~
MKKIASVLALFVALLFGLLACSKGSSSGASGKLKVVTTNSILADITKNIAGDKIELHSIVPVGKDPHEYEPLPEDVKKTS
QADLIFYNGINLETGGNAWFTKLVKNANKVENKDYFAVSEGVDVIYLEGQNQAGKEDPHAWLNLENGILYAKNIAKQLIA
KDPKNKDFYEKNLAAYTEKLSKLDQKAKQAFKNIPEDKKMIVTSEGCFKYFSKAYGVPSAYIWEINTEEEGTPEQIKTLV
EKLRQTKVPALFVESSVDERPMKTVAKDTNIPIYAKIFTDSIAKEGEKGDSYYSMMKWNLDKIAEGLSQ
>P0A4G2 ~~~psaA~~~Manganese ABC transporter substrate-binding lipoprotein PsaA~~~COG0803
MKKLGTLLVLFLSAIILVACASGKKDTTSGQKLKVVATNSIIADITKNIAGDKIDLHSIVPIGQDPHEYEPLPEDVKKTS
EADLIFYNGINLETGGNAWFTKLVENAKKTENKDYFAVSDGVDVIYLEGQNEKGKEDPHAWLNLENGIIFAKNIAKQLSA
KDPNNKEFYEKNLKEYTDKLDKLDKESKDKFNKIPAEKKLIVTSEGAFKYFSKAYGVPSAYIWEINTEEEGTPEQIKTLV
EKLRQTKVPSLFVESSVDDRPMKTVSQDTNIPIYAQIFTDSIAEQGKEGDSYYSMMKYNLDKIAEGLAK
>P31304 ~~~ssaB~~~Metal ABC transporter substrate-binding lipoprotein SsaB~~~
MKKLGFLSLLLLAVCTLFACSNQKNASSDSSKLKVVATNSIIADITKNIAGDKIDLHSIVPVGKDPHEYEPLPEDVKKTS
QADLIFYNGINLETGGNAWFTKLVKNANKEENKDYYAVSDGVDVIYLEGQSEKGKEDPHAWLNLENGIIYAQNIAKRLIE
KDPDNKATYEKNLKAYVEKLTALDKEAKEKFNNIPEEKKMIVTSEGCFKYFSKAYNVPSAYIWEINTEEEGTPDQIKSLV
EKLRKTKVPSLFVESSVDDRPMKTVSKDTNIPIHAKIFTDSIADQGEEGDTYYSMMKYNLDKISEGLAK
>P42360 7.2.2.5~~~scaC~~~Manganese import ATP-binding protein ScaC~~~
MLRYINTVGVIYMIEIQNLSVSYQGQLALDKANVTIKGPTITGIIGPNGAGKSTLIKGLLGIVDHQGQALLDGQPLDKEL
KRIAYVEQKINIDYNFPIKVKECVSLGLYPKIKLFQRLKTSDWDKVNQALKIVGLEDFAERQISQLSGGQFQRVLIARCL
VQEADYIFLDEPFVGIDSVSEEIIMKTLRQLRKDGKTILIVHHDLSKVVAYFDQVLLLNKKVVAFGSTESTFTKENMQQT
YGSQLFMNGGA
>P15840 2.1.1.37~~~sssIM~~~Orphan methyltransferase M.SssI~~~
MSKVENKTKKLRVFEAFAGIGAQRKALEKVRKDEYEIVGLAEWYVPAIVMYQAIHNNFHTKLEYKSVSREEMIDYLENKT
LSWNSKNPVSNGYWKRKKDDELKIIYNAIKLSEKEGNIFDIRDLYKRTLKNIDLLTYSFPCQDLSQQGIQKGMKRGSGTR
SGLLWEIERALDSTEKNDLPKYLLMENVGALLHKKNEEELNQWKQKLESLGYQNSIEVLNAADFGSSQARRRVFMISTLN
EFVELPKGDKKPKSIKKVLNKIVSEKDILNNLLKYNLTEFKKTKSNINKASLIGYSKFNSEGYVYDPEFTGPTLTASGAN
SRIKIKDGSNIRKMNSDETFLYIGFDSQDGKRVNEIEFLTENQKIFVCGNSISVEVLEAIIDKIGG
>P14230 2.1.1.113~~~smaIM~~~Type II methyltransferase M.SmaI~~~
MKKHSNNLDLFDQTEECLESNLRSCKIIVGDAREAVQGLDSEIFDCVVTSPPYWGLRDYGNGGQIGAEDNINDYIKDLVD
LFRDVRRTLKDDGTLWLNIGDSYTSGGRTWRDKDDKNKGRAMSYRPPTPEGLKPKDLIGVPWRLAFALQNDGWYLRTDII
WNKPNCQPESVRDRPTRSHEYIFLLSKGKKYYYDWESIKEPASDPKMDKKNRRTVWNINTEPYPGSHFAVFPRAMARLCV
LAGSRPGGKVLDPFFGSGTTGVVCQELDRECVGIELNEEYASLAKERILRRR
>P14385 2.1.1.72~~~taqIM~~~Type II methyltransferase M.TaqI~~~
MGLPPLLSLPSNSAPRSLGRVETPPEVVDFMVSLAEAPRGGRVLEPACAHGPFLRAFREAHGTAYRFVGVEIDPKALDLP
PWAEGILADFLLWEPGEAFDLILGNPPYGIVGEASKYPIHVFKAVKDLYKKAFSTWKGKYNLYGAFLEKAVRLLKPGGVL
VFVVPATWLVLEDFALLREFLAREGKTSVYYLGEVFPQKKVSAVVIRFQKSGKGLSLWDTQESESGFTPILWAEYPHWEG
EIIRFETEETRKLEISGMPLGDLFHIRFAARSPEFKKHPAVRKEPGPGLVPVLTGRNLKPGWVDYEKNHSGLWMPKERAK
ELRDFYATPHLVVAHTKGTRVVAAWDERAYPWREEFHLLPKEGVRLDPSSLVQWLNSEAMQKHVRTLYRDFVPHLTLRML
ERLPVRREYGFHTSPESARNF
>Q03055 2.1.1.72~~~vspIM~~~Type II methyltransferase M.VspI~~~
MKQSALFEADDVEAISIDDVAFSLNVSSASVRNWIKTGYLHKATKNSVTAESFVAFKDEILGTEKLNQRANKSLKDQHDH
SGLEEMIHNIIRSNEVHPEGLSDIYEESLSESYKNKEGVFYTPKEIAADFFDYLPKDCSELTFCDPCCGTGNFLIEAVKR
GFKPCNIYGYDIDEVALEISRSRLKELCGVAESNIEKRDFLSASYQIEQKYDVIFTNPPWGKKLPKKDKDSLADSLATGN
SKDTSAIFFFASMKILNSSGYLGFLLQDAFFNIASYESVRKAALANQIVALIDFGKPFKGLLTKAKGIILRKQCPDDQHA
TICVSGNTKNEVSQRVFEKNPKSIFNFTCSELDLEVVEHILSIPHKTLRGSARWGLGIVTGNNKKFCLPEARGGYIPVYK
GSDITRKG
>Q7UG04 ~~~metXA~~~Bifunctional methionine biosynthesis protein MetXA/MetW~~~COG2021
MTSGRSTRVTMFESQASMGEPSNEDLSSTDDVRTDAPLAYAKYVTFDQSLPLERGGELPEIRCCYETWGTLNDDGSNAVL
VCHAVSGDSHAARHDEDDQPGWWDGLIGPGLPIDTDRLFVVCPNVLGGCRGSTGPGDADPTSPDGKPYGANFPRITIGDI
VEAQKLLADHLGIRQWRAVVGGSLGGHQVLQWINRYPDAAKTCVAIATSPRLNSQALGFDVIARNAIQTDPHYAGGQYYD
KDQRPDTGLAIARMLGHITYLSVEAMEAKFDPDRHDPRQIASQFEQRFSIGSYLAHQGQKFTTRFDANSYVTLSMAMDLF
DLGGTRLKLMETFDEATCDFLLISFSSDWLFPPAQSREIVNALTALDKRVTYAEITTNAGHDAFLIAKDIATYGPLIRER
LRDTETHPAVPSDITLNVDEESILEIIPAGSSVLDLGCGNGQLLAAIRDRHRTPGPPTTEHRLMGVEVAQENLLATAMRG
IDVIDYDLNHGLPAFIDDQFDYVILNATLQAVENVVELLNEMLRVGRHAIISFPNFAYRQLRDHYVTHGRSPKAPGEFDF
DWHNTPNRRFPTIADVRDLLGQLNVVIDEEVFWDVDQGQRIEPDNDPNLNADTAVIAFHRENR
>J2EKT7 ~~~MT~~~Metallothionein~~~
MNELRCGCPDCHCKVDPERVFNHDGEAYCSQACAEQHPNGEPCPAPDCHCERSGKVGGRDITNNQLDEALEETFPASDPI
SP
>P30331 ~~~smtA~~~Metallothionein~~~
MTSTTLVKCACEPCLCNVDPSKAIDRNGLYYCSEACADGHTGGSKGCGHTGCNCHG
>P08002 ~~~~~~Metallothionein~~~
TSTTLVKCACEPCLCNVDPSKAIDRNGLYYCCEACADGHTGGSKGCGHTGCNC
>P38107 ~~~mucA~~~Sigma factor AlgU negative regulatory protein~~~
MSREALQETLSAVMDNEADELELRRVLAACGEDAELRSTWSRYQLARSVMHREPTLPKLDIAAAVSAALADEAAPPKAEK
GPWRMVGRLAVAASVTLAVLAGVRLYNQNDALPQMAQQGTTPQIALPQVKGPAVLAGYSEEQGAPQVITNSSSSDTRWHE
QRLPIYLRQHVQQSAVSGTESALPYARAASLENR
>P38108 ~~~mucB~~~Sigma factor AlgU regulatory protein MucB~~~
MRTTSLLLLLGSLMAVPATQAADASDWLNRLAEADRQNSFQGTFVYERNGSFSTHEIWHRVESDGAVRERLLQLDGARQE
VVRVDGRTQCISGGLADQLADAQLWPVRKFDPSQLASWYDLRLVGESRVAGRPAVVLAVTPRDQHRYGFELHLDRDTGLP
LKSLLLNEKGQLLERFQFTQLNTGAAPAEDQLQAGAECQVVGPAKADGEKTVAWRSEWLPPGFTLTRSFMRRSPVTPDPV
ACLTYGDGLARFSVFIEPLHGAMVGDARSQLGPTVVVSKRLQTDDGGQMVTVVGEVPLGTAERVALSIRPEAAAQK
>P0A9H1 3.2.2.28~~~mug~~~G/U mismatch-specific DNA glycosylase~~~COG3663
MVEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLDYRCGVTKLVDRPTVQANEV
SKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRGAQWGKQTLTIGSTQIWVLPNPSGLSRVSLEKLVEAYRELD
QALVVRGR
>P22523 ~~~mukB~~~Chromosome partition protein MukB~~~COG3096
MIERGKFRSLTLINWNGFFARTFDLDELVTTLSGGNGAGKSTTMAAFVTALIPDLTLLHFRNTTEAGATSGSRDKGLHGK
LKAGVCYSMLDTINSRHQRVVVGVRLQQVAGRDRKVDIKPFAIQGLPMSVQPTQLVTETLNERQARVLPLNELKDKLEAM
EGVQFKQFNSITDYHSLMFDLGIIARRLRSASDRSKFYRLIEASLYGGISSAITRSLRDYLLPENSGVRKAFQDMEAALR
ENRMTLEAIRVTQSDRDLFKHLISEATNYVAADYMRHANERRVHLDKALEFRRELHTSRQQLAAEQYKHVDMARELAEHN
GAEGDLEADYQAASDHLNLVQTALRQQEKIERYEADLDELQIRLEEQNEVVAEAIERQQENEARAEAAELEVDELKSQLA
DYQQALDVQQTRAIQYNQAIAALNRAKELCHLPDLTADCAAEWLETFQAKELEATEKMLSLEQKMSMAQTAHSQFEQAYQ
LVVAINGPLARNEAWDVARELLREGVDQRHLAEQVQPLRMRLSELEQRLREQQEAERLLADFCKRQGKNFDIDELEALHQ
ELEARIASLSDSVSNAREERMALRQEQEQLQSRIQSLMQRAPVWLAAQNSLNQLSEQCGEEFTSSQDVTEYLQQLLERER
EAIVERDEVGARKNAVDEEIERLSQPGGSEDQRLNALAERFGGVLLSEIYDDVSLEDAPYFSALYGPSRHAIVVPDLSQV
TEHLEGLTDCPEDLYLIEGDPQSFDDSVFSVDELEKAVVVKIADRQWRYSRFPEVPLFGRAARESRIESLHAEREVLSER
FATLSFDVQKTQRLHQAFSRFIGSHLAVAFESDPEAEIRQLNSRRVELERALSNHENDNQQQRIQFEQAKEGVTALNRIL
PRLNLLADDSLADRVDEIRERLDEAQEAARFVQQFGNQLAKLEPIVSVLQSDPEQFEQLKEDYAYSQQMQRDARQQAFAL
TEVVQRRAHFSYSDSAEMLSGNSDLNEKLRERLEQAEAERTRAREALRGHAAQLSQYNQVLASLKSSYDTKKELLNDLQR
ELQDIGVRADSGAEERARIRRDELHAQLSNNRSRRNQLEKALTFCEAEMDNLTRKLRKLERDYFEMREQVVTAKAGWCAV
MRMVKDNGVERRLHRRELAYLSADDLRSMSDKALGALRLAVADNEHLRDVLRMSEDPKRPERKIQFFVAVYQHLRERIRQ
DIIRTDDPVEAIEQMEIELSRLTEELTSREQKLAISSRSVANIIRKTIQREQNRIRMLNQGLQNVSFGQVNSVRLNVNVR
ETHAMLLDVLSEQHEQHQDLFNSNRLTFSEALAKLYQRLNPQIDMGQRTPQTIGEELLDYRNYLEMEVEVNRGSDGWLRA
ESGALSTGEAIGTGMSILVMVVQSWEDESRRLRGKDISPCRLLFLDEAARLDARSIATLFELCERLQMQLIIAAPENISP
EKGTTYKLVRKVFQNTEHVHVVGLRGFAPQLPETLPGTDEAPSQAS
>Q7VL96 ~~~mukB~~~Chromosome partition protein MukB~~~COG3096
MMNTNELFDQTAVNSSQDKPLNPPFAVAQPANIARGKFRSLTLINWNGFFARTFDFDELVTTLSGGNGAGKSTTMAGFVT
ALIPDLTLLNFRNTTEAGSTSSSRDKGLYGKLKAGVCYAVLETVNSRAQRIITGVRLQQIAGRDKKVDIRPFSLQNVPMA
DSVISLFTEQVANKARVLSLNDLKEKFEETAVTFKPYHSITDYHSFMFDLGILPKRLRSSSDRNKFYKLIEASLYGGISS
VITKSLRDYLLPENSGVRQAFQDMEAALRENRMTLEAIKVTQSDRDMFKHLITEATQYVSADYMRNANERRGNVVSALAQ
RRTWYDTKAKLLVEEQRLIEFSREVADINLTEQSLESEYNVANDHLNLVMNALRHQEKIIRYQDEVDALNEKLEQQQIAL
EEVSEQVEDAQAHTDEIDDRVDGLRSQIADYQQALDAQQTRALQYQQAITALQKAQQLCALPHLDLDNLKDYQTEFEAQA
QDITDHVFELEQRLSISDMTKTQFEKAYQLVCKVSGEIDRSQAWSEATQLLATFPDQKMQAAQAVALRQKLADLEQRLHQ
QQKVQRLIAEFNQQAAKKLTDFTALENHFEQQQIKLEDLEAELANVIELRSVHRQQQEQLTQQYNQLAKTAPAWHTARSV
LTRLEEQCAEQFENSQAIMHCMQEMLRKEREATLERDELARTEAALASQISQLSQADGAEDIRLNQLAERLGGVLLSELY
DDVSLQDAPYFSALYGEARHAIVVRDLTSVKAQLEKLTDCPNDLYLIEGDPTAFDDTVFSAEELYDSVIVKVSNRQWRYS
KFPEVPLFGRAAREKHLITLKTERDDIAEQHAESAFNVQKYQRLHQHLSQFVGTHLNIAFQPDPEMLMQEIALERQEIEG
QLNQAVENEHYLRQQADHLKAELQMLNKILPLANTLADETVMERFEECREQLQSAEENELFVRQFGQYLTQLAPIATSLQ
TDPSKFEQLEHDYQQAKSTQRILQQKVFALSDVMQRRLHFNYSEENQCEGSALTEQLRTDLALAQQEREQARQRLRQAQA
QFTQYNQVLISLRSAYDAKYQMLQELMQEIDDLGVRGDSAAEECARLRRDELQQQLSQQRARKGYLDKQLGIIEAEIDNL
NRLLRKTERDYQTQRELVVQAKASWCLVQKLSRNSDVEKRLNRRELAYQSAEELRSISDKALGALRTAVADNEYLRDSLR
ASEDSRKPENKVAFFIAVYQHLRERIRQDIIRTDDPIDAIEQMEIELSRLINELTSREKKLAISAESVANILRKTIQREQ
NRILQLNQGLQNIAFGQVKGVRLVVNIRDTHSILLNALSDQHEQHKDLFESQKLSFSEALAMLYKRVNPHIELGQRMPQT
IGEELLDYRNYLDLEVETLRGADGWMRAESSALSTGEAIGTGMSILLMVVQSWEEESRRMRAKDILPCRLLFLDEAARLD
AMSINTLFELCERLDMQLLIAAPENISPEHGTTYKLVRKILANQEYVHVVGLKGFGQQMNKST
>P22524 ~~~mukE~~~Chromosome partition protein MukE~~~COG3095
MSSTNIEQVMPVKLAQALANPLFPALDSALRSGRHIGLDELDNHAFLMDFQEYLEEFYARYNVELIRAPEGFFYLRPRST
TLIPRSVLSELDMMVGKILCYLYLSPERLANEGIFTQQELYDELLTLADEAKLLKLVNNRSTGSDVDRQKLQEKVRSSLN
RLRRLGMVWFMGHDSSKFRITESVFRFGADVRAGDDPREAQRRLIRDGEAMPIENHLQLNDETEENQPDSGEEE
>Q7VL95 ~~~mukE~~~Chromosome partition protein MukE~~~COG3095
MTEYIQDAIPAKLAIAIANPIFPQLDSQLRAGRHISIEMLDEHAFLMDFQTELESFYRRYHVDLIRAPEGFFYLRPKAST
LIARSAMSEMEMLVGKVLCYLYLSPERLAQQGIFSQDDVYEELLNLADENKLLKAVNPRSTGSDLDRAKLAEKVGGALRR
LARIGIITRVGEQNSKKFIISEAVFRFGADVRAGDDPREVQLRLIRDGEATTPTLLTTEAIEFAEDGARDELEESEAE
>Q7UD30 ~~~mukE~~~Chromosome partition protein MukE~~~
MPVKLAQALANPLFPALDSALRSGRHIGLDELDNHAFLMDFQEYLEEFYARYNVELIRAPEGFFYLRPRSTTLIPRSVLS
ELDMMVGKILCYLYLSPERLANEGIFTQQELYDELLTLADEAKLLKLVNNRSTGSDVDRQKLQEKVRSSLNRLRRLGMVW
FMGHDSSKFRITESVFRFGADVRAGDDPREAQRRLIRDGEAMPIENHLQLNDETEESQPDSGEEE
>P60293 ~~~mukF~~~Chromosome partition protein MukF~~~COG3006
MSEFSQTVPELVAWARKNDFSISLPVDRLSFLLAVATLNGERLDGEMSEGELVDAFRHVSDAFEQTSETIGVRANNAIND
MVRQRLLNRFTSEQAEGNAIYRLTPLGIGITDYYIRQREFSTLRLSMQLSIVAGELKRAADAAEEGGDEFHWHRNVYAPL
KYSVAEIFDSIDLTQRLMDEQQQQVKDDIAQLLNKDWRAAISSCELLLSETSGTLRELQDTLEAAGDKLQANLLRIQDAT
MTHDDLHFVDRLVFDLQSKLDRIISWGQQSIDLWIGYDRHVHKFIRTAIDMDKNRVFAQRLRQSVQTYFDEPWALTYANA
DRLLDMRDEEMALRDEEVTGELPEDLEYEEFNEIREQLAAIIEEQLAVYKTRQVPLDLGLVVREYLSQYPRARHFDVARI
VIDQAVRLGVAQADFTGLPAKWQPINDYGAKVQAHVIDKY
>Q7VL94 ~~~mukF~~~Chromosome partition protein MukF~~~COG3006
MQNELAQTIPELISWTKEREFSLSLPSDRLAFLLVISIYNNEQTDGELLESDLIDLFRYVSNVFEQSEASLLQRANNAIN
DLVKQRFLNRFSSEFTEGLAIYRVTPLGVGVSDYYVRQREFSSLRLSIQLSIVADEIQRASVAAEQGGDERYWRNNVFAP
LKFSVAEIFDSIDLSQRMMDENQHQIREQIAGLLSQNWHEAIINCQQLLDETSINLRELQDTLNAAGDKLQSQLLRIQSC
LISRDDLAFVDQLIVNLQNKLDRIMSWGQQAIDLWIGYDKHVHKFIRTAIDMDKNRVFGQRLRQSIQNYFSSPWLLYTAK
AEALLDLRDDEAMLNEMEAVGELPMALEYESLTDVQTQIVTAIQAELAHFRDTAQPINLGAVLREQLARYPQSRHFDVAR
IIVDQAVKLGMASQDHQAVYPVWQPIDDFSAAVQAHLIDQYDK
>A0A0H2XHV5 3.2.1.-~~~mupG~~~6-phospho-N-acetylmuramidase~~~
MTGFSVYLGQPLDEAYIKRMIKQGYQMIFTSVQIPEEDDETKYHYFTKLLNLLKHEQVTYLIDANPSILTPSFYDHLRQY
DAQFMIRIDHSTSIEAIEAIMAQGLKCCLNASIISRELLTSLHQQLNDFTLLSFCHNYYPRPDTGLSVDLVNKKNELIYQ
FNPKAQIYGFIVGSDLRGPLHKGLPTIEATRHSHPVVAAKLLQETGVSEVLVGDSLIEMRQAKQLIDFCKHRHFTLCIEE
VFDTTVTYLFDMCHKVRPDNPENVIRSETSRQICPHSIQPQFTTQRRIGSVTVDNLNNGRYQGEMQIVRQTLSAHDNVNV
VAQIIKEDLPLLSCIEPNDTFDFQKTRECKK
>Q9HZ62 3.1.3.105~~~mupP~~~N-acetylmuramic acid 6-phosphate phosphatase~~~
MKRMRLKAVLFDMDGTLLDTAPDFIAITQAMRAAHGLPPVDEQRVRDVVSGGARAMVAAAFGLSLDSPEVEPLRQEFLDR
YQEHCAVLSRPYDGIPELLAAIEKAGLIWGVVTNKPVRFAEPIMQRLGYAERSRVLVCPDHVTRSKPDPEPLLLACSQLG
IDPSRVLFIGDDLRDIESGRDAGTKTAAVRYGYIHPEDNPAHWGADVIVDHPRELIDVLDRALCDC
>Q88M11 3.1.3.105~~~mupP~~~N-acetylmuramic acid 6-phosphate phosphatase~~~COG0546
MRLRAVLFDMDGTLLDTAPDFIAICQAMLAERGLPAVDDNLIRGVISGGARAMVATAFAMDPEADGFEALRLEFLERYQR
DCAVHSKLFEGMAELLADIEKGNLLWGVVTNKPVRFAEPIMQQLGLAERSALLICPDHVKNSKPDPEPLILACKTLNLDP
ASVLFVGDDLRDIESGRDAGTRTAAVRYGYIHPEDNPNNWGADVVVDHPLELRKVIDSALCGC
>P39046 3.2.1.17~~~~~~Muramidase-2~~~COG1388
MENIARKERRRLNETKRFRKVKRSAALVGTAMVGCSVAAPLIQPVQVDADQTPTQFGARINTAAFIAEIATYAQPIAQAN
DLYASVMIAQAVVESGWGSSALSQAPYYNLFGIKGSYQGQTVYMDTLEYLNNKWVSKKEPFRQYPSFAESFNDNAYVLRN
TSFGNGYYYAGTWKSNTKSYTDATACLTGRYATDPGYAGKLNNIITTYGLTKYDTPASGNAGGGVTIGNGGNTGNTSNSG
STSGNSGGSATTTGTTYTVKSGDSVWGISHSFGITMAQLIEWNNIKNNFIYPGQKLTIKGGQSAGSSTTNTGNNASSGNT
SGNTNTSGSTGQATGAKYTVKSGDSVWKIANDHGISMNQLIEWNNIKNNFVYPGQQLVVSKGSSSASGSTSNTSTGNTSS
NTANTGSTTSGSTYTVKAGESVWSVSNKFGISMNQLIQWNNIKNNFIYPGQKLIVKGGSSSSNASTSTANNKNTASSNTS
STATGQATYTVKAGESVWGVANKNGISMNQLIEWNNIKNNFIYPGQKLIVKGGSSKASATATIKPTASTPASTTPTASST
GDTKYTVKAGESVWGVANKHHITMDQLIEWNNIKNNFIYPGQEVIVKKGTAQSTPAKSDEKTYTVKAGESVWGVADSHGI
TMNQLIEWNNIKNNFIYPGQQLIVKK
>Q81K13 2.5.1.7~~~murA1~~~UDP-N-acetylglucosamine 1-carboxyvinyltransferase 1~~~COG0766
MEKIIVRGGKRLNGTVRVEGAKNAVLPIIAAALLASDGKNVLSEVPVLSDVYTINEVLRHLNAEVVFENNQVTIDASKEL
NIEAPFEYVRKMRASVQVMGPLLARNGRARIALPGGCAIGSRPIDQHLKGFEAMGAKVQVGNGFVEAYVEGELKGAKIYL
DFPSVGATENIMSAATLAKGTTILENAAKEPEIVDLANFLNAMGAKVRGAGTGTIRIEGVDKLYGANHSIIPDRIEAGTF
MVAAAITGGDILIENAVPEHLRSITAKMEEMGVKIIEENEGVRVIGPDKLKAVDIKTMPHPGFPTDMQSQMMALLLQADG
TSMITETVFENRFMHVEEFRRMNADIKIEGRSVIMNGPNSLQGAEVGATDLRAAAALILAGLVSEGYTRVTELKHLDRGY
VDFHKKLAALGATIERVNEKVEEVKEQEVSDLHA
>P70965 2.5.1.7~~~murAA~~~UDP-N-acetylglucosamine 1-carboxyvinyltransferase 1~~~COG0766
MEKIIVRGGQKLNGTVKVEGAKNAVLPVIAASLLASEEKSVICDVPTLSDVYTINEVLRHLGADVHFENNEVTVNASYAL
QTEAPFEYVRKMRASVLVMGPLLARTGHARVALPGGCAIGSRPIDQHLKGFEAMGAEIKVGNGFIEAEVKGRLQGAKIYL
DFPSVGATENLIMAAALAEGTTTLENVAKEPEIVDLANYINGMGGKIRGAGTGTIKIEGVEKLHGVKHHIIPDRIEAGTF
MVAAAITEGNVLVKGAVPEHLTSLIAKMEEMGVTIKDEGEGLRVIGPKELKPIDIKTMPHPGFPTDMQSQMMALLLRASG
TSMITETVFENRFMHAEEFRRMNGDIKIEGRSVIINGPVQLQGAEVAATDLRAGAALILAGLVAEGHTRVTELKHLDRGY
VDFHQKLAALGADIERVNDESASEQENKEVVSDLNA
>Q8Y4C4 2.5.1.7~~~murA1~~~UDP-N-acetylglucosamine 1-carboxyvinyltransferase 1~~~COG0766
MEKIIVRGGKQLNGSVKMEGAKNAVLPVIAATLLASKGTSVLKNVPNLSDVFTINEVLKYLNADVSFVNDEVTVDATGEI
TSDAPFEYVRKMRASIVVMGPLLARTGSARVALPGGCAIGSRPVDLHLKGFEAMGAVVKIENGYIEATAEKLVGAKVYLD
FPSVGATQNIMMAATLAEGTTVIENVAREPEIVDLANFLNQMGARVIGAGTEVIRIEGVKELTATEHSIIPDRIEAGTFM
IAAAITGGNVLIEDAVPEHISSLIAKLEEMGVQIIEEENGIRVIGPDKLKAVDVKTMPHPGFPTDMQSQMMVIQMLSEGT
SIMTETVFENRFMHVEEMRRMNADMKIEGHSVIISGPAKLQGAEVAATDLRAAAALILAGLVADGYTQVTELKYLDRGYN
NFHGKLQALGADVERVDDSKVDVTNLASLF
>P84058 2.5.1.7~~~murA1~~~UDP-N-acetylglucosamine 1-carboxyvinyltransferase 1~~~
MDKIVIKGGNKLTGEVKVEGAKNAVLPILTASLLASDKPSKLVNVPALSDVETINNVLTTLNADVTYKKDENAVVVDATK
TLNEEAPYEYVSKMRASILVMGPLLARLGHAIVALPGGCAIGSRPIEQHIKGFEALGAEIHLENGNIYANAKDGLKGTSI
HLDFPSVGATQNIIMAASLAKGKTLIENAAKEPEIVDLANYINEMGGRITGAGTDTITINGVESLHGVEHAIIPDRIEAG
TLLIAGAITRGDIFVRGAIKEHMASLVYKLEEMGVELDYQEDGIRVRAEGELQPVDIKTLPHPGFPTDMQSQMMALLLTA
NGHKVVTETVFENRFMHVAEFKRMNANINVEGRSAKLEGKSQLQGAQVKATDLRAAAALILAGLVADGKTSVTELTHLDR
GYVDLHGKLKQLGADIERIND
>A0A0H2ZNL3 2.5.1.7~~~murA1~~~UDP-N-acetylglucosamine 1-carboxyvinyltransferase~~~COG0766
MRKIVINGGLPLQGEITISGAKNSVVALIPAIILADDVVTLDCVPDISDVASLVEIMELMGATVKRYDDVLEIDPRGVQN
IPMPYGKINSLRASYYFYGSLLGRFGEATVGLPGGCDLGPRPIDLHLKAFEAMGATASYEGDNMKLSAKDTGLHGASIYM
DTVSVGATINTMIAAVKANGRTIIENAAREPEIIDVATLLNNMGAHIRGAGTNIIIIDGVERLHGTRHQVIPDRIEAGTY
ISLAAAVGKGIRINNVLYEHLEGFIAKLEEMGVRMTVSEDSIFVEEQSNLKAINIKTAPYPGFATDLQQPLTPLLLRANG
RGTIVDTIYEKRVNHVFELAKMDADISTTNGHILYTGGRDLRGASVKATDLRAGAALVIAGLMAEGKTEITNIEFILRGY
SDIIEKLRNLGADIRLVED
>B1IBM3 2.5.1.7~~~murA1~~~UDP-N-acetylglucosamine 1-carboxyvinyltransferase~~~
MRKIVINGGLPLQGEITISGAKNSVVALIPAIILADDVVTLDCVPDISDVASLVEIMELMGATVKRYDDVLEIDPRGVQN
IPMPYGKINSLRASYYFYGSLLGRFGEATVGLPGGCDLGPRPIDLHLKAFEAMGATASYEGDNMKLSAKDTGLHGASIYM
DTVSVGATINTMIAAVKANGRTIIENAAREPEIIDVATLLNNMGAHIRGAGTNIIIIDGVERLHGTRHQVIPDRIEAGTY
ISLAAAVGKGIRINNVLYEHLEGFIAKLEEMGVRMTVSEDSIFVEEQSNLKAINIKTAPYPGFATDLQQPLTPLLLRANG
RGTIVDTIYEKRVNHVFELAKMDADISTTNGHILYTGGRDLRGTSVKATDLRAGAALVIAGLMAEGKTEITNIEFILRGY
SDIIEKLRNLGADIRLVED
>Q97NQ4 2.5.1.7~~~murA1~~~UDP-N-acetylglucosamine 1-carboxyvinyltransferase 1~~~COG0766
MDKIVVQGGDNRLVGSVTIEGAKNAVLPLLAATILASEGKTVLQNVPILSDVFIMNQVVGGLNAKVDFDEEAHLVKVDAT
GDITEEAPYKYVSKMRASIVVLGPILARVGHAKVSMPGGCTIGSRPIDLHLKGLEAMGVKISQTAGYIEAKAERLHGAHI
YMDFPSVGATQNLMMAATLADGVTVIENAAREPEIVDLAILLNEMGAKVKGAGTETITITGVEKLHGTTHNVVQDRIEAG
TFMVAAAMTGGDVLIRDAVWEHNRPLIAKLLEMGVEVIEEDEGIRVRSQLENLKAVHVKTLPHPGFPTDMQAQFTALMTV
AKGESTMVETVFENRFQHLEEMRRMGLHSEIIRDTARIVGGQPLQGAEVLSTDLRASAALILTGLVAQGETVVGKLVHLD
RGYYGFHEKLAQLGAKIQRIEASDEDE
>P19670 2.5.1.7~~~murAB~~~UDP-N-acetylglucosamine 1-carboxyvinyltransferase 2~~~COG0766
MEKLNIAGGDSLNGTVHISGAKNSAVALIPATILANSEVTIEGLPEISDIETLRDLLKEIGGNVHFENGEMVVDPTSMIS
MPLPNGKVKKLRASYYLMGAMLGRFKQAVIGLPGGCHLGPRPIDQHIKGFEALGAEVTNEQGAIYLRAERLRGARIYLDV
VSVGATINIMLAAVLAEGKTIIENAAKEPEIIDVATLLTSMGAKIKGAGTNVIRIDGVKELHGCKHTIIPDRIEAGTFMI
AGAAMGKEVIIDNVIPTHLESLTAKLREMGYHIETSDDQLLIVGGQKNLKPVDVKTLVYPGFPTDLQQPMTALLTRAKGT
SVVTDTIYSARFKHIDELRRMGANMKVEGRSAIITGPVELQGAKVKASDLRAGACLVVAGLMADGVTEITGLEHIDRGYS
SLEKKLEGLGATIWRERMTDEEIEQLQNS
>P65457 2.5.1.7~~~murA2~~~UDP-N-acetylglucosamine 1-carboxyvinyltransferase 2~~~
MAQEVIKIRGGRTLNGEVNISGAKNSAVAIIPATLLAQGHVKLEGLPQISDVKTLVSLLEDLNIKASLNGTELEVDTTEI
QNAALPNNKVESLRASYYMMGAMLGRFKKCVIGLPGGCPLGPRPIDQHIKGFKALGAEIDESSTTSMKIEAKELKGAHIF
LDMVSVGATINIMLAAVYATGQTVIENAAKEPEVVDVANFLTSMGANIKGAGTSTIKINGVKELHGSEYQVIPDRIEAGT
YMCIAAACGENVILNNIVPKHVETLTAKFSELGVNVDVRDERIRINNNAPYQFVDIKTLVYPGFATDLQQPITPLLFMAN
GPSFVTDTIYPERFKHVEELKRMGANIEVDEGTATIKPSTLHGAEVYASDLRAGACLIIAGLIAEGVTTIYNVKHIYRGY
TDIVEHLKALGADIWTETV
>B5F9P4 2.5.1.7~~~murA~~~UDP-N-acetylglucosamine 1-carboxyvinyltransferase~~~
MYKFRIQGSDKPLSGEVTISGAKNAALPILFASLLAEEPVEVANVPKLRDVDTTMELLKRLGAEVSRNGSVHIDASGVND
FCAPYDLVKTMRASIWALGPLVARFGKGQVSLPGGCAIGARPVDLHIHGLEQLGATIKLEEGYVKAEVDGRLKGAHIVMD
KVSVGATITVMCAATLAEGTTVLENAAREPEIVDTANFLNAIGAKVSGMGTDTITIEGVERLGGGYHEVVADRIETGTFL
VAAAVSGGKIVCKNTKAHLLEAVLAKLEEAGADVQTGDDWISLDMTGRELKAVNIRTAPHPAFPTDMQAQFTLLNMMAKG
SGIITETIFENRFMHIPELQRMGAHAEIEGNTAICGDTDGLSGAQVMATDLRASASLVIAGCIAKGETIVDRIYHIDRGY
DKIEDKLTALGANIERVHSDDL
>O67315 2.5.1.7~~~murA~~~UDP-N-acetylglucosamine 1-carboxyvinyltransferase~~~COG0766
MKNTTLYTYRDYFVIRGGKPLTGKVKISGAKNAALPIMFATILTEEPCTITNVPDLLDVRNTLLLLRELGAELEFLNNTV
FINPSINSFITNQEIIRRMRASVLSLGPLLGRFGRAVVGLPGGCSIGARPIDQHLKFFKEAGADVEVREGYVYVNLKEKR
RVHFKFDLVTVTGTENALLYLASVPEESILENIALEPEVMDLIEVLKKMGAHVKVEGRSAYVKGSENLKGFTHSVIPDRI
EAGTFMVGAVLTDGEILLENARINHLRAVVEKLKLIGGEVVEENGNLRVFRKESLRACDIETQVYPGFPTDMQAQFMALL
SVAKGKSRIKENIFEHRFHHAQELNRLGANITVRGNTAYVEGVERLYGSEVYSTDLRASASLVLAGLVAQGETVVRDVYH
LDRGYEKLEEKLKKLGADIERVSEL
>Q9PP65 2.5.1.7~~~murA~~~UDP-N-acetylglucosamine 1-carboxyvinyltransferase~~~COG0766
MTYLEIEGTNHLSGNVTISGAKNAALPLIVSSILAKNEVKINNVPNVADIKTLISLLENLGAKVNFQNNSALLNTNTLNQ
TIAKYDIVRKMRASILTLGPLLARFGHCEVSLPGGCAIGQRPIDLHLLALEKMGANIQIKQGYVVASGNLKGNEILFDKI
TVTGSENIIMAAALAKGKTKLLNVAKEPEVVQLCEVLKDAGLEIKGIGTDELEIYGSDGELLEFKEFSVIPDRIEAGTYL
CAGAITNSKITLDKVNATHLSAVLAKLHQMGFETLITEDSITLLPAKEIKPVEIMTSEYPGFPTDMQAQFMALALKANGT
SIIDERLFENRFMHVSELLRMGADIKLNGHIATIVGGKELNAADVMATDLRASSALILAALAAKGTSKVHRIYHLDRGYE
NLEEKFKDLGAKITRLEE
>P0A749 2.5.1.7~~~murA~~~UDP-N-acetylglucosamine 1-carboxyvinyltransferase~~~COG0766
MDKFRVQGPTKLQGEVTISGAKNAALPILFAALLAEEPVEIQNVPKLKDVDTSMKLLSQLGAKVERNGSVHIDARDVNVF
CAPYDLVKTMRASIWALGPLVARFGQGQVSLPGGCTIGARPVDLHISGLEQLGATIKLEEGYVKASVDGRLKGAHIVMDK
VSVGATVTIMCAATLAEGTTIIENAAREPEIVDTANFLITLGAKISGQGTDRIVIEGVERLGGGVYRVLPDRIETGTFLV
AAAISRGKIICRNAQPDTLDAVLAKLRDAGADIEVGEDWISLDMHGKRPKAVNVRTAPHPAFPTDMQAQFTLLNLVAEGT
GFITETVFENRFMHVPELSRMGAHAEIESNTVICHGVEKLSGAQVMATDLRASASLVLAGCIAEGTTVVDRIYHIDRGYE
RIEDKLRALGANIERVKGE
>P33038 2.5.1.7~~~murA~~~UDP-N-acetylglucosamine 1-carboxyvinyltransferase~~~COG0766
MDKFRVQGPTRLQGEVTISGAKNAALPILFAALLAEEPVEIQNVPKLKDIDTTMKLLTQLGTKVERNGSVWIDASNVNNF
SAPYDLVKTMRASIWALGPLVARFGQGQVSLPGGCAIGARPVDLHIFGLEKLGAEIKLEEGYVKASVNGRLKGAHIVMDK
VSVGATVTIMSAATLAEGTTIIENAAREPEIVDTANFLVALGAKISGQGTDRITIEGVERLGGGVYRVLPDRIETGTFLV
AAAISGGKIVCRNAQPDTLDAVLAKLREAGADIETGEDWISLDMHGKRPKAVTVRTAPHPAFPTDMQAQFTLLNLVAEGT
GVITETIFENRFMHVPELIRMGAHAEIESNTVICHGVEKLSGAQVMATDLRASASLVLAGCIAEGTTVVDRIYHIDRGYE
RIEDKLRALGANIERVKGE
>P45025 2.5.1.7~~~murA~~~UDP-N-acetylglucosamine 1-carboxyvinyltransferase~~~COG0766
MDKFRVYGQSRLSGSVNISGAKNAALPILFAAILATEPVKLTNVPELKDIETTLKILRQLGVVVDRDATGAVLLDASNIN
HFTAPYELVKTMRASIWALAPLVARFHQGQVSLPGGCSIGARPVDLHISGLEKLGADIVLEEGYVKAQVSDRLVGTRIVI
EKVSVGATLSIMMAATLAKGTTVIENAAREPEIVDTADFLNKMGAKITGAGSAHITIEGVERLTGCEHSVVPDRIETGTF
LIAAAISGGCVVCQNTKADTLDAVIDKLREAGAQVDVTENSITLDMLGNRPKAVNIRTAPHPGFPTDMQAQFTLLNMVAE
GTSIITETIFENRFMHIPELIRMGGKAEIEGNTAVCHGVEQLSGTEVIATDLRASISLVLAGCIATGETIVDRIYHIDRG
YEHIEDKLRGLGAKIERFSGSDEA
>P9WJM1 2.5.1.7~~~murA~~~UDP-N-acetylglucosamine 1-carboxyvinyltransferase~~~COG0766
MAERFVVTGGNRLSGEVAVGGAKNSVLKLMAATLLAEGTSTITNCPDILDVPLMAEVLRGLGATVELDGDVARITAPDEP
KYDADFAAVRQFRASVCVLGPLVGRCKRARVALPGGDAIGSRPLDMHQAGLRQLGAHCNIEHGCVVARAETLRGAEIQLE
FPSVGATENILMAAVVAEGVTTIHNAAREPDVVDLCTMLNQMGAQVEGAGSPTMTITGVPRLHPTEHRVIGDRIVAATWG
IAAAMTRGDISVAGVDPAHLQLVLHKLHDAGATVTQTDASFRVTQYERPKAVNVATLPFPGFPTDLQPMAIALASIADGT
SMITENVFEARFRFVEEMIRLGADARTDGHHAVVRGLPQLSSAPVWCSDIRAGAGLVLAGLVADGDTEVHDVFHIDRGYP
LFVENLVSLGAEIERVCC
>Q9HVW7 2.5.1.7~~~murA~~~UDP-N-acetylglucosamine 1-carboxyvinyltransferase~~~
MDKLIITGGNRLDGEIRISGAKNSALPILAATLLADTPVTVCNLPHLHDITTMIELFGRMGVQPIIDEKLNVEVDASSIK
TLVAPYELVKTMRASILVLGPMLARFGEAEVALPGGCAIGSRPVDLHIRGLEAMGAQIEVEGGYIKAKAPAGGLRGGHFF
FDTVSVTGTENLMMAAALANGRTVLQNAAREPEVVDLANCLNAMGANVQGAGSDTIVIEGVKRLGGARYDVLPDRIETGT
YLVAAAATGGRVKLKDTDPTILEAVLQKLEEAGAHISTGSNWIELDMKGNRPKAVNVRTAPYPAFPTDMQAQFISMNAVA
EGTGAVIETVFENRFMHVYEMNRMGAQILVEGNTAIVTGVPKLKGAPVMATDLRASASLVIAGLVAEGDTLIDRIYHIDR
GYECIEEKLQLLGAKIRRVPG
>Q88P88 2.5.1.7~~~murA~~~UDP-N-acetylglucosamine 1-carboxyvinyltransferase~~~COG0766
MDKLIITGGARLDGEIRISGAKNAALPILAATLLADGPVTVGNLPHLHDITTMIELFGRMGIEPVIDEKLSVEIDPRTIK
TLVAPYELVKTMRASILVLGPMVARFGEAEVALPGGCAIGSRPVDLHIRGLEAMGAKIEVEGGYIKAKAPEGGLRGAHFF
FDTVSVTGTENIMMAAALAKGRSVLQNAAREPEVVDLANFINAMGGNIQGAGTDTITIDGVERLDSANYRVMPDRIETGT
YLVAAAVTGGRVKVKDTDPTILEAVLEKLKEAGADINTGEDWIELDMHGKRPKAVNLRTAPYPAFPTDMQAQFISLNAIA
EGTGAVIETIFENRFMHVYEMHRMGAQIQVEGNTAIVTGVKALKGAPVMATDLRASASLVLSALVAEGDTLIDRIYHIDR
GYECIEEKLQMLGAKIRRVPG
>B2FRX1 2.5.1.7~~~murA~~~UDP-N-acetylglucosamine 1-carboxyvinyltransferase~~~COG0766
MAKIVVTGGAALHGEVSISGAKNAVLPILCATLLADEPVEITNVPHLHDVVTTVKLLGELGAKVTIDQGTLSRGSAIVVD
PRPVNQHVAPYELVKTMRASILVLGPLLARFGAAEVSLPGGCAIGSRPVDQHIKGLQALGAEIVVENGFIKASAKRLKGG
HFTFDMVSVTGTENVLMGAVLAEGTTVLDNCAMEPEVTDLAHCLIALGAKIEGLGTARLVIEGVERLSGGRHEVLPDRIE
TGTFLVAAAMTGGKVTVNRARPNTMDAVLSKLVEAGAKIETTDDSITLDMQGRRPKAVNLTTAPYPAFPTDMQAQFMALN
CVADGVGVINETIFENRFMHVNELLRLGADIQVEGHTAIVRGSEHLSGAPVMATDLRASASLILAGLMASGDTTIDRIYH
LDRGYENIEEKLSSLGATIRRVP
>Q9KP62 2.5.1.7~~~murA~~~UDP-N-acetylglucosamine 1-carboxyvinyltransferase~~~COG0766
MEKFRVIGSTQPLQGEVTISGAKNAALPILFASILAEEPVEVANVPHLRDIDTTMELLERLGAKVERNGSVHVDAGPINQ
YCAPYDLVKTMRASIWALGPLVARFGQGQVSLPGGCAIGARPVDLHIHGLEQLGATITLEDGYVKAHVDGRLQGAHIVMD
KVSVGATITIMCAATLAEGTTVLDNAAREPEIVDTAMFLNKLGAKISGAGTDSITIEGVERLGGGKHAVVPDRIETGTFL
VAAAVSRGKIVCRNTHAHLLEAVLAKLEEAGAEIECGEDWISLDMTGRELKAVTVRTAPHPGFPTDMQAQFTLLNMMAKG
GGVITETIFENRFMHVPELKRMGAKAEIEGNTVICGDVDRLSGAQVMATDLRASASLVIAGCIAKGETIVDRIYHIDRGY
ERIEDKLSALGANIERFRD
>Q65JX9 1.3.1.98~~~murB~~~UDP-N-acetylenolpyruvoylglucosamine reductase~~~COG0812
MDKVIQELKELEVGKVLENEPLSNHTTIKIGGPADVLVIPKDIQAVKDTMKVVKKHGVKWTAIGRGSNLLVLDEGIRGVV
IKLGQGLDHMEIDGEQVTVGGGYSVVRLATGISKKGLSGLEFAAGIPGSVGGAVYMNAGAHGSDISKVLVKALILFEDGT
IEWLTNEEMAFSYRTSILQNKRPGICLEAVLQLEQKERDQIVAQMQKNKDYRKETQPVSNPCAGSIFRNPLPEHAGRLVE
EAGLKGHQIGGAKVSEMHGNFIVNAGGATAKDVLDLIAFIQKTIKEKYDIDMHTEVEIIGEKR
>P08373 1.3.1.98~~~murB~~~UDP-N-acetylenolpyruvoylglucosamine reductase~~~COG0812
MNHSLKPWNTFGIDHNAQHIVCAEDEQQLLNAWQYATAEGQPVLILGEGSNVLFLEDYRGTVIINRIKGIEIHDEPDAWY
LHVGAGENWHRLVKYTLQEGMPGLENLALIPGCVGSSPIQNIGAYGVELQRVCAYVDSVELATGKQVRLTAKECRFGYRD
SIFKHEYQDRFAIVAVGLRLPKEWQPVLTYGDLTRLDPTTVTPQQVFNAVCHMRTTKLPDPKVNGNAGSFFKNPVVSAET
AKALLSQFPTAPNYPQADGSVKLAAGWLIDQCQLKGMQIGGAAVHRQQALVLINEDNAKSEDVVQLAHHVRQKVGEKFNV
WLEPEVRFIGASGEVSAVETIS
>Q8Y776 1.3.1.98~~~murB~~~UDP-N-acetylenolpyruvoylglucosamine reductase~~~COG0812
MNNLQTKFPHIAIKLNEPLSKYTYTKTGGAADVFVMPKTIEEAQEVVAYCHQNKIPLTILGNGSNLIIKDGGIRGVILHL
DLLQTIERNNTQIVAMSGAKLIDTAKFALNESLSGLEFACGIPGSIGGALHMNAGAYGGEISDVLEAATVLTQTGELKKL
KRSELKAAYRFSTIAEKNYIVLDATFSLALEEKNLIQAKMDELTAAREAKQPLEYPSCGSVFKRPPGHFAGKLIQDSGLQ
GHIIGGAQVSLKHAGFIVNIGGATATDYMNLIAYVQQTVREKFDVELETEVKIIGEDK
>P9WJL9 1.3.1.98~~~murB~~~UDP-N-acetylenolpyruvoylglucosamine reductase~~~COG0812
MKRSGVGSLFAGAHIAEAVPLAPLTTLRVGPIARRVITCTSAEQVVAALRHLDSAAKTGADRPLVFAGGSNLVIAENLTD
LTVVRLANSGITIDGNLVRAEAGAVFDDVVVRAIEQGLGGLECLSGIPGSAGATPVQNVGAYGAEVSDTITRVRLLDRCT
GEVRWVSARDLRFGYRTSVLKHADGLAVPTVVLEVEFALDPSGRSAPLRYGELIAALNATSGERADPQAVREAVLALRAR
KGMVLDPTDHDTWSVGSFFTNPVVTQDVYERLAGDAATRKDGPVPHYPAPDGVKLAAGWLVERAGFGKGYPDAGAAPCRL
STKHALALTNRGGATAEDVVTLARAVRDGVHDVFGITLKPEPVLIGCML
>Q9HZM7 1.3.1.98~~~murB~~~UDP-N-acetylenolpyruvoylglucosamine reductase~~~
MSLELQEHCSLKPYNTFGIDVRARLLAHARDEADVREALALARERGLPLLVIGGGSNLLLTRDVEALVLRMASQGRRIVS
DAADSVLVEAEAGEAWDPFVQWSLERGLAGLENLSLIPGTVGAAPMQNIGAYGVELKDVFDSLTALDRQDGTLREFDRQA
CRFGYRDSLFKQEPDRWLILRVRLRLTRRERLHLDYGPVRQRLEEEGIASPTARDVSRVICAIRREKLPDPAVLGNAGSF
FKNPLVDATQAERLRQAFPDLVGYPQADGRLKLAAGWLIDKGGWKGFRDGPVGVHAQQALVLVNHGGATGAQVRALAERI
QEDVRRRFGVELEPEPNLY
>P65463 1.3.1.98~~~murB~~~UDP-N-acetylenolpyruvoylglucosamine reductase~~~
MINKDIYQALQQLIPNEKIKVDEPLKRYTYTKTGGNADFYITPTKNEEVQAVVKYAYQNEIPVTYLGNGSNIIIREGGIR
GIVISLLSLDHIDVSDDAIIAGSGAAIIDVSRVARDYALTGLEFACGIPGSIGGAVYMNAGAYGGEVKDCIDYALCVNEQ
GSLIKLTTKELELDYRNSIIQKEHLVVLEAAFTLAPGKMTEIQAKMDDLTERRESKQPLEYPSCGSVFQRPPGHFAGKLI
QDSNLQGHRIGGVEVSTKHAGFMVNVDNGTATDYENLIHYVQKTVKEKFGIELNREVRIIGEHPKES
>P61431 1.3.1.98~~~murB~~~UDP-N-acetylenolpyruvoylglucosamine reductase~~~
MINKDIYQALQQLIPNEKIKVDEPLKRYTYTKTGGNADFYITPTKNEEVQAVVKYAYQNEIPVTYLGNGSNIIIREGGIR
GIVISLLSLDHIEVSDDAIIAGSGAAIIDVSRVARDYALTGLEFACGIPGSIGGAVYMNAGAYGGEVKDCIDYALCVNEQ
GSLIKLTTKELELDYRNSIIQKEHLVVLEAAFTLAPGKMTEIQAKMDDLTERRESKQPLEYPSCGSVFQRPPGHFAGKLI
QDSNLQGHRIGGVEVSTKHAGFMVNVDNGTATDYENLIHYVQKTVKEKFGIELNREVRIIGEHPKES
>Q5SJC8 1.3.1.98~~~murB~~~UDP-N-acetylenolpyruvoylglucosamine reductase~~~COG0812
MRVERVLLKDYTTLGVGGPAELWTVETREELKRATEAPYRVLGNGSNLLVLDEGVPERVIRLAGEFQTYDLKGWVGAGTL
LPLLVQEAARAGLSGLEGLLGIPAQVGGAVKMNAGTRFGEMADALEAVEVFHDGAFHVYCPEELGFGYRKSHLPPGGIVT
RVRLKLKERPKEEILRRMAEVDRARKGQPKRKSAGCAFKNPPGQSAGRLIDERGLKGLRVGDAMISLEHGNFIVNLGQAR
AKDVLELVRRVQEELPLELEWEVWP
>Q9KV40 1.3.1.98~~~murB~~~UDP-N-acetylenolpyruvoylglucosamine reductase~~~COG0812
MQIQLGANLKPYHTFGIEQLAAQLVVAESIDDLKALYCSAEWASLPKLIIGKGSNMLFTCHYTGMIVVNRLNGIEHQQDD
DYHRLHVAGGEDWPSLVSWCVEQGIGGLENLALIPGCAGSAPIQNIGAYGVEFKDVCDYVEYLCLETGTVKRLTMEECQF
GYRDSIFKHQLYQKAVVTAVGLKFAKAWQPIIQYGPLKDLSSDCAIHDVYQRVCATRMEKLPDPAVMGNAGSFFKNPVIS
QQAFARLQIEHPDVVAYPAEQGVKVAAGWLIDQAGLKGHQIGGAKVHPKQALVIVNTGDASAQDVLMLAADIQQRVFNCY
GIELEHEVRFIGESEETNLKQWMSEQA
>B7GV74 6.3.2.8~~~murC~~~UDP-N-acetylmuramate--L-alanine ligase~~~
MSPTTAANQAKKLIKVPEMRRIKHIHFVGIGGAGMCGIAEVLANQGYKISGSDIKASKTTQQLEENGIKVYIGHEAENIK
NANVLVVSTAIDPENPEVKAAIEQRIPIVRRAEMLGELMRYRHGIAVAGTHGKTTTTSLLTTMLAEENLDPTYVIGGLLN
STGVNAALGESRFIVAEADESDASFLYLQPMAAIVTNIDADHMDTYEGSFDKLKDTFVQFLHNLPFYGLAVVCGDDANIR
EILPRVGRPVITYGFNEDNDIRAIDVEQDGMRSHFTVLRKGREPLRLTINQPGLHNVLNALAAIGVATDEGVSDEAISRA
LKGFSGVGRRFQVQGEFELGEGNVKLVDDYGHHPKEVEATIKAARQSHPDRRLVMLFQPHRYSRTRDCFDDFIEVLSQVD
QLLLLEVYPAGEKPIVGADSRTLARSIRLRGQVEPILIDPVEGNLQNIMQNVLQPNDLLLTQGAGNVGAISVELAQHHLY
VK
>P17952 6.3.2.8~~~murC~~~UDP-N-acetylmuramate--L-alanine ligase~~~COG0773
MNTQQLAKLRSIVPEMRRVRHIHFVGIGGAGMGGIAEVLANEGYQISGSDLAPNPVTQQLMNLGATIYFNHRPENVRDAS
VVVVSSAISADNPEIVAAHEARIPVIRRAEMLAELMRFRHGIAIAGTHGKTTTTAMVSSIYAEAGLDPTFVNGGLVKAAG
VHARLGHGRYLIAEADESDASFLHLQPMVAIVTNIEADHMDTYQGDFENLKQTFINFLHNLPFYGRAVMCVDDPVIRELL
PRVGRQTTTYGFSEDADVRVEDYQQIGPQGHFTLLRQDKEPMRVTLNAPGRHNALNAAAAVAVATEEGIDDEAILRALES
FQGTGRRFDFLGEFPLEPVNGKSGTAMLVDDYGHHPTEVDATIKAARAGWPDKNLVMLFQPHRFTRTRDLYDDFANVLTQ
VDTLLMLEVYPAGEAPIPGADSRSLCRTIRGRGKIDPILVPDPARVAEMLAPVLTGNDLILVQGAGNIGKIARSLAEIKL
KPQTPEEEQHD
>P45066 6.3.2.8~~~murC~~~UDP-N-acetylmuramate--L-alanine ligase~~~COG0773
MKHSHEEIRKIIPEMRRVQQIHFIGIGGAGMSGIAEILLNEGYQISGSDIADGVVTQRLAQAGAKIYIGHAEEHIEGASV
VVVSSAIKDDNPELVTSKQKRIPVIQRAQMLAEIMRFRHGIAVAGTHGKTTTTAMISMIYTQAKLDPTFVNGGLVKSAGK
NAHLGASRYLIAEADESDASFLHLQPMVSVVTNMEPDHMDTYEGDFEKMKATYVKFLHNLPFYGLAVMCADDPVLMELVP
KVGRQVITYGFSEQADYRIEDYEQTGFQGHYTVICPNNERINVLLNVPGKHNALNATAALAVAKEEGIANEAILEALADF
QGAGRRFDQLGEFIRPNGKVRLVDDYGHHPTEVGVTIKAAREGWGDKRIVMIFQPHRYSRTRDLFDDFVQVLSQVDALIM
LDVYAAGEAPIVGADSKSLCRSIRNLGKVDPILVSDTSQLGDVLDQIIQDGDLILAQGAGSVSKISRGLAESWKN
>P65473 6.3.2.8~~~murC~~~UDP-N-acetylmuramate--L-alanine ligase~~~
MSTEQLPPDLRRVHMVGIGGAGMSGIARILLDRGGLVSGSDAKESRGVHALRARGALIRIGHDASSLDLLPGGATAVVTT
HAAIPKTNPELVEARRRGIPVVLRPAVLAKLMAGRTTLMVTGTHGKTTTTSMLIVALQHCGLDPSFAVGGELGEAGTNAH
HGSGDCFVAEADESDGSLLQYTPHVAVITNIESDHLDFYGSVEAYVAVFDSFVERIVPGGALVVCTDDPGGAALAQRATE
LGIRVLRYGSVPGETMAATLVSWQQQGVGAVAHIRLASELATAQGPRVMRLSVPGRHMALNALGALLAAVQIGAPADEVL
DGLAGFEGVRRRFELVGTCGVGKASVRVFDDYAHHPTEISATLAAARMVLEQGDGGRCMVVFQPHLYSRTKAFAAEFGRA
LNAADEVFVLDVYGAREQPLAGVSGASVAEHVTVPMRYVPDFSAVAQQVAAAASPGDVIVTMGAGDVTLLGPEILTALRV
RANRSAPGRPGVLG
>P9WJL7 6.3.2.8~~~murC~~~UDP-N-acetylmuramate--L-alanine ligase~~~COG0773
MSTEQLPPDLRRVHMVGIGGAGMSGIARILLDRGGLVSGSDAKESRGVHALRARGALIRIGHDASSLDLLPGGATAVVTT
HAAIPKTNPELVEARRRGIPVVLRPAVLAKLMAGRTTLMVTGTHGKTTTTSMLIVALQHCGLDPSFAVGGELGEAGTNAH
HGSGDCFVAEADESDGSLLQYTPHVAVITNIESDHLDFYGSVEAYVAVFDSFVERIVPGGALVVCTDDPGGAALAQRATE
LGIRVLRYGSVPGETMAATLVSWQQQGVGAVAHIRLASELATAQGPRVMRLSVPGRHMALNALGALLAAVQIGAPADEVL
DGLAGFEGVRRRFELVGTCGVGKASVRVFDDYAHHPTEISATLAAARMVLEQGDGGRCMVVFQPHLYSRTKAFAAEFGRA
LNAADEVFVLDVYGAREQPLAGVSGASVAEHVTVPMRYVPDFSAVAQQVAAAASPGDVIVTMGAGDVTLLGPEILTALRV
RANRSAPGRPGVLG
>Q9HW02 6.3.2.8~~~murC~~~UDP-N-acetylmuramate--L-alanine ligase~~~
MVKEPNGVTRTMRRIRRIHFVGIGGAGMCGIAEVLLNLGYEVSGSDLKASAVTERLEKFGAQIFIGHQAENADGADVLVV
SSAINRANPEVASALERRIPVVPRAEMLAELMRYRHGIAVAGTHGKTTTTSLIASVFAAGGLDPTFVIGGRLNAAGTNAQ
LGASRYLVAEADESDASFLHLQPMVAVVTNIDADHMATYGGDFNKLKKTFVEFLHNLPFYGLAVMCVDDPVVREILPQIA
RPTVTYGLSEDADVRAINIRQEGMRTWFTVLRPEREPLDVSVNMPGLHNVLNSLATIVIATDEGISDEAIVQGLSGFQGV
GRRFQVYGELQVEGGSVMLVDDYGHHPREVAAVIKAIRGGWPERRLVMVYQPHRYTRTRDLYEDFVQVLGEANVLLLMEV
YPAGEEPIPGADSRQLCHSIRQRGQLDPIYFERDADLAPLVKPLLRAGDILLCQGAGDVGGLAPQLIKNPLFAGKGGKGA
>Q2FXJ0 6.3.2.8~~~murC~~~UDP-N-acetylmuramate--L-alanine ligase~~~COG0773
MTHYHFVGIKGSGMSSLAQIMHDLGHEVQGSDIENYVFTEVALRNKGIKILPFDANNIKEDMVVIQGNAFASSHEEIVRA
HQLKLDVVSYNDFLGQIIDQYTSVAVTGAHGKTSTTGLLSHVMNGDKKTSFLIGDGTGMGLPESDYFAFEACEYRRHFLS
YKPDYAIMTNIDFDHPDYFKDINDVFDAFQEMAHNVKKGIIAWGDDEHLRKIEADVPIYYYGFKDSDDIYAQNIQITDKG
TAFDVYVDGEFYDHFLSPQYGDHTVLNALAVIAISYLEKLDVTNIKEALETFGGVKRRFNETTIANQVIVDDYAHHPREI
SATIETARKKYPHKEVVAVFQPHTFSRTQAFLNEFAESLSKADRVFLCEIFGSIRENTGALTIQDLIDKIEGASLINEDS
INVLEQFDNAVILFMGAGDIQKLQNAYLDKLGMKNAF
>P65475 6.3.2.8~~~murC~~~UDP-N-acetylmuramate--L-alanine ligase~~~
MTHYHFVGIKGSGMSSLAQIMHDLGHEVQGSDIENYVFTEVALRNKGIKILPFDANNIKEDMVVIQGNAFASSHEEIVRA
HQLKLDVVSYNDFLGQIIDQYTSVAVTGAHGKTSTTGLLSHVMNGDKKTSFLIGDGTGMGLPESDYFAFEACEYRRHFLS
YKPDYAIMTNIDFDHPDYFKDINDVFDAFQEMAHNVKKGIIAWGDDEHLRKIEADVPIYYYGFKDSDDIYAQNIQITDKG
TAFDVYVDGEFYDHFLSPQYGDHTVLNALAVIAISYLEKLDVTNIKEALETFGGVKRRFNETTIANQVIVDDYAHHPREI
SATIETARKKYPHKEVVAVFQPHTFSRTQAFLNEFAESLSKADRVFLCEIFGSIRENTGALTIQDLIDKIEGASLINEDS
INVLEQFDNAVVLFMGAGDIQKLQNAYLDKLGMKNAF
>Q97PS8 6.3.2.8~~~murC~~~UDP-N-acetylmuramate--L-alanine ligase~~~COG0773
MSKTYHFIGIKGSGMSALALMLHQMGHKVQGSDVEKYYFTQRGLEQAGITILPFDEKNLDGDMEIIAGNAFRPDNNVEIA
YADQNGISYKRYHEFLGSFMRDFVSMGVAGAHGKTSTTGMLSHVLSHITDTSFLIGDGTGRGSANAKYFVFESDEYERHF
MPYHPEYSIITNIDFDHPDYFTSLEDVFNAFNDYAKQITKGLFVYGEDAELRKITSDAPIYYYGFEAEGNDFVASDLLRS
ITGSTFTVHFRGQNLGQFHIPTFGRHNIMNATAVIGLLYTAGFDLNLVREHLKTFAGVKRRFTEKIVNDTVIIDDFAHHP
TEIIATLDAARQKYPSKEIVAVFQPHTFTRTIALLDDFAHALNQADAVYLAQIYGSAREVDHGDVKVEDLANKINKKHQV
ITVENVSPLLDHDNAVYVFMGAGDIQTYEYSFERLLSNLTSNVQ
>Q9WY73 6.3.2.8~~~murC~~~UDP-N-acetylmuramate--L-alanine ligase~~~COG0773
MKIHFVGIGGIGMSAVALHEFSNGNDVYGSNIEETERTAYLRKLGIPIFVPHSADNWYDPDLVIKTPAVRDDNPEIVRAR
MERVPIENRLHYFRDTLKREKKEEFAVTGTDGKTTTTAMVAHVLKHLRKSPTVFLGGIMDSLEHGNYEKGNGPVVYELDE
SEEFFSEFSPNYLIITNARGDHLENYGNSLTRYRSAFEKISRNTDLVVTFAEDELTSHLGDVTFGVKKGTYTLEMRSASR
AEQKAMVEKNGKRYLELKLKVPGFHNVLNALAVIALFDSLGYDLAPVLEALEEFRGVHRRFSIAFHDPETNIYVIDDYAH
TPDEIRNLLQTAKEVFENEKIVVIFQPHRYSRLEREDGNFAKALQLADEVVVTEVYDAFEEKKNGISGKMIWDSLKSLGK
EAYFVEKLPELEKVISVSENTVFLFVGAGDIIYSSRRFVERYQSSKSSPSRVLGSNK
>Q8ZIE8 6.3.2.8~~~murC~~~UDP-N-acetylmuramate--L-alanine ligase~~~COG0773
MNTQQLAKLRTIVPEMRRVRHIHFVGIGGAGMGGIAEVLANEGYQISGSDLAPNSVTQHLTALGAQIYFHHRPENVLDAS
VVVVSTAISADNPEIVAAREARIPVIRRAEMLAELMRYRHGIAVAGTHGKTTTTAMLSSIYAEAGLDPTFVNGGLVKAAG
THARLGSSRYLIAEADESDASFLHLQPMVAIVTNIEADHMDTYQGDFENLKQTFINFLHNLPFYGRAVMCIDDPVVRELL
PRVGRHITTYGFSDDADVQIASYRQEGPQGHFTLRRQDKPLIEVTLNAPGRHNALNAAAAVAVATEEGIEDEDILRALVG
FQGTGRRFDFLGNFPLAPVNGKEGSAMLVDDYGHHPTEVDATIKAARAGWPDKRIVMLFQPHRYTRTRDLYDDFANVLSQ
VDVLLMLDVYAAGEPPIPGADSRALCRTIRNRGKLDPILVPDSESAPEMLAQILNGEDLILVQGAGNIGKIARKLAEHKL
QPQLKDEEHHG
>C4RJF7 6.3.2.53~~~murD2~~~UDP-N-acetylmuramoyl-L-alanine--L-glutamate ligase~~~COG0771
MRLSDLRGRTVAVWGAGREGRAAVIAIAAHGPADLVAVDDSANFLALPWEGPLAEAAPLVTGEEGFARLAAAEVVVRSPG
VPNTHPWLVELRGRGVTVTQGSALWMADHARRTVGVTGSKGKSTTSSLISHLLTAVDRPNVFGGNIGVPLLDLPDADLYV
LELSSYQCADLTDSPRVAVVTALFPEHLDAHGGEREYYRDKLNLLAHGPQTIVVNGADPRLAAELGDRPAVRAGSPDTTH
VAPGPDGTPWFHLGDRPLFPRAVLPLVGRHNEGNLCVALAVLAALGVDVVARADALAVAVAGFQGLAHRLTEIADPSGLT
FVDDTLATSPYAAMHAIDAYEGRPVTVIVGGADRGLDYAPLREHLAEREITVLGIPDSGQRIVATLAGLPRVRAEVVDDL
VAAVRRARELTPADGVVLLSPAAPSYGRFRNFEHRSEVFAEAVRDTAGHPAR
>A4X981 6.3.2.53~~~murD2~~~UDP-N-acetylmuramoyl-L-alanine--L-glutamate ligase~~~COG0771
MRLSDLRGRKVAVWGTGREGRAAVVAIAAHGPADLVAVDDGGSTVSPPWDGFLATAAPLVTGDAGAQRLAAADVVVRSPG
VPNTHPWLAELWRRQVPVTQGTALWMADHAARTVGVTGSKGKSTTSSLISHLLAAVDQPNVFGGNIGVPTLDLPAADLYV
LELSSYQCSDLTDSPRVAVVTALFPEHLDAHGGEREYYRDKLNLLAHGPETVVVNGADPRLAAELGDRPVVRAGTPDTTH
VAGGPDGTPWFHLGDQPLFPRAVLPLVGRHNEGNLCVALAVLDVLGVDVLARRDTLAVAVAGFQGLAHRLTEIVDPSGLT
FVDDTLATSPYAAMHAIDAYDGRALTVIVGGADRGLDYTPLRDHLAEREITVIGVPDSGARIVAALDGLPKVRCDVTGDL
VEAVRLARRVTPAGGVVLLSPAAPSYGQFRNFEHRSEVFAQAVRDTAG
>Q2P5V2 6.3.2.53~~~murD2~~~UDP-N-acetylmuramoyl-L-alanine--L-glutamate ligase~~~
MRISQFEGKAVALWGWGREGRGAYRALRAQLPTQSLTMFCNAEEVRELESLADAALHVETDASAQALGRFEIVVKSPGIS
PYRAEALAAAAQGTQFIGGTALWFAEHAQPDGSVPGAICVTGTKGKSTTTALLAHLLRVAGHRTALVGNIGQPLLEVLAP
QPPPAYWAIELSSYQTGDVGRSGARPELAVVLNLFPEHLDWHGDEARYVRDKLSLVTEGRPRIVLLNAADPLLASLQLPD
SEVLWFNHPEGWHLRGDVVYRGEQAIFDSADVPLPGVHNRRNLCAVLAALEALGLDAEALAPAALSFRPLPNRLQVLGSV
DGISYVNDSISTTPYASLAALACFAQRRVALLVGGHDRGLDWHDFARHMAQQAPLEIVTMAANGPRIHALLAPLADAGRF
GLHAANDLEHAMQLARDALGGQGGVVLLSPGAPSFGAYSDYVARGRHFAQLAGFDPAAISAIPGLGVH
>B7H1N2 6.3.2.9~~~murD~~~UDP-N-acetylmuramoylalanine--D-glutamate ligase~~~
MLIQRGGLKVVAGLGISGVSAVNFLHEQGYQVAVTDSRPTPPGHDQIPAGVKTSFGQLDQELLLQAEEIILSPGLAPQLP
EIQAAIAKGISVVGDIQLLRRATDVPIVAITGSNAKSTVTTLIGLMAKDAGKKVAVGGNLGRPALDLLKDQPELLVLELS
SFQLETTSHLNAEVAVVLNMSEDHLDRHGNMLGYHQAKHRIFQGAKKVVFNRDDALSRPLVPDTTPMQSFGLNAPDLNQY
GVLRDADGTLWLARGLQRLIKSSDLYIQGMHNVANALACLALGEAIGLPMESMLETLKQFKGLEHRCEYVKTVHDVRYYN
DSKGTNVGATLAAIDGLGAAIEVKKGKVALILGGQGKGQDFGPLRSSIEKYAKVVVLIGEDAPVIEQAIQGATKILHAAT
LKEAVELCQRETQAEDVVLLSPACASFDMFKSYNDRGQQFVACVNSLV
>P14900 6.3.2.9~~~murD~~~UDP-N-acetylmuramoylalanine--D-glutamate ligase~~~COG0771
MADYQGKNVVIIGLGLTGLSCVDFFLARGVTPRVMDTRMTPPGLDKLPEAVERHTGSLNDEWLMAADLIVASPGIALAHP
SLSAAADAGIEIVGDIELFCREAQAPIVAITGSNGKSTVTTLVGEMAKAAGVNVGVGGNIGLPALMLLDDECELYVLELS
SFQLETTSSLQAVAATILNVTEDHMDRYPFGLQQYRAAKLRIYENAKVCVVNADDALTMPIRGADERCVSFGVNMGDYHL
NHQQGETWLRVKGEKVLNVKEMKLSGQHNYTNALAALALADAAGLPRASSLKALTTFTGLPHRFEVVLEHNGVRWINDSK
ATNVGSTEAALNGLHVDGTLHLLLGGDGKSADFSPLARYLNGDNVRLYCFGRDGAQLAALRPEVAEQTETMEQAMRLLAP
RVQPGDMVLLSPACASLDQFKNFEQRGNEFARLAKELG
>A5U4I2 6.3.2.9~~~murD~~~UDP-N-acetylmuramoylalanine--D-glutamate ligase~~~COG0771
MSGLPRSVPDVLDPLGPGAPVLVAGGRVTGQAVAAVLTRFGATPTVCDDDPVMLRPHAERGLPTVSSSDAVQQITGYALV
VASPGFSPATPLLAAAAAAGVPIWGDVELAWRLDAAGCYGPPRSWLVVTGTNGKTTTTSMLHAMLIAGGRRAVLCGNIGS
AVLDVLDEPAELLAVELSSFQLHWAPSLRPEAGAVLNIAEDHLDWHATMAEYTAAKARVLTGGVAVAGLDDSRAAALLDG
SPAQVRVGFRLGEPAARELGVRDAHLVDRAFSDDLTLLPVASIPVPGPVGVLDALAAAALARSVGVPAGAIADAVTSFRV
GRHRAEVVAVADGITYVDDSKATNPHAARASVLAYPRVVWIAGGLLKGASLHAEVAAMASRLVGAVLIGRDRAAVAEALS
RHAPDVPVVQVVAGEDTGMPATVEVPVACVLDVAKDDKAGETVGAAVMTAAVAAARRMAQPGDTVLLAPAGASFDQFTGY
ADRGEAFATAVRAVIR
>P9WJL5 6.3.2.9~~~murD~~~UDP-N-acetylmuramoylalanine--D-glutamate ligase~~~COG0771
MSGLPRSVPDVLDPLGPGAPVLVAGGRVTGQAVAAVLTRFGATPTVCDDDPVMLRPHAERGLPTVSSSDAVQQITGYALV
VASPGFSPATPLLAAAAAAGVPIWGDVELAWRLDAAGCYGPPRSWLVVTGTNGKTTTTSMLHAMLIAGGRRAVLCGNIGS
AVLDVLDEPAELLAVELSSFQLHWAPSLRPEAGAVLNIAEDHLDWHATMAEYTAAKARVLTGGVAVAGLDDSRAAALLDG
SPAQVRVGFRLGEPAARELGVRDAHLVDRAFSDDLTLLPVASIPVPGPVGVLDALAAAALARSVGVPAGAIADAVTSFRV
GRHRAEVVAVADGITYVDDSKATNPHAARASVLAYPRVVWIAGGLLKGASLHAEVAAMASRLVGAVLIGRDRAAVAEALS
RHAPDVPVVQVVAGEDTGMPATVEVPVACVLDVAKDDKAGETVGAAVMTAAVAAARRMAQPGDTVLLAPAGASFDQFTGY
ADRGEAFATAVRAVIR
>Q9HVZ9 6.3.2.9~~~murD~~~UDP-N-acetylmuramoylalanine--D-glutamate ligase~~~
MSLIASDHFRIVVGLGKSGMSLVRYLARRGLPFAVVDTRENPPELATLRAQYPQVEVRCGELDAEFLCSARELYVSPGLS
LRTPALVQAAAKGVRISGDIDLFAREAKAPIVAITGSNAKSTVTTLVGEMAVAADKRVAVGGNLGTPALDLLADDIELYV
LELSSFQLETCDRLNAEVATVLNVSEDHMDRYDGMADYHLAKHRIFRGARQVVVNRADALTRPLIADTVPCWSFGLNKPD
FKAFGLIEEDGQKWLAFQFDKLLPVGELKIRGAHNYSNALAALALGHAVGLPFDAMLGALKAFSGLAHRCQWVRERQGVS
YYDDSKATNVGAALAAIEGLGADIDGKLVLLAGGDGKGADFHDLREPVARFCRAVVLLGRDAGLIAQALGNAVPLVRVAT
LDEAVRQAAELAREGDAVLLSPACASLDMFKNFEERGRLFAKAVEELA
>Q2FZ92 6.3.2.9~~~murD~~~UDP-N-acetylmuramoylalanine--D-glutamate ligase~~~COG0771
MLNYTGLENKNVLVVGLAKSGYEAAKLLSKLGANVTVNDGKDLSQDAHAKDLESMGISVVSGSHPLTLLDNNPIIVKNPG
IPYTVSIIDEAVKRGLKILTEVELSYLISEAPIIAVTGTNGKTTVTSLIGDMFKKSRLTGRLSGNIGYVASKVAQEVKPT
DYLVTELSSFQLLGIEKYKPHIAIITNIYSAHLDYHENLENYQNAKKQIYKNQTEEDYLICNYHQRQVIESEELKAKTLY
FSTQQEVDGIYIKDGFIVYKGVRIINTEDLVLPGEHNLENILAAVLACILAGVPIKAIIDSLTTFSGIEHRLQYVGTNRT
NKYYNDSKATNTLATQFALNSFNQPIIWLCGGLDRGNEFDELIPYMENVRAMVVFGQTKAKFAKLGNSQGKSVIEANNVE
DAVDKVQDIIEPNDVVLLSPACASWDQYSTFEERGEKFIERFRAHLPSY
>P0A090 6.3.2.9~~~murD~~~UDP-N-acetylmuramoylalanine--D-glutamate ligase~~~
MLNYTGLENKNVLVVGLAKSGYEAAKLLSKLGANVTVNDGKDLSQDAHAKDLESMGISVVSGSHPLTLLDNNPIIVKNPG
IPYTVSIIDEAVKRGLKILTEVELSYLISEAPIIAVTGTNGKTTVTSLIGDMFKKSRLTGRLSGNIGYVASKVAQEVKPT
DYLVTELSSFQLLGIEKYKPHIAIITNIYSAHLDYHENLENYQNAKKQIYKNQTEEDYLICNYHQRQVIESEELKAKTLY
FSTQQEVDGIYIKDGFIVYKGVRIINTEDLVLPGEHNLENILAAVLACILAGVPIKAIIDSLTTFSGIEHRLQYVGTNRT
NKYYNDSKATNTLATQFALNSFNQPIIWLCGGLDRGNEFDELIPYMENVRAMVVFGQTKAKFAKLGNSQGKSVIEANNVE
DAVDKVQDIIEPNDVVLLSPACASWDQYSTFEERGEKFIERFRAHLPSY
>Q8E186 6.3.2.9~~~murD~~~UDP-N-acetylmuramoylalanine--D-glutamate ligase~~~
MKTITTFENKKVLVLGLARSGEAAARLLAKLGAIVTVNDGKPFDENPTAQSLLEEGIKVVCGSHPLELLDEDFCYMIKNP
GIPYNNPMVKKALEKQIPVLTEVELAYLVSESQLIGITGSNGKTTTTTMIAEVLNAGGQRGLLAGNIGFPASEVVQAAND
KDTLVMELSSFQLMGVKEFRPHIAVITNLMPTHLDYHGSFEDYVAAKWNIQNQMSSSDFLVLNFNQGISKELAKTTKATI
VPFSTTEKVDGAYVQDKQLFYKGENIMSVDDIGVPGSHNVENALATIAVAKLAGISNQVIRETLSNFGGVKHRLQSLGKV
HGISFYNDSKSTNILATQKALSGFDNTKVILIAGGLDRGNEFDELIPDITGLKHMVVLGESASRVKRAAQKAGVTYSDAL
DVRDAVHKAYEVAQQGDVILLSPANASWDMYKNFEVRGDEFIDTFESLRGE
>Q97RU8 6.3.2.9~~~murD~~~UDP-N-acetylmuramoylalanine--D-glutamate ligase~~~COG0771
MKVIDQFKNKKVLVLGLAKSGESAARLLDKLGAIVTVNDGKPFEDNPAAQSLLEEGIKVITGGHPLELLDEEFALMVKNP
GIPYNNPMIEKALAKGIPVLTEVELAYLISEAPIIGITGSNGKTTTTTMIGEVLTAAGQHGLLSGNIGYPASQVAQIASD
KDTLVMELSSFQLMGVQEFHPEIAVITNLMPTHIDYHGSFSEYVAAKWNIQNKMTAADFLVLNFNQDLAKDLTSKTEATV
VPFSTLEKVDGAYLEDGQLYFRGEVVMAANEIGVPGSHNVENALATIAVAKLRDVDNQTIKETLSAFGGVKHRLQFVDDI
KGVKFYNDSKSTNILATQKALSGFDNSKVVLIAGGLDRGNEFDELVPDITGLKKMVILGQSAERVKRAADKAGVAYVEAT
DIADATRKAYELATQGDVVLLSPANASWDMYANFEVRGDLFIDTVAELKE
>P0C0D7 6.3.2.9~~~murD~~~UDP-N-acetylmuramoylalanine--D-glutamate ligase~~~COG0771
MKVISNFQNKKILILGLAKSGEAAAKLLTKLGALVTVNDSKPFDQNPAAQALLEEGIKVICGSHPVELLDEDFEYMVKNP
GIPYDNPMVKRALAKEIPILTEVELAYFVSEAPIIGITGSNGKTTTTTMIADVLNAGGQSALLSGNIGYPASKVVQKAIA
GDTLVMELSSFQLVGVNAFRPHIAVITNLMPTHLDYHGSFEDYVAAKWMIQAQMTESDYLILNANQEISATLAKTTQATV
IPFSTQKVVDGAYLKDGILYFKEQAIIAATDLGVPGSHNIENALATIAVAKLSGIADDIIAQCLSHFGGVKHRLQRVGQI
KDITFYNDSKSTNILATQKALSGFDNSRLILIAGGLDRGNEFDDLVPDLLGLKQMIILGESAERMKRAANKAEVSYLEAR
NVAEATELAFKLAQTGDTILLSPANASWDMYPNFEVRGDEFLATFDCLRGDA
>Q9WY76 6.3.2.9~~~murD~~~UDP-N-acetylmuramoylalanine--D-glutamate ligase~~~COG0771
MKIGFLGFGKSNRSLLKYLLNHQEAKFFVSEAKTLDGETKKFLEEHSVEYEEGGHTEKLLDCDVVYVSPGIKPDTSMIEL
LSSRGVKLSTELQFFLDNVDPKKVVGITGTDGKSTATALMYHVLSGRGFKTFLGGNFGTPAVEALEGEYDYYVLEMSSFQ
LFWSERPYLSNFLVLNISEDHLDWHSSFKEYVDSKLKPAFLQTEGDLFVYNKHIERLRNLEGVRSRKIPFWTDENFATEK
ELIVRGKKYTLPGNYPYQMRENILAVSVLYMEMFNELESFLELLRDFKPLPHRMEYLGQIDGRHFYNDSKATSTHAVLGA
LSNFDKVVLIMCGIGKKENYSLFVEKASPKLKHLIMFGEISKELAPFVGKIPHSIVENMEEAFEKAMEVSEKGDVILLSP
GGASFDMYENYAKRGEHFREIFKRHGGDEV
>Q819Q0 6.3.2.13~~~murE~~~UDP-N-acetylmuramoyl-L-alanyl-D-glutamate--2,6-diaminopimelate ligase~~~
MKLHTLVSCLHDFPVVPKENPEITSIEADSRKVKEGSLFVCMKGYTVDSHDFAKQAAAQGAAAIVAERPIDVDVPVVLVK
NTFRSLAVLADYFYGQPTHKLHLIGITGTNGKTTTSHIMDEIMRAHGHKTGLIGTINMKIGDETFEVKNTTPDALTLQQT
FSKMVEQGVDSTVMEVSSHALDLGRVHGCDYDVAVFTNLTQDHLDYHKTMEEYKHAKGLLFAQLGNSYNHNREKYAVLNS
DDNVTEEYMRSTAATVVTYGIDTTSDIMAKNIVMTSGGTTFTLVTPYESVNVTMKLIGKFNVYNVLAATAAGLVSGVKLE
TIIAVIKDLAGVPGRFEVVDGGQNYTVIVDYAHTPDSLENVLKTAKQFAKGDVYCIVGCGGDRDRTKRPIMASVATKYAT
HAIYTSDNPRSEDPAAILDDMVHGASGKNYEMIIDRKEAIHHAIAKAKADDIIIIAGKGHETYQIIGKEVHHFDDREVAK
EAITGRLNNEE
>P22188 6.3.2.13~~~murE~~~UDP-N-acetylmuramoyl-L-alanyl-D-glutamate--2,6-diaminopimelate ligase~~~COG0769
MADRNLRDLLAPWVPDAPSRALREMTLDSRVAAAGDLFVAVVGHQADGRRYIPQAIAQGVAAIIAEAKDEATDGEIREMH
GVPVIYLSQLNERLSALAGRFYHEPSDNLRLVGVTGTNGKTTTTQLLAQWSQLLGEISAVMGTVGNGLLGKVIPTENTTG
SAVDVQHELAGLVDQGATFCAMEVSSHGLVQHRVAALKFAASVFTNLSRDHLDYHGDMEHYEAAKWLLYSEHHCGQAIIN
ADDEVGRRWLAKLPDAVAVSMEDHINPNCHGRWLKATEVNYHDSGATIRFSSSWGDGEIESHLMGAFNVSNLLLALATLL
ALGYPLADLLKTAARLQPVCGRMEVFTAPGKPTVVVDYAHTPDALEKALQAARLHCAGKLWCVFGCGGDRDKGKRPLMGA
IAEEFADVAVVTDDNPRTEEPRAIINDILAGMLDAGHAKVMEGRAEAVTCAVMQAKENDVVLVAGKGHEDYQIVGNQRLD
YSDRVTVARLLGVIA
>P9WJL3 6.3.2.13~~~murE~~~UDP-N-acetylmuramoyl-L-alanyl-D-glutamate--2,6-diaminopimelate ligase~~~COG0769
MSSLARGISRRRTEVATQVEAAPTGLRPNAVVGVRLAALADQVGAALAEGPAQRAVTEDRTVTGVTLRAQDVSPGDLFAA
LTGSTTHGARHVGDAIARGAVAVLTDPAGVAEIAGRAAVPVLVHPAPRGVLGGLAATVYGHPSERLTVIGITGTSGKTTT
TYLVEAGLRAAGRVAGLIGTIGIRVGGADLPSALTTPEAPTLQAMLAAMVERGVDTVVMEVSSHALALGRVDGTRFAVGA
FTNLSRDHLDFHPSMADYFEAKASLFDPDSALRARTAVVCIDDDAGRAMAARAADAITVSAADRPAHWRATDVAPTDAGG
QQFTAIDPAGVGHHIGIRLPGRYNVANCLVALAILDTVGVSPEQAVPGLREIRVPGRLEQIDRGQGFLALVDYAHKPEAL
RSVLTTLAHPDRRLAVVFGAGGDRDPGKRAPMGRIAAQLADLVVVTDDNPRDEDPTAIRREILAGAAEVGGDAQVVEIAD
RRDAIRHAVAWARPGDVVLIAGKGHETGQRGGGRVRPFDDRVELAAALEALERRA
>Q2FZP6 6.3.2.7~~~murE~~~UDP-N-acetylmuramoyl-L-alanyl-D-glutamate--L-lysine ligase~~~COG0769
MDASTLFKKVKVKRVLGSLEQQIDDITTDSRTAREGSIFVASVGYTVDSHKFCQNVADQGCKLVVVNKEQSLPANVTQVV
VPDTLRVASILAHTLYDYPSHQLVTFGVTGTNGKTSIATMIHLIQRKLQKNSAYLGTNGFQINETKTKGANTTPETVSLT
KKIKEAVDAGAESMTLEVSSHGLVLGRLRGVEFDVAIFSNLTQDHLDFHGTMEAYGHAKSLLFSQLGEDLSKEKYVVLNN
DDSFSEYLRTVTPYEVFSYGIDEEAQFMAKNIQESLQGVSFDFVTPFGTYPVKSPYVGKFNISNIMAAMIAVWSKGTSLE
TIIKAVENLEPVEGRLEVLDPSLPIDLIIDYAHTADGMNKLIDAVQPFVKQKLIFLVGMAGERDLTKTPEMGRVACRADY
VIFTPDNPANDDPKMLTAELAKGATHQNYIEFDDRAEGIKHAIDIAEPGDTVVLASKGREPYQIMPGHIKVPHRDDLIGL
EAAYKKFGGGPVD
>P65480 6.3.2.7~~~murE~~~UDP-N-acetylmuramoyl-L-alanyl-D-glutamate--L-lysine ligase~~~
MDASTLFKKVKVKRVLGSLEQQIDDITTDSRTAREGSIFVASVGYTVDSHKFCQNVADQGCKLVVVNKEQSLPANVTQVV
VPDTLRVASILAHTLYDYPSHQLVTFGVTGTNGKTSIATMIHLIQRKLQKNSAYLGTNGFQINETKTKGANTTPETVSLT
KKIKEAVDAGAESMTLEVSSHGLVLGRLRGVEFDVAIFSNLTQDHLDFHGTMEAYGHAKSLLFSQLGEDLSKEKYVVLNN
DDSFSEYLRTVTPYEVFSYGIDEEAQFMAKNIQESLQGVSFDFVTPFGTYPVKSPYVGKFNISNIMAAMIAVWSKGTSLE
TIIKAVENLEPVEGRLEVLDPSLPIDLIIDYAHTADGMNKLIDAVQPFVKQKLIFLVGMAGERDLTKTPEMGRVACRADY
VIFTPDNPANDDPKMLTAELAKGATHQNYIEFDDRAEGIKHAIDIAEPGDTVVLASKGREPYQIMPGHIKVPHRDDLIGL
EAAYKKFGGGPVDQ
>Q9A196 6.3.2.7~~~murE~~~UDP-N-acetylmuramoyl-L-alanyl-D-glutamate--L-lysine ligase~~~
MITIEQLLDILKKDHNFREVLDADGYHYHYQGFSFERLSYDSRQVDGKTLFFAKGATFKADYLKEAITNGLQLYISEVDY
ELGIPVVLVTDIKKAMSLIAMAFYGNPQEKLKLLAFTGTKGKTTAAYFAYHMLKESYKPAMFSTMNTTLDGKTFFKSQLT
TPESLDLFAMMAECVTNGMTHLIMEVSSQAYLVDRVYGLTFDVGVFLNISPDHIGPIEHPTFEDYFYHKRLLMENSRAVV
INSGMDHFSFLADQVADQEHVFYGPLSDNQITTSQAFSFEAKGQLAGHYDIQLIGHFNQENAMAAGLACLRLGASLADIQ
KGIAKTRVPGRMEVLTMTNHAKVFVDYAHNGDSLEKLLSVVEEHQTGKLMLILGAPGNKGESRRADFGRVIHQHPNLTVI
LTADDPNFEDPEDISKEIASHIARPVEIISDREQAIQKAMSLCQGAKDAVIIAGKGADAYQIVKGQQVAYAGDLAIAKHY
L
>Q97PS1 6.3.2.7~~~murE~~~UDP-N-acetylmuramoyl-L-alanyl-D-glutamate--L-lysine ligase~~~COG0769
MIKIETVLDILKKDGLFREIIDQGHYHYNYSKVIFDSISYDSRKVTEDTLFFAKGAAFKKEYLLSAITQGLAWYVAEKDY
EVGIPVIIVNDIKKAMSLIAMEFYGNPQEKLKLLAFTGTKGKTTAAYFAYNILSQGHRPAMLSTMNTTLDGETFFKSALT
TPESIDLFDMMNQAVQNDRTHLIMEVSSQAYLVKRVYGLTFDVGVFLNISPDHIGPIEHPSFEDYFYHKRLLMEKSRAVI
INSDMDHFSVLKEQVEDQDHDFYGSQFDNQIENSKAFSFSATGKLAGDYDIQLIGNFNQENAVAAGLACLRLGASLEDIK
KGIAATRVPGRMEVLTQKNGAKVFIDYAHNGDSLKKLINVVETHQTGKIALVLGSTGNKGESRRKDFGLLLNQHPEIQVF
LTADDPNYEDPMAIADEISSYINHPVEKIADRQEAIKAAMAITNHELDAVIIAGKGADCYQIIQGKKESYPGDTAVAENY
L
>Q9WY79 6.3.2.37~~~murE~~~UDP-N-acetylmuramoyl-L-alanyl-D-glutamate--LD-lysine ligase~~~COG0769
MNISTIVSNLKDLILEVRAPYDLEITGVSNHSSKVKKGDLFICRRGEKFDSHEIIPEVMEKGAVAVVVEREIDLDFPYIQ
VFDSRYFEAKVASLFFEDPWKDVLTFGVTGTNGKTTTTMMIYHMLTSLGERGSVLTTAVKRILGNSYYDDITTPDAITIL
SAMKENREGGGKFFALEVSSHALVQQRVEGVRFDVGIFTNISRDHLDFHGTFENYLKAKLHLFDLLKDDGVAVLNESLAD
AFNRKSRKITFGTSKNADYRLGNIEVSWEGTQFVLETPDGLLKVFTRAIGDFNAYNAAAAIAALHQLGYDPKDLASSLET
FTGVEGRFEVVRGAKKIGLNVVVDFAHSPDALEKLLKNVRKISQGRVIVVFGAGGNSDRGKRPMMSEVASKLADVVILTT
DDPRGEDPEQIMEDLIKGIDKRKPYLVLFDRREAIETALTIANRGDSVVIAGRGHERYQIIDEEKKVPFQDREVVEEIIR
DKLKGRKYAQ
>P11880 6.3.2.10~~~murF~~~UDP-N-acetylmuramoyl-tripeptide--D-alanyl-D-alanine ligase~~~COG0770
MISVTLSQLTDILNGELQGADITLDAVTTDTRKLTPGCLFVALKGERFDAHDFADQAKAGGAGALLVSRPLDIDLPQLIV
KDTRLAFGELAAWVRQQVPARVVALTGSSGKTSVKEMTAAILSQCGNTLYTAGNLNNDIGVPMTLLRLTPEYDYAVIELG
ANHQGEIAWTVSLTRPEAALVNNLAAAHLEGFGSLAGVAKAKGEIFSGLPENGIAIMNADNNDWLNWQSVIGSRKVWRFS
PNAANSDFTATNIHVTSHGTEFTLQTPTGSVDVLLPLPGRHNIANALAAAALSMSVGATLDAIKAGLANLKAVPGRLFPI
QLAENQLLLDDSYNANVGSMTAAVQVLAEMPGYRVLVVGDMAELGAESEACHVQVGEAAKAAGIDRVLSVGKQSHAISTA
SGVGEHFADKTALITRLKLLIAEQQVITILVKGSRSAAMEEVVRALQENGTC
>P9WJL1 6.3.2.10~~~murF~~~UDP-N-acetylmuramoyl-tripeptide--D-alanyl-D-alanine ligase~~~COG0770
MIELTVAQIAEIVGGAVADISPQDAAHRRVTGTVEFDSRAIGPGGLFLALPGARADGHDHAASAVAAGAAVVLAARPVGV
PAIVVPPVAAPNVLAGVLEHDNDGSGAAVLAALAKLATAVAAQLVAGGLTIIGITGSSGKTSTKDLMAAVLAPLGEVVAP
PGSFNNELGHPWTVLRATRRTDYLILEMAARHHGNIAALAEIAPPSIGVVLNVGTAHLGEFGSREVIAQTKAELPQAVPH
SGAVVLNADDPAVAAMAKLTAARVVRVSRDNTGDVWAGPVSLDELARPRFTLHAHDAQAEVRLGVCGDHQVTNALCAAAV
ALECGASVEQVAAALTAAPPVSRHRMQVTTRGDGVTVIDDAYNANPDSMRAGLQALAWIAHQPEATRRSWAVLGEMAELG
EDAIAEHDRIGRLAVRLDVSRLVVVGTGRSISAMHHGAVLEGAWGSGEATADHGADRTAVNVADGDAALALLRAELRPGD
VVLVKASNAAGLGAVADALVADDTCGSVRP
>Q2FWH4 6.3.2.10~~~murF~~~UDP-N-acetylmuramoyl-tripeptide--D-alanyl-D-alanine ligase~~~COG0770
MINVTLKQIQSWIPCEIEDQFLNQEINGVTIDSRAISKNMLFIPFKGENVDGHRFVSKALQDGAGAAFYQRGTPIDENVS
GPIIWVEDTLTALQQLAQAYLRHVNPKVIAVTGSNGKTTTKDMIESVLHTEFKVKKTQGNYNNEIGLPLTILELDNDTEI
SILEMGMSGFHEIEFLSNLAQPDIAVITNIGESHMQDLGSREGIAKAKSEITIGLKDNGTFIYDGDEPLLKPHVKEVENA
KCISIGVATDNALVCSVDDRDTTGISFTINNKEHYDLPILGKHNMKNATIAIAVGHELGLTYNTIYQNLKNVSLTGMRME
QHTLENDITVINDAYNASPTSMRAAIDTLSTLTGRRILILGDVLELGENSKEMHIGVGNYLEEKHIDVLYTFGNEAKYIY
DSGQQHVEKAQHFNSKDDMIEVLINDLKAHDRVLVKGSRGMKLEEVVNALIS
>A3M9Y1 2.4.1.227~~~murG~~~UDP-N-acetylglucosamine--N-acetylmuramyl-(pentapeptide) pyrophosphoryl-undecaprenol N-acetylglucosamine transferase~~~
MTDSQQSKPKHVMMMAAGTGGHVFPALAVAKQLQQQGCQVSWLATPTGMENRLLKDQNIPIYQIDIQGVRGNGVIRKLAA
PFKILKATFSAMRYMKQLKVDAVAGFGGYVAGPGGLAARLLGIPVLIHEQNAVAGFTNAQLSRVAKVVCEAFPNTFPASE
KVVTTGNPVRREITDILSPKWRYDEREQADKPLNILIVGGSLGAKALNERLPPALKQLEVPLNIFHQCGQQQVEATQALY
ADAPANLTIQVLPFIEDMAKAYSEADLIICRAGALTVTEVATAGVAAVFVPLPIAVDDHQTANAKFLADIGAAKICQQST
MTPEVLNQLFTTLMNRQLLTEMAVKARQHAQPNATQHVVDLIQKM
>P17443 2.4.1.227~~~murG~~~UDP-N-acetylglucosamine--N-acetylmuramyl-(pentapeptide) pyrophosphoryl-undecaprenol N-acetylglucosamine transferase~~~COG0707
MSGQGKRLMVMAGGTGGHVFPGLAVAHHLMAQGWQVRWLGTADRMEADLVPKHGIEIDFIRISGLRGKGIKALIAAPLRI
FNAWRQARAIMKAYKPDVVLGMGGYVSGPGGLAAWSLGIPVVLHEQNGIAGLTNKWLAKIATKVMQAFPGAFPNAEVVGN
PVRTDVLALPLPQQRLAGREGPVRVLVVGGSQGARILNQTMPQVAAKLGDSVTIWHQSGKGSQQSVEQAYAEAGQPQHKV
TEFIDDMAAAYAWADVVVCRSGALTVSEIAAAGLPALFVPFQHKDRQQYWNALPLEKAGAAKIIEQPQLSVDAVANTLAG
WSRETLLTMAERARAASIPDATERVANEVSRVARA
>P9WJK9 2.4.1.227~~~murG~~~UDP-N-acetylglucosamine--N-acetylmuramyl-(pentapeptide) pyrophosphoryl-undecaprenol N-acetylglucosamine transferase~~~COG0707
MKDTVSQPAGGRGATAPRPADAASPSCGSSPSADSVSVVLAGGGTAGHVEPAMAVADALVALDPRVRITALGTLRGLETR
LVPQRGYHLELITAVPMPRKPGGDLARLPSRVWRAVREARDVLDDVDADVVVGFGGYVALPAYLAARGLPLPPRRRRRIP
VVIHEANARAGLANRVGAHTADRVLSAVPDSGLRRAEVVGVPVRASIAALDRAVLRAEARAHFGFPDDARVLLVFGGSQG
AVSLNRAVSGAAADLAAAGVCVLHAHGPQNVLELRRRAQGDPPYVAVPYLDRMELAYAAADLVICRAGAMTVAEVSAVGL
PAIYVPLPIGNGEQRLNALPVVNAGGGMVVADAALTPELVARQVAGLLTDPARLAAMTAAAARVGHRDAAGQVARAALAV
ATGAGARTTT
>Q9HW01 2.4.1.227~~~murG~~~UDP-N-acetylglucosamine--N-acetylmuramyl-(pentapeptide) pyrophosphoryl-undecaprenol N-acetylglucosamine transferase~~~
MKGNVLIMAGGTGGHVFPALACAREFQARGYAVHWLGTPRGIENDLVPKAGLPLHLIQVSGLRGKGLKSLVKAPLELLKS
LFQALRVIRQLRPVCVLGLGGYVTGPGGLAARLNGVPLVIHEQNAVAGTANRSLAPIARRVCEAFPDTFPASDKRLTTGN
PVRGELFLDAHARAPLTGRRVNLLVLGGSLGAEPLNKLLPEALAQVPLEIRPAIRHQAGRQHAEITAERYRTVAVEADVA
PFISDMAAAYAWADLVICRAGALTVSELTAAGLPAFLVPLPHAIDDHQTRNAEFLVRSGAGRLLPQKSTGAAELAAQLSE
VLMHPETLRSMADQARSLAKPEATRTVVDACLEVARG
>P65482 2.4.1.227~~~murG~~~UDP-N-acetylglucosamine--N-acetylmuramyl-(pentapeptide) pyrophosphoryl-undecaprenol N-acetylglucosamine transferase~~~
MTKIAFTGGGTVGHVSVNLSLIPTALSQGYEALYIGSKNGIEREMIESQLPEIKYYPISSGKLRRYISLENAKDVFKVLK
GILDARKVLKKEKPDLLFSKGGFVSVPVVIAAKSLNIPTIIHESDLTPGLANKIALKFAKKIYTTFEETLNYLPKEKADF
IGATIREDLKNGNAHNGYQLTGFNENKKVLLVMGGSLGSKKLNSIIRENLDALLQQYQVIHLTGKGLKDAQVKKSGYIQY
EFVKEDLTDLLAITDTVISRAGSNAIYEFLTLRIPMLLVPLGLDQSRGDQIDNANHFADKGYAKTIDEEQLTAQILLQEL
NEMEQERTRIINNMKSYEQSYTKEALFDKMIKDALN
>P94556 5.1.1.3~~~racE~~~Glutamate racemase 1~~~COG0796
MLEQPIGVIDSGVGGLTVAKEIMRQLPKENIIYVGDTKRCPYGPRPEEEVLQYTWELTNYLLENHHIKMLVIACNTATAI
ALDDIQRSVGIPVVGVIQPGARAAIKVTDNQHIGVIGTENTIKSNAYEEALLALNPDLKVENLACPLLVPFVESGKFLDK
TADEIVKTSLYPLKDTSIDSLILGCTHYPILKEAIQRYMGEHVNIISSGDETAREVSTILSYKGLLNQSPIAPDHQFLTT
GARDQFAKIADDWFGHEVGHVECISLQEPIKR
>O05412 5.1.1.3~~~yrpC~~~Glutamate racemase 2~~~COG0796
MKIGFFDSGIGGMTVLYEAIKVLPYEDYIFYADTLNVPYGEKSKGKVKEYIFNAAEFLASQNIKALVIACNTATSIAIED
LRRNFDFPIIGIEPAVKPAINKCTEERKRVLVVATNLTLKEEKFHNLVKEIDHHDLVDCLALPGLVEFAENFDFSEDKII
KYLKNELSSFDLKQYGTIVLGCTHFPFFKNSFEKLFGIKVDMISGSVGTAKQLKKVLADRNQLGKGSGSITFFNSGHKIV
DQEVISKYKRLFEILDETQRSHVGH
>P56868 5.1.1.3~~~murI~~~Glutamate racemase~~~
MKIGIFDSGVGGLTVLKAIRNRYRKVDIVYLGDTARVPYGIRSKDTIIRYSLECAGFLKDKGVDIIVVACNTASAYALER
LKKEINVPVFGVIEPGVKEALKKSRNKKIGVIGTPATVKSGAYQRKLEEGGADVFAKACPLFVPLAEEGLLEGEITRKVV
EHYLKEFKGKIDTLILGCTHYPLLKKEIKKFLGDVEVVDSSEALSLSLHNFIKDDGSSSLELFFTDLSPNLQFLIKLILG
RDYPVKLAEGVFTH
>O31332 5.1.1.3~~~murI~~~Glutamate racemase~~~
MKLNRAIGVIDSGVGGLTVAKELIRQLPKERIIYLGDTARCPYGPRSREEVRQFTWEMTEHLLDLNIKMLVIACNTATAV
VLEEMQKQLPIPVVGVIHPGSRTALKMTNTYHVGIIGTIGTVKSGAYEEALKSINNRVMVESLACPPFVELVESGNFESE
MAYEVVRETLQPLKNTDIDTLILGCTHYPILGPVIKQVMGDKVQLISSGDETAREVSTILYHSKMLNEGEEQSDHLFLTT
GKIGLFKEIASKWFGQPIENVKHIYLEKE
>Q9PM24 5.1.1.3~~~murI~~~Glutamate racemase~~~COG0796
MKIGVFDSGVGGLSVLKSLYEARLFDEIIYYGDTARVPYGVKDKDTIIKFCLEALDFFEQFQIDMLIIACNTASAYALDA
LRAKAHFPVYGVIDAGVEATIKALHDKNKEILVIATKATIKSEEYQKRLLSQGYTNINALATGLFVPMVEEGIFEGDFLQ
SAMEYYFKNITTPDALILACTHFPLLGRSLSKYFGDKTKLIHSGDAIVEFLKERENIDLKNHKAKLHFYASSDVESLKNT
AKIWLNLLRK
>P22634 5.1.1.3~~~murI~~~Glutamate racemase~~~COG0796
MATKLQDGNTPCLAATPSEPRPTVLVFDSGVGGLSVYDEIRHLLPDLHYIYAFDNVAFPYGEKSEAFIVERVVAIVTAVQ
ERYPLALAVVACNTASTVSLPALREKFDFPVVGVVPAIKPAARLTANGIVGLLATRGTVKRSYTHELIARFANECQIEML
GSAEMVELAEAKLHGEDVSLDALKRILRPWLRMKEPPDTVVLGCTHFPLLQEELLQVLPEGTRLVDSGAAIARRTAWLLE
HEAPDAKSADANIAFCMAMTPGAEQLLPVLQRYGFETLEKLAVLG
>Q836J0 5.1.1.3~~~murI~~~Glutamate racemase~~~COG0796
MSNQEAIGLIDSGVGGLTVLKEALKQLPNERLIYLGDTARCPYGPRPAEQVVQFTWEMADFLLKKRIKMLVIACNTATAV
ALEEIKAALPIPVVGVILPGARAAVKVTKNNKIGVIGTLGTIKSASYEIAIKSKAPTIEVTSLDCPKFVPIVESNQYRSS
VAKKIVAETLQALQLKGLDTLILGCTHYPLLRPVIQNVMGSHVTLIDSGAETVGEVSMLLDYFDIAHTPEAPTQPHEFYT
TGSAKMFEEIASSWLGIENLKAQQIHLGGNEND
>Q9ZLT0 5.1.1.3~~~murI~~~Glutamate racemase~~~COG0796
MKIGVFDSGVGGFSVLKSLLKARLFDEIIYYGDSARVPYGTKDPTTIKQFGLEALDFFKPHEIELLIVACNTASALALEE
MQKYSKIPIVGVIEPSILAIKRQVEDKNAPILVLGTKATIQSNAYDNALKQQGYLNISHLATSLFVPLIEESILEGELLE
TCMHYYFTPLEILPEVIILGCTHFPLIAQKIEGYFMGHFALPTPPLLIHSGDAIVEYLQQKYALKNNACTFPKVEFHASG
DVIWLERQAKEWLKL
>Q03469 5.1.1.3~~~murI~~~Glutamate racemase~~~
MDNRPIGVMDSGLGGLSVVRVIQQKLPNEEVIFVGDQGHFPYGTKDQAEVRQLALSIGAFLLKHDVKMMVVACNTATAAA
LPALQAALPIPVIGVIEPGARAALAQDKKGPIGVIATTATTTAGAYPATIERLAPGTPVIAKATQPMVEIVEHGQTGTAK
AQEVVSEQLMTFKEHPVKTLIMGCTHFPFLAPEISKAVGPTVALVDPAKETVATAKSWLEQHQAMGNHAHPNYHLYSTGN
LPDLRAGVNKWLLSGHFDLGTAQIEEGD
>Q8Y7N7 5.1.1.3~~~murI~~~Glutamate racemase~~~COG0796
MKQAIGFIDSGVGGLTVVREVLKQLPHEQVYYLGDTARCPYGPRDKEEVAKFTWEMTNFLVDRGIKMLVIACNTATAAAL
YDIREKLDIPVIGVIQPGSRAALKATRNNKIGVLGTLGTVESMAYPTALKGLNRRVEVDSLACPKFVSVVESGEYKSAIA
KKVVAESLLPLKSTKIDTVILGCTHYPLLKPIIENFMGDGVAVINSGEETASEVSALLDYHNLLDATDEEIEHRFFTTGS
TQIFKDIAKDWLNMPDMTVEHIKLGK
>A0R1X0 5.1.1.3~~~murI~~~Glutamate racemase~~~COG0796
MSDRLAPIGIFDSGVGGLTVARAIIDQLPDEDIVYVGDTGNGPYGPLTIPQIRAHSLAIGDDLVSRGVKALVIACNTASS
ACLRDARERYSPVPVVEVILPAVRRAVAATRNGRIGVIGTQATIASGAYQDAFAAARDTEVFTVACPRFVDFVERGVTSG
RQVLGLAEGYLEPLQLAEVDTLVLGCTHYPMLSGLIQLAMGDNVTLVSSAEETAKDLLRVLTELDLLRPHPDDPSVTAVR
RFEATGDPEAFTALAARFLGPTLDGVRPVRRHAGAGR
>P9WPW9 5.1.1.3~~~murI~~~Glutamate racemase~~~COG0796
MNSPLAPVGVFDSGVGGLTVARAIIDQLPDEDIVYVGDTGNGPYGPLTIPEIRAHALAIGDDLVGRGVKALVIACNSASS
ACLRDARERYQVPVVEVILPAVRRAVAATRNGRIGVIGTRATITSHAYQDAFAAARDTEITAVACPRFVDFVERGVTSGR
QVLGLAQGYLEPLQRAEVDTLVLGCTHYPLLSGLIQLAMGENVTLVSSAEETAKEVVRVLTEIDLLRPHDAPPATRIFEA
TGDPEAFTKLAARFLGPVLGGVQPVHPSRIH
>P63638 5.1.1.3~~~murI~~~Glutamate racemase~~~
MNKPIGVIDSGVGGLTVAKEIMRQLPNETIYYLGDIGRCPYGPRPGEQVKQYTVEIARKLMEFDIKMLVIACNTATAVAL
EYLQKTLSIPVIGVIEPGARTAIMTTRNQNVLVLGTEGTIKSEAYRTHIKRINPHVEVHGVACPGFVPLVEQMRYSDPTI
TSIVIHQTLKRWRNSESDTVILGCTHYPLLYKPIYDYFGGKKTVISSGLETAREVSALLTFSNEHASYTEHPDHRFFATG
DPTHITNIIKEWLNLSVNVERISVND
>Q6GHT5 5.1.1.3~~~murI~~~Glutamate racemase~~~
MNKPIGVIDSGVGGLTVAKEIMRQLPNETIYYLGDIGRCPYGPRPGEQVKQYTVEIARKLMEFDIKMLVIACNTATAVAL
EYLQKTLSIPVIGVIEPGARTAIMTTRNQNVLVLGTEGTIKSEAYRTHIKRINPHVEVHGVACPGFVPLVEQMRYSDPTI
TSIVIHQTLKRWRNSESDTVILGCTHYPLLYKPIYDYFGGKKTVISSGLETAREVSALLTFSNEHASYTEHPDHRFFATG
DTTHITNIIKEWLNLSVNVERISVND
>Q9A1B7 5.1.1.3~~~murI~~~Glutamate racemase~~~
MDTRPIGFLDSGVGGLTVVCELIRQLPHEKIVYIGDSARAPYGPRPKKQIKEYTWELVNFLLTQNVKMIVFACNTATAVA
WEEVKAALDIPVLGVVLPGASAAIKSTTKGQVGVIGTPMTVASDIYRKKIQLLAPSIQVRSLACPKFVPIVESNEMCSSI
AKKIVYDSLAPLVGKIDTLVLGCTHYPLLRPIIQNVMGPSVKLIDSGAECVRDISVLLNYFDINGNYHQKAVEHRFFTTA
NPEIFQEIASIWLKQKINVEHVTL
>P63640 5.1.1.3~~~murI~~~Glutamate racemase~~~COG0796
MDNRPIGFLDSGVGGLTVVRELMRQLPHEEIVYIGDSARAPYGPRPAEQIREYTWQLVNFLLTKDVKMIVIACNTATAVV
WEEIKAQLDIPVLGVILPGASAAIKSSQGGKIGVIGTPMTVQSDIYRQKIHDLDPDLQVESLACPKFAPLVESGALSTSV
TKKVVYETLRPLVGKVDSLILGCTHYPLLRPIIQNVMGPKVQLIDSGAECVRDISVLLNYFEINRGRDAGPLHHRFYTTA
SSQSFAQIGEEWLEKEIHVEHVEL
>Q5SHT7 5.1.1.3~~~murI~~~Glutamate racemase~~~COG0796
MKDPKAPIGVFDSGVGGLTVLKALRRLLPREEFLYFGDTARVPYGGKPLAMVRRFAWEIAGFLLRQGVKAIVVACNTASS
AALPDLAEDLSVPVFGVVEPAARAARGFRKVGLIGTQATVESGAYPRYVDLAWAKACPLFVPLVEEGLWDDPVALLVARH
YLEDAPKDLEALILGCTHYPFLKGAIGAVLPGVALLDSAELTAQEVARALEAEGLLNPEGRGRTFHLVTGDPEAYRALAE
RLGERVEAVRRVSLEEL
>O34674 ~~~murJ~~~Lipid II flippase MurJ~~~COG2244
MSSKLLRGTFVLTLGTYISRILGMVYLIPFSIMVGATGGALFQYGYNQYTLFLNIATMGFPAAVSKFVSKYNSKGDYETS
RKMLKAGMSVMLVTGMIAFFILYLSAPMFAEISLGGKDNNGLTIDHVVYVIRMVSLALLVVPIMSLVRGFFQGHQMMGPT
AVSQVVEQIVRIIFLLSATFLILKVFNGGLVIAVGYATFAALIGAFGGLVVLYIYWNKRKGSLLAMMPNTGPTANLSYKK
MFFELFSYAAPYVFVGLAIPLYNYIDTNTFNKAMIEAGHQAISQDMLAILTLYVQKLVMIPVSLATAFGLTLIPTITESF
TSGNYKLLNQQINQTMQTILFLIIPAVVGISLLSGPTYTFFYGSESLHPELGANILLWYSPVAILFSLFTVNAAILQGIN
KQKFAVVSLVIGVVIKLVLNVPLIKLMQADGAILATALGYIASLLYGFIMIKRHAGYSYKILVKRTVLMLVLSAIMGIAV
KIVQWVLGFFISYQDGQMQAAIVVVIAAAVGGAVYLYCGYRLGFLQKILGRRLPGFFRKGRHAG
>B4E9G3 ~~~murJ~~~Lipid II flippase MurJ~~~COG0728
MNLFRALLTVSGFTLLSRVTGLARETLIARAFGASQYTDAFYVAFRIPNLLRRLSAEGAFSQAFVPILAEFKNQQGHDAT
KALVDAMSTVLAWALAVLSVVGIAGASWVVFAVASGLHSDGQAFPLAVTMTRIMFPYIVFISLTTLASGVLNTYKSFSLP
AFAPVLLNVAFIAAAVFVAPHLKVPVYALAWAVIVGGVLQFLVQLPGLKKVDMVPLIGLNPLRALRHPGVKRVLAKMVPA
TFAVSVAQLSLIINTNIASRLGQGAVSWINYADRLMEFPTALLGVALGTILLPSLSKAHVDADSHEYSALLDWGLRVTFL
LAAPSALALFFFATPLTATLFNYGKFDAHTVTMVARALATYGIGLVGIILIKILAPGFYAKQDIKTPVKIAIGVLIVTQL
SNYVFVPLIGHAGLTLSIGVGACLNSLLLFLGLRKRGIYQPSPGWLRFFVQLVGAALVLAGLMHWCAINFDWTGMRAQPL
DRIALMAACLVLFAALYFGMLWVMGFKYAYFRRRAK
>P0AF16 ~~~murJ~~~Lipid II flippase MurJ~~~COG0728
MNLLKSLAAVSSMTMFSRVLGFARDAIVARIFGAGMATDAFFVAFKLPNLLRRIFAEGAFSQAFVPILAEYKSKQGEDAT
RVFVSYVSGLLTLALAVVTVAGMLAAPWVIMVTAPGFADTADKFALTSQLLKITFPYILLISLASLVGAILNTWNRFSIP
AFAPTLLNISMIGFALFAAPYFNPPVLALAWAVTVGGVLQLVYQLPHLKKIGMLVLPRINFHDAGAMRVVKQMGPAILGV
SVSQISLIINTIFASFLASGSVSWMYYADRLMEFPSGVLGVALGTILLPSLSKSFASGNHDEYNRLMDWGLRLCFLLALP
SAVALGILSGPLTVSLFQYGKFTAFDALMTQRALIAYSVGLIGLIVVKVLAPGFYSRQDIKTPVKIAIVTLILTQLMNLA
FIGPLKHAGLSLSIGLAACLNASLLYWQLRKQKIFTPQPGWMAFLLRLVVAVLVMSGVLLGMLHIMPEWSLGTMPWRLLR
LMAVVLAGIAAYFAALAVLGFKVKEFARRTV
>P37169 ~~~murJ~~~Probable lipid II flippase MurJ~~~
MQEFYARVWNTKEMNLLKSLAAVSSMTMFSRVLGFARDAIVARIFGAGMATDAFFVAFKLPNLLRRIFAEGAFSQAFVPI
LAEYKSKQGEEATRIFVAYVSGLLTLALAVVTVAGMLAAPWVIMVTAPGFADTADKFALTTQLLRITFPYILLISLASLV
GAILNTWNRFSIPAFAPTFLNISMIGFALFAAPYFNPPVLALAWAVTVGGVLQLVYQLPYLKKIGMLVLPRINFRDTGAM
RVVKQMGPAILGVSVSQISLIINTIFASFLASGSVSWMYYADRLMEFPSGVLGVALGTILLPSLSKSFASGNHDEYCRLM
DWGLRLCFLLALPSAVALGILAKPLTVSLFQYGKFTAFDAAMTQRALIAYSVGLIGLIVVKVLAPGFYSRQDIKTPVKIA
IVTLIMTQLMNLAFIGPLKHAGLSLSIGLAACLNASLLYWQLRKQNIFTPQPGWMWFLMRLIISVLVMAAVLFGVLHIMP
EWSQGSMLWRLLRLMAVVIAGIAAYFAALAVLGFKVKEFVRRTA
>B7IE18 ~~~murJ~~~Lipid II flippase MurJ~~~COG0728
MSILFSSILFSIATFFSRILGLFRDVLFAKYFGVSYELDAYFIAIMFPFFLRKVFGEGAMSSAFVPLYSEKSGEEKDKFL
SSVINGFSLIILALVILSYFFPELIINLFGAGSSHETKILAKKLLLITSPSIYFIFLWAISYSILNTNNKFFWPALTPSI
SNITIIIGTFLSTKYGIISPTIGFLIGSILMFFSIIKSIIKHKYYFTIKHFPHFLKLFFPTFMTMVVSQINTVVDMNVVS
FYDKGSISYLQYASRFYLLPYGLFAVSVSTVVLSKISNDRKNFNYHLNDALKTTLFFTIPSMVGLIFLSTPIIRFFYEHG
AFTSKDTLITSKILIAYTLGLPFYGIYSTISRSYHAIKNTKTPFIAATIVSLSNIILDIIFGLKYGPIGVALATSIAGII
GVLYLLFSVKTFPIKDFLKISLNSLIMLFVIYLTDFTDNEFWFLIQILIGILVYLIFSSIFYRDLIRRFLYARKK
>Q97ML3 2.7.1.-~~~murK~~~N-acetylmuramic acid/N-acetylglucosamine kinase~~~COG2971
MKYVIGIDGGGSKTHMKISTLDYKVLLEVFKGPSNINSSTKEEVKRVLQELIMEGLGKLGQSLEECSAICIGTAGADRTE
DKSIIEDMIRSLGYMGKIIVVNDAEIALAGGIEKREGIIVISGTGSICYGRNKEGRSARSGGWGHIIGDEGSGYDIGIKA
IKAALKSFDKRGEKTILEGDILDFLKLKSHEDLINYIYRSGVTKKEIASLTRVVNSAYIKGDLVSKRILKEAARELFLSV
KAVVEVLSMQNKKVVLTTAGGVINNINYLYDEFRKFLNLNYPKVKIISMKNDSAFGAVIIARSECD
>C4RJF8 5.1.1.23~~~murL~~~UDP-N-acetyl-alpha-D-muramoyl-L-alanyl-L-glutamate epimerase~~~COG1365
MPNEQLRRMDAFTFPSYSIDLATGEALFDYALTGPDGEQRFTEVITLPLPAEPPSDATVATLGRVLELLHVVAGVSYYKA
AAPRRLVLPAPLGEAAAALVTAVYTKGLAEYAYRNQLPHVLELTPEIPAGSVPPAREYDNSDLRPLSAVGGGKDSIVSLE
ALRRAGLDPVPFSVNPNHVIVAVNEASGLAPLAARRRIDPVLFDLNAAGARNGHIPVTAINSLIAVATAVLNRLGPVVMS
NERSASDPNLVWNGHEINHQWSKGVEAEGLLRAALAEHAGLTEPYFSLLRSLSELHIARLFAQIDRYDDVVTSCNAAFKL
RDASERWCRDCPKCRFVFLAMAPFMPRERVVHIFGGDLLADETQIPGYRELLGVDGHKPFECVGEVEESVVALSLLAEQD
QWRDAPVVRALVDAVPETAWSAAATSDVFTPGGPHHIPPPYAKALAQ
>A4X982 5.1.1.23~~~murL~~~UDP-N-acetyl-alpha-D-muramoyl-L-alanyl-L-glutamate epimerase~~~COG1365
MPNEQLRRMDAFTFPSYSFDLSTGEALFDYALTGPDGEQRFTEVITLPLPASPPSDERVATLGRVLELLHVIAGVSYYKT
AAPHRLVLPAPLGPSAVALVTAVYTKGLAEYAYRNALPHVLRLRPEVPSGEVTPPVGYDTDGRRPLSAVGGGKDSIVSLE
ALRRDGLDPLPFSVNPNRVIEAVNVASGLPALAARRRIDPVLFDLNAAGALNGHIPVTAINSLIAVATSVLNGLGPVVMS
NERSASDPNLIWDGHQINHQWSKGVEAEGLLRGALEEHAGLTDPYFSLLRQLSELHIARTFARIGGYDDVVTSCNAAFKL
RGASDRWCRDCPKCRFVFLALAPFMPRERITRVFGGDLLADPAQLPGYRELLGVDGHKPFECVGEVEESVVALSLLGEQS
GWRDAPVVSALIDAVPETAWKTAATSAVFTPGGPSFTPTRYADALGSLTEPARNLPG
>P0DQD8 5.1.1.23~~~murL~~~UDP-N-acetyl-alpha-D-muramoyl-L-alanyl-L-glutamate epimerase~~~
MSAFDKHQISTFRFVRCALDAQTGVATLVYAFDQGPELVETVAVPGAPFALGGANATAVQQALQLLHLIAGVSYFKAAVP
PNIAIDSYSIDAETAALVQSVYLHGLGEFAYRNGLQLHGKIRFPVAAQAAAAAPALGLRVHALVAIGGGKDSLVSIEALR
HAGVDQTVSWIGGSQLIRACAERTGLPVLNIGRVLAPELFELNRQGAWNGHIPVTAVNSAILVLAALLNGVDQVVFSNER
SASYGSQIPGTGEVNHQWSKGWAFEQAFGDYVQRHVAADLRYYSLLRPLSELAVARQFAKTDRYDAHFSSCNRNFHIMGE
RPVHRWCGVCPKCHFVFLALAPFMPKTRLVNIFGRNLLDDATQAGGYDALLEFQDHKPFECVGEGRESRTAMAVLASRAE
WKEDAVVKRFIRDIQPQLDPNDLQVEPLMAIEGEHRIPPALWERVRANFAV
>Q45582 4.2.1.126~~~murQ~~~N-acetylmuramic acid 6-phosphate etherase~~~COG2103
MSEPLNLHRLTTESRNSQTVEIHKANTLGILKMINNEDMKVAAAVQEVLPDIKTAVDCAYESFQNGGRLIYTGAGTSGRL
GVMDAVECPPTYSVSPDQVIGIMAGGPEAFLQAAEGIEDSEEAGAEDLRNIQLTSNDTVIAIAASGRTPYAAGALRYARK
VGAHTIALTCNENSAISKDADHSIEVVVGPEAITGSTRMKAATAHKMILNMISTAVMVKIGKVYENLMVDVNVSNKKLKE
RAISIIQSLTNASYDTARYTLEQADHHVKTAIVMLKTSTDQKQAQTLLDEANGFIDKAIEHYHP
>P76535 4.2.1.126~~~murQ~~~N-acetylmuramic acid 6-phosphate etherase~~~COG2103
MQFEKMITEGSNTASAEIDRVSTLEMCRIINDEDKTVPLAVERVLPDIAAAIDVIHAQVSGGGRLIYLGAGTSGRLGILD
ASECPPTYGVKPGLVVGLIAGGEYAIQHAVEGAEDSREGGVNDLKNINLTAQDVVVGIAASGRTPYVIAGLEYARQLGCR
TVGISCNPGSAVSTTAEFAITPIVGAEVVTGSSRMKAGTAQKLVLNMLSTGLMIKSGKVFGNLMVDVVATNEKLHVRQVN
IVKNATGCSAEQAEAALIACERNCKTAIVMVLKNLDAAEAKKRLDQHGGFIRQVLDKE
>P44862 4.2.1.126~~~murQ~~~N-acetylmuramic acid 6-phosphate etherase~~~COG2103
MNDIILKSLSTLITEQRNPNSVDIDRQSTLEIVRLMNEEDKLVPLAIESCLPQISLAVEQIVQAFQQGGRLIYIGAGTSG
RLGVLDASECPPTFGVSTEMVKGIIAGGECAIRHPVEGAEDNTKAVLNDLQSIHFSKNDVLVGIAASGRTPYVIAGLQYA
KSLGALTISIASNPKSEMAEIADIAIETIVGPEILTGSSRLKSGTAQKMVLNMLTTASMILLGKCYENLMVDVQASNEKL
KARAVRIVMQATDCNKTLAEQTLLEADQNAKLAIMMILSTLSKSEAKVLLERHQGKLRNALSK
>P77245 ~~~murR~~~HTH-type transcriptional regulator MurR~~~COG1737
MLYLTKISNAGSEFTENEQKIADFLQANVSELQSVSSRQMAKQLGISQSSIVKFAQKLGAQGFTELRMALIGEYSASREK
TNATALHLHSSITSDDSLEVIARKLNREKELALEQTCALLDYARLQKIIEVISKAPFIQITGLGGSALVGRDLSFKLMKI
GYRVACEADTHVQATVSQALKKGDVQIAISYSGSKKEIVLCAEAARKQGATVIAITSLTDSPLRRLAHFTLDTVSGETEW
RSSSMSTRTAQNSVTDLLFVGLVQLNDVESLKMIQRSSELTQRLK
>A0A0H2WZQ7 6.3.5.13~~~murT~~~Lipid II isoglutaminyl synthase (glutamine-hydrolyzing) subunit MurT~~~
MRQWTAIHLAKLARKASRAVGKRGTDLPGQIARKVDTDILRKLAEQVDDIVFISGTNGKTTTSNLIGHTLKANNIQIIHN
NEGANMAAGITSAFIMQSTPKTKIAVIEIDEGSIPRVLKEVTPSMMVFTNFFRDQMDRFGEIDIMVNNIAETISNKGIKL
LLNADDPFVSRLKIASDTIVYYGMKAHAHEFEQSTMNESRYCPNCGRLLQYDYIHYNQIGHYHCQCGFKREQAKYEISSF
DVAPFLYLNINDEKYDMKIAGDFNAYNALAAYTVLRELGLNEQTIKNGFETYTSDNGRMQYFKKERKEAMINLAKNPAGM
NASLSVGEQLEGEKVYVISLNDNAADGRDTSWIYDADFEKLSKQQIEAIIVTGTRAEELQLRLKLAEVEVPIIVERDIYK
ATAKTMDYKGFTVAIPNYTSLAPMLEQLNRSFEGGQS
>A0A0H3JUU7 6.3.5.13~~~murT~~~Lipid II isoglutaminyl synthase (glutamine-hydrolyzing) subunit MurT~~~
MRQWTAIHLAKLARKASRAVGKRGTDLPGQIARKVDTDVLRKLAEQVDDIVFISGTNGKTTTSNLIGHTLKANNIQIIHN
NEGANMAAGITSAFIMQSTPKTKIAVIEIDEGSIPRVLKEVTPSMMVFTNFFRDQMDRFGEIDIMVNNIAETISNKGIKL
LLNADDPFVSRLKIASDTIVYYGMKAHAHEFEQSTMNESRYCPNCGRLLQYDYIHYNQIGHYHCQCGFKREQAKYEISSF
DVAPFLYLNINDEKYDMKIAGDFNAYNALAAYTVLRELGLNEQTIKNGFETYTSDNGRMQYFKKERKEAMINLAKNPAGM
NASLSVGEQLEGEKVYVISLNDNAADGRDTSWIYDADFEKLSKQQIEAIIVTGTRAEELQLRLKLAEVEVPIIVERDIYK
ATAKTMDYKGFTVAIPNYTSLAPMLEQLNRSFEGGQS
>Q8DNZ9 6.3.5.13~~~murT~~~Lipid II isoglutaminyl synthase (glutamine-hydrolyzing) subunit MurT~~~COG0770
MNLKTTLGLLAGRSSHFVLSRLGRGSTLPGKVALQFDKDILQSLAKNYEIVVVTGTNGKTLTTALTVGILKEVYGQVLTN
PSGANMITGIATTFLTAKSSKTGKNIAVLEIDEASLSRICDYIQPSLFVITNIFRDQMDRFGEIYTTYNMILDAIRKVPT
ATVLLNGDSPLFYKPTIPNPIEYFGFDLEKGPAQLAHYNTEGILCPDCQGILKYEHNTYANLGAYICEGCGCKRPDLDYR
LTKLVELTNNRSRFVIDGQEYGIQIGGLYNIYNALAAVAIARFLGADSQLIKQGFDKSRAVFGRQETFHIGDKECTLVLI
KNPVGATQAIEMIKLAPYPFSLSVLLNANYADGIDTSWIWDADFEQITDMDIPEINAGGVRHSEIARRLRVTGYPAEKIT
ETSNLEQVLKTIENQDCKHAYILATYTAMLEFRELLASRQIVRKEMN
>Q88QT2 2.7.7.99~~~murU~~~N-acetylmuramate alpha-1-phosphate uridylyltransferase~~~COG1208
MKAMILAAGKGERMRPLTLHTPKPLVPVAGQPLIEYHLRALAAAGVTEVVINHAWLGQQIEDHLGDGSRFGLSIRYSPEG
EPLETGGGIFKALPLLGDAPFLLVNGDVWTDYDFARLQAPLQGLAHLVLVDNPGHHGRGDFRLVGEQVVDGDDAPGTLTF
SGISVLHPALFEGCQAGAFKLAPLLRQAMAAGKVSGEHYRGHWVDVGTLERLAEAESLIGERA
>P9WJK7 5.4.99.2~~~mutA~~~Probable methylmalonyl-CoA mutase small subunit~~~COG1884
MSIDVPERADLEQVRGRWRNAVAGVLSKSNRTDSAQLGDHPERLLDTQTADGFAIRALYTAFDELPEPPLPGQWPFVRGG
DPLRDVHSGWKVAEAFPANGATADTNAAVLAALGEGVSALLIRVGESGVAPDRLTALLSGVYLNLAPVILDAGADYRPAC
DVMLALVAQLDPGQRDTLSIDLGADPLTASLRDRPAPPIEEVVAVASRAAGERGLRAITVDGPAFHNLGATAATELAATV
AAAVAYLRVLTESGLVVSDALRQISFRLAADDDQFMTLAKMRALRQLWARVAEVVGDPGGGAAVVHAETSLPMMTQRDPW
VNMLRCTLAAFGAGVGGADTVLVHPFDVAIPGGFPGTAAGFARRIARNTQLLLLEESHVGRVLDPAGGSWFVEELTDRLA
RRAWQRFQAIEARGGFVEAHDFLAGQIAECAARRADDIAHRRLAITGVNEYPNLGEPALPPGDPTSPVRRYAAGFEALRD
RSDHHLARTGARPRVLLLPLGPLAEHNIRTTFATNLLASGGIEAIDPGTVDAGTVGNAVADAGSPSVAVICGTDARYRDE
VADIVQAARAAGVSRVYLAGPEKALGDAAHRPDEFLTAKINVVQALSNLLTRLGA
>P11652 5.4.99.2~~~mutA~~~Methylmalonyl-CoA mutase small subunit~~~
MSSTDQGTNPADTDDLTPTTLSLAGDFPKATEEQWEREVEKVLNRGRPPEKQLTFAECLKRLTVHTVDGIDIVPMYRPKD
APKKLGYPGVAPFTRGTTVRNGDMDAWDVRALHEDPDEKFTRKAILEGLERGVTSLLLRVDPDAIAPEHLDEVLSDVLLE
MTKVEVFSRYDQGAAAEALVSVYERSDKPAKDLALNLGLDPIAFAALQGTEPDLTVLGDWVRRLAKFSPDSRAVTIDANI
YHNAGAGDVAELAWALATGAEYVRALVEQGFTATEAFDTINFRVTATHDQFLTIARLRALREAWARIGEVFGVDEDKRGA
RQNAITSWRDVTREDPYVNILRGSIATFSASVGGAESITTLPFTQALGLPEDDFPLRIARNTGIVLAEEVNIGRVNDPAG
GSYYVESLTRSLADAAWKEFQEVEKLGGMSKAVMTEHVTKVLDACNAERAKRLANRKQPITAVSEFPMIGARSIETKPFP
AAPARKGLAWHRDSEVFEQLMDRSTSVSERPKVFLACLGTRRDFGGREGFSSPVWHIAGIDTPQVEGGTTAEIVEAFKKS
GAQVADLCSSAKVYAQQGLEVAKALKAAGAKALYLSGAFKEFGDDAAEAEKLIDGRLFMGMDVVDTLSSTLDILGVAK
>P9WJK5 5.4.99.2~~~mutB~~~Probable methylmalonyl-CoA mutase large subunit~~~COG2185
MTTKTPVIGSFAGVPLHSERAAQSPTEAAVHTHVAAAAAAHGYTPEQLVWHTPEGIDVTPVYIAADRAAAEAEGYPLHSF
PGEPPFVRGPYPTMYVNQPWTIRQYAGFSTAADSNAFYRRNLAAGQKGLSVAFDLATHRGYDSDHPRVQGDVGMAGVAID
SILDMRQLFDGIDLSTVSVSMTMNGAVLPILALYVVAAEEQGVAPEQLAGTIQNDILKEFMVRNTYIYPPKPSMRIISDI
FAYTSAKMPKFNSISISGYHIQEAGATADLELAYTLADGVDYIRAGLNAGLDIDSFAPRLSFFWGIGMNFFMEVAKLRAG
RLLWSELVAQFAPKSAKSLSLRTHSQTSGWSLTAQDVFNNVARTCIEAMAATQGHTQSLHTNALDEALALPTDFSARIAR
NTQLVLQQESGTTRPIDPWGGSYYVEWLTHRLARRARAHIAEVAEHGGMAQAISDGIPKLRIEEAAARTQARIDSGQQPV
VGVNKYQVPEDHEIEVLKVENSRVRAEQLAKLQRLRAGRDEPAVRAALAELTRAAAEQGRAGADGLGNNLLALAIDAARA
QATVGEISEALEKVYGRHRAEIRTISGVYRDEVGKAPNIAAATELVEKFAEADGRRPRILIAKMGQDGHDRGQKVIATAF
ADIGFDVDVGSLFSTPEEVARQAADNDVHVIGVSSLAAGHLTLVPALRDALAQVGRPDIMIVVGGVIPPGDFDELYAAGA
TAIFPPGTVIADAAIDLLHRLAERLGYTLD
>P11653 5.4.99.2~~~mutB~~~Methylmalonyl-CoA mutase large subunit~~~
MSTLPRFDSVDLGNAPVPADAARRFEELAAKAGTGEAWETAEQIPVGTLFNEDVYKDMDWLDTYAGIPPFVHGPYATMYA
FRPWTIRQYAGFSTAKESNAFYRRNLAAGQKGLSVAFDLPTHRGYDSDNPRVAGDVGMAGVAIDSIYDMRELFAGIPLDQ
MSVSMTMNGAVLPILALYVVTAEEQGVKPEQLAGTIQNDILKEFMVRNTYIYPPQPSMRIISEIFAYTSANMPKWNSISI
SGYHMQEAGATADIEMAYTLADGVDYIRAGESVGLNVDQFAPRLSFFWGIGMNFFMEVAKLRAARMLWAKLVHQFGPKNP
KSMSLRTHSQTSGWSLTAQDVYNNVVRTCIEAMAATQGHTQSLHTNSLDEAIALPTDFSARIARNTQLFLQQESGTTRVI
DPWSGSAYVEELTWDLARKAWGHIQEVEKVGGMAKAIEKGIPKMRIEEAAARTQARIDSGRQPLIGVNKYRLEHEPPLDV
LKVDNSTVLAEQKAKLVKLRAERDPEKVKAALDKITWAAGNPDDKDPDRNLLKLCIDAGRAMATVGEMSDALEKVFGRYT
AQIRTISGVYSKEVKNTPEVEEARELVEEFEQAEGRRPRILLAKMGQDGHDRGQKVIATAYADLGFDVDVGPLFQTPEET
ARQAVEADVHVVGVSSLAGGHLTLVPALRKELDKLGRPDILITVGGVIPEQDFDELRKDGAVEIYTPGTVIPESAISLVK
KLRASLDA
>P06722 ~~~mutH~~~DNA mismatch repair protein MutH~~~COG3066
MSQPRPLLSPPETEEQLLAQAQQLSGYTLGELAALVGLVTPENLKRDKGWIGVLLEIWLGASAGSKPEQDFAALGVELKT
IPVDSLGRPLETTFVCVAPLTGNSGVTWETSHVRHKLKRVLWIPVEGERSIPLAQRRVGSPLLWSPNEEEDRQLREDWEE
LMDMIVLGQVERITARHGEYLQIRPKAANAKALTEAIGARGERILTLPRGFYLKKNFTSALLARHFLIQ
>P44688 ~~~mutH~~~DNA mismatch repair protein MutH~~~COG3066
MIPQTLEQLLSQAQSIAGLTFGELADELHIPVPIDLKRDKGWVGMLLERALGATAGSKAEQDFSHLGVELKTLPINAEGY
PLETTFVSLAPLVQNSGVKWENSHVRHKLSCVLWMPIEGSRHIPLRERHIGAPIFWKPTAEQERQLKQDWEELMDLIVLG
KLDQITARIGEVMQLRPKGANSRAVTKGIGKNGEIIDTLPLGFYLRKEFTAQILNAFLETKSL
>P80925 ~~~~~~Bacteriocin mundticin~~~
KYYGNGVSCNKKGCSVDWGKAIGIIGNNSAANLATGGAAGWSK
>O67518 ~~~mutL~~~DNA mismatch repair protein MutL~~~COG0323
MFVKLLPPEVRKVIAAGEVIESPVDVVKELVENSLDAKATKVEVEIVKGGKRLIRVKDNGTGIHPEDVEKVVLQGATSKI
ETEKDLMNISTYGFRGEALYSISSVSKFKLRSRFFQEKEGKEIEVEAGNILGTRRVGMPVGTEVEVRDLFFNLPVRRKFL
KKEDTERRKVLELIKEYALTNPEVEFTLFSEGRETLKLKKSSLKERVEEVFQTKTEELYAEREGITLRAFVSRNQRQGKY
YVFINKRPIQNKNLKEFLRKVFGYKTLVVLYAELPPFMVDFNVHPKKKEVNILKERKFLELVRELAGKEKPIVDIPLSQP
VKTYKPTYEILGQMDETFILVKDSEYLYFVDQHLLEERINYEKLKDENLACRISVKAGQKLSEEKIRELIKTWRNLENPH
VCPHGRPIYYKIPLREIYEKVGRNY
>P49850 ~~~mutL~~~DNA mismatch repair protein MutL~~~COG0323
MAKVIQLSDELSNKIAAGEVVERPASVVKELVENAIDADSTVIEIDIEEAGLASIRVLDNGEGMENEDCKRAFRRHATSK
IKDENDLFRVRTLGFRGEALPSIASVSHLEITTSTGEGAGTKLVLQGGNIISESRSSSRKGTEIVVSNLFFNTPARLKYM
KTVHTELGNITDVVNRIALAHPEVSIRLRHHGKNLLQTNGNGDVRHVLAAIYGTAVAKKMLPLHVSSLDFEVKGYIALPE
ITRASRNYMSSVVNGRYIKNFPLVKAVHEGYHTLLPIGRHPITFIEITMDPILVDVNVHPSKLEVRLSKETELHDLIRDG
IKDVFKQQQLIPSAQVPKKSAPAIKNEQQFITFDEKPPEKKVPEKSTAPSYSPMKLSSVVKEPVDAEEKLPPLQFDAPPI
VDQEQTLEVSDVSAEQPETFEQECHEEQPQPASDRVPIMYPIGQMHGTYILAQNENGLYIIDQHAAQERIKYEYFREKVG
EVEPEVQEMIVPLTFHYSTNEALIIEQHKQELESVGVFLESFGSNSYIVRCHPAWFPKGEEAELIEEIIQQVLDSKNIDI
KKLREEAAIMMSCKGSIKANRHLRNDEIKALLDDLRSTSDPFTCPHGRPIIIHHSTYEMEKMFKRVM
>P23367 ~~~mutL~~~DNA mismatch repair protein MutL~~~COG0323
MPIQVLPPQLANQIAAGEVVERPASVVKELVENSLDAGATRIDIDIERGGAKLIRIRDNGCGIKKDELALALARHATSKI
ASLDDLEAIISLGFRGEALASISSVSRLTLTSRTAEQQEAWQAYAEGRDMNVTVKPAAHPVGTTLEVLDLFYNTPARRKF
LRTEKTEFNHIDEIIRRIALARFDVTINLSHNGKIVRQYRAVPEGGQKERRLGAICGTAFLEQALAIEWQHGDLTLRGWV
ADPNHTTPALAEIQYCYVNGRMMRDRLINHAIRQACEDKLGADQQPAFVLYLEIDPHQVDVNVHPAKHEVRFHQSRLVHD
FIYQGVLSVLQQQLETPLPLDDEPQPAPRSIPENRVAAGRNHFAEPAAREPVAPRYTPAPASGSRPAAPWPNAQPGYQKQ
QGEVYRQLLQTPAPMQKLKAPEPQEPALAANSQSFGRVLTIVHSDCALLERDGNISLLSLPVAERWLRQAQLTPGEAPVC
AQPLLIPLRLKVSAEEKSALEKAQSALAELGIDFQSDAQHVTIRAVPLPLRQQNLQILIPELIGYLAKQSVFEPGNIAQW
IARNLMSEHAQWSMAQAITLLADVERLCPQLVKTPPGGLLQSVDLHPAIKALKDE
>Q5F8M6 ~~~mutL~~~DNA mismatch repair protein MutL~~~
MPRIAALPDHLVNQIAAGEVVERPANALKEIVENSIDAGATAVDVELEGGGIRLIRVGDNGGGIHPDDIELALHRHATSK
IKTLNDLEHVASMGFRGEGLASIASVSRLTLTSRQEDSSHATQVKAEDGKLSSPTAAAHPVGTTIEAAELFFNTPARRKF
LKSENTEYAHCATMLERLALAHPHIAFSLKRDGKQVFKLPAQSLHERIAAIVGDDFQTASLEIDSGNSALRLYGAIAKPT
FAKGKTDKQYCFVNHRFVRDKVMLHAVKQAYRDVLHNALTPAFVLFLELPPEAVDVNVHPTKTEIRFRDSRQVHQLVFHT
LNKALADTRANLTESVSNAGEVLHDITGVTPAPMPSENDGENLFDSASNHPTGNKPDTRNAFGSSGKTAPMPYQAARAPQ
QHSLSLRESRAAMDTYAELYKKTDDIDLELSQFEQARFGNMPSETPAHKTDTPLSDGIPSQSELPPLGFAIAQLLGIYIL
AQAEDSLLLIDMHAAAERVNYEKMKRQRQENGNLQSQHLLIPVTFAASHEECAALADHAETLAGFGLELSDMGGNTLAVR
AAPVMLGKSDVVSLARDVLGELAQVGSSQTIASHENRILATMSCHGSIRAGRRLTLPEMNALLRDMENTPRSNQCNHGRP
TWVKLTLKELDTLFLRGQ
>P65492 ~~~mutL~~~DNA mismatch repair protein MutL~~~
MGKIKELQTSLANKIAAGEVVERPSSVVKELLENAIDAGATEISIEVEESGVQSIRVVDNGSGIEAEDLGLVFHRHATSK
LDQDEDLFHIRTLGFRGEALASISSVAKVTLKTCTDNANGNEIYVENGEILNHKPAKAKKGTDILVESLFYNTPARLKYI
KSLYTELGKITDIVNRMAMSHPDIRIALISDGKTMLSTNGSGRTNEVMAEIYGMKVARDLVHISGDTSDYHIEGFVAKPE
HSRSNKHYISIFINGRYIKNFMLNKAILEGYHTLLTIGRFPICYINIEMDPILVDVNVHPTKLEVRLSKEEQLYQLIVSK
IQEAFKDRILIPKNNLDYVPKKNKVLYSFEQQKIEFEQRQNTENNQEKTFSSEESNSKSFMAENQNDEIVIKEDSYNPFV
TKTSESLITDDESSGYNNTREKDEDYFKKQQEILQEMDQTFDSNEDASVQNYENKASDDYYDVNDIKGTKSKDPKRRIPY
MEIVGQVHGTYIIAQNEFGMYMIDQHAAQERIKYEYFRDKIGEVTNEVQDLLIPLTFHFSKDEQLVIDQYKNELQQVGIM
LEHFGGHDYIVSSYPVWFPKDEVEEIIKDMIELILEEKKVDIKKLREDVAIMMSCKKSIKANHYLQKHEMSDLIDQLREA
EDPFTCPHGRPIIINFSKYELEKLFKRVM
>P94545 3.1.-.-~~~mutSB~~~Endonuclease MutS2~~~COG1193
MQQKVLSALEFHKVKEQVIGHAASSLGKEMLLELKPSASIDEIKKQLDEVDEASDIIRLRGQAPFGGLVDIRGALRRAEI
GSVLSPSEFTEISGLLYAVKQMKHFITQMAEDGVDIPLIHQHAEQLITLSDLERDINSCIDDHGEVLDHASETLRGIRTQ
LRTLESRVRDRLESMLRSSSASKMLSDTIVTIRNDRFVIPVKQEYRSSYGGIVHDTSSSGATLFIEPQAIVDMNNSLQQA
KVKEKQEIERILRVLTEKTAEYTEELFLDLQVLQTLDFIFAKARYAKAVKATKPIMNDTGFIRLKKARHPLLPPDQVVAN
DIELGRDFSTIVITGPNTGGKTVTLKTLGLLTLMAQSGLHIPADEGSEAAVFEHVFADIGDEQSIEQSLSTFSSHMVNIV
GILEQVNENSLVLFDELGAGTDPQEGAALAMSILDDVHRTNARVLATTHYPELKAYGYNREGVMNASVEFDIETLSPTYK
LLIGVPGRSNAFEISKRLGLPDHIIGQAKSEMTAEHNEVDTMIASLEQSKKRAEEELSETESIRKEAEKLHKELQQQIIE
LNSKKDKMLEEAEQQAAEKVKAAMKEAEDIIHELRTIKEEHKSFKDHELINAKKRLEGAMPAFEKSKKPEKPKTQKRDFK
PGDEVKVLTFGQKGTLLEKTGGNEWNVQIGILKMKVKEKDLEFIKSAPEPKKEKMITAVKGKDYHVSLELDLRGERYENA
LSRVEKYLDDAVLAGYPRVSIIHGKGTGALRKGVQDLLKNHRSVKSSRFGEAGEGGSGVTVVELK
>O25338 3.1.-.-~~~mutS2~~~Endonuclease MutS2~~~COG1193
MSDAPKRSLNPTLMMNNNNTPPKPLEESLDLKEFIALFKTFFAKERDTIALENDLKQTFTYLNEVDAIGLPTPKSVKESD
LIIIKLTKLGTLHLDEIFEIVKRLHYIVVLQNAFKTFTHLKFHERLNAIVLPPFFNDLIALFDDEGKIKQGANATLDALN
ESLNRLKKESVKIIHHYARSKELAPYLVDTQSHLKHGYECLLLKSGFSGAIKGVVLERSANGYFYLLPESAQKIAQKIAQ
IGNEIDCCIVEMCQTLSHSLQKHLLFLKFLFKEFDFLDSLQARLNFAKAYNLEFVMPSFTQKKMILENFSHPILKEPKPL
NLKFEKSMLAVTGVNAGGKTMLLKSLLSAAFLSKHLIPMKINAHHSIIPYFKEIHAIINDPQNSANNISTFAGRMKQFSA
LLSKENMLLGVDEIELGTDADEASSLYKTLLEKLLKQNNQIIITTHHKRLSVLMAENKEVELLAALYDEEKERPTYTFLK
GVIGKSYAFETALRYGVPHFLIEKAKTFYGEDKEKLNVLIENSSALERELKQKNEHLENALKEQEYLKNAWLLEMEKQKE
IFHNKKLELEKSYQQALNILKSEVASKDTSSMHKEIHKASEILSKHKTNQEIPQIITNFQANEKARYKNESVLIVQILDK
GYYWIETELGMRLKAHGSLLKKIQKPPKNKFKPPKTTIPKPKEASLRLDLRGQRSEEALDLLDAFLNDALLGGFEEVLIC
HGKGSGILEKFVKEFLKNHPKVVSFSDAPINLGGSGVKIVKL
>P65496 3.1.-.-~~~mutS2~~~Endonuclease MutS2~~~
MRQKTLDVLEFEKIKSLVANETISDLGLEKVNQMMPATNFETVVFQMEETDEIAQIYNKHRLPSLSGLSKVSAFIHRADI
GGVLNVSELNLIKRLIQVQNQFKTFYNQLVEEDEGVKYPILDDKMNQLPVLTDLFHQINETCDTYDLYDNASYELQGIRS
KISSTNQRIRQNLDRIVKSQANQKKLSDAIVTVRNERNVIPVKAEYRQDFNGIVHDQSASGQTLYIEPSSVVEMNNQISR
LRHDEAIEKERVLTQLTGYVAADKDALLVAEQVMGQLDFLIAKARYSRSIKGTKPIFKEERTVYLPKAYHPLLNRETVVA
NTIEFMEDIETVIITGPNTGGKTVTLKTLGLIIVMAQSGLLIPTLDGSQLSVFKNVYCDIGDEQSIEQSLSTFSSHMTNI
VEILKHADKHSLVLFDELGAGTDPSEGAALAMSILDHVRKIGSLVMATTHYPELKAYSYNREGVMNASVEFDVDTLSPTY
KLLMGVPGRSNAFDISKKLGLSLNIINKAKTMIGTDEKEINEMIESLERNYKRVETQRLELDRLVKEAEQVHDDLSKQYQ
QFQNYEKSLIEEAKEKANQKIKAATKEADDIIKDLRQLREQKGADVKEHELIDKKKRLDDHYEAKSIKQNVQKQKYDKIV
AGDEVKVLSYGQKGEVLEIVNDEEAIVQMGIIKMKLPIEDLEKKQKEKVKPTKMVTRQNRQTIKTELDLRGYRYEDALIE
LDQYLDQAVLSNYEQVYIIHGKGTGALQKGVQQHLKKHKSVSDFRGGMPSEGGFGVTVATLK
>Q9X105 3.1.-.-~~~mutS2~~~Endonuclease MutS2~~~COG1193
MDYLESLDFPKVVEIVKKYALSDLGRKHLDTLKPTVNPWDELELVEELLNYFNRWGEPPIKGLNDISQEVEKVKSGSPLE
PWELLRVSVFLEGCDILKKEFEKREYSRLKETFSRLSSFREFVEEVNRCIEQDGEISDRASPRLREIRTEKKRLSSEIKR
KADDFVRTHSQILQEQMYVYRDGRYLFPVKASMKNAVRGIVHHLSSSGATVFLEPDEFVELNNRVRLLEEEERLEISRIL
RQLTNILLSRLNDLERNVELIARFDSLYARVKFAREFNGTVVKPSSRIRLVNARHPLIPKERVVPINLELPPNKRGFIIT
GPNMGGKTVTVKTVGLFTALMMSGFPLPCDEGTELKVFPKIMADIGEEQSIEQSLSTFSSHMKKIVEIVKNADSDSLVIL
DELGSGTDPVEGAALAIAIIEDLLEKGATIFVTTHLTPVKVFAMNHPLLLNASMEFDPETLSPTYRVLVGVPGGSHAFQI
AEKLGLDKRIIENARSRLSREEMELEGLIRSLHEKISLLEEEKRKLQKEREEYMKLREKYEEDYKKLRRMKIEEFDKELR
ELNDYIRKVKKELDQAIHVAKTGSVDEMREAVKTIEKEKKDLEQKRIEEATEEEIKPGDHVKMEGGTSVGKVVEVKSGTA
LVDFGFLRLKVPVSKLRKTKKEEKKETSTFSYKPSSFRTEIDIRGMTVEEAEPVVKKFIDDLMMNGISKGYIIHGKGTGK
LASGVWEILRKDKRVVSFRFGTPSEGGTGVTVVEVKV
>Q5SHT5 3.1.-.-~~~mutS2~~~Endonuclease MutS2~~~COG1193
MRDVLEVLEFPRVRALLAERAKTPLGRELALALAPLPREEAEKRHELTGEALSYPYALPEAGTLREAYGRALAGARLSGP
ELLKAAKALEEAMALKEELLPLKNALSQVAEGIGDHTPFLERVRKALDEEGAVKDEASPRLAQIRRELRPLRQQILDRLY
ALMDRHREAFQDRFVTLRRERYCVPVRAGMAQKVPGILLDESESGATLFIEPFSVVKLNNRLQALRLKEEEEVNRILRDL
SERLAKDEGVPKTLEALGLLDLVQAQAALARDLGLSRPAFGERYELYRAFHPLIPDAVRNSFALDEKNRILLISGPNMGG
KTALLKTLGLAVLMAQSGLFVAAEKALLAWPDRVYADIGDEQSLQENLSTFAGHLRRLREMLEEATSHSLVLIDELGSGT
DPEEGAALSQAILEALLERGVKGMVTTHLSPLKAFAQGREGIQNASMRFDLEALRPTYELVLGVPGRSYALAIARRLALP
EEVLKRAEALLPEGGRLEALLERLEAERLALEAERERLRRELSQVERLRKALAEREARFEEERAERLKALEEEVRAELLK
VEAELKALKEKARTEGKRDALRELMALRERYAKKAPPPPPPPGLAPGVLVEVPSLGKRGRVVELRGEEVLVQVGPLKMSL
KPQEVKPLPEAEPGKPLLAKPRREVKEVDLRGLTVAEALLEVDQALEEARALGLSTLRLLHGKGTGALRQAIREALRRDK
RVESFADAPPGEGGHGVTVVALRP
>P23909 ~~~mutS~~~DNA mismatch repair protein MutS~~~COG0249
MSAIENFDAHTPMMQQYLRLKAQHPEILLFYRMGDFYELFYDDAKRASQLLDISLTKRGASAGEPIPMAGIPYHAVENYL
AKLVNQGESVAICEQIGDPATSKGPVERKVVRIVTPGTISDEALLQERQDNLLAAIWQDSKGFGYATLDISSGRFRLSEP
ADRETMAAELQRTNPAELLYAEDFAEMSLIEGRRGLRRRPLWEFEIDTARQQLNLQFGTRDLVGFGVENAPRGLCAAGCL
LQYAKDTQRTTLPHIRSITMEREQDSIIMDAATRRNLEITQNLAGGAENTLASVLDCTVTPMGSRMLKRWLHMPVRDTRV
LLERQQTIGALQDFTAGLQPVLRQVGDLERILARLALRTARPRDLARMRHAFQQLPELRAQLETVDSAPVQALREKMGEF
AELRDLLERAIIDTPPVLVRDGGVIASGYNEELDEWRALADGATDYLERLEVRERERTGLDTLKVGFNAVHGYYIQISRG
QSHLAPINYMRRQTLKNAERYIIPELKEYEDKVLTSKGKALALEKQLYEELFDLLLPHLEALQQSASALAELDVLVNLAE
RAYTLNYTCPTFIDKPGIRITEGRHPVVEQVLNEPFIANPLNLSPQRRMLIITGPNMGGKSTYMRQTALIALMAYIGSYV
PAQKVEIGPIDRIFTRVGAADDLASGRSTFMVEMTETANILHNATEYSLVLMDEIGRGTSTYDGLSLAWACAENLANKIK
ALTLFATHYFELTQLPEKMEGVANVHLDALEHGDTIAFMHSVQDGAASKSYGLAVAALAGVPKEVIKRARQKLRELESIS
PNAAATQVDGTQMSLLSVPEETSPAVEALENLDPDSLTPRQALEWIYRLKSLV
>Q5F5J4 ~~~mutS~~~DNA mismatch repair protein MutS~~~
MSKSAVSPMMQQYLGIKAQHTDKLVFYRMGDFYELFLDDAVEAAKLLDITLTTRGQMDGVPIKMAGVPFHAAEQYLARLV
KLGKSVAICEQVGEVGAGKGPVERKVVRIVTPGTLTDSALLEDKETNRIVAVSPDKKYIGLAWASLQSGEFKTKLTTADK
LNDELARLQAAEILLPDSKNAPQLQTASGVTRLNAWQFAADAGEKLLTEYFGCQDLRGFGLDSKEHAVSIGAAGALLNYI
RLTQNLMPQHLDGLSLETDSQYIGMDAATRRNLEITQTLSGKKTPTLFSILDGCATHMGSRLLALWLHHPLRNRAHIRAR
QEAVTALESQYEPLQCHLKSIADIERIAARIAVGNARPRDLASLRDSLFELAQIDLSATGSSLLETLKAVFPETLPVAET
LKAAVMPEPSVWLKDGNVINHGFHPELDELRRIQNHGDEFLLDLEAKERERTGLSTLKVEFNRVHGFYIELSKTQAEQAP
ADYQRRQTLKNAERFITPELKAFEDKVLTAQDQALALEKQLFDGVLKNLRTALPQLQKAAKAAAALDVLSTFSALAKERN
FVRPEFADYPVVHIENGRHPVVEQQVRHFTANHTDLDHKHRLMLLTGPNMGGKSTYMRQVALIVLLAHTGCFVPADAATI
GPVDQIFTRIGASDDLASNRSTFMVEMSETAYILHHATEQSIVLMDEVGRGTSTFDGLALAHAIAEHLLQKNKSFSLFAT
HYFELTYLPEAHAAAVNMHLSALEQGRDIVFLHQIQPGPAGKSYGIAVAKLAGLPVRALKAAQKHLNGLENQAAANRPQL
DIFSTMPSEKGDEPNVDCFVDKAEEKHFEGILAAALENLDPDSLTPREALSELYRLKDLCKSVS
>P0A1Y0 ~~~mutS~~~DNA mismatch repair protein MutS~~~
MNESFDKDFSNHTPMMQQYLKLKAQHPEILLFYRMGDFYELFYDDAKRASQLLDISLTKRGASAGEPIPMAGIPHHAVEN
YLAKLVNQGESVAICEQIGDPATSKGPVERKVVRIVTPGTISDEALLQERQDNLLAAIWQDGKGYGYATLDISSGRFRLS
EPADRETMAAELQRTNPAELLYAEDFAEMALIEGRRGLRRRPLWEFEIDTARQQLNLQFGTRDLVGFGVENASRGLCAAG
CLLQYVKDTQRTSLPHIRSITMERQQDSIIMDAATRRNLEITQNLAGGVENTLAAVLDCTVTPMGSRMLKRWLHMPVRNT
DILRERQQTIGALQDTVSELQPVLRQVGDLERILARLALRTARPRDLARMRHAFQQLPELHAQLETVDSAPVQALRKKMG
DFAELRDLLERAIIDAPPVLVRDGGVIAPGYHEELDEWRALADGATDYLDRLEIRERERTGLDTLKVGYNAVHGYYIQIS
RGQSHLAPINYVRRQTLKNAERYIIPELKEYEDKVLTSKGKALALEKQLYDELFDLLLPHLADLQQSANALAELDVLVNL
AERAWTLNYTCPTFTDKPGIRITEGRHPVVEQVLNEPFIANPLNLSPQRRMLIITGPNMGGKSTYMRQTALIALLAYIGS
YVPAQNVEIGPIDRIFTRVGAADDLASGRSTFMVEMTETANILHNATENSLVLMDEIGRGTSTYDGLSLAWACAENLANK
IKALTLFATHYFELTQLPEKMEGVANVHLDALEHGDTIAFMHSVQDGAASKSYGLAVAALAGVPKEVIKRARQKLRELES
ISPNAAATQVDGTQMSLLAAPEETSPAVEALENLDPDSLTPRQALEWIYRLKSLV
>Q56215 ~~~mutS~~~DNA mismatch repair protein MutS~~~
MEGMLKGEGPGPLPPLLQQYVELRDQYPDYLLLFQVGDFYECFGEDAERLARALGLVLTHKTSKDFTTPMAGIPLRAFEA
YAERLLKMGFRLAVADQVEPAEEAEGLVRREVTQLLTPGTLLQESLLPREANYLAAIATGDGWGLAFLDVSTGEFKGTVL
KSKSALYDELFRHRPAEVLLAPELLENGAFLDEFRKRFPVMLSEAPFEPEGEGPLALRRARGALLAYAQRTQGGALSLQP
FRFYDPGAFMRLPEATLRALEVFEPLRGQDTLFSVLDETRTAPGRRLLQSWLRHPLLDRGPLEARLDRVEGFVREGALRE
GVRRLLYRLADLERLATRLELGRASPKDLGALRRSLQILPELRALLGEEVGLPDLSPLKEELEAALVEDPPLKVSEGGLI
REGYDPDLDALRAAHREGVAYFLELEERERERTGIPTLKVGYNAVFGYYLEVTRPYYERVPKEYRPVQTLKDRQRYTLPE
MKEKEREVYRLEALIRRREEEVFLEVRERAKRQAEALREAARILAELDVYAALAEVAVRYGYVRPRFGDRLQIRAGRHPV
VERRTEFVPNDLEMAHELVLITGPNMAGKSTFLRQTALIALLAQVGSFVPAEEAHLPLFDGIYTRIGASDDLAGGKSTFM
VEMEEVALILKEATENSLVLLDEVGRGTSSLDGVAIATAVAEALHERRAYTLFATHYFELTALGLPRLKNLHVAAREEAG
GLVFYHQVLPGPASKSYGVEVAAMAGLPKEVVARARALLQAMAARREGALDAVLERLLALDPDRLTPLEALRLLQELKAL
ALGAPLDTMKG
>Q56239 ~~~mutS~~~DNA mismatch repair protein MutS~~~COG0249
MGGYGGVKMEGMLKGEGPGPLPPLLQQYVELRDRYPDYLLLFQVGDFYECFGEDAERLARALGLVLTHKTSKDFTTPMAG
IPIRAFDAYAERLLKMGFRLAVADQVEPAEEAEGLVRREVTQLLTPGTLTQEALLPREANYLAAIATGDGWGLAFLDVST
GEFKGTLLKSKSALYDELFRHRPAEVLLAPELRENEAFVAEFRKRFPVMLSEAPFEPQGEGPLALRRAQGALLAYARATQ
GGALSVRPFRLYDPGAFVRLPEASLKALEVFEPLRGQDTLFGVLDETRTAPGRRLLQAWLRHPLLERGPLEARLDRVERF
VREGALREGVRRLLFRLADLERLATRLELSRASPRDLAALRRSLEILPELKGLLGEEVGLPDLSGLLEELRAALVEDPPL
KVSEGGLIREGYDPDLDALRRAHAEGVAYFLDLEAREKERTGIPTLKVGYNAVFGYYLEVTRPYYEKVPQEYRPVQTLKD
RQRYTLPEMKERERELYRLEALIKRREEEVFLALRERARKEAEALREAARILAELDVYAALAEVAVRHGYTRPRFGERLR
IRAGRHPVVERRTAFVPNDLEMAHELVLVTGPNMAGKSTFLRQTALIALLAQIGSFVPAEEAELPLFDGIYTRIGASDDL
AGGKSTFMVEMEEVALVLKEATERSLVLLDEVGRGTSSLDGVAIATALAEALHERRCYTLFATHYFELTALALPRLKNLH
VAAKEEEGGLVFYHQVLPGPASKSYGVEVAEMAGLPKEVVERARALLSAMAARREGALEEVLERLLALDPDRLTPLEALR
FLHELKALALGLPLGSMKG
>A0QUZ2 3.6.1.69~~~mutT1~~~8-oxo-(d)GTP phosphatase~~~COG0406
MMPVDDLQEIPLSKDTTEKSKHTVRAAGAVLWRDASEHGGTTGHPATVEVAVIHRPRYDDWSLPKGKLDQGETEPVAAAR
EIHEETGHTAVLGRRLGRVTYPIPQGTKRVWYWAAKSTGGDFSPNDEVDKLVWLPVDAAMDQLQYPDDRKVLRRFVKRPV
DTKTVLVVRHGTAGRRSRYKGDDRKRPLDKRGRAQAEALVAQLMAFGATTLYAADRVRCHQTIEPLAQELDQLIHNEPLL
TEEAYAADHKAARKRLLEIAGRPGNPVICTQGKVIPGLIEWWCERAKVRPETTGNRKGSTWVLSLSDGELVGADYLSPPD
EK
>P9WIY3 3.6.1.69~~~mutT1~~~8-oxo-(d)GTP phosphatase~~~COG0406
MSIQNSSARRRSAGRIVYAAGAVLWRPGSADSEGPVEIAVIHRPRYDDWSLPKGKVDPGETAPVGAVREILEETGHRANL
GRRLLTVTYPTDSPFRGVKKVHYWAARSTGGEFTPGSEVDELIWLPVPDAMNKLDYAQDRKVLCRFAKHPADTQTVLVVR
HGTAGSKAHFSGDDSKRPLDKRGRAQAEALVPQLLAFGATDVYAADRVRCHQTMEPLAAELNVTIHNEPTLTEESYANNP
KRGRHRVLQIVEQVGTPVICTQGKVIPDLITWWCERDGVHPDKSRNRKGSTWVLSLSAGRLVTADHIGGALAANVRA
>P9WIY1 3.6.1.55~~~mutT2~~~Putative 8-oxo-dGTP diphosphatase 2~~~COG1051
MLNQIVVAGAIVRGCTVLVAQRVRPPELAGRWELPGGKVAAGETERAALARELAEELGLEVADLAVGDRVGDDIALNGTT
TLRAYRVHLLGGEPRARDHRALCWVTAAELHDVDWVPADRGWIADLARTLNGSAADVHRRC
>P9WIX9 3.6.1.55~~~mutT3~~~Putative 8-oxo-dGTP diphosphatase 3~~~COG0494
MPSCPPAYSEQVRGDGDGWVVSDSGVAYWGRYGAAGLLLRAPRPDGTPAVLLQHRALWSHQGGTWGLPGGARDSHETPEQ
TAVRESSEEAGLSAERLEVRATVVTAEVCGVDDTHWTYTTVVADAGELLDTVPNRESAELRWVAENEVADLPLHPGFAAS
WQRLRTAPATVPLARCDERRQRLPRTIQIEAGVFLWCTPGDADQAPSPLGRRISSLL
>P9WIX7 3.6.1.-~~~mutT4~~~Putative mutator protein MutT4~~~COG0494
MSDGEQAKSRRRRGRRRGRRAAATAENHMDAQPAGDATPTPATAKRSRSRSPRRGSTRMRTVHETSAGGLVIDGIDGPRD
AQVAALIGRVDRRGRLLWSLPKGHIELGETAEQTAIREVAEETGIRGSVLAALGRIDYWFVTDGRRVHKTVHHYLMRFLG
GELSDEDLEVAEVAWVPIRELPSRLAYADERRLAEVADELIDKLQSDGPAALPPLPPSSPRRRPQTHSRARHADDSAPGQ
HNGPGPGP
>P08337 3.6.1.55~~~mutT~~~8-oxo-dGTP diphosphatase~~~COG0494
MKKLQIAVGIIRNENNEIFITRRAADAHMANKLEFPGGKIEMGETPEQAVVRELQEEVGITPQHFSLFEKLEYEFPDRHI
TLWFWLVERWEGEPWGKEGQPGEWMSLVGLNADDFPPANEPVIAKLKRL
>P41354 3.6.1.55~~~mutX~~~8-oxo-dGTP diphosphatase~~~COG1051
MPQLATICYIDNGKELLMLHRNKKPNDVHEGKWIGVGGKLERGETPQECAAREILEETGLKAKPVLKGVITFPEFTPDLD
WYTYVFKVTEFEGDLIDCNEGTLEWVPYDEVLSKPTWEGDHTFVEWLLEDKPFFSAKFVYDGDKLLDTQVDFYE
>O31584 3.2.2.31~~~mutY~~~Adenine DNA glycosylase~~~COG1194
MNVLEDKLKQKDIQQFRDDLISWFEREQRVLPWREDQDPYKVWVSEVMLQQTRVETVIPYFLRFVEQFPTVEALADADEE
KVLKAWEGLGYYSRVRNLQSAVKEVKQEYGGIVPPDEKDFGGLKGVGPYTKGAVLSIAYNKPIPAVDGNVMRVMSRILSI
WDDIAKPKTRTIFEDAIRAFISKEKPSEFNQGLMELGALICTPKSPSCLLCPVQQHCSAFEEGTERELPVKSKKKKPGIK
TMAAIVLTDEDGQVYIHKRPSKGLLANLWEFPNLETQKGIKTEREQLIAFLENEYGIQADISDLQGVVEHVFTHLVWNIS
VFFGKVKQVSDTSKLKKVTKEELEQFAFPVSHQKIWKMAGEAAAISAAP
>P17802 3.2.2.31~~~mutY~~~Adenine DNA glycosylase~~~COG1194
MQASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDLANAPLDEVLHLWTGL
GYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEV
ENKLWSLSEQVTPAVGVERFNQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQH
EDEVLLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAADNLTQLTAFRHTFSHFHLDIVPMWLPVSSFTGCMDEGNA
LWYNLAQPPSVGLAAPVERLLQQLRTGAPV
>P83847 3.2.2.31~~~mutY~~~Adenine DNA glycosylase~~~
MTRETERFPAREFQRDLLDWFARERRDLPWRKDRDPYKVWVSEVMLQQTRVETVIPYFEQFIDRFPTLEALADADEDEVL
KAWEGLGYYSRVRNLHAAVKEVKTRYGGKVPDDPDEFSRLKGVGPYTVGAVLSLAYGVPEPAVDGNVMRVLSRLFLVTDD
IAKPSTRKRFEQIVREIMAYENPGAFNEALIELGALVCTPRRPSCLLCPVQAYCQAFAEGVAEELPVKMKKTAVKQVPLA
VAVLADDEGRVLIRKRDSTGLLANLWEFPSCETDGADGKEKLEQMVGEQYGLQVELTEPIVSFEHAFSHLVWQLTVFPGR
LVHGGPVEEPYRLAPEDELKAYAFPVSHQRVWREYKEWASGVRRPD
>A0R567 3.2.2.31~~~mutY~~~Adenine DNA glycosylase~~~COG1194
MSISPVELLSWYDHARRDLPWRRPGVSAWQILVSEFMLQQTPVSRVEPIWSAWIERWPTASATAAAGPAEVLRAWGKLGY
PRRAKRLHECAVVIASEYDDVVPRDVDTLLTLPGIGAYTARAVACFAYQASVPVVDTNVRRVVTRAVHGAADAPASTRDL
DMVAALLPPDTTAPTFSAALMELGATVCTARSPRCGICPLSHCRWRSAGFPAGTVARRVQRYAGTDRQVRGKLLDVLRDS
TTPVTRAQLDVVWLSDPAQRDRALDSLLVDGLVEQTADGRFALAGEGETGRPA
>P9WQ09 3.2.2.31~~~mutY~~~Adenine DNA glycosylase~~~COG1194
MPHILPEPSVTGPRHISDTNLLAWYQRSHRDLPWREPGVSPWQILVSEFMLQQTPAARVLAIWPDWVRRWPTPSATATAS
TADVLRAWGKLGYPRRAKRLHECATVIARDHNDVVPDDIEILVTLPGVGSYTARAVACFAYRQRVPVVDTNVRRVVARAV
HGRADAGAPSVPRDHADVLALLPHRETAPEFSVALMELGATVCTARTPRCGLCPLDWCAWRHAGYPPSDGPPRRGQAYTG
TDRQVRGRLLDVLRAAEFPVTRAELDVAWLTDTAQRDRALESLLADALVTRTVDGRFALPGEGF
>P13702 1.1.1.88~~~mvaA~~~3-hydroxy-3-methylglutaryl-coenzyme A reductase~~~
MSLDSRLPAFRNLSPAARLDHIGQLLGLSHDDVSLLANAGALPMDIANGMIENVIGTFELPYAVASNFQINGRDVLVPLV
VEEPSIVAAASYMAKLARANGGFTTSSSAPLMHAQVQIVGIQDPLNARLSLLRRKDEIIELANRKDQLLNSLGGGCRDIE
VHTFADTPRGPMLVAHLIVDVRDAMGANTVNTMAEAVAPLMEAITGGQVRLRILSNLADLRLARAQVRITPQQLETAEFS
GEAVIEGILDAYAFAAVDPYRAATHNKGIMNGIDPLIVATGNDWRAVEAGAHAYACRSGHYGSLTTWEKDNNGHLVGTLE
MPMPVGLVGGATKTHPLAQLSLRILGVKTAQALAEIAVAVGLAQNLGAMRALATEGIQRGHMALHARNIAVVAGARGDEV
DWVARQLVEYHDVRADRAVALLKQKRGQ
>Q9I4X0 ~~~mvfR~~~Multiple virulence factor regulator MvfR~~~
MPIHNLNHVNMFLQVIASGSISSAARILRKSHTAVSSAVSNLEIDLCVELVRRDGYKVEPTEQALRLIPYMRSLLNYQQL
IGDIAFNLNKGPRNLRVLLDTAIPPSFCDTVSSVLLDDFNMVSLIRTSPADSLATIKQDNAEIDIAITIDEELKISRFNQ
CVLGYTKAFVVAHPQHPLCNASLHSIASLANYRQISLGSRSGQHSNLLRPVSDKVLFVENFDDMLRLVEAGVGWGIAPHY
FVEERLRNGTLAVLSELYEPGGIDTKVYCYYNTALESERSFLRFLESARQRLRELGRQRFDDAPAWQPSIVETAQRRSGP
KALAYRQRAAPE
>P9WJK3 ~~~mviN~~~Probable peptidoglycan biosynthesis protein MviN~~~COG0728
MRPSPGEVPTASQRQPELSDAALVSHSWAMAFATLISRITGFARIVLLAAILGAALASSFSVANQLPNLVAALVLEATFT
AIFVPVLARAEQDDPDGGAAFVRRLVTLATTLLLGATTLSVLAAPLLVRLMLGTNPQVNEPLTTAFAYLLLPQVLVYGLS
SVFMAILNTRNVFGPPAWAPVVNNVVAIATLAVYLAVPGELSVDPVRMGNAKLLVLGIGTTAGVFAQTAVLLVAIRREHI
SLRPLWGIDQRLKRFGAMAAAMVLYVLISQLGLVVGNRIASTAAASGPAIYNYTWLVLMLPFGMIGVTVLTVVMPRLSRN
AAADDTPAVLADLSLATRLTMITLIPTVAFMTVGGPAIGSALFAYGNFGDVDAGYLGAAIALSAFTLIPYALVLLQLRVF
YAREQPWTPITIIVVITGVKILGSLLAPHITGDPQLVAAYLGLANGLGFLAGTIVGYYILRRALRPDGGQLIGVGEARTV
LVTVAASLLAGLLAHVADRLLGLSELTAHAGSVGSLLRLSVLALIMLPILAAVTLCARVPEARAALDAVRARIRSRRLKT
GPQTQNVLDQSSRPGPVTYPERRRLAPPRGKSVVHEPIRRRPPEQVARAGRAKGPEVIDRPSENASFGAASGAELPRPVA
DELQLDAPAGRDPGPVSRPHPSDLQNGDLPADAARGPIAFDALREPDRESSAPPDDVQLVPGARIANGRYRLLIFHGGVP
PLQFWQALDTALDRQVALTFVDPQGVLPDDVLQETLSRTLRLSRIDKPGVARVLDVVHTRAGGLVVAEWIRGGSLQEVAD
TSPSPVGAIRAMQSLAAAADAAHRAGVALSIDHPSRVRVSIDGDVVLAYPATMPDANPQDDIRGIGASLYALLVNRWPLP
EAGVRSGLAPAERDTAGQPIEPADIDRDIPFQISAVAARSVQGDGGIRSASTLLNLMQQATAVADRTEVLGPIDEAPVSA
APRTSAPNSETYTRRRRNLLIGIGAGAAVLMVALLVLASVLSRIFGDVSGGLNKDELGLNAPTASTSAASSAPPGSVVKP
TKVTVFSPDGGADNPGEADLAIDGNPATSWKTDIYTDPVPFPSFKNGVGLMLQLPQATVVGTVAIDVASTGTKVEIRSAS
TPTPATLEDTAVLTSATALRPGHNTISVEAAAPTSNLLVWISTLGTTDGKSQADISEITIYAAS
>Q9RHG4 3.2.1.-~~~mvl~~~Lectin MVL~~~
MASYKVNIPAGPLWSNAEAQQVGPKIAAAHQGNFTGQWTTVVESAMSVVEVELQVENTGIHEFKTDVLAGPLWSNDEAQK
LGPQIAASYGAEFTGQWRTIVEGVMSVIQIKYTF
>P0A1I5 ~~~mxiA~~~Protein MxiA~~~
MIQSFLKQVSTKPELIILVLMVMIIAMLIIPLPTYLVDFLIGLNIVLAILVFMGSFYIERILSFSTFPSVLLITTLFRLA
LSISTSRLILVDADAGKIITTFGQFVIGDSLAVGFVIFSIVTVVQFIVITKGSERVAEVAARFSLDGMPGKQMSIDADLK
AGIIDAAGAKERRSILERESQLYGSFDGAMKFIKGDAIAGIIIIFVNLIGGISVGMSQHGMSLSGALSTYTILTIGDGLV
SQIPALLISISAGFIVTRVNGDSDNMGRNIMSQIFGNPFVLIVTSALALAIGMLPGFPFFVFFLIAVTLTALFYYKKVVE
KEKSLSESDSSGYTGTFDIDNSHDSSLAMIENLDAISSETVPLILLFAENKINANDMEGLIERIRSQFFIDYGVRLPTIL
YRTSNELKVDDIVLLINEVRADSFNIYFDKVCITDENGDIDALGIPVVSTSYNERVISWVDVSYTENLTNIDAKIKSAQD
EFYHQLSQALLNNINEIFGIQETKNMLDQFENRYPDLLKEVFRHVTIQRISEVLQRLLGENISVRNLKLIMESLALWAPR
EKDVITLVEHVRASLSRYICSKIAVSGEIKVVMLSGYIEDAIRKGIRQTSGGSFLNMDIEVSDEVMETLAHALRELRNAK
KNFVLLVSVDIRRFVKRLIDNRFKSILVISYAEIDEAYTINVLKTI
>Q04640 ~~~mxiC~~~Protein MxiC~~~
MLDVKNTGVFSSAFIDRLNAMTNSDDGDETADAELDSGLANSKYIDSSDEMASALSSFINRRDLEKLKGTNSDSQERILD
GEEDEINHKIFDLKRTLKDNLPLDRDFIDRLKRYFKDPSDQVLALRELLNEKDLTAEQVELLTKIINEIISGSEKSVNAG
INSAIQAKLFGNKMKLEPQLLRACYRGFIMGNISTTDQYIEWLGNFGFNHRHTIVNFVEQSLIVDMDSEKPSCNAYEFGF
VLSKLIAIKMIRTSDVIFMKKLESSSLLKDGSLSAEQLLLTLLYIFQYPSESEQILTSVIEVSRASHEDSVVYQTYLSSV
NESPHDIFKSESEREIAINILRELVTSAYKKELSR
>P0A221 ~~~mxiG~~~Protein MxiG~~~
MSEAKNSNLAPFRLLVKLTNGVGDEFPLYYGNNLIVLGRTIETLEFGNDNFPENIIPVTDSKSDGIIYLTISKDNICQFS
DEKGEQIDINSQFNSFEYDGISFHLKNMREDKSRGHILNGMYKNHSVFFFFAVIVVLIIIFSLSLKKDEVKEIAEIIDDK
RYGIVNTGQCNYILAETQNDAVWASVALNKTGFTKCRYILVSNKEINRIQQYINQRFPFINLYVLNLVSDKAELLVFLSK
ERNSSKDTELDKLKNALIVEFPYIKNIKFNYLSDHNARGDAKGIFTKVNVQYKEICENNKVTYSVREELTDEKLELINRL
ISEHKNIYGDQYIEFSVLLIDDDFKGKSYLNSKDSYVMLNDKHWFFLDKNK
>P0A225 ~~~mxiI~~~Protein MxiI~~~
MNYIYPVNQVDIIKASDFQSQEISSLEDVVSAKYSDIKMDTDIQVSQIMEMVSNPESLNPESLAKLQTTLSNYSIGVSLA
GTLARKTVSAVETLLKS
>Q06081 ~~~mxiJ~~~Lipoprotein MxiJ~~~
MIRYKGFILFLLLMLIGCEQREELISNLSQRQANEIISVLERHNITARKVDGGKQGISVQVEKGTFASAVDLMRMYDLPN
PERVDISQMFPTDSLVSSPRAEKARLYSAIEQRLEQSLVSIGGVISAKIHVSYDLEEKNISSKPMHISVIAIYDSPKESE
LLVSNIKRFLKNTFSDVKYENISVILTPKEEYVYTNVQPVKEVKSEFLTNEVIYLFLGMAVLVVILLVWAFKTGWFKRNK
I
>P16946 ~~~ennX~~~Virulence factor-related M protein~~~
MARQQTKKNYSLRKLKTGTASVAVALTVLGAGFANQTEVRAEGVNATTSLTEKAKYDALKDENTGLRGDQTKLVKKLEEE
QEKSKNLEKEKQKLENQALNFQDVIETQEKEKEDLKTTLAKATKENEISEASRKGLSRDLEASRAAKKELEAKHQKLEAE
NKKLTEANQVSEASRKGLSNDLEASRAAKKELEAKYQKLETDHQALEAKHQKLEADLPKFQRPSRKGLSRDLEASREANK
KVTSELTQAKAQLSALEESKKLSEKEKAELQAKLDAQGKALKEQLAKQTEELAKLRAEKAAGSKTPATKPANKERSGRAA
QTATRPSQNKGMRSQLPSTGEAANPFFTAAAATVMVSAGMLALKRKEEN
>Q9R9J1 2.3.1.-~~~mycA~~~Mycosubtilin synthase subunit A~~~
MYTSQFQTLVDVIRNRSNISDRGIRFIESDKIETFVSYRQLFDEAQGFLGYLQHIGIQPKQEIVFQIQENKSFVVAFWAC
LLGGMIPVPVSIGEDNDHKLKVWRIWNILNNPFLLASETVLDKMKKFAADHDLQDFHHQLIEKSDIIQDRIYDHPASQYE
PEADELAFIQFSSGSTGDPKGVMLTHHNLIHNTCAIRNALAIDLKDTLLSWMPLTHDMGLIACHLVPALAGINQNLMPTE
LFIRRPILWMKKAHEHKASILSSPNFGYNYFLKFLKDNKSYDWDLSHIRVIANGAEPILPELCDEFLTRCAAFNMKRSAI
LNVYGLAEASVGATFSNIGERFVPVYLHRDHLNLGERAVEVSKEDQNCASFVEVGKPIDYCQIRICNEANEGLEDGFIGH
IQIKGENVTQGYYNNPESTNRALTPDGWVKTGDLGFIRKGNLVVTGREKDIIFVNGKNVYPHDIERVAIELEDIDLGRVA
ACGVYDQETRSREIVLFAVYKKSADRFAPLVKDIKKHLYQRGGWSIKEILPIRKLPKTTSGKVKRYELAEQYESGKFALE
STKIKEFLEGHSTEPVQTPIHEIETALLSIFSEVMDGKKIHLNDHYFDMGATSLQLSQIAERIEQKFGCELTVADLFTYP
SIADLAAFLVENHSEIKQTDTAKPSRSSSKDIAIIGMSLNVPGASNKSDFWHLLENGEHGIREYPAPRVKDAIDYLRSIK
SERNEKQFVRGGYLDEIDRFDYSFFGLAPKTAKFMDPNQRLFLQSAWHAIEDAGYAGDTISGSQLGVYVGYSKVGYDYER
LLSANYPEELHHYIVGNLPSVLASRIAYFLNLKGPAVTVDTACSSSLVAVHMACKALLTGDCEMALAGGIRTSLLPMRIG
LDMESSDGLTKTFSKDSDGTGSGEGVAAVLLKPLQAAIRDGDHIYGVIKGSAINQDGTTVGITAPSPAAQTEVIEMAWKD
AGIAPETLSFIEAHGTGTKLGDPVEFNGLCKAFEKVTEKKQFCAIGSVKANIGHLFEAAGIVGLIKSALMLNHKKIPPLA
HFNKPNPLIPFHSSPFYVNQEVMDFTPEDRPLRGGISSFGFSGTNAHVVLEEYTPESEYAPEDGNDPHLFVLSAHTEASL
YELTHQYRQYISDDSQSSLRSICYTASTGRAHLDYCLAMIVSSNQELIDKLTSLIQGERNLPQVHFGYKNIKEMQPAEKD
NLSKQISDLMQHRPCTKDERITWLNRIAELYVQRAVIDWRAVYSNEVVQKTPLPLYPFERNRCWVEAVYESAKERKEKGE
VALDINHTKTHIESFLKTVISNASGIRADEIDSNAHFIGFGLDSIMLTQVKKAIADEFNVDIPMERFFDTMNNIESVVDY
LAENVPSAASTPPQESVTAQEELVISGAQPELEHQEHMLDKIIASQNQLIQQTLQAQLDSFNLLRNNSHFVSKESEISQD
KTSLSPKSVTAKKNSAQEAKPYIPFQRQTLNEQVNYTPQQRQYLESFIEKYVDKTKGSKQYTDETRFAHANNRNLSSFRS
YWKEMVYPIIAERSDGSRMWDIDGNEYIDITMGFGVNLFGHHPSFITQTVVDSTHSALPPLGPMSNVAGEVADRIRACTG
VERVAFYNSGTEAVMVALRLARAATGRTKVVVFAGSYHGTFDGVLGVANTKGGAEPANPLAPGIPQSFMNDLIILHYNHP
DSLDVIRNLGNELAAVLVEPVQSRRPDLQPESFLKELRAITQQSGTALIMDEIITGFRIGLGGAQEWFDIQADLVTYGKI
IGGGQPLGIVAGKAEFMNTIDGGTWQYGDDSYPTDEAKRTFVAGTFNTHPLTMRMSLAVLRYLQAEGETLYERLNQKTTY
LVDQLNSYFEQSQVPIRMVQFGSLFRFVSSVDNDLFFYHLNYKGVYVWEGRNCFLSTAHTSDDIAYIIQAVQETVKDLRR
GGFIPEGPDSPNDGGHKEPETYELSPEQKQLAVVSQYGNDASAALNQSIMLKVKGAVQHTLLKQAVRNIVKRHDALRTVI
HVDDEVQQVQARINVEIPIIDFTGYPNEQRESEVQKWLTEDAKRPFHFHEQKPLFRVHVLTSKQDEHLIVLTFHHIIADG
WSIAVFVQELESTYAAIVQGSPLPSHEVVSFRQYLDWQQAQIENGHYEEGIRYWRQYLSEPIPQAILTSMSSSRYPHGYE
GDRYTVTLDRPLSKAIKSLSIRMKNSVFATILGAFHLFLQQLTKQAGLVIGIPTAGQLHMKQPMLVGNCVNMVPVKNTAS
SESTLADYLGHMKENMDQVMRHQDVPMTLVASQLPHDQMPDMRIIFNLDRPFRKLHFGQMEAELIAYPIKCISYDLFLNV
TEFDQEYVLDFDFNTSVISSEIMNKWGTGFVNLLKKMVEGDSASLDSLKMFSKEDQHDLLELYADHQLRISSTLDHKGVR
AVYEEPENETELQIAQIWAELLGLEKVGRSDHFLSLGGNSLKATLMLSKIQQTFNQKVSIGQFFSHQTVKELANFIRGEK
NVKYPPMKPVEQKAFYRTSPAQQRVYFLHQMEPNQVSQNMFGQISIIGKYDEKALIASLQQVMQRHEAFRTSFHIIDGEI
VQQIAGELDFNVRVHSMDREEFEAYADGYVKPFRLEQAPLVRAELIKVDNEQAELLIDMHHIISDGYSMSILTNELFALY
HGNPLPEIPFEYKDFAEWQNQLLIGEVMEQQEEYWLEQFKQEVPILQLPADGSRAMEWSSEGQRVTCSLQSSLIRSLQEM
AQQKGTTLYMVLLAAYNVLLHKYTGQEDIVVGTPVSGRNQPNIESMIGIFIQTMGIRTKPQANKRFTDYLDEVKRQTLDA
FENQDYPFDWLVEKVNVQRETTGKSLFNTMFVYQNIEFQEIHQDGCTFRVKERNPGVSLYDLMLTIEDAEKQLDIHFDFN
PNQFEQETIEQIIRHYTSLLDSLVKEPEKSLSSVPMLSDIERHQLLMGCNDTETPFPHNDTVCQWFETQAEQRPDDEAVI
FGNERCTYGQLNERVNQLARTLRTKGVQADQFVAIICPHRIELIVGILAVLKAGGAYVPIDPEYPEDRIQYMLKDSEAKI
VLAQLDLHKHLTFDADVVLLDEESSYHEDRSNLEPTCGANDLAYMIYTSGSTGNPKGVLIEHRGLANYIEWAKEVYVNDE
KTNFPLYSSISFDLTVTSIFTPLVTGNTIIVFDGEDKSAVLSTIMQDPRIDIIKLTPAHLHVLKEMKIADGTTIRKMIVG
GENLSTRLAQSVSEQFKGQLDIFNEYGPTEAVVGCMIYRYDTKRDRREFVPIGSPAANTSIYVLDASMNLVPVGVPGEMY
IGGAGVARGYWNRPDLTAEKFVHNPFAPGTIMYKTGDLAKRLRDGNLIYLGRIDEQVKIRGHRIELGEVEAAMHKVEAVQ
KAVVLAREEEDGLQQLCAYYVSNKPITIAEIREQLSLELPDYMVPSHYIQLEQLPLTSNGKINRKALPAPEVSLEQIAEY
VPPGNEVESKLAVLWQEMLGIHRVGIKHNFFDLGGNSIRATALAARIHKELDVNLSVKDIFKFPTIEQLANMALRMEKIR
YVSIPSAQKISYYPVSSAQKRMYLLSHTEGGELTYNMTGAMSVEGAIDLERLTAAFQKLIERHEVLRTSFELYEGEPAQR
IHPSIEFTIEQIQAREEEVEDHVLDFIKSFDLAKPPLMRVGLIELTPEKHVLLVDMHHIISDGVSMNILMKDLNQFYKGI
EPDPLPIQYKDYAVWQQTEAQRQNIKKQEAYWLNRFHDEIPVLDMPTDYERPAIRDYEGESFEFLIPIELKQRLSQMEEA
TGTTLYMILMAAYTILLSKYSGQEDIVVGTPVSGRSHMDVESVVGMFVNTLVIRNHPAGRKIFEDYLNEVKENMLNAYQN
QDYPLEELIQHVHLLKDSSRNPLFDTMFVLQNLDQVELNLDSLRFTPYKLHHTVAKFDLTLSIQTDQDKHHGLFEYSKKL
FKKSRIEALSKDYLHILSVISQQPSIQIEHIELSGSTAEDDNLIHSIELNF
>Q83WF5 1.14.-.-~~~mycCI~~~Mycinamicin VIII C21 methyl hydroxylase~~~
MVVWPMDRTCAWALPEQYAEFRQRATLVPAKVWDGSPTWLVSRYEHVRALLVDPRVTVDPTRQPRLSEADGDGDGFRSML
MLDPPEHTRLRRMFISAFSVRQVETMRPEIEKIVDGILDRLLALEPPVDILTHLALPMSTQVICHLLGVPYEDREFFQER
SELASRPNDDRSMPALIELVEYLDGLVRTKTAHPDTGLLGTAVTERLLKGEITHQELVNNAVLLLAAGHETSANQVTLSV
LTLLRHPETAAELREQPELMPNAVDELLRYHSIADGLRRAATADIVLGDHTIRAGDGLIILLSSANHDGNTFGAEATFDI
HRPARHHVAFGYGPHQCLGQNLARLEMEVTLGKLFRRVPALRLAQEPDALRVRQGSPIFGIDELLVEW
>Q83WF2 2.1.1.238~~~mycE~~~Mycinamicin VI 2''-O-methyltransferase~~~
MTAQTEFDEATVQDVVRLAGGHDSELRELTQKYDPAMISRLLVAEILSRCPPPSNDTPVLVELAIVHGSERFRHFLRVVR
DSPIRPVGADEGFVGMLVEYELTELLRELFGVTHERPAGVRGTKLFPYLTDDEEAVEQIGTYLLAAQQGTEAVLAGCGSR
KPDLSELSSRYFTPKFGFLHWFTPHYDRHFRDYRNQQVRVLEIGVGGYKHPEWGGGSLRMWKSFFPRGQIYGLDIMDKSH
VDELRIRTIQGDQNDAEFLDRIARRYGPFDIVIDDGSHINAHVRTSFAALFPHVRPGGLYVIEDMWTAYWPGFGGQADPQ
ECSGTSLGLLKSLIDAIQHQELPSDPNRSPGYVDRNIVGLHVYHNVAFVEKGRNDEGGIPTWIPRDFESLVQASSGGAT
>Q49492 2.1.1.237~~~mycF~~~Mycinamicin III 3''-O-methyltransferase~~~
MSPSTGVELYLDLLKRTVSNFIYQDATHVAGLITEAAFVEEARESGEDYPTVAHTMIGMKRLNNLQHCVESALRDGVPGD
VLETGVWRGGACIFARGILKAYDVRDRTVWVADSFQGFPKITDDDHPMDAEMNLHQYNEAVDLPTSLATVQRNFSRYGLL
DDQVRFLPGWFKDTMPTAPFERLAVLRMDGDSYGATMDVLTHAYPRLSPGGFAIIDDYCIPACREAVHEYRDRHGISDEI
VEIDRQGVYWRRSA
>Q59523 1.14.-.-~~~mycG~~~Mycinamicin IV hydroxylase/epoxidase~~~
MTSAEPRAYPFNDVHGLTLAGRYGELQETEPVSRVRPPYGEEAWLVTRYEDVRAVLGDGRFVRGPSMTRDEPRTRPEMVK
GGLLSMDPPEHSRLRRLVVKAFTARRAESLRPRAREIAHELVDQMAATGQPADLVAMFARQLPVRVICELLGVPSADHDR
FTRWSGAFLSTAEVTAEEMQEAAEQAYAYMGDLIDRRRKEPTDDLVSALVQARDQQDSLSEQELLDLAIGLLVAGYESTT
TQIADFVYLLMTRPELRRQLLDRPELIPSAVEELTRWVPLGVGTAFPRYAVEDVTLRGVTIRAGEPVLASTGAANRDQAQ
FPDADRIDVDRTPNQHLGFGHGVHHCLGAPLARVELQVALEVLLQRLPGIRLGIPETQLRWSEGMLLRGPLELPVVW
>P20910 3.4.24.31~~~npr~~~Mycolysin~~~
MPMFRIRLPKPAALIAAGGIGACIATVAVPSAYAAAPAPADSRLGVTASLDRLPSIGERSTLTVNITAETDVKRAGLSLQ
LPPALRIVDRNPSLAAPTTDPFGQRTSRTLTLTPGTRTIELEVKAVATGPAQIQADISDIDRPDPRRAGHASVELTIGKA
KGSTAKGMSVTKADATPVSGPTTPGRPVRPGAAAPAAPSTPPQAAAPGKRYAPVCVSGTLRNRFQSSEGGSWNSKASRDL
PVSNANVTLWGRATAGGGTQKLAAGLTGNGDGTFKLCYTPSTSVTSQVWAEFQTQAGTMWSVVDGYNRRYSTTSNALSNV
SGNKSLGTVYANSGQSRAWHAFDTLNKLWWDRGSTSTCWTSNQRDGRCTPITVQWYPGSTDGTYWTNRDDKVHLADSDPD
SGHTTVHEAGHSLMGKLYNGWWPYVTNCSPHYINRTSSTTCGWTEGFADAVAFHTFKDTVMTWGNGSSMNLANDRGTRGM
DWGDACEARVGTALVDLWSQVDGGWTKSNTMMSRERQSTFREYFLTDRPAYGLDSGPKARNILYGHTIQY
>A0QNL1 3.4.21.-~~~mycP1~~~Mycosin-1~~~COG1404
MQRVAVMVLAVLLALFSAPPAWAIDPPVIDAGAVPPDETGPDQPTEQRKICATPTVMPNSNFADRPWANDYLRIQEAQKF
ATGAGVTVAVIDTGVNGSPRVPAEPGGDFVDAAGNGMSDCDAHGTMTAAIIGGRPSPTDGFVGMAPDVRLLSLRQTSVAF
QPKGARQDPNDPNTTQTAGSIRSLARSVVHAANLGAQVINISEAACYKVTRRIDETSLGAAINYAVNVKGAVIVVAAGNT
GQDCSQNPPPDPSVPSDPRGWREVQTIVSPAWYDPLVLTVGSIGQNGQPSNFSMSGPWVGAAAPGENLTSLGYDGQPVNA
TPGEDGPVPLNGTSFSAAYVSGLAALVKQRFPDLTPAQIINRITATARHPGGGVDNYVGAGVIDPVAALTWEIPDGPEKA
PFRVKEVPPPVYIPPPDRGPITAVVIAGATLAFALGIGALARRALRRKQ
>O05461 3.4.21.-~~~mycP1~~~Mycosin-1~~~COG1404
MHRIFLITVALALLTASPASAITPPPIDPGALPPDVTGPDQPTEQRVLCASPTTLPGSGFHDPPWSNTYLGVADAHKFAT
GAGVTVAVIDTGVDASPRVPAEPGGDFVDQAGNGLSDCDAHGTLTASIIAGRPAPTDGFVGVAPDARLLSLRQTSEAFEP
VGSQANPNDPNATPAAGSIRSLARAVVHAANLGVGVINISEAACYKVSRPIDETSLGASIDYAVNVKGVVVVVAAGNTGG
DCVQNPAPDPSTPGDPRGWNNVQTVVTPAWYAPLVLSVGGIGQTGMPSSFSMHGPWVDVAAPAENIVALGDTGEPVNALQ
GREGPVPIAGTSFAAAYVSGLAALLRQRFPDLTPAQIIHRITATARHPGGGVDDLVGAGVIDAVAALTWDIPPGPASAPY
NVRRLPPPVVEPGPDRRPITAVALVAVGLTLALGLGALARRALSRR
>O05458 3.4.21.-~~~mycP2~~~Mycosin-2~~~COG1404
MASPLNRPGLRAAAASAALTLVALSANVPAAQAIPPPSVDPAMVPADARPGPDQPMRRSNSCSTPITVRNPDVAQLAPGF
NLVNISKAWQYSTGNGVPVAVIDTGVSPNPRLPVVPGGDYIMGEDGLSDCDAHGTVVSSIIAAAPLGILPMPRAMPATAA
FPPPAGPPPVTAAPAPPVEVPPPMPPPPPVTITQTVAPPPPPPEDAGAMAPSNGPPDPQTEDEPAVPPPPPGAPDGVVGV
APHATIISIRQSSRAFEPVNPSSAGPNSDEKVKAGTLDSVARAVVHAANMGAKVINISVTACLPAAAPGDQRVLGAALWY
AATVKDAVIVAAAGNDGEAGCGNNPMYDPLDPSDPRDWHQVTVVSSPSWFSDYVLSVGAVDAYGAALDKSMSGPWVGVAA
PGTHIMGLSPQGGGPVNAYPPSRPGEKNMPFWGTSFSAAYVSGVAALVRAKFPELTAYQVINRIVQSAHNPPAGVDNKLG
YGLVDPVAALTFNIPSGDRMAPGAQSRVITPAAPPPPPDHRARNIAIGFVGAVATGVLAMAIGARLRRAR
>O53695 3.4.21.-~~~mycP3~~~Mycosin-3~~~COG1404
MIRAAFACLAATVVVAGWWTPPAWAIGPPVVDAAAQPPSGDPGPVAPMEQRGACSVSGVIPGTDPGVPTPSQTMLNLPAA
WQFSRGEGQLVAIIDTGVQPGPRLPNVDAGGDFVESTDGLTDCDGHGTLVAGIVAGQPGNDGFSGVAPAARLLSIRAMST
KFSPRTSGGDPQLAQATLDVAVLAGAIVHAADLGAKVINVSTITCLPADRMVDQAALGAAIRYAAVDKDAVIVAAAGNTG
ASGSVSASCDSNPLTDLSRPDDPRNWAGVTSVSIPSWWQPYVLSVASLTSAGQPSKFSMPGPWVGIAAPGENIASVSNSG
DGALANGLPDAHQKLVALSGTSYAAGYVSGVAALVRSRYPGLNATEVVRRLTATAHRGARESSNIVGAGNLDAVAALTWQ
LPAEPGGGAAPAKPVADPPVPAPKDTTPRNVAFAGAAALSVLVGLTAATVAIARRRREPTE
>I6YC58 3.4.21.-~~~mycP4~~~Mycosin-4~~~COG1404
MTTSRTLRLLVVSALATLSGLGTPVAHAVSPPPIDERWLPESALPAPPRPTVQREVCTEVTAESGRAFGRAERSAQLADL
DQVWRLTRGAGQRVAVIDTGVARHRRLPKVVAGGDYVFTGDGTADCDAHGTLVAGIIAAAPDAQSDNFSGVAPDVTLISI
RQSSSKFAPVGDPSSTGVGDVDTMAKAVRTAADLGASVINISSIACVPAAAAPDDRALGAALAYAVDVKNAVIVAAAGNT
GGAAQCPPQAPGVTRDSVTVAVSPAWYDDYVLTVGSVNAQGEPSAFTLAGPWVDVAATGEAVTSLSPFGDGTVNRLGGQH
GSIPISGTSYAAPVVSGLAALIRARFPTLTARQVMQRIESTAHHPPAGWDPLVGNGTVDALAAVSSDSIPQAGTATSDPA
PVAVPVPRRSTPGPSDRRALHTAFAGAAICLLALMATLATASRRLRPGRNGIAGD
>O53945 3.4.21.-~~~mycP5~~~Mycosin-5~~~COG1404
MQRFGTGSSRSWCGRAGTATIAAVLLASGALTGLPPAYAISPPTIDPGALPPDGPPGPLAPMKQNAYCTEVGVLPGTDFQ
LQPKYMEMLNLNEAWQFGRGDGVKVAVIDTGVTPHPRLPRLIPGGDYVMAGGDGLSDCDAHGTLVASMIAAVPANGAVPL
PSVPRRPVTIPTTETPPPPQTVTLSPVPPQTVTVIPAPPPEEGVPPGAPVPGPEPPPAPGPQPPAVDRGGGTVTVPSYSG
GRKIAPIDNPRNPHPSAPSPALGPPPDAFSGIAPGVEIISIRQSSQAFGLKDPYTGDEDPQTAQKIDNVETMARAIVHAA
NMGASVINISDVMCMSARNVIDQRALGAAVHYAAVDKDAVIVAAAGDGSKKDCKQNPIFDPLQPDDPRAWNAVTTVVTPS
WFHDYVLTVGAVDANGQPLSKMSIAGPWVSISAPGTDVVGLSPRDDGLINAIDGPDNSLLVPAGTSFSAAIVSGVAALVR
AKFPELSAYQIINRLIHTARPPARGVDNQVGYGVVDPVAALTWDVPKGPAEPPKQLSAPLVVPQPPAPRDMVPIWVAAGG
LAGALLIGGAVFGTATLMRRSRKQQ
>P33406 ~~~myfA~~~Fimbrial protein MyfA~~~
MNMKKFVKKPLAIAVLMLASGGMVNMVHAEPTVINSKDISATKTVKEGGSFSVEFKATENEIVSGKLDADTPAFHLVMSD
SGEHKGWNVRPTGASEGGQMVSADGTRVDLHTNELSWDNDHWWIDDGSERVEATFFLAAGDEVKAGEYQFTGRVEEYVE
>P9WNF7 1.14.13.-~~~mymA~~~Putative FAD-containing monooxygenase MymA~~~COG2072
MNQHFDVLIIGAGLSGIGTACHVTAEFPDKTIALLERRERLGGTWDLFRYPGVRSDSDMFTFGYKFRPWRDVKVLADGAS
IRQYIADTATEFGVDEKIHYGLKVNTAEWSSRQCRWTVAGVHEATGETRTYTCDYLISCTGYYNYDAGYLPDFPGVHRFG
GRCVHPQHWPEDLDYSGKKVVVIGSGATAVTLVPAMAGSNPGSAAHVTMLQRSPSYIFSLPAVDKISEVLGRFLPDRWVY
EFGRRRNIAIQRKLYQACRRWPKLMRRLLLWEVRRRLGRSVDMSNFTPNYLPWDERLCAVPNGDLFKTLASGAASVVTDQ
IETFTEKGILCKSGREIEADIIVTATGLNIQMLGGMRLIVDGAEYQLPEKMTYKGVLLENAPNLAWIIGYTNASWTLKSD
IAGAYLCRLLRHMADNGYTVATPRDAQDCALDVGMFDQLNSGYVKRGQDIMPRQGSKHPWRVLMHYEKDAKILLEDPIDD
GVLHFAAAAQDHAAA
>P9WK09 ~~~mymT~~~Metallothionein~~~
MRVIRMTNYEAGTLLTCSHEGCGCRVRIEVPCHCAGAGDAYRCTCGDELAPVK
>P42615 ~~~mzrA~~~Modulator protein MzrA~~~
MQIPRMSLRQLAWSGAVLLLVGTLLLAWSAVRQQESTLAIRAVHQGTTMPDGFSIWHHLDAHGIPFKSITPKNDTLLITF
DSSDQSAAAKAVLDRTLPHGYIIAQQDNNSQAMQWLTRLRDNSHRFG
>A6TED5 ~~~mzrA~~~Modulator protein MzrA~~~
MMVMKRPSLRQFSWLLGGSLLLGALFWLWLAVQQQEATLAIRPVGQGIGMPDGFSVWHHLDANGIRFKSITPQKDGLLIK
FDSTAQGAAAKEVLGRALPHGYIIALLEDDNSPTAWLSRLRDAPHRLG
>D3WZ85 3.5.99.8~~~naaA~~~5-nitroanthranilic acid aminohydrolase~~~
MAGSNDVAKVMKTLDGMREGLIQTAVELGSIEAPTGREGAAGDYVYEWMARNGFGPERVGVFDDRFNVVGRLRGTGGGAS
LSFNSHLDTIMAREDTARFADANDRIYHEAWHEEGRIYGYSVVNCKGPMACWLIAAKALKEAGAALKGDVVLTAVCGEID
CEPVDEFQGHDYLAEDIGARYAISHGAISDYALVAEATNFKPAWVEAGKVFLKVTVFAGPSRYTPYVPRPVAALDSPNAI
VRMAKLVEALEEWADNYEKRYTREYGGGTVVPKVAIGAIRGGVPYKIYRFPELCSIYMDIRLNPDTNPLVVQREVEAVVS
KLGLKAEVKPFLFRRGYEAQGIEPLQNALEVAHREVVGRPTERPGSPECSMWRDTNPYNELGIPSLTYGCGGGAGGGNTY
FLVDDMLKAAKVYAMTAMDLCNRTP
>D3WZ86 1.13.11.64~~~naaB~~~5-nitrosalicylic acid 1,2-dioxygenase~~~
MKWSNKDGYPWSKIIHAEKFFDKVIQNDTRPGKWEWADVVSGLRDLDKDPRMNSERRYVAIVNEDVGLGETKGIGITPGL
FCGCQLIHPGEEVTSHRHNSVALYFIVEGTGELEVEGEVYSYKPFDIMTCPAWSYHAWRATGDKDTLMYVIHDMALLAYM
RALFWEEPKGSENIRHMVKGSTHTWSNTKAPEVSKTQAAKELLKQGE
>Q9JXK7 ~~~nadA~~~Neisseria adhesin A~~~
MKHFPSKVLTTAILATFCSGALAATSDDDVKKAATVAIVAAYNNGQEINGFKAGETIYDIGEDGTITQKDATAADVEADD
FKGLGLKKVVTNLTKTVNENKQNVDAKVKAAESEIEKLTTKLADTDAALADTDAALDETTNALNKLGENITTFAEETKTN
IVKIDEKLEAVADTVDKHAEAFNDIADSLDETNTKADEAVKTANEAKQTAEETKQNVDAKVKAAETAAGKAEAAAGTANT
AADKAEAVAAKVTDIKADIATNKADIAKNSARIDSLDKNVANLRKETRQGLAEQAALSGLFQPYNVGRFNVTAAVGGYKS
ESAVAIGTGFRFTENFAAKAGVAVGTSSGSSAAYHVGVNYEW
>P0DV44 ~~~nadA~~~Neisseria adhesin A~~~
MKHFPSKVLTTAILATFCSGALAATNDDDVKKAATVAIAAAYNNGQEINGFKAGETIYDIDEDGTITKKDATAADVEADD
FKGLGLKKVVTNLTKTVNENKQNVDAKVKAAESEIEKLTTKLADTDAALADTDAALDATTNALNKLGENITTFAEETKTN
IVKIDEKLEAVADTVDKHAEAFNDIADSLDETNTKADEAVKTANEAKQTAEETKQNVDAKVKAAETAAGKAEAAAGTANT
AADKAEAVAAKVTDIKADIATNKDNIAKKANSADVYTREESDSKFVRIDGLNATTEKLDTRLASAEKSIADHDTRLNGLD
KTVSDLRKETRQGLAEQAALSGLFQPYNVGRFNVTAAVGGYKSESAVAIGTGFRFTENFAAKAGVAVGTSSGSSAAYHVG
VNYEW
>A0ELI2 ~~~nadA~~~Neisseria adhesin A~~~
MKHFQSKVLTAAILAALSGSAMADNPPPSTDEIAKAALVNSYNNTQDINGFKVGDTIYDINGNGKITRKTATEDDVKADD
FGGLGLKEVLAQHDQSLADLTGTVDENSEALVKTAEVVNDISADVKANTAAIGENKAAIAKKADQTALDAVSEKVTANET
AIGKKANSADVYTKAEVYTKQESDNRFVKIGDRIGNLNTTANGLETRLADAEKSVADHGTRLASAEKSITEHGTRLNGLD
RTVSDLRKETRQGLAEQAALSGLFQPYNVGRFNVTAAVGGYKSESAVAIGTGFRFTENFAAKAGVAVGTSSGSSAAYHVG
VNYEW
>Q9KWZ1 2.5.1.72~~~nadA~~~Quinolinate synthase~~~COG0379
MSILDVIKQSNDMMPESYKELSRKDMETRVAAIKKKFGSRLFIPGHHYQKDEVIQFADQTGDSLQLAQVAEKNKEADYIV
FCGVHFMAETADMLTSEQQTVVLPDMRAGCSMADMADMQQTNRAWKKLQHIFGDTIIPLTYVNSTAEIKAFVGKHGGATV
TSSNAKKVLEWAFTQKKRILFLPDQHLGRNTAYDLGIALEDMAVWDPMKDELVAESGHTNVKVILWKGHCSVHEKFTTKN
IHDMRERDPDIQIIVHPECSHEVVTLSDDNGSTKYIIDTINQAPAGSKWAIGTEMNLVQRIIHEHPDKQIESLNPDMCPC
LTMNRIDLPHLLWSLEQIEKGEPSGVIKVPKAIQEDALLALNRMLSIT
>P11458 2.5.1.72~~~nadA~~~Quinolinate synthase~~~COG0379
MSVMFDPDTAIYPFPPKPTPLSIDEKAYYREKIKRLLKERNAVMVAHYYTDPEIQQLAEETGGCISDSLEMARFGAKHPA
STLLVAGVRFMGETAKILSPEKTILMPTLQAECSLDLGCPVEEFNAFCDAHPDRTVVVYANTSAAVKARADWVVTSSIAV
ELIDHLDSLGEKIIWAPDKHLGRYVQKQTGGDILCWQGACIVHDEFKTQALTRLQEEYPDAAILVHPESPQAIVDMADAV
GSTSQLIAAAKTLPHQRLIVATDRGIFYKMQQAVPDKELLEAPTAGEGATCRSCAHCPWMAMNGLQAIAEALEQEGSNHE
VHVDERLRERALVPLNRMLDFAATLRG
>P9WJK1 2.5.1.72~~~nadA~~~Quinolinate synthase~~~COG0379
MTVLNRTDTLVDELTADITNTPLGYGGVDGDERWAAEIRRLAHLRGATVLAHNYQLPAIQDVADHVGDSLALSRVAAEAP
EDTIVFCGVHFMAETAKILSPHKTVLIPDQRAGCSLADSITPDELRAWKDEHPGAVVVSYVNTTAAVKALTDICCTSSNA
VDVVASIDPDREVLFCPDQFLGAHVRRVTGRKNLHVWAGECHVHAGINGDELADQARAHPDAELFVHPECGCATSALYLA
GEGAFPAERVKILSTGGMLEAAHTTRARQVLVATEVGMLHQLRRAAPEVDFRAVNDRASCKYMKMITPAALLRCLVEGAD
EVHVDPGIAASGRRSVQRMIEIGHPGGGE
>P24519 2.5.1.72~~~nadA~~~Quinolinate synthase~~~
MSVMFDPQAAIYPFPPKPTPLNDDEKQFYREKIKRLLKERNAVMVAHYYTDPEIQQLAEETGGCISDSLEMARFGTKHAA
STLLVAGVRFMGETAKILSPEKTILMPTLAAECSLDLGCPIDEFSAFCDAHPDRTVVVYANTSAAVKARADWVVTSSIAV
ELIEHLDSLGEKIIWAPDRHLGNYVQKQTGADVLCWQGACIVHDEFKTQALTRLKKIYPDAALLVHPESPQSIVEMADAV
GSTSQLIKAAKTLPHRQLIVATDRGIFYKMQQAVPEKELLEAPTAGEGATCRSCAHCPWMAMNGLKAIAEGLEQGGAAHE
IQVDAALREGALLPLNRMLDFAATLRA
>Q9X1X7 2.5.1.72~~~nadA~~~Quinolinate synthase~~~COG0379
MVDEILKLKKEKGYIILAHNYQIPELQDIADFVGDSLQLARKAMELSEKKILFLGVDFMAELVKILNPDKKVIVPDRSAT
CPMANRLTPEIIREYREKFPDAPVVLYVNSTSECKTLADVICTSANAVEVVKKLDSSVVIFGPDRNLGEYVAEKTGKKVI
TIPENGHCPVHQFNAESIDAVRKKYPDAKVIVHPECPKPVRDKADYVGSTGQMEKIPEKDPSRIFVIGTEIGMIHKLKKK
FPDREFVPLEMAVCVNMKKNTLENTLHALQTESFEVILPKEVIEKAKKPILRMFELMG
>P38032 1.4.3.16~~~nadB~~~L-aspartate oxidase~~~COG0029
MSKKTIAVIGSGAAALSLAAAFPPSYEVTVITKKSVKNSNSVYAQGGIAAAYAKDDSIEAHLEDTLYAGCGHNNLAIVAD
VLHDGKMMVQSLLERGFPFDRNERGGVCLGREGAHSYNRIFHAGGDATGRLLIDYLLKRINSKIKLIENETAADLLIEDG
RCIGVMTKDSKGRLKVRHADEVVLAAGGCGNLFLHHTNDLTVTGDGLSLAYRAGAELTDLEFTQFHPTLLVKNGVSYGLV
SEAVRGEGGCLVDENGRRIMAERHPLGDLAPRDIVSRVIHEEMAKGNRVYIDFSAISDFETRFPTITAICEKAGIDIHSG
KIPVAPGMHFLMGGVSVNRWGETTVPGLYAIGETACSGLHGANRLASNSLLEALVFGKRAAEHIIQKPVYNRQYQSGLET
SVFYEVPDIEGHELQSKMTSHMSILREQSSLIELSIWLHTLPFQEVNVKDITIRQMELSHLWQTAKLMTFSALLREESRG
AHFRTDFPHAEVSWQGRQIVHTKKGTKIRKNEGIWNNESFTAEKITESLFS
>P10902 1.4.3.16~~~nadB~~~L-aspartate oxidase~~~COG0029
MNTLPEHSCDVLIIGSGAAGLSLALRLADQHQVIVLSKGPVTEGSTFYAQGGIAAVFDETDSIDSHVEDTLIAGAGICDR
HAVEFVASNARSCVQWLIDQGVLFDTHIQPNGEESYHLTREGGHSHRRILHAADATGREVETTLVSKALNHPNIRVLERS
NAVDLIVSDKIGLPGTRRVVGAWVWNRNKETVETCHAKAVVLATGGASKVYQYTTNPDISSGDGIAMAWRAGCRVANLEF
NQFHPTALYHPQARNFLLTEALRGEGAYLKRPDGTRFMPDFDERGELAPRDIVARAIDHEMKRLGADCMFLDISHKPADF
IRQHFPMIYEKLLGLGIDLTQEPVPIVPAAHYTCGGVMVDDHGRTDVEGLYAIGEVSYTGLHGANRMASNSLLECLVYGW
SAAEDITRRMPYAHDISTLPPWDESRVENPDERVVIQHNWHELRLFMWDYVGIVRTTKRLERALRRITMLQQEIDEYYAH
FRVSNNLLELRNLVQVAELIVRCAMMRKESRGLHFTLDYPELLTHSGPSILSPGNHYINR
>P9WJJ9 1.4.3.16~~~nadB~~~L-aspartate oxidase~~~COG0029
MAGPAWRDAADVVVIGTGVAGLAAALAADRAGRSVVVLSKAAQTHVTATHYAQGGIAVVLPDNDDSVDAHVADTLAAGAG
LCDPDAVYSIVADGYRAVTDLVGAGARLDESVPGRWALTREGGHSRRRIVHAGGDATGAEVQRALQDAAGMLDIRTGHVA
LRVLHDGTAVTGLLVVRPDGCGIISAPSVILATGGLGHLYSATTNPAGSTGDGIALGLWAGVAVSDLEFIQFHPTMLFAG
RAGGRRPLITEAIRGEGAILVDRQGNSITAGVHPMGDLAPRDVVAAAIDARLKATGDPCVYLDARGIEGFASRFPTVTAS
CRAAGIDPVRQPIPVVPGAHYSCGGIVTDVYGQTELLGLYAAGEVARTGLHGANRLASNSLLEGLVVGGRAGKAAAAHAA
AAGRSRATSSATWPEPISYTALDRGDLQRAMSRDASMYRAAAGLHRLCDSLSGAQVRDVACRRDFEDVALTLVAQSVTAA
ALARTESRGCHHRAEYPCTVPEQARSIVVRGADDANAVCVQALVAVC
>Q8ZMX9 1.4.3.16~~~nadB~~~L-aspartate oxidase~~~
MMTTPELSCDVLIIGSGAAGLSLALRLAEKHKVIVLSKGPVSEGSTFYAQGGIAAVFDETDSIASHVEDTLIAGAGICDR
HAVEFVASNARTCVQWLIDQGVLFDTHVQPNGKESYHLTREGGHSHRRILHAADATGKEVETTLVSRAQNHPNIQVLERS
NAVDLIISDKMGLPGPRRVVGAWIWNRNKEWVETCHAKSVVLATGGASKVYQYTTNPDISSGDGIAMAWRAGCRVANLEF
NQFHPTALYHPQARNFLLTEALRGEGAYLKRPDGSRFMPDVDERGELAPRDIVARAIDHEMKQLGADCMFLDISHKPDDF
VRQHFPMIYAKLLDLGMDLTKEPIPVVPAAHYTCGGVVVDDYGRTDVDGLYAIGEVSYTGLHGANRMASNSLLECLVYGW
SAAMDIDRRMPSVHSVDALPAWDESRVENADERVVIQHNWHELRLLMWDYVGIVRTTKRLERALRRITMLQQEIDEYYAN
FRVSNNLLELRNLVQVAELIVRCAMMRKESRGLHFTLDYPQQLAESGPSILSPLTPHINR
>P39666 2.4.2.19~~~nadC~~~Probable nicotinate-nucleotide pyrophosphorylase [carboxylating]~~~COG0157
MNHLQLKKLLNHFFLEDIGTGDLTSQSIFGEQSCEAEIVAKSEGIFAGAAIIKEGFSLLDENVQSILHKKDGDMLHKGEV
IAELHGPAAALLSGERVVLNLIQRLSGIATMTREAVRCLDDEQIKICDTRKTTPGLRMLEKYAVRAGGGYNHRFGLYDGI
MIKDNHIAACGSILEACKKARQAAGHMVNIEVEIETEEQLREAIAAGADVIMFDNCPPDTVRHFAKLTPANIKTEASGGI
TLESLPAFKGTGVNYISLGFLTHSVKSLDISMDVTLSNESVEECCYVNS
>P30011 2.4.2.19~~~nadC~~~Nicotinate-nucleotide pyrophosphorylase [carboxylating]~~~COG0157
MPPRRYNPDTRRDELLERINLDIPGAVAQALREDLGGTVDANNDITAKLLPENSRSHATVITRENGVFCGKRWVEEVFIQ
LAGDDVTIIWHVDDGDVINANQSLFELEGPSRVLLTGERTALNFVQTLSGVASKVRHYVELLEGTNTQLLDTRKTLPGLR
SALKYAVLCGGGANHRLGLSDAFLIKENHIIASGSVRQAVEKASWLHPDAPVEVEVENLEELDEALKAGADIIMLDNFET
EQMREAVKRTNGKALLEVSGNVTDKTLREFAETGVDFISVGALTKHVQALDLSMRFR
>O25909 2.4.2.19~~~nadC~~~Probable nicotinate-nucleotide pyrophosphorylase [carboxylating]~~~COG0157
MEIRTFLERALKEDLGHGDLFERVLEKDFKATAFVRAKQEGVFSGEKYALELLEMTGIECVQTIKDKERFKPKDALMEIR
GDFSMLLKVERTLLNLLQHSSGIATLTSRFVEALNSHKVRLLDTRKTRPLLRIFEKYSVLNGGASNHRLGLDDALMLKDT
HLRHVKDLKSFLTHARKNLPFTAKIEIECESFEEAKNAMNAGADIVMCDNLSVLETKEIAAYRDAHYPFVLLEASGNISL
ESINAYAKSGVDAISVGALIHQATFIDMHMKMA
>P9WJJ7 2.4.2.19~~~nadC~~~Nicotinate-nucleotide pyrophosphorylase [carboxylating]~~~COG0157
MGLSDWELAAARAAIARGLDEDLRYGPDVTTLATVPASATTTASLVTREAGVVAGLDVALLTLNEVLGTNGYRVLDRVED
GARVPPGEALMTLEAQTRGLLTAERTMLNLVGHLSGIATATAAWVDAVRGTKAKIRDTRKTLPGLRALQKYAVRTGGGVN
HRLGLGDAALIKDNHVAAAGSVVDALRAVRNAAPDLPCEVEVDSLEQLDAVLPEKPELILLDNFAVWQTQTAVQRRDSRA
PTVMLESSGGLSLQTAATYAETGVDYLAVGALTHSVRVLDIGLDM
>P30012 2.4.2.19~~~nadC~~~Nicotinate-nucleotide pyrophosphorylase [carboxylating]~~~
MPPRRYNPDDRRDALLERINLDIPAAVAQALREDLGGEVDAGNDITAQLLPADTQAHATVITREDGVFCGKRWVEEVFIQ
LAGDDVRLTWHVDDGDAIHANQTVFELNGPARVLLTGERTALNFVQTLSGVASEVRRYVGLLAGTQTQLLDTRKTLPGLR
TALKYAVLCGGGANHRLGLTDAFLIKENHIIASGSVRQAVEKAFWLHPDVPVEVEVENLDELDDALKAGADIIMLDNFNT
DQMREAVKRVNGQARLEVSGNVTAETLREFAETGVDFISVGALTKHVRALDLSMRFC
>C3L5T6 2.7.7.18~~~nadD~~~Probable nicotinate-nucleotide adenylyltransferase~~~
MRKIGIIGGTFDPPHYGHLLIANEVYHALNLEEVWFLPNQIPPHKQGRNITSVESRLQMLELATEAEEHFSICLEELSRK
GPSYTYDTMLQLTKKYPDVQFHFIIGGDMVEYLPKWYNIEALLDLVTFVGVARPGYKLRTPYPITTVEIPEFAVSSSLLR
ERYKEKKTCKYLLPEKVQVYIERNGLYES
>P54455 2.7.7.18~~~nadD~~~Nicotinate-nucleotide adenylyltransferase~~~COG1057
MKKIGIFGGTFDPPHNGHLLMANEVLYQAGLDEIWFMPNQIPPHKQNEDYTDSFHRVEMLKLAIQSNPSFKLELVEMERE
GPSYTFDTVSLLKQRYPNDQLFFIIGADMIEYLPKWYKLDELLNLIQFIGVKRPGFHVETPYPLLFADVPEFEVSSTMIR
ERFKSKKPTDYLIPDKVKKYVEENGLYES
>P0A752 2.7.7.18~~~nadD~~~Nicotinate-nucleotide adenylyltransferase~~~COG1057
MKSLQALFGGTFDPVHYGHLKPVETLANLIGLTRVTIIPNNVPPHRPQPEANSVQRKHMLELAIADKPLFTLDERELKRN
APSYTAQTLKEWRQEQGPDVPLAFIIGQDSLLTFPTWYEYETILDNAHLIVCRRPGYPLEMAQPQYQQWLEDHLTHNPED
LHLQPAGKIYLAETPWFNISATIIRERLQNGESCEDLLPEPVLTYINQQGLYR
>P9WJJ5 2.7.7.18~~~nadD~~~Probable nicotinate-nucleotide adenylyltransferase~~~COG1057
MGVMGGTFDPIHYGHLVAASEVADLFDLDEVVFVPSGQPWQKGRQVSAAEHRYLMTVIATASNPRFSVSRVDIDRGGPTY
TKDTLADLHALHPDSELYFTTGADALASIMSWQGWEELFELARFVGVSRPGYELRNEHITSLLGQLAKDALTLVEIPALA
ISSTDCRQRAEQSRPLWYLMPDGVVQYVSKCRLYCGACDAGARSTTSLAAGNGL
>Q9HX21 2.7.7.18~~~nadD~~~Probable nicotinate-nucleotide adenylyltransferase~~~
MGKRIGLFGGTFDPVHIGHMRSAVEMAEQFALDELRLLPNARPPHRETPQVSAAQRLAMVERAVAGVERLTVDPRELQRD
KPSYTIDTLESVRAELAADDQLFMLIGWDAFCGLPTWHRWEALLDHCHIVVLQRPDADSEPPESLRDLLAARSVADPQAL
KGPGGQITFVWQTPLAVSATQIRALLGAGRSVRFLVPDAVLNYIEAHHLYRAPH
>Q5HFG7 2.7.7.18~~~nadD~~~Probable nicotinate-nucleotide adenylyltransferase~~~
MKKIVLYGGQFNPIHTAHMIVASEVFHELQPDEFYFLPSFMSPLKKHHDFIDVQHRLTMIQMIIDELGFGDICDDEIKRG
GQSYTYDTIKAFKEQHKDSELYFVIGTDQYNQLEKWYQIEYLKEMVTFVVVNRDKNSQNVENAMIAIQIPRVDISSTMIR
QRVSEGKSIQVLVPKSVENYIKGEGLYEH
>Q6F8K4 6.3.5.1~~~nadE~~~Glutamine-dependent NAD(+) synthetase~~~COG0171
MKSFKIALAQFSPHIGNIDSNAQRMVEQANEAKKQNADLIIFPELSVIGYPAEDLLLRPNLNKRMQKAFQQLKEVKDIVM
VFGFVHQTEEGHRYNSAAVMKDGVVLGVYNKHNLPNYSVFDEKRYFSPGHQHLVFEYLGHKFGVLICEDIWSINTVKQLS
KLNVETVLVLNASPYEVGKPQHRVQTLTELSKQLNVHLVYLNQVGGQDDLIFDGSSFIINHDGEVAFQAPSFKEELYYSE
FDIEQKRYKKIDPAPALDTIAEIYQSLVMATRDYVQRSGFSGVILGLSGGIDSALTLAIAADAIGADKVQAVMMPYTYTS
QISVEDATEQARRMGVTFGIAEIHPIVNSFMQTLYPFFGNAPADATEENLQARARGTLLMGLSNKFGNLVLSTGNKSELA
VGYCTLYGDMVGGFAVLKDVYKTIVFELAKYRNTLSETPVIPERVITRPPSAELRPDQKDQDSLPAYDILDAILYAYIEE
DQSQSDIIAKGFDKEVVEKVIRLVDRNEYKRRQGAIGPRISSRAFSRERRYPIVNGWRPDD
>Q5DZX4 6.3.1.5~~~nadE~~~NH(3)-dependent NAD(+) synthetase~~~COG0171
MQQQIVEEMKVKVSIDPVEEIKKRVDFIKGKLLEAHCKSLILGISGGVDSTTCGRLAQLAVNELNLETQSSDYQFIAVRL
PYGIQQDEDEAQLALQFIQPTHSISINIKNGVDGLHSANHIALKDTGLLPTDSAKIDFVKGNVKARARMIAQYEVAGYVG
GLVLGTDHSAENITGFYTKFGDGACDLAPLFGLNKRQVREVAAQLGAPEQLVKKVPTADLEELAPQKADEDALSVSYDQI
DDFLEGKKIDADAEDRLIKIYQMSQHKRKPIPTIYD
>Q81RP3 6.3.1.5~~~nadE~~~NH(3)-dependent NAD(+) synthetase~~~COG0171
MTLQEQIMKALHVQPVIDPKAEIRKRVDFLKDYVKKTGAKGFVLGISGGQDSTLAGRLAQLAVEEIRNEGGNATFIAVRL
PYKVQKDEDDAQLALQFIQADQSVAFDIASTVDAFSNQYENLLDESLTDFNKGNVKARIRMVTQYAIGGQKGLLVIGTDH
AAEAVTGFFTKFGDGGADLLPLTGLTKRQGRALLQELGADERLYLKMPTADLLDEKPGQADETELGITYDQLDDYLEGKT
VPADVAEKIEKRYTVSEHKRQVPASMFDDWWK
>P08164 6.3.1.5~~~nadE~~~NH(3)-dependent NAD(+) synthetase~~~COG0171
MSMQEKIMRELHVKPSIDPKQEIEDRVNFLKQYVKKTGAKGFVLGISGGQDSTLAGRLAQLAVESIREEGGDAQFIAVRL
PHGTQQDEDDAQLALKFIKPDKSWKFDIKSTVSAFSDQYQQETGDQLTDFNKGNVKARTRMIAQYAIGGQEGLLVLGTDH
AAEAVTGFFTKYGDGGADLLPLTGLTKRQGRTLLKELGAPERLYLKEPTADLLDEKPQQSDETELGISYDEIDDYLEGKE
VSAKVSEALEKRYSMTEHKRQVPASMFDDWWK
>Q3JL79 6.3.1.5~~~nadE~~~NH(3)-dependent NAD(+) synthetase~~~
MSRPDQAARRRAIAAELHVSPTFDARDEAERRIGFVADYLRTAGLRACVLGISGGIDSSTAGRLAQLAVERLRASGYDAR
FVAMRLPYGAQHDEADARRALAFVRADETLTVDVKPAADAMLAALAAGGLAYLDHAQQDFVLGNIKARERMIAQYAVAGA
RNGVVIGTDHAAESVMGFFTKFGDGGADVLPLAGLTKRRVRALARMLGADEPLVLKTPTADLETLRPQRPDEHAYGITYE
QIDDFLEGKPMDDAVAETVLRFYDATRHKRALPYTMFDWPGHPA
>Q9PPB0 6.3.1.5~~~nadE~~~NH(3)-dependent NAD(+) synthetase~~~COG0171
MDWQKITEKMCDFIQEKVKNSQSQGVVLGLSGGIDSALVATLCKRALKENVFALLMPTQISNKANLEDALRLCADLNLEY
KIIEIQSILDAFIKQSENTTLVSLGNFAARIRMSLLYDYSALKNSLVIGTSNKSELLLGYGTIYGDLACAFNPIGSLYKS
EIYALAKYLNLHENFIKKAPSADLWENQSDEADLGFSYTKIDEGLKALETNDEKLLRTLDPSLIAMLKNRMQKNAFKGKM
PEILEI
>Q9RYV5 6.3.1.5~~~nadE~~~NH(3)-dependent NAD(+) synthetase~~~COG0171
MTPSPLPLSPLRSHIIRELHVQPDIDPGAEVERRVAFLCDYLQSTPTKGFVLGISGGQDSTLAGRLCQLAVERRRSQGHG
ATFLAVRLPYGVQADEADAQQALDFIQADREVTVNIKEAADASVAAAQAALGSEVRDFVRGNVKARERMVAQYALAGQEN
LLVVGTDHAAEALTGFYTKYGDGGVDLTPLSGLTKRQGAQLLAHLGAPEGTWRKVPTADLEDDRPGLPDEVALGVTYAQI
DAYLEGREVSDEAAARLERLFLNSRHKRALPVTPFDGWWQPGEQKQS
>P18843 6.3.1.5~~~nadE~~~NH(3)-dependent NAD(+) synthetase~~~COG0171
MTLQQQIIKALGAKPQINAEEEIRRSVDFLKSYLQTYPFIKSLVLGISGGQDSTLAGKLCQMAINELRLETGNESLQFIA
VRLPYGVQADEQDCQDAIAFIQPDRVLTVNIKGAVLASEQALREAGIELSDFVRGNEKARERMKAQYSIAGMTSGVVVGT
DHAAEAITGFFTKYGDGGTDINPLYRLNKRQGKQLLAALACPEHLYKKAPTADLEDDRPSLPDEVALGVTYDNIDDYLEG
KNVPQQVARTIENWYLKTEHKRRPPITVFDDFWKK
>Q830Y9 6.3.1.5~~~nadE~~~NH(3)-dependent NAD(+) synthetase~~~COG0171
MTTLQEKIIQELGVLPTIDPKEEVRKSIDFLKAYLTKHPFLKTFVLGISGGQDSTLAGRLAQLAMTEMREETGDMSYQFI
AIRLPYGEQADEADAQAALAFIQPDVSLRVDIKPAVDAMVGSLENAGVQISDFNKGNMKARQRMITQYAVAGENAGAVIG
TDHAAENVTAFFTKYGDGGADILPLFRLNKRQGKALLKELGAPEALYLKIPTADLEDDKPLVADEVALGVTYDAIDDYLE
GKKVSETDQQTIENWYKKGQHKRHLPITIFDDFWK
>O25096 6.3.1.5~~~nadE~~~NH(3)-dependent NAD(+) synthetase~~~COG0171
MQKDYQKLIVYLCDFLEKEVQKRGFKKVVYGLSGGLDSAVVGVLCQKVFKENAHALLMPSSVSMPENKTDALNLCEKFSI
PYTEYSIAPYDAIFSSHFKDASLTRKGNFCARLRMAFLYDYSLKSDSLVIGTSNKSERMLGYGTLFGDLACAINPIGELF
KTEVYELARRLNIPKKILNKPPSADLFVGQSDEKDLGYPYSVIDPLLKDIEALFQTKPIDTETLAQLGYDEILVKNITSR
IQKNAFKLELPAIAKRFNPE
>P9WJJ3 6.3.5.1~~~nadE~~~Glutamine-dependent NAD(+) synthetase~~~COG0171
MNFYSAYQHGFVRVAACTHHTTIGDPAANAASVLDMARACHDDGAALAVFPELTLSGYSIEDVLLQDSLLDAVEDALLDL
VTESADLLPVLVVGAPLRHRHRIYNTAVVIHRGAVLGVVPKSYLPTYREFYERRQMAPGDGERGTIRIGGADVAFGTDLL
FAASDLPGFVLHVEICEDMFVPMPPSAEAALAGATVLANLSGSPITIGRAEDRRLLARSASARCLAAYVYAAAGEGESTT
DLAWDGQTMIWENGALLAESERFPKGVRRSVADVDTELLRSERLRMGTFDDNRRHHRELTESFRRIDFALDPPAGDIGLL
REVERFPFVPADPQRLQQDCYEAYNIQVSGLEQRLRALDYPKVVIGVSGGLDSTHALIVATHAMDREGRPRSDILAFALP
GFATGEHTKNNAIKLARALGVTFSEIDIGDTARLMLHTIGHPYSVGEKVYDVTFENVQAGLRTDYLFRIANQRGGIVLGT
GDLSELALGWSTYGVGDQMSHYNVNAGVPKTLIQHLIRWVISAGEFGEKVGEVLQSVLDTEITPELIPTGEEELQSSEAK
VGPFALQDFSLFQVLRYGFRPSKIAFLAWHAWNDAERGNWPPGFPKSERPSYSLAEIRHWLQIFVQRFYSFSQFKRSALP
NGPKVSHGGALSPRGDWRAPSDMSARIWLDQIDREVPKG
>Q9HUP3 6.3.1.5~~~nadE~~~NH(3)-dependent NAD(+) synthetase~~~
MQQIQRDIAQALQVQPPFQSEADVQAQIARRIAFIQQCLKDSGLKTLVLGISGGVDSLTAGLLAQRAVEQLREQTGDQAY
RFIAVRLPYQVQQDEADAQASLATIRADEEQTVNIGPSVKALAEQLEALEGLEPAKSDFVIGNIKARIRMVAQYAIAGAR
GGLVIGTDHAAEAVMGFFTKFGDGACDLAPLSGLAKHQVRALARALGAPENLVEKIPTADLEDLRPGHPDEASHGVTYAE
IDAFLHGQPLREEAARVIVDTYHKTQHKRELPKAP
>Q03638 6.3.5.1~~~nadE~~~Glutamine-dependent NAD(+) synthetase~~~
MTDRFRITLAQLNPTVGALAANAEKAMAAWQAGRAAGADLVALPEMFLTGYQTQDLVLKPAFLRDAMAAMAALAAQVVDG
PALGIGGPYVDETGSYNAWWVLKDGRVIARALKHHLPHDDVFDEMRLFDQGPVSDPLRLGPVALGVPVCEDAWHPDVAGA
LAAAGAEVLMVPNGSPYRRGKLDLRRQVTGARVAETGLPLLYLNMVGGQDDQLFDGASFVLNPDGSVAVQLPAFEEAVVH
VDLERGAADWRAVPADIVAPPGDIEQDYRAMVLGLQDYLRKSGFSRVVLGLSGGIDSALVAVIAADALGAGNVHCVMLPS
RYTSQGSLDDAADLARRLGARLDTVEIEGPRAAVEGALAHVLAGTAPDVTEENIQSRLRGVILMAISNKFGAMLLTTGNK
SEVAVGYCTIYGDMAGGYNPLKDLYKTRVFETCRWRNATHRPWMQAPAGEIIPVAIIDKPPSAELRENQTDQDSLPPYEV
LDAILERLVEGDQSVDQIVAAGFDRATVKRIEHLLYISEWKRFQSAPGPRLTTRAFWLDRRYPMVNRWRDQS
>Q8ZPU5 6.3.1.5~~~nadE~~~NH(3)-dependent NAD(+) synthetase~~~
MTLQQEIIQALGAKPHINPEEEIRRSVDFLKAYLKTYPFLKSLVLGISGGQDSTLAGKLSQMAIAELREETGDNALQFIA
VRLPYGVQADEQDCQDAIAFIQPDRVLTVNIKGAVLASEQALREAGIELSDFVRGNEKARERMKAQYSIAGMTHGVVVGT
DHAAEAITGFFTKYGDGGTDINPLHRLNKRQGKQLLAALGCPEHLYKKVPTADLEDDRPSLPDEAALGVTYDNIDDYLEG
KTLDPAIAKTIEGWYVKTEHKRRLPITVFDDFWKR
>Q2G236 6.3.1.5~~~nadE~~~NH(3)-dependent NAD(+) synthetase~~~COG0171
MSKLQDVIVQEMKVKKRIDSAEEIMELKQFIKNYVQSHSFIKSLVLGISGGQDSTLVGKLVQMSVNELREEGIDCTFIAV
KLPYGVQKDADEVEQALRFIEPDEIVTVNIKPAVDQSVQSLKEAGIVLTDFQKGNEKARERMKVQFSIASNRQGIVVGTD
HSAENITGFYTKYGDGAADIAPIFGLNKRQGRQLLAYLGAPKELYEKTPTADLEDDKPQLPDEDALGVTYEAIDNYLEGK
PVTPEEQKVIENHYIRNAHKRELAYTRYTWPKS
>P99150 6.3.1.5~~~nadE~~~NH(3)-dependent NAD(+) synthetase~~~
MSKLQDVIVQEMKVKKRIDSAEEIMELKQFIKNYVQSHSFIKSLVLGISGGQDSTLVGKLVQMSVNELREEGIDCTFIAV
KLPYGVQKDADEVDQALRFIEPDEIVTVNIKPAVDQSVQSLKEAGIVLTDFQKGNEKARERMKVQFSIASNRQGIVVGTD
HSAENITGFYTKYGDGAADIAPIFGLNKRQGRQLLAYLGAPKELYEKTPTADLEDDKPQLPDEDALGVTYEAIDNYLEGK
PVTPEEQKVIENHYIRNAHKRELAYTRYTWPKS
>Q9KMW1 6.3.1.5~~~nadE~~~NH(3)-dependent NAD(+) synthetase~~~COG0171
MEHKIREEMRVLPSIDPQFEIERRVAFIKRKLTEARCKSLVLGISGGVDSTTCGRLAQLAVEELNQQHNTTEYQFIAVRL
PYGEQKDEDEAQLALSFIRPTHSVSVNIKAGVDGLHAASHHALANTGLIPSDPAKVDFIKGNVKARARMVAQYEIAGYVG
GLVLGTDHSAENITGFYTKFGDGACDLAPLFGLNKRQVRLLAKTLGAPEQLVYKTPTADLEELAPQKADEAALNLTYEQI
DDFLEGKAVPAEVSQRLVAIYHATQHKRQPIPTIYD
>O31612 2.7.1.23~~~ppnKA~~~NAD kinase 1~~~COG0061
MKFAVSSKGDQVSDTLKSKIQAYLLDFDMELDENEPEIVISVGGDGTLLYAFHRYSDRLDKTAFVGVHTGHLGFYADWVP
HEIEKLVLAIAKTPYHTVEYPLLEVIVTYHENEREERYLALNECTIKSIEGSLVADVEIKGQLFETFRGDGLCLSTPSGS
TAYNKALGGAIIHPSIRAIQLAEMASINNRVFRTVGSPLLLPSHHDCMIKPRNEVDFQVTIDHLTLLHKDVKSIRCQVAS
EKVRFARFRPFPFWKRVQDSFIGKGE
>Q8Y8D7 2.7.1.23~~~nadK1~~~NAD kinase 1~~~COG0061
MKYMITSKGDEKSDLLRLNMIAGFGEYDMEYDDVEPEIVISIGGDGTFLSAFHQYEERLDEIAFIGIHTGHLGFYADWRP
AEADKLVKLLAKGEYQKVSYPLLKTTVKYGIGKKEATYLALNESTVKSSGGPFVVDVVINDIHFERFRGDGLCMSTPSGT
TAYNKSLGGALMHPSIEAMQLTEMASINNRVYRTIGSPLVFPKHHVVSLQPVNDKDFQISVDHLSILHRDVQEIRYEVSA
KKIHFARFRSFPFWRRVHDSFIED
>P73955 2.7.1.23~~~nadK1~~~NAD kinase 1~~~COG0061
MELKQVIIAHKAGHNESKTYAERCARELEARGCKVLMGPSGIKDNPYPVFLASATEKIDLALVLGGDGTTLAAARHLSPE
GIPILSVNVGGHLGFLTEPFDVFQDTQKVWDRLNQDRYAVSQRMMLAASLFEGDRRDPQMVGETYYCLNEMCIKPASIDR
MPTAIIEVEVDGELIDQYQCDGLLVATPTGSTCYTSSANGPILHPGMDAIVITPICPLSLSSRPIVIPPGSSVNIWPLGD
FELNTKLWTDGSLATGVWPGQRVGVWMAHRAAQFILLRESYSFYKTLRDKLQWAGARFLYDGNNKVN
>P74430 2.7.1.23~~~nadK2~~~NAD kinase 2~~~COG0061
MPKVGIIFNDDKPTACSVAQELQEQLQQSGFTVAMETGSGGLLGYSQPDRPICHTRIEHLTPPHFDESMPFAIVLGGDGT
VLSAFRQLAPLGIPLLTINTGHMGFLTEIYLNQLPTAIEQLINGDYQIESRSMMTVRLMREENLLWEALSLNEMVLHREP
LTSMCHFEIQVGYHASVDIAADGIIVSTPTGSTAYSLSAGGPVVTPDVPVFQLAPICPHSLASRALVFSDLEPVTIFPAT
PNRMVLVVDGNGGCYVLPEDRVHLSKSPYPAKFIRLQTPEFFRILREKLGWGLPHIAKPTSVELP
>P0A7B3 2.7.1.23~~~nadK~~~NAD kinase~~~COG0061
MNNHFKCIGIVGHPRHPTALTTHEMLYRWLCTKGYEVIVEQQIAHELQLKNVKTGTLAEIGQLADLAVVVGGDGNMLGAA
RTLARYDIKVIGINRGNLGFLTDLDPDNAQQQLADVLEGHYISEKRFLLEAQVCQQDCQKRISTAINEVVLHPGKVAHMI
EFEVYIDEIFAFSQRSDGLIISTPTGSTAYSLSAGGPILTPSLDAITLVPMFPHTLSARPLVINSSSTIRLRFSHRRNDL
EISCDSQIALPIQEGEDVLIRRCDYHLNLIHPKDYSYFNTLSTKLGWSKKLF
>P44497 2.7.1.23~~~nadK~~~NAD kinase~~~COG0061
MNHLYRSFKTIALVGKPRNDINLQMHKNLFHWLMERGYQVLVEKEVAITLELPFEHLATLEEIGHRAQLAIVIGGDGNML
GRARVLAKYDIPLIGINRGNLGFLTDIDPKNAYSQLEACLERGEFFVEERFLLEAKIERASEIVSTSNAVNEAVIHPAKI
AHMIDFHVYINDKFAFSQRSDGLIVSTPTGSTAYSLSAGGPILTPNLNAIALVPMFPHTLTSRPLVVDGDSKISIRFAEH
NTSQLEVGCDSQITLPFTPDDVVHIQKSEHKLRLLHLKIIIITMC
>P9WHV7 2.7.1.23~~~ppnK~~~NAD kinase~~~COG0061
MTAHRSVLLVVHTGRDEATETARRVEKVLGDNKIALRVLSAEAVDRGSLHLAPDDMRAMGVEIEVVDADQHAADGCELVL
VLGGDGTFLRAAELARNASIPVLGVNLGRIGFLAEAEAEAIDAVLEHVVAQDYRVEDRLTLDVVVRQGGRIVNRGWALNE
VSLEKGPRLGVLGVVVEIDGRPVSAFGCDGVLVSTPTGSTAYAFSAGGPVLWPDLEAILVVPNNAHALFGRPMVTSPEAT
IAIEIEADGHDALVFCDGRREMLIPAGSRLEVTRCVTSVKWARLDSAPFTDRLVRKFRLPVTGWRGK
>Q9HZC0 2.7.1.23~~~nadK~~~NAD kinase~~~
MEPFRNIGIIGRLGSTQVLDTIRRLKKFLIDRHLHVILEDTIAEVLPGHGLQTCSRKIMGEICDLVVVVGGDGSMLGAAR
ALARHKVPVLGINRGSLGFLTDIRPDELEAKVGEVLDGQYIVESRFLLDAQVRRGIDSMGQGDALNDVVLHPGKSTRMIE
FELYIDGQFVCSQKADGLIVATPTGSTAYALSAGGPIMHPKLDAIVIVPMYPHMLSSRPIVVDGNSELKIVVSPNMQIYP
QVSCDGQNHFTCAPGDTVTISKKPQKLRLIHPIDHNYYEICRTKLGWGSRLGGGD
>P65774 2.7.1.23~~~nadK~~~NAD kinase~~~
MNNHFKCIGIVGHPRHPTALTTHEMLYRWLCDQGYEVIVEQQIAHELQLKNVPTGTLAEIGQQADLAVVVGGDGNMLGAA
RTLARYDINVIGINRGNLGFLTDLDPDNALQQLSDVLEGRYISEKRFLLEAQVCQQDRQKRISTAINEVVLHPGKVAHMI
EFEVYIDETFAFSQRSDGLIISTPTGSTAYSLSAGGPILTPSLDAITLVPMFPHTLSARPLVINSSSTIRLRFSHRRSDL
EISCDSQIALPIQEGEDVLIRRCDYHLNLIHPKDYSYFNTLSTKLGWSKKLF
>P65777 2.7.1.23~~~nadK~~~NAD kinase~~~
MRYTILTKGDSKSNALKHKMMNYMKDFRMIEDSENPEIVISVGGDGTLLQAFHQYSHMLSKVAFVGVHTGHLGFYADWLP
HEVEKLIIEINNSEFQVIEYPLLEIIMRYNDNGYETRYLALNEATMKTENGSTLVVDVNLRGKHFERFRGDGLCVSTPSG
STAYNKALGGALIHPSLEAMQITEIASINNRVFRTVGSPLVLPKHHTCLISPVNHDTIRMTIDHVSIKHKNVNSIQYRVA
NEKVRFARFRPFPFWKRVHDSFISSDEER
>P65779 2.7.1.23~~~nadK~~~NAD kinase~~~COG0061
MKNTGKRIDLIANRKPQSQRVLYELRDRLKRNQFILNDTNPDIVISIGGDGMLLSAFHKYENQLDKVRFIGLHTGHLGFY
TDYRDFELDKLVTNLQLDTGARVSYPVLNVKVFLENGEVKIFRALNEASIRRSDRTMVADIVINGVPFERFRGDGLTVST
PTGSTAYNKSLGGAVLHPTIEALQLTEIASLNNRVYRTLGSSIIVPKKDKIELIPTRNDYHTISVDNSVYSFRNIERIEY
QIDHHKIHFVATPSHTSFWNRVKDAFIGEVDE
>Q9X255 2.7.1.23~~~NADK~~~NAD kinase~~~COG0061
MKIAILYREEREKEGEFLKEKISKEHEVIEFGEANAPGRVTADLIVVVGGDGTVLKAAKKAADGTPMVGFKAGRLGFLTS
YTLDEIDRFLEDLRNWNFREETRWFIQIESELGNHLALNDVTLERDLSGKMVEIEVEVEHHSSMWFFADGVVISTPTGST
AYSLSIGGPIIFPECEVLEISPIAPQFFLTRSVVIPSNFKVVVESQRDINMLVDGVLTGKTKRIEVKKSRRYVRILRPPE
YDYVTVIRDKLGYGRRIE
>Q8ZH09 2.7.1.23~~~nadK~~~NAD kinase~~~COG0061
MNNRRFDCIGIVGHPRHPAALATHEILYHWLKARGYAVMVEQQIAHDLNLTDAITGSLADIGQKADLAVVVGGDGNMLGA
ARVLARYDIKVIGVNRGNLGFLTDLDPDNALQQLSDVLEGEYLSEQRFLLETHVRRTNQQSRISTAINEVVLHPGKVAHM
IEFEVYIDDRFAFSQRSDGLIIATPTGSTAYSLSAGGPILTPTLDAIVLVPMFPHTLTARPLVISSSSTIRLKFSHITSD
LEISCDSQIALPIQEGEEVLIRRSDFHLNLIHPKDYSYFNTLSTKLGWSKKLF
>Q6F999 2.7.7.1~~~nadM~~~Nicotinamide/nicotinic acid mononucleotide adenylyltransferase~~~COG1056
MYKFDYLVFIGRFQPFHFAHLQTIQIALQQSREVIIALGSAQPERNIKNPFLAEERQKMILANFSAEDQARIHFVNIIDV
YNDQKWVEQVKQLVNAIIESRSHVGLIGHFKDESSYYLKLFPEWTMVELESLKESMSATPMREAYYEGKIIESAFPEGTI
QFLKTFQDSEIYKQLQQKYRAQDSSNLI
>Q55928 ~~~~~~Bifunctional NMN adenylyltransferase/Nudix hydrolase~~~COG1051
MQTKYQYGIYIGRFQPFHLGHLRTLNLALEKAEQVIIILGSHRVAADTRNPWRSPERMAMIEACLSPQILKRVHFLTVRD
WLYSDNLWLAAVQQQVLKITGGSNSVVVLGHRKDASSYYLNLFPQWDYLETGHYPDFSSTAIRGAYFEGKEGDYLDKVPP
AIADYLQTFQKSERYIALCDEYQFLQAYKQAWATAPYAPTFITTDAVVVQAGHVLMVRRQAKPGLGLIALPGGFIKQNET
LVEGMLRELKEETRLKVPLPVLRGSIVDSHVFDAPGRSLRGRTITHAYFIQLPGGELPAVKGGDDAQKAWWMSLADLYAQ
EEQIYEDHFQIIQHFVSKV
>P32382 1.-.-.-~~~~~~NADH oxidase~~~
MTHFPNLFSEGRIGNLVIRNRIVMPPMATNLANEDGSVSQRLIDYYVARARGGVGLIILENVQVDYPQGKNVACQLRLDD
DKYMAGFFELAEAVHSYGAKIFMQIHHAGRQTTPGITEGLQPVAPSPVPCSFLGTQPRELTINEIEEIIQKFVDAAVRAK
GAMFDGIELHGAHGYLIGQFMSPRTNRRVDKYGGSFERRMRFPLEIIRRIKEAVGEDYPISFRFSADEFVEGGNTLEEGK
QIAKMLEEAGVHVLHVSAGIYESMPTLLEPSRFEQGWRVYLAEEIKKVVNIPVITVGVIREPEFAEKIIAEGRADFVAVG
RGLIADPEWPKKAKEGRQNEIRKCISCNIGCIGGRVFQNLRLRCTVNPVAGREGVYSEIKQAPVKKKVVVVGGGPAGMQA
AITAAKRGHQVILYEKKQHLGGQLEIASASPGKAKIKWFRDWLEAELSRAGVEVRSGVTADAETIAALSPDYVILATGSE
PVTPRIKGAEKENTFVFQAWDVLAGKVSFDKDEEVVVIGGGLVGCETAHYLAEKGAKVTIVEMLSDIAIDMEPISRFDMM
QQFTKLGISARTGKVVTEILPRGVAAVGKEGKQDFIRAHKVVLAIGQSPVGNELKKTLEDKGIDVRVIGDAYNVGKIIDA
VSSGFQVAWQI
>P39667 ~~~nadR~~~Transcription repressor NadR~~~COG1827
MTEELKLMGANRRDQLLLWLKESKSPLTGGELAKKANVSRQVIVQDISLLKAKNVPIIATSQGYVYMDAAAQQHQQAERI
IACLHGPERTEEELQLIVDEGVTVKDVKIEHPVYGDLTAAIQVGTRKEVSHFIKKINSTNAAYLSQLTDGVHLHTLTAPD
EHRIDQACQALEEAGILIKD
>P27278 ~~~nadR~~~Trifunctional NAD biosynthesis/regulator protein NadR~~~COG1056
MSSFDYLKTAIKQQGCTLQQVADASGMTKGYLSQLLNAKIKSPSAQKLEALHRFLGLEFPRQKKTIGVVFGKFYPLHTGH
IYLIQRACSQVDELHIIMGFDDTRDRALFEDSAMSQQPTVPDRLRWLLQTFKYQKNIRIHAFNEEGMEPYPHGWDVWSNG
IKKFMAEKGIQPDLIYTSEEADAPQYMEHLGIETVLVDPKRTFMSISGAQIRENPFRYWEYIPTEVKPFFVRTVAILGGE
SSGKSTLVNKLANIFNTTSAWEYGRDYVFSHLGGDEIALQYSDYDKIALGHAQYIDFAVKYANKVAFIDTDFVTTQAFCK
KYEGREHPFVQALIDEYRFDLVILLENNTPWVADGLRSLGSSVDRKEFQNLLVEMLEENNIEFVRVEEEDYDSRFLRCVE
LVREMMGEQR
>P44308 ~~~nadR~~~Bifunctional NAD biosynthesis protein NadR~~~COG3172
MGFTTGREFHPALRMRAKYNAKYLGTKSEREKYFHLAYNKHTQFLRYQEQIMSKTKEKKVGVIFGKFYPVHTGHINMIYE
AFSKVDELHVIVCSDTVRDLKLFYDSKMKRMPTVQDRLRWMQQIFKYQKNQIFIHHLVEDGIPSYPNGWQSWSEAVKTLF
HEKHFEPSIVFSSEPQDKAPYEKYLGLEVSLVDPDRTFFNVSATKIRTTPFQYWKFIPKEARPFFAKTVAILGGESSGKS
VLVNKLAAVFNTTSAWEYGREFVFEKLGGDEQAMQYSDYPQMALGHQRYIDYAVRHSHKIAFIDTDFITTQAFCIQYEGK
AHPFLDSMIKEYPFDVTILLKNNTEWVDDGLRSLGSQKQRQQFQQLLKKLLDKYKVPYIEIESPSYLDRYNQVKAVIEKV
LNEEEISELQNTTFPIKGTSQ
>P24518 ~~~nadR~~~Trifunctional NAD biosynthesis/regulator protein NadR~~~
MSSFDYLKTAIKQQGCTLQQVADASGMTKGYLSQLLNAKIKSPSAQKLEALHRFLGLEFPRRQKNIGVVFGKFYPLHTGH
IYLIQRACSQVDELHIIMGYDDTRDRGLFEDSAMSQQPTVSDRLRWLLQTFKYQKNIRIHAFNEEGMEPYPHGWDVWSNG
IKAFMAEKGIQPSWIYTSEEADAPQYLEHLGIETVLVDPERTFMNISGAQIRENPFRYWEYIPTEVKPFFVRTVAILGGE
SSGKSTLVNKLANIFNTTSAWEYGRDYVFSHLGGDEMALQYSDYDKIALGHAQYIDFAVKYANKVAFIDTDFVTTQAFCK
KYEGREHPFVQALIDEYRFDLVILLENNTPWVADGLRSLGSSVDRKAFQNLLVEMLKENNIEFVHVKEADYDGRFLRCVE
LVKEMMGEQG
>Q7WUL3 3.2.1.21~~~nag3~~~Beta-N-acetylglucosaminidase/beta-glucosidase~~~
MIDLTAAPFSLDDDGIAWVRTTLAEMGEDEKLGQLFCLITYTSDPEYLGYLTRGLHVGGVMLRTMTAADAAATVTTLQST
ATVPLLISANLEGGASQTVQEATHVGSNMALAATGSTDHVRRAATVIGREARALGINWAFTPVVDIDLNFRNPITNTRTF
GADAATVAAMGAEYVEAIQAQGLAASAKHFPGDGVDERDQHLLASVNTMSVEEWDDSFGVVYRAAIAAGVKTVMVGHIML
PAYSRALRPGVADRDILPGVVAEELLNDLLRDRLGFNGLVVSDSTTMAGLASVLPRSQAVPRVIAAGCDMFLFTKNLDED
FGYMRAGIRDGVITPERLDEAVTRILALKASLGLHRGTNLPAQGAAGVLADPDHSATAREVAASSITLVKEEPGVLPITR
ERYPRVLVYDLQNGGSPIGQGARAGAVEQFVDALVEAGHDVTRFEPGGGWEGMAAPTTDVTERHDLVLYLANLSTRSNQT
VVRIEWAEPMGANVPAYVHSVPTVFVSFENPYHLFDVPRVRTLINTYGSSPVVLETLLAALQGKAPFAGSSPVDAFCGQW
DTHL
>O52378 1.18.1.7~~~nagAa~~~Naphthalene 1,2-dioxygenase/salicylate 5-hydroxylase systems, ferredoxin--NAD(P)(+), reductase component~~~
MELVVEPLNLHLNAETGSTLLDVLRSNEVPISYSCMSGRCGTCRCRVIAGHLRDNGPETGRPQAGKGTYVLACQAVLTED
CTIEIPESDEIVVHPARIVKGTVTAIDEATHDIRRLRIKLAKPLEFSPGQYATVQFTPECVRPYSMAGLPSDAEMEFQIR
AVPGGHVSNYVFNELSVGASVRISGPLGTAYLRRTHTGPMLCVGGGTGLAPVLSIVRGALESGMSNPIHLYFGVRSEQDI
YDEERLHALAARFPNLKVNVVVATGPAGPGRRSGLVTDLIGRDLPNLAGWRAYLCGAPAMVEALNLLVARLGIVPGHIHA
DAFYPSGV
>O52381 ~~~nagAb~~~Naphthalene 1,2-dioxygenase/salicylate 5-hydroxylase systems, ferredoxin component~~~
MTQNWIDAACLDDIPEGDVVGVKVNGKEIALYEVEGEIYATDNLCTHGAARMSDGFLEGREIECPLHQGRFDVCTGKALC
TPLTKDIKTYPVKIENMRVMLKME
>O34450 3.5.1.25~~~nagA~~~N-acetylglucosamine-6-phosphate deacetylase~~~COG1820
MAESLLIKDIAIVTENEVIKNGYVGINDGKISTVSTERPKEPYSKEIQAPADSVLLPGMIDIHIHGGYGADTMDASFSTL
DIMSSRLPEEGTTSFLATTITQEHGNISQALVNAREWKAAEESSLLGAELLGIHLEGPFVSPKRAGAQPKEWIRPSDVEL
FKKWQQEAGGLIKIVTLAPEEDQHFELIRHLKDESIIASMGHTDADSALLSDAAKAGASHMTHLYNAMSPFHHREPGVIG
TALAHDGFVTELIADGIHSHPLAAKLAFLAKGSSKLILITDSMRAKGLKDGVYEFGGQSVTVRGRTALLSDGTLAGSILK
MNEGARHMREFTNCSWTDIANITSENAAKQLGIFDRKGSVTVGKDADLVIVSSDCEVILTICRGNIAFISKEADQI
>P0AF19 3.5.1.25~~~nagA~~~N-acetylglucosamine-6-phosphate deacetylase~~~COG1820
MYALTQGRIFTGHEFLDDHAVVIADGLIKSVCPVAELPPEIEQRSLNGAILSPGFIDVQLNGCGGVQFNDTAEAVSVETL
EIMQKANEKSGCTNYLPTLITTSDELMKQGVRVMREYLAKHPNQALGLHLEGPWLNLVKKGTHNPNFVRKPDAALVDFLC
ENADVITKVTLAPEMVPAEVISKLANAGIVVSAGHSNATLKEAKAGFRAGITFATHLYNAMPYITGREPGLAGAILDEAD
IYCGIIADGLHVDYANIRNAKRLKGDKLCLVTDATAPAGANIEQFIFAGKTIYYRNGLCVDENGTLSGSSLTMIEGVRNL
VEHCGIALDEVLRMATLYPARAIGVEKRLGTLAAGKVANLTAFTPDFKITKTIVNGNEVVTQ
>P0AF18 3.5.1.25~~~nagA~~~N-acetylglucosamine-6-phosphate deacetylase~~~COG1820
MYALTQGRIFTGHEFLDDHAVVIADGLIKSVCPVAELPPEIEQRSLNGAILSPGFIDVQLNGCGGVQFNDTAEAVSVETL
EIMQKANEKSGCTNYLPTLITTSDELMKQGVRVMREYLAKHPNQALGLHLEGPWLNLVKKGTHNPNFVRKPDAALVDFLC
ENADVITKVTLAPEMVPAEVISKLANAGIVVSAGHSNATLKEAKAGFRAGITFATHLYNAMPYITGREPGLAGAILDEAD
IYCGIIADGLHVDYANIRNAKRLKGDKLCLVTDATAPAGANIEQFIFAGKTIYYRNGLCVDENGTLSGSSLTMIEGVRNL
VEHCGIALDEVLRMATLYPARAIGVEKRLGTLAAGKVANLTAFTPDFKITKTIVNGNEVVTQ
>Q84F86 3.5.1.25~~~nagA~~~N-acetylglucosamine-6-phosphate deacetylase~~~
MFFLSLIIRNITVVNASGRDEQMDVWMKDGKIAQIAQHIHAQGVDQLEGSGKFLLPGFIDMHIHGSAQMDTMDASDEGLH
IHGPITIKEGTTSFLATTMTQSFDWFDRAQRQCGNNFSPKSDEAEVLGLHIEGPFVSKQRAGAQPLDYIVQPDMEVIKKW
QALSGQKIKQITLAPEEPNGMAAVQSLSESGVIVSIGHSDATFEQMQEAVQLGASQGTHLYNQMRPFHHRDPGVVGGVLL
VDAIKAELIVDFIHMHEGAVEMAYRLKGADGIILITDAMRAKGMPYGEYDLGGQLVHVTESGAHLSNGSLAGSILTMDQA
VRNMRQITNCTLEELVKMSSYNAAQQLKLTNKGQLTEGYDADAVIVDEHLLLHQTIKAGRIRVQTNN
>O32445 3.5.1.25~~~nagA~~~N-acetylglucosamine-6-phosphate deacetylase~~~COG1820
MYALTNCKIYTGNDVLVKHAVIINGDKIEAVCPIESLPSEMNVVDLNGANLSPGFIDLQLNGCGGVMFNDEITAETIDTM
HKANLKSGCTSFLPTLITSSDENMRQAIAAAREYQAKYPNQSLGLHLEGPYLNVMKKGIHSVDFIRPSDDTMIDTICANS
DVIAKVTLAPENNKPEHIEKLVKAGIVVSIGHTNATYSEARKSFESGITFATHLFNAMTPMVGREPGVVGAIYDTPEVYA
GIIADGFHVDYANIRIAHKIKGEKLVLVTDATAPAGAEMDYFIFVGKKVYYRDGKCVDENGTLGGSALTMIEAVQNTVEH
VGIALDEALRMATLYPAKAIGVDEKLGRIKKGMIANLTVFDRDFNVKATVVNGQYEQN
>O35000 3.5.99.6~~~nagB~~~Glucosamine-6-phosphate deaminase 1~~~COG0363
MKVMECQTYEELSQIAARITADTIKEKPDAVLGLATGGTPEGTYRQLIRLHQTENLSFQNITTVNLDEYAGLSSDDPNSY
HFYMNDRFFQHIDSKPSRHFIPNGNADDLEAECRRYEQLVDSLGDTDIQLLGIGRNGHIGFNEPGTSFKSRTHVVTLNEQ
TRQANARYFPSIDSVPKKALTMGIQTILSSKRILLLISGKSKAEAVRKLLEGNISEDFPASALHLHSDVTVLIDREAASL
RP
>O30564 3.5.99.6~~~nagB~~~Glucosamine-6-phosphate deaminase~~~
MRLIIRPTYEDISKWAANHVAQKINEFSPTKENPFILGLPTGSSPIGMYKNLIELNKNKKISFQNVITFNMDEYIGIEEN
HPESYHSFMWNNFFSHIDIKKENINILNGNASNLKKECEEYEKKIKSFGGIMLFVGGIGPDGHIAFNEPGSSLTSRTRIK
TLTQDTIIANSRFFEGDVNKVPKNALTVGIGTIMDSQEVLIIVNGHNKARALKHAIEKGVNHMWTISALQLHKNAIIVSD
KNATYELKVGTVEYFNDIERKNFNNDLK
>P0A760 3.5.99.6~~~nagB~~~Glucosamine-6-phosphate deaminase~~~COG0363
MRLIPLTTAEQVGKWAARHIVNRINAFKPTADRPFVLGLPTGGTPMTTYKALVEMHKAGQVSFKHVVTFNMDEYVGLPKE
HPESYYSFMHRNFFDHVDIPAENINLLNGNAPDIDAECRQYEEKIRSYGKIHLFMGGVGNDGHIAFNEPASSLASRTRIK
TLTHDTRVANSRFFDNDVNQVPKYALTVGVGTLLDAEEVMILVLGSQKALALQAAVEGCVNHMWTISCLQLHPKAIMVCD
EPSTMELKVKTLRYFNELEAENIKGL
>P0A759 3.5.99.6~~~nagB~~~Glucosamine-6-phosphate deaminase~~~COG0363
MRLIPLTTAEQVGKWAARHIVNRINAFKPTADRPFVLGLPTGGTPMTTYKALVEMHKAGQVSFKHVVTFNMDEYVGLPKE
HPESYYSFMHRNFFDHVDIPAENINLLNGNAPDIDAECRQYEEKIRSYGKIHLFMGGVGNDGHIAFNEPASSLASRTRIK
TLTHDTRVANSRFFDNDVNQVPKYALTVGVGTLLDAEEVMILVLGSQKALALQAAVEGCVNHMWTISCLQLHPKAIMVCD
EPSTMELKVKTLRYFNELEAENIKGL
>Q4QP46 3.5.99.6~~~nagB~~~Glucosamine-6-phosphate deaminase~~~
MRFIPLQTEQQVSCWAAQHIINRINDFKPTAERPFVLGLPTGGTPLKTYQELIRLYQAGKVSFKHVVTFNMDEYVALPEE
HPESYHSFMYNNFFNHIDILPENINILNGNTDDHNAECHRYEEKIKSYGKIHLFMGGVGVDGHIAFNEPASSLSSRTRIK
TLTQDTLIANSRFFNNDVTQVPKYALTIGVGTLLDAEEVMILATGHQKALAVQAAVEGSINHLWTVSALQMHRHFVLVCD
EAAQQELKVKTVKYFTELEGSVAGTDYQDK
>P59686 3.5.99.6~~~nagB~~~Glucosamine-6-phosphate deaminase~~~
MHEPLPSLRLLNNGSTTFGLATGGTMEPLYAKICKTDIDFSNCISFNLDEYVGLEANHEQSYAYYMHQHLFHEKPFQASY
LPNGLATNPLEEAARYEALLQQHSLDFQLLGIGQNGHIGFNEPGTSFESLTHLVTLEESTRQANARFFSSINEVPTQAFT
MGIQSIMRAKCILLIAVGETKREVLERVLASDYTEEIPASALTKHPNVIILTDLQVEENKS
>A0QU88 3.5.99.6~~~nagB~~~Glucosamine-6-phosphate deaminase~~~COG0363
MEVIILPDPGRIGSLAADAITALITRKPDAVLGLATGSSPLAVYDELVSRYEAGQISFRQARGFTLDEYVGLPADHPERY
RNVIDTAFAARVDFAPGAVQGPDGLADDIPAACAAYEAAIRDAGGVDLQILGIGTDGHIAFNEPGSSLASRTRIKTLTRQ
TRVDNARFFGGDLDQVPTHCLTQGLGTIMEARHLILIAMGRSKAEAVHHLVEGAVSAMWPATVLQMHPHVTVLLDDAAAQ
RLQLVDYYRETYRAKPAWQGI
>Q9CMF4 3.5.99.6~~~nagB~~~Glucosamine-6-phosphate deaminase~~~
MRLIPLHNVDQVAKWSARYIVDRINQFQPTEARPFVLGLPTGGTPLKTYEALIELYKAGEVSFKHVVTFNMDEYVGLPKE
HPESYHSFMYKNFFDHVDIQEKNINILNGNTEDHDAECQRYEEKIKSYGKIHLFMGGVGVDGHIAFNEPASSLSSRTRIK
TLTEDTLIANSRFFDNDVNKVPKYALTIGVGTLLDAEEVMILVTGYNKAQALQAAVEGSINHLWTVTALQMHRRAIIVCD
EPATQELKVKTVKYFTELEASAIRSVK
>P99125 3.5.99.6~~~nagB~~~Glucosamine-6-phosphate deaminase~~~
MKVLNLGSKKQASFYVACELYKEMAFNQHCKLGLATGGTMTDLYEQLVKLLNKNQLNVDNVSTFNLDEYVGLTASHPQSY
HYYMDDMLFKQYPYFNRKNIHIPNGDADDMNAEASKYNDVLEQQGQRDIQILGIGENGHIGFNEPGTPFDSVTHIVDLTE
STIKANSRYFKNEDDVPKQAISMGLANILQAKRIILLAFGEKKRAAITHLLNQEISVDVPATLLHKHPNVEIYLDDEACP
KNVAKIHVDEMD
>Q9K487 3.5.99.6~~~nagB~~~Glucosamine-6-phosphate deaminase~~~COG0363
MEVVIVPDAKAGGELIAEAMAQLLRRKPDALLGVATGSTPLPVYEALAAKVRSGAVDTAQARIAQLDEYVGLPAEHPESY
RSVLRREVLEPLGIDMDAFMGPDGTAADVQAACEAYDTALGGSGGVDLQLLGIGTDGHIGFNEPCSSLASRTRIKTLTEQ
TRIDNARFFDGDIEQVPHHVITQGIGTILEARHVVLLATGEGKADAVAASVEGPVAAVCPASALQLHPHATVVVDEAAAS
KLKLADYFRHTYAHKPDWQGI
>Q8DV70 3.5.99.6~~~nagB~~~Glucosamine-6-phosphate deaminase~~~COG0363
MKTIKVKNKTEGSKVAFRMLEEEITFGAKTLGLATGSTPLELYKEIRESHLDFSDMVSINLDEYVGLSADDKQSYAYFMK
QNLFAAKPFKKSYLPNGLAADLAKETEYYDQILAQYPIDLQILGIGRNAHIGFNEPGTAFSSQTHLVDLTPSTIAANSRF
FEKAEDVPKQAISMGLASIMSAKMILLMAFGEEKAEAVAAMVKGPVTEEIPASILQTHPKVILIVDEKAGAGI
>Q9KKS5 3.5.99.6~~~nagB~~~Glucosamine-6-phosphate deaminase~~~COG0363
MRLIPLKAAAQVGKWAAAHIVKRINEFQPTAERPFVLGLPTGGTPLATYKALIEMHKAGEVSFKHVVTFNMDEYVGLAAD
HPESYRSFMYNNFFNHIDIQEENINLLNGNTDDHEAECKRYEDKIKSYGKINLFMGGVGNDGHIAFNEPASSLSSRTRIK
TLTEDTRIANSRFFDGDINQVPKYALTIGVGTLLDAQEIMILVTGHNKALALQAAVEGSVNHLWTVSALQLHPKAVIVCD
EPSTQELKVKTVKYFTELEAKNIVGF
>P0AF20 ~~~nagC~~~N-acetylglucosamine repressor~~~COG1846
MTPGGQAQIGNVDLVKQLNSAAVYRLIDQYGPISRIQIAEQSQLAPASVTKITRQLIERGLIKEVDQQASTGGRRAISIV
TETRNFHAIGVRLGRHDATITLFDLSSKVLAEEHYPLPERTQQTLEHALLNAIAQFIDSYQRKLRELIAISVILPGLVDP
DSGKIHYMPHIQVENWGLVEALEERFKVTCFVGHDIRSLALAEHYFGASQDCEDSILVRVHRGTGAGIISNGRIFIGRNG
NVGEIGHIQVEPLGERCHCGNFGCLETIAANAAIEQRVLNLLKQGYQSRVPLDDCTIKTICKAANKGDSLASEVIEYVGR
HLGKTIAIAINLFNPQKIVIAGEITEADKVLLPAIESCINTQALKAFRTNLPVVRSELDHRSAIGAFALVKRAMLNGILL
QHLLEN
>P0AF24 3.1.3.5~~~nagD~~~Ribonucleotide monophosphatase NagD~~~COG0647
MTIKNVICDIDGVLMHDNVAVPGAAEFLHGIMDKGLPLVLLTNYPSQTGQDLANRFATAGVDVPDSVFYTSAMATADFLR
RQEGKKAYVVGEGALIHELYKAGFTITDVNPDFVIVGETRSYNWDMMHKAAYFVANGARFIATNPDTHGRGFYPACGALC
AGIEKISGRKPFYVGKPSPWIIRAALNKMQAHSEETVIVGDNLRTDILAGFQAGLETILVLSGVSSLDDIDSMPFRPSWI
YPSVAEIDVI
>O52379 1.14.13.172~~~nagG~~~Salicylate 5-hydroxylase, large oxygenase component~~~
MSEPQRLKPVFPQDPKWPGEGSSRVPFWAYTREDLYKRELERLFYANHWCYVGLEAEIPNPGDFKRTVIGERSVIMVRDP
DGGINVVENVCAHRGMRFCRERHGNAKDFFCPYHQWNYSLKGDLQGVPFRRGVKQDGKVNGGMPKDFKLEEHGLTKLKVA
ARGGAVFASFDHDVEPFEEFLGPTILHYFDRVFNGRKLKILGYRRQRIPGNWKLMQENIKDPYHPGLLHTWFSTFGLWRA
DNKSELKMDAKFRHAAMISTRGQGGKNEEVVSGVDSFKEQMKVNDPRLLDIVPEPWWGGPTAVMTTIFPSVIIQQQVNSV
STRHIQPNGHGSFDFVWTHFGFEDDNEEWTQRRLIQANLFGPAGFVSADDGEVIEWSQEGFEQKPTHRTVIEMGGHEIGD
TDHMVTETLIRGMYDYWRKVMGE
>P26831 3.2.1.35~~~nagH~~~Hyaluronoglucosaminidase~~~
MNKNIRKIITSTVLAAMTISVLPSNLVVFATDGITENFYEIYPKPQEISYSGGEFQISDEINIVYDDGIDTYTKKRVDEV
LEASNLEATVSNEIVPGKTNFLVGINESGGVVDNYFNKNIPHDESFFDEKMDANIVSVKDGVIGVIGEDTDSAFYGVTTL
KHVFNQLEEGNKIQSFRADDYAEVAHRGFIEGYYGNPWSNEDRAELMKFGGDYKLNQYVFAPKDDPYHNSKWRDLYPEEK
LSEIKKLAQVGNETKNRYVYALHPFMNNPVRFDTEENYQNDLGVIKAKFTQLLENDVRQFAILADDASAPAQGASMYVKL
LTDLTRWLEEQQSTYPDLKTDLMFCPSDYYGNGSSAQLKELNKAEDNVSIVMTGGRIWGEVDENFANNFMNNISTEGHPG
RAPFFWINWPCSDNSKQHLIMGGNDTFLHPGVDPSKIDGIVLNPMQQAEANKSALFAIADYAWNIWDNKEEADENWNDSF
KYMDHGTAEETNSSLALREISKHMINQNMDGRVRPLQESVELAPKLEAFKQKYDSGASIKEDALELIAEFTNLQKAADYY
KNNPGNERTRDQIIYWLNCWEDTMDAAIGYLKSAIAIEEGDDEAAWANYSEAQGAFEKSKTYGFHYVDHTEYAEVGVQHI
VPFIKSMGQNLSVVIGSIVDPNRIIATYISNRQDAPTGNPDNIFDNNASTELVYKNPNRIDVGTYVGVKYSNPITLNNVE
FLMGANSNPNDTMQKAKIQYTVDGREWIDLEEGVEYTMPGAIKVENLDLKVRGVRLIATEARENTWLGVRDINVNKKEDS
NSGVEFNPSLIRSESWQVYEGNEANLLDGDDNTGVWYKTLNGDTSLAGEFIGLDLGKEIKLDGIRFVIGKNGGGSSDKWN
KFKLEYSLDNESWTTIKEYDKTGAPAGKDVIEESFETPISAKYIRLTNMENINKWLTFSEFAIISDELENAGNKENVYTN
TELDLLSLAKEDVTKLIPTDDISLNHGEYIGVKLNRIKDLSNINLEISNDTGLKLQSSMNGVEWTEITDKNTLEDGRYVR
LINTSNEAVNFNLTKFEVNSNEVYEPSLVDAYVGDDGAKKAVDGDLKTRVKFLGAPSTGDTIVYDLGQEILVDNLKYVVL
DTEVDHVRDGKIQLSLDGETWTDAITIGDGVENGVDDMFSTPLKNGYKHGNQSGGIVPIDSAYVEGDNLNQKARYVRILF
TAPYRHRWTVINELMINNGEYISTVNDPTYISNPIEERGFAPSNLRDGNLTTSYKPNTNNGEISEGSITYRLSEKTDVRK
VTIVQSGSSISNAKVMARVGDGSENVTDQWVQLGTLSNSLNEFINRDYNNIYEIKIEWTDVAPNIYEIITLNQEFEFPVN
DSLKAKYDELINLSGDEYTLSSFETLKEALNEAKSILDDSNSSQKKIDKALEKLNKAEERLDLRATDFEDFNKVLTLGNS
LVEEEYTAESWALFSEVLEAANEANKNKADYTQDQINQIVIDLDASIKALVKETPEVDKTNLGELINQGKSLLDESVEGF
NVGEYHKGAKDGLTVEINKAEEVFNKEDATEEEINLAKESLEGAIARFNSLLIEESTGDFNGNGKIDIGDLAMVSKNIGS
TTNTSLDLNKDGSIDEYEISFINHRILN
>O52380 1.14.13.172~~~nagH~~~Salicylate 5-hydroxylase, small oxygenase component~~~
MVDFKTYFELLNLYSDYAMVCDSANWEKWPDFFIETGTYRLQPRENFEQGLPLCLLALESKAMIRDRVYGVKETMYHDPY
YQRHIVGTPRVLSVERDADGERITAEASYAVIRTKYDGDSTIFNAGYYRDVIVRTPEGLKLKSRLCVYDSEMIPNSVIYP
I
>P75959 2.7.1.59~~~nagK~~~N-acetyl-D-glucosamine kinase~~~COG1940
MYYGFDIGGTKIALGVFDSGRQLQWEKRVPTPRDSYDAFLDAVCELVAEADQRFGCKGSVGIGIPGMPETEDGTLYAANV
PAASGKPLRADLSARLDRDVRLDNDANCFALSEAWDDEFTQYPLVMGLILGTGVGGGLIFNGKPITGKSYITGEFGHMRL
PVDALTMMGLDFPLRRCGCGQHGCIENYLSGRGFAWLYQHYYHQPLQAPEIIALYDQGDEQARAHVERYLDLLAVCLGNI
LTIVDPDLVVIGGGLSNFPAITTQLADRLPRHLLPVARVPRIERARHGDAGGMRGAAFLHLTD
>O86042 3.7.1.20~~~nagK~~~Fumarylpyruvate hydrolase~~~
MGRPVDKSVEQAFYFTKSPQTLVESGATVAYPPRTSNYHYEMELVLAIGKPGFRVSEDQAHELIYGYAAGLDMTRRDLQL
VARDKGRPWDTGKDIEEGSVCSEIVPMQGVVVEQGAIALEVNGQTKQSSNVDKLIWNVREIIADLSTYYHLQPGDLIYTG
TPEGVGAVVAGDKIIGRVEGIAEISLTVGPAE
>Q8ZPZ9 2.7.1.59~~~nagK~~~N-acetyl-D-glucosamine kinase~~~
MYYGFDIGGTKIALGVFDSTRRLQWEKRVPTPHTSYSAFLDAVCELVEEADQRFGVKGSVGIGIPGMPETEDGTLYAANV
PAASGKPLRADLSARLDRDVRLDNDANCFALSEAWDDEFTQYPLVMGLILGTGVGGGLVLNGKPITGQSYITGEFGHMRL
PVDALTLMGFDFPLRRCGCGQMGCIENYLSGRGFAWLYQHYYDQSLQAPEIIALWEQGDEQAHAHVERYLDLLAVCLGNI
LTIVDPDLLVIGGGLSNFTAITTQLAERLPRHLLPVARAPRIERARHGDAGGMRGAAFLHLTD
>Q8D9M7 2.7.1.59~~~nagK~~~N-acetyl-D-glucosamine kinase~~~
MYYGFDVGGTKIEFGAFNEKLERVATERVPTPTDDYPLLLETIAGLVAKYDQEFACEGKIGLGLPGMEDADDATVLTVNV
PAAKGKPLRADLEAKIGRSVKIENDANCFALSEAWDEELQDAPSVMGLILGTGFGGGLIYEGKVFSGRNNVAGELGHMRL
PLDAWFHLGDNAPLLGCGCGKKGCLDSYLSGRGFELLYAHYYGEEKKAIDIIKANAAGDEKAAEHVERFMELLAICFGNI
FTANDPHVVALGGGLSNFELIYEEMPKRVPKYLLSVAKCPKIIKAKHGDSGGVRGAAFLNIKG
>O86043 5.2.1.4~~~nagL~~~Maleylpyruvate isomerase~~~
MKLYNFWRSGTSHRLRIALNLKGVPYEYLAVHLGKEEHLKDAFKALNPQQLVPALDTGAQVLIQSPAIIEWLEEQYPTPA
LLPADADGRQRVRALAAIVGCDIHPINNRRILEYLRKTFGADEAAINAWCGTWISAGFDAYEALLAVDPKRGRYSFGDTP
TLADCYLVPQVESARRFQVDLTPYPLIRAVDAACGELDAFRRAAPAAQPDSA
>O34817 ~~~nagR~~~HTH-type transcriptional repressor NagR~~~COG2188
MNINKQSPIPIYYQIMEQLKTQIKNGELQPDMPLPSEREYAEQFGISRMTVRQALSNLVNEGLLYRLKGRGTFVSKPKME
QALQGLTSFTEDMKSRGMTPGSRLIDYQLIDSTEELAAILGCGHPSSIHKITRVRLANDIPMAIESSHIPFELAGELNES
HFQSSIYDHIERYNSIPISRAKQELEPSAATTEEANILGIQKGAPVLLIKRTTYLQNGTAFEHAKSVYRGDRYTFVHYMD
RLS
>P40406 3.2.1.52~~~nagZ~~~Beta-hexosaminidase~~~COG1472
MRPVFPLILSAVLFLSCFFGARQTEASASKRAIDANQIVNRMSLDEKLGQMLMPDFRNWQKEGESSPQALTKMNDEVASL
VKKYQFGGIILFAENVKTTKQTVQLTDDYQKASPKIPLMLSIDQEGGIVTRLGEGTNFPGNMALGAARSRINAYQTGSII
GKELSALGINTDFSPVVDINNNPDNPVIGVRSFSSNRELTSRLGLYTMKGLQRQDIASALKHFPGHGDTDVDSHYGLPLV
SHGQERLREVELYPFQKAIDAGADMVMTAHVQFPAFDDTTYKSKLDGSDILVPATLSKKVMTGLLRQEMGFNGVIVTDAL
NMKAIADHFGQEEAVVMAVKAGVDIALMPASVTSLKEEQKFARVIQALKEAVKNGDIPEQQINNSVERIISLKIKRGMYP
ARNSDSTKEKIAKAKKIVGSKQHLKAEKKLAEKAVTVLKNEQHTLPFKPKKGSRILIVAPYEEQTASIEQTIHDLIKRKK
IKPVSLSKMNFASQVFKTEHEKQVKEADYIITGSYVVKNDPVVNDGVIDDTISDSSKWATVFPRAVMKAALQHNKPFVLM
SLRNPYDAANFEEAKALIAVYGFKGYANGRYLQPNIPAGVMAIFGQAKPKGTLPVDIPSVTKPGNTLYPLGYGLNIKTGR
PL
>P75949 3.2.1.52~~~nagZ~~~Beta-hexosaminidase~~~COG1472
MGPVMLDVEGYELDAEEREILAHPLVGGLILFTRNYHDPAQLRELVRQIRAASRNRLVVAVDQEGGRVQRFREGFTRLPA
AQSFAALSGMEEGGKLAQEAGWLMASEMIAMDIDISFAPVLDVGHISAAIGERSYHADPQKALAIASRFIDGMHEAGMKT
TGKHFPGHGAVTADSHKETPCDPRPQAEIRAKDMSVFSSLIRENKLDAIMPAHVIYSDVDPRPASGSPYWLKTVLRQELG
FDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPIKAERVTRLYHKGSFSRQELMDSARW
KAISTRLNQLHERWQEEKAGH
>Q5FA94 3.2.1.52~~~nagZ~~~Beta-hexosaminidase~~~
MTVPHIPRGPVMADIAAFRLTEEEKQRLLDPAIGGIILFRRNFQNIEQLKTLTAEIKALRTPELIIAVDHEGGRVQRFIE
GFTRLPAMNVLGQIWDKDGASAAETAAGQVGRVLATELSACGIDLSFTPVLDLDWGNCAVIGNRSFHRNPEAVARLALAL
QKGLAKGGMKSCGKHFPGHGFVEGDSHLVLPEDGRSLDELEAADLAPFRIMSREGMAAVMPAHVVYPQVDTKPAGFSEIW
LKQILRRDIGFKGVIFSDDLTMEGACGAGGIKERARISFEAGCDIVLVCNRPDLVDELRDGFTIPDNQDLAGRWQYMENS
LGHEAVQAVMQTMGFQAAQAFVAGLASPQDTAGGVKVGEAF
>Q9HZK0 3.2.1.52~~~nagZ~~~Beta-hexosaminidase~~~
MQGSLMLDIGGTWLTAEDRQILRHPEVGGLIIFARNIEHPAQVRELCAAIRAIRPDLLLAVDQEGGRVQRLRQGFVRLPA
MRAIADNPNAEELAEHCGWLMATEVQAVGLDLSFAPVLDLDHQRSAVVGSRAFEGDPERAALLAGAFIRGMHAAGMAATG
KHFPGHGWAEADSHVAIPEDARSLEEIRRSDLVPFARLAGQLDALMPAHVIYPQVDPQPAGFSRRWLQEILRGELKFDGV
IFSDDLSMAGAHVVGDAASRIEAALAAGCDMGLVCNDRASAELALAALQRLKVTPPSRLQRMRGKGYANTDYRQQPRWLE
ALSALRAAQLID
>Q8ZQ06 3.2.1.52~~~nagZ~~~Beta-hexosaminidase~~~
MGPVMLNVEGCELDAEEREILAHPLVGGLILFTRNYHDPEQLRELVRQIRAASRNHLVVAVDQEGGRVQRFREGFTRLPA
AQSFFALHGLEEGGRLAQEAGWLMASEMIAMDIDISFAPVLDVGHISAAIGERSYHADPAKALAMATRFIDGMHDAGMKT
TGKHFPGHGAVTADSHKETPCDPRPETDIRGKDMSVFRTLISENKLDAIMPAHVIYRAIDPRPASGSPYWLKTVLRQELG
FDGVIFSDDLSMEGAAIMGSYAERAQASLDAGCDMILVCNNRKGAVSVLDNLSPIKAERVTRLYHKGSFSRRELMDSARW
KTASAQLNQLHERWQEEKAGH
>Q9KU37 3.2.1.52~~~nagZ~~~Beta-hexosaminidase~~~COG1472
MGPLWLDVAGYELSAEDREILQHPTVGGVILFGRNYHDNQQLLALNKAIRQAAKRPILIGVDQEGGRVQRFREGFSRIPP
AQYYARAENGVELAEQGGWLMAAELIAHDVDLSFAPVLDMGFACKAIGNRAFGEDVQTVLKHSSAFLRGMKAVGMATTGK
HFPGHGAVIADSHLETPYDERETIAQDMAIFRAQIEAGVLDAMMPAHVVYPHYDAQPASGSSYWLKQVLREELGFKGIVF
SDDLSMEGAAVMGGPVERSHQALVAGCDMILICNKREAAVEVLDNLPIMEVPQAEALLKKQQFSYSELKRLERWQQASAN
MQRLIEQFSE
>P96157 3.2.1.52~~~nagZ~~~Beta-hexosaminidase~~~
MGPLWLDVEGCELTAEDREILAHPTVGGVILFARNYHDNQQLLALNTAIRQAAKRPILIGVDQEGGRVQRFRDGFSKIPA
AQLYARSDNGTQLAEDGGWLMAAELIAHDIDLSFAPVLDKGFDCRAIGNRAFGDDVQTVLTYSSAYMRGMKSVGMATTGK
HFPGHGAVIADSHLETPYDERDSIADDMTIFRAQIEAGILDAMMPAHVIYPHYDAQPASGSPYWLKQVLRQELGFQGIVF
SDDLSMEGAAIMGGPAERAQQSLDAGCDMVLMCNKRESAVAVLDQLPISVVPQAQSLLKQQQFTYRELKATERWKQAYQA
LQRLIDAHS
>P11861 1.13.11.56~~~nahC~~~1,2-dihydroxynaphthalene dioxygenase~~~
MSKQAAVIELGYMGISVKDPDAWKSFAMNMLGLQVLDEGEKDRFYLRMDYWHHRIVVHHSAEDDLEYLGWRVAGKPEFEA
LGQKLIDAGYKIRVCDKVEAQERMVLGLMKTEDPGGNPTEIFWGPRIDMSNPFHPGRPLHGKFVTGDQGLGHCIVRQTDV
AAAHKFYSLLGFRGDVEYRIPLPNGMTAELSFMHCNARDHSIAFGAMPAAKRLNHLMLEYTHMEDLGYTHQQFVKNEIDI
ALQLGIHANDKALTFYGATPSGWLIEPGWRGATAIDEAEYYVGDIFGHGVEAPGYGLDVKLS
>P0A108 1.13.11.56~~~doxG~~~1,2-dihydroxynaphthalene dioxygenase~~~
MSKQAAVIELGYMGISVKDPDAWKSFATDMLGLQVLDEGEKDRFYLRMDYWHHRIVVHHNGQDDLEYLGWRVAGKPEFEA
LGQKLIDAGYKIRICDKVEAQERMVLGLMKTEDPGGNPTEIFWGPRIDMSNPFHPGRPLHGKFVTGDQGLGHCIVRQTDV
AEAHKFYSLLGFRGDVEYRIPLPNGMTAELSFMHCNARDHSIAFGAMPAAKRLNHLMLEYTHMEDLGYTHQQFVKNEIDI
ALQLGIHANDKALTFYGATPSGWLIEPGWRGATAIDEAEYYVGDIFGHGVEATGYGLDVKLS
>Q51948 5.99.1.4~~~nahD~~~2-hydroxychromene-2-carboxylate isomerase~~~
MIVDFYFDFLSPFSYLANQRLSKLAQDYGLTIRYNAIDLARVKIAIGNVGPSNRDLKVKLDYLKVDLQRWAQLYGIPLVF
PANYNSRRMNIGFYYSGAEAQAAAYVNVVFNAVWGEGIAPDLESLPALVSEKLGWDRSAFEHFLSSNAATERYDEQTHAA
IERKVFGVPTMFLGDEMWWGNDRLFMLESAMGRLCRQNADLSS
>Q51947 4.1.2.45~~~nahE~~~Trans-O-hydroxybenzylidenepyruvate hydratase-aldolase~~~
MLNKVIKTTRLTAEDINGAWTIMPTPSTPDASDWRSTNTVDLDETARIVEELIAAGVNGILSMGTFGECATLTWEEKRDY
VSTVVETIRGRVPYFCGTTALNTREVIRQTRELIDIGANGTMLGVPMWVKMDLPTAVQFYRDVAGAVPEAAIAIYANPEA
FKFDFPRPFWAEMSKIPQVVTAKYLGIGMLDLDLKLAPNIRFLPHEDDYYAAARINPERITAFWSSGAMCGPATAIMLRD
EVERAKSTGDWIKAKAISDDMRAADSTLFPRGDFSEFSKYNIGLEKARMDAAGWLKAGPCRPPYNLVPEDYLVGAQKSGK
AWAALHAKYSK
>E8MF12 2.7.1.162~~~nahK~~~N-acetylhexosamine 1-kinase~~~
MTESNEDLFGIASHFALEGAVTGIEPYGDGHINTTYLVTTDGPRYILQQMNTSIFPDTVNLMRNVELVTSTLKAQGKETL
DIVPTTSGATWAEIDGGAWRVYKFIEHTVSYNLVPNPDVFREAGSAFGDFQNFLSEFDASQLTETIAHFHDTPHRFEDFK
AALAADKLGRAAACQPEIDFYLSHADQYAVVMDGLRDGSIPLRVTHNDTKLNNILMDATTGKARAIIDLDTIMPGSMLFD
FGDSIRFGASTALEDEKDLSKVHFSTELFRAYTEGFVGELRGSITAREAELLPFSGNLLTMECGMRFLADYLEGDIYFAT
KYPEHNLVRTRTQIKLVQEMEQKASETRAIVADIMEAAR
>Q9JXM7 3.4.21.-~~~nalP~~~Neisserial autotransporter lipoprotein NalP~~~
MRTTPTFPTKTFKPTAMALAVATTLSACLGGGGGGTSAPDFNAGGTGIGSNSRATTAKSAAVSYAGIKNEMCKDRSMLCA
GRDDVAVTDRDAKINAPPPNLHTGDFPNPNDAYKNLINLKPAIEAGYTGRGVEVGIVDTGESVGSISFPELYGRKEHGYN
ENYKNYTAYMRKEAPEDGGGKDIEASFDDEAVIETEAKPTDIRHVKEIGHIDLVSHIIGGRSVDGRPAGGIAPDATLHIM
NTNDETKNEMMVAAIRNAWVKLGERGVRIVNNSFGTTSRAGTADLFQIANSEEQYRQALLDYSGGDKTDEGIRLMQQSDY
GNLSYHIRNKNMLFIFSTGNDAQAQPNTYALLPFYEKDAQKGIITVAGVDRSGEKFKREMYGEPGTEPLEYGSNHCGITA
MWCLSAPYEASVRFTRTNPIQIAGTSFSAPIVTGTAALLLQKYPWMSNDNLRTTLLTTAQDIGAVGVDSKFGWGLLDAGK
AMNGPASFPFGDFTADTKGTSDIAYSFRNDISGTGGLIKKGGSQLQLHGNNTYTGKTIIEGGSLVLYGNNKSDMRVETKG
ALIYNGAASGGSLNSDGIVYLADTDQSGANETVHIKGSLQLDGKGTLYTRLGKLLKVDGTAIIGGKLYMSARGKGAGYLN
STGRRVPFLSAAKIGQDYSFFTNIETDGGLLASLDSVEKTAGSEGDTLSYYVRRGNAARTASAAAHSAPAGLKHAVEQGG
SNLENLMVELDASESSATPETVETAAADRTDMPGIRPYGATFRAAAAVQHANAADGVRIFNSLAATVYADSTAAHADMQG
RRLKAVSDGLDHNGTGLRVIAQTQQDGGTWEQGGVEGKMRGSTQTVGIAAKTGENTTAAATLGMGRSTWSENSANAKTDS
ISLFAGIRHDAGDIGYLKGLFSYGRYKNSISRSTGADEHAEGSVNGTLMQLGALGGVNVPFAATGDLTVEGGLRYDLLKQ
DAFAEKGSALGWSGNSLTEGTLVGLAGLKLSQPLSDKAVLFATAGVERDLNGRDYTVTGGFTGATAATGKTGARNMPHTR
LVAGLGADVEFGNGWNGLARYSYAGSKQYGNHSGRVGVGYRF
>E6MVD9 3.4.21.-~~~nalP~~~Neisserial autotransporter lipoprotein NalP~~~
MRTTPTFPTKTFKPTAMALAVATTLSACLGGGGGGTSAPDFNAGGTGIGSNSRATTAKSAAVSYAGIKNEMCKDRSMLCA
GRDDVAVTDRDAKINRPPPPNLHTGDFPNPNDAYKNLINLKPAIEAGYTGRGVEVGIVDTGESVGSISFPELYGRKEHGY
NENYKNYTAYMRKEAPEDGGGKDIEASFDDEAVIETEAKPTDIRHVKEIGHIDLVSHIIGGRSVDGRPAGGIAPDATLHI
MNTNDGTKNEMMVAAIRNAWVKLGERGVRIVNNSFGTTSRAGTADLFQIANSEEQYRQALLDYSGGDKTDEGIRLMQQSD
YGNLSYHIRNKNMLFIFSTGNDAQAQPNTYALLPFYEKDAQKGIITVAGVDRSGEKFKREMYGEPGTEPLEYGSNHCGIT
AMWCLSAPYEASVRFTRTNPIQIAGTSFSAPIVTGTAALLLQKYPWMSNDNLRTTLLTTAQDIGAVGVDSKFGWGLLDAG
KAMNGPASFPFGDFTADTKGTSDIAYSFRNDISGTGGLIKKGGSQLQLHGNNTYTGKTIIEGGSLVLYGNNKSDMRVETK
GALIYNGAASGGSLNSDGIVYLADTDQSGANETVHIKGSLQLDGKGTLYTRLGKLLKVDGTAIIGGKLYMSARGKGAGYL
NSTGRRVPFLSAAKIGQDYSFFTNIETDGGLLASLDSVEKTAGSEGDTLSYYVRRGNAARTASAAAHSAPAGLKHAVEQG
GSNLENLMVELDASESSATPETVETAAADRTDMPGIRPYGATFRAAAAVQHANAADGVRIFNSLAATVYADSTAAHADMQ
GRRLKAVSDGLDHNGTGLRVIAQTQQDGGTWEQGGVEGKMRGSTQTVGIAAKTGENTTAAATLGMGRSTWSENSANAKTD
SISLFAGIRHDAGDIGYLKGLFSYGRYKNSISRSTGADEHAEGSVNGTLMQLGALGGVNVPFAATGDLTVEGGLRYDLLK
QDAFAEKGSALGWSGNSLTEGTLVGLAGLKLSQPLSDKAVLFATAGVERDLNGRDYTVTGGFTGATAATGKTGARNMPHT
RLVAGLGADVEFGNGWNGLARYSYAGSKQYGNHSGRVGVGYRF
>P54550 1.6.99.1~~~namA~~~NADPH dehydrogenase~~~COG1902
MARKLFTPITIKDMTLKNRIVMSPMCMYSSHEKDGKLTPFHMAHYISRAIGQVGLIIVEASAVNPQGRITDQDLGIWSDE
HIEGFAKLTEQVKEQGSKIGIQLAHAGRKAELEGDIFAPSAIAFDEQSATPVEMSAEKVKETVQEFKQAAARAKEAGFDV
IEIHAAHGYLIHEFLSPLSNHRTDEYGGSPENRYRFLREIIDEVKQVWDGPLFVRVSASDYTDKGLDIADHIGFAKWMKE
QGVDLIDCSSGALVHADINVFPGYQVSFAEKIREQADMATGAVGMITDGSMAEEILQNGRADLIFIGRELLRDPFFARTA
AKQLNTEIPAPVQYERGW
>Q5KXG9 1.6.99.1~~~namA~~~NADPH dehydrogenase~~~COG1902
MNTMLFSPYTIRGLTLKNRIVMSPMCMYSCDTKDGAVRTWHKIHYPARAVGQVGLIIVEATGVTPQGRISERDLGIWSDD
HIAGLRELVGLVKEHGAAIGIQLAHAGRKSQVPGEIIAPSAVPFDDSSPTPKEMTKADIEETVQAFQNGARRAKEAGFDV
IEIHAAHGYLINEFLSPLSNRRQDEYGGSPENRYRFLGEVIDAVREVWDGPLFVRISASDYHPDGLTAKDYVPYAKRMKE
QGVDLVDVSSGAIVPARMNVYPGYQVPFAELIRREADIPTGAVGLITSGWQAEEILQNGRADLVFLGRELLRNPYWPYAA
ARELGAKISAPVQYERGWRF
>Q6FDK2 2.4.2.12~~~nadV~~~Nicotinamide phosphoribosyltransferase~~~COG1488
MSFRINPLNAIDFYKADHRRQYPEGTEYVYANFTPRSSRLANMLHDFDDKIVFFGLQGFIQHFLIETWNEGFFNQDKATV
VSHYKRRMDTSLGEGAVSVEHIEALHDLGYLPLKIKALPEGSRVNMRVPVLTVINTQAEFFWLTNYIETVLSAELWKSST
TATIAFEYKRLLTQYAVKTGASIENVVVQGHDFSSRGMSGIYDAAQSGVGHLTSFIGTDAVTAIDYAEQYYAASGVVGVS
VPATEHSVMCMGSEENELETFRRLICELYPSGIVSIVSDTWDFWRVLSEFSVKLKQDILNRTPNALGLAKVVFRPDSGDP
VKIICGDPDAEKDTPAYKGAVQLLWEIFGGTYTAQGYKVLHERVGLIYGDSITLQRAEAILQGLEAQGFASSNLVFGIGS
YTYNYMTRDTFGFAVKATWGQVNGVGRELFKDPVTDSGTKKSAQGLLRVERSQDGFTLFDRQSKQQENQGELHTVFENGQ
LCLKSTLDEIRQRLAQQLESFKDSN
>P40407 3.2.1.92~~~namZ~~~Peptidoglycan beta-N-acetylmuramidase NamZ~~~COG3876
MRKTIFAFLTGLMMFGTITAASASPDSKNQTAKKPKVQTGIDTLLPDYKKQLKGKRIGLITNPAGVNTSLKSSVDILYEN
PDIKLTALFGPEHGVRGDAQAGDEVGSYIDEKTGVPVYSLYGKTKKPTPEMLKNVDILMFDIQDVGTRFYTYIYTMAYAM
EAAKENGIPFMVLDRPNPQGGNHIEGPILEPEYASFVGLYPIPLKHGMTIGELASLFNKEFSIDADLTVVKMKHWKRKMD
FDDTRLPFVLPSPNMPTVESTFVYPATGLIEGTNISEGRGTTKPFELIGAPFIKSTELEETLNSLHLPGVTFRAASFTPT
FSKHQGTLCHGVQLYVTDRDKFEAVKTGLSVIKTIHDLYPEDFEFLSTGSFDKLAGNGWIRTKIENGTSVENIINSYEKT
LQQFSKTRKKYLIY
>Q9S4K9 4.1.3.3~~~nanA~~~N-acetylneuraminate lyase~~~
MKGIYSALLVSFDKDGNINEKGLREIIRHNIDVCKIDGLYVGGSTGENFMLSTDEKKRIFEIAMDEAKGQVKLIAQVGSV
NLKEAVELAKFTTDLGYDAISAVTPFYYKFDFNEIKHYYETIINSVDNKLIIYSIPFLTGVNMSIEQFAELFENDKIIGV
KFTAADFYLLERMRKAFPDKLIFAGFDEMMLPATVLGVDGAIGSTFNVNGVRARQIFEAAQKGDIETALEVQHVTNDLIT
DILNNGLYQTIKLILQEQGVDAGYCRQPMKEATEEMIAKAKEINKKYF
>P0A6L4 4.1.3.3~~~nanA~~~N-acetylneuraminate lyase~~~COG0329
MATNLRGVMAALLTPFDQQQALDKASLRRLVQFNIQQGIDGLYVGGSTGEAFVQSLSEREQVLEIVAEEAKGKIKLIAHV
GCVSTAESQQLAASAKRYGFDAVSAVTPFYYPFSFEEHCDHYRAIIDSADGLPMVVYNIPALSGVKLTLDQINTLVTLPG
VGALKQTSGDLYQMEQIRREHPDLVLYNGYDEIFASGLLAGADGGIGSTYNIMGWRYQGIVKALKEGDIQTAQKLQTECN
KVIDLLIKTGVFRGLKTVLHYMDVVSVPLCRKPFGPVDEKYLPELKALAQQLMQERG
>Q8RDN6 4.1.3.3~~~nanA~~~N-acetylneuraminate lyase~~~COG0329
MKGIYSALMVPYNEDGSINEKGLREIIRYNIDKMKVDGLYVGGSTGENFMISTEEKKRVFEIAIDEAKDSVNLIAQVGSI
NLNEAVELGKYVTKLGYKCLSAVTPFYYKFDFSEIKDYYETIVRETGNYMIIYSIPFLTGVNMSLSQFGELFENEKIIGV
KFTAGDFYLLERVRKAFPDKLIFAGFDEMLLPATVLGVDGAIGSTYNINGIRAKQIFELAKNSKIDEALKIQHTTNDLIE
GILSNGLYQTIKEILKLEGVDAGYCRKPMKKISQKQIEFAKELHKKFLKN
>P44539 4.1.3.3~~~nanA~~~N-acetylneuraminate lyase~~~COG0329
MRDLKGIFSALLVSFNEDGTINEKGLRQIIRHNIDKMKVDGLYVGGSTGENFMLSTEEKKEIFRIAKDEAKDQIALIAQV
GSVNLKEAVELGKYATELGYDCLSAVTPFYYKFSFPEIKHYYDTIIAETGNNMIVYSIPFLTGVNMGIEQFGELYKNPKV
LGVKFTAGDFYLLERLKKAYPNHLIWAGFDEMMLPAASLGVDGAIGSTFNVNGVRARQIFELTKAGKLAEALEIQHVTND
LIEGILANGLYLTIKELLKLEGVDAGYCREPMTSKATEEQLAKAKDLKAKFLS
>P59407 4.1.3.3~~~nanA~~~N-acetylneuraminate lyase~~~COG0329
MSKKLLYAAQMTAFDKDGNINLDGIRALVRYNIDVNKVDGLYVCGSTGEAFMLNTDEKKQVMETVYDEANGAIDLVAQVG
SLNLKEAKELAKFATDLGYPKLSAVTPFYYNFTFEQIKDYYNEILKDVDNKLLIYSIPALTGVALTTDQFAELFENPKII
GIKYTNADFYLLERVRNAFPDKLILSGFDEMLLPALALNVDGCIGSTYNLNAPRVREEMDAFEAGDIDKARQLQNISNDM
ITDLIANDIYPTLKLVMKHMGVDAGYVKKPMSHPTPEMEAGATAIYEKYFKN
>Q9CKB0 4.1.3.3~~~nanA~~~N-acetylneuraminate lyase~~~
MKNLKGIFSALLVSFNADGSINEKGLRQIVRYNIDKMKVDGLYVGGSTGENFMLSTEEKKEIFRIAKDEAKDEIALIAQV
GSVNLQEAIELGKYATELGYDSLSAVTPFYYKFSFPEIKHYYDSIIEATGNYMIVYSIPFLTGVNIGVEQFGELYKNPKV
LGVKFTAGDFYLLERLKKAYPNHLIWAGFDEMMLPAASLGVDGAIGSTFNVNGVRARQIFELTQAGKLKEALEIQHVTND
LIEGILANGLYLTIKELLKLDGVEAGYCREPMTKELSPEKVAFAKELKAKYLS
>Q2G160 4.1.3.3~~~nanA~~~N-acetylneuraminate lyase~~~COG0329
MNKDLKGLYAALLVPFDENGQVNEQGLKQIAQNAIETEELDGLYVNGSSGENFLLNTEQKKQVFKVAKEAVGDKVKLIAQ
VGSLDLNEAIELGKYATELGYDALSAVTPFYYPFTFEEIRDYYFDIIEATQNNMIIYAIPDLTGVNISIEQFSELFNHEK
IVGVKYTAPNFFLLERIRKAFPDKLILSGFDEMLVQATISGVDGAIGSTYNVNGRRARKIFDLARQGQIQEAYQLQHDSN
DIIETVLSMGIYPTLKEILRHRGIDAGLPKRPFKPFNEAHRQTLDQLIAKYDL
>P99123 4.1.3.3~~~nanA~~~N-acetylneuraminate lyase~~~
MNKDLKGLYAALLVPFDENGQVNEQGLKQIAQNAIETEELDGLYVNGSSGENFLLNTEQKKQVFKVAKEAVGDKVKLIAQ
VGSLDLNEAIELGKYATELGYDALSAVTPFYYPFTFEEIRDYYFDIIEATQNNMIIYAIPDLTGVNISIEQFSELFNHEK
IVGVKYTAPNFFLLERIRKAFPDKLILSGFDEMLVQATISGVDGAIGSTYNVNGRRARKIFDLARQGQIQEAYQLQHDSN
DIIETVLSMGIYPTLKEILRHRDIDAGLPKRPFKPFNEAHRQTLDQLIAKYDL
>Q6GK01 4.1.3.3~~~nanA~~~N-acetylneuraminate lyase~~~
MNKDLKGLYAALLVPFDENGQVNEQGLKQIAQNAIETEELDGLYVNGSSGENFLLNTEQKKQVFKVAKEAVGDKVKLIAQ
VGSLDLNEAIELGKYATELGYDALSAVTPFYYPFTFEEIRDYYFDIIEATQNNMIIYAIPDLTGVNISIEQFSELFNHEK
IVGVKYTAPNFFLLERIRKAFPDKLILSGFDEMLVQATISGVDGAIGSTYNVNGRRARKIFDLARQGQIQEAYQLQHDSN
DIIETVLSMGIYPTLKEILRHRGIDAGLPKRPFKPFNEAHRQTLDQLIAKYDL
>P62575 3.2.1.18~~~nanA~~~Sialidase A~~~
MSYFRNRDIDIERNSMNRSVQERKCRYSIRKLSVGAVSMIVGAVVFGTSPVLAQEGASEQPLANETQLSGESSTLTDTEK
SQPSSETELSGNKQEQERKDKQEEKIPRDYYARDLENVETVIEKEDVETNASNGQRVDLSSELDKLKKLENATVHMEFKP
DAKAPAFYNLFSVSSATKKDEYFTMAVYNNTATLEGRGSDGKQFYNNYNDAPLKVKPGQWNSVTFTVEKPTAELPKGRVR
LYVNGVLSRTSLRSGNFIKDMPDVTHVQIGATKRANNTVWGSNLQIRNLTVYNRALTPEEVQKRSQLFKRSDLEKKLPEG
AALTEKTDIFESGRNGKPNKDGIKSYRIPALLKTDKGTLIAGADERRLHSSDWGDIGMVIRRSEDNGKTWGDRVTITNLR
DNPKASDPSIGSPVNIDMVLVQDPETKRIFSIYDMFPEGKGIFGMSSQKEEAYKKIDGKTYQILYREGEKGAYTIRENGT
VYTPDGKATDYRVVVDPVKPAYSDKGDLYKGNQLLGNIYFTTNKTSPFRIAKDSYLWMSYSDDDGKTWSAPQDITPMVKA
DWMKFLGVGPGTGIVLRNGPHKGRILIPVYTTNNVSHLNGSQSSRIIYSDDHGKTWHAGEAVNDNRQVDGQKIHSSTMNN
RRAQNTESTVVQLNNGDVKLFMRGLTGDLQVATSKDGGVTWEKDIKRYPQVKDVYVQMSAIHTMHEGKEYIILSNAGGPK
RENGMVHLARVEENGELTWLKHNPIQKGEFAYNSLQELGNGEYGILYEHTEKGQNAYTLSFRKFNWDFLSKDLISPTEAK
VKRTREMGKGVIGLEFDSEVLVNKAPTLQLANGKTARFMTQYDTKTLLFTVDSEDMGQKVTGLAEGAIESMHNLPVSVAG
TKLSNGMNGSEAAVHEVPEYTGPLGTSGEEPAPTVEKPEYTGPLGTSGEEPAPTVEKPEYTGPLGTAGEEAAPTVEKPEF
TGGVNGTEPAVHEIAEYKGSDSLVTLTTKEDYTYKAPLAQQALPETGNKESDLLASLGLTAFFLGLFTLGKKREQ
>P62576 3.2.1.18~~~nanA~~~Sialidase A~~~COG4409
MSYFRNRDIDIERNSMNRSVQERKCRYSIRKLSVGAVSMIVGAVVFGTSPVLAQEGASEQPLANETQLSGESSTLTDTEK
SQPSSETELSGNKQEQERKDKQEEKIPRDYYARDLENVETVIEKEDVETNASNGQRVDLSSELDKLKKLENATVHMEFKP
DAKAPAFYNLFSVSSATKKDEYFTMAVYNNTATLEGRGSDGKQFYNNYNDAPLKVKPGQWNSVTFTVEKPTAELPKGRVR
LYVNGVLSRTSLRSGNFIKDMPDVTHVQIGATKRANNTVWGSNLQIRNLTVYNRALTPEEVQKRSQLFKRSDLEKKLPEG
AALTEKTDIFESGRNGKPNKDGIKSYRIPALLKTDKGTLIAGADERRLHSSDWGDIGMVIRRSEDNGKTWGDRVTITNLR
DNPKASDPSIGSPVNIDMVLVQDPETKRIFSIYDMFPEGKGIFGMSSQKEEAYKKIDGKTYQILYREGEKGAYTIRENGT
VYTPDGKATDYRVVVDPVKPAYSDKGDLYKGNQLLGNIYFTTNKTSPFRIAKDSYLWMSYSDDDGKTWSAPQDITPMVKA
DWMKFLGVGPGTGIVLRNGPHKGRILIPVYTTNNVSHLNGSQSSRIIYSDDHGKTWHAGEAVNDNRQVDGQKIHSSTMNN
RRAQNTESTVVQLNNGDVKLFMRGLTGDLQVATSKDGGVTWEKDIKRYPQVKDVYVQMSAIHTMHEGKEYIILSNAGGPK
RENGMVHLARVEENGELTWLKHNPIQKGEFAYNSLQELGNGEYGILYEHTEKGQNAYTLSFRKFNWDFLSKDLISPTEAK
VKRTREMGKGVIGLEFDSEVLVNKAPTLQLANGKTARFMTQYDTKTLLFTVDSEDMGQKVTGLAEGAIESMHNLPVSVAG
TKLSNGMNGSEAAVHEVPEYTGPLGTSGEEPAPTVEKPEYTGPLGTSGEEPAPTVEKPEYTGPLGTAGEEAAPTVEKPEF
TGGVNGTEPAVHEIAEYKGSDSLVTLTTKEDYTYKAPLAQQALPETGNKESDLLASLGLTAFFLGLFTLGKKREQ
>Q54727 3.2.1.18~~~nanB~~~Sialidase B~~~COG4409
MNKRGLYSKLGISVVGISLLMGVPTLIHANELNYGQLSISPIFQGGSYQLNNKSIDISSLLLDKLSGESQTVVMKFKADK
PNSLQALFGLSNSKAGFKNNYFSIFMRDSGEIGVEIRDAQKGINYLFSRPASLWGKHKGQAVENTLVFVSDSKDKTYTMY
VNGIEVFSETVDTFLPISNINGIDKATLGAVNREGKEHYLAKGSIDEISLFNKAISDQEVSTIPLSNPFQLIFQSGDSTQ
ANYFRIPTLYTLSSGRVLSSIDARYGGTHDSKSKINIATSYSDDNGKTWSEPIFAMKFNDYEEQLVYWPRDNKLKNSQIS
GSASFIDSSIVEDKKSGKTILLADVMPAGIGNNNANKADSGFKEINGHYYLKLKKNGDNDFRYTVRENGVVYNETTNKPT
NYTINDKYEVLEGGKSLTVEQYSVDFDSGSLRERHNGKQVPMNVFYKDSLFKVTPTNYIAMTTSQNRGESWEQFKLLPPF
LGEKHNGTYLCPGQGLALKSSNRLIFATYTSGELTYLISDDSGQTWKKSSASIPFKNATAEAQMVELRDGVIRTFFRTTT
GKIAYMTSRDSGETWSKVSYIDGIQQTSYGTQVSAIKYSQLIDGKEAVILSTPNSRSGRKGGQLVVGLVNKEDDSIDWKY
HYDIDLPSYGYAYSAITELPNHHIGVLFEKYDSWSRNELHLSNVVQYIDLEINDLTK
>P69856 ~~~nanC~~~Probable N-acetylneuraminic acid outer membrane channel protein NanC~~~COG1452
MKKAKILSGVLLLCFSSPLISQAATLDVRGGYRSGSHAYETRLKVSEGWQNGWWASMESNTWNTIHDNKKENAALNDVQV
EVNYAIKLDDQWTVRPGMLTHFSSNGTRYGPYVKLSWDATKDLNFGIRYRYDWKAYRQQDLSGDMSRDNVHRWDGYVTYH
INSDFTFAWQTTLYSKQNDYRYANHKKWATENAFVLQYHMTPDITPYIEYDYLDRQGVYNGRDNLSENSYRIGVSFKL
>Q8ZLQ7 5.1.3.9~~~nanE2~~~Putative N-acetylmannosamine-6-phosphate 2-epimerase 2~~~
MSLLEQLDKNIAASGGLIVSCQPVPGSPLDKPEIVAAMALAAEQAGAVAVRIEGIDNLRMTRSLVSVPIIGIIKRDLDES
PVRITPFLDDVDALAQAGAAIIAVDGTARQRPVAVEALLARIHHHHLLAMADCSSVDDGLACQRLGADIIGTTMSGYTTP
DTPEEPDLPLVKALHDAGCRVIAEGRYNSPALAAEAIRYGAWAVTVGSAITRLEHICGWYNDALKKAAS
>Q0TUP9 5.1.3.9~~~nanE~~~Putative N-acetylmannosamine-6-phosphate 2-epimerase~~~COG3010
MLDVVKGNLIVSCQALSDEPLHSSFIMGRMAIAAKQGGAAAIRAQGVNDINEIKEVTKLPIIGIIKRNYDDSEIYITPTM
KEVDELLKTDCEMIALDATKRKRPNGENVKDLVDAIHAKGRLAMADISTLEEGIEAEKLGFDCVSTTLSGYTPYSKQSNS
VDFELLEELVKTVKIPVICEGRINTPEELKKALDLGAYSAVVGGAITRPQQITKRFTDILK
>Q8XNZ3 5.1.3.9~~~nanE~~~Putative N-acetylmannosamine-6-phosphate 2-epimerase~~~
MLDVVKGNLIVSCQALSDEPLHSSFIMGRMAIAAKQGGAAAIRAQGVNDINEIKEVTKLPIIGIIKRNYDDSEIYITPTM
KEVDELLKTDCEMIALDATKRKRPNGENVKDLVDAIHAKGRLAMADISTLEEGIEAEKLGFDCVSTTLSGYTPYSKQSNS
VDFELLEELVKTVKIPVICEGRINTPEELKKALDLGAYSAVVGGAITRPQQITKRFTDILK
>P0A761 5.1.3.9~~~nanE~~~Putative N-acetylmannosamine-6-phosphate 2-epimerase~~~COG3010
MSLLAQLDQKIAANGGLIVSCQPVPDSPLDKPEIVAAMALAAEQAGAVAIRIEGVANLQATRAVVSVPIIGIVKRDLEDS
PVRITAYIEDVDALAQAGADIIAIDGTDRPRPVPVETLLARIHHHGLLAMTDCSTPEDGLACQKLGAEIIGTTLSGYTTP
ETPEEPDLALVKTLSDAGCRVIAEGRYNTPAQAADAMRHGAWAVTVGSAITRLEHICQWYNTAMKKAVL
>Q8RDN5 5.1.3.9~~~nanE~~~Putative N-acetylmannosamine-6-phosphate 2-epimerase~~~COG3010
MNKILESIRGKLIVSCQALEDEPLHSSFIMGRMAYAAYSGGAAGIRANTVEDIKEIKKNVSLPIIGIIKKVYNNSDVYIT
PTIKEVEDLINEGVQIIAIDATKRERPDRKDLKNFIAEIKEKYPNQLFMADISSVDEALYAEKIGFDIVGTTLVGYTDYT
KNYKALEELKKVVKVVKIPVIAEGNIDTPLKAKKALEIGAFAVVVGGAITRPQQITKKFVDEMK
>P60668 5.1.3.9~~~nanE~~~Putative N-acetylmannosamine-6-phosphate 2-epimerase~~~COG3010
MSLLARLEQSVHENGGLIVSCQPVPGSPMDKPEIVAAMAQAAASAGAVAVRIEGIENLRTVRPHLSVPIIGIIKRDLTGS
PVRITPYLQDVDALAQAGADIIAFDASFRSRPVDIDSLLTRIRLHGLLAMADCSTVNEGISCHQKGIEFIGTTLSGYTGP
ITPVEPDLAMVTQLSHAGCRVIAEGRYNTPALAANAIEHGAWAVTVGSAITRIEHICQWFSHAVKR
>Q2G157 5.1.3.9~~~nanE~~~Putative N-acetylmannosamine-6-phosphate 2-epimerase~~~COG3010
MLPHGLIVSCQALPDEPLHSSFIMSKMALAAYEGGAVGIRANTKEDILAIKETVDLPVIGIVKRDYDHSDVFITATSKEV
DELIESQCEVIALDATLQQRPKETLDELVSYIRTHAPNVEIMADIATVEEAKNAARLGFDYIGTTLHGYTSYTQGQLLYQ
NDFQFLKDVLQSVDAKVIAEGNVITPDMYKRVMDLGVHCSVVGGAITRPKEITKRFVQIMED
>P65517 5.1.3.9~~~nanE~~~Putative N-acetylmannosamine-6-phosphate 2-epimerase~~~
MLPHGLIVSCQALADEPLHSSFIMSKMALAAYEGGAVGIRANTKEDILAIKETVDLPVIGIVKRDYDHSDVFITATSKEV
DELIESQCEVIALDATLQQRPKETLDELVSYIRTHAPNVEIMADIATVEEAKNAARLGFDYIGTTLHGYTSYTQGQLLYQ
NDFQFLKDVLQSVDAKVIAEGNVITPDMYKRVMDLGVHCSVVGGAITRPKEITKRFVQVMED
>P65522 5.1.3.9~~~nanE~~~Putative N-acetylmannosamine-6-phosphate 2-epimerase~~~
MPDKPTKEKLMEQLKGGIIVSCQALPGEPLYSETGGIMPLMAKAAQEAGAVGIRANSVRDIKEIQAITDLPIIGIIKKDY
PPQEPFITATMTEVDQLAALNIAVIAMDCTKRDRHDGLDIASFIRQVKEKYPNQLLMADISTFDEGLVAHQAGIDFVGTT
LSGYTPYSRQEAGPDVALIEALCKAGIAVIAEGKIHSPEEAKKINDLGVAGIVVGGAITRPKEIAERFIEALKS
>Q9KR62 5.1.3.9~~~nanE~~~Putative N-acetylmannosamine-6-phosphate 2-epimerase~~~COG3010
MRPVVRKNFLNIEELKRFLNGQTVVSIQPVTGSPLDKTDFIVAMAIAVEQAGAKALRIEGVNNVAAVSAAVTIPIIGIVK
RDLPDSPIRITPFVSDVDGLANAGATVIAFDATDRTRPESRERIAQAIKNTGCFAMADCSTFEDGLWANSQGVEIVGSTL
SGYVGDIEPTVPDFQLVKAFSEAGFFTMAEGRYNTPELAAKAIESGAVAVTVGSALTRLEVVTQWFNNATQAAGERKCAH
>P10481 3.2.1.18~~~nanH~~~Sialidase~~~
MCNKNNTFEKNLDISHKPEPLILFNKDNNIWNSKYFRIPNIQLLNDGTILTFSDIRYNGPDDHAYIDIASARSTDFGKTW
SYNIAMKNNRIDSTYSRVMDSTTVITNTGRIILIAGSWNTNGNWAMTTSTRRSDWSVQMIYSDDNGLTWSNKIDLTKDSS
KVKNQPSNTIGWLGGVGSGIVMDDGTIVMPAQISLRENNENNYYSLIIYSKDNGETWTMGNKVPNSNTSENMVIELDGAL
IMSTRYDYSGYRAAYISHDLGTTWEIYEPLNGKILTGKGSGCQGSFIKATTSNGHRIGLISAPKNTKGEYIRDNIAVYMI
DFDDLSKGVQEICIPYPEDGNKLGGGYSCLSFKNNHLGIVYEANGNIEYQDLTPYYSLINKQ
>Q02834 3.2.1.18~~~nedA~~~Sialidase~~~
MTANPYLRRLPRRRAVSFLLAPALAAATVAGASPAQAIAGAPVPPGGEPLYTEQDLAVNGREGFPNYRIPALTVTPDGDL
LASYDGRPTGIDAPGPNSILQRRSTDGGRTWGEQQVVSAGQTTAPIKGFSDPSYLVDRETGTIFNFHVYSQRQGFAGSRP
GTDPADPNVLHANVATSTDGGLTWSHRTITADITPDPGWRSRFAASGEGIQLRYGPHAGRLIQQYTIINAAGAFQAVSVY
SDDHGRTWRAGEAVGVGMDENKTVELSDGRVLLNSRDSARSGYRKVAVSTDGGHSYGPVTIDRDLPDPTNNASIIRAFPD
APAGSARAKVLLFSNAASQTSRSQGTIRMSCDDGQTWPVSKVFQPGSMSYSTLTALPDGTYGLLYEPGTGIRYANFNLAW
LGGICAPFTIPDVALEPGQQVTVPVAVTNQSGIAVPKPSLQLDASPDWQVQGSVEPLMPGRQAKGQVTITVPAGTTPGRY
RVGATLRTSAGNASTTFTVTVGLLDQARMSIADVDSEETAREDGRASNVIDGNPSTFWHTEWSRADAPGYPHRISLDLGG
THTISGLQYTRRQNSANEQVADYEIYTSLNGTTWDGPVASGRFTTSLAPQRAVFPARDARYIRLVALSEQTGHKYAAVAE
LEVEGQR
>P15698 3.2.1.18~~~~~~Sialidase~~~COG4409
MKKFIKILKVLSMAIVLSACNINGIFASNLNTTNEPQKTTVFNKNDNTWNAQYFRIPSLQTLADGTMLAFSDIRYNGAED
HAYIDIGAAKSTDNGQTWDYKTVMENDRIDSTFSRVMDSTTVVTDTGRIILIAGSWNKNGNWASSTTSLRSDWSVQMVYS
DDNGETWSDKVDLTTNKARIKNQPSNTIGWLAGVGSGIVMSDGTIVMPIQIALRENNANNYYSSVIYSKDNGETWTMGNK
VPDPKTSENMVIELDGALIMSSRNDGKNYRASYISYDMGSTWEVYDPLHNKISTGNGSGCQGSFIKVTAKDGHRLGFISA
PKNTKGGYVRDNITVYMIDFDDLSKGIRELCSPYPEDGNSSGGGYSCLSFNDGKLSILYEANGNIEYKDLTDYYLSIENN
KKLK
>P29768 3.2.1.18~~~nanH~~~Sialidase~~~
MTVEKSVVFKAEGEHFTDQKGNTIVGSGSGGTTKYFRIPAMCTTSKGTIVVFADARHNTASDQSFIDTAAARSTDGGKTW
NKKIAIYNDRVNSKLSRVMDPTCIVANIQGRETILVMVGKWNNNDKTWGAYRDKAPDTDWDLVLYKSTDDGVTFSKVETN
IHDIVTKNGTISAMLGGVGSGLQLNDGKLVFPVQMVRTKNITTVLNTSFIYSTDGITWSLPSGYCEGFGSENNIIEFNAS
LVNNIRNSGLRRSFETKDFGKTWTEFPPMDKKVDNRNHGVQGSTITIPSGNKLVAAHSSAQNKNNDYTRSDISLYAHNLY
SGEVKLIDDFYPKVGNASGAGYSCLSYRKNVDKETLYVVYEANGSIEFQDLSRHLPVIKSYN
>A5F7A4 3.2.1.18~~~nanH~~~Sialidase~~~COG4409
MRFKNVKKTALMLAMFGMATSSNAALFDYNATGDTEFDSPAKQGWMQDNTNNGSGVLTNADGMPAWLVQGIGGRAQWTYS
LSTNQHAQASSFGWRMTTEMKVLSGGMITNYYANGTQRVLPIISLDSSGNLVVEFEGQTGRTVLATGTAATEYHKFELVF
LPGSNPSASFYFDGKLIRDNIQPTASKQNMIVWGNGSSNTDGVAAYRDIKFEIQGDVIFRGPDRIPSIVASSVTPGVVTA
FAEKRVGGGDPGALSNTNDIITRTSRDGGITWDTELNLTEQINVSDEFDFSDPRPIYDPSSNTVLVSYARWPTDAAQNGD
RIKPWMPNGIFYSVYDVASGNWQAPIDVTDQVKERSFQIAGWGGSELYRRNTSLNSQQDWQSNAKIRIVDGAANQIQVAD
GSRKYVVTLSIDESGGLVANLNGVSAPIILQSEHAKVHSFHDYELQYSALNHTTTLFVDGQQITTWAGEVSQENNIQFGN
ADAQIDGRLHVQKIVLTQQGHNLVEFDAFYLAQQTPEVEKDLEKLGWTKIKTGNTMSLYGNASVNPGPGHGITLTRQQNI
SGSQNGRLIYPAIVLDRFFLNVMSIYSDDGGSNWQTGSTLPIPFRWKSSSILETLEPSEADMVELQNGDLLLTARLDFNQ
IVNGVNYSPRQQFLSKDGGITWSLLEANNANVFSNISTGTVDASITRFEQSDGSHFLLFTNPQGNPAGTNGRQNLGLWFS
FDEGVTWKGPIQLVNGASAYSDIYQLDSENAIVIVETDNSNMRILRMPITLLKQKLTLSQN
>P0C6E9 3.2.1.18~~~nanH~~~Sialidase~~~COG4409
MRFKNVKKTALMLAMFGMATSSNAALFDYNATGDTEFDSPAKQGWMQDNTNNGSGVLTNADGMPAWLVQGIGGRAQWTYS
LSTNQHAQASSFGWRMTTEMKVLSGGMITNYYANGTQRVLPIISLDSSGNLVVEFEGQTGRTVLATGTAATEYHKFELVF
LPGSNPSASFYFDGKLIRDNIQPTASKQNMIVWGNGSSNTDGVAAYRDIKFEIQGDVIFRGPDRIPSIVASSVTPGVVTA
FAEKRVGGGDPGALSNTNDIITRTSRDGGITWDTELNLTEQINVSDEFDFSDPRPIYDPSSNTVLVSYARWPTDAAQNGD
RIKPWMPNGIFYSVYDVASGNWQAPIDVTDQVKERSFQIAGWGGSELYRRNTSLNSQQDWQSNAKIRIVDGAANQIQVAD
GSRKYVVTLSIDESGGLVANLNGVSAPIILQSEHAKVHSFHDYELQYSALNHTTTLFVDGQQITTWAGEVSQENNIQFGN
ADAQIDGRLHVQKIVLTQQGHNLVEFDAFYLAQQTPEVEKDLEKLGWTKIKTGNTMSLYGNASVNPGPGHGITLTRQQNI
SGSQNGRLIYPAIVLDRFFLNVMSIYSDDGGSNWQTGSTLPIPFRWKSSSILETLEPSEADMVELQNGDLLLTARLDFNQ
IVNGVNYSPRQQFLSKDGGITWSLLEANNANVFSNISTGTVDASITRFEQSDGSHFLLFTNPQGNPAGTNGRQNLGLWFS
FDEGVTWKGPIQLVNGASAYSDIYQLDSENAIVIVETDNSNMRILRMPITLLKQKLTLSQN
>P45425 2.7.1.60~~~nanK~~~N-acetylmannosamine kinase~~~COG1940
MTTLAIDIGGTKLAAALIGADGQIRDRRELPTPASQTPEALRDALSALVSPLQAHAQRVAIASTGIIRDGSLLALNPHNL
GGLLHFPLVKTLEQLTNLPTIAINDAQAAAWAEFQALDGDITDMVFITVSTGVGGGVVSGCKLLTGPGGLAGHIGHTLAD
PHGPVCGCGRTGCVEAIASGRGIAAAAQGELAGADAKTIFTRAGQGDEQAQQLIHRSARTLARLIADIKATTDCQCVVVG
GSVGLAEGYLALVETYLAQEPAAFHVDLLAAHYRHDAGLLGAALLAQGEKL
>Q4QP43 2.7.1.60~~~nanK~~~N-acetylmannosamine kinase~~~
MRCLALDIGGTKIAAAIVKNGEIEQRQQIHTPRENVVEGMHQALGKLLADYEGQFDYVAVASTGIINNGILSALNPKNLG
GLAEFPLKASIAKHTDKPIGLLNDAQAATYAEYQLQNFEQVSNFVFITVSTGVGGGIVLNQILQTGSRGIAGHIGHTLAD
PNGAICGCGRRGCVEAIASGRAIEAVSSQWEDPCDPKEVFERFRKNDEKATALVERSAKAIANLIADLVISLDIQKIAIG
GSVGLAEGYLSLVEKYLQDFPSIYCCEIETAKFGQDAGLIGAAYWVKDVLLDKPEGTIYG
>P44541 2.7.1.60~~~nanK~~~N-acetylmannosamine kinase~~~COG1940
MRCLALDIGGTKIAAAIVKNGEIEQRQQIHTPRENVVEGMHQALGKLLADYEGQFDYVAVASTGIINNGILSALNPKNLG
GLAEFPLKASIAKHTDKPIGLLNDAQAATYAEYQLQNSEQVSNFVFITVSTGVGGGIVLNQILQTGSRGIAGHIGHTLAD
PNGAICGCGRRGCVEAIASGRAIEAVSSQWEEPCDPKEVFERFRKNDEKATALVERSAKAIANLIADLVISLDIQKIAIG
GSVGLAEGYLSLVEKYLQDFPSIYCCEIETAKFGQDAGLIGAAYWVKDVLLDKPEGTIYG
>Q9CKB3 2.7.1.60~~~nanK~~~N-acetylmannosamine kinase~~~
MRCLALDIGGTKIASAIVTDGKIEQRQQIATPQADAANAMHDTLANILALYAGQFDYVAVASTGIINHGVLTALNPKNLG
GLAEFPLKESIARHTDKPIGLLNDVQAAACAEYKDEDKNAVQNFVFITVSTGVGGGIILERRLLTEPNGVAGHIGHTLAD
PNGPVCGCGRVGCVEAVAAGRAIEAVSSQWNPPCTPKQAFELFRKNDEKATALIQRSASAIANLIADLVIGLDVQKVVVG
GSVGLAEGYLPLVKQYLNMMPHFYHCTVEQARHGQDAGLLGAAWWVADCLKQGVHLK
>P39371 5.1.3.24~~~nanM~~~N-acetylneuraminate epimerase~~~COG3055
MNKTITALAIMMASFAANASVLPETPVPFKSGTGAIDNDTVYIGLGSAGTAWYKLDTQAKDKKWTALAAFPGGPRDQATS
AFIDGNLYVFGGIGKNSEGLTQVFNDVHKYNPKTNSWVKLMSHAPMGMAGHVTFVHNGKAYVTGGVNQNIFNGYFEDLNE
AGKDSTAIDKINAHYFDKKAEDYFFNKFLLSFDPSTQQWSYAGESPWYGTAGAAVVNKGDKTWLINGEAKPGLRTDAVFE
LDFTGNNLKWNKLAPVSSPDGVAGGFAGISNDSLIFAGGAGFKGSRENYQNGKNYAHEGLKKSYSTDIHLWHNGKWDKSG
ELSQGRAYGVSLPWNNSLLIIGGETAGGKAVTDSVLITVKDNKVTVQN
>P44544 5.1.3.24~~~nanM~~~N-acetylneuraminate epimerase~~~COG3055
MKLTKTALCTALFATFTFSANAQTYPDLPVGIKGGTGALIGDTVYVGLGSGGDKFYTLDLKDPSAQWKEIATFPGGERNQ
PVAAAVDGKLYVFGGLQKNEKGELQLVNDAYRYNPSDNTWMKLPTRSPRGLVGSSGASHGDKVYILGGSNLSIFNGFFQD
TVAAGEDKAKKDEIAAAYFDQRPEDYFFTTELLSYEPSTNKWRNEGRIPFSGRAGAAFTIQGNELVVVNGEIKPGLRTAE
THQGKFTAKGVQWKNLPDLPAPKGKSQDGLAGALSGYSNGHYLVTGGANFPGSIKQFKEGKLHAHKGLSKAWHNEVYTLN
NGKWRIVGELPMNIGYGFSVSYNNKVLLIGGETDGGKALTSVKAISYDGKKLTIE
>P45424 ~~~nanQ~~~N-acetylneuraminate anomerase NanQ~~~COG2731
MMMGEVQSLPSAGLHPALQDALTLALAARPQEKAPGRYELQGDNIFMNVMTFNTQSPVEKKAELHEQYIDIQLLLNGEER
ILFGMAGTARQCEEFHHEDDYQLCSTIDNEQAIILKPGMFAVFMPGEPHKPGCVVGEPGEIKKVVVKVKADLMA
>P44583 ~~~nanQ~~~N-acetylneuraminate anomerase~~~COG2731
MIISSLTNPNFKVGLPKVIAEVCDYLNTLDLNALENGRHDINDQIYMNVMEPETAEPSSKKAELHHEYLDVQVLIRGTEN
IEVGATYPNLSKYEDYNEADDYQLCADIDDKFTVTMKPKMFAVFYPYEPHKPCCVVNGKTEKIKKLVVKVPVKLI
>P0A8W0 ~~~nanR~~~HTH-type transcriptional repressor NanR~~~COG2186
MGLMNAFDSQTEDSSPAIGRNLRSRPLARKKLSEMVEEELEQMIRRREFGEGEQLPSERELMAFFNVGRPSVREALAALK
RKGLVQINNGERARVSRPSADTIIGELSGMAKDFLSHPGGIAHFEQLRLFFESSLVRYAAEHATDEQIDLLAKALEINSQ
SLDNNAAFIRSDVDFHRVLAEIPGNPIFMAIHVALLDWLIAARPTVTDQALHEHNNVSYQQHIAIVDAIRRHDPDEADRA
LQSHLNSVSATWHAFGQTTNKKK
>P39370 3.1.1.-~~~nanS~~~Probable 9-O-acetyl-N-acetylneuraminic acid deacetylase~~~
MNAIISPDYYYVLTVAGQSNAMAYGEGLPLPDREDAPHPRIKQLARFAHTHPGGPPCHFNDIIPLTHCPHDVQDMQGYHH
PLATNHQTQYGTVGQALHIARKLLPFIPDNAGILIVPCCRGGSAFTAGSEGTYSERHGASHDACRWGTDTPLYQDLVSRT
RAALAKNPQNKFLGACWMQGEFDLMTSDYASHPQHFNHMVEAFRRDLKQYHSQLNNITDAPWFCGDTTWYWKENFPHSYE
AIYGNYQNNVLANIIFVDFQQQGERGLTNAPDEDPDDLSTGYYGSAYRSPENWTTALRSSHFSTAARRGIISDRFVEAIL
QFWRER
>P41036 ~~~nanT~~~Sialic acid transporter NanT~~~COG2814
MSTTTQNIPWYRHLNRAQWRAFSAAWLGYLLDGFDFVLIALVLTEVQGEFGLTTVQAASLISAAFISRWFGGLMLGAMGD
RYGRRLAMVTSIVLFSAGTLACGFAPGYITMFIARLVIGMGMAGEYGSSATYVIESWPKHLRNKASGFLISGFSVGAVVA
AQVYSLVVPVWGWRALFFIGILPIIFALWLRKNIPEAEDWKEKHAGKAPVRTMVDILYRGEHRIANIVMTLAAATALWFC
FAGNLQNAAIVAVLGLLCAAIFISFMVQSAGKRWPTGVMLMVVVLFAFLYSWPIQALLPTYLKTDLAYNPHTVANVLFFS
GFGAAVGCCVGGFLGDWLGTRKAYVCSLLASQLLIIPVFAIGGANVWVLGLLLFFQQMLGQGIAGILPKLIGGYFDTDQR
AAGLGFTYNVGALGGALAPIIGALIAQRLDLGTALASLSFSLTFVVILLIGLDMPSRVQRWLRPEALRTHDAIDGKPFSG
AVPFGSAKNDLVKTKS
>P37061 1.6.3.4~~~nox~~~NADH oxidase~~~COG0446
MKVVVVGCTHAGTSAVKSILANHPEAEVTVYERNDNISFLSCGIALYVGGVVKNAADLFYSNPEELASLGATVKMEHNVE
EINVDDKTVTAKNLQTGATETVSYDKLVMTTGSWPIIPPIPGIDAENILLCKNYSQANVIIEKAKDAKRVVVVGGGYIGI
ELVEAFVESGKQVTLVDGLDRILNKYLDKPFTDVLEKELVDRGVNLALGENVQQFVADEQGKVAKVITPSQEFEADMVIM
CVGFRPNTELLKDKVDMLPNGAIEVNEYMQTSNPDIFAAGDSAVVHYNPSQTKNYIPLATNAVRQGMLVGRNLTEQKLAY
RGTQGTSGLYLFGWKIGSTGVTKESAKLNGLDVEATVFEDNYRPEFMPTTEKVLMELVYEKGTQRIVGGQLMSKYDITQS
ANTLSLAVQNKMTVEDLAISDFFFQPHFDRPWNYLNLLAQAALENM
>A2RIB7 1.6.3.4~~~noxE~~~NADH oxidase~~~COG0446
MKIVVIGTNHAGIATANTLLEQYPGHEIVMIDRNSNMSYLGCGTAIWVGRQIEKPDELFYAKAEDFEAKGVKILTETEVS
EIDFANKKVYAKTKSDDEIIEAYDKLVLATGSRPIIPNLPGKDLKGIHFLKLFQEGQAIDAEFAKEKVKRIAVIGAGYIG
TEIAEAAKRRGKEVLLFDAENTSLASYYDEEFAKGMDENLAQHGIELHFGELAKEFKANEEGYVSQIVTNKATYDVDLVI
NCIGFTANSALASDKLATFKNGAIKVDKHQQSSDPDVYAVGDVATIYSNALQDFTYIALASNAVRSGIVAGHNIGGKELE
SVGVQGSNGISIFGYNMTSTGLSVKAAKKLGLEVSFSDFEDKQKAWFLHENNDSVKIRIVYETKSRRIIGAQLASKSEII
AGNINMFSLAIQEKKTIDELALLDLFFLPHFNSPYNYMTVAALNAK
>O84925 1.6.3.4~~~nox~~~NADH oxidase~~~
MSKIVVVGANHAGTACINTMLDNFGNENEIVVFDQNSNISFLGCGMALWIGEQIDGAEGLFYSDKEKLEAKGAKVYMNSP
VLSIDYDNKVVTAEVEGKEHKESYEKLIFATGSTPILPPIEGVEIVKGNREFKATLENVQFVKLYQNAEEVINKLSDKSQ
HLDRIAVVGGGYIGVELAEAFERLGKEVVLVDIVDTVLNGYYDKDFTQMMAKNLEDHNIRLALGQTVKAIEGDGKVERLI
TDKESFDVDMVILAVGFRPNTALADGKIELFRNGAFLVDKKQETSIPGVYAVGDCATVYDNARKDTSYIALASNAVRTGI
VGAYNACGHELEGIGVQGSNGISIYGLHMVSTGLTLEKAKAAGYNATETGFNDLQKPEFMKHDNHEVAIKIVFDKDSREI
LGAQMVSHDIAISMGIHMFSLAIQEHVTIDKLALTDLFFLPHFNKPYNYITMAALTAEK
>Q5XC60 1.6.3.4~~~~~~NADH oxidase~~~
MSKIVVVGANHAGTACIKTMLTNYGDANEIVVFDQNSNISFLGCGMALWIGEQIAGPEGLFYSDKEELESLGAKVYMESP
VQSIDYDAKTVTALVDGKNHVETYDKLIFATGSQPILPPIKGAEIKEGSLEFEATLENLQFVKLYQNSADVIAKLENKDI
KRVAVVGAGYIGVELAEAFQRKGKEVVLIDVVDTCLAGYYDRDLTDLMAKNMEEHGIQLAFGETVKEVAGNGKVEKIITD
KNEYDVDMVILAVGFRPNTTLGNGKIDLFRNGAFLVNKRQETSIPGVYAIGDCATIYDNATRDTNYIALASNAVRTGIVA
AHNACGTDLEGIGVQGSNGISIYGLHMVSTGLTLEKAKRLGFDAAVTEYTDNQKPEFIEHGNFPVTIKIVYDKDSRRILG
AQMAAREDMSMGIHMFSLAIQEGVTIEKLALTDIFFLPHFNKPYNYITMAALGAKD
>Q8DP70 1.6.3.4~~~nox~~~NADH oxidase~~~COG0446
MSKIVVVGANHAGTACINTMLDNFGNENEIVVFDQNSNISFLGCGMALWIGEQIDGAEGLFYSDKEKLEAKGAKVYMNSP
VLSIDYDAKVVTAEVEGKEHKESYEKLIFATGSTPILPPIEGVEIVKGNREFKATLENVQFVKLYQNAEEVINKLSDKSQ
HLDRIAVVGGGYIGVELAEAFERLGKEVVLVDIVDTVLNGYYDKDFTQMMAKNLEDHNIRLALGQTVKAIEGDGKVERLI
TDKESFDVDMVILAVGFRPNTALADGKIELFRNGAFLVDKKQETSIPGVYAVGDCATVYDNARKDTSYIALASNAVRTGI
VGAYNACGHELEGIGVQGSNGISIYGLHMVSTGLTLEKAKAAGYNATETGFNDLQKPEFMKHDNHEVAIKIVFDKDSREI
LGAQMVSHDIAISMGIHMFSLAIQEHVTIDKLALTDLFFLPHFNKPYNYITMAALTAEK
>Q53176 1.9.6.1~~~napA~~~Periplasmic nitrate reductase~~~
MTLTRRDLIKAQAAATAAAAAGLPVSALAQPVTGGAEALRIRWSKAPCRFCGTGCGVMVGTRDGQVVATHGDTQAEVNRG
LNCVKGYFLSKIMYGEDRLTTPLLRMKDGVYHKEGEFAPVSWDEAFDVMAAQAKRVLKEKGPKAVGMFGSGQWTIWEGYA
ASKLMRAGFRSNNLDPNARHCMASAATAFMRTFGMDEPMGCYDDFEAADAFVLWGSNMAEMHPILWSRLTDRRLSHEHVR
VAVLSTFTHRSMDLADTPIIFRPGTDLAILNYIAHHIISTGRVNRDFVDRHTNFALGATDIGYGLRPEHQLQLAAKGAAD
AGAMTPTDFETFAALVSEYTLEKAAEISGVEPALLEELAELYADPDRKVMSLWTMGFNQHVRGVWANHMVYNLHLLTGKI
SEPGNSPFSLTGQPSACGTAREVGTFAHRLPADMVVTNPEHRAHAEEIWKLPAGLLPDWVGAHAVEQDRKLHDGEINFYW
VQVNNNMQAAPNIDQETYPGYRNPENFIVVSDAYPTVTGRCADLVLPAAMWVEKEGAYGNAERRTHFWHQLVEAPGEARS
DLWQLMEFSKRFTTDEVWPEEILSAAPAYRGKTLFEVLFANGSVDRFPASDVNPDHANHEAALFGFYPQKGLFEEYAAFG
RGHGHDLAPFDTYHEVRGLRWPVVEGEETRWRYREGFDPYVKPGEGLRFYGKPDGRAVILGVPYEPPAESPDEEFGFWLV
TGRVLEHWHSGSMTLRVPELYKAFPGAVCFMHPEDARSRGLNRGSEVRVISRRGEIRTRLETRGRNRMPRGVVFVPWFDA
SQLINKVTLDANDPISRQTDFKKCAVKIEAV
>P39185 1.9.6.1~~~napA~~~Periplasmic nitrate reductase~~~COG0243
MKISRRDFIKQTAITATASVAGVTLPAGAANFVTDSEVTKLKWSKAPCRFCGTGCGVTVAVKDNKVVATQGDPQAEVNKG
LNCVKGYFLSKIMYGQDRLTRPLMRMKNGKYDKNGDFAPVTWDQAFDEMERQFKRVLKEKGPTAVGMFGSGQWTVWEGYA
AAKLYKAGFRSNNIDPNARHCMASAAAGFMRTFGMDEPMGCYDDFEAADAFVLWGSNMAEMHPILWTRVTDRRLSHPKTR
VVVLSTFTHRCFDLADIGIIFKPQTDLAMLNYIANYIIRNNKVNKDFVNKHTVFKEGVTDIGYGLRPDHPLQKAAKNASD
PGAAKVITFDEFAKFVSKYDADYVSKLSAVPKAKLDQLAELYADPNIKVMSLWTMGFNQHTRGTWANNMVYNLHLLTGKI
ATPGNSPFSLTGQPSACGTAREVGTFSHRLPADMVVTNPKHREEAERIWKLPPGTIPDKPGYDAVLQNRMLKDGKLNAYW
VQVNNNMQAAANLMEEGLPGYRNPANFIVVSDAYPTVTALAADLVLPSAMWVEKEGAYGNAERRTQFWHQLVDAPGEARS
DLWQLVEFAKRFKVEEVWPPELIAKKPEYKGKTLYDVLYRNGQVDKFPLKDVNAEYHNAEAKAFGFYLQKGLFEEYATFG
RGHGHDLAPFDAYHEARGLRWPVVNGKETRWRYREGSDPYVKAGTGFQFYGNPDGKAVIFALPYEPPAESPDKEYPYWLV
TGRVLEHWHSGSMTRRVPELYRSFPNAVVFMHPEDAKALGLRRGVEVEVVSRRGRMRSRIETRGRDAPPRGLVFVPWFDA
SQLINKVTLDATCPISLQTDFKKCAVKIVKV
>P81186 1.9.6.1~~~napA~~~Periplasmic nitrate reductase~~~COG0243
MSTSRRDFLKYFAMSAAVAAASGAGFGSLALAADNRPEKWVKGVCRYCGTGCGVLVGVKDGKAVAIQGDPNNHNAGLLCL
KGSLLIPVLNSKERVTQPLVRRHKGGKLEPVSWDEALDLMASRFRSSIDMYGPNSVAWYGSGQCLTEESYVANKIFKGGF
GTNNVDGNPRLCMASAVGGYVTSFGKDEPMGTYADIDQATCFFIIGSNTSEAHPVLFRRIARRKQVEPGVKIIVADPRRT
NTSRIADMHVAFRPGTDLAFMHSMAWVIINEELDNPRFWQRYVNFMDAEGKPSDFEGYKAFLENYRPEKVAEICRVPVEQ
IYGAARAFAESAATMSLWCMGINQRVQGVFANNLIHNLHLITGQICRPGATSFSLTGQPNACGGVRDGGALSHLLPAGRA
IPNAKHRAEMEKLWGLPEGRIAPEPGYHTVALFEALGRGDVKCMIICETNPAHTLPNLNKVHKAMSHPESFIVCIEAFPD
AVTLEYADLVLPPAFWCERDGVYGCGERRYSLTEKAVDPPGQCRPTVNTLVEFARRAGVDPQLVNFRNAEDVWNEWRMVS
KGTTYDFWGMTRERLRKESGLIWPCPSEDHPGTSLRYVRGQDPCVPADHPDRFFFYGKPDGRAVIWMRPAKGAAEEPDAE
YPLYLTSMRVIDHWHTATMTGKVPELQKANPIAFVEINEEDAARTGIKHGDSVIVETRRDAMELPARVSDVCRPGLIAVP
FFDPKKLVNKLFLDATDPVSREPEYKICAARVRKA
>P33937 1.9.6.1~~~napA~~~Periplasmic nitrate reductase~~~COG0243
MKLSRRSFMKANAVAAAAAAAGLSVPGVARAVVGQQEAIKWDKAPCRFCGTGCGVLVGTQQGRVVACQGDPDAPVNRGLN
CIKGYFLPKIMYGKDRLTQPLLRMKNGKYDKEGEFTPITWDQAFDVMEEKFKTALKEKGPESIGMFGSGQWTIWEGYAAS
KLFKAGFRSNNIDPNARHCMASAVVGFMRTFGMDEPMGCYDDIEQADAFVLWGANMAEMHPILWSRITNRRLSNQNVTVA
VLSTYQHRSFELADNGIIFTPQSDLVILNYIANYIIQNNAINQDFFSKHVNLRKGATDIGYGLRPTHPLEKAAKNPGSDA
SEPMSFEDYKAFVAEYTLEKTAEMTGVPKDQLEQLAQLYADPNKKVISYWTMGFNQHTRGVWANNLVYNLHLLTGKISQP
GCGPFSLTGQPSACGTAREVGTFAHRLPADMVVTNEKHRDICEKKWNIPSGTIPAKIGLHAVAQDRALKDGKLNVYWTMC
TNNMQAGPNINEERMPGWRDPRNFIIVSDPYPTVSALAADLILPTAMWVEKEGAYGNAERRTQFWRQQVQAPGEAKSDLW
QLVQFSRRFKTEEVWPEDLLAKKPELRGKTLYEVLYATPEVSKFPVSELAEDQLNDESRELGFYLQKGLFEEYAWFGRGH
GHDLAPFDDYHKARGLRWPVVNGKETQWRYSEGNDPYVKAGEGYKFYGKPDGKAVIFALPFEPAAEAPDEEYDLWLSTGR
VLEHWHTGSMTRRVPELHRAFPEAVLFIHPLDAKARDLRRGDKVKVVSRRGEVISIVETRGRNRPPQGLVYMPFFDAAQL
VNKLTLDATDPLSKETDFKKCAVKLEKV
>P26235 ~~~napA~~~Na(+)/H(+) antiporter~~~
MEFIGILCLILVATTIGSHISRRFGIPAVIGQLLVGVLLGQAGLGWVHPNILVHDFSEIGVILLMFLAGLESDLSLLKKY
FKPGMFVALLGILFPVFFGWLTGEAFQVANNEAIFFGIILAATSVSISVEVLKELNVVNTKEGSTILGASVVDDILVVLV
LSFSLSFLTGKSTSNLPLPLLLLEQLFYFLFIFLLVKWIAPFLMSLAEKIYANSAIIIMSLVICLGMSYLADLIGLSSVI
GAFFAGIAVSQTKVKHEVYNNVEALGYAVFIPVFFVSVGLEVDFSKFSEQILFILILTLVAILTKLIGGYIGAKFSSFSS
NSALMVGAGMISRGEMALIILQIGQQSNLIENHYYSPLVIVVLLSTLISPLILKYFTKKVYAN
>Q56350 1.9.6.1~~~napA~~~Periplasmic nitrate reductase~~~COG0243
MTISRRDLLKAQAAGIAAMAANIPLSSQAPAVPGGVESLQITWSKAPCRFCGTGCGVMVGVKEGRVVATHGDLLAEVNRG
LNCVKGYFLSKIMYGADRLTQPLLRKKDGVYAKDGEFTPVSWEEAFDTMAAQAKRVLRDKGPTALGMFGSGQWTIFEGYA
ATKLMRAGFRSNNLDPNARHCMASAAYAFMRTFGMDEPMGCYDDFEAADAFVLWGSNMAEMHPILWTRVADRRLGHPHVK
VAVLSTFTHRSSDLADIPIVFKPGTDLAILNYIANHIIQTGRVNRDFVDRHTTFVAGATGIGYGLRDDDPREMAARTAED
PAATTPSTFEEFAELVSEYTLDKVSELSGVEPAFLEQLAELYADPDRKVMSLWTMGFNQHVRGVWANQMVYNLHLLTGKI
SEPGNSPFSLTGQASACGTARQVGTFRHRLPSDMTVTNPERRQDAEEIWRIPHGVIPEQPGLHAVAQDRALHDGTLNFYW
IQVNNNLQASPNNDGEAWPGYRNPDNFIVVSDAYPTVTALAADLILPAAMWVEKEGAYGNAERRTHVWHQLVEAPGEARS
DLWQMMEFSTRFTTDEVWPEEILAANPNYRGQSLFDVLFRNGSVDRFDLSELNPVTPTAESNAFGFYVQKGLFEEYAPFG
RGHGHDLAPYDTYHEVRGLRWPVVDGKETLWRYREGLDPYVEPGAGVQFYGNPDGKARIIAVPYEPPAEPPDEEYNIWLV
TGRVLEHWHSGSMTMRVPELYRAFPGARCFMNPEDARDMGFNQGAEVRIVSRRGEIRSRVETRGRNRMPRGVVFVPWFDA
SQLINKVTLDATDPISKQTDFKKCAVKILPV
>Q8KY07 ~~~napB~~~Periplasmic nitrate reductase, electron transfer subunit~~~COG3043
MRRAHRAGERVMMKRFGIALLAVAIAAGASSLTAQTVTSGLHGPAPLNDEGPAPPMLPNRNTSEREVRNYPEQPPVIPHT
IDGYQVDLNGNKCLSCHARARTAESQAPMVSITHFMDRDGQFWPSISPRRFFCTECHVPQNTATPPVSNDFTDIDTLLSR
ASPGGRR
>Q53177 ~~~napB~~~Periplasmic nitrate reductase, electron transfer subunit~~~
MSVHPTLRFLATALVALGAGAALAQDAPRLTGADRPMSEVAAPPLPETITDDRRVGRNYPEQPPVIPHSIEGYQLSVNAN
RCLECHRRQYSGLVAAPMISITHFQDREGQMLADVSPRRYFCTACHVPQTNAQPLVTNEFRDMLTLMPASNEAE
>O88160 ~~~napB~~~Periplasmic nitrate reductase, electron transfer subunit~~~
MSMHPALRLLATVLVALGAGPAFTQDAPRLTGADRPMSEVAAPPLPETITDDRRVGRNYPEQPPVIPHAIEGYQLSVNAN
RCLECHRRQYSGLVAAPMISITHFQDREGQMLADVSPRRYFCTACHVPQTNAQPLVTNEFRDMLTLTPASNEAE
>P39186 ~~~napB~~~Periplasmic nitrate reductase, electron transfer subunit~~~COG3043
MKPSRSWASLLAVCAVLLAALAMQAIFFPAPARAQGLVDAMRGPTAIANEPRAPLLYPTENKDIRRTRNYTMQPPTIPHK
IDGYQLDKDFNRCMFCHARTRTEETQAIPVSITHYMDRDNNVLADVSPRRYFCTQCHVPQADTKPLIGNNFVDVDTILKR
RPGAKGAAK
>P0ABL3 ~~~napB~~~Periplasmic nitrate reductase, electron transfer subunit~~~COG3043
MKSHDLKKALCQWTAMLALVVSGAVWAANGVDFSQSPEVSGTQEGAIRMPKEQDRMPLNYVNQPPMIPHSVEGYQVTTNT
NRCLQCHGVESYRTTGAPRISPTHFMDSDGKVGAEVAPRRYFCLQCHVPQADTAPIVGNTFTPSKGYGK
>P44654 ~~~napB~~~Periplasmic nitrate reductase, electron transfer subunit~~~COG3043
MINMTKQVSKILAGLFTALFAGSLMASDAPAVGKDLTQAAENIPPAFHNAPRQGELPALNYVNQPPMVPHSVANYQVTKN
VNQCLNCHSPENSRLSGATRISPTHFMDRDGKVGSSSSPRRYFCLQCHVSQANVDPIVPNDFKPMKGYGN
>Q56351 ~~~napB~~~Periplasmic nitrate reductase, electron transfer subunit~~~COG3043
MRGQDPSRLIRPAAMAGLLLFALVGAALPQAEPAVQIVPALDRLGRADVRGQIPPLGRPITDDVRRMRNYPEQPPVIPHS
IDGYQLTVNTNRCMDCHKPQFTEGSGAPMISVTHFQDRDGQVLTDVTPRRYFCTACHVQQTDVQPLVPNQFRDGYRHAGG
P
>Q8EIJ4 ~~~napB~~~Periplasmic nitrate reductase, electron transfer subunit~~~COG3043
MKKILTLAAIVLAIGGCSGQQAETQATPVNIKSLAGDSAVTDIRPADAMPVYPARGKALERSFTDQPPLIPHKDDYKITL
DKNGCLTCHSWDKAARMKATPVAKSHVIDDKGTLNGHNYFCTQCHVAQAENKAPLVENKFSTQ
>Q53178 ~~~napC~~~Cytochrome c-type protein NapC~~~
MRLPSFLRRFWSIATSPSSFLSVGFLTLGGFVGGVLFWGGFNTALEATNTEAFCTSCHEMQSNVFEELTRTVHYTNRSGV
RAGCPDCHVPHEWTDKIARKMQASKEVWGHLFGTIDTRRKFLDNRLRLAEHEWARLKANDSLECRNCHSEVAMDFTRQTD
RAAQIHTQYLIQTEGYTCIDCHKGIAHELPDMRGIDPGWLPPADLRASLPDHGSSFDLEGARAYVAD
>P0ABL5 ~~~napC~~~Cytochrome c-type protein NapC~~~COG3005
MGNSDRKPGLIKRLWKWWRTPSRLALGTLLLIGFVGGIVFWGGFNTGMEKANTEEFCISCHEMRNTVYQEYMDSVHYNNR
SGVRATCPDCHVPHEFVPKMIRKLKASKELYGKIFGVIDTPQKFEAHRLTMAQNEWRRMKDNNSQECRNCHNFEYMDTTA
QKSVAAKMHDQAVKDGQTCIDCHKGIAHKLPDMREVEPGF
>P0A9I5 ~~~napD~~~Chaperone NapD~~~COG3062
MHTNWQVCSLVVQAKSERISDISTQLNAFPGCEVAVSDAPSGQLIVVVEAEDSETLIQTIESVRNVEGVLAVSLVYHQQE
EQGEETP
>P37062 1.11.1.1~~~npr~~~NADH peroxidase~~~COG0446
MKVIVLGSSHGGYEAVEELLNLHPDAEIQWYEKGDFISFLSCGMQLYLEGKVKDVNSVRYMTGEKMESRGVNVFSNTEIT
AIQPKEHQVTVKDLVSGEERVENYDKLIISPGAVPFELDIPGKDLDNIYLMRGRQWAIKLKQKTVDPEVNNVVVIGSGYI
GIEAAEAFAKAGKKVTVIDILDRPLGVYLDKEFTDVLTEEMEANNITIATGETVERYEGDGRVQKIVTDKNAYDADLVVV
AVGVRPNTAWLKGTLELHPNGLIKTDEYMRTSEPDVFAVGDATLIKYNPADTEVNIALATNARKQGRFAVKNLEEPVKPF
PGVQGSSGLAVFDYKFASTGINEVMAQKLGKETKAVTVVEDYLMDFNPDKQKAWFKLVYDPETTQILGAQLMSKADLTAN
INAISLAIQAKMTIEDLAYADFFFQPAFDKPWNIINTAALEAVKQER
>P0AAL0 ~~~napF~~~Ferredoxin-type protein NapF~~~COG1145
MKIDASRRGILTGRWRKASNGIRPPWSGDESHFLTHCTRCDACINACENNILQRGAGGYPSVNFKNNECSFCYACAQACP
ESLFSPRHTRAWDLQFTIGDACLAYQSVECRRCQDSCEPMAIIFRPTLSGIYQPQLNSQLCNGCGACAASCPVSAITAEY
LHAH
>P0AAL3 ~~~napG~~~Ferredoxin-type protein NapG~~~COG0437
MSRSAKPQNGRRRFLRDVVRTAGGLAAVGVALGLQQQTARASGVRLRPPGAINENAFASACVRCGQCVQACPYDTLKLAT
LASGLSAGTPYFVARDIPCEMCEDIPCAKVCPSGALDREIESIDDARMGLAVLVDQENCLNFQGLRCDVCYRECPKIDEA
ITLELERNTRTGKHARFLPTVHSDACTGCGKCEKVCVLEQPAIKVLPLSLAKGELGHHYRFGWLEGNNGKS
>P33934 ~~~napH~~~Ferredoxin-type protein NapH~~~COG0348
MANRKRDAGREALEKKGWWRSHRWLVLRRLCQFFVLGMFLSGPWFGVWILHGNYSSSLLFDTVPLTDPLMTLQSLASGHL
PATVALTGAVIITVLYALAGKRLFCSWVCPLNPITDLANWLRRRFDLNQSATIPRHIRYVLLVVILVGSALTGTLIWEWI
NPVSLMGRSLVMGFGSGALLILALFLFDLLVVEHGWCGHICPVGALYGVLGSKGVITVAATDRQKCNRCMDCFHVCPEPH
VLRAPVLDEQSPVQVTSRDCMTCGRCVDVCSEDVFTITTRWSSGAKS
>P96688 3.1.1.1~~~nap~~~Uncharacterized carboxylesterase nap~~~COG0596
MSNHSSSIPELSDNGIRYYQTYNESLSLWPVRCKSFYISTRFGQTHVIASGPEDAPPLVLLHGALFSSTMWYPNIADWSS
KYRTYAVDIIGDKNKSIPENVSGTRTDYANWLLDVFDNLGIEKSHMIGLSLGGLHTMNFLLRMPERVKSAAILSPAETFL
PFHHDFYKYALGLTASNGVETFLNWMMNDQNVLHPIFVKQFKAGVMWQDGSRNPNPNADGFPYVFTDEELRSARVPILLL
LGEHEVIYDPHSALHRASSFVPDIEAEVIKNAGHVLSMEQPTYVNERVMRFFNAETGISR
>P39458 1.7.5.1~~~narB~~~Nitrate reductase~~~COG0243
MFDLSKFLPVITPLMIDTAKTLCPYCGVGCGLEAVPPAQPGRATVRDREGTPIWQIRGDRQHPSSQGMVCVKGATVAESV
SKSRLKYPMFRASLDDPFTEISWDEALDRLCDRIQQTQADYGKDGICFYGSGQFQTEDYYIAQKLVKGCLGTNNFDTNSR
LCMSSAVSAYSLCLGSDGPPACYEDLDLADCLLIVGSNTAECHPILFNRYRKRHKQGGTNLIVVDPRCTPTAEVADLHLA
LKPGSDVALLNGLGWLLYQMGYVKKDFIANQTEGFEDWLAIIEDYPPQRTAELTGLAVAELVRAADLIASAQRWLSLWSM
GVNQSIQGTAKATSLINLHLLTRQIGLPGCGPFSLTGQPNAMGGRETGGLAHLLPGYRKVIDPQHRADVETIWGLPMGSI
SPQPGRTAWQMIEGLEQGAVGFLWVAATNPAVSLPDVKRAQAALKRSPFTVLQDAYHPTETTTYAHLLLPAAQWSEKTGT
MTNSERRVTLCQAFRQPPGEARADWQIFAEVGRRLGFAFDYTDAAAVFAEYVQVTAGRLCDLSGLSHELLAQAGPQQWPF
PAGAEPTTESKRLYTKHHFAYADGRARFQPFHHLGVAEPPDDRYPLVLTVGRLYGHWHTQTRTGRIDKINKLHPSAFVEI
HPRDADRYNISEGQAVVIRSRRGEGCFPAKVTTAISPGVLFVPMHWGALWGDRTEANALTHPAACPISGEPELKACAVQI
EAASSTFTI
>P09152 1.7.5.1~~~narG~~~Respiratory nitrate reductase 1 alpha chain~~~COG5013
MSKFLDRFRYFKQKGETFADGHGQLLNTNRDWEDGYRQRWQHDKIVRSTHGVNCTGSCSWKIYVKNGLVTWETQQTDYPR
TRPDLPNHEPRGCPRGASYSWYLYSANRLKYPMMRKRLMKMWREAKALHSDPVEAWASIIEDADKAKSFKQARGRGGFVR
SSWQEVNELIAASNVYTIKNYGPDRVAGFSPIPAMSMVSYASGARYLSLIGGTCLSFYDWYCDLPPASPQTWGEQTDVPE
SADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKTVAVTPDYAEIAKLCDLWLAPKQGTDAAMALAMGHVMLREFHL
DNPSQYFTDYVRRYTDMPMLVMLEERDGYYAAGRMLRAADLVDALGQENNPEWKTVAFNTNGEMVAPNGSIGFRWGEKGK
WNLEQRDGKTGEETELQLSLLGSQDEIAEVGFPYFGGDGTEHFNKVELENVLLHKLPVKRLQLADGSTALVTTVYDLTLA
NYGLERGLNDVNCATSYDDVKAYTPAWAEQITGVSRSQIIRIAREFADNADKTHGRSMIIVGAGLNHWYHLDMNYRGLIN
MLIFCGCVGQSGGGWAHYVGQEKLRPQTGWQPLAFALDWQRPARHMNSTSYFYNHSSQWRYETVTAEELLSPMADKSRYT
GHLIDFNVRAERMGWLPSAPQLGTNPLTIAGEAEKAGMNPVDYTVKSLKEGSIRFAAEQPENGKNHPRNLFIWRSNLLGS
SGKGHEFMLKYLLGTEHGIQGKDLGQQGGVKPEEVDWQDNGLEGKLDLVVTLDFRLSSTCLYSDIILPTATWYEKDDMNT
SDMHPFIHPLSAAVDPAWEAKSDWEIYKAIAKKFSEVCVGHLGKETDIVTLPIQHDSAAELAQPLDVKDWKKGECDLIPG
KTAPHIMVVERDYPATYERFTSIGPLMEKIGNGGKGIAWNTQSEMDLLRKLNYTKAEGPAKGQPMLNTAIDAAEMILTLA
PETNGQVAVKAWAALSEFTGRDHTHLALNKEDEKIRFRDIQAQPRKIISSPTWSGLEDEHVSYNAGYTNVHELIPWRTLS
GRQQLYQDHQWMRDFGESLLVYRPPIDTRSVKEVIGQKSNGNQEKALNFLTPHQKWGIHSTYSDNLLMLTLGRGGPVVWL
SEADAKDLGIADNDWIEVFNSNGALTARAVVSQRVPAGMTMMYHAQERIVNLPGSEITQQRGGIHNSVTRITPKPTHMIG
GYAHLAYGFNYYGTVGSNRDEFVVVRKMKNIDWLDGEGNDQVQESVK
>P9WJQ3 1.7.5.1~~~narG~~~Nitrate reductase alpha subunit~~~COG5013
MTVTPHVGGPLEELLERSGRFFTPGEFSADLRTVTRRGGREGDVFYRDRWSHDKVVRSTHGVNCTGSCSWKIYVKDGIIT
WETQQTDYPSVGPDRPEYEPRGCPRGASFSWYSYSPTRVRYPYARGVLVEMYREAKTRLGDPVLAWADIQADPERRRRYQ
QARGKGGLVRVSWAEASEMVAAAHVHTIKTYGPDRVAGFSPIPAMSMVSHAAGSRFVELIGGVMTSFYDWYADLPVASPQ
VFGDQTDVPESGDWWDASYLVMWGSNVPITRTPDAHWMAEARYRGAKVVVVSPDYADNTKFADEWVRCAAGTDTALAMAM
GHVILSECYVRNQVPFFVDYVRRYTDLPFLIKLEKRGDLLVPGKFLTAADIGEESENAAFKPALLDELTNTVVVPQGSLG
FRFGEDGVGKWNLDLGSVVPALSVEMDKAVNGDRSAELVTLPSFDTIDGHGETVSRGVPVRRAGKHLVCTVFDLMLAHYG
VARAGLPGEWPTGYHDRTQQNTPAWQESITGVPAAQAIRFAKEFARNATESGGRSMIIMGGGICHWFHSDVMYRSVLALL
MLTGSMGRNGGGWAHYVGQEKVRPLTGWQTMAMATDWSRPPRQVPGASYWYAHTDQWRYDGYGADKLASPVGRGRFAGKH
TMDLLTSATAMGWSPFYPQFDRSSLDVADEARAAGRDVGDYVAEQLAQHKLKLSITDPDNPVNWPRVLTVWRANLIGSSG
KGGEYFLRHLLGTDSNVQSDPPTDGVHPRDVVWDSDIPEGKLDLIMSIDFRMTSTTLVSDVVLPAATWYEKSDLSSTDMH
PYVHSFSPAIDPPWETRSDFDAFAAIARAFSALAKRHLGTRTDVVLTALQHDTPDEMAYPDGTERDWLATGEVPVPGRTM
SKLTVVERDYTAIYDKWLTLGPLIDQFGMTTKGYTVHPFREVSELAANFGVMNSGVAVGRPAITTAKRMADVILALSGTC
NGRLAVEGFLELEKRTGQRLAHLAEGSEERRITYADTQARPVPVITSPEWSGSESGGRRYAPFTINIEHLKPFHTLTGRM
HFYLAHDWVEELGEQLPVYRPPLDMARLFNQPELGPTDDGLGLTVRYLTPHSKWSFHSTYQDNLYMLSLSRGGPTMWMSP
GDAAKINVRDNDWVEAVNANGIYVCRAIVSHRMPEGVVFVYHVQERTVDTPRTETNGKRGGNHNALTRVRIKPSHLAGGY
GQHAFAFNYLGPTGNQRDEVTVVRRRSQEVRY
>P11349 1.7.5.1~~~narH~~~Respiratory nitrate reductase 1 beta chain~~~COG1140
MKIRSQVGMVLNLDKCIGCHTCSVTCKNVWTSREGVEYAWFNNVETKPGQGFPTDWENQEKYKGGWIRKINGKLQPRMGN
RAMLLGKIFANPHLPGIDDYYEPFDFDYQNLHTAPEGSKSQPIARPRSLITGERMAKIEKGPNWEDDLGGEFDKLAKDKN
FDNIQKAMYSQFENTFMMYLPRLCEHCLNPACVATCPSGAIYKREEDGIVLIDQDKCRGWRMCITGCPYKKIYFNWKSGK
SEKCIFCYPRIEAGQPTVCSETCVGRIRYLGVLLYDADAIERAASTENEKDLYQRQLDVFLDPNDPKVIEQAIKDGIPLS
VIEAAQQSPVYKMAMEWKLALPLHPEYRTLPMVWYVPPLSPIQSAADAGELGSNGILPDVESLRIPVQYLANLLTAGDTK
PVLRALKRMLAMRHYKRAETVDGKVDTRALEEVGLTEAQAQEMYRYLAIANYEDRFVVPSSHRELAREAFPEKNGCGFTF
GDGCHGSDTKFNLFNSRRIDAIDVTSKTEPHP
>P11350 1.7.5.1~~~narI~~~Respiratory nitrate reductase 1 gamma chain~~~COG2181
MQFLNMFFFDIYPYIAGAVFLIGSWLRYDYGQYTWRAASSQMLDRKGMNLASNLFHIGILGIFVGHFFGMLTPHWMYEAW
LPIEVKQKMAMFAGGASGVLCLIGGVLLLKRRLFSPRVRATTTGADILILSLLVIQCALGLLTIPFSAQHMDGSEMMKLV
GWAQSVVTFHGGASQHLDGVAFIFRLHLVLGMTLFLLFPFSRLIHIWSVPVEYLTRKYQLVRARH
>P0AF26 ~~~narJ~~~Nitrate reductase molybdenum cofactor assembly chaperone NarJ~~~COG2180
MIELVIVSRLLEYPDAALWQHQQEMFEAIAASKNLPKEDAHALGIFLRDLTTMDPLDAQAQYSELFDRGRATSLLLFEHV
HGESRDRGQAMVDLLAQYEQHGLQLNSRELPDHLPLYLEYLAQLPQSEAVEGLKDIAPILALLSARLQQRESRYAVLFDL
LLKLANTAIDSDKVAEKIADEARDDTPQALDAVWEEEQVKFFADKGCGDSAITAHQRRFAGAVAPQYLNITTGGQH
>P9WJY6 ~~~narK2~~~Probable nitrate/nitrite transporter NarK2~~~
MRGQAANLVLATWISVVNFWAWNLIGPLSTSYARDMSLSSAEASLLVATPILVGALGRIVTGPLTDRFGGRAMLIAVTLA
SILPVLAVGVAATMGSYALLVFFGLFLGVAGTIFAVGIPFANNWYQPARRGFSTGVFGMGMVGTALSAFFTPRFVRWFGL
FTTHAIVAAALASTAVVAMVVLRDAPYFRPNADPVLPRLKAAARLPVTWEMSFLYAIVFGGFVAFSNYLPTYITTIYGFS
TVDAGARTAGFALAAVLARPVGGWLSDRIAPRHVVLASLAGTALLAFAAALQPPPEVWSAATFITLAVCLGVGTGGVFAW
VARRAPAASVGSVTGIVAAAGGLGGYFPPLVMGATYDPVDNDYTVGLLLLVATALVACTYTALHAREPVSEEASR
>P9WJY7 ~~~narK2~~~Probable nitrate/nitrite transporter NarK2~~~COG2223
MRGQAANLVLATWISVVNFWAWNLIGPLSTSYARDMSLSSAEASLLVATPILVGALGRIVTGPLTDRFGGRAMLIAVTLA
SILPVLAVGVAATMGSYALLVFFGLFLGVAGTIFAVGIPFANNWYQPARRGFSTGVFGMGMVGTALSAFFTPRFVRWFGL
FTTHAIVAAALASTAVVAMVVLRDAPYFRPNADPVLPRLKAAARLPVTWEMSFLYAIVFGGFVAFSNYLPTYITTIYGFS
TVDAGARTAGFALAAVLARPVGGWLSDRIAPRHVVLASLAGTALLAFAAALQPPPEVWSAATFITLAVCLGVGTGGVFAW
VARRAPAASVGSVTGIVAAAGGLGGYFPPLVMGATYDPVDNDYTVGLLLLVATALVACTYTALHAREPVSEEASR
>P10903 ~~~narK~~~Nitrate/nitrite antiporter NarK~~~COG2223
MSHSSAPERATGAVITDWRPEDPAFWQQRGQRIASRNLWISVPCLLLAFCVWMLFSAVAVNLPKVGFNFTTDQLFMLTAL
PSVSGALLRVPYSFMVPIFGGRRWTAFSTGILIIPCVWLGFAVQDTSTPYSVFIIISLLCGFAGANFASSMANISFFFPK
QKQGGALGLNGGLGNMGVSVMQLVAPLVVSLSIFAVFGSQGVKQPDGTELYLANASWIWVPFLAIFTIAAWFGMNDLATS
KASIKEQLPVLKRGHLWIMSLLYLATFGSFIGFSAGFAMLSKTQFPDVQILQYAFFGPFIGALARSAGGALSDRLGGTRV
TLVNFILMAIFSGLLFLTLPTDGQGGSFMAFFAVFLALFLTAGLGSGSTFQMISVIFRKLTMDRVKAEGGSDERAMREAA
TDTAAALGFISAIGAIGGFFIPKAFGSSLALTGSPVGAMKVFLIFYIACVVITWAVYGRHSKK
>P0AF28 ~~~narL~~~Nitrate/nitrite response regulator protein NarL~~~COG2197
MSNQEPATILLIDDHPMLRTGVKQLISMAPDITVVGEASNGEQGIELAESLDPDLILLDLNMPGMNGLETLDKLREKSLS
GRIVVFSVSNHEEDVVTALKRGADGYLLKDMEPEDLLKALHQAAAGEMVLSEALTPVLAASLRANRATTERDVNQLTPRE
RDILKLIAQGLPNKMIARRLDITESTVKVHVKHMLKKMKLKSRVEAAVWVHQERIF
>P9WGM5 ~~~narL~~~Probable transcriptional regulatory protein NarL~~~COG2197
MSNPQPEKVRVVVGDDHPLFREGVVRALSLSGSVNVVGEADDGAAALELIKAHLPDVALLDYRMPGMDGAQVAAAVRSYE
LPTRVLLISAHDEPAIVYQALQQGAAGFLLKDSTRTEIVKAVLDCAKGRDVVAPSLVGGLAGEIRQRAAPVAPVLSARER
EVLNRIACGQSIPAIAAELYVAPSTVKTHVQRLYEKLGVSDRAAAVAEAMRQRLLD
>P27896 2.7.13.3~~~narQ~~~Nitrate/nitrite sensor protein NarQ~~~COG3850
MIVKRPVSASLARAFFYIVLLSILSTGIALLTLASSLRDAEAINIAGSLRMQSYRLGYDLQSGSPQLNAHRQLFQQALHS
PVLTNLNVWYVPEAVKTRYAHLNANWLEMNNRLSKGDLPWYQANINNYVNQIDLFVLALQHYAERKMLLVVAISLAGGIG
IFTLVFFTLRRIRHQVVAPLNQLVTASQRIEHGQFDSPPLDTNLPNELGLLAKTFNQMSSELHKLYRSLEASVEEKTRDL
HEAKRRLEVLYQCSQALNTSQIDVHCFRHILQIVRDNEAAEYLELNVGENWRISEGQPNPELPMQILPVTMQETVYGELH
WQNSHVSSSEPLLNSVSSMLGRGLYFNQAQKHFQQLLLMEERATIARELHDSLAQVLSYLRIQLTLLKRSIPEDNATAQS
IMADFSQALNDAYRQLRELLTTFRLTLQQADLPSALREMLDTLQNQTSAKLTLDCRLPTLALDAQMQVHLLQIIREAVLN
AMKHANASEIAVSCVTAPDGNHTVYIRDNGIGIGEPKEPEGHYGLNIMRERAERLGGTLTFSQPSGGGTLVSISFRSAEG
EESQLM
>O53857 2.7.13.3~~~narS~~~Sensor histidine kinase NarS~~~COG4585
MPSYGNLGRLGGRHEYGVLVAMTSSAELDRVRWAHQLRSYRIASVLRIGVVGLMVAAMVVGTSRSEWPQQIVLIGVYAVA
ALWALLLAYSASRRFFALRRFRSMGRLEPFAFTAVDVLILTGFQLLSTDGIYPLLIMILLPVLVGLDVSTRRAAVVLACT
LVGFAVAVLGDPVMLRAIGWPETIFRFALYAFLCATALMVVRIEERHTRSVAGLSALRAELLAQTMTASEVLQRRIAEAI
HDGPLQDVLAARQELIELDAVTPGDERVGRALAGLQSASERLRQATFELHPAVLEQVGLGPAVKQLAASTAQRSGIKIST
DIDYPIRSGIDPIVFGVVRELLSNVVRHSGATTASVRLGITDEKCVLDVADDGVGVTGDTMARRLGEGHIGLASHRARVD
AAGGVLVFLATPRGTHVCVELPLKR
>O33854 ~~~narT~~~Probable nitrate transporter NarT~~~COG2223
MNKSKGGLQLTVQTLSLVAGFMVWSIIAPLMPMISQDIKITSSQISIVLAIPVILGSVLRIPFGYLTNIIGAKWVFFSSF
IILLFPIFLLSQAQSVNMLMLAGFFLGVGGAVFSVGVTSIPKYFPKDKVGLANGIYGMGNLGTAVSSFLAPPIAGAIGWQ
STVRLYLIVMAVFAIVMFFLGDAKEPKVKIPLVAQTKDLLKDLRTYYLSFWYFITFGSFVAFGIFLPKYLVDHYELTTVD
AGIRAGIFIAIATFLRPLGGIIGDKIDAVKALKVDFLFMIIGAIILGIANDMILFTVGCLTVSVCAGIGNGLVFKLVPQY
FQKEAGVANGIVSMMGGLGGFFPPLVITYVTSITGTSHLAFIFLALFGVLALVTMWHLSKKNRSLAYK
>P37758 ~~~narU~~~Nitrate/nitrite transporter NarU~~~COG2223
MALQNEKNSRYLLRDWKPENPAFWENKGKHIARRNLWISVSCLLLAFCVWMLFSAVTVNLNKIGFNFTTDQLFLLTALPS
VSGALLRVPYSFMVPIFGGRRWTVFSTAILIIPCVWLGIAVQNPNTPFGIFIVIALLCGFAGANFASSMGNISFFFPKAK
QGSALGINGGLGNLGVSVMQLVAPLVIFVPVFAFLGVNGVPQADGSVMSLANAAWIWVPLLAIATIAAWSGMNDIASSRA
SIADQLPVLQRLHLWLLSLLYLATFGSFIGFSAGFAMLAKTQFPDVNILRLAFFGPFIGAIARSVGGAISDKFGGVRVTL
INFIFMAIFSALLFLTLPGTGSGNFIAFYAVFMGLFLTAGLGSGSTFQMIAVIFRQITIYRVKMKGGSDEQAHKEAVTET
AAALGFISAIGAVGGFFIPQAFGMSLNMTGSPVGAMKVFLIFYIVCVLLTWLVYGRRKFSQK
>P0AF32 1.7.5.1~~~narV~~~Respiratory nitrate reductase 2 gamma chain~~~COG2181
MIQYLNVFFYDIYPYICATVFFLGSWLRYDYGQYTWRASSSQMLDKRGMVIWSNLFHIGILGIFFGHLFGMLTPHWMYAW
FLPVAAKQLMAMVLGGICGVLTLIGGAGLLWRRLTNQRVRATSTTPDIIIMSILLIQCLLGLSTIPFSAQYPDGSEMMKL
VGWAQSIVTFRGGSSEMLNGVAFVFRLHLVLGMTIFLLFPFTRLVHVWSAPFEYFTRRYQIVRSRR
>P19317 ~~~narW~~~Probable nitrate reductase molybdenum cofactor assembly chaperone NarW~~~COG2180
MQILKVIGLLMEYPDELLWECKEDALALIRRDAPMLTDFTHNLLNAPLLDKQAEWCEVFDRGRTTSLLLFEHVHAESRDR
GQAMVDLLAEYEKVGLQLDCRELPDYLPLYLEYLSVLPDDQAKEGLLNVAPILALLGGRLKQREAPWYALFDALLQLAGS
SLSSDSVTKQVNSEERDDTRQALDAVWEEEQVKFIEDNATACDSSPLNQYQRRFSQDVAPQYVDISAGGGK
>P0AFA2 2.7.13.3~~~narX~~~Nitrate/nitrite sensor protein NarX~~~COG3850
MLKRCLSPLTLVNQVALIVLLSTAIGLAGMAVSGWLVQGVQGSAHAINKAGSLRMQSYRLLAAVPLSEKDKPLIKEMEQT
AFSAELTRAAERDGQLAQLQGLQDYWRNELIPALMRAQNRETVSADVSQFVAGLDQLVSGFDRTTEMRIETVVLVHRVMA
VFMALLLVFTIIWLRARLLQPWRQLLAMASAVSHRDFTQRANISGRNEMAMLGTALNNMSAELAESYAVLEQRVQEKTAG
LEHKNQILSFLWQANRRLHSRAPLCERLSPVLNGLQNLTLLRDIELRVYDTDDEENHQEFTCQPDMTCDDKGCQLCPRGV
LPVGDRGTTLKWRLADSHTQYGILLATLPQGRHLSHDQQQLVDTLVEQLTATLALDRHQERQQQLIVMEERATIARELHD
SIAQSLSCMKMQVSCLQMQGDALPESSRELLSQIRNELNASWAQLRELLTTFRLQLTEPGLRPALEASCEEYSAKFGFPV
KLDYQLPPRLVPSHQAIHLLQIAREALSNALKHSQASEVVVTVAQNDNQVKLTVQDNGCGVPENAIRSNHYGMIIMRDRA
QSLRGDCRVRRRESGGTEVVVTFIPEKTFTDVQGDTHE
>P9WJQ0 ~~~narX~~~Nitrate reductase-like protein NarX~~~
MTVTPRTGSRIEELLARSGRFFIPGEISADLRTVTRRGGRDGDVFYRDRWSHDKVVRSTHGVNCTGSCSWKIYVKDDIIT
WETQETDYPSVGPDRPEYEPRGCPRGAAFSWYTYSPTRVRHPYARGVLVEMYREAKARLGDPVAAWADIQADPRRRRRYQ
RARGKGGLVRVSWAEATEMIAAAHVHTISTYGPDRVAGFSPIPAMSMVSHAAGSRFVELIGGVMTSFYDWYADLPVASPQ
VFGDQTDVPESGDWWDVVWQCASVLLTYPNSRQLGTAEELLAHIDGPAADLLGRTVSELRRADPLTAATRYVDTFDLRGR
ATLYLTYWTAGDTRNRGREMLAFAQTYRSTDVAPPRGETPDFLPVVLEFAATVDPEAGRRLLSGYRVPIAALCNALTEAA
LPYAHTVAAVCRTGDMMGELFWTVVPYVTMTIVAVGSWWRYRYDKFGWTTRSSQLYESRLLRIASPMFHFGILVVIVGHG
IGLVIPQSWTQAAGLSEGAYHVQAVVLGSIAGITTLAGVTLLIYRRRTRGPVFMATTVNDKVMYLVLVAAIVAGLGATAL
GSGVVGEAYNYRETVSVWFRSVWVLQPRGDLMAEAPLYYQIHVLIGLALFALWPFTRLVHAFSAPIGYLFRPYIIYRSRE
ELVLTRPRRRGW
>P9WJQ1 ~~~narX~~~Nitrate reductase-like protein NarX~~~COG2180
MTVTPRTGSRIEELLARSGRFFIPGEISADLRTVTRRGGRDGDVFYRDRWSHDKVVRSTHGVNCTGSCSWKIYVKDDIIT
WETQETDYPSVGPDRPEYEPRGCPRGAAFSWYTYSPTRVRHPYARGVLVEMYREAKARLGDPVAAWADIQADPRRRRRYQ
RARGKGGLVRVSWAEATEMIAAAHVHTISTYGPDRVAGFSPIPAMSMVSHAAGSRFVELIGGVMTSFYDWYADLPVASPQ
VFGDQTDVPESGDWWDVVWQCASVLLTYPNSRQLGTAEELLAHIDGPAADLLGRTVSELRRADPLTAATRYVDTFDLRGR
ATLYLTYWTAGDTRNRGREMLAFAQTYRSTDVAPPRGETPDFLPVVLEFAATVDPEAGRRLLSGYRVPIAALCNALTEAA
LPYAHTVAAVCRTGDMMGELFWTVVPYVTMTIVAVGSWWRYRYDKFGWTTRSSQLYESRLLRIASPMFHFGILVVIVGHG
IGLVIPQSWTQAAGLSEGAYHVQAVVLGSIAGITTLAGVTLLIYRRRTRGPVFMATTVNDKVMYLVLVAAIVAGLGATAL
GSGVVGEAYNYRETVSVWFRSVWVLQPRGDLMAEAPLYYQIHVLIGLALFALWPFTRLVHAFSAPIGYLFRPYIIYRSRE
ELVLTRPRRRGW
>P19318 1.7.5.1~~~narY~~~Respiratory nitrate reductase 2 beta chain~~~COG1140
MKIRSQVGMVLNLDKCIGCHTCSVTCKNVWTGREGMEYAWFNNVETKPGIGYPKNWEDQEEWQGGWVRDVNGKIRPRLGN
KMGVITKIFANPVVPQIDDYYEPFTFDYEHLHSAPEGKHIPTARPRSLIDGKRMDKVIWGPNWEELLGGEFEKRARDRNF
EAMQKEMYGQFENTFMMYLPRLCEHCLNPSCVATCPSGAIYKREEDGIVLIDQDKCRGWRLCISGCPYKKIYFNWKSGKS
EKCIFCYPRIESGQPTVCSETCVGRIRYLGVLLYDADRIEEAASTEREVDLYERQCEVFLDPHDPSVIEEALKQGIPQNV
IDAAQRSPVYKMAMDWKLALPLHPEYRTLPMVWYVPPLSPIQSYADAGGLPKSEGVLPAIESLRIPVQYLANMLSAGDTG
PVLRALKRMMAMRHYMRSQTVEGVTDTRAIDEVGLSVAQVEEMYRYLAIANYEDRFVIPTSHREMAGDAFAERNGCGFTF
GDGCHGSDSKFNLFNSSRIDAINITEVRDKAEGE
>P19319 1.7.5.1~~~narZ~~~Respiratory nitrate reductase 2 alpha chain~~~COG5013
MSKLLDRFRYFKQKGETFADGHGQVMHSNRDWEDSYRQRWQFDKIVRSTHGVNCTGSCSWKIYVKNGLVTWEIQQTDYPR
TRPDLPNHEPRGCPRGASYSWYLYSANRLKYPLIRKRLIELWREALKQHSDPVLAWASIMNDPQKCLSYKQVRGRGGFIR
SNWQELNQLIAAANVWTIKTYGPDRVAGFSPIPAMSMVSYAAGTRYLSLLGGTCLSFYDWYCDLPPASPMTWGEQTDVPE
SADWYNSSYIIAWGSNVPQTRTPDAHFFTEVRYKGTKTIAITPDYSEVAKLCDQWLAPKQGTDSALAMAMGHVILKEFHL
DNPSDYFINYCRRYSDMPMLVMLEPRDDGSYVPGRMIRASDLVDGLGESNNPQWKTVAVNTAGELVVPNGSIGFRWGEKG
KWNLESIAAGTETELSLTLLGQHDAVAGVAFPYFGGIENPHFRSVKHNPVLVRQLPVKNLTLVDGNTCPVVSVYDLVLAN
YGLDRGLEDENSAKDYAEIKPYTPAWGEQITGVPRQYIETIAREFADTAHKTHGRSMIILGAGVNHWYHMDMNYRGMINM
LIFCGCVGQSGGGWAHYVGQEKLRPQTGWLPLAFALDWNRPPRQMNSTSFFYNHSSQWRYEKVSAQELLSPLADASKYSG
HLIDFNVRAERMGWLPSAPQLGRNPLGIKAEADKAGLSPTEFTAQALKSGDLRMACEQPDSSSNHPRNLFVWRSNLLGSS
GKGHEYMQKYLLGTESGIQGEELGASDGIKPEEVEWQTAAIEGKLDLLVTLDFRMSSTCLFSDIVLPTATWYEKDDMNTS
DMHPFIHPLSAAVDPAWESRSDWEIYKGIAKAFSQVCVGHLGKETDVVLQPLLHDSPAELSQPCEVLDWRKGECDLIPGK
TAPNIVAVERDYPATYERFTSLGPLMDKLGNGGKGISWNTQDEIDFLGKLNYTKRDGPAQGRPLIDTAIDASEVILALAP
ETNGHVAVKAWQALGEITGREHTHLALHKEDEKIRFRDIQAQPRKIISSPTWSGLESDHVSYNAGYTNVHELIPWRTLSG
RQQLYQDHPWMRAFGESLVAYRPPIDTRSVSEMRQIPPNGFPEKALNFLTPHQKWGIHSTYSENLLMLTLSRGGPIVWIS
ETDARELTIVDNDWVEVFNANGALTARAVVSQRVPPGMTMMYHAQERIMNIPGSEVTGMRGGIHNSVTRVCPKPTHMIGG
YAQLAWGFNYYGTVGSNRDEFIMIRKMKNVNWLDDEGRDQVQEAKK
>P42432 ~~~nasA~~~Nitrate transporter~~~COG2223
MKLSELKTSGHPLTLLCSFLYFDVSFMIWVMLGALGVYISQDFGLSPFEKGLVVAVPILSGSVFRIILGILTDRIGPKKT
AVIGMLVTMIPLLWGTFGGRSLTELYAIGILLGVAGASFAVALPMASRWYPPHLQGLAMGIAGAGNSGTLFATLFGPRLA
EQFGWHIVMGIALIPLLIVFILFVSMAKDSPAQPSPQPLKSYLHVFGQKETWFFCLLYSVTFGGFVGLSSFLSIFFVDQY
QLSKIHAGDFVTLCVAAGSFFRPVGGLISDRVGGTKVLSVLFVIVALCMAGVSSLPSLSMVIVLLFVGMMGLGMGNGAVF
QLVPQRFRKEIGMVTGIVGAAGGIGGFFLPNILGSLKQMTGTYAIGFITFSCIALLAFALVLAAGYYWRKSWSAESSPAD
V
>Q06457 1.7.-.-~~~nasA~~~Nitrate reductase~~~COG0243
MTETRTTCPYCGVGCGVIASRAPHGQVSVRGDEQHPANFGRLCVKGAALGETVGLEGRMLFPEVDGERATWPQALAAAGS
RLREIIDRHGPQAVAFYASGQLLTEDYYAANKLMKGFIGAANIDTNSRLCMSSAVTGYKRALGADVVPCSYEDVENSDLV
VLVGSNAAWAHPVLYQRLAQAKRDNPQMRVVVIDPRRTATCDIADRHLALAPGSDGGLFVGLLNAIAASGAISDDFNDAQ
RALTIAQDWDLDKVAQFCGLPRQQIADFYREFIAAPRAITLYTMGINQSASGSDKCNAIINVHLACGKYGRPGCGPFSLT
GQPNAMGGREVGGLATMLAAHMNFEPDDLRRLARFWGSERLAQTPGLTGVELFAAIGRGEVKAVWIMGTNPVVSLPDSHA
VSEALARCPLVIISDVVADTDTGRFAHIRFPALAWGEKSGTVTNSERRISRQRAFMPPPGEARADWWIVARVAEALGFGS
AFAWQHPHEVFSEHAALSGYENDGQRAFDIGGLADLSREAWDALEPVRWPVSRSEAAWSVHKGWHRDGKLRMVPVAPQPT
RATTDAFYPLILNSGRIRDQWHTMTRTGAVPRLMQHINEPVVEVAPADAQRYHLLEGELARVRSPKGVMVAKVTIGDGQR
PGSLFVPMHWNNQFARQGRVNNLLAAVTDPHSGQPESKQTAVAIATWLPAWKGELFSRQPVPLPASLHWRRRAAQGIIHL
SLAGDTRSRDWLVEWCQRQGWQMQVAEGGKVWNLLAWRAGELMLGWWSDASEPAIDADWIHAAFRVPPQNAARRHALLSG
RKGGVEMPRGRIICSCFSVGERAIGEAIAGGCRTPGALGGKLKCGTNCGSCIPELKALLAAKLAQA
>P42433 ~~~nasB~~~Assimilatory nitrate reductase electron transfer subunit~~~COG1251
MKKQRLVLAGNGMAGIRCIEEVLKLNRHMFEIVIFGSEPHPNYNRILLSSVLQGEASLDDITLNSKDWYDKHGITLYTGE
TVIQIDTDQQQVITDRKRTLSYDKLIVATGSSPHILPIPGADKKGVYGFRTIEDCQALMNMAQHFQKAAVIGAGLLGLEA
AVGLQHLGMDVSVIHHSAGIMQKQLDQTAARLLQTELEQKGLTFLLEKDTVSISGATKADRIHFKDGSSLKADLIVMAAG
VKPNIELAVSAGIKVNRGIIVNDFMQTSEPNIYAVGECAEHNGTVYGLVAPLYEQGKALASHICGVPCEEYQGSAPSAAL
KIAGIDVWSAGKIQEDERTTSIKIYDEQAGVYKKALFVDDKLAGVILFGDTRDKQRLLDSLLKQRDISIAKKQIIEPETS
GPLFESMPSSETICQCNTVTKGAIEDAVHTNSLTTVEEVKHCTKATGSCGGCKPLVEDLLRYMTNSEYTKPASTPSFCSC
TDFTEDDIIAELQRRPFTNPAEVMNQLDWKTKNGCSTCVPAIQYYLEMLYPGFVQPEPATEETCILIPQMYGGRTNAEQL
RTIANIIEAYSIPDVSITHGQRLKLSGIKPADLPNMKKDLKMPVYTNEHRHALQSIKACTCGQNRSIQQLAAQIERQLEM
LPLPAPISISLSCETDCTEAALQDVGAIRTQAGWDIHIGGVRGTHARSGALFCVTENEDSTAGMIKGLIQYYRETAHYLE
GVHQWIDRLGIVHIREVLFEEDLRAQLLESLQTDLSLIQNPTVETGAYKKG
>P42434 1.7.-.-~~~nasC~~~Assimilatory nitrate reductase catalytic subunit~~~COG0243
MTERLLRYFRDKQQDVQSEKTYDTQCPFCSMQCKMQLVEQTIVTRKKYTAIGIDNPTTQGRLCIKGMNAHQHALNSSRIT
RPLLKKNGEFMPVSWEEALNHIKDQVTMIQTEHGHDAMAVYGSASITNEEAYLLGKFARVGLQTKYIDYNGRLCMSAAAT
AANQTFGADRGLTNPLSDIPHTRVIILAGTNIAECQPTIMPYFEKAKENGAYFIAIDPRETATTKIADLHLKIKPGTDAA
LANGLVKIIIDEQLINEDFIQSRTNGFEELKQHTDSLDLNDIAEQTSVSLVDIRKAAVKFAKETSGMLFTARGIEQQTDG
TAAVKGFLNMVLITGKIGKPYSGYGAITGQGNGQGAREHGQKADQLPGYRSIENEEHRAHIAKVWGIHQDELPRKGVSAY
EMMEKINDGDIKGLFLMCSNPAVSSPNANLVKKALRRLTFFVAIDLFISETAKYADVILPASSYLEDEGTMTNVEGRVTL
REASRPCPGEAKHDWQIICDLASALGKGRYFSYTSAEDIFNELREASRGGIADYSGISYGRLRREGGIHWPCPESDHPGT
GRLFTESFAHPDQKAALSVIPNEPPVPKEKPTADYPLYLTTGRVMSHYLTGVQTRKSAALAARHFESFMEIHPQTAATYN
IEDRVLVKIESPRGSITVRSKLSEQIRKDTVFVPIHWADAQNVNDLIGEALDPACKMPGFKVCAVRIIPI
>P42435 1.7.1.4~~~nasD~~~Nitrite reductase [NAD(P)H]~~~COG1251
MGKKQLVLVGNGMAGVRAIEEILSVAKDEFQITIFGAEPHPNYNRILLSKVLQGDTDIKDITLNDWDWYEENNIQLYTNE
TVIKVDTENKTVITDADRIQPYDELILATGSVPFILPIPGADKKGVTAFRDIKDTDTMLAASKQYKKAAVIGGGLLGLEA
ARGLLNLGMDVSVIHLAPFLMERQLDATAGRLLQNELEKQGMTFLLEKQTEEIVGDDRVEGLRFKDGTSIEADLVVMAVG
IRPNTTLGAESGIPVNRGIIVNDYMQTEIPHIYAVGECAEHRGIAYGLVAPLYEQAKVLAKHMCGIETKPYEGSVLSTQL
KVSGVEVFSAGDFNESEEKKAIKVFDEQDGIYKKIVLRGNQIVGAVLFGDSSEGNRLFSMIQKEADISETSKISILQPLS
QEAGTSITAAMSDDEIICGCNGVSKGAIIQAIQEKGCSSTDEIKACTGASRSCGGCKPLVEEILQHTLGSDFDASAQKEA
ICGCTTLSRDEVVEEIKAKGLSHTREVMNVLGWKTPEGCSKCRPALNYYLGMINPTKYEDDRTSRFVNERMHANIQKDGT
YSVVPRMYGGVTNSTDLRKIADVVDKYEIPLVKMTGGQRIDLIGVKKEDLPKVWEDLDMPSGYAYGKTLRTVKTCVGEQF
CRFGTQDSMALGIALEKKFEGLNTPHKVKMAVSACPRNCAESGIKDLGVVGIDGGWELYVGGNGGTHLRAGDLLMKVKTN
EEVLEYAGAYLQYYRETANYLERTSAWLERVGLSHVQSVLNDPEKRQELNGRMNETLSVHKDPWKDFLEDKQTSKELFEN
VVTTS
>P42436 1.7.1.4~~~nasE~~~Assimilatory nitrite reductase [NAD(P)H] small subunit~~~COG2146
MVNKDVTKVCIGKIEELPEQLGKTVYIEDKELAVFKLSDGSIRAIENRCPHKGGVLAEGIVSGQYVFCPMHDWKISLEDG
IVQEPDHGCVKTYETLIEGEHVYLVY
>P42437 2.1.1.107~~~nasF~~~Uroporphyrinogen-III C-methyltransferase~~~COG0007
MIMKNGIVYFVGAGPGDPGLLTIKGKQALKEADVILYDRLANPKLLEFASPDCQFIYCGKLPNRHFMKQKEINALLVEKA
LNGLTVVRLKGGDPSVFGRVGEEADALHEHGIRYEMVPGITSGIAAPLYAGIPVTHRDFASSFAMITAHDKSLKGTPNLD
WEGLARSVQTLVFYMGVKNLSYICQQLISYGKSPSVPVIVIQWGTWGRQRSVKGTLENIQQKVQEHQITNPAIIVIGDIV
NFQTHSWFESKPLIGRHLMVVTHGEDEDPLADKLRDSGADLIEWPKWRTENMPVNEEILRKIGTFEDVFFTSRRAVCEFF
RALASQKIDIRQLTAKLSAASEQAKTELEKRGFLVTAIQPDSEKRLVVGSRHAVENMQKHESCSFYITHENVIDDRFTHM
IQRTISESPLHMVICPNKLSVQQLINGGEQIGILPEPSASRPPIVCIGDDSAAGIYGFTAVQEQDELLAFIHNQHAEKKL
LHT
>A0A0H3JXA8 2.5.1.152~~~cntL~~~D-histidine 2-aminobutanoyltransferase~~~
MNNFNNEIKLILQQYLEKFEAHYERVLQDDQYIEALETLMDDYSEFILNPIYEQQFNAWRDVEEKAQLIKSLQYITAQCV
KQVEVIRARRLLDGQASTTGYFDNIEHCIDEEFGQCSITSNDKLLLVGSGAYPMTLIQVAKETGASVIGIDIDPQAVDLG
RRIVNVLAPNEDITITDQKVSELKDIKDVTHIIFSSTIPLKYSILEELYDLTNENVVVAMRFGDGIKAIFNYPSQETAED
KWQCVNKHMRPQQIFDIALYKKAAIKVGITDV
>A0A0H2ZHV3 2.5.1.-~~~cntL~~~L-histidine 2-aminobutanoyltransferase~~~
MQGRTPLLETLRELECEIRLLTVYARECCGCYEILRRKLDRLSGLIGEDCSRAQWQADSDDPALQALGLRLRDAAVQALC
ELEKHLCQGVLHEPGEMGRYLGSLLESIRGELDSAGIDADARVLFVGSGALPTSALVLAREVGAHLCCLDIDEEALGYAR
EIARCQGLEARMQFSSLPPAELAFSRDATHFLIASLVQQKSAVLAQIRQVMRADAKVLLRHGSGIKGLFNYPVEPAELEG
WQVCAERVSQPLYDTLILEKAGR
>Q9HUX4 2.5.1.-~~~cntL~~~L-histidine 2-aminobutanoyltransferase~~~
MQGRTPLLETLRELECEIRLLTVYARECCGCYEILRRKLDRLSGLIGEDCSRAQWQADSDDPALQALGLRLRDAAVQALC
ELEKHLCQGVLHEPGEMGRYLGSLLESIRGELDSAGIDADARVLFVGSGALPTSALVLAREVGAHLCCLDIDEEALGCAR
EIARCQGLEARMQFSSLPPAELAFSRDATHFLIASLVQQKSAVLAQIRQVMRADAKVLLRHGSGIKGLFNYPVEPAELDG
WRVCAERVSQPLYDTLILEKAGR
>Q48468 ~~~nasR~~~Nitrate regulatory protein~~~COG3707
MNNMAGNTPEVVDWFARARRLQKQQLHQLAQQGTLAGQISALVHMLQCERGASNIWLCSGGRLYAAECRAGAALVDEQLT
RFYAALEPARDAASSALCWRIACAVWYLPQLAALRKRVRDREIAAEEATGQFSRIIRHLLNIVPQLNDSIDDPQIAGRMV
ALYSFMQGKELAGQERALGALGFARGQFSDELRQQLVDRIDGQQPCFDSFQALAQPPQTALFAEQCQASLEIEQLRRVAC
TRQPPADEGETALRWFCAQTQRLEQLRGVEELLIVDLLNAADALLEGEEPEAQLPPADWQEDSIALRLDKQLLPLVRQQA
HELQQLSGQLASLKDALEERKLIEKAKSVLMTYQGMQEEQAWQALRKMAMDKNQRMVEIARALLTVKALWRVTPKE
>P46903 7.2.2.4~~~natA~~~ABC transporter ATP-binding protein NatA~~~COG4555
MITLTDCSRRFQDKKKVVKAVRDVSLTIEKGEVVGILGENGAGKTTMLRMIASLLEPSQGVITVDGFDTVKQPAEVKQRI
GVLFGGETGLYDRMTAKENLQYFGRLYGLNRHEIKARIEDLSKRFGMRDYMNRRVGGFSKGMRQKVAIARALIHDPDIIL
FDEPTTGLDITSSNIFREFIQQLKREQKTILFSSHIMEEVQALCDSVIMIHSGEVIYRGALESLYESERSEDLNYIFMSK
LVRGIS
>P46904 7.2.2.4~~~natB~~~ABC transporter permease protein NatB~~~COG1668
MLSHIYKKEMIDALRDRKTILLTILVPMIMMLGLVFFYESMLSDKGEQYTLAVGHSLPPALESKLNEIDEISVKTFAKPE
EAVDEGKADAYLNVPKEFDSYVNSMTPFKVDVYGNSIDQGSSNAMQLVQSALDQYKNEIVQQRLTNKHIDQSVIQPFTIQ
QKEADEEKGTSAIMLSAILPMLILTSIVSGAMPIALDIMAGEKDRKSIEALLLTPVSRNKVLVGKWLAVSTFGVASGVFA
LVFLILSTVLFTENLKTAFQLGDHMWSVIGASALIIVLSALLISAMELFISIMSSSVKEAQSYMSLVVFLPVFPMFFIFS
KAPNQFDLSYFLIPFLNLHALFKQLLFGMVDPATILSTSGTIAVLIAIFFLLARACFLKDKWVLPK
>P70955 ~~~natR~~~Transcriptional regulatory protein NatR~~~COG3279
MVKVGLVDDYRVDLEKLEAIVSRMQDVEIVFSTDSAKEAYRRVKNGDIDLLLADIEMPHMSGYELADLIKSHSLDVDVIF
VTGHGGYAVHAFDLNVHDYIMKPYYADRLAASFDRYLKKKTETSLNGRILIKQKSEMHVLQKKDIIFAERTGRSTTIVTT
AEEVQTYQTLNDIKGDLPEKDFLRSHRSFIINIHYIKHFSAYTKHSFTVSFEGTSKKAMITKQQLDYFQNYYF
>O86309 2.3.1.5~~~nat~~~Arylamine N-acetyltransferase~~~
MAMDLGGYLTRIGLDGRPRPDLGTLHAIVAAHNRSIPFENLDPLLGIPVADLSAEALFAKLVDRRRGGYCYEHNGLLGYV
LEELGFEVERLSGRVVWMRADDAPLPAQTHNVLSVAVPGADGRYLVDVGFGGQTLTSPIRLEAGPVQQTRHEPYRLTRHG
DDHTLAAQVRGEWQPLYTFTTEPRPRIDLEVGSWYVSTHPGSHFVTGLTVAVVTDDARYNLRGRNLAVHRSGATEHIRFD
SAAQVLDAIVNRFGIDLGDLAGRDVQARVAEVLDT
>P9WJI5 2.3.1.5~~~nat~~~Arylamine N-acetyltransferase~~~COG2162
MALDLTAYFDRINYRGATDPTLDVLQDLVTVHSRTIPFENLDPLLGVPVDDLSPQALADKLVLRRRGGYCFEHNGLMGYV
LAELGYRVRRFAARVVWKLAPDAPLPPQTHTLLGVTFPGSGGCYLVDVGFGGQTPTSPLRLETGAVQPTTHEPYRLEDRV
DGFVLQAMVRDTWQTLYEFTTQTRPQIDLKVASWYASTHPASKFVTGLTAAVITDDARWNLSGRDLAVHRAGGTEKIRLA
DAAAVVDTLSERFGINVADIGERGALETRIDELLARQPGADAP
>Q9RAH6 2.5.1.38~~~nat~~~Isonocardicin synthase~~~
MTALSRVSGQAPDRVVETYAEGKPYDLFFLDVAGVRLVGRKTEAAYPGPDRDGLPAERLKCALVEARMLLGVVERDQVAE
DHVAVFHRPLGEAEKAELFAAAVADPTTDLYYPYAQLGDRVRETEEGGWEVTDESARELDHAEEVLRDHVPDRLAELGFR
GGVAYDAACSTGAFLQAVGRRFPGTRTIGQDLSPAMVARARTRLDEAHCGDGIRPAIPEASADLVVCRHLNAFVVGTGQA
HDLLAAAASRCREGGLVVLLGHTPVLVSSQWCEMSGLTPLQRSGATPSGHALFQCYVLRKG
>P9WFG7 5.99.-.-~~~~~~Peroxynitrite isomerase 1~~~COG4044
MTRDLAPALQALSPLLGSWAGRGAGKYPTIRPFEYLEEVVFAHVGKPFLTYTQQTRAVADGKPLHSETGYLRVCRPGCVE
LVLAHPSGITEIEVGTYSVTGDVIELELSTRADGSIGLAPTAKEVTALDRSYRIDGDELSYSLQMRAVGQPLQDHLAAVL
HRQR
>P9WFG9 5.99.-.-~~~~~~Peroxynitrite isomerase 2~~~COG4044
MSSGAGSDATGAGGVHAAGSGDRAVAAAVERAKATAARNIPAFDDLPVPADTANLREGADLNNALLALLPLVGVWRGEGE
GRGPDGDYRFGQQIVVSHDGGDYLNWESRSWRLTATGDYQEPGLREAGFWRFVADPYDPSESQAIELLLAHSAGYVELFY
GRPRTQSSWELVTDALARSRSGVLVGGAKRLYGIVEGGDLAYVEERVDADGGLVPHLSARLSRFVG
>P35087 ~~~nblA~~~Phycobilisome degradation protein NblA~~~
MLPPLPDFSLSVEQQFDLQKYRQQVRDISREDLEDLFIEVVRQKMAHENIFKGMIRQGS
>Q93NF6 1.1.1.328~~~nboR~~~Nicotine blue oxidoreductase~~~
MGDTDLPCVTGVLLAAGAGKRLGRGPKALLPYRGRTLVEDAAETMLVGGCHEVVIVLGANAQAVCARANLEPYRIVVNHD
WSSGMGSSYLAGDAAAHTKNHILVALVDQPGLSVTTVGRLLVSHRPGRISSAAYSSLDSPRVLRRGHPMVIDAGLRPAVA
STVSGDAGARVFLRQKPWLVDLIDCSDESTGEDVDTVEQMYRLL
>Q6DLR9 1.7.1.16~~~nbzA~~~Nitrobenzene nitroreductase~~~
MPTSPFIDDLIRDRRAKRGFLDQPVSIEMVKDILSAAKYAPSSSNTQPWRCYVITGEARERITTAAVEAYRAAPEGLPPE
YPFFPQPLHEPYATRFNSFRGQLGDALGIPRSDITLRRRDVERQFRFFDAPVGLIFTMDRRLEWASFICYGCFLQNIMLA
AKGRGLDTCTQVFWSMQHPVLRTELNLPDDQMVVAGMSLGWADNSLPENQMSISKMELEEFTTFVHE
>O06769 3.5.1.23~~~~~~Neutral ceramidase~~~
MLSVGRGIADITGEAADCGMLGYGKSDQRTAGIHQRLRSRAFVFRDDSQDGDARLLLIVAELPLPMQNVNEEVLRRLADL
YGDTYSEQNTLITATHTHAGPGGYCGYLLYNLTTSGFRPATFAAIVDGIVESVEHAHADVAPAEVSLSHGELYGASINRS
PSAFDRNPPADKAFFPKRVDPHTTLVRIDRGEATVGVIHFFATHGTSMTNRNHLISGDNKGFAAYHWERTVGGADYLAGQ
PDFIAAFAQTNPGDMSPNVDGPLSPEAPPDREFDNTRRTGLCQFEDAFTQLSGATPIGAGIDARFTYVDLGSVLVRGEYT
PDGEERRTGRPMFGAGAMAGTDEGPGFHGFRQGRNPFWDRLSRAMYRLARPTAAAQAPKGIVMPARLPNRIHPFVQEIVP
VQLVRIGRLYLIGIPGEPTIVAGLRLRRMVASIVGADLADVLCVGYTNAYIHYVTTPEEYLEQRYEGGSTLFGRWELCAL
MQTVAELAEAMRDGRPVTLGRRPRPTRELSWVRGAPADAGSFGAVIAEPSATYRPGQAVEAVFVSALPNNDLRRGGTYLE
VVRREGASWVRIADDGDWATSFRWQRQGRAGSHVSIRWDVPGDTTPGQYRIVHHGTARDRNGMLTAFSATTREFTVV
>Q9I596 3.5.1.23~~~~~~Neutral ceramidase~~~
MSRSAFTALLLSCVLLALSMPARADDLPYRFGLGKADITGEAAEVGMMGYSSLEQKTAGIHMRQWARAFVIEEAASGRRL
VYVNTDLGMIFQAVHLKVLARLKAKYPGVYDENNVMLAATHTHSGPGGFSHYAMYNLSVLGFQEKTFNAIVDGIVRSIER
AQARLQPGRLFYGSGELRNANRNRSLLSHLKNPDIVGYEDGIDPQMSVLSFVDANGELAGAISWFPVHSTSMTNANHLIS
PDNKGYASYHWEHDVSRKSGFVAAFAQTNAGNLSPNLNLKPGSGPFDNEFDNTREIGLRQFAKAYEIAGQAQEEVLGELD
SRFRFVDFTRLPIRPEFTDGQPRQLCTAAIGTSLAAGSTEDGPGPLGLEEGNNPFLSALGGLLTGVPPQELVQCQAEKTI
LADTGNKKPYPWTPTVLPIQMFRIGQLELLGAPAEFTVMAGVRIRRAVQAASEAAGIRHVVFNGYANAYASYVTTREEYA
AQEYEGGSTLYGPWTQAAYQQLFVDMAVALRERLPVETSAIAPDLSCCQMNFQTGVVADDPYIGKSFGDVLQQPRESYRI
GDKVTVAFVTGHPKNDLRTEKTFLEVVNIGKDGKQTPETVATDNDWDTQYRWERVGISASKATISWSIPPGTEPGHYYIR
HYGNAKNFWTQKISEIGGSTRSFEVLGTTP
>Q44582 ~~~nccX~~~Nickel-cobalt-cadmium resistance protein NccX~~~
MMKSRTFRLSVSTLVGALVGVLMAIVGVYVTHSTEEPHTSLHEMLHDAVPLDSNEREILELKEEEFTARRREIESRLRAA
NGKLAESIAKNPQWSPEVEEATREVERAAADLQRATLVHVFEMRAGLKPEHRAAYDRVLVDALKRGSQ
>A0A1X9WEP1 2.1.1.351~~~ncmP~~~Nocamycin O-methyltransferase~~~
MVFDRLAGIYDATGVEFFRPVARRLLDLVDPRPGVDLLDVGCGRGAVLFPAAERVGPGGTVVGIDIAEPMVRATAAEAAE
RGLGTVSVRLGDGADPAFPAGSFDVVTASMSAALFPDLPAVAARYARLLRPDGRIGLTGPVPPPSLREWALGPLRVGAVV
DAIAPEAVAATHPRIAALLGAHPFGAPGAVADALRAAGFVEVRELHEDLELRAPSAEALVGWTWSNGLRVYWELVEPDRR
AAVAAELVRDLTAHAAGGPITATYPVAYVTGRLRPAGGTP
>P39411 3.6.1.73~~~yjjX~~~Inosine/xanthosine triphosphatase~~~COG1986
MHQVVCATTNPAKIQAILQAFHEIFGEGSCHIASVAVESGVPEQPFGSEETRAGARNRVANARRLLPEADFWVAIEAGID
GDSTFSWVVIENASQRGEARSATLPLPAVILEKVREGEALGPVMSRYTGIDEIGRKEGAIGVFTAGKLTRASVYHQAVIL
ALSPFHNAVY
>P39432 3.6.1.73~~~yjjX~~~Inosine/xanthosine triphosphatase~~~
MHQVISATTNPAKIQAILQAFEEIFGEGSCHITPVAVESGVPEQPFGSEETRAGARNRVDNARRLHPQADFWVAIEAGID
DDATFSWVVIDNGVQRGEARSATLPLPAVILDRVRQGEALGPVMSQYTGIDEIGRKEGAIGVFTAGKLTRSSVYYQAVIL
ALSPFHNAVYR
>Q9KU27 3.6.1.73~~~~~~Inosine/xanthosine triphosphatase~~~COG1986
MPPIIKRRVMRKIIIASQNPAKVNAVRSAFSTVFPDQEWEFIGVSVPSEVADQPMSDEETKQGALNRVRNAKQRHPGAEY
YVGLEAGIEENKTFAWMIVESDQQRGESRSACLMLPPLVLERLRQAKELGDVMDEVFGTENIKQKGGAIGLLTRHHLTRS
TVYHQALILALIPFINPEHYPSA
>Q84HC8 2.1.1.303~~~ncsB1~~~2,7-dihydroxy-5-methyl-1-naphthoate 7-O-methyltransferase~~~
MGKRAAHIGLRALADLATPMAVRVAATLRVADHIAAGHRTAAEIASAAGAHADSLDRLLRHLVAVGLFTRDGQGVYGLTE
FGEQLRDDHAAGKRKWLDMNSAVGRGDLGFVELAHSIRTGQPAYPVRYGTSFWEDLGSDPVLSASFDTLMSHHLELDYTG
IAAKYDWAALGHVVDVGGGSGGLLSALLTAHEDLSGTVLDLQGPASAAHRRFLDTGLSGRAQVVVGSFFDPLPAGAGGYV
LSAVLHDWDDLSAVAILRRCAEAAGSGGVVLVIEAVAGDEHAGTGMDLRMLTYFGGKERSLAELGELAAQAGLAVRAAHP
ISYVSIVEMTAL
>Q84HC5 6.2.1.43~~~ncsB2~~~2-hydroxy-7-methoxy-5-methyl-1-naphthoate--CoA ligase~~~
MHETAAAPAPAGFVPWPDDVAARYTAAGHWEGRSLGTHLAEAARKVPEAVCLVDGPVRMSYSELMARADGAAVRMRGLGI
RPADRVVVQLPNCWEHVVVTMACLRLGALPIWALPQYRHRELSGVVTHARASALVVPDVYREFDHQALAHEVAEAQPTVR
HVLVAGSDVRPDSVDLRALCEPLDADEAARVAAELDRSAPRGEEVAMLKLSGGTTGLPKLVARTHNDLSYMIKRAAQVCG
FGRDTVYLAVLPLGHGFPNTGPGVLGTLLAGGRVVISGSPAPEAAFALMERERVTATSVVPAIVMRWLQYRDERPGADLG
SLELMQVGASRLEPEVARQVGPKLGCRLQQVFGMAEGLLCLTRLDDPDDVVHYTQGRPISPDDEIRVVDPEGRTVGVGEP
GALLTRGPYTPRGYYDSPSANARAFTPDGWYRTGDLVRRTPDGNLIVVGREKDLINRGGEKINAEEVEGFAVQVDGVLQA
AAVGLPDSELGERICLFVVLADGTRVELADVRKVMENAETASFKLPERLITLPSLPTTPMGKIDKKALRAAAGRMSET
>Q84HB6 1.14.15.31~~~ncsB3~~~2-hydroxy-5-methyl-1-naphthoate 7-hydroxylase~~~
MCPYRLDPEGADTHGETARLREQGPIARVELQDGVLAWSVHDYAVAKQIMADERFSKNPRKNWPAYINGEISNGWPLITW
VAMDTMATQDGADHARLRKLLLKAFTERRVESMRPHIEKTVKELLDNMAAKADDEIVDIKEMFHAELPTRLMCDLFGVPE
ERRAEVLAGGHKNIDTRISSEAAEANLGQWQEAISDLVEYKRHHPGDDLTSALIEARDEGSRLSDSELIGTLHLLLGAGS
ETLVNALAHSSLALLVDADLRKKVTSGEIPWVNVWEETLRVESPVAHLPFRYATEDFEIGGVKISKGDPLLVDFAGIGRD
PAVHSDAPDEFDALRPDKTHLSFGHGVHYCLGARLAKHAWMIGIPALFERFPDMELAVRRDELKGQGSFVVNGHASLPVH
LKGRAAALAR
>Q84HC6 2.3.1.237~~~ncsB~~~Neocarzinostatin naphthoate synthase~~~
MSCRYAPDLDSPDKFWDFLVNGRSTVGDMPDKRWEPYASGSPQATAAMRDTVRRGAFLDDIEGFDAEFFGISPREADFLD
PQQRFMLELAWEALADAGVPPLTLRNTDTGVYAAANSNDYGRRLLEDIPRTGAYAVNGTTLYGIANRVSYFLDLHGPSMA
VDTACAGALTALHLARQSLLTGETPLAIVGGLNIMSTPSLNVALNDAGAMSPDGRSKAFDEDADGYGRGEGAGVLVLKRL
SDARRDNDPVHAVIRGSGVFQDGRSDGMMAPDGDAQEHMLRQAYHRAGVDPATVDYVEAHGTGTPTGDREEITALAKVFG
AGRSPHAPCLIGSVKPNVGHVEGGSGITGVIKTVLALRNELIPPTLHDRPRTDVDWDAWGVRLVGQVQEWPSCGRPRRAG
VSSYGVGGTISHVILEESPVPAATSSADASTGVRTPALFPLSAASEAGLRALAGEAAGWVASRPDTPLPSVGHTLTQRRS
HLAQRAAVVADSAEQLVDRLREVAEGRNGPGIVSARTSAGRADDAVWVFSGHGAQWSGMGRRLLASEPVFAATLDALDEV
FREELGWTPREAVTEGGPWTAAHVQALTFAVQIGLADVWRSKGLRPGAVIGHSVGEIAAAVVAGSLDRDEAARFACRRAA
ALQRLDGRGAMVMVGLPFEEAALRLGDRRDVEAAISAGPHSTVLSGDRSAVLRVAEEWQASEVWTRTVDSDIAFHSVHVD
EVTGDIESAARLLTPRPPTVPLYTTALSDPRSRAPRDSGYWAANLRKPVRFTEAVRAAAEDGHRLFLEVSSHPVVAHSVS
ETLLDLGIEDAAVAGTLRRDTDEVESLLENLAELHCHGVAVDWARHHTDGELVGLPAAVWQHRPYWIFPETTADAGLGRG
HDPASHSLLGGRMTVSGSPTRQVWQTRLDMDSRPYPQSHGLVGVEVTPAASIINTFAAAVEEDGPSALTDIVLRTPLAVE
PPRVVQVVREGRSLSLATRVAEDADADGSEWITHTTAAVTPGVRPAGGRLDTEAIRSRLPEGSLTRADEMFERMGVEGYA
FPWDLEELRHDDHEQLAVLQIEPSPAQRATSWAHVIDGALTISAMVVSPGDATVLWMSRSIDQVTWSGEPPARLTVHSTR
SLRSPHDTVDVRVADERGDVVCEVVGLRFAAVEHIGAAVLPSELVHEIVWRPWDPEDHEGAEDAPVEQVILVGDPEATVP
LAEQLESAGMTCVQVGDSPETGLRPDLFARPGAVVVAPALAGSDTAPEEEAERVSWLLVRTVQRVAEIRADVTGDAPAQR
VWCVTSDVRRARDERSVAHGPLWGLARIVAGDHPELWGGAVDIGPSADGIGARLVALLDGAAGTEDVISLTGEGAEVARL
SRIDRSADGTPLQCSPSGTVLITGGLGALGLEVARWLVDRGARRLVLVSRRALPNRTEWPAVTDAETRRRIDGVLALEAL
GVTVRVLALDITDVDQVSAALAPEALGLPPVRGVVHAAGVVNNALVDKVDLEGLREVLAPKVRGAMVLHRLFPPGDLDFF
VLFSSCGQFARLSGQASYAAANSFLDTLASHRNAGEHTETVSLGWTAWRGLGMSSNIDTTMFEANSRGLEAVSATEAFGA
WSFGDRFQSDYQAILRVVPTPVHTPRLPVFRDLPVSGETDGPTGDQLFTTTLEGLPEQEARERITADVREQVAGVLNFDP
SEVEVKRPLVELGVDSVMTVALRVRLQRRYGLELPPTILWAKPTVAALSEHVCDSLRWDGEEHGDLAAPTAAA
>P0A3R9 ~~~ncsA~~~Neocarzinostatin~~~
MVPISIIRNRVAKVAVGSAAVLGLAVGFQTPAVAAAPTATVTPSSGLSDGTVVKVAGAGLQAGTAYDVGQCAWVDTGVLA
CNPADFSSVTADANGSASTSLTVRRSFEGFLFDGTRWGTVDCTTAACQVGLSDAAGNGPEGVAISFN
>P0A3S0 ~~~ncsA~~~Neocarzinostatin~~~
AAPTATVTPSSGLSDGTVVKVAGAGLQAGTAYDVGQCAWVDTGVLACNPADFSSVTADANGSASTSLTVRRSFEGFLFDG
TRWGTVDCTTAACQVGLSDAAGNGPEGVAISFN
>P72349 3.5.1.81~~~dan~~~D-aminoacylase~~~COG3653
MSQSDSQPFDLLLAGGTLIDGSNTPGRRADLGVRGDRIAAIGDLSDAAAHTRVDVSGLVVAPGFIDSHTHDDNYLLRRRD
MTPKISQGVTTVVTGNCGISLAPLAHANPPAPLDLLDEGGSYRFERFADYLDALRATPAAVNAACMVGHSTLRAAVMPDL
QRAATDEEIAAMRDLAEEAMASGAIGISTGAFYPPAARATTEEIIEVCRPLSAHGGIYATHMRDEGEHIVAALEETFRIG
RELDVPVVISHHKVMGQPNFGRSRETLPLIEAAMARQDVSLDAYPYVAGSTMLKQDRVLLAGRTIITWCKPFPELSGRDL
DEVAAERGKSKYDVVPELQPAGAIYFMMDEPDVQRILAFGPTMIGSDGLPHDERPHPRLWGTFPRVLGHYARDLGLFPLE
TAVWKMTGLTAARFGLAGRGQLQAGYFADLVVFDPATVADTATFEHPTERAAGIHSVYVNGAPVWQEQAFTGQHAGRVLA
RTAA
>P73735 1.6.5.12~~~ndbB~~~Demethylphylloquinone reductase NdbB~~~COG1252
MTDARPRICILGGGFGGLYTALRLGQLSWEGHTPPEIVLVDQRDRFLFAPFLYELVTEEMQTWEIAPPFVELLAESGVIF
RQAEVTAIDFDHQKVLLNDQDKGTESLAFDQLVIALGGQTPLPNLPGLKDYGLGFRTLEDAYKLKQKLKSLEQADAEKIR
IAIVGGGYSGVELAAKLGDRLGERGRIRIIERGKEILAMSPEFNRQQAQASLSAKGIWVDTETTVTAITATDVTLQFREQ
EDVIPVDLVLWTVGTTVSPLIRNLALPHNDQGQLRTNAQLQVEGKTNIFALGDGAEGRDASGQLIPTTAQGAFQQTDYCA
WNIWANLTGRPLLPCRYQPLGEMLALGTDGAVLSGLGIKLSGPAALLARRLVYLYRFPTWQHQLTVGLNWLTRPLGDWLK
NEPS
>P94212 3.5.1.83~~~~~~N-acyl-D-aspartate deacylase~~~COG3653
MTDRSTLDDAPAQADFIIAGATLIDGGGGPARQGDLAVRGGRIVALGDFAHAPGVPVIDARGLALAPGFIDSHTHDDGYL
LAHPEMLPKVSQGITTVVTGNCGISLAPLSRRQIPQPLDLLGPPELFRFATFRDWLRALAETPAAVNVIPLVGHTTLRVA
VMDDTGRAATDAERAAMRALLDEALQAGAFGVSTGTFYPPASAAPTDEIIDVCQPLRGRAGAIYATHLRDEADHIVPAME
EALLIGRELDCRVVFSHHKLAGERNHGRSRETLDMISRAAATQRVCLDCHPYPATSTMLRLDRARLASRTLITWSKGYPE
ATGRDFSEVMAELGLDDEAAIARLAPAGAIYFLMDQADVNRIFSHPLTTVGSDGLPFDPHPHPRQWGTFTNVLRTMVREQ
RLLSLETAIHKMTGLAAAQYGLTERGLLRQGYHADLVLFDPANVTDTATFSAPIQVSQGIHAVWVNGRQVWDGERTGAER
PGQVLAPGDAIPWSQQSE
>P94211 3.5.1.82~~~dag~~~N-acyl-D-glutamate deacylase~~~COG3653
MQEKLDLVIEGGWVIDGLGGPRRRADVGIRGERIAAIGDLSAAPADRRLDAGGRIVAPGFIDTHGHDDLMFVEKPGLEWK
TSQGITSVVVGNCGISGAPAPLPGNTAAALALLGDSPLFADMAMYFGALEAQRPMINVAALVGHANLRLAAMRDPAAQPS
AKEQRAMERMLADALEAGAVGFSTGLAYQPGGVAEQAELDGLARVAAARGALHTSHIRNEGDAVEAAVDEVLAVGRRTGC
ATVLSHHKCMMPANWGKSAATLANIDRARAAGVDVALDIYPYPGSSTILIPERADQIDDIRITWSTPHPECGGQSLAEIA
ARWGCDAVTAARRLCPAGAIYFAMDENEVRRIFQHECCMVGSDGLPNDAHPHPRLWGSFTRVLGRYVREAELLTLEAAVA
KMTALPARVFGLADRGRLAVGAWADVVVFDADTVCDRATWDAPTLASAGIEHVLVNGCAVFPQAPPSHRPGRILRRDASI
AGAPEFSR
>Q0QLF4 1.17.1.5~~~ndhF~~~Nicotinate dehydrogenase FAD-subunit~~~
MKDFEFFAPKTLEEAKGLLHQYKDVPPAIIAGGTDLVIEINDRWEKPDVVIDIKKLKELEYIRVEENTIHIGALSTFTQI
ENHPFIRSHVRALYKAASQVGSPQIRNLGTIGGNLSTSSVAGDGVSAMTTLDATVVLESVRGTRQMKLTDFFDGEGFKRR
NALEADEIMTEVIIDRPDAHSASAFYKLAKRKSLAISVIGGGMAVKVDDAGVCTWASMRGGCIGRYPLHFKQAEEMLVGA
PLTMETMEATLPILHDTVYDMARARPSVLYKKESVQGVFKKLFVDILDQLEGGCNE
>P95200 1.6.5.9~~~ndhA~~~Type II NADH:quinone oxidoreductase NdhA~~~COG1252
MTLSSGEPSAVGGRHRVVIIGSGFGGLNAAKALKRADVDITLISKTTTHLFQPLLYQVATGILSEGDIAPTTRLILRRQK
NVRVLLGEVNAIDLKAQTVTSKLMDMTTVTPYDSLIVAAGAQQSYFGNDEFATFAPGMKTIDDALELRGRILGAFEAAEV
STDHAERERRLTFVVVGAGPTGVEVAGQIVELAERTLAGAFRTITPSECRVILLDAAPAVLPPMGPKLGLKAQRRLEKMD
VEVQLNAMVTAVDYKGITIKEKDGGERRIECACKVWAAGVAASPLGKMIAEGSDGTEIDRAGRVIVEPDLTVKGHPNVFV
VGDLMFVPGVPGVAQGAIQGARYATTVIKHMVKGNDDPANRKPFHYFNKGSMATISRHSAVAQVGKLEFAGYFAWLAWLV
LHLVYLVGYRNRIAALFAWGISFMGRARGQMAITSQMIYARLVMTLMEQQAQGALAAAEQAEHAEQEAAG
>P27724 7.1.1.-~~~ndhH~~~NAD(P)H-quinone oxidoreductase subunit H~~~COG0649
MTKIETRTEPMVLNMGPHHPSMHGVLRLIVTLDGEDVVDCEPVIGYLHRGMEKIAESRTNIMYVPYVSRWDYAAGMFNEA
ITVNAPEKLADIEVPKRAQYIRVIMLELNRIANHLLWLGPFMADVGAQTPFFYIFREREMIYDLWEAASGMRLINNNYFR
VGGVAVDLPYGWNDKCEDFCDYFLPKVDEYEKLITNNPIFRRRVEGVGTVTREEAINWGLSGPMLRGSGVKWDLRKVDHY
ECYDELDWEVQYETAGDCFARYLVRIREMRESVKIIRQALKAMPGGPYENLEAKRLQEGKKSEWNDFQYQYIAKKVAPTF
KIPAGEHYVRLESGKGELGIFIQGNDDVFPWRWKIRSADFNNLQILPHILKGVKVADIMAILGSIDIIMGSVDR
>Q8DJD9 7.1.1.-~~~ndhH~~~NAD(P)H-quinone oxidoreductase subunit H~~~COG0649
MPKIETRTEPMVINMGPHHPSMHGVLRLMVTLDGEDVIDCEPVIGYLHRGMEKIAENRTNIMFIPYVSRWDYAAGMFNEA
VTVNAPEKLAGIPVPKRASYIRVIMLELNRIANHLLWLGPFLADVGAQTPFFYIFREREYIYDLFEAATGMRFINNNYFR
IGGVAADLTYGWVTKCRDFCDYFLPKVDEYERLITNNPIFVRRLQGVGKISREEAINWGLSGPMLRASGVKWDLRKVDHY
ECYDDFDWDVPVATEGDCLARYIVRIQEMRESVKIIRQALDGLPGGPYENLEAKRMLEGAKSEWNGFDYQYIGKKLSPTF
KIPKGEHYVRVESGKGELGIYLIGDDNVFPWRWKIRPPDFNNLQVLPQLLKGMKVADIVAILGSIDVIMGSVDR
>P26525 7.1.1.-~~~ndhI~~~NAD(P)H-quinone oxidoreductase subunit I~~~COG1143
MFNNILKQVGDYAKESLQAAKYIGQGLAVTFDHMSRRPITVQYPYEKLIPSERFRGRIHFEFDKCIACEVCVRVCPINLP
VVDWEFNKAVKKKELKHYSIDFGVCIFCGNCVEYCPTNCLSMTEEYELAAYDRHDLNYDNVALGRLPYKVTEDPMVTPLR
ELGYLPKGVIEPHNLPKGSQRAGQHPEDLVKAE
>Q8DL31 7.1.1.-~~~ndhI~~~NAD(P)H-quinone oxidoreductase subunit I~~~COG1143
MKFLNQITNYAKEAVQSAKYIGQGLSVTFDHMRRRPITVQYPYEKLIPSERFRGRIHFEFDKCIACEVCVRVCPINLPVV
DWVFNKELKKKELKHYSIDFGVCIFCANCVEYCPTNCLSVTEEYELATYDRHELNYDSVAMGRIPYKVTQDPMVTPIREF
AYLPAGVMSGHDLPAGAQRAGERPEAIANTAKSSEN
>P19125 7.1.1.-~~~ndhJ~~~NAD(P)H-quinone oxidoreductase subunit J~~~COG0852
MAEEVNSPNEAVNLQEETAIAPVGPVSTWLTTNGFEHQSLTADHLGVEMVQVEADLLLPLCTALYAYGFNYLQCQGAYDE
GPGKSLVSFYHLVKLTEDTRNPEEVRLKVFLPRENPVVPSVYWIWKAADWQERECYDMFGIVYEGHPNLKRILMPEDWVG
WPLRKDYISPDFYELQDAY
>Q8DJ01 7.1.1.-~~~ndhJ~~~NAD(P)H-quinone oxidoreductase subunit J~~~COG0852
MSDTPEAPIVEAGPVGRLLQSQNLSVESLGRDASGVEMIKVDRDRLLAVCQTLYADGFNYLRCQAAYDSGPGQDLVSTYH
LIKLSDNADRPPEVRIKVFVPRDDPRVPSVYWIWKTADWQERESYDMFGIVYEGHPNLKRILMPEDWVGWPLRKDYITPD
FYELQEAY
>P19050 7.1.1.-~~~ndhK1~~~NAD(P)H-quinone oxidoreductase subunit K 1~~~COG0377
MSPNPANPTDLERVATAKILNPASRSQVTQDLSENVILTTVDDLYNWAKLSSLWPLLYGTACCFIEFAALIGSRFDFDRF
GLVPRSSPRQADLIITAGTITMKMAPALVRLYEEMPEPKYVIAMGACTITGGMFSSDSTTAVRGVDKLIPVDVYIPGCPP
RPEAIFDAIIKLRKKVANESIQERAITQQTHRYYSTSHQMKVVAPILDGKYLQQGTRSAPPRELQEAMGMPVPPALTTSQ
QKEQLNRG
>Q8DKZ4 7.1.1.-~~~ndhK~~~NAD(P)H-quinone oxidoreductase subunit K~~~COG0377
MTNTTSPAILNPIARPEVPQELAENIILTSLNDVYDWARLSSLWPLMYGTACCFIEFAAMIGSRFDFDRFGLVPRNSPRQ
ADLIITSGTITMKMAPALVRLYEQMPSPKYVIAMGACTITGGMFSSDSYSAVRGVDKLIPVDVYLPGCPPRPEAIMDAIV
KLRKKIANEHINERGNLAQTHRLFTAKHKMKPVPPILTGQYLNAPSRQAPPPALAAAMGIAVPALGEAVSETTSVAE
>P27372 7.1.1.-~~~ndhL~~~NAD(P)H-quinone oxidoreductase subunit L~~~
MEDLLGLLLSETGLLAIIYLGLSLAYLLVFPALLYWYLQKRWYVASSVERLVMYFLVFLFFPGLLVLSPVLNLRPRRQAA
>Q8DKZ3 7.1.1.-~~~ndhL~~~NAD(P)H-quinone oxidoreductase subunit L~~~
MAVSTELLVLGVYGALAGLYLLVVPAIVYAYLNARWYVASSFERAFMYFLVTFFFPGLLLLAPFINFRPQPRSLNS
>P74338 7.1.1.-~~~ndhM~~~NAD(P)H-quinone oxidoreductase subunit M~~~
MLVKSTTRHVRIFSAEVQGNELIPSNNVLTMDVDPDNEFVWNEDALQQVYRRFDELVESYSGEDLTDYNLRRIGSDLEHF
IRDLLQAGKVSYNLDCRVLNYSMGLPKVENQETAGKYWLDN
>Q8DLN5 7.1.1.-~~~ndhM~~~NAD(P)H-quinone oxidoreductase subunit M~~~
MLLKSTTRHVHIYAGHVVDGEVHPDTETLTLNVDPDNELEWNEAALAKVEAKFRELVANAAGEDLTEYNLRRIGSDLEHF
IRSLLMQGEIGYNLNSRVRNYSLGIPRVNHS
>P74069 7.1.1.-~~~ndhN~~~NAD(P)H-quinone oxidoreductase subunit N~~~
MLPLPLIANGKGFIRALENDGALAVYAPLEGGYEGRYQRRLRANGYASISLSARGLGDVEAYLMQVHGVRPAHLGKKNIA
QEGAVGPIYFAQPIAGYQLENLPAQSKGLVLWILEGYILSQTEIQDLISLTKRVPKLKVVLEMGGDRVFRWQPLLDCLQA
A
>Q8DJU2 7.1.1.-~~~ndhN~~~NAD(P)H-quinone oxidoreductase subunit N~~~
MGLLAGYQFVKDLESAGALALFVPPEGGFEGRYQRRLRSKGYTTLPMSAPGLGDLAAYLTQEHGIRPAHTGKEDIRVYFQ
PPLVTYHLENLPPNAKGLVLWLIDGKRLSKQEFAYLAQLTQTLPKFKVVVEVGGDRVVRWEPLADWVAAA
>P74771 7.1.1.-~~~ndhO~~~NAD(P)H-quinone oxidoreductase subunit O~~~
MAAKMKKGSLVRVIRAQLENSLEAQASDRRLPDYLFHSKGEVLDLNEEYALVRFYVPTPNVWLRLDQIEALA
>Q8DMU4 7.1.1.-~~~ndhO~~~NAD(P)H-quinone oxidoreductase subunit O~~~
MAIKKGDLVKVVAEKLANSLEALASDHRYPPYLFEGRGEVVDIRGDYAQIKFPVPTPTVWLRLDQLEVAQ
>P00393 1.6.5.9~~~ndh~~~Type II NADH:quinone oxidoreductase~~~COG1252
MTTPLKKIVIVGGGAGGLEMATQLGHKLGRKKKAKITLVDRNHSHLWKPLLHEVATGSLDEGVDALSYLAHARNHGFQFQ
LGSVIDIDREAKTITIAELRDEKGELLVPERKIAYDTLVMALGSTSNDFNTPGVKENCIFLDNPHQARRFHQEMLNLFLK
YSANLGANGKVNIAIVGGGATGVELSAELHNAVKQLHSYGYKGLTNEALNVTLVEAGERILPALPPRISAAAHNELTKLG
VRVLTQTMVTSADEGGLHTKDGEYIEADLMVWAAGIKAPDFLKDIGGLETNRINQLVVEPTLQTTRDPDIYAIGDCASCP
RPEGGFVPPRAQAAHQMATCAMNNILAQMNGKPLKNYQYKDHGSLVSLSNFSTVGSLMGNLTRGSMMIEGRIARFVYISL
YRMHQIALHGYFKTGLMMLVGSINRVIRPRLKLH
>P95160 1.6.5.9~~~ndh~~~Type II NADH:quinone oxidoreductase Ndh~~~COG1252
MSPQQEPTAQPPRRHRVVIIGSGFGGLNAAKKLKRADVDIKLIARTTHHLFQPLLYQVATGIISEGEIAPPTRVVLRKQR
NVQVLLGNVTHIDLAGQCVVSELLGHTYQTPYDSLIVAAGAGQSYFGNDHFAEFAPGMKSIDDALELRGRILSAFEQAER
SSDPERRAKLLTFTVVGAGPTGVEMAGQIAELAEHTLKGAFRHIDSTKARVILLDAAPAVLPPMGAKLGQRAAARLQKLG
VEIQLGAMVTDVDRNGITVKDSDGTVRRIESACKVWSAGVSASRLGRDLAEQSRVELDRAGRVQVLPDLSIPGYPNVFVV
GDMAAVEGVPGVAQGAIQGAKYVASTIKAELAGANPAEREPFQYFDKGSMATVSRFSAVAKIGPVEFSGFIAWLIWLVLH
LAYLIGFKTKITTLLSWTVTFLSTRRGQLTITDQQAFARTRLEQLAELAAEAQGSAASAKVAS
>Q2FZV7 1.6.5.9~~~~~~Type II NADH:quinone oxidoreductase~~~COG1252
MAQDRKKVLVLGAGYAGLQTVTKLQKAISTEEAEITLINKNEYHYEATWLHEASAGTLNYEDVLYPVESVLKKDKVNFVQ
AEVTKIDRDAKKVETNQGIYDFDILVVALGFVSETFGIEGMKDHAFQIENVITARELSRHIEDKFANYAASKEKDDNDLS
ILVGGAGFTGVEFLGELTDRIPELCSKYGVDQNKVKITCVEAAPKMLPMFSEELVNHAVSYLEDRGVEFKIATPIVACNE
KGFVVEVDGEKQQLNAGTSVWAAGVRGSKLMEESFEGVKRGRIVTKQDLTINGYDNIFVIGDCSAFIPAGEERPLPTTAQ
IAMQQGESVAKNIKRILNGESTEEFEYVDRGTVCSLGSHDGVGMVFGKPIAGKKAAFMKKVIDTRAVFKIGGIGLAFKKG
KF
>Q7A6J4 1.6.5.9~~~~~~Type II NADH:quinone oxidoreductase~~~
MAQDRKKVLVLGAGYAGLQTVTKLQKAISTEEAEITLINKNEYHYEATWLHEASAGTLNYEDVLYPVESVLKKDKVNFVQ
AEVTKIDRDAKKVETNQGIYDFDILVVALGFVSETFGIEGMKDHAFQIENVITARELSRHIEDKFANYAASKEKDDNDLS
ILVGGAGFTGVEFLGELTDRIPELCSKYGVDQNKVKITCVEAAPKMLPMFSEELVNHAVSYLEDRGVEFKIATPIVACNE
KGFVVEVDGEKQQLNAGTSVWAAGVRGSKLMEESFEGVKRGRIVTKQDLTINGYDNIFVIGDCSAFIPAGEERPLPTTAQ
IAMQQGESVAKNIKRILNGESTEEFEYVDRGTVCSLGSHDGVGMVFGKPIAGKKAAFMKKVIDTRAVFKIGGIGLAFKKG
KF
>B0VKS3 2.7.4.6~~~ndk~~~Nucleoside diphosphate kinase~~~
MAIERTLSIVKPDAVSKNHIGEIFARFEKAGLKIVATKMKHLSQADAEGFYAEHKERGFFGDLVAFMTSGPVVVSVLEGE
NAVLAHREILGATNPKEAAPGTIRADFAVSIDENAAHGSDSVASAEREIAYFFADNEICPRTR
>O67528 2.7.4.6~~~ndk~~~Nucleoside diphosphate kinase~~~COG0105
MAVERTLIIVKPDAMEKGALGKILDRFIQEGFQIKALKMFRFTPEKAGEFYYVHRERPFFQELVEFMSSGPVVAAVLEGE
DAIKRVREIIGPTDSEEARKVAPNSIRAQFGTDKGKNAIHASDSPESAQYEICFIFSGLEIV
>Q81SV8 2.7.4.6~~~ndk~~~Nucleoside diphosphate kinase~~~COG0105
MEKTFLMVKPDGVQRAFIGEIVARFEKKGFQLVGAKLMQVTPEIAGQHYAEHTEKPFFGELVDFITSGPVFAMVWQGEGV
VDTARNMMGKTRPHEAAPGTIRGDFGVTVAKNIIHGSDSLESAEREIGIFFKEEELVDYSKLMNEWIY
>P31103 2.7.4.6~~~ndk~~~Nucleoside diphosphate kinase~~~COG0105
MMEKTFIMVKPDGVQRQLIGDILSRFERKGLQLAGAKLMRVTEQMAEKHYAEHQGKPFFGELVEFITSGPVFAMVWEGEN
VIEVTRQLIGKTNPKEALPGTIRGDYGMFVGKNIIHGSDSLESAEREINIFFKNEELVSYQQLMAGWIY
>O51419 2.7.4.6~~~ndk~~~Nucleoside diphosphate kinase~~~
MLLQKTLCIVKPDGVRRGLIGDVVSRFERVGLKMVAAKMLIVDESLAKKHYLYDDIVFRHSEAVWNSLIKFISNSPVFTF
VVEGVESIEVVRKLCGATEPKLAIPGTIRGDFSYHSFKYSNEKGFSIYNVIHASANEADAMREIPIWFKDNEILNYKRDD
ECEHYYC
>Q2SWE7 2.7.4.6~~~ndk~~~Nucleoside diphosphate kinase~~~
MALERTLSIIKPDAVAKNVIGQIYSRFENAGLKIVAARMAHLSRADAEKFYAVHAERPFFKDLVEFMISGPVMIQVLEGE
DAILKNRDLMGATDPKKAEKGTIRADFADSIDANAVHGSDAPETARVEIAFFFPEMNVYSR
>Q9PIG7 2.7.4.6~~~ndk~~~Nucleoside diphosphate kinase~~~COG0105
MEKTLSIIKPDAVKKGVIGKILDRFESNGLRIAAMKKVQLSKEQAENFYAVHKERPFFKDLVEFMISGPVVVSILEGEGA
VLKNRDLMGATNPKEAKAGTIRADFAESIDANAVHGSDSLENAKIEIEFFFKPNEIC
>B7UGW4 2.7.4.6~~~ndk~~~Nucleoside diphosphate kinase~~~
MAIERTFSIIKPNAVAKNVIGSIFARFETAGFKIVGTKMLHLTVEQARGFYAEHDGKPFFDGLVEFMTSGPIVVSVLEGE
NAVQRHRDLLGATNPANALAGTLRADYADSLTENGTHGSDSVESAAREIAYFFGEGEVCPRTR
>P0A763 2.7.4.6~~~ndk~~~Nucleoside diphosphate kinase~~~COG0105
MAIERTFSIIKPNAVAKNVIGNIFARFEAAGFKIVGTKMLHLTVEQARGFYAEHDGKPFFDGLVEFMTSGPIVVSVLEGE
NAVQRHRDLLGATNPANALAGTLRADYADSLTENGTHGSDSVESAAREIAYFFGEGEVCPRTR
>B5Z9W9 2.7.4.6~~~ndk~~~Nucleoside diphosphate kinase~~~
MKQRTLSIIKPDALKKKVVGKIIDRFESNGLEVIAMKRLHLSVKDAENFYAIHRERPFFKDLIEFMVSGPVVVMVLEGKD
AVAKNRDLMGATDPKLAQKGTIRADFAESIDANAVHGSDSLENAHNEIAFFFAARDL
>O85501 2.7.4.6~~~ndk~~~Nucleoside diphosphate kinase~~~COG0105
MTERTLVLIKPDGVKRQLVGEILSRIERKGLTLAALELKNVSDDLARQHYAEHADKPFFGSLLEFITSGPLVAAIVEGPR
AVAAFRQIAGGTDPVEKAVPGTIRGDFALITQDNLVHGSDSPESAAREIALWFPGEATA
>P9WJH7 2.7.4.6~~~ndkA~~~Nucleoside diphosphate kinase~~~COG0105
MTERTLVLIKPDGIERQLIGEIISRIERKGLTIAALQLRTVSAELASQHYAEHEGKPFFGSLLEFITSGPVVAAIVEGTR
AIAAVRQLAGGTDPVQAAAPGTIRGDFALETQFNLVHGSDSAESAQREIALWFPGA
>P15266 2.7.4.6~~~ndk~~~Nucleoside diphosphate kinase~~~
MAIERTLSIIKPDGLEKGVIGKIISRFEEKGLKPVAIRLQHLSQAQAEGFYAVHKARPFFKDLVQFMISGPVVLMVLEGE
NAVLANRDIMGATNPAQAAEGTIRKDFATSIDKNTVHGSDSLENAKIEIAYFFRETEIHSYPYQK
>B4RMG0 2.7.4.6~~~ndk~~~Nucleoside diphosphate kinase~~~
MAIERTISIIKPDAVGKNVIGKIYSRFEENGLKIVAAKMKQLTLKEAQEFYAVHKDRPFYAGLVEFMTGGPVMIQVLEGE
NAVLKNRELMGATNPTEAAEGTIRADFATSVSINAVHGSDSVENAALEIAYFFSQTEICPR
>Q59636 2.7.4.6~~~ndk~~~Nucleoside diphosphate kinase~~~
MALQRTLSIIKPDAVSKNVIGEILTRFEKAGLRVVAAKMVQLSEREAGGFYAEHKERPFFKDLVSFMTSGPVVVQVLEGE
DAIAKNRELMGATDPKKADAGTIRADFAVSIDENAVHGSDSEASAAREIAYFFAATEVCERIR
>Q5HFV4 2.7.4.6~~~ndk~~~Nucleoside diphosphate kinase~~~
MERTFLMIKPDAVQRNLIGEVISRIERKGLKLVGGKLMQVPMELAETHYGEHQGKPFYNDLISFITSAPVFAMVVEGEDA
VNVSRHIIGSTNPSEASPGSIRGDLGLTVGRNIIHGSDSLESAEREINLWFNENEITSYASPRDAWLYE
>P99068 2.7.4.6~~~ndk~~~Nucleoside diphosphate kinase~~~
MERTFLMIKPDAVQRNLIGEVISRIERKGLKLVGGKLMQVPMELAETHYGEHQGKPFYNDLISFITSAPVFAMVVEGEDA
VNVSRHIIGSTNPSEASPGSIRGDLGLTVGRNIIHGSDSLESAEREINLWFNENEITSYASPRDAWLYE
>P74494 2.7.4.6~~~ndk~~~Nucleoside diphosphate kinase~~~COG0105
MERTFIMIKPDGVQRQLIGEIVGRFEKKGFKLVAMKVMTVSQELAEKHYEALNDKPFFSGLVNFICSSPVVAMVWEGNSI
VSTSRQMIGATDPHAAAPGTIRGDYGVSVGRNIIHGSDAIETAKREISLWFKDEEVNEWDATLNPWLYE
>Q5SLV5 2.7.4.6~~~ndk~~~Nucleoside diphosphate kinase~~~COG0105
MERTFVMIKPDGVRRGLVGEILARFERKGFRIAALKLMQISQELAERHYAEHREKPFFPGLVRFITSGPVVAMVLEGPGV
VAEVRKMMGATHPKDALPGTIRGDFATTIDENVIHGSATLEDAQREIALFFRPEELL
>Q9KTX4 2.7.4.6~~~ndk~~~Nucleoside diphosphate kinase~~~COG0105
MALERTFSIIKPDAVKRNLIGEIYHRIEKAGLQIIAAKMVHLSEEQASGFYAEHEGKPFFEPLKEFMTSGPIMVQVLEGE
NAIARYRELMGKTNPEEAACGTLRADYALSMRYNSVHGSDSPASAAREIEFFFPESEICPRP
>C3LT09 2.7.4.6~~~ndk~~~Nucleoside diphosphate kinase~~~
MALERTFSIIKPDAVKRNLIGEIYHRIEKAGLQIIAAKMVHLSEEQASGFYAEHEGKPFFEPLKEFMTSGPIMVQVLEGE
NAIARYRELMGKTNPEEAACGTLRADYALSMRYNSVHGSDSPASAAREIEFFFPESEICPRP
>Q0QLF2 1.17.1.5~~~ndhL~~~Nicotinate dehydrogenase large molybdopterin subunit~~~
MGKDYQVLGKNKVKVDSLEKVMGTAKFAADYSFPDMLYAGVFRSTVPHARIVSLDLSKARAIDGVEAVLDYHAIPGKNRF
GIIIKDEPCLVDDKVRRYGDAIAVVAAQTPDLVQEALDAITIEYEELEGIFTMERALEEDSPAIHGDTNIHQVKHLEYGD
VDAAFKQCDIVVEDTYSTHRLTHMFIEPDAGVSYYDNEGMLTVVVSTQNPHYDRGEVAGMLALPNSKVRIIQATTGGGFG
GKLDLSVQCHCALLTYHTKKPVKMVRSREESTTVSSKRHPMTMHCKTGATKDGRLQAVQVEMFGDTGAYASYGPAVITRA
TVHCMGPYVVPNVRVDAKFVYTNNPMSGAFRGFGVPQASVCHEGQMNALAKALGMDPIDIRILNAHQVGAKLATGQVLEN
SVGLIETLEKAREKAVEVMGYEKTR
>H9N289 1.14.13.178~~~ndmA~~~Methylxanthine N1-demethylase NdmA~~~
MEQAIINDEREYLRHFWHPVCTVTELEKAHPSSLGPLAVKLLNEQLVVAKLGDEYVAMRDRCAHRSAKLSLGTVSGNRLQ
CPYHGWQYDTHGACQLVPACPNSPIPNKAKVDRFDCEERYGLIWIRLDSSFDCTEIPYFSAANDPRLRIVIQEPYWWDAT
AERRWENFTDFSHFAFIHPGTLFDPNNAEPPIVPMDRFNGQFRFVYDTPEDMAVPNQAPIGSFSYTCSMPFAINLEVSKY
SSSSLHVLFNVSCPVDSHTTKNFLIFAREQSDDSDYLHIAFNDLVFAEDKPVIESQWPKDAPADEVSVVADKVSIQYRKW
LRELKEAHKEGSQAFRSALLDPVIESDRSYI
>H9N290 1.14.13.179~~~ndmB~~~Methylxanthine N3-demethylase NdmB~~~
MKEQLKPLLEDKTYLRHFWHPVCTLNEFERANASGHGPMGVTLLGEKLVLARLNSKIIAAADRCAHRSAQLSIGRVCSNA
GKDYLECPYHGWRYDEAGACQLIPACPDKSISPRAKISSFDCEVKYDIVWVRLDNSFDCTQIPYLSDFDNPDMQVIVADS
YIWETVAERRWENFTDFSHFAFVHPGTLYDPFFASHPTVYVNRVDGELQFKLAPPREMKGIPPEAPMGDFTYRCTMPYSV
NLEIKLWKDDSRFVLWTTASPVDNKSCRNFMIIVREKDNQPDHMHLAFQKRVLDEDQPVIESQWPLEIQTSEVSVATDKI
SVQFRKWHKELSLSAVEGREAFRDSVLTNVIEEEQ
>F0E1K6 1.14.13.128~~~~~~Probable methylxanthine N7-demethylase NdmC~~~
MHAENSFVIDDWYPVGALAETVSGRKYHTRILGTEIWYQLADGTVSAGLADNTAELASKSIYGLLWVSLSDNPRDVIAIP
EFAEADRRVVSAGSIRVATSGLRVIENFLDMAHFPFVHTDILGAEPLTEVAAYDVEIDEAADEIRAVNCRFPQPKGSAAA
SEPVEMQYVYRIARPFIAILYKTCVIDANRLDVLGLFVQPVDQESSIAHTIMCYLDDINTDKQLRDFQQRIFGQDIMILI
NQVPKALPLNPRHETPVRADALSSAYRRWLNDRNVTFGTTRG
>H9N291 1.-.-.-~~~ndmD~~~Oxidoreductase NdmD~~~
MNKLDVNQWFPIATTEDLPKRHVFHATLLGQEMAIWRDDSGSVNAWENRCPHRGLRLTLGANTGNELRCQYHGWTYESGT
GGCTFVPAHRDAPPPNAARVNTFPVREKHGFIWTTLGQPPGEPISILDDAQLVNAVKTNLHSVVIDADIDGVVSVLRQNL
SAFIDVFGAASAEDLHLKSMLQDRGILVTRSGSIAIHFYMQRSTISKCVVHAQVLTPGRPGYELQKNYSYAMNVIRRAAE
AVATDLISITDISDQTIEKLEVVRENMTKAPPTHYICEVVTRTQETGDINSYWLKPIGYPLPAFSPGMHISITTPEGSIR
QYSLVNGPDERESFIIGVKKEIQSRGGSRSMHEDVKVGTQLKVTLPRNGFPLVQTRKHPILVAGGIGITPILCMAQALDQ
QGSSYEIHYFARAFEHVPFQDRLTALGDRLNVHLGLGPDETRAKLPDIMEIHNAQDVDVYTCGPQPMIETVSAVALAHGI
AEESIRFEFFSKKNDVPVSDEEYEVELKKTGQIFTVSPGSTLLQACLDNDVRIEASCEQGVCGTCITPVVSGDLEHHDTY
LSKKERESGKWIMPCVSRCKSKKIVLDL
>Q0QLF1 1.17.1.5~~~ndhM~~~Nicotinate dehydrogenase medium molybdopterin subunit~~~
MKKRGKGVGSMWYGIGNTGLPNPAAAFVEIHGDGSANVMFGAADIGQGSGTAMAQIAAEELGLDYEKIHVTWGDTMVTPD
GGATSASRQTLITGNAVILACRQAKETLAKTAAEKLDCAPEELSFRDNTVFITADPERSMTYGELMAAMKAAGRMAVGAG
SYNPNTTGLAPENMSGIPFEVYSYATTIAEVEVDTETGEVDVLKVVSAHDVGTPINRSMVEGQIEGGVTMGQGFVLMEEI
EVNTKNGAIKNPSMSKYIIPSNRDVPEIHSILVESEGGPGPFGAKGVGEPALIPMIPAVVAAIEDALGTRFTHTPIMPKD
IVAAVKAQEK
>P0A185 ~~~ndoA~~~Naphthalene 1,2-dioxygenase system, ferredoxin component~~~
MTVKWIEAVALSDILEGDVLGVTVEGKELALYEVEGEIYATDNLCTHGSARMSDGYLEGREIECPLHQGRFDVCTGKALC
APVTQNIKTYPVKIENLRVMIDLS
>P0A186 ~~~doxA~~~Naphthalene 1,2-dioxygenase system, ferredoxin component~~~
MTVKWIEAVALSDILEGDVLGVTVEGKELALYEVEGEIYATDNLCTHGSARMSDGYLEGREIECPLHQGRFDVCTGKALC
APVTQNIKTYPVKIENLRVMIDLS
>P0A110 1.14.12.12~~~ndoB~~~Naphthalene 1,2-dioxygenase system, large oxygenase component~~~
MNYNNKILVSESGLSQKHLIHGDEELFQHELKTIFARNWLFLTHDSLIPAPGDYVTAKMGIDEVIVSRQNDGSIRAFLNV
CRHRGKTLVSVEAGNAKGFVCSYHGWGFGSNGELQSVPFEKDLYGESLNKKCLGLKEVARVESFHGFIYGCFDQEAPPLM
DYLGDAAWYLEPMFKHSGGLELVGPPGKVVIKANWKAPAENFVGDAYHVGWTHASSLRSGESIFSSLAGNAALPPEGAGL
QMTSKYGSGMGVLWDGYSGVHSADLVPELMAFGGAKQERLNKEIGDVRARIYRSHLNCTVFPNNSMLTCSGVFKVWNPID
ANTTEVWTYAIVEKDMPEDLKRRLADSVQRTFGPAGFWESDDNDNMETASQNGKKYQSRDSDLLSNLGFGEDVYGDAVYP
GVVGKSAIGETSYRGFYRAYQAHVSSSNWAEFEHASSTWHTELTKTTDR
>P0A111 1.14.12.12~~~doxB~~~Naphthalene 1,2-dioxygenase system, large oxygenase component~~~
MNYNNKILVSESGLSQKHLIHGDEELFQHELKTIFARNWLFLTHDSLIPAPGDYVTAKMGIDEVIVSRQNDGSIRAFLNV
CRHRGKTLVSVEAGNAKGFVCSYHGWGFGSNGELQSVPFEKDLYGESLNKKCLGLKEVARVESFHGFIYGCFDQEAPPLM
DYLGDAAWYLEPMFKHSGGLELVGPPGKVVIKANWKAPAENFVGDAYHVGWTHASSLRSGESIFSSLAGNAALPPEGAGL
QMTSKYGSGMGVLWDGYSGVHSADLVPELMAFGGAKQERLNKEIGDVRARIYRSHLNCTVFPNNSMLTCSGVFKVWNPID
ANTTEVWTYAIVEKDMPEDLKRRLADSVQRTFGPAGFWESDDNDNMETASQNGKKYQSRDSDLLSNLGFGEDVYGDAVYP
GVVGKSAIGETSYRGFYRAYQAHVSSSNWAEFEHASSTWHTELTKTTDR
>O52382 1.14.12.12~~~nagAc~~~Naphthalene 1,2-dioxygenase system, large oxygenase component~~~
MIYENLVSEAGLTQKHLIHGDKELFQHELKTIFARNWLFLTHDSLIPSPGDYVTAKMGVDEVIVSRQNDGSVRAFLNVCR
HRGKTLVHAEAGNAKGFVCSYHGWGFGSNGELQSVPFEKELYGDTIKKKCLGLKEVPRIESFHGFIYGCFDAEAPTLVDY
LGDAAWYLEPIFKHSGGLELVGPPGKVVIKANWKAPAENFVGDAYHVGWTHASSLRSGQSIFTPLAGNAMLPPEGAGLQM
TSKYGSGMGVLWDGYSGVHSADLVPEMMAFGGAKQEKLAKEIGDVRARIYRSHLNCTVFPNNSILTCSGVFKVWNPIDEN
TTEVWTYAIVEKDMPEDLKRRLADAVQRTFGPAGFWESDDNDNMETESQNAKKYQSSNSDLIANLGFGKDVYGDECYPGV
VAKSAIGETSYRGFYRAYQAHISSSNWAEFENTSRNWHTELTKTTDR
>P0A112 ~~~ndoC~~~Naphthalene 1,2-dioxygenase system, small oxygenase component~~~
MMINIQEDKLVSAHDAEEILRFFNCHDSALQQEATTLLTQEAHLLDIQAYRAWLEHCVGSEVQYQVISRELRAASERRYK
LNEAMNVYNENFQQLKVRVEHQLDPQNWGNSPKLRFTRFITNVQAAMDVNDKELLHIRSNVILHRARRGNQVDVFYAARE
DKWKRGEGGVRKLVQRFVDYPERILQTHNLMVFL
>P0A113 ~~~doxD~~~Naphthalene 1,2-dioxygenase system, small oxygenase component~~~
MMINIQEDKLVSAHDAEEILRFFNCHDSALQQEATTLLTQEAHLLDIQAYRAWLEHCVGSEVQYQVISRELRAASERRYK
LNEAMNVYNENFQQLKVRVEHQLDPQNWGNSPKLRFTRFITNVQAAMDVNDKELLHIRSNVILHRARRGNQVDVFYAARE
DKWKRGEGGVRKLVQRFVDYPERILQTHNLMVFL
>O52383 ~~~nagAd~~~Naphthalene 1,2-dioxygenase system, small oxygenase component~~~
MMINTQEDKLVSAHDAEEFHRFFVGHDSDLQQEVTTLLTREAHLLDIQAYNAWLEHCVAPEIKYQVISREFRSTSERRYQ
LNDAVNIYNENYQHLKVRVEHQMDPQNWANSPKIRFTRFVTNVTAAKDKIVPDLLHVRSNLILHRARRGNQVDVFYATRE
DKWKRIEGGGIQLVERLVDYPERILQTHNLMTFL
>Q52126 1.18.1.7~~~ndoR~~~Naphthalene 1,2-dioxygenase system ferredoxin--NAD(P)(+), reductase component~~~
MELLIQPNNRIIPFSAGANLLEVLRENGVAISYSCLSGRCGTCRCRVIDGSVIDSGAENGQSNLTDKQYVLACQSVLTGN
CAIEVPEADEIVTHPARIIKGTVVAVESPTHDIRRLRVRLSKPFEFSPGQYATLQFSPEHARPYSMAGLPDDQEMEFHIR
KVPGGRVTEYVFEHVREGTSIKLSGPLGTAYLRQKHTGPMLCVGGGTGLAPVLSIVRGALKSGMTNPILLYFGVRSQQDL
YDAERLHKLAADHPQLTVHTVIATGPINEGQRAGLITDVIEKDILSLAGWRAYLCGAPAMVEALCTVTKHLGISPEHIYA
DAFYPGGI
>P33920 ~~~yejK~~~Nucleoid-associated protein YejK~~~COG3081
MSLDINQIALHQLIKRDEQNLELVLRDSLLEPTETVVEMVAELHRVYSAKNKAYGLFSEESELAQTLRLQRQGEEDFLAF
SRAATGRLRDELAKYPFADGGFVLFCHYRYLAVEYLLVAVLSNLSSMRVNENLDINPTHYLDINHADIVARIDLTEWETN
PESTRYLTFLKGRVGRKVADFFMDFLGASEGLNAKAQNRGLLQAVDDFTAEAQLDKAERQNVRQQVYSYCNEQLQAGEEI
ELKSLSKELAGVSEVSFTEFAAEKGYELEESFPADRSTLRQLTKFAGSGGGLTINFDAMLLGERIFWDPATDTLTIKGTP
PNLRDQLQRRTSGGN
>Q65JC2 2.4.1.384~~~yjiC~~~NDP-glycosyltransferase YjiC~~~COG1819
MGHKHIAIFNIPAHGHINPTLALTASLVKRGYRVTYPVTDEFVKAVEETGAEPLNYRSTLNIDPQQIRELMKNKKDMSQA
PLMFIKEMEEVLPQLEALYENDKPDLILFDFMAMAGKLLAEKFGIEAVRLCSTYAQNEHFTFRSISEEFKIELTPEQEDA
LKNSNLPSFNFEDMFEPAKLNIVFMPRAFQPYGETFDERFSFVGPSLAKRKFQEKETPIISDSGRPVMLISLGTAFNAWP
EFYHMCIEAFRDTKWQVIMAVGTTIDPESFDDIPENFSIHQRVPQLEILKKAELFITHGGMNSTMEGLNAGVPLVAVPQM
PEQEITARRVEELGLGKHLQPEDTTAASLREAVSQTDGDPHVLKRIQDMQKHIKQAGGAEKAADEIEAFLAPAGVK
>O34539 2.4.1.384~~~yjiC~~~NDP-glycosyltransferase YjiC~~~COG1819
MKKYHISMINIPAYGHVNPTLALVEKLCEKGHRVTYATTEEFAPAVQQAGGEALIYHTSLNIDPKQIREMMEKNDAPLSL
LKESLSILPQLEELYKDDQPDLIIYDFVALAGKLFAEKLNVPVIKLCSSYAQNESFQLGNEDMLKKIREAEAEFKAYLEQ
EKLPAVSFEQLAVPEALNIVFMPKSFQIQHETFDDRFCFVGPSLGERKEKESLLIDKDDRPLMLISLGTAFNAWPEFYKM
CIKAFRDSSWQVIMSVGKTIDPESLEDIPANFTIRQSVPQLEVLEKADLFISHGGMNSTMEAMNAGVPLVVIPQMYEQEL
TANRVDELGLGVYLPKEEVTVSSLQEAVQAVSSDQELLSRVKNMQKDVKEAGGAERAAAEIEAFMKKSAVPQ
>A0R2N3 2.3.1.286~~~sir2~~~NAD-dependent protein deacylase Sir2~~~COG0846
MQVTVLSGAGISAESGVPTFRDAETGLWAQVDPYEISSTDGWQRNPEKVWAWYLWRHYMMARVAPNEAHRTVAAWEDHLD
VRVVTQNIDDLHERAGSTNVYHLHGSLFEFRCDACGSAFEGNLPEMPEPVETIDPPVCPCSGLIRPSVVWFGEPLPDAAW
NRSVLAVSSADVVIVVGTSSIVYPAAGLPEAALAAGKPVIEVNPERTPLSDSATVSLRETASEALPTLLQRLPELLNRSA
>Q0QLF3 1.17.1.5~~~ndhS~~~Nicotinate dehydrogenase small FeS subunit~~~
MNKITINLNLNGEARSIVTEPNKRLLDLLREDFGLTSVKEGCSEGECGACTVIFNGDPVTTCCMLAGQADESTIITLEGV
AEDGKPSLLQQCFLEAGAVQCGYCTPGMILTAKALLDKNPDPTDEEITVAMSGNLCRCTGYIKIHAAVRYAVERCAN
>Q2YQ73 7.5.2.3~~~ndvA~~~Beta-(1-->2)glucan export ATP-binding/permease protein NdvA~~~
MSLLKIYWRAMQYLAVERTATITMCVASVLVALVTLAEPVLFGRVIQSISDKGDIFSPLLMWAALGGFNIMAAVFVARGA
DRLAHRRRLGVMIDSYERLITMPLAWHQKRGTSNALHTLIRATDSLFTLWLEFMRQHLTTVVALATLIPVAMTMDMRMSL
VLIVLGVIYVMIGQLVMRKTKDGQAAVEKHHHKLFEHVSDTISNVSVVQSYNRIASETQALRDYAKNLENAQFPVLNWWA
LASGLNRMASTFSMVVVLVLGAYFVTKGQMRVGDVIAFIGFAQLMIGRLDQISAFINQTVTARAKLEEFFQMEDATADRQ
EPENVADLNDVKGDIVFDNVTYEFPNSGQGVYDVSFEVKPGQTVAIVGPTGAGKTTLINLLQRVFDPAAGRIMIDGTDTR
TVSRRSLRHAIATVFQDAGLFNRSVEDNIRVGRANATHEEVHAAAKAAAAHDFILAKSEGYDTFVGERGSQLSGGERQRL
AIARAILKDSPILVLDEATSALDVETEEKVTQAVDELSHNRTTFIIAHRLSTVRSADLVLFMDKGHLVESGSFNELAERG
GRFSDLLRAGGLKLEDKQPKQPVVEGSNVMPFPVKGAVA
>P18767 7.5.2.3~~~ndvA~~~Beta-(1-->2)glucan export ATP-binding/permease protein NdvA~~~COG1132
MSLFQVYARALQYLAVHKFRVGAIVIANIVLAAITIAEPILFGRIIDAISSQKDVAPMLLLWAGFGVFNTIAFVLVSREA
DRLAHGRRASLLTEAFGRIVSMPLSWHSQRGTSNALHTLLRACETLFGLWLEFMRQHLATAVALMLLIPTAFAMDVRLSL
ILVVLGAAYVMISKVVMSRTKEGQAAVEGHYHTVFSHVSDSISNVSVVHSYNRIEAETRELKKFTQRLLSAQYPVLDWWA
LASGLNRIASTISMMAILVIGTVLVQRGELGVGEVIAFIGFANLLIGRLDQMKAFATQIFEARAKLEDFFQLEDSVQDRE
EPADAGELKGVVGEVEFRDISFDFANSAQGVRNVSFKAKAGQTIAIVGPTGAGKTTLVNLLQRVHEPKHGQILIDGVDIA
TVTRKSLRRSIATVFQDAGLMNRSIGENIRLGREDASLDEVMAAAEAAAASDFIEDRLNGYDTVVGERGNRLSGGERQRV
AIARAILKNAPILVLDEATSALDVETEARVKDAIDALRKDRTTFIIAHRLSTVREADLVIFMDQGRVVEMGGFHELSQSN
GRFAALLRASGILTDEDVRKSLTAA
>P0A2V1 7.5.2.3~~~ndvA~~~Beta-(1-->2)glucan export ATP-binding/permease protein NdvA~~~COG1132
MTLFQVYTRALRYLTVHKWRVAVVVIANVILAAITIAEPVLFGRIIDAISSGTNVTPILILWAGFGVFNTVAYVAVAREA
DRLAHGRRATLLTEAFGRIISMPLSWHHLRGTSNALHTLLRASETLFGLWLEFMRTHLATFVALVLLIPTAMAMDLRLSF
VLIGLGIVYWFIGKWVMGRTKDGQASVEEHYHSVFAHVSDSISNVSVLHSYNRIEAETKALKSFTEKLLSAQYPVLDWWA
FASALNRTASTVSMMIILVIGTVLVKNGELRVGDVIAFIGFANLLIGRLDQMRQFVTQIFEARAKLEDFFVLEDAVKERE
EPGDARELSNVSGTVEFRNINFGFANTKQGVHDVSFTAKAGETVAIVGPTGAGKTTLINLLQRVYDPDSGQILIDGTDIS
TVTKNSLRNSIATVFQDAGLLNRSIRENIRLGRETATDAEVVEAAAAAAATDFIDSRINGYLTQVGERGNRLSGGERQRI
AIARAILKNAPILVLDEATSALDVETEARVKAAVDALRKNRTTFIIAHRLSTVRDADLVLFLDQGRIIEKGTFDELTQRG
GRFTSLLRTSGLLTEDEGQQPRPKAIAS
>P20471 2.4.1.-~~~ndvB~~~Cyclic beta-(1,2)-glucan synthase NdvB~~~COG3459
MLQNTTQSNLPREPEAKQIDYNDSIRSTYFSIDDLRACGASLAEKGTSALPGFFPFEFRARHRENEKEILRVYRATAADV
EAGASITPAAEWLLDNHHVVEEAIQEVRRDFPRRFYRQLPTLSVSGTVIPRTMALAWLYVAHTHSTVTRESITAMVEGFQ
EHETLKIGELWALPSILRFVLIENLRRIAIRVERSRGMRRKANEVADQLIRLNDPEGCRTLLVESEALAADNTFIAQLLY
RMRDGSQSSGAVIAWIEERLERRGTDVEEALVAEQNRLSSGNATMSNIIRSLREIDDTDWAVWFESVSKIDATLREGSDY
AALDFGSRNTYRDTIEKLARRSGHSEHEVTEIAIEMVEEAKAAAAVEAPLQEPNVGSFLVGKQRLALEKRIGYSPSIFQH
LIRSVRKLDWFAIAGPNILLTILAMIVVYAFVSPMDIPSGAKLIMLLLFALPASEGAMGLFNTVFTLFAKPSRLVGYEFL
DGIPEDARTLVVVPCLIAKRDHVDELVRNLEVHYLANPRGEIYFALLSDWADSKSEEAPADTDVLEYAKREIASLSARYA
YDGKTRFFLLHRRRLYNEAEGVWMGWERKRGKLHELNLLLRGDRDTSFLQGANMVPEGVQYVMTLDSDTRLMRDAVTKLV
GKLYHPINRPVVNPRTQEVVTGYSLLQPRVTPSLTTGSEASAFQRIFTINRGIDPYVFTVSDVYQDIAGEGSFTGKGLYH
VDAFEAALKSRIEENAVLSHDLLEGSYARCALVTDIELVEDFPIRYEVEMSRQHRWARGDWQLLPYIFNPKNGLSMLGRW
KMYDNLRRSLIPVAWLAASVMGWYYMEPTPALIWQLVLIFSLFVAPTLSLISGIMPRRNDIVARAHLHTVLSDIRAANAQ
VALRIVFIAHNAAMMADAIVRSLYRTFVSRKLMLEWRTAAQVQSAGHGSIGDYFRAMWTAPALALVSLALAAISDTGLPF
IGLPFALIWAASPAVAWFVSQSAETEDQLVVSEEAIEEMRKIARRTWRYFEAFVTAEQNFLPPDNFQETPQPVLAERTSP
TNIGVYLLSVMSARSFGWIGFEETITRLEQTIATIDRMPKYRGHLFNWYRTRGLEPMEPRYVSSVDSGNLAGHLIAVSSM
CREWAEAPSAHVQGNLDGIGDVAAILKEALNELPDDRKTVRPLRRLVEERIAGFQNALAAVKRERELASIRVINLAVLAR
DMHKLTVNLDHEVRTVQSGEVATWAGSLVAACEAHIADGVFDLGAIEALRQRLLVLKERARDIAFSMDFSFLFRPERRLL
SIGYRVNANELDEACYDLLASEARLTSLFAIAKGDLPTEHWYKLGRPIVPIGARGALVSWSGSMFEYLMPPLVMQERQGG
ILNQTNNLVVQEQINHGRRLGTPWGISEAAFNARDHELTYQYTNFGVPTLGLKRGLGQNAVIAPYASILACMYDPKSALA
NLARLREVGALGAYGYHDAVDFTPTRVPEGQKCAVVRNYYAHHHGMSVAAVANVVFNGQLREWFHADPVIEAAELLLQEK
APRDIPVMAAKREPEALGKGQADLLRPEVRVVEDPINQDRETVLLSNGHYSVMLTATGAGYARWNGQSVTRWTPDPVEDR
TGTFIFLRDTVTGDWWSATAEPRRAPGEKTVTRFGDDKAEFVKTVGDLTSEVECIVATEHDAEGRRVILLNTGTEDRFIE
VTSYAEPVLAMDDADSSHPTFSKMFLRTEISRHGDVIWVSRNKRSPGDPDIEVAHLVTDNAGSERHTQAETDRRRFLGQG
RTLAEAAAFDPGATLSGTDGFTLDPIVSLRRVVRVPAGKKVSVIFWTIAAPDREGVDRAIDRYRHPETFNHELIHAWTRS
QVQMRHVGITSKEAASFQMLGRYLVYPDMHLRADAETVKTGLASQSALWPLAISGDFPIFCLRINDDGDLGIAREALRAQ
EYLRARGITADLVVVNERASSYAQDLQHTLDSMCENLRLRGLSDGPRQHIFAVRRDLMEPETWSTLISASRAVFHARNGT
ISDQIARATSLYSKSSEKKEEGAEMLLPVIREADARTAVELDGGDLDFWNGFGGFAEDGREYAVRLRGGEATPQPWINVI
SNEQFGFHVSAEGAAFSWSRNSRDYQLTPWTNDAVVNRPGEAIFVRDMASGAVLTPYAALSRRKSALFETRHGLGYSRFL
STQDELEIEAMHTVHRTLPAKLVRLTIRNRSSAARKLRVYGYAEWVLGNNRSRTAPFVLSEWDESAKTLVATNPYSIDYP
GRCAFFASDGDIAGYTASRREFLGRAGGILAPQAVISGAELTGSTDVDGDACAALATDITVEAGVERQVTFFLGDADNPD
QVRAVLEELRADSFGAALEAAKAFWGDFTGVVKVETPDRAFNHMINHWLPYQALGCRIMARSAFYQASGAFGFRDQLQDT
LAFLIHRPALARAQILNAAARQFVEGDVQHWWLPGTDAGVRTMISDDVVWLAHAVAHYCAVTGEEDILKEKVPFITGPAL
EEGQHDSFYKPDVADEVGDVYEHCARALDLAIHRTGANGLPLILGGDWNDGMNRVGEAGEGTSVWLGWFLAGTLRAFLPY
ARARKDKPRVALWERHLEALKDALEQAGWDGDYYRRGYYDDDTPLGSAENGECRIDSIAQSWSTLSGEGDKERSLRAMDA
VMAELVDPEKRIVRLFTPPLETTKQDPGYIKAYPPGVRENGGQYTHAATWVVLAFAAQERAEEAWRTFRMLNPVSHALSQ
VDAEHYRVEPYVVAADIYGEGALAGRGGWTWYTGSAGWLYRAGVEGILGIRKRGDKLLIRPVLPSEWPGYSAEVRVNGTT
HRISVSRDSKSGEPVVSVNNSVTKNAHEGVLL
>Q75UV1 3.6.1.61~~~ndx1~~~Diadenosine hexaphosphate hydrolase~~~
MELGAGGVVFNAKREVLLLRDRMGFWVFPKGHPEPGESLEEAAVREVWEETGVRAEVLLPLYPTRYVNPKGVEREVHWFL
MRGEGAPRLEEGMTGAGWFSPEEARALLAFPEDLGLLEVALERLPL
>P77258 1.3.1.-~~~nemA~~~N-ethylmaleimide reductase~~~COG1902
MSSEKLYSPLKVGAITAANRIFMAPLTRLRSIEPGDIPTPLMAEYYRQRASAGLIISEATQISAQAKGYAGAPGIHSPEQ
IAAWKKITAGVHAENGHMAVQLWHTGRISHASLQPGGQAPVAPSALSAGTRTSLRDENGQAIRVETSMPRALELEEIPGI
VNDFRQAIANAREAGFDLVELHSAHGYLLHQFLSPSSNHRTDQYGGSVENRARLVLEVVDAGIEEWGADRIGIRVSPIGT
FQNTDNGPNEEADALYLIEQLGKRGIAYLHMSEPDWAGGEPYTDAFREKVRARFHGPIIGAGAYTVEKAETLIGKGLIDA
VAFGRDWIANPDLVARLQRKAELNPQRAESFYGGGAEGYTDYPTL
>P67430 ~~~nemR~~~HTH-type transcriptional repressor NemR~~~COG1309
MNKHTEHDTREHLLATGEQLCLQRGFTGMGLSELLKTAEVPKGSFYHYFRSKEAFGVAMLERHYAAYHQRLTELLQSGEG
NYRDRILAYYQQTLNQFCQHGTISGCLTVKLSAEVCDLSEDMRSAMDKGARGVIALLSQALENGRENHCLTFCGEPLQQA
QVLYALWLGANLQAKISRSFEPLENALAHVKNIIATPAV
>Q53U18 2.4.1.283~~~neoD~~~2-deoxystreptamine N-acetyl-D-glucosaminyltransferase~~~
MRVLRLTPFFHHDCVTAWPAEFDAVGGMQVQILRLSRELADRGVEQLVMTVGFPGLPRERVDRPGLRVRVTRAPLPRLRS
ELTGLVGLNQAWLAAVLTACAPLRRTWRPDLVHVHADGQLWALLAGPLVSRLVGAPYCLTLHCSRLASYEPMSRFDRLQH
RLVAAAERYALRRARRVSTLTSRTADTVARLLPLDRALVDVLPDSVGDVRPVARPEAEEYVRSLGVPAGRPVVGWVGRVA
HEKGWRDFVAMAERWDAGSGAPGAVFAVVGDGPQRERMREAVEAAGLADRFVFTGFLPHDAVPSVMTALDVLVMPSAHEE
LGGSALEAMVCGTPVAGYAVGGLRDTVGSVTPSLLVPRGDVAALTRAAGDAVTDAERHRKTVAAAVPDLLGRYGADTVER
ALEHYRLAVGRASGGGAGWAP
>Q53U14 5.1.3.-~~~neoN~~~Neomycin C epimerase~~~
MTTDIVWPPPVRQVRAYRNIVVDGACNIRCTYCEVKKTKVDQPATIRSLDRIFAEYEPDAVLFRVESDGEITLYPKIVDH
LQKRAAEGYRVEVLSNGTKLPRALEGRPDLLWVFSVDGHTEAMNAKRGLKQPQIDRILDAAVELGAELQTVYWGQPVEEV
NAYIDLLESRGYRGLLHFMPLLAFKGRPLTVNLRYQDLHPADFLAPPEYFRRWNHIFETGRRDAVCDQITNGYNYQVSGD
EIRMVKCDCYSVPKHLVHGFGPIREFDDWPCGTCIANQEFNNSRERMRVPQGRIPLPLV
>Q53U15 1.1.3.43~~~neoG~~~Paromamine 6'-oxidase~~~
MKRLRGTLPSDARHAWHPEPLGPAHRDGWDTRDDDRVWDVVVIGSGASGSVAADRLVRQGLDVLMIEEGFRLSPDLGNPE
LDDMCRTALARDGQGGWTDEGWPWTTSNLGGGTVFYGGASWRYRPFDFDPSELVDAGGLDVRWPYGLAELAPYYDVLERR
LGVCGGEEGEGSRGPAHPPTAAAEVLYEAGTALGYEPFPTPLAINRHAHGGRSACERNSLCVSHQCSTGAKGDAVAVFLA
PLAAHPNFTLRTGVRALRLNQDRPDAVGSVTCLDRLGRTTHRVRARSFVVACNAIQSAALLLRSRGGRAPDGVGNHSGLV
GRGLTMKLSEYVSGVVDAPSAATLADWRAHAGPFSTIAFLDHYLDADCPTGVGGMIYESKNDMPSRIRDDVLELRIETIL
ADHPNLDNRVRLSSHADEDGVPAVVIDYTPDPRDLRRLAYMTDVCERLLRKAGATGIAHEESGFAQGSCHLHGTCRAGDD
PATSVVDGWGRVHSAPNVYVVDGGFMPYPGGLNPTLTIQAHALRSAKAVAGDLVSRHTAHV
>Q53U11 2.4.1.285~~~neoK~~~UDP-GlcNAc:ribostamycin N-acetylglucosaminyltransferase~~~
MAEAPAGRALFEIYDEGFDSPSWGGVETALWHLSRSLREAGTEAEFYRSSEGADLDALAARVERDRVDAVFPLVESDLFE
GAAWRRLPALHARTVRVWHDVSRLSADLSAPPPCPVHARVPALPGAPVAEGCPARGAHPEGPMREVFLGEWPWTRCFPRR
SVIPWAADHVPAKDLCDPSGPVVLQLGKIDTVDAERCLRRLTGAGVALRVVFATWSRRGREARELVRAHQGAGRRVEVLD
AYDIRTDWERVFGGASLFLLPSVFHETYNFAAAEAVQLGVPVAALGEGGNLPRFASLTAPTPDALVDRLLAGGGAVAPRP
RPAAGWRDVAARYAEVIREHPAAGAGPAVPAGAGEGRGGREEEHGG
>Q53U10 3.5.1.112~~~neoL~~~2'-N-acetylparomamine deacetylase~~~
MGEPTWEAAEDPDRTLRERLRRGRTLLVSPHPDDVAYSCGGLLAAVGRPAHATLLTVFTRSAWALPRRLRRAGARVVSER
RREEELRYCRLRGLAEYRPLGFADAGLRGYDDETELSSPAEADGVRGAVEEAVAEAIRDAGADTVLAPAAVGGHVDHLLV
HGAVRGAVGPGGPLTLFYEDLPYAGQRDAVDVERTLREARGLVPFASVDISGVVQQKVRGMYVYGSQTDDECVRETLRHA
RRGAPRRWTGGTAGAGHAAGRRGAPHTERVWTPAPAGAR
>Q53U08 2.6.1.93~~~neoN~~~Neamine transaminase NeoN~~~
MTKNSSLLAEFPTCPRDEKDRPRVFTAASGAWLTDESGFRWIDFDNARGSILLGHGDPVVAEAVARAATGADGTATGWSR
RVDAVLERLHALCGGEVVGLFRSGTAAVRAAVLAVREATGRPLLLSAGYHGYDPMWYPSEAPLEPNADGVVDFFFDLGLL
RELLRAPERVAAVVVSPDHMHLSPGWYRELRRLCSAAGVVLVADEVKVGLRYAPGLSTAELLAPDVWVVAKGMANGHAVS
AVGGSRRLLKPLKEVSFTSFFEPTILAAADAALARVATGEPQRAVREAGDRFLRHARKALDDASLPVEIAGDGTFFQFVP
ATEELEEALYGAANAEGLLFYAGDNQGVSAAFDEAVLGEAERRFARVCERLAPYAGGEPVGDAARYRVAWNVMDGLRQAP
RDREETTGLLARLLDD
>Q8GAI5 ~~~nepA~~~Nicotine metabolites export pump subunit NepA~~~
MQTRYGRRALTIWPLLLLAIAAEVAATSLLPQTNGFRKLKPTVAVACLYTVAFALLAQILKFTDIGIAYALWAGLGTASV
AVIGVLFRNERFSWKHAIGLALVVTGVVTLNLQAGQ
>Q8GAI6 ~~~nepB~~~Nicotine metabolites export pump subunit NepB~~~
MSSYARRTPVRTVLNFCTAIRQIITGEAGSVAADKGNRGQKRAPLLHRHRLHAWLYLGSAITTEVTGTVILDFSEGFQLP
AQTTAAMALYAFSFFLLTRALRAVPLSVAYATWSGLGTVAVAFAGAIIHGEAVTLGRITAITAVIGGIVILNLATTRQHS
ARRKDV
>P0ADL1 ~~~nepI~~~Purine ribonucleoside efflux pump NepI~~~COG2814
MSEFIAENRGADAITRPNWSAVFSVAFCVACLIIVEFLPVSLLTPMAQDLGISEGVAGQSVTVTAFVAMFASLFITQTIQ
ATDRRYVVILFAVLLTLSCLLVSFANSFSLLLIGRACLGLALGGFWAMSASLTMRLVPPRTVPKALSVIFGAVSIALVIA
APLGSFLGELIGWRNVFNAAAVMGVLCIFWIIKSLPSLPGEPSHQKQNTFRLLQRPGVMAGMIAIFMSFAGQFAFFTYIR
PVYMNLAGFGVDGLTLVLLSFGIASFIGTSLSSFILKRSVKLALAGAPLILAVSALVLTLWGSDKIVATGVAIIWGLTFA
LVPVGWSTWITRSLADQAEKAGSIQVAVIQLANTCGAAIGGYALDNIGLTSPLMLSGTLMLLTALLVTAKVKMKKS
>Q8XGS2 ~~~nepI~~~Purine ribonucleoside efflux pump NepI~~~COG2814
MNENIAEKFRADGVARPNWSAVFAVAFCVACLITVEFLPVSLLTPMAQDLGISEGVAGQSVTVTAFVAMFSSLFITQIIQ
ATDRRYIVILFAVLLTASCLMVSFANSFTLLLLGRACLGLALGGFWAMSASLTMRLVPARTVPKALSVIFGAVSIALVIA
APLGSFLGGIIGWRNVFNAAAVMGVLCVIWVVKSLPSLPGEPSHQKQNMFSLLQRPGVMAGMIAIFMSFAGQFAFFTYIR
PVYMNLAGFDVDGLTLVLLSFGIASFVGTSFSSYVLKRSVKLALAGAPLLLALSALTLIVWGSDKTVAAAIAIIWGLAFA
LVPVGWSTWITRSLADQAEKAGSIQVAVIQLANTCGAAVGGYALDNFGLLSPLALSGGLMLLTALVVAAKVRITPMS
>Q60053 3.2.1.135~~~tvaI~~~Neopullulanase 1~~~
MIKLLKPMSLSILLVFILSFSFPFPTAKAAANDNNVEWNGLFHDQGPLFDNAPEPTSTQSVTLKLRTFKGDITSANIKYW
DTADNAFHWVPMVWDSNDPTGTFDYWKGTIPASPSIKYYRFQINDGTSTAWYNGNGPSSTEPNADDFYIIPNFKTPDWLK
NGVMYQIFPDRFYNGDSSNDVQTGSYTYNGTPTEKKAWGSSVYADPGYDNSLVFFGGDLAGIDQKLGYIKKTLGANILYL
NPIFKAPTNHKYDTQDYMAVDPAFGDNSTLQTLINDIHSTANGPKGYLILDGVFNHTGDSHPWFDKYNNFSSQGAYESQS
SPWYNYYTFYTWPDSYASFLGFNSLPKLNYGNSGSAVRGVIYNNSNSVAKTYLNPPYSVDGWRLDAAQYVDANGNNGSDV
TNHQIWSEFRNAVKGVNSNAAIIGEYWGNANPWTAQGNQWDAATNFDGFTQPVSEWITGKDYQNNSASISTTQFDSWLRG
TRANYPTNVQQSMMNFLSNHDITRFATRSGGDLWKTYLALIFQMTYVGTPTIYYGDEYGMQGGADPDNRRSFDWSQATPS
NSAVALTQKLITIRNQYPALRTGSFMTLITDDTNKIYSYGRFDNVNRIAVVLNNDSVSHTVNVPVWQLSMPNGSTVTDKI
TGHSYTVQNGMVTVAVDGHYGAVLAQ
>Q08751 3.2.1.135~~~tvaII~~~Neopullulanase 2~~~
MLLEAIFHEAKGSYAYPISETQLRVRLRAKKGDVVRCEVLYADRYASPEEELAHALAGKAGSDERFDYFEALLECSTKRV
KYVFLLTGPQGEAVYFGETGFSAERSKAGVFQYAYIHRSEVFTTPEWAKEAVIYQIFPERFANGDPSNDPPGTEQWAKDA
RPRHDSFYGGDLKGVIDRLPYLEELGVTALYFTPIFASPSHHKYDTADYLAIDPQFGDLPTFRRLVDEAHRRGIKIILDA
VFNHAGDQFFAFRDVLQKGEQSRYKDWFFIEDFPVSKTSRTNYETFAVQVPAMPKLRTENPEVKEYLFDVARFWMEQGID
GWRLDVANEVDHAFWREFRRLVKSLNPDALIVGEIWHDASGWLMGDQFDSVMNYLFRESVIRFFATGEIHAERFDAELTR
ARMLYPEQAAQGLWNLLDSHDTERFLTSCGGNEAKFRLAVLFQMTYLGTPLIYYGDEIGMAGATDPDCRRPMIWEEKEQN
RGLFEFYKELIRLRHRLASLTRGNVRSWHADKQANLYAFVRTVQDQHVGVVLNNRGEKQTVLLQVPESGGKTWLDCLTGE
EVHGKQGQLKLTLRPYQGMILWNGR
>P38940 3.2.1.135~~~nplT~~~Neopullulanase~~~
MRKEAIYHRPADNFAYAYDSETLHLRLRTKKDDIDRVELLHGDPYDWQNGAWQFQMMPMRKTGSDELFDYWFAEVKPPYR
RLRYGFVLYSGEEKLVYTEKGFYFEVPTDDTAYYFCFPFLHRVDLFEAPDWVKDTVWYQIFPERFANGNPSISPEGSRPW
GSEDPTPTSFFGGDLQGIIDHLDYLVDLGITGIYLTPIFRSPSNHKYDTADYFEVDPHFGDKETLKTLIDRCHEKGIRVM
LDAVFNHCGYEFAPFQDVWKNGESSKYKDWFHIHEFPLQTEPRPNYDTFRFVPQMPKLNTANPEVKRYLLDVATYWIREF
DIDGWRLDVANEIDHEFWREFRQEVKALKPDVYILGEIWHDAMPWLRGDQFDAVMNYPFTDGVLRFFAKEEISARQFANQ
MMHVLHSYPNNVNEAAFNLLGSHDTSRILTVCGGDIRKVKLLFLFQLTFTGSPCIYYGDEIGMTGGNDPECRKCMVWDPM
QQNKELHQHVKQLIALRKQYRSLRRGEISFLHADDEMNYLIYKKTDGDETVLVIINRSDQKADIPIPLDARGTWLVNLLT
GERFAAEAETLCTSLPPYGFVLYAIEHW
>Q0P8S7 2.7.7.82~~~legF~~~CMP-N,N'-diacetyllegionaminic acid synthase~~~COG1083
MAEILCTICARGGSKGVKNKNIRKINKLEMIAYSIIQAQNSKLFKHIVISTDSDEIASVAQKYGAEVFFKREAHLANDRT
AKLPVMRDALLRSEEHFKTCFETLIDLDASAPLRSSLDIKKAYESFVENDNSNLITAVPARRNPYFNLVEIQNNKVVKSK
EGNFTTRQSAPKCYDMNASIYIFKRDYLLENDSVFGKNTGLFVMDESTAFDIDSELDFKIVEFLISLKNLSPKDF
>Q5ZXI0 2.7.7.82~~~neuA~~~CMP-N,N'-diacetyllegionaminic acid synthase~~~COG1083
MRILAVIPARAGSKRLPGKNTRLLAGKPLIAHTIVAALQSSCCEEIVVSTDSKQIADVAVQYGASVPWLRSEDLATDTSD
VIHTVIDLLFKFQQMDVFFDSVLLLQPTSPFRKPETIRHAVEIHKVTGKSVVSVSPISLKPSWCRSIDSQGNLVKPELFQ
DLEIYCNENPIYKLNGSIYIATAKQIIENKSFYSEPTKPLLLNSISESIDIDTPIDWALTEKLMELNQEALV
>P13266 2.7.7.43~~~neuA~~~N-acylneuraminate cytidylyltransferase~~~
MRTKIIAIIPARSGSKGLRNKNALMLIDKPLLAYTIEAALQSEMFEKVIVTTDSEQYGAIAESYGADFLLRPEELATDKA
SSFEFIKHALSIYTDYESFALLQPTSPFRDSTHIIEAVKLYQTLEKYQCVVSVTRSNKPSQIIRPLDDYSTLSFFDLDYS
KYNRNSIVEYHPNGAIFIANKQHYLHTKHFFGRYSLAYIMDKESSLDIDDRMDFELAITIQQKKNRQKIDLYQNIHNRIN
EKRNEFDSVSDITLIGHSLFDYWDVKKINDIEVNNLGIAGINSKEYYEYIIEKELIVNFGEFVFIFFGTNDIVVSDWKKE
DTLWYLKKTCQYIKKKNAASKIYLLSVPPVFGRIDRDNRIINDLNSYLRENVDFAKFISLDHVLKDSYGNLNKMYTYDGL
HFNSNGYTVLENEIAEIVK
>P0A0Z7 2.7.7.43~~~neuA~~~N-acylneuraminate cytidylyltransferase~~~
MEKQNIAVILARQNSKGLPLKNLRKMNGISLLGHTINAAISSKCFDRIIVSTDGGLIAEEAKNFGVEVVLRPAELASDTA
SSISGVIHALETIGSNSGTVTLLQPTSPLRTGAHIREAFSLFDEKIKGSVVSACPMEHHPLKTLLQINNGEYAPMRHLSD
LEQPRQQLPQAFRPNGAIYINDTASLIANNCFFIAPTKLYIMSHQDSIDIDTELDLQQAENILNHKES
>P0A0Z8 2.7.7.43~~~neuA~~~N-acylneuraminate cytidylyltransferase~~~
MEKQNIAVILARQNSKGLPLKNLRKMNGISLLGHTINAAISSKCFDRIIVSTDGGLIAEEAKNFGVEVVLRPAELASDTA
SSISGVIHALETIGSNSGTVTLLQPTSPLRTGAHIREAFSLFDEKIKGSVVSACPMEHHPLKTLLQINNGEYAPMRHLSD
LEQPRQQLPQAFRPNGAIYINDTASLIANNCFFIAPTKLYIMSHQDSIDIDTELDLQQAENILNHKES
>P0A4V0 2.7.7.43~~~neuA~~~N-acylneuraminate cytidylyltransferase~~~COG1083
MKPICIIPARSGSKGLPDKNMLFLAGKPMIFHTIDAAIESGMFDKKDIFVSTDSELYREICLERGISVVMRKPELSTDQA
TSYDMLKDFLSDYEDNQEFVLLQVTSPLRKSWHIKEAMEYYSSHDVDNVVSFSEVEKHPGLFTTLSDKGYAIDMVGADKG
YRRQDLQPLYYPNGAIFISNKETYLREKSFFTSRTYAYQMAKEFSLDVDTRDDFIHVIGHLFFDYAIREKENKVFYKEGY
SRLFNREASKIILGDSKTISISLENYHNYSQGGVTLATMLENLPNFLTANVTEAFVSIGVNDLITGYSVEEIFSNFQKLY
SLLAENKIKMRFTTIAYTLFRETVNNADIEKINQWLTEFCYQNQIPLLDINRFLSKDGNLNYHLTSDGLHFTQEANDLLQ
SQYQLFVDEVKTL
>Q0P8T1 2.5.1.101~~~legI~~~N,N'-diacetyllegionaminic acid synthase~~~COG2089
MKKTLIIAEAGVNHNGDLNLAKKLIEIAADSGADFVKFQSFKAKNCISTKAKKAPYQLKTTANDESQLQMVQKLELDLKA
HKELILHAKKCNIAFLSTPFDLESVDLLNELGLKIFKIPSGEITNLPYLKKIAKLNKKIILSTGMANLGEIEEALNVLCK
NGAKRQNITLLHCTTEYPAPFNEVNLKAMQSLKDAFKLDVGYSDHTRGIHISLAAVALGACVIEKHFTLDKNMSGPDHKA
SLEPQELKMLCTQIRQIQKAMGDGIKKASKSEQKNINIVRKSLVAKKDIKKGEIFSEGNLTTKRPANGISAMRYEEFLGK
IATKNYKEDELIRE
>Q5ZXH9 2.5.1.101~~~neuB~~~N,N'-diacetyllegionaminic acid synthase~~~COG2089
MGSNRKINGIKPRGSSMTCFIIAEAGVNHNGDLQLAKELVYAAKESGADAVKFQTFKADTLVNKTVEKAEYQKNNAPESS
TQYEMLKALELSEEDHYLLSELANSLGIEFMSTGFDEQSIDFLISLGVKRLKIPSGEITNVPYLQHCASKKLPLIISTGM
CDLQEVRVAIDTVKPYYGNSLSDYLVLLHCTSNYPASYQDVNLKAMQTLADEFQLPVGYSDHTLGILVPTLAVGMGACVI
EKHFTMDKSLPGPDHLASMDPEEMKNLVQSIRDAETVLGSGEKKPSDNELPIRALVRRSITLRRDLVKGAQISKEDLILL
RPGTGIAPSEISNIVGSRLSMNLSAGTTLLWEHIEA
>Q0P8T0 3.2.1.184~~~legG~~~GDP/UDP-N,N'-diacetylbacillosamine 2-epimerase (hydrolyzing)~~~COG0381
MSKRKICIVSATRAEWYLLRNLCHEIQNDKDLSLQIIATGAHLSPEFGLTYKEIEKEFKITKKIPILLANDDKISLCKSM
SLAFSAFSDAFEDLKPDMVVILGDRYEMLSVASVCLLMHIPLVHLCGGELTLGAIDDSIRHSISKMSHLHFVSHEIYKKR
LLQLGEEEKRVFNIGSLASTIIKNMNFLNKKDLEKALEMKLDKELYLITYHPLTLNVKNTQKEIKTLLKKLDTLKNASLI
FTKANADENGLLINEILQNYCQKNSHKAKLFDNLGSQKYLSLMKIAKAMIGNSSSGISESPFFKTPCINIGDRQKGRLRT
QNIIDSEINDLDQAFEKLESKEFKQNLKNFKNPYDNGKNPNKIIKTCLKNVNLDTILHKNFIDL
>Q5ZXH8 3.2.1.184~~~~~~UDP-N,N'-diacetylbacillosamine 2-epimerase (hydrolyzing)~~~COG0381
MIRKIIYVTGTRADYGLMREVLKRLHQSEDIDLSICVTGMHLDALYGNTVNEIKADQFSICGIIPVDLANAQHSSMAKAI
GHELLGFTEVFESETPDVVLLLGDRGEMLAAAIAAIHLNIPVVHLHGGERSGTVDEMVRHAISKLSHYHFVATEASKQRL
IRMGEKEETIFQVGAPGLDEIMQYKTSTRDVFNQRYGFDPDKKICLLIYHPVVQEVDSIKIQFQSVIQAALATNLQIICL
EPNSDTGGHLIREVIQEYIDHPDVRIIKHLHRPEFIDCLANSDVMLGNSSSGIIEAASFNLNVVNVGSRQNLRERSDNVI
DVDVTYDAILTGLREALNKPKIKYSNCYGDGKTSERCYQLLKTIPLHSQILNKCNAY
>Q47400 ~~~neuC~~~Polysialic acid biosynthesis protein P7~~~
MKKILYVTGSRAEYGIVRRLLTMLRETPEIQLDLAVTGMHCDNAYGNTIHIIEQDNFNIIKVVDININTTSHTHILHSMS
VCLNSFGDFFSNNTYDAVMVLGDRYEIFSVAIAASMHNIPLIHIHGGEKTLANYDEFIRHSITKMSKLHLTSTEEYKKRV
IQLGEKPGSVFNIGSLGAENALSLHLPNKQELELKYGSLLKRYFVVVFHPETLSTQSVNDQIDELLSAISFFKNTHDFIF
IGSNADTGSDIIQRKVKYFCKEYKFRYLISIRSEDYLAMIKYSCGLIGNSSSGLIEVPSLKVATINIGDRQKGRVRGASV
IDVPVEKNAIVRGINISQDEKFISVVQSSSNPYFKENALINAVRIIKDFIKSKNKDYKDFYDIPECTTSYD
>A1ADJ6 2.3.1.136~~~neuO~~~Polysialic acid O-acetyltransferase~~~
MLRLKTQDSRLKTQDSRLKTQDSRLKTQDSRLKTQDSRLKTQDSRLKTQDSRLKTQDSRLKTQDSRLKTQDSRLKTQDSR
LKTQDSRLKTQDSFSVDDNGSGNVFVCGDLVNSKENKVQFNGNNNKLIIEDDVECRWLTVIFRGDNNYVRIHKNSKIKGD
IVATKGSKVIIGRRTTIGAGFEVVTDKCNVTIGHDCMIARDVILRASDGHPIFDIHSKKRINWAKDIIISSYVWVGRNVS
IMKGVSVGSGSVIGYGSIVTKDVPSMCAAAGNPAKIIKRNIIWARTDKAELISDDKRCSSYHAKLTQ
>P46739 ~~~nfaA~~~Non-fimbrial adhesin 1~~~
MKAKKYENQIYNENGRRCQRHGRRLAIADANGLNTVNAGDGKNLGTATATITTLQSCSVDLNLVTPNATVNRAGMLANRE
ITKFSVGSKDCPSDTYAVWFKEIDGEGQGVAQGTTVTNKFYLKMTSADGTASVGDINIGTKSGKGLSGQLVGGKFDGKIT
VAYDSATAPADVYTYDLMAAVYVQ
>A4IT49 1.5.1.45~~~~~~NAD(P)H-dependent FAD/FMN reductase GTNG_3158~~~COG0431
MKLLGISGTLVGTKTCILVEQVLVEAKRICPEVDIQLLDLKDYQVEFCDGRQQSSYNEDTQKVIELVSVADCYVIGTPIF
QGSITGALKNLFDLISPQALRHKVMGFVANGGTYQHYLVIENQLKPIASFFRAFVAPGSVYAHTDHFNEKNELVDPEVRE
RVAQLAWEVVHMHWSLKSGGVHAHR
>Q68AP4 3.5.1.91~~~nfdA~~~N-substituted formamide deformylase~~~
MTQMRDLMIINANVRTVDARNSCAQAVLVSGGRIAIVGTETEVRGAAAPDAEVLDVSGKTVVPGFIDAHNHLSVAAFAPD
SVDCSTPPLATLDEVLEVIERHCRNIPPGQWVRGINFHASHIREQRNPTRYELDEVAPNNPFFLIDASCHAGFANSAALD
LVGIGAHTPEPWGGEIERDLSGKPTGTLLEAAANLLHSASWNDYAERDWDRAVELLHSKMNDYLAVGLTGVGDAMVTAKS
AELYRRADAAGKMPFTLQQLHGGDHFFSMQDLGRSDTVDRIMEPESYLLRGGAMKIFVDRAYPSPAIDQIHDGCKTHVGA
NFYSKSEVHDLAVRASKLGINLAIHGMGNCAIDIVLDAYEAVRRQSNADTVLRLEHAFIAETGQGQRMADLGIDLVANPG
LAFGWGEVFNMWRGENQEHLKLFPVRSMLDAGVRVSLASDHPCGTYSPAEIMWTAVARETMAGAPLEPDEAVTADEALRM
YTINPAHASGRGSEEGSIEAGKRANLLVLDRDPVDCATGELRELQVLRTYVDGVLRYERTGS
>O32077 ~~~nfeD2~~~Membrane protein NfeD2~~~COG1585
MELFGVPIQTMYLYTLIIAGSLTLLFLFFGDVFSGLSEGIPFLNPTLVLSFFTCFSAGGYIGELVLPLSSLLIALLSCIL
SIMLVVLLHIFVLVPLSSAEESLAYREDDLRGRLGKVITAVPVDGFGEVVIEGIGGTISKSAVSFDNQQISYGTTVLVVD
INNGVLSVTPHEPI
>P96724 3.1.21.7~~~nfi~~~Endonuclease V~~~COG1515
MKVFDVHKFDMKKEQDFLQVQFNLKNRINLSPTIHPDSINTCAGVDLAYWEQDGEPYGVCCIIVIDADTKEVIEKVHSMG
RISVPYVSGFLAFRELPLIIEAAKKLETEPDVFLFDGNGYLHYNHMGVATHAAFFLGKPTIGIAKTYLKIKGCDFVTPEI
EVGAYTDIIIDGEVYGRALRTRRDVKPIFLSCGNYIDLDSSYQITMSLINQESRLPIPVRLADLETHVLRTFYQKNHV
>P68739 3.1.21.7~~~nfi~~~Endonuclease V~~~COG1515
MDLASLRAQQIELASSVIREDRLDKDPPDLIAGADVGFEQGGEVTRAAMVLLKYPSLELVEYKVARIATTMPYIPGFLSF
REYPALLAAWEMLSQKPDLVFVDGHGISHPRRLGVASHFGLLVDVPTIGVAKKRLCGKFEPLSSEPGALAPLMDKGEQLA
WVWRSKARCNPLFIATGHRVSVDSALAWVQRCMKGYRLPEPTRWADAVASERPAFVRYTANQP
>Q82MH6 3.1.21.7~~~nfi~~~Endonuclease V~~~COG1515
MTTVRIPAGWPATEEEARAVQDELRGRVILDEPGPPPGTGRVTGVDVAYDDERDVVVAAAVVLDAATLDVVAEATAVGEV
SFPYVPGLLAFREIPTVLAALDALPCPPGLIVCDGYGVAHPRRFGLASHLGVLTGLPTIGVAKNPFTFSYEDPGAPRGSA
APLLAGADEVGRALRTQSGVKPVFVSVGHRVDLDHACAHTLALTPKYRIPETTRRADSLCRRALKEATA
>Q9X2H9 3.1.21.7~~~nfi~~~Endonuclease V~~~COG1515
MDYRQLHRWDLPPEEAIKVQNELRKKIKLTPYEGEPEYVAGVDLSFPGKEEGLAVIVVLEYPSFKILEVVSERGEITFPY
IPGLLAFREGPLFLKAWEKLRTKPDVVVFDGQGLAHPRKLGIASHMGLFIEIPTIGVAKSRLYGTFKMPEDKRCSWSYLY
DGEEIIGCVIRTKEGSAPIFVSPGHLMDVESSKRLIKAFTLPGRRIPEPTRLAHIYTQRLKKGLF
>A0R6D0 1.-.-.-~~~nfnB~~~Nitroreductase NfnB~~~COG0778
MSVPTLPTGPTVDLAQAAERLIKGRRAVRAFRPDEVPEETMRAVFELAGHAPSNSNTQPWHVEVVSGAARDRLAEALVTA
HAEERVTVDFPYREGLFQGVLQERRADFGSRLYAALGIARDQTDLLQGYNTESLRFYGAPHVAMLFAPNNTEARIAGDMG
IYAQTLMLAMTAHGIASCPQALLSFYADTVRAELGVENRKLLMGISFGYADDTAAVNGVRIPRAGLSETTRFSR
>Q74HL7 1.5.1.36~~~nfr1~~~NADH-dependent flavin reductase subunit 1~~~COG0431
MKLFAIVGSNADHSYNRDLLNFIKKHFTDRYDIELGEVKDLPMFKEGVKEPAAVASFAKKVADADAVLISTPEQQHSVPS
SLKSALEWLSSAEHPFKDKPVVIVGTSVLPQGSARGQSHLKLVLSSPGFGAKVFNGDEFMMGTAPEQFDENGNLPAGTVK
FLDHFFDEFDSFYAEVSK
>Q74HL8 1.5.1.36~~~nfr2~~~NADH-dependent flavin reductase subunit 2~~~COG0431
MKLLAIVGTNADFSYNRFLDQFMAKRYKDQAEIEVYEIADLPRFKKEAQPDSKVEEFKNKIREADGVIFATPEYDHGIPS
ALKSAMEWTGSHAQGNADVMKMKPAMVLGTSYGIQGASRAQEEMREILLSPDQSANVLPGNEVLIGHAADKFDKNTGDLL
DQETIHAIDLAFNNFVKFVEQAQK
>P39605 1.5.1.38~~~nfrA1~~~FMN reductase (NADPH)~~~COG0778
MNNTIETILNHRSIRSFTDQLLTAEEIDTLVKSAQAASTSSYVQAYSIIGVSDPEKKRELSVLAGNQPYVEKNGHFFVFC
ADLYRHQQLAEEKGEHISELLENTEMFMVSLIDAALAAQNMSIAAESMGLGICYIGGIRNELDKVTEVLQTPDHVLPLFG
LAVGHPANLSGKKPRLPKQAVYHENTYNVNTDDFRHTMNTYDKTISDYYRERTNGKREETWSDQILNFMKQKPRTYLNDY
VKEKGFNKN
>P94424 1.5.1.39~~~nfrA2~~~FMN reductase [NAD(P)H]~~~COG0778
MNEVIKSLTDHRSIRSYTDEPVAQEQLDQIIEAVQSAPSSINGQQVTVITVQDKERKKKISELAGGQPWIDQAPVFLLFC
ADFNRAKIALEDLHDFKMEITNGLESVLVGAVDAGIALGTATAAAESLGLGTVPIGAVRGNPQELIELLELPKYVFPLSG
LVIGHPADRSAKKPRLPQEAVNHQETYLNQDELTSHIQAYDEQMSEYMNKRTNGKETRNWSQSIASYYERLYYPHIREML
EKQGFKVEK
>P31600 ~~~nfrA~~~Bacteriophage adsorption protein A~~~COG4783
MKENNLNRVIGWSGLLLTSLLSTSALADNIGTSAEELGLSDYRHFVIYPRLDKALKAQKNNDEATAIREFEYIHQQVPDN
IPLTLYLAEAYRHFGHDDRARLLLEDQLKRHPGDARLERSLAAIPVEVKSVTTVEELLAQQKACDAAPTLRCRSEVGQNA
LRLAQLPVARAQLNDATFAASPEGKTLRTDLLQRAIYLKQWSQADTLYNEARQQNTLSAAERRQWFDVLLAGQLDDRILA
LQSQGIFTDPQSYITYATALAYRGEKARLQHYLIENKPLFTTDAQEKSWLYLLSKYSANPVQALANYTVQFADNRQYVVG
ATLPVLLKEGQYDAAQKLLATLPANEMLEERYAVSVATRNKAEALRLARLLYQQEPANLTRLDQLTWQLMQNEQSREAAD
LLLQRYPFQGDARVSQTLMARLASLLESHPYLATPAKVAILSKPLPLAEQRQWQSQLPGIADNCPAIVRLLGDMSPSYDA
AAWNRLAKCYRDTLPGVALYAWLQAEQRQPSAWQHRAVAYQAYQVEDYATALAAWQKISLHDMSNEDLLAAANTAQAAGN
GAARDRWLQQAEKRGLGSNALYWWLHAQRYIPGQPELALNDLTRSINIAPSANAYVARATIYRQRHNVPAAVSDLRAALE
LEPNNSNTQAALGYALWDSGDIAQSREMLEPAHKGLPDDPALIRQLAYVNQRLDDMPATQHYARLVIDDIDNQALITPLT
PEQNQQRFNFRRLHEEVGRRWTFSFDSSIGLRSGAMSTANNNVGGAAPGKSYRSYGQLEAEYRIGRNMLLEGDLLSVYSR
VFADTGENGVMMPVKNPMSGTGLRWKPLRDQIFFIAVEQQLPLNGQNGASDTMLRASASFFNGGKYSDEWHPNGSGWFAQ
NLYLDAAQYIRQDIQAWTADYRVSWHQKVANGQTIEPYAHVQDNGYRDKGTQGAQLGGVGVRWNIWTGETHYDAWPHKVS
LGVEYQHTFKAINQRNGERNNAFLTIGVHW
>P17117 1.-.-.-~~~nfsA~~~Oxygen-insensitive NADPH nitroreductase~~~COG0778
MTPTIELICGHRSIRHFTDEPISEAQREAIINSARATSSSSFLQCSSIIRITDKALREELVTLTGGQKHVAQAAEFWVFC
ADFNRHLQICPDAQLGLAEQLLLGVVDTAMMAQNALIAAESLGLGGVYIGGLRNNIEAVTKLLKLPQHVLPLFGLCLGWP
ADNPDLKPRLPASILVHENSYQPLDKGALAQYDEQLAEYYLTRGSNNRRDTWSDHIRRTIIKESRPFILDYLHKQGWATR
>P38489 1.-.-.-~~~nfsB~~~Oxygen-insensitive NAD(P)H nitroreductase~~~COG0778
MDIISVALKRHSTKAFDASKKLTPEQAEQIKTLLQYSPSSTNSQPWHFIVASTEEGKARVAKSAAGNYVFNERKMLDASH
VVVFCAKTAMDDVWLKLVVDQEDADGRFATPEAKAANDKGRKFFADMHRKDLHDDAEWMAKQVYLNVGNFLLGVAALGLD
AVPIEGFDAAILDAEFGLKEKGYTSLVVVPVGHHSVEDFNATLPKSRLPQNITLTEV
>Q01234 1.-.-.-~~~nfsB~~~Oxygen-insensitive NAD(P)H nitroreductase~~~COG0778
MDIISVALKRHSTKAFDASKKLTAEEAEKIKTLLQYSPSSTNSQPWHFIVASTEEGKARVAKSAAGTYVFNERKMLDASH
VVVFCAKTAMDDAWLERVVDQEEADGRFNTPEAKAANHKGRTYFADMHRVDLKDDDQWMAKQVYLNVGNFLLGVGAMGLD
AVPIEGFDAAILDEEFGLKEKGFTSLVVVPVGHHSVEDFNATLPKSRLPLSTIVTEC
>P15888 1.-.-.-~~~nfsB~~~Oxygen-insensitive NAD(P)H nitroreductase~~~
MDIVSVALQRYSTKAFDPSKKLTAEEADKIKTLLQYSPSSTNSQPWHFIVASTEEGKARVAKSAAGNYTFNERKMLDASH
VVVFCAKTAMDDAWLERVVDQEDADGRFATPEAKAANDKGRRFFADMHRVSLKDDHQWMAKQVYLNVGNFLLGVAAMGLD
AVPIEGFDAEVLDAEFGLKEKGYTSLVVVPVGHHSVEDFNAGLPKSRLPLETTLTEV
>P63020 ~~~nfuA~~~Fe/S biogenesis protein NfuA~~~COG0316
MIRISDAAQAHFAKLLANQEEGTQIRVFVINPGTPNAECGVSYCPPDAVEATDTALKFDLLTAYVDELSAPYLEDAEIDF
VTDQLGSQLTLKAPNAKMRKVADDAPLMERVEYMLQSQINPQLAGHGGRVSLMEITEDGYAILQFGGGCNGCSMVDVTLK
EGIEKQLLNEFPELKGVRDLTEHQRGEHSYY
>P31774 ~~~nfuA~~~Fe/S biogenesis protein NfuA~~~COG0316
MEQATQQIAISDAAQAHFRKLLDTQEEGTHIRIFVVNPGTPNAECGVSYCPPNAVEESDIEMKYNTFSAFIDEVSLPFLE
EAEIDYVTEELGAQLTLKAPNAKMRKVADDAPLIERVEYVIQTQINPQLANHGGRITLIEITEDGYAVLQFGGGCNGCSM
VDVTLKDGVEKQLVSLFPNELKGAKDITEHQRGEHSYY
>O50499 ~~~ngcE~~~Diacetylchitobiose binding protein NgcE~~~COG1653
MTIRAGSLDRRTLLRGAIATAAMGSFAVACSSPSSEDKESDSGPKGEKSANNPFGAAANSTVEAAIFDGGYGTDYVDYAN
QVLGSQVKGLKVQVKPVVDIAPQLQPRFVGGNPPDLIDNSGEDQIGFLGILDQLEELDDLFEASTYEGKKIADIVYPGVK
DPGTFKDKFVALNYVMTVYGVWYSKTLFEENGWTPPKTWDEALDLGQEAKKKGKYLFVHGKEAATYYRTLLIDSAIKEGG
DEVRLALENLEKGCWSHPAVQGVIKVMETMVKQKMFVPGGSGTQFQKAQAIWSNDQKALLYPSGGWIENEMKKATKADFQ
MTGIPSMTLTDKPALPYEALRAAAGEPFIVPKQGKNPAGGKEVLRAMLSEKAAANFSKTKLAPTIVKGTVPADGYGSTAL
VSQTKMLEAAGTNIFNYMFVETYGLNTDQLVPWNSFLAGDLDGKGLTSALQKISDKVREDDSVDKVKVS
>O50500 ~~~ngcF~~~Diacetylchitobiose uptake system permease protein NgcF~~~COG1175
MKDTIPTAETASRRPEPAARGGRPRRRKLTFDRVTFFLAFLGVPLAIFVIFVLIPFGQAIFWGMTDWRGFSPDYNFVGFD
NFTKMFQDDIFLKALRNVALLAAFVPLVTLTLALGVAVAITLGGPSKGPVRGIRGASFYRIISFFPYVVPAIIVGLIWAQ
MYDPNAGLLNGVLTGLGLDQFDTFAWLGEKAAAMPAVMFVIVWGLVGFYAVLFIAAIKGVPGELYEAAKIDGAGRFRTTI
SITLPAIRDSVQTAYIYLGIAALDAFVYVQAMVPNGGPDNSTLTISQRLFNVAFAKQQFGYATAMGVVLAAVTLVFAALV
FLVNRLTGGGEGESKRKAPGSRARRAAAKGGAR
>O50501 ~~~ngcG~~~Diacetylchitobiose uptake system permease protein NgcG~~~COG0395
MSVIAADRKPASDRKPVSDRKLARAAASSDRRFAAISHALLILWSVIVIVPMLWVLMSSFKSTGEILSSPFSLPDHWRFE
NYANAWTDANIGKYFLNSVIVVVSALILVMLLGAMCAYVLARFEFPGRRLIYYVMLAGLTFPVFLAIVPLFFQLQNFGLL
NTRPGLILTYVAFALPFTMFFLYSFFRSLPHDVYEAALIDGAGDWRAFFQVMLPMARPGMAAVAIFNFLGLWNQFLLPVA
LNTDQDKWVLTQGMAAYASSQVYDIDYGALFAAIVVTVVPVLLVYCVFQRRIAGSVSQGTFR
>A3N2T3 2.4.1.-~~~~~~UDP-glucose:protein N-beta-glucosyltransferase~~~COG3914
MENENKPNVANFEAAVAAKDYEKACSELLLILSQLDSNFGGIHEIEFEYPAQLQDLEQEKIVYFCTRMATAITTLFSDPV
LEISDLGVQRFLVYQRWLALIFASSPFVNADHILQTYNREPNRKNSLEIHLDSSKSSLIKFCILYLPESNVNLNLDVMWN
ISPELCASLCFALQSPRFVGTSTAFNKRATILQWFPRHLDQLKNLNNIPSAISHDVYMHCSYDTSVNKHDVKRALNHVIR
RHIESEYGWKDRDVAHIGYRNNKPVMVVLLEHFHSAHSIYRTHSTSMIAAREHFYLIGLGSPSVDQAGQEVFDEFHLVAG
DNMKQKLEFIRSVCESNGAAIFYMPSIGMDMTTIFASNTRLAPIQAIALGHPATTHSDFIEYVIVEDDYVGSEECFSETL
LRLPKDALPYVPSALAPEKVDYLLRENPEVVNIGIASTTMKLNPYFLEALKAIRDRAKVKVHFHFALGQSNGITHPYVER
FIKSYLGDSATAHPHSPYHQYLRILHNCDMMVNPFPFGNTNGIIDMVTLGLVGVCKTGAEVHEHIDEGLFKRLGLPEWLI
ANTVDEYVERAVRLAENHQERLELRRYIIENNGLNTLFTGDPRPMGQVFLEKLNAFLKEN
>B3H2N2 2.4.1.-~~~~~~UDP-glucose:protein N-beta-glucosyltransferase~~~
MENENKPNVANFEAAVAVKDYEKACSELLLILSQLDSNFGGIQEIEFEYPVQLQDLEQEKIVYFCTRMATAITTLFSDPV
LEISDLGVQRFLVYQRWLALIFASSPFVNADHILQTYNREPNRKNSLEIHLDSSKSSLIKFCILYLPESNVNLNLDVMWN
ISPELCASLCFALQSPRFIGTSTAFNKRATILQWFPRHLDQLKNLNNIPSAISHDVYMHCSYDTSVNKHDVKRALNHVIR
RHIESEYGWKDRYVAHIGYRNNKPVMVVLLEHFHSAHSIYRTHSTSMIAAREHFYLIGLGSPSVDQAGQEVFDEFHLVAG
DNMKQKLEFIRSVCESNGAAIFYMPSIGMDMTTIFASNTRLAPIQAIALGHPATTHSDFIEYVIVEDDYVGSEACFSETL
LRLPKDALPYVPSALAPEKVDYLLRENPEVVNIGIASTTMKLNPYFLEALKAIRDRAKVKVHFHFALGQSNGITHPYVER
FIKSYLGDSATAHPHSPYHQYLRILHNCDMMVNPFPFGNTNGIIDMVTLGLVGVCKTGAEVHEHIDEGLFKRLGLPEWLI
ANTVDEYVERAVRLAENHQERLELRRYIIENNGLNTLFTGDPRPMGQVFLEKLNAFLKEN
>P21219 4.2.1.84~~~nhhA~~~High-molecular weight cobalt-containing nitrile hydratase subunit alpha~~~
MSEHVNKYTEYEARTKAIETLLYERGLITPAAVDRVVSYYENEIGPMGGAKVVAKSWVDPEYRKWLEEDATAAMASLGYA
GEQAHQISAVFNDSQTHHVVVCTLCSCYPWPVLGLPPAWYKSMEYRSRVVADPRGVLKRDFGFDIPDEVEVRVWDSSSEI
RYIVIPERPAGTDGWSEEELTKLVSRDSMIGVSNALTPQEVIV
>P29378 4.2.1.84~~~~~~Low-molecular weight cobalt-containing nitrile hydratase subunit alpha~~~
MTAHNPVQGTLPRSNEEIAARVKAMEAILVDKGLISTDAIDHMSSVYENEVGPQLGAKIVARAWVDPEFKQRLLTDATSA
CREMGVGGMQGEEMVVLENTGTVHNMVVCTLCSCYPWPVLGLPPNWYKYPAYRARAVRDPRGVLAEFGYTPDPDVEIRIW
DSSAELRYWVLPQRPAGTENFTEEQLADLVTRDSLIGVSVPTTPSKA
>P13738 ~~~nhaA~~~Na(+)/H(+) antiporter NhaA~~~COG3004
MKHLHRFFSSDASGGIILIIAAILAMIMANSGATSGWYHDFLETPVQLRVGSLEINKNMLLWINDALMAVFFLLVGLEVK
RELMQGSLASLRQAAFPVIAAIGGMIVPALLYLAFNYADPITREGWAIPAATDIAFALGVLALLGSRVPLALKIFLMALA
IIDDLGAIIIIALFYTNDLSMASLGVAAVAIAVLAVLNLCGARRTGVYILVGVVLWTAVLKSGVHATLAGVIVGFFIPLK
EKHGRSPAKRLEHVLHPWVAYLILPLFAFANAGVSLQGVTLDGLTSILPLGIIAGLLIGKPLGISLFCWLALRLKLAHLP
EGTTYQQIMVVGILCGIGFTMSIFIASLAFGSVDPELINWAKLGILVGSISSAVIGYSWLRVRLRPSV
>P27764 4.2.1.84~~~nthA~~~Nitrile hydratase subunit alpha~~~
MSTSISTTATPSTPGERAWALFQVLKSKELIPEGYVEQLTQLMAHDWSPENGARVVAKAWVDPQFRALLLKDGTAACAQF
GYTGPQGEYIVALEDTPGVKNVIVCSLCSCTNWPVLGLPPEWYKGFEFRARLVREGRTVLRELGTELPSDTVIKVWDTSA
ESRYLVLPQRPEGSEHMSEEQLQQLVTKDVLIGVALPRVG
>Q7SID2 4.2.1.84~~~~~~Cobalt-containing nitrile hydratase subunit alpha~~~
MTENILRKSDEEIQKEITARVKALESMLIEQGILTTSMIDRMAEIYENEVGPHLGAKVVVKAWTDPEFKKRLLADGTEAC
KELGIGGLQGEDMMWVENTDEVHHVVVCTLCSCYPWPVLGLPPNWFKEPQYRSRVVREPRQLLKEEFGFEVPPSKEIKVW
DSSSEMRFVVLPQRPAGTDGWSEEELATLVTRESMIGVEPAKAV
>P13448 4.2.1.84~~~nthA~~~Nitrile hydratase subunit alpha~~~
MSVTIDHTTENAAPAQAPVSDRAWALFRALDGKGLVPDGYVEGWKKTFEEDFSPRRGAELVARAWTDPEFRQLLLTDGTA
AVAQYGYLGPQGEYIVAVEDTPTLKNVIVCSLCSCTAWPILGLPPTWYKSFEYRARVVREPRKVLSEMGTEIASDIEIRV
YDTTAETRYMVLPQRPAGTEGWSQEQLQEIVTKDCLIGVAIPQVPTV
>Q8ZRZ3 ~~~nhaA~~~Na(+)/H(+) antiporter NhaA~~~
MKHLHRFFSSDASGGIILIIAAALAMLMANMGATSGWYHDFLETPVQLRVGALEINKNMLLWINDALMAVFFLLIGLEVK
RELMQGSLASLRQAAFPVIAAIGGMIVPALLYLAFNYSDPVTREGWAIPAATDIAFALGVLALLGSRVPLALKIFLMALA
IIDDLGAIVIIALFYTSDLSIVSLGVAAFAIAVLALLNLCGVRRTGVYILVGAVLWTAVLKSGVHATLAGVIVGFFIPLK
EKHGRSPAKRLEHVLHPWVAYLILPLFAFANAGVSLQGVTIDGLTSMLPLGIIAGLLIGKPLGISLFCWLALRFKLAHLP
QGTTYQQIMAVGILCGIGFTMSIFIASLAFGNVDPELINWAKLGILIGSLLSAVVGYSWLRARLNAPA
>O85187 ~~~nhaA~~~Na(+)/H(+) antiporter NhaA~~~COG3004
MSDMIRDFFKMESAGGILLVIAAAIAMVIANSAMGEGYQAFLHTYVFGMSVSHWINDGLMAVFFLLIGLEVKRELLEGAL
KSRETAIFPAIAAVGGMLAPALIYVAFNFNDPAAIQGWAIPAATDIAFALGIMALLGKRVPVSLKVFLLALAIIDDLGVV
VIIALFYSSDLSTIALTIGFIMTGVLFMLNAKHVTKLSIYLVAGLILWIAVLKSGVHATLAGVVIGFAIPLKGNKGEHSP
LKHLEHALHPYVAFAILPVFAFANAGISLQGVSLAGLTSMLPLGVALGLFLGKPLGIFSFSWAAVKLGVAKLPEGINFKH
IFAVSVLCGIGFTMSIFISSLAFGQANEAYDTYARLGILMGSTTAALLGYSLLRLSLPLKKA
>Q56725 ~~~nhaA~~~Na(+)/H(+) antiporter NhaA~~~COG3004
MNDVIRDFFKMESAGGILLVIAAAIAMTIANSPLGETYQSLLHTYVFGMSVSHWINDGLMAVFFLLIGLEVKRELLEGAL
KSKETAIFPAIAAVGGMLAPALIYVAFNANDPEAISGWAIPAATDIAFALGIMALLGKRVPVSLKVFLLALAIIDDLGVV
VIIALFYTGDLSSMALLVGFVMTGVLFMLNAKEVTKLTPYMIVGAILWFAVLKSGVHATLAGVVIGFAIPLKGKQGEHSP
LKHMEHALHPYVAFGILPLFAFANAGISLEGVSMSGLTSMLPLGIALGLLIGKPLGIFSFSWAAVKLGVAKLPEGINFKH
IFAVSVLCGIGFTMSIFISSLAFGNVSPEFDTYARLGILMGSTTAAVLGYALLHFSLPKKAQD
>P0AFA7 ~~~nhaB~~~Na(+)/H(+) antiporter NhaB~~~COG3067
MEISWGRALWRNFLGQSPDWYKLALIIFLIVNPLIFLISPFVAGWLLVAEFIFTLAMALKCYPLLPGGLLAIEAVFIGMT
SAEHVREEVAANLEVLLLLMFMVAGIYFMKQLLLFIFTRLLLSIRSKMLLSLSFCVAAAFLSAFLDALTVVAVVISVAVG
FYGIYHRVASSRTEDTDLQDDSHIDKHYKVVLEQFRGFLRSLMMHAGVGTALGGVMTMVGEPQNLIIAKAAGWHFGDFFL
RMSPVTVPVLICGLLTCLLVEKLRWFGYGETLPEKVREVLQQFDDQSRHQRTRQDKIRLIVQAIIGVWLVTALALHLAEV
GLIGLSVIILATSLTGVTDEHAIGKAFTESLPFTALLTVFFSVVAVIIDQQLFSPIIQFVLQASEHAQLSLFYIFNGLLS
SISDNVFVGTIYINEAKAAMESGAITLKQYELLAVAINTGTNLPSVATPNGQAAFLFLLTSALAPLIRLSYGRMVWMALP
YTLVLTLVGLLCVEFTLAPVTEWFMQMGWIATL
>Q9I2S5 ~~~nhaB~~~Na(+)/H(+) antiporter NhaB~~~
MSSPLSQAFAQNFLGHSPRWYKLTVLAFLLLNPLLLWLAGPVTSAWVLVGEFIFTLAMALKCYPLQPGGLLVLEALLLGL
ATPEALYAELQHNFPVLLLLMFMVAGIYFMKDLLLLLFSRLLLGVRSKTLLSLLFCLLAALLSAFLDALTVTAVVISVAV
AFFAVYHRVASGQRASEDYDPATDRQVPELHRAHLEEFRAFLRSLLMHAAVGTALGGVCTLVGEPQNLLIGHEAGWHFVE
FFRQVAPVSMPVLAAGLLTCVLLEKSRRFGYGAQLPAAVRQVLAEYAASESRKRGAQQKAALLVQALAALVLIVGLALHV
AEVGLIGLLVIVLITAFTGVTDEHQIGRAFQEALPFTALLVVFFAVVAVIHQQHLFTPIIQAVLALPAERQPGMLFIANG
LLSAISDNVFVATIYITEVKQALDAGHMSREHFDTLAVAINTGTNLPSVATPNGQAAFLFLLTSSIAPLVRLSYGRMVWM
ALPYTLVMGGLGWWAVSHWL
>P27763 4.2.1.84~~~nthB~~~Nitrile hydratase subunit beta~~~
MDGFHDLGGFQGFGKVPHTINSLSYKQVFKQDWEHLAYSLMFVGVDQLKKFSVDEVRHAVERLDVRQHVGTQYYERYIIA
TATLLVETGVITQAELDQALGSHFKLANPAHATGRPAITGRPPFEVGDRVVVRDEYVAGHIRMPAYVRGKEGVVLHRTSE
QWPFPDAIGHGDLSAAHQPTYHVEFRVKDLWGDAADDGYVVVDLFESYLDKAPGAQAVNA
>P97052 4.2.1.84~~~nthB~~~Nitrile hydratase subunit beta~~~
MNGIHDTGGAHGYGPVYREPNEPVFRYDWEKTVMSLLPALLANANFNLDEFRHSIERMGPAHYLEGTYYEHWLHVFENLL
VEKGVLTATEVATGKAASGKTATRVLTPAIVDDSSAPGLLRPGGGFSFFPVGDKVRVLNKNPVGHTRMPRYTRAKWGQWS
STMVCFVTPDTAAHGKGEQPQHVYTVSFTSVELWGQDASSPKDTIRVDLWDDYLEPA
>Q7SID3 4.2.1.84~~~~~~Cobalt-containing nitrile hydratase subunit beta~~~
MNGVYDVGGTDGLGPINRPADEPVFRAEWEKVAFAMFPATFRAGFMGLDEFRFGIEQMNPAEYLESPYYWHWIRTYIHHG
VRTGKIDLEELERRTQYYRENPDAPLPEHEQKPELIEFVNQAVYGGLPASREVDRPPKFKEGDVVRFSTASPKGHARRAR
YVRGKTGTVVKHHGAYIYPDTAGNGLGECPEHLYTVRFTAQELWGPEGDPNSSVYYDCWEPYIELVDTKAAAA
>P13449 4.2.1.84~~~nthB~~~Nitrile hydratase subunit beta~~~
MDGVHDLAGVQGFGKVPHTVNADIGPTFHAEWEHLPYSLMFAGVAELGAFSVDEVRYVVERMEPRHYMMTPYYERYVIGV
ATLMVEKGILTQDELESLAGGPFPLSRPSESEGRPAPVETTTFEVGQRVRVRDEYVPGHIRMPAYCRGRVGTISHRTTEK
WPFPDAIGHGRNDAGEEPTYHVKFAAEELFGSDTDGGSVVVDLFEGYLEPAA
>Q53117 4.2.1.84~~~nthB~~~Nitrile hydratase subunit beta~~~
MNGVFDLGGTDGIGPVDPPAEEPVFRADWEKAAFTMFSALFRAGWFGIDEFRHGVEKMDPALYLKSPYYKHWIASFEYHG
KRTGKLDLAELDRRTQYYLANPDAPLPEHGPNQELIDFANAVVPSGAPAIRPTDKEPRFKIGDVVRMSSDVPFGHTRIAG
YVRGKVGRVISHHGSFVYPDSAGNGRGDDPQHLYTLQFDATELWGEQYAEPNVTTTFDAWDPYLTLVTAPEGAAA
>Q56577 ~~~nhaB~~~Na(+)/H(+) antiporter NhaB~~~COG3067
MPISLGNAFIKNFLGKAPDWYKVAIIAFLIINPIVFFLINPFVAGWLLVAEFIFTLAMALKCYPLQPGGLLAIEAIAIGM
TSPAQVKHELVANIEVLLLLVFMVAGIYFMKHLLLFIFTKILLGIRSKTLLSLAFCFAAAFLSAFLDALTVIAVVISVAI
GFYSIYHKVASGNPIGDHDHTQDDTITELTRDDLENYRAFLRSLLMHAGVGTALGGVTTMVGEPQNLIIADQAGWLFGEF
LIRMSPVTLPVFFCGLITCALVEKLKVFGYGAKLPNNVRQILVDFDNEERKTRTNQDVAKLWVQGLIAVWLIVALALHLA
AVGLIGLSVIILATAFTGVIEEHSMGKAFEEALPFTALLAVFFSIVAVIIDQELFKPVIDAVLAVEDKGTQLALFYVANG
LLSMVSDNVFVGTVYINEVKTALIEGLITREQFDLLAVAINTGTNLPSVATPNGQAAFLFLLTSALAPLIRLSYGRMVIM
ALPYTIVLAIVGLMGIMFFLEPATASFYDAGWILPHSGDLTPVVSGGH
>Q9KQU7 ~~~nhaB~~~Na(+)/H(+) antiporter NhaB~~~COG3067
MPMSLGNAFIKNFLGKAPDWYKVAIIAFLIINPIVFFFINPFLAGWLLVVEFIFTLAMALKCYPLQPGGLLAIEAIAIGM
TSPGQVKHELVANIEVLLLLVFMVAGIYFMKQLLLFIFTKILLGIRSKVLLSIAFSVTAAFLSAFLDALTVIAVIISVAV
GFYSIYHKVASGQSVHSSHDHTHDEGISELTRDDLENYRAFLRSLLMHAGVGTALGGVTTMVGEPQNLIIADQAGWQFGE
FLLRMAPVTVPVFIAGMLTCMLVEKFRIFGYGARLPANVRQILLDFDSEERKNRTNQDVAKLWVQGAIAVWLIVGLALHL
AAVGLIGLSVIILATAFTGVIEEHSLGKAFEEALPFTALLAVFFSIVAVIIDQELFKPVIDAVLNVEDHGTQLALFYVAN
GLLSMVSDNVFVGTVYITEVKTALMEGMISRDQFDLLAVAINTGTNLPSVATPNGQAAFLFLLTSALAPLIRLSYGRMVI
MAFPYTLALSLVGFIGIMFLLEPMTEVFYSLGWISHHVAPAADALLQSGH
>Q87N04 ~~~nhaB~~~Na(+)/H(+) antiporter NhaB~~~COG3067
MPISLGNAFIKNFLGKAPDWYKVAIIAFLIINPIVFFLINPFVAGWLLVAEFIFTLAMALKCYPLQPGGLLAIEAIAIGM
TSPAQVKHELVANIEVLLLLVFMVAGIYFMKQLLLFIFTKILLGIRSKTLLSLAFCFAAAFLSAFLDALTVIAVVISVAV
GFYSIYHKVASGNPIGDHDHTQDDTITELTRDDLENYRAFLRSLLMHAGVGTALGGVTTMVGEPQNLIIADQAGWLFGEF
LIRMSPVTLPVFICGLITCALVEKLKVFGYGAKLPDNVRQILVDFDREERKTRTNQDVAKLWVQGIIAVWLIVALALHLA
AVGLIGLSVIILATSFTGVIEEHSMGKAFEEALPFTALLAVFFSIVAVIIDQELFKPVIDAVLAVEDKGTQLALFYVANG
LLSMVSDNVFVGTVYINEVKSALMEGLITREQFDLLAVAINTGTNLPSVATPNGQAAFLFLLTSALAPLIRLSYGRMVVM
ALPYTVVLAIVGLMGIMFFLEPATASFYDAGWIAPHTGDLTPVVSGGH
>O07553 ~~~nhaC~~~Na(+)/H(+) antiporter NhaC~~~COG1757
MDSQKKLTFPLAVGLFIFMLCIIISCLFLLHVEPHIPLFLSVVMLSAAALWFGFPWKSIEKGIVDGIKNGVQPIIVLALI
GILIGAWMYSGAIPTMTVYALSFIEPSHLLLTALFSCMIISTLVGSSLTTVSTIGVALIGVASAAGVPLEWTAGAVICGA
CFGDKMSPMSDTTNFAAGIGEIPIFEHIRHMMGTTIPALLITVVLFYFLGSSVSADAASTDNIQQVITGIKDAANVTPWA
LLSPLLVVLLAMKRVSVIPVLTAGIISSGILTAIFVPYSSLQAFMTALQNGTTFETDNEAAAKIINRGGLQSMMGSVSLI
MIAFALGGLMEKIGLISALLEGVMKGIRSKGRLVAATVCSSIGVNLATGEQYLSILIPGQSFKSLYDKRNIQRKFLTRSL
EDGGTLINPLIPWGVSGAFMASALGVPVIDYIPFTFFLYISPMISILIGFVKK
>Q56EB3 ~~~nhaD~~~Na(+)/H(+) antiporter NhaD~~~
MQSLRCVSWLAGLLCLLFSTPVFAASAAPLDLTSSLVGFVCIAIFVVAYVLVMGEEKLHMRKSKPVLVAAGLIWILIGWV
YISRDIPDVTEAAFRHNLLEFAELMLFLLVAMTYINALEERRLFDALRAWMIRKGFSYQNLFWITGFLSFFISPIADNLT
TALLMCAVVMKVAEGDKRFINLCCVNIVIAANAGGAFSPFGDITTLMVWQAGLVRIDEFLVLFFPALVNYLIPAAVMSFF
VEKRQPSAVYEDVELKRGALRILTLFLLTVATAVLCHSLLHLPPVLGMMMGLGYLQFFGYFLRMTLPGSLARKRAMAERE
GDQEKLKRLGGVVPFDVFSRVSRAEWDTLLFFYGIVMCVGGLGFLGYLGLMSDLLYEGWNPTSANILLGVISAVIDNIPV
MFAVLAMQPEMSHGHWLLITLTAGVGGSLLSIGSAAGVALMGQARGYYTFFGHLKWAPVIFIGYIASIAVHLWLNADLFH
IYD
>E1VBT7 ~~~nhaD~~~Na(+)/H(+) antiporter NhaD~~~COG1055
MLTNHRSPHWLRHSARWPGFLVLALPILLFSPLAQAASAGELDLTSSLPGFIAVTLFLAAYVLVMAEEKLHMRKSKPVLV
AAGLIWAMIGWVYVHAGLPDASEEAFSETLLEYSELLLFLLVAMTYINAMEERRVFDKLRAWLVEKGFSYRSLFWITGIL
AFWISTIANNLTTAMLMCAVVLKVAEGDKRFINLCCINVVVASNAGGAFSPFGDITTLMVWQAKLVEFQEFFELLGPALV
NYLVPAIVMSLFIKNRKPAALEEHIWLKRGARRIVLLFLVTIVISVLCHTMLNLPPALGMMTGLGFLQFFGYYLRQSLPR
SLERKRTRYSQRGDWRKLESLGSVVPFDVFTRIARSEWDTLLFFYGVVMSVGGLGFMGYLALLSETLYTGWDPVWANIVL
GLVSSVVDNIPVMFAVISMEPDMSMGNWLLITLTAGVGGSLLSVGSAAGVALMGQARGIYTFASHMRWAPVIALGYVASV
VVHLMINADSFAIFH
>A5F120 ~~~nhaD~~~Na(+)/H(+) antiporter NhaD~~~COG1055
MTGRIALLSLTLFSPLSLASTPDGQALDFTHSTIGYAALLIFAIAYTLVMLEEYLQLRKSKPVLLAAGLIWAMIGYVYQQ
TGSTEVARQALEHNLLEYAELLLFLLVAMTYISAMEERRLFDALKAWMINRGFNFHTLFWITGWLAFFISPIADNLTTAL
LMCAVVMKVGGENPKFVSLACINIVIAANAGGAFSPFGDITTLMVWQAGHVSFLEFMDLFLPSLANYLVPALVMSLFVPH
QTPSSIQEVVELKRGAKRIVVLFLFTILSAIGFHAFFHFPPVIGMMMGLAYLQFFGYFLRKTLARSLAKKTAIAMAKNDE
AALKRIGSVVPFDVFRSISHAEWDTLLFFYGVVMCVGGLSLLGYLGLVSEILYTEWNPIWANVLVGLLSSVVDNIPVMFA
VLSMQPEMSLGNWLLVTLTAGVGGSLLSIGSAAGVALMGAAHGKYTFLSHLKWTPVILLGYVVSIVLHLLLNHQSFT
>O66163 ~~~nhaD~~~Na(+)/H(+) antiporter NhaD~~~
MRKSKPVLLAAGLIWILIGYTFAQHHQQDVAKAALEHNLLEYAELLLFLLVAMTYINAMEERKLFDALQAWMVGKGFGFK
KLFWLTGFLAFVISPIADNLTTALLMCAVVMKVSGDNPRFVNLACINIVIAANAGGAFSPFGDITTLMVWQAGHVRFSEF
MPLFVPSLINYVVPAFLMALFVPNTKPNTIHEHVELKRGARRIVLLFVLTIATAVSFHAVLHFPPVVGMMMGLAYLQFFG
YFLRKTLKHSLAKKAAMAIANGDDHALKRLGSVVPFDVFHRVSRAEWDTLLFFYGVVMCVGGLSLLGYLELVSNVMYTQW
NPVWANVMVGVLSAIVDNIPVMFAVLTMDPSMSTGNWLLVTLTAGVGGSLLSIGSAAGVALMGAARGQYTFFGHLKWTPV
IALGYAPVLPLICG
>Q9LCB5 ~~~nhaG~~~Na(+)/H(+) antiporter NhaG~~~
MEHLDLHHIFELGFFVVMIAAGITAIAKKCRQPYPIALVIVGTIIGLVHIPLFEPLKEFITEGEVFNFVIITLFLPALLG
EAALKLPFSHLRENKRPVLALFGGTLISFLIVGFSSMWLMHLAIPAAFVFAALMSATDPVSVLSIFKSVGAPKKLSIVVE
GESLFNDGLAVVLFNISAFYLMTYLDLGIQGAGLGLWEFVKVISLGLIIGGVLGYVFSQLTKYFDDYPLEIIFSIILFYS
SFLLAEMAGASGVIAVVVAALIFGNYGAKIGMSPTTKLNINNFWDVAALLANSLVFLMVGLEITRIDLTDKWGLAIMAIV
IVLIARSAAVYISLAFIKKFPVTWKHTINWGGLKGSLSIALVLSLPRDFPGREDILVFAFSVVLFSLVVQGLTIKPLLER
LGVNQKEEGNQEYEELLAKGHRLETAIKEVQQVKHNLLIHEAVSSELTDQYKKEVSQLHQQTNKLFETYPELKNKQQTIL
KKHSLYAQYQAIENLSREDIISNEVAELEQARIIDEIVRLQNDH
>B0LJC9 ~~~nhaH~~~Na(+)/H(+) antiporter NhaH~~~
MHGFHDVFIQILLLLAISVTVIAIAKLLKEPDSVALVLVGLVLGLTQLPFIEEAESYITQSEVFQATVISLFLPILLGDA
TLKLPFHHLFSQKKTVMGLAFLGTFISSLTIGAASYFLLDLPLAVAFTFAALMSATDPISVLSIFKSLGVKQKMSTIMEG
ESLFNDGIAVVLFKIASIYLLTYIEMGWAGLGSGVFMFLKFAVGGALVGLILGYVFSQVIRVYDDYPLEVAFSALLFFGS
YFIAEHFHTSGVIAVVVGGFVFGDYGAKIGMSEETKTNLNTFWDSVTLLANALIFLMVGLEIRNIDLAGNWGYIVGAIAI
VLVGRTIAVYIGTGWIKELSSKERILINWGGLRGSLSIALALSLPMDFDGREQVLLLTFSVVLFSLIVQGLTLKPVIKKL
GLA
>Q2XWL3 ~~~nhaH~~~Na(+)/H(+) antiporter NhaH~~~COG0025
MHGFHDVFIQILLLLAISVSVIAIAKLLKEPDSIALVLVGLVLGLTELPIIEDAERYITQSEVFQATIISLFLPILLGDA
TLKLPFHHLFSQKKTVLGLAFVGTFVSSICIGTAAYFLLDLPLAVAFTFAALMSATDPISVLSIFKSLGVPQKMSTVMEG
ESLFNDGIAVVLFKIASIYLLTYMEMGWAGLGSGVFLFLKFAIGGALVGLVLGYFFSQVIRVFDDYPLEVAFSALLFFGS
YFIAEHFHTSGVIAVVVGGFVFGDYGAKIGMSKETKTNINTFWDSVTLIANALIFLMVGLEIRNIDLAGNWGVIVGAILI
VLVGRTIAVYLGTGWVQELSSKERLLINWGGLRGSLSVALALSLPMDFAGRDQVLLLTFSVVLFSLIVQGLTLKPLIKKL
GMI
>O32212 ~~~nhaK~~~Sodium, potassium, lithium and rubidium/H(+) antiporter~~~COG0025
MDIFLVVLVLLTIIAISNIVNRFIPFIPVPLIQVALGILAASFPQGLHFELNTELFFVLFIAPLLFNDGKRTPRAELWNL
RAPILLLALGLVFATVIVGGYTIHWMIPAIPLAAAFGLAAILSPTDVVAVSALSGRVKMPKGILRLLEGEGLMNDASGLV
AFKFAIAAAVTGAFSLAQAAVSFVFISLGGLLCGVVISFLIIRFRLFLRRLGMQDVTMHMLIQILTPFVIYLAAEEIGVS
GILAVVAGGITHAVEQDRLESTMIKLQIVSSSTWNIILFILNGLVFVILGTQIPDVISVIFNDTAISNMKVIGYILVITF
TLMLLRFLWVLFFWNGKWFFNKDQNIYKPGLRSTLLISISGVRGAVTLAGSFSIPYFLEDGTPFPERNLILFLAAGVILC
TLVIATVVLPILTEKEEEDEERNKKLLTARRKLIKTALQTIKEDMNETNKTASLAVIAEYNEKMKNLRFQQYTSSNRIKK
HERKVRAQGVKAEQEALMKMLERGDIPEETANVLQERFNELEILYANPFKVGLSKTRLKRLMYWIFFGEHKKPEMSILNE
AGLIRATRVKTAKAAIEYLEKHKTDEHKEVFLSVITFYKQLIFRLEHSHHELKSSAHFENQKLEVKLKAVQAIRNEIQTL
FEEREISRDISHELRQYINDVEAAMLEGGE
>P76007 ~~~cvrA~~~K(+)/H(+) antiporter NhaP2~~~COG3263
MDATTIISLFILGSILVTSSILLSSFSSRLGIPILVIFLAIGMLAGVDGVGGIPFDNYPFAYMVSNLALAIILLDGGMRT
QASSFRVALGPALSLATLGVLITSGLTGMMAAWLFNLDLIEGLLIGAIVGSTDAAAVFSLLGGKGLNERVGSTLEIESGS
NDPMAVFLTITLIAMIQHHESNISWMFIVDILQQFGLGIVIGLGGGYLLLQMINRIALPAGLYPLLALSGGILIFSLTTA
LEGSGILAVYLCGFLLGNRPIRNRYGILQNFDGLAWLAQIAMFLVLGLLVNPSDLLPIAIPALILSAWMIFFARPLSVFA
GLLPFRGFNLRERVFISWVGLRGAVPIILAVFPMMAGLENARLFFNVAFFVVLVSLLLQGTSLSWAAKKAKVVVPPVGRP
VSRVGLDIHPENPWEQFVYQLSADKWCVGAALRDLHMPKETRIAALFRDNQLLHPTGSTRLREGDVLCVIGRERDLPALG
KLFSQSPPVALDQRFFGDFILEASAKYADVALIYGLEDGREYRDKQQTLGEIVQQLLGAAPVVGDQVEFAGMIWTVAEKE
DNEVLKIGVRVAEEEAES
>A5F4U3 ~~~nhaP2~~~K(+)/H(+) antiporter NhaP2~~~COG3263
MDAVTINSFFMIGALLIGISVLLSPVSSKLGIPILLVFLAVGMLAGEDGIGQIAFDNYPVAYLVSNLALAIILLDGGMRT
RVASFRVAFWPSVSLATLGVAVTTLLTGLLAMWLFNLSLLQGVLVGAIVGSTDAAAVFSLLKGRSLNERVGATLEIESGT
NDPMAVFLTVTLIAVLGSAETNLSAGFLLLSFAQQFGVGALLGLAGGWILWWLINRNQLPEGLYSILAVSGGLMIFALSN
ALGGSGILSIYLTGLLLGNRPTRSRHAILNVLDGMTWLAQIGMFLVLGLLVTPSELMEIALPGLALAVGMILFARPIAVW
IGLAPFKSFTAREKWFVSWVGLRGAVPIILAVFPMMAGLPNAQLYFNLAFFVVMVSLVVQGGTLTKAMSLAKVELPPKPE
PISRTGVEIYPTSEWELFIYKLKADKWCIGEPLRNLFMPEGTRIAAVFRDNQLLHPSGSTELCEGDTLCVMAQERDLESL
SRLFSEAPEKASLARFFGDFFLDIEAKLQDVALLYGLDLGELEADAKLKDLVLEHLGETPVLGDYFEWHGLQWVVADVVD
WKVTKIGLRLPPEEELQEGAE
>Q87KV8 ~~~nhaP2~~~K(+)/H(+) antiporter NhaP2~~~COG3263
MDADTINSFFLIGALLIALSVLLSPVSSKLGIPILLVFLAVGMLAGEDGLGGILFDNYSIAYLVSNLALAIILLDGGMRT
RVASFRVALWPSVSLATIGVAITTLLTGLMATWLFDLDLLQGILVGAIVGSTDAAAVFSLLKGRSLNERVGSTLEIESGT
NDPMAVFLTVTLIAILSSTGTGLSAGFLALSFVKQFGIGALLGFAGGWVLWKVINRNQLPDGLYSILTVSGGLIIFALSN
SLGGSGILSIYLVGLLLGNRPTRSRHSILHVLDGMTWLAQIGMFLVLGLLVTPSNLLSIAVPGLALAFGMILFARPISVW
IGLLPFKSFTPREKWFVSWVGLRGAVPIILAVFPMMAGLPDAQLYFNLAFFVVMVSLIVQGGTLTKAMSLAKVELPPKPE
PISRTGVEIYPTSEWELFIYRLKADKWCIGEPLRSLSMPEGTRIAAVFRNQELLHPSGSTRLEEDDTLCVLAQEKDLAAL
SLLFSEAPEKASLTRFFGDFFLDIEVKLADVAMMYGLNLGYELQDKTLSNIVEEQLGSTPVLGDQFEWQGLQWVIADVVD
HQVTKVGLRLPNEEEEGEEED
>Q0ZAH6 ~~~nhaP~~~K(+)/H(+) antiporter NhaP~~~
MEAINLTILVIGVLFLISIVATLISSRIGAPILLVFLIIGMLAGEQGLGGITFNNPQVAFLIGSIALVIILFDGGMRTHP
ERFRVALAPAAMLATLGVVVTCTVTGLAAAWILGLHWLQGLLLGAILSSTDAAAVFSIFQSRGIRIKDRVASTLEIESGS
NDPMAVMLTITLVGVLAEYTALDWSVLIVFLKQAIIGGAVGYGAGRLFVFLCRKLPLSFAFFPLMAVACCISVYAVTTQF
EGSGFLAVYLMGYFVGNARLPQVLYILRVHDGLAWLSQIVMFLMLGLLVVPSQLLDHLLPALAIAGVLIFIARPLAVLLS
LIPFHFPAKDQLFISWVGLRGAVPIILALFPWLAGVPDEHLYFNVAFVIVIVSLVFQGWSISPVARWLKLEVPKESGPDQ
TMPLDAIASNEVIEVVSFTLKGDSPMLDKQWQDFTVPHSAEFLGVIRDGEWLLSRDNPVFKLKDSVLVLCKMADVPDIST
VLASAASSRTMTASDFFGDFVLNAQITLDELDAFYSITLPEHESHVTLADYITERFHRRVVVGDQVKLDALVLTVRQLDD
HGNVKLVGIKPSDS
>Q93HU4 ~~~apnhaP~~~Na(+)/H(+) antiporter ApNhaP~~~
MTIEAAMGEEAIKENLEQFLIVLSVSLGVATLSQISSFFRQIPYTLLLVIVGLGLAFVDIRLVNLSPELILEIFLPPLLF
EAAWNIRWRNLKKNLFPVVLLAIIGVVISVVGIGFSLNYFSGLSLPIALLVGAILAATDPVSVIALFRELGVGERLTVLM
EGESLFNDGVAVVAFSLLVGIPLGTQEFSVTNTLIQFVTLQGIGIGCGGVIGFGISYLTQRFDLPLVEQSLTLVSAYGTY
LITEELGGSGVIGVVTVGLILGNFGSRIGMNPRTRLLVSEFWEFIAFFVNSIVFLLIGDQINIRGLADNGQLILITIIAL
VIIRAISIYGLGTISNLITKQDISWQEETVLWWGGLRGSVSIALALSVPVMLDGRQDIIEAVFGVVLFTLLVQGLTMQTV
IEKLGLIGDRAQRRTYSELIARRSALERVLAHLNAVPPSPSIRMKSLKTTKEASKGQLESRQTKKLRSYNNSYPQLRSLE
QEQLRELTFKVEADTYAELIRAGKLNNNLSPLLQEVLAKPE
>G3XD29 ~~~nhaP~~~Na(+)/H(+) antiporter NhaP~~~
MLDLVAAFIALTTLLTYVNYRFIRLPPTIGVMATALVFSLIVQGLSELGYPILEVEMQEIIRRIDFSEVLMTWFLPALLF
AGALHVDLSDLRSYKWPIGLLATAGVLIATFVIGGLAYYTFPLFGWQVDFIYCLLFGALISPTDPIAVLGILKSAGAPKP
LATTIVGESLFNDGTAVVVFAIILGILQLGEAPTVSATAILFVQEAIGGVVFGAVLGYGVFVMMRGIDQYQVEVMLTLAL
VIGGAALAARLHVSAPIAMVVAGLIIGNHGRHYAMSDETRRYVDKFWELIDEILNALLFALIGLELLLLPFSWLHVAAAF
ALGGAVLVSRLLTVGPAILVLRRFRGANRQVPAGTIRILVWGGLRGGVSVALALSLPLGPERDLILSLTYIVVLVSILLQ
GLSIGPLVRRIYAGQPLEKSEGAH
>P0A9G2 ~~~nhaR~~~Transcriptional activator protein NhaR~~~COG0583
MSMSHINYNHLYYFWHVYKEGSVVGAAEALYLTPQTITGQIRALEERLQGKLFKRKGRGLEPSELGELVYRYADKMFTLS
QEMLDIVNYRKESNLLFDVGVADALSKRLVSSVLNAAVVEGEPIHLRCFESTHEMLLEQLSQHKLDMIISDCPIDSTQQE
GLFSVRIGECGVSFWCTNPPPEKPFPACLEERRLLIPGRRSMLGRKLLNWFNSQGLNVEILGEFDDAALMKAFGAMHNAI
FVAPTLYAYDFYADKTVVEIGRVENVMEEYHAIFAERMIQHPAVQRICNTDYSALFSPAVR
>P73863 ~~~nhaS1~~~Low-affinity Na(+)/H(+) antiporter NhaS1~~~COG0025
MDTAVNESLSISYNLEQFLIVLSVSLSIATLSKTVPILRKIPYTLLLVIVGMALAFVDVKLINLSPELIMEIFLPPLLFE
AAWNLQWRNLKENWFPITLFATLGVVICVVGIAFPLSYWGGMELAIAFLAAAALSATDPVSVIALFKELGASKKLNTLME
GESLFNDGVAVVVFLILVGIPLGTSTFDLSVTLARFVTVIGIGVGCGLVIGFSLSLLTQRFDLPFVEQSLTLVSAYGAYI
LAENLGGSGVIGVVVVGMVLGNYGSRIGMNPRTRLIVSIFWEFVAFFVNSIIFLLIGDQIGLSSLSDHLNLILIAIAAVV
VTRLVSVFGLSLISNKVSDQISSTHITLQEQTVLWWGGLRGSVAIAVALSVPQAIAERQAIIDIVFGVVLFTLLVQGLTT
QFVLKGLDLIGDQPQRLEYAELVSRQIALRRVLAELEKTDEFPDINPERLRYKQELVQGQLQSVTDKLKLLLQEYPLLQE
VANKKFDQTVLDIEAETYADLIRMGRLEENIMPLLVTLEGENVAEPS
>P74393 ~~~nhaS2~~~Na(+)/H(+) antiporter NhaS2~~~COG0025
MIKLPVLLADINIQSLPTEPELILNNLAITTLVENLIILLLVATLVALVARWLKIPYVIGLVLAGLAIPRGLSVGLNPEL
ILNFFLPILIFEAAINTDISRLRSTIKPITVLAGPGVVISAAITAVLLKIGLGLAWVTAAGVSVILTITDTVSVIAAFRS
VPVPRRLATIVEGESMLNDGVAMVLLSVITTIHIQGGFSAGEGIRQIFVAFVGGGLVGLGLGYLCVGLFRQLNDDLSDIL
LTVSVSLGTFQIGQMLGVSSAIAVVVAGLVIGNLALKQTSASIKVTLLSFWEYAGFGVNTLIFLLVGIEVYPSILLSTIP
AALIAIVAYQIGRVFSIYPLLYLLSFFDRPLPLRWQHVLIAGNVKGSLSMALALALPLTLPGRDQVVTLVFSTVMVSLIG
QGLSLPWVVKKLQLSKPSPLAQKIAMLQLNLVTAKAAQGELKYLLEAGSLPKFLYEELFADYQARIANSEQELREFYNQR
NLIFSEGEVEKKYIDGLYRRLYIAEKSAINDALAKGILADDISDEYLQVLNEKLLALQDD
>Q55190 ~~~nhaS3~~~High-affinity Na(+)/H(+) antiporter NhaS3~~~COG0475
MFMNPLLPPLWPMIATAVETETEIAPLVLAGVLLSLVVIYFASKLGGEVCLRLNLPPVLGELVGGVLVGVSALKLLLFPE
GGLAPEDSLVIQLLMGSADLSPEAAQSVFSAQSEVISVISELGVIILLFEIGLESNLKELIRVGPQAAIVAVVGVVTPFS
LGTIGLMTIFGVAAIPAIFAGAALTATSIGITAKVLAEINRLSSNEGQIIIGAAVLDDILGIIVLAVVGSLVKTGEIQIS
NIIYLILSATGFVVGSILIGRLLSPFYVSLVNRMKTRGQLLLVSICVAFVLSYIAQIVQLEAILGSFAAGLILAETEKRE
DLEEQILPLADFFVPVFFVCVGAKTDVSVLNPAVPANREGLIIAAFLILVAIVGKVVTGFTLFGKSELNKLAIGVGMIPR
GEVGLVFAGVGAASGALDPATDAAIIVMVIVTTFVAPPWLRAVFEGAKKEEAPEKPVPTPD
>O07552 ~~~nhaX~~~Stress response protein NhaX~~~COG0589
MFHADRIIVAFDGSENSKKALLTAIDLAKTVNAAITVAHSHDMKDNQTVIDPPRPAAEASYISGGMTSVPDPLISDVTSP
EPMIYEDRTEEVIAEARMMLNEQQADGDIDILEGDPAESIIEHANRISADMIVTGSRDQNRLKKLIFGSVSEKLSAKSDI
PVLIVK
>P21220 4.2.1.84~~~nhhB~~~High-molecular weight cobalt-containing nitrile hydratase subunit beta~~~
MDGIHDTGGMTGYGPVPYQKDEPFFHYEWEGRTLSILTWMHLKGISWWDKSRFFRESMGNENYVNEIRNSYYTHWLSAAE
RILVADKIITEEERKHRVQEILEGRYTDRKPSRKFDPAQIEKAIERLHEPHSLALPGAEPSFSLGDKIKVKSMNPLGHTR
CPKYVRNKIGEIVAYHGCQIYPESSSAGLGDDPRPLYTVAFSAQELWGDDGNGKDVVCVDLWEPYLISA
>P29379 4.2.1.84~~~~~~Low-molecular weight cobalt-containing nitrile hydratase subunit beta~~~
MDGIHDLGGRAGLGPIKPESDEPVFHSDWERSVLTMFPAMALAGAFNLDQFRGAMEQIPPHDYLTSQYYEHWMHAMIHHG
IEAGIFDSDELDRRTQYYMDHPDDTTPTRQDPQLVETISQLITHGADYRRPTDTEAAFAVGDKVIVRSDASPNTHTRRAG
YVRGRVGEVVATHGAYVFPDTNALGAGESPEHLYTVRFSATELWGEPAAPNVVNHIDVFEPYLLPA
>Q7DD37 ~~~nhba~~~Neisserial heparin binding antigen~~~
MFKRSVIAMACIFALSACGGGGGGSPDVKSADTLSKPAAPVVSEKETEAKEDAPQAGSQGQGAPSAQGSQDMAAVSEENT
GNGGAVTADNPKNEDEVAQNDMPQNAAGTDSSTPNHTPDPNMLAGNMENQATDAGESSQPANQPDMANAADGMQGDDPSA
GGQNAGNTAAQGANQAGNNQAAGSSDPIPASNPAPANGGSNFGRVDLANGVLIDGPSQNITLTHCKGDSCSGNNFLDEEV
QLKSEFEKLSDADKISNYKKDGKNDKFVGLVADSVQMKGINQYIIFYKPKPTSFARFRRSARSRRSLPAEMPLIPVNQAD
TLIVDGEAVSLTGHSGNIFAPEGNYRYLTYGAEKLPGGSYALRVQGEPAKGEMLAGAAVYNGEVLHFHTENGRPYPTRGR
FAAKVDFGSKSVDGIIDSGDDLHMGTQKFKAAIDGNGFKGTWTENGSGDVSGKFYGPAGEEVAGKYSYRPTDAEKGGFGV
FAGKKEQD
>Q9JPP1 ~~~nhba~~~Neisserial heparin binding antigen~~~
MFERSVIAMACIFALSACGGGGGGSPDVKSADTLSKPAAPVVAEKETEVKEDAPQAGSQGQGAPSTQGSQDMAAVSAENT
GNGGAATTDKPKNEDEGPQNDMLQNSAESANQTGNNQPADSSDSAPASNPAPANGGSNFGRVDLANGVLIDGPSQNITLT
HCKGDSCNGDNLLDEEAPSKSEFENLNESERIEKYKKDGKSDKFTNLVATAVQANGTNKYVIIYKDKSASSSFARFRRSA
RSRRSLPAEMPLIPVNQADTLIVDGEAVSLTGHSGNIFAPEGNYRYLTYGAEKLPGGSYALRVQGEPAKGEMLAGTAVYN
GEVLHFHTENGRPYPTRGRFAAKVDFGSKSVDGIIDSGDDLHMGTQKFKAAIDGNGFKGTWTENGGGDVSGRFYGPAGEE
VAGKYSYRPTDAEKGGFGVFAGKKEQD
>E6MZW7 ~~~nhba~~~Neisserial heparin binding antigen~~~
MKEMMMFKRSVIAMACIFALSACGGGGGGSPDVKSADTLSKPAAPVVSEKETEAKEDAPQAGSQGQGAPSAQGSQDMAAV
SEENTGNGGAVTADNPKNEDEVAQNDMPQNAAGTDSSTPNHTPDPNMLAGNMENQATDAGESSQPANQPDMANAADGMQG
DDPSAGGQNAGNTAAQGANQAGNNQAAGSSDPIPASNPAPANGGSNFGRVDLANGVLIDGPSQNITLTHCKGDSCSGNNF
LDEEVQLKSEFEKLSDADKISNYKKDGKNDKFVGLVADSVQMKGINQYIIFYKPKPTSFARFRRSARSRRSLPAEMPLIP
VNQADTLIVDGEAVSLTGHSGNIFAPEGNYRYLTYGAEKLPGGSYALRVQGEPAKGEMLAGAAVYNGEVLHFHTENGRPY
PTRGRFAAKVDFGSKSVDGIIDSGDDLHMGTQKFKAAIDGNGFKGTWTENGSGDVSGKFYGPAGEEVAGKYSYRPTDAEK
GGFGVFAGKKEQD
>P23262 1.14.13.1~~~nahG~~~Salicylate hydroxylase~~~
MKNNKLGLRIGIVGGGISGVALALELCRYSHIQVQLFEAAPAFGEVGAGVSFGPNAVRAIVGLGLGEAYLQVADRTSEPW
EDVWFEWRRGSDASYLGATIAPGVGQSSVHRADFIDALVTHLPEGIAQFGKRATQVEQQGGEVQVLFTDGTEYRCDLLIG
ADGIKSALRSHVLEGQGLAPQVPRFSGTCAYRGMVDSLHLREAYRAHGIDEHLVDVPQMYLGLDGHILTFPVRNGGIINV
VAFISDRSEPKPTWPADAPWVREASQREMLDAFAGWGDAARALLECIPAPTLWALHDLAELPGYVHGRVVLIGDAAHAML
PHQGAGAGQGLEDAYFLARLLGDTQADAGNLAELLEAYDDLRRPRACRVQQTSWETGELYELRDPVVGANEQLLGENLAT
RFDWLWNHDLDTDLAEARARLGWEHGGGGALRQG
>Q7DDJ2 ~~~nhhA~~~Autotransporter adhesin NhhA~~~
MNKIYRIIWNSALNAWVVVSELTRNHTKRASATVKTAVLATLLFATVQASANNEEQEEDLYLDPVQRTVAVLIVNSDKEG
TGEKEKVEENSDWAVYFNEKGVLTAREITLKAGDNLKIKQNGTNFTYSLKKDLTDLTSVGTEKLSFSANGNKVNITSDTK
GLNFAKETAGTNGDTTVHLNGIGSTLTDTLLNTGATTNVTNDNVTDDEKKRAASVKDVLNAGWNIKGVKPGTTASDNVDF
VRTYDTVEFLSADTKTTTVNVESKDNGKKTEVKIGAKTSVIKEKDGKLVTGKDKGENGSSTDEGEGLVTAKEVIDAVNKA
GWRMKTTTANGQTGQADKFETVTSGTNVTFASGKGTTATVSKDDQGNITVMYDVNVGDALNVNQLQNSGWNLDSKAVAGS
SGKVISGNVSPSKGKMDETVNINAGNNIEITRNGKNIDIATSMTPQFSSVSLGAGADAPTLSVDGDALNVGSKKDNKPVR
ITNVAPGVKEGDVTNVAQLKGVAQNLNNRIDNVDGNARAGIAQAIATAGLVQAYLPGKSMMAIGGGTYRGEAGYAIGYSS
ISDGGNWIIKGTASGNSRGHFGASASVGYQW
>P96454 ~~~nhlF~~~Cobalt transport protein NhlF~~~
MTSTTITPHHIGGAWTRTERRRLASVVGAIVILHVLGVALYLGYSGNPAAAGGLAGSGVLAYVLGVRHAFDADHIAAIDD
TTRLMLLRGRRPVGVGFFFAMGHSTVVVVLALVVALGASALTTTELEGVQEIGGLVATVVAVTFLSIVAGLNSVVLRNLL
CLSRQVRAGSDITGDLESRLSERGLFTRLLGNRWRGLVRSSWHMYPVGLLMGLGLETASEVTLLTLTASAATGGTLSIAA
VLSLPLLFAAGMSTFDTADSLFMTRAYSWSYQDPQRRLNFNIATTGATVVIGLFVAGIYVCALLAHLPMFAALSPIGDIS
ENFEFLGYAVAAAFILTWTGALLFNHLKPQRN
>P77567 2.3.1.5~~~nhoA~~~Arylamine N-acetyltransferase~~~COG2162
MTPILNHYFARINWSGAAAVNIDTLRALHLKHNCTIPFENLDVLLPREIQLDNQSPEEKLVIARRGGYCFEQNGVFERVL
RELGFNVRSLLGRVVLSNPPALPPRTHRLLLVELEEEKWIADVGFGGQTLTAPIRLVSDLVQTTPHGEYRLLQEGDDWVL
QFNHHQHWQSMYRFDLCEQQQSDYVMGNFWSAHWPQSHFRHHLLMCRHLPDGGKLTLTNFHFTHYENGHAVEQRNLPDVA
SLYAVMQEQFGLGVDDAKHGFTVDELALVMAAFDTHPEAGK
>Q00267 2.3.1.118~~~nhoA~~~Arylamine N-acetyltransferase / N-hydroxyarylamine O-acetyltransferase~~~
MTSFLHAYFTRLHCQPLGVPTVEALRTLHLAHNCAIPFENLDVLLPREIQLDETALEEKLLYARRGGYCFELNGLFERAL
RDIGFNVRSLLGRVILSHPASLPPRTHRLLLVDVEDEQWIADVGFGGQTLTAPLRLQAEIAQQTPHGEYRLMQEGSTWIL
QFRHHEHWQSMYCFDLGVQQQSDHVMGNFWSAHWPQSHFRHHLLMCRHLPDGGKLTLTNFHFTRYHQGHAVEQVNVPDVP
SLYQLLQQQFGLGVNDVKHGFTEAELAAVMAAFDTHPEAGK
>Q6FFF7 ~~~niaP~~~Niacin transporter NiaP~~~COG2814
MDLVSRIENLPIGKFHYTLLWVVGLGWMFDALDTGIIAFIMTTLVKDWALTPAESGWIVSIGFIGMALGAVFSGGLADRF
GRKTVFATTLLIYSLATAACAFAPNLTWLLAFRFIVGLGLGGQLPVAVTLVSEYIPAHVRGRFIVLLESFWGLGWLVAAL
VSYFVIPHFGWHIAFLIGGLPAIYVYVIIKKVPESIPYLINRGRIDEAHELVQQIERHAGVPVIDTIVVKPVAQKQQVSF
RQLWSGRFARRSLMLWLVWFGIVFSYYGIFTWLPSLLVKQGYSVVQSFEYVLIMILAQLPGYISAAWLVERLGRKATLAG
FIGACAISAYFFGQADTVFNIMVWGCLLSFFNLGAWGVLYTYTPEQYPANIRAFGAGWASAVGRMGGIAAPIVVTHMMVA
HDGFHQVFMMFTLVLLAVAAVIVILGEETQGKTLESIGL
>O34691 ~~~niaP~~~Putative niacin/nicotinamide transporter NiaP~~~COG0477
MGKQQPISQRKLLGVAGLGWLFDAMDVGILSFIIAALHVEWNLSPEEMKWIGSVNSIGMAAGAFLFGLLADRIGRKKVFI
ITLLCFSIGSGISAFVTSLSAFLILRFVIGMGLGGELPVASTLVSEAVVPEKRGRVIVLLESFWAVGWLAAALISYFVIP
SFGWQAALLLTALTAFYALYLRTSLPDSPKYESLSAKKRSMWENVKSVWARQYIRPTVMLSIVWFCVVFSYYGMFLWLPS
VMLLKGFSMIQSFEYVLLMTLAQLPGYFSAAWLIEKAGRKWILVVYLIGTAGSAYFFGTADSLSLLLTAGVLLSFFNLGA
WGVLYAYTPEQYPTAIRATGSGTTAAFGRIGGIFGPLLVGTLAARHISFSVIFSIFCIAILLAVACILIMGKETKQTELE
>Q9X1T8 ~~~niaR~~~Probable transcription repressor NiaR~~~COG1827
MHMKTVRQERLKSIVRILERSKEPVSGAQLAEELSVSRQVIVQDIAYLRSLGYNIVATPRGYVLAGGKSGVSRLVAVKHA
PEEIKEELLCVVRNGGRIVDVIVEHPVYGEIRGIIDVSSEEEVLKFVNLMEMAKTEPLLTLSGGVHLHTIEAPDEETMER
IMRELKKKGFLIEEG
>A2RKV5 ~~~niaX~~~Niacin transporter NiaX~~~
MRNHDCPTVRFFKARVYKEKETQMTQTKKAKVRNLIIAAMLTALGILIPMMMPVKLIIGPASFTLAAHVPVMAAMFFSPL
MTAFVALGTTLGFMISIPVPTIWLRALMHLPVMTVGAYVLKKYPEFVHQKVKIQIFNFILGIFHAGLETLVVYAFYSLGF
ANIEQGALLNFLLLIALGGLVHSMIDFNLALGLGNVLSKAFPIDIFDKAKNLVNKKKVKAEI
>Q88FX9 1.17.2.1~~~nicA~~~Nicotinate dehydrogenase subunit A~~~COG2080
MQTTISLQVNGQPVEVSAMPDTPLLLILRNDLCLNGPKYGCGLGECGACTVIIDGVAARSCVIPLAGAAGRNITTLEGLG
SKAAPHPVQQAFIDEQAAQCGYCMNGMIMTAKALLDRIPEPSDEQIRNELSANLCRCGTHVEILRAVRRAAETRRKP
>Q88FX8 1.17.2.1~~~nicB~~~Nicotinate dehydrogenase subunit B~~~COG1529
MNHSQQVPSRDQLLAKTGVLLIVDQITPPSGPVAKGVTPTVKERELALFIAVSDDGMVYAFNGHVDLGTGIRTSLAQIVA
EELDLRMDQVHMVLGDTERAPNQGATIASATLQISAVPLRKAAATARRYLLQQAALRLGCPPEMLRIEDGTVIASNGSTL
SFAELVQGKNHQLHIADDAPLKAIEDYRLVGRSAPRVDIPGKATGELTYVHDMRLPNMLHGRVIRPPYAGHDSGDFVGNS
LLAVDESSIAHLPGVVAVVVIRDFVGVVAEREEQAIRAAHELKVSWKPFTGKLPDLSDVAQAIRDNPRVQRTVLDQGDVD
GGIANASQRLSRSYLWPYQLHASIGPSCALADFTAGQIRVWSGTQNPHLLRADLAWLLACDEARIEIIRMEAAGCYGRNC
ADDVCADAVLLSRAVQRPVRVQLTREQEHVWEPKGTAQLMEIDGGLNADGSVAAYDFQTSYPSNGAPTLALLLTGAVEPV
PALFEMGDRTSIPPYDYEHMRVTINDMTPLVRASWMRGVSAMPNSFAHESYIDELAFAAGVDPVEYRLKHLSDPRAIDLV
KATAERAQWQPHTRPMQTQAEGDVLRGRGFAYARYIHSKFPGFGAAWAAWVADVAVDRRTGEVAVTRVVIGHDAGMMVNP
EGVRHQIHGNVIQSTSRVLKEQVSFEESTVASKEWGGYPILTFPELPAIDVMMLPRQHEPPMGSGESASVPSAAAIANAI
FDATGIRFRELPITAERVRAALGGEGQGPDAPAPAQPSTKRSKWWFGSLAGVFGAALGMLATALPWRAEIAPVTPPGVGS
WSAAMLERGRQVAAAGDCAVCHTVSGGKANAGGLAMDTPFGTLYSTNITPDPETGIGRWSFAAFERAMREGISRDGRHLY
PAFPYTSFRNINDADMQALYAYLMSQTPVRQEAPANQMRFPFNQRPLMAGWNARFLQRGEYQPDPQRSAQWNRGAYLVDG
LGHCTACHSPRNLMGAEKGGSSYLAGGMVDGWEAPALNALGKSSTPWSEDELFNYLSTGFSEKHGVAAGPMGPVVSELAT
LPKSDVRAIAHYLSSLEGEPQALAANAAPQVDTHVSLSNGERVFKGACLGCHSDGLGPKLFGVSPSMAVNSNVHSDLPDN
LLRVVLHGIPTPATRDLGYMPGFKDSLSDRQVADLAAYLRHRFAADKPAWQGLASKAAQVRANPGSH
>Q88FY3 3.5.1.106~~~nicD~~~N-formylmaleamate deformylase~~~COG0596
MSTFVAGGNVSANGIRQHYLRYGGKGHALILVPGITSPAITWGFVAERLGHYFDTYVLDVRGRGLSSSGPDLDYGTDACA
ADIPAFAAALGLDSYHLLGHSMGARFAIRAAAQGAPGLQRLVLVDPPVSGPGRRAYPSKLPWYVDSIRQATVGMSGDDMR
AFCATWSDEQLALRAEWLHTCYEPAIVRAFDDFHEVDIHQYLPAVRQPALLMVAGRGGVIEPRDIAEMRELKPDIQVAYV
DNAGHMIPWDDLDGFFAAFGDFLDHPLV
>Q88FY5 3.5.1.107~~~nicF~~~Maleamate amidohydrolase~~~COG1335
MSDAQSARDNYQGVWGQRIGFGRKPALLMIDFMQGYTTPGAPLYAPGVVAAVEQAAGLLALARDCGTLVVHTNIRYQPPH
FADGGVWVRKAPVMKDMVEGNPLAAFCEAVAPQAGEVVLSKQYASAFFATSLAPLLHAQGVDTVVLAGCSTSGCIRASAV
DAMQHGFRTIVVRECVGDRHSDPHEANLFDIDSKYGDVVTRQDAMQQLRHLAG
>Q8YS92 ~~~~~~DNA nickase~~~COG5592
MVSTLDDTKRNAIAEKLADAKLLQELIIENQERFLRESTDNEISNRIRDFLEDDRKNLGIIETVIVQYGIQKEPRQTVRE
MVDQVRQLMQGSQLNFFEKVAQHELLKHKQVMSGLLVHKAAQKVGADVLAAIGPLNTVNFENRAHQEQLKGILEILGVRE
LTGQDADQGIWGRVQDAIAAFSGAVGSAVTQGSDKQDMNIQDVIRMDHNKVNILFTELQQSNDPQKIQEYFGQIYKDLTA
HAEAEEEVLYPRVRSFYGEGDTQELYDEQSEMKRLLEQIKAISPSAPEFKDRVRQLADIVMDHVRQEESTLFAAIRNNLS
SEQTEQWATEFKAAKSKIQQRLGGQATGAGV
>Q88FY7 ~~~nicP~~~Porin-like protein NicP~~~
MQGFTAPRFSSSARLAGGCIAAALCAEATAEEAGFIEGARATLQARNYFFSRDYADIKGASTQSRTQEWAQGFILNASSG
YTQGTLGVGVDVTGLLGFKLDSSPEHARSGLLPSLDSGKSADEYSRLGAAIKFKVSQTELKVGELMPNLPVLLFSDLRLL
PPTYQGAMLESREFAGLTLSAGQFRSTSLRDSSNSQKMYALVNDPINPARLARFTSDRFNYVGADYAFNDNRTSVGVWQA
QLEDIYQQRFYSFKHAEPLGSWTLGVNAGYFDAREDGSKVAGDYDNHALFSLFSAKTGGHTFYVGYQQIGGDDGFIQVGA
NTNPMGNTLPTYEFSAPGERSWQVRHDYNFVALGLPGLTSTLRYVKGRDVETGLGFEGRDRERDLDLAYVVQSGPLAGLG
IRVRNVMARSNYRTDIDENRLILSYTIKVF
>Q88FY0 ~~~nicR~~~HTH-type transcriptional repressor NicR~~~COG1846
MSKKTTPSSAPLDNTPYDVTEQVGHLLRKAYQRHTAIFQQQACDPQLTSIQFVTLCALRDHGPSSQAELIKATAVDQATI
RGIVERLKARELVQLSPDPGDRRKVIVELTESGAALLDAMIPCARQISELSMGSLNAGERVAILYLLRKMIDSDENAG
>Q88FX7 ~~~nicS~~~HTH-type transcriptional repressor NicS~~~COG1309
MQTRKTGVRAQQADRTRDNILKAAVKVFSKEGFTGGRIEQISTLAKSNDRMIYYYFGSKEKLFISVLEHIYASFNQAEAK
LRLDLADPEQALRELVAFIWDYYVRHPEFVTILATENLHQGLHARKSQNLRALSGEAVGVLRPIIEAGQAKGLFRDDICI
THAYLMIASLCYFYNSNRHTLSSFLAVDLADKQAKADWLTFISDLALRGLRR
>I6YEJ7 ~~~nicT~~~Nickel transporter NicT~~~COG3376
MASSQLDRQRSRSAKMNRALTAAEWWRLGLMFAVIVALHLVGWLTVTLLVEPARLSLGGKAFGIGVGLTAYTLGLRHAFD
ADHIAAIDNTTRKLMSDGHRPLAVGFFFSLGHSTVVFGLAVMLVTGLKAIVGPVENDSSTLHHYTGLIGTSISGAFLYLI
GILNVIVLVGIVRVFAHLRRGDYDEAELEQQLDNRGLLIRFLGRFTKSLTKSWHMYPVGFLFGLGFDTATEIALLVLAGT
SAAAGLPWYAILCLPVLFAAGMCLLDTIDGSFMNFAYGWAFSSPVRKIYYNITVTGLSVAVALLIGSVELLGLIANQLGW
QGPFWDWLGGLDLNTVGFVVVAMFALTWAIALLVWHYGRVEERWTPAPDRTT
>Q88FY6 ~~~nicT~~~Putative metabolite transport protein NicT~~~COG2271
MPIANATTVHSDIDHGTKALYSKITWRLIPFLCFCYLAAYLDRINVGFAKLQMLEDLQFSTAAYGLGAGLFFVGYIIFEV
PSNLILQRVGAKLWIARIMITWGLLSACTMFVTSTTQFYILRFLLGAAEAGFLPGVLYYLTMWYPTYRRGRIIALFMIGL
PLSSVIGGPISGWIMGHFDQVQGLHGWQWLFLLEAIPSVLLGILTFWALPNHFQQAKWLSADDKAQLAADLAADDAEGKD
SKHSFRDGFFNLKVWMLGGIDFSILLSAYAMGFWMPTFIRDAGVSDTFHIGLLTAIPSLAALAGMLMIGASSDRHRERRW
HIIVPFIIGAIAMASSTLFSQNLVMTVVLFAIASAAIIGAVPVFFSLPATFLKGTAAATGFALACSVANIAGLVSNSLMG
VVTDLTGTSHAALWVFAGCLILSCFLVIALPAKLVNR
>Q88FY1 1.13.11.9~~~nicX~~~2,5-dihydroxypyridine 5,6-dioxygenase~~~COG2309
MPVSNAQLTQMFEHVLKLSRVDETQSVAVLKSHYSDPRTVNAAMEAAQRLKAKVYAVELPAFNHPTAMGNDMTAYCGDTA
LTGNLAAQRALEAADLVVDTMMLLHSPEQEQILKTGTRILLAVEPPEVLARMLPTEDDKRRVLAAETLLKQARSLHVRSK
AGSDFHAPLGQYPAVTEYGYADEPGRWDHWPSGFLFTWPNEDSAEGTLVLDVGDIILPFKNYCRERITLEIEKGFITGIH
GGFEAEYLRDYMKYFNDPEVYGISHIGWGLQPRAQWTAMGLHDRNDGMCMDARAFYGNFLFSTGPNTEVGGKRKTPCHLD
IPLRNCDIYLDDKAVVLAGDVVAPEESRAR
>P30667 ~~~nifA~~~Nif-specific regulatory protein~~~
MPGAMRQSTSNLELLTIYEVSKILGSSLDLQQTLREVLRALAYQLQMHRGRVYLVGEDNVLRLVAANGLSNEAAAQIEFR
DGEGITGRILKTGMPAVVPNLAEEPLFLNRTGGREDLDEQVASLVGVPIKAAGVVVGVLTIDRISDEGPQGHFGSDVRFL
TMVANLIGQTVRLHRTVAEERRFMMRETFRMQKELRPVAAPINDVVCTSPNMLEVMAQVHRVAPFKSTVLIRGESGTGKE
LIARAIHNMSPRKDAPFIRVNCAALPESLLESELFGHEKGAFTGAQKDHKGRFELASGGTLFLDEIGDISPNFQAKLLRV
LQEQEFERVGGSKTIKTDVRLICATNLNLEEAVGHGKFRADLYFRINVVTIHLPPLRERRQDIGPLARHFVAKFAKDNGM
ALVMEDEALEVLNRCTWPGNVRELENCIERAATQSRDGIIRTESLSCSLNLCNSSVLFQYRTLGASVGGLAPSMGPGAIN
RVPPGRPGGPAAANAPKTPAMPAPVPEPAGAAAARGRPARRVVPRPLAGLRRRPAGGSGPPDPACPCPSRAPLPPQAPPP
SPAAAPPPAAEVPLDEPESGSLRDRLLWAMERTGWVQAKAARLLGMTTRQVSYALRKYNIEIKRF
>P09133 ~~~nifA~~~Nif-specific regulatory protein~~~COG3604
MPMTDAFQVRVPRVSSSTAGDIAASSITTRGALPRPGGMPVSMSRGTSPEVALIGVYEISKILTAPRRLEVTLANVVNVL
SSMLQMRHGMICILDSEGDPDMVATTGWTPEMAGQIRAHVPQKAIDQIVATQMPLVVQDVTADPLFAGHEDLFGPPEEAT
VSFIGVPIKADHHVMGTLSIDRIWDGTARFRFDEDVRFLTMVANLVGQTVRLHKLVASDRDRLIAQTHRLEKALREEKSG
AEPEVAEAANGSAMGIVGDSPLVKRLIATAQVVARSNSTVLLRGESGTGKELFARAIHELSPRKGKPFVKVNCAALPESV
LESELFGHEKGAFTGALNMRQGRFELAHGGTLFLDEIGEITPAFQAKLLRVLQEGEFERVGGNRTLKVDVRLVCATNKNL
EEAVSKGEFRADLYYRIHVVPLILPPLRERPGDIPKLAKNFLDRFNKENKLHMMLSAPAIDVLRRCYFPGNVRELENCIR
RTATLAHDAVITPHDFACDSGQCLSAMLWKGSAPKPVMPHVPPAPTPLTPLSPAPLATAAPAAASPAPAADSLPVTCPGT
EACPAVPPRQSEKEQLLQAMERSGWVQAKAARLLNLTPRQVGYALRKYDIDIKRF
>P05407 ~~~nifA~~~Nif-specific regulatory protein~~~COG3604
MLHIPSSSERPASQPEPERAPPGEPSHESALAGIYEISKILNAPGRLEVTLANVLGLLQSFVQMRHGLVSLFNDDGVPEL
TVGAGWSEGTDERYRTCVPQKAIHEIVATGRSLMVENVAAETAFSAADREVLGASDSIPVAFIGVPIRVDSTVVGTLTID
RIPEGSSSLLEYDARLLAMVANVIGQTIKLHRLFAGDREQSLVDKDRLEKQTVDRGPPARERKQLQAHGIIGDSPALSAL
LEKIVVVARSNSTVLLRGESGTGKELVAKAIHESSVRAKRPFVKLNCAALPETVLESELFGHEKGAFTGAVSARKGRFEL
ADKGTLFLDEIGEISPPFQAKLLRVLQEQEFERVGSNHTIKVDVRVIAATNRNLEEAVARSEFRADLYYRISVVPLLLPP
LRERRSDIPLLAREFLRKFNSENGRSLTLEASAIDVLMSCKFPGNVRELENCIERTATLSAGTSIVRSDFACSQGQCLST
TLWKSTSYGKTDPAAPMQPVPAKSIIPLAETAPPPQAVCEPGSLAPSGTVLVSGARMADRERVVAAMEKSGWVQAKAARL
LGLTPRQVGYALRKYGIEIKRF
>P03027 ~~~nifA~~~Nif-specific regulatory protein~~~
MIHKSDSDTTVRRFDLSQQFTAMQRISVVLSRATEASKTLQEVLSVLHNDAFMQHGMICLYDSQQEILSIEALQQTEDQT
LPGSTQIRYRPGEGLVGTVLAQGQSLVLPRVADDQRFLDRLSLYDYDLPFIAVPLMGPHSRPIGVLAAHAMARQEERLPA
CTRFLETVANLIAQTIRLMILPTSAAQAPQQSPRIERPRACTPSRGFGLENMVGKSPAMRQIMDIIRQVSRWDTTVLVRG
ESGTGKELIANAIHHNSPRAAAAFVKFNCAALPDNLLESELFGHEKGAFTGAVRQRKGRFELADGGTLFLDEIGESSASF
QAKLLRILQEGEMERVGGDETLRVNVRIIAATNRHLEEEVRLGHFREDLYYRLNVMPIALPPLRERQEDIAELAHFLVRK
IAHSQGRTLRISDGAIRLLMEYSWPGNVRELENCLERSAVLSESGLIDRDVILFNHRDNPPKALASSGPAEDGWLDNSLD
ERQRLIAALEKAGWVQAKAARLLGMTPRQVAYRIQIMDITMPRL
>Q8KC85 4.-.-.-~~~nifB~~~FeMo cofactor biosynthesis protein NifB~~~COG0535
MTLNIKNHPCFNDSSRHTYGRIHLPVAPKCNIQCNYCNRKFDCMNENRPGITSKVLSPRQALYYLDNALKLSPNISVVGI
AGPGDPFANPEETMETLRLVREKYPEMLLCVATNGLDMLPYIEELAELQVSHVTLTINAIDPEIGQEIYAWVRYQKKMYR
DRQAAELLLENQLAALQKLKRYGVTAKVNSIIIPGVNDQHVIEVARQVASMGADILNALPYYNTTETVFENIPEPDPMMV
RKIQEEAGKLLPQMKHCARCRADAVGIIGEINSDEMMAKLAEAALMPKNPDEHRPYIAVASLEGVLINQHLGEADRFLVY
ALDEEKKSCTLVDSRQAPPPGGGKLRWEALAAKLSDCRAVLVNSAGDSPQSVLKASGIDVMSIEGVIEEAVYGVFTGQNL
KHLMKSSQIHACKTSCGGDGNGCD
>P27714 4.-.-.-~~~nifB~~~FeMo cofactor biosynthesis protein NifB~~~
MQPTQYVGIQDIKSLGTLLDKVAEHKGCGTSSEGGKASCGSSDGPADMAPEVWEKVKNHPCYSEEAHHHYARMHVAVAPA
CNIQCNYCNRKYDCANESRPGVVSEKLTPEQAAKKVFAVASTIPQMTVLGIAGPGDPLANPAKTFKTFELISQTAPDIKL
CLSTNGLALPDHIDTIAAFNVDHVTITTNMVDPEIGQHIYPWIYYQNKRWTGIDAARILHERQMLGLEMLTARGILCKVN
SVMIPGINDQHLVEVNRAVKSRGAFLHNIMPLISAPEHGTVFGLNGQRGPSAQELKALQDACEGEMNMMRHCRQCRADAV
GLLGEDRSAEFTTEKIEAMEVAYDGATRKAYQELVEQERQAKSAAKAAEQQELAQMADQSGLSLLVAVATKGQGRVNEHF
GHVSEFQIYEVSSAGSKFVGHRRVDQYCQGGYGEEDALETVIRAINDCHAVLVAKIGGCPKDDLQKVGIEPVDRYAHEFI
EQSVIAYFMDYLERVRSGQIEHRPRGDADIRQGAYTSVQSTSAAA
>P07328 1.18.6.1~~~nifD~~~Nitrogenase molybdenum-iron protein alpha chain~~~
MTGMSREEVESLIQEVLEVYPEKARKDRNKHLAVNDPAVTQSKKCIISNKKSQPGLMTIRGCAYAGSKGVVWGPIKDMIH
ISHGPVGCGQYSRAGRRNYYIGTTGVNAFVTMNFTSDFQEKDIVFGGDKKLAKLIDEVETLFPLNKGISVQSECPIGLIG
DDIESVSKVKGAELSKTIVPVRCEGFRGVSQSLGHHIANDAVRDWVLGKRDEDTTFASTPYDVAIIGDYNIGGDAWSSRI
LLEEMGLRCVAQWSGDGSISEIELTPKVKLNLVHCYRSMNYISRHMEEKYGIPWMEYNFFGPTKTIESLRAIAAKFDESI
QKKCEEVIAKYKPEWEAVVAKYRPRLEGKRVMLYIGGLRPRHVIGAYEDLGMEVVGTGYEFAHNDDYDRTMKEMGDSTLL
YDDVTGYEFEEFVKRIKPDLIGSGIKEKFIFQKMGIPFREMHSWDYSGPYHGFDGFAIFARDMDMTLNNPCWKKLQAPWE
ASEGAEKVAASA
>P00467 1.18.6.1~~~nifD~~~Nitrogenase molybdenum-iron protein alpha chain~~~
MSENLKDEILEKYIPKTKKTRSGHIVIKTEETPNPEIVANTRTVPGIITARGCAYAGCKGVVMGPIKDMVHITHGPIGCS
FYTWGGRRFKSKPENGTGLNFNEYVFSTDMQESDIVFGGVNKLKDAIHEAYEMFHPAAIGVYATCPVGLIGDDILAVAAT
ASKEIGIPVHAFSCEGYKGVSQSAGHHIANNTVMTDIIGKGNKEQKKYSINVLGEYNIGGDAWEMDRVLEKIGYHVNATL
TGDATYEKVQNADKADLNLVQCHRSINYIAEMMETKYGIPWIKCNFIGVDGIVETLRDMAKCFDDPELTKRTEEVIAEEI
AAIQDDLDYFKEKLQGKTACLYVGGSRSHTYMNMLKSFGVDSLVAGFEFAHRDDYEGREVIPTIKIDADSKNIPEITVTP
DEQKYRVVIPEDKVEELKKAGVPLSSYGGMMKEMHDGTILIDDMNHHDMEVVLEKLKPDMFFAGIKEKFVIQKGGVLSKQ
LHSYDYNGPYAGFRGVVNFGHELVNGIYTPAWKMITPPWKKASSESKVVVGGEA
>P00466 1.18.6.1~~~nifD~~~Nitrogenase molybdenum-iron protein alpha chain~~~
MMTNATGERNLALIQEVLEVFPETARKERRKHMMVSDPKMKSVGKCIISNRKSQPGVMTVRGCAYAGSKGVVFGPIKDMA
HISHGPAGCGQYSRAERRNYYTGVSGVDSFGTLNFTSDFQERDIVFGGDKKLSKLIEEMELLFPLTKGITIQSECPVGLI
GDDISAVANASSKALDKPVIPVRCEGFRGVSQSLGHHIANDVVRDWILNNREGQPFETTPYDVAIIGDYNIGGDAWASRI
LLEEMGLRVVAQWSGDGTLVEMENTPFVKLNLVHCYRSMNYIARHMEEKHQIPWMEYNFFGPTKIAESLRKIADQFDDTI
RANAEAVIARYEGQMAAIIAKYRPRLEGRKVLLYMGGLRPRHVIGAYEDLGMEIIAAGYEFAHNDDYDRTLPDLKEGTLL
FDDASSYELEAFVKALKPDLIGSGIKEKYIFQKMGVPFRQMHSWDYSGPYHGYDGFAIFARDMDMTLNNPAWNELTAPWL
KSA
>P00459 1.18.6.1~~~nifH1~~~Nitrogenase iron protein 1~~~
MAMRQCAIYGKGGIGKSTTTQNLVAALAEMGKKVMIVGCDPKADSTRLILHSKAQNTIMEMAAEAGTVEDLELEDVLKAG
YGGVKCVESGGPEPGVGCAGRGVITAINFLEEEGAYEDDLDFVFYDVLGDVVCGGFAMPIRENKAQEIYIVCSGEMMAMY
AANNISKGIVKYANSGSVRLGGLICNSRNTDREDELIIALANKLGTQMIHFVPRDNVVQRAEIRRMTVIEYDPKAKQADE
YRALARKVVDNKLLVIPNPITMDELEELLMEFGIMEVEDESIVGKTAEEV
>P00456 1.18.6.1~~~nifH1~~~Nitrogenase iron protein 1~~~
MRQVAIYGKGGIGKSTTTQNLTSGLHAMGKTIMVVGCDPKADSTRLLLGGLAQKSVLDTLREEGEDVELDSILKEGYGGI
RCVESGGPEPGVGCAGRGIITSINMLEQLGAYTDDLDYVFYDVLGDVVCGGFAMPIREGKAQEIYIVASGEMMALYAANN
ISKGIQKYAKSGGVRLGGIICNSRKVANEYELLDAFAKELGSQLIHFVPRSPMVTKAEINKQTVIEYDPTCEQAEEYREL
ARKVDANELFVIPKPMTQERLEEILMQYGLMDL
>P00458 1.18.6.1~~~nifH~~~Nitrogenase iron protein~~~
MTMRQCAIYGKGGIGKSTTTQNLVAALAEMGKKVMIVGCDPKADSTRLILHAKAQNTIMEMAAEVGSVEDLELEDVLQIG
YGDVRCAESGGPEPGVGCAGRGVITAINFLEEEGAYEDDLDFVFYDVLGDVVCGGFAMPIRENKAQEIYIVCSGEMMAMY
AANNISKGIVKYAKSGKVRLGGLICNSRQTDREDELIIALAEKLGTQMIHFVPRDNIVQRAEIRRMTVIEYDPACKQANE
YRTLAQKIVNNTMKVVPTPCTMDELESLLMEFGIMEEEDTSIIGKTAAEENAA
>P22921 1.18.6.1~~~nifH~~~Nitrogenase iron protein~~~
MSALRQIAFYGKGGIGKSTTSQNTLAALVEMGQRILIVGCDPKADSTRLILNTKLQDTVLHLAAEAGSVEDLDVADVVKI
GYKGIKCTESGGPEPGVGCAGRGVITAINFLEENGAYDDLDYVSYDVLGDVVCGGFAMPIRENKAQEIYIVMSGEMMALY
AANNIAKGILKYAHTGGVRLGGLICNERQTDKEVELAEALAGRLGCRLIHFVPRDNGVQHAELRRQTVIQYAPDSKQAGE
YRTLATKIHNNSGQGVVPTPITMEDLEEMLMEFGIMKSDEEALAELEAKESAAAN
>Q06879 1.2.7.-~~~nifJ~~~Pyruvate-flavodoxin oxidoreductase~~~COG0674
MSQTFATIDGNEAVARVAYKLNEVIAIYPITPSSAMGEWADAWMAEGRPNLWGTVPSVVQMQSEGGAAGAVHGALQTGSL
STTFTASQGLLLMIPNLYKIGGELTSMVVHVAARSLATHALSIFGDHSDVMAARGTGFAMLCSASVQESHDFALIAHAAT
LDTRVSFLHFFDGFRTSHEVQKVELLADDDVRSLINEDKIFAHRARALTPDSPLLRGTAQNPDVFFQAREGANPYYNACP
AIVQGIMDKFGERTGRYYQIYEYHGASDADRLIIIMGSGCETVHETVDYLNARGEKVGVLKVRLFRPWDVERFVQALPHS
VQAIAVLDRTKEPGSAGEPLYQDVVTAIHEGWVNKNNSPVPSPQSPVPKIIGGRYGLSSKEFTPAMVKAVFDNLAQATPK
NHFTIGINDDVTHTSLEYDPSFSTEPDNVVRAMFYGLGSDGTVGANKNSIKIIGEGTDNYAQGYFVYDSKKSGSMTVSHL
RFGSQPIRSTYLIDQANFIGCHHWGFLERIEVLNAAAHGATILLNSPYNAATVWENLPLKVRLQILDKQLKLYVINANQV
ARDSGMGGRINTIMQVCFFALAGVLPEVQAIAKIKQAIEKTYGKKGVEVVRMNLQAVDQTLENLHEVKIPIEEKGKWIDE
EALLSNQSPFSTSAPKFVRDVLGKIMVWQGDDLPVSTLPPDGTFPTGTAKWEKRNVAQEIPVWDTDICVQCSKCVMVCPH
AAIRAKVYQPSELENAPPTFKSVDAKDRDFANQKFTIQVAPEDCTGCAICVNVCPAKNKSEPSLKAINMANQLPLREQER
DNWDFFLNLPNPDRRNLKLNQIRQQQLQEPLFEFSGACAGCGETPYVKLLTQLFGDRSVIANATGCSSIYGGNLPTTPWT
KNNDGRGPAWSNSLFEDNAEFGFGYRLSLDKQAEFAAELLQQFSTEVGDNLVDSILKAPQKTEADIWEQRQRIELLKQQL
DKIPTFDPNLKSKIQNLKSLADYLVKKSVWIIGGDGWAYDIDFGGIDHVIASGRNVNILVMDTEVYSNTGGQSSKATPKA
AVAKFAASGKPAQKKDMGLMAMNYGNVYVASVALGAKDDQTLKAFLEAEAFDGPSIIIAYSHCIAHGINMTTGMNQQKAL
VESGRWLLYRYNPLLQEQGKNPLQLDMRSPTQSVEQSMYQENRFKMLTKSKPEVAKQLLEQAQAEVDARWQMYQYLASR
>P07329 1.18.6.1~~~nifK~~~Nitrogenase molybdenum-iron protein beta chain~~~
MSQQVDKIKASYPLFLDQDYKDMLAKKRDGFEEKYPQDKIDEVFQWTTTKEYQELNFQREALTVNPAKACQPLGAVLCAL
GFEKTMPYVHGSQGCVAYFRSYFNRHFREPVSCVSDSMTEDAAVFGGQQNMKDGLQNCKATYKPDMIAVSTTCMAEVIGD
DLNAFINNSKKEGFIPDEFPVPFAHTPSFVGSHVTGWDNMFEGIARYFTLKSMDDKVVGSNKKINIVPGFETYLGNFRVI
KRMLSEMGVGYSLLSDPEEVLDTPADGQFRMYAGGTTQEEMKDAPNALNTVLLQPWHLEKTKKFVEGTWKHEVPKLNIPM
GLDWTDEFLMKVSEISGQPIPASLTKERGRLVDMMTDSHTWLHGKRFALWGDPDFVMGLVKFLLELGCEPVHILCHNGNK
RWKKAVDAILAASPYGKNATVYIGKDLWHLRSLVFTDKPDFMIGNSYGKFIQRDTLHKGKEFEVPLIRIGFPIFDRHHLH
RSTTLGYEGAMQILTTLVNSILERLDEETRGMQATDYNHDLVR
>P11347 1.18.6.1~~~nifK~~~Nitrogenase molybdenum-iron protein beta chain~~~
MLDATPKEIVERKALRINPAKTCQPVGAMYAALGIHNCLPHSHGSQGCCSYHRTVLSRHFKEPAMASTSSFTEGASVFGG
GSNIKTAVKNIFSLYNPDIIAVHTTCLSETLGDDLPTYISQMEDAGSIPEGKLVIHTNTPSYVGSHVTGFANMVQGIVNY
LSENTGAKNGKINVIPGFVGPADMREIKRLFEAMDIPYIMFPDTSGVLDGPTTGEYKMYPEGGTKIEDLKDTGNSDLTLS
LGSYASDLGAKTLEKKCKVPFKTLRTPIGVSATDEFIMALSEATGKEVPASIEEERGQLIDLMIDAQQYLQGKKVALLGD
PDEIIALSKFIIELGAIPKYVVTGTPGMKFQKEIDAMLAEAGIEGSKVKVEGDFFDVHQWIKNEGVDLLISNTYGKFIAR
EENIPFVRFGFPIMDRYGHYYNPKVGYKGAIRLVEEITNVILDKIERECTEEDFEVVR
>P09772 1.18.6.1~~~nifK~~~Nitrogenase molybdenum-iron protein beta chain~~~
MSQTIDKINSCYPLFEQDEYQELFRNKRQLEEAHDAQRVQEVFAWTTTAEYEALNFRREALTVDPAKACQPLGAVLCSLG
FANTLPYVHGSQGCVAYFRTYFNRHFKEPIACVSDSMTEDAAVFGGNNNMNLGLQNASALYKPEIIAVSTTCMAEVIGDD
LQAFIANAKKDGFVDSSIAVPHAHTPSFIGSHVTGWDNMFEGFAKTFTADYQGQPGKLPKLNLVTGFETYLGNFRVLKRM
MEQMAVPCSLLSDPSEVLDTPADGHYRMYSGGTTQQEMKEAPDAIDTLLLQPWQLLKSKKVVQEMWNQPATEVAIPLGLA
ATDELLMTVSQLSGKPIADALTLERGRLVDMMLDSHTWLHGKKFGLYGDPDFVMGLTRFLLELGCEPTVILSHNANKRWQ
KAMNKMLDASPYGRDSEVFINCDLWHFRSLMFTRQPDFMIGNSYGKFIQRDTLAKGKAFEVPLIRLGFPLFDRHHLHRQT
TWGYEGAMNIVTTLVNAVLEKLDSDTSQLGKTDYSFDLVR
>P30663 2.7.13.3~~~nifL~~~Nitrogen fixation regulatory protein~~~
MTPANPTLSNEPQAPHAESDELLPEIFRQTVEHAPIAISITDLKANILYANRAFRTITGYGSEEVLGKNESILSNGTTPR
LVYQALWGRLAQKKPWSGVLVNRRKDKTLYLAELTVAPVLNEAGETIYYLGMHRDTSELHELEQRVNNQRLMIEAVVNAA
PAAMVVLDRQHRVMLSNPSFCRLARDLVEDGSSESLVALLRENLAAPFETLENQGSAFSGKEISFDLGGRSPRWLSCHGR
AIHIENEQAHVFFAPTEERYLLLTINDISELRQKQQDSRLNALKALMAEEELLEGMRETFNAAIHRLQGPVNLISAAMRM
LERRLGDKAGNDPVLSAMREASTAGMEALENLSGSIPVRMAESKMPVNLNQLIREVITLCTDQLLAQGIVVDWQPALRLP
WVMGGESSQRSMIKHLVDNAIESMSQNQVSRRELFISTRVENHLVRMEITDSGPGIPPDLVLKVFEPFFSTKPPHRVGRG
MGLPVVQEIVAKHAGMVHVDTDYREGCRIVVELPFSAST
>P56267 ~~~nifL~~~Nitrogen fixation regulatory protein~~~COG2205
MTLNMMLDNAVPEAIAGALTQQHPGLFFTMVEQASVAISLTDARANIIYANPAFCRQTGYSLAQLLNQNPRLLASSQTPR
EIYQEMWQTLLQRQPWRGQLINQRRDGGLYLVDIDITPVLNPQGELEHYLAMQRDISVSYTLEQRLRNHMTLMEAVLNNI
PAAVVVVDEQDRVVMDNLAYKTFCADCGGKELLVELQVSPRKMGPGAEQILPVVVRGAVRWLSVTCWALPGVSEEASRYF
VDSAPARTLMVIADCTQQRQQQEQGRLDRLKQQMTAGKLLAAIRESLDAALIQLNCPINMLAAARRLNGEGSGNVALDAA
WREGEEAMARLQRCRPSLELESNAVWPLQPFFDDLYALYRTRFDDRARLQVDMASPHLVGFGQRTQLLACLSLWLDRTLA
LAAELPSVPLEIELYAEEDEGWLSLYLNDNVPLLQVRYAHSPDALNSPGKGMELRLIQTLVAYHRGAIELASRPQGGTSL
VLRFPLFNTLTGGEQ
>P06772 ~~~nifL~~~Nitrogen fixation regulatory protein~~~
MTLNMMLDNAVPEAIAGALTQQHPGLFFTMVEQASVAISLTDARANITYANPAFCRQTGYSLAQLLNQNPRLLASSQTPR
EIYQEMWQTLLQRQPWRGQLINQARDGGLYLVDIDITPVLNPQGELEHYLAMQRDISVSYTLEQRLRNHMTLMEAVLNNI
PAAVVVVDEQDRVVMDNLAYKTFCADCGGKELLVELQVSPRKMGPGAEQILPVVVRGAVRWLSVTCWALPGVSEEASRYF
VDSAPARTLMVIADCTQQRQQQEQGRLDRLKQQMTAGKLLAAIRESLDAALIQLNCPINMLAAARRLNGEGSGNVALDAA
WREGEEAMARLQRCRPSLELESNAVWPLQPFFDDLYALYRTRFDDRARLQVDMASPHLVGFGQRTQLLACLSLWLDRTLA
LAAELPSVPLEIELYAEEDEGWLSLYLNDNVPLLQVRYAHSPDALNSPGKGMELRLIQTLVAYHRGAIELASRPQGGTSL
VLRFPLFNTLTGGEQ
>P05341 2.8.1.7~~~nifS~~~Cysteine desulfurase NifS~~~
MADVYLDNNATTRVDDEIVQAMLPFFTEQFGNPSSLHSFGNQVGMALKKARQSVQKLLGAEHDSEILFTSCGTESDSTAI
LSALKAQPERKTVITTVVEHPAVLSLCDYLASEGYTVHKLPVDKKGRLDLEHYASLLTDDVAVVSVMWANNETGTLFPIE
EMARLADDAGIMFHTDAVQAVGKVPIDLKNSSIHMLSLCGHKLHAPKGVGVLYLRRGTRFRPLLRGGHQERGRRAGTENA
ASIIGLGVAAERALQFMEHENTEVNALRDKLEAGILAVVPHAFVTGDPDNRLPNTANIAFEYIEGEAILLLLNKVGIAAS
SGSACTSGSLEPSHVMRAMDIPYTAAHGTVRFSLSRYTTEEEIDRVIREVPPIVAQLRNVSPYWSGNGPVEDPGKAFAPV
YG
>P38033 2.8.1.7~~~nifS~~~Putative cysteine desulfurase NifS~~~COG1104
MIYLDYAATTPICEEALTVYQKLSMDMYGNASSLHDAGGKAKHILEYCREKIANIIGGEASGIYFTSGGTESNFLAIQSL
LNGLPKTKRHFITTAMEHQSIHNCAAFLEQQGYDVTVIEPNEYGLITEEILLTHIRPETGLVSIQHANSETGIIQPIQHL
SSYLHNKGILLHCDAVQTFGKIPINTKNLGVDALSMSSHKIHGPKGVGAVYIRPDVPWKPVYPLTTHEYGFRAGTVNVPG
IGAFTAAAELIVSEMEKQISRNEALRTYFLDQIRIRSLPVTLAADTSKAECLPHIIGCFFHSFEGQYVMLECNRSNICIS
TGSACSAGYHGPSETMKALRKTEQEALQFIRISFGRHTTAEQLEQLLHTFTVLWEQKKGEFDIDRRIKANGRQQA
>P46054 ~~~nifW2~~~Nitrogenase-stabilizing/protective protein NifW 2~~~
MTWDIEQFNKLVSAEEYFEFFQLPYDPRVVQVSRLHILKQFSQSIQEIDANNSQASQAEKLDLYCTALKQAYEVFLSSTP
LEQKLFKVFKQKPKNIVMLTEIATS
>P14888 ~~~nifW~~~Nitrogenase-stabilizing/protective protein NifW~~~
MTVQPFSPDSDLTLDEAMDELVSAEDFLEFFGVPFDQDVVHVNRLHIMQRYHDYLSKAGDLDEHDDQARYAVFQKLLARA
YLDFVESDALTEKVFKVFRMHEPQKTFVSIDQLLS
>A9KT32 2.4.1.279~~~~~~Nigerose phosphorylase~~~COG1554
MNWTLTNSSLDKDSITSNGNRFLIGNGYLGIRGTLEEYRKEYFPAINLAGIYDQVGEGWREPLNAPNALYTRIEVDEVEY
QLPKIEPRYHELSLDYRHGILDRQTVWASNKGTIIVKSSRFASMKEKHLVVLNYSITADYDCEIIVYTGIDGSVWDIHGP
HYDKVEFQKQLLRHNETEILNEKYLLESESKWNGHRLSIAAQTHENKDIVYVTEDIICNKEAKIELIESDKECLRKLTFH
GKAKEEINFTKYITVFTSKDCVDYKEQSIKIVNHAKDTGYERLQEEHKNVWEQLWNISEVTIEGDDEANDALNYSLYHLH
CIAPRHSKSLSIAARGLSGQTYKGAVFWDTEMFMLDFFLYTQPEVAKTLLRYRIDTLEGAKKKAKLYGYEGAFYAWESQE
GGYDACSDYNVTDVFTKRPMRTYFKDKQVHISSAIVYGIRSYLNYTNDFSILAEGGAETILECAKFYYSLLEKKIGKEYY
EIHDVIGPDEYHERVNNNAYTNRMAKLTFETAIDILDHEKNKDEEFYIKLLKQYEIKDLLDKLKDACNKLYIPKPKDNSD
LIEQFDGFFELEDVSLEEVRSRLLHEKEYWGGAYGVASHTQVIKQADVVTMLVLFKEEYQREVLQQNLNYYEPRTEHGSS
LSACMYSLLYCMCDQPQYAYPFFMKSALADWNGKGKEWAGLVYIGGTHPAAAGGAYMTAIKGFGGFQIENGVIKATPRLP
KHWVRLKYRVLYQGAIYEIDASKEQVSISKIEM
>P30820 ~~~ngr~~~Nigerythrin~~~COG1592
MKVRAQVPTVKNATNFNMVADSKTAVGSTLENLKAAIAGETGAHAKYTAFAKAAREQGYEQIARLFEATAAAELIHIGLE
YALVAEMEPGYEKPTVAAPSAYSCDLNLISGANGEIYETSDMYPAFIRKAQEEGNSKAVHVFTRAKLAESVHAERYLAAY
NDIDAPDDDKFHLCPICGYIHKGEDFEKCPICFRPKDTFTAY
>P33590 ~~~nikA~~~Nickel-binding periplasmic protein~~~COG0747
MLSTLRRTLFALLACASFIVHAAAPDEITTAWPVNVGPLNPHLYTPNQMFAQSMVYEPLVKYQADGSVIPWLAKSWTHSE
DGKTWTFTLRDDVKFSNGEPFDAEAAAENFRAVLDNRQRHAWLELANQIVDVKALSKTELQITLKSAYYPFLQELALPRP
FRFIAPSQFKNHETMNGIKAPIGTGPWILQESKLNQYDVFVRNENYWGEKPAIKKITFNVIPDPTTRAVAFETGDIDLLY
GNEGLLPLDTFARFSQNPAYHTQLSQPIETVMLALNTAKAPTNELAVREALNYAVNKKSLIDNALYGTQQVADTLFAPSV
PYANLGLKPSQYDPQKAKALLEKAGWTLPAGKDIREKNGQPLRIELSFIGTDALSKSMAEIIQADMRQIGADVSLIGEEE
SSIYARQRDGRFGMIFHRTWGAPYDPHAFLSSMRVPSHADFQAQQGLADKPLIDKEIGEVLATHDETQRQALYRDILTRL
HDEAVYLPISYISMMVVSKPELGNIPYAPIATEIPFEQIKPVKP
>Q2G2P5 ~~~nikA~~~Nickel-binding protein NikA~~~COG0747
MKFKRLATIFSAVLVLSGCGSMHSSGKDLNISLPLKTKSIAPYETDVPVKIGAAESLFKTNDQGKIEKALVKSYHQPNDT
TLDIELKDNIKFQNGQKLTAEKVKSSLENSMKKSDLVKYSLPISSITAKGQKLTIKTNSAYPELVSELANPFMAIYDTDA
KSDVNQTPVGTGPYQIKDYKQSRKISLSNFKDYWQGKPKLDHITVTYQEDGNNRVRNLESQKDDLITDVPVNKVQDIENN
QNLKVSKESGFRTSLLMYNHTNKKMTKSVREALDHIIDRQGIADHIYQGYAKPATSPFNDKIPYIKEPKLTKQNIEQAKM
LLAKDGYTKEHPLKIKLITYDGRPELSKIAQVLQSDAKKANIEIDIKSVDDIEGYLKDRSAWDATMYSFGTIPRGDTGYF
FNQAYKKDGAINKGDYNNSNVDDLINQLNHTVDVKERHNISNDIIKLSSRDVPNSYIAYNDQIVAANSKVKNYKVTPEGI
YLIDYRTTIER
>P33591 ~~~nikB~~~Nickel transport system permease protein NikB~~~COG0601
MLRYVLRRFLLLIPMVLAASVIIFLMLRLGTGDPALDYLRLSNLPPTPEMLASTRTMLGLDQPLYVQYGTWLWKALHLDF
GISFASQRPVLDDMLNFLPATLELAGAALVLILLTSVPLGIWAARHRDRLPDFAVRFIAFLGVSMPNFWLAFLLVMAFSV
YLQWLPAMGYGGWQHIILPAVSIAFMSLAINARLLRASMLDVAGQRHVTWARLRGLNDKQTERRHILRNASLPMITAVGM
HIGELIGGTMIIENIFAWPGVGRYAVSAIFNRDYPVIQCFTLMMVVVFVVCNLIVDLLNAALDPRIRRHEGAHA
>Q2FYQ5 ~~~nikB~~~Nickel import system permease protein NikB~~~COG0601
MFIIKSMLYRLMQMIVVLFVISTLTFILMKLSPGNPVDKILHLDVAQVSTEQINATKDKLGLNDSLLVQWWHWMNHLLHF
NLGKSFESKEPVTQILFNYAPITLLISFSTLVVSLCISIPLGIIAAKRFHKWTDKVIRVISTLSISLPAFFIGIILLFIV
TNLMNIDSVILSQFILPVITLSLGMCAYIIRLVRSNLLMLLQSNIVQASRLRGMNERYILIHDLLKPTILPIIPLLGISL
GSLIGGTVVIENLFDIPGIGYLLMDSIKSRDYPVIQGCVLFIGFFVVIINTIADLLTLLLDPKQRLQLGNPKIKTNTPLI
SESSDRHA
>P0AFA9 ~~~nikC~~~Nickel transport system permease protein NikC~~~COG1173
MNFFLSSRWSVRLALIIIALLALIALTSQWWLPYDPQAIDLPSRLLSPDAQHWLGTDHLGRDIFSRLMAATRVSLGSVMA
CLLLVLTLGLVIGGSAGLIGGRVDQATMRVADMFMTFPTSILSFFMVGVLGTGLTNVIIAIALSHWAWYARMVRSLVISL
RQREFVLASRLSGAGHVRVFVDHLAGAVIPSLLVLATLDIGHMMLHVAGMSFLGLGVTAPTAEWGVMINDARQYIWTQPL
QMFWPGLALFISVMAFNLVGDALRDHLDPHLVTEHAH
>Q2FYQ6 ~~~nikC~~~Nickel import system permease protein NikC~~~COG1173
MHKIFSKNNLIFFVFVAFIFVVIVLQFFVSSENATKVNLSQTFEPISWLHLLGTDDYGRDLFTRIIIGARSTLFVTVLTL
IAIVVIGVTLGLFAGYKKGWIERLVLRFIDVGLSIPEFIIMIALASFFQPSLWNLVISITLIKWMNYTRLTRSIVNSEMN
KPYIKMAQLFHVPTRTILIRHLTPKIIPAIIVLMVVDFGKIILYISSLSFIGLGAQPPTPEWGAMLQQGRDFISSHPIML
IAPASVIAITILIFNLTGDALRDRLLKQRGEYDESH
>Q8FVM9 7.2.2.11~~~nikD~~~Nickel import ATP-binding protein NikD~~~
MTRKTLAIEGLTATTVIDGQQRVLVDNLSLGVQRGRILALVGASGSGKSMTCSAALGVLPPGVTASRGRVTIDGVPYAAN
ALRGRHVATIMQNPRGAFNPVRTMRDHAIETLQALGKLSSNPQDQIVHCMRAAGLEDVKTILSLHPFEMSGGMLQRMMIA
LALLSEAPFLFADEPTTDLDLVVQLRVLELLEKLVEERDLGILLVTHDMGVVARLAHDVAVLDHGRLIEQAPVMDIFQTP
GHEVTRMLVSAHLSLYGMELNA
>Q2FYQ7 7.2.2.11~~~nikD~~~Nickel import system ATP-binding protein NikD~~~COG0444
MSLIDIQNLTIKNTSEKSLIKGIDLKIFSQQINALIGESGAGKSLIAKALLEYLPFDLSCTYDSYQFDGENVSRLSQYYG
HTIGYISQNYAESFNDHTKLGKQLTAIYRKHYKGSKEEALSKVDKALSWVNLQSKDILNKYSFQLSGGQLERVYIASVLM
LEPKLIIADEPVASLDALNGNQVMDLLQHIVLEHGQTLFIITHNLSHVLKYCQYIYVLKEGQIIERGNINHFKYEHLHPY
TERLIKYRTQLKRDYYD
>Q8FVN0 7.2.2.11~~~nikE~~~Nickel import ATP-binding protein NikE~~~
MSLISADNIVKIYQSHSLVGASARKTVLHDISISIGQGETVALLGRSGCGKSTLARLLVGLERPTSGEVRFRGVPLTKLD
RSGMKAFRREVQLIFQDSPGAVNARSSVRAIIGEPLRHLTSLDETRREERIQELLRLVELPPEIADRLPAQVSGGQLQRI
CIARALAVNPKLIILDEAVSNLDIHLQASALALLTKLQQEGGIAYLFVTHDLRLVQKFAARCLVMDEGQIVEEIKTADLD
SMRHPASRLLREAVLPPLPVRAVETN
>Q2FYQ8 7.2.2.11~~~nikE~~~Nickel import system ATP-binding protein NikE~~~COG4608
MIELKHVTFGYNKKQMVLQDINITIPDGENVGILGESGCGKSTLASLVLGLFKPVKGEIYLSDNAVLPIFQHPLTSFNPD
WTIETSLKEALYYYRGLTDNTAQDQLLLQHLSTFELNAQLLTKLPSEVSGGQLQRFNVMRSLLAQPRVLICDEITSNLDV
IAEQNVINILKAQTITNLNHFIVISHDLSVLQRLVNRIIVLKDGMIVDDFAIEELFNVDRHPYTKELVQAFSY
>D5AQY8 ~~~nikMN~~~Fused nickel transport protein NikMN~~~COG0310
MHIPDGYLSPVTCAVTFAATVPFWYVSMRKLDRDLNGQHLPLVALVAAFSFVIMMFNLPIPGGTTAHAAGIGIAAVLLGP
WAAVPAISVALLIQAIFFGDGGITAFGANCLNMAVVGPMVAAAVYALGTRGAAIGSRRRVIMAGLASYAGLNAAALLAAV
EFGVQPLFFHDAAGAPLYAPYPLSVAVPAMALTHLTIAGAAEFIVTAGLVAWLQRSNPELLAPRRAPAAPERHLRLWAGI
GALVVLCPLGLIAAGTAWGEWGAEDFTSEAGRAAMAGASGGVAPPAGLPGGFARLAELWSAPLPDYAPAFVQNAPLGYVL
SALLGVALIVAGIGLSAGLRALTRRAG
>D5AQY6 3.6.3.-~~~nikO~~~Nickel import ATP-binding protein NikO~~~COG1122
MTPAFELQGVQFAYKGVPALNGLDLTLPLGRRTALLGANGSGKSTLLRLLDGLQFPAAGRISAFGTPLTEAMFTDEAAAI
AFRRRVGFVFQNPEVQLFCPSVFDELAFGPLQLHWPKERIRARVARAIAQFGLGPLAGRPPHRLSGGEKKRVALASVLIL
DPEVLLLDEPTAALDPQATDDIAALLETEFGARNPGRTLIFSSHDLDLVARIADHVVVLEAGKVAAAGPAAEVLARTALL
RRARLLPGFDGTAP
>D5AQY7 ~~~nikQ~~~Nickel transport protein NikQ~~~COG0619
MTDPSVVHARAPAPDRIGRLGAGVRGLMQHAEEAAGLAQRPGLLQGLDPRAKVAGAFALILAAVATRSLLVLLALFVLAT
ALAAASQISPARLARQVWIVVLGFTGMIALPALILVPGTPVLSLPFGLAITEQGLRAAAFLTGRSETTATLALALVLTTP
WPQVLKALRCLGVPRAAVMILGMTHRYIFVLADLALDLFEARRSRLVGRLSPAEARRLATGIAGALFERALALSSEVHLA
MLARGWRGEVHLIDDFRFRPRDGGALVLAAAILAGVVWAGSVWP
>P0A6Z6 ~~~nikR~~~Nickel-responsive regulator~~~COG0864
MQRVTITLDDDLLETLDSLSQRRGYNNRSEAIRDILRSALAQEATQQHGTQGFAVLSYVYEHEKRDLASRIVSTQHHHHD
LSVATLHVHINHDDCLEIAVLKGDMGDVQHFADDVIAQRGVRHGHLQCLPKED
>B5Z8Y5 ~~~~~~Putative nickel-responsive regulator~~~
MDTPNKDDSIIRFSVSLQQNLLDELDNRIIKNGYSSRSELVRDMIREKLVEDNWAEDNPNDESKIAVLVVIYDHHQRELN
QRMIDIQHASGTHVLCTTHIHMDEHNCLETIILQGNSFEIQRLQLEIGGLRGVKFAKLTKASSFEHNE
>O25896 ~~~~~~Putative nickel-responsive regulator~~~COG0864
MDTPNKDDSIIRFSVSLQQNLLDELDNRIIKNGYSSRSELVRDMIREKLVEDNWAEDNPNDESKIAVLVVIYDHHQRELN
QRMIDIQHASGTHVLCTTHIHMDEHNCLETIILQGNSFEIQRLQLEIGGLRGVKFAKLTKASSFEYNE
>P76241 ~~~nimR~~~HTH-type transcriptional regulator NimR~~~COG2207
MMHRLNLNGYEPDRHHEAAVAFCIHAGTDELTSPVHQHRKGQLILALHGAITCTVENALWMVPPQYAVWIPGGVEHSNQV
TANAELCFLFIEPSAVTMPTTCCTLKISPLCRELILTLANRTTTQRAEPMTRRLIQVLFDELPQQPQQQLHLPVSSHPKI
RTMVEMMAKGPVEWGALGQWAGFFAMSERNLARLIVKETGLSFRQWRQQLQLIMALQGLVKGDTVQKVAHTLGYDSTTAF
ITMFKKGLGQTPGRYIARLTTVSPQSAKPDPRQ
>P76242 ~~~nimT~~~2-nitroimidazole transporter~~~COG2807
MTCSTSLSGKNRIVLIAGILMIATTLRVTFTGAAPLLDTIRSAYSLTTAQTGLLTTLPLLAFALISPLAAPVARRFGMER
SLFAALLLICAGIAIRSLPSPYLLFGGTAVIGGGIALGNVLLPGLIKRDFPHSVARLTGAYSLTMGAAAALGSAMVVPLA
LNGFGWQGALLMLMCFPLLALFLWLPQWRSQQHANLSTSRALHTRGIWRSPLAWQVTLFLGINSLVYYVIIGWLPAILIS
HGYSEAQAGSLHGLLQLATAAPGLLIPLFLHHVKDQRGIAAFVALMCAVGAVGLCFMPAHAITWTLLFGFGSGATMILGL
TFIGLRASSAHQAAALSGMAQSVGYLLAACGPPLMGKIHDANGNWSVPLMGVAILSLLMAIFGLCAGRDKEIR
>P12669 ~~~nin~~~DNA-entry nuclease inhibitor~~~
MIKSWKPQELSISYHQFTVFQKDSTPPVMDWTDEAIEKGYAAADGAISFEAQRNTKAFILFRLNSSETVNSYEKKVTVPF
HVTENGIHIESIMSKRLSFDLPKGDYQLTCWTVPAEMSDLHADTYIIDAVSV
>P0AC26 ~~~nirC~~~Nitrite transporter NirC~~~COG2116
MFTDTINKCAANAARIARLSANNPLGFWVSSAMAGAYVGLGIILIFTLGNLLDPSVRPLVMGATFGIALTLVIIAGSELF
TGHTMFLTFGVKAGSISHGQMWAILPQTWLGNLVGSVFVAMLYSWGGGSLLPVDTSIVHSVALAKTTAPAMVLFFKGALC
NWLVCLAIWMALRTEGAAKFIAIWWCLLAFIASGYEHSIANMTLFALSWFGNHSEAYTLAGIGHNLLWVTLGNTLSGAVF
MGLGYWYATPKANRPVADKFNQTETAAG
>Q51479 ~~~nirC~~~Cytochrome c55X~~~
MNAPPDFRRAASHALWLALALTFACPLPGLADEHPDARRQAQLRHLLLQDCGSCHGLRLTGGLGPALTPEALRGKPRESL
VATVLMGRPQTPMPPWAGLLSEDDAGWLVDRLIEGEIAP
>D3DFS4 4.1.1.111~~~nirDL~~~Siroheme decarboxylase~~~COG1522
MGNEFDKILKIIQKDIPLVKEPFSVLAQEVGIEEGKLLKTIEKLVEDGIVRHIAPIYDSRLLGYDSALIAFKVDRQKLEE
VANFVNACPGVSHNYERTHDFNLWFTLAVPPEISELEDVVRLMAERERVKDYLVLRVVRLFKIGVKLDYESPAEKESVDT
KVYTYTPLTEEEKRIVSITQGSFPLVERPFLEYAKRLRMSEEELLEKLSALKERGVLRRISAVLYHRRAGYVANAMSVWE
VPEDAIEEVGRYIAGFKGVSHCYQRTTSEKFRYNLFAMMHGKGQEEIKLLAETISREKALSKYALLFSTREFKKVRIKYF
SEEFERWFKELISA
>I6UH61 4.1.1.111~~~nirDL~~~Siroheme decarboxylase NirDL subunit~~~
MTMDDLDLRLLDGFQRDLPLETRPFAAIANRLNTSEAEVIARLARLRDEGLIARIGATCRPNTAGASTLAALRVPVRRID
KVAALVGAEPGVNHSYLREGSDWNLWFVATAPDAEALEESLVRIETATGLVPLSLPLVRAFNIDLGFPLIGPRRAMALDR
PTDLDVLRPRDKALMQALTTGLALVPRPFVALGQALGRSEAEVISRIRALAAARILTRVGVIVRHRALGWCENAMVVWRL
PEPAVEAAGTALAAVPGVTLCYQRRTVPGLWNWPLFCMIHARSRAEAMEVLVQARALPELQGVPHRILFSTRCFRQRGAV
IAEVAA
>P0A9I8 1.7.1.15~~~nirD~~~Nitrite reductase (NADH) small subunit~~~COG2146
MSQWKDICKIDDILPETGVCALLGDEQVAIFRPYHSDQVFAISNIDPFFESSVLSRGLIAEHQGELWVASPLKKQRFRLS
DGLCMEDEQFSVKHYEARVKDGVVQLRG
>Q51480 ~~~nirF~~~Protein NirF~~~
MNLRPLAPLLLTLLAGCSQQPPLRGSGDLGVLIERADGSVQILDGTAKTSLARVEGLGDLSHASLVFSRDQRYAYVFGRD
GGLTKLDLLAQRIDKRLIQGGNSIGGAISQDGRLVAVSNYEPGGVKVFDSRTLELVAEIPATRLPGQDRNSRVVGLVDAP
GQRFVFSLFDSGEIWIADFSQGDTPHLTRFRDIGKQPYDALISPDGRYYMAGLFGEDGMAQLDLWHPERGVRRVLGDYGR
GQRKLPVYKMPHLEGWTIASDQAFVPAVGHHQVLVLDARDWKQTDAIDVAGQPVFVMTRPDDRQIWVNFAYPDNDKVQVI
DSETHEVIETLRPGPGVLHMEFSGRGDQVWISVRDADQLQVWDPYRLKRIGSLPARSPSGIFFSHRAQHIGL
>I6TCK3 4.1.1.111~~~nirG~~~Siroheme decarboxylase NirG subunit~~~
MRRQTLPPEALDDTDRAILNRLQEGFPLTPRPFDDAGAALGLTGAQLIERLERLRAIGAITRFGPFYDAAAMGGAFCLCA
LSAPQADFDRIAALVNAHPEVAHNYARDHALNMWFVLATATPEGIAETAGRIEAETGLTVWRFPKLREFFIGFRVAA
>I6UF18 4.1.1.111~~~nirH~~~Siroheme decarboxylase NirH subunit~~~
MTAPDDTDRRLIAATQAGLPLDEAPYARIAAELGLTETQVITRLSILHAQGVIRRIAIAPNHYALGMIANGMSVWDVDDA
QAEALGERIGALDFVTHCYLRPRAPVWRYNLFAMLHGQSRAEVEQKRAQVRALLGAACRADDILYSTRILKKTGLRLKDG
>A8LLZ7 ~~~nirJ~~~Pre-heme d1 synthase~~~COG0535
MFRLSHYLDQLYHPTPPRIARGTPKPVVIWNLTRRCNLKCKHCYTVSADVDFPGELTAAQARETLEDIGRFKVPALILSG
GEPLLRDDLFALAKRARALTRVLALSTNGTGVIGSKADRVAEIGFDYVGISIDGIGATNDAFRGVIGAYEQALAGVRSCK
RRGIKVGLRFTITEQNESQLPELLKLCDDEGVDKFYLSHLVYAGRGNKNRGEDADHARTRRAMDLLIARALESAEGRGHP
LEIVTGNNDADAVYFLNWAKANFPAAQVAHLRKHLEAWGGNASGVGVANIDTQGDVHPDTYWSEYTVGSVKQTPFSELWT
GPDPMLAELRRRPRPLKGRCGACAHQAVCGGNTRIRALQLTGDPWAEDPACYLTAAETGTATDIDRLTVRPFIGDRHDPK
PAFV
>A0A1I5E523 ~~~nirJ~~~Pre-heme d1 synthase~~~
MFRLTQYMHQLLDPSPPRRRSRPDAVRPVVIWNLTRSCNLKCRHCYTVSADRPFPGELSHDQAMAVLRDLSDFRIPALIL
SGGEPMSRFDFWELAEEARRLDFRHLSLSTNGTKIDAGNVERLAGLGFDYVGISLDGIGAVNDWFRGVEGAFDQALAGVR
ACKAQGVKVGLRFTITEGNAHHLPAMLDLCRDEGVDKFYLSHLVYAGRGDKHRGEDTEHARTRRAMDLLIARAWQAVERG
EPLEVVTGNNDADAVYFLRWAETRFAPAAVAHLRAHLQAWGGNSSGLGVGNIDPQGRVHPDTYWSDYTLGSVKERPFSAI
WTGDDPILATLRTRPRPLKGRCGACAYQAVCGGNTRIRALQLTGDPWAEDPACYLSGSEIGAEGADLDRLAVTPFRGKSH
DPAHRFL
>Q92Z29 1.7.2.1~~~nirK~~~Copper-containing nitrite reductase~~~
MSEQFQMTRRSMLAGAAIAGAVTPLIGAVSAHAEEAVAKTAHINVASLPRVKVDLVKPPFVHAHTQKAEGGPKVVEFTLT
IEEKKIVIDEQGTELHAMTFNGSVPGPLMVVHQDDYVELTLINPDTNTLQHNIDFHSATGALGGGALTVVNPGDTTVLRF
KASKAGVFVYHCAPPGMVPWHVTSGMNGAIMVLPREGLTDGKGNSITYDKVYYVGEQDFYVPRDANGKFKKYESVGEAYA
DTLEVMRTLTPSHIVFNGAVGALTGDSALKAAVGEKVLIVHSQANRDTRPHLIGGHGDYVWATGKFRNAPDVDQETWFIP
GGTAGAAFYTFEQPGIYAYVNHNLIEAFELGAAAHFAVTGDWNDDLMTSVRAPSGT
>Q9I609 1.3.-.-~~~nirN~~~Dihydro-heme d1 dehydrogenase~~~
MRLIGLALGLLLGALAQAGEAPGEALYRQHCQACHGAGRLGGSGPTLLPESLSRLKPAQAREVILHGRPATQMAGFAGQL
DDAAADALVAYLYQAPPREPQWSAEDIRASQVQPHPLATLPSRPRFEADPLNLFVVVESGDHHVTILDGDRFEPIARFPS
RYALHGGPKFSPDGRLVYFASRDGWVTLYDLYNLKVVAEVRAGLNTRNLAVSDDGRWVLVGNYLPGNLVLLDARDLSLVQ
VIPAADAQGQASRVSAVYTAPPRHSFVVALKDVHELWELPYANGKPVAPKRLAVADYLDDFSFSPDYRYLLGSSRQARGG
EVIELDSGARVASIPLSGMPHLGSGIYWKRDGRWVFATPNISRGVISVIDLQNWKPLKEIVTDGPGFFMRSHADSPYAWT
DTFLGKKHDEILLIDKQTLEIAHRLRPSPGKVAGHVEFTRDGRYALLSVWDRDGALVVYDAHSLEEVKRLPMNKPSGKYN
VGNKIGYAEGTSH
>Q51481 ~~~nirQ~~~Denitrification regulatory protein NirQ~~~
MRDATPFYEATGHEIEVFERAWRHGLPVLLKGPTGCGKTRFVQYMARRLELPLYSVACHDDLGAADLLGRHLIGADGTWW
QDGPLTRAVREGGICYLDEVVEARQDTTVAIHPLADDRRELYLERTGETLQAPPSFMLVVSYNPGYQNLLKGLKPSTRQR
FVALRFDYPAAQQEARILVGESGCAETLAQRLVQLGQALRRLEQHDLEEVASTRLLIFAARLIGDGMDPREACRVALAEP
LSDDPATVAALMDIVDLHVA
>Q02441 ~~~nirQ~~~Denitrification regulatory protein NirQ~~~COG0714
MRYLPVNAIEIPTTAGTPDAPFYQPLGNEEQLFQQAWQHGMPVLIKGPTGCGKTRFVQHMAHRLNLPLYTVACHDDLSAA
DLVGRHLIGAQGTWWQDGPLTRAVREGGICYLDEVVEARQDTAVVLHPLADDRRELFIERTGEALKAPPGFMLVVSYNPG
YQNLLKGMKPSTRQRFVAMRFDYPPTAEEERIVANEAQVDAALAAQVVKLGQALRRLEQHDLEEVASTRLLIFTARMIRS
GMTPRQACLACLAEPLSDDPQTVAALMDVVYVHFG
>P72181 1.7.2.1~~~nirS~~~Nitrite reductase~~~COG2010
MRQRTPFARPGLLASAALALVLGPLAASAQEQVAPPKDPAAALEDHKTRTDNRYEPSLDNLAQQDVAAPGAPEGVSALSD
AQYNEANKIYFERCAGCHGVLRKGATGKALTPDLTRDLGFDYLQSFITYGSPAGMPNWGTSGELSAEQVDLMANYLLLDP
AAPPEFGMKEMRESWKVHVAPEDRPTQQENDWDLENLFSVTLRDAGQIALIDGATYEIKSVLDTGYAVHISRLSASGRYL
FVIGRDGKVNMIDLWMKEPTTVAEIKIGSEARSIETSKMEGWEDKYAIAGAYWPPQYVIMDGETLEPKKIQSTRGMTYDE
QEYHPEPRVAAILASHYRPEFIVNVKETGKILLVDYTDLDNLKTTEISAERFLHDGGLDGSHRYFITAANARNKLVVIDT
KEGKLVAIEDTGGQTPHPGRGANFVHPTFGPVWATSHMGDDSVALIGTDPEGHPDNAWKILDSFPALGGGSLFIKTHPNS
QYLYVDATLNPEAEISGSVAVFDIKAMTGDGSDPEFKTLPIAEWAGITEGQPRVVQGEFNKDGTEVWFSVWNGKDQESAL
VVVDDKTLELKHVIKDERLVTPTGKFNVYNTMTDTY
>P24474 1.7.2.1~~~nirS~~~Nitrite reductase~~~
MPFGKPLVGTLLASLTLLGLATAHAKDDMKAAEQYQGAASAVDPAHVVRTNGAPDMSESEFNEAKQIYFQRCAGCHGVLR
KGATGKPLTPDITQQRGQQYLEALITYGTPLGMPNWGSSGELSKEQITLMAKYIQHTPPQPPEWGMPEMRESWKVLVKPE
DRPKKQLNDLDLPNLFSVTLRDAGQIALVDGDSKKIVKVIDTGYAVHISRMSASGRYLLVIGRDARIDMIDLWAKEPTKV
AEIKIGIEARSVESSKFKGYEDRYTIAGAYWPPQFAIMDGETLEPKQIVSTRGMTVDTQTYHPEPRVAAIIASHEHPEFI
VNVKETGKVLLVNYKDIDNLTVTSIGAAPFLHDGGWDSSHRYFMTAANNSNKVAVIDSKDRRLSALVDVGKTPHPGRGAN
FVHPKYGPVWSTSHLGDGSISLIGTDPKNHPQYAWKKVAELQGQGGGSLFIKTHPKSSHLYVDTTFNPDARISQSVAVFD
LKNLDAKYQVLPIAEWADLGEGAKRVVQPEYNKRGDEVWFSVWNGKNDSSALVVVDDKTLKLKAVVKDPRLITPTGKFNV
YNTQHDVY
>P24040 1.7.2.1~~~nirS~~~Nitrite reductase~~~COG2010
MSNVGKPILAGLIAGLSLLGLAVAQAAAPEMTAEEKEASKQIYFERCAGCHGVLRKGATGKNLEPHWSKTEADGKKTEGG
TLNLGTKRLENIIAYGTEGGMVNYDDILTKEEINMMARYIQHTPDIPPEFSLQDMKDSWNLIVPVEKRVTKQMNKINLQN
VFAVTLRDAGKLALIDGDTHKIWKVLESGYAVHISRMSASGRYVYTTGRDGLTTIIDLWPEEPMTVATVRFGSDMRSVDV
SKFEGYEDKYLIGGTYWPPQYSIVDGLTLEPIKVVSTRGQTVDGEYHPEPRVASIVASHIKPEWVVNVKETGQIILVDYT
DLKNLKTTTIESAKFLHDGGWDYSKRYFMVAANASNKVAAVDTKTGKLAALIDTAKIPHPGRGANFVHPQFGPVWSTGHL
GDDVVSLISTPSEESKYAKYKEHNWKVVQELKMPGAGNLFVKTHPKSKHFWADAPMNPEREVAESVYVFDMNDLSKAPIQ
LNVAKDSGLPESKAIRRAVQPEYNKAGDEVWISLWGGKTDQSAIVIYDDKTLKLKRVITDPAVVTPTGKFNVFNTMNDVY
>P24038 ~~~nirT~~~Denitrification system component NirT~~~
MTDKDGNKQQKGGILALLRRPSTRYSLGGILIVGIVAGIVFWGGFNTALEATNTETFCISCHEMGDNVYPEYKETIHYAN
RTGVRATCPDCHVPRDWTHKMVRKVEASKELWGKIVGTIDTAEKFEAKRLTLARREWARMRASDSRECRNCHSLESMSSD
MQKQRARKQHEMAREDNLTCIACHKGIAHHLPEGMTEEDED
>P25006 1.7.2.1~~~nirK~~~Copper-containing nitrite reductase~~~
MTEQLQMTRRTMLAGAALAGAVAPLLHTAQAHAAGAAAAAGAAPVDISTLPRVKVDLVKPPFVHAHDQVAKTGPRVVEFT
MTIEEKKLVIDREGTEIHAMTFNGSVPGPLMVVHENDYVELRLINPDTNTLLHNIDFHAATGALGGGALTQVNPGEETTL
RFKATKPGVFVYHCAPEGMVPWHVTSGMNGAIMVLPRDGLKDEKGQPLTYDKIYYVGEQDFYVPKDEAGNYKKYETPGEA
YEDAVKAMRTLTPTHIVFNGAVGALTGDHALTAAVGERVLVVHSQANRDTRPHLIGGHGDYVWATGKFRNPPDLDQETWL
IPGGTAGAAFYTFRQPGVYAYVNHNLIEAFELGAAGHFKVTGEWNDDLMTSVVKPASM
>P38501 1.7.2.1~~~nirK~~~Copper-containing nitrite reductase~~~
MAEQMQISRRTILAGAALAGALAPVLATTSAWGQGAVRKATAAEIAALPRQKVELVDPPFVHAHSQVAEGGPKVVEFTMV
IEEKKIVIDDAGTEVHAMAFNGTVPGPLMVVHQDDYLELTLINPETNTLMHNIDFHAATGALGGGGLTEINPGEKTILRF
KATKPGVFVYHCAPPGMVPWHVVSGMNGAIMVLPREGLHDGKGKALTYDKIYYVGEQDFYVPRDENGKYKKYEAPGDAYE
DTVKVMRTLTPTHVVFNGAVGALTGDKAMTAAVGEKVLIVHSQANRDTRPHLIGGHGDYVWATGKFNTPPDVDQETWFIP
GGAAGAAFYTFQQPGIYAYVNHNLIEAFELGAAAHFKVTGEWNDDLMTSVLAPSGT
>P81445 1.7.2.1~~~nirK~~~Copper-containing nitrite reductase~~~
GLPRVAVDLVAPPLVHPHSQVAAGAPKVVQFRMSIEEKKMVADDDGTTAQAMTFNGSVPGPTLVVHEGDYIELTLVNPAT
NSMPHNVDFHAATGALGGAGLTQVVPGQEAVLRFKADRSGTFVYHCAPAGMVPWHVVSGMNGALMVLPRDGLRDAAGAAL
AYDRVYTIGESDLYVPKAADGNYSDYPALASAYADTVAVMRTLTPSHAVFNGAVGALTGANALTAAVGESVLIIHSQANR
DSRPHLIGGHGDWVWTTGKFANPPQLNMETWFIPGGSAAAALYTFKQPGTYAYLSHNLIEAMELGAAAQASVEGQWDDDL
MTSVAAPGPA
>Q53239 1.7.2.1~~~nirK~~~Copper-containing nitrite reductase~~~COG2132
MFTRRAALVGAAALASAPLVIRTAGAEEAPAQLASAAPVDLSNLPRVKHTLVPPPFAHAHEQVAASGPVINEFEMRIIEK
EVQLDEDAYLQAMTFDGSIPGPLMIVHEGDYVELTLINPPENTMPHNIDFHAATGALGGGGLTLINPGEKVVLRFKATRA
GAFVYHCAPGGPMIPWHVVSGMAGCIMVLPRDGLKDHEGKPVRYDTVYYIGESDHYIPKDEDGTYMRFSDPSEGYEDMVA
VMDTLIPSHIVFNGAVGALTGEGALKAKVGDNVLFVHSQPNRDSRPHLIGGHGDLVWETGKFHNAPERDLETWFIRGGSA
GAALYKFLQPGVYAYVNHNLIEAVHKGATAHVLVEGEWDNDLMEQVVAPVGLTG
>Q06006 1.7.2.1~~~nirK~~~Copper-containing nitrite reductase~~~
MSVFRSVLGACVLLGSCASSLALAGGAEGLQRVKVDLVAPPLVHPHEQVVSGPPKVVQFRMSIEEKKMVIDDQGTTLQAM
TFNGSMPGPTLVVHEGDYIELTLVNPATNSMPHNVDFHAATGALGGAGLTQVVPGQEVVLRFKADRSGTFVYHCAPQGMV
PWHVVSGMNGALMVLPRDGLRDPQGKLLHYDRVYTIGESDLYIPKDKDGHYKDYPDLASSYQDTRAVMRTLTPSHVVFNG
RVGALTGANALTSKVGESVLFIHSQANRDSRPHLIGGHGDWVWTTGKFANPPQRNMETWFIPGGSAVAALYTFKQPGTYV
YLSHNLIEAMELGALAQIKVEGQWDDDLMTQVKAPGPIVEPKQ
>L0DSL2 1.7.2.2~~~nir~~~Cytochrome c-552~~~COG3303
MNDLNRLGRVGRWIAGAACLFLASAAHAEPGENLKPVDAMQCFDCHTQIEDMHTVGKHATVNCVHCHDATEHVETASSRR
MGERPVTRMDLEACATCHTAQFNSFVEVRHESHPRLEKATPTSRSPMFDKLIAGHGFAFEHAEPRSHAFMLVDHFVVDRA
YGGRFQFKNWQKVTDGMGAVRGAWTVLTDADPESSDQRRFLSQTATAANPVCLNCKTQDHILDWAYMGDEHEAAKWSRTS
EVVEFARDLNHPLNCFMCHDPHSAGPRVVRDGLINAVVDRGLGTYPHDPVKSEQQGMTKVTFQRGREDFRAIGLLDTADS
NVMCAQCHVEYNCNPGYQLSDGSRVGMDDRRANHFFWANVFDYKEAAQEIDFFDFRHATTGAALPKLQHPEAETFWGSVH
ERNGVACADCHMPKVQLENGKVYTSHSQRTPRDMMGQACLNCHAEWTEDQALYAIDYIKNYTHGKIVKSEYWLAKMIDLF
PVAKRAGVSEDVLNQARELHYDAHLYWEWWTAENSVGFHNPDQARESLMTSISKSKEAVSLLNDAIDAQVASR
>P20103 ~~~nisB~~~Nisin biosynthesis protein NisB~~~
MIKSSFKAQPFLVRNTILCPNDKRSFTEYTQVIETVSKNKVFLEQLLLANPKLYDVMQKYNAGLLKKKRVKKLFESIYKY
YKRSYLRSTPFGLFSETSIGVFSKSSQYKLMGKTTKGIRLDTQWLIRLVHKMEVDFSKKLSFTRNNANYKFGDRVFQVYT
INSSELEEVNIKYTNVYQIISEFCENDYQKYEDICETVTLCYGDEYRELSEQYLGSLIVNHYLISNLQKDLLSDFSWNTF
LTKVEAIDEDKKYIIPLKKVQKFIQEYSEIEIGEGIEKLKEIYQEMSQILENDNYIQIDLISDSEINFDVKQKQQLEHLA
EFLGNTTKSVRRTYLDDYKDKFIEKYGVDQEVQITELFDSTFGIGAPYNYNHPRNDFYESEPSTLYYSEEEREKYLSMYV
EAVKNHNVINLDDLESHYQKMDLEKKSELQGLELFLNLAKEYEKDIFILGDIVGNNNLGGASGRFSALSPELTSYHRTIV
DSVERENENKEITSCEIVFLPENIRHANVMHTSIMRRKVLPFFTSTSHNEVLLTNIYIGIDEKEKFYARDISTQEVLKFY
ITSMYNKTLFSNELRFLYEISLDDKFGNLPWELIYRDFDYIPRLVFDEIVISPAKWKIWGRDVNSKMTIRELIQSKEIPK
EFYIVNGDNKVYLSQKNPLDMEILESAIKKSSKRKDFIELQEYFEDENIINKGEKGRVADVVVPFIRTRALGNEGRAFIR
EKRVSVERREKLPFNEWLYLKLYISINRQNEFLLSYLPDIQKIVANLGGNLFFLRYTDPKPHIRLRIKCSDLFLAYGSIL
EILKRSRKNRIMSTFDISIYDQEVERYGGFDTLELSEAIFCADSKIIPNLLTLIKDTNNDWKVDDVSILVNYLYLKCFFQ
NDNKKILNFLNLVSTKKVKENVNEKIEHYLKLLKVNNLGDQIFYDKNFKELKHAIKNLFLKMIAQDFELQKVYSIIDSII
HVHNNRLIGIERDKEKLIYYTLQRLFVSEEYMK
>Q03202 ~~~nisC~~~Nisin biosynthesis protein NisC~~~
MRIMMNKKNIKRNVEKIIAQWDERTRKNKENFDFGELTLSTGLPGIILMLAELKNKDNSKIYQKKIDNYIEYIVSKLSTY
GLLTGSLYSGAAGIALSILHLREDDEKYKNLLDSLNRYIEYFVREKIEGFNLENITPPDYDVIEGLSGILSYLLLINDEQ
YDDLKILIINFLSNLTKENNGLISLYIKSENQMSQSESEMYPLGCLNMGLAHGLAGVGCILAYAHIKGYSNEASLSALQK
IIFIYEKFELERKKQFLWKDGLVADELKKEKVIREASFIRDAWCYGGPGISLLYLYGGLALDNDYFVDKAEKILESAMQR
KLGIDSYMICHGYSGLIEICSLFKRLLNTKKFDSYMEEFNVNSEQILEEYGDESGTGFLEGISGCILVLSKFEYSINFTY
WRQALLLFDDFLKGGKRK
>P42708 ~~~nisI~~~Nisin immunity protein~~~
MRRYLILIVALIGITGLSGCYQTSHKKVRFDEGSYTNFIYDNKSYFVTDKEIPQENVNNSKVKFYKLLIVDMKSEKLLSS
SNKNSVTLVLNNIYEASDKSLCMGINDRYYKILPESDKGAVKALRLQNFDVTSDISDDNFVIDKNDSRKIDYMGNIYSIS
DTTVSDEELGEYQDVLAEVRVFDSVSGKSIPRSEWGRIDKDGSNSKQSRTEWDYGEIHSIRGKSLTEAFAVEINDDFKLA
TKVGN
>Q07596 3.4.21.-~~~nisP~~~Nisin leader peptide-processing serine protease NisP~~~
MKKILGFLFIVCSLGLSATVHGETTNSQQLLSNNINTELINHNSNAILSSTEGSTTDSINLGAQSPAVKSTTRTELDVTG
AAKTLLQTSAVQKEMKVSLQETQVSSEFSKRDSVTNKEAVPVSKDELLEQSEVVVSTSSIQKNKILDNKKKRANFVTSSP
LIKEKPSNSKDASGVIDNSASPLSYRKAKEVVSLRQPLKNQKVEAQPLLISNSSEKKASVYTNSHDFWDYQWDMKYVTNN
GESYALYQPSKKISVGIIDSGIMEEHPDLSNSLGNYFKNLVPKGGFDNEEPDETGNPSDIVDKMGHGTEVAGQITANGNI
LGVAPGITVNIYRVFGENLSKSEWVARAIRRAADDGNKVINISAGQYLMISGSYDDGTNDYQEYLNYKSAINYATAKGSI
VVAALGNDSLNIQDNQTMINFLKRFRSIKVPGKVVDAPSVFEDVIAVGGIDGYGNISDFSNIGADAIYAPAGTTANFKKY
GQDKFVSQGYYLKDWLFTTANTGWYQYVYGNSFATPKVSGALALVVDKYGIKNPNQLKRFLLMNSPEVNGNRVLNIVDLL
NGKNKAFSLDTDKGQDDAINHKSMENLKESRDTMKQEQDKEIQRNTNNNFSIKNDFHNISKEVISVDYNINQKMANNRNS
RGAVSVRSQEILPVTGDGEDFLPALGIVCISILGILKRKTKN
>P0DP66 3.5.1.128~~~nit1~~~Deaminated glutathione amidase~~~
MKPYLAAALQMTSRPNLTENLQEAEELIDLAVRQGAELVGLPENFAFLGNETEKLEQATAIATATEKFLQTMAQRFQVTI
LAGGFPFPVAGEAGKAYNTATLIAPNGQELARYHKVHLFDVNVPDGNTYWESATVMAGQKYPPVYHSDSFGNLGLSICYD
VRFPELYRYLSRQGADVLFVPAAFTAYTGKDHWQVLLQARAIENTCYVIAPAQTGCHYERRHTHGHAMIIDPWGVILADA
GEKPGLAIAEINPDRLKQVRQQMPSLQHRVFV
>P0DP68 3.5.1.128~~~nit1~~~Deaminated glutathione amidase~~~COG0388
MKNANVALLQLCSGENTRDNLAQIEQQIKQLNSGIQLVMTPENALLFANAASYRHHAEQHNDGPLQQEVREMARRYGVWI
QVGSMPMISRESPDLITTSSLLFDSQGELKARYDKIHMFDVDIKDIHGRYRESDTYQPGEHLTVADTPVGRLGMTVCYDL
RFPGLFQALRAQGAEIISVPAAFTKVTGEAHWEILLRARAIENQCVILAAAQVGRHGATRRTWGHTMAVDAWGKIIGQNP
DAVSALKVKIETTGLKTIRNQMPVLQHNRFVSSLVPRLSDSKQSSK
>Q89GE3 3.5.5.1~~~~~~Nitrilase bll6402~~~COG0388
MQDTKFKVAVVQAAPVFMDAPASVAKAIGFIAEAGAAGAKLLAFPEVWIPGYPWWLWLGTPAWGMQFVPRYHANSLRADG
PDILALCAAAAEAKINVVMGFSEIDGGTLYLSQVFISDAGEIIFKRRKLKPTHVERTLYGEGDGSDFRVVESSVGRLGAL
CCAEHIQPLSKYAMYSMNEQVHVASWPSFTLYRDKAYALGHEVNLAASQIYALEGGCFVLHASAITGQDMFDMLCDTPEK
ADLLNAEGAKPGGGYSMIFGPDGQPMCEHLPQDKEGILYADVDLSMIAIAKAAYDPTGHYARGDVVRLMVNRSPRRTSVS
FSEDENAAVTFTET
>Q89PT3 3.5.5.7~~~~~~Nitrilase blr3397~~~COG0388
MMDSNRPNTYKAAVVQAASDPTSSLVSAQKAAALIEKAAGAGARLVVFPEAFIGGYPKGNSFGAPVGMRKPEGREAFRLY
WEAAIDLDGVEVETIAAAAAATGAFTVIGCIEREQGTLYCTALFFDGARGLVGKHRKLMPTAGERLIWGFGDGSTMPVFE
TSLGNIGAVICWENYMPMLRMHMYSQGISIYCAPTADDRDTWLPTMQHIALEGRCFVLTACQHLKRGAFPADYECALGAD
PETVLMRGGSAIVNPLGKVLAGPCFEGETILYADIALDEVTRGKFDFDAAGHYSRPDVFQLVVDDRPKRAVSTVSAVRAR
N
>B2JQY2 3.5.5.1~~~~~~Nitrilase~~~COG0388
MSDQRVIRAAAVQIAPDFERPGGTLDRVCAAIDEAASKGVQLIVFPETFVPYYPYFSFVRPPVASGADHMRLYEQAVVVP
GPVTHAVSERARRHAMVVVLGVNERDHGSLYNTQLVFDIDGCQVLKRRKITPTFHERMIWGQGDAAGLKVARTGIARVGA
LACWEHYNPLARYALMTQHEEIHCSQFPGSLVGPIFAEQIEVTIRHHALESGCFVVNSTGWLSDAQIESVTTDPKLQKAL
RGGCMTAIVSPEGQHLAEPLREGEGMVVADLDMALITKRKRMMDSVGHYARPELLSLAINDRPAMPVVPMSMSFERAGAD
VAPEIISGGQDECQHEPVAG
>Q500U1 3.5.5.1~~~~~~Nitrilase~~~COG0388
MKEPLKVACVQAAPVFLDLDATVDKTITLMEQAAAAGAGLIAFPETWIPGYPWFLWLDAPAWNMPLVQRYHQQSLVLDSV
QARRISDAARHLGLYVVLGYSERNKASLYIGQWIIDDHGETVGVRRKLKATHVERTMFGEGDGASLRTFETPVGVLGALC
CWEHLQPLSKYAMYAQNEQIHVAAWPSFSLYRNATSALGPEVNTAASRVYAAEGQCFVLAPCAIVSPEMIEMLCDSDAKR
SLLQAGGGHARIFGPDGSDLATPLGEHEEGLLYATLDPAALTLAKVAADPAGHYSRPDVTRLMFNPNPTPCVVDLPDLPI
SSESIELLRPDIALEV
>C4RPA1 1.14.13.187~~~evdC~~~L-evernosamine nitrososynthase~~~COG1960
MSNLTEERWVAADLRAPLTPAGRTVVDLLAGVIPRISAEAADRDRTGTFPVEAFEQFAKLGLMGATVPAELGGLGLTRLY
DVATALMRLAEADASTALAWHVQLSRGLTLTYEWQHGTPPVRAMAERLLRAMAEGEAAVCGALKDAPGVVTELHSDGAGG
WLLSGRKVLVSMAPIATHFFVHAQRRDDDGSVFLAVPVVHRDAPGLTVLDNWDGLGMRASGTLEVVFDRCPVRADELLER
GPVGARRDAVLAGQTVSSITMLGIYAGIAQAARDIAVGFCAGRGGEPRAGARALVAGLDTRLYALRTTVGAALTNADAAS
VDLSGDPDERGRRMMTPFQYAKMTVNELAPAVVDDCLSLVGGLAYTAGHPLSRLYRDVRAGGFMQPYSYVDAVDYLSGQA
LGLDRDNDYMSVRALRSRTSA
>A0A0R4I990 1.14.13.-~~~dnmZ~~~Amino sugar nitrososynthase DnmZ~~~
MTKPSVHEHPGVLADNGLCEPKTPAGRRLLDLLERYLPALEAESRDNDREATLPVHLFDRMRKEGVLGATVPEDLGGLGV
HSLHDVALALARIAGRDAGVALALHMQFSRGLTLDFEWRHGAPSTRPLAEDLLRQMGAGEAVICGAVKDVRGTTVLTRAT
DGSYRLNGRKTLVSMAGIATHYVVSTRLEEAGAPVRLAAPVVARTTPGLTVLDNWDGMGMRSSGSVDIVFDGCPVDRDRV
LPRGEPGVRDDAALAGQTVSSIAMLGIYVGIAEAARRIALTELRRRGGAPAGVRTTVAEIDARLFALHTAVASALTTADR
LADDLSGDLAARGRAMMTPFQYAKLLVNRHSVGVVDDCLMLVGGAGYSNSHPLARLYRDVRAGGFMHPYNFTDGVDYLSE
VALGR
>Q2PC69 1.14.13.-~~~rubN8~~~Amino sugar nitrososynthase RubN8~~~
METEQAPRPAEPPGDLTTAITAPGEQLLTLLDRHLPRIRAQAAPNDRDSTFPAATFHGFARDGVLGATVPAELGGMGVSR
LHDVAVALLRVAEADASTALALHAQFSRGITLTYEWLHGPPPTRKLAERLLRAMARGEAVIGGAVKDHGRETTRLRPDGS
GGWLLSGRKTLVTMAPIATHFVVSAQAPAAGGTTLLYAPIVARDTPGLSIVDGWTGLGMRASGTLDVAFDDCPVPAGNLL
ARGSVGAHSDAALAGQAVSSVAMLGIYVGVAQAARDLAVETMARRSATPPAASRTLVAETEARLYALRATASAALVNVDE
LSPRHDMDPDERGRRMMTPFQCAKVMVNQLAAAVVDDCLTVVGGATYAAEHPLARLSRDVRAGRFMQPYTYADGVDYLSA
QALGLERDNNYVSLRATRPVDSR
>Q93NG1 3.5.1.111~~~~~~2-oxoglutaramate amidase~~~
MNLMEVRELAPSRGQLDVAAVQVKFDSTELLEDRISRIQDLVSGVGKADLIVLPELWLHGGFSYDSWRKNAISLESEVFT
FLSEVARDKKAWFHAGSFMVTEPSSAASDMWNTSVLFDPTGSLRATYKKIHRFGFSDGEPKLIAAGDEPRVVELQTERAT
AITGLSTCYDLRFPELYRHISAEGTALNVIPACWPLTRIQHWQTLGRARAIENQSFVVQCNMTGVDQEVELGGHSQIVDG
NGDILAQADKEEAVLRATLNFDSLNELRSSFPVLNDRRADIWAAKGKTVIASHL
>G9AIU0 3.5.5.1~~~nit~~~Aliphatic nitrilase~~~
MTKFRAAVVQAAPVPNDVEATIEKTINLIREAAARGANVAVFPEAFIGGYPKGANFNIHIGARTPEGRQEFADYRAGAIA
VPGSETEQLAQAAHEAGLYLTIGVIERDGGTLYCTALYFTPDGLAGKHRKLMPTGAERLCWGFGDGSTLDTVQTPWGSMG
AVICWENYMPLMRTAMYGKGIALYCAPTADDRDSWAATMRHIALEGRCFVLSACQYLTRKDFPESMGNRITDEPDAVLMR
GGAIIVDPLGRVVAGPDYSGETILTADLDTDDIPRAQFDFDVVGHYARPDVFKLVVDEEPKSAVVTRA
>Q48262 ~~~nixA~~~High-affinity nickel-transport protein NixA~~~COG3376
MKLWFPYFLAIVFLHALGLALLFMANNASFYAAASMAYMLGAKHAFDADHIACIDNTIRKLTQQGKNAYGVGFYFSMGHS
SVVILMTIISAFAIAWAKEHTPMLEEIGGVVGTLVSGLFLLIIGLLNAIILLDLLKIFKKSHSNESLSQQQNEEIERLLT
SRGLLNRFFKPLFNFVSKSWHIYPIGFLFGLGFDTASEIALLALSSSAIKVSMVGMLSLPILFAAGMSLFDTLDGAFMLK
AYDWAFKTPLRKIYYNISITALSVFIALFIGLIELFQVVSEKLHLKFENRLLRALQSLEFTDLGYYLVGLFVIAFLGSFF
LWKIKFSKLES
>Q2FUR9 ~~~nixA~~~Nickel transporter NixA~~~COG3376
MTVFKNERLSWLPYIAIVILLHVIGFSFLWIAGKDHHILFGMGILAYTLGLRHAFDADHIAAIDNTVRKLLQQRKDPSGV
GFYFSIGHSSVVFLMAVFLGVSVKWAKDELPHFQDIGGTIGTLVSGFFLVLIGVLNLIILISLINLFAKLRREHIEEAEV
DALLESRGLVSRFVGPYFKLITRSWHVLPLGFLFGLGFDTASEIALLALSSGASQQAISFIGILSLPILFASGMSLLDTL
DGVVMKYAYNWAFFNPIRKIYYNITITAISVMAALVIGMIELLQILADKLDLHGAFWAFIGSIEFDYLGYILVALFLITW
LISSLIWKFGRIEHKWSR
>B7UI21 2.4.1.-~~~nleB1~~~Protein-arginine N-acetylglucosaminyltransferase NleB1~~~
MLSSLNVLQSSFRGKTALSNSTLLQKVSFAGKEYSLEPIDERTPILFQWFEARPERYEKGEVPILNTKEHPYLSNIINAA
KIENERIIGVLVDGNFTYEQKKEFLNLENEHQNIKIIYRADVDFSMYDKKLSDIYLENIHKQESYPASERDNYLLGLLRE
ELKNIPEGKDSLIESYAEKREHTWFDFFRNLAILKAGSLFTETGKTGCHNISPCSGCIYLDADMIITDKLGVLYAPDGIA
VHVDCNDEIKSLENGAIVVNRSNHPALLAGLDIMKSKVDAHPYYDGLGKGIKRHFNYSSLHNYNAFCDFIEFKHENIIPN
TSMYTSSSW
>Q8XBX8 2.4.1.-~~~nleB1~~~Protein-arginine N-acetylglucosaminyltransferase NleB1~~~
MLSSLNVLQSSFRGKTALSNSTLLQKVSFAGKEYPLEPIDEKTPILFQWFEARPERYEKGEVPILNTKEHPYLSNIINAA
KIENERIIGVLVDGNFTYEQKKEFLSLENEYQNIKIIYRADVDFSMYDKKLSDIYLENIHKQESYPASERDNYLLGLLRE
ELKNIPEGKDSLIESYAEKREHTWFDFFRNLAMLKAGSLFTETGKTGCHNISPCSGCIYLDADMIITDKLGVLYAPDGIA
VHVDCNDEIKSLENGAIVVNRSNHPALLAGLDIMKSKVDAHPYYDGLGKGIKRHFNYSSLHDYNAFCDFIEFKHENIIPN
TSMYTCSSW
>B7UNX3 2.4.1.-~~~nleB2~~~Protein-arginine N-acetylglucosaminyltransferase NleB2~~~
MLSPIRTTFHNSVNIVQSSPCQTVSFAGKEYELKVIDEKTPILFQWFEPNPERYKKDEVPIVNTKQHPYLDNVTNAARIE
SDRMIGIFVDGDFSVNQKTAFSKLERDFENVMIIYREDVDFSMYDRKLSDIYHDIICEQRLRTEDKRDEYLLNLLEKELR
EISKAQDSLISMYAKKRNHAWFDFFRNLALLKAGEIFRCTYNTKNHGISFGEGCIYLDMDMILTGKLGTIYAPDGISMHV
DRRNDSVNIENSAIIVNRSNHPALLEGLSFMHSKVDAHPYYDGLGKGVKKYFNFTPLHNYNHFCDFIEFNHPNIIMNTSQ
YTCSSW
>A0A023YYV9 2.4.1.-~~~nleB2~~~Protein-arginine N-acetylglucosaminyltransferase NleB2~~~
MLSPIRTTFHNSVNIVQSSPCQTVSFAGKEYELKVIDEKTPILFQWFEPNPERYKKDEVPIVNTKQHPYLDNVTNAARIE
SDRMIGIFVDGDFSVNQKTAFSKLERDFENVMIIYREDVDFSMYDRKLSDIYHDIICEQRLRTEDKRDEYLLNLLEKELR
EISKAQDSLISMYAKKRNHAWFDFFRNLALLKAGEIFRCTYNTKNHGISFGEGGIYLDMDMILTGKLGTIYAPDGISMHV
DRRNDSVNIENSAIIVNRSNHPALLEGLSFMHSKVDAHPYYDGLGKGVKKYFNFTPLHNYNHFCDFIEFNHPNIIMNTSQ
YTCSSW
>A0A482PDI9 2.4.1.-~~~nleB~~~Protein-arginine N-acetylglucosaminyltransferase NleB~~~
MLSPLNVLQFNFRGETALSDSAPLQTVSFAGKDYSMEPIDEKTPILFQWFEARPERYGKGEVPILNTKEHPYLSNIINAA
KIENERVIGVLVDGDFTYEQRKEFLSLEDEHQNIKIIYRENVDFSMYDKKLSDIYLENIHEQESYPASERDNYLLGLLRE
ELKNIPYGKDSLIESYAEKRGHTWFDFFRNLAVLKGGGLFTETGKTGCHNISPCGGCIYLDADMIITDKLGVLYAPDGIA
VYVDCNDNRKSLENGAIVVNRSNHPALLAGLDIMKSKVDAHPYYDGVGKGLKRHFNYSSLQDYNVFCNFIEFKHKNIIPN
TSMYTNSSW
>B7UI22 2.1.1.-~~~nleE~~~Cysteine S-methyltransferase NleE~~~
MINPVTNTQGVSPINTKYAEHVVKNIYPKIKHDYFNESPNIYDKKYISGITRGVAELKQEEFVNEKARRFSYMKTMYSVC
PEAFEPISRNEASTPEGSWLTVISGKRPMGQFSVDSLYNPDLHALCELPDICCKIFPKENNDFLYIVVVYRNDSPLGEQR
ANRFIELYNIKRDIMQELNYELPELKAVKSEMIIAREMGEIFSYMPGEIDSYMKYINNKLSKIE
>Q7DBA6 2.1.1.-~~~nleE~~~Cysteine S-methyltransferase NleE~~~
MINPVTNTQGVSPINTKYAEHVVKNIYPEIKHDYFNESPNIYDKKYISGITRGVAELKQEEFVNEKARRFSYMKTMYSVC
PEAFEPISRNEASTPEGSWLTVISGKRPMGQFSVDSLYNPDLHALCELPDICCKIFPKENNDFLYIVVVYRNDSPLGEQR
ANRFIELYNIKRDIMQELNYELPELKAVKSEMIIAREMGEIFSYMPGEIDSYMKYINNKLSKIE
>Q8XAL7 ~~~nleF~~~Effector protein NleF~~~
MLPTSGSSANLYSWMYVSGRGNPSTPESVSELNHNHFLSPELQDKLDVMVSIYSCARNNNELEEIFQELSAFVSGLMDKR
NSVFEVRNENTDEVVGALRAGMTIEDRDSYIRDLFFLHSLKVKIEESRQGKEDSKCKVYNLLCPHHSSELYGDLRAMKCL
VEGCSDDFNPFDIIRVPDLTYNKGSLQCG
>A0A1W5X0D5 ~~~~~~Sodium-lithium/proton antiporter~~~
MFRYLSKRQWILLLLGILFVVAGYFILPVSVPLIIALITALFLNPAVRWMQFRFRLNRKMAVTIVFLLFVIMIGLLGTYA
VTRAVTQLVELADNAPSYINQINNVLINWQNNMNSFTQNMPSEFVDKVSVELQNTIDTTTQTLSQKLQLSNIAAFAAKIP
EYLISFLVYLIALFLFMLELPRLKDKMHGNFTESTSEKVKFMNARLSYVVFGFLKAQFLVSIVIFVVCLIGLFWITPEVA
IVMSLIIWIVDFVPIIGSIVILGPWALYMLIVGDIAMGGQLAMLAIILLAIRRTVEPKVMGRHIGLSPLATLIAMYIGLQ
LIGLMGFILGPLLVIAFNSAKEAGIIRWNFKL
>P9WK87 3.1.1.1~~~nlhH~~~Carboxylesterase NlhH~~~COG0657
MTEPTVARPDIDPVLKMLLDTFPVTFTAADGVEVARARLRQLKTPPELLPELRIEERTVGYDGLTDIPVRVYWPPVVRDN
LPVVVYYHGGGWSLGGLDTHDPVARAHAVGAQAIVVSVDYRLAPEHPYPAGIDDSWAALRWVGENAAELGGDPSRIAVAG
DSAGGNISAVMAQLARDVGGPPLVFQLLWYPTTMADLSLPSFTENADAPILDRDVIDAFLAWYVPGLDISDHTMLPTTLA
PGNADLSGLPPAFIGTAEHDPLRDDGACYAELLTAAGVSVELSNEPTMVHGYVNFALVVPAAAEATGRGLAALKRALHA
>P04846 ~~~nlpA~~~Lipoprotein 28~~~COG1464
MKLTTHHLRTGAALLLAGILLAGCDQSSSDAKHIKVGVINGAEQDVAEVAKKVAKEKYGLDVELVGFSGSLLPNDATNHG
ELDANVFQHRPFLEQDNQAHGYKLVAVGNTFVFPMAGYSKKIKTVAQIKEGATVAIPNDPTNLGRALLLLQKEKLITLKE
GKGLLPTALDITDNPRHLQIMELEGAQLPRVLDDPKVDVAIISTTYIQQTGLSPVHDSVFIEDKNSPYVNILVAREDNKN
AENVKEFLQSYQSPEVAKAAETIFNGGAVPGW
>P0ADA3 ~~~nlpD~~~Murein hydrolase activator NlpD~~~COG1388
MSAGSPKFTVRRIAALSLVSLWLAGCSDTSNPPAPVSSVNGNAPANTNSGMLITPPPKMGTTSTAQQPQIQPVQQPQIQA
TQQPQIQPVQPVAQQPVQMENGRIVYNRQYGNIPKGSYSGSTYTVKKGDTLFYIAWITGNDFRDLAQRNNIQAPYALNVG
QTLQVGNASGTPITGGNAITQADAAEQGVVIKPAQNSTVAVASQPTITYSESSGEQSANKMLPNNKPTATTVTAPVTVPT
ASTTEPTVSSTSTSTPISTWRWPTEGKVIETFGASEGGNKGIDIAGSKGQAIIATADGRVVYAGNALRGYGNLIIIKHND
DYLSAYAHNDTMLVREQQEVKAGQKIATMGSTGTSSTRLHFEIRYKGKSVNPLRYLPQR
>P39700 ~~~nlpD~~~Murein hydrolase activator NlpD~~~
MSAGSPKFTVSRIAALSLVSLWLAGCTSSSNPPAPVTSVDSGSSSNTNSGMLITPPPKMGATTQQTPQQAPQIQPVQRPV
TQPMQTQPVTEQPVQMENGRIVYNRQYGNIPKGSYTGGSTYTVKKGDTLFYIAWITGNDFRDLAQRNSISAPYSLNVGQT
LQVGNASGMPITGGNAITQADAAQQGVVTRSAQNSTVAVASQPTITYSEGSGEQSANKMLPNNKPAGTVVTAPVTAPTVS
TTEPNASSTSTSAPISAWRWPTDGKVIENFGASEGGNKGIDIAGSKGQAIVATADGRVVYAGNALRGYGNLIIIKHNDDY
LSAYAHNDTMLVREQQEVKAGQKIATMGSTGTSSTRLHFEIRYKGKSVNPLRYLPQR
>P40710 ~~~nlpE~~~Lipoprotein NlpE~~~COG3015
MVKKAIVTAMAVISLFTLMGCNNRAEVDTLSPAQAAELKPMPQSWRGVLPCADCEGIETSLFLEKDGTWVMNERYLGARE
EPSSFASYGTWARTADKLVLTDSKGEKSYYRAKGDALEMLDREGNPIESQFNYTLEAAQSSLPMTPMTLRGMYFYMADAA
TFTDCATGKRFMVANNAELERSYLAARGHSEKPVLLSVEGHFTLEGNPDTGAPTKVLAPDTAGKFYPNQDCSSLGQ
>P0AFB3 ~~~nlpI~~~Lipoprotein NlpI~~~COG4785
MKPFLRWCFVATALTLAGCSNTSWRKSEVLAVPLQPTLQQEVILARMEQILASRALTDDERAQLLYERGVLYDSLGLRAL
ARNDFSQALAIRPDMPEVFNYLGIYLTQAGNFDAAYEAFDSVLELDPTYNYAHLNRGIALYYGGRDKLAQDDLLAFYQDD
PNDPFRSLWLYLAEQKLDEKQAKEVLKQHFEKSDKEQWGWNIVEFYLGNISEQTLMERLKADATDNTSLAEHLSETNFYL
GKYYLSLGDLDSATALFKLAVANNVHNFVEHRYALLELSLLGQDQDDLAESDQQ
>C4ZSQ4 ~~~nlpI~~~Lipoprotein NlpI~~~
MKPFLRWCFVATALTLAGCSNTSWRKSEVLAVPLQPTLQQEVILARMEQILASRALTDDERAQLLYERGVLYDSLGLRAL
ARNDFSQALAIRPDMPEVFNYLGIYLTQAGNFDAAYEAFDSVLELDPTYNYAHLNRGIALYYGGRDKLAQDDLLAFYQDD
PNDPFRSLWLYLAEQKLDEKQAKEVLKQHFEKSDKEQWGWNIVEFYLGNISEQTLMERLKADATDNTSLAEHLSETNFYL
GKYYLSLGDLDSATALFKLAVANNVHNFVEHRYALLELSLLGQDQDDLAESDQQ
>P0AFB1 ~~~nlpI~~~Lipoprotein NlpI~~~COG4785
MKPFLRWCFVATALTLAGCSNTSWRKSEVLAVPLQPTLQQEVILARMEQILASRALTDDERAQLLYERGVLYDSLGLRAL
ARNDFSQALAIRPDMPEVFNYLGIYLTQAGNFDAAYEAFDSVLELDPTYNYAHLNRGIALYYGGRDKLAQDDLLAFYQDD
PNDPFRSLWLYLAEQKLDEKQAKEVLKQHFEKSDKEQWGWNIVEFYLGNISEQTLMERLKADATDNTSLAEHLSETNFYL
GKYYLSLGDLDSATALFKLAVANNVHNFVEHRYALLELSLLGQDQDDLAESDQQ
>P44585 ~~~nlpI~~~Lipoprotein NlpI homolog~~~COG4785
MRCFRLSRHFIVYLFSLCAILLLAGCVQSRGGFVSKNHVVLAEQNPNTHFEQEVMIVRLSQVLLVGKMSNEERASLHFER
GVLYDSLGLWGLARYDLTQALALQPKMASVYNYLGLYLLLEEDYDGALDAFNTVFELDSGYDYTHLNRGLNFYYVGRYHL
AQQDFLQFYQADKKDPYRVLWLYLNEQKLKPQEAQTNLVERAKGLSEDFWGTHIVQYYLGHISVEELQQRASEFAENSQQ
YAEILTETYFYLAKQKLNVGLVDEAAALFKLAMANQVYNFVEYRFAAFELMKLKPVQTEDEKEEKSAVTKAIVF
>O50258 1.15.1.1~~~nlr~~~Neelaredoxin~~~
MKMCDMFQTADWKTEKHVPAIECDDAVAADAFFPVTVSLGKEIAHPNTTEHHIRWIRCYFKPEGDKFSYEVGSFEFTAHG
ECAKGPNEGPVYTNHTVTFQLKIKTPGVLVASSFCNIHGLWESSKAVALK
>Q7DDR9 2.7.7.108~~~~~~Protein adenylyltransferase NmFic~~~
MPSENPIGKTMKSIDEQSLHNARRLFESGDIDRIEVGTTAGLQQIHRYLFGGLYDFAGQIREDNISKGGFRFANAMYLKE
ALVKIEQMPERTFEEIIAKYVEMNIAHPFLEGNGRSTRIWLDLVLKKNLKKVVNWQNVSKTLYLQAMERSPVNDLELRFL
LKDNLTDDVDNREIIFKGIEQSYYYEGYEKG
>B2TEK6 1.13.12.-~~~~~~Nitronate monooxygenase~~~COG2070
MTDQTGKRALLPLLGIDKPIIQAPMAGVSTPALAAAVCNAGGLGSLGVGAMNADGARKVIRETRALTDKPFNINVFCHRP
AQADAAVEQQWLSWLAPHFEKYGATPPAKLSDIYTSFLADPAMLAVFLEEKPAIVSFHFGLPSADVIAELKKAGIRLLAS
ATNLQEAAQVEAAGVDAIVAQGIEAGGHRGVFDPDAFDDRLGTFALTRLLAKECRLPVIAAGGIMDGAGIAAALALGAQA
AQLGTAFVACTETSIDEGYRRALLGEAARRTTFTAAISGRLARSMANSFTALGADPRSPEPATYPIAYDAGKALNAAAKA
KGEFGYGAHWAGQAAALARSLPAAELVAQLERELKQSIEQLRQFAN
>Q9HWH9 1.13.12.-~~~nmoA~~~Nitronate monooxygenase~~~
MTDRFTRLLGIQQPIIQAPMLGVSTPALAAAVSNAGGLGSIAITGSAAEKGRALIREVRGLTDKPFNVNLFCHRPGQADP
ARERAWLDYLKPLFAEFGAEPPVRLKNIYLSFLEDPTLLPMLLEERPAAVSFHFGAPPRDQVRALQAVGIRVLVCATTPE
EAALVEAAGADAVVAQGIEAGGHRGVFEPERGDAAIGTLALVRLLAARGSLPVVAAGGIMDGRGIRAALELGASAVQMGT
AFVLCPESSANAAYREALKGPRAARTALTVTMSGRSARGLPNRMFFDAAAPGVPPLPDYPFVYDATKALQTAALARGNHD
FAAQWAGQGAALARELPAAELLRTLVEELRG
>D0V3Y4 1.13.12.-~~~pnoA~~~Nitronate monooxygenase~~~
MSNSLLSLLNIELPIIQSPMVGVSTPRLAAAVSNAGGLGSIGIGASNVEQARAMLRETAALTDRPFNVNLFCHVPARADA
AREAQWLAFLAPLFAEFESPAPAALREIYTSFVEDFAMLEMLLEEKPAVVSFHFGLPPQSSIDALKDAGIVLLACVTNLA
EAQQAEHAGVHALVAQGYEAGGHRGVFDPQQDSEMGTFALVRVLTDACQLPVIAAGGIMDGAGIKAVMQLGASAAQLGTA
FILCPESSANPAYRDALQGPRAHQTRVTSAISGRPARGMVNRNYIDLETNAPALPDYPIAYDANKALNAAAANKANTDFA
VQWAGQGAPLARSMPAAALVNLLAAEMKA
>O69711 ~~~nmtR~~~HTH-type transcriptional regulator NmtR~~~COG0640
MGHGVEGRNRPSAPLDSQAAAQVASTLQALATPSRLMILTQLRNGPLPVTDLAEAIGMEQSAVSHQLRVLRNLGLVVGDR
AGRSIVYSLYDTHVAQLLDEAIYHSEHLHLGLSDRHPSAG
>F4ZCI3 3.5.99.9~~~nnhA~~~2-nitroimidazole nitrohydrolase~~~
MTTVDKRPSSRGYGDWRLSDIPQYKDGISTYEFVRATHEADYRTHQAEPVAGRTFGFNGIGRLTEVALHMPTKYTLHDQS
SQYKESPSFFQGLMGVPDRGPVDLAAFQRETEELATAFENNGIKVHWVDYPEEPANPYGPLMGHVFLSWGSIWRGGSVIS
RFGFLPGMVGVSEYLAKWAWNTLNIPPLVAITEGAMEPGACNMIADEVLVTCLSASYDQRGTDQLVAAISKTSGTEEFHN
LQLRPAVEGFFNKATGACAHPDININAIDVGKLVVSPAALDWDARTWLYDNNFELIEADPDEQREFLAPCNVLLLEPGKV
IAHADCHKTNQKIRDAGVEVIEVTGTEIRKACGGIKCRVMQINREPGPTLADVRNRVWR
>P94368 4.2.1.136~~~nnrD~~~ADP-dependent (S)-NAD(P)H-hydrate dehydratase~~~COG0063
MNVPFWTEEHVRATLPERDAESHKGTYGTALLLAGSDDMPGAALLAGLGAMRSGLGKLVIGTSENVIPLIVPVLPEATYW
RDGWKKAADAQLEETYRAIAIGPGLPQTESVQQAVDHVLTADCPVILDAGALAKRTYPKREGPVILTPHPGEFFRMTGVP
VNELQKKRAEYAKEWAAQLQTVIVLKGNQTVIAFPDGDCWLNPTGNGALAKGGTGDTLTGMILGMLCCHEDPKHAVLNAV
YLHGACAELWTDEHSAHTLLAHELSDILPRVWKRFE
>Q833Y3 4.2.1.136~~~nnrD~~~ADP-dependent (S)-NAD(P)H-hydrate dehydratase~~~COG0063
MRYLSKDILEEVITQRPSDSYKSNFGRVVLIGGNRQYGGAIIMSTEACINSGAGLTTVITDVKNHGPLHARCPEAMVVGF
EETVLLTNVVEQADVILIGPGLGLDATAQQILKMVLAQHQKQQWLIIDGSAITLFSQGNFSLTYPEKVVFTPHQMEWQRL
SHLPIEQQTLANNQRQQAKLGSTIVLKSHRTTIFHAGEPFQNTGGNPGMATGGTGDTLAGIIAGFLAQFKPTIETIAGAV
YLHSLIGDDLAKTDYVVLPTKISQALPTYMKKYAQPHTAPDSELLEQKRSR
>P31806 ~~~nnr~~~Bifunctional NAD(P)H-hydrate repair enzyme Nnr~~~COG0062
MTDHTMKKNPVSIPHTVWYADDIRRGEREAADVLGLTLYELMLRAGEAAFQVCRSAYPDARHWLVLCGHGNNGGDGYVVA
RLAKAVGIEVTLLAQESDKPLPEEAALAREAWLNAGGEIHASNIVWPESVDLIVDALLGTGLRQAPRESISQLIDHANSH
PAPIVAVDIPSGLLAETGATPGAVINADHTITFIALKPGLLTGKARDVTGQLHFDSLGLDSWLAGQETKIQRFSAEQLSH
WLKPRRPTSHKGDHGRLVIIGGDHGTAGAIRMTGEAALRAGAGLVRVLTRSENIAPLLTARPELMVHELTMDSLTESLEW
ADVVVIGPGLGQQEWGKKALQKVENFRKPMLWDADALNLLAINPDKRHNRVITPHPGEAARLLGCSVAEIESDRLHCAKR
LVQRYGGVAVLKGAGTVVAAHPDALGIIDAGNAGMASGGMGDVLSGIIGALLGQKLSPYDAACAGCVAHGAAADVLAARF
GTRGMLATDLFSTLQRIVNPEVTDKNHDESSNSAP
>P56176 ~~~nnr~~~Bifunctional NAD(P)H-hydrate repair enzyme Nnr~~~COG0062
MLSVYEKVNALDKRAIEELFLSEDILMENAAMALERAVLQNASLGAKVIILCGSGDNGGDGYALARRLVGRFKTLVFEMK
LAKSPMCQLQQERAKKAGVVIKAYEENALNQNLECDVLIDCVIGSHFKGKLEPFLNFESLSQKARFKIACDIPSGIDSKG
RVDKGAFKADLTISMGAIKSCLLSDRAKDYVGELKVGHLGVFNQIYEIPTDTFLLEKSDLKLPLRDRKNAHKGDYGHAHV
LLGKHSGAGLLSALSALSFGSGVVSVQALECEITSNNKPLELVFCENFPNLLSAFALGMGLENIPKDFNKWLELAPCVLD
AGVFYHKEVLQALEKEVILTPHPKEFLSLLKLVGINISMLELLDNKLEIARDFSQKYPKVVLLLKGANTLIAHQGQVFIN
ILGSVALAKAGSGDVLAGLILSLLSQNYTPLDAAINASSAHALASLEFKNNYALTPLDLIEKIKQL
>P9WF11 ~~~nnr~~~Bifunctional NAD(P)H-hydrate repair enzyme Nnr~~~COG0062
MRHYYSVDTIRAAEAPLLASLPDGALMRRAAFGLATEIGRELTARTGGVVGRRVCAVVGSGDNGGDALWAATFLRRRGAA
ADAVLLNPDRTHRKALAAFTKSGGRLVESVSAATDLVIDGVVGISGSGPLRPAAAQVFAAVQAAAIPVVAVDIPSGIDVA
TGAITGPAVHAALTVTFGGLKPVHALADCGRVVLVDIGLDLAHTDVLGFEATDVAARWPVPGPRDDKYTQGVTGVLAGSS
TYPGAAVLCTGAAVAATSGMVRYAGTAHAEVLAHWPEVIASPTPAAAGRVQAWVVGPGLGTDEAGAAALWFALDTDLPVL
VDADGLTMLADHPDLVAGRNAPTVLTPHAGEFARLAGAPPGDDRVGACRQLADALGATVLLKGNVTVIADPGGPVYLNPA
GQSWAATAGSGDVLSGMIGALLASGLPSGEAAAAAAFVHARASAAAAADPGPGDAPTSASRISGHIRAALAAL
>Q9X024 ~~~nnr~~~Bifunctional NAD(P)H-hydrate repair enzyme Nnr~~~COG0062
MKEIDELTIKEYGVDSRILMERAGISVVLAMEEELGNLSDYRFLVLCGGGNNGGDGFVVARNLLGVVKDVLVVFLGKKKT
PDCEYNYGLYKKFGGKVVEQFEPSILNEFDVVVDAIFGTGLRGEITGEYAEIINLVNKSGKVVVSVDVPSGIDSNTGKVL
RTAVKADLTVTFGVPKIGHILFPGRDLTGKLKVANIGHPVHLINSINRYVITREMVRSLLPERPRDSHKGTYGKVLIIAG
SRLYSGAPVLSGMGSLKVGTGLVKLAVPFPQNLIATSRFPELISVPIDTEKGFFSLQNLQECLELSKDVDVVAIGPGLGN
NEHVREFVNEFLKTLEKPAVIDADAINVLDTSVLKERKSPAVLTPHPGEMARLVKKTVGDVKYNYELAEEFAKENDCVLV
LKSATTIVTDGEKTLFNITGNTGLSKGGSGDVLTGMIAGFIAQGLSPLEASTVSVYLHGFAAELFEQDERGLTASELLRL
IPEAIRRLKE
>Q8DTM1 2.6.1.14~~~aspB~~~Asparagine--oxo-acid transaminase~~~COG0436
MTKLSRRVLEMEESVTLATSARAKTLKAQGRDVLELSLGQPDFVTPKNIQEAAMKSIRDGRASFYTIASGLPELKDAISQ
YFEKFYGYSVERKQIVVGTGAKFILYALFAAVINPKDEVIIPTPFWVSYADQIKMNDGVPVFIRTSEENHFKATVEQLEA
ARTNKTKMIVLNSPSNPTGMIYSKKELEAIGNWAVKHDILILSDDIYGRLVYNGARFTPISTISQPICQQTIVINGVSKT
YSMTGWRVGYAVGDPEIIGAMSKIVSQTTSNLTTAAQYAAIEALIGNQDTVEVMRQAFEERLNTIYPLLAKVPGFHVVKP
EGAFYFFPNVKKAMEMKGYTDVTEFTTALLEETGVALVTGAGFGAPENVRLSYATDMVTLKEAINRIQAFMEK
>Q5J1R2 5.1.1.14~~~nocJ~~~Nocardicin C-9' epimerase~~~
MGALPRVPLITAPTRLHPVDGLAPRRVLVKRDDENSPVFGGCKTRALEFVLGAARAAGATAVLTSGTAGSNHVAATALHA
GRLGFRVTALVLPQEPGALVARNLRLAAGAGARLEPVPDGVSVHPDRERHRAAVAELRERGERVHVIPFGGADPVAGVAH
ALAGLELAEQARGLPGPLRVHLPAASTLTAAGIAAGLALSGLPFQVTAVDVVGSSSTLGPGLLGRAREVAALLGGPADAV
RPEHVRHVGYAGAPYGVPDPEAGRCADLLREAADVRVDECYGAKAFHHLLGEVGDADGTHLFWHTGSTREAGEVFGPVPP
ELLCYVE
>P35120 ~~~nocT~~~Nopaline-binding periplasmic protein~~~
MKFFNLNALAAVVTGVLLAAGPTQAKDYKSITIATEGSYAPYNFKDAGGKLIGFDIDLGNDLCKRMNIECKFVEQAWDGI
IPSLTAGRYDAIMAAMGIQPAREKVIAFSRPYLLTPMTFLTTADSPLLKTQVAIENLPLDNITPEQKAELDKFTKIFEGV
KFGVQAGTSHEAFMKQMMPSVQISTYDTIDNVVMDLKAGRIDASLASVSFLKPLTDKPDNKDLKMFGPRMTGGLFGKGVG
VGIRKEDADLKALFDKAIDAAIADGTVQKLSQQWFGYDASPKQ
>P37524 ~~~noc~~~Nucleoid occlusion protein~~~COG1475
MKHSFSRFFGLGEKEQEPEIAEHDTNKEEILEIPVNAIVPNRFQPRTIFSDEKIKELAMTIHTHGIIQPIVVRHTEEEGQ
YELIAGERRWRAVQSLEWEKIPAIIKDFSDTETASVALIENLQREELSSIEEAHAYARLLELHDLTQEALAQRLGKGQST
IANKLRLLKLPQPVQEAIMEKKITERHARALIPLKQPELQVTLLTEIIEKSLNVKQTEDRVVKMLEQGQRKPKPRRKAFS
RDTRIAMNTIRQSLSMVEDSGVKLNTEEEEFEEYIQLTIRIPK
>P02962 2.3.1.-~~~nodA~~~Nodulation protein A~~~
MSLKVQWKLCWENQLERADHQELSEFFRKSYGPTGAFHAKPFEGGRSWAGARPERRAIAYDSVGIASHMGVLRRFIKVGE
TDLLVAELGLYAVRPDLERMGIAHSVGALTPTLRELGVPFAFGTVRHAMRNHVERYCQNGMASILTGVRVRSSIAEVNAD
LPSTRTEDPLVVIFPVGRPLNEWPPGTLIERNGSEL
>P55700 ~~~nodD2~~~Nodulation protein D 2~~~COG0583
MRFKGLDLNLLVALDALMTKRSVTAAARSINLSQPAMSAAIARLRTYFGDDLFTMRGRELIPTPRAIALAPAVRDALLHI
QFSIISWDMFNPVQSERRFRIRLSDVMMLVFFERVVKRLAREAPGIGFELLPLTEDPDELLRYGDVDFVILPELFASSDH
PKAKLLDDTLVCVGCPTNKQLKRQLSFENYGSMGHIAAKFGRTLKPSIENWLLLEHGLKRRIEVVVPGFSLIPPLLSGTD
RIATMPLRLVEHFAKTTPLRVAELPLALPPFAQAVQWPSLHNRDQASIWMRQVLLQEALHMTAPRDSVEYRP
>P04685 ~~~nodF~~~Nodulation protein F~~~
MADQLTLEIISAINKLVKAENGERTSVALGEITTDTELTSLGIDSLGLADVLWDLEQLYGIKIEMNTADAWSNLNNIGDV
VEAVRGLLTKEV
>P25195 2.6.1.16~~~nodM~~~Glutamine--fructose-6-phosphate aminotransferase [isomerizing]~~~
MCGIVGIVGHQPVSERLVEALEPLEYRGYDSAGVATMDAGTLQRRRAEGKLGNLREKLKEAPLSGTIGIAHTRWATHGAP
TERNAHPHFTEGVAVVHNGIIENFAELKDELAAGGAEFQTETDTEVVAHLLAKYRRDGLGRREAMHAMLKRVKGAYALAV
LFEDDPSTIMAARTGPLAIGHGNGEMFLGSDAIALAPFTNEITYLIDGDWAVIGKTGVHIFDFDGNVVERPRQISTAAAF
LVDKGNHRHFMEKEIYEQPEVIAIALGHYVNVIDKSCRSDSDAIDFAGVESLAISCCGTAYLAGLIGKYWFERYARLPVE
IAVASEFRYREIPLSPQSALFISQSGETADTLASLRYCKAHGLRIGAVVNARESTMARESDAVFPILAGPEIGVARTKAF
TCQLAVLAALRAGAGKARGTISGDEEQALIKSLAEMPAIMGQVLNSIQPEIEVLSRELSNCRDVLYLGRGTSFPLAMEGA
LKLKEISYIQPKSYAAGQLKHGPYALIDENMPVIVIAPHDRFFDKTVTNMQEVARGGRIILITDEKGAAASKLDTMHTIV
LPEVDEIIAPMIFSLPLQLLAYHTAVFMGTDVDQPRNLAKSVTVE
>P15728 ~~~nodO~~~Nodulation protein O~~~
MNIKGSDNGSFIKGSPENDIIDGGKKNDWIDAGNGDDRIKAGDGQDSITAGPGHDIVWAGKGSDVIHADGGDDLLYSDAS
YPLYVTDPHRVIPHSGEGDDVLYAGPGSDILVAGDGADVLTGGDDGDAFVFRFHDPMVGTTHCYTSVMDFDTKQDRFVLD
AADFGGDRNLFDANFINHSKGFPGEFVDTFYNGAAEGAHGEHVVVITDRGFASAAAAATAIDHEARGDIIVFHDQKTLGQ
DGETHGATLAYVDSANHAHAFAHVDNLHDMSDLTSLTAENFGFI
>P55473 2.1.1.-~~~noeI~~~2-O-methyltransferase NoeI~~~COG0275
MEVGRYLKEGLLISLQQRLREFAEGIRKPTSSLKVHRQIVSLLEKPDPVILDIGCNDGSDARRFLQLRPKAQLFCFEPDP
RAAARCRENMGPLDRMRLFEVAISDRNGRIDFHPSNGDGDAKEWDLSGSIRQPKNHLSEYQWVRFDGPISVETRRLDDWC
SEAGLESIDLIWMDVQGAESDVIAGGKETLTKTRFIYTEYSDQELYEGQLPLRAILDLLPSFELVAQFPRGVEGDVLLRN
TKL
>P31061 ~~~nohA~~~Prophage DNA-packing protein NohA~~~COG4220
MEVNKKQLADIFGASIRTIQNWQEQGMPVLRGGGKGNEVLYDSAAVIRWYAERDAEIENEKLRREVEELRQASETDLQPG
TIEYERHRLTRAQADAQELKNARDSAEVVETAFCTFVLSRIAGEIASILDGIPLSVQRRFPELENRHVDFLKRDIIKAMN
KAAALDELIPGLLSEYNRADRQYAGGSRS
>P22537 ~~~nolA~~~Nodulation protein NolA~~~COG0789
MNRATPRRRRWRIGELAEATGVTVRTLHHYEHTGLLAATERTEGGHRMYDRESGQRVHQIRALRELGFSLVEIRKAMEGT
TSLTDLLRKHLERIEVQVARTTLLRDRLRNMTIDSEAQVSVDELPATLNAMSRAETRSQTSRCTCNLAAEREDRWRRIRD
DLRDCMDGGEHPCGERAKAVAVAARLLISEIAGDDSRVSMILKVLARLSAPRSLAGWDPCLMQYLDLALGGLEDQPY
>P33208 ~~~nolB~~~Nodulation protein NolB~~~
MMLPVTSISNSLPRVASSSFGEQAQFERALAQAADSMKNDTASTPVRTAPISPPMDVHRAAPTSPLKDRVLQTISAICPD
SIVAAAAPNHKAALISGAPPGPSQKLPVEDGAGTERLGIPQGGHDFDVMVAGLRDLYNGVTQVALVSKGISGITSSVNKL
LKEG
>P55713 ~~~nolB~~~Nodulation protein NolB~~~
MMLPVTSISNSLPRVASSSFGEQAQFERALAQAADSMKNDTASTPVRTAPISPPMDVHRAAPTSPLEDRVLQTISSICPD
SIVAAAAPNHKAALISGAPPGPSQKLPVEGGAGTERLGIPQGGHDFDVMVAGLRDLYNGVTQVALVSKGISGITSSVNKL
LKEG
>P12780 ~~~nolJ~~~Nodulation protein NolJ~~~
MDIRLDSRRAIQAGPLVTTGRIAGAALRMARPFLGIARRLQFKAKAFELALRSVALQLMNDAMADADEAMEETEEDADAL
GGPRQEVRAVSDGTIVTEREMTCPQNRYCHECSWTGAAAGQNVAAKGAGAAVPGPNRRTGSGERRGYG
>P33209 ~~~nolT~~~Nodulation protein NolT~~~
MFGSAHGDTTSSDTSGRRPLRLVVLPLLLALSSCKVDLYTQLQEREANEIVALLMDNGVDAVRVAGKDGTSTIQVDEKLL
AFSIKLLNGKGLPRQSFKNLGEIFQGSGLIASPTEERARYVYALSEELSHTISDIDGVFSARVHVVLPHNDLLRAGDTPS
SASVFIRHDAKTNLPALLPKIKMLVAESIEGLAYDKVEVVLVPVERSAQEQRSLLEPDLAQASRPIPVPLLAVAVGVGAA
VFAVTCYLLFIVLGHRRRQLTGELSRVQERPGVSALAAIRKKIPALGRR
>P33210 ~~~nolU~~~Nodulation protein NolU~~~
MKLPMPIAIQNTQPDVSFQSHSVPRSELAASTHPTRLAARLDPELSAATVVQLQKCARLQPRLAELLLGNDMDWNRIGWG
PDLLRGHDPRRAALLAGSIWHARSLLKVVSQRDLARLVERIGADAHAFGIRHLAHAIADKLISDPEKLALQIEHDGHACL
GAWLNIRPALERNRVLLRLPLGTAAENPAPEHDGASSGLFSLVIAHFEMESP
>P33213 ~~~nolX~~~Nodulation protein NolX~~~
MSASNLLPMISSNPAQFAQASLAKAFAPRVAQGQQSVLSFEAMLSTNMLDRIGPLASREDLLPPDAESTLEDLQKDPLAL
LPPHMRAAIESMDQTPQSAVAIDDHYVAPAPIQSSRITWNGGSLTKPELQIVAVLNRHKDLCPLSWESLEAKVNDPSTPP
DLKAAIEALLQDPELFYAIGSQGDGRCGGKITAKDLSEFSKHHPQVAAFQESQAQSYAQNYIPSDSAENAQPSVMTENDA
LRELYRYSEYLPKNLSLADFKQIVDGEAKTGKCPPQVIAAAQYFVSHPEEWKRLYGGNIDKVHKEDFLEVASSSMSLTQA
ELDTLKTINSHQELFFGSGDLTRDKLASMADDKSLDPKVREAASQLLSDPLLFGLLNNAITGYKTHHGFFDFGGGHTVDS
GNVSKEDFGRFYTNMTTANRTVQQPKFHVPETEAAQNAVADMKMGLADQPDIKSPKKNGGASCMSSTPCCG
>P55704 ~~~nopL~~~Nodulation outer protein L~~~
MDINSTSPLNASPQPDSPPPANASAFAHQLSGFQYSPPHAADSLLPQVEADSPYLDTRHPYSQYLDSAYPYPSPCEWQHD
LYTRTRERSPHPSEQRPHARVLQGAPEHDQDQHLEAAGPREGSWQVGPSRSGPSQAGLSPSATPLNPSPPPHATDLETKH
PYSQYLDWANPSLLDWQQDLHTRATASPAPLTAERGRSPQPSEQQPHARALQVPEYDQDLIWQRVDAAGPQAGPWQVGPS
HSGPSQARPSHAWPSSSAGAEPAELSDFVMDSGVRAWDHWFLAPHMASEDQMSMLRATGLMPTAEVPTTTFLMMGMRHVA
EFRGEGVIRIRPSVDFDI
>P55724 ~~~nopP~~~Effector protein NopP~~~
MYGRIDSSSDFHYTQSASKQMDAETQEFADTFARMHLDRSNGGSSSAARYTLDHEPPVVPIDLETFRREIRKFHGKEITD
IANNPQEYSDFVSAKARRTADVAQQYGIRRDSENARYFSYQLGNQCVGLMRTEGGFSMEEEFESKSWRDQFPGHQEITST
VDLQVAHPLVENAGDILLEHQLRRDGERPLLNWRAENPEAKARAAMMGFVEVDDCDMVLDPKQHPDKWTQTSAAEWRRKD
KPPLYLRKFEDAETAQCSTSCSYETYEDDFM
>P55711 ~~~nopX~~~Nodulation outer protein X~~~
MSASNLLPMISSNPAQFAQASLAKAFAPRVAQGQQSVLSFEAMLSTNMLDRIGPLASREDLPPPDAESTLEDLQKDPLAL
LPPHMRAAIESMDQTPQSAVVIDDHYVAPAPIQSSRITWNGGSLTKPELQIVAVLNRHKDLCPLSWESLEAKANDPSTPP
DLKAAIEALLQDPELFYAIGSQGDGRCGGKISAKDLSEFSKHHPQVAAFQESQAQSYAQNYIPSDSAENAQPSVMTENDA
LRELYRYSEYLPKNLSLADFKQIVDGEAKTGKCPPQVIAAAQYFVSHPEEWKQLYGGNIDKVHKEDFLQVASSSMSLTQA
ELDTLKTINSHQELFFGSGDLTRDKLASMADDKSLDPKVREAASQLLSDPLLFGLLNNAITGYKTHHGFFDFGGGHTVDS
GNVSKEDFGRFYTNMTTANRTVQQPKFHVPETEAAQNAVADMKMGLADQPDIKSPKKNGGALMHVVDSVLRVGSKVLDWA
ATAVGVLSFIPGIGQVADLVSMTLACEAQAANLLRTAITGGNMKQALIEAGIGVAAQAVGLVSGPGVKLAIRNGLARKAI
EEAATAGINLPLSMAQHYAEGYLNDLKARLAADHPA
>P0A0J7 ~~~norA~~~Quinolone resistance protein NorA~~~
MNKQIFVLYFNIFLIFLGIGLVIPVLPVYLKDLGLTGSDLGLLVAAFALSQMIISPFGGTLADKLGKKLIICIGLILFSV
SEFMFAVGHNFSVLMLSRVIGGMSAGMVMPGVTGLIADISPSHQKAKNFGYMSAIINSGFILGPGIGGFMAEVSHRMPFY
FAGALGILAFIMSIVLIHDPKKSTTSGFQKLEPQLLTKINWKVFITPVILTLVLSFGLSAFETLYSLYTADKVNYSPKDI
SIAITGGGIFGALFQIYFFDKFMKYFSELTFIAWSLLYSVVVLILLVFANGYWSIMLISFVVFIGFDMIRPAITNYFSNI
AGERQGFAGGLNSTFTSMGNFIGPLIAGALFDVHIEAPIYMAIGVSLAGVVIVLIEKQHRAKLKEQNM
>Q59647 1.7.2.5~~~norB~~~Nitric oxide reductase subunit B~~~
MMSPNGSLKFASQAVAKPYFVFALILFVGQILFGLIMGLQYVVGDFLFPAIPFNVARMVHTNLLIVWLLFGFMGAAYYLV
PEESDCELYSPKLAWILFWVFAAAGVLTILGYLLVPYAGLARLTGNELWPTMGREFLEQPTISKAGIVIVALGFLFNVGM
TVLRGRKTAISMVLMTGLIGLALLFLFSFYNPENLTRDKFYWWWVVHLWVEGVWELIMGAILAFVLVKITGVDREVIEKW
LYVIIAMALISGIIGTGHHYFWIGVPGYWLWLGSVFSALEPLPFFAMVLFAFNTINRRRRRDYPNRAVALWAMGTTVMAF
LGAGVWGFMHTLAPVNYYTHGTQLTAAHGHMAFYGAYAMIVMTIISYAMPRLRGIGEAMDNRSQVLEMWGFWLMTVAMVF
ITLFLSAAGVLQVWLQRMPADGAAMTFMATQDQLAIFYWLREGAGVVFLIGLVAYLLSFRRGKAAA
>Q2FYJ5 ~~~norB~~~Quinolone resistance protein NorB~~~COG0477
MEKPSREAFEGNNKLLIGIVLSVITFWLFAQSLVNVVPILEDSFNTDIGTVNIAVSITALFSGMFVVGAGGLADKYGRIK
LTNIGIILNILGSLLIIISNIPLLLIIGRLIQGLSAACIMPATLSIIKSYYIGKDRQRALSYWSIGSWGGSGVCSFFGGA
VATLLGWRWIFILSIIISLIALFLIKGTPETKSKSISLNKFDIKGLVLLVIMLLSLNILITKGSELGVTSLLFITLLAIA
IGSFSLFIVLEKRATNPLIDFKLFKNKAYTGATASNFLLNGVAGTLIVANTFVQRGLGYSSLQAGSLSITYLVMVLIMIR
VGEKLLQTLGCKKPMLIGTGVLIVGECLISLTFLPEIFYVICCIIGYLFFGLGLGIYATPSTDTAIANAPLEKVGVAAGI
YKMASALGGAFGVALSGAVYAIVSNMTNIYTGAMIALWLNAGMGILSFVIILLLVPKQNDTQL
>A6QGY6 ~~~norB~~~Quinolone resistance protein NorB~~~
MEKPSREAFEGNNKLLIGIVLSVITFWLFAQSLVNVVPILEDSFNTDIGTVNIAVSITALFSGMFVVGAGGLADKYGRIK
LTNIGIILNILGSLLIIISNIPLLLIIGRLIQGLSAACIMPATLSIIKSYYIGKDRQRALSYWSIGSWGGSGVCSFFGGA
VATLLGWRWIFILSIIISLIALFLIKGTPETKSKSISLNKFDIKGLVLLVIMLLSLNILITKGSELGVTSLLFITLLAIA
IGSFSLFIVLEKRATNPLIDFKLFKNKAYTGATASNFLLNGVAGTLIVANTFVQRGLGYSSLQAGSLSITYLVMVLIMIR
VGEKLLQTLGCKKPMLIGTGVLIVGECLISLTFLPEIFYVICCIIGYLFFGLGLGIYATPSTDTAIANAPLEKVGVAAGI
YKMASALGGAFGVALSGAVYAIVSNMTNIYTGAMIALWLNAGMGILSFVIILLLVPKQNDTQL
>Q8NWQ5 ~~~norB~~~Quinolone resistance protein NorB~~~
MEKPSREAFEGNNKLLIGIVLSVITFWLFAQSLVNVVPILEDSFNTDIGTVNIAVSITALFSGMFVVGAGGLADKYGRIK
LTNIGIILNILGSLLIIISNIPLLLIIGRLIQGLSAACIMPATLSIIKSYYIGKDRQRALSYWSIGSWGGSGVCSFFGGA
VATLLGWRWIFILSIIISLIALFLIKGTPETKSKSISLNKFDIKGLVLLVIMLLSLNILITKGSELGVSSLLFITLLAIA
IGSFSLFIVLEKRATNPLIDFKLFKNKAYTGATASNFLLNGVAGTLIVANTFVQRGLGYSSLQAGSLSITYLVMVLIMIR
VGEKLLQTLGCKKPMLIGTGVLIVGECLISLTFLPEILYVICCIIGYLFFGLGLGIYATPSTDTAIANAPLEKVGVAAGI
YKMASALGGAFGVALSGAVYAIVSNMTNIYTGAMIALWLNAGMGILSFVIILLLVPKQNDTQL
>P98008 1.7.2.5~~~norB~~~Nitric oxide reductase subunit B~~~COG3256
MSSFNPHLKFQSQAVAKPYFVFALILFVGQVLFGLIMGLQYVVGDFLFPLLPFNVARMVHTNLLIVWLLFGFMGAAYYLI
PEESDCELHSPKLAIILFWVFAAAGVLTILGYLFVPYAALAEMTRNDLLPTMGREFLEQPTITKIGIVVVALGFLYNIGM
TMLKGRKTVVSTVMMTGLIGLAVFFLFAFYNPENLSRDKFYWWFVVHLWVEGVWELIMGAMLAFVLIKVTGVDREVIEKW
LYVIIAMALITGIIGTGHHFFWIGAPTVWLWVGSIFSALEPLPFFAMVLFALNMVNRRRREHPNKAASLWAIGTTVTAFL
GAGVWGFMHTLAPVNYYTHGSQLTAAHGHLAFYGAYAMIVMTMISYAMPRLRGLGEAPDARAQRIEVWGFWLMTISMIAI
TLFLTAAGVVQIWLQRIPADGAAMSFMNTADQLAIFFWLRFIAGVFFLIGLVCYLYSFRQRGRVPVVVAAPAAA
>Q59646 ~~~norC~~~Nitric oxide reductase subunit C~~~
MSETFTKGMARNIYFGGSVFFILLFLALTYHTEKTLPERTNEAAMSAAVVRGKLVWEQNNCVGCHTLLGEGAYFAPELGN
VVGRRGGEEGFNTFLQAWMNIQPLNVPGRRAMPQFHLSEGQVDDLAEFLKWSSKIDTNQWPPNKEG
>Q52527 ~~~norC~~~Nitric oxide reductase subunit C~~~COG2010
MSETFTKGMARNIYFGGSVFFFLVFLGLTYHTEQTFPERTNESEMTEAVVRGKEVWENNNCIGCHSLLGEGAYFAPELGN
VFVRRGGEETFKPFLHAWMKAQPLGAPGRRAMPQFNLSEQQVDDMAEFLKWTSKIDTNDWPPNKEG
>Q2G1P1 ~~~norG~~~HTH-type transcriptional regulator NorG~~~
MKIPSHRQLAIQYNVNRVTIIKSIELLEAEGFIYTKVGSGTYVNDYLNEAHITNKWSEMMLWSSQQRSQYTVQLINKIET
DDSYIHISKGELGISLMPHIQLKKAMSNTASHIEDLSFGYNNGYGYIKLRDIIVERMSKQGINVGRENVMITSGALHAIQ
LLSIGFLGQDAIIISNTPSYIHSTNVFEQLNFRHIDVPYNQINEIDTIIDRFINFKNKAIYIEPRFNNPTGRSLTNEQKK
NIITYSERHNIPIIEDDIFRDIFFSDPTPSIKTYDKLGKVIHISSFSKTIAPAIRIGWIVASEKIIEQLADVRMQIDYGS
SILSQMVVYEMLKNKSYDKHLVKLRYVLKDKRDFMLNILNNLFKDIAHWEVPSGGYFVWLVFKIDIDIKYLFYELLSKEK
ILINPGYIYGSKEKSIRLSFAFESNENIKHALYKIYTYVKKV
>Q9KRU4 ~~~norM~~~Multidrug resistance protein NorM~~~COG0534
MHRYKKEASNLIKLATPVLIASVAQTGMGFVDTIMAGGVSAIDMAAVSIAASIWLPSILFGVGLLMALVPVVAQLNGAGR
QHKIPFEVHQGLILALLVSVPIIAVLFQTQFIIRFMDVEEAMATKTVGYMHAVIFAVPAYLLFQALRSFTDGMSLTKPAM
VIGFIGLLLNIPLNWIFVYGKFGAPELGGVGCGVATAIVYWIMLLLLLFYIVTSKRLAHVKVFETFHKPQPKELIRLFRL
GFPVAAALFFEVTLFAVVALLVAPLGSTVVAAHQVALNFSSLVFMFPMSIGAAVSIRVGHKLGEQDTKGAAIAANVGLMT
GLATACITALLTVLFREQIALLYTENQVVVALAMQLLLFAAIYQCMDAVQVVAAGSLRGYKDMTAIFHRTFISYWVLGLP
TGYILGMTNWLTEQPLGAKGFWLGFIIGLSAAALMLGQRLYWLQKQSDDVQLHLAAK
>O82855 ~~~norM~~~Multidrug resistance protein NorM~~~COG0534
MHRYKEEASSLIKLATPVLIASVAQTGMGFVDTVMAGGVSATDMAAVSVASSIWLPSILFGIGLLMALVPVVAQLNGSAR
REKIPFEIQQGVVLALLISIPIIGVLLQTQFILQLMDVEAVMAGKTVGYIHAVIFAVPAFLLFQTLRSFTDGMSLTKPAM
VIGFIGLLLNIPLNWIFVYGKFGAPELGGVGCGVATTIVYWVMFALLLAYVMTSSRLKSINVFGEYHKPQWKAQVRLFKL
GFPVAAALFFEVTLFAVVALLVSPLGPIIVAAHQVAINFSSLVFMLPMSVGAAVSIRVGHRLGEENVDGARVASRVGIMV
GLALATITAIITVLSRELIAELYTNNPEVISLAMQLLLFAAVYQCTDAVQVIAAGALRGYKDMRAIFNRTFIAYWILGLP
TGYILGRTDWIVEPMGAQGFWLGFIIGLTAAALMLGVRLRWMHRQEPDVQLNFSLQ
>Q9K4V0 ~~~norR1~~~Nitric oxide reductase transcription regulator NorR1~~~COG3604
MTPMYPELLTDLVTDLPHAVRLQRLVSGLRTHFRCGAVALLRLEEEHLRPVAVDGLVRDTLGRRFAVSLHPRLAAILARR
DVTCFHHDSMLPDPYDGLIDEHVGEPLPVHDCMGTSLHLDGRPWGVLTLDALTVGTFDAAAQAELQRLTVIVEAAIRTTR
LEAEIRALQLARGKQPDGEGPADDGEIIGQSQAIAGLLHELEVVADTDLPVLLLGETGVGKELFAHRLHRHSRRRGHPLV
HVNCAALPESLAESELFGHARGAFSGATGERPGRFEAAAGGTLFLDEVGELPLSIQAKLLRTLQNGEIQRLGSDRPRRVN
VRVIAATNRNLREHVRDGSFRADLFHRLSVYPIPIPPLRERGNDVLLLAGRFLELNRARLGMRSLRLSAAAQDALRRYRW
PGNVRELEHVISRAALRCVSRGADRNDIVTLEAELLDLDGLELPAGSAHHAAEAAIAHPALPTGATLREAVEQTQRACIE
QALRAHDGSWAKAARQLGMDASNLHKLAKRLGSK
>Q9K4U8 ~~~norR2~~~Nitric oxide reductase transcription regulator NorR2~~~COG3604
MLPSAMYPELLADLVTDLPHAVRLQRLVSMLRTHFRCGAVALLRLEEDHLRPVAVDGLVRDALGRRFAVGLHPRLAAILA
RRGVTCFHHDSMLPDPYDGLIDEHVGEPLPVHDCMGTSLAVDGQPWGALTLDALAIGTFDAAAQAELQRLTVIVEAAIRT
TRLEGEIRALQLARGTPEADEGTAQHGDIGGEIIGQSEAIANLLHELEVVADTDLPVLLLGETGVGKELFAHRLHRQSRR
RAQPLVHVNCAALPESLAESELFGHARGAFSGATGERPGRFEAADGGTLFLDEVGELPLAIQAKLLRTLQNGEIQRLGSD
RPRRVNVRVIAATNRNLREHVRDGSFRADLYHRLSVYPIPIPPLRERGNDVLLLAGRFLELNRARLGLRSLRLSGGAQDA
LRSYRWPGNVRELEHVISRAALRAVSRGAGRNDIVTLEPELLDLDGLEVPAAHHAGAGMASAFAAPALAAGITLRDAVEQ
TQRACIEQALKDQGGNWAQAARQLGIDASNLHKLARRLGCK
>B1XCN6 ~~~norR~~~Anaerobic nitric oxide reductase transcription regulator NorR~~~
MSFSVDVLANIAIELQRGIGHQDRFQRLITTLRQVLECDASALLRYDSRQFIPLAIDGLAKDVLGRRFALEGHPRLEAIA
RAGDVVRFPADSELPDPYDGLIPGQESLKVHACVGLPLFAGQNLIGALTLDGMQPDQFDVFSDEELRLIAALAAGALSNA
LLIEQLESQNMLPGDATPFEAVKQTQMIGLSPGMTQLKKEIEIVAASDLNVLISGETGTGKELVAKAIHEASPRAVNPLV
YLNCAALPESVAESELFGHVKGAFTGAISNRSGKFEMADNGTLFLDEIGELSLALQAKLLRVLQYGDIQRVGDDRCLRVD
VRVLAATNRDLREEVLAGRFRADLFHRLSVFPLSVPPLRERGDDVILLAGYFCEQCRLRQGLSRVVLSAGARNLLQHYSF
PGNVRELEHAIHRAVVLARATRSGDEVILEAQHFAFPEVTLPTPEVAAVPVVKQNLREATEAFQRETIRQALAQNHHNWA
ACARMLETDVANLHRLAKRLGLKD
>P37013 ~~~norR~~~Anaerobic nitric oxide reductase transcription regulator NorR~~~COG3604
MSFSVDVLANIAIELQRGIGHQDRFQRLITTLRQVLECDASALLRYDSRQFIPLAIDGLAKDVLGRRFALEGHPRLEAIA
RAGDVVRFPADSELPDPYDGLIPGQESLKVHACVGLPLFAGQNLIGALTLDGMQPDQFDVFSDEELRLIAALAAGALSNA
LLIEQLESQNMLPGDATPFEAVKQTQMIGLSPGMTQLKKEIEIVAASDLNVLISGETGTGKELVAKAIHEASPRAVNPLV
YLNCAALPESVAESELFGHVKGAFTGAISNRSGKFEMADNGTLFLDEIGELSLALQAKLLRVLQYGDIQRVGDDRCLRVD
VRVLAATNRDLREEVLAGRFRADLFHRLSVFPLSVPPLRERGDDVILLAGYFCEQCRLRQGLSRVVLSAGARNLLQHYSF
PGNVRELEHAIHRAVVLARATRSGDEVILEAQHFAFPEVTLPTPEVAAVPVVKQNLREATEAFQRETIRQALAQNHHNWA
ACARMLETDVANLHRLAKRLGLKD
>Q46877 ~~~norV~~~Anaerobic nitric oxide reductase flavorubredoxin~~~COG0426
MSIVVKNNIHWVGQRDWEVRDFHGTEYKTLRGSSYNSYLIREEKNVLIDTVDHKFSREFVQNLRNEIDLADIDYIVINHA
EEDHAGALTELMAQIPDTPIYCTANAIDSINGHHHHPEWNFNVVKTGDTLDIGNGKQLIFVETPMLHWPDSMMTYLTGDA
VLFSNDAFGQHYCDEHLFNDEVDQTELFEQCQRYYANILTPFSRLVTPKITEILGFNLPVDMIATSHGVVWRDNPTQIVE
LYLKWAADYQEDRITIFYDTMSNNTRMMADAIAQGIAETDPRVAVKIFNVARSDKNEILTNVFRSKGVLVGTSTMNNVMM
PKIAGLVEEMTGLRFRNKRASAFGSHGWSGGAVDRLSTRLQDAGFEMSLSLKAKWRPDQDALKLCREHGREIARQWALAP
LPQSTVNTVVKEETSATTTADLGPRMQCSVCQWIYDPAKGEPMQDVAPGTPWSEVPDNFLCPECSLGKDVFEELASEAK
>P37596 1.18.1.-~~~norW~~~Nitric oxide reductase FlRd-NAD(+) reductase~~~COG0446
MSNGIVIIGSGFAARQLVKNIRKQDATIPLTLIAADSMDEYNKPDLSHVISQGQRADDLTRQTAGEFAEQFNLHLFPQTW
VTDIDAEARVVKSQNNQWQYDKLVLATGASAFVPPVPGRELMLTLNSQQEYRACETQLRDARRVLIVGGGLIGSELAMDF
CRAGKAVTLIDNAASILASLMPPEVSSRLQHRLTEMGVHLLLKSQLQGLEKTDSGIQATLDRQRNIEVDAVIAATGLRPE
TALARRAGLTINRGVCVDSYLQTSNTDIYALGDCAEINGQVLPFLQPIQLSAMVLAKNLLGNNTPLKLPAMLVKIKTPEL
PLHLAGETQRQDLRWQINTERQGMVARGVDDADQLRAFVVSEDRMKEAFGLLKTLPM
>P19843 ~~~nosD~~~Probable ABC transporter binding protein NosD~~~
MFKAQATFSRYSAAVSLLLLFSGAAQAAPQSITTLPLQPDGENRWRLPAGEYQGQFTIEQPMQLRCEPGAVIQSQGQGSS
LLISAPDVLVEGCTLYEWGSDLTAMDSAVFILPAAERAQISNNRMRGPGFGVFVDGTRDVQVIGNEIDGDAGVRSQDRGN
GIHLFAVSGARVLHNHVRNARDGIYIDTSNGNHLEGNVIEDVRYGVHYMFANENSLIDNVTRRTRTGYALMQSRKLTVTG
NRSEQDQNYGILMNYITYSTITGNFVSDVQRGDTGGDSMISGGEGKALFIYNSLFNTIENNHFEKSSLGIHLTAGSEDNR
ISGNAFVGNQQQVKYVASRTQEWSVDGRGNYWSDYLGWDRNNDGLGDIAYEPNDNVDRLLWLYPQVRLLMNSPSIEVLRW
VQRAFPVIKSPGVQDSHPLMKLPTEKLLTEKQEPTS
>P19844 ~~~nosF~~~Probable ABC transporter ATP-binding protein NosF~~~
MNAVEIQGVSQRYGSMTVLHDLNLNLGEGEVLGLFGHNGAGKTTSMKLILGLLSPSEGQVKVLGRAPNDPQVRRQLGYLP
ENVTFYPQLSGRETLRHFARLKGAALTQVDELLEQVGLAHAADRRVKTYSKGMRQRLGLAQALLGEPRLLLLDEPTVGLD
PIATQDLYLLIDRLRQRGTSIILCSHVLPGVEAHINRAAILAKGCLQAVGSLSQLRAEAGLPVRIRASGISERDSWLQRW
TDAGHSARGLSESSIEVVAVNGHKLVLLRQLLGEGEPEDIEIHQPSLEDLYRYYMERAGDVRAQEGRL
>O68481 ~~~nosL~~~Copper-binding lipoprotein NosL~~~
MRTRLRFVLVAAALALLSACKEDVAQSIVPQDMTPETLGHYCQMNLLEHPGPKAQIFLEGSPAPLFFSQVRDAIAYARGP
EQIAPILVIYVNDMGAAGATWDQPGDGNWIAADKAFYVVGSARRGGMGAPEAVPFSSRDEAAAFVLAEGGQVLALADITD
AMVLTPVETGSEPRADDEDYLGRLRALPHPAGG
>Q52529 ~~~nosL~~~Copper-binding lipoprotein NosL~~~
MNALHRIGAGTLLAVLLAFGLTGCGEKEEVQQSLEPVAFHDSDECHVCGMIITDFPGPKGQAVEKRGVKKFCSTAEMLGW
WLQPENRLLDAKLYVHDMGRSVWEKPDDGHLIDATSAYYVVGTSLKGAMGASLASFAEEQDAKALAGMHGGRVLRFEEID
QALLQEAASMQHGGMHDHAPNGAHNAHAGH
>C6FX52 ~~~nosM~~~Nosiheptide precursor~~~
MDAAHLSDLDIDALEISEFLDESRLEDSEVVAKVMSASCTTCECCCSCSS
>O34453 1.14.14.47~~~nos~~~Nitric oxide synthase oxygenase~~~COG4362
MEEKEILWNEAKAFIAACYQELGKEEEVKDRLADIKSEIDLTGSYVHTKEELEHGAKMAWRNSNRCIGRLFWNSLNVIDR
RDVRTKEEVRDALFHHIETATNNGKIRPTITIFPPEEKGEKQVEIWNHQLIRYAGYESDGERIGDPASCSLTAACEELGW
RGERTDFDLLPLIFRMKGDEQPVWYELPRSLVIEVPITHPDIEAFSDLELKWYGVPIISDMKLEVGGIHYNAAPFNGWYM
GTEIGARNLADEKRYDKLKKVASVIGIAADYNTDLWKDQALVELNKAVLHSYKKQGVSIVDHHTAASQFKRFEEQEEEAG
RKLTGDWTWLIPPISPAATHIFHRSYDNSIVKPNYFYQDKPYE
>Q9RR97 1.14.14.47~~~nos~~~Nitric oxide synthase oxygenase~~~COG4362
MSCPAAAVLTPDMRAFLRRFHEEMGEPGLPARLRAVEEAGLWWPTSAELTWGAKVAWRNSTRCVGRLYWEALSVRDLREL
NTAQAVYEALLQHLDDAFCGGHIRPVISVFGPGVRLHNPQLIRYADDPINADFVDKLRRFGWQPRGERFEVLPLLIEVNG
RAELFSLPPQAVQEVAITHPVCLGIGELGLRWHALPVISDMHLDIGGLHLPCAFSGWYVQTEIAARDLADVGRYDQLPAV
ARALGLDTSRERTLWRDRALVELNVAVLHSFDAAGVKLADHHTVTAHHVRFEEREARAGREVRGKWSWLVPPLSPATTPL
WSRRYRAREESPRFVRARCPFHTPTVHASTGHAPTG
>P0A093 1.14.14.47~~~nos~~~Nitric oxide synthase oxygenase~~~
MLFKEAQAFIENMYKECHYETQIINKRLHDIELEIKETGTYTHTEEELIYGAKMAWRNSNRCIGRLFWDSLNVIDARDVT
DEASFLSSITYHITQATNEGKLKPYITIYAPKDGPKIFNNQLIRYAGYDNCGDPAEKEVTRLANHLGWKGKGTNFDVLPL
IYQLPNESVKFYEYPTSLIKEVPIEHNHYPKLRKLNLKWYAVPIISNMDLKIGGIVYPTAPFNGWYMVTEIGVRNFIDDY
RYNLLEKVADAFEFDTLKNNSFNKDRALVELNYAVYHSFKKEGVSIVDHLTAAKQFELFERNEAQQGRQVTGKWSWLAPP
LSPTLTSNYHHGYDNTVKDPNFFYKKKESNANQCPFHH
>P0A004 1.14.14.47~~~nos~~~Nitric oxide synthase oxygenase~~~
MLFKEAQAFIENMYKECHYETQIINKRLHDIELEIKETGTYTHTEEELIYGAKMAWRNSNRCIGRLFWDSLNVIDARDVT
DEASFLSSITYHITQATNEGKLKPYITIYAPKDGPKIFNNQLIRYAGYDNCGDPAEKEVTRLANHLGWKGKGTNFDVLPL
IYQLPNESVKFYEYPTSLIKEVPIEHNHYPKLRKLNLKWYAVPIISNMDLKIGGIVYPTAPFNGWYMVTEIGVRNFIDDY
RYNLLEKVADAFEFDTLKNNSFNKDRALVELNYAVYHSFKKEGVSIVDHLTAAKQFELFERNEAQQGRQVTGKWSWLAPP
LSPTLTSNYHHGYDNTVKDPNFFYKKKESNANQCPFHH
>P19845 ~~~nosY~~~Probable ABC transporter permease protein NosY~~~COG1277
MNQVWNIARKELSDGLRNRWLLAISLLFAVLAVGIAWLGAAASGQLGFTSIPATIASLASLATFLMPLIALLLAYDAIVG
EDEGGTLMLLLTYPLGRGQILLGKFVGHGLILALAVLIGFGCAALAIALLVEGVELGMLFWAFGRFMISSTLLGWVFLAF
AYVLSGKVNEKSSAAGLALGVWFLFVLVFDLVLLALLVLSEGKFNPELLPWLLLLNPTDIYRLINLSGFEGSGSAMGVLS
LGADLPVPAAVLWLCLLAWIGVSLLLAYAIFRRRLT
>P94127 1.7.2.4~~~nosZ~~~Nitrous-oxide reductase~~~
MESKEHKGLSRRALFSATAGSAILAGTVGPAALSLGAAGLATPARAATGADGSVAPGKLDDYYGFWSSGQTGEMRILGIP
SMRELMRVPVFNRCSATGWGQTNESIRIHQRTMTEKTKKQLAANGKKIHDNGDLHHVHMSFTDGKYDGRYLFMNDKANTR
VARVRCDVMKTDAILEIPNAKGIHGMRPQKWPRSNYVFCNGEDEAPLVNDGSTMTDVATYVNIFTAVDADKWEVAWQVKV
SGNLDNCDADYEGKWAFSTSYNSEMGMTLEEMTKSEMDHVVVFNIAEIEKAIKAGQYEEINGVKVVDGRKEAKSLFTRYI
PIANNPHGCNMAPDRKHLCVAGKLSPTVTVLDVTKFDALFYDNAEPRSAVVAEPELGLGPLHTAFDGRGNAYTSLFLDSQ
VVKWNIDEAIRAYAGEKINPIKDKLDVQYQPGHLKTVMGETLDAANDWLVCLCKFSKDRFLNVGPLKPENDQLIDISGDK
MVLVHDGPTFAEPHDAIAVSPSILPNIRSVWDRKDPLWAETRKQAEADEVDIDEWTEAVIRDGNKVRVYMTSVAPSFSQP
SFTVKEGDEVTVIVTNLDEIDDLTHGFTMGNHGVAMEVGPQQTSSVTFVAANPGVYWYYCQWFCHALHMEMRGRMFVEPK
GA
>Q51705 1.7.2.4~~~nosZ~~~Nitrous-oxide reductase~~~
MESKQEKGLSRRALLGATAGGAAVAGAFGGRLALGPAALGLGTAGVATVAGSGAALAASGDGSVAPGQLDDYYGFWSSGQ
SGEMRILGIPSMRELMRVPVFNRCSATGWGQTNESVRIHERTMSERTKKFLAANGKRIHDNGDLHHVHMSFTEGKYDGRF
LFMNDKANTRVARVRCDVMKCDAILEIPNAKGIHGLRPQKWPRSNYVFCNGEDETPLVNDGTNMEDVANYVNVFTAVDAD
KWEVAWQVLVSGNLDNCDADYEGKWAFSTSYNSEKGMTLPEMTAAEMDHIVVFNIAEIEKAIAAGDYQELNGVKVVDGRK
EASSLFTRYIPIANNPHGCNMAPDKKHLCVAGKLSPTATVLDVTRFDAVFYENADPRSAVVAEPELGLGPLHTAFDGRGN
AYTSLFLDSQVVKWNIEDAIRAYAGEKVDPIKDKLDVHYQPGHLKTVMGETLDATNDWLVCLSKFSKDRFLNVGPLKPEN
DQLIDISGDKMVLVHDGPTFAEPHDAIAVHPSILSDIKSVWDRNDPMWAETRAQAEADGVDIDNWTEEVIRDGNKVRVYM
SSVAPSFSIESFTVKEGDEVTVIVTNLDEIDDLTHGFTMGNYGVAMEIGPQMTSSVTFVAANPGVYWYYCQWFCHALHME
MRGRMLVEPKEA
>P19573 1.7.2.4~~~nosZ~~~Nitrous-oxide reductase~~~COG4263
MSDKDSKNTPQVPEKLGLSRRGFLGASAVTGAAVAATALGGAVMTRESWAQAVKESKQKIHVGPGELDDYYGFWSGGHQG
EVRVLGVPSMRELMRIPVFNVDSATGWGLTNESRHIMGDSAKFLNGDCHHPHISMTDGKYDGKYLFINDKANSRVARIRL
DIMKCDKMITVPNVQAIHGLRLQKVPHTKYVFANAEFIIPHPNDGKVFDLQDENSYTMYNAIDAETMEMAFQVIVDGNLD
NTDADYTGRFAAATCYNSEKAFDLGGMMRNERDWVVVFDIHAVEAAVKAGDFITLGDSKTPVLDGRKKDGKDSKFTRYVP
VPKNPHGCNTSSDGKYFIAAGKLSPTCSMIAIDKLPDLFAGKLADPRDVIVGEPELGLGPLHTTFDGRGNAYTTLFIDSQ
VVKWNMEEAVRAYKGEKVNYIKQKLDVHYQPGHLHASLCETNEADGKWLVALSKFSKDRFLPVGPLHPENDQLIDISGDE
MKLVHDGPTFAEPHDCIMARRDQIKTKKIWDRNDPFFAPTVEMAKKDGINLDTDNKVIRDGNKVRVYMTSMAPAFGVQEF
TVKQGDEVTVTITNIDQIEDVSHGFVVVNHGVSMEISPQQTSSITFVADKPGLHWYYCSWFCHALHMEMVGRMMVEPA
>Q9L9G1 ~~~novG~~~Transcriptional regulator NovG~~~
MTNSGDEEITPASLKATRKGERVSIGSLLPPSELVRSGESTEHIRVLAETDEDLPPIVVHRGTRRVVDGMHRLWAARFRG
DESIEVVFVDGSPADVFVLAVELNRAHGLPLTLDERKSAAAQIMDSHPHWSDRKIARTTGLAASTVASLRSSSTAGTVGR
RTGQDGRSRPNDGTDGRQRAAALLARNPNASLREVTRAAGISVGTASDVRARLRRGEPALTARQQAVMKLRPAAAQRSGP
DYGRVLENLRKDPSLRFTDLGRRLLRLLDGSVPGSVEQIAQIADGVPEHCRTVVVDMARECAAAWQHLADQLADRDTA
>Q9L9G0 6.-.-.-~~~novH~~~Novobiocin biosynthesis protein H~~~
MFNTRANKASDQSPTIPTESATLAELWERTVRSRPSSPAIVTNGETLSYDEVNARANRLARLLLDEGAGPGRLVALALPR
SSHLVISVLAVAKAGAVFLPLDVNHPRERLSYQLADARPALLCTVRSAAARLPDGIEMPRVLLDSPERTAVLDALPDTDL
TDDERGGPLAATDLAYVIYTSGSTGRPKGVALTGAGLPALAAAKVAAMRVTGDSRVLQFASPGFDAYLTELLAAFTAGAT
LVVPGTDTLAGDPLRRALRDGRVSHAVLPPAAVATMSPDAVPDLRVLVVAGEACPAGLVERWAPGRLLINAYGPTECTVC
ATMTGPLTPTDEVTIGRPIPGVSVYILDAERRPAAPGEIGELYLSGAGLAQGYLNSPDLTAQMFVPNPFAADGERMYRTG
DLASRRADGDILFHGRIDDQVELRGFRVELGEVESVLSQHPDVAQAVAALWTDPAEGPQLVTYVVPAPGTTPSAGELREH
AGRFLPDFMVPSAFTTIDAVPLTPGGKTDRAGLPDPVKATQPAGLGPRTPAEKVLCDIFRDLFDLVEIDVRSNFFEMGGN
SILAVDLIQRAQEAGLTLMPRTVIDHPTIEQLAAIATLEE
>Q9L9F8 1.1.1.-~~~novJ~~~Short-chain reductase protein NovJ~~~
MTSPADATTEVAVSQESVAMVTGAGRGIGAATAERLAAEGMAVIVVDRTEQDTRATVAAIRTAGGRARGIGCDVAVAQAV
TAAVATAVEEFGRIDVLVNCAGINRDRLLLTMGDQEWDTVLDVNLGGTMRCSFAVGRHMRRQGHGRIINFSSVAARGNAG
QTNYATAKGAIAGFTRTLAAELGPHGVTVNAIAPGFVATPMVDELAERLGGDRDSVMSEAAKSSAVGRIGTPEEIAATVV
FVARPESGYLTGETVHVDGGRP
>Q9L9F7 ~~~novK~~~Short-chain dehydrogenase/reductase family member NovK~~~
MTRVAVVAGGGEHTGPAVALRLAAGGFDVALLGSEFTSADKTVRRVEEYGRQCVTVRAELSDARSVAVAFGRVRTALSGP
AVLVTCVGPQPLPDGLPEDESADEQRYTAVRRALRPVFVCCQAGAGQLLRHRWGRIIIVTEPADADGNTWRTSRPVLDGL
IGFTRSAALELARSGTTVNLVAPADRAADSRAPAHRAAGDDSAGSYADGVAHVTEFLVDERAVGITGQAIRVAARADVPL
LRER
>Q9L9F6 6.3.1.15~~~novL~~~8-demethylnovobiocic acid synthase~~~
MANKDHAPEHYVTRILAEATLDGARPVVRWRDTVITGTQLDRSVRRVVTALREAGVARDHAVAVLTQVNSPWMLIVRYAA
HLVGASVVYITGANHGIVTHELPVATRVRMLREAGASVLVFDESNAQLAETVDETVRDKLVLCGLGHPASGTVSVDGRPV
DDVSVDFTPEAPELAMVLYTSGTTGQPKGVCRSFGSWNAAALRGAAYPRPVFLTMTAVSQTVAMIVDTVLAAGGSVLLRE
RFDPADFLRDVGEHRVTETFMGVAQLYAILGHPDARTADLSSLRHVLYLGCPASPERLREAAALLPGVLAQSYGSTEAGR
ITVLRAADHERPELLATVGRAVPGVTIAIRDPETGHDLPVGEIGEVVVHGPEVMAGYVADPEHTARVIRDGWVHTGDFGS
VDERGYVRLFGRMREVVKVQDTRVSPTEVEKVLVGCPGVVDACVYGHRGPDLIEELHAAVVLGTEGAPSFDTLRDHVARA
MTPTHAPIRFVRWRRFPINNTGKVNRLRVREVSAEARGDSPDVLVDR
>Q9L9F5 2.4.1.302~~~novM~~~L-demethylnoviosyl transferase~~~
MRVLLTSLPGIGHLFPMVPLGWALQAAGHTVLVATDREFLPVVTGAGLSATAVLDPVDPVELFKPVEPFGDPLSPAERTG
HRCAEAGVRALPAMRALVDVWHPDLVIAEPMELAGPAAATNAGVPWVRHSYGLIPPGPLLSVAAEVLDAELAVLGLSALA
KPARTIDVCPDSLRPSDGVATVPMRYVPYNGPAGVPDWLLAGPPARPRVCLTLGTSLPRRDPHVAPLWRLLLDELVALGQ
EVVIAIDESHRPLLGHLPDGVRAARIPLCDLLPTCTAIVHHGGSGSTMAAASFGVPQLVVPHFADHFTNAERLTAVGAGL
SLPHDTDDLARISAACELITGDGPHRAISRRLADENARRPTPAVVAEGLAAEQRSMTPA
>Q9L9F4 2.1.3.12~~~novN~~~Decarbamoylnovobiocin carbamoyltransferase~~~
MLILGLNGNVSAAGADVVPNLNELYMHDAAATLVRDGVLVAAVEEERLNRIKKTTKFPSNAIRECLAAAGAKPADVDAIA
YYFPEDFFDDIFQQLYTEHPSVPTRYSREMILERLRVDLGWAAPPDILHYVPHHLAHAMSTYYRSGMREALVVVMDGAGE
RNCTTIYRSDGAELFEIASYPVPKSLGMFYLYGTRHLGYGFGDEYKVMGLAPYGDPSTYRDVFSTLYSLGPNGTYELIPR
GGVVFRMTTILREHGLQPRRRGEPFTQAHMDFAASVQETTEQIAMHVIGYWARSTGLRNLAFGGGVAHNSTLNGRILTSG
LFDEVFVHPASHDAGSSEGAALVVARERGERVWPLPRLTNASLGPDLGDVDSLERTLKSWSPLVDVERPDDIVEATAHLL
AAGEAIGWAHGRSEFGPRALGNRSILADARPKENQTRINAMVKKRESFRPFAPVVTAEAAGDYFDLPETVGHHDFMSFVV
QVRADRRELLGAVTHVDGSARVQVVTEETNPRFHRLVTRFGELTGTPVLLNTSFNNHAEPIVQSVDDVLTSYLTTSLDVL
VIEDFVVRRRTELPLALEDFTIGFRPVTRLVRRLADVSAGRPGAPEVSHEIYLDHTSGPRATISAAMYELLTHADGVTPL
GSLGIELTGELLTELYDLWQGRFVTVAPVGDGAGSAP
>Q9L9F3 2.1.1.284~~~novO~~~8-demethylnovobiocic acid C(8)-methyltransferase~~~
MKIEAITGSEAEAFHRMGSQASHRYDEFVDLLVGAGIADGQTVVDLCCGSGELEVILSSRFPSLNLVGVDLSEDMVRIAR
EYAAEQGKALEFRHGDAQLLAGMEDLAGKADLVVSRNAFHRLTRLPAAFDTMLRLAKPGGAVLNCSFIHPSDFDESGFRA
WVTFLNQRPWDSEMQIVWALAHHYAPRLDDYREALAQAARETPVSEQRVWIDDQGYGVPTVKCFARRAAA
>Q9L9F2 2.1.1.285~~~novP~~~Demethyldecarbamoylnovobiocin O-methyltransferase~~~
MAPIVETAKETNSDSSLYLDLMIKVLAGTVYEDPAHRENFSHRDSTYREEVRNEGRDWPANAHTMIGIKRLENIRQCVED
VIGNNVPGDLVETGVWRGGACILMRGILRAHDVRDRTVWVADSFQGIPDVGEDGYAGDRKMALHRRNSVLAVSEEEVRRN
FRNYDLLDEQVRFLPGWFKDTLPTAPIDTLAVLRMDGDLYESTWDTLTNLYPKVSVGGYVIVDDYMMCPPCKDAVDEYRA
KFDIADELITIDRDGVYWQRTR
>Q9L9F1 2.5.1.111~~~novQ~~~4-hydroxyphenylpyruvate 3-dimethylallyltransferase~~~
MPALPMNQEFDRERFRVDLRATAAAIGAPVTPRVTDTVLETFRDNFAQGATLWKTTSQPGDQLSYRFFSRLKMDTVGRAV
DAGLLDGTHPTVPIVEDWSDLYGGTPVQSADFDAGRGMAKTWLYFGGLRPAEDILSVPALPAPVQARLKDFLGLGLAHVR
FAAVDWRHRSANVYFRGQGPLDTAQFARVHALSGGTPPAADVVAEVLAYVPEDYCVAITLDLHTGAIDRVCFYALKVPKD
ARPRVPARIATFLEVAPSHDPEECNVIGWSFGRSGDYVKAERSYTGNMTEILSGWNCFFHGEEGRDHDLRALQDTGSITG
GAR
>Q9L9F0 4.1.-.-~~~novR~~~Decarboxylase NovR~~~
MSEALANMPGDDYFRQPPVFDTYAEHRAYLKFRHAVALRHFARLGFDQDGLAGLITVADPEHADTYWANPLAHPFSTITP
ADLIRVDGDSAETVEGQRRVNIAAFNIHAEIHRARPDVQAVIHLHTVYGRAFSAFARKLPPLTQDACPFFEDHEVFDDFT
GLVLAKDDGRRIAKQLRGHKAILLKNHGLVTVGETLDAAAWWFTLLDTCCHVQLLADAAGKPEEIPAEVARLTGRQLGSH
LLGWNSYQPLHEAALARDPDLATMEPALPS
>Q9L9E5 5.1.3.-~~~novW~~~dTDP-4-dehydrorhamnose 3-epimerase~~~
MRLRPLGIEGVWEITPEQRADPRGVFLDWYHVDRFAEAIGRPLRLAQANLSVSVRGVVRGIHFVDVPPGQAKYVTCVRGA
VFDVVVDLRVGSPTYGCWEGTRLDDVSRRAVYLSEGIGHGFCAISDEATLCYLSSGTYDPATEHGVHPLDPELAIDWPTG
TPLLSPRDQDALLLAEARDAGLLPTYATCQAVTVPSPAPGSVGDPGP
>A0A0N9HP11 1.8.1.20~~~nox~~~4,4'-dithiodibutanoate disulfide reductase~~~
MTTAPTPSIFEPARLGPLTLRNRIVKAATFEGVMPRGAVSDDLINFHAEVARGGAAMTTVAYCAVSPGGRVHRDTLVMDE
RALPGLRRLTDAVHAEGALAAAQIGHAGLVANTLSNKTKTLAPSTRLSPPAMGLVKGATLAELDGVVSDFERTARVAVDA
GFDAIEVHLGHNYLLSSFMSPNLNKRHDRYGGSVAKRAEYPRRVIEAVRVAAGSSVAVTAKFNMSDGVPKGLWLDQSLPI
AQILEADGHLDAMQLTGGSSLLNGMYFFRGEVPLAEFVASQPKLVGYGLKFYGPKIFPTYPFEEGFFLPFARQFRQALRM
PLILLGGINRVDTIEHALDEGFEFVAMARALLRDPQLVNKFQAESVDQGLCIHCNKCMPTIYTGTRCVVRDALVVREAPR
LGQ
>Q60049 7.1.1.2~~~nox~~~NADH dehydrogenase~~~COG0778
MEATLPVLDAKTAALKRRSIRRYRKDPVPEGLLREILEAALRAPSAWNLQPWRIVVVRDPATKRALREAAFGQAHVEEAP
VVLVLYADLEDALAHLDEVIHPGVQGERREAQKQAIQRAFAAMGQEARKAWASGQSYILLGYLLLLLEAYGLGSVPMLGF
DPERVRAILGLPSHAAIPALVALGYPAEEGYPSHRLPLERVVLWR
>Q6F4M8 1.14.13.166~~~npcA~~~4-nitrophenol 4-monooxygenase/4-nitrocatechol 2-monooxygenase, oxygenase component~~~
MRTGQQYLESLRDGRQVYVGGELIDDVTTHPKTSGYAKAIAEYYDLHLDPEHQDVLTFVDDDGVRKSMHWFLPRSKADAA
RRRAYHEFWFRHFQGGIFTRPPAGMHVVMYAQIDDPEPWGDNAVVAGGRTISFADNIRSQWQRVTTDDVALSPMFVDVQF
DRGRDDALVETPMLSIVEQNDQGIVVRGWKAMGTSLPFVNELLVGNLWRPGQTSDQTVYAIVPVNTPGLSLVCRQSNATP
DADPYDHPLSTIGDELDGMAYFDDVFIPWENVQHIGNPDHAKWYPQRQFDWVHIETQIRHAVHAELIVGLALLLTNALGT
NNNPIVQSQLADLVRFRETCKAFAIAAEETGFTTAGGLFKPNNIYVDLGRAHYLENIHNAVNQLIEFCGRGVVMSPTKAD
FDHPFLGPKLEEALRGTSISARDRVSIFRQISERYLTQWGARHEMFEKFNGTPLYLVRLLTMQRTEYQVDGPLTDLARQV
LGFGDTEALAARAAEVEKNSNWASVAYQPEYAREQDVRDGYYKETEKV
>Q6F4M9 1.14.13.166~~~npcB~~~4-nitrophenol 4-monooxygenase/4-nitrocatechol 2-monooxygenase, reductase component~~~
MLEDPMKQNVLQPLDKAEFRNVVGHFASGVTIVTAAHDGVPYGATISAVTSLCDTPPMVLVCLNQKLGTHAAIRKARHFT
INILGEDQASLAHTFATPGADKFADVAVHHRQHGPRLAEALAYLTCRVVDDLEGGTHRIFVAEVVEAQAGTGNPLSYYRG
RFGHFVPYRNAMWRTTQADNAVSPH
>Q6F4M7 1.13.11.37~~~npcC~~~Hydroxyquinol 1,2-dioxygenase~~~
MHTTDTETFEEQFAIEQRLVDSVVASFDSTTDPRLKELMQSLTRHLHAFIREVRLSEDEWSNAIAFLTAVGNITDDRRQE
FILLSDVLGVSMQTIAVSNPAYEDATESTVFGPFFVEDAPEVTLGGDIAGGATGQPCWIEGTVTDTAGNPVPEARIEVWQ
NDEDGFYDVQYSDGRVSGRAHLFSDAHGRYRFWGMTPVPYPIPSDGPVGKMLAATNRSPMRVAHLHFMVTADGLRTLVTH
IFVAGDPQLERGDSVFGVKDSLIKDFVEQPPGTPTPDGRHIGDRNWARCEFDIVLAPEQI
>P75960 2.3.1.286~~~cobB~~~NAD-dependent protein deacylase~~~COG0846
MLSRRGHRLSRFRKNKRRLRERLRQRIFFRDKVVPEAMEKPRVLVLTGAGISAESGIRTFRAADGLWEEHRVEDVATPEG
FDRDPELVQAFYNARRRQLQQPEIQPNAAHLALAKLQDALGDRFLLVTQNIDNLHERAGNTNVIHMHGELLKVRCSQSGQ
VLDWTGDVTPEDKCHCCQFPAPLRPHVVWFGEMPLGMDEIYMALSMADIFIAIGTSGHVYPAAGFVHEAKLHGAHTVELN
LEPSQVGNEFAEKYYGPASQVVPEFVEKLLKGLKAGSIA
>P9WGG3 2.3.1.286~~~cobB~~~NAD-dependent protein deacylase~~~COG0846
MRVAVLSGAGISAESGVPTFRDDKNGLWARFDPYELSSTQGWLRNPERVWGWYLWRHYLVANVEPNDGHRAIAAWQDHAE
VSVITQNVDDLHERAGSGAVHHLHGSLFEFRCARCGVPYTDALPEMPEPAIEVEPPVCDCGGLIRPDIVWFGEPLPEEPW
RSAVEATGSADVMVVVGTSAIVYPAAGLPDLALARGTAVIEVNPEPTPLSGSATISIRESASQALPGLLERLPALLK
>B4EVF5 2.3.1.286~~~npdA~~~NAD-dependent protein deacylase~~~COG0846
MMKLKLRHRRLRKFRKIKSLRRQHSRCRYFHLTHKTEHEMNLPKVVVLTGAGISAESGIKTFRSEDGLWEEHRVEDVATP
EGYHRNPKLVQQFYNERRRQLQQPSIQPNEAHYALAKLEQYLGKDNFLLVTQNIDNLHEKAGSKHILHMHGELLKVRCPQ
SGQVFEWKGDLSTTDYCHCCQFPSPLRPHIVWFGEMPIGMDEIYHALAQADLFIAIGTSGNVYPAAGFVHEARLTGAHTV
ELNLEPSLVESEFEEKHYGPASQVVDAYVHKLFDLINDPKADLTQ
>P0A2F2 2.3.1.286~~~cobB~~~NAD-dependent protein deacylase~~~
MQSRRFHRLSRFRKNKRLLRERLRQRIFFRDRVVPEMMENPRVLVLTGAGISAESGIRTFRAADGLWEEHRVEDVATPEG
FARNPGLVQTFYNARRQQLQQPEIQPNAAHLALAKLEEALGDRFLLVTQNIDNLHERAGNRNIIHMHGELLKVRCSQSGQ
ILEWNGDVMPEDKCHCCQFPAPLRPHVVWFGEMPLGMDEIYMALSMADIFIAIGTSGHVYPAAGFVHEAKLHGAHTVELN
LEPSQVGSEFEEKHYGPASQVVPEFVDKFLKGL
>P66815 2.3.1.286~~~cobB~~~NAD-dependent protein deacetylase~~~
MRNDLETLKHIIDSSNRITFFTGAGVSVASGVPDFRSMGGLFDEISKDGLSPEYLLSRDYLEDDPEGFINFCHKRLLFVD
TKPNIVHDWIAKLERNQQSLGVITQNIDGLHSDAGSQHVDELHGTLNRFYCNACHKSYTKSDVIDRTLKHCDNCGGAIRP
DIVLYGEMLDQPTIIRALNKIEHADTLVVLGSSLVVQPAAGLISHFKGDNLIIINKDRTPYDSDATLVIHDDMVSVVKSL
MTE
>Q9WYW0 2.3.1.286~~~cobB~~~NAD-dependent protein deacetylase~~~COG0846
MKMKEFLDLLNESRLTVTLTGAGISTPSGIPDFRGPNGIYKKYSQNVFDIDFFYSHPEEFYRFAKEGIFPMLQAKPNLAH
VLLAKLEEKGLIEAVITQNIDRLHQRAGSKKVIELHGNVEEYYCVRCEKKYTVEDVIKKLESSDVPLCDDCNSLIRPNIV
FFGENLPQDALREAIGLSSRASLMIVLGSSLVVYPAAELPLITVRSGGKLVIVNLGETPFDDIATLKYNMDVVEFARRVM
EEGGIS
>Q8RQQ0 1.14.13.29~~~nphA1~~~4-nitrophenol 2-monooxygenase, oxygenase component~~~
MTTSAFVDDRVGVPNDVRPMTGDEYLESLRDGREVYFRGERVDDVTTHPAFRNSARSVARMYDALHQPEQEGVLAVPTDT
GNGGFTHPFFKTARSADDLVLSRDAIVAWQREVYGWLGRSPDYKASFLGTLGANADFYGPYRDNALRWYKHAQERMLYLN
HAIVNPPIDRDKPADETADVCVHVVEETDAGLIVSGAKVVATGSAITNANFIAHYGLLRKKEYGLIFTVPMDSPGLKLFC
RTSYEMNAAVMGTPFDYPLSSRFDENDAIMVFDRVLVPWENVFAYDTDTANGFVMKSGFLSRFMFHGCARLAVKLDFIAG
CVMKGVEMTGSAGFRGVQMQIGEILNWRDMFWGLSDAMAKSPEQWVNGAVQPNLNYGLAYRTFMGVGYPRVKEIIQQVLG
SGLIYLNSHASDWANPAMRPYLDQYVRGSNGVAAIDRVQLLKLLWDAVGTEFGGRHELYERNYGGDHEAVRFQTLFAYQA
TGQDLALKGFAEQCMSEYDVDGWTRPDLIGNDDLRIVRG
>Q8RQP9 1.5.1.36~~~nphA2~~~NADH-dependent flavin reductase~~~
MTETAGELDPEVTPLHLRKALGRFASGVTIVTTAECEDEDSVHGMTANAFTSVSLDPPLVLVSISTRAKMDTKIRETGTY
GISILAGDQEPVSLHFAGAAHEPDRVRFVWRRGVPLLEGALVHLACTVVASHPAGDHTLHVGRVEQLWYDDGHPLVFYTG
SFRSLELLGRDEPWGF
>B1Q2A8 ~~~nphR~~~Transcriptional activator NphR~~~
MAEREQSNDSARTDVPAIVSLRTRELDTGEGRMQWASTLERLYCETDVAWPEPRRHFDAEWGGRPFGDLHVSTIRADAHT
VVRSPAMIQSDSGEGYLVCLVTDGSVEVRQSGRATVVEPGSFALLDCAAPFVFHSPAPFRQVVVRSPREVLTSRLPGRIV
EHGTARSIHGDTGAGGLVGRLFVDIADMDAPMSQGAAVSFASSAVDMLATALTEGLLATSAADLHRTEDLTRVQRVIEQN
LHDADITLSDIAAAAGMSLRTVHKLFNAEGTTTRAWLYQARLEAARRYLLTTDLSVADVSECAGFRDVSHFSRLFRSTFG
SSPGLYRKEHARIGS
>D7URV0 2.3.1.194~~~nphT7~~~Acetoacetyl CoA synthase NphT7~~~
MTDVRFRIIGTGAYVPERIVSNDEVGAPAGVDDDWITRKTGIRQRRWAADDQATSDLATAAGRAALKAAGITPEQLTVIA
VATSTPDRPQPPTAAYVQHHLGATGTAAFDVNAVCSGTVFALSSVAGTLVYRGGYALVIGADLYSRILNPADRKTVVLFG
DGAGAMVLGPTSTGTGPIVRRVALHTFGGLTDLIRVPAGGSRQPLDTDGLDAGLQYFAMDGREVRRFVTEHLPQLIKGFL
HEAGVDAADISHFVPHQANGVMLDEVFGELHLPRATMHRTVETYGNTGAASIPITMDAAVRAGSFRPGELVLLAGFGGGM
AASFALIEW
>A8C927 2.1.1.180~~~npmA~~~16S rRNA (adenine(1408)-N(1))-methyltransferase~~~
MLILKGTKTVDLSKDELTEIIGQFDRVHIDLGTGDGRNIYKLAINDQNTFYIGIDPVKENLFDISKKIIKKPSKGGLSNV
VFVIAAAESLPFELKNIADSISILFPWGTLLEYVIKPNRDILSNVADLAKKEAHFEFVTTYSDSYEEAEIKKRGLPLLSK
AYFLSEQYKAELSNSGFRIDDVKELDNEYVKQFNSLWAKRLAFGRKRSFFRVSGHVSKH
>O06428 2.5.1.85~~~grcC1~~~Nonaprenyl diphosphate synthase~~~COG0142
MRTPATVVAGVDLGDAVFAAAVRAGVARVEQLMDTELRQADEVMSDSLLHLFNAGGKRFRPLFTVLSAQIGPQPDAAAVT
VAGAVIEMIHLATLYHDDVMDEAQVRRGAPSANAQWGNNVAILAGDYLLATASRLVARLGPEAVRIIADTFAQLVTGQMR
ETRGTSENVDSIEQYLKVVQEKTGSLIGAAGRLGGMFSGATDEQVERLSRLGGVVGTAFQIADDIIDIDSESDESGKLPG
TDVREGVHTLPMLYALRESGPDCARLRALLNGPVDDDAEVREALTLLRASPGMARAKDVLAQYAAQARHELALLPDVPGR
RALAALVDYTVSRHG
>P43130 ~~~nprA~~~Transcriptional activator NprA~~~
MADGIISVSYLSKIENNQVVPSEEVLRLLCQRLGINNILKNRQDELTSKLLLWYKTITDKNRQEAARMYEEIKRTFDDVQ
GAESIAYFLLFEMRYHLLLKDIHTVEALLIKLRELYDTFDDVMKYYYYKFLGLLYYCKEKYEDALEYYKKAEQRFRSQSF
EKWEEADLHYLLALVYSRLWRILGCINYAQHALAIYQSEYDLKRSAECHILLGICYRRYGEVDQAIECYSLAHKIAQIIN
DTELLGTIEHNLGYLMSMKHEHYEAIQHYKKSLLYKRNSSLQARFITLFSLIKEYYVSKNYKKALANVEESLQLLKREKD
GMTTYYEYYLHFTVYQYLLSEDISENEFETFMKDRVLPYFQRFKKYEDVAQYAEYLAIYYEKRHKYKLASKFYKMSYQFL
KNMINI
>P39899 3.4.24.-~~~nprB~~~Neutral protease B~~~COG3227
MRNLTKTSLLLAGLCTAAQMVFVTHASAEESIEYDHTYQTPSYIIEKSPQKPVQNTTQKESLFSYLDKHQTQFKLKGNAN
SHFRVSKTIKDPKTKQTFFKLTEVYKGIPIYGFEQAVAMKENKQVKSFFGKVHPQIKDVSVTPSISEKKAIHTARRELEA
SIGKIEYLDGEPKGELYIYPHDGEYDLAYLVRLSTSEPEPGYWHYFIDAKNGKVIESFNAIHEAAGTGIGVSGDEKSFDV
TEQNGRFYLADETRGKGINTFDAKNLNETLFTLLSQLIGYTGKEIVSGTSVFNEPAAVDAHANAQAVYDYYSKTFGRDSF
DQNGARITSTVHVGKQWNNAAWNGVQMVYGDGDGSKFKPLSGSLDIVAHEITHAVTQYSAGLLYQGEPGALNESISDIMG
AMADRDDWEIGEDVYTPGIAGDSLRSLEDPSKQGNPDHYSNRYTGTEDYGGVHINSSIHNKAAYLLAEGGVHHGVQVEGI
GREASEQIYYRALTYYVTASTDFSMMKQAAIEAANDLYGEGSKQSASVEKAYEAVGIL
>P06832 3.4.24.28~~~npr~~~Bacillolysin~~~COG3227
MGLGKKLSVAVAASFMSLTISLPGVQAAENPQLKENLTNFVPKHSLVQSELPSVSDKAIKQYLKQNGKVFKGNPSERLKL
IDQTTDDLGYKHFRYVPVVNGVPVKDSQVIIHVDKSNNVYAINGELNNDVSAKTANSKKLSANQALDHAYKAIGKSPEAV
SNGTVANKNKAELKAAATKDGKYRLAYDVTIRYIEPEPANWEVTVDAETGKILKKQNKVEHAATTGTGTTLKGKTVSLNI
SSESGKYVLRDLSKPTGTQIITYDLQNREYNLPGTLVSSTTNQFTTSSQRAAVDAHYNLGKVYDYFYQKFNRNSYDNKGG
KIVSSVHYGSRYNNAAWIGDQMIYGDGDGSFFSPLSGSMDVTAHEMTHGVTQETANLNYENQPGALNESFSDVFGYFNDT
EDWDIGEDITVSQPALRSLSNPTKYGQPDNFKNYKNLPNTDAGDYGGVHTNSGIPNKAAYNTITKIGVNKAEQIYYRALT
VYLTPSSTFKDAKAALIQSARDLYGSQDAASVEAAWNAVGL
>P05806 3.4.24.28~~~npr~~~Bacillolysin~~~COG3227
MKKKSLALVLATGMAVTTFGGTGSAFADSKNVLSTKKYNETVQSPEFISGDLTEATGKKAESVVFDYLNAAKGDYKLGEK
SAQDSFKVKQVKKDAVTDSTVVRMQQVYEGVPVWGSTQVAHVSKDGSLKVLSGTVAPDLDKKEKLKNKNKIEGAKAIEIA
QQDLGVTPKYEVEPKADLYVYQNGEETTYAYVVNLNFLDPSPGNYYYFIEADSGKVLNKFNTIDHVTNDDKSPVKQEAPK
QDAKAVVKPVTGTNKVGTGKGVLGDTKSLNTTLSGSSYYLQDNTRGATIFTYDAKNRSTLPGTLWADADNVFNAAYDAAA
VDAHYYAGKTYDYYKATFNRNSINDAGAPLKSTVHYGSNYNNAFWNGSQMVYGDGDGVTFTSLSGGIDVIGHELTHAVTE
NSSNLIYQNESGALNEAISDIFGTLVEFYDNRNPDWEIGEDIYTPGKAGDALRSMSDPTKYGDPDHYSKRYTGSSDNGGV
HTNSGIINKQAYLLANGGTHYGVTVTGIGKDKLGAIYYRANTQYFTQSTTFSQARAGAVQAAADLYGANSAEVAAVKQSF
SAVGVN
>P23384 3.4.24.28~~~npr~~~Bacillolysin~~~
MNKRAMLGAIGLAFGLMAWPFGASAKGKSMVWNEQWKTPSFVSGSLLGRCSQELVYRYLDQEKNTFQLGGQARERLSLIG
NKLDELGHTVMRFEQAIAASLCMGAVLVAHVNDGELSSLSGTLIPNLDKRTLKTEAAISIQQAEMIAKQDVADRVTKERP
AAEEGKPTRLVIYPDEETPRLAYEVNVRFLTPVPGNWIYMIDAADGKVLNKWNQMDEAKPGGAQPVAGTSTVGVGRGVLG
DQKYINTTYSSYYGYYYLQDNTRGSGIFTYDGRNRTVLPGSLWADGDNQFFASYDAAAVDAHYYAGVVYDYYKNVHGRLS
YDGSNAAIRSTVHYGRGYNNAFWNGSQMVYGDGDGQTFLPFSGGIDVVGHELTHAVTDYTAGLVYQNESGAINEAMSDIF
GTLVEFYANRNPDWEIGEDIYTPGVAGDALRSMSDPAKYGDPDHYSKRYTGTQDNGGVHTNSGIINKAAYLLSQGGVHYG
VSVTGIGRDKMGKIFYRALVYYLTPTSNFSQLRAACVQAAADLYGSTSQEVNSVKQAFNAVGVY
>P68734 3.4.24.28~~~nprE~~~Neutral protease NprE~~~
AAATGSGTTLKGATVPLNISYEGGKYVLRDLSKPTGTQIITYDLQNRQSRLPGTLVSSTTKTFTSSSQRAAVDAHYNLGK
VYDYFYSNFKRNSYDNKGSKIVSSVHYGTQYNNAAWTGDQMIYGDGDGSFFSPLSGSLDVTAHEMTHGVTQETANLIYEN
QPGALNESFSDVFGYFNDTEDWDIGEDITVSQPALRSLSNPTKYNQPDNYANYRNLPNTDEGDYGGVHTNSGIPNKAAYN
TITKLGVSKSQQIYYRALTTYLTPSSTFKDAKAALIQSARDLYGSTDAAKVEAAWNAVGL
>P68735 3.4.24.28~~~nprE~~~Bacillolysin~~~
MGLGKKLSVAVAASFMSLSISLPGVQAAEGHQLKENQTNFLSKNAIAQSELSAPNDKAVKQFLKKNSNIFKGDPSKRLKL
VESTTDALGYKHFRYAPVVNGVPIKDSQVIVHVDKSDNVYAVNGELHNQSAAKTDNSQKVSSEKALALAFKAIGKSPDAV
SNGAAKNSNKAELKAIETKDGSYRLAYDVTIRYVEPEPANWEVLVDAETGSILKQQNKVEHAAATGSGTTLKGATVPLNI
SYEGGKYVLRDLSKPTGTQIITYDLQNRQSRLPGTLVSSTTKTFTSSSQRAAVDAHYNLGKVYDYFYSNFKRNSYDNKGS
KIVSSVHYGTQYNNAAWTGDQMIYGDGDGSFFSPLSGSLDVTAHEMTHGVTQETANLIYENQPGALNESFSDVFGYFNDT
EDWDIGEDITVSQPALRSLSNPTKYNQPDNYANYRNLPNTDEGDYGGVHTNSGIPNKAAYNTITKLGVSKSQQIYYRALT
TYLTPSSTFKDAKAALIQSARDLYGSTDAAKVEAAWNAVGL
>P68736 3.4.24.28~~~nprE~~~Bacillolysin~~~COG3227
MGLGKKLSVAVAASFMSLSISLPGVQAAEGHQLKENQTNFLSKNAIAQSELSAPNDKAVKQFLKKNSNIFKGDPSKRLKL
VESTTDALGYKHFRYAPVVNGVPIKDSQVIVHVDKSDNVYAVNGELHNQSAAKTDNSQKVSSEKALALAFKAIGKSPDAV
SNGAAKNSNKAELKAIETKDGSYRLAYDVTIRYVEPEPANWEVLVDAETGSILKQQNKVEHAAATGSGTTLKGATVPLNI
SYEGGKYVLRDLSKPTGTQIITYDLQNRQSRLPGTLVSSTTKTFTSSSQRAAVDAHYNLGKVYDYFYSNFKRNSYDNKGS
KIVSSVHYGTQYNNAAWTGDQMIYGDGDGSFFSPLSGSLDVTAHEMTHGVTQETANLIYENQPGALNESFSDVFGYFNDT
EDWDIGEDITVSQPALRSLSNPTKYNQPDNYANYRNLPNTDEGDYGGVHTNSGIPNKAAYNTITKLGVSKSQQIYYRALT
TYLTPSSTFKDAKAALIQSARDLYGSTDAAKVEAAWNAVGL
>P43263 3.4.24.28~~~npr~~~Bacillolysin~~~
MKKSYLATSLTLSIAVGVSGFTSVPAFAKTKIDYHKQWDTPQYIGEVWEPEGAKGDDVVWSYLEKYKDEFRIQGNVEDHF
EIVNEARNKETDTKHYRLQEVYNGIPIYGFQQTVHIDADGNVTSFLGQFIPDLDSNKQLKKKPKLNEQKAVKQAIKDVEG
EVGEKPDFIQDPEAKLYIYVHEDESYLAYAVELNFLDPEPGRWMYFIDAHSGDVINKYNMLDHVTATGKGVLGDTKQFET
TKQGSTYMLKDTTRGKGIETYTANNRTSLPGTLMTDSDNYWTDGAAVDAHAHAQKTYDYFRNVHNRNSYDGNGAVIRSTV
HYSTRYNNAFWNGSQMVYGDGDGTTFLPLSGGLDVVAHELTHAVTERTAGLVYQNESGALNESMSDIFGAMVDNDDWLMG
EDIYTPGRSGDALRSLQDPAAYGDPDHYSKRYTGSQDNGGVHTNSGINNKAAYLLAEGGTHYGVRVNGIGRTDTAKIYYH
ALTHYLTPYSNFSAMRRAAVLSATDLFGANSRQVQAVNAAYDAVGVK
>P29148 3.4.24.28~~~npr~~~Bacillolysin~~~COG3227
MKKVWFSLLGGAMLLGSVASGASAESSVSGPAQLTPTFHTEQWKAPSSVSGDDIVWSYLNRQKKSLLGVDSSSVREQFRI
VDRTSDKSGVSHYRLKQYVNGIPVYGAEQTIHVGKSGEVTSYLGAVINEDQQEEATQGTTPKISASEAVYTAYKEAAARI
EALPTSDDTISKDAEEPSSVSKDTYAEAANNDKTLSVDKDELSLDKASVLKDSKIEAVEAEKSSIAKIANLQPEVDPKAE
LYYYPKGDDLLLVYVTEVNVLEPAPLRTRYIIDANDGSIVFQYDIINEATGKGVLGDSKSFTTTASGSSYQLKDTTRGNG
IVTYTASNRQSIPGTLLTDADNVWNDPAGVDAHAYAAKTYDYYKSKFGRNSIDGRGLQLRSTVHYGSRYNNAFWNGSQMT
YGDGDGDGSTFIAFSGDPDVVGHELTHGVTEYTSNLEYYGESGALNEAFSDVIGNDIQRKNWLVGDDIYTPNICGDALRS
MSNPTLYDQPHHYSNLYKGSSDNGGVHTNSGIINKAYYLLAQGGTFHGVTVNGIGRDAAVQIYYSAFTNYLTSSSDFSNA
RAAVIQAAKDLYGANSAEATAAAKSFDAVG
>P0CH29 3.4.24.28~~~nprM~~~Bacillolysin~~~
MKKKKQALKVLLSVGILSSSFAFAHTSSAAPNNVLSTEKYNKEIKSPEFISGKLSGPSSQKAQDVVFHYMNTNKDKYKLG
NESAQNSFKVTEVVKDPVEQATVVRLQQVYNNIPVWGSTQLAHVAKDGTLKVVSGTVAPDLDKKEKLKGQKQVDSKKAIQ
AAEKDLGFKPTYEKSPSSELYVYQNASDTTYAYVVNLNFLSPEPGNYYYFVDAISGKVLDKYNTIDSVAGPKADVKQAAK
PAAKPVTGTNTIGSGKGVLGDTKSLKTTLSSSTYYLQDNTRGATIYTYDAKNRTSLPGTLWADTDNTYNATRDAAAVDAH
YYAGVTYDYYKNKFNRNSYDNAGRPLKSTVHYSSGYNNAFWNGSQMVYGDGDGTTFVPLSGGLDVIGHELTHALTERSSN
LIYQYESGALNEAISDIFGTLVEYYDNRNPDWEIGEDIYTPGTSGDALRSMSNPAKYGDPDHYSKRYTGSSDNGGVHTNS
GIINKAAYLLANGGTHYGVTVTGIGGDKLGKIYYRANTLYFTQSTTFSQARAGLVQAAADLYGSGSQEVISVGKSFDAVG
VQ
>P06874 3.4.24.-~~~nprT~~~Thermostable neutral protease NprT~~~
MNKRAMLGAIGLAFGLLAAPIGASAKGESIVWNEQWKTPSFVSGSLLNGGEQALEELVYQYVDRENGTFRLGGRARDRLA
LIGKQTDELGHTVMRFEQRHHGIPVYGTMLAAHVKDGELIALSGSLIPNLDGQPRLKKAKTVTVQQAEAIAEQDVTETVT
KERPTTENGERTRLVIYPTDGTARLAYEVNVRFLTPVPGNWVYIIDATDGAILNKFNQIDSRQPGGGQPVAGASTVGVGR
GVLGDQKYINTTYSSYYGYYYLQDNTRGSGIFTYDGRNRTVLPGSLWTDGDNQFTASYDAAAVDAHYYAGVVYDYYKNVH
GRLSYDGSNAAIRSTVHYGRGYNNAFWNGSQMVYGDGDGQTFLPFSGGIDVVGHELTHAVTDYTAGLVYQNESGAINEAM
SDIFGTLVEFYANRNPDWEIGEDIYTPGVAGDALRSMSDPAKYGDPDHYSKRYTGTQDNGGVHTNSGIINKAAYLLSQGG
VHYGVSVNGIGRDKMGKIFYRALVYYLTPTSNFSQLRAACVQAAADLYGSTSQEVNSVKQAFNAVGVY
>Q00971 3.4.24.25~~~nprV~~~Neutral protease~~~
MNKTQRHINWLLAVSAATALPVTAAEMINVNDGSLLNQALKAQSQSVAPVETGFKQMKRVVLPNGKVKVRYQQTHHGLPV
FNTSVVATESKSGSSEVFGVMAQGIADDVSTLTPSVEMKQAISIAKSRFQQQEKMVAEPATENEKAELMVRLDDNNQAQL
VYLVDFFVAEDHPARPFFFIDAQTGEVLQTWDGLNHAQADGTGPGGNTKTGRYEYGSDFPPFVIDKVGTKCSMNNSAVRT
VDLNGSTSGNTTYSYTCNDSTNYNDYKAINGAYSPLNDAHYFGKVVFDMYKDWMNTTPLTFQLTMRVHYGNNYENAFWNG
SSMTFGDGYSTFYPLVDINVSAHEVSHGFTEQNSGLVYENMSGGMNEAFSDIAGEAAEFYMKGSVDWVVGADIFKSSGGL
RYFDQPSRDGRSIDHASDYYNGLNVHYSSGVFNRAFYLLANKAGWDVRKGFEVFTLANQLYWTANSTFDEGGCGVVKAAS
DMGYSVADVEDAFNTVGVNASCGATPPPSGDVLEIGKPLANLSGNRNDMTYYTFTPSSSSSVVIKITGGTGDADLYVKAG
SKPTTTSYDCRPYKYGNEEQCSISAQAGTTYHVMLRGYSNYAGVTLRAD
>Q56225 7.1.1.-~~~~~~NADH-quinone oxidoreductase subunit 10~~~COG0839
MSLLEGLALFLLLLSGVLVVTLRNAIHAALALILNFLVLAGVYVALDARFLGFIQVIVYAGAIVVLFLFVIMLLFAAQGE
IGFDPLVRSRPLAALLALGVAGILAAGLWGLDLAFTQDLKGGLPQALGPLLYGDWLFVLLAVGFLLMAATVVAVALVEPG
KASRAKEAEKREEVAR
>P29923 7.1.1.-~~~~~~NADH-quinone oxidoreductase subunit 11~~~
MIGLTHYLVVGAILFVTGIFGIFVNRKNVIVILMSIELMLLAVNINFVAFSTHLGDLAGQVFTMFVLTVAAAEAAIGLAI
LVVFFRNRGTIAVEDVNVMKG
>Q56226 7.1.1.-~~~~~~NADH-quinone oxidoreductase subunit 11~~~COG0713
MSYLLTSALLFALGVYGVLTRRTAILVFLSIELMLNAANLSLVGFARAYGLDGQVAALMVIAVAAAEVAVGLGLIVAIFR
HRESTAVDDLSELRG
>Q56227 7.1.1.-~~~~~~NADH-quinone oxidoreductase subunit 12~~~COG1009
MALLGTILLPLLGFALLGLFGKRMREPLPGVLASGLVLASFLLGAGLLLSGGARFQAEWLPGIPFSLLLDNLSGFMLLIV
TGVGFLIHVYAIGYMGGDPGYSRFFAYFNLFIAMMLTLVLADSYPVMFIGWEGVGLASFLLIGFWYKNPQYADSARKAFI
VNRIGDLGFMLGMAILWALYGTLSISELKEAMEGPLKNPDLLALAGLLLFLGAVGKSAQIPLMVWLPDAMAGPTPVSALI
HAATMVTAGVYLIARSSFLYSVLPDVSYAIAVVGLLTAAYGALSAFGQTDIKKIVAYSTISQLGYMFLAAGVGAYWVALF
HVFTHAFFKALLFLASGSVIHALGGEQDVRKMGGLWKHLPQTRWHALIGALALGGLPLLSGFWSKDAILAATLTYPFGGV
GFYVGALLVAVLTAMYAMRWFVLVFLGEERGHHHPHEAPPVMLWPNHLLALGSVLAGYLALPHPLPNVLEPFLKPALAEV
EAHHLSLGAEWGLIALSAAVALLGLWAGFVFFQRKVFPAWYLAFEAASREAFYVDRAYNALIVNPLKALAEALFYGDRGL
LSGYFGLGGAARSLGQGLARLQTGYLRVYALLFVLGALLLLGVMRW
>Q56228 7.1.1.-~~~~~~NADH-quinone oxidoreductase subunit 13~~~COG1008
MVVLAVLLPVVFGALLLLGLPRALGVLGAGLSFLLNLYLFLTHPGGVAHAFQAPLLPGAGVYWAFGLDGLSALFFLTIAL
TVFLGALVARVEGRFLGLALLMEGLLLGLFAARDLLVFYVFFEAALIPALLMLYLYGGEGRTRALYTFVLFTLVGSLPML
AAVLGARLLSGSPTFLLEDLLAHPLQEEAAFWVFLGFALAFAIKTPLFPLHAWLPPFHQENHPSGLADALGTLYKVGVFA
FFRFAIPLAPEGFAQAQGLLLFLAALSALYGAWVAFAAKDFKTLLAYAGLSHMGVAALGVFSGTPEGAMGGLYLLAASGV
YTGGLFLLAGRLYERTGTLEIGRYRGLAQSAPGLAALALILFLAMVGLPGLSGFPGEFLTLLGAYKASPWLAALAFLSVI
ASAAYALTAFQKTFWEEGGSGVKDLAGAEWGFALLSVLALLLMGVFPGYFARGLHPLAEAFAKLLGGGA
>Q56229 7.1.1.-~~~~~~NADH-quinone oxidoreductase subunit 14~~~COG1007
MTLAILAVFSVALTLLGFVLPPQGVKRATLLGLALALASLLLTWGKPFAFGPYAVDGVSQVFTLLALLGALWTVGLVRSG
RFEFYLLVLYAALGMHLLASTRHLLLMLVALEALSLPLYALATWRRGQGLEAALKYFLLGALAAAFFLYGAALFYGATGS
LVLGAPGEGPLYALALGLLLVGLGFKAALAPFHFWTPDVYQGSPTPVVLFMATSVKAAAFAALLRVAAPPEALALLVALS
VVVGNLAALAQKEAKRLLAYSSIAHAGYMALALYTGNAQALGFYLLTYVLATGLAFAVLSQISPDRVPLEALRGLYRKDP
LLGLAFLVAMLSLLGLPPLAGFWGKYLAFAEAARAGAWGVLVLALVTSAVSAYYYLGLGLAVFARPEETPFRPGPPWARA
AVVAAGVLLLALGLLPGLVLPALAAGG
>Q5SKZ7 7.1.1.-~~~~~~NADH-quinone oxidoreductase subunit 15~~~
MSASSERELYEAWVELLSWMREYAQAKGVRFEKEADFPDFIYRMERPYDLPTTIMTASLSDGLGEPFLLADVSPRHAKLK
RIGLRLPRAHIHLHAHYEPGKGLVTGKIPLTKERFFALADRAREALAFA
>P29913 7.1.1.-~~~nqo1~~~NADH-quinone oxidoreductase chain 1~~~
MLNDQDRIFTNLYGMGDRSLAGAKKRGHWDGTAAIIQRGRDKIIDEMKASGLRGRGGAGFPTGMKWSFMPKESDGRPSYL
VINADESEPATCKDREIMRHDPHTLIEGALIASFAMGAHAAYIYIRGEFIREREALQAAIDECYDAGLLGRNAAGSGWDF
DLYLHHGAGAYICGEETALLESLEGKKGMPRMKPPFPAGAGLYGCPTTVNNVESIAVVPTILRRGAEWFASFGRPNNAGV
KLFGLTGHVNTPCVVEEAMSIPMRELIEKHGGGIRGGWKNLKAVIPGGASCPVLTAEQCENAIMDYDGMRDVRSSFGTAC
MIVMDQSTDVVKAIWRLSKFFKHESCGQCTPCREGTGWMMRVMERLVRGDAEVEEIDMLFDVTKQVEGHTICALGDAAAW
PIQGLIRNFREEIEDRIKAKRTGRMGAMAAE
>Q56222 7.1.1.-~~~nqo1~~~NADH-quinone oxidoreductase subunit 1~~~COG1894
MTGPILSGLDPRFERTLYAHVGKEGSWTLDYYLRHGGYETAKRVLKEKTPDEVIEEVKRSGLRGRGGAGFPTGLKWSFMP
KDDGKQHYLICNADESEPGSFKDRYILEDVPHLLIEGMILAGYAIRATVGYIYVRGEYRRAADRLEQAIKEARARGYLGK
NLFGTDFSFDLHVHRGAGAYICGEETALMNSLEGLRANPRLKPPFPAQSGLWGKPTTINNVETLASVVPIMERGADWFAQ
MGTEQSKGMKLYQISGPVKRPGVYELPMGTTFRELIYEWAGGPLEPIQAIIPGGSSTPPLPFTEEVLDTPMSYEHLQAKG
SMLGTGGVILIPERVSMVDAMWNLTRFYAHESCGKCTPCREGVAGFMVNLFAKIGTGQGEEKDVENLEALLPLIEGRSFC
PLADAAVWPVKGSLRHFKDQYLALAREKRPVPRPSLWR
>P29914 7.1.1.-~~~nqo2~~~NADH-quinone oxidoreductase chain 2~~~
MLRRLSPIQPDSFEFTPANLEWARAQMTKYPEGRQQSAIIPVLWRAQEQEGWLSRPAIEYCADLLGMPYIRALEVATFYF
MFQLQPVGSVAHIQICGTTTCMICGAEDLIRVCKEKIAPEPHALSADGRFSWEEVECLGACTNAPMAQIGKDFYEDLTVE
KLAALIDRFAAGEVPVPGPQNGRFSAEALGGPTALADLKGGEAHNASVARALRLGDSIKRIDGTEVPITTPWLATQNGV
>Q56221 7.1.1.-~~~nqo2~~~NADH-quinone oxidoreductase subunit 2~~~COG1905
MGFFDDKQDFLEETFAKYPPEGRRAAIMPLLRRVQQEEGWIRPERIEEIARLVGTTPTEVMGVASFYSYYQFVPTGKYHL
QVCATLSCKLAGAEELWDYLTETLGIGPGEVTPDGLFSVQKVECLGSCHTAPVIQVNDEPYVECVTRARLEALLAGLRAG
KRLEEIELPGKCGHHVHEVEV
>P29915 7.1.1.-~~~nqo3~~~NADH-quinone oxidoreductase chain 3~~~
MADLRKIKIDDTIIEVDPNMTLIQACEMAGIEVPRFCYHERLSIAGNCRMCLVEVVGGPPKPAASCAMQVKDLRPGPEGA
PSEIRTNSPMVKKAREGVMEFLLINHPLDCPICDQGGECDLQDQAMAYGVDFSRYREPKRATEDLNLGPLVETHMTRCIS
CTRCVRFTTEVAGITQMGQTGRGEDSEITSYLNQTLESNMQGNIIDLCPVGALVSKPYAFTARPWELTKTESIDVMDALG
SSIRIDTKGREVMRILPRNHDGVNEEWISDKTRFVWDGLRRQRLDRPYIRENGRLRPASWPEALEAAARAMKGKKIAGLI
GDLVPAEAAFSLKQLVEGLGGKVECRVDGARLPAGNRSAYVGTARIEDIDDAEMIQLIGTNPRDEAPVLNARIRKAWSKG
AKVGLVGEPVDLTYDYAHVGTDRAALESLSSREISDETKARPSIVIVGQGAIARRDGEAVLAHAMKLAENSNSGLLILHT
AAGRVGAMDVGAVTEGGLLAAIDGAEVVYNLGADEVDIDQGPFVIYQGSHGDRGAHRDIILPGACYTEESGLFVNTEGRP
QLAMRANFAPGEGKENWAILRALSAELGATQPWDSLAGLRRKLVEAVPHLAQIDQVPQNEWQPLGRFDLGQASFRYAIRD
FYLTNPIARSSPLMGELSAMAAARKAPAPLAAE
>Q56223 7.1.1.-~~~nqo3~~~NADH-quinone oxidoreductase subunit 3~~~COG1034
MVRVKVNDRIVEVPPGTSVMDAVFHAGYDVPLFCSEKHLSPIGACRMCLVRIGLPKKGPDGKPLLNEKGEPEIQWQPKLA
ASCVTAVADGMVVDTLSDVVREAQAGMVEFTLLNHPLDCPTCDKGGACELQDRTVEYGLYEKYYQKGPLELPVYTRFEFT
RRHVDKHHPLSPFVILDRERCIHCKRCVRYFEEVPGDEVLDFIERGVHTFIGTMDFGLPSGFSGNITDICPVGALLDLTA
RFRARNWEMEETPTTCALCPVGCGITADTRSGELLRIRAREVPEVNEIWICDAGRFGHEWADQNRLKTPLVRKEGRLVEA
TWEEAFLALKEGLKEARGEEVGLYLAHDATLEEGLLASELAKALKTPHLDFQGRTAAPASLFPPASLEDLLQADFALVLG
DPTEEAPILHLRLSEFVRDLKPPHRYNHGTPFADLQIKERMPRRTDKMALFAPYRAPLMKWAAIHEVHRPGEEREILLAL
LGDKEGSEMVAKAKEAWEKAKNPVLILGAGVLQDTVAAERARLLAERKGAKVLAMTPAANARGLEAMGVLPGAKGASWDE
PGALYAYYGFVPPEEALKGKRFVVMHLSHLHPLAERYAHVVLPAPTFYEKRGHLVNLEGRVLPLSPAPIENGEAEGALQV
LALLAEALGVRPPFRLHLEAQKALKARKVPEAMGRLSFRLKELRPKERKGAFYLRPTMWKAHQAVGKAQEAARAELWAHP
ETARAEALPEGAQVAVETPFGRVEARVVHREDVPKGHLYLSALGPAAGLRVEGRVLVPAGGEA
>P29916 7.1.1.-~~~nqo4~~~NADH-quinone oxidoreductase subunit 4~~~
MDGDIRKNSLDDGSMDALTGEQSIRNFNINFGPQHPAAHGLLRMVLELDGEIVERADPHIGLLHRGTEKLMESRTYLQNL
PYLDRLDYVAPMNQEHAWCLAIERLTGTVIPRRASLIRVLYSEIGRILNHLMGVTTGAMDVGALTPPLWGFEAREELMIF
YERACGARLHAAYFRPGGVHQDLPPDLLDDIEEWCERFPKLVDDLDTLLTENRIFKQRLVDIGIVTEADALDWGYTGVMV
RGSGLAWDLRRSQPYECYDEFDFQIPVGRNGDCYDRYLCRMAEMRESCKIMQQAVQKLRAEPAGDVLARGKLTPPRRAEM
KRDMESLIHHFKLYTEGFKVPAGEVYAAVEGPKGEFGVYLVADGTNKPWRAKLRAPGFAHLQSIDWMSRGHMLADVPAII
ATLDIVFGEVDR
>Q56220 7.1.1.-~~~nqo4~~~NADH-quinone oxidoreductase subunit 4~~~COG0649
MREEFLEEIPLDAPPEEAKELRTEVMTLNVGPQHPSTHGVLRLMVTLSGEEVLEVVPHIGYLHTGFEKTMEHRTYLQNIT
YTPRMDYLHSFAHDLAYALAVEKLLGAVVPPRAETIRVILNELSRLASHLVFLGTGLLDLGALTPFFYAFRERETILDLF
EWVTGQRFHHNYIRIGGVKEDLPEEFVPELKKLLEVLPHRIDEYEALFAESPIFYERARGVGVIPPEVAIDLGLTGGSLR
ASGVNYDVRKAYPYSGYETYTFDVPLGERGDVFDRMLVRIREMRESVKIIKQALERLEPGPVRDPNPQITPPPRHLLETS
MEAVIYHFKHYTEGFHPPKGEVYVPTESARGELGYYIVSDGGSMPYRVKVRAPSFVNLQSLPYACKGEQVPDMVAIIASL
DPVMGDVDR
>P29917 7.1.1.-~~~nqo5~~~NADH-quinone oxidoreductase chain 5~~~
MSEALSDEALLELAEHIAVRRENDVISTQAVGELTVNATLSGVIGLIEFLRNDPNCRFSTLIDITAVDNPARPARFDVVY
HLLSMYQNQRIRVKVQVREDELVPSLIGVFPGANWYEREVFDLFGILFSGHSDLRRILTDYGFRGHPLRKDFPTTGYVEV
RWSDIEKRVVYEPVNLVQEYRQFDFLSPWEGAKYVLPGDEKAPEAKK
>Q56219 7.1.1.-~~~nqo5~~~NADH-quinone oxidoreductase subunit 5~~~COG0852
MRLERVLEEARAKGYPIEDNGLGNLWVVLPRERFKEEMAHYKAMGFNFLADIVGLDYLTYPDPRPERFAVVYELVSLPGW
KDGDGSRFFVRVYVPEEDPRLPTVTDLWGSANFLEREVYDLFGIVFEGHPDLRKILTPEDLEGHPLRKDYPLGETPTLFR
EGRYIIPAEFRAALTGKDPGLTFYKGGSRKGYRSLWADLKKAREVKG
>Q56218 7.1.1.-~~~nqo6~~~NADH-quinone oxidoreductase subunit 6~~~COG0377
MALKDLFERDVQELEREGILFTTLEKLVAWGRSNSLWPATFGLACCAIEMMASTDARNDLARFGSEVFRASPRQADVMIV
AGRLSKKMAPVMRRVWEQMPDPKWVISMGACASSGGMFNNYAIVQNVDSVVPVDVYVPGCPPRPEALIYAVMQLQKKVRG
QAYNERGERLPPVAAWKRTRG
>Q56217 7.1.1.-~~~nqo7~~~NADH-quinone oxidoreductase subunit 7~~~COG0838
MAPIQEYVGTLIYVGVALFIGVAALLVGALLGPKKPGRAKLMPYESGNDPAGEVKRFPVHFYVVAMLFILFDVEVAFLWP
YAVSAGGLGLYGFLGVLAFTLLLFVGFLYEWWKGVMRWH
>Q60019 7.1.1.-~~~nqo8~~~NADH-quinone oxidoreductase subunit 8~~~COG1005
MTWSYPVDPYWMVALKALLVVVGLLTAFAFMTLIERRLLARFQVRMGPNRVGPFGLLQPLADAIKSIFKEDIVVAQADRF
LFVLAPLISVVFALLAFGLIPFGPPGSFFGYQPWVINLDLGILYLFAVSELAVYGIFLSGWASGSKYSLLGSLRSSASLI
SYELGLGLALLAPVLLVGSLNLNDIVNWQKEHGWLFLYAFPAFLVYLIASMAEAARTPFDLPEAEQELVGGYHTEYSSIK
WALFQMAEYIHFITASALIPTLFLGGWTMPVLEVPYLWMFLKIAFFLFFFIWIRATWFRLRYDQLLRFGWGFLFPLALLW
FLVTALVVALDLPRTYLLYLSALSFLVLLGAVLYTPKPARKGGGA
>Q56224 7.1.1.-~~~nqo9~~~NADH-quinone oxidoreductase subunit 9~~~COG1143
MTLKALAQSLGITLKYLFSKPVTVPYPDAPVALKPRFHGRHVLTRHPNGLEKCIGCSLCAAACPAYAIYVEPAENDPENP
VSAGERYAKVYEINMLRCIFCGLCEEACPTGAIVLGYDFEMADYEYSDLVYGKEDMLVDVVGTKPQRREAKRTGKPVKVG
YVVPYVRPELEGFKAPTEGGKR
>Q2YQ23 1.6.5.2~~~~~~NAD(P)H dehydrogenase (quinone)~~~
MVKMLVLYYSAYGYMEQMAKAAAEGAREGGAEVTLKRVPELVPEEVAKASHYKIDQEVPIATPGELADYDAIIIGTATRY
GMMASQMKNFLDQTGGLWAKGALINKVGSVMVSTATQHGGAELALISTQWQMQHHGMIIVPLSYAYREQMGNDVVRGGAP
YGMTTTADGDGSRQPSAQELDGARFQGRRVAEITAKLHG
>Q9RYU4 1.6.5.2~~~~~~NAD(P)H dehydrogenase (quinone)~~~COG0655
MTAPVKLAIVFYSSTGTGYAMAQEAAEAGRAAGAEVRLLKVRETAPQDVIDGQDAWKANIEAMKDVPEATPADLEWAEAI
VFSSPTRFGGATSQMRAFIDTLGGLWSSGKLANKTFSAMTSAQNVNGGQETTLQTLYMTAMHWGAVLTPPGYTDEVIFKS
GGNPYGASVTANGQPLLENDRASIRHQVRRQVELTAKLLG
>A1A9Q9 1.6.5.2~~~~~~NAD(P)H dehydrogenase (quinone)~~~
MAKVLVLYYSMYGHIETMARAVAEGASKVDGAEVVVKRVPETMPPQLFEKAGGKTQTAPVATPQELADYDAIIFGTPTRF
GNMSGQMRTFLDQTGGLWASGALYGKLASVFSSTGTGGGQEQTITSTWTTLAHHGMVIVPIGYAAQELFDVSQVRGGTPY
GATTIAGGDGSRQPSQEELSIARYQGEYVAGLAVKLNG
>P0A8G6 1.6.5.2~~~wrbA~~~NAD(P)H dehydrogenase (quinone)~~~COG0655
MAKVLVLYYSMYGHIETMARAVAEGASKVDGAEVVVKRVPETMPPQLFEKAGGKTQTAPVATPQELADYDAIIFGTPTRF
GNMSGQMRTFLDQTGGLWASGALYGKLASVFSSTGTGGGQEQTITSTWTTLAHHGMVIVPIGYAAQELFDVSQVRGGTPY
GATTIAGGDGSRQPSQEELSIARYQGEYVAGLAVKLNG
>Q9I509 1.6.5.2~~~~~~NAD(P)H dehydrogenase (quinone)~~~
MSSPYILVLYYSRHGATAEMARQIARGVEQGGFEARVRTVPAVSTECEAVAPDIPAEGALYATLEDLKNCAGLALGSPTR
FGNMASPLKYFLDGTSSLWLTGSLVGKPAAVFTSTASLHGGQETTQLSMLLPLLHHGMLVLGIPYSEPALLETRGGGTPY
GASHFAGADGKRSLDEHELTLCRALGKRLAETAGKLGS
>Q8ZQ40 1.6.5.2~~~~~~NAD(P)H dehydrogenase (quinone)~~~
MAKILVLYYSMYGHIETMAHAVAEGAKKVDGAEVIIKRVPETMPPEIFAKAGGKTQNAPVATPQELADYDAIIFGTPTRF
GNMSGQMRTFLDQTGGLWASGSLYGKLGSVFSSTGTGGGQEQTITSTWTTLAHHGMVIVPIGYAAQELFDVSQVRGGTPY
GATTIAGGDGSRQPSQEELSIARYQGEYVAGLAVKLNG
>Q66BP3 1.6.5.2~~~~~~NAD(P)H dehydrogenase (quinone)~~~
MAKILVLYYSMYGHIETLAGAIAEGARKVSGVDVTIKRVPETMPAEAFAKAGGKTNQQAPVATPHELADYDGIIFGTPTR
FGNMSGQMRTFLDQTGGLWASGALYGKVASVFASTGTGGGQEHTITSTWTTLAHHGFIIVPIGYGAKELFDVSQTRGGTP
YGATTIAGGDGSRQPSAEELAIARFQGEHVAKITAKLKG
>P43955 7.2.1.1~~~nqrA~~~Na(+)-translocating NADH-quinone reductase subunit A~~~COG1726
MITIKKGLDLPIAGKPAQVIHSGNAVNQVAILGEEYVGMRPSMKVREGDVVKKGQVLFEDKKNPGVIFTAPASGTITAIN
RGEKRVLQSVVINVEGDEKITFAKYSTEQLNTLSSEQVKQNLIESGLWTALRTRPFSKVPSIESEASSIFVNAMDTNPLA
ADPSVVLKEYSQDFTNGLTVLSRLFPSKPLHLCKAGDSNIPTADLENLQIHDFTGVHPAGLVGTHIHFIDPVGIQKTVWH
INYQDVIAVGKLFTTGELYSERVISLAGPQVKEPRLVRTTIGANLSQLTQNELSAGKNRVISGSVLCGQIAKDSHDYLGR
YALQVSVIAEGNEKEFFGWIMPQANKYSVTRTVLGHFSKKLFNFTTSENGGERAMVPIGSYERVMPLDILPTLLLRDLIV
GDTDGAQELGCLELDEEDLALCSFVCPGKYEYGSILRQVLDKIEKEG
>Q56586 7.2.1.1~~~nqrA~~~Na(+)-translocating NADH-quinone reductase subunit A~~~COG1726
MITIKKGLDLPIAGTPSQVINDGKTIKKVALLGEEYVGMRPTMHVRVGDEVKKAQVLFEDKKNPGVKFTAPAAGKVIEVN
RGAKRVLQSVVIEVAGEEQVTFDKFEAAQLSGLDREVIKTQLVDSGLWTALRTRPFSKVPAIESSTKAIFVTAMDTNPLA
AKPELIINEQQEAFIAGLDILSALTEGKVYVCKSGTSLPRSSQSNVEEHVFDGPHPAGLAGTHMHFLYPVNAENVAWSIN
YQDVIAFGKLFLTGELYTDRVVSLAGPVVNNPRLVRTVIGASLDDLTDNELMPGEVRVISGSVLTGTHATGPHAYLGRYH
QQVSVLREGREKELFGWAMPGKNKFSVTRSFLGHVFKGQLFNMTTTTNGSDRSMVPIGNYERVMPLDMEPTLLLRDLCAG
DTDSAQALGALELDEEDLALCTFVCPGKYEYGTLLRECLDTIEKEG
>A5F5X1 7.2.1.1~~~nqrA~~~Na(+)-translocating NADH-quinone reductase subunit A~~~COG1726
MITIKKGLDLPIAGTPSQVISDGKAIKKVALLGEEYVGMRPTMHVRVGDEVKKAQILFEDKKNPGVKFTSPVSGKVVEIN
RGAKRVLQSVVIEVAGDDQVTFDKFEANQLASLNRDAIKTQLVESGLWTAFRTRPFSKVPAIDSTSEAIFVTAMDTNPLA
AEPTVVINEQSEAFVAGLDVLSALTTGKVYVCKKGTSLPRSQQPNVEEHVFDGPHPAGLAGTHMHFLYPVSADHVAWSIN
YQDVIAVGQLFLTGELYTQRVVSLAGPVVNKPRLVRTVMGASLEQLVDSEIMPGEVRIISGSVLSGTKATGPHAYLGRYH
LQVSVLREGRDKELFGWAMPGKNKFSVTRSFLGHLFKGQVYNMTTTTNGSDRSMVPIGNYEKVMPLDMEPTLLLRDLCAG
DSDSAVRLGALELDEEDLALCTFVCPGKYEYGQLLRECLDKIEKEG
>Q9KPS1 7.2.1.1~~~nqrA~~~Na(+)-translocating NADH-quinone reductase subunit A~~~COG1726
MITIKKGLDLPIAGTPSQVISDGKAIKKVALLGEEYVGMRPTMHVRVGDEVKKAQILFEDKKNPGVKFTSPVSGKVVEIN
RGAKRVLQSVVIEVAGDDQVTFDKFEANQLASLNRDAIKTQLVESGLWTAFRTRPFSKVPAIDSTSEAIFVTAMDTNPLA
AEPTVVINEQSEAFVAGLDVLSALTTGKVYVCKKGTSLPRSQQPNVEEHVFDGPHPAGLAGTHMHFLYPVSADHVAWSIN
YQDVIAVGQLFLTGELYTQRVVSLAGPVVNKPRLVRTVMGASLEQLVDSEIMPGEVRIISGSVLSGTKATGPHAYLGRYH
LQVSVLREGRDKELFGWAMPGKNKFSVTRSFLGHLFKGQVYNMTTTTNGSDRSMVPIGNYEKVMPLDMEPTLLLRDLCAG
DSDSAVRLGALELDEEDLALCTFVCPGKYEYGQLLRECLDKIEKEG
>Q56587 7.2.1.1~~~nqrB~~~Na(+)-translocating NADH-quinone reductase subunit B~~~COG1805
MALKKFLEDIEHHFEPGGKHEKWFALYEAVATVFYTPGIVTNKSSHVRDSVDLKRIMIMVWFAVFPAMFWGMYNAGGQAI
AALNHMYAGDQLATVISGNWHYWLTEMLGGTIAADAGVGSKMLLGATYFLPIYATVFLVGGFWEVLFCMVRKHEVNEGFF
VTSILFALIVPPTLPLWQAALGITFGVVVAKEIFGGTGRNFLNPALAGRAFLFFAYPAQISGDVVWTAADGFSGATALSQ
WAQGGNGALVNTVTGSPITWMDAFIGNIPGSIGEVSTLALMIGAAMIVYMRIASWRIIAGVMIGMIAVSTLFNVIGSDTN
PMFNMPWHWHLVLGGFAFGMFFMATDPVSASFTNKGKWWYGILIGAMCVMIRVVNPAYPEGMMLAILFANLFAPLFDHVV
IEKNIKRRLARYGK
>A5F5X0 7.2.1.1~~~nqrB~~~Na(+)-translocating NADH-quinone reductase subunit B~~~COG1805
MGLKKFLEDIEHHFEPGGKHEKWFALYEAAATLFYTPGLVTKRSSHVRDSVDLKRIMIMVWLAVFPAMFWGMYNAGGQAI
AALNHLYSGDQLAAIVAGNWHYWLTEMLGGTMSSDAGWGSKMLLGATYFLPIYATVFIVGGFWEVLFCMVRKHEVNEGFF
VTSILFALIVPPTLPLWQAALGITFGVVVAKEVFGGTGRNFLNPALAGRAFLFFAYPAQISGDLVWTAADGYSGATALSQ
WAQGGAGALINNATGQTITWMDAFIGNIPGSIGEVSTLALMIGAAFIVYMGIASWRIIGGVMIGMILLSTLFNVIGSDTN
AMFNMPWHWHLVLGGFAFGMFFMATDPVSASFTNSGKWAYGILIGVMCVLIRVVNPAYPEGMMLAILFANLFAPLFDHVV
VERNIKRRLARYGKQ
>Q9KPS2 7.2.1.1~~~nqrB~~~Na(+)-translocating NADH-quinone reductase subunit B~~~COG1805
MGLKKFLEDIEHHFEPGGKHEKWFALYEAAATLFYTPGLVTKRSSHVRDSVDLKRIMIMVWLAVFPAMFWGMYNAGGQAI
AALNHLYSGDQLAAIVAGNWHYWLTEMLGGTMSSDAGWGSKMLLGATYFLPIYATVFIVGGFWEVLFCMVRKHEVNEGFF
VTSILFALIVPPTLPLWQAALGITFGVVVAKEVFGGTGRNFLNPALAGRAFLFFAYPAQISGDLVWTAADGYSGATALSQ
WAQGGAGALINNATGQTITWMDAFIGNIPGSIGEVSTLALMIGAAFIVYMGIASWRIIGGVMIGMILLSTLFNVIGSDTN
AMFNMPWHWHLVLGGFAFGMFFMATDPVSASFTNSGKWAYGILIGVMCVLIRVVNPAYPEGMMLAILFANLFAPLFDHVV
VERNIKRRLARYGKQ
>Q56582 7.2.1.1~~~nqrC~~~Na(+)-translocating NADH-quinone reductase subunit C~~~COG2869
MASNNDSIKKTLGVVIGLSLVCSIIVSTAAVGLRDKQKANAVLDKQSKIVEVAGIDANGKKVPELFAEYIEPRLVDLETG
NFTEGNASTYDQREASKDAERSIALTPEEDVADIRRRANTAVVYLVKDQDEVQKVILPMHGKGLWSMMYAFVAVETDGNT
VSAITYYEQGETPGLGGEVENPSWRDQFIGKKLYNEDHQPAIKVVKGGAPQGSEHGVDGLSGATLTSNGVQHTFDFWLGD
KGFGPFLAKVRDGELN
>Q9RFV9 7.2.1.1~~~nqrC~~~Na(+)-translocating NADH-quinone reductase subunit C~~~
MASNNDSIKKTLGVVVGLSLVCSIIVSTAAVGLRDQQKANAVLDKQSKIVEVAGIDAEGKKVPELFAEYIEPRLVDFKTG
DFVEKAEDGSTAANYDQRKAAKDPAESIKLTADEDKAKILRRANTGIVYLVKNGDDISKVIIPVHGNGLWSMMYAFVAVE
TDGNTVSGITYYEQGETPGLGGEVENPVWRAQFVGKKLFDENHKPAIKIVKGGAPEGSEHGVDGLSGATLTGNGVQGTFD
FWLGDMGFGPFLAKVRDGGLN
>A5F5Y7 7.2.1.1~~~nqrC~~~Na(+)-translocating NADH-quinone reductase subunit C~~~COG2869
MASNNDSIKKTLFVVIALSLVCSIIVSAAAVGLRDKQKENAALDKQSKILQVAGIEAKGSKQIVELFNKSIEPRLVDFNT
GDFVEGDAANYDQRKAAKEASESIKLTAEQDKAKIQRRANVGVVYLVKDGDKTSKVILPVHGNGLWSMMYAFVAVETDGN
TVSGLTYYEQGETPGLGGEVENPAWRAQWVGKKLFDENHKPAIKIVKGGAPQGSEHGVDGLSGATLTSNGVQNTFDFWLG
DMGFGPFLTKVRDGGLN
>P0C6E0 7.2.1.1~~~nqrC~~~Na(+)-translocating NADH-quinone reductase subunit C~~~COG2869
MASNNDSIKKTLFVVIALSLVCSIIVSAAAVGLRDKQKENAALDKQSKILQVAGIEAKGSKQIVELFNKSIEPRLVDFNT
GDFVEGDAANYDQRKAAKEASESIKLTAEQDKAKIQRRANVGVVYLVKDGDKTSKVILPVHGNGLWSMMYAFVAVETDGN
TVSGLTYYEQGETPGLGGEVENPAWRAQWVGKKLFDENHKPAIKIVKGGAPQGSEHGVDGLSGATLTSNGVQNTFDFWLG
DMGFGPFLTKVRDGGLN
>Q57095 7.2.1.1~~~nqrD~~~Na(+)-translocating NADH-quinone reductase subunit D~~~COG1347
MSSAQNVKKSILAPVLDNNPIALQVLGVCSALAVTTKLETAFVMTLAVTFVTALSNFSVSLIRNHIPNSVRIIVQMAIIA
SLVIVVDQVLKAYLYDISKQLSVFVGLIITNCIVMGRAEAFAMKSAPVPSLIDGIGNGLGYGFVLITVGFFRELFGSGKL
FGLEVLPLVSNGGWYQPNGLMLLAPSAFFLIGFLIWVIRILKPEQVEAKE
>A5F5Y6 7.2.1.1~~~nqrD~~~Na(+)-translocating NADH-quinone reductase subunit D~~~COG1347
MSSAKELKKSVLAPVLDNNPIALQVLGVCSALAVTTKLETAFVMTLAVMFVTALSNFFVSLIRNHIPNSVRIIVQMAIIA
SLVIVVDQILKAYLYDISKQLSVFVGLIITNCIVMGRAEAFAMKSEPIPSFIDGIGNGLGYGFVLMTVGFFRELLGSGKL
FGLEVLPLISNGGWYQPNGLMLLAPSAFFLIGFMIWAIRTFKPEQVEAKE
>Q9X4Q6 7.2.1.1~~~nqrD~~~Na(+)-translocating NADH-quinone reductase subunit D~~~COG1347
MSSAKELKKSVLAPVLDNNPIALQVLGVCSALAVTTKLETAFVMTLAVMFVTALSNFFVSLIRNHIPNSVRIIVQMAIIA
SLVIVVDQILKAYLYDISKQLSVFVGLIITNCIVMGRAEAFAMKSEPIPSFIDGIGNGLGYGFVLMTVGFFRELLGSGKL
FGLEVLPLISNGGWYQPNGLMLLAPSAFFLIGFMIWAIRTFKPEQVEAKE
>Q9I4V0 1.6.5.9~~~~~~NADH:quinone reductase~~~
MGVFRTRFTETFGVEHPIMQGGMQWVGRAEMAAAVANAGGLATLSALTQPSPEALAAEIARCRELTDRPFGVNLTLLPTQ
KPVPYAEYRAAIIEAGIRVVETAGNDPGEHIAEFRRHGVKVIHKCTAVRHALKAERLGVDAVSIDGFECAGHPGEDDIPG
LVLLPAAANRLRVPIIASGGFADGRGLVAALALGADAINMGTRFLATRECPIHPAVKAAIRAADERSTDLIMRSLRNTAR
VARNAISQEVLAIEARGGAGYADIAALVSGQRGRQVYQQGDTDLGIWSAGMVQGLIDDEPACAELLRDIVEQARQLVRQR
LEGMLAGV
>Q56589 7.2.1.1~~~nqrE~~~Na(+)-translocating NADH-quinone reductase subunit E~~~COG2209
MEHYISLLVKSIFIENMALSFFLGMCTFLAVSKKVKTSFGLGVAVVVVLTIAVPVNNLVYNLVLRENALVEGVDLSFLNF
ITFIGVIAALVQILEMVLDRFFPPLYNALGIFLPLITVNCAIFGGVSFMVQRDYNFAESIVYGFGSGVGWMLAIVALAGI
REKMKYSDVPPGLRGLGITFITVGLMALGFMSFSGVQL
>A5F5Y5 7.2.1.1~~~nqrE~~~Na(+)-translocating NADH-quinone reductase subunit E~~~COG2209
MEHYISLLVKSIFIENMALSFFLGMCTFLAVSKKVKTSFGLGIAVIVVLTISVPVNNLVYNLVLKPDALVEGVDLSFLNF
ITFIGVIAALVQILEMILDRFFPPLYNALGIFLPLITVNCAIFGGVSFMVQRDYSFAESVVYGFGSGVGWMLAIVALAGI
REKMKYSDVPPGLRGLGITFITAGLMALGFMSFSGVQL
>Q9X4Q7 7.2.1.1~~~nqrE~~~Na(+)-translocating NADH-quinone reductase subunit E~~~COG2209
MEHYISLLVKSIFIENMALSFFLGMCTFLAVSKKVKTSFGLGIAVIVVLTISVPVNNLVYNLVLKPDALVEGVDLSFLNF
ITFIGVIAALVQILEMILDRFFPPLYNALGIFLPLITVNCAIFGGVSFMVQRDYSFAESVVYGFGSGVGWMLAIVALAGI
REKMKYSDVPPGLRGLGITFITAGLMALGFMSFSGVQL
>A6T526 7.2.1.1~~~nqrF~~~Na(+)-translocating NADH-quinone reductase subunit F~~~
MEIILGVVMFTLIVLVLSGLILAARSKLVNAGDVVIEINNEADKQIRTPAGDKLLNTLSSNGIFVSSACGGGGSCGQCRV
TVKEGGGDILPTELSHITKRDAKAGCRLACQVAVKQNMKIELPEEIFGVKKWECEVISNDNKATFIKELKLRIPEGEVVP
FRAGGYIQIECPPHKVAYADFDVPDEYRSDWDKFNLFRYVSEVKEPTLRAYSMANYPEEKGIIMLNVRIATPPPKVPDAP
PGIMSSYIWSLKPGDKVTISGPFGEFFAKETDAEMVFIGGGAGMAPMRSHIFDQLKRLHSTRKISFWYGARSLREMFYDE
EFEQLARDNPNFTFHVALSDPLPEDNWTGHTGFIHNVLYENYLRDHPAPEDCEFYMCGPPVMNAAVIKMLKDLGVEDENI
LLDDFGG
>Q02PF8 7.2.1.1~~~nqrF~~~Na(+)-translocating NADH-quinone reductase subunit F~~~
MIGFEIFLAIGMFTAIVLGLVAIILVARAKLVSSGDVTIQINGEHSLTVPAGGKLLQTLATNNVFLSSACGGGGTCAQCK
CVVVEGGGEMLPTEESHFTRRQAKEGWRLSCQTPVKQDMQIRVPEEVFGVKKWECTVESNPNVATFIKELTLRLPDGESV
DFRAGGYVQLECPPHVVEYKDFDIQPEYRGDWDKFNMWRYVSKVDETVIRAYSMANYPEEQGVVKFNIRIASPPPGSDLP
PGQMSSWVFNLKPGDKVTVYGPFGEFFAKDTEAEMVFIGGGAGMAPMRSHIFDQLRRLKSNRKISFWYGARSLREAFYTE
EYDQLQAENPNFQWHLALSDPQPEDNWTGLTGFIHNVLFENYLKDHPAPEDCEFYMCGPPMMNAAVIKMLTDLGVERENI
LLDDFGG
>Q56584 7.2.1.1~~~nqrF~~~Na(+)-translocating NADH-quinone reductase subunit F~~~COG2871
MDIILGVVMFTLIVLALVLVILFAKSKLVPTGDITISVNDDPSLAIVTQPGGKLLSALAGAGVFVSSACGGGGSCGQCRV
KVKSGGGDILPTELDHITKGEAREGERLACQVAMKTDMDIELPEEIFGVKKWECTVISNDNKATFIKELKLQIPDGESVP
FRAGGYIQIEAPAHHVKYADYDIPEEYREDWEKFNLFRYESKVNEETIRAYSMANYPEEHGIIMLNVRIATPPPNNPDVP
PGIMSSYIWSLKEGDKCTISGPFGEFFAKDTDAEMVFVGGGAGMAPMRSHIFDQLKRLHSKRKMSFWYGARSKREMFYVE
DFDMLQAENDNFVWHCALSDPLPEDNWDGYTGFIHNVLYENYLRDHEAPEDCEYYMCGPPMMNAAVIGMLKDLGVEDENI
LLDDFGG
>A5F5Y4 7.2.1.1~~~nqrF~~~Na(+)-translocating NADH-quinone reductase subunit F~~~COG2871
MSTIIFGVVMFTLIILALVLVILFAKSKLVPTGDITISINGDPEKAIVTQPGGKLLTALAGAGVFVSSACGGGGSCGQCR
VKIKSGGGDILPTELDHISKGEAREGERLACQVAVKADMDLELPEEIFGVKKWECTVISNDNKATFIKELKLAIPDGESV
PFRAGGYIQIEAPAHHVKYADFDVPEKYRGDWDKFNLFRYESKVDEPIIRAYSMANYPEEFGIIMLNVRIATPPPNNPNV
PPGQMSSYIWSLKAGDKCTISGPFGEFFAKDTDAEMVFIGGGAGMAPMRSHIFDQLKRLKSKRKMSYWYGARSKREMFYV
EDFDGLAAENDNFVWHCALSDPQPEDNWTGYTGFIHNVLYENYLKDHEAPEDCEYYMCGPPMMNAAVINMLKNLGVEEEN
ILLDDFGG
>Q9X4Q8 7.2.1.1~~~nqrF~~~Na(+)-translocating NADH-quinone reductase subunit F~~~COG2871
MSTIIFGVVMFTLIILALVLVILFAKSKLVPTGDITISINGDPEKAIVTQPGGKLLTALAGAGVFVSSACGGGGSCGQCR
VKIKSGGGDILPTELDHISKGEAREGERLACQVAVKADMDLELPEEIFGVKKWECTVISNDNKATFIKELKLAIPDGESV
PFRAGGYIQIEAPAHHVKYADFDVPEKYRGDWDKFNLFRYESKVDEPIIRAYSMANYPEEFGIIMLNVRIATPPPNNPNV
PPGQMSSYIWSLKAGDKCTISGPFGEFFAKDTDAEMVFIGGGAGMAPMRSHIFDQLKRLKSKRKMSYWYGARSKREMFYV
EDFDGLAAENDNFVWHCALSDPQPEDNWTGYTGFIHNVLYENYLKDHEAPEDCEYYMCGPPMMNAAVINMLKNLGVEEEN
ILLDDFGG
>P28903 1.1.98.6~~~nrdD~~~Anaerobic ribonucleoside-triphosphate reductase~~~COG1328
MTPHVMKRDGCKVPFKSERIKEAILRAAKAAEVDDADYCATVAAVVSEQMQGRNQVDINEIQTAVENQLMSGPYKQLARA
YIEYRHDRDIEREKRGRLNQEIRGLVEQTNASLLNENANKDSKVIPTQRDLLAGIVAKHYARQHLLPRDVVQAHERGDIH
YHDLDYSPFFPMFNCMLIDLKGMLTQGFKMGNAEIEPPKSISTATAVTAQIIAQVASHIYGGTTINRIDEVLAPFVTASY
NKHRKTAEEWNIPDAEGYANSRTIKECYDAFQSLEYEVNTLHTANGQTPFVTFGFGLGTSWESRLIQESILRNRIAGLGK
NRKTAVFPKLVFAIRDGLNHKKGDPNYDIKQLALECASKRMYPDILNYDQVVKVTGSFKTPMGCRSFLGVWENENGEQIH
DGRNNLGVISLNLPRIALEAKGDEATFWKLLDERLVLARKALMTRIARLEGVKARVAPILYMEGACGVRLNADDDVSEIF
KNGRASISLGYIGIHETINALFGGEHVYDNEQLRAKGIAIVERLRQAVDQWKEETGYGFSLYSTPSENLCDRFCRLDTAE
FGVVPGVTDKGYYTNSFHLDVEKKVNPYDKIDFEAPYPPLANGGFICYGEYPNIQHNLKALEDVWDYSYQHVPYYGTNTP
IDECYECGFTGEFECTSKGFTCPKCGNHDASRVSVTRRVCGYLGSPDARPFNAGKQEEVKRRVKHLGNGQIG
>P0A9N8 1.97.1.-~~~nrdG~~~Anaerobic ribonucleoside-triphosphate reductase-activating protein~~~COG0602
MNYHQYYPVDIVNGPGTRCTLFVSGCVHECPGCYNKSTWRVNSGQPFTKAMEDQIINDLNDTRIKRQGISLSGGDPLHPQ
NVPDILKLVQRIRAECPGKDIWVWTGYKLDELNAAQMQVVDLINVLVDGKFVQDLKDPSLIWRGSSNQVVHHLR
>P0AC65 ~~~nrdH~~~Glutaredoxin-like protein NrdH~~~COG0695
MRITIYTRNDCVQCHATKRAMENRGFDFEMINVDRVPEAAEALRAQGFRQLPVVIAGDLSWSGFRPDMINRLHPAPHAAS
A
>Q48708 ~~~nrdH~~~Glutaredoxin-like protein NrdH~~~COG0695
MVTVYSKNNCMQCKMVKKWLSEHEIAFNEINIDEQPEFVEKVIEMGFRAAPVITKDDFAFSGFRPSELAKLA
>Q97T03 ~~~~~~Putative NrdI-like protein~~~COG1780
MKTISLVYISLSGNTESFVTRLKDYLLSQYKGIEVQKIHIKDLVKEGKNFYEMDHPYVAFLPTYLEGGNGVDNGDVEILT
TPVGDFIAYGNNASKCFGVVGSGNRNFNNQYCLTAKQYSQRFGFPVLADFEMRGMLEDIKHVAAIIADLYELEKEN
>P50618 ~~~nrdI~~~Protein NrdI~~~COG1780
MVQIIFDSKTGNVQRFVNKTGFQQIRKVDEMDHVDTPFVLVTYTTNFGQVPASTQSFLEKYAHLLLGVAASGNKVWGDNF
AKSADTISRQYQVPILHKFELSGTSKDVELFTQEVERVVTKSSAKMDPVK
>P0A772 ~~~nrdI~~~Protein NrdI~~~COG1780
MSQLVYFSSSSENTQRFIERLGLPAVRIPLNERERIQVDEPYILIVPSYGGGGTAGAVPRQVIRFLNDEHNRALLRGVIA
SGNRNFGEAYGRAGDVIARKCGVPWLYRFELMGTQSDIENVRKGVTEFWQRQPQNA
>P9WIZ3 ~~~nrdI~~~Protein NrdI~~~COG1780
MDIAGRSLVYFSSVSENTHRFVQKLGIPATRIPLHGRIEVDEPYVLILPTYGGGRANPGLDAGGYVPKQVIAFLNNDHNR
AQLRGVIAAGNTNFGAEFCYAGDVVSRKCSVPYLYRFELMGTEDDVAAVRTGLAEFWKEQTCHQPSLQSL
>O54196 1.17.4.1~~~nrdJ~~~Vitamin B12-dependent ribonucleotide reductase~~~COG0209
MTETTSGPARGSRTKGTKATKGLRIERVHTTPGVHPYDEVVWERRDVVMTNWRDGSVNFEQRGVEFPDFWSVNAVNIVTS
KYFRGAVGTPQRETGLKQLIDRIVKTYRKAGEEYKYFASPADAEIFEHELAYALLHQIFSFNSPVWFNVGTPQPQQVSAC
FILSVDDSMESILDWYKEEGMIFKGGSGAGLNLSRIRSSKELLSSGGNASGPVSFMRGADASAGTIKSGGATRRAAKMVI
LDVDHPDIEGFIETKVKEEEKIRALRDAGFDMDLGGDDITSVQYQNANNSVRVNDEFMRAVESGSAFGLRARMTGEIIEQ
VDAKALFRKMAQAAWACADPGIQYDDTINRWHTCPESGRINGSNPCSEYMHLDNTSCNLASLNLMKFLTDDGEGNQSFDV
ERFAKVVELVITAMDISICFADFPTQKIGENTRAFRQLGIGYANLGALLMATGHAYDSDGGRAIAGAISSLMTGTSYRRS
AELAAVVGPYDGYARNAAPHNQVMRQHADANDTAVRMDDLDTPIWAAATETWQDVLRLGEKNGFRNAQASVIAPTGTIGL
AMSCDTTGLEPDLALVKFKKLVGGGSMQIVNGTVPQALRRLGYQAEQIEAIVEHIAEHGNVLDAPGLKTEHYKVFDCAMG
ERSISAMGHVRMMAAIQPWISGALSKTVNMPESATVEEVEEIYFEAWKMGVKALAIYRDNCKVGQPLSAKTKEKEQDGIA
EKTEDTIRAAVEKVIEYRPVRKRLPKGRPGITTSFTVGGAEGYMTANSYPDDGLGEVFLKMSKQGSTLAGMMDAFSIAVS
VGLQYGVPLETYVSKFTNMRFEPAGMTDDPDVRMAQSIVDNIFRRLALDFLPFETRSALGIHSAEERQRHLDTGSYEQVI
EEDELDVEGLAQSAPRQQIPAVPAAPAEIPAPKQAHTSAELVEMQLGISADAPLCFSCGTKMQRAGSCYICEGCGSTSGC
S
>O69981 1.17.4.1~~~nrdJ~~~Vitamin B12-dependent ribonucleotide reductase~~~COG0209
MTETASGPARSSRAKGTKAGKGLRVERVHTTPGVHPYDEVAWERRDVVMTNWRDGSVNFEQRGVEFPEFWSVNAVNIVTS
KYFRGAVGTPQREVSLKQLIDRIVKTYRKAGEDNKYFASPADAEIFEHELAYALLHQIFSFNSPVWFNVGTPQPQQVSAC
FILSVDDSMESILDWYKEEGMIFKGGSGAGLNLSRIRSSKELLSSGGNASGPVSFMRGADASAGTIKSGGATRRAAKMVI
LDVDHPDIEDFIQTKVKEEEKIRALRDAGFDMDLGGDDITSVQYQNANNSVRVNDTFMKAVQDGGKFGLTSRMTGEVIEE
VDAKALFRKMAEAAWACADPGIQYDDTINAWHTCPESGRINGSNPCSEYMHLDNTSCNLASLNLMKFLKDDGKGNQSFDA
ERFSKVVELVITAMDISICFADFPTQKIGENTRAFRQLGIGYANLGALLMATGHAYDSDGGRALAGAITSLMTGTSYRRS
AELAAIVGPYDGYARNAKPHLRVMKQHSDENAKAVRMDDLDTPIWAAATEAWQDVLRLGEKNGFRNSQASVIAPTGTIGL
AMSCDTTGLEPDLALVKFKKLVGGGSMQIVNGTVPQALRRLGYQEEQIEAIVAHIAENGNVIDAPGLKPEHYEVFDCAMG
ERSISAMGHVRMMAAIQPWISGALSKTVNLPESATVEDVEEVYFEAWKMGVKALAIYRDNCKVGQPLSAKTKTVKDTEKA
EITEKTEAAIRETVEKVVEYRPVRKRLPKGRPGITTSFTVGGAEGYMTANSYPDDGLGEVFLKMSKQGSTLAGMMDAFSI
AVSVGLQYGVPLETYVSKFTNMRFEPAGMTDDPDVRMAQSIVDYIFRRLALDFLPFETRSALGIHSAEERQRHLETGSYE
PSDDELDVEGLAQSAPRAQELVAVATPKAEAEAAKPAPQQAHTSAELVEMQLGIQADAPLCFSCGTKMQRAGSCYICEGC
GSTSGCS
>P0A8D0 ~~~nrdR~~~Transcriptional repressor NrdR~~~COG1327
MHCPFCFAVDTKVIDSRLVGEGSSVRRRRQCLVCNERFTTFEVAELVMPRVVKSNDVREPFNEEKLRSGMLRALEKRPVS
SDDVEMAINHIKSQLRATGEREVPSKMIGNLVMEQLKKLDKVAYIRFASVYRSFEDIKEFGEEIARLED
>P9WIZ1 ~~~nrdR~~~Transcriptional repressor NrdR~~~COG1327
MHCPFCRHPDSRVIDSRETDEGQAIRRRRSCPECGRRFTTVETAVLAVVKRSGVTEPFSREKVISGVRRACQGRQVDDDA
LNLLAQQVEDSVRAAGSPEIPSHDVGLAILGPLRELDEVAYLRFASVYRSFSSADDFAREIEALRAHRNLSAHS
>O69980 ~~~nrdR~~~Transcriptional repressor NrdR~~~COG1327
MHCPFCRHPDSRVVDSRTTDDGTSIRRRRQCPDCSRRFTTVETCSLMVVKRSGVTEPFSRTKVINGVRKACQGRPVTEDA
LAQLGQRVEEAVRATGSAELTTHDVGLAILGPLQELDLVAYLRFASVYRAFDSLEDFEAAIAELRETTGHPGEEDDTGAG
SQENDRGPTGAGQVPEPAGAAD
>P67318 ~~~nrdR~~~Transcriptional repressor NrdR~~~COG1327
MRCPKCGATKSSVIDSRQAEEGNTIRRRRECDECQHRFTTYERVEERTLVVVKKDGTREQFSRDKIFNGIIRSAQKRPVS
SDEINMVVNRIEQKLRGRNENEIQSEDIGSLVMEELAELDEITYVRFASVYRSFKDVSELESLLQQITQSSKKKKER
>P9WH76 1.17.4.1~~~nrdZ~~~Vitamin B12-dependent ribonucleoside-diphosphate reductase~~~
MGVSWPAKVRRRDGTLVPFDIARIEAAVTRAAREVACDDPDMPGTVAKAVADALGRGIAPVEDIQDCVEARLGEAGLDDV
ARVYIIYRQRRAELRTAKALLGVRDELKLSLAAVTVLRERYLLHDEQGRPAESTGELMDRSARCVAAAEDQYEPGSSRRW
AERFATLLRNLEFLPNSPTLMNSGTDLGLLAGCFVLPIEDSLQSIFATLGQAAELQRAGGGTGYAFSHLRPAGDRVASTG
GTASGPVSFLRLYDSAAGVVSMGGRRRGACMAVLDVSHPDICDFVTAKAESPSELPHFNLSVGVTDAFLRAVERNGLHRL
VNPRTGKIVARMPAAELFDAICKAAHAGGDPGLVFLDTINRANPVPGRGRIEATNPCGEVPLLPYESCNLGSINLARMLA
DGRVDWDRLEEVAGVAVRFLDDVIDVSRYPFPELGEAARATRKIGLGVMGLAELLAALGIPYDSEEAVRLATRLMRRIQQ
AAHTASRRLAEERGAFPAFTDSRFARSGPRRNAQVTSVAPTGTISLIAGTTAGIEPMFAIAFTRAIVGRHLLEVNPCFDR
LARDRGFYRDELIAEIAQRGGVRGYPRLPAEVRAAFPTAAEIAPQWHLRMQAAVQRHVEAAVSKTVNLPATATVDDVRAI
YVAAWKAKVKGITVYRYGSREGQVLSSAAPKPLLAQADTEFSGGCAGRSCEF
>P9WH77 1.17.4.1~~~nrdZ~~~Vitamin B12-dependent ribonucleoside-diphosphate reductase~~~COG0209
MPGERRRFAQATKSVGVSWPAKVRRRDGTLVPFDIARIEAAVTRAAREVACDDPDMPGTVAKAVADALGRGIAPVEDIQD
CVEARLGEAGLDDVARVYIIYRQRRAELRTAKALLGVRDELKLSLAAVTVLRERYLLHDEQGRPAESTGELMDRSARCVA
AAEDQYEPGSSRRWAERFATLLRNLEFLPNSPTLMNSGTDLGLLAGCFVLPIEDSLQSIFATLGQAAELQRAGGGTGYAF
SHLRPAGDRVASTGGTASGPVSFLRLYDSAAGVVSMGGRRRGACMAVLDVSHPDICDFVTAKAESPSELPHFNLSVGVTD
AFLRAVERNGLHRLVNPRTGKIVARMPAAELFDAICKAAHAGGDPGLVFLDTINRANPVPGRGRIEATNPCGEVPLLPYE
SCNLGSINLARMLADGRVDWDRLEEVAGVAVRFLDDVIDVSRYPFPELGEAARATRKIGLGVMGLAELLAALGIPYDSEE
AVRLATRLMRRIQQAAHTASRRLAEERGAFPAFTDSRFARSGPRRNAQVTSVAPTGTISLIAGTTAGIEPMFAIAFTRAI
VGRHLLEVNPCFDRLARDRGFYRDELIAEIAQRGGVRGYPRLPAEVRAAFPTAAEIAPQWHLRMQAAVQRHVEAAVSKTV
NLPATATVDDVRAIYVAAWKAKVKGITVYRYGSREGQVLSYAAPKPLLAQADTEFSGGCAGRSCEF
>Q7WZY5 2.7.13.3~~~nreB~~~Oxygen sensor histidine kinase NreB~~~COG4585
MKSISNRDKLQDLLTQYYLNTNEKMVFLNSTGEVIALNEAAEEVFADDNDYSQMTNAVCRRCEGYSNEYDIMSCENCFLE
ALEIGKGSFQVFIRTKDNKIQPYTASYELIDHEKGIYAFTLHNVSPQIQRQERMYQRKMMQKTISAQENERKRISRELHD
GIVQELINVDVELRLLKYQQDKDELIDNSKRIEGIMSRLIDDVRNLSVELRPSSLDDLGLDAAFRSYFKQFEKNYGIHVN
YHTNFSAQRFDNEIETVVYRVVQEALFNALKYAQVDIVEVSLQLNENNIIAEVSDRGVGFKRGDDPKGTGLGLFGMNERA
ELVNGTVNIDSQINRGTIVTLEVPITD
>Q7WZY4 ~~~nreC~~~Oxygen regulatory protein NreC~~~COG2197
MKIVIADDHAVVRTGFSMILNFQDDMEVVDTAADGVEAYQKVMQHQPDVLIMDLSMPPGESGLIATSKIVESFPDTKILI
LTMYDDEEYLFHVLRNGAKGYILKNAPDEQLISAVRTVYRGDTYIDPKMTTSLVNEFVNNTGQDANSTNDPFRILSKREL
EILPLIAKGYGNKEIAEKLFVSVKTVEAHKTHIMQKLNLKSKPELVEYALKKKLLDF
>Q72EF3 1.7.2.2~~~nrfA~~~Cytochrome c nitrite reductase subunit NrfA~~~COG3303
MNNQKTFKGLRLAALGLVAVAAFTAGCSDVSTELKTPVYKTKLTAEEIRNSAFKPEFPKQYASYERNDETTVMTEYKGSV
PFNKNDNVNPLPEGYRHAQPYLKNLWLGYPFMYEYREARGHTYAIQDFLHIDRINRYAEKGGLPATCWNCKTPKMMEWVK
ESGDGFWAKDVNEFRDKIDMKDHTIGCATCHDPQTMELRITSVPLTDYLVSQGKDPKKLPRNEMRALVCGQCHVEYYFNG
PTMGVNKKPVFPWAEGFDPADMYRYYDKHGDLQVKGFEGKFADWTHPASKTPMIKAQHPEYETWINGTHGAAGVTCADCH
MSYTRSDDKKKISSHWWTSPMKDPEMRACRQCHSDKTPDYLKSRVLFTQKRTFDLLLAAQEVSVKAHEAVRLANEYQGAK
AAGYDDLMIQAREMVRKGQFFWDYVSAENSVGFHNPAKALDTLAQSQQFSQKAIDLAMEATQYGIGKDLSGDIKTIVPPI
LKMNRKLQQDPEFMKTHKWFQYLPVLPKADQVWDGQKRLVSAKQ
>P0ABK9 1.7.2.2~~~nrfA~~~Cytochrome c-552~~~COG3303
MTRIKINARRIFSLLIPFFFFTSVHAEQTAAPAKPVTVEAKNETFAPQHPDQYLSWKATSEQSERVDALAEDPRLVILWA
GYPFSRDYNKPRGHAFAVTDVRETLRTGAPKNAEDGPLPMACWSCKSPDVARLIQKDGEDGYFHGKWARGGPEIVNNLGC
ADCHNTASPEFAKGKPELTLSRPYAARAMEAIGKPFEKAGRFDQQSMVCGQCHVEYYFDGKNKAVKFPWDDGMKVENMEQ
YYDKIAFSDWTNSLSKTPMLKAQHPEYETWTAGIHGKNNVTCIDCHMPKVQNAEGKLYTDHKIGNPFDNFAQTCANCHTQ
DKAALQKVVAERKQSINDLKIKVEDQLVHAHFEAKAALDAGATEAEMKPIQDDIRHAQWRWDLAIASHGIHMHAPEEGLR
MLGTAMDKAADARTKLARLLATKGITHEIQIPDISTKEKAQQAIGLNMEQIKAEKQDFIKTVIPQWEEQARKNGLLSQ
>Q8EAC7 1.7.2.2~~~nrfA~~~Cytochrome c-552~~~COG3303
MMKKMTGKTFALSALVAASFMAAGAMASDKTEPRNEVYKDKFKNQYNSWHDTAKSEELVDALEQDPNMVILWAGYAFAKD
YKAPRGHMYAVTDVRNTLRTGAPKNAEDGPLPMACWSCKSPDVPRLIEEQGEDGYFKGKWAKGGPEVTNTIGCSDCHEKG
SPKLRISRPYVDRALDAIGTPFSKASKQDKESMVCAQCHVEYYFEKKEDKKGFVKFPWDMGVTVDQMEVYYDGIEFSDWT
HALSKTPMLKAQHPEYETWKMGIHGKNNVSCVDCHMPKVTSPEGKKFTDHKVGNPFDRFEETCATCHSQTKEFLVGVTNE
RKAKVKEMKLKAEEQLVKAHFEAAKAWELGATEAEMKPILTDIRHAQWRWDLAIASHGVAAHAPEEALRVLGTSVNKAAD
ARVKLAQLLAKKGLTDPVAIPDISTKAKAQAVLGMDMEKMNAEKEAFKKDMLPKWDAEAKKREATYK
>Q2G0Z5 1.6.-.-~~~nfrA~~~NADPH-dependent oxidoreductase~~~COG0778
MSEHVYNLVKKHHSVRKFKNKPLSEDVVKKLVEAGQSASTSSFLQAYSIIGIDDEKIKENLREVSGQPYVVENGYLFVFV
IDYYRHHLVDQHAETDMENAYGSTEGLLVGAIDAALVAENIAVTAEDMGYGIVFLGSLRNDVERVREILDLPDYVFPVFG
MAVGEPADDENGAAKPRLPFDHVFHHNKYHADKETQYAQMADYDQTISEYYDQRTNGNRKETWSQQIEMFLGNKARLDML
EQLQKSGLIQR
>Q9Z4P4 1.7.2.2~~~nrfA~~~Cytochrome c-552~~~
MKFKLLLAGSLVAVGAMALLASNINEKEKQRVELAKAPSEAGIAGKEKSEEWAKYYPRQFDSWKKTKEYDSFTDMLAKDP
ALVIAWSGYAFSKDYNSPRGHYYALQDNVNSLRTGAPVDAKTGPLPTACWTCKSPDVPRLIEEDGELEYFTGKWAKYGSQ
IVNVIGCANCHDDKTAELKVRVPHLNRGLQAAGLKTFEESTHQDKRTLVCAQCHVEYYFKKTEWKDAKGADKTAMVVTLP
WANGVGKDGNAGVEGMIKYYDEINFSDWTHNISKTPMLKAQHPGFEFWKSGIHGQKGVSCADCHMPYTQEGSVKYSDHQV
KENPLDSMDQSCMNCHRESESKLRGIVHQKYERKEFLNKVAFDNIGKAHLETGKAIEAGASDEELKEVRKLIRHGQFKAD
MAIAAHGNYFHAPEETLRLLAAGSDDAQKARLLLVKILAKHGVMDYIAPDFDTKDKAQKLAKVDIAALAAEKMKFKQTLE
QEWKKEAKAKGRANPELYKDVDTINDGKSSWNKK
>Q9S1E5 1.7.2.2~~~nrfA~~~Cytochrome c-552~~~COG3303
MTKFKLLLAGSLVAIVSMGLLASNINEREKERVALNKTAHSQGIEGKAMSEEWARYYPRQFDSWKKTKESDNITDMLKEK
PALVVAWAGYPFSKDYNAPRGHYYALQDNINTLRTGAPVDGKTGPLPSACWTCKSPDVPRIIEQDGELEYFTGKWAKYGD
EIVNTIGCYNCHDDKSAELKSKVPYLDRGLSAAGFKTFAESTHQEKRSLVCAQCHVEYYFKKTEWKDDKGVDKTAMVVTL
PWSKGISTEQMEAYYDEINFADWTHGISKTPMLKAQHPDWELYKTGIHGQKGVSCADCHMPYTQEGAVKYSDHKVGNPLD
NMDKSCMNCHRESEQKLKDIVKQKFERKEFLQDIAFDNIGKAHLETGKAMELGATDAELKEIRTHIRHAQWRADMAIAGH
GSFFHAPEEVLRLLASGNEEAQKARIKLVKVLAKYGAIDYVAPDFETKEKAQKLAKVDMEAFIAEKLKFKQTLEQEWKKQ
AIAKGRLNPESLKGVDEKSSYYDKTKK
>P0ABL1 ~~~nrfB~~~Cytochrome c-type protein NrfB~~~COG3303
MSVLRSLLTAGVLASGLLWSLNGITATPAAQASDDRYEVTQQRNPDAACLDCHKPDTEGMHGKHASVINPNNKLPVTCTN
CHGQPSPQHREGVKDVMRFNEPMYKVGEQNSVCMSCHLPEQLQKAFWPHDVHVTKVACASCHSLHPQQDTMQTLSDKGRI
KICVDCHSDQRTNPNFNPASVPLLKEQP
>Q8X5S3 ~~~nrfG~~~Formate-dependent nitrite reductase complex subunit NrfG~~~COG4235
MKQPQIPVKMLTTLTILMVFLCIGSYLLSPKWQAVRAEYQRQRDPLHQFASQQNPEAQLQALQDKIRANPQNSEQWALLG
EYYLWQNDYSNSLLAYRQALQLRGENAELYAALATVLYYQASQHMTAQTRAMIDKALALDSNEITALMLLASDAFMQANY
AQAIELWQKVMDLNSPRINRTQLVESINMAKLLQRRSD
>Q72EF4 ~~~~~~Cytochrome c nitrite reductase subunit NrfH~~~COG3005
MSEEKSRNGPARLKLVLGGATLGVVALATVAFGMKYTDQRPFCTSCHIMNPVGVTHKLSGHANISCNDCHAPHNLLAKLP
FKAIAGARDVYMNTLGHPGDLILAGMETKEVVNANCKACHTMTNVEVASMEAKKYCTDCHRNVQHMRMKPISTREVADE
>Q9S1E6 ~~~nrfH~~~Cytochrome c-type protein NrfH~~~COG3005
MNKSKFLVYSSLVVFAIALGLFVYLVNASKALSYLSSDPKACINCHVMNPQYATWQHSSHAERASCVECHLPTGNMVQKY
ISKARDGWNHSVAFTLGTYDHSMKISEDGARRVQENCISCHASLSSTLLENADRNHQFNDPKGASERLCWECHKSVPHGK
VRSLTATPDNLGVREVK
>Q07428 ~~~nrgB~~~Nitrogen regulatory PII-like protein~~~COG0347
MSGQMFKVEIVTRPANFEKLKQELGKIGVTSLTFSNVHGCGLQKAHTELYRGVKIESNVYERLKIEIVVSKVPVDQVTET
AKRVLKTGSPGDGKIFVYEISNTINIRTGEEGPEAL
>Q02068 3.5.5.7~~~~~~Aliphatic nitrilase~~~
MSSNPELKYTGKVKVATVQAEPVILDADATIDKAIGFIEEAAKNGAEFLAFPEVWIPGYPYWAWIGDVKWAVSDFIPKYH
ENSLTLGDDRMRRLQLAARQNNIALVMGYSEKDGASRYLSQVFIDQNGDIVANRRKLKPTHVERTIYGEGNGTDFLTHDF
GFGRVGGLNCWEHFQPLSKYMMYSLNEQIHVASWPAMFALTPDVHQLSVEANDTVTRSYAIEGQTFVLASTHVIGKATQD
LFAGDDDAKRALLPLGQGWARIYGPDGKSLAEPLPEDAEGLLYAELDLEQIILAKAAADPAGHYSRPDVLSLKIDTRNHT
PVQYITADGRTSLNSNSRVENYRLHQLADIEKYENAEAATLPLDAPAPAPAPEQKSGRAKAEA
>Q03217 3.5.5.7~~~nitA~~~Aliphatic nitrilase~~~
MVEYTNTFKVAAVQAQPVWFDAAKTVDKTVSIIAEAARNGCELVAFPEVFIPGYPYHIWVDSPLAGMAKFAVRYHENSLT
MDSPHVQRLLDAARDHNIAVVVGISERDGGSLYMTQLVIDADGQLVARRRKLKPTHVERSVYGEGNGSDISVYDMPFARL
GALNCWEHFQTLTKYAMYSMHEQVHVASWPGMSLYQPEVPAFGVDAQLTATRMYALEGQTFVVCTTQVVTPEAHEFFCDN
DEQRKLIGRGGGFARIIGPDGRDLATPLAEDEEGILYADIDLSAITLAKQAADPVGHYSRPDVLSLNFNQRHTTPVNTAI
STIHATHTLVPQSGALDGVRELNGADEQRALPSTHSDETDRATASI
>P20960 3.5.5.1~~~~~~Nitrilase, arylacetone-specific~~~COG0388
MQTRKIVRAAAVQAASPNYDLATGVDKTIELARQARDEGCDLIVFGETWLPGYPFHVWLGAPAWSLKYSARYYANSLSLD
SAEFQRIAQAARTLGIFIALGYSERSGGSLYLGQCLIDDKGQMLWSRRKLKPTHVERTVFGEGYARDLIVSDTELGRVGA
LCCWEHLSPLSKYALYSQHEAIHIAAWPSFSLYSEQAHALSAKVNMAASQIYSVEGQCFTIAASSVVTQETLDMLEVGEH
NASLLKVGGGSSMIFAPDGRTLAPYLPHDAEGLIIADLNMEEIAFAKAINDPVGHYSKPEATRLVLDLGHREPMTRVHSK
SVIQEEAPEPHVQSTAAPVAVSQTQDSDTLLVQEPS
>P10045 3.5.5.1~~~bxn~~~Nitrilase, bromoxynil-specific~~~
MDTTFKAAAVQAEPVWMDAAATADKTVTLVAKAAAAGAQLVAFPELWIPGYPGFMLTHNQTETLPFIIKYRKQAIAADGP
EIEKIRCAAQEHNIALSFGYSERAGRTLYMSQMLIDADGITKIRRRKLKPTRFERELFGEGDGSDLQVAQTSVGRVGALN
CAENLQSLNKFALAAEGEQIHISAWPFTLGSPVLVGDSIGAINQVYAAETGTFVLMSTQVVGPTGIAAFEIEDRYNPNQY
LGGGYARIYGPDMQLKSKSLSPTEEGIVYAEIDLSMLEAAKYSLDPTGHYSRPDVFSVSINRQRQPAVSEVIDSNGDEDP
RAACEPDEGDREVVISTAIGVLPRYCGHS
>P82605 3.5.5.1~~~nit~~~Nitrilase~~~
MSNYPKYRVAAVQASPVLLDLDATIDKTCRLVDEAAANGAKVIAFPEAFIPGYPWWIWLGNADYGMKYYIQLYKNSVEIP
SLAVQKLSSAGTNKVYFCVSVTEKDGGSLYLTQLWFDPNGDLIGKHRKLKATNAEKTIWGDGDGSMMPVFETEFGNLGGL
QCWEHFLPLNVAAMASMNEQVHVASWPIGMPQEGHLFGPEQCVTATKYYAISNQVFCLLSSQIWTEEQRDKICETEEQRN
FMKVGHGFSKIIAPNGMEIGNKLAHDEEGITYADIDLEQIIPGKFLIDSAGHYSTPGFLSLSFDRTEKKPIKHIGESAQE
TVTYEEIQYGNKANVKVHS
>O34600 3.1.-.-~~~nrnA~~~Bifunctional oligoribonuclease and PAP phosphatase NrnA~~~COG0618
MKTELIRTISLYDTIILHRHVRPDPDAYGSQCGLTEILRETYPEKNIFAVGTPEPSLSFLYSLDEVDNETYEGALVIVCD
TANQERIDDQRYPSGAKLMKIDHHPNEDPYGDLLWVDTSASSVSEMIYELYLEGKEHGWKLNTKAAELIYAGIVGDTGRF
LFPNTTEKTLKYAGELIQYPFSSSELFNQLYETKLNVVKLNGFIFQNVSLSENGAASVFIKKDTLEKFGTTASEASQLVG
TLGNISGIRAWVFFVEEDDQIRVRFRSKGPVINGLARKYNGGGHPLASGASIYSWDEADRILADLETLCKEHE
>P75144 3.1.-.-~~~nrnA~~~Bifunctional oligoribonuclease and PAP phosphatase NrnA~~~
MNSQVHRKGSIAEAVSAIQAHDKIVIFHHIRPDGDCLGAQHGLARLIQTNFPHKQVFCVGDPKHNFPWLEMVFTPKEQIT
PELMQQALAVIVDANYKERIECRDLLDQNQFKAVLRIDHHPNEDDLNTTHNFVDASYIAAAEQVVDLAVQAKWKLSPPAA
TALYLGIYTDSNRFLYSNTSWRTLYLGSMLYRAQANIAKIHDELNHTSLKDIQFKQYVFKNFQTFQNVIYFVADKKFQKK
LKVTPLECARVNILANIEQFHIWLFFIEEGKNHYRVEFRSNGINVREVALKYGGGGHIQASGAVLKSKRDIIRVVQDCQK
QIAV
>P71615 3.1.-.-~~~nrnA~~~Bifunctional oligoribonuclease and PAP phosphatase NrnA~~~COG0618
MTTIDPRSELVDGRRRAGARVDAVGAAALLSAAARVGVVCHVHPDADTIGAGLALALVLDGCGKRVEVSFAAPATLPESL
RSLPGCHLLVRPEVMRRDVDLVVTVDIPSVDRLGALGDLTDSGRELLVIDHHASNDLFGTANFIDPSADSTTTMVAEILD
AWGKPIDPRVAHCIYAGLATDTGSFRWASVRGYRLAARLVEIGVDNATVSRTLMDSHPFTWLPLLSRVLGSAQLVSEAVG
GRGLVYVVVDNREWVAARSEEVESIVDIVRTTQQAEVAAVFKEVEPHRWSVSMRAKTVNLAAVASGFGGGGHRLAAGYTT
TGSIDDAVASLRAALG
>Q8DTN6 3.1.-.-~~~nrnA~~~Probable bifunctional oligoribonuclease and PAP phosphatase NrnA~~~COG0618
MTAFKTILAKIKAYDTIIIHRHMKPDPDALGSQVGLKEMITSNFPQKTVKVTGYNEPSLSWLAQMDDVSDKDYEGALVIV
VDTANRPRIDDQRYLNGNFLIKIDHHPDEDHYGDLSYVDTKASSASEIITDFALQNQLKLSDQAARLLYAGILGDTGRFL
YPATTSKTFIIASELLKYDFDFAALARQMDSFPYKIAKLQAYVFENLEIDKNGAARIILSQKILKKFNLTDAETSAIVSS
PGKIDTVQVWAIFVEQADGHYRVRLRSKSTVINEVAKRHAGGGHPLASGANSYSLAENEDIYQELKNLLK
>Q5SM25 3.1.-.-~~~nrnA~~~Bifunctional oligoribonuclease and PAP phosphatase NrnA~~~COG0618
MDGNAPEPRYWEKMRLVAEVLKAVEGPIYIATHVDPDGDAIGSSLGLYRALKALGKEAYWVADPPRFLRFLPKEEEYSDP
VEKLPPGATLVALDSAEPSRVVGVPVEGFVINIDHHGTNPRFGHLHVVDPSKAATAQMVKDLIDLLGVEWTAEIATPVLT
GILTDTGNFRFANTTPEVLRVAAELLGYGVKLAELTDRLQFRPPSYFRLMGQVLSTVAFHFGGLLVTAHLPEDAGAEEDS
DDFVGLIRYVEGSVVSVFLRKREEGVKVSIRSRGGVSAQNIALKLGGGGHVPAAGATLKGLDLDQAYERVLEAVREELTR
AGYL
>O31824 3.1.-.-~~~nrnB~~~Oligoribonuclease NrnB~~~COG2404
MYHLYSHNDLDGVGCGIVAKLAFGKDVEIRYNSVNGLNAQVQYFLEKAKESNRQDALFITDLAVNEENEERLNEYVHAGG
KVKLIDHHKTALHLNEHEWGFVQVEYDDGRLTSATSLLYGYLIENGFMKPTNALDQFTELVRQYDTWEWERYDQKQAKRL
NDLFFLLSIDEFEAKMIQRLSTHDEFFFDDFEEKLLDLEDEKIERYLRRKKREMVQTFVHEHCVGIVHAESYHSELGNRL
GKDNPHLDYIAILSMGSKRVSLRTIHDYIDVSEIAGRYGGGGHAKASGCSITDEVYELFVAEAFRIDPVRPDAFRNIYNL
KGSANGSLYENRAQMRFFLFPLDNEWNIQINGETQDETFAAFEEAEWFIKRNYAASLVRDEVFVAFLAENLKLANQHRK
>Q9AL95 1.18.1.1~~~nroR~~~NADH-rubredoxin oxidoreductase~~~COG1251
MKSTKILILGAGPAGFSAAKAALGKCDDITMINSEKYLPYYRPRLNEIIAKNKSIDDILIKKNDWYEKNNIKVITSEFAT
SIDPNNKLVTLKSGEKIKYEKLIIASGSIANKIKVPHADEIFSLYSYDDALKIKDECKNKGKAFIIGGGILGIELAQAII
DSGTPASIGIILEYPLERQLDRDGGLFLKDKLDRLGIKIYTNSNFEEMGDLIRSSCVITAVGVKPNLDFIKDTEIASKRG
ILVNDHMETSIKDIYACGDVAEFYGKNPGLINIANKQGEVAGLNACGEDASYSEIIPSPILKVSGISIISCGDIENNKPS
KVFRSTQEDKYIVCMLKENKIDAAAVIGDVSLGTKLKKAIDSSKSFDNISSLDAILNNL
>G8QM60 ~~~nrsf~~~Probable anti-sigma-F factor NrsF~~~COG4944
MKTEDLITMLAAGAGAVEAPSAAQRYALAIGWGAAGATLLMLALLQVRHDLGLALLLPMFWVKVGFVTCLAAGSLFAVLR
LSRPGAKTNWVPAALGLPVLGMWAIAAFTLIEAEPMERSNLFFGDTWKSCPLLIAMLSVPVFAAVLRSMKDLAPTRPRLA
GFAAGLLAGAVAAVVYCLHCPELGAPFIGFWYLLGMLIPAAVGVLLGNSMLRW
>A0A0H3CC47 ~~~nrsF~~~Anti-sigma-F factor NrsF~~~
MRTDDLIDALAADAGRGTEPAPPRRLALVAGLGGVAALLLVLGWLQARPDLGQAILGPMFWVKAIYTGLLGLAGYLAVER
LSRPGGSGRRGWIIGAVVFGACAVAGIYQAITSPDVQAALKLLHGYSWRSCSPRILVLGLPMLALGLWALRGMAPTRPGL
AGFAMGLFSGGVVATLYGLHCPEHTFTFLALWYSLGVLALGLIGGWAGRWLLRW
>P38043 ~~~nrtA~~~Nitrate/nitrite binding protein NrtA~~~COG0715
MSQFSRRKFLLTAGGTAAAALWLNACGSNNSSTDTTGSTSTPAPSGTSGGDAPEVKGVTLGFIALTDAAPVIIALEKGLF
AKYGLPDTKVVKQTSWAVTRDNLELGSDRGGIDGAHILSPMPYLLTAGTITKSQKPLPMYILARLNTQGQGISLSNEFLA
EKVQIKDPKLKAIADQKKASGKLLKAAVTFPGGTHDLWMRYWLAANGIDPNNDADLVVIPPPQMVANMQTGTMDTFCVGE
PWNARLVNKKLGYTAAVTGELWKFHPEKALTIRADWADKNPKATMALLKAVQEAQIWCEDPANLDELCQITAQDKYFKTS
VEDIKPRLQGDIDYGDGRSVKNSDLRMRFWSENASFPYKSHDLWFLTEDIRWGYLPASTDTKALIEKVNRSDLWREAAKA
IGREQDIPASDSRGVETFFDGVTFDPENPQAYLDGLKFKAIKA
>P73452 ~~~nrtA~~~Nitrate/nitrite binding protein NrtA~~~COG0715
MSNFSRSTRRKFMFTAGAAAIGGVVLHGCTSPTTTSTGTGTGSSTDQAISPLVEGENAPEVTTAKLGFIALTDAAPLIIA
KEKGFYAKYGMPDVEVLKQASWGTTRDNLVLGSASGGIDGAHILTPMPYLITMGTVTDGKPTPMYILARLNVNGQGIQLG
NNYKDLKVGTDAAPLKEAFAKVTDPKVAMTFPGGTHDMWIRYWLAAGGMEPGKDFSTIVVPPAQMVANVKVNAMESFCVG
EPWPLQTVNQGVGYQALTTGQLWKDHPEKAFGMRADWVDQNPKAAKALLMAVMEAQQWCDQAENKEEMCQILSKREWFKV
PFEDIIDRSKGIYNFGNGQETFEDQEIMQKYWVDNASYPYKSHDQWFLTENIRWGYLPASTDTKAIVDKVNREDLWREAA
QALEVPADQIPSSPSRGIETFFDGITFDPENPQAYLDSLKIKSIKA
>P38044 ~~~nrtB~~~Nitrate import permease protein NrtB~~~COG0600
MTVTLRPPSSVRRSAWVKNPKLKPFLPYVVCLPIFLAIWQVISAILGQDRLPGPINVVANTWMPYIVEPFFDNGGTSKGL
GLQILISLQRVAIGYLLAACTGILVGGVLGMSKFLGKGLDPVIQVLRTVPPLAWFPISLMVFQDANTSAIFVIFITAIWP
IIINTAVGINQIPDDYNNVARVLKLSKKDYILNILIPSTVPYVFAGLRIAVGLAWLAIVAAEMLKADGGIGYFIWDAYNA
GGDGSSSQIILAIFYVGLVGLSLDRLVAWVGRLVSPVSR
>P38045 7.3.2.4~~~nrtC~~~Nitrate import ATP-binding protein NrtC~~~COG0715
MSVFLAVDHVHQVFDLPGGGQYIALKDVSLNIRPGEFISLIGHSGCGKSTLLNLIAGLAQPSSGGIILEGRQVTEPGPDR
MVVFQNYSLLPWRTVRQNIALAVDSVLHDRNRTERRTIIEETIDLVGLRAAADKYPHEISGGMKQRVAIARGLAIRPKLL
LLDEPFGALDALTRGNLQEQLMRICQEAGVTAVMVTHDVDEALLLSDRVVMLTNGPAAQIGQILEVDFPRPRQRLEMMET
PHYYDLRNELINFLQQQRRAKRRAKAAAPAPAVAASQQKTVRLGFLPGNDCAPLAIAQELGLFQDLGLSVELQSFLTWEA
LEDSIRLGQLEGALMMAAQPLAMTMGLGGHRPFAIATPLTVSRNGGAIALSRRYLNAGVRSLEDLCQFLAATPQRLRLAI
PDPIAMPALLLRYWLASAGLNPEQDVELVGMSPYEMVEALKAGDIDGFAAGEMRIALAVQAGAAYVLATDLDIWAGHPEK
VLGLPEAWLQVNPETAIALCSALLKAGELCDDPRQRDRIVEVLQQPQYLGSAAGTVLQRYFDFGLGDEPTQILRFNQFHV
DQANYPNPLEGTWLLTQLCRWGLTPLPKNRQELLDRVYRRDIYEAAIAAVGFPLITPSQRGFELFDAVPFDPDSPLRYLE
QFEIKAPIQVAPIPLATSA
>P38046 7.3.2.4~~~nrtD~~~Nitrate import ATP-binding protein NrtD~~~COG1116
MTAILPSTAATVNTGFLHFDCVGKTFPTPRGPYVAIEDVNLSVQQGEFICVIGHSGCGKSTLLNLVSGFSQPTSGGVYLD
GQPIQEPGPDRMVVFQNYSLLPWKSARDNIALAVKAARPHLSTSEQRQVVDHHLELVGLTEAQHKRPDQLSGGMKQRVAI
ARALSIRPEVLILDEPFGALDAITKEELQEELLNIWEEARPTVLMITHDIDEALFLADRVVMMTNGPAATIGEVLEIPFD
RPREREAVVEDPRYAQLRTEALDFLYRRFAHDDD
>B2IZT6 ~~~nrtP~~~Nitrate/nitrite transporter NrtP~~~COG2223
MLKKLFSFSDRYRILHQTWFAFFLTFVCWFNFAPFATTIGKELHLAPEQIKTLGICNLALTIPARLIIGMLLDRFGPRIT
YSILLMFAVVPCLATALAQDFNQLVISRLLMGIVGSGFVVGIRMVAEWFQPKEMGIAQGIYGGWGNFGAFGAEFALPILA
ISTSFFSGGASNWRLAIALVGIITAIYGVIYYNTVQDTPRGKVYKKPKKNGSLEVTSIKSFWAMMISNFGLIFALGLLAW
RLEQKKIHFLTLSQMYLTWLVLAGLFAYQSYKAWQVNRELLTGKKTYPVSERFQFGQVALLEFTYITNFGSELAAVSMLP
AFFEKTFGLEHVVAGMIAATYPFLNLVSRPSGGLISDKFGSRKWTMTIISVGIGVSYLMAHFINSNWPIPVAIAVTMFAA
YFAQAGCGATYSIVPMIKKEATGQIAGNVGAYGNFGGVVYLTIFSLTDAPTLFSTMGIAALICAFMCAFFLKEPKGSFAP
AYEGEASETATKSSVFLTEE
>B1XLL7 ~~~nrtP~~~Nitrate/nitrite transporter NrtP~~~COG2223
MLGEMWSFNGRYKILHMTWFAFFLSFVVWFNFPPFATTIAQDFGLDKAQLGTIGLCNVALTVPARIIIGMLLDKYGPRLT
YSLLLIYAAVPCLIFATAQSFNQLVLGRLLMGIVGAGFVIGIRMVAEWFPPKDVGTAEGIYGGWGNFGSAFSAFTMVIFG
IILAFLPGAFNFGQPESFKILFFPEFNTAILNWRAAIAGTGIIAALYGMLYYFSVSDTPPGKTYHRPKSARGMEVTTKKD
FWFLLAMNLPLTLILMVLAWRLQKVNFLNGTGFAIAILALVGLYLFQTYNCWTVNKDLMTGKKRYAPEDRYEFSQVAILE
LTYIVNFGSELAVVTMLPAFFEGTFSLDKATAGIIASSYAFMNLMSRPGGGLISDKMGSRKWTMVVLTVGMGVGYLLMSS
VAGTWPLAIAVLLTMACSFFVQAAEGSTFAIVPLVKRRITGQIAGNVGAYGNVGAVAYLTVLLLLTEASAGANGGEPVMA
TVNAGFFQVLGITGLIVAFLCAFFLKEPKGSFAEFHEGETEMTATPPIEEEATY
>Q6FDK3 ~~~~~~ADPR responsive transcriptional repressor NtrR~~~COG1051
MSLRVNFSSEQAFLAQYQKSDYPSPLMTVDMAIFSVDQGQLQILLIQRSNYPQKSYWALPGGFVDLEQDQNLMACAHRKL
LEKTGIDSPYLEQVASIGNAKRDPRGWSVTVLYFALINFKAYQQQIQHSEHSEWVTLEQALKLDLAFDHHDLLQQAFARL
NNKTRYTALPISLMPPLFTLTELQNIYEIILGHNLEKKAFRRRMIESGVVEETDQSKIAGKRPAQLYRFALQDYDFNFPR
MLEYPRHHED
>P74836 1.13.11.56~~~nsaC~~~1,2-dihydroxynaphthalene dioxygenase~~~
MSSVSELGYLGMSVTDLDAWRAYAAEVAGMEVVDEGESDRIYLRMDLWHHRIALIKGDTDDLAYMGWRLGDPTEFESMVE
KLTNAGIAVTVASDAEARERRVLGLAKLTDPGGNPTEIFYGPQVDAHKPFHPGRPMFGKFVTGSEGIGHCILRQDDVEAA
AAFYRLLGLRGSVEYQLHLPNGMVAMPYFMHCNERQHSVAFGLGPMEKRINHLMFEYTELDDLGLAHDIVRERQIDVALQ
LGKHANDLALTFYCANPSGWLWEFGWGARKAPAQQEFYTRDIFGHGNEAQGYGMDVPL
>Q9X9Q7 5.99.1.4~~~nsaD~~~2-hydroxychromene-2-carboxylate isomerase~~~
MTKTIDFYFDFISPFSYLAQVKLPDLARRTGCVIEYRPIDIPEAKIAAGNYGPSNREVVPKIKVMMADLERWAAKYEVPL
TFPASFACSDWNCAALYARGQDQAEAFVTAAYHRIWGIGIDPRDQNELRGCAEDVGLDADALCEFVRSPAGQGEYRKART
QAYQRGVFGAPMMFVDDQIFWGNDRLDFLESYLLD
>Q9X9Q6 4.1.2.45~~~nsaE~~~Trans-O-hydroxybenzylidenepyruvate hydratase-aldolase~~~
MARTLMKPDDVKGAWAIIPTPAKDDASDWRATKTVDLDETARVVNGLIDAGINGILSMGTLGEAATMTHDEKLDFIKALV
DAAAGRVPIFVGTTCLNTRDTIALTRQALDIGADGTMLGVPMWCAPSVDVAVQFYKDLAEAVPEMNIAIYANPEAFKFDF
PRSFWAQVAEIPQVVTAKYIGVAHLLPDLAAIRGRIKLLPIDFDYYGAARMDESIDAFWSSGAVCDPLVTTTLRDLVSQA
RATGDWSAARAFMGRLGPTAAPLFPNGSFKEFSTYNIALEKARMNAGGWMNAGPVRPPYHLCPEPYLEGARLSGRMWAEL
GKALAAEK
>Q44244 5.1.1.-~~~Aaar~~~N-succinylamino acid racemase~~~
MKLSGVELRRVQMPLVAPFRTSFGTQSVRELLLLRAVTPAGEGWGECVTMAGPLYSSEYNDGAEHVLRHYLIPALLAAED
ITAAKVTPLLAKFKGHRMAKGALEMAVLDAELRAHERSFAAELGSVRDSVPCGVSVGIMDTIPQLLDVVGGYLDEGYVRI
KLKIEPGWDVEPVRAVRERFGDDVLLQVDANTAYTLGDAPQLARLDPFGLLLIEQPLEEEDVLGHAELARRIQTPICLDE
SIVSARAAADAIKLGAVQIVNIKPGRVGGYLEARRVHDVCAAHGIPVWCGGMIETGLGRAANVALASLPNFTLPGDTSAS
DRFYKTDITEPFVLSGGHLPVPTGPGLGVAPIPELLDEVTTAKVWIGS
>Q81IL5 5.1.1.-~~~~~~N-succinyl-L-Arg/Lys racemase~~~
MKITAIHLYAIRLPLRNPFVISYGSYSDMPSIIVKMETDEGIIGYGEGVADDHVTGESWESTFHTLKHTLTPALIGQNPM
NIEKIHDMMDNTIYGVPTAKAAIDIACFDIMGKKLNQPVYQLIGGRYHEEFPVTHVLSIADPENMAEEAASMIQKGYQSF
KMKVGTNVKEDVKRIEAVRERVGNDIAIRVDVNQGWKNSANTLTALRSLGHLNIDWIEQPVIADDIDAMAHIRSKTDLPL
MIDEGLKSSREMRQIIKLEAADKVNIKLMKCGGIYPAVKLAHQAEMAGIECQVGSMVESSVASSAGFHVAFSKKIITSVE
LTGPLKFTKDIGNLHYDVPFIRLNEKPGLGIEINEDTLQELTVFQDIVR
>Q9RYA6 5.1.1.-~~~~~~N-succinylamino acid racemase~~~COG4948
MAHTGRMFKIEAAEIVVARLPLKFRFETSFGVQTHKVVPLLILHGEGVQGVAEGTMEARPMYREETIAGALDLLRGTFLP
AILGQTFANPEAVADALGSYRGNRMARAMVEMAAWDLWARTLGVPLGTLLGGHKEQVEVGVSLGIQAGEQATVDLVRKHV
EQGYRRIKLKIKPGWDVQPVRATREAFPDIRLTVDANSAYTLADAGRLRQLDEYDLTYIEQPLAWDDLVDHAELARRIRT
PLCLDESVASAADARKALALGAGGVINLKVARVGGHAESRRVHDVAQSFGAPVWCGGMLESGIGRAHNIHLSTLPNFRLP
GDTSSASRYWERDLIQEPLEAVDGLMPVPQGPGTGVTLDREFLATVTEAQEEHRA
>Q5SJX8 5.1.1.-~~~~~~N-succinylamino acid racemase~~~COG4948
MRIEAAELRILELPLKFRFETSFGVQTKRTILLLRLFGEGLEGLGEGVMERLPLYREETVAGARYLLEEVFLPRVLGRDL
PNPEALREALAPFRGNPMAKAVLEMAFFDLWAKALGRPLWQVLGGVRQAVEVGVSLGIQPSVEDTLRVVERHLEEGYRRI
KLKIKPGWDYEVLKAVREAFPEATLTADANSAYSLANLAQLKRLDELRLDYIEQPLAYDDLLDHAKLQRELSTPICLDES
LTGAEKARKAIELGAGRVFNVKPARLGGHGESLRVHALAESAGIPLWMGGMLEAGVGRAHNLHLATLPGFTKPGDVSSAS
RYWEEDIVEEALEAKDGLMPVPEGVGIGVHLKLPFVERVTLWQRYMSAS
>P52391 2.1.1.230~~~nshR~~~23S rRNA (adenosine(1067)-2'-O)-methyltransferase~~~
MTEPAIITNASDPAVQRIIDVTKHSRASIKTTLIEDTEPLMECIRAGVQFIEVYGSSGTPLDPALLDLCRQREIPVRLID
VSIVNQLFKAERKAKVFGIARVPRPARLADIAERGGDVVVLDGVKIVGNIGAIVRTSLALGAAGIVLVDSDLATIADRRL
LRASRGYVFSLPVVLADREEAVSFLRDNDIALMVLDTDGDLGVKDLGDRADRMALVFGSEKGGPSGLFQEASAGTVSIPM
LSSTESLNVSVSVGIALHERSARNFAVRRAAAQA
>A8FNH9 4.1.1.96~~~nspC~~~Carboxynorspermidine/carboxyspermidine decarboxylase~~~
MFYEKIQTPAYILEEDKLRKNCELLASVGEKSGAKVLLALKGFAFSGAMKIVGEYLKGCTCSGLWEAKFAKEYMDKEIHT
YSPAFKEDEIGEIASLSHHIVFNSLAQFHKFQSKTQKNSLGLRCNVEFSLAPKELYNPCGRYSRLGIRAKDFENVDLNAI
EGLHFHALCEESADALEAVLKVFKEKFGKWIGQMKWVNFGGGHHITKKGYDVEKLIALCKNFSDKYGVQVYLEPGEAVGW
QTGNLVASVVDIIENEKQIAILDTSSEAHMPDTIIMPYTSEVLNARILATRENEKISDLKENEFAYLLTGNTCLAGDVMG
EYAFDKKLKIGDKIVFLDQIHYTIVKNTTFNGIRLPNLMLLDHKNELQMIREFSYKDYSLRN
>Q5QCP2 4.1.1.96~~~nspC~~~Carboxynorspermidine/carboxyspermidine decarboxylase~~~COG0019
MISTPYYLIDKSALLRNLQVIDQVRERSGAKVLLALKCFATWSVFDLMQQYMDGTTSSSLYEVKLGHQKFGGETHAYSVA
FADHEIDEVVAHCDKIIFNSISQFQRFSSHAGNKPKGLRLNPGVSCASFDLADPARPFSRLGESDPARILSIIDQLDGVM
IHNNCENRDFERFDALLTEVEQRYGEILHRLSWVSLGGGISFTTPGYSIDAFCERLRRFAQTYDVQVYLEPGEATVRDTT
TLEVSVVDIGFNGKNLAVVDSSTEAHMLDLLIYRETAPIKNAQGDHAYQICGKTCLAGDIFGEARFEQPLQIGDRISIGD
AGGYTMVKKNWFNGVHMPAIAIKEADGSVRAVREFSFDDYVSSLS
>Q56575 4.1.1.96~~~nspC~~~Carboxynorspermidine/carboxyspermidine decarboxylase~~~COG0019
MQQNELKTPYFSINEDKLIENLEKAKQLKDISGVKLVLALKCFSTWGVFDIIKPYLDGTTSSGPFEVKLGYETFGGETHA
YSVGYSEDDVRDVADICDKMIFNSQSQLAAYRHIVEGKASIGLRLNPGVSYAGQDLANPARQFSRLGVQADHIKPEIFDG
IDGVMFHMNCENKDVDAFIGLLDAISAQFGEYLDKLDWVSMGGGVFFTWPGYDIEKLGLALKAFAEKHGVQMYLEPGERI
ITKTTDLVVTVVDIVENVKKTAIVDSATEAHRLDTLIYNEPASILEASENGEHEYVIGSCSCLAGDQFCVANFEQPLEIG
QRLHILDSAGYTMVKLNWFNGLRMPSVYCERSNGDIQKLNEFDYSDFKRSLSQWSVI
>Q8D8D2 4.1.1.96~~~nspC~~~Carboxynorspermidine/carboxyspermidine decarboxylase~~~
MNKEQLKTPFFMIDEAKLIQNLEIAKQLKEISGVKLVLALKCFSTWGVFDIIKPYLDGTTSSGPFEVKLGYEKFGGETHA
YSVGYSEDDVREVADLCDKIIFNSQSQLAAHRHIVEGKASIGLRLNPGVSYASQDLANPARQFSRLGVQADHIDPAVFDS
INGVMFHMNCENKDVDAFIALLDSISERFGAYLNKLDWVSMGGGVFFTWPGYDVEKLGLALKAFSEKHGVQMYLEPGEAI
ITKTTDLVVTVVDLVENGMKTAIVDSATEAHRLDTLIYKEPASVLEASENGEHEYVIGSCSCLAGDQFCVAKFDQPLHVG
QRLHILDSAGYTMVKLNWFNGLKMPSVYCERTNGEIQKLNEFGYEDFKRSLSLWSVQ
>Q9L132 ~~~nsrR~~~HTH-type transcriptional repressor NsrR~~~COG1959
MRLTKFTDLALRSLMRLAVVRDGDEPLATREVAEVVGVPYTHAAKAITRLQHLGVVEARRGRGGGLTLTDLGRRVSVGWL
VRELEGEAEVVDCEGDNPCPLRGACRLRRALRDAQEAFYAALDPLTVTDLVAAPTGPVLLGLTDRPSG
>P54989 1.14.14.10~~~ntaA~~~Nitrilotriacetate monooxygenase component A~~~
MGANKQMNLGFLFQISGVHYGGWRYPSAQPHRATDIQYYAEIVRTAERGKLDFCFLADSIAAYEGSADQQDRSKDALMAA
EPKRLLEPFTLLAALAMVTEHIGLVTTATTTYNEPYTMARLFASLDHITNGRAGWNVVTSANLAEAHNFGRDGHVEHGDR
YARAEEFINVVFKLWDSIEDGAYLRDKLAGRYGLSEKIHFINHIGEHFKVRGPLNVPRPPQGHPVIVQAGSSHPGKELAA
RTAEVVFTAQQTLADGKAFYSDVKGRMAKYGRSSENLKVLPGVVVYVAETESEAKAKYETVSNLVPPDFGLFMLSDLLGE
IDLKQFDIDGPLPEDLPEAKGSQSRREVIINLARRENLTIRQLYQRVSGASGHRSIWGTPKQIADQFEQWVYEEAADGFN
ILPPYLPESMNDFVNFVVPELQRRGIFRTEYEGSTLRDHLGLARPKNSVAKPS
>P54990 1.5.1.42~~~ntaB~~~FMN reductase (NADH) NtaB~~~
MADQIRSATEGGDPTSDPKGFRRALGTFPTGVTIVTAPGVDGPAGVTANSFASVSLDPPLVLWSIGHTSRSHSKFQQSAT
FAINILADDQVGVSQVFAGGSADKFSLVDWHTGRTGAPLIDNALAYFDCVCEARHEGGDHTIMIGRVVDFGRAEGSPLAF
SQGRYGVTLDHPEAAKARDHKSEEYGLDDLPFLSLIAKAHYKEDADLEEQRSAAGCTPVGSKILAGLYGSAPLTADELAR
RMYLDRREVVDSLNEFVADGHVESCDSGRFALTESGKQRRRRMIEYVSRYQDEQLASISRSDLGVATRVLQAFLAGPGRG
SS
>P0A4U6 ~~~ntcA~~~Global nitrogen regulator~~~COG0664
MIVTQDKALANVFRQMATGAFPPVVETFERNKTIFFPGDPAERVYFLLKGAVKLSRVYEAGEEITVALLRENSVFGVLSL
LTGNKSDRFYHAVAFTPVELLSAPIEQVEQALKENPELSMLMLRGLSSRILQTEMMIETLAHRDMGSRLVSFLLILCRDF
GVPCADGITIDLKLSHQAIAEAIGSTRVTVTRLLGDLREKKMISIHKKKITVHKPVTLSRQFT
>P29283 ~~~ntcA~~~Global nitrogen regulator~~~COG0664
MLANENSLLTMFRELGSGKLPLQIEQFERGKTIFFPGDPAERVYLLVKGAVKLSRVYESGEEITVALLRENSVFGVLSLL
TGQRSDRFYHAVAFTPVQLFSVPIEFMQKALIERPELANVMLQGLSSRILQTEMMIETLAHRDMGSRLVSFLLILCRDFG
IPSPDGITIDLKLSHQAIAEAIGSTRVTVTRLLGDLRESKLIAIHKKRITVFNPVALSQQFS
>O07566 2.6.1.104~~~ntdA~~~3-oxo-glucose-6-phosphate:glutamate aminotransferase~~~COG0399
MQKQVKISGKSKENMSLLKHLKGDVQGKELVIEDSIVNERWKQVLKEKIDIEHDLFNYQKNREISKVPFLPVDRLITNDE
VEDILNTLTEVLPTGKFTSGPYLEQFEKVLSTYLHKRYVIATSSGTDAIMIGLLALGLNPGDEVIMPANSFSATENAVLA
SGGVPIYVDINPQTFCIDPDKIEEAITPYTKFILPVHLYGKHSDMQHIRQIANRYKLKVIEDACQGIGLTDLGKYADITT
LSFNPYKNFGVCGKAGAIATDNEELAKKCIQFSYHGFEVNVKNKKVINFGFNSKMDNLQAAIGLERMKYLSLNNFKRLFL
ADRYITQLAELQNKGYIELPELSEDHVWHLFPIKVRTEDRADIMTKLNEDFGVQTDVYYPILSHMQKTPLVQDKYAGLQL
VHTEKAHSQVLHLPLYPSFTLEEQDRVMEGLFHVIKQEIGV
>O07565 3.1.3.92~~~ntdB~~~Kanosamine-6-phosphate phosphatase~~~COG0561
MLLSKKSEYKTLSTVEHPQYIVFCDFDETYFPHTIDEQKQQDIYELEDYLEQKSKDGELIIGWVTGSSIESILDKMGRGK
FRYFPHFIASDLGTEITYFSEHNFGQQDNKWNSRINEGFSKEKVEKLVKQLHENHNILLNPQTQLGKSRYKHNFYYQEQD
EINDKKNLLAIEKICEEYGVSVNINRCNPLAGDPEDSYDVDFIPIGTGKNEIVTFMLEKYNLNTERAIAFGDSGNDVRML
QTVGNGYLLKNATQEAKNLHNLITDSEYSKGITNTLKKLIGS
>O07564 1.1.1.361~~~ntdC~~~Glucose-6-phosphate 3-dehydrogenase~~~COG0673
MKKIGIIGAGGIARAHATALSTIKNAELVGVYDINQQNAESFVKTFGGKSFENVDELIDASEGLIVASPNFCHKEHALQA
LGKHKHVLCEKPMAISLEEASIMKDTAERLSVRASMGFNYRYLSYVNILKSLIINNELGNILSIKVHFKKNSALRRKKFT
WRDDANSKKTSGSLGDLGIHLIDMVWYLFESDFITESVRAKMNTNVKTKEDKQVLVDDYAEIYGQLKNKVFVNIITSKCS
VPEDCGFSIEVVGHKKEFKYHTGNPHVYKLIDGLNVVDCPVPQSLLNDPPNEFYGWADSFRSELINWIASTQNDWVEIPS
FSDGFRSQEVLEMFFEKDSNSQPMSVSAVN
>Q9R5V5 2.4.2.6~~~ntd~~~Nucleoside deoxyribosyltransferase~~~
MPKKTIYFGAGWFTDRQNKAYKEAMEALKENPTIDLENSYVPLDNQYKGIRVDEHPEYLHDKVWATATYNNDLNGIKTND
IMLGVYIPDEEDVGLGMELGYALSQGKYVLLVIPDEDYGKPINLMSWGVSDNVIKMSQLKDFNFNKPRFDFYEGAVY
>A0A5P3XKL4 ~~~ntnh~~~Non-toxic nonhemagglutinin~~~
MDIIDNVDITLPENGEDIVIVGGRRYDYNGDLAKFKAFKVAKHIWVVPGRYYGEKLDIQDGEKINGGIYDKDFLSQNQEK
QEFMDGVILLLKRINNTLEGKRLLSLITSAVPFPNEDDGIYKQNNFILSDKTFKAYTSNIIIFGPGANLVENKVIAFNSG
DAENGLGTISEICFQPLLTYKFGDYFQDPALDLLKCLIKSLYYLYGIKVPEDFTLPYRLTNNPDKTEYSQVNMEDLLISG
GDDLNAAGQRPYWLWNNYFIDAKDKFDKYKEIYENQMKLDPNLEINLSNHLEQKFNINISELWSLNISNFARTFNLKSPR
SFYKALKYYYRKKYYKIHYNEIFGTNYNIYGFIDGQVNASLKETDLNIINKPQQIINLIDNNNILLIKSYIYDDELNKID
YNFYNNYEIPYNYGNSFKIPNITGILLPSVNYELIDKIPKIAEIKPYIKDSTPLPDSEKTPIPKELNVGIPLPIHYLDSQ
IYKGDEDKDFILSPDFLKVVSTKDKSLVYSFLPNIVSYFDGYDKTKISTDKKYYLWIREVLNNYSIDITRTENIIGIFGV
DEIVPWMGRALNILNTENTFETELRKNGLKALLSKDLNVIFPKTKVDPIPTDNPPLTIEKIDEKLSDIYIKNKFFLIKNY
YITIQQWWICCYSQFLNLSYMCREAIINQQNLIEKIILNQLSYLARETSINIETLYILSVTTEKTIEDLREISQKSMNNI
CNFFERASVSIFHTDIYNKFIDHMKYIVDDANTKIINYINSNSNITQEEKNYLINKYMLTEEDFNFFNFDKLINLFNSKI
QLTIKNEKPEYNLLLSINQNESNENITDISGNNVKISYSNNINILDGRNEQAIYLDNDSQYVDFKSKNFENGVTNNFTIS
FWMRTLEKVDTNSTLLTSKLNENSAGWQLDLRRNGLVWSMKDHNKNEINIYLNDFLDISWHYIVVSVNRLTNILTVYIDG
ELSVNRNIEEIYNLYSDVGTIKLQASGSKVRIESFSILNRDIQRDEVSNRYINYIDNVNLRNIYGERLEYNKEYEVSNYV
YPRNLLYKVNDIYLAIERGSNSSNRFKLILININEDKKFVQQKDIVIIKDVTQNKYLGISEDSNKIKLVDRNNALELILD
NHLLNPNYTTFSTKQEEYLRLSNIDGIYNWVIKDVSRLNDIYSWTLI
>Q08636 7.2.2.1~~~ntpA~~~V-type sodium ATPase catalytic subunit A~~~COG1155
MQIGKIIKVSGPLVMAENMSEASIQDMCLVGDLGVIGEIIEMRQDVASIQVYEETSGIGPGEPVRSTGEALSVELGPGII
SQMFDGIQRPLDTFMEVTQSNFLGRGVQLPALDHEKQWWFEATIEEGTEVSAGDIIGYVDETKIIQHKIMVPNGIKGTVQ
KIESGSFTIDDPICVIETEQGLKELTMMQKWPVRRGRPIKQKLNPDVPMITGQRVIDTFFPVTKGGAAAVPGPFGAGKTV
VQHQIAKWSDVDLVVYVGCGERGNEMTDVVNEFPELIDPNTGESLMERTVLIANTSNMPVAAREASIYTGITIAEYFRDM
GYDVAIMADSTSRWAEALREMSGRLEEMPGDEGYPAYLGSRLAEYYERSGRVIALGSDQREGSITAISAVSPSGGDISEP
VTQNTLRVVKVFWGLDSSLAQKRHFPSINWIQSYSLYSTEVGRYMDQILQQDWSDMVTEGMRILQEEEQLNEIVRLVGID
SLSDNDRLTLEVAKSIREDYLQQNAFDDVDTFTSREKQFNMLKVILTFGKEARKALSLGAYFNEIMEGTVAVRERISRSK
YIPEEELAKISSINEEIKETIQLIVSEGGMTDD
>Q08637 ~~~ntpB~~~V-type sodium ATPase subunit B~~~COG1156
MIKEYRTIKEVVGPLMAVEKVSGVKYEELIEVRMQNGEIRRGQVLEVQEDKAMVQIFEGTSGINLKNSSVRFLGHPLQLG
VSEDMIGRVFDGLGRPKDNGPEILPEKYLDINGEVINPIARDYPDEFIQTGISAIDHLNTLVRGQKLPVFSGSGLPHKEL
AAQIARQATVLDSSDDFAVVFAAIGITFEEAEFFMEDFRQTGAIDRSVMFMNLANDPAIERIATPRMALTAAEYLAYEKG
MHVLVIMTDMTNYAEALREISAARREVPGRRGYPGYLYTNLATLFERAGRIRGLKGSVTQIPILTMPEDDKTHPIPDLTG
YITEGQIILTRELYKSGIQPPIDVLPSLSRLKDKGTGAGKTREDHAATMNQLFAAYAQGKQAKELAVVLGESALSDIDKI
YAKFAERFENEYVNQGFYTNRTITETLDLGWELLAMLPRTELKRIKDDLLDKYLPEGK
>P43456 ~~~ntpC~~~V-type sodium ATPase subunit C~~~COG1527
MEYHELNPLIRGRELELISKDTFEQMIQTDSIDSLGEILQSTIYQPYIYDGFDKDFEANLSQERSKLFQWLKESAPEPEI
VWIYTMRYTFHNLKVLTKAEITGQNLDHLYIHDGFYSLEVLKDAIHTQVSVELPDSLMDYIREVHEYCEESTILQGIDVI
YDRCFLTEQRRLGEQLGYPELLEEIIAFIDLTNITTTARGILQHRSAGFMTTVISSSGSIPKDTLLSFVRGEMASFTQFL
LTTDYSELLKQVIHEEQIDLVSLEQLKDDYLSSFYQVAQTQAFGPLPLLAFLNAKEVESKNLRLLIIGKRNHFSLEQLKE
RMRQVYDL
>P43435 ~~~ntpD~~~V-type sodium ATPase subunit D~~~COG1394
MRLNVNPTRMELTRLKKQLTTATRGHKLLKDKQDELMRQFILLIRKNNELRQAIEKETQTAMKDFVLAKSTVEEAFIDEL
LALPAENVSISVVEKNIMSVKVPLMNFQYDETLNETPLEYGYLHSNAELDRSIDGFTQLLPKLLKLAEVEKTCQLMAEEI
EKTRRRVNALEYMTIPQLEETIYYIKMKLEENERAEVTRLIKVKNMGTEE
>O34313 ~~~yfkN~~~Trifunctional nucleotide phosphoesterase protein YfkN~~~COG0737
MRIQKRRTHVENILRILLPPIMILSLILPTPPIHAEESAAPQVHLSILATTDIHANMMDYDYYSDKETADFGLARTAQLI
QKHREQNPNTLLVDNGDLIQGNPLGEYAVKYQKDDIISGTKTHPIISVMNALKYDAGTLGNHEFNYGLDFLDGTIKGADF
PIVNANVKTTSGENRYTPYVINEKTLIDENGNEQKVKVGYIGFVPPQIMTWDKKNLEGQVQVQDIVESANETIPKMKAEG
ADVIIALAHTGIEKQAQSSGAENAVFDLATKTKGIDAIISGHQHGLFPSAEYAGVAQFNVEKGTINGIPVVMPSSWGKYL
GVIDLKLEKADGSWKVADSKGSIESIAGNVTSRNETVTNTIQQTHQNTLEYVRKPVGKTEADINSFFAQVKDDPSIQIVT
DAQKWYAEKEMKDTEYKNLPILSAGAPFKAGGRNGANYYTNIPAGDLAIKNVGDLYLYDNTVQIVKLTGSEVKDWLEMSA
GQFNQIDPAKGGDQALLNENFRSYNFDVIDGVTYQVDVTKPAKYNENGKVINADSSRIINLSYEGKPISPSQEFLVVTNN
YRASGGGGFPHLTSDKIVHGSAVENRQVLMDYIIEQKTVNPKADNNWSIAPVSGTNLTFESSLLAKPFADKADDVAYVGK
SANEGYGVYKLQFDDDSNPDPPKDGLWDLTVMHTNDTHAHLDDAARRMTKINEVRSETNHNILLDAGDVFSGDLYFTKWN
GLADLKMMNMMGYDAMTFGNHEFDKGPTVLSDFLSGNSATVDPANRYHFEAPEFPIVSANVDVSNEPKLKSFVKKPQTFT
AGEKKEAGIHPYILLDVDGEKVAVFGLTTEDTATTSSPGKSIVFNDAFETAQNTVKAIQEEEKVNKIIALTHIGHNRDLE
LAKKVKGIDLIIGGHTHTLVDKMEVVNNEEPTIVAQAKEYGQFLGRVDVAFDEKGVVQTDKSNLSVLPIDEHTEENPEAK
QELDQFKNELEDVKNEKVGYTDVALDGQREHVRTKETNLGNFIADGMLAKAKEAAGARIAITNGGGIRAGIDKGDITLGE
VLNVMPFGNTLYVADLTGKQIKEALEQGLSNVENGGGAFPQVAGIEYTFTLNNKPGHRVLEVKIESPNGDKVAINTDDTY
RVATNNFVGAGGDGYSVFTEASHGEDLGYVDYEIFTEQLKKLGNKVSPKVEGRIKEVFLPTKQKDGSWTLDEDKFAIYAK
NANTPFVYYGIHEGSQEKPINLKVKKDQVKLLKERESDPSLTMFNYWYSMKMPMANLKTADTAIGIKSTGELDVSLSDVY
DFTVKQKGKEIKSFKEPVQLSLRMFDIEEAHNPAIYHVDRKKKAFTKTGHGSVDDDMVTGYTNHFSEYTILNSGSNNKPP
AFPSDQPTGGDDGNHGGGSDKPGGKQPTDGNGGNDTPPGTQPTNGSGGNGSGGSGTDGPAGGLLPDTATSMYSILLAGFL
ISALGTAMYLHQRRKQNRANQA
>P43455 ~~~ntpG~~~V-type sodium ATPase subunit G~~~COG1436
MTYKIGVVGDKDSVSPFRLFGFDVQHGTTKTEIRKTIDEMAKNEYGVIYITEQCANLVPETIERYKGQLTPAIILIPSHQ
GTLGIGLEEIQNSVEKAVGQNIL
>P43440 ~~~ntpJ~~~Potassium/sodium uptake protein NtpJ~~~COG0168
MTIMKKRVRKRLSPVQLIAAGFFILILFGGSLLTLPFFSRSGESTHFIDALFTATSAVCVTGLTTLNTAEHWNSAGQFLI
MTLIEIGGLGFMMIPILFFAIAKKKISFSMRIVLKEALNLEEMSGVIKLMIYILKFAVVIQVIGAVALSVVFIPEFGWAK
GIWFSIFHAVSSFCNAGFDLLGDSLLADQTNVYLIMVVSALIIAGGLGFIVWRDILSYHRVKKITLHSKVALSVTALLLI
GGFILFLITERNGLTLVKGTFTERLANTFFMSVTPRTAGYYSIDYLQMSHAGLILTMFLMYIGGTSGSTAGGLKTTTLGI
LLIQMHAMFKGKTRAEAFGRTIRQAAVLRALTLFFVTLSLCVVAIMVLSVTETIPKTSGIEYIAFEVFSAFGTVGLTMGL
TPDLTLIGKLVIISLMYIGRVGIMTVVFSLLVKANRAEANYKYPEESIMLG
>P43457 ~~~ntpK~~~V-type sodium ATPase subunit K~~~COG0636
MMDYLITQNGGMVFAVLAMATATIFSGIGSAKGVGMTGEAAAALTTSQPEKFGQALILQLLPGTQGLYGFVIAFLIFINL
GSDMSVVQGLNFLGASLPIAFTGLFSGIAQGKVAAAGIQILAKKPEHATKGIIFAAMVETYAILGFVISFLLVLNA
>Q02169 3.6.1.9~~~maf~~~dTTP/UTP pyrophosphatase~~~COG0424
MTKPLILASQSPRRKELLDLLQLPYSIIVSEVEEKLNRNFSPEENVQWLAKQKAKAVADLHPHAIVIGADTMVCLDGECL
GKPQDQEEAASMLRRLSGRSHSVITAVSIQAENHSETFYDKTEVAFWSLSEEEIWTYIETKEPMDKAGAYGIQGRGALFV
KKIDGDYYSVMGLPISKTMRALRHFDIRA
>P25536 3.6.1.9~~~yhdE~~~dTTP/UTP pyrophosphatase~~~COG0424
MTSLYLASGSPRRQELLAQLGVTFERIVTGIEEQRQPQESAQQYVVRLAREKARAGVAQTAKDLPVLGADTIVILNGEVL
EKPRDAEHAAQMLRKLSGQTHQVMTAVALADSQHILDCLVVTDVTFRTLTDEDIAGYVASDEPLDKAGAYGIQGLGGCFV
RKINGSYHAVVGLPLVETYELLSNFNALREKRDKHDG
>P0A729 3.6.1.-~~~yceF~~~7-methyl-GTP pyrophosphatase~~~COG0424
MPKLILASTSPWRRALLEKLQISFECAAPEVDETPRSDESPRQLVLRLAQEKAQSLASRYPDHLIIGSDQVCVLDGEITG
KPLTEENARLQLRKASGNIVTFYTGLALFNSANGHLQTEVEPFDVHFRHLSEAEIDNYVRKEHPLHCAGSFKSEGFGITL
FERLEGRDPNTLVGLPLIALCQMLRREGKNPLMG
>P58627 3.6.1.-~~~yceF~~~7-methyl-GTP pyrophosphatase~~~
MPRLILASTSPWRRALLEKLTIPFECAAPDVDETPMPGEAPRQLVLRLAQAKAQSLAARFPNHLIIGSDQICVLDGEITG
KPLTEEKARQQLAKASGNIVTFYTGLALYNSASGHLQTEVEPFDVHFRHLSEAEIDDYVRKEHPLHCAGSFKSEGLGIAL
FERLEGRDPNTLIGLPLIALCQMLRREGFNPLQQ
>P9WK27 3.6.1.9~~~~~~Nucleoside triphosphate pyrophosphatase~~~COG0424
MTRLVLGSASPGRLKVLRDAGIEPLVIASHVDEDVVIAALGPDAVPSDVVCVLAAAKAAQVATTLTGTQRIVAADCVVVA
CDSMLYIEGRLLGKPASIDEAREQWRSMAGRAGQLYTGHGVIRLQDNKTVYRAAETAITTVYFGTPSASDLEAYLASGES
LRVAGGFTLDGLGGWFIDGVQGNPSNVIGLSLPLLRSLVQRCGLSVAALWAGNAGGPAHKQQ
>O67322 3.6.1.15~~~~~~Nucleoside-triphosphatase THEP1~~~COG1618
MKIIITGEPGVGKTTLVKKIVERLGKRAIGFWTEEVRDPETKKRTGFRIITTEGKKKIFSSKFFTSKKLVGSYGVNVQYF
EELAIPILERAYREAKKDRRKVIIIDEIGKMELFSKKFRDLVRQIMHDPNVNVVATIPIRDVHPLVKEIRRLPGAVLIEL
TPENRDVILEDILSLLER
>P0AFB5 2.7.13.3~~~glnL~~~Sensory histidine kinase/phosphatase NtrB~~~COG3852
MATGTQPDAGQILNSLINSILLIDDNLAIHYANPAAQQLLAQSSRKLFGTPLPELLSYFSLNIELMQESLEAGQGFTDNE
VTLVIDGRSHILSVTAQRMPDGMILLEMAPMDNQRRLSQEQLQHAQQVAARDLVRGLAHEIKNPLGGLRGAAQLLSKALP
DPSLLEYTKVIIEQADRLRNLVDRLLGPQLPGTRVTESIHKVAERVVTLVSMELPDNVRLIRDYDPSLPELAHDPDQIEQ
VLLNIVRNALQALGPEGGEIILRTRTAFQLTLHGERYRLAARIDVEDNGPGIPPHLQDTLFYPMVSGREGGTGLGLSIAR
NLIDQHSGKIEFTSWPGHTEFSVYLPIRK
>P0AFB8 ~~~glnG~~~DNA-binding transcriptional regulator NtrC~~~COG2204
MQRGIVWVVDDDSSIRWVLERALAGAGLTCTTFENGAEVLEALASKTPDVLLSDIRMPGMDGLALLKQIKQRHPMLPVII
MTAHSDLDAAVSAYQQGAFDYLPKPFDIDEAVALVERAISHYQEQQQPRNVQLNGPTTDIIGEAPAMQDVFRIIGRLSRS
SISVLINGESGTGKELVAHALHRHSPRAKAPFIALNMAAIPKDLIESELFGHEKGAFTGANTIRQGRFEQADGGTLFLDE
IGDMPLDVQTRLLRVLADGQFYRVGGYAPVKVDVRIIAATHQNLEQRVQEGKFREDLFHRLNVIRVHLPPLRERREDIPR
LARHFLQVAARELGVEAKLLHPETEAALTRLAWPGNVRQLENTCRWLTVMAAGQEVLIQDLPGELFESTVAESTSQMQPD
SWATLLAQWADRALRSGHQNLLSEAQPELERTLLTTALRHTQGHKQEAARLLGWGRNTLTRKLKELGME
>P03029 ~~~ntrC~~~DNA-binding transcriptional regulator NtrC~~~
MQRGIAWIVDDDSSIRWVLERALTGAGLSCTTFESGNEVLDALTTKTPDVLLSDIRMPGMDGLALLKQIKQRHPMLPVII
MTAHSDLDAAVSAYQQGAFDYLPKPFDIDEAVALVDRAISHYQEQQQPRNAPINSPTADIIGEAPAMQDVFRIIGRLSRS
SISVLINGESGTGKELVAHALHRHSPRAKAPFIALNMAAIPKDLIESELFGHEKGAFTGANTVRQGRFEQADGGTLFLDE
IGDMPLDVQTRLLRVLADGQFYRVGGYAPVKVDVRIIAATHQNLELRVQEGKFREDLFHRLNVIRVHLPPLRERREDIPR
LARHFLQIAARELGVEAKQLHPETEMALTRLAWPGNVRQLENTCRWLTVMAAGQEVLTQDLPSELFETAIPDNPTQMLPD
SWATLLGQWADRALRSGHQNLLSEAQPEMERTLLTTALRHTQGHKQEAARLLGWGRNTLTRKLKELGME
>P10577 ~~~ntrC~~~DNA-binding transcriptional regulator NtrC~~~COG2204
MTGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFDLLPRIKKARPDLPVLV
MSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTD
LTLMITGESGTGKELVARALHDYGKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGTLFLDEI
GDMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFREDLYYRLNVVPLRLPPLRDRAEDIPDL
VRHFVQQAEKEGLDVKRFDQEALELMKAHPWPGNVRELENLVRRLTALYPQDVITREIIENELRSEIPDSPIEKAAARSG
SLSISQAVEENMRQYFASFGDALPPSGLYDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIRELGVSVYRS
SRSA
>P41789 ~~~glnG~~~DNA-binding transcriptional regulator NtrC~~~
MQRGIVWVVDDDSSIRWVLERALAGAGLTCTTFENGNEVLAALASKTPDVLLSDIRMPGMDGLALLKQIKQRHPMLPVII
MTAHSDLDAAVSAYQQGAFDYLPKPFDIDEAVALVERAISHYQEQQQPRNIEVNGPTTDMIGEAPAMQDVFRIIGRLSRS
SISVLINGESGTGKELVAHALHRHSPRAKAPFIALNMAAIPKDLIESELFGHEKGAFTGANTIRQGRFEQADGGTLFLDE
IGDMPLDVQTRLLRVLADGQFYRVGGYAPVKVDVRIIAATHQNLERRVQEGKFREDLFHRLNVIRIHLPPLRERREDIPR
LARHFLQVAARELGVEAKLLHPETETALTRLAWPGNVRQLENTCRWLTVMAAGQEVLIQDLPGELFEASTPDSPSHLPPD
SWATLLAQWADRALRSGHQNLLSEAQPELERTLLTTALRHTQGHKQEAARLLGWGRNTLTRKLKELGME
>Q8DL32 7.1.1.-~~~ndhA~~~NAD(P)H-quinone oxidoreductase subunit 1~~~COG1005
MESGIDLQGQFISALQSLGLSHDLAKLLWLPLPMLMMLIVATVGVLVAVWLERKISAAVQQRIGPEYIGPLGILAPLADG
LKLIFKEDVLPANSDRWLFTLGPAVVVIPVFLSYIIVPFGQNLLISNLAMGVFLWIALSSIAPIGLLMAGYASNNKYSLL
GGLRAAAQSISYEIPLALAVLAVAMMSNGLGTVEIVEQQSQYGILSWNVWRQPIGFLVFWIAALAECERLPFDLPEAEEE
LVAGYQTEYAGMKFALFYLGAYVNLVLSALLVSVLYFGGWSFPIPLETIANLLGVSETNPFLQIAFAVLGITMTLIKAYF
FVFLAILLRWTVPRVRIDQLLDLGWKFLLPVGLVNLLLTAGLKLAFPVAFGG
>Q8DMR6 7.1.1.-~~~ndhB~~~NAD(P)H-quinone oxidoreductase subunit 2~~~COG1007
MDLVTLAGQLNAGTILPETILIVTLLVVLLADLIQGRQADRWTPYFAIVGLGGAIATMIPLWTQPATISFFGSFISDHLS
LFFRGLIALSALGTILMSIRYVEQTGSSLGEFMTILLTATVGGMFIAGAQELVFIFVALETLSIASYLLTGYTKRDSRSN
EAALKYLLIGAASSAIFLYGSSLLYGLSGGHTQLPAIAQALSSESLGLVVALVFVIAGISFKISAVPFHQWTPDVYEGAP
TPVVAFLSVGSKAAGFALAIRFLTLAFPSVTDQWQLIFTVLAILSMILGNVVALAQTSMKRMLAYSSIGQAGFVMIGFVV
GTEAGYASMLFYLLVYLFMNLGAFTCVILFSLRTGTDQISEYAGLYQKDPLLTLGLSLCLLSLGGIPPLAGFFGKIYLFW
AGWQAGAYGLVLLGLLTSVISIYYYIRVVKMMVVKEPQEMSEAVRNYPEVSWSSFGLRPLQVGLVMTVIATSLAGILANP
LFNLVNTAVWDVPQLANQPTVMEVAYQALSPAGKS
>P19045 7.1.1.-~~~ndhC~~~NAD(P)H-quinone oxidoreductase subunit 3~~~COG0838
MFVLTGYEYFLGFLFICSLVPVLALTASKLLRPRDGGPERQTTYESGMEPIGGAWIQFNIRYYMFALVFVVFDVETVFLY
PWAVAFNQLGLLAFVEALIFIAILVVALVYAWRKGALEWS
>Q8DJ02 7.1.1.-~~~ndhC~~~NAD(P)H-quinone oxidoreductase subunit 3~~~COG0838
MVAIPRLRDTATVFVLSGYEYFLGFLIICSLVPVLALAASALLRPKSGRMIRLTTYESGMEPIGGAWIQFNVRYYMFALV
FVIFDVETVFLYPWAVAFHQLGLLAFIEALIFIAILVVALVYAWRKRALEWS
>Q8DKY0 7.1.1.-~~~ndhD1~~~NAD(P)H-quinone oxidoreductase chain 4 1~~~COG1008
MSTFPWLTTIILFPIVAALAIPFIPDPTGKGRPIRWYALAVGLIDFALIVYAFTNFYDLNTPGMQLWESYDWIPEIGLRW
SVGADGLSMPLILLTGFITTLAILAAWPVTLKPRLFYFLMLAMYGGQIAVFAVQDMLVFFLAWELELIPVYLLLAIWGGH
KRQYAATKFILYTAGSSLFILVAGLAMAFYGDTVSFDMQTLAAKDYALGFQLLVYAGFLVAYGVKLPIVPLHTWLPDAHG
EATAPVHMLLAGILLKMGGYALIRMNVDMLPAAHAKFAPVLVILGVVNIIYAALTSYAQRNLKRKIAYSSISHIGFVLIG
IASFTNLGMSGAVLQMVSHGLIGASLFFLVGATYDRTHTLILEEMGGVGQKMKKIFAMFTACSLASLALPGMSGFVAELM
VFIGFATSDAYSLPFRVIVVFLAAVGVILTPIYLLSMLREIFYGPENKELVEHEALVDAEPREVFIIACLLVPIIGIGLY
PKLLTQIYDATTGQVIARAREVLPTLAQQTEQPLGILPMVAPQLKANAQ
>P38446 3.1.30.-~~~nucA~~~Nuclease~~~
MGICGKLGVAALVALIVGCSPVQSQVPPLTELSPSISVHLLLGNPSGATPTKLTPDNYLMVKNQYALSYNNSKGTANWVA
WQLNSSWLGNAERQDNFRPDKTLPAGWVRVTPSMYSGSGYDRGHIAPSADRTKTTEDNAATFLMTNMMPQTPDNNRNTWG
NLEDYCRELVSQGKELYIVAGPNGSLGKPLKGKVTVPKSTWKIVVVLDSPGSGLEGITANTRVIAVNIPNDPELNNDWRA
YKVSVDELESLTGYDFLSNVSPNIQTSIESKVDN
>P13717 3.1.30.2~~~nucA~~~Nuclease~~~
MRFNNKMLALAALLFAAQASADTLESIDNCAVGCPTGGSSNVSIVRHAYTLNNNSTTKFANWVAYHITKDTPASGKTRNW
KTDPALNPADTLAPADYTGANAALKVDRGHQAPLASLAGVSDWESLNYLSNITPQKSDLNQGAWARLEDQERKLIDRADI
SSVYTVTGPLYERDMGKLPGTQKAHTIPSAYWKVIFINNSPAVNHYAAFLFDQNTPKGADFCQFRVTVDEIEKRTGLIIW
AGLPDDVQASLKSKPGVLPELMGCKN
>P42983 3.-.-.-~~~nucB~~~Sporulation-specific extracellular nuclease~~~COG3209
MKKWMAGLFLAAAVLLCLMVPQQIQGASSYDKVLYFPLSRYPETGSHIRDAIAEGHPDICTIDRDGADKRREESLKGIPT
KPGYDRDEWPMAVCEEGGAGADVRYVTPSDNRGAGSWVGNQMSSYPDGTRVLFIVQ
>D7Y2H5 3.1.-.-~~~nucC~~~Endodeoxyribonuclease NucC~~~
MSDWSLSQLFASLHEDIQLRLGTARKAFQHPGAKGDASEGVWIEMLDTYLPKRYQAANAFVVDSLGNFSDQIDVVVFDRQ
YSPFIFKFNEQIIVPAESVYAVFEAKQSASADLVAYAQRKVASVRRLHRTSLPIPHAGGTYPAKPLIPILGGLLTFESDW
SPALGMSFDKALNGDLSDGRLDMGCVASHGHFYFNNIDSKFNFEHGNKPATAFLFRLIAQLQFSGTVPMIDIDAYGKWLA
N
>P0DTF8 3.1.-.-~~~nucC~~~Endodeoxyribonuclease NucC~~~
MSQWSLSQLLSSLHEDIQQRLSVVRKTFGHPGTKGDASENVWIDMLDTYLPKRYQAAKAHVVDSLGNFSQQIDVVVFDRQ
YSPFIFTYENETIIPAESVYAVFEAKQTADAGLVAYAQEKVASVRRLHRTSLPIPHAGGTYPAKPLIPILGGLLTFESEW
SPALGPSMDKALNANLTEGRLDIGCVAAHGHFFYDQASGAYSYTNENKPATAFLFKLIAQLQFSGTVPMIDVEAYGQWLT
K
>P0DUD6 3.1.-.-~~~nucC~~~CRISPR-associated endodeoxyribonuclease NucC~~~
MAQDWQLSELLENLHADVQHKLTTVRKSFKHSVVKGDGAENVWVDLFNQYLPERYRASRAFVVDSENQFSEQIDVVIYDR
QYSPFIFHYAEQLIIPAESVYAVFEVKQTLNKQHIDAARKKVASVRALHRTSLPIPHAGGVHSPRELIGIIGGLLTLENE
LKIPDTLMGHLDHDKADKGMLNIGCAADDCFFYYDNDHQRMQVMQHKKATTAFLFELLSQLQKCGTVPMIDIHAYGKWLT
PRISE
>O67335 7.1.1.-~~~nuoC2~~~NADH-quinone oxidoreductase subunit C/D 2~~~COG0649
MKWVNKGTVERVKQEFKDEVKYYETKHTKGFEVSHDFLKPLLKFLKERERFLHFVDMTCIDFPEHPNRFQGVYILYNPEE
NERVIVKSWAKDGKLPTVEDLWPGAKWAEREAYDMFGVVFEGHENLRRMFMWEGYEHYPLRKDFPLQGIPEVELPSLTEV
LHGRTDPPSHDFELVHTKLPTLEDLERTEKARLKKKAELVLNWGPLHPGTHGTIWFLFDLEGEKVVQSDVILGQLHRGME
KLAENLHYFQFIPYTDRMDYISAICNELAYVETVERLLGVEVPEKARYIRTMFAELQRINSHLLWLGTGALDLGALTVFL
YAFREREKIMDIIEGNAGYRLTSCFLRIGGVHYDLAEGTLDVVKHFIKDFPNRLKEYHTLLTRNRIWLRRTKDVGVITRE
DVHNYGLSGPVARGSGVPYDLRKLQPYAAYDEVEFDIPVGEVGDVYDRYLVRMEEMAQSVRIIEQCVQKLEKLPKDAPYL
NKEHPAVIPPKEDVFHDLESMVKSFRVVVHGEDAPPGEVYFAGENPRGELGFFIYSKGGGKPYRTRIRSGALYNLSIFPK
LIQGRTIADAIALLGSLDPVVGETDR
>P0A3S3 3.1.30.-~~~endA~~~DNA-entry nuclease~~~
MNKKTRQTLIGLLVLLLLSTGSYYIKQMPSAPNSPKTNLSQKKQASEAPSQALAESVLTDAVKSQIKGSLEWNGSGAFIV
NGNKTNLDAKVSSKPYADNKTKTVGKETVPTVANALLSKATRQYKNRKETGNGSTSWTPPGWHQVKNLKGSYTHAVDRGH
LLGYALIGGLDGFDASTSNPKNIAVQTAWANQAQAEYSTGQNYYESKVRKALDQNKRVRYRVTLYYASNEDLVPSASQIE
AKSSDGELEFNVLVPNVQKGLQLDYRTGEVTVTQ
>A0A2A5JY22 ~~~~~~Nucleobase transporter PlUacP~~~
MRKSKVLTLGFQHVLAMYAGAVIVPLIVGSSLKLNAEQLAYLVSIDLLTCGIATLLQVWRNKFFGIGLPVMLGCTFTAVG
PMIAIGSEYGMPAIYGSVLASGLFLALFAGFFGKLARFFPPIVTGSVVTIIGITLIPVAVQDMGGGQGSADFGSLSNLAL
SFGVLLFIILANRFFTGFIRAISILLGLIFGTIAGAFMGKVDIGPLLDASWFHGIHPFYFGFPTFHLPSILTMTLVAIVS
VMESTGVFVALGKITEKELTADDLKRGYRSEGLASILGSIMNSFPYTTYSQNVGLIQISKVKSRDVVITAGFILVILGFM
PKIAALTLLIPTAVLGGAMIAMFGMVVSSGIKMLGAIDLNNHENLLIIACSVSVGLGVTVAPNLFDHLPDSIKILTSNGI
VAGSLTAILMNFLFTVGRKKQDESHASAENVHAA
>P37994 3.1.21.-~~~nucM~~~Nuclease NucM~~~COG2356
MLRNLVIFAVLGAGLTTLAAAGQDINNFTQAKAAAAKIHQDAPGTFYCGCKINWQGKKGTPDLASCGYQVRKDANRASRI
EWEHVVPAWQFGHQRQCWQDGGRKNCTKDDVYRQIETDLHNLQPAIGEVNGDRGNFMYSQWNGGERQYGQCEMKIDFKSQ
LAEPPERARGAIARTYFYMRDRYNLNLSRQQTQLFDAWNKQYPATTWECTREKRIAAVQGNHNPYVQQACQP
>P9WIY5 3.1.-.-~~~nucS~~~Endonuclease NucS~~~COG1637
MSRVRLVIAQCTVDYIGRLTAHLPSARRLLLFKADGSVSVHADDRAYKPLNWMSPPCWLTEESGGQAPVWVVENKAGEQL
RITIEGIEHDSSHELGVDPGLVKDGVEAHLQALLAEHIQLLGEGYTLVRREYMTAIGPVDLLCSDERGGSVAVEIKRRGE
IDGVEQLTRYLELLNRDSVLAPVKGVFAAQQIKPQARILATDRGIRCLTLDYDTMRGMDSGEYRLF
>Q5HHM4 3.1.31.1~~~nuc~~~Thermonuclease~~~
MTEYLLSAGICMAIVSILLIGMAISNVSKGQYAKRFFYFATSCLVLTLVVVSSLSSSANASQTDNGVNRSGSEDPTVYSA
TSTKKLHKEPATLIKAIDGDTVKLMYKGQPMTFRLLLVDTPETKHPKKGVEKYGPEASAFTKKMVENAKKIEVEFDKGQR
TDKYGRGLAYIYADGKMVNEALVRQGLAKVAYVYKPNNTHEQLLRKSEAQAKKEKLNIWSEDNADSGQ
>P00644 3.1.31.1~~~nuc~~~Thermonuclease~~~
MLVMTEYLLSAGICMAIVSILLIGMAISNVSKGQYAKRFFFFATSCLVLTLVVVSSLSSSANASQTDNGVNRSGSEDPTV
YSATSTKKLHKEPATLIKAIDGDTVKLMYKGQPMTFRLLLVDTPETKHPKKGVEKYGPEASAFTKKMVENAKKIEVEFDK
GQRTDKYGRGLAYIYADGKMVNEALVRQGLAKVAYVYKPNNTHEQHLRKSEAQAKKEKLNIWSEDNADSGQ
>Q8NXI6 3.1.31.1~~~nuc~~~Thermonuclease~~~
MTEYLLSAGICMAIVSILLIGMAISNVSKGQYAKRFFFFATSCLVLTLVVVSSLSSSANASQTDNGVNRSGSEHPTVYSA
TSTKKLHKEPATLIKAIDGDTVKLMYKGQPMTFRLLLVDTPETKHPKKGVEKYGPEASAFTKKMVENAKKIEVEFDKGQR
TDKYGRGLAYIYADGKMVNEALVRQGLAKVAYVYKPNNTHEQLLRKSEAQAKKEKLNIWSEDNADSGQ
>P43270 3.1.31.1~~~nucH~~~Thermonuclease~~~
MKKITTGLIIVVAAIIVLSIQFMTESGPFKSAGLSNANEQTYKVIRVIDGDTIIVDKDGKQQNLRMIGVDTPETVKPNTP
VQPYGKEASDFTKRHLTNQKVRLEYDKQEKDRYGRTLAYVWLGKEMFNEKLAKEGLARAKFYRPNYKYQERIEQAQKQAQ
KLKKNIWSN
>P0AFC0 3.6.1.67~~~nudB~~~Dihydroneopterin triphosphate diphosphatase~~~COG0494
MKDKVYKRPVSILVVIYAQDTKRVLMLQRRDDPDFWQSVTGSVEEGETAPQAAMREVKEEVTIDVVAEQLTLIDCQRTVE
FEIFSHLRHRYAPGVTRNTESWFCLALPHERQIVFTEHLAYKWLDAPAAAALTKSWSNRQAIEQFVINAA
>P32664 3.6.1.-~~~nudC~~~NAD-capped RNA hydrolase NudC~~~COG2816
MDRIIEKLDHGWWVVSHEQKLWLPKGELPYGEAANFDLVGQRALQIGEWQGEPVWLVQQQRRHDMGSVRQVIDLDVGLFQ
LAGRGVQLAEFYRSHKYCGYCGHEMYPSKTEWAMLCSHCRERYYPQIAPCIIVAIRRDDSILLAQHTRHRNGVHTVLAGF
VEVGETLEQAVAREVMEESGIKVKNLRYVTSQPWPFPQSLMTAFMAEYDSGDIVIDPKELLEANWYRYDDLPLLPPPGTV
ARRLIEDTVAMCRAEYE
>P45799 3.6.1.-~~~nudE~~~ADP compounds hydrolase NudE~~~COG0494
MSKSLQKPTILNVETVARSRLFTVESVDLEFSNGVRRVYERMRPTNREAVMIVPIVDDHLILIREYAVGTESYELGFSKG
LIDPGESVYEAANRELKEEVGFGANDLTFLKKLSMAPSYFSSKMNIVVAQDLYPESLEGDEPEPLPQVRWPLAHMMDLLE
DPDFNEARNVSALFLVREWLKGQGRV
>P77788 3.6.1.65~~~nudG~~~CTP pyrophosphohydrolase~~~COG0494
MKMIEVVAAIIERDGKILLAQRPAQSDQAGLWEFAGGKVEPDESQRQALVRELREELGIEATVGEYVASHQREVSGRIIH
LHAWHVPDFHGTLQAHEHQALVWCSPEEALQYPLAPADIPLLEAFMALRAARPAD
>P52006 3.6.1.9~~~nudI~~~Nucleoside triphosphatase NudI~~~COG0494
MRQRTIVCPLIQNDGAYLLCKMADDRGVFPGQWAISGGGVEPGERIEEALRREIREELGEQLLLTEITPWTFSDDIRTKT
YADGRKEEIYMIYLIFDCVSANREVKINEEFQDYAWVKPEDLVHYDLNVATRKTLRLKGLL
>Q8ZNF5 3.6.1.9~~~nudI~~~Nucleoside triphosphatase NudI~~~
MRQRTIVCPLIQNDGCYLLCKMADNRGVFPGQWALSGGGVEPGERIEEALRREIREELGEQLILSDITPWTFRDDIRIKT
YADGRQEEIYMIYLIFDCVSANRDICINDEFQDYAWVKPEELALYDLNVATRHTLALKGLL
>A1AA28 3.6.1.-~~~nudJ~~~Phosphatase NudJ~~~
MFKPHVTVACVVHAEGKFLVVEETINGKALWNQPAGHLEADETLVEAAARELWEETGISAQPQHFIRMHQWIAPDKTPFL
RFLFAIELEQICPTQPHDSDIDCCRWVSAEEILQASNLRSPLVAESIRCYQSGQRYPLEMIGDFNWPFTKGVI
>P0AEI6 3.6.1.-~~~nudJ~~~Phosphatase NudJ~~~COG1051
MFKPHVTVACVVHAEGKFLVVEETINGKALWNQPAGHLEADETLVEAAARELWEETGISAQPQHFIRMHQWIAPDKTPFL
RFLFAIELEQICPTQPHDSDIDCCRWVSAEEILQASNLRSPLVAESIRCYQSGQRYPLEMIGDFNWPFTKGVI
>P37128 3.6.1.-~~~nudK~~~GDP-mannose pyrophosphatase~~~COG0494
MTQQITLIKDKILSDNYFTLHNITYDLTRKDGEVIRHKREVYDRGNGATILLYNTKKKTVVLIRQFRVATWVNGNESGQL
IESCAGLLDNDEPEVCIRKEAIEETGYEVGEVRKLFELYMSPGGVTELIHFFIAEYSDNQRANAGGGVEDEDIEVLELPF
SQALEMIKTGEIRDGKTVLLLNYLQTSHLMD
>P0AFC3 7.1.1.-~~~nuoA~~~NADH-quinone oxidoreductase subunit A~~~COG0838
MSMSTSTEVIAHHWAFAIFLIVAIGLCCLMLVGGWFLGGRARARSKNVPFESGIDSVGSARLRLSAKFYLVAMFFVIFDV
EALYLFAWSTSIRESGWVGFVEAAIFIFVLLAGLVYLVRIGALDWTPARSRRERMNPETNSIANRQR
>P9WIW7 7.1.1.-~~~nuoA~~~NADH-quinone oxidoreductase subunit A~~~COG0838
MNVYIPILVLAALAAAFAVVSVVIASLVGPSRFNRSKQAAYECGIEPASTGARTSIGPGAASGQRFPIKYYLTAMLFIVF
DIEIVFLYPWAVSYDSLGTFALVEMAIFMLTVFVAYAYVWRRGGLTWD
>O67334 7.1.1.-~~~nuoB~~~NADH-quinone oxidoreductase subunit B~~~COG0377
MVAINSNGFVTTTVEELLRWGRRNSLWPVTIGLACCAIEMMHTAASRFDLDRLGVIFRASPRQADVLIVAGTVVNKVAPM
LKLIWDQMPDPKWCISMGGCASAGGPFPTYSTLQGVDRIIPVDVYIPGCPPTPQGLIYGILQLQRKIKEQGITKYDKLFA
DFNREIEKEGIFVPRELKV
>P0AFC7 7.1.1.-~~~nuoB~~~NADH-quinone oxidoreductase subunit B~~~COG0377
MDYTLTRIDPNGENDRYPLQKQEIVTDPLEQEVNKNVFMGKLNDMVNWGRKNSIWPYNFGLSCCYVEMVTSFTAVHDVAR
FGAEVLRASPRQADLMVVAGTCFTKMAPVIQRLYDQMLEPKWVISMGACANSGGMYDIYSVVQGVDKFIPVDVYIPGCPP
RPEAYMQALMLLQESIGKERRPLSWVVGDQGVYRANMQSERERKRGERIAVTNLRTPDEI
>A0QU35 7.1.1.-~~~nuoB~~~NADH-quinone oxidoreductase subunit B~~~COG0377
MGLEERLPGGILLSTVETVAGYVRKGSLWPATFGLACCAIEMMSTAGPRFDIARFGMERFSATPRQADLMIVAGRVSQKM
APVLRQIYDQMVEPKWVLAMGVCASSGGMFNNYAVVQGVDHVVPVDIYLPGCPPRPEMLLHAILKLHDKIQQMPLGVNRE
EAIREAEQAALAVPPTIELKGLLR
>P9WJH1 7.1.1.-~~~nuoB~~~NADH-quinone oxidoreductase subunit B~~~COG0377
MGLEEQLPGGILLSTVEKVAGYVRKNSLWPATFGLACCAIEMMATAGPRFDIARFGMERFSATPRQADLMIVAGRVSQKM
APVLRQIYDQMAEPKWVLAMGVCASSGGMFNNYAIVQGVDHVVPVDIYLPGCPPRPEMLLHAILKLHEKIQQMPLGINRE
RAIAEAEEAALLARPTIEMRGLLR
>A1B497 7.1.1.-~~~nuoB~~~NADH-quinone oxidoreductase subunit B~~~COG0377
MMTGLNTAGADRDLATAELNRELQDKGFLLTTTEDIINWARNGSLHWMTFGLACCAVEMMQTSMPRYDLERFGTAPRASP
RQSDLMIVAGTLTNKMAPALRKVYDQMPEPRYVISMGSCANGGGYYHYSYSVVRGCDRIVPVDIYVPGCPPTAEALLYGI
LQLQRRIRRTGTLVR
>P33599 7.1.1.-~~~nuoC~~~NADH-quinone oxidoreductase subunit C/D~~~COG0649
MTDLTAQEPAWQTRDHLDDPVIGELRNRFGPDAFTVQATRTGVPVVWIKREQLLEVGDFLKKLPKPYVMLFDLHGMDERL
RTHREGLPAADFSVFYHLISIDRNRDIMLKVALAENDLHVPTFTKLFPNANWYERETWDLFGITFDGHPNLRRIMMPQTW
KGHPLRKDYPARATEFSPFELTKAKQDLEMEALTFKPEEWGMKRGTENEDFMFLNLGPNHPSAHGAFRIVLQLDGEEIVD
CVPDIGYHHRGAEKMGERQSWHSYIPYTDRIEYLGGCVNEMPYVLAVEKLAGITVPDRVNVIRVMLSELFRINSHLLYIS
TFIQDVGAMTPVFFAFTDRQKIYDLVEAITGFRMHPAWFRIGGVAHDLPRGWDRLLREFLDWMPKRLASYEKAALQNTIL
KGRSQGVAAYGAKEALEWGTTGAGLRATGIDFDVRKARPYSGYENFDFEIPVGGGVSDCYTRVMLKVEELRQSLRILEQC
LNNMPEGPFKADHPLTTPPPKERTLQHIETLITHFLQVSWGPVMPANESFQMIEATKGINSYYLTSDGSTMSYRTRVRTP
SFAHLQQIPAAIRGSLVSDLIVYLGSIDFVMSDVDR
>A0QU34 7.1.1.-~~~nuoC~~~NADH-quinone oxidoreductase subunit C~~~COG0852
MSTSNGSANGTNGVGLPRGDEPEIIAVRRGMFGNRDTGDTSGYGRLVRPVALPGSTPRPYGGYFDAVMDRLAEVLGEERY
AMSIERVVVYRDQLTIEVSRVQLPAVASVLRDDPDLRFELCLGVSGVHYPEDTGRELHAVYPLMSITHNRRIQLEVAAPD
ADPHIPSLYAVYPTTDWHERETYDFFGIIFDGHPSLTRIEMPDDWEGHPQRKDYPLGGIPVEYHGAQIPPPDQRRSYS
>P9WJH3 7.1.1.-~~~nuoC~~~NADH-quinone oxidoreductase subunit C~~~COG0852
MSPPNQDAQEGRPDSPTAEVVDVRRGMFGVSGTGDTSGYGRLVRQVVLPGSSPRPYGGYFDDIVDRLAEALRHERVEFED
AVEKVVVYRDELTLHVRRDLLPRVAQRLRDEPELRFELCLGVSGVHYPHETGRELHAVYPLQSITHNRRLRLEVSAPDSD
PHIPSLFAIYPTNDWHERETYDFFGIIFDGHPALTRIEMPDDWQGHPQRKDYPLGGIPVEYKGAQIPPPDERRGYN
>A0QU33 7.1.1.-~~~nuoD~~~NADH-quinone oxidoreductase subunit D~~~COG0649
MSTSTVPPDGGEKVVVVGGNDWHEVVAAARAGAAAQAGERIVVNMGPQHPSTHGVLRLILEIEGEIITEARCGIGYLHTG
IEKNLEYRNWTQGVTFVTRMDYLSPFFNETAYCLGVEKLLGITDDIPERASVIRVMLMELNRISSHLVALATGGMELGAM
SAMFYGFREREEILRVFESITGLRMNHAYIRPGGLAADLPDDAITQVRRLVEILPKRLKDLEDLLNENYIWKARTVGVGY
LDLTGCMALGITGPILRSTGLPHDLRKAQPYCGYENYEFDVITDDRCDSYGRYIIRVKEMHESVKIVEQCLARLKPGPVM
ISDKKLAWPADLKLGPDGLGNSPEHIAKIMGRSMEGLIHHFKLVTEGIRVPPGQVYVAVESPRGELGVHMVSDGGTRPYR
VHYRDPSFTNLQAVAATCEGGMVADAIAAVASIDPVMGGVDR
>P9WJH5 7.1.1.-~~~nuoD~~~NADH-quinone oxidoreductase subunit D~~~COG0649
MTAIADSAGGAGETVLVAGGQDWQQVVDAARSADPGERIVVNMGPQHPSTHGVLRLILEIEGETVVEARCGIGYLHTGIE
KNLEYRYWTQGVTFVTRMDYLSPFFNETAYCLGVEKLLGITDEIPERVNVIRVLMMELNRISSHLVALATGGMELGAMTP
MFVGFRAREIVLTLFEKITGLRMNSAYIRPGGVAQDLPPNAATEIAEALKQLRQPLREMGELLNENAIWKARTQGVGYLD
LTGCMALGITGPILRSTGLPHDLRKSEPYCGYQHYEFDVITDDSCDAYGRYMIRVKEMWESMKIVEQCLDKLRPGPTMIS
DRKLAWPADLQVGPDGLGNSPKHIAKIMGSSMEALIHHFKLVTEGIRVPAGQVYVAVESPRGELGVHMVSDGGTRPYRVH
YRDPSFTNLQSVAAMCEGGMVADLIAAVASIDPVMGGVDR
>A1B495 7.1.1.-~~~nuoD~~~NADH-quinone oxidoreductase subunit D~~~COG0649
MDGDIRKNSYDDGSMDALTGEQSIRNFNINFGPQHPAAHGVLRMVLELDGEIVERADPHIGLLHRGTEKLMESRTYLQNL
PYLDRLDYVAPMNQEHAWCLAIERLTGTVIPRRASLIRVLYSEIGRILNHLMGVTTGAMDVGALTPPLWGFEAREELMIF
YERACGARLHAAYFRPGGVHQDLPPDLLDDIEEWCERFPKLVDDLDTLLTENRIFKQRLVDIGIVTEADALDWGYTGVMV
RGSGLAWDLRRSQPYECYDEFDFQIPVGRNGDCYDRYLCRMAEMRESCKIMQQAVQKLRAEPAGDVLARGKLTPPRRAEM
KRDMESLIHHFKLYTEGFKVPAGEVYAAVEAPKGEFGVYLVADGTNKPWRAKLRAPGFAHLQSIDWMSRGHMLADVPAII
ATLDIVFGEVDR
>O66842 7.1.1.-~~~nuoE~~~NADH-quinone oxidoreductase subunit E~~~COG1905
MFKTEFEFPEELKTKLQEHINYFPKKRQAILLCLHEIQNYYGYIPPESLKPLADMLELPLNHVEGVVAFYDMFDREDKAK
YRIRVCVSIVCHLMGTNKLLKALENILGIKPGEVTPDGKFKIVPVQCLGACSEAPVFMVNDDEYKFESEVQLNEILSRYT
>P0AFD1 7.1.1.-~~~nuoE~~~NADH-quinone oxidoreductase subunit E~~~COG1905
MHENQQPQTEAFELSAAEREAIEHEMHHYEDPRAASIEALKIVQKQRGWVPDGAIHAIADVLGIPASDVEGVATFYSQIF
RQPVGRHVIRYCDSVVCHINGYQGIQAALEKKLNIKPGQTTFDGRFTLLPTCCLGNCDKGPNMMIDEDTHAHLTPEAIPE
LLERYK
>P9WIV5 7.1.1.-~~~nuoE~~~NADH-quinone oxidoreductase subunit E~~~COG1905
MTQPPGQPVFIRLGPPPDEPNQFVVEGAPRSYPPDVLARLEVDAKEIIGRYPDRRSALLPLLHLVQGEDSYLTPAGLRFC
ADQLGLTGAEVSAVASFYTMYRRRPTGEYLVGVCTNTLCAVMGGDAIFDRLKEHLGVGHDETTSDGVVTLQHIECNAACD
YAPVVMVNWEFFDNQTPESARELVDSLRSDTPKAPTRGAPLCGFRQTSRILAGLPDQRPDEGQGGPGAPTLAGLQVARKN
DMQAPPTPGADE
>O66841 7.1.1.-~~~nuoF~~~NADH-quinone oxidoreductase subunit F~~~COG1894
MRSYPAIPRIYAETTLNMLLKRAKKPRVHSIDEYLKDGGYQALEKALNMSPEEIIDWVDKSTLRGRGGAGFPTGKKWKFA
VQNPGPRYFICNADESEPGTFKDRIIIERDPHLLIEGIIISSYAIGANEAYIYIRGEYPAGYYILRDAIEEAKKKGFLGK
NILGSGFDLEIYVARGAGAYICGEETALIESLEGKRGHPRLKPPYPVQKGLWGKPTVVNNVETIANVPFIISMGWEEYRY
IGPSDYAGPKLFPVSGKVKKPGVYELPMNTTLREVIFKYAGGTLGNKKVKAVFSGALDCFSSEELDIPMDYSPLGFGGTG
TVIVLTEEDDIVEAALKIAEFYEHETCGQCTPCRVGCYEQANLLEKIYKGEATEQDWEGFDFVNRNIQPTSICGLGAVAG
RLIRQTLEKFPEEWEKYRKKSASLPL
>P31979 7.1.1.-~~~nuoF~~~NADH-quinone oxidoreductase subunit F~~~COG1894
MKNIIRTPETHPLTWRLRDDKQPVWLDEYRSKNGYEGARKALTGLSPDEIVNQVKDAGLKGRGGAGFSTGLKWSLMPKDE
SMNIRYLLCNADEMEPGTYKDRLLMEQLPHLLVEGMLISAFALKAYRGYIFLRGEYIEAAVNLRRAIAEATEAGLLGKNI
MGTGFDFELFVHTGAGRYICGEETALINSLEGRRANPRSKPPFPATSGAWGKPTCVNNVETLCNVPAILANGVEWYQNIS
KSKDAGTKLMGFSGRVKNPGLWELPFGTTAREILEDYAGGMRDGLKFKAWQPGGAGTDFLTEAHLDLPMEFESIGKAGSR
LGTALAMAVDHEINMVSLVRNLEEFFARESCGWCTPCRDGLPWSVKILRALERGEGQPGDIETLEQLCRFLGPGKTFCAH
APGAVEPLQSAIKYFREEFEAGIKQPFSNTHLINGIQPNLLKERW
>P9WIV7 7.1.1.-~~~nuoF~~~NADH-quinone oxidoreductase subunit F~~~COG1894
MTTQATPLTPVISRHWDDPESWTLATYQRHDRYRGYQALQKALTMPPDDVISIVKDSGLRGRGGAGFATGTKWSFIPQGD
TGAAAKPHYLVVNADESEPGTCKDIPLMLATPHVLIEGVIIAAYAIRAHHAFVYVRGEVVPVLRRLHNAVAEAYAAGFLG
RNIGGSGFDLELVVHAGAGAYICGEETALLDSLEGRRGQPRLRPPFPAVAGLYGCPTVINNVETIASVPSIILGGIDWFR
SMGSEKSPGFTLYSLSGHVTRPGQYEAPLGITLRELLDYAGGVRAGHRLKFWTPGGSSTPLLTDEHLDVPLDYEGVGAAG
SMLGTKALEIFDETTCVVRAVRRWTEFYKHESCGKCTPCREGTFWLDKIYERLETGRGSHEDIDKLLDISDSILGKSFCA
LGDGAASPVMSSIKHFRDEYLAHVEGGGCPFDPRDSMLVANGVDA
>A8GQT6 7.1.1.-~~~nuoF~~~NADH-quinone oxidoreductase subunit F~~~
MLKEEDKIFTNLHGQQSHDLKSSKKRGDWENTKALLDKGRDFIVEEVKKSGLRGRGGAGFSTGMKWSFMPKNLEKSCYLV
VNADESEPGTCKDRDILRFEPHKLIEGCLLASFAIGANNCYIYIRGEFYNEASNIQRALDEAYKEGLIGKNSCGSGFDCN
IYLHRGAGAYICGEETALLESLEGKKGMPRLKPPFPAGFGLYGCPTTINNVESIAVVPTILRRGASWFAGIGKPNNTGTK
IFCISGHVNKPCNVEEAMGISLKELIEKYAGGVRGGWDNLKAIIPGGSSVPLLPKSLCEVDMDFDSLRTAGSGLGTGGII
VMDQSTDIIYAIARLSKFYMHESCGQCTPCREGTGWMWRVMMRLVKGNVTKSEIDELLNVTKAIEGHTICALGDAAAWPI
QGLIRHFRSEIEARIKSYSVV
>P33602 7.1.1.-~~~nuoG~~~NADH-quinone oxidoreductase subunit G~~~COG1034
MATIHVDGKEYEVNGADNLLEACLSLGLDIPYFCWHPALGSVGACRQCAVKQYQNAEDTRGRLVMSCMTPASDGTFISID
DEEAKQFRESVVEWLMTNHPHDCPVCEEGGNCHLQDMTVMTGHSFRRYRFTKRTHRNQDLGPFISHEMNRCIACYRCVRY
YKDYADGTDLGVYGAHDNVYFGRPEDGTLESEFSGNLVEICPTGVFTDKTHSERYNRKWDMQFAPSICQQCSIGCNISPG
ERYGELRRIENRYNGTVNHYFLCDRGRFGYGYVNLKDRPRQPVQRRGDDFITLNAEQAMQGAADILRQSKKVIGIGSPRA
SVESNFALRELVGEENFYTGIAHGEQERLQLALKVLREGGIYTPALREIESYDAVLVLGEDVTQTGARVALAVRQAVKGK
AREMAAAQKVADWQIAAILNIGQRAKHPLFVTNVDDTRLDDIAAWTYRAPVEDQARLGFAIAHALDNSAPAVDGIEPELQ
SKIDVIVQALAGAKKPLIISGTNAGSLEVIQAAANVAKALKGRGADVGITMIARSVNSMGLGIMGGGSLEEALTELETGR
ADAVVVLENDLHRHASAIRVNAALAKAPLVMVVDHQRTAIMENAHLVLSAASFAESDGTVINNEGRAQRFFQVYDPAYYD
SKTVMLESWRWLHSLHSTLLSREVDWTQLDHVIDAVVAKIPELAGIKDAAPDATFRIRGQKLAREPHRYSGRTAMRANIS
VHEPRQPQDIDTMFTFSMEGNNQPTAHRSQVPFAWAPGWNSPQAWNKFQDEVGGKLRFGDPGVRLFETSENGLDYFTSVP
ARFQPQDGKWRIAPYYHLFGSDELSQRAPVFQSRMPQPYIKLNPADAAKLGVNAGTRVSFSYDGNTVTLPVEIAEGLTAG
QVGLPMGMSGIAPVLAGAHLEDLKEAQQ
>P9WIV9 7.1.1.-~~~nuoG~~~NADH-quinone oxidoreductase subunit G~~~COG1034
MTQAADTDIRVGQPEMVTLTIDGVEISVPKGTLVIRAAELMGIQIPRFCDHPLLEPVGACRQCLVEVEGQRKPLASCTTV
ATDDMVVRTQLTSEIADKAQHGVMELLLINHPLDCPMCDKGGECPLQNQAMSNGRTDSRFTEAKRTFAKPINISAQVLLD
RERCILCARCTRFSDQIAGDPFIDMQERGALQQVGIYADEPFESYFSGNTVQICPVGALTGTAYRFRARPFDLVSSPSVC
EHCASGCAQRTDHRRGKVLRRLAGDDPEVNEEWNCDKGRWAFTYATQPDVITTPLIRDGGDPKGALVPTSWSHAMAVAAQ
GLAAARGRTGVLVGGRVTWEDAYAYAKFARITLGTNDIDFRARPHSAEEADFLAARIAGRHMAVSYADLESAPVVLLVGF
EPEDESPIVFLRLRKAARRHRVPVYTIAPFATGGLHKMSGRLIKTVPGGEPAALDDLATGAVGDLLATPGAVIIVGERLA
TVPGGLSAAARLADTTGARLAWVPRRAGERGALEAGALPTLLPGGRPLADEVARAQVCAAWHIAELPAAAGRDADGILAA
AADETLAALLVGGIEPADFADPDAVLAALDATGFVVSLELRHSTVTERADVVFPVAPTTQKAGAFVNWEGRYRTFEPALR
GSTLQAGQSDHRVLDALADDMGVHLGVPTVEAAREELAALGIWDGKHAAGPHIAATGPTQPEAGEAILTGWRMLLDEGRL
QDGEPYLAGTARTPVVRLSPDTAAEIGAADGEAVTVSTSRGSITLPCSVTDMPDRVVWLPLNSAGSTVHRQLRVTIGSIV
KIGAGS
>Q9KGW3 7.1.1.-~~~nuoG~~~NADH-quinone oxidoreductase subunit G~~~COG1034
MATIHVDGKELEVDGADNLLQACLSLGLDIPYFCWHPALGSVGACRQCAVKQYTDENDKRGRIVMSCMTPATDGSWISID
DEEAKVFRASVVEWLMTNHPHDCPVCEEGGHCHLQDMTVMTGHNERRYRFTKRTHQNQDLGPFISHEMNRCIACYRCVRF
YKDYAGGTDLGVFGAHDNVYFGRVEDGTLESEFSGNLTEVCPTGVFTDKTHSERYNRKWDMQFSPSICHGCSSGCNISPG
ERYGELRRIENRFNGSVNQYFLCDRGRFGYGYVNRKDRPRQPLLANGAKLSLDQALDKAAELLRGRNIVGIGSPRASLES
NYALRELVGAEHFYSGIEAGELERIRLVLQVLKDSPLPVPNMRDIEDHDAVFVLGEDLTQTAARMALALRQSVKGKAEDM
ADAMRVQPWLDAAVKNIGQHALNPLFIASLAETKLDDVAEECVHAAPDDLARIGFAVAHALDASAPAVDGLDSEAAALAQ
RIADALLAAKRPLIIAGTSLGSKALIEAAANIAKALKLREKNGSISLIVPEANSLGLAMLGGESVDAALQAVIDGSADAI
VVLENDLYTRTDKAKVDAALNAAKVLIVADHQKTATTDRAHLVLPAASFAEGDGTLVSQEGRAQRFFQVFDPQYLDASIL
VHEGWRWLHALRATLLDQPIDWTQLDHVTAAVASSSPQLAAIVDAAPSASFRIKGLKLAREPLRYSGRTAMRADISVHEP
RTSQDNDTAFSFSMEGYSGSTEPRSQVPFAWSPGWNSPQAWNKFQDEVGGHLRAGDPGTRLIESQGDHLSWFASVPRAFN
PAPGTWQVVPFHHLFGSEENSSKAAPVQERIPAAYVSLAKSEADRLGVNDGALLSLNVAGQTLRLPLRINEELGAGLVAL
PAGLAGIPPAIFGKTVDGLQEAAQ
>P0AFD4 7.1.1.-~~~nuoH~~~NADH-quinone oxidoreductase subunit H~~~COG1005
MSWISPELIEILLTILKAVVILLVVVTCGAFMSFGERRLLGLFQNRYGPNRVGWGGSLQLVADMIKMFFKEDWIPKFSDR
VIFTLAPMIAFTSLLLAFAIVPVSPGWVVADLNIGILFFLMMAGLAVYAVLFAGWSSNNKYSLLGAMRASAQTLSYEVFL
GLSLMGVVAQAGSFNMTDIVNSQAHVWNVIPQFFGFITFAIAGVAVCHRHPFDQPEAEQELADGYHIEYSGMKFGLFFVG
EYIGIVTISALMVTLFFGGWQGPLLPPFIWFALKTAFFMMMFILIRASLPRPRYDQVMSFGWKICLPLTLINLLVTAAVI
LWQAQ
>A0QU29 7.1.1.-~~~nuoH~~~NADH-quinone oxidoreductase subunit H~~~COG1005
MTHPDPTLFGHDPWWLMLAKAVAIFVFLLLTVLSAILIERKLLGRMQMRFGPNRVGPAGLLQSLADGIKLALKEGLVPAG
VDKPIYLLAPVISVIPAFVAFSVIPLGGAVSVFGHRTPLQLTDLPVAVLFILAATSIGVYGIVLAGWASGSTYPLLGGLR
SSAQVVSYEIAMGLSFVAVFLYAGTMSTSGIVAAQDRTWFVFLLLPSFLVYVVSMVGETNRAPFDLPEAEGELVGGFHTE
YSSLKFAMFMLAEYVNMTTVSALATTMFLGGWHAPFPFNLIDGANSGWWPLLWFTAKVWTFMFLYFWLRATLPRLRYDQF
MALGWKVLIPVSLLWIMVVAITRSLRQHGEGTWAAWLLTAAVVVVVALIWGLATSLRRRTVQPPPPQSTGAYPVPPLPSV
GTKETADA
>P9WIX1 7.1.1.-~~~nuoH~~~NADH-quinone oxidoreductase subunit H~~~COG1005
MTTFGHDTWWLVAAKAIAVFVFLMLTVLVAILAERKLLGRMQLRPGPNRVGPKGALQSLADGIKLALKESITPGGIDRFV
YFVAPIISVIPAFTAFAFIPFGPEVSVFGHRTPLQITDLPVAVLFILGLSAIGVYGIVLGGWASGSTYPLLGGVRSTAQV
ISYEVAMGLSFATVFLMAGTMSTSQIVAAQDGVWYAFLLLPSFVIYLISMVGETNRAPFDLPEAEGELVAGFHTEYSSLK
FAMFMLAEYVNMTTVSALAATLFFGGWHAPWPLNMWASANTGWWPLIWFTAKVWGFLFIYFWLRATLPRLRYDQFMALGW
KLLIPVSLVWVMVAAIIRSLRNQGYQYWTPTLVFSSIVVAAAMVLLLRKPLSAPGARASARQRGDEGTSPEPAFPTPPLL
AGATKENAGG
>A1B487 7.1.1.-~~~nuoH~~~NADH-quinone oxidoreductase subunit H~~~COG1005
MAEFWASPYGFALSMLLQGLAVIAFVMGSLIFMVYGDRKIWAAVQMRRGPNVVGPWGLLQTFADALKYIVKEIVIPAGAD
KFVYFLAPFLSMMLALFAFVVIPFDEGWVMANINVGILFIFAASSLEVYGVIMGGWASNSKYPFLASLRSAAQMISYEVS
LGLIIIGIIISTGSMNLTAIVEAQRGDYGLLNWYWLPHLPMVVLFFVSALAECNRPPFDLVEAESELVAGFMTEYSSTPY
LLFMAGEYIAMYLMCALLSLLFFGGWLSPVPFIADGWWWMVIKMWFWFYMFAMVKAIVPRYRYDQLMRIGWKVFLPLSLG
WVVLVAILARYEILGGFWARFAVGG
>P42032 7.1.1.-~~~nuoH~~~NADH-quinone oxidoreductase subunit H~~~
MADFWATSLGQTLILLAQGLGIIAFVMIGLLLLVWGDRKIWAAVQMRKGPNVVGAFGLLQSVADAAKYVFKEIVVPAGVD
KPVYFLAPMLSLVLALLAWVVVPFNEGWVMADINVAVLFVFAVSSLEVYGVIMGGWASNSKYPFLGSLRSAAQMISYEVS
MGLIIVGVIISTGSMNLSAIVEAQRGDFGLLNWYWLPHLPMVALFFISALAETNRPPFDLPEAESELVAGFMVEYSSTPY
LLFMAGEYIAVWLMCALTSVLFFGGWLSPIPGVPDGVLWMVAKMAAVFFVFAMVKAIVPRYRYDQLMRIGWKVFLPLSLA
WVVVVAFLAKFEVLGGFWARWSIGA
>O67337 7.1.1.-~~~nuoI~~~NADH-quinone oxidoreductase subunit I~~~COG1143
MGVKKLSRKDYLNILESILFIDFLKGLSVTLKNLLRRPITTEYPKEKLTPPKRFRGAHGHYVWDGTEPDSLKAIEKFMSY
EKAKSRCVACYMCQTACPMPTLFRIEAVQLPNGKKKVVRFDMNLLNCLFCGLCVDACPVGCLTMTDIFELANYSRRNEVL
RMEDLEKFAIDFKQRRGNEPDRIWPNDEEREKLWGKIEWSG
>P0AFD6 7.1.1.-~~~nuoI~~~NADH-quinone oxidoreductase subunit I~~~COG1143
MTLKELLVGFGTQVRSIWMIGLHAFAKRETRMYPEEPVYLPPRYRGRIVLTRDPDGEERCVACNLCAVACPVGCISLQKA
ETKDGRWYPEFFRINFSRCIFCGLCEEACPTTAIQLTPDFEMGEYKRQDLVYEKEDLLISGPGKYPEYNFYRMAGMAIDG
KDKGEAENEAKPIDVKSLLP
>A0QU28 7.1.1.-~~~nuoI~~~NADH-quinone oxidoreductase subunit I~~~COG1143
MPKFLDALAGFAVTLGSMFKKPITEGYPEKPGPVAPRYHGRHQLNRYPDGLEKCIGCELCAWACPADAIYVEGADNTADE
RYSPGERYGRVYQINYLRCIGCGLCIEACPTRALTMTTEYEMADDNRADLIWGKDKLLAPLQEGMQAPPHDMAPGKTDDD
YYLGNVTPITPVPSGTEDAR
>P9WJG9 7.1.1.-~~~nuoI~~~NADH-quinone oxidoreductase subunit I~~~COG1143
MANTDRPALPHKRAVPPSRADSGPRRRRTKLLDAVAGFGVTLGSMFKKTVTEEYPERPGPVAARYHGRHQLNRYPDGLEK
CIGCELCAWACPADAIYVEGADNTEEERFSPGERYGRVYQINYLRCIGCGLCIEACPTRALTMTYDYELADDNRADLIYE
KDRLLAPLLPEMAAPPHPRTPGATDKDYYLGNVTAEGLRGVRESQTTGDSR
>A1B486 7.1.1.-~~~nuoI~~~NADH-quinone oxidoreductase subunit I~~~COG1143
MAFDFARATKYFLMWDFIKGFGLGMRYFVSPKPTLNYPHEKGPLSPRFRGEHALRRYPNGEERCIACKLCEAVCPAQAIT
IDAEPREDGSRRTTRYDIDMTKCIYCGFCQEACPVDAIVEGPNFEYATETREELFYDKQKLLANGERWEAEIARNLQLDA
PYR
>Q8RQ74 7.1.1.-~~~nuoI~~~NADH-quinone oxidoreductase subunit I~~~COG1143
MFKYIGDIVKGTGTQLRSLVMIFGHGFRKRDTLQYPEEPVYLAPRYRGRIVLTRDPDGEERCVACNLCAVACPVGCISLQ
KAETEDGRWYPDFFRINFSRCIFCGLCEEACPTTAIQLTPDFEMAEFKRQDLVYEKEDLLISGPGKNPDYNFYRVAGMAI
AGKPKGAAQNEAEPINVKSLLP
>P0AFE0 7.1.1.-~~~nuoJ~~~NADH-quinone oxidoreductase subunit J~~~COG0839
MEFAFYICGLIAILATLRVITHTNPVHALLYLIISLLAISGVFFSLGAYFAGALEIIVYAGAIMVLFVFVVMMLNLGGSE
IEQERQWLKPQVWIGPAILSAIMLVVIVYAILGVNDQGIDGTPISAKAVGITLFGPYVLAVELASMLLLAGLVVAFHVGR
EERAGEVLSNRKDDSAKRKTEEHA
>C5W716 7.1.1.-~~~nuoK~~~NADH-quinone oxidoreductase subunit K~~~COG0713
MIPLQHGLILAAILFVLGLTGLVIRRNLLFMLIGLEIMINASALAFVVAGSYWGQTDGQVMYILAISLAAAEASIGLALL
LQLHRRRQNLNIDSVSEMRG
>P0AFE4 7.1.1.-~~~nuoK~~~NADH-quinone oxidoreductase subunit K~~~COG0713
MIPLQHGLILAAILFVLGLTGLVIRRNLLFMLIGLEIMINASALAFVVAGSYWGQTDGQVMYILAISLAAAEASIGLALL
LQLHRRRQNLNIDSVSEMRG
>A0QU26 7.1.1.-~~~nuoK~~~NADH-quinone oxidoreductase subunit K~~~COG0713
MNPDNYLYLSALLFTIGAAGVLLRRNAIVMFMCVELMLNAANLAFVNFSRMHGQLDGQVVAFFTMVVAACEVVVGLAIIM
AIFRTRRSASVDDANLLKH
>A1B482 7.1.1.-~~~nuoK~~~NADH-quinone oxidoreductase subunit K~~~COG0713
MIGLTHYLVVGAILFVTGIFGIFVNRKNVIVILMSIELMLLAVNINFVAFSTHLGDLAGQVFTMFVLTVAAAEAAIGLAI
LVVFFRNRGTIAVEDVNVMKG
>P33607 7.1.1.-~~~nuoL~~~NADH-quinone oxidoreductase subunit L~~~COG1009
MNMLALTIILPLIGFVLLAFSRGRWSENVSAIVGVGSVGLAALVTAFIGVDFFANGEQTYSQPLWTWMSVGDFNIGFNLV
LDGLSLTMLSVVTGVGFLIHMYASWYMRGEEGYSRFFAYTNLFIASMVVLVLADNLLLMYLGWEGVGLCSYLLIGFYYTD
PKNGAAAMKAFVVTRVGDVFLAFALFILYNELGTLNFREMVELAPAHFADGNNMLMWATLMLLGGAVGKSAQLPLQTWLA
DAMAGPTPVSALIHAATMVTAGVYLIARTHGLFLMTPEVLHLVGIVGAVTLLLAGFAALVQTDIKRVLAYSTMSQIGYMF
LALGVQAWDAAIFHLMTHAFFKALLFLASGSVILACHHEQNIFKMGGLRKSIPLVYLCFLVGGAALSALPLVTAGFFSKD
EILAGAMANGHINLMVAGLVGAFMTSLYTFRMIFIVFHGKEQIHAHAVKGVTHSLPLIVLLILSTFVGALIVPPLQGVLP
QTTELAHGSMLTLEITSGVVAVVGILLAAWLWLGKRTLVTSIANSAPGRLLGTWWYNAWGFDWLYDKVFVKPFLGIAWLL
KRDPLNSMMNIPAVLSRFAGKGLLLSENGYLRWYVASMSIGAVVVLALLMVLR
>P9WIW1 7.1.1.-~~~nuoL~~~NADH-quinone oxidoreductase subunit L~~~COG1009
MTTSLGTHYTWLLVALPLAGAAILLFGGRRTDAWGHLLGCAAALAAFGVGAMLLADMLGRDGLERAIHQQVFTWIPAGGL
QVDFGLQIDQLSMCFVLLISGVGSLIHIYSVGYMAEDPDRRRFFGYLNLFLASMLLLVVADNYVLLYVGWEGVGLASYLL
IGFWYHKPSAATAAKKAFVMNRVGDAGLAVGMFLTFSTFGTLSYAGVFAGVPAASRAVLTAIGLLMLLGACAKSAQVPLQ
AWLGDAMEGPTPVSALIHAATMVTAGVYLIVRSGPLYNLAPTAQLAVVIVGAVTLLFGAIIGCAKDDIKRALAASTISQI
GYMVLAAGLGPAGYAFAIMHLLTHGFFKAGLFLGSGAVIHAMHEEQDMRRYGGLRAALPVTFATFGLAYLAIIGVPPFAG
FFSKDAIIEAALGAGGIRGSLLGGAALLGAGVTAFYMTRVMLMTFFGEKRWTPGAHPHEAPAVMTWPMILLAVGSVFSGG
LLAVGGTLRHWLQPVVGSHEEATHALPTWVATTLALGVVAVGIAVAYRMYGTAPIPRVAPVRVSALTAAARADLYGDAFN
EEVFMRPGAQLTNAVVAVDDAGVDGSVNALATLVSQTSNRLRQMQTGFARNYALSMLVGAVLVAAALLVVQLW
>P0AFE8 7.1.1.-~~~nuoM~~~NADH-quinone oxidoreductase subunit M~~~COG1008
MLLPWLILIPFIGGFLCWQTERFGVKVPRWIALITMGLTLALSLQLWLQGGYSLTQSAGIPQWQSEFDMPWIPRFGISIH
LAIDGLSLLMVVLTGLLGVLAVLCSWKEIEKYQGFFHLNLMWILGGVIGVFLAIDMFLFFFFWEMMLVPMYFLIALWGHK
ASDGKTRITAATKFFIYTQASGLVMLIAILALVFVHYNATGVWTFNYEELLNTPMSSGVEYLLMLGFFIAFAVKMPVVPL
HGWLPDAHSQAPTAGSVDLAGILLKTAAYGLLRFSLPLFPNASAEFAPIAMWLGVIGIFYGAWMAFAQTDIKRLIAYTSV
SHMGFVLIAIYTGSQLAYQGAVIQMIAHGLSAAGLFILCGQLYERIHTRDMRMMGGLWSKMKWLPALSLFFAVATLGMPG
TGNFVGEFMILFGSFQVVPVITVISTFGLVFASVYSLAMLHRAYFGKAKSQIASQELPGMSLRELFMILLLVVLLVLLGF
YPQPILDTSHSAIGNIQQWFVNSVTTTRP
>P9WIW5 7.1.1.-~~~nuoM~~~NADH-quinone oxidoreductase subunit M~~~COG1008
MNNVPWLSVLWLVPLAGAVLIILLPPGRRRLAKWAGMVVSVLTLAVSIVVAAEFKPSAEPYQFVEKHSWIPAFGAGYTLG
VDGIAVVLVLLTTVLIPLLLVAGWNDATDADDLSPASGRYPQRPAPPRLRSSGGERTRGVHAYVALTLAIESMVLMSVIA
LDVLLFYVFFEAMLIPMYFLIGGFGQGAGRSRAAVKFLLYNLFGGLIMLAAVIGLYVVTAQYDSGTFDFREIVAGVAAGR
YGADPAVFKALFLGFMFAFAIKAPLWPFHRWLPDAAVESTPATAVLMMAVMDKVGTFGMLRYCLQLFPDPSTYFRPLIVT
LAIIGVIYGAIVAIGQTDMMRLIAYTSISHFGFIIAGIFVMTTQGQSGSTLYMLNHGLSTAAVFLIAGFLIARRGSRSIA
DYGGVQKVAPILAGTFMVSAMATVSLPGLAPFISEFLVLLGTFSRYWLAAAFGVTALVLSAVYMLWLYQRVMTGPVAEGN
ERIGDLVGREMIVVAPLIALLLVLGVYPKPVLDIINPAVENTMTTIGQHDPAPSVAHPVPAVGASRTAEGPHP
>P0AFF0 7.1.1.-~~~nuoN~~~NADH-quinone oxidoreductase subunit N~~~COG1007
MTITPQNLIALLPLLIVGLTVVVVMLSIAWRRNHFLNATLSVIGLNAALVSLWFVGQAGAMDVTPLMRVDGFAMLYTGLV
LLASLATCTFAYPWLEGYNDNKDEFYLLVLIAALGGILLANANHLASLFLGIELISLPLFGLVGYAFRQKRSLEASIKYT
ILSAAASSFLLFGMALVYAQSGDLSFVALGKNLGDGMLNEPLLLAGFGLMIVGLGFKLSLVPFHLWTPDVYQGAPAPVST
FLATASKIAIFGVVMRLFLYAPVGDSEAIRVVLAIIAFASIIFGNLMALSQTNIKRLLGYSSISHLGYLLVALIALQTGE
MSMEAVGVYLAGYLFSSLGAFGVVSLMSSPYRGPDADSLFSYRGLFWHRPILAAVMTVMMLSLAGIPMTLGFIGKFYVLA
VGVQAHLWWLVGAVVVGSAIGLYYYLRVAVSLYLHAPEQPGRDAPSNWQYSAGGIVVLISALLVLVLGVWPQPLISIVRL
AMPLM
>P9WIW9 7.1.1.-~~~nuoN~~~NADH-quinone oxidoreductase subunit N~~~COG1007
MILPAPHVEYFLLAPMLIVFSVAVAGVLAEAFLPRRWRYGAQVTLALGGSAVALIAVIVVARSIHGSGHAAVLGAIAVDR
ATLFLQGTVLLVTIMAVVFMAERSARVSPQRQNTLAVARLPGLDSFTPQASAVPGSDAERQAERAGATQTELFPLAMLSV
GGMMVFPASNDLLTMFVALEVLSLPLYLMCGLARNRRLLSQEAAMKYFLLGAFSSAFFLYGVALLYGATGTLTLPGIRDA
LAARTDDSMALAGVALLAVGLLFKVGAVPFHSWIPDVYQGAPTPITGFMAAATKVAAFGALLRVVYVALPPLHDQWRPVL
WAIAILTMTVGTVTAVNQTNVKRMLAYSSVAHVGFILTGVIADNPAGLSATLFYLVAYSFSTMGAFAIVGLVRGADGSAG
SEDADLSHWAGLGQRSPIVGVMLSMFLLAFAGIPLTSGFVSKFAVFRAAASAGAVPLVIVGVISSGVAAYFYVRVIVSMF
FTEESGDTPHVAAPGVLSKAAIAVCTVVTVVLGIAPQPVLDLADQAAQLLR
>A1B479 7.1.1.-~~~nuoN~~~NADH-quinone oxidoreductase subunit N~~~COG1007
MTSLDFSTILPEVVLAGYALAALMAGAYLGKDRLARTLLWVTVAAFLVVAAMVGLGNHVDGAAFHGMFIDDGFSRFAKVV
TLVAAAGVLAMSADYMQRRNMLRFEFPIIVALAVLGMMFMVSAGDLLTLYMGLELQSLALYVVAAMRRDSVRSSEAGLKY
FVLGSLSSGLLLYGASLVYGFAGTTGFEGIISTIEAGHLSLGVLFGLVFMLVGLSFKVSAVPFHMWTPDVYEGSPTPVTA
FFATAPKVAAMALIARLVFDAFGHVIGDWSQIVAALAVMSMFLGSIAGIGQTNIKRLMAYSSIAHMGFALVGLAAGTAIG
VQNMLLYMTIYAVMNIGTFAFILSMERDGVPVTDLAALNRFAWTDPVKALAMLVLMFSLAGVPPTLGFFAKFGVLTAAVD
AGMGWLAVLGVIASVIGAFYYLRIVYYMYFGGESEGMTSRMGAVQYLALMVPALAMLVGAISMFGVDSAAGRAAETLVGP
VAAIEQPAEAAQAEPVQGE
>A2RKA7 7.6.2.-~~~nupA~~~Nucleoside import ATP-binding protein NupA~~~COG3845
MANETVIQMIDVTKRFGDFVANDKVNLELKKGEIHALLGENGAGKSTLMNILSGLLEPSEGEVHVKGKLENIDSPSKAAN
LGIGMVHQHFMLVDAFTVTENIILGNEVTKGINLDLKTAKKKILELSERYGLSVEPDALIRDISVGQQQRVEILKTLYRG
ADILIFDEPTAVLTPAEITELMQIMKNLIKEGKSIILITHKLDEIRAVADRITVIRRGKSIDTVELGDKTNQELAELMVG
RSVSFITEKAAAQPKDVVLEIKDLNIKESRGSLKVKGLSLDVRAGEIVGVAGIDGNGQTELVKAITGLTKVDSGSIKLHN
KDITNQRPRKITEQSVGHVPEDRHRDGLVLEMTVAENIALQTYYKPPMSKYGFLDYNKINSHARELMEEFDVRGAGEWVS
ASSLSGGNQQKAIIAREIDRNPDLLIVSQPTRGLDVGAIEYIHKRLIQARDEGKAVLVISFELDEILNVSDRIAVIHDGQ
IQGIVSPETTTKQELGILMVGGNINE
>A2RKA6 ~~~nupB~~~Nucleoside ABC transporter permease protein NupB~~~COG4603
MNNKTRKVLVPLIAIVFGFLLGAIIMLAFGYNPIWGYEDLFISALGSARSIGETLQTMGPLILTALSFAVAMKVGLFNIG
MSGQALAGWISSMWFALSFPDIPRLLMIPLVVIIGMVFGAFMGFIPGILRALLGTSEVITTIMLNYIMLFFSTFMIHSMF
QKNILMDNTTDQTKLISANASFRTNWMSSLTDNSTLNIGLIIAIIALVIMAIIFTKTTLGFEIKAVGLNPDASEYAGISA
KRTLILSMVVAGALAGLGGVVYGFGYMQNFVSQSASLDIGFYGMAVALLGGNSPIGILFAALLFSVLQTGAPGMTNDGIP
PEIVKVVTAAIIFFIAVKFIIEVMLPKAKAIKASEATKKKGEKA
>P39141 ~~~nupC~~~Nucleoside permease NupC~~~COG1972
MKYLIGIIGLIVFLGLAWIASSGKKRIKIRPIVVMLILQFILGYILLNTGIGNFLVGGFAKGFGYLLEYAAEGINFVFGG
LVNADQTTFFMNVLLPIVFISALIGILQKWKVLPFIIRYIGLALSKVNGMGRLESYNAVASAILGQSEVFISLKKELGLL
NQQRLYTLCASAMSTVSMSIVGAYMTMLKPEYVVTALVLNLFGGFIIASIINPYEVAKEEDMLRVEEEEKQSFFEVLGEY
ILDGFKVAVVVAAMLIGFVAIIALINGIFNAVFGISFQGILGYVFAPFAFLVGIPWNEAVNAGSIMATKMVSNEFVAMTS
LTQNGFHFSGRTTAIVSVFLVSFANFSSIGIIAGAVKGLNEKQGNVVARFGLKLLYGATLVSFLSAAIVGLIY
>P0AFF2 ~~~nupC~~~Nucleoside permease NupC~~~COG1972
MDRVLHFVLALAVVAILALLVSSDRKKIRIRYVIQLLVIEVLLAWFFLNSDVGLGFVKGFSEMFEKLLGFANEGTNFVFG
SMNDQGLAFFFLKVLCPIVFISALIGILQHIRVLPVIIRAIGFLLSKVNGMGKLESFNAVSSLILGQSENFIAYKDILGK
ISRNRMYTMAATAMSTVSMSIVGAYMTMLEPKYVVAALVLNMFSTFIVLSLINPYRVDASEENIQMSNLHEGQSFFEMLG
EYILAGFKVAIIVAAMLIGFIALIAALNALFATVTGWFGYSISFQGILGYIFYPIAWVMGVPSSEALQVGSIMATKLVSN
EFVAMMDLQKIASTLSPRAEGIISVFLVSFANFSSIGIIAGAVKGLNEEQGNVVSRFGLKLVYGSTLVSVLSASIAALVL
>A2RKA5 ~~~nupC~~~Nucleoside ABC transporter permease protein NupC~~~COG1079
MNVVNTLQIIVANMLIYSTPLIFTSIGGVFSERGGIVNVGLEGIMTIGAFSSVVFNLTTAGMFGSMTPWLSILFGALIGA
LFSSLHAVATVNLRADHIVSGTVLNLMAPALGVFLLQVFYQQGQININEQIGYWNVPLLSNIPVIGKIFFTQTSLPGFLA
IVVAILAWYVLFKTRFGLRLRSVGENPQAADTLGINVYAYRWAGVLLSGVLGGVGGAIYAQAISGNFSVSTIAGQGFISL
AAMIFGKWNPIGAMLSSLLFGLFTSLAVVGGQIPGIKEIPSSFLQMAPYVFTIIVLALFLGKAIAPKADGVNYIKSK
>P42312 ~~~nupG~~~Purine nucleoside transport protein NupG~~~COG1972
MYFLLNLVGLIVIMAVVFLCSPQKKKIKWRPIITLIVLELLITWFMLGTKVGSWAIGKIGDFFTWLIACASDGIAFAFPS
VMANETVDFFFSALLPIIFIVTFFDILTYFGILPWLIDKIGWVISKASRLPKLESFFSIQMMFLGNTEALAVIRQQLTVL
SNNRLLTFGLMSMSSISGSIIGSYLSMVPATYVFTAIPLNCLNALIIANLLNPVHVPEDEDIIYTPPKEEKKDFFSTISN
SMLVGMNMVIVILAMVIGYVALTSAVNGILGVFVHGLTIQTIFAYLFSPFAFLLGLPVHDAMYVAQLMGMKLATNEFVAM
LDLKNNLKSLPPHTVAVATTFLTSFANFSTVGMIYGTYNSILDGEKSTVIGRNVWKLLVSGIAVSLLSAAIVGLFVW
>P0AFF4 ~~~nupG~~~Nucleoside permease NupG~~~COG2211
MNLKLQLKILSFLQFCLWGSWLTTLGSYMFVTLKFDGASIGAVYSSLGIAAVFMPALLGIVADKWLSAKWVYAICHTIGA
ITLFMAAQVTTPEAMFLVILINSFAYMPTLGLINTISYYRLQNAGMDIVTDFPPIRIWGTIGFIMAMWVVSLSGFELSHM
QLYIGAALSAILVLFTLTLPHIPVAKQQANQSWTTLLGLDAFALFKNKRMAIFFIFSMLLGAELQITNMFGNTFLHSFDK
DPMFASSFIVQHASIIMSISQISETLFILTIPFFLSRYGIKNVMMISIVAWILRFALFAYGDPTPFGTVLLVLSMIVYGC
AFDFFNISGSVFVEKEVSPAIRASAQGMFLMMTNGFGCILGGIVSGKVVEMYTQNGITDWQTVWLIFAGYSVVLAFAFMA
MFKYKHVRVPTGTQTVSH
>O05252 ~~~nupN~~~ABC transporter guanosine-binding protein NupN~~~COG1744
MNKRKIGLAMSLVIAAGTILGACGNSEKSSGSGEGKNKFSVAMVTDVGGVDDKSFNQSAWEGIQAFGKENGLKKGKNGYD
YLQSKSDADYTTNLNKLARENFDLIYGVGYLMEDSISEIADQRKNTNFAIIDAVVDKDNVASITFKEQEGSFLVGVAAAL
SSKSGKIGFVGGMESELIKKFEVGFRAGVQAVNPKAVVEVKYAGGFDKADVGKATAESMYKSGVDVIYHSAGATGTGVFT
EAKNLKKEDPKRDVWVIGVDKDQYAEGQVEGTDDNVTLTSMVKKVDTVVEDVTKKASDGKFPGGETLTYGLDQDGVGISP
SKQNLSDDVIKAVDKWKKKIIDGLEIPATEKELKTFKAE
>O05253 7.6.2.-~~~nupO~~~Guanosine import ATP-binding protein NupO~~~COG3845
MEYVIEMLNIRKAFPGIVANDNINLQVKKGEIHALLGENGAGKSTLMNVLFGLYQPERGEIRVRGEKVHINSPNKANDLG
IGMVHQHFMLVDTFTVAENIILGKEPKKFGRIDRKRAGQEVQDISDRYGLQIHPEAKAADISVGMQQRAEILKTLYRGAD
ILIFDEPTAVLTPHEIKELMQIMKNLVKEGKSIILITHKLKEIMEICDRVTVIRKGKGIKTLDVRDTNQDELASLMVGRE
VSFKTEKRAAQPGAEVLAIDGITVKDTRGIETVRDLSLSVKAGEIVGIAGVDGNGQSELIEAVTGLRKTDSGTITLNGKQ
IQNLTPRKITESGIGHIPQDRHKHGLVLDFPIGENILLQSYYKKPYSALGVLHKGEMYKKARSLITEYDVRTPDEYTHAR
ALSGGNQQKAIIGREIDRNPDLLIAAQPTRGLDVGAIEFVHKKLIEQRDAGKAVLLLSFELEEIMNLSDRIAVIFEGRII
ASVNPQETTEQELGLLMAGSTQKEAGKANG
>O05254 ~~~nupP~~~Guanosine ABC transporter permease protein NupP~~~COG4603
MVKRLSHLLVPLIAIILGLAAGALIMLVSGYSVASGYSALWNGIFGEIYYVGETIRQITPYILSGLAVAFAFRTGLFNIG
VEGQLLVGWTAAVWVGTAFDGPAYIHLPLALITAAAAGGLWGFIPGILKARFYVHEVIVTIMMNYIALHMTNYIISNVLT
DHQDKTGKIHESASLRSPFLEQITDYSRLHLGIIVALLAAVIMWFIINKSTKGFELRAVGFNQHASQYAGMSVRKNIMTS
MLISGAFAGLAGAMEGLGTFEYAAVKGAFTGVGFDGIAVALLGGNTAVGVVLAACLLGGLKIGALNMPIESGVPSEVVDI
VIAIIILFVASSYAIRFVMGKLKKKGAN
>O05255 ~~~nupQ~~~Guanosine ABC transporter permease protein NupQ~~~COG1079
MDIVQILSIIVPATLVYAAPLILTALGGVFSERSGVVNIGLEGLMIIGAFTSVLFNLFFGQELGAAAPWLSLLAAMAAGA
LFSLIHAAAAISFRADQTVSGVAINMLALGATLFIVKLIYGKAQTDKIPEPFYKTKIPGLGDIPVLGKIFFSDVYYTSIL
AIALAFISWFILFKTPFGLRIRSVGEHPMAADTMGINVYKMRYIGVMISGLFGGLGGGVYASTIALDFTHSTISGQGFIA
LAALVFGKWHPIGALGAALFFGFAQSLSIIGSLLPLFKDIPNVYMLMAPYILTILALTGFIGRADAPKANGVPYIKGKR
>P33021 ~~~nupX~~~Putative nucleoside permease NupX~~~COG1972
MDVMRSVLGMVVLLTIAFLLSVNKKKISLRTVGAALVLQVVIGGIMLWLPPGRWVAEKVAFGVHKVMAYSDAGSAFIFGS
LVGPKMDTLFDGAGFIFGFRVLPAIIFVTALVSILYYIGVMGILIRILGGIFQKALNISKIESFVAVTTIFLGQNEIPAI
VKPFIDRLNRNELFTAICSGMASIAGSTMIGYAALGVPVEYLLAASLMAIPGGILFARLLSPATESSQVSFNNLSFTETP
PKSIIEAAATGAMTGLKIAAGVATVVMAFVAIIALINGIIGGVGGWFGFEHASLESILGYLLAPLAWVMGVDWSDANLAG
SLIGQKLAINEFVAYLNFSPYLQTAGTLDAKTVAIISFALCGFANFGSIGVVVGAFSAVAPHRAPEIAQLGLRALAAATL
SNLMSATIAGFFIGLA
>P32727 ~~~nusA~~~Transcription termination/antitermination protein NusA~~~COG0195
MSSELLDALTILEKEKGISKEIIIEAIEAALISAYKRNFNQAQNVRVDLNRETGSIRVFARKDVVDEVYDQRLEISIEEA
QGIHPEYMVGDVVEIEVTPKDFGRIAAQTAKQVVTQRVREAERGVIYSEFIDREEDIMTGIVQRLDNKFIYVSLGKIEAL
LPVNEQMPNESYKPHDRIKVYITKVEKTTKGPQIYVSRTHPGLLKRLFEIEVPEIYDGTVELKSVAREAGDRSKISVRTD
DPDVDPVGSCVGPKGQRVQAIVNELKGEKIDIVNWSSDPVEFVANALSPSKVLDVIVNEEEKATTVIVPDYQLSLAIGKR
GQNARLAAKLTGWKIDIKSETDARELGIYPRELEEDDEPLFTEPETAESDE
>Q83BS0 ~~~nusA~~~Transcription termination/antitermination protein NusA~~~COG0195
MNKDILLIVDSMSNERGVSKEVIFEAIEAALAAVTAKRYEEDDVKIRVAIDQKTGDYESFRCWTVVEDTNESLEFPNQEM
TLKQAREIDSDLEVGDVIEEPVESVKFGRIAVQQAKQVIVQKVREAERAKIIRQYEKRVGELVIGVVKRVTRESIILDMG
ENAEALLLREEMIPREAFRINDRLRAYLYSVCQDKKRGPQLLVSRTRPEFLVELFKIEVPEIGEEVIEIKGAARDPGSRA
KIAVKTNDGRIDPIGACVGMRGSRVQAVSNELGGERIDIVLWDDNPAQLVINAMAPAEVASIVVDEDSHTMDIAVNKDQL
SQAIGRSGQNVRLASELTGWTLNVMSEAEMAQKHEKEAGKIKTAFMEKLDVDEEVADALVQAGFMNLEEVAYVPKEELQG
VEGFDEDISAELQRRAGDVLLTQEIAKQELDEKKPAEDLLTLPGMTTELARQLVENEVLTRDDLAEKSVLDLKEIIEIDD
EAAANLIMAARAHWFAEEESEKS
>P0AFF6 ~~~nusA~~~Transcription termination/antitermination protein NusA~~~COG0195
MNKEILAVVEAVSNEKALPREKIFEALESALATATKKKYEQEIDVRVQIDRKSGDFDTFRRWLVVDEVTQPTKEITLEAA
RYEDESLNLGDYVEDQIESVTFDRITTQTAKQVIVQKVREAERAMVVDQFREHEGEIITGVVKKVNRDNISLDLGNNAEA
VILREDMLPRENFRPGDRVRGVLYSVRPEARGAQLFVTRSKPEMLIELFRIEVPEIGEEVIEIKAAARDPGSRAKIAVKT
NDKRIDPVGACVGMRGARVQAVSTELGGERIDIVLWDDNPAQFVINAMAPADVASIVVDEDKHTMDIAVEAGNLAQAIGR
NGQNVRLASQLSGWELNVMTVDDLQAKHQAEAHAAIDTFTKYLDIDEDFATVLVEEGFSTLEELAYVPMKELLEIEGLDE
PTVEALRERAKNALATIAQAQEESLGDNKPADDLLNLEGVDRDLAFKLAARGVCTLEDLAEQGIDDLADIEGLTDEKAGA
LIMAARNICWFGDEA
>P75591 ~~~nusA~~~Transcription termination/antitermination protein NusA~~~
MNNQSNHFTNPLLQLIKNVAETKNLAIDDVVLCLKTALAQTYKKHLNYVNVEVNIDFNKGLMQIEQLFDVVDDNNEDYDD
FLEMPLSEAKKLNPNLEVGGVLRKPVSLKDIKGDLISKMVLLFNQKINETAFKTVMSDFINEVGQVIEARVEDIDTNKDG
GLKGYIVNLETTKGYMPKRELSKGEKLDIGKKYLFVIKEIQKQSSMWPITLSRSDSRLLEFLLNSNTPEIANGTIEIKKM
ERSPGTKSKVAVISKDPVVDPIAAILGPKGERIRGISEEFNGEIIDIVIWNEDKLKFLVNAVLPAEVVGYNILQDDERDT
SIEIVVPANQIANVFGFKGINIRLISNLTGWSSVDVYTEKDAAEQGIEFTRVNFQPQGIFGIKKRRDKISNNPRNNNQQL
ASDKVFYTSKANVVDDEIIVDLAKQAEAKRVKQIKQEATKPELQLQQELNLEATPKVAAPTPTPAPQPTPAPTKVEPVPP
PVSVTPKPIPKVNKPKPVVKPKSVFSITVEADDSKTKPEKSSAKTNTPQTKQTFDNFDDL
>P9WIV3 ~~~nusA~~~Transcription termination/antitermination protein NusA~~~COG0195
MNIDMAALHAIEVDRGISVNELLETIKSALLTAYRHTQGHQTDARIEIDRKTGVVRVIARETDEAGNLISEWDDTPEGFG
RIAATTARQVMLQRFRDAENERTYGEFSTREGEIVAGVIQRDSRANARGLVVVRIGTETKASEGVIPAAEQVPGESYEHG
NRLRCYVVGVTRGAREPLITLSRTHPNLVRKLFSLEVPEIADGSVEIVAVAREAGHRSKIAVRSNVAGLNAKGACIGPMG
QRVRNVMSELSGEKIDIIDYDDDPARFVANALSPAKVVSVSVIDQTARAARVVVPDFQLSLAIGKEGQNARLAARLTGWR
IDIRGDAPPPPPGQPEPGVSRGMAHDR
>O66530 ~~~nusB~~~Transcription antitermination protein NusB~~~COG0781
MRYRKGARDTAFLVLYRWDLRGENPGELFKEVVEEKNIKNKDAYEYAKKLVDTAVRHIEEIDSIIEKHLKGWSIDRLGYV
ERNALRLGVAELIFLKSKEPGRVFIDIVDLVKKYADEKAGKFVNGVLSAIYKAYITSSKEEKPSLKSE
>Q2SYC5 ~~~nusB~~~Transcription antitermination protein NusB~~~
MKKSARRQSRELATQGLYQWLLSNAAPGEIDAQLRGALGYDKADKTLLDTILHGVIREHATLAEAISPSLDRPIDQLSPV
ERAVLLIATYELTHQIETPYRVIINEAVELAKTFGGSDGYKYVNGVLDKLAVKLRPAETQARRGA
>P0A780 ~~~nusB~~~Transcription antitermination protein NusB~~~COG0781
MKPAARRRARECAVQALYSWQLSQNDIADVEYQFLAEQDVKDVDVLYFRELLAGVATNTAYLDGLMKPYLSRLLEELGQV
EKAVLRIALYELSKRSDVPYKVAINEAIELAKSFGAEDSHKFVNGVLDKAAPVIRPNKK
>P9WIV1 ~~~nusB~~~Transcription antitermination protein NusB~~~COG0781
MSDRKPVRGRHQARKRAVALLFEAEVRGISAAEVVDTRAALAEAKPDIARLHPYTAAVARGVSEHAAHIDDLITAHLRGW
TLDRLPAVDRAILRVSVWELLHAADVPEPVVVDEAVQLAKELSTDDSPGFVNGVLGQVMLVTPQLRAAAQAVRGGA
>P65578 ~~~nusB~~~Transcription antitermination protein NusB~~~
MSRKESRVQAFQTLFQLEMKDSDLTINEAISFIKDDNPDLDFEFIHWLVSGVKDHEPVLDETISPYLKDWTIARLLKTDR
IILRMATYEILHSDTPAKVVMNEAVELTKQFSDDDHYKFINGVLSNIKK
>P65582 ~~~nusB~~~Transcription antitermination protein NusB~~~COG0781
MTSPLLESRRQLRKCAFQALMSLEFGTDVETACRFAYTHDREDTDVQLPAFLIDLVSGVQAKKEELDKQITQHLKAGWTI
ERLTLVERNLLRLGVFEITSFDTPQLVAVNEAIELAKDFSDQKSARFINGLLSQFVTEEQ
>Q9X286 ~~~nusB~~~Transcription antitermination protein NusB~~~COG0781
MKTPRRRMRLAVFKALFQHEFRRDEDLEQILEEILDETYDKKAKEDARRYIRGIKENLSMIDDLISRYLEKWSLNRLSVV
DRNVLRLATYELLFEKDIPIEVTIDEAIEIAKRYGTENSGKFVNGILDRIAKEHAPKEKFEL
>O67757 ~~~nusG~~~Transcription termination/antitermination protein NusG~~~COG0250
MSEQQVQELEKKWYALQVEPGKENEAKENLLKVLELEGLKDLVDEVIVPAEEKVVIRAQGKEKYRLSLKGNARDISVLGK
KGVTTFRIENGEVKVVESVEGDTCVNAPPISKPGQKITCKENKTEAKIVLDNKIFPGYILIKAHMNDKLLMAIEKTPHVF
RPVMVGGKPVPLKEEEVQNILNQIKRGVKPSKVEFEKGDQVRVIEGPFMNFTGTVEEVHPEKRKLTVMISIFGRMTPVEL
DFDQVEKI
>Q06795 ~~~nusG~~~Transcription termination/antitermination protein NusG~~~COG0250
MEKNWYVVHTYSGYENKVKANLEKRVESMGMQDKIFRVVVPEEEETDIKNGKKKVVKKKVFPGYVLVEIVMTDDSWYVVR
NTPGVTGFVGSAGSGSKPTPLLPGEAETILKRMGMDERKTDIDFELKETVKVIDGPFANFTGSIEEIDYDKSKVKVFVNM
FGRETPVELEFTQIDKL
>P0AFG1 ~~~nusG~~~Transcription termination/antitermination protein NusG~~~COG0250
MSEAPKKRWYVVQAFSGFEGRVATSLREHIKLHNMEDLFGEVMVPTEEVVEIRGGQRRKSERKFFPGYVLVQMVMNDASW
HLVRSVPRVMGFIGGTSDRPAPISDKEVDAIMNRLQQVGDKPRPKTLFEPGEMVRVNDGPFADFNGVVEEVDYEKSRLKV
SVSIFGRATPVELDFSQVEKA
>P0AFG0 ~~~nusG~~~Transcription termination/antitermination protein NusG~~~COG0250
MSEAPKKRWYVVQAFSGFEGRVATSLREHIKLHNMEDLFGEVMVPTEEVVEIRGGQRRKSERKFFPGYVLVQMVMNDASW
HLVRSVPRVMGFIGGTSDRPAPISDKEVDAIMNRLQQVGDKPRPKTLFEPGEMVRVNDGPFADFNGVVEEVDYEKSRLKV
SVSIFGRATPVELDFSQVEKA
>P75049 ~~~nusG~~~Transcription termination/antitermination protein NusG~~~
MEQVELIPQWYVAPVSVKDEAVVRNLKAKVKALGFDNEILDVRVLKEREVIEEVFSLKSGKLPRSLKNTAFTKWFVLDED
RYLKVKISEKNLLGRYIYIKMIYSEDAWRIIRNFPGITGIVGSSGRGALPTPLDQADADNLEQMLKGISVNPKKRVLVTN
TAIVEMDADKFDEKCQYILKHKQVKPEAIAQVNESGEIIDTNQFAQALMEANKAEQDEWNEDVAIVKSEANKVDPSVLIP
YLGKYEIVEGDTKVDQLQQFSVGNLVEVHLTGAIHIQGQIKALYQGTINKAVVEVELTTKTQLINLPLENLSFIEVEQSH
>P9WIU9 ~~~nusG~~~Transcription termination/antitermination protein NusG~~~COG0250
MTTFDGDTSAGEAVDLTEANAFQDAAAPAEEVDPAAALKAELRSKPGDWYVVHSYAGYENKVKANLETRVQNLDVGDYIF
QVEVPTEEVTEIKNGQRKQVNRKVLPGYILVRMDLTDDSWAAVRNTPGVTGFVGATSRPSALALDDVVKFLLPRGSTRKA
AKGAASTAAAAEAGGLERPVVEVDYEVGESVTVMDGPFATLPATISEVNAEQQKLKVLVSIFGRETPVELTFGQVSKI
>P0A096 ~~~nusG~~~Transcription termination/antitermination protein NusG~~~
MSEEVGAKRWYAVHTYSGYENKVKKNLEKRVESMNMTEQIFRVVIPEEEETQVKDGKAKTTVKKTFPGYVLVELIMTDES
WYVVRNTPGVTGFVGSAGAGSKPNPLLPEEVRFILKQMGLKEKTIDVELEVGEQVRIKSGPFANQVGEVQEIETDKFKLT
VLVDMFGRETPVEVEFDQIEKL
>Q5XE43 ~~~nusG~~~Transcription termination/antitermination protein NusG~~~
MLDSFDKGWFVLQTYSGYENKVKENLLQRAQTYNMLDNILRVEIPTQTVNVEKNGQTKEIEENRFPGYVLVEMVMTDEAW
FVVRNTPNVTGFVGSHGNRSKPTPLLEEEIRAILLSMGQTIDVFDTNIKEGDVVQIIDGAFMGQEGRVVEIENNKVKLML
NMFGSETVAEVELYQIAEL
>P27309 ~~~nusG~~~Transcription termination/antitermination protein NusG~~~
MSDPNLNASHDSVESVEDELDIVEAADAVDPDEAELADAEAGAPAEEAALHVESDEDEDEADVEVDAAVEEAADDAEVAE
EEAEEAAPVEPAEPVDPIQALREELRLLPGEWYVIHTYAGYEKRVKANLEQRAVSLNVEEFIYQAEVPEEEIVQIKNGER
KNVRQNKLPGYVLVRMDLTNESWGVVRNTPGVTGFVGNAYDPYPLTLDEIVKMLAPEAQEKAAKAAAEEAGLPAPAVKRT
IEVLDFEVGDSVTVTDGPFATLQATINEINPDSKKVKGLVEIFGRETPVELSFDQIQKN
>P29397 ~~~nusG~~~Transcription termination/antitermination protein NusG~~~COG0250
MKKKWYIVLTMSGYEEKVKENIEKKVEATGIKNLVGRIVIPEEVVLDATSPSERLILSPKAKLHVNNGKDVNKGDLIAEE
PPIYARRSGVIVDVKNVRKIVVETIDRKYTKTYYIPESAGIEPGLRVGTKVKQGLPLSKNEEYICELDGKIVEIERMKKV
VVQTPDGEQDVYYIPLDVFDRDRIKKGKEVKQGEMLAEARKFFAKVSGRVEVVDYSTRKEIRIYKTKRRKLFPGYVFVEM
IMNDEAYNFVRSVPYVMGFVSSGGQPVPVKDREMRPILRLAGLEEYEEKKKPVKVELGFKVGDMVKIISGPFEDFAGVIK
EIDPERQELKVNVTIFGRETPVVLHVSEVEKIE
>P35872 ~~~nusG~~~Transcription termination/antitermination protein NusG~~~COG0250
MSIEWYAVHTLVGQEEKAKANLEKRIKAFGLQDKIFQVLIPTEEVVELREGGKKEVVRKKLFPGYLFIQMDLGDEEEPNE
AWEVVRGTPGITGFVGAGMRPVPLSPDEVRHILEVSGLLGKKEAPKAQVAFREGDQVRVVSGPFADFTGTVTEINPERGK
VKVMVTIFGRETPVELDFSQVVKA
>P13398 3.5.2.12~~~nylA~~~6-aminohexanoate-cyclic-dimer hydrolase~~~
MSKVDLWQDATAQAELVRSGEISRTELLEATIAHVQAVNPEINAVIIPLFEKARRESELASGPFAGVPYLLKDLTVVSQG
DINTSSIKGMKESGYRADHDAYFVQRMRAAGFVLLGKTNTPEMGNQVTTEPEAWGATRNPWNLGRSVGGSSGGSGAAVAA
ALSPVAHGNDAAGSVRIPASVCGVVGLKPTRGRISPGPLVTDSDNVAGAAHEGLFARSVRDIAALLDVVSGHRPGDTFCA
PTASRPYAQGISENPGSLRVGVLTHNPVGDFALDPECAAAARGAAAALAALGHDVNDAYPEALGDRSFLKDYSTICDVAI
AREIERNGELIGRPLTEDDVEWTSWEMVKRADQVTGRAFAACVDELRYYAGKVERWWEAGWDLLILPTVTRQTPEIGELM
LAKGTDLEGRQSAFISGSLQMLAFTVPFNVSGQPAISLPIGMSSDGMPIGVQIVAAYGREDLLLQVAAQLEGALPWVARR
PQLLNPSRKIPAA
>P07062 3.5.1.46~~~nylB'~~~6-aminohexanoate-dimer hydrolase~~~
MNTPTTGSHPARYPSAAAGEPTLDSWQEPPHNRWAFAHLGEMVPSAAVSRRPVNAPGHALARLGAIAAQLPDLEQRLEQT
YTDAFLVLRGTEVVAEYYRAGFAPDDRHLLMSVSKSLCGTVVGALVDEGRIDPAQPVTEYVPELAGSVYDGPSVLQVLDM
QISIDYNEDYVDPASEVQTHGRSAGWRTRATGDPADTYEFLTTLRGDGSTGEFQYCSANTDVLAWIVERVTGLRYVEALS
TYLWAKLDADRDATITVDTTGFGFAHGGVSCTARDLARVGRMMLDGGVAPGGRVVSEDWVRRVLAGGSHEAMTDKGFTNT
FPDGSYTRQWWCTGNERGNVSGIGIHGQNLWLDPLTDSVIVKLSSWPDPDTEHWHRLQNGILLDVSRALDAV
>P07061 3.5.1.46~~~nylB~~~6-aminohexanoate-dimer hydrolase~~~
MNARSTGQHPARYPGAAAGEPTLDSWQEAPHNRWAFARLGELLPTAAVSRRDPATPAEPVVRLDALATRLPDLEQRLEET
CTDAFLVLRGSEVLAEYYRAGFAPDDRHLLMSVSKSLCGTVVGALIDEGRIDPAQPVTEYVPELAGSVYDGPSVLQVLDM
QISIDYNEDYVDPASEVQTHDRSAGWRTRRDGDPADTYEFLTTLRGDGGTGEFQYCSANTDVLAWIVERVTGLRYVEALS
TYLWAKLDADRDATITVDQTGFGFANGGVSCTARDLARVGRMMLDGGVAPGGRVVSQGWVESVLAGGSREAMTDEGFTSA
FPEGSYTRQWWCTGNERGNVSGIGIHGQNLWLDPRTDSVIVKLSSWPDPDTRHWHGLQSGILLDVSRALDAV
>Q1EPR5 3.5.1.117~~~nylC~~~6-aminohexanoate-oligomer endohydrolase~~~
MNTTPVHALTDIDGGIAVDPAPRLAGPPVFGGPGNDAFDLAPVRSTGREMLRFDFPGVSIGAAHYEEGPTGATVIHIPAG
ARTAVDARGGAVGLSGGYDFNHAICLAGGASYGLEAGAGVSGALLERLEYRTGFAEAQLVSSAVIYDFSARSTAVYPDKA
LGRAALEFAVPGEFPQGRAGAGMSASAGKVDWDRTEITGQGAAFRRLGDVRILAVVVPNPVGVIMDRAGTVVRGNYDAQT
GVRRHPVFDYQEAFAEQVPPVTEAGNTTISAIVTNVRMSPVELNQFAKQVHSSMHRGIQPFHTDMDGDTLFAVTTDEIDL
PTTPGSSRGRLSVNATALGAIASEVMWDAVLEAGK
>Q79F77 3.5.1.117~~~nylC~~~6-aminohexanoate-oligomer endohydrolase~~~
MNTTPVHALTDIDGGIAVDPAPRLAGPPVFGGPGNDAFDLAPVRSTGREMLRFDFPGVSIGAAHYEEGPTGATVIHIPAG
ARTAVDARGGAVGLSGGYDFNHAICLAGGAGYGLEAGAGVSDALLERLEHRTGFAELQLVSSAVIYDFSARSTAVYPDKA
LGRAALEFAVPGEFPQGRAGAGMSASAGKVDWDRTEITGQGAAFRRLGDVRILAVVVPNPVGVIVDRAGTVVRGNYDAQT
GVRRHPVFDYQEAFAEQVPPVTEAGNTTISAIVTNVRMSPVELNQFAKQVHSSMHRGIQPFHTDMDGDTLFAVTTDEIDL
PTTPGSSRGRLSVNATALGAIASEVMWDAVLEAGK
>Q1EPR4 3.5.1.117~~~nylC~~~6-aminohexanoate-oligomer endohydrolase~~~
MNTTPVHALTDIDGGIAVDPAPRLAGPPVFGGPGNAAFDLVPVRSTGRETLRFDFPGVSVGSAHYEEGPTGATVIHIPAG
ARTAVDARGGAVGLSGGYDFNHAICLAGGASYGLEAGAGVSGALLERLEYRTGFAEAQLVSSAVIYDFSARSTAVYPDKA
LGRAALEFAVPGEFPQGRAGAGMSASAGKVDWDRTEITGQGAAFRRLGDVRILAVVVPNPVGVIMDRAGGIVRGNYDAQT
GVRRHPVFDYQEAFAEQLPPVTQAGNTTISAIVTNVRMSPVELNQFAKQVHSSMHRGIQPFHTDMDGDTLFAVTTDEIDL
PTTPGSSRGRLSVNATALGAIASEVMWDAVLEAAK
>O06994 3.2.1.10~~~malL~~~Oligo-1,6-glucosidase 1~~~COG0366
MSEWWKEAVVYQIYPRSFYDANGDGFGDLQGVIQKLDYIKNLGADVIWLSPVFDSPQDDNGYDISDYKNMYEKFGTNEDM
FQLIDEVHKRGMKIVMDLVVNHTSDEHAWFAESRKSKDNPYRDYYLWKDPKPDGSEPNNWGSIFSGSAWTYDEGTGQYYL
HYFSKKQPDLNWENEAVRREVYDVMRFWMDRGVDGWRMDVIGSISKYTDFPDYETDHSRSYIVGRYHSNGPRLHEFIQEM
NREVLSHYDCMTVGEANGSDIEEAKKYTDASRQELNMIFTFEHMDIDKEQNSPNGKWQIKPFDLIALKKTMTRWQTGLMN
VGWNTLYFENHDQPRVISRWGNDRKLRKECAKAFATVLHGMKGTPFIYQGEEIGMVNSDMPLEMYDDLEIKNAYRELVVE
NKTMSEKEFVKAVMIKGRDHARTPMQWDAGKHAGFTAGDPWIPVNSRYQDINVKESLEDQDSIFFYYQKLIQLRKQYKIM
IYGDYQLLQENDPQVFSYLREYRGEKLLVVVNLSEEKALFEAPPELIHERWKVLISNYPQERADLKSISLKPYEAVMGIS
I
>O34364 3.2.1.10~~~ycdG~~~Probable oligo-1,6-glucosidase 2~~~COG0366
MKTDWWKDAVVYQIYPRSFQDSNGDGIGDLRGIISRLDYIKELGADVIWICPIYPSPNVDYGYDVTNHKAIMDSYGTMDD
FHELLDQVHQRGLKLVMDFVLNHTSVEHPWFKEAELDKNSKYRSYYYWRPGTKNGPPTDWLSNYGCPVWQYEEHTGEYYL
HMNAVKQADLNWENPEVRQAVYDMMKFWLDKGVDGLRIDQLHLISKKEYLPSYEDYINQQAEPKPFQPNGERIHDYLKEI
TDEVFSHYDVMSVGEVGSVTPEEGLKYTGTDKHELNMIFHFQHMELDQQPGKEHWDLKPLELSDLKSVLTKWQKKLEHQG
WNTLFWCNHDQPRIVSRFGDDGEYRKASAKMLAAVIYFMKGTPYIYQGEEIGMTNAPFTRIEDYKDIQTINMYHKRVFEK
GYDPNDVMRSILAKSRDHARTPMQWNSGKNAGFTDGTPWLKVNPNFTAINVEEAQGDPDSVLNYYKKLISLRKQYADLMK
GSFDLLLPDDPQLFVYMRENSKQQLLSVNNFSKEQAVFQWPKNCGKAQASLLLSNYNNDDLDDEMVFRPYESRVYLLDKT
N
>P21332 3.2.1.10~~~malL~~~Oligo-1,6-glucosidase~~~COG0366
MEKQWWKESVVYQIYPRSFMDSNGDGIGDLRGIISKLDYLKELGIDVIWLSPVYESPNDDNGYDISDYCKIMNEFGTMED
WDELLHEMHERNMKLMMDLVVNHTSDEHNWFIESRKSKDNKYRDYYIWRPGKEGKEPNNWGAAFSGSAWQYDEMTDEYYL
HLFSKKQPDLNWDNEKVRQDVYEMMKFWLEKGIDGFRMDVINFISKEEGLPTVETEEEGYVSGHKHFMNGPNIHKYLHEM
NEEVLSHYDIMTVGEMPGVTTEEAKLYTGEERKELQMVFQFEHMDLDSGEGGKWDVKPCSLLTLKENLTKWQKALEHTGW
NSLYWNNHDQPRVVSRFGNDGMYRIESAKMLATVLHMMKGTPYIYQGEEIGMTNVRFESIDEYRDIETLNMYKEKVMERG
EDIEKVMQSIYIKGRDNARTPMQWDDQNHAGFTTGEPWITVNPNYKEINVKQAIQNKDSIFYYYKKLIELRKNNEIVVYG
SYDLILENNPSIFAYVRTYGVEKLLVIANFTAEECIFELPEDISYSEVELLIHNYDVENGPIENITLRPYEAMVFKLK
>P29093 3.2.1.10~~~malL~~~Oligo-1,6-glucosidase~~~
MSQWWKEAVVYQIYPRSFYDSNGDGFGDLQGVIQKLDYIKRLGADVIWLCPVFDSPQDDNGYDISDYRSIYEKFGTNDDM
FQLIDEVHKRGMKIIMDLVVNHSSDEHAWFAESRKSKDNPYRDYYFWKDPKADGSEPNNWGAIFSGPAWSAMSTAQYYLH
YFSKKQPDLNWENEAVRREVYDLMTFWMDRGVDGWRMDVIGSISKFVDFPDYETDDSRPYVVGRYHSNGPRLHEFIQEMN
REVLSRYDCMTVGEAGGSDVEEAKKYTDPSRHELNMIFTFEHMDIDTKQHSPNGKWQMKPFDPIALKKTMTRWQTALMNV
GWNTLYFENHDQPRVISAGAMTRELRKQSRQSISNSSARHEGNPFIYQGEEIGMTNSEMPLEMYDDLEIKNAYRELVIEN
KTMTEEDFRKAVAKKGRDHARTPMQWDDGKYAGFTDGEAWLAVNPRYQEINVKESLADEDSIFYYYQKLIGLRKQNKVIV
YGDYRLLLEEDPRIFAYIREYRGEKLLVP
>P29094 3.2.1.10~~~malL~~~Oligo-1,6-glucosidase~~~COG0366
MERVWWKEAVVYQIYPRSFYDSNGDGIGDIRGIIAKLDYLKELGVDVVWLSPVYKSPNDDNGYDISDYRDIMDEFGTMAD
WKTMLEEMHKRGIKLVMDLVVNHTSDEHPWFIESRKSKDNPYRDYYIWRPGKNGKEPNNWESVFSGSAWEYDEMTGEYYL
HLFSKKQPDLNWENPKVRREVYEMMKFWLDKGVDGFRMDVINMISKVPELPDGEPQSGKKYASGSRYYMNGPRVHEFLQE
MNREVLSKYDIMTVGETPGVTPKEGILYTDPSRRELNMVFQFEHMDLDSGPGGKWDIRPWSLADLKKTMTKWQKELEGKG
WNSLYLNNHDQPRAVSRFGDDGKYRVESAKMLATFLHMMQGTPYIYQGEEIGMTNVRFPSIEDYRDIETLNMYKERVEEY
GEDPQEVMEKIYYKGRDNARTPMQWDDSENAGFTAGTPWIPVNPNYKEINVKAALEDPNSVFHYYKKLIQLRKQHDIIVY
GTYDLILEDDPYIYRYTRTLGNEQLIVITNFSEKTPVFRLPDHIIYKTKELLISNYDVDEAEELKEIRLRPWEARVYKIR
LP
>O06334 4.1.1.112~~~mhpE~~~Oxaloacetate decarboxylase~~~COG0119
MLMTATHREPIVLDTTVRDGSYAVNFQYTDDDVRRIVGDLDAAGIPYIEIGHGVTIGAAAAQGPAAHTDEEYFRAARSVV
RNARLGAVIVPALARIETVDLAGDYLDFLRICVIATEFELVMPFVERAQSKGLEVSIQLVKSHLFEPDVLAAAGKRARDV
GVRIVYVVDTTGTFLPEDARRYVEALRGASDVSVGFHGHNNLAMAVANTLEAFDAGADFLDGTLMGFGRGAGNCQIECLV
AALQRRGHLAAVDLDRIFDAARSDMLGRSPQSYGIDPWEISFGFHGLDSLQVEHLRAAAQQAGLSVSHVIRQTAKSHAGQ
WLSPQDIDRVVVGMRA
>Q8ZRY4 7.2.4.2~~~oadB1~~~Oxaloacetate decarboxylase beta chain 1~~~
MESLNALLQGMGLMHLGAGQAIMLLVSLLLLWLAIAKKFEPLLLLPIGFGGLLSNIPEAGMALTALESLLAHHDAGQLAV
IAAKLNCAPDVHAIKEALALALPSVQGQMENLAVDMGYTPGVLALFYKVAIGSGVAPLVIFMGVGAMTDFGPLLANPRTL
LLGAAAQFGIFATVLGALTLNYFGLISFTLPQAAAIGIIGGADGPTAIYLSGKLAPELLGAIAVAAYSYMALVPLIQPPI
MKALTSETERKIRMVQLRTVSKREKILFPVVLLMLVALLLPDAAPLLGMFCFGNLMRESGVVERLSDTVQNGLINIVTIF
LGLSVGAKLVADKFLQPQTLGILLLGVIAFGIGTAAGVLMAKLLNLCSKNKINPLIGSAGVSAVPMAARVSNKVGLESDP
QNFLLMHAMGPNVAGVIGSAIAAGVMLKYVLAM
>P13156 7.2.4.2~~~oadB~~~Oxaloacetate decarboxylase beta chain~~~
MESLNALIQGLGLMHLGAGQAIMLLVSLLLLWLAIAKKFEPLLLLPIGFGGLLSNIPEAGLALTALESLLAHRDPAQLAV
IAAKLHCAPDVHAIKAALALALPSVQGQMESLAVDMGYSAGVLAIFYKVAIGSGIAPLVIFMGVGAMTDFGPLLANPRTL
LLGAAAQFGIFATVLGALTLNYFGIISFTLPQAAAIGIIGGADGPTAIYLSGKLAPELLGAIAVAAYSYMALVPLIQPPI
MKALTTDKERKIRMVQLRTVSKREKILFPAVLLLLVALLLPDAAPLLGMFCFGNLMRESGVVERLSDTVQNALINIVTIF
LGLSVGAKLVADKFLQPQTLGILVLGVIAFCVGTAAGVLMAKLMNVFSRHKINPLIGSAGVSAVPMAARVSNKVGLEADG
QNFLLMHAMGPNVAGVIGSAIAAGVMLKYVLAM
>Q9HUU1 4.1.1.112~~~~~~Oxaloacetate decarboxylase~~~
MHRASHHELRAMFRALLDSSRCYHTASVFDPMSARIAADLGFECGILGGSVASLQVLAAPDFALITLSEFVEQATRIGRV
ARLPVIADADHGYGNALNVMRTVVELERAGIAALTIEDTLLPAQFGRKSTDLICVEEGVGKIRAALEARVDPALTIIART
NAELIDVDAVIQRTLAYQEAGADGICLVGVRDFAHLEAIAEHLHIPLMLVTYGNPQLRDDARLARLGVRVVVNGHAAYFA
AIKATYDCLREERGAVASDLTASELSKKYTFPEEYQAWARDYMEVKE
>Q03032 7.2.4.2~~~oadG2~~~Oxaloacetate decarboxylase gamma chain 2~~~
MTNAALLLGEGFTLMFLGMGFVLAFLFLLIFAIRGMSAAVNRFFPEPAPAPKAAPAAAAPVVDDFTRLKPVIAAAIHHHH
RLNA
>P13155 7.2.4.2~~~oadG~~~Oxaloacetate decarboxylase gamma chain~~~
MTDNAVLLGEGFTLMCLGMGFVLVFLLLLIFAIRGMSLAVNRLFPEPPAAPKPAPAAVAPADDFARLKPAIVAAIHHHRR
LHP
>E3PY95 5.4.3.5~~~oraE~~~D-ornithine 4,5-aminomutase subunit beta~~~COG5012
MEKDLQLRVNEKLDVENILKDLDKYTPKRRGWTWRQPAENLQMGPFIYKDASTPLENSVALPSAKYFGDIDPQPLPVITT
EIASGRFEDDIRRMRMAAWHGADHIMVIRTAGQSHYDGLIEGTPQGIGGVPITRKQVRAQRKALDLIEEEVGRPINYHSY
VSGVAGPDIAVMFAEEGVNGAHQDPQYNVLYRNINMIRSFIDACESKTIMAWADMAQIDGAHNANATAREAWKVMPELMV
QHALNSIFSLKVGMKKSNICLSTVPPTAPPAPSMYLDLPYAVALREMFEGYRMRAQMNTKYMEASTREATVTHVLNLLIS
KLTRADIQSTITPDEGRNVPWHIYNIEACDTAKQALIGMDGLMDMVQLKREGVLGDTVRELKERAVLFMEEIIEAGGYFN
AVEQGFFVDSGYYPERNGDGIARQINGGIGAGTVFERDEDYMAPVTAHFGYNNVKQYDEALVSEPSKLIDGCTLEVPEKI
VYIDELDENDNVNVRMEETKEFRHSSMIKPEVEWQADGTVLLTMFLPTSKRVAEFAAIEFAKKMNLEEVEVINREVMQEA
EGTRIELKGRVPFSIDINSLVIPPEPEILSEDEIREDIEKTPLKIVAATVGEDEHSVGLREVIDIKHGGIEKYGVEVHYL
GTSVPVEKLVDAAIELKADAILASTIISHDDIHYKNMKRIHELAVEKGIRDKIMIGCGGTQVTPEVAVKQGVDAGFGRGS
KGIHVATFLVKKRREMREGK
>E3PY96 5.4.3.5~~~oraS~~~D-ornithine 4,5-aminomutase subunit alpha~~~
MKRADDFQQRRAHLANLSDEELQTRFWEMAEKIVDPLLDLGKKNTTPSIERSVLLRMGFSSLEAKAIVDKTMDRGLMGKG
AGHIVYKIAKEKNISVREAGLALSEGKYWDDAIQIFKGGVK
>P44415 ~~~oapA~~~Opacity-associated protein OapA~~~COG3061
MNSMDKNQQSSQNELDLGLNQEPITPKKTIQPSSSILGKAKGLFAKKNHVQTNFQQRKEPTFGDSSTQENDPLIPSENLK
KVQKPVLQTSSTEENISAVDEEISAENNADEPVEKAEKPILAQPEKWKILQVLPAKHRRLFMAIFVLVILLIIFFALKPS
SDTVESFTQSNSNEVPVQFQSLDQSQPLETTILDNPPAQNQMAVEQANQSEFAPKAEEAANNTTAQNPLVENAPMQQNVV
QSPSQMPNEMAAASVAPMQPAQAEQPKATVPVQPMKKAVEPQVAHKDTVKKEVKVAEKAQAPAKATEQNVAKTAGNAPIV
EAKPVQAKKEKKVQIVDAKPVSKSTASRLSAKTLTVPKGVSLMQLFRDNQLNISDVNAMSKATGAGNVLSSFKSGDKVTV
SVNNQGRVNEMRLSNGARFVRQSDGSYQYKK
>P28269 2.6.1.18~~~~~~Omega-amino acid--pyruvate aminotransferase~~~
NMPEHAGASLASQLKLDAHWMPYTANRNFLRDPRLIVAAEGSWLVDDKGRKVYDSLSGLWTCGAGHTRKEIQEAVAKQLS
TLDYSPGFQYGHPLSFQLAEKITDLTPGNLNHVFFTDSGSECALTAVKMVRAYWRLKGQATKTKMIGRARGYHGVNIAGT
SLGGVNGNRKLFGQPMQDVDHLPHTLLASNAYSRGMPKEGGIALADELLKLIELHDASNIAAVFVEPLAGSAGVLVPPEG
YLKRNREICNQHNILLVFDEVITGFGRTGSMFGADSFGVTPDLMCIAKQVTNGAIPMGAVIASTEIYQTFMNQPTPEYAV
EFPHGYTYSAHPVACAAGLAALCLLQKENLVQSVAEVAPHFEKALHGIKGAKNVIDIRNFGLAGAIQIAPRDGDAIVRPF
EAGMALWKAGFYVRFGGDTLQFGPTFNSKPQDLDRLFDAVGEVLNKLLD
>P38370 ~~~oar~~~TonB-dependent transporter Oar~~~
MHLNRVLRETGVVVAAGLLYGSAAFAQSSTIIGTVIDAQSRQPAADVVVTATSPNLQGEQTVVTDAQGNYRIPQLPPGDY
TLRFEKEQFKPYARSAIQLRLNRTIRVNVELLPEALGEVVEIVGAPPTIDVGSTTMGVNVDQEFIKRIAVARPGGKGGAT
RSFESLAELAPGAQNDNYGVSINGSTSPENGYVVDGLSTNDPAFGVNASPLSIEFVQDVNIITGGYMPEFGRSTGGVINA
VTRSGSNEFHGSVFANWTPGTLEGTRKQIREEGTVITGQNQLQNLGDFGATLGGPILKDKLWFFAGFAPSFTRYQHTRTL
NALRVDDEGNTIKDETDFTVADAIPGSARKYYADSRTIQYMGKLTYLINQDHNVSFALNGTPTSTGGLGKLSVNPQSGGL
PGVLATRPGDFGLTETKANTTSLALKYAGAFADKKVLVDANLGWFHQTASTLPGDGSNLGDRTGLAGYSRMVYTTPRALT
LFEALPEGQEGACGSTPEEQLVRSPVTGYGVGGPGFMSDQTLDRYQANAKATYLLNALGTHVFKAGVDVELLSFDQVKAY
GGGVFFQEGSNYGVAGQGPAVHDARRYGYQTGPDSAVTQFTQVAKTTSTTVGGFLQDSWSIANRVTLNLGVRYDVQALYG
GNGDLSLLLGNQWSPRIGAIVDPFANGRAKVFVNFARYYEQVPLNLMDRAFPGENRISARRSLAEPGQGTATSCDPSSFE
SQQATCNTDSNLLAIPESSRNVNRFYTGGTVGGTPVDPDIKAQSSDEIVVGAEYEVLANTRLGASYTHKDMNSVIEDMSR
DDGNTYFLGNPGSGFAGEFPTPVRNYDNVTVYLNRTFADGWLAQANYTWSRLYGNYPGLFRPETGQLDPNILSDFDLIEL
LENRTGLLPFDRTHQIKVFGAKEFNISNALSASVGVSYRGSSGTPINYWGSHWAYLQDESFVLPRGAGGRTPWINTIDSN
IGVNYRVSKDSVVSFTLDVFNLFNFQGVNTVDQTYTLRDIKPIPGGTPADLENLPGRVEFQDQAPRDEPFGSVDGDVNKN
FKNPLSYQAPRQVRFGIRYTF
>Q59327 1.3.99.5~~~~~~3-oxo-5-alpha-steroid 4-dehydrogenase~~~
MSNVKKHVSTINPVGEVLDVGSADEVQWSDASDVVVVGWGGAGASAAIEAREQGAEVLVIERFSGGGASVLSGGVVYAGA
VPATRRKPASRFTEAMTAYLKHEVNGVVSDETLARFSRDSVTNLNWLEKQGATFASTMPGYKTSYPADGMYLYYSGNEVV
PAYGNPQLLKKPPPRGHRVVAKGQSGAMFFAALQKSTLAHGARTLTQARVQRLVREKDSGRVLGVEVMVLPEGDPRTERH
KKLDELVAKSACIRRRVPRRVAVNVRRSRARSARSATSVPAKVWCCPLAAISSIRNCWSMRRYKPGWLTGAAGCDGSGLR
LGQSVGGIAQDLNNISAWRFITPPSVWPKGLVVNIQGERFCNEQVYGAKLGYEMMEKQGGQAWLIIDSNVRRQAAWQCLF
GGLWAFQSMPALALMYKVAIKGKSVDDLAKKLRMDAAVLQLQFDRANAPARGEIEDPLGKSQDMRHEFKGGSLFAIDISI
SQKMFPLAVLSLGGLKVNEDNGAVIDGAGYDIPGLYAAGVPPLVWLPRVT
>P60298 2.6.1.13~~~rocD2~~~Ornithine aminotransferase 2~~~
MTKSEKIIELTNHYGAHNYLPLPIVISEAEGVWVKDPEGNKYMDMLSAYSAVNQGHRHPKIIQALKDQADKVTLVSRAFH
SDNLGEWYEKICKLAGKDKALPMNTGAEAVETALKAARRWAYDVKGIEPNKAEIIAFNGNFHGRTMAPVSLSSEAEYQRG
YGPLLDGFRKVDFGDVDALKAAINENTAAVLVEPIQGEAGINIPPEGYLKAIRELCDEHNVLFIADEIQAGLGRSGKLFA
TDWDNVKPDVYILGKALGGGVFPISVVLADKEVLDVFTPGSHGSTFGGNPLACAASIAALDVIVDEDLPGRSLELGDYFK
EQLKQIDHPSIKEVRGRGLFIGVELNESARPYCEALKEEGLLCKETHDTVIRFAPPLIITKEELDLALEKIRHVFQ
>Q8Y7I6 2.3.1.-~~~oatA~~~Peptidoglycan O-acetyltransferase OatA~~~COG1835
MKRTTRYSRKYVPSIDGLRALAVIAVIAYHLNFSWAKGGFIGVDIFFVLSGYLITNILLTQWEKNQSLQLKQFWIRRFRR
LIPAVYVMIVVVVIYSVFFHPEILKNLRGDAIASFFYVSNWWFIFHNVSYFDSFGLPSPLKNLWSLAIEEQFYLIWPAFL
LVFLKWVKNPKLLLKIVIGLGLLSAVWMTILYVPGTDPSRVYYGTDTRAFDLLSGCALAFVWPFNRLSPVVPRKSKAVLN
IAGTISILCFILFTAFVSEYQPFLYRGGLLFVAILGVIMIATISHPASYLSKIFSFKPLRWIGTRSYGIYLWHYPIITLT
TPVLEITQPNIWRAILQVAATFIIAELSFRFIETPIRKNGFINYFKGFKDKNYFIWKNKPVGKWLSIAGVVAVLAIFTLG
MSNVLSVNTNAEKQQTSVKTTTSTPDEKKDDKKEDKATKDKEADSNKASEQKETQKPDNKNKSAATPKTIITQTVAIGDS
VMLDIEPYLKEAVPNITIDGLVGRQLRDAITTATGYKKFNSENSSVILELGTNGPFTEDQLNDLLDQFDKATIYLVNTRV
PRGWQSDVNKSIANAASRPNVTVVDWYSRSSGQSQYFAPDGVHLTKAGAQAYVAMLTSVMNK
>Q2FV54 2.3.1.-~~~oatA~~~O-acetyltransferase OatA~~~COG1835
MDTKDFKRLEKMYSPRYLPGLDGLRAFAVIGIIIYHLNAQWLSGGFLGVDTFFVISGYLITSLLISEYYRTQKIDLLEFW
KRRLKRLIPAVLFLICVVLTFTLIFKPELIIQMKRDAIAAIFYVSNWWYISQNVDYFNQFAIEPLKHLWSLAIEEQFYLL
FPLVITFLLHRFKPRNIIQTLFIVSLISLGLMIVIHFITGDNSRVYFGTDTRLQTLLLGCILAFIWPPFALKKDISKKIV
VSLDIIGISGFAVLMTLFFIVGDQDQWIYNGGFYIISFATLFIIAIAVHPSSLFAKFLSMKPLLIIGKRSYSLYLWHYPI
IVFVNSYYVQGQIPVYVYIIEILLTALMAEISYRFIETPIRKKGFKAFAFLPKKKGQFARTVLVILLLVPSIVVLSGQFD
ALGKQHEAEKKEKKTEFKTTKKKVVKKDKQEDKQTANSKEDIKKSSPLLIGDSVMVDIGNVFTKKIPNAQIDGKVGRQLV
DATPIVKSQYKDYAKKGQKVVVELGTNGAFTKDQLNELLDSFGKADIYLVSIRVPRDYEGRINKLIYEAAEKRSNVHLVD
WYKASAGHPEYFAYDGIHLEYAGSKALTDLIVKTMETHATNKK
>Q7A3D6 2.3.1.-~~~oatA~~~O-acetyltransferase OatA~~~
MDTKDFKRLEKMYSPRYLPGLDGLRAFAVIGIIIYHLNAQWLSGGFLGVDTFFVISGYLITSLLISEYYRTQKIDLLEFW
KRRLKRLIPAVLFLICVVLTFTLIFKPELIIQMKRDAIAAIFYVSNWWYISQNVDYFNQFAIEPLKHLWSLAIEEQFYLL
FPLVITFLLHRFKPRNIIQTLFIVSLISLGLMIVIHFITGDNSRVYFGTDTRLQTLLLGCILAFIWPPFALKKDISKKIV
VSLDIIGISGFAVLMTLFFIVGDQDQWIYNGGFYIISFATLFIIAIAVHPSSLFAKFLSMKPLLIIGKRSYSLYLWHYPI
IVFVNSYYVQGQIPVYVYIIEILLTALMAEISYRFIETPIRKKGFKAFAFLPKKKGQFARTVLVILLLVPSIVVLSGQFD
ALGKQHEAEKKEKKTEFKTTKKKVVKKDKQEDKQTANSKEDIKKSSPLLIGDSVMVDIGNVFTKKIPNAQIDGKVGRQLV
DATPIVKSQYKDYAKKGQKVVVELGTNGAFTKDQLNELLDSFGKADIYLVSIRVPRDYEGRINKLIYEAAEKRSNVHLVD
WYKASAGHPEYFAYDGIHLEYAGSKALTDLIVKTMETHATNKK
>Q93S40 2.3.1.-~~~oatWY~~~Polysialic acid O-acetyltransferase~~~
MGTHMYSEQGINNTINISTTSLTNATQLTVIGNNNSVYIGNNCKIVSSNIRLKGNNITLFIADDVENMGLVCSLHSDCSL
QIQAKTTMGNGEITIAEKGKISIGKDCMLAHGYEIRNTDMHPIYSLENGERINHGKDVIIGNHVWLGRNVTILKGVCIPN
NVVVGSHTVLYKSFKEPNCVIAGSPAKIVKENIVWGRKMYHSTMYDDPTLNEFYK
>Q81TV3 2.6.1.13~~~rocD~~~Ornithine aminotransferase~~~COG4992
MIQTKDIIELTDTYGANNYHPLPIVISKAEGVWVEDPEGNRYMDLLSAYSAVNQGHRHPKIINALIDQANRVTLTSRAFH
SDQLGPWYEKVAKLTNKEMVLPMNTGAEAVETAIKTARRWAYDVKKVEANRAEIIVCEDNFHGRTMGAVSMSSNEEYKRG
FGPMLPGIIVIPYGDLEALKAAITPNTAAFILEPIQGEAGINIPPAGFLKEALEVCKKENVLFVADEIQTGLGRTGKVFA
CDWDNVTPDMYILGKALGGGVFPISCAAANRDILGVFEPGSHGSTFGGNPLACAVSIAALEVLEEEKLTERSLQLGEKLV
GQLKEIDNPMITEVRGKGLFIGIELNEPARPYCEQLKAAGLLCKETHENVIRIAPPLVISEEDLEWAFQKIKAVLS
>P38021 2.6.1.13~~~rocD~~~Ornithine aminotransferase~~~COG4992
MTALSKSKEIIDQTSHYGANNYHPLPIVISEALGAWVKDPEGNEYMDMLSAYSAVNQGHRHPKIIQALKDQADKITLTSR
AFHNDQLGPFYEKTAKLTGKEMILPMNTGAEAVESAVKAARRWAYEVKGVADNQAEIIACVGNFHGRTMLAVSLSSEEEY
KRGFGPMLPGIKLIPYGDVEALRQAITPNTAAFLFEPIQGEAGIVIPPEGFLQEAAAICKEENVLFIADEIQTGLGRTGK
TFACDWDGIVPDMYILGKALGGGVFPISCIAADREILGVFNPGSHGSTFGGNPLACAVSIASLEVLEDEKLADRSLELGE
YFKSELESIDSPVIKEVRGRGLFIGVELTEAARPYCERLKEEGLLCKETHDTVIRFAPPLIISKEDLDWAIEKIKHVLRN
A
>P20964 3.6.5.-~~~obg~~~GTPase Obg~~~COG0536
MFVDQVKVYVKGGDGGNGMVAFRREKYVPKGGPAGGDGGKGGDVVFEVDEGLRTLMDFRYKKHFKAIRGEHGMSKNQHGR
NADDMVIKVPPGTVVTDDDTKQVIADLTEHGQRAVIARGGRGGRGNSRFATPANPAPQLSENGEPGKERYIVLELKVLAD
VGLVGFPSVGKSTLLSVVSSAKPKIADYHFTTLVPNLGMVETDDGRSFVMADLPGLIEGAHQGVGLGHQFLRHIERTRVI
VHVIDMSGLEGRDPYDDYLTINQELSEYNLRLTERPQIIVANKMDMPEAAENLEAFKEKLTDDYPVFPISAVTREGLREL
LFEVANQLENTPEFPLYDEEELTQNRVMYTMENEEVPFNITRDPDGVFVLSGDSLERLFKMTDFSRDESVKRFARQMRGM
GVDEALRERGAKDGDIIRLLEFEFEFID
>B8GYI7 3.6.5.-~~~cgtA~~~GTPase Obg/CgtA~~~
MKFLDQCKIYIRSGNGGGGSVSFRREKYIEYGGPDGGDGGRGGDVWIEAVEGLNTLIDYRYQQHFKAGTGVHGMGRARHG
AAGEDVVLKVPVGTEVLEEDKETLIADLDHAGMRLLLAKGGNGGWGNLHFKGPVNQAPKYANPGQEGEERWIWLRLKLIA
DVGLVGLPNAGKSTFLAAASAAKPKIADYPFTTLTPNLGVVDLSSSERFVLADIPGLIEGASEGAGLGTRFLGHVERSAT
LIHLIDATQDDVAGAYETIRGELEAYGDELADKAEILALNKIDALDEETLAEKVAELEAVSGIKPRLVSGVSGQGVTELL
RAAYKQVRIRRGDLEEEIDDDEDHVDETPGGWTP
>P42641 3.6.5.-~~~obgE~~~GTPase ObgE/CgtA~~~COG0536
MKFVDEASILVVAGDGGNGCVSFRREKYIPKGGPDGGDGGDGGDVWMEADENLNTLIDYRFEKSFRAERGQNGASRDCTG
KRGKDVTIKVPVGTRVIDQGTGETMGDMTKHGQRLLVAKGGWHGLGNTRFKSSVNRTPRQKTNGTPGDKRELLLELMLLA
DVGMLGMPNAGKSTFIRAVSAAKPKVADYPFTTLVPSLGVVRMDNEKSFVVADIPGLIEGAAEGAGLGIRFLKHLERCRV
LLHLIDIDPIDGTDPVENARIIISELEKYSQDLATKPRWLVFNKIDLLDKVEAEEKAKAIAEALGWEDKYYLISAASGLG
VKDLCWDVMTFIIENPVVQAEEAKQPEKVEFMWDDYHRQQLEEIAEEDDEDWDDDWDEDDEEGVEFIYKR
>P9WMT1 3.6.5.-~~~obg~~~GTPase Obg~~~COG0536
MPRFVDRVVIHTRAGSGGNGCASVHREKFKPLGGPDGGNGGRGGSIVFVVDPQVHTLLDFHFRPHLTAASGKHGMGNNRD
GAAGADLEVKVPEGTVVLDENGRLLADLVGAGTRFEAAAGGRGGLGNAALASRVRKAPGFALLGEKGQSRDLTLELKTVA
DVGLVGFPSAGKSSLVSAISAAKPKIADYPFTTLVPNLGVVSAGEHAFTVADVPGLIPGASRGRGLGLDFLRHIERCAVL
VHVVDCATAEPGRDPISDIDALETELACYTPTLQGDAALGDLAARPRAVVLNKIDVPEARELAEFVRDDIAQRGWPVFCV
STATRENLQPLIFGLSQMISDYNAARPVAVPRRPVIRPIPVDDSGFTVEPDGHGGFVVSGARPERWIDQTNFDNDEAVGY
LADRLARLGVEEELLRLGARSGCAVTIGEMTFDWEPQTPAGEPVAMSGRGTDPRLDSNKRVGAAERKAARSRRREHGDG
>Q5F5D9 3.6.5.-~~~obg~~~GTPase Obg~~~
MKFIDEAKIEVAAGKGGNGATSFRREKFVPRGGPDGGDGGKGGSVWAEADENTNTLVEYRFVKRYQAKNGEKGHGSDRYG
AGADDIVLKMPVGTLIRDLDTDEIVADLTYHGQRVCLAKGGKGGLGNIHFKSSVNRAPKQSTPGEEGETRSLQLELKVLA
DVGLLGMPNAGKSTLITAVSAARPKIANYPFTTLHPNLGVVRIDENHSFVMADIPGLIEGAAEGAGLGHRFLKHLSRTGL
LLHVVDLAPFDETVNPAEEALAIINELRKYDEELYGKPRWLVLNKLDMLDEEEARARTAAFLEAVGWDYPEPDDRFQFDM
ETPRLFQISALTHQGTQELVHQINQYLAEKKRIEAEKAEAEKAAANVEIIEQQPKTDTGVFKPE
>Q02GB1 3.6.5.-~~~obg~~~GTPase Obg~~~
MKFVDEVSIHVKAGDGGNGLMSFRREKFIEKGGPNGGDGGDGGSIYLEADVNLNTLVDYRYTRRFDAQRGENGGSKDCTG
AKGDDLILPVPVGTTVIDANTQEIIGDLTEPGQRLMVAQGGWHGLGNTRFKSSTNRAPRQTTPGKPGEARDLKLELKVLA
DVGLLGLPNAGKSTFIRAVSAAKPKVADYPFTTLVPNLGVVSVGRYKSFVVADIPGLIEGAAEGAGLGIRFLKHLARTRI
LLHLVDMAPLDESDPADAAEVIVRELGRFSPALTERERWLVLNKMDQILDPAEREARKQAVIERLGWEGPVYVISALERD
GTEALSQDIMRYLDERTLRLEEDPQYAEELAELDRRIEDEARARLQALDDARALRRSGLKNAGAVDDDDFDDEEDDGDGP
EIFYVP
>Q8ZLS5 3.6.5.-~~~obg~~~GTPase Obg~~~
MKFVDEASILVVAGDGGNGCVSFRREKYIPKGGPDGGDGGDGGDVWMEADENLNTLIDYRFEKSFRAERGQNGASRDCTG
KRGKDVTIKVPVGTRVIDQGTGETMGDMTKHGQRLLVAKGGWHGLGNTRFKSSVNRTPRQKTNGTPGDKRDLLLELMLLA
DVGMLGMPNAGKSTFIRAVSAAKPKVADYPFTTLVPSLGVVRMDSEKSFVVADIPGLIEGAAEGAGLGIRFLKHLERCRV
LLHLIDIDPIDGSDPVENARIIIGELEKYSQDLAAKPRWLVFNKIDLMDKTEAEEKAKAIAEALGWEGKYYLISAASQLG
VKDLCWDVMTFIIENPIAQAEEAKQPEKVEFMWDDYHRQQLAEVEEDADDDWDDDWDEDDEEGVEFIYKR
>Q7A584 3.6.5.-~~~obg~~~GTPase Obg~~~
MFVDQVKISLKAGDGGNGITAYRREKYVPFGGPAGGDGGKGASVVFEVDEGLRTLLDFRYQRHFKASKGENGQSSNMHGK
NAEDLVLKVPPGTIIKNVETDEVLADLVEDGQRAVVAKGGRGGRGNSRFATPRNPAPDFSEKGEPGEELDVSLELKLLAD
VGLVGFPSVGKSTLLSIVSKAKPKIGAYHFTTIKPNLGVVSTPDQRSFVMADLPGLIEGASDGVGLGHQFLRHVERTKVI
VHMIDMSGSEGREPIEDYKVINQELAAYEQRLEDRPQIVVANKMDLPESQDNLNLFKEEIGEDVPVIPVSTITRDNIDQL
LYAIADKLEEYKDVDFTVEEEESVGINRVLYKHTPSQDKFTISRDDDGAYVVSGNAIERMFKMTDFNSDPAVRRFARQMR
SMGIDDALRERGCKNGDIVRILGGEFEFVE
>P95722 3.6.5.-~~~obg~~~GTPase Obg~~~COG0536
MTTFVDRVELHVAAGNGGHGCASVHREKFKPLGGPDGGNGGRGGDVILTVDQSVTTLLDYHHSPHRKATNGKPGEGGNRS
GKDGQDLVLPVPDGTVVLDGAGNVLADLVGHGTSYVAAQGGRGGLGNAALASARRKAPGFALLGEPGDLQDIHLELKTVA
DVALVGYPSAGKSSLISVLSAAKPKIADYPFTTLVPNLGVVTAGETVYTVADVPGLIPGASQGKGLGLEFLRHVERCSVL
VHVLDTATLESERDPLSDLDVIETELREYGGLDNRPRIVVLNKIDVPDGKDLAEMVRPDLEARGYRVFEVSAVAHMGLRE
LSFALAELVATARAARPKEEATRIVIRPKAVDDAGFTVTREEDGLFRVRGEKPERWVRQTDFNNDEAVGYLSDRLNRLGV
EDKLMKAGARNGDGVAIGPEDNAVVFDWEPSVTAGAEMLGRRGEDHRFEAPRPAAQRRRDRDAERDEAQQEFDGFEPF
>P95758 3.6.5.-~~~obg~~~GTPase Obg~~~
MTTFVDRVELHAAAGNGGHGCASVHREKFKPLGGPDGGNGGRGGDVILVVEQSVTTLLDYHHSPHRKATNGQPGAGDNRS
GKDGQDLVLPVPDGTVVLDKAGNVLADLVGQGTTFVAGQGGRGGLGNAALASARRKAPGFALLGEPGESRDIVLELKTVA
DVALVGYPSAGKSSLISVLSAAKPKIADYPFTTLVPNLGVVTAGSTVYTIADVPGLIPGASQGKGLGLEFLRHVERCSVL
VHVLDTATLESDRDPVSDLDMIEEELRLYGGLENRPRIVALNKVDIPDGQDLADMIRPDLEARGYRVFEVSAIAHKGLKE
LSFALAGIIAEARATKPKEEATRIVIRPRAVDDAGFTVTLEDDGIYRVRGEKPERWVRQTDFNNDEAVGYLADRLNRLGV
EDSLMKAGARAGDGVAIGPEENAVVFDWEPTVTAGAEMLGRRGEDHRLEEPRPAAQRRRERDAERDDAEKEYDEFDPF
>Q5SHE9 3.6.5.-~~~obg~~~GTPase Obg~~~COG0536
MFQDVLVITVAAGRGGDGAVSFRREKFVPKGGPDGGDGGRGGSVYLRARGSVDSLSRLSKRTYKAEDGEHGRGSQQHGRG
GEDLVIEVPRGTRVFDADTGELLADLTEEGQTVLVARGGAGGRGNMHFVSPTRQAPRFAEAGEEGEKRRLRLELMLIADV
GLVGYPNAGKSSLLAAMTRAHPKIAPYPFTTLSPNLGVVEVSEEERFTLADIPGIIEGASEGKGLGLEFLRHIARTRVLL
YVLDAADEPLKTLETLRKEVGAYDPALLRRPSLVALNKVDLLEEEAVKALADALAREGLAVLPVSALTGAGLPALKEALH
ALVRSTPPPEMPKPVPRKEVQAGVEVVPVAEGVYEVRAPEVERYLARIKGDLMEAAGYLQEVFRRQGVEAALRAKGVRAG
DLVRIGGLEFEYIPEV
>Q9KUS8 3.6.5.-~~~cgtA~~~GTPase Obg/CgtA~~~COG0536
MKFVDEAVIKVQAGDGGNGVVSFWREKFVTNGGPDGGDGGDGGDVYMVADENLNTLIDYRFQRFYEAERGKNGGGGNCTG
KSGKDKELRVPVGTRAVDIHTNEIIGEVAEHGKKVMIAKGGWHGLGNARFKSSVNRSPRQKTLGTKGELRDIRLELLLLA
DVGMLGMPNAGKSTFIRAVSAAKPKVADYPFTTLVPSLGVVSVLPEKSFVVADIPGLIEGAAEGAGLGIRFLKHLERCRV
LLHMIDIMPADQSDPAHNALTIIDELEQYSEKLAKKPRWLVFNKVDLMSEEEADEIIQNIIDALAWEGDYFKISAANRQG
TKELCMKLAEFMDTLPREAEEKTEAEKVDFTWDYNHKDGLAGREVITEDDDDWDDWDDEEDDGHVIYVRD
>Q6E0U3 3.6.5.-~~~cgtA~~~GTPase Obg/CgtA~~~
MKFVDEAVVKVQAGDGGSGVVSFWREKFITKGGPDGGDGGDGGDVYIQADENLNTLIDYRFQRFYEAERGENGRGGNCTG
KRGKDITLRVPVGTRAVDIHTNEIVAEVAEHGKKVMVAKGGWHGLGNTRFKSSVNRAPRQRTLGTKGEIREIRLELLLLA
DVGMLGLPNAGKSTFIRAVSAAKPKVADYPFTTLIPSLGVVSVVPEKSFVVADIPGLIEGAADGAGLGIRFLKHLERCRV
LLHMIDIMPIDQSDPIQNALTIIDELEQYSEKLAGKPRWLVFNKTDLMPEEEANEKIQEILDALGWEDEYFKISAINRNG
TKELCYKLADFMENLPREEEEVAEEDKVNFMWDDYHKDAIAGKDVITEEDDDDWDDCDDEDDDGHVVYVRD
>P35114 ~~~occM~~~Octopine transport system permease protein OccM~~~
MPFDPAFLWQTFVALLSGIPLALQLAVFSVALGTVLAFGLALMRVSRLWWLDLPARFYIFAFRGTPLLVQIYIIYYGLSQ
FPDVRHSFIWPFLRDAYWCAMAALALNTAAYTAEIMRGGLLSVPAGQIEAAKACGMGRVKLFRRIVIPQAIRQMLPGYSN
EVILMVKSTSLASTITIMEITGIAAKLISESYRTVEVFSCAGAIYLILNFIVARLFTLLEWALWPERRNNRLTTDPVDRK
GELHA
>P0A2V2 ~~~occP~~~Octopine permease ATP-binding protein P~~~
MPNPVRPAVQLKDIRKNFGNLEVLHGVSLSANEGEVISILGSSGSGKSTLLRCVNMLEVPNAGSVAIMGEEIALEHRAGR
LARPKDLKQVNRLRERAAMVFQGFNLWSHQTILQNVMEAPVHVQGRDRKACRDEAEALLERVGIASKRDAYPSELSGGQQ
QRAAIARALAMRPDVMLFDEPTSALDPELVGEVLKVMRDLAAEGRTMLIVTHEMDFARDVSSRTVFLHQGVIAEEGPSSE
MFAHPRTDRFRQFLRRDGGTSH
>P0A4N5 ~~~occQ~~~Octopine transport system permease protein OccQ~~~
MDYSQLMGFGPDGWGYDMLRATAMTMAVAFSGFTIGLVFGCLGAAASLSSSGALQAAASGYTTALRGIPDLLVIYLFYFG
SSSVISNVASLFGSSGFVGASTFLIGALAIGVVSGAYQTQVLRGAVLALNKGEIEAGRAYGMGALLLFRRIVLPQAARYA
LPGVGNVWQLVLKESALISVIGLVELMRQAQVGSGSTRQPFSFYLTAAALYLLITFVSGQVFRLAETRSMRGLQRGV
>P0A4T4 ~~~occR~~~Octopine catabolism/uptake operon regulatory protein OccR~~~
MNLRQVEAFRAVMLTGQMTAAAELMLVTQPAISRLIKDFEQATKLQLFERRGNHIIPTQEAKTLWKEVDRAFVGLNHIGN
LAADIGRQAAGTLRIAAMPALANGLLPRFLAQFIRDRPNLQVSLMGLPSSMVMEAVASGRADIGYADGPQERQGFLIETR
SLPAVVAVPMGHRLAGLDRVTPQDLAGERIIKQETGTLFAMRVEVAIGGIQRRPSIEVSLSHTALSLVREGAGIAIIDPA
AAIEFTDRIVLRPFSIFIDAGFLEVRSAIGAPSTIVDRFTTEFWRFHDDLMKQNGLME
>P0A4T3 ~~~occR~~~Octopine catabolism/uptake operon regulatory protein OccR~~~
MNLRQVEAFRAVMLTGQMTAAAELMLVTQPAISRLIKDFEQATKLQLFERRGNHIIPTQEAKTLWKEVDRAFVGLNHIGN
LAADIGRQAAGTLRIAAMPALANGLLPRFLAQFIRDRPNLQVSLMGLPSSMVMEAVASGRADIGYADGPQERQGFLIETR
SLPAVVAVPMGHRLAGLDRVTPQDLAGERIIKQETGTLFAMRVEVAIGGIQRRPSIEVSLSHTALSLVREGAGIAIIDPA
AAIEFTDRIVLRPFSIFIDAGFLEVRSAIGAPSTIVDRFTTEFWRFHDDLMKQNGLME
>P0A4F8 ~~~occT~~~Octopine-binding periplasmic protein~~~
MKLKTILCAALLLVAGQAAAQEKSITIATEGGYAPWNFSGPGGKLDGFEIDLANALCEKMKAKCQIVAQNWDGIMPSLTG
KKYDAIMAAMSVTPKRQEVIGFSIPYAAGINGFAVMGDSKLAEMPGLGETYSLDSQADAAKKAIADISSFLNGTTVGVQG
STTASTFLDKYFKGSVDIKEYKSVEEHNLDLTSGRLDAVLANATVLAAAIEKPEMKGAKLVGPLFSGGEFGVVAVGLRKE
DTALKADFDAAIKAASEDGTIKTLSLKWFKVDVTPQ
>P09773 4.3.1.12~~~ocd~~~Ornithine cyclodeaminase~~~
MPALANLNIVPFISVENMMDLAVSTGIENFLVQLAGYIEEDFRRWESFDKIPRIASHSRDGVIELMPTSDGTLYGFKYVN
GHPKNTKSGRQTVTAFGVLSDVDSGYPLLLSEMTILTALRTAATSAIAAKYLARKDSRTMALIGNGAQSEFQALAFKALI
GVDRIRLYDIDPEATARCSRNLQRFGFQIEACTSAEQAVEGADIITTATADKHNATILSDNMIGPGVHINGVGGDCPGKT
EMHRDILLRSDIFVEFPPQTRIEGEIQQLAPDHPVTELWRVMTGQDVGRKSDKQITLFDSVGFAIEDFSALRYVRDRVEG
SSHSSPLDLLADPDEPRDLFGMLLRRQAFRRLGG
>Q59701 4.3.1.12~~~ocd~~~Ornithine cyclodeaminase~~~
MPIDPKLNVVPFISVDHMMKLVLKVGIDTFLTELAAEIEKDFRRWPIFDKKPRVGSHSQDGVIELMPTSDGSLYGFKYVN
GHPKNTHQGRQTVTAFGVLSDVGNGYPLLLSEMTILTALRTAATSALAAKYLARPNSKTMAIIGNGAQSEFQARAFRAIL
GIQKLRLFDIDTSATRKCARNLTGPGFDIVECGSVAEAVEGADVITTVTADKQFATILSDNHVGPGVHINAVGGDCPGKT
EISMEVLLRSDIFVEYPPQTWIEGDIQQLPRTHPVTELWQVMTGEKTGRVGDRQITMFDSVGFAIEDFSALRYVRAKITD
FEMFTELDLLADPDEPRDLYGMLLRCEKKLEPTAVG
>Q88H32 4.3.1.12~~~ocd~~~Ornithine cyclodeaminase~~~COG2423
MTYFIDVPTMSDLVHDIGVAPFIGELAAALRDDFKRWQAFDKSARVASHSEVGVIELMPVADKSRYAFKYVNGHPANTAR
NLHTVMAFGVLADVDSGYPVLLSELTIATALRTAATSLMAAQALARPNARKMALIGNGAQSEFQALAFHKHLGIEEIVAY
DTDPLATAKLIANLKEYSGLTIRRASSVAEAVKGVDIITTVTADKAYATIITPDMLEPGMHLNAVGGDCPGKTELHADVL
RNARVFVEYEPQTRIEGEIQQLPADFPVVDLWRVLRGETEGRQSDSQVTVFDSVGFALEDYTVLRYVLQQAEKRGMGTKI
DLVPWVEDDPKDLFSHTRGRAGKRRIRRVA
>P83689 ~~~~~~Orange carotenoid-binding protein~~~
MPFTIDTARSIFPETLAADVVPATIARFKQLSAEDQLALIWFAYLEMGKTITIAAPGAANMQFAENTLQEIRQMTPLQQT
QAMCDLANRTDTPICRTYASWSPNIKLGFWYELGRFMDQGLVAPIPEGYKLSANANAILVTIQGIDPGQQITVLRNCVVD
MGFDTSKLGSYQRVAEPVVPPQEMSQRTKVQIEGVTNSTVLQYMDNLNANDFDNLISLFAEDGALQPPFQKPIVGKENTL
RFFREECQNLKLIPERGVSEPTEDGYTQIKVTGKVQTPWFGGNVGMNIAWRFLLNPENKVFFVAIDLLASPKELLNL
>P74102 ~~~~~~Orange carotenoid-binding protein~~~COG3631
MPFTIDSARGIFPNTLAADVVPATIARFSQLNAEDQLALIWFAYLEMGKTLTIAAPGAASMQLAENALKEIQAMGPLQQT
QAMCDLANRADTPLCRTYASWSPNIKLGFWYRLGELMEQGFVAPIPAGYQLSANANAVLATIQGLESGQQITVLRNAVVD
MGFTAGKDGKRIAEPVVPPQDTASRTKVSIEGVTNATVLNYMDNLNANDFDTLIELFTSDGALQPPFQRPIVGKENVLRF
FREECQNLKLIPERGVTEPAEDGFTQIKVTGKVQTPWFGGNVGMNIAWRFLLNPEGKIFFVAIDLLASPKELLNFAR
>K5BJH8 2.3.1.273~~~octT~~~Diglucosylglycerate octanoyltransferase~~~COG2755
MSGRRPTLLVFCDSLSYYGPRGGLPADDPRIWPNIVASQLDWDVELIGRVGWTSRDVWWAATQDPRAWAALPRAGAVIFA
TGGMDSLPSPLPTALRELIRYIRPPWLRRRVRDLYGWLQPRLSPVSRNALPPHLTAEYLEMTRGAIDFNRPGIPVVAALP
SVHIADSYGRAHHGREATARAITEWARQHGVVLVDLKAAVADQVLNGRGNPDGIHWNFEAHQAVAELMLKALAEAGVPCR
>A0R109 2.3.1.273~~~octT~~~Diglucosylglycerate octanoyltransferase~~~COG2755
MSSETSSESTGHRPVLLVFADSLSYFGPTGGLPADDPRIWPNIVGEQLGWDVELIGRIGWTCRDVWWAATQDPRSWAALP
RAGAVVFATSGMDSLPSPLPTALREMIRYVRPPWLRRWVRDGYGWVQPRLSPIARSALPPHVTVEYLEMTRNAIDFNRPG
IPVVASLPSVHIAETYGRAHHGREPTVRAITAWAEEHHVPLVDLKAAVADEVFGGRGNPDGIHWSFEAHRAVAELMLKGL
AEAGVTQRDSAT
>P71725 2.3.1.273~~~octT~~~Diglucosylglycerate octanoyltransferase~~~COG2755
MSSRRGRRPALLVFADSLAYYGPTGGLPADDPRIWPNIVASQLDWDLELIGRIGWTCRDVWWAATQDPRAWAALPRAGAV
IFATGGMDSLPSVLPTALRELIRYVRPSWLRRWVRDGYAWVQPRLSPVARAALPPHLTAEYLEKTRGAIDFNRPGIPIIA
SLPSVHIAETYGKAHHGRAGTVAAITEWAQHHDIPLVDLKAAVAEQILSGYGNRDGIHWNFEAHQAVAELMLKALAEAGV
PNEKSRG
>Q9I1M0 2.3.1.168~~~bkdB~~~Lipoamide acyltransferase component of branched-chain alpha-keto acid dehydrogenase complex~~~
MGTHVIKMPDIGEGIAEVELVEWHVQVGDSVNEDQVLAEVMTDKATVEIPSPVAGRILALGGQPGQVMAVGGELIRLEVE
GAGNLAESPAAATPAAPVAATPEKPKEAPVAAPKAAAEAPRALRDSEAPRQRRQPGERPLASPAVRQRARDLGIELQFVQ
GSGPAGRVLHEDLDAYLTQDGSVARSGGAAQGYAERHDEQAVPVIGLRRKIAQKMQDAKRRIPHFSYVEEIDVTDLEALR
AHLNQKWGGQRGKLTLLPFLVRAMVVALRDFPQLNARYDDEAEVVTRYGAVHVGIATQSDNGLMVPVLRHAESRDLWGNA
SEVARLAEAARSGKAQRQELSGSTITLSSLGVLGGIVSTPVINHPEVAIVGVNRIVERPMVVGGNIVVRKMMNLSSSFDH
RVVDGMDAAAFIQAVRGLLEHPATLFLE
>P37940 1.2.4.4~~~bfmBAA~~~2-oxoisovalerate dehydrogenase subunit alpha~~~COG1071
MSTNRHQALGLTDQEAVDMYRTMLLARKIDERMWLLNRSGKIPFVISCQGQEAAQVGAAFALDREMDYVLPYYRDMGVVL
AFGMTAKDLMMSGFAKAADPNSGGRQMPGHFGQKKNRIVTGSSPVTTQVPHAVGIALAGRMEKKDIAAFVTFGEGSSNQG
DFHEGANFAAVHKLPVIFMCENNKYAISVPYDKQVACENISDRAIGYGMPGVTVNGNDPLEVYQAVKEARERARRGEGPT
LIETISYRLTPHSSDDDDSSYRGREEVEEAKKSDPLLTYQAYLKETGLLSDEIEQTMLDEIMAIVNEATDEAENAPYAAP
ESALDYVYAK
>P09060 1.2.4.4~~~bkdA1~~~2-oxoisovalerate dehydrogenase subunit alpha~~~COG1071
MNEYAPLRLHVPEPTGRPGCQTDFSYLRLNDAGQARKPPVDVDAADTADLSYSLVRVLDEQGDAQGPWAEDIDPQILRQG
MRAMLKTRIFDSRMVVAQRQKKMSFYMQSLGEEAIGSGQALALNRTDMCFPTYRQQSILMARDVSLVEMICQLLSNERDP
LKGRQLPIMYSVREAGFFTISGNLATQFVQAVGWAMASAIKGDTKIASAWIGDGATAESDFHTALTFAHVYRAPVILNVV
NNQWAISTFQAIAGGESTTFAGRGVGCGIASLRVDGNDFVAVYAASRWAAERARRGLGPSLIEWVTYRAGPHSTSDDPSK
YRPADDWSHFPLGDPIARLKQHLIKIGHWSEEEHQATTAEFEAAVIAAQKEAEQYGTLANGHIPSAASMFEDVYKEMPDH
LRRQRQELGV
>Q5SLR4 1.2.4.4~~~~~~2-oxoisovalerate dehydrogenase subunit alpha~~~COG1071
MVKETHRFETFTEEPIRLIGEEGEWLGDFPLDLEGEKLRRLYRDMLAARMLDERYTILIRTGKTSFIAPAAGHEAAQVAI
AHAIRPGFDWVFPYYRDHGLALALGIPLKELLGQMLATKADPNKGRQMPEHPGSKALNFFTVASPIASHVPPAAGAAISM
KLLRTGQVAVCTFGDGATSEGDWYAGINFAAVQGAPAVFIAENNFYAISVDYRHQTHSPTIADKAHAFGIPGYLVDGMDV
LASYYVVKEAVERARRGEGPSLVELRVYRYGPHSSADDDSRYRPKEEVAFWRKKDPIPRFRRFLEARGLWNEEWEEDVRE
EIRAELERGLKEAEEAGPVPPEWMFEDVFAEKPWHLLRQEALLKEEL
>P37941 1.2.4.4~~~bfmBAB~~~2-oxoisovalerate dehydrogenase subunit beta~~~COG0022
MSVMSYIDAINLAMKEEMERDSRVFVLGEDVGRKGGVFKATAGLYEQFGEERVMDTPLAESAIAGVGIGAAMYGMRPIAE
MQFADFIMPAVNQIISEAAKIRYRSNNDWSCPIVVRAPYGGGVHGALYHSQSVEAIFANQPGLKIVMPSTPYDAKGLLKA
AVRDEDPVLFFEHKRAYRLIKGEVPADDYVLPIGKADVKREGDDITVITYGLCVHFALQAAERLEKDGISAHVVDLRTVY
PLDKEAIIEAASKTGKVLLVTEDTKEGSIMSEVAAIISEHCLFDLDAPIKRLAGPDIPAMPYAPTMEKYFMVNPDKVEAA
MRELAEF
>P09061 1.2.4.4~~~bkdA2~~~2-oxoisovalerate dehydrogenase subunit beta~~~COG0022
MATTTMTMIQALRSAMDVMLERDDNVVVYGQDVGYFGGVFRCTEGLQTKYGKSRVFDAPISESGIVGTAVGMGAYGLRPV
VEIQFADYFYPASDQIVSEMARLRYRSAGEFIAPLTLRMPCGGGIYGGQTHSQSPEAMFTQVCGLRTVMPSNPYDAKGLL
IASIECDDPVIFLEPKRLYNGPFDGHHDRPVTPWSKHPHSAVPDGYYTVPLDKAAITRPGNDVSVLTYGTTVYVAQVAAE
ESGVDAEVIDLRSLWPLDLDTIVESVKKTGRCVVVHEATRTCGFGAELVSLVQEHCFHHLEAPIERVTGWDTPYPHAQEW
AYFPGPSRVGAALKKVMEV
>Q5SLR3 1.2.4.4~~~~~~2-oxoisovalerate dehydrogenase subunit beta~~~COG0022
MALMTMVQALNRALDEEMAKDPRVVVLGEDVGKRGGVFLVTEGLLQKYGPDRVMDTPLSEAAIVGAALGMAAHGLRPVAE
IQFADYIFPGFDQLVSQVAKLRYRSGGQFTAPLVVRMPSGGGVRGGHHHSQSPEAHFVHTAGLKVVAVSTPYDAKGLLKA
AIRDEDPVVFLEPKRLYRSVKEEVPEEDYTLPIGKAALRREGKDLTLIGYGTVMPEVLQAAAELAKAGVSAEVLDLRTLM
PWDYEAVMNSVAKTGRVVLVSDAPRHASFVSEVAATIAEDLLDMLLAPPIRVTGFDTPYPYAQDKLYLPTVTRILNAAKR
ALDY
>Q8NQJ3 ~~~odhI~~~Oxoglutarate dehydrogenase inhibitor~~~COG1716
MSDNNGTPEPQVETTSVFRADLLKEMESSTGTAPASTGAENLPAGSALLVVKRGPNAGARFLLDQPTTTAGRHPESDIFL
DDVTVSRRHAEFRINEGEFEVVDVGSLNGTYVNREPRNAQVMQTGDEIQIGKFRLVFLAGPAE
>Q44297 1.5.1.28~~~odh~~~Opine dehydrogenase~~~
MIESKTYAVLGLGNGGHAFAAYLALKGQSVLAWDIDAQRIKEIQDRGAIIAEGPGLAGTAHPDLLTSDIGLAVKDADVIL
IVVPAIHHASIAANIASYISEGQLIILNPGATGGALEFRKILRENGAPEVTIGETSSMLFTCRSERPGQVTVNAIKGAMD
FACLPAAKAGWALEQIGSVLPQYVAVENVLHTSLTNVNAVMHPLPTLLNAARCESGTPFQYYLEGITPSVGSLAEKVDAE
RIAIAKAFDLNVPSVCEWYKESYGQSPATIYEAVQGNPAYRGIAGPINLNTRYFFEDVSTGLVPLSELGRAVNVPTPLID
AVLDLISSLIDTDFRKEGRTLEKLGLSGLTAAGIRSAVE
>A0A0H2ZH12 1.5.1.-~~~cntM~~~Pseudopaline synthase~~~
MNAADESLGNVLLVGLGAVAIQVALDLRRHGAGRLGALNHPGRRSQRIAEALARGACLQLEGQGQHRWLSGNAALDVFHQ
DPAELRDDWQTLVLCVPADSYLDVVRGLPWERLGGVRTLLLVSAFIGANLLVRSALPAGCQATVLSLSSYYAATKVIDET
QPLRALTKAVKRRVYLGSSRPDCPARETWRRVLAGSGVEVVPLATPEAAEGRNVTTYVHSPFFLGEFALARILSEQGPPG
FMYKLYPEGPITPGAIGAMRRLWCELSELLRRMGAEPLNLLRFLNDDNYPVHETMLPRAAIDGFAEAGAERQEYLLFVRY
AALLVDPFSPADEQGRHFDFSAVPFRRVSRDEDGLWRLPRVPLEDYRKLALIVALAAHFDLAMPQARSLLASYENAVSRF
IDCQGASQCHPSLYPIDSRPAADAIYRQWCSTC
>Q9HUX5 1.5.1.-~~~cntM~~~Pseudopaline synthase~~~
MNAADESLGNVLLVGLGAVAIQVALDLRRHGAGRLGALNHPGRRSQRIAEALARGACLQLEGQGQHRWLSGNAALDVFHQ
DPAELRDDWQTLVLCVPADSYLDVVRGLPWERLGGVRTLLLVSAFIGANLLVRSALPAGCQATVLSLSSYYAATKVIDET
QPLRALTKAVKRRVYLGSSRPDCPARETWRRVLAGSGVEVVPLATPEAAEGRNVTTYVHSPFFLGEFALARILSEQGPPG
FMYKLYPEGPITPGAIGAMRRLWCELSELLRRMGAEPLNLLRFLNDDNYPVHETMLPRASIDGFAEAGAERQEYLLFVRY
AALLVDPFSPADEQGRHFDFSAVPFRRVSRDEDGLWRLPRVPLEDYRKLALIVALAAHFDLAMPQARSLLASYENAVSRF
IDCQGASQCHPSLYPIDSRPAADAIYRQWCSTC
>A0A0H3JT80 1.5.1.52~~~cntM~~~Staphylopine synthase~~~
MSKLLMIGTGPVAIQLANICYLKSDYEIDMVGRASTSEKSKRLYQAYKKEKQFEVKIQNEAHQHLEGKFEINRLYKDVKN
VKGEYETVVMACTADAYYDTLQQLSLETLQSVKHVILISPTFGSQMIVEQFMSKFSQDIEVISFSTYLGDTRIVDKEAPN
HVLTTGVKKKLYMGSTHSNSTMCQRISALAEQLKIQLEVVESPLHAETRNSSLYVHPPLFMNDFSLKAIFEGTDVPVYVY
KLFPEGPITMTLIREMRLMWKEMMAILQAFRVPSVNLLQFMVKENYPVRPETLDEGDIEHFEILPDILQEYLLYVRYTAI
LIDPFSQPDENGHYFDFSAVPFKQVYKNEQDVVQIPRMPSEDYYRTAMIQHIGKMLGIKTPMIDQFLTRYEASCQAYKDM
HQDQQLSSQFNTNLFEGDKALVTKFLEINRTLS
>Q8CKU7 1.5.1.-~~~~~~Yersinopine synthase~~~
MHNTLPTLILGAGPAAIQLAVDISATGDARLGLYNRPSTKGERLKQYLALTPTLYLQGTGKAQATQKESSVTIDCYIDQL
AQAVGDWQRLILAVPADHYYAVLQQIPWAALPQLKSVILLSSSMGSGLMVQNLLNAAGKRDVEVISLSSYYADTKYIRAE
TQDISANTQDINAGTQDIGAIQPYRAYTKAFKQRIYLANQWGNAGSAEMSWLTAVLARHHIDTLPCSNLLAAERFSITNY
VHPPLALADTTLQALFYPEQRSQYLYKTQPEGPVCPAVIADLAGLADDYKRLLNRLGVEEINLLRFLNDDNYPVPASMVS
RRWIDEFPQLPPLEQQYALFVRYTALLVDPYSTPDEQGRFYDFSAVKVATVYQDANALWHLPRVPLEDVHKLRTLLLLAG
ALDVVMPTAQRLLQRFQQALKAFIDRVGEEHCHPSLLGDDCDRQAAIIEQQWRSQT
>Q8NRC3 ~~~odhA~~~2-oxoglutarate dehydrogenase E1/E2 component~~~COG0508
MSSASTFGQNAWLVDEMFQQFQKDPKSVDKEWRELFEAQGGPNTTPATTEAQPSAPKESAKPAPKAAPAAKAAPRVETKP
ADKTAPKAKESSVPQQPKLPEPGQTPIRGIFKSIAKNMDISLEIPTATSVRDMPARLMFENRAMVNDQLKRTRGGKISFT
HIIGYAMVKAVMAHPDMNNSYDVIDGKPTLIVPEHINLGLAIDLPQKDGSRALVVAAIKETEKMNFSEFLAAYEDIVARS
RKGKLTMDDYQGVTVSLTNPGGIGTRHSVPRLTKGQGTIIGVGSMDYPAEFQGASEDRLAELGVGKLVTITSTYDHRVIQ
GAVSGEFLRTMSRLLTDDSFWDEIFDAMNVPYTPMRWAQDVPNTGVDKNTRVMQLIEAYRSRGHLIADTNPLSWVQPGMP
VPDHRDLDIETHNLTIWDLDRTFNVGGFGGKETMTLREVLSRLRAAYTLKVGSEYTHILDRDERTWLQDRLEAGMPKPTQ
AEQKYILQKLNAAEAFENFLQTKYVGQKRFSLEGAEALIPLMDSAIDTAAGQGLDEVVIGMPHRGRLNVLFNIVGKPLAS
IFNEFEGQMEQGQIGGSGDVKYHLGSEGQHLQMFGDGEIKVSLTANPSHLEAVNPVMEGIVRAKQDYLDKGVDGKTVVPL
LLHGDAAFAGLGIVPETINLAKLRGYDVGGTIHIVVNNQIGFTTTPDSSRSMHYATDYAKAFGCPVFHVNGDDPEAVVWV
GQLATEYRRRFGKDVFIDLVCYRLRGHNEADDPSMTQPKMYELITGRETVRAQYTEDLLGRGDLSNEDAEAVVRDFHDQM
ESVFNEVKEGGKKQAEAQTGITGSQKLPHGLETNISREELLELGQAFANTPEGFNYHPRVAPVAKKRVSSVTEGGIDWAW
GELLAFGSLANSGRLVRLAGEDSRRGTFTQRHAVAIDPATAEEFNPLHELAQSKGNNGKFLVYNSALTEYAGMGFEYGYS
VGNEDSIVAWEAQFGDFANGAQTIIDEYVSSGEAKWGQTSKLILLLPHGYEGQGPDHSSARIERFLQLCAEGSMTVAQPS
TPANHFHLLRRHALSDLKRPLVIFTPKSMLRNKAAASAPEDFTEVTKFQSVINDPNVADAAKVKKVMLVSGKLYYELAKR
KEKDGRDDIAIVRIEMLHPIPFNRISEALAGYPNAEEVLFVQDEPANQGPWPFYQEHLPELIPNMPKMRRVSRRAQSSTA
TGVAKVHQLEEKQLIDEAFEA
>P0AFG3 1.2.4.2~~~sucA~~~2-oxoglutarate dehydrogenase E1 component~~~COG0567
MQNSALKAWLDSSYLSGANQSWIEQLYEDFLTDPDSVDANWRSTFQQLPGTGVKPDQFHSQTREYFRRLAKDASRYSSTI
SDPDTNVKQVKVLQLINAYRFRGHQHANLDPLGLWQQDKVADLDPSFHDLTEADFQETFNVGSFASGKETMKLGELLEAL
KQTYCGPIGAEYMHITSTEEKRWIQQRIESGRATFNSEEKKRFLSELTAAEGLERYLGAKFPGAKRFSLEGGDALIPMLK
EMIRHAGNSGTREVVLGMAHRGRLNVLVNVLGKKPQDLFDEFAGKHKEHLGTGDVKYHMGFSSDFQTDGGLVHLALAFNP
SHLEIVSPVVIGSVRARLDRLDEPSSNKVLPITIHGDAAVTGQGVVQETLNMSKARGYEVGGTVRIVINNQVGFTTSNPL
DARSTPYCTDIGKMVQAPIFHVNADDPEAVAFVTRLALDFRNTFKRDVFIDLVCYRRHGHNEADEPSATQPLMYQKIKKH
PTPRKIYADKLEQEKVATLEDATEMVNLYRDALDAGDCVVAEWRPMNMHSFTWSPYLNHEWDEEYPNKVEMKRLQELAKR
ISTVPEAVEMQSRVAKIYGDRQAMAAGEKLFDWGGAENLAYATLVDEGIPVRLSGEDSGRGTFFHRHAVIHNQSNGSTYT
PLQHIHNGQGAFRVWDSVLSEEAVLAFEYGYATAEPRTLTIWEAQFGDFANGAQVVIDQFISSGEQKWGRMCGLVMLLPH
GYEGQGPEHSSARLERYLQLCAEQNMQVCVPSTPAQVYHMLRRQALRGMRRPLVVMSPKSLLRHPLAVSSLEELANGTFL
PAIGEIDELDPKGVKRVVMCSGKVYYDLLEQRRKNNQHDVAIVRIEQLYPFPHKAMQEVLQQFAHVKDFVWCQEEPLNQG
AWYCSQHHFREVIPFGASLRYAGRPASASPAVGYMSVHQKQQQDLVNDALNVE
>Q99U74 1.2.4.2~~~odhA~~~2-oxoglutarate dehydrogenase E1 component~~~
MTNERKEVSEAPVNFGANLGLMLDLYDDFLQDPSSVPEDLQVLFSTIKRVMRLIDNIRQYGHLKADIYPVNPPKRKHVPK
LEIEDFDLDQQTLEGISAGIVSDHFADIYDNAYEAILRMEKRYKGPIAFEYTHINNNTERGWLKRRIETPYKVTLNNNEK
RALFKQLAYVEGFEKYLHKNFVGAKRFSIEGVDALVPMLQRTITIAAKEGIKNIQIGMAHRGRLNVLTHVLEKPYEMMIS
EFMHTDPMKFLPEDGSLQLTAGWTGDVKYHLGGIKTTDSYGTMQRIALANNPSHLEIVAPVVEGRTRAAQDDTQRAGAPT
TDHHKAMPIIIHGDAAYPGQGINFETMNLGNLKGYSTGGSLHIITNNRIGFTTEPIDARSTTYSTDVAKGYDVPIFHVNA
DDVEATIEAIDIAMEFRKEFHKDVVIDLVGYRRFGHNEMDEPSITNPVPYQNIRKHDSVEYVFGKKLVNEGVISEDEMHS
FIEQVQKELRQAHDKINKADKMDNPDMEKPAELALPLQADEQSFTFDHLKEINDALLTYPDGFNILKKLNKVLEKRHEPF
NKEDGLVDWAQAEQLAFATILQDGTPIRLTGQDSERGTFSHRHAVLHDEQTGETYTPLHHVPDQKATFDIHNSPLSEAAV
VGFEYGYNVENKKSFNIWEAQYGDFANMSQMIFDNFLFSSRSKWGERSGLTLFLPHAYEGQGPEHSSARLERFLQLAAEN
NCTVVNLSSSSNYFHLLRAQAASLDSEQMRPLVVMSPKSLLRNKTVAKPIDEFTSGGFEPILTESYQADKVTKVILATGK
MFIDLKEALAKNPDESVLLVAIERLYPFPEEEIEALLAQLPKLEEVSWVQEEPKNQGAWLYVYPYVKVLVADKYDLSYHG
RIQRAAPAEGDGEIHKLVQNKIIENALKNN
>P20708 2.3.1.61~~~sucB~~~Dihydrolipoyllysine-residue succinyltransferase component of 2-oxoglutarate dehydrogenase complex~~~
MAIDIKAPTFPESIADGTVATWHKKPGEPVKRDELIVDIETDKVVMEVLAEADGVIAEIVKNEGDTVLSGELLGKLTEGG
AATAAPAAAPAPAAAAPAAAEAPILSPAARKIAEENAIAADSITGTGKGGRVTKEDAVAAAEAKKSAPAGQPAPAATAAP
LFAAGDRVEKRVPMTRLRAKVAERLVEAQSSMAMLTTFNEVNMKPVMELRAKYKDLFEKTHNGVRLGFMSFFVKAAVEAL
KRQPGVNASIDGNDIVYHGYQDIGVAVSSDRGLVVPVLRNAEFMSLAEIEGGINEFGKKAKAGKLTIEEMTGGTFTISNG
GVFGSLLSTPIVNPPQTAILGMHKIQERPMAVNGQVVILPMMYLALSYDHRLIDGKEAVTFLVTMKDLLEDPARLLLDV
>P0AFG7 2.3.1.61~~~sucB~~~Dihydrolipoyllysine-residue succinyltransferase component of 2-oxoglutarate dehydrogenase complex~~~COG0508
MSSVDILVPDLPESVADATVATWHKKPGDAVVRDEVLVEIETDKVVLEVPASADGILDAVLEDEGTTVTSRQILGRLREG
NSAGKETSAKSEEKASTPAQRQQASLEEQNNDALSPAIRRLLAEHNLDASAIKGTGVGGRLTREDVEKHLAKAPAKESAP
AAAAPAAQPALAARSEKRVPMTRLRKRVAERLLEAKNSTAMLTTFNEVNMKPIMDLRKQYGEAFEKRHGIRLGFMSFYVK
AVVEALKRYPEVNASIDGDDVVYHNYFDVSMAVSTPRGLVTPVLRDVDTLGMADIEKKIKELAVKGRDGKLTVEDLTGGN
FTITNGGVFGSLMSTPIINPPQSAILGMHAIKDRPMAVNGQVEILPMMYLALSYDHRLIDGRESVGFLVTIKELLEDPTR
LLLDV
>P0AFG6 2.3.1.61~~~sucB~~~Dihydrolipoyllysine-residue succinyltransferase component of 2-oxoglutarate dehydrogenase complex~~~COG0508
MSSVDILVPDLPESVADATVATWHKKPGDAVVRDEVLVEIETDKVVLEVPASADGILDAVLEDEGTTVTSRQILGRLREG
NSAGKETSAKSEEKASTPAQRQQASLEEQNNDALSPAIRRLLAEHNLDASAIKGTGVGGRLTREDVEKHLAKAPAKESAP
AAAAPAAQPALAARSEKRVPMTRLRKRVAERLLEAKNSTAMLTTFNEVNMKPIMDLRKQYGEAFEKRHGIRLGFMSFYVK
AVVEALKRYPEVNASIDGDDVVYHNYFDVSMAVSTPRGLVTPVLRDVDTLGMADIEKKIKELAVKGRDGKLTVEDLTGGN
FTITNGGVFGSLMSTPIINPPQSAILGMHAIKDRPMAVNGQVEILPMMYLALSYDHRLIDGRESVGFLVTIKELLEDPTR
LLLDV
>Q7A5N4 2.3.1.61~~~odhB~~~Dihydrolipoyllysine-residue succinyltransferase component of 2-oxoglutarate dehydrogenase complex~~~
MPEVKVPELAESITEGTIAEWLKNVGDSVEKGEAILELETDKVNVEVVSEEAGVLSEQLASEGDTVEVGQAIAIIGEGSG
NASKENSNDNTPQQNEETNNKKEETTNNSVDKAEVNQANDDNQQRINATPSARRYARENGVNLAEVSPKTNDVVRKEDID
KKQQAPASTQTTQQAPAKEEKKYNQYPTKPVIREKMSRRKKTAAKKLLEVSNNTAMLTTFNEVDMTNVMELRKRKKEQFM
KDHDGTKLGFMSFFTKASVAALKKYPEVNAEIDGDDMITKQYYDIGVAVSTDDGLLVPFVRDCDKKNFAEIEAEIANLAV
KAREKKLGLDDMVNGSFTITNGGIFGSMMSTPIINGNQAAILGMHSIITRPIAIDQDTIENRPMMYIALSYDHRIIDGKE
AVGFLKTIKELIENPEDLLLES
>Q8NNF6 1.2.4.1~~~aceE~~~Pyruvate dehydrogenase E1 component~~~COG2609
MADQAKLGGKPSDDSNFAMIRDGVASYLNDSDPEETNEWMDSLDGLLQESSPERARYLMLRLLERASAKRVSLPPMTSTD
YVNTIPTSMEPEFPGDEEMEKRYRRWIRWNAAIMVHRAQRPGIGVGGHISTYAGAAPLYEVGFNHFFRGKDHPGGGDQIF
FQGHASPGMYARAFMEGRLSEDDLDGFRQEVSREQGGIPSYPHPHGMKDFWEFPTVSMGLGPMDAIYQARFNRYLENRGI
KDTSDQHVWAFLGDGEMDEPESRGLIQQAALNNLDNLTFVVNCNLQRLDGPVRGNTKIIQELESFFRGAGWSVIKVVWGR
EWDELLEKDQDGALVEIMNNTSDGDYQTFKANDGAYVREHFFGRDPRTAKLVENMTDEEIWKLPRGGHDYRKVYAAYKRA
LETKDRPTVILAHTIKGYGLGHNFEGRNATHQMKKLTLDDLKLFRDKQGIPITDEQLEKDPYLPPYYHPGEDAPEIKYMK
ERRAALGGYLPERRENYDPIQVPPLDKLRSVRKGSGKQQIATTMATVRTFKELMRDKGLADRLVPIIPDEARTFGLDSWF
PTLKIYNPHGQNYVPVDHDLMLSYREAPEGQILHEGINEAGSVASFIAAGTSYATHGKAMIPLYIFYSMFGFQRTGDSIW
AAADQMARGFLLGATAGRTTLTGEGLQHMDGHSPVLASTNEGVETYDPSFAYEIAHLVHRGIDRMYGPGKGEDVIYYITI
YNEPTPQPAEPEGLDVEGLHKGIYLYSRGEGTGHEANILASGVGMQWALKAASILEADYGVRANIYSATSWVNLARDGAA
RNKAQLRNPGADAGEAFVTTQLKQTSGPYVAVSDFSTDLPNQIREWVPGDYTVLGADGFGFSDTRPAARRFFNIDAESIV
VAVLNSLAREGKIDVSVAAQAAEKFKLDDPTSVSVDPNAPEE
>P0AFG9 1.2.4.1~~~aceE~~~Pyruvate dehydrogenase E1 component~~~COG2609
MSERFPNDVDPIETRDWLQAIESVIREEGVERAQYLIDQLLAEARKGGVNVAAGTGISNYINTIPVEEQPEYPGNLELER
RIRSAIRWNAIMTVLRASKKDLELGGHMASFQSSATIYDVCFNHFFRARNEQDGGDLVYFQGHISPGVYARAFLEGRLTQ
EQLDNFRQEVHGNGLSSYPHPKLMPEFWQFPTVSMGLGPIGAIYQAKFLKYLEHRGLKDTSKQTVYAFLGDGEMDEPESK
GAITIATREKLDNLVFVINCNLQRLDGPVTGNGKIINELEGIFEGAGWNVIKVMWGSRWDELLRKDTSGKLIQLMNETVD
GDYQTFKSKDGAYVREHFFGKYPETAALVADWTDEQIWALNRGGHDPKKIYAAFKKAQETKGKATVILAHTIKGYGMGDA
AEGKNIAHQVKKMNMDGVRHIRDRFNVPVSDADIEKLPYITFPEGSEEHTYLHAQRQKLHGYLPSRQPNFTEKLELPSLQ
DFGALLEEQSKEISTTIAFVRALNVMLKNKSIKDRLVPIIADEARTFGMEGLFRQIGIYSPNGQQYTPQDREQVAYYKED
EKGQILQEGINELGAGCSWLAAATSYSTNNLPMIPFYIYYSMFGFQRIGDLCWAAGDQQARGFLIGGTSGRTTLNGEGLQ
HEDGHSHIQSLTIPNCISYDPAYAYEVAVIMHDGLERMYGEKQENVYYYITTLNENYHMPAMPEGAEEGIRKGIYKLETI
EGSKGKVQLLGSGSILRHVREAAEILAKDYGVGSDVYSVTSFTELARDGQDCERWNMLHPLETPRVPYIAQVMNDAPAVA
STDYMKLFAEQVRTYVPADDYRVLGTDGFGRSDSRENLRHHFEVDASYVVVAALGELAKRGEIDKKVVADAIAKFNIDAD
KVNPRLA
>P0AFG8 1.2.4.1~~~aceE~~~Pyruvate dehydrogenase E1 component~~~COG2609
MSERFPNDVDPIETRDWLQAIESVIREEGVERAQYLIDQLLAEARKGGVNVAAGTGISNYINTIPVEEQPEYPGNLELER
RIRSAIRWNAIMTVLRASKKDLELGGHMASFQSSATIYDVCFNHFFRARNEQDGGDLVYFQGHISPGVYARAFLEGRLTQ
EQLDNFRQEVHGNGLSSYPHPKLMPEFWQFPTVSMGLGPIGAIYQAKFLKYLEHRGLKDTSKQTVYAFLGDGEMDEPESK
GAITIATREKLDNLVFVINCNLQRLDGPVTGNGKIINELEGIFEGAGWNVIKVMWGSRWDELLRKDTSGKLIQLMNETVD
GDYQTFKSKDGAYVREHFFGKYPETAALVADWTDEQIWALNRGGHDPKKIYAAFKKAQETKGKATVILAHTIKGYGMGDA
AEGKNIAHQVKKMNMDGVRHIRDRFNVPVSDADIEKLPYITFPEGSEEHTYLHAQRQKLHGYLPSRQPNFTEKLELPSLQ
DFGALLEEQSKEISTTIAFVRALNVMLKNKSIKDRLVPIIADEARTFGMEGLFRQIGIYSPNGQQYTPQDREQVAYYKED
EKGQILQEGINELGAGCSWLAAATSYSTNNLPMIPFYIYYSMFGFQRIGDLCWAAGDQQARGFLIGGTSGRTTLNGEGLQ
HEDGHSHIQSLTIPNCISYDPAYAYEVAVIMHDGLERMYGEKQENVYYYITTLNENYHMPAMPEGAEEGIRKGIYKLETI
EGSKGKVQLLGSGSILRHVREAAEILAKDYGVGSDVYSVTSFTELARDGQDCERWNMLHPLETPRVPYIAQVMNDAPAVA
STDYMKLFAEQVRTYVPADDYRVLGTDGFGRSDSRENLRHHFEVDASYVVVAALGELAKRGEIDKKVVADAIAKFNIDAD
KVNPRLA
>A0R0B0 1.2.4.1~~~aceE~~~Pyruvate dehydrogenase E1 component~~~COG2609
MTTEFVRQDLAQNSSTAAEPDRVRVIREGVASYLPDIDTEETAEWLESFDELLERSGPARARYLMLRLLERAGEQRVAIP
ALTSTDYVNTIPTELEPWFPGDEDVERRYRAWIRWNAAIMVHRAQRPGVGVGGHISTYASSATLYEVGFNHFFRGKSHPG
GGDHVFIQGHASPGIYARAFLEGRLTTDQLDGFRQEHSHSGGGLPSYPHPRLMPDFWEFPTVSMGLGPMNAIYQARFNHY
LHDRGIKDTSDQHVWAFLGDGEMDEPESRGLIQVAANEALDNLTFVINCNLQRLDGPVRGNGKIIQELESFFRGAGWNVI
KVVWGREWDVLLHADRDGALVNLMNSTPDGDYQTYKANDGAYVRDHFFGRDPRTKALVADMSDQEIWNLKRGGHDYRKVY
AAYRAAMEHKGQPTVILAKTIKGYTLGQHFEGRNATHQMKKLALEDLKNFRDVTRVPVSDAQLEEDPYLPPYYHPGPEAP
EIRYLLERRRALGGFVPSRRTKSKPLALPGSDTYKALKKGSGSQAVATTMATVRTFKELLRDKNIGPRIVPIIPDEARTF
GMDSWFPSLKIYNRNGQLYTSVDSELMLAYKESEVGQILHEGINEAGSTSSFTAVGTSYSTHDEPMIPIYIFYSMFGFQR
TGDGLWAAADQMARGFVLGATAGRTTLTGEGLQHADGHSLLLASTNPAAVTYDPAFAYEIAHIIESGLQRMYGEDPENVF
FYLTIYNEPYQQPAEPENLDVEALLKGLYLYRPAPEKRAKSAQILASGVAMPEALRAADLLASDWDVAADVWSVTSWGEL
NREGVAIEKHRLRHPDEPAGTPHVTSALADAAGPVIAVSDWMRAVPEQIRPWVPGTYVTLGTDGFGFSDTRPAARRYFNT
DAESVVVAVLQGLARDGEIDASVAAQAAEQYRIDDVSAAGVSYADTGSA
>P9WIS9 1.2.4.1~~~aceE~~~Pyruvate dehydrogenase E1 component~~~COG2609
MTTDFARHDLAQNSNSASEPDRVRVIREGVASYLPDIDPEETSEWLESFDTLLQRCGPSRARYLMLRLLERAGEQRVAIP
ALTSTDYVNTIPTELEPWFPGDEDVERRYRAWIRWNAAIMVHRAQRPGVGVGGHISTYASSAALYEVGFNHFFRGKSHPG
GGDQVFIQGHASPGIYARAFLEGRLTAEQLDGFRQEHSHVGGGLPSYPHPRLMPDFWEFPTVSMGLGPLNAIYQARFNHY
LHDRGIKDTSDQHVWCFLGDGEMDEPESRGLAHVGALEGLDNLTFVINCNLQRLDGPVRGNGKIIQELESFFRGAGWNVI
KVVWGREWDALLHADRDGALVNLMNTTPDGDYQTYKANDGGYVRDHFFGRDPRTKALVENMSDQDIWNLKRGGHDYRKVY
AAYRAAVDHKGQPTVILAKTIKGYALGKHFEGRNATHQMKKLTLEDLKEFRDTQRIPVSDAQLEENPYLPPYYHPGLNAP
EIRYMLDRRRALGGFVPERRTKSKALTLPGRDIYAPLKKGSGHQEVATTMATVRTFKEVLRDKQIGPRIVPIIPDEARTF
GMDSWFPSLKIYNRNGQLYTAVDADLMLAYKESEVGQILHEGINEAGSVGSFIAAGTSYATHNEPMIPIYIFYSMFGFQR
TGDSFWAAADQMARGFVLGATAGRTTLTGEGLQHADGHSLLLAATNPAVVAYDPAFAYEIAYIVESGLARMCGENPENIF
FYITVYNEPYVQPPEPENFDPEGVLRGIYRYHAATEQRTNKAQILASGVAMPAALRAAQMLAAEWDVAADVWSVTSWGEL
NRDGVAIETEKLRHPDRPAGVPYVTRALENARGPVIAVSDWMRAVPEQIRPWVPGTYLTLGTDGFGFSDTRPAARRYFNT
DAESQVVAVLEALAGDGEIDPSVPVAAARQYRIDDVAAAPEQTTDPGPGA
>P35489 2.3.1.12~~~pdhC~~~Dihydrolipoyllysine-residue acetyltransferase component of pyruvate dehydrogenase complex~~~
MYEFKFADIGEGIHEGTVLQWNFKVGDKVKEGETLVIVETDKVNAELPSPVDGTIVSLGAKEGEEIHVGQIIVTIDDGTG
TPAAAPAPAQVSAPTPAPAAAPQVAAPAASGDIYDFKFADIGEGIHEGTILQWNFKVGDKVKEGETLVVVETDKVNAELP
SPVDGTILKLGKAEGEVIHVGETVVLIGQNGATLEQAQAPKAEAPVSEPKKGAGVVGEIEVSDDIIGGSEEVHVVATTGK
VLASPVARKLASDLGVDIATIKGSGEQGRVMKDDVQNSKAPAEAQAPVQQTQAPAQAAASVAPSFAAAGKPQGDVEVVKI
TRLRKAVSNAMTRSKSIIPETVLMDEINVDALVNFRNEAKGLAESKGIKLTYMAFIAKAVLIALKEFPMFNASFNHDTDE
VYIKKFINLGMAVDTPDGLIVPNIKNADRLSVFELASQVRSLADDTIARKISMDQQTGGTFTITNFGSAGIAFGTPVINY
PELAILGIGKIDRKPWVVGNEIKIAHTLPLSLAVDHRIIDGADGGRFLMRVKELLTNPTLLLLS
>P10802 2.3.1.12~~~~~~Dihydrolipoyllysine-residue acetyltransferase component of pyruvate dehydrogenase complex~~~
MSEIIRVPDIGGDGEVIELLVKTGDLIEVEQGLVVLESAKASMEVPSPKAGVVKSVSVKLGDKLKEGDAIIELEPAAGAA
AAPAEAAAVPAAPTQAVDEAEAPSPGASATPAPAAASQEVRVPDIGSAGKARVIEVLVKAGDQVQAEQSLIVLESDKASM
EIPSPASGVVESVAIQLNAEVGTGDLILTLRTTGAQAQPTAPAAAAAASPAPAPLAPAAAGPQEVKVPDIGSAGKARVIE
VLVKAGDQVQAEQSLIVLESDKASMEIPSPAAGVVESVAVQLNAEVGTGDQILTLRVAGAAPSGPRARGSPGQAAAAPGA
APAPAPVGAPSRNGAKVHAGPAVRQLAREFGVELAAINSTGPRGRILKEDVQAYVKAMMQKAKEAPAAGAASGAGIPPIP
PVDFAKYGEIEEVPMTRLMQIGATNLHRSWLNVPHVTQFESADITELEAFRVAQKAVAEKAGVKLTVLPLLLKACAYLLK
ELPDFNSSLAPSGQALIRKKYVHIGFAVDTPDGLLVPVIRNVDQKSLLQLAAEAAELAEKARSKKLGADAMQGACFTISS
LGHIGGTAFTPIVNAPEVAILGVSKASMQPVWDGKAFQPRLMLPLSLSYDHRVINGAAAARFTKRLGDLLADIRAILL
>P21883 2.3.1.12~~~pdhC~~~Dihydrolipoyllysine-residue acetyltransferase component of pyruvate dehydrogenase complex~~~COG0508
MAFEFKLPDIGEGIHEGEIVKWFVKPNDEVDEDDVLAEVQNDKAVVEIPSPVKGKVLELKVEEGTVATVGQTIITFDAPG
YEDLQFKGSDESDDAKTEAQVQSTAEAGQDVAKEEQAQEPAKATGAGQQDQAEVDPNKRVIAMPSVRKYAREKGVDIRKV
TGSGNNGRVVKEDIDSFVNGGAQEAAPQETAAPQETAAKPAAAPAPEGEFPETREKMSGIRKAIAKAMVNSKHTAPHVTL
MDEVDVTNLVAHRKQFKQVAADQGIKLTYLPYVVKALTSALKKFPVLNTSIDDKTDEVIQKHYFNIGIAADTEKGLLVPV
VKNADRKSVFEISDEINGLATKAREGKLAPAEMKGASCTITNIGSAGGQWFTPVINHPEVAILGIGRIAEKAIVRDGEIV
AAPVLALSLSFDHRMIDGATAQNALNHIKRLLNDPQLILMEA
>Q8NNJ2 2.3.1.12~~~aceF~~~Dihydrolipoyllysine-residue acetyltransferase component of pyruvate dehydrogenase complex~~~COG0508
MAFSVEMPELGESVTEGTITQWLKSVGDTVEVDEPLLEVSTDKVDTEIPSPVAGVILEIKAEEDDTVDVGGVIAIIGDAD
ETPANEAPADEAPAPAEEEEPVKEEPKKEAAPEAPAATGAATDVEMPELGESVTEGTITQWLKAVGDTVEVDEPLLEVST
DKVDTEIPSPVAGTIVEILADEDDTVDVGAVIARIGDANAAAAPAEEEAAPAEEEEPVKEEPKKEAAPEAPAATGAATDV
EMPELGESVTEGTITQWLKAVGDTVEVDEPLLEVSTDKVDTEIPSPVAGTIVEILADEDDTVDVGAVIARIGDANAAAAP
AEEEAAPAEEEEPVKEEPKKEEPKKEEPKKEAATTPAAASATVSASGDNVPYVTPLVRKLAEKHGVDLNTVTGTGIGGRI
RKQDVLAAANGEAAPAEAAAPVSAWSTKSVDPEKAKLRGTTQKVNRIREITAMKTVEALQISAQLTQLHEVDMTRVAELR
KKNKPAFIEKHGVNLTYLPFFVKAVVEALVSHPNVNASFNAKTKEMTYHSSVNLSIAVDTPAGLLTPVIHDAQDLSIPEI
AKAIVDLADRSRNNKLKPNDLSGGTFTITNIGSEGALSDTPILVPPQAGILGTGAIVKRPVVITEDGIDSIAIRQMVFLP
LTYDHQVVDGADAGRFLTTIKDRLETANFEGDLQL
>P06959 2.3.1.12~~~aceF~~~Dihydrolipoyllysine-residue acetyltransferase component of pyruvate dehydrogenase complex~~~COG0508
MAIEIKVPDIGADEVEITEILVKVGDKVEAEQSLITVEGDKASMEVPSPQAGIVKEIKVSVGDKTQTGALIMIFDSADGA
ADAAPAQAEEKKEAAPAAAPAAAAAKDVNVPDIGSDEVEVTEILVKVGDKVEAEQSLITVEGDKASMEVPAPFAGTVKEI
KVNVGDKVSTGSLIMVFEVAGEAGAAAPAAKQEAAPAAAPAPAAGVKEVNVPDIGGDEVEVTEVMVKVGDKVAAEQSLIT
VEGDKASMEVPAPFAGVVKELKVNVGDKVKTGSLIMIFEVEGAAPAAAPAKQEAAAPAPAAKAEAPAAAPAAKAEGKSEF
AENDAYVHATPLIRRLAREFGVNLAKVKGTGRKGRILREDVQAYVKEAIKRAEAAPAATGGGIPGMLPWPKVDFSKFGEI
EEVELGRIQKISGANLSRNWVMIPHVTHFDKTDITELEAFRKQQNEEAAKRKLDVKITPVVFIMKAVAAALEQMPRFNSS
LSEDGQRLTLKKYINIGVAVDTPNGLVVPVFKDVNKKGIIELSRELMTISKKARDGKLTAGEMQGGCFTISSIGGLGTTH
FAPIVNAPEVAILGVSKSAMEPVWNGKEFVPRLMLPISLSFDHRVIDGADGARFITIINNTLSDIRRLVM
>P11961 2.3.1.12~~~pdhC~~~Dihydrolipoyllysine-residue acetyltransferase component of pyruvate dehydrogenase complex~~~
MAFEFKLPDIGEGIHEGEIVKWFVKPGDEVNEDDVLCEVQNDKAVVEIPSPVKGKVLEILVPEGTVATVGQTLITLDAPG
YENMTFKGQEQEEAKKEEKTETVSKEEKVDAVAPNAPAAEAEAGPNRRVIAMPSVRKYAREKGVDIRLVQGTGKNGRVLK
EDIDAFLAGGAKPAPAAAEEKAAPAAAKPATTEGEFPETREKMSGIRRAIAKAMVHSKHTAPHVTLMDEADVTKLVAHRK
KFKAIAAEKGIKLTFLPYVVKALVSALREYPVLNTSIDDETEEIIQKHYYNIGIAADTDRGLLVPVIKHADRKPIFALAQ
EINELAEKARDGKLTPGEMKGASCTITNIGSAGGQWFTPVINHPEVAILGIGRIAEKPIVRDGEIVAAPMLALSLSFDHR
MIDGATAQKALNHIKRLLSDPELLLMEA
>P75392 2.3.1.12~~~pdhC~~~Dihydrolipoyllysine-residue acetyltransferase component of pyruvate dehydrogenase complex~~~
MANEFKFTDVGEGLHEGKVTEILKKVGDTIKVDEALFVVETDKVTTELPSPYAGVITAITTNVGDVVHIGQVMAVIDDGA
GAAAPAAPQPVSAPAPAPTPTFTPTPAPVTTEPVVEEAGASVVGEIKVSNSVFPIFGVQPSAPQPTPAPVVQPTSAPTPT
PAPASAAAPSGEETIAITTMRKAIAEAMVKSHENIPATILTFYVNATKLKQYRESVNGLALSKYNMKISFFAFFVKAIVN
ALKKFPVFNGRYDKERNLIVLNKDVNVGIAVDTPDGLIVPNIKQAQTKSVVDIAKDIVDLANRARSKQIKLPDLSKGTIS
VTNFGSLGAAFGTPIIKHPEMCIVATGNMEERVVRAEGGVAVHTILPLTIAADHRWVDGADVGRFGKEIAKQIEELIDLE
VA
>P9WIS7 2.3.1.12~~~dlaT~~~Dihydrolipoyllysine-residue acetyltransferase component of pyruvate dehydrogenase complex~~~COG0508
MAFSVQMPALGESVTEGTVTRWLKQEGDTVELDEPLVEVSTDKVDTEIPSPAAGVLTKIIAQEDDTVEVGGELAVIGDAK
DAGEAAAPAPEKVPAAQPESKPAPEPPPVQPTSGAPAGGDAKPVLMPELGESVTEGTVIRWLKKIGDSVQVDEPLVEVST
DKVDTEIPSPVAGVLVSISADEDATVPVGGELARIGVAADIGAAPAPKPAPKPVPEPAPTPKAEPAPSPPAAQPAGAAEG
APYVTPLVRKLASENNIDLAGVTGTGVGGRIRKQDVLAAAEQKKRAKAPAPAAQAAAAPAPKAPPAPAPALAHLRGTTQK
ASRIRQITANKTRESLQATAQLTQTHEVDMTKIVGLRARAKAAFAEREGVNLTFLPFFAKAVIDALKIHPNINASYNEDT
KEITYYDAEHLGFAVDTEQGLLSPVIHDAGDLSLAGLARAIADIAARARSGNLKPDELSGGTFTITNIGSQGALFDTPIL
VPPQAAMLGTGAIVKRPRVVVDASGNESIGVRSVCYLPLTYDHRLIDGADAGRFLTTIKHRLEEGAFEADLGL
>Q59638 2.3.1.12~~~aceF~~~Dihydrolipoyllysine-residue acetyltransferase component of pyruvate dehydrogenase complex~~~
MSELIRVPDIGNGEGEVIELLVKPGDKVEADQSLLTLESDKASMEIPSPKAGVVKSIKAKVGDTLKEGDEILELEVEGGE
QPAEAKAEAAPAQPEAPKAEAPAPAPSESKPAAPAAASVQDIKVPDIGSAGKANVIEVMVKAGDTVEADQSLITLESDKA
SMEIPSPASGVVESVSIKVGDEVGTGDLILKLKVEGAAPAAEEQPAAAPAQAAAPAAEQKPAAAAPAPAKADTPAPVGAP
SRDGAKVHAGPAVRMLAREFGVELSEVKASGPKGRILKEDVQVFVKEQLQRAKSGGAGATGGAGIPPIPEVDFSKFGEVE
EVAMTRLMQVGAANLHRSWLNVPHVTQFDQSDITDMEAFRVAQKAAAEKAGVKLTVLPILLKACAHLLKELPDFNSSLAP
SGKALIRKKYVHIGFAVDTPDGLLVPVIRDVDRKSLLQLAAEAADLADKARNKKLSADAMQGACFTISSLGHIGGTGFTP
IVNAPEVAILGVSKATMQPVWDGKAFQPRLMLPLSLSYDHRVINGAAAARFTKRLGELLADIRTLLL
>P65636 2.3.1.12~~~pdhC~~~Dihydrolipoyllysine-residue acetyltransferase component of pyruvate dehydrogenase complex~~~
MAFEFRLPDIGEGIHEGEIVKWFVKAGDTIEEDDVLAEVQNDKSVVEIPSPVSGTVEEVMVEEGTVAVVGDVIVKIDAPD
AEDMQFKGHDDDSSSKEEPAKEEAPAEQAPVATQTEEVDENRTVKAMPSVRKYAREKGVNIKAVSGSGKNGRITKEDVDA
YLNGGAPTASNESAASATSEEVAETPAAPAAVSLEGDFPETTEKIPAMRRAIAKAMVNSKHTAPHVTLMDEIDVQALWDH
RKKFKEIAAEQGTKLTFLPYVVKALVSALKKYPALNTSFNEEAGEIVHKHYWNIGIAADTDRGLLVPVVKHADRKSIFQI
SDEINELAVKARDGKLTADEMKGATCTISNIGSAGGQWFTPVINHPEVAILGIGRIAQKPIVKDGEIVAAPVLALSLSFD
HRQIDGATGQNAMNHIKRLLNNPELLLMEG
>Q4MTG0 1.2.4.1~~~pdhA~~~Pyruvate dehydrogenase E1 component subunit alpha~~~COG1071
MGTKTKKTLFNVDEQMKAIAAQFETLQILNEKGEVVNEAAMPELSDDQLKELMRRMVYTRVLDQRSISLNRQGRLGFYAP
TAGQEASQLASHFALEAEDFILPGYRDVPQLVWHGLPLYQAFLFSRGHFMGNQMPENVNALAPQIIIGAQIIQTAGVALG
MKLRGKKSVAITYTGDGGASQGDFYEGMNFAGAFKAPAIFVVQNNRYAISTPVEKQSAAKTVAQKAVAAGIYGIQVDGMD
PLAVYAATAFARERAVNGEGPTLIETLTFRYGPHTMAGDDPTRYRTKDIENEWEQKDPIVRFRAFLENKGLWSQEVEEKV
IEEAKEDIKQAIAKADQAPKQKVTDLMEIMYEKMPYNLAEQYEIYKEKESK
>P21881 1.2.4.1~~~pdhA~~~Pyruvate dehydrogenase E1 component subunit alpha~~~COG1071
MAAKTKKAIVDSKKQFDAIKKQFETFQILNEKGEVVNEAAMPDLTDDQLKELMRRMVFTRVLDQRSISLNRQGRLGFYAP
TAGQEASQIATHFALEKEDFVLPGYRDVPQLIWHGLPLYQAFLFSRGHFRGNQMPDDVNALSPQIIIGAQYIQTAGVALG
LKKRGKKAVAITYTGDGGASQGDFYEGINFAGAYKAPAIFVVQNNRYAISTPVEKQSAAETIAQKAVAAGIVGVQVDGMD
PLAVYAATAEARERAINGEGPTLIETLTFRYGPHTMAGDDPTKYRTKEIENEWEQKDPLVRFRAFLENKGLWSEEEEAKV
IEDAKEEIKQAIKKADAEPKQKVTDLMKIMYEKMPHNLEEQFEIYTQKESK
>P21873 1.2.4.1~~~pdhA~~~Pyruvate dehydrogenase E1 component subunit alpha~~~
MGVKTFQFPFAEQLEKVAEQFPTFQILNEEGEVVNEEAMPELSDEQLKELMRRMVYTRILDQRSISLNRQGRLGFYAPTA
GQEASQIASHFALEKEDFILPGYRDVPQIIWHGLPLYQAFLFSRGHFHGNQIPEGVNVLPPQIIIGAQYIQAAGVALGLK
MRGKKAVAITYTGDGGTSQGDFYEGINFAGAFKAPAIFVVQNNRFAISTPVEKQTVAKTLAQKAVAAGIPGIQVDGMDPL
AVYAAVKAARERAINGEGPTLIETLCFRYGPHTMSGDDPTRYRSKELENEWAKKDPLVRFRKFLEAKGLWSEEEENNVIE
QAKEEIKEAIKKADETPKQKVTDLISIMFEELPFNLKEQYEIYKEKESK
>P75390 1.2.4.1~~~pdhA~~~Pyruvate dehydrogenase E1 component subunit alpha~~~
MAILIKNKVPTTLYQVYDNEGKLMDPNHKITLSNEQLKHAFYLMNLSRIMDKKMLVWQRAGKMLNFAPNLGEEALQVGMG
MGLNENDWFCPTFRSGALMLYRGVKPEQLLLYWNGNENGSKIEAKYKTLPINITIGAQYSHAAGLGYMLHYKKLPNVAVT
MIGDGGTAEGEFYEAMNIASIHKWNSVFCINNNQFAISTRTKLESAVSDLSTKAIAVNIPRIRVDGNDLIASYEAMHEAA
NYARSGNGPVLIEFFSWRQGPHTTSDDPSIYRTKEEEAEAMKSDPVKRLRNFLFDRGILTPQQEEEMVAKIEQEVQAAYE
VMVSKTPVTLDEVFDYNYEKLTPDLARQKAEAKKYFKD
>Q820A6 1.2.4.1~~~pdhA~~~Pyruvate dehydrogenase E1 component subunit alpha~~~
MAPKLQAQFDAVKVLNDTQSKFEMVQILDENGNVVNEDLVPDLTDEQLVELMERMVWTRILDQRSISLNRQGRLGFYAPT
AGQEASQLASQYALEKEDYILPGYRDVPQIIWHGLPLTEAFLFSRGHFKGNQFPEGVNALSPQIIIGAQYIQAAGVAFAL
KKRGKNAVAITYTGDGGSSQGDFYEGINFAAAYKAPAIFVIQNNNYAISTPRSKQTAAETLAQKAIAVGIPGIQVDGMDA
LAVYQATKEARDRAVAGEGPTLIETMTYRYGPHTMAGDDPTRYRTSDEDAEWEKKDPLVRFRKFLENKGLWNEDKENEVI
ERAKADIKAAIKEADNTEKQTVTSLMEIMYEDMPQNLAEQYEIYKEKESK
>P35488 1.2.4.1~~~pdhB~~~Pyruvate dehydrogenase E1 component subunit beta~~~
MAIITLLEAINQAIDQAMEKDESIVVFGEDAGFEGGVFRVTAGLQKKYGETRVFDTPIAESAIVGSAVGMAINGLKPIAE
IQFDGFIFPGYTDLVTHAARMRNRSRGQFTVPMVLRLPHGGGIRALEHHSEALEVLFGSIPGLKVVTPSTPYDAKGLLLA
AINDPDPVVFLEPKRIYRAGKQEVPAEMYEIPIGKAKVVKQGTDMTVVAWGSIVREVEKAVKLVEAEGISVEIIDLRTIS
PIDEETILNSVKKTGKFMVVTEAVKSYGPAAELITMVNEKAFFHLEAAPVRFTGFDITVPLARGEHYHFPQPEKIAAYIR
KLAKARP
>P21874 1.2.4.1~~~pdhB~~~Pyruvate dehydrogenase E1 component subunit beta~~~
MAQMTMVQAITDALRIELKNDPNVLIFGEDVGVNGGVFRATEGLQAEFGEDRVFDTPLAESGIGGLAIGLALQGFRPVPE
IQFFGFVYEVMDSICGQMARIRYRTGGRYHMPITIRSPFGGGVHTPELHSDSLEGLVAQQPGLKVVIPSTPYDAKGLLIS
AIRDNDPVIFLEHLKLYRSFRQEVPEGEYTIPIGKADIKREGKDITIIAYGAMVHESLKAAAELEKEGISAEVVDLRTVQ
PLDIETIIGSVEKTGRAIVVQEAQRQAGIAANVVAEINERAILSLEAPVLRVAAPDTVYPFAQAESVWLPNFKDVIETAK
KVMNF
>P75391 1.2.4.1~~~pdhB~~~Pyruvate dehydrogenase E1 component subunit beta~~~
MSKTIQANNIEALGNAMDLALERDPNVVLYGQDAGFEGGVFRATKGLQKKYGEERVWDCPIAEAAMAGIGVGAAIGGLKP
IVEIQFSGFSFPAMFQIFTHAARIRNRSRGVYTCPIIVRMPMGGGIKALEHHSETLEAIYGQIAGLKTVMPSNPYDTKGL
FLAAVESPDPVVFFEPKKLYRAFRQEIPADYYTVPIGQANLISQGNNLTIVSYGPTMFDLINMVYGGELKDKGIELIDLR
TISPWDKETVFNSVKKTGRLLVVTEAAKTFTTSGEIIASVTEELFSYLKAAPQRVTGWDIVVPLARGEHYQFNLNARILE
AVNQLLK
>P99063 1.2.4.1~~~pdhB~~~Pyruvate dehydrogenase E1 component subunit beta~~~
MAQMTMVQAINDALKTELKNDQDVLIFGEDVGVNGGVFRVTEGLQKEFGEDRVFDTPLAESGIGGLAMGLAVEGFRPVME
VQFLGFVFEVFDAIAGQIARTRFRSGGTKTAPVTIRSPFGGGVHTPELHADNLEGILAQSPGLKVVIPSGPYDAKGLLIS
SIRSNDPVVYLEHMKLYRSFREEVPEEEYTIDIGKANVKKEGNDISIITYGAMVQESMKAAEELEKDGYSVEVIDLRTVQ
PIDVDTIVASVEKTGRAVVVQEAQRQAGVGAAVVAELSERAILSLEAPIGRVAAADTIYPFTQAENVWLPNKNDIIEKAK
ETLEF
>P0A0A3 1.2.4.1~~~pdhB~~~Pyruvate dehydrogenase E1 component subunit beta~~~
MAQMTMVQAINDALKTELKNDQDVLIFGEDVGVNGGVFRVTEGLQKEFGEDRVFDTPLAESGIGGLAMGLAVEGFRPVME
VQFLGFVFEVFDAIAGQIARTRFRSGGTKTAPVTIRSPFGGGVHTPELHADNLEGILAQSPGLKVVIPSGPYDAKGLLIS
SIRSNDPVVYLEHMKLYRSFREEVPEEEYTIDIGKANVKKEGNDISIITYGAMVQESMKAAEELEKDGYSVEVIDLRTVQ
PIDVDTIVASVEKTGRAVVVQEAQRQAGVGAAVVAELSERAILSLEAPIGRVAAADTIYPFTQAENVWLPNKNDIIEKAK
ETLEF
>Q89ZI2 3.2.1.169~~~~~~O-GlcNAcase BT_4395~~~COG3525
MKNNKIYLLGACLLCAVTTFAQNVSLQPPPQQLIVQNKTIDLPAVYQLNGGEEANPHAVKVLKELLSGKQSSKKGMLISI
GEKGDKSVRKYSRQIPDHKEGYYLSVNEKEIVLAGNDERGTYYALQTFAQLLKDGKLPEVEIKDYPSVRYRGVVEGFYGT
PWSHQARLSQLKFYGKNKMNTYIYGPKDDPYHSAPNWRLPYPDKEAAQLQELVAVANENEVDFVWAIHPGQDIKWNKEDR
DLLLAKFEKMYQLGVRSFAVFFDDISGEGTNPQKQAELLNYIDEKFAQVKPDINQLVMCPTEYNKSWSNPNGNYLTTLGD
KLNPSIQIMWTGDRVISDITRDGISWINERIKRPAYIWWNFPVSDYVRDHLLLGPVYGNDTTIAKEMSGFVTNPMEHAES
SKIAIYSVASYAWNPAKYDTWQTWKDAIRTILPSAAEELECFAMHNSDLGPNGHGYRREESMDIQPAAERFLKAFKEGKN
YDKADFETLQYTFERMKESADILLMNTENKPLIVEITPWVHQFKLTAEMGEEVLKMVEGRNESYFLRKYNHVKALQQQMF
YIDQTSNQNPYQPGVKTATRVIKPLIDRTFATVVKFFNQKFNAHLDATTDYMPHKMISNVEQIKNLPLQVKANRVLISPA
NEVVKWAAGNSVEIELDAIYPGENIQINFGKDAPCTWGRLEISTDGKEWKTVDLKQKESRLSAGLQKAPVKFVRFTNVSD
EEQQVYLRQFVLTIEKK
>Q0TR53 3.2.1.169~~~nagJ~~~O-GlcNAcase NagJ~~~COG3291
MKRKMLKRLLTSAFACMFIANGLITTTVRAVGPKTGEENQVLVPNLNPTPENLEVVGDGFKITSSINLVGEEEADENAVN
ALREFLTANNIEINSENDPNSTTLIIGEVDDDIPELDEALNGTTAENLKEEGYALVSNDGKIAIEGKDGDGTFYGVQTFK
QLVKESNIPEVNITDYPTVSARGIVEGFYGTPWTHQDRLDQIKFYGENKLNTYIYAPKDDPYHREKWREPYPESEMQRMQ
ELINASAENKVDFVFGISPGIDIRFDGDAGEEDFNHLITKAESLYDMGVRSFAIYWDDIQDKSAAKHAQVLNRFNEEFVK
AKGDVKPLITVPTEYDTGAMVSNGQPRAYTRIFAETVDPSIEVMWTGPGVVTNEIPLSDAQLISGIYNRNMAVWWNYPVT
DYFKGKLALGPMHGLDKGLNQYVDFFTVNPMEHAELSKISIHTAADYSWNMDNYDYDKAWNRAIDMLYGDLAEDMKVFAN
HSTRMDNKTWAKSGREDAPELRAKMDELWNKLSSKEDASALIEELYGEFARMEEACNNLKANLPEVALEECSRQLDELIT
LAQGDKASLDMIVAQLNEDTEAYESAKEIAQNKLNTALSSFAVISEKVAQSFIQEALSFDLTLINPRTVKITASSEETSG
ENAPASFASDGDMNTFWHSKWSSPAHEGPHHLTLELDNVYEINKVKYAPRQDSKNGRITGYKVSVSLDGENFTEVKTGTL
EDNAAIKFIEFDSVDAKYVRLDVTDSVSDQANGRGKFATAAEVNVHGKLKENAEVTGSVSLEALEEVQVGENLEVGVGID
ELVNAEAFAYDFTLNYDENAFEYVEAISDDGVFVNAKKIEDGKVRVLVSSLTGEPLPAKEVLAKVVLRAEAKAEGSNLSV
TNSSVGDGEGLVHEIAGTEKTVNIIEGTSPEIVVNPVRDFKASEINKKNVTVTWTEPETTEGLEGYILYKDGKKVAEIGK
DETSYTFKKLNRHTIYNFKIAAKYSNGEVSSKESLTLRTAR
>Q8XL08 3.2.1.169~~~nagJ~~~O-GlcNAcase NagJ~~~
MKRKMLKRLLTSAFACMFIANGLITTTVRAVGPKTGEENQVLVPNLNPTPENLEVVGDGFKITSSINLVGEEEADENAVN
ALREFLTANNIEINSENDPNSTTLIIGEVDDDIPELDEALNGTTAENLKEEGYALVSNDGKIAIEGKDGDGTFYGVQTFK
QLVKESNIPEVNITDYPTVSARGIVEGFYGTPWTHKDRLDQIKFYGENKLNTYIYAPKDDPYHREKWREPYPENEMQRMQ
ELIDASAENKVDFVFGISPGIDIRFDGEAGEEDFNHLIAKAESLYDMGVRSFAIYWDDIQDKSAAKHAQVLNRFNEEFVK
AKGDVKPLITVPTEYDTGAMVSNGQPRTYTRIFAETVDPSIEVMWTGPGVVTNEIPLSDAQLISGIYNRNMAVWWNYPVT
DYFKGKLALGPMHGLDKGLNQYVDFFTVNPMEHAELSKISIHTAADYSWNMDNYDYDKAWNRAIDMLYGDLAEDMKVFAN
HSTRMDNKTWAKSGREDAPELRAKMDELWNKLSSKEDASALIEELYGEFARMEEACNNLKANLPEVALEECSRQLDELIT
LAQGDKASLDMIVAQLNEDTEAYESAKEIAQNKLNTALSSFAVISEKVAQSFIQEALSFDLTLINPRTVKITASSEETSG
ENAPASFASDGDMNTFWHSKWSSPAHEGPHHLTLELDNVYEINKVKYAPRQDSKNGRITGYKVSVSLDGENFTEVKTGTL
EDNAAIKFIEFDSVDAKYVRLDVTDSVSDQANGRGKFATAAEVNVHGKLKEAAEVTGSVSLEALEEVQVGENIEVGVGID
ELVNAEAFAYDFTLNYDENAFEYVEAISDDGVFVNAKKIEDGKVRVLVSSLTGEPLPAKEVLAKVVLRAEAKTEGSNLSV
TNSSVGDGEGLVHEIAGTEKTVNIIEGTSPEIVVNPVRDFKASEINKKNVTVTWTEPETTEGLEGYILYKDGKKVAEIGK
DETSYTFKKLNRHTIYNFKIAAKYSNGEVSSKESLTLRTAR
>Q2CEE3 3.2.1.169~~~~~~Protein O-GlcNAcase~~~COG3525
MLTGVIEGFYGRDWRRDERATVMDWIAAAGMNTYIYGPKDDVHVRARWRVPYDAAGLARLTELRDAAAARGMVFYVSLAP
CLDVTYSDPQDRAALLARVDQLARAGLRNLVLLFDDIPSVLPEADRHRFDSFAEAQADLSNMVLRHLRGAGHVVFCPTEY
CGRMAGGDPRGSAYLQRLGSTLDPAIDIFWTGPEIVSEEIVAAHLAAVGEVLRRRPVIWDNFHANDYDIRRVFAGPLGGR
SRDILPLVAGWITNPNNEAEANFPAIHTTGAYLADPDYAPERAIAAAVAAWQPRFRLAFGDGAVPSDLVALLCDLFWQPF
ALGPETTRILSALRAALTVPRPDPSDPAWRAALEDLRDLKRRINKLFTLMTEIENRDLFHTFHNYLWEAQEEVGHLVAYC
DWLDEAPPPGAVFPATDRIHNFYRRGFGVAVQDILQRDRQGRYHHGV
>Q9X2E1 ~~~ogg~~~8-oxoguanine DNA glycosylase/AP lyase~~~COG1059
MEELLKELERIREEAKPLVEQRFEEFKRLGEEGTEEDLFCELSFCVLTANWSAEGGIRAQKEIGKGFVHLPLEELAEKLR
EVGHRYPQKRAEFIVENRKLLGKLKNLVKGDPFQSREFLVRNAKGIGWKEASHFLRNTGVEDLAILDKHVLRLMKRHGLI
QEIPKGWSKKRYLYVEEILRKVAEAFGESPGKFDLYLWYLVKGKVDK
>P21258 4.2.2.6~~~ogl~~~Oligogalacturonate lyase~~~COG0823
MAKGKKLSFSFHTYQDSVTGTEVVRLTPPDVICHRNYFYQKCFSNDGSKLLFGGAFDGPWNYYLLDLKTQQATQLTEGTG
DNTFGGFLSPDDDALYYVKNVRNLMRVDLNTLEETNIYQVPDDWVGYGTWVANSDCTKMVGIEIKKEDWKPLTDWKKFQE
FYFTNPCCRLIRIDLKTGEATTILKENQWLGHPIYRPGDDNTVAFCHEGPHDLVDARMWFINEDGSNMRKVKEHAPGESC
THEFWVPNGSALAYVSYLKGSTNRFICSVDPVTLENRQLTEMPPCSHLMSNYDGTLMVGDGCNAPVDVKDDGGYKIENDP
FLYVFNMKTGKHFQVAQHNTSWEVLEGDRQVTHPHPSFTPDDKHILFTSDVDGKPALYLAKVPDSVWQ
>P11742 2.1.1.63~~~ogt~~~Methylated-DNA--protein-cysteine methyltransferase, constitutive~~~COG0350
MNYYTTAETPLGELIIAEEEDRITRLFLSQEDWVDWKETVQNTEHKETPNLAEAKQQLQEYFAGERKTFSLPLSQKGTPF
QQKVWQALERIPYGESRSYADIAAAVGSPKAVRAVGQANKRNDLPIFVPCHRVIGKNSALTGYAGSKTEIKAFLLNIERI
SYKEK
>P0AFH0 2.1.1.63~~~ogt~~~Methylated-DNA--protein-cysteine methyltransferase~~~COG0350
MLRLLEEKIATPLGPLWVICDEQFRLRAVEWEEYSERMVQLLDIHYRKEGYERISATNPGGLSDKLREYFAGNLSIIDTL
PTATGGTPFQREVWKTLRTIPCGQVMHYGQLAEQLGRPGAARAVGAANGSNPISIVVPCHRVIGRNGTMTGYAGGVQRKE
WLLRHEGYLLL
>P9WJW5 2.1.1.63~~~ogt~~~Methylated-DNA--protein-cysteine methyltransferase~~~COG0350
MIHYRTIDSPIGPLTLAGHGSVLTNLRMLEQTYEPSRTHWTPDPGAFSGAVDQLNAYFAGELTEFDVELDLRGTDFQQRV
WKALLTIPYGETRSYGEIADQIGAPGAARAVGLANGHNPIAIIVPCHRVIGASGKLTGYGGGINRKRALLELEKSRAPAD
LTLFD
>O34762 ~~~ohrA~~~Organic hydroperoxide resistance protein OhrA~~~COG1764
MSQPLFTATVSAVGGREGKVISSDRVLELDVAMPGTPRAKKLEKATNPEQLFAAGYAACFDSALQLVARTERVKVETEVT
ANVSLLKDEADQGYKLGVTLQVKGEGVSASELEALVKKAHGVCPYSKATSGNIDVTLEVAE
>P80242 ~~~ohrB~~~Organic hydroperoxide resistance protein OhrB~~~COG1764
MALFTAKVTARGGRAGHITSDDGVLDFDIVMPNAKKEGQTGTNPEQLFAAGYAACFGGALEHVAKEQNIEIDSEIEGQVS
LMKDESDGGFKIGVTLVVNTKDLDREKAQELVNAAHEFCPYSKATRGNVDVKLELK
>Q7A6M9 ~~~~~~Organic hydroperoxide resistance protein-like~~~
MAIHYETKATNVGGRKGHVYTDDRALDIDIVPPAQADGKATNPEQLFAAGYASCFNGAFDLILKQNKVRDAHPEVTLTVR
LEDDSDSESPKLSVSIDATIKNVISQEEAEKYLQMAHEFCPYSKATQGNINVDLNVNVVD
>O34777 ~~~ohrR~~~Organic hydroperoxide resistance transcriptional regulator~~~COG1846
MENKFDHMKLENQLCFLLYASSREMTKQYKPLLDKLNITYPQYLALLLLWEHETLTVKKMGEQLYLDSGTLTPMLKRMEQ
QGLITRKRSEEDERSVLISLTEDGALLKEKAVDIPGTILGLSKQSGEDLKQLKSALYTLLETLHQKN
>Q6D8V4 4.1.1.120~~~oiaC~~~3-oxo-isoapionate decarboxylase~~~COG1082
MAIGLSTYAFFWRASSRVPNPLGLAAMLEQTAESGAGVFQICDYAAVEALSPAELEKLRQRAVDLGIQLELGTRGLATDH
LTRYLTMARALDVRFIRTMFNSATHKPTQDEALALLRCVLPEFEQYNIQLGLETYEQVKTRDVLAVVDAIDSPALGICLD
PGNCVAALEYPHEVIELTASRVVNLHIKDFAFARQEGWVGFTYSGCLLGTGLLDYDALHQTIRPNERNINQIVEHWLPWQ
ASAEETCRLEDAWTRHSLNYLYTRNPYANRSSHIL
>B1G889 2.7.1.231~~~oiaK~~~3-oxo-isoapionate kinase~~~
MNGTEPAEPTNGTNATAWPAGLLLAYYGDDFTGSTDAMEAMQAAGVPTVLCLQKPTPELLARFPEVRCVGMAGSSRGRSS
AWMDDELPDVLASLAALGAPILQYKVCSTFDSSPEVGSIGRAIDIGVRHMPGNWSPMVIGAPRLKRYQMFGNLFAAVDGV
GYRLDRHPTMSRHPVTPMNEADLRLHLARQTARRIELIDMLELRGADVATRVRALCAPDMPVVLIDVLDEETLAEAGRLV
WEQRGEGIFTASSSGLQYALAAHWRARGLLPPTPSLPAADPVQAIAAVSGSCSPVTAAQIGWARAHGFHTERLDLPRALD
SRDGAAEIERVVTAATQALTRGISVIVHSAEGPDDPAVTGFDAIASAAGFARHDAARKVGRALAEVMRRLLDSVELTRVV
VAGGDSSGEVASVLGIDALSVMAGLVPGAPLCRAWSAEPRRDGLQIVLKGGQIGDATFFGMVREGRLAGA
>A7IJG7 3.7.1.28~~~oiaT~~~3-oxo-isoapionate-4-phosphate transcarboxylase/hydrolase~~~COG1850
MSERVYATYWMETGGDPARTAEVIAGEQSSGTFVALATETAELKERSGARVERLDILDTADIPSLPGGMASDRYTRAILE
LSWPVENFGPSLPNLMSTIAGNLFELHQVSGLRLIDLKLPPSFTNAFAGPAFGIAGTRKLAGVAQGPIIGTIIKPSIGLT
PEETAQQVRELIAGDIDFIKDDELQADGARCPFEARVKAVMRVVNDAADRRGRKVMVAFNITGDLDEMRRRHDLVLAEGG
TCVMVCLNSIGLVGVREIRRHTQLPIHGHRAGWGYLYRCPSLGWDYAPWQQLWRLAGVDHLHVNGLDNKFSEANASVIAA
ARAVLSPLNHAAPMGAMPVFSSGQTGRQAAETYAAIGCADLIHTAGGGIFGHPAGVPAGVEALRAAWRAAMAGASLEDEA
TRSPALRSALGFWR
>B9JK73 4.1.1.121~~~oiaX~~~3-oxo-isoapionate-4-phosphate decarboxylase~~~COG1850
MSITITYRIETPGSIEAMADKIASDQSTGTFVPVPGETEELKSRVAARVLGIRQLEDAKRPTWPEVAEGHGPLRRADVDI
AFPLDAIGTDLSALMTIAIGGVFSIKGMTGIRIVDMKLPNAFRGAHPGPQFGVAGSKRLTGVEGRPIIGTIVKPALGLRP
VETAELVGELINSGVDFIKDDEKLMSPAYSPLKERVAAIMPRILDHEQKTGKKVMYAFGISHADPDEMMRNHDLVLEAGG
NCAVVNINSIGFGGMSFLRKRSGLVLHAHRNGWDVLTRDPGAGMDFKVYQQFWRLLGVDQFQINGIRVKYWEPDESFIES
FKAVSTPLFDPSDCPLPVAGSGQWGGQAPETYQRTGRTTDLLYLCGGGIVSHPSGPAAGVRAVQQAWEAAVADIPLANYA
KDHPELAASIAKFSDGKGA
>Q2JZQ0 4.1.1.121~~~oiaX~~~3-oxo-isoapionate-4-phosphate decarboxylase~~~
MITLTYRIETPGSVETMADKIASDQSTGTFVPVPGETEELKSRVAARVLAIRPLENARHPTWPESAPDTLLHRADVDIAF
PLEAIGTDLSALMTIAIGGVYSIKGMTGIRIVDMKLPEAFRSAHPGPQFGIAGSRRLTGVEGRPIIGTIVKPALGLRPHE
TAELVGELIGSGVDFIKDDEKLMSPAYSPLKERVAAIMPRILDHEQKTGKKVMYAFGISHADPDEMMRNHDIVAAAGGNC
AVVNINSIGFGGMSFLRKRSSLVLHAHRNGWDVLTRDPGAGMDFKVYQQFWRLLGVDQFQINGIRIKYWEPDESFVSSFK
AVSTPLFDAADCPLPVAGSGQWGGQAPETYERTGRTIDLLYLCGGGIVSHPGGPAAGVRAVQQAWQAAVAGIPLEVYAKD
HPELAASIAKFSDGKGA
>B3PEE6 2.4.1.161~~~~~~Oligosaccharide 4-alpha-D-glucosyltransferase~~~COG1501
MFRRIAGFSPIFLMLFGSSLPTMGNPVKREIHPDAVFYKEHKLRNDGLVITTNQGNIRLQFKSEAAIEVLYRADSKQLPS
FALAQPESAIKAQLTETENHLQFSGGTLTARIQKRPFAISYYRDSELLLAEESGFQVNTDKINFRFYLSPGEKILGGGQR
ILGMDRRGQRFPLYNRAHYGYSDHSGQMYFGLPAIMSSKQYILVFDNSASGAMDIGKTESDILQLEAKSGRSAYILVAGN
SYPSLIENFTQVTGRQPLPPRWALGSFASRFGYRSEAETRATVQKYKTEDFPLDTIVLDLYWFGKDIKGHMGNLDWDKEN
FPTPLDMMADFKQQGVKTVLITEPFVLTSSKRWDDAVKAKALAKDPQGQPKAFELYFGNGGIIDVFSKEGSRWFSSIYKD
LSKQGVAGWWGDLGEPEMHPEDTQHAIGDADTVHNAYGHRWAEMLYQQQLDQFPELRPFIMMRAGFVGSQRYGMIPWTGD
VSRTWGGLASQVELALQMSLLGFGYIHSDLGGFADGETLDKEMYIRWLQYGVFQPVYRPHGQDHIPSEPVFQDEETKAIL
RPLVKLRYRMLPYIYTAAYQNTLTGMPLMRPLFFSDEKNPALIDNKTSYFWGDSLLVTPITQAGVESVSIPAPKGVWFDF
WKDTRYQTDGAPLTLPTDLHTIPVLVKAGAFMPYVPAVSTTEDYRSDSLEIHYYADASVPLAQGEIFEDDGKDPNSIKRN
QFDLLTLQATHTDNQLHFQLARTGKGYRGMPERRATTLVIHNASDQYQHLDINGKTIAIAQADCASTPALACYDQERRQL
QLVFTWGREALNLRLH
>P0DV58 3.1.-.-~~~old~~~Retron Eco8 OLD nuclease~~~
MTIESIRVKNLLSFDDVILRDFRDINCIIGRNNVGKSNLLKVIRYFYAKLENKKVIPLDFHTNYNAVGEITFTFDTTRIK
KIVTSRKNNGRFHKHIYNTLFKSSSVKLNFEELIARKNSTNKSFFSLTLTICKDDSVMWSVDDPKVRSLLATLYPFLYIE
TRHIDLYDWNPIWKLISNLNSFNFDDVDHDELVNFLDEKISSRKGDYKKYIDRVVSVIDTKPYTYKEKVINYIKVAIKGD
SFVNAGEELFTQSDGTNSNKFLETLLHLLITLTRTEFISPIVYIDEPEVGLHPKLAESFVSNLNKIYSKFKKTSELSGPG
RYKTPYPNIFYSTHSPSILKQTIKLFGKDQQVLHFSKKKDGSTRVNKINSTYSDERFLNIFSDNEARLFFSEYIVFVEGA
TELELFRNLSLLNLYPAFSLADIYDANEVILANINPGYSKASIPFVIIKDIDTLIDYSIKTEKFSLRPLFEKMIKELTKE
FDYYDTGFGRVRKEIDLFSDIQSSTKKHMDSGLFFKRFSLHNLSSRINKVSRKLNRYFMTTTIEGALINEQSLPYFFNWI
GDVILTQMTINNPNPDKFIEAMRRRYNIKSQVVPLFKSVFCIGLNHPVYSSAVDKQALRIKLSFLNYLKRKVYSDFNNEK
EIVLALRLAFGGKTETQYTLDKLRKDGEAELFREKIKNYKNNELFFLEPQMTKTSGWVTTFLNYTIEKITSEESDDDRIR
QKLSFIFPEIISIIEQASSSIEAEESSLTG
>E8PLM2 3.1.11.3~~~old~~~OLD nuclease~~~COG1196
MLKRLQVKNFRCLEDIDLPLGPLTAIVGPNGAGKTTILRAIDLVLGDVWPSLRSFRIPQDFINFDTTRAIEITVHFDPPY
TQGSFNITAFRLTCKGEDADFHVDLEPLDEGGNVPRYPSGNPLRVGTDMRNHARVLFLDHRRNLAQHLPSIRGSILGRLL
QPVRREFKLQDNFKQVYEQAMDLLRTEQVKQIEKTIAETAKQMLGFLGKDAMKSMEIGFGFADPANPFNSLRLQYRESDL
TLPGDELGLGIQSAIVVGIFEAFRQLGEKIGTVIIEEPEMYLHPQAQRYFYRLLCEMADKDQCQIIYSTHSPIFADVNRF
EALRLVRKDRDDRVVVSYVREEDKSALDNVRNRFKLGGRFDTARNEVLFAKRALLVEGYGDRVAALQLFNQLEVDPDAEC
IAVVDCGGKAGIELIVGVCKALDIPFVVVHDEDVWPIDERADEETRRKQEQENKAEQEKNQRIQACAGAERVFVVQPSLE
AALGIGRNASDKPYRIAEILKTVDVGQPPDALRPFVEAIRQVTRPMEE
>Q8EG66 2.3.3.20~~~oleA~~~Acyl-CoA:acyl-CoA alkyltransferase~~~COG0332
MKYSRVFINSLAYELAPVVVSSSELESRLAPLYQKFRIPMGQLAALTGITERRWWPKGHQLSDGAINAAHKAIAETGIDV
AELGAVVYTGVCRDQHEPATACRIAAALGVSKDTAIYDISNACLGVLSGILDIANRIELGQIKAGMVVSCESARDIVDVT
IDNMLADPTMQNFAQSLATLTGGSGAVAVILTDGSLPLTNVRKHQLLGASHLSAPQHHQLCQWGLQEVGHNIYREFMRTD
AVTLLKEGVELAKHTWEHFLAQRNWLVEQVDKVICHQVGASNRKQVLSALNIPPEKEFPTYQLLGNMGTVSLPVTAAMAH
DQGFLRPGDQVSFLGIGSGLNCMMLGIKW
>Q8PDX2 2.3.3.20~~~oleA~~~Acyl-CoA:acyl-CoA alkyltransferase~~~COG0332
MLFQNVSIAGLAHIDAPHTLTSKEINERLQPTYDRLGIKTDVLGDVAGIHARRLWDQDVQASDAATQAARKALIDANIGI
EKIGLLINTSVSRDYLEPSTASIVSGNLGVSDHCMTFDVANACLAFINGMDIAARMLERGEIDYALVVDGETANLVYEKT
LERMTSPDVTEEEFRNELAALTLGCGAAAMVMARSELVPDAPRYKGGVTRSATEWNKLCRGNLDRMVTDTRLLLIEGIKL
AQKTFVAAKQVLGWAVEELDQFVIHQVSRPHTAAFVKSFGIDPAKVMTIFGEHGNIGPASVPIVLSKLKELGRLKKGDRI
ALLGIGSGLNCSMAEVVW
>Q8EG65 4.1.1.114~~~oleB~~~Cis-3-alkyl-4-alkyloxetan-2-one decarboxylase~~~COG0596
MLDTLLPFKRHFLSRNGNKLHYINEGQGEPVVMVHGNPSWSFYYRNLVSALKDTHQCIVPDHIGCGLSDKPDDSGYDYTL
KNRIDDLEALLDSLNVKENITLVVHDWGGMIGMGYAARYPERIKRLVILNTGAFHLPDTKPLPLALWICRNTLLGTVLVR
GFNAFSSIASYVGVKRQPMSKYIREAYVAPFNSWANRISTLRFVQDIPLKPGDRNYQLVSDIAASLPKFAKVPTLICWGL
QDFVFDKHFLVKWREHMPHAQVHEFADCGHYILEDASDEVITHIKHFMTETETLATQVNPADSITEFESASQAPQAER
>Q8PDW8 4.1.1.114~~~oleB~~~Cis-3-alkyl-4-alkyloxetan-2-one decarboxylase~~~COG0596
MTYPGYSFTPKRLDVRPGIAMSYLDEGPSDGEVVVMLHGNPSWGYLWRHLVSGLSDRYRCIVPDHIGMGLSDKPDDAPDA
QPRYDYTLQSRVDDLDRLLQHLGITGPITLAVHDWGGMIGFGWALSHHAQVKRLVITNTAAFPLPPEKPMPWQIAMGRHW
RLGEWFIRTFNAFSSGASWLGVSRRMPAAVRRAYVAPYDNWKNRISTIRFMQDIPLSPADQAWSLLERSAQALPSFADRP
AFIAWGLRDICFDKHFLAGFRRALPQAEVMAFDDANHYVLEDKHEVLVPAIRAFLERNPL
>Q8EG64 6.1.3.1~~~oleC~~~Olefin beta-lactone synthetase~~~COG0318
MTKVDDALFEHGASAVAVEQNNGRDNPTKPKDANICRHLKLAAHHIPHHLAVAVQQGKGKSFANLTYQELDFISLNKQSD
AIAFALNAYGLTRGMKAVLMVTPSLDFFALTFALFKAGIIPVLVDPGMGIKNLKQCFIEAAPDAFIGIPKAHIARRLLGW
GKASVKRLINVDANQSGVTDTLSRLLTGAPSLASMLSFTTKSSSAKLPEQVEYPMALLEHDEMAAILFTSGSTGTPKGVV
YSHGMFEAQIQALKQDYGIAHGERDLATFPLFSLFGPALGMTSIVPEMDASKPITANPEFLFAAIEKYQCSNIFVNPALL
ERLGRAGEQTDSKNQHKLSSVKRVISAGAPATIASIARFSKMLSDGVPVLNSYGATESLPISMIASDELFTTTQVTDNGG
GICVGRAIDGVKIEIIAITEADIPEWDNRLCLNAGEIGEIVVTGQMVSQSYYHREKATAASKIWDSERQTFRHRMGDLGY
LDDSGRLWMCGRKAHRVDATQGGQFAKRYYSIPCERIFNTHPNVKRSALVGVTVKGQHGVGEIKPLICIELDQSLVCNKS
AQLYQELMVIAEQYSQTQGIRRFLIHPDFPVDVRHNAKIFREKLAVWAQSQTKG
>B2FI28 6.1.3.1~~~oleC~~~Olefin beta-lactone synthetase~~~COG0318
MNRPCNIAARLPELARERPDQIAIRCPGRRGAGNGMAAYDVTLDYRQLDARSDAMAAGLAGYGIGRGVRTVVMVRPSPEF
FLLMFALFKLGAVPVLVDPGIDRRALKQCLDEAQPEAFIGIPLAHVARLVLRWAPSAARLVTVGRRLGWGGTTLAALERA
GAKGGPMLAATDGEDMAAILFTSGSTGVPKGVVYRHRHFVGQIQLLGSAFGMEAGGVDLPTFPPFALFDPALGLTSVIPD
MDPTRPAQADPVRLHDAIQRFGVTQLFGSPALMRVLAKHGRPLPTVTRVTSAGAPVPPDVVATIRSLLPADAQFWTPYGA
TECLPVAVVEGRELERTRAATEAGAGTCVGSVVAPNEVRIIAIDDAPLADWSQARVLAVGEVGEITVAGPTATDSYFNRP
QATAAAKIRETLADGSTRVVHRMGDVGYFDAQGRLWFCGRKTQRVETARGPLYTEQVEPVFNTVAGVARTALVGVGAAGA
QVPVLCVELLRGQSDSPALQEALRAHAAARTPEAGLQHFLVHPAFPVDIRHNAKIGREKLAVWASAELEKRA
>Q8PDW6 6.1.3.1~~~oleC~~~Olefin beta-lactone synthetase~~~COG0318
MGDNGRMTTLCNIAASLPRLARERPDQIAIRCPGGRGANGMAAYDVTLSYAELDARSDAIAAGLALHGIGRGVRAVVMVR
PSPEFFLLMFALFKAGAVPVLVDPGIDKRALKQCLDEAQPQAFIGIPLAQLARRLLRWAPSATQIVTVGGRYCWGGVTLA
RVERDGAGAGSQLADTAADDVAAILFTSGSTGVPKGVVYRHRHFVGQIELLRNAFDMQPGGVDLPTFPPFALFDPALGLT
SVIPDMDPTRPATADPRKLHDAMTRFGVTQLFGSPALMRVLADYGQPLPNVRLATSAGAPVPPDVVAKIRALLPADAQFW
TPYGATECLPVAAIEGRTLDATRTATEAGAGTCVGQVVAPNEVRIIAIDDAAIPEWSGVRVLAAGEVGEITVAGPTTTDT
YFNRDAATRNAKIRERCSDGSERVVHRMGDVGYFDAEGRLWFCGRKTHRVETATGPLYTEQVEPIFNVHPQVRRTALVGV
GTPGQQQPVLCVELQPGVAASAFAEVETALRAVGAAHPHTAGIARFLRHSGFPVDIRHNAKIGREKLAIWAAQQRV
>Q8EG63 1.1.1.412~~~oleD~~~2-alkyl-3-oxoalkanoate reductase~~~COG0451
MTDNSSISLTPADLEHVPLQPTRLKQVGGDQACIKLSLDAREQTALDALAAKVSHAFVTGAGGFLGKAICQRLIAAGIKV
TGFARGRYLELEALGVTMVQGDLVNPEQVKQAMQGCDIVFHVASKAGVWGDRDSYFCPNVKGAANVIAACKALKINKLVY
TSTPSVTFAGEDESGINESTPYASRFLNYYAHSKAIAEKMMLDANQSSSTNAAYVLKTVALRPHLIWGPNDPHLVPRVLA
RGRLGKLKLVGREDKLVDTIYIDNAAYAHVLAALELCQATPKCQGKAYFISNDEPVTMAKMLNMILACDGLPPVTQRVPQ
MLAYAVGAVLETAYRLLNKQEEPIMTRFVAKQLSCSHYFDISAAKQDFGYSALVSIEEGMKRLKASL
>Q53685 2.4.1.-~~~oleD~~~Oleandomycin glycosyltransferase~~~
MTTQTTPAHIAMFSIAAHGHVNPSLEVIRELVARGHRVTYAIPPVFADKVAATGPRPVLYHSTLPGPDADPEAWGSTLLD
NRRTFLNDAIQALPQLADAYADDIPDLVLHDITSYPARVLARRWGVPAVSLSPNLVAWKGYEEEVAEPMWREPRQTERGR
AYYARFEAWLKENGITEHPDTFASHPPRSLVLIPKALQPHADRVDEDVYTFVGACQGDRAEEGGWQRPAGAEKVVLVSLG
SAFTKQPAFYRECVRAFGNLPGWHLVLQIGRKVTPAELGELPDNVEVHDWVPQLAILRQADLFVTHAGAGGSQEGLATAT
PMIAVPQAVDQFGNADMLQGLGVARKLATEEATADLLRETALALVDDPEVARRLRRIQAEMAQEGGTRRAADLIEAELPA
RHERQEPVGDRPNVGDRPAGVRSDRQRSAL
>B2FI29 1.1.1.412~~~oleD~~~2-alkyl-3-oxoalkanoate reductase~~~COG0451
MKILVTGGGGFLGQALCRGLVERGHQVLAFNRSHYPELQVMGVGQIRGDLADPQAVLHAVAGVDAVFHNGAKAGAWGSYD
SYHQANVIGTDNVIAACRAHGIGRLVYTSTPSVTHRATHPVEGLGADEVPYGEDFQAPYAATKAIAEQRVLAANDASLAT
VALRPRLIWGPGDQQLVPRLAERARQGRLRLVGDGSNKVDTTYIDNAALAHFLAFEALAPGAACAGKAYFISNGEPLPMR
ELVNQLLAAVGAPRVDKAISFKTAYRIGAICERLWPLLRLRGEPPLTRFLAEQLCTPHWYSMEPARRDFGYVPQVSIEEG
LRRLKASSAA
>Q8PDW5 1.1.1.412~~~oleD~~~2-alkyl-3-oxoalkanoate reductase~~~COG0451
MMKILVTGGGGFLGQALCRGLVARGHEVVSFQRGDYPVLHTLGVGQIRGDLADPQAVRHALAGIDAVFHNAAKAGAWGSY
DSYHQANVVGTQNVLDACRANGVPRLIYTSTPSVTHRATNPVEGLGADEVPYGEDLRAPYAATKAIAERAVLAANDAQLA
TVALRPRLIWGPGDNHLLPRLAARARAGRLRMVGDGSNLVDSTYIDNAAQAHFDAFAHLAPGAACAGKAYFISNGEPLPM
RELLNRLLAAVDAPAVTRSLSFKTAYRIGAVCETLWPLLRLPGEVPLTRFLVEQLCTPHWYSMEPARRDFGYVPQISIEE
GLQRLRSSSSRDISITR
>Q9RR31 4.2.1.159~~~oleV~~~dTDP-4-dehydro-6-deoxy-alpha-D-glucopyranose 2,3-dehydratase~~~
MIWGIPAMSEAMGSVPTAGSEVSSTCAFLSWLDARRRANRLTVEHVPFRELSGWQFDENTGNLRHTSGRFFSIEGLRVRT
DHCWFGSWTQPIIVQPEIGILGLLVKRFDGILHVLVQAKNEPGNIGGLQLSPTVQATRSNYTRVHRGGGVRYLEYFASPR
GRGRVLADVLQSEQGSWFLHKRNRNMVVEALDDVPLDDDFHWISLGGLRKLLLRPHLVNMDTRTVLSCLPPDPAPDGRQP
PAPAAPFAAAVTRSLTRGATALHTMGEILGWLTDERSRRELVQQRVPLEETAFSGWRRDDHAIAHKDGDYFRVIGVSVRA
SSREVSSWSQPLLAPVGPGLAAFVTRRIRGVLHVLLHARTEAGLLNGPEMAPTVQCRPLNYRAVPAEYRPAYLDYVLSAD
PGRIRYDTLQSEEGGRFHHAENRYVVVEAEDDFPVEVPRDFRWLTLHQILALLHHSNYVNVEARSLVACIQALS
>Q9RR32 1.1.1.384~~~oleW~~~dTDP-3,4-didehydro-2,6-dideoxy-alpha-D-glucose 3-reductase~~~
MPSPRLRFGVLGAADIALRRTVPALLAHPDVTVVAVSSRDTARAARFAAAFGCEAVPGHQALLDRDDIDALYVPLPVMVH
TPWVEAALLRGRHVLVEKPLTATRSGAEDLIALARSRGLVLMENFTSLHHAQHGTVTDLLRDGTIGELRSLSAAFTIPPK
PEGDIRYQPDVGGGALLDIGIYPLRAALHFLGPDLHAAGAVLRRERRRNVVVSGHVLLTTPHGVVAELAFGMEHAYRSEY
TLFGTAGRLRLDRAFTPPETHRPRVEIHRQDALDIVDLPPDAQFANLVRDFVLAVREGPGRLTQHHADAVRQADLVERVM
AVARVRWC
>O87833 2.1.1.239~~~oleY~~~L-olivosyl-oleandolide 3-O-methyltransferase~~~
MSYDDHAVLEAILRCAGGDERFLLNTVEEWGAAEITAALVDELLFRCEIPQVGGEAFIGLDVLHGADRISHVLQVTDGKP
VTSAEPAGQELGGRTWSSRSATLLRELFGPPSGRTAGGFGVSFLPDLRGPRTMEGAALAARATNVVLHATTNETPPLDRL
ALRYESDKWGGVHWFTGHYDRHLRAVRDQAVRILEIGIGGYDDLLPSGASLKMWKRYFPRGLVFGVDIFDSRRATSRVSR
RSAARQDDPEFMRRVAEEHGPFDVIIDDGSHINAHMRTSFSVMFPHLRNGGFYVIEDTFTSYWPGYGGPSGARCPSGTTA
LEMVKGLIDSVHYEERPDGAATADYIARNLVGLHAYQTTSSSSRRAINKEGGIPHTVPREPFWNDN
>C7DLJ6 4.2.1.53~~~ohyA~~~Oleate hydratase~~~COG4716
MNPITSKFDKVLNASSEYGHVNHEPDSSKEQQRNTPQKSMPFSDQIGNYQRNKGIPVQSYDNSKIYIIGSGIAGMSAAYY
FIRDGHVPAKNITFLEQLHIDGGSLDGAGNPTDGYIIRGGREMDMTYENLWDMFQDIPALEMPAPYSVLDEYRLINDNDS
NYSKARLINNKGEIKDFSKFGLNKMDQLAIIRLLLKNKEELDDLTIEDYFSESFLKSNFWTFWRTMFAFENWHSLLELKL
YMHRFLHAIDGLNDLSSLVFPKYNQYDTFVTPLRKFLQEKGVNIHLNTLVKDLDIHINTEGKVVEGIITEQDGKEVKIPV
GKNDYVIVTTGSMTEDTFYGNNKTAPIIGIDNSTSGQSAGWKLWKNLAAKSEIFGKPEKFCSNIEKSAWESATLTCKPSA
LIDKLKEYSVNDPYSGKTVTGGIITITDSNWLMSFTCNRQPHFPEQPDDVLVLWVYALFMDKEGNYIKKTMLECTGDEIL
AELCYHLGIEDQLENVQKNTIVRTAFMPYITSMFMPRAKGDRPRVVPEGCKNLGLVGQFVETNNDVVFTMESSVRTARIA
VYKLLNLNKQVPDINPLQYDIRHLLKAAKTLNDDKPFVGEGLLRKVLKGTYFEHVLPAGAAEEEEHESFIAEHVNKFREW
VKGIRG
>B9E972 4.2.1.53~~~~~~Oleate hydratase~~~COG4716
MYYSNGNYEAFARPKKPEGVDNKSAYLVGSGLASLAAASFLIRDGQMKGENIHILEELDLPGGSLDGILNPERGYIMRGG
REMENHFECLWDLFRSVPSLEVEDASVLDEFYWLNKEDPNYSKCRVIENRGQRLESDGKMTLTKKANKEIIQLCLMKEEQ
LNDVKISDVFSKDFLDSNFWIYWKTMFAFEPWHSAMEMRRYLMRFIHHIGGLADFSALKFTKFNQFESLVMPLIEHLKAK
NVTFEYGVTVKNIQVECSKESKVAKAIDIVRRGNEESIPLTENDLVFVTNGSITESTTYGDNDTPAPPTSKPGGAWQLWE
NLSTQCEEFGNPAKFYKDLPEKSWFVSATATTNNKEVIDYIQKICKRDPLSGRTVTGGIVTVDDSNWQLSFTLNRQQQFK
NQPDDQVSVWIYALYSDERGERTNKTIVECSGKEICEEWLYHMGVPEEKISALAAECNTIPSYMPYITAYFMPRKEGDRP
LVVPHGSKNIAFIGNFAETERDTVFTTEYSVRTAMEAVYKLLEVDRGVPEVFASVYDVRILLHALSVLNDGKKLDEIDMP
FYERLVEKRLLKKASGTFIEELLEEANLI
>B5XK69 4.2.1.53~~~sph~~~Oleate hydratase~~~
MYYTSGNYEAFATPRKPEGVDQKSAYIVGTGLAGLAAAVFLIRDGHMAGERIHLFEELPLAGGSLDGIEKPHLGFVTRGG
REMENHFECMWDMYRSIPSLEIPGASYLDEFYWLDKDDPNSSNCRLIHKRGNRVDDDGQYTLGKQSKELIHLIMKTEESL
GDQTIEEFFSEDFFKSNFWVYWATMFAFEKWHSAVEMRRYAMRFIHHIDGLPDFTSLKFNKYNQYDSMVKPIIAYLESHD
VDIQFDTKVTDIQVEQTAGKKVAKTIHMTVSGEAKAIELTPDDLVFVTNGSITESSTYGSHHEVAKPTKALGGSWNLWEN
LAAQSDDFGHPKVFYQDLPAESWFVSATATIKHPAIEPYIERLTHRDLHDGKVNTGGIITITDSNWMMSFAIHRQPHFKE
QKENETTVWIYGLYSNSEGNYVHKKIEECTGQEITEEWLYHLGVPVDKIKDLASQEYINTVPVYMPYITSYFMPRVKGDR
PKVIPDGSVNLAFIGNFAESPSRDTVFTTEYSIRTAMEAVYSFLNGERGIPQGFNSAYDIRELLKAFYYLNDKKAIKDMD
LPIPALIEKIGHKKIKDTFIEELLKDANLM
>Q2YQS9 2.3.1.270~~~olsA~~~Lyso-ornithine lipid O-acyltransferase~~~
MIGTIRIFLVVAAMVALSLSLIPFQYLFLKLKNGWKRRLPNFFHRIVARLFGFRIRTVGKLHEGCPLLLVSNHTSWSDIV
VLSAVGQVSFIAKSEVRDWPVFGMFAVLQRTVFVERARRGKTVHQTSEIANRLIAGDAMVLFAEGTTSDGNRVLPFKTAL
FGAAHAAIREAGVAEVAVQPVAIAYTRVHGMAMGRYFRPLVSWPGDVELMPHLKGILREGAIDVEVRFGEPVFVTAETDR
KALARTMENRVRALLQSALLGREIPEA
>Q9HW50 2.3.1.270~~~olsA~~~Lyso-ornithine lipid O-acyltransferase~~~
MARLRLLLRSARLLGLVALGLGLAAWVSLRERLPGADVTPLRQRLTRWWLARLCAALPFEVRVSGEAPRQPMLWVANHVS
WTDIPLLGALAPLTFLSKAEVRAWPLAGWLAEKAGTLFIRRGSGDSRLINQRLAEQLHRGRNLLIFPEGTTTNGESLRTF
HGRLMASALEAGVAVQPVAISYRRDGVPDAQAPFIGDDDLLSHLGRLLRGERGSVHIQLLEPIPSQGLDRAELARQAQQA
VRLALFGTAAPTQTRRAA
>Q7APG1 2.3.1.270~~~olsA~~~Lyso-ornithine lipid O-acyltransferase~~~COG0204
MINWVRVALCGMLLVMVSLVLMPVQILCLWLDLKPRRWLPRHWHRVACLLLGLRVRVHGELDRRRPLLLSANHVSWKDIL
VLSSVADVVFVAKSDVKSWPIFGLLARLQASVFVEREQKRTTGHQVNDIGRRLADGEIVVLFPEGTTSDGNRLLDIKTSL
FGAAASAVPQSPTGVVHVQPLAISYTGIHGMPMGRYHRPIAAWPGDIGLVPHLLGVLREGALEVDVDFGEAVDYDRHANR
KEVSRLIGQRIRKMLSDRLRGRSRSAAKGEPAPACSAAPDIPSDAQRSRLAP
>D5AQD5 2.3.1.270~~~olsA~~~Lyso-ornithine lipid O-acyltransferase~~~COG0204
MTRPIWMGDEPPPDRPPTLAGRGRLALRGGAMALVLMAGLTLHLAVRLIERPLHGGHRPWTPAITPVVCRICVAILGLRY
SVRGRPMQHIGAVVANHTGWLDIFTLNACQRLYFVSKDEVADWPFIGWLARATGTVFIRRDPREAKAQQALLEDRIRDGH
HLLFFPEGTSTDGLQVLPFKTTLFAAFYTHGLDKVMQIQPVTVNYTAPEGEDPRFYGWWRDMPFATHLAKVLSVARQGAA
EVVFHPPLDVSDFPSRKDLAAACEAAVRSGMGQRSR
>Q2YNY9 2.3.2.30~~~olsB~~~L-ornithine N(alpha)-acyltransferase~~~
MSGLEAQQALFASNGDAIILGRIGSLEVRLANSRAAIQAAQELRFRVFFEEMGARKETIEAVEQRDADRFDTICDHLLVY
DTALPVPEHQQIVGTYRLMRNEQAEKALGFYSADEYDVQRLKLSRPNLRLLELGRSCVKPEYRSKRTVELLWQGAWAYCR
RHSIDVMFGCASFHGAVPAAHALGLSFLHHNCRATDDWDVRALPHRYQAMDLMPKEAINNKVALFSMPPLVKGYLRLGAM
IGDGAVIDEAFGTTDVFIILPIERISSRYISYYGAEANRFV
>Q9HW51 2.3.2.30~~~olsB~~~L-ornithine N(alpha)-acyltransferase~~~
MTQTAITREPVAGRRLKAERLNGARALREAQALRYRVFSAEFDAKLEGAEDGLDRDDYDRHCAHIGVRDLDSGALVATTR
LLDHRAAERLGRFYSEEEFHLSGLDALHGPVLEIGRTCVAPEYRNGATIAVLWGELAEVLNEGGYRYLMGCASIPMRDGG
MQAKAVMQRLRERYLCTDYLQAEPKNPLPPLDVPENLTAELPPLLKAYMRLGAKICGEPCWDPDFQVADVFILLKRDELC
PRYARHFKAAV
>Q92SJ1 2.3.2.30~~~olsB~~~L-ornithine N(alpha)-acyltransferase~~~COG3176
MTIELLDSMGVVDTSNAYIRKAVAAPASDVLGRIANLETRLARSAAEIDAAQAVRYRVFVEEMKAQVAPEAGRRKRDIDS
WDAICDHLLVLDTSIEGDAEEQIVGTYRLLRQDVAERTGGFYSASEFAIGELLSRHPGKRFMELGRSCVLPEYRTKRTVE
LLWQGNWAYALKHGIDAMFGCGSFPGVVPEEHALALSFLHHNVRVRDEWAVSARPELYRTMDLMPPEAINPKKALAALPP
LIKGYMRLGAMVGDGAVVDQAFRTTDVLIVLPIGKISGRYLNYYGADAGRFSSPVS
>Q4VFY5 1.14.11.58~~~olsC~~~Ornithine lipid ester-linked acyl 2-hydroxylase~~~
MTESPLSAPAPTSNQSPAPEQTFGTAGIAPMDRPSAMTRFFMSIVAWAESLNLKYAKLGNPPVYDTATFPWAAEIEKDYP
AIRAELEKVLLRQSELPTFQDISTDVKTISTDTRWKTFFLLGFGVKSEQNIKACPNTWAAVQKIPGLTTAMFSIFEPGKH
LPAHRGPYNGVLRLHLGLIVPEPNDKLAIRVDNQVCHWQEGKALIFDDAYEHEAWNHTDKTRVVLFVDFVKPLKSPARFV
NWALMNLAIFTPFIKEGLDNHKEWEKKFYAEAEAFRNRPKP
>L0D9B6 2.1.1.344~~~olsG~~~Ornithine lipid N-methyltransferase~~~COG3963
MKDFFLFLGKFFKHGTAIASLAPSSPWLSRTTVRNIPWENARVVVELGAGTGPITKVIADRVHPDCRVIVLERDPDFARL
LRERFANRANFDVVEGDVRDLTQILQQRGIEQADFVVSGLPVPSFPKELQRDLFRVVKQVLAPSGTFNQITEMPWVYQRF
YRRFFDEVTFVFEPRNLPPAGAYFCRGVKESF
>P80604 ~~~~~~24 kDa outer membrane protein~~~
MKNKSKLLACCLMALPISSFSIGNNNLIGVGVSAGNSIYQVKKKTAVEPFLMLDLSFGNFYMRGAAGLSELGYQHVFTPS
FSTSLFLSPFDGAPIKRKDLKPGYDSIQDRKTQVAVGLGLDYDLSDLFNLPNTNISLEMKKGRRGFNSDITLTRTFMLTD
KLSISPSFGLSYYSAKYTNYYFGIKKAELNKTKLKSVYHPKKAYSGHIALNSHYAITDHIGMGLSFSWETYSKAIKKSPI
VKRSGEISSALNFYYMF
>Q51922 ~~~skp~~~Outer membrane p25~~~
MKKAVKVTALSLALAFTSSLAMATENIAFISGDYLFQNHPDRKMVAEKLESEFKARVEKLTANKKSIDEKIAASQKKVEA
KVAALQKDAPKLRSADIKKREEEINKLGNSEQEAINKLVTAHDEEVSKYQDDYAKREREETAKLVDSIQNAVNTVAREKN
YTLVLNEGAVVFAADAKNITEDVLKVIPATQAK
>Q57483 ~~~~~~Outer membrane protein 26~~~COG2825
MKNIAKVTALALGIALASGYASAEEKIAFINAGYIFQHHPDRQAVADKLDAEFKPVAEKLAASKKEVDDKIAAARKKVEA
KVAALEKDAPRLRQADIQKRQQEINKLGAAEDAELQKLMQEQDKKVQEFQAQNEKRQAEERGKLLDSIQTATNNLAKAKG
YTYVLDANSIVFAVEGKDITEEVLKSIPASEKAQEKK
>Q05811 ~~~ropA~~~Outer membrane protein IIIA~~~
MNIRMVLLASAAAFAASTPVLAADAIVAAEPEPVEYVRVCDAYGTGYFYIPGTETCLKIEGYIRFQVNVGDNPGGDNDSD
WDAVTAVRFSSRKSDTEYGPLTGVIVMQFNADNASDQDAILDSAYLDVAGFRAGLFYSWWDDGLSGETDDIGSVVTLHNS
IRYQYESGTFYAGLSVDELEDGVYQGTFTPGVIPGTTDFTADDGPNNVGVAFGIGGTAGAFSYQVTGGWDVDNEDGAIRA
MGTVEIGPGTFGLAGVYSSGPNSYYSSAEWAVAAEYAIKATDKLKITPGRWHGHVPEDFDGLGDAWKVGLTVDYQIVENF
YAKASVQYLDPQDGEDSTSGYFACSVRSNHLVDAPGLRIGSTTISF
>Q9PJU9 ~~~omcA~~~Small cysteine-rich outer membrane protein OmcA~~~
MKKTALLAALCSVVSLSSCCRIVDCCFEDPCAPIKCSPCESKKREVNGGCNSCNGYVPSCKPCGGELDHETKQGPQARGI
QADGRCRQ
>F0T376 ~~~omcA~~~Small cysteine-rich outer membrane protein omcA~~~
MKKAVLLATVFCGVVGLTSCCRIVDCCFEDPCAPKPCNPCGNKKDKGCSPCGVYTPSCSKPCGSECNPGVQGPQAKGCTS
LDGRCKQ
>Q9Z7Z5 ~~~omcA~~~Small cysteine-rich outer membrane protein OmcA~~~
MKKAVLIAAMFCGVVSLSSCCRIVDCCFEDPCAPSSCNPCEVIRKKERSCGGNACGSYVPSCSNPCGSTECNSQSPQVKG
CTSPDGRCKQ
>P0CZ19 ~~~omcA~~~Small cysteine-rich outer membrane protein omcA~~~
MKKAVLLATVFCGVVGLTSCCRIVDCCFEDPCAPKPCNPCGNKKDKGCSPCGVYTPSCSKPCGSECNPGVQGPQAKGCTS
LDGRCKQ
>B0B816 ~~~omcA~~~Small cysteine-rich outer membrane protein OmcA~~~
MKKTALLAALCSVVSLSSCCRIVDCCFEDPCAPIQCSPCESKKKDVDGGCNSCNGYVPACKPCGGDTHQDAEHGPQAREI
PVDGKCRQ
>P0DJI1 ~~~omcA~~~Small cysteine-rich outer membrane protein OmcA~~~
MKKTALLAALCSVVSLSSCCRIVDCCFEDPCAPIQCSPCESKKKDVDGGCNSCNGYVPACKPCGGDTHQDAEHGPQAREI
PVDGKCRQ
>P0CC05 ~~~omcA~~~Small cysteine-rich outer membrane protein OmcA~~~
MKKTALLAALCSVVSLSSCCRIVDCCFEDPCAPIQCSPCESKKKDVDGGCNSCNGYVPACKPCGGDTHQDAKHGPQARGI
PVDGKCRQ
>P26758 ~~~omcB~~~Large cysteine-rich periplasmic protein OmcB, serovar C~~~
MNKLIRRAVTIFAVTSVASLFASGVLETSMAESLSTNVISLADTKAKDNTSHKSKKARKNHSKETPVDRKEVAPVHESKA
TGPKQDSCFGRMYTVKVNDDRNVEITQAVPEYATVGSPYPIEITATGKRDCVDVIITQQLPCEAEFVRSDPATTPTADGK
LVWKIDRLGQGEKSKITVWVKPLKEGCCFTAATVCACPEIRSVTKCGQPAICVKQEGPENACLRCPVVYKINVVNQGTAT
ARNVVVENPVPDGYAHSSGQRVLTFTLGDMQPGEHRTITVEFCPLKRGRATNIATVSYCGGHKNTASVTTVINEPCVQVS
IAGADWSYVCKPVEYVISVSNPGDLVLRDVVVEDTLSPGVTVLEAAGAQISCNKVVWTVKELNPGESLQYKVLVRAQTPG
QFTNNVVVKSCSDCGTCTSCAEATTYWKGVAATHMCVVDTCDPVCVGENTVYRICVTNRGSAEDTNVSLMLKFSKELQPV
SFSGPTKGTITGNTVVFDSLPRLGSKETVEFSVTLKAVSAGDARGEAILSSDTLTVPVSDTENTHIY
>P0CC04 ~~~omcB~~~Large cysteine-rich periplasmic protein OmcB~~~
MNKLIRRAVTIFAVTSVASLFASGVLETSMAESLSTNVISLADTKAKDNTSHKSKKARKNHSKETPVDRKEVAPVHESKA
TGPKQDSCFGRMYTVKVNDDRNVEITQAVPEYATVGSPYPIEITATGKRDCVDVIITQQLPCEAEFVRSDPATTPTADGK
LVWKIDRLGQGEKSKITVWVKPLKEGCCFTAATVCACPEIRSVTKCGQPAICVKQEGPENACLRCPVVYKINIVNQGTAT
ARNVVVENPVPDGYAHSSGQRVLTFTLGDMQPGEHRTITVEFCPLKRGRATNIATVSYCGGHKNTASVTTVINEPCVQVS
IAGADWSYVCKPVEYVISVSNPGDLVLRDVVVEDTLSPGVTVLEAAGAQISCNKVVWTVKELNPGESLQYKVLVRAQTPG
QFTNNVVVKSCSDCGTCTSCAEATTYWKGVAATHMCVVDTCDPVCVGENTVYRICVTNRGSAEDTNVSLMLKFSKELQPV
SFSGPTKGTITGNTVVFDSLPRLGSKETVEFSVTLKAVSAGDARGEAILSSDTLTVPVSDTENTHIY
>P23603 ~~~omcB~~~Large cysteine-rich periplasmic protein OmcB, serovar E~~~COG1361
MNKLIRRAVTIFAVTSVASLFASGVLETSMAESLSTNVISLADTKAKDNTSHKSKKARKNHSKETLVDRKEVAPVHESKA
TGPKQDSCFGRMYTVKVNDDRNVEITQAVPEYATVGSPYPIEITATGKRDCVDVIITQQLPCEAEFVRSDPATTPTADGK
LVWKIDRLGQGEKSKITVWVKPLKEGCCFTAATVCACPEIRSVTKCGQPAICVKQEGPENACLRCPVVYKINVVNQGTAI
ARNVVVENPVPDGYAHSSGQRVLTFTLGDMQPGEHRTITVEFCPLKRGRATNIATVSYCGGHKNTASVTTVINEPCVQVS
IAGADWSYVCKPVEYVISVSNPGDLVLRDVVVEDTLSPGVTVLEAAGAQISCNKVVWTVKELNPGESLQYKVLVRAQTPG
QFTNNVVVKSCSDCGTCTSCAEATTYWKGVAATHMCVVDTCDPVCVGENTVYRICVTNRGSAEDTNVSLMLKFSKELQPV
SFSGPTKGTITGNTVVFDSLPRLGSKETVEFSVTLKAVSAGDARGEAILSSDTLTVPVSDTENTHIY
>F0T377 ~~~omcB~~~Large cysteine-rich periplasmic protein omcB~~~
MSKLIRRVVTVLALTSMASSFASGKIEAAAAESLATRFIASTENSDDNVFQATAKKVRFGRNKNQRQEQKHTGAFCDKEF
YPCEGGQCQPVDATQESCYGKMYCVRVNDDCNVEISQSVPEYATVGSPYPIEILAVGKKDCVNVVITQQLPCEVEFVSSD
PATTPTSDSKLIWTIDRLGQGEKCKITVWVKPLKEGCCFTAATVCACPELRSYTKCGQPAICIKQEGPECACLRCPVCYK
IEVCNTGSAIARNVVVDNPVPDGYTHASGQRVLSFNLGDMRPGDSKCFCVEFCPQKRGKVTNVATVSYCGGHKCSANVTT
VVNEPCVQVNISGADWSYVCKPVEYTIVVSNPGDLKLYDVVIEDTAPSGATILEAAGAEICCNKAVWCIKEMCPGETLQF
KVVAKAQSPGKFTNQVVVKTNSDCGTCTSCAEVTTHWKGLAATHMCVIDTNDPICVGENTVYRICVTNRGSAEDTNVSLI
LKFSKELQPVSSSGPTKGTITGNTVVFDALPKLGSKESVEFSVTLKGIAPGDARGEAILSSDTLTVPVADTENTHVY
>P23700 ~~~omcB~~~Large cysteine-rich periplasmic protein OmcB~~~COG1361
MSKLIRRVVTVLALTSMASCFASGGIEAAVAESLITKIVASAETKPAPVPMTAKKVRLVRRNKQPVEQKSRGAFCDKEFY
PCEEGRCQPVEAQQESCYGRLYSVKVNDDCNVEICQSVPEYATVGSPYPIEILAIGKKDCVDVVITQQLPCEAEFVSSDP
ETTPTSDGKLVWKIDRLGAGDKCKITVWVKPLKEGCCFTAATVCACPELRSYTKCGQPAICIKQEGPDCACLRCPVCYKI
EVVNTGSAIARNVTVDNPVPDGYSHASGQRVLSFNLGDMRPGDKKVFTVEFCPQRRGQITNVATVTYCGGHKCSANVTTV
VNEPCVQVNISGADWSYVCKPVEYSISVSNPGDLVLHDVVIQDTLPSGVTVLEAPGGEICCNKVVWRIKEMCPGETLQFK
LVVKAQVPGRFTNQVAVTSESNCGTCTSCAETTTHWKGLAATHMCVLDTNDPICVGENTVYRICVTNRGSAEDTNVSLIL
KFSKELQPIASSGPTKGTISGNTVVFDALPKLGSKESVEFSVTLKGIAPGDARGEAILSSDTLTSPVSDTENTHVY
>P0CZ18 ~~~omcB~~~Large cysteine-rich periplasmic protein omcB~~~COG1361
MSKLIRRVVTVLALTSMASSFASGKIEAAAAESLATRFIASTENADDNVFQATAKKVRFGRNKNQRQEQKHTEAFCDKEF
YPCEGGQCQPVDATQESCYGKMYCVRVNDDCNVEISQSVPEYATVGSPYPIEILAVGKKDCVNVVITQQLPCEVEFVSSD
PATTPTSDSKLIWTIDRLGQGEKCKITVWVKPLKEGCCFTAATVCACPELRSYTKCGQPAICIKQEGPECACLRCPVCYK
IEVCNTGSAIARNVVVDNPVPDGYTHASGQRVLSFNLGDMRPGDSKCFCVEFCPQKRGKVTNVATVSYCGGHKCSANVTT
VVNEPCVQVNISGADWSYVCKPVEYTIVVSNPGDLKLYDVVIEDTAPSGATILEAAGAEICCNKAVWCIKEMCPGETLQF
KVVAKAQSPGKFTNQVVVKTNSDCGTCTSCAEVTTHWKGLAATHMCVIDTNDPICVGENTVYRICVTNRGSAEDTNVSLI
LKFSKELQPVSSSGPTKGTITGNTVVFDALPKLGSKESVEFSVTLKGIAPGDARGEAILSSDTLTVPVADTENTHVY
>B0B815 ~~~omcB~~~Large cysteine-rich periplasmic protein OmcB~~~
MNKLIRRAVTIFAVTSVASLFASGVLETSMAEFISTNVISLADTKAKDNTSHKSKKARKNHSKETPVNRKKVAPVHESKA
TGPKQDSCFGRMYTVKVNDDRNVEITQAVPKYATVGSPYPVEITATGKRDCVDVIITQQLPCEAEFVRSDPATTPTADGK
LVWKIDRLGQGEKSKITVWVKPLKEGCCFTAATVCACPEIRSVTKCGQPAICVKQEGPENACLRCPVVYKINVVNQGTAT
ARNVVVENPVPDSYAHSSGQRVLTFTLGDMQPGEHRTITVEFCPLKRGRATNIAMVSYCGGHKNTASVTTVINEPCVQVS
IAGADWSYVCKPVEYVISVSNPGDLVLRDVVVKDTLSPGVTVLEAAGAQISCNKVVWTVKELNPGESLQYKVLVRAQTPG
QFTNNVVVKSCSDCGTCTSCAEATTYWKGVAATHMCVVDTCDPVCVGENTVYRICVTNRGSAEDTNVSLMLKFSKELQPV
SFSGPTKGTITGNTVVFDSLPRLGSKETVEFSVTLKAVSAGDARGEAILSSDTLTVPVSDTENTHIY
>P0DJI2 ~~~omcB~~~Large cysteine-rich periplasmic protein OmcB, serovars L1/L3~~~
MNKLIRRAVTIFAVTSVASLFASGVLETSMAEFISTNVISLADTKAKDNTSHKSKKARKNHSKETPVNRKKVAPVHESKA
TGPKQDSCFGRMYTVKVNDDRNVEITQAVPKYATVGSPYPVEITATGKRDCVDVIITQQLPCEAEFVRSDPATTPTADGK
LVWKIDRLGQGEKSKITVWVKPLKEGCCFTAATVCACPEIRSVTKCGQPAICVKQEGPENACLRCPVVYKINVVNQGTAT
ARNVVVENPVPDSYAHSSGQRVLTFTLGDMQPGEHRTITVEFCPLKRGRATNIAMVSYCGGHKNTASVTTVINEPCVQVS
IAGADWSYVCKPVEYVISVSNPGDLVLRDVVVKDTLSPGVTVLEAAGAQISCNKVVWTVKELNPGESLQYKVLVRAQTPG
QFTNNVVVKSCSDCGTCTSCAEATTYWKGVAATHMCVVDTCDPVCVGENTVYRICVTNRGSAEDTNVSLMLKFSKELQPV
SFSGPTKGTITGNTVVFDSLPRLGSKETVEFSVTLKAVSAGDARGEAILSSDTLTVPVSDTENTHIY
>Q74A86 ~~~omcS~~~C-type cytochrome OmcS~~~
MKKGMKVSLSVAAAALLMSAPAAFAFHSGGVAECEGCHTMHNSLGGAVMNSATAQFTTGPMLLQGATQSSSCLNCHQHAG
DTGPSSYHISTAEADMPAGTAPLQMTPGGDFGWVKKTYTWNVRGLNTSEGERKGHNIVAGDYNYVADTTLTTAPGGTYPA
NQLHCSSCHDPHGKYRRFVDGSIATTGLPIKNSGSYQNSNDPTAWGAVGAYRILGGTGYQPKSLSGSYAFANQVPAAVAP
STYNRTEATTQTRVAYGQGMSEWCANCHTDIHNSAYPTNLRHPAGNGAKFGATIAGLYNSYKKSGDLTGTQASAYLSLAP
FEEGTADYTVLKGHAKIDDTALTGADATSNVNCLSCHRAHASGFDSMTRFNLAYEFTTIADASGNSIYGTDPNTSSLQGR
SVNEMTAAYYGRTADKFAPYQRALCNKCHAKD
>P0A3P0 ~~~~~~Outer membrane lipoprotein omp10~~~
MKRFRIVAPLALMSLALAACETTGPGSGNAPIIAHTPAGIEGSWVDPNGIASSFNGGIFETRTTDTNEKLAEGNYLYLSP
QLVEINMRSIVRGTTSKVNCALVSPTQLNCTSSAGSRFSLTRRNAG
>P0A3N8 ~~~~~~Outer membrane lipoprotein omp10~~~
MKRFRIVAPLALMSLALAACETTGPGSGNAPIIAHTPAGIEGSWVDPNGIASSFNGGIFETRTTDTNEKLAEGNYLYLSP
QLVEINMRSIVRGTTSKVNCALVSPTQLNCTSSAGSRFSLTRRNAG
>P0A3N9 ~~~~~~Outer membrane lipoprotein omp10~~~
MKRFRIVAPLALMSLALAACETTGPGSGNAPIIAHTPAGIEGSWVDPNGIASSFNGGIFETRTTDTNEKLAEGNYLYLSP
QLVEINMRSIVRGTTSKVNCALVSPTQLNCTSSAGSRFSLTRRNAG
>Q2YLR6 ~~~~~~Outer membrane lipoprotein omp19~~~
MGISKASLLSLAAAGIVLAGCQSSRLGNLDNVSPPPPPAPVNAVPAGTVQKGNLDSPTQFPNAPSTDMSAQSGTQVASLP
PASAPDLTPGAVAGVWNASLGGQSCKIATPQTKYGQGYRAGPLRCPGELANLASWAVNGKQLVLYDANGGTVASLYSSGQ
GRFDGQTTGGQAVTLSR
>Q83EK8 ~~~ompP1~~~Outer membrane protein P1~~~COG3637
METTTKLAIGVSALCCLASAAFAGGPDIPMIDMNGFHIGLGFGYKSYTYDQVGTVTVTTNGGTVLSVLHPVSASITQFGP
VGELGYTFASDWWIAGVKAQYQYDNVRSVHIMDAPLVGSNYSYRTRLGSHLTAMLLAGIKVNEANAVYLEAGYSTVWGKT
TLFGPGPVAVSMKNRLNGGIAGIGWRHYFMNNVFLDLSYDYALYRSKSNSVTLSSATASAEGTAIGVSGTVQNPKRVAIN
GITATVNYLFNI
>B2SAB9 ~~~omp2a~~~Porin Omp2a~~~
MNIKSLLLGSAAALVAASGAQAADAIVAPEPEAVEYVRVCDAYGAGYFYIPGTETCLRVHGYVRYDVKGGDDVYSGTDRN
GWDKGARFALMFNTNSETELGTLGTYTQLRFNYTSNNSRHDGQYGDFSDDRDVADGGVSTGKIAYTFTGGNGFSAVIALE
QGGEDVDNDYTIDGYMPHVVGGLKYAGGWGSIAGVVAYDSVIEEWATKVRGDVNITDRFSVWLQGAYSSAATPNQNYGQW
GGDWAVWGGAKFIAPEKATFNLQAAHDDWGKTAVTANVAYQLVPGFTITPEVSYTKFGGEWKDTVAEDNAWGGIVRFQRS
F
>Q44620 ~~~omp2a~~~Porin Omp2a~~~
MNIKSLLLGSAAALVAASGAQAADAIVAPEPEAVEYVRVCDAYGAGYFYIPGTETCLRVHGYVRYDVKGGDDVYSGTDRN
GWDKGARFALMFNTNSETELGTLGTYTQLRFNYTSNNSRHDGQYGDFSDDRDVADGGVSTGKIAYTFTGGNGFSAVIALE
QGGEDVDNDYTIDGYMPHVVGGLKYAGGWGSIAGVVAYDSVIEEWATKVRGDVNITDRFSVWLQGAYSSAATPNQNYGQW
GGDWAVWGGAKFIAPEKATFNLQAAHDDWGKTAVTANVAYQLVPGFTITPEVSYTKFGGEWKDTVAEDNAWGGIVRFQRS
F
>Q7CNU3 ~~~omp2a~~~Porin Omp2a~~~COG3203
MNIKSLLLGSAAALVAASGAQAADAIVAPEPEAVEYVRVCDAYGAGYFYIPGTETCLRVHGYVRYDVKGGDDVYSGTDRN
GWDKGARFALMFNTNSETELGTLGTYTQLRFNYTSNNSRHDGQYGDFSDDRDVADGGVSTGTDLQFAYITLGGFKVGIDE
SEFHTFTGYLGDVINDDVVAAGSYRTGKIAYTFTGGNGFSAVIALEQGGEDVDNDYTIDGYMPHVVGGLKYAGGWGSIAG
VVAYDSVIEEWATKVRGDVNITDRFSVWLQGAYSSAATPNQNYGQWGGDWAVWGGAKFIAPEKATFNLQAAHDDWGKTAV
TANVAYQLVPGFTITPEVSYTKFGGEWKDTVAEDNAWGGIVRFQRSF
>Q44665 ~~~omp2b~~~Porin Omp2b~~~
MNIKSLLLGSAAALVAASGAQAADAIVAPEPEAVEYVRVCDAYGAGYFYIPGTETCLRVHGYVRYDVKGGDDVYSGTDRN
GWDKSARFALRVSTGSETELGTLKTFTELRFNYAANNSGVDGKYGNETSSGTVMEFAYIQLGGLRVGIDESEFHTFTGYL
GDVINDDVISAGSYRTGKISYTFTGGNGFSAVIALEQGGDNDGGYTGTTNYHIDGYMPDVVGGLKYAGGWGSIAGVVAYD
SVIEEWAAKVRGDVNITDQFSVWLQGAYSSAATPDQNYGQWGGDWAVWGGLKYQATQKAAFNLQAAHDDWGKTAVTANVA
YELVPGFTVTPEVSYTKFGGEWKNTVAEDNAWGGIVRFQRSF
>Q44619 ~~~omp2b~~~Porin Omp2b~~~
MNIKSLLLGSAAALVAASGAQAADAIVAPEPEAVEYVRVCDAYGAGYFYIPGTETCLRVHGYVRYDVKGGDDVYSGTDRN
GWDKSARFALRVSTGSETELGTLKTFTELRFNYAANNSGVDGKYGNETSSGTVMEFAYIQLGGLRVGIDESEFHTFTGYL
GDVINDDVISAGSYRTGKISYTFTGGNGFSAVIALEQGGDNDGGYTGTTNYHIDGYMPDVVGGLKYAGGWGSIAGVVAYD
SVIEEWAAKVRGDVNITDQFSVWLQGAYSSAATPDQNYGQWGGDWAVWGGLKYQATQKAAFNLQAAHDDWGKTAVTANVA
YELVPGFTVTPEVSYTKFGGEWKNTVAEDNAWGGIVRFQRSF
>Q8YG56 ~~~omp2b~~~Porin Omp2b~~~COG3203
MNIKSLLLGSAAALVAASGAQAADAIVAPEPEAVEYVRVCDAYGAGYFYIPGTEICLRVHGYVRYDVKGGDDVYSGTDRN
GWDKGARFALRVSTGSETELGTLKTFTELRFNYAANNSGVDGKYGNETSSGTVMEFAYIQLGGLRVGIDESEFHTFTGYL
GDVINDDVISAGSYRTGKISYTFTGGNGFSAVIALEQGGDNDGGYTGTTNYHIDGYMPDVVGGLKYAGGWGSIAGVVAYD
SVIEEWAAKVRGDVNITDQFSVWLQGAYSSAATPDQNYGQWGGDWAVWGGLKYQATQKAAFNLQAAHDDWGKTAVTANVA
YELVPGFTVTPEVSYTKFGGEWKNTVAEDNAWGGIVRFQRSF
>Q45330 ~~~omp2b~~~Porin Omp2b~~~
MNIKSLLLGSAAALVAASGAQAADAIVAPEPEAVEYVRVCDAYGAGYFYIPGTETCLRISGYVRYDVKGGDDVYTGSDRK
GWDKVLVLHSMFNTNSETELGTLGTYTTLRFNYTSNNSRHDGQYGDFSDDRDVADGSVSTGTDLQFAYITLGGFKVGIDE
SEFHTFTGYLGDIINDDVISAGSYRTGKIAYTFTGGNGFSAVIALEQGGEDVDNDYTIDGYMPHVVGGLKYAGGWGSIAG
VVAYDSVIEEWATKVRGDVNITDRFSVWLQGAYSSAATPNQNYGQWGGDWAVWGGLKYQATQKAAFNLQAAHDDWGKTAV
TANVAYELVPGFTVTPEVSYTKFGGEWKNTVAEDNAWGGIVRFQRSF
>P0DI96 ~~~omp2b~~~Porin Omp2b~~~
MNIKSLLLGSAAALVAASGAQAADAIVAPEPEAVEYVRVCDAYGAGYFYIPGTETCLRVHGYVRYDVKGGDDVYSGTDRK
GWDKSARFALRVSTGSETELGTLKTFTELRFNYSASNSREDGYYGTNSDGTVMQFAYIQLGGLRVGIDESEFQTFTGYLG
DVINDDVISAGTYRTGKISYTFTGGNGFSAVIALEQGGDNDGGYTPVFKDSQGREINGRGYQIDGYMPDVVGGLKYAGGW
GSIAGVVAYDSVIEEWATKVRGDVNITDQFSVWLQGAYSSAATPDQNYGQWGGDWAVWGGLKYQATQKAAFNLQAAHDDW
GKTAVTANVAYELVPGFTITPEVSYTKFSNEWKRELGNDTLDDAWGGIVRFQRSF
>P24305 ~~~~~~Outer membrane porin protein 32~~~
MKKSLIALAVLAASGAAMAQSSVTLFGIVDTNVAYVNKDAAGDSRYGLGTSGASTSRLGLRGTEDLGGGLKAGFWLEGEI
FGDDGNASGFNFKRRSTVSLSGNFGEVRLGRDLVPTSQKLTSYDLFSATGIGPFMGFRNWAAGQGADDNGIRANNLISYY
TPNFGGFNAGFGYAFDEKQTIGTADSVGRYIGGYVAYDNGPLSASLGLAQQKTAVGGLATDRDEITLGASYNFGVAKLSG
LLQQTKFKRDIGGDIKTNSYMLGASAPVGGVGEVKLQYALYDQKAIDSKAHQITLGYVHNLSKRTALYGNLAFLKNKDAS
TLGLQAKGVYAGGVQAGESQTGVQVGIRHAF
>Q6RYW5 ~~~~~~Outer membrane protein Omp38~~~COG2885
MKLSRIALATMLVAAPLAAANAGVTVTPLLLGYTFQDSQHNNGGKDGNLTNGPELQDDLFVGAALGIELTPWLGFEAEYN
QVKGDVDGASAGAEYKQKQINGNFYVTSDLITKNYDSKIKPYVLLGAGHYKYDFDGVNRGTRGTSEEGTLGNAGVGAFWR
LNDALSLRTEARATYNADEEFWNYTALAGLNVVLGGHLKPAAPVVEVAPVEPTPVAPQPQELTEDLNMELRVFFDTNKSN
IKDQYKPEIAKVAEKLSEYPNATARIEGHTDNTGPRKLNERLSLARANSVKSALVNEYNVDASRLSTQGFAWDQPIADNK
TKEGRAMNRRVFATITGSRTVVVQPGQEAAAPAAAQ
>A0A160PAH3 ~~~~~~Outer membrane protein Omp38~~~
MKLSRIALAMLVAAPLAAANAGVTVTPLMLGYTFQDTQHNNGGDKGELTAGPELQDDLFVGAALGVELTPWLGFEAEYNQ
VKGDLDGTGVPGSEYKQTQINGNFYATSDLITKNYDSKIKPYVLLGAGHYKYKLDDMSYHNNEEGTLGNAGVGAFWRLND
ALSLRTEARGTYNFDEKFWNYTALAGLNVVLGGHLKPAAPVVEVAPVVPVAPTPVEPAPQVLTEDLNMELRVFFDTNKSN
IKDQYKPEVAKVADALNTFPNATARIEGHTDNTGPRKLNERLSLARANSVKSALVSEYNIDASRLSTQGFAWDQPIADNK
TKEGRAMNRRVFATITGSRTVTK
>Q9S3R8 ~~~~~~Outer membrane protein 40~~~COG2885
MKAKSLLLALAGLACTFSATAQEATTQNKAGMHTAFQRDKASDHWFIDIAGGAGMALSGWNNDVDFVDRLSIVPTFGIGK
WHEPYFGTRLQFTGFDIYGFPQGSKERNHNYFGNAHLDFMFDLTNYFGVYRPNRVFHIIPWAGIGFGYKFHSENANGEKV
GSKDDMTGTVNVGLMLKFRLSRVVDFNIEGQAFAGKMNFIGTKRGKADFPVMATAGLTFNLGKTEWTEIVPMDYALVNDL
NNQINSLRGQVEELSRRPVSCPECPEPTQPTVTRVVVDNVVYFRINSAKIDRNQEINVYNTAEYAKTNNAPIKVVGYADE
KTGTAAYNMKLSERRAKAVAKMLEKYGVSADRITIEWKGSSEQIYEENAWNRIVVMTAAE
>Q9S3R9 ~~~~~~Outer membrane protein 41~~~COG2885
MKVKYLMLTLVGAIALNASAQENTVPATGQLPAKNVAFARNKAGSNWFVTLQGGVAAQFLNDNNNKDLMDRLGAIGSLSV
GKYHSPFFATRLQINGGQAHTFLGKNGEQEINTNFGAAHFDFMFDVVNYFAPYRENRFFHLIPWVGVGYQHKFIGSEWSK
DNVESLTANVGVMMAFRLGKRVDFVIEAQAAHSNLNLSRAYNAKKTPVFEDPAGRYYNGFQGMATAGLNFRLGAVGFNAI
EPMDYALINDLNGQINRLRSEVEELSKRPVSCPECPEVTPVTKTENILTEKAVLFRFDSHVVDKDQLINLYDVAQFVKET
NEPITVVGYADPTGNTQYNEKLSERRAKAVVDVLTGKYGVPSELISVEWKGDSTQPFSKKAWNRVVIVRSK
>P80603 ~~~~~~47 kDa outer membrane protein~~~
MAKTSKFTQTLLASALAVVAGSASAAAFQLAEVSTSGLGRAYAGEAAIADNAAVVATNPALMSLLKQPEISVGAIYVDPN
INLTSPMPGFAYKNIAPNALVPTVYGVYPINEKFAVGGGLNVNYGLATEFDDKYAGGFLGGKTDLTAINFNLSGAYRVTE
KFSVGLGLNAVHAKAKLERYAGVALKLKVPNVAQLAALPANTVISKLQGDKWGFGWNAGLVYEFNERNRIGIAYHSQVDI
NFKGQYSNHFPLAAAALLQTKGITATGGKEIPGTLHLPLPAYWEISGYHKMTDRFAMHYSYKYTQWSKFKELRAKGTDGK
TLFSKTEEFRDSSRIALGASYDVTDALTVRTGIAYDESAADEHNTISIPDTDRTWFSVGATYRFTPNVSIDAGFAHLKGK
KNTFKEEGVPFTSKASANLYGLNVNYRF
>P0A0V3 ~~~rmpM~~~Outer membrane protein class 4~~~
MTKQLKLSALFVALLASGTAVAGEASVQGYTVSGQSNEIVRNNYGECWKNAYFDKASQGRVECGDAVAAPEPEPEPEPAP
APVVVVEQAPQYVDETISLSAKTLFGFDKDSLRAEAQDNLKVLAQRLSRTNVQSVRVEGHTDFMGSDKYNQALSERRAYV
VANNLVSNGVPVSRISAVGLGESQAQMTQVCEAEVAKLGAKVSKAKKREALIACIEPDRRVDVKIRSIVTRQVVPAHNHH
QH
>P38368 ~~~ompA~~~Outer membrane protein P5~~~
MKKTAIALVVAGLAAASVAQAAPQENTFYAGVKAGQASFHDGLRALAREKNVGYHRNSFTYGVFGGYQILNQNNLGLAVE
LGYDDFGRAKGREKGKTVAKHTNHGAHLSLKGSYEVLDGLDVYGKAGVALVRSDYKFYEDANGTRDHKKGRHTARASGLF
AVGAEYAVLPELAVRLEYQWLTRVGKYRPQDKPNTAINYNPWIGSINAGISYRFGQGAAPVVAAPEVVSKTFSLNSDVTF
AFGKANLKPQAQATLDSIYGEMSQVKSAKVAVAGYTDRIGSDAFNVKLSQERADSVANYFVAKGVAADAISATGYGKANP
VTGATCDQVKGRKALIACLAPDRRVEIAVNGTK
>P45996 ~~~ompA~~~Outer membrane protein P5~~~
MKKTAIALVVAGLAAASVAQAAPQENTFYAGVKAGQGSFHDGINNNGAIKKGLSSSNYGYRRNTFTYGVFGGYQILNQDN
FGLAAELGYDDFGRAKLREAGKPKAKHTNHGAYLSLKGSYEVLDGLDVYGKAGVALVRSDYKFYEDANGTRDHKKGRHTA
RASGLFAVGAEYAVLPELAVRLEYQWLTRVGKYRPQDKPNTAINYNPWIGCINAGISYRFGQGEAPVVAAPEMVSKTFSL
NSDVTFAFGKANLKPQAQATLDSVYGEISQVKSRKVAVAGYTNRIGSDAFNVKLSQERADSVANYFVAKGVAADAISATG
YGEANPVTGATCDQVKGRKALIACLAPDRRVEIAVNGTK
>A6QIG2 ~~~~~~65 kDa membrane protein~~~
MKFKSLITTTLALGVLASTGANFNNNEASAAAKPLDKSSSSLHHGYSKVHVPYAITVNGTSQNILSSLTFNKNQNISYKD
LEDRVKSVLKSDRGISDIDLRLSKQAKYTVYFKNGTKKVIDLKAGIYTADLINTSEIKAININVDTKKQVEDKKKDKANY
QVPYTITVNGTSQNILSNLTFNKNQNISYKDLEDKVKSVLESNRGITDVDLRLSKQAKYTVNFKNGTKKVIDLKSGIYTA
NLINSSDIKSININVDTKKHIENKAKRNYQVPYSINLNGTSTNILSNLSFSNKPWTNYKNLTSQIKSVLKHDRGISEQDL
KYAKKAYYTVYFKNGGKRILQLNSKNYTANLVHAKDVKRIEITVKTGTKAKADRYVPYTIAVNGTSTPILSDLKFTGDPR
VGYKDISKKVKSVLKHDRGIGERELKYAKKATYTVHFKNGTKKVININSNISQLNLLYVQDIKKIDIDVKTGTKAKADSY
VPYTIAVNGTSTPILSKLKISNKQLISYKYLNDKVKSVLKSERGISDLDLKFAKQAKYTVYFKNGKKQVVNLKSDIFTPN
LFSAKDIKKIDIDVKQYTKSKKNK
>P13415 ~~~porA~~~Major outer membrane protein P.IA~~~
MRKKLTALVLSALPLAAVADVSLYGEIKAGVEGRNIQAQLTEQPQVTNGVQGNQVKVTKAKSRIRTKISDFGSFIGFKGS
EDLGEGLKAVWQLEQDVSVAGGGASQWGNRESFIGLAGEFGTLRAGRVANQFDDASQAINPWDSNNDVASQLGIFKRHDD
MPVSVRYDSPEFSGFSGSVQFVPAQNSKSAYKPAYYTKDTNNNLTLVPAVVGKPGSDVYYAGLNYKNGGFAGNYAFKYAR
HANVGRNAFELFLIGSATSDEAKGTDPLKNHQVHRLTGGYEEGGLNLALAAQLDLSENGDKAKTKNSTTEIAATASYRFG
NAVPRISYAHGFDLIERGKKGENTSYDQIIAGVDYDFSKRTSAIVSGAWLKRNTGIGNYTQINAASVGLRHKF
>P0A910 ~~~ompA~~~Outer membrane protein A~~~COG2885
MKKTAIAIAVALAGFATVAQAAPKDNTWYTGAKLGWSQYHDTGFINNNGPTHENQLGAGAFGGYQVNPYVGFEMGYDWLG
RMPYKGSVENGAYKAQGVQLTAKLGYPITDDLDIYTRLGGMVWRADTKSNVYGKNHDTGVSPVFAGGVEYAITPEIATRL
EYQWTNNIGDAHTIGTRPDNGMLSLGVSYRFGQGEAAPVVAPAPAPAPEVQTKHFTLKSDVLFNFNKATLKPEGQAALDQ
LYSQLSNLDPKDGSVVVLGYTDRIGSDAYNQGLSERRAQSVVDYLISKGIPADKISARGMGESNPVTGNTCDNVKQRAAL
IDCLAPDRRVEIEVKGIKDVVTQPQA
>P24017 ~~~ompA~~~Outer membrane protein A~~~
MKAIFVLNAAPKDNTWYAGGKLGWSQYHDTGFYGNGFQNNNGPTRNDQLGAGAFGGYQVNPYLGFEMGYDWLGRMAYKGS
VDNGAFKAQGVQLTAKLGYPITDDLDIYTRLGGMVWRADSKGNYASTGVSRSEHDTGVSPVFAGGVEWAVTRDIATRLEY
QWVNNIGDAGTVGTRPDNGMLSLGVSYRFGQEDAAPVVAPAPAPAPEVATKHFTLKSDVLFNFNKATLKPEGQQALDQLY
TQLSNMDPKDGSAVVLGYTDRIGSEAYNQQLSEKRAQSVVDYLVAKGIPAGKISARGMGESNPVTGNTCDNVKARAALID
CLAPDRRVEIEVKGYKEVVTQPQA
>P57041 ~~~porA~~~Major outer membrane protein P.IA~~~
MRKKLTALVLSALPLAAVADVSLYGEIKAGVEGRNYQLQLTEAQAANGGASGQVKVTKVTKAKSRIRTKISDFGSFIGFK
GSEDLGEGLKAVWQLEQDVSVAGGGATQWGNRESFIGLAGEFGTLRAGRVANQFDDASQAIDPWDSNNDVASQLGIFKRH
DDMPVSVRYDSPEFSGFSGSVQFVPAQNSKSAYKPAYWTTVNTGSATTTTFVPAVVGKPGSDVYYAGLNYKNGGFAGNYA
FKYARHANVGRDAFELFLLGSGSDQAKGTDPLKNHQVHRLTGGYEEGGLNLALAAQLDLSENGDKTKNSTTEIAATASYR
FGNAVPRISYAHGFDFIERGKKGENTSYDQIIAGVDYDFSKRTSAIVSGAWLKRNTGIGNYTQINAASVGLRHKF
>P0DH58 ~~~porA~~~Major outer membrane protein P.IA~~~
MRKKLTALVLSALPLAAVADVSLYGEIKAGVEGRNYQLQLTEAQAANGGASGQVKVTKVTKAKSRIRTKISDFGSFIGFK
GSEDLGDGLKAVWQLEQDVSVAGGGATQWGNRESFIGLAGEFGTLRAGRVANQFDDASQAIDPWDSNNDVASQLGIFKRH
DDMPVSVRYDSPEFSGFSGSVQFVPIQNSKSAYTPAYYTKNTNNNLTLVPAVVGKPGSDVYYAGLNYKNGGFAGNYAFKY
ARHANVGRNAFELFLIGSGSDQAKGTDPLKNHQVHRLTGGYEEGGLNLALAAQLDLSENGDKTKNSTTEIAATASYRFGN
AVPRISYAHGFDFIERGKKGENTSYDQIIAGVDYDFSKRTSAIVSGAWLKRNTGIGNYTQINAASVGLRHKF
>E6MXW0 ~~~porA~~~Major outer membrane protein P.IA~~~
MRKKLTALVLSALPLAAVADVSLYGEIKAGVEGRNYQLQLTEAQAANGGASGQVKVTKVTKAKSRIRTKISDFGSFIGFK
GSEDLGDGLKAVWQLEQDVSVAGGGATQWGNRESFIGLAGEFGTLRAGRVANQFDDASQAIDPWDSNNDVASQLGIFKRH
DDMPVSVRYDSPEFSGFSGSVQFVPIQNSKSAYTPAYYTKDTNNNLTLVPAVVGKPGSDVYYAGLNYKNGGFAGNYAFKY
ARHANVGRNAFELFLIGSGSDQAKGTDPLKNHQVHRLTGGYEEGGLNLALAAQLDLSENGDKTKNSTTEIAATASYRFGN
AVPRISYAHGFDFIERGKKGENTSYDQIIAGVDYDFSKRTSAIVSGAWLKRNTGIGNYTQINAASVGLRHKF
>A0A2S4N3N0 ~~~ompA~~~Outer membrane protein A~~~
MKKTAIAIAVALAGFATVAQAAPKDNTWYTGAKLGWSQYHDTGFIPNNGPTHENQLGAGAFGGYQVNPYVGFEMGYDWLG
RMPYKGDNINGAYKAQGVQLTAKLGYPITDDLDIYTRLGGMVWRADTKANVPGGASFKDHDTGVSPVFAGGVEYAITPEI
ATRLEYQWTNNIGDANTIGTRPDNGLLSLGVSYRFGQGEAAPVVAPAPAPEVQTKHFTLKSDVLFNFNKATLKPEGQAAL
DQLYSQLSNLDPKDGSVVVLGYTDRIGSDAYNQGLSERRAQSVVDYLISKGIPADKISARGMGESNPVTGNTCDNVKQRA
ALIDCLAPDRRVEIEVKGIKDVVTQPQA
>Q01969 ~~~omp-alpha~~~Outer membrane protein alpha~~~COG1196
MKRVLLTVAMLSVFFSAMFAFFPDVPKDHWAYEYVWKLWQRGIFIGYPDGEFKGDRYITRYEAATAVSRLLDFIEQKMLA
GASGDLAQVVGNLSDKYMALEEKVNGLTGILDTLASQIGTTQANLTETERELLEKIDAVKEEIEMEFDKEISLNREVVNN
IGLKLGNLSRDYERYKENVDAKISEVNEKLAALEKDLGNKIADLEGIVNLHEKDIINIYNKISSVNEELNNKIAATEEKL
SRKDEEISAMVELHEKDIINLYNKVAALNEDLNKKILDTKAELSAKIESQEKTLNMVYTKLLDTESKLNDEISALKEKDA
EIQKTVDLHEQDIVNLYGKTSSLEEDLNMKYNETNEKIDQVKAELEDKIESVKAYNRNLSILTGAFFGILGLILIAISGK
>Q8ZG77 ~~~ompA~~~Outer membrane protein A~~~COG2885
MKKTAIALAVALVGFATVAQAAPKDNTWYTGGKLGWSQYQDTGSIINNDGPTHKDQLGAGAFFGYQANQYLGFEMGYDWL
GRMPYKGDINNGAFKAQGVQLAAKLSYPVAQDLDVYTRLGGLVWRADAKGSFDGGLDRASGHDTGVSPLVALGAEYAWTK
NWATRMEYQWVNNIGDRETVGARPDNGLLSVGVSYRFGQEDAAAPIVAPTPAPAPIVDTKRFTLKSDVLFGFNKANLKPE
GQQALDQLYAQLSSIDPKDGSVVVLGFADRIGQPAPNLALSQRRADSVRDYLVSKGIPADKITARGEGQANPVTGNTCDN
VKPRAALIECLAPDRRVEIEVKGYKEVVTQPQA
>P38399 ~~~ompA~~~Outer membrane protein A~~~
MKKTAIALAVALVGFATVAQAAPKDNTWYTGGKLGWSQYQDTGSIINNDGPTHKDQLGAGAFFGYQANQYLGFEMGYDWL
GRMPYKGDINNGAFKAQGVQLAAKLSYPVAQDLDVYTRLGGLVWRADAKGSFDGGLDRASGHDTGVSPLVALGAEYAWTK
NWATRMEYQWVNNIGDRETVGARPDNGLLSVGVSYRFGQEDAAAPIVAPTPAPAPIVDTKRFTLKSDVLFGFNKANLKPE
GQQALDQLYAQLSSIDPKDGSVVVLGFADRIGQPAPNLALSQRRADSVRDYLVSKGIPADKITARGEGQANPVTGNTCDN
VKPRAALIECLAPDRRVEIEVKGYKEVVTQPQA
>P30690 ~~~porB~~~Major outer membrane protein P.IB~~~
MKKSLIALTLAALPVAAMADVTLYGTIKAGVETSRSVFHQNGQVTEVTTATGIVDLGSKIGFKGQEDLGNGLKAIWQVEQ
KASIAGTDSGWGNRQSFIGLKGGFGKLRVGRLNSVLKDTGDINPWDSKSDYLGVNKIAEPEARLISVRYDSPEFAGLSGS
VQYALNDNAGRHNSESYHAGFNYKNGGFFVQYGGAYKRHHQVQEGLNIEKYQIHRLVSGYDNDALYASVAVQQQDAKLTD
ASNSHNSQTEVAATLAYRFGNVTPRVSYAHGFKGLVDDADIGNEYDQVVVGAEYDFSKRTSALVSAGWLQEGKGENKFVA
TAGGVGLRHKF
>E6MZM0 ~~~porB~~~Major outer membrane protein P.IB~~~
MKKSLIALTLAALPVAAMADVTLYGTIKAGVETSRSVFHQNGQVTEVTTATGIVDLGSKIGFKGQEDLGNGLKAIWQVEQ
KASIAGTDSGWGNRQSFIGLKGGFGKLRVGRLNSVLKDTGDINPWDSKSDYLGVNKIAEPEARLISVRYDSPEFAGLSGS
VQYALNDNAGRHNSESYHAGFNYKNGGFFVQYGGAYKRHHQVQEGLNIEKYQIHRLVSGYDNDALYASVAVQQQDAKLTD
ASNSHNSQTEVAATLAYRFGNVTPRVSYAHGFKGLVDDADIGNEYDQVVVGAEYDFSKRTSALVSAGWLQEGKGENKFVA
TAGGVGLRHKF
>Q9KKA3 ~~~ompB~~~Outer membrane protein B~~~
MAQKPNFLKKLISAGLVTASTATIVASFAGSAMGAAIQQNRTTNAVATTVDGVGFDQTAVPANVAVPLNAVITAGVNKGI
TLNTPAGSFNGLFLNTANNLDVTVREDTTLGFITNVVNNANHFNLMLNAGKTLTITGQGITNVQAAATKNANNVVAQVNN
GAAIDNNDLQGVGRIDCGAAASTLVFNLANPTTQKAPLILGDNAVIVNGANGTLNVTNGFIKVSSKSFATVNVINIGDGQ
GIMFNTDADNVNTLNLQANGATITFNGTDGTGRLVLLSKNAAATDFNVTGSLGGNLKGIIEFNTVAVNGQLKANAGANAA
VIGTNNGAGRAAGFVVSVDNGKVATIDGQVYAKDMVIQSANAVGQVNFRHIVDVGTDGTTAFKTAASKVAITQNSNFGTT
DFGNLAAQIIVPNTMTLNGNFTGDASNPGNTAGVITFDANGTLASASADANVAVTNNITAIEASGAGVVQLSGTHAAELR
LGNAGSVFKLADGTVINGKVNQTALVGGALAAGTITLDGSATITGDIGNAGGAAALQGITLANDATKTLTLGGANIIGAN
GGTINFQANGGTIKLTSTQNNIVVDFDLAIATDQTGVVDASSLTNAQTLTINGKIGTVGANNKTLGQFNIGSSKTVLSDG
DVAINELVIGNNGAVQFAHNTYLITRTTNAAGQGKIIFNPVVNNNTTLATGTNLGSATNPLAEINFGSKGAANVDTVLNV
GKGVNLYATNITTTDANVGSFIFNAGGTNIVSGTVGGQQGNKFNTVALDNGTTVKFLGNATFNGNTTIAANSTLQIGGNY
TADFVASADGTGIVEFVNTGPITVTLNKQAAPVNALKQITVSGPGNVVINEIGNAGNYHGAVTDTIAFENSSLGAVVFLP
RGIPFNDAGNRIPLTIKSTVGNKTATGFDVPSVIVLGVDSVIADGQVIGDQNNIVGLGLGSDNDIIVNATTLYAGIGTIN
NNQGTVTLSGGIPNTPGTVYGLGTGIGASKFKQVTFTTDYNNLGNIIATNATINDGVTVTTGGIAGIGFDGKITLGSVNG
NGNVRFVDGILSHSTSMIGTTKANNGTVTYLGNAFVGNIGDSDTPVASVRFTGSDGGAGLQGNIYSQVIDFGTYNLGISN
SNVILGGGTTAINGKINLRTNTLTFASGTSTWGNNTSIETTLTLANGNIGNIVILEGAQVNATTTGTTTIKVQDNANANF
SGTQTYTLIQGGARFNGTLGGPNFVVTGSNRFVNYGLIRAANQDYVITRTNNAENVVTNDIANSSFGGAPGVGQNVTTFV
NATNTAAYNNLLLAKNSANSANFVGAIVTDTSAAITNAQLDVAKDIQAQLGNRLGALRYLGTPETAEMAGPEAGAIPAAV
AAGDEAVDNVAYGIWAKPFYTDAHQSKKGGLAGYKAKTTGVVIGLDTLANDNLMIGAAIGITKTDIKHQDYKKGDKTDVN
GFSFSLYGAQQLVKNFFAQGSAIFSLNQVKNKSQRYFFDANGNMSKQIAAGHYDNMTFGGNLTVGYDYNAMQGVLVTPMA
GLSYLKSSDENYKETGTTVANKQVNSKFSDRTDLIVGAKVAGSTMNITDLAVYPEVHAFVVHKVTGRLSKTQSVLDGQVT
PCISQPDRTAKTSYNLGLSASIRSDAKMEYGIGYDAQISSKYTAHQGTLKVRVNF
>Q53020 ~~~ompB~~~Outer membrane protein B~~~COG4625
MAQKPNFLKKIISAGLVTASTATIVAGFSGVAMGAAMQYNRTTNAAATTFDGIGFDQAAGANIPVAPNSVITANANNPIT
FNTPNGHLNSLFLDTANDLAVTINEDTTLGFITNIAQQAKFFNFTVAAGKILNITGQGITVQEASNTINAQNALTKVHGG
AAINANDLSGLGSITFAAAPSVLEFNLINPTTQEAPLTLGANSKIVNGGNGTLNITNGFIQVSDNTFAGIKTINIDDCQG
LMFNSTPDAANTLNLQVGGNTINFNGIDGTGKLVLVSKNGAATEFNVTGTLGGNLKGIIELNTAAVAGKLISQGGAANAV
IGTDNGAGRAAGFIVSVDNGNAATISGQVYAKNMVIQSANAGGQVTFEHIVDVGLGGTTNFKTADSKVIITENSNFGSTN
FGNLDTQIVVPDTKILKGNFIGDVKNNGNTAGVITFNANGALVSASTDPNIAVTNINAIEAEGAGVVELSGIHIAELRLG
NGGSIFKLADGTVINGPVNQNALMNNNALAAGSIQLDGSAIITGDIGNGGVNAALQHITLANDASKILALDGANIIGANV
GGAIHFQANGGTIKLTNTQNNIVVNFDLDITTDKTGVVDASSLTNNQTLTINGSIGTVVANTKTLAQLNIGSSKTILNAG
DVAINELVIENNGSVQLNHNTYLITKTINAANQGQIIVAADPLNTNTTLADGTNLGSAENPLSTIHFATKAANADSILNV
GKGVNLYANNITTNDANVGSLHFRSGGTSIVSGTVGGQQGHKLNNLILDNGTTVKFLGDTTFNGGTKIEGKSILQISNNY
TTDHVESADNTGTLEFVNTDPITVTLNKQGAYFGVLKQVIISGPGNIVFNEIGNVGIVHGIAANSISFENASLGTSLFLP
SGTPLDVLTIKSTVGNGTVDNFNAPIVVVSGIDSMINNGQIIGDKKNIIALSLGSDNSITVNANTLYSGIRTTKNNQGTV
TLSGGMPNNPGTIYGLGLENGSPKLKQVTFTTDYNNLGSIIANNVTINDYVTLTTGGIAGTDFDAKITLGSVNGNANVRF
VDSTFSDPRSMIVATQANKGTVTYLGNALVSNIGSLDTPVASVRFTGNDSGAGLQGNIYSQNIDFGTYNLTILNSNVILG
GGTTAINGEIDLLTNNLIFANGTSTWGDNTSISTTLNVSSGNIGQVVIAEDAQVNATTTGTTTIKIQDNANANFSGTQAY
TLIQGGARFNGTLGAPNFAVTGSNIFVKYELIRDSNQDYVLTRTNDVLNVVTTAVGNSAIANAPGVSQNISRCLESTNTA
AYNNMLLAKDPSDVATFVGAIATDTSAAVTTVNLNDTQKTQDLLSNRLGTLRYLSNAETSDVAGSATGAVSSGDEAEVSY
GVWAKPFYNIAEQDKKGGIAGYKAKTTGVVVGLDTLASDNLMIGAAIGITKTDIKHQDYKKGDKTDINGLSFSLYGSQQL
VKNFFAQGNAIFTLNKVKSKSQRYFFESNGKMSKQIAAGNYDNMTFGGNLIFGYDYNAMPNVLVTPMAGLSYLKSSNENY
KETGTTVANKRINSKFSDRVDLIVGAKVAGSTVNITDIVIYPEIHSFVVHKVNGKLSNSQSMLDGQTAPFISQPDRTAKT
SYNIGLSANIKSDAKMEYGIGYDFNSASKYTAHQGTLKVRVNF
>Q53047 ~~~ompB~~~Outer membrane protein B~~~
MAQKPNFLKKLISAGLVTASTATIVASFAGSAMGAAIQQNRTTNGAATTVDGAGFDQTAAPANVGVALNAVITANANNGI
NFNTPAGSFNGLLLNTANNLAVTVSEDTTLGFITNVVHNAHSFNLTLNAGKTLTITGQGVTNAQAAATKNAQNVVVQFNN
GAAIDNNDLKGVGRIDFGAPASTLVFNLANPTTQKAPLILGDNAVIANGVNGTLNVTNGFIQVSNKSFATVKAINIADGQ
GIIFNTDANNANTLNLQAGGTTINFTGTDGTGRLVLLSKHAAATNFNITGSLGGNLKGVIEFNTVAVDGQLTANAGAANA
VIGTNNGAGRAAGFVVSVDNGKVATIDGQVYAKDMVIQSANATGQVNFRHIVDVGADGTTAFKTAASKVTITQDSNFGNT
DFGNLAAQIKVPNAITLTGNFTGDASNPGNTAGVITFDANGTLESASADANVAVTNNITAIEASGAGVVQLSGTHAAELR
LGNAGSIFKLADGTVINGKVNQTALVGGALAAGTITLDGSATITGDIGNAGGAAALQRITLANDAKKTLTLGGANIIGAG
GGTIDLQANGGTIKLTSTQNNIVVDFDLAIATDQTGVVDASSLTNAQTLTINGKIGTIGANNKTLGQFNIGSSKTVLSNG
NVAINELVIGNDGAVQFAHDTYLITRTTNAAGQGKIIFNPVVNNGTTLAAGTNLGSATNPLAEINFGSKGVNVDTVLNVG
EGVNLYATNITTTDANVGSFVFNAGGTNIVSGTVGGQQGNKFNTVALENGTTVKFLGNATFNGNTTIAANSTLQIGGNYT
ADCVASADGTGIVEFVNTGPITVTLNKQAAPVNALKQITVSGPGNVVINEIGNAGNHHGAVTDTIAFENSSLGAVVFLPR
GIPFNDAGNTMPLTIKSTVGNKTAKGFDVPSVVVLGVDSVIADGQVIGDQNNIVGLGLGSDNGIIVNATTLYAGISTLNN
NQGTVTLSGGVPNTPGTVYGLGTGIGASKFKQVTFTTDYNNLGNIIATNATINDGVTVTTGGIAGIGFDGKITLGSVNGN
GNVRFADGILSNSTSMIGTTKANNGTVTYLGNAFVGNIGDSDTPVASVRFTGSDSGAGLQGNIYSQVIDFGTYNLGIVNS
NIILGGGTTAINGKIDLVTNTLTFASGTSTWGNNTSIETTLTLANGNIGHIVILEGAQVNTTTTGTTTIKVQDNANANFS
GTQTYTLIQGGARFNGTLGSPNFAVTGSNRFVNYSLIRAANQDYVITRTNNAENVVTNDIANSPFGGAPGVDQNVTTFVN
ATNTAAYNNLLLAKNSANSANFVGAIVTDTSAAITNVQLDLAKDIQAQLGNRLGALRYLGTPETAEMAGPEAGAISAAVA
AGDEAIDNVAYGIWAKPFYTDAHQSKKGGLAGYKAKTTGVVIGLDTLANDNLMIGAAIGITKTDIKHQDYKKGDKTDVNG
FSFSLYGAQQLVKNFFAQGSAIFSLNQVKNKSQRYFFDANGNMSKQIAAGHYDNMTFGGNLTVGYDYNAMQGVLVTPMAG
LSYLKSSDENYKETGTTVANKQVNSKFSDRTDLIVGAKVAGSTMNITDLAVYPEVHAFVVHKVTGRLSKTQSVLDGQVTP
CINQPDRTTKTSYNLGLSASIRSDAKMEYGIGYDAQISSKYTAHQGTLKVRVNF
>P96989 ~~~ompB~~~Outer membrane protein B~~~COG4625
MAQKPNFLKKIISAGLVTASTATIVAGFSGVAMGAVMQYNRTTNAAATTVDGAGFDQTGAGVNLPVATNSVITANSNNAI
TFNTPNGNLNSLFLDTANTLAVTINENTTLGFVTNVTKQGNFFNFTIGAGKSLTITGHGITAQQAATTKSAQNVVSKVNA
GAAINDNDLSGVGSIDFTAAPSVLEFNLINPTTQEAPLTLGDNAKIVNGANGILNITNGFVKVSDKTFAGIKTINIGDNQ
GLMFNTTPDAANALNLQGGGNTINFNGRDGTGKLVLVSKNGNATEFNVTGSLGGNLKGVIEFDTTAAAGKLIANGGAANA
VIGTDNGAGRAAGFIVSVDNGNAATISGQVYAKDIVIQSANAGGQVTFEHLVDVGLGGKTNFKTADSKVIITENASFGST
DFGNLAVQIVVPNNKILTGNFIGDAKNNGNTAGVITFNANGTLVSGNTDPNIVVTNIKAIEVEGAGIVQLSGIHGAELRL
GNAGSIFKLADGTVINGPVNQNPLVNNNALAAGSIQLDGSAIITGDIGNGAVNAALQDITLANDASKILTLSGANIIGAN
AGGAIHFQANGGTIQLTSTQNNILVDFDLDVTTDQTGVVDASSLTNNQTLTINGSIGTIGANTKTLGRFNVGSSKTILNA
GDVAINELVMENDGSVHLTHNTYLITKTINAANQGKIIVAADPINTDTALADGTNLGSAESPLSNIHFATKAANGDSILH
IGKGVNLYANNITTTDANVGSLHFRSGGTSIVSGTVGGQQGLKLNNLILDNGTTVKFLGDITFNGGTKIEGKSILQISSN
YITDHIESADNTGTLEFVNTDPITVTLNKQGAYFGVLKQVMVSGPGNIAFNEIGNGVAHAIAVDSISFENASLGASLFLL
SGTPLDVLTIKSTVGNGTVDNFNAPILVVSGIDSMINNGQVIGDQKNIIALSLGSDNSITVNSNTLYAGIRTTKTNQGTV
TLSGGIPNNPGTIYGLGLENGDPKLKQVTFTTDYNNLGSIIATNVTINDDVTLTTGGIAGTDFDGKITLGSINGNANVKF
VDRTFSHPTSMIVSTKANQGTVTYLGNALVGNIGSSDIPVASVRFTGNDSGVGLQGNIHSQNIDFGTYNLTILNSDVILG
GGTTAINGEIDLLTNNLIFANGTSTWGNNTSLSTTLNVSNGNVGQIVIAEGAQVNATTTGTTTIKIQDNANANFSGTQTY
TLIQGGARFNGTLGAPNFDVTGNNIFVKYELIRDANQDYVLTRTNDVLNVVTTAVGNSAIANAPGVHQNIAICLESTDTA
AYNNMLLAKDSSDVATFIGAIATDTGAAVATVNLNDTQKTQDLLGNRLGALRYLSNSETADVGGSETGAVSSGDEAIDQV
SYGVWAKPFYNIAEQDKKGGLAGYKAKTAGVVVGLDTLANDNLMIGAAIGITKTDIKHQDYKKGDKTDIKGLSFSLYGAQ
QLVKNFFAQGSAIFTLNKVKSKSQRYFFDANGKMNKQIAAGNYDNITFGGNLMFGYDYNALQGVLVTPMAGLSYLKSSNE
NYKETGTTVANKRIHSKFSDRIDLIVGAKVTGSAMNINDIVIYPEIHSFVVHKVNGKLSKAQSMLDGQTAPFISQPDRTA
KTSYNIGLSANIRSDAKMEYGIGYDFNAASKYTAHQGTLKVRINF
>A0A4P7TKC8 ~~~ompC1~~~Outer membrane porin C 1~~~
MKVKVLSLLVPALLVAGAANAAEVYNKDGNKLDLYGKVDGLHYFSDDKSVDGDQTYMRLGFKGETQVTDQLTGYGQWEYQ
IQGNSAENENNSWTRVAFAGLKFQDVGSFDYGRNYGVVYDVTSWTDVLPEFGGDTYGSDNFMQQRGNGFATYRSTDFFGL
VDGLNFAVQYQGKNGSPEGEGMTNNGREALRQNGDGVGGSITYDYEGFGIGAAVSSSKRTDDQNFGLNRYDERYIGNGDR
AETYTGGLKYDANNIYLAAQYTQTYNATRVGNLGWANKAQNFEAVAQYQFDFGLRPSLAYLQSKGKNLGVINGRNYDDED
ILKYVDVGATYYFNKNMSTYVDYKINLLDDNQFTRDAGINTDNIVALGLVYQF
>A0A4P7TME1 ~~~ompC2~~~Outer membrane porin C 2~~~
MKLKIVAVVVTGLLAANVAHAAEVYNKDGNKLDLYGKVTALRYFTDDKRDDGDKTYARLGFKGGTQINDQMIGFGHWEYD
FKGYNDEANGSRGNKTRLAYAGLKISEFGSLDYGRNYGVGYDIGSWTDMLPEFGGDTWSQKDVFMTYRTTGLATYRNYDF
FGLIEGLNFAAQYQGKNERTDNSHLYGADYTRANGDGFGISSTYVYDGFGIGAVYTKSDRTNAQERAAANPLNASGKNAE
LWATGIKYDANNIYFAANYDETLNMTTYGDGYISNKAQSFEVVAQYQFDFGLRPSLAYLKSKGRDLGRYADQDMIEYIDV
GATYFFNKNMSTYVDYKINLIDESDFTRAVDIRTDNIVATGITYQF
>P0DQH0 ~~~ompC~~~Outer membrane porin C~~~
MKVKVLSLLVPALLVAGAANAAEVYNKDGNKLDLYGKVDGLHYFSDDKSVDGDQTYMRLGFKGETQVTDQLTGYGQWEYQ
IQGNAPESENNSWTRVAFAGLKFQDIGSFDYGRNYGVVYDVTSWTDVLPEFGGDTYGSDNFMQQRGNGFATYRNTDFFGL
VDGLNFAVQYQGQNGSVSGENDPDFTGHGITNNGRKALRQNGDGVGGSITYDYEGFGVGAAVSSSKRTDAQNTAAYIGNG
DRAETYTGGLKYDANNIYLAAQYTQTYNATRVGSLGWANKAQNFEAVAQYQFDFGLRPSVAYLQSKGKNLGTIGTRNYDD
EDILKYVDVGATYYFNKNMSTYVDYKINLLDDNQFTRDAGINTDNIVALGLVYQF
>Q8CVW1 ~~~ompC~~~Outer membrane porin C~~~COG3203
MKVKVLSLLVPALLVAGAANAAEVYNKDGNKLDLYGKVDGLHYFSDDKSVDGDQTYMRLGFKGETQVTDQLTGYGQWEYQ
IQGNAPESENNSWTRVAFAGLKFQDIGSFDYGRNYGVVYDVTSWTDVLPEFGGDTYGSDNFMQQRGNGFATYRNTDFFGL
VDGLNFAVQYQGQNGSVSGENDPDFTGHGITNNGRKALRQNGDGVGGSITYDYEGFGVGAAVSSSKRTWDQNNTGLIGTG
DRAETYTGGLKYDANNIYLAAQYTQTYNATRVGSLGWANKAQNFEAVAQYQFDFGLRPSVAYLQSKGKNLGVVAGRNYDD
EDILKYVDVGATYYFNKNMSTYVDYKINLLDDNQFTRAAGINTDDIVALGLVYQF
>P06996 ~~~ompC~~~Outer membrane porin C~~~COG3203
MKVKVLSLLVPALLVAGAANAAEVYNKDGNKLDLYGKVDGLHYFSDNKDVDGDQTYMRLGFKGETQVTDQLTGYGQWEYQ
IQGNSAENENNSWTRVAFAGLKFQDVGSFDYGRNYGVVYDVTSWTDVLPEFGGDTYGSDNFMQQRGNGFATYRNTDFFGL
VDGLNFAVQYQGKNGNPSGEGFTSGVTNNGRDALRQNGDGVGGSITYDYEGFGIGGAISSSKRTDAQNTAAYIGNGDRAE
TYTGGLKYDANNIYLAAQYTQTYNATRVGSLGWANKAQNFEAVAQYQFDFGLRPSLAYLQSKGKNLGRGYDDEDILKYVD
VGATYYFNKNMSTYVDYKINLLDDNQFTRDAGINTDNIVALGLVYQF
>P84838 ~~~~~~Outer membrane protein~~~COG2885
MRLRTALLATTLMAAAPVAANATIITGPYVDLGGGYNLVQNQHGHFSNDPANASMLTKSSSQYRHDAGFTGFGAVGWGFG
NGLRLEAEGLYNYSEINHRAPTAATGVTSGHDQSYGGMLNVLYDIDLKQFGIDVPVTPFVGVGAGYLWQNVSPTTTRYSN
GNVSRLGGTNGGFAYQGIVGAAYDIPNMPGLQLTAQYRMVGQAFSDGPFTMTSYTNGVGKSVGHAFFDNRFNHQFILGLR
YAFNTAPPPPPPAPVVVPPAPTPARTYLVFFDWDRSDLTARAREIVAEAAQASTHVQTTRIEVNGYTDNSAAHPGPRGEK
YNMGLSIRRAQSVKAELIRDGVPTGAIDIHGYGEQHPLVPTGPNTREPQNRRVEIILH
>Q48473 ~~~ompC~~~Outer membrane porin C~~~
MKVKVLSLLVPALLVAGAANAAEIYNKDGNKLDLYGKIDGLHYFSDDKDVDGDQTYMRLGVKGETQINDQLTGYGQWEYN
VQANNTESSSDQAWTRLAFAGLKFGDAGSFDYGRNYGVVYDVTSWTDVLPEFGGDTYGSDNFLQSRANGVATYRNSDFFG
LVDGLNFALQYQGKNGSVSGEGATNNGRGALKQNGDGFGTSVTYDIFDGISAGFAYANSKRTDDQNQLLLGEGDHAETYT
GGLKYDANNIYLATQYTQTYNATRAGSLGFANKAQNFEVAAQYQFDFGLRPSVAYLQSKGKDLNGYGDQDILKYVDVGAT
YYFNKNMSTYVDYKINLLDDNSFTRSAGISTDDVVALGLVYQF
>P09888 ~~~piiC~~~Outer membrane protein P.IIC~~~
MQPAKNLLFSSLLFSSLLFSSAARAASEDGGRGPYVQADLAYAAERITHDYPKPTGTGKNKISTVSDYFRNIRTHSVHPR
VSVGYDFGSWRIAADYARYRKWNNNKYSVSIKELLRNDNSASGVRGHLNIQTQKTEHQENGTFHAVSSLGLSTIYDFDTG
SRFKPYIGMRVAYGHVRHQVRSVEQETEIITTYPSNGGGKVSLSSKMPPKSAHHQSNSIRRVGLGVIAGVGFDITPNLTL
DTGYRYHNWGRLENTRFKTHEASLGMRYRF
>P0A264 ~~~ompC~~~Outer membrane porin C~~~COG3203
MKVKVLSLLVPALLVAGAANAAEIYNKDGNKLDLFGKVDGLHYFSDDKGSDGDQTYMRIGFKGETQVNDQLTGYGQWEYQ
IQGNQTEGSNDSWTRVAFAGLKFADAGSFDYGRNYGVTYDVTSWTDVLPEFGGDTYGADNFMQQRGNGYATYRNTDFFGL
VDGLDFALQYQGKNGSVSGENTNGRSLLNQNGDGYGGSLTYAIGEGFSVGGAITTSKRTADQNNTANARLYGNGDRATVY
TGGLKYDANNIYLAAQYSQTYNATRFGTSNGSNPSTSYGFANKAQNFEVVAQYQFDFGLRPSVAYLQSKGKDISNGYGAS
YGDQDIVKYVDVGATYYFNKNMSTYVDYKINLLDKNDFTRDAGINTDDIVALGLVYQF
>A0A0H3NJI9 ~~~ompC~~~Outer membrane porin C~~~
MKVKVLSLLVPALLVAGAANAAEIYNKDGNKLDLFGKVDGLHYFSDDKGSDGDQTYMRIGFKGETQVNDQLTGYGQWEYQ
IQGNQTEGSNDSWTRVAFAGLKFADAGSFDYGRNYGVTYDVTSWTDVLPEFGGDTYGADNFMQQRGNGYATYRNTDFFGL
VDGLDFALQYQGKNGSVSGENTNGRSLLNQNGDGYGGSLTYAIGEGFSVGGAITTSKRTADQNNTANARLYGNGDRATVY
TGGLKYDANNIYLAAQYSQTYNATRFGTSNGSNPSTSYGFANKAQNFEVVAQYQFDFGLRPSVAYLQSKGKDISNGYGAS
YGDQDIVKYVDVGATYYFNKNMSTYVDYKINLLDKNDFTRDAGINTDDIVALGLVYQF
>A0A2S4MYF8 ~~~ompC~~~Outer membrane protein C~~~
MKVKVLSLLVPALLVAGAANAAEVYNKDGNKLDLYGKVDGLHYFSDDKSVDGDQTYMRLGFKGETQVTDQLTGYGQWEYQ
IQGNSAENENNSWTRVAFAGLKFQDVGSFDYGRNYGVVYDVTSWTDVLPEFGGDTYGSDNFMQQRGNGFATYRSTDFFGL
VDGLNFAVQYQGKNGSPEGEGMTNNGREALRQNGDGVGGSITYDYEGFGIGAAVSSSKRTDDQNFGLNRYDERYIGNGDR
AETYTGGLKYDANNIYLAAQYTQTYNATRVGNLGWANKAQNFEAVAQYQFDFGLRPSLAYLQSKGKNLGVINGRNYDDED
ILKYVDVGATYYFNKNMSTYVDYKINLLDDNQFTRDAGINTDNIVALGLVYQF
>D0ZXQ1 ~~~ompD~~~Outer membrane porin protein OmpD~~~
MKLKLVAVAVTSLLAAGVVNAAEVYNKDGNKLDLYGKVHAQHYFSDDNGSDGDKTYARLGFKGETQINDQLTGFGQWEYE
FKGNRTESQGADKDKTRLAFAGLKFADYGSFDYGRNYGVAYDIGAWTDVLPEFGGDTWTQTDVFMTGRTTGVATYRNTDF
FGLVEGLNFAAQYQGKNDRDGAYESNGDGFGLSATYEYEGFGVGAAYAKSDRTNNQVKAASNLNAAGKNAEVWAAGLKYD
ANNIYLATTYSETLNMTTFGEDAAGDAFIANKTQNFEAVAQYQFDFGLRPSIAYLKSKGKNLGTYGDQDLVEYIDVGATY
YFNKNMSTFVDYKINLLDDSDFTKAAKVSTDNIVAVGLNYQF
>A0A0H3NBQ0 ~~~ompD~~~Outer membrane porin OmpD~~~
MRKHAKKIIRIIKMKLKLVAVAVTSLLAAGVVNAAEVYNKDGNKLDLYGKVHAQHYFSDDNGSDGDKTYARLGFKGETQI
NDQLTGFGQWEYEFKGNRTESQGADKDKTRLAFAGLKFADYGSFDYGRNYGVAYDIGAWTDVLPEFGGDTWTQTDVFMTG
RTTGVATYRNTDFFGLVEGLNFAAQYQGKNDRDGAYESNGDGFGLSATYEYEGFGVGAAYAKSDRTNNQVKAASNLNAAG
KNAEVWAAGLKYDANNIYLATTYSETLNMTTFGEDAAGDAFIANKTQNFEAVAQYQFDFGLRPSIAYLKSKGKNLGTYGD
QDLVEYIDVGATYYFNKNMSTFVDYKINLLDDSDFTKAAKVSTDNIVAVGLNYQF
>P37592 ~~~ompD~~~Outer membrane porin protein OmpD~~~
MKLKLVAVAVTSLLAAGVVNAAEVYNKDGNKLDLYGKVHAQHYFSDDNGSDGDKTYARLGFKGETQINDQLTGFGQWEYE
FKGNRTESQGADKDKTRLAFAGLKFADYGSFDYGRNYGVAYDIGAWTDVLPEFGGDTWTQTDVFMTGRTTGVATYRNTDF
FGLVEGLNFAAQYQGKNDRDGAYESNGDGFGLSATYEYEGFGVGAAYAKSDRTNNQVKAASNLNAAGKNAEVWAAGLKYD
ANNIYLATTYSETLNMTTFGEDAAGDAFIANKTQNFEAVAQYQFDFGLRPSIAYLKSKGKNLGTYGDQDLVEYIDVGATY
YFNKNMSTFVDYKINLLDDSDFTKAAKVSTDNIVAVGLNYQF
>A0RZH5 ~~~omp-EA~~~Outer membrane protein Omp-EA~~~
MKRNILAVLIPALLAAGAANAAEIYNKDGNKLDLYGKVKAMRYLSDADSNASNNADKSYTRIGFKGQTLINDQLTGYGQW
EYNFSLSNSESSSDAQSGNKTRLGFAGLKLKDYGSVDYGRNYGVIYDVEAFTDMMPEFGATGYTRTDTYMLTRGNSMLTW
RNSDFFGLVDGLKIALQYQGKNEGSGTRATNVSNGDGYGASLSYKIVEGLTINGAMSSSNRLNANSASSTTSQKMAAYGS
GGRAEAWATGLKYDANGVYLAGTYAETRNTNPFSGASYTFAGNSTATAVSGYANKVQNTELVAQYQFDSGLRPSLAYVQT
KAKDIENGIGDADLSKFVDVAATYYFNKNMSAFVDYKVNLLSDSNKLHLNTDDIVAVGLVYQF
>P0DSD9 ~~~ompF~~~Outer membrane porin F~~~
MMKRNILAVIVPALLVAGTANAAEIYNKDGNKVDLYGKAVGLHYFSKGNGENSYGGNGDMTYARLGFKGETQINSDLTGY
GQWEYNFQGNNSEGADAQTGNKTRLAFAGLKYADVGSFDYGRNYGVVYDALGYTDMLPEFGGDTAYSDDFFVGRVGGVAT
YRNSNFFGLVDGLNFAVQYLGKNERDTARRSNGDGVGGSISYEYEGFGIVGAYGAADRTNLQEESSLGKGKKAEQWATGL
KYDANNIYLAANYGETRNATPITNKFTNTSGFANKTQDVLLVAQYQFDFGLRPSIAYTKSKAKDVEGIGDVDLVNYFEVG
ATYYFNKNMSTYVDYIINQIDSDNKLGVGSDDTVAVGIVYQF
>P02931 ~~~ompF~~~Outer membrane porin F~~~COG3203
MMKRNILAVIVPALLVAGTANAAEIYNKDGNKVDLYGKAVGLHYFSKGNGENSYGGNGDMTYARLGFKGETQINSDLTGY
GQWEYNFQGNNSEGADAQTGNKTRLAFAGLKYADVGSFDYGRNYGVVYDALGYTDMLPEFGGDTAYSDDFFVGRVGGVAT
YRNSNFFGLVDGLNFAVQYLGKNERDTARRSNGDGVGGSISYEYEGFGIVGAYGAADRTNLQEAQPLGNGKKAEQWATGL
KYDANNIYLAANYGETRNATPITNKFTNTSGFANKTQDVLLVAQYQFDFGLRPSIAYTKSKAKDVEGIGDVDLVNYFEVG
ATYYFNKNMSTYVDYIINQIDSDNKLGVGSDDTVAVGIVYQF
>A0A1J0CL75 ~~~~~~Outer membrane porin F~~~
MRVIIMMKRNILAVVIPALMFAGAANAAEMYNKDGNKVDIYGKVDVRHYFADADTKGDHESGDASRARIGFKGETQINKD
LTGFGRFEYEVKTNTTEGENKAKTRLAYAGLKFADFGSLDYGRNYGVIYDTNAWTDVLPLWGGDSMAQTDVYMTSRTTGV
LTYRNTDMFGYVDGLSFALQYQGKNNENTNNKRKDNAEANGDGFGFSTAYNLGWGVTLGGGYSSSARTNWQEHKSDATGK
RAEAWNVGGKFEANNVYLAAMYGETRNMNYYGDGAIANKTQNIELTAQYDFADLGIKPSLGYVQSKGKDLNNGVGGLKDN
NHDLVKYISVGSFYKFNKNMTAVVDYKINLLDEDDFTKANGVATDNVVGLGLTYQF
>Q56113 ~~~ompF~~~Outer membrane porin F~~~COG3203
MMKRKILAAVIPALLAAATANAAEIYNKDGNKLDLYGKAVGRHVWTTTGDSKNADQTYAQIGFKGETQINTDLTGFGQWE
YRTKADRAEGEQQNSNLVRLAFAGLKYAEVGSIDYGRNYGIVYDVESYTDMAPYFSGETWGGAYTDNYMTSRAGGLLTYR
NSDFFGLVDGLSFGIQYQGKNQDNHSINSQNGDGVGYTMAYEFDGFGVTAAYSNSKRTNDQQDRDGNGDRAESWAVGAKY
DANNVYLAAVYAETRNMSIVENTVTDTVEMANKTQNLEVVAQYQFDFGLRPAISYVQSKGKQLNGADGSADLAKYIQAGA
TYYFNKNMNVWVDYRFNLLDENDYSSSYVGTDDQAAVGITYQF
>A0A0H3N9T8 ~~~ompF~~~Outer membrane porin F~~~
MMKRKILAAVIPALLAAATANAAEIYNKDGNKLDLYGKAVGRHVWTTTGDSKNADQTYAQIGFKGETQINTDLTGFGQWE
YRTKADRAEGEQQNSNLVRLAFAGLKYAEVGSIDYGRNYGIVYDVESYTDMAPYFSGETWGGAYTDNYMTSRAGGLLTYR
NSDFFGLVDGLSFGIQYQGKNQDNHSINSQNGDGVGYTMAYEFDGFGVTAAYSNSKRTNDQQDRDGNGDRAESWAVGAKY
DANNVYLAAVYAETRNMSIVENTVTDTVEMANKTQNLEVVAQYQFDFGLRPAISYVQSKGKQLNGAGGSADLAKYIQAGA
TYYFNKNMNVWVDYRFNLLDENDYSSSYVGTDDQAAVGITYQF
>P37432 ~~~ompF~~~Outer membrane porin F~~~
MMKRKILAAVIPALLAAATANAAEIYNKDGNKLDLYGKAVGRHVWTTTGDSKNADQTYAQIGFKGETQINTDLTGFGQWE
YRTKADRAEGEQQNSNLVRLAFAGLKYAEVGSIDYGRNYGIVYDVESYTDMAPYFSGETWGGAYTDNYMTSRAGGLLTYR
NSDFFGLVDGLSFGIQYQGKNQDNHSINSQNGDGVGYTMAYEFDGFGVTAAYSNSKRTNDQQDRDGNGDRAESWAVGAKY
DANNVYLAAVYAETRNMSIVENTVTDTVEMANKTQNLEVVAQYQFDFGLRPAISYVQSKGKQLNGAGGSADLAKYIQAGA
TYYFNKNMNVWVDYRFNLLDENDYSSSYVGTDDQAAVGITYQF
>A0A4P7TN82 ~~~ompF~~~Outer membrane porin F~~~
MMKRNILAVIVPALLVAGTANAAEIYNKDGNKVDLYGKAVGLHYFSKGNGENSYGGNGDMTYARLGFKGETQINSDLTGY
GQWEYNFQGNNSEGADAQTGNKTRLAFAGLKYADVGSFDYGRNYGVVYDALGYTDMLPEFGGDTAYSDDFFVGRVGGVAT
YRNSNFFGLVDGLNFAVQYLGKNERDTARRSNGDGVGGSISYEYEGFGIVGAYGAADRTNLQEAQPLGNGKKAEQWATGL
KYDANNIYLAANYGETRNATPITNKFTNTSGFANKTQDVLLVAQYQFDFGLRPSIAYTKSKAKDVEGIGDVDLVNYFEVG
ATYYFNKNMSTYVDYIINQIDSDNKLGVGSDDTVAVGIVYQF
>P76045 ~~~ompG~~~Outer membrane porin G~~~
MKKLLPCTALVMCAGMACAQAEERNDWHFNIGAMYEIENVEGYGEDMDGLAEPSVYFNAANGPWRIALAYYQEGPVDYSA
GKRGTWFDRPELEVHYQFLENDDFSFGLTGGFRNYGYHYVDEPGKDTANMQRWKIAPDWDVKLTDDLRFNGWLSMYKFAN
DLNTTGYADTRVETETGLQYTFNETVALRVNYYLERGFNMDDSRNNGEFSTQEIRAYLPLTLGNHSVTPYTRIGLDRWSN
WDWQDDIEREGHDFNRVGLFYGYDFQNGLSVSLEYAFEWQDHDEGDSDKFHYAGVGVNYSF
>P29739 ~~~ompH~~~Porin-like protein H~~~COG3203
MKKTLVALAILTAAGSANAGINLYDADGVKTDLSGAAEVQYRQTFKEDSDAELRMDDGDLAVNTTVAISDSLNAVAAVAF
EFEDGKVTNDELWVGVAGDFGTLTAGRQYMLADDAGVGKDYELGGDGIDFVQANGDQVVKYVFDNGQFYGGVGALITETN
PDNNADEASVYEGRLGARFGDFDVRAYLYSGEDVNTDNFDVFGDDKVNVDIDGYQIEAEYIVNAFAFAASFGQVDYELAS
DSSQKIEADTAALAGSYTMNKTTFAVGYTYWSPEAKGTVKKMEEANVFYANVTQQLHSNVKVYGEIGSSDTDNSEFGYVA
GMEVTF
>P76773 ~~~ompL~~~Porin OmpL~~~COG1452
MKKINAIILLSSLTSASVFAGAYVENREAYNLASDQGEVMLRVGYNFDMGAGIMLTNTYNFQREDELKHGYNEIEGWYPL
FKPTDKLTIQPGGLINDKSIGSGGAVYLDVNYKFVPWFNLTVRNRYNHNNYSSTDLSGELDNNDTYEIGTYWNFKITDKF
SYTFEPHYFMRVNDFNSSNGKDHHWEITNTFRYRINEHWLPYFELRWLDRNVEPYHREQNQIRIGTKYFF
>Q52581 ~~~ompL~~~Porin-like protein L~~~COG3203
MNKKLIALAVAAASISSVATAAEVYSDETSSLAVGGRFEARAVLADVNKDENVTNTASSEVSDKSRVRINVAGKTDITED
FYGVGFFEKEFSSADSDNDETRYAYAGVGSQYGQLVYGKADGSLGMLTDFTDIMAYHGNEAGNKLAAADRTDNNLSYVGS
FDLNGDNLTVKANYVFGGSDENEGYSAAAMYAMDMGLGFGAGYGEQDGQSSKNGNEDKTGKQAFGAISYTISDFYFSGLY
QDSRNTVVNNDLIDESTGYEFAAAYTYGKAVFITTYNFLEDSNASGDASDLRDSIAIDGTYYFNKNFRTYASYKFNLLDA
NSSTTKAQASDEFVLGARYDF
>P77747 ~~~ompN~~~Outer membrane porin N~~~COG3203
MKSKVLALLIPALLAAGAAHAAEVYNKDGNKLDLYGKVDGLHYFSDNSAKDGDQSYARLGFKGETQINDQLTGYGQWEYN
IQANNTESSKNQSWTRLAFAGLKFADYGSFDYGRNYGVMYDIEGWTDMLPEFGGDSYTNADNFMTGRANGVATYRNTDFF
GLVNGLNFAVQYQGNNEGASNGQEGTNNGRDVRHENGDGWGLSTTYDLGMGFSAGAAYTSSDRTNDQVNHTAAGGDKADA
WTAGLKYDANNIYLATMYSETRNMTPFGDSDYAVANKTQNFEVTAQYQFDFGLRPAVSFLMSKGRDLHAAGGADNPAGVD
DKDLVKYADIGATYYFNKNMSTYVDYKINLLDEDDSFYAANGISTDDIVALGLVYQF
>P34210 3.4.23.-~~~ompP~~~Outer membrane protease OmpP~~~
MQTKLLAIMLAAPVVFSSQEASASDFFGPEKISTEINLGTLSGKTKERVYEPEEGGRKVSQLDWKYSNAAILKGAVNWEL
NPWLSVGAAGWTTLNSRGGNMVDQDWMDSGTPGTWTDESRHPDTRLNYANEFDLNVKGWFLKESDYRLAIMAGYQESRYS
FNATGGTYIYSENGGFRNETGALPDKIKVIGYKQHFKIPYVGLTGNYRYDNFEFGGAFKYSGWVRGSDNDEHYVRQTTFR
SKVINQNYYSVAVNAGYYITPEAKVYIEGVWSRLTNKKGDTSLYDRSDNTSEHNNNGAGIENYNFITTAGLKYTF
>P0AA16 ~~~ompR~~~DNA-binding dual transcriptional regulator OmpR~~~COG0745
MQENYKILVVDDDMRLRALLERYLTEQGFQVRSVANAEQMDRLLTRESFHLMVLDLMLPGEDGLSICRRLRSQSNPMPII
MVTAKGEEVDRIVGLEIGADDYIPKPFNPRELLARIRAVLRRQANELPGAPSQEEAVIAFGKFKLNLGTREMFREDEPMP
LTSGEFAVLKALVSHPREPLSRDKLMNLARGREYSAMERSIDVQISRLRRMVEEDPAHPRYIQTVWGLGYVFVPDGSKA
>P0AA20 ~~~ompR~~~DNA-binding dual transcriptional regulator OmpR~~~COG0745
MQENYKILVVDDDMRLRALLERYLTEQGFQVRSVANAEQMDRLLTRESFHLMVLDLMLPGEDGLSICRRLRSQSNPMPII
MVTAKGEEVDRIVGLEIGADDYIPKPFNPRELLARIRAVLRRQANELPGAPSQEEAVIAFGKFKLNLGTREMFREDEPMP
LTSGEFAVLKALVSHPREPLSRDKLMNLARGREYSAMERSIDVQISRLRRMVEEDPAHPRYIQTVWGLGYVFVPDGSKA
>A0A4P7TS68 ~~~ompR~~~DNA-binding dual transcriptional regulator OmpR~~~
MQENYKILVVDDDMRLRALLERYLTEQGFQVRSVANAEQMDRLLTRESFHLMVLDLMLPGEDGLSICRRLRSQSNPMPII
MVTAKGEEVDRIVGLEIGADDYIPKPFNPRELLARIRAVLRRQANELPGAPSQEEAVIAFGKFKLNLGTREMFREDEPMP
LTSGEFAVLKALVSHPREPLSRDKLMNLARGREYSAMERSIDVQISRLRRMVEEDPAHPRYIQTVWGLGYVFVPDGSKA
>P09169 3.4.23.49~~~ompT~~~Protease 7~~~COG4571
MRAKLLGIVLTTPIAISSFASTETLSFTPDNINADISLGTLSGKTKERVYLAEEGGRKVSQLDWKFNNAAIIKGAINWDL
MPQISIGAAGWTTLGSRGGNMVDQDWMDSSNPGTWTDESRHPDTQLNYANEFDLNIKGWLLNEPNYRLGLMAGYQESRYS
FTARGGSYIYSSEEGFRDDIGSFPNGERAIGYKQRFKMPYIGLTGSYRYEDFELGGTFKYSGWVESSDNDEHYDPGKRIT
YRSKVKDQNYYSVAVNAGYYVTPNAKVYVEGAWNRVTNKKGNTSLYDHNNNTSDYSKNGAGIENYNFITTAGLKYTF
>A5F934 ~~~ompU~~~Outer membrane protein U~~~COG3203
MNKTLIALAVSAAAVATGAYADGINQSGDKAGSTVYSAKGTSLEVGGRAEARLSLKDGKAQDNSRVRLNFLGKAEINDSL
YGVGFYEGEFTTNDQGKNASNNSLDNRYTYAGIGGTYGEVTYGKNDGALGVITDFTDIMSYHGNTAAEKIAVADRVDNML
AYKGQFGDLGVKASYRFADRNAVDAMGNVVTETNAAKYSDNGEDGYSLSAIYTFGDTGFNVGAGYADQDDQNEYMLAASY
RMENLYFAGLFTDGELAKDVDYTGYELAAGYKLGQAAFTATYNNAETAKKTSADNFAIDATYYFKPNFRSYISYQFNLLD
SDKASKVASEDELAIGLRYDF
>P0C6Q6 ~~~ompU~~~Outer membrane protein U~~~COG3203
MNKTLIALAVSAAAVATGAYADGINQSGDKAGSTVYSAKGTSLEVGGRAEARLSLKDGKAQDNSRVRLNFLGKAEINDSL
YGVGFYEGEFTTNDQGKNASNNSLDNRYTYAGIGGTYGEVTYGKNDGALGVITDFTDIMSYHGNTAAEKIAVADRVDNML
AYKGQFGDLGVKASYRFADRNAVDAMGNVVTETNAAKYSDNGEDGYSLSAIYTFGDTGFNVGAGYADQDDQNEYMLAASY
RMENLYFAGLFTDGELAKDVDYTGYELAAGYKLGQAAFTATYNNAETAKETSADNFAIDATYYFKPNFRSYISYQFNLLD
SDKVGKVASEDELAIGLRYDF
>P06111 ~~~ompV~~~Outer membrane protein OmpV~~~COG3713
MKKIALFITASLIAGNALAAQTYIRNGNIYTHEGQWAAEVGAFGSTDLLKDQDKSYGALLNFGYHGEDFNADLSGLNYRF
FGNTGDIVNLGTYLTGSGVAYDQDSANSVKGMDKRKATVDLGLNADIALGDGTVSTYFQHDILNENKGYKTGVNYFHIID
LGVADLVPFAGISYQSSDYNNYYFGVKDKEATAQRKAYHAGGDFSYNLGYKLVYPINDRWEITQTSAYTRLGSDIAHSPI
VDSANQWLVGATVAYHF
>P0A915 ~~~ompW~~~Outer membrane protein W~~~COG3047
MKKLTVAALAVTTLLSGSAFAHEAGEFFMRAGSATVRPTEGAGGTLGSLGGFSVTNNTQLGLTFTYMATDNIGVELLAAT
PFRHKIGTRATGDIATVHHLPPTLMAQWYFGDASSKFRPYVGAGINYTTFFDNGFNDHGKEAGLSDLSLKDSWGAAGQVG
VDYLINRDWLVNMSVWYMDIDTTANYKLGGAQQHDSVRLDPWVFMFSAGYRF
>P0A917 ~~~ompX~~~Outer membrane protein X~~~COG3637
MKKIACLSALAAVLAFTAGTSVAATSTVTGGYAQSDAQGQMNKMGGFNLKYRYEEDNSPLGVIGSFTYTEKSRTASSGDY
NKNQYYGITAGPAYRINDWASIYGVVGVGYGKFQTTEYPTYKHDTSDYGFSYGAGLQFNPMENVALDFSYEQSRIRSVDV
GTWIAGVGYRF
>P25253 ~~~ompX~~~Outer membrane protein X~~~COG3637
MKKIACLSALAAVLAVSAGTAVAATSTVTGGYAQSDMQGVMNKTNGFNLKYRYEQDNNPLGVIGSFTYTEKDRTENGSYN
KGQYYGITAGPAYRLNDWASIYGVVGVGYGKFQQTENQGLNRTASNSDYGFSYGAGMQFNPIENVALDFSYEQSRIRNVD
VGTWIAGVGYRF
>Q04064 ~~~~~~Outer membrane porin protein BP0840~~~COG3203
MKKTLLAAALLAGFAGAAQAETSVTLYGIIDTGIGYNDVDFKVKGANADDSDFKYNHSRFGMINGVQNGSRWGLRGTEDL
GDGLQAVFQLESGFNSGNGNSAQDGRLFGRQATIGLQSESWGRLDFGRQTNIASKYFGSIDPFGAGFGQANIGMGMSAMN
TVRYDNMVMYQTPSYSGFQFGIGYSFSANDKDADAVNRVGFATADNVRAITTGLRYVNGPLNVALSYDQLNASNNQAQGE
VDATPRSYGLGGSYDFEVVKLALAYARTTDGWFGGQGYPVAVTLPSGDKFGGFGVNTFADGFKANSYMVGLSAPIGGASN
VFGSWQMVDPKLTGGDEKMNVFSLGYTYDLSKRTNLYAYGSYAKNFAFLEDAKSTAVGVGIRHRF
>Q2RI41 1.2.7.10~~~~~~Oxalate oxidoreductase subunit alpha~~~COG0674
MGKVRNISGCVAVAHGVRLADVDVICSYPIRPYTGIMSELARMVADGELDAEFVHGEGEHAQLSVVYGASAAGARVFTGS
SGVGVTYAMEVYSPISGERLPVQMAIADRTLDPPGDFGEEHTDAECCRDQGWIQGWASTPQEALDNTLIYYRVGEDQRVL
LPQYACLDGYFVSHILGPVDIPDEAQVKEFLPPYKNHHVLDPRKPQIIGPQIEPAMGPPLQYQRYQAVKGVHKVLEEACD
EFARIFGRKYDPYLDEYLTDDAEVIIFGQGAHMETAKAVARRLRNLGEKVGVARLRTFRPFPTEQIKERLSKFKAIGVLD
VSANFGISCSGGVLLSELRAALYDYGDKVKTVGFVAGLGGEVVTHDEFYRMFQKLKEIAKTGKVEQTSYWIPFEL
>Q2RI42 1.2.7.10~~~~~~Oxalate oxidoreductase subunit beta~~~COG1013
MLDRIASIKKAPDEEYYVPGHRTCAGCGPALTYRLVAKAAGPNTIFIGPTGCMYVANTSYGCGPWRVPWIHAQITNGGAV
ASGIEAAYKAMIRKKKTDAEFPNIIVMAGDGGAVDIGLQALSAMLYRGHDVLFICYDNESYANTGIQTSPTTPYGANTTF
TPPGEVVPEGKKLFPKDNPKVIAHGHPELKYVATASIGWPVDLMNKVRKGLNQEGPAYIHIHAPCPKGWQFPADKTIEMA
KLAVQTGMFQLYEYENGEYKLSVKVDKRKPVSEYMKLQKRFAHLKPEHIAKMQAFVDARCAEVGITVPVVASNA
>Q2RI40 1.2.7.10~~~~~~Oxalate oxidoreductase subunit delta~~~COG1014
MSTKDLFAEPNLKQITVWARGVVMNKDARDIVVALTEAAAKEGKYVQAWENYVDLPDRIYVPVRAYARISSDPIESKYIY
ENETPDIVVLVEESLIKGVPILKGIRPGSTLVVNTKRSIDTILEFLGDTGNLAQIVTVDANSMAEAVMTLSGAEGATDAT
GIGAGIAAPIAGAVVKATGIVDVENLAAVVKNPAAMRRGYAEAQVRQLPPHEAVEEAAVSATELLRQMPFAGTVPSPVTE
NEGMVTGNWRIQRPIIDREACTECYTCWIYCPDSCITRTEEGPVFNMKYCKGCGLCTAVCPSGALTNVPELDFKD
>O34709 ~~~opcR~~~HTH-type transcriptional repressor OpcR~~~COG1510
MKKTALDIIEHAEEHLIEKIAENMHTFGMPSTVGRVLGIIYMNRKPMTLSELSEATGMSKTRMSQVVREMIDANIAEKVF
EKGVRKDLYDVEQDYYQTFISLFAANWTKVVSKNKVLYKKLNRELSDLLQRDGLTPEAEEKVNQLLNELKEWLHYYDWLS
RLIEFFESEEVFRYVPKTKECSSLK
>P27237 3.4.24.70~~~prlC~~~Oligopeptidase A~~~
MTNPLLTSFSLPPFSAIKPEHVVPAVTKALADCRAAVEGVVAHGAPYSWENLCQPLAEADDVLGRIFSPISHLNSVKNSP
ELREAYEQTLPLLSEYSTWVGQHEGLYNAYRDLRDGDHYATLNTAQKKAVDNALRDFELSGIGLPKEKQQRYGEIATRLS
ELGNQYSNNVLDATMGWTKLITDEAELAGMPESALAAAKAQAEAKEQEGYLLTLDIPSYLPVMTYCDNQALREEMYRAYS
TRASDQGPNAGKWDNSPVMEEILALRHELAQLLGFENYAHESLATKMAENPQQVLDFLTDLAKRARPQGEKELAQLRAFA
KAEFGVEELQPWDIAYYSEKQKQHLYSISDEQLRPYFPENKAVNGLFEVVKRIYGITAKERTDVDVWHPEVRFFELYDEN
NELRGSFYLDLYAREHKRGGAWMDDCVGQMRKADGTLQKPVAYLTCNFNRPVNGKPALFTHDEVITLFHEFGHGLHHMLT
RIETAGVSGISGVPWDAVELPSQFMENWCWEPEALAFISGHYETGEPLPKELLDKMLAAKNYQAALFILRQLEFGLFDFR
LHAEFNPQQGAKILETLFEIKKQVAVVPSPTWGRFPHAFSHIFAGGYAAGYYSYLWADVLAADAYSRFEEEGIFNRETGQ
SFLDNILTRGGSEEPMELFKRFRGREPQLDAMLEHYGIKG
>P0A434 3.1.8.1~~~opd~~~Parathion hydrolase~~~
MQTRRVVLKSAAAAGTLLGGLAGCASVAGSIGTGDRINTVRGPITISEAGFTLTHEHICGSSAGFLRAWPEFFGSRKALA
EKAVRGLRRARAAGVRTIVDVSTFDIGRDVSLLAEVSRAADVHIVAATGLWFDPPLSMRLRSVEELTQFFLREIQYGIED
TGIRAGIIKVATTGKATPFQELVLKAAARASLATGVPVTTHTAASQRDGEQQAAIFESEGLSPSRVCIGHSDDTDDLSYL
TALAARGYLIGLDHIPHSAIGLEDNASASALLGIRSWQTRALLIKALIDQGYMKQILVSNDWLFGFSSYVTNIMDVMDRV
NPDGMAFIPLRVIPFLREKGVPQETLAGITVTNPARFLSPTLRAS
>P0A433 3.1.8.1~~~opd~~~Parathion hydrolase~~~
MQTRRVVLKSAAAAGTLLGGLAGCASVAGSIGTGDRINTVRGPITISEAGFTLTHEHICGSSAGFLRAWPEFFGSRKALA
EKAVRGLRRARAAGVRTIVDVSTFDIGRDVSLLAEVSRAADVHIVAATGLWFDPPLSMRLRSVEELTQFFLREIQYGIED
TGIRAGIIKVATTGKATPFQELVLKAAARASLATGVPVTTHTAASQRDGEQQAAIFESEGLSPSRVCIGHSDDTDDLSYL
TALAARGYLIGLDHIPHSAIGLEDNASASALLGIRSWQTRALLIKALIDQGYMKQILVSNDWLFGFSSYVTNIMDVMDRV
NPDGMAFIPLRVIPFLREKGVPQETLAGITVTNPARFLSPTLRAS
>P75920 2.1.-.-~~~mdoC~~~Glucans biosynthesis protein C~~~COG1835
MNPVPAQREYFLDSIRAWLMLLGIPFHISLIYSSHTWHVNSAESSLWLTLFNDFIHSFRMQVFFVISGYFSYMLFLRYPL
KKWWKVRVERVGIPMLTAIPLLTLPQFIMLQYVKGKAESWPGLSLYDKYNTLAWELISHLWFLLVLVVMTTLCVWIFKRI
RNNLENSDKTNKKFSMVKLSVIFLCLGIGYAVIRRTIFIVYPPILSNGMFNFIVMQTLFYLPFFILGALAFIFPHLKALF
TTPSRGCTLAAALAFVAYLLNQRYGSGDAWMYETESVITMVLGLWMVNVVFSFGHRLLNFQSARVTYFVNASLFIYLVHH
PLTLFFGAYITPHITSNWLGFLCGLIFVVGIAIILYEIHLRIPLLKFLFSGKPVVKRENDKAPAR
>P40120 ~~~mdoD~~~Glucans biosynthesis protein D~~~COG3131
MDRRRFIKGSMAMAAVCGTSGIASLFSQAAFAADSDIADGQTQRFDFSILQSMAHDLAQTAWRGAPRPLPDTLATMTPQA
YNSIQYDAEKSLWHNVENRQLDAQFFHMGMGFRRRVRMFSVDPATHLAREIHFRPELFKYNDAGVDTKQLEGQSDLGFAG
FRVFKAPELARRDVVSFLGASYFRAVDDTYQYGLSARGLAIDTYTDSKEEFPDFTAFWFDTVKPGATTFTVYALLDSASI
TGAYKFTIHCEKSQVIMDVENHLYARKDIKQLGIAPMTSMFSCGTNERRMCDTIHPQIHDSDRLSMWRGNGEWICRPLNN
PQKLQFNAYTDNNPKGFGLLQLDRDFSHYQDIMGWYNKRPSLWVEPRNKWGKGTIGLMEIPTTGETLDNIVCFWQPEKAV
KAGDEFAFQYRLYWSAQPPVHCPLARVMATRTGMGGFSEGWAPGEHYPEKWARRFAVDFVGGDLKAAAPKGIEPVITLSS
GEAKQIEILYIEPIDGYRIQFDWYPTSDSTDPVDMRMYLRCQGDAISETWLYQYFPPAPDKRQYVDDRVMS
>P75785 2.7.-.-~~~opgE~~~Phosphoethanolamine transferase OpgE~~~COG2194
MNLTLKESLVTRSRVFSPWTAFYFLQSLLINLGLGYPFSLLYTAAFTAILLLLWRTLPRVQKVLVGVSSLVAACYFPFAQ
AYGAPNFNTLLALHSTNMEESTEILTIFPWYSYLVGLFIFALGVIAIRRKKENEKARWNTFDSLCLVFSVATFFVAPVQN
LAWGGVFKLKDTGYPVFRFAKDVIVNNNEVIEEQERMAKLSGMKDTWTVTAVKPKYQTYVVVIGESARRDALGAFGGHWD
NTPFASSVNGLIFADYIAASGSTQKSLGLTLNRVVDGKPQFQDNFVTLANRAGFQTWWFSNQGQIGEYDTAIASIAKRAD
EVYFLKEGNFEADKNTKDEALLDMTAQVLAQEHSQPQLIVLHLMGSHPQACDRTQGKYETFVQSKETSCYLYTMTQTDDL
LRKLYDQLRNSGSSFSLVYFSDHGLAFKERGKDVQYLAHDDKYQQNFQVPFMVISSDDKAHRVIKARRSANDFLGFFSQW
TGIKAKEINIKYPFISEKKAGPIYITNFQLQKVDYNHLGTDIFDPKP
>P33136 ~~~mdoG~~~Glucans biosynthesis protein G~~~COG3131
MMKMRWLSAAVMLTLYTSSSWAFSIDDVAKQAQSLAGKGYETPKSNLPSVFRDMKYADYQQIQFNHDKAYWNNLKTPFKL
EFYHQGMYFDTPVKINEVTATAVKRIKYSPDYFTFGDVQHDKDTVKDLGFAGFKVLYPINSKDKNDEIVSMLGASYFRVI
GAGQVYGLSARGLAIDTALPSGEEFPRFKEFWIERPKPTDKRLTIYALLDSPRATGAYKFVVMPGRDTVVDVQSKIYLRD
KVGKLGVAPLTSMFLFGPNQPSPANNYRPELHDSNGLSIHAGNGEWIWRPLNNPKHLAVSSFSMENPQGFGLLQRGRDFS
RFEDLDDRYDLRPSAWVTPKGEWGKGSVELVEIPTNDETNDNIVAYWTPDQLPEPGKEMNFKYTITFSRDEDKLHAPDNA
WVQQTRRSTGDVKQSNLIRQPDGTIAFVVDFTGAEMKKLPEDTPVTAQTSIGDNGEIVESTVRYNPVTKGWRLVMRVKVK
DAKKTTEMRAALVNADQTLSETWSYQLPANE
>P62517 2.4.1.-~~~mdoH~~~Glucans biosynthesis glucosyltransferase H~~~COG2943
MNKTTEYIDAMPIAASEKAALPKTDIRAVHQALDAEHRTWAREDDSPQGSVKARLEQAWPDSLADGQLIKDDEGRDQLKA
MPEAKRSSMFPDPWRTNPVGRFWDRLRGRDVTPRYLARLTKEEQESEQKWRTVGTIRRYILLILTLAQTVVATWYMKTIL
PYQGWALINPMDMVGQDLWVSFMQLLPYMLQTGILILFAVLFCWVSAGFWTALMGFLQLLIGRDKYSISASTVGDEPLNP
EHRTALIMPICNEDVNRVFAGLRATWESVKATGNAKHFDVYILSDSYNPDICVAEQKAWMELIAEVGGEGQIFYRRRRRR
VKRKSGNIDDFCRRWGSQYSYMVVLDADSVMTGDCLCGLVRLMEANPNAGIIQSSPKASGMDTLYARCQQFATRVYGPLF
TAGLHFWQLGESHYWGHNAIIRVKPFIEHCALAPLPGEGSFAGSILSHDFVEAALMRRAGWGVWIAYDLPGSYEELPPNL
LDELKRDRRWCHGNLMNFRLFLVKGMHPVHRAVFLTGVMSYLSAPLWFMFLALSTALQVVHALTEPQYFLQPRQLFPVWP
QWRPELAIALFASTMVLLFLPKLLSILLIWCKGTKEYGGFWRVTLSLLLEVLFSVLLAPVRMLFHTVFVVSAFLGWEVVW
NSPQRDDDSTSWGEAFKRHGSQLLLGLVWAVGMAWLDLRFLFWLAPIVFSLILSPFVSVISSRATVGLRTKRWKLFLIPE
EYSPPQVLVDTDRFLEMNRQRSLDDGFMHAVFNPSFNALATAMATARHRASKVLEIARDRHVEQALNETPEKLNRDRRLV
LLSDPVTMARLHFRVWNSPERYSSWVSYYEGIKLNPLALRKPDAASQ
>P20401 2.4.1.-~~~opgH~~~Glucans biosynthesis glucosyltransferase H~~~
MSNSLPVPMSLNEYLAHLPMSDEQRAELAGCTTFAELHERLSAQPVTDPAEAAQASVGRRLTLTPRDQLEDAEMLGVDAS
GRLCLKATPPIRRTKVVPEPWRTNILVRGWRRLTGKGNPPKPEHDDLPRDLPKARWRTVGSIRRYILLILMLGQTIVAGW
YMKGILPYQGWSLVSLDEITRQTFVQTALQVMPYALQTSILLLFGILFCWVSAGFWTALMGFLELLTGRDKYRISGASAG
NEPIEKGARTALVMPICNEDVPRVFAGLRATFESVAATGDLDRFDFFVLSDTNETDIAVAEQQAWLDVCRETKGFGKIFY
RRRRRRVKRKSGNLDDFCRRWGGDYRYMVVLDADSVMSGECLTSLVRLMEATPDAGIIQTAPRASGMDTLYARMQQFATR
VYGPLFTAGLHFWQLGESHYWGHNAIIRMKPFIEHCALAPLPGKGAFAGAILSHDFVEAALMRRAGWGVWIAYDLPGSYE
ELPPNLLDELKRDRRWCHGNLMNFRLFLVKGMHPVHRAVFLTGVMSYLSAPLWFFFLVLSTALLAVNTLMEPTYFLEPRQ
LYPLWPQWHPEKAVALFSTTIVLLFLPKLLSVILIWAKGAKGFGGKFKVTVSMLLEMLFSVLLAPVRMLFHTRFVLAAFL
GWAATWNSPQRDDDSTPWIEAVKRHGPQTLLGACWALLVFWLNPSFLWWLAPIVVSLMLSIPVSVISSRTNLGVKARDEK
FFLIPEEFEPPQELISTDRYTYENRWHALKQGFIRAVVDPRQNALACALATSRHVRLSRLKWCVWSVSIRHSRSVRQNSA
IRNA
>Q9LCQ7 3.7.1.7~~~pvaB~~~Oxidized polyvinyl alcohol hydrolase~~~
MNQSLGVLRLTRGVIALALASVASGCSSTGADRTAATPAAANPAATEPVKWECPAGYEVKEGLNVDFPHKGMKRAFIVYP
AKNVSGPAPVWVPMTGSVESTNDNLTVARSGANSILADHGYTVIAPVRACANQDPNIRGERCNGPGSNGWNWNPWFEGRA
ADPSGEHWKNDEGPDSSFFVAMVQCVGTKYKLDARRLFLGGISSGGTMTNRALLFRSNFWAGGLPISGEWYVTSDDGTPL
SFDDARAAVAAAPTKIHQGRVGPYPLPAKVGPLIVMTVWGGEKDLWNCTRPDGSRFLCADYRPSTQAGSNFFSAQPDVVH
VACSSTHGHMWPQLNTQEFNRWALDTLASHPKGSDPRSFKLTQPPEGYTCHVGPFTGLY
>Q588Z2 3.7.1.7~~~oph~~~Oxidized polyvinyl alcohol hydrolase~~~
MFKPVVKSRSSRSFCYLAGCLAMVAATLSSTAQAKSEWACPEGFTPKAGLNTDFPSDGKKRAFVVVPPKDSAGGAPVWVP
MVGTVEATNWNLNVPRSGNNAKLAEHGYMVISPVRQCAEQDPNLGAGACNGVGKDGWTWNPWNDGRAPDASGDKYKTDAG
DDVRFLEAMVRCVGTKWKLDRKRLFLGGISAGGTMTNRALLFDSEFWAGGMPISGEWYSTKDDGSTVPFQETRKMVAAAP
AKIWQGRVGPYPLPSKLDPMVVITVWGGEKDLWDCGPPLGLCSDYRPTTQASSNYFSSISNVVHVACSATHGHMWPQVNT
DAFNLWALNTMASHPKGSSPKDFKLTAPPEGYSCKIGRFTDHYK
>P43838 ~~~ompP1~~~Outer membrane protein P1~~~COG2067
MKKFNQSLLATAMLLAAGGANAAAFQLAEVSTSGLGRAYAGEAAIADNASVVATNPALMSLFKTAQFSTGGVYVDSRINM
NGDVTSHATIITSSSGIKAIEGGSASARNVVPGAFVPNLYFVAPVNDKFALGAGMNVNFGLKSEYDDSYDAGIFGGKTDL
SAINLNLSGAYRVTEGLSLGLGVNAVYAKAQVERNAGIIADSVKDNQVKTALTVQQEPLKFLDKYLPSKDTSVVSLQDRA
AWGFGWNAGVMYQFNEANRIGLAYHSKVDIDFTDRTATSVEANVIKAGKKGDLTLTLPDYLELSGFHQLTDKLAVHYSYK
YTHWSRLTKLNASFEDGKKAFDKELQYSNNSRVALGASYNLDEKLTLRAGIAYDQAASRHQRSAAIPDTDRTWYSLGATY
KFTPNLSVDLGYAYLKGKKVHFKEVKTIGDERSLTLNTTANYTSQAHANLYGLNLNYSF
>P10641 ~~~ompP1~~~Outer membrane protein P1~~~
MKKFNQSLLATAMLLAAGGANAAAFQLAEVSTSGLGRAYAGEAAIADNASVVATNPALMSLFKTAQFSTGGVYIDSRINM
NGDVTSYAQIITNQIGMKAIKDGSASQRNVVPGAFVPNLYFVAPVNDKFALGAGMNVNFGLKSEYDDSYDAGVFGGKTDL
SAINLNLSGAYRVTEGLSLGLGVNAVYAKAQVERNAGLIADSVKDNQITSALSTQQEPFRDLKKYLPSKDKSVVSLQDRA
AWGFGWNAGVMYQFNEANRIGLAYHSKVDIDFADRTATSLEANVIKEGKKGNLTFTLPDYLELSGFHQLTDKLAVHYSYK
YTHWSRLTKLHASFEDGKKAFDKELQYSNNSRVALGASYNLYEKLTLRAGIAYDQAASRHHRSAAIPDTDRTWYSLGATY
KFTPNLSVDLGYAYLKGKKVHFKEVKTIGDKRTLTLNTTANYTSQAHANLYGLNLNYSF
>P24141 ~~~oppA~~~Oligopeptide-binding protein OppA~~~COG4166
MKKRWSIVTLMLIFTLVLSACGFGGTGSNGEGKKDSKGKTTLNINIKTEPFSLHPGLANDSVSGGVIRQTFEGLTRINAD
GEPEEGMASKIETSKDGKTYTFTIRDGVKWSNGDPVTAQDFEYAWKWALDPNNESQYAYQLYYIKGAEAANTGKGSLDDV
AVKAVNDKTLKVELNNPTPYFTELTAFYTYMPINEKIAEKNKKWNTNAGDDYVSNGPFKMTAWKHSGSITLEKNDQYWDK
DKVKLKKIDMVMINNNNTELKKFQAGELDWAGMPLGQLPTESLPTLKKDGSLHVEPIAGVYWYKFNTEAKPLDNVNIRKA
LTYSLDRQSIVKNVTQGEQMPAMAAVPPTMKGFEDNKEGYFKDNDVKTAKEYLEKGLKEMGLSKASDLPKIKLSYNTDDA
HAKIAQAVQEMWKKNLGVDVELDNSEWNVYIDKLHSQDYQIGRMGWLGDFNDPINFLELFRDKNGGNNDTGWENPEFKKL
LNQSQTETDKTKRAELLKKAEGIFIDEMPVAPIYFYTDTWVQDENLKGVIMPGTGEVYFRNAYFK
>P23843 ~~~oppA~~~Periplasmic oligopeptide-binding protein~~~COG4166
MTNITKRSLVAAGVLAALMAGNVALAADVPAGVTLAEKQTLVRNNGSEVQSLDPHKIEGVPESNISRDLFEGLLVSDLDG
HPAPGVAESWDNKDAKVWTFHLRKDAKWSDGTPVTAQDFVYSWQRSVDPNTASPYASYLQYGHIAGIDEILEGKKPITDL
GVKAIDDHTLEVTLSEPVPYFYKLLVHPSTSPVPKAAIEKFGEKWTQPGNIVTNGAYTLKDWVVNERIVLERSPTYWNNA
KTVINQVTYLPIASEVTDVNRYRSGEIDMTNNSMPIELFQKLKKEIPDEVHVDPYLCTYYYEINNQKPPFNDVRVRTALK
LGMDRDIIVNKVKAQGNMPAYGYTPPYTDGAKLTQPEWFGWSQEKRNEEAKKLLAEAGYTADKPLTINLLYNTSDLHKKL
AIAASSLWKKNIGVNVKLVNQEWKTFLDTRHQGTFDVARAGWCADYNEPTSFLNTMLSNSSMNTAHYKSPAFDSIMAETL
KVTDEAQRTALYTKAEQQLDKDSAIVPVYYYVNARLVKPWVGGYTGKDPLDNTYTRNMYIVKH
>P06202 ~~~oppA~~~Periplasmic oligopeptide-binding protein~~~
MSNITKKSLIAAGILTALIAASAATAADVPAGVQLADKQTLVRNNGSEVQSLDPHKIEGVPESNVSRDLFEGLLISDVEG
HPSPGVAEKWENKDFKVWTFHLRENAKWSDGTPVTAHDFVYSWQRLADPNTASPYASYLQYGHIANIDDIIAGKKPATDL
GVKALDDHTFEVTLSEPVPYFYKLLVHPSVSPVPKSAVEKFGDKWTQPANIVTNGAYKLKNWVVNERIVLERNPQYWDNA
KTVINQVTYLPISSEVTDVNRYRSGEIDMTYNNMPIELFQKLKKEIPNEVRVDPYLCTYYYEINNQKAPFNDVRVRTALK
LALDRDIIVNKVKNQGDLPAYSYTPPYTDGAKLVEPEWFKWSQQKRNEEAKKLLAEAGFTADKPLTFDLLYNTSDLHKKL
AIAVASIWKKNLGVNVNLENQEWKTFLDTRHQGTFDVARAGWCADYNEPTSFLNTMLSDSSNNTAHYKSPAFDKLIADTL
KVADDTQRSELYAKAEQQLDKDSAIVPVYYYVNARLVKPWVGGYTGKDPLDNIYVKNLYIIKH
>P24138 ~~~oppB~~~Oligopeptide transport system permease protein OppB~~~COG0601
MLKYIGRRLVYMIITLFVIVTVTFFLMQAAPGGPFSGEKKLPPEIEANLNAHYGLDKPLFVQYVSYLKSVAMWDFGPSFK
YKGQSVNDLISSGFPVSFTLGAEAILLALALGVLFGVIAALYHNKWQDYTVAILTIFGISVPSFIMAAVLQYVFSMKLGL
FPVAGWDSWAYTFLPSIALASMPMAFIARLSRSSMIEVLNSDYIRTAKAKGLSRPAVTVRHAIRNALLPVVTYMGPMAAQ
VLTGSFIIETIFGIPGLGAHFVNSITNRDYTVIMGVTVFFSVILLLCVLIVDVLYGIIDPRIKLSKAKKGA
>P0AFH2 ~~~oppB~~~Oligopeptide transport system permease protein OppB~~~COG0601
MLKFILRRCLEAIPTLFILITISFFMMRLAPGSPFTGERTLPPEVMANIEAKYHLNDPIMTQYFSYLKQLAHGDFGPSFK
YKDYSVNDLVASSFPVSAKLGAAAFFLAVILGVSAGVIAALKQNTKWDYTVMGLAMTGVVIPSFVVAPLLVMIFAIILHW
LPGGGWNGGALKFMILPMVALSLAYIASIARITRGSMIEVLHSNFIRTARAKGLPMRRIILRHALKPALLPVLSYMGPAF
VGIITGSMVIETIYGLPGIGQLFVNGALNRDYSLVLSLTILVGALTILFNAIVDVLYAVIDPKIRY
>P08005 ~~~oppB~~~Oligopeptide transport system permease protein OppB~~~
MLKFILRRCLEAIPTLFILITISFFMMRLAPGSPFTGERALPPEVLANIEAKYHLNDPIMTQYFSYLKQLAHGDFGPSFK
YKDYTVNDLVAASFPVSAKLGAAAFLLAVIIGVSAGVIAALKQNTRWDYTVMGFAMTGVVIPSFVVAPLLVMVFAITLQW
LPGGGWNGGALKFMILPMVALSLAYIASIARITRGSMIEVLHSNFIRTARAKGLPMRRIIFRHALKPALLPVLSYMGPAF
VGIITGSMVIETIYGLPGIGQLFVNGALNRDYSLVLSLTILVGALTILFNAIVDVLYAVIDPKIRY
>P24139 ~~~oppC~~~Oligopeptide transport system permease protein OppC~~~COG1173
MQNIPKNMFEPAAANAGDAEKISKKSLSLWKDAMLRFRSNKLAMVGLIIIVLIILMAIFAPMFSRYDYSTTNLLNADKPP
SKDHWFGTDDLGRDIFVRTWVGARISIFIGVAAAVLDLLIGVIWGSISGFRGGRTDEIMMRIADILWAVPSLLMVILLMV
VLPKGLFTIIIAMTITGWINMARIVRGQVLQLKNQEYVLASQTLGAKTSRLLFKHIVPNAMGSILVTMTLTVPTAIFTEA
FLSYLGLGVPAPLASWGTMASDGLPALTYYPWRLFFPAGFICITMFGFNVVGDGLRDALDPKLRK
>P0AFH6 ~~~oppC~~~Oligopeptide transport system permease protein OppC~~~COG1173
MMLSKKNSETLENFSEKLEVEGRSLWQDARRRFMHNRAAVASLIVLVLIALFVILAPMLSQFAYDDTDWAMMSSAPDMES
GHYFGTDSSGRDLLVRVAIGGRISLMVGVAAALVAVVVGTLYGSLSGYLGGKVDSVMMRLLEILNSFPFMFFVILLVTFF
GQNILLIFVAIGMVSWLDMARIVRGQTLSLKRKEFIEAAQVGGVSTSGIVIRHIVPNVLGVVVVYASLLVPSMILFESFL
SFLGLGTQEPLSSWGALLSDGANSMEVSPWLLLFPAGFLVVTLFCFNFIGDGLRDALDPKDR
>P08006 ~~~oppC~~~Oligopeptide transport system permease protein OppC~~~
MMLSKKNSETLENFSEKLEVEGRSLWQDARRRFMHNRAAVASLIVLFLIALFVTVAPMLSQFTYFDTDWGMMSSAPDMAS
GHYFGTDSSGRDLLVRVAIGGRISLMVGIAAALVAVIVGTLYGSLSGYLGGKIDSVMMRLLEILNSFPFMFFVILLVTFF
GQNILLIFVAIGMVSWLDMARIVRGQTLSLKRKEFIEAAQVGGVSTASIVIRHIVPNVLGVVVVYASLLVPSMILFESFL
SFLGLGTQEPLSSWGALLSDGANSMEVSPWLLLFPAGFLVVTLFCFNFIGDGLRDALDPKDR
>P24136 ~~~oppD~~~Oligopeptide transport ATP-binding protein OppD~~~COG0444
MIRVTRLLEVKDLAISFKTYGGEVQAIRGVNFHLDKGETLAIVGESGSGKSVTSQAIMKLIPMPPGYFKRGEILFEGKDL
VPLSEKEMQNVRGKEIGMIFQDPMTSLNPTMKVGKQITEVLFKHEKISKEAAKKRAVELLELVGIPMPEKRVNQFPHEFS
GGMRQRVVIAMALAANPKLLIADEPTTALDVTIQAQILELMKDLQKKIDTSIIFITHDLGVVANVADRVAVMYAGQIVET
GTVDEIFYDPRHPYTWGLLASMPTLESSGEEELTAIPGTPPDLTNPPKGDAFALRSSYAMKIDFEQEPPMFKVSDTHYVK
SWLLHPDAPKVEPPEAVKAKMRKLANTFEKPVLVREVE
>P76027 ~~~oppD~~~Oligopeptide transport ATP-binding protein OppD~~~COG0444
MSVIETATVPLAQQQADALLNVKDLRVTFSTPDGDVTAVNDLNFSLRAGETLGIVGESGSGKSQTAFALMGLLAANGRIG
GSATFNGREILNLPEHELNKLRAEQISMIFQDPMTSLNPYMRVGEQLMEVLMLHKNMSKAEAFEESVRMLDAVKMPEARK
RMKMYPHEFSGGMRQRVMIAMALLCRPKLLIADEPTTALDVTVQAQIMTLLNELKREFNTAIIMITHDLVVVAGICDKVL
VMYAGRTMEYGNARDVFYQPVHPYSIGLLNAVPRLDAEGETMLTIPGNPPNLLRLPKGCPFQPRCPHAMEICSSAPPLEE
FTPGRLRACFKPVEELL
>P24137 ~~~oppF~~~Oligopeptide transport ATP-binding protein OppF~~~COG4608
MTEKLLEIKHLKQHFVTPRGTVKAVDDLSFDIYKGETLGLVGESGCGKSTTGRSIIRLYEATDGEVLFNGENVHGRKSRK
KLLEFNRKMQMIFQDPYASLNPRMTVADIIAEGLDIHKLAKTKKERMQRVHELLETVGLNKEHANRYPHEFSGGQRQRIG
IARALAVDPEFIIADEPISALDVSIQAQVVNLMKELQKEKGLTYLFIAHDLSMVKYISDRIGVMYFGKLVELAPADELYE
NPLHPYTKSLLSAIPLPDPDYERNRVRQKYDPSVHQLKDGETMEFREVKPGHFVMCTEAEFKAFS
>O33361 ~~~OprA~~~Osmosensory protein A~~~COG1366
MTTTIPTSKSACSVTTRPGNAAVDYGGAQIRAYLHHLATVVTIRGEIDAANVEQISEHVRRFSLGTNPMVLDLSELSHFS
GAGISLLCILDEDCRAAGVQWALVASPAVVEQLGGRCDQGEHESMFPMARSVHKALHDLADAIDRRRQLVLPLISRSA
>Q51397 ~~~oprJ~~~Outer membrane protein OprJ~~~
MRKPAFGVSALLIALTLGACSMAPTYERPAAPVADSWSGAAAQRQGAAIDTLDWKSFIVDAELRRLVDMALDNNRSLRQT
LLDIEAARAQYRIQRADRVPGLNAAATGNRQRQPADLSAGNRSEVASSYQVGLALPEYELDLFGRVKSLTDAALQQYLAS
EEAARAARIALVAEVSQAYLSYDGALRRLALTRQTLVSREYSFALIDQRRAAGAATALDYQEALGLVEQARAEQERNLRQ
KQQAFNALVLLLGSDDAAQAIPRSPGQRPKLLQDIAPGTPSELIERRPDILAAEHRLRARNADIGAARAAFFPRISLTGS
FGTSSAEMSGLFDGGSRSWSFLPTLTLPIFDGGRNRANLSLAEARKDSAVAAYEGTIQTAFREVADALAASDTLRREEKA
LRALANSSNEALKLAKARYESGVDNHLRYLDAQRSSFLNEIAFIDGSTQRQIALVDLFRALGGGWDEGRSLVVHRGGRS
>Q51487 ~~~oprM~~~Outer membrane protein OprM~~~
MKRSFLSLAVAAVVLSGCSLIPDYQRPEAPVAAAYPQGQAYGQNTGAAAVPAADIGWREFFRDPQLQQLIGVALENNRDL
RVAALNVEAFRAQYRIQRADLFPRIGVDGSGTRQRLPGDLSTTGSPAISSQYGVTLGTTAWELDLFGRLRSLRDQALEQY
LATEQAQRSAQTTLVASVATAYLTLKADQAQLQLTKDTLGTYQKSFDLTQRSYDVGVASALDLRQAQTAVEGARATLAQY
TRLVAQDQNALVLLLGSGIPANLPQGLGLDQTLLTEVPAGLPSDLLQRRPDILEAEHQLMAANASIGAARAAFFPSISLT
ANAGTMSRQLSGLFDAGSGSWLFQPSINLPIFTAGSLRASLDYAKIQKDINVAQYEKAIQTAFQEVADGLAARGTFTEQL
QAQRDLVKASDEYYQLADKRYRTGVDNYLTLLDAQRSLFTAQQQLITDRLNQLTSEVNLYKALGGGWNQQTVTQQQTAKK
EDPQA
>P46920 7.6.2.9~~~opuAA~~~Glycine betaine transport ATP-binding protein OpuAA~~~COG0517
MSVDEKPIKIKVEKVSKIFGKQTKKAVQMLANGKTKKEILKATGSTVGVNQADFEVYDGEIFVIMGLSGSGKSTLVRMLN
RLIEPTAGNIYIDGDMITNMSKDQLREVRRKKISMVFQKFALFPHRTILENTEYGLELQGVDKQERQQKALESLKLVGLE
GFEHQYPDQLSGGMQQRVGLARALTNDPDILLMDEAFSALDPLIRKDMQDELLDLHDNVGKTIIFITHDLDEALRIGDRI
VLMKDGNIVQIGTPEEILMNPSNEYVEKFVEDVDLSKVLTAGHIMKRAETVRIDKGPRVALTLMKNLGISSIYAVDKQKK
LLGVIYASDAKKAAESDLSLQDILNTEFTTVPENTYLTEIFDVVSDANIPIAVVDEKQRMKGIVVRGALIGALAGNNEYI
NAEGTNEQTQDPSAQEVK
>P46921 ~~~opuAB~~~Glycine betaine transport system permease protein OpuAB~~~COG4176
MDRLPRIPLADIIDRFVDWITMTFGGFFDGIANGLAAFVNGIVTGLGFIPSILLTIIFAALAWWISTRGIALFTLIGFLL
IDYLGYWDPMLQTLALVLTSVIISIVVGVPIGIWASQKETVRRIVTPILDLMQTMPAFVYLLPAIFFFNIGVVPGVVASV
IFAMPPTIRMTVLGIKQVPADLIEATEAFGSTTAQRLFKVQLPLATKTILAGINQSIMLALSMVVIAAMVGAPGLGSEVY
SAVTQLKTGVGVEAGIAIVIVAITLDRITQNIKVKKKSRGNA
>P46922 ~~~opuAC~~~Glycine betaine-binding protein OpuAC~~~COG2113
MLKKIIGIGVSAMLALSLAACGSENDENASAAEQVNKTIIGIDPGSGIMSLTDKAMKDYDLNDWTLISASSAAMTATLKK
SYDRKKPIIITGWTPHWMFSRYKLKYLDDPKQSYGSAEEIHTITRKGFSKEQPNAAKLLSQFKWTQDEMGEIMIKVEEGE
KPAKVAAEYVNKHKDQIAEWTKGVQKVKGDKINLAYVAWDSEIASTNVIGKVLEDLGYEVTLTQVEAGPMWTAIATGSAD
ASLSAWLPNTHKAYAAKYKGKYDDIGTSMTGVKMGLVVPQYMKNVNSIEDLKK
>Q45460 ~~~opuBA~~~Choline transport ATP-binding protein OpuBA~~~COG0517
MLTLENVSKTYKGGKKAVNNVNLKIAKGEFICFIGPSGCGKTTTMKMINRLIEPSAGKIFIDGENIMDQDPVELRRKIGY
VIQQIGLFPHMTIQQNISLVPKLLKWPEQQRKERARELLKLVDMGPEYVDRYPHELSGGQQQRIGVLRALAAEPPLILMD
EPFGALDPITRDSLQEEFKKLQKTLHKTIVFVTHDMDEAIKLADRIVILKAGEIVQVGTPDDILRNPADEFVEEFIGKER
LIQSSSPDVERVDQIMNTQPVTITADKTLSEAIQLMRQERVDSLLVVNDERVLQGYVDVEIIDQCRKKANLVSEVLHEDI
YTVLGGTLLRDTVRKILKRGVKYVPVVDEDRRLIGIVTRASLVDIVYDSLWGEEKQLAALS
>Q45461 ~~~opuBB~~~Choline transport system permease protein OpuBB~~~COG1174
MHHIVQFLQTNGGELLYKTYEHITISLIAVILGVLVAVPLGVVLTRMKKGAGTIIGIVNIIQTLPSLAILAFFIPLLGVG
KVPAIVALFFYSVLPILRNTYTGIRGVNKNLLESGKGIGMTPAEQVRLVELPLAAPVIMAGIRTSTIYLIGWATLASFIG
GGGLGDYIFIGLNLYQPEYIIGGAVPVTILAIVIDYVLAVAERKLTPAGMQRLKELS
>Q45462 ~~~opuBC~~~Choline-binding protein~~~COG1732
MKRKYLKLMIGLALAATLTLSGCSLPGLSAAADQTIKIGAQSMSESEIIASMLGQLIEHHTDLKTTTIKNLGSNAVQQQA
LMNGEIDIAATRYTGDALTGTLRMEPEKDPDKALALTQREFKKRYDLKWYDSYGFDNTYAFTVSKELADQYHLETVSDVK
KWAPQLKLGVDNYWMKLKGNGYQDFTKTYGMTFGGTYPMQIGLVYDAVKSGKMDIVLAYSTDGRIKSYGLKMLKDDKQFF
PPYDCSPVVPEKVLKEHPELEGIIKKMLGKIDTATMQELNYEVDGNLKEPSVVAKEYLEKHRYFES
>P39775 ~~~opuBD~~~Choline transport system permease protein OpuBD~~~COG1174
MNVLEQLMTYYAQNGSYVMDEFGRHFLMSAYGVLFAAVVGVPAGILIAHFRRLSAWVFAVTNVIQTIPALAMLAVLMLVM
GLGANTVILSLFLYSLLPIIRNTYTGIISIEHAYLESGKAMGMTKFQVLRMVELPLALSVIMAGLRTALVIAIGITAIGT
FVGAGGLGDMIVRGSNATNGTAIILAGAIPTAVMAVGADLLMAWLERALSPVKKKRTGAKHVQSAA
>O34992 ~~~opuCA~~~Glycine betaine/carnitine/choline transport ATP-binding protein OpuCA~~~COG0517
MLKLEQVSKVYKGGKKAVNSIDLDIAKGEFICFIGPSGCGKTTTMKMINRLIEPSSGRIFIDGENIMEQDPVELRRKIGY
VIQQIGLFPHMTIQQNISLVPKLLKWPEEKRKERARELLKLVDMGPEYLDRYPHELSGGQQQRIGVLRALAAEPPLILMD
EPFGALDPITRDSLQEEFKKLQRTLNKTIVFVTHDMDEAIKLADRIVILKAGEIVQVGTPDEILRNPANEFVEEFIGKER
LIQSRPDIERVEQMMNRTPVTVSADKTLSQAIQLMREKRVDSLLVVDRQNVLKGYVDVEMIDQNRKKASIVGDVYRSDIY
TVQKGALLRDTVRKILKQGIKYVPVVDEQNHLAGIVTRASLVDIVYDSIWGDEENQLMTI
>G2JZ44 7.6.2.9~~~opuCA~~~Carnitine transport ATP-binding protein OpuCA~~~
MLKFEHVTKTYKGGKKAVNDLTLNIDKGEFVCFIGPSGCGKTTTMKMINRLIEPTEGKIFINDKDIMAEDPVKLRRSIGY
VIQQIGLMPHMTIRENIVLVPKLLKWSEEKKQERAKELIKLVDLPEEFLDRYPYELSGGQQQRIGVLRALAAEQNLILMD
EPFGALDPITRDSLQEEFKNLQKELGKTIIFVTHDMDEAIKLADRIVIMKDGEIVQFDTPDEILRNPANSFVEDFIGKDR
LIEAKPDVTQVAQIMNTNPVSITADKSLQAAITVMKEKRVDTLLVVDEGNVLKGFIDVEQIDLNRRTATSVMDIIEKNVF
YVYEDTLLRDTVQRILKRGYKYIPVVDKDKRLVGIVTRASLVDIVYDSIWGTLEDATENQEEQADSKTTEPEMKQEG
>Q9KHT9 7.6.2.9~~~opuCA~~~Carnitine transport ATP-binding protein OpuCA~~~COG0517
MLKFEHVTKTYKGGKKAVNDLTLNIDKGEFVCFIGPSGCGKTTTMKMINRLIEPTEGKIFINDKDIMAEDPVKLRRSIGY
VIQQIGLMPHMTIRENIVLVPKLLKWSEEKKQERAKELIKLVDLPEEFLDRYPYELSGGQQQRIGVLRALAAEQNLILMD
EPFGALDPITRDSLQEEFKNLQKELGKTIIFVTHDMDEAIKLADRIVIMKDGEIVQFDTPDEILRNPANSFVEDFIGKDR
LIEAKPDVTQVAQIMNTNPVSITADKSLQAAITVMKEKRVDTLLVVDEGNVLKGFIDVEQIDLNRRTATSVMDIIEKNVF
YVYEDTLLRDTVQRILKRGYKYIPVVDKDKRLVGIVTRASLVDIVYDSIWGTLEDATENQEEQADSKTTEPEMKQEG
>O34878 ~~~opuCB~~~Glycine betaine/carnitine/choline transport system permease protein OpuCB~~~COG1174
MNQMMTFLQTNGGELLYKTGEHLYISLIAVVLGIIVAVPLGVALTRMKKGAGAVIGFVNIVQTLPSLAILAFFIPLLGVG
KVPAIVALFFYSVLPILRNTYTGIKGVNKNLLESGKGIGMTGWEQIRLVEIPLAIPIIMAGIRTSTIYLIGWATLASFIG
GGGLGDYIFIGLNLYQPEYIIGGAVPVTILAIIIDYVLAVTERKVTPKGLQGMKEVS
>G2JZ43 ~~~opuCB~~~Carnitine transport permease protein OpuCB~~~
MDAIVTFFQENGHNLLVQTWQHLFISLSAVILGIAVAVPTGILLTRSPKVANFVIGVVSVLQTVPSLAILAFIIPFLGVG
TLPAIIALFIYALLPILRNTFIGVRGVDKNLIESGRGMGMTNWQLIVNVEIPNSISVIMAGIRLSAVYVIAWATLASYIG
AGGLGDFIFNGLNLYRPDLILGGAIPVTILALVVEFALGKLEYRLTPKAIREAREGGE
>Q9KHT8 ~~~opuCB~~~Carnitine transport permease protein OpuCB~~~COG1174
MDAIVTFFQENGHNLLVQTWQHLFISLSAVILGIAVAVPTGILLTRSPKVANFVIGVVSVLQTVPSLAILAFIIPFLGVG
TLPAIIALFIYALLPILRNTFIGVRGVDKNLIESGRGMGMTNWQLIVNVEIPNSISVIMAGIRLSAVYVIAWATLASYIG
AGGLGDFIFNGLNLYRPDLILGGAIPVTILALVVEFALGKLEYRLTPKAIREAREGGE
>O32243 ~~~opuCC~~~Glycine betaine/carnitine/choline-binding protein OpuCC~~~COG1732
MTKIKWLGAFALVFVMLLGGCSLPGLGGASDDTIKIGAQSMTESEIVANMIAQLIEHDTDLNTALVKNLGSNYVQHQAML
GGDIDISATRYSGTDLTSTLGKEAEKDPKKALNIVQNEFQKRFSYKWFDSYGFDNTYAFTVTKKFAEKEHINTVSDLKKN
ASQYKLGVDNAWLKRKGDGYKGFVSTYGFEFGTTYPMQIGLVYDAVKNGKMDAVLAYSTDGRIKAYDLKILKDDKRFFPP
YDCSPVIPEKVLKEHPELEGVINKLIGQIDTETMQELNYEVDGKLKEPSVVAKEFLEKHHYFD
>G2JZ42 ~~~opuCC~~~Carnitine transport binding protein OpuCC~~~
MKKKFIALFSVLLLTSSLFLSSCSLPGLGGSSKDTIRIGAMATTESQIVSNILKELIEHDTGLKVEIVNNLGSTIVQHQA
MLNGDVDITATRYTGTDLVGPLGEEAIKDPEKALAAVKKGFEERFHQTWFDSYGFANTYVFMVRQDTAKKYNLNTVSDMR
KVENELTAGVDNSWMEREGDGYKAFSKAYDIEFKKIFPMQIGLIYTALKNNQMDVALGYSTDGRIPTYNLKLLKDDKKFF
PPYDASALATDEILKKHPELKTTINKLKGKISTEEMQKLNYEADGKLKEPSIVAQEFLQKNNYFEGKN
>Q9KHT7 ~~~opuCC~~~Carnitine transport binding protein OpuCC~~~COG1732
MKKKFIALFSVLLLTSSLFLSSCSLPGLGGSSKDTIRIGAMATTESQIVSNILKELIEHDTGLKVEIVNNLGSTIVQHQA
MLNGDVDITATRYTGTDLVGPLGEEAIKDPEKALAAVKKGFEERFHQTWFDSYGFANTYVFMVRQDTAKKYNLNTVSDMR
KVENELTAGVDNSWMEREGDGYKAFSKAYDIEFKKIFPMQIGLIYTALKNNQMDVALGYSTDGRIPTYNLKLLKDDKKFF
PPYDASALATDEILKKHPELKTTINKLKGKISTEEMQKLNYEADGKLKEPSIVAQEFLQKNNYFEGKN
>O34742 ~~~opuCD~~~Glycine betaine/carnitine/choline transport system permease protein OpuCD~~~COG1174
MEVLQQLGTYYSQNGGYVLQEFYRHFLMSVYGVLFAAIVGIPLGILIARYRRLSGWVFAVTNVIQTIPALAMLAVLMLVM
GLGANTVILSLFLYSLLPIIRNTYTGIISIEHAYLESGKAMGMTKFQVLRMVELPLALSVIMAGLRTALVIAIGITAIGT
FVGAGGLGDIIVRGSNATNGTAIILAGAIPTALMAVIADLVMGWLERALSPIKKKKGNFIIADRKTTSI
>G2JZ41 ~~~opuCD~~~Carnitine transport permease protein OpuCD~~~
MDTLKQLIDYYQTNGSYVMEEFWRHFLMSAYGVIFAAIIAIPLGVYIARKKRLAGWVIQIANIIQTIPALAMLAVLMLIM
GLGTNTVVLSLFLYSLLPILKNTYTGIRNVDGALLESGKAMGMTKWQVLRLIEMPLALSVIMAGIRNALVIAIGVAAIGT
FVGAGGLGDIIVRGTNATNGTAIILAGAIPTAVMAILADVLLGWVERTLNPVKNKRKPLTEAL
>Q9KHT6 ~~~opuCD~~~Carnitine transport permease protein OpuCD~~~COG1174
MDTLKQLIDYYQTNGSYVMEEFWRHFLMSAYGVIFAAIIAIPLGVYIARKKRLAGWVIQIANIIQTIPALAMLAVLMLIM
GLGTNTVVLSLFLYSLLPILKNTYTGIRNVDGALLESGKAMGMTKWQVLRLIEMPLALSVIMAGIRNALVIAIGVAAIGT
FVGAGGLGDIIVRGTNATNGTAIILAGAIPTAVMAILADVLLGWVERTLNPVKNKRKPLTEAL
>P54417 ~~~opuD~~~Glycine betaine transporter OpuD~~~COG1292
MLKHISSVFWIVIAITAAAVLWGVISPDSLQNVSQSAQAFITDSFGWYYLLVVSLFVGFCLFLIFSPIGKIKLGKPDEKP
EFGLLSWFAMLFSAGMGIGLVFYGAAEPISHYAISSPSGETETPQAFRDALRYTFFHWGLHAWAIYAIVALCIAYFQFRK
GAPGLISSTLSPILGDKVNGPIGKAIDCIAVFATVVGVSTSLGLGATQINGGLNYLFGIPNAFIVQLVLIIIVTVLFLLS
AWSGLGKGIKYLSNTNMVLAGLLMLFMLVVGPTVLIMNSFTDSIGQYIQNIVQMSFRLTPNDPEKREWINSWTIFYWAWW
ISWSPFVGIFIARVSRGRTIREFLIGVLVTPCILTFLWFSIFGVSAMDLQQKGAFNVAKLSTETMLFGTLDHYPLTMVTS
ILALILIAVFFITSADSATFVLGMQTSYGSLNPANSVKLSWGIIQSAMAAVLLYSGGLAALQNTAILAALPFSIVILLMI
ASLYQSLSKERREIKKAEKLDKPRSPRVKKAY
>O06493 ~~~opuE~~~Osmoregulated proline transporter OpuE~~~COG0591
MSIEIIISLGIYFIAMLLIGWYAFKKTTDINDYMLGGRGLGPFVTALSAGAADMSGWMLMGVPGAMFATGLSTLWLALGL
TIGAYSNYLLLAPRLRAYTEAADDAITIPDFFDKRFQHSSSLLKIVSALIIMIFFTLYTSSGMVSGGRLFESAFGADYKL
GLFLTTAVVVLYTLFGGFLAVSLTDFVQGAIMFAALVLVPIVAFTHVGGVAPTFHEIDAVNPHLLDIFKGASVISIISYL
AWGLGYYGQPHIIVRFMAIKDIKDLKPARRIGMSWMIITVLGSVLTGLIGVAYAHKFGVAVKDPEMIFIIFSKILFHPLI
TGFLLSAILAAIMSSISSQLLVTASAVTEDLYRSFFRRKASDKELVMIGRLSVLVIAVIAVLLSLNPNSTILDLVGYAWA
GFGSAFGPAILLSLYWKRMNEWGALAAMIVGAATVLIWITTGLAKSTGVYEIIPGFILSMIAGIIVSMITKRPAKASYRL
FGVMEKLLKRKK
>E3PY99 1.4.1.12~~~ord~~~2,4-diaminopentanoate dehydrogenase~~~COG3804
MEKVKVIIMGLGAMGGGMADMLLKKQGVEIVGVVGRGKMLGTSMYDHISTPRGDREDVIVGAMEDVITEKAADVVLLCTD
SFTRKAFDKIKFIVEKKINVISSAEEMAYPMAQEPELAKEIDRLAKENGVSVLGTGINPGLIMDLLVILMTGCCEEVHSI
LSRRVNSLSPFGPAVMEEQGIGITVEEFNKGVQEGTLAGHVGFHESIGMIADAIGWKLSAPITQSMEPIVTDVDRKSPYG
FAKAGNVAGCAMKGFGYVDGELKIEMDHPQQIEPEQVGVQTGDYVIINGVPNINMVNSPEVPGGIGTIAMCVNMIPQIIN
ARPGLHTMIDLPVPRAIMGDFRDLISEEAKIVK
>C1FW05 1.4.1.26~~~ord~~~2,4-diaminopentanoate dehydrogenase~~~
MGKNVKTLIWGFGAMGSGMASILLEKGGYELVSVIDTHPQKAGKDVGELLGRAPLGVKVTMDHLKAFGSHPDVALIATSS
FVEEVSPQIEFALENDANVITIAEEMSYPWIDSKEIAERLDALALNRGKTVLGTGINPGFVLDTLVVSLSGVCKEVRHIH
AKRVNDLAPFGPTVMRTQGVGTTPEKFEEGIRSGNIVGHIGFRQSIMLIAKALGWTIEDIVEERQPIITNVRRKTNYVDV
TPGNVAGCRHTARAYSCGREVIFLEHPQQVCPEAEGVRTGDYVVIDGDPPVNLRIEPEIPGGTGTMAMAVNMIPLVVNAP
PGLLTMIDLPVPRSISNGF
>C1FUH5 ~~~orfX1~~~Protein OrfX1~~~
MNQTFSFNFDDTLSNSSGLINLEKINQNCSPNYQYFKIKFIGGYLHIKNKSGDILEKYDLKDLISLIALKKDYLKLSSPN
NKKPNEFTNIKNKHLENRFNLYVINEDINGKITKNGILEEIILNRLLLSILLGNEENLLQIA
>A0A5P3XKM0 ~~~orfX1~~~Protein OrfX1~~~
MNREFPFHFNDGNVSMNGLFCLKKIKTQYHPNYDYFKIKFCEGFLSIKNKVKDDLCEYDLKNIESVIALKREYSKENNLK
NKESAIFMNIGNKGIHNKYDLYVVNVDINNILDENYMLKGILNDKLKILFLGNERKLLRIKN
>C1FUH4 ~~~orfX2~~~Protein OrfX2~~~
MNNLKPFIYYDWKKTILKNAKESYSINEIIPKTFFMELHGTKITNSTLNGTWKAWNLTDEGEGSHPVLKCIIDDGYLDMN
FGASSEKIPLKNVWIKLCMKINPNSDGTYSIPEKSSSFYIKDNSLKISKDNLILDKYLNKLMLSYFKNNIKNIEMFINKS
RIQTKVVGDLSLLGWNTENSVSFRTMNEFIKKDNLYPKDFKAVYSYRKMTFTATGTFDSWEMTTGADGRNIRFKCPIKSA
AYDLDGDVFNSSTENFLLIQVDLTYFDSKTTINDPTGENDGKQFNLKVKTNDDKLKNVLIVTYNLTDTDGSMSSEDKDFL
SLAFRNWFNDNIQQFEQIFAYILLDETAKIPEYQWLKPTQISYGSASVETANDEPDLDASIFSAMSMVENNTNSTPSHAV
DNRMLQLTKTQAAFGISFPLFIEHFLKQALLSSQFISVDDIVADINTLTITNNKQIIFGKVENSDGKNVDSSLKPGKLKL
SLQNNLIVLELFDLTWEQGRGVTGHFDFRQEYELTLESKSEKQIPILKVHDEPEIEYYVEEAQWKANEDMIVSAVVGTVF
SMILGAGMKLAGSALSKAGKLIRSKATTIKGRKKIYINRSNVRQLRKDSGVTEMELQRINRRNSSIASEDARFISNNGTT
SIQTLGDMKKKPMSTGQRIAIGVKKITGTAVMFGAVGLGMNFGEMLINYINAMENNDYSAIPGINSFMQQCIGAMQWPDK
DSELKVTFGKLQGIYLLGGTLEKNNKPNSK
>A0A5P3XKJ3 ~~~orfX2~~~Protein OrfX2~~~
MSKKPLDFLRIYDWHKTEAMNKISKLDFERIIPKHFSKEIKNKHLSVKITGNWKIWKLTDEGEGQYPIFKCIVEDGFLKI
KNECGNKKYSLDNAWIKICTKIKYDNENGKDIYSIDEKNLTLYSVNNSFNSKYKNNIVDAFLDNLLIACIEDNIKDLNKF
FKLYKVKTAIKEDLSLLGWDTGYSTSFTHVNKTIENQQNYPKQFKYESEGPYNIDISGEFDSWRLTTGSDGQNVNFICPI
KNGEFNFLGTEYKFSQGEQVNIQLKLKYLNIEEPTFEDSTSLNDGNQVDLIVKTDEDENENPPVTIIKVVLLGEIDAIGK
MLLEGTFREWFNENIDAFKQIFSSFLLEDTSKNPDFQWLKPTKAYYGVASAEPIDGKPDLDSSVFSVMSMVEDNKNDKPS
HTVDGRILDAVNNESAFGIRTPLFVKKWLIAGLEMMQIGKLEDFDLINNGMGFINNKKLLFGTFENADGEDVPAYVEKDN
FRLEITNNQLKIEITDIYWQQSRRLTGHVMYSQYFDLELRSGTDITGAEYKNILIPVENSEPTLVVNISQDEFDIWGDIV
GEIVGGIVVGIVTGYLGSILGKGVGKYLEKFLTKTSGGRWVLKMNKEMYDYLNNLFKGDRRVFNEVAIDEIELISTLGTS
QAISTIANTPTNFASKIWVNKSKFIGGLIGGSVGSVIPSVIIKSIDAWDKQNYSVLPSINAFVASSVGSVKWPDTSEFKI
ESAELNGIFLLGGKLERYEK
>C1FUH3 ~~~orfX3~~~Protein OrfX3~~~
MQTTTLNWDTVYAVPIDIVNEAIKFKHPTPEQFELLDGKYGDCKGSFQEWQIISGGDGGNIRLKIPIKNFKANVIGKYLS
GTGGFESANLEIQVKLKYLPHFPKSKNKNEELVDLKIRTKSNNAEDPAIIIIPTSEDVKGFYFNEDVRSLLMTEDDQFIM
NYFHRLIKEWLERNLYFFNYVFNTVNLNLYISNNEKWKWTKPSYVDYAYSEIDEDLSKSILGVLCMTDGRKGSKNQQQKI
DPNAIPKTSQSGFLISEDRLLKNILLPTIPKKFPKSKGDEFEVVNQSAQGGTYSYILKLKEGKKINLDNINACGYTCTPY
IQEMKVSLLGNYLRLESTTRVDLPLGVSSICETMCEYRFKLDKNDKGEQTIAYEQIGSPTNKQYTEKTQDVSFEIIKGLL
IATLGFVLELVPGIGSFLAVALIGGTLVGSISLIPNFIENYNVNTAPSIDLSLENSVSEITWNSSDIFNLNYVALAGPLQ
LGGTLQVQNT
>A0A5P3XKL3 ~~~orfX3~~~Protein OrfX3~~~
MIGKRQTSTLNWDTVFAVPISVVNKAIKDKKSSPENFEFEDSSGSKCKGDFGDWQIITGGDGSNIRMKIPIYNFKAELVD
DKYGIFNGNGGFESGEMNIQVKLKYFPHDKISKYKDVELVDLKVRSESADPIDPVVVMLSLKNLNGFYFNFLNEFGEDLQ
DIIEMFFIELVKQWLTENISLFNHIFSVVNLNLYIDQYSQWSWSRPSYVSYAYTDIEGDLDKSLLGVLCMTGGRNPDLRQ
QKVDPHAVPESSQCGFLIYEERVLRDLLLPTLPMKFKNSTVEDYEVINASGESGQYQYILRLKKGRSVSLDRVEANGSKY
DPYMTEMSISLSNDVLKLEATTETSVGMGGKVGCDTINWYKLVLAKNGNGEQTISYEEVGEPTVINYVIKEGENWVWDVI
AAIIAILATAVLAIFTGGAAFFIGGIVIAIITGFIAKTPDIILNWNLETSPSIDMMLENSTSQIIWNARDIFELDYVALN
GPLQLGGELTV
>E1WAB5 ~~~orgA~~~Oxygen-regulated invasion protein OrgA~~~
MIRRNRQMNRQPLPIIWQRIIFDPLSYIHPQRLQIAPEMIVRPAARAAANELILAAWRLKNGEKECIQNSLTQLWLRQWR
RLPQVAYLLGCHKLRADLARQGALLGLPDWAQAFLAMHQGTSLSVCNKAPNHRFLLSVGYAQLNALNEFLPESLAQRFPL
LFPPFIEEALKQDAVEMSILLLALQYAQKYPNTVPAFAC
>P0CL44 ~~~orgA~~~Oxygen-regulated invasion protein OrgA~~~
MIRRNRQMNRQPLPIIWQRIIFDPLSYIHPQRLQIAPEMIVRPAARAAANELILAAWRLKNGEKECIQNSLTQLWLRQWR
RLPQVAYLLGCHKLRADLARQGALLGLPDWAQAFLAMHQGTSLSVCNKAPNHRFLLSVGYAQLNALNEFLPESLAQRFPL
LFPPFIEEALKQDAVEMSILLLALQYAQKYPNTVPAFAC
>Q47VZ4 3.1.15.-~~~orn~~~Oligoribonuclease~~~COG1949
MAGNDSNLIWLDLEMTGLEPVEDVILEIAIIITDSELNILAQGPIFAISQTDDVLDNMNPWCIEHHGKSGLTQRCRDSEV
SLAHATKESLAFVQEWVPQGKSPMCGNSIGQDRRFINKYMPDFEDHFHYRNLDVSTIKELAKRWKPEVLESVVKTGAHLA
LDDIKESIAELKVYRELFFKL
>Q83C93 3.1.15.-~~~orn~~~Oligoribonuclease~~~COG1949
MDFSDDNLIWLDLEMTGLDPERDRIIEIATIVTNSHLDILAEGPAFAIHQPDKLLTAMDNWNTSHHTASGLLERVKNSSV
DEVEAETLTLAFLEKYVSAGKSPLCGNSVCQDRRFLSRYMPRLNQFFHYRHLDVTTLKILAQRWAPQIAAAHIKESQHLA
LQDIRDSIEELRYYRAHLLNLSK
>P0A784 3.1.15.-~~~orn~~~Oligoribonuclease~~~COG1949
MSANENNLIWIDLEMTGLDPERDRIIEIATLVTDANLNILAEGPTIAVHQSDEQLALMDDWNVRTHTASGLVERVKASTM
GDREAELATLEFLKQWVPAGKSPICGNSIGQDRRFLFKYMPELEAYFHYRYLDVSTLKELARRWKPEILDGFTKQGTHQA
MDDIRESVAELAYYREHFIKL
>P45340 3.1.15.-~~~orn~~~Oligoribonuclease~~~COG1949
MSFDKQNLIWIDLEMTGLDPEKERIIEIATIVTDKNLNILAEGPVLAVHQSDELLNKMNDWCQKTHSENGLIERIKASKL
TERAAELQTLDFLKKWVPKGASPICGNSIAQDKRFLVKYMPDLADYFHYRHLDVSTLKELAARWKPEILEGFKKENTHLA
LDDIRESIKELAYYREHFMKLD
>P9WIU1 3.1.15.-~~~orn~~~Oligoribonuclease~~~COG1949
MQDELVWIDCEMTGLDLGSDKLIEIAALVTDADLNILGDGVDVVMHADDAALSGMIDVVAEMHSRSGLIDEVKASTVDLA
TAEAMVLDYINEHVKQPKTAPLAGNSIATDRAFIARDMPTLDSFLHYRMIDVSSIKELCRRWYPRIYFGQPPKGLTHRAL
ADIHESIRELRFYRRTAFVPQPGPSTSEIAAVVAELSDGAGAQEETDSAEAPQSG
>P57665 3.1.15.-~~~orn~~~Oligoribonuclease~~~
MQNPQNLIWIDLEMTGLDPDRDVIIEMATIVTDSDLNTLAEGPVIAIHQPEEILAGMDEWNTRQHGQSGLTQRVRESTVS
MAEAEAQTLAFLEQWVPKRSSPICGNSICQDRRFLYRHMPRLEGYFHYRNLDVSTLKELAARWAPQVRESFKKGNTHLAL
DDIRESIAELRHYRDHFIKL
>P57666 3.1.15.-~~~orn~~~Oligoribonuclease~~~COG1949
MNDRMVWIDCEMTGLSLSDDALIEVAALVTDSELNILGEGVDIVIRPPERALETMPEVVREMHTASGLLAELDGGTTLAD
AEAQVLAYVREHVKEPGKAPLCGNSVGTDRGFLLRDMATLEGYLHYRIVDVSSIKELARRWYPRAYFNSPEKNGNHRALA
DIRESIAELRYYREAVFVPQPGPDSDTARAIAAKHVVSAG
>P57667 3.1.15.-~~~orn~~~Oligoribonuclease~~~
MNDRMVWIDCEMTGLSLADDALIEVAALVTDSELNVLGEGVDIVIRPPDAALETMPEVVRQMHTASGLLDELAGGTTLAD
AEEQVLAYVREHVKEPGKAPLCGNSVGTDRGFLARDMRELEGYLHYRIVDVSSVKELARRWYPRAYFNSPAKNGNHRALA
DIRDSITELRYYREAVFVPQPGPDSERAKEIAARLSAPAAP
>A5F3M7 3.1.15.-~~~orn~~~Oligoribonuclease~~~COG1949
MSFSDQNLIWIDLEMTGLDPEMHKIIEMATIVTDSELNILAEGPVIAIHQPESELAKMDEWCTTTHTASGLVARVRQSQV
SEEEAIDQTLAFLKQWVPEGKSPICGNSIGQDRRFLYKHMPRLEAYFHYRYIDVSTIKELTRRWQPEVLKEFSKTGSHLA
LDDIRESIAELQFYRKAVFKI
>Q9KV17 3.1.15.-~~~orn~~~Oligoribonuclease~~~COG1949
MSFSDQNLIWIDLEMTGLDPEMHKIIEMATIVTDSELNILAEGPVIAIHQPESELAKMDEWCTTTHTASGLVARVRQSQV
SEEEAIDQTLAFLKQWVPEGKSPICGNSIGQDRRFLYKHMPRLEAYFHYRYIDVSTIKELTRRWQPEVLKEFSKTGSHLA
LDDIRESIAELQFYRKAVFKI
>Q8P8S1 3.1.15.-~~~orn~~~Oligoribonuclease~~~COG1949
MADNVAGNDRLIWIDLEMTGLDTDRDSIIEIATIVTDAQLNVLAEGPELAIAHSLETLEAMDEWNRNQHRRSGLWQRVLD
SQVTHAQAEAQTVAFLSEWIRAGASPMCGNSICQDRRFLHRQMSRLERYFHYRNLDVSTIKELARRWAPAVASGFAKSSA
HTALSDVRDSIDELRHYRQFMGTLGGDNGGGVQN
>C7Q942 1.14.11.-~~~~~~L-ornithine/L-arginine 3-hydroxylase~~~COG2175
MHRLALTAQDNLAVAPMLADLAGRYPDIEDPELIRSAPVLAAKGLPPHLLAFLDDFRLREPSALCVISGLDVDQDRLGPT
PEHWRDSQIGSRSLNLEIFFLLCGAALGDVFGWATQQDGRIMHDVLPIKGHEHYELGSNSLQHLSWHTEDSFHPCRGDYV
ALMCLKNPYEAETMVCDAGDLDWPNLDVDALFEPVFTQMPDNSHLPQNTAESTGDPTKDRLRARSFELIKSWNENPVRRA
VLYGDRQNPYMALDPYHMKMDDWSERSLEAFQALCEEIEAKMQDVVLHPGDIAFIDNFRAVHGRRSFRARYDGSDRWLKR
LNITRNLRGSRAWRPAPDDRVIY
>C1FW08 5.1.1.12~~~orr~~~Ornithine racemase~~~COG3457
MYPKITIDINKLRDNATFIKNLCEKGGCKTALVVKSMCANHDIVKELDSVEVDYFADSRIQNLKKLKDLKTKKMLLRIPM
LCEVEDVVKYADISMNSELDTLKALNKAAKTLNKVHSVIIMVDLGDLREGYFEAEDLKENIKEIIKLENIEIKGIGVNLT
CYGAVIPKNDNLSRLCDIADELRTEFNLELPIVSGGNSSSIYLIDKGELPEGITNLRVGESMLLGRETAYGEDIIGMNND
VFELKCQIVELKEKPSLPIGEIGVDAFGNKPYYEDKGIRKRAILAIGQQDTDISSLMPIDDKLEILGASSDHLIVDVSDS
NTSYKVGDIITFRMGYGALLKGFTSEYIEKELL
>E3PY98 2.3.1.263~~~ortA~~~2-amino-4-ketopentanoate thiolase alpha subunit~~~
MAKKGDWVLIHKIVLSPEERAPQVPDDTKKVPLEMWIKGYLNEDAQIGDQVSITTRTKRVEEGKLLEVNPYYTHDFGKFV
PELLKISEQVREITFGGEGNE
>C1FW06 2.3.1.263~~~ortA~~~2-amino-4-ketopentanoate thiolase alpha subunit~~~
MSEKCKKDDWVEIHYVVLEARERTGDIPEDTRKVPLECWIKGWAEKEGTVGEEVTIRTPANRYVKGTLTRINPEYTHTFG
PCASELSAIGQELRATLKEGK
>E3PY97 2.3.1.263~~~ortB~~~2-amino-4-ketopentanoate thiolase beta subunit~~~COG0031
MSKDMSYSGVMSRRNEIMKNAIGIDYTTFESGSLSFDYEKMMRETGYTLEEMQKIQYSTGVGRTPVLELRNITALARKYA
APGKGARIFIKDEAGNPSGSFKARRAANAVYHAKKLGYKGVIAATSGNYGAAVASQAAMAGLKCIIVQECYDSKGVGQPE
IIEKARKCEALGAEVVQLSVGPELFYKFLSLLEETGYFNASLYTPFGIAGVETLGYELAVEFREAYGKDPDVVVCSNAGG
GNLTGTARGLIKAGAQSEVVAASVNLQGLHMASDTQFNKKSFTTSHTGFGMPFATWPDRSDVPRSAARPLRYMDRYVTVN
QGEVFYITETLASLEGLEKGPAGNTALAAAFSLAQEMDKDQIIVVQETEYTGAGKHIQPQLAFARDNGIDIKFGNPKEEV
AGINIILPENPGMIKAVDHEMNKLRKSLIKNALANYPDAKLDDSDIDFLVKETKSDTEFVKATIKEIKG
>C1FW07 2.3.1.263~~~ortB~~~2-amino-4-ketopentanoate thiolase beta subunit~~~
MAKDTSYDAVMDRRAEIMSRALGLNYDEFIISDIAFDYEGMMAKAGYSLEEVRQIQSESGVGNTPLLELRNITDLARKTS
KRGFGARILIKDEAANPSGSFKDRRASVSIHHAKKLGYPGVLAATSGNYGAAVASQAAMKNLGCIIVQEVFDSRHVGQPE
IIEKSRKCEAYGAEVVQLTVGPELFYVSLILLEETGYFNASLYSPFGISGVETLGYELVEQIRARYDKDPAYIVVTHAGG
GNVTGTARGALKAGAKNSTIIGASVDLSGLHMASDNDFNKKSFTTGHTGFGVPFATWPDRTDVPKNAARPLRYLDRYVTV
TQGEVFYVTEALAQVEGMERGPAGNTSLAAAIALARELPEDEIVVVQETEYTGAGKHPWAQLDFAKQNGIAVKRGAPKEN
KPGKTIVIPQNFSQITATEMDLNRMRRSYIRNALERNKVKNVTEEDIRFLAEDTKTDADFVTSVIRDLGVRL
>P64453 ~~~ortT~~~Orphan toxin OrtT~~~
MSLYQHMLVFYAVMAAIAFLITWFLSHDKKRIRFLSAFLVGATWPMSFPVALLFSLF
>P29772 ~~~osa~~~Protein osa~~~
MLMLLRRRCRAWLEIRRLDKELAQSSGLPLELPQIVPNAWNEVVWRLPVPNHPDAFMTASNAAQSDFIVYVNGLAFYRAW
LALGVEDSQACPLKQDMPKDRKYPSSAAHFAVGIDSPVPLADVSPTMILGHFAVCFTDGMTRSMWLLAHEVAVFPVLSRD
EASAVMLAEHVGVAAPIQVSKLREQCRKI
>P0ADA7 ~~~osmB~~~Osmotically-inducible lipoprotein B~~~
MFVTSKKMTAAVLAITLAMSLSACSNWSKRDRNTAIGAGAGALGGAVLTDGSTLGTLGGAAVGGVIGHQVGK
>P37723 ~~~osmB~~~Osmotically-inducible lipoprotein B~~~
MFMTSKKMAAAVLAITVAMSLSACSNWSKRDRNTAIGAGAGALGGAVLTDGSTLGTLGGAAVGGVIGHQVGK
>P0C0L2 1.11.1.-~~~osmC~~~Peroxiredoxin OsmC~~~COG1764
MTIHKKGQAHWEGDIKRGKGTVSTESGVLNQQPYGFNTRFEGEKGTNPEELIGAAHAACFSMALSLMLGEAGFTPTSIDT
TADVSLDKVDAGFAITKIALKSEVAVPGIDASTFDGIIQKAKAGCPVSQVLKAEITLDYQLKS
>P0ADB1 ~~~osmE~~~Osmotically-inducible putative lipoprotein OsmE~~~COG2913
MNKNMAGILSAAAVLTMLAGCTAYDRTKDQFVQPVVKDVKKGMSRAQVAQIAGKPSSEVSMIHARGTCQTYILGQRDGKA
ETYFVALDDTGHVINSGYQTCAEYDTDPQAAK
>Q8ZPK4 7.6.2.-~~~osmV~~~Osmoprotectant import ATP-binding protein OsmV~~~
MIKLENLTKQFVQKKGQPLKAVDNVNLNVPEGEMCVLLGPSGCGKTTTLKMINRLIAPSSGNILINGENTNDMDAVTLRR
NIGYVIQQIGLFPNMTIEENITVVPRMLGWDKARCKQRAEELMDMVALDARKFLHRYPKEMSGGQQQRIGVIRALAADPP
VLLMDEPFGAVDPINREVIQNQFLDMQRKLKKTVMLVSHDIDEALKLGDRIAVFRQGRIVQCASPDELLAKPANEFVGSF
VGQDRTLKRLLLVSAGDVTDQQPTITARPSTPLSEAFGIMDDHDIRAITVIDNDGKPLGFVKRREARNASGICADITHPF
RITGKAEDNLRIVLSRLYESNTSWMPIVDEDGRYNGEISQDYIADYLSSGRTRRALNIHENS
>Q8ZPK3 ~~~osmW~~~Osmoprotectant import permease protein OsmW~~~
MDTIHYMLDNAGYLASLTFQHLWLVALAVGLAIIIGVPLGVLIVRHKWLATPVLGAATLLLTIPSIALFGLMIPLFSLIG
HGIGVLPAVTAVFLYSLLPIVRNTHTALDSLPPGLREAGRGIGMTFWQRLRWVEIPMALPVIFGGIRTAVVMNIGVMAIA
AVIGAGGLGLLLLNGISGSDIRMLIAGALMICLLAIVLDWLLHRLQVVLTPKGIR
>Q8ZPK2 ~~~osmX~~~Osmoprotectant-binding protein OsmX~~~
MRFKKHLLGWLAATLLFSSQTQAAPLVLATKSFTEQHILSAMTVQYLQKKGFQVQPQTNIAAVISRNAMVNKQIDITWEY
TGTSLIIFNRIDKRMSPQETYDTVKRLDAKLGLVWLKPADMNNTYAFAMQRKRAESENITTISQMVAKIEQVRQNDPDHN
WMLGLDLEFAGRSDGMKPLQQAYQMQLDRPQIRQMDPGLVYNAVRDGLVDAGLVYTTDGRVKGFDLKVLEDDKGFFPSYA
VTPVVRKEVLEANPGLDDALNTLSGLLNNDVISTLNAQVDIEHRTPQQVAHQFLQDKGLL
>P0AFH8 ~~~osmY~~~Osmotically-inducible protein Y~~~COG2823
MTMTRLKISKTLLAVMLTSAVATGSAYAENNAQTTNESAGQKVDSSMNKVGNFMDDSAITAKVKAALVDHDNIKSTDISV
KTDQKVVTLSGFVESQAQAEEAVKVAKGVEGVTSVSDKLHVRDAKEGSVKGYAGDTATTSEIKAKLLADDIVPSRHVKVE
TTDGVVQLSGTVDSQAQSDRAESIAKAVDGVKSVKNDLKTK
>Q8ZPK1 ~~~osmY~~~Osmoprotectant import permease protein OsmY~~~
MHTLTLKRVLGFTIVILLLLALFIWGIGLETLKARQVDLLYLGQRHLMLVFTSMFFALLVGIPSGILLSRPAAKGFAEYV
MQIFNVGNTLPPLAVLALAMVIIGIGDTPAIVALFLASLLPIVRNTYAGLCSVPASLIEAANGIGMTKWQRLRQVELPNA
WPVMLSGIRIATAINVGTAPLAFLIGASSYGELIFPGIYLNDFPTLILGATATALFALILDTLLAWFGRRLSPHTV
>Q09086 ~~~ospA~~~Outer surface protein A~~~
MKKYLLGIGLILALIACKQNVSSLDEKNSVSVDLPGGMTVLVSKEKDKDGKYSLDATVDKLELKGTSDKNNGSGTLEGEK
TDKSKVKLTIADDLSQTKFEIFKEDGKTLVSKKVTLKDKSSTEEKFNEKGETSEKTIVRANGTRLEYTDIKSDGSGKAKE
VLKDFTLEGTLAADGKTTLKVTEGTVVLSKNILKSGEITVALDDSDTTQATKKTGNWDSKSSTLTISVNSQKTKNLVFTK
EDTITVQKYDSAGTNLEGKAVEITTLKELKAALK
>P0CL66 ~~~ospA~~~Outer surface protein A~~~
MKKYLLGIGLILALIACKQNVSSLDEKNSVSVDLPGEMKVLVSKEKNKDGKYDLIATVDKLELKGTSDKNNGSGVLEGVK
ADKSKVKLTISDDLGQTTLEVFKEDGKTLVSKKVTSKDKSSTEEKFNEKGEVSEKIITRADGTRLEYTGIKSDGSGKAKE
VLKGYVLEGTLTAEKTTLVVKEGTVTLSKNISKSGEVSVELNDTDSSAATKKTAAWNSGTSTLTITVNSKKTKDLVFTKE
NTITVQQYDSNGTKLEGSAVEITKLDEIKNALK
>B7IZU3 ~~~ospA~~~Outer surface protein A~~~
MKKYLLGIGLILALIACKQNVSSLDEKNSVSVDLPGEMNVLVSKEKNKDGKYDLIATVDKLELKGTSDKNNGSGVLEGVK
ADKSKVKLTISDDLGQTTLEVFKEDGKTLVSKKVTSKDKSSTEEKFNEKGEVSEKIITRADGTRLEYTEIKSDGSGKAKE
VLKSYVLEGTLTAEKTTLVVKEGTVTLSKNISKSGEVSVELNDTDSSAATKKTAAWNSGTSTLTITVNSKKTKDLVFTKE
NTITVQQYDSNGTKLEGSAVEITKLDEIKNALK
>P17739 ~~~ospB~~~Outer surface protein B~~~
MRLLIGFALALALIGCAQKGAESIGSQKENDLNLEDSSKKSHQNAKQDLPAVTEDSVSLFNGNKIFVSKEKNSSGKYDLR
ATIDQVELKGTSDKNNGSGTLEGSKPDKSKVKLTVSADLNTVTLEAFDASNQKISSKVTKKQGSITEETLKANKLDSKKL
TRSNGTTLEYSQITDADNATKAVETLKNSIKLEGSLVGGKTTVEIKEGTVTLKREIEKDGKVKVFLNDTAGSNKKTGKWE
DSTSTLTISADSKKTKDLVFLTDGTITVQQYNTAGTSLEGSASEIKNLSELKNALK
>P0DW54 ~~~ospC~~~Outer surface protein C~~~
MKKNTLSAILMTLFLFISCNNSGKGGDSASTNPADESAKGPNLTEISKKITDSNAFVLAVKEVETLVLSIDELAKKAIGQ
KIDNNNGLAALNNQNGSLLAGAYAISTLITEKLSKLKNLEELKTEIAKAKKCSEEFTNKLKSGHADLGKQDATDDHAKAA
ILKTHATTDKGAKEFKDLFESVEGLLKAAQVALTNSVKELTSPVVAESPKKP
>Q8VSJ7 4.3.99.-~~~ospC1~~~Arginine ADP-riboxanase OspC1~~~
MNISETLNSANTQCNIDSMDNRLHTLFPKVTSVRNAAQQTMPDEKNLKDSANIIKSFFRKTIAAQSYSRMFSQGSNFKSL
NIAIDAPSDAKASFKAIEHLDRLSKHYISEIREKLHPLSAEELNLLSLIINSDLIFRHQSNSDLSDKILNIKSFNKIQSE
GICTKRNTYADDIKKIANHDFVFFGVEISNHQKKHPLNTKHHTVDFGANAYIIDHDSPYGYMTLTDHFDNAIPPVFYHEH
QSFLDKFSEVNKEVSRYVHGSKGIIDVPIFNTKDMKLGLGLYLIDFIRKSEDQSFKEFCYGKNLAPVDLDRIINFVFQPE
YHIPRMVSTENFKKVKIREISLEEAVTASNYEEINKQVTNKKIALQALFLSITNQKEDVALYILSNFEITRQDVISIKHE
LYDIEYLLSAHNSSCKVLEYFINKGLVDVNTKFKKTNSGDCMLDNAIKYENAEMIKLLLKYGATSDNKYI
>Q44977 ~~~ospC~~~Outer surface protein C~~~
MKKNTLSAILMTLFLFISCNNSGKDGNTSANSADESVKGPNLTEISKKITDSNAVLLAVKEVEALLSSIDELAKAIGKKI
KNDGSLGDEANHNESLLAGAYTISTLITQKLSKLNGSEGLKEKIAAAKKCSEEFSTKLKDNHAQLGIQGVTDENAKKAIL
KANAAGKDKGVEELEKLSGSLESLSKAAKEMLANSVKELTSPVVAESPKKP
>Q8VSL8 4.3.99.-~~~ospC2~~~Arginine ADP-riboxanase OspC2~~~
MKIPEAVNHINVQNNIDLVDGKINPNKDTKALQKNISCVTNSSSSGISEKHLDHCADTVKSFLRKSIAAQSYSKMFSQGT
SFKSLNLSIEAPSGARSSFRSLEHLDKVSRHYLSEIIQKTHPLSSDERHLLSIIINSDFNFRHQSNANLSNNTLNIKSFD
KIKSENIQTYKNTFSEDIEEIANHDFVFFGVEISNHQETLPLNKTHHTVDFGANAYIIDHDSPYGYMTLTDHFDNAIPPV
FYHEHQSFFLDNFKEVVDEVSRYVHGNQGKTDVPIFNTKDMRLGIGLHLIDFIRKSKDQRFREFCYNKNIDPVSLDRIIN
FVFQLEYHIPRMLSTDNFKKIRLRDISLEDAIKASNYEEINNKVTDKKMAHQALAYSLGNAKSDMALYLLSKFNFTKQDI
AEMEKMNNNMYCELYDVEYLLSEDSANYKVLEYFISNGLVDVNKRFQKANSGDTMLDNAMKSKDSKTIDFLLKNGAVSGK
RFGR
>A0A2H5DV25 4.3.99.-~~~ospC3~~~Arginine ADP-riboxanase OspC3~~~
MRVETHSPSFTNPNPAEACSGDPTEMGSRLSGVSRAPLPHAAAGRDGEAAAAGKIGAFLRKAVAAQSYGLMFANGKLFEA
TGDALEKREQYGFSALKRLDGLSRRNLDAVAARLGALDSAEQALKQHIMAGAWHFRHQSNAALDDGEKATIASNHLLSRQ
ARSSGGNTFADDKALLSNHDFVFFGVEFSGRDKNDKPLNHKHSTMDFGANAYVVSDALPACRNGYLTLTDHFFNRVPGGR
EAEHQDFVGRFAQMGRESGRWIHEGKYRQNAPLFCYRDMKAAVALHLIEFMRNSQDEAFKAYVFDQATQSGPALDRVLNS
VFQAEFHIPRLMATTDYAKHPLRPMLLKEAVDSVNLPALSDLVSNRGDAVTAMWHAINKGKDEVVAYLLGNWQFEAKDFS
HAPAGFYHELNYALSESGASVYILDQFLSRGWAAVNAPFEHVNRGDTMLDNAVKYGNREMVAALIKHGADRNLLSKWHKS
DLDALLA
>P0DV36 4.3.99.-~~~ospC3~~~Arginine ADP-riboxanase OspC3~~~
MKIPEAVNHINVQNNIDLVDGKTNPNKATKALQKNILRVTNSSSSGISEKHLDHCANTVKNFLRKSIAAQSYSKMFSQGT
SFKSLNLSIEAPSGARSSFRSLEHLDKVSRHYISEIIQKVHPLSSDERLLLSIIINSNFNFRHQSNSNLSNNILNIKSFD
KMQSENIQTHKNTYSEDIKEISNHDFVFFGVEISNHQEKLPLNKTHHTVDFGANAYIIDHDSPYGYMTLTDHFDNAIPPV
FYHEHQSFFLDNFKEVVDEVSRYVHGNQGKTDVPIFNTKDMRLGIGLHLIDFIRKSKDQGFREFCYNKNIDPVSLDRIIN
FVFQLEYHIPRMLSTDNFKKIKLRDISLEDAIKASNYEEINNKVTDKKMAHQALAYSLGNKKADIALYLLSKFNFTKQDV
AEMEKMNNNRYCNLYDVEYLLSKDGANYKVLEYFINNGLVDVNKKFQKANSGDTMLDNAMKSKDSKMIDFLLKNGAILGK
RFEI
>A0A0H2US87 4.3.99.-~~~ospC3~~~Arginine ADP-riboxanase OspC3~~~
MKIPEAVNHINVQNNIDLVDGKINPNKDTKALQKNISCVTNSSSSGISEKHLDHCADTVKSFLRKSIAAQSYSKMFSQGT
SFKSLNLSIEAPSGARSSFRSLEHLDKVSRHYLSEIIQKTHPLSSDERHLLSIIINSDFNFRHQSNANLSNNTLNIKSFD
KIKSENIQTYKNTFSEDIEEIANHDFVFFGVEISNHQETLPLNKTHHTVDFGANAYIIDHDSPYGYMTLTDHFDNAIPPV
FYHEHQSFFLDNFKEVVDEVSRYVHGNQGKTDVPIFNTKDMRLGIGLHLIDFIRKSKDQRFREFCYNKNIDPVSLDRIIN
FVFQLEYHIPRMLSTDNFKKIKLRDISLEDAIKASNYEEINNKVTDKKMAHQALAYSLGNKKADIALYLLSKFNFTKQDV
AEMEKMKNNRYCNLYDVEYLLSKDGANYKVLEYFINNGLVDVNKKFQKVNSGDTMLDNAMKSKDSKMIDFLLKNGAILGK
RFEI
>A0A0H2USP8 4.3.99.-~~~ospC4~~~Arginine ADP-riboxanase OspC4~~~
MKNFLRKSIAAQSYSKMFSQGTSFKSLNLSLEAPSGARSSFRSLEHLDKVSRHYISEIIQKVHPLSSDERHLLSIIINSN
FNFRHQSNSNLSNIILNIKSFDKIQSENIQTHKNTYSEDIKEISNHDFVFFGVEISNHQEKLPLNKTHHTVDFGANAYII
DHDSPYGYMTLTDHFDNAIPPVFYHEHQSFFLDNFKEVVDEVSRYVHGNQGKTDVPIFNTKDMRLGIGLHLIDFIRKSKD
QGFREFCYNKNIDPVSLDRIINFVFQLEYHIPRMLSTDNFKKIKLRDISLEDAIKASNYEEINNKVTDKKMAHQALAYSL
GNKKADIALYLLSKFNFTKQDVAEMEKMKNNRYCNLYDVEYLLSKDGANYKVLEYFINNGLVDVNKKFQKANSGDTMLDN
AMKSKDSKMIDFFIKKWSGIRQTI
>Q0SL36 ~~~ospC~~~Outer surface protein C~~~
MKKNTLSAILMTLFLFISCNNSGKGGDSASTNPADESAKGPNLTEISKKITDSNAFVLAVKEVETLVLSIDELAKKAIGQ
KIDNNNGLAALNNQNGSLLAGAYAISTLITEKLSKLKNLEELKTEIAKAKKCSEEFTNKLKSGHADLGKQDATDDHAKAA
ILKTHATTDKGAKEFKDLFESVEGLLKAAQVALTNSVKELTSPVVAESPKKP
>Q07337 ~~~ospC~~~Outer surface protein C~~~
MKKNTLSAILMTLFLFISCNNSGKDGNTSANSADESVKGPNLTEISKKITDSNAVLLAVKEVEALLSSIDEIAAKAIGKK
IHQNNGLDTENNHNGSLLAGAYAISTLIKQKLDGLKNEGLKEKIDAAKKCSETFTNKLKEKHTDLGKEGVTDADAKEAIL
KTNGTKTKGAEELGKLFESVEVLSKAAKEMLANSVKELTSPVVAESPKKP
>Q99Q01 3.4.-.-~~~ospD3~~~Effector protease OspD3~~~
MPSVNLIPSRKICLQNMINKDNVSVETIQSLLHSKQLPYFSDKRSFLLNLNCQVTDHSGRLIVCRHLASYWIAQFNKSSG
HVDYHHFAFPDEIKNYVSVSEEEKAINVPAIIYFVENGSWGDIIFYIFNEMIFHSEKSRALEISTSNHNMALGLKIKETK
NGGDFVIQLYDPNHTATHLRAEFNKFNLAKIKKLTVDNFLDEKHQKCYGLISDGMSIFVDRHTPTSMSSIIRWPDNLLHP
KVIYHAMRMGLTELIQKVTRVVQLSDLSDNTLELLLAAKNDDGLSGLLLALQNGHSDTILAYGELLETSGLNLDKTVELL
TAEGMGGRISGLSQALQNGHAETIKTYGRLLKKRAINIEYNKLKNLLTAYYYDEVHRQIPGLMFALQNGHADAIRAYGEL
ILSPPLLNSEDIVNLLASRRYDNVPGLLLALNNGQADAILAYGDILNEAKLNLDKKAELLEAKDSNGLSGLFVALHNGCV
ETIIAYGKILHTADLTPHQASKLLAAEGPNGVSGLIIAFQNRNFEAIKTYMGIIKNENITPEEIAEHLDKKNGSDFLEIM
KNIKS
>Q8VSP9 4.2.3.-~~~ospF~~~Phosphothreonine lyase OspF~~~
MPIKKPCLKLNLDSLNVVRSEIPQMLSANERLKNNFNILYNQIRQYPAYYFKVASNVPTYSDICQSFSVMYQGFQIVNHS
GDVFIHACRENPQSKGDFVGDKFHISIAREQVPLAFQILSGLLFSEDSPIDKWKITDMNRVSQQSRVGIGAQFTLYVKSD
QECSQYSALLLHKIRQFIMCLESNLLRSKIAPGEYPASDVRPEDWKYVSYRNELRSDRDGSERQEQMLREEPFYRLMIE
>Q99PZ6 2.7.-.-~~~ospG~~~Protein kinase OspG~~~
MKITSTIIQTPFPFENNNSHAGIVTEPILGKLIGQGSTAEIFEDVNDSSALYKKYDLIGNQYNEILEMAWQESELFNAFY
GDEASVVIQYGGDVYLRMLRVPGTPLSDIDTADIPDNIESLYLQLICKLNELSIIHYDLNTGNMLYDKESESLFPIDFRN
IYAEYYAATKKDKEIIDRRLQMRTNDFYSLLNRKYL
>Q3YTH2 2.7.-.-~~~ospG~~~Protein kinase OspG~~~
MKITSTIIQTPFPFENNNSHAGIVTEPILGKLIGQGSTAEIFEDVNDSSALYKKYDLIGNQYNEILEMAWQESELFNAFY
GDEASVVIQYGGDVYLRMLRVPGTPLSDIDTADIPDNIESLYLQLICKLNELSIIHYDLNTGNMLYDKESESLFPIDFRN
IYAEYYAATKKDKEIIDRRLQMRTNDFYSLLNRKYL
>A0A3T2V133 2.1.1.-~~~ospZ~~~Cysteine S-methyltransferase OspZ~~~
MISPIKNIKNVFPINTANTEYIVRNIYPRVEHGYFNESPNIYDKKYISGITRSMAQLKIEEFINEKSRRLNYMKTMYSPC
PEDFQPISRDEASIPEGSWLTVISGKRPMGQFSVDSLYHPDLHALCELPEISCKIFPKENSDFLYIIVVFRNDSPQGELR
ANRFIELYDIKREIMQVLRDESPELKSIKSEIIIAREMGELFSYASEEIDSYIKQMNDRFSQIKARMSVT
>Q02047 2.1.3.3~~~argF~~~Ornithine carbamoyltransferase 1, anabolic~~~
MNARHFLSMMDYTPDELLGLIRRGVELKDLRIRGELFEPLKNRVLGMIFEKSSTRTRLSFEAGMIQLGGQAIFLSHRDTQ
LGRGEPIADSAKVMSRMLDAVMIRTYAHSNLTEFAANSRVPVINGLSDDLHPCQLLADMQTFLEHRGSIKGKTVAWIGDG
NNMCNSYIEAAIQFDFQLRVACPAGYEPNPEFLALAGERVTIVRDPKAAVAGAHLVSTDVWTSMGQEEETARRMALFAPF
QVTRASLDLAEKDVLFMHCLPAHRGEEISVDLLDDSRSVAWDQAENRLHAQKALLEFLVAPSHQRA
>P04391 2.1.3.3~~~argI~~~Ornithine carbamoyltransferase subunit I~~~COG0078
MSGFYHKHFLKLLDFTPAELNSLLQLAAKLKADKKSGKEEAKLTGKNIALIFEKDSTRTRCSFEVAAYDQGARVTYLGPS
GSQIGHKESIKDTARVLGRMYDGIQYRGYGQEIVETLAEYASVPVWNGLTNEFHPTQLLADLLTMQEHLPGKAFNEMTLV
YAGDARNNMGNSMLEAAALTGLDLRLVAPQACWPEAALVTECRALAQQNGGNITLTEDVAKGVEGADFIYTDVWVSMGEA
KEKWAERIALLREYQVNSKMMQLTGNPEVKFLHCLPAFHDDQTTLGKKMAEEFGLHGGMEVTDEVFESAASIVFDQAENR
MHTIKAVMVATLSK
>P68747 2.1.3.3~~~argK~~~Ornithine carbamoyltransferase 2, anabolic~~~
MKITSLKNRNLLTMNEFNQSELSHLIDRAIECKRLKKDRIFNLGLNHLNICGIFLKPSGRTSTSFVVASYDEGAHFQFFP
ADNIRFGHKESIKDFARVVGRLFDGIAFRGFEHEVAEELAKHSGIPVWNALTDTHHPTQVLADVMTVKEEFGRIEGVTIA
YVGDGRNNMVTSLAIGALKFGYNLRIIAPNALHPTDAVLAGIYEQTPERNGSIEIFTEVAAGVHQADVIYTDVWISMGES
VSVEERIALLKPYKVTEKMMALTGKADTIFMHCLPAFHDLDTEVARETPDLVEVEDSVFEGPQSRVFDQGENRMHTIKAL
MLETVVP
>P06960 2.1.3.3~~~argF~~~Ornithine carbamoyltransferase subunit F~~~COG0078
MSDLYKKHFLKLLDFTPAQFTSLLTLAAQLKADKKNGKEVQKLTGKNIALIFEKDSTRTRCSFEVAAFDQGARVTYLGPS
GSQIGHKESIKDTARVLGRMYDGIQYRGHGQEVVETLAQYAGVPVWNGLTNEFHPTQLLADLMTMQEHLPGKAFNEMTLV
YAGDARNNMGNSMLEAAALTGLDLRLLAPKACWPEESLVAECSALAEKHGGKITLTEDVAAGVKGADFIYTDVWVSMGEA
KEKWAERIALLRGYQVNAQMMALTDNPNVKFLHCLPAFHDDQTTLGKQMAKEFDLHGGMEVTDEVFESAASIVFDQAENR
MHTIKAVMMATLGE
>Q9PNU6 2.1.3.3~~~argF~~~Ornithine carbamoyltransferase, anabolic~~~COG0078
MKHFLTLRDFSKEEILSLVNHASELKKEPKKLLQDKTLAMIFEKNSTRTRMAFELAITELGGKALFLSSNDLQLSRGEPV
KDTARVIGAMVDFVMMRVNKHETLLEFARYSKAPVINALSELYHPTQVLGDLFTIKEWNKMQNGIAKVAFIGDSNNMCNS
WLITAAILGFEISIAMPKNYKISPEIWEFAMKQALISGAKISLGYDKFEALKDKDVVITDTWVSMGEENEKERKIKEFEG
FMIDEKAMSVANKDAILLHCLPAYRGYEVSEEIFEKHADVIFEEARNRLYVVKALLCFLDNQRGRE
>P11724 2.1.3.3~~~argF~~~Ornithine carbamoyltransferase, anabolic~~~
MSVRHFLSFMDYSPEELIGLIRRGSELKDLRNRGVLYEPLKSRVLGMVFEKASTRTRLSFEAGMIQLGGQAIFLSPRDTQ
LGRGEPIGDSARVMSRMLDGVMIRTFAHATLTEFAAHSKVPVINGLSDDLHPCQLLADMQTFHEHRGSIQGKTVAWIGDG
NNMCNSYIEAALKFDFQLRVACPEGYEPKAEFVALAGDRLRVVRDPREAVAGAHLVSTDVWASMGQEDEAAARIALFRPY
QVNAALLDGAADDVLFMHCLPAHRGEEISEELLDDPRSVAWDQAENRLHAQKALLELLIEHAHYA
>Q8G998 2.1.3.3~~~arcB~~~Ornithine carbamoyltransferase, catabolic~~~
MTKDFRQNVFQGRSVLAEKDFSAAELEYLIDFGLHLKALKKAGIPHHYLEGKNIALLFEKSSTRTRSAFTTASIDLGAHP
EYLGQNDIQLGKKESTSDTAKVLGSMFDGIEFRGFKQSDAEILARDSGVPVWNGLTDEWHPTQMLADFMTVKENFGKLQG
LTLTFMGDGRNNVANSLLVTGAILGVNIHIVAPKALFPTEETQNIAKGFAEKSGAKLVITDDLDEGLKGSNVVYTDVWVS
MGESNWEERVKELTPYQVNMEAMKKTGTPDDQLIFMHCLPAFHNTDTQYGKEIKEKYGITEMEVTDEVFTSKYARQFEEA
ENRMHSIKAMMAATLGNLFIPRV
>Q8EVF5 2.1.3.3~~~arcB~~~Ornithine carbamoyltransferase, catabolic~~~COG0078
MPVNLKGRSLDSLLNFTTEEVQHLIDLSIDLKKAKYQGLHINNRPLVGKNIAILFQKDSTRTRCAFEVAASDLGAGVTYI
GPSGSNMGKKESIEDTAKVLGRFYDGIEFRGFAQSDVDALVKYSGVPVWNGLTDDEHPTQIIADFMTMKEKFGNLKNKKI
VFIGDYKNNVGVSTMIGAAFNGMHVVMCGPDNYKNEIDKNVLAKCIELFKRNGGSLRFSTDKILAAQDADVIYTDVWVSL
GEPFELFDKRIGELKNFQVDMNMIKAAKNDVIFLHCLPAFHDDHTSFSKEVATTLGAKYPIVAKGEMEVTDEVFQSLHNK
AFDQAENRMHSIKAIILSTIGY
>P08308 2.1.3.3~~~arcB~~~Ornithine carbamoyltransferase, catabolic~~~
MAFNMHNRNLLSLMHHSTRELRYLLDLSRDLKRAKYTGTEQQHLKRKNIALIFEKTSTRTRCAFEVAAYDQGANVTYIDP
NSSQIGHKESMKDTARVLGRMYDAIEYRGFKQEIVEELAKFAGVPVFNGLTDEYHPTQMLADVLTMREHSDKPLHDISYA
YLGDARNNMGNSLLLIGAKLGMDVRIAAPKALWPHDEFVAQCKKFAEESGAKLTLTEDPKEAVKGVDFVHTDVWVSMGEP
VEAWGERIKELLPYQVNMEIMKATGNPRAKFMHCLPAFHNSETKVGKQIAEQYPNLANGIEVTEDVFESPYNIAFEQAEN
RMHTIKAILVSTLADI
>P65602 2.1.3.3~~~arcB~~~Ornithine carbamoyltransferase, catabolic~~~
MTEIQKPYDLKGRSLLKESDFTKAEFEGLIDFAITLKEYKKNGIKHHYLSGKNIALLFEKNSTRTRAAFTVASIDLGAHP
EFLGKNDIQLGKKESVEDTAKVLGRMFDGIEFRGFSQQAVEDLAKFSGVPVWNGLTDDWHPTQMLADFMTIKENFGYLEG
INLTYVGDGRNNIAHSLMVAGAMLGVNVRICTPKSLNPKEAYVDIAKEKASQYGGSIMITDNIAEAVENTDAIYTDVWVS
MGEESEFEQRINLLKDYQVNQQMFDLTGKDSTIFLHCLPAFHDTNTLYGQEIYEKYGLAEMEVTDQIFRSEHSKVFDQAE
NRMHTIKAVMAATLGS
>Q5XAY4 2.1.3.3~~~arcB~~~Ornithine carbamoyltransferase, catabolic~~~
MTQVFQGRSFLAEKDFTRAELEYLIDFSAHLKDLKKRGVPHHYLEGKNIALLFEKTSTRTRAAFTTAAIDLGAHPEYLGA
NDIQLGKKESTEDTAKVLGRMFDGIEFRGFSQRMVEELAEFSGVPVWNGLTDEWHPTQMLADYLTVKENFGKLEGLTLVY
CGDGRNNVANSLLVTGAILGVNVHIFSPKELFPEEEIVTLAEGYAKESGARILITEDADEAVKGADVLYTDVWVSMGEED
KFKERVELLQPYQVNMDLVQKAGNDKLIFLHCLPAFHDTNTVYGKDVAEKFGVKEMEVTDEVFRSKYARHFDQAENRMHT
IKAVMAATLGNLFIPKV
>Q58PK7 1.14.13.38~~~otcC~~~Anhydrotetracycline monooxygenase~~~
MRYDVVIAGAGPTGLMLACELRLAGARTLVLERLAERVDFSKALGVHARTVELLDMRGLGRGFQAEAPKLRGGNFASLGV
PLDFSSFDTRHPYALFVPQVRTETLLTGRALELGAELRRGHAVTALEQDADGVTVSVTGPEGPYEVECAYLVGCDGGGIT
VRKLLGIDFPGQDPHMFAVIADARFREELPHGEGMGPMRPYGVMRHDLRAWFAAFPLEPDVYRATVAFFDRPYADRRAPV
TEEDVRAALTEVAGSDFGMHDVRWLSRLTDTSRQAERYRDGRVLLAGDACHIHLPAGGQGLNLGFQDAVNLGWKLGATIA
GTAPPELLDTYEAERRPIAAGVLRNTRAQAVLIDPDPRYEGLRELMIELLHVPETNRYLAGLISALDVRYPMAGEHPLLG
RRVPDLPLVTEDGTRQLSTYFHAARGVLLTLGCDQPLADEAAAWKDRVDLVAAEGVADPGSAVDGLTALLVRPDGYICWT
AAPETGTDGLTDALRTWFGPPAM
>Q81M99 2.1.3.3~~~argF~~~Ornithine carbamoyltransferase~~~COG0078
MSTVQVPKLNTKDLLTLEELTQEEIISLIEFAIYLKKNKQEPLLQGKILGLIFDKHSTRTRVSFEAGMVQLGGHGMFLNG
KEMQMGRGETVSDTAKVLSHYIDGIMIRTFSHADVEELAKESSIPVINGLTDDHHPCQALADLMTIYEETNTFKGIKLAY
VGDGNNVCHSLLLASAKVGMHMTVATPVGYKPNEEIVKKALAIAKETGAEIEILHNPELAVNEADFIYTDVWMSMGQEGE
EEKYTLFQPYQINKELVKHAKQTYHFLHCLPAHREEEVTGEIIDGPQSIVFEQAGNRLHAQKALLVSLFKNVEELS
>P18186 2.1.3.3~~~argF~~~Ornithine carbamoyltransferase~~~COG0078
MHTVTQTSLYGKDLLTLKDLSEEDINALLAEAGELKQNKIQPIFHGKTLAMIFEKSSTRTRVSFEAGMAQLGGSALFLSQ
KDLQLGRGETVADTAKVLSGYVDAIMIRTFEHEKVEELAKEADIPVINGLTDKYHPCQALADLLTIKEIKGKLKGVKVAY
IGDGNNVAHSLMIGCAKMGCDISIASPKGYEVLDEAAEAAKTYALQSGSSVTLTDDPIEAVKDADVIYSDVFTSMGQEAE
EQERLAVFAPYQVNAALVSHAKPDYTFLHCLPAHREEEVTAEIIDGPNSAVFQQAENRLHVQKALLKAILYKGESSKNC
>Q7NGR7 2.1.3.3~~~argF~~~Ornithine carbamoyltransferase~~~COG0078
MSASLGATRFRPDLLSLDDLDEAQLHALLTLAHQLKRGERVANLHGKVLGLVFLKASTRTRVSFTVAMYQLGGQVIDLSP
SNTQVGRGEPVRDTARVLGRYVDGLAIRTFAQTELEEYAHYAGIPVINALTDHEHPCQVVADLLTIRENFGRLAGLKLAY
VGDGNNVAHSLLLGCAKVGMSIAVATPEGFTPDPAVSARASEIAGRTGAEVQILRDPFEAARGAHILYTDVWTSMGQEAE
TQHRLQLFEQYQINAALLNCAAAEAIVLHCLPAHRGEEITDEVMEGPRSRIWDEAENRLHAQKAVLAALMGGR
>Q9K4Y9 2.1.3.3~~~argF~~~Ornithine carbamoyltransferase~~~
MENLLSVKDLSKQQILDLLALAKAVKANPAEYSQALAGKSIVTIYEKPSLRTRVTFDIGIHKLGGHAVYLDAQNGAIGER
ETVKDFAANISRWADAIVARVVSHKTLEGLVEHGSVPVVNSLCDLYHPCQALADFLTISEHYEDVSKVKLAYVGEGNNVT
HSLMLTGAILGAEVTAVCPRGSSPDAQIVKQAMALAEISGGKINVTDNLDDIVDYDVIYGDTWVSMGDDTPLAQVKEKYM
PYQINKALLMRTGIKHVLHCQPAHRELEITSEVMDGEHSLIFDQAENRMHAQNAVLLTLLK
>P0A5M9 2.1.3.3~~~argF~~~Ornithine carbamoyltransferase~~~
MIRHFLRDDDLSPAEQAEVLELAAELKKDPVSRRPLQGPRGVAVIFDKNSTRTRFSFELGIAQLGGHAVVVDSGSTQLGR
DETLQDTAKVLSRYVDAIVWRTFGQERLDAMASVATVPVINALSDEFHPCQVLADLQTIAERKGALRGLRLSYFGDGANN
MAHSLLLGGVTAGIHVTVAAPEGFLPDPSVRAAAERRAQDTGASVTVTADAHAAAAGADVLVTDTWTSMGQENDGLDRVK
PFRPFQLNSRLLALADSDAIVLHCLPAHRGDEITDAVMDGPASAVWDEAENRLHAQKALLVWLLERS
>A0QYS8 2.1.3.3~~~argF~~~Ornithine carbamoyltransferase~~~COG0078
MIRHFLRDDDLSPEEQAEVLTLAADLKKTPFSRRPLEGPRGVAVIFEKNSTRTRFSFEMGIAQLGGHAIVVDGRSTQLGR
EETLEDTGAVLSRYVDAIVWRTFAQERLTAMASGASVPIVNALSDEFHPCQVLADLQTLAERKGKLAGLRMTYFGDGANN
MAHSLMLGGVTAGVHVTIAAPDGFEPDPRFVDAARRRAAETGATVALTKDAKAGADGADVLVTDTWTSMGQENDGLDRVR
PFRPFQVNADLLELADPAAVVLHCLPAHRGHEITDEVIDGPQSAVFDEAENRLHAQKALLVWLLEKR
>P9WIT9 2.1.3.3~~~argF~~~Ornithine carbamoyltransferase~~~COG0078
MIRHFLRDDDLSPAEQAEVLELAAELKKDPVSRRPLQGPRGVAVIFDKNSTRTRFSFELGIAQLGGHAVVVDSGSTQLGR
DETLQDTAKVLSRYVDAIVWRTFGQERLDAMASVATVPVINALSDEFHPCQVLADLQTIAERKGALRGLRLSYFGDGANN
MAHSLLLGGVTAGIHVTVAAPEGFLPDPSVRAAAERRAQDTGASVTVTADAHAAAAGADVLVTDTWTSMGQENDGLDRVK
PFRPFQLNSRLLALADSDAIVLHCLPAHRGDEITDAVMDGPASAVWDEAENRLHAQKALLVWLLERS
>P99073 2.1.3.3~~~argF~~~Ornithine carbamoyltransferase~~~
MKNLRNRSFLTLLDFSRQEVEFLLTLSEDLKRAKYIGTEKPMLKNKNIALLFEKDSTRTRCAFEVAAHDQGANVTYLGPT
GSQMGKKETTKDTARVLGGMYDGIEYRGFSQRTVETLAEYSGVPVWNGLTDEDHPTQVLADFLTAKEVLKKDYADINFTY
VGDGRNNVANALMQGAAIMGMNFHLVCPKELNPTDELLNRCKNIAAENGGNILITDDIDQGVKGSDVIYTDVWVSMGEPD
EVWKERLELLKPYQVNKEMMDKTGNPNVIFEHCLPSFHNADTKIGQQIFEKYGIREMEVTDEVFESKASVVFQEAENRMH
TIKAVMVATLGEF
>P96108 2.1.3.3~~~argF~~~Ornithine carbamoyltransferase~~~COG0078
MSVNLKGRSLLTLLDFSPEEIRYLLDISKQVKMENRSKLRTERFKGMTLAMIFEKRSTRTRLAFETAFAEEGGHPIFLSP
NDIHLGAKESLEDTARVLGRMVDAIMFRGYKQETVEKLAEYSGVPVYNGLTDEFHPTQALADLMTIEENFGRLKGVKVVF
MGDTRNNVATSLMIACAKMGMNFVACGPEELKPRSDVFKRCQEIVKETDGSVSFTSNLEEALAGADVVYTDVWASMGEED
KEKERMALLKPYQVNERVMEMTGKSETIFMHCLPAVKGQEVTYEVIEGKQSRVWDEAENRKHTIKAVMIATLL
>P96134 2.1.3.3~~~argF~~~Ornithine carbamoyltransferase~~~COG0078
MGGEALTLPKDLLDFSGYGPKELQALLDLAERLKRERYRGEDLKGKVLALLFEKPSLRTRTTLEVAMVHLGGHAVYLDQK
QVGIGEREPVRDVAKNLERFVEGIAARVFRHETVEALARHAKVPVVNALSDRAHPLQALADLLTLKEVFGGLAGLEVAWV
GDGNNVLNSLLEVAPLAGLKVRVATPKGYEPDPGLLKRANAFFTHDPKEAALGAHALYTDVWTSMGQEAERAKRLRDFQG
FQVNGELLKLLRPEGVFLHCLPAHYGEETTEEAVHGPRSRVFDQAENRLHTAKAVLLTLLK
>Q8DCF5 2.1.3.3~~~argF~~~Ornithine carbamoyltransferase~~~
MAFNLRNRNFLKLLDFSTKEIQFLIDLSADLKKAKYAGTEQKKLLGKNIALIFEKASTRTRCAFEVAAFDQGAQVTYIGP
SGSQIGDKESMKDTARVLGRMYDGIQYRGFGQAIVEELGAFAGVPVWNGLTDEFHPTQILADFLTMLEHSQGKALADIQF
AYLGDARNNVGNSLMVGAAKMGMDIRLVGPQAYWPDEELVAACQAIAKQTGGKITLTENVAEGVQGCDFLYTDVWVSMGE
SPEAWDERVALMKPYQVNMNVLKQTGNPNVKFMHCLPAFHNDETTIGKQVADKFGMKGLEVTEEVFESEHSIVFDEAENR
MHTIKAVMVATLGS
>H3JQW0 1.14.13.160~~~otemo~~~2-oxo-Delta(3)-4,5,5-trimethylcyclopentenylacetyl-CoA monooxygenase~~~
MSNRAKSPALDAVVIGAGVTGIYQAFLINQAGMKVLGIEAGEDVGGTWYWNRYPGCRLDTESYAYGYFALKGIIPEWEWS
ENFASQPEMLRYVNRAADAMDVRKHYRFNTRVTAARYVENDRLWEVTLDNEEVVTCRFLISATGPLSASRMPDIKGIDSF
KGESFHSSRWPTDAEGAPKGVDFTGKRVGVIGTGATGVQIIPIAAETAKELYVFQRTPNWCTPLGNSPMSKEKMDSLRNR
YPTILEYVKSTDTAFPYHRDPRKGTDVSESERDAFFEELYRQPGYGIWLSGFRDLLLNKESNKFLADFVAKKIRQRVKDP
VVAEKLIPKDHPFGAKRVPMETNYYETYNRDNVHLVDIREAPIQEVTPEGIKTADAAYDLDVIIYATGFDAVTGSLDRID
IRGKDNVRLIDAWAEGPSTYLGLQARGFPNFFTLVGPHNGSTFCNVGVCGGLQAEWVLRMISYMKDNGFTYSEPTQAAEN
RWTEEVYADFSRTLLAEANAWWVKTTTKPDGSVVRRTLVHVSGGPEYRKRCEQVAYNNYNGFELA
>Q0KBC9 4.1.1.104~~~otnC~~~3-oxo-tetronate 4-phosphate decarboxylase~~~COG0235
MSTESKLREEICRIGASLYQRGYTVGSAGNISARLDDGWLITPTDACLGMMDPAAVAKVATDGSWVSGDKPSKTLMLHRA
IYDNNREAHAVVHTHSTHLVALTLAGVWQPDDVLPPLTPYYVMKVGHIPLIPYHRPGDPAVAARVATLAAQVRGVLLERL
GPVVWESSVSRAAFALEELEETAKLWMTMKDTPGFAARAALPDGALTELRDAFQARW
>A0A0H2VA12 4.1.1.104~~~otnC~~~3-oxo-tetronate 4-phosphate decarboxylase~~~COG0235
MSDFAKVEQSLREEMTRIASSFFQRGYATGSAGNLSLLLPDGNLLATPTGSCLGNLDPQRLSKVTADGEWLSGDKPSKEV
LFHLALYRNNPRCKAVVHLHSTWSTALSCLEGLDSNNVIRPFTPYVVMRMGNVPLVPYYRPGDKRIAQDLAELAADNQAF
LLANHGPVVCGESLQEAANNMEELEETAKLIFILGDRPIRYLTAGEIAELRS
>Q46890 4.1.1.104~~~otnC~~~3-oxo-tetronate 4-phosphate decarboxylase~~~COG0235
MSDFAKVEQSLREEMTRIASSFFQRGYATGSAGNLSLLLPDGNLLATPTGSCLGNLDPQRLSKVAADGEWLSGDKPSKEV
LFHLALYRNNPRCKAVVHLHSTWSTALSCLQGLDSSNVIRPFTPYVVMRMGNVPLVPYYRPGDKRIAQDLAELAADNQAF
LLANHGPVVCGESLQEAANNMEELEETAKLIFILGDRPIRYLTAGEIAELRS
>Q57199 4.1.1.104~~~otnC~~~3-oxo-tetronate 4-phosphate decarboxylase~~~COG0235
MTDLAQKELMVQLGRSFYERGYTVGGAGNLSVRLDDNRVLVTPTGSSLGRLSVERLSVLDMEGNLLGGDKPSKEAVFHLA
MYKKNPECKAIVHLHSTYLTALSCLDNLDPNNAIEPFTPYYVMRVGKMQVIPYYRPGSPKIAEELSNRALTGKAFLLANH
GVVVTGSDLLDAADNTEELEETAKLFFTLQGQKIRYLTDTEVKDLENRGK
>Q6CZ24 4.1.1.104~~~otnC~~~3-oxo-tetronate 4-phosphate decarboxylase~~~COG0235
MSEHHNGTEASLSSEQRARAEMVKLGASFFQRGYATGSAGNLSLLLDDGTLLATPTGSCLGELDAERLSKVSLSGEWISG
DKPSKEVSFHLSIYRNDPECKAIVHLHSTYLTALSCLEGLDTQDAIKPFTPYVVMRVGKVPVVPYYRPGDARLGEDLAKL
ASRYKAFLLANHGPVVTGKNLRAAADNMEELEETAKLIFILGDRKIRYLTADDIAELS
>Q0KBD1 5.3.1.35~~~otnI~~~2-oxo-tetronate isomerase~~~COG3622
MPRFAANLSMMYNEHAFLDRFAAAAADGFRAVEFLFPYEHAAAELRARLDANGLTQALFNAAPGDWAAGERGLAALPGRE
ADFRGTIGRALEYAGVIGNDRIHVMAGLIPADADRARCRATYLENLAFAANAAAAQGVTVLIEPINTRDMPGYFLNRQDD
GQAICKEVGAANLKVQFDCYHCQIVEGDVAMKLKRDIAGIGHIQIAGVPERHEPDVGELNYPYLFEVMDTLGYDGWIGCE
YRPRAGTSAGLGWLKPYLGR
>Q46891 5.3.1.35~~~otnI~~~2-oxo-tetronate isomerase~~~COG3622
MPRFAANLSMMFTEVPFIERFAAARKAGFDAVEFLFPYNYSTLQIQKQLEQNHLTLALFNTAPGDINAGEWGLSALPGRE
HEAHADIDLALEYALALNCEQVHVMAGVVPAGEDAERYRAVFIDNIRYAADRFAPHGKRILVEALSPGVKPHYLFSSQYQ
ALAIVEEVARDNVFIQLDTFHAQKVDGNLTHLIRDYAGKYAHVQIAGLPDRHEPDDGEINYPWLFRLFDEVGYQGWIGCE
YKPRGLTEEGLGWFDAWR
>Q57151 5.3.1.35~~~otnI~~~2-oxo-tetronate isomerase~~~COG3622
MPKFAANLTMMFNEVPFLDRFEAAAKAGFKYVEFLWPYDYPAQELKAILDKHGLKVVLFNTPAGDVNKGEWGGSAIPGRE
ADSHRDIDLALEYALALGCPNVHIMSAVVPEGASREEYKQTFIKNVRYASDKYKPYGIKIQLEALSPEVKPNYLLKSQFD
TLEVVELVDRDNVFVQLDYFHAQNVDGNLARLTDKLNGKFAHVQIASVPDRHEPDEGEINYQYIFDKLDEIGYTGYVGCE
YKPRGETVTGLDWFQKYK
>Q6CZ23 5.3.1.35~~~otnI~~~2-oxo-tetronate isomerase~~~COG3622
MPKFAANLSMLFTDVPFLDRFKAAADAGFTSVEYLFPYEYPAPLLAEKLRENGLKQVLFNTAPGNIAAGEWGVSALPDRI
EDARRDIDNALEYALALNCPSVLVMGGVVPPGEDRDAYQQTFIDNLRYAADKFAPHGINIMIEALSAKVKPNYLFASQYQ
ALELANLIDRPNIYIQVDFFHAQIVDGNLTQIIHDLDGRIGHIQIASVPARHEPDEGEINYPFIFAELDRVNYSGWIGCE
YNPRGKTEDGLGWAEPWLEKKH
>A6VKK5 2.7.1.217~~~otnK~~~3-oxo-tetronate kinase~~~COG3395
MMLGVIADDFTGASDIASFLVENGLSAVQMNGVPKQPLNSRVDAIVISLKSRSNPANEAVEQSLNAYNWLQENGCTQFYF
KYCSTFDSTAKGNIGPVTDALLEALNEDFTVITPALPVNGRTIFNGYLFVGEQLLSESGMKNHPITPMTDANLMRLMDAQ
AKGKTGLVAYADVIQGAARVKERFAELKAQGYRYAVVDAADNSQLEVLAEAVAGLKLVTGGSGLGAYIAARLSGGKKGTN
AFTPTKGKTVVLSGSCSVMTNKQVEKYCEKAPHFQLDAAQAINNPNYAEELYQWVTANLDAPLAPMVYATVPPEALKAIQ
NEFGADKASHAIENTFAQLAAKLKRYGVTNFINAGGETSSIVVQQLGFSGFHIGKQIAPGVPWLKAVEEDIYLALKSGNF
GKEDFFEYAQGMFV
>Q8YB10 2.7.1.217~~~otnK~~~3-oxo-tetronate kinase~~~COG3395
MPASNFPTGVHDMRLGVIADDFTGATDIAGFLVGNGLRTIQLNGVPADDLAVDADAVVISLKARSCPTGQAIAESLAALK
WLQKNNCQQFFFKYCSTFDSTPKGNIGPVTDALLEALGEEFTVICPALPVNGRTIYNGYLFVNGVLLSETGMRNHPVTPM
TDSNIMRVMESQSRGRAGNISSTIVDQGSDAVRDALRKLQSEGIRYAVLDALNDQHIETLGRAVSQMKLVTGGSGLADGM
ARAWTQLRGKNVAAAEAAGAPVKGRTVILSGSCSQMTNAQVAAYKAKAPALAMDVEKAINDAAYIDVLAEWVLAQSGDAL
PPLVYATMPPEALKAVQERFGGERASAAIEDLFGQLAKRLEAEGFTRFIVAGGETSGAVTQALAIDGFTIGPQIAPGVPW
VRGIGKPLSLALKSGNFGTEAFFFEAQKIANQEGDK
>A0A0H3KP73 2.7.1.217~~~otnK~~~3-oxo-tetronate kinase~~~COG3395
MTASASRPLLGCIADDFTGATDLANMLVKSGMRTVQTIGVPAESASIDADAIVVALKSRTIPAADAVAQSLAAYEWLRAQ
GCRQFFFKYCSTFDSTDAGNIGPVADALLDAAGGGFTIACPAFPENGRTIYRGHLFVGDVLLNESGMENHPLTPMKDANL
VRVLQRQTSSKVGLIRYDTIARGAADVRACIAQLRADGVRIAIADALSDRDLYVLGEACAALPLVTGGSGIALGLPENFR
RAAELAARDNAASLPRIDGTATVLAGSASKATNAQVAAWRATRPSFRIDPLAAARGEPVVDQALAFARSHLPEPVLIYAT
ATPDEVKAVQQALGVEAAGELVERTLAAIAHGLRALGVRKFVVAGGETSGAVVQALGVKSLQIGAQIDPGVPATATIDTE
PLGLALKSGNFGAVDFFDKALRALDGAA
>Q0KBC8 2.7.1.217~~~otnK~~~3-oxo-tetronate kinase~~~COG3395
MTAGTLAHRPLLGCIADDFTGATDLANTLVRNGMRTVQTIGLPDVGAVQDIGEADALVVALKSRTIPAVEAVAQSLAALQ
WLRAQGCRQFVFKYCSTFDSTDAGNIGPVAEALLAALDSDFTIACPAFPENGRTIFRGHLFVGDALLNESGMEHHPLTPM
TDASLVRVLQRQSKNKVGLLRYDAVARGAHATAERIAALRSDGVRMAIADAVSDADLFTLGEACANLPLITGGSGIALGL
PENFRRAGLLPQRGDAASVPAIDGPGVVLAGSASRATNGQVARWLEQGRPALRIDPLALARGEAVADAALAFAAGHGEPV
LIYATSSPDEVKAVQAELGVERAGHLVEQCLATVAAGLLARGTRRFVVAGGETSGAVVQALGVRALRIGAQIAPGVPATV
TLDAKPLALALKSGNFGGPDFFDEALRQLGGH
>P44093 2.7.1.217~~~otnK~~~3-oxo-tetronate kinase~~~COG3395
MLGVIADDFTGASDIASFLVENGLSTVQMNGVPTQSLNSKVDAIVISLKSRSNPVNEAIEQSLRAYQWLKENGCTQFYFK
YCSTFDSTAKGNIGPVTDALLDELNEDFTVITPALPVNGRTIFNGYLFVGDVLLSESGMKNHPITPMVDANLMRLMDAQA
KGKTGLVAYADVIKGASRVQECFAELKAQGYRYAVVDAVDNSQLEVLAEAVADFKLVTGGSGLGAYMAARLSGGKKGTNA
FTPTKGKTVVLSGSCSVMTNKQVEKYREKAPHFQLDVEQAIHNENYIEQLYQWVIANLDSEFAPMVYATVPPDALKAIQH
QFGVDQASHAIENTFAKLAAKLKQYGVTNFITAGGETSSIVVQELGFTGFHIGKQIAPGVPWLKAVEEDIFLALKSGNFG
KEDFFEYAQGMFL
>B1M1V6 2.7.1.217~~~otnK~~~3-oxo-tetronate kinase~~~COG3395
MSLALGCVADDYTGASDLANTLTKAGLRTIQTIGVPEAGRALPEADAVVVALKSRSIPADQAVARSREAERWLRARGAAH
VMFKVCSTFDSTDAGNIGPVMDALRADAGETVALVTPAFPETGRSVYQGNLFVGSVPLNESPLKDHPLNPMRDANLVRVL
GRQSRSPVGLIDTATVARGAEAVAARLDALAQEGKGAAIADAIFDSDLEVLGRAILDRKFSVGASGLGLGLARALAADGR
GTRDAAGAAVGEPVGGASACLAGSCSQATLQQVAAAEAIMPVLRLDPARLLAGDDVVAEALAFAEERLASGPVLIATSAP
PEAVRALQAAHGVDAAGHAIEAALAAIAEGLVARGVRRLVVAGGETSGAVVDRLGLTAFLLGPEIAAGVPVLRTAGRPEP
MLLALKSGNFGGADFFGRALDMMA
>Q6CZ25 2.7.1.217~~~otnK~~~3-oxo-tetronate kinase~~~COG3395
MIKLGVIADDFTGATDIASFLVNNGLPTVQLNGVPPSDFKVDTQAVVISLKSRSCSAEQAVADSLNALAWLQQQGCQQFY
FKYCSTFDSTAKGNIGPVTDALLEQLGETQTIISPALPVNGRTVYQGHLFVMDQLLSESGMRHHPVTPMTDSNLMRVMEQ
QAAGQCGLVPYAVMDQGADAVKQRLAQLKEQGMRYVVLDTLNEQHLLTQGEALRDMKLVTGGSGLAIGLARQWADSTKQT
SSATEAGKPQSGAGVVLSGSCSVMTNKQVAHYLKQAAGRAIDVARCLESDDAQQSYAQELADWVKAHRDDALAPLLYATS
SPDELAQIQQRWGAEASSHAVEKLFAAVARQLQEDGFQRFIIAGGETSSIVVQTLGIHAFHIGPSISPGVPWVRSTTHPL
SLALKSGNFGDEDFFARAQKEFAA
>Q48PB0 2.7.1.217~~~otnK~~~3-oxo-tetronate kinase~~~COG3395
MSLNNARPLLGCIADDFTGATDLANMLVRGGMRTVQSIGIPSAEMAAGLDADAIVIALKSRTTPSADAVAESLAALEWLR
ERGCEQIFFKYCSTFDSTAAGNIGQVSEALLEQLDSDFTLACPAFPENGRTIFRGHLFVQDQLLSESGMQNHPLTPMTDA
NLVRVLQAQTRHKVGLLRYDSIAQGVEGVRNRIAELRAEGVSMAIADALSDADLYTLGEACADLPLLTGGSGLALGLPGN
FRKAGKLRDIDAAKQVAISGGEVVLAGSASVATNGQVAAWLEDNRPALRINPLDLAAGKPVVEQALTFARDAGQTVLIYA
TSTPDEVKAVQKELGVERSGAMVEAALGEIAKGLLNAGVRRFVVAGGETSGAVVQALGVQLLQIGAQIDPGVPATVSSGA
QPLALALKSGNFGARDFFAKALKQLAGAA
>Q4KBD3 2.7.1.217~~~otnK~~~3-oxo-tetronate kinase~~~COG3395
MTISNPRPLLGCIADDFTGATDLANMLVRGGMRTVQSIGIPSAEVAAGLDADAVVIALKSRTTAASEAVAESLAALQWLR
DQGCEQIFFKYCSTFDSTAAGNIGQVSEALLEALGSDFTLACPAFPENGRTIFRGHLFVQDQLLSESGMQHHPLTPMTDA
NLVRVLQSQTRLPVGLLRYDSIAQGVEAVRSRIAELRGQGVALAIADALSDADLYTLGAACADLPLLTGGSGLALGLPEN
FRRAGKLRDLDAASLPKVAGGEVVLAGSASLATNAQVDAWLEAERPAWRIDPLALAAGEAVVEQALAFAREQQGTVLIYA
TSTPEEVKAVQRQLGAERAGALVENALGEIARGLRDSGVRRFVVAGGETSGAVVKALDVRLLQIGAQIDPGVPATVSSGG
EPLALALKSGNFGGRDFFSKALGQLAGGQA
>Q8ZMG5 2.7.1.217~~~otnK~~~3-oxo-tetronate kinase~~~
MLKIGVIADDFTGATDIASFLVENGMPTVQINDVPTGTQPEGCDAVVISLKTRSCPAQEAIKQSLAALVWLKKQGCQQVY
FKYCSTFDSTAEGNIGPVTDALMVALDTSFTVISPALPVNGRTVYQGYLFVMNHLLAESGMRHHPINPMTDSYLPRLMEA
QAQGRCGVIPAQTLDEGVAATRAALSRLQQEGYRYAVLDALNERHLEIQGEVLRDAPLVTGGSGLAMGLARQWAKHGVSQ
ARSAGYPLSGRAVVLSGSCSQMTNQQVAFYRQHAPTRDVDVARCLSSETREAYAEALAQWVLSQDSELAPMISATASTQA
LAAIQQQYGATEASHAVEALFSLLAARLAEGGITRFIVAGGETSGVVTQSLGITGFHIGPCISPGVPWVNALHAPVSLAL
KSGNFGDESFFIRAQREFQV
>P31677 2.4.1.15~~~otsA~~~Trehalose-6-phosphate synthase~~~COG0380
MSRLVVVSNRIAPPDEHAASAGGLAVGILGALKAAGGLWFGWSGETGNEDQPLKKVKKGNITWASFNLSEQDLDEYYNQF
SNAVLWPAFHYRLDLVQFQRPAWDGYLRVNALLADKLLPLLQDDDIIWIHDYHLLPFAHELRKRGVNNRIGFFLHIPFPT
PEIFNALPTYDTLLEQLCDYDLLGFQTENDRLAFLDCLSNLTRVTTRSAKSHTAWGKAFRTEVYPIGIEPKEIAKQAAGP
LPPKLAQLKAELKNVQNIFSVERLDYSKGLPERFLAYEALLEKYPQHHGKIRYTQIAPTSRGDVQAYQDIRHQLENEAGR
INGKYGQLGWTPLYYLNQHFDRKLLMKIFRYSDVGLVTPLRDGMNLVAKEYVAAQDPANPGVLVLSQFAGAANELTSALI
VNPYDRDEVAAALDRALTMSLAERISRHAEMLDVIVKNDINHWQECFISDLKQIVPRSAESQQRDKVATFPKLA
>A0R4M9 2.4.1.15~~~otsA~~~Trehalose-6-phosphate synthase~~~COG0380
MSPESGHETISGTSDFVVVANRLPVDLERLPDGTTRWKRSPGGLVTALEPLLRKRRGSWIGWAGVADSDEEPIVQDGLQL
HPVRLSADDVAKYYEGFSNATLWPLYHDLIVKPEYHREWWDRYVEVNRRFAEATARAAAEGATVWIQDYQLQLVPKMLRM
LRPDVTIGFFLHIPFPPVELFMQMPWRTEIVEGLLGADLVGFHLPGGAQNFLVLSRRLVGANTSRASIGVRSRFGEVQVG
FRTVKVGAFPISIDSAELDGKARNRAIRQRARQIRAELGNPRKIMLGVDRLDYTKGIDVRLRALSELLEEKRIKRDDTVL
VQLATPSRERVESYIAMREDIERQVGHINGEYGEVGHPIVHYLHRPIPRDELIAFFVAADVMLVTPLRDGMNLVAKEYVA
CRSDLGGALVLSEFTGAAAELRQAYLVNPHDLEGVKDKIEAAVNQNPEEGKRRMRALRRQVLAHDVDRWARSFLDALAAT
GETGDSGVTGESTPAPESDSGSF
>P9WN11 2.4.1.15~~~otsA~~~Trehalose-6-phosphate synthase~~~COG0380
MAPSGGQEAQICDSETFGDSDFVVVANRLPVDLERLPDGSTTWKRSPGGLVTALEPVLRRRRGAWVGWPGVNDDGAEPDL
HVLDGPIIQDELELHPVRLSTTDIAQYYEGFSNATLWPLYHDVIVKPLYHREWWDRYVDVNQRFAEAASRAAAHGATVWV
QDYQLQLVPKMLRMLRPDLTIGFFLHIPFPPVELFMQMPWRTEIIQGLLGADLVGFHLPGGAQNFLILSRRLVGTDTSRG
TVGVRSRFGAAVLGSRTIRVGAFPISVDSGALDHAARDRNIRRRAREIRTELGNPRKILLGVDRLDYTKGIDVRLKAFSE
LLAEGRVKRDDTVVVQLATPSRERVESYQTLRNDIERQVGHINGEYGEVGHPVVHYLHRPAPRDELIAFFVASDVMLVTP
LRDGMNLVAKEYVACRSDLGGALVLSEFTGAAAELRHAYLVNPHDLEGVKDGIEEALNQTEEAGRRRMRSLRRQVLAHDV
DRWAQSFLDALAGAHPRGQG
>P31678 3.1.3.12~~~otsB~~~Trehalose-6-phosphate phosphatase~~~COG1877
MTEPLTETPELSAKYAWFFDLDGTLAEIKPHPDQVVVPDNILQGLQLLATASDGALALISGRSMVELDALAKPYRFPLAG
VHGAERRDINGKTHIVHLPDAIARDISVQLHTVIAQYPGAELEAKGMAFALHYRQAPQHEDALMTLAQRITQIWPQMALQ
QGKCVVEIKPRGTSKGEAIAAFMQEAPFIGRTPVFLGDDLTDESGFAVVNRLGGMSVKIGTGATQASWRLAGVPDVWSWL
EMITTALQQKRENNRSDDYESFSRSI
>P9WFZ5 3.1.3.12~~~otsB~~~Trehalose-phosphate phosphatase~~~COG0561
MRKLGPVTIDPRRHDAVLFDTTLDATQELVRQLQEVGVGTGVFGSGLDVPIVAAGRLAVRPGRCVVVSAHSAGVTAARES
GFALIIGVDRTGCRDALRRDGADTVVTDLSEVSVRTGDRRMSQLPDALQALGLADGLVARQPAVFFDFDGTLSDIVEDPD
AAWLAPGALEALQKLAARCPIAVLSGRDLADVTQRVGLPGIWYAGSHGFELTAPDGTHHQNDAAAAAIPVLKQAAAELRQ
QLGPFPGVVVEHKRFGVAVHYRNAARDRVGEVAAAVRTAEQRHALRVTTGREVIELRPDVDWDKGKTLLWVLDHLPHSGS
APLVPIYLGDDITDEDAFDVVGPHGVPIVVRHTDDGDRATAALFALDSPARVAEFTDRLARQLREAPLRAT
>E1WGG9 3.1.3.12~~~otsB~~~Trehalose-phosphate phosphatase~~~
MAEPLTVSPELTANYAYFFDLDGTLAEIKPHPDQVVVPHKILQLLDRLAAHNAGALALISGRSMTELDALAKPFRFPLAG
VHGAERRDINGKTHIVRLPEAVVREVEALLRSTLVALPGTELESKGMAFALHYRQAPEHEAALLALAQHVTQHWPQLALQ
LGKCVVEIKPKGTNKGEAIAAFMQEAPFAGRIPVFVGDDLTDEAGFGVVNHAGGISVKVGVGATQAAWRLESVPDVWRWL
EQINYPQQEQQVMNNRRDGYESFSRSI
>Q47421 ~~~ousA~~~Glycine betaine/proline/ectoine/pipecolic acid transporter OusA~~~COG0477
MKLKRKRVKPIALDDVTIIDDGRLRKAITAAALGNAMEWFDFGVYGFVAYALGQVFFPGADPGVQMIAALATFSVPFLIR
PLGGVFFGALGDKYGRQKILAITIIIMSISTFCIGLIPSYERIGIWAPILLLLAKMAQGFSVGGEYTGASIFVAEYSPDR
KRGFMGSWLDFGSIAGFVLGAGVVVLISTLIGEQAFLAWGWRLPFFLALPLGLIGLYLRHALEETPAFRQHVEKLEQNDR
DGLKAGPGVSFREIATHHWKSLLVCIGLVIATNVTYYMLLTYMPSYLSHSLHYSENHGVLIIIAIMIGMLFVQPVMGLLS
DRFGRKPFVVIGSVAMFFLAVPSFMLINSDIIGLIFLGLLMLAVILNAFTGVMASTLPALFPTHIRYSALASAFNISVLI
AGLTPTVAAWLVESSQNLYMPAYYLMVIAVIGLLTGLFMKETANKPLKGATPAASDLSEAKEILQEHHDNIEHKIEDITQ
QIAELEAKRQLLVAQHPRIND
>E0SCY1 ~~~ousV~~~Glycine betaine/choline transport system ATP-binding protein OusV~~~COG4175
MAIKLEVTHLYKIFGEHPERAFRLLEQGLSKDQIFEKTGLTVGVKDASLAIEEGEIFVIMGLSGSGKSTLVRLLNRLIEP
TRGQVLIDGEDISRLPDGALRAVRRKQISMVFQSFALMPHLNILDNTAFGMDLAGVPRAEREQKALNALQQVGLETYAHA
YPDELSGGMRQRVGLARALANDPDILLMDEAFSALDPLIRTEMQDELIKLQARRQRTIVFISHDLDEAMRIGDRIAIMHS
GEVIQVGTPDEILNNPANDYVRTFFRGVDISHVFTAKDIARRRPVAVIRKTPGVGPRSALRILQEEDREYGYVLERGRKF
IGVVSIDSLKQALRGQQPLEQALLPAPAPVPASMSLNELISQVAQAPCAVPVVDENHEYLGIISKGMLLQALDKESTLND
>E0SCY2 ~~~ousW~~~Glycine betaine/choline transport system permease protein OusW~~~COG4176
MTDTTQNPWEEDQAPDQAAAANHSHAAATSGEHAAAAGSSGTPAQTDPWAASSSAPAGNTPAPDNAADAWSNAPPPAASD
VHQSAADWLNSTPTPTQEHFNLMDPFRHTLVPLDRWVTEGIDWLVLHFRPLFQGIRVPVDMILTSFQQLLTGLPAPVAIL
VFSLLAWQVSSFGMGVATLLSLVAIGAIGAWSQAMVTLALVLTALFFCVIIGLPLGIWLAHSDRAARIVRPLLDAMQTTP
AFVYLIPIVMLFGIGNVPGVVVTIIFALPPIIRLTILGIRQVPADLVEAAQSFGASPRQMLFKVQLPLAMPTIMAGINQT
LMLALSMVVIASMIAVGGLGQMVLRGIGRLDMGLASIGGVGIVILAIILDRLTQSLGRDARSRGNRHWYHHGPLGLLARP
FIKSRA
>E0SCY3 ~~~ousX~~~Glycine betaine-binding periplasmic protein OusX~~~COG2113
MRNISMATLALTTVLSTGLFAADDLPGKGITVKPVQSTISEETFQTLLVSKALEKLGYTVDKPSEVDYNVGYTSIANGDA
TFTAVNWQPLHDDMYQAAGGDAKFYRQGVYVSGAAQGYLIDKKTAERYHITRLDQLKDPKLAKLFDTNGDGKADLTGCNP
GWGCDSVINHQIQAYGLGDTVNHNQGNYAALIADTIARYKQGKSVIFFTWTPYWVSDVLVPGRDVVWLQVPFSSLPGKQK
GTDTKLPNGANYGFPVNNMRIVANKDWAEKNPAAAKLFAIMKLPLADINAQNLRMHQGEASQQDIERHVNGWINAHQAQF
DGWLNAARAAAK
>Q01567 ~~~outS~~~Pilotin OutS~~~
MHVSSLKVVLFGVCCLSLAACQTPAPVKNTASRSAASVPANEQISQLASLVAASKYLRVQCERSDLPDDGTILKTAVNVA
VQKGWDTGRYQSLPQLSENLYQGLLKDGTPKATQCSSFNRTMTPFLDAMRTVR
>Q9X113 ~~~~~~Oxalate-binding protein~~~COG0662
MKEGTGMVVRSSEITPERISNMRGGKGEVEMAHLLSKEAMHNKARLFARMKLPPGSSVGLHKHEGEFEIYYILLGEGVFH
DNGKDVPIKAGDVCFTDSGESHSIENTGNTDLEFLAVIILL
>P0AFI0 4.1.1.8~~~oxc~~~Oxalyl-CoA decarboxylase~~~COG0028
MSDQLQMTDGMHIIVEALKQNNIDTIYGVVGIPVTDMARHAQAEGIRYIGFRHEQSAGYAAAASGFLTQKPGICLTVSAP
GFLNGLTALANATVNGFPMIMISGSSDRAIVDLQQGDYEELDQMNAAKPYAKAAFRVNQPQDLGIALARAIRVSVSGRPG
GVYLDLPANVLAATMEKDEALTTIVKVENPSPALLPCPKSVTSAISLLAKAERPLIILGKGAAYSQADEQLREFIESAQI
PFLPMSMAKGILEDTHPLSAAAARSFALANADVVMLVGARLNWLLAHGKKGWAADTQFIQLDIEPQEIDSNRPIAVPVVG
DIASSMQGMLAELKQNTFTTPLVWRDILNIHKQQNAQKMHEKLSTDTQPLNYFNALSAVRDVLRENQDIYLVNEGANTLD
NARNIIDMYKPRRRLDCGTWGVMGIGMGYAIGASVTSGSPVVAIEGDSAFGFSGMEIETICRYNLPVTIVIFNNGGIYRG
DGVDLSGAGAPSPTDLLHHARYDKLMDAFRGVGYNVTTTDELRHALTTGIQSRKPTIINVVIDPAAGTESGHITKLNPKQ
VAGN
>P40149 4.1.1.8~~~oxc~~~Oxalyl-CoA decarboxylase~~~
MSNDDNVELTDGFHVLIDALKMNDIDTMYGVVGIPITNLARMWQDDGQRFYSFRHEQHAGYAASIAGYIEGKPGVCLTVS
APGFLNGVTSLAHATTNCFPMILLSGSSEREIVDLQQGDYEEMDQMNVARPHCKASFRINSIKDIPIGIARAVRTAVSGR
PGGVYVDLPAKLFGQTISVEEANKLLFKPIDPAPAQIPAEDAIARAADLIKNAKRPVIMLGKGAAYAQCDDEIRALVEET
GIPFLPMGMAKGLLPDNHPQSAAATRAFALAQCDVCVLIGARLNWLMQHGKGKTWGDELKKYVQIDIQANEMDSNQPIAA
PVVGDIKSAVSLLRKALKGAPKADAEWTGALKAKVDGNKAKLAGKMTAETPSGMMNYSNSLGVVRDFMLANPDISLVNEG
ANALDNTRMIVDMLKPRKRLDSGTWGVMGIGMGYCVAAAAVTGKPVIAVEGDSAFGFSGMELETICRYNLPVTVIIMNNG
GIYKGNEADPQPGVISCTRLTRGRYDMMMEAFGGKGYVANTPAELKAALEEAVASGKPCLINAMIDPDAGVESGRIKSLN
VVSKVGKK
>O34714 4.1.1.2~~~oxdC~~~Oxalate decarboxylase OxdC~~~COG2140
MKKQNDIPQPIRGDKGATVKIPRNIERDRQNPDMLVPPETDHGTVSNMKFSFSDTHNRLEKGGYAREVTVRELPISENLA
SVNMRLKPGAIRELHWHKEAEWAYMIYGSARVTIVDEKGRSFIDDVGEGDLWYFPSGLPHSIQALEEGAEFLLVFDDGSF
SENSTFQLTDWLAHTPKEVIAANFGVTKEEISNLPGKEKYIFENQLPGSLKDDIVEGPNGEVPYPFTYRLLEQEPIESEG
GKVYIADSTNFKVSKTIASALVTVEPGAMRELHWHPNTHEWQYYISGKARMTVFASDGHARTFNYQAGDVGYVPFAMGHY
VENIGDEPLVFLEIFKDDHYADVSLNQWLAMLPETFVQAHLDLGKDFTDVLSKEKHPVVKKKCSK
>O34767 4.1.1.2~~~oxdD~~~Oxalate decarboxylase OxdD~~~COG2140
MLLEQQPINHEDRNVPQPIRSDGAGAIDTGPRNIIRDIQNPNIFVPPVTDEGMIPNLRFSFSDAPMKLDHGGWSREITVR
QLPISTAIAGVNMSLTAGGVRELHWHKQAEWAYMLLGRARITAVDQDGRNFIADVGPGDLWYFPAGIPHSIQGLEHCEFL
LVFDDGNFSEFSTLTISDWLAHTPKDVLSANFGVPENAFNSLPSEQVYIYQGNVPGSVASEDIQSPYGKVPMTFKHELLN
QPPIQMPGGSVRIVDSSNFPISKTIAAALVQIEPGAMRELHWHPNSDEWQYYLTGQGRMTVFIGNGTARTFDYRAGDVGY
VPSNAGHYIQNTGTETLWFLEMFKSNRYADVSLNQWMALTPKELVQSNLNAGSVMLDSLRKKKVPVVKYPGT
>P82604 4.8.1.4~~~oxd~~~Phenylacetaldoxime dehydratase~~~
MKNMPENHNPQANAWTAEFPPEMSYVVFAQIGIQSKSLDHAAEHLGMMKKSFDLRTGPKHVDRALHQGADGYQDSIFLAY
WDEPETFKSWVADPEVQKWWSGKKIDENSPIGYWSEVTTIPIDHFETLHSGENYDNGVSHFVPIKHTEVHEYWGAMRDRM
PVSASSDLESPLGLQLPEPIVRESFGKRLKVTAPDNICLIRTAQNWSKCGSGERETYIGLVEPTLIKANTFLRENASETG
CISSKLVYEQTHDGEIVDKSCVIGYYLSMGHLERWTHDHPTHKAIYGTFYEMLKRHDFKTELALWHEVSVLQSKDIELIY
VNCHPSTGFLPFFEVTEIQEPLLKSPSVRIQ
>Q7WSJ4 4.8.1.2~~~oxdA~~~Aliphatic aldoxime dehydratase~~~
MESAIDTHLKCPRTLSRRVPEEYQPPFPMWVARADEQLQQVVMGYLGVQYRGEAQREAALQAMRHIVSSFSLPDGPQTHD
LTHHTDSSGFDNLMVVGYWKDPAAHCRWLRSAEVNDWWTSQDRLGEGLGYFREISAPRAEQFETLYAFQDNLPGVGAVMD
STSGEIEEHGYWGSMRDRFPISQTDWMKPTNELQVVAGDPAKGGRVVIMGHDNIALIRSGQDWADAEAEERSLYLDEILP
TLQDGMDFLRDNGQPLGCYSNRFVRNIDLDGNFLDVSYNIGHWRSLEKLERWAESHPTHLRIFVTFFRVAAGLKKLRLYH
EVSVSDAKSQVFEYINCHPHTGMLRDAVVAPT
>Q4W7T3 4.8.1.2~~~oxd~~~Aliphatic aldoxime dehydratase~~~
MESAIDTHLKCPRTLSRRVPDEYQPPFAMWMARADEHLEQVVMAYFGVQYRGEAQRAAALQAMRHIVESFSLADGPQTHD
LTHHTDNSGFDNLIVVGYWKDPAAHCRWLRSAPVNAWWASEDRLNDGLGYFREISAPRAEQFETLYAFQDNLPGVGAVMD
RISGEIEEHGYWGSMRDRFPISQTDWMKPTSELQVIAGDPAKGGRVVVLGHGNLTLIRSGQDWADAEAEERSLYLDEILP
TLQDGMDFLRDNGQPLGCYSNRFVRNIDLDGNFLDVSYNIGHWRSVEKLERWTESHPTHLRIFVTFFRVAAGLKKLRLYH
EVSVSDAKSQIFGYINCHPQTGMLRDAQVSPA
>Q76K71 4.8.1.2~~~oxd~~~Aliphatic aldoxime dehydratase~~~
MESAIGEHLQCPRTLTRRVPDTYTPPFPMWVGRADDALQQVVMGYLGVQFRDEDQRPAALQAMRDIVAGFDLPDGPAHHD
LTHHIDNQGYENLIVVGYWKDVSSQHRWSTSTPIASWWESEDRLSDGLGFFREIVAPRAEQFETLYAFQEDLPGVGAVMD
GISGEINEHGYWGSMRERFPISQTDWMQASGELRVIAGDPAVGGRVVVRGHDNIALIRSGQDWADAEADERSLYLDEILP
TLQSGMDFLRDNGPAVGCYSNRFVRNIDIDGNFLDLSYNIGHWASLDQLERWSESHPTHLRIFTTFFRVAAGLSKLRLYH
EVSVFDAADQLYEYINCHPGTGMLRDAVTIAEH
>Q76EV4 4.8.1.2~~~oxd~~~Aliphatic aldoxime dehydratase~~~
MESAIGEHLQCPRTLTRRVPDTYTPPFPMWVGRADDTLHQVVMGYLGVQFRGEDQRPAALRAMRDIVAGFDLPDGPAHHD
LTHHIDNQGYENLIVVGYWKDVSSQHRWSTSPPVSSWWESEDRLSDGLGFFREIVAPRAEQFETLYAFQDDLPGVGAVMD
GVSGEINEHGYWGSMRERFPISQTDWMQASGELRVVAGDPAVGGRVVVRGHDNIALIRSGQDWADAEADERSLYLDEILP
TLQSGMDFLRDNGPAVGCYSNRFVRNIDIDGNFLDLSYNIGHWASLDQLERWSESHPTHLRIFTTFFRVAEGLSKLRLYH
EVSVFDAADQLYEYINCHPGTGMLRDAVITAEH
>Q8VPD4 1.4.3.2~~~~~~L-amino acid oxidase~~~
MAFTRRSFMKGLGATGGAGLAYGAMSTLGLAPSTAAPARTFQPLAAGDLIGKVKGSHSVVVLGGGPAGLCSAFELQKAGY
KVTVLEARTRPGGRVWTARGGSEETDLSGETQKCTFSEGHFYNVGATRIPQSHITLDYCRELGVEIQGFGNQNANTFVNY
QSDTSLSGQSVTYRAAKADTFGYMSELLKKATDQGALDQVLSREDKDALSEFLSDFGDLSDDGRYLGSSRRGYDSEPGAG
LNFGTEKKPFAMQEVIRSGIGRNFSFDFGYDQAMMMFTPVGGMDRIYYAFQDRIGTDNIVFGAEVTSMKNVSEGVTVEYT
AGGSKKSITADYAICTIPPHLVGRLQNNLPGDVLTALKAAKPSSSGKLGIEYSRRWWETEDRIYGGASNTDKDISQIMFP
YDHYNSDRGVVVAYYSSGKRQEAFESLTHRQRLAKAIAEGSEIHGEKYTRDISSSFSGSWRRTKYSESAWANWAGSGGSH
GGAATPEYEKLLEPVDKIYFAGDHLSNAIAWQHGALTSARDVVTHIHERVAQEA
>Q51330 ~~~oxlT~~~Oxalate:formate antiporter~~~
MNNPQTGQSTGLLGNRWFYLVLAVLLMCMISGVQYSWTLYANPVKDNLGVSLAAVQTAFTLSQVIQAGSQPGGGYFVDKF
GPRIPLMFGGAMVLAGWTFMGMVDSVPALYALYTLAGAGVGIVYGIAMNTANRWFPDKRGLASGFTAAGYGLGVLPFLPL
ISSVLKVEGVGAAFMYTGLIMGILIILIAFVIRFPGQQGAKKQIVVTDKDFNSGEMLRTPQFWVLWTAFFSVNFGGLLLV
ANSVPYGRSLGLAAGVLTIGVSIQNLFNGGCRPFWGFVSDKIGRYKTMSVVFGINAVVLALFPTIAALGDVAFIAMLAIA
FFTWGGSYALFPSTNSDIFGTAYSARNYGFFWAAKATASIFGGGLGAAIATNFGWNTAFLITAITSFIAFALATFVIPRM
GRPVKKMVKLSPEEKAVH
>Q9I6Z0 3.5.4.32~~~~~~8-oxoguanine deaminase~~~
MSRTWIRNPLAIFTANGLDAAGGLVVEDGRIVELLGAGQQPAQPCASQFDASRHVVLPGLVNTHHHFYQTLTRAWAPVVN
QPLFPWLKTLYPVWARLTPEKLELATKVALAELLLSGCTTAADHHYLFPGGLEQAIDVQAGVVEELGMRAMLTRGSMSLG
EKDGGLPPQQTVQEAETILADSERLIARYHQRGDGARVQIALAPCSPFSVTPEIMRASAEVAARHDVRLHTHLAETLDEE
DFCLQRFGLRTVDYLDSVGWLGPRTWLAHGIHFNAEEIRRLGEAGTGICHCPSSNMRLASGICPTVELEAAGAPIGLGVD
GSASNDASNMILEARQALYLQRLRYGAERITPELALGWATRGSARLLGRSDIGELAPGKQADLALFKLDELRFSGSHDPL
SALLLCAADRADRVMVGGAWRVVDGAVEGLDLAALIARHRAAASALIAG
>Q3S8R0 1.14.13.232~~~oxyE~~~6-methylpretetramide 4-monooxygenase~~~
MTGHPRPPADGAHTDVCVVGGGPAGLTLALLMLRSGARVTLVERSRSLDRAYRGEILQPGGQALLDALGVLEGARRRGCH
EHDGFRLEERGRTLINGDYRRLPGPYNCLLSLPQQHLLTDLLERCRAHPRFTCLTGTKVNGLVEEGGVVRGVVCGGGADG
LVVRADCVVGADGRYSTVRKLAGIPYDRIELFDQDVLWCKLTAPATRTVRIFRAGGNPVLAYTSFPDCVQLGWTLPHKGY
QALAERGFAHVKERIRAAVPEYADTVDQQLNSFKDVSLLDVFAGSARRWARDGLLLIGDSAHTHSPIGAQGINLAIQDAV
AAHPVLCEGLRRRDLSERFLDAVAARRRPETERATRVQVMQSRMMLSTGRVSAAVRPKAAMLVSRTPAYRSVLRRIAYGD
QTLRVRSDLFEEGEPATV
>Q3S8Q4 1.14.13.232~~~oxyL~~~6-methylpretetramide 4-monooxygenase~~~
MPPEADGPQVLIAGAGPVGLTLAHELTRRRVRVRVIDRADGPATTSRALAVHPRTLEACHQMGLADALVARGRPVVHFTV
HLRGRQLIRFDTNYGRLPTAYPFSLMLDQVRTEEILRERLAGLGVGIEWGVELADCAPCGDRVNAELRRDGRSEQVTVPW
LVGADGSRSTVRERLGLRLVGDATQTWLNADVVLDADLSRDSNHLVHTGSGTVLLVPFPDPGKWRAVDTGYAGQGADPET
VRRRLAGSLARGLGRPVAVSEPTWVSVFRVQQRMITAMRSGRCFVAGDAAHVHSPASGQGMNTGMQDAYNLAWKLADVVR
GHAREELLDTYAAERIPVGGRLLSSTRTATALVALRNAVAPVAMPVGLSFLKAVRPLKRRVEHRIMAGMSGLALHYADSP
LTYGTGDGAAGVHPGHLVACTEQDVARHPGLRALRQALTDPRWLLLLFADDGGAAELALRYGRAVQIRTVIPHEDEDGPA
LADPDDALRQTLGVPPGGWALIRPDGYLAAKGQRSGTTTLTARLQALHLLPEDTAPGAGDSAGRPAPDGTRRGVTTE
>P0ACQ4 ~~~oxyR~~~Hydrogen peroxide-inducible genes activator~~~COG0583
MNIRDLEYLVALAEHRHFRRAADSCHVSQPTLSGQIRKLEDELGVMLLERTSRKVLFTQAGMLLVDQARTVLREVKVLKE
MASQQGETMSGPLHIGLIPTVGPYLLPHIIPMLHQTFPKLEMYLHEAQTHQLLAQLDSGKLDCVILALVKESEAFIEVPL
FDEPMLLAIYEDHPWANRECVPMADLAGEKLLMLEDGHCLRDQAMGFCFEAGADEDTHFRATSLETLRNMVAAGSGITLL
PALAVPPERKRDGVVYLPCIKPEPRRTIGLVYRPGSPLRSRYEQLAEAIRARMDGHFDKVLKQAV
>L8EYU3 1.3.98.4~~~oxyR~~~5a,11a-dehydrotetracycline/5a,11a-dehydrooxytetracycline reductase~~~
MPFTQKEITYLRAQGYGRLATVGAHGEPHNVPVSFEIDEERGTIEITGRDMGRSRKFRNVAKNDRVAFVVDDVPCRDPEV
VRAVVIHGTAQALPTGGRERRPHCADEMIRIHPRRIVTWGIEGDLSTGVHARDITAEDGGRR
>Q9X5P2 ~~~oxyR~~~Probable hydrogen peroxide-inducible genes activator~~~
MRKRRRQPSLAQLRAFAAVAEHLHFRDAAAAIGMSQPALSGAVSALEESLGVTLLERTTRKVLLSPAGERLAARAKSVLA
EVGALVEEADALQAPFTGVLRLGVIPTVAPYVLPTVLRLVHDRYPRLDLQVHEEQTASLLDGLTGGRLDLLLLAVPLGVP
GVVELPLFDEDFVLVTPLEHGLGGREGIPRKALRELNLLLLDEGHCLRDQALDICREAGSAGVAATTTAAGLSTLVQLVA
GGLGVTLLPHTAVQVETTRSGRLLTGRFADPAPGRRIALAMRTGAARAAEYEELAAALREAMRDLPVRIVRD
>L8EUQ6 1.14.13.234~~~oxyS~~~12-dehydrotetracycline 5-monooxygenase/anhydrotetracycline 6-monooxygenase~~~
MRYDVVIAGAGPTGLMLACELRLAGARTLVLERLAEPVDFSKALGVHARTVELLDMRGLGEGFQAEAPKLRGGNFASLGV
PLDFSSFDTRHPYALFVPQVRTEELLTGRALELGAELRRGHAVTALEQDADGVTVSVTGPEGPYEVECAYLVGCDGGGST
VRKLLGIDFPGQDPHMFAVIADARFREELPHGEGMGPMRPYGVMRHDLRAWFAAFPLEPDVYRATVAFFDRPYADRRAPV
TEEDVRAALTEVAGSDFGMHDVRWLSRLTDTSRQAERYRDGRVLLAGDACHIHLPAGGQGLNLGFQDAVNLGWKLGATIA
GTAPPELLDTYEAERRPIAAGVLRNTRAQAVLIDPDPRYEGLRELMIELLHVPETNRYLAGLISALDVRYPMAGEHPLLG
RRVPDLPLVTEDGTRQLSTYFHAARGVLLTLGCDQPLADEAAAWKDRVDLVAAEGVADPGSAVDGLTALLVRPDGYICWT
AAPETGTDGLTDALRTWFGPPAM
>Q3S8P6 2.1.1.335~~~oxyT~~~N,N-dimethyltransferase OxyT~~~
MTTTPLAPVAQARSLLQLTTAYHQAKALHSAVELGLFDLLADGPATAEEVKDRLRIVHPLAKEFLDALVALELLEADGDR
YRNSPAAQAFLVSGASEYLGGTVLQHARKHYHVWAGLTTALQEGEAGSGAEAHGPEAYPKHYEDPERARQVMAHFDTFSS
FTAEELARRVDWSGYGSFIDIGGARGNLATRVALAHPHLHGAVFDLPALAPLAGELIRERGLEGRVRFHGGDFLTDPLPS
ADAVVTGHVLPDWPVPQRRKLLARIHEALPSGGALVVYDLMTDPATTTVHDVLQRLNHGLIRGDSSSSSVEEYRAEIEEA
GFRVRQAERIDNLLGDWLIVAVKP
>P0DN73 1.-.-.-~~~~~~Uncharacterized oxidoreductase SpyM50865~~~
MKERFSPLFEPLTLPNGSQLDNRFVLSPMVTNSSTKDGYVTQDDVSYALRRAASAPLQITGAAYVDPYGQLFEYGFSVTK
DADISGLKELAQAMKAKGAKAVLQLTHAGRFASHALTKYGFVYGPSYMQLRSPQPHEVKPLTGQQIEELIAAYAQATRRA
IQAGFDGVEVSSAQRLLIQTFFSTFSNKRTDSYGCQTLFNRSKLTLAVLQAVQQVINQEAPDGFIFGFRATPEETRGNDI
GYSIDEFLQLMDWVLNIAKLDYLAIASWGRHVFRNTVRSPGPYYGRRVNQVVRDYLRNKLPVMATGGMNTPDKAIEALAH
ADFIGVSTPFVVDPEFAHKIKEGCEESIHLRIRPADLKSLAIPQASFKDIVPLMDYGESLPKESRTLFRSLTHNYKEIK
>Q9X0Y1 3.1.3.-~~~~~~Phosphorylated carbohydrates phosphatase TM_1254~~~COG0637
MEAVIFDMDGVLMDTEPLYFEAYRRVAESYGKPYTEDLHRRIMGVPEREGLPILMEALEIKDSLENFKKRVHEEKKRVFS
ELLKENPGVREALEFVKSKRIKLALATSTPQREALERLRRLDLEKYFDVMVFGDQVKNGKPDPEIYLLVLERLNVVPEKV
VVFEDSKSGVEAAKSAGIERIYGVVHSLNDGKALLEAGAVALVKPEEILNVLKEVL
>P75211 ~~~~~~Protein P200~~~
MPKTIKKQNPSNTTLQYKKYLEQSKEKTAKAKNKDVSIDDLLKKPFLEEIKTNVLKKNKTTRASTATRGTSKVKKQIVES
SIDFFDEKKRGVFIVPPAGTSVINDDRDDNKAVEETVSKTAISQNQLAHYANSELVETEQFELKPVALEHNQVLTSTRHS
QERESIFEKAQLFWQIFVGDVRFGFWKNHTWIWLGFFDQHQNWYYFEVVETVELPQEHTAFIKRKQIDSCFWKPLVGNPN
YGYIQNNIWVWKGFFDTKLNWIPDPVRFTLPMVEKATTTTPVVQIELPAPPTVTVVDQTSPPTAAVTVSTSQPVIEEQTT
VFNQTTQLEQLSVSAPLLDQSEVETEMVEVPFVAPSTTTTQPQVVTVQAQPASSSIQFQEPIIKVEFVNESFDFKKPSQT
AAAASQAPSQAINIALNEADLIDELVAVGTTATTALPQSELIQEVVVIDNGQPQQAGFHYVVDFLTSTAPLTVAEIELQE
QELVNEFVTTTSRETTTFASTPVFEPVVIPTVESEEQLLENEFVESTVVSATSNEPNVASTPVVETVELTETPVSLEPLE
TVQLETAPVVTETVTVTEKAVEPEVLAVVEEAPLAVEPIVETSTTLAAETVEEAQVEQESTAVAVEPAIETESKATSEAQ
AELDWEALIGNSEYGYFDAEQNWIWTGYFDEDNKWVSTATAQTEANAEEVVLTADAETSELNTESDPSFEPEVEIQPEPE
PNFDLETIPEPESIETTEPEPNFEPEVELEPEIEPNFESETEVQQELAQESSFESEPEPNFETEVEVQPESEIESKFEAE
VQSEPKVSLNSDFETKPEAQAEVTPETLEVEATSEAPELQPETEATKVVDDVEEEQLDWELLIGNSNYGHYEPSGEWVWA
GYFDDNQIWTPDASVEWARESDYTDLIGDEIYGRYNRKGEWIWYGYYDETGEWVLVDEHYQNHQPRISEAPRFWEQLIGN
EDYGYYEDNEWKWYDGEFDSEGNWLVFHSSNAEDAKNIDIAKDIPVFESFDVDSIDADEWLDQFSDSDAKEVFGED
>P0CL67 ~~~~~~Outer surface 22 kDa lipoprotein~~~
MYKNGFFKNYLSLFLIFLVIACTSKDSSNEYVEEQEAENSSKPDDSKIDEHTIGHVFHAMGVVHSKKDRKSLGKNIKVFY
FSEEDGHFQTIPSKENAKLIVYFYDNVYAGEAPISISGKEAFIFVGITPDFKKIINSNLHGAKSDLIGTFKDLNIKNSKL
EITVDENNSDAKTFLESVNYIIDGVEKISPMLTN
>Q9I6N5 4.1.1.93~~~hudA~~~Pyrrole-2-carboxylic acid decarboxylase~~~
MNRSALDFRHFVDHLRRQGDLVDVHTEVDANLEIGAITRRVYERRAPAPLFHNIRDSLPGARVLGAPAGLRADRARAHSR
LALHFGLPEHSGPRDIVAMLRAAMRAEPIAPRRLERGPVQENVWLGEQVDLTRFPVPLLHEQDGGRYFGTYGFHVVQTPD
GSWDSWSVGRLMLVDRNTLAGPTIPTQHIGIIREQWRRLGKPTPWAMALGAPPAALAAAGMPLPEGVSEAGYVGALVGEP
VEVVRTQTNGLWVPANTEIVLEGEISLDETALEGPMGEYHGYSFPIGKPQPLFHVHALSFRDQPILPICVAGTPPEENHT
IWGTMISAQLLDVAQNAGLPVDMVWCSYEAATCWAVLSIDVQRLAALGTDAAAFAARVAETVFGSHAGHLVPKLILVGND
IDVTEIDQVVWALATRAHPLHDHFAFPQIRDFPMVPYLDAEDKARGSGGRLVINCLYPEQFAGQMRAATASFRHAYPTAL
RRRVEERWSDYGFGDA
>Q02470 3.4.21.96~~~prtP~~~PII-type proteinase~~~
MQRKKKGLSILLAGTVALGALAVLPVGEIQAKAAISQQTKVSSLANTVKAATAKQAATDTTAATTNQAIATQLAAKGIDY
NKLNKVQQQDTYVDVIVQMSAAPASENGTLRTDYSSTAEIQQETNKVIAAQASVKAAVEQVTQQTAGESYGYVVNGFSTK
VRVVDIPKLKQIAGVKTVTLAKVYYPTDAKANSMANVQAVWSNYKYKGEGTVVSVIDTGIDPTHKDMRLSDDKDVKLTKY
DVEKFTDTAKHGRYFTSKVPYGFNYADNNDTITDDTVDEQHGMHVAGIIGANGTGDDPTKSVVGVAPEAQLLAMKVFTNS
DTSATTGSATLVSAIEDSAKIGADVLNMSLGSDSGNQTLEDPEIAAVQNANESGTAAVISAGNSGTSGSATQGVNKDYYG
LQDNEMVGTPGTSRGATTVASAENTDVISQAVTITDGKDLQLGPETIQLSSNDFTGSFDQKKFYVVKDASGDLSKGAAAD
YTADAKGKIAIVKRGELNFADKQKYAQAAGAAGLIIVNNDGTATPLTSIRLTTTFPTFGLSSKTGQKLVDWVTAHPDDSL
GVKIALTLLPNQKYTEDKMSDFTSYGPVSNLSFKPDITAPGGNIWSTQNNNGYTNMSGTSMASPFIAGSQALLKQALNNK
NNPFYADYKQLKGTALTDFLKTVEMNTAQPINDINYNNVIVSPRRQGAGLVDVKAAIDALEKNPSTVVAENGYPAVELKD
FTSTDKTFKLTFTNRTTHELTYQMDSNTDTNAVYTSATDPNSGVLYDKKIDGAAIKAGSDITVPAGKTAQIEFTLSLPKS
FDQQQFVEGFLNFKGSDGSRLNLPYMGFFGDWNDGKIVDSLNGITYSPAGGNYGTVPLLTNKNTGHQYYGGMVTDADGKQ
TVDDQAIAFSSDKNALYNDISMQYYLLRNISNVQVDILDGQGNKVTTLSSSTNQTKTYYDAHSQKYIYYNAPAWDGTYYD
QRDGNIKTADDGSYTYRISGVPEGGDKRQVFDVPFKLDSKAPTVRHVALSAKTENGKTQYYLTAEAKDDLSGLDATKSVK
TAINEVTNLDATFTDAGTTADGYTKIETPLSDEQAQALGNGDNSAELYLTDNASNATNQDASVQKPGSTSFDLIVNGGGI
PDKISSTTTGYEANTQGGGTYTFSGTYPAAVDGTYTDAQGKKHDLNTTYDAATNSFTASMAVTNADYAAQVDLYADKAHT
QLLKHFDTKVRLTAPTFTDLKFNNGSDQTSEATIKVTGTVSSDTKTVNVGDTVAALDAQHHFSVDVPVNYGDNTIKVTAT
DEDGNTTTEQKTITSSYDPDVLKNAVTFDQGVKFGANEFNATSAKFYDPKTGIATITGKVKHPTTTLQVDGKQISIKNDL
TFSFTLDLGTLGQKPFGVVVGDTTQNKTFQEALTFILDAVAPTLSLDSSTDAPVYTNDPNFQITGTATDNAQYLSLAING
SHVASQYADININSGKPGHMAIDQPVKLLEGKNVLTVAVTDSENNTTTKKITVYYEPKKTLAAPTVTPSTTEPAKTVTLT
ANAAATGETVQYSADGGKTYQDVPAAGVTVTANGTFKFKSTDLYGNESPAVDYVVTNIKADDPAQLQTAKQALTNLIASA
KTLSASGKYDDATTTALAAATQKAQTALDQTDASVDSLTGANRDLQTAINQLAAKLPADKKTSLLNQLQSVKAALGTDLG
NQTDPSTGKTFTAALDDLVAQAQAGTQTADQLQASLAKVLDAVLAKLAEGIKAATPAEVGNAKDAATGKTWYADIADTLT
SGQASADASDKLAHLQALQSLKTKVAAAVEAAKTAGKGDDTTGTSDKGGGQGTPAPAPGDTGKDKGDEGSQPSSGGNIPT
KPATTTSTSTDDTTDRNGQHTSGKGALPKTAETTERPAFGFLGVIVVSLMGVLGLKRKQREE
>P0DJ01 ~~~~~~Lipoprotein p35~~~
MKIKKIKLLKALALTGAFGIVATVPVIVSSCSSTSENNGNGNGNGGTDGNTQQTEVTPAIKSEVSLTGALSKIYDTKTGT
DRETTSQLIVKDIKANPENYFTNGEALKDVIASATVTVDGGFTESTFTGEAYSVWSAKADVKKGTYSQASKQLDIKSIND
LQTVLGDSAAIKGICDLIPNLKLNNGTDYKVTNNGLSLSEDLLHINVTAKDGQTDVSMDLAIPVSDLNLKIDGLKISVSG
TGIKTSELTTNYKFNIGIDNTVKTLTPAAVTLAEADRTNAEKVLEKLGYATVSGSTYTLDQDKLADALGLYNCKFEAVKS
EKDSTNNNKYTVTLKATPNDGYFWEDGTNGAKEEISFVATFS
>P15363 ~~~~~~High affinity transport system protein p37~~~
MLKKLKNFILFSSIFSPIAFAISCSNTGVVKQEDVSVSQGQWDKSITFGVSEAWLNKKKGGEKVNKEVINTFLENFKKEF
NKLKNANDKTKNFDDVDFKVTPIQDFTVLLNNLSTDNPELDFGINASGKLVEFLKNNPGIITPALETTTNSFVFDKEKDK
FYVDGTDSDPLVKIAKEINKIFVETPYASWTDENHKWNGNVYQSVYDPTVQANFYRGMIWIKGNDETLAKIKKAWNDKDW
NTFRNFGILHGKDNSFSKFKLEETILKNHFQNKFTTLNEDRSAHPNAYKQKSADTLGTLDDFHIAFSEEGSFAWTHNKSA
TKPFETKANEKMEALIVTNPIPYDVGVFRKSVNQLEQNLIVQTFINLAKNKQDTYGPLLGYNGYKKIDNFQKEIVEVYEK
AIK
>Q49410 ~~~~~~High affinity transport system protein p37~~~
MLFKKFTWVIPSLFLTIISTSLLISCATKSDNTLIFNISLDHNADTSIEKFFTVFSKKLSGKLNKKINVNFNIVDDSFTK
INNIQANKADFAFVNSQAIASNNWFGYTPLIQTLTTAFKEDLELDYYEDGNLQKKAEKTNLLFLSPPYKEWDDIKQKWTG
NRYDFLYEPSKLVSFYRSMILITGSASEITAIKKAWNEKNWNQFMKFGIGHGQTNSASRFELPDLLFRKHFAKNYPGLQN
AINSDPDKFAVVRGREIGINKNIKIVFDDANSFSWTQNIKGSKRPFYTPIDPNDRLEILTYSDPLLYDIGIVSNNLSRIY
QKAIGEIFIELAQSSEDLYGPSIGYNGYKMINDFEKEVVEIIEKTYGK
>P96010 1.14.11.28~~~~~~L-proline cis-3-hydroxylase 1~~~
MRSHILGRIELDQERLGRDLEYLATVPTVEEEYDEFSNGFWKNIPLYNASGGSEDRLYRDLEGSPAQPTKHAEQVPYLNE
IITTVYNGERLQMARTRNLKNAVVIPHRDFVELDRELDQYFRTHLMLEDSPLAFHSDDDTVIHMRAGEIWFLDAAAVHSA
VNFAEFSRQSLCVDLAFDGAFDEKEAFADATVYAPNLSPDVRERKPFTKEREAGILALSGVIGRENFRDILFLLSKVHYT
YDVHPGETFEWLVSVSKGAGDDKMVEKAERIRDFAIGARALGERFSLTTW
>O09345 1.14.11.28~~~~~~L-proline cis-3-hydroxylase 2~~~
MRSHILGKIELDQTRLAPDLAYLAAVPTVEEEYDEFSNGFWKHVPLWNASGDSEDRLYRDLKDAAAQPTAHVEHVPYLKE
IVTTVFDGTHLQMARSRNLKNAIVIPHRDFVELDREVDRYFRTFMVLEDSPLAFHSNEDTVIHMRPGEIWFLDAATVHSA
VNFSEISRQSLCVDFAFDGPFDEKEIFADATLYAPGSTPDLPERRPFTAEHRRRILSLGQVIERENFRDILFLLSKVHYK
YDVHPSETYDWLIEISKQAGDEKMVVKAEQIRDFAVEARALSERFSLTSW
>P15292 3.4.21.96~~~prtP~~~PIII-type proteinase~~~
MQRKKKGLSILLAGTVALGALAVLPVGEIQAKAAISQQTKGSSLANTVTAATAKQAATDTTAATTNQAIATQLAAKGIDY
NKLNKVQQQDIYVDVIVQMSAAPASENGTLRTDYSSTAEIQQETNKVIAAQASVKAAVEQVTQQTAGESYGYVVNGFSTK
VRVVDIPKLKQIAGVKTVTLAKVYYPTDAKANSMANVQAVWSNYKYKGEGTVVSVIDSGIDPTHKDMRLSDDKDVKLTKS
DVEKFTDTAKHGRYFNSKVPYGFNYADNNDTITDDTVDEQHGMHVAGIIGANGTGDDPAKSVVGVAPEAQLLAMKVFTNS
DTSATTGSATLVSAIEDSAKIGADVLNMSLGSDSGNQTLEDPELAAVQNANESGTAAVISAGNSGTSGSATEGVNKDYYG
LQDNEMVGSPGTSRGATTVASAENTDVITQAVTITDGTGLQLGPETIQLSSHDFTGSFDQKKFYIVKDASGNLSKGALAD
YTADAKGKIAIVKRGEFSFDDKQKYAQAAGAAGLIIVNTDGTATPMTSIALTTTFPTFGLSSVTGQKLVDWVTAHPDDSL
GVKITLAMLPNQKYTEDKMSDFTSYGPVSNLSFKPDITAPGGNIWSTQNNNGYTNMSGTSMASPFIAGSQALLKQALNNK
NNPFYAYYKQLKGTALTDFLKTVEMNTAQPINDINYNNVIVSPRRQGAGLVDVKAAIDALEKNPSTVVAENGYPAVELKD
FTSTDKTFKLTFTNRTTHELTYQMDSNTDTNAVYTSATDPNSGVLYDKKIDGAAIKAGSNITVPAGKTAQIEFTLSLPKS
FDQQQFVEGFLNFKGSDGSRLNLPYMGFFGDWNDGKIVDSLNGITYSPAGGNFGTVPLLKNKNTGTQYYGGMVTDADGNK
TVDDQAIAFSSDKNALYNEISMKYYLLRNISNVQVDILDGQGNKVTTLSSSTNRKKTYYNAHSQQYIYYNAPAWDGTYYD
QRDGNIKTADDGSYTYRISGVPEGGDKRQVFDVPFKLDSKAPTVRHVALSAKTENGKTQYYLTAEAKDDLSGLDATKSVK
TEINEVTNLDATFTDAGTTADGYTKIETPLSDEQAQALGNGDNSAELYLTDNASNATDQDASVQKPGSTSFDLIVNGGGI
PDKISSTTTGYEANTQGGGTYTFSGTYPAAVDGTYTDAQGKKHDLNTTYDAATNSFTASMPVTNADYAAQVDLYADKAHT
QLLKHFDTKVRLTAPTFTDLKFNNGSDQTSEATIKVTGTVSADTKTVNVGDTVAALDAQHHFSVDVPVNYGDNTIKVTAT
DEDGNTTTEQKTITSSYDPDMLKNSVTFDQGVTFGANEFNATSAKFYDPKTGIATITGKVKHPTTTLQVDGKQIPIKDDL
TFSFTLDLGTLGQKPFGVVVGDTTQNKTFQEALTFILDAVAPTLSLDSSTDAPVYTNDPNFQITGTATDNAQYLSLSING
SSVASQYVDININSGKPGHMAIDQPVKLLEGKNVLTVAVTDSEDNTTTKNITVYYEPKKTLAAPTVTPSTTEPAKTVTLT
ANSAATGETVQYSADGGKTYQDVPAAGVTVTANGTFKFKSTDLYGNESPAVDYVVTNIKADDPAQLQAAKQELTNLIASA
KTLSASGKYDDATTTALAAATQKAQTALDQTNASVDSLTGANRDLQTAINQLAAKLPADKKTSLLNQLQSVKAALGTDLG
NQTDSSTGKTFTAALDDLVAQAQAGTQTDDQLQATLAKVLDAVLAKLAEGIKAATPAEVGNAKDAATGKTWYADIADTLT
SGQASADASDKLAHLQALQSLKTKVAAAVEAAKTVGKGDGTTGTSDKGGGQGTPAPAPGDTGKDKGDEGSQPSSGGNIPT
KPATTTSTTTDDTTDRNGQLTSGTSDKGGGQGTPAPAPGDIGKDKGDEGSQPSSGGNIPTNPATTTSTTTDDTTDRNGQL
TSGKGALPKTGETTERPAFGFLGVIVVSLMGVLGLKRKQREE
>P0C0J8 ~~~~~~46 kDa surface antigen~~~COG4213
MLRKKFLYSSAIYATSLASIIAFVAAGCGQTESGSTSDSKPQAETLKHKVSNDSIRIALTDPDNPRWISAQKDIISYVDE
TEAATSTITKNQDAQNNWLTQQANLSPAPKGFIIAPENGSGVGTAVNTIADKGIPIVAYDRLITGSDKYDWYVSFDNEKV
GELQGLSLAAGLLGKEDGAFDSIDQMNEYLKSHMPQETISFYTIAGSQDDNNSQYFYNGAMKVLKELMKNSQNKIIDLSP
EGENAVYVPGWNYGTAGQRIQSFLTINKDPAGGNKIKAVGSKPASIFKGFLAPNDGMAEQAITKLKLEGFDTQKIFVTGQ
DYNDKAKTFIKDGDQNMTIYKPDKVLGKVAVEVLRVLIAKKNKASRSEVENELKAKLPNISFKYDNQTYKVQGKNINTIL
VSPVIVTKANVDNPDA
>C1FUH7 ~~~~~~Protein P47~~~
MNTYGWDIVYGCSKRVVNKHLKEYITKNNIQFLYSNIDKKQEIKMVFDNWEIINGGSSNFLRIKTPIKEGYFKVRNTTVD
LSGINPVLEIKLDFFNDISNPNIKELKFNFGSESNDDIKIIVSDLNGNLQEEDEFYFNKLLINAFIQNEKQISYIFASLN
VTSDIEWMNPKQFKFVYYSPTDNSDGYLFILSVVTNRDISKLSANVDGNILGNNSEVGLLISEKLFLQNMVLSRLSSNMG
SNINKNNFEVISTSDTTGRIVNNSTLNWYGLKVAALYYYPKINNFSMQLFEGNKLKISLRGLVRLTGLEAVYSDFEIQSI
NKFVYNSTNKKAYFEVDKNPTSSYKYHLFPGDLISLAVLSSVTHWSIKSIEGALGFELINNFVDLINNTIKWNNLKISQV
TNVTLNVGFCIQGNAN
>Q989T9 1.14.11.56~~~~~~L-proline cis-4-hydroxylase~~~COG3555
MTTRILGVVQLDQRRLTDDLAVLAKSNFSSEYSDFACGRWEFCMLRNQSGKQEEQRVVVHETPALATPLGQSLPYLNELL
DNHFDRDSIRYARIIRISENACIIPHRDYLELEGKFIRVHLVLDTNEKCSNTEENNIFHMGRGEIWFLDASLPHSAGCFS
PTPRLHLVVDIEGTRSLEEVAINVEQPSARNATVDTRKEWTDETLESVLGFSEIISEANYREIVAILAKLHFFHKVHCVD
MYGWLKEICRRRGEPALIEKANSLERFYLIDRAAGEVMTY
>Q92LF6 1.14.11.56~~~~~~L-proline cis-4-hydroxylase~~~COG3555
MSTHFLGKVKFDEARLAEDLSTLEVAEFSSAYSDFACGKWEACVLRNRTGMQEEDIVVSHNAPALATPLSKSLPYLNELV
ETHFDCSAVRYTRIVRVSENACIIPHSDYLELDETFTRLHLVLDTNSGCANTEEDKIFHMGLGEIWFLDAMLPHSAACFS
KTPRLHLMIDFEATAFPESFLRNVEQPVTTRDMVDPRKELTDEVIEGILGFSIIISEANYREIVSILAKLHFFYKADCRS
MYDWLKEICKRRGDPALIEKTASLERFFLGHRERGEVMTY
>Q9HTX3 ~~~~~~Probable binding protein component of ABC iron transporter PA5217~~~
MQASKALLAALALGITGLAQAADEVVVYSSRIDELIKPVFDAYTSKTGVKVKFITDKEAPLMARIKAEGANTPADLLLTV
DAGNLWQAEQMGLLQPFKSATIERNIPSQYRSSTDSWTGLSLRARTIVYSTERVKPEELSTYEALADKQWEGRLCLRTAK
KVYNQSLTGTLIETHGAQKTEEILQGWVNNLATDVFADDNAVIQAVDAGQCDVGIVNTYYYGRLHKQNPNLRVKLFWPNQ
ADRGVHVNLSGIGLTRHAPHPEAAKALVEWMTGPDAQALFAGINQEFPANPQVAPSAEVASWGSFKADSIPVEVAGKRQA
EAIRLMDRAGWN
>P0A9L8 1.5.1.2~~~proC~~~Pyrroline-5-carboxylate reductase~~~COG0345
MEKKIGFIGCGNMGKAILGGLIASGQVLPGQIWVYTPSPDKVAALHDQFGINAAESAQEVAQIADIIFAAVKPGIMIKVL
SEITSSLNKDSLVVSIAAGVTLDQLARALGHDRKIIRAMPNTPALVNAGMTSVTPNALVTPEDTADVLNIFRCFGEAEVI
AEPMIHPVVGVSGSSPAYVFMFIEAMADAAVLGGMPRAQAYKFAAQAVMGSAKMVLETGEHPGALKDMVCSPGGTTIEAV
RVLEEKGFRAAVIEAMTKCMEKSEKLSKS
>P9WHU7 1.5.1.2~~~proC~~~Pyrroline-5-carboxylate reductase~~~COG0345
MLFGMARIAIIGGGSIGEALLSGLLRAGRQVKDLVVAERMPDRANYLAQTYSVLVTSAADAVENATFVVVAVKPADVEPV
IADLANATAAAENDSAEQVFVTVVAGITIAYFESKLPAGTPVVRAMPNAAALVGAGVTALAKGRFVTPQQLEEVSALFDA
VGGVLTVPESQLDAVTAVSGSGPAYFFLLVEALVDAGVGVGLSRQVATDLAAQTMAGSAAMLLERMEQDQGGANGELMGL
RVDLTASRLRAAVTSPGGTTAAALRELERGGFRMAVDAAVQAAKSRSEQLRITPE
>P22008 1.5.1.2~~~proC~~~Pyrroline-5-carboxylate reductase~~~
MSTPRIAFIGAGNMAASLIGGLRAQGVPAAQIRASDPGAEQRAKIAGEFAIDVVESNAEAVADADVVVLSVKPQAMKAVC
QALAPALKPEQLIVSIAAGIPCASLEAWLGQPRPVVRCMPNTPALLRQGASGLYANAQVSAAQCEQAGQLLSAVGIALWL
DDEAQIDAVTAVSGSGPAYFFLLMQAMTDAGEKLGLSRETASRLTLQTALGAAQMALSSEVEPAELRRRVTSPNGTTEAA
IKSFQANGFEALVEQALNAASQRSAELAEQLGQ
>Q7A5G8 1.5.1.2~~~proC~~~Pyrroline-5-carboxylate reductase~~~
MKLVFYGAGNMAQAIFTGIINSSNLDANDIYLTNKSNEQALKAFAEKLGVNYSYDDATLLKDADYVFLGTKPHDFDALAT
RIKPHITKDNCFISIMAGIPIDYIKQQLECQNPVARIMPNTNAQVGHSVTGISFSNNFDPKSKDEINDLVKAFGSVIEVS
EDHLHQVTAITGSGPAFLYHVFEQYVKAGTKLGLEKEQVEESIRNLIIGTSKMIERSDLSMAQLRKNITSKGGTTQAGLD
TLSQYDLVSIFEDCLNAAVDRSIELSNVEDQ
>P27771 1.5.1.2~~~proC~~~Pyrroline-5-carboxylate reductase~~~COG0345
MNVGFLGFGAMGRALAEGLVHAGALQAAQVYACALNQEKLRAQCTSLGIGACASVQELVQKSEWIFLAVKPSQISTVLRD
RQSFQGKVLISLAAGMSCAAYEALFAADPHQGIRHLSLLPNLPCQVARGVIIAEARHTLHHDEHAALLAVLRTVAQVEVV
DTAYFAIAGVIAGCAPAFAAQFIEALADAGVRYGLARDQAYRLAAHMLEGTAALIQHSGVHPAQLKDRVCSPAGSTIRGV
LALEEQGLRRAVIHAVRAALSSS
>P21171 3.4.-.-~~~iap~~~Probable endopeptidase p60~~~COG0791
MNMKKATIAATAGIAVTAFAAPTIASASTVVVEAGDTLWGIAQSKGTTVDAIKKANNLTTDKIVPGQKLQVNNEVAAAEK
TEKSVSATWLNVRSGAGVDNSIITSIKGGTKVTVETTESNGWHKITYNDGKTGFVNGKYLTDKAVSTPVAPTQEVKKETT
TQQAAPAAETKTEVKQTTQATTPAPKVAETKETPVVDQNATTHAVKSGDTIWALSVKYGVSVQDIMSWNNLSSSSIYVGQ
KLAIKQTANTATPKAEVKTEAPAAEKQAAPVVKENTNTNTATTEKKETATQQQTAPKAPTEAAKPAPAPSTNTNANKTNT
NTNTNTNTNNTNTNTPSKNTNTNSNTNTNTNSNTNANQGSSNNNSNSSASAIIAEAQKHLGKAYSWGGNGPTTFDCSGYT
KYVFAKAGISLPRTSGAQYASTTRISESQAKPGDLVFFDYGSGISHVGIYVGNGQMINAQDNGVKYDNIHGSGWGKYLVG
FGRV
>Q05097 ~~~lecA~~~PA-I galactophilic lectin~~~
MAWKGEVLANNEAGQVTSIIYNPGDVITIVAAGWASYGPTQKWGPQGDREHPDQGLICHDAFCGALVMKIGNSGTIPVNT
GLFRWVAPNNVQGAITLIYNDVPGTYGNNSGSFSVNIGKDQS
>P0A921 3.1.1.32~~~pldA~~~Phospholipase A1~~~COG2829
MRTLQGWLLPVFMLPMAVYAQEATVKEVHDAPAVRGSIIANMLQEHDNPFTLYPYDTNYLIYTQTSDLNKEAIASYDWAE
NARKDEVKFQLSLAFPLWRGILGPNSVLGASYTQKSWWQLSNSEESSPFRETNYEPQLFLGFATDYRFAGWTLRDVEMGY
NHDSNGRSDPTSRSWNRLYTRLMAENGNWLVEVKPWYVVGNTDDNPDITKYMGYYQLKIGYHLGDAVLSAKGQYNWNTGY
GGAELGLSYPITKHVRLYTQVYSGYGESLIDYNFNQTRVGVGVMLNDLF
>Q9K0U7 3.1.1.32~~~~~~Putative phospholipase A1~~~
MPTMGAEMNTRNMRYILLTGLLPMASAFGETALQCAALTDNVTRLACYDRIFAAQLPSSAGQEGQESKAVLNLTETVRSS
LDKGEAVIVVEKGGDALPADSAGETADIYTPLSLMYDLDKNDLRGLLGVREHNPMYLMPLWYNNSPNYAPGSPTRGTTVQ
EKFGQQKRAETKLQVSFKSKIAEDLFKTRADLWFGYTQRSDWQIYNQGRKSAPFRNTDYKPEIFLTQPVKADLPFGGRLR
MLGAGFVHQSNGQSRPESRSWNRIYAMAGMEWGKLTVIPRVWVRAFDQSGDKNDNPDIADYMGYGDVKLQYRLNDRQNVY
SVLRYNPKTGYGAIEAAYTFPIKGKLKGVVRGFHGYGESLIDYNHKQNGIGIGLMFNDLDGI
>P0A232 3.1.1.32~~~pldA~~~Phospholipase A1~~~COG2829
MRAILRGLLPATLLPLAAYAQEATIKEVHDAPAVRGSIIANMLQEHDNPFTLYPYDTNYLIYTNTSDLNKEAISTYNWSE
NARKDEVKFQLSLAFPLWRGILGPNSVLGASYTQKSWWQLSNSKESSPFRETNYEPQLFLGFATDYRFAGWTLRDVEMGY
NHDSNGRSDPTSRSWNRLYTRLMAENGNWLVEVKPWYVIGSTDDNPDITKYMGYYQLKIGYHLGEAVLSAKGQYNWNTGY
GGAEVGLSYPVTKHVRLYTQVYSGYGESLIDYNFNQTRVGVGVMLNDIF
>P18952 3.1.1.32~~~phlA~~~Extracellular phospholipase A1~~~
MSMPLSFTSAVSPVAAIPTPRAAAETRTAASLRHAGKSGPVASPSQNTLNAQNLLNTLVGDISAAAPTAAAAPGVTRGQQ
SQEGDYALALLAKDVYSLNGQGAAGFNRLSDSALLGFGIDPASLHDAGSGFQAGIYSNDKQYVLAFAGTNDWRDWLSNVR
QATGYDDVQYNQAVAAAKSAKAAFGDALVIAGHSLGGGLAATAALATGTVAVTFNAAGVSDYTLNRLGIDPAAAKKDAEA
GGIRRYSEQYDMLTSTQESTSLIPDAIGHNITLANNDTLTGIDDWRPSKHLDRSLTAHGIDKVISSMAEQKPWEAKANA
>P76077 1.14.13.149~~~paaA~~~1,2-phenylacetyl-CoA epoxidase, subunit A~~~COG3396
MTQEERFEQRIAQETAIEPQDWMPDAYRKTLIRQIGQHAHSEIVGMLPEGNWITRAPTLRRKAILLAKVQDEAGHGLYLY
SAAETLGCAREDIYQKMLDGRMKYSSIFNYPTLSWADIGVIGWLVDGAAIVNQVALCRTSYGPYARAMVKICKEESFHQR
QGFEACMALAQGSEAQKQMLQDAINRFWWPALMMFGPNDDNSPNSARSLTWKIKRFTNDELRQRFVDNTVPQVEMLGMTV
PDPDLHFDTESGHYRFGEIDWQEFNEVINGRGICNQERLDAKRKAWEEGTWVREAALAHAQKQHARKVA
>P76078 ~~~paaB~~~1,2-phenylacetyl-CoA epoxidase, subunit B~~~COG3460
MSNVYWPLYEVFVRGKQGLSHRHVGSLHAADERMALENARDAYTRRSEGCSIWVVKASEIVASQPEERGEFFDPAESKVY
RHPTFYTIPDGIEHM
>P76079 ~~~paaC~~~1,2-phenylacetyl-CoA epoxidase, subunit C~~~COG3396
MNQLTAYTLRLGDNCLVLSQRLGEWCGHAPELEIDLALANIGLDLLGQARNFLSYAAELAGEGDEDTLAFTRDERQFSNL
LLVEQPNGNFADTIARQYFIDAWHVALFTRLMESRDPQLAAISAKAIKEARYHLRFSRGWLERLGNGTDVSGQKMQQAIN
KLWRFTAELFDADEIDIALSEEGIAVDPRTLRAAWEAEVFAGINEATLNVPQEQAYRTGGKKGLHTEHLGPMLAEMQYLQ
RVLPGQQW
>P76080 ~~~paaD~~~Putative 1,2-phenylacetyl-CoA epoxidase, subunit D~~~COG2151
MQRLATIAPPQVHEIWALLSQIPDPEIPVLTITDLGMVRNVTQMGEGWVIGFTPTYSGCPATEHLIGAIREAMTTNGFTP
VQVVLQLDPAWTTDWMTPDARERLREYGISPPAGHSCHAHLPPEVRCPRCASVHTTLISEFGSTACKALYRCDSCREPFD
YFKCI
>P76081 1.-.-.-~~~paaE~~~1,2-phenylacetyl-CoA epoxidase, subunit E~~~COG1018
MTTFHSLTVAKVESETRDAVTITFAVPQPLQEAYRFRPGQHLTLKASFDGEELRRCYSICRSYLPGEISVAVKAIEGGRF
SRYAREHIRQGMTLEVMVPQGHFGYQPQAERQGRYLAIAAGSGITPMLAIIATTLQTEPESQFTLIYGNRTSQSMMFRQA
LADLKDKYPQRLQLLCIFSQETLDSDLLHGRIDGEKLQSLGASLINFRLYDEAFICGPAAMMDDAETALKALGMPDKTIH
LERFNTPGTRVKRSVNVQSDGQKVTVRQDGRDREIVLNADDESILDAALRQGADLPYACKGGVCATCKCKVLRGKVAMET
NYSLEPDELAAGYVLSCQALPLTSDVVVDFDAKGMA
>P76082 4.2.1.17~~~paaF~~~2,3-dehydroadipyl-CoA hydratase~~~COG1024
MSELIVSRQQRVLLLTLNRPAARNALNNALLMQLVNELEAAATDTSISVCVITGNARFFAAGADLNEMAEKDLAATLNDT
RPQLWARLQAFNKPLIAAVNGYALGAGCELALLCDVVVAGENARFGLPEITLGIMPGAGGTQRLIRSVGKSLASKMVLSG
ESITAQQAQQAGLVSDVFPSDLTLEYALQLASKMARHSPLALQAAKQALRQSQEVALQAGLAQERQLFTLLAATEDRHEG
ISAFLQKRTPDFKGR
>P77467 5.3.3.18~~~paaG~~~1,2-epoxyphenylacetyl-CoA isomerase~~~COG1024
MMEFILSHVEKGVMTLTLNRPERLNSFNDEMHAQLAECLKQVERDDTIRCLLLTGAGRGFCAGQDLNDRNVDPTGPAPDL
GMSVERFYNPLVRRLAKLPKPVICAVNGVAAGAGATLALGGDIVIAARSAKFVMAFSKLGLIPDCGGTWLLPRVAGRARA
MGLALLGNQLSAEQAHEWGMIWQVVDDETLADTAQQLARHLATQPTFGLGLIKQAINSAETNTLDTQLDLERDYQRLAGR
SADYREGVSAFLAKRSPQFTGK
>P76083 1.1.1.-~~~paaH~~~3-hydroxyadipyl-CoA dehydrogenase~~~COG1250
MMINVQTVAVIGSGTMGAGIAEVAASHGHQVLLYDISAEALTRAIDGIHARLNSRVTRGKLTAETCERTLKRLIPVTDIH
ALAAADLVIEAASERLEVKKALFAQLAEVCPPQTLLTTNTSSISITAIAAEIKNPERVAGLHFFNPAPVMKLVEVVSGLA
TAAEVVEQLCELTLSWGKQPVRCHSTPGFIVNRVARPYYSEAWRALEEQVAAPEVIDAALRDGAGFPMGPLELTDLIGQD
VNFAVTCSVFNAFWQERRFLPSLVQQELVIGGRLGKKSGLGVYDWRAEREAVVGLEAVSDSFSPMKVEKKSDGVTEIDDV
LLIETQGETAQALAIRLARPVVVIDKMAGKVVTIAAAAVNPDSATRKAIYYLQQQGKTVLQIADYPGMLIWRTVAMIINE
ALDALQKGVASEQDIDTAMRLGVNYPYGPLAWGAQLGWQRILRLLENLQHHYGEERYRPCSLLRQRALLESGYES
>P76084 3.1.2.-~~~paaI~~~Acyl-coenzyme A thioesterase PaaI~~~COG2050
MSHKAWQNAHAMYENDACAKALGIDIISMDEGFAVVTMTVTAQMLNGHQSCHGGQLFSLADTAFAYACNSQGLAAVASAC
TIDFLRPGFAGDTLTATAQVRHQGKQTGVYDIEIVNQQQKTVALFRGKSHRIGGTITGEA
>P0C7L2 2.3.1.174~~~paaJ~~~3-oxoadipyl-CoA/3-oxo-5,6-dehydrosuberyl-CoA thiolase~~~COG0183
MREAFICDGIRTPIGRYGGALSSVRADDLAAIPLRELLVRNPRLDAECIDDVILGCANQAGEDNRNVARMATLLAGLPQS
VSGTTINRLCGSGLDALGFAARAIKAGDGDLLIAGGVESMSRAPFVMGKAASAFSRQAEMFDTTIGWRFVNPLMAQQFGT
DSMPETAENVAELLKISREDQDSFALRSQQRTAKAQSSGILAEEIVPVVLKNKKGVVTEIQHDEHLRPETTLEQLRGLKA
PFRANGVITAGNASGVNDGAAALIIASEQMAAAQGLTPRARIVAMATAGVEPRLMGLGPVPATRRVLERAGLSIHDMDVI
ELNEAFAAQALGVLRELGLPDDAPHVNPNGGAIALGHPLGMSGARLALAASHELHRRNGRYALCTMCIGVGQGIAMILER
V
>Q9L9C1 6.2.1.30~~~paaK~~~Phenylacetate-coenzyme A ligase~~~
MPVKTPSPGDLEPIEKASQDELRALQLERLKWSVRHAYENVPHYRKAFDAKGVHPDDLKSLADLAKFPFTAKGDLRDNYP
FGMFAVPREKVARVHASSGTTGKPTVVGYTLKDIDTWATVVARSIRASGGRAGDMVHIAYGYGLFTGGLGAHYGAEKLGC
TVVPMSGGQTEKQIQLIQDFKPDIIMVTPSYMLTVLDEMERMGIDPHQTSLKVGIFGAEPWTQAMRAAMEARAGIDAVDI
YGLSEVMGPGVANECIEAKDGPVIWEDHFYPEIIDPHTGEVLPDGSEGELVFTTLTKEAMPVIRYRTRDLTRLLPPTARS
MRRMAKITGRSDDMLIIRGVNLFPTQVEELICKNPKLAPQYLLEVDKDGHMDTLTVKVEINPEANVGRHPEQKEALAKEL
QHDIKTFIGVSAKVHVCEPFAIERVTIGKAKRVVDRRPKE
>P76085 6.2.1.30~~~paaK~~~Phenylacetate-coenzyme A ligase~~~COG1541
MITNTKLDPIETASVDELQALQTQRLKWTLKHAYENVPMYRRKFDAAGVHPDDFRELSDLRKFPCTTKQDLRDNYPFDTF
AVPMEQVVRIHASSGTTGKPTVVGYTQNDIDNWANIVARSLRAAGGSPKDKIHVAYGYGLFTGGLGAHYGAERLGATVIP
MSGGQTEKQAQLIRDFQPDMIMVTPSYCLNLIEELERQLGGDASGCSLRVGVFGAEPWTQAMRKEIERRLGITALDIYGL
SEVMGPGVAMECLETTDGPTIWEDHFYPEIVNPHDGTPLADGEHGELLFTTLTKEALPVIRYRTRDLTRLLPGTARTMRR
MDRISGRSDDMLIIRGVNVFPSQLEEEIVKFEHLSPHYQLEVNRRGHLDSLSVKVELKESSLTLTHEQRCQVCHQLRHRI
KSMVGISTDVMIVNCGSIPRSEGKACRVFDLRNIVGA
>O33469 6.2.1.30~~~paaK~~~Phenylacetate-coenzyme A ligase~~~
MNMYHDADRALLDPMETASVDALRQHQLERLRWSLKHAYDNVPLYRQRFAECGAHPDDLTCLEDLAKFPFTGKNDLRDNY
PYGMFAVPQEEVVRLHASSGTTGKPTVVGYTQNDINTWANVVARSIRAAGGRKGDKVHVSYGYGLFTGGLGAHYGAERLG
CTVIPMSGGQTEKQVQLIRDFQPDIIMVTPSYMLNLADEIERQGIDPHDLKLRLGIFGAEPWTDELRRSIEQRLGINALD
IYGLSEIMGPGVAMECIETKDGPTIWEDHFYPEIIDPVTGEVLPDGQLGELVFTSLSKEALPMVRYRTRDLTRLLPGTAR
PMRRIGKITGRSDDMLIIRGVNVFPTQIEEQVLKIKQLSEMYEIHLYRNGNLDSVEVHVELRAECQHLDEGQRKLVIGEL
SKQIKTYIGISTQVHLQACGTLKRSEGKACHVYDKRLAS
>Q72K16 6.2.1.30~~~~~~Phenylacetate-coenzyme A ligase~~~COG1541
MMYQPELETLPREKLRALQEERLKRLVAYVYERVPFYRRLLDEAGVDPKGFRGLEDLPRIPFTKKTDLRDHYPFGLFAVP
REEVARVHASSGTTGKPTVVGYTKNDLKVFAEVVARSLAAAGARPGMMLHNAYGYGLFTGGLGLHGGAEALGMTVVPVSG
GMTERQVMLIQDFRPEVISCTPSYAQTLAEEFRKRGVSPEELSLEYAVLGAEPWTEAIRKQVDEGLGVKSTNIYGLSEII
GPGVSNECVEERQGSHIWEDHFLPEVVDPDTGEPLPEGKVGVLVFTTLTKEAMPLLRYWTGDLTFLTYEACTCGRTHVRM
GPILGRTDDMLIIRGVNVYPTQVEAVLLAIPEVVPHYQIVVRREGTLDEAELKVEVSEPFFREIGQEVLSDEVVEADHRL
HALRERIARKIKDNVGVTLKVTLLPPGQAPRSEGGKLRRVLDLRK
>P76086 ~~~paaX~~~Transcriptional repressor PaaX~~~COG3327
MSKLDTFIQHAVNAVPVSGTSLISSLYGDSLSHRGGEIWLGSLAALLEGLGFGERFVRTALFRLNKEGWLDVSRIGRRSF
YSLSDKGLRLTRRAESKIYRAEQPAWDGKWLLLLSEGLDKSTLADVKKQLIWQGFGALAPSLMASPSQKLADVQTLLHEA
GVADNVICFEAQIPLALSRAALRARVEECWHLTEQNAMYETFIQSFRPLVPLLKEAADELTPERAFHIQLLLIHFYRRVV
LKDPLLPEELLPAHWAGHTARQLCINIYQRVAPAALAFVSEKGETSVGELPAPGSLYFQRFGGLNIEQEALCQFIR
>P77455 ~~~paaZ~~~Bifunctional protein PaaZ~~~COG1012
MQQLASFLSGTWQSGRGRSRLIHHAISGEALWEVTSEGLDMAAARQFAIEKGAPALRAMTFIERAAMLKAVAKHLLSEKE
RFYALSAQTGATRADSWVDIEGGIGTLFTYASLGSRELPDDTLWPEDELIPLSKEGGFAARHLLTSKSGVAVHINAFNFP
CWGMLEKLAPTWLGGMPAIIKPATATAQLTQAMVKSIVDSGLVPEGAISLICGSAGDLLDHLDSQDVVTFTGSAATGQML
RVQPNIVAKSIPFTMEADSLNCCVLGEDVTPDQPEFALFIREVVREMTTKAGQKCTAIRRIIVPQALVNAVSDALVARLQ
KVVVGDPAQEGVKMGALVNAEQRADVQEKVNILLAAGCEIRLGGQADLSAAGAFFPPTLLYCPQPDETPAVHATEAFGPV
ATLMPAQNQRHALQLACAGGGSLAGTLVTADPQIARQFIADAARTHGRIQILNEESAKESTGHGSPLPQLVHGGPGRAGG
GEELGGLRAVKHYMQRTAVQGSPTMLAAISKQWVRGAKVEEDRIHPFRKYFEELQPGDSLLTPRRTMTEADIVNFACLSG
DHFYAHMDKIAAAESIFGERVVHGYFVLSAAAGLFVDAGVGPVIANYGLESLRFIEPVKPGDTIQVRLTCKRKTLKKQRS
AEEKPTGVVEWAVEVFNQHQTPVALYSILTLVARQHGDFVD
>P28819 2.6.1.85~~~pabA~~~Aminodeoxychorismate/anthranilate synthase component 2~~~COG0512
MILMIDNYDSFTYNLVQYLGELGEELVVKRNDSITIDEIEELSPDFLMISPGPCSPDEAGISLEAIKHFAGKIPIFGVCL
GHQSIAQVFGGDVVRAERLMHGKTSDIEHDGKTIFEGLKNPLVATRYHSLIVKPETLPSCFTVTAQTKEGEIMAIRHNDL
PIEGVQFHPESIMTSFGKEMLRNFIETYRKEVIA
>P00903 2.6.1.85~~~pabA~~~Aminodeoxychorismate synthase component 2~~~COG0512
MILLIDNYDSFTWNLYQYFCELGADVLVKRNDALTLADIDALKPQKIVISPGPCTPDEAGISLDVIRHYAGRLPILGVCL
GHQAMAQAFGGKVVRAAKVMHGKTSPITHNGEGVFRGLANPLTVTRYHSLVVEPDSLPACFDVTAWSETREIMGIRHRQW
DLEGVQFHPESILSEQGHQLLANFLHR
>P28820 2.6.1.85~~~pabB~~~Aminodeoxychorismate synthase component 1~~~COG0147
MAQRRPAGKKIPFQKDSFLQQFEKLAQSRKHHVLLESARGGRYSIAGLDPIATVKGKDGITTIKHGDEMLFKEGDPLRAF
HSWFKTLETETNHEFPDFQGGAIGFLSYDYARYIENFKMLSLDDLETPDIYFLVFDDIAVYDHQEESLWLITHVNGSDQE
TADVKLSELEQMWLTELPAVTSREMKPETAGSFAAPFTEDGFSQAVEKIKQYIASGDVFQVNLSIRQSQSLSVHPYQIYK
TLREVNPSPYMAYLETPDFQIICGSPELLVSKKGKLLETRPIAGTRSRGKTNEEDEALANELIHNEKERAEHVMLVDLER
NDLGRVSRYGSVRVNEFMAIEKYSHVMHIVSNVQGELQDGYDAVDIIHAVFPGGTITGAPKVRTMEIIEELEPTRRGLYT
GSIGWFGYNHDLQFNIVIRTIYATGGQAFMQSGAGVVIDSVPKHEYKESFKKAFAMQRALELSEEETKIR
>P05041 2.6.1.85~~~pabB~~~Aminodeoxychorismate synthase component 1~~~COG0147
MKTLSPAVITLLWRQDAAEFYFSRLSHLPWAMLLHSGYADHPYSRFDIVVAEPICTLTTFGKETVVSESEKRTTTTDDPL
QVLQQVLDRADIRPTHNEDLPFQGGALGLFGYDLGRRFESLPEIAEQDIVLPDMAVGIYDWALIVDHQRHTVSLLSHNDV
NARRAWLESQQFSPQEDFTLTSDWQSNMTREQYGEKFRQVQEYLHSGDCYQVNLAQRFHATYSGDEWQAFLQLNQANRAP
FSAFLRLEQGAILSLSPERFILCDNSEIQTRPIKGTLPRLPDPQEDSKQAVKLANSAKDRAENLMIVDLMRNDIGRVAVA
GSVKVPELFVVEPFPAVHHLVSTITAQLPEQLHASDLLRAAFPGGSITGAPKVRAMEIIDELEPQRRNAWCGSIGYLSFC
GNMDTSITIRTLTAINGQIFCSAGGGIVADSQEEAEYQETFDKVNRILKQLEK
>P28305 4.1.3.38~~~pabC~~~Aminodeoxychorismate lyase~~~COG0115
MFLINGHKQESLAVSDRATQFGDGCFTTARVIDGKVSLLSAHIQRLQDACQRLMISCDFWPQLEQEMKTLAAEQQNGVLK
VVISRGSGGRGYSTLNSGPATRILSVTAYPAHYDRLRNEGITLALSPVRLGRNPHLAGIKHLNRLEQVLIRSHLEQTNAD
EALVLDSEGWVTECCAANLFWRKGNVVYTPRLDQAGVNGIMRQFCIRLLAQSSYQLVEVQASLEESLQADEMVICNALMP
VMPVCACGDVSFSSATLYEYLAPLCERPN
>Q51911 ~~~pab~~~Peptostreptococcal albumin-binding protein~~~
MKLNKKLLMAALAGAIVVGGGVNTFAADEPGAIKVDKAPEAPSQELKLTKEEAEKALKKEKPIAKERLRRLGITSEFILN
QIDKATSREGLESLVQTIKQSYLKDHPIKEEKTEETPKYNNLFDKHELGGLGKDKGPGRFDENGWENNEHGYETRENAEK
AAVKALGDKEINKSYTISQGVDGRYYYVLSREEAETPKKPEEKKPEDKRPKMTIDQWLLKNAKEDAIAELKKAGITSDFY
FNAINKAKTVEEVNALKNEILKAHAGKEVNPSTPEVTPSVPQNHYHENDYANIGAGEGTKEDGKKENSKEGIKRKTAREE
KPGKEEKPAKEDKKENKKKENTDSPNKKKKEKAALPEAGRRKAEILTLAAASLSSVAGAFISLKKRK
>P15557 ~~~acyI~~~Acylase ACY 1 proenzyme~~~
MNAPVPVPRVADFTCEKKPASGSRGMVVTNHPLASAAGAQILLAGGNAIDAAVASLFALTVAEPMMVGILGGGLSHIRLA
DGRHVVIDNLSTAPGKATAEMYECLSDEIGKQRDTRDRQNVVGAKAVAVPGALKGWCEALARFGTLPLAEVLQPAIGLAE
RGFVVTPYLSNCITDNAGDLARDPGLAAMLLPGGKPLQPGMRLVQSDYAASLKLIAAEGPDALYGGKLGRALTDYMAANG
GLIDQADLANYRIELREPIRGSYRGYEIIGPPPPSSSGVHITQMLNILEGYDIGSLGFGSTDAVHLLAEALKIAFADRAV
ATADPAFVKVPVARLIDKAYADERRALIEMEQAKSWTAGLSGGESADTTHVTVADAMGNVVSATQTINGLFGACVQIPGT
GMIANNYMYNFDPHPGRALSIAPGKRVFTSMAPMMALKEGRIAFALGLPGALRIFPSALQAIVNLIDHRMSLQEAVEAPR
VWTEGGVLELEEAIPEAVAQALIARGHKVVRSPRVAGGMNAIAFNPDGTLTGAACWRADGTPVAISGGLARAGARFTI
>Q05053 ~~~acyI~~~Acylase ACY 1 proenzyme~~~
MNAPVPVPRVADFTCEKKPATGSRGMVVTNHPLASAAGAQILLAGGNAIDAAVASLFALTVAEPMMVGILGGGLSHIRLA
DGRHVVIDNLSTAPGKATADMYECLSDEIGKQRDTRDRENVVGAKAVAVPGALKGWCEALARFGTLPLAEVLQPAIGLAE
RGFVVTPYLSNCITDNAADLARDPGLAAMLLPGGQPLQPGMRLIQSDYAASLKLIAAEGPEALYGGKLGRALTDYMAANG
GLIDQADLSNYRIELREPIRGSYRGYEIIGPPPPSSSGVHIAQMLNILEGYDIGALGFGSTDAVHLLAEALKIAFADRAV
ATADPAFVKVPVARLIDKAYADERRALIAMEQAKSWTAGLSGGESADTTHVTVADAMGNVVSATQTINGLFGACVQTPGT
GMIANNYMYNFDPHPGRALSIAPGKRVFTSMAPMMAVKEGRLAFALGLPGALRIFPSALQAIVNLIDHRMSLQEAVEAPR
VWTEGGVLELEEAIPESVAQALIARGHKVVRSPRVAGGMNAIAFNPDGTLTGAACWRADGTPVAISGGLARAGARFTI
>P15558 3.5.1.11~~~acyII~~~Penicillin acylase 2 proenzyme~~~
MTMAAKTDREALQAALPPLSGSLSIPGLSAPVRVQRDGWGIPHIKASGEADAYRALGFVHAQDRLFQMELTRRKALGRAA
EWLGAEAAEADILVRRLGMEKVCRRDFEALGAEAKDMLRAYVAGVNAFLASGAPLPIEYGLLGAEPEPWEPWHSIAVMRR
LGLLMGSVWFKLWRMLALPVVGAANALKLRYDDGGQDLLCIPPGVEAERLEADLAALRPAVDALLKAMGGDASDAAGGGS
NNWAVAPGRTATGRPILAGDPHRVFEIPGMYAQHHLACDRFDMIGLTVPGVPGFPHFAHNGKVAYCVTHAFMDIHDLYLE
QFAEDGRTARFGNEFEPVAWRRDRIAVRGGADREFDIVETRHGPVIAGDPLEGAALTLRSVQFAETDLSFDCLTRMPGAS
TVAQLYDATRGWGLIDHNLVAGDVAGSIGHLVRARVPSRPRENGWLPVPGWSGEHEWRGWIPHEAMPRVIDPPGGLIVTA
NNRVVADDHPDYLCTDCHPPYRAERIMERLVASPAFAVDDAAAIHADTLSPHVGLLRARLEALGIQGSLPAEELRQTLIA
WDGRMDAGSQAASAYNAFRRALTRLVTARSGLEQAIAHPFAAVPPGVSPQGQVWWAVPTLLRNDDAGMLKGWSWDEALSE
ALSVATQNLTGRGWGEEHRPRFTHPLSAQFPAWAALLNPVSRPIGGDGDTVLANGLVPSAGPEATYGALSRYVFDVGNWD
NSRWVVFHGASGHPASPHYADQNAPWSDCAMVPMLYSWDRIAAEAVTSQELVPA
>Q6FAW0 ~~~aceI~~~Short-chain diamines transporter~~~COG4125
MLISKRRMIHALSYEVILLVIIAIALSFIFDVPLEVTGTLGIVMAVTSVFWNMIFNHFFEKFERKHQLERTVKIRILHAI
GFEGGLMLVTIPMVAYAMNMSLWQAIVLDFGLTMCILVYTFIFQWCYDTIEKRLGYTPRHS
>P0DUT9 ~~~aceI~~~Short-chain diamines transporter~~~
MLISKRRLIHAISYEGILLVIIAIALSFIFNMPMEVTGTLGVFMAVVSVFWNMIFNHYFEKVEHKYNWERTIPVRILHAI
GFEGGLLIATVPMIAYMMQMTVIDAFILDIGLTLCILVYTFIFQWCYDHIEDKFFPNAKAASLH
>P06875 3.5.1.11~~~pac~~~Penicillin G acylase~~~
MKNRNRMIVNCVTASLMYYWSLPALAEQSSSEIKIVRDEYGMPHIYANDTWHLFYGYGYVVAQDRLFQMEMARRSTQGTV
AEVLGKDFVKFDKDIRRNYWPDAIRAQIAALSPEDMSILQGYADGMNAWIDKVNTNPETLLPKQFNTFGFTPKRWEPFDV
AMIFVGTMANRFSDSTSEIDNLALLTALKDKYGVSQGMAVFNQLKWLVNPSAPTTIAVQESNYPLKFNQQNSQTAALLPR
YDLPAPMLDRPAKGADGALLALTAGKNRETIAAQFAQGGANGLAGYPTTSNMWVIGKSKAQDAKAIMVNGPQFGWYAPAY
TYGIGLHGAGYDVTGNTPFAYPGLVFGHNGVISWGSTAGFGDDVDIFAERLSAEKPGYYLHNGKWVKMLSREETITVKNG
QAETFTVWRTVHGNILQTDQTTQTAYAKSRAWDGKEVASLLAWTHQMKAKNWQEWTQQAAKQALTINWYYADVNGNIGYV
HTGAYPDRQSGHDPRLPVPGTGKWDWKGLLPFEMNPKVYNPQSGYIANWNNSPQKDYPASDLFAFLWGGADRVTEIDRLL
EQKPRLTADQAWDVIRQTSRQDLNLRLFLPTLQAATSGLTQSDPRRQLVETLTRWDGINLLNDDGKTWQQPGSAILNVWL
TSMLKRTVVAAVPMPFDKWYSASGYETTQDGPTGSLNISVGAKILYEAVQGDKSPIPQAVDLFAGKPQQEVVLAALEDTW
ETLSKRYGNNVSNWKTPAMALTFRANNFFGVPQAAAEETRHQAEYQNRGTENDMIVFSPTTSDRPVLAWDVVAPGQSGFI
APDGTVDKHYEDQLKMYENFGRKSLWLTKQDVEAHKESQEVLHVQR
>P07941 3.5.1.11~~~pac~~~Penicillin G acylase~~~
MKNRNRMIVNGIVTSLICCSSLSALAASPPTEVKIVRDEYGMPHIYADDTYRLFYGYGYVVAQDRLFQMEMARRSTQGTV
SEVLGKAFVSFDKDIRQNYWPDSIRAQIASLSAEDKSILQGYADGMNAWIDKVNASPDKLLPQQFSTFGFKPKHWEPFDV
AMIFVGTMANRFSDSTSEIDNLALLTAVKDKYGNDEGMAVFNQLKWLVNPSAPTTIAARESSYPLKFDLQNTQTAALLVP
RYDQPAPMLDRPAKGTDGALLAVTAIKNRETIAAQFANGANGLAGYPTTSNMWVIGKNKAQDAKAIMVNGPQFGWYAPAY
TYGIGLHGAGYDVTGNTPFAYPGLVFGHNGTISWGSTAGFGDDVDIFAEKLSAEKPGYYQHNGEWVKMLSRKETIAVKDG
QPETFTVWRTLDGNVIKTDTRTQTAYAKARAWAGKEVASLLAWTHQMKAKNWPEWTQQAAKQALTINWYYADVNGNIGYV
HTGAYPDRQPGHDPRLPVPDGKWDWKGLLSFDLNPKVYNPQSGYIANWNNSPQKDYPASDLFAFLWGGADRVTEIDTILD
KQPRFTADQAWDVIRQTSLRDLLRLFLPALKDATANLAENDPRRQLVDKLASWDGENLVNDDGKTYQQPGSAILNAWLTS
MLKRTVVAAVPAPFGKWYSASGYETTQDGPTGSLNISVGAKILYEALQGDKSPIPQAVDLFGGKPEQEVILAALDDAWQT
LSKRYGNDVTGWKTPAMALTFRANNFFGVPQAAAKEARHQAEYQNRGTENDMIVFSPTSGNRPVLAWDVVAPGQSGFIAP
DGKADKHYDDQLKMYESFGRKSLWLTPQDVDEHKESQEVLQVQR
>P12256 3.5.1.11~~~~~~Penicillin acylase~~~
MLGCSSLSIRTTDDKSLFARTMDFTMEPDSKVIIVPRNYGIRLLEKENVVINNSYAFVGMGSTDITSPVLYDGVNEKGLM
GAMLYYATFATYADEPKKGTTGINPVYVISQVLGNCVTVDDVIEKLTSYTLLNEANIILGFAPPLHYTFTDASGESIVIE
PDKTGITIHRKTIGVMTNSPGYEWHQTNLRAYIGVTPNPPQDIMMGDLDLTPFGQGAGGLGLPGDFTPSARFLRVAYWKK
YTEKAKNETEGVTNLFHILSSVNIPKGVVLTNEGKTDYTIYTSAMCAQSKNYYFKLYDNSRISAVSLMAENLNSQDLITF
EWDRKQDIKQLNQVNVMS
>Q60136 3.5.1.11~~~pac~~~Penicillin G acylase~~~
MKTKWLISVIILFVFIFPQNLVFAGEDKNEGVKVVRDNFGVPHLYAKNKKDLYEAYGYVMAKDRLFQLEMFRRGNEGTVS
EIFGEDYLSKDEQSRRDGYSNKEIKKMIDGLDRQPKELIAKFAEGISRYVNEALKDPDDKLSKEFHEYQFLPQKWTSTDV
VRVYMVSMTYFMDNHQELKNAEILAKLEHEYGTEVSRKMFDDLVWKNDPSAPTSIVSEGKPKRDSSSQSLQILSSAVIKA
SEKVGKERENFVQTSEELGLPLKIGSNAAIVGSEKSATGNALLFSGPQVGFVAPGFLYEVGLHAPGFDMEGSGFIGYPFI
MFGANNHFALSATAGYGNVTDIFEEKLNAKNSSQYLYKGKWRDMEKRKESFTVKGDNGEKKTVEKIYYRTVHGPVISRDE
TNKVAYSKSWSFRGTEAQSMSAYMKANWAKNLKEFENAASEYTMSLNWYYADKKGDIAYYHVGRYPVRNSKIDERIPTPG
TGEYEWKGFIPFKENPHVINPKNGYVVNWNNKPSKEWVNGEYSFYWGEDNRVQQYINGMEARGKVTLEDINEINYTASFA
QLRANLFKQLLIDVLDKNKSTNGNYIYLIEKLEEWNNLKEDENKDGYYDAGIAAFFDEWWNNLHDKLFMDELGDFYGITK
EITDHRYGASLAYKILNKESTNYKWVNVDQEKIIMESTNEVLAKLQSEKGLKAEKWRMPIKTMTFGEKSLIGIPHGYGSM
TPIIEMNRGSENHYIEMTPTGPSGFNITPPGQIGFVKKDGTISDHYDDQLVMFAEWKFKPYLFNKKDINKAAKNVSALNM
SK
>P31956 3.5.1.11~~~pac~~~Penicillin G acylase~~~
MKMKWLISVIILFVFIFPQNLVFAGEDKNEGVKVVRDNFGVPHLYAKNKKDLYEAYGYVMAKDRLFQLEMFRRGNEGTVS
EIFGEDYLSKDEQSRRDGYSNKEIKKMIDGLDRQPRELIAKFAEGISRYVNEALKDPDDKLSKEFHEYQFLPQKWTSTDV
VRVYMVSMTYLWIITRELKNAEILAKLEHEYGTEVSRKMFDDLVWKNDPSAPTSIVSEGKPKRESSSQSLQKLSSAVIKA
SEKVGKERENFVQSSEELGLPLKIGSNAAIVGSEKSATGNALLFSGPQVGFVAPGFLYEVGLHAPGFDMEGSGFIGYPFI
MFGANNHFALSATAGYGNVTDIFEEKLNTKNSSQYLYKGKWRDMEKRKESFTVKGDNGEKKTVEKIYYRTVHGPVISRDE
TNKVAYSKYVSFRGTEAQSMSAYMKANWAKNLKEFENAASEYTMSLNWYYADKKGDIAYYHVGRYPVRNNKIDERIPTPG
TGEYEWKGFIPFKENPHVINPKNGYVVNWNNKPSKEWVNGEYSYYWGEDNRVQQYINGMEARGKVTLEDINEINYTASFA
QLRANLFKPLLIDVLDKNKSTNGNYTYLIEKLEEWNNLKEDENKDGYYDAGIAAFFDEWWNNLHDKLFMDELGDFYGITK
EITDHRYGASLAYKNISKESTNYKWVNVDQEKIIMESTNEVLAKLQSEKGLKAEKWRMPIKTMTFGEKSLIGIPHGYGSM
TPIIEMNRGSENHYIEMTPKGPSGFNITPPGQIGFVKKDGTISDHYDDQLVMFAEWKFKPYLFNKKDIYKAATNVSALNM
SK
>P11657 ~~~pac~~~Major cell-surface adhesin PAc~~~COG3064
MKVKKTYGFRKSKISKTLCGAVLGTVAAVSVAGQKVFADETTTTSDVDTKVVGTQTGNPATNLPEAQGSASKEAEQSQTK
LERQMVHTIEVPKTDLDQAAKDAKSAGVNVVQDADVNKGTVKTPEEAVQKETEIKEDYTKQAEDIKKTTDQYKSDVAAHE
AEVAKIKAKNQATKEQYEKDMAAHKAEVERINAANAASKTAYEAKLAQYQADLAAVQKTNAANQAAYQKALAAYQAELKR
VQEANAAAKAAYDTAVAANNAKNTEIAAANEEIRKRNATAKAEYETKLAQYQAELKRVQEANAANEADYQAKLTAYQTEL
ARVQKANADAKATYEAAVAANNAKNAALTAENTAIKQRNENAKATYEAALKQYEADLAAVKKANAANEADYQAKLTAYQT
ELARVQKANADAKAAYEAAVAANNAANAALTAENTAIKKRNADAKADYEAKLAKYQADLAKYQKDLADYPVKLKAYEDEQ
TSIKAALAELEKHKNEDGNLTEPSAQNLVYDLEPNANLSLTTDGKFLKASAVDDAFSKSTSKAKYDQKILQLDDLDITNL
EQSNDVASSMELYGNFGDKAGWSTTVSNNSQVKWGSVLLERGQSATATYTNLQNSYYNGKKISKIVYKYTVDPKSKFQGQ
KVWLGIFTDPTLGVFASAYTGQVEKNTSIFIKNEFTFYHEDEKPINFDNALLSVTSLNREHNSIEMAKDYSGKFVKISGS
SIGEKNGMIYATDTLNFKQGEGGSRWTMYKNSQAGSGWDSSDAPNSWYGAGAIKMSGPNNHVTVGATSATNVMPVSDMPV
VPGKDNTDGKKPNIWYSLNGKIRAVNVPKVTKEKPTPPVKPTAPTKPTYETEKPLKPAPVAPNYEKEPTPPTRTPDQAEP
NKPTPPTYETEKPLEPAPVEPSYEAEPTPPTRTPDQAEPNKPTPPTYETEKPLEPAPVEPSYEAEPTPPTPTPDQPEPNK
PVEPTYEVIPTPPTDPVYQDLPTPPSDPTVHFHYFKLAVQPQVNKEIRNNNDINIDRTLVAKQSVVKFQLKTADLPAGRD
ETTSFVLVDPLPSGYQFNPEATKAASPGFDVTYDNATNTVTFKATAATLATFNADLTKSVATIYPTVVGQVLNDGATYKN
NFTLTVNDAYGIKSNVVRVTTPGKPNDPDNPNNNYIKPTKVNKNENGVVIDGKTVLAGSTNYYELTWDLDQYKNDRSSAD
TIQKGFYYVDDYPEEALELRQDLVKITDANGNEVTGVSVDNYTNLEAAPQEIRDVLSKAGIRPKGAFQIFRADNPREFYD
TYVKTGIDLKIVSPMVVKKQMGQTGGSYENQAYQIDFGNGYASNIVINNVPKINPKKDVTLTLDPADTNNVDGQTIPLNT
VFNYRLIGGIIPANHSEELFEYNFYDDYDQTGDHYTGQYKVFAKVDITLKNGVIIKSGTELTQYTTAEVDTTKGAITIKF
KEAFLRSVSIDSAFQAESYIQMKRIAVGTFENTYINTVNGVTYSSNTVKTTTPEDPADPTDPQDPSSPRTSTVIIYKPQS
TAYQPSSVQETLPNTGVTNNAYMPLLGIIGLVTSFSLLGLKAKKD
>A8AZP4 ~~~padA~~~Platelet adherence protein A~~~COG2304
MKDFLKKVLILFTVLLMSMPSSVLNLGTSVVRADDPLNIETRRIDEHTTITQNGCYRKIEKTDATDWTVPRKPIDLVILQ
DASGSFRTTIPSVKNALKRLTTYVSPEQYDENDPHLVKTDDPRTTDRVFVASYQGLDQVRYFENNDFSGNPANVYTDANS
TGKNYTYGNSGLTSDQNKVHNFIDNIAVDGGTPTVPAIDDTIAQYNRVKGNMENGRKTVFLLVTDGVANGYRLPGTNTVV
MDKSWTRTDAIQKAWRVDSYPEAAQDIIGRANELKAAGNQLKAAVGSEGSVVVGFWERVDNFTEKYYQYGPAYLNGFGNT
INIGDNRSVQAIFHDALQSMASPDKVVNGKNVSFYVNEQNNIDVFSQKILESVAAALVKDDITGEFDITEGYKVDAIRIN
GKKIVPKVTDPSKEIRGTITQTGNKVKISVPDSVFNPGKNSFDYDLSKEARAPETDEDSEVDPPENYVPEKEEITVPELT
GKFKAGDFETRQIGGRNQTVEVQKLEYCYPSATKTVKDADASNDIGVIPDPLELTKKPSYSAQLSKKDEEFTYTVDYNFN
NVPYEFEKNVMLTDPIDYRLEVVSHSAQGPDGQSWPTRVVTQQDAGGNSQSVVVADVPPQGKDYNYLIMKKAKLKMTVRL
KEEYRKNQASKAYLAILQNNNGYGLVNQGNIMWNGEDDSPNQDAHAKTKDKASTIRRSNPIYVKPPLDTEVDKKVNEKEH
EGLQADGEEFEYKVTAPWPGIADKFTLTDTVVDELEIVPNSAKVTVAGKSYNALTKAISINGQTIDITLDKAQLTSLNRL
ISRRGGSEVQEIELIFKAKIRPGADLSKYKKNGAVNIPNTADVILNDKKKTSKEVTVTPPKPKEPTVSKKINNTLDSLVT
FDGQPYTYNITTAVPSDVAGYKKFVISDKLDADLEFDGQASISGPLADVFEIQTNGQTVTATVKEGKFKELAKYSFVELT
IPAKVKAGVTGKTIENKAKISFTNENNVAKEVESNPVTVTPPPVTKKINENLDHLDIATGQPYKYNVKTTLPSDITSYKE
FVITDTLEDELSVINEGTDKPVISGPAAEFFDVTVSGQKVTATMKNFAGASALAGQEIELVIPAKINDGVTRSNIPNKAT
FSFKDKNDHKGEKETIPVTVTPPTEPNVSKKINGDQDNATIAAETDFTYNIKTTLPNDIDTYKSFAITDTLDENLGVVNP
EPSISEEAKKFFDITVSGNTVTATMKDFAKASALANKEIELVIHAKVKKESVLPEIPNTAKITYTNKNNESKEKETEPVK
VTPPPITKKVNGKDQEDLASLTSTFKYTVDSKVPIVADKFVLSDTLEEVLTFDGDATVTIDGQTVTDVTVAKKDQKLTVT
FDKDQVKKYAGKAVQVAFDAKIKSGYTVDQLVAKYPNGDKAAIPNKASFVVNDNPETEKFSNPVTVTPPPPNTPEIEKKV
NGADSYNLQTRLEEFTYSLNTAMPTNATEFTVTDELKSVLEFAGKKGDVQVKIDGKAANDQATISTDKNTLTVAFAEKAV
KANAGKSIEVTFKAKIREGANLLDYLVPGQGIRIPNKASYDIDHNPKFHKDSNEVPVTPPSPEQPPIEKDVNDKAEATLE
ARDEEFTYHVKTKIPYEATAFNITDTLKEVLDFSGEKGQAEATVDGKKLSDDHIAINGQTITVTLNQEELKANADKEIKL
TFKAKIRPNANLAAYVVGDKVVINNQASYNVDLPDNPGVHKDSNIVPVTPPSPEKPEIEKTVNDAKEATLANRDEIFTYK
VKTKVPFDATAFSIDDTIKDVLEFADAGSATLNGEALEADRISIADQKITLTLTEDQVKNNGGKEVVLTFKAKIRQGANL
SGYIEKGKTVINNQASYNAAFPNDPNFHKDSNIVPVTPPNPENPPIEKKVNEAESANLGARDEEFTYTIDTTVPLDVTGF
AVYDTIEKVLEFSGENGQASATVDGQPLDASHITIKGQKITVKLTEDEAKALGGKAVHVSFKAKIKAGANLSDYIEKDGT
TRIYNTAKYNFNNDPGTEQSSKPVPVIPPTPTEPELKKEVNGKEAETLANRDDVFTYTVKTTVPQDATAFSISDSLVPVL
EFAGEDAEASLTLNGEKLDAKQIKLKDQTISAELTEAQVKANGGKEVVLNFKAKIREGANLADYIEADGVTRVPNKASYV
ANFPHRPKVEKDSNIVPVTPPSPENPPVEKKVNNKPSATLDSRDEEFTYTIDTKVPVDATGFKITDELKDVLEFSGKKGQ
AEVTVDGDKDVIEDSQITVDKQVLTVTLTKDQVKKYGNKAVHVSFKAKIRKNVSLAGYIEADGVTRIPNIAKYIINDDPK
TEKSTEPVPVIPPSPEEPGIKKEVNGQPEATLKERYEEFTYKVTTSVPQDATAFSVSDTLVPVLEFSGEKGQATATLDGQ
EIDANRINVADQTISMALTEDEVKANGGKEVTLTFKAKIREGANLSAYIEKGKTSIPNTASYTAGFPNRPEIHKDSNRVP
VTPPTPEEPEIKKDVNGKEEETLANRNDEFTYHINTKVPFDATAFSINDELKDVLEFADGTGRATASLNGQALDADRISI
NGQTITVNLTEEQVKNNGGKDVNLTFTAKIRQGVNLSGYIKDGKTSIPNKASYRVDFPNNPGVTKDSNEVPVTPPSPENP
PIEKKVNEAESANLGARDEEFTYTIDTTVPLDVTGFAVYDTIEKVLEFSGENGQASATVDGQPLDASHITIKGQKITVKL
TEDEAKALGGKAVHVSFKAKIKAGANLSDYIEKDGTTRIYNTAKYNFNNDPGTEQSSKPVPVIPPTPTEPELKKEVNGKE
AETLANRDDVFTYTVKTTVPQDATAFSISDKLEDVLEFAGESSATLAGEDLKADQITTDGQIIKLTLTEDQVKANGGKEV
VLNFKAKIREGANLSAYMKADKAEVPNKASYTVGFPNKPAVTKDSNEVPVTPPSPEQPPIEKDVNSKPSETIADRTEEFT
YNIHTTMPQDATGFTVTDELKDVLEFAGDVQVTLGGKKADAAVAKNGQTLEVTFPEETVKANGGKKVQVTFKAKIKADAD
LTPYETANSYSVPNTASYLINNNPTSKKETKPVTVEVPKQPGPEVTKKINRTLDHLDVDRDVPYMYNVNTQIPKDIRLYK
EFTVTDTLEPVLEITGTPVAYVDGYATDAVETKVEGNTVTVTVKDFARISGYKEIQLYIPAKLKADSDLSAYENQTVPNK
ATIAFKDSNGKNGTKESNPVTVRPRDPEKPEEPKPNEPAKTVGPADGSNPSTAYRLKELKEGFRFDVTAKVPTDPVDESG
NPIKDAQGRDVKTELNSFTVTDELEKVLKVDRVAVKVEENKVAEAIAKITAKIEKAESDLKELEGKETNGTFAKKLAEAE
KKVEELTAQLAAAKEKAAAAPATPAPASDSDAGNATATPAPADNNAEVAALEESLKAAQAELEQLKADGAKAGNLATPEE
QKVEQDKLNKNLEQLKESKEKLEKALEAFTTVNDKGEITDEALAKIAKVTVEGQKVTVEVTDKAVLEALKGSTFRVIIYS
SIKDGADLSSYLNKENNETKIPNKATVTFNDKPKVTNTVNVYPPEPTTPPQTPPHTPPTTPGTPPPTTPDTPPAPKGDLP
PAPTPEPEKPKNILPKTGTSATMVNEVIIGMILVLMGLLLRRKPKH
>O07006 4.1.1.102~~~padC~~~Phenolic acid decarboxylase PadC~~~COG3479
MENFIGSHMIYTYENGWEYEIYIKNDHTIDYRIHSGMVAGRWVRDQEVNIVKLTEGVYKVSWTEPTGTDVSLNFMPNEKR
MHGIIFFPKWVHEHPEITVCYQNDHIDLMKESREKYETYPKYVVPEFAEITFLKNEGVDNEEVISKAPYEGMTDDIRAGR
L
>Q8L3B3 1.2.1.58~~~padE~~~NADH-dependent phenylglyoxylate dehydrogenase subunit gamma~~~
MYEVRFHGRGGQGSVMASGMLAAAMVEEGKYAVSIPSFGFERRGAPVVSFLRMSDREIRQLTNIYQPDCIVCVDPTLTKS
VDIFAGMKAGGTLVQATHHPLSELALPDCVSTVGLLDAVKIALEIFKRPITNTLMLGAFAKTTGVVSLESLKRALEDSEF
RDAGLAQNMTALERGYAEVAVHHIERRAAA
>Q8L3B2 1.2.1.58~~~padF~~~NADH-dependent phenylglyoxylate dehydrogenase subunit delta~~~
MSRHQSYPLFNLEQAGVPDDLCPVATVVSPMLPGDWRSMRPVVDRDKCVKCAVCWLYCPVQCVEEHAAWFDFNLKTCKGC
GICANECPQRRSR
>Q8KQL6 2.4.1.208~~~dgs~~~Processive diacylglycerol alpha-glucosyltransferase~~~
MKVLLYSQKQSMLKKSGIGRAFYHQKRALEAVGIEYTTDPKDTYDLVHVNIAHSNKIKKFRKKYPVIVHGHSTVQDFRRS
FAFWRVIAPFFYKHLQNIYGIADLIITPTRYSKFLIESMHVVKSPVVALSNGIDLDAYEYKQENVDAFRKHFDLEPNQKV
VIGVGLLFERKGIHDFIEVARTMPNVTFIWFGNLSKLATTHFIRKRIKNKPKNMIMPGYVDGAVIKGAFSGADCVFFPSY
EETEGIVVLEGLASKTPVVLRDIPVYYDWLFHKEHVLKGHNNFEFSKLIEKVLHEDQTEMIENGYKIVQDRSIEKIGEGL
KQAYQEVIKIKR
>Q8L3B1 1.2.1.58~~~padG~~~NADH-dependent phenylglyoxylate dehydrogenase subunit alpha~~~
MSTATLEKAVAAKAPRKQKVILAEGNEAAALGVALARPDMVSVYPITPQSSLVEHVAKLIADGRMDADIVDAEGEHSVLS
VLQGGALAGARTYTATCGPGLAFMFEPYFRTSGMRLPIVLTIVTRDGITPQSVWGGHQDAMTVREAGWIQVYCESVQEVL
DTTVMAFKIAEHHDVMLPVNVCLDGNYLSYGASRVELPDQAVVDEFMGEKNVNWHVALDPLRPMAVDPLTGGTTGKGPQT
FVRYRKGQCRGMQNALSVIEEVHADWAKRIGRSFAPLVEEYRLDDAEFAIMTLGSMTGAAKDAVDEAREAGKKIGLIKIK
TFSPFPVEALKKALGKVKALGVIDRSVGFRWNCGPMYQETLGVLYRLGRHIPSISYIGGLAGADITIPHVHRVIDETEAL
LNGAVAPTEPVWLNEKD
>Q8L3B0 1.2.1.58~~~padH~~~NADH-dependent phenylglyoxylate dehydrogenase subunit epsilon~~~
MDRAIEHTKYLIAGSSHAALEAINAIRMHDAEGPITVVTRDAHLPYSPTVLPYVVSGKSAPERIFLRDDDFFARNKVAYR
PKAALKALHADRNTAELADGSSVVYEKLLLATGASPAIPPIPGIDTVSYHVLRTLDDALKLRGAIAESKQAVVLGAGLVG
MHAAENLVKAGATVTIVEMSEQLTSGYFDKVAADMIEQAFRDAGGKIMTGSRVVRLEPTAAGAKLTLENGTTLEADLLLV
ATGVKPEMDYLNGSGVEHAQGILVDDRMQTTAENVWAAATAQARGFFTGTKVMNAILPDATIQGRVAGMAMAGDPGVKDY
AGAVPLNTYHFFGRHAISVGSSTVPEGGEVVTRFDEKTGRYLKAIFAADGPLTGIFGVNEFFDGGVMAQLILRRTDLTPL
RSRFVANPLAVGREIMSQTWR
>Q8L3A9 1.2.1.58~~~padI~~~NADH-dependent phenylglyoxylate dehydrogenase subunit beta~~~
MGRAYSTIAFDPAKCDGCGDCMTACAQAKTGTDDIARSRIQIYGREGAADKTFELALCRQCADPKCVTVCPAGALNKDGT
SGVIGWDATKCVDCLLCTVGCAYAGIALDEATGHVAKCDTCDGNPACVPACPHGALKHITTANIYNEVGDWEDLFAPGLA
GCQGCNTELLMRHTLRRVGPDTVLATPPGCVPGMGSVGFNGTTGTKVPVFHPLLTNTAAMLAGIKRQYKRVGRDVQALAI
AGDGGASDVGFQSLSGRAERGEQMLFMVVDNEGYMNTGMQRSSCTPYGAWTSTTPVGETSRGKTQDAKNLPLIMVNHRCA
YVATASTAYMEDLYDKLDKAIAASKNGFAYLHVYSPCTTAWRFPSNLNMEVARKAVETNFVMLWEYTPQDGLHFTKPVDD
PLPVTDYLKAMGRFRHLTPEQVEHIQKKVVENQKFVERMTEHAHVG
>P94404 2.5.1.129~~~bsdB~~~Probable UbiX-like flavin prenyltransferase~~~COG0163
MKAEFKRKGGGKVKLVVGMTGATGAIFGVRLLQWLKAAGVETHLVVSPWANVTIKHETGYTLQEVEQLATYTYSHKDQAA
AISSGSFDTDGMIVAPCSMKSLASIRTGMADNLLTRAADVMLKERKKLVLLTRETPLNQIHLENMLALTKMGTIILPPMP
AFYNRPRSLEEMVDHIVFRTLDQFGIRLPEAKRWNGIEKQKGGA
>P69772 2.5.1.129~~~ecdB~~~Probable UbiX-like flavin prenyltransferase~~~COG0163
MKLIVGMTGATGAPLGVALLQALREMPNVETHLVMSKWAKTTIELETPYSARDVAALADFSHNPADQAATISSGSFRTDG
MIVIPCSMKTLAGIRAGYADGLVGRAADVVLKEGRKLVLVPREMPLSTIHLENMLALSRMGVAMVPPMPAFYNHPETVDD
IVHHVVARVLDQFGLEHPYARRWQGLPQARNFSQENE
>Q4R101 2.5.1.129~~~shdB~~~Probable UbiX-like flavin prenyltransferase~~~
MRLVIGISGASGVVLGYHMLKVLRFFPECETHLVISEGAKLTFGLETDLKIEDVEKLADFVYSNTNLAASISSGSFKTDG
MIVIPCSMKTLSGIATGYAENLLIRAADVCLKENRKVVLVPREMPFGKLHIRNMKEASDLGCVIIPPLLTFYNNPQTIEE
QINHIIGKILMQFGLEHEKFKAWEGTKDD
>Q9X696 2.5.1.129~~~vdcB~~~Probable UbiX-like flavin prenyltransferase~~~
MRLVVGMTGATGAPFGVRLLENLRQLPGVETHLVLSRWARTTIEMETGLSVAEVSALADVTHHPEDQGATISSGSFRTDG
MVIVPCSMKTLAGIRTGYAEGLVARAADVVLKERRRLVLVPRETPLSEIHLQNMLELARMGVQLVPPMPAFYNNPQTVDD
IVDHVVARILDQFDLPAPAARRWAGMRAARAAARSFGDAA
>P94443 ~~~padR~~~Negative transcription regulator PadR~~~COG1695
MRVLKYAILGLLRKGELSGYDITSYFKEELGQFWSAKHSQIYPELKKLTDEGFITFRTTIQGTKLEKKMYTLTDSGKQEL
HDWLIRHQPIPETVKDEFMLKAYFISCLSRQEASDLFKDQLQKRQAKLSDLQGSYEKLMASAEPMSFSSPDFGHYLVLTK
ALEREKNYVSWLESILAMIDKD
>Q9RQJ2 3.5.3.-~~~~~~Peptidylarginine deiminase~~~COG2957
MKKLLQAKALILALGLFQLPAIAQTQMQADRTNGQFATEEMQRAFQETNPPAGPVRAIAEYERSAAVLVRYPFGIPMELI
KELAKNDKVITIVASESQKNTVITQYTQSGVNLSNCDFIIAKTDSYWTRDYTGWFAMYDTNKVGLVDFIYNRPRPNDDEF
PKYEAQYLGIEMFGMKLKQTGGNYMTDGYGSAVQSHIAYTENSSLSQAQVNQKMKDYLGITHHDVVQDPNGEYINHVDCW
GKYLAPNKILIRKVPDNHPQHQALEDMAAYFAAQTCAWGTKYEVYRALATNEQPYTNSLILNNRVFVPVNGPASVDNDAL
NVYKTAMPGYEIIGVKGASGTPWLGTDALHCRTHEVADKGYLYIKHYPILGEQAGPDYKIEADVVSCANATISPVQCYYR
INGSGSFKAADMTMESTGHYTYSFTGLNKNDKVEYYISAADNSGRKETYPFIGEPDPFKFTCMNETNTCTVTGAAKALRA
WFNAGRSELAVSVSLNIAGTYRIKLYNTAGEEVAAMTKELVAGTSVFSMDVYSQAPGTYVLVVEGNGIRETMKILK
>P0AE45 ~~~paeA~~~Polyamine export protein~~~COG1253
MLNSILVILCLIAVSAFFSMSEISLAASRKIKLKLLADEGNINAQRVLNMQENPGMFFTVVQIGLNAVAILGGIVGDAAF
SPAFHSLFSRYMSAELSEQLSFILSFSLVTGMFILFADLTPKRIGMIAPEAVALRIINPMRFCLYVCTPLVWFFNGLANI
IFRIFKLPMVRKDDITSDDIYAVVEAGALAGVLRKQEHELIENVFELESRTVPSSMTPRENVIWFDLHEDEQSLKNKVAE
HPHSKFLVCNEDIDHIIGYVDSKDLLNRVLANQSLALNSGVQIRNTLIVPDTLTLSEALESFKTAGEDFAVIMNEYALVV
GIITLNDVMTTLMGDLVGQGLEEQIVARDENSWLIDGGTPIDDVMRVLDIDEFPQSGNYETIGGFMMFMLRKIPKRTDSV
KFAGYKFEVVDIDNYRIDQLLVTRIDSKATALSPKLPDAKDKEESVA
>Q8NQE1 6.3.1.19~~~pafA~~~Pup--protein ligase~~~COG0638
MSTVESALTRRIMGIETEYGLTFVDGDSKKLRPDEIARRMFRPIVEKYSSSNIFIPNGSRLYLDVGSHPEYATAECDNLT
QLINFEKAGDVIADRMAVDAEESLAKEDIAGQVYLFKNNVDSVGNSYGCHENYLVGRSMPLKALGKRLMPFLITRQLICG
AGRIHHPNPLDKGESFPLGYCISQRSDHVWEGVSSATTRSRPIINTRDEPHADSHSYRRLHVIVGDANMAEPSIALKVGS
TLLVLEMIEADFGLPSLELANDIASIREISRDATGSTLLSLKDGTTMTALQIQQVVFEHASKWLEQRPEPEFSGTSNTEM
ARVLDLWGRMLKAIESGDFSEVDTEIDWVIKKKLIDRFIQRGNLGLDDPKLAQVDLTYHDIRPGRGLFSVLQSRGMIKRW
TTDEAILAAVDTAPDTTRAHLRGRILKAADTLGVPVTVDWMRHKVNRPEPQSVELGDPFSAVNSEVDQLIEYMTVHAESY
RS
>A0QZ42 6.3.1.19~~~pafA~~~Pup--protein ligase~~~COG0638
MQRRIMGIETEFGVTCTFHGHRRLSPDEVARYLFRRVVSWGRSSNVFLRNGARLYLDVGSHPEYATAECDNLIQLVTHDR
AGERVLEDLLIDAEQRLADEGIGGDIYLFKNNTDSAGNSYGCHENYLIVRAGEFSRISDVLLPFLVTRQLICGAGKVLQT
PKAATFCLSQRAEHIWEGVSSATTRSRPIINTRDEPHADAEKYRRLHVIVGDSNMCEATTMLKVGTASLVLEMIEAGVPF
RDFSLDNPIRAIREVSHDLTGRRPVRLAGGRQASALDIQREYYSRAVEYLQSREPNTQIEQVVDLWGRQLDAVESQDFAK
VDTEIDWVIKRKLFQRYQDRYNMELSDPKISQLDLAYHDIKRGRGVFDLLQRKGLAARITTDEEIDAAVTTPPQTTRAKL
RGEFISAAQEAGRDFTVDWVHLKLNDQAQRTVLCKDPFRSVDERVKRLIASM
>P9WNU7 6.3.1.19~~~pafA~~~Pup--protein ligase~~~COG0638
MQRRIMGIETEFGVTCTFHGHRRLSPDEVARYLFRRVVSWGRSSNVFLRNGARLYLDVGSHPEYATAECDSLVQLVTHDR
AGEWVLEDLLVDAEQRLADEGIGGDIYLFKNNTDSAGNSYGCHENYLIVRAGEFSRISDVLLPFLVTRQLICGAGKVLQT
PKAATYCLSQRAEHIWEGVSSATTRSRPIINTRDEPHADAEKYRRLHVIVGDSNMSETTTMLKVGTAALVLEMIESGVAF
RDFSLDNPIRAIREVSHDVTGRRPVRLAGGRQASALDIQREYYTRAVEHLQTREPNAQIEQVVDLWGRQLDAVESQDFAK
VDTEIDWVIKRKLFQRYQDRYDMELSHPKIAQLDLAYHDIKRGRGIFDLLQRKGLAARVTTDEEIAEAVDQPPQTTRARL
RGEFISAAQEAGRDFTVDWVHLKLNDQAQRTVLCKDPFRAVDERVKRLIASM
>P9WIM1 ~~~pafB~~~Protein PafB~~~COG2378
MATSKVERLVNLVIALLSTRGYITAEKIRSSVAGYSDSPSVEAFSRMFERDKNELRDLGIPLEVGRVSALEPTEGYRINR
DAYALSPVELTPDEAAAVAVATQLWESPELITATQGALLKLRAAGVDVDPLDTGAPVAIASAAAVSGLRGSEDVLGILLS
AIDSGQVVQFSHRSSRAEPYTVRTVEPWGVVTEKGRWYLVGHDRDRDATRVFRLSRIGAQVTPIGPAGATTVPAGVDLRS
IVAQKVTEVPTGEQATVWVAEGRATALRRAGRSAGPRQLGGRDGEVIELEIRSSDRLAREITGYGADAIVLQPGSLRDDV
LARLRAQAGALA
>P9WIL9 ~~~pafC~~~Protein PafC~~~COG2378
MSALSTRLVRLLNMVPYFQANPRITRAEAAAELGVTAKQLEEDLNQLWMCGLPGYSPGDLIDFEFCGDTIEVTFSAGIDR
PLKLTSPEATGLLVALRALADIPGVVDPQAARSAIAKIAAAAGAVAAVAEQAPTESPAAAAVRAAVRNSRALTIDYYAAS
HDTLTTRIVDPIRVLLIGGHSYLEAWSREAEGVRLFRFDRIVDAAELGEPAVPPESARQAPPDTSLFDGDLSLPSATLRV
APSASWMLEYYPIRELRQLPDGSCEVAMTYASEDWMTRLLLGFGSDVRVLAPESLAQRVRDAATAALDAYQAAAPP
>Q9I0P5 ~~~~~~Putrescine/agmatine-binding protein~~~
MKKVCALALSILTTIGATAADSAWAAQTSVHLYNWYDFIAPETPKAFQKETGTRVVLDTFDSAETAQGKLMVGRSGYDVV
VITSNILPGLIKAGVLQELDRDRLPHWKNLDADILGKLQANDPGNRYAVPYLWGTTGIAYDVDKVRKLLGPDAPVDSWDL
VFKEENISRLSQCGVATLDSSTELVSIALNYLGLPHNSQNPEDYQKAQELLLKVRPYIRYFDSSRVDTDLSNGNVCVVVG
WQGTAYMAQVNNEQAGNGRHIAYSIPREGSLVWAENMVLLKDAPHPQQGYALIDYLLRPEVIARTSNYVGYPNGNQAALP
LVERKLRENPAVYLSKETMATLFPLETLPLKVERIRTRVWSRVKTGS
>Q03C44 3.2.1.122~~~simA~~~6-phospho-alpha-glucosidase 1~~~
MDDRKFSVLIAGGGSTYTPGIVLTLLDHIQKFPLRKLKFYDIDGERQQRVADACEILVKERAPEVEFLATTDPEEAFTDV
DFVMAQIRVGKYAMRSLDEKIPLKHGVVGQETTGPGGIAYGLRSIPGVIGLVDYMEKYSPNAWMLNYSNPAAIVAEATRR
LRPHSRIINICDMPIGIMDRMAQIVGLKDRNDLVFRYYGLNHFGWWTDVRDKTGKDLMPALKQYVAKNGYWLGDKDKDTE
ASWVSTFKKAADVYALDPSTLPNTYLKYYLYPKYVVEHSDPNYTRTDEVEAYREKHVFDECDRIIAAGTAADTHFKSDDH
ATYIVDLCTAIAYDTKQRMLAIVPNDGAIENIDPEAMVEVPCLFGANGAERLAMGKAATFQKGLITEQNCVEKLTVDAFE
QQSYTKLWEAMSLCKIVPDASVAKEILDEMVVANKDYWPELK
>Q7WD07 3.1.1.77~~~pagL~~~Lipid A deacylase PagL~~~COG3637
MQFLKKNKPLFGIVTLALACATAQAQPTQGGVSLHYGIGDHYQRVTLNYETPTLWSHQFGGNWGRLDLTPELGASYWWAD
GSRSPGHVWQASAIPMFRWWTGERFYIEAGIGATVFSSTSFADKRIGSAFQFGDHIGLGFLLTPSNRIGLRYSHFSNAGI
KEPNPGLDIVQLTYTYQF
>Q97DP6 3.2.1.-~~~pagL~~~Phospho-alpha-glucosidase PagL~~~COG1486
MKKYSICIVGGGSRYTPDMLAMLCNQKERFPLRKIVLYDNESERQETVGNYAKILFKEYYPELEEVIWTTDEKEAFEDID
FALMQIRAGRLKMREKDEKISLKHGCLGQETCGAGGFAYGLRSVPAVIDLIKSIRTYSPKCWILNYSNPAAIVAEATKRV
FPNDYRIINICDMPIAIMDIYAAVLGLKRRDLEPKYFGLNHFGWFTHILDKKTGENYLPKLREILKTPVDVQTEPLFQEK
SWKSTFEFMSQMINDYDEYLPNTYLQYYLYPAKMRNKENPEYTRANEVMDGNEKETYERMHKIISLGKIHGTKYELTSDV
GCHAEYIVDLATAIANNTNEIFLIITENKGTINNVSKDMMVEVPCRVGSNGVEPLVVGSIPAFYKGLMENQYAYEKLSVD
ACLEGSYQKALQALVLNRTVVNTDVAKELLKDLIEANKGYWNELH
>C7NB67 3.2.1.-~~~pagL~~~6-phospho-alpha-glucosidase~~~COG1486
MKKFSIVVAGGGSTFTPGIVLMLLENLDKFPIRQIKFYDNDAQRQEVIAKACDIIIKEKAPDINFVYTTDPETAFTDIDF
VMAHIRVGKYAMREKDEKIPLRHGVLGQETCGPGGISYGMRSIGGVIELVDYMEKYSPNAWMLNYSNPAAIVAEATRRLR
PNSKILNICDMPIGIEIRMAEMLGLKSRKDMVIRYFGLNHFGWWTDIRDKKGNDLMPALREKVAKIGYNVEIEGENTEAS
WNDTFTKARDVFAIDPTTMPNTYLKYYFFPDYVVEHSNPNHTRANEVMEGREKFVFGECRAIAEKGTAKDSKLHVDDHAS
YIVDLARAIAYDTKERMLLIVENDGAISNFDPTAMVEVPCIVGSNGPEKIVQGKIPQFQKGLMEQQVSVEKLTVEAWMEG
SYQKLWQAITLSRTVPSASVAKAILDDLIEANKDFWPVLK
>Q9HVD1 3.1.1.77~~~pagL~~~Lipid A deacylase PagL~~~
MKKLLPLAVLAALSSVHVASAQAADVSAAVGATGQSGMTYRLGLSWDWDKSWWQTSTGRLTGYWDAGYTYWEGGDEGAGK
HSLSFAPVFVYEFAGDSIKPFIEAGIGVAAFSGTRVGDQNLGSSLNFEDRIGAGLKFANGQSVGVRAIHYSNAGLKQPND
GIESYSLFYKIPI
>Q8ZRJ9 ~~~pagN~~~Outer membrane protein PagN~~~
MKNFFAVCIIPLVVAWSATASAKEGIYITGKAGTSVVNVYGINSTFSQDEIVNGHATLPDRTKGVFGGGVAIGYDFYDPF
QLPVRLELDTTFRGETDAKGGQDIIAFGDPVHINVKNQVRMTTYMVNGYYDFHNSTAFTPYISAGVGLAHVKLSNNTIPV
GFGINETLSASKNNFAWGAGIGAKYAVTDNIMIDASYKYINAGKVSISKNHYAGDEHTAYDADTKAASNDFMLGITYAF
>Q7WFT9 2.3.1.251~~~pagP~~~Lipid A acyltransferase PagP~~~
MTQYFRALAFFLLLVPATAMACDGWPSWARGACQRVDQIWNEGGNDLYLTGYSWHNRAMYSSDKIRSFNELAWGGGLGKS
IYDEDGDWQGLYAMAFLDSHSDIEPIAGYGFQKIGRIGADTRLGIGYTVFLTSRSDIMSRVPFPGILPLVSAGYRDATLY
ATYIPGGKGNGNVLFMFGRWEF
>Q7W4D1 2.3.1.251~~~pagP~~~Lipid A acyltransferase PagP~~~
MTQYFRALAFFLLLVPATAMACDDWPSWARGACQRVDQIWNEGGNDLYLTGYSWHNRAMYSSDKIRSFNELAWGGGLGKS
IYDEDGDWQGLYAMAFLDSHSDIEPIAGYGFQKIGRIGADTRLGIGYTVFLTSRSDIMSRVPFPGILPLVSAGYRDATLY
ATYIPGGKGNGNVLFMFGRWEF
>Q8XBR9 2.3.1.251~~~pagP~~~Lipid A palmitoyltransferase PagP~~~
MNVSKYVAIFSFVFIQLISVGKVFANADEWMTTFRENIVQTWQQPEHYDLYIPAITWHARFAYDKEKTDRYNERPWGGGF
GLSRWDEKGNWHGLYAMAFKDSWNKWEPIAGYGWESTWRPLADENFHLGLGFTAGVTARDNWNYIPLPVLLPLASVGYGP
VTFQMTYIPGTYNNGNVYFAWMRFQF
>P37001 2.3.1.251~~~pagP~~~Lipid A palmitoyltransferase PagP~~~
MNVSKYVAIFSFVFIQLISVGKVFANADEWMTTFRENIAQTWQQPEHYDLYIPAITWHARFAYDKEKTDRYNERPWGGGF
GLSRWDEKGNWHGLYAMAFKDSWNKWEPIAGYGWESTWRPLADENFHLGLGFTAGVTARDNWNYIPLPVLLPLASVGYGP
VTFQMTYIPGTYNNGNVYFAWMRFQF
>Q93K12 2.3.1.251~~~pagP~~~Lipid A acyltransferase PagP~~~
MKRLISCLTIICALNRSAAAETTSSPCSRWISLLKPVCQRIHQTWTEGHDDMYFSGYAWHNRYTYRPEKIKSYNEAAWGG
GLGKSLFDEKGNWHGLYAIAFLDSHRHIEPAVGYAYLKTASVNKDIKAGLGYSVLVTSRVDYDNVPFPGALPWVALFYKR
TTVAATYIPGSAGAGNVLYILGKISL
>Q7N3D3 2.3.1.251~~~pagP~~~Lipid A acyltransferase PagP~~~
MKFDLTAACTLSATLLVSSGTVFATTANTANKSLTTIESHTPISYGNNSSSLWEKFNNNVALTWDAPNNELYLPVITWHN
RHTYDKEKTDRYNERPWGFGYGKYRYDEDNDWHSLYAMAFMDSHNRLEPIVGYGFQKMWIPGDLEGFRMGIGFTLSVTAR
HDYYYVPIPLPLPLFSIEYDRLSFQGTYIPGTYNNGNVLFAWLRWQW
>Q8ZR06 2.3.1.251~~~pagP~~~Lipid A acyltransferase PagP~~~
MYVAMIIRKYFLIIALLLMPWLAIPSVSAADKGGFNTFTDNVAETWRQPEHYDLYVPAITWHARFAYDKEKTDRYNERPW
GVGFGQSRWDDKGNWHGLYMMAFKDSFNKWEPIGGYGWEKTWRPLEDDNFRLGLGFTAGVTARDNWNYIPIPVLLPLASI
GYGPATFQMTYIPGSYNNGNVYFAWMRFQF
>A1JM47 2.3.1.251~~~pagP~~~Lipid A acyltransferase PagP~~~
MSYKHLISACIFSSLCLGQVNAVLAEDKLPPSNTSTGQHSELSVDNDNLWQRLLRNISLAWDSPNQELYIPLNTWHNRWT
YDDDKIESYNERPWGIGYGKYRYDENNNWHAVYAMAFMDSHNEVEPIIGYGYQKMWIPAEMDGWRFGVGFTASITARHEY
HYIPIPLPLPLISIEYNKFSLQTTYIPGTYNNGNVLFTWMRWQF
>O31178 ~~~pagR~~~Transcriptional repressor PagR~~~
MTVFVDHKIEYMSLEDDAELLKTMAHPMRLKIVNELYKHKALNVTQIIQILKLPQSTVSQHLCKMRGKVLKRNRQGLEIY
YSINNPKVEGIIKLLNPIQ
>P13423 ~~~pagA~~~Protective antigen~~~
MKKRKVLIPLMALSTILVSSTGNLEVIQAEVKQENRLLNESESSSQGLLGYYFSDLNFQAPMVVTSSTTGDLSIPSSELE
NIPSENQYFQSAIWSGFIKVKKSDEYTFATSADNHVTMWVDDQEVINKASNSNKIRLEKGRLYQIKIQYQRENPTEKGLD
FKLYWTDSQNKKEVISSDNLQLPELKQKSSNSRKKRSTSAGPTVPDRDNDGIPDSLEVEGYTVDVKNKRTFLSPWISNIH
EKKGLTKYKSSPEKWSTASDPYSDFEKVTGRIDKNVSPEARHPLVAAYPIVHVDMENIILSKNEDQSTQNTDSQTRTISK
NTSTSRTHTSEVHGNAEVHASFFDIGGSVSAGFSNSNSSTVAIDHSLSLAGERTWAETMGLNTADTARLNANIRYVNTGT
APIYNVLPTTSLVLGKNQTLATIKAKENQLSQILAPNNYYPSKNLAPIALNAQDDFSSTPITMNYNQFLELEKTKQLRLD
TDQVYGNIATYNFENGRVRVDTGSNWSEVLPQIQETTARIIFNGKDLNLVERRIAAVNPSDPLETTKPDMTLKEALKIAF
GFNEPNGNLQYQGKDITEFDFNFDQQTSQNIKNQLAELNATNIYTVLDKIKLNAKMNILIRDKRFHYDRNNIAVGADESV
VKEAHREVINSSTEGLLLNIDKDIRKILSGYIVEIEDTEGLKEVINDRYDMLNISSLRQDGKTFIDFKKYNDKLPLYISN
PNYKVNVYAVTKENTIINPSENGDTSTNGIKKILIFSKKGYEIG
>P0DJQ3 3.5.3.22~~~pah~~~Proclavaminate amidinohydrolase~~~COG0010
MERIDSHVSPRYAQIPTFMRLPHDPQPRGYDVVVIGAPYDGGTSYRPGARFGPQAIRSESGLIHGVGIDRGPGTFDLINC
VDAGDINLTPFDMNIAIDTAQSHLSGLLKANAAFLMIGGDHSLTVAALRAVAEQHGPLAVVHLDAHSDTNPAFYGGRYHH
GTPFRHGIDEKLIDPAAMVQIGIRGHNPKPDSLDYARGHGVRVVTADEFGELGVGGTADLIREKVGQRPVYVSVDIDVVD
PAFAPGTGTPAPGGLLSREVLALLRCVGDLKPVGFDVMEVSPLYDHGGITSILATEIGAELLYQYARAHRTQL
>P21340 2.3.1.57~~~paiA~~~Spermidine/spermine N(1)-acetyltransferase~~~COG0456
MSVKMKKCSREDLQTLQQLSIETFNDTFKEQNSPENMKAYLESAFNTEQLEKELSNMSSQFFFIYFDHEIAGYVKVNIDD
AQSEEMGAESLEIERIYIKNSFQKHGLGKHLLNKAIEIALERNKKNIWLGVWEKNENAIAFYKKMGFVQTGAHSFYMGDE
EQTDLIMAKTLI
>P21341 ~~~paiB~~~Protease synthase and sporulation protein PAI 2~~~COG2808
MYIPKYFKVTNAEEIWNFVQENSFGTVVTTEQGKPIATHLPLGFNKKDDHYYITGHFAYGNPQWRTFEACEDVLVMFQGP
HAYISSSWYSRENVPTWNYQAVHMYGKASMLEKDELAEELTIMLEKYEKHRDNPVLWDKLSPKLLESELKGIVGFKIKVE
DIQAAYKLSQNRNETDYMNVIEQLQNEENPNAKQMAELMEDKLKKQI
>Q9AI65 3.2.1.20~~~palH~~~Alpha-glucosidase~~~
MATKIVLVGAGSAQFGYGTLGDIFQSRALYGSEIILHDINPVALAVTEKTAKDFLAKEDLPFIVSATTDRRTALRGAEFV
IISIEVGDRFALWDLDWQIPQQYGIQQVYGENGGPGGLFHSLRIIPPILDICADVADICPDAWIFNYSNPMSRICTTVHR
RFPELNFVGMCHEIASLERYLPEMLNTSFDNLSLRAGGLNHFSVLLDARYKDSGKDAYADVRAKAPDYFASLPGYSDILA
YTRQHGKLVDTEGSTERHALGGKDSSYPWADRTLFKEILEKFHCMPITVDSHFGEYISWAGEVSDHRGILDFYTFYRNYL
GGVQPKIELKLKERVVSIMEGILTDSGYEEAAVNIPNRGFIKQLPEFIAVEVPAIIDRKGVHGIQVDIPPGIGGLLSNQI
AIHDLTAEAIIAGSRDLVIQALLVDSVNNQCRAIPELVDVMISRQQPWLNYLK
>P0A3S9 ~~~pal~~~Peptidoglycan-associated lipoprotein~~~
MRRIQSIARSPIAIALFMSLAVAGCASKKNLPNNAGDLGLGAGAATPGSSQDFTVNVGDRIFFDLDSSLIRADAQQTLSK
QAQWLQRYPQYSITIEGHADERGTREYNLALGQRRAAATRDFLASRGVPTNRMRTISYGNERPVAVCDADTCWSQNRRAV
TVLNGAGR
>P0A3S7 ~~~pal~~~Peptidoglycan-associated lipoprotein~~~COG2885
MRRIQSIARSPIAIALFMSLAVAGCASKKNLPNNAGDLGLGAGAATPGSSQDFTVNVGDRIFFDLDSSLIRADAQQTLSK
QAQWLQRYPQYSITIEGHADERGTREYNLALGQRRAAATRDFLASRGVPTNRMRTISYGNERPVAVCDADTCWSQNRRAV
TVLNGAGR
>P0A3S8 ~~~pal~~~Peptidoglycan-associated lipoprotein~~~
MRRIQSIARSPIAIALFMSLAVAGCASKKNLPNNAGDLGLGAGAATPGSSQDFTVNVGDRIFFDLDSSLIRADAQQTLSK
QAQWLQRYPQYSITIEGHADERGTREYNLALGQRRAAATRDFLASRGVPTNRMRTISYGNERPVAVCDADTCWSQNRRAV
TVLNGAGR
>P0A912 ~~~pal~~~Peptidoglycan-associated lipoprotein~~~COG2885
MQLNKVLKGLMIALPVMAIAACSSNKNASNDGSEGMLGAGTGMDANGGNGNMSSEEQARLQMQQLQQNNIVYFDLDKYDI
RSDFAQMLDAHANFLRSNPSYKVTVEGHADERGTPEYNISLGERRANAVKMYLQGKGVSADQISIVSYGKEKPAVLGHDE
AAYSKNRRAVLVY
>P10324 ~~~pal~~~Peptidoglycan-associated lipoprotein~~~COG2885
MNKFVKSLLVAGSVAALAACSSSNNDAAGNGAAQTFGGYSVADLQQRYNTVYFGFDKYDITGEYVQILDAHAAYLNATPA
AKVLVEGNTDERGTPEYNIALGQRRADAVKGYLAGKGVDAGKLGTVSYGEEKPAVLGHDEAAYSKNRRAVLAY
>B2J528 4.3.1.24~~~~~~Phenylalanine ammonia-lyase~~~COG2986
MNITSLQQNITRSWQIPFTNSSDSIVTVGDRNLTIDEVVNVARHGTQVRLTDNADVIRGVQASCDYINNAVETAQPIYGV
TSGFGGMADVVISREQAAELQTNLIWFLKSGAGNKLSLADVRAAMLLRANSHLYGASGIRLELIQRIETFLNAGVTPHVY
EFGSIGASGDLVPLSYITGALIGLDPSFTVDFDGKEMDAVTALSRLGLPKLQLQPKEGLAMMNGTSVMTGIAANCVYDAK
VLLALTMGVHALAIQGLYGTNQSFHPFIHQCKPHPGQLWTADQMFSLLKDSSLVREELDGKHEYRGKDLIQDRYSLRCLA
QFIGPIVDGVSEITKQIEVEMNSVTDNPLIDVENQVSYHGGNFLGQYVGVTMDRLRYYIGLLAKHIDVQIALLVSPEFSN
GLPPSLVGNSDRKVNMGLKGLQISGNSIMPLLSFYGNSLADRFPTHAEQFNQNINSQGYISANLTRRSVDIFQNYMAIAL
MFGVQAVDLRTYKMKGHYDARTCLSPNTVQLYTAVCEVVGKPLTSVRPYIWNDNEQCLDEHIARISADIAGGGLIVQAVE
HIFSSLKST
>Q3M5Z3 4.3.1.24~~~~~~Phenylalanine ammonia-lyase~~~COG2986
MKTLSQAQSKTSSQQFSFTGNSSANVIIGNQKLTINDVARVARNGTLVSLTNNTDILQGIQASCDYINNAVESGEPIYGV
TSGFGGMANVAISREQASELQTNLVWFLKTGAGNKLPLADVRAAMLLRANSHMRGASGIRLELIKRMEIFLNAGVTPYVY
EFGSIGASGDLVPLSYITGSLIGLDPSFKVDFNGKEMDAPTALRQLNLSPLTLLPKEGLAMMNGTSVMTGIAANCVYDTQ
ILTAIAMGVHALDIQALNGTNQSFHPFIHNSKPHPGQLWAADQMISLLANSQLVRDELDGKHDYRDHELIQDRYSLRCLP
QYLGPIVDGISQIAKQIEIEINSVTDNPLIDVDNQASYHGGNFLGQYVGMGMDHLRYYIGLLAKHLDVQIALLASPEFSN
GLPPSLLGNRERKVNMGLKGLQICGNSIMPLLTFYGNSIADRFPTHAEQFNQNINSQGYTSATLARRSVDIFQNYVAIAL
MFGVQAVDLRTYKKTGHYDARACLSPATERLYSAVRHVVGQKPTSDRPYIWNDNEQGLDEHIARISADIAAGGVIVQAVQ
DILPCLH
>Q47PU3 1.14.13.92~~~pamO~~~Phenylacetone monooxygenase~~~COG2072
MAGQTTVDSRRQPPEEVDVLVVGAGFSGLYALYRLRELGRSVHVIETAGDVGGVWYWNRYPGARCDIESIEYCYSFSEEV
LQEWNWTERYASQPEILRYINFVADKFDLRSGITFHTTVTAAAFDEATNTWTVDTNHGDRIRARYLIMASGQLSVPQLPN
FPGLKDFAGNLYHTGNWPHEPVDFSGQRVGVIGTGSSGIQVSPQIAKQAAELFVFQRTPHFAVPARNAPLDPEFLADLKK
RYAEFREESRNTPGGTHRYQGPKSALEVSDEELVETLERYWQEGGPDILAAYRDILRDRDANERVAEFIRNKIRNTVRDP
EVAERLVPKGYPFGTKRLILEIDYYEMFNRDNVHLVDTLSAPIETITPRGVRTSEREYELDSLVLATGFDALTGALFKID
IRGVGNVALKEKWAAGPRTYLGLSTAGFPNLFFIAGPGSPSALSNMLVSIEQHVEWVTDHIAYMFKNGLTRSEAVLEKED
EWVEHVNEIADETLYPMTASWYTGANVPGKPRVFMLYVGGFHRYRQICDEVAAKGYEGFVLT
>Q3JP15 2.1.2.11~~~panB~~~3-methyl-2-oxobutanoate hydroxymethyltransferase~~~
MTYLQESSRPAVTVPKLQAMREAGEKIAMLTSYDASFAALLDRANVDVQLIGDSLGNVLQGQATTLPVTLDDIAYHTACV
ARAQPRGLVVADLPFGTYGTPADAFASAVKLMRAGAQMVKLEGGEWLAETVRFLVERAVPVCAHVGLTPQSVHAFGGFKV
QGKTEAGAAQLLRDARAVEEAGAQLIVLEAVPTLVAAEVTRELSIPTIGIGAGAECSGQVLVLHDMLGVFPGKRPRFVKD
FMQGQPSIFAAVEAYVRAVKDGSFPGPEHSF
>Q2SYZ1 2.1.2.11~~~panB~~~3-methyl-2-oxobutanoate hydroxymethyltransferase~~~
MTYLQESSRPAVTVPKLQAMREAGEKIAMLTCYDASFAALLDRANVDVQLIGDSLGNVLQGQTTTLPVTLDDIAYHTACV
ARAQPRALIVADLPFGTYGTPADAFASAVKLMRAGAQMVKFEGGEWLAETVRFLVERAVPVCAHVGLTPQSVHAFGGFKV
QGKTEAGAAQLLRDARAVEEAGAQLIVLEAVPTLVAAEVTRELSIPTIGIGAGAECSGQVLVLHDMLGVFPGKRPRFVKD
FMQGQPSIFAAVEAYVRAVKDGSFPGPEHSF
>Q9X712 2.1.2.11~~~panB~~~3-methyl-2-oxobutanoate hydroxymethyltransferase~~~COG0413
MSGIDAKKIRTRHFREAKVNGQKVSVLTSYDALSARIFDEAGVDMLLVGDSAANVVLGRDTTLSITLDEMIVLAKAVTIA
TKRALVVVDLPFGTYEVSPNQAVESAIRVMRETGAAAVKIEGGVEIAQTIRRIVDAGIPVVGHIGYTPQSEHSLGGHVVQ
GRGASSGKLIADARALEQAGAFAVVLEMVPAEAAREVTEDLSITTIGIGAGNGTDGQVLVWQDAFGLNRGKKPRFVREYA
TLGDSLHDAAQAYIADIHAGTFPGEAESF
>P31057 2.1.2.11~~~panB~~~3-methyl-2-oxobutanoate hydroxymethyltransferase~~~COG0413
MKPTTISLLQKYKQEKKRFATITAYDYSFAKLFADEGLNVMLVGDSLGMTVQGHDSTLPVTVADIAYHTAAVRRGAPNCL
LLADLPFMAYATPEQAFENAATVMRAGANMVKIEGGEWLVETVQMLTERAVPVCGHLGLTPQSVNIFGGYKVQGRGDEAG
DQLLSDALALEAAGAQLLVLECVPVELAKRITEALAIPVIGIGAGNVTDGQILVMHDAFGITGGHIPKFAKNFLAETGDI
RAAVRQYMAEVESGVYPGEEHSFH
>P9WIL7 2.1.2.11~~~panB~~~3-methyl-2-oxobutanoate hydroxymethyltransferase~~~COG0413
MSEQTIYGANTPGGSGPRTKIRTHHLQRWKADGHKWAMLTAYDYSTARIFDEAGIPVLLVGDSAANVVYGYDTTVPISID
ELIPLVRGVVRGAPHALVVADLPFGSYEAGPTAALAAATRFLKDGGAHAVKLEGGERVAEQIACLTAAGIPVMAHIGFTP
QSVNTLGGFRVQGRGDAAEQTIADAIAVAEAGAFAVVMEMVPAELATQITGKLTIPTVGIGAGPNCDGQVLVWQDMAGFS
GAKTARFVKRYADVGGELRRAAMQYAQEVAGGVFPADEHSF
>Q9JZW6 2.1.2.11~~~panB~~~3-methyl-2-oxobutanoate hydroxymethyltransferase~~~
MITVNTLQKMKAAGEKIAMLTAYESSFAALMDDAGVEMLLVGDSLGMAVQGRKSTLPVSLRDMCYHTECVARGAKNAMIV
SDLPFGAYQQSKEQAFAAAAELMAAGAHMVKLEGGVWMAETTEFLQMRGIPVCAHIGLTPQSVFAFGGYKVQGRGGKAQA
LLNDAKAHDDAGAAVVLMECVLAELAKKVTETVSCPTIGIGAGADCDGQVLVMHDMLGIFPGKTAKFVKNFMQGHDSVQA
AVRAYVAEVKAKTFPAAEHIFAD
>P65656 2.1.2.11~~~panB~~~3-methyl-2-oxobutanoate hydroxymethyltransferase~~~
MKTVSQLIDMKQKQTKISMVTAYDFPSAKQVEAAGIDMILVGDSLGMTVLGYESTVQVTLADMIHHGRAVRRGAPNTFVV
VDMPIGAVGISMTQDLNHALKLYQETNANAIKAEGAHITPFIEKATAIGIPVVAHLGLTPQSVGVMGYKLQGATKEAAEQ
LILDAKNVEQAGAVALVLEAIPNDLAEEISKHLTIPVIGIGAGKGTDGQVLVYHDMLNYGVEHKAKFVKQFADFSVGVDG
LKQYDQEVKSGAFPSEEYTYKKKIMNEVNNND
>Q6G456 6.3.2.1~~~panC~~~Pantothenate synthetase~~~COG0414
MRVLKTIPEVRQSITEERRLGFSIGLVPTMGALHNGHIALVRRARAMCDRVLVSIFVNPKQFGPDEDFDKYPRDLKGDCA
LLEEAGVEYLFTPSVEEMWPPGNETIVNVEKLSRMLIGKLRPGHFCGVTSVVAKLFNIVQPDKAFFGEKDFQQILIVRRM
VEDLAFPIEVVGVPVLREADGVASSSRNQFLTLEDRKAAKIIPESGKAAENLYRQGERSVDKLCKIVRDILQQESRAIIE
SIDLRDMETLSVVKGRLDKSAVLLLTVRFGEIRLIDQYILQEKG
>Q8YFC9 6.3.2.1~~~panC~~~Pantothenate synthetase~~~COG0414
MQIIHTIEELRQALAPARQQGKKIGFVPTMGYLHKGHLELVRRARVENDVTLVSIFVNPLQFGANEDLGRYPRDLERDAG
LLHDAQVDYLFAPTVSDMYPRPMQTVVDVPPLGNQMEGEARPGHFAGVATVVSKLFNIVGPDAAYFGEKDFQQLVIIRRM
VDDMAIPVRIVGVETVREDDGLACSSRNVYLTPEQRRAAIIVPQALDEADRLYRSGMDDPDALEAAIRTFIGRQPLAVPE
VIAIRDPETLERLPALQGRPILVALFVRVGATRLLDNRVIGHAAPQITQERAA
>Q2T095 6.3.2.1~~~panC~~~Pantothenate synthetase~~~
MKVISSIQELRDQLRGQNRTAFVPTMGNLHEGHLSLMRLARQHGDPVVASIFVNRLQFGPNEDFDKYPRTLQEDIEKLQK
ENVYVLFAPTERDMYPEPQEYRVQPPHDLGDILEGEFRPGFFTGVCTVVTKLMACVQPRVAVFGKKDYQQLMIVRRMCQQ
LALPVEIVAAETVRDADGLALSSRNRYLSEAERAEAPELAKTLARVRDAVLDGERDLAAIERRAVAHLSARGWQPDYVSI
RRRENLVAPSAAQIEAGDPLVVLTAAKLGATRLIDNLEI
>Q9PIK2 6.3.2.1~~~panC~~~Pantothenate synthetase~~~COG0414
MQVITSVKEAKQIVKDWKSHQLSIGYVPTMGFLHDGHLSLVKHAKTQDKVIVSIFVNPMQFGPNEDFSSYPRDLERDIKM
CQDNGVDMVFIPDATQMYLKNFSTYVDMNTITDKLCGAKRPGHFRGVCTVLTKFFNILNPDIVYMGQKDAQQCVVVRHMV
DDLNFDLKIQICPIIREEDGLAKSSRNVYLSKEERKASLAISQSIFLAEKLVREGEKNTSKIIQAMKDILEKEKLIKIDY
IELVDFNTMENIENITDNVLGAVAAFVGKTRLIDNFLVQGLK
>Q9X713 6.3.2.1~~~panC~~~Pantothenate synthetase~~~COG0414
MQVATTKQALIDALLHHKSVGLVPTMGALHSGHASLVKAARAENDTVVASIFVNPLQFEALGDCDDYRNYPRQLDADLAL
LEEAGVDIVFAPDVEEMYPGGLPLVWARTGSIGTKLEGASRPGHFDGVATVVAKLFNLVRPDRAYFGQKDAQQVAVIRRL
VADLDIPVEIRPVPIIRGADGLAESSRNQRLSADQRAQALVLPQVLSGLQRRKAAGEALDIQGARDTLASADGVRLDHLE
IVDPATLEPLEIDGLLTQPALVVGAIFVGPVRLIDNIEL
>P31663 6.3.2.1~~~panC~~~Pantothenate synthetase~~~COG0414
MLIIETLPLLRQQIRRLRMEGKRVALVPTMGNLHDGHMKLVDEAKARADVVVVSIFVNPMQFDRPEDLARYPRTLQEDCE
KLNKRKVDLVFAPSVKEIYPNGTETHTYVDVPGLSTMLEGASRPGHFRGVSTIVSKLFNLVQPDIACFGEKDFQQLALIR
KMVADMGFDIEIVGVPIMRAKDGLALSSRNGYLTAEQRKIAPGLYKVLSSIADKLQAGERDLDEIITIAGQELNEKGFRA
DDIQIRDADTLLEVSETSKRAVILVAAWLGDARLIDNKMVELA
>Q5NF57 6.3.2.1~~~panC~~~Pantothenate synthetase~~~COG0414
MIIADNIKQFHSIRNSLIKQQKIGFVPTMGALHNGHISLIKKAKSENDVVIVSIFVNPTQFNNPNDYQTYPNQLQQDIQI
LASLDVDVLFNPSEKDIYPDGNLLRIEPKLEIANILEGKSRPGHFSGMLTVVLKLLQITKPNNLYLGEKDYQQVMLIKQL
VKDFFINTKIIVCPTQRQPSGLPLSSRNKNLTSTDIEIANKIYEILRQDDFSNLEELTNKINSTGAKLQYIQKLNNRIFL
AFYIGKVRLIDNFLKETGPSC
>A0R580 6.3.2.1~~~panC~~~Pantothenate synthetase~~~COG0414
MTISRTPKFSAGELNVYSAPADVAAVTRALRTAGRRIVLVPTMGALHEGHLTLVRAAKRTPGAVVVVSIFVNPLQFGPNE
DLNAYPRTLEDDLTALRAEGVEIVFTPTGSDMYPDGTRTSVHPGPLGDDLEGSSRPGHFAGVLTVVLKLFSIVRPDRAYF
GEKDYQQLTLLRQMVADLNVDVQIVGVPTVRESDGLALSSRNRYLDKDQREQAGALSAALLAGKYAAAGGAEAALDAARA
VLDEVPALEVDYLQVRDPMLGPAPAEGQARLLVAARLGRTRLIDNIAIDVGASAGIDGHPRVGNDQNHELPWRN
>P9WIL4 6.3.2.1~~~panC~~~Pantothenate synthetase~~~
MTIPAFHPGELNVYSAPGDVADVSRALRLTGRRVMLVPTMGALHEGHLALVRAAKRVPGSVVVVSIFVNPMQFGAGEDLD
AYPRTPDDDLAQLRAEGVEIAFTPTTAAMYPDGLRTTVQPGPLAAELEGGPRPTHFAGVLTVVLKLLQIVRPDRVFFGEK
DYQQLVLIRQLVADFNLDVAVVGVPTVREADGLAMSSRNRYLDPAQRAAAVALSAALTAAAHAATAGAQAALDAARAVLD
AAPGVAVDYLELRDIGLGPMPLNGSGRLLVAARLGTTRLLDNIAIEIGTFAGTDRPDGYRAILESHWRN
>P9WIL5 6.3.2.1~~~panC~~~Pantothenate synthetase~~~COG0414
MTIPAFHPGELNVYSAPGDVADVSRALRLTGRRVMLVPTMGALHEGHLALVRAAKRVPGSVVVVSIFVNPMQFGAGEDLD
AYPRTPDDDLAQLRAEGVEIAFTPTTAAMYPDGLRTTVQPGPLAAELEGGPRPTHFAGVLTVVLKLLQIVRPDRVFFGEK
DYQQLVLIRQLVADFNLDVAVVGVPTVREADGLAMSSRNRYLDPAQRAAAVALSAALTAAAHAATAGAQAALDAARAVLD
AAPGVAVDYLELRDIGLGPMPLNGSGRLLVAARLGTTRLLDNIAIEIGTFAGTDRPDGYRAILESHWRN
>Q8ZRR1 6.3.2.1~~~panC~~~Pantothenate synthetase~~~
MLIIETLPLLRQHIRRLRQEGKRVALVPTMGNLHDGHMKLVDEAKARADVVIVSIFVNPMQFDRPDDLVRYPRTLQEDCE
KLNKRKVDYVFAPAVEEIYPHGLEGQTYVDVPGLSTMLEGASRPGHFRGVSTIVSKLFNLIQPDIACFGEKDFQQLALIR
KMVADMSYDIEIVGVPIIRAKDGLALSSRNAYLTAEQRKIAPGLYNVMNSIAEKLIAGNRELQEIIAIAEQELNEKGFRA
DDIQIRDADTLLELTETSKRAVILAAAWLGQARLIDNQSVTLAQ
>Q2FV22 6.3.2.1~~~panC~~~Pantothenate synthetase~~~COG0414
MTKLITTVKEMQHIVKAAKRSGTTIGFIPTMGALHDGHLTMVRESVSTNDITIVSVFVNPLQFGPNEDFDAYPRQIDKDL
ELVSEVGADIVFHPAVEDMYPGELGIDVKVGPLADVLEGAKRPGHFDGVVTVVNKLFNIVMPDYAYFGKKDAQQLAIVEQ
MVKDFNHAVEIIGIDIVREADGLAKSSRNVYLTEQERQEAVHLSKSLLLAQALYQDGERQSKVIIDRVTEYLESHISERI
EEVAVYSYPQLVEQHEITGRIFISLAVKFSKARLIDNIIIGAE
>P65659 6.3.2.1~~~panC~~~Pantothenate synthetase~~~
MTKLITTVKEMQHIVKAAKRSGTTIGFIPTMGALHDGHLTMVRESVSTNDITVVSVFVNPLQFGPNEDFDAYPRQIDKDL
ELVSEVGADIVFHPAVEDIYPGELGIDVKVGPLADVLEGAKRPGHFDGVVTVVNKLFNIVMPDYAYFGKKDAQQLAIVEQ
MVKDFNHAVEIIGIDIVREADGLAKSSRNVYLTEQERQEAVHLSKSLLLAQALYQDGERQSKVIIDRVTEYLESHISGRI
EEVAVYSYPQLVEQHEITGRIFISLAVKFSKARLIDNIIIGAE
>Q6GDK5 6.3.2.1~~~panC~~~Pantothenate synthetase~~~
MTKLITTVKEMQHIVKAAKRSGTTIGFIPTMGALHDGHLTMVRESVSTNDITVVSVFVNPLQFGPNEDFDAYPRQIDKDL
ELVSEVGADIVFHPAVEDMYPGELGIDVKVGPLADVLEGAKRPGHFDGVVTVVNKLFNIVMPDYAYFGKKDAQQLAIVEQ
MVKDFNHAVEIIGIDIVREADGLAKSSRNVYLTEQERQEAVHLSKSLLLAQALYQDGERQSKVIIDKVTQYLESHISGRI
EEVAVYSYPQLVEQHEITGRIFISLAVKFSKARLIDNIIIGAE
>Q9X0G6 6.3.2.1~~~panC~~~Pantothenate synthetase~~~COG0414
MRIIETIEEMKKFSEEMREKKKTIGFVPTMGYLHEGHLSLVRRARAENDVVVVSIFVNPTQFGPNEDYERYPRDFERDRK
LLEKENVDCIFHPSVEEMYPPDFSTYVEETKLSKHLCGRSRPGHFRGVCTVVTKLFNIVKPHRAYFGQKDAQQFRVLRRM
VRDLNMDVEMIECPIVREPDGLAMSSRNVYLSPEERQQALSLYQSLKIAENLYLNGERDAEKIKEEMIKHLSRFDKVKID
YVEIVDEETLEPVEKIDRKVIVAVAAWVGNARLIDNTILG
>Q5SHF5 6.3.2.1~~~panC~~~Pantothenate synthetase~~~COG0414
MRTVSTVAELRAALPREGVGFVPTMGYLHRGHLALVERARRENPFVVVSVFVNPLQFGPGEDYHRYPRDLERDRALLQEA
GVDLLFAPGVEEMYPEGFATRVQVEGPLTALWEGAVRPGHFQGVATVVARLFLLVQPQRAYFGEKDYQQLLVVRRMVRDL
GFPVEVVGVPTVREEDGLALSSRNVYLSPETRKKAPVLYRALLAMREVAGQGGSVAEALRAGEEALRAVPEFRKDYLAIV
HPETLLPLSDWVAGARGIVAGRFPEARLIDNLEVYP
>Q8ZBK7 6.3.2.1~~~panC~~~Pantothenate synthetase~~~COG0414
MLIIETLPLLRQQIRRWRQEGKRIALVPTMGNLHEGHMTLVDEAKTRADVVVVTIFVNPLQFERPDDLAHYPRTLQEDCE
KLTRHGADLVFAPAAADIYPAGLEKQTYVDVPALSTILEGASRPGHFRGVSTIVSKLFNLIQPDVACFGEKDYQQLALIR
KMVADMGYDINIVGVPTVRAKDGLALSSRNGYLTEEERQIAPQLSKIMWALAEKMALGERQIDALLEEAAAQLLRVGFTP
DELFIRDAETLQPLTVDSQQAVILMAAWLGKARLIDNQLVDLRH
>Q9PIK3 4.1.1.11~~~panD~~~Aspartate 1-decarboxylase~~~COG0853
MNITLLKSKIHRASVTEARLDYIGSISIDEKLLQASGILEYEKVQVVNVNNGARFETYTIATQEEGVVCLNGAAARLAEV
GDKVIIMSYADFNEEEAKTFKPKVVFVDENNTATKITNYEKHGAIF
>Q9X4N0 4.1.1.11~~~panD~~~Aspartate 1-decarboxylase~~~COG0853
MLRTILGSKIHRATVTQADLDYVGSVTIDADLVHAAGLIEGEKVAIVDITNGARLETYVIVGDAGTGNICINGAAAHLIN
PGDLVIIMSYLQATDAEAKAYEPKIVHVDADNRIVALGNDLAEALPGSGLLTSRSI
>P0A790 4.1.1.11~~~panD~~~Aspartate 1-decarboxylase~~~COG0853
MIRTMLQGKLHRVKVTHADLHYEGSCAIDQDFLDAAGILENEAIDIWNVTNGKRFSTYAIAAERGSRIISVNGAAAHCAS
VGDIVIIASFVTMPDEEARTWRPNVAYFEGDNEMKRTAKAIPVQVA
>P0C7I4 4.1.1.11~~~panD~~~Aspartate 1-decarboxylase~~~
MIRTMLQGKLHRVKVTHADLHYEGSCAIDQDFLDAAGILENEAIDIWNVTNGKRFSTYAIAAERGSRIISVNGAAAHCAS
VGDIVIIASFVTMPDEEARTWRPNVAYFEGDNEMKRTAKAIPVQVA
>Q5NF56 4.1.1.11~~~panD~~~Aspartate 1-decarboxylase~~~COG0853
MLISVLKSKISYATVTGKDLFYVGSITIDSEIMKQANIIENEKVQVVNLNNGERLETYVIKGEPNSKTIALNGPAARRCE
IGDQLFIISYTQVDPTRENIKPKLVDLKTGD
>P56065 4.1.1.11~~~panD~~~Aspartate 1-decarboxylase~~~COG0853
MTFEMLYSKIHRATITDANLNYIGSITIDEDLAKLAKLREGMKVEIVDVNNGERFSTYVILGKKRGEICVNGAAARKVAI
GDVVIILAYASMNEDEINAHKPSIVLVDEKNEILEKG
>A5U8S6 4.1.1.11~~~panD~~~Aspartate 1-decarboxylase~~~COG0853
MLRTMLKSKIHRATVTCADLHYVGSVTIDADLMDAADLLEGEQVTIVDIDNGARLVTYAITGERGSGVIGINGAAAHLVH
PGDLVILIAYATMDDARARTYQPRIVFVDAYNKPIDMGHDPAFVPENAGELLDPRLGVG
>P9WIL3 4.1.1.11~~~panD~~~Aspartate 1-decarboxylase~~~COG0853
MLRTMLKSKIHRATVTCADLHYVGSVTIDADLMDAADLLEGEQVTIVDIDNGARLVTYAITGERGSGVIGINGAAAHLVH
PGDLVILIAYATMDDARARTYQPRIVFVDAYNKPIDMGHDPAFVPENAGELLDPRLGVG
>P65662 4.1.1.11~~~panD~~~Aspartate 1-decarboxylase~~~
MIRTMLQGKLHRVKVTQADLHYEGSCAIDQDFLDASGILENEAIDIWNVTNGKRFSTYAIAAERGSRIISVNGAAAHCAE
VGDIVIIASFVTMSDEEARTWRPKVAYFEGDNEMKRTAKAIPVQVA
>Q5SKN7 4.1.1.11~~~panD~~~Aspartate 1-decarboxylase~~~COG0853
MKRVMFHAKIHRATVTQADLHYVGSVTVDQDLLDAAGILPFEQVDIYDITNGARLTTYALPGERGSGVIGINGAAAHLVK
PGDLVILVAYGVFDEEEARNLKPTVVLVDERNRILEVRKG
>O34661 1.1.1.169~~~panE~~~2-dehydropantoate 2-reductase~~~COG1893
MKIGIIGGGSVGLLCAYYLSLYHDVTVVTRRQEQAAAIQSEGIRLYKGGEEFRADCSADTSINSDFDLLVVTVKQHQLQS
VFSSLERIGKTNILFLQNGMGHIHDLKDWHVGHSIYVGIVEHGAVRKSDTAVDHTGLGAIKWSAFDDAEPDRLNILFQHN
HSDFPIYYETDWYRLLTGKLIVNACINPLTALLQVKNGELLTTPAYLAFMKLVFQEACRILKLENEEKAWERVQAVCGQT
KENRSSMLVDVIGGRQTEADAIIGYLLKEASLQGLDAVHLEFLYGSIKALERNTNKVF
>P0A9J4 1.1.1.169~~~panE~~~2-dehydropantoate 2-reductase~~~COG1893
MKITVLGCGALGQLWLTALCKQGHEVQGWLRVPQPYCSVNLVETDGSIFNESLTANDPDFLATSDLLLVTLKAWQVSDAV
KSLASTLPVTTPILLIHNGMGTIEELQNIQQPLLMGTTTHAARRDGNVIIHVANGITHIGPARQQDGDYSYLADILQTVL
PDVAWHNNIRAELWRKLAVNCVINPLTAIWNCPNGELRHHPQEIMQICEEVAAVIEREGHHTSAEDLRDYVMQVIDATAE
NISSMLQDIRALRHTEIDYINGFLLRRARAHGIAVPENTRLFEMVKRKESEYERIGTGLPRPW
>P9WIL1 1.1.1.169~~~~~~2-dehydropantoate 2-reductase~~~COG1893
MATGIALVGPGAVGTTVAALLHKAGYSPLLCGHTPRAGIELRRDGADPIVVPGPVHTSPREVAGPVDVLILAVKATQNDA
ARPWLTRLCDERTVVAVLQNGVEQVEQVQPHCPSSAVVPAIVWCSAETQPQGWVRLRGEAALVVPTGPAAEQFAGLLRGA
GATVDCDPDFTTAAWRKLLVNALAGFMVLSGRRSAMFRRDDVAALSRRYVAECLAVARAEGARLDDDVVDEVVRLVRSAP
QDMGTSMLADRAAHRPLEWDLRNGVIVRKARAHGLATPISDVLVPLLAAASDGPG
>Q9HW09 1.1.1.169~~~panE~~~2-dehydropantoate 2-reductase~~~
MTWHILGAGSLGSLWAARLGRAGLPVRLILRDRQRLRRYQQAGGLSLVEDGQASLYPIAAETPDGGQPIQRLLLACKAYD
AEEAASSVAHRLAGNAELLLLQNGLGSQQAVAARLPRSRCLFASSTEGAFRDGDFRVVFAGRGHTWLGDPRDTNAPAWLT
QLSQAGIPHSWSDDILERLWRKLALNCAINPLTVLHDCRNGGLRQHPEEIAALCDELGQLLHASGYDAAARSLLEDVRAV
IDATAANYSSMHQDVTRGRRTEIGYLLGYACQHGQRLGLPLPRLGTLLARLQAHLRQRGLPDR
>P37402 1.1.1.169~~~panE~~~2-dehydropantoate 2-reductase~~~
MKITVLGCGALGQLWLSALCKHGHDVQGWLRVPQPYCSVNLIDTDGSFFNESLTANDPDFLAKSELLLVTLKAWQVSDAV
RTLASTLPVTSPILLIHNGMGTIEELQNIQQPMLMGTITHAARRDGNIIIHVANGTTHIGPAREQDGDYSYLADILQGVL
PDVAWHNNIRAEMWRKLAVNCVINPLTALWNCPNGELRHHTDEINAICEEVAAVIEREGYHTSADDLRYYVEQVIDSTAE
NISSMLQDVRAMRHTEIDYITGYLLKRARVHGLAVPENSRLFEMVKRKESEYERSGTGMPRPW
>P16256 ~~~panF~~~Sodium/pantothenate symporter~~~COG4145
MQLEVILPLVAYLVVVFGISVYAMRKRSTGTFLNEYFLGSRSMGGIVLAMTLTATYISASSFIGGPGAAYKYGLGWVLLA
MIQLPAVWLSLGILGKKFAILARRYNAVTLNDMLFARYQSRLLVWLASLSLLVAFVGAMTVQFIGGARLLETAAGIPYET
GLLIFGISIALYTAFGGFRASVLNDTMQGLVMLIGTVVLLIGVVHAAGGLSNAVQTLQTIDPQLVTPQGADDILSPAFMT
SFWVLVCFGVIGLPHTAVRCISYKDSKAVHRGIIIGTIVVAILMFGMHLAGALGRAVIPDLTVPDLVIPTLMVKVLPPFA
AGIFLAAPMAAIMSTINAQLLQSSATIIKDLYLNIRPDQMQNETRLKRMSAVITLVLGALLLLAAWKPPEMIIWLNLLAF
GGLEAVFLWPLVLGLYWERANAKGALSAMIVGGVLYAVLATLNIQYLGFHPIVPSLLLSLLAFLVGNRFGTSVPQATVLT
TDK
>Q5E6F9 4.1.1.11~~~panP~~~Aspartate 1-decarboxylase~~~COG0076
MVTDNKTADASFESLLRIFTVPEAPDSTLGIIEKELSQNLNQFLREHIVAEEKPLTEIEKDFTDSSMPESPTYVSEHTEH
LLDTLVSQSVHTSAPSFIGHMTSALPYFLMPLSKIMIALNQNLVKIETSKAFTPLERQVLGMLHRLIFGQKDSFYQHWMH
SADHSLGAFCSGGTIANITALWVARNRLLKPEGDFEGIAKQGLFAALMHYKCNGLAIFVSERGHYSLKKAADVLGIGQDG
VIAVKTDNNNRVCLDDLELKIAQAKAKNIKPLAIVGVAGTTETGSIDPLRELANVAQREGCHFHVDAAWGGATLMSNTYR
HLLDGIDLADSVTIDAHKQLYVPMGAGMVIFKDPELMSSIQHHAEYILRKGSKDLGRHTLEGSRSGMAMLLYSCFNVISR
PGYELLINQSIEKAHYFADLIQQQDDFELITEPELCLLTYRYVPSNVKAALAIATDEQKIEIYEHLDNLTKYIQKTQRET
GKSFVSRTRLTPEAYQHQPTIVFRVVLANPLTTKEILQNVLIEQREIASSSEISLPLLNQIVGNILH
>Q8ZKL0 ~~~panS~~~Pantothenate precursors transporter PanS~~~
MLAVITRLFPLWALLLSLTAYYTPSTFTPVGPWVATLLMLIMFGMGVHLNVDDFKRVLSRPAPVAAGIFLHYLVMPLAAW
LLALLFKMPPDLSAGMVLVGSVASGTASNVMIYLAKGDVALSVTISSVSTLVGVVATPLLTRLYVDAHIQVDVMGMLLSI
LQIVVIPITLGLVIHHLFPRVVKVVKPYLPAFSMVCILAIISAVVAGSASHIASVGFMVIIAVILHNTLGLLGGYWGGRL
FGFDESTCRTLAIEVGMQNSGLAAALGKIYFGPLAALPGALFSVWHNLSGSLLAGYWSGKPIVEKSGETAKVN
>A2RIQ0 ~~~panT~~~Pantothenic acid transporter PanT~~~COG4684
MKKSKASDVAILAIFIAIMVVVQLFTQFVINVWPFPVKPTLLHLPVIIGSIILGWRKGAFLGLVWGLISFVTATIVTTPT
SFLFSPFQPVIGTHHGSPWGLFIAFIPRILVGILPYFVYKIANNRLGAGLAAFAGTATNTVLVLTSIFLFFGSTLKWSLS
YLLGAIVATNSLTEVIIAVILTTAIVPALTKARNNS
>Q03YI6 ~~~panT~~~Pantothenate transporter PanT~~~COG4684
MSNNKTKYLVITTFFMAIILLQVLIPWLGYIPLGAVIVGAQPTIIQFTVAIAAILLGARRGAFIGGFWGLLTLWQAWSTP
GSIGSLMFQNPFTAFIPRILVGLIIGMAFNKWLRNKNFGFRTLGLGFLGGLAALINTVGVVLLTVIGFTVMRTNFTGIPN
HNLLGWLIGIVSFNSIFEIITGIILVAAIGNVLVPIAERAGIKG
>P37613 ~~~panZ~~~PanD regulatory factor~~~COG0456
MKLTIIRLEKFSDQDRIDLQKIWPEYSPSSLQVDDNHRIYAARFNERLLAAVRVTLSGTEGALDSLRVREVTRRRGVGQY
LLEEVLRNNPGVSCWWMADAGVEDRGVMTAFMQALGFTAQQGGWEKC
>Q7CPJ9 ~~~panM~~~PanD regulatory factor~~~
MKLTILRLEHFSAQDQIDLGKIWPEYSASSLSVDETHRIYAARFNERLLGAVRVTLSGTQGALDSLRVREITRRRGVGQY
LVEEVIRDNPNVSSWWMADVGVEDRSVMAAFMQALGFTAQHDGWEKR
>P77165 1.2.99.6~~~paoA~~~Aldehyde oxidoreductase iron-sulfur-binding subunit PaoA~~~COG2080
MSNQGEYPEDNRVGKHEPHDLSLTRRDLIKVSAATAATAVVYPHSTLAASVPAATPAPEIMPLTLKVNGKTEQLEVDTRT
TLLDTLRENLHLIGTKKGCDHGQCGACTVLVNGRRLNACLTLAVMHQGAEITTIEGLGSPDNLHPMQAAFIKHDGFQCGY
CTSGQICSSVAVLKEIQDGIPSHVTVDLVSAPETTADEIRERMSGNICRCGAYANILAAIEDAAGEIKS
>P77324 1.2.99.6~~~paoB~~~Aldehyde oxidoreductase FAD-binding subunit PaoB~~~COG1319
MKAFTYERVNTPAEAALSAQRVPGAKFIAGGTNLLDLMKLEIETPTHLIDVNGLGLDKIEVTDAGGLRIGALVRNTDLAA
HERVRRDYAVLSRALLAGASGQLRNQATTAGNLLQRTRCPYFYDTNQPCNKRLPGSGCAALEGFSRQHAVVGVSEACIAT
HPSDMAVAMRLLDAVVETITPEGKTRSITLADFYHPPGKTPHIETALLPGELIVAVTLPPPLGGKHIYRKVRDRASYAFA
LVSVAAIIQPDGSGRVALGGVAHKPWRIEAADAQLSQGAQAVYDTLFASAHPTAENTFKLLLAKRTLASVLAEARAQA
>P77489 1.2.99.6~~~paoC~~~Aldehyde oxidoreductase molybdenum-binding subunit PaoC~~~COG1529
MKFDKPAGENPIDQLKVVGRPHDRIDGPLKTTGTARYAYEWHEEAPNAAYGYIVGSAIAKGRLTALDTDAAQKAPGVLAV
ITASNAGALGKGDKNTARLLGGPTIEHYHQAIALVVAETFEQARAAASLVQAHYRRNKGAYSLADEKQAVNQPPEDTPDK
NVGDFDGAFTSAAVKIDATYTTPDQSHMAMEPHASMAVWDGNKLTLWTSNQMIDWCRTDLAKTLKVPVENVRIISPYIGG
GFGGKLFLRSDALLAALAARAVKRPVKVMLPRPSIPNNTTHRPATLQHLRIGADQSGKITAISHESWSGNLPGGTPETAV
QQSELLYAGANRHTGLRLATLDLPEGNAMRAPGEAPGLMALEIAIDELAEKAGIDPVEFRILNDTQVDPADPTRCFSRRQ
LIECLRTGADKFGWKQRNATPGQVRDGEWLVGHGVAAGFRNNLLEKSGARVHLEQNGTVTVETDMTDIGTGSYTILAQTA
AEMLGVPLEQVAVHLGDSSFPVSAGSGGQWGANTSTSGVYAACMKLREMIASAVGFDPEQSQFADGKITNGTRSATLHEA
TAGGRLTAEESIEFGTLSKEYQQSTFAGHFVEVGVHSATGEVRVRRMLAVCAAGRILNPKTARSQVIGAMTMGMGAALME
ELAVDDRLGYFVNHDMAGYEVPVHADIPKQEVIFLDDTDPISSPMKAKGVGELGLCGVSAAIANAVYNATGIRVRDYPIT
LDKLLDKLPDVV
>P77183 ~~~paoD~~~Molybdenum cofactor insertion chaperone PaoD~~~COG1975
MSYPLFDKDEHWHKPEQAFLTDDHRTILRFAVEALMSGKGAVLVTLVEIRGGAARPLGAQMVVREDGRYCGFVSGGCVEA
AAAFEALEMMGSGRDREIRYGEGSPWFDIVLPCGGGITLTLHKLRSAQPLLAVLNRLEQRKPVGLRYDPQAQSLVCLPTQ
TRTGWNLNGFEVGFRPCVRLMIYGRSLEAQATASLAAATGYDSHIFDLFPASASAQIDTDTAVILLCHDLNRELPVLQAA
REAKPFYLGALGSYRTHTLRLQKLHELGWSREETTQIRAPVGIFPKARDAHTLALSVLAEVASVRLHQEEDSCLPPSS
>P46881 1.4.3.21~~~~~~Phenylethylamine oxidase~~~
MTPSTIQTASPFRLASAGEISEVQGILRTAGLLGPEKRIAYLGVLDPARGAGSEAEDRRFRVFIHDVSGARPQEVTVSVT
NGTVISAVELDTAATGELPVLEEEFEVVEQLLATDERWLKALAARNLDVSKVRVAPLSAGVFEYAEERGRRILRGLAFVQ
DFPEDSAWAHPVDGLVAYVDVVSKEVTRVIDTGVFPVPAEHGNYTDPELTGPLRTTQKPISITQPEGPSFTVTGGNHIEW
EKWSLDVGFDVREGVVLHNIAFRDGDRLRPIINRASIAEMVVPYGDPSPIRSWQNYFDTGEYLVGQYANSLELGCDCLGD
ITYLSPVISDAFGNPREIRNGICMHEEDWGILAKHSDLWSGINYTRRNRRMVISFFTTIGNYDYGFYWYLYLDGTIEFEA
KATGVVFTSAFPEGGSDNISQLAPGLGAPFHQHIFSARLDMAIDGFTNRVEEEDVVRQTMGPGNERGNAFSRKRTVLTRE
SEAVREADARTGRTWIISNPESKNRLNEPVGYKLHAHNQPTLLADPGSSIARRAAFATKDLWVTRYADDERYPTGDFVNQ
HSGGAGLPSYIAQDRDIDGQDIVVWHTFGLTHFPRVEDWPIMPVDTVGFKLRPEGFFDRSPVLDVPANPSQSGSHCHG
>Q5W9R9 1.13.12.9~~~~~~Phenylalanine 2-monooxygenase precursor~~~
MGVTVIPRLLGLKDEKKIATTVGEARLSGINYRHPDSALVSYPVAAAAPLGRLPAGNYRIAIVGGGAGGIAALYELGRLA
ATLPAGSGIDVQIYEADPDSFLHDRPGIKAIKVRGLKAGRVSAALVHNGDPASGDTIYEVGAMRFPEIAGLTWHYASAAF
GDAAPIKVFPNPGKVPTEFVFGNRVDRYVGSDPKDWEDPDSPTLKVLGVVAGGLVGNPQGENVAMYPIANVDPAKIAAIL
NAATPPADALERIQTKYWPEFIAQYDGLTLGAAVREIVTVAFEKGTLPPVDGVLDVDESISYYVELFGRFGFGTGGFKPL
YNISLVEMMRLILWDYSNEYTLPVTENVEFIRNLFLKAQNVGAGKLVVQVRQERVANACHSGTASARAQLLSYDSHNAVH
SEAYDFVILAVPHDQLTPIVSRSGFEHAASQNLGDAGLGLETHTYNQVYPPLLLSDSSPAANARIVTAIGQLHMARSSKV
FATVKTAALDQPWVPQWRGEPIKAVVSDSGLAASYVVPSPIVEDGQAPEYSSLLASYTWEDDSTRLRHDFGLYPQNPATE
TGTADGMYRTMVNRAYRYVKYAGASNAQPWWFYQLLAEARTADRFVFDWTTNKTAGGFKLDMTGDHHQSNLCFRYHTHAL
AASLDNRFFIASDSYSHLGGWLEGAFMSALNAVAGLIVRANRGDVSALSTEARPLVIGLRPVVKVPAAELATSQ
>P9WIK9 2.3.1.283~~~papA1~~~2'-acyl-2-O-sulfo-trehalose (hydroxy)phthioceranyltransferase PapA1~~~COG1020
MRIGPVELSAVKDWDPAPGVLVSWHPTPASCAKALAAPVSAVPPSYVQARQIRSFSEQAARGLDHSRLLIASVEVFGHCD
LRAMTYVINAHLRRHDTYRSWFELRDTDHIVRHSIADPADIEFVPTTHGEMTSADLRQHIVATPDSLHWDCFSFGVIQRA
DSFTFYASIDHLHADGQFVGVGLMEFQSMYTALIMGEPPIGLSEAGSYVDFCVRQHEYTSALTVDSPEVRAWIDFAEINN
GTFPEFPLPLGDPSVRCGGDLLSMMLMDEQQTQRFESACMAANARFIGGMLACIAIAIHELTGADTYFGITPKDIRTPAD
LMTQGWFTGQIPVTVPVAGLSFNEIARIAQTSFDTGADLAKVPFERVVELSPSLRRPQPLFSLVNFFDAQVGPLSAVTKL
FEGLNVGTYSDGRVTYPLSTMVGRFDETAASVLFPDNPVARESVTAYLRAIRSVCMRIANGGTAERVGNVVALSPGRRNN
IERMTWRSCRAGDFIDICNLKVANVTVDREA
>P9WIK7 2.3.1.288~~~papA2~~~Trehalose-2-sulfate acyltransferase PapA2~~~COG1020
MFSITTLRDWTPDPGSIICWHASPTAKAKARQAPISEVPPSYQQAQHLRRYRDHVARGLDMSRLMIFTWDLPGRCNIRAM
NYAINAHLRRHDTYHSWFEFDNAEHIVRHTIADPADIEVVQAEHQNMTSAELRHHIATPQPLQWDCFLFGIIQSDDHFTF
YASIAHLCVDPMIVGVLFIEIHMMYSALVGGDPPIELPPAGRYDDHCVRQYADTAALTLDSARVRRWVEFAANNDGTLPH
FPLPLGDLSVPHTGKLLTETLMDEQQGERFEAACVAAGARFSGGVFACAALAERELTNCETFDVVTTTDTRRTPTELRTT
GWFTGLVPITVPVASGLFDSAARVAQISFDSGKDLATVPFDRVLELARPETGLRPPRPGNFVMSFLDASIAPLSTVANSD
LNFRIYDEGRVSHQVSMWVNRYQHQTTVTVLFPDNPIASESVANYIAAMKSIYIRTADGTLATLKPGT
>P9WIK5 2.3.1.278~~~papA3~~~Acyltransferase PapA3~~~COG1020
MLRVGPLTIGTLDDWAPSTGSTVSWRPSAVAHTKASQAPISDVPVSYMQAQHIRGYCEQKAKGLDYSRLMVVSCQQPGQC
DIRAANYVINAHLRRHDTYRSWFQYNGNGQIIRRTIQDPADIEFVPVHHGELTLPQIREIVQNTPDPLQWGCFRFGIVQG
CDHFTFFASVDHVHVDAMIVGVTLMEFHLMYAALVGGHAPLELPPAGSYDDFCRRQHTFSSTLTVESPQVRAWTKFAEGT
NGSFPDFPLPLGDPSKPSDADIVTVMMLDEEQTAQFESVCTAAGARFIGGVLACCGLAEHELTGTTTYYGLTPRDTRRTP
ADAMTQGWFTGLIPITVPIAGSAFGDAARAAQTSFDSGVKLAEVPYDRVVELSSTLTMPRPNFPVVNFLDAGAAPLSVLL
TAELTGTNIGVYSDGRYSYQLSIYVIRVEQGTAVAVMFPDNPIARESVARYLATLKSVFQRVAESGQQQNVA
>P9WIN5 2.3.1.282~~~papA5~~~Phthiocerol/phthiodiolone dimycocerosyl transferase~~~COG1020
MFPGSVIRKLSHSEEVFAQYEVFTSMTIQLRGVIDVDALSDAFDALLETHPVLASHLEQSSDGGWNLVADDLLHSGICVI
DGTAATNGSPSGNAELRLDQSVSLLHLQLILREGGAELTLYLHHCMADGHHGAVLVDELFSRYTDAVTTGDPGPITPQPT
PLSMEAVLAQRGIRKQGLSGAERFMSVMYAYEIPATETPAVLAHPGLPQAVPVTRLWLSKQQTSDLMAFGREHRLSLNAV
VAAAILLTEWQLRNTPHVPIPYVYPVDLRFVLAPPVAPTEATNLLGAASYLAEIGPNTDIVDLASDIVATLRADLANGVI
QQSGLHFGTAFEGTPPGLPPLVFCTDATSFPTMRTPPGLEIEDIKGQFYCSISVPLDLYSCAVYAGQLIIEHHGHIAEPG
KSLEAIRSLLCTVPSEYGWIME
>P04127 ~~~papA~~~Pap fimbrial major pilin protein~~~
MIKSVIAGAVAMAVVSFGVNNAAPTIPQGQGKVTFNGTVVDAPCSISQKSADQSIDFGQLSKSFLEAGGVSKPMDLDIEL
VNCDITAFKGGNGAKKGTVKLAFTGPIVNGHSDELDTNGGTGTAIVVQGAGKNVVFDGSEGDANTLKDGENVLHYTAVVK
KSSAVGAAVTEGAFSAVANFNLTYQ
>P04744 ~~~papB~~~Major pilu subunit operon regulatory protein PapB~~~
MAHHEVISRSGNAFLLNIRESVLLPGSMSEMHFFLLIGISSIHSDRVILAMKDYLVGGHSRKEVCEKYQMNNGYFSTTLG
RLIRLNALAARLAPYYTDESSAFD
>P07110 ~~~papC~~~Outer membrane usher protein PapC~~~
MKDRIPFAVNNITCVILLSLFCNAASAVEFNTDVLDAADKKNIDFTRFSEAGYVLPGQYLLDVIVNGQSISPASLQISFV
EPALSGDKAEKKLPQACLTSDMVRLMGLTAESLDKVVYWHDGQCADFHGLPGVDIRPDTGAGVLRINMPQAWLEYSDATW
LPPSRWDDGIPGLMLDYNLNGTVSRNYQGGDSHQFSYNGTVGGNLGPWRLRADYQGSQEQSRYNGEKTTNRNFTWSRFYL
FRAIPRWRANLTLGENNINSDIFRSWSYTGASLESDDRMLPPRLRGYAPQITGIAETNARVVVSQQGRVLYDSMVPAGPF
SIQDLDSSVRGRLDVEVIEQNGRKKTFQVDTASVPYLTRPGQVRYKLVSGRSRGYGHETEGPVFATGEASWGLSNQWSLY
GGAVLAGDYNALAAGAGWDLGVPGTLSADITQSVARIEGERTFQGKSWRLSYSKRFDNADADITFAGYRFSERNYMTMEQ
YLNARYRNDYSSREKEMYTVTLNKNVADWNTSFNLQYSRQTYWDIRKTDYYTVSVNRYFNVFGLQGVAVGLSASRSKYLG
RDNDSAYLRISVPLGTGTASYSGSMSNDRYVNMAGYTDTFNDGLDSYSLNAGLNSGGGLTSQRQINAYYSHRSPLANLSA
NIASLQKGYTSFGVSASGGATITGKGAALHAGGMSGGTRLLVDTDGVGGVPVDGGQVVTNRWGTGVVTDISSYYRNTTSV
DLKRLPDDVEATRSVVESALTEGAIGYRKFSVLKGKRLFAILRLADGSQPPFGASVTSEKGRELGMVADEGLAWLSGVTP
GETLSVNWDGKIQCQVNVPETAISDQQLLLPCTPQK
>P15319 ~~~papD~~~Chaperone protein PapD~~~
MIRKKILMAAIPLFVISGADAAVSLDRTRAVFDGSEKSMTLDISNDNKQLPYLAQAWIENENQEKIITGPVIATPPVQRL
EPGAKSMVRLSTTPDISKLPQDRESLFYFNLREIPPRSEKANVLQIALQTKIKLFYRPAAIKTRPNEVWQDQLILNKVSG
GYRIENPTPYYVTVIGLGGSEKQAEEGEFETVMLSPRSEQTVKSANYNTPYLSYINDYGGRPVLSFICNGSRCSVKKEK
>P08407 ~~~papE~~~Fimbrial protein PapE~~~
MKKIRGLCLPVMLGAVLMSQHVHAVDNLTFRGKLIIPACTVSNTTVDWQDVEIQTLSQNGNHEKEFTVNMRCPYNLGTMK
VTITATNTYNNAILVQNTSNTSSDGLLVYLYNSNAGNIGTAITLGTPFTPGKITGNNADKTISLHAKLGYKGNMQNLIAG
PFSATATLVASYS
>P08408 ~~~papF~~~Fimbrial adapter PapF~~~
MIRLSLFISLLLTSVAVLADVQINIRGNVYIPPCTINNGQNIVVDFGNINPEHVDNSRGEVTKTISISCPYKSGSLWIKV
TGNTMGGGQNNVLATNITHFGIALYQGKGMSTPLILGNGSGNGYGVTAGLDTARSTFTFTSVPFRNGSGILNGGDFQTTA
SMSMIYN
>P13720 ~~~papGI~~~Fimbrial adhesin PapGI~~~
MKKWFPAFLFLSLSGGNDALAGWHNVMFYAFNDYLTTNAGNVKVIDQPQLYIPWNTGSATATYYSCSGPEFASGVYFQEY
LAWMVVPKHVYTNEGFNIFLDVQSKYGWSMENENDKDFYFFVNGYEWDTWTNNGARICFYPGNMKQLNNKFNDLVFRVLL
PVDLPKGHYNFPVRYIRGIQHHYYDLWQDHYKMPYDQIKQLPATNTLMLSFDNVGGCQPSTQVLNIDHGSIVIDRANGNI
ASQTLSIYCDVPVSVKISLLRNTPPIYNNNKFSVGLGNGWDSIISLDGVEQSEEILRWYTAGSKTVKIESRLYGEEGKRK
PGELSGSMTMVLSFP
>Q47450 ~~~papGII~~~Fimbrial adhesin PapGII~~~
MKKWFPALLFSLCVSGESSAWNNIVFYSLGDVNSYQGGNVVITQRPQFITSWRPGIATVTWNQCNGPEFADGFWAYYREY
IAWVVFPKKVMTQNGYPLFIEVHNKGSWSEENTGDNDSYFFLKGYKWDERAFDAGNLCQKPGEITRLTEKFDDIIFKVAL
PADLPLGDYSVKIPYTSGMQRHFASYLGARFKIPYNVAKTLPRENEMLFLFKNIGGCRPSAQSLEIKHGDLSINSANNHY
AAQTLSVSCDVPANIRFMLLRNTTPTYSHGKKFSVGLGHGWDSIVSVNGVDTGETTMRWYKAGTQNLTIGSRLYGESSKI
QPGVLSGSATLLMILP
>P07111 ~~~papH~~~PAP fimbrial minor pilin protein~~~
MRLRFSVPLFFFGCVFVHGVFAGPFPPPGMSLPEYWGEEHVWWDGRAAFHGEVVRPACTLAMEDAWQIIDMGETPVRDLQ
NGFSGPERKFSLRLRNCEFNSQGGNLFSDSRIRVTFDGVRGETPDKFNLSGQAKGINLQIADVRGNIARAGKVMPAIPLT
GNEEALDYTLRIVRNGKKLEAGNYFAVLGFRVDYE
>P86242 ~~~pi~~~Papain inhibitor~~~
MREFRRVRRVRFAACALVAAATGITLAAGPASADIPIGQKMTGKMTYYTDKGYGACGTPIDASSQDLVAIPAAWWTTPNP
NNDPLCRGVSVEVSYNGRTIRVPVRDKCPSCDRTHIDLSQAAFAKLAPLDRGVVNGITWKFVR
>P62532 ~~~papK~~~Fimbrial adapter PapK~~~
MIKSTGALLLFAALSAGQAIASDVAFRGNLLDRPCHVSGDSLNKHVVFKTRASRDFWYPPGRSPTESFVIRLENCHATAV
GKIVTLTFKGTEEAALPGHLKVTGVNAGRLGIALLDTDGSSLLKPGTSHNKGQGEKVTGNSLELPFGAYVVATPEALRTK
SVVPGDYEATATFELTYR
>P72542 2.1.1.-~~~papM~~~4-amino-L-phenylalanine/4-methylamino-L-phenylalanine methyltransferase~~~
MTAAAPTLAQALDEATGQLTGAGITADAARADTRLLAAHACQVAPGDLDTCLAGPVPPRFWHYVRRRLTREPAERIVGHA
YFMGHRFDLAPGVFVPKPETEEITRDAIARLEALVRRGTTAPLVVDLCAGPGTMAVTLARHVPAARVLGIELSQAAARAA
RRNARGTGARIVQGDARDAFPELSGTVDLVVTNPPYIPIGLRTSAPEVLEHDPPLALWAGEEGLGMIRAMERTAARLLAP
GGVLLLEHGSYQLASVPALFRATGRWSHASSRPTCNDGCLTAVRNHTCAPPA
>P39155 3.9.1.2~~~ywlE~~~Protein-arginine-phosphatase~~~COG0394
MDIIFVCTGNTCRSPMAEALFKSIAEREGLNVNVRSAGVFASPNGKATPHAVEALFEKHIALNHVSSPLTEELMESADLV
LAMTHQHKQIIASQFGRYRDKVFTLKEYVTGSHGDVLDPFGGSIDIYKQTRDELEELLRQLAKQLKKDRR
>S0F332 3.9.1.2~~~ywle~~~Protein-arginine-phosphatase~~~
MPYRILFVCTGNTCRSPMAAALLENKQLPGVEVKSAGVFAAEGSEASVHAKMVLKEKGIEAAHRSSQLKKEHIDWATHVL
AMTSGHKDMIVERFPEAKDKTFTLKQFVSGTDGDIADPFGGPIEVYRAARDELETLIDRLAEKLQTEQ
>B8GW31 ~~~parA~~~Chromosome partitioning protein ParA~~~
MSANPLRVLAIANQKGGVGKTTTAINLGTALAACGERVLLIDADPQGNCSTGLGIGRTQRRTTLYDVLMGEAPVVDAAVK
TELPGLDVIPADADLSGVEIELGQTARRSYRLRDALEAIRANGPYTYVLIDCPPSLNVLTVNAMTAADAVFVPLQCEFFA
LEGLTQLMRTIERVRGSLNPRLEIQGVVLTMYDRRNSLSEQVAKDVRAHFGDKVYDAVIPRNVRVSEAPSFGKPVLLYDL
KCAGSQAYLKLAREVISRERDRQAKAA
>P07620 ~~~parA~~~Plasmid partition protein A~~~
MSDSSQLHKVAQRANRMLNVLTEQVQLQKDELHANEFYQVYAKAALAKLPLLTRANVDYAVSEMEEKGYVFDKRPAGSSM
KYAMSIQNIIDIYEHRGVPKYRDRYSEAYVIFISNLKGGVSKTVSTVSLAHAMRAHPHLLMEDLRILVIDLDPQSSATMF
LSHKHSIGIVNATSAQAMLQNVSREELLEEFIVPSVVPGVDVMPASIDDAFIASDWRELCNEHLPGQNIHAVLKENVIDK
LKSDYDFILVDSGPHLDAFLKNALASANILFTPLPPATVDFHSSLKYVARLPELVKLISDEGCECQLATNIGFMSKLSNK
ADHKYCHSLAKEVFGGDMLDVFLPRLDGFERCGESFDTVISANPATYVGSADALKNARIAAEDFAKAVFDRIEFIRSN
>P22997 3.1.-.-~~~parB~~~Protein ParB~~~
MKRRSYAMLRAAAALAVLVVASPAWAELRGEVVRIIDGDTIDVLVDKQPVRVRLVDIDAPEKRQAFGERARQALAGMVFR
RHVLVDEKDTDRYGRTLGTVWVNMELASRPPQPRNVNAAMVHQGMAWAYRFHGRAADPEMLRLEQEARGKRVGLWSDPHA
VEPWKWRRESNNRRDEG
>P0CAV8 ~~~parB~~~Chromosome-partitioning protein ParB~~~COG1475
MESVVVGEPGMSEGRRGLGRGLSALLGEVDAAPAQAPGEQLGGSREAPIEILQRNPDQPRRTFREEDLEDLSNSIREKGV
LQPILVRPSPDTAGEYQIVAGERRWRAAQRAGLKTVPIMVRELDDLAVLEIGIIENVQRADLNVLEEALSYKVLMEKFER
TQENIAQTIGKSRSHVANTMRLLALPDEVQSYLVSGELTAGHARAIAAAADPVALAKQIIEGGLSVRETEALARKAPNLS
AGKSKGGRPPRVKDTDTQALESDLSSVLGLDVSIDHRGSTGTLTITYATLEQLDDLCNRLTRGI
>B8GW30 ~~~parB~~~Chromosome-partitioning protein ParB~~~
MESVVVGEPGMSEGRRGLGRGLSALLGEVDAAPAQAPGEQLGGSREAPIEILQRNPDQPRRTFREEDLEDLSNSIREKGV
LQPILVRPSPDTAGEYQIVAGERRWRAAQRAGLKTVPIMVRELDDLAVLEIGIIENVQRADLNVLEEALSYKVLMEKFER
TQENIAQTIGKSRSHVANTMRLLALPDEVQSYLVSGELTAGHARAIAAAADPVALAKQIIEGGLSVRETEALARKAPNLS
AGKSKGGRPPRVKDTDTQALESDLSSVLGLDVSIDHRGSTGTLTITYATLEQLDDLCNRLTRGI
>Q83AH2 ~~~parB~~~Probable chromosome-partitioning protein ParB~~~COG1475
MSMTQKRGLGRGLSDLGLNELLTEINDASLADSKTELKKLTIDVIQPGRYQPRRQMDKDALEELANSIRAQGIIQPIVVR
PVGQRYEIIAGERRWRAAQLAGLKEVPAVIRPITDEAAITMSLIENIQRQNLNAIEEAAALQRLLDEFKMTHEEIAEAVG
KSRTSVTNSLRLLKLNPDVKALLEQGHLDMGHARALLALEGFQQSEAANIIIKRALSVRETEKLIQHWQSEGKSSANRPS
MDPDVARLQHHLSDKLGAAVTIRHGAKGKGKLIIHYNSADELEGILDRIR
>O25758 ~~~parB~~~Probable chromosome-partitioning protein ParB~~~COG1475
MAKNKVLGRGLADIFPEINEVYEQGLYERANRVVELGIDEVMPNPYQPRKVFSEDSLEELAQSIKEHGLLQPVLVVSENG
RYHLIAGERRLRASKLAKMPTIKAIVVDIEQEKMREVALIENIQREDLNPLELARSYKELLESYQMTQEELSKIVKKSRA
HVANIMRLLTLSSKVQNALLEEKITSGHAKVLVGLDGEKQELILNSIIGQKLSVRQTEDLARDFKINANFDNKKHGFKQT
QTLIAGDELERLNQSLWDHYKLKAALKGNKIVLRCYENSLLEAFMKKMMS
>P9WIJ9 ~~~parB~~~Probable chromosome-partitioning protein ParB~~~COG1475
MTQPSRRKGGLGRGLAALIPTGPADGESGPPTLGPRMGSATADVVIGGPVPDTSVMGAIYREIPPSAIEANPRQPRQVFD
EEALAELVHSIREFGLLQPIVVRSLAGSQTGVRYQIVMGERRWRAAQEAGLATIPAIVRETGDDNLLRDALLENIHRVQL
NPLEEAAAYQQLLDEFGVTHDELAARIGRSRPLITNMIRLLKLPIPVQRRVAAGVLSAGHARALLSLEAGPEAQEELASR
IVAEGLSVRATEETVTLANHEANRQAHHSDATTPAPPRRKPIQMPGLQDVAERLSTTFDTRVTVSLGKRKGKIVVEFGSV
DDLARIVGLMTTDGRDKGLHRDAL
>P0AFI2 5.6.2.2~~~parC~~~DNA topoisomerase 4 subunit A~~~COG0188
MSDMAERLALHEFTENAYLNYSMYVIMDRALPFIGDGLKPVQRRIVYAMSELGLNASAKFKKSARTVGDVLGKYHPHGDS
ACYEAMVLMAQPFSYRYPLVDGQGNWGAPDDPKSFAAMRYTESRLSKYSELLLSELGQGTADWVPNFDGTLQEPKMLPAR
LPNILLNGTTGIAVGMATDIPPHNLREVAQAAIALIDQPKTTLDQLLDIVQGPDYPTEAEIITSRAEIRKIYENGRGSVR
MRAVWKKEDGAVVISALPHQVSGARVLEQIAAQMRNKKLPMVDDLRDESDHENPTRLVIVPRSNRVDMDQVMNHLFATTD
LEKSYRINLNMIGLDGRPAVKNLLEILSEWLVFRRDTVRRRLNYRLEKVLKRLHILEGLLVAFLNIDEVIEIIRNEDEPK
PALMSRFGLTETQAEAILELKLRHLAKLEEMKIRGEQSELEKERDQLQGILASERKMNNLLKKELQADAQAYGDDRRSPL
QEREEAKAMSEHDMLPSEPVTIVLSQMGWVRSAKGHDIDAPGLNYKAGDSFKAAVKGKSNQPVVFVDSTGRSYAIDPITL
PSARGQGEPLTGKLTLPPGATVDHMLMESDDQKLLMASDAGYGFVCTFNDLVARNRAGKALITLPENAHVMPPVVIEDAS
DMLLAITQAGRMLMFPVSDLPQLSKGKGNKIINIPSAEAARGEDGLAQLYVLPPQSTLTIHVGKRKIKLRPEELQKVTGE
RGRRGTLMRGLQRIDRVEIDSPRRASSGDSEE
>P26973 5.6.2.2~~~parC~~~DNA topoisomerase 4 subunit A~~~
MSDMAERLALHEFTENAYLNYSMYVIMDRALPFIGDGLKPVQRRIVYAMSELGLNATAKFKKSARTVGDVLGKYHPHGDS
ACYEAMVLMAQPFSYRYPLVDGQGNWGAPDDPKSFAAMRYTESRLSKYAELLLSELGQGTADWVPNFDGTMQEPKMLPAR
LPNILLNGTTGIAVGMATDIPPHNLREVAKAAITLIEQPKTTLDQLLDIVQGPDYPTEAEIITPRAEIRKIYENGRGSVR
MRAVWTKEDGAVVISALPHQVSGAKVLEQIAAQMRNKKLPMVDDLRDESDHENPTRLVIVPRSNRVDMEQVMNHLFATTD
LEKSYRINLNMIGLDGRPAVKNLLEILTEWLAFRRDTVRRRLNYRLEKVLKRLHILEGLLVAFLNIDEVIEIIRSEDEPK
PALMSRFGISETQAEAILELKLRHLAKLEEMKIRGEQDELEKERDQLQGILASERKMNTLLKKELQADADAYGDDRRSPL
REREEAKAMSEHDMLPSEPVTIVLSQMGWVRSAKGHDIDAPGLNYKAGDSFKAAVKGKSNQPVVFIDTTGRSYAIDPITL
PSARGQGEPLTGKLTLPPGATVEHMLMEGDDQKLLMASDAGYGFVCTFNDLVARNRAGKTLITLPENAHVMPPLVIEDEH
DMLLAITQAGRMLMFPVDSLPQLSKGKGNKIINIPSAEAAKGDDGLAHLYVLPPQSTLTIHVGKRKIKLRPEELQKVVGE
RGRRGTLMRGLQRIDRIEIDSPHRVSHGDSEE
>Q2FYS4 5.6.2.2~~~parC~~~DNA topoisomerase 4 subunit A~~~COG0188
MSEIIQDLSLEDVLGDRFGRYSKYIIQERALPDVRDGLKPVQRRILYAMYSSGNTHDKNFRKSAKTVGDVIGQYHPHGDS
SVYEAMVRLSQDWKLRHVLIEMHGNNGSIDNDPPAAMRYTEAKLSLLAEELLRDINKETVSFIPNYDDTTLEPMVLPSRF
PNLLVNGSTGISAGYATDIPPHNLAEVIQATLKYIDNPDITVNQLMKYIKGPDFPTGGIIQGIDGIKKAYESGKGRIIVR
SKVEEETLRNGRKQLIITEIPYEVNKSSLVKRIDELRADKKVDGIVEVRDETDRTGLRIAIELKKDVNSESIKNYLYKNS
DLQISYNFNMVAISDGRPKLMGIRQIIDSYLNHQIEVVANRTKFELDNAEKRMHIVEGLIKALSILDKVIELIRSSKNKR
DAKENLIEVYEFTEEQAEAIVMLQLYRLTNTDIVALEGEHKELEALIKQLRHILDNHDALLNVIKEELNEIKKKFKSERL
SLIEAEIEEIKIDKEVMVPSEEVILSMTRHGYIKRTSIRSFNASGVEDIGLKDGDSLLKHQEVNTQDTVLVFTNKGRYLF
IPVHKLADIRWKELGQHVSQIVPIEEDEVVINVFNEKDFNTDAFYVFATQNGMIKKSTVPLFKTTRFNKPLIATKVKEND
DLISVMRFEKDQLITVITNKGMSLTYNTSELSDTGLRAAGVKSINLKAEDFVVMTEGVSENDTILMATQRGSLKRISFKI
LQVAKRAQRGITLLKELKKNPHRIVAAHVVTGEHSQYTLYSKSNEEHGLINDIHKSEQYTNGSFIVDTDDFGEVIDMYIS
>Q93KF4 5.6.2.2~~~parC~~~DNA topoisomerase 4 subunit A~~~
MSEIIQDLSLEDVLGDRFGRYSKYIIQERALPDVRDGLKPVQRRILYAMYSSGNTHDKNFRKSAKTVGDVIGQYHPHGDS
SVYEAMVRLSQDWKLRHVLIEMHGNNGSIDNDPPAAMRYTEAKLSLLAEELLRDINKETVSFIPNYDDTTLEPMVLPSRF
PNLLVNGSTGISAGYATDIPPHNLAEVIQATLKYIDNPDITVNQLMKYIKGPDFPTGGIIQGIDGIKKAYESGKGRIIVR
SKVEEETLRNGRKQLIITEIPYEVNKSSLVKRIDELRADKKVDGIVEVRDETDRTGLRIAIELKKDVNSESIKNYLYKNS
DLQISYNFNMVAISDGRPKLMGIRQIIDSYLNHQIEVVANRTKFELDNAEKRMHIVEGLIKALSILDKVIELIRSSKNKR
DAKENLIEVFEFTEEQAEAIVMLQLYRLTNTDIVALEGEHKELEALIKQLRHILDNHDALLNVIKEELNEIKKKFKSERL
SLIEAEIEEIKIDKEVMVPSEEVILSMTRHGYIKRTSIRSFNASGVEDIGLKDGDSLLKHQEVNTQDTVLVFTNKGRYLF
IPVHKLADIRWKELGQHVSQIVPIEEDEVVINVFNEKDFNTDAFYVFATQNGMIKKSTVPLFKTTRFNKPLIATKVKEND
DLISVMRFEKDQLITVITNKGMSLTYNTSELSDTGLRAAGVKSINLKAEDFVVMTEGVSENDTILMATQRGSLKRISFKI
LQVAKRAQRGITLLKELKKNPHRIVAAHVVTGEHSQYTLYSKSNEEHGLINDIHKSEQYTNGSFIVDTDDFGEVIDMYIS
>Q6G9K4 5.6.2.2~~~parC~~~DNA topoisomerase 4 subunit A~~~
MSEIIQDLSLEDVLGDRFGRYSKYIIQERALPDVRDGLKPVQRRILYAMYSSGNTHDKNFRKSAKTVGDVIGQYHPHGDS
SVYEAMVRLSQDWKLRHVLIEMHGNNGSIDNDPPAAMRYTEAKLSLLAEELLRDINKETVSFIPNYDDTTLEPMVLPSRF
PNLLVNGSTGISAGYATDIPPHNLAEVIQATLKYIDNPDITVNQLMKYIKGPDFPTGGIIQGIDGIKKAYESGKGRIIVR
SKVEEETLRNGRKQLIITEIPYEVNKSSLVKRIDELRADKKVDGIVEVRDETDRTGLRIAIELKKDVNSESIKNYLYKNS
DLQISYNFNMVAISDGRPKLMGIRQIIDSYLNHQIEVVANRTKFELDNAEKRMHIVEGLIKALSILDKVIELIRSSKNKR
DAKENLIEVYEFTEEQAEAIVMLQLYRLTNTDIVALEGEHKELEALIKQLRHILDNHDALLNVIKEELNEIKKKFKSERL
SLIEAEIEEIKIDKEVMVPSEEVILSMTRHGYIKRTSIRSFNASGVEDIGLKDGDSLLKHQEVNTQDTVLVFTNKGRYLF
IPVHKLADIRWKELGQHVSQIVPIEEDEVVINVFNEKDFNTDAFYVFATQNGMIKKSTVPLFKTTRFNKPLIATKVKEND
DLISVMRFEKDQLITVITNKGMSLTYNTSELSDTGLRAAGVKSINLKAEDFVVVTEGVSENDTILMATQRGSLKRISFKI
LQVAKRAQRGITLLKELKKNPHRIVAAHVVTGEHSQYTLYSKSNEEHGLINDIHKSEQYTNGSFIVDTDDFGEVIDMYIS
>P72525 5.6.2.2~~~parC~~~DNA topoisomerase 4 subunit A~~~COG0188
MSNIQNMSLEDIMGERFGRYSKYIIQDRALPDIRDGLKPVQRRILYSMNKDSNTFDKSYRKSAKSVGNIMGNFHPHGDSS
IYDAMVRMSQNWKNREILVEMHGNNGSMDGDPPAAMRYTEARLSEIAGYLLQDIEKKTVPFAWNFDDTEKEPTVLPAAFP
NLLVNGSTGISAGYATDIPPHNLAEVIDAAVYMIDHPTAKIDKLMEFLPGPDFPTGAIIQGRDEIKKAYETGKGRVVVRS
KTEIEKLKGGKEQIVIIEIPYEINKANLVKKIDDVRVNNKVAGIAEVRDESDRDGLRIAIELKKDANTELVLNYLFKYTD
LQINYNFNMVAIDNFTPRQVGIVPILSSYIAHRREVILARSRFDKEKAEKRLHIVEGLIRVISILDEVIALIRASENKAD
AKENLKVSYDFTEEQAEAIVTLQLYRLTNTDVVVLQEEEAELREKIAMLAAIIGDERTMYNLMKKELREVKKKFATPRLS
SLEDTAKAIEIDTASLIAEEDTYVSVTKAGYIKRTSPRSFAASTLEEIGKRDDDRLIFVQSAKTTQHLLMFTSLGNVIYR
PIHELADIRWKDIGEHLSQTITNFETNEEILYVEVLDQFDDATTYFAVTRLGQIKRVERKEFTPWRTYRSKSVKYAKLKD
DTDQIVAVAPIKLDDVVLVSQNGYALRFNIEEVPVVGAKAAGVKAMNLKEDDVLQSGFICNTSSFYLLTQRGSLKRVSIE
EILATSRAKRGLQVLRELKNKPHRVFLAGAVAEQGFVGDFFSTEVDVNDQTLLVQSNKGTIYESRLQDLNLSERTSNGSF
ISDTISDEEVFDAYLQEVVTEDK
>P58091 ~~~parD1~~~Antitoxin ParD1~~~COG3609
MASKNTSVVLGDHFQAFIDSQVADGRYGSASEVIRAGLRLLEENEAKLAALRAALIEGEESGFIEDFDFDAFIEERSRAS
APQGFHEE
>P9WIJ7 ~~~parD1~~~Antitoxin ParD1~~~COG3609
MGKNTSFVLDEHYSAFIDGEIAAGRYRSASEVIRSALRLLEDRETQLRALREALEAGERSGSSTPFDFDGFLGRKRADAS
RGR
>P9WJ75 ~~~parD2~~~Antitoxin ParD2~~~
MVVNRALLASVDALSRDEQIELVEHINGNLAEGMHISEANQALIEARANDTDDAHWSTIDDFDKRIRARLG
>P0CW74 ~~~parD3~~~Antitoxin ParD3~~~
MNKPAKPAADDVDDLFGRPLTPAEEDTWFEHNREAIGQLVDEAWAEFERGEYDERSFAEIIAQGVAEHNAKR
>Q9A458 ~~~parD4~~~Antitoxin ParD4~~~COG3609
MPSNGIVRSWRRAMATMNVSLPDAMREWVEGQTQSGRYHNASEYVRDLIRRDQERADKIAHLQRLIDEGLDSGVGERSLH
EIRAEARRRAGVDHEL
>P22995 ~~~parD~~~Antitoxin ParD~~~
MSRLTIDMTDQQHQSLKALAALQGKTIKQYALERLFPGDADADQAWQELKTMLGNRINDGLAGKVSTKSVGEILDEELSG
DRA
>P58093 ~~~parD~~~Antitoxin ParD~~~
MAKNTSITLGEHFDGFITSQIQSGRYGSASEVIRSALRLLENQETKLQSLRQLLIEGEQSGDADYDLDSFINELDSENIR
>Q9A9T8 ~~~parE1~~~Toxin ParE1~~~COG3668
MKPYRLSRRAKADLDDIWTYSEQRWGVEQAADYARELQATIEMIAEHPGMGQPDENLRAGYRRCASGSHVVFYRVGVRVE
IIRVLHQSMNARAHLG
>P9WHG7 ~~~parE1~~~Toxin ParE1~~~COG3668
MSSRYLLSPAAQAHLEEIWDCTYDRWGVDQAEQYLRELQHAIDRAAANPRIGRACDEIRPGYRKLSAGSHTLFYRVTGEG
TIDVVRVLHQRMDVDRNL
>P9WHG5 ~~~parE2~~~Toxin ParE2~~~COG3668
MTRRLRVHNGVEDDLFEAFSYYADAAPDQIDRLYNLFVDAVTKRIPQAPNAFAPLFKHYRHIYLRPFRYYVAYRTTDEAI
DILAVRHGMENPNAVEAEISGRTFE
>Q9A4S4 ~~~parE3~~~Toxin ParE3~~~COG3668
MGRVIRTRPVSGDLDRVFRDVCENNGVKVASAQLNRIESVFHRLSAFPRLGRDRSDLRPGLRTFSVKPWQVLYRLNGEDV
VILRILDGRMNLAAQLGKKT
>Q9A459 ~~~parE4~~~Toxin ParE4~~~COG3668
MWIMSYRLSRKAEQDLIDIYVAGVGLFGVAQAERYQDTLEAAFGAIAAFPHIGRERPELRPPVRVHPCKSHIILYVLDER
GALIVRVRHAGEDWVGEAGG
>P20083 5.6.2.2~~~parE~~~DNA topoisomerase 4 subunit B~~~COG0187
MTQTYNADAIEVLTGLEPVRRRPGMYTDTTRPNHLGQEVIDNSVDEALAGHAKRVDVILHADQSLEVIDDGRGMPVDIHP
EEGVPAVELILCRLHAGGKFSNKNYQFSGGLHGVGISVVNALSKRVEVNVRRDGQVYNIAFENGEKVQDLQVVGTCGKRN
TGTSVHFWPDETFFDSPRFSVSRLTHVLKAKAVLCPGVEITFKDEINNTEQRWCYQDGLNDYLAEAVNGLPTLPEKPFIG
NFAGDTEAVDWALLWLPEGGELLTESYVNLIPTMQGGTHVNGLRQGLLDAMREFCEYRNILPRGVKLSAEDIWDRCAYVL
SVKMQDPQFAGQTKERLSSRQCAAFVSGVVKDAFILWLNQNVQAAELLAEMAISSAQRRMRAAKKVVRKKLTSGPALPGK
LADCTAQDLNRTELFLVEGDSAGGSAKQARDREYQAIMPLKGKILNTWEVSSDEVLASQEVHDISVAIGIDPDSDDLSQL
RYGKICILADADSDGLHIATLLCALFVKHFRALVKHGHVYVALPPLYRIDLGKEVYYALTEEEKEGVLEQLKRKKGKPNV
QRFKGLGEMNPMQLRETTLDPNTRRLVQLTIDDEDDQRTDAMMDMLLAKKRSEDRRNWLQEKGDMAEIEV
>Q79EC5 ~~~parE~~~Toxin ParE~~~
MTAYILTAEAEADLRGIIRYTRREWGAAQVRRYIAKLEQGIARLAAGEGPFKDMSELFPALRMARCEHHYVFCLPRAGEP
ALVVAILHERMDLMTRLADRLKG
>H7C794 5.6.2.2~~~parE~~~DNA topoisomerase 4 subunit B~~~COG0187
MAKKINNEYNDASIQVLEGLEAVRKRPGMYIGSTDSRGLHHLVYEIVDNAVDEALSGYGNEINVTIQKDNSICVADSGRG
MPTGMHASGIPTVEVIFTVLHAGGKFGQGGYKTSGGLHGVGASVVNALSKWLEVHIVRDGVEYMERFEDGGKPVGTLKKI
GKTKKRNGTSVTFLPDDTIFSTTNFSYEILAERLRESAFLLKGVKITLTDERGEEPKEEVFHYEEGIKEFVAYLNEEKDT
LTPVVYFSGAKEGIEVELAYQYNDGYSENVLSFVNNVRTKDGGTHEVGMKTSMTKAYNEYARKVGLLKEKDKNLEGSDFR
EGLAAVLSIRVPENLLQFEGQTKGKLGTPLARTVVDNVVGEQMGFYLQENSEMSQSLIRKAIKAREAREAARKAREESRN
GKKRKKGESLLSGKLTPAQSRNPKKNELYLVEGDSAGGSAKQGRDRKFQAILPLRGKVINTEKAKMQDILKNEEINTMIY
TIGAGVGPEFSIEDCNYDKIIIMTDADTDGAHIQVLLLTFFYRYMKPLIEAGKVYIALPPLYKVSKGTGKKSVIEYAWTD
GELAEVIDKVGKGYMLQRYKGLGEMNAEQLWETTMDPETRTLIRVRIDDAAQAERRVTTLMGDKVEPRRKWIEQHVQFTL
EEDGSILDRSEEDTSAPTGESLLDAEKTKEAEQTDDTEISLFDIE
>A0A0J9WZF0 5.6.2.2~~~parE~~~DNA topoisomerase 4 subunit B~~~
MQNYNAKSIEVLTGLDPVKKRPGMYTNIENPNHLIQEIIDNSVDEVLAGFASKINITLYEDNSIEVADDGRGMPVDIHPE
HKMSGIELIMTKLHSGGKFSNKNYTHSGGLHGVGVSVVNALSTRLEAEIKRDGNVYHIVFEDGFKTKDLEIIDNVGKKNT
GTKIRFWPNKKYFDDIKVNFKALKNLLEAKAILCKALTIKYSNEIKKEKLTWHFETGLKGYLDHKLEAETLPAEPFIIDN
FSNGDSYLDAVFCWCEDLSESIKNSYVNLIPTPQDGTHVTGLKNGIYDAIKAYIEKNSLSVKNIKITANDSFAQLNYVIS
VKITNPQFAGQTKEKLSNKDVTNFVATAVKDLLTIWLNQNPDEARQIVENISKVAQKRINADKKTTRKRIMNTTIRLPGK
LTDCISSDVNSTELFIVEGDSAGGSAKQARDKNFQAVLPLKGKILNSWELDADTIMNSQEIHNIATAIGVDPDSDDISAL
RYNKICILADADSDGLHIATLLCAMFLKHFRKLIENGHIYIAQPPLFRIDIGKSTFYALDENERDTILTKNSKLPGKVNI
MRFKGLGEMNPAQLRESAMDVSSRRLLQLTISDVYDDTEMLDMLLAKKRAKDRRDWLENYGDRASVE
>P0A2I5 5.6.2.2~~~parE~~~DNA topoisomerase 4 subunit B~~~
MTQTYNADAIEVLTGLEPVRRRPGMYTDTTRPNHLGQEVIDNSVDEALAGHAKRVDVILHADQSLEVIDDGRGMPVDIHP
EEGVPAVELILCRLHAGGKFSNKNYQFSGGLHGVGISVVNALSKRVEVTVRRDGQVYNIAFENGEKVQDLQVVGTCGKRN
TGTSVHFWPDESFFDSPRFSVSRLMHVLKAKAVLCPGVEITFKDEVNNSEQRWCYQDGLNDYLGEAVNGLPTLPEKPFIG
NFNGETEAVDWALLWLPEGGELLTESYVNLIPTMQGGTHVNGLRQGLLDAMREFCEYRNILPRGVKLSAEDIWDRCAYVL
SVKMQDPQFAGQTKERLSSRQCAAFVSGVVKDAFSLWLNQNVQAAEQLAEMAIASAQRRLRAAKKVVRKKLTSGPALPGK
LADCTAQDLNRTELFLVEGDSAGGSAKQARDREYQAIMPLKGKILNTWEVSSDEVLASQEVHDISVAIGIDPDSDDLSQL
RYGKICILADADSDGLHIATLLCALFVRHFRALVKNGHVYVALPPLYRIDLGKEVYYALTEEEKAGVLEQLKRKKGKPNV
QRFKGLGEMNPMQLRETTLDPNTRRLVQLTISDEDDQRTNAMMDMLLAKKRSEDRRNWLQEKGDLADLDV
>P66939 5.6.2.2~~~parE~~~DNA topoisomerase 4 subunit B~~~
MNKQNNYSDDSIQVLEGLEAVRKRPGMYIGSTDKRGLHHLVYEIVDNSVDEVLNGYGNEIDVTINKDGSISIEDNGRGMP
TGIHKSGKPTVEVIFTVLHAGGKFGQGGYKTSGGLHGVGASVVNALSEWLEVEIHRDGSIYHQSFKNGGSPSSGLVKKGK
TKKTGTKVTFKPDDTIFKASTSFNFDVLSERLQESAFLLKNLKITLNDLRSGKERQEHYHYEEGIKEFVSYVNEGKEVLH
DVATFSGEANGIEVDVAFQYNDQYSESILSFVNNVRTKDGGTHEVGFKTAMTRVFNDYARRINELKTKDKNLDGNDIREG
LTAVVSVRIPEELLQFEGQTKSKLGTSEARSAVDSVVADKLPFYLEEKGQLSKSLVKKAIKAQQAREAARKAREDARSGK
KNKRKDTLLSGKLTPAQSKNTEKNELYLVEGDSAGGSAKLGRDRKFQAILPLRGKVINTEKARLEDIFKNEEINTIIHTI
GAGVGTDFKIEDSNYNRVIIMTDADTDGAHIQVLLLTFFFKYMKPLVQAGRVFIALPPLYKLEKGKGKTKRVEYAWTDEE
LNKLQKELGKGFTLQRYKGLGEMNPEQLWETTMNPETRTLIRVQVEDEVRSSKRVTTLMGDKVQPRREWIEKHVEFGMQE
DQSILDNSEVQVLENDQFDEEEI
>Q59961 5.6.2.2~~~parE~~~DNA topoisomerase 4 subunit B~~~COG0187
MSKKEININNYNDDAIQVLEGLDAVRKRPGMYIGSTDGAGLHHLVWEIVDNAVDEALSGFGDRIDVTINKDGSLTVQDHG
RGMPTGMHAMGIPTVEVIFTILHAGGKFGQGGYKTSGGLHGVGSSVVNALSSWLEVEITRDGAVYKQRFENGGKPVTTLK
KIGTAPKSKTGTKVTFMPDATIFSTTDFKYNTISERLNESAFLLKNVTLSLTDKRTNEAIEFHYENGVQDFVSYLNEDKE
ILTPVLYFEGEDNGFQVEVALQYNDGFSDNILSFVNNVRTKDGGTHETGLKSAITKVMNDYARKTGLLKEKDKNLEGSDY
REGLAAVLSILVPEEHLQFEGQTKDKLGSPLARPVVDGIVADKLTFFLMENGELASNLIRKAIKARDAREAARKARDESR
NGKKNKKDKGLLSGKLTPAQSKNPAKNELYLVEGDSAGGSAKQGRDRKFQAILPLRGKVVNTAKAKMADILKNEEINTMI
YTIGAGVGADFSIEDANYDKIIIMTDADTDGAHIQTLLLTFFYRYMRPLVEAGHVYIALPPLYKMSKGKGKKEEVAYAWT
DGELEELRKQFGKGATLQRYKGLGEMNADQLWETTMNPETRTLIRVTIEDLARAERRVNVLMGDKVEPRRKWIEDNVKFT
LEETTVF
>P11904 ~~~parM~~~Plasmid segregation protein ParM~~~
MLVFIDDGSTNIKLQWQESDGTIKQHISPNSFKREWAVSFGDKKVFNYTLNGEQYSFDPISPDAVVTTNIAWQYSDVNVV
AVHHALLTSGLPVSEVDIVCTLPLTEYYDRNNQPNTENIERKKANFRKKITLNGGDTFTIKDVKVMPESIPAGYEVLQEL
DELDSLLIIDLGGTTLDISQVMGKLSGISKIYGDSSLGVSLVTSAVKDALSLARTKGSSYLADDIIIHRKDNNYLKQRIN
DENKISIVTEAMNEALRKLEQRVLNTLNEFSGYTHVMVIGGGAELICDAVKKHTQIRDERFFKTNNSQYDLVNGMYLIGN
>A0A0C5XKJ0 ~~~parS~~~Prs ADP-ribosylating antitoxin~~~
MSELETGARKAQGKGTPPFAYKALYRASPVDRIGVIKKGVAASDLKRFIAALHVDQKVMFDALNLKTATVNKKAANNQPL
STEDSERVLGLAKLVGQLEDMVEESGETDGFDAPEWLSSWLRQPLPALGGVNPIDLLDTMEGQAVVSRALAQIQSGAFA
>A0A0C5XL88 2.4.2.-~~~parT~~~Prs ADP-ribosylating toxin~~~
MTTSFWRIATDARTYEADDLSGAGAKITGGRWNEVGVAIVYAASSRALACLETVVHLNSGGLPLNRYLVEIEVPDEVLAS
AEVATPGNLPVGWDAEPAGRVSISFGSQWAQSQRTALLLVPSVIVPEETNLLINPAHPDAKGIKARKVRKWLYDPRMIRK
A
>Q8FEY5 ~~~pasI~~~Persistence and stress-resistance antitoxin PasI~~~COG2914
MPGKIAVEVAYALPEKQYLQRVTLQEGATVEEAIRASGLLELRTDIDLTKNKVGIYSRPAKLSDIVHDGDRVEIYRPLIA
DPKELRRQRAEKSANK
>C7G1H4 ~~~bactA1~~~Bacteriocin plantaricin ASM1~~~
MSKLVKTLTVDEISKIQTNGGKPAWCWYTLAMCGAGYDSGTCDYMYSHCFGVKHSSGGGGSYHC
>Q8FEY4 ~~~pasT~~~Persistence and stress-resistance toxin PasT~~~COG2867
MILFVGFLLMEIVMPQISRTALVPYSAEQMYQLVNDVQSYPQFLPGCTGSRILESTPGQMTAAVDVSKAGISKTFTTRNQ
LTSNQSILMSLVDGPFKKLIGGWKFTPLSQDACRIEFHLDFEFTNKLIELAFGRVFKELAANMVQAFTVRAKEVYSAR
>Q9KW51 ~~~pas~~~Probable cell-surface antigen I/II~~~
MKKRKEVFGFRKSKVAKTLCGAVLGAALLAMVDQQVAAEETTKSNSTTNVAVTATGNPATNLPEAQGSASKEAEQSQNQA
GETNGSIPVEVPKTDLDQAAKEAKSAGVNVVQDADVNKGTVKTAGEAAQKETEIKEDYTKQAEDIKKTTDQYKSDVAAHE
AEVAKIKAKNQATKEQYEKDMAAHKAEVERINAANAASKAAYETKLAQYQAELKRVQEANATNEANYQAKLTAYKTELAR
VQKANADAKAAYEAAVAANNAANAALTAENTAIKKRNADAKADYEAKLAKYQADLAKYQKDLAEYPQKLKEYNEEQAKIK
EALKKLEQDKNKDGHLTEPSAQSLVYDSEPDAKLSLTTEDGTLLKSSVVDEAFSKSTSKAKYDQKILQLDDLDIRGLEKA
DSATSTVELYGNIGNKSTWTTNVGNNTEVKWGSVLLKRGQSVTATYTNLQKTYYNGKKVSKIVYKYTVDKDSKFQNPSGN
VWLGVFSDPTLGVFASAYTGQVEKDTSIFIKNEFTFYDENDQPINFDNALLSVASLNRENNSIEMAKDYTGKFVRISGSS
IDEKDGKIYATKTLNFKKGQGGSRWTMYPNGQEGSGWDSSDAPNSWYGAGAVKISGQHNSITLGAISATLVVPSDSVMAV
ETGKKPNIWYSLNGKIRAVNVPKITKENPTPPVEPTAPQAPTYEVEKPLNPAPVAPNYEKEPTPPTPTPDQPEPNKPVEP
TYEAIPTPPTDPVYQNLPTPPAVPTVHFRYYKLAVQPQVNKEIKNNNDVDIDKTLVAKQSVVKFQLKTADLPAGRAKTTS
FVLVDPLPSGYQFNLEATKAASPGFDVSYDKATNTVTFKATAATLATFNADLTKSVATIYPTVVGQVLNDGATYKNNFTL
TVNDAYGIKSNVVRVTTPGKPNDPDNPNNNYIKPTKVNKNENGVVIDGKTVLAGSTNYYELTWDLDQYKNDRSSADTIQK
GFYYVDDYPEEALELRQDLVKITDANGNKVTGVSVDHYTSLEAAPQEVRDVLSKAGIRPKGAFQIFRADNPREFYDTYVK
NGIDLKIVSPMVVKKQMGQTGGSYENQAYQIDFGNGYASNIVINNVPKINPKKDVTLTLDPADTNNVDGQTIPLNTVFNY
RLIGGIIPADHSEELFEYNFYDDYDQTGDRYTGQYKVFAKVDITLKDGSVIKSGTDLTQHTTAEVDATKGALTIKFKEDF
LRSVSIDSAFQAESYIQMKRIAAGTFENTYVNTVNKVAYASNTVRTTTPQAPRSEVPKRPTPVSYLTNKLSSAPATLPQT
GTTDSSYMPYLGVIALLSVLGLGQLKRNEK
>O25526 2.3.1.-~~~patA~~~Peptidoglycan O-acetyltransferase~~~COG1696
MLASIISILRVFVLLFNTPLFIFAFLPVGFLGYFILQAYAKNPLFPKLWLVLASLFFYAFWNVKYLPLLVGSIVFNYFVA
LKIHQTQPNAYKRLWLILGLIANVSLLGFFKYTDFFLTNFNLIWKSHFETLHLILPLAISFFTLQQIAYLMDTYKQNQIM
QPKMRERVSENAPILLNPPTSFFSLSHFLDYALFVSFFPQLIAGPIVHHSEMMPQFKDKNNQYLNYRNIALGLFIFSIGL
FKKVVIADNTAHFADFGFDKATSLSFIQAWMTSLSYSFQLYFDFSGYCDMAIGIGLFFNIKLPINFNSPYKALNIQDFWR
RWHITLSRFLKEYLYIPLGGNRVKELIVYRNLILVFLIGGFWHGAGWTFIIWGLLHGIALSVHRAYSHATRKFHFTMPKI
LAWLITFNFINLAWVFFRAKNLESALKVLKGMVGLNGVSLCHLSKEASEFLNRVNDNMIMHTIMYASPTFKMCVLMIIIS
FCLKNSSHLYQSNQMDWIKTTSACLLLSIGFLFIFASSQSVFLYFNF
>P39048 ~~~patA~~~Protein PatA~~~COG0784
MKTLPITRYRFFQKIQPLSLLKKITGKTITGCLQVFSTSGTWSIYVEEGKLIYACYSERMFEPLYRHLGNLSPQIATLPK
EINEQLRAIFETGIENQAIPNPDYLAICWLVNQKYISSSQAAVLIEQLALEVVESFLMLEEGSYEFIPESFLDDLPKFCY
LNVRLLVEQCQQHGRVPEAFRREASSQEISSSTEHNQIPVNNRRSTKFTSPPHTQPKPEPRLPQINTNKSTEYSKRYASQ
PNTVNHGYSQTSATSTDKKIYTIFCIDENPIVLNNIKNFLDDQIFAVIGVTDSLKALMEILCTKPDIILINVDMPDLDGY
ELCSLLRKHSYFKNTPVIMVTEKAGLVDRARAKIVRASGHLTKPFNQGDLLKVIFKHIT
>C7BKP9 ~~~~~~Toxin PAU_02230~~~COG3774
MKGIEGVIMLSHDILPEKLLVSEKKHENVGSYFSDDIGEQSEQTEVSHFNLSLDDAFDIYADISIENQQELKNKDNNTNI
WSSLGRGDDDHNLKKIINDAFKEKLPQLMEYRRKGYNVIGLDKEGIKKLEGMLKAVPPEIQQPTMKNLYSAAQELLNTLK
QHPLLPENQDMIQQSNLVIRNLSDALEAINAVSKVNQVEWWEEVHKTNKAQSDRLIAATLEELFFKVKDKRLPGSNDDYC
QQEREETERKIKDLLLYDGYQLTAEHFKFGRLRKSLLAESRVTRLKLAEYLEKKSVGILTAARDAKMYAMKILLAQTRNN
GFNAKDLINAGQVNDRLLSFQQYARHIRAVDGEIDGIILSNPLVVACIKETNDEPAHIKIARAILPVSEELGTVSKVLRE
TKEKVQPSKPKEELNHPHQDWWNRGDELWKYIKKTSWNIKETSVHVTQMVGYEASKTASRAKHKLKESSYSESINGAVKG
TALLLLDEIQQAENRIRQIPQFAWDVQEAVEQHSSVIQRTAYPDELPELSELLNEQLKHEEARWQAVKKQSRDKLQELIA
PITRLAQEKWAQDLYFQLGEELRKERQDRWKDIQQFDEIMAEAVGQFAEMARELDSEAVRLAEHGHSGGKELQEKVAKWL
RDLSKLKGKVKAGVAKITGTSLDNFSRSGMLARGMSEWAEDLKQSYLQETLQEGSAVAAELFERTLMEVVEENRTHFAKE
SDPEAERFLKRLALALKHAAENTTVYPPTPEEILAGSRSLPEDIRHWAEKKVVSGAISAAFRGGFKLVTGTFSLPVRVVI
RGAKTGGTLYRGVRAINRSVRLGQGPATQVKSKFINQELSKTAFRLTLSLSPLVAWGMAASITAGRLYNEKDYPEKIIKN
IVIDLPEELLWIGGYAGINAAIRAHAEKAIQQAIQHALDEQADKLALRINKEIAGKSADVNVEIIPQETSVSPAETAQST
PEPLSDFASTSQLTMPELIDIQDNNSAQQPKVRRKRDVSVESEISIDNLNIINANTREDKVNSEIKSELRSELKRFENSD
ANSPMSDVERAIFIDLFLYKNKYEVSESQQDYKNTWLKFRRELESQENKEIKEYLRFRSIIEAYEIYDKKRLDDDTIPEA
GTIIKEVIDFFQKLKKENPITFMKLAEAMVKFQYYYEEEDENEDRYFKMAEIYYFLNKTENEKKSKTFHLDIIDKYPNEN
NRLLDEFFLNKNNNNPDLDEIIYKLQSMQEKYRESYEMLSKVENIHQVLSDDSKNEENIFLDNRIIAAQVFDGSINISLQ
DKKKWLNRYDQIRNEEGSDGWKLMHIESILINLRRINTAINLTAMKSESALLLIDKLLNFQKKARENILHISETPHEDFT
SYSQFKTRKELGNDDSKYYAQFDNYKDNHDAEKEAKEILSQVVARASLSFSELFDKVESIKLFSFVYKNRDGGAPLAAPG
RTVVIKFPGKDTGGLVISNLFLRNHVKRISTKEMEDLKPLTEGMYTRATQHRSLGSYYHIGSQSEHTNALEILSGMNKEE
LKTHLKKQGIWFGEPALFSNEYPKQENTGHLENTTLKNAIIGVSTIQNNAAANYLRSTMYESTGWEKLGDRFIPFYEIGR
RKHYDREYEINSEQLTLDIITSIAIAYPAARGIVATIRSSAIPSILKSGLRGSALFKSLSLELGKMGFNASKVFGGAVYE
LIEPYPINSHLNRHNVFNKVKDTAWEFHTDVGLKGGGLKDFIDRFTKEPKEITISGYKFKRIKYNQENFDTMQRMALDYA
YNPDSKGKIAQAQQAYKTGKEDYNAPQYDNFNGLSLDKKIERYISPDTDATTKGVLAGKMNESIKDINAFQTAKDAQSWK
KSANKANKVVLTPQNLYLKGKPSECLPESVLMGWALQSSQDAKLSKMLMGIYSSNDITSNPLYKSLKELHANGNASKFNA
SATSISNINVSNLATSETKLFPTEISSVRVDAPKHTMLISKIKNRENKIKYVFYDPNYGMAYFDKHSDMAAFFQKKMQQY
DFPDDSVSFHPLDYSNVSDIKISGRNLNEIIDGEIPLLYKQEGVQLEGITPRDGIYRVPPKNTLGVQETKHYIIVNNDIY
QVEWDQTNNTWRVFDPSNTNRSRPTVPVKQDTNGEWFKHSETGLKGGGPIDDIRKYIARKSAIKIFNQSINYSATKWPPE
PIDKNIHMIWIGTKNISEKNIKLSIDTAKKNPDYNTSIIYDSGISGHEGAKKFMLEKFQDSNVNIIDFRKKSYFSQLKQE
PSFAYYEQVIAENKYAQASDILRLLVLKYEGGIYKDIDDIQVKGFGSLTFPKGIGVMREYAPEAGKATAFPNTPIAVTKN
NPIINKTLDLAVSNYQRGEKNVLKLAGPDVFTQALYQEIPGLDSKVLNAQLYQLELAKRQALGVPLEKPKNFADEQLTSA
EKEKINRPYQSIRGLSGYVENGADHSWAVDTNIPSTSTQTSTIVTPLAPKTEMLPPVPSSSTKSSTSAPVLQEKISYNLA
TDIDATDYLNQLKQKTNINNKISSPAGQCESLMKPVSDFMRENGFTDIRYRGMFIWNNATEQIPMNHFVVVGKKVGKDYV
FDVSAHQFENKGMPDLNGPLILAAEDWAKKYRGATTRKLIYYSDFKNASTATNTYNALPRELVLESMEGKTFITSPNWYQ
TFKRTHNIHPEVTVSDPATFSLNYSVNPTAENLSPPPPPPIPSHGQVPKTVTPPPPPMRSPLSLSQPLERLPANKTKPIG
FNPGENKASFSKLEEAGKHYYKDDKSRQAAPVNTMSDFDNRYLSHTTEAPAPSNVAHLAPGNIYNTKVTAKGAEKPAYDI
YISKDGESLITSSSYKVDDITTDSKFGKPLPYSEIMFNSLKKSGVDPKNLKRSVQASIENKVTQDVISAIGTRIQRGQVI
RVSPTENPDAFYTLLGTDNCKATLHMLNQHAEEFGHKVVTSIEFKGTGYLVMNIGTSTQTSTIVTPPPMPGTSQLVQ
>A0R5X8 2.6.1.-~~~pat~~~Putative phenylalanine aminotransferase~~~COG0079
MSIRLRAEMADLPAYAPGKTVPGAIKIASNETVHGPLPSVREAILKATDLINRYPDNGYLDLRERLAKHVNFAPENISVG
CGSVSLCQQLIQITSSVGDEVLFAWRSFEIYPLQVRTAGATPVAVALRDHTHDLDAMLAAITDRTRLIFVCNPNNPTSTV
VDPGELARFVAAVPPHILVVLDEAYVEYIRDGLLPDSLGLVREHRNVVVLRTFSKAYGLAGLRVGYAVADPEIVTALGKV
YVPFSATSVSQAAAIACLDAADELLARTDAVVAERTRVSDALRAAGYTLPPSQANFVWLPLAERTLDFVARAADNRIIVR
PYGEDGVRVTIGAPHENDAFLDFAQRWIAPGGAGPRTGDSA
>P9WML5 2.6.1.-~~~pat~~~Putative phenylalanine aminotransferase~~~COG0079
MTARLRPELAGLPVYVPGKTVPGAIKLASNETVFGPLPSVRAAIDRATDTVNRYPDNGCVQLKAALARHLGPDFAPEHVA
VGCGSVSLCQQLVQVTASVGDEVVFGWRSFELYPPQVRVAGAIPIQVPLTDHTFDLYAMLATVTDRTRLIFVCNPNNPTS
TVVGPDALARFVEAVPAHILIAIDEAYVEYIRDGMRPDSLGLVRAHNNVVVLRTFSKAYGLAGLRIGYAIGHPDVITALD
KVYVPFTVSSIGQAAAIASLDAADELLARTDTVVAERARVSAELRAAGFTLPPSQANFVWLPLGSRTQDFVEQAADARIV
VRPYGTDGVRVTVAAPEENDAFLRFARRWRSDQ
>P42588 2.6.1.82~~~patA~~~Putrescine aminotransferase~~~COG4992
MNRLPSSASALACSAHALNLIEKRTLDHEEMKALNREVIEYFKEHVNPGFLEYRKSVTAGGDYGAVEWQAGSLNTLVDTQ
GQEFIDCLGGFGIFNVGHRNPVVVSAVQNQLAKQPLHSQELLDPLRAMLAKTLAALTPGKLKYSFFCNSGTESVEAALKL
AKAYQSPRGKFTFIATSGAFHGKSLGALSATAKSTFRKPFMPLLPGFRHVPFGNIEAMRTALNECKKTGDDVAAVILEPI
QGEGGVILPPPGYLTAVRKLCDEFGALMILDEVQTGMGRTGKMFACEHENVQPDILCLAKALGGGVMPIGATIATEEVFS
VLFDNPFLHTTTFGGNPLACAAALATINVLLEQNLPAQAEQKGDMLLDGFRQLAREYPDLVQEARGKGMLMAIEFVDNEI
GYNFASEMFRQRVLVAGTLNNAKTIRIEPPLTLTIEQCELVIKAARKALAAMRVSVEEA
>A0R3F9 2.3.1.-~~~~~~Acetyltransferase Pat~~~COG0664
MAELTEVRAADLAALEFFTGCRPSALEPLATQLRPLKAEPGQVLIRQGDPALTFMLIESGRVQVSHAVADGPPIVLDIEP
GLIIGEIALLRDAPRTATVVAAEPVIGWVGDRDAFDTILHLPGMFDRLVRIARQRLAAFITPIPVQVRTGEWFYLRPVLP
GDVERTLNGPVEFSSETLYRRFQSVRKPTRALLEYLFEVDYADHFVWVMTEGALGPVIADARFVREGHNATMAEVAFTVG
DDYQGRGIGSFLMGALIVSANYVGVQRFNARVLTDNMAMRKIMDRLGAVWVREDLGVVMTEVDVPPVDTVPFEPELIDQI
RDATRKVIRAVSQ
>O05581 2.3.1.-~~~~~~Acetyltransferase Pat~~~COG0664
MDGIAELTGARVEDLAGMDVFQGCPAEGLVSLAASVQPLRAAAGQVLLRQGEPAVSFLLISSGSAEVSHVGDDGVAIIAR
ALPGMIVGEIALLRDSPRSATVTTIEPLTGWTGGRGAFATMVHIPGVGERLLRTARQRLAAFVSPIPVRLADGTQLMLRP
VLPGDRERTVHGHIQFSGETLYRRFMSARVPSPALMHYLSEVDYVDHFVWVVTDGSDPVADARFVRDETDPTVAEIAFTV
ADAYQGRGIGSFLIGALSVAARVDGVERFAARMLSDNVPMRTIMDRYGAVWQREDVGVITTMIDVPGPGELSLGREMVDQ
INRVARQVIEAVG
>P21861 2.3.1.183~~~bar~~~Phosphinothricin N-acetyltransferase~~~COG1247
MPGTAEVQVRPGVEEDLKPLTDLYNHYVRETPITFDTEPFTPEERRPWLLSHPEDGPYRLRVATDAESQEILGYATSSPY
RAKPAYATSVETTVYVAPGAGGRGIGSLLYASLFDALAAEDLHRAYAGIAQPNEASARLHARFGFRHVGTYREVGRKFGR
YWDVAWYERPL
>P16426 2.3.1.183~~~bar~~~Phosphinothricin N-acetyltransferase~~~
MSPERRPADIRRATEADMPAVCTIVNHYIETSTVNFRTEPQEPQEWTDDLVRLRERYPWLVAEVDGEVAGIAYAGPWKAR
NAYDWTAESTVYVSPRHQRTGLGSTLYTHLLKSLEAQGFKSVVAVIGLPNDPSVRMHEALGYAPRGMLRAAGFKHGNWHD
VGFWQLDFSLPVPPRPVLPVTEI
>Q57146 2.3.1.183~~~pat~~~Phosphinothricin N-acetyltransferase~~~COG1247
MSPERRPVEIRPATAADMAAVCDIVNHYIETSTVNFRTEPQTPQEWIDDLERLQDRYPWLVAEVEGVVAGIAYAGPWKAR
NAYDWTVESTVYVSHRHQRLGLGSTLYTHLLKSMEAQGFKSVVAVIGLPNDPSVRLHEALGYTARGTLRAAGYKHGGWHD
VGFWQRDFELPAPPRPVRPVTQI
>Q9ZB73 2.4.1.-~~~~~~Processive diacylglycerol beta-glycosyltransferase~~~COG0463
MDKLVSILVPCYKSKPFLKRFFNSLLKQDLNQAKIIFFNDNVADETYEVLQKFKKEHNNLAIEVYCDKQNEGIGKVRDKL
VNLVTTPYFYFIDPDDCFNNKNVIKEIVESIKKEDFDLGVLKSMVYLCFLKHDFIIKFLPLKGIFQGRVKLINNNNVNKL
NYIKNNDQYIWNIVINTDFFRKLNLTFESRLFEDIPIWYPMFFSSQKIVFIDVIGTNYFIRNDSLSTTISAPRYLNLIQC
YEKLYVNLSQNGSLASFIDPNHKIEARFWRRQMFVWFALFSFEYFKKNFSESKKILEKLFVFLEKNGVYERVFQTKNQGI
YYIWVQRLKYFKHVLESKSDN
>P75302 2.4.1.-~~~~~~Processive diacylglycerol beta-glycosyltransferase~~~
MNKLISILVPCYQSQPFLDRFFKSLLKQDWNGVKVIFFNDNKPDPTYEILKQFQQAHPQLAIEVHCGEKNVGVGGSRDQL
INYVDTPYFYFVDPDDEFSDPNCFKAIVETIQGENFDIAVLNSIVYLQMLKNDFLIKHIPLKNIFQGKVKLNPDNTVNHL
HYIQNNDQYIWNIVINTAFFKALDLQFVNRFIEDIAVWFPIMFKAQKVLWIDVNGVNYYLRPNSASTQKNSIKLLSFIEA
YERLYFHLKKVGKLADFIDPNNKIESRFWRRQAFIWFSFINVSWMKAEFEQTKSVLQKLFDFMEANGIYDRVFTNKHHGI
YLLWVNRLKHFKKLVQAQPHL
>A3UZK3 4.3.1.-~~~pbfA~~~(R)-1-hydroxy-2-aminoethylphosphonate ammonia-lyase~~~
MTTTNQEPILKATHFRSEGDVNTTPAREKWNESLNDDATQAMLKRDSDVFLHQAMSTPCLDTLTAAEGIYIQDATGKKYM
DFHGNNVHQLGYGHPHIINKVTQQMASLPFSPRRFTNETAVQCAEKLTQICGGDLNRVLFAPGGTSVIGMALKLARHVTN
NFKVVSLWDSFHGASLDAISVGGEACFREGMGPLMAGVERIPPAVSYRGAFPLRDSLSLRGQNSGDANETACDVHYADYL
EYVIEKEGGIGAFIAEAVRNTDVQVPSKAYWKRIREICDKHNVMLIIDDIPNGMGRSGEWFTHQAFDIEPDILCIGKGFG
GGLVPIAAMITKDKYNTAAQVSLGHYTHEKSPIGCAAALATMEVIEQENLLEKVQADSAFVREQLLQMKEEYPVIGDIRG
IGLLWGVELVTDHITKTRAFDEAEAVLYQCLNEGLSFKVSQGNVIQLSPPLIISRNELEVALSVFEKAIAKVCKDFEYL
>P71707 ~~~ponA1~~~Penicillin-binding protein 1A~~~COG0744
MNSDGRHHQSSSGAPRGPANPGQRGQVPPDDRLTAILPPVTDDRSAPHADSIEAVKAALDGAPPMPPPRDPLEEVTAALA
APPGKPPRGDQLGGRRRPPGPPGPPGSSGQPAGRLPQPRVDLPRVGQINWKWIRRSLYLTAAVVILLPMVTFTMAYLIVD
VPKPGDIRTNQVSTILASDGSEIAKIVPPEGNRVDVNLSQVPMHVRQAVIAAEDRNFYSNPGFSFTGFARAVKNNLFGGD
LQGGSTITQQYVKNALVGSAQHGWSGLMRKAKELVIATKMSGEWSKDDVLQAYLNIIYFGRGAYGISAASKAYFDKPVEQ
LTVAEGALLAALIRRPSTLDPAVDPEGAHARWNWVLDGMVETKALSPNDRAAQVFPETVPPDLARAENQTKGPNGLIERQ
VTRELLELFNIDEQTLNTQGLVVTTTIDPQAQRAAEKAVAKYLDGQDPDMRAAVVSIDPHNGAVRAYYGGDNANGFDFAQ
AGLQTGSSFKVFALVAALEQGIGLGYQVDSSPLTVDGIKITNVEGEGCGTCNIAEALKMSLNTSYYRLMLKLNGGPQAVA
DAAHQAGIASSFPGVAHTLSEDGKGGPPNNGIVLGQYQTRVIDMASAYATLAASGIYHPPHFVQKVVSANGQVLFDASTA
DNTGDQRIPKAVADNVTAAMEPIAGYSRGHNLAGGRDSAAKTGTTQFGDTTANKDAWMVGYTPSLSTAVWVGTVKGDEPL
VTASGAAIYGSGLPSDIWKATMDGALKGTSNETFPKPTEVGGYAGVPPPPPPPEVPPSETVIQPTVEIAPGITIPIGPPT
TITLAPPPPAPPAATPTPPP
>P54488 3.4.16.4~~~pbpA~~~Penicillin-binding protein 2A~~~COG0768
MRRNKPKKQNHKEKKKSLPIRLNILFLAAFVIFTWIIVELGIKQIVQGDDYKNQANKQEQSEVSSAVPRGKIYDRNFNAI
VTNKALNAITYTRSKSTTQEQRLKIAKKLSDMIKVDTKKVTERDKKDYWILTRPKEAKKLISSKERQQVEDKKISDDDLY
QLQLKRITDKQLNELTDKDMQILAIKRQMDSGYALTPQYIKNEDVSAKEMAVVSEHLDELPGVDVTSDWEREYPYKNLLR
SVLGSVSSSNEGLPSNLLDHYLSLGYSRNDRVGKSYLEYQYESLLQGQKAKVENITDSKGNVTGTKTVSEGKAGKDLVLT
IDIDLQKSVEKIIEKKLKAAKARPSTELLDRAFVVMMDPRNGEVLTMAGKQIKRENGAYKFDDYALGAMTSSYAMGSAVK
GATVLTGLQTGAINLNTVFKDEPLYIGQDKRGKKSWQNLGPVGIQTALEKSSNVFMFKTAIAVGKGEYKPHQALPLDTSA
FDTFRNYFSQFGLGVKTGIDLPNEMTGYKGTSRLSGFLLDFAIGQYDTYTPLELAQYVSTIANGGYRMKPQLVKEVRDSN
AKKGIGAVVDSVQPEVLNKVDMKSSYIEEVQAGFRRVATKGTAAGQLASASYKPAAKTGTAQSFYDGPDKSKTGTDTYNT
TLVAYAPADNPEIAISVVVPWTYIDYNQRYSITNEIGREVMDKYFELKSKQDKEGTQQKNKDKIEENAENTTSSDN
>A0A0H2ZMF9 ~~~pbp2a~~~Penicillin-binding protein 2a~~~COG0744
MKLDKLFEKFLSLFKKETSELEDSDSTILRRSRSDRKKLAQVGPIRKFWRRYHLTKIILILGLSAGLLVGIYLFAVAKST
NVNDLQNALKTRTLIFDREEKEAGALSGQKGTYVELTDISKNLQNAVIATEDRSFYKNDGINYGRFFLAIVTAGRSGGGS
TITQQLAKNAYLSQDQTVERKAKEFFLALELSKKYSKEQILTMYLNNAYFGNGVWGVEDASKKYFGVSASEVSLDQAATL
AGMLKGPELYNPLNSVEDSTNRRDTVLQNMVAAGYIDKNQETEAAEVDMTSQLHDKYEGKISDYRYPSYFDAVVNEAVSK
YNLTEEEIVNNGYRIYTELDQNYQANMQIVYENTSLFPRAEDGTFAQSGSVALEPKTGGVRGVVGQVADNDKTGFRNFNY
ATQSKRSPGSTIKPLVVYTPAVEAGWALNKQLDNHTMQYDSYKVDNYAGIKTSREVPMYQSLAESLNLPAVATVNDLGVD
KAFEAGEKFGLNMEKVDRVLGVALGSGVETNPLQMAQAYAAFANEGLMPEAHFISRIENASGQVIASHKNSQKRVIDKSV
ADKMTSMMLGTFTNGTGISSSPADYVMAGKTGTTEAVFNPEYTSDQWVIGYTPDVVISHWLGFPTTDENHYLAGSTSNGA
AHVFRNIANTILPYTPGSTFTVENAYKQNGIAPANTKRQVQTNDNSQTDDNLSDIRGRAQSLVDEASRAISDAKIKEKAQ
TIWDSIVNLFR
>Q8DNB6 ~~~pbp2a~~~Penicillin-binding protein 2a~~~COG0744
MKLDKLFEKFLSLFKKETSELEDSDSTILRRSRSDRKKLAQVGPIRKFWRRYHLTKIILILGLSAGLLVGIYLFAVAKST
NVNDLQNALKTRTLIFDREEKEAGALSGQKGTYVELTDISKNLQNAVIATEDRSFYKNDGINYGRFFLAIVTAGRSGGGS
TITQQLAKNAYLSQDQTVERKAKEFFLALELSKKYSKEQILTMYLNNAYFGNGVWGVEDASKKYFGVSASEVSLDQAATL
AGMLKGPELYNPLNSVEDSTNRRDTVLQNMVAAGYIDKNQETEAAEVDMTSQLHDKYEGKISDYRYPSYFDAVVNEAVSK
YNLTEEEIVNNGYRIYTELDQNYQANMQIVYENTSLFPRAEDGTFAQSGSVALEPKTGGVRGVVGQVADNDKTGFRNFNY
ATQSKRSPGSTIKPLVVYTPAVEAGWALNKQLDNHTMQYDSYKVDNYAGIKTSREVPMYQSLAESLNLPAVATVNDLGVD
KAFEAGEKFGLNMEKVDRVLGVALGSGVETNPLQMAQAYAAFANEGLMPEAHFISRIENASGQVIASHKNSQKRVIDKSV
ADKMTSMMLGTFTNGTGISSSPADYVMAGKTGTTEAVFNPEYTSDQWVIGYTPDVVISHWLGFPTTDENHYLAGSTSNGA
AHVFRNIANTILPYTPGSTFTVENAYKQNGIAPANTKRQVQTNDNSQTDDNLSDIRGRAQSLVDEASRAISDAKIKEKAQ
TIWDSIVNLFR
>Q07868 3.4.16.4~~~pbpB~~~Penicillin-binding protein 2B~~~COG0768
MPKKNKFMNRGAAILSICFALFFFVILGRMAYIQITGKANGEVLATKATEQHEKKRTIEASRGSILDRKGKVIAEDTATY
KLIAILDKKMTTDVKHPQHVVNKEKTAEALSKVINLDKADILDILNKDAKQVEFGSAGRDITYSQKQKIEKMKLPGISFL
RDTKRYYPNGVFASNLIGYAEVDEETNEISGAMGLEKVLDKYLKERDGYVTYESDKSGWELPNSKNKITAPKNGDNVYLT
IDQKIQTFLEDSMTKVAQKYNPKKIMAAVVDPKTGKVLAMGQRPSFDPNKRDVTNYYNDLISYAYEPGSTMKIFTLAAAM
QENVFNANEKYKSGTFEVGGAPVKDHNNGVGWGPTTYHDGVLRSSNVAFAKLAKEKLGYDRLNQYLHKFNFYQKTGIDLP
GEVSSKINFKYEFDKASTAYGQASAVTPIQQIQAATAIANDGKMMKPYVIDHIVDPDKDKTIYQNKPESAGTPISASTAK
KVRDILGEVVTSKIGTGQAYKIEGFDVAGKTGTAQIAGKGGYLDGTDNYIFSFMGMAPKDDPELLIYVAVQQPQLKAGQS
SSDPVSEIFNPTMKNSLHYLNIEPTEKSDSDKEETKAQTMPDLTDQTVAAAQKKAKEENLTPIVIGSDVAVKEQYPKADE
EVLTNQKVFLKTGGKIKMPDMTGWSRREVLQYGELAGIHIEVSGQGYAVSQSVKKDKEIKDKTVIKVKFKNPD
>P08149 3.4.16.4~~~penA~~~Probable peptidoglycan D,D-transpeptidase PenA~~~
MLIKSEYKPRMLPKEEQVKKPMTSNGRISFVLMAMAVLFACLIARGLYLQTVTYNFLKEQGDNRIVRTQALPATRGTVSD
RNGAVLALSAPTESLFAVPKDMKEMPSAAQLERLSELVDVPVDVLRNKLEQKGKSFIWIKRQLDPKVAEEVKALGLENFV
FEKELKRHYPMGNLFAHVIGFTDIDGKGQEGLELSLEDSLYGEDGAEVVLRDRQGNIVDSLDSPRNKAPQNGKDIILSLD
QRIQTLAYEELNKAVEYHQAKAGTVVVLDARTGEILALANTPAYDPNRPGRADSEQRRNRAVTDMIEPGSAIKPFVIAKA
LDAGKTDLNERLNTQPYKIGPSPVRDTHVYPSLDVRGIMQKSSNVGTSKLSARFGAEEMYDFYHELGIGVRMHSGFPGET
AGLLRNWRRWRPIEQATMSFGYGLQLSLLQLARAYTALTHDGVLLPLSFEKQAVAPQGKRIFKESTAREVRNLMVSVTEP
GGTGTAGAVDGFDVGAKTGTARKFVNGRYADNKHVATFIGFAPAKNPRVIVAVTIDEPTAHGYYGGVVAGPPFKKIMGGS
LNILGISPTKPLTAAAVKTPS
>P0A3M6 ~~~penA~~~Penicillin-binding protein 2B~~~COG0768
MRKFNSHSIPIRLNLLFSIVILLFMTIIGRLLYMQVLNKDFYEKKLASASQTKITSSSARGEIYDASGKPLVENTLKQVV
SFTRSNKMTATDLKETAKKLLTYVSISSPNLTERQLADYYLADPEIYKKIVEALPSEKRLDSDGNRLSESELYNNAVDSV
QTSQLNYTEDEKKEIYLFSQLNAVGNFATGTIATDPLNDSQVAVIASISKEMPGISISTSWDRKVLETSLSSIVGSVSSE
KAGLPAEEAEAYLKKGYSLNDRVGTSYLEKQYEETLQGKRSVKEIHLDKYGNMESVDTIEEGSKGNNIKLTIDLAFQDSV
DALLKSYFNSELENGGAKYSEGVYAVALNPKTGAVLSMSGIKHDLKTGELTPDSLGTVTNVFVPGSVVKAATISSGWENG
VLSGNQTLTDQSIVFQGSAPINSWYTQAYGSFPITAVQALEYSSNTYMVQTALGLMGQTYQPNMFVGTSNLESAMEKLRS
TFGEYGLGTATGIDLPDESTGFVPKEYSFANYITNAFGQFDNYTPMQLAQYVATIANNGVRVAPRIVEGIYGNNDKGGLG
DLIQQLQPTEMNKVNISDSDMSILHQGFYQVAHGTSGLTTGRAFSNGALVSISGKTGTAESYVADGQQATNTNAVAYAPS
DNPQIAVAVVFPHNTNLTNGVGPSIARDIINLYQKYHPMN
>P0AFI5 3.4.21.-~~~pbpG~~~D-alanyl-D-alanine endopeptidase~~~COG1686
MPKFRVSLFSLALMLAVPFAPQAVAKTAAATTASQPEIASGSAMIVDLNTNKVIYSNHPDLVRPIASISKLMTAMVVLDA
RLPLDEKLKVDISQTPEMKGVYSRVRLNSEISRKDMLLLALMSSENRAAASLAHHYPGGYKAFIKAMNAKAKSLGMNNTR
FVEPTGLSVHNVSTARDLTKLLIASKQYPLIGQLSTTREDMATFSNPTYTLPFRNTNHLVYRDNWNIQLTKTGFTNAAGH
CLVMRTVINNKPVALVVMDAFGKYTHFADASRLRTWIETGKVMPVPAAALSYKKQKAAQMAAAGQTAQND
>Q97H19 ~~~~~~Putative polysaccharide biosynthesis protein with aminopeptidase-like domain~~~COG4310
MEEINKYIQNSSETGGEIYNLIEELFPICRSITGNGVRKTMDIIRKHIPLEIHEVKSGTKVFDWTVPKEWNIKDAYVRNS
KGEKVIDFKENNLHVMSYSVPVHKTMTLDELKPYLHTIPGNKDRIPYLTSYYKENWGFSLTQNKFDELCDDDYEVVIDSS
LEDGSLTYGEYYIRGELEEEILLTTYTCHPSMCNDNLSGVALITFIAKALSKLKTKYSYRFLFAPETIGSITWLSRNEDK
LKNIKMGLVATCVGDAGIKNYKRTKFGDAEIDKIVEKVLMHCGSEYYVADFFPWGSDERQFSSPGINLPVGSLMRSCYGF
DGYHTSADNLCYMNKDGLADSYKTYLEVIYTIENNRTYLNLNPKCEPQLGKRGIYRMIGGGSDYPFDEFAMFWVLNMSDG
KNSLLDIAYKSGMEFRRIKYAADALYRVELLKLV
>O66874 ~~~mrcA~~~Penicillin-binding protein 1A~~~COG5009
MKKLVIGILGIVIALFVGLLVFLIPIYKNLPDPKLLESWTPPQASEVYDAKGRLYGTIGIQKRFYVSIDKIPEHVINAFV
ATEDRNFWHHFGIDPVAIVRAAIVNYRAGRIVQGGSTITQQLAKNLFLTRERTLERKIKEALLAIKIERTFDKKKIMELY
LNQIYLGSGAYGVEAAAQVYFGKHVWELSLDEAALLAALPKAPAKYNPFYHPERALQRRNLVLKRMLEEGYITPEQYEEA
VNKPLTVKKENKYKFSDYFLDMVKSYVFNKYGEIAYKGRLKIYTTIDLDYQKIAQKSLEEGLKRVAKIIGLPFLPKSEED
MELAYEKEAQLKRLKRGKIYVAKILKYDGNFMKVEIHGKKLKGEIKGLNTEGHKYVFVKYLGGNRAEIIPDLEGSLVSID
VKTGEIKAIVGGRSYAYSQFNRAVKALRQPGSAIKPVIYLSALLKGMTQISTIDASSKPYYDPSKGEDWIPKNYDEKEYG
NVTLRYALAHSINTAAVNLLDKVGFELVLEVGKKVGLDNLKPYYSLALGTVEVTPLQLTAAYQVFANLGTECKPFFIKKI
VDENGEVLEENVPECEEVLPKPETRVPVDMLRAVVLEGTARRASVLDRIVAGKTGTTDDFQDAWFVGFSPYIVTGVWVGY
DVKKSLGKHMSGSRVALPIWIDYMKVVTRMYPNEDFELPPENIVVNINPKDLVLADETCEGVPMVFVKGTEPHITCSDLN
AILGLR
>P39793 ~~~ponA~~~Penicillin-binding protein 1A/1B~~~COG0744
MSDQFNSREARRKANSKSSPSPKKGKKRKKGGLFKKTLFTLLILFVLGVVGGAVTFAVMVSDAPSLDESKLKTPYSSTIY
DKNGKEIAEVGAEKRTYVSIDEIPDVVKEAFIATEDARFYEHHGIDPVRIGGALVANFKDGFGAEGGSTITQQVVKNSLL
SHQKTLKRKVQEVWLSIQLERNYSKDEILEMYLNRIYFSPRAYGIGKAAEEFFGVTDLSKLTVEQAATLAGMPQSPTAYN
PVKNPDKAEKRRNIVLSLMKKQGFISDSQYNKAKKVAVKDEGVVSQKEYEKASTNKYSAFVEEVMKEIDEKSDVDPSADG
LKIYTTLDTKAQDKLDELMDGDTVGFTEGMQGGVTLLDTKNGEVRAIGAGRNQPVGGFNYATQTKAQPGSTIKPILDYGP
VIENKKWSTYEQIDDSAYTYSNGKPIRDWDRKYLGPISMRYALAQSRNIPALKAFQAVGKDTAVDFANGLGLGLTKDNVT
EAYSIGGFGGNDGVSPLTMAGAYSAFGNNGTYNEPHFVKSIEFNDGTKLDLTPKSKSAMSDYTAFMITDMLKTAVKTGTG
QLAQVPGVEVAGKTGTTNFDDNEVKRYNIASGGARDSWFVGYTPQYTAAVWTGMGENEAGKKSLSAEEQKVAKRIFAQLI
ADVDDGSGSFEKPDSVVEATVEKGSNPAKLAGPNTPSDKKLTEYFVKGTAPSTVSKTYEKEEKEETAKLSGLNVKYDKDN
QSLTLSWNYDGDATFAVKQSVDGGSYSEIQNSSAKEAVISGVQPGSVYKFEVTAVSDDGKSTASTSYEVPKAEDDEDKKD
QQQTDDEKQDDEKTQDDTQTDDSQKDDGQTDQDQTDDSTNDQDKKQDNTNTNPSDNNNQDQSNDNDNDNSNNQDTSDGDS
NSGKNDSTGSDTNKNKTDTSNKTQTNSSSIEKTN
>P02918 ~~~mrcA~~~Penicillin-binding protein 1A~~~COG5009
MKFVKYFLILAVCCILLGAGSIYGLYRYIEPQLPDVATLKDVRLQIPMQIYSADGELIAQYGEKRRIPVTLDQIPPEMVK
AFIATEDSRFYEHHGVDPVGIFRAASVALFSGHASQGASTITQQLARNFFLSPERTLMRKIKEVFLAIRIEQLLTKDEIL
ELYLNKIYLGYRAYGVGAAAQVYFGKTVDQLTLNEMAVIAGLPKAPSTFNPLYSMDRAVARRNVVLSRMLDEGYITQQQF
DQTRTEAINANYHAPEIAFSAPYLSEMVRQEMYNRYGESAYEDGYRIYTTITRKVQQAAQQAVRNNVLDYDMRHGYRGPA
NVLWKVGESAWDNNKITDTLKALPTYGPLLPAAVTSANPQQATAMLADGSTVALSMEGVRWARPYRSDTQQGPTPRKVTD
VLQTGQQIWVRQVGDAWWLAQVPEVNSALVSINPQNGAVMALVGGFDFNQSKFNRATQALRQVGSNIKPFLYTAAMDKGL
TLASMLNDVPISRWDASAGSDWQPKNSPPQYAGPIRLRQGLGQSKNVVMVRAMRAMGVDYAAEYLQRFGFPAQNIVHTES
LALGSASFTPMQVARGYAVMANGGFLVDPWFISKIENDQGGVIFEAKPKVACPECDIPVIYGDTQKSNVLENNDVEDVAI
SREQQNVSVPMPQLEQANQALVAKTGAQEYAPHVINTPLAFLIKSALNTNIFGEPGWQGTGWRAGRDLQRRDIGGKTGTT
NSSKDAWFSGYGPGVVTSVWIGFDDHRRNLGHTTASGAIKDQISGYEGGAKSAQPAWDAYMKAVLEGVPEQPLTPPPGIV
TVNIDRSTGQLANGGNSREEYFIEGTQPTQQAVHEVGTTIIDNGEAQELF
>P31776 ~~~mrcA~~~Penicillin-binding protein 1A~~~COG5009
MRIAKLILNTLLTLCILGLVAGGMLYFHLKSELQQPMQIYTADGKLIGEVGEQRRIPVKLADVPQRLIDAFLATEDSRFY
DHHGLDPIGIARALFVAVSNGGASQGASTITQQLARNFFLTSEKTIIRKAREAVLAVEIENTLNKQEILELYLNKIFLGY
RSYGVAAAAQTYFGKSLNELTLSEMAIIAGLPKAPSTMNPLYSLKRSEERRNVVLSRMLDEKYISKEEYDAALKEPIVAS
YHGAKFEFRADYVTEMVRQEMVRRFGEENAYTSGYKVFTTVLSKDQAEAQKAVRNNLIDYDMRHGYRGGAPLWQKNEAAW
DNDRIVGFLRKLPDSEPFIPAAVIGIVKGGADILLASGEKMTLSTNAMRWTGRSNPVKVGEQIWIHQRANGEWQLGQIPA
ANSALVSLNSDNGAIEAVVGGFSYEQSKFNRATQSLVQVGSSIKPFIYAAALEKGLTLSSVLQDSPISIQKPGQKMWQPK
NSPDRYDGPMRLRVGLGQSKNIIAIRAIQTAGIDFTAEFLQRFGFKRDQYFASEALALGAASFTPLEMARAYAVFDNGGF
LIEPYIIEKIQDNTGKDLFIANPKIACIECNDIPVIYGETKDKINGFANIPLGENALKPTDDSTNGEELDQQPETVPELP
ELQSNMTALKEDAIDLMAAAKNASSKIEYAPRVISGELAFLIRSALNTAIYGEQGLDWKGTSWRIAQSIKRSDIGGKTGT
TNSSKVAWYAGFGANLVTTTYVGFDDNKRVLGRGEAGAKTAMPAWITYMKTALSDKPERKLSLPPKIVEKNIDTLTGLLS
PNGGRKEYFIAGTEPTRTYLSEMQERGYYVPTELQQRLNNEGNTPATQPQELF
>P9WKD1 3.4.16.4~~~pbpA~~~Peptidoglycan D,D-transpeptidase PbpA~~~COG0768
MNASLRRISVTVMALIVLLLLNATMTQVFTADGLRADPRNQRVLLDEYSRQRGQITAGGQLLAYSVATDGRFRFLRVYPN
PEVYAPVTGFYSLRYSSTALERAEDPILNGSDRRLFGRRLADFFTGRDPRGGNVDTTINPRIQQAGWDAMQQGCYGPCKG
AVVALEPSTGKILALVSSPSYDPNLLASHNPEVQAQAWQRLGDNPASPLTNRAISETYPPGSTFKVITTAAALAAGATET
EQLTAAPTIPLPGSTAQLENYGGAPCGDEPTVSLREAFVKSCNTAFVQLGIRTGADALRSMARAFGLDSPPRPTPLQVAE
STVGPIPDSAALGMTSIGQKDVALTPLANAEIAATIANGGITMRPYLVGSLKGPDLANISTTVGYQQRRAVSPQVAAKLT
ELMVGAEKVAQQKGAIPGVQIASKTGTAEHGTDPRHTPPHAWYIAFAPAQAPKVAVAVLVENGADRLSATGGALAAPIGR
AVIEAALQGEP
>O05131 ~~~mrcA~~~Penicillin-binding protein 1A~~~
MIKKILTTCFGLFFGFCVFGVGLVAIAILVTYPKLPSLDSLQHYQPKMPLTIYSADGEVIGMYGEQRREFTKIGDFPEVL
RNAVIAAEDKRFYRHWGVDVWGVARAAVGNVVSGSVQSGASTITQQVAKNFYLSSEKTFTRKFNEVLLAYKIEQSLSKDK
ILELYFNQIYLGQRAYGFASAAQIYFNKNVRDLTLAEAAMLAGLPKAPSAYNPIVNPERAKLRQKYILNNMLEEKMITVQ
QRDQALNEELHYERFVRKIDQSALYVAEMVRRELYEKYGEDAYTQGFKVYTTVRTDHQKAATEALRKALRNFDRGSSYRG
AENYIDLSKSEDVEETVSQYLSGLYTVDKMVPAVVLDVTKKKNVVIQLPGGRRVALDRRALGFAARAVDNEKMGEDRIRR
GAVIRVKNNGGRWAVVQEPLLQGALVSLDAKTGAVRALVGGYDFHSKTFNRAVQAMRQPGSTFKPFVYSAALSKGMTAST
VVNDAPISLPGKGPNGSVWTPKNSDGRYSGYITLRQALTASKNMVSIRILMSIGVGYAQQYIRRFGFRPSELPASLSMAL
GTGETTPLKVAEAYSVFANGGYRVSSHVIDKIYDRDGRLRAQMQPLVAGQNAPQAIDPRNAYIMYKIMQDVVRVGTARGA
AALGRTDIAGKTGTTNDNKDAWFVGFNPDVVTAVYIGFDKPKSMGRAGYGGTIAVPVWVDYMRFALKGKQGKGMKMPEGV
VSSNGEYYMKERMVTDPGLMLDNSGIAPQPSRRAKEDDEAAVENEQQGRSDETRQDVQETPVLPSNTDSKQQQLDSLF
>Q07806 ~~~mrcA~~~Penicillin-binding protein 1A~~~
MRLLKFLWWTCVTLICGVLLSFSGAYLYLSPSLPSVEALRNVQLQIPLKVYSEDGKLISEFGEMRRTPIRFADIPQDFIH
ALLSAEDDNFANHYGVDVKSLMRAAAQLLKSGHIQTGGSTITMQVAKNYFLTNERSFSRKINEILLALQIERQLTKDEIL
ELYVNKIYLGNRAYGIEAAAQVYYGKPIKDLSLAEMAMIAGLPKAPSRYNPLVNPTRSTERRNWILERMLKLGFIDQQRY
QAAVEEPINASYHVQTPELNAPYIAEMARAEMVGRYGSEAYTEGYKVITTVRSDLQNAASQSVRDGLIDYDQRHGYRGPE
TRLPGQTRDAWLKHLGQQRSIGGLEPAIVTQVEKSGIMVMTRDGKEEAVTWDSMKWARPFLSNNSMGPMPRQPADVAQAG
DQIRVQRQEDGTLRFVQIPAAQSALISLDPKDGAIRSLVGGFSFEQSNYNRAIQAKRQPGSSFKPFIYSAALDNGFTAAS
LVNDAPIVFVDEYLDKVWRPKNDTNTFLGPIPLREALYKSRNMVSIRVLQGLGIERAISYITKFGFQRDELPRNFSLALG
TATVTPMEIAGAWSVFANGGYKVNPYVIERIESRDGQVLYQANPPRVPVEEQVAADAEDAGNPGDPEHPESAEGEGSIEA
QQVAAKAQTTFEPTPAERIIDARTAYIMTSMLQDVIKRGTGRRALALKRTDLAGKTGTTNDSKDGWFSGYNSDYVTSVWV
GFDQPETLGRREYGGTVALPIWIRYMGFALKDKPMHTMAEPPGIVSLRIDPVTGRSAAPGTPGAYFEMFKNEDTPPSVNE
LPPGSFPGSPLPDDEGAPIDLF
>Q04707 ~~~ponA~~~Penicillin-binding protein 1A~~~COG0744
MNKPTILRLIKYLSISFLSLVIAAIVLGGGVFFYYVSKAPSLSESKLVATTSSKIYDNKNQLIADLGSERRVNAQANDIP
TDLVKAIVSIEDHRFFDHRGIDTIRILGAFLRNLQSNSLQGGSTLTQQLIKLTYFSTSTSDQTISRKAQEAWLAIQLEQK
ATKQEILTYYINKVYMSNGNYGMQTAAQNYYGKDLNNLSLPQLALLAGMPQAPNQYDPYSHPEAAQDRRNLVLSEMKNQG
YISAEQYEKAVNTPITDGLQSLKSASNYPAYMDNYLKEVINQVEEETGYNLLTTGMDVYTNVDQEAQKHLWDIYNTDEYV
AYPDDELQVASTIVDVSNGKVIAQLGARHQSSNVSFGINQAVETNRDWGSTMKPITDYAPALEYGVYDSTATIVHDEPYN
YPGTNTPVYNWDRGYFGNITLQYALQQSRNVPAVETLNKVGLNRAKTFLNGLGIDYPSIHYSNAISSNTTESDKKYGASS
EKMAAAYAAFANGGTYYKPMYIHKVVFSDGSEKEFSNVGTRAMKETTAYMMTDMMKTVLTYGTGRNAYLAWLPQAGKTGT
SNYTDEEIENHIKTSQFVAPDELFAGYTRKYSMAVWTGYSNRLTPLVGNGLTVAAKVYRSMMTYLSEGSNPEDWNIPEGL
YRNGEFVFKNGARSTWNSPAPQQPPSTESSSSSSDSSTSQSSSTTPSTNNSTTTNPNNNTQQSNTTPDQQNQNPQPAQP
>Q8DR59 ~~~pbpA~~~Penicillin-binding protein 1A~~~COG0744
MNKPTILRLIKYLSISFLSLVIAAIVLGGGVFFYYVSKAPSLSESKLVATTSSKIYDNKNQLIADLGSERRVNAQANDIP
TDLVKAIVSIEDHRFFDHRGIDTIRILGAFLRNLQSNSLQGGSALTQQLIKLTYFSTSTSDQTISRKAQEAWLAIQLEQK
ATKQEILTYYINKVYMSNGNYGMQTAAQNYYGKDLNNLSLPQLALLAGMPQAPNQYDPYSHPEAAQDRRNLVLSEMKNQG
YISAEQYEKAVNTPITDGLQSLKSASNYPAYMDNYLKEVINQVEEETGYNLLTTGMDVYTNVDQEAQKHLWDIYNTDEYV
AYPDDELQVASTIVDVSNGKVIAQLGARHQSSNVSFGINQAVETNRDWGSTMKPITDYAPALEYGVYESTATIVHDEPYN
YPGTNTPVYNWDRGYFGNITLQYALQQSRNVPAVETLNKVGLNRAKTFLNGLGIDYPSIHYSNAISSNTTESDKKYGASS
EKMAAAYAAFANGGTYYKPMYIHKVVFSDGSEKEFSNVGTRAMKETTAYMMTDMMKTVLSYGTGRNAYLAWLPQAGKTGT
SNYTDEEIENHIKTSQFVAPDELFAGYTRKYSMAVWTGYSNRLTPLVGNGLTVAAKVYRSMMTYLSEGSNPEDWNIPEGL
YRNGEFVFKNGARSTWSSPAPQQPPSTESSSSSSDSSTSQSSSTTPSTNNSTTTNPNNNTQQSNTTPDQQNQNPQPAQP
>P02919 ~~~mrcB~~~Penicillin-binding protein 1B~~~COG0744
MAGNDREPIGRKGKPTRPVKQKVSRRRYEDDDDYDDYDDYEDEEPMPRKGKGKGKGRKPRGKRGWLWLLLKLAIVFAVLI
AIYGVYLDQKIRSRIDGKVWQLPAAVYGRMVNLEPDMTISKNEMVKLLEATQYRQVSKMTRPGEFTVQANSIEMIRRPFD
FPDSKEGQVRARLTFDGDHLATIVNMENNRQFGFFRLDPRLITMISSPNGEQRLFVPRSGFPDLLVDTLLATEDRHFYEH
DGISLYSIGRAVLANLTAGRTVQGASTLTQQLVKNLFLSSERSYWRKANEAYMALIMDARYSKDRILELYMNEVYLGQSG
DNEIRGFPLASLYYFGRPVEELSLDQQALLVGMVKGASIYNPWRNPKLALERRNLVLRLLQQQQIIDQELYDMLSARPLG
VQPRGGVISPQPAFMQLVRQELQAKLGDKVKDLSGVKIFTTFDSVAQDAAEKAAVEGIPALKKQRKLSDLETAIVVVDRF
SGEVRAMVGGSEPQFAGYNRAMQARRSIGSLAKPATYLTALSQPKIYRLNTWIADAPIALRQPNGQVWSPQNDDRRYSES
GRVMLVDALTRSMNVPTVNLGMALGLPAVTETWIKLGVPKDQLHPVPAMLLGALNLTPIEVAQAFQTIASGGNRAPLSAL
RSVIAEDGKVLYQSFPQAERAVPAQAAYLTLWTMQQVVQRGTGRQLGAKYPNLHLAGKTGTTNNNVDTWFAGIDGSTVTI
TWVGRDNNQPTKLYGASGAMSIYQRYLANQTPTPLNLVPPEDIADMGVDYDGNFVCSGGMRILPVWTSDPQSLCQQSEMQ
QQPSGNPFDQSSQPQQQPQQQPAQQEQKDSDGVAGWIKDMFGSN
>A0R022 ~~~pbpB~~~Penicillin-binding protein PbpB~~~COG0768
MSRRGDRPRTPAQPRKKARVDQPRSARTRRTRVSEAEAGLRSSSFVFRHRTGNLAILAVLVIAAVQLFMLQVPRAAGLRA
EAASQLKVTDITPAIRGSIIDRNNDKLAFTIEARALTFQPTRVRKQLDEAWRKAQEAGSSTSDDVPNPDERLNEIAKEIA
ARLNNTPDAKTVLKKLKSNETFVYLARAVDPAIANAITDKFPEVGSERQDLRQYPGGSLAANIVGGIDWDGHGLLGLEDS
LDAVLAGTDGSVTYDRGSDGVVIPGSYRNRHDAVDGSTVQLTIDDDIQYHVQQQVQMAKDASGAKNVSAVVLDAKTGEVL
AMSNDNTFDPSQDIGRQADRQMGNPSVSSPFEPGSVNKIVTAAAAIENGLTNPDEVLQVPGSIHMGGVTVRDAWNHGVMP
YTTTGVFGKSSNVGTLMLAQRVGPERFYEMLRKFGLGQRTNVGLPGESSGLLPPIDQWSGSSFSNLPIGQGLSMTLLQMA
AMYQTVANDGVRVPPRIIKSTIAPDGTVTEEERPEGIRVISPETARTLRSMFRSVVQRDPMGVQQGTGPQAAVEGYQIAG
KTGTAQQINPACGCYYDDVYWITFAGIAPADDPRYVIGIMMDAPQRAADGSPGSSAAPLFHEIASWLLQRHNVPLSPDPG
PPLTLQAT
>L0T911 ~~~pbpB~~~Penicillin-binding protein PbpB~~~COG0768
MSRAAPRRASQSQSTRPARGLRRPPGAQEVGQRKRPGKTQKARQAQEATKSRPATRSDVAPAGRSTRARRTRQVVDVGTR
GASFVFRHRTGNAVILVLMLVAATQLFFLQVSHAAGLRAQAAGQLKVTDVQPAARGSIVDRNNDRLAFTIEARALTFQPK
RIRRQLEEARKKTSAAPDPQQRLRDIAQEVAGKLNNKPDAAAVLKKLQSDETFVYLARAVDPAVASAICAKYPEVGAERQ
DLRQYPGGSLAANVVGGIDWDGHGLLGLEDSLDAVLAGTDGSVTYDRGSDGVVIPGSYRNRHKAVHGSTVVLTLDNDIQF
YVQQQVQQAKNLSGAHNVSAVVLDAKTGEVLAMANDNTFDPSQDIGRQGDKQLGNPAVSSPFEPGSVNKIVAASAVIEHG
LSSPDEVLQVPGSIQMGGVTVHDAWEHGVMPYTTTGVFGKSSNVGTLMLSQRVGPERYYDMLRKFGLGQRTGVGLPGESA
GLVPPIDQWSGSTFANLPIGQGLSMTLLQMTGMYQAIANDGVRVPPRIIKATVAPDGSRTEEPRPDDIRVVSAQTAQTVR
QMLRAVVQRDPMGYQQGTGPTAGVPGYQMAGKTGTAQQINPGCGCYFDDVYWITFAGIATADNPRYVIGIMLDNPARNSD
GAPGHSAAPLFHNIAGWLMQRENVPLSPDPGPPLVLQAT
>P42971 3.4.16.4~~~pbpC~~~Penicillin-binding protein 3~~~COG0768
MLKKCILLVFLCVGLIGLIGCSKTDSPEDRMEAFVKQWNDQQFDDMYQSLTKDVKKEISKKDFVNRYKAIYEQAGVKNLK
VTAGEVDKDDQDNKTMKHIPYKVSMNTNAGKVSFKNTAVLKLEKTDDEESWNIDWDPSFIFKQLADDKTVQIMSIEPKRG
QIYDKNGKGLAVNTDVPEIGIVPGELGDKKEKVIKELAKKLDLTEDDIKKKLDQGWVKDDSFVPLKKVKPDQEKLVSEAT
SLQGVTRTNVSSRYYPYGEKTAHLTGYVRAITAEELKKKKEGTYSDTSNIGIAGLENVYEDKLRGTTGWKIYVPQTGEVI
AEKKAKDGEDLHLTIDIKTQMKLYDELKDDSGAAVALQPKTGETLALVSAPSYDPNGFIFGWSDKEWKKLNKDKNNPFSA
KFNKTYAPGSTIKPIAAAIGIKNGTLKADEKKTIKGKEWQKDSSWGGYSVTRVSERLQQVDLENALITSDNIYFAQNALD
MGADTFTKGLKTFGFSEDVPYEFPIQKSSIANDKLDSDILLADTGYGQGQMQMSPLHLATAYTPFVDNGDLVKPTLIKKD
SQTADVWHKQVVTKEGAADITKGLKGVVEDERGSAYQPVVKGITVAGKTGTAELKTSKDDKDGTENGWFVGYDYENKDLL
VAMMIQNVQDRGGSHYVVEKAKKQFQSN
>P76577 ~~~pbpC~~~Penicillin-binding protein 1C~~~COG4953
MPRLLTKRGCWITLAAAPFLLFLAAWGADKLWPLPLHEVNPARVVVAQDGTPLWRFADADGIWRYPVTIEDVSPRYLEAL
INYEDRWFWKHPGVNPFSVARAAWQDLTSGRVISGGSTLTMQVARLLDPHPKTFGGKIRQLWRALQLEWHLSKREILTLY
LNRAPFGGTLQGIGAASWAYLGKSPANLSYSEAAMLAVLPQAPSRLRPDRWPERAEAARNKVLERMAVQGVWSREQVKES
REEPIWLAPRQMPQLAPLFSRMMLGKSKSDKITTTLDAGLQRRLEELAQNWKGRLPPRSSLAMIVVDHTDMRVRGWVGSV
DLNDDSRFGHVDMVNSIRSPGSVLKPFVYGLALDEGLIHPASLLQDVPRRTGDYRPGNFDSGFHGPISMSEALVRSLNLP
AVQVLEAYGPKRFAAKLRNVGLPLYLPNGAAPNLSLILGGAGAKLEDMAAAYTAFARHGKAGKLRLQPDDPLLERPLMSS
GAAWIIRRIMADEAQPLPDSALPRVAPLAWKTGTSYGYRDAWAIGVNARYVIGIWTGRPDGTPVVGQFGFASAVPLLNQV
NNILLSRSANLPEDPRPNSVTRGVICWPGGQSLPEGDGNCRRRLATWLLDGSQPPTLLLPEQEGINGIRFPIWLDENGKR
VAADCPQARQEMINVWPLPLEPWLPASERRAVRLPPASTSCPPYGHDAQLPLQLTGVRDGAIIKRLPGAAEATLPLQSSG
GAGERWWFLNGEPLTERGRNVTLHLTDKGDYQLLVMDDVGQIATVKFVMQ
>P40750 ~~~pbpD~~~Penicillin-binding protein 4~~~COG0744
MTMLRKIIGWILLLCIIPLFAFTVIASGKEVKQMKSLDQVLDKNIDLKDISLVQNSYMYDRDGSLVSEIVSDHENRVLVP
FNKIPEEVKQIFLTSEDRHFYEHKGFDFMGMVRATASNVKDKKIDQGASTITQQLSRNLYLSHERSFSRKLTELAYSYQL
EKKYTKNEILEAYLNTIYFNNGVYGVGSAAQFYFSKPLKSLTVGEMAFICAIPNNPTLYDPLKHFDYTKSRQERLLKGLK
DAGVITDKELKKAVKQKIKLDVEKREDKYPDYVSYVNDEFTQLVSESEGFDKRLQKASGKQKEKIENELSARVSTLMKDG
VKIYTALDPYMQNQVVAQMNSKLPYADVQGGAAVINHQTHQIIALSGGKNYQKYDFNRAYQAYRQPGSSIKPLLDYGPYI
EQTGATTSSTIDASKFCSKDYCPQNYNNRTYGTVTLDTAFKNSYNTPAIRMLDRVGIQKAFSYIEPYHFAKLVDSDYLLP
AALGGFTNGMTPLEMTKAYTTFGNSGSYTPSHAITKVTDLKGKTLYKWNDKATQIFSVRTNMQLKKLMSSVVKSGTGKKA
YFNAPYIGGKTGTSNDYHDMWFVGLTDTYTMGVWVGKDTPTSVEYLHSISPQLSIWKGTLQAAY
>P32959 ~~~pbpE~~~Penicillin-binding protein 4*~~~COG1680
MKQNKRKHLQTLFETLGEKHQFNGTVLAAEGGDILYHHSFGYAEMTEKRPLKTNSLFELASLSKPFTALGIILLEEKGIL
GYEDKVDRWLPGFPYQGVTIRHLLNHTSGLPDYMGWFFANWDSHKIAVNQDIVDMLMNEGLSGYFEPNEGWMYSNTGYVL
LAVIIEKASGMSYADFIKTSIFLPAGMNETRVYNRRLSPERIDHYAYGYVYDVHSETYVLPDELEETNYVVYLDGIQGDG
TVNSVTSDLFRFDQALYQDDFISKASKESAFSPVRLNNGETIDYGFGWVLQNSPEKGRIVSHSGGWPGYSTMMIRYIDHR
KTLIYLSNKEEDTEYEQAILKAAEHILFGQPYDVPERPADKKKKAIDTAIYSRYVGSYLLQDGTAAQVTTENERLYLEIA
GQLRLELFPSSETRFFLRALSVEVEFTLGEDAAKSFILYEDGSEEEAVRTK
>P38050 ~~~pbpF~~~Penicillin-binding protein 1F~~~COG0744
MFKIKKKKLFIPIIILVLTAFLALIGYISIIFLGHYVIDEKKLILHASSKIVDQNGDEVASLYTENREPVSINEIPKQVR
EAFIAVEDKRFYEHHGIDAKSVGRAVYRDILAGGKVEGGSTITQQLAKNIFLTHDKTFLRKTKEVIIAINLERDYSKDKL
LEMYLNQLYFGHGVYGIQAASHYYFNKEVKDLTVSEGAVLAAIPKAPSTYSPILHPDKNKERRDTILGMMNDQGYISAKE
AVTAQGRTLGLHVKKQSETPWFDSYIDLVIKEAEDKYSISGEQLLQGGYTIKVPLDSKLQKTAYQVMKEGSYYPGTDQNA
EGSAVFINNKTGGVEAAIGGRDYTSKGYNRVTAVRQPGSTFKPLAVYGPAMQEKKFKPYSLLKDELQSYGDYTPKNYDSR
YEGEVTMSDAITYSKNAPAVWTLNEIGVETGKSYLKANGIDIPDEGLALALGGLEKGVSPLQLAGAFHTFAANGTYTEPF
FISSIIDEDGETIADHKEEGKRVFSKQTSWNMTRMLQQVVKKGTATSGTYHGDLAGKTGSTSYTGVSGATKDAWFAGYTP
KITGAVWMGYDKTDQNHYLKAGSSYPTRLFKDILTQAGETGHVFTKPKNVKELESPIELKPVKTLTADYTFKAAGLFTIE
LKWDAQEDDRAVYRIYVNKDGEETLLDSVEGKGSYEIPYANLFSGASYKIVPYNTQTKREGEGTDYVQPKLFSS
>P70997 ~~~pbpG~~~Penicillin-binding protein 2D~~~COG0744
MDAMTNKRLRLTLKTVRAFIFLGAFAALAAAAVFMTVILIAKYQGAPSVQVPQSTILYASDGSKLGETNYGEKRYWVPLK
DMNPTIVKATVAIEDQNFYDHHGFDYKRMAGAALADLKAFAKVQGASTITQQYARNLYLEHDKTWKRKWNEAFYTIRLEQ
NYSKDEILEGYLNTIYYGHGAYGIEAASRLYFGKHAKNLTDAEAALLAGIPKGPSGYSPYVNETKAKERQKTIVRMMEKQ
QMISQKKADELIKEPLSYQPLNKQVSKRKAPYFYDNAMRELEKKLGMTREQIETSGLNVYTTVDKRMQRIAEETITETVN
AGSDIQVGFSAIDPRTGNVLALVGGRDYQKSPFDRTTQAKRQPASTIKPLLYYKAIQSGFTPVTLMKSEETEFQIDAKGE
TYSPSNYNGYYANKPITLLQALALSDNIYAVKTHLFLGTNKLVKTAKEFGITAHLQALPSLALGTEPVRPIEMVNAYAML
ANGGKKIEPTFISRVTDAAGHVLYENPNQHKQVLDEKAAFVTASMMTGMFDIDLNGYTSVTGRTIANRLTRTYAGKSGTT
SADSWMIGFNPKLAAGVWTGYDKNSTIDSVEEKSYAKTIWADFMEDALKGEPETAFKPPKGVTGVYIDPETGYSSGPGCA
AKHYTYFVKGTEPANVCYGAEPAKQTKDRLPSKEKPASEKKWWDKWLGRHH
>Q796K8 3.4.16.4~~~pbpH~~~Penicillin-binding protein H~~~COG0768
MTEIGREPKKKSKGNRAIRMNLFFLAVFVLFTALIFKLGVVQIVEGEQHEEDAEKSNAKTAYYPAPRGKMYDRNQKVAVD
NQSVPEIVYVSTSSTKTEDKIKTAKRLASFIHIDTEFLKERDLRDYWIAAHPKKAAALLKESESNLKGDQAYKLQIERVP
DQELKAIQQDDEEMETAAIYTRFSSGNAYEPQIVKAMNPNKSNSNGKNGALLDEKKNSSQRPKNDLTYDEISIVSEHLEE
LPGIDIVNDWTRKYPYDKTLYSVFGGVTTPDQGLLSDRKDFYLTRGYANNDRVGKSYLEYQYEEYLNSHKEKVEYVEDNK
GNVVSQKTIDKGSRGYDLQLSFDMELQAKVEKIIEEEVRNSRARGNYMLDRAFAVMMDPNNGDILSMAGKKIDLKTNKIE
DYAIGAFTTQYEMGSAVKGATVLAGYQDGIPHYKYYIDAPMLLGTNLIKKSYTNMGTINELTALQKSSNVYMFNVAMHIA
GVTYKPHGSLPADQNDLLKMRNYYSQFGLGVKTGIDLPQESAGMQTTPKTVGGLILDLAIGQYDTYTPLQMAQYISVIAN
GGYRVQPRIVTSIHKPGKKDQLGKAIEHRKPKVLNKINNSESDLKQVQTGMKLVTSSGTAKNTFTEDVSGKTGTAETFYY
GTNRNWWGKKTYNLTFVGYYPSKKPKVAFSVVVPSVDDDDKINKIIAKRAIHAYAELEKKHSKK
>O32032 3.4.16.4~~~pbpI~~~Penicillin-binding protein 4B~~~COG0768
MKISKRMKLAVIAFLIVFFLLLLRLAEIQLFFTESFSKKKINLIQESVKQRTEEVLISDGRGSFLDRNGRALTGQSEPAV
VLFPFLLTQDWPIKKVADILGMSEDDLRQTLGQAKKPVILQQKKIKTLSKQSITKINSLKYPGIYGVYMENEDKPSLASH
TIGSTNQDPALLRKKYPDKESLPITTEIGTTGLERTFDEFLLPEQDTKLLYHVDGKGNPLFGMDVKYTAEANTFYPLQIK
TTIDQSIQKAMEEVLDEQGLKKGGAVLLDIENSSVLGIVSKPDADVSRQNTLQNYMLTPIYPGSVFKTVIAAAAIENNMV
KPSQTFNCNLNLYGEPGDDKGTLSFDESFAQSCNYTFTSLAEQLMKKDSSVIEDMSEKLALTDRAGWEGKLYHETDFRQL
YNEKSGVIWGDEKDKSVKKAIAQTAIGQKNVKVTPLEVANMMATIARGGEKRQVKIAEQIEYKNGTTLVTFKDQKLKGET
IDKYTSQQLQKILRRVVESPSGTGRRFQDLPYTVAGKSGTAQTGKLSKEKETLYEKWFAGYFPADKPKYALVVLHMDTPG
DKALTNSVFYDIVKKVHEIEINQK
>O31773 ~~~pbpX~~~Putative penicillin-binding protein PbpX~~~COG1680
MTSPTRRRTAKRRRRKLNKRGKLLFGLLAVMVCITIWNALHRNSEENEPSQETAAVSNTDQKKEVKKKTAKKSEEQIKTV
DRNQKISNYLKEIGFSGTAMIVRNGEIVTNKGFGYADRKHYIQNNPLTSFYVGSSQKALIATAILQLEEKGKLQTSDPVS
TYLPHFPNGQTITLKNLLTHTSGINGHIEGNGAITPDDLIKDIELQGIKRQPGVWDYKDSNYSVLAYIIAEVSGEPYEQY
IKNHIFKPAGMTHAGFYKTYEKEPYPAVGYKMEGSKTVTPYIPDLSQLYGAGDIYMSAIDMYKFDQALIDGKLYSQKSYE
KMFTPGSSSTYGMGFYVAPGSYSNHGVMPGFNILNSFSKSGQTIVILFSNIQNNAKLGQVNNKIYQLLNQE
>P14677 ~~~pbpX~~~Penicillin-binding protein 2x~~~COG0768
MKWTKRVIRYATKNRKSPAENRRRVGKSLSLLSVFVFAIFLVNFAVIIGTGTRFGTDLAKEAKKVHQTTRTVPAKRGTIY
DRNGVPIAEDATSYNVYAVIDENYKSATGKILYVEKTQFNKVAEVFHKYLDMEESYVREQLSQPNLKQVSFGAKGNGITY
ANMMSIKKELEAAEVKGIDFTTSPNRSYPNGQFASSFIGLAQLHENEDGSKSLLGTSGMESSLNSILAGTDGIITYEKDR
LGNIVPGTEQVSQRTMDGKDVYTTISSPLQSFMETQMDAFQEKVKGKYMTATLVSAKTGEILATTQRPTFDADTKEGITE
DFVWRDILYQSNYEPGSTMKVMMLAAAIDNNTFPGGEVFNSSELKIADATIRDWDVNEGLTGGRTMTFSQGFAHSSNVGM
TLLEQKMGDATWLDYLNRFKFGVPTRFGLTDEYAGQLPADNIVNIAQSSFGQGISVTQTQMIRAFTAIANDGVMLEPKFI
SAIYDPNDQTARKSQKEIVGNPVSKDAASLTRTNMVLVGTDPVYGTMYNHSTGKPTVTVPGQNVALKSGTAQIADEKNGG
YLVGLTDYIFSAVSMSPAENPDFILYVTVQQPEHYSGIQLGEFANPILERASAMKDSLNLQTTAKALEQVSQQSPYPMPS
VKDISPGDLAEELRRNLVQPIVVGTGTKIKNSSAEEGKNLAPNQQVLILSDKAEEVPDMYGWTKETAETLAKWLNIELEF
QGSGSTVQKQDVRANTAIKDIKKITLTLGD
>P59676 ~~~pbpX~~~Penicillin-binding protein 2X~~~COG0768
MKWTKRVIRYATKNRKSPAENRRRVGKSLSLLSVFVFAIFLVNFAVIIGTGTRFGTDLAKEAKKVHQTTRTVPAKRGTIY
DRNGVPIAEDATSYNVYAVIDENYKSATGKILYVEKTQFNKVAEVFHKYLDMEESYVREQLSQPNLKQVSFGAKGNGITY
ANMMSIKKELEAAEVKGIDFTTSPNRSYPNGQFASSFIGLAQLHENEDGSKSLLGTSGMESSLNSILAGTDGIITYEKDR
LGNIVPGTEQVSQRTMDGKDVYTTISSPLQSFMETQMDAFQEKVKGKYMTATLVSAKTGEILATTQRPTFDADTKEGITE
DFVWRDILYQSNYEPGSTMKVMMLAAAIDNNTFPGGEVFNSSELKIADATIRDWDVNEGLTGGRMMTFSQGFAHSSNVGM
TLLEQKMGDATWLDYLNRFKFGVPTRFGLTDEYAGQLPADNIVNIAQSSFGQGISVTQTQMIRAFTAIANDGVMLEPKFI
SAIYDPNDQTARKSQKEIVGNPVSKDAASLTRTNMVLVGTDPVYGTMYNHSTGKPTVTVPGQNVALKSGTAQIADEKNGG
YLVGLTDYIFSAVSMSPAENPDFILYVTVQQPEHYSGIQLGEFANPILERASAMKDSLNLQTTAKALEQVSQQSPYPMPS
VKDISPGDLAEELRRNLVQPIVVGTGTKIKNSSAEEGKNLAPNQQVLILSDKAEEVPDMYGWTKETAETLAKWLNIELEF
QGSGSTVQKQDVRANTAIKDIKKITLTLGD
>Q2YKI6 ~~~~~~Purine-binding protein BAB2_0673~~~
MVIATVAGFMLGGAAHAEEKLKVGFIYIGPPGDFGWTYQHDQARKELVEALGDKVETTFLENVAEGADAERSIKRIARAG
NKLIFTTSFGYMDPTVKVAKKFPDVKFEHATGYKTADNMSAYNARFYEGRYVQGVIAAKMSKKGIAGYIGSVPVPEVVQG
INSFMLGAQSVNPDFRVKVIWVNSWFDPGKEADAAKALIDQGVDIITQHTDSTAAIQVAHDRGIKAFGQASDMIKFAPDT
QLTAVVDEWGPYYIDRAKAVLDGTWKSQNIWWGMKEGLVKMAPFTNMPDDVKKLAEETEARIKSGELNPFTGPIKKQDGS
EWLKAGEKADDQTLLGMNFYVAGVDDKLPQ
>P07944 ~~~pbp~~~Beta-lactam-inducible penicillin-binding protein~~~
MKKIKIVPLILIVVVVGFGIYFYASKDKEINNTIDAIEDKNFKQVYKDSSYISKSDNGEVEMTERPIKIYNSLGVKDINI
QDRKIKKVSKNKKRVDAQYKIKTNYGNIDRNVQFNFVKEDGMWKLDWDHSVIIPGMQKDQSIHIENLKSERGKILDRNNV
ELANTGTHMRLGIVPKNVSKKDYKAIAKELSISEDYINNKWIKIGYKMIPSFHFKTVKKMDEYLSDFAKKFHLTTNETES
RNYPLGKATSHLLGYVGPINSEELKQKEYKGYKDDAVIGKKGLEKLYDKKLQHEDGYRVTIVRVDDNSNTIAHTLIEKKK
KDGKDIQLTIDAKVQKSIYNNMKNDYGSGTAIHPQTGELLALVSTPSYDVYPFMYGMSNEEYNKLTEDKKEPLLNKFQIT
TSPGSTQKILTAMIGLNNKTLDDKTSYKIDGKGWQKDKSWGGYNVTRYEVVNGNIDLKQAIESSDNIFFARVALELGSKK
FEKGMKKLGVGEDIPSDYPFYNAQISNKNLDNEILLADSGYGQGEILINPVQILSIYSALENNGNINAPHLLKDTKNKVW
KKNIISKENINLLNDGMQQVVNKTHKEDIYRSYANLIGKSGTAELKMKQGETGRQIGWFISYDKDNPNMMMAINVKDVQD
KGMASYNAKISGKVYDELYENGNKKYDIDE
>Q0GQS6 ~~~pbuE~~~Purine efflux pump PbuE~~~COG2814
MNFKVFLLAASTIAVGLVELIVGGILPQIASDLDISIVSAGQLISVFALGYAVSGPLLLAVTAKAERKRLYLIALFVFFL
SNLVAYFSPNFAVLMVSRVLASMSTGLIVVLSLTIAPKIVAPEYRARAIGIIFMGFSSAIALGVPVGIIISNAFGWRVLF
LGIGVLSLVSMLIISVFFEKIPAEKMIPFREQIKTIGNAKIASAHLVTLFTLAGHYTLYAYFAPFLETTLHLSSVWVSVC
YFLFGLSAVCGGPFGGWLYDRLGSFKSIMLVTVSFALILFILPLSTVSLIVFLPAMVIWGLLSWSLAPAQQSYLIKIAPE
SSDIQQSFNTSALQIGIALGSAIGGGVIGQTGSVTATAWCGGLIVIIAVSLAVFSLTRPALKRKSA
>Q797E3 ~~~pbuE~~~Purine efflux pump PbuE~~~COG2814
MNFKVFLLAASTIAVGLVELIVGGILPQIANDLDISIVSAGQLISVFALGYAVSGPLLLALTAKIERKRLYLIALFVFFL
SNLVAYFSPNFATLMVSRVLAAMSTGLIVVLSLTIAPKIVAPEYRARAIGIIFMGFSSAIALGVPLGILISDSFGWRILF
LGIGLLALISMLIISIFFERIPAEKMIPFREQLKTIGNLKIASSHLVTMFTLAGHYTLYAYFAPFLEETLHLSSFWVSIC
YFLFGISAVCGGPFGGALSDRLGSFKSILLVTGSFAIIMFLLPLSTSSMIFFLPVMVIWGLLSWSLAPAQQSYLIEIAPD
SSDIQQSFNTSALQVGIALGSAIGGVVLDQTGTVVSTAWCGGSIVIIAVLFAFISLTRPVQTAKKSSL
>O34987 ~~~pbuG~~~Guanine/hypoxanthine permease PbuG~~~COG2252
MKTFFQFDELGTSYRNEIIGGLTTFLSMAYILFVNPITLALESVKDFPEALRIDQGAVFTATALASAAGCILMGLIARYP
IAIAPGMGLNAFFAFSVVLGMGISWQAALSGVFISGLIFVALSLTGFREKIINAIPPELKLAVGAGIGLFITFVGLQGSG
IITANPSTLVTIGNIHSGPVLLTIFGVIVTVILMVLRVNAGVFIGMLLTAVAGMIFGLVPVPTQIIGSVPSLAPTFGQAW
IHLPDIFSVQMLIVILTFLFVGFFDTAGTLVAVATQAGLMKENKLPRAGRALLADSSSIVIGAVLGTSTTTSYVESSSGV
AAGARSGFAAIVTGILFLLATFFSPLLSVVTSNVTAPALIIVGALMVAPLGKIAWDKFEVAVPAFLTMIMMPLTYSIATG
IAIGFIFYPITMVCKGKAKEVHPIMYGLFVVFILYFIFLK
>O34978 ~~~pbuO~~~Guanine/hypoxanthine permease PbuO~~~COG2252
MFHLKEQQTSIKQEIIAGLTTFFTMVYIVVVNPVILANAGVPFDQVFTATIIASIVGTLWMALAANYPIAIAPGMGLNAY
LAFHVVSASDGGITYATAFSAVFTAGVLFIILSLTPLRKQLIEAIPNNLKYGITTGIGLFIAFIGLRQAGIVAADESNLV
TLGNLHSPGVILTLVGLLISVVLMVLNVSGALFIGMAATALIAFFTGQLHFSKGFMSLPHLPEGLMISNPFTAFGDVIHH
GLYAVVFSFLLVTIFDTTGTMIGVAEQAGLMKNNKLPNVRKALLADSTATTVGAVFGTSPTTAFIESSAGVAAGGRTGLT
ALTVAVMFAASMFFSPLVSALSGIAAITSPALIIVGSLMMGSVSNMNWKEMDEAFPAFLVILAMPLTSSISTGIALGFIS
YPIVKAARGKWREIHPLVIVFAILFFIQLFIL
>Q9HY16 ~~~potD~~~Putrescine/cadaverine-binding protein~~~
MMKKLLLVATLMAGAAQATAAEKLYLFNWNDYIAEDTLKRFEQQCGCELVQEFYSGTEEMMAKLAAGASGYDVIIPTQNA
VEALIRKGDLLELDKSRLANLSNEAAGYLDKDFDKGNRYSLPYAFTTTLVGYNKTELDKLGIDPADWSVIFDPAVLEKIK
GRVTVMDDPQELFGAALKYLGHSANDTDPQHWKEAQALILAAKPYWAAFNSSSYIKELTLGNIWVAHGYSSDMYQARADA
EAAGRAFKVDFALPRQGAVLAIDNMVIHKGSKNPDLAYRFIDFMLDGRNASELTNQIGTGTPNAAALPFIKPEIKTLAAL
FPDATTQARLEPLKDLNSRQRRALNKLWTEIKLR
>Q59092 5.5.1.2~~~pcaB~~~3-carboxy-cis,cis-muconate cycloisomerase~~~COG0015
MSQLYASLFYQRDVTEIFSDRALVSYMVEAEVALAQAQAQVGVIPQSAATVIERAAKTAIDKIDFDALATATGLAGNIAI
PFVKQLTAIVKDADEDAARYVHWGATSQDILDTACILQCRDALAIVQNQVQQCYETALSQAQTYRHQVMMGRTWLQQALP
ITLGHKLARWASAFKRDLDRINAIKARVLVAQLGGAVGSLASLQDQGSIVVEAYAKQLKLGQTACTWHGERDRIVEIASV
LGIITGNVGKMARDWSLMMQTEIAEVFEPTAKGRGGSSTMPHKRNPVAAASVLAAANRVPALMSSIYQSMVQEHERSLGA
WHAEWLSLPEIFQLTAGALERTLDVLKGMEVNAENMHQNIECTHGLIMAEAVMMALAPHMGRLNAHHVVEAACKTAVAEQ
KHLKDIISQVDEVKQYFNPSQLDEIFKPESYLGNIQDQIDAVLQEAKGEAK
>Q43974 2.3.1.174~~~pcaF~~~Beta-ketoadipyl-CoA thiolase~~~COG0183
MKHAYIVDAIRTPFGRYAGGLAAVRADDLGAIPIAALIERNPSVNWAQVDDVIYGCANQAGEDNRNVGRMSALLAGLPVE
VPATTVNRLCGSSLDAIAMAARAIKAGEAHLIIAGGVESMSRAPYVMGKSEGAFGRTQKIEDTTMGWRFINPKLKAMYGV
DTMPQTAENVAEQFGIQREDQDQFAYTSQQRTAAAQAKGYFAKEIVPVTIPQRKGEPVVIDTDEHPRASTTLEGLAKLKG
VVKPEGSVTAGNASGINDGAAAVLIASDEAVAQYQLKARAKIIASTTVGIEPRIMGFAPAPAIKKLLKQANLTLDQMDVI
ELNEAFAAQALACTRDLGLADDDARVNPNGGAIALGHPLGASGARLVTTALNQLEQSGGKYALCSMCIGVGQGIALIIER
V
>Q8VPF1 2.3.1.174~~~pcaF~~~Beta-ketoadipyl-CoA thiolase~~~COG0183
MSREVYICDAVRTPIGRFGGSLAAVRADDLAAVPVKALVERNPQVDWSQLDEVYLGCANQAGEDNRNVARMALLLAGLPD
SVPGVTLNRLCASGMDAVGTAFRAIASGEAELVIAGGVESMSRAPYVMGKADSAFGRGQKIEDTTIGWRFINPLMKAQYG
VDAMPETADNVADDYKVSRADQDAFALRSQQLAGRAQAAGYFAEEIVPVVIKGKKGETVVDADEHLRPDTTLEALAKLKP
VNGPDKTVTAGNASGVNDGSVALILASAEAVKKHGLKARAKVLGMASAGVAPRVMGIGPVPAVRKLLERLNLSVADFDVI
ELNEAFAAQGLAVTRELGIADDDARVNPNGGAIALGHPLGASGARLVLTAVHQLEKSGGQRGLCTMCVGVGQGVALAVER
V
>Q43973 2.8.3.6~~~pcaI~~~3-oxoadipate CoA-transferase subunit A~~~COG1788
MIDKSAATLTEALSQIHDGATILIGGFGTAGQPAELIDGLIELGRKNLTIVSNNAGNGDYGLAKLLKTGAVKKIICSFPR
QADSYVFDELYRAGKIELEIVPQGNLACRIQAAGMGLGPIYTPTGFGTLLAEGKPTLNFDGKDYVLENPIKADFALIKAY
KGDRWGNLVYRKSARNFGPIMAMAANVTIAQVSEVVALGELDPENVVTPGIFVQHVVPVQSTPASAAP
>Q01103 2.8.3.6~~~pcaI~~~3-oxoadipate CoA-transferase subunit A~~~
MINKTYESIASAVEGITDGSTIMVGGFGTAGMPSELIDGLIATGARDLTIISNNAGNGEIGLAALLMAGSVRKVVCSFPR
QSDSYVFDELYRAGKIELEVVPQGNLAERIAAAGSGIGAFFSPTGYGTLLAEGKETREIDGRMYVLEMPLHADFALIKAH
KGDRWGNLTYRKAARNFGPIMAMAAKTAIAQVDQVVELGELDPEHIITPGIFVQRVVAVSGAAASSIAKAI
>Q59091 2.8.3.6~~~pcaJ~~~3-oxoadipate CoA-transferase subunit B~~~COG2057
MSYHKLTRDQIAQRVAQDIPEGSYVNLGIGLPTKIASYLPADKDVFLHSENGLLAFGPPPAAGEEDPELINAGKEYVTML
EGGCFFHHGDSFAMMRGGHLDICVLGAFQIAANGDLANWHTGAPDAIPSVGGAMDLAVGAKKVFVTTDHVTKKGEPKIVA
ELTYPATGQKCVDRIYTDLCIIDVVPEGLKVIEKVEGLSFEELQRLTGATLIDATQG
>P0A102 2.8.3.6~~~pcaJ~~~3-oxoadipate CoA-transferase subunit B~~~
MTITKKLSRTEMAQRVAADIQEGAYVNLGIGAPTLVANYLGDKEVFLHSENGLLGMGPSPAPGEEDDDLINAGKQHVTLL
TGGAFFHHADSFSMMRGGHLDIAVLGAFQVSVKGDLANWHTGAEGSIPAVGGAMDLATGARQVFVMMDHLTKTGESKLVP
ECTYPLTGIACVSRIYTDLAVLEVTPEGLKVVEICADIDFDELQKLSGVPLIK
>Q43975 ~~~pcaK~~~4-hydroxybenzoate transporter PcaK~~~COG2271
MPKEANMASQDYATQRSSLDAQALINDAPLSRYQWLIAIVCFLIVFVDGIDTAAMGFIAPALAQDWGVDRSQLGPVMSAA
LGGMIIGALVSGPTADRFGRKIVLSMSMLVFGGFTLACAYSTNLDSLVIFRFLTGIGLGAAMPNATTLFSEYCPARIRSL
LVTCMFCGYNLGMAIGGFISSWLIPAFGWHSLFLLGGWAPLILMLLVIFFLPESYRFLIVKGKNTKKVRQILSRIAPQKV
QGVTEFHVPEEKVEAGTKKGVFGMLFSAKYVKGTVLLWVTYFMGLVMIYLLTSWLPTLMRETGASLERAAFLGGLFQFGG
VLSALFIGWAMDRFNPNRIIAGFYLAAGIFAVIVGQSLSNPTLLALFILCAGIAVNGAQSSMPVLSARFYPTQCRATGVA
WMSGIGRFGAVFGAWIGAVLLGNNWSFTMILSMLIIPAAAAAIAIFVKSLVAHTDAT
>Q51955 ~~~pcaK~~~4-hydroxybenzoate transporter PcaK~~~COG2814
MNQAQNSVGKSLDVQSFINQQPLSRYQWRVVLLCFLIVFLDGLDTAAMGFIAPALSQEWGIDRASLGPVMSAALIGMVFG
ALGSGPLADRFGRKGVLVGAVLVFGGFSLASAYATNVDQLLVLRFLTGLGLGAGMPNATTLLSEYTPERLKSLLVTSMFC
GFNLGMAGGGFISAKMIPAYGWHSLLVIGGVLPLLLALVLMVWLPESARFLVVRNRGTDKIRKTLSPIAPQVVAEAGSFS
VPEQKAVAARSVFAVIFSGTYGLGTMLLWLTYFMGLVIVYLLTSWLPTLMRDSGASMEQAAFIGALFQFGGVLSAVGVGW
AMDRYNPHKVIGIFYLLAGVFAYAVGQSLGNITVLATLVLIAGMCVNGAQSAMPSLAARFYPTQGRATGVSWMLGIGRFG
AILGAWSGATLLGLGWNFEQVLTALLVPAALATVGVIVKGLVSHADAT
>Q52154 ~~~pcaR~~~Pca regulon regulatory protein~~~COG1414
MSDETLVNDPVNPEPARPASAAMAPPIVASPAKRIQAFTGDPDFMTSLARGLAVIQAFQERKRHLTIAQISHRTEIPRAA
VRRCLHTLIKLGYATSDGRTYSLLPKVLTLGHAYLSSTPLAISAQPYLDRISDQLHEAANMATLEGDDILYIARSATVER
LISVDLSVGGRLPAYCTSMGRILLAAMDDTSLREYLGRADLKARTSRTLHDPESLFACIQQVRAQGWCVVDQELEQGLRS
IAVPIYDASGQVLAALNVSTHVGRVTRSELEQRFLPILLAASRDLCHQLFG
>O83046 ~~~pcaU~~~Pca operon regulatory protein~~~COG1414
MWSNMDDKKVKEEKILHNSTNKKIIRHEDFVAGISKGMAILDSFGTDRHRLNITMAAEKTGMTRAAARRHLLTLEYLGYL
ESDGHYFYLTPKILKFSGSYLGGAQLPKISQPLLNLLTTQTSLIYSVMVLDGYEAITIARSAAHQQTDRVNPYGLHLGNR
LPAHATSAGKILLAYLDDHAQQEWLNQYPLQRLTKYTYTNNIDFLRLLSEIKEQGWCYSSEEHELGVHALAVPIYGQQSR
VVAALNIVSPTMRTTKEYLIQHILPLLQETARELRNIL
>A5W2C8 ~~~pcaY~~~Methyl-accepting chemotaxis protein PcaY~~~COG0840
MLANLKIRTGMFWVLSLFSLTLLFSTASAWWAAVGSDQQITELDQTAHQSDRLNNALLMAIRSSANVSSGFIEQLGGHDE
SAGKRMALSVELNNKSQTLVDEFVENAREPALRVLATELQATFAEYAKAVAGQREATRQRSLEQYFKVNSDAGNAMGRLQ
TLRQQLVTTLSERGQQIMLESDRRLARAQLLSLCLLGMTVVLAVLCWAFIAQRVLHPLREAGGHFRRIASGDLSVPVQGQ
GNNEIGQLFHELQRMQQSQRDTLGQINNCARQLDAAASALNAVTEESANNLRQQGQELEQAATAVTEMTTAVEEVARNAI
TTSQTTSESNQLAAQSRRQVSENIDGTEAMTREIQTSSAHLQQLVGQVRDIGKVLEVIRSVSEQTNLLALNAAIEAARAG
EAGRGFAVVADEVRTLAYRTQQSTQEIEQMIGSVQAGTEAAVASMQASTNRAQSTLDVTLASGQVLEGIYSAIGEINERN
LVIASAAEEQAQVAREVDRNLLNIRELSNHSAAGAQQTSEASKALSGLVGEMTALVGRFKV
>Q88JK6 ~~~pcaY~~~Methyl-accepting chemotaxis protein PcaY~~~COG0840
MVPTRSTARMLANLKIRTGMFWVLSLFSLTLLFSTASAWWAALGSDQQITELDQTAHQSDRLNNALLMAIRSSANVSSGF
IEQLGGHDESAGKRMALSVELNNKSQALVDEFVENAREPALRGLATELQATFAEYAKAVAGQREATRQRSLEQYFKVNSD
AGNAMGRLQTLRQQLVTTLSERGQQIMLESDRRLARAQLLSLCLLGVTVVLAVLCWAFIAQRVLHPLREAGGHFRRIASG
DLSVPVQGQGNNEIGQLFHELQRMQQSQRDTLGQINNCARQLDAAATALNAVTEESANNLRQQGQELEQAATAVTEMTTA
VEEVARNAITTSQTTSESNQLAAQSRRQVSENIDGTEAMTREIQTSSAHLQQLVGQVRDIGKVLEVIRSVSEQTNLLALN
AAIEAARAGEAGRGFAVVADEVRTLAYRTQQSTQEIEQMIGSVQAGTEAAVASMQASTNRAQSTLDVTLASGQVLEGIYS
AIGEINERNLVIASAAEEQAQVAREVDRNLLNIRELSNHSAAGAQQTSEASKALSGLVGEMTALVGRFKV
>P95503 ~~~pcbA~~~Chlorophyll a/b light-harvesting protein PcbA~~~
MATTATPEYGWWAGNSRFALQSGKWLSAHIAQYALITFWAGGITLFELARYNPDVSMGEQGLILIPHLATLGWGIGSGGQ
VVDTYPYFVIGVIHLVASAVFGAGALYHALKGPEDLSQSDFEFAKNFHFEWDDAAKLGNILGHHLLTLGYAALLFVIWLR
FHGVYDSTIGEVRVVTNPGATILSVLFEYGWFTPDHNPYFVNNLEDLASGHAYIAVVLLAGGFWHINQAPFPWAQRLLAS
LFSPEGLLSASLAGLSMAGFAAAYFSAVNTLAYPVEFFGPPLELKFSVAPYFVDTIDLPNGAHTARAWLCNVHFFLAFFV
LQGHLWHALRALGFDFKRIPQALGSLSGEA
>Q7VCF7 ~~~pcbA~~~Divinyl chlorophyll a/b light-harvesting protein PcbA~~~
MQTYGNPDVTYGWWAGNSGVTNRSGKFIAAHAAHTGLIAFWAGAFTLFELARFDPSVPMGHQPLIALPHLATLGIGFDEA
GTFVGGTTVTAIAIVHLVLSMVYGAGGLLHSLTFPGDMQDSEVLQARKFKLEWDNPDNQTFILGHHLIFLGVANIQFVEW
ARIHGIWDAAAGSIRQVEYNLNLSSIWNHQFDFLTINNLEDVMGGHAFLAFFMITGGAFHIATKQVGEYTKFKGSGLLSA
EAILSWSLAGIGWMAIVAAFWCATNTTVYPVDFFGEVLDLKFGIAPYWVDTVDLPNGAHTSRAWLTNVHYFLGFFYIQGH
LWHALRAMGFDFKRVSSAVSNIGTASVTLND
>Q7V872 ~~~pcbA~~~Divinyl chlorophyll a/b light-harvesting protein PcbA~~~
MQTYGKTDVTYAWYAGNSGVTNRSGRFIASHIGHTGLICFGAGANTLFELARYDSALPIGDQGFVVLPHLAGLGIGGIEN
GVITDSYGMLVVAVFHLIFSAVYAGGAMLHSFRYKEDLGEYPQGSRPNKFDFKWDDPDRLTFILGHHLLFLGLGCVQFVE
WAKYHGIYDPAMGVVRKVEYNLDLSMVWNHQIDFLTINSLEDVMGGHAFLAFFLSAGAIWHIFSKPFGEYTEFKGKGLLS
AEFVLSTSLAGAAFIAFVAAFWASMNTTIYPTDLYGGPLNIELNFAPYFSDTDPLFGGDLHSARSWLSNFHFYLGFFYLQ
GHFWHGLRAMGFDFKRVEKLFDQLESNEISLNPAKSTTVPSTSTDSAT
>O07295 ~~~pcbA~~~Divinyl chlorophyll a/b light-harvesting protein PcbA~~~
MQTYGNPDTTYGWWAGNSGVANRSGKFIAAHVAHAGLIVFWAGAFTLFELSRFDPSVPMGQQPLIALPHLATLGIGFDAD
GVLMGDTKPVLAIAIVHLVSSMVLAAGGLLHSLLLPGNLEESEVAKARKFNIEWDNPDKLTFILGHHLIILGFAVILLVE
WARVHGVYDPAIGAVRQVEYDLNLAEIWNHQTDFLLIDDLEDVMGGHAFLAFVLITGGAWHIATKQVGEYTKFKGKGLLS
AEAVLSWSLAGIGWMAIIAAFWSASNTTVYPVEFFGEPLELKFSISPYWIDTVDLPDGVYTSRAWLANVHYYFGFFFIQG
HLWHALRALGFDFKRVTNAISNIDSATVTLKD
>Q9L8M5 ~~~pcbB~~~Divinyl chlorophyll a/b light-harvesting protein PcbB~~~
MQTYGNPNVTYGWWAGNSGVTNRSGKFIAAHAAHTGLIAFGCGAATLVELAGFDPSLPMGHQSSLFLAHLASVGIGFDDA
GVWTGVGVANIAILHLILSMVYGGGGLLHSVYFTGDMQDSQVPQARKFKLEWDNPDNQTFILGHHLLFFGVANIWFVEWA
RIHGIYDPAIEAIRQVNYNLDLTQIWNHQFDFLAIDSLEDVMGGHAFLAFFQLGGGAFHIATKQIGTYTKFKGKELLSAE
AILSWSLAGIGWMACVAAFWAATNTTVYPEAWYGEVLQIKFGVSPYWIDTVPGGTAFLGHTTRAALVNVHYYLGFFFIQG
HLWHALRAMGFDFKRLLDKTGPFGIPRTL
>Q7V6U4 ~~~pcbB~~~Divinyl chlorophyll a/b light-harvesting protein PcbB~~~
MQTYGNPNPTYGWWAGNAGTTNRSGKFLAAHIAHTGLMAFWAGSFTLFELSRYDPSVPMGHQPLVALPHLATLGIGVGDG
GVITDTYPIVVTAVLHLVLSMVYAAGGLMHSLLFNGDIGEMGVKWARKFDFKWDDPDKLTFILGHHLFLLGLGNVQFVEW
AKYYGLYDNAEGVVRTVVPNLNIGMVWNAQFNFLAINSLEDVMGGHAFLALFMMSGGLWHIVTKQAGEYTTFKGKGILSA
EAQLSWALAGVGWMALVAAFWCASNTTIYPDTFFGEVLDLKFSISPYWVDTANLPEGTYTSRAWLTNIHYYLGFFYIQGH
LWHALRALGFDFKRVSNAIGNADSATITLN
>P95505 ~~~pcbC~~~Chlorophyll a/b light-harvesting protein PcbC~~~
MEECSCDNRFRRGNNEPAGFSLDEQWWAGNIRLVDLSGQLLGAHIAHAGLIAFWAGSITVLEVARYVPDVPFYEQGLGLL
PHLATLGFGIGPDGTVVDTYPYFVIGILHLVTSAVLGAGGLFHTFKGPAILAEGGALAPKFHYDWGDTKQLSLILGHHLL
LLGILCLAFVAKAMFWGGVYDASLGTVHTVSPNLNPADIFGYVFGFNHGQFNGLGMSSVDNLPDIIGGHVYIGILELIGG
TWHILTKPFAIGAKPFSFSGEAILSYSLGAVGWMGLLSGFFVRYCDAAYPPQFYGPERSGAAAVQYILGVLLLVGHVWHA
TRARAGGEPVPYTPPAPQRGRFGMTRVAPAPARTFIGRGKPQPEPPKKKGLFGRG
>Q7VC57 ~~~pcbC~~~Divinyl chlorophyll a/b light-harvesting protein PcbC~~~
MQTYGNPNVTYAWYAGNSGTTNRSGKFIAAHAAHAGLMMFWAGAFTLFELARYDSSIPMGNQNLICLPHLAGLGIGGVSN
GVITEPYGCTVIAVLHLIFSGVLGAGGLLHSMRYEGDLGNYPEGSRAKKFDFEWDDPDRLTFILGHHLIFLGLGNIQFVE
WARIHGIYDSAQGITRTVNYNLDLGMIWNHQADFLTINSLEDVMGGHAFLAFFLIIGGAFHIATKQYGQYTEFKGKGLLS
AESVLSYSLAGVAYCAFVAAFWCATNTTIYPTDLYGEVLSLKFEFAPYFVDTADLPADAHTARAWLSNVHFYLGFFFLQG
HLWHALRGMGFDFKRVGKAFDNMESAKITAG
>Q7VBC8 ~~~pcbD~~~Divinyl chlorophyll a/b light-harvesting protein PcbD~~~
MQTYGNPEVTYGWWAGNSVVTNRSGRFIASHVGHTGLICFAAGGSTLWELARYNPEIPMGHQSSLFLAHLASIGIGFDEA
GAWTGVGVATIAIVHLILSMVYGGGGLLHGILFDENVEDSEVLQAKKFKLEWNNPDNQTFILGHHLIFMGVACAWFVEWA
RIHGIYDPALGAIRQVNYNLDLSMIWQRQFDFITIDSLEDVMGGHAFLAFAEITGGAFHIVAGSTPWEDKKLGEWSKFKG
SELLSAEAVLSWSLAGIGWMAIVAAFWCASNTTVYPEAWYGEPLQFKFAISPYWVDTGDLSDATAFWGHSARAALTNVHY
YLGFFFLQGHFWHALRALGFNFKNVTASIGNEQKATFTIKS
>Q9L8M2 ~~~pcbE~~~Divinyl chlorophyll a/b light-harvesting protein PcbE~~~
MQTYGNPDPTYGWWVGNSVVTNKSSRFIGSHVAHTGLIAFTAGANTLWELARFNPDIPMGHQGMVSIPHLASLGIGFDQA
GAWTGQDVAFVGIFHLICSFVYALAGLLHSVIFSEDTQNSSGLFADGRPEHRQAARFKLEWDNPDNQTFILGHHLVFFGV
ANIWFVEWARVHGIYDPAIEAIRQVNYNLDLTQIWNHQFDFIQIDSLEDVMGGHAFLAFFQIGGGAFHIATKQIGTYTNF
KGAGLLSAEAVLSWSLAGIGWMAIIAAFWCATNTTVYPEAWYGETLQLKFGISPYWIDTGNMDGVVTGHTSRAWLSNVHY
YLGFFFIQGHLWHAIRAMGFDFRKVTSAVANLDNSRITLSD
>Q9L8M1 ~~~pcbF~~~Divinyl chlorophyll a/b light-harvesting protein PcbF~~~
MQTYGNPDVTYDYWAGNASVTNRSGRFIASHAAHTGMIAFGAGSNTLFELSRFDSSLPMGDQGFVFLPHLASVGIGFDEA
GVWTGAGVVTLAILHLILSMVYGAGGLLHAIYFPDDMQKSNVPQARKFKLEWDNPDNQTFILGHHLILFGLACAWFVEWA
RIHGIYDPAIGAVRQVNYNLDLSMIWERQVNFLNIDSLEDVMGGHAFLAFAEITGGCFHAIAGSTKWEDKRLGEYDRLKG
AGLLSAEAILSFSLAGIGWMAIVAAFWCSQNTTVYPIEFYGEPLNRAFVIAPAFVDSIDYSNGIAPLGHSGRCYTANFHY
IAGFFAFQGHLWHALRAMGYNFKDLRAKLNPSAA
>Q7VC50 ~~~pcbG~~~Divinyl chlorophyll a/b light-harvesting protein PcbG~~~
MQTYGDPNVSYAWYAANAGAVTNKSGRFISSHIAHTGLICFGAGANTLFELARYNPDLPMGSQGLVVLPHLAGLGLGGIS
NGVFTDTYQLLVVAILHLILSGVYGGGGMLHAFRYEEKLESYPATSRANKFKFDWNDPDRLTFILGHHLLFLAAGNIQFV
EWARVHGIYDPVAGAVRQVEYNLDLGMIWNHQFDFLSISSLEDIMGGHAFLAFFMAAGGVFHILTKNYGEYNSFKGADLL
SAEFVLSTSLAGAAYTAFVAALWCASNTTIYPVDLYGDVLQFKLGIAPYWIDTDSSLAADAHTGRAWLTNVHFFIGFFYL
QGHFFHGLRALGFDFKSIGKLFDNLETSETTLN
>Q7VBC2 ~~~pcbH~~~Divinyl chlorophyll a/b light-harvesting protein PcbH~~~
MQTYGNPDVTYGWWVGNSVVTNRAGRFIGSHVGHTGIICFATGASCLWELSRFDSSVPMGHQSSIYLSHLASLGIGFDEA
GVWTGAGVATIAIFHLIFSMVYGGAGLAHSLFFDPDLNEGPIGRVDKFKLEWDNPNNLTFILGHHLIFLGVANIWFVEWA
RVHGIYDPALGEVRTIFPGYGDFGMVWGHQFDFIKIDSLEDVMSGHAFLAFLQISGGAFHIATRQIGEYTKFKGDGLLSA
EAVLSWSLAGLFLMGVVAAFWAASNTTVYPTEWYGEPLEFKFGISPYWADTGDTSDCKYFFGHTSRAALVNVQYYFAFFC
LQGHLWHALRALGFDFRRIAKAIGGLTESTSS
>Q9F487 ~~~pcb~~~Chlorophyll a/b light-harvesting protein Pcb~~~
MAISTDKTFAANTNSPWLIGNARLIDLSGQLLGAHIAHAGLIMFWAGSITISEVTRFVPGIPMYEQQMTLLPHLATLGWG
VGAGGEVINTYPYFVIGILHLVASAVLGAGGLFHVFRSPAILYNSGGQVAKFHYEWNDPKKLGLILGHHLIILGFGAFLL
VLKAMFFGGIYDTHIENVRLITNPTFDPMTIFSYLVGIKDSHWTLLGIASVDNLEDVIGGHIWIGSILILGGIWHILVPP
FAWVRQILPIVNGEEILSYSLLGLALMAFISAVFVGYNDTVFPKEFYGENRIVIATIQWVLGILALVGYFWHSWRSRELN
SNLS
>Q6Q972 ~~~pcb~~~Chlorophyll a/b light-harvesting protein Pcb~~~
MGMQTYGNPDVEYGWWAGNSRLAGFSGKWLAAHVAQAALIVFWAGAICLFEVARYTADVPLGEQNLILIPHMASLGLGIG
EGGQIVDTFPYFAVGVVHLVSSAVIGAGGLYHSLRGPAILKEGPARAPKFDFDWGDGKRLGFILGHHLILLGLGALFLVL
WAVFFGIYDPVIGEVRTVTSPTLNPFTIFGYQTHFVETNTLEDLIGGHVYVAIIEISGGLWHIFCPPFKWAQRLIIYSGE
GLLAYALGGLAIMGFTAAVYCAFNTLAYPVEFYGPPLDFRFSFAPYFIDTADLPSGQYTARAWLCNVHFFLAFFVLQGHL
WHALRTLGFDFKRIPAALGSLSEDVVDAKA
>Q5LUF3 6.4.1.3~~~pccA~~~Propionyl-CoA carboxylase alpha chain~~~COG4770
MFNKILIANRGEIACRVIKTARKMGISTVAIYSDADKQALHVQMADEAVHIGPPPANQSYIVIDKVMAAIRATGAQAVHP
GYGFLSENSKFAEALEAEGVIFVGPPKGAIEAMGDKITSKKIAQEANVSTVPGYMGLIEDADEAVKISNQIGYPVMIKAS
AGGGGKGMRIAWNDQEAREGFQSSKNEAANSFGDDRIFIEKFVTQPRHIEIQVLCDSHGNGIYLGERECSIQRRNQKVVE
EAPSPFLDEATRRAMGEQAVALAKAVGYASAGTVEFIVDGQKNFYFLEMNTRLQVEHPVTELITGVDLVEQMIRVAAGEP
LSITQGDVKLTGWAIENRLYAEDPYRGFLPSIGRLTRYRPPAETAAGPLLVNGKWQGDAPSGEAAVRNDTGVYEGGEISM
YYDPMIAKLCTWAPTRAAAIEAMRIALDSFEVEGIGHNLPFLSAVMDHPKFISGDMTTAFIAEEYPEGFEGVNLPETDLR
RVAAAAAAMHRVAEIRRTRVSGRMDNHERRVGTEWVVTLQGADFPVTIAADHDGSTVSFDDGSSMRVTSDWTPGDQLANL
MVDGAPLVLKVGKISGGFRIRTRGADLKVHVRTPRQAELARLMPEKLPPDTSKMLLCPMPGLIVKVDVEVGQEVQEGQAL
CTIEAMKMENILRAEKKGVVAKINASAGNSLAVDDVIMEFE
>Q3J4E3 6.4.1.3~~~pccB~~~Propionyl-CoA carboxylase beta chain~~~COG4799
MKDILQELENRRAIARAGGGQRRVEAQHKRGKLTARERIELLLDEGSFEEFDMFVRHRCTDFGMQDDRPAGDGVVTGWGT
INGRMVYVFSQDFTVFGGSLSETHAQKICKIMDMAMQNGAPVIGLNDSGGARIQEGVASLAGYADVFQRNIMASGVIPQI
SVIMGPCAGGAVYSPAMTDFIFMVRDTSYMFVTGPDVVKTVTNEVVTAEELGGASTHTKKSSVADGAFENDVEALYEIRR
LVDFLPLSNRTPAPVRPFFDDVARIEDSLDTLIPDNPNQPYDMKELILKIADEADFYEIQKDFAANIITGFIRLEGQTVG
VVANQPMVLAGCLDIDSSRKAARFVRFCDAFNIPILTLVDVPGFLPGTGQEYGGVIKHGAKLLFAYGEATVPKVTVITRK
AYGGAYDVMASKHLRGDFNYAWPTAEIAVMGAKGATEILYRSELGDKEKIAARAKEYEDRFANPFVAAERGFIDEVIMPH
STRRRVSKAFASLRNKKLANPWKKHDNIPL
>Q168G2 6.4.1.3~~~pccB~~~Propionyl-CoA carboxylase beta chain~~~COG4799
MKDILEQLEDRRAAARLGGGQKRIDAQHGRGKLTARERVDLLLDEGSFEEFDMFVTHRCTDFNMQDQKPAGDGVVTGWGT
INGRVVYVFSQDFTVLGGSVSETHSKKICKIMDMAMQNGAPVIGINDSGGARIQEGVDSLAGYGEVFQRNIMASGVVPQI
SMIMGPCAGGAVYSPAMTDFIFMVKDSSYMFVTGPDVVKTVTNEQVSAEELGGATTHTRKSSVADAAFENDVEALAEVRR
LVDFLPLNNREKPPVRPFFDDPDRIEPSLDTLVPDNPNTPYDMKELIHKLADEGDFYEIQEEFAKNIITGFIRLEGRTVG
VVANQPLVLAGCLDIDSSRKAARFVRFCDAFEIPLLTLIDVPGFLPGTSQEYGGVIKHGAKLLYAYGEATVPMVTVITRK
AYGGAYVVMSSKHLRADFNYAWPTAEVAVMGAKGATEIIHRGDLGDPEKIAQHTADYEERFANPFVASERGFVDEVIQPR
STRKRVARAFASLRNKSVQMPWKKHDNIPL
>Q3J4E6 ~~~pccR~~~Propionyl-CoA carboxylase regulator~~~COG1396
MAQKLYAGAKLRELRVKLGLTQKVFAERLGASLPYLNQMENNHRPVSATVVLALAQEFGVDVTKLTTSEAERIVTDMREA
LADPVFTDSPPLADLRLVASNAPAFARAFLDLHRAYRQTHERLASLDEALGRDEADLRPSPWEEVRDFFHYCDNYLDAVD
RAAEHYAAPGGVRRDVFSAAMETLTRAGLDLQISDMPAIRSREGNALRLSARAAAPTQRFQLLHQVALLTQNDLLEATLD
LARFQTAEAREIAKIGLANYFAGAALLPYRPFLQAAAETRHDLERLADLFGASIEQVAHRLSTLQRPGAKGVPFFFVRVD
QAGTITKRHSATRFQFARFGGACPLWNVHRAFETPGRFLRQLAQTPDGVRYLLLARDVSKPGGSFTAPVRRYAIGLGCEV
QHADALVYADGLDLKGSFEPIGISCRICDRQECHQRSVPPLEKRLRVDPDRRGLLPYEIVD
>Q01470 3.1.1.-~~~pcd~~~Phenmedipham hydrolase~~~
MITRPIAHTTAGDLGGCLEDGLYVFRGVPYAEPPVGDLRWRAARPHAGWTGVRDASAYGPSAPQPVEPGGSPILGTHGDP
PFDEDCLTLNLWTPNLDGGSRPVLVWIHGGGLLTGSGNLPNYATDTFARDGDLVGISINYRLGPLGFLAGMGDENVWLTD
QVEALRWIADNVAAFGGDPNRITLVGQSGGAYSIAALAQHPVARQLFHRAILQSPPFGMQPHTVEESTARTKALARHLGH
DDIEALRHEPWERLIQGTIGVLMEHTKFGEWPLAFYPVFDEATIPRHPIESIIDSDIEIIIGWTRDEGTFPFAFDPQVSQ
ADRDQVESWLQKRFGDHAASAYEAHAGDGTSPWTVIANVVGDELFHSAGYRVADERATRRPVRAYQFDVVSPLSDGALGA
VHCIEMPFTFANLDRWTGKPFVDGLDPDVVARVTNVLHQAWIAFVRTGDPTHDQLPVWPTFRADDPAVLVVGDEGAEVAR
DLARPDHVSVRTL
>Q8GJ31 1.21.99.5~~~pceA~~~Tetrachloroethene reductive dehalogenase~~~
MGEINRRNFLKVSILGAAAAAVASASAVKGMVSPLVADAADIVAPITETSEFPYKVDAKYQRYNSLKNFFEKTFDPEANK
TPIKFHYDDVSKITGKKDTGKDLPTLNAERLGIKGRPATHTETSILFHTQHLGAMLTQRHNETGWTGLDEALNAGAWAVE
FDYSGFNATGGGPGSVIPLYPINPMTNEIANEPVMVPGLYNWDNIDVESVRQQGQQWKFESKEEASKIVKKATRLLGADL
VGIAPYDERWTYSTWGRKIYKPCKMPNGRTKYLPWDLPKMLSGGGVEVFGHAKFEPDWEKYAGFKPKSVIVFVLEEDYEA
IRTSPSVISSATVGKSYSNMAEVAYKIAVFLRKLGYYAAPCGNDTGISVPMAVQAGLGEAGRNGLLITQKFGPRHRIAKV
YTDLELAPDKPRKFGVREFCRLCKKCADACPAQAISHEKDPKVLQPEDCEVAENPYTEKWHLDSNRCGSFWAYNGSPCSN
CVAVCSWNKVETWNHDVARIATQIPLLQDAARKFDEWFGYNGPVNPDERLESGYVQNMVKDFWNNPESIKQ
>Q8GJ27 1.21.99.5~~~pceA~~~Tetrachloroethene reductive dehalogenase~~~
MGEINRRNFLKASMLGAAAAAVASASAVKGMVSPLVADAADIVAPITETSEFPYKVDAKYQRYNSLKNFFEKTFDPEANK
TPIKFHYDDVSKITGKKDTGKDLPTLNAERLGIKGRPATHTETSILFQTQHLGAMLTQRHNETGWTGLDEALNAGAWAVE
FDYSGFNAAGGGPGSVIPLYPINPMTNEIANEPVMVPGLYNWDNIDVESVRQQGQQWKFESKEEASKMVKKATRLLGADL
VGIAPYDERWTYSTWGRKILKPCKMPNGRTKYLPWDLPKMLSGGGVEVFGHAKFEPDWEKYAGFKPKSVIVFVLEEDYEA
IRTSPSVISSATVGKSYSNMAEVAYKIAVFLRKLGYYAAPCGNDTGLSVPMAVQAGLGEAGRNGLLITQKFGPRHRIAKV
YTDLELAPDKPRKFGVREFCRLCKKCADACPAQAISHEKDPKVLQPEDCEVAENPYTEKWHLDSNRCGSFWAYNGSPCAN
CVAVCSWNKVETWNHDVARIATQIPLLQDAARKFDEWFGYNGPVNPDERLESGYVQNMVKDFWNNPESIKQ
>Q848J2 1.21.99.5~~~pceA~~~Tetrachloroethene reductive dehalogenase~~~
MGEINRRNFLKASMLGAAAAAVASASVVKGVVSPLVADAADIVAPITETSEFPYKVDAKYQRYNSLKNFFEKTFDPEENK
TPIKFHYDDVSKITGKKDTGKDLPMLNAERLGIKGRPATHTETSILFHTQHLGAMLTQRHNETGWTGLDEALNAGAWAVE
FDYSGFNAAGGGPGSAIPLYPINPMTNEIANEPVMVPGLYNWDNIDVESVRQQGQQWKFESKEEASKILKKATRLLGADL
VGIAPYDERWTYSTWGRKIQKPCKMPNGRTKYLPWDLPKMLSGGGVEVFGHAKFEPDWEKYAGFKPKSVIVFVLEEDYEA
IRTSPSVISSATVGKSYSNMAEVAYKIAVFLRKLGYYAAPCGNDTGISVPMAVQAGLGEAGRNGLLITQKFGPRHRIAKV
YTDLELAPDKPRKFGVREFCRLCKKCADACPAQAISHEKDPKVLQPEDCEASENPYTEKWHVDSERCGSFWAYNGSPCSN
CVAVCSWNKVETWNHDVARVATQIPLLQDAARKFDEWFGYSGPVNPDERLESGYVQNMVKDFWNNPESIKQ
>Q8L172 1.21.99.5~~~pceA~~~Tetrachloroethene reductive dehalogenase~~~COG1600
MGEINRRNFLKVSILGAAAAAVASASAVKGMVSPLVADAADIVAPITETSEFPYKVDAKYQRYNSLKNFFEKTFDPEANK
TPIKFHYDDVSKITGKKDTGKDLPTLNAERLGIKGRPATHTETSILFHTQHLGAMLTQRHNETGWTGLDEALNAGAWAVE
FDYSGFNATGGGPGSVIPLYPINPMTNEIANEPVMVPGLYNWDNIDVESVRQQGQQWKFESKEEASKIVKKATRLLGADL
VGIAPYDERWTYSTWGRKIYKPCKMPNGRTKYLPWDLPKMLSGGGVEVFGHAKFEPDWEKYAGFKPKSVIVFVLEEDYEA
IRTSPSVISSATVGKSYSNMAEVAYKIAVFLRKLGYYAAPCGNDTGISVPMAVQAGLGEAGRNGLLITQKFGPRHRIAKV
YTDLELAPDKPRKFGVREFCRLCKKCADACPAQAISHEKDPKVLQPEDCEVAENPYTEKWHLDSNRCGSFWAYNGSPCSN
CVAVCSWNKVETWNHDVARVATQIPLLQDAARKFDEWFGYNGPVNPDERLESGYVQNMVKDFWNNPESIKQ
>O68252 1.21.99.5~~~pceA~~~Tetrachloroethene reductive dehalogenase~~~
MEKKKKPELSRRDFGKLIIGGGAAATIAPFGVPGANAAEKEKNAAEIRQQFAMTAGSPIIVNDKLERYAEVRTAFTHPTS
FFKPNYKGEVKPWFLSAYDEKVRQIENGENGPKMKAKNVGEARAGRALEAAGWTLDINYGNIYPNRFFMLWSGETMTNTQ
LWAPVGLDRRPPDTTDPVELTNYVKFAARMAGADLVGVARLNRNWVYSEAVTIPADVPYEQSLHKEIEKPIVFKDVPLPI
ETDDELIIPNTCENVIVAGIAMNREMMQTAPNSMACATTAFCYSRMCMFDMWLCQFIRYMGYYAIPSCNGVGQSVAFAVE
AGLGQASRMGACITPEFGPNVRLTKVFTNMPLVPDKPIDFGVTEFCETCKKCARECPSKAITEGPRTFEGRSIHNQSGKL
QWQNDYNKCLGYWPESGGYCGVCVAVCPFTKGNIWIHDGVEWLIDNTRFLDPLMLGMDDALGYGAKRNITEVWDGKINTY
GLDADHFRDTVSFRKDRVKKS
>Q8GJ30 ~~~pceB~~~Probable tetrachloroethene reductive dehalogenase membrane anchor protein~~~
MNIYDVLIWMALGMTALLIQYGIWRYLKGKGKDTIPLQICGFLANFFFIFALAWGYSSFSEREYQAIGMGFIFFGGTALI
PAIITYRLANHPAKKIRESSDSISA
>Q8L171 ~~~pceB~~~Probable tetrachloroethene reductive dehalogenase membrane anchor protein~~~
MNIYDVLIWMALGMTALLIQYGIWRYLKGKGKDTIPLQICGFLANFFFIFALAWGYSSFSEREYQAIGMGFIFFGGTALI
PAIITYRLANHPAKKIRESSDSISA
>Q51508 5.4.4.2~~~pchA~~~Salicylate biosynthesis isochorismate synthase~~~
MSRLAPLSQCLHALRGTFERAIGQAQALDRPVLVAASFEIDPLDPLQVFGAWDDRQTPCLYWEQPELAFFAWGCALELQG
HGEQRFARIEENWQLLCADAVVEGPLAPRLCGGFRFDPRGPREEHWQAFADASLMLAGITVLREGERYRVLCQHLAKPGE
DALALAAYHCSALLRLRQPARRRPSGPTAGAQGDASAQERRQWEAKVSDAVSSVRQGRFGKVVLARTQARPLGDIEPWQV
IEHLRLQHADAQLFACRRGNACFLGASPERLVRIRAGEALTHALAGTIARGGDAQEDARLGQALLDSAKDRHEHQLVVEA
IRTALEPFSEVLEIPDAPGLKRLARVQHLNTPIRARLADAGGILRLLQALHPTPAVGGYPRSAALDYIRQHEGMDRGWYA
APLGWLDGEGNGDFLVALRSALLTPGRGYLFAGCGLVGDSEPAHEYRETCLKLSAMREALSAIGGLDEVPLQRGVA
>Q59702 1.2.1.96~~~pchA~~~4-hydroxybenzaldehyde dehydrogenase (NADP(+))~~~
MSQRLAAYENMSLQLIAGEWRVGKAGRDLDVLDPFTQEKLLQIPLANREDLDEAYRSARQAQVAWAACGPSERAQVMLNA
VRIFDERRDEIIDWIIRESGSTRIKAQIEWGAARAITQESASLPSRVHGRILASDVPGKESRVYREPLGVIGIISPWNFP
LHLTARSLAPALALGNACVIKPASDTPVTGGLLLAHIFEEAGLPKGVLSVVVGSGSEIGDAFVEHEVPGFISFTGSTQVG
RNIGRIAAGGEHLKHVALELGGNSPFVVLADADLDQAVNAAVVGKFLHQGQICMAINRIIVEDSVYDEFVNRYAERVKSL
PYGDPSKPETVVGPVINAKQLAGLQDKIATAKSEGARVMVEGEAQGNVLPPHVFADVTADMEIAREEIFGPLVGIQRARD
EAHALELANSSEYGLSSAVFTSSLERGVKFARGIRAGMTHINDIPVNDEPNAPFGGEKNSGLGRFNGDWAIEEFTTDHWI
TVQHAPRRYPF
>Q51507 4.2.99.21~~~pchB~~~Isochorismate pyruvate lyase~~~
MKTPEDCTGLADIREAIDRIDLDIVQALGRRMDYVKAASRFKASEAAIPAPERVAAMLPERARWAEENGLDAPFVEGLFA
QIIHWYIAEQIKYWRQTRGAA
>A0A0H2ZF83 6.2.1.61~~~pchD~~~Pyochelin synthase PchD~~~
MTSSPATPSAVDDAPDWPAAFVRRYLDAGHWQDQNFAEALAASAARHPRRIALCDDDQRLSYADLLQRCRRLAAGLRQAG
LAHGDTVVLHLPNGIAFVETCFALFQLGVRPVLALPAHRQHEISGFCRFAEAKAYIGAERIDGFDPRPMARELLASGACR
MALIHGEAEAPLQALAPLYQADALEDCAARAEDIACFQLSGGTTGTPKLIPRRHREYLYNVRASAEVCGFDEHTVYLTGL
PMAHNFTLCCPGVIGTLLAGGRVVVSQRADPEHCFALIARERVTHTALVPPLAMLWLDAQESRRADLSSLRLLQVGGSRL
GSSAAQRVEPVLGCQLQQVLGMAEGLICYTRLDDPPERVLHTQGRPLSPDDEVRVVDAEGREVGPGEVGELTVRGPYTIR
GYYRLPEHNAKAFSADGFYRTGDRVSRDKDGYLVVEGRDKDQINRGGEKIAAEEVENLLIAHPQVHDATVVAMPDSLLGE
RTCAFVIPRQPAPSALKLKQYLHACGLAAFKVPDRIELVPAFPQTGIGKISKKDLRERLRRELEARA
>Q9HWG3 6.2.1.61~~~pchD~~~Pyochelin synthase PchD~~~
MTSSPVTPSAVDDAPDWPAAFVRRYLDAGHWQDQSFAEALATSAARHPRRIALCDDDQRLSYADLLQRCRRLAAGLRQAG
LAHGDTVVLHLPNGIAFVETCFALFQLGVRPVLALPAHRQHEISGFCRFAEAKAYIGAERIDGFDPRPMARELLASGACR
MALIHGEAEAPLQALAPLYQADALEDCAARAEDIACFQLSGGTTGTPKLIPRRHREYLYNVRASAEVCGFDEHTVYLTGL
PMAHNFTLCCPGVIGTLLASGRVVVSQRADPEHCFALIARERVTHTALVPPLAMLWLDAQESRRADLSSLRLLQVGGSRL
GSSAAQRVEPVLGCQLQQVLGMAEGLICYTRLDDPPERVLHTQGRPLSPDDEVRVVDAEGREVGPGEVGELTVRGPYTIR
GYYRLPEHNAKAFSADGFYRTGDRVSRDKDGYLVVEGRDKDQINRGGEKIAAEEVENLLIAHPQVHDATVVAMPDSLLGE
RTCAFVIPRQPAPSALKLKQYLHACGLAAFKVPDRIELVPAFPQTGIGKISKKDLRERLRRELEARA
>A0A0H2ZGB9 6.2.1.69~~~pchE~~~Pyochelin synthase PchE~~~
MDLPPDSRTALRDWLTEQLADLLGEPLADVRALADDDDLLGCGLDSIRLMYLQERLRARGSTLDFAQLAQRPCLGAWLDL
LACADRLSAPATVALPTVQDRDQPFELSSVQQAYWLGRGAGEVLGNVSCHAFLEFRTRDVDPQRLAAAAECVRQRHPMLR
ARFFDGRQQILPTPPLSCFDLQDWRTLQVDEAERDWQALRDWRAHECLAVERGQVFLLGLVRMPGGEDRLWLSLDLLAAD
VESLRLLLAELGVAYLAPERLAEPPALHFADYLARRAAQRAEAAARARDYWLERLPRLPDAPALPLACAPESIRQPRTRR
LAFQLSAGESRRLERLAAQHGVTLSSVFGCAFALVLARWSESAEFLLNVPLFDRHADDPRIGEVIADFTTLLLLECRMQA
GVSFAEAVKSFQRNLHGAIDHAAFPALEVLREARRQGQPRSAPVVFASNLGEEGFVPAAFRDAFGDLHDMLSQTPQVWLD
HQLYRVGDGILLAWDSVVGLFPEGLPETMFEAYVGLLQRLCDSTWEQPADLPLPWAQQARRALLNGQPACATARTLHRDF
FLRAAEAPDADALLYRDQRVTRGELAERALRIAGGLREAGVRPGDAVEVSLPRGPQQVAAVFGVLAAGACYVPLDIDQPP
ARRRLIEEAAGVCLAITEEDDPQALPPRLDVQRLLRGPALAAPVPLAPQASAYVIYTSGSTGVPKGVEVSHAAAINTIDA
LLDLLRVDAADRLLAVSALDFDLSVFDLFGGLGAGASLVLPAQEQARDAAAWAEAIQRHAVSLWNSAPALLEMALSLPAS
QADYRSLRAVLLSGDWVALDLPGRLRPRCAEGCRLHVLGGATEAGIWSNLQSVDTVPPHWRSIPYGRPLPGQAYRVVDAH
GRDVPDLVVGELWIGGASLARGYRNDPELSARRFVHDAQGRWYRTGDRGRYWDDGTLEFLGRVDQQVKVRGQRIELGEVE
AALCAQAGVESACAAVLGGGVASLGAVLVPRLAPRAEGSMELPAAQPFAGLAEAEAVLTREILGALLEAPLELDDGLRRR
WLDWLADSAASALPPLDEALRRLGWQAAGLTAMGNALRGLLAGEQAPAALLLDPWLAPQAVAARLPDGREALARLLEALP
TPAAGERLRVAVLDTRAGLWLDQGMASLLRPGLELTLFERSRVLLDAAATRLPERIVVQALDDGLLPAEHLGRYDRVISF
AALHAYAASREGLALAAALLRPQGRLLLVDLLCESPLALLGAALLDDRPLRLAELPSLLADLAAAGLAPRCLWRSERIAL
VEALAPGLGLDAAALQAGLEQRLPQAMRPERLWCLPSLPLNGNGKVDRRRLAESMTRALGECRDEPSAEEPLEAHEQALA
ECWEAVLKRPVRRREASFFSLGGDSLLATRLLAGIRERFGVRLGMADFYRQPTLAGLARHLQAQTVEIEETQLEEGVL
>G3XCV2 6.2.1.69~~~pchE~~~Pyochelin synthase PchE~~~
MDLPPDSRTALRDWLTEQLADLLGEPLADVRALADDDDLLGCGLDSIRLMYLQERLRARGSTLDFAQLAQRPCLGAWLDL
LACADRLSAPATVALPTAQDRDQPFELSSVQQAYWLGRGAGEVLGNVSCHAFLEFRTRDVDPQRLAAAAECVRQRHPMLR
ARFLDGRQQILPTPPLSCFDLQDWRTLQVDEAERDWQALRDWRAHECLAVERGQVFLLGLVRMPGGEDRLWLSLDLLAAD
VESLRLLLAELGVAYLAPERLAEPPALHFADYLAHRAAQRAEAAARARDYWLERLPRLPDAPALPLACAPESIRQPRTRR
LAFQLSAGESRRLERLAAQHGVTLSSVFGCAFALVLARWSESAEFLLNVPLFDRHADDPRIGEVIADFTTLLLLECRMQA
GVSFAEAVKSFQRNLHGAIDHAAFPALEVLREARRQGQPRSAPVVFASNLGEEGFVPAAFRDAFGDLHDMLSQTPQVWLD
HQLYRVGDGILLAWDSVVGLFPEGLPETMFEAYVGLLQRLCDSAWGQPADLPLPWAQQARRALLNGQPACATARTLHRDF
FLRAAEAPDADALLYRDQRVTRGELAERALRIAGGLREAGVRPGDAVEVSLPRGPQQVAAVFGVLAAGACYVPLDIDQPP
ARRRLIEEAAGVCLAITEEDDPQALPPRLDVQRLLRGPALAAPVPLAPQASAYVIYTSGSTGVPKGVEVSHAAAINTIDA
LLDLLRVNASDRLLAVSALDFDLSVFDLFGGLGAGASLVLPAQEQARDAAAWAEAIQRHAVSLWNSAPALLEMALSLPAS
QADYRSLRAVLLSGDWVALDLPGRLRPRCAEGCRLHVLGGATEAGIWSNLQSVDTVPPHWRSIPYGRPLPGQAYRVVDTH
GRDVPDLVVGELWIGGASLARGYRNDPELSARRFVHDAQGRWYRTGDRGRYWGDGTLEFLGRVDQQVKVRGQRIELGEVE
AALCAQAGVESACAAVLGGGVASLGAVLVPRLAPRAEGSMDLPAAQPFAGLAEAEAVLTREILGALLEAPLELDDGLRRR
WLDWLADSAASALPSLDEALRRLGWQAAGLTAMGNALRGLLAGEQAPAALLLDPWLAPQAVAARLPDGREALARLLEALP
TPAAGERLRVAVLDTRAGLWLDQGMASLLRPGLELTLFERSRVLLDAAATRLPERIVVQALDDGLLPAEHLGRYDRVISF
AALHAYEASREGLALAAALLRPQGRLLLVDLLCESPLALLGAALLDDRPLRLAELPSLLADLAAAGLAPRCLWRSERIAL
VEALAPGLGLDAAALQAGLEQRLPQAMRPERLWCLPSLPLNGNGKVDRRRLAESMTRALGECRHEPSAEEPLEAHEQALA
ECWEAVLKRPVRRREASFFSLGGDSLLATRLLAGIRERFGVRLGMADFYRQPTLAGLARHLQVQTVEIEETQLEEGVL
>A0A0H2ZGJ4 6.2.1.69~~~pchF~~~Pyochelin synthase PchF~~~
MSLGELLETCRSRRIELWSEAGRLRYRAPQGALDAGLAERLRAEREALLEHLEGGPGWRAEPDLAHQRFPLTPVQAAYVL
GRQAAFDYGGNACQLYAEYDWPADTDPARLEAAWNAMVERHPMLRAVIEDNAWQRVLPEVPWQRLTVHACAGLDEAAFQA
HLERVRERLDHACAALDQWPVLRPELSIGRDDCVLHCSVDFTLVDYASLQLLLGEWRRRYLDPQWTAEPLEATFRDYVGV
EQRRRQSPAWQRDRDWWLARLDALPGRPDLPLRAQPDTRSTRFRHFHARLDEAAWQALGARAGEHGLSAAGVALAAFAET
IGRWSQAPAFCLNLTVLNRPPLHPQLAQVLGDFTALSLLAVDSRHGDSFVERARRIGEQMFDDLDHPTFSGVDLLRELAR
RRGRGADLMPVVFTSGIGSVQRLLGDGEAPRAPRYMISQTPQVWLDCQVTDQFGGLEIGWDVRLGLFPEGQAEAMFDDFV
GLLRRLAQSPRAWTDGDATEPVEAPPQALPGSARSIAAGFAERALLTPDATVIHDAAGSYSYRQVAQHASALRRVLEAHG
AGRGRRVAVMLPKSAAQLVAVIGILQAGAAYVPVDIRQPPLRRQAILASPEVVAWVCLESDVPNAGCACVAIDRLAADSA
WPPPPAAEVAADDLAYVIYTSGSTGTPKGVMLSHAAVSNTLLDINQRYGVDANDRVLGLAELSFDLSVYDFFGATAAGAQ
VVLPDPARGSDPSHWAELLERHAITLWNSVPAQGQMLIDYLESEPQRHLPGPRCVLWSGDWIPVSLPTRWWRRWPDSALF
SLGGATEAAIWSIEQPIRPQHTELASIPYGRALRGQSVEVLDARGRRCPPGVRGEIHIGGVGLALGYAGDPQRTAERFVR
HPDGRRLYRTGDLGRYLADGSIEFLGREDDQVKIRGHRIELAELDAALCAHPQVNLAATVVLGETHERSLASFVTLHAPA
EAGEDPRTALDTVRQRAAQALRRDWGSEEGIAAAVAALDRACLASLAAWLAGSGLFASATPLDFATLCQRLGIAEARQRL
LRHWLRQLEEGGYLRAEGEGWLGCAERPAQSPEDAWTAFAGCAPAALWPAELVAYLRDSAQSLGEQLAGRISPAALMFPQ
GSARIAEAMYSQGLHAQALHEAMAEAIAAIVERQPQRRWRLLELGAGTAAASRAVIARLAPLVQRGTEVDYLFTDVSSYF
LAAARERFADQPWVRFGRFDMNGDLLDQGVAPHSVDILLSSGALNNALDTPALLAGLRELLSADAWLVIQELTREHNEIS
VSQSLMMENPRDLRDERRQLFVHTGQWLEWLAAQGGDLACGVVPPGSALDLLGYDVLLARCKTDRARLEPAELLAFVEAR
VPRYMLPAQLRVLERLPVTGNGKIDRKALTGFARQPQADLRHGVAQAPADELESALLALWREVLDNPSLGVEQDFFGAGG
DSLLIAQLIARLRERLESARRHPFDRLLRWALSQPTPRGLAERLRSAPEEGRGPALAAARGVAPAQTGMSRAPLAEGAVA
LDPLVRLVPGEGVPRVLVHEGLGTLLPYRPLLRALGEGRPLLGLAVHDSDAYLAIPAEYLNACLGRRYAEALHGAGLREV
DLLGYCSGGLVALETAKSLLQRGVRVRQLDIVSSYRIPYRVDDERLLLFSFAATLGLDTAALGFPAPERLGQAVQAALAQ
TPERLGAEALAGLPGLADLVALRGRVLQAASGSADAASVERDTLYRLFCHSVRASQAEALEPYVGALRLFVPDAGNPLVP
RYAEALETQWRAAALGACGIHEVPGGHFDCLGEALAQSLSKPMPEEASQ
>Q9HWG4 6.2.1.69~~~pchF~~~Pyochelin synthase PchF~~~
MSLGELLETCRSRRIELWSEAGRLRYRAPQGALDAGLAERLRAEREALLEHLEGGPGWRAEPDMAHQRFPLTPVQAAYVL
GRQAAFDYGGNACQLYAEYDWPADTDPARLEAAWNAMVERHPMLRAVIEDNAWQRVLPEVPWQRLTVHACAGLDEAAFQA
HLERVRERLDHACAALDQWPVLRPELSIGRDACVLHCSVDFTLVDYASLQLLLGEWRRRYLDPQWTAEPLEATFRDYVGV
EQRRRQSPAWQRDRDWWLARLDALPGRPDLPLRVQPDTRSTRFRHFHARLDEAAWQALGARAGEHGLSAAGVALAAFAET
IGRWSQAPAFCLNLTVLNRPPLHPQLAQVLGDFTALSLLAVDSRHGDSFVERARRIGEQMFDDLDHPTFSGVDLLRELAR
RRGRGADLMPVVFTSGIGSVQRLLGDGEAPRAPRYMISQTPQVWLDCQVTDQFGGLEIGWDVRLGLFPEGQAEAMFDDFV
GLLRRLAQSPRAWTDGDATEPVEAPPQALPGSARSIAAGFAERALLTPDATAIHDAAGSYSYRQVAQHASALRRVLEAHG
AGRGRRVAVMLPKSAAQLVAVIGILQAGAAYVPVDIRQPPLRRQAILASAEVVALVCLESDVPDVGCACVAIDRLAADSA
WPPPPAAEVAADDLAYVIYTSGSTGTPKGVMLSHAAVSNTLLDINQRYGVDANDRVLGLAELSFDLSVYDFFGATAAGAQ
VVLPDPARGSDPSHWAELLERHAITLWNSVPAQGQMLIDYLESEPQRHLPGPRCVLWSGDWIPVSLPTRWWRRWPDSALF
SLGGATEAAIWSIEQPIRPQHTELASIPYGRALRGQSVEVLDARGRRCPPGVRGEIHIGGVGLALGYAGDPQRTAERFVR
HPDGRRLYRTGDLGRYLADGSIEFLGREDDQVKIRGHRIELAELDAALCAHPQVNLAATVVLGETHERSLASFVTLHAPV
EAGEDPRTALDAVRQRAAQALRRDWGSEEGIAAAVAALDRACLASLAAWLAGSGLFASATPLDLATLCQRLGIAEARQRL
LRHWLRQLEEGGYLRAEGEGWLGCAERPAQSPEDAWTAFAGCAPAALWPAELVAYLRDSAQSLGEQLAGRISPAALMFPQ
GSARIAEAMYSQGLHAQALHEAMAEAIAAIVERQPQRRWRLLELGAGTAAASRTVIARLAPLVQRGAEVDYLFTDVSSYF
LAAARERFADQPWVRFGRFDMNGDLLDQGVAPHSVDILLSSGALNNALDTPALLAGLRELLSADAWLVIQELTREHNEIS
VSQSLMMENPRDLRDERRQLFVHTGQWLEWLAAQGGDLACGVVPPGSALDLLGYDVLLARCKTDRARLEPAELLAFVEAR
VPRYMLPAQLRVLERLPVTGNGKIDRKALTGFARQPQADLRHGVAQAPADELENALLALWREVLDNPSLGVEQDFFGAGG
DSLLIAQLIARLRERLESARRHPFDRLLRWALSQPTPRGLAERLRSAPEEGRGPALAAARGVAPAPAGMSRAPLAEGAVA
LDPLVRLVPGEGVPRVLVHEGLGTLLPYRPLLRALGEGRPLLGLAVHDSDAYLAIPAEHLNACLGRRYAEALHRAGLREV
DLLGYCSGGLVALETAKSLVQRGVRVRQLDIVSSYRIPYRVDDERLLLFSFAATLGLDTAALGFPAPERLGQAVQAALAQ
TPERLVAEALAGLPGLADLVALRGRVLQAASGSADAVSVERDTLYRLFCHSVRASQAEAPEPYVGALRLFVPDAGNPLVP
RYAEALETQWRAAALGACGIHEVPGGHFDCLGEALAQSLSKPMPEEASR
>Q9HTR2 3.1.3.75~~~pchP~~~Phosphorylcholine phosphatase~~~
MTFAKGILAALALAAAVGQASATELEHWPAPAARQLNALIEANANKGAYAVFDMDNTSYRYDLEESLLPYLEMKGVLTRD
RLDPSLKLIPFKDQAGHKESLFSYYYRLCEIDDMVCYPWVAQVFSGFTLRELKGYVDELMAYGKPIPATYYDGDKLATLD
VEPPRVFSGQRELYNKLMENGIEVYVISAAHEELVRMVAADPRYGYNAKPENVIGVTTLLKNRKTGELTTARKQIAEGKY
DPKANLDLEVTPYLWTPATWMAGKQAAILTYIDRWKRPILVAGDTPDSDGYMLFNGTAENGVHLWVNRKAKYMEQINGMI
KQHSAAQAKAGLPVTADRNWVIVTPEQIQ
>P40762 ~~~pchR~~~HTH-type transcriptional regulator PchR~~~COG1846
MSDLTKQMIYDIYVRLLHLNEQKANTSLQQFFKEAAEEDVAEIPKNMTSIHVIDCIGQHEPINNAGIARKMNLSKANVTK
ISTKLIKEEFINSYQLTDNKKEVYFKLTRKGRRIFDLHEKLHKKKELAFYQFLDSFSQEEQKAVLKFLEQLTSTLEAEQT
DGTPDKPVK
>P40883 ~~~pchR~~~Regulatory protein PchR~~~
MTITIIAPPQADAAAPAPGNRPGVAHIDPNMKLVTGTFCSASEDWFEEPLERGLRLILVQSGQLRCRIPGQPEHLIEGPS
LCTIANDGDFTSAQIYGTDKPLRYTIVQLGVEALDSRLGWLPEQLIRRPGGDPRIMSCPAPRAMQALASQIATCQMLGPT
RDLYLGGKALELAALSAQFLSGEGRPVEEPRITCSEVERIHAARDLLVGALQEPPSLDTLASRVGMNPRKLTAGFRKVFG
ASVFGYLQEYRLREAHRMLCDEEANVSTVAYRVGYSPAHFSIAFRKRYGISPSEIR
>A6VKV4 4.1.1.49~~~pckA~~~Phosphoenolpyruvate carboxykinase (ATP)~~~COG1866
MTDLNKLVKELNDLGLTDVKEIVYNPSYEQLFEEETKPGLEGFDKGTLTTLGAVAVDTGIFTGRSPKDKYIVCDETTKDT
VWWNSEAAKNDNKPMTQETWKSLRELVAKQLSGKRLFVVEGYCGASEKHRIGVRMVTEVAWQAHFVKNMFIRPTDEELKN
FKADFTVLNGAKCTNPNWKEQGLNSENFVAFNITEGIQLIGGTWYGGEMKKGMFSMMNYFLPLKGVASMHCSANVGKDGD
VAIFFGLSGTGKTTLSTDPKRQLIGDDEHGWDESGVFNFEGGCYAKTINLSQENEPDIYGAIRRDALLENVVVRADGSVD
FDDGSKTENTRVSYPIYHIDNIVRPVSKAGHATKVIFLTADAFGVLPPVSKLTPEQTEYYFLSGFTAKLAGTERGVTEPT
PTFSACFGAAFLSLHPIQYADVLVERMKASGAEAYLVNTGWNGTGKRISIKDTRGIIDAILDGSIEKAEMGELPIFNLAI
PKALPGVDPAILDPRDTYADKAQWQVKAEDLANRFVKNFVKYTANPEAAKLVGAGPKA
>O09460 4.1.1.49~~~pckA~~~Phosphoenolpyruvate carboxykinase (ATP)~~~
MSLSESLAKYGITGATNIVHNPSHEELFAAETQASLEGFEKGTVTEMGAVNVMTGVYTGRSPKDKFIVKNEASKEIWWTS
DEFKNDNKPVTEEAWAQLKALAGKELSNKPLYVVDLFCGANENTRLKIRFVMEVAWQAHFVTNMFIRPTEEELKGFEPDF
VVLNASKAKVENFKELGLNSETAVVFNLAEKMQIILNTWYGGEMKKGMFSMMNFYLPLQGIAAMHCSANTDLEGKNTAIF
FGLSGTGKTTLSTDPKRLLIGDDEHGWDDDGVFNFEGGCYAKVINLSKENEPDIWGAIKRNALLENVTVDANGKVDFADK
SVTENTRVSYPIFHIKNIVKPVSKAPAAKRVIFLSADAFGVLPPVSILSKEQTKYYFLSGFTAKLAGTERGITEPTPTFS
SCFGAAFLTLPPTKYAEVLVKRMEASGAKAYLVNTGWNGTGKRISIKDTRGIIDAILDGSIDTANTATIPYFNFTVPTEL
KGVDTKILDPRNTYADASEWEVKAKDLAERFQKNFKKFESLGGDLVKAGPQL
>P22259 4.1.1.49~~~pckA~~~Phosphoenolpyruvate carboxykinase (ATP)~~~COG1866
MRVNNGLTPQELEAYGISDVHDIVYNPSYDLLYQEELDPSLTGYERGVLTNLGAVAVDTGIFTGRSPKDKYIVRDDTTRD
TFWWADKGKGKNDNKPLSPETWQHLKGLVTRQLSGKRLFVVDAFCGANPDTRLSVRFITEVAWQAHFVKNMFIRPSDEEL
AGFKPDFIVMNGAKCTNPQWKEQGLNSENFVAFNLTERMQLIGGTWYGGEMKKGMFSMMNYLLPLKGIASMHCSANVGEK
GDVAVFFGLSGTGKTTLSTDPKRRLIGDDEHGWDDDGVFNFEGGCYAKTIKLSKEAEPEIYNAIRRDALLENVTVREDGT
IDFDDGSKTENTRVSYPIYHIDNIVKPVSKAGHATKVIFLTADAFGVLPPVSRLTADQTQYHFLSGFTAKLAGTERGITE
PTPTFSACFGAAFLSLHPTQYAEVLVKRMQAAGAQAYLVNTGWNGTGKRISIKDTRAIIDAILNGSLDNAETFTLPMFNL
AIPTELPGVDTKILDPRNTYASPEQWQEKAETLAKLFIDNFDKYTDTPAGAALVAAGPKL
>Q9ZNH4 4.1.1.49~~~pckA~~~Phosphoenolpyruvate carboxykinase (ATP)~~~COG1866
MQETGVHNGAYGTDKFGLKNLKGVYWNFGAPQLYEHALKNGEAVLSSDGALVADTGVFTGRSPKDKFTVRDATTENTMWW
GGNQSITAEQFEALYQDFLKHAEGMTLFAQDLYGGADPTFRIKTRVYTELAWHSLFIRTLLRRPERAELENFVPELTLID
LPSFRADPKRHGCRSENVVAIDFARKIVLIGGTQYAGEMKKSVFTTLNYYLPEKGVLPMHCSANVGPNGDTAIFFGLSGT
GKTTLSADPNRTLIGDDEHGWGKDGVFNFEGGCYAKCIKLSSENEPEIYAASTRFGAVLENVVLGELDRKPDFDDGSKTE
NTRSAYPLESIPNASLTGRAGQPKNVVMLAADAFGVMPPIAKLTPAQAMYHFLSGYTAKVAGTERGVTEPTPEFSTCFGS
PFLPRDPSVYGNMLRELIAKHNVDCWLVNTGWTGGIYGTGHRMPIKVTRALLTAALDGSLRNVEFRTDPYFGFAVPTALP
GVPSEILDPVKTWADKAAFDTTARKLVGMFQKNFAKFEAQVDAEVRAAAPDVKMAVE
>P99128 4.1.1.49~~~pckA~~~Phosphoenolpyruvate carboxykinase (ATP)~~~
MSVDTYTETTKIDKLLKKPTSHFQLSTTQLYNKILDNNEGVLTELGAVNASTGKYTGRSPKDKFFVSEPSYRDNIDWGEI
NQPIDEETFLKLYHKVLDYLDKKDELYVFKGYAGSDKDTMLKLTVINELAWHNLFAKNMFIRPESKEEATKIKPNFTIVS
APHFKADPEVDGTKSETFVIISFKHKVILIGGTEYAGEMKKGIFSVMNYLLPMQDIMSMHCSANVGEKGDVALFFGLSGT
GKTTLSADPHRKLIGDDEHGWNKNGVFNIEGGCYAKAINLSKEKEPQIFDAIKYGAILENTVVAEDGSVDFEDNRYTENT
RAAYPINHIDNIVVPSKAAHPNTIIFLTADAFGVIPPISKLNKDQAMYHFLSGFTSKLAGTERGVTEPEPSFSTCFGAPF
FPLHPTVYADLLGELIDLHDVDVYLVNTGWTGGKYGVGRRISLHYTRQMVNQAISGKLKNAEYTKDSTFGLSIPVEIEDV
PKTILNPINAWSDKEKYKAQAEDLIQRFEKNFEKFGEKVEHIAEKGSFNK
>Q5SLL5 4.1.1.49~~~pckA~~~Phosphoenolpyruvate carboxykinase (ATP)~~~COG1866
MQRLEALGIHPKKRVFWNTVSPVLVEHTLLRGEGLLAHHGPLVVDTTPYTGRSPKDKFVVREPEVEGEIWWGEVNQPFAP
EAFEALYQRVVQYLSERDLYVQDLYAGADRRYRLAVRVVTESPWHALFARNMFILPRRFGNDDEVEAFVPGFTVVHAPYF
QAVPERDGTRSEVFVGISFQRRLVLIVGTKYAGEIKKSIFTVMNYLMPKRGVFPMHASANVGKEGDVAVFFGLSGTGKTT
LSTDPERPLIGDDEHGWSEDGVFNFEGGCYAKVIRLSPEHEPLIYKASNQFEAILENVVVNPESRRVQWDDDSKTENTRS
SYPIAHLENVVESGVAGHPRAIFFLSADAYGVLPPIARLSPEEAMYYFLSGYTARVAGTERGVTEPRATFSACFGAPFLP
MHPGVYARMLGEKIRKHAPRVYLVNTGWTGGPYGVGYRFPLPVTRALLKAALSGALENVPYRRDPVFGFEVPLEAPGVPQ
ELLNPRETWADKEAYDQQARKLARLFQENFQKYASGVAKEVAEAGPRTE
>Q9AEM1 4.1.1.32~~~pckG~~~Phosphoenolpyruvate carboxykinase [GTP]~~~COG1274
MTTAAIRGLQGEAPTKNKELLNWIADAVELFQPEAVVFVDGSQAEWDRMAEDLVEAGTLIKLNEEKRPNSYLARSNPSDV
ARVESRTFICSEKEEDAGPTNNWAPPQAMKDEMSKHYAGSMKGRTMYVVPFCMGPISDPDPKLGVQLTDSEYVVMSMRIM
TRMGIEALDKIGANGSFVRCLHSVGAPLEPGQEDVAWPCNDTKYITQFPETKEIWSYGSGYGGNAILAKKCYALRIASVM
AREEGWMAEHMLILKLINPEGKAYHIAAAFPSACGKTNLAMITPTIPGWTAQVVGDDIAWLKLREDGLYAVNPENGFFGV
APGTNYASNPIAMKTMEPGNTLFTNVALTDDGDIWWEGMDGDAPAHLIDWMGNDWTPESDENAAHPNSRYCVAIDQSPAA
APEFNDWEGVKIDAILFGGRRADTVPLVTQTYDWEHGTMVGALLASGQTAASAEAKVGTLRHDPMAMLPFIGYNAGEYLQ
NWIDMGNKGGDKMPSIFLVNWFRRGEDGRFLWPGFGDNSRVLKWVIDRIEGHVGADETVVGHTAKAEDLDLDGLDTPIED
VKEALTAPAEQWANDVEDNAEYLTFLGPRVPAEVHSQFDALKARISAAHA
>A1KF31 4.1.1.32~~~pckG~~~Phosphoenolpyruvate carboxykinase [GTP]~~~
MTSATIPGLDTAPTNHQGLLSWVEEVAELTQPDRVVFTDGSEEEFQRLCDQLVEAGTFIRLNPEKHKNSYLALSDPSDVA
RVESRTYICSAKEIDAGPTNNWMDPGEMRSIMKDLYRGCMRGRTMYVVPFCMGPLGAEDPKLGVEITDSEYVVVSMRTMT
RMGKAALEKMGDDGFFVKALHSVGAPLEPGQKDVAWPCSETKYITHFPETREIWSYGSGYGGNALLGKKCYSLRIASAMA
HDEGWLAEHMLILKLISPENKAYYFAAAFPSACGKTNLAMLQPTIPGWRAETLGDDIAWMRFGKDGRLYAVNPEFGFFGV
APGTNWKSNPNAMRTIAAGNTVFTNVALTDDGDVWWEGLEGDPQHLIDWKGNDWYFRETETNAAHPNSRYCTPMSQCPIL
APEWDDPQGVPISGILFGGRRKTTVPLVTEARDWQHGVFIGATLGSEQTAAAEGKVGNVRRDPMAMLPFLGYNVGDYFQH
WINLGKHADESKLPKVFFVNWFRRGDDGRFLWPGFGENSRVLKWIVDRIEHKAGGATTPIGTVPAVEDLDLDGLDVDAAD
VAAALAVDADEWRQELPLIEEWLQFVGEKLPTGVKDEFDALKERLG
>A0QP32 4.1.1.32~~~pckG~~~Phosphoenolpyruvate carboxykinase [GTP]~~~COG1274
MTSATIPGLDTAPTKHQGLLAWVQEVAELTQPDRVVFADGSDEEYERLCAHLVEAGTFQKLNPEKQPNSYLALSDPSDVA
RVESRTFICTEREIDAGPTNNWMDPAEMRGIMTDLYRGSMRGRTLYVVPFCMGPLDAEDPKLGVEITDSEYVVVSMRTMT
RMGRAALDKLGDDGFFVKALHSIGAPLEPGQKDVPWPCNDTKYITHFPETREIWSFGSGYGGNALLGKKCYSLRIASAMA
HDEGWLAEHMLILKLISPENKAYFIAAAFPSACGKTNLAMLQPTIEGWRAETVGDDIAWMRFGKDGRLYATNPEFGFFGV
APGTNWSSNPNAMKTIAAGNTVFTNVAKTDDGDVWWEGLEGDPQHLIDWKGNDWTPESGEKAAHPNSRYCTPISQCPTLA
PEWDDPQGVPISAILFGGRRKTTVPLITEARDWQHGVFIGATLGSEQTAAAEGKVGTVRRDPMAMLPFLGYNVGDYFAHW
INVGKNADESKLPKVFFVNWFRRGDDGRFLWPGFGENSRVLKWAVERIEHKADGKSTPIGIVPTAADLDLEGLDVDPADV
DEALAVKPEEWRAELPLIEEWFEFVGEKLPTGLKDEFDALKHRLSEEG
>Q9AGJ6 4.1.1.32~~~pckG~~~Phosphoenolpyruvate carboxykinase [GTP]~~~
MTSATIPGLDTAPTKHQGLLAWVQEVAELTQPDRVVFADGSDEEYERLCAHLVEAGTFQKLNPEKQPNSYLALSDPSDVA
RVESRTFICTEREIDAGPTNNWMDPAEMRGIMTDLYRGSMRGRTLYVVPFCMGPLDAEDPKLGVEITDSEYVVVSMRTMT
RMGRAALDKLGDDGFFVKALHSIGAPLEPGQKDVPWPCNDTKYITHFPETREIWSFGSGYGGNALLGKKCYSLRIASAMA
HDEGWLAEHMLILKLISPENKAYFIAAAFPSACGKTNLAMLQPTIEGWRAETVGDDIAWMRFGKDGRLYATNPEFGFFGV
APGTNWSSNPNAMKTIAAGNTVFTNVAKTDDGDVWWEGLEGDPQHLIDWKGNDWTPESGEKAAHPNSRYCTPISQCPTLA
PEWDDPQGVPISAILFGGRRKTTVPLITEARDWQHGVFIGATLGSEQTAAAEGKVGTVRRDPMAMLPFLGYNVGDYFAHW
INVGKNADESKLPKVFFVNWFRRGDDGRFLWPGFGENSRVLKWAVERIEHKADGKSTPIGIVPTAADLDLEGLDVDPADV
DEALAVKPEEWRAELPLIEEWFEFVGEKLPTGLKDEFDALKERLG
>A5TYT6 4.1.1.32~~~pckG~~~Phosphoenolpyruvate carboxykinase [GTP]~~~COG1274
MTSATIPGLDTAPTNHQGLLSWVEEVAELTQPDRVVFTDGSEEEFQRLCDQLVEAGTFIRLNPEKHKNSYLALSDPSDVA
RVESRTYICSAKEIDAGPTNNWMDPGEMRSIMKDLYRGCMRGRTMYVVPFCMGPLGAEDPKLGVEITDSEYVVVSMRTMT
RMGKAALEKMGDDGFFVKALHSVGAPLEPGQKDVAWPCSETKYITHFPETREIWSYGSGYGGNALLGKKCYSLRIASAMA
HDEGWLAEHMLILKLISPENKAYYFAAAFPSACGKTNLAMLQPTIPGWRAETLGDDIAWMRFGKDGRLYAVNPEFGFFGV
APGTNWKSNPNAMRTIAAGNTVFTNVALTDDGDVWWEGLEGDPQHLIDWKGNDWYFRETETNAAHPNSRYCTPMSQCPIL
APEWDDPQGVPISGILFGGRRKTTVPLVTEARDWQHGVFIGATLGSEQTAAAEGKVGNVRRDPMAMLPFLGYNVGDYFQH
WINLGKHADESKLPKVFFVNWFRRGDDGRFLWPGFGENSRVLKWIVDRIEHKAGGATTPIGTVPAVEDLDLDGLDVDAAD
VAAALAVDADEWRQELPLIEEWLQFVGEKLPTGVKDEFDALKERLG
>P9WIH3 4.1.1.32~~~pckG~~~Phosphoenolpyruvate carboxykinase [GTP]~~~COG1274
MTSATIPGLDTAPTNHQGLLSWVEEVAELTQPDRVVFTDGSEEEFQRLCDQLVEAGTFIRLNPEKHKNSYLALSDPSDVA
RVESRTYICSAKEIDAGPTNNWMDPGEMRSIMKDLYRGCMRGRTMYVVPFCMGPLGAEDPKLGVEITDSEYVVVSMRTMT
RMGKAALEKMGDDGFFVKALHSVGAPLEPGQKDVAWPCSETKYITHFPETREIWSYGSGYGGNALLGKKCYSLRIASAMA
HDEGWLAEHMLILKLISPENKAYYFAAAFPSACGKTNLAMLQPTIPGWRAETLGDDIAWMRFGKDGRLYAVNPEFGFFGV
APGTNWKSNPNAMRTIAAGNTVFTNVALTDDGDVWWEGLEGDPQHLIDWKGNDWYFRETETNAAHPNSRYCTPMSQCPIL
APEWDDPQGVPISGILFGGRRKTTVPLVTEARDWQHGVFIGATLGSEQTAAAEGKVGNVRRDPMAMLPFLGYNVGDYFQH
WINLGKHADESKLPKVFFVNWFRRGDDGRFLWPGFGENSRVLKWIVDRIEHKAGGATTPIGTVPAVEDLDLDGLDVDAAD
VAAALAVDADEWRQELPLIEEWLQFVGEKLPTGVKDEFDALKERLG
>A7IQE5 5.4.99.-~~~~~~Pivalyl-CoA mutase large subunit~~~COG1884
MNQAAVQLPLPGFEQASGQWRSDYSRQVAGEKPVRNRSGIEVQPLYSPRDWAGERYLDDLGFPGQYPFTRGIYPSMHRGR
TWTQRQLIGLGTPQDYNVRVRRIIDAGATAISLLPCCSGFRGIDCDEVDPVLLGTCGTVVNTTDHMDAALDGVPLGTIST
AMNDPSPFTLLAFTLGVARRRGIDWRSITGTSNQSDYISHFIANHQFYRLSLPGSRRVLLDHIEFCRRALPNWNPLSVVG
QHMQQAGATPAETMGFTLSSAIQYAQDCIERGMDVDDVLRRFTFFFDISISFFEEIAKFRAGRRIWARIARERLGAKDPA
CWRFKFHGQTSGVDLTQQQPLNNIARVSVQAMAGILSGLQSMHTDAYDEAIACPSEETARIAVATQNILRDEAQLCAVID
PLGGSYYVERLTDQMEAEIEAVIARIDAAGGMYKAAEVGLVQTMIGESALAFQEQLETGERKIVGVNCYQVEEDPTIPPA
ERPDPEAMERHVERFKVFKRERSQDAVARALDALARAANSERENVFEKVVEAAEAGVTHGEMVGCLRRELGFGHPLIIA
>A7IQE6 5.4.99.-~~~~~~Pivalyl-CoA mutase small subunit~~~COG2185
MIHAGTRPLRVLVTKIGLDGHDRGSRIVAAYLRDAGMEVIYTPPWQTIPGVVKLATEEDVDVIGISSLATDHLIVPKMME
ALRAAGLGHVGVVVGGIVPEAEQSALAAAGVSRVFGPGAAREEIVECVTALGQKSRAERVDDYSEANP
>P0ABF1 2.7.7.19~~~pcnB~~~Poly(A) polymerase I~~~COG0617
MFTRVANFCRKVLSREESEAEQAVARPQVTVIPREQHAISRKDISENALKVMYRLNKAGYEAWLVGGGVRDLLLGKKPKD
FDVTTNATPEQVRKLFRNCRLVGRRFRLAHVMFGPEIIEVATFRGHHEGNVSDRTTSQRGQNGMLLRDNIFGSIEEDAQR
RDFTINSLYYSVADFTVRDYVGGMKDLKDGVIRLIGNPETRYREDPVRMLRAVRFAAKLGMRISPETAEPIPRLATLLND
IPPARLFEESLKLLQAGYGYETYKLLCEYHLFQPLFPTITRYFTENGDSPMERIIEQVLKNTDTRIHNDMRVNPAFLFAA
MFWYPLLETAQKIAQESGLTYHDAFALAMNDVLDEACRSLAIPKRLTTLTRDIWQLQLRMSRRQGKRAWKLLEHPKFRAA
YDLLALRAEVERNAELQRLVKWWGEFQVSAPPDQKGMLNELDEEPSPRRRTRRPRKRAPRREGTA
>Q47453 ~~~pcoB~~~Copper resistance protein B~~~COG3667
MKRNLKAIPVLVAGLFTSQLSIAAGSVSADPHAGHDMSAMQMPADENFTEMTSMEPIVTESRTPIPPVTDADRKAAFGNL
QGHAIHDSAINYLVLLDQLEWQRSDNTNNFSWSVNSWIGGDTDRIWLKSEGERSNGETEAAEAQLLWGHAVGPWWDLVAG
VRQDFRPASARTWAAVGFQGLALYNFESEITGFVSNGGKAALRLGGEYDVLLTNRLILQPSYEVNFYSQDDESRGRGRGL
TDTELGLRLRYEIRREFAPYIGVSWNQLYGKTSDMAKREGEKDHQVVFLAGARIWF
>Q47454 ~~~pcoC~~~Copper resistance protein C~~~COG2372
MSILNKAILTGGLVMGVAFSAMAHPELKSSVPQADSAVAAPEKIQLNFSENLTVKFSGAKLTMTGMKGMSSHSPMPVAAK
VAPGADPKSMVIIPREPLPAGTYRVDWRAVSSDTHPITGNYTFTVK
>Q47459 ~~~pcoE~~~Probable copper-binding protein PcoE~~~
MKKILVSFVAIMAVASSAMAAETMNMHDQVNNAQAPAHQMQSSAEKSAVQGDSMTMMDMSSHDQAAMSHDMMQNGNSAAH
QDMAEMHKKMMKSKPAASNETAKSFSEMNEHEKSAVVHEKANNGQSSVIHQQQAEKHRSQITQN
>P42535 1.14.13.50~~~pcpB~~~Pentachlorophenol 4-monooxygenase~~~COG0654
MSTYPINAPGQSADAAVLIVGGGPTGLIAANELLRRGVSCRMIDRLPVAHQTSKSCTIHARSMEMMEHIGIAARYIETGV
RSNGFTFNFENTDANALLDFSVLPGRYPFITIYNQNETERVLRHDLEATYSFQPEWGTQLLALNQDENGIRADLRLKDGT
KQTISPRWVIGADGVRSRVRECLGIAYEGEDYEENVLQMMDVGIQDFEAGDDWIHYFIGQDKFVFVTKLPGSNYRVIISD
LGGANKSNLEETREAFQGYLSSFDDHATLDEPRWATKWRVWKRMATAYRKGNVFLAGDAAHCHSPSGGSGMNVGMQDAFN
LGWKIAMVERGEAKPDLLDTYHTERTPVAQQLLEGTHAMHEIIMGHGKGLTDRIELTQAPGWHDAATYRVSGMSYNYRDQ
LVSFNDDRLAGPSAGDRIPDAELAPRIRLFDLVRNTRPTLLVAPATEAEVAEAEKLRDLIREQWPLVKPVLVRPQGSEES
IEGDVHVDSYGQLKREWGDNAKGWAALLRPDNYIHARAGLDRGDLLVQAIDAMLVRCA
>Q03520 1.21.4.5~~~pcpC~~~Tetrachloro-P-hydroquinone reductive dehalogenase~~~COG0625
MPEVSLYNYTMSICSMKTRLAMEEFGVDYDDKQVDIGFALENFEPDYVRLNEKAVVPTLVVGDRVVTNSYNIVLEAAKLG
KVGIPADPVENKAALDWFQKGDQVNFQVITYGHKGVPRGDELLIARRERAKEYAEKYPELRSIYQAAHDRIVEHGNCAYD
ADTVAQAEVDLQKRLDELDAHLADKPFIAGSNYSIADIMWTVLLARIEMLNMTAWISERPNLLAYYQRMKARRSFETARV
MPNWKGGI
>Q47914 1.1.1.404~~~pcpD~~~Tetrachlorobenzoquinone reductase~~~
MTNPVSTIDMTVTQITRVAKDINSYELRPEPGVILPEFTAGAHIGVSLPNGIQRSYSLVNPQGERDRYVITVNLDRNSRG
GSRYLHEQLRVGQRLSIVPPANNFALVETAPHSVLFAGGIGITPIWSMIQRLRELGSTWELHYACRGKDFVAYRQELEQA
AAEAGARFHLHLDEEADGKFLDLAGPVAQAGQDSIFYCCGPEAMLQAYKAATADLPSERVRFEHFGAALTGEPADDVFTV
VLARRSGQEFTVEPGMTILETLLQNGISRNYSCTQGVCGTCETKVLEGEPDHRDWVLSDEKKASNSTMLICCSLSKSPRL
VLDI
>Q8KN33 1.8.5.7~~~pcpF~~~Glutathionyl-hydroquinone reductase PcpF~~~COG0435
MGLLIDGVWRDAWYDTKSSGGRFVRKESQYRGGLDAGFRGEPGRYHLYAGFACPWAHRVLIMRALKGLEEMISVSMVNAY
MGENGWTFLPGDDVVPDSINGADYLYQVYTAADPTYTGRVTIPILWDKVEKRILNNESSEIIRILNSAFDDVGALPGDYY
PAEFRPEIDRINARVYETLNNGVYRSGFATTQEAYEEAFYPLFDTLDWLEEHLTGREWLVGDRLTEADIRLFPTLVRFDA
IYHGHFKCNLRRIADYPNLSRLVGKLASHERVAPTINLRHAKAHYYGSHPSVNPTGIVPVGPAQPLPGLTLQS
>P52679 ~~~pcpR~~~PCP degradation transcriptional activation protein~~~COG0583
MNDSVLPLGHLMVFDALYRHGSAGKAAHALSMPQPTLSRWLAQLRTHFDDPLFVRTRSGMEPTPLAARAAPHIAEMIAIY
RQHVRSELRFDPGTSNRNFRIAASDFGQALMLPRLYATLEETAPQVRVTGVNLRHGPLVEELESGSIDIAFGGFPTLSAG
IKTQTLFREEYVCVMRQSHPALTHGLDLEAFRQCRHIIVTAHEFNHVHEQVEARLLELLPPESIRFTTENFLVSAVIAEE
TDVILTIPSRLARWFANRGGLTIFPVPIELPSIEVKQYWHERYDKDPGNIWLRRVIAKIGFQNPPAE
>P46107 3.4.19.3~~~pcp~~~Pyrrolidone-carboxylate peptidase~~~COG2039
MEKKVLLTGFDPFGGETVNPSWEAVKRLNGAAEGPASIVSEQVPTVFYKSLAVLREAIKKHQPDIIICVGQAGGRMQITP
ERVAINLNEARIPDNEGNQPVGEDISQGGPAAYWTGLPIKRIVEEIKKEGIPAAVSYTAGTFVCNHLFYGLMDEISRHHP
HIRGGFIHIPYIPEQTLQKSAPSLSLDHITKALKIAAVTAAVHEDDIETGGGELH
>Q81NT5 3.4.19.3~~~pcp~~~Pyrrolidone-carboxylate peptidase~~~COG2039
MKTVLLTGFDPFGGESINPAWEVAKSLHEKTIGEYKIISKQVPTVFHKSISVLKEYIEELAPEFIICIGQAGGRPDITIE
RVAINIDDARIADNEGNQPVDVPVVEEGPAAYWSTLPMKAIVKKLQEEGIPASVSQTAGTFVCNHLFYGLMHELEKHDTK
MKGGFIHIPFLPEQASNYPGQPSMSLSTIRKGIELAVEVTTTVEVDIVEVGGTTH
>P28618 3.4.19.3~~~pcp~~~Pyrrolidone-carboxylate peptidase~~~COG2039
MRKKVLITGFDPFDKETVNPSWEAAKRLNGFETEEAIITAEQIPTVFRSALDTLRQAIQKHQPDIVICVGQAGGRMQITP
ERVAINLADARIPDNEGHQPIDEEISPDGPAAYWTRLPVKRMTAKMKEHGIPAAVSYTAGTFVCNYLFYGLMDHISRTSP
HIRGGFIHIPYIPQQTIDKTAPSLSLDTIVRALRIAAVTAAQYDEDVKSPGGTLH
>Q9RX25 3.4.19.3~~~pcp~~~Pyrrolidone-carboxylate peptidase~~~COG2039
MPTLLLTGFEPFHTHPDNPSAQAAQELHGLELPGGWGVHSALLPVEPHAAGAALTRLLSEQDPGAVLLTGLAAGRPQVTL
ERVGVGVMDFQIPDNAGQTYRDQPIEPDAPAAYLATLPLRAILAAWREAEIPGDISNSAGLYVCNFVLYHALHWLREHGR
GAVPCGFLHVPANAAVALAVPADRPPLPYLPQSEITRAVRVAAEAITAQSSVLQMGKM
>P10325 ~~~pcp~~~Outer membrane lipoprotein pcp~~~COG3133
MKKTNMALALLVAFSVTGCANTDIFSGDVYSASQAKEARSITYGTIVSVRPVKIQADNQGVVGTLGGGALGGIAGSTIGG
GRGQAIAAVVGAIGGAIAGSKIEEKMSQVNGAELVIKKDDGQEIVVVQKADSSFCSLVAEFVFVGGGSSLNVSVL
>P9WIJ5 3.4.19.3~~~pcp~~~Pyrrolidone-carboxylate peptidase~~~COG2039
MSKVLVTGFGPYGVTPVNPAQLTAEELDGRTIAGATVISRIVPNTFFESIAAAQQAIAEIEPALVIMLGEYPGRSMITVE
RLAQNVNDCGRYGLADCAGRVLVGEPTDPAGPVAYHATVPVRAMVLAMRKAGVPADVSDAAGTFVCNHLMYGVLHHLAQK
GLPVRAGWIHLPCLPSVAALDHNLGVPSMSVQTAVAGVTAGIEAAIRQSADIREPIPSRLQI
>Q5HCK7 3.4.19.3~~~pcp~~~Pyrrolidone-carboxylate peptidase~~~
MHILVTGFAPFDNQNINPSWEAVTQLEDIIGTHTIDKLKLPTSFKKVDNIINKTLASNHYDVVLAIGQAGGRNAITPERV
AINIDDARIPDNDDFQPIDQAIHLDGAPAYFSNLPVKAMTQSIINQGLPGALSNSAGTFVCNHTLYHLGYLQDKHYPHLR
FGFIHVPYIPEQVIGKPDTPSMPLEKIVAGLTAAIEAISNDEDLHLALGTTE
>Q5XDD4 3.4.19.3~~~pcp~~~Pyrrolidone-carboxylate peptidase~~~
MKILVTGFDPFGGEAINPALEAIKKLPATIHGAEIKCIEVPTVFQKSADVLQQHIESFQPDAVLCIGQAGGRTGLTPERV
AINQDDARIPDNEGNQPIDTPIRADGKAAYFSTLPIKAMVAAIHQAGLPASVSNTAGTFVCNHLMYQALYLVDKYCPNAK
AGFMHIPFMMEQVVDKPNTAAMNLDDITRGIEAAIFAIVDFKDRSDLKRVGGATH
>O34580 5.6.2.4~~~pcrA~~~ATP-dependent DNA helicase PcrA~~~COG0210
MNYISNQLLSGLNPVQQEAVKTTDGPLLLMAGAGSGKTRVLTHRIAYLMAEKHVAPWNILAITFTNKAAREMKERVESIL
GPGADDIWISTFHSMCVRILRRDIDRIGINRNFSILDTADQLSVIKGILKERNLDPKKFDPRSILGTISSAKNELTEPEE
FSKVAGGYYDQVVSDVYADYQKKLLKNQSLDFDDLIMTTIKLFDRVPEVLEFYQRKFQYIHVDEYQDTNRAQYMLVKQLA
ERFQNLCVVGDSDQSIYRWRGADITNILSFEKDYPNASVILLEQNYRSTKRILRAANEVIKNNSNRKPKNLWTENDEGIK
ISYYRGDNEFGEGQFVAGKIHQLHSTGKRKLSDIAILYRTNAQSRVIEETLLKAGLNYNIVGGTKFYDRKEIKDILAYLR
LVSNPDDDISFTRIVNVPKRGVGATSLEKIASYAAINGLSFFQAIQQVDFIGVSAKAANALDSFRQMIENLTNMQDYLSI
TELTEEILDKTEYREMLKAEKSIEAQSRLENIDEFLSVTKNFEQKSEDKTLVAFLTDLALIADIDQLDQKEEESGGKDAI
TLMTLHAAKGLEFPVVFLMGLEEGVFPHSRSLMEEAEMEEERRLAYVGITRAEQELYLTNAKMRTLFGRTNMNPESRFIA
EIPDDLLENLNEKKETRATSARKMQPRRGPVSRPVSYASKTGGDTLNWAVGDKAGHKKWGTGTVVSVKGEGEGTELDIAF
PSPVGVKRLLAAFAPIEKQ
>Q47CW6 1.97.1.-~~~pcrA~~~Perchlorate reductase subunit alpha~~~COG5013
MVQMTRRGFLLASGATLLGSSLSFRTLAAAADLSGAFEYSGWENFHRAQWSWDKKTRGAHLINCTGACPHFVYSKEGVVI
REEQSKDIAPMTGIPEYNPRGCNKGECAHDYMYGPHRLKYPLIRVGERGEGKWRRASWDEALDMIADKVVDTIKNHAPDC
ISVYSPVPAVAPVSFSAGHRFAHYIGAHTHTFFDWYGDHPTGQTQTCGVQGDTAETADWFNSKYIILWGANPTQTRIPDA
HFLSEAQLNGTKIVSIAPDFNSSAIKVDKWIHPQPGTDGALALSMAHVIIKEKLYDAHNLKEQTDLSYLVRSDTKRFLRE
ADVVAGGSKDKFYLWDVRTGKPVIPKGCWGDQPEQKAPPVAFMGRNTNTFPKGYIDLGDIDPALEGKFKIQLLDGKSIEV
RPVFEILKSRIMADNTPEKAAKITGVPAKSITELAREYATAKPSMIICGGGTQHWYYSDVLLRAMHLLTALTGSEGKNGG
GLNHYIGQWKPTFLPGLVALAFPEGPAKQRFCQTTIWTYIHAEVNDQILNSDVDTEKYLREAFASRQMPNLPRDGRDPKV
FIIYRGNWLNQAKGQKYVLRNLWPKLELVVDINIRMDSTALYSDVVLPSAHWYEKLDLNVTEEHTFINMTEPAIKPMWES
KTDWQIFLALSKRVEMAANRKGYQKFNDEQFKWVRNLSNLWNQMTMDGKLAEDAAAAQYILDNAPHSKGITLDMLREKPQ
RFKANWTSSMKEGVPYTPFQNFVVDKKPWPTLTGRQQFYLDHETFFDMGVELPVYKAPIDADKYPFRFNSPHSRHSIHST
FKDSVLMLRLQRGGPSIDISSIDAKTLGIKDNDWVEVWNDHGKVICRVKIRSGEQRGRVSMWHTPELYMDLIEGGSQSVC
PVRITPTHLVGNYGHLVFRPNYYGPGGTQRDVRVNMKRYIGATPMSF
>P56255 5.6.2.4~~~pcrA~~~ATP-dependent DNA helicase PcrA~~~
MNFLSEQLLAHLNKEQQEAVRTTEGPLLIMAGAGSGKTRVLTHRIAYLMAEKHVAPWNILAITFTNKAAREMRERVQSLL
GGAAEDVWISTFHSMCVRILRRDIDRIGINRNFSILDPTDQLSVMKTILKEKNIDPKKFEPRTILGTISAAKNELLPPEQ
FAKRASTYYEKVVSDVYQEYQQRLLRNHSLDFDDLIMTTIQLFDRVPDVLHYYQYKFQYIHIDEYQDTNRAQYTLVKKLA
ERFQNICAVGDADQSIYRWRGADIQNILSFERDYPNAKVILLEQNYRSTKRILQAANEVIEHNVNRKPKRIWTENPEGKP
ILYYEAMNEADEAQFVAGRIREAVERGERRYRDFAVLYRTNAQSRVMEEMLLKANIPYQIVGGLKFYDRKEIKDILAYLR
VIANPDDDLSLLRIINVPKRGIGASTIDKLVRYAADHELSLFEALGELEMIGLGAKAAGALAAFRSQLEQWTQLQEYVSV
TELVEEVLDKSGYREMLKAERTIEAQSRLENLDEFLSVTKHFENVSDDKSLIAFLTDLALISDLDELDGTEQAAEGDAVM
LMTLHAAKGLEFPVVFLIGMEEGIFPHNRSLEDDDEMEEERRLAYVGITRAEEELVLTSAQMRTLFGNIQMDPPSRFLNE
IPAHLLETASRRQAGASRPAVSRPQASGAVGSWKVGDRANHRKWGIGTVVSVRGGGDDQELDIAFPSPIGIKRLLAKFAP
IEKV
>Q53727 5.6.2.4~~~pcrA~~~ATP-dependent DNA helicase PcrA~~~COG0210
MNALLNHMNTEQSEAVKTTEGPLLIMAGAGSGKTRVLTHRIAYLLDEKDVSPYNVLAITFTNKAAREMKERVQKLVGDQA
EVIWMSTFHSMCVRILRRDADRIGIERNFTIIDPTDQKSVIKDVLKNENIDSKKFEPRMFIGAISNLKNELKTPADAQKE
ATDYHSQMVATVYSGYQRQLSRNEALDFDDLIMTTINLFERVPEVLEYYQNKFQYIHVDEYQDTNKAQYTLVKLLASKFK
NLCVVGDSDQSIYGWRGADIQNILSFEKDYPEANTIFLEQNYRSTKTILNAANEVIKNNSERKPKGLWTANTNGEKIHYY
EAMTERDEAEFVIREIMKHQRNGKKYQDMAILYRTNAQSRVLEETFMKSNMPYTMVGGQKFYDRKEIKDLLSYLRIIANS
NDDISLQRIINVPKRGVGPSSVEKVQNYALQNNISMFDALGEADFIGLSKKVTQECLNFYELIQSLIKEQEFLEIHEIVD
EVLQKSGYREMLERENTLESRSRLENIDEFMSVPKDYEENTPLEEQSLINFLTDLSLVADIDEADTENGVTLMTMHSAKG
LEFPIVFIMGMEESLFPHIRAIKSEDDHEMQEERRICYVAITRAEEVLYITHATSRMLFGRPQSNMPSRFLKEIPESLLE
NHSSGKRQTIQPKAKPFAKRGFSQRTTSTKKQVLSSDWNVGDKVMHKAWGEGMVSNVNEKNGSIELDIIFKSQGPKRLLA
QFAPIEKKED
>P64319 5.6.2.4~~~pcrA~~~ATP-dependent DNA helicase PcrA~~~
MNALLNHMNTEQSEAVKTTEGPLLIMAGAGSGKTRVLTHRIAYLLDEKDVSPYNVLAITFTNKAAREMKERVQKLVGDQA
EVIWMSTFHSMCVRILRRDADRIGIERNFTIIDPTDQKSVIKDVLKNENIDSKKFEPRMFIGAISNLKNELKTPADAQKE
ATDYHSQMVATVYSGYQRQLSRNEALDFDDLIMTTINLFERVPEVLEYYQNKFQYIHVDEYQDTNKAQYTLVKLLASKFK
NLCVVGDSDQSIYGWRGADIQNILSFEKDYPEANTIFLEQNYRSTKTILNAANEVIKNNSERKPKGLWTANTNGEKIHYY
EAMTERDEAEFVIREIMKHQRNGKKYQDMAILYRTNAQSRVLEETFMKSNMPYTMVGGQKFYDRKEIKDLLSYLRIIANS
NDDISLQRIINVPKRGVGPSSVEKVQNYALQNNISMFDALGEADFIGLSKKVTQECLNFYELIQSLIKEQEFLEIHEIVD
EVLQKSGYREMLERENTLESRSRLENIDEFMSVPKDYEENTPLEEQSLINFLTDLSLVADIDEADTENGVTLMTMHSAKG
LEFPIVFIMGMEESLFPHIRAIKSEDDHEMQEERRICYVAITRAEEVLYITHATSRMLFGRPQSNMPSRFLKEIPESLLE
NHSSGKRQTIQPKAKPFAKRGFSQRTTSTKKQVSSSDWNVGDKVMHKAWGEGMVSNVNEKNGSIELDIIFKSQGPKRLLA
QFAPIEKKED
>Q81ZG3 2.5.1.n9~~~pcrB~~~Heptaprenylglyceryl phosphate synthase~~~COG1646
MYDISGWKHVFKLDPNKELSDEHLEMICESGTDAVIVGGSDGVTIDNVLHMLVSIRRYAVPCVLEVSDVEAITPGFDFYY
IPSVLNSRKVEWVTGVHHEALKEFGDIMDWDEIFMEGYCVLNPEAKVAQLTDAKCDVTEDDVIAYARLADKLLRLPIFYL
EYSGTYGDVELVKNVKAELKQAKLYYGGGISNAEQAKEMAQHADTVVVGNIIYDDIKAALKTVKAVKGE
>O34790 2.5.1.n9~~~pcrB~~~Heptaprenylglyceryl phosphate synthase~~~COG1646
MYDVTEWKHVFKLDPNKDLPDEQLEILCESGTDAVIIGGSDGVTEDNVLRMMSKVRRFLVPCVLEVSAIEAIVPGFDLYF
IPSVLNSKNADWIVGMHQKAMKEYGELMSMEEIVAEGYCIANPDCKAAALTEADADLNMDDIVAYARVSELLQLPIFYLE
YSGVLGDIEAVKKTKAVLETSTLFYGGGIKDAETAKQYAEHADVIVVGNAVYEDFDRALKTVAAVKGE
>Q47CW7 ~~~pcrB~~~Perchlorate reductase subunit beta~~~COG1140
MANVMKAPKRQLTYVTDLNKCIGCQTCTVACKKLWTTGPGQDFMYWRNVETTPGLGYPRNWQTKGGGYKNGELQKGKIPP
MIDYGIPFEFDYAGRLFEGKKERVRPSPTPRSAPNWDEDQGAGEYPNNSFFYLPRMCNHCTKPACLEACPNEAIYKREQD
GIVVIHQDKCKGAQACVQSCPYAKPYFNPVANKANKCIGCFPRIEQGVAPGCVAQCVGRAMHVGFIDDTNSSVHKLIRLY
KVALPLHPEFGTEPNVFYVPPVLGPRMELPNGELSTDPKIPLAQLEGLFGKQVRDVLAILQTEREKKMKGLASDLMDVLI
GRRSADMMISPLT
>Q5L3C1 2.5.1.n9~~~pcrB~~~Heptaprenylglyceryl phosphate synthase~~~COG1646
MEEIRAWRHVFKLDPNKPIDDERLERLCESGTDAVIVGGTDGVTIDNVLDLLARIRRFSVPCALEVTDVEALTPGFDVYL
VPIVLNSRQAEWIIGRHHEAVKQYGDMMNWDEIAAEGYCILNPECKAAKLTRADTELDVDDIVAYARLAEHLYKLPIFYL
EYSGVYGDPSVVEKVKQALDQTQLFYGGGITTPEQAEHMARYADTVVVGNAIYDAFEQALATVAAVKQMAGQRNGDDGK
>Q8Y6C8 2.5.1.n9~~~pcrB~~~Heptaprenylglyceryl phosphate synthase~~~COG1646
MKHLFKLDPAKNLPTNDVTKLIHSGTDGFIIGGTDNVQIEAVQNLYELLVETDLPIFLEISNESMILPEADHFLIPVVLN
TENSKWTHGLHKELIKEMGEFIPWKRVTSEGYVILNKDAKVAHLTEAKTDLTDEDIVAYARLAENIFHLPIFYVEYSGMY
GDPEVVRKASAALSNTKFWYGGGIRSKEQAAEMAKYADTIIVGNIIYEDLEKALETATIFRKKTV
>A7X435 2.5.1.n9~~~pcrB~~~Heptaprenylglyceryl phosphate synthase~~~
MYDIKKWRHIFKLDPAKHISDDDLDAICMSQTDAIMIGGTDDVTEDNVIHLMSKIRRYPLPLVLEISNIESVMPGFDFYF
VPTVLNSTDVAFHNGTLLEALKTYGHSIDFEEVIFEGYVVCNADSKVAKHTKANTDLTTEDLEAYAQMVNHMYRLPVMYI
EYSGIYGDVSKVQAVSEHLTETQLFYGGGISSEQQATEMAAIADTIIVGDIIYKDIKKALKTVKIKESSK
>Q53726 2.5.1.n9~~~pcrB~~~Heptaprenylglyceryl phosphate synthase~~~COG1646
MYDIKKWRHIFKLDPAKHISDDDLDAICMSQTDAIMIGGTDDVTEDNVIHLMSRVRRYPLPLVLEISNIESVMPGFDFYF
VPTVLNSTDVVFHNGTLLEALKTYGHSIDFEEVIFEGYVVCNADSKVAKHTKANTDLTTEDLEAYAQMVNHMYRLPVMYI
EYSGIYGDVSKVQAVSEHLTETQLFYGGGISSEQQATEMAAIADTIIVGDIIYKDIKKALKTVKIKESSK
>Q47CW8 ~~~pcrC~~~Perchlorate reductase subunit gamma~~~COG0737
MIKILALATLLISGFLPGVTVAQQAEYLGFRACTKCHDSQGETWRASAHAKAFDSLKPNAKSEAKTKAKLDPKKDYTQDK
NCVGCHVTGYGEPGGFVSGASLDDMKTLVGVTCESCHGAGGKFRNLHGEASDRLKNQGETSERKQLVTAGQNFDMEKACA
RCHLNFEGSTKHDAKAPFTPFSPSVGSKYQFDFQKSVMTTGAGNPIHTHFKLRGVFKGDPVPAVRAKLQEDAPEPE
>A9CIM3 2.7.8.24~~~pcs~~~Phosphatidylcholine synthase~~~COG1183
MKIFNYKRVPYAEIRAFSVHILTASGSFLAFLGVVAASEHRFVDMFWWLGLALLVDGIDGPIARKVRVKEVLPNWSGDTL
DNIIDYVTYVLLPAFALYQSGMIGEPLSFVAAGMIVVSSAIYYADMGMKTDEYFFSGFPVVWNMVVFTLFVMDASATTAM
TVVTVSVFLTFLPINFLHPVRVKRLRPLNLLVVAIWCALGGYALLMHFETPTWAVIAFVASGIYLYCIGGILQFFPSLGA
K
>O51265 2.7.8.24~~~pcs~~~Phosphatidylcholine synthase~~~
MKNINLILAWLVHIFTASGLIVGLYSIISIVNGNYSLLLKLTVIGLIIDGIDGTMARKLKVKELIPEIDGTLLDNITDYI
NYTFIPVIFFYLGEFIEEKYKVAICIGILLSSAYQFSRTDAKTNDNYFRGFPSLWNLFVILNIIFKMEQITNLITMSICI
ITSFIPIKFIYPSKTKELRKITIPITIISCLIFVVSIFSELSTTALKMAKTVLILYFAYLTLASIYLTYKTRNR
>Q89LF9 2.7.8.24~~~pcs~~~Phosphatidylcholine synthase~~~COG1183
MILWRIVRPGAAMAYVQTGLVLIAEAMDTQQDSLKPRPAMRAAAFSVHVFTAFGAAIALLAMLEAVREHWAAMFQWLGVA
LIIDAIDGPIARRLDVKNVQPNWSGDVLDLVVDFVTYVFVPAYAIVASGLLLPVAAPLLGVAIIVTSALYFADLRMKADD
NHFRGFPALWNAAAFYLFLLHWPPLWSTLLVAALVVLTFVPFHVLHPVRVVRLRWLTMSLIGIWAVLSLYTLDMDFRVGP
GVTLALCAIALWISFSDALIRFARSFA
>D0B707 2.7.8.24~~~pcs~~~Phosphatidylcholine synthase~~~COG1183
MGGQKEMADSVKTKLTGKLKAKKVTAPQAKAFSVHLLTASGSFLAFLSVVAASDGRYTAMWWWLGLALFVDGIDGPIARK
LEVKYVLPNWSGELLDSIIDYVTYVLIPAFALYQSGFMGTNLSFISGAIIVVSSAIYYADTGMKTKENFFKGFPVVWNMV
VFTLFIVRPGEWVAFGTVVASAILSFLPINFLHPVRVVRLRPLNLTIFLLWCAFGVIALYYMLDAPLWVRIGISVTGLYI
YFIGAIMQLFPSLGREAALAKARKLVEKQQKSGEAP
>Q5ZV56 2.7.8.24~~~pcsA~~~Phosphatidylcholine synthase~~~COG1183
MNPIKPPFTLNQYFAAWFVHVFTASAACIGVFSLYKIYQHDYVFALWLMAITVFIDAVDGSLARLVHVKSVLPKIDGALL
DNIVDYLNYVITPCFFLLVKPGMLPADYVVPITAAITITSAYQFCQDDAKTPDHFFKGFPCYWNITVFYMYIFNTSMIVN
TVLLSLFCVLIFIPVKYVYPSRLDYLTESRVLKILMHCCSALYGISSFCLLVNYPETNKLWVSLSLGYVGMYLFLSFYRT
YYPMFKAKITANNKD
>Q9HXE9 2.7.8.24~~~pcs~~~Phosphatidylcholine synthase~~~
MPVNLSMTPINKAKAWGVHAVTASGVILALLALLALVDNKPQACLLWLGLALLVDGLDGTLARKYEVKEMLPHFDGSVLD
LVIDYLTYVFIPAIFIYRYIPLPEHFELLAVGVILVSSLFCFCNVNMKSTDNYFVGFPAAWNVVAVYFYVLDLHPWVNLA
TVLVLAALTLTRMKFLHPFRVRQFMPLNIAVTFVWLISSGLLIVQQPADLPILLGLWFAASAYFVGICLWRSAREWFG
>Q1MGQ9 2.7.8.24~~~pcs~~~Phosphatidylcholine synthase~~~COG1183
MKIFNYKRVPYAEMRAFSVHILTASGSFLAFLGVVAAAEHRFIDMFWWLGLALLVDGIDGPIARKVRVKEVLPNWSGDTL
DNIIDYVTYVLLPAFALYQSGMIGEPWSFVAAGMIVVSSAIYYADMGMKTDEYFFSGFPVVWNMIVFTLFVIDASATTAL
TVVIVSVVLTFLPINFLHPVRVKRLRPLNLGVFFLWSALGIFSLLMHFDTPEWALILFIVTGAYLYVIGAVLQFFPALGR
ET
>Q98MN3 2.7.8.24~~~pcs~~~Phosphatidylcholine synthase~~~COG1183
MAARKAAKKLTDRIPRPKKKVTWPQARAFSVHLLTASGSFLAFLSLVAASEERWTAMFWWLGLALFVDGIDGPIARKLEV
KEILPTWSGELLDNIIDYVTYVLIPAFALYQRGFMGEGLSFLSAAIIVVSSAIYYADTGMKTKENFFKGFPVVWNMVVFT
LFVIEPGQWVSFAVVVVAGILTFVPINFIHPVRVVRLRPFNLTMTLLWCAFGALALAQAALAAFYDQIGVLGAQVSTFIK
IGITITGLYLACIGGIMQFFPNLGAKKA
>Q9KJY8 2.7.8.24~~~pcs~~~Phosphatidylcholine synthase~~~COG1183
MKFFNYRRVPYAEIRAFSVHILTASGSFLAFLGVVAAAEHRFVDMFWWLGLALLVDGIDGPIARKVQVKEVLPNWSGDTL
DNVIDYVTYVLLPAFALYQSGMIGEPWSFVAAGAIVVSSAIYYADMGMKTDEYFFSGFPVVWNMVVFTLFVIQASEVTAS
IVVFLSVILTFLPINFLHPVRVKRLRPLNLGIFLVWSVLGMYALLLHFETPPWVVVGVVATGLYLYVIGFILQIFPKLGR
A
>G3XD24 ~~~pctA~~~Methyl-accepting chemotaxis protein PctA~~~
MIKSLKFSHKILLAASLVVFAAFALFTLYNDYLQRNAIREDLESYLREMGDVTSSNIQNWLGGRLLLVEQTAQTLARDHS
PETVSALLEQPALTSTFSFTYLGQQDGVFTMRPDSPMPAGYDPRSRPWYKDAVAAGGLTLTEPYVDAATQELIITAATPV
KAAGNTLGVVGGDLSLKTLVQIINSLDFSGMGYAFLVSGDGKILVHPDKEQVMKTLSEVYPQNTPKIATGFSEAELHGHT
RILAFTPIKGLPSVTWYLALSIDKDKAYAMLSKFRVSAIAAALISIVAILVLLGLLIRLLMQPLHLMGRAMQDIAQGEGD
LTKRLAVTSRDEFGVLGDAFNQFVERIHRSIREVAGTAHKLHDVSQLVVNASNSSMANSDEQSNRTNSVAAAINELGAAA
QEIARNAADASHHASDANHQAEDGKQVVEQTIRAMNELSEKISASCANIEALNSRTVNIGQILEVIKGISEQTNLLALNA
AIEAARAGEAGRGFAVVADEVRNLAHRAQESAQQIQKMIEELQVGAREAVATMTESQRYSLESVEIANRAGESLSSVTRR
IGEIDGMNQSVATATEEQTAVVDSLNMDITEINTLNQEGVENLQATLRACGELETQAGRLRQLVDSFKI
>Q9HW91 ~~~pctB~~~Methyl-accepting chemotaxis protein PctB~~~
MIKSLKFSHKILLAAALVVIATFSLFTLYNDSLQRASIREDLEDYLHEMGEITASNVQNWLSGRILLIENLAQTLARDHS
PETTQALLEQPLLGSTFLFTYLGQTDGTYTARPTSDLPADYDPRRRPWYNAATSAGQTTLTEPYMEPAIHELVLTIASPA
RQGGQPFGVVGGDLSLQTVVKIINSLDFGGMGYAFLVSGDGKILVHPDKDQVMKSLSDVYPRNTPKIGSGFSEAELHGNT
RILSFSPVKGLSGLDWYIGISVDKDKAYAMLTKLRTSAIVAALIAVVAIVLLLGMLIRVLMQPLTDMGRAMQDIAQGEGD
LTKRLKVTSNDEFGALAISFNRFVERIHESIREVAGTARQLHDVAQLVVNASNSSMANSDEQSNRTNSVAAAINELGAAA
QEIARNAADASHHASDANHQAEDGKQVVEQTIRAMNELSEKISASCANIEALNSRTVNIGQILEVIKGISEQTNLLALNA
AIEAARAGEAGRGFAVVADEVRNLAHRAQESAQQIQKMIEELQIGAQEAVSTMTESQRYSLESVEIANRAGERLSSVTGR
IAEIDGMNQSVATATEEQTAVVDSLNMDITEINTLNQEGVENLQATLRACGELETQAGRLRQLVDSFKI
>Q9HW93 ~~~pctC~~~Methyl-accepting chemotaxis protein PctC~~~
MLRSLSFAKKILLAAALVVVFAFSCFILYNDYRQREAVRTDTENYLGEIGTLTASNIQSWLEGRMHLVEGLASQLALLDQ
PDEANIARQLEQPVFSRNFASVYLGEAASGTFTMRPYDAMPEGYDPRTRAWYKDALAADRLIVTEPFVDAGTGEQILAMS
LPVRHAGQLLGVAAGDMKLETLTAILNSLKFDGAGYAFLVSDAGKILLHPDSGLVLKTLAEAYPKGAPNIVPGVHEVELD
GSSQFVSFTPVKGLPGVTWYVALVLDRDTAYSMLSEFRTSAIVATLIAVVGIMLLLGMLIRVLMQPLTDMGRAMQDIAQG
EGDLTKRLKVTSNDEFGTLANAFNRFVERIHESIREVAGTARQLHDVAQLVVNASNSSMANSDEQSNRTNSVAAAINELG
AAAQEIARNAADASHHASDANHQAEDGKQVVEQTIRAMNELSEKISASCANIEALNSRTVNIGQILEVIKGISEQTNLLA
LNAAIEAARAGEAGRGFAVVADEVRNLAHRAQESAQQIQKMIEELQVGAREAVATMTESQRYSLESVEIANRAGERLGSV
TSRIGEIDSMNQSVATATEEQTAVVDSLNMDITEINTLNQEGVENLQATLRACGELETQAGRLRHLVDSFKI
>A0A0J5WTU0 3.2.2.5~~~pycTIR~~~Pycsar effector protein BcPycTIR~~~
MIERFSGEAGKRLRVEALTGQKLVGGDKGLAAELADMAELISVKAGDVIIQQDGTDNDLYFIITGAFDIVVNATPIRRRF
PGDSVGEMAAVEPVQKRSATVSAAADSLVAKITEQQLSELGSRYPDIWRRMAKELSKRLIERNQFVNAKREKIRVFVISS
AEALGVAHLLQSMFAHDKFLTVPWNQGVFKVANYTLDDIERELDQCDFAVAIAHGDDVTNARGTEWPAPRDNVVFELGLF
MGRLGRKRAILMEPRGEGVKLPSDMAGVTTIPYVYDEKNDTEAKFGPAATALRKHIMSLGTIS
>P0DV29 3.2.2.5~~~pycTIR~~~Pycsar effector protein XpPycTIR~~~
MIERFQGDEGRRRLVATLTEHRLVANRQELAERLVAVGELMEAPAGTTFINQGDQTSEVFFIIAGKVEVRVNGKVVANRF
PGDSVGEMAAIEPSQPRAASVIPVEDTVLIKVSEAEFSAAAEQFPDVWRRIAATLARRLAERNHLVTAQRERVRVFIMSS
VEALPIVDLLIKQFAHDPFLAVAWKNGVFRASQYTLDELEAELDDSDFAVAVAHGDDVLITRDDEWPTIRDNVILEFGLF
MGRLGRRRAFLMEPRDVDLKLPSDLAGLTTIPYRYVKGKDAEHYIAPACARLRELILAAGAKD
>P20371 1.13.11.3~~~pcaG~~~Protocatechuate 3,4-dioxygenase alpha chain~~~COG3485
MNGWNFQELKETPSQTGGPYVHIGLLPKQANIEVFEHNLDNNLVQDNTQGQRIRLEGQVFDGLGLPLRDVLIEIWQADTN
GVYPSQADTQGKQVDPNFLGWGRTGADFGTGFWSFNTIKPGAVPGRKGSTQAPHISLIIFARGINIGLHTRVYFDDEAEA
NAKDPVLNSIEWATRRQTLVAKREERDGEVVYRFDIRIQGENETVFFDI
>P00436 1.13.11.3~~~pcaG~~~Protocatechuate 3,4-dioxygenase alpha chain~~~COG3485
MPIELLPETPSQTAGPYVHIGLALEAAGNPTRDQEIWNRLAKPDAPGEHILLLGQVYDGNGHLVRDSFLEVWQADANGEY
QDAYNLENAFNSFGRTATTFDAGEWTLHTVKPGVVNNAAGVPMAPHINISLFARGINIHLHTRLYFDDEAQANAKCPVLN
LIEQPQRRETLIAKRCEVDGKTAYRFDIRIQGEGETVFFDF
>P75028 ~~~pcxA~~~Proton extrusion protein PcxA~~~
MDLTNWWQGATQWFGRSSQKSLEQAFRSALKIKEIEDQYFQGKKIGPENCDYSADTVTYFANQIQRHLRKIEQEIYHLNS
DQEFVKILSLDPAVKQDPQTEYVLNQLQFIDDILQRYDGELPQVSPPKQIANGGVLDLPAITANKQRQINKKRRDGFQYI
RREDTQQKVDTATQKSGVLPRSFLRTIDRLKREMDPQSSDTEQKVLKQYRNSRYKTALSIKFVLTLIIVPLLAHQLTKTF
FLLPSVESFFERNSEVVFINQSMETEAYEELSHFEESLRFRELLGFGEKLSPEAKEEKLAEKAKEISESYRRVSTNAIAN
IFADIFSLVAFSLVLVNSQREIEVLKEFIDEIVYGLSDSAKAFLIILFTDMFVGFHSPHGWEVILASIARHFGLPENQDF
NFLFIATFPVILDTVFKYWIFRYLNSISPSAVATYRNMNE
>P20372 1.13.11.3~~~pcaH~~~Protocatechuate 3,4-dioxygenase beta chain~~~COG3485
MSQIIWGAYAQRNTEDHPPAYAPGYKTSVLRSPKNALISIAETLSEVTAPHFSADKFGPKDNDLILNYAKDGLPIGERVI
VHGYVRDQFGRPVKNALVEVWQANASGRYRHPNDQYIGAMDPNFGGCGRMLTDDNGYYVFRTIKPGPYPWRNRINEWRPA
HIHFSLIADGWAQRLISQFYFEGDTLIDSCPILKTIPSEQQRRALIALEDKSNFIEADSRCYRFDITLRGRRATYFENDL
T
>P00437 1.13.11.3~~~pcaH~~~Protocatechuate 3,4-dioxygenase beta chain~~~COG3485
MPAQDNSRFVIRDRNWHPKALTPDYKTSIARSPRQALVSIPQSISETTGPNFSHLGFGAHDHDLLLNFNNGGLPIGERII
VAGRVVDQYGKPVPNTLVEMWQANAGGRYRHKNDRYLAPLDPNFGGVGRCLTDSDGYYSFRTIKPGPYPWRNGPNDWRPA
HIHFGISGPSIATKLITQLYFEGDPLIPMCPIVKSIANPEAVQQLIAKLDMNNANPMDCLAYRFDIVLRGQRKTHFENC
>Q93TN0 1.3.7.5~~~pcyA~~~Phycocyanobilin:ferredoxin oxidoreductase~~~
MSLTSIPSLREQQHPLIRQLADCIEEVWHQHLDLSPYHLPAELGYVEGRLEGEKLTIENRCYQTPQFRKMHLELAKVGNM
LDILHCVMFPRPEYDLPMFGCDLVGGRGQISAAIADLSPVHLDRTLPESYNSALTSLNTLNFSQPRELPEWGNIFSDFCI
FVRPSSPEEEAMFLGRVREFLQVHCQGAIAASPVSAEQKQQILAGQHNYCSKQQQNDKTRRVLEKAFGVDWAENYMTTVL
FDLPE
>P22635 1.13.11.8~~~ligA~~~Protocatechuate 4,5-dioxygenase alpha chain~~~
MTEKKERIDVHAYLAEFDDIPGTRVFTAQRARKGYNLNQFAMSLMKAENRERFKADESAYLDEWNLTPAAKAAVLARDYN
AMIDEGGNVYFLSKLFSTDGKSFQFAAGSMTGMTQEEYAQMMIDGGRSPAGVRSIKGGY
>Q55891 1.3.7.5~~~pcyA~~~Phycocyanobilin:ferredoxin oxidoreductase~~~
MAVTDLSLTNSSLMPTLNPMIQQLALAIAASWQSLPLKPYQLPEDLGYVEGRLEGEKLVIENRCYQTPQFRKMHLELAKV
GKGLDILHCVMFPEPLYGLPLFGCDIVAGPGGVSAAIADLSPTQSDRQLPAAYQKSLAELGQPEFEQQRELPPWGEIFSE
YCLFIRPSNVTEEERFVQRVVDFLQIHCHQSIVAEPLSEAQTLEHRQGQIHYCQQQQKNDKTRRVLEKAFGEAWAERYMS
QVLFDVIQ
>P22636 1.13.11.8~~~ligB~~~Protocatechuate 4,5-dioxygenase beta chain~~~
MARVTTGITSSHIPALGAAIQTGTSDNDYWGPVFKGYQPIRDWIKQPGNMPDVVILVYNDHASAFDMNIIPTFAIGCAET
FKPADEGWGPRPVPDVKGHPDLAWHIAQSLILDEFDMTIMNQMDVDHGCTVPLSMIFGEPEEWPCKVIPFPVNVVTYPPP
SGKRCFALGDSIRAAVESFPEDLNVHVWGTGGMSHQLQGPRAGLINKEFDLNFIDKLISDPEELSKMPHIQYLRESGSEG
VELVMWLIMRGALPEKVRDLYTFYHIPASNTALGAMILQPEETAGTPLEPRKVMSGHSLAQA
>O34928 3.5.1.-~~~pdaA~~~Peptidoglycan-N-acetylmuramic acid deacetylase PdaA~~~COG0726
MKWMCSICCAAVLLAGGAAQAEAVPNEPINWGFKRSVNHQPPDAGKQLNSLIEKYDAFYLGNTKEKTIYLTFDNGYENGY
TPKVLDVLKKHRVTGTFFVTGHFVKDQPQLIKRMSDEGHIIGNHSFHHPDLTTKTADQIQDELDSVNEEVYKITGKQDNL
YLRPPRGVFSEYVLKETKRLGYQTVFWSVAFVDWKINNQKGKKYAYDHMIKQAHPGAIYLLHTVSRDNAEALDDAITDLK
KQGYTFKSIDDLMFEKEMRLPSL
>O34798 3.5.1.-~~~pdaC~~~Peptidoglycan-N-acetylmuramic acid deacetylase PdaC~~~COG0726
MLAKRIKWFHVLIAVVCVVGLIGFFHNHSLKKETVMNKVRTDSQYGNVEIATLVNDGKTFNYAVNYPVFKNEKMDSALKR
FAEKEVRQFQKETKDVDQEHTTKRNELNVDYKIVHYAKQTVAIVFNEYKYIGGAHGQTVKKTFNYDFSKQAFLSIDDIFK
EDADYLHKLSLIAYHELKKNKDIAADDALLKEGTAPKKENFSRFAIKEDYIELYFDTYQVAAGYLGEQSIAIKKSLLKDI
LKEQYIDKAKNKNKIKEQKPKHEVISLPKEETVDPNQKVIALTFDDGPNPATTNQILDSLKKYKGHATFFVLGSRVQYYP
ETLIRMLKEGNEVGNHSWSHPLLTRLSVKEALKQINDTQDIIEKISGYRPTLVRPPYGGINDELRSQMKMDVALWDVDPE
DWKDRNKKTIVDRVMNQAGDGRTILIHDIYRTSADAADEIIKKLTDQGYQLVTVSQLEEVKKQREAK
>B9J8S0 3.5.4.11~~~codAch2~~~Pterin deaminase~~~COG0402
MSYSFMSPPNAARFVLSNATVPAVTVVGFTGPSSEGLMKADIVVADGLIKDILPAGTAPAELAKADMRDGMVWPTFADMH
THLDKGHIWERRANPDGSFMGALDAVRSDREANWSAADVRKRMEFSLRAAYAHGTSLIRTHLDSLAPQHRISFEVFSEVR
EAWKDKIALQAVALFPLDFMVDDAFFADLTTVVREAGGLLGGVTQMNPDIDAQLDKLIRAAAANGLDIDLHVDETEDREV
LTLKAIAAAVLRNGFTGKVTAGHCCSLARQDENVAAATIDLVAKAGISIVALPMCNMYLQDRHPGRTPRWRGVTLLHELA
AAGVPTAVASDNTRDPFYAYGDLDPVEVFREAVRILHLDHPLDTAARVVTTSPASILGRPDIGRIAVGGPADLVLFSARR
WSEFLSRPQSDRVVLRKGKVIDRSLPDYRELDTVIGA
>P06672 4.1.1.1~~~pdc~~~Pyruvate decarboxylase~~~COG3961
MSYTVGTYLAERLVQIGLKHHFAVAGDYNLVLLDNLLLNKNMEQVYCCNELNCGFSAEGYARAKGAAAAVVTYSVGALSA
FDAIGGAYAENLPVILISGAPNNNDHAAGHVLHHALGKTDYHYQLEMAKNITAAAEAIYTPEEAPAKIDHVIKTALREKK
PVYLEIACNIASMPCAAPGPASALFNDEASDEASLNAAVEETLKFIANRDKVAVLVGSKLRAAGAEEAAVKFADALGGAV
ATMAAAKSFFPEENPHYIGTSWGEVSYPGVEKTMKEADAVIALAPVFNDYSTTGWTDIPDPKKLVLAEPRSVVVNGIRFP
SVHLKDYLTRLAQKVSKKTGALDFFKSLNAGELKKAAPADPSAPLVNAEIARQVEALLTPNTTVIAETGDSWFNAQRMKL
PNGARVEYEMQWGHIGWSVPAAFGYAVGAPERRNILMVGDGSFQLTAQEVAQMVRLKLPVIIFLINNYGYTIEVMIHDGP
YNNIKNWDYAGLMEVFNGNGGYDSGAGKGLKAKTGGELAEAIKVALANTDGPTLIECFIGREDCTEELVKWGKRVAAANS
RKPVNKLL
>O51338 3.1.4.52~~~pdeA~~~Cyclic di-GMP phosphodiesterase PdeA~~~
MNKVNNYQNINSVIISKKEVGLRDLIKLKSIFNLIQIVKSEKALYSEYVKQNNIKFAIIYNYEKPIDFSINIANELKNAN
KIHSIIISKEKFDEEYLKLDHIEIIKDISELEYKQSLIHQKKLFCDNKNTTLDFFLNLSELIKEIVIITNTKNEIIYINE
KGSKNLNLPMKTSGNIIKVTDIDIRDWEKLEKINLSYHTNSIPEFKNILITDCLLTLKNNKKLHVDIFISTIAQNNIDKL
ITIKEISNSNKIENYKYLEIIDSQDEIQNAKEIEKLLVNHMDIYKKKSIYLLNLDVSLTAEYEYKEDQEKLNAKILKIMY
SKIMSLYSEYIFKLKHNNLIVIISTSGGEKRIISIAKKIKKTIALAFKKEDIIIFKFNIGIIEVNLKENLEFKIPKLMMA
TKISSEYKESNPTIYKEELPEAVILKNQNKIFQYILKAIKNDFFTLYYQKINPLKKNLKPKIEILTRLFDHMGKPIPNNQ
IFNLIDKYNLTVEVDTLVVKKALREYKSFVSKNGIHIFSINISPYSLKSQNFRIFLRDTLLKSQIPLQNICLEITETGIL
ENFEIINKYFQELKSFGIKLALDDFGSGHTSLSYIKTLPIDLLKIDGSFIKAINSSEIDFVIIKSIKKIADTKNIKIIAE
FVYNEEILKKIIELEIDYGQGFLWHKPEPI
>P23842 3.1.4.52~~~pdeA~~~Probable cyclic di-GMP phosphodiesterase PdeA~~~COG2199
MFVEHNLIKNIKIFTLAFTLTVVLIQLSRFISPLAIIHSSYIFLAWMPLCVMLSILFIFGWRGVVPVLCGMFCTNLWNFH
LSFLQTAVMLGSQTFVVLCACAILRWQLGTRWRYGLTSRYVWQRLFWLGLVTPIGIKCSMYLVGSFFDFPLKISTFFGDA
DAIFTVVDLLSLFTAVLIYNMLFYYLTRMIVSPHFAQILWRRDIAPSLGKEKRAFTLSWLAALSVLLLLLCTPYENDFIA
GYLVPVFFIIFTLGVGKLRYPFLNLTWAVSTLCLLNYNQNFLQGVETEYSLAFILAVLISFSVCLLYMVRIYHRSEWLNR
RWHLQALTDPLTLLPNFRALEQAPEQEAGKSFCCLRIDNLEFMSRHYGLMMRVHCIRSICRTLLPLMQENEKLYQLPGSE
LLLVLSGPETEGRLQHMVNILNSRQIHWNNTGLDMGYGAAWGRFDGNQETLQPLLGQLSWLAEQSCAHHHVLALDSREEM
VSGQTTKQVLLLNTIRTALDQGDLLLYAQPIRNKEGEGYDEILARLKYDGGIMTPDKFLPLIAQFNLSARFDLQVLESLL
KWLATHPCDKKGPRFSVNLMPLTLLQKNIAGRIIRLFKRYHISPQAVILEITEEQAFSNAESSMYNIEQLHKFGFRIAID
DFGTGYANYERLKRLQADIIKIDGVFVKDIVTNTLDAMIVRSITDLAKAKSLSVVAEFVETQQQQALLHKLGVQYLQGYL
IGRPQPLAD
>O50161 3.1.4.-~~~PdeB~~~Cyclic di-GMP phosphodiesterase PdeB~~~
MQNSESIIKNIKNSSYLIDKEFLVWPENAFIGDKNIELIEKWNLKSYIKERKNFFSDDSVKKEYEEIHKKFNEEAISSYH
VIISNLEEIYENCKRNKKIYYQDIMPTVKKVIEFYKKQKKIFIKYFRIPKLSANYHIIHSVNTAILTVALGNEMGLNNYK
TVELCSIALLHKIGFLFIPSKISEKKEALTEEELEIIKKYPIISYKIASTSNLSRSICLTLLTHKENLDGTGYPKGLTSE
NISIESNIIGAASAYSAIILDKAYKKSFNSGASIIELIKDADKKFDKRVLKLIINAISSCPLDFIVELNDNSIAKIVDID
ESNPNLPYINYIIKNGKVIDKNEQSSVQSIPNTNTGIKKILNQNEIELIKNKYSLIDII
>P77473 3.1.4.52~~~pdeB~~~Probable cyclic di-GMP phosphodiesterase PdeB~~~COG4943
MRTRHLVGLISGVLILSVLLPVGLSIWLAHQQVETSFIEELDTYSSRVAIRANKVATQGKDALQELERWQGAACSEAHLM
EMRRVSYSYRYIQEVAYIDNNVPQCSSLEHESPPDTFPEPGKISKDGYRVWLTSHNDLGIIRYMVAMGTAHYVVMIDPAS
FIDVIPYSSWQIDAAIIGNAHNVVITSSDEIAQGIITRLQKTPGEHIENNGIIYDILPLPEMNISIITWASTKMLQKGWH
RQVFIWLPLGLVIGLLAAMFVLRILRRIQSPHHRLQDAIENRDICVHYQPIVSLANGKIVGAEALARWPQTDGSWLSPDS
FIPLAQQTGLSEPLTLLIIRSVFEDMGDWLRQHPQQHISINLESPVLTSEKIPQLLRDMINHYQVNPRQIALELTEREFA
DPKTSAPIISRYREAGHEIYLDDFGTGYSSLSYLQDLDVDILKIDKSFVDALEYKNVTPHIIEMAKTLKLKMVAEGIETS
KQEEWLRQHGVHYGQGWLYSKALPKEDFLRWAEQHL
>Q8EJM6 3.1.4.52~~~pdeB~~~Cyclic di-GMP phosphodiesterase PdeB~~~COG5001
MRIGNKILVFIVGFCLPAVVLVSYCLGVWFDHRVELLRQDNVRHELANIQQQFRIDVDRLGFLTNIYASPLSHLDSEQLK
SLESSWLESSMSGNLSWFILRDGNLQNVFQNEQPIAEANRQEIAKAITTQAKPEFASAYLIGDKGYVVTAVASHLGEYVL
LVRQLTERDLLEYAQTSLVARVSMSNVVTAHHSSHSSSVALPSLISQQPIYLHVEFSDDPFRDVKLSLDWVSLAVILLGI
LIVALGYVWLRACLLQPFKSLMQQLALVDPMASVYRPVTSEGNEELSVLANRVNSLLARIYQQKERGKITLESIAEAVIL
TDIEAKVIYMNPKAETLLEVASSNAVGESLASLLKAGEQLNQAVFHCIRLGETMPQVAKIKLLTTMPRIIERSISNVLNH
EKEIVGTVVVLRDITQEELLKHQLQKRANFDGITGLLNRQAFEEQLPEFASQARSLAVCYLDLEQFKLINDSCGHTAGDR
MLAMVARAIQSCLGPQELLARIGGDEFGLVICDRTALAVAQLLKQIIAQVSLQVLHDKNCNYKVGLSIGVAFGRAPYINA
QELLKDADIACIAAKAKGTNQIHIYDDKDKELTYQRNAPKWAVRIAQAIEENELLLYYQPIRGLGASSKRQRMEVLLRIQ
EPCGRILAPAQFIAAAERFKLMPEIDKEVIRKAFLWLSLNSQLWQDHCISINLSGNSLGAEGMVEYIAKQQQIFDIPSQC
VCFEITETTAIQNRHRGMEMLRQLRKLGFSFALDDFGSGFASYGYLRELPVDYVKIDGCFVKNLAVNAKDYAIVKSIQDV
CRVMGIETVAEFVENQEIIDRLQTIGINYAQGYAIGRPQPLASYCEQFETRLAQRA
>P32701 3.1.4.52~~~pdeC~~~Probable cyclic di-GMP phosphodiesterase PdeC~~~COG4943
MSHRARHQLLALPGIIFLVLFPIILSLWIAFLWAKSEVNNQLRTFAQLALDKSELVIRQADLVSDAAERYQGQVCTPAHQ
KRMLNIIRGYLYINELIYARDNHFLCSSLIAPVNGYTIAPADYKREPNVSIYYYRDTPFFSGYKMTYMQRGNYVAVINPL
FWSEVMSDDPTLQWGVYDTVTKTFFSLSKEASAATFSPLIHLKDLTVQRNGYLYATVYSTKRPIAAIVATSYQRLITHFY
NHLIFALPAGILGSLVLLLLWLRIRQNYLSPKRKLQRALEKHQLCLYYQPIIDIKTEKCIGAEALLRWPGEQGQIMNPAE
FIPLAEKEGMIEQITDYVIDNVFRDLGDYLATHADRYVSINLSASDFHTSRLIARINQKTEQYAVRPQQIKFEVTEHAFL
DVDKMTPIILAFRQAGYEVAIDDFGIGYSNLHNLKSLNVDILKIDKSFVETLTTHKTSHLIAEHIIELAHSLGLKTIAEG
VETEEQVNWLRKRGVRYCQGWFFAKAMPPQVFMQWMEQLPARELTRGQ
>P76261 3.1.4.52~~~pdeD~~~Probable cyclic di-GMP phosphodiesterase PdeD~~~COG2200
MQKAQRIIKTYRRNRMIVCTICALVTLASTLSVRFISQRNLNQQRVVQFANHAVEELDKVLLPLQAGSEVLLPLIGLPCS
VAHLPLRKQAAKLQTVRSIGLVQDGTLYCSSIFGYRNVPVVDILAELPAPQPLLRLTIDRALIKGSPVLIQWTPAAGSSN
AGVMEMINIDLLTAMLLEPQLPQISSASLTVDKRHLLYGNGLVDSLPQPEDNENYQVSSQRFPFTINVNGPGATALAWHY
LPTQLPLAVLLSLLVGYIAWLATAYRMSFSREINLGLAQHEFELFCQPLLNARSQQCIGVEILLRWNNPRQGWISPDVFI
PIAEEHHLIVPLTRYVMAETIRQRHVFPMSSQFHVGINVAPSHFRRGVLIKDLNQYWFSAHPIQQLILEITERDALLDVD
YRIARELHRKNVKLAIDDFGTGNSSFSWLETLRPDVLKIDKSFTAAIGSDAVNSTVTDIIIALGQRLNIELVAEGVETQE
QAKYLRRHGVHILQGYLYAQPMPLRDFPKWLAGSQPPPARHNGHITPIMPLR
>P77172 3.1.4.52~~~pdeF~~~Cyclic di-GMP phosphodiesterase PdeF~~~COG2200
MKLNATYIKIRDKWWGLPLFLPSLILPIFAHINTFAHISSGEVFLFYLPLALMISMMMFFSWAALPGIALGIFVRKYAEL
GFYETLSLTANFIIIIILCWGGYRVFTPRRNNVSHGDTRLISQRIFWQIVFPATLFLILFQFAAFVGLLASRENLVGVMP
FNLGTLINYQALLVGNLIGVPLCYFIIRVVRNPFYLRSYYSQLKQQVDAKVTKKEFALWLLALGALLLLLCMPLNEKSTI
FSTNYTLSLLLPLMMWGAMRYGYKLISLLWAVVLMISIHSYQNYIPIYPGYTTQLTITSSSYLVFSFIVNYMAVLATRQR
AVVRRIQRLAYVDPVVHLPNVRALNRALRDAPWSALCYLRIPGMEMLVKNYGIMLRIQYKQKLSHWLSPLLEPGEDVYQL
SGNDLALRLNTESHQERITALDSHLKQFRFFWDGMPMQPQIGVSYCYVRSPVNHIYLLLGELNTVAELSIVTNAPENMQR
RGAMYLQRELKDKVAMMNRLQQALEHNHFFLMAQPITGMRGDVYHEILLRMKGENDELISPDSFLPVAHEFGLSSSIDMW
VIEHTLQFMAENRAKMPAHRFAINLSPTSVCQARFPVEVSQLLAKYQIEAWQLIFEVTESNALTNVKQAQITLQHLQELG
CQIAIDDFGTGYASYARLKNVNADLLKIDGSFIRNIVSNSLDYQIVASICHLARMKKMLVVAEYVENEEIREAVLSLGID
YMQGYLIGKPQPLIDTLNEIEPIRESA
>P75995 3.1.4.52~~~pdeG~~~Probable cyclic di-GMP phosphodiesterase PdeG~~~COG2200
MRNTLIPILVAICLFITGVAILNIQLWYSAKAEYLAGARYAANNINHILEEASQATQTAVNIAGKECNLEEQYQLGTEAA
LKPHLRTIIILKQGIVWCTSLPGNRVLLSRIPVFPDSNLLLAPAIDTVNRLPILLYQNQFADTRILVTISDQHIRGALNV
PLKGVRYVLRVADDIIGPTGDVMTLNGHYPYTEKVHSTKYHFTIIFNPPPLFSFYRLIDKGFGILIFILLIACAAAFLLD
RYFNKSATPEEILRRAINNGEIVPFYQPVVNGREGTLRGVEVLARWKQPHGGYISPAAFIPLAEKSGLIVPLTQSLMNQV
ARQMNAIASKLPEGFHIGINFSASHIISPTFVDECLNFRDSFTRRDLNLVLEVTEREPLNVDESLVQRLNILHENGFVIA
LDDFGTGYSGLSYLHDLHIDYIKIDHSFVGRVNADPESTRILDCVLDLARKLSISIVAEGVETKEQLDYLNQNYITFQQG
YYFYKPVTYIDLVKIILSKPKVKVVVE
>P37646 3.1.4.52~~~pdeH~~~Cyclic di-GMP phosphodiesterase PdeH~~~COG2200
MIRQVIQRISNPEASIESLQERRFWLQCERAYTWQPIYQTCGRLMAVELLTVVTHPLNPSQRLPPDRYFTEITVSHRMEV
VKEQIDLLAQKADFFIEHGLLASVNIDGPTLIALRQQPKILRQIERLPWLRFELVEHIRLPKDSTFASMCEFGPLWLDDF
GTGMANFSALSEVRYDYIKIARELFVMLRQSPEGRTLFSQLLHLMNRYCRGVIVEGVETPEEWRDVQNSPAFAAQGWFLS
RPAPIETLNTAVLAL
>P75800 3.1.4.52~~~pdeI~~~Probable cyclic di-GMP phosphodiesterase PdeI~~~COG2200
MLSLYEKIKIRLIILFLLAALSFIGLFFIINYQLVSERAVKRADSRFELIQKNVGYFFKDIERSALTLKDSLYLLKNTEE
IQRAVILKMEMMPFLDSVGLVLDDNKYYLFSRRANDKIVVYHQEQVNGPLVDESGRVIFADFNPSKRPWSVASDDSNNSW
NPAYNCFDRPGKKCISFTLHINGKDHDLLAVDKIHVDLNWRYLNEYLDQISANDEVLFLKQGHEIIAKNQLAREKLIIYN
SEGNYNIIDSVDTEYIEKTSAVPNNALFEIYFYYPGGNLLNASDKLFYLPFAFIIIVLLVVYLMTTRVFRRQFSEMTELV
NTLAFLPDSTDQIEALKIREGDAKEIISIKNSIAEMKDAEIERSNKLLSLISYDQESGFIKNMAIIESNNNQYLAVGIIK
LCGLEAVEAVFGVDERNKIVRKLCQRIAEKYAQCCDIVTFNADLYLLLCRENVQTFTRKIAMVNDFDSSFGYRNLRIHKS
AICEPLQGENAWSYAEKLKLAISSIRDHMFSEFIFCDDAKLNEIEENIWIARNIRHAMEIGELFLVYQPIVDINTRAILG
AEALCRWVSAERGIISPLKFITIAEDIGFINELGYQIIKTAMGEFRHFSQRASLKDDFLLHINVSPWQLNEPHFHERFTT
IMKENGLKANSLCVEITETVIERINEHFYLNIEQLRKQGVRISIDDFGTGLSNLKRFYEINPDSIKVDSQFTGDIFGTAG
KIVRIIFDLARYNRIPVIAEGVESEDVARELIKLGCVQAQGYLYQKPMPFSAWDKSGKLVKE
>P37649 3.1.4.52~~~pdeK~~~Probable cyclic di-GMP phosphodiesterase PdeK~~~COG2199
MRVSRSLTIKQMAMVAAVVLVFVFIFCTVLLFHLVQQNRYNTATQLESIARSVREPLSSAILKGDIPEAEAILASIKPAG
VVSRADVVLPNQFQALRKSFIPERPVPVMVTRLFELPVQISLGVYSLERPANPQPIAYLVLQADSFRMYKFVMSTLSTLV
TIYLLLSLILTVAISWCINRLILHPLRNIARELNAIPAKELVGHQLALPRLHQDDEIGMLVRSYNLNQQLLQRHYEEQNE
NAMRFPVSDLPNKALLMEMLEQVVARKQTTALMIITCETLRDTAGVLKEAQREILLLTLVEKLKSVLSPRMILAQISGYD
FAVIANGVQEPWHAITLGQQVLTIMSERLPIERIQLRPHCSIGVAMFYGDLTAEQLYSRAISAAFTARHKGKNQIQFFDP
QQMEAAQKRLTEESDILNALENHQFAIWLQPQVEMTSGKLVSAEVLLRIQQPDGSWDLPDGLIDRIECCGLMVTVGHWVL
EESCRLLAAWQERGIMLPLSVNLSALQLMHPNMVADMLELLTRYRIQPGTLILEVTESRRIDDPHAAVAILRPLRNAGVR
VALDDFGMGYAGLRQLQHMKSLPIDVLKIDKMFVEGLPGDSSMIAAIIMLAQSLNLQMIAEGVETEAQRDWLAKAGVGIA
QGFLFARPLPIEIFEESYLEEK
>P21514 3.1.4.52~~~pdeL~~~Cyclic di-GMP phosphodiesterase PdeL~~~COG2200
MNSCDFRVFLQEFGTTVHLSLPGSVSEKERLLLKLLMQGMSVTEISQYRNRSAKTISHQKKQLFEKLGIQSDITFWRDIF
FQYNPEIISATGSNSHRYINDNHYHHIVTPEAISLALENHEFKPWIQPVFCAQTGVLTGCEVLVRWEHPQTGIIPPDQFI
PLAESSGLIVIMTRQLMKQTADILMPVKHLLPDNFHIGINVSAGCFLAAGFEKECLNLVNKLGNDKIKLVLELTERNPIP
VTPEARAIFDSLHQHNITFALDDFGTGYATYRYLQAFPVDFIKIDKSFVQMASVDEISGHIVDNIVELARKPGLSIVAEG
VETQEQADLMIGKGVHFLQGYLYSPPVPGNKFISEWVMKAGG
>P76446 3.1.4.52~~~pdeN~~~Probable cyclic di-GMP phosphodiesterase PdeN~~~COG2200
MFIRAPNFGRKLLLTCIVAGVMIAILVSCLQFLVAWHKHEVKYDTLITDVQKYLDTYFADLKSTTDRLQPLTLDTCQQAN
PELTARAAFSMNVRTFVLVKDKKTFCSSATGEMDIPLNELIPALDINKNVDMAILPGTPMVPNKPAIVIWYRNPLLKNSG
VFAALNLNLTPSLFYSSRQEDYDGVALIIGNTALSTFSSRLMNVNELTDMPVRETKIAGIPLTVRLYADDWTWNDVWYAF
LLGGMSGTVVGLLCYYLMSVRMRPGREIMTAIKREQFYVAYQPVVDTQALRVTGLEVLLRWRHPVAGEIPPDAFINFAES
QKMIVPLTQHLFELIARDAAELEKVLPVGVKFGINIAPDHLHSESFKADIQKLLTSLPAHHFQIVLEITERDMLKEQEAT
QLFAWLHSVGVEIAIDDFGTGHSALIYLERFTLDYLKIDRGFINAIGTETITSPVLDAVLTLAKRLNMLTVAEGVETPEQ
ARWLSERGVNFMQGYWISRPLPLDDFVRWLKKPYTPQW
>P77334 3.1.4.52~~~pdeR~~~Cyclic di-GMP phosphodiesterase PdeR~~~COG5001
MKTVRESTTLYNFLGSHNPYWRLTESSDVLRFSTTETTEPDRTLQLSAEQAARIREMTVITSSLMMSLTVDESDLSVHLV
GRKINKREWAGNASAWHDTPAVARDLSHGLSFAEQVVSEAHSAIVILDSRGNIQRFNRLCEDYTGLKEHDVIGQSVFKLF
MSRREAAASRRNNRVFFRSGNAYEVELWIPTCKGQRLFLFRNKFVHSGSGKNEIFLICSGTDITEERRAQERLRILANTD
SITGLPNRNAMQDLIDHAINHADNNKVGVVYLDLDNFKKVNDAYGHLFGDQLLRDVSLAILSCLEHDQVLARPGGDEFLV
LASNTSQSALEAMASRILTRLRLPFRIGLIEVYTSCSVGIALSPEHGSDSTAIIRHADTAMYTAKEGGRGQFCVFTPEMN
QRVFEYLWLDTNLRKALENDQLVIHYQPKITWRGEVRSLEALVRWQSPERGLIPPLDFISYAEESGLIVPLGRWVILDVV
RQVAKWRDKGINLRVAVNISARQLADQTIFTALKQVLQELNFEYCPIDVELTESCLIENDELALSVIQQFSQLGAQVHLD
DFGTGYSSLSQLARFPIDAIKLDQVFVRDIHKQPVSQSLVRAIVAVAQALNLQVIAEGVESAKEDAFLTKNGINERQGFL
FAKPMPAVAFERWYKRYLKRA
>P0ACL9 ~~~pdhR~~~Pyruvate dehydrogenase complex repressor~~~COG2186
MAYSKIRQPKLSDVIEQQLEFLILEGTLRPGEKLPPERELAKQFDVSRPSLREAIQRLEAKGLLLRRQGGGTFVQSSLWQ
SFSDPLVELLSDHPESQYDLLETRHALEGIAAYYAALRSTDEDKERIRELHHAIELAQQSGDLDAESNAVLQYQIAVTEA
AHNVVLLHLLRCMEPMLAQNVRQNFELLYSRREMLPLVSSHRTRIFEAIMAGKPEEAREASHRHLAFIEEILLDRSREES
RRERSLRRLEQRKN
>Q57BR6 2.7.13.3~~~pdhS~~~Cell-division control histidine kinase PdhS~~~
MSGSYPFIDIAALDSVREGFARGDAQLVLAHDLSTVLWVNGPGAKLFGYNRVEDLIEGQLDLPVATRRQIAAFSSENTSA
PSAVAVRLGGGLRSELTHLHVSNIKLPDGVAALLVATQMPDNSAEAAISGLGDDSTHIALVDAVGKVVAASPRFALLDIS
ASTLEDLIVEAGDATDRIVKRRIRTGSHSVPGAIARLTDTPALHLLCIVGDAPAQFQTAAEAVPLPDNAEAVLEEILPEQ
GDAPAQQAQKTHAEQPRPKTFAFDHDAPPARFIWKVGPDGTFSEISPNLAAVVGPNSADIVGRRFSDVANVFGFDTDGSI
AALLLERDTWSGKRLLWPVEGTRLRVPVELAALPVYSRDREFLGFRGFGIVRPAEAEADPEEIGLALAGGIPQNRKPRKE
PAETARMVGEDDVLALSEEVANDDQPAAVLPKPPLDITPTPGRRDSDKVISLLNSCAQEKVAADQAKFLKEKERATRPEG
GLTKTERNAFREIAERLRKQGLANTRAESETPVSETSSIEPVEPTPPVKTRSEPIQPDETALLANLPVPVIIHSGDAIHY
VNQALLDITGYESLDDIRSAGGVDVLFNSESDDGETRQSMLLRHADGSEEPVDAHLNAIAWRGGRALMLSLMPVTAADLP
APAELPAANDEEKQALEAHVEELKTILDTATDGVVLIDPEGRIRSMNHSASALFGYERDEAEGKFFSMLFAIESQRAAMD
YLHGLSGNGVLSVLNDGREVIGREAKGGFIPLFMTIGKLPHTRGFCAVLRDITQWKRTEEELTNARKEAERASNQKTEFL
ARISHEIRTPLNAIIGFSELMADEKFGPIGNDRYRDYLRDINRSGNHVLALVNDLLDISKIEAGALDMQFEAVSLNDAIG
EAIALMQPQANRERVIIRSSFQSNLPDIVADSRSIKQVALNLLSNAVRFTAPGGQVIVSTSYELNGDVVMRVRDTGIGMS
KSEVEQALKPFRQINALERRKAESAKDWRNEGTGLGLPLTKAMVEANRAQFAIDSNPGQGTVVEIVFPPTRVLAD
>Q988B9 3.1.1.27~~~~~~4-pyridoxolactonase~~~COG0491
MSDTKVYLLDGGSLVLDGYHVFWNRGPGGEVRFPVYSILIEHAEGRFLIDTGYDYDHVMKVLPFEKPIQEKHQTIPGALG
LLGLEPRDIDVVVNSHFHFDHCGGNKYFPHAKKICHRSEVPQACNPQPFEHLGYSDLSFSAEAAEARGATAQLLEGTTRA
NSTFEGIDGDVDLARGVKLISTPGHSIGHYSLLVEFPRRKPILFTIDAAYTQKSLETLCQAAFHIDPVAGVNSMRKVKKL
AEDHGAELMYSHDMDNFKTYRTGTQFYG
>P39142 2.4.2.2~~~pdp~~~Pyrimidine-nucleoside phosphorylase~~~COG0213
MRMVDIIIKKQNGKELTTEEIQFFVNGYTDGSIPDYQASALAMAIFFQDMSDRERADLTMAMVNSGETIDLSAIEGIKVD
KHSTGGVGDTTTLVLAPLVAALDVPVAKMSGRGLGHTGGTIDKLEAIMGFHVELTKDEFIKLVNRDKVAVIGQSGNLTPA
DKKLYALRDVTGTVNSIPLIASSIMSKKIAAGADAIVLDVKTGAGAFMKTEEDAAELAKAMVRIGNNVGRQTMAVISDMS
QPLGFAIGNALEVKEAIDTLKGEGPEDLHELVLTLGSQMVVLAKKADTLDEARAKLEEVMKNGKALEKFKDFLKNQGGDS
SIVDDPSKLPQAAYQIDVPAKEAGVVSEIVADEIGVAAMLLGAGRATKEDEIDLAVGIMLRKKVGDKVEKGEPLVTLYAN
RENVDEVIAKVYDNIRIAAEAKAPKLIHTLITE
>P77836 2.4.2.2~~~pdp~~~Pyrimidine-nucleoside phosphorylase~~~
MRMVDLIEKKRDGHALTKEEIQFIIEGYTKGDIPDYQMSALAMAIFFRGMNEEETAELTMAMVHSGDTIDLSRIEGIKVD
KHSTGGVGDTTTLVLGPLVASVGVPVAKMSGRGLGHTGGTIDKLESVPGFHVEITNDEFIDLVNKNKIAVVGQSGNLTPA
DKKLYALRDVTATVNSIPLIASSIMSKKIAAGADAIVLDVKTGVGAFMKDLNDAKALAKAMVDIGNRVGRKTMAIISDMS
QPLGYAIGNALEVKEAIDTLKGEGPEDFQELCLVLGSHMVYLAEKASSLEEARHMLEKAMKDGSALQTFKTFLAAQGGDA
SVVDDPSKLPQAKYIIELEAKEDGYVSEIVADAVGTAAMWLGAGRATKESTIDLAVGLVLRKKVGDAVKKGESLVTIYSN
REQVDDVKQKLYENIRISATPVQAPTLIYDKIS
>Q5HE64 2.4.2.2~~~pdp~~~Pyrimidine-nucleoside phosphorylase~~~
MRMIDIIEKKRDGHTLTTEEINFFIGGYVKGDIPDYQASSLAMAIYFQDMNDDERAALTMAMVNSGDMIDLSDIKGVKVD
KHSTGGVGDTTTLVLAPLVAAVDVPVAKMSGRGLGHTGGTIDKLEAIDGFHVEIDEATFVKLVNENKVAVVGQSGNLTPA
DKKLYALRDVTGTVNSIPLIASSIMSKKIAAGADAIVLDVKTGSGAFMKTLEDAEALAHAMVRIGNNVGRNTMAIISDMN
QPLGRAIGNALELQEAIDTLKGQGPKDLTELVLTLGSQMVVLANKAETLEEARALLIEAINSGAALEKFKTFIKNQGGDE
TVIDHPERLPQAQYQIEYKAKKSGYVTELVSNDIGVASMMLGAGRLTKEDDIDLAVGIVLNKKIGDKVEEGESLLTIHSN
RQDVDDVVKKLDSSITIADHVVSPTLIHKIITE
>Q7A4D0 2.4.2.2~~~pdp~~~Pyrimidine-nucleoside phosphorylase~~~
MRMIDIIEKKRDGHTLTTEEINFFIDGYVKGDIPDYQASSLAMAIYFQDMNDDERAALTMAMVNSGDMIDLSDIKGVKVD
KHSTGGVGDTTTLVLAPLVAAVDVPVAKMSGRGLGHTGGTIDKLEAIDGFHVEIDEATFVKLVNENKVAVVGQSGNLTPA
DKKLYALRDVTGTVNSIPLIASSIMSKKIAAGADAIVLDVKTGSGAFMKTLEDAEALAHAMVRIGNNVGRNTMAIISDMN
QPLGRAIGNALELQEAIDTLKGQGPKDLTELVLTLGSQMVVLANKAETLEEARALLIEAINSGAALEKFKTFIKNQGGDE
TVIDHPERLPQAQYQIEYKAKKSGYVTELVSNDIGVASMMLGAGRLTKEDDIDLAVGIVLNKKIGDKVEEGESLLTIHSN
RQDVDDVVKKLDSSITIADHVVSPTLIHKIITE
>P54470 2.7.11.32~~~yqfL~~~Putative pyruvate, phosphate dikinase regulatory protein~~~COG1806
MNNRIIYVVSDSVGETAELVVKAALSQFNGSADDTHVRRIPYVEDIGTINEVISLAKADGGIICFTLVVPEIREYLIAEA
EKANVLYYDIIGPLIDKMETAYGLTAKYEPGRVRQLDEDYFKKVEAIEFAVKYDDGRDPRGILKADIVLIGVSRTSKTPL
SQYLAHKRLKVANVPIVPEVDPPEELFNVDPKKCIGLKISPDKLNHIRKERLKSLGLNDKAIYANINRIKEELEYFEKIV
DRIGCQVVDVSNKAVEETANIIHHLKTKNI
>P33164 1.-.-.-~~~ophA1~~~Phthalate dioxygenase reductase~~~
MTTPQEDGFLRLKIASKEKIARDIWSFELTDPQGAPLPPFEAGANLTVAVPNGSRRTYSLCNDSQERNRYVIAVKRDSNG
RGGSISFIDDTSEGDAVEVSLPRNEFPLDKRAKSFILVAGGIGITPMLSMARQLRAEGLRSFRLYYLTRDPEGTAFFDEL
TSDEWRSDVKIHHDHGDPTKAFDFWSVFEKSKPAQHVYCCGPQALMDTVRDMTGHWPSGTVHFESFGATNTNARENTPFT
VRLSRSGTSFEIPANRSILEVLRDANVRVPSSCESGTCGSCKTALCSGEADHRDMVLRDDEKGTQIMVCVSRAKSAELVL
DL
>P26294 1.3.5.5~~~pds~~~15-cis-phytoene desaturase~~~COG0493
MRVAIAGAGLAGLSCAKYLADAGHTPIVYERRDVLGGKVAAWKDEDGDWYETGLHIFFGAYPNMLQLFKELNIEDRLQWK
SHSMIFNQPTKPGTYSRFDFPDIPAPINGVAAILSNNDMLTWEEKIKFGLGLLPAMIRGQSYVEEMDQYSWTEWLRKQNI
PERVNDEVFIAMAKALNFIDPDEISATVVLTALNRFLQEKKGSMMAFLDGAPPERLCQPIVEHVQARGGDVLLNAPLKEF
VLNDDSSVQAFRIAGIKGQEEQLIEADAYVSALPVDPLKLLLPDAWKAMPYFQQLDGLQGVPVINIHLWFDRKLTDIDHL
LFSRSPLLSVYADMSNTCREYEDPDRSMLELVFAPAKDWIGRSDEDILAATMAEIEKLFPQHFSGENPARLRKYKIVKTP
LSVYKATPGRQQYRPDQASPIANFFLTGDYTMQRYLASMEGAVLSGKLTAQAIIARQDELQRRSSGRPLAASQA
>P9WGM3 ~~~pdtaR~~~Transcriptional regulatory protein PdtaR~~~COG3707
MTGPTTDADAAVPRRVLIAEDEALIRMDLAEMLREEGYEIVGEAGDGQEAVELAELHKPDLVIMDVKMPRRDGIDAASEI
ASKRIAPIVVLTAFSQRDLVERARDAGAMAYLVKPFSISDLIPAIELAVSRFREITALEGEVATLSERLETRKLVERAKG
LLQTKHGMTEPDAFKWIQRAAMDRRTTMKRVAEVVLETLGTPKDT
>P9WGL5 2.7.13.3~~~pdtaS~~~Sensor histidine kinase PdtaS~~~COG3920
MSTLGDLLAEHTVLPGSAVDHLHAVVGEWQLLADLSFADYLMWVRRDDGVLVCVAQCRPNTGPTVVHTDAVGTVVAANSM
PLVAATFSGGVPGREGAVGQQNSCQHDGHSVEVSPVRFGDQVVAVLTRHQPELAARRRSGHLETAYRLCATDLLRMLAEG
TFPDAGDVAMSRSSPRAGDGFIRLDVDGVVSYASPNALSAYHRMGLTTELEGVNLIDATRPLISDPFEAHEVDEHVQDLL
AGDGKGMRMEVDAGGATVLLRTLPLVVAGRNVGAAILIRDVTEVKRRDRALISKDATIREIHHRVKNNLQTVAALLRLQA
RRTSNAEGREALIESVRRVSSIALVHDALSMSVDEQVNLDEVIDRILPIMNDVASVDRPIRINRVGDLGVLDSDRATALI
MVITELVQNAIEHAFDPAAAEGSVTIRAERSARWLDVVVHDDGLGLPQGFSLEKSDSLGLQIVRTLVSAELDGSLGMRDA
RERGTDVVLRVPVGRRGRLML
>A1AY86 3.2.2.6~~~~~~NAD(+) hydrolase PdTIR~~~COG4916
MSANDRAIETLRREIAKLQTDGAAIARKDAGIRAKLASAMAAQAKAKTAPALRLKQAEASRLEKELMATSKSQADIATKI
AKKQSSLSAKLVVQANEAKKADAKAKKNQERVSKTQEEATRKLEAGYRKLTLENQSLEQRLQRELSAMKPTAGPTTNADL
TSAPPHDIFISHAWEDKADFVEALAHTLRAAGAEVWYDDFSLRPGDSLRRSIDKGLGSSRFGIVVLSTHFFKKEWPQKEL
DGLFQLESSGRSRILPIWHKVSKDEVASFSPTMADKLAFNTSTKSVDEIVADLMAIIRD
>P0DUM6 ~~~pduA~~~Bacterial microcompartment shell protein PduA~~~
MQQEALGMVETKGLTAAIEAADAMVKSANVMLVGYEKIGSGLVTVIVRGDVGAVKAATDAGAAAARNVGEVKAVHVIPRP
HTDVEKILPKGIS
>P0A1C7 ~~~pduA~~~Bacterial microcompartment shell protein PduA~~~
MQQEALGMVETKGLTAAIEAADAMVKSANVMLVGYEKIGSGLVTVIVRGDVGAVKAATDAGAAAARNVGEVKAVHVIPRP
HTDVEKILPKGISQ
>B1VB63 ~~~pduB~~~Bacterial microcompartment shell protein PduB~~~
MSSNELVDQIMAQVIARVATPEQQAIPENNPPTRETAMAEKSCSLTEFVGTAIGDTVGLVIANVDSALLDAMKLEKRYRS
IGILGARTGAGPHIMAADEAVKATNTEVVSIELPRDTKGGAGHGSLIILGGNDVSDVKRGIEVALKELDRTFGDVYANEA
GHIEMQYTARASYALEKAFGAPIGRACGVIVGAPASVGVLMADTALKSANVEVVAYSSPAHGTSFSNEAILVISGDSGAV
RQAVISAREIGKTVLGTLGSEPKNDRPSYI
>A5VMB3 ~~~pduB~~~Bacterial microcompartment shell protein PduB~~~COG4816
MNDFLNSTSTVPEFVGASEIGDTIGMVIPRVDQQLLDKLHVTKQYKTLGILSDRTGAGPQIMAMDEGIKATNMECIDVEW
PRDTKGGGGHGCLIIIGGDDPADARQAIRVALDNLHRTFGDVYNAKAGHLELQFTARAAGAAHLGLGAVEGKAFGLICGC
PSGIGVVMGDKALKTAGVEPLNFTSPSHGTSFSNEGCLTITGDSGAVRQAVMAGREVGLKLLSQFGEEPVNDFPSYIK
>P37449 ~~~pduB~~~Bacterial microcompartment shell protein PduB~~~
MSSNELVEQIMAQVIARVATPEQQAIPGQPQPIRETAMAEKSCSLTEFVGTAIGDTLGLVIANVDTALLDAMKLEKRYRS
IGILGARTGAGPHIMAADEAVKATNTEVVSIELPRDTKGGAGHGSLIILGGNDVSDVKRGIEVALKELDRTFGDVYGNEA
GHIELQYTARASYALEKAFGAPIGRACGIIVGAPASVGVLMADTALKSANVEVVAYSSPAHGTSFSNEAILVISGDSGAV
RQAVTSAREIGKTVLATLGSEPKNDRPSYI
>P0DUM7 4.2.1.28~~~pduC~~~Propanediol dehydratase large subunit~~~
MRSKRFEALAKRPVNQDGFVKEWIEEGFIAMESPNDPKPSIKIVNGTVTELDGKSASEFDLIDHFIARYGINLARAEEVM
AMDSVKLANMLCDPNVKRKDIVPLTTAMTPAKIVEVVSHMNVVEMMMAMQKMRARRTPSQQAHVTNVKDNPVQIAADAAE
GAWRGFDEQETTVAVARYAPFNAIALLVGSQVGRPGVLTQCSLEEATELKLGMLGHTCYAETISVYGTEPVFTDGDDTPW
SKGFLASSYASRGLKMRFTSGSGSEVQMGYAEGKSMLYLEARCIYITKAAGVQGLQNGSVSCIGVPSAVPSGIRAVLAEN
LICSSLDLECASSNDQTFTHSDMRRTARLLMQFLPGTDFISSGYSAVPNYDNMFAGSNEDAEDFDDYNVLQRDLKVDGGL
RPVREEDVIAIRNKAARALQAVFAGMGLPPITDEEVEAATYAHGSKDMPERNIVEDIKFAQEIINKNRNGLEVVKALAQG
GFTDVAQDMLNIQKAKLTGDYLHTSAIIVGDGQVLSAVNDVNDYAGPATGYRLQGERWEEIKNIPGALDPNELG
>P37450 4.2.1.28~~~pduC~~~Propanediol dehydratase large subunit~~~
MRSKRFEALAKRPVNQDGFVKEWIEEGFIAMESPNDPKPSIKIVNGAVTELDGKPVSEFDLIDHFIARYGINLNRAEEVM
AMDSVKLANMLCDPNVKRSEIVPLTTAMTPAKIVEVVSHMNVVEMMMAMQKMRARRTPSQQAHVTNVKDNPVQIAADAAE
GAWRGFDEQETTVAVARYAPFNAIALLVGSQVGRPGVLTQCSLEEATELKLGMLGHTCYAETISVYGTEPVFTDGDDTPW
SKGFLASSYASRGLKMRFTSGSGSEVQMGYAEGKSMLYLEARCIYITKAAGVQGLQNGSVSCIGVPSAVPSGIRAVLAEN
LICSSLDLECASSNDQTFTHSDMRRTARLLMQFLPGTDFISSGYSAVPNYDNMFAGSNEDAEDFDDYNVIQRDLKVDGGL
RPVREEDVIAIRNKAARALQAVFAGMGLPPITDEEVEAATYAHGSKDMPERNIVEDIKFAQEIINKNRNGLEVVKALAQG
GFTDVAQDMLNIQKAKLTGDYLHTSAIIVGDGQVLSAVNDVNDYAGPATGYRLQGERWEEIKNIPGALDPNEID
>P0DUM8 4.2.1.28~~~pduD~~~Propanediol dehydratase medium subunit~~~
MEINEKLLRQIIEDVLSEMQTSDKPVSFRASTAASAPQAAAAQGDSFLTEIGEAKQGQQQDEVIIAVGPAFGLSQTVNIV
GIPHKNILREVIAGIEEEGIKARVIRCFKSSDVAFVAVEGNRLSGSGISIGIQSKGTTVIHQQGLPPLSNLELFPQAPLL
TLETYRQIGKNAARYAKRESPQPVPTLNDQMARPKYQAKSAILHIKETKYVVTGKNPQELRVAL
>O31041 4.2.1.28~~~pduD~~~Propanediol dehydratase medium subunit~~~
MEINEKLLRQIIEDVLRDMKGSDKPVSFNAPAASTAPQTAAPAGDGFLTEVGEARQGTQQDEVIIAVGPAFGLAQTVNIV
GLPHKSILREVIAGIEEEGIKARVIRCFKSSDVAFVAVEGNRLSGSGISIGIQSKGTTVIHQQGLPPLSNLELFPQAPLL
TLETYRQIGKNAARYAKRESPQPVPTLNDQMARPKYQAKSAILHIKETKYVVTGKNPQELRVAL
>P0DUM9 4.2.1.28~~~pduE~~~Propanediol dehydratase small subunit~~~
MNTDAIESMVRDVLSRMNSLQGESATPVAASSSAHTAKVTDYPLANKHPEWVKTATNKTLDDFTLENVLSNKVTAQDMRI
TPETLRLQAEIAKDAGRDRLAMNFERAAELTAVPDDRILEIYNALRPYRSTKDELMAIADDLENRYQAKICAAFVREAAA
LYVERKKLKGDD
>O31042 4.2.1.28~~~pduE~~~Propanediol dehydratase small subunit~~~
MNTDAIESMVRDVLSRMNSLQGDAPAAAPAAGGTSRSAKVSDYPLANKHPEWVKTATNKTLDDFTLENVLSNKVTAQDMR
ITPETLRLQASIAKDAGRDRLAMNFERAAELTAVPDDRILEIYNALRPYRSTKEELLAIADDLENRYQAKICAAFVREAA
GLYVERKKLKGDD
>P37451 ~~~pduF~~~Propanediol uptake facilitator PduF~~~
MNDSLKAQCGAEFLGTGLFLFFGIGCLSALKVAGASLGLWEICIIWGLGISLAVYLTAGISGGHLNPAVTIALWLFACFP
KQKVLPYIIAQFAGAFGGALLAYVLYSSLFTEFETAHHMVRGSVESLQLASIFSTYPAAALNVWQAALVEVVITSILMGM
IMALTDDGNGIPKGPLAPLLIGILVAVIGASTGPLTGFAMNPARDFGPKLFTWLAGWGNMAMSGGREIPYFIVPIVAPVI
GACAGAAIYRYFIGKNLPCNRCEL
>O31043 ~~~pduG~~~Propanediol dehydratase-reactivating factor large subunit~~~
MRYIAGIDIGNSSTEVALARQDETGALTITHSALAETTGIKGTLRNVFGIQEALALVAKRAGINVRDISLIRINEATPVI
GDVAMETITETIITESTMIGHNPKTPGGAGLGVGITITPEELLTRPADSSYILVVSSAFDFADIANVINASMRAGYQITG
VILQRDDGVLVSNRLEKSLPIVDEVLYIDRIPLGMLAAIEVAVPGKVIETLSNPYGIATVFNLNADETKNIVPMARALIG
NRSAVVVKTPSGDVKARAIPAGNLELQAQGRTVRVDVAAGAEAIMKAVDGCGKLDNVTGEAGTNIGGMLEHVRQTMAELT
NKPSSEIFIQDLLAVDTSVPVSVTGGLAGEFSLEQAVGIASMVKSDRLQMAMIAREIEQKLNIDVQIGGAEAEAAILGAL
TTPGTTRPLAILDLGAGSTDASIINPKGEIIATHLAGAGDMVTMIIARELGLEDRYLAEEIKKYPLAKVESLFHLRHEDG
SVQFFPTPLPPAVFARVCVVKPDELVPLPGDLALEKVRAIRRSAKERVFVTNALRALRQVSPTGNIRDIPFVVLVGGSSL
DFEVPQLVTDALAHYRLVAGRGNIRGSEGPRNAVATGLILSWHKEFAHGQ
>Q8ZNR6 ~~~pduH~~~Propanediol dehydratase-reactivating factor small subunit~~~
MDSNHSAPAIVITVINDCASLWHEVLLGIEEEGIPFLLQHHPAGDIVDSAWQAARSSPLLVGIACDRHSLVVHYKNLPAS
APLFTLMHHQDSQAQRNTGNNAARLVKGIPFRDLHA
>P0DUV5 ~~~pduJ~~~Bacterial microcompartment shell protein PduJ~~~
MNNALGLVETKGLVGAIEAADAMVKSANVQLVGYEKIGSGLITVMVRGDVGAVKAAVDAGSAAASAVGEVKSCHVIPRPH
SDVEAILPKSA
>H9L478 ~~~pduJ~~~Bacterial microcompartment shell protein PduJ~~~
MNNALGLVETKGLVGAIEAADAMVKSANVQLVGYEKIGSGLVTVMVRGDVGAVKAAVDAGSAAASVVGEVKSCHVIPRPH
SDVEAILPKSA
>B1VB70 ~~~pduK~~~Bacterial microcompartment shell protein PduK~~~
MKQSLGLLEVSGLALAISCADVMAKAASITLVGLEKTNGSGWMVIKIIGDVASVQAAISTGVSFADQRDGLVAHKVISRP
GDGILSHSVTPESESEPAPAPTPVVPHEEIPEDHAAPEAPQDAELISCNLCLDPACPRQKGEPRSLCLHSGKRGEA
>Q9XDN6 ~~~pduK~~~Bacterial microcompartment shell protein PduK~~~
MANKEHRVKQSLGLLEVCGLALAISCADIMAKSASITLLALEKTNGSGWMVIKITGDVASVQAAITTGAHFAEQWNGLVA
HKVIARPGEGILLAETPSPSVIEPEPEASEIADVVSEAPAEEAPQESELVSCNLCLDPKCPRQKGEPRTLCIHSGKRGEA
>D5SXM0 2.3.1.222~~~pduL~~~Phosphate propanoyltransferase~~~COG4869
MSSDSVVAGNISRADVERVVRAVLTRQLAGATTSSGSVSSAGGPPNPLVVNISARHVHLTEEHVEVLFGKGVKLEPMKWL
YQDGYYAAKQTVTIFGPRRRMIPDVRVLGPCRNASQVELAFTDGISLGIDLPVRISGDHHDTVGCVLVGPAGVVELKSGV
IRAMRHVHMSPADCAYYGVKNGDEMDLKIHSGPCTTTLEHVTVREDKDVKLEVHIDTDEGNAVDLSHATKVELVKPVGCG
CHSK
>Q21A54 2.3.1.222~~~pduL~~~Phosphate propanoyltransferase~~~COG4869
MDMQQETIERIIRQVLGQVGPAGGSIATLSGDAGVDPFQVAVGVSNRHIHLSRTDMDTLFGPGAELQRKKAMKQPGQFAA
EETVTLKGPKGSLSKVRVLGPLRRETQVEVSVADGFALGITPPLRQSGQLDDTPGLTIIGPQGSVTKDHGVIVAQRHIHM
HPSTAAKLGLRNGDEVDVEAGGERGGVMHRVLIRVAEASADEMHIDVEEANALCLKNDDVVRICKK
>Q9XDN5 2.3.1.222~~~pduL~~~Phosphate propanoyltransferase~~~
MDKELLQSTVRKVLDEMRQRPIPLGVSNRHIHLSAQDYERLFPGHPISEKKALLQPGQYAAEQTVTLVGPKGQLKNVRLL
GPLRSVSQVEISRTDARTLGIAAPLRMSGNLKGTPGIRLVSPFAELELPSGVIVAQRHIHMSPLDALILRVSHGDMVSVA
IEGDDRGLIFNNVAIRVSPDMRLEMHIDTDEANAAGADNPHAFARLVGPR
>B1VB72 ~~~pduM~~~Bacterial microcompartment assembly protein PduM~~~
MNNELLQRIIEEVVSRLKKRAESTLSLSVAQLREIEPRTLCCQYSSLHLLQADLPLLEQIAEGCADNMSVVTIHEALACG
VRVKISLQHRLLPAIPVRKLARLPLEFSDELGRIIVLHPDKLLSYADVAQLKGGVLVLRRRCVVTALAQDAVGTRNVQLI
KQE
>Q9XDN4 ~~~pduM~~~Bacterial microcompartment assembly protein PduM~~~
MNGETLQRIVEEIVSRLHRRAQSTATLSVTQLRDADCPALFCQHASLRILLIDLPLLGQLADAETGDAAARKIHDALAFG
IRVQLSLHSQLLPVIPVKKLARLPLVFTDEHGLPLVLHAGSVLSYRDVALLSRGRVVVHRKCIVTAMARDAANARNIQLI
KQE
>P0DUV6 ~~~pduN~~~Bacterial microcompartment shell vertex protein PduN~~~
MHLARVTGVVVSTQKSPSLVGKKLLLVRRVSADGELPASPVSGDEVAVDSVGAGTGELVLLSSGSSARHVFSGPNEAIDL
AIVGIVDTLSR
>Q9XDN3 ~~~pduN~~~Bacterial microcompartment shell vertex protein PduN~~~
MHLARVTGAVVSTQKSPSLIGKKLLLVRRVSADGELPASPTSGDEVAVDSVGAGVGELVLLSGGSSARHVFSGPNEAIDL
AVVGIVDTLSC
>O34899 2.5.1.17~~~yvqK~~~Corrinoid adenosyltransferase~~~COG2096
MKLYTKTGDKGQTGLVGGRTDKDSLRVESYGTIDELNSFIGLALAELSGQPGFEDLTAELLTIQHELFDCGGDLAIVTER
KDYKLTEESVSFLETRIDAYTAEAPELKKFILPGGSKCASLLHIARTITRRAERRVVALMKSEEIHETVLRYLNRLSDYF
FAAARVVNARSGIGDVEYERSAIVFRDRNSSES
>B1VB74 ~~~pduO~~~Corrinoid adenosyltransferase PduO~~~
MAIYTRTGDAGTTALFTGQRVSKTHPRVEAYGTLDELNAALSLCVCAAKNPQHRQLLENIQLQLFWFSAELASESEQPAP
EQRYISSEEIAALEAAIDTAMGRVPPLRSFILPGRSEAASRLHFARTLARRAERRLVELSTEISVRHVLMRYINRLSDCL
YALARAEDHDAHQNNIIQKVAERYLAAIRTSATREPAMSLSFQELHQLTRAAVMRAEELQVPVVISIVDANGTQTVTWRM
PDALLVSSELAPKKAWTAVAMKTATHELTSAVQPGAALYGLESHMQGKVVTFGGGYALWREGLLLGGLGISGGSVEQDMD
IAETAIAAINVRTHQ
>P9WP99 2.5.1.17~~~~~~Corrinoid adenosyltransferase~~~COG2096
MAVHLTRIYTRTGDDGTTGLSDMSRVAKTDARLVAYADCDEANAAIGAALALGHPDTQITDVLRQIQNDLFDAGADLSTP
IVENPKHPPLRIAQSYIDRLEGWCDAYNAGLPALKSFVLPGGSPLSALLHVARTVVRRAERSAWAAVDAHPEGVSVLPAK
YLNRLSDLLFILSRVANPDGDVLWRPGGDRTAS
>Q8ZNR5 2.5.1.-~~~pduO~~~Corrinoid adenosyltransferase PduO~~~
MAIYTRTGDAGTTSLFTGQRVSKTHPRVEAYGTLDELNAALSLCACAAADENHRTLLEAIQQQLFWFSAELASDSEQPSP
KQRYISSEEISALEAAIDRAMARVEPLHSFILPGRCEAASRLHFARTLARRAERRLVELATEVNVRQVLMRYINRLSDCL
YALARAEDSDAHQANIIREVSKRYLAACQPPHSKETTPVALSFHDLHQLTRAAVERAQQLQVPVVVSIVDAHGTETVTWR
MPDALLVSSELAPKKAWTAVAMKTATHELSDVVQPGAALYGLESHLQGKVVTFGGGYALWRDGILIGGLGISGGSVEQDM
DIAQTAIAAINVGTHQ
>B1VB75 1.2.1.87~~~pduP~~~Propanal dehydrogenase (CoA-propanoylating)~~~
MNTSELETLIRNILSEQLAPAKAEVKGNGIFPSVSEAIDAAHQAFLRYQQCPLKTRSAIINALREELTPHLASLAAESAA
ETGMGNKEDKFLKNKAALDNTPGIEDLTTTALTGDGGMVLFEYSPFGVIGSVAPSTNPTETIINNSISMLAAGNSVYFSP
HPGAKAVSLKLITMIEDIAFRCCGIRNLVVTVTEPTFEATQQMMAHPKIAVLAITGGPGIVAMGMKSGKKVIGAGAGNPP
CIVDETADLVKAAEDIINGASFDFNLPCIAEKSLIVVDAVAERLVQQMQSFGAMRLNSEEIDKLRAVCLPEGIANKQLVG
KSPATLLEAAGIPVPAKAPRLLIGIVKADDPWVTSEQLMPMLPIVTVSDFDSALTLALKVEEGLHHTAIMHSQNVSRLNL
AARTLQTSIFVKNGPSYAGIGVGGEGFTTFTIATPTGEGTTSARTFARSRRCVLTNGFSIR
>Q9XDN1 1.2.1.87~~~pduP~~~Propanal dehydrogenase (CoA-propanoylating)~~~
MNTSELETLIRTILSEQLTTPAQTPVQPQGKGIFQSVSEAIDAAHQAFLRYQQCPLKTRSAIISAMRQELTPLLAPLAEE
SANETGMGNKEDKFLKNKAALDNTPGVEDLTTTALTGDGGMVLFEYSPFGVIGSVAPSTNPTETIINNSISMLAAGNSIY
FSPHPGAKKVSLKLISLIEEIAFRCCGIRNLVVTVAEPTFEATQQMMAHPRIAVLAITGGPGIVAMGMKSGKKVIGAGAG
NPPCIVDETADLVKAAEDIINGASFDYNLPCIAEKSLIVVESVAERLVQQMQTFGALLLSPADTDKLRAVCLPEGQANKK
LVGKSPSAMLEAAGIAVPAKAPRLLIALVNADDPWVTSEQLMPMLPVVKVSDFDSALALALKVEEGLHHTAIMHSQNVSR
LNLAARTLQTSIFVKNGPSYAGIGVGGEGFTTFTIATPTGEGTTSARTFARSRRCVLTNGFSIR
>Q9XDN0 1.1.-.-~~~pduQ~~~1-propanol dehydrogenase PduQ~~~
MNTFSLQTRLYSGQGSLAVLKRFTNKHIWIICDGFLAHSPLLDTLRNALPADNRISVFSEITPDPTIHTVVQGIAQMQAL
QPQVVIGFGGGSAMDAAKAIVWFSQQSGINIETCVAIPTTSGTGSEVTSACVISDPDKGIKYPLFNNALYPDMAILDPEL
VVSVPPQITANTGMDVLTHALEAWVSPRASDFTDALAEKAAKLVFQYLPTAVEKGDCVATRGKMHNASTLAGMAFSQAGL
GLNHAIAHQLGGQFHLPHGLANALLLTTVIRFNAGVPRAAKRYARLAKACGFCPAEANDIAAINALIQQIELLKQRCVLP
SLAVALKEGRSDFSARIPAMVQAALADVTLRTNPRPANAEAIRELLEELL
>B1VB77 ~~~pduS~~~Cobalamin reductase PduS~~~
MKTAMTAESTLYDAQTIRERVRAAGVVGAGGAGFPAHVKLQAQVDTFLVNAAECEPMLKVDQQLMAVQAERLIRGVQYAM
TATGARAGIIALKEKYQRAINALTPLLPAGIRLHILPDVYPAGDEVLTIWMATGRRVPPAALPVSVGVVVNNVQTVLNIT
RAVEQQYPVTRRTLTVNGAVARPITLTVPIGMSLREVLALAGGATVDDPGFINGGPMMGGLITSLDTPVSKTTGGLLVLP
KSHALIQRRMQDERTVLSVAKTVCEQCRLCTDLCPRHLIGHELSPHLLVRAVNYQQAATPQLLLTALTCSECNVCESVAC
PVGISPMRINRMLKRELRALNHRYEGPLNPEDEMAKYRLIPVKRLITKLGLSDWYHDAPLTETDYPTDKTTLLLRQHIGA
SAIPCVLQGEHVVRGQCVADVPSGALGAPVHASIDGIVSEITEQSITVIRG
>Q9XDM9 ~~~pduS~~~Cobalamin reductase PduS~~~
MSTAINSVEMSLSADEIRERVRAAGVVGAGGAGFPAHVKLQAQVEIFLVNAAECEPMLKVDQQLMWQQAARLVRGVQYAM
TATGAREGVIALKEKYRRAIDALTPLLPDGIRLHILPDVYPAGDEVLTIWMATGRRVAPAALPASVGVVVNNVQTVLNIA
RAVEQRFPVTRRTLTVNGAVARPLTVTVPIGMSLREVLALAGGATVDDPGFINGGPMMGGLITSLDNPVTKTTGGLLVLP
KSHPLIQRRMQDERTVLSVARTVCEQCRLCTDLCPRHLIGHELSPHLLVRAVNFHQAATPQLLLSALTCSECNVCESVAC
PVGISPMRINRMLKRELRAQNQRYEGPLNPADEMAKYRLVPVKRLIAKLGLSPWYQEAPLVEEEPSVEKVTLQLRQHIGA
SAVANVAVGERVTRGQCVADVPPGALGAPIHASIDGVVSAISEQAITVVRG
>P0DUV7 ~~~pduT~~~Bacterial microcompartment shell protein PduT~~~
MSQAIGILELTSIAKGMEAGDAMLKSANVNLLVSKTICPGKFLLMLGGDVGAVQQAIATGTSLAGDMLVDSLVLPNIHAS
VLPAISGLNSVDKRQAVGIVETWSVAACICAADRAVKASNVTLVRVHMAFGIGGKCYMVVAGDVSDVNNAVTVASESAGE
KGLLVYRSVIPRPHESMWRQMVEG
>Q9XDM8 ~~~pduT~~~Bacterial microcompartment shell protein PduT~~~
MSQAIGILELTSIAKGMELGDAMLKSANVDLLVSKTICPGKFLLMLGGDIGAIQQAIETGTSQAGEMLVDSLVLANIHPS
VLPAISGLNSVDKRQAVGIVETWSVAACISAADRAVKGSNVTLVRVHMAFGIGGKCYMVVAGDVSDVNNAVTVASESAGE
KGLLVYRSVIPRPHEAMWRQMVEG
>P0DUV8 ~~~pduU~~~Bacterial microcompartment shell protein PduU~~~
MERQPTTDRMIQEYVPGKQVTLAHLIANPGKDLFKKLGLPESVSAIGILTITPSEASIIACDIATKSGAVEIGFLDRFTG
AVVLTGDVSAVEYALKQVTRTLGEMMRFTACPITRT
>P0A1D1 ~~~pduU~~~Bacterial microcompartment shell protein PduU~~~
MERQPTTDRMIQEYVPGKQVTLAHLIANPGKDLFKKLGLQDAVSAIGILTITPSEASIIACDIATKSGAVEIGFLDRFTG
AVVLTGDVSAVEYALKQVTRTLGEMMQFTTCSITRT
>B1VB80 ~~~pduV~~~Propanediol utilization protein PduV~~~
MKRIMLIGPSQCGKTSLTQCMRGEALHYQKTQAIVWSPTTIDTPGEYLENRCLYSALLASACEADIIALVLNADAPWSPF
SPGFTGPMNRPVIGLVTKADLASPQRISLVESWLVQAGAQKVFFTSALENTGVDEMFIFLNAKESSCLTK
>Q9XDM6 ~~~pduV~~~Propanediol utilization protein PduV~~~
MKRLMFIGPSQCGKTSLTQSLRGEALHYKKTQAIEWSPMAIDTPGEYLENRCLYSALLTSACEADVIALVLNADAQWSPF
SPGFTAPMNRPTIGLVTKADLAEPQRISLVAEWLTQAGAQQIFITSALNNSGLDAVLDFLNSKEPLCLTK
>A6TDE9 2.7.2.15~~~pduW~~~Propionate kinase~~~
MTYKIMAINAGSSSLKFQLLNMPQGALLCQGLIERIGLPEARFTLKTSAQKWQETLPIADHHEAVTLLLEALTGRGILSS
LQEIDGVGHRVAHGGERFKDAALVCDDTLREIERLAELAPLHNPVNALGIRLFRQLLPAVPAVAVFDTAFHQTLAPEAWL
YPLPWRYYAELGIRRYGFHGTSHHYVSSALAEKLGVPLSALRVVSCHLGNGCSVCAIKGGQSVNTSMGFTPQSGVMMGTR
SGDLDPSILPWLVEKEGKSAQQLSQLLNNESGLLGVSGVSSDYRDVEQAADAGNERAALALSLFAERIRATIGSYIMQMG
GLDALIFTGGIGENSARARAAICRNLHFLGLALDDEKNQRSATFIQADNALVKVAVINTNEELMIARDVMRLALPQAREL
AVSA
>P74879 2.7.2.15~~~pduW~~~Propionate kinase PduW~~~
MSYKIMAINAGSSSLKFQLLEMPQGDMLCQGLIERIGMADAQVTIKTHSQKWQETVPVADHRDAVTLLLEKLLGYQIINS
LRDIDGVGHRVAHGGEFFKDSTLVTDETLAQIERLAELAPLHNPVNALGIHVFRQLLPDAPSVAVFDTAFHQTLDEPAYI
YPLPWHYYAELGIRRYGFHGTSHKYVSGVLAEKLGVPLSALRVICCHLGNGSSICAIKNGRSVNTSMGFTPQSGVMMGTR
SGDIDPSILPWIAQRESKTPQQLNQLLNNESGLLGVSGVSSDYRDVEQAANTGNRQAKLALTLFAERIRATIGSYIMQMG
GLDALVFTGGIGENSARARSAVCHNLQFLGLAVDEEKNQRNATFIQTENALVKVAVINTNEELMIAQDVMRIALPATEGL
CVPA
>Q9XDM4 2.7.1.177~~~pduX~~~L-threonine kinase~~~
MRAHYSYLKGDNVAVAQCPASCGELIQGWILGSEKLVSCPVDWYSTVAVTAAPPLINERPLSRAMVERVLAHWQYPAHWS
NEIRVDVRSSIPVAKGMASSTADIAATAVATAHHLGHSLDETTLAQLCVSIEPTDSTVFHQLTLFDHNNAATQIACEPPP
PIDLLVLESPVTLRTQDYHRLPRQQKLIASSATLQQAWNLVQEACITQNPLRLGEAATLSAIASQTLLPKPGFTALLSLV
EECDLYGLNVAHSGSVVGLMLDRKRHDIARLKGKLAEKKLTRHWPKQHLLKMVTGGVKLQ
>A0A0H3LQK8 1.1.1.408~~~pdxA2~~~D-threonate 4-phosphate dehydrogenase~~~COG1995
MTQDATPSRIPTLAVTLGDVAGIGPEITAKMLLGHDELRQRARLLVVGDAAVLAQAVQAVGGDPARVRVIATPAEATNQP
GSIEVIQAGPSLAHVPPGQLSAEAGDGSVRYVTTACALARDGLIDGIVTAPLNKAAMHMAGHKWPGHTELLAHEFGVKTF
SLVLSAGDLYIFHATTHVSLRQAIEDVNPQRMRAVLRLAGSFARALGRADHPVAVAGLNPHAGENGIFGTEDAEILAPAV
AQANAEGILAAGPIPADALFPQAVRGKWKFVIACYHDQGHAPFKSVYGDDGVNITVGLPVVRVSVDHGTAFDIAGKGIAR
EDSLVLAAERAAQLAPGWHQVWETARSTTGG
>Q0K4F5 1.1.1.408~~~pdxA2~~~D-threonate 4-phosphate dehydrogenase~~~COG1995
MSDYLPVIGITMGDATGIGPEVIVKSLAHDSVRAQCRPLVIGDVRRLEVAGRLVGSPLKLRAIQAPEEARFQSGTIDCID
LGLIPEGLPFGKLSAVAGDAAFRYIERAVALTRDEKIDAICTAPLNKEALHAGGHKFPGHTEMLAYLTGTPEVSMMLVAP
KLRVIHVTTHIGLLDAIRKIEPGLVQRTIERGHQTLQRAGIAAPRIGVCGINPHAGENGLFGHGEEEEKIIPAVEALRAR
GRDVEGPLPADTLFYRAGRGDFDLVVAMYHDQGHGPVKVLGLEAGVNITVGLPVIRTSVDHGTAFDIAGKGIADERSLLE
ALRQGAELATRRA
>B0TBI8 1.1.1.409~~~pdxA2~~~D-erythronate 4-phosphate dehydrogenase~~~COG1995
MQRPIIAIPMGDPAGVGPEIVVKALANEEMYRIARPLVIGDAGVLRQAMAFCGLELAVHTVTEPAMGKFEPGVIDLIDLA
NVELKQLKMGAVQAMAGNAAYECIEKSVSLAMAGQVDAIATTPINKEALKAAGIPHIGHTEILGHLSGANDPLTMFQVFE
LRVFFLSRHVSLRKACDMVTTERALDYLVRCTEALRRLGVDSPKFAVAGLNPHSGEHGLFGDEEDEQIAPAIAAARERGI
NVVGPVPADSVFYFGLKGAYDAILSLYHDQGHIATKMVDFERTVAVTNGLPFLRTSVDHGTAFDIAGSGKASSVSMEEAI
KLAVRYAPSFRRY
>Q6D0N8 1.1.1.408~~~pdxA2~~~D-threonate 4-phosphate dehydrogenase~~~COG1995
MSKIIAVTMGDPAGIGPEIIIKSLAEGELSGASAVVVGCVQTMRRILALNVVPTVELKIIDKPADAVFAPGVINIIDEPL
EDPQALKPGIVQAQAGDLAYRCIKKATALAMAGEVHAIATAPLNKEALHSAGHLYPGHTELLAKLTNSRDYAMVLYTDKL
KVIHVSTHIALRKFLDTLNRDRVETVIEMADVFLKRVGFTHPRIAVAGVNPHAGENGLFGDEEIKIVSPSVEAMKAKGID
VYGPCPPDTVYLQAYEGQYDMVVAMYHDQGHIPLKLLGFYDGVNITAGLPFIRTSADHGTAFDIAWTGKAKPESMAISIQ
LAMQLA
>P58718 1.1.1.408~~~pdxA2~~~D-threonate 4-phosphate dehydrogenase~~~
METKTVAITMGDPAGIGPEIIVKALSEDGLNGAPLVVIGCLATLKRLQAKGITPNVELRAIERVAEARFAPGIIHVIDEP
LAQPEALEAGKVQAQAGDLAYRCVKRATELALRGDVQAIATAPLNKEALHLAGHNYPGHTELLATLTHSRDYAMVLYTDK
LKVIHVSTHIALRKFLDTLSTARVETVIGIADTFLKRVGYVKPRIAVAGVNPHAGENGLFGDEETRILTPAITDARAKGM
DVYGPCPPDTVFLQAYEGQYDMVVAMYHDQGHIPLKLLGFYDGVNITAGLPFIRTSADHGTAFDIAWTGKAKSESMAVSI
KLAMQLA
>Q9PN58 1.1.1.262~~~pdxA~~~4-hydroxythreonine-4-phosphate dehydrogenase~~~COG1995
MKKLAISIGDINSIGLEILVRSHEELSKICTPFYFIHESLLNKALKLLNLKLFNAKIVAFKDDKDYEFNFIKKENSLEIY
SFCLPLGFKVDENFEIQAGEIDAKSGLYGFLSFKAASYFVYEKHAHALLTLPIHKKAWEDAGLKYKGHTDALRDFFKKNA
IMMLGCKELFVGLFSEHIPLAKVSKKITFKNLSIFLKDFYKETHFKKMGLLGFNPHAGDYGVIGGEEEKIMEKAIAFVNA
FLHSKKDEKFFKKALKDENLQKELLLNFKGKGVYLPYPLVADTAFTKTGLKNCNRLVAMYHDLALAPLKALYFDKSINVS
LNLPIIRVSVDHGTAFDKAYKNAKINTKSYFEAAKFAINLHSKA
>P19624 1.1.1.262~~~pdxA~~~4-hydroxythreonine-4-phosphate dehydrogenase~~~COG1995
MVKTQRVVITPGEPAGIGPDLVVQLAQREWPVELVVCADATLLTNRAAMLGLPLTLRPYSPNSPAQPQTAGTLTLLPVAL
RAPVTAGQLAVENGHYVVETLARACDGCLNGEFAALITGPVHKGVINDAGIPFTGHTEFFEERSQAKKVVMMLATEELRV
ALATTHLPLRDIADAITPALLHEVIAILHHDLRTKFGIAEPRILVCGLNPHAGEGGHMGTEEIDTIIPVLNELRAQGMKL
NGPLPADTLFQPKYLDNADAVLAMYHDQGLPVLKYQGFGRGVNITLGLPFIRTSVDHGTALELAGRGKADVGSFITALNL
AIKMIVNTQ
>Q9I5U4 1.1.1.262~~~pdxA~~~4-hydroxythreonine-4-phosphate dehydrogenase~~~
MSLRFALTPGEPAGIGPDLCLLLARSAQPHPLIAIASRTLLQERAGQLGLAIDLKDVSPAAWPERPAKAGQLYVWDTPLA
APVRPGQLDRANAAYVLETLTRAGQGCLDGHFAGMITAPVHKGVINEAGIPFSGHTEFLADLTHTAQVVMMLATRGLRVA
LATTHLPLREVADAISDERLTRVARILHADLRDKFGIAHPRILVCGLNPHAGEGGHLGREEIEVIEPCLERLRGEGLDLI
GPLPADTLFTPKHLEHCDAVLAMYHDQGLPVLKYKGFGAAVNVTLGLPIIRTSVDHGTALDLAGSGRIDSGSLQVALETA
YQMAASRC
>P58717 1.1.1.262~~~pdxA~~~4-hydroxythreonine-4-phosphate dehydrogenase~~~
MSSAQRVVITPGEPAGSGPDLVVQLAQRAWPIELVVCADGALLTERAAMLGLPLSLLPYSPDVPAAPQPAGTLTLLPVSL
RAPAISGQLTVENGPYVVETLARACDGCLNGEFAALITGPVHKGVINDAGISFTGHTEFFEERSQAKKVVMMLATEELRV
ALATTHLPLRAIADAITPALLHEVIAILHHDLRTKFGIAEPRILVCGLNPHAGEGGHMGTEEIDTIIPVLDELRAQGMKL
NGPLPADTLFQPKYLDNADAVLAMYHDQGLPVLKYQGFGRGVNITLGLPFIRTSVDHGTALELAGRGKADVGSFITALNL
AIKMIVNTQ
>P58719 1.1.1.262~~~pdxA~~~4-hydroxythreonine-4-phosphate dehydrogenase~~~COG1995
MHNHNNRLVITPGEPAGVGPDLAITLAQQDWPVELVVCADPALLLARASQLNLPLQLREYQADQPAIAQQAGSLTILPVK
TAVNVVPGKLDVGNSHYVVETLAKACDGAISGEFAALVTGPVQKSIINDAGIPFIGHTEFFADRSHCQRVVMMLATEELR
VALATTHLPLLAVPGAITQASLHEVITILDNDLKTKFGITQPQIYVCGLNPHAGEGGHMGHEEIDTIIPALNTLRQQGIN
LIGPLPADTLFQPKYLQHADAVLAMYHDQGLPVLKYQGFGRAVNITLGLPFIRTSVDHGTALELAATGTADVGSFITALN
LAIKMINNSNE
>P05459 1.1.1.290~~~pdxB~~~Erythronate-4-phosphate dehydrogenase~~~COG0111
MKILVDENMPYARDLFSRLGEVTAVPGRPIPVAQLADADALMVRSVTKVNESLLAGKPIKFVGTATAGTDHVDEAWLKQA
GIGFSAAPGCNAIAVVEYVFSSLLMLAERDGFSLYDRTVGIVGVGNVGRRLQARLEALGIKTLLCDPPRADRGDEGDFRS
LDELVQRADILTFHTPLFKDGPYKTLHLADEKLIRSLKPGAILINACRGAVVDNTALLTCLNEGQKLSVVLDVWEGEPEL
NVELLKKVDIGTSHIAGYTLEGKARGTTQVFEAYSKFIGHEQHVALDTLLPAPEFGRITLHGPLDQPTLKRLVHLVYDVR
RDDAPLRKVAGIPGEFDKLRKNYLERREWSSLYVICDDASAASLLCKLGFNAVHHPAR
>Q9I3W9 1.1.1.290~~~pdxB~~~Erythronate-4-phosphate dehydrogenase~~~
MRILADENIPVVDAFFADQGSIRRLPGRAIDRAALAEVDVLLVRSVTEVSRAALAGSPVRFVGTCTIGTDHLDLDYFAEA
GIAWSSAPGCNARGVVDYVLGCLLAMAEVRGADLAERTYGVVGAGQVGGRLVEVLRGLGWKVLVCDPPRQAREPDGEFVS
LERLLAEADVISLHTPLNRDGEHPTRHLLDEPRLAALRPGTWLVNASRGAVVDNQALRRLLEGGADLEVALDVWEGEPQA
DPELAARCLIATPHIAGYSLEGKLRGTAQIYQAYCAWRGIAERVSLQDVLPETWLAGLQLNPGCDPAWALATLCRAVYDP
RSDDAAFRRSLTGDSATRRAAFDALRKHYPPRREITGLRVATGGQAELQRVVRALGAQLV
>P60802 1.1.1.290~~~pdxB~~~Erythronate-4-phosphate dehydrogenase~~~
MKILVDENMPYARELFSRLGEVKAVPGRPIPVEELNHADALMVRSVTKVNESLLSGTPINFVGTATAGTDHVDEAWLKQA
GIGFSAAPGCNAIAVVEYVFSALLMLAERDGFSLRDRTIGIVGVGNVGSRLQTRLEALGIRTLLCDPPRAARGDEGDFRT
LDELVQEADVLTFHTPLYKDGPYKTLHLADETLIRRLKPGAILINACRGPVVDNAALLARLNAGQPLSVVLDVWEGEPDL
NVALLEAVDIGTSHIAGYTLEGKARGTTQVFEAYSAFIGREQRVALETLLPAPEFGRITLHGPLDQPTLKRLAHLVYDVR
RDDAPLRKVAGIPGEFDKLRKNYLERREWSSLYVMCDDETAAALLCKLGFNAVHHPAH
>Q9KQ92 1.1.1.290~~~pdxB~~~Erythronate-4-phosphate dehydrogenase~~~COG0111
MKILIDENMPYAQALFSQLGEVILKPGRTLTADDLIDVDALMIRSVTKVNDALLAKANRLKFVGTATAGMDHVDQALLRE
RGIFFTAAPGCNKVGVAEYVFSVLMVLAQQQGFSVFDKTVGIIGAGQVGSYLAKCLSGIGMKVLLNDPPKQAQGDEREFT
ELETLLKQADVITLHTPITRGGEWPTHHLIDAAILEQLRSDQILINAARGPVVDNAALKARLQQGDGFTAVLDVFEFEPQ
VDMELLPLLAFATPHIAGYGLEGKARGTTMIFNSYCEFLGSAHCANPASLLPKAPVPKVYLERAWDEETLRTLTQIIYDV
RKDDAQFRREIHQPGAFDLMRKHYWDRREYSAVTLAGGADCHLAPLAKLGFQVEVCDEPTI
>P0AFI7 1.4.3.5~~~pdxH~~~Pyridoxine/pyridoxamine 5'-phosphate oxidase~~~COG0259
MSDNDELQQIAHLRREYTKGGLRRRDLPADPLTLFERWLSQACEAKLADPTAMVVATVDEHGQPYQRIVLLKHYDEKGMV
FYTNLGSRKAHQIENNPRVSLLFPWHTLERQVMVIGKAERLSTLEVMKYFHSRPRDSQIGAWVSKQSSRISARGILESKF
LELKQKFQQGEVPLPSFWGGFRVSLEQIEFWQGGEHRLHDRFLYQRENDAWKIDRLAP
>P9WIJ1 1.4.3.5~~~pdxH~~~Pyridoxine 5'-phosphate oxidase~~~COG0259
MDDDAQMVAIDKDQLARMRGEYGPEKDGCGDLDFDWLDDGWLTLLRRWLNDAQRAGVSEPNAMVLATVADGKPVTRSVLC
KILDESGVAFFTSYTSAKGEQLAVTPYASATFPWYQLGRQAHVQGPVSKVSTEEIFTYWSMRPRGAQLGAWASQQSRPVG
SRAQLDNQLAEVTRRFADQDQIPVPPGWGGYRIAPEIVEFWQGRENRMHNRIRVANGRLERLQP
>P21159 1.4.3.5~~~pdxH~~~Pyridoxine/pyridoxamine 5'-phosphate oxidase~~~
MRTLTCVPDESTAKVHTCRAPFMLHRVMIPPDPIQRFAELFERAKQAIAVDPNAMVVATVGDDGRPSARVVLLKDFDARG
FVFYTNHESRKGREARAHPYAALCFYWQPLNEQVRVEGRVERVTDAEADAYFQSRARGSQVGAWASLQSQPLATREELEA
RVAEVEQKYAGQPVPRPPHWSGFRVVPDRIEFWHAQESRLHDRHVYLREDGGWRTQMLYP
>P25906 1.1.1.65~~~pdxI~~~Pyridoxine 4-dehydrogenase~~~COG0667
MSSNTFTLGTKSVNRLGYGAMQLAGPGVFGPPRDRHVAITVLREALALGVNHIDTSDFYGPHVTNQIIREALYPYSDDLT
IVTKIGARRGEDASWLPAFSPAELQKAVHDNLRNLGLDVLDVVNLRVMMGDGHGPAEGSIEASLTVLAEMQQQGLVKHIG
LSNVTPTQVAEARKIAEIVCVQNEYNIAHRADDAMIDALAHDGIAYVPFFPLGGFTPLQSSTLSDVAASLGATPMQVALA
WLLQRSPNILLIPGTSSVAHLRENMAAEKLHLSEEVLSTLDGISRE
>Q3JQ80 2.6.99.2~~~pdxJ~~~Pyridoxine 5'-phosphate synthase~~~
MSFFLTTPAAIDLGVNIDHVATLRNARGTAYPDPVRAALAAEDAGADAITLHLREDRRHIVDADVRTLRPRVKTRMNLEC
AVTPEMLDIACEIRPHDACLVPEKRSELTTEGGLDVVGHFDAVRAACKQLADAGVRVSLFIDPDEAQIRAAHETGAPVIE
LHTGRYADAHDAAEQQREFERIATGVDAGIALGLKVNAGHGLHYTNVQAIAALPGIAELNIGHAIVAHAVFVGWDNAVRE
MKAIMVAARVAALHGGR
>Q9PN59 2.6.99.2~~~pdxJ~~~Pyridoxine 5'-phosphate synthase~~~COG0854
MLLGVNIDHIAVLRQARMVNDPDLLEAAFIVARHGDQITLHVREDRRHAQDFDLENIIKFCKSPVNLECALNDEILNLAL
KLKPHRVTLVPEKREELTTEGGLCLNHAKLKQSIEKLQNANIEVSLFINPSLEDIEKSKILKAQFIELHTGHYANLHNAL
FSNISHTAFALKELDQDKKTLQAQFEKELQNLELCAKKGLELGLKVAAGHGLNYKNVKPVVKIKEICELNIGQSIVARSV
FTGLQNAILEMKELIKR
>Q3V891 2.6.99.2~~~pdxJ~~~Pyridoxine 5'-phosphate synthase~~~COG0854
MPILVVNVDHVATLRQQRLGIEPDPVTAAHMAELAGARGIIVHLREDRRHIQDRDVSLLRQTLKTRLHLEMAATEEMQGI
ALAEKPHMVCLVPEKREELTTEGGLVVAGRVDFLQAYVKPFHEIGIATSLFIEADPDQIRAAARVGVTHVELHTGHFADA
PDAAERKRQRDAIVAGIGLARTLGLKVNLGHGLNYDNIFDFEAVPGICEFSIGHSIVSRAVLTGFGPAVRDMVDIINRFP
G
>P0A794 2.6.99.2~~~pdxJ~~~Pyridoxine 5'-phosphate synthase~~~COG0854
MAELLLGVNIDHIATLRNARGTAYPDPVQAAFIAEQAGADGITVHLREDRRHITDRDVRILRQTLDTRMNLEMAVTEEML
AIAVETKPHFCCLVPEKRQEVTTEGGLDVAGQRDKMRDACKRLADAGIQVSLFIDADEEQIKAAAEVGAPFIEIHTGCYA
DAKTDAEQAQELARIAKAATFAASLGLKVNAGHGLTYHNVKAIAAIPEMHELNIGHAIIGRAVMTGLKDAVAEMKRLMLE
ARG
>Q02HS5 2.6.99.2~~~pdxJ~~~Pyridoxine 5'-phosphate synthase~~~
MTEATRILLGVNIDHVATLRQARGTRYPDPVKAALDAEEAGADGITVHLREDRRHIQERDVRVLKEVLQTRMNFEMGVTE
EMLAFAEEIRPAHSCLVPERREELTTEGGLDVAGQEQRIRDAVRRLAAVGSEVSLFIDPDPRQIEASARVGAPAIELHTG
RYADAEDPEEQARELQRVREGVALGRSLGLIVNAGHGLHYHNVEPVAAIDGINELNIGHAIVAHALFVGFRQAVAEMKAL
MLAAATKR
>Q8ZCP4 2.6.99.2~~~pdxJ~~~Pyridoxine 5'-phosphate synthase~~~COG0854
MADLLLGVNIDHIATLRNARGTIYPDPVQAAFIAEQAGADGITVHLREDRRHITDRDVRILRQTIQTRMNLEMAVTDEMV
DIACDIKPHFCCLVPEKRQEVTTEGGLDVAGQVDKMTLAVGRLADVGILVSLFIDADFRQIDAAVAAGAPYIEIHTGAYA
DASTVLERQAELMRIAKAATYAAGKGLKVNAGHGLTYHNVQPIAALPEMHELNIGHAIIGQAVMTGLAAAVTDMKVLMRE
ARR
>P39610 2.7.1.35~~~pdxK~~~Pyridoxine kinase~~~COG0351
MSMHKALTIAGSDSSGGAGIQADLKTFQEKNVYGMTALTVIVAMDPNNSWNHQVFPIDTDTIRAQLATITDGIGVDAMKT
GMLPTVDIIELAAKTIKEKQLKNVVIDPVMVCKGANEVLYPEHAQALREQLAPLATVITPNLFEASQLSGMDELKTVDDM
IEAAKKIHALGAQYVVITGGGKLKHEKAVDVLYDGETAEVLESEMIDTPYTHGAGCTFSAAVTAELAKGAEVKEAIYAAK
EFITAAIKESFPLNQYVGPTKHSALRLNQQS
>P40191 2.7.1.35~~~pdxK~~~Pyridoxine/pyridoxal/pyridoxamine kinase~~~COG2240
MSSLLLFNDKSRALQADIVAVQSQVVYGSVGNSIAVPAIKQNGLNVFAVPTVLLSNTPHYDTFYGGAIPDEWFSGYLRAL
QERDALRQLRAVTTGYMGTASQIKILAEWLTALRKDHPDLLIMVDPVIGDIDSGIYVKPDLPEAYRQYLLPLAQGITPNI
FELEILTGKNCRDLDSAIAAAKSLLSDTLKWVVVTSASGNEENQEMQVVVVTADSVNVISHSRVKTDLKGTGDLFCAQLI
SGLLKGKALTDAVHRAGLRVLEVMRYTQQHESDELILPPLAEA
>P40192 2.7.1.35~~~pdxK~~~Pyridoxine/pyridoxal/pyridoxamine kinase~~~
MGQESDIQSVLFDDNHRALQTDIVAVQSQVVYGSVGNSIAVPAIKAQGLRVTAVPTVLFSNTPHYKTFYGGIIPAEWFAG
YLTALNERDALRELKAITTGYMGSADQIVLLSKWLMAIRASHPEVCILVDPVIGDTDSGMYVQAEIPQAYRTHLLPQAQG
LTPNVFELEMLSGKPCRTLEEAVAAAQSLLSDTLKWVVITSAPGESLETITVAVVTAQVVEVFAHPRVATELKGTGDLFC
AELVSGIVQGKKLTTAAKDAAQRVLEVMTWTQQCGCDELILPPAGEAR
>P37527 4.3.3.6~~~pdxS~~~Pyridoxal 5'-phosphate synthase subunit PdxS~~~COG0214
MAQTGTERVKRGMAEMQKGGVIMDVINAEQAKIAEEAGAVAVMALERVPADIRAAGGVARMADPTIVEEVMNAVSIPVMA
KARIGHIVEARVLEAMGVDYIDESEVLTPADEEFHLNKNEYTVPFVCGCRDLGEATRRIAEGASMLRTKGEPGTGNIVEA
VRHMRKVNAQVRKVVAMSEDELMTEAKNLGAPYELLLQIKKDGKLPVVNFAAGGVATPADAALMMQLGADGVFVGSGIFK
SDNPAKFAKAIVEATTHFTDYKLIAELSKELGTAMKGIEISNLLPEQRMQERGW
>P82134 4.3.3.6~~~pdxS~~~Pyridoxal 5'-phosphate synthase subunit PdxS~~~COG0214
MTESGSTASPLCGVGSSVMTETQETYQATTRVKRGLADMLKGGVIMDVVTPEQARIAEDAGASAVMALERVPADIRSQGG
VARMSDPDLIEGIVNAVSIPVMAKARIGHFVEAQVLEALGVDFIDESEVLSPADYTHHINKWKFDVPFVCGATNLGEALR
RITEGAAMIRSKGEAGTGDVSEAVRHLRTIRGDINRLRSLDEDELFVAAKEFQAPYDLVREVASTGKLPVVTFVAGGVAT
PADAALVRQMGAEGVFVGSGIFKSGNPAARAAAIVKAATLFDDPSVIADVSRGLGEAMVGINVSDVPAPHRLAERGW
>Q5L3Y2 4.3.3.6~~~pdxS~~~Pyridoxal 5'-phosphate synthase subunit PdxS~~~COG0214
MALTGTDRVKRGMAEMQKGGVIMDVVNAEQAKIAEAAGAVAVMALERVPADIRAAGGVARMADPTVIEEVMNAVSIPVMA
KVRIGHYVEARVLEALGVDYIDESEVLTPADEEFHIDKRQFTVPFVCGCRDLGEAARRIAEGASMLRTKGEPGTGNIVEA
VRHMRKVNAQIRKVVNMSEDELVAEAKQLGAPVEVLREIKRLGRLPVVNFAAGGVATPADAALMMHLGADGVFVGSGIFK
SENPEKYARAIVEATTHYEDYELIAHLSKGLGGAMRGIDIATLLPEHRMQERGW
>P45293 4.3.3.6~~~pdxS~~~Pyridoxal 5'-phosphate synthase subunit PdxS~~~COG0214
MAENRYELNKNLAQMLKGGVIMDVQNPEQARIAEAAGAAAVMALERIPADIRAVGGVSRMSDPKMIKEIQGAVSIPVMAK
VRIGHFVEAQILEAIEIDYIDESEVLSPADNRFHVDKKEFQVPFVCGAKDLGEALRRIAEGASMIRTKGEPGTGDIVQAV
RHMRMMSQEIRRIQNLREDELYVAAKDLQVPVELVQYVHKHGKLPVVNFAAGGIATPADAALMMQLGAEGVFVGSGIFKS
GDPIKRASAIVKAVTNYRNPQILAQISEDLGEAMVGINENEIQILMAERGK
>P9WII9 4.3.3.6~~~pdxS~~~Pyridoxal 5'-phosphate synthase subunit PdxS~~~COG0214
MDPAGNPATGTARVKRGMAEMLKGGVIMDVVTPEQARIAEGAGAVAVMALERVPADIRAQGGVSRMSDPDMIEGIIAAVT
IPVMAKVRIGHFVEAQILQTLGVDYIDESEVLTPADYAHHIDKWNFTVPFVCGATNLGEALRRISEGAAMIRSKGEAGTG
DVSNATTHMRAIGGEIRRLTSMSEDELFVAAKELQAPYELVAEVARAGKLPVTLFTAGGIATPADAAMMMQLGAEGVFVG
SGIFKSGAPEHRAAAIVKATTFFDDPDVLAKVSRGLGEAMVGINVDEIAVGHRLAQRGW
>P60798 4.3.3.6~~~pdxS~~~Pyridoxal 5'-phosphate synthase subunit PdxS~~~
MSKIIGSDRVKRGMAEMQKGGVIMDVVNAEQARIAEEAGAVAVMALERVPSDIRAAGGVARMANPKIVEEVMNAVSIPVM
AKARIGHITEARVLEAMGVDYIDESEVLTPADEEYHLRKDQFTVPFVCGCRNLGEAARRIGEGAAMLRTKGEPGTGNIVE
AVRHMRQVNSEVSRLTVMNDDEIMTFAKDIGAPYEILKQIKDNGRLPVVNFAAGGVATPQDAALMMELGADGVFVGSGIF
KSEDPEKFAKAIVQATTHYQDYELIGRLASELGTAMKGLDINQLSLEERMQERGW
>Q9WYU4 4.3.3.6~~~pdxS~~~Pyridoxal 5'-phosphate synthase subunit PdxS~~~COG0214
MEIKKGTWIIKKGFAEMFKGGVIMDVTSAEQAKIAEEAGAVAVMALERVPADIRKEGGVARMASIAKIREIMEAVSIPVM
AKVRIGHIAEAKILEELGVDFIDESEVLTPADDRFHINKHEFKVPFVCGARDLGEALRRIAEGAAMIRTKGEAGTGNVVE
AVKHMRRVMEQIKQVTKMEDEELVAYGKEIGAPVELLREVKRLGRLPVVNFAAGGVATPADAALMMMLGADGVFVGSGIF
KSKDPRKMAKAMVLAVTYWDNPRILLKISEDIGEPMRGLDVEELEVRMQERGW
>Q5SKD9 4.3.3.6~~~pdxS~~~Pyridoxal 5'-phosphate synthase subunit PdxS~~~COG0214
MEKGTFQIKTGFAEMFKGGVIMDVTTPEQAVIAEEAGAVAVMALERVPADIRAQGGVARMSDPKIIKEIMAAVSIPVMAK
VRIGHFVEAMILEAIGVDFIDESEVLTPADEEHHIDKWKFKVPFVCGARNLGEALRRIAEGAAMIRTKGEAGTGNVVEAV
RHARTMWKEIRYVQSLREDELMAYAKEIGAPFELVKWVHDHGRLPVVNFAAGGIATPADAALMMHLGMDGVFVGSGIFKS
GDPRKRARAIVRAVAHYNDPEVLAEVSEDLGEPMVGINLDQLKEEERLAKRGW
>P37528 4.3.3.6~~~pdxT~~~Pyridoxal 5'-phosphate synthase subunit PdxT~~~COG0311
MLTIGVLGLQGAVREHIHAIEACGAAGLVVKRPEQLNEVDGLILPGGESTTMRRLIDTYQFMEPLREFAAQGKPMFGTCA
GLIILAKEIAGSDNPHLGLLNVVVERNSFGRQVDSFEADLTIKGLDEPFTGVFIRAPHILEAGENVEVLSEHNGRIVAAK
QGQFLGCSFHPELTEDHRVTQLFVEMVEEYKQKALV
>Q5L3Y1 4.3.3.6~~~pdxT~~~Pyridoxal 5'-phosphate synthase subunit PdxT~~~COG0311
MKIGVLGLQGAVREHVRAIEACGAEAVIVKKPEQLEGLDGLVLPGGESTTMRRLIDRYGLMEPLKQFAAAGKPMFGTCAG
LILLAKRIVGYDEPHLGLMDITVERNSFGRQRESFEAELSIKGVGDGFVGVFIRAPHIVEAGDGVDVLATYNDRIVAARQ
GQFLGCSFHPELTDDHRLMQYFLNMVKEAKMASSLK
>P83813 4.3.3.6~~~pdxT~~~Pyridoxal 5'-phosphate synthase subunit PdxT~~~
MKIGVLGLQGAVREHVRAIEACGAEAVIVKKSEQLEGLDGLVLPGGESTTXRRLIDRYGLXEPLKQFAAAGKPXFGTCAG
LILLAKRIVGYDEPHLGLXDITVERNSFGRQRESFEAELSIKGVGDGFVGVFIRAPHIVEAGDGVDVLATYNDRIVAARQ
GQFLGCSFHPELTDDHRLXQYFLNXVKEAKXASSLK
>A0QWH0 4.3.3.6~~~pdxT~~~Pyridoxal 5'-phosphate synthase subunit PdxT~~~COG0311
MTAHVGVLALQGDTREHLAALREAGAEASTVRRLSELAAVDALVIPGGESTAISHLLREFDLLEPLRARIAEGMPCYGSC
AGMILLATEIADAGVPGRAAVPLKGIDMTVRRNAFGRQVDSFEGDIDFVGLDTPVHAVFIRAPWVERIGPDVEVLARADD
HIVAVRQGPMFATAFHPEVTGDRRIHKLFVDSL
>P9WII7 4.3.3.6~~~pdxT~~~Pyridoxal 5'-phosphate synthase subunit PdxT~~~COG0311
MSVPRVGVLALQGDTREHLAALRECGAEPMTVRRRDELDAVDALVIPGGESTTMSHLLLDLDLLGPLRARLADGLPAYGS
CAGMILLASEILDAGAAGRQALPLRAMNMTVRRNAFGSQVDSFEGDIEFAGLDDPVRAVFIRAPWVERVGDGVQVLARAA
GHIVAVRQGAVLATAFHPEMTGDRRIHQLFVDIVTSAA
>Q8L1A7 4.3.3.6~~~pdxT~~~Pyridoxal 5'-phosphate synthase subunit PdxT~~~
MKVGVLALQGAVAEHIRLIEAVGGEGVVVKRAEQLAELDGLIIPGGESTTIGKLMRRYGFIEAIRDFSNQGKAVFGTCAG
LIVIADKIAGQEEAHLGLMDMTVQRNAFGRQRESFETDLPVKGIDRPVRAVFIRAPLIDQVGNGVDVLSEYNGQIVAARQ
GHLLAASFHPELTDDSSMHAYFLDMIREAR
>Q7A7A1 4.3.3.6~~~pdxT~~~Pyridoxal 5'-phosphate synthase subunit PdxT~~~
MKIGVLALQGAVREHIRHIELSGHEGIAVKKVEQLEEIEGLILPGGESTTLRRLMNLYGFKEALQNSTLPMFGTCAGLIV
LAQDIVGEEGYLNKLNITVQRNSFGRQVDSFETELDIKGIATDIEGVFIRAPHIEKVGQGVDILCKVNEKIVAVQQGKYL
GVSFHPELTDDYRVTDYFINHIVKKA
>Q9WYU3 4.3.3.6~~~pdxT~~~Pyridoxal 5'-phosphate synthase subunit PdxT~~~COG0311
MKIGVLGVQGDVREHVEALHKLGVETLIVKLPEQLDMVDGLILPGGESTTMIRILKEMDMDEKLVERINNGLPVFATCAG
VILLAKRIKNYSQEKLGVLDITVERNAYGRQVESFETFVEIPAVGKDPFRAIFIRAPRIVETGKNVEILATYDYDPVLVK
EGNILACTFHPELTDDLRLHRYFLEMVK
>Q5SKD6 4.3.3.6~~~pdxT~~~Pyridoxal 5'-phosphate synthase subunit PdxT~~~COG0311
MRGVVGVLALQGDFREHKEALKRLGIEAKEVRKKEHLEGLKALIVPGGESTTIGKLAREYGIEDEVRKRVEEGSLALFGT
CAGAIWLAKEIVGYPEQPRLGVLEAWVERNAFGRQVESFEEDLEVEGLGSFHGVFIRAPVFRRLGEGVEVLARLGDLPVL
VRQGKVLASSFHPELTEDPRLHRYFLELAGV
>P77150 2.7.1.35~~~pdxY~~~Pyridoxal kinase PdxY~~~COG2240
MMKNILAIQSHVVYGHAGNSAAEFPMRRLGANVWPLNTVQFSNHTQYGKWTGCVMPPSHLTEIVQGIAAIDKLHTCDAVL
SGYLGSAEQGEHILGIVRQVKAANPQAKYFCDPVMGHPEKGCIVAPGVAEFHVRHGLPASDIIAPNLVELEILCEHAVNN
VEEAVLAARELIAQGPQIVLVKHLARAGYSRDRFEMLLVTADEAWHISRPLVDFGMRQPVGVGDVTSGLLLVKLLQGATL
QEALEHVTAAVYEIMVTTKAMQEYELQVVAAQDRIAKPEHYFSATKL
>Q141E8 2.7.1.35~~~pdxY~~~Pyridoxal kinase PdxY~~~COG2240
MTKNVLSIQSHVVFGHAGNSAAVFPMRRLGVNVWPLNTVQFSNHTQYGHWTGGAIDATQMVELVDGIGAIGMLPRCDAVL
SGYLGTPEQAQSVLEIVKAVKAANPRAWYFCDPVMGAVSGCKVEPGIQEFLVRTMPGVADAMAPNHTELQRLVGREIETL
EEAVTACRELIARGPKLVLVKHLLDRNSPADRFNMLVVTEREAWMGQRPLYPFARQPVGVGDLTSAVFVARTLLGDSIRA
AFEHTLAAVNAVVKATWQAGRYELELVAAQSEIAQPREWFDAWVGDTA
>Q9HT57 2.7.1.35~~~pdxY~~~Pyridoxal kinase PdxY~~~
MPRTPHLLAIQSHVVFGHAGNAAAVFPMQRIGINVWPLNTVQFSNHTQYGRWTGQVLPPEQIPALVDGIAGIGELGNCDA
VLSGYLGSAAQGRAILDVVARIKQANPRALYLCDPVMGHPEKGCIVAPEVSDFLLEEAAAVADYLCPNQLELDSFCDRQP
NSLADCVEMARSLLARGPRAILVKHLNYPGKAGDTFEMLLVAADQAWHLQRPLLAFPRQPVGVGDLASGLFLSRLLLGDD
LRNAFEFTGAAVHEVLLETQACGSYELELVRAQDRIAHPRVRFDAVRL
>Q7CIR8 2.7.1.35~~~pdxY~~~Pyridoxal kinase PdxY~~~COG2240
MKNILSIQSHVVFGHAGNSAAEFPMRRMGVNVWPLNTVQFSNHTQYGHWTGCVMPASHLTDIVQGIADIDRLKDCDAVLS
GYIGSPEQGSHILAAVAQVKQANPDAWYFCDPVMGHPEKGCIVAPGVAEFFCNEALPASDMIAPNLLELEQLSGERVENV
EQAVQVARSLCARGPKVVLVKHLSRAGYHADCFEMLLVTADDAWHICRPLVDFGKRQPVGVGDLTSGLLLVNLLKGEPLD
KALEHVTAAVYEVMLKTQEMGEYELQVVAAQETIVTPICQFTAVRL
>Q79G04 ~~~PE3~~~PE family protein PE3~~~COG0657
MSYVIAAPEMLATTAADVDGIGSAIRAASASAAGPTTGLLAAAADEVSSAAAALFSEYARECQEVLKQAAAFHGEFTRAL
AAAGAAYAQAEASNTAAMSGTAGSSGALGSVGMLSGNPLTALMMGGTGEPILSDRVLAIIDSAYIRPIFGPNNPVAQYTP
EQWWPFIGNLSLDQSIAQGVTLLNNGINAELQNGHDVVVFGYSQSAAVATNEIRALMALPPGQAPDPSRLAFTLIGNINN
PNGGVLERYVGLYLPFLDMSFNGATPPDSPYQTYMYTGQYDGYAHNPQYPLNILSDLNAFMGIRWVHNAYPFTAAEVANA
VPLPTSPGYTGNTHYYMFLTQDLPLLQPIRAIPFVGTPIAELIQPDLRVLVDLGYGYGYADVPTPASLFAPINPIAVASA
LATGTVQGPQAALVSIGLLPQSALPNTYPYLPSANPGLMFNFGQSSVTELSVLSGALGSVARLIPPIA
>L7N695 ~~~PE5~~~PE family immunomodulator PE5~~~
MTLRVVPEGLAAASAAVEALTARLAAAHASAAPVITAVVPPAADPVSLQTAAGFSAQGVEHAVVTAEGVEELGRAGVGVG
ESGASYLAGDAAAAATYGVVGG
>Q79FS8 ~~~PE9~~~PE family protein PE9~~~COG0657
MSYMIATPAALTAAATDIDGIGSAVSVANAAAVAATTGVLAAGGDEVLAAIARLFNANAEEYHALSAQVAAFQTLFVRTL
TGGCGVFRRRRGRQCVTAAEHRAAGAGRRQRRRRSGDGQWRLRQQRHFGCGGQPEFRQHSEHRR
>Q79FR5 3.1.1.6~~~~~~Esterase PE11~~~
MSFVTTRPDSIGETAANLHEIGVTMSAHDDGVTPLITNVESPAHDLVSIVTSMLFSMHGELYKAIARQAHVIHESFVQTL
QTSKTSYWLTELANRAGTST
>Q79FR3 ~~~~~~PE family protein PE13~~~COG0657
MSFVMAYPEMLAAAADTLQSIGATTVASNAAAAAPTTGVVPPAADEVSALTAAHFAAHAAMYQSVSARAAAIHDQFVATL
ASSASSYAATEVANAAAAS
>P9WIH1 ~~~~~~PE family immunomodulator PE15~~~
MTLRVVPESLAGASAAIEAVTARLAAAHAAAAPFIAAVIPPGSDSVSVCNAVEFSVHGSQHVAMAAQGVEELGRSGVGVA
ESGASYAARDALAAASYLSGGL
>L7N697 3.1.1.-~~~~~~Esterase PE16~~~COG3391
MSFVFAVPEMVAATASDLASLGAALSEATAAAAIPTTQVLAAAADEVSAAIAELFGAHGQEFQALSAQASAFHDRFVRAL
SAAAGWYVDAEAANAALVDTAATGASELGSGGRTALILGSTGTPRPPFDYMQQVYDRYIAPHYLGYAFSGLYTPAQFQPW
TGIPSLTYDQSVAEGAGYLHTAIMQQVAAGNDVVVLGFSQGASVATLEMRHLASLPAGVAPSPDQLSFVLLGNPNNPNGG
ILARFPGLYLQSLGLTFNGATPDTDYATTIYTTQYDGFADFPKYPLNILADVNALLGIYYSHSLYYGLTPEQVASGIVLP
VSSPDTNTTYILLPNEDLPLLQPLRGIVPEPLLDLIEPDLRAIIELGYDRTGYADVPTPAALFPVHIDPIAVPPQIGAAI
GGPLTALDGLLDTVINDQLNPVVTSGIYQAGAELSVAAAGYGAPAGVTNAIFIGQQVLPILVEGPGALVTADTHYLVDAI
QDLAAGDLSGFNQNLQLIPATNIALLVFAAGIPAVAAVAILTGQDFPV
>P9WIG9 ~~~~~~Uncharacterized PE family protein PE23~~~COG0657
MQFLSVIPEQVESAAQDLAGIRSALSASYAAAAGPTTAVVSAAEDEVSTAIASIFGAYGRQCQVLSAQASAFHDEFVNLL
KTGATAYRNTEFANAQSNVLNAVNAPARSLLGHPSAAESVQNSAPTLGGGHSTVTAGLAAQAGRAVATVEQQAAAAVAPL
PSAGAGLAQVVNGVVTAGQGSAAKLATALQSAAPWLAKSGGEFIVAGQSALTGVALLQPAVVGVVQAGGTFLTAGTSAAT
GLGLLTLAGVEFSQGVGNLALASGTAATGLGLLGSAGVQLFSPAFLLAVPTALGGVGSLAIAVVQLVQGVQHLSLVVPNV
VAGIAALQTAGAQFAQGVNHTMLAAQLGAPGIAVLQTAGGHFAQGIGHLTTAGNAAVTVLIS
>I6X486 ~~~~~~PE-PGRS family protein PE25~~~
MSFVITNPEALTVAATEVRRIRDRAIQSDAQVAPMTTAVRPPAADLVSEKAATFLVEYARKYRQTIAAAAVVLEEFAHAL
TTGADKYATAEADNIKTFS
>P9WIG7 ~~~~~~PE family immunomodulator PE35~~~
MEKMSHDPIAADIGTQVSDNALHGVTAGSTALTSVTGLVPAGADEVSAQAATAFTSEGIQLLASNASAQDQLHRAGEAVQ
DVARTYSQIDDGAAGVFAE
>Q0P9X8 ~~~peb1A~~~Major cell-binding factor~~~COG0834
MVFRKSLLKLAVFALGACVAFSNANAAEGKLESIKSKGQLIVGVKNDVPHYALLDQATGEIKGFEVDVAKLLAKSILGDD
KKIKLVAVNAKTRGPLLDNGSVDAVIATFTITPERKRIYNFSEPYYQDAIGLLVLKEKKYKSLADMKGANIGVAQAATTK
KAIGEAAKKIGIDVKFSEFPDYPSIKAALDAKRVDAFSVDKSILLGYVDDKSEILPDSFEPQSYGIVTKKDDPAFAKYVD
DFVKEHKNEIDALAKKWGL
>Q02189 1.3.7.2~~~pebA~~~15,16-dihydrobiliverdin:ferredoxin oxidoreductase~~~
MFDSFLNELHSDITKRGGSPLPLPEGLEECRSSKSSSVIQSWLWDVPGFRRWRVTRLDAGDSLQVFNSVAYPDYNYDHPL
MGVDLLWFGARQKLVAVLDFQPLVQDKDYLDRYFSGLKELNQRFPDLNGEETMRSFDPNQYFSSWLLFCRGGAEQADLSL
PKAFSAFLKAYWDLHDNAKSIPSTIPPEEVKNLQDKYDIYSAERDPAHGLFTSHFGKDWSNRFLHEFLFPASSSHK
>B2HE92 ~~~pecA~~~PE cleavage protein A~~~COG3391
MSLLVVAPEWLTSAAAELQSIESALSAANAAAAVPTTGLAAAAADEVSTAVATLFAGFGQEYQAISTQLSAFQQQFALTL
NSSAGSYSAAEAQSVSILDTLGQDVFGAINAPTEALLGRPLIGNGANGTATSPNGGAGGLLFGNGGIGYSQTGAGIVGGA
GGSAGLIGNGGAGGTGGAGATGGAGGNGGWLFGSGGIGGTGGANALGTGGTGGLGGSAGLFGGGGNGGAGGLGISGDLGT
GGAGGTGGFLLGDYGVSGAGGDGRTVPLEVVNVTEPVVNVNVNGGHSTPVLIDTGSAGLVMQVKDVGGPLGLLRMGLPSG
ISMSAYSGGLTYLFATYPTTVDFGNGIVTSTTGVDVVLFSIPTSPYALTTWLNALWSNPLTTPFDAYFQSAGVDGVLGVG
PNAVGPGPSIPTQALGGGLGQGLLIDMKGGELVFGPNPLTPEFSISGAPIATLWVSVNGGAPVAVPSIIDSGGVMGTIPS
SVIGGSTLPANTNITVYTDNTMTTEVYHYSTNDYQPTVISSGLMNTGFLPFWNQPVYIDYSPAGTGTTVFDMP
>P9WIF1 ~~~pecA~~~PE cleavage protein A~~~COG0657
MSFLVVVPEFLTSAAADVENIGSTLRAANAAAAASTTALAAAGADEVSAAVAALFARFGQEYQAVSAQASAFHQQFVQTL
NSASGSYAAAEATIASQLQTAQHDLLGAVNAPTETLLGRPLIGDGAPGTATSPNGGAGGLLYGNGGNGYSATASGVGGGA
GGSAGLIGNGGAGGAGGPNAPGGAGGNGGWLLGNGGIGGPGGASSIPGMSGGAGGTGGAAGLLGWGANGGAGGLGDGVGV
DRGTGGAGGRGGLLYGGYGVSGPGGDGRTVPLEIIHVTEPTVHANVNGGPTSTILVDTGSAGLVVSPEDVGGILGVLHMG
LPTGLSISGYSGGLYYIFATYTTTVDFGNGIVTAPTAVNVVLLSIPTSPFAISTYFSALLADPTTTPFEAYFGAVGVDGV
LGVGPNAVGPGPSIPTMALPGDLNQGVLIDAPAGELVFGPNPLPAPNVEVVGSPITTLYVKIDGGTPIPVPSIIDSGGVT
GTIPSYVIGSGTLPANTNIEVYTSPGGDRLYAFNTNDYRPTVISSGLMNTGFLPFRFQPVYIDYSPSGIGTTVFDHPA
>P29729 ~~~pecE~~~Bilin biosynthesis protein PecE~~~
MTAACCAQPILSPEAAIAALAGEDNQIRYYAAWWLGKHQVQAGCAALCEALFDERYRIPSGGYPLRRQAARALGQLKNPQ
AVPALIAALACEEDLGLREAVIQALAMIGDRRAVHPLVQLLQSQQPQPYEALIEALATLQVWSARPQIEPFLYHSSERVQ
CAAARYLYLLTKQPQYLERIVQNLNHDNMYLRWAAIFDLAALGHRQAVDAILAAKVPNSLKLLNLKRILETLLDGDRSQL
NNGEFCQNQSDRETIELLFQAIDDLLIQL
>P29730 ~~~pecF~~~Bilin biosynthesis protein PecF~~~
MIAARFTNSKQLISQLNCALSPADALCAIAAISDSETTEVAVISALLQLLSRHHHHSSVATAAVEVLVKLAPASVEPLLA
AFRSCSDQGFQAWIIQALAMIGDAKAFDLLAEVVGTEVANHCQGNVRRIAARGLGKIGSTVKDREVTDRAIEKLHWALVT
PQDWGLRYAAVVSLQEIATPQAHAVLSAAVAGESDWVVRSRMKKALEQIPAC
>A9KNI6 ~~~~~~HTH-type transcriptional regulator Cphy_2742~~~COG1609
MNIYDVSQKAGVSIATVSRVINGNPNVSEKTKQKVLDVMKEIGYTPNVFARGLGLNTMKTIGIMCSDSSDTFLANAVYHL
EQNLRKNGYDSFLCCTGYELHTKQKYLKLLLSKRVDAIVMVGSSFLEANKKDNAYIMEAADEVPIMLINGYLSHPRIYCT
LCDDHQAVYDAVSKLITQGRKEILYLYNSKSYSGLQKLSGYKAALAAHELPLDENLMQMCPNNLADTKNLLNSLVKQGLN
YDAVVTAEDLLAIGTIKFIKDSGRQIPQDVSIIGYNNSLLTTCCDPELTSIDNHVETLCISTISTLMRVLNGNDVPNKTT
ISNDLIVRKTTNF
>Q5P5I4 1.1.1.311~~~ped~~~(S)-1-Phenylethanol dehydrogenase~~~COG1028
MTQRLKDKLAVITGGANGIGRAIAERFAVEGADIAIADLVPAPEAEAAIRNLGRRVLTVKCDVSQPGDVEAFGKQVISTF
GRCDILVNNAGIYPLIPFDELTFEQWKKTFEINVDSGFLMAKAFVPGMKRNGWGRIINLTSTTYWLKIEAYTHYISTKAA
NIGFTRALASDLGKDGITVNAIAPSLVRTATTEASALSAMFDVLPNMLQAIPRLQVPLDLTGAAAFLASDDASFITGQTL
AVDGGMVRH
>P15922 3.2.1.82~~~pehX~~~Exo-poly-alpha-D-galacturonosidase~~~
MKVITFSRRSALASIVATCLMSTPALAATAQAPQKLQIPTLSYDDHSVMLVWDTPEDTSNITDYQIYQNGQLIGLASQNN
DKNSPAKPYISAFYKSDAANFHHRIVLQNAKVDGLKAGTDYQFTVRTVYADGTTSNDSNTVTTTTTAVPKVINITQYGAK
GDGTTLNTSAIQKAIDACPTGCRIDVPAGVFKTGALWLKSDMTLNLLQGATLLGSDNAADYPDAYKIYSYVSQVRPASLL
NAIDKNSSAVGTFKNIRIVGKGIIDGNGWKRSADAKDELGNTLPQYVKSDNSKVSKDGILAKNQVAAAVATGMDTKTAYS
QRRSSLVTLRGVQNAYIADVTIRNPANHGIMFLESENVVENSVIHQTFNANNGDGVEFGNSQNIMVFNSVFDTGDDSINF
AAGMGQDAQKQEPSQNAWLFNNFFRHGHGAVVLGSHTGAGIVDVLAENNVITQNDVGLRAKSAPAIGGGAHGIVFRNSAM
KNLAKQAVIVTLSYADNNGTIDYTPAKVPARFYDFTVKNVTVQDSTGSNPAIEITGDSSKDIWHSQFIFSNMKLSGVSPT
SISDLSDSQFNNLTFSNLRSGSSPWKFGTVKNVTVDGKTVTP
>P94449 4.2.2.10~~~pelB~~~Pectin lyase~~~
MKRFCLWFAVFSLLLVLLPGKAFGAVDFPNTSTNGLLGFAGNAKNEKGISKASTTGGKNGQIVYIQSVNDLKTHLGGSTP
KILVLQNDISASSKTTVTIGSNKTLVGSYAKKTLKNIYLTTSSASGNVIFQNLTFEHSPQINGNNDIQLYLDSGINYWID
HVTFSGHSYSASGSDLDKLLYVGKSADYITISNSKFANHKYGLILGYPDDSQHQYDGYPHMTIANNYFENLYVRGPGLMR
YGYFHVKNNYSNNFNQAITIATKAKIYSEYNYFGKGSEKGGILDDKGTGYFKDTGSYPSLNKQTSPLTSWNPGSNYSYRV
QTPQYTKDFVTKYAGSQSTTLVFGY
>P24112 4.2.2.10~~~pnl~~~Pectin lyase~~~
MAYPTTNLTGLIGFAKAAKVTGGTGGKVVTVNSLADFKSAVSGSAKTIVVLGSSLKTSALTKVVFGSNKTIVGSFGGANV
LTNIHLRAESNSSNVIFQNLVFKHDVAIKDNDDIQLYLNYGKGYWVDHCSWPGHTWSDNDGSLDKLIYIGEKADYITISN
CLFSNHKYGCIFGHPADDNNSAYNGYPRLTICHNYYENIQVRAPGLMRYGYFHVFNNYVNKFQLAFTVAQNANVISERNV
FGSGAEKKGMVDDKGNGSTFTDNGSSPAAVASKSPAAKWTASSNYSYSLMTTAAAQSWVVSNAGAQNSALKFPS
>P27027 4.2.2.10~~~pnl~~~Pectin lyase~~~
MSYPESKLTGLTGFALAAKVTGGWAGPVVSITNLDQLKANIGTVTPQVLVINSNISASSLTKVNMGANKTLIGSFQNRTL
ENIHLRATAQSQNIILQNLIFKHSANIKANDDIQVYLNYGSKYWIDHCSFVGHSWSTTDGSEDKLLYIGEKADYATISNC
FFGSHKYGLIFGHPADDNNAAFNGYPRLTLCHNRFDNMEVRAPGLMRYGYFHVYNNYINKFHLGFTLAQNANILSESNYF
GEGSQNNGMLDDKGSGTFTDTNSVPPITNQKSPKAQWTATSNYAYTLKTAAQAKDFTQKNAGAQAAALVFGS
>P13975 ~~~pemI~~~Antitoxin PemI~~~
MHTTRLKRVGGSVMLTVPPALLNALSLGTDNEVGMVIDNGRLIVEPYRRPQYSLAELLAQCDPNAEISAEEREWLDAPAT
GQEEI
>P13976 3.1.-.-~~~pemK~~~Endoribonuclease PemK~~~
MLKYQLKNENGWMHRRLVRRKSDMERGEIWLVSLDPTAGHEQQGTRPVLIVTPAAFNRVTRLPVVVPVTSGGNFARTAGF
AVSLDGVGIRTTGVVRCDQPRTIDMKARGGKRLERVPETIMNEVLGRLSTILT
>Q02940 3.5.2.6~~~penA~~~Beta-lactamase~~~
MQRIGVTDYTILGTVKGAELELVRFTHPFMGFDVPAILGDHVTRMPVPVPFTPRLPRPGRLCDRSEIRPGKPLTRLARTA
LICRALIRRWMARTSYSDSVNCHTQPISAIFDYKDLRFEPPSNRISPAGQTSVDRLLQLSQGQAVEGQSAVARLTGEKKN
HPGAQYANRLSPRIANNHPATQQTLFELGSGAKERNAINVSYLTALGTPGFTLMLPARMLCGIVSDNNFTQKQLCPSPAR
CTRGPAEPAKRGPWLEPGLVIRKDGLRTGKLLSSLRGLCLTVLRFQPTVPCFCRLALSSSVAISSTGLVNFSR
>Q55012 4.2.3.7~~~penA~~~Pentalenene synthase~~~
MPQDVDFHIPLPGRQSPDHARAEAEQLAWPRSLGLIRSDAAAERHLRGGYADLASRFYPHATGADLDLGVDLMSWFFLFD
DLFDGPRGENPEDTKQLTDQVAAALDGPLPDTAPPIAHGFADIWRRTCEGMTPAWCARSARHWRNYFDGYVDEAESRFWN
APCDSAAQYLAMRRHTIGVQPTVDLAERAGRFEVPHRVFDSAVMSAMLQIAVDVNLLLNDIASLEKEEARGEQNNMVMIL
RREHGWSKSRSVSHMQNEVRARLEQYLLLESCLPKVGEIYQLDTAEREALERYRTDAVRTVIRGSYDWHRSSGRYDAEFA
LAAGAQGYLEELGSSAH
>E3VWK3 1.14.13.170~~~penE~~~Pentalenolactone D synthase~~~
MREKYRQERDKRSVGRTYQFARGDFSRYARDPYTERQEREPLTDEVDVAVVGAGIGGLLTGAHLRKETGLERIRLIDGAG
DVGGTWYWNRFPGVRCDVESYIYMPLLEETGTIPREKYSTGPEIFAHLQQIAHRYDLYRDALFQTTVTELRWDEAAGRWL
VSTDRGDLIRARYVAMSIGLMHRPKLPGLPGLETFAGHSFHTSRWDFDYTGGDSTGGLTKLKDKKVGVIGTGSTTIQLAP
HLAEWAEQLILFQRTPAAVDVRGNRPTPPEWAAGLAPGWQQRRMENFHALTSGVPQDEDLVQDRWTQTTAKLAAAILPTG
DTGGDPKERALAAERADFLKMEELRARIDSVVTDPATAAALKPYYRVYCKRPCFHDGYLQTFNRPNVTLVDTQGQGVERL
TPAGVVANGREYPLDCLIFATGYEHEFAVPYTERAGYDIVGRDGLRLSEKWADGARTLHGLQVNGFPNCFILSKVQAGRH
VNIAYMLGEQTRHLAHIVKCVEERGHQVVEASEAGEKEWVEEILRLATNDIDFLENCTPGLYNNEGDPSGLPLLNSSYGG
GSVEFVNILRRWREAGDLAGLELR
>E3VWJ9 1.14.19.8~~~penM~~~Pentalenolactone synthase~~~
MNELPRLPFDNPAILGIAPQMRALQKEGPIVRVRTAGEDAWLITRYDEVKALLSDRRLGLSDPKPERAAKSTARITMMAL
MAGDDYDREATEHPQMRELLVPRFSTRRMRVMKARIEQHVDELLDQLAASVAPVDLHRALSFPLPTMVVCDLLGVPLADR
ERIGQWARGTFDQSDSLHSVNTFQQVVDYMMELVQRKRTEPGDDILSELIAEKDGTLSDEYIAHLGCAVLLFGYETTIVR
IDMGVLLMLRNPAQRALLAENPALAPAAVEEILRLAVGGKGSNALIPRYAHSDITVGETVIRTGDAVMLAIGAANIDGHA
FPHADLFDLSREKPKAHMAFGHGTRHCIGRVLARIELTAVFERLFRRLPNLQLAVPEESLRWQEHRITGGFDEIPVTF
>I6Y4D2 3.5.1.28~~~~~~N-acetylmuramoyl-L-alanine amidase Rv3717~~~COG0860
MIVGVLVAAATPIISSASATPANIAGMVVFIDPGHNGANDASIGRQVPTGRGGTKNCQASGTSTNSGYPEHTFTWETGLR
LRAALNALGVRTALSRGNDNALGPCVDERANMANALRPNAIVSLHADGGPASGRGFHVNYSAPPLNAIQAGPSVQFARIM
RDQLQASGIPKANYIGQDGLYGRSDLAGLNLAQYPSILVELGNMKNPADSALMESAEGRQKYANALVRGVAGFLATQGQA
R
>Q48677 3.4.11.7~~~pepA~~~Glutamyl aminopeptidase~~~COG1363
MELFDKVKALTEIQATSGFEGPVRDYLKARMVELGYQPEFDGLGGIFVTKASKVENAPRIMVAAHMDEVGFMVSSIKADG
TFRVVPLGGWNPLVVSGQRFTLFTRTGKKIPVVTGGLPPHLLRGTGVTPQIPAISDIIFDGAFENAAEAAEFGIAQGDLI
IPETETILSANGKNIISKAWDNRYGCLMILELLEFLADKELPVTLIIGANVQEEVGLRGAKVSTTKFNPDLFFAVDCSPA
SDTFGDDNGRLGEGTTLRFFDPGHIMLPGMKNFLLDTANHAKVKTQVYMAKGGTDAGAAHLANGGVPSTTIGVVARYIHS
HQTIFNIDDFLQAQTFLRAIITSLNTEKVAEIKNY
>P37095 3.4.11.23~~~pepB~~~Peptidase B~~~COG0260
MTEAMKITLSTQPADARWGEKATYSINNDGITLHLNGADDLGLIQRAARKIDGLGIKHVQLSGEGWDADRCWAFWQGYKA
PKGTRKVVWPDLDDAQRQELDNRLMIIDWVRDTINAPAEELGPSQLAQRAVDLISNVAGDRVTYRITKGEDLREQGYMGL
HTVGRGSERSPVLLALDYNPTGDKEAPVYACLVGKGITFDSGGYSIKQTAFMDSMKSDMGGAATVTGALAFAITRGLNKR
VKLFLCCADNLISGNAFKLGDIITYRNGKKVEVMNTDAEGRLVLADGLIDASAQKPEMIIDAATLTGAAKTALGNDYHAL
FSFDDALAGRLLASAAQENEPFWRLPLAEFHRSQLPSNFAELNNTGSAAYPAGASTAAGFLSHFVENYQQGWLHIDCSAT
YRKAPVEQWSAGATGLGVRTIANLLTA
>Q9RF52 3.4.11.23~~~pepB~~~Peptidase B~~~
MTEAMKITLSTQPADARWGDKATYSINNDGITLHLNGKDDLGLIQRAARKIDGLGIKQVALTGEGWDTERCWAFWAGYKG
PKGVRTVMWPDLDDAQRQELDNRLTIIDWVRDTINAPAEELGPEQLAQRAVDLLCSVACDSVTYRITKGEDLREQNYMGL
HTVGRGSERPPVLLALDYNPTGDKDAPVYACLVGKGITFDSGGYSIKQSAFMDSMKSDMGGAATVTGALAFAITRGLNKR
VKLFLCCADNLISGNAFKLGDIIRYRNGKNVEVMNTDAEGRLVLADGLIDASAQHPQLIIDMATLTGAAKTALGNDYHAL
FSFDDTLAGRLLTSAAQENEPFWRLPLAEFHRNQLPSNFAELNNTGSAAYPAGASTAAGFLSHFVENYREGWLHIDCSAT
YRKAPVEQWAAGATGLGVRTIANLLTA
>P58475 3.4.11.23~~~pepB~~~Peptidase B~~~COG0260
MTTEIMQISLSHNPADARWGEKALISTNDQGVTIHLTSHDQLGGIQRAARKIDGQGIKQVKLAGEGWGLEQSWAFWQGFR
GPKGQRSVVWAELPANEKTELEQRLKIIDWVRDTINAPAEDLGPEQLAKNAIDLLCAVSCDAVSYRITKGEDLREQNYAG
IYTVGRGSDRAPVLLALDYNPTGNPDAPVMACLVGKGITFDSGGYSLKQSAFMDSMKSDMGGAATLTGALALAAARGLKE
RVKLYLCCADNMVSGNAFKLGDIIRYRNGKTVEIMNTDAEGRLVLADGLIDASEQNAPLIIDAATLTGAAKTALGNDYHA
LFSFDDELAQALLNSAHSEHELFWRLPLAEFHRSQLPSNFAELNNVAGGAYSAGASTAAAFLSHFVKNYQQGWLHIDCSA
TYRKSAVDQWSAGATGLGVRTVANLLLAQAKQ
>A0A0C5URS1 4.1.1.38~~~~~~PPi-type phosphoenolpyruvate carboxykinase~~~
MSVVERRQINAAINLRLSLLGLPHPDSNAESPDAILVEPLLARQRELSRRLKDRLSAPDLRIQRFLDDYLADCDEHPQLP
RTTLVLDEPGLARGLSLPVDGDEFHSDIVASYRLVNGVLHNPKHDRRTTAGVFHISTGGLPIPQDKVEVDKNVYARILAR
AFQAPDEELALPYTANLPEQAHCWASLLMRPTVLPAVPGRTTEKSYEVHFIVPGGLMCNLDFVEGIFGNAGDPYLPENDA
SLDPDSWTGHTGCVILAPHLTTMTKKSLGMPHYDDATERQRRDGQCWRHEDDLYNDGKAFKVCARDERGVIVTVIADNYF
GYCKKEVKTQISYSANLLGGAEEEHSGGAEVYPAWNLNQDFTDRTPDDFTLADVISTNRELLDVRPEGYAVYKPEPNIVF
IPEHSHYSMRTQTISWTAHGAEQTIKLLAGKHYLSPDGYRIHAKHREMDATQWHLIGTSSRAVTCHKPATVSGGGKSEIS
KSISDAFVFGNAFSHDIDSAMDQVQALFDTDFTNRFADASRNGTDHRPVLSIDRSLGSVIKLLTPSIQYNDEYNAFLEGI
EPDVKELAFTVKRYYLPEWGEDWRSHFTVGIMNGRHGNMVRLDGKKIITNMLRVGFREDGSWRLFTLRPDYSPAVKVQTE
DDITASTVTPPWEDAEGLPRKYVTNCEHLLFQRPDDAIHRGYDKQAEFDLASGTDTFISNFEPLTHEQARDLLTDVQAYS
EFTKPVRKLIERVAAMPDDQSPEFWVCSDDPRHLPDGGRSKNPRYLQVRPTDSNPELTTVADVAGKLARKLPLAGHAPQP
IDVVAAGRRNNPPEDKVPALCAYNPLHYMELPELFMEYISSMTGKSPSTTGAGSEGALTKGPFNALPAVYDLNAALLSYA
LTDYDGWLSSAGYIGPNARVDHDISMLIPELFSHMGPNDRNTKRLISEGYLEKMQDFDFDGHRVLASRLGYRINDRFVTH
YFGRIFLHPDVVFSEEMLRPELQDEKIFADSIDVIVKTHQRVAQMYFDDGTVSLACPPIRALLEIMAHGASAEGWTLDSP
EFRKLFERESVLASDWYAQRLDAKQAEDVKQAEEGVERLKEYIGRSDSGSVTGRLHLADRLRELEAQLTYERSPEYRQSL
VGTLGRQPRFV
>Q04723 3.4.22.40~~~pepC~~~Aminopeptidase C~~~
MTVTSDFTQKLYENFAENTKLRAVENAVTKNGLLSSLEVRGSHAANLPEFSLDLTKDPVTNQKQSGRCWMFAALNTFRHK
FINEFKTEDFEFSQAYTFFWDKYEKSNWFMEQIIGDVAMDDRRLKFLLQTPQQDGGQWDMMVAIFDKYGIVPKAVYPESQ
ASSSSRELNQYLNKLLRQDAEILRYTIEQDGDVQAVKEELLQEVFNFLAVTLGLPPQNFEFAFRNKDNEYKKFVGTPKEF
YNEYVGIDLNNYVSVINAPTADKPYNKSYTVEFLGNVVGGKEVKHLNVEMDRFKKLAIAQMQAGETVWFGCDVGQESNRS
AGLLTMDSYDFKSSLDIEFTQSKAGRLDYGESLMTHAMVLAGVDLDADGNSTKWKVENSWGKDAGQKGYFVASDEWMDEY
TYQIVVRKDLLSEEELAAYEAKPQVLLPWDPMGALA
>Q48558 3.4.13.19~~~pepDA~~~Dipeptidase A~~~COG4690
MKQTECTTILVGKKASIDGSTMIARSEDGGRVIIPEGFKVVNPEDQPKHYTSVISKQKIDDEDLAETPLRYTSAPDVSGK
NGIWGAAGINADNVAMTATETITTNSRIQGVDPILDPSEGGLGEEDFVTLTLPYLHSAFDGVKRVGYLVEKYGTYEMNGM
AFSDKDNIWYLETIGGHHWIARRIPDDAYVIAPNRLNIDTFDFDDSENFATASDLKDLIDEYHLNPDREGYNMRHIFGSS
TIKDAHYNNPRAWYIHNYFDPDFGGTPADQDQPFICRANRLISIEDIKWAESSHYQDTPYDAYGDQGTPEQKKTFRPIGI
NRNFETHILQIRNDVPAEIAGVQWLAFGPNTFNSMLPFYTNVTTTPEAWQTTPKFNLNKIFWLNKLTAQLGDTNYRVYGE
LEDAFEQKSLAQCHKIQHETDKEVKNLSGKELQDKLIAANQKMSDTVYNNTVELLGQMVDEGHGLMTLKYDLLD
>Q8G6Z9 3.4.13.19~~~pepD~~~Dipeptidase~~~
MACTTILVGKDASYDGSTIIARNEDSANGEFNPKRFIVVKPEEQPREYKSVISHLTITLPDDPLQYTAVPNADLKEGIWG
EAGVNEANVAMSATETLTTNERVLGADPFVEYTPAKGDEPEVPGGIGEEDFLTIVLPYVKTAREGVQRLGALLEEFGTYE
MNGVAFSDSNEIWWLETVGGHHWIAKRVPDEAYVTMPNQLGIDEFDLEDALGDQEAHMCSEDLAEFIETNHLDLAVENTT
PFNPRDAFGSHSDSDHVYNTPRAWYMQRFLNPYDEVWDGPDADHKPTSDDIPWARQPERKVTIEDIKYVLSSHYQGTPFD
PYGQLGDERTRHMYRTIGINRQSQLAVMQIRPYRPQASRAIQWMAYGSNPFNTLVPFFPNVDTTPAYLEDTTTRVTSENF
YWANRIIAALCDGAFRSTSNAVERYQEKTGAMGHRLVAATDEQIARLGLTAAEEAAQSAAEEEFEADNVDGDVQPMTPDE
TIAALRNPEVREILAAANQTMADQLKEETEKLLDSVLYTRSMEMKNGFHMSDF
>P15288 3.4.13.18~~~pepD~~~Cytosol non-specific dipeptidase~~~COG2195
MSELSQLSPQPLWDIFAKICSIPHPSYHEEQLAEYIVGWAKEKGFHVERDQVGNILIRKPATAGMENRKPVVLQAHLDMV
PQKNNDTVHDFTKDPIQPYIDGEWVKARGTTLGADNGIGMASALAVLADENVVHGPLEVLLTMTEEAGMDGAFGLQGNWL
QADILINTDSEEEGEIYMGCAGGIDFTSNLHLDREAVPAGFETFKLTLKGLKGGHSGGEIHVGLGNANKLLVRFLAGHAE
ELDLRLIDFNGGTLRNAIPREAFATIAVAADKVDVLKSLVNTYQEILKNELAEKEKNLALLLDSVANDKAALIAKSRDTF
IRLLNATPNGVIRNSDVAKGVVETSLNVGVVTMTDNNVEIHCLIRSLIDSGKDYVVSMLDSLGKLAGAKTEAKGAYPGWQ
PDANSPVMHLVRETYQRLFNKTPNIQIIHAGLECGLFKKPYPEMDMVSIGPTITGPHSPDEQVHIESVGHYWTLLTELLK
EIPAK
>O53896 3.4.21.107~~~pepD~~~Serine protease PepD~~~COG0265
MAKLARVVGLVQEEQPSDMTNHPRYSPPPQQPGTPGYAQGQQQTYSQQFDWRYPPSPPPQPTQYRQPYEALGGTRPGLIP
GVIPTMTPPPGMVRQRPRAGMLAIGAVTIAVVSAGIGGAAASLVGFNRAPAGPSGGPVAASAAPSIPAANMPPGSVEQVA
AKVVPSVVMLETDLGRQSEEGSGIILSAEGLILTNNHVIAAAAKPPLGSPPPKTTVTFSDGRTAPFTVVGADPTSDIAVV
RVQGVSGLTPISLGSSSDLRVGQPVLAIGSPLGLEGTVTTGIVSALNRPVSTTGEAGNQNTVLDAIQTDAAINPGNSGGA
LVNMNAQLVGVNSAIATLGADSADAQSGSIGLGFAIPVDQAKRIADELISTGKASHASLGVQVTNDKDTLGAKIVEVVAG
GAAANAGVPKGVVVTKVDDRPINSADALVAAVRSKAPGATVALTFQDPSGGSRTVQVTLGKAEQ
>P0A7C6 3.4.13.21~~~pepE~~~Peptidase E~~~COG3340
MELLLLSNSTLPGKAWLEHALPLIAEQLQGRRSAVFIPFAGVTQTWDDYTAKTAAVLAPLGVSVTGIHSVVDPVAAIENA
EIVIVGGGNTFQLLKQCRERGLLAPITDVVKRGALYIGWSAGANLACPTIRTTNDMPIVDPQGFDALNLFPLQINPHFTN
ALPEGHKGETREQRIRELLVVAPELTIIGLPEGNWITVSKGHATLGGPNTTYVFKAGEEAVPLEAGHRF
>P94870 3.4.22.-~~~pepE~~~Aminopeptidase E~~~COG3579
MAHELTVQELEKFSADFNKNPKNKVVARAAQRSGVLEASYNDRVQSELTRVFSTELDTDNVTNQKHSGRCWLFATLNVLR
HEFGKKYKAKDFTFSQAYNFFWDKIERANMFYNRILDSADMPLDSRQVKTDLDFAGTDGGQFQMAAALVEKYGVVPSYAM
PETFNTNDTTGFATALGDKLKKDALVLRKLKQEGKDDEIKKTREKFLSEVYQMTAIAVGEPPKKFDLEYRDDDKKYHLEK
DLTPLEFLHKYLGGVDFDDYVVLTNAPDHEYDKLYGLPAEDNVSGSIRIKLLNVPMEYLTAASIAQLKDGEAVWFGNDVL
RQMDRKTGYLDTNLYKLDDLFGVDLKMSKADRLKTGVGEVSHAMTLVGVDEDNGEVRQWKVENSWGDKSGAKGYYVMNNE
WFNDYVYEVVVHKKYLTDKQKELAEGPITDLPAWDSLA
>P9WHS7 3.4.13.-~~~pepE~~~Probable dipeptidase PepE~~~COG0006
MGSRRFDAEVYARRLALAAAATADAGLAGLVITPGYDLCYLIGSRAETFERLTALVLPAAGAPAVVLPRLELAALKQSAA
AELGLRVCDWVDGDDPYGLVSAVLGGAPVATAVTDSMPALHMLPLADALGVLPVLATDVLRRLRMVKEETEIDALRKAGA
AIDRVHARVPEFLVPGRTEADVAADIAEAIVAEGHSEVAFVIVGSGPHGADPHHGYSDRELREGDIVVVDIGGTYGPGYH
SDSTRTYSIGEPDSDVAQSYSMLQRAQRAAFEAIRPGVTAEQVDAAARDVLAEAGLAEYFVHRTGHGIGLCVHEEPYIVA
GNDLVLVPGMAFSIEPGIYFPGRWGARIEDIVIVTEDGAVSVNNCPHELIVVPVS
>P36936 3.4.13.21~~~pepE~~~Peptidase E~~~
MELLLLSNSTLPGKAWLEHALPLIANQLNGRRSAVFIPFAGVTQTWDEYTDKTAEVLAPLGVNVTGIHRVADPLAAIEKA
EIIIVGGGNTFQLLKESRERGLLAPMADRVKRGALYIGWSAGANLACPTIRTTNDMPIVDPNGFDALDLFPLQINPHFTN
ALPEGHKGETREQRIRELLVVAPELTVIGLPEGNWIQVSNGQAVLGGPNTTWVFKAGEEAVALEAGHRF
>P54124 3.4.24.-~~~pepF1~~~Oligoendopeptidase F, plasmid~~~
MAKNRNEIPEKLTWDLTTIYKTDKEWEAELTRIKSELSLVEETDPGHLLDSAESLLTITEKMLSISQQVEKLYVYASMKN
DQDTREAKYQEYQSKATALYVKFGEVYAFYEPEFLKISKEVYNKWLGELQKLKNYDHMFERLFAKKAHILSQKEEKLLAA
AGEIFESPSETFEIFDNADIKLPMVKNESDEMIQLTHGNYSSLMESKNRGVRKAAYKALYSNYEQYQHTYAKTLQTNVKV
HNLNAQIRSYDSARQAALANNFVPEKVYDVLMEAIHQHLPLLHRYIELRKKILGITDLKMYGIYTPLSNLGYKFNYEDGV
KKAEEVLAIFGKEYKGKVKAAFEQRWIDVEENIGKRSGAYSGGSYDTNAFMLLNWQETLDDLFTLVHETGHSMHSAFTRE
NQPYVYGNYPIFLAEIASTTNENILTETLLKESKDDKERFALLNHWLDSFRGTVFRQSQFAEFEQKIHEADAAGEVLTSE
YLNSLYGEINEKYYNLAVKGNPEIQYEWARIPHFYYNFYVFQYATGFAAATFLAEKVVHGSTEDRQKYLEYLKAGSSAYP
LEVIAKAGVDMESTDYLDAAFELFENRLSELEKLVEKGVHL
>Q07744 3.4.24.-~~~pepO~~~Neutral endopeptidase~~~COG3590
MTRIQDDLFATVNAEWLENAEIPADKPRISAFDELVLKNEKNLAKDLADLSQNLPTDNPELLEAIKFYNKAGDWQTREKA
DFSAVKNELAKVETLNTFEDFKNNLTQLVFHSQAPLPFSFSVEPDMKDAIHYSLGFSGPGLILPDTTYYNDEHPRKKELL
DFWAKNTSEILKTFDVENAEEIAKSALKFDALLVPSANTSEEWAKYAELYHPISTDSFVSKVKNLDLKSLIKDLVKTEPD
KVIVYEDRFYESFDSLINEENWSLIKAWMLTKIARGATSFFNEDLRILGGAYGRFLSNVQEARSQEKHQLDLTESYFSQV
IGLFYGKKYFGEAAKADVKRMVTAMIKVYQARLSKNEWLSQETAEKAIEKLDAITPFIGFPDKLPEIYSRLKTTSGSLYE
DALKFDEILTARTFEKFSEDVDKTSWHMPAHMVNAYYSPDSNTIVFPAAILQAPFYSLEQSSSQNYGGIGTVIAHEISHA
FDNNGAQFDKEGNLNKWWLDEDYEAFEEKQKEMIALFDGVETEAGPANGKLIVSENIADQGGITAALTAAKDEKDVDLKA
FFSQWAKIWRMKASKEFQQMLLSMDVHAPAKLRANIPPTNLEEFYETFDVKETDKMYRAPENRLKIW
>P0C2B4 3.4.24.-~~~pepO~~~Neutral endopeptidase~~~
MTRIQDDLFATVNAEWLENAEIPADKPRISAFDELVLKNEKNLAKDLADLSQNLPTDNPELLEAIKFYNKAGDWQAREKA
DFSAVKNELAKVETLNTFEDFKNNLTQLVFHSQAPLPFSFSVEPDMKDAIHYSLGFSGPGLILPDTTYYNDEHPRKKELL
DFWAKNTSEILKTFDVENAEEIAKSALKFDALLVPSANTSEEWAKYAELYHPISTDSFVSKVKNLDLKSLIKDLVKTEPD
KVIVYEDRFYESFDSLINEENWSLIKAWMLTKIARGATSFFNEDLRILGGAYGRFLSNVQEARSQEKHQLDLTESYFSQV
IGLFYGKKYFGEAGKADVKRMVTAMIKVYQARLSKNEWLSQETAEKAIEKLDAITPFIGFPDKLPEIYSRLKTTSGSLYE
DALKFDEILTARTFEKFSEDVDKTSWHMPAHMVNAYYSPDSNTIVFPAAILQAPFYSLEQSSSQNYGGIGTVIAHEISHA
FDNNGAQFDKEGNLNKWWLDEDYEAFEEKQKEMIALFDGVETEAGPANGKLIVSENIADQGGITAALTAAKDEKDVDLKA
FFSQWAKIWRMKASKEFQQMLLSMDFHAPAKLRANIPPTNLEEFYDTFDVKETDKMYRAPENRLKIW
>Q02VB0 3.4.24.-~~~pepO~~~Neutral endopeptidase~~~
MTRIQDDLFATVNAEWLENAEIPADKPRISAFDELVLKNEKNLAKDLADLSQNLPTDNPELLEAIKFYNKAGDWQAREKA
DFSAVKNELAKVETLNTFEDFKNNLTQLVFHSQAPLPFSFSVEPDMKDAIHYSLGFSGPGLILPDTTYYNDEHPRKKELL
DFWAKNTSEILKTFDVENAEEIAKSALKFDALLVPSANTSEEWAKYAELYHPISTDSFVSKVKNLDLKSLIKDLVKTEPD
KVIVYEDRFYESFDSLINEENWSLIKAWMLTKIARGATSFFNEDLRILGGAYGRFLSNVQEARSQEKHQLDLTESYFSQV
IGLFYGKKYFGEAGKADVKRMVTAMIKVYQARLSKNEWLSQETAEKAIEKLDAITPFIGFPDKLPEIYSRLKTTSGSLYE
DALKFDEILTARTFEKFSEDVDKTSWHMPAHMVNAYYSPDSNTIVFPAAILQAPFYSLEQSSSQNYGGIGTVIAHEISHA
FDNNGAQFDKEGNLNKWWLDEDYEAFEEKQKEMIALFDGVETEAGPANGKLIVSENIADQGGITAALTAAKDEKDVDLKA
FFSQWAKIWRMKASKEFQQMLLSMDFHAPAKLRANIPPTNLEEFYDTFDVKETDKMYRAPENRLKIW
>Q44238 3.4.13.9~~~pepQ~~~Xaa-Pro dipeptidase~~~
MNKLAVLYAEHIATLQKRTREIIERENLDGVVFHSGQAKRQFLDDMYYPFKVNPQFKAWLPVIDNPHCWIVANGTDKPKL
IFYRPVDFWHKVPDEPNEYWADYFDIELLVKPDQVEKLLPYDKARFAYIGEYLEVAQALGFELMNPEPVMNFYHYHRAYK
TQYELACMREANKIAVQGHKAARDAFFQGKSEFEIQQAYLLATQHSENDNAYGNIVALNENCAILHYTHFDRVAPATHRS
FLIDAGANFNGYAADITRTYDFTGEGEFAELVATMKQHQIALCNQLAPGKLYGELHLDCHQRVAQTLSDFNIVDLSADEI
VAKGITSTFFPHGLGHHIGLQVHDVGGFMADEQGAHQEPPEGHPFLRCTRKIEANQVFTIEPGLYFIDSLLGDLAATDNN
QHINWDKVAELKPFGGIRIEDNIIVHEDSLENMTRELRARLTTHSLRGLSAPQFSINDPAVMSEYSYPSEPLSYEEEIKK
STFIVHVRTRRILVRRRTLSPILIAVTPMPAITAGLM
>P21165 3.4.13.9~~~pepQ~~~Xaa-Pro dipeptidase~~~COG0006
MESLASLYKNHIATLQERTRDALARFKLDALLIHSGELFNVFLDDHPYPFKVNPQFKAWVPVTQVPNCWLLVDGVNKPKL
WFYLPVDYWHNVEPLPTSFWTEDVEVIALPKADGIGSLLPAARGNIGYIGPVPERALQLGIEASNINPKGVIDYLHYYRS
FKTEYELACMREAQKMAVNGHRAAEEAFRSGMSEFDINIAYLTATGHRDTDVPYSNIVALNEHAAVLHYTKLDHQAPEEM
RSFLLDAGAEYNGYAADLTRTWSAKSDNDYAQLVKDVNDEQLALIATMKAGVSYVDYHIQFHQRIAKLLRKHQIITDMSE
EAMVENDLTGPFMPHGIGHPLGLQVHDVAGFMQDDSGTHLAAPAKYPYLRCTRILQPGMVLTIEPGIYFIESLLAPWREG
QFSKHFNWQKIEALKPFGGIRIEDNVVIHENNVENMTRDLKLA
>Q9S6S1 3.4.13.9~~~pepQ~~~Xaa-Pro dipeptidase~~~
MNLDKLQNWLQENGMDVAYVSSPTTINYFTGFITDPEERIFKLFAFKDAEPFLFCPALNYEEAKASAWDGDVVGYLDSED
PWGKIAEEIKQRTKDYQNWAVEKNGLTVAHYQALHAQFPDSDFSKDLSDFIAHIRLFKTESELVKLRKAGEEADFAFQIG
FEALRNGVTERAVVSQIEYQLKLQKGVMQTSFDTIVQAGKNAANPHQGPSMNTVQPNELVLFDLGTMHEGYASDSSRTVA
YGEPTDKMREIYEVNRTAQQAAIDAAKPGMTASELDGVARKIITDAGYGEYFIHRLGHGIGMEVHEFPSIANGNDVVLEE
GMCFSIEPGIYIPGFAGVRIEDCGVLTKEGFKPFTHTSKELKVLPVKE
>P77814 3.4.13.9~~~pepQ~~~Xaa-Pro dipeptidase~~~COG0006
MEKLAVLYAEHIATLQQRTRTICEQEGLEGLVIHSGQAKRQFLDDMYYPFKVNPHFKAWLPVIDNPHCWIVVNGSDKPKL
IFYRPIDFWHKVPDEPRDFWAEYFDIELLLQPDQVEKLLPYDKAKFAYIGEYLEVAQALGFSIMNPEPVLNYIHYHRAYK
TQYELECLRNANRIAVDGHKAARDAFFNGGSEFDIQQAYLMATRQSENEMPYGNIVALNENCAILHYTHFEPKAPQTHNS
FLIDAGANFNGYAADITRTYDFKKQGEFADLVNAMTAHQIELGKSLKPGLLYGDLHIDCHNRIAQLLSDFDIVKLPAAEI
VERQITSTFFPHGLGHHLGAQVHDVGGFMRDETGAHQAPPEGHPFLRCTRLIEKNQVFTIEPGLYFIDSLLGDLAQTDNK
QFINWEKVEAFKPFGGIRIEDNIIVHEDSLENMTRNLLLD
>Q9X4A7 3.4.11.-~~~pepS~~~Aminopeptidase PepS~~~COG2309
MVLPNFKENLEKYAKLLVTNGINVQPGHTVALSIDVEQAELAHLLVKEAYALGAAEVIVQWSDDTINRERFLHAEMNRIE
EVPAYKKAEMEYLLEKKASRLGVRSSDPDAFNGVAPERLSAHAKAIGAAFKPMQVATQSNKVSWTVAAAAGKEWAKKVFP
NASSDEEAVDLLWNQIFKTCRVYEKDPVRAWKEHADRLDAKARILNEAQFSALHYTAPGTDLTLGLPKNHVWESAGAINA
QGESFLPNMPTEEVFTAPDFRRAYGYVSSTKPLSYNGNIIEGIKVTFKDGEIVDITADQGEKVMKNLVFNNNGARALGEC
ALVPDSSPISQSGITFFNTLFDENASNHLAIGAAYATSVEGGADMTEEELKAAGLNRSDVHVDFIIGSNQMNIDGIHHDG
SRVPIFRNGDWVI
>Q76HM7 3.4.11.4~~~pepT~~~Peptidase T~~~
MKYEKLLPRFLEYVKVNTRSDENSTTTPSTQALVEFAHKMGEDMKALGLKDVHYLESNGYVIGTIPANTDKKVRKIGLLA
HLDTADFNAEGVNPQILENYDGESVIQLGDTEFTLDPKDFPNLKNYKGQTLVHTDGTTLLGSDDKSGVAEIMTLADYLLN
INPDFEHGEIRVGFGPDEEIGVGADKFDVADFDVDFAYTVDGGPLGELQYETFSAAGAVIEFQGKNVHPGTAKNMMVNAL
QLAIDYHNALPEFDRPEKTEGREGFFHLLKLDGTPEEARAQYIIRDHEEGKFNERKALMQEIADKMNAELGQNRVKPVIK
DQYYNMAQIIEKDMSIIDIAKKAMENLDIAPIIEPIRGGTDGSKISFMGLPTPNLFAGGENMHGRFEFVSVQTMEKAVDT
LLEIIRLNNEVAK
>Q81WU4 3.4.11.4~~~pepT~~~Peptidase T~~~COG2195
MKEELIERFTRYVKIDTQSNEDSHTVPTTPGQIEFGKLLVEELKEVGLTEVTMDDNGYVMATLPANTDKDVPVIGFLAHL
DTATDFTGKNVKPQIHENFDGNAITLNEELNIVLTPEQFPELPSYKGHTIITTDGTTLLGADDKAGLTEIMVAMNYLIHN
PQIKHGKIRVAFTPDEEIGRGPAHFDVEAFGASFAYMMDGGPLGGLEYESFNAAGAKLTFNGTNTHPGTAKNKMRNATKL
AMEFNGHLPVEEAPEYTEGYEGFYHLLSLNGDVEQSKAYYIIRDFDRKNFEARKNTIENIVKQMQEKYGQDAVVLEMNDQ
YYNMLEKIEPVREIVDIAYEAMKSLNIEPNIHPIRGGTDGSQLSYMGLPTPNIFTGGENYHGKFEYVSVDVMEKAVQVII
EIARRFEEQA
>P29745 3.4.11.4~~~pepT~~~Peptidase T~~~COG2195
MDKLLERFLNYVSLDTQSKAGVRQVPSTEGQWKLLHLLKEQLEEMGLINVTLSEKGTLMATLPANVPGDIPAIGFISHVD
TSPDCSGKNVNPQIVENYRGGDIALGIGDEVLSPVMFPVLHQLLGQTLITTDGKTLLGADDKAGIAEIMTALAVLQQKKI
PHGDIRVAFTPDEEVGKGAKHFDVDAFDARWAYTVDGGGVGELEFENFNAASVNIKIVGNNVHPGTAKGVMVNALSLAAR
IHAEVPADESPEMTEGYEGFYHLASMKGTVERADMHYIIRDFDRKQFEARKRKMMEIAKKVGKGLHPDCYIELVIEDSYY
NMREKVVEHPHILDIAQQAMRDCDIEPELKPIRGGTDGAQLSFMGLPCPNLFTGGYNYHGKHEFVTLEGMEKAVQVIVRI
AELTAQRK
>Q9L4G1 3.4.11.4~~~pepT~~~Peptidase T~~~COG2195
MEYPNLLPRFLKYVKVNSRSDENSDRFPSTEREENFQKNVIMKDLEELGLSDIHYNQKAGSVIAEIPSNVDYDVPVMGFL
AHSDTADFNSENVKPQIHKNYDGESKIQLGDSEFYLDPEVYPNLRKYKGQTIITASGDTLLGADDKCGISELMTFAEYLM
NHPEVKHGKIRLAFTPDEEIGTGAEQFDVKDFGADFAFTVDGEAPGKLGDCTFSAAQFTLDIQGVNVHPAVAKGQMINAV
QVGIDFHNQLPEHDRPEHTDGREGFFHLLSFDGTVDHAHLAYIIRDFERDGLEERKNLVKSIVKKMNDEFGTERIKLQMN
DQYYNMADELKKHMDIVDLARDAYKAEGLEVNEDPVRGGTDGSQLTYMGLPCPNIFAGEENMHGRYEYTVLESMYKTVDV
MIKMAELNAERAK
>P0C2T7 3.4.11.4~~~pepT~~~Peptidase T~~~
MKYEKLLPRFLEYVKVNTRSDENSTTTPSTQALVEFAHKMGEDMKALGLKDVHYLESNGYVIGTIPANTDKKVRKIGLLA
HLDTADFNAEGVNPQILENYDGESVIQLGDTEFTLDPKDFPNLKNYKGQTLVHTDGTTLLGSDDKSGVAEIMTLADYLLN
INPDFEHGEIRVGFGPDEEIGVGADKFDVADFDVDFAYTVDGGPLGELQYETFSAAGAVIEFQGKNVHPGTAKNMMVNAL
QLAIDYHNALPEFDRPEKTEGREGFFHLLKLDGTPEEARAQYIIRDHEEGKFNERKALMQEIADKMNAELGQNRVKPLIK
DQYYNMAQIIEKDMSIIDIAKKAMENLDIAPIIEPIRGGTDGSKISFMGLPTPNLFAGGENMHGRFEFVSVQTMEKAVDT
LLEIIRLNNEVAK
>Q84BV2 3.4.11.4~~~pepT~~~Peptidase T~~~
MKYEKLLPRFLEYVKVNTRSDENSTTTPSTQALVEFAHKMGEDMKALGLKDVHYLESNGYVIGTIPANTDKKVRKIGLLA
HLDTADFNAEGVNPQILENYDGESVIKLGDTEFTLDPKDFPSLKNYKGQTLVHTDGTTLLGSDDKSGVAEIMTLAEYLLN
INPDFEHGEIRVGFGPDEEIGVGADKFDVADFDVDFAYTVDGGPLGELQYETFSAAGAVIEFQGKNVHPGTAKNTMVNAL
QLAIDYHNALPEFDRPEKTEGREGFFHLLKLDGTPEEARAQYIIRDHEEGKFNERKALMQEIADKMNAEFGQNRVKPVIK
DQYYNMAQIIEKDMSIIDIAKKAMENLDIVPIIEPIRGGTDGSKISFMGLPTPNLFAGGENMHGRFEFVSVQTMEKAVDT
LLEIIRLNNEVVK
>Q76HM5 3.4.11.4~~~pepT~~~Peptidase T~~~
MKYEKLLPRFLEYVKVNTRSDENSTTTPSTQALVEFAHKMGEDMKALGLKDVHYLESNGYVIGTIPANTDKKVRKIGLLA
HLDTADFNAEGVNPQILENYDGESVIKLGDTEFTLDPKDFPNLKNYKGQTLVHTDGTTLLGSDDKSGVAEIMTLADYLLN
INPDFEHGEIRVGFGPDEEIGVGADKFDVADFDVDFAYTVDGGPLGELQYETFSAAGAVIEFQGKNVHPGTAKNTMVNAL
QLAIDYHNALPEFDRPEKTEGREGFFHLLKLDGTPEEARAQYIIRDHEEGKFNERKALMQEIADKMNAELGQNRVKPVIK
DQYYNMAQIIEKDMSIIDIAKKAMENLDIVPIIEPIRGGTDGSKISFMGLPTPNLFAGGENMHGRFEFVSVQTMEKAVDT
LLEIIRLNNEVVK
>P26311 3.4.11.4~~~pepT~~~Peptidase T~~~
MDKLLERFLHYVSLDTQSKSGVRQVPSTEGQWKLLRLLKQQLEEMGLVNITLSEKGTLMATLPANVEGDIPAIGFISHVD
TSPDFSGKNVNPQIVENYRGGDIALGIGDEVLSPVMFPVLHQLLGQTLITTDGKTLLGADDKAGVAEIMTALAVLKGNPI
PHGDIKVAFTPDEEVGKGAKHFDVEAFGAQWAYTVDGGGVGELEFENFNAASVNIKIVGNNVHPGTAKGVMVNALSLAAR
IHAEVPADEAPETTEGYEGFYHLASMKGTVDRAEMHYIIRDFDRKQFEARKRKMMEIAKKVGKGLHPDCYIELVIEDSYY
NMREKVVEHPHILDIAQQAMRDCHITPEMKPIRGGTDGAQLSFMGLPCPNLFTGGYNYHGKHEFVTLEGMEKAVQVIVRI
AELTAKRGQ
>A0A2R9TD79 ~~~~~~Peptide transporter YePEPT~~~
MQTSTNTPGGRTFFGHPYPLSGLFLSEMWERFSFYGIRPLLILFMAATVFDGGMGLPREQASAIVGIFAGSMYLAALPGG
LLADNWLGQQRAVWYGSILIALGHLSIALSAFFGNDLFFIGLVFIVLGTGLFKTCISVMVGTLYKPGDARRDGGFSLFYM
GINMGSFIAPLLSGWLLRTHGWHWGFGIGGIGMLVALLIFRGFAIPAMKRYDAEVGLDSSWNKPTNQRQGVGRWVTAIMA
VVVVIIALISQGVIPINPVMIASLLVYVIAASVTLYFIYLFAFAKMSRKDRARLLVCFILLVSAAFFWSAFEQKPTSFNL
FANDYTDRMVMGFEIPTVWFQSINALFIILLAPVFSWAWPALAKKKIQPSSITKFVIGILCAAAGFAVMMYAAQHVLSSG
GAGVSPLWLVMSILLLTLGELCLSPIGLATMTLLAPDRMRGQVMGLWFCASSLGNLAAGLIGGHVKADQLDMLPTLFARC
SIALVICAAVLILLIVPIRRLMNNTQGQQTA
>Q5HF23 3.4.13.-~~~~~~Putative dipeptidase SACOL1801~~~
MWKEKVQQYEDQIINDLKGLLAIESVRDDAKASEDAPVGPGPRKALDYMYEIAHRDGFTTHDVDHIAGRIEAGKGNDVLG
ILCHVDVVPAGDGWDSNPFEPVVTEDAIIARGTLDDKGPTIAAYYAIKILEDMNVDWKKRIHMIIGTDEESDWKCTDRYF
KTEEMPTLGFAPDAEFPCIHGEKGITTFDLVQNKLTEDQDEPDYELITFKSGERYNMVPDHAEARVLVKENMTDVIQDFE
YFLEQNHLQGDSTVDSGILVLTVEGKAVHGMDPSIGVNAGLYLLKFLASLNLDNNAQAFVAFSNRYLFNSDFGEKMGMKF
HTDVMGDVTTNIGVITYDNENAGLFGINLRYPEGFEFEKAMDRFANEIQQYGFEVKLGKVQPPHYVDKNDPFVQKLVTAY
RNQTNDMTEPYTIGGGTYARNLDKGVAFGAMFSDSEDLMHQKNEYITKKQLFNATSIYLEAIYSLCVEE
>Q7A522 3.4.13.-~~~~~~Putative dipeptidase SA1572~~~
MWKEKVQQYEDQIINDLKGLLAIESVRDDAKASEDAPVGPGPRKALDYMYEIAHRDGFTTHDVDHIAGRIEAGKGNDVLG
ILCHVDVVPAGDGWDSNPFEPVVTEDAIIARGTLDDKGPTIAAYYAIKILEDMNVDWKKRIHMIIGTDEESDWKCTDRYF
KTEEMPTLGFAPDAEFPCIHGEKGITTFDLVQNKLTEDQDEPDYELITFKSGERYNMVPDHAEARVLVKENMTDVIQDFE
YFLEQNHLQGDSTVDSGILVLTVEGKAVHGMDPSIGVNAGLYLLKFLASLNLDNNAQAFVAFSNRYLFNSDFGEKMGMKF
HTDVMGDVTTNIGVITYDNENAGLFGINLRYPEGFEFEKAMDRFANEIQQYGFEVKLGKVQPPHYVDKNDPFVQKLVTAY
RNQTNDMTEPYTIGGGTYARNLDKGVAFGAMFSDSEDLMHQKNEYITKKQLFNATSIYLEAIYSLCVEE
>P45494 3.4.13.-~~~pepV~~~Beta-Ala-Xaa dipeptidase~~~
MDLNFKELAEAKKDAILKDLEELIAIDSSEDLENATEEYPVGKGPVDAMTKFLSFAKRDGFDTENFANYAGRVNFGAGDK
RLGIIGHMDVVPAGEGWTRDPFKMEIDEEGRIYGRGSADDKGPSLTAYYGMLLLKEAGFKPKKKIDFVLGTNEETNWVGI
DYYLKHEPTPDIVFSPDAEYPIINGEQGIFTLEFSFKNDDTKGDYVLDKFKAGIATNVTPQVTRATISGPDLEAVKLAYE
SFLADKELDGSFEINDESADIVLIGQGAHASAPQVGKNSATFLALFLDQYAFAGRDKNFLHFLAEVEHEDFYGKKLGIFH
HDDLMGDLASSPSMFDYEHAGKASLLNNVRYPQGTDPDTMIKQVLDKFSGILDVTYNGFEEPHYVPGSDPMVQTLLKVYE
KQTGKPGHEVVIGGGTYGRLFERGVAFGAQPENGPMVMHAANEFMMLDDLILSIAIYAEAIYELTKDEEL
>P40334 3.4.14.11~~~pepX~~~Xaa-Pro dipeptidyl-peptidase~~~
MKYNQYAYVETSPEKATEELLAINFLPENYSSLSFSELLAVLTGNVLAEATTRQAKDAKLAEFAVDDQTDLAAFLLDTPT
AITASQFANVALQLLGYHPNYDYSLTDPLTCGKKHALPAFKDLTSKEELIFTFYRLLNTRSKNGQILLDVMAGKGYFTQF
WGEGKFMFFNGKSLPVFDTSQVIREVVYVQSDLDTDGDGKGDLLPVTVFRPVESQDQLKVPALYTASPYFGGIIDNVKTN
HNVDENLTDATTWTNPKYVAKPLVKSPAPSDQDVPATELATGQSSYGLNEYLLARGFASVFSGAIGNRHGDGIRITGSPE
ETISQKEVIEWLTGDRVAYTDRTRRFETKASWCSGNVGMTGRSYLGTLQIAIATTGVKGLKTVVSEAAISSWYDYYREHG
LVVAPSECQGEDMDKLAEVCQSNLWDGGNFTAKKAYEAEQAELLAAQDRATGQYSDFWESRNYRHHTDGIKCSWISVHGL
NDWNVKPKNVYKIWQKVKQLPVKSHLFLHQGPHYNMNNLVSIDFTDLMNLWFVHELLEVENGAYEQWPKVMIQDNLEADE
WHAESDWASDLGQASLYLPTADGDLSTVENGTGQLTFTDLGGTEFKKAGISETDWEYQFISGEEKWAKASLRFESEEFLH
PTTLVGRPKVRVRVAANKTVGQLSVALVDLGTRQRLTATPKIFARGNQPFGYRFGADSLQEFVPDKATKAKLITKAHMNL
QNYQDMKQPSKLEAGQFVDLEFELQPTYYTLPAGAKLGLIIYSTDQGMTKRPLETEDYTVDLAGTALLLYRK
>Q59485 3.4.14.11~~~pepX~~~Xaa-Pro dipeptidyl-peptidase~~~COG2936
MKYNQYAYVETDFQQQVKELIDINFLPKNYQVWDFSSLLAKLVKNAIAEAKTDAAKNAKLAEFAVSDHQTLADFLKEKPT
EIGTKQFYNVALQLLGYHVHYDYDFADPTGFMQRNALPFLQDISDNQKLISAFYRLLNTRAKNGQILLDVMAGKGYFTQF
WGQNKFKFFNGKSIPVFDTNKVIREVVYVETDLDTDHDGKSDLIQVTVFRPEETNKGLKVPALYTASPYFGGIIANEKRN
HNVDENLSDSTEWNDPQYVHSPIVKAEKPDGSSRPATEEAVHKSSYPLNEYMLARGFASVFAGAIGTRGSDGVRITGAPE
ETESAAAVIEWLHGDRVAYTDRTRTVQTTADWCNGNIGMTGRSYLGTLQIAIATTGVKGLKTVVSEAAISSWYDYYREHG
LVIAPEACQGEDLDLLAETCQSNLWDAGSYLKIKPEYDKMQKQLREKEDRNTGQYSDFWEARNYRHHADGIKCSWISVHG
LNDWNVKPKNVYKIWQLVKKMPMKHHLFLHQGPHYNMNNLVSIDFTDLMNLWFVHELLGIENNAYNQWPTVMIQDNLQAD
KWHEEPDWSNDLGQEKIYYPTDEGELFQDGNGKAQKSFTDVGGIEFKKAGISESDWQYKFICGDEKWAKPSLRFETDEFT
HPTTIVGRPEVKVRVSASLPKGEISVALVELGERQRLTATPKFLMHGGQELGYRFGTDTLQEFVPDKKTKAKLITKAHMN
LQNFKDMKKPEAIDADKFYDLDFLLQPTYYTIPSGSKLALIIYSTDQGMTKRPLEDETYTIDLANTEIKFYEK
>P22346 3.4.14.11~~~pepX~~~Xaa-Pro dipeptidyl-peptidase~~~
MRFNHFSIVDKNFDEQLAELDQLGFRWSVFWDEKKILKDFLIQSPSDMTALQATAELDVIEFLKSSIELDWEIFWNIALQ
LLDFVPNFDFEIGKAFEYAKNSNLPQIEAEMTTENIISAFYYLLCTRRKTGMILVEHWVSEGLLPLDNHYHFFNDKSLAT
FDSSLLEREVLWVESPVDSEQRGENDLIKIQIIRPKSTEKLPVVMTASPYHLGINDKANDLALHDMNVELEEKTSHEIHV
EQKLPQKLSAKAKELPIVDKAPYRFTHGWTYSLNDYFLTRGFASIYVAGVGTRSSDGFQTSGDYQQIYSMTAVIDWLNGR
ARAYTSRKKTHEIKASWANGKVAMTGKSYLGTMAYGAATTGVEGLELILAEAGISSWYNYYRENGLVRSPGGFPGEDLDV
LAALTYSRNLDGADFLKGNAEYEKRLAEMTAALDRKSGDYNQFWHDRNYLINTDKVKADVLIVHGLQDWNVTPEQAYNFW
KALPEGHAKHAFLHRGAHIYMNSWQSIDFSETINAYFVAKLLDRDLNLNLPPVILQENSKDQVWTMMNDFGANTQIKLPL
GKTAVSFAQFDNNYDDETFKKYSKDFNVFKKDLFENKANEAVIDLELPSMLTINGPVELELRLKLNDTKGFLSAQILDFG
QKKRLEDKVRVKDFKVLDRGRNFMLDDLVELPLVESPYQLVTKGFTNLQNQSLLTVSDLKADEWFTIKFELQPTIYHLEK
ADKLRVILYSTDFEHTVRDNRKVTYEIDLSQSKLIIPIESVKN
>Q93M42 3.4.14.11~~~pepX~~~Xaa-Pro dipeptidyl-peptidase~~~
MRYNQYSYTKASEEVMLDELARLGFTIQTTNSPKENLHHFLQKILFRYQDVNYVLSSWVADQKTDLLTFFQSDKQLTEEV
FYTVALQVLGFAPFVDFDDVTAFCKEIHFPITYGNILENLYQLLNTRTKLGNTLIDQLVSEGFIPESNDYHFFNGKSLAT
FSSHEAIREVVYVESRVDTDGDGKPDLVKVSIIRPSYEGQVPAVMTASPYHQGTNDKASDKALHNMNVDLSCKNPRTITV
QESSIQTIEPQGQASLVEKAEEKLGHIGSYTLNDYLLPRGFANLYVSGVGTKDSEGMMTSGDYQQIEAYKNVIDWLNGRC
RAFTDHTRQREIKATWSNGKVATTGISYLGTMSNGLATTGVDGLEVIIAEAGISSWYNYYRENGLVTSPGGYPGEDFESL
TELTYSRNLLAGEYLRHNQAYQAYLDQQRKDLERETGDYNQFWHDRNYLIHADKVKAEVVFTHGSQDWNVKPLHVYNMFH
ALPAHIKKHLFFHNGAHVYINNWQSIDFRESMNALLSKKLLGHSSDFDLPPVIWQDNSQAQNWMSLDDFGNQEDYSHFHL
GKGSQEIRNRYSDEDYNRFAKSYQVFKNELFEGKTQQITLDWTLEQDLFINGPAKLKLRLKSSTNKGLISAQLLDYGPAK
RLTPIPSLLEPRVMDNGRYYMLDNLMELPFADTPHRVITKGFLNLQNRTDLLTVEEVVPNQWMELSFELQPTIYKLKKGD
QLRLVLYTTDFEHTVRDKTDYHLSVDMEHSSLSLPHKKS
>Q7DBF7 2.3.1.227~~~perB~~~GDP-perosamine N-acetyltransferase~~~
MNLYGIFGAGSYGRETIPILNQQIKQECGSDYALVFVDDVLAGKKVNGFEVLSTNCFLKAPYLKKYFNVAIANDKIRQRV
SESILLHGVEPITIKHPNSVVYDHTMIGSGAIISPFVTISTNTHIGRFFHANIYSYVAHDCQIGDYVTFAPGAKCNGYVV
IEDNAYIGSGAVIKQGVPNRPLIIGAGAIIGMGAVVTKSVPAGITVCGNPAREMKRSPTSI
>P71086 ~~~perR~~~Peroxide operon regulator~~~COG0735
MAAHELKEALETLKETGVRITPQRHAILEYLVNSMAHPTADDIYKALEGKFPNMSVATVYNNLRVFRESGLVKELTYGDA
SSRFDFVTSDHYHAICENCGKIVDFHYPGLDEVEQLAAHVTGFKVSHHRLEIYGVCQECSKKENH
>Q97FU2 ~~~perR~~~Transcriptional regulator PerR~~~COG0735
MNDISTIFKEKKLKLTPQRIAVYKYLKSTHEHPSAETIYKAIQSDYPTMSLATVYKALKTLAEVHLIQELNVGEGNFRYD
ANSSSHPHIQCLSCGKVDDIMGITFDNLNKDVSSHTDYDVISNKLYFYGICKDCKDKA
>Q57083 ~~~perR~~~HTH-type transcriptional regulator PerR~~~COG0583
MKLLAKAPLNLLRAFEAAGRTGAFALAASELELSPSAISHAIRKLENLLDVRLFQRSTREITLTKEGEILLEHIQRGFNE
LQQGLALVTADESRPLRLHTAPSFAHQWLLPRLGKFIRENPSIDLRLSASTEYARFEQDDFDLDIVYGEPRPSPYEKIPL
AVEELTPLCSPQLAERLKKPEDLYALTLIQCDVQLYQWKGWFEANKMTPPNNYGLRFDRSFMAIAAAVDGLGVVLESKLL
AEREIASGKLVCPLVNSTSEIHYIGHYLVFPQHQHMHSALDVFKTWLLNELNLGKIR
>Q2G282 ~~~perR~~~Peroxide-responsive repressor PerR~~~COG0735
MSVEIESIEHELEESIASLRQAGVRITPQRQAILRYLISSHTHPTADEIYQALSPDFPNISVATIYNNLRVFKDIGIVKE
LTYGDSSSRFDFNTHNHYHIICEQCGKIVDFQYPQLNEIERLAQHMTDFDVTHHRMEIYGVCKECQDK
>Q03035 ~~~prn~~~Pertactin autotransporter~~~COG3468
MNMSLSRIVKAAPLRRTTLAMALGALGALGAAPAAHADWNNQSIIKAGERQHGIHIKQSDGAGVRTATGTTIKVSGRQAQ
GVLLENPAAELRFQNGSVTSSGQLFDEGVRRFLGTVTVKAGKLVADHATLANVSDTRDDDGIALYVAGEQAQASIADSTL
QGAGGVRVERGANVTVQRSTIVDGGLHIGTLQPLQPEDLPPSRVVLGDTSVTAVPASGAPAAVSVFGANELTVDGGHITG
GRAAGVAAMDGAIVHLQRATIRRGDAPAGGAVPGGAVPGGAVPGGFGPLLDGWYGVDVSDSTVDLAQSIVEAPQLGAAIR
AGRGARVTVSGGSLSAPHGNVIETGGGARRFPPPASPLSITLQAGARAQGRALLYRVLPEPVKLTLAGGAQGQGDIVATE
LPPIPGASSGPLDVALASQARWTGATRAVDSLSIDNATWVMTDNSNVGALRLASDGSVDFQQPAEAGRFKVLMVDTLAGS
GLFRMNVFADLGLSDKLVVMRDASGQHRLWVRNSGSEPASANTMLLVQTPRGSAATFTLANKDGKVDIGTYRYRLAANGN
GQWSLVGAKAPPAPKPAPQPGPQPGPQPPQPPQPPQRQPEAPAPQPPAGRELSAAANAAVNTGGVGLASTLWYAESNALS
KRLGELRLNPDAGGAWGRGFAQRQQLDNRAGRRFDQKVAGFELGADHAVAVAGGRWHLGGLAGYTRGDRGFTGDGGGHTD
SVHVGGYATYIANSGFYLDATLRASRLENDFKVAGSDGYAVKGKYRTHGVGASLEAGRRFAHADGWFLEPQAELAVFRVG
GGAYRAANGLRVRDEGGSSVLGRLGLEVGKRIELAGGRQVQPYIKASVLQEFDGAGTVRTNGIAHRTELRGTRAELGLGM
AAALGRGHSLYASYEYSKGPKLAMPWTFHAGYRYSW
>P14283 ~~~prn~~~Pertactin autotransporter~~~COG3468
MNMSLSRIVKAAPLRRTTLAMALGALGAAPAAHADWNNQSIVKTGERQHGIHIQGSDPGGVRTASGTTIKVSGRQAQGIL
LENPAAELQFRNGSVTSSGQLSDDGIRRFLGTVTVKAGKLVADHATLANVGDTWDDDGIALYVAGEQAQASIADSTLQGA
GGVQIERGANVTVQRSAIVDGGLHIGALQSLQPEDLPPSRVVLRDTNVTAVPASGAPAAVSVLGASELTLDGGHITGGRA
AGVAAMQGAVVHLQRATIRRGDAPAGGAVPGGAVPGGAVPGGFGPGGFGPVLDGWYGVDVSGSSVELAQSIVEAPELGAA
IRVGRGARVTVSGGSLSAPHGNVIETGGARRFAPQAAPLSITLQAGAHAQGKALLYRVLPEPVKLTLTGGADAQGDIVAT
ELPSIPGTSIGPLDVALASQARWTGATRAVDSLSIDNATWVMTDNSNVGALRLASDGSVDFQQPAEAGRFKVLTVNTLAG
SGLFRMNVFADLGLSDKLVVMQDASGQHRLWVRNSGSEPASANTLLLVQTPLGSAATFTLANKDGKVDIGTYRYRLAANG
NGQWSLVGAKAPPAPKPAPQPGPQPPQPPQPQPEAPAPQPPAGRELSAAANAAVNTGGVGLASTLWYAESNALSKRLGEL
RLNPDAGGAWGRGFAQRQQLDNRAGRRFDQKVAGFELGADHAVAVAGGRWHLGGLAGYTRGDRGFTGDGGGHTDSVHVGG
YATYIADSGFYLDATLRASRLENDFKVAGSDGYAVKGKYRTHGVGASLEAGRRFTHADGWFLEPQAELAVFRAGGGAYRA
ANGLRVRDEGGSSVLGRLGLEVGKRIELAGGRQVQPYIKASVLQEFDGAGTVHTNGIAHRTELRGTRAELGLGMAAALGR
GHSLYASYEYSKGPKLAMPWTFHAGYRYSW
>Q82W83 ~~~petC~~~Ammonia monooxygenase gamma subunit~~~COG2857
MRMIKFLLLAILLAPFVAHSSGQEVKLDKAPIDRADKESLQRGAKGFVEYCLTCHGANFMRFNRHHDIGMSEDDIRADLI
HTGQKTGDLMEAAMRKKEAEGWFGVVPPDLSVIARARGADWLYTYLRTFYQDTSTYSGWNNLIFDKVAMPHVLHHLQGWQ
VLEPGTGNLVQTKPGTMTKEEYDRFVADLVNYMVYLGEPHAPYRRELGITVLLFLFGMLGLTYLLKKEYWRDIH
>P83792 ~~~petD~~~Cytochrome b6-f complex subunit 4~~~
MATLKKPDLSDPKLRAKLAKGMGHNYYGEPAWPNDLLYVFPVVIMGTFACIVALSVLDPAMVGEPADPFATPLEILPEWY
LYPVFQILRSVPNKLLGVLLMASVPLGLILVPFIENVNKFQNPFRRPVATTIFLFGTLVTIWLGIGATFPLDKTLTLGLF
>Q93SX1 ~~~petD~~~Cytochrome b6-f complex subunit 4~~~COG1290
MATHKKPDLSDPTLRAKLAKGMGHNYYGEPAWPNDLLYVFPIVIMGSFACIVALAVLDPAMTGEPANPFATPLEILPEWY
LYPVFQILRSLPNKLLGVLAMASVPLGLILVPFIENVNKFQNPFRRPVATTVFLFGTLVTLWLGIGAALPLDKSLTLGLF
>P27589 ~~~petD~~~Cytochrome b6-f complex subunit 4~~~COG1290
MSIIKKPDLSDPDLRAKLAKGMGHNYYGEPAWPNDILYMFPICILGALGLIAGLAILDPAMIGEPADPFATPLEILPEWY
LYPTFQILRILPNKLLGIAGMAAIPLGLMLVPFIESVNKFQNPFRRPIAMTVFLFGTAAALWLGAGATFPIDKSLTLGLF
>P83797 ~~~petG~~~Cytochrome b6-f complex subunit 5~~~
MVEPLLDGLVLGLVFATLGGLFYAAYQQYKRPNELGG
>P58246 ~~~petG~~~Cytochrome b6-f complex subunit 5~~~
MVEPLLSGIVLGLIVVTLAGLFYAAYKQYKRPNELGG
>P74149 ~~~petG~~~Cytochrome b6-f complex subunit 5~~~
MIEPLLLGIVLGLIPVTLAGLFVAAYLQYKRGNQFNLD
>P0DX29 3.1.1.74~~~cut1~~~Cutinase~~~
MSALTSQPTSSGSSEKIPRLRGWRAKAAGVVLAALALTTGVAAPAPAAANPYERGPDPTTASIEATSGSFATSTVTVSRL
AVSGFGGGTIYYPTTTTAGTFGALSIAPGFTATQSSIAWLGPRLASQGFVVFTIDTLTTSDQPDSRGRQLLASLDYLTQQ
SSVRSRIDSTRLGVVGHSMGGGGTLEAARSRPTLQAAVPLTAWDLTKNWSTLQVPTLVVGAQSDTVAPVASHSIPFYTSL
PSTLDRAYLELRGASHFAPNSPNTTIAKYTLSWLKRFIDNDTRYEQFLCPIPSTSLSISDYRGNCPHNG
>D4Q9N1 3.1.1.74~~~est1~~~Cutinase est1~~~
MSVTTPRREASLLSRAVAVAAAAAATVALAAPAQAANPYERGPNPTESMLEARSGPFSVSEERASRLGADGFGGGTIYYP
RENNTYGAIAISPGYTGTQSSIAWLGERIASHGFVVIAIDTNTTLDQPDSRARQLNAALDYMLTDASSSVRNRIDASRLA
VMGHSMGGGGTLRLASQRPDLKAAIPLTPWHLNKSWRDITVPTLIIGADLDTIAPVSSHSEPFYNSIPSSTDKAYLELNN
ATHFAPNITNKTIGMYSVAWLKRFVDEDTRYTQFLCPGPRTGLLSDVDEYRSTCPF
>G8GER6 3.1.1.74~~~cut_1~~~Cutinase cut1~~~
MPPHAARPGPAQNRRGRAMAVITPRRERSSLLSRALRFTAAAATALVTAVSLAAPAHAANPYERGPNPTDALLEARSGPF
SVSEERASRFGADGFGGGTIYYPRENNTYGAVAISPGYTGTQASVAWLGERIASHGFVVITIDTNTTLDQPDSRARQLNA
ALDYMINDASSAVRSRIDSSRLAVMGHSMGGGGTLRLASQRPDLKAAIPLTPWHLNKNWSSVRVPTLIIGADLDTIAPVL
THARPFYNSLPTSISKAYLELDGATHFAPNIPNKIIGKYSVAWLKRFVDNDTRYTQFLCPGPRDGLFGEVEEYRSTCPF
>Q47RJ7 3.1.1.74~~~~~~Cutinase~~~COG4188
MPPHAARPGPAQNRRGRAMAVITPRRERSSLLSRALRFTAAAATALVTAVSLAAPAHAANPYERGPNPTDALLEARSGPF
SVSEERASRFGADGFGGGTIYYPRENNTYGAVAISPGYTGTQASVAWLGERIASHGFVVITIDTNTTLDQPDSRARQLNA
ALDYMINDASSAVRSRIDSSRLAVMGHSMGGGGTLRLASQRPDLKAAIPLTPWHLNKNWSSVRVPTLIIGADLDTIAPVL
THARPFYNSLPTSISKAYLELDGATHFAPNIPNKIIGKYSVAWLKRFVDNDTRYTQFLCPGPRDGLFGEVEEYRSTCPF
>F7IX06 3.1.1.74~~~est2~~~Cutinase est2~~~
MSVTTPRRETSLLSRALRATAAAATAVVATVALAAPAQAANPYERGPNPTESMLEARSGPFSVSEERASRFGADGFGGGT
IYYPRENNTYGAIAISPGYTGTQSSIAWLGERIASHGFVVIAIDTNTTLDQPDSRARQLNAALDYMLTDASSAVRNRIDA
SRLAVMGHSMGGGGTLRLASQRPDLKAAIPLTPWHLNKSWRDITVPTLIIGAEYDTIASVTLHSKPFYNSIPSPTDKAYL
ELDGASHFAPNITNKTIGMYSVAWLKRFVDEDTRYTQFLCPGPRTGLLSDVEEYRSTCPF
>Q6A0I4 3.1.1.74~~~cut2~~~Cutinase cut2~~~
MAVMTPRRERSSLLSRALQVTAAAATALVTAVSLAAPAHAANPYERGPNPTDALLEASSGPFSVSEENVSRLSASGFGGG
TIYYPRENNTYGAVAISPGYTGTEASIAWLGERIASHGFVVITIDTITTLDQPDSRAEQLNAALNHMINRASSTVRSRID
SSRLAVMGHSMGGGGTLRLASQRPDLKAAIPLTPWHLNKNWSSVTVPTLIIGADLDTIAPVATHAKPFYNSLPSSISKAY
LELDGATHFAPNIPNKIIGKYSVAWLKRFVDNDTRYTQFLCPGPRDGLFGEVEEYRSTCPF
>Q47RJ6 3.1.1.74~~~TfH~~~Cutinase~~~COG4188
MAVMTPRRERSSLLSRALQVTAAAATALVTAVSLAAPAHAANPYERGPNPTDALLEASSGPFSVSEENVSRLSASGFGGG
TIYYPRENNTYGAVAISPGYTGTEASIAWLGERIASHGFVVITIDTITTLDQPDSRAEQLNAALNHMINRASSTVRSRID
SSRLAVMGHSMGGGGTLRLASQRPDLKAAIPLTPWHLNKNWSSVTVPTLIIGADLDTIAPVATHAKPFYNSLPSSISKAY
LELDGATHFAPNIPNKIIGKYSVAWLKRFVDNDTRYTQFLCPGPRDGLFGEVEEYRSTCPF
>A0A0K8P6T7 3.1.1.101~~~~~~Poly(ethylene terephthalate) hydrolase~~~
MNFPRASRLMQAAVLGGLMAVSAAATAQTNPYARGPNPTAASLEASAGPFTVRSFTVSRPSGYGAGTVYYPTNAGGTVGA
IAIVPGYTARQSSIKWWGPRLASHGFVVITIDTNSTLDQPSSRSSQQMAALRQVASLNGTSSSPIYGKVDTARMGVMGWS
MGGGGSLISAANNPSLKAAAPQAPWDSSTNFSSVTVPTLIFACENDSIAPVNSSALPIYDSMSRNAKQFLEINGGSHSCA
NSGNSNQALIGKKGVAWMKRFMDNDTRYSTFACENPNSTRVSDFRTANCS
>G9BY57 3.1.1.74~~~~~~Leaf-branch compost cutinase~~~
MDGVLWRVRTAALMAALLALAAWALVWASPSVEAQSNPYQRGPNPTRSALTADGPFSVATYTVSRLSVSGFGGGVIYYPT
GTSLTFGGIAMSPGYTADASSLAWLGRRLASHGFVVLVINTNSRFDYPDSRASQLSAALNYLRTSSPSAVRARLDANRLA
VAGHSMGGGGTLRIAEQNPSLKAAVPLTPWHTDKTFNTSVPVLIVGAEADTVAPVSQHAIPFYQNLPSTTPKVYVELDNA
SHFAPNSNNAAISVYTISWMKLWVDNDTRYRQFLCNVNDPALSDFRTNNRHCQ
>P83795 ~~~petL~~~Cytochrome b6-f complex subunit 6~~~
MILGAVFYIVFIALFFGIAVGIIFAIKSIKLI
>Q8YVQ2 ~~~petL~~~Cytochrome b6-f complex subunit 6~~~
MLAIVAYIGFLALFTGIAAGLLFGLRSAKIL
>P83796 ~~~petM~~~Cytochrome b6-f complex subunit 7~~~
MTEEMLYAALLSFGLIFVGWGLGVLLLKIQGAEKE
>P0A3Y1 ~~~petM~~~Cytochrome b6-f complex subunit 7~~~
MSGELLNAALLSFGLIFVGWALGALLLKIQGAEE
>P74810 ~~~petM~~~Cytochrome b6-f complex subunit 7~~~
MTAESMLANGAFIMIGLTLLGLAWGFVIIKLQGSEE
>P83798 ~~~petN~~~Cytochrome b6-f complex subunit 8~~~
MEIDVLGWVALLVVFTWSIAMVVWGRNGL
>P61048 ~~~petN~~~Cytochrome b6-f complex subunit 8~~~
MAILTLGWVSLLVVFTWSIAMVVWGRNGL
>P72717 ~~~petN~~~Cytochrome b6-f complex subunit 8~~~
MDILTLGWVSVLVLFTWSISMVVWGRNGF
>O68900 3.4.21.-~~~pet~~~Serine protease pet autotransporter~~~
MNKIYSIKYSAATGGLIAVSELAKKVICKTNRKISAALLSLAVISYTNIIYAANMDISKAWARDYLDLAQNKGVFQPGST
HVKIKLKDGTDFSFPALPVPDFSSATANGAATSIGGAYAVTVAHNAKNKSSANYQTYGSTQYTQINRMTTGNDFSIQRLN
KYVVETRGADTSFNYNENNQNIIDRYGVDVGNGKKEIIGFRVGSGNTTFSGIKTSQTYQADLLSASLFHITNLRANTVGG
NKVEYENDSYFTNLTTNGDSGSGVYVFDNKEDKWVLLGTTHGIIGNGKTQKTYVTPFDSKTTNELKQLFIQNVNIDNNTA
TIGGGKITIGNTTQDIEKNKNNQNKDLVFSGGGKISLKENLDLGYGGFIFDENKKYTVSAEGNNNVTFKGAGIDIGKGST
VDWNIKYASNDALHKIGEGSLNVIQAQNTNLKTGNGTVILGAQKTFNNIYVAGGPGTVQLNAENALGEGDYAGIFFTENG
GKLDLNGHNQTFKKIAATDSGTTITNSNTTKESVLSVNNQNNYIYHGNVDGNVRLEHHLDTKQDNARLILDGDIQANSIS
IKNAPLVMQGHATDHAIFRTTKTNNCPEFLCGVDWVTRIKNAENSVNQKNKTTYKSNNQVSDLSQPDWETRKFRFDNLNI
EDSSLSIARNADVEGNIQAKNSVINIGDKTAYIDLYSGKNITGAGFTFRQDIKSGDSIGESKFTGGIMATDGSISIGDKA
IVTLNTVSSLDRTALTIHKGANVTASSSLFTTSNIKSGGDLTLTGATESTGEITPSMFYAAGGYELTEDGANFTAKNQAS
VTGDIKSEKAAKLSFGSADKDNSATRYSQFALAMLDGFDTSYQGSIKAAQSSLAMNNALWKVTGNSELKKLNSTGSMVLF
NGGKNIFNTLTVDELTTSNSAFVMRTNTQQADQLIVKNKLEGANNLLLVDFIEKKGNDKNGLNIDLVKAPENTSKDVFKT
ETQTIGFSDVTPEIKQQEKDGKSVWTLTGYKTVANADAAKKATSLMSGGYKAFLAEVNNLNKRMGDLRDINGEAGAWARI
MSGTGSAGGGFSDNYTHVQVGADNKHELDGLDLFTGVTMTYTDSHAGSDAFSGETKSVGAGLYASAMFESGAYIDLIGKY
VHHDNEYTATFAGLGTRDYSSHSWYAGAEVGYRYHVTDSAWIEPQAELVYGAVSGKQFSWKDQGMNLTMKDKDFNPLIGR
TGVDVGKSFSGKDWKVTARAGLGYQFDLFANGETVLRDASGEKRIKGEKDGRMLMNVGLNAEIRDNVRFGLEFEKSAFGK
YNVDNAINANFRYSF
>Q97QZ2 ~~~pezA~~~Antitoxin PezA~~~COG1396
MIGKNIKSLRKTHDLTQLEFARIVGISRNSLSRYENGTSSVSTELIDIICQKFNVSYVDIVGEDKMLNPVEDYELTLKIE
IVKERGANLLSRLYRYQDSQGISIDDESNPWILMSDDLSDLIHTNIYLVETFDEIERYSGYLDGIERMLEISEKRMVA
>Q97QZ1 2.7.1.176~~~pezT~~~Toxin PezT~~~COG0542
MEIQDYTDSEFKHALARNLRSLTRGKKSSKQPIAILLGGQSGAGKTTIHRIKQKEFQGNIVIIDGDSFRSQHPHYLELQQ
EYGKDSVEYTKDFAGKMVESLVTKLSSLRYNLLIEGTLRTVDVPKKTAQLLKNKGYEVQLALIATKPELSYLSTLIRYEE
LYIINPNQARATPKEHHDFIVNHLVDNTRKLEELAIFERIQIYQRDRSCVYDSKENTTSAADVLQELLFGEWSQVEKEML
QVGEKRLNELLEK
>Q2M5K2 2.3.1.-~~~pE~~~Acyltransferase PE~~~COG5651
MRRRLLAFGTAFTTIGTAGFLGFGVAAADDTKPVDPAPGGAHAEAPSMGTPGRGYALGGAHVLGIPYDEYIMRTGADWFP
GLDRQIVDYPAGQVQGHTLERLFPGIGALGERFMPGLGLDGPSYGESIDVGAPNLINAIRQGGPGTVIGLSEGASVLDEV
QARLAYDPAAPPPDSLSFATYGNPVGKHAFGESFLTQMFPVGSVVPSLDYRIPAPVESQYDTYQFVSAYDSIADWPDRPD
NWISVANAIVGLATGHTAVAFTNPSMVPPQNIRTTVNSRGAKDTTIMIPEEHLPLVLPFKYLGVDKDTLNKLDGVLKPYV
DAGYSRNDDPLTAPITVDPVNGYDPAAVTAPATQAAFGGGTDPVSQLLAGLQYVVNNQPAPKP
>Q8CYC9 ~~~pfbA~~~Plasmin and fibronectin-binding protein A~~~COG5434
MLKIVKKLEVLMKYFVPNEVFSIRKLKVGTCSVLLAISILGSQGILSDEVVTSSSPMATKESSNAITNDLDNSPTVNQNR
SAEMIASNSTTNGLDNSLSVNSISSNGTIRSNSQLDNRTVESTVTSTNENKSYKEDVISDRIIKKEFEDTALSVKDYGAV
GDGIHDDRQAIQDAIDAAAQGLGGGNVYFPEGTYLVKEIVFLKSHTHLELNEKATILNGINIKNHPSIVFMTGLFTDDGA
QVEWGPTEDISYSGGTIDMNGALNEEGTKAKNLPLINSSGAFAIGNSNNVTIKNVTFKDSYQGHAIQIAGSKNVLVDNSR
FLGQALPKTMKDGQIISKESIQIEPLTRKGFPYALNDDGKKSENVTIQNSYFGKSDKSGELVTAIGTHYQTLSTQNPSNI
KILNNHFDNMMYAGVRFTGFTDVLIKGNRFDKKVKGESVHYRESGAALVNAYSYKNTKDLLDLNKQVVIAENIFNIADPK
TKAIRVAKDSAEYLGKVSDITVTKNVINNNSKETEQPNIELLRVSDNLVVSENSIFGGKEGIVIEDSKGKITVLNNQFYN
LSGKYISFIKSNANGKEPVIRDSDGNFNIVTENGLYKIVTNNLSDKNEKEKNKEEKQSNSNNVIDSNQKNGEFNSSKDNR
QMNDKIDNKQDNKTEEVNYKIVGDGRETENHINKSKEIVDVKQKLPKTGSNKIMELFLTVTGIGLLLTLKGLKYYGKDK
>Q05098 ~~~pfeA~~~Ferric enterobactin receptor~~~
MSSRALPAVPFLLLSSCLLANAVHAAGQGDGSVIELGEQTVVATAQEETKQAPGVSIITAEDIAKRPPSNDLSQIIRTMP
GVNLTGNSSSGQRGNNRQIDIRGMGPENTLILVDGKPVSSRNSVRYGWRGERDSRGDTNWVPADQVERIEVIRGPAAARY
GNGAAGGVVNIITKQAGAETHGNLSVYSNFPQHKAEGASERMSFGLNGPLTENLSYRVYGNIAKTDSDDWDINAGHESNR
TGKQAGTLPAGREGVRNKDIDGLLSWRLTPEQTLEFEAGFSRQGNIYTGDTQNTNSNNYVKQMLGHETNRMYRETYSVTH
RGEWDFGSSLAYLQYEKTRNSRINEGLAGGTEGIFDPNNAGFYTATLRDLTAHGEVNLPLHLGYEQTLTLGSEWTEQKLD
DPSSNTQNTEEGGSIPGLAGKNRSSSSSARIFSLFAEDNIELMPGTMLTPGLRWDHHDIVGDNWSPSLNLSHALTERVTL
KAGIARAYKAPNLYQLNPDYLLYSRGQGCYGQSTSCYLRGNDGLKAETSVNKELGIEYSHDGLVAGLTYFRNDYKNKIES
GLSPVDHASGGKGDYANAAIYQWENVPKAVVEGLEGTLTLPLADGLKWSNNLTYMLQSKNKETGDVLSVTPRYTLNSMLD
WQATDDLSLQATVTWYGKQKPKKYDYHGDRVTGSANDQLSPYAIAGLGGTYRLSKNLSLGAGVDNLFDKRLFRAGNAQGV
VGIDGAGAATYNEPGRTFYTSLTASF
>Q9I0F2 3.1.1.108~~~pfeE~~~Iron(III) enterobactin esterase~~~
MRTSLLVAALGLALAAALPGGAPLAQPDPEATMDRSLLQRQDLPYRFSAVDLDSVDGQRHYRLWLGRPLQAPPAAGYPVV
WMLDGNAAVGALDESTLRRLADGDAPLLVAIGYRTPLRIDRAGRTFDYTPASPGQADQRDPLNGLPSGGADAFLDLLRDG
MRPAVAAQAPLDTARQTLWGHSYGGLLVLHALFTRPGEFARYAAASPSLWWRDGAILGERAGLEQRLRGKRAELLLWRGS
AEPASPRGSLKAEPGQAMARLVDDLRRVAGLTLDFQPLDGLGHGETLGASLRLLLARPAVERQR
>Q04804 2.7.13.3~~~pfeS~~~Sensor protein PfeS~~~
MRRHPLLWKLALLQVGFCLLLTWLIYTWGLSVERSTYFLAPADRHYLADYARQAEDAWRREGAAGAERFRKELSAKEDTW
VALVGPHLESLGSTPLSAEESSHLTFMRKLDWPMSRRLQDELPYVSIEFPGHPEQGRLVIQLPERLLPGGLTPWTHLVTH
GIVPTLLAALLGLLLYRHLVVPLNRLRDRADALRADELESTPLAAPLAARRDELGELAQALEHMAERLRLSLAQQRLLLR
TLSHELRTPLARLRIAHDSELPPEQLRQRLDREIGDMQRLLEDTLDLAWMDTERPQLPTEPVLALSVWEALRDDACFESG
WDPARLPCRLGVDCRVEVHLDSLAQAMENLLRNAIRHSPEDGTVSLDGEREGDFWHLRLQDQGPGVAEDQLERIFLPYQR
LDDSAGEGFGLGLAIARRAIELQGGRLWASNGKPGLCLHLWLPAAA
>P0DOB6 2.7.1.11~~~pfkA~~~ATP-dependent 6-phosphofructokinase~~~
MKRIAVLTSGGDAPGMNAAIRAVVRKAISEGIEVYGINHGYAGMVAGDIFPLTSASVGDKIGRGGTFLYSARYPEFAQVE
GQLAGIEQLKKFGIEGVVVIGGDGSYHGAMRLTEHGFPAVGLPGTIDNDIVGTDFTIGFDTAVSTVVDALDKIRDTSSSH
NRTFVVEVMGRNAGDIALNAGIAAGADDISIPELEFKFENVVNNINKGYEKGKNHHIIIVAEGVMTGEEFATKLKEAGYK
GDLRVSVLGHIQRGGSPTARDRVLASRMGARAVELLRDGIGGVAVGIRNEELVESPILGTAEEGALFSLTTEGGIKVNNP
HKAGLELYRLNSALNNLNLN
>P21777 2.7.1.11~~~pfkA~~~ATP-dependent 6-phosphofructokinase 1~~~COG0205
MKRIGVFTSGGDAPGMNAAIRAVVRQAHALGVEVIGIRRGYAGMIQGEMVPLGVRDVANIIQRGGTILLTARSQEFLTEE
GRAKAYAKLQAAGIEGLVAIGGDGTFRGALCLVEEHGMPVVGVPGTIDNDLYGTDYTIGFDTAVNTALEAIDRIRDTAAS
HERVFFIEVMGRHAGFIALDVGLAGGAEVIAVPEEPVDPKAVAEVLEASQRRGKKSSIVVVAEGAYPGGAAGLLAAIREH
LQVEARVTVLGHIQRGGSPTAKDRILASRLGAAAVEALVGGASGVMVGEVEGEVDLTPLKEAVERRKDINRALLRLSQVL
AL
>Q9L1L8 2.7.1.11~~~pfkA2~~~ATP-dependent 6-phosphofructokinase 2~~~COG0205
MRIGVLTAGGDCPGLNAVIRSVVHRAVDNYGDEVIGFEDGYAGLLDGRYRALDLNAVSGILARGGTILGSSRLERDRLRE
ACENAGDMIQNFGIDALIPIGGEGTLTAARMLSDAGLPVVGVPKTIDNDISSTDRTFGFDTAVGVATEAMDRLKTTAESH
QRVMVVEVMGRHAGWIALESGMAAGAHGICLPERPFDPADLVKMVEERFSRGKKFAVVCVAEGAHPAEGSMDYGKGAIDK
FGHERFQGIGTALAFELERRLGKEAKPVILGHVQRGGVPTAYDRVLATRFGWHAVEAAHRGDFGRMTALRGTDVVMVPLA
EAVTELKTVPKDRMDEAESVF
>Q8VU09 2.7.1.11~~~pfkA~~~ATP-dependent 6-phosphofructokinase~~~
MTLHLDDLRVRLLGECRYDSPFAEVLSTKRTSPHYVAEGDRVLLEDTVAMLAEHSLPSVQAPSFEAAGPRRKIYFDPARV
TAGIVTCGGLCPGLNNVIRGLVQELSVHYRVKRIVGFRNGPGLTAAHRDDTVELTPEVVRDIHNLGGTILGSSRGGQDAD
EMVETLALHGVDVMFVIGGDGGMRAATFLSGAIRARGLDIAVIGVPKTIDNDLPFTDQSFGFQSAFARATDFISAVSVEA
AASPNGVGIVKLMGRHSGFIAAYAALAANSADVVLIPEVPFALDGDDGLLAHVERLVRAKGFAVVVVAEGAGQDLFDAHG
LPQLNGRGTDASGNVKLGNIGELLRTSIEAHLTAAGLAPTMRYIDPSYAIRSIPANAYDSVYCLRLAHAAVHAAMAGRTE
AAVARWRRRFVHVPFSLMTRRRNQVDPDGDLWMSVLETTCQPAEFGAVAARERISSGFC
>Q4MVY3 2.7.1.11~~~pfkA~~~ATP-dependent 6-phosphofructokinase~~~COG0205
MKRIGVLTSGGDSPGMNAAIRAVVRKAIFHDIEVYGIYHGYAGLISGHIEKLELGSVGDIIHRGGTKLYTARCPEFKDPE
VRLKGIEQLKKHGIEGLVVIGGDGSYQGAKKLTEQGFPCVGVPGTIDNDIPGTDFTIGFDTALNTVIDAIDKIRDTATSH
ERTYVIEVMGRHAGDIALWAGLADGAETILIPEEEYDMEDVIARLKRGSERGKKHSIIVVAEGVGSAIDIGKHIEEATNF
DTRVTVLGHVQRGGSPSAQDRVLASRLGARAVELLIAGKGGRCVGIQDNKLVDHDIIEALAQKHTIDKDMYQLSKELSI
>O34529 2.7.1.11~~~pfkA~~~ATP-dependent 6-phosphofructokinase~~~COG0205
MKRIGVLTSGGDSPGMNAAVRAVVRKAIYHDVEVYGIYNGYAGLISGKIEKLELGSVGDIIHRGGTKLYTARCPEFKTVE
GREKGIANLKKLGIEGLVVIGGDGSYMGAKKLTEHGFPCVGVPGTIDNDIPGTDFTIGFDTALNTVIDAIDKIRDTATSH
ERTYVIEVMGRHAGDIALWAGLAGGAESILIPEADYDMHEIIARLKRGHERGKKHSIIIVAEGVGSGVEFGKRIEEETNL
ETRVSVLGHIQRGGSPSAADRVLASRLGAYAVELLLEGKGGRCVGIQNNKLVDHDIIEILETKHTVEQNMYQLSKELSI
>P0A796 2.7.1.11~~~pfkA~~~ATP-dependent 6-phosphofructokinase isozyme 1~~~COG0205
MIKKIGVLTSGGDAPGMNAAIRGVVRSALTEGLEVMGIYDGYLGLYEDRMVQLDRYSVSDMINRGGTFLGSARFPEFRDE
NIRAVAIENLKKRGIDALVVIGGDGSYMGAMRLTEMGFPCIGLPGTIDNDIKGTDYTIGFFTALSTVVEAIDRLRDTSSS
HQRISVVEVMGRYCGDLTLAAAIAGGCEFVVVPEVEFSREDLVNEIKAGIAKGKKHAIVAITEHMCDVDELAHFIEKETG
RETRATVLGHIQRGGSPVPYDRILASRMGAYAIDLLLAGYGGRCVGIQNEQLVHHDIIDAIENMKRPFKGDWLDCAKKLY
>P00512 2.7.1.11~~~pfkA~~~ATP-dependent 6-phosphofructokinase~~~
MKRIGVLTSGGDSPGMNAAIRSVVRKAIYHGVEVYGVYHGYAGLIAGNIKKLEVGDVGDIIHRGGTILYTARCPEFKTEE
GQKKGIEQLKKHGIEGLVVIGGDGSYQGAKKLTEHGFPCVGVPGTIDNDIPGTDFTIGFDTALNTVIDAIDKIRDTATSH
ERTYVIEVMGRHAGDIALWSGLAGGAETILIPEADYDMNDVIARLKRGHERGKKHSIIIVAEGVGSGVDFGRQIQEATGF
ETRVTVLGHVQRGGSPTAFDRVLASRLGARAVELLLEGKGGRCVGIQNNQLVDHDIAEALANKHTIDQRMYALSKELSI
>P80019 2.7.1.11~~~pfkA~~~ATP-dependent 6-phosphofructokinase~~~
MKRIGILTSGGDAPGMNAAVRAVTRVAIANGLEVFGIRYGFAGLVAGDIFPLESEDVAHLINVSGTFLYSARYPEFAEEE
GQLAGIEQLKKHGIDAVVVIGGDGSYHGALQLTRHGFNSIGLPGTIDNDIPYTDATIGYDTACMTAMDAIDKIRDTASSH
HRVFIVNVMGRNCGDIAMRVGVACGADAIVIPERPYDVEEIANRLKQAQESGKDHGLVVVAEGVMTADQFMAELKKYGDF
DVRANVLGHMQRGGTPTVSDRVLASKLGSEAVHLLLEGKGGLAVGIENGKVTSHDILDLFDESHRGDYDLLKLNADLSR
>P9WID7 2.7.1.11~~~pfkA~~~ATP-dependent 6-phosphofructokinase~~~COG0205
MRIGVLTGGGDCPGLNAVIRAVVRTCHARYGSSVVGFQNGFRGLLENRRVQLHNDDRNDRLLAKGGTMLGTARVHPDKLR
AGLPQIMQTLDDNGIDVLIPIGGEGTLTAASWLSEENVPVVGVPKTIDNDIDCTDVTFGHDTALTVATEAIDRLHSTAES
HERVMLVEVMGRHAGWIALNAGLASGAHMTLIPEQPFDIEEVCRLVKGRFQRGDSHFICVVAEGAKPAPGTIMLREGGLD
EFGHERFTGVAAQLAVEVEKRINKDVRVTVLGHIQRGGTPTAYDRVLATRFGVNAADAAHAGEYGQMVTLRGQDIGRVPL
ADAVRKLKLVPQSRYDDAAAFFG
>Q2FXM8 2.7.1.11~~~pfkA~~~ATP-dependent 6-phosphofructokinase~~~COG0205
MKKIAVLTSGGDSPGMNAAVRAVVRTAIYNEIEVYGVYHGYQGLLNDDIHKLELGSVGDTIQRGGTFLYSARCPEFKEQE
VRKVAIENLRKRGIEGLVVIGGDGSYRGAQRISEECKEIQTIGIPGTIDNDINGTDFTIGFDTALNTIIGLVDKIRDTAS
SHARTFIIEAMGRDCGDLALWAGLSVGAETIVVPEVKTDIKEIADKIEQGIKRGKKHSIVLVAEGCMTAQDCQKELSQYI
NVDNRVSVLGHVQRGGSPTGADRVLASRLGGYAVDLLMQGETAKGVGIKNNKIVATSFDEIFDGKDHKFDYSLYELANKL
SI
>P99165 2.7.1.11~~~pfkA~~~ATP-dependent 6-phosphofructokinase~~~
MKKIAVLTSGGDSPGMNAAVRAVVRTAIYNEIEVYGVYHGYQGLLNDDIHKLELGSVGDTIQRGGTFLYSARCPEFKEQE
VRKVAIENLRKRGIEGLVVIGGDGSYRGAQRISEECKEIQTIGIPGTIDNDINGTDFTIGFDTALNTIIGLVDKIRDTAS
SHARTFIIEAMGRDCGDLALWAGLSVGAETIVVPEVKTDIKEIADKIEQGIKRGKKHSIVLVAEGCMTAQDCQKELSQYI
NVDNRVSVLGHVQRGGSPTGADRVLASRLGGYAVDLLMQGETAKGVGIKNNKIVATSFDEIFDGKDHKFDYSLYELANKL
SI
>Q9WY52 2.7.1.11~~~pfkA~~~ATP-dependent 6-phosphofructokinase~~~COG0205
MKKIAVLTSGGDAPGMNAAVRAVVRYGVRQGLEVIGVRRGYSGLIDGDFVKLEYKDVAGITEKGGTILRTSRCEEFKTEE
GRELAAKQIKKHGIEGLVVIGGEGSLTGAHLLYEEHKIPVVGIPATIDNDIGLTDMCIGVDTCLNTVMDAVQKLKDTASS
HERAFIVEVMGRHSGYIALMAGLVTGAEAIIVPEIPVDYSQLADRILEERRRGKINSIIIVAEGAASAYTVARHLEYRIG
YETRITILGHVQRGGSPTAFDRRLALSMGVEAVDALLDGEVDVMIALQGNKFVRVPIMEALSTKKTIDKKLYEIAHMLS
>P06999 2.7.1.11~~~pfkB~~~ATP-dependent 6-phosphofructokinase isozyme 2~~~COG1105
MVRIYTLTLAPSLDSATITPQIYPEGKLRCTAPVFEPGGGGINVARAIAHLGGSATAIFPAGGATGEHLVSLLADENVPV
ATVEAKDWTRQNLHVHVEASGEQYRFVMPGAALNEDEFRQLEEQVLEIESGAILVISGSLPPGVKLEKLTQLISAAQKQG
IRCIVDSSGEALSAALAIGNIELVKPNQKELSALVNRELTQPDDVRKAAQEIVNSGKAKRVVVSLGPQGALGVDSENCIQ
VVPPPVKSQSTVGAGDSMVGAMTLKLAENASLEEMVRFGVAAGSAATLNQGTRLCSHDDTQKIYAYLSR
>P9WID2 2.7.1.11~~~pfkB~~~ATP-dependent 6-phosphofructokinase isozyme 2~~~
MTEPAAWDEGKPRIITLTMNPALDITTSVDVVRPTEKMRCGAPRYDPGGGGINVARIVHVLGGCSTALFPAGGSTGSLLM
ALLGDAGVPFRVIPIAASTRESFTVNESRTAKQYRFVLPGPSLTVAEQEQCLDELRGAAASAAFVVASGSLPPGVAADYY
QRVADICRRSSTPLILDTSGGGLQHISSGVFLLKASVRELRECVGSELLTEPEQLAAAHELIDRGRAEVVVVSLGSQGAL
LATRHASHRFSSIPMTAVSGVGAGDAMVAAITVGLSRGWSLIKSVRLGNAAGAAMLLTPGTAACNRDDVERFFELAAEPT
EVGQDQYVWHPIVNPEASP
>P9WID3 2.7.1.11~~~pfkB~~~ATP-dependent 6-phosphofructokinase isozyme 2~~~COG1105
MTEPAAWDEGKPRIITLTMNPALDITTSVDVVRPTEKMRCGAPRYDPGGGGINVARIVHVLGGCSTALFPAGGSTGSLLM
ALLGDAGVPFRVIPIAASTRESFTVNESRTAKQYRFVLPGPSLTVAEQEQCLDELRGAAASAAFVVASGSLPPGVAADYY
QRVADICRRSSTPLILDTSGGGLQHISSGVFLLKASVRELRECVGSELLTEPEQLAAAHELIDRGRAEVVVVSLGSQGAL
LATRHASHRFSSIPMTAVSGVGAGDAMVAAITVGLSRGWSLIKSVRLGNAAGAAMLLTPGTAACNRDDVERFFELAAEPT
EVGQDQYVWHPIVNPEASP
>D9TT10 2.7.1.11~~~pfkB~~~ATP-dependent 6-phosphofructokinase~~~COG0524
MFNFNDKIVFDDKKYDVLTVGEMLVDMISTDYGDDFECDTYKKYFGGSPANIAINSKMLGINSIIVSSVGNDGLGKFLLK
KLQEHHIEIKYVRQVDYSTSMVLVTKSKSSPTPIFYRDADYHIEYSDELKYLIENTKIVHFSSWPISRNPSRSTVEILID
ECKKYDVLVCYDPNYHSMIWERGHDGREYIKSLIAKVDIIKPSEDDAERIFGKDTPENQLKKFLDLGAKLVILTLGKDGA
IVSNGEETIRFNTLADEVVDTTGAGDAFWSGFYSGLIKGYTLKKSLELGFAVSAYKLRYVGAIVDLPDIDTIKSMYDLKK
LR
>A5IZ80 3.1.1.104~~~~~~Phospho-furanose lactonase~~~
MAKDKFVRTVLGDVPAESIGITDCHDHLIKNGGPEMHEHPDFLMIDVEAAKKEVQEYVDHGGKTIVTMDPPNVGRDVYRM
LEIAEAFKGKANIVMSTGFHKAAFYDKYSSWLACVPTDDIVKMMVAEVEEGMDEYNYNGPVVKRSKAKAGIIKAGTGYAA
IDRLELKALEVAARTSITTGCPILVHTQLGTMALEVAQHLIGFGANPRKIQLSHLNKNPDRYYYEKIIKETGVTICFDGP
DRVKYYPDSLLADHIKYLVDKGLQKHITLSLDAGRILYQRNYGLTKGKETFGLSYLFERFIPLLKQVGVSQEAIDDILIN
NPREILAFDEPRVYDASKVSSEVVQLKKDLKLL
>Q4A724 3.1.1.104~~~~~~Phospho-furanose lactonase~~~COG1735
MENKFARTVLGDIPVEKLGITDCHDHFIKNGGPEVEEHIDFLMLNVDASIKEFKEFIDRGGSTIVTMDPPNVGRDVLKTL
EIANAVKNLGGNVIMSTGFHKAKFYDKYSSWLAVVPTEEIVKMCVAEIEEGMDEYNYNGPVVKRSKAKAGIIKAGTGYGA
IDRLELKALEVAARTSILTGCPILVHTQLGTMALEVAKHLIGFGANPDKIQISHLNKNPDKYYYEKVIKETGVTLCFDGP
DRVKYYPDSLLAENIKYLVDKGLQKHITLSLDAGRILYQRNYGLTKGKQTFGLAYLFDRFLPLLKQVGVSKEAIFDILVN
NPKRVLAFDEKRNFDPLKVSKEVLELKKELNLN
>P0A9N4 1.97.1.4~~~pflA~~~Pyruvate formate-lyase 1-activating enzyme~~~COG1180
MSVIGRIHSFESCGTVDGPGIRFITFFQGCLMRCLYCHNRDTWDTHGGKEVTVEDLMKEVVTYRHFMNASGGGVTASGGE
AILQAEFVRDWFRACKKEGIHTCLDTNGFVRRYDPVIDELLEVTDLVMLDLKQMNDEIHQNLVGVSNHRTLEFAKYLANK
NVKVWIRYVVVPGWSDDDDSAHRLGEFTRDMGNVEKIELLPYHELGKHKWVAMGEEYKLDGVKPPKKETMERVKGILEQY
GHKVMF
>Q7A7X5 1.97.1.4~~~pflA~~~Pyruvate formate-lyase-activating enzyme~~~
MLKGHLHSVESLGTVDGPGLRYILFTQGCLLRCLYCHNPDTWKISEPSREVTVDEMVNEILPYKPYFDASGGGVTVSGGE
PLLQMPFLEKLFAELKENGVHTCLDTSAGCANDTKAFQRHFEELQKHTDLILLDIKHIDNDKHIRLTGKPNTHILNFARK
LSDMKQPVWIRHVLVPGYSDDKDDLIKLGEFINSLDNVEKFEILPYHQLGVHKWKTLGIAYELEDVEAPDDEAVKAAYRY
VNFKGKIPVEL
>P09373 2.3.1.54~~~pflB~~~Formate acetyltransferase 1~~~COG1882
MSELNEKLATAWEGFTKGDWQNEVNVRDFIQKNYTPYEGDESFLAGATEATTTLWDKVMEGVKLENRTHAPVDFDTAVAS
TITSHDAGYINKQLEKIVGLQTEAPLKRALIPFGGIKMIEGSCKAYNRELDPMIKKIFTEYRKTHNQGVFDVYTPDILRC
RKSGVLTGLPDAYGRGRIIGDYRRVALYGIDYLMKDKLAQFTSLQADLENGVNLEQTIRLREEIAEQHRALGQMKEMAAK
YGYDISGPATNAQEAIQWTYFGYLAAVKSQNGAAMSFGRTSTFLDVYIERDLKAGKITEQEAQEMVDHLVMKLRMVRFLR
TPEYDELFSGDPIWATESIGGMGLDGRTLVTKNSFRFLNTLYTMGPSPEPNMTILWSEKLPLNFKKFAAKVSIDTSSLQY
ENDDLMRPDFNNDDYAIACCVSPMIVGKQMQFFGARANLAKTMLYAINGGVDEKLKMQVGPKSEPIKGDVLNYDEVMERM
DHFMDWLAKQYITALNIIHYMHDKYSYEASLMALHDRDVIRTMACGIAGLSVAADSLSAIKYAKVKPIRDEDGLAIDFEI
EGEYPQFGNNDPRVDDLAVDLVERFMKKIQKLHTYRDAIPTQSVLTITSNVVYGKKTGNTPDGRRAGAPFGPGANPMHGR
DQKGAVASLTSVAKLPFAYAKDGISYTFSIVPNALGKDDEVRKTNLAGLMDGYFHHEASIEGGQHLNVNVMNREMLLDAM
ENPEKYPQLTIRVSGYAVRFNSLTKEQQQDVITRTFTQSM
>Q5HJF4 2.3.1.54~~~pflB~~~Formate acetyltransferase~~~
MLETNKNHATAWQGFKNGRWNRHVDVREFIQLNYTLYEGNDSFLAGPTEATSKLWEQVMQLSKEERERGGMWDMDTKVAS
TITSHDAGYLDKDLETIVGVQTEKPFKRSMQPFGGIRMAKAACEAYGYELDEETEKIFTDYRKTHNQGVFDAYSREMLNC
RKAGVITGLPDAYGRGRIIGDYRRVALYGVDFLMEEKMHDFNTMSTEMSEDVIRLREELSEQYRALKELKELGQKYGFDL
SRPAENFKEAVQWLYLAYLAAIKEQNGAAMSLGRTSTFLDIYAERDLKAGVITESEVQEIIDHFIMKLRIVKFARTPDYN
ELFSGDPTWVTESIGGVGIDGRPLVTKNSFRFLHSLDNLGPAPEPNLTVLWSVRLPDNFKTYCAKMSIKTSSIQYENDDI
MRESYGDDYGIACCVSAMTIGKQMQFFGARANLAKTLLYAINGGKDEKSGAQVGPNFEGINSEVLEYDEVFKKFDQMMDW
LAGVYINSLNVIHYMHDKYSYERIEMALHDTEIVRTMATGIAGLSVAADSLSAIKYAQVKPIRNEEGLVVDFEIEGDFPK
YGNNDDRVDDIAVDLVERFMTKLRSHKTYRDSEHTMSVLTITSNVVYGKKTGNTPDGRKAGEPFAPGANPMHGRDQKGAL
SSLSSVAKIPYDCCKDGISNTFSIVPKSLGKEPEDQNRNLTSMLDGYAMQCGHHLNINVFNRETLIDAMEHPEEYPQLTI
RVSGYAVNFIKLTREQQLDVISRTFHESM
>Q7A7X6 2.3.1.54~~~pflB~~~Formate acetyltransferase~~~
MLETNKNHATAWQGFKNGRWNRHVDVREFIQLNYTLYEGNDSFLAGPTEATSKLWEQVMQLSKEERERGGMWDMDTKVAS
TITSHDAGYLDKDLETIVGVQTEKPFKRSMQPFGGIRMAKAACEAYGYELDEETEKIFTDYRKTHNQGVFDAYSREMLNC
RKAGVITGLPDAYGRGRIIGDYRRVALYGVDFLMEEKMHDFNTMSTEMSEDVIRLREELSEQYRALKELKELGQKYGFDL
SRPAENFKEAVQWLYLAYLAAIKEQNGAAMSLGRTSTFLDIYAERDLKAGVITESEVQEIIDHFIMKLRIVKFARTPDYN
ELFSGDPTWVTESIGGVGIDGRPLVTKNSFRFLHSLDNLGPAPEPNLTVLWSVRLPDNFKTYCAKMSIKTSSIQYENDDI
MRESYGDDYGIACCVSAMTIGKQMQFFGARANLAKTLLYAINGGKDEKSGAQVGPNFEGINSEVLEYDEVFKKFDQMMDW
LAGVYINSLNVIHYMHDKYSYERIEMALHDTEIVRTMATGIAGLSVAADSLSAIKYAQVKPIRNEEGLVVDFEIEGDFPK
YGNNDDRVDDIAVDLVERFMTKLRSHKTYRDSEHTMSVLTITSNVVYGKKTGNTPDGRKAGEPFAPGANPMHGRDQKGAL
SSLSSVAKIPYDCCKDGISNTFSIVPKSLGKEPEDQNRNLTSMLDGYAMQCGHHLNINVFNRETLIDAMEHPEEYPQLTI
RVSGYAVNFIKLTREQQLDVISRTFHESM
>P94692 1.2.7.1~~~por~~~Pyruvate:ferredoxin oxidoreductase~~~
MGKKMMTTDGNTATAHVAYAMSEVAAIYPITPSSTMGEEADDWAAQGRKNIFGQTLTIREMQSEAGAAGAVHGALAAGAL
TTTFTASQGLLLMIPNMYKISGELLPGVFHVTARAIAAHALSIFGDHQDIYAARQTGFAMLASSSVQEAHDMALVAHLAA
IESNVPFMHFFDGFRTSHEIQKIEVLDYADMASLVNQKALAEFRAKSMNPEHPHVRGTAQNPDIYFQGREAANPYYLKVP
GIVAEYMQKVASLTGRSYKLFDYVGAPDAERVIVSMGSSCETIEEVINHLAAKGEKIGLIKVRLYRPFVSEAFFAALPAS
AKVITVLDRTKEPGAPGDPLYLDVCSAFVERGEAMPKILAGRYGLGSKEFSPAMVKSVYDNMSGAKKNHFTVGIEDDVTG
TSLPVDNAFADTTPKGTIQCQFWGLGADGTVGANKQAIKIIGDNTDLFAQGYFSYDSKKSGGITISHLRFGEKPIQSTYL
VNRADYVACHNPAYVGIYDILEGIKDGGTFVLNSPWSSLEDMDKHLPSGIKRTIANKKLKFYNIDAVKIATDVGLGGRIN
MIMQTAFFKLAGVLPFEKAVDLLKKSIHKAYGKKGEKIVKMNTDAVDQAVTSLQEFKYPDSWKDAPAETKAEPMTNEFFK
NVVKPILTQQGDKLPVSAFEADGRFPLGTSQFEKRGVAINVPQWVPENCIQCNQCAFVCPHSAILPVLAKEEELVGAPAN
FTALEAKGKELKGYKFRIQINTLDCMGCGNCADICPPKEKALVMQPLDTQRDAQVPNLEYAARIPVKSEVLPRDSLKGSQ
FQEPLMEFSGACSGCGETPYVRVITQLFGERMFIANATGCSSIWGASAPSMPYKTNRLGQGPAWGNSLFEDAAEYGFGMN
MSMFARRTHLADLAAKALESDASGDVKEALQGWLAGKNDPIKSKEYGDKLKKLLAGQKDGLLGQIAAMSDLYTKKSVWIF
GGDGWAYDIGYGGLDHVLASGEDVNVFVMDTEVYSNTGGQSSKATPTGAVAKFAAAGKRTGKKDLARMVMTYGYVYVATV
SMGYSKQQFLKVLKEAESFPGPSLVIAYATCINQGLRKGMGKSQDVMNTAVKSGYWPLFRYDPRLAAQGKNPFQLDSKAP
DGSVEEFLMAQNRFAVLDRSFPEDAKRLRAQVAHELDVRFKELEHMAATNIFESFAPAGGKADGSVDFGEGAEFCTRDDT
PMMARPDSGEACDQNRAGTSEQQGDLSKRTKK
>Q2RMD6 1.2.7.1~~~~~~Pyruvate:ferredoxin oxidoreductase~~~COG0674
MPKQTLDGNTAAAHVAYAMSEVATIYPITPSSPMAEIADEWAAHGRKNIFGKTLQVAEMQSEAGAAGAVHGSLAAGALTT
TFTASQGLLLMIPNMYKIAGELLPCVFHVAARALSTHALSIFGDHADVMAARQTGFAMLSSASVQEVMDLALVAHLATLK
ARVPFVHFFDGFRTSHEVQKIDVIEYEDMAKLVDWDAIRAFRQRALNPEHPHQRGTAQNPDIYFQSREAANPYYLATPGI
VAQVMEQVAGLTGRHYHLFDYAGAPDAERVIVSMGSSCEVIEETVNYLVEKGEKVGLIKVRLFRPFSAEHFLKVLPASVK
RIAVLDRTKEPGSLGEPLYEDVQTVLAEHGKNILVVGGRYGLGSKEFNPSMVKAVFDNLAATTPKNKFTVGITDDVTHTS
LEIKEHIDTSPKGTFRCKFFGLGSDGTVGANKNSIKIIGDHTDMYAQGYFVYDSKKSGGVTISHLRFGKQPIQSAYLIDQ
ADLIACHNPSYVGRYNLLEGIKPGGIFLLNSTWSAEEMDSRLPADMKRTIATKKLKFYNIDAVKIAQEIGLGSRINVIMQ
TAFFKIANVIPVDEAIKYIKDSIVKTYGKKGDKILNMNFAAVDRALEALEEIKYPASWADAVDEAAATVTEEPEFIQKVL
RPINALKGDELPVSTFTPDGVFPVGTTKYEKRGIAVNIPQWQPENCIQCNQCSLVCPHAAIRPYLAKPADLAGAPETFVT
KDAIGKEAAGLKFRIQVSPLDCTGCGNCADVCPAKVKALTMVPLEEVTAVEEANYNFAEQLPEVKVNFNPATVKGSQFRQ
PLLEFSGACAGCGETPYVKLVTQLFGDRMIIANATGCSSIWGGSAPACPYTVNRQGHGPAWASSLFEDNAEFGYGMALAV
AKRQDELATAISKALEAPVSAAFKAACEGWLAGKDDADRSREYGDRIKALLPGEISQASGEVKDLLLDIDRQKDYLTKKS
IWIIGGDGWAYDIGYGGLDHVLASGANVNVLVLDTEVYSNTGGQSSKATQTGAVARFAAGGKFTKKKDLGLMAMSYGYVY
VASVAMGASHSQLMKALIEAEKYDGPSLIIAYAPCINHGINMTYSQREAKKAVEAGYWPLYRYNPQLAQEGKNPFILDYK
TPTASFRDFLMGEIRYTSLKKQFPEKAEQLFAKAEADAKARLEQYKKLAEG
>Q59126 2.7.1.90~~~pfp~~~Pyrophosphate--fructose 6-phosphate 1-phosphotransferase~~~
MRVGVLTGGGDCPGLNAVIRAVVRKGIEAHGWEIVGFRSGWRGPLTGDSRPLGLDDVEEILIRGGTILGSSRTNPYKEEG
GVEKIRAVLADQGVDALIAIGGEDTLGVAKKLTDDGIGVVGVPKTIDNDLAATDYTFGFDTAVHIATEAIDRLRTTAESH
YRAMVVEVMGRHAGWIALHAGLAGGANVILVPERPFSVEQVVEWVERRFEKMYAPIIVVAEGAVPEGGAEVLRTGEKDAF
GHVQLGGVGTWLADEIAERTGKESRAVVLGHTQRGGTPTAYDRVLATRFGLHAVDAVADGDFGTMVALRGTDIVRVKLAE
ATAELKTVPPERYEEAEVFFG
>P70826 2.7.1.90~~~pfp~~~Pyrophosphate--fructose 6-phosphate 1-phosphotransferase~~~
MNTSLFKQERQKYIPKLPNILKKDFNNISLVYGENTEAIQDRQALKEFFKNTYGLPIISFTEGESSLSFSKALNIGIILS
GGPAPGGHNVISGVFDAIKKFNPNSKLFGFKGGPLGLLENDKIELTESLINSYRNTGGFDIVSSGRTKIETEEHYNKALF
VAKENNLNAIIIIGGDDSNTNAAILAEYFKKNGENIQVIGVPKTIDADLRNDHIEISFGFDSATKIYSELIGNLCRDAMS
TKKYWHFVKLMGRSASHVALECALKTHPNICIVSEEVLAKKKTLSEIIDEMVSVILKRSLNGDNFGVVIVPEGLIEFIPE
VKSLMLELCDIFDKNEGEFKGLNIEKMKEIFVAKLSDYMKGVYLSLPLFIQFELIKSILERDPHGNFNVSRVPTEKLFIE
MIQSRLNDMKKRGEYKGSFTPVDHFFGYEGRSAFPSNFDSDYCYSLGYNAVVLILNGLTGYMSCIKNLNLKPTDWIAGGV
PLTMLMNMEERYGEKKPVIKKALVDLEGRPFKEFVKNRDKWALNNLYLYPGPVQYFGSSEIVDEITETLKLELLK
>Q9KH71 2.7.1.90~~~pfp~~~Pyrophosphate--fructose 6-phosphate 1-phosphotransferase~~~
MSKMRIGVLTGGGDCPGLNPAIRGIVMRALDYGDEVIGLKYGWAGLLKADTMPLSLEMVEDILEIGGTILGSSRTNPFKK
EEDVQKCVENFKKLNLDALIAIGGEDTLGVASKFSKLGLPMIGVPKTIDKDLEETDYTLGFDTAVEVVVDAIKRLRDTAR
SHARVIVVEIMGRHAGWLALYGGLAGGADYILIPEVEPNLEDLYNHIRKLYARGRNHAVVAIAEGVQLPGFTYQKGQEGM
VDAFGHIRLGGVGNVLAEEIQKNLGIETRAVILSHLQRGGSPSIRDRIMGLLLGKKAVDLVHEGKSGLFVAVKGNELVPV
DITLIEGKTKNVDPAFYESVKTFFNK
>E1VB09 2.7.1.90~~~pfp~~~Pyrophosphate--fructose 6-phosphate 1-phosphotransferase~~~COG0205
MAQHNAFYAQSGGVTAVINASACGVIEACRRHDDRIGKVYAGHNGIIGALTEDLIDVSQESDEAIAALRHTPAGAFGSCR
YKLKDIETHRTQYERLIEVFRAHDIRYFFYNGGGDSADTCLKVSQLSEKMGYPLTAIHVPKTVDNDLPITDNSPGFGSVA
KYIATSTLEASLDIASMCATSTKVFVLEVMGRHAGWIAAAGALAGQGEGDPPHLVIFPEIDFDRAAVMARVEESVKKCGY
CVIVVSEGARYEDGTFLADSGNTDAFGHRQLGGVAPTLAGMIKQDLGYKYHWAVADYLQRAARHLASKTDVDQAYAVGEK
AVELALDGQNAKMPAIKRISDEPYAWTVEAAPLADVANREKFMPRDFIREDGFGITEQCRRYLAPLIQGEDFPPFENGLP
KVAKLAKHRVERKLPEFKL
>G4STG9 2.7.1.90~~~pfp~~~Pyrophosphate--fructose 6-phosphate 1-phosphotransferase~~~
MNKPKKVAILTAGGLAPCLSSAIGSLIERYTEIDPSIEIICYRSGYKGLLLGDSYAVTPKIRENAALLHKFGGSPIGNSR
VKLTNVKDCIKRGLVQEGQDPQKVAADQLVKDGVDVLHTIGGDDTNTAAADLAAFLAKNDYGLTVIGLPKTIDNDVFPIK
QSLGAWTAAEQGAQYFQNVVAEYNANPRMLIVHEVMGRNCGWLTAATAMEYRKLLDRSEWLPEIGLDRAAYEVHGVFVPE
MEIDLAAEAKRLREVMDKVDCVNIFVSEGAGVDAIVAEMQAKGQEVPRDAFGHIKLDAVNPGKWFGEQFAEMIGAEKTLI
QKSGYFARASASNVDDIRLIKSCADLAVECAFRRESGVIGHDEDNGNVLRAIEFPRIKGGKPFDIDTPWFVQMLAGIGQS
KGARVEVSH
>Q609I3 2.7.1.90~~~pfp~~~Pyrophosphate--fructose 6-phosphate 1-phosphotransferase~~~COG0205
MAARNAFYAQSGGVTAVINASACGVLETARQYPDRIGTVYAGRNGIVGALTEDLIDTGQESAEAIAALRHTPSGAFGSCR
YKLKGLEENRAQYERLIEVFRAHDIGYFFYNGGGDSADTCLKVSQLSEKLGYPLQAVHIPKTVDNDLPITDCCPGFGSVA
KYIAVSVREASFDVRSMAATSTCIFVLEVMGRHAGWIAAAGGLASDERHELALVILFPEQVFDPERFLRAVDEKVRSHGY
CSVVVSEGIRGADGRFVAESGSRDVFGHARLGGVAPVIADLIKERLGYKYHWAVADYLQRAARHIASRTDVEQAYAVGKA
GVEMALKGLSAVMPAIVRTSDSPYRWEITAASLAEVANVEKKMPLEFISADGFGITEACRRYLRPLIEGEDYPPYAGGLP
DYVTLCNVAVPKKLAASFSV
>Q3KSV5 2.7.1.90~~~pfp~~~Pyrophosphate--fructose 6-phosphate 1-phosphotransferase~~~
MNKPKKVAILTAGGLAPCLNSAIGSLIERYTEIDPSIEIICYRGGYKGLLLGDSYPVTAEVRKKAGVLQRFGGSVIGNSR
VKLTNVKDCVKRGLVKEGEDPQKVAADQLVKDGVDILHTIGGDDTNTAAADLAAFLARNNYGLTVIGLPKTVDNDVFPIK
QSLGAWTAAEQGARYFMNVVAENNANPRMLIVHEVMGRNCGWLTAATAQEYRKLLDRAEWLPELGLTRESYEVHAVFVPE
MAIDLEAEAKRLREVMDKVDCVNIFVSEGAGVEAIVAEMQAKGQEVPRDAFGHIKLDAVNPGKWFGEQFAQMIGAEKTLV
QKSGYFARASASNVDDMRLIKSCADLAVECAFRRESGVIGHDEDNGNVLRAIEFPRIKGGKPFNIDTDWFNSMLSEIGQP
KGGKVEVSH
>Q9FAF8 2.7.1.90~~~pfp~~~Pyrophosphate--fructose 6-phosphate 1-phosphotransferase~~~
MAKSALHLARMSYQPKMPASLQGAVKIIEGKATEAVSDKEEIAAIFPRTYGLPLISFAPGGERTEYPPTNVGVILSGGQA
PGGHNVIAGLFDEMKLLNPDSRLFGFLMGPDGLIEHKYRELTAEVIDEYRNTGGFDMIGSGRTKLDKPEQFEAGLEILRE
LDIKALVIIGGDDSNTNACILAEYYASIDAGIQVIGCPKTIDGDLKNKQIETSFGFDTAAKVYSELIGNIQRDCNSARKY
WHFIKLMGRSASHITLECALQTHPNICIVSEEVEANNYYLDDVVTYIAETVVRRSEAGMNFGTVLIPEGLIEFLPAMKRL
IKELNEFLSQNDAEFKLIKRSAQRQYIKNKLSPENSRLYDSLPVDVARQLIADRDPHGNVQVSLIATEKLLADMTAQKLA
EWAEEGRFQGRFSTLTHFFGYEGRCAMPSNFDANYCYCLGRAASILIAAGKTGYMAAIKNTADPVSEWEAGGVPMTMMMN
MERRSGKMKPVIRKALVDMDGEPYRALREMRREWALSTEYVYPGPIQFFGPEHVCDSPTMTLRLEKNDR
>P29495 2.7.1.90~~~pfp~~~Pyrophosphate--fructose 6-phosphate 1-phosphotransferase~~~COG0205
MVKKVALLTAGGFAPCLSSAIAELIKRYTEVSPETTLIGYRYGYEGLLKGDSLEFSPAVRAHYDRLFSFGGSPIGNSRVK
LTNVKDLVARGLVASGDDPLKVAADQLIADGVDVLHTIGGDDTNTTAADLAAYLAQHDYPLTVVGLPKTIDNDIVPIRQS
LGAWTAADEGARFAANVIAEHNAAPRELIIHEIMGRNCGYLAAETSRRYVAWLDAQQWLPEAGLDRRGWDIHALYVPEAT
IDLDAEAERLRTVMDEVGSVNIFISEGAGVPDIVAQMQATGQEVPTDAFGHVQLDKINPGAWFAKQFAERIGAGKTMVQK
SGYFSRSAKSNAQDLELIAATATMAVDAALAGTPGVVGQDEEAGDKLSVIDFKRIAGHKPFDITLDWYTQLLARIGQPAP
IAAA
>Q2RNU4 2.7.1.90~~~pfp~~~Pyrophosphate--fructose 6-phosphate 1-phosphotransferase~~~COG0205
MFKGKVVVAQGGGPTAVINQSMVGAVLESRKFRNVELVYGAVHGVRGIVDEHFLDLTQETTHNLEMVAETPSSALGSTRE
KPDLKYCQEIFKVLKAHEIGYFFYIGGNDSSDTVRIVSEEAAKADYGLRCIHIPKTIDNDLVVNDHTPGFPSAARFVAQA
FSGVNLDNQALPGVYIGVVMGRHAGFLTAASALGKKFQDDGPHLIYLPERTFDVDTFVSDVKEVYDRTGRCIVAVSEGIH
DASGEPIITKLAEEVERDAHGNVQLSGTGALADLLVSVVKKKSGIKRVRGDTLGYLQRSFVGCVSDVDQREAREVGEKAV
QYAMWGQTNGSVTIHRTGFYSVDYQLTPLLDVAGKTRTMPDSFIAANGHDVTTDFLMYLRPLLGRGMPDAYRLRDNRVAK
VLNR
>Q9EZ02 2.7.1.90~~~pfp~~~Pyrophosphate--fructose 6-phosphate 1-phosphotransferase~~~COG0205
MTRISPLQKARYGYVPKLPPVLQDEIARIQAELGQPTEAVADREDLKRLFANTYGKPIATLTRGSNPEAGRRRTVGVILS
GGPAPGGHNVICGLFDALKKANRESTLIGFKGGPSGILDDEWIEFTDSLINQYRNTGGFDIIGSGRTKIETPEQFAKALE
NAKKHGLDALVIIGGDDSNTNAALLAEYFVQQGAPIQVIGIPKTIDGDLKNEYIEASFGFDTATKVYAELIGNIARDAIS
SRKYWHFIRLMGRSASHIALECALQTHPNVCIVSEEVREKNMTLSQIVDQIVDAVVKRAAKGENFGVVLVPEGLIEFIPE
VGALIDELNTLLAKEAEVFNRIDDPRERISWVKGKLSGNNQHVFSSLPETIQAQLLMDRDPHGNVQVSRIETEKLLIEMV
SSRLKALKEEGAYKGKFSALNHFFGYEARCAFPSNFDADYCYALGFTAFVLIANGLTGYIAAIKNLARPAVEWKPMGIPL
TMMMNMEKRHGKMKPVIRKALVDLEGAPFKRFATQRDAWATESAYVFPGAIQYYGPDDVCNQPTMTLKLEQGLE
>Q9WYC5 2.7.1.90~~~pfp~~~Pyrophosphate--fructose 6-phosphate 1-phosphotransferase~~~COG0205
MAERLGILVGGGPAPGINSVISSVTIEAINNGLEVIGIYDGFKHLVEGKTNMVKKLSIEDVSRIHIEGGSILRTSRVNPA
KSEETLEKTVQTLKKLGIKYLVTIGGDDTAFSASKVCERSKGEIKVVHVPKTIDNDLPLPENMPTFGFETARHVATELVY
NLMQDSRTTNRWYFVAMMGREAGHLALGVGKAASATITIIPEEFKEGVTLEEVCDVLDGAILKRKLMGRDDGVAVIGEGI
AEKMDPEELANIPGVIVEKDPHGHLRLAEIPLATILKRAIERRYAERGERIHIVDVTIGYELRSARPIPFDIVYTRTLGY
GAVRFLLGDYSDLPGGMVCVVGGRIKILPFDAFMDPKTGRTKVRVVDVRSEDYRVARKYMIRLEKKDLEDPETLEKLAKL
AKMEPEEFKKKYWHTTELP
>O83553 2.7.1.90~~~pfp~~~Pyrophosphate--fructose 6-phosphate 1-phosphotransferase~~~COG0205
MSISLLQQERHRYLPKVPDLLRGDFRRVCARRGLSTTAVADYDALRSLFARTYGQPLVNFVNASEKNEDSPMETAPEPRG
LRVAIVLSGGQAPGGHNVIAGLFDGLKRWHADSVLIGFLGGPAGVLSGDHIEICADRVDAYRNTGGFDLIGSGRTKIESE
SQFAAAAQTVTRMALDALVVVGGDDSNTNAALLAEHFVNSGISTKVIGVPKTIDGDLKNEAIETSFGFDTATKTYSELIG
NIARDACSARKYWHFIKLMGRSASHIALECALKTQPNVCLISEEVAAQSLTLAQIVQSLCDTIATRAQHGEHFGIVLVPE
GLIEFIPEMKALITELNEVMARRAQEFEALDTPDAQRVWIEQALSASARAVFNALPAEISTQLLADRDPHGNVQVSRIDT
ERLLILQVTERLAQMKQEGTYTGVFSSIAHFFGYEGRCAFPSNFDADYCYTLGLTACLLAVHRFTGYVASVRNLTSSVAE
WAVGGVPLTMLMNMERRHGSQKPVIKKALVDLEGMPFRVFSRRRASWALKTSYVYPGAVQYYGPPAVCDEPSVTIRLERP
APAANSSFGHRSS
>B0RP51 2.7.1.90~~~pfp~~~Pyrophosphate--fructose 6-phosphate 1-phosphotransferase~~~
MTNGNLLYAQSGGVTAVINATAAGVIGEARARKIKVLAARNGILGALREELIDTSKESAAAIAALAQTPGGAFGSCRYKL
KSLEQDRAKYERLLEVLRAHDVRWFLYNGGNDSADTALKVSQLAKAFGYPLHCIGVPKTIDNDLAVTDTCPGFGSAAKYT
AVSVREAALDVAAMADTSTKVFIYEAMGRHAGWLAAAAGLAGQGPDDAPQIILLPERAFDQAAFLAKVRQMVERVGWCVV
VASEGIQDAQGKFVADAGGATDSFGHAQLGGVASFLAAQVKQELGYKVHWTLPDYLQRSARHLASKTDWEQAQAVGKAAV
QYALKGMNAVIPVIERVSDAPYRWKIVPAPLHKVANHEKKMPPSFLRKDGFGITERARRYFAPLIKGEAPLAYGSDGLPK
YVSLKNVAVAKKLPAWEG
>P9WIG3 ~~~PE_PGRS3~~~PE-PGRS family protein PE_PGRS3~~~COG3391
MSFVIAAPEVIAAAATDLASLGSSISAANAAAAANTTALMAAGADEVSTAIAALFGAHGQAYQALSAQAQAFHAQFVQAL
TSGGGAYAAAEAAAVSPLLDPINEFFLANTGRPLIGNGANGAPGTGANGGDGGWLIGNGGAGGSGAAGVNGGAGGNGGAG
GNGGAGGLIGNGGAGGAGGVASSGIGGSGGAGGNAMLFGAGGAGGAGGGVVALTGGAGGAGGAGGNAGLLFGAAGVGGAG
GFTNGSALGGAGGAGGAGGLFATGGVGGSGGAGSSGGAGGAGGAGGLFGAGGTGGHGGFADSSFGGVGGAGGAGGLFGAG
GEGGSGGHSLVAGGDGGAGGNAGMLALGAAGGAGGIGGDGGTLTAGGIGGAGGAGGNAGLLFGSGGSGGAGGFGFADGGQ
GGPGGNAGTVFGSGGAGGNGGVGQGFAGGIGGAGGTPGLIGNGGNGGNGGASAVTGGNGGIGGTGVLIGNGGNGGSGGIG
AGKAGVGGVSGLLLGLDGFNAPASTSPLHTLQQNVLNVVNEPFQTLTGRPLIGNGANGTPGTGADGGAGGWLFGNGANGT
PGTGAAGGAGGWLFGNGGNGGHGATNTAATATGGAGGAGGILFGTGGNGGTGGIATGAGGIGGAGGAGGVSLLIGSGGTG
GNGGNSIGVAGIGGAGGRGGDAGLLFGAAGTGGHGAAGGVPAGVGGAGGNGGLFANGGAGGAGGFNAAGGNGGNGGLFGT
GGTGGAGTNFGAGGNGGNGGLFGAGGTGGAAGSGGSGITTGGGGHGGNAGLLSLGASGGAGGSGGASSLAGGAGGTGGNG
ALLFGFRGAGGAGGHGGAALTSIQQGGAGGAGGNGGLLFGSAGAGGAGGSGANALGAGTGGTGGDGGHAGVFGNGGDGGC
RRVWRRYRRQRWCRRQRRADRQRRQRRQRRQSRGHARCRRHRRAAARRERTQRLAIAGRPATTRGVEGISCSPQMMP
>L0T4W6 ~~~PE_PGRS4~~~PE-PGRS family protein PE_PGRS4~~~COG0657
MSFVIAAPEVIAAAATDLASLESSIAAANAAAAANTTALLAAGADEVSTAVAALFGAHGQAYQALSAQAQAFHAQFVQAL
TSGGGAYAAAEAAATSPLLAPINEFFLANTGRPLIGNGTNGAPGTGANGGDGGWLIGNGGAGGSGAAGVNGGAGGNGGAG
GLIGNGGAGGAGGRASTGTGGAGGAGGAAGMLFGAAGVGGPGGFAAAFGATGGAGGAGGNGGLFADGGVGGAGGATDAGT
GGAGGSGGNGGLFGAGGTGGPGGFGIFGGGAGGDGGSGGLFGAGGTGGSGGTSIINVGGNGGAGGDAGMLSLGAAGGAGG
SGGSNPDGGGGAGGIGGDGGTLFGSGGAGGVCGLGFDAGGAGGAGGKAGLLIGAGGAGGAGGGSFAGAGGTGGAGGAPGL
VGNAGNGGNGGASANGAGAAGGAGGSGVLIGNGGNGGSGGTGAPAGTAGAGGLGGQLLGRDGFNAPASTPLHTLQQQILN
AINEPTQALTGRPLIGNGANGTPGTGADGGAGGWLFGNGGNGGHGATGADGGDGGSGGAGGILSGIGGTGGSGGIGTTGQ
GGTGGTGGAALLIGSGGTGGSGGFGLDTGGAGGRGGDAGLFLGAAGTGGQAALSQNFIGAGGTAGAGGTGGLFANGGAGG
AGGFGANGGTGGNGLLFGAGGTGGAGTLGADGGAGGHGGLFGAGGTGGAGGSSGGTFGGNGGSGGNAGLLALGASGGAGG
SGGSALNVGGTGGVGGNGGSGGSLFGFGGAGGTGGSSGIGSSGGTGGDGGTAGVFGNGGDGGAGGFGADTGGNSSSVPNA
VLIGNGGNGGNGGKAGGTPGAGGTSGLIIGENGLNGL
>Q6MX50 ~~~PE_PGRS5~~~PE-PGRS family protein PE_PGRS5~~~COG3391
MSFVIAQPEMIAAAAGELASIRSAINAANAAAAAQTTGVMSAAADEVSTAVAALFSSHAQAYQAASAQAAAFHAQVVRTL
TVDAGAYASAEAANAGPNMLAAVNAPAQALLGRPLIGNGANGAPGTGQAGGDGGLLFGNGGNGGSGAPGQAGGAGGAAGF
FGNGGNGGDGGAGANGGAGGTAGWFFGFGGNGGAGGIGVAGINGGLGGAGGDGGNAGFFGNGGNGGMGGAGAAGVNAVNP
GLATPVTPAANGGNGLNLVGVPGTAGGGADGANGSAIGQAGGAGGDGGNASTSGGIGIAQTGGAGGAGGAGGDGAPGGNG
GNGGSVEHTGATGSSASGGNGATGGNGGVGAPGGAGGNGGHVSGGSVNTAGAGGKGGNGGTGGAGGPGGHGGSVLSGPVG
DSGNGGAGGDGGAGVSATDIAGTGGRGGNGGHGGLWIGNGGDGGAGGVGGVGGAGAAGAIGGHGGDGGSVNTPIGGSEAG
DGGKGGLGGDGGGRGIFGQFGAGGAGGAGGVGGAGGAGGTGGGGGNGGAIFNAGTPGAAGTGGDGGVGGTGAAGGKGGAG
GSGGVNGATGADGAKGLDGATGGKGNNGNPG
>Q79FW5 5.4.2.12~~~~~~PE-PGRS family protein PE_PGRS11~~~COG0406
MSFVIVARDALAAAAADLAQIGSAVNAGNLAAANPTTAVAAAAADEVSAALAALFGAHAREYQAAAAQAAAYHEQFVHRL
SAAATSYAVTEVTIATSLRGALGSAPASVSDGFQAFVYGPIHATGQQWINSPVGEALAPIVNAPTNVLLGRDLIGNGVTG
TAAAPNGGPGGLLFGDGGAGYTGGNGGSAGLIGNGGTGGAGFAGGVGGMGGTGGWLMGNGGMGGAGGVGGNGGAGGQALL
FGNGGLGGAGGAGGVDGAIGRGGWFIGTGGMATIGGGGNGQSIVIDFVRHGQTPGNAAMLIDTAVPGPGLTALGQQQAQA
IANALAAKGPYAGIFDSQLIRTQQTAAPLANLLGMAPQVLPGLNEIHAGIFEDLPQISPAGLLYLVGPIAWTLGFPIVPM
LAPGSTDVNGIVFNRAFTGAVQTIYDASLANPVVAADGNITSVAYSSAFTIGVGTMMNVDNPHPLLLLTHPVPNTGAVVV
QGNPEGGWTLVSWDGIPVGPASLPTALFVDVRELITAPQYAAYDIWESLFTGDPAAVINAVRDGADEVGAAVVQFPHAVA
DDVIDATGHPYLSGLPIGLPSLIP
>Q79FU3 ~~~~~~PE-PGRS family protein PE_PGRS16~~~COG0657
MSFVVTAPPVLASAASDLGGIASMISEANAMAAVRTTALAPAAADEVSAAIAALFSSYARDYQTLSVQVTAFHVQFAQTL
TNAGQLYAVVDVGNGVLLKTEQQVLGVINAPTQTLVGRPLIGDGTHGAPGTGQNGGAGGILWGNGGNGGSGAPGQPGGRG
GDAGLFGHGGHGGVGGPGIAGAAGTAGLPGGNGANGGSGGIGGAGGAGGNGGLLFGNGGAGGQGGSGGLGGSGGTGGAGM
AAGPAGGTGGIGGIGGIGGAGGVGGHGSALFGHGGINGDGGTGGMGGQGGAGGNGWAAEGITVGIGEQGGQGGDGGAGGA
GGIGGSAGGIGGSQGAGGHGGDGGQGGAGGSGGVGGGGAGAGGDGGAGGIGGTGGNGSIGGAAGNGGNGGRGGAGGMATA
GSDGGNGGGGGNGGVGVGSAGGAGGTGGDGGAAGAGGAPGHGYFQQPAPQGLPIGTGGTGGEGGAGGAGGDGGQGDIGFD
GGRGGDGGPGGGGGAGGDGSGTFNAQANNGGDGGAGGVGGAGGTGGTGGVGADGGRGGDSGRGGDGGNAGHGGAAQFSGR
GAYGGEGGSGGAGGNAGGAGTGGTAGSGGAGGFGGNGADGGNGGNGGNGGFGGINGTFGTNGAGGTGGLGTLLGGHNGNI
GLNGATGGIGSTTLTNATVPLQLVNTTEPVVFISLNGGQMVPVLLDTGSTGLVMDSQFLTQNFGPVIGTGTAGYAGGLTY
NYNTYSTTVDFGNGLLTLPTSVNVVTSSSPGTLGNFLSRSGAVGVLGIGPNNGFPGTSSIVTAMPGLLNNGVLIDESAGI
LQFGPNTLTGGITISGAPISTVAVQIDNGPLQQAPVMFDSGGINGTIPSALASLPSGGFVPAGTTISVYTSDGQTLLYSY
TTTATNTPFVTSGGVMNTGHVPFAQQPIYVSYSPTAIGTTTFN
>Q79FU2 ~~~~~~PE-PGRS family protein PE_PGRS17~~~COG3391
MSFVNVAPQLVSTAAADAARIGSAINTANTAAAATTQVLAAAQDEVSTAIAALFGSHGQHYQAISAQVAAYQQRFVLALS
QAGSTYAVAEAASATPLQNVLDAINAPVQSLTGRPLIGDGANGIDGTGQAGGNGGWLWGNGGNGGSGAPGQAGGAGGAAG
LIGNGGAGGTGGAVSLARAGTAGGAGRGPVGGIGGAGGVGGAGGAAGAVTTITHASFNDPHGVAVNPGGNVYVTNFGSGT
VSVINPATNTVTGSPITIGNGPSGVAVSPVTGLVFVTNFDSNTVSVIDPTTNTVTGSPITVGTAPTGVAVNPVTGEVYVT
NFAGDTVSVIS
>Q79FU0 ~~~~~~PE-PGRS family protein PE_PGRS18~~~COG3391
MSFVNVAPQLVSTAAADAARIGSAINTANTAAAATTQVLAAAHDEVSTAIAALFGSHGQHYQAISAQVAAYQERFVLALS
QASSTYAVAEAASATPLQNVLDAINAPVQSLTGRPLIGDGANGIDGTGQAGGNGGWLWGNGGNGGSGAPGQAGGAGGAAG
LIGNGGAGGAGGQGLPFEAGANGGAGGAGGWLFGNGGAGGVGGAGGAGTTFGVAGGDGGTGGVGGHGGLIGVGGHGGDGG
TGGTGGAVSLARAGTAGGAGGGPAGGIGGAGGVGGAGGAAGAVTTITHASFNDPHGVAVNPGGNIYVTNQGSNTVSVIDP
VTNTVTGSITDGNGPSGVAVSPVTGLVFVTNFDSNTVSVIDPNTNTVTGSIPVGTGAYGVAVNPGGNIYVTNQFSNTVSV
IDPATNTVTGSPIPVGLDPTGVAVNPVTGVVYVTNSLDDTVSVITGEPARSVCSAAI
>P9WIF7 ~~~~~~Uncharacterized PE-PGRS family protein PE_PGRS24~~~COG0657
MSFVIAAPETLVRAASDLANIGSTLGAANAAALGPTTELLAAGADEVSAAIASLFAAHGQAYQAVSAQMSAFHAQFVQTF
TAGAGAYASAEAAAAAPLEGLLNIVNTPTQLLLGRPLIGNGANGAPGTGQAGGAGGLLYGNGGAGGSGAPGQAGGPGGAA
GLFGNGGAGGAGGDGPGNGAAGGAGGAGGLLFGSGGAGGPGGVGNTGTGGLGGDGGAAGLFGAGGIGGAGGPGFNGGAGG
AGGRSGLFEVLAAGGAGGTGGLSVNGGTGGTGGTGGGGGLFSNGGAGGAGGFGVSGSAGGNGGTGGDGGIFTGNGGTGGT
GGTGTGNQLVGGEGGAGGAGGNAGILFGAGGIGGTGGTGLGAPDPGGTGGKGGVGGIGGAGALFGPGGAGGTGGFGASSA
DQMAGGIGGSGGSGGAAKLIGDGGAGGTGGDSVRGAAGSGGTGGTGGLIGDGGAGGAGGTGIEFGSVGGAGGAGGNAAGL
SGAGGAGGAGGFGETAGDGGAGGNAGLLNGDGGAGGAGGLGIAGDGGNGGKGGKAGMVGNGGDGGAGGASVVANGGVGGS
GGNATLIGNGGNGGNGGVGSAPGKGGAGGTAGLLGLNGSPGLS
>Q79FP3 ~~~~~~PE-PGRS family protein PE_PGRS26~~~COG0657
MSNVMVVPGMLSAAAADVASIGAALSAANGAAAPTTAGVLAAGADEVSAAIASLFSGYARDYQALSAQMARFHQQFVQAL
TASVGSYAAAEAANASPLQALEQQVLAAINAPTQTLLGRPLIGNGADGLPGQNGGAGGLLWGNGGNGGAGDAAHPNGGNG
GDAGMFGNGGAGGAGYSPAAGTGAAGGAGGAGGAGGWLSGNGGAGGNGGTGASGADGGGGLPPVPASPGGNGGGGDAGGA
AGMFGTGGAGGTGGDGGAGGAGDSPNSGANGARGGDGGNGAAGGAGGRLFGNGGAGGNGGTAGQGGDGGTALGAGGIGGD
GGTGGAGGTGGTAGIGGSSAGAGGAGGDGGAGGTGGGSSMIGGKGGTGGNGGVGGTGGASALTIGNGSSAGAGGAGGAGG
TGGTGGYIESLDGKGQAGNGGNGGNGAAGGAGGGGTGAGGNGGAGGNGGDGGPSQGGGNPGFGGDGGTGGPGGVGVPDGI
GGANGAQGKHG
>Q79FP0 ~~~~~~Ubiquitin-binding protein Rv1468c~~~COG3391
MSFVVANTEFVSGAAGNLARLGSMISAANSAAAAQTTAVAAAGADEVSAAVAALFGAHGQTYQVLSAQAAAFHSQFVQAL
SGGAQAYAAAEATNFGPLQPLFDVINAPTLALLNRPLIGNGADGTAANPNGQAGGLLIGNGGNGFSPAAGPGGNGGAAGL
LGHGGNGGVGALGANGGAGGTGGWLFGNGGAGGNSGGGGGAGGIGGSAVLFGAGGAGGISPNGMGAGGSGGNGGLFFGNG
GAGASSFLGGGGAGGRAFLFGDGGAGGAALSAGSAGRGGDAGFFYGNGGAGGSGAGGASSAHGGAGGQAGLFGNGGEGGD
GGALGGNGGNGGNAQLIGNGGDGGDGGGAGAPGLGGRGGLLLGLPGANGT
>Q79FL8 ~~~~~~PE-PGRS family protein PE_PGRS30~~~COG3391
MSFLLVEPDLVTAAAANLAGIRSALSEAAAAASTPTTALASAGADEVSAAVSRLFGAYGQQFQALNARAATFHAEFVSLL
NGGAAAYTGAEAASVSSMQALLDAVNAPTQTLLGRPLIGNGADGVAGTGSNAGGNGGPGGILYGNGGNGGAGGNGGAAGL
IGNGGAGGAGGAGGAGGAGGAGGTGGLLYGNGGAGGNGGSAAAAGGAGGNALLFGNGGNGGSGASGGAAGHAGTIFGNGG
NAGAGSGLAGADGGLFGNGGDGGSSTSKAGGAGGNALFGNGGDGGSSTVAAGGAGGNTLVGNGGAGGAGGTSGLTGSGVA
GGAGGSVGLWGSGGAGGDGGAATSLLGVGMNAGAGGAGGNAGLLYGNGGAGGAGGNGGDTTVPLFDSGVGGAGGAGGNAS
LFGNGGTGGVGGKGGTSSDLASATSGAGGAGGAGGVGGLLYGNGGNGGAGGIGGAAINILANAGAGGAGGAAGSSFIGNG
GNGGAGGAGGAAALFSSGVGGAGGSGGTALLLGSGGAGGNGGTGGANSGSLFASPGGTGGAGGHGGAGGLIWGNGGAGGN
GGNGGTTADGALEGGTGGIGGTGGSAIAFGNGGQGGAGGTGGDHSGGNGIGGKGGASGNGGNAGQVFGDGGTGGTGGAGG
AGSGTKAGGTGSDGGHGGNATLIGNGGDGGAGGAGGAGSPAGAPGNGGTGGTGGVLFGQSGSSGPPGAAALAFPSLSSSV
PILGPYEDLIANTVANLASIGNTWLADPAPFLQQYLANQFGYGQLTLTALTDATRDFAIGLAGIPPSLQSALQALAAGDV
SGAVTDVLGAVVKVFVSGVDASDLSNILLLGPVGDLFPILSIPGAMSQNFTNVVMTVTDTTIAFSIDTTNLTGVMTFGLP
LAMTLNAVGSPITTAIAFAESTTAFVSAVQAGNLQAAAAALVGAPANVANGFLNGEARLPLALPTSATGGIPVTVEVPVG
GILAPLQPFQATAVIPVIGPVTVTLEGTPAGGIVPALVNYAPTQLAQAIAP
>P9WIF5 ~~~~~~PE-PGRS family protein PE_PGRS33~~~COG0657
MSFVVTIPEALAAVATDLAGIGSTIGTANAAAAVPTTTVLAAAADEVSAAMAALFSGHAQAYQALSAQAALFHEQFVRAL
TAGAGSYAAAEAASAAPLEGVLDVINAPALALLGRPLIGNGANGAPGTGANGGDGGILIGNGGAGGSGAAGMPGGNGGAA
GLFGNGGAGGAGGNVASGTAGFGGAGGAGGLLYGAGGAGGAGGRAGGGVGGIGGAGGAGGNGGLLFGAGGAGGVGGLAAD
AGDGGAGGDGGLFFGVGGAGGAGGTGTNVTGGAGGAGGNGGLLFGAGGVGGVGGDGVAFLGTAPGGPGGAGGAGGLFGVG
GAGGAGGIGLVGNGGAGGSGGSALLWGDGGAGGAGGVGSTTGGAGGAGGNAGLLVGAGGAGGAGALGGGATGVGGAGGNG
GTAGLLFGAGGAGGFGFGGAGGAGGLGGKAGLIGDGGDGGAGGNGTGAKGGDGGAGGGAILVGNGGNGGNAGSGTPNGSA
GTGGAGGLLGKNGMNGLP
>O53553 ~~~~~~Uncharacterized PE-PGRS family protein PE_PGRS54~~~COG0657
MSFVLIAPEFVTAAAGDLTNLGSSISAANASAASATTQVLAAGADEVSARIAALFGGFGLEYQAISAQVAAYHQRFVQAL
STGAGAYASAEAAAAEQIVLGVINAPTQALLGRPLIGDGANATTPGGAGGAGGLLFGNGGAGAAGAPGQAGGPGGPAGLW
GNGGPGGAGGSGGGTGGAGGAGGWLFGVGGAGGVGGAGGGTGGAGGPGGLIWGGGGAGGVGGAGGGTGGAGGRAELLFGA
GGAGGAGTDGGPGATGGTGGHGGVGGDGGWLAPGGAGGAGGQGGAGGAGSDGGALGGTGGTGGTGGAGGAGGRGALLLGA
GGQGGLGGAGGQGGTGGAGGDGVLGGVGGTGGKGGVGGVAGLGGAGGAAGQLFSAGGAAGAVGVGGTGGQGGAGGAGAAG
ADAPASTGLTGGTGFAGGAGGVGGQGGNAIAGGINGSGGAGGTGGQGGAGGMGGSGADNASGIGADGGAGGTGGNAGAGG
AGGAAGTGGTGGVVGAAGKAGIGGTGGQGGAGGAGSAGTDATATGATGGTGFSGGAGGAGGAGGNTGVGGTNGSGGQGGT
GGAGGAGGAGGVGADNPTGIGGTGGTGGKGGAGGAGGQGGSSGAGGTNGSGGAGGTGGQGGAGGAGGAGADNPTGIGGAG
GTGGTGGAAGAGGAGGAIGTGGTGGAVGSVGNAGIGGTGGTGGVGGAGGAGAAAAAGSSATGGAGFAGGAGGEGGAGGNS
GVGGTNGSGGAGGAGGKGGTGGAGGSGADNPTGAGFAGGAGGTGGAAGAGGAGGATGTGGTGGVVGATGSAGIGGAGGRG
GDGGDGASGLGLGLSGFDGGQGGQGGAGGSAGAGGINGAGGAGGNGGDGGDGATGAAGLGDNGGVGGDGGAGGAAGNGGN
AGVGLTAKAGDGGAAGNGGNGGAGGAGGAGDNNFNGGQGGAGGQGGQGGLGGASTTSINANGGAGGNGGTGGKGGAGGAG
TLGVGGSGGTGGDGGDAGSGGGGGFGGAAGKAGGGGNGGRGGDGGDGASGLGLGLSGFDGGQGGQGGAGGSAGAGGINGA
GGAGGNGGDGGDGATGAAGLGDNGGVGGDGGAGGAAGNGGNAGVGLTAKAGDGGAAGNGGNGGAGGAGGAGDNNFNGGQG
GAGGQGGQGGLGGASTTSINANGGAGGNGGTGGKGGAGGAGTLGVGGSGGTGGDGGDAGSGGGGGFGGAAGKAGGGGNGG
VGGDGGEGASGLGLGLSGFDGGQGGQGGAGGSAGAGGINGAGGAGGTGGAGGDGAPATLIGGPDGGDGGQGGIGGDGGNA
GFGAGVPGDGGDGGNAGFGAGVPGDGGIGGTGGAGGAGGAGADGDPSIDGGQGGAGGHGGQGGKGGLNSTGLASAASGDG
GNGGAGGAGGNGGDGDGFIGGSGGTGGTGGDAGVGGLANTGGTAGNAGIGGAGGRGGDGGAGDSGALSQDGNGFAGGQGG
QGGVGGNAGAGGINGAGGTGGTGGAGGDGQNGTTGVASEGGAGGQGGDGGQGGIGGAGGNAGFGAGVPGDGGIGGTGGAG
GAGGAGADGDPSIDGGQGGAGGHGGQGGKGGLNSTGLASAASGDGGNGGAGGAGGNGGDGDGFIGGSGGTGGTGGDAGVG
GLANTGGTAGNAGIGGAGGRGGDGGAGDSGALSQDGNGFAGGQGGQGGVGGNAGAGGINGAGGTGGTGGAGGDGQNGTTG
VASEGGAGGQGGDGGQGGIGGAGGNAGFGAGVPGDGGIGGTGGAGGAGGAGADGDPSIDGGQGGAGGHGGQGGKGGLNST
GLASAASGDGGNGGAGGAGGNGGAGGLGGGGGTGGTNGNGGLGGGGGNGGAGGAGGTPTGSGTEGTGGDGGDAGAGGNGG
SATGVGNGGNGGDGGNGGDGGNGAPGGFGGGAGAGGLGGSGAGGGTDGDDGNGGSPGTDGS
>Q6MWV0 ~~~~~~PE-PGRS family protein PE_PGRS61~~~COG0657
MLNAPTQALLGRPLVGNGANGAPGTGANGGDGGILFGSGGAGGSGAAGMAGGNGGAAGLFGNGGAGGAGGSATAGAAGAG
GNGGAGGLLFGTAGAGGNGGLSLGLGVAGGAGGAGGSGGSDTAGHGGTGGAGGLLFGAGEDGTTPGGNGGAGGVAGLFGD
GGNGGNAGVGTPAGNVGAGGTGGLLLGQDGMTGLT
>L7N680 ~~~~~~PE-PGRS family protein PE_PGRS62~~~COG0657
MSFVVTVPEAVAAAAGDLAAIGSTLREATAAAAGPTTGLAAAAADDVSIAVSQLFGRYGQEFQTVSNQLAAFHTEFVRTL
NRGAAAYLNTESANGGQLFGQIEAGQRAVSAAAAAAPGGAYGQLVANTATNLESLYGAWSANPFPFLRQIIANQQVYWQQ
IAAALANAVQNFPALVANLPAAIDAAVQQFLAFNAAYYIQQIISSQIGFAQLFATTVGQGVTSVIAGWPNLAAELQLAFQ
QLLVGDYNAAVANLGKAMTNLLVTGFDTSDVTIGTMGTTISVTAKPKLLGPLGDLFTIMTIPAQEAQYFTNLMPPSILRD
MSQNFTNVLTTLSNPNIQAVASFDIATTAGTLSTFFGVPLVLTYATLGAPFASLNAIATSAETIEQALLAGNYLGAVGAL
IDAPAHALDGFLNSATVLDTPILVPTGLPSPLPPTVGITLHLPFDGILVPPHPVTATISFPGAPVPIPGFPTTVTVFGTP
FMGMAPLLINYIPQQLALAIKPAA
>P69434 ~~~pgaA~~~Poly-beta-1,6-N-acetyl-D-glucosamine export protein~~~COG0457
MYSSSRKRCPKTKWALKLLTAAFLAASPAAKSAVNNAYDALIIEARKGNTQPALSWFALKSALSNNQIADWLQIALWAGQ
DKQVITVYNRYRHQQLPARGYAAVAVAYRNLQQWQNSLTLWQKALSLEPQNKDYQRGQILTLADAGHYDTALVKLKQLNS
GAPDKANLLAEAYIYKLAGRHQDELRAMTESLPENASTQQYPTEYVQALRNNQLAAAIDDANLTPDIRADIHAELVRLSF
MPTRSESERYAIADRALAQYAALEILWHDNPDRTAQYQRIQVDHLGALLTRDRYKDVISHYQRLKKTGQIIPPWGQYWVA
SAYLKDHQPKKAQSIMTELFYHKETIAPDLSDEELADLFYSHLESENYPGALTVTQHTINTSPPFLRLMGTPTSIPNDTW
LQGHSFLSTVAKYSNDLPQAEMTARELAYNAPGNQGLRIDYASVLQARGWPRAAENELKKAEVIEPRNINLEVEQAWTAL
TLQEWQQAAVLTHDVVEREPQDPGVVRLKRAVDVHNLAELRIAGSTGIDAEGPDSGKHDVDLTTIVYSPPLKDNWRGFAG
FGYADGQFSEGKGIVRDWLAGVEWRSRNIWLEAEYAERVFNHEHKPGARLSGWYDFNDNWRIGSQLERLSHRVPLRAMKN
GVTGNSAQAYVRWYQNERRKYGVSWAFTDFSDSNQRHEVSLEGQERIWSSPYLIVDFLPSLYYEQNTEHDTPYYNPIKTF
DIVPAFEASHLLWRSYENSWEQIFSAGVGASWQKHYGTDVVTQLGYGQRISWNDVIDAGATLRWEKRPYDGDREHNLYVE
FDMTFRF
>P75906 3.5.1.-~~~pgaB~~~Poly-beta-1,6-N-acetyl-D-glucosamine N-deacetylase~~~COG0726
MLRNGNKYLLMLVSIIMLTACISQSRTSFIPPQDRESLLAEQPWPHNGFVAISWHNVEDEAADQRFMSVRTSALREQFAW
LRENGYQPVSIAQIREAHRGGKPLPEKAVVLTFDDGYQSFYTRVFPILQAFQWPAVWAPVGSWVDTPADKQVKFGDELVD
REYFATWQQVREVARSRLVELASHTWNSHYGIQANATGSLLPVYVNRAYFTDHARYETAAEYRERIRLDAVKMTEYLRTK
VEVNPHVFVWPYGEANGIAIEELKKLGYDMFFTLESGLANASQLDSIPRVLIANNPSLKEFAQQIITVQEKSPQRIMHID
LDYVYDENLQQMDRNIDVLIQRVKDMQISTVYLQAFADPDGDGLVKEVWFPNRLLPMKADIFSRVAWQLRTRSGVNIYAW
MPVLSWDLDPTLTRVKYLPTGEKKAQIHPEQYHRLSPFDDRVRAQVGMLYEDLAGHAAFDGILFHDDALLSDYEDASAPA
ITAYQQAGFSGSLSEIRQNPEQFKQWARFKSRALTDFTLELSARVKAIRGPHIKTARNIFALPVIQPESEAWFAQNYADF
LKSYDWTAIMAMPYLEGVAEKSADQWLIQLTNQIKNIPQAKDKSILELQAQNWQKNGQHQAISSQQLAHWMSLLQLNGVK
NYGYYPDNFLHNQPEIDLIRPEFSTAWYPKND
>P75905 2.4.1.-~~~pgaC~~~Poly-beta-1,6-N-acetyl-D-glucosamine synthase~~~COG1215
MINRIVSFFILCLVLCIPLCVAYFHSGELMMRFVFFWPFFMSIMWIVGGVYFWVYRERHWPWGENAPAPQLKDNPSISII
IPCFNEEKNVEETIHAALAQRYENIEVIAVNDGSTDKTRAILDRMAAQIPHLRVIHLAQNQGKAIALKTGAAAAKSEYLV
CIDGDALLDRDAAAYIVEPMLYNPRVGAVTGNPRIRTRSTLVGKIQVGEYSSIIGLIKRTQRIYGNVFTVSGVIAAFRRS
ALAEVGYWSDDMITEDIDISWKLQLNQWTIFYEPRALCWILMPETLKGLWKQRLRWAQGGAEVFLKNMTRLWRKENFRMW
PLFFEYCLTTIWAFTCLVGFIIYAVQLAGVPLNIELTHIAATHTAGILLCTLCLLQFIVSLMIENRYEHNLTSSLFWIIW
FPVIFWMLSLATTLVSFTRVMLMPKKQRARWVSPDRGILRG
>P69432 ~~~pgaD~~~Biofilm PGA synthesis protein PgaD~~~COG3658
MNNLIITTRQSPVRLLVDYVATTILWTLFALFIFLFAMDLLTGYYWQSEARSRLQFYFLLAVANAVVLIVWALYNKLRFQ
KQQHHAAYQYTPQEYAESLAIPDELYQQLQKSHRMSVHFTSQGQIKMVVSEKALVRA
>O25249 ~~~pgbA~~~Plasminogen-binding protein PgbA~~~
MLRLLIGLLLMSFISLQSASWQEPLRVSIEFVDLPKKIIRFPAHDLQVGEFGFVVTKLSDYEIVNSEVVIIAVENGVATA
KFRAFESMKQRHLPTPRMVARKGDLVYFRQFNNQAFLIAPNDELYEQIRATNTDINFISSDLLVTFLNGFDPKIANLRKA
CNVYSVGVIYIVTTNTLNILSCESFEILEKRELDTSGVTKTSTPFFSRVEGIDAGTLGKLFSGSQSKNYFAYYDALVKKE
KRKEVRIKKREEKIDSREIKREIKQEAIKEPKKANQGTQNAPTLEEKNYQKAERKLDAKEERRYLRDERKKAKATKKAME
FEEREKEHDERDEQETEGRRKALEMDKGDKKEERVKPKENEREIKQEAIKEPSDGNNATQQGEKQNAPKENNAQKEENKP
NSKEEKRRLKEEKKKAKAEQRAREFEQRAREHQERDEKELEERRKALEAGKK
>O25534 ~~~pgbB~~~Plasminogen-binding protein PgbB~~~
MNKPFLILLIALIVFSGCNMRKYFKPAKHQIKGEAYFPNHLQESIVSSNRYGAILKNGAVIGDKGLTQLRIGKNFNYESS
FLNESQGFFILAQDCLNKIDKKTNKSKVAKTEETELKLKGVEAEVQDKVCHQVELISNNPNASQQSIVIPLETFALSASV
KGNLLAVVLADNSANLYDITSQKLLFSEKGSPSTTINSLMAMPIFMDTVVVFPMLDGRLLVVDYVHGNPTPIRNIVISSD
KFFNNITYLIVDGNNMIASTGKRILSVVSGQEFNYDGDIVDLLYDKGTLYVLTLDGQILQMDKSLRELNSVKLPSSLNTI
VLNHNKLYSLEKRGYVIEVDLNDFDSYNVYKTPTIGSFKFFSSNRLDKGVFYDKNRVYYDRYYLDYNDFKPKLYPVVEKS
ASKKSQKGEKGNAPIYLQERHKAKENKQPLEENKVKPRNSGFEEEEVKTRRPEPIRDQNNATQQGETKNNESKNAPVLKE
NAAKKEVPKPNSKEEKRRLKEEKKKAKAEQRAREFEQRAREHQERDEKELEERRKALEMNKK
>P18159 5.4.2.2~~~pgcA~~~Phosphoglucomutase~~~COG1109
MTWRKSYERWKQTEHLDLELKERLIELEGDEQALEDCFYKDLEFGTGGMRGEIGAGTNRMNIYTVRKASAGFAAYISKQG
EEAKKRGVVIAYDSRHKSPEFAMEAAKTLATQGIQTYVFDELRPTPELSFAVRQLNAYGGIVVTASHNPPEYNGYKVYGD
DGGQLPPKEADIVIEQVNAIENELTITVDEENKLKEKGLIKIIGEDIDKVYTEKLTSISVHPELSEEVDVKVVFTPLHGT
ANKPVRRGLEALGYKNVTVVKEQELPDSNFSTVTSPNPEEHAAFEYAIKLGEEQNADILIATDPDADRLGIAVKNDQGKY
TVLTGNQTGALLLHYLLSEKKKQGILPDNGVVLKTIVTSEIGRAVASSFGLDTIDTLTGFKFIGEKIKEYEASGQYTFQF
GYEESYGYLIGDFARDKDAIQAALLAVEVCAFYKKQGMSLYEALINLFNEYGFYREGLKSLTLKGKQGAEQIEAILASFR
QNPPQKMAGKQVVTAEDYAVSKRTLLTESKEEAIDLPKSNVLKYFLEDGSWFCLRPSGTEPKVKFYFAVKGSSLEDSEKR
LAVLSEDVMKTVDEIVESTAK
>Q2FVC1 5.4.2.2~~~pgcA~~~Phosphoglucomutase~~~COG1109
MKGCLATMDKELWIERANDSLVKHFYEQQSDIEQREGFESKLTFGTAGIRGKFGLGEGRLNKFTIEKLALGLARYLNAQT
NSPTIVIHYDIRHLSTEFAQIIANVLANHQITVYLPDTYKTTPELSFAVRNLNTTAGIMITASHNPKDYNGIKVYGSDGA
QLSTDASELASRYIEEVGDPLQIDIPISKQNTSYIKPFPKSVTDDYMKHIQNMIGYIPKSDLQVVFTSLHGTSVPIVPEL
LQSLNFNQFNLVEAQCKPDPNFSSVQSANPEDHRAFDQAVELANKSHADLLISTDPDADRLGIAERDAHGHITYFNGNQI
GALLLNYRIQQTSQLRHRLMIQSIVSSELTKSLARYNNVEYKEVLTGFKFIAQEIRQLDDHQNMIFAFEESYGFLSEPFV
RDKDAVQIVPLIIKYASELKLYGKTLKDELEQIYQTVGRHEDTLFSHTLEGFEGKKKINAIMTKFRSNPPQEIQGLKVKA
IEDYLTSEVYHLDKDTTSQINSPKSNVIRVLFDEGFIALRPSGTEPKIKLYVSLKCPNFDDVAQKINAMIFS
>Q7A3K7 5.4.2.2~~~pgcA~~~Phosphoglucomutase~~~
MKGCLATMDKELWIERANDSLVKHFYEQQSDIEQREGFESKLTFGTAGIRGKFGLGEGRLNKFTIEKLALGLARYLNAQT
NSPTIVIHYDIRHLSTEFAQIIANVLANHQITVYLPDTYKTTPELSFAVRNLNTTAGIMITASHNPKDYNGIKVYGSDGA
QLSTDASELASRYIEEVGDPLQIDIPISKQNTSYIKPFPKSVTDDYMKHIQNMIGYIPKSDLQVVFTSLHGTSVPIVPEL
LKSLNFNQFNLVDAQCKPDPNFSSVQSANPEDHRAFDQAVELANKSHADLLISTDPDADRLGIAERDAHGHITYFNGNQI
GALLLNYRIQQTSQLRHRLMIQSIVSSELTKSLARYNNVKYKEVLTGFKFIAQEIRQLDDHQNMIFAFEESYGFLSEPFV
RDKDAVQIVPLIIKYASELKLYGKTLKDELEQIYQTVGRHEDTLFSHTLEGLEGKKKIESIMTHFRSNPPQEIQGLKVKA
IEDYLTSEVYHLDKDTTSQINSSKSNVIRVLFDEGFIALRPSGTEPKIKLYVSLKCPDFDDVAQKINAMIFS
>Q81EK9 3.5.1.104~~~~~~Peptidoglycan-N-acetylglucosamine deacetylase BC_1960~~~
MYYFYSPEMFAPYQWGLERDVSYAYMPYNSFYYGDYINSLPYAYIPQNYEVQMKADDRGSWTPFSWVEKYAYAFSGPYNK
AEVALTFDDGPDLEFTPKILDKLKQHNVKATFFLLGENAEKFPNIVKRIANEGHVIGNHTYSHPNLAKVNEDEYRNQIIK
TEEILNRLAGYAPKFIRPPYGEILENQLKWATEQNFMIVQWSVDTVDWKGVSADTITNNVLGNSFPGSVILQHSTPGGHL
QGSVDALDKIIPQLKTKGARFVTLPSMFQTSKERK
>Q81EJ6 3.5.1.104~~~~~~Peptidoglycan-N-acetylglucosamine deacetylase BC_1974~~~
MEKALKIKQIVVVLIAIAAVAIGYYMFQSITSPAKAVAKQENVVQLASEQPKVEMNKTAPSRFNGKERKVAYLTFDDGPG
KYTAELLNTLKQHDAKATFFLIGANVKEFPDLVKRENAEGHYVGMHSMTHNFAKLYKNGEYVNEMKEDQGLIANIIGKSP
KLTRPPYGSMPGLNEGLRNKVVEGGFKVWDWTIDSLDWRYNKMPVDAAAAQIAQNVLTNATKPQEVILMHDIHPQSVAAV
PAILKGLKEKGYEFEAYHEESHFPVNFWHDNRM
>Q81AF4 3.5.1.104~~~~~~Peptidoglycan-N-acetylglucosamine deacetylase BC_3618~~~
MLLRKELEPTGYVTWEVPNNEKIIAITFDDGPDPTYTPQVLDLLRQYKAEATFFMIGFRVQRNPYLVKQVLKEGHEIGNH
TMNHLYASNSSDEKLENDILDGKKFFEKWVKEPLLFRPPGGYINDAVFKTAKEAGYQTVLWSWHQDPRDWANPGVESIVN
HVVKNAKSGDIVLLHDGGNDRSQTVAALAKILPELKKQGYRFVTVSELLRYKH
>B5ZA76 3.5.1.-~~~pgdA~~~Peptidoglycan deacetylase~~~
MAKEILVAYGVDIDAVAGWLGSYGGEDSPDDISRGLFAGEVGIPRLLKLFKKYHLPATWFVPGHSIETFPEQMKMIVDAG
HEVGAHGYSHENPIAMSTKQEEDVLLKSVELIKDLTGKAPTGYVAPWWEFSNITNELLLKHGFKYDHSLMHNDFTPYYVR
VGDSWSKIDYSLEAKDWMKPLIRGVETNLVEIPANWYLDDLPPMMFIKKSPNSFGFVSPRDIGQMWIDQFDWVYREMDYA
VFSMTIHPDVSARPQVLLMHEKIIEHINKHEGVRWVTFNEIADDFLKRNPRKK
>O25080 3.5.1.-~~~pgdA~~~Peptidoglycan deacetylase~~~COG0726
MAKEILVAYGVDIDAVAGWLGSYGGEDSPDDISRGLFAGEVGIPRLLKLFKKYHLPATWFSPGHSIETFSEQMKMIVDAG
HEVGAHGYSHENPIAMTAKQEEDVLLKSVELIKDLTGKAPTGYVAPWWEFSNITNELLLKHGFKYDHSLMHNDFTPYYVR
VGDSWSKIDYSLEAKDWMKPLIRGVETDLVEIPANWYLDDLPPMMFIKKSPNSFGFVSPHDIGQMWIDQFDWVYREMDYA
VFSMTIHPDVSARPQVLLMHEKIIEHINKHEGVRWVTFNEIADDFLKRNPRKK
>A0A0H3GDH9 3.5.1.104~~~pgdA~~~Peptidoglycan-N-acetylglucosamine deacetylase PgdA~~~
MKIRWIRLSLVAILIIAVVFIGVIGFQKYQFSKSRNKVIMQMDRLMKDQDGGNFRRLDKKENGVEIISYIPKTTEKKDNE
IIQKEIGKATDAEVKKLNRDKETQGIIFYTYQKHRMAEQAISYKAVQSEYVKEGRTKFVLKDKKDICKNIVTDAETGALL
TLGEVLIKSNQTKLNLKTAVEEELIKTGDFSLKDVGNLGKIKSLVKWNQTDFEITNSEIILPVKIPGAPEPKKVKVKLAD
IASSVNKRYLPSSVKVPEVPKAKTNKRIALTFDDGPSSSVTPGVLDTLKRHNVKATFFVLGSSVIQNPGLVKRELEEGHQ
VGSHSWDHPQLTKQSTQEVYNQILKTQKAVFDQTGYFPTTMRPPYGAVNKQVAEEIGLPIIQWSVDTEDWKYRNAGIVTK
KVLAGATDGAIVLMHDIHKTTAASLDTTLTKLKSQGYEFVTIDELYGEKLQIGKQYFDKTDSRMVK
>A0A3Q0NBH7 3.5.1.104~~~pgdA~~~Peptidoglycan-N-acetylglucosamine deacetylase PgdA~~~
MKIRWIRLSLVAILIIAVVFIGVIGFQKYQFSKSRNKVIMQMDRLMKDQDGGNFRRLDKKENGVEIISYIPKTTEKKDNE
IIQKEIGKATDAEVKKLNRDKETQGIIFYTYQKHRMAEQAISYKAVQSEYVKEGRTKFVLKDKKDICKNIVTDAETGALL
TLGEVLIKSNQTKLNLKTAVEEELIKTGDFSLKDVGNLGKIKSLVKWNQTDFEITNSEIILPVKIPGAPEPKKVKVKLAD
IASSVNKRYLPSSVKVPEVPKAKTNKRIALTFDDGPSSSVTPGVLDTLKRHNVKATFFVLGSSVIQNPGLVKRELEEGHQ
VGSHSWDHPQLTKQSTQEVYNQILKTQKAVFDQTGYFPTTMRPPYGAVNKQVAEEIGLPIIQWSVDTEDWKYRNAGIVTK
KVLAGATDGAIVLMHDIHKTTAASLDTTLTKLKSQGYEFVTIDELYGEKLQIGKQYFDKTDSRMVK
>Q8Y9V5 3.5.1.104~~~pgdA~~~Peptidoglycan-N-acetylglucosamine deacetylase PgdA~~~COG0726
MKIRWIRLSLVAILIIAVVFIGVIGFQKYQFSKSRNKVIMQMDRLMKDQDGGNFRRLDKKENGVEIISYIPKTTEKKDNE
IIQKEIGKATDAEVKKLNRDKETQGIIFYTYQKHRMAEQAISYKAVQSEYVKEGRTKFVLKDKKDICKNIVTDAETGALL
TLGEVLIKSNQTKLNLKTAVEEELIKTGDFSLKDVGNLGKIKSLVKWNQTDFEITNSEIILPVKIPGAPEPKKVKVKLAD
IASSVNKRYLPSSVKVPEVPKAKTNKRIALTFDDGPSSSVTPGVLDTLKRHNVKATFFVLGSSVIQNPGLVKRELEEGHQ
VGSHSWDHPQLTKQSTQEVYNQILKTQKAVFDQTGYFPTTMRPPYGAVNKQVAEEIGLPIIQWSVDTEDWKYRNAGIVTK
KVLAGATDGAIVLMHDIHKTTAASLDTTLTKLKSQGYEFVTIDELYGEKLQIGKQYFDKTDSRMVK
>Q8DP63 3.5.1.104~~~pgdA~~~Peptidoglycan-N-acetylglucosamine deacetylase~~~COG0726
MNKSRLGRGRHGKTRHVLLALIGILAISICLLGGFIAFKIYQQKSFEQKIESLKKEKDDQLSEGNQKEHFRQGQAEVIAY
YPLQGEKVISSVRELINQDVKDKLESKDNLVFYYTEQEESGLKGVVNRNVTKQIYDLVAFKIEETEKTSLGKVHLTEDGQ
PFTLDQLFSDASKAKEQLIKELTSFIEDKKIEQDQSEQIVKNFSDQDLSAWNFDYKDSQIILYPSPVVENLEEIALPVSA
FFDVIQSSYLLEKDAALYQSYFDKKHQKVVALTFDDGPNPATTPQVLETLAKYDIKATFFVLGKNVSGNEDLVKRIKSEG
HVVGNHSWSHPILSQLSLDEAKKQITDTEDVLTKVLGSSSKLMRPPYGAITDDIRNSLDLSFIMWDVDSLDWKSKNEASI
LTEIQHQVANGSIVLMHDIHSPTVNALPRVIEYLKNQGYTFVTIPEMLNTRLKAHELYYSRDE
>P96740 3.4.19.-~~~pgdS~~~Gamma-DL-glutamyl hydrolase~~~COG0791
MNTLANWKKFLLVAVIICFLVPIMTKAEIAEADTSSELIVSEAKNLLGYQYKYGGETPKEGFDPSGLIQYVFSKADIHLP
RSVNDQYKIGTAVKPENLKPGDILFFKKEGSTGTVPTHDALYIGDGQMVHSTQSKGVIITNYKKSSYWSGTYIGARRIAA
DPATADVPVVQEAEKYIGVPYVFGGSTPSEGFDCSGLVQYVFQQALGIYLPRSAEQQWAVGEKVAPQNIKPGDVVYFSNT
YKTGISHAGIYAGAGRFIQASRSEKVTISYLSEDYWKSKMTGIRRFDNLTIPKENPIVSEATLYVGEVPYKQGGVTPETG
FDTAGFVQYVYQKAAGISLPRYATSQYNAGTKIEKADLKPGDIVFFQSTSLNPSIYIGNGQVVHVTLSNGVTITNMNTST
YWKDKYAGSIRVQ
>P36204 ~~~pgk/tpi~~~Bifunctional PGK/TIM~~~COG0126
MEKMTIRDVDLKGKRVIMRVDFNVPVKDGVVQDDTRIRAALPTIKYALEQGAKVILLSHLGRPKGEPSPEFSLAPVAKRL
SELLGKEVKFVPAVVGDEVKKAVEELKEGEVLLLENTRFHPGETKNDPELAKFWASLADIHVNDAFGTAHRAHASNVGIA
QFIPSVAGFLMEKEIKFLSKVTYNPEKPYVVVLGGAKVSDKIGVITNLMEKADRILIGGAMMFTFLKALGKEVGSSRVEE
DKIDLAKELLEKAKEKGVEIVLPVDAVIAQKIEPGVEKKVVRIDDGIPEGWMGLDIGPETIELFKQKLSDAKTVVWNGPM
GVFEIDDFAEGTKQVALAIAALTEKGAITVVGGGDSAAAVNKFGLEDKFSHVSTGGGASLEFLEGKELPGIASIADKKKI
TRKLILAGNWKMHKTISEAKKFVSLLVNELHDVKEFEIVVCPPFTALSEVGEILSGRNIKLGAQNVFYEDQGAFTGEISP
LMLQEIGVEYVIVGHSERRRIFKEDDEFINRKVKAVLEKGMTPILCVGETLEEREKGLTFCVVEKQVREGFYGLDKEEAK
RVVIAYEPVWAIGTGRVATPQQAQEVHAFIRKLLSEMYDEETAGSIRILYGGSIKPDNFLGLIVQKDIDGGLVGGASLKE
SFIELARIMRGVIS
>Q81X75 2.7.2.3~~~pgk~~~Phosphoglycerate kinase~~~COG0126
MNKKSIRDVDLKGKRVFCRVDFNVPMKEGKITDETRIRAALPTIQYLVEQGAKVILASHLGRPKGQAVEELRLTPVAARL
GELLGKDVKKADEAFGPVAQEMVAAMNEGDVLVLENVRFYAGEEKNDAELAKEFAALADIFVNDAFGAAHRAHASTAGIA
DYLPAVSGLLMEKELEVLGKALSNPERPFTAIIGGAKVKDKIGLIRHLLDKVDNLIIGGGLAYTFVKALGHEIGLSLCED
DKIELAKEFMQLAKEKGVNFYMPVDVVITEEFSETATTKIVGIDSIPSNWEGVDIGPKTREIYADVIKNSKLVVWNGPMG
VFEMTPFAEGTKAVGQALADAEGTYSVIGGGDSAAAVEKFGMADKMSHISTGGGASLEFMEGKELPGVVCLNDK
>P40924 2.7.2.3~~~pgk~~~Phosphoglycerate kinase~~~COG0126
MNKKTLKDIDVKGKVVFCRVDFNVPMKDGEVTDDTRIRAALPTIKHLADQGAKVLLASHLGRPKGEVVEELRLTPVAARL
GELLGKEVKKADEAYGDAVKAQISEMKDGDVLVLENVRFYPGEEKNDPELAKAFAELADVYVNDAFGAAHRAHASTAGIA
EHLPAVAGFLMEKELDVLGKAVSNPDRPFTAIIGGAKVKDKIGVIESLLDKVDNLIIGGGLAYTFVKALGYEVGKSLLEE
DKIELAKSFMDRAKEKGVNFYMPEDVLVADDFSNDANVKIVPISEIPSDLEAIDIGTKTRETYADVIKNSKLVVWNGPMG
VFEIDLFAQGTKAVAEALAEAKDTYSVIGGGDSAAAVEKFGLADKMSHISTGGGASLEFMEGKELPGVAALNDK
>Q9PMQ5 2.7.2.3~~~pgk~~~Phosphoglycerate kinase~~~COG0126
MSDIISIKDIDLAKKKVFIRCDFNVPQDDFLNITDDRRIRSAIPTIRYCLDNGCSVILASHLGRPKEISSKYSLEPVAKR
LARLLDKEIVMAKDVIGEDAKTKAMNLKAGEILLLENLRFEKGETKNDENLAKELASMVQVYINDAFGVCHRAHSSVEAI
TKFFDEKHKGAGFLLQKEIDFASNLIKHPARPFVAVVGGSKVSGKLQALTNLLPKVDKLIIGGGMAFTFLKALGYDIGNS
LLEEELLEEANKILTKGKNLGVKIYLPVDVVAAPACSQDVPMKFVPAQEIPNGWMGLDIGPASVRLFKEVISDAQTIWWN
GPMGVFEIDKFSKGSIKMSHYISEGHATSVVGGGDTADVVARAGDADEMTFISTGGGASLELIEGKELPGVKALRSKENE
>Q83AU6 2.7.2.3~~~pgk~~~Phosphoglycerate kinase~~~COG0126
MSNLNLHNKRVMIREDLNVPMKNGKITNDERIVRALPTIQKAIEQKARVMILSHLGRPEEGKFEKEFSLAPVARLLSKKL
NQKVPLINDWLKGVAVEPGQAILCENVRFNKGENENNTELAKRMAELCDIFVMDAFATAHRAQASTAGVAAYAKLACAGP
LLISEVEALSRALENPQKPLVAVVGGSKVSTKIHLLENLLDKVDQLIVGGGIANTFLKAQGYSIGKSLCENEWLDAAQQF
WEKAAEKNVSLPLPVDVIVADELSEDAKATVKNIDAVTSNESIFDVGPNTSATYAKLMAQAGTIVWNGPIGVFEIEAFSQ
GTRALAQAVAKSTAYSIVGGGDTLAALDKFNLTDQMSYVSTAGGAFLEFLEGKILPAIKILTQRAKEYEQK
>P0A799 2.7.2.3~~~pgk~~~Phosphoglycerate kinase~~~COG0126
MSVIKMTDLDLAGKRVFIRADLNVPVKDGKVTSDARIRASLPTIELALKQGAKVMVTSHLGRPTEGEYNEEFSLLPVVNY
LKDKLSNPVRLVKDYLDGVDVAEGELVVLENVRFNKGEKKDDETLSKKYAALCDVFVMDAFGTAHRAQASTHGIGKFADV
ACAGPLLAAELDALGKALKEPARPMVAIVGGSKVSTKLTVLDSLSKIADQLIVGGGIANTFIAAQGHDVGKSLYEADLVD
EAKRLLTTCNIPVPSDVRVATEFSETAPATLKSVNDVKADEQILDIGDASAQELAEILKNAKTILWNGPVGVFEFPNFRK
GTEIVANAIADSEAFSIAGGGDTLAAIDLFGIADKISYISTGGGAFLEFVEGKVLPAVAMLEERAKK
>Q5NF76 2.7.2.3~~~pgk~~~Phosphoglycerate kinase~~~COG0126
MSFLTLKDVDLKDKKVLVRVDFNVPVKDGKVTSKVRIEAAIPTIQYILDQGGAVILMSHLGRPTEGEYDSQFSLEPVAKA
LSEIINKPVKFAKDWLDGVDVKAGEIVMCENVRFNSGEKKSTDDLSKKIASLGDVFVMDAFATAHRAQASTYGVAKYIPV
ACAGILLTNEIQALEKALKSPKKPMAAIVGGSKVSTKLSVLNNLLDKVEILIVGGGIANTFIKAEGFDVGNSLYEQDLVA
EATEILAKAKALGVNIPVPVDVRVAKEFSENAQAIIKKVSDVVADEMILDIGPESQKIIAELLKSANTILWNGPVGVFEF
DNFAEGTKALSLAIAQSHAFSVAGGGDTIAAIEKFGIKDQVSYISTAGGAFLEFLEGKKLPAIEILKEKAIR
>P18912 2.7.2.3~~~pgk~~~Phosphoglycerate kinase~~~
MNKKTIRDVDVRGKRVFCRVDFNVPMEQGAITDDTRIRAALPTIRYLIEHGAKVILASHLGRPKGKVVEELRLDAVAKRL
GELLERPVAKTNEAVGDEVKAAVDRLNEGDVLLLENVRFYPGEEKNDPELAKAFAELADLYVNDAFGAAHRAHASTEGIA
HYLPAVAGFLMEKELEVLGKALSNPDRPFTAIIGGAKVKDKIGVIDNLLEKVDNLIIGGGLAYTFVKALGHDVGKSLLEE
DKIELAKSFMEKAKEKGVRFYMPVDVVVADRFANDANTKVVPIDAIPADWSALDIGPKTRELYRDVIRESKLVVWNGPMG
VFEMDAFAHGTKAIAEALAEALDTYSVIGGGDSAAAVEKFGLADKMDHISTGGGASLEFMEGKQLPGVVALEDK
>P43726 2.7.2.3~~~pgk~~~Phosphoglycerate kinase~~~COG0126
MSVIKMTDLDLAGKRVFIRADLNVPVKDGKVTSDARIRATIPTLKLALEKGAKVMVTSHLGRPTEGEFKPEDSLQPVVDY
LKNAGFNVRLEQDYLNGVDVKDGEIVVLENVRVNKGEKKNDPELGKKYAALCDVFVMDAFGTAHRAQASTYGVAEFAPIA
CAGPLLAAELDALGKALKEPARPMVAIVGGSKVSTKLEVLNSLSKIADQIIVGGGIANTFIAAAGHNVGKSLYEADLIPV
AKELAANTDIPVPVDVRVGLEFSETAAATEKVVNEVKDDESIFDIGDKSAEQLAEIIKNAKTVLWNGPVGVFEFPHFRKG
TEIISHAIANSDAFSIAGGGDTLAAIDLFGIADKISYISTGGGAFLEFVEGKVLPAVEILEKRAKN
>A0QHY4 2.7.2.3~~~pgk~~~Phosphoglycerate kinase~~~
MAVHNLKDLLAEGVSGRGVLVRSDLNVPLDSDGEQGRITDPGRITASVPTLSALVEAGAKVVVAAHLGRPKNGPDPALSL
APVAAALGEQLGRHVQLASDVVGTDALARAEGLTDGDVLLLENIRFDARETSKDDAERLALARQLAELVGPTGAFVSDGF
GVVHRKQASVYDVATLLPHYAGTLVAEEIAVLEQLTGSTKRPYAVVLGGSKVSDKLGVIESLATKADSIVIGGGMCFTFL
AAQGFSVGKSLLETEMVDTCRRLLDTYVDVLRLPVDIVAADRFAADAAPQTVPADAIPDDLMGLDIGPGSVKRFTALLSN
AETIFWNGPMGVFEFPAFAAGTKGLAEAIAAATGKGAFSVVGGGDSAAAVRALGIPESGFSHISTGGGASLEYLEGKALP
GIEVLGRPQPTGGAA
>A0QWW3 2.7.2.3~~~pgk~~~Phosphoglycerate kinase~~~COG0126
MSVKTLDDLLAEGVQGRGVLVRSDLNVPLDDDGNITDPGRVIASVPTLQALAEAGAKVIVTAHLGRPKGEPDPKLSLAPV
AAALGEKLGRHVQLAGDVVGTDALARAEGLTDGDVLLLENIRFDARETSKDDSERLSLAKALAALVEGPDGSPGVFVSDG
FGVVHRKQASVYDVATLLPHYAGTLVAAEVKVLQQLTSSTDRPYAVVLGGSKVSDKLAVIENLATKADSLIIGGGMCFTF
LAAQGFSVGSSLLQEEMVDTCRRLLDEYADVIHLPVDIVVADKFAADAEAETVAADRIPDGKMGLDIGPGSVERFTALLS
NAKTVFWNGPMGVFEFPAFAAGTKGVAEAIIGATGKGAFSVVGGGDSAAAVRRLGLPEDGFSHISTGGGASLEYLEGKEL
PGIQVLES
>P9WID1 2.7.2.3~~~pgk~~~Phosphoglycerate kinase~~~COG0126
MSVANLKDLLAEGVSGRGVLVRSDLNVPLDEDGTITDAGRIIASAPTLKALLDADAKVVVAAHLGRPKDGPDPTLSLAPV
AVALGEQLGRHVQLAGDVVGADALARAEGLTGGDILLLENIRFDKRETSKNDDDRRALAKQLVELVGTGGVFVSDGFGVV
HRKQASVYDIATLLPHYAGTLVADEMRVLEQLTSSTQRPYAVVLGGSKVSDKLGVIESLATKADSIVIGGGMCFTFLAAQ
GFSVGTSLLEDDMIEVCRGLLETYHDVLRLPVDLVVTEKFAADSPPQTVDVGAVPNGLMGLDIGPGSIKRFSTLLSNAGT
IFWNGPMGVFEFPAYAAGTRGVAEAIVAATGKGAFSVVGGGDSAAAVRAMNIPEGAFSHISTGGGASLEYLEGKTLPGIE
VLSREQPTGGVL
>Q8YPR1 2.7.2.3~~~pgk~~~Phosphoglycerate kinase~~~COG0126
MSKKTVASLSAADISGKRALVRVDFNVPLDDQGNITDDTRIRAALPTIQDLTQKGAKVILASHFGRPKGVDEKLRLTPVA
KRLSELLGQEVIKTDDSIGDEVAAKVATLQNGQVLLLENVRFYKEEEKNDPEFAKKLAANADFYVNDAFGTAHRAHASTE
GVTKFLSPSVAGYLVEKELQYLQSAIENPQRPLAAIIGGSKVSSKIGVIETLLEKCDKLIIGGGMIFTFYKARGLNVGKS
LVEEDKLELAKSLEAKAKERGVSLLLPTDVVLADNFAPDANSQTVSIENIPDGWMGLDIGPDSVKVFQAALADTKTVIWN
GPMGVFEFDKFAAGTEAIAHTLAEIGKTGTTTIIGGGDSVAAVEKVGLADQMSHISTGGGASLELLEGKVLPGIAALDEA
>P99135 2.7.2.3~~~pgk~~~Phosphoglycerate kinase~~~
MAKKIVSDLDLKGKTVLVRADFNVPLKDGEITNDNRIVQALPTIQYIIEQGGKIVLFSHLGKVKEESDKAKLTLRPVAED
LSKKLDKEVVFVPETRGEKLEAAIKDLKEGDVLLVENTRYEDLDGKKESKNDPELGKYWASLGDVFVNDAFGTAHREHAS
NVGISTHLETAAGFLMDKEIKFIGGVVNDPHKPVVAILGGAKVSDKINVIKNLVNIADKIIIGGGMAYTFLKAQGKEIGI
SLLEEDKIDFAKDLLEKHGDKIVLPVDTKVAKEFSNDAKITVVPSDSIPADQEGMDIGPNTVKLFADELEGAHTVVWNGP
MGVFEFSNFAQGTIGVCKAIANLKDAITIIGGGDSAAAAISLGFENDFTHISTGGGASLEYLEGKELPGIKAINNK
>Q6GIL7 2.7.2.3~~~pgk~~~Phosphoglycerate kinase~~~
MAKKIVSDLDLKGKTVLVRADFNVPLKDGEITNDNRIVQALPTIQYIIEQGGKIVLFSHLGKVKEESDKAKLTLRPVAED
LSKKLDKEVVFVPETRGEKLEAAIKDLKEGDVLLVENTRYEDLDGKKESKNDPELGKYWASLGDVFVNDAFGTAHREHAS
NVGISTHLETAAGFLMDKEIKFIGGVVNDPHKPVVAILGGAKVSDKINVIKNLVNIADKIIIGGGMAYTFLKAQGKEIGI
SLLEEDKIDFAKDLLEKHGDKIVLPVDTKVAKEFSNDAKITVVPSDSIPADQEGMDIGPNTVKLFADELEGAHTVVWNGP
MGVFEFSNFAQGTIGVCKAIANLKDAITIIGGGDSAAAAISLGFENDFTHISTGGGASLEYLEGKELPGIKAINNK
>Q04LZ5 2.7.2.3~~~pgk~~~Phosphoglycerate kinase~~~COG0126
MAKLTVKDVDLKGKKVLVRVDFNVPLKDGVITNDNRITAALPTIKYIIEQGGRAILFSHLGRVKEESDKAGKSLAPVAAD
LAAKLGQDVVFPGVTRGAELEAAINALEDGQVLLVENTRYEDVDGKKESKNDPELGKYWASLGDGIFVNDAFGTAHRAHA
SNVGISANVEKAVAGFLLENEIAYIQEAVETPERPFVAILGGSKVSDKIGVIENLLEKADKVLIGGGMTYTFYKAQGIEI
GNSLVEEDKLDVAKALLEKANGKLILPVDSKEANAFAGYTEVRDTEGEAVSEGFLGLDIGPKSIAKFDEALTGAKTVVWN
GPMGVFENPDFQAGTIGVMDAIVKQPGVKSIIGGGDSAAAAINLGRADKFSWISTGGGASMELLEGKVLPGLAALTEK
>Q5XA18 2.7.2.3~~~pgk~~~Phosphoglycerate kinase~~~
MAKLTVKDVDLKGKKVLVRVDFNVPLKDGVITNDNRITAALPTIKYIIEQGGRAILFSHLGRVKEEADKEGKSLAPVAAD
LAAKLGQDVVFPGVTRGSKLEEAINALEDGQVLLVENTRFEDVDGKKESKNDEELGKYWASLGDGIFVNDAFGTAHRAHA
SNVGISANVEKAVAGFLLENEIAYIQEAVETPERPFVAILGGSKVSDKIGVIENLLEKADKVLIGGGMTYTFYKAQGIEI
GNSLVEEDKLDVAKDLLEKSNGKLILPVDSKEANAFAGYTEVRDTEGEAVSEGFLGLDIGPKSIAEFDQALTGAKTVVWN
GPMGVFENPDFQAGTIGVMDAIVKQPGVKSIIGGGDSAAAAINLGRADKFSWISTGGGASMELLEGKVLPGLAALTEK
>Q8DQX8 2.7.2.3~~~pgk~~~Phosphoglycerate kinase~~~COG0126
MAKLTVKDVDLKGKKVLVRVDFNVPLKDGVITNDNRITAALPTIKYIIEQGGRAILFSHLGRVKEESDKAGKSLAPVAAD
LAAKLGQDVVFPGVTRGAELEAAINALEDGQVLLVENTRYEDVDGKKESKNDPELGKYWASLGDGIFVNDAFGTAHRAHA
SNVGISANVEKAVAGFLLENEIAYIQEAVETPERPFVAILGGSKVSDKIGVIENLLEKADKVLIGGGMTYTFYKAQGIEI
GNSLVEEDKLDVAKALLEKANGKLILPVDSKEANAFAGYTEVRDTEGEAVSEGFLGLDIGPKSIAKFDEALTGAKTVVWN
GPMGVFENPDFQAGTIGVMDAIVKQPGVKSIIGGGDSAAAAINLGRADKFSWISTGGGASMELLEGKVLPGLAALTEK
>P09403 2.7.2.3~~~pgk~~~Phosphoglycerate kinase~~~COG0126
MRTLLDLDPKGKRVLVRVDYNVPVQDGKVQDETRILESLPTLRHLLAGGASLVLLSHLGRPKGPDPKYSLAPVGEALRAH
LPEARFAPFPPGSEEARREAEALRPGEVLLLENVRFEPGEEKNDPELSARYARLGEAFVLDAFGSAHRAHASVVGVARLL
PAYAGFLMEKEVRALSRLLKDPERPYAVVLGGAKVSDKIGVIESLLPRIDRLLIGGAMAFTFLKALGGEVGRSLVEEDRL
DLAKDLLGRAEALGVRVYLPEDVVAAERIEAGVETRVFPARAIPVPYMGLDIGPKTREAFARALEGARTVFWNGPMGVFE
VPPFDEGTLAVGQAIAALEGAFTVVGGGDSVAAVNRLGLKERFGHVSTGGGASLEFLEKGTLPGLEVLEG
>Q0P9C9 2.4.1.290~~~pglA~~~N,N'-diacetylbacillosaminyl-diphospho-undecaprenol alpha-1,3-N-acetylgalactosaminyltransferase~~~COG0438
MRIGFLSHAGASIYHFRMPIIKALKDRKDEVFVIVPQDEYTQKLRDLGLKVIVYEFSRASLNPFVVLKNFFYLAKVLKNL
NLDFIQSAAHKSNTFGILAAKWAKIPYRFALVEGLGSFYIDQGFKANLVRFVINSLYKLSFKFAHQFIFVNESNAEFMRN
LGLKENKICVIKSVGINLKKFFPIYVESEKKELFWKNLNIDKKPIVLMIARALWHKGVKEFYESATMLKDKANFVLVGGR
DENPSCASLEFLNSGAVHYLGARSDIVELLQNCDIFVLPSYKEGFPVSVLEAKACGKAIVVSDCEGCVEAISNAYDGLWA
KTKNAKDLSEKISLLLEDEKLRLNLAKNAAQDALQYDENIIAQRYLKLYDRVIKNV
>Q0P9C8 2.4.99.19~~~pglB~~~Undecaprenyl-diphosphooligosaccharide--protein glycotransferase~~~COG1287
MLKKEYLKNPYLVLFAMIVLAYVFSVFCRFYWVWWASEFNEYFFNNQLMIISNDGYAFAEGARDMIAGFHQPNDLSYYGS
SLSTLTYWLYKITPFSFESIILYMSTFLSSLVVIPIILLANEYKRPLMGFVAALLASVANSYYNRTMSGYYDTDMLVIVL
PMFILFFMVRMILKKDFFSLIALPLFIGIYLWWYPSSYTLNVALIGLFLIYTLIFHRKEKIFYIAVILSSLTLSNIAWFY
QSAIIVILFALFALEQKRLNFMIIGILGSATLIFLILSGGVDPILYQLKFYIFRSDESANLTQGFMYFNVNQTIQEVENV
DFSEFMRRISGSEIVFLFSLFGFVWLLRKHKSMIMALPILVLGFLALKGGLRFTIYSVPVMALGFGFLLSEFKAILVKKY
SQLTSNVCIVFATILTLAPVFIHIYNYKAPTVFSQNEASLLNQLKNIANREDYVVTWWDYGYPVRYYSDVKTLVDGGKHL
GKDNFFPSFSLSKDEQAAANMARLSVEYTEKSFYAPQNDILKSDILQAMMKDYNQSNVDLFLASLSKPDFKIDTPKTRDI
YLYMPARMSLIFSTVASFSFINLDTGVLDKPFTFSTAYPLDVKNGEIYLSNGVVLSDDFRSFKIGDNVVSVNSIVEINSI
KQGEYKITPIDDKAQFYIFYLKDSAIPYAQFILMDKTMFNSAYVQMFFLGNYDKNLFDLVINSRDAKVFKLKI
>Q5HTX9 2.4.99.19~~~pglB~~~Undecaprenyl-diphosphooligosaccharide--protein glycotransferase~~~
MLKKEYLKNPYLVLFAMIILAYVFSVLCRFYWIWWASEFNEYFFNNQLMIISNDGYAFAEGARDMIAGFHQPNDLSYYGS
SLSTLTYWLYKITPFSFESIILYMSTFLSSLVVIPIILLANEYKRPLMGFVAALLASVANSYYNRTMSGYYDTDMLVIVL
PMFILFFMVRMILKKDFFSLIALPLFIGIYLWWYPSSYTLNVALIGLFLIYTLIFHRKEKIFYIAVILSSLTLSNIAWFY
QSAIIVILFALFALEQKRLNFMIIGILGSATLIFLILSGGVDPILYQLKFYIFRNDESANLTQGFMYFNVNQTIQEVENV
DFSEFMRRISGSEIVFLFSLFGFVWLLRKHKSMIMALPILVLGFLALKGGLRFTIYSVPVMALGFGFLLSEFKAILVKKY
SQLTSNVCIVFATILTLAPVFIHIYNYKAPTVFSQNEASLLNQLKNIANREDYVVTWWDYGYPVRYYSDVKTLVDGGKHL
GKDNFFPSFALSKDEQAAANMARLSVEYTEKSFYAPQNDILKSDILQAMMKDYNQSNVDLFLASLSKPDFKIDTPKTRDI
YLYMPARMSLIFSTVASFSFINLDTGVLDKPFTFSTAYPLDVKNGEIYLSNGVVLSDDFRSFKIGDNVVSVNSIVEINSI
KQGEYKITPIDDKAQFYIFYLKDSAIPYAQFILMDKTMFNSAYVQMFFLGNYDKNLFDLVINSRDAKVFKLKI
>B9KDD4 2.4.99.19~~~pglB~~~Undecaprenyl-diphosphooligosaccharide--protein glycotransferase~~~COG1287
MKLQQNFTDNNSIKYTCILILIAFAFSVLCRLYWVAWASEFYEFFFNDQLMITTNDGYAFAEGARDMIAGFHQPNDLSYF
GSSLSTLTYWLYSILPFSFESIILYMSAFFASLIVVPIILIAREYKLTTYGFIAALLGSIANSYYNRTMSGYYDTDMLVL
VLPMLILLTFIRLTINKDIFTLLLSPVFIMIYLWWYPSSYSLNFAMIGLFGLYTLVFHRKEKIFYLTIALMIIALSMLAW
QYKLALIVLLFAIFAFKEEKINFYMIWALIFISILILHLSGGLDPVLYQLKFYVFKASDVQNLKDAAFMYFNVNETIMEV
NTIDPEVFMQRISSSVLVFILSFIGFILLCKDHKSMLLALPMLALGFMALRAGLRFTIYAVPVMALGFGYFLYAFFNFLE
KKQIKLSLRNKNILLILIAFFSISPALMHIYYYKSSTVFTSYEASILNDLKNKAQREDYVVAWWDYGYPIRYYSDVKTLI
DGGKHLGKDNFFSSFVLSKEQIPAANMARLSVEYTEKSFKENYPDVLKAMVKDYNKTSAKDFLESLNDKDFKFDTNKTRD
VYIYMPYRMLRIMPVVAQFANTNPDNGEQEKSLFFSQANAIAQDKTTGSVMLDNGVEIINDFRALKVEGASIPLKAFVDI
ESITNGKFYYNEIDSKAQIYLLFLREYKSFVILDESLYNSSYIQMFLLNQYDQDLFEQITNDTRAKIYRLKR
>Q0P9D0 2.7.8.36~~~pglC~~~Undecaprenyl phosphate N,N'-diacetylbacillosamine 1-phosphate transferase~~~COG2148
MYEKVFKRIFDFILALVLLVLFSPVILITALLLKITQGSVIFTQNRPGLDEKIFKIYKFKTMSDERDEKGELLSDELRLK
AFGKIVRSLSLDELLQLFNVLKGDMSFVGPRPLLVEYLPLYNKEQKLRHKVRPGITGWAQVNGRNAISWQKKFELDVYYV
KNISFLLDLKIMFLTALKVLKRSGVSKEGHVTTEKFNGKN
>Q0P9D1 2.3.1.203~~~pglD~~~UDP-N-acetylbacillosamine N-acetyltransferase~~~COG0110
MARTEKIYIYGASGHGLVCEDVAKNMGYKECIFLDDFKGMKFESTLPKYDFFIAIGNNEIRKKIYQKISENGFKIVNLIH
KSALISPSAIVEENAGILIMPYVVINAKAKIEKGVILNTSSVIEHECVIGEFSHVSVGAKCAGNVKIGKNCFLGINSCVL
PNLSLADDSILGGGATLVKNQDEKGVFVGVPAKRM
>Q0P9D3 2.6.1.34~~~pglE~~~UDP-N-acetylbacillosamine transaminase~~~COG0399
MRFFLSPPHMGGNELKYIEEVFKSNYIAPLGEFVNRFEQSVKAYSKSENALALNSATAALHLALRVAGVKQDDIVLASSF
TFIASVAPICYLKAKPVFIDCDETYNIDVDLLKLAIKECEKKPKALILTHLYGNAAKMDEIVEICKENEIVLIEDAAEAL
GSFYKNKALGTFGEFGAYSYNGNKIITTSGGGMLIGKNKEKIEKARFYSTQARENCLHYEHLDYGYNYRLSNVLGAIGVA
QMEVLEQRVLKKREIYEWYKEFLGECFSFLDELENSRSNRWLSTALIDFDKNELNSCQKDINISQKNITLHPKISKLIED
LKNEQIETRPLWKAMHAQEVFKGAKAYLNGNSELFFQKGICLPSGTAMSKDDVYEISKLILKSIKA
>Q0P9D4 4.2.1.135~~~pglF~~~UDP-N-acetyl-alpha-D-glucosamine C6 dehydratase~~~COG1086
MIFYKSKRLAFFLTSDIVLILLSVYLAFSLRFSGDIPSIFYHGMMVSAIILLVLKLSFLFVFRIYKVAWRFFSLNEARKI
FIALLLAEFCFFLIFYFFSDFFNPFPRSAIVIDFVLSYMFIGTLRISKRMLVDFKPSRMKEEETPCIVVGATSKALHLLK
GAKEGSLGLFPVGVVDARKELIGTYCDKFIVEEKEKIKSYVEQGVKTAIIALRLEQEELKKLFEELVAYGICDVKIFSFT
RNEARDISIEDLLARKPKDLDDSAVAAFLKDKVVLVSGAGGTIGSELCKQCIKFGAKHLIMVDHSEYNLYKINDDLNLYK
EKITPILLSILDKQSLDEVLKTYKPELILHAAAYKHVPLCEQNPHSAVINNILGTKILCDSAKENKVAKFVMISTDKAVR
PTNIMGCTKRVCELYTLSMSDENFEVACVRFGNVLGSSGSVIPKFKAQIANNEPLTLTHPDIVRYFMLVAEAVQLVLQAG
AIAKGGELFVLDMGKPVKIIDLAKKMLLLSNRNDLEIKITGLRKGEKLYEELLIDENDAKTQYESIFVAKNEKVDLDWLN
KEIENLQICEDISEALLKIVPEFKHNKEGV
>Q0P9C5 2.4.1.292~~~pglH~~~GalNAc-alpha-(1->4)-GalNAc-alpha-(1->3)-diNAcBac-PP-undecaprenol alpha-1,4-N-acetyl-D-galactosaminyltransferase~~~COG0438
MMKISFIIATLNSGGAERALVTLANALCKEHEVSIIKFHAGESFYKLENEVKVTSLEQFRFDTLYHKIASRFKKFFALRK
ALKESKSDVFISFLDTTNIACIAAKIGLKTPLIISEHSNEAYLKPKIWRFLRRVSYPFCDALSVLGSSDKVYYERFVKRV
KLLLNPCHFSDEISFDSSFEKENLVLFIGRLDHNKNPVMFLKAIAHLDKNLQENYKFVIAGDGQLRQELEYKVKSLGIKV
DFLGRVENVKALYEKAKVLCLCSFVEGLPTVLIESLYFEVCRISSSYYNGAKDLIKDNHDGLLVGCDDEIALAKKLELVL
NDENFRKELVNNAKQRCKDFEISHIKEEWLKLIAEVKNA
>Q0P9C6 2.4.1.293~~~pglI~~~GalNAc(5)-diNAcBac-PP-undecaprenol beta-1,3-glucosyltransferase~~~COG1215
MPKLSVIVPTFNRQVLLEKAIKSIQNQDFKDLEIIVSDDNSSDDTKSVVQNLQKDDDRIKYFLNQNYKQGPNGNKNNGLD
QASGEFVTFLDDDDELLSGALSTLMQKANEGYAHVFGNCLIEKEGNLSKEFSGKGLEKDSEISKKDFLMAKFSGEFFSVF
KKSLLENKRFNEEFYGNEATLWVNLYKEKSFYIHKAFRIYRIFRQDSVTLGASKNAYRVYLGYLELAKILENELRMSKDK
DYKKTCASYYKMAAYYAKLAKNYKALYKCLFKSLSIKINAPALILLILSIIPNNMIEKLSKIRVALCKN
>Q0P9C7 2.4.1.291~~~pglJ~~~N-acetylgalactosamine-N,N'-diacetylbacillosaminyl-diphospho-undecaprenol 4-alpha-N-acetylgalactosaminyltransferase~~~COG0438
MQKLGIFIYSLGSGGAERVVATLLPILSLKFEVHLILMNDKISYEIPECQIHFLECSKPSENPILKFLKLPFLALKYKKL
CRNLGIDTEFVFLNRPNYIALMARMFGNKTRLVINECTTPSVMYMKNNFNSLVNKFLISLLYPKADLILPNSKGNLEDLV
QNFSISPKKCEILYNAIDLENIGQKALEDIALKDKFILSVGRLDKGKNHALLIRAYARLKTDLKLVILGEGVLKDELLAL
IKELNLEEKVLLLGFDNNPYKYMAKCEFFAFASVFEGFSNVLIESLACSCAVVCTDHKSGARELFGDDEFGLLVEVDNEN
SMFQGLKTMLEDDKLRKAYKNKAKTRAKAFDKVKIARDALKYLLG
>Q0P9C4 7.5.2.5~~~pglK~~~Protein glycosylation K~~~COG1132
MLKKLFFILSKEDKNFLFFLLVFSVFISFIETFAISLVMPFITLASDFSYFDRNKYLISLKEYLNIPVFEIIVYFGVGLI
VFYVFRALLNAYYFHLLARFSKGRYHAIAYKVFSKFLNINYEKFTQKNQSEILKSITGEVYNLSTMISSFLLLMSEIFVV
LLLYALMLLINYKITLFLSIFMVLNAFILVKILSPIIKKAGVRREEAMKNFFEILNTNLNNFKFIKLKTKEDGVLSLFKA
QSEAFSKANITNESVAAVPRIYLEGIGFCVLVFIVVFLVLKNESDISGILSTISIFVLALYRLMPSANRIITSYHDLLYY
HSSLDIIYQNLRQEEENLGEEKLSFNQELKICNLSFGYEGKKYLFKNLNLNIKKGEKIAFIGESGCGKSTLVDLIIGLLK
PKEGQILIDEQELNANNTKNYRQKIGYIPQNIYLFNDSIAKNITFGDAVDEEKLNRVIKQANLEHFIKNLPQGVQTKVGD
GGSNLSGGQKQRIAIARALYLEPEMLVLDEATSALDTQSEAKIMDEIYKISKDKTMIIIAHRLSTITQCDKVYRLEHGKL
KEEK
>P26509 3.2.1.15~~~pehA~~~Endo-polygalacturonase~~~COG5434
MEYQSGKRVLSLSLGLIGLFSASAWASDSRTVSEPKTPSSCTTLKADSSTATSTIQKALNNCDQGKAVRLSAGSTSVFLS
GPLSLPSGVSLLIDKGVTLRAVNNAKSFENAPSSCGVVDKNGKGCDAFITAVSTTNSGIYGPGTIDGQGGVKLQDKKVSW
WELAADAKVKKLKQNTPRLIQINKSKNFTLYNVSLINSPNFHVVFSDGDGFTAWKTTIKTPSTARNTDGIDPMSSKNITI
AYSNIATGDDNVAIKAYKGRAETRNISILHNDFGTGHGMSIGSETMGVYNVTVDDLKMNGTTNGLRIKSDKSAAGVVNGV
RYSNVVMKNVAKPIVIDTVYEKKEGSNVPDWSDITFKDVTSETKGVVVLNGENAKKPIEVTMKNVKLTSDSTWQIKNVNV
KK
>P20041 3.2.1.15~~~pglA~~~Polygalacturonase~~~
MNHRYTLLALAAAALSAGAHATGTSVTAPWGEVAEPSLPADSAVCKTLSASITPIKGSVDSVDGNPANSQPDASRIQSAI
DNCPAGQAVKLVKGSAGESGFLSGSLKLKSGVTLWIDTGVTLFASRNPADYDNGLGTCGTATTSNDKSCNALIVARDTAG
SGIVGAGAIDGRGGSLVTSGPNANRLTWWDIAYLNKTKGLNQQNPRLIQTYNGSAFTLYGVTVQNSPNFHIVTTGTSGVT
AWGIKIVTPSLAYAVAGYKCPSGSTPDKVTPATCFTPETVKNTDGFDPGQSTNVVLAYSYINTGDDHVAVKASSGPTRNL
LFAHNHFYYGHGLSIGSETNTGVSNMLVTDLTMDGNDSSAGNGLRIKSDASRGGKVTNIVYDGICMRNVKEPLVFDPFYS
SVKGSLYPNFTNIVVKNFHDLGSAKSIKRTMTFLGYKANKQKNPLTITLDNVVFDGTLPAFEGSHYGGPASPNGVHFTFG
GTGPVSFADAIVTSSTTDVTVTGTPGTAAAVDCSKAFVPLKSVAPTSPI
>P27644 3.2.1.15~~~pgl~~~Polygalacturonase~~~
MALATRATGGAGRRKPVRARCARGLHLVSCHKTQLLGFTIRNAASWTIHPQGCEDLTAAASTIIAPHDSPNTDGFNPESC
RNVMISGVRFSVGDDCIAVKAGKRGPDGEDDHLAETRGITVRHCLMQPGHGGLVIGSEMSGGVHDVTVEDCDMIGTDRGL
RLKTGARSGGGMVGNITMRRVLLDGVQTALSANAHYHCDADGHDDWVQSRNPAPVNDGTPFVDGITVEDVEIRNLAHAAG
VFLGLPDVPSATSLSATSPIVSHDPSAVATPPIMADRVRPMRMRLVFEQADVVCDDPALLNDAPVSISSYFD
>O86560 ~~~pglW~~~Probable kinase PglW~~~COG0515
MREGRWVTVTESEFEHERRGLEAIRQKLPDGDPWRAWSNFTFTANTGHVREVDLLVVAPGGLCMVELKDWHGSVTSENGT
WVQTTPGGRRRTHGNPLHLVNRKAKELAGLLAQPGAKRVWVAEAVCFTDNGLRVRLPAHDQNGVYTVDELVDMLKQAPSD
ERRRVTAIGSREVAAALKNIGIRKSDAQYKVGPYELERKSFDSGPTWADYLARHSDLPEAARVRIYLSERGSDASLRQSV
ENAARREAAVLGRFKHPGAVQLKQYFPSGHAAGPALIFDYHPHTQKLDEYLVQYGEKLDILGRMALVRQLAETVRSAHAS
RIHHRALAARSVLVVPRSRGGKGRAVGEEAAWLTPQLQISDWQIATQRSGDSSQGQGMTRFAPTALSAMHLADDADAYLA
PELTALNPDPVYLDVYGLGVLTYLLVTGKAPAASQAELLARLEAGEGLRPSSLVDGLSEDVDELVQAATAYRPGQRLSSV
DEFLELLEVVEDSLTAPAAALDGPAEDETGASADKDPLEVVAGDLLAGRWEVRRRLGTGSTSRAFLVRDLEAETRRTRPL
AVLKVALSDSRGEILVREAEAMRRLRPHSGIIRLAEPEPLHIGGRTVLALEYVGDERDDDGPGAEGATRPRRREETVARQ
LREHGRLPVDQLEAYGDYLFGAVDFLEGEGIWHRDIKPDNIAVRIRPNRTRELVLIDFSLAGYPAKNTDAGTDGYLDPFV
DVITRGSYDSHAERYAVAVTLHQMASGELPKWGDGSVLPRMTDPKEWPYPTIAAEAFDPAVRDGLVAFFQKALHRDAGKR
FPELKPMRDAWRKVFLDASQTVPSSHRTRPAAPADGAAPAEGAAAGIADAEPETAEQQRDRLAAEVTRDTPLTVSGLTPA
AQSFLYGLGITTVGELLDYSRRKLVNAPGLGAKTRNEVQQRQREWGERLREAPVSPLTPKGRAEAKEELEQLTAAESALV
GQLATGESAGALSARTLRSVSLDTLATVLVPAVNNNGSNRNKAEMVRLLLRLPDEHGVLPGIGVWPKQKDVADALGLSHG
RIPQMLKDERKRWKAEPAVQALRDEIIELLASMGRVASAVEIADALAVRRGTHLAGREQRRAMALAAVRAVVEVEQLVPQ
EVEFQHQPNRKATDESLGAGLLALDVREDDAPDTPTAPGLLDYATRLGKTADRLARLDTLPTAATVLAELGALTVPPGAV
DWDERRMVELAAAASVNAAATPRLEIYPRDLSLVRALRLTQAGLVRWIPGVPEGRQPGLTGEDVHERVRARFPELVVPDG
RGGTAHELPTAGPLTKALRDAGFELSLSMREDTGTLRYLPTRVDEASSYLTTGAWRQSTRTGTVTRYADDPQLAGAVRAE
ERLLASAHRDGYRVLTVRQQLVRDAVRELGAERLGGQAVSVTELFLEALHGQVTPGTKPTWETLLKADAAEPGSKGAVRF
AEYARTAWGSVEPRIAELLGDGGGGAGPVLLTEAGVFARYDAMGVLDRLASAARRGGRGLWLLVPQSDPSREPRLGQVAV
PYQAGLGEWIQLPDTWVGNRHRGSGEVVASGVEGDAK
>P0DUF3 2.1.1.72~~~pglX~~~Adenine-specific methyltransferase PglX~~~
MNKTAIKNFAVSARMKLIDAVKQKAYELGITETEIKEPETYEDGFRINNKFFRSYELEQRKKLIQKIEEKDYKQVIEEVS
YTWFNRFIAIRFMEVNEYLPTGVRVLSSTQEGKNEPDVIGEVTNIAEDLDLNLDIVYRLQDENKTEDLFKYILIKQCNKL
GEIMPMMFETIQDYTELLLPDNLLGEGSVVRNLVSMIDEEDWKEQVEIIGWLYQYYISEKKDEVFADLKKNKKITKENIP
AATQLFTPKWIVKYMVENSLGRLWLESHPNEDVQQQWKYYLEEAEQEPNVQEQLEKLKNNELSPEDITVLDPCMGSGHIL
VYAFDVLYNIYQHAGYSEREIPQLILEKNLYGLDIDDRAAQLAYFALMMKARSYNRRIFRRPLELNVCSIQESNGIPQEA
IDYFVGDSLDRDEITYLIRVFEDSKEYGSILEVLSIDFVSIEKRMIEIQEMQEVKDIFSLQYSDILLIKIPQLIKQAKIM
SSKYHVVLTNPPYMGSKGMNEQLKKYCTKFFKKGKRDLFAVLMLKCFNFTKEGGYISNINQQSWMFLSSYEEIRKYFLSE
STICSMIHLGSGSFEEINGEVVQSTSFVLKKQVVKNYQTPFIRLVEYNDSQEKKLSFLEKKERYIVAADLFLNVPGYQIA
YWATQKMLVAFSENKVLGDLYDPRQGLATGDNDRFLRLWHEVDKVRINWGALSISDAHESKIKWFPHNKGGRFRKWYGNN
EYLISFNIESYNTLAKQGNCLPSKQLYFKSGITWSRTTSNLLGVRLHNQGTIFDCEGCFLSAGNTRDDYYLMAFLNSNVA
FVFLEKINPTLHFQVGDVRKIPLMTFNNLLEAKMDLITSCISISKRDWDSFETSWGFIRHPMLTHKNSSRRLSEVLDYWS
VFAEQQFNQLKANEEELNRIFIEIYDLQDDLTPEVKEKDIVIRKADKERDIKSFISYAVGCMLGRYSLDQEGLVFAGGEF
DESKYKTFKADTDNIIPITDDEYFEDDIVSRFIEFVRVTFSEETLEENLDFIAEALNKKANETSRQCIRRYFLKDFFKDH
VKMYQKRPIYWLFDSGKNDGFKALVYMHRYDVGTVAEVRTDYLHTLQRKYEAEIARRDVLLESDASTKDKTRAKKEKEKL
QKQLLECQQYDQVIAHIANQKITIDLDDGVKVNYAKFQNVELPQGEGKKPLKANLLAKI
>P0DUF9 2.1.1.72~~~pglX~~~Adenine-specific methyltransferase BrxX~~~
MNTNNIKKYAPQARNDFRDAVIQKLTTLGIAADKKGNLQIAEAETIGETVRYGQFDYPLSTLPRRERLVKRAREQGFEVL
VEHCAYTWFNRLCAIRYMELHGYLDHGFRMLSHPETPTAFEVLDHVPEVAEALLPESKAQLVEMKLSGNQDEALYRELLL
GQCHALHHAMPFLFEAVDDEAELLLPDNLTRTDSILRGLVDDIPEEDWEQVEVIGWLYQFYISEKKDAVIGKVVKSEDIP
AATQLFTPNWIVQYLVQNSVGRQWLQTYPDSPLKDKMEYYIEPAEQTPEVQAQLAAITPASIEPESIKVLDPACGSGHIL
TEAYNVLKAIYEERGYRTRDIPQLILENNIFGLDIDDRAAQLSGFAMLMLARQDDRRILGRGVRLNIVSLQESKLDIAEV
WTKLNFHQHMQRGSMGDMFTQGTALANTDSAEYKLLMRTLALFTSAKTLGSLIQVPQEDEAALKAFLERLYRLAVEGDIQ
QKEAAAELIPYIQQAWILAQRYDAVVANPPYMGGKGMNGDLKEFAKKQFPDSKSDLFAMFMQHAFSLLKENGFNAQVNMQ
SWMFLSSYEALRGWLLDNKTFITMAHLGARAFGQISGEVVQTTAWVIKNNHSGFYKPVFFRLVDDNEEHKKNNLLNRMNC
FKNTLQNDFKKIPGSPIAYWATLAFINSFLKLPALGTRAVKGLDTNGSIDVFLRRWPEVSINSFDALGKGNSKWFPIAKG
GELRKWFGNHEYIINYENDGIELRKNKANLRNKDMYFQEGGTWTVVSTTGFSMRYMPKGFLFDQGGSAVFCENNDELSIY
NILACMNSKYINYSASLICPTLNFTTGDVRKFPVIKNNHLEDLAKKAIEISKADWNQFETSWEFSKNKLIEHKGNVAYSY
ASYCNFQDKLYEQLVNIEKNINNIIEEILGFKIETTENSELITLNSNKIYRYGQSETNDTFLNRHRSDTISELISYSVGC
QMGRYSLDREGLVYAHEGNKGFAELAAEGAYKTFPADNDGILPLMDDEWFEDDVTSRVKEFVRTVWGEEHLQENLEFIAE
SLCLYAIKPKKGESALETIRRYLSTQFWKDHMKMYKKRPIYWLFSSGKEKAFECLVYLHRYNDATLSRMRTEYVVPLLAR
YQANIDRLNDQLDEASGGEATRLKRERDSLIKKFSELRSYDDRLRHYADMRISIDLDDGVKVNYGKFGDLLADVKAITGN
APEAI
>Q8CJM2 2.1.1.72~~~pglX~~~Adenine-specific methyltransferase PglX~~~COG1002
MIDRKALLNDLKQQVKAVEADLGKQVKALDEVGARLRAEYDQARKLGRTAATWNSWLDERVTQVAVAWVLGTVFVRFCED
NRLIPEPYVTGPDNYRRDLAETRYDVYVEADDDPTYRGWLRRAFAELGDGQAGRLLFDSDHNPLYQIPLSHDGARELVEF
WRQRDEEGALVHDFTDPLSADGTEGWGTRFLGDLYQDLSEAARKTYALLQTPEFVEEFILDRTMNPAVREFGYEELKMID
PTCGSGHFVLGAFRRLVRLGGENQPGKDVHQRVRAALDSVHGVDINPFAVAIARFRLLVAAMAASGVRTLDEASKYEWPV
HLAVGDSLIKSGSQQGSLFGESDDDLTDELAEFKYATEDVGEHPEMLRPGRYHVVVGNPPYITVKDKSLNALYRELYPAC
AGKYALSVPFAQRFFELAKREDAEGSGYGMVGQITANSFMKREFGTKLIEGYFGHAVELTEVIDTSGAYIPGHGTPTVIL
VGTRRGGDGRSPVIRTVRSVQGEPVAPENAEEGLVWRAIVEQIDKPGSVSQWVSVDDLDREKYFSKQPWVLADGGQEMLE
QINAASHAILKRDLHRIGFYGIMGADDAMSAVPRTFRRNNAESEYVRRLVVGDEVRDFRIADGDDAFHPYGSQRDLVGPD
AFPNLAAWLWPYRTELGGRATFSGGTYFADGRPWWEWHQLPKDVGAHAWSLNFAFVATHNHVVLDRSGCAFTRTAPVIKL
REGASEEEHLRLLGLLNSSTAGFWLKMVSYPKGGDPVGDEGARVSVHPWSDRYEFTGTKLQEFPLPSEYPTGLGTALDAL
AQRLAAASPAAVAAEAVPIAGLLREARTRWEAIRSRMIALQEEMDWQVYSLYKLHSEDLRVSEDPDDTNIPELTLGGRAF
EIVLARRVAAGEASDEWFKRHNSTPITEIPAHWPAPYREIVQKRIDAIESNRAIGMVERPEYKRRWATEGWDALQEKALR
SWLLDRMENRDLWCDENGQPTILTLARLTDALSRDEDFASVAKLYAPRKELAKVVAELITDEHVPFLSALRYKPSGLKKR
ADWEEVWDLQRKEDAAPDEPAKRKIRDSIPVPPKYTSADFLRPSYWKARGKLDVPKERFVSYGQTNAATPELYGWAGWDH
REQAQALATYFTNTALSTKEITPFLAGLLELQPWLSQWHNEFDMLYSGSPADFFAGYRQQKQGEHGLTDDDLRGWRPPAA
TRRRRAAAKQ
>O86682 ~~~pglY~~~ATPase PglY~~~COG1483
MAQPPLLRDVIDIKESISTSDFVLSLAEATTPAGAQHALRDYVVTERLLENFDEALALIKSSLDGHRSKAAYLHGSFGSG
KSHFMAVLYALLSGDQAARARTEFDPVLTKHQWLSTDGKKFLLVPYHMLGAKALEQRVLGGYVTHVKKLCPEAPTPQVYR
TDSLFADIRAMRANMGDEAVIRALGTSGADDAEEDEWGEGFAWTPQLLDTALAAEESHEAGVHLNLTNPSTPAELRAKLV
NDAGTNLLPGFTQNAAEDEHGFISLDAGLSVIAEHAKSLGYDGLILFMDELILWLATLIHDQKFVAREASKITNFVEGGD
ARRAIPVMSFIARQRDLRELVGEEVSGAAESSIQDTLNLASGRFDKITLEDRKLPQIAHARLLKPKDAEAAQQVDAAFEQ
TKRVGPQVWDTLLGSEKGTTGADAESFRLTYPFSPAFMDTLVHISSALQRSRTGLKLMGQLLADHRDELRLGQLVPVGDL
YPVIAQGGDKPFTDSLKVVFEAADKLYKTKLRPYLLSSYDITEDDVEQYRNRPESLTDPKKLNGCRMFTGDNRLVCTLLL
SALAPSVPALSELTIRRLGALNHGSVLAPIPGAEVGIIKNKVAEWAARFPEIKETGTDANPGVRLELSGVDVDSVIANAQ
VNDNPGNRVALARRLLSEELGVEHGQLSDQLGFTWRGTARTAEIVFGNVADEDELPDHDLMPQEEGRWRIAIDLPFDEGE
WGPVEDVNRVQRLRERQQGERSRTIAWLPAHLSATRFADFRRLVVIDKALADEHRFDTQYAGHLNADNRSRAKGLLETQR
EALLKQAKGAFKQAYGLAQKQAADVVPDFDDHLVALPDVDGLTLSFGQSLHDGIRHVAGKLLTHQYPAHPDLDPDATGTA
VKPADTKKVFAHVRAAAEARDGRIEVPAADRKLMQRIAGPLRLGQQKEAYFELSRYWGDHFRQLARSQGVTGDLSLITLT
DWTDRPDPRGLPDFLARLVVAAFAEMDDRVWVRGGTVLDPAPELAAIKDHDALRSQPLPAESDWDTARQRFETVFGAKPP
ALRRGRMVNQFARQIIEAARDYRDHAADLVHQLEAHASFLGLDQTADTGRLALARRSLQLLDALTAEAGKGAAGAKKTVE
ALASFDLGETSADRYGTSIKKARAVAEAVASAPWSTLELAAGLGPEGEALLDSLRNVARDDQRTADLRDALARTQREVVA
LIKRTQAAATPPPAPAASQPTAGDLSLDTPTSDPRIPYTSQETPTSSGGAGTARTSGGRRTTAARAVTDLQAELSDLAVR
HPEATIEITWRVVE
>P0DUF4 ~~~pglZ~~~Alkaline phosphatase-like protein PglZ~~~
MNTKEINRVLQDTFNKELTEGKKRHIVFWYDEAGEFIEDIDELNLEGVRIWKLTPHNMFATKLEVEKNDVNSNFLIYANM
AKPSAREDWLLDVYKYSQEFATDKMTVMMRELGITNDAALRDVFKKYTKFFKSKEREALFKSFSVPEYTEEQIDLVVLAS
LCKCNIVNLDEVIKALFREQLKETNKYWENIRKFGNEETFWNLVEKTYGYNLQDKSINSLLIFFLLTNVSETLSGDIPKT
WQPYISAIPMNAIVFMNQFMNHSADEVIYNELANTVEKQVKVTEHLQSKEIKDYITSDTFCCFDTNIITYITKQLMNAIH
DYTSYIEIIAARRKLHWFSVFRNEYEALYQAIQLFQQIYEMGNAITESQPFDLFKAYESKYHNIDTAYRKFYVAFDQIED
KDGFRALRDKVENIYTNVYINDLAIKWSDALEGEQEEYWPIAGLESQHTFYRSFVQPFVNKEERVFVIISDALRYEVAKE
LSNMLNVERKASTDIVAMQGVLPSYTDLGMATLLPYKSITFNENAEVYVNDYKASSTENRATILSKHYKDSTAIQYKDLA
AMNRQQFRDVFSGKKVSYIYHNVIDARGDHAATEHEVFHAVEQTLKDIRSLVDQLINTVSASNIVITADHGFIYNRDTLQ
ASDKVKKDFLNTDIEKRRFIISSESNSIEGTMNFSMDYVLGEGSGKYVKVPRGANRFAVQGTGANYVHGGAMLQEIVVPV
IKFKNDRSKSSKNDVRKVEVKLTSLTRKITNSITYLEFFQTEKIEGKKTPLRLKVYFTDEEGNRISNENIIIADSQSSKP
EDRTFKEKFVLKSMTYDKTKKYYLVLEDEEEAVENIYEKVAFPIDIAITNDFGF
>P0DUG0 ~~~pglZ~~~Alkaline phosphatase-like protein BrxZ~~~
MQNQDFIAGLKAKFAEHRIVFWHDPDKRFIEELEQLKLENVTLINMTHASQLAVKKRIEMDEPEQQFLLWFPHDAPPHEQ
DWLLDIRLYSSEFHADFAAITLNTLGIPQLGLREHIQRRKAFFSTKRTQALKNLVTEQEDETSLDKKMIAVIAGAKTAKT
EDILFNLITQYVNQQTEDDSELENTQAMLKRHGLDPVLWEMLNHEMGYQAEEPSLENLLLKLFCTDLSAQADRQQRAWLE
KNVLLTPSGRASALAFMVTWRADRRYKEAYDYCAQQMQAALRPEDHYRLSSPYDLHECETTLSIEQTIIHALVTQLLEES
TTLDREAFKKLLSERQSKYWCQTQPEYYAIYDALRQAERLLNLRNRHIDGFHYQDSATFWKAYCEELFRFDQAYRLFNEY
ALLVHSKGAMILKSLDDYIEALYSNWYLAELSRNWNEVLEAENRMQAWQIPGVPRQKNFFNEVVKPQFQNPQIKRVFVII
SDALRYEVAEELGNQINTEKRFTAELRSQLGVLPSYTQLGMAALLPHEQLCYQPSNGDIVYADGLSTSGIPNRDTILKNY
KGMAIKSKDLLELKNQEGRDLIRDYEVVYIWHNTIDATGDTASTEDKTFEACRTAVTELKDLVTKVINRLHGTRIFVTAD
HGFLFQQQALSVQDKTTLQIKPENTIKNHKRFIIGHQLPADDFCWKGKVADTAGVSDNSEFLIPKGIQRFHFSGGARFVH
GGTMLQEVCVPVLQIKALQKTAAEKQPQRRPVDIVAYHPMIKLVNNIDKVSLLQTHPVGELYEPRILNIYIVDNANNVVS
GKERISFDSDNSTMEKRVREVTLKLIGANFNRRNEYWLILEDAQTETGYQKYPVIIDLAFQDDFF
>O86683 ~~~pglZ~~~Alkaline phosphatase-like protein PglZ~~~COG1524
MTDTTVAVPGAVRLNTATVTQYLSSQSSLVASLTGDGGGRRRAVLLRSAPQWDGPAEPAWGEGRTAGIAVAPSPLAVHEL
VLDHLAARRPGPPVLVVLTDREQHELDPAILARVHKLRIDTVDGWDVVREAFGARQIDPRLKDVNWAAEALLDATPPGSW
PAVPGGWLSRQYALTALAQRRLRLGRYDTEGGPRRPGDDRLDAQALLHWSTRPGAPERLLTLRGPERAGLTAFLGEEDQA
GLAGRTLLALVDAERGEDAAAFGLVCAALWQHAEPAPETYRARGRAERYFGDRPPATGDQLDALVTVFGRATEEHVTTLL
AAGHRTAGTDADQAREARRTTGTVLDRAAALARQFGAEEAVAASPVLRGGLEARFTAVGRALAAGDTTAVADAVRRLENH
RLAAEPEESARIERARMGQRLARWLATDPPTDALTVADALRRHVAETGWADLALEHIEAGGDQGPVLKAAYDTLGTRVRD
RRRQIDASFARSLAAWTQSGTQPGSMLTVETFLDRVVGPLVRRGEERRTLMLVLDGMSAAIANELGEELRRSWAEFDPLP
EGDTPYRRAMAAALPTVTAVSRTSLFAGTLTKGTQADEKRLFPALKLWGGAPAAVFHKDDLRTETAGETFGPALTEALAD
GRTHVAVVLNAIDDRLAKEQKLGDGAWRIDDVPGLRDLLRVAATQGMAVVLTSDHGHVVDRHGTKVDPAAAPESARHRLI
GGGPLAEREITLSGPRVIWPEPGASIVALWDADSRYTALKAGYHGGASLAEVTIPALAFLPFGAEPPKGWRELGDQRPVW
WAPEETGKAPLPDEYTARPVAATASAPKKPTAKAKKDQAEVARMHHGALFDVALTTEGDDALLTPTVVSRTETLVTALLD
SETYQAQLGGLARKPQQEQVHKALTTLLDSGGTLPVTALAQRVGMPVTRGVGFAAVLGQLLNYDGVQVLETLPDGRTLRL
HAALLREQFALGAG
>O06995 5.4.2.6~~~yvdM~~~Beta-phosphoglucomutase~~~COG0637
MKAVIFDLDGVITDTAEYHFLAWKHIAEQIDIPFDRDMNERLKGISREESLESILIFGGAETKYTNAEKQELMHRKNRDY
QMLISKLTPEDLLPGIGRLLCQLKNENIKIGLASSSRNAPKILRRLAIIDDFHAIVDPTTLAKGKPDPDIFLTAAAMLDV
SPADCAAIEDAEAGISAIKSAGMFAVGVGQGQPMLGADLVVRQTSDLTLELLHEEWEQYRIRESIP
>P77366 5.4.2.6~~~ycjU~~~Beta-phosphoglucomutase~~~COG0637
MKLQGVIFDLDGVITDTAHLHFQAWQQIAAEIGISIDAQFNESLKGISRDESLRRILQHGGKEGDFNSQERAQLAYRKNL
LYVHSLRELTVNAVLPGIRSLLADLRAQQISVGLASVSLNAPTILAALELREFFTFCADASQLKNSKPDPEIFLAACAGL
GVPPQACIGIEDAQAGIDAINASGMRSVGIGAGLTGAQLLLPSTESLTWPRLSAFWQNV
>P71447 5.4.2.6~~~pgmB~~~Beta-phosphoglucomutase~~~COG0637
MFKAVLFDLDGVITDTAEYHFRAWKALAEEIGINGVDRQFNEQLKGVSREDSLQKILDLADKKVSAEEFKELAKRKNDNY
VKMIQDVSPADVYPGILQLLKDLRSNKIKIALASASKNGPFLLEKMNLTGYFDAIADPAEVAASKPAPDIFIAAAHAVGV
APSESIGLEDSQAGIQAIKDSGALPIGVGRPEDLGDDIVIVPDTSYYTLEFLKEVWLQKQK
>P36938 5.4.2.2~~~pgm~~~Phosphoglucomutase~~~COG0033
MAIHNRAGQPAQQSDLINVAQLTAQYYVLKPEAGNAEHAVKFGTSGHRGSAARHSFNEPHILAIAQAIAEERAKNGITGP
CYVGKDTHALSEPAFISVLEVLAANGVDVIVQENNGFTPTPAVSNAILVHNKKGGPLADGIVITPSHNPPEDGGIKYNPP
NGGPADTNVTKVVEDRANALLADGLKGVKRISLDEAMASGHVKEQDLVQPFVEGLADIVDMAAIQKAGLTLGVDPLGGSG
IEYWKRIGEYYNLNLTIVNDQVDQTFRFMHLDKDGAIRMDCSSECAMAGLLALRDKFDLAFANDPDYDRHGIVTPAGLMN
PNHYLAVAINYLFQHRPQWGKDVAVGKTLVSSAMIDRVVNDLGRKLVEVPVGFKWFVDGLFDGSFGFGGEESAGASFLRF
DGTPWSTDKDGIIMCLLAAEITAVTGKNPQEHYNELAKRFGAPSYNRLQAAATSAQKAALSKLSPEMVSASTLAGDPITA
RLTAAPGNGASIGGLKVMTDNGWFAARPSGTEDAYKIYCESFLGEEHRKQIEKEAVEIVSEVLKNA
>P39671 5.4.2.2~~~pgm~~~Phosphoglucomutase~~~COG0033
MIKTIKTTPYQDQKPGTSGLRKKVPVFAQENYAENFIQSIFDALEGFEGQTLVIGGDGRYYNREVIQKAIKMAAAAGFGK
VLVGQGGILSTPAASNVIRKYKAFGGIVLSASHNPGGPTEDFGIKYNIGNGGPAPEKITDAIYARSKVIDSYKISDAADI
DLDKIGSFKVDELTVDVIDPVADYAALMEELFDFGAIRSLIAGGFKVVVDSMSAVTGPYAVEILEKRLGAPKGSVRNATP
LPDFGGHHPDPNLVHAKELYDDVMSPEGPDFGAASDGDGDRNMVVGKGMFVTPSDSLAIIAANAKLAPGYAAGISGIARS
MPTSAAADRVAEKLGLGMYETPTGWKFFGNLMDAGKVTICGEESFGTGSNHVREKDGLWAVLYWLNIVAARKESVKDIVT
KHWAEYGRNYYSRHDYEEVDSDAANTLVAILREKLATLPGTSYGNLKVAAADDFAYHDPVDQSVSKNQGIRILFEGGSRI
VLRLSGTGTAGATLRLYVERYEPDAARHGIETQSALADLISVADTIAGIKAHTADSEPTVIT
>P56601 1.3.3.4~~~pgoX~~~Protoporphyrinogen oxidase~~~
MHHMPRTTGMNVAVVGGGISGLAVAHHLRSRGTDAVLLESSARLGGAVGTHALAGYLVEQGPNSFLDREPATRALAAALN
LEGRIRAADPAAKRRYVYTRGRLRSVPASPPAFLASDILPLGARLRVAGELFSRRAPEGVDESLAAFGRRHLGHRATQVL
LDAVQTGIYAGDVEQLSVAATFPMLVKMEREHRSLILGAIRAQKAQRQAALPAGTAPKLSGALSTFDGGLQVLIDALAAS
LGDAAHVGARVEGLAREDGGWRLIIEEHGRRAELSVAQVVLAAPAHATAKLLRPLDDALAALVAGIAYAPIAVVHLGFDA
GTLPAPDGFGFLVPAEEQRRMLGAIHASTTFPFRAEGGRVLYSCMVGGARQPGLVEQDEDALAALAREELKALAGVTARP
SFTRVFRWPLGIPQYNLGHLERVAAIDAALQRLPGLHLIGNAYKGVGLNDCIRNAAQLADALVAGNTSHAP
>P18200 3.1.3.27~~~pgpA~~~Phosphatidylglycerophosphatase A~~~COG1267
MTILPRHKDVAKSRLKMSNPWHLLAVGFGSGLSPIVPGTMGSLAAIPFWYLMTFLPWQLYSLVVMLGICIGVYLCHQTAK
DMGVHDHGSIVWDEFIGMWITLMALPTNDWQWVAAGFVIFRILDMWKPWPIRWFDRNVHGGMGIMIDDIVAGVISAGILY
FIGHHWPLGILS
>O34349 3.1.3.27~~~pgpB~~~Phosphatidylglycerophosphatase B~~~COG0671
MYKPVSLFLFFLILAAAIHTNAVQSADEAISKAAVLIRQPWLNEVMTGITHLGASSFLLPLIVIIGAGMFFYRKTWDGLL
MLLVFGTDRLLNKVLKEWIERVRPDFAPLVHESSFSFPSGHSMNAACVYPVIAYFLVKHLPFLSKHKKMVYIIAGVIAVL
VGISRVYLGVHFVTDVLGGFSLGLLLFFLVKGFDEKIKRFRQK
>P0A924 3.1.3.27~~~pgpB~~~Phosphatidylglycerophosphatase B~~~COG0671
MRSIARRTAVGAALLLVMPVAVWISGWRWQPGEQSWLLKAAFWVTETVTQPWGVITHLILFGWFLWCLRFRIKAAFVLFA
ILAAAILVGQGVKSWIKDKVQEPRPFVIWLEKTHHIPVDEFYTLKRAERGNLVKEQLAEEKNIPQYLRSHWQKETGFAFP
SGHTMFAASWALLAVGLLWPRRRTLTIAILLVWATGVMGSRLLLGMHWPRDLVVATLISWALVAVATWLAQRICGPLTPP
AEENREIAQREQES
>P0AD42 3.1.3.27~~~pgpC~~~Phosphatidylglycerophosphatase C~~~COG0560
MATHERRVVFFDLDGTLHQQDMFGSFLRYLLRRQPLNALLVLPLLPIIAIALLIKGRAARWPMSLLLWGCTFGHSEARLQ
TLQADFVRWFRDNVTAFPLVQERLTTYLLSSDADIWLITGSPQPLVEAVYFDTPWLPRVNLIASQIQRGYGGWVLTMRCL
GHEKVAQLERKIGTPLRLYSGYSDSNQDNPLLYFCQHRWRVTPRGELQQLE
>A0A0H3GGY3 3.1.4.59~~~pgpH~~~Cyclic-di-AMP phosphodiesterase PgpH~~~
MKLAKKWRDWYIESGKKYLFPLLLVCFAVIAYFLVCQMTKPESYNVKLFQVAEKTIRSPQTVEDTEKTKEERTKASDAVE
DVYVYNRETGQNRVALIQSLFAYVNEVNAEAQEKDTKNKEKAKKENKPAPAPTSTEDKLKNLKDKLSSNVSEKITSNISD
EVFTTLIEAKSKDFNVMEDVVTTEVEKSMENKIRDENLNSVKIRARDDIELSAIPAYYKNVSKALVSYAIVPNEVYDEEQ
TDARRKEAAQSVVPVKILQGQVIVQEGQIVDRETYRQLKMLHLLDQKMPVKQYAGFAIFIIALAAILFLYTKKQTQPKAK
KMQTMLIFSSVYLVSLFMLFIILFLETQNIANIAFLFPAAFAPMILKILLNEKYAFLSVIFIAVTSLLTFQNDATSGITI
FILLSGATSVVMLRDYSRRSAIMLSGFMVGLINMIYVLLLLLINNSTLLQVSTLMALGYAFLGGFGAFILGVGVIPLFET
IFGLLTTSRLVELANPNHPLLKKILMKAPGTYHHSMMVANLAEACADKIGANSLLVRVGCFYHDIGKTLRPPYFVENQLQ
GINPHDRLTPEQSRDIILSHTKDGAEILKENHMPQPIIDIALQHHGTTLLKYFYFKAKETNPDVKEADYRYSGPKPQTKE
IAIINISDSVEAAVRSSTEPTMAKITEIIDGIIKDRFLDGQFTECDITIQEIKIIRDTLIATLNGIYHQRIQYPDDKD
>P77333 ~~~pgrR~~~HTH-type transcriptional regulator PgrR~~~COG0583
MKREEIADLMAFVVVAEERSFTRAAARLSMAQSALSQIVRRIEERLGLRLLTRTTRSVVPTEAGEHLLSVLGPMLHDIDS
AMASLSDLQNRPSGTIRITTVEHAAKTILLPAMRTFLKSHPEIDIQLTIDYGLTDVVSERFDAGVRLGGEMDKDMIAIRI
GPDIPMAIVGSPDYFSRRSVPTSVSQLIDHQAINLYLPTSGTANRWRLIRGGREVRVRMEGQLLLNTIDLIIDAAIDGHG
LAYLPYDQVERAIKEKKLIRVLDKFTPDLPGYHLYYPHRRHAGSAFSLFIDRLKYKGAV
>P0ABF8 2.7.8.5~~~pgsA~~~CDP-diacylglycerol--glycerol-3-phosphate 3-phosphatidyltransferase~~~COG0558
MQFNIPTLLTLFRVILIPFFVLVFYLPVTWSPFAAALIFCVAAVTDWFDGFLARRWNQSTRFGAFLDPVADKVLVAIAMV
LVTEHYHSWWVTLPAATMIAREIIISALREWMAELGKRSSVAVSWIGKVKTTAQMVALAWLLWRPNIWVEYAGIALFFVA
AVLTLWSMLQYLSAARADLLDQ
>P63756 2.7.8.5~~~pgsA~~~CDP-diacylglycerol--glycerol-3-phosphate 3-phosphatidyltransferase~~~
MNIPNQITVFRVVLIPVFILFALVDFGFGNVSFLGGYEIRIELLISGFIFILASLSDFVDGYLARKWNLVTNMGKFLDPL
ADKLLVASALIVLVQLGLTNSVVAIIIIAREFAVTGLRLLQIEQGFVSAAGQLGKIKTAVTMVAITWLLLGDPLATLIGL
SLGQILLYIGVIFTILSGIEYFYKGRDVFKQK
>P37433 2.7.13.3~~~pgtB~~~Phosphoglycerate transport system sensor protein PgtB~~~
MKGRLLQRLRQLSISNSLRGAFLTGALLTLIVSMVSLYSWHEQSSQVRYSLDEYFPRIHSAFLIEGNLNLAVDQLNEFLL
APNTTVRLQLRTQIIQHLDKIERLSQGLQLAERRQLAVILQDSRTLLAELDNALYNMFLVREKVSELSARIDWLHDDFTT
ELNSLVQDFTWQQGTLLDQIEANQGDAAQYLQRSREVQNEQQQVYTLARIENQIVDDLRDRLNELKSGNNDGMLVETHIR
YLENLKKTADENIRALDDWPSTITLRQTIDELLEIGMVKNKMPDTMRDYVAAQKALLDASRAREATLGRFRTLLEAQLGS
SHQQMQTFNQRLEQIVRVSGGLILVATLLALLLAWGLNHYFIRSRLVKRFTALNQAVVQIGLGRTDSTIPVYGRDELGRI
ARLLRHTLGQLNMQRRQLEQEVAERKEIEADLRAMQDELIQTAKLAVVGQTMTTLAHEINQPLNALSMYLFTAGRAIEQG
QSGQARNTLTKAEGLINRIDAIIRSLRQFTRRAELETPLYPVDLRQTFVAAWELLAMRHQSRQGALSLPTDTVWVSGDEV
RIQQVLVNVLANALDACSHDAVIAVTWQTQGEALEVYIADNGPGWPVALLPSLLKPFTTSKAVGLGIGLSISVSLMAQMK
GDLRLASTLTRNACVVLQFSVTDVDDVE
>P0A233 ~~~pgtC~~~Phosphoglycerate transport regulatory protein PgtC~~~
MFGSCQAYSRELVMATTFSPSATAWIIQRWQTEPGSVMIRTLNRTSGSLEQLLDTANAENVDLILTSSPMLLQHLQEHQK
LALLDSAPAASQKLVPRSIRSTSVAVAVSGFGLLINRSALAARHLPPPADWQDMGLPSYQGALLMSSPSRSDTNHLMVES
LLQQKGWTAGWATLLAISGNLVTISSRSFGVADKIKSGLGVAGPVIDNYANLLLNDPNLAFTYFPYSAVSPTYVAVLKNS
RHADEARAFIHYLLSPKGQRILADANTGKYPVAPLSADNPRAAQQQRLMAQPPLNYRLILKRQQLVQRMFDTAISFRLAQ
LKDAWRALHSAETRLKRPLPEIRALLTSVPVDAASSEDETWLAQFDNKSFAEQKMMEWQIWFLNNQRLAIHKLEELK
>P80563 1.97.1.2~~~athL~~~Pyrogallol hydroxytransferase large subunit~~~
MGEVVRLTNSSTGGPVFVYVKDGKIIRMTPMDFDDAVDAPSWKIEARGKTFTPPRKTSIAPYTAGFKSMIYSDLRIPYPM
KRKSFDPNGERNPQLRGAGLSKQDPWSDYERISWDEATDIVVAEINRIKHAYGPSAILSTPSSHHMWGNVGYRHSTYFRF
MNMMGFTYADHNPDSWEGWHWGGMHMWGFSWRLGNPEQYDLLEDGLKHAEMIVFWSSDPETNSGIYAGFESNIRRQWLKD
LGVDFVFIDPHMNHTARLVADKWFSPKIGTDHALSFAIAYTWLKEDSYDKEYVAANAHGFEEWADYVLGKTDGTPKTCEW
AEEESGVPACEIRALARQWAKKNTYLAAGGLGGWGGACRASHGIEWARGMIALATMQGMGKPGSNMWSTTQGVPLDYEFY
FPGYAEGGISGDCENSAAGFKFAWRMFDGKTTFPSPSNLNTSAGQHIPRLKIPECIMGGKFQWSGKGFAGGDISHQLHQY
EYPAPGYSKIKMFWKYGGPHLGTMTATNRYAKMYTHDSLEFVVSQSIWFEGEVPFADIILPACTNFERWDISEFANCSGY
IPDNYQLCNHRVISLQAKCIEPVGESMSDYEIYRLFAKKLNIEEMFSEGKDELAWCEQYFNATDMPKYMTWDEFFKKGYF
VVPDNPNRKKTVALRWFAEGREKDTPDWGPRLNNQVCRKGLQTTTGKVEFIATSLKNFEEQGYIDEHRPSMHTYVPAWES
QKHSPLAVKYPLGMLSPHPRFSMHTMGDGKNSYMNYIKDHRVEVDGYKYWIMRVNSIDAEARGIKNGDLIRAYNDRGSVI
LAAQVTECLQPGTVHSYESCAVYDPLGTAGKSADRGGCINILTPDRYISKYACGMANNTALVEIEKWDGDKYEIY
>P12681 ~~~pgtP~~~Phosphoglycerate transporter protein~~~
MLTILKTGQSAHKVPPEKVQATYGRYRIQALLSVFLGYLAYYIVRNNFTLSTPYLKEQLDLSATQIGLLSSCMLIAYGIS
KGVMSSLADKASPKVFMACGLVLCAIVNVGLGFSSAFWIFAALVVFNGLFQGMGVGPSFITIANWFPRRERGRVGAFWNI
SHNVGGGIVAPIVGAAFAILGSEHWQSASYIVPACVAVIFALIVLVLGKGSPRKEGLPSLEQMMPEEKVVLKTKNTAKAP
ENMSAWQIFCTYVLRNKNAWYISLVDVFVYMVRFGMISWLPIYLLTVKHFSKEQMSVAFLFFEWAAIPSTLLAGWLSDKL
FKGRRMPLAMICMALIFVCLIGYWKSESLLMVTIFAAIVGCLIYVPQFLASVQTMEIVPSFAVGSAVGLRGFMSYIFGAS
LGTSLFGVMVDKLGWYGGFYLLMGGIVCCILFCYLSHRGALELERQRQNALHNQDSLQLADAQ
>P80564 1.97.1.2~~~bthL~~~Pyrogallol hydroxytransferase small subunit~~~
MEQYYMVIDVAKCQDCNNCFMGCMDEHELNEWPGYTASMQRGHRWMNIERRERGTYPRNDINYRPTPCMHCENAPCVAKG
NGAVYQREDGIVLIDPEKAKGKKELLDTCPYGVMYWNEEENVAQKCTMCAHLLDDESWAPKMPRCAHNCGSFVYEFLKTT
PEAMAKKVEEEGLEVIKPELGTKPRVYYKNLYRFEKNYVTAGILVQGDCFEGAKVVLKSGGKEVASAETNFFGEFKFDAL
DNGEYTVEIDADGKSYSDTVVIDDKSVDLGFIKL
>F5BFC8 1.14.16.7~~~pacX~~~Phenylalanine 3-hydroxylase~~~
MQGPHAQMTDAAYEIRRSEIAALSTDLAPEDPIPVVEYTEWEHEVWRTVCVDLTARHRTDAAAEYLESAEQLAVPLDHVP
QLRDVSGRLGSISGFTFQSAPALVPLREFCGGLANSVFHSTQYLRHPRSPFYTEDPDLLHDLVGHGNVLASDRFARLYRL
AGNAAARVHSTEALQFIGKVFWFTLECGVVRERGERKAYGATLVSSYGELDHFRSADFRPLDIKSLADVEYDISTYQPIL
FEADSMDEVEDTVGSFWDTCDDDSIAALLGGTSRSVTPH
>P30967 1.14.16.1~~~phhA~~~Phenylalanine-4-hydroxylase~~~COG3186
MNDRADFVVPDITTRKNVGLSHDANDFTLPQPLDRYSAEDHATWATLYQRQCKLLPGRACDEFMEGLERLEVDADRVPDF
NKLNQKLMAATGWKIVAVPGLIPDDVFFEHLANRRFPVTWWLREPHQLDYLQEPDVFHDLFGHVPLLINPVFADYLEAYG
KGGVKAKALGALPMLARLYWYTVEFGLINTPAGMRIYGAGILSSKSESIYCLDSASPNRVGFDLMRIMNTRYRIDTFQKT
YFVIDSFKQLFDATAPDFAPLYLQLADAQPWGAGDVAPDDLVLNAGDRQGWADTEDV
>P16570 ~~~apcA1~~~Allophycocyanin alpha chain 1~~~
MSIVTKSIVNADAEARYLSPGELDRIKSFVSGGERRLRIAQILTENRERLVKQAGEQVFQKRPDVVSPGGNAYGQELTAT
CLRDLDYYLRLVTYGIVSGDVTPIEEIGVIGAREMYKSLGTPIEGITEGIRALKSGASSLLSGEDAAEAGSYFDYVVGAL
S
>P80555 ~~~apcA1~~~Allophycocyanin subunit alpha 1~~~
MSIVTKSIVNADAEARYLSPGELDRIKSFVAGGQQRLRIAQALTDNRERLVKQAGDQLFQKRPDVVSPGGNAYGQEMTAT
CLRDLDYYLRLVTYGIVAGDVTPIEEIGVIGVREMYKSLGTPIEAVGEGVRALKNAASTLLSAEDAAEAGSYFDYVVGAL
Q
>P07325 ~~~apcA~~~Allophycocyanin alpha chain~~~
SIVTKAIVNADAEARYLSPGELDRIKSFVAGGASRLRIAQVLTENRERIVKQAGDQLFQKRPDVVSPGGNAYGQEMTATC
LRDLDYYLRLVTYGIVSGDVTPIEEIGIVGVREMYKSLGTPIDAVAGGVAAMKNVAATLLSAEDSSEAGSYFDYVVGAMQ
>P72504 ~~~apcA~~~Allophycocyanin alpha chain~~~
MSIVTKSIVNADAEARYLSPGELDRIKSFVTSGERRVRIAETMTGARERIIKEAGNQLFQKRPDVVSPGGNAYGEEMTAT
CLRDLDYYLRLITYGIVAGDVTPIEEIGVVGVREMYKSLGTPIEAVAEGVRAMKSVATSLLSGEDAAEAGAYFDYLIGAM
S
>P00315 ~~~apcA~~~Allophycocyanin alpha chain~~~
SIVTKSIVNADAEARYLSPGELDRIKSFVSSGEKRLRIAQILTDNRERIVKQAGDQLFQKRPDVVSPGGNAYGQEMTATC
LRDLDYYLRLITYGIVAGDVTPIEEIGIVGVREMYKSLGTPIDAVAAGVSAMKNVASSILSAEDAAEAGAYFDYVAGALA
>B1XQM2 ~~~apcA~~~Allophycocyanin alpha subunit~~~
MSIVTKSIVNADAEARYLSPGELDRIKAFVTSGESRLRIAETLTGSRERIIKSAGDALFQKRPDVVSPGGNAYGEEMTAT
CLRDMDYYLRLITYGVVAGDVTPIEEIGLVGVREMYKSLGTPVDAVAQAVREMKAVATGMMSGDDAAEAGAYFDYVIGAM
E
>Q01951 ~~~apcA~~~Allophycocyanin alpha chain~~~
MSIVTKSIVNADAEARYLSPGELDRIKAFVTGGAARLRIAETLTGSRETIVKQAGDRLFQKRPDIVSPGGNAYGEEMTAT
CLRDMDYYLRLVTYGVVSGDVTPIEEIGLVGVREMYRSLGTPIEAVAQSVREMKEVASGLMSSDDAAEASAYFDFVIGKM
S
>P50030 ~~~apcA~~~Allophycocyanin alpha chain~~~
MSVVTKSIVNADAEARYLSPGELDRIKNFVSTGERRLRIAQTLTENRERIVKQAGDQLFQKRPDVVSPGGNAYGEEMTAT
CLRDLDYYLRLVTYGIVAGDVTPIEEIGLVGVREMYNSLGTPIPAVAEGIRAMKNVACSLLSAEDAAEAGSYFDFVIGAM
Q
>P14697 1.1.1.36~~~phaB~~~Acetoacetyl-CoA reductase~~~COG1028
MTQRIAYVTGGMGGIGTAICQRLAKDGFRVVAGCGPNSPRREKWLEQQKALGFDFIASEGNVADWDSTKTAFDKVKSEVG
EVDVLINNAGITRDVVFRKMTRADWDAVIDTNLTSLFNVTKQVIDGMADRGWGRIVNISSVNGQKGQFGQTNYSTAKAGL
HGFTMALAQEVATKGVTVNTVSPGYIATDMVKAIRQDVLDKIVATIPVKRLGLPEEIASICAWLSSEESGFSTGADFSLN
GGLHMG
>P73826 1.1.1.36~~~phaB~~~Acetoacetyl-CoA reductase~~~COG1028
MLSLGLEDKVIVVTGGNRGIGAAIVKLLQEMGAKVAFTDLATDGGNTEALGVVANVTDLESMTAAAAEITDKLGPVYGVV
ANAGITKDNFFPKLTPADWDAVLNVNLKGVAYSIKPFIEGMYERKAGSIVAISSISGERGNVGQTNYSATKAGVIGMMKS
LAREGARYGVRANAVAPGFIDTEMTLAIREDIREKITKEIPFRRFGKPEEIAWAVAFLLSPVASSYVTGEVLRVNGAHHT
>P45370 2.3.1.-~~~phaC~~~Poly(3-hydroxyalkanoate) polymerase subunit PhaC~~~COG3243
MFPIDIRPDKLTQEMLDYSRKLGQGMENLLNAEAIDTGVSPKQAVYSEDKLVLYRYDRPEGAPEAQPVPLLIVYALVNRP
YMTDIQEDRSTIKGLLATGQDVYLIDWGYPDQADRALTLDDYINGYIDRCVDYLREAHGVDKVNLLGICQGGAFSLMYSA
LHPDKVRNLVTMVTPVDFKTPDNLLSAWVQNVDIDLAVDTMGNIPGELLNWTFLSLKPFSLTGQKYVNMVDLLDDPDKVK
NFLRMEKWIFDSPDQAGETFRQFIKDFYQNNGFLNGGVVLGGQEVDLKDITCPVLNIFALQDHLVPPDASRALKGLTSSP
DYTELAFPGGHIGIYVSGKAQKEVTPAIGKWLNER
>P23608 2.3.1.-~~~phaC~~~Poly(3-hydroxyalkanoate) polymerase subunit PhaC~~~COG3243
MATGKGAAASTQEGKSQPFKVTPGPFDPATWLEWSRQWQGTEGNGHAAASGIPGLDALAGVKIAPAQLGDIQQRYMKDFS
ALWQAMAEGKAEATGPLHDRRFAGDAWRTNLPYRFAAAFYLLNARALTELADAVEADAKTRQRIRFAISQWVDAMSPANF
LATNPEAQRLLIESGGESLRAGVRNMMEDLTRGKISQTDESAFEVGRNVAVTEGAVVFENEYFQLLQYKPLTDKVHARPL
LMVPPCINKYYILDLQPESSLVRHVVEQGHTVFLVSWRNPDASMAGSTWDDYIEHAAIRAIEVARDISGQDKINVLGFCV
GGTIVSTALAVLAARGEHPAASVTLLTTLLDFADTGILDVFVDEGHVQLREATLGGGAGAPCALLRGLELANTFSFLRPN
DLVWNYVVDNYLKGNTPVPFDLLFWNGDATNLPGPWYCWYLRHTYLQNELKVPGKLTVCGVPVDLASIDVPTYIYGSRED
HIVPWTAAYASTALLANKLRFVLGASGHIAGVINPPAKNKRSHWTNDALPESPQQWLAGAIEHHGSWWPDWTAWLAGQAG
AKRAAPANYGNARYRAIEPAPGRYVKAKA
>P73390 2.3.1.-~~~phaC~~~Poly(3-hydroxyalkanoate) polymerase subunit PhaC~~~COG3243
MFLLFFIVHWLKIMLPFFAQVGLEENLHETLDFTEKFLSGLENLQGLNEDDIQVGFTPKEAVYQEDKVILYRFQPVVENP
LPIPVLIVYALVNRPYMVDLQEGRSLVANLLKLGLDVYLIDWGYPSRGDRWLTLEDYLSGYLNNCVDIICQRSQQEKITL
LGVCQGGTFSLCYASLFPDKVKNLVVMVAPVDFEQPGTLLNARGGCTLGAEAVDIDLMVDAMGNIPGDYLNLEFLMLKPL
QLGYQKYLDVPDIMGDEAKLLNFLRMEKWIFDSPDQAGETYRQFLKDFYQQNKLIKGEVMIGDRLVDLHNLTMPILNLYA
EKDHLVAPASSLALGDYLPENCDYTVQSFPVGHIGMYVSGKVQRDLPPAIAHWLSERQ
>P45372 ~~~phaE~~~Poly(3-hydroxyalkanoate) polymerase subunit PhaE~~~
MSNTNFFNDDWLELQRKYWDNWTDMSRKAMGLDSASSSATTPWEAAIDQWWKAMAPAAPDLSRSFMEKMMEQGKNFFRLA
DTFAKRADEGNAGNGLELWTKTLEDMQKRFSGSLDDGGNTMQRLMSFWELPLDNWQRMMSSMSPMPGDMLRNMPHEQFKD
SLDRALSAPGLGYTREEQSQYQELMRSAMEYQAALQEYTNVYTKLGMKSVEHMGSYIQGVIDSGKTIDSARALYDNWVAC
CEGAYADEVATPEYARIHGRLVNAQMALKKRMSILVDENLGALNMPTRSELRTLQDRLQETRRENKALRHSLHSLERRVA
ALAGEEPATKPATALRSPAPAAKAPARRRTTKTNPAD
>P73389 ~~~phaE~~~Poly(3-hydroxyalkanoate) polymerase subunit PhaE~~~
MESTNKTWTELMTPLSQFWLESSSQAWKNWFDLMAKGGAGAMMGSAPQSFESLPQQFLQSQQFYGELLKLSFEAWQSLWP
KLDNGSAPGAVQGYLKQLQTQIEQYTATTQALQGDMDGLWQCYIKEVQRFSQLWLSTWQSSVAPLGKLPTGDIHAWLDLN
NLYGDALYNKNLSSFMRSPLLGPSREMNGKLLRAFDDWVKLSQAMADYQLLEADIQYRGFAALMEDLLARAKEDKPVKTW
KEFQQRWAIAADQVFEEAFCEEKNLKVRGKFINALNRYRIQQQEILEAWLKMLNLPTRSEVDEIHQTIYQLRKEVKSLKK
RLGETEANPG
>O32472 4.2.1.119~~~phaJ~~~(R)-specific enoyl-CoA hydratase~~~
MSAQSLEVGQKARLSKRFGAAEVAAFAALSEDFNPLHLDPAFAATTAFERPIVHGMLLASLFSGLLGQQLPGKGSIYLGQ
SLSFKLPVFVGDEVTAEVEVTALREDKPIATLTTRIFTQGGALAVTGEAVVKLP
>Q2RQ36 4.2.1.119~~~phaJ~~~(R)-specific enoyl-CoA hydratase~~~COG2030
MSADDLILHYFEDIKEGQSASLAKTISESDIYLFAGLSMDTNPAHVNEDYAQTTVFKTRIAHGMLSAGFISAVLGTRLPG
PGAIYVNQSLKFKAPVRIGDTVTATVTVTGLVPEKKFVTFRTTCTVAGKVVIEGEATVMVPARG
>P0C2Y0 3.5.1.79~~~~~~o-phthalyl amidase~~~
MQAPSVHQHVAFTEEIGDLPDGSSYMIRVPENWNGVLIRDLDLVSGTSNSNAARYETMLKEGFAVAGTARHPLRQWQYDP
AHEIENLNHVLDTFEENYGSPERVIQYGCSGGAHVSLAVAEDFSDRVDGSVALAAHTPVWIMNSFLDGWFSLQSLIGEYY
VEAGHGPLSDLAITKLPNDGSSNSSGHGMEGDLPAAWRNAFTAANATPEGRARMALAFALGQWSPWLADNTPQPDLDDPE
AIADSVYESAMRLAGSPGGEARIMFENAARGQQLSWNDDIDYADFWENSNPAMKSAVQELYDTAGLDLQSDIETVNSQPR
IEASQYALDYWNTPGRNVIGDPEVPVLRLHMIGDYQIPYSLVQGYSDLISENNNDDLYRTAFVQSTGHCNFTAAESSAAI
EVMMQRLDTGEWPSTEPDDLNAIAEASNTGTEARFMALDGWEIPEYNRTWKPE
>P73545 ~~~phaP~~~Phasin PhaP~~~
MNTQFFEEYQTQLLDWQKKFFSTWMESLPKGTAEIKLTDTFETSLKLQEEMVKSYLEAQEKSATMMIDAQKQFWDNYFQA
LRQEPVSAN
>P9WQE9 2.3.1.287~~~pks2~~~Phthioceranic/hydroxyphthioceranic acid synthase~~~COG0604
MGLGSAASGTGADRGAWTLAEPRVTPVAVIGMACRLPGGIDSPELLWKALLRGDDLITEVPPDRWDCDEFYDPQPGVPGR
TVCKWGGFLDNPADFDCEFFGIGEREAIAIDPQQRLLLETSWEAMEHAGLTQQTLAGSATGVFAGVTHGDYTMVAADAKQ
LEEPYGYLGNSFSMASGRVAYAMRLHGPAITVDTACSSGLTAVHMACRSLHEGESDVALAGGVALMLEPRKAAAGSALGM
LSPTGRCRAFDVAADGFVSGEGCAVVVLKRLPDALADGDRILAVIRGTSANQDGHTVNIATPSQPAQVAAYRAALAAGGV
DAATVGMVEAHGPGTPIGDPIEYASVSEVYGVDGPCALASVKTNFGHTQSTAGVLGLIKVVLALKHGVVPRNLHFTRLPD
EIAGITTNLFVPEVTTPWPTNGRQVPRRAAVSSYGFSGTNVHAVVEQAPQTEAQPHAASTPPTGTPALFTLSASSADALR
QTAQRLTDWIQQHADSLVLSDLAYTLARRRTHRSVRTAVIASSVDELIAGLGEVADGDTVYQPAVGQDDRGPVWLFSGQG
SQWAAMGADLLTNESVFAATVAELEPLIAAESGFSVTEAMTAPETVTGIDRVQPTIFAMQVALAATMAAYGVRPGAVIGH
SMGESAAAVVAGVLSAEDGVRVICRRSKLMATIAGSAAMASVELPALAVQSELTALGIDDVVVAVVTAPQSTVIAGGTES
VRKLVDIWERRDVLARAVAVDVASHSPQVDPILDELIAALADLNPKAPEIPYYSATLFDPREAPACDARYWADNLRHTVR
FSAAVRSALDDGYRVFAELSPHPLLTHAVDQIAGSVGMPVAALAGMRREQPLPLGLRRLLTDLHNAGAAVDFSVLCPQGR
LVDAPLPAWSHRFLFYDREGVDNRSPGGSTVAVHPLLGAHVRLPEEPERHAWQADVGTATLPWLGDHRIHNVAALPGAAY
CEMALSAARAVLGEQSEVRDMRFEAMLLLDDQTPVSTVATVTSPGVVDFAVEALQEGVGHHLRRASAVLQQVSGECEPPA
YDMASLLEAHPCRVDGEDLRRQFDKHGVQYGPAFTGLAVAYVAEDATATMLAEVALPGSIRSQQGLYAIHPALLDACFQS
VGAHPDSQSVGSGLLVPLGVRRVRAYAPVRTARYCYTRVTKVELVGVEADIDVLDAHGTVLLAVCGLRIGTGVSERDKHN
RVLNERLLTIEWHQRELPEMDPSGAGKWLLISDCAASDVTATRLADAFREHSAACTTMRWPLHDDQLAAADQLRDQVGSD
EFSGVVVLTGSNTGTPHQGSADRGAEYVRRLVGIARELSDLPGAVPRMYVVTRGAQRVLADDCVNLEQGGLRGLLRTIGA
EHPHLRATQIDVDEQTGVEQLARQLLATSEEDETAWRDNEWYVARLCPTPLRPQERRTIVADHQQSGMRLQIRTPGDMQT
IELAAFHRVPPGPGQIEVAVRASSVNFADVLIAFGRYPSFEGHLPQLGTDFAGVVTAVGPGVTDHKVGDHVGGMSPNGCW
GTFVTCDARLAATLPPGLGDAQAAAVTTAHATAWYGLHELARIRAGDTVLIHSGTGGVGQAAIAIARAAGAEIFATAGTP
QRRELLRNMGIEHVYDSRSIEFAEQIRRDTNGRGVDVVLNSVTGAAQLAGLKLLAFRGRFVEIGKRDIYGDTKLGLFPFR
RNLSFYAVDLGLLSATHPEELRDLLGTVYRLTAAGELPMPQSTHYPLVEAATAIRVMGNAEHTGKLVLHIPQTGKSLVTL
PPEQAQVFRPDGSYIITGGLGGLGLFLAEKMAAAGCGRIVLNSRTQPTQKMRETIEAIAAMGSEVVVECGDIAQPGTAER
LVATAVATGLPVRGVLHAAAVVEDATLANITDELLARDWAPKVHGAWELHEATSGQPLDWFCLFSSAAALTGSPGQSAYS
AANSWLDAFAHWRQAQGLPATAIAWGAWSDIGQLGWWSASPARASALEESNYTAITPDEGAYAFEALLRHNRVYTGYAPV
IGAPWLVAFAERSRFFEVFSSSNGSGTSKFRVELNELPRDEWPARLRQLVAEQVSLILRRTVDPDRPLPEYGLDSLGALE
LRTRIETETGIRLAPKNVSATVRGLADHLYEQLAPDDAPAAALSSQ
>Q51718 3.1.1.76~~~phaZ~~~Poly(3-hydroxyoctanoate) depolymerase~~~
MPLRTLLCGLLLAVCLGQHALAASRCSERPRTLLRPAEVSCSYQSTWLDSGLVGQRKIIYQTPLGTPPAGGWPVVLIYQG
SFFPLNDFSYHSNLPFGGYYEGKLVQNLLDHGYAVIAPSAPADLFWQTNIPGLAQAYELSTDYDFLGNVLAAIASGHFGP
LNAQRQYATGISSGGYNTSRMAVSFPGKFRALAVQSGSYATCSGPLCVVPDQLPADHPPTLFLHGFVDAVVPWWSMDLYY
DRLLHQGIETARYTEPLGGHEWFAASPGKVLAWFNAHP
>P07553 ~~~apcD~~~Phycobiliprotein beta chain~~~
MRDAVTSLIKNYDVAGRYFDRNAIESLKSYFESGTQRVQAAKAINANAAAIVKQTGSKLFDEQPELIRPGGNAYTTRRYA
ACLRDLDYYLRYATYAIVAGSMDVLDERVLQGLRETYNSLGVPIGPTVRGIQIMKEIVKEQLGAAGIPNTSFVDEPFDYM
TRELGEKDI
>P85173 ~~~~~~Phosphate-binding protein~~~
DINGGGATLPQKLYLTPDVLTAGFAPYIGVGSGKGKIAFLENKYNQFGTDTTKNVHWAGSDSKLTATELATYAADKEPGW
GKLIQVPSVATSVAIPFRKAGANAVDLSVKELCGVFSGRIADWSGITGAGRSGPIQVVYRAESSGTTELFTRFLNAKCTT
EPGTFAVTTTFANSYSLGLTPLAGAVAATGSDGVMAALNDTTVAEGRITYISPDFAAPTLAGLDDATKVARVGKGVVNGV
AVEGKSPAAANVSAAISVVPLPAAADRGNPDVWVPVFGATTGGGVVAYPDSGYPILGFTNLIFSQCYANATQTGQVRDFF
TKHYGTSANNDAAIEANAFVPLPSNWKAAVRASFLTASNALSIGNTNVCNGKGRPQ
>P9WIC5 4.1.3.40~~~~~~Chorismate pyruvate-lyase~~~COG3161
MTECFLSDQEIRKLNRDLRILIAANGTLTRVLNIVADDEVIVQIVKQRIHDVSPKLSEFEQLGQVGVGRVLQRYIILKGR
NSEHLFVAAESLIAIDRLPAAIITRLTQTNDPLGEVMAASHIETFKEEAKVWVGDLPGWLALHGYQNSRKRAVARRYRVI
SGGQPIMVVTEHFLRSVFRDAPHEEPDRWQFSNAITLAR
>O05527 3.1.1.75~~~phaZCac~~~Poly(3-hydroxybutyrate) depolymerase~~~
MAFNFIRAAAAGAAMALCGVGSVHAAVNLPALKIDKTQTTVSGLSSGGFMAVQLHVAYSATFAKGAGVVAGGPFYCAEGS
IVNATGRCMASPAGIPTSTLVSTTNTWASQGVIDPVANLQNSKVYLFSGTLDSVVKTGVMDALRTYYNSFVPAANVVYKK
DIAAEHAMVTDDYGNACSTKGAPYISDCNFDLAGAMLQHLYGTLNARNNATLPTGNYIEFNQSEFITNHGMATTGWAYVP
QACQAGGTATCKLHVVLHGCKQNIGDVQQQYVRNTGYNRWADTNNIVMLYPQTSTAATNSCWDWWGYDSANYSKKSGPQM
AAIKAMVDRVSSGTGGTTPPDPVALPAPTGVSTSGATASSMAIGWAAVMGAASYNVYRNANKVNALPVTATSYTDTGLAA
STTYSWTVRAADANGAEGATSAAASGTTLAASGGGTATCTTASNYAHTLAGRAYAAGGYTYALGSNQNMGLWNVFVTNTL
KQTSTNYYVIGTCP
>P07122 ~~~cpcA1~~~C-phycocyanin-1 alpha subunit~~~
MKTPLTEAVAAADSQGRFLSSTEIQTAFGRFRQASASLAAAKALTEKASSLASGAANAVYSKFPYTTSQNGPNFASTQTG
KDKCVRDIGYYLRMVTYCLVVGGTGPLDDYLIGGIAEINRTFDLSPSWYVEALKYIKANHGLSGDPAVEANSYIDYAINA
LS
>P00308 ~~~cpcA1~~~C-phycocyanin-1 alpha subunit~~~
MSKTPLTEAVAAADSQGRFLSSTELQVAFGRFRQAASGLAAAKALANNADSLVNGAANAVYSKFPYTTSTPGNNFASTPE
GKAKCARDIGYYLRIVTYALVAGGTGPIDEYLLAGLDEINKTFDLAPSWYVEALKYIKANHGLSGDSRDEANSYIDYLIN
ALS
>P08040 ~~~cpcA2~~~C-phycocyanin-2 alpha subunit~~~
MKTPLTEAVATADSQGRFLSSTELQVAFGRFRQASASLDAAKALSSKANSLAQGAVNAVYQKFPYTTQMQGKNFASDQRG
KDKCARDIGYYIRIVTYCLVAGGTGPLDDYLIGGLAEINRTFDLSPSWYVEALKYIKANHGLSGDPAVEANSYIDYAINA
LS
>P14876 ~~~cpcA3~~~C-phycocyanin-3 alpha subunit~~~
MTKTPLTEAVVSADSQGRFLSTELQVAFGRFRQAGSSLEAAKALSKKASSLAEAAANAVYQKFPYTTTTSGPNYASTQTG
KDKCVRDIGYYLRIVTYGLVVGGTGPIDDYLIGGLAEINRTFELSPSWYIEALKYIKANHGLSGDPAVEANSYIDYIINA
LS
>P72509 ~~~cpcA~~~C-phycocyanin alpha subunit~~~
MKTPLTEAVSIADSQGRFLSSTEIQVAFGRFRQAKAGLEAAKALTSKADSLISGAAQAVYNKFPYTTQMQGPNYAADQRG
KDKCARDIGYYLRMVTYCLIAGGTGPMDEYLIAGIDEINRTFELSPSWYIEALKYIKANHGLSGDAATEANSYLDYAINA
LS
>P00307 ~~~cpcA~~~C-phycocyanin alpha subunit~~~
MVKTPITDAIAAADTQGRFLSNTELQAVNGRYQRAAASLEAARALTANAQRLIDGAAQAVYQKFPYTTQTSGPNYAADAR
GKSKCARDIGHYLRIITYSLVAGGTGPLDEYLIAGLNEINDAFELSPSWYIEALKYIKANHGLSGQAANEANTYIDYVIN
ALS
>P07121 ~~~cpcA~~~C-phycocyanin alpha subunit~~~
MVKTPITEAIAAADTQGRFLGNTELQSARGRYERAAASLEAARGLTSNAQRLIDGATQAVYQKFPYTTQTPGPQFAADSR
GKSKCARDVGHYLRIITYSLVAGGTGPLDEYLIAGLAEINSTFDLSPSWYVEALKHIKANHGLSGQAANEANTYIDYAIN
ALS
>P13530 ~~~cpcA1~~~C-phycocyanin alpha subunit~~~
MSKTPLTEAVAAADSQGRFLSSTELQVAFGRFRQAASGLAAAKALANNADSLVNGAANAVYSKFPYTTSTPGNNFASTPE
GKAKCARDIGYYLRIVTYALVAGGTGPIDEYLLAGLDEINKTFDLAPSWYVEALKYIKANHGLSGDSRDEANSYIDYLIN
ALS
>P03943 ~~~cpcA~~~C-phycocyanin subunit alpha~~~
MKTPLTEAVALADSQGRFLSNTELQYLYGRLRQGAFALEAAQTLTAKADTLVNGAAQAVYSKFPYTTSTPGNNFAADQRG
KDKCARDIGYYLRMVTYCLVAGGTGPMDEYLIAGVDEINRTFDLSPSWYVEALKHIKANHGLTGDAATETNNYIDYAINA
LS
>Q54715 ~~~cpcA~~~C-phycocyanin alpha subunit~~~
MKTPLTEAVSTADSQGRFLSSTELQIAFGRLRQANAGLQAAKALTDNAQSLVNGAAQAVYNKFPYTTQTQGNNFAADQRG
KDKCARDIGYYLRIVTYCLVAGGTGPLDEYLIAGIDEINRTFDLSPSWYVEALKYIKANHGLSGDARDEANSYLDYAINA
LS
>P50032 ~~~cpcA~~~C-phycocyanin alpha subunit~~~
MKTPITEAIAAADTQGRFLSNTELQAVDGRFKRAVASMEAARALTNNAQSLIDGAAQAVYQKFPYTTTMQGSQYASTPEG
KAKCARDIGYYLRMVTYCLVAGGTGPMDEYLIAGLSEINSTFDLSPSWYIEALKYIKANHGLTGQAAVEANAYIDYAINA
LS
>P07119 ~~~cpcB1~~~C-phycocyanin-1 beta subunit~~~
MLDAFAKVVSQADARGEYLSGSQIDALSALVADGNKRMDVVNRITGNSSTIVANAARSLFAEQPQLIAPGGNAYTSRRMA
ACLRDMEIILRYVTYAIFAGDASVLDDRCLNGLKETYLALGTPGSSVAVGVQKMKDAALAIAGDTNGITRGDCASLMAEV
ASYFDKAASAVA
>P00312 ~~~cpcB1~~~C-phycocyanin-1 beta subunit~~~
MTFDAFTKVVAQADARGEFLSDAQLDALSRLVAEGNKRIDTVNRITGNASSIVANAARALFAEQPSLIAPGGNAYTNRRM
AACLRDMEIILRYVTYAVFTGDASILDDRCLNGLRETYLALGVPGASVAEGVRKMKDAAVAIVSDRNGITQGDCSAIISE
LGSYFDKAAAAVA
>P08039 ~~~cpcB2~~~C-phycocyanin-2 beta subunit~~~
MLDAFTKVVSQADTRGAYISDAEIDALKTMVAAGSKRMDVVNRITGNASTIVANAARALFEEQPQLIAPGGNAYTNRRMA
ACLRDMEIILRYVTYAVFAGDASVLDDRCLNGLRETYQALGVPGASVSTGVQKMKEAAIAIANDPSGVTRGDCSSLMSEL
GSYFDRAAAAVG
>P14877 ~~~cpcB3~~~C-phycocyanin-3 beta subunit~~~
MVQDAFSKVVSQADARGEYLSDGQLDALINLVKEGNKRVDVVNRISSNASSIVRNAARSLFAEQPQLIAPGGNAYTSRRA
AACVRDLEIILRYVTYAIFAGDASVLDDRALNGLRETYLALGTPGASVAVGIQKLKESSIAIANDPNGITRGDCSSLIAE
VSGYFDRAAAAVA
>P72508 ~~~cpcB~~~C-phycocyanin beta subunit~~~
MFDAFTKVVSQADTRGEMLSTAQIDALSQMVAESNKRLDAVNRITSNASTIVSNAARSLFAEQPQLIAPGGNAYTSRRMA
ACLRDMEIILRYVTYAVFAGDASVLEDRCLNGLRETYLALGTPGSSVAVGVGKMKEAALAIVNDPAGITPGDCSALASEI
ASYFDRACAAVS
>P00310 ~~~cpcB~~~C-phycocyanin beta subunit~~~
MAYDVFTKVVSQADSRGEFLSNEQLDALANVVKEGNKRLDVVNRITSNASTIVTNAARALFEEQPQLIAPGGNAYTNRRM
AACLRDMEIILRYITYAILAGDASILDDRCLNGLRETYQALGTPGSSVAVGIQKMKEAAINIANDPNGITKGDCSALISE
VASYFDRAAAAVA
>P07120 ~~~cpcB~~~C-phycocyanin beta subunit~~~
MTLDVFTKVVSQADSRGEFLSNEQLDALANVVKEGNKRLDVVNRITSNASAIVTNAARALFEEQPQLIAPGGNAYTNRRM
AACLRDMEIILRYVTYAILAGDASVLDDRCLNGLRETYQALGTPGSSVAVGVQKMKDAAVGIANDPNGITKGDCSQLISE
VASYFDRAAAAVG
>P06539 ~~~cpcB1~~~C-phycocyanin beta subunit~~~
MTFDAFTKVVAQADARGEFLSDAQLDALSRLVAEGNKRIDTVNRITGNASSIVANAARALFAEQPSLIAPGGNAYTNRRM
AACLRDMEIILRYVTYAVFTGDASILDDRCLNGLRETYLALGVPGASVAEGVRKMKDAAVAIVSDRNGITQGDCSAIISE
LGSYFDKAAAAVA
>P03944 ~~~cpcB~~~C-phycocyanin subunit beta~~~
MFDIFTRVVSQADARGEFISSDKLEALKKVVAEGTKRSDAVSRMTNNASSIVTNAARQLFADQPQLIAPGGNAYTNRRMA
ACLRDMEIILRYVTYATFTGDASVLNDRCLNGLRETYVALGVPGASVAAGVRAMGKAAVAIVMDPAGVTSGDCSSLQQEI
ELYFETAAKAVE
>Q54714 ~~~cpcB~~~C-phycocyanin beta subunit~~~
MFDVFTRVVSQADARGEYLSGSQLDALSATVAEGNKRIDSVNRITGNASAIVSNAARALFAEQPQLIQPGGNAYTSRRMA
ACLRDMEIILRYVTYATFTGDASVLEDRCLNGLRETYVALGVPGASVAAGVQKMKEAALDIVNDPNGITRGDCSAIVAEI
AGYFDRAAAAVA
>P50033 ~~~cpcB~~~C-phycocyanin beta subunit~~~
MLDAFAKVVAQADARGEFLTNAQFDALSNLVKEGNKRLDAVNRITSNASTIVANAARALFAEQPQLIQPGGNAYTNRRMA
ACLRDMEIILRYVTYAILAGDSSVLDDRCLNGLRETYQALGTPGSSVAVAIQKMKDAAIAIANDPNGITPGDCSALMSEI
AGYFDRAAAAVA
>O24721 1.13.11.38~~~phdI~~~1-hydroxy-2-naphthoate 1,2-dioxygenasee~~~
MNSSNTGAPEAAQAATLEAFDRRAAEQYLRGQWIAEEHLMRAIGGPRPAGIPYRWEWKSVEVALDEATIALGPVDTARRH
LTFVNPGLMDRGSATTHTISAGFQLVKPGEVCWSHRHTMSAVRFVTKGHPDAFTAVDGERLPMEDFDLLITPRFSWHDHH
NSGDADVVWLDGLDIGLLQSLGAVFYEPYGDDSQNVRPSSSEGIGTRSHWLRPTWERGRESRLPIRYPWKEVNARLDVYD
LDAGTPYDGLALRYANPVTGGPTMATMDCWVQRLAPGFDGKSHRRSSSAITYVISGSGTMVTEDETITFNRGDVISLPNW
TNFRWTNDSEIEPVLLFSMHDIPALEAFGLLYEEPEAILNATPAPINPTPSLNPIYRPGAFYDQDEL
>Q79EM8 4.1.2.34~~~phdJ~~~Trans-2'-carboxybenzalpyruvate hydratase-aldolase~~~
MTSPAVTSADITGLVGIVPTPSKPGSEAPDAVDTVDLDETARMVELIVASGVDVLLTNGTFGEVATLTYEELLAFNDTVI
RTVANRIPVFCGASTLNTRDTIARSLALMGLGANGLFVGRPMWLPLDDEQLVSYYAAVCDAVPAAAVVVYDNTGVFKGKI
SSAAYAALAEIPQIVASKHLGVLSGSDAYASDLAAVKGRFPLLPTADNWLPSLEAFPGEVPAAWSGDVACGPEPVMALRR
AIAEGLWDDARAVHEDIAWATEPLFPGGDISKFMPYSIQIDRAEFEAAGYIVPGPSRHPYGTAPAAYLEGGAEVGRRWAG
IRQKYVATLAEP
>Q79EM7 1.2.1.78~~~phdK~~~2-formylbenzoate dehydrogenase~~~
MTTPRKFDEYRWNVLVDGVPLNVESRYPISDPSTGRYLTQVPDCAEADVDRAVQASRQAQAEWGALPPRARAAKLRELIT
LLREHREEFAMLDAIDGGFPISMMRNDVDAALELMDIFADMALDLGGKTIPVSTNLHFTTHEPFGVVARIGAFNHPFFFA
ASKVAAPLMAGNSVILKAPDQTPLSSLRLAEVAAEVLPQNLLITISGRGRVAGRAIVRHPQIKRIGFIGSTDTGRSIQRD
AAEVAVKHISLELGGKNAQIVFADADLEQAALGAVNGMNFTWTAGQSCGSTSRLLVHESVADQVIARVVELVSAIAVGPP
LDENAQMGPLVSQAQYDKSVHAIGEGIREGAKVVAGGGRPEGVGEGGWYLAPTVLADVRPGSFIEQNEIFGPVLSVIIFA
TDDEAVAIANGVEYGLTASVWTSDITRAHLIARRVEAGYVLVNGGSRHYWGLPFGGVKSSGVGSEESMEELISYTETKTT
TVVLG
>A0QRX9 ~~~phd~~~Antitoxin Phd~~~
MPSLNIDFDEAEMEQIRAAARADDLSLKKFAHAAVMERASAHKRRVAEAARLVAERSAELNRRLA
>Q02179 ~~~cpeA~~~C-phycoerythrin class 1 subunit alpha~~~
MKSVVTTVVTAADAAGRFLSQNDLEAVQGNIQRAAARLEAAEKLAAGLDKVTREAGDACFNKYSYLKQPGEAGDSQVKVD
KCYRDLGHYLRLINYCLVVGGTGPLDEWGIAGAREVYRSLSLPTGPYVEALTYTRDRACAPRDMSPQALNEFKSYLDYVI
NALS
>P27646 ~~~mpeA~~~C-phycoerythrin class 2 subunit alpha~~~
MKSVITTVVGAADSASRFPSASDMESVQGSIQRAAARLEAAEKLSANYDAIAQRAVDAVYAQYPNGATGRQPRQCATEGK
EKCKRDFVHYLRLINYCLVTGGTGPLDELAINGQKEVYKALSIDAGTYVAGFSNMRNDGCSPRDMSAQALTAYNTLLDYV
INSLG
>P0A317 ~~~mpeA~~~C-phycoerythrin class 2 subunit alpha~~~
MKSVITTVVGAADSASRFPSASDMESVQGSIQRAAARLEAAEKLAGNYDQVAQEAVDAVYNQYPNGATGRQPRKCATEGK
EKCKRDFVHYLRLINYCLVTGGTGPLDELAINGQKEVYKALSIDAGTYVAGFSHLRSRGCAPRDMSAQALTAYNQLLDYV
INSLG
>P00309 ~~~pccA~~~Phycoerythrocyanin alpha chain~~~
MKTPLTEAIAAADLRGSYLSNTELQAVFGRFNRARAGLEAARAFANNGKKWAEAAANHVYQKFPYTTQMQGPQYASTPEG
KAKCVRDIDHYLRTISYCCVVGGTGPLDDYVVAGLKEFNSALGLSPSWYIAALEFVRDNHGLTGDVAGEANTYINYAINA
LS
>P05098 ~~~cpeA~~~C-phycoerythrin alpha chain~~~
MKSVVTTVIAAADAAGRFPSTSDLESVQGSIQRAAARLEAAEKLANNIDAVATEAYNACIKKYPYLNNSGEANSTDTFKA
KCARDIKHYLRLIQYSLVVGGTGPLDEWGIAGQREVYRALGLPTAPYVEALSFARNRGCAPRDMSAQALTEYNALLDYAI
NSLS
>Q7TVJ6 4.2.1.51~~~pheA~~~Prephenate dehydratase~~~
MVRIAYLGPEGTFTEAALVRMVAAGLVPETGPDALQRMPVESAPAALAAVRDGGADYACVPIENSIDGSVLPTLDSLAIG
VRLQVFAETTLDVTFSIVVKPGRNAADVRTLAAFPVAAAQVRQWLAAHLPAADLRPAYSNADAARQVADGLVDAAVTSPL
AAARWGLAALADGVVDESNARTRFVLVGRPGPPPARTGADRTSAVLRIDNQPGALVAALAEFGIRGIDLTRIESRPTRTE
LGTYLFFVDCVGHIDDEAVAEALKAVHRRCADVRYLGSWPTGPAAGAQPPLVDEASRWLARLRAGKPEQTLVRPDDQGAQ
A
>P9WIC3 4.2.1.51~~~pheA~~~Prephenate dehydratase~~~COG0077
MVRIAYLGPEGTFTEAALVRMVAAGLVPETGPDALQRMPVESAPAALAAVRDGGADYACVPIENSIDGSVLPTLDSLAIG
VRLQVFAETTLDVTFSIVVKPGRNAADVRTLAAFPVAAAQVRQWLAAHLPAADLRPAYSNADAARQVADGLVDAAVTSPL
AAARWGLAALADGVVDESNARTRFVLVGRPGPPPARTGADRTSAVLRIDNQPGALVAALAEFGIRGIDLTRIESRPTRTE
LGTYLFFVDCVGHIDDEAVAEALKAVHRRCADVRYLGSWPTGPAAGAQPPLVDEASRWLARLRAGKPEQTLVRPDDQGAQ
A
>P35796 ~~~pecA~~~Phycoerythrocyanin alpha chain~~~
MKTPLTEAISAADVRGSYLSNTEMQAVFGRFNRARAGLAAAQAFSNNGKKWAEAAANHVYQKFPYTTQMSGPQYASTPEG
KSKCVRDIDHYLRTISYCCVVGGTGPLDEYVVSGLSELNSALGLSPSWYVAALEFVRDNHGLNGDVAGEANIYLNYAINA
LS
>Q02180 ~~~cpeB~~~C-phycoerythrin class 1 subunit beta~~~
MLDAFSRSVVSADAKTAPVGGSDLAGLRSYVRDGNKRLDAVNAITSNASCIVSDAVTGMICENTGLIQAGGNCYPNRRMA
ACLRDGEIVLRYISYALLAGDASVLDDRCLNGLKETYIALGVPTQSAGRAVAIMKASATAHIGETNTPGLGGKRFRKMET
TQGDCAALVAEAGAYFDRVIGAIS
>P27647 ~~~mpeB~~~C-phycoerythrin class 2 subunit beta~~~
MLDAFSRKAVSADSSGAFIGGGELASLKSFIADGNKRLDAVNALSSNAACIVSDAVAGICCENTGLTAPNGGVYTNRKMA
ACLRDGEIVLRYVSYALLAGDASVLQDRCLNGLRETYAALGVPTGSAARAVAIMKAASAALITNTNSQPKKAAVTQGDCS
SLAGEAGSYFDAVISAIS
>P0A319 ~~~mpeB~~~C-phycoerythrin class 2 subunit beta~~~
MLDAFSRAAVSADSSGSFIGGGELASLKSFIADGNKRLDAVNAITSNASCIVSDAVAGICCENTGLTAPNGGVYTNRKMA
ACLRDGEIVLRYVSYALLAGDASVLQDRCLNGLRETYAALGVPTGSASRAVAIMKAAAGALITNTNSQPKKMPVTTGDCS
NIAGEAASYFDMVISAIS
>P00313 ~~~pccB~~~Phycoerythrocyanin beta chain~~~
MLDAFSRVVEQADKKGAYLSNDEINALQAIVADSNKRLDVVNRLTSNASSIVANAYRALVAERPQVFNPGGPCFHHRNQA
ACIRDLGFILRYVTYSVLAGDTSVMDDRCLNGLRETYQALGTPGDAVASGIKKMKEAALKIANDPNGITKGDCSQLMSEL
ASYFDRAAAAVA
>P05097 ~~~cpeB~~~C-phycoerythrin beta chain~~~
MLDAFSRAVVSADASTSTVSDIAALRAFVASGNRRLDAVNAIASNASCMVSDAVAGMICENQGLIQAGGNCYPNRRMAAC
LRDAEIVLRYVTYALLAGDASVLDDRCLNGLKETYAALGVPTTSTVRAVQIMKAQAAAHIQDTPSEARAGAKLRKMGTPV
VEDRCASLVAEASSYFDRVISALS
>P35797 ~~~pecB~~~Phycoerythrocyanin subunit beta~~~
MLDAFSKVVEQADRKGNYLSGDEINALSALVADSNKRLDIVNRLTSNASSIVANAYRALVAERPQIFNAGGACFHNRNQA
ACIRDLGFILRYVTYSVLAGDGSVMDDRCLNGLRETYQALGTPGDAVASGIQKMKDAAIAIANDSKGITKGDCSQLIAEL
ASYFDRAASAVV
>Q7VDN2 ~~~cpeB~~~R-phycoerythrin subunit beta~~~
MLDAFSRAVVSADSKGATIGSAELSSLRKYVADANKRIDATLAITQNVSCIAADAISGMVCENTGLTQPGGHCYPTRRMA
ACLRDGEIILRYVSYALLAGDPSVLDDRCINGLKETYIALGVPLSNAIRAIEIMKIATVAIMTETNSGRKMFEGINSGSG
AECKDIASEAASYFDRVIDALN
>Q7V2Z3 ~~~cpeB~~~R-phycoerythrin subunit beta~~~
MTVSKSNQILSNDRDLENISNKNIEDIKEFINTANSRLDAIDSITNNSHAIAADAVTAMICENQDSVNTKISLDTTNKMS
VCLRDGEIILRIVSYLLISDDESVLSKNCLKDLKNTYLALGVPLKNAIRVFELMRDATISDLKSTVNSMKGEKEFLSDLI
SNTEFQFERIINLLR
>Q01269 ~~~pheC~~~Cyclohexadienyl dehydratase~~~
MPKSFRHLVQALACLALLASASLQAQESRLDRILESGVLRVATTGDYKPFSYRTEEGGYAGFDVDMAQRLAESLGAKLVV
VPTSWPNLMRDFADDRFDIAMSGISINLERQRQAYFSIPYLRDGKTPITLCSEEARFQTLEQIDQPGVTAIVNPGGTNEK
FARANLKKARILVHPDNVTIFQQIVDGKADLMMTDAIEARLQSRLHPELCAVHPQQPFDFAEKAYLLPRDEAFKRYVDQW
LHIAEQSGLLRQRMEHWLEYRWPTAHGK
>Q02181 ~~~mpeC~~~Phycoerythrin class 2 subunit gamma, linker polypeptide~~~
MLGAETSLQALTSATRTGPAAFSTKSKAGKNTVPRTVAGAIAEYKRQHCAAMGIGIGPRLLSECPFAVTFDRYSPDSSAA
LERVIVAAYRQVLGNLPPTDNQRETSLEVRLMNGEITVRDFVNGLAKSDFYKDNFFHAVGAQRGIELNFKHLLGRAPLNQ
QEVQNHIKLQAEEGFDALIDTLTDSAEYTEVFGADIVPYDRTKDSYAGMNTRSFNLMRDLGGMKVAISDNAQGRQSKTVN
ALASASRESTKPQPFSYVSVTQIPVKLPQQQYTGHNVPAMSDYVPFRPFGIFF
>P24207 ~~~pheP~~~Phenylalanine-specific permease~~~COG1113
MKNASTVSEDTASNQEPTLHRGLHNRHIQLIALGGAIGTGLFLGIGPAIQMAGPAVLLGYGVAGIIAFLIMRQLGEMVVE
EPVSGSFAHFAYKYWGPFAGFLSGWNYWVMFVLVGMAELTAAGIYMQYWFPDVPTWIWAAAFFIIINAVNLVNVRLYGET
EFWFALIKVLAIIGMIGFGLWLLFSGHGGEKASIDNLWRYGGFFATGWNGLILSLAVIMFSFGGLELIGITAAEARDPEK
SIPKAVNQVVYRILLFYIGSLVVLLALYPWVEVKSNSSPFVMIFHNLDSNVVASALNFVILVASLSVYNSGVYSNSRMLF
GLSVQGNAPKFLTRVSRRGVPINSLMLSGAITSLVVLINYLLPQKAFGLLMALVVATLLLNWIMICLAHLRFRAAMRRQG
RETQFKALLYPFGNYLCIAFLGMILLLMCTMDDMRLSAILLPVWIVFLFMAFKTLRRK
>P29166 1.12.7.2~~~~~~Iron hydrogenase 1~~~
MKTIIINGVQFNTDEDTTILKFARDNNIDISALCFLNNCNNDINKCEICTVEVEGTGLVTACDTLIEDGMIINTNSDAVN
EKIKSRISQLLDIHEFKCGPCNRRENCEFLKLVIKYKARASKPFLPKDKTEYVDERSKSLTVDRTKCLLCGRCVNACGKN
TETYAMKFLNKNGKTIIGAEDEKCFDDTNCLLCGQCIIACPVAALSEKSHMDRVKNALNAPEKHVIVAMAPSVRASIGEL
FNMGFGVDVTGKIYTALRQLGFDKIFDINFGADMTIMEEATELVQRIENNGPFPMFTSCCPGWVRQAENYYPELLNNLSS
AKSPQQIFGTASKTYYPSISGLDPKNVFTVTVMPCTSKKFEADRPQMEKDGLRDIDAVITTRELAKMIKDAKIPFAKLED
SEADPAMGEYSGAGAIFGATGGVMEAALRSAKDFAENAELEDIEYKQVRGLNGIKEAEVEINNNKYNVAVINGASNLFKF
MKSGMINEKQYHFIEVMACHGGCVNGGGQPHVNPKDLEKVDIKKVRASVLYNQDEHLSKRKSHENTALVKMYQNYFGKPG
EGRAHEILHFKYKK
>P07598 1.12.7.2~~~hydA~~~Periplasmic [Fe] hydrogenase large subunit~~~COG1145
MSRTVMERIEYEMHTPDPKADPDKLHFVQIDEAKCIGCDTCSQYCPTAAIFGEMGEPHSIPHIEACINCGQCLTHCPENA
IYEAQSWVPEVEKKLKDGKVKCIAMPAPAVRYALGDAFGMPVGSVTTGKMLAALQKLGFAHCWDTEFTADVTIWEEGSEF
VERLTKKSDMPLPQFTSCCPGWQKYAETYYPELLPHFSTCKSPIGMNGALAKTYGAERMKYDPKQVYTVSIMPCIAKKYE
GLRPELKSSGMRDIDATLTTRELAYMIKKAGIDFAKLPDGKRDSLMGESTGGATIFGVTGGVMEAALRFAYEAVTGKKPD
SWDFKAVRGLDGIKEATVNVGGTDVKVAVVHGAKRFKQVCDDVKAGKSPYHFIEYMACPGGCVCGGGQPVMPGVLEAMDR
TTTRLYAGLKKRLAMASANKA
>P07603 1.12.7.2~~~hydB~~~Periplasmic [Fe] hydrogenase small subunit~~~COG4624
MQIASITRRGFLKVACVTTGAALIGIRMTGKAVAAVKQIKDYMLDRINGVYGADAKFPVRASQDNTQVKALYKSYLEKPL
GHKSHDLLHTHWFDKSKGVKELTTAGKLPNPRASEFEGPYPYE
>P20586 1.14.13.2~~~pobA~~~p-hydroxybenzoate hydroxylase~~~
MKTQVAIIGAGPSGLLLGQLLHKAGIDNVILERQTPDYVLGRIRAGVLEQGMVDLLREAGVDRRMARDGLVHEGVEIAFA
GQRRRIDLKRLSGGKTVTVYGQTEVTRDLMEAREACGATTVYQAAEVRLHDLQGERPYVTFERDGERLRLDCDYIAGCDG
FHGISRQSIPAERLKVFERVYPFGWLGLLADTPPVSHELIYANHPRGFALCSQRSATRSRYYVQVPLSEKVEDWSDERFW
TELKARLPSEVAEKLVTGPSLEKSIAPLRSFVVEPMQHGRLFLAGDAAHIVPPTGAKGLNLAASDVSTLYRLLLKAYREG
RGELLERYSAICLRRIWKAERFSWWMTSVLHRFPDTDAFSQRIQQTELEYYLGSEAGLATIAENYVGLPYEEIE
>P00438 1.14.13.2~~~pobA~~~p-hydroxybenzoate hydroxylase~~~
MKTQVAIIGAGPSGLLLGQLLHKAGIDNVILERQTPDYVLGRIRAGVLEQGMVDLLREAGVDRRMARDGLVHEGVEIAFA
GQRRRIDLKRLSGGKTVTVYGQTEVTRDLMEAREACGATTVYQAAEVRLHDLQGERPYVTFERDGERLRLDCDYIAGCDG
FHGISRQSIPAERLKVFERVYPFGWLGLLADTPPVSHELIYANHPRGFALCSQRSATRSRYYVQVPLTEKVEDWSDERFW
TELKARLPAEVAEKLVTGPSLEKSIAPLRSFVVEPMQHGRLFLAGDAAHIVPPTGAKGLNLAASDVSTLYRLLLKAYREG
RGELLERYSAICLRRIWKAERFSWWMTSVLHRFPDTDAFSQRIQQTELEYYLGSEAGLATIAENYVGLPYEEIE
>P42404 5.3.1.27~~~hxlB~~~3-hexulose-6-phosphate isomerase~~~COG0794
MKTTEYVAEILNELHNSAAYISNEEADQLADHILSSHQIFTAGAGRSGLMAKSFAMRLMHMGFNAHIVGEILTPPLAEGD
LVIIGSGSGETKSLIHTAAKAKSLHGIVAALTINPESSIGKQADLIIRMPGSPKDQSNGSYKTIQPMGSLFEQTLLLFYD
AVILKLMEKKGLDSETMFTHHANLE
>Q9S0X3 5.3.1.27~~~rmpB~~~3-hexulose-6-phosphate isomerase~~~
MNKYQELVVSKLTNVINNTAEGYDDKILSLVDAAGRTFIGGAGRSLLVSRFFAMRLVHAGYQVSMVGEVVTPSIQAGDLF
IVISGSGSTETLMPLVKKAKSQGAKIIVISMKAQSPMAELADLVVPVGGNDANAFDKTHGMPMGTIFELSTLWFLEATIA
KLVDQKGLTEEGMRAIHANLE
>Q9LBW5 5.3.1.27~~~rmpB~~~3-hexulose-6-phosphate isomerase~~~
MTQAAEADGAVKVVGDDITNNLSLVRDEVADTAAKVDPEQVAVLARQIVQPGRVFVAGAGRSGLVLRMAAMRLMHFGLTV
HVAGDTTTPAISAGDLLLVASGSGTTSGVVKSAETAKKAGARIAAFTTNPDSPLAGLADAVVIIPAAQKTDHGSHISRQY
AGSLFEQVLFVVTEAVFQSLWDHTEVEAEELWTRHANLE
>P9WIB7 1.2.-.-~~~~~~Phthiodiolone/phenolphthiodiolone dimycocerosates ketoreductase~~~COG2141
MGGLRFGFVDALVHSRLPPTLPARSSMAAATVMGADSYWVGDHLNALVPRSIATSEYLGIAAKFVPKIDANYEPWTMLGN
LAFGLPSRLRLGVCVTDAGRRNPAVTAQAAATLHLLTRGRAILGIGVGEREGNEPYGVEWTKPVARFEEALATIRALWNS
NGELISRESPYFPLHNALFDLPPYRGKWPEIWVAAHGPRMLRATGRYADAWIPIVVVRPSDYSRALEAVRSAASDAGRDP
MSITPAAVRGIITGRNRDDVEEALESVVVKMTALGVPGEAWARHGVEHPMGADFSGVQDIIPQTMDKQTVLSYAAKVPAA
LMKEVVFSGTPDEVIDQVAEWRDHGLRYVVLINGSLVNPSLRKTVTAVLPHAKVLRGLKKL
>P11889 3.1.4.12~~~sph~~~Sphingomyelinase C~~~
MKGKLLKGVLSLGVGLGALYSGTSAQAEASTNQNDTLKVMTHNVYMLSTNLYPNWGQTERADLIGAADYIKNQDVVILNE
VFDNSASDRLLGNLKKEYPNQTAVLGRSSGSEWDKTLGNYSSSTPEDGGVAIVSKWPIAEKIQYVFAKGCGPDNLSNKGF
VYTKIKKNDRFVHVIGTHLQAEDSMCGKTSPASVRTNQLKEIQDFIKNKNIPNNEYVLIGGDMNVNKINAENNNDSEYAS
MFKTLNASVPSYTGHTATWDATTNSIAKYNFPDSPAEYLDYIIASKDHANPSYIENKVLQPKSPQWTVTSWFQKYTYNDY
SDDYPVEATISMK
>P9WIB4 3.1.4.3~~~plcA~~~Phospholipase C A~~~
MSASPLLGMSRREFLTKLTGAGAAAFLMDWAAPVIEKAYGAGPCPGHLTDIEHIVLLMQENRSFDHYFGTLSSTNGFNAA
SPAFQQMGWNPMTQALDPAGVTIPFRLDTTRGPFLDGECVNDPEHQWVGMHLAWNGGANDNWLPAQATTRAGPYVPLTMG
YYTRQDIPIHYLLADTFTICDGYHCSLLTGTLPNRLYWLSANIDPAGTDGGPQLVEPGFLPLQQFSWRIMPENLEDAGVS
WKVYQNKGLGRFINTPISNNGLVQAFRQAADPRSNLARYGIAPTYPGDFAADVRANRLPKVSWLVPNILQSEHPALPVAL
GAVSMVTALRILLSNPAVWEKTALIVSYDENGGFFDHVTPPTAPPGTPGEFVTVPNIDAVPGSGGIRGPLGLGFRVPCIV
ISPYSRGPLMVSDTFDHTSQLKLIRARFGVPVPNMTAWRDGVVGDMTSAFNFATPPNSTRPNLSHPLLGALPKLPQCIPN
VVLGTTDGALPSIPYRVPYPQVMPTQETTPVRGTPSGLCS
>P9WIB5 3.1.4.3~~~plcA~~~Phospholipase C A~~~COG3511
MSASPLLGMSRREFLTKLTGAGAAAFLMDWAAPVIEKAYGAGPCPGHLTDIEHIVLLMQENRSFDHYFGTLSSTNGFNAA
SPAFQQMGWNPMTQALDPAGVTIPFRLDTTRGPFLDGECVNDPEHQWVGMHLAWNGGANDNWLPAQATTRAGPYVPLTMG
YYTRQDIPIHYLLADTFTICDGYHCSLLTGTLPNRLYWLSANIDPAGTDGGPQLVEPGFLPLQQFSWRIMPENLEDAGVS
WKVYQNKGLGRFINTPISNNGLVQAFRQAADPRSNLARYGIAPTYPGDFAADVRANRLPKVSWLVPNILQSEHPALPVAL
GAVSMVTALRILLSNPAVWEKTALIVSYDENGGFFDHVTPPTAPPGTPGEFVTVPNIDAVPGSGGIRGPLGLGFRVPCIV
ISPYSRGPLMVSDTFDHTSQLKLIRARFGVPVPNMTAWRDGVVGDMTSAFNFATPPNSTRPNLSHPLLGALPKLPQCIPN
VVLGTTDGALPSIPYRVPYPQVMPTQETTPVRGTPSGLCS
>P9WIB2 3.1.4.3~~~plcB~~~Phospholipase C B~~~
MGSEHPVDGMTRRQFFAKAAAATTAGAFMSLAGPIIEKAYGAGPCPGHLTDIEHIVLLMQENRSFDHYFGTLSDTRGFDD
TTPPVVFAQSGWNPMTQAVDPAGVTLPYRFDTTRGPLVAGECVNDPDHSWIGMHNSWNGGANDNWLPAQVPFSPLQGNVP
VTMGFYTRRDLPIHYLLADTFTVCDGYFCSLLGGTTPNRLYWMSAWIDPDGTDGGPVLIEPNIQPLQHYSWRIMPENLED
AGVSWKVYQNKLLGALNNTVVGYNGLVNDFKQAADPRSNLARFGISPTYPLDFAADVRNNRLPKVSWVLPGFLLSEHPAF
PVNVGAVAIVDALRILLSNPAVWEKTALIVNYDENGGFFDHVVPPTPPPGTPGEFVTVPDIDSVPGSGGIRGPIGLGFRV
PCLVISPYSRGPLMVHDTFDHTSTLKLIRARFGVPVPNLTAWRDATVGDMTSTFNFAAPPNPSKPNLDHPRLNALPKLPQ
CVPNAVLGTVTKTAIPYRVPFPQSMPTQETAPTRGIPSGLC
>P9WIB3 3.1.4.3~~~plcB~~~Phospholipase C B~~~COG3511
MGSEHPVDGMTRRQFFAKAAAATTAGAFMSLAGPIIEKAYGAGPCPGHLTDIEHIVLLMQENRSFDHYFGTLSDTRGFDD
TTPPVVFAQSGWNPMTQAVDPAGVTLPYRFDTTRGPLVAGECVNDPDHSWIGMHNSWNGGANDNWLPAQVPFSPLQGNVP
VTMGFYTRRDLPIHYLLADTFTVCDGYFCSLLGGTTPNRLYWMSAWIDPDGTDGGPVLIEPNIQPLQHYSWRIMPENLED
AGVSWKVYQNKLLGALNNTVVGYNGLVNDFKQAADPRSNLARFGISPTYPLDFAADVRNNRLPKVSWVLPGFLLSEHPAF
PVNVGAVAIVDALRILLSNPAVWEKTALIVNYDENGGFFDHVVPPTPPPGTPGEFVTVPDIDSVPGSGGIRGPIGLGFRV
PCLVISPYSRGPLMVHDTFDHTSTLKLIRARFGVPVPNLTAWRDATVGDMTSTFNFAAPPNPSKPNLDHPRLNALPKLPQ
CVPNAVLGTVTKTAIPYRVPFPQSMPTQETAPTRGIPSGLC
>P18954 ~~~phlB~~~Protein PhlB~~~
MPEGRRLRRALAIALLALVAVTGLLMMAKEQQMGQEISPFDGHSNLALAQAVARGDTQGIHAQATQDRLRERGDRQVTLL
QWAVLSQQPDSVQALLDLGADPAAAGLDGNSALHTAAMLQDAQYLRLLLAEGAQMNVRNAVTGATPLAAAVLAGREEQLR
LLLAAGADTTLSDRLGDTPLHLAAKINRRTWRCCCCRPGPMPGRATSRASRSSFTSRKRRRICRMTN
>P0C216 3.1.4.3~~~plc~~~Phospholipase C~~~
MKRKICKALICATLATSLWAGASTKVYAWDGKIDGTGTHAMIVTQGVSILENDLSKNEPESVRKNLEILKENMHELQLGS
TYPDYDKNAYDLYQDHFWDPDTDNNFSKDNSWYLAYSIPDTGESQIRKFSALARYEWQRGNYKQATFYLGEAMHYFGDID
TPYHPANVTAVDSAGHVKFETFAEERKEQYKINTAGCKTNEDFYADILKNKDFNAWSKEYARGFAKTGKSIYYSHASMSH
SWDDWDYAAKVTLANSQKGTAGYIYRFLHDVSEGNDPSVGKNVKELVAYISTSGEKDAGTDDYMYFGIKTKDGKTQEWEM
DNPGNDFMTGSKDTYTFKLKDENLKIDDIQNMWIRKRKYTAFPDAYKPENIKIIANGKVVVDKDINEWISGNSTYNIK
>P09598 3.1.4.3~~~plc~~~Phospholipase C~~~
MKKKVLALAAAITVVAPLQSVAFAHENDGGSKIKIVHRWSAEDKHKEGVNSHLWIVNRAIDIMSRNTTLVKQDRVAQLNE
WRTELENGIYAADYENPYYDNSTFASHFYDPDNGKTYIPFAKQAKETGAKYFKLAGESYKNKDMKQAFFYLGLSLHYLGD
VNQPMHAANFTNLSYPQGFHSKYENFVDTIKDNYKVTDGNGYWNWKGTNPEEWIHGAAVVAKQDYSGIVNDNTKDWFVKA
AVSQEYADKWRAEVTPMTGKRLMDAQRVTAGYIQLWFDTYGDR
>Q46150 3.1.4.3~~~plc~~~Phospholipase C~~~
MKKKFLKGLCCAFVISITCLGASSKAYGWDGKKDGTGTHSMIVTQAVKVLENDMSKDEPEIVKQNFKILQDNMHKFQLGS
TYPDYDPNAYKLFQDHFWDPDTDHNFSKDNLWYLSYSIKDTAESQVRKFTALARNEWEKGNYEKATWYFGQAMHYFGDLN
TPYHAANVTAVDSIGHTKYEGFAEKRKDQYRINTTGIKTNEGFYADALKNSNFDSWSKEYCKGWAKQAKNLYYSHSTMKH
TNEDWDYSASHALKNAQMGTAGCIYRFLYDVSKDLLPTENHKINGLMVVIKTANEIAAGTDDYVYFGIERKDGTVQEWTL
DNPGNDFEANQEDTYILKIKKPSIKFSDINRMWIRKANFTPVSDDWKVKGIKVIADGSVQYEKQINKWIHGNEKYYIN
>Q0TV31 3.1.4.3~~~plc~~~Phospholipase C~~~
MKRKICKALICAALATSLWAGASTKVYAWDGKIDGTGTHAMIVTQGVSILENDLSKNEPESVRKNLEILKENMHELQLGS
TYPDYDKNAYDLYQDHFWDPDTDNNFSKDNSWYLAYSIPDTGESQIRKFSALARYEWQRGNYKQATFYLGEAMHYFGDID
TPYHPANVTAVDSAGHVKFETFAEERKEQYKINTAGCKTNEAFYTDILKNKDFNAWSKEYARGFAKTGKSIYYSHASMSH
SWDDWDYAAKVTLANSQKGTAGYIYRFLHDVSEGNDPSVGKNVKELVAYISTSGEKDAGTDDYMYFGIKTKDGKTQEWEM
DNPGNDFMTGSKDTYTFKLKDENLKIDDIQNMWIRKRKYTAFSDAYKPENIKIIANGKVVVDKDINEWISGNSTYNIK
>Q9RF12 3.1.4.3~~~plc~~~Phospholipase C~~~
MKRKIYKLLICATIATSLWAVRTTKVYAWDGKADGTGTHAMIATQGVTILENDLSSNEPEVIRNNLEILKQNMHDLQLGS
TYPDYDKNAYDLYQDHFWDPDTDNNFTKDSKWYLSYSIPDTAESQIRKFSALARYEWKRGNYKQATFYLGEAMHYFGDAD
TPYHAANVTAVDSPGHVKFETFAEDRKDQYKINTTGSKTNDAFYSNILTNEDFNSWSKEFARSFAKTAKDLYYSHANMSC
SWDEWDYAAKVALANSQKGTSGYIYRFLHDVSDGKDSSANKNVNELVAYITTGGEKYAGTDDYMYFGIKTKDGQTQEWTM
DNPGNDFMTGSQDTYTFKLKDKNLKIDDIQNMWIRKSKYTEFGDDYKPANIKVIANGNVVLNKDINEWISGNSTYNIK
>Q9RLV9 3.1.4.12~~~smcL~~~Sphingomyelinase C~~~
MEKFKIIKTIPKICGAFIFLLFFTFLFGHYGELKTQASDEYPGNFKITSHNVYLFSRNIYPNWGQMHRADLIAQADYMKN
NDVVILNEAFDTSASHRLLNNLREMYPHQTPVIGRSKHGWDKTEGNYSNFALEDGGVAVVSQWPIVEKSQHIFQRGGGAD
RLSNKGFAYVKIMKNGKPYHIIGTHTPADDSLISKDTSRAIRAEQMQEIQTFIAKKNIPKDEIIFIGGDLNVNYGTDEYH
DMFKLLNVSSPANFNGQMATWDPTTNSMLKESYPKAAPEYLDYIFVENGHARPHSWHNKVLHTKSPQWSVKSWFKTYTYQ
DFSDHYPVVGFTDNN
>P33378 3.1.4.3~~~plcB~~~Phospholipase C~~~
MKFKKVVLGMCLIASVLVFPVTIKANACCDEYLQTPAAPHDIDSKLPHKLSWSADNPTNTDVNTHYWLFKQAEKILAKDV
NHMRANLMNELKKFDKQIAQGIYDADHKNPYYDTSTFLSHFYNPDRDNTYLPGFANAKITGAKYFNQSVTDYREGKFDTA
FYKLGLAIHYYTDISQPMHANNFTAISYPPGYHCAYENYVDTIKHNYQATEDMVAKRFCSDDVKDWLYENAKRAKADYPK
IVNAKTKKSYLVGNSEWKKDTVEPTGARLRDSQQTLAGFLEFWSKKTNE
>P9WIB0 3.1.4.3~~~plcC~~~Phospholipase C C~~~
MVSQGAFAGMSRRAFLAKAAGAGAAAVLTDWAAPVIEKAYGAGPCSGHLTDIEHIVLCLQENRSFDHYFGTLSAVDGFDT
PTPLFQQKGWNPETQALDPTGITLPYRINTTGGPNGVGECVNDPDHQWIAAHLSWNGGANDGWLPAQARTRSVANTPVVM
GYYARPDIPIHYLLADTFTICDQYFSSLLGGTMPNRLYWISATVNPDGDQGGPQIVEPAIQPKLTFTWRIMPQNLSDAGI
SWKVYNSKLLGGLNDTSLSRNGYVGSFKQAADPRSDLARYGIAPAYPWDFIRDVINNTLPQVSWVVPLTVESEHPSFPVA
VGAVTIVNLIRVLLRNPAVWEKTALIIAYDEHGGFFDHVTPLTAPEGTPGEWIPNSVDIDKVDGSGGIRGPIGLGFRVPC
FVISPYSRGGLMVHDRFDHTSQLQLIGKRFGVPVPNLTPWRASVTGDMTSAFNFAAPPDPSPPNLDHPVRQLPKVAKCVP
NVVLGFLNEGLPYRVPYPQTTPVQESGPARPIPSGIC
>P9WIB1 3.1.4.3~~~plcC~~~Phospholipase C C~~~COG3511
MVSQGAFAGMSRRAFLAKAAGAGAAAVLTDWAAPVIEKAYGAGPCSGHLTDIEHIVLCLQENRSFDHYFGTLSAVDGFDT
PTPLFQQKGWNPETQALDPTGITLPYRINTTGGPNGVGECVNDPDHQWIAAHLSWNGGANDGWLPAQARTRSVANTPVVM
GYYARPDIPIHYLLADTFTICDQYFSSLLGGTMPNRLYWISATVNPDGDQGGPQIVEPAIQPKLTFTWRIMPQNLSDAGI
SWKVYNSKLLGGLNDTSLSRNGYVGSFKQAADPRSDLARYGIAPAYPWDFIRDVINNTLPQVSWVVPLTVESEHPSFPVA
VGAVTIVNLIRVLLRNPAVWEKTALIIAYDEHGGFFDHVTPLTAPEGTPGEWIPNSVDIDKVDGSGGIRGPIGLGFRVPC
FVISPYSRGGLMVHDRFDHTSQLQLIGKRFGVPVPNLTPWRASVTGDMTSAFNFAAPPDPSPPNLDHPVRQLPKVAKCVP
NVVLGFLNEGLPYRVPYPQTTPVQESGPARPIPSGIC
>P20419 3.1.4.3~~~plc~~~Phospholipase C~~~
MKALKKVSNILCVLGLCTLMGGTSYAWDGKKDGTGTHSLIAEHGLSMLNNDLSGNESQQVKDNIKILNEYLGDLKLGSTY
PDYDPNAYDLYQDHFYDPDTGNNFTIDNSWYASYPIYDTSRNSVRKFATLAKNEWEKGNFKEATFLLGQGLHYLGDLNTP
YHASNVTAVDSPGHVKYETFVEERKDNYALNTSGNDTTSGVYKEAMENPSFNKWMTQNSIKYAKIAKDLYYSHSTMSHSW
DDWDYSGREAIKNSQVCTAGFLYRFMNEVSNGNTGDNDSLTNEFNIVLKTADNKYAGTDDNVYFGFETNEGKKFEWKLDN
AGNDFERNQVDNYILKTKDGEEVDINNISNYWIRKERLTSISDDWELSNFKLIANGKVIQQQDVNKVFTGNETYYINK
>Q2FWP1 3.1.4.3~~~hlb~~~Phospholipase C~~~COG3568
MVKKTKSNSLKKVATLALANLLLVGALTDNSAKAESKKDDTDLKLVSHNVYMLSTVLYPNWGQYKRADLIGQSSYIKNND
VVIFNEAFDNGASDKLLSNVKKEYPYQTPVLGRSQSGWDKTEGSYSSTVAEDGGVAIVSKYPIKEKIQHVFKSGCGFDND
SNKGFVYTKIEKNGKNVHVIGTHTQSEDSRCGAGHDRKIRAEQMKEISDFVKKKNIPKDETVYIGGDLNVNKGTPEFKDM
LKNLNVNDVLYAGHNSTWDPQSNSIAKYNYPNGKPEHLDYIFTDKDHKQPKQLVNEVVTEKPKPWDVYAFPYYYVYNDFS
DHYPIKAYSK
>P09978 3.1.4.3~~~hlb~~~Phospholipase C~~~
MVKKTKSNSLKKVATLALANLLLVGALTDNSAKAESKKDDTDLKLVSHNVYMLSTVLYPNWGQYKRADLIGQSSYIKNND
VVIFNEAFDNGASDKLLSNVKKEYPYQTPVLGRSQSGWDKTEGSYSSTVAEDGGVAIVSKYPIKEKIQHVFKSGCGFDND
SNKGFVYTKIEKNGKNVHVIGTHTQSEDSRCGAGHDRKIRAEQMKEISDFVKKKNIPKDETVYIGGDLNVNKGTPEFKDM
LKNLNVNDVLYAGHNSTWDPQSNSIAKYNYPNGKPEHLDYIFTDKDHKQPKQLVNEVVTEKPKPWDVYAFPYYYVYNDFS
DHYPIKAYSK
>P9WIA8 3.1.4.3~~~plcD~~~Phospholipase C D~~~
MSQSHIGGVSRREFLAKVAAGGAGALMSFAGPVIEKAYGAGPCSGHLTDIEHFVFFMQENRSFDHYFGTLSGTDGFNTVS
PLFQQKGWNPMTQALDATGVTMPYRFDTTRGPFLDGACVNDPDHSWVAMHESWNGGVNDNWLPAQAKTRSAAHTPTVMGY
YTRQDIPIHYLLADAFTVCDRYFCSVLGPTLPNRLYWLSATIDPDGQNGGPELQSPTFQPVRRFGWRIMPQNLSDAGVSW
KVYRNKTLGPISSVLTYGSLVTSFKQSADPRSDLVRFGVAPSYPASFAADVLANRLPRVSWVIPNVLESEHPAVPAAAGA
FAIVNILRILLANPAVWEKTALIVSYDENGGFFDHVVPATAPAGTPGEYVTVPDIDQVPGSGGIRGPIGLGFRVPCFVIS
PYSRGPQMVHDTFDHTSQLRLLETRFGVPVPNLTAWRRSVTGDMTSTFNFAVPPNSSWPNLDYPGLHALSTVPQCVPNAA
LGTINRGIPYRVPDPQIMPTQETTPTRGIPSGPC
>Q4K418 2.3.1.253~~~phlD~~~Phloroglucinol synthase~~~COG3424
MSTLCLPHVMFPQHKITQQQMVDHLENLHADHPRMALAKRMIANTEVNERHLVLPIDELAVHTGFTHRSIVYEREARQMS
SAAARQAIENAGLQISDIRMVIVTSCTGFMMPSLTAHLINDLALPTSTVQLPIAQLGCVAGAAAINRANDFARLDARNHV
LIVSLEFSSLCYQPDDTKLHAFISAALFGDAVSACVLRADDQAGGFKIKKTESYFLPKSEHYIKYDVKDTGFHFTLDKAV
MNSIKDVAPVMERLNYESFEQNCAHNDFFIFHTGGRKILDELVMHLDLASNRVSQSRSSLSEAGNIASVVVFDVLKRQFD
SNLNRGDIGLLAAFGPGFTAEMAVGEWTA
>Q4K423 3.7.1.24~~~phlG~~~2,4-diacetylphloroglucinol hydrolase~~~
MEARNMTPFTYFSLPMQKLFLRNQAAVRNKPYAKYFRSEMRVPLSAVRKIQQGPMALEDTLTPSIEDINRLLEPDFVSEE
SGYALLPGPMAYVQSRKFFPGCTAQMFKWWFIWHPAESERYTLWFPYAHVSNPCVHHQRLRDESLSFEERLYGNTFCASE
YVGDRLMHLHIDFQQPASLGLNTDLYREAKIDGSVSALMSLADHPEVPVSLMVHLFKEVPDGMYLTSRYWVGAHPSMARF
PGAEKAASLLKENGFGEAELETLAYEFAVHDMCEFNHLASFLPDLYREFGTPAA
>A0A2K9M484 3.7.1.24~~~phlG~~~2,4-diacetylphloroglucinol hydrolase~~~
MEARVMTPFTYFSLPMQKQFLANQQAVQGKPYAEFFRSKISVPLSAVEKIQQGPMPLADTLTPSIEDLNRMLAADFISEE
AGYALLPGPMAYVQSRKFFPNCTAEMLKWWFMWHPLEAERYTLWFPYAHVENPCVHHERLSDTTLSFEESLYGNTFCASE
YVGDRLMHLHIHFRDPCELGFCPDLYRESKIDGSVSALMSLAHEPQVPVSLMAHLFKECPEGLYLTSRYWVGSHPAMQRF
PGAERAAQLLEESGLGEVELETLAYEFAVHDMCEFNHLASILPSLHAQFSGAK
>A0A2C9EVE6 3.7.1.24~~~phlG~~~2,4-diacetylphloroglucinol hydrolase~~~
MAAICQFTPKDISMEARNMTPFTYFSLPMQKLFLRNQAAVRNKPYAKYFRSEMRVPLSAVRKIQQGPMALEDTLTPSIED
INRLLEPDFVSEESGYALLPGPMAYVQSRKFFPGCTAQMFKWWFIWHPAESERYTLWFPYAHVSNPCVHHQRLCDESLSF
EERLYGNTFCASEYVGDRLMHLHIDFQQPASLGLNTDLYREAKIDGSVSALMSLADHPEVPVSLMVHLFKEVPDGMYLTS
RYWVGAHPSMARFPGAEKAASLLKENGFGEAELETLAYEFAVHDMCEFNHLASFLPDLYREFGTPAA
>F7J5X9 3.7.1.24~~~phlG~~~2,4-diacetylphloroglucinol hydrolase~~~
MEARNMTPFTYFSLPMQKLFLRNQAAVRNKPYAKYFRTEMRVPLSAVRKIQQGPMALEDTLTPSIEDINRLLEPDFVSEE
SGYALLPGPMAYVQSRKFFPGCTAQMFKWWFIWHPAESERYTLWFPYAHVSNPCVHHQRLCDESLSFEERLYGNTFCASE
YVGDRLMHLHIDFQQPASLGLNTDLYREAKIDGSVSALMSLADHPEVPVSLMVHLFKEVPGGMYLTSRYWVGAHPSMARF
PGAEKAASLLKENGFGEAELETLAYEFAVHDMCEFNHLASFLPDLYREFGTPAA
>Q9RGS8 3.1.4.3~~~plcN~~~Non-hemolytic phospholipase C~~~COG3511
MTNQNRRDFLRLAAGTAGAAALQLFPPVIREALAIPANRRTGTIRDVEHIVILMQENRSFDHYFGKLRGVRGFGDPRPLA
LQNGKSVFHQPVLLGPAELLPFHPDASNLGMQFLQDLPHGWQDMHGAWNKGRYDRWIANKGTTTMAYLERDDIPFHYQLA
DAFTICDAYHCSIPSSTDPNRYYMWTGYVGNDGAGGGPVLGNEEAGYGWSTYPETLEQAGVSWKIYQDIGTGLDAAGSWG
WTQNPYIGNYGDNSLLYFNQYRNAQPGSPLYDKARTGTNVSAGGTLFDVLQQDVKNGTLPQVSWICAPEAYSEHPNWPAN
YGAWYVEQVLKALTSNPDVWSKTALFITYDENDGFFDHVAPPFAPQSRENGLSTVSTAGEIFAGDATHMAGPYGLGPRVP
MLVVSPWTKGGWVCSQTFDHTSLLQFIEARFNDRYSVRAPNVTPWRRAVCGDLTSAFNFSSPDGSWPQLPDTSGYAPPDR
NRHPSYVPVPPAAQSMPKQEAGLRAARALPYELFVLGRIDQSTGKFKLTFANTGRAGAAFQVTAGNRLDGPWAYTVEARK
RLSDEWSTALTLSIYDLTVYGPNGFLCQFRGSTAAALGLSANPEVIYGYDVANGNITLRLSNRGRAAVRLTVTNAYGNAA
PRVYELKPGQRINDYWDLRDSHSWYDLSVSDGAPNGFLRRFAGHVETGRPSTSDPLIATA
>P9WIN3 2.1.1.-~~~~~~Phthiotriol/phenolphthiotriol dimycocerosates methyltransferase~~~COG2226
MAFSRTHSLLARAGSTSTYKRVWRYWYPLMTRGLGNDEIVFINWAYEEDPPMDLPLEASDEPNRAHINLYHRTATQVDLG
GKQVLEVSCGHGGGASYLTRTLHPASYTGLDLNQAGIKLCKKRHRLPGLDFVRGDAENLPFDDESFDVVLNVEASHCYPH
FRRFLAEVVRVLRPGGYFPYADLRPNNEIAAWEADLAATPLRQLSQRQINAEVLRGIGNNSQKSRDLVDRHLPAFLRFAG
REFIGVQGTQLSRYLEGGELSYRMYCFTKD
>P09785 4.1.3.27~~~phnA~~~Anthranilate synthase component 1, pyocyanine specific~~~
MGARRWLVSGVGYRLEESLEYRTLVPEALSIWRMAGANRMLFDCFDVDSKAARRSVAILSSCLRIECWGRDVVLRALNSN
GRALLAPLSEDCPAQVTCLRDGDTLHWRFPQEESHADEWRRLHGLSSLEALRRVLGTLGDAEGPVLLGGLFSFDLAEQFE
PLPAPAEPARHCPDYLFLVPELLLDIDHLARRTSLQAFVHDPAGHDRLAASLRQCADEFHGAVEEASESPVAGVRAGNYQ
VDLDDASFARQVERLQAHVRAGDVFQIVPSRSFSMPCADPWRAYRQLCLRNPSPYRFFLDAGDFCLFGASPESALKYDAE
SREVELYPIAGTRPRGRDARGAIDAELDNRLEAELRLDAKEIAEHMMLVDLARNDLARVCRSGTRQVRDMLKVDRYSHVM
HLVSRVAGELHGELDALHAYRACLNMGTLVGAPKVRAMQLLRQYEDGYRGSYGGAIGILDSAGNLDTSIVIRSAEVREGI
ARVRAGAGVVLDSDPRLEAEETRNKALAVLTAVAAAERERGERDAHHAVG
>P09786 4.1.3.27~~~phnB~~~Anthranilate synthase component 2, pyocyanine specific~~~
MRITLLDNFDSFTYNLVEQFCLLGAEVRVMRNDTPLPTIQAALLADGCELLVLSPGPGRPEDAGCMLELLAWARGRLPVL
GVCLGHQALALAAGGAVGEARKPLHGKSTSLRFDQRHPLFDGIADLRVARYHSLVVSRLPEGFDCLADADGEIMAMADPR
NRQLGLQFHPESILTTHGQRLLENALLWCGALAVRERLRA
>A0QQ70 7.3.2.1~~~phnC~~~Phosphate-import ATP-binding protein PhnC~~~COG3638
MNPVAGDDVVVIARDVTKRFGDTLALDHVSLDVHRSELLVLLGLSGSGKSTLLRCLNGLHPVTSGTVDVGGTRVDQASGA
QLRALRRRVGFVFQHFNLVGRLSCLENVLIGGLGRLRLPRYGALTYPRHMRAEALAHLDRVGLADYADRRADTLSGGQQQ
RVAIARTLMQKPALLLADEPVASLDPENAGVVMDLLFRVCIEEKLTVVCTLHQVDLALGWAHRLVGLQGGRKVLDRPAVG
MTRDDVMAVYQRVEPAVTPARRV
>A3PC74 ~~~phnD1~~~Probable ABC transporter phosphite binding protein PhnD1~~~COG3221
MFNLKYFLVSSSLLFSVFSSPVFSNPKVLKVGAIPDQNQDVLDKRFNLFSKELSKQLDVEVKYIPVINYIAAVTGFRTKD
LDLVWFGGLSGVQARLQTPNSIVIAQRDIDKEFKSVFVVNKNLELNSISNIKGLKKLKNLRFTFGSENSTSGRLMPEYFL
NQAGVEIKHFKGKKAGFSGSHDATIALVNSGAFDAGALNKQVWENNLKNNPKRTSNLELFWITPEYVDYHWVAQGDLENR
FGEGFTKELKSVILNLDIKQKSHKQILDMFNAKRFIKAESKQYKNIEEIGRKLNKIR
>A3PDP9 ~~~phnD2~~~Probable ABC transporter phosphonate/phosphite binding protein PhnD2~~~COG3221
MKLKSLLSVFTISIVALTSACSTKNAGPSADPDKLIVALIPDENAATVIQDNQGLKDYLTEAFDKEIELVVTTDYSSMIE
AARNDRLDLAYFGPLSYVLAKAVSDIEPFAARIKGGTKTYNSCIIGNTKKGVTSFDDIKGTTFALGDPASTSSRLFPELT
LAENGLTKGKDFQGVFLGSHDAVALAVQNGNAQAGGMACPILKSLKKKGVIDPSKVTTIAQSSPIPQYPWTMRSTLSPEL
KEKIRFTFLDLDSDKVLKPFNADGFASITDSDYDGIRKAGKLLGLDLSKFVK
>A0QQ71 ~~~phnD~~~Phosphate-import protein PhnD~~~COG3221
MKIRAHHKIATAAACVALLASACSGSDKPQSTTAEGFPETITLAAIPAENSSDLKASYDPLIKMLEKQTGSKVEFVQASD
YAGVVEGMIAGNVDLAFFGPFAYVVAGVNGAKMTPLGAVIKDEGGAPGYQSYGLARADEDNINGLKDFAGKKVCFVDPGS
TSGFLYPTAGLIEEGVVKSGSEADISAAMSPIFAGGHDSSALAIANGDCDAGFAFDTMVDKTMIDKGDLKPGQLKTVWKS
DMIAGSVFAANDALGPEVIDKLKTMFAQDANVKSFEEEGFCEGDACRITDERAWGVVPVTDADYDGVRHVCDVTGSEKCK
G
>A0QQ68 ~~~phnE~~~Phosphate-import permease protein PhnE~~~COG3639
MTTEITRPPAPPSRPSESRKPSLPGLLHLVAIAAVLATIVSAWAIDFVPTALIDGSDNIVALLQRMIPPRLDDPARIGML
AVETLLMAVLGTTLAAIASVPLAFLAARNTTPHPAVQAVARAVITFCRAMPDLLFAVLFVRALGIGVLPGVLALALHSIG
MLGKVFADAIEQTDAGPREAVRSTGVGYFRELLNAVVPQVVPSWIAMFVYRIDINLRMSVVLGFVGAGGIGFALQDALRG
LIYPRALGIVCVILVIIAGMELLAIAIRRILLDPSRSNPLRDRIARFGLSGVLVGSCVAAFVLLKINPLALFTWVFPSVG
IFTRMVPPNFDALGVDLFTAAAQTVAIGVVATAIGIALSIPAGILAARNVSPHPALYWPARAWILVVRAVPELILAVVFV
AALGLGPIAGTCALAIGSIGFLAKLVADAVEEIDPGPMEAVRSVGGGWWKTLFAAVLPQSMPALVGSSLYLFDVNVRTST
ILGIVGAGGVGYLLFESIRTLNFDVAGAIVIVIFVIVYAIERLSGWIRSRLV
>P16684 ~~~phnF~~~Probable transcriptional regulator PhnF~~~COG2188
MHLSTHPTSYPTRYQEIAAKLEQELRQHYRCGDYLPAEQQLAARFEVNRHTLRRAIDQLVEKGWVQRRQGVGVLVLMRPF
DYPLNAQARFSQNLLDQGSHPTSEKLLSVLRPASGHVADALGITEGENVIHLRTLRRVNGVALCLIDHYFADLTLWPTLQ
RFDSGSLHDFLREQTGIALRRSQTRISARRAQAKECQRLEIPNMSPLLCVRTLNHRDGESSPAEYSVSLTRADMIEFTME
H
>A0QQ72 ~~~phnF~~~HTH-type transcriptional repressor PhnF~~~COG2188
MTAGAAPRILKHQVVRAELDRMLDGMRIGDPFPAEREIAEQFEVARETVRQALRELLIDGRVERRGRTTVVARPKIRQPL
GMGSYTEAAKAQGLSAGRILVAWSDLTADEVLAGVLGVDVGAPVLQLERVLTTDGVRVGLETTKLPAQRYPGLRETFDHE
ASLYAEIRSRGIAFTRTVDTIDTALPDAREAALLGADARTPMFLLNRVSYDQDDVAIEQRRSLYRGDRMTFTAVMHAKNS
AIVS
>P16685 2.7.8.37~~~phnG~~~Alpha-D-ribose 1-methylphosphonate 5-triphosphate synthase subunit PhnG~~~COG3624
MHADTATRQHWMSVLAHSQPAELAARLNALNITADYEVIRAAETGLVQIQARMGGTGERFFAGDATLTRAAVRLTDGTLG
YSWVQGRDKQHAERCALIDALMQQSRHFQNLSETLIAPLDADRMARIAARQAEVNASRVDFFTMVRGDNA
>Q51782 3.11.1.2~~~phnA~~~Phosphonoacetate hydrolase~~~
MTQLISVNSRSYRLSSAPTIVICVDGCEQEYINQAIQAGQAPFLAELTGFGTVLTGDCVVPSFTNPNNLSIVTGAPPSVH
GICGNFFFDQETQEEVLMNDAKYLRAPTILAEMAKAGQLVAVVTAKDKLRNLLGHQLKGICFSAEKADQVNLEEHGVENI
LARVGMPVPSVYSADLSEFVFAAGLSLLTNERPDFMYLSTTDYVQHKHAPGTPEANAFYAMMDSYFKRYHEQGAIVAITA
DHGMNAKTDAIGRPNILFLQDLLDAQYGAQRTRVLLPITDPYVVHHGALGSYATVYLRDAVPQRDAIDFLAGIAGVEAVL
TRSQACQRFELPEDRIGDLVVLGERLTVLGSAADKHDLSGLTVPLRSHGGVSEQKVPLIFNRKLVGLDSPGRLRNFDIID
LALNHLA
>P16686 2.7.8.37~~~phnH~~~Alpha-D-ribose 1-methylphosphonate 5-triphosphate synthase subunit PhnH~~~COG3625
MTLETAFMLPVQDAQHSFRRLLKAMSEPGVIVALHQLKRGWQPLNIATTSVLLTLADNDTPVWLSTPLNNDIVNQSLRFH
TNAPLVSQPEQATFAVTDEAISSEQLNALSTGTAVAPEAGATLILQVASLSGGRMLRLTGAGIAEERMIAPQLPECILHE
LTERPHPFPLGIDLILTCGERLLAIPRTTHVEVC
>P16687 2.7.8.37~~~phnI~~~Alpha-D-ribose 1-methylphosphonate 5-triphosphate synthase subunit PhnI~~~COG3626
MYVAVKGGEKAIDAAHALQESRRRGDTDLPELSVAQIEQQLNLAVDRVMTEGGIADRELAALALKQASGDNVEAIFLLRA
YRTTLAKLAVSEPLDTTGMRLERRISAVYKDIPGGQLLGPTYDYTHRLLDFTLLANGEAPTLTTADSEQQPSPHVFSLLA
RQGLAKFEEDSGAQPDDITRTPPVYPCSRSSRLQQLMRGDEGYLLALAYSTQRGYGRNHPFAGEIRSGYIDVSIVPEELG
FAVNVGELLMTECEMVNGFIDPPGEPPHFTRGYGLVFGMSERKAMAMALVDRALQAPEYGEHATGPAQDEEFVLAHADNV
EAAGFVSHLKLPHYVDFQAELELLKRLQQEQNHG
>P16688 4.7.1.1~~~phnJ~~~Alpha-D-ribose 1-methylphosphonate 5-phosphate C-P lyase~~~COG3627
MANLSGYNFAYLDEQTKRMIRRAILKAVAIPGYQVPFGGREMPMPYGWGTGGIQLTASVIGESDVLKVIDQGADDTTNAV
SIRNFFKRVTGVNTTERTDDATVIQTRHRIPETPLTEDQIIIFQVPIPEPLRFIEPRETETRTMHALEEYGVMQVKLYED
IARFGHIATTYAYPVKVNGRYVMDPSPIPKFDNPKMDMMPALQLFGAGREKRIYAVPPFTRVESLDFDDHPFTVQQWDEP
CAICGSTHSYLDEVVLDDAGNRMFVCSDTDYCRQQSEAKNQ
>P16678 ~~~phnK~~~Putative phosphonates utilization ATP-binding protein PhnK~~~COG4107
MNQPLLSVNNLTHLYAPGKGFSDVSFDLWPGEVLGIVGESGSGKTTLLKSISARLTPQQGEIHYENRSLYAMSEADRRRL
LRTEWGVVHQHPLDGLRRQVSAGGNIGERLMATGARHYGDIRATAQKWLEEVEIPANRIDDLPTTFSGGMQQRLQIARNL
VTHPKLVFMDEPTGGLDVSVQARLLDLLRGLVVELNLAVVIVTHDLGVARLLADRLLVMKQGQVVESGLTDRVLDDPHHP
YTQLLVSSVLQN
>P21852 1.12.2.1~~~hydB~~~Periplasmic [NiFe] hydrogenase large subunit~~~COG0374
MSGCRAQNAPGGIPVTPKSSYSGPIVVDPVTRIEGHLRIEVEVENGKVKNAYSSSTLFRGLEIILKGRDPRDAQHFTQRT
CGVCTYTHALASTRCVDNAVGVHIPKNATYIRNLVLGAQYLHDHIVHFYHLHALDFVDVTAALKADPAKAAKVASSISPR
KTTAADLKAVQDKLKTFVESGQLGPFTNAYFLGGHPAYYLDPETNLIATAHYLEALRLQVKAARAMAVFGAKNPHTQFTV
VGGVTCYDALTPQRIAEFEALWKETKAFVDEVYIPDLLVVAAAYKDWTQYGGTDNFITFGEFPKDEYDLNSRFFKPGVVF
KRDFKNIKPFDKMQIEEHVRHSWYEGAEARHPWKGQTQPKYTDLHGDDRYSWMKAPRYMGEPMETGPLAQVLIAYSQGHP
KVKAVTDAVLAKLGVGPEALFSTLGRTAARGIETAVIAEYVGVMLQEYKDNIAKGDNVICAPWEMPKQAEGVGFVNAPRG
GLSHWIRIEDGKIGNFQLVVPSTWTLGPRCDKNKLSPVEASLIGTPVADAKRPVEILRTVHSFDPCIACGVHVIDGHTNE
VHKFRIL
>P16679 2.7.8.37~~~phnL~~~Alpha-D-ribose 1-methylphosphonate 5-triphosphate synthase subunit PhnL~~~COG4778
MINVQNVSKTFILHQQNGVRLPVLNRASLTVNAGECVVLHGHSGSGKSTLLRSLYANYLPDEGQIQIKHGDEWVDLVTAP
ARKVVEIRKTTVGWVSQFLRVIPRISALEVVMQPLLDTGVPREACAAKAARLLTRLNVPERLWHLAPSTFSGGEQQRVNI
ARGFIVDYPILLLDEPTASLDAKNSAAVVELIREAKTRGAAIVGIFHDEAVRNDVADRLHPMGASS
>P12944 1.12.2.1~~~hydB~~~Periplasmic [NiFe] hydrogenase large subunit~~~
MSEMQGNKIVVDPITRIEGHLRIEVEVEGGKIKNAWSMSTLFRGLEMILKGRDPRDAQHFTQRACGVCTYVHALASVRAV
DNCVGVKIPENATLMRNLTMGAQYMHDHLVHFYHLHALDWVNVANALNADPAKAARLANDLSPRKTTTESLKAVQAKVKA
LVESGQLGIFTNAYFLGGHPAYVLPAEVDLIATAHYLEALRVQVKAARAMAIFGAKNPHTQFTVVGGCTNYDSLRPERIA
EFRKLYKEVREFIEQVYITDLLAVAGFYKNWAGIGKTSNFLTCGEFPTDEYDLNSRYTPQGVIWGNDLSKVDDFNPDLIE
EHVKYSWYEGADAHHPYKGVTKPKWTEFHGEDRYSWMKAPRYKGEAFEVGPLASVLVAYAKKHEPTVKAVDLVLKTLGVG
PEALFSTLGRTAARGIQCLTAAQEVEVWLDKLEANVKAGKDDLYTDWQYPTESQGVGFVNAPRGMLSHWIVQRGGKIENF
QHVVPSTWNLGPRCAERKLSAVEQALIGTPIADPKRPVEILRTVHSYDPCIACGVHVIDPESNQVHKFRIL
>P18188 1.12.2.1~~~hydB~~~Periplasmic [NiFe] hydrogenase large subunit~~~
MAESKPTPQSTFTGPIVVDPITRIEGHLRIMVEVENGKVKDAWSSSQLFRGLEIILKGRDPRDAQHFTQRACGVCTYVHA
LASSRCVDDAVKVSIPANARMMRNLVMASQYLHDHLVHFYHLHALDWVDVTAALKADPNKAAKLAASIAPARPGNSAKAL
KAVQDKLKAFVESGQLGIFTNAYFLGGHKAYYLPPEVDLIATAHYLEALHMQVKAASAMAILGGKNPHTQFTVVGGCSNY
QGLTKDPLANYLALSKEVCQFVNECYIPDLLAVAGFYKDWGGIGGTSNYLAFGEFATDDSSPEKHLATSQFPSGVITGRD
LGKVDNVDLGAIYEDVKYSWYAPGGDGKHPYDGVTDPKYTKLDDKDHYSWMKAPRYKGKAMEVGPLARTFIAYAKGQPDF
KKVVDMVLGKLSVPATALHSTLGRTAARGIETAIVCANMEKWIKEMADSGAKDNTLCAKWEMPEESKGVGLADAPRGALS
HWIRIKGKKIDNFQLVVPSTWNLGPRGAQGDKSPVEEALIGTPIADPKRPVEILRTVHAFDPCIACGVHVIEPETNEILK
FKVC
>P16689 3.6.1.63~~~phnM~~~Alpha-D-ribose 1-methylphosphonate 5-triphosphate diphosphatase~~~COG3454
MIINNVKLVLENEVVSGSLEVQNGEIRAFAESQSRLPEAMDGEGGWLLPGLIELHTDNLDKFFTPRPKVDWPAHSAMSSH
DALMVASGITTVLDAVAIGDVRDGGDRLENLEKMINAIEETQKRGVNRAEHRLHLRCELPHHTTLPLFEKLVQREPVTLV
SLMDHSPGQRQFANREKYREYYQGKYSLTDAQMQQYEEEQLALAARWSQPNRESIAALCRARKIALASHDDATHAHVAES
HQLGSVIAEFPTTFEAAEASRKHGMNVLMGAPNIVRGGSHSGNVAASELAQLGLLDILSSDYYPASLLDAAFRVADDQSN
RFTLPQAVKLVTKNPAQALNLQDRGVIGEGKRADLVLAHRKDNHIHIDHVWRQGKRVF
>P16690 2.7.4.23~~~phnN~~~Ribose 1,5-bisphosphate phosphokinase PhnN~~~COG3709
MMGKLIWLMGPSGSGKDSLLAELRLREQTQLLVAHRYITRDASAGSENHIALSEQEFFTRAGQNLLALSWHANGLYYGVG
VEIDLWLHAGFDVLVNGSRAHLPQARARYQSALLPVCLQVSPEILRQRLENRGRENASEINARLARAARYTPQDCHTLNN
DGSLRQSVDTLLTLIHQKEKHHACL
>P16691 2.3.1.280~~~phnO~~~Aminoalkylphosphonate N-acetyltransferase~~~COG0456
MPACELRPATQYDTDAVYALICELKQAEFDHHAFRVGFNANLRDPNMRYHLALLDGEVVGMIGLHLQFHLHHVNWIGEIQ
ELVVMPQARGLNVGSKLLAWAEEEARQAGAEMTELSTNVKRHDAHRFYLREGYEQSHFRFTKAL
>Q8ZKE6 2.3.1.280~~~phnO~~~Aminoalkylphosphonate N-acetyltransferase~~~
MILMRRREDVMPVCELRHATTEDTDSVYALICELLKNELDYQAFRDGFAANLLDPNVHYRLALRNGEVVGMISLHMQFHL
HHANWIGEIQELVVLPPMRGQKIGSQLLAWAEEEARQAGAELTELSTNIKRRDAHRFYLREGYKQSHFRFTKAL
>C8WJZ5 3.1.4.57~~~phnPP~~~Phosphoribosyl 1,2-cyclic phosphate 1,2-diphosphodiesterase~~~COG0613
MIEDLHVHSTMSDGSDTFEQVLEQAAQRGVERLAFTNHDTTAGLTAARELGERLGVQVVGGIEVSAYDFERGRKVHILGL
GVEEGAPALAALCGSTLERRHANSLWQLDRLVEAGYEVDVERALELGRASTCLYKQHLMAALTSEPYPSAAYRTLYRSLF
KNGGICDRDIDYVDARDAVRVVVEDGGLAVLAHPGQLDSYDLLPDLVECGLGGIERFHPDHTLADHARCAELAVRYRLVC
TGGSDYHGKFGRVPHVGFRVPA
>P16692 3.1.4.55~~~phnP~~~Phosphoribosyl 1,2-cyclic phosphate phosphodiesterase~~~COG1235
MSLTLTLTGTGGAQGVPAWGCECAACARARRSPQYRRQPCSGVVKFNDAITLIDAGLHDLADRWSPGSFQQFLLTHYHMD
HVQGLFPLRWGVGDPIPVYGPPDEQGCDDLFKHPGLLDFSHTVEPFVVFDLQGLQVTPLPLNHSKLTFGYLLETAHSRVA
WLSDTAGLPEKTLKFLRNNQPQVMVMDCSHPPRADAPRNHCDLNTVLALNQVIRSPRVILTHISHQFDAWLMENALPSGF
EVGFDGMEIGVA
>Q7CR30 ~~~phnR~~~Putative transcriptional regulator of 2-aminoethylphosphonate degradation operons~~~
MKSIPGDIPQYLLIKAQLQARIQSGALKSGDKLPSERELCAIFNTTRITIRESLAQLESSGLIWRADRRGWFVTPERLWL
DPTQNTNFHKLCREQGREPKTALLSGVLTTVPVEVMEPLQLQPFDQIYLLTRLRYADGRAVCYCENHCLPARVPELLQYD
LNGSLTEVYESHYNLVYTSMHLSFYPTAMPAQAAQALGVMEGRPALLLRRLNYDQHGRVLDLDIEYWRHDSLRIEVDTH
>Q06173 1.12.2.1~~~hynB1~~~Periplasmic [NiFe] hydrogenase small subunit 1~~~COG1740
MRFSVGLGKEGAEERLARRGVSRRDFLKFCTAIAVTMGMGPAFAPEVARALTGSRRPSVVYLHNAECTGCSESVLRAFQP
YLDELILDTISLDYHETIMAAAGDAAEAALHQAVANPDGFICIVEGAIPTADNGIYGKVANHTMLSICSDIVPKAKAVIA
YGTCATFGGVQAAKPNPTGAKGLNDALKHLGVNAINLAGCPPNPYNLVGTLVYYLKNNAAPEMDEFNRPLMFFGQSVHDN
CPRLKHFDAGEFAPSFESEEARKGWCLYELGCKGPSTMNNCPKIKFNQTNWPVEAGHPCIGCSEPDFWDEKSPFYES
>P13061 1.12.2.1~~~hydA~~~Periplasmic [NiFe] hydrogenase small subunit~~~COG1740
MRIAVGLGKEGGEERLERQGISRRDFMKFCTAVAVAMGMGPAFATDVAAALTGRRPSVVYLHAAECTGCSEALLRTYQPF
IDTLILDTISLDYHETIMAAAGEAAEEALQAAVNGPDGFICLVEGAIPTGMDNKYGYIAGHTMYDICKNILPKAKAVVSI
GTCACYGGIQAAKPNPTAAKGINDCYADLGVKAINVPGCPPNPLNMVGTLVAFLKGQKIELDEVGRPVMFFGQSVHDLCE
RRKHFDAGEFAPSFNSEEARKGWCLYDVGCKGPETYNNCPKVLFNETNWPVAAGHPCIGCSEPNFWDDMTPFYQN
>P21853 1.12.2.1~~~hydA~~~Periplasmic [NiFe] hydrogenase small subunit~~~COG1740
MKISIGLGKEGVEERLAERGVSRRDFLKFCTAIAVTMGMGPAFAPEVARALMGPRRPSVVYLHNAECTGCSESVLRAFEP
YIDTLILDTLSLDYHETIMAAAGDAAEAALEQAVNSPHGFIAVVEGGIPTAANGIYGKVANHTMLDICSRILPKAQAVIA
YGTCATFGGVQAAKPNPTGAKGVNDALKHLGVKAINIAGCPPNPYNLVGTIVYYLKNKAAPELDSLNRPTMFFGQTVHEQ
CPRLPHFDAGEFAPSFESEEARKGWCLYELGCKGPVTMNNCPKIKFNQTNWPVDAGHPCIGCSEPDFWDAMTPFYQN
>P12943 1.12.2.1~~~hydA~~~Periplasmic [NiFe] hydrogenase small subunit~~~
MKFCTAVAVAMGMGPAFAPKVAEALTAKKRPSVVYLHNAECTGCSESLLRTVDPYVDELILDVISMDYHETLMAGAGHAV
EEALHEAIKGDFVCVIEGGIPMGDGGYWGKVGRRNMYDICAEVAPKAKAVIAIGTCATYGGVQAAKPNPTGTVGVNEALG
KLGVKAINIAGCPPNPMNFVGTVVHLLTKGMPELDKQGRPVMFFGETVHDNCPRLKHFEAGEFATSFGSPEAKKGYCLYE
LGCKGPDTYNNCPKQLFNQVNWPVQAGHPCIACSEPNFWDLYSPFYSA
>P96062 ~~~phnS~~~Putative 2-aminoethylphosphonate-binding periplasmic protein~~~
MKLSRLALLSVFALASAPSWAESVVTVYSIDGLHDGDNSWYQVQFDAFTKATGITVRYVEGGGGVVVERLAKERTNPQAD
VLVTAPPFIQRAAAEKLLANFNTDTASAIPDANNLYSPLVKNYLSFIYNSKLLKTAPASWQDLLDGKFKNKLQYSTPGQA
ADGTAVMLQAFHSFGSKDAGFAYLGKLQANNVGPSASTGKLTALVNKGEIYVANGDLQMNLAQMERNPNVKIFWPANDKG
ERSALAIPYVIGLVQGAPQSENGKKLINFLLSKEAQTRVSELSWGMPVRSDVTPSDEHYKAATAALEGVQSWQPNWDDVA
VSLSADISRWHKVTESE
>P18187 1.12.2.1~~~hydA~~~Periplasmic [NiFe] hydrogenase small subunit~~~
MNFSVGLGRDDAEKRLVQNGVSRRDFMKFCATVAAAMGMGPAFAPKVAEALTAKHRPSVVWLHNAECTGCTEAAIRTIKP
YIDALILDTISLDYQETIMAAAGEAAEAALHQALEGKDGYYLVVEGGLPTIDGGQWGMVAGHPMIETTKKAAAKAKGIIC
IGTCSAYGGVQKAKPNPSQAKGVSEALGVKTINIPGCPPNPINFVGAVVHVLTKGIPDLDENGRPKLFYGELVHDNCPRL
PHFEASEFAPSFDSEEAKKGFCLYELGCKGPVTYNNCPKVLFNQVNWPVQAGHPCLGCSEPDFWDTMTPFYEQG
>P96063 ~~~phnT~~~Putative 2-aminoethylphosphonate import ATP-binding protein PhnT~~~
MLMKTTTVHAPASQGTSGIVLDSLRVAYHGNVVLKPLSLTIEPGEVLALIGPSGSGKTTVLRAVAGFVQPAGGRILIGDT
DVTHLPPYKRGLAMVVQNYALFPHLKVEDNVAFGLRAQKQPKALINERVTQALKTVGMSDYAARYPHQLSGGQQQRVAIA
RAIAVRPRVLLLDEPLSALDAQIRHNMVEEIARLHRELPELTILYVTHDQTEALTLADKIGIMKDGSLIAHGETRALYQH
PPNRFAAEFLGRANILSAIALGITEAPGLVDVSCGGAVIRAFSRGSHHGYNKLLCIRPQHLSLTPRSAYSNRFNATLQSV
HWQGDLTHLLCDVAGETVRMVLTHVNPLPRVGDKLALWFEPDDAVLIEV
>P96064 ~~~phnU~~~Putative 2-aminoethylphosphonate transport system permease protein PhnU~~~
MSLILPLEKPALNLRPLLWLLLPLLVLATLFFWPLSLIVEQALRGANGEIGLETFRQVVDSKRFVGALLNTLQIAFFATA
GCLLLGSVMSLILVFIPFPGSELIGRVVDTFIALPTFLITLAFTFIYGSAGLLNGTLMSLFAFELPPVDFLYSMQGVILA
EITVFTPLVMRPLMAALRQIDKSQLEAASILGAHPLRVIGQVIFPAALPALMAGGSLCLLLTTNEFGIVLFIGAKGVNTL
PMMVYSKAILESDYTVACMIALINIVLSLGLFSLYRLAASRTGVRS
>P96065 ~~~phnV~~~Putative 2-aminoethylphosphonate transport system permease protein PhnV~~~
MLIWSPKGRAAAGVVASVLFIVFFFLPLAVILMSSLSQQWNGILPSGFTLNHFVNALHGAAWDALLASLTIGFCASLFAL
LCGVWAALALRQYGVKTQKWLSMVFYLPSAIPSVSVGLGILVAFSQGPLQMNGTLWIVLTAHFVLISAFTFSNVSTGLAR
ISADIENVASSLGASPWYRLRHVTLPLLMPWMMSALALSLSLSMGELGATMMIYPPGWTTLPVAIFSLTDRGNIADGAAL
TIVLVAITLLLMMKLERIAKRLGQK
>Q9I434 2.6.1.37~~~phnW~~~2-aminoethylphosphonate--pyruvate transaminase~~~
MSTAERAPILLTPGPLTTSYRTRRAMMVDWGSWDSDFNELTASVCQRLLKIVGGEGSHTCVPLQGSGTFAVEAAIGTLVP
RDGKVLVLINGAYGKRLAKICEVLQRPFSTLETEENVPTTAADVERLLAADPAISHVALIHCETSTGILNPLEAIAKVVE
RHGKRLIVDAMSSFGAIGIDARKVPFDALIAASGKCLEGVPGMGFVFARSAALEASAGNCHSLAMDLQDQHAYMRKTGQW
RFTPPTHVVAALHEALSQYEEEGGLPARQRRYASNCETLLGEMARLGFRSFLPAEIQAPIIVTFHAPRDPRYRFADFYQR
VREKGFILYPGKLTQVETFRVGCIGHVDAAEMRQAVAAIGEALRELEVLEI
>P96060 2.6.1.37~~~phnW~~~2-aminoethylphosphonate--pyruvate transaminase~~~
MTSRNYLLLTPGPLTTSRTVKEAMLFDSCTWDDDYNIGVVEQIRQQLTALATASEGYTSVLLQGSGSYAVEAVLGSALGP
QDKVLIVSNGAYGARMVEMAGLMGIAHHAYDCGEVARPDVQAIDAILNADPTISHIAMVHSETTTGMLNPIDEVGALAHR
YGKTYIVDAMSSFGGIPMDIAALHIDYLISSANKCIQGVPGFAFVIAREQKLAACKGHSRSLSLDLYAQWRCMEDNHGKW
RFTSPTHTVLAFAQALKELAKEGGVAARHQRYQQNQRSLVAGMRALGFNTLLDDELHSPIITAFYSPEDPQYRFSEFYRR
LKEQGFVIYPGKVSQSDCFRIGNIGEVYAADITALLTAIRTAMYWTK
>O31156 3.11.1.1~~~phnX~~~Phosphonoacetaldehyde hydrolase~~~
MKIEAVIFDWAGTTVDYGCFAPLEVFMEIFHKRGVAITAEEARKPMGLLKIDHVRALTEMPRIASEWNRVFRQLPTEADI
QEMYEEFEEILFAILPRYASPINGVKEVIASLRERGIKIGSTTGYTREMMDIVAKEAALQGYKPDFLVTPDDVPAGRPYP
WMCYKNAMELGVYPMNHMIKVGDTVSDMKEGRNAGMWTVGVILGSSELGLTEEEVENMDSVELREKIEVVRNRFVENGAH
FTIETMQELESVMEHIEKQELIIS
>Q9I433 3.11.1.1~~~phnX~~~Phosphonoacetaldehyde hydrolase~~~
MNYNQPATLQAAILDWAGTVVDFGSFAPTQIFVEAFAEFGVQVSLEEARGPMGMGKWDHIRTLCDIPAIAERYRAVFGRL
PSDDDVTAIYERFMPLQIEKIAEHSALIPGALQAIAELRGMGLKIGSCSGYPAVVMEKVVALAETNGYVADHVVATDEVP
NGRPWPAQALANVIALGIDDVAACVKVDDTWPGILEGRRAGMWTVALTCSGNALGLTYEQYQALPAAELERERTRIEQMF
EGSRPHYLIETIAELPAVVRDINARLARGEMPQGN
>Q7ZAP3 3.11.1.1~~~phnX~~~Phosphonoacetaldehyde hydrolase~~~
MNRIHAVILDWAGTTVDFGSFAPTQIFVEAFRQAFDVEITLAEARVPMGLGKWQHIEALGKLPAVDARWQAKFGRSMSAA
DIDAIYAAFMPLQIAKVVDFSSPIAGVIDTIAALRAEGIKIGSCSGYPRAVMERLVPAAAGHGYRPDHWVATDDLAAGGR
PGPWMALQNVIALGIDAVAHCVKVDDAAPGISEGLNAGMWTVGLAVSGNEFGATWDAYQTMSKEDVAVRREHAASKLYAA
GAHYVVDSLADLPGVIAHINARLAQGERP
>P0DX11 1.14.11.71~~~phnY*~~~Methylphosphonate hydroxylase~~~COG5285
MNTTLTDSQLDRWNQTGYIKLPEFLSEAETQNLREWVEEISAWPADDEKWMHHFEQTPSGVRPARTEYILAFHAGIRQLL
TQGKIPDCAGALMGEPAILYKEKINYKYPGGGGYAAHQDAPAYEFIRNHITCSIAVDAATPENGCLFFTPELHQRGLLHL
DKNGCIDREYADTLDWEPVPMQPGDALFFSSYAPHKSPPNETQQPRRTLYLTYNALAEGDLREEYYADKRRSFAQVDTTG
GEKLKISKIGHFDGKPAQQT
>Q92UV7 1.2.1.-~~~phnY~~~Phosphonoacetaldehyde dehydrogenase~~~COG1012
MTNAEVTIAVRHEPMRIAGRLVDTDDRVEVRYPWNDTVVGTVPAGRAEHAREAFAIAAAYQPKLTRYERQKILLATAEAL
AARKEEISDVITLELGISKADSLYEVGRAFDVFTLAGQMCIRDDGEIFSCDLTPHGKARKIFTMREPLTAISAITPFNHP
LNMVAHKVAPAIATNNCVVVKPTELTPMTALLLADILYEAGLPPEMLSVVTGWPADIGMEMITNPHVDLVTFTGSVPVGK
LIAANAHYKRQVLELGGNDPLIILNDLSDDDLARAADLAVAGATKNSGQRCTAVKRILCQESVADRFVPLVLERAKRLRF
GDPMDRSTDLGTVIHEKAAALFEERVMRAAEEGADILYHPGRSGALLPPIVVDRVPHQSDLVLEETFGPIIPIVRVPDDD
DATITLSNSTAFGLSSGVCTNDYRRMQKYIAGLKVGTVNIWEVPGYRIEMSPFGGIKDSGNGYKEGVIEAMKSFTNVKTF
SLPWP
>D0E8I4 1.14.11.46~~~phnY~~~2-aminoethylphosphonate dioxygenase~~~
MSYFTQEQKTQWKDNGFVHLKGFLNEALAQDIKDWTQELYEWEEAPGKWMKYFETSSDTGERLLCRVENFIDYHKGIKGF
LCGEMIYGMVSELMGEQAVLFKEKINFKYPGGAGFAYHQDAPAFTSFGQKYHITMMVSVDASNEENGCLRMAHGFSEEKT
LEQEPDGTVCKKLAAKLDWRPLETGPGDLVLFNSYVPHYSEANTSDRSRRAMFITYNRLSEGEKRLDYFKDKREKFPPEA
ERIEGKDYSSAESLYNLGNPIK
>P0DX12 1.13.11.89~~~phnZ1~~~Hydroxymethylphosphonate dioxygenase~~~COG4341
MNEKIELSLCDVLSLSKTPGERVTSLFELMQDHGQSFYDESVTQLEHALQAAHLAKTSHATMEQITAALLHDIGHFLMDE
HDEQNHFLAEDWQHETVGAQQLAPFFGKAVTEPIFLHVPAKRYLCSVNADYFNGLSRASQRSYELQGGLMTDAEIAEFEQ
NPFHHTAVLLRRWDDGAKVKGLKVPDLQEYHTEVESCLEHS
>D0E8I5 1.13.11.78~~~phnZ~~~2-amino-1-hydroxyethylphosphonate dioxygenase (glycine-forming)~~~
MSLSNSSKVSVLISLLEKSRDLDYIGEAINQLEHSLQCAYFAQRSGADNEMVLAALLHDLGHYCNDTSFEDMGGYGVWQH
EKVGADYLRGLGFSERVACLIEGHVAAKRYLVSSKASYLKNLSDASRKTLEYQGGPMDEGERRLFEEREDFKDCLKIRAW
DEKGKQTDLKVPGPEHYRKMMEEHLSENQN
>P0AFJ5 ~~~phoB~~~Phosphate regulon transcriptional regulatory protein PhoB~~~COG0745
MARRILVVEDEAPIREMVCFVLEQNGFQPVEAEDYDSAVNQLNEPWPDLILLDWMLPGGSGIQFIKHLKRESMTRDIPVV
MLTARGEEEDRVRGLETGADDYITKPFSPKELVARIKAVMRRISPMAVEEVIEMQGLSLDPTSHRVMAGEEPLEMGPTEF
KLLHFFMTHPERVYSREQLLNHVWGTNVYVEDRTVDVHIRRLRKALEPGGHDRMVQTVRGTGYRFSTRF
>P28581 3.1.3.2~~~phoC~~~Major phosphate-irrepressible acid phosphatase~~~
MKKNIIAGCLFSLFSLSALAAIPAGNDATTKPDLYYLKNEQAIDSLKLLPPPPEVGSIQFLNDQAMYEKGRMLRNTERGK
QAQADADLAAGGVATAFSGAFGYPITEKDSPELYKLLTNMIEDAGDLATRSAKEHYMRIRPFAFYGTETCNTKDQKKLST
NGSYPSGHTSIGWATALVLAEVNPANQDAILERGYQLGQSRVICGYHWQSDVDAARIVGSAAVATLHSDPAFQAQLAKAK
QEFAQKSQK
>Q01605 ~~~phoE~~~Outer membrane porin PhoE~~~
MKKSTLALVVMGITASASVQAAEVYNKNGNKLDLYGKVKAMHYMTDYDSKDGDQSYIRLGFKGETQINDELTGYGRWEAE
FAGNKAESDSNQQKTRLAFAGSKLKNLGSFDYGRNLGALYDVEAWTDMFPEFGGDSSAQTDNFMTKRASGLATYRNTDFF
GVVDGLDLTLQYQGKNQDRDVKKQNGDGFGTSVTYDFGGSDFAVSGAYTNSDRTNQQNLQTRGTGDKAEAWATGLKYDAN
DIYIATFYSETRNMTPISGGFANKTQNFEAVVQYQFDFGLRPSLGYVLSKGKDIEGVGNEDLVNYIDVGATYYFNKNMSA
FVDYKINQLDSDNKLGINNDDIVAVGMVYQF
>P02932 ~~~phoE~~~Outer membrane porin PhoE~~~COG3203
MKKSTLALVVMGIVASASVQAAEIYNKDGNKLDVYGKVKAMHYMSDNASKDGDQSYIRFGFKGETQINDQLTGYGRWEAE
FAGNKAESDTAQQKTRLAFAGLKYKDLGSFDYGRNLGALYDVEAWTDMFPEFGGDSSAQTDNFMTKRASGLATYRNTDFF
GVIDGLNLTLQYQGKNENRDVKKQNGDGFGTSLTYDFGGSDFAISGAYTNSDRTNEQNLQSRGTGKRAEAWATGLKYDAN
NIYLATFYSETRKMTPITGGFANKTQNFEAVAQYQFDFGLRPSLGYVLSKGKDIEGIGDEDLVNYIDVGATYYFNKNMSA
FVDYKINQLDSDNKLNINNDDIVAVGMTYQF
>Q47490 ~~~phoE~~~Outer membrane porin PhoE~~~COG3203
MKKSTLALVVMGVVASASVHAAEVYNKNGNKLDVYGKVKAMHYISDDDTKDGDQTYVRFGFKGETQINDQLTGYGRWEAE
FAGNKAESDSSQKTRLAFAGLKLKDFGSLDYGRNLGALYDVEAWTDMFPEFGGDSSAQTDNFMTKRASGLATYRNTDFFG
AIDGLDMTLQYQGKNENRDAKKQNGDGFGTSLTYDFGGTDFAVSGAYTNSDRTNAQNLLARAQGQKAEAWATGLKYDAND
IYLAAMYSETRNMTPISGGFANKAQNFEVVAQYQFDFGLRPSLGYVQSKGKDNEGIGDEDLVKYIDVGATYYFNKNMSAF
VDYKINQIDDDNKLGVSSDDIVAVGMTYQF
>Q01606 ~~~phoE~~~Outer membrane porin PhoE~~~COG3203
MKKSSLALMMMGLIASSATQAAEVYNKNGNKLDVYGKVKAMHYMSDYDSKDGDQTYVRFGIKGETQINDDLTGYGRWESE
FSGNKTESDSSQKTRLAFAGVKVKNYGSFDYGRNLGALYDVEAWTDMFPEFGGDSSAQTDNFMTKRASGLATYRNTDFFG
VVDGLDLTLQYQGKNEGREAKKQNGDGFGTSLSYDFGGSDFAVSAAYTSSDRTNDQNLLARGAKKAEAWATGLKYDANNI
YLATMYSETRKMTPISGGFANKAQNFEAVAQYQFDFGLRPSLGYVLSKGKDIEGVGSEDLVNYIDVGVTYYFNKNMNAFV
DYKINQLKSDNKLGINDDDIVAVGMTYQF
>P30704 ~~~phoE~~~Outer membrane porin PhoE~~~
MKKSTLALMMMGFVASTATQAAEVYNKNANKLDVYGKIKAMHYFSDYDSKDGDQTYVRFGIKGETQINEDLTGYGRWESE
FSGNKTESDSSQQKTRLAFAGVKLKNYGSFDYGRNLGALYDVEAWTDMFPEFGGDSSAQTDNFMTKRASGLATYRNTDFF
GLVDGLDLTLQYQGKNEGREAKKQNGDGVGTSLSYDFGGTDFAVSAAYTSSDRTNDQNLLARAQGSKAEAWATGLKYDAN
NIYLATMYSETRKMTPISGGFANKAQNFEAVAQYQFDFGLRPSLGYVLSKGKDIEGVGSEDLVNYIDVGLTYYFNKNMNA
FVDYKINQLKSDNKLGINDDDIVALGMTYQF
>Q56119 ~~~phoE~~~Outer membrane porin PhoE~~~COG3203
MNKSTLAIVVSIIASASVHAAEVYNKNGNKLDVYGKVKAMHYMSDYDSKDGDQSYVRFGFKGETQINDQLTGYGRWEAEF
AGNKAESDSSQQKTRLAFAGLKLKDIGSFDYGRNLGALYDVEAWTDMFPEFGGDSSAQTDNFMTKRASGLATYRNTDFFG
IVDGLDLTLQYQGKNEDRDVKKQNGDGFGTSVSYDFGGSDFAVSGAYTLSDRTREQNLQRRGTGDKAEAWATGVKYDAND
IYIATFYSETRNMTPVSGGFANKTQNFEAVIQYQFDFGLRPSLGYVLSKGKDIEGVGSEDLVNYIDVGAIYYFNKNMSAF
VDYKINQLDSDNTLGINDDDIVAIGLTYQF
>P30705 ~~~phoE~~~Outer membrane porin PhoE~~~
MNKSTLAIVVSIIASASVHAAEVYNKNGNKLDVYGKVKAMHYMSDYDSKDGDQSYVRFGFKGETQINDQLTGYGRWEAEF
ASNKAESDSSQQKTRLAFAGLKLKDIGSFDYGRNLGALYDVEAWTDMFPEFGGDSSAQTDNFMTKRASGLATYRNTDFFG
IVDGLDLTLQYQGKNEDRDVKKQNGDGFGTSVSYDFGGSDFAVSGAYTLSDRTREQNLQRRGTGDKAEAWATGVKYDAND
IYIATFYSETRNMTPVSGGFANKTQNFEAVIQYQFDFGLRPSLGYVLSKGKDIEGVGSEDLVNYIDVGATYYFNKNMSAF
VDYKINQLDSDNTLGINDDDIVAIGLTYQF
>P0A9K1 ~~~phoH~~~Protein PhoH~~~COG1702
MVTSCTGHVLDNQRATTRGVFSSGSHLVTLHFQPHPFFSCVTDAVNGARSRFSAFYPKANYGLQGSQPSDVRAHNRAANG
ACDEYKQLKVLSMGRQKAVIKARREAKRVLRRDSRSHKQREEESVTSLVQMGGVEAIGMARDSRDTSPILARNEAQLHYL
KAIESKQLIFATGEAGCGKTWISAAKAAEALIHKDVDRIIVTRPVLQADEDLGFLPGDIAEKFAPYFRPVYDVLVRRLGA
SFMQYCLRPEIGKVEIAPFAYMRGRTFENAVVILDEAQNVTAAQMKMFLTRLGENVTVIVNGDITQCDLPRGVCSGLSDA
LERFEEDEMVGIVRFGKEDCVRSALCQRTLHAYS
>P0A9K3 ~~~ybeZ~~~PhoH-like protein~~~COG1702
MNIDTREITLEPADNARLLSLCGPFDDNIKQLERRLGIEINRRDNHFKLTGRPICVTAAADILRSLYVDTAPMRGQIQDI
EPEQIHLAIKEARVLEQSAESVPEYGKAVNIKTKRGVIKPRTPNQAQYIANILDHDITFGVGPAGTGKTYLAVAAAVDAL
ERQEIRRILLTRPAVEAGEKLGFLPGDLSQKVDPYLRPLYDALFEMLGFEKVEKLIERNVIEVAPLAYMRGRTLNDAFII
LDESQNTTIEQMKMFLTRIGFNSKAVITGDVTQIDLPRNTKSGLRHAIEVLADVEEISFNFFHSEDVVRHPVVARIVNAY
EAWEEAEQKRKAALAAERKREEQEQK
>P9WIA3 ~~~~~~PhoH-like protein~~~COG1702
MTSRETRAADAAGARQADAQVRSSIDVPPDLVVGLLGSADENLRALERTLSADLHVRGNAVTLCGEPADVALAERVISEL
IAIVASGQSLTPEVVRHSVAMLVGTGNESPAEVLTLDILSRRGKTIRPKTLNQKRYVDAIDANTIVFGIGPAGTGKTYLA
MAKAVHALQTKQVTRIILTRPAVEAGERLGFLPGTLSEKIDPYLRPLYDALYDMMDPELIPKLMSAGVIEVAPLAYMRGR
TLNDAFIVLDEAQNTTAEQMKMFLTRLGFGSKVVVTGDVTQIDLPGGARSGLRAAVDILEDIDDIHIAELTSVDVVRHRL
VSEIVDAYARYEEPGSGLNRAARRASGARGRR
>P13792 ~~~phoP~~~Alkaline phosphatase synthesis transcriptional regulatory protein PhoP~~~COG0745
MNKKILVVDDEESIVTLLQYNLERSGYDVITASDGEEALKKAETEKPDLIVLDVMLPKLDGIEVCKQLRQQKLMFPILML
TAKDEEFDKVLGLELGADDYMTKPFSPREVNARVKAILRRSEIAAPSSEMKNDEMEGQIVIGDLKILPDHYEAYFKESQL
ELTPKEFELLLYLGRHKGRVLTRDLLLSAVWNYDFAGDTRIVDVHISHLRDKIENNTKKPIYIKTIRGLGYKLEEPKMNE
>P23836 ~~~phoP~~~Transcriptional regulatory protein PhoP~~~COG0745
MRVLVVEDNALLRHHLKVQIQDAGHQVDDAEDAKEADYYLNEHIPDIAIVDLGLPDEDGLSLIRRWRSNDVSLPILVLTA
RESWQDKVEVLSAGADDYVTKPFHIEEVMARMQALMRRNSGLASQVISLPPFQVDLSRRELSINDEVIKLTAFEYTIMET
LIRNNGKVVSKDSLMLQLYPDAELRESHTIDVLMGRLRKKIQAQYPQEVITTVRGQGYLFELR
>Q9I4F9 ~~~phoP~~~Two-component response regulator PhoP~~~
MKLLVVEDEALLRHHLYTRLGEQGHVVDAVPDAEEALYRVSEYHHDLAVIDLGLPGMSGLDLIRELRSQGKSFPILILTA
RGNWQDKVEGLAAGADDYVVKPFQFEELEARLNALLRRSSGFVQSTIEAGPLVLDLNRKQALVEEQPVALTAYEYRILEY
LMRHHQQVVAKERLMEQLYPDDEERDANVIEVLVGRLRRKLEACGGFKPIDTVRGQGYLFTERCR
>D0ZV90 ~~~phoP~~~Virulence transcriptional regulatory protein PhoP~~~
MMRVLVVEDNALLRHHLKVQLQDSGHQVDAAEDAREADYYLNEHLPDIAIVDLGLPDEDGLSLIRRWRSSDVSLPVLVLT
AREGWQDKVEVLSSGADDYVTKPFHIEEVMARMQALMRRNSGLASQVINIPPFQVDLSRRELSVNEEVIKLTAFEYTIME
TLIRNNGKVVSKDSLMLQLYPDAELRESHTIDVLMGRLRKKIQAQYPHDVITTVRGQGYLFELR
>E1WFA1 ~~~phoP~~~Virulence transcriptional regulatory protein PhoP~~~
MMRVLVVEDNALLRHHLKVQLQDSGHQVDAAEDAREADYYLNEHLPDIAIVDLGLPDEDGLSLIRRWRSSDVSLPVLVLT
AREGWQDKVEVLSSGADDYVTKPFHIEEVMARMQALMRRNSGLASQVINIPPFQVDLSRRELSVNEEVIKLTAFEYTIME
TLIRNNGKVVSKDSLMLQLYPDAELRESHTIDVLMGRLRKKIQAQYPHDVITTVRGQGYLFELR
>F5ZP95 ~~~phoP~~~Virulence transcriptional regulatory protein PhoP~~~
MMRVLVVEDNALLRHHLKVQLQDSGHQVDAAEDAREADYYLNEHLPDIAIVDLGLPDEDGLSLIRRWRSSDVSLPVLVLT
AREGWQDKVEVLSSGADDYVTKPFHIEEVMARMQALMRRNSGLASQVINIPPFQVDLSRRELSVNEEVIKLTAFEYTIME
TLIRNNGKVVSKDSLMLQLYPDAELRESHTIDVLMGRLRKKIQAQYPHDVITTVRGQGYLFELR
>P0DM78 ~~~phoP~~~Virulence transcriptional regulatory protein PhoP~~~
MMRVLVVEDNALLRHHLKVQLQDSGHQVDAAEDAREADYYLNEHLPDIAIVDLGLPDEDGLSLIRRWRSSDVSLPVLVLT
AREGWQDKVEVLSSGADDYVTKPFHIEEVMARMQALMRRNSGLASQVINIPPFQVDLSRRELSVNEEVIKLTAFEYTIME
TLIRNNGKVVSKDSLMLQLYPDAELRESHTIDVLMGRLRKKIQAQYPHDVITTVRGQGYLFELR
>P23837 2.7.13.3~~~phoQ~~~Sensor protein PhoQ~~~COG2205
MKKLLRLFFPLSLRVRFLLATAAVVLVLSLAYGMVALIGYSVSFDKTTFRLLRGESNLFYTLAKWENNKLHVELPENIDK
QSPTMTLIYDENGQLLWAQRDVPWLMKMIQPDWLKSNGFHEIEADVNDTSLLLSGDHSIQQQLQEVREDDDDAEMTHSVA
VNVYPATSRMPKLTIVVVDTIPVELKSSYMVWSWFIYVLSANLLLVIPLLWVAAWWSLRPIEALAKEVRELEEHNRELLN
PATTRELTSLVRNLNRLLKSERERYDKYRTTLTDLTHSLKTPLAVLQSTLRSLRSEKMSVSDAEPVMLEQISRISQQIGY
YLHRASMRGGTLLSRELHPVAPLLDNLTSALNKVYQRKGVNISLDISPEISFVGEQNDFVEVMGNVLDNACKYCLEFVEI
SARQTDEHLYIVVEDDGPGIPLSKREVIFDRGQRVDTLRPGQGVGLAVAREITEQYEGKIVAGESMLGGARMEVIFGRQH
SAPKDE
>D0ZV89 2.7.13.3~~~phoQ~~~Virulence sensor histidine kinase PhoQ~~~
MNKFARHFLPLSLRVRFLLATAGVVLVLSLAYGIVALVGYSVSFDKTTFRLLRGESNLFYTLAKWENNKISVELPENLDM
QSPTMTLIYDETGKLLWTQRNIPWLIKSIQPEWLKTNGFHEIETNVDATSTLLSEDHSAQEKLKEVREDDDDAEMTHSVA
VNIYPATARMPQLTIVVVDTIPIELKRSYMVWSWFVYVLAANLLLVIPLLWIAAWWSLRPIEALAREVRELEDHHREMLN
PETTRELTSLVRNLNQLLKSERERYNKYRTTLTDLTHSLKTPLAVLQSTLRSLRNEKMSVSKAEPVMLEQISRISQQIGY
YLHRASMRGSGVLLSRELHPVAPLLDNLISALNKVYQRKGVNISMDISPEISFVGEQNDFVEVMGNVLDNACKYCLEFVE
ISARQTDDHLHIFVEDDGPGIPHSKRSLVFDRGQRADTLRPGQGVGLAVAREITEQYAGQIIASDSLLGGARMEVVFGRQ
HPTQKEE
>E1WFA0 2.7.13.3~~~phoQ~~~Virulence sensor histidine kinase PhoQ~~~
MNKFARHFLPLSLRVRFLLATAGVVLVLSLAYGIVALVGYSVSFDKTTFRLLRGESNLFYTLAKWENNKISVELPENLDM
QSPTMTLIYDETGKLLWTQRNIPWLIKSIQPEWLKTNGFHEIETNVDATSTLLSEDHSAQEKLKEVREDDDDAEMTHSVA
VNIYPATARMPQLTIVVVDTIPIELKRSYMVWSWFVYVLAANLLLVIPLLWIAAWWSLRPIEALAREVRELEDHHREMLN
PETTRELTSLVRNLNQLLKSERERYNKYRTTLTDLTHSLKTPLAVLQSTLRSLRNEKMSVSKAEPVMLEQISRISQQIGY
YLHRASMRGSGVLLSRELHPVAPLLDNLISALNKVYQRKGVNISMDISPEISFVGEQNDFVEVMGNVLDNACKYCLEFVE
ISARQTDDHLHIFVEDDGPGIPHSKRSLVFDRGQRADTLRPGQGVGLAVAREITEQYAGQIIASDSLLGGARMEVVFGRQ
HPTQKEE
>F5ZP94 2.7.13.3~~~phoQ~~~Virulence sensor histidine kinase PhoQ~~~
MNKFARHFLPLSLRVRFLLATAGVVLVLSLAYGIVALVGYSVSFDKTTFRLLRGESNLFYTLAKWENNKISVELPENLDM
QSPTMTLIYDETGKLLWTQRNIPWLIKSIQPEWLKTNGFHEIETNVDATSTLLSEDHSAQEKLKEVREDDDDAEMTHSVA
VNIYPATARMPQLTIVVVDTIPIELKRSYMVWSWFVYVLAANLLLVIPLLWIAAWWSLRPIEALAREVRELEDHHREMLN
PETTRELTSLVRNLNQLLKSERERYNKYRTTLTDLTHSLKTPLAVLQSTLRSLRNEKMSVSKAEPVMLEQISRISQQIGY
YLHRASMRGSGVLLSRELHPVAPLLDNLISALNKVYQRKGVNISMDISPEISFVGEQNDFVEVMGNVLDNACKYCLEFVE
ISARQTDDHLHIFVEDDGPGIPHSKRSLVFDRGQRADTLRPGQGVGLAVAREITEQYAGQIIASDSLLGGARMEVVFGRQ
HPTQKEE
>P0DM80 2.7.13.3~~~phoQ~~~Virulence sensor histidine kinase PhoQ~~~
MNKFARHFLPLSLRVRFLLATAGVVLVLSLAYGIVALVGYSVSFDKTTFRLLRGESNLFYTLAKWENNKISVELPENLDM
QSPTMTLIYDETGKLLWTQRNIPWLIKSIQPEWLKTNGFHEIETNVDATSTLLSEDHSAQEKLKEVREDDDDAEMTHSVA
VNIYPATARMPQLTIVVVDTIPIELKRSYMVWSWFVYVLAANLLLVIPLLWIAAWWSLRPIEALAREVRELEDHHREMLN
PETTRELTSLVRNLNQLLKSERERYNKYRTTLTDLTHSLKTPLAVLQSTLRSLRNEKMSVSKAEPVMLEQISRISQQIGY
YLHRASMRGSGVLLSRELHPVAPLLDNLISALNKVYQRKGVNISMDISPEISFVGEQNDFVEVMGNVLDNACKYCLEFVE
ISARQTDDHLHIFVEDDGPGIPHSKRSLVFDRGQRADTLRPGQGVGLAVAREITEQYAGQIIASDSLLGGARMEVVFGRQ
HPTQKEE
>P23545 2.7.13.3~~~phoR~~~Alkaline phosphatase synthesis sensor protein PhoR~~~COG5002
MNKYRVRLFSVFVVCMILVFCVLGLFLQQLFETSDQRKAEEHIEKEAKYLASLLDAGNLNNQANEKIIKDAGGALDVSAS
VIDTDGKVLYGSNGRSADSQKVQALVSGHEGILSTTDNKLYYGLSLRSEGEKTGYVLLSASEKSDGLKGELWGMLTASLC
TAFIVIVYFYSSMTSRYKRSIESATNVATELSKGNYDARTYGGYIRRSDKLGHAMNSLAIDLMEMTRTQEMQRDRLLTVI
ENIGSGLIMIDGRGFINLVNRSYAKQFHINPNHMLRRLYHDAFEHEEVIQLVEDIFMTETKKCKLLRLPIKIERRYFEVD
GVPIMGPDDEWKGIVLVFHDMTETKKLEQMRKDFVANVSHELKTPITSIKGFTETLLDGAMEDKEALSEFLSIILKESER
LQSLVQDLLDLSKIEQQNFTLSIETFEPAKMLGEIETLLKHKADEKGISLHLNVPKDPQYVSGDPYRLKQVFLNLVNNAL
TYTPEGGSVAINVKPREKDIQIEVADSGIGIQKEEIPRIFERFYRVDKDRSRNSGGTGLGLAIVKHLIEAHEGKIDVTSE
LGRGTVFTVTLKRAAEKSA
>P08400 2.7.13.3~~~phoR~~~Phosphate regulon sensor protein PhoR~~~COG5002
MLERLSWKRLVLELLLCCLPAFILGAFFGYLPWFLLASVTGLLIWHFWNLLRLSWWLWVDRSMTPPPGRGSWEPLLYGLH
QMQLRNKKRRRELGNLIKRFRSGAESLPDAVVLTTEEGGIFWCNGLAQQILGLRWPEDNGQNILNLLRYPEFTQYLKTRD
FSRPLNLVLNTGRHLEIRVMPYTHKQLLMVARDVTQMHQLEGARRNFFANVSHELRTPLTVLQGYLEMMNEQPLEGAVRE
KALHTMREQTQRMEGLVKQLLTLSKIEAAPTHLLNEKVDVPMMLRVVEREAQTLSQKKQTFTFEIDNGLKVSGNEDQLRS
AISNLVYNAVNHTPEGTHITVRWQRVPHGAEFSVEDNGPGIAPEHIPRLTERFYRVDKARSRQTGGSGLGLAIVKHAVNH
HESRLNIESTVGKGTRFSFVIPERLIAKNSD
>O34627 ~~~pfyP~~~Blue-light photoreceptor~~~COG1366
MASFQSFGIPGQLEVIKKALDHVRVGVVITDPALEDNPIVYVNQGFVQMTGYETEEILGKNCRFLQGKHTDPAEVDNIRT
ALQNKEPVTVQIQNYKKDGTMFWNELNIDPMEIEDKTYFVGIQNDITKQKEYEKLLEDSLTEITALSTPIVPIRNGISAL
PLVGNLTEERFNSIVCTLTNILSTSKDDYLIIDLSGLAQVNEQTADQIFKLSHLLKLTGTELIITGIKPELAMKMNKLDA
NFSSLKTYSNVKDAVKVLPIM
>P9WI97 ~~~phoU1~~~Phosphate-specific transport system accessory protein PhoU homolog 1~~~COG0704
MRTVYHQRLTELAGRLGEMCSLAGIAMKRATQALLEADIGAAEQVIRDHERIVAMRAQVEKEAFALLALQHPVAGELREI
FSAVQIIADTERMGALAVHIAKITRREYPNQVLPEEVRNCFADMAKVAIALGDSARQVLVNRDPQEAAQLHDRDDAMDDL
HRHLLSVLIDREWRHGVRVGVETALLGRFFERFADHAVEVGRRVIFMVTGVLPTEDEISTY
>P9WI95 ~~~phoU2~~~Phosphate-specific transport system accessory protein PhoU homolog 2~~~COG0704
MRTAYHEQLSELSERLGEMCGLAGIAMERATQALLQADLVLAEQVISDHEKIATLSARAEESAFVLLALQAPVAGDLRAI
VSAIQMVADIDRMGALALHVAKIARRRHPQHALPEEVNGYFAEMGRVAVELGNSAQEVVLSHDPEKAAQIREEDDAMDDL
HRHLFTVLMDREWKHGVAAAVDVTLLSRFYERFADHAVEVARRVIFQATGAFP
>Q9X256 ~~~phoU2~~~Phosphate-specific transport system accessory protein PhoU homolog 2~~~COG0704
MNRLLNEKVEEFKKGVLKAGWFIEKMFRNSISSLVERNESLAREVIADEEVVDQMEVEIQEKAMEVLGLFSPIGKPLLTV
TAGIRVAELIENIADKCHDIAKNVLELMEEPPLKPLEDIPAMANQTSEMLKFALRMFADVNVEKSFEVCRMDSKVDDLYE
KVREELLLYMMESPKYVKRALLLLEIAGNIEIIADYATNIVEVSVYMVQGEAYKCYHDELLLFKKSGGVLFESSD
>O67053 ~~~phoU~~~Phosphate-specific transport system accessory protein PhoU homolog~~~COG0704
MKLFKELEETKEQVIKMAKLVQEAIDKATEALNKQNVELAEEVIKGDDTIDLLEVDIERRCIRMIALYQPEAGDLRMIMG
IYKIVSDLERMGDEAENIAERAILLAEEPPLKPYVNINFMSEIVKEMVNDSVISFIQQDTLLAKKVIEKDDTVDELYHQL
ERELMTYVLEDPRNIKRAMHLSFVARHYERIADHAENVAEAAIYLSEGEIVKHQHIKEKGE
>Q97ID9 ~~~phoU~~~Phosphate-specific transport system accessory protein PhoU homolog~~~COG0704
MTRKIFESDLEELHSELLRMGSMAEKQIYDCMEALEKQDENMAEVIIKKDDIIDDMQKEIENKVIRLIAMQQPIVAEDLR
NIFTTVKIVTDLERLGDHAVDIAKAIKRLNGEKHHDIVKEIWNMGNKVKSMIKDSLDAYVERNLDKAYEVCKRDDDVDSL
YKRIFNELLNIMSEDKSKVNQLTQFLFVCKYLERIGDRTTNVCESTIYLITGKQVDLND
>Q8FBT8 ~~~phoU~~~Phosphate-specific transport system accessory protein PhoU~~~COG0704
MIQECVMDSLNLNKHISGQFNAELESIRTQVMTMGGMVEQQLSDAITAMHNQDSDLAKRVIEGDKNVNMMEVAIDEACVR
IIAKRQPTASDLRLVMVISKTIAELERIGDVADKICRTALEKFSQQHQPLLVSLESLGRHTIQMLHDVLDAFARMDIDEA
VRIYREDKKVDQEYEGIVRQLMTYMMEDSRTIPSVLTALFCARSIERIGDRCQNICEFIFYYVKGQDFRHVGGDELDKLL
AEKDSDK
>P0A9K7 ~~~phoU~~~Phosphate-specific transport system accessory protein PhoU~~~COG0704
MDSLNLNKHISGQFNAELESIRTQVMTMGGMVEQQLSDAITAMHNQDSDLAKRVIEGDKNVNMMEVAIDEACVRIIAKRQ
PTASDLRLVMVISKTIAELERIGDVADKICRTALEKFSQQHQPLLVSLESLGRHTIQMLHDVLDAFARMDIDEAVRIYRE
DKKVDQEYEGIVRQLMTYMMEDSRTIPSVLTALFCARSIERIGDRCQNICEFIFYYVKGQDFRHVGGDELDKLLAGKDSD
K
>Q83WB2 ~~~phoU~~~Phosphate-specific transport system accessory protein PhoU~~~
MDNLNLNKHTSGQFNAELEYIRTQVMSMGGLVEQQLTDAITAMHNQDADLARRVVEGDAKVNMMEIAIDEACVKIIAKRQ
PTASDLRLVMAIIKTISELERIGDVADKICRTALEKFSQQHQPLLVSLESLGQHTVQMLHDVLDAFARMDLNEAIRIYRE
DKKVDQEYEGIVRQLMTYMMEDSRTIPSVLTALFCARSIERIGDRCQNICEFIFYYVKGQDFRHIGGDDLEQLLTDHRRV
DEA
>Q51547 ~~~phoU~~~Phosphate-specific transport system accessory protein PhoU homolog~~~
MINKDSLTHHISQQFNAELEDVRSHLLAMGGLVEKQVNDAVNALIDADSGLAQQVREIDDQINQMERNIDEECVRILARR
QPAASDLRLIISISKSVIDLERIGDEASKVARRAIQLCEEGESPRGYVEVRHIGSQVQKMVQEALDAFARFDADLALSVA
QYDKTVDREYKTALRELVTYMMEDPRAISRVLNIIWALRSLERIGDHARNIAELVIYLVRGTDVRHIGLTRMKEEVENNR
GE
>P0A3Y7 ~~~phoU~~~Phosphate-specific transport system accessory protein PhoU homolog~~~COG0704
MRNQFDLELHELEQSFLGLGQLVLETASKALLALASKDKEMAELIINKDHAINQGQSAIELTCARLLALQQPQVSDLRFV
ISIMSSCSDLERMGDHMAGIAKAVLQLKENQLAPDEEQLHQMGKLSLSMLADLLVAFPLHQASKAISIAQKDEQIDQYYY
ALSKEIIGLMKDQETSIPNGTQYLYIIGHLERFADYIANICERLVYLETGELVDLN
>D9XF45 1.1.1.309~~~phpC~~~Phosphonoacetaldehyde reductase~~~COG1454
MTAVFPGELLLAEGIHEIARVTALLSGPLRRAPRVAQVVGPGFAGRPWAPRLTDALRPLDPTVVVHDGPTTPDSVAALAR
QLRAIRADVAVAIGGGTVMDAAKAAAALADGGPPDADRVRQACAAGPAAGDTPPAVRVVAVPTTAGTGAEATPFATLWDL
KHRRKLSLTGPRVRPSAAVLAPELLAGLGRRALATGILDALCQGAEASWSIRSTPESIRWGTSAVTLAAEALDQVQDDAP
DAAARLALQRAAHHSGRAIALAQTSSCHAISYPLTLRLGLAHGHACGVTLGRLLRYNHAVPAGDCADPRGTGHVRRVLDA
LAAPLGGTPARAALRVERFITACGLTPYDALDVDHRSLAAEAVTYPRCHDNPRRLDRESLGRLLGERSEMEETCG
>Q5IW36 2.7.7.93~~~phpF~~~Phosphonoformate cytidylyltransferase~~~COG1056
MSAEQIAGTGVIHGRFQPLHLGHLEYLLAGAERCRTLVVGITNPDPWTTTEETTDPERGLPESNPCTFYERYLMVEGALT
EAGVSHERLRIVPFPHSFPERLAHYAPADARYFVTVYDDWGDAKLDRFHALGLRTEVMWRRTDKPVSGGRVRRSIAEGQP
WEHLVPPAVARVVKECGIDERIRA
>Q8KY51 3.1.3.16~~~phpP~~~Protein phosphatase PhpP~~~COG0631
MEISLLTDVGQKRTNNQDYVNHYVNRAGRTMIILADGMGGHRAGNIASEMAVTDLGVAWVDTQIDTVNEVREWFAHYLEI
ENQKIHQLGQDEAYRGMGTTLEVLAIIDNQAIYAHIGDSRIGLIRGEEYHQLTSDHSLVNELLKAGQLTPEEAEAHPQKN
IITQSIGQKDEIQPDFGTVILESGDYLLLDSDGLTNMISGSEIRDIVTSDIPLADKTETLVRFANNAGGLDNITVALVSM
NEEDAE
>Q04J42 3.1.3.16~~~phpP~~~Protein phosphatase PhpP~~~COG0631
MEISLLTDVGQKRTNNQDYVNHYVNRAGRTMIILADGMGGHRAGNIASEMAVTDLGVAWVDTQIDTVNEVREWFAHYLEI
ENQKIHQLGQDEAYRGMGTTLEVLAIIDNQAIYAHIGDSRIGLIRGEEYHQLTSDHSLVNELLKAGQLTPEEAEAHPQKN
IITQSIGQKDEIQPDFGTVILESGDYLLLNSDGLTNMISGSEIRDIVTSDIPLADKTETLVRFANNAGGLDNITVALVSM
NEEDAE
>P45548 3.1.-.-~~~php~~~Phosphotriesterase homology protein~~~COG1735
MSFDPTGYTLAHEHLHIDLSGFKNNVDCRLDQYAFICQEMNDLMTRGVRNVIEMTNRYMGRNAQFMLDVMRETGINVVAC
TGYYQDAFFPEHVATRSVQELAQEMVDEIEQGIDGTELKAGIIAEIGTSEGKITPLEEKVFIAAALAHNQTGRPISTHTS
FSTMGLEQLALLQAHGVDLSRVTVGHCDLKDNLDNILKMIDLGAYVQFDTIGKNSYYPDEKRIAMLHALRDRGLLNRVML
SMDITRRSHLKANGGYGYDYLLTTFIPQLRQSGFSQADVDVMLRENPSQFFQ
>P9WHN9 ~~~php~~~Phosphotriesterase homology protein~~~COG1735
MPELNTARGPIDTADLGVTLMHEHVFIMTTEIAQNYPEAWGDEDKRVAGAIARLGELKARGVDTIVDLTVIGLGRYIPRI
ARVAAATELNIVVATGLYTYNDVPFYFHYLGPGAQLDGPEIMTDMFVRDIEHGIADTGIKAGILKCATDEPGLTPGVERV
LRAVAQAHKRTGAPISTHTHAGLRRGLDQQRIFAEEGVDLSRVVIGHCGDSTDVGYLEELIAAGSYLGMDRFGVDVISPF
QDRVNIVARMCERGHADKMVLSHDACCYFDALPEELVPVAMPNWHYLHIHNDVIPALKQHGVTDEQLHTMLVDNPRRIFE
RQGGYQ
>A9CJC9 4.1.99.3~~~phrA~~~Deoxyribodipyrimidine photo-lyase~~~COG0415
MSLKTAPVIVWFRKDLRLSDNLALLAAVEHGGPVIPVYIREKSAGPLGGAQEWWLHHSLAALSSSLEKAGGRLVLASGDA
ERILRDLISETGADTVVWNRRYDPTGMATDKALKQKLRDDGLTVRSFSGQLLHEPSRLQTKSGGPYRVYTPFWRALEGSD
EPHAPADPPKSLTAPKVWPKSEKLSNWKLLPTKPDWAKDFSDIWTPGETGALDKLDDFIDGALKGYEEGRDFPAKPATSL
LSPHLAAGEISPAAVWHATKGLSRHIASNDISRFRKEIVWREFCYHLLFHFPELGEKNWNDSFDAFSWRDDEKSFKAWTR
GMTGYPIVDAGMRQLWQHGTMHNRVRMIVASFLIKHLLIDWRKGEKWFRDTLVDADPASNAANWQWVAGSGADASPFFRI
FNPILQGEKFDGDGDYVRRFVPELEKLERKYIHKPFEAPKDALKKAGVELGKTYPLPIVDHGKARERALAAYAAVKKTT
>Q00829 ~~~phrA~~~Phosphatase RapA inhibitor~~~
MKSKWMSGLLLVAVGFSFTQVMVHAGETANTEGKTFHIAARNQT
>P11394 ~~~rpcA~~~R-phycocyanin-2 subunit alpha~~~
MKTPLTEAVAAADSQGRFLSNTEVQAASGRFNRAKASLEAAKALTSKADSLVNGAAQAVYSKFPYTTQMEGSNYSATPEG
KAKCSRDVGYYLRMITYCLVAGGTGPMDDYLIAGLDEINRTFELSPSWYVEALKHIKANHGLSGDAATEANSYIDYAINA
LI
>A9CH39 4.1.99.13~~~phrB~~~(6-4) photolyase~~~COG3046
MSQLVLILGDQLSPSIAALDGVDKKQDTIVLCEVMAEASYVGHHKKKIAFIFSAMRHFAEELRGEGYRVRYTRIDDADNA
GSFTGEVKRAIDDLTPSRICVTEPGEWRVRSEMDGFAGAFGIQVDIRSDRRFLSSHGEFRNWAAGRKSLTMEYFYREMRR
KTGLLMNGEQPVGGRWNFDAENRQPARPDLLRPKHPVFAPDKITKEVIDTVERLFPDNFGKLENFGFAVTRTDAERALSA
FIDDFLCNFGATQDAMLQDDPNLNHSLLSFYINCGLLDALDVCKAAERAYHEGGAPLNAVEGFIRQIIGWREYMRGIYWL
AGPDYVDSNFFENDRSLPVFYWTGKTHMNCMAKVITETIENAYAHHIQRLMITGNFALLAGIDPKAVHRWYLEVYADAYE
WVELPNVIGMSQFADGGFLGTKPYAASGNYINRMSDYCDTCRYDPKERLGDNACPFNALYWDFLARNREKLKSNHRLAQP
YATWARMSEDVRHDLRAKAAAFLRKLD
>P11395 ~~~rpcB~~~R-phycocyanin-2 beta chain~~~
MFDAFTKVVAQADARGQFISTSEIDALAAMVSDSNKRLDAVNRISSNASTIVASAARQLFAQQPALIAPGGNAYTSRRMA
ACLRDMEIILRYVTYSAFTGDASVMEDRCLNGLRETYLALGTPGASVAAGVNLMKDAALAIINDKAGISAGDCASLSSEI
GTYFDRAAASVA
>P94416 ~~~phrC~~~Competence and sporulation stimulating factor~~~
MKLKSKLFVICLAAAAIFTAAGVSANAEALDFHVTERGMT
>O32025 ~~~phrE~~~Phosphatase RapE inhibitor~~~
MKSKLFISLSAVLIGLAFFGSMYNGEMKEASRNVTLAPTHEFLV
>P71001 ~~~phrF~~~RapF inhibitor~~~
MKLKSKLLLSCLALSTVFVATTIANAPTHQIEVAQRGMI
>O32295 ~~~phrG~~~RapG inhibitor~~~
MKRFLIGAGVAAVILSGWFIADHQTHSQEMKVAEKMIG
>Q59HN7 ~~~phrH~~~Phosphatase RapH inhibitor~~~
MPIKKKVMMCLAVTLVFGSMSFPTLTNSGGFKESTDRNTTYIDHSPYKLSDQKKALS
>O31492 ~~~phrI~~~Phosphatase RapI inhibitor~~~
MKISRILLAAVILSSVFSITYLQSDHNTEIKVAADRVGA
>P00914 4.1.99.3~~~phrB~~~Deoxyribodipyrimidine photo-lyase~~~COG0415
MTTHLVWFRQDLRLHDNLALAAACRNSSARVLALYIATPRQWATHNMSPRQAELINAQLNGLQIALAEKGIPLLFREVDD
FVASVEIVKQVCAENSVTHLFYNYQYEVNERARDVEVERALRNVVCEGFDDSVILPPGAVMTGNHEMYKVFTPFKNAWLK
RLREGMPECVAAPKVRSSGSIEPSPSITLNYPRQSFDTAHFPVEEKAAIAQLRQFCQNGAGEYEQQRDFPAVEGTSRLSA
SLATGGLSPRQCLHRLLAEQPQALDGGAGSVWLNELIWREFYRHLITYHPSLCKHRPFIAWTDRVQWQSNPAHLQAWQEG
KTGYPIVDAAMRQLNSTGWMHNRLRMITASFLVKDLLIDWREGERYFMSQLIDGDLAANNGGWQWAASTGTDAAPYFRIF
NPTTQGEKFDHEGEFIRQWLPELRDVPGKVVHEPWKWAQKAGVTLDYPQPIVEHKEARVQTLAAYEAARKGK
>P25078 4.1.99.3~~~phrB~~~Deoxyribodipyrimidine photo-lyase~~~
MPTHLVWFRRDLRLQDNLALAAACRDASARVLALYISTPAQWQAHDMAPRQAAFISAQLNALQTALAEKGIPLLFHEVAD
FNASIETVKNVCRQHDVSHLFYNYQYEFNERQRDAAVEKTLPSVICEGFDDSVILAPGAVMTGNHEMYKVFTPFKNAWLK
RLKEDIPPCVPAPKIRVSGALSTPLTPVSLNYPQQAFDAALFPVEENAVIAQLRQFCAQGADEYALRRDFPAVDGTSRLS
ASLATGGLSPRQCLHRLLAEQPQALDGGPGSVWLNELIWREFYRHLMTWYPALCKHQPFIRWTKRVAWQENPHYFQAWQK
GETGYPIVDAAMRQLNATGWMHNRLRMITASFLVKDLLIDWRLGERYFMSQLIDGDLAANNGGWQWAASTGTDAAPYFRI
FNPTTQGERFDRDGEFIRQWLPALRDIPGKAIHEPWRWAEKAGVVLDYPRPIVEHKQARIATLSAYEAARKGA
>P05327 4.1.99.3~~~phr~~~Deoxyribodipyrimidine photo-lyase~~~COG0415
MAAPILFWHRRDLRLSDNIGLAAARAQSAQLIGLFCLDPQILQSADMAPARVAYLQGCLQELQQRYQQAGSRLLLLQGDP
QHLIPQLAQQLQAEAVYWNQDIEPYGRDRDGQVAAALKTAGIRAVQLWDQLLHSPDQILSGSGNPYSVYGPFWKNWQAQP
KPTPVATPTELVDLSPEQLTAIAPLLLSELPTLKQLGFDWDGGFPVEPGETAAIARLQEFCDRAIADYDPQRNFPAEAGT
SGLSPALKFGAIGIRQAWRAASAAHALSRSDEARNSIRVWQQELAWREFYQHALYHFPSLADGPYRSLWQQFPWENREAL
FTAWTQAQTGYPIVDAAMRQLTETGWMHNRCWMIVASFLTKDLIIDWRRGEQFFMQHLVDGDLAANNGGWQWSASSGMDP
KPLRIFNPASQAKKFDATATYIKRWLPELRHVHPKDLISGEITPIGRRGYPAPIVNHNLRQKQFKALYNQLKAAIAEPEA
EPDS
>Q55081 4.1.99.3~~~phrA~~~Deoxyribodipyrimidine photo-lyase~~~COG0415
MGRQLILFPMSDQSDHPLILLWHRRDLRLNDHLALAKARQKTAKIVGVFCLDNKILQAEDMAPARVAYLLGCLQSLQDHY
QRLGSELLVFQADPVQLLPKLANTLGAHGVTWTLDTEPYAQKRDLAVAQALRERGLAIATEWDQLMHHPGEVLTQAGSPY
TVYTPFWKNWSQLPKTSPVPTPKDLQGLTPAEKEKLAPLEPLAIPQLADLGFIWDQPLPLTPGEEAAEQRLDWFVAHGLE
EYQQNRNFPALDGTSQLSAALKFGVISPRTLWQTTLEAWEQSRSEEARASIETWQQELAWREFYQHCLYSFPALAQGPYR
SPFQEFPWEENQDHFQAWCEGRTGYPIIDAAMAQLNQTGWMHNRCRMIVASFLIKDLILNWQWGELYFMQTLYDGDLAAN
NGGWQWSASSGMDPKPLRIFNPHTQAQKFDPEGEYIRTWLPQLARFDTGDLLTGKLTPGSRRSVNYPEPIVDHNQQQREF
KRRYQLVK
>P61496 4.1.99.3~~~phr~~~Deoxyribodipyrimidine photo-lyase~~~COG0415
MGPLLVWHRGDLRLHDHPALLEALARGPVVGLVVLDPNNLKTTPRRRAWFLENVRALREAYRARGGALWVLEGLPWEKVP
EAARRLKAKAVYALTSYTPYGRYRDAKVQEALPVPLHLLPAPHLLPPDLPRAYRVYTPFARRFLGVEAPLPAPEALPKGP
EEGEIPREDPGLPLPEPGEEAALAGLRAFLEAKLPRYAEERDRLDGEGGSRLSPYFALGVLSPRLAAWEAERRGGEGARK
WVAELLWRDFSYHLLYHFPWMAERPLDPRFQALPWQEDEALFRAWYEGRTGVPLVDAAMRELHATGFLSNRARMNAAQFA
VKHLLLPWKRCEEAFRHLLLDGDRAVNLQGWQWAGGLGVDAAPYFRVFNPVLQGERHDPEGRWLKRWAPEYPSYAPKDPV
VDLEEARRRYLRLARDLARG
>P61497 4.1.99.3~~~phr~~~Deoxyribodipyrimidine photo-lyase~~~
MGPLLVWHRGDLRLHDHPALLEALARGPVVGLVVLDPNNLKTTPRRRAWFLENVRALREAYRARGGALWVLEGLPWEKVP
EAARRLKAKAVYALTSHTPYGRYRDGRVREALPVPLHLLPAPHLLPPDLPRAYRVYTPFSRLYRGAAPPLPPPEALPKGP
EEGEIPREDPGLPLPEPGEEAALAGLRAFLEAKLPRYAEERDRLDGEGGSRLSPYFALGVLSPRLAAWEAERRGGEGARK
WVAELLWRDFSYHLLYHFPWMAERPLDPRFQAFPWQEDEALFQAWYEGKTGVPLVDAAMRELHATGFLSNRARMNAAQFA
VKHLLLPWKRCEEAFRHLLLDGDRAVNLQGWQWAGGLGVDAAPYFRVFNPVLQGERHDPEGRWLKRWAPEYPSYAPKDPV
VDLEEARRRYLRLARDLARG
>Q9KNA8 4.1.99.3~~~phrA~~~Deoxyribodipyrimidine photo-lyase~~~COG0415
MRLVWFRRDLRSFDNTALTAALNSGDPVAAMYIATPEQWHQHHLAPIQADLIWRRLAELQQELAALNVPLFYQQVADFQA
AAVAVSQLAKTLNATQVLANRDYELDEQQRDQLAQQLLSEQGIIWSAFDDKCVLPPGSVRTKQGEFFKVFTPFKRAWLTL
FQPPVIGKNRPVALWNVPSALAELVWHPEQAFDYPRIDSTPWAADFETVRAQLRDFCRERVQDYHQARDFPAREGTSSLS
PYLAIGVLSARQCVARLYHESSMGELSEGAQVWLSELIWREFYQHLVAIEPNLSKSRDFVEWGARLEWWNDNEKFQLWCE
GKTGYPIVDAAMRQLNQTGWMHNRLRMIVASFLTKDLHIDWRWGERYFMSRLIDGDYAANNGGWQWCASTGCDGQPYFRI
FNPVSQGEKFDPNGDFIRRWVPELRSVSSAYIHQPWTYPAVNSVLYPARLVDHKQEREVTLRLYKTAKG
>P37600 1.8.5.5~~~phsA~~~Thiosulfate reductase molybdopterin-containing subunit PhsA~~~
MSISRRSFLQGVGIGCSACALGAFPPGALARNPIAGINGKTTLTPSLCEMCSFRCPIQAQVVNNKTVFIQGNPSAPQQGT
RICARGGSGVSLVNDPQRIVKPMKRTGPRGDGEWQVISWQQAYQEIAAKMNAIKAQHGPESVAFSSKSGSLSSHLFHLAT
AFGSPNTFTHASTCPAGKAIAAKVMMGGDLAMDIANTRYLVSFGHNLYEGIEVADTHELMTAQEKGAKMVSFDPRLSIFS
SKADEWHAIRPGGDLAVLLAMCHVMIDEQLYDASFVERYTSGFEQLAQAVKETTPEWAAAQADVPADVIVRVTRELAACA
PHAIVSPGHRATFSQEEIDMRRMIFTLNVLLGNIEREGGLYQKKNASVYNKLAGEKVAPTLAKLNIKNMPKPTAQRIDLV
APQFKYIAAGGGVVQSIIDSALTQKPYPIKAWIMSRHNPFQTVTCRSDLVKTVEQLDLVVSCDVYLSESAAYADYLLPEC
TYLERDEEVSDMSGLHPAYALRQQVVEPIGEARPSWQIWKELGEQLGLGQYYPWQDMQTRQLYQLNGDHALAKELRQKGY
LEWGVPLLLREPESVRQFTARYPGAIATDSDNTYGEQLRFKSPSGKIELYSATLEELLPGYGVPRVRDFALKKENELYFI
QGKVAVHTNGATQYVPLLSELMWDNAVWVHPQTAQEKGIKTGDEIWLENATGKEKGKALVTPGIRPDTLFVYMGFGAKAG
AKTAATTHGIHCGNLLPHVTSPVSGTVVHTAGVTLSRA
>Q53692 1.10.3.4~~~phsA~~~O-aminophenol oxidase~~~
MTDMIEQSDDRIDPIDGVLADGVLADDVLAKEREQAPAPGELTPFAAPLTVPPVLRPASDEVTRETEIALRPTWVRLHPQ
LPPTLMWGYDGQVPGPTIEVRRGQRVRIAWTNRIPKGSEYPVTSVEVPLGPPGTPAPNTEPGRGGVEPNKDVAALPAWSV
THLHGAQTGGGNDGWADNAVGFGDAQLSEYPNDHQATQWWYHDHAMNITRWNVMAGLYGTYLVRDDEEDALGLPSGDREI
PLLIADRNLDTDEDGRLNGRLLHKTVIVQQSNPETGKPVSIPFFGPYTTVNGRIWPYADVDDGWYRLRLVNASNARIYNL
VLIDEDDRPVPGVVHQIGSDGGLLPRPVPVDFDDTLPVLSAAPAERFDLLVDFRALGGRRLRLVDKGPGAPAGTPDPLGG
VRYPEVMEFRVRETCEEDSFALPEVLSGSFRRMSHDIPHGHRLIVLTPPGTKGSGGHPEIWEMAEVEDPADVQVPAEGVI
QVTGADGRTKTYRRTAATFNDGLGFTIGEGTHEQWTFLNLSPILHPMHIHLADFQVLGRDAYDASGFDLALGGTRTPVRL
DPDTPVPLAPNELGHKDVFQVPGPQGLRVMGKFDGAYGRFMYHCHLLEHEDMGMMRPFVVMPPEALKFDHGGAHGGHGEG
HTG
>P0A1I1 ~~~phsB~~~Thiosulfate reductase electron transfer subunit PhsB~~~
MNHLTNQYVMLHDEKRCIGCQACTVACKVLNDVPEGFSRVQVQIRAPEQASNALTHFQFVRVSCQHCENAPCVSVCPTGA
SYRDENGIVQVDKSRCIGCDYCVAACPFHVRYLNPQTGVADKCNFCADTRLAAGQSPACVSVCPTDALKFGRLDESEIQR
WVGQKEVYRQQEARSGAVSLYRRKEVHQEGKA
>D9XF49 6.2.1.67~~~phsB~~~Phosphinothricin tripeptide synthase PhsB~~~COG1020
MTAPQTDDVVTGRIAAIWAGLLDRPEIGIDDNVFRLGASSVMAVRAAARIREALDTPLPLRDVFESPSPAALAKRIRASR
STAPTASGPPRTAPVDSTATAPLTFQQEPMWLFDRMQPGNATYTIHFALRHEGHLDLGVLDRCVRDVVRRHAVLRTVFPS
VDDRPAQQVLDRAHIPLGESDLRALPEAERPVAAARIAAREAQAPFDLSTGPMLRVRVLHLSDTRQRLLLTMPHIVTDAW
SDDILVRELNHLYRAHTDGIAPALPPLPVQYHDWAARQRAELAGTEAELLDWWRHHLAGVPPLLELPADRPRAAVKRHRG
GRLLFDIPESVTRRLEGLAKDEGTTPYAVLLAGFAALLHRLTGQDDLLVGSPVAGRTHTETEGLVGLFVNTVAVRCDVAG
RPSFLELVRRTRRTVVESFARQELPFHRLVEELAPVRSPAYTPLVQVMLALQNTPDRDGREPGEGPFAREESGGDTGSAM
FDLTLFVTGSASGMRGEWEYDSDLFDRERVAELGPQLVTLLDAALDRTDLPVALLPLQEPAARDRMVQAWNDTADVLPGG
PDADSLPALLSAQAHRTPDAVALRTDDGAELTYRQLHLRADRLARRLLSYGLAPESVVAVACERSFEMVVALLAVLKAGC
AYLPIDPGDPAERTAYLLRDSGARVLLTLHRHTANLPDADGTTVVTLDEPDPSGDMQDTTSALPGIAPGQLAYLIYTSGS
TGRPKGVLNEHGPVCNRIRWGMRAFPPGPGTIVLQKTPIHFDVSVWEMFWTLATGATLVLARPDGHRDPQYLAGRLVEEG
VTDVHFVPSMLAAFLDVGALPEGHSLRRVFCSGEALSPGLRDRLFARLPHVELHNLYGPTEAAIEVTHWRCRPGEPTVPI
GRPIANARCYVLDAELNPVPPGVPGELWLGGVPVARGYHGRADLTAERFLPDPYGPAGSRMYRSGDLARWRRDGVLEYLG
REDGQVKLRGQRLELGEIEATLAGHAEVADVVVDVRGTGPQDRRLVAYVRPARPGRDEQLRTTLRELAAARLPAYMRPSS
YVTLDRVPLTPSGKTDRKALPDPAAGEQPRSGRGAAPGTPAERELAGIWAELLGAGEVGGDDNFFEIGGHSLLAARMTGR
ASTAFGVDLPVSLAFEHPVLRDFALAVVTAQAATDSAATERLLAELEALADTELEALPDEDGPDGRSGE
>P37602 ~~~phsC~~~Thiosulfate reductase cytochrome B subunit PhsC~~~
MNTIWGAELHYAPDYWPLWLIYAGVVVLLMLVGLVIHALLRRMLAPKTAGGEEHRDYLYSLAIRRWHWGNALLFVLLLLS
GLFGHFSLGPVALMVQVHTWCGFALLAFWVGFVLINLTTGNGRHYRVNFSGLVTRCIRQTRFYLFGIMKGEAHPFVATEQ
NKFNPLQQLAYLAIMYALVPLLIITGLLCLYPQVAGLGPVMLVLHMALAIIGLLFICAHLYLCTLGDTPGQIFRSMVDGY
HRHRTAPRGDKSAV
>D9XF47 6.2.1.67~~~phsC~~~Phosphinothricin tripeptide synthase PhsC~~~COG1020
MEDLQTRIAALSPKQRALFESRLRAAAAPGPDPAIPRRPDDDGPVPLSFAQHRLWFLDQLEPGRPVYNVSASLRIGRPVT
TEAVRDALGALTRRHEVLRTVFPADGGEPRQHIADSLTPPLTETDLRALPDSARAAAALRLCAEDKQRPFDLSSGPLLRC
LLLRLRDDDALLFLTFHHTVFDGWSIGLLRRDLTALLHAAETGTDAGLPPLPIQYADFADWQRRMLDEKRLGELLGYWRE
RIRGAPPVIDLPFDRPRPAVATTEGARRRFALPAELTTALRDLAARSGATPFMTMLTVFAALLHRWSGERDMVIGTPVAN
RARPELDDLIGFFANTLAMRVRIEPGMSFGDLLAQVRQTVVEALARQDLPFERLVDEARTERTLTHNPLFQVAFVMEDGR
DASELDTLLPERARDTHTPDSAKFDLTLVLTDRESTYTGYFEYNTALFEPVTIDRLGERLALLARSVAADPGWELAALPV
LTRDEIRHLKEVNAPVADDPRHHRTLHGVLEDSARRHPDHTAVEAPDRQLTYRELDEAANRLAHHLLALGVRPEQPVGVA
LDGTADAIVATFAVLKAGAVLLPLDPEYPAERLEHILRRSGATLLLTQRSLAGRFAGNDVTTVLLDDDATRAALADGPAD
RPGLPIAPDRLAYVIFTSGSTGVPKGVMVPHRGIGSLTRSAEQFAQTPDSRVLRFASPSFDVSLLELLMTFDAGATLVLE
PRALLVPGEDLARLIRERRVSTVLLSPSALSTLTAGELPGLRTVVMAGEAATLELAQQWCDGRDVFNGYGPTEATVLATI
ARCAPDRVPPLGRPVAGYTVHVLDDTLRPVPFGRQGELFLGGVGLARGYLDQPDVTADRFLPDPSGTEPGARLYRTGDVV
RWGADGELEFLGRTDHQVKLRGFRIELGEIETRLEDHPGVRTAVVLVRGEGSDRRLAGYAVRAPGEERPTAAGLRQWLRD
RLPGYMVPELFLVLDALPTSPNGKLDREALPDPLAQSGDTAGNRPPLLDPVEERISGIWQEVLGIAPPGSADNFFEVGGN
SLSATRIIARVNQAFGVRLPVRSLFVEPTLSGLARSVSAERAEELP
>P39123 2.4.1.1~~~glgP~~~Glycogen phosphorylase~~~COG0058
MFSSKERFADLFLKRLEMTCGKSFKDSAKLDQYKTLGNMVREYISADWIETNEKSRSNSGKQTYYLSIEFLLGQLLEQNL
MNLGVRDVVEAGLKEIGINLEEILQIENDAGLGNGGLGRLAACFLDSLASLNLPGHGMGIRYKHGLFEQKIVDGHQVELP
EQWLKNGNVWEVRNADQAVDVPFWGEVHMTEKSGRLHFRHEQATIVTAVPYDIPIIGYETGTVNTLRLWNAEPYAHYHGG
NILSYKRETEAVSEFLYPDDTHDEGKILRLKQQYFLVCASLKSIVNNYRKTHKSLSGLHKKVSIHINDTHPALAVPELMR
ILLDEENMSWEEAWHITVHTISYTNHTTLSEALEKWPIHLFKPLLPRMYMIIEEINERFCRAVWEKYPGDWKRIENMAIT
AHGVVKMAHLAIVGSYSVNGVAKIHSDILKEREMRDFHLLFPNRFNNKTNGIAHRRWLLKANPGLSAIITEAIGDEWVKQ
PESLIRLEPYATDPAFIEQFQNNKSKKKQELADLIFCTAGVVVNPESIFDVQVKRLHAYKRQLLNVLHIMYLYNRLKEDS
GFSIYPQTFIFGAKASPSYYYAKKIIKLIHSVAEKVNYDPAVKQLIKVVFLENYRVSMAERIFPASDVSEQISTASKEAS
GTGNMKFMMNGALTIGTHDGANIEILERVGPDCIYTFGLKADEVLSYQENGGYRSREYYQHDRRIRQVADQLINGFFEGE
ADEFESIFDSLLPHNDEYFVLKDFSSYADAQERIQADYRERRKWSEHSIVNIAHSGYFSSDRTIREYAKDIWGIKPMM
>P9WMW1 2.4.1.1~~~glgP~~~Glycogen phosphorylase~~~COG0058
MKALRRFTVRAHLPERLAALDQLSTNLRWSWDKPTQDLFAAIDPALWEQCGHDPVALLGAVNPARLDELALDAEFLGALD
ELAADLNDYLSRPLWYQEQQDAGVAAQALPTGIAYFSLEFGVAEVLPNYSGGLGILAGDHLKSASDLGVPLIAVGLYYRS
GYFRQSLTADGWQHETYPSLDPQGLPLRLLTDANGDPVLVEVALGDNAVLRARIWVAQVGRVPLLLLDSDIPENEHDLRN
VTDRLYGGDQEHRIKQEILAGIGGVRAIRAYTAVEKLTPPEVFHMNEGHAGFLGIERIRELVTDAGLDFDTALTVVRSST
VFTTHTPVPAGIDRFPLEMVQRYVNDQRGDGRSRLLPGLPADRIVALGAEDDPAKFNMAHMGLRLAQRANGVSLLHGRVS
RAMFNELWAGFDPDEVPIGSVTNGVHAPTWAAPQWLQLGRELAGSDSLREPVVWQRLHQVDPAHLWWIRSQLRSMLVEDV
RARLRQSWLERGATDAELGWIATAFDPNVLTVGFARRVPTYKRLTLMLRDPDRLEQLLLDEQRPIQLIVAGKSHPADDGG
KALIQQVVRFADRPQVRHRIAFLPNYDMSMARLLYWGCDVWLNNPLRPLEACGTSGMKSALNGGLNLSIRDGWWDEWYDG
ENGWEIPSADGVADENRRDDLEAGALYDLLAQAVAPKFYERDERGVPQRWVEMVRHTLQTLGPKVLASRMVRDYVEHYYA
PAAQSFRRTAGAQFDAARELADYRRRAEEAWPKIEIADVDSTGLPDTPLLGSQLTLTATVRLAGLRPNDVTVQGVLGRVD
AGDVLMDPVTVEMAHTGTGDGGYEIFSTTTPLPLAGPVGYTVRVLPRHPMLAASNELGLVTLA
>P13065 1.12.99.6~~~~~~Periplasmic [NiFeSe] hydrogenase large subunit~~~
MSQAATPAADGKVKISIDPLTRVEGHLKIEVEVKDGKVVDAKCSGGMFRGFEQILRGRDPRDSSQIVQRICGVCPTAHCT
ASVMAQDDAFGVKVTTNGRITRNLIFGANYLQSHILHFYHLAALDYVKGPDVSPFVPRYANADLLTDRIKDGAKADATNT
YGLNQYLKALEIRRICHEMVAMFGGRMPHVQGMVVGGATEIPTADKVAEYAARFKEVQKFVIEEYLPLIYTLGSVYTDLF
ETGIGWKNVIAFGVFPEDDDYKTFLLKPGVYIDGKDEEFDSKLVKEYVGHSFFDHSAPGGLHYSVGETNPNPDKPGAYSF
VKAPRYKDKPCEVGPLARMWVQNPELSPVGQKLLKELYGIEAKKFRDLGDKAFSIMGRHVLVAEETWLTAVAVEKWLKQV
QPGAETYVKSEIPDAAEGTGFTEAPRGALLHYLKIKDKKIENYQIVSATLWNANPRDDMGQRGPIEEALIGVPVPDIKNP
VNVGRLVRSYDPULGCAVHVLHAETGEEHVVNID
>P00490 2.4.1.1~~~malP~~~Maltodextrin phosphorylase~~~COG0058
MSQPIFNDKQFQEALSRQWQRYGLNSAAEMTPRQWWLAVSEALAEMLRAQPFAKPVANQRHVNYISMEFLIGRLTGNNLL
NLGWYQDVQDSLKAYDINLTDLLEEEIDPALGNGGLGRLAACFLDSMATVGQSATGYGLNYQYGLFRQSFVDGKQVEAPD
DWHRSNYPWFRHNEALDVQVGIGGKVTKDGRWEPEFTITGQAWDLPVVGYRNGVAQPLRLWQATHAHPFDLTKFNDGDFL
RAEQQGINAEKLTKVLYPNDNHTAGKKLRLMQQYFQCACSVADILRRHHLAGRKLHELADYEVIQLNDTHPTIAIPELLR
VLIDEHQMSWDDAWAITSKTFAYTNHTLMPEALERWDVKLVKGLLPRHMQIINEINTRFKTLVEKTWPGDEKVWAKLAVV
HDKQVHMANLCVVGGFAVNGVAALHSDLVVKDLFPEYHQLWPNKFHNVTNGITPRRWIKQCNPALAALLDKSLQKEWAND
LDQLINLEKFADDAKFRQQYREIKQANKVRLAEFVKVRTGIEINPQAIFDIQIKRLHEYKRQHLNLLHILALYKEIRENP
QADRVPRVFLFGAKAAPGYYLAKNIIFAINKVADVINNDPLVGDKLKVVFLPDYCVSAAEKLIPAADISEQISTAGKEAS
GTGNMKLALNGALTVGTLDGANVEIAEKVGEENIFIFGHTVEQVKAILAKGYDPVKWRKKDKVLDAVLKELESGKYSDGD
KHAFDQMLHSIGKQGGDPYLVMADFAAYVEAQKQVDVLYRDQEAWTRAAILNTARCGMFSSDRSIRDYQARIWQAKR
>P13063 1.12.99.6~~~~~~Periplasmic [NiFeSe] hydrogenase small subunit~~~
MSLSRREFVKLCSAGVAGLGISQIYHPGIVHAMTEGAKKAPVIWVQGQGCTGCSVSLLNAVHPRIKEILLDVISLEFHPT
VMASEGEMALAHMYEIAEKFNGNFFLLVEGAIPTAKEGRYCIVGETLDAKGHHHEVTMMELIRDLAPKSLATVAVGTCSA
YGGIPAAEGNVTGSKSVRDFFADEKIEKLLVNVPGCPPHPDWMVGTLVAAWSHVLNPTEHPLPELDDDGRPLLFFGDNIH
ENCPYLDKYDNSEFAETFTKPGCKAELGCKGPSTYADCAKRRWNNGINWCVENAVCIGCVEPDFPDGKSPFYVAE
>P65722 4.2.1.96~~~phhB~~~Putative pterin-4-alpha-carbinolamine dehydratase~~~COG2154
MARNRLTESEMNEALRALDGWQKVDGREAITRSFKFKDFSTAFGFMAQAALYAEKLDHHPEWFNAYNRVDVTLATHSENG
VTELDIKMARKMNAIAG
>A0R2K7 4.2.1.96~~~~~~Putative pterin-4-alpha-carbinolamine dehydratase~~~COG2154
MAVLSNDQVDAALPNLPGWERAAGALRRSVKFPTFLDGIDAVRRVAEFAEEKDHHPDIDIRWRTVTFALVTHAAGGITEK
DVQMAEEINRILSD
>P9WI93 4.2.1.96~~~~~~Putative pterin-4-alpha-carbinolamine dehydratase~~~COG2154
MAVLTDEQVDAALHDLNGWQRAGGVLRRSIKFPTFMAGIDAVRRVAERAEEVNHHPDIDIRWRTVTFALVTHAVGGITEN
DIAMAHDIDAMFGA
>Q02I31 4.2.1.96~~~~~~Putative pterin-4-alpha-carbinolamine dehydratase~~~
MTALTQAHCEACRADAPHVSDEELPVLLRQIPDWNIEVRDGIMQLEKVYLFKNFKHALAFTNAVGEISEAEGHHPGLLTE
WGKVTVTWWSHSIKGLHRNDFIMAARTDEVAKTAEGRK
>Q05182 1.14.12.7~~~pht2~~~Phthalate 4,5-dioxygenase oxygenase reductase subunit~~~
MSSSEQLDDGFTGLKVIAKTEIAQGIFRFELAHPQGMLLPAFTAGAHLRVRVPNGSIRNYSLSNDPQERERYVIAVKRDA
NGRGGSVSMADDIEAGDLLPVATPQNEFELIENARQFIFVAGGIGITPILSMMRHLKASTDLPFKLYYCTRNPELTAFRD
ELLGAEFANTVVIHHDFGNRADAYDFWPVFDKPSSGTHVYCCGPRPLMDSVLDMTGHWPPGSIHFESFGVDQSRFAENRP
FSVTLGRSGIDLEIPVDRSILEVLRDNGIRAPSSCESGTCGSCRTRLIEGDVEHRDMVLREDEQHDQIMICVSRARNDVL
VLDL
>Q05183 1.14.12.7~~~pht3~~~Phthalate 4,5-dioxygenase oxygenase subunit~~~
MLTPEENLLLCRVEGDAPMGQMMRRHWTPVCLLEEVSEPDGTPVRARLFGEDLVVFRDTDGRVGVMDEYCPHRRVSLIYG
RNENSGLRCLYHGWKMDVDGNVVEMVSEPAASNMCQKVKHTAYKTREWGGFVWAYMGPQDAIPEFVPPAWAPHEHVRVSI
AKAIIPCNWAQILEGAIDSAHSSSLHSSDFVPARVGGAEATSKNWLRPSTDKAPRMQVERTSYGFRYAALRRPIQNAATS
EYVRSTVFVAPATALIPPNNLYNVANINVPIDDTHTAFYFMAWGNPDNTPETETWRKFLGQQVGIDLDDSYRPLRNDGNR
FFQDREAMKNGNFTGIKGFPNQDIAMWVTMGPIADRSDERLGASDLAVVEFRRVMLDALAAFQAGESAIGTGEKAIPSRI
CSFQAIVSKDIDWRDYQARYVWALDDANIVAEPDYEVHT
>Q05184 1.-.-.-~~~pht4~~~Putative 4,5-dihydroxyphthalate dehydrogenase~~~
MMASHAESTARLRLGVVGLGRAFTLMLPTFLADRRVQLVGACDPREQARRQFERDFDAPAYETIEDLAADSNVDALYIAS
PHQFHAEHTRIAAANRKHVLVEKPMALSLDECDRMIADCAEAGVKLIVGHCHSFDTPYLRTRELIGSGEFGAVKMIQALN
YTDYLQRPRRPEELSTAEGGGAVFSQAAHQVDVVRLLAGSRATRVRAAVGNWDPARPTEGAYTATLWFENGAFASITYNG
YGHFDSDEWMDWVGEMGKPKNPEAYGGARRLLQQVQTADEEARLKAEGTYGGTRYVSPSPTDDATAFQHFGPIIVSCEGG
DLRPMADAVMIYRPHSRDRMMLERPTVPRSEVIDELYLAQFYGVTPLHDGEWARDTLEICLAMLRSSEEQRDITLGVNVE
DTQWRENPSS
>Q05185 4.1.1.55~~~pht5~~~4,5-dihydroxyphthalate decarboxylase~~~
MAREPIIMNKLNLSIAVGNYVRIRPLVDGEVQIDGVDPIFMLQDPEEIFFRAFRHADYDICELSLSSYSVKTAAGTSPYI
AVPVFPSRAFRHTSIYIRNDRGIESAADLKGKRIGVPEYQLTANVWVRLFLEEDHGLKASDVTWVRGGYEETGRLEKIVL
KLPADVIVENAPETETLSGMLASGELDAVIGPRAPSCFTQGHPKVSYLYRDPQGAASDWYRALSYSRSCTCWGSGARWPS
STLGYPGPLPKHSRSPSP
>Q7N8B1 2.4.2.31~~~phxA~~~Photox toxin~~~
MEKIMPISPISGHMPLSQIQVPQHATTSPLLEQGNRLFEQSVRRGPLHFQSSSLKHLCAELRQLQNAPSSMQARRVQDAI
QHWENHHPKEVMARSTRLAELKQALAEQGTVGRTLQSKVMATGPQVILKQPMPALPQSIAAQITKAQTGCTTTLVSSATA
ELIKHNQNNQQHIKDSDGRKPVNNMPPPPPPPMADKTQKVKKWVVNTDSKQLQALRYYSAQGYNLINTYLRGGEYVKHQA
IETLLSRNYLHSNEPTPQEFDAGMRAYIQDVTEGLNELAITDHKKVYRGLKFDKSELKNLLDQYTTEGNIIAEKGFLSTS
PDKAWVNDTILVINLESGHKGRILGDAAHFKGEAEMLFPPESKMLVEKVLNRDDKEFDSHFSNLRLTDDASADTTRIKRI
INIKMLNE
>Q55168 2.7.13.3~~~cph1~~~Phytochrome-like protein Cph1~~~COG4251
MATTVQLSDQSLRQLETLAIHTAHLIQPHGLVVVLQEPDLTISQISANCTGILGRSPEDLLGRTLGEVFDSFQIDPIQSR
LTAGQISSLNPSKLWARVMGDDFVIFDGVFHRNSDGLLVCELEPAYTSDNLPFLGFYHMANAALNRLRQQANLRDFYDVI
VEEVRRMTGFDRVMLYRFDENNHGDVIAEDKRDDMEPYLGLHYPESDIPQPARRLFIHNPIRVIPDVYGVAVPLTPAVNP
STNRAVDLTESILRSAYHCHLTYLKNMGVGASLTISLIKDGHLWGLIACHHQTPKVIPFELRKACEFFGRVVFSNISAQE
DTETFDYRVQLAEHEAVLLDKMTTAADFVEGLTNHPDRLLGLTGSQGAAICFGEKLILVGETPDEKAVQYLLQWLENREV
QDVFFTSSLSQIYPDAVNFKSVASGLLAIPIARHNFLLWFRPEVLQTVNWGGDPNHAYEATQEDGKIELHPRQSFDLWKE
IVRLQSLPWQSVEIQSALALKKAIVNLILRQAEELAQLARNLERSNADLKKFAYIASHDLQEPLNQVSNYVQLLEMRYSE
ALDEDAKDFIDFAVTGVSLMQTLIDDILTYAKVDTQYAQLTFTDVQEVVDKALANLKQRIEESGAEIEVGSMPAVMADQI
QLMQVFQNLIANGIKFAGDKSPKIKIWGDRQEDAWVFAVQDNGIGIDPQFFERIFVIFQRLHTRDEYKGTGMGLAICKKI
IEGHQGQIWLESNPGEGSTFYFSIPIGN
>Q55434 ~~~cph2~~~Phytochrome-like protein cph2~~~COG2203
MNPNRSLEDFLRNVINKFHRALTLRETLQVIVEEARIFLGVDRVKIYKFASDGSGEVLAEAVNRAALPSLLGLHFPVEDI
PPQAREELGNQRKMIAVDVAHRRKKSHELSGRISPTEHSNGHYTTVDSCHIQYLLAMGVLSSLTVPVMQDQQLWGIMAVH
HSKPRRFTEQEWETMALLSKEVSLAITQSQLSRQVHQQQVQEALVQRLETTVAQYGDRPETWQYALETVGQAVEADGAVL
YIAPDLTGSVAQHYQWNLRFDWGNWLETSLWQELMRGQPSAAMEPMAAVQSTWEKPRPFTSVAPLPPTNCVPHGYTLGEL
EQRSDWIAPPESLSAENFQSFLIVPLAADQQWVGSLILLRKEKSLVKHWAGKRGIDRRNILPRLSFEAWEETQKLVPTWN
RSERKLAQVASTQLYMAITQQFVTRLITQQTAYDPLTQLPNWIIFNRQLTLALLDALYEGKMVGVLVIAMDRFKRINESF
GHKTGDGLLQEVADRLNQKLSPLAAYSPLLSRWHGDGFTILLTQISDNQEMIPLCERLLSTFQEPFFLQGQPIYLTASMG
ISTAPYDGETAESLLKFAEIALTRAKCQGKNTYQFYRPQDSAPMLDRLTLESDLRQALTNQEFVLYFQPQVALDTGKLLG
VEALVRWQHPRLGQVAPDVFIPLAEELGLINHLGQWVLETACATHQHFFRETGRRLRMAVNISARQFQDEKWLNSVLECL
KRTGMPPEDLELEITESLMMEDIKGTVVLLHRLREEGVQVAIDDFGTGYSSLSILKQLPIHRLKIDKSFVNDLLNEGADT
AIIQYVIDLANGLNLETVAEGIESEAQLQRLQKMGCHLGQGYFLTRPLPAEAMMTYLYYPQILDFGPTPPLPKVALPETE
TEAGQGNVGDRPLPNSLNRENPWTEKLHDYVLLKERLQQRNVKEKLVLKIANKIRASLNINDILYSTVTEVRQFLNTDRV
VLFKFNSQWSGQVVTESHNDFCRSIINDEIDDPCFKGHYLRLYREGRVRAVSDIEKADLADCHKELLRHYQVKANLVVPV
VFNENLWGLLIAHECKTPRYWQEEDLQLLMELATQVAIAIHQGELYEQLETANIRLQQISSLDALTQVGNRYLFDSTLER
EWQRLQRIREPLALLLCDVDFFKGFNDNYGHPAGDRCLKKIADAMAKVAKRPTDLVARYGGEEFAIILSETSLEGAINVT
EALQVEVANLAIPHTVSGTGHVTLSIGIAVYTPERHINPNALVKAADLALYEAKAKGRNQWLAYEGSQLPHVDGEV
>O31097 3.1.3.8~~~phyC~~~3-phytase~~~
MNHSKTLLLTAAAGLMLTCGAVSSQAKHKLSDPYHFTVNAAAETEPVDTAGDAADDPAIWLDPKTPQNSKLITTNKKSGL
VVYSLDGKMLHSYNTGKLNNVDIRYDFPLNGKKVDIAAASNRSEGKNTIEIYAIDGKNGTLQSMTDPDHPIATAINEVYG
FTLYHSQKTGKYYAMVTGKEGEFEQYELKADKNGYISGKKVRAFKMNSQTEGMAADDEYGRLYIAEEDEAIWKFSAEPDG
GSNGTVIDRADGRHLTRDIEGLTIYYAADGKGYLMASSQGNSSYAIYDRQGKNKYVADFRITDGPETDGTSDTDGIDVLG
FGLGPEYPFGIFVAQDGENIDHGQKANQNFKIVPWERIADQIGFRPLANEQVDPRKLTDRSGK
>Q46806 3.5.2.-~~~hyuA~~~D-phenylhydantoinase~~~COG0044
MRVLIKNGTVVNADGQAKQDLLIESGIVRQLGNNISPQLPYEEIDATGCYVFPGGVDVHTHFNIDVGIARSCDDFFTGTR
AAACGGTTTIIDHMGFGPNGCRLRHQLEVYRGYAAHKAVIDYSFHGVIQHINHAILDEIPMIVEEGLSSFKLYLTYQYKL
NDDEVLQALRRLHESGALTTVHPENDAAIASKRAEFIAAGLTAPRYHALSRPLECEAEAIARMINLAQIAGNAPLYIVHL
SNGLGLDYLRLARANHQPVWVETCPQYLLLDERSYDTEDGMKFILSPPLRNVREQDKLWCGISDGAIDVVATDHCTFSMA
QRLQISKGDFSRCPNGLPGVENRMQLLFSSGVMTGRITPERFVELTSAMPARLFGLWPQKGLLAPGSDGDVVIIDPRQSQ
QIQHRHLHDNADYSPWEGFTCQGAIVRTLSRGETIFCDGTFTGKAGRGRFLRRKPFVPPVL
>P74653 2.7.1.182~~~vte5~~~Phytol kinase~~~COG0170
MGIEQNNPMALPLWIAVGLAATYLGAVVLTAELLNRLSLSPAEVTRKIVHIGAGQVVLIAWWLSIPGWVGAIAGVFAAGI
AVLSYRLPILPSLESVGRHSYGTLFYALSIGLLVGGFFSLGLPIFAAIGILVMAWGDGLAALVGQRWGRHRYQVFGFRKS
WEGTLTMVLASFLVTVVFLSYTFGFTVIVLVVAGTVAIASAGLESFSRWGIDNLTVPLGSALIAWAGSYLWLG
>Q0PEV3 ~~~phyR~~~Phyllosphere-induced regulator PhyR~~~COG0784
MSTAQLVVQHLPYLRRYARALTGSQVAGDAYVAATLETLVNEPETLGRSTNVKADLFRVFTRIWNSLSVNGHSDQVQHDL
PAEVRLGQITPLPRQAFLLSCLEGFSEEDAGVILDVDVSKVRDLVDEAGRELAADMATEILIIEDEPLIAMDLEALVEGL
GHNVIGVARTRTEAVKIASESKRPGLILADIQLADGSSGLDAVNDLLKTFEVPVIFITAYPERFLTGERPEPAFLIAKPF
QPANVSAVISQALFFQQSARRREAHNA
>O66037 3.1.3.8~~~phy~~~3-phytase~~~
MNHSKTLLLTAAAGLMLTCGAVSSQAKHKLSDPYHFTVNAAAETEPVDTAGDAADDPAIWLDPKNPQNSKLITTNKKSGL
AVYSLEGKMLHSYHTGKLNNVDIRYDFPLNGKKVDIAAASNRSEGKNTIEIYAIDGKNGTLQSITDPNRPIASAIDEVYG
FSLYHSQKTGKYYAMVTGKEGEFEQYELNADKNGYISGKKVRAFKMNSQTEGMAADDEYGSLYIAEEDEAIWKFSAEPDG
GSNGTVIDRADGRHLTPDIEGLTIYYAADGKGYLLASSQGNSSYAIYERQGQNKYVADFQITDGPETDGTSDTDGIDVLG
FGLGPEYPFGLFVAQDGENIDHGQKANQNFKMVPWERIADKIGFHPQVNKQVDPRKMTDRSGK
>P42094 3.1.3.8~~~phy~~~3-phytase~~~COG4247
MKVPKTMLLSTAAGLLLSLTATSVSAHYVNEEHHFKVTAHTETDPVASGDDAADDPAIWVHEKHPEKSKLITTNKKSGLV
VYDLDGKQLHSYEFGKLNNVDLRYDFPLNGEKIDIAAASNRSEGKNTIEVYAIDGDKGKLKSITDPNHPISTNISEVYGF
SLYHSQKTGAFYALVTGKQGEFEQYEIVDGGKGYVTGKKVREFKLNSQTEGLVADDEYGNLYIAEEDEAIWKFNAEPGGG
SKGQVVDRATGDHLTADIEGLTIYYAPNGKGYLMASSQGNNSYAMYERQGKNRYVANFEITDGEKIDGTSDTDGIDVLGF
GLGPKYPYGIFVAQDGENIDNGQAVNQNFKIVSWEQIAQHLGEMPDLHKQVNPRKLKDRSDG
>Q715L4 3.7.1.4~~~phy~~~Phloretin hydrolase~~~
MEEDFNMSTPGVKVGVXEEEKKLSYYKYYEQDLAPVPAEKIAILQGGPIAPEKCIPFDERNKFLKGEDDEYANIGFGVAA
DGTALVCNTTYMPGVTGEMLDWWFPWHSVGSDLRYKIWDPEDHYFARAYPASYVVDPNVPMNQKTWGVDHYIMEDVGPGP
EFLKLCFKRPADFGYDESIIGTEKCESLVCAIGESSCAAAMTHKWHPYKDGVLFESRFWIGYRIDEEGNIVKAIPEGVSI
PPFVPQGLFAHNIKEFTNLAAILPTLYAEEKDTF
>B1MK49 3.7.1.4~~~~~~Phloretin hydrolase~~~
MHPITYYPVDTQRLVRSNAERIRHKPYAHYFNPDVAVPEEVFAALKAPLEPEQVLGTSSTELNRLLEPGYLEGETGYCGL
PDGAGYTSSLVRFPGATPEMFRWWFWWHSFEPERYSLWHPWCHADIWRTDPETETAPNLTDEQRYVGSTHHINEYIGQDP
LDIEITFIDPARWGFDADGFAAAGIGAHACGSVLMKGSHMRLATMVHLARITDDGFELRSRYWIADRAEPRHDPVAGIAQ
LTTVPGFSGERQAYEQLVHDQTEFNHLATFLPDIYQEFGPR
>Q9HWH1 ~~~phzA1~~~Phenazine biosynthesis protein PhzA1~~~
MNGQRYRETPLDIERLRRLNRATVERYMAMKGAERLQRHSLFVEDGCAGNWTTESGEPLVFRGHESLRRLAEWLERCFPD
WEWHNVRIFETEDPNHFWVECDGRGKALVPGYPQGYCENHYIHSFELENGRIKRNREFMNPIQKLRALGIAVPQIKRDGI
PT
>Q9I2J9 ~~~phzA2~~~Phenazine biosynthesis protein PhzA2~~~
MREYQRLKGFTDNLELRRRNRATVEHYMRMKGAERLQRHSLFVEDGCAGNWTTESGEPLVFRGHESLRRLAEWLERCFPD
WEWHNVRIFETEDPNHFWVECDGRGKALVPGYPQGYCENHYIHSFELENGRIKRNREFMNPMQKLRALGIAVPQIKRDGI
PT
>O69753 ~~~phzB1~~~Phenazine biosynthesis protein PhzB1~~~
MPDTTNPIGFTDANELREKNRATVEKYMNTKGQDRLRRHELFVEDGCGGLWTTDTGSPIVIRGKDKLAEHAVWSLKCFPD
WEWYNINIFGTDDPNHFWVECDGHGKILFPGYPEGYYENHFLHSFELEDGKIKRNREFMNVFQQLRALSIPVPQIKREGI
PT
>Q02L47 ~~~phzB2~~~Phenazine biosynthesis protein PhzB 2~~~
MLDNAIPQGFEDAVELRRKNRETVVKYMNTKGQDRLRRHELFVEDGCGGLWTTDTGSPIVIRGKDKLAEHAVWSLKCFPD
WEWYNIKVFETDDPNHFWVECDGHGKILFPGYPEGYYENHFLHSFELDDGKIKRNREFMNVFQQLRALSIPVPQIKREGI
PT
>Q9S508 ~~~phzB2~~~Phenazine biosynthesis protein PhzB2~~~
MLDNAIPQGFEDAVELRRKNRETVVKYMNTKGQDRLRRHELFVEDGCGGLWTTDTGSPIVIRGKDKLAEHAVWSLKCFPD
WEWYNIKVFETDDPNHFWVECDGHGKILFPGYPEGYYENHFLHSFELDDGKIKRNREFMNVFQQLRALSIPVPQIKREGI
PT
>P0DPB9 3.3.2.15~~~phzD1~~~Phenazine biosynthesis protein PhzD1~~~
MSGIPEITAYPLPTAQQLPANLARWSLEPRRAVLLVHDMQRYFLRPLPESLRAGLVANAARLRRWCVEQGVQIAYTAQPG
SMTEEQRGLLKDFWGPGMRASPADREVVEELAPGPDDWLLTKWRYSAFFHSDLLQRMRAAGRDQLVLCGVYAHVGVLIST
VDAYSNDIQPFLVADAIADFSEAHHRMALEYAASRCAMVVTTDEVLE
>P0DPC1 3.3.2.15~~~phzD2~~~Phenazine biosynthesis protein PhzD2~~~
MSGIPEITAYPLPTAQQLPANLARWSLEPRRAVLLVHDMQRYFLRPLPESLRAGLVANAARLRRWCVEQGVQIAYTAQPG
SMTEEQRGLLKDFWGPGMRASPADREVVEELAPGPDDWLLTKWRYSAFFHSDLLQRMRAAGRDQLVLCGVYAHVGVLIST
VDAYSNDIQPFLVADAIADFSEAHHRMALEYAASRCAMVVTTDEVLE
>Q51790 3.3.2.15~~~phzD~~~Phenazine biosynthesis protein PhzD~~~
MTGIPSIVPYALPTSRDLPANLAQWHIDPERAVLLVHDMQRYFLRPLPDALRDQVVGNAARIRQWAADNGVPVAYTAQPG
SMNEEQRGLLKDFWGPGMKASPTDREVVDALAPQPGDWLLTKWRYSAFFNSDLLQRLHASGRDQLILCGVYAHVGVLISS
VDAYSNDIQPFLVADAIADFSKEHHWMAMEYAASRCAMVITTDEVVL
>Q51792 5.3.3.17~~~phzF~~~Trans-2,3-dihydro-3-hydroxyanthranilate isomerase~~~
MHNYVIIDAFASVPLEGNPVAVFFDADDLPPAQMQRIAREMNLSESTFVLKPRNGGDALIRIFTPVNELPFAGHPLLGTA
IALGAHTDNHRLYLETQMGTIAFELERQNGSVIAASMDQPIPTWTALGRDAELLKALGISDSTFPIEIYHNGPRHVFVGL
PSIDALSALHPDHRALSNFHDMAINCFAGAGRRWRSRMFSPAYGVVEDAATGSAAGPLAIHLARHGQIEFGQPVEILQGV
EIGRPSLMFAKAEGRAEQLTRVEVSGNGVTFGRGTIVL
>Q396C5 1.10.3.16~~~phzG~~~Dihydrophenazinedicarboxylate synthase~~~
MNTSRFESLTGSVDVLFPEYDDPPSEPITLLKRWLATADVARVREPKALALATATSDGRISSRVIAFSSIDDRGVIFCTH
STSRKGRELTETGWASGLLYWRETGQQIMISGQAVPLEESENDKLWFGRSVPMHAMSSASHQSDELVDREALRAHAAELL
ALGVALPRPPRFVGYRLEPHEMEFWAASSDRLHRRLRYERDGNDWKTTQLQP
>Q51793 1.10.3.16~~~phzG~~~Dihydrophenazinedicarboxylate synthase~~~
MNGSIQGKPLLGKGMSESLTGTLDAPFPEYQTLPADPMSVLHNWLERARRVGIREPRALALATADSQGRPSTRIVVISEI
SDAGVVFSTHAGSQKGRELLHNPWASGVLYWRETSQQIILNGQAVRLPNAKADDAWLKRPYATHPMSSVSRQSEELQDVQ
AMRNAARQLAELQGPLPRPEGYCVFELRLESLEFWGNGQERLHERLRYDRSDTGWNVRRLQP
>A0A172J1V3 2.1.1.-~~~phzM~~~Phenazine O-methyltransferase PhzM~~~
MTENNRAGAVPLSSILLQMITGYWVTQSLYVAAKLGIADLVADAPKPIEELAAKTGAKAPLLKRVLRTIASIGVFTETEP
GIFGITPLAALLRSGTPDSMRPQAIMHGEEQYRAWADVLHNVQTGETAFEKEFGTSYFGYLAKHPEADRVFNEAQAGYTK
QVAHAVVDAYDFSPFKTVIDIGAGYGPLLSAILRSQPEARGILFDQPHVAQAAGKRLAEAGVGDRCGTVGGDFFVEVPAD
GDVYILSLLLHDWDDQRSIEILRNCRRAMPAHGKLLIVELVLPEGEEPFFGKWLDLHMLVLLGAQERTADEFKTLFAASG
FALERVLPTASGLSIVEARPI
>Q9HWH2 2.1.1.327~~~phzM~~~Phenazine-1-carboxylate N-methyltransferase~~~
MNNSNLAAARNLIQVVTGEWKSRCVYVATRLGLADLIESGIDSDETLAAAVGSDAERIHRLMRLLVAFEIFQGDTRDGYA
NTPTSHLLRDVEGSFRDMVLFYGEEFHAAWTPACEALLSGTPGFELAFGEDFYSYLKRCPDAGRRFLLAMKASNLAFHEI
PRLLDFRGRSFVDVGGGSGELTKAILQAEPSARGVMLDREGSLGVARDNLSSLLAGERVSLVGGDMLQEVPSNGDIYLLS
RIIGDLDEAASLRLLGNCREAMAGDGRVVVIERTISASEPSPMSVLWDVHLFMACAGRHRTTEEVVDLLGRGGFAVERIV
DLPMETRMIVAARA
>A0A172J1S0 1.14.13.-~~~phzS~~~Phenazine 1,6-dicarboxylic acid hydroxylase PhzS~~~
MTTATQTDIVIAGAGIGGLTTALALHAQGIERVVVLESANEIRPLGVGINVQPAAIAQLFALGLGEAIAATGIATRELRY
LDHAGITLWTEPRGLAAGDPYPQYAIHRGELQMLLLAAVRERLGADTVRTGLRVQDFEHTRTGIRVHAQERGNGGSTVSF
EATALVGADGLHSAVRARLHPDRCELLPARIQMWRGLTEVDEFLDGRSMIVANDDRSTRLIAYPCSARHAQHGRALINWV
CMVPDVAQDLTREASWDCSGQLKDVLPYFADWKFGWLDVPDLLSRSTQILEYPMVDRDPLPRWGIGRATLLGDAAHLMYP
VGANGASQAILDAVSLANELGDNSDTVEALQRYEAVRRPPTTAIVQANRDRDTAERAIATRPDPEKTAALAAITSSYRSI
VDRSHVQ
>Q9HWG9 1.14.13.218~~~phzS~~~5-methylphenazine-1-carboxylate 1-monooxygenase~~~
MSEPIDILIAGAGIGGLSCALALHQAGIGKVTLLESSSEIRPLGVGINIQPAAVEALAELGLGPALAATAIPTHELRYID
QSGATVWSEPRGVEAGNAYPQYSIHRGELQMILLAAVRERLGQQAVRTGLGVERIEERDGRVLIGARDGHGKPQALGADV
LVGADGIHSAVRAHLHPDQRPLSHGGITMWRGVTEFDRFLDGKTMIVANDEHWSRLVAYPISARHAAEGKSLVNWVCMVP
SAAVGQLDNEADWNRDGRLEDVLPFFADWDLGWFDIRDLLTRNQLILQYPMVDRDPLPHWGRGRITLLGDAAHLMYPMGA
NGASQAILDGIELAAALARNADVAAALREYEEARRPTANKIILANREREKEEWAAASRPKTEKSAALEAITGSYRNQVER
PR
>Q2FYD8 3.5.1.28~~~~~~Probable autolysin PH~~~COG0860
MLITKNQAEKWFDNSLGKQFNPDLFYGFQCYDYANMFFMIATGERLQGLYAYNIPFDNKARIEKYGQIIKNYDSFLPQKL
DIVVFPSKYGGGAGHVEIVESANLNTFTSFGQNWNGKGWTNGVAQPGWGPETVTRHVHYYDDPMYFIRLNFPDKVSVGDK
AKSVIKQATAKKQAVIKPKKIMLVAGHGYNDPGAVGNGTNERDFIRKYITPNIAKYLRHAGHEVALYGGSSQSQDMYQDT
AYGVNVGNNKDYGLYWVKSHGYDIVLEIHLDAAGESASGGHVIISSQFNADTIDKSIQDVIKNNLGQIRGVTPRNDLLNV
NVSAEININYRLSELGFITNKNDMDWIKKNYDLYSKLIAGAIHGKPIGGLVAGNVKTSAKNQKNPPVPAGYTLDKNNVPY
KKETGYYTVANVKGNNVRDGYSTNSRITGVLPNNATIKYDGAYCINGYRWITYIANSGQRRYIATGEVDKAGNRISSFGK
FSTI
>P0A3U7 ~~~~~~24.9 kDa protein in picA locus~~~
MGLPFQIEYGQTTGRPELIEDALRQFSAALALTADAGGLYVHGYDESRNQRWANPASGKSPAIWARAVGWLAMALVDALV
ILPDDSATAELRERTRRLLAGIIARQTQAGLWMQVLDNQGLAGNYAETSASAMFAYALLRAARLGLLRGEEAKAALSAGR
QALAALLETRLELDEQGVARLTGIVHVAGLGGFDGNYRDGTPDYYLTEPVVSDDAKGVGPLMMAYAESLLLAR
>P42790 3.4.21.100~~~pcp~~~Pseudomonalisin~~~
MKSSAAKQTVLCLNRYAVVALPLAIASFAAFGASPASTLWAPTDTKAFVTPAQVEARSAAPLLELAAGETAHIVVSLKLR
DEAQLKQLAQAVNQPGNAQFGKFLKRRQFLSQFAPTEAQVQAVVAHLRKNGFVNIHVVPNRLLISADGSAGAVKAAFNTP
LVRYQLNGKAGYANTAPAQVPQDLGEIVGSVLGLQNVTRAHPMLKVGERSAAKTLAAGTAKGHNPTEFPTIYDASSAPTA
ANTTVGIITIGGVSQTLQDLQQFTSANGLASVNTQTIQTGSSNGDYSDDQQGQGEWDLDSQSIVGSAGGAVQQLLFYMAD
QSASGNTGLTQAFNQAVSDNVAKVINVSLGWCEADANADGTLQAEDRIFATAAAQGQTFSVSSGDEGVYECNNRGYPDGS
TYSVSWPASSPNVIAVGGTTLYTTSAGAYSNETVWNEGLDSNGKLWATGGGYSVYESKPSWQSVVSGTPGRRLLPDISFD
AAQGTGALIYNYGQLQQIGGTSLASPIFVGLWARLQSANSNSLGFPAASFYSAISSTPSLVHDVKSGNNGYGGYGYNAGT
GWDYPTGWGSLDIAKLSAYIRSNGFGH
>Q7BS42 3.4.21.-~~~pic~~~Serine protease pic autotransporter~~~
MNKVYSLKYCPVTGGLIAVSELARRVIKKTCRRLTHILLAGIPAICLCYSQISQAGIVRSDIAYQIYRDFAENKGLFVPG
ANDIPVYDKDGKLVGRLGKAPMADFSSVSSNGVATLVSPQYIVSVKHNGGYRSVSFGNGKNTYSLVDRNNHPSIDFHAPR
LNKLVTEVIPSAVTSEGTKANAYKYTERYTAFYRVGSGTQYTKDKDGNLVKVAGGYAFKTGGTTGVPLISDATIVSNPGQ
TYNPVNGPLPDYGAPGDSGSPLFAYDKQQKKWVIVAVLRAYAGINGATNWWNVIPTDYLNQVMQDDFDAPVDFVSGLGPL
NWTYDKTSGTGTLSQGSKNWTMHGQKDNDLNAGKNLVFSGQNGAIILKDSVTQGAGYLEFKDSYTVSAESGKTWTGAGII
TDKGTNVTWKVNGVAGDNLHKLGEGTLTINGTGVNPGGLKTGDGIVVLNQQADTAGNIQAFSSVNLASGRPTVVLGDARQ
VNPDNISWGYRGGKLDLNGNAVTFTRLQAADYGAVITNNAQQKSQLLLDLKAQDTNVSEPTIGNISPFGGTGTPGNLYSM
ILNSQTRFYILKSASYGNTLWGNSLNDPAQWEFVGMDKNKAVQTVKDRILAGRAKQPVIFHGQLTGNMDVAIPQVPGGRK
VIFDGSVNLPEGTLSQDSGTLIFQGHPVIHASISGSAPVSLNQKDWENRQFTMKTLSLKDADFHLSRNASLNSDIKSDNS
HITLGSDRAFVDKNDGTGNYVIPEEGTSVPDTVNDRSQYEGNITLNHNSALDIGSRFTGGIDAYDSAVSITSPDVLLTAP
GAFAGSSLTVHDGGHLTALNGLFSDGHIQAGKNGKITLSGTPVKDTANQYAPAVYLTDGYDLTGDNAALEITRGAHASGD
IHASAASTVTIGSDTPAELASAETAASAFAGSLLEGYNAAFNGAITGGRADVSMHNALWTLGGDSAIHSLTVRNSRISSE
GDRTFRTLTVNKLDATGSDFVLRTDLKNADKINVTEKATGSDNSLNVSFMNNPAQGQALNIPLVTAPAGTSAEMFKAGTR
VTGFSRVTPTLHVDTSGGNTKWILDGFKAEADKAAAAKADSFMNAGYKNFMTEVNNLNKRMGDLRDTNGDAGAWARIMSG
AGSADGGYSDNYTHVQVGFDKKHELDGVDLFTGVTMTYTDSSADSHAFSGKTKSVGGGLYASALFESGAYIDLIGKYIHH
DNDYTGNFASLGTKHYNTHSWYAGAETGYRYHLTEDTFIEPQAELVYGAVSGKTFRWKDGDMDLSMKNRDFSPLVGRTGV
ELGKTFSGKDWSVTARAGTSWQFDLLNNGETVLRDASGEKRIKGEKDSRMLFNVGMNAQIKDNMRFGLEFEKSAFGKYNV
DNAVNANFRYMF
>Q8A7K2 ~~~~~~Phosphoinositol dihydroceramide synthase~~~COG0671
MPSKKETLTVIVIMALFLLLTAACIGLRSEHLLMAALYLVLFFAGLPTRKLAVALLPFAIFGISYDWMRICPNYEVNPID
VAGLYNLEKSLFGVMDNGVLVTPCEYFAVHHWAVADVFAGIFYLCWVPVPILFGLCLYFKKERKTYLRFALVFLFVNLIG
FAGYYIHPAAPPWYAINYGFEPILNTPGNVAGLGRFDEIFGVTIFDSIYGRNANVFAAVPSLHAAYMVVALVYAIIGKCR
WYVIALFSVIMAGIWGTAIYSCHHYIIDVLLGISCALLGWLFFEYGLMKIRGFRNFFDRYYQYIK
>P10030 ~~~pifC~~~Transcriptional repressor PifC~~~
MLSQLNLRFHKKLIEALKTRAGRENTSVNALAERFLDDGLKTVAPGDGYFQLIADPEATVRQLYRHIILGQTFGTSALSR
DELRFVLVHVREAFLRGHNRLATLPALDTLLDITGNLLAWQVEHDRPVDGHYLKGIFRLAGKNWTEEFEAFRAALRPVVD
QMYAEHLLRPLESDCFGLAEVPDAVLAEIFTLPRLKAVFPLMLRGLDWNTEQARTLAQELRPVISAVTETIEAGTLRLEI
RVDGQHPGERPGAWYTTPRLHLLITGQDFVVPYGWEALSELLGLFTLYARHPEALTHGHQGERVMFSPPGNVTPEGFFGI
DGLRIFMPAEAFETLVRELATRCQEGPLAEALTGLRCLYGDL
>G3XCZ8 1.14.99.58~~~pigA~~~Heme oxygenase PigA~~~
MDTLAPESTRQNLRSQRLNLLTNEPHQRLESLVKSKEPFASRDNFARFVAAQYLFQHDLEPLYRNEALARLFPGLASRAR
DDAARADLADLGHPVPEGDQSVREADLSLAEALGWLFVSEGSKLGAAFLFKKAAALELDENFGARHLAEPEGGRAQGWKS
FVAILDGIELNEEEERLAAKGASDAFNRFGDLLERTFA
>Q5W271 1.3.8.14~~~pigA~~~L-prolyl-[peptidyl-carrier protein] dehydrogenase~~~COG1960
MDFNLSNSQSDIYESAYRFACDVLDQDAQTRISQKILSTELWKKAAAYGFAHGPVSHQFGGSELGALDTALMIEALGKGS
RDIGLSFSLCAHLCACVIPLYRFGSSELKDKYLESLVTGKLIAANAATEPDAGSDIYNMQATAQPCEGGYILNGKKIFIT
NAPIADVFIIYAKTNPDHGFLGVSAFLIEKGTPGLNVGEVIPKDCLSNCPWSEIVFNDIFIPQSQRIGMEGAGGAIFHDS
MIWEKGCLSALFVGGLARLLETTLEYAKARQQFGKAIGQFQSVSNRIIDMKLRLEQCRLMLYRACWKHDQGQDAEADIAM
SKLLISEYAVQSGLDAIQTFGGAAMDQELGLVRHLLNMIPSRIFSGTNDIQKEIIARKLGLRGTSS
>Q5W252 6.4.-.-~~~pigC~~~Prodigiosin synthesizing transferase PigC~~~
MNPTLVVELSGDKTLEPHRLGGKAHSLNHLIHAGLPVPPAFCITAQAYRQFIEFAVPGALLDTGAPGNVRDMILSAAIPA
PLDLAIRHACKQLGDGASLAVRSSALEEDGLTHSFAGQYDTYLHVRGDDEVVRKVQSCWASLWAERAAQYSRTSAAQSDI
AVVLQIMVDADAAGVMFTQDPLTGDANHIVIDSCWGLGEGVVSGQVTTDSFILDKASGEIRERQIRHKPHYCQRDPQGRV
TLLQTPEARRDAPSLTPEQLQQLARLARQTRMIYGAELDIEWAVKDDRVWLLQARPITTQAKPVQMLYANPWESDPTIKE
RAFFSRMDTGEIVTGLMTPLGLSFCQFYQKHIHGPAIKTMGLADIGDWQIYMGYLQGYVYLNISGSAYMLRQCPPTRDEM
KFTTRYATADIDFSGYKNPYGPGVQGWAYLKSAWHWLKQQRHNLRSAGATVDAMIALRQRETRRFLALDLTTMTHQELER
ELSRIDGYFLDSCAAYMPFFLQSFALYDALALTCERYLKGRGNGLQNRIKASMNNLRTIEVTLGILSLVETVNRQPALKA
LFERHSAQELVTVLPTDPESRAFWQSDFSAFLFEFGARGRQEFELSLPRWNDDPSYLLQVMKMYLQHPVDLHTKLRETER
LRHEDSAALLKAMPWFGRMKLKFITKLYGVMAERREATRPTFVTETWFYRRIMLEVLRRLEAQGLVKSADLPYVDFERFR
AFMAGEQSAQEAFAADLIERNRHQHLLNLHAEEPPMAIVGGYQPRMKAPTAENAAGMLSGLAASPGKVVAKARVITDLLA
QAGELQPNEILVARFTDASWTPLFALAAGIVTDIGSALSHSCIVAREFGIPAAVNLKNATQLINSGDTLILDGDSGTVII
QRGERADG
>Q5W269 6.4.-.-~~~pigC~~~Prodigiosin synthesizing transferase PigC~~~COG0574
MNQPLVVEISGDKALEHHHLGGKGYSLNNLIHAGLPVPSAFCVTAQAYQQFIEEVVPGAELTDGDLIAVRDAILHADIPD
SLKQAIGDAYQHLGHDTTIAVRSSALDEDGQRQSFAGQYETYLHVKGSEAVLHKVQACWASLWAERAAQYRHESASHSAI
AVILQVMVDADAAGVMFTQDPLSGSTDKVVIDSCWGLGEGVVSGQVTTDSFTLDKATGELCDQQIRHKPNYCQRDEHGLV
TLLQTPEAKRDLPSLTPAQLQQLVTLARQAQLIYSTELDIEWAVKDDKVWLLQARPVTTSAKTANVIYANPWESDPAAKE
GAFFSRMDTGEIVTGLMTPLGLSFCQFYQKHIHGPAIKTMGLADISHWQIYMGYIQGYVYLNISGSAYMLRQCPPTRNEM
KFTTRYATDEIDFKDYKNPYGAGVQGWDYAKSCWYWLKQQVRNMRSAARTVEQMIALRQDETTRFLGLDLTAMTLQQLDQ
ELQRIDRFFLDSCAAYMPFFLQSFALYDALAQACERHIKDGKGLQNRIKASMNNLRTIEVTLGIIKLVATVNQQTELKAL
FEQHRADELVTLLPVHDISRAFWQGDFEDFLVEFGSRGRQEFDLSIPRWRDDPSYLLQVMKMYLQHPVDLHKKLRETELL
RQQDSEALFSAMSWSGRFKLKTLIKLYGMMAERREATRPTFITETWFYRCIMLEVLRRLDAQGIASSADLPYVDFEQFRA
YVAGTIPAEQAFSKARLDQNRHQHLFNLHAEEPPMAIVGPYTPKVKAPTQDDKTIRSLTGLAASPGNVVAKARVITDLQV
QAGEFQPDEILVARFTDASWTPLFALAAGIVTDIGSTLSHSCIVAREFGIPAVVNLKTATQIINSGDMLILDGDSGTVII
QHQEERNHDG
>Q5W251 2.2.1.12~~~pigD~~~Thiamine diphosphate dependent-3-acetyloctanal synthase PigD~~~
MRAATAACRDRRGLCRAEFARLAEAVTPFWLHKELIMTTLTGQARLTNSAAYEQVWQAERQACRTDADPDTLTVGVVVVT
RNPAFFQTGLSVLNDIRDYVFNRVHIQSEMPLKLLDLAADSLYLAAREKALHFLKGQNKAINVRIIQCASLAEATGKIIY
THALEQRPEFHLGMLFYDQTTPAGVDDSIEQIDRDLDAFYSALQRSGIPAFYTTFSTVAFIRQLRSPFRYLPQQYREIVR
SEDPAIFQTELLCLWMDFFEMNYTNRRVKPIGALALHNTLGEQLIQFFERTAAERWLVSYYTGSIISNLIGYLDRHAEAR
GALILRGPNEHAIACGAMANWQLYRMPFLGVVTSGMMDEFKGTLANLKETAAQGIIVAAENRGNQWYSFQGTLTPTEDMR
EVLIARRIPFVYIDDVEMIGAGLTEAFRLYHQGQGPVVILATQNVLESTLSLEGAVCDPSPIPVLSADDPLPMSESLAQA
IALINRGPERLVWQLGPVSDDEYALIHDIADAAGIALVDSLAHPGSAPKYYQGRRNPHYLGTLAIYGYSPRVYNFLHTND
KLNAMSEQSLFMIKSRVAQITTPFSDGRLERKVHLVQLTHDDRHLSAYADLHLHMNCLAFLRTVKAHLDVDPALRERRRA
LIAAYLDSPSDVVSQLPSLPMSANYFFCQLNRVIEELIETEGFDFTGVYDVGRCGISAARNVAKTRRGFSGWYGRALMGD
ALLATGYLAYTSPSHVMAFIGDGAKGIVPDILPAFIDNILTHPQLLNKSITVFYLCNGGLSVINTYQERILFNRTSRQMR
LVNVEQPDVEQTVNNFHIQSKTLTHFDEDVIRQALTTSHRLNLFSVVLGHNNEGDGISPGHRQRLAALIRADHDALQERK
AWAAQQPESTSTAFDQDPTQEATS
>Q5W267 2.6.1.-~~~pigE~~~Aminotransferase PigE~~~COG4992
MKFGFIAHPTSVGLKRYVKMIDLLQRNSTELHSGYKRDLWRRENLVPFMNFAKITSATGATCEGVIKYMPLVADEMLADA
RGIANRVVSGIEELVEDGAELVGLGGFTSIVGRRGEATAEKSPVPVTSGNSLTTYAGYKALMQIQSWLDIQPEQEPVAIV
GYPGSICLALSRLLLAQGFSLHLLHRAGHKDEDELLSHLPEQYRSRVTLTSDPEDLYPRCKLFVAATSAGGVIDPYKLQP
GSVFIDVALPRDINSDTRPDRDDILIIDGGCVTATDAVKLGGESLNVTIKQQLNGCMAETIVLALENRRENFSLGRYLAL
DNVLEIGELAEKHGFLVYPLASYGERIDRQRVINLKRYYHHDIYSDEPDTEQPPASQLAFIDAIIAQDPAREDTLDRYHQ
FINPMMVEFLKLQHCDNVFRRASGTQLFTADGEAFLDMVAGYGCINLGHNPQPIIDALKAYLDAQGPNFIQYISIPEQAA
KLAEVLCHFAPGNMGRVFFSNSGTEAVEAAMKLAKASTGKAGIAYLKNSYHGKTLGALSITGREKHRRHFKPLLASMIEV
PFADIEALRQTLSRDDIGALMIEPIQGEGGVHVPPPGYLRTVQEICRQTDTLLMVDEVQTGLGRTGKLFACEWEGIEPDV
LMLSKSLSGGVMPIGATLCRAIFGNGPYGTADRFLMHSSTFGGGNIAAVVALSALREILAQDLVGNAERLGTYFKQALTD
VAARYPFVAEIAGRGLMLGIQFDQTFAGAVGASAREFATRLPGDWHTTWKFLPDPVQAHLKAAMERMEQSLGEMFCMKFV
TKLCQDHNILTFITANSSTVIRIQPPLTISKAEIDRFVSAFATVCDELSTFLE
>A0A0J9X1Q5 2.6.1.-~~~pigE~~~Aminotransferase PigE~~~
MKFGFIAHPTSLGLKRYVKMLDLLQRNSTEQHSGYTRELWERQNLVPFMNFARITSATGATCEGVIKYMPLVADEMLADA
RGIAARVVQGIEELAGDGAELVGLGGFTSIVGRRGEATAEKSPVPVTSGNSLTTYAGYKALMQIQSWLEIRPEEEPVAIV
GYPGSICLALSRLLLAHGFSLHLLHRAGNHDRSELLSHLPEEYHSRVTLTSDPEDLYPRCKLFAAATSAGGVIDPARLQP
GSIFIDVALPRDIASETRPARDDILIIDGGCVTATDAVKLGGESLNVTIKQQLNGCMAETIVLALENRRENFSLGRYLAP
EKVLEIGEIAERHGFFAYPLASYGERIDRQSVTNLKRYYHHDIYAGESADAALPASRLAFIDAVIAQTPAREDTLDRYHQ
YINPMMVDFLKLQRCDNVFRSAAGTQLYDDAGEAFLDMVAGYGCLNLGHNPQPVVNALKNYLDAQGPNFIQYISIPEQTA
KLAEVLCRLAPGNMGRVFFSNSGTEAVEAAMKIAKASTGKPGIAYLRNSYHGKTLGALSITGRDKHRRYFTPLLDAMVEV
PFGDLAALREALNREDVGALMIEPIQGEGGVHIPPAGYLQAVQQLCRETGVLLMVDEVQTGLGRTGKLFACEWDGIEPDV
LMLSKSLSGGLIPIGATLCRADLWQKAYGTADRFLVHSSTYGGGNLASVVALSALREILAQDLVGHAERMGAYFKQALSE
IAARYPFVSEVRGRGLMLGIQFDQAFTGAVNASAREFATRLPGDWHTTWKFLPDPVQAHLRAAMDRMEQALGEMFCMKFV
TKLCQDHKILTFITANSSTVIRIQPPLIISKAEIDRFVGAFATVCEELSTFLD
>Q5W266 2.1.1.-~~~pigF~~~S-adenosyl-L-methionine-dependent methyl transferase PigF~~~COG1414
MTLTKQDAVNQMMGFFQSKTLITALSLKLFDHLRDQDRNAKQMAALLNCPLRSSEQLLIALQAMGYLEKQDGLYHLPQEH
RAFLVSDEPQWLGWLGRHIDTFLYPLWGELKAAVENDTHQRQTVFGDDRSWFDILYQNPDDVTDFQEFLGKFAAPFIDGF
IQDYDFSQHQAFLDIGSGIGSLPIAVANAYSGVNLAICELPQTSTFLRDKLVQQGYGQRIQVLEGDVISGDLPIGDYDLI
HLGWMLHDYAPETQLIILKNIYDAMPVGGRFIASETPLNADKSGPEFTALLSLNMLVSTDGGIESSPQEYLSRFHQAGFS
NARIMDISGPRTLIVGEKTTHNNGSSQC
>Q5W265 ~~~pigG~~~Probable acyl carrier protein PigG~~~COG0236
MLESKLINHIATQFLDGEKDGLDSQTPLFELNIVDSAAIFDLVDFLRQESKVSIGMQEIHPANFATVQSMVALVQRLKAH
PEQGGAA
>Q5W264 2.3.2.-~~~pigH~~~4-hydroxy-2,2'-bipyrrole-5-methanol synthase PigH~~~COG0156
MNDVTTETYETLKQSVLHTFAQLTGYNVSELSLTSHLENDLGVDSIALAEIAVSLSRQFQLNTPLLIQDINTIKDALDGI
LQREFQLSEKVEPAAIALSGDADLWLGNLVRQIFASHSGYDVNALALDAEIESDLGIDSVSVASAQGELFNTLQLNSETI
IANCNTLSALKQCLAARLVQEKGQDWFEQRGRGQSDSAIDHDADTTAEVTPPTATPVAINAEIGDPRTMRDFVGIEHPDI
FHKAREFHLFYQDKKKRQLYFYGMPLETPCKNRAVMFDEATGQHREFLMFGSNSYLGLSNHPEIIHAIQDAASLYGATNT
GCRIIAGSNVLHLELERKLAKLKGRDDCIVYPSGYSANLGCISALTSRHDLVFTDAINHMSIQDGCKLAGAQRKIYNHSL
TSLEKSLAKYADHPGGKLIVTDGVFSMHGDIVDLPRLMKLAERYGARVLVDDAHSTGVLGKTGAGTSEHFNMKGQVDLEL
GTMSKALSGLGGYVCGDGDVVEYLRFYSNSYVFAATIPAPVAAGVIASIDVMLREPERLAKLWDNIYYFRTRLLNAGFDL
ENSDSAIIPIVVGDDAKTLFFGRAVRARGMFCQTVVFPGVSVGDARLRISITSEHTREDLDEAYAILVASALEVGVPVNA
SAHQEENASVAEA
>Q5W263 6.2.1.53~~~pigI~~~L-proline--[L-prolyl-carrier protein] ligase~~~COG1020
MTISTPVIIDSLIRHAQRTPEQTALLCGDQHWNYRQLVTRAHVMASALRQAGLSGQAILLNLPKSLDAVAAIYATWLSGN
HYIPIDYSQPSSRIERIIAAAAPALIIDTAWLATLDSQPSFDAEQPVGRMVYHNPIAAILYTSGSTGTPKGVQISHEMLG
FFIQWAVRDTQLTARDVLSNHASFAFDLSTFDLFASAYVGAATWIIRESEQKDCAALAQGLQRHAVSVWYSVPSILAMLE
KSTLLNPTLGQSLRQVIFAGEPYPVTALKRLLPCLPQPCRVSNWYGPTETNVCVAYAIDRARLAMLKQVPIGLPLEGLTA
QLEDENGDRHPLTAQLRLSGELLISGPCVTPGYSNVVVPRQAALHPHQCHATGDWVEMTPEGLVFRGRIDDMVKINGYRV
ELGEIESVLHQHPAIDRAALCVELGDLRQTLIMVISLQTGAVPPGLLELKQFLQQKLPSYMIPNKLVITESLPVNANGKV
DRKQLAGVVAV
>Q5W262 2.3.1.-~~~pigJ~~~Beta-ketoacyl synthase PigJ~~~COG3321
MSNDKHIAPLAVVSMGCVLPGVDHFRALDTIADWETVFQSASPLAWSETSRPIQGRQMDDSGFDFKKFSIPPLFRKAVSR
ETRLALRAAEDALAGLVLPESLRDCCDQFCAIHLGSDAAYRNATKVGALRALAEKLQAQGCPAAEVRRRLDDYKQPLAES
LGCSSHDRVGEMASSIPARIAHFAHTRGKCQTLDGADKGGLRLLQLAQDCFRYHDSQMAVLTSVQCFHHRPQAYMLLEQG
VSQDACWLEGAISLVVCPLAVAHEQGWPVLTQLGDIVTTHDGSPQPEADHPAALYFAGANQVFCQIVEMVLRQHQRCEGR
SFTGGRWQVNVAQTQSLTPAVDDRVAIVDYQPITGHPLDKTQFWQTLEQGEDALREHSAAHVNAEAFVRTTQQKLSTYIH
RTMSFPAHSPSDVALKKPMMPAKKQRLDVTQLYALNSCHSWSEKIRQFERVAIIIASNLSLSADRLQAMRALWSGLPGSE
GAIPLPELPSINHWSWYGACGIGTAQLLAQYFGISADCYAVEAACASSLAAVHDAVRALQAGRYDAVIVGGIETATLERD
LVLCSAQMMLSVSRIRPFSQGADGFTPGDGGGFVMLTHHPVPRAIATIEAISGSCDSYSMTAPDPLGQALAIKKTLSLTA
IDAQTVQYLEAHGTGTELGDRSEVMSLKYSYHRDKHSPLYIGSAKYNFGHCFAGAGALSLCKVLSAFEHERIPPTPVSEL
NVDLPLGDIPAEVPQQAIPWRLSEDGQRKAAINAFGTGGINYHLVIRQSS
>Q5W259 1.-.-.-~~~pigM~~~Probable NAD(P)H nitroreductase PigM~~~COG0778
MVNDTFEQALQNAINIARLAPSSHNCQPWSVHYDAATRCGEVSIDRQRALKGLPSLEREMLMSCGIFFEYLSTLLKHSGY
PLDWQWVGARQNGSSGMLISFAPSAPCVADLVAYQQWVQRISDRHTVRTAYQPTQVNEQQQAQLYALFDRSPVTCDIKYG
EATRHDVAFLTANYASLDFADQQAWRETYHYIRFNEQQAAEDGFYLHHLFGPVSCGFRWFFRIAFHPRLSWLAKRLRLPA
SMAKGLAELVVEGPQYLALSLEHESDENLFIAGMKLGQLWLMLQSWGWSLHPLSVLVQHATARCALADTVRLTGLPVFFA
RFGQHRQSGIPTPRRAWQRILTTTQHSFSPENGADVKQP
>Q9ZGI5 2.3.1.239~~~pikAI~~~Narbonolide/10-deoxymethynolide synthase PikA1, modules 1 and 2~~~
MSSAGITRTGARTPVTGRGAAAWDTGEVRVRRGLPPAGPDHAEHSFSRAPTGDVRAELIRGEMSTVSKSESEEFVSVSND
AGSAHGTAEPVAVVGISCRVPGARDPREFWELLAAGGQAVTDVPADRWNAGDFYDPDRSAPGRSNSRWGGFIEDVDRFDA
AFFGISPREAAEMDPQQRLALELGWEALERAGIDPSSLTGTRTGVFAGAIWDDYATLKHRQGGAAITPHTVTGLHRGIIA
NRLSYTLGLRGPSMVVDSGQSSSLVAVHLACESLRRGESELALAGGVSLNLVPDSIIGASKFGGLSPDGRAYTFDARANG
YVRGEGGGFVVLKRLSRAVADGDPVLAVIRGSAVNNGGAAQGMTTPDAQAQEAVLREAHERAGTAPADVRYVELHGTGTP
VGDPIEAAALGAALGTGRPAGQPLLVGSVKTNIGHLEGAAGIAGLIKAVLAVRGRALPASLNYETPNPAIPFEELNLRVN
TEYLPWEPEHDGQRMVVGVSSFGMGGTNAHVVLEEAPGGCRGASVVESTVGGSAVGGGVVPWVVSAKSAAALDAQIERLA
AFASRDRTDGVDAGAVDAGAVDAGAVARVLAGGRAQFEHRAVVVGSGPDDLAAALAAPEGLVRGVASGVGRVAFVFPGQG
TQWAGMGAELLDSSAVFAAAMAECEAALSPYVDWSLEAVVRQAPGAPTLERVDVVQPVTFAVMVSLARVWQHHGVTPQAV
VGHSQGEIAAAYVAGALSLDDAARVVTLRSKSIAAHLAGKGGMLSLALSEDAVLERLAGFDGLSVAAVNGPTATVVSGDP
VQIEELARACEADGVRARVIPVDYASHSRQVEIIESELAEVLAGLSPQAPRVPFFSTLEGAWITEPVLDGGYWYRNLRHR
VGFAPAVETLATDEGFTHFVEVSAHPVLTMALPGTVTGLATLRRDNGGQDRLVASLAEAWANGLAVDWSPLLPSATGHHS
DLPTYAFQTERHWLGEIEALAPAGEPAVQPAVLRTEAAEPAELDRDEQLRVILDKVRAQTAQVLGYATGGQIEVDRTFRE
AGCTSLTGVDLRNRINAAFGVRMAPSMIFDFPTPEALAEQLLLVVHGEAAANPAGAEPAPVAAAGAVDEPVAIVGMACRL
PGGVASPEDLWRLVAGGGDAISEFPQDRGWDVEGLYHPDPEHPGTSYVRQGGFIENVAGFDAAFFGISPREALAMDPQQR
LLLETSWEAVEDAGIDPTSLRGRQVGVFTGAMTHEYGPSLRDGGEGLDGYLLTGNTASVMSGRVSYTLGLEGPALTVDTA
CSSSLVALHLAVQALRKGEVDMALAGGVAVMPTPGMFVEFSRQRGLAGDGRSKAFAASADGTSWSEGVGVLLVERLSDAR
RNGHQVLAVVRGSALNQDGASNGLTAPNGPSQQRVIRRALADARLTTSDVDVVEAHGTGTRLGDPIEAQALIATYGQGRD
DEQPLRLGSLKSNIGHTQAAAGVSGVIKMVQAMRHGLLPKTLHVDEPSDQIDWSAGAVELLTEAVDWPEKQDGGLRRAAV
SSFGISGTNAHVVLEEAPVVVEGASVVEPSVGGSAVGGGVTPWVVSAKSAAALDAQIERLAAFASRDRTDDADAGAVDAG
AVAHVLADGRAQFEHRAVALGAGADDLVQALADPDGLIRGTASGVGRVAFVFPGQGTQWAGMGAELLDSSAVFAAAMAEC
EAALSPYVDWSLEAVVRQAPGAPTLERVDVVQPVTFAVMVSLARVWQHHGVTPQAVVGHSQGEIAAAYVAGALPLDDAAR
VVTLRSKSIAAHLAGKGGMLSLALNEDAVLERLSDFDGLSVAAVNGPTATVVSGDPVQIEELAQACKADGFRARIIPVDY
ASHSRQVEIIESELAQVLAGLSPQAPRVPFFSTLEGTWITEPVLDGTYWYRNLRHRVGFAPAIETLAVDEGFTHFVEVSA
HPVLTMTLPETVTGLGTLRREQGGQERLVTSLAEAWVNGLPVAWTSLLPATASRPGLPTYAFQAERYWLENTPAALATGD
DWRYRIDWKRLPAAEGSERTGLSGRWLAVTPEDHSAQAAAVLTALVDAGAKVEVLTAGADDDREALAARLTALTTGDGFT
GVVSLLDGLVPQVAWVQALGDAGIKAPLWSVTQGAVSVGRLDTPADPDRAMLWGLGRVVALEHPERWAGLVDLPAQPDAA
ALAHLVTALSGATGEDQIAIRTTGLHARRLARAPLHGRRPTRDWQPHGTVLITGGTGALGSHAARWMAHHGAEHLLLVSR
SGEQAPGATQLTAELTASGARVTIAACDVADPHAMRTLLDAIPAETPLTAVVHTAGALDDGIVDTLTAEQVRRAHRAKAV
GASVLDELTRDLDLDAFVLFSSVSSTLGIPGQGNYAPHNAYLDALAARRRATGRSAVSVAWGPWDGGGMAAGDGVAERLR
NHGVPGMDPELALAALESALGRDETAITVADIDWDRFYLAYSSGRPQPLVEELPEVRRIIDARDSATSGQGGSSAQGANP
LAERLAAAAPGERTEILLGLVRAQAAAVLRMRSPEDVAADRAFKDIGFDSLAGVELRNRLTRATGLQLPATLVFDHPTPL
ALVSLLRSEFLGDEETADARRSAALPATVGAGAGAGAGTDADDDPIAIVAMSCRYPGDIRSPEDLWRMLSEGGEGITPFP
TDRGWDLDGLYDADPDALGRAYVREGGFLHDAAEFDAEFFGVSPREALAMDPQQRMLLTTSWEAFERAGIEPASLRGSST
GVFIGLSYQDYAARVPNAPRGVEGYLLTGSTPSVASGRIAYTFGLEGPATTVDTACSSSLTALHLAVRALRSGECTMALA
GGVAMMATPHMFVEFSRQRALAPDGRSKAFSADADGFGAAEGVGLLLVERLSDARRNGHPVLAVVRGTAVNQDGASNGLT
APNGPSQQRVIRQALADARLAPGDIDAVETHGTGTSLGDPIEAQGLQATYGKERPAERPLAIGSVKSNIGHTQAAAGAAG
IIKMVLAMRHGTLPKTLHADEPSPHVDWANSGLALVTEPIDWPAGTGPRRAAVSSFGISGTNAHVVLEQAPDAAGEVLGA
DEVPEVSETVAMAGTAGTSEVAEGSEASEAPAAPGSREASLPGHLPWVLSAKDEQSLRGQAAALHAWLSEPAADLSDADG
PARLRDVGYTLATSRTAFAHRAAVTAADRDGFLDGLATLAQGGTSAHVHLDTARDGTTAFLFTGQGSQRPGAGRELYDRH
PVFARALDEICAHLDGHLELPLLDVMFAAEGSAEAALLDETRYTQCALFALEVALFRLVESWGMRPAALLGHSVGEIAAA
HVAGVFSLADAARLVAARGRLMQELPAGGAMLAVQAAEDEIRVWLETEERYAGRLDVAAVNGPEAAVLSGDADAAREAEA
YWSGLGRRTRALRVSHAFHSAHMDGMLDGFRAVLETVEFRRPSLTVVSNVTGLAAGPDDLCDPEYWVRHVRGTVRFLDGV
RVLRDLGVRTCLELGPDGVLTAMAADGLADTPADSAAGSPVGSPAGSPADSAAGALRPRPLLVALLRRKRSETETVADAL
GRAHAHGTGPDWHAWFAGSGAHRVDLPTYSFRRDRYWLDAPAADTAVDTAGLGLGTADHPLLGAVVSLPDRDGLLLTGRL
SLRTHPWLADHAVLGSVLLPGAAMVELAAHAAESAGLRDVRELTLLEPLVLPEHGGVELRVTVGAPAGEPGGESAGDGAR
PVSLHSRLADAPAGTAWSCHATGLLATDRPELPVAPDRAAMWPPQGAEEVPLDGLYERLDGNGLAFGPLFQGLNAVWRYE
GEVFADIALPATTNATAPATANGGGSAAAAPYGIHPALLDASLHAIAVGGLVDEPELVRVPFHWSGVTVHAAGAAAARVR
LASAGTDAVSLSLTDGEGRPLVSVERLTLRPVTADQAAASRVGGLMHRVAWRPYALASSGEQDPHATSYGPTAVLGKDEL
KVAAALESAGVEVGLYPDLAALSQDVAAGAPAPRTVLAPLPAGPADGGAEGVRGTVARTLELLQAWLADEHLAGTRLLLV
TRGAVRDPEGSGADDGGEDLSHAAAWGLVRTAQTENPGRFGLLDLADDASSYRTLPSVLSDAGLRDEPQLALHDGTIRLA
RLASVRPETGTAAPALAPEGTVLLTGGTGGLGGLVARHVVGEWGVRRLLLVSRRGTDAPGADELVHELEALGADVSVAAC
DVADREALTAVLDAIPAEHPLTAVVHTAGVLSDGTLPSMTTEDVEHVLRPKVDAAFLLDELTSTPAYDLAAFVMFSSAAA
VFGGAGQGAYAAANATLDALAWRRRAAGLPALSLGWGLWAETSGMTGELGQADLRRMSRAGIGGISDAEGIALLDAALRD
DRHPVLLPLRLDAAGLRDAAGNDPAGIPALFRDVVGARTVRARPSAASASTTAGTAGTPGTADGAAETAAVTLADRAATV
DGPARQRLLLEFVVGEVAEVLGHARGHRIDAERGFLDLGFDSLTAVELRNRLNSAGGLALPATLVFDHPSPAALASHLDA
ELPRGASDQDGAGNRNGNENGTTASRSTAETDALLAQLTRLEGALVLTGLSDAPGSEEVLEHLRSLRSMVTGETGTGTAS
GAPDGAGSGAEDRPWAAGDGAGGGSEDGAGVPDFMNASAEELFGLLDQDPSTD
>Q9ZGI4 2.3.1.239~~~pikAII~~~Narbonolide/10-deoxymethynolide synthase PikA2, modules 3 and 4~~~
MSTVNEEKYLDYLRRATADLHEARGRLRELEAKAGEPVAIVGMACRLPGGVASPEDLWRLVAGGEDAISEFPQDRGWDVE
GLYDPNPEATGKSYAREAGFLYEAGEFDADFFGISPREALAMDPQQRLLLEASWEAFEHAGIPAATARGTSVGVFTGVMY
HDYATRLTDVPEGIEGYLGTGNSGSVASGRVAYTLGLEGPAVTVDTACSSSLVALHLAVQALRKGEVDMALAGGVTVMST
PSTFVEFSRQRGLAPDGRSKSFSSTADGTSWSEGVGVLLVERLSDARRKGHRILAVVRGTAVNQDGASSGLTAPNGPSQQ
RVIRRALADARLTTSDVDVVEAHGTGTRLGDPIEAQAVIATYGQGRDGEQPLRLGSLKSNIGHTQAAAGVSGVIKMVQAM
RHGVLPKTLHVEKPTDQVDWSAGAVELLTEAMDWPDKGDGGLRRAAVSSFGVSGTNAHVVLEEAPAAEETPASEATPAVE
PSVGAGLVPWLVSAKTPAALDAQIGRLAAFASQGRTDAADPGAVARVLAGGRAEFEHRAVVLGTGQDDFAQALTAPEGLI
RGTPSDVGRVAFVFPGQGTQWAGMGAELLDVSKEFAAAMAECESALSRYVDWSLEAVVRQAPGAPTLERVDVVQPVTFAV
MVSLAKVWQHHGVTPQAVVGHSQGEIAAAYVAGALTLDDAARVVTLRSKSIAAHLAGKGGMISLALSEEATRQRIENLHG
LSIAAVNGPTATVVSGDPTQIQELAQACEADGVRARIIPVDYASHSAHVETIESELAEVLAGLSPRTPEVPFFSTLEGAW
ITEPVLDGTYWYRNLRHRVGFAPAVETLATDEGFTHFIEVSAHPVLTMTLPETVTGLGTLRREQGGQERLVTSLAEAWTN
GLTIDWAPVLPTATGHHPELPTYAFQRRHYWLHDSPAVQGSVQDSWRYRIDWKRLAVADASERAGLSGRWLVVVPEDRSA
EAAPVLAALSGAGADPVQLDVSPLGDRQRLAATLGEALAAAGGAVDGVLSLLAWDESAHPGHPAPFTRGTGATLTLVQAL
EDAGVAAPLWCVTHGAVSVGRADHVTSPAQAMVWGMGRVAALEHPERWGGLIDLPSDADRAALDRMTTVLAGGTGEDQVA
VRASGLLARRLVRASLPAHGTASPWWQADGTVLVTGAEEPAAAEAARRLARDGAGHLLLHTTPSGSEGAEGTSGAAEDSG
LAGLVAELADLGATATVVTCDLTDAEAAARLLAGVSDAHPLSAVLHLPPTVDSEPLAATDADALARVVTAKATAALHLDR
LLREAAAAGGRPPVLVLFSSVAAIWGGAGQGAYAAGTAFLDALAGQHRADGPTVTSVAWSPWEGSRVTEGATGERLRRLG
LRPLAPATALTALDTALGHGDTAVTIADVDWSSFAPGFTTARPGTLLADLPEARRALDEQQSTTAADDTVLSRELGALTG
AEQQRRMQELVREHLAVVLNHPSPEAVDTGRAFRDLGFDSLTAVELRNRLKNATGLALPATLVFDYPTPRTLAEFLLAEI
LGEQAGAGEQLPVDGGVDDEPVAIVGMACRLPGGVASPEDLWRLVAGGEDAISGFPQDRGWDVEGLYDPDPDASGRTYCR
AGGFLDEAGEFDADFFGISPREALAMDPQQRLLLETSWEAVEDAGIDPTSLQGQQVGVFAGTNGPHYEPLLRNTAEDLEG
YVGTGNAASIMSGRVSYTLGLEGPAVTVDTACSSSLVALHLAVQALRKGECGLALAGGVTVMSTPTTFVEFSRQRGLAED
GRSKAFAASADGFGPAEGVGMLLVERLSDARRNGHRVLAVVRGSAVNQDGASNGLTAPNGPSQQRVIRRALADARLTTAD
VDVVEAHGTGTRLGDPIEAQALIATYGQGRDTEQPLRLGSLKSNIGHTQAAAGVSGIIKMVQAMRHGVLPKTLHVDRPSD
QIDWSAGTVELLTEAMDWPRKQEGGLRRAAVSSFGISGTNAHIVLEEAPVDEDAPADEPSVGGVVPWLVSAKTPAALDAQ
IGRLAAFASQGRTDAADPGAVARVLAGGRAQFEHRAVALGTGQDDLAAALAAPEGLVRGVASGVGRVAFVFPGQGTQWAG
MGAELLDVSKEFAAAMAECEAALAPYVDWSLEAVVRQAPGAPTLERVDVVQPVTFAVMVSLAKVWQHHGVTPQAVVGHSQ
GEIAAAYVAGALSLDDAARVVTLRSKSIGAHLAGQGGMLSLALSEAAVVERLAGFDGLSVAAVNGPTATVVSGDPTQIQE
LAQACEADGVRARIIPVDYASHSAHVETIESELADVLAGLSPQTPQVPFFSTLEGAWITEPALDGGYWYRNLRHRVGFAP
AVETLATDEGFTHFVEVSAHPVLTMALPETVTGLGTLRRDNGGQHRLTTSLAEAWANGLTVDWASLLPTTTTHPDLPTYA
FQTERYWPQPDLSAAGDITSAGLGAAEHPLLGAAVALADSDGCLLTGSLSLRTHPWLADHAVAGTVLLPGTAFVELAFRA
GDQVGCDLVEELTLDAPLVLPRRGAVRVQLSVGASDESGRRTFGLYAHPEDAPGEAEWTRHATGVLAARADRTAPVADPE
AWPPPGAEPVDVDGLYERFAANGYGYGPLFQGVRGVWRRGDEVFADVALPAEVAGAEGARFGLHPALLDAAVQAAGAGRG
VRRGHAAAVRLERDLLYAVGATALRVRLAPAGPDTVSVSAADSSGQPVFAADSLTVLPVDPAQLAAFSDPTLDALHLLEW
TAWDGAAQALPGAVVLGGDADGLAAALRAGGTEVLSFPDLTDLVEAVDRGETPAPATVLVACPAAGPDGPEHVREALHGS
LALMQAWLADERFTDGRLVLVTRDAVAARSGDGLRSTGQAAVWGLGRSAQTESPGRFVLLDLAGEARTAGDATAGDGLTT
GDATVGGTSGDAALGSALATALGSGEPQLALRDGALLVPRLARAAAPAAADGLAAADGLAALPLPAAPALWRLEPGTDGS
LESLTAAPGDAETLAPEPLGPGQVRIAIRATGLNFRDVLIALGMYPDPALMGTEGAGVVTATGPGVTHLAPGDRVMGLLS
GAYAPVVVADARTVARMPEGWTFAQGASVPVVFLTAVYALRDLADVKPGERLLVHSAAGGVGMAAVQLARHWGVEVHGTA
SHGKWDALRALGLDDAHIASSRTLDFESAFRAASGGAGMDVVLNSLAREFVDASLRLLGPGGRFVEMGKTDVRDAERVAA
DHPGVGYRAFDLGEAGPERIGEMLAEVIALFEDGVLRHLPVTTWDVRRARDAFRHVSQARHTGKVVLTMPSGLDPEGTVL
LTGGTGALGGIVARHVVGEWGVRRLLLVSRRGTDAPGAGELVHELEALGADVSVAACDVADREALTAVLDSIPAEHPLTA
VVHTAGVLSDGTLPSMTAEDVEHVLRPKVDAAFLLDELTSTPGYDLAAFVMFSSAAAVFGGAGQGAYAAANATLDALAWR
RRTAGLPALSLGWGLWAETSGMTGGLSDTDRSRLARSGATPMDSELTLSLLDAAMRRDDPALVPIALDVAALRAQQRDGM
LAPLLSGLTRGSRVGGAPVNQRRAAAGGAGEADTDLGGRLAAMTPDDRVAHLRDLVRTHVATVLGHGTPSRVDLERAFRD
TGFDSLTAVELRNRLNAATGLRLPATLVFDHPTPGELAGHLLDELATAAGGSWAEGTGSGDTASATDRQTTAALAELDRL
EGVLASLAPAAGGRPELAARLRALAAALGDDGDDATDLDEASDDDLFSFIDKELGDSDF
>Q9ZGI3 2.3.1.239~~~pikAIII~~~Narbonolide/10-deoxymethynolide synthase PikA3, module 5~~~
MANNEDKLRDYLKRVTAELQQNTRRLREIEGRTHEPVAIVGMACRLPGGVASPEDLWQLVAGDGDAISEFPQDRGWDVEG
LYDPDPDASGRTYCRSGGFLHDAGEFDADFFGISPREALAMDPQQRLSLTTAWEAIESAGIDPTALKGSGLGVFVGGWHT
GYTSGQTTAVQSPELEGHLVSGAALGFLSGRIAYVLGTDGPALTVDTACSSSLVALHLAVQALRKGECDMALAGGVTVMP
NADLFVQFSRQRGLAADGRSKAFATSADGFGPAEGAGVLLVERLSDARRNGHRILAVVRGSAVNQDGASNGLTAPHGPSQ
QRVIRRALADARLAPGDVDVVEAHGTGTRLGDPIEAQALIATYGQEKSSEQPLRLGALKSNIGHTQAAAGVAGVIKMVQA
MRHGLLPKTLHVDEPSDQIDWSAGTVELLTEAVDWPEKQDGGLRRAAVSSFGISGTNAHVVLEEAPAVEDSPAVEPPAGG
GVVPWPVSAKTPAALDAQIGQLAAYADGRTDVDPAVAARALVDSRTAMEHRAVAVGDSREALRDALRMPEGLVRGTSSDV
GRVAFVFPGQGTQWAGMGAELLDSSPEFAASMAECETALSRYVDWSLEAVVRQEPGAPTLDRVDVVQPVTFAVMVSLAKV
WQHHGITPQAVVGHSQGEIAAAYVAGALTLDDAARVVTLRSKSIAAHLAGKGGMISLALDEAAVLKRLSDFDGLSVAAVN
GPTATVVSGDPTQIEELARTCEADGVRARIIPVDYASHSRQVEIIEKELAEVLAGLAPQAPHVPFFSTLEGTWITEPVLD
GTYWYRNLRHRVGFAPAVETLAVDGFTHFIEVSAHPVLTMTLPETVTGLGTLRREQGGQERLVTSLAEAWANGLTIDWAP
ILPTATGHHPELPTYAFQTERFWLQSSAPTSAADDWRYRVEWKPLTASGQADLSGRWIVAVGSEPEAELLGALKAAGAEV
DVLEAGADDDREALAARLTALTTGDGFTGVVSLLDDLVPQVAWVQALGDAGIKAPLWSVTQGAVSVGRLDTPADPDRAML
WGLGRVVALEHPERWAGLVDLPAQPDAAALAHLVTALSGATGEDQIAIRTTGLHARRLARAPLHGRRPTRDWQPHGTVLI
TGGTGALGSHAARWMAHHGAEHLLLVSRSGEQAPGATQLTAELTASGARVTIAACDVADPHAMRTLLDAIPAETPLTAVV
HTAGAPGGDPLDVTGPEDIARILGAKTSGAEVLDDLLRGTPLDAFVLYSSNAGVWGSGSQGVYAAANAHLDALAARRRAR
GETATSVAWGLWAGDGMGRGADDAYWQRRGIRPMSPDRALDELAKALSHDETFVAVADVDWERFAPAFTVSRPSLLLDGV
PEARQALAAPVGAPAPGDAAVAPTGQSSALAAITALPEPERRPALLTLVRTHAAAVLGHSSPDRVAPGRAFTELGFDSLT
AVQLRNQLSTVVGNRLPATTVFDHPTPAALAAHLHEAYLAPAEPAPTDWEGRVRRALAELPLDRLRDAGVLDTVLRLTGI
EPEPGSGGSDGGAADPGAEPEASIDDLDAEALIRMALGPRNT
>Q9ZGI2 2.3.1.239~~~pikAIV~~~Narbonolide/10-deoxymethynolide synthase PikA4, module 6~~~
MTSSNEQLVDALRASLKENEELRKESRRRADRRQEPMAIVGMSCRFAGGIRSPEDLWDAVAAGKDLVSEVPEERGWDIDS
LYDPVPGRKGTTYVRNAAFLDDAAGFDAAFFGISPREALAMDPQQRQLLEASWEVFERAGIDPASVRGTDVGVYVGCGYQ
DYAPDIRVAPEGTGGYVVTGNSSAVASGRIAYSLGLEGPAVTVDTACSSSLVALHLALKGLRNGDCSTALVGGVAVLATP
GAFIEFSSQQAMAADGRTKGFASAADGLAWGEGVAVLLLERLSDARRKGHRVLAVVRGSAINQDGASNGLTAPHGPSQQH
LIRQALADARLTSSDVDVVEGHGTGTRLGDPIEAQALLATYGQGRAPGQPLRLGTLKSNIGHTQAASGVAGVIKMVQALR
HGVLPKTLHVDEPTDQVDWSAGSVELLTEAVDWPERPGRLRRAGVSAFGVGGTNAHVVLEEAPAVEESPAVEPPAGGGVV
PWPVSAKTSAALDAQIGQLAAYAEDRTDVDPAVAARALVDSRTAMEHRAVAVGDSREALRDALRMPEGLVRGTVTDPGRV
AFVFPGQGTQWAGMGAELLDSSPEFAAAMAECETALSPYVDWSLEAVVRQAPSAPTLDRVDVVQPVTFAVMVSLAKVWQH
HGITPEAVIGHSQGEIAAAYVAGALTLDDAARVVTLRSKSIAAHLAGKGGMISLALSEEATRQRIENLHGLSIAAVNGPT
ATVVSGDPTQIQELAQACEADGIRARIIPVDYASHSAHVETIENELADVLAGLSPQTPQVPFFSTLEGTWITEPALDGGY
WYRNLRHRVGFAPAVETLATDEGFTHFIEVSAHPVLTMTLPDKVTGLATLRREDGGQHRLTTSLAEAWANGLALDWASLL
PATGALSPAVPDLPTYAFQHRSYWISPAGPGEAPAHTASGREAVAETGLAWGPGAEDLDEEGRRSAVLAMVMRQAASVLR
CDSPEEVPVDRPLREIGFDSLTAVDFRNRVNRLTGLQLPPTVVFQHPTPVALAERISDELAERNWAVAEPSDHEQAEEEK
AAAPAGARSGADTGAGAGMFRALFRQAVEDDRYGEFLDVLAEASAFRPQFASPEACSERLDPVLLAGGPTDRAEGRAVLV
GCTGTAANGGPHEFLRLSTSFQEERDFLAVPLPGYGTGTGTGTALLPADLDTALDAQARAILRAAGDAPVVLLGHSGGAL
LAHELAFRLERAHGAPPAGIVLVDPYPPGHQEPIEVWSRQLGEGLFAGELEPMSDARLLAMGRYARFLAGPRPGRSSAPV
LLVRASEPLGDWQEERGDWRAHWDLPHTVADVPGDHFTMMRDHAPAVAEAVLSWLDAIEGIEGAGK
>Q9ZGI1 3.1.2.-~~~pikAV~~~Thioesterase PikA5~~~
MTDRPLNVDSGLWIRRFHPAPNSAVRLVCLPHAGGSASYFFRFSEELHPSVEALSVQYPGRQDRRAEPCLESVEELAEHV
VAATEPWWQEGRLAFFGHSLGASVAFETARILEQRHGVRPEGLYVSGRRAPSLAPDRLVHQLDDRAFLAEIRRLSGTDER
FLQDDELLRLVLPALRSDYKAAETYLHRPSAKLTCPVMALAGDRDPKAPLNEVAEWRRHTSGPFCLRAYSGGHFYLNDQW
HEICNDISDHLLVTRGAPDARVVQPPTSLIEGAAKRWQNPR
>O87605 1.14.15.33~~~pikC~~~Cytochrome P450 monooxygenase PikC~~~
MRRTQQGTTASPPVLDLGALGQDFAADPYPTYARLRAEGPAHRVRTPEGDEVWLVVGYDRARAVLADPRFSKDWRNSTTP
LTEAEAALNHNMLESDPPRHTRLRKLVAREFTMRRVELLRPRVQEIVDGLVDAMLAAPDGRADLMESLAWPLPITVISEL
LGVPEPDRAAFRVWTDAFVFPDDPAQAQTAMAEMSGYLSRLIDSKRGQDGEDLLSALVRTSDEDGSRLTSEELLGMAHIL
LVAGHETTVNLIANGMYALLSHPDQLAALRADMTLLDGAVEEMLRYEGPVESATYRFPVEPVDLDGTVIPAGDTVLVVLA
DAHRTPERFPDPHRFDIRRDTAGHLAFGHGIHFCIGAPLARLEARIAVRALLERCPDLALDVSPGELVWYPNPMIRGLKA
LPIRWRRGREAGRRTG
>P04737 ~~~traA~~~Pilin~~~
MNAVLSVQGASAPVKKKSFFSKFTRLNMLRLARAVIPAAVLMMFFPQLAMAAGSSGQDLMASGNTTVKATFGKDSSVVKW
VVLAEVLVGAVMYMMTKNVKFLAGFAIISVFIAVGMAVVGL
>P12060 ~~~traA~~~Pilin~~~
MNLSFAKGGLPAPVKNRAWQYCQMAWRGVTSKKALSRLAALSPLLLLGVGQMASATDLLAGGKDDVKATFGADSFVMMCI
IIAELIVGVAMYIRTKNLLILLGLVVVIVFTTVGLTFIK
>Q72JC0 ~~~pilA4~~~Type IV wide pilus major component PilA4~~~COG2165
MRNAKGFTLIELLIVIAIIAILAAVLIPNLLAARKRANDTVVTAYLNDAVKFQEMYQIDNNSYTSNQAALISLGLKSTPA
NVTFSIVSASANSYCMIAGHSGGTVWFAATPDKGVYKTNTAVTSSQPESCP
>Q72GL2 ~~~pilA5~~~Type IV narrow pilus major component PilA5~~~COG2165
MRAKGFTLIELAIVIVIIGILVAIAVPRFVDLTDQANQANVDATAAAVRSAYAIATVQAKGIPTCDQVFANPEGGSTSGS
TWTSSDNSTTVSCNASADTFTISRGGKTRTLNLTVN
>Q59589 ~~~pilA~~~Type IV major pilin protein PilA~~~COG4968
MRVSRFNPRNRGFTLIELMIVVAIIGILAAIAIPNFIKFQARSKQSEAKTNLKALYTAQKSFFSEKDRYSDFANEIGFAP
ERGNRYGYRVSAAAGDCEVRNAADLPVPAAGVPCISNDSFRFGANSAIDDPTPVVARFVPQGAAGWNTTLGVQPTIADCP
NCNFFAGARGNADNEATFDDWVIAGFEGSGQVGPCSEAGNVASGTPYNTRNDVACDGAAQ
>P04739 ~~~pilA~~~Type IV major pilin protein PilA~~~
MKAQKGFTLIELMIVVAIIGILAAIAIPQYQNYVARSEGASALATINPLKTTVEESLSRGIAGSKIKIGTTASTATETYV
GVEPDANKLGVIAVAIEDSGAGDITFTFQTGTSSPKNATKVITLNRTADGVWACKSTQDPMFTPKGCDN
>Q6FF45 ~~~pilB~~~Type IV pilus assembly ATPase PilB~~~COG2804
MISESQGGLMSAFTTPPKFSGFIRRLVEEGYVNAQNMQQALEKAKKFKQDIVPYLIDNFSISPLTIAEIISLEFGEPLLD
LGVFDPALFLKDKIDEKLIQKYRIMPLVHRGHVLYVATSNPTNIEAMDAIRFNSKLKVEPIIVEHDKLERLLSEHFVEET
HFNFDTEELDLDVEVDPHTTDDDDEDDKLKDEAPIVKYINKLLIDAIRMSASDLHFEPYEKSYRVRYRVDGVLRLIATPP
LQLATRLASRLKVMSQMDISEKRVPQDGRIKLKMSKSKTIDFRVNSLPTLFGEKIVLRILDPASAMLGIDALGYEPEQKA
LFMEALNKPQGMLLITGPTGSGKTVSLYTGLNILNTEHANISTAEDPVEINLEGVNQVNVNPKVGLTFAAALRSFLRQDP
DIIMVGEIRDLETAEIAIKAAQTGHLVMSTLHTNNAAETLTRLRNMGVASFNIATSVNLVIAQRLARRLCSQCKRPIQVP
ERSLLEMGFTPEDLAQPEFQIFEPVGCHDCREGYKGRVGIYEVMKITPEISKIIMEDGNALEIAATAETLGFNNLRRSGL
KKVMQGVTSLQEINRVTSE
>Q1D098 ~~~pilB~~~Type IV pilus assembly ATPase PilB~~~COG2804
MSGRLGELLVRENLISVQQLRKAQEEQQKNGTRIGTALVKTGAIEESKLTDFLSKQYGVPAINLKDFDVEPDIIKLVPKE
VAEKHLVVPVNRAGPSLIVAMCDPSNIFAVDDLKFLTGYNIETVVASEVSIREAIERYYAEKGPSLEDIVGDVGDDIEVT
KEETENIDEMAKAADDAPVVKLVNLILMDAIKKRASDIHVEPYEKDFRVRFRIDGVMYEVMRPPMKLRNAITSRLKIMAS
LDISERRLPQDGRIKIKMGGGKEMDFRVSVCPTLFGEKVVMRLLDKSNLQLDMTKLGFDAQPLAWFKEAIDRPYGMVLVT
GPTGSGKTTTLYSALSSLNGLDTNICTAEDPVEFNFAGINQVQMHDDIGLNFAAALRSFLRQDPDIIMIGEIRDFETAEI
GVKAALTGHLVLSTLHTNDAPGTVSRLLNMGIEPFLVTASLNLILAQRLARRLCPACKKPAENVDEQALIDAGVPPDKIG
TFTMYEKVGCRDCNDRGYRGRVAIYEVMPFWDGLKELVINGASAAELKQEAIRLGMSSLRMSGLRKMMDGATTLEEVVGN
TAPDRF
>P22608 ~~~pilB~~~Type IV pilus assembly ATPase PilB~~~
MNDSIQLSGLSRQLVQANLLDEKTAVQAQAQAQRNKLSLVTHLVQSKLVSGLALAELSAEQFGIAYCDLNSLDKESFPRD
AISEKLVRQHRVIPLWRRGNKLFVGISDPANHQAINDVQFSTGLTTEAILVEDDKLGLAIDKLFESATDGLAGLDDVDLE
GLDIGSADKSTQEDASAEADDAPVVRFVNKMLLDAIKGGSSDLHFEPYEKIYRVRFRTDGMLHEVAKPPIQLASRISARL
KVMAGLDISERRKPQDGRIKMRVSKTKSIDFRVNTLPTLWGEKIVMRILDSSSAQMGIDALGYEEDQKELYLAALKQPQG
MILVTGPTGSGKTVSLYTGLNILNTTDINISTAEDPVEINLEGINQVNVNPRQGMDFSQALRAFLRQDPDVIMVGEIRDL
ETAEIAIKAAQTGHMVMSTLHTNSAAETLTRLLNMGVPAFNLATSVNLIIAQRLARKLCSHCKKEHEVPRETLLHEGFPE
DKIGTFKLYSPVGCDHCKNGYKGRVGIYEVVKNTPALQRIIMEEGNSIEIAEQARKEGFNDLRTSGLLKAMQGITSLEEV
NRVTKD
>Q5SLC9 ~~~pilB~~~Type IV pilus assembly ATPase PilB~~~COG2804
MSVLTIGDKRLGAALLDAGLLTDEELQRALERHREVGGSLAEVLVDMGLLSERRIAQTIEDRFGIPLVELHRVEIPPKVK
ALLPAEKAKELKAIPFALDEEAGVVRVAFLNPLDTLSLEEVEDLTGLVVEPYQTTKSAFLYALAKHYPELGLPVPPPPSG
EGQKDLKLGELLLQKGWISREALEEALVEQEKTGDLLGRILVRKGLPEEALYRALAEQKGLEFLESTEGIVPDPSAALLL
LRSDALRYGAVPIGFQNGEVEVVLSDPRHKEAVAQLLNRPARFYLALPQAWEELFRRAYPQKNRLGEVLVQEGKLSREAL
KEALEVQKGLPRAKPLGEILVELGLARPEDVEEALQKQRRGGGRLEDTLVQSGKLRPEALAQAVATQLGYPYVDPEEDPP
DPGAPLLLPEDLCRRYGVFPHRLEGNRLVLLMKDPRNILALDDVRLALKRKGLNYEVAPAVATEAAITKLIERFYGKAEL
SEIAKEFAKKQAEEEVPSPLELDESAAQKFVKQVIREAFLQDASDIHIEPRQNDVQVRLRIDGALRPYSTLPKGALNAVI
SVVKIMGGLNIAEKRLPQDGRVRYREGAIDVDLRLSTLPTVYGEKAVMRLLKKASDIPEIEDLGFAPGVFERFKEVISKP
YGIFLITGPTGSGKSFTTFSILKRIATPDKNTQTIEDPVEYEIPGINQTQVNPQAGLTFARALRAFLRQDPDIIMVGEIR
DSETAKIATEAALTGHLVIATLHTNDAAQAITRLDEMGVEPFNISAALIGVLSQRLVRRVCEHCKVEVKPDPETLRRLGL
SEAEIQGARLYKGMGCERCGGTGYKGRYAIHELLVVDDEIRHAIVAGKSATEIKEIARRKGMKTLREDGLYKALQGITTL
EEVLARTIE
>Q1D0A0 ~~~pilC~~~Type IV pilus assembly protein PilC~~~COG1459
MAAPAVKSASTPKKATAQFLWEAKTKSGESKKGEMEAMDVEAVNARLKSLGLNPVKVRKKSMLDGDITIPGFGGVEGKDI
LVFTRQFATMIDAGLPLVQCLDILASQMDNPSFKKVLFAIKSKVEQGSTFADALKEHPKVFDELYVQLCAAGEVGGILDA
ILNRLAAYREKNEKLKSKVKSAMTYPIIVILVAIGVTAVLLLKVTPVFEKMFADFGSELPGPTQMIVNFSHMAQEYFFHV
AGSIVAVVMSFTWSYRQPRGRKFWDKVFLFMPVFGPVLRKVAVARFTRTLGTMISSGVPILDALDVTAKTAGNRTVEDAI
IYVRGKIAEGKNIAGPLAETKVFPSMVVQMIGVGEATGAMDTMLNKIADFYDDEVDAAINSLTAMIEPVLMVFLGGVVGG
FLIGMYLPIFSLAGAIQ
>P22609 ~~~pilC~~~Type IV pilus assembly protein PilC~~~
MLVKAHLRKQGINPLKVRKKGISLLGAGKKVKPMDIALFTRQMATMMGAGVPLLQSFDIIGEGFDNPNMRKLVDEIKQEV
SSGNSLANSLRKKPQYFDELYCNLVDAGEQSGALENLLDRVATYKEKTESLKAKIRKAMTYPIAVIIVALIVSAILLIKV
VPQFQSVFQGFGAELPAFTQMVVNLSEFLQEWWLAVIVGVGAIGFTFKELHKRSKKFRDTLDRTILKLPIFGGIVYKSAV
ARYARTLSTTFAAGVPLVDALDSVSGATGNIVFKNAVSKIKQDVSTGMQLNFSMRTTSVFPNMAIQMTAIGEESGSLDEM
LSKVASYYEEEVDNAVDNLTTLMEPMIMAVLGVLVGGLIVAMYLPIFQLGNVVG
>Q5SK58 ~~~~~~Type IV pilus assembly protein PilC~~~COG1459
MPVYQYKARDRQGRLVEATIEAEDLRTAARLLRDRGLFVAEIKEPGKGLQAEVRIPALERGPGLKDLAIFSRQLATMLGA
GLTLLQALAILERQTENRKFREILKQVRTDVEGGMAFSEALSKHKIFSRLYVNLVRAGETSGGLDLILDRLASFLEKELE
LRGKIRSAMTYPVIVFVFAVGVAYFLLTGIVPQFAQILTDLGSELPLLTRFLIAVSDLLRAATLPLLLLAVALFFAYRWY
YGTPQGRRVIDRLKLRLPVFGNLNRKTAVARFSRTLALLLSSGVNIVEALDITKGTAGNSVVEEIVEAAKLKIQQGDPLN
LTLAQHPFVFPPMVSSMVAIGEETGALDTMLSKVADFYEREVDEAVASLTAAIEPLMIIFLGVIVGMIVAGMFLPLFKII
GTLSVQ
>G3XD43 ~~~pilE~~~Type IV pilus non-core minor pilin PilE~~~
MRTRQKGFTLLEMVVVVAVIGILLGIAIPSYQNYVIRSNRTEGQALLSDAAARQERYYSQNPGVGYTKDVAKLGMSSANS
PNNLYNLTIATPTSTTYTLTATPINSQTRDKTCGKLTLNQLGERGAAGKTGNNSTVNDCWR
>Q9HXJ2 ~~~pilF~~~Type IV pilus assembly protein PilF~~~
MTVRAALVFLLAVGLTGCVTSGDQNPLKTDKGRDEARDAYIQLGLGYLQRGNTEQAKVPLRKALEIDPSSADAHAALAVV
FQTEMEPKLADEEYRKALASDSRNARVLNNYGGFLYEQKRYEEAYQRLLEASQDTLYPERSRVFENLGLVSLQMKKPAQA
KEYFEKSLRLNRNQPSVALEMADLLYKEREYVPARQYYDLFAQGGGQNARSLLLGIRLAKVFEDRDTAASYGLQLKRLYP
GSLEYQEFQAEK
>P46384 ~~~pilG~~~Protein PilG~~~
MEQQSDGLKVMVIDDSKTIRRTAETLLKKVGCDVITAIDGFDALAKIADTHPNIIFVDIMMPRLDGYQTCALIKNNSAFK
STPVIMLSSKDGLFDKAKGRIVGSDQYLTKPFSKEELLGAIKAHVPSFTPVDAVS
>A5U7Y7 ~~~mtp~~~Pilin~~~
MYRFACRTLMLAACILATGVAGLGVGAQSAAQTAPVPDYYWCPGQPFDPAWGPNWDPYTCHDDFHRDSDGPDHSRDYPGP
ILEGPVLDDPGAAPPPPAAGGGA
>P9WI86 ~~~mtp~~~Pilin~~~
MYRFACRTLMLAACILATGVAGLGVGAQSAAQTAPVPDYYWCPGQPFDPAWGPNWDPYTCHDDFHRDSDGPDHSRDYPGP
ILEGPVLDDPGAAPPPPAAGGGA
>P9WI87 ~~~mtp~~~Pilin~~~
MYRFACRTLMLAACILATGVAGLGVGAQSAAQTAPVPDYYWCPGQPFDPAWGPNWDPYTCHDDFHRDSDGPDHSRDYPGP
ILEGPVLDDPGAAPPPPAAGGGA
>Q9A1S2 ~~~~~~Pilin~~~
MKLRHLLLTGAALTSFAATTVHGETVVNGAKLTVTKNLDLVNSNALIPNTDFTFKIEPDTTVNEDGNKFKGVALNTPMTK
VTYTNSDKGGSNTKTAEFDFSEVTFEKPGVYYYKVTEEKIDKVPGVSYDTTSYTVQVHVLWNEEQQKPVATYIVGYKEGS
KVPIQFKNSLDSTTLTVKKKVSGTGGDRSKDFNFGLTLKANQYYKASEKVMIEKTTKGGQAPVQTEASIDQLYHFTLKDG
ESIKVTNLPVGVDYVVTEDDYKSEKYTTNVEVSPQDGAVKNIAGNSTEQETSTDKDMTITFTNKKDFEVPTGVAMTVAPY
IALGIVAVGGALYFVKKKNA
>P42257 ~~~pilJ~~~Protein PilJ~~~
MKKINAGNLFAGMRSSSVIAGLFIVLIVSIVLLFANFAYLNTQSNHDKQYIGHAGELRVLSQRIAKNATEAAAGKGEAFK
LLKDARNDFEKRWNILVNGDESTSLPPSPEAVKPQMDVVQQDWDGLRKNADSILASEQTVLSLHQVASTLAETIPQLQVE
YEEVVDILLENGAPADQVAVAQRQSLLAERILGSVNKVLAGDENSVQAADSFGRDASLFGRVLKGMQEGNAAMSISKVTN
AEAVDRLNEIAELFEFVSGSVDEILETSPDLFQVREAANNIFSVSQTLLDKASQLADGFENLAGGRSINLFAGYALGALA
LASIILIGLVMVRETNRRLAETAEKNDRNQAAILRLLDEIADLADGDLTVAATVTEDFTGAIADSINYSIDQLRELVETI
NQTAVQVAAAAQETQSTAMHLAEASEHQAQEIAGASAAINEMAVSIDQVSANASESSAVAERSVAIANKGNEVVHNTITG
MDNIREQIQDTSKRIKRLGESSQEIGDIVSLINDIADQTNILALNAAIQASMAGDAGRGFAVVADEVQRLAERSSAATKQ
IEALVKTIQTDTNEAVISMEQTTSEVVRGARLAQDAGVALEEIEKVSKTLAALIQNISNAARQQASSAGHISNTMNVIQE
ITSQTSAGTTATARSIGNLAKMASEMRNSVSGFKLPEGVEQA
>G3XD28 ~~~pilM~~~Type IV pilus inner membrane component PilM~~~
MLGLIKKKANTLLGIDISSTSVKLLELSRSGGRYKVEAYAVEPLPPNAVVEKNIVELEGVGQALSRVLVKAKTNLKSAVV
AVAGSAVITKTIEMEAGLSEDELENQLKIEADQYIPYPLEEVAIDFEVQGLSARNPERVDVLLAACRKENVEVREAALAL
AGLTAKVVDVEAYALERSYALLSSQLGADTDQLTVAVVDIGATMTTLSVLHNGRTIYTREQLFGGRQLTEEIQRRYGLSV
EEAGLAKKQGGLPDDYDSEVLRPFKDAVVQQVSRSLQFFFAAGQFNDVDYIVLAGGTASIQDLDRLIQQKIGTPTLVANP
FADMALNGKVNAGALASDAPALMIACGLALRSFD
>G3XD30 ~~~pilN~~~Type IV pilus inner membrane component PilN~~~
MARINLLPWREELREQRKQQFLVILGGVLVASAALVFLGDQYFTAAIENQNARNDFLRKEIVVLDARIKEISELKSRRQQ
LLERMKIIQDLQGNRPIIGRVFDQLVRTLPDGVYFTDLKMTGKNIAIAGAAESNNRVSNLMRNMDASEWLTAPTLNEVKA
VTQGAVDQANVFQLTVQQTQPGEEDAKAKHGVAQGAKK
>G3XD51 ~~~pilO~~~Type IV pilus inner membrane component PilO~~~
MSLASSLESLRKIDINDLDLNNIGSWPAAVKVIVCVLLTAAVLALGYNFHLSDMQAQLEQQAAEEETLKQQFSTKAFQAA
NLEAYKAQMKEMEESFGALLRQLPSDTEVPGLLEDITRTGLGSGLEFEEIKLLPEVAQQFYIELPIQISVVGGYHDLATF
VSGVSSLPRIVTLHDFEIKPVAPGSTSKLRMSILAKTYRYNDKGLKK
>G3XCX7 ~~~pilP~~~Type IV pilus inner membrane component PilP~~~
MRARLILSSLLLASLAGCGGGSDFADLQSYMDEVRARPKGTIEPLPKFQPYEAFTYSAASLRSPFQPPVKIDLTVRQKGN
KVIKPDETRVKQFLEGFNIETFEMVGTLSNAQGTFALVKGAGGVHRVRVGDYLGRNDGKVVGISEGKIDVIEIVPDGEGN
WLERPRSLTLKERS
>Q9JVW4 ~~~pilQ~~~Type IV pilus biogenesis and competence protein PilQ~~~
MNTKLTKIISGLFVATAAFQTASAGNITDIKVSSLPNKQKIVKVSFDKEIVNPTGFVTSSPARIALDFEQTGISMDQQVL
EYADPLLSKISAAQNSSRARLVLNLNKPGQYNTEVRGNKVWIFINESDDTVSAPARPAVKAAPAAPAKQQAAAPSTKSAV
SVSKPFTPAKQQAAAPFTESVVSVSAPFSPAKQQAAASAKQQTAAPAKQQAATPAKQTNIDFRKDGKNAGIIELAALGFA
GQPDISQQHDHIIVTLKNHTLPTTLQRSLDVADFKTPVQKVTLKRLNNDTQLIITTAGNWELVNKSAAPGYFTFQVLPKK
QNLESGGVNNAPKTFTGRKISLDFQDVEIRTILQILAKESGMNIVASDSVNGKMTLSLKDVPWDQALDLVMQARNLDMRQ
QGNIVNIAPRDELLAKDKAFLQAEKDIADLGALYSQNFQLKYKNVEEFRSILRLDNADTTGNRNTLVSGRGSVLIDPATN
TLIVTDTRSVIEKFRKLIDELDVPAQQVMIEARIVEAADGFSRDLGVKFGATGKKKLKNDTSAFGWGVNSGFGGDDKWGA
ETKINLPITAAANSISLVRAISSGALNLELSASESLSKTKTLANPRVLTQNRKEAKIESGYEIPFTVTSIANGGSSTNTE
LKKAVLGLTVTPNITPDGQIIMTVKINKDSPAQCASGNQTILCISTKNLNTQAMVENGGTLIVGGIYEEDNGNTLTKVPL
LGDIPVIGNLFKTRGKKTDRRELLIFITPRIMGTAGNSLRY
>Q70M91 ~~~pilQ~~~Type IV pilus biogenesis and competence protein PilQ~~~
MNTKLTKIISGLFVATAAFQTASAGNITDIKVSSLPNKQKIVKVSFDKEIVNPTGFVTSSPARIALDFEQTGISMDQQVL
EYADPLLSKISAAQNSSRARLVLNLNKPGQYNTEVRGNKVWIFINESDDTVSAPARPAVKAAPAAPAKQQAAAPSTKSAV
SVSEPFTPAKQQAAAPFTESVVSVSAPFSPAKQQAAASAKQQAAAPAKQQAAAPAKQQAAAPAKQTNIDFRKDGKNAGII
ELAALGFAGQPDISQQHDHIIVTLKNHTLPTTLQRSLDVADFKTPVQKVTLKRLNNDTQLIITTAGNWELVNKSAAPGYF
TFQVLPKKQNLESGGVNNAPKTFTGRKISLDFQDVEIRTILQILAKESGMNIVASDSVNGKMTLSLKDVPWDQALDLVMQ
ARNLDMRQQGNIVNIAPRDELLAKDKALLQAEKDIADLGALYSQNFQLKYKNVEEFRSILRLDNADTTGNRNTLISGRGS
VLIDPATNTLIVTDTRSVIEKFRKLIDELDVPAQQVMIEARIVEAADGFSRDLGVKFGATGKKKLKNDTSAFGWGVNSGF
GGDDKWGAETKINLPITAAANSISLVRAISSGALNLELSASESLSKTKTLANPRVLTQNRKEAKIESGYEIPFTVTSIAN
GGSSTNTELKKAVLGLTVTPNITPDGQIIMTVKINKDSPAQCASGNQTILCISTKNLNTQAMVENGGTLIVGGIYEEDNG
NTLTKVPLLGDIPVIGNLFKTRGKKTDRRELLIFITPRIMGTAGNSLRY
>P34750 ~~~pilQ~~~Fimbrial assembly protein PilQ~~~
MNSGLSRLGIALLAAMFAPALLAADLEKLDVAALPGDRVELKLQFDEPVAAPRGYTIEQPARIALDLPGVQNKLGTKNRE
LSVGNTRSVTVVEAKDRTRLIINLTALSSYTTRVEGNNLFVVVGNSPAGASVASAAPVKASPAPASYAQPIKPKPYVPAG
RAIRNIDFQRGEKGEGNVVIDLSDPTLSPDIQEQGGKIRLDFAKTQLPDALRVRLDVKDFATPVQFVNASAQSDRTSITI
EPSGLYDYLVYQTDNRLTVSIKPMTTEDAERRKKDNFAYTGEKLSLNFQDIDVRSVLQLIADFTDLNLVASDTVQGNITL
RLQNVPWDQALDLVLKTKGLDKRKLGNVLLVAPADEIAARERQELEAQKQIAELAPLRRELIQVNYAKAADIAKLFQSVT
SDGGQEGKEGGRGSITVDDRTNSIIAYQPQERLDELRRIVSQLDIPVRQVMIEARIVEANVGYDKSLGVRWGGAYHKGNW
SGYGKDGNIGIKDEDGMNCGPIAGSCTFPTTGTSKSPSPFVDLGAKDATSGIGIGFITDNIILDLQLSAMEKTGNGEIVS
QPKVVTSDKETAKILKGSEVPYQEASSSGATSTSFKEAALSLEVTPQITPDNRIIVEVKVTKDAPDYQNMLNGVPPINKN
EVNAKILVNDGETIVIGGVFSNEQSKSVEKVPFLGELPYLGRLFRRDTVTDRKNELLVFLTPRIMNNQAIAIGR
>Q00934 ~~~pilR~~~Response regulator protein PilR~~~
MSRQKALIVDDEPDIRELLEITLGRMKLDTRSARNVKEARELLAREPFDLCLTDMRLPDGSGLDLVQYIQQRHPQTPVAM
ITAYGSLDTAIQALKAGAFDFLTKPVDLGRLRELVATALRLRNPEAEEAPVDNRLLGESPPMRALRNQIGKLARSQAPVY
ISGESGSGKELVARLIHEQGPRIERPFVPVNCGAIPSELMESEFFGHKKGSFTGAIEDKQGLFQAASGGTLFLDEVADLP
MAMQVKLLRAIQEKAVRAVGGQQEVAVDVRILCATHKDLAAEVGAGRFRQDLYYRLNVIELRVPPLRERREDIPLLAERI
LKRLAGDTGLPAARLTGDAQEKLKNYRFPGNVRELENMLERAYTLCEDDQIQPHDLRLADAPGASQEGAASLSEIDNLED
YLEDIERKLIMQALEETRWNRTAAAQRLGLTFRSMRYRLKKLGID
>P33639 2.7.13.3~~~pilS~~~Sensor protein kinase PilS~~~
MRAERLRLSEEQGQRILRLYHLYRLTIGLVLVLLISSELEDQVLKLVHPELFHVGSWCYLVFNILVALFLPPSRQLLPIF
ILALTDVLMLCGLFYAGGGVPSGIGSLLVVAVAIANILLRGRIGLVIAAAASLGLLYLTFFLSLSSPDATNHYVQAGGLG
TLCFAAALVIQALVRRQEQTETLAEERAETVANLEELNALILQRMRTGILVVDSRQAILLANQAALGLLRQDDVQGASLG
RHSPMLMHCMKQWRLNPSLRPPTLKVVPDGPTVQPSFISLNREDDQHVLIFLEDISQIAQQAQQMKLAGLGRLTAGIAHE
IRNPLGAISHAAQLLQESEELDAPDRRLTQIIQDQSKRMNLVIENVLQLSRRRQAEPQQLDLKEWLQRFVDEYPGRLRND
SQLHLQLGAGDIQTRMDPHQLNQVLSNLVQNGLRYSAQAHGRGQVWLSLARDPESDLPVLEVIDDGPGVPADKLNNLFEP
FFTTESKGTGLGLYLSRELCESNQARIDYRNREEGGGCFRITFAHPRKLS
>P24559 ~~~pilT~~~Type IV pilus retractation ATPase PilT~~~
MDITELLAFSAKQGASDLHLSAGLPPMIRVDGDVRRINLPPLEHKQVHALIYDIMNDKQRKDFEEFLETDFSFEVPGVAR
FRVNAFNQNRGAGAVFRTIPSKVLTMEELGMGEVFKRVSDVPRGLVLVTGPTGSGKSTTLAAMLDYLNNTKYHHILTIED
PIEFVHESKKCLVNQREVHRDTLGFSEALRSALREDPDIILVGEMRDLETIRLALTAAETGHLVFGTLHTTSAAKTIDRV
VDVFPAEEKAMVRSMLSESLQSVISQTLIKKIGGGRVAAHEIMIGTPAIRNLIREDKVAQMYSAIQTGGSLGMQTLDMCL
KGLVAKGLISRENAREKAKIPENF
>G3XCX3 ~~~pilU~~~Type IV pilus ATPase PilU~~~
MEFEKLLRLMVEKGGSDLFITAGVPPSMKVNGRVMPVTKTPLSPEQTRETVLGVMNEQQRRDFAENHECNFAISARGIGR
FRVSAFYQRNLVGMVLRRIETNIPTLEELKLPEILKKLALTKRGLVIFVGATGTGKSTSLAAMIGYRNKNSTGHIISIED
PIEYIHQHQGCIVTQREVGLDTDSFEVALKNTLRQAPDVIMIGEVRSRETMDHAVAFAETGHLCLATLHANNANQALERI
IHFFPADRHGQVWMDLSLNLKAIVAQQLVPTPDGKGRRAVIEVLLNTPLAADLIRKGEVHELKPLMKRSTEQGMQTFDQA
LYQLYTQGEITYEDALAHADSANDLRLMIKLGSESDADHLSSLTQGLSLEITDDDPAGRRFR
>Q02GC2 ~~~pilY1~~~Type IV pilus biogenesis factor PilY1~~~
MIHQITRAGKSLLAAGCTLSILFASDSYAATALNVSQQPLFLTQGVAPNLLFTLDDSGSMAWAYVPDGISGNSGRAGRSS
DYNALYYNPDYAYQVPKKLTLSGDQIIVSDYPVPRFTAAWQDGYAQGSTTNLSNNYRPQWGTGWLGCIDSSCNTGRAYYY
TYKVSASCPAQPVSSSNSCYTYNALPTSQESNFAIWYSYYRNRILATKTAANLAFYSLPENVRLTWGALNTCSIGANSRS
CQNNALLQFNKQHKINFFNWLANSPASGGTPLHAALDRAGRFLQTNGTAYTTEDGKTYSCRASYHIMMTDGIWNGRNVTP
GNLDNQNQTFPDSTLYRPQPPYADSNASSLADLAFKYWTTDLRPSIDNDLKPFMAYKSGDDSKDYWDPRNNPATWQHMVN
FTVGLGLSYSLTLNSAPTWTGSTFGNYEELMAGSKAWPSVDNDAAPGNVYDLWHAAINSRGDFFSAESPDSLVQAFNKIL
TRISERNTSSSKPAMTSALQDDGTGDKLIRYSYQSSFASDKNWAGDLIRYKVESTSTGSTKTQEWSAGALLDNRAPATRN
IYIASNSGTNRLKPFTWSNIEGSQLATWLNRNPDKDNQADTKGAQRVDFIRGQQNMDGFRQRQAVLGDIVHSSPAVVGPA
QYLTYLANPIEPSGDYGTFKTEADQRSPRVYVGSNDGMLHGFNIKTGVEEFAFIPTAVFEKLNKLTGISYQGGAHQYFVD
ATPVVSDAFFDGAWHTVLIGTLGAGGRGLFALDVTKPDDVKLLWEYDSSTDSDLGYTFSKPTVARLHSGQWAVVTGNGYG
SDNDKAALLLIDLKKGTLIKKLEVQSERGIANGLSTPRLADNNSDGIADYAYAGDLQGNIWRFDLIGNTRNDDPDTNTSI
NPFKPGDVDPSAFRVSFSGAPLFRARADNNTRQPITAPPTLVRHPSRKGYIVIVGTGKYFEDDDAQADTSRAMTLYGIWD
RQTKGESANSTPTIDRNALTAQTMTTEANSTFGSVNRNIRLISQNPVKWYKDGATGTANSDVASYGWRLNLEVNSSKKGE
MMIEDMFAAGQVLLLQTLTPNDDPCDSGSTSWTYGLNPYTGGRTSFTVFDLKRAGIVDSGSDYNGSVVSAFQQDGLGGLA
ITQNEQRQSEACTGDECIIFNPSDKSNGRQTWRVVEEK
>S0HPF7 ~~~pilY1~~~Type IV pilus biogenesis factor PilY1~~~
MKSALHQIGKTSLAAALSGAVLLSAQTTHAAALSVSQQPLMLIQGVAPNMLVTLDDSGSMAYAYAPDSLVNSRNNVYFAS
NSYNPMYFDPNTQYKLPKKVTLSNGQIQVQDYSKPSFTAAWRNGFTQEGRVNLSRDYRPTVQYQGGSGAGTESSIDWYGA
PAFYYQYSGGRGCSLTTSSCYTRVEISGAAQQQNFANWYSFYRTRALATQTAANLAFYSLPENARISWQLLNSSSCLIGS
GSSNCYNNYLRDFTGQHRVNFFNWLENLSVGGGTPLRQAMTRAGEFLKKTGVNGPYAYRPGTQTSPEYSCRGSYHILMTD
GLWNNDSASVGNADSTSRSLPDGKSYSSQTPYRDAASNTLADQAFHYWATDARPDIDDNIKPYIPYPDQANPSAEYWNPR
NDPATWQHMVTYTLGLGLTTSLTSPKWEGSTYSGGYDEIAAGRLSWPNASNNHSNNVYDLWHAAVNSRGEFFSADSPDQL
VAAFQDILNRISGKDLPASRPAISSSLQEDDTGDKLTRFAYQTSFASDKNWAGDLTRYSLTTQDKATVQTKLWSAQSILD
AMPNGGAGRKIMMAGSGTSGLKEFTWGSLSADQQRQLNRDPDRNDVADTKGQDRVAFLRGDRSKENSDNFRTRNSILGDI
INSSPATVGKAQYLTYLAQPIEPSGNYSTFAEAQKTRAPRVYVGANDGMLHGFDTDGNETFAFIPSAVFEKLHKLTARGY
QGGAHQFYVDGSPVVADAFFGGAWHTVLIGSLRAGGKGLFALDVTDPANIKLLWEIGVDQEPDLGYSFPKPTVARLHNGK
WAVVTGNGYSSLNDKAALLIIDLETGAITRKLEVTGRTGVPNGLSSPRLADNNSDGVADYAYAGDLQGNLWRFDLIAGKV
NQDDPFSRANDGPAVASSFRVSFGGQPLYSAVDSAGAAQAITAAPSLVRHPTRKGYIVIFGTGKYFENADARADTSRAQT
LYGIWDQQTKGEAAGSTPRLTRGNLQQQTLDLQADSTFASTARTIRIASQNPVNWLNNDGSTKQSGWYLDFMVNGTLKGE
MLIEDMIAIGQVVLLQTITPNDDPCADGASNWTYGLDPYTGGRTSFTVFDLARQGVVDSKSDYSYNKQNVAVSGTEQKGL
GGLTLSTNEQGNPEVCSSGECLTVNPGPNTRGRQNWRPIEGKN
>A0QWG6 2.4.1.345~~~pimA~~~Phosphatidyl-myo-inositol mannosyltransferase~~~COG0438
MRIGMVCPYSFDVPGGVQSHVLQLAEVLRDAGHEVSVLAPASPHVKLPDYVVSGGKAVPIPYNGSVARLRFGPATHRKVK
KWIAEGDFDVLHIHEPNAPSLSMLALQAAEGPIVATFHTSTTKSLTLSVFQGILRPYHEKIIGRIAVSDLARRWQMEALG
SDAVEIPNGVDVASFADAPLLDGYPREGRTVLFLGRYDEPRKGMAVLLAALPKLVARFPDVEILIVGRGDEDELREQAGD
LAGHLRFLGQVDDATKASAMRSADVYCAPHLGGESFGIVLVEAMAAGTAVVASDLDAFRRVLADGDAGRLVPVDDADGMA
AALIGILEDDQLRAGYVARASERVHRYDWSVVSAQIMRVYETVSGAGIKVQVSGAANRDETAGESV
>P9WMZ5 2.4.1.345~~~pimA~~~Phosphatidyl-myo-inositol mannosyltransferase~~~COG0438
MRIGMICPYSFDVPGGVQSHVLQLAEVMRTRGHLVSVLAPASPHAALPDYFVSGGRAVPIPYNGSVARLRFGPATHRKVK
KWLAHGDFDVLHLHEPNAPSLSMLALNIAEGPIVATFHTSTTKSLTLTVFQGILRPMHEKIVGRIAVSDLARRWQMEALG
SDAVEIPNGVDVDSFASAARLDGYPRQGKTVLFLGRYDEPRKGMAVLLDALPKVVQRFPDVQLLIVGHGDADQLRGQAGR
LAAHLRFLGQVDDAGKASAMRSADVYCAPNTGGESFGIVLVEAMAAGTAVVASDLDAFRRVLRDGEVGHLVPVDPPDLQA
AALADGLIAVLENDVLRERYVAAGNAAVRRYDWSVVASQIMRVYETVAGSGAKVQVAS
>D7GDZ9 2.4.1.345~~~pimA~~~Phosphatidyl-myo-inositol mannosyltransferase~~~COG0438
MRVGLVCPYSFARPGGVQNHVLGLGGWLKEQGHDVSIIAPGQASRSLLAETGLVPSEFVSAGRAVPVTFNGSVARINFGV
GPALKVKKWLDQGNFDVVHLHEPIAPTICLLALYLTDRPVTATFHTATPELTAIRFANRVLPRMVSRIDAAIAVSSEAAD
VAHHYSGVNPVVIGNGIHLADYPLVRATSRWRGGEHPLITFLGRYDEPRKGFEVLTAALPLVRATYPDLEVVVIGSGTAR
SVEGVRFLGGLDDEERNAWLGRSDIYIAPQTGRESFGIVLLEAMACGAPVVAANLRAFLDVLTDDEGLVGHTFRVGNSAS
ASRAMLRSLSEPRDLRLERGRALAANYDWSVIGPQVVAMYTVAGQNYATSRGIKNRELKGH
>Q8NNK8 2.4.1.346~~~pimB~~~GDP-mannose-dependent monoacylated alpha-(1-6)-phosphatidylinositol monomannoside mannosyltransferase~~~COG0438
MSASRKTLVVTNDFPPRIGGIQSYLRDFIATQDPESIVVFASTQNAEEAHAYDKTLDYEVIRWPRSVMLPTPTTAHAMAE
IIREREIDNVWFGAAAPLALMAGTAKQAGASKVIASTHGHEVGWSMLPGSRQSLRKIGTEVDVLTYISQYTLRRFKSAFG
SHPTFEHLPSGVDVKRFTPATPEDKSATRKKLGFTDTTPVIACNSRLVPRKGQDSLIKAMPQVIAARPDAQLLIVGSGRY
ESTLRRLATDVSQNVKFLGRLEYQDMINTLAAADIFAMPARTRGGGLDVEGLGIVYLEAQACGVPVIAGTSGGAPETVTP
ATGLVVEGSDVDKLSELLIELLDDPIRRAAMGAAGRAHVEAEWSWEIMGERLTNILQSEPR
>A0R043 2.4.1.346~~~pimB~~~GDP-mannose-dependent alpha-(1-6)-phosphatidylinositol monomannoside mannosyltransferase~~~COG0438
MLLVTNDFPPRRGGIQSYLEAFVGELVRTHELTVYAPKWKGAEEYDEKAARSGYRVVRHPTTLMLPEPTVASRMKRLIGE
HDIETVWFGAAAPLALLGPLARRAGARRIVASTHGHEVGWSMLPVARTALRRIGNDADVVTFVSRYTRSRFASAFGPSAA
LEHLPPGVDTDRFAPDPDARARMRERYGLGDRPVVVCLSRLVPRKGQDMLIRALPELRRRVPDTALAIVGGGPYLETLQR
MASDLGVAEHVVFTRGIPAEELPAHHAMADVFAMPCRTRGAGLDVEGLGIVYLEASACGVPVVAGRSGGAPETVLDGKTG
TVVDGTDVDAITTAVGDLLADPRRAAAMGVAGRHWALDNWQWRTRGARLAELLSGRREARQA
>P9WMZ3 2.4.1.346~~~pimB~~~GDP-mannose-dependent alpha-(1-6)-phosphatidylinositol monomannoside mannosyltransferase~~~COG0438
MSRVLLVTNDFPPRRGGIQSYLGEFVGRLVGSRAHAMTVYAPQWKGADAFDDAARAAGYRVVRHPSTVMLPGPTVDVRMR
RLIAEHDIETVWFGAAAPLALLAPRARLAGASRVLASTHGHEVGWSMLPVARSVLRRIGDGTDVVTFVSSYTRSRFASAF
GPAASLEYLPPGVDTDRFRPDPAARAELRKRYRLGERPTVVCLSRLVPRKGQDTLVTALPSIRRRVDGAALVIVGGGPYL
ETLRKLAHDCGVADHVTFTGGVATDELPAHHALADVFAMPCRTRGAGMDVEGLGIVFLEASAAGVPVIAGNSGGAPETVQ
HNKTGLVVDGRSVDRVADAVAELLIDRDRAVAMGAAGREWVTAQWRWDTLAAKLADFLRGDDAAR
>P0CF99 2.4.1.-~~~pimC~~~GDP-mannose-dependent alpha-(1-6)-phosphatidylinositol dimannoside mannosyltransferase~~~
MRVVQVANFYGPRSGGLRTAVDRLGAEYCASGHEVFLIVPGARTERHLLRTGVVRITLPAKHIPYTGGYRAVMPGAVRTV
LETLRPDALEVSDRLTLRSLGRWGREHGVTTVMISHERLDRFAGQLLPRRAAQKFADFANARTAANYDTVVCTTGFAREE
FDRIGATNTVTVPLGVDLKTFHPRRRCARVRQHWATPTQILLVHCGRLSVEKHADRSIDALAALCDAGVDARLVIAGEGP
LRARLERKATGLPIDFTGFISDRHAVAGLLASADVALAPGPHETFGLAALESLACGTPAVVSRTSALTEIITADSGACAD
NRPEAIAHAVRTIVSRPERHRRRCARRRAEIFTWQRAAASMLATLGAMAVSTRCGDTQDTA
>A0R2K8 2.4.1.-~~~pimE~~~Polyprenol-phosphate-mannose-dependent alpha-(1-2)-phosphatidylinositol pentamannoside mannosyltransferase~~~COG1051
MRVNGYRGAKVGAVDTTSPPEVPASARLQRLAPMLLVVSILARLAWTYLVPNGANFVDLHVYVGGADALDGPGALYDYVY
ADQTPDFPLPFTYPPFAAIVFYPLHLLPFGVVAFIWQIGIIAALYGVVRVSQRLMGLQSQRRVAMLWTALGIWTEPLRST
FDYGQVNVVLVLAVLCAVSTTRWWLSGLLVGLAAGIKLTPAVAGLYFLGARRWAAVACSAAVFFATVGVSWLVVGAQARR
YFTELLGDADRIGPIGTSFNQSWRGGISRILGHDAGFGPLVLIGIGITAVLALLAWRAIGGAQDRLGGILVVSLFGLVLS
PISWTHHWVWLIPLMMWLLHGPLSALRGARILGWGWLALTLLGVPWLLSFAQPTIWEIGRPWYLAWAGLVYIVATLATLG
WIAFSRKGSG
>P9WN01 2.4.1.-~~~pimE~~~Polyprenol-phosphate-mannose-dependent alpha-(1-2)-phosphatidylinositol pentamannoside mannosyltransferase~~~COG1051
MCRTLIDGPVRSAIAKVRQIDTTSSTPAAARRVTSPPARETRAAVLLLVLSVGARLAWTYLAPNGANFVDLHVYVSGAAS
LDHPGTLYGYVYADQTPDFPLPFTYPPFAAVVFYPLHLVPFGLIALLWQVVTMAALYGAVRISQRLMGGTAETGHFAAML
WTAIAIWIEPLRSTFDYGQINVLLMLAALWAVYTPRWWLSGLLVGVASGVKLTPAITAVYLVGVRRLHAAAFSVVVFLAT
VGVSLLVVGDEARYYFTDLLGDAGRVGPIATSFNQSWRGAISRILGHDAGFGPLVLAAIASTAVLAILAWRALDRSDRLG
KLLVVELFGLLLSPISWTHHWVWLVPLMIWLIDGPARERPGARILGWGWLVLTIVGVPWLLSFAQPSIWQIGRPWYLAWA
GLVYVVATLATLGWIAASERYVRIRPRRMAN
>A0R036 2.4.1.-~~~~~~Polyprenol-phosphate-mannose-dependent alpha-(1-2)-phosphatidylinositol mannoside mannosyltransferase~~~COG5650
MLEMSKRQSPRGAGLAPTIAWRVFQLLTLAGVLWVGWRLLGRVPYRIDIDVYRMGGRAWLDGRPLYADGAIFHTQGGLDL
PFTYPPLAAIAFAPFAWLSLPLASSAITATTLVLLIVATTIVLTRLDVWPHTTVTSEPAWMRRAWLAAAMVAPAVIYLEP
IRSNFEFGQINVVLMTLVIADCVPRRTPWPRGLLLGLAIALKLTPAVFLLYFLLRRDIHTLLRTAATAVVASLAGFALAW
SDSVEYWTETVRNTDRIGTATLNTNQNIAGALARLGLGESPRFILWVLACFAVLALTVWAARRALRGDTADQTTEAPVLA
LVCVALFGLVVSPVSWSHHWVWMLPVLVVTAVLAYRRRSVWFTALTAAGLALTVWTPITLLPEHRETTASLWRQLAGGSY
VWWAFAVIVVIGLVSSSRTHTGDAHETDEPLVPLARGEAG
>P9WMZ9 2.4.1.-~~~~~~Polyprenol-phosphate-mannose-dependent alpha-(1-2)-phosphatidylinositol mannoside mannosyltransferase~~~COG5650
MSAWRAPEVGSRLGRRVLWCLLWLLAGVALGYVAWRLFGHTPYRIDIDIYQMGARAWLDGRPLYGGGVLFHTPIGLNLPF
TYPPLAAVLFSPFAWLQMPAASVAITVLTLVLLIASTAIVLTGLDAWPTSRLVPAPARLRRLWLAVLIVAPATIWLEPIS
SNFAFGQINVVLMTLVIVDCFPRRTPWPRGLMLGLGIALKLTPAVFLLYFLLRRDGRAALTALASFAVATLLGFVLAWRD
SWEYWTHTLHHTDRIGAAALNTDQNIAGALARLTIGDDERFALWVAGSLLVLAATIWAMRRVLRAGEPTLAVICVALFGL
VVSPVSWSHHWVWMLPAVLVIGLLGWRRRNVALAMLSLAGVVLMRWTPIDLLPQHRETTAVWWRQLAGMSYVWWALAVIV
VAGLTVTARMTPQRSLTRGLTPAPTAS
>P0A7A5 2.1.1.77~~~pcm~~~Protein-L-isoaspartate O-methyltransferase~~~COG2518
MVSRRVQALLDQLRAQGIQDEQVLNALAAVPREKFVDEAFEQKAWDNIALPIGQGQTISQPYMVARMTELLELTPQSRVL
EIGTGSGYQTAILAHLVQHVCSVERIKGLQWQARRRLKNLDLHNVSTRHGDGWQGWQARAPFDAIIVTAAPPEIPTALMT
QLDEGGILVLPVGEEHQYLKRVRRRGGEFIIDTVEAVRFVPLVKGELA
>Q56308 2.1.1.77~~~pcm~~~Protein-L-isoaspartate O-methyltransferase~~~COG2518
MREKLFWILKKYGVSDHIAKAFLEIPREEFLTKSYPLSYVYEDIVLVSYDDGEEYSTSSQPSLMALFMEWVGLDKGMRVL
EIGGGTGYNAAVMSRVVGEKGLVVSVEYSRKICEIAKRNVERLGIENVIFVCGDGYYGVPEFSPYDVIFVTVGVDEVPET
WFTQLKEGGRVIVPINLKLSRRQPAFLFKKKDPYLVGNYKLETRFITAGGNLGNLLERNRKLLREFPFNREILLVRSHIF
VELVDLLTRRLTEIDGTFYYAGPNGVVEFLDDRMRIYGDAPEIENLLTQWESCGYRSFEYLMLHVGYNAFSHISCSI
>A5F9C1 2.1.1.77~~~pcm~~~Protein-L-isoaspartate O-methyltransferase~~~COG2518
MANPKADRLIQFLTEQGITSPQVLAAIHALPREFFVAPAMMHQAYDNNALPIGQGQTISQPYIVAKMTELLALTPETKVL
EIGTGSGYQTAVLAKLVNHVFTVERIKTLQWDAKRRLKQLDIYNVSTKHGDGWQGWPARGPFDAILVTAAAAKVPQSLLD
QLAEGGRMVIPVGEDEQYLYKIVRQGGQFISERVEAVRFVPLVAGDLA
>P0ADI0 3.1.22.-~~~pinR~~~Serine recombinase PinR~~~COG1961
MSRIFAYCRISTLDQTTENQRREIESAGFKIKPQQIIEEHISGSAATSERPGFNRLLARLKCGDQLIVTKLDRLGCNAMD
IRKTVEQLTETGIRVHCLALGGIDLTSPTGKMMMQVISAVAEFERDLLLERTHSGIVRARGAGKRFGRPPVLNEEQKQVV
FERIKSGVSISAIAREFKTSRQTILRAKAKLQTPDI
>Q8ZMM8 ~~~pipB2~~~Secreted effector protein PipB2~~~
MERSLDSLAGMAKSAFGAGTSAAMRQATSPKTILEYIINFFTCGGIRRRNETQYQELIETMAETLKSTMPDRGAPLPENI
ILDDMDGCRVEFNLPGENNEAGQVIVRVSKGDHSETREIPLASFEKICRALLFRCEFSLPQDSVILTAQGGMNLKGAVLT
GANLTSENLCDADLSGANLEGAVLFMADCEGANFKGANLSGTSLGDSNFKNACLEDSIMCGATLDHANLTGANLQHASLL
GCSMIECNCSGANMDHTNLSGATLIRADMSGATLQGATIMAAIMEGAVLTRANLRKASFISTNLDGADLAEANLNNTCFK
DCTLTDLRTEDATMSTSTQTLFNEFYSENI
>Q8ZQ59 ~~~pipB~~~Secreted effector protein PipB~~~
MPITNASPENILRYLHAAGTGTKEAMKSATSPRGILEWFVNFFTCGGVRRSNERWFREVIGKLTTSLLYVNKNAFFDGNK
IFLEDVNGCTICLSCGAASENTDPMVIIEVNKNGKTVTDKVDSERFWNVCRMLKLMSKHNIQQPDSLITEDGFLNLRGVN
LAHKDFQGEDLSKIDASNADFRETTLSNVNLVGANLCCANLHAVNLMGSNMTKANLTHADLTCANMSGVNLTAAILFGSD
LTDTKLNGAKLDKIALTLAKALTGADLTGSQHTPTPLPDYNDRTLFPHPIF
>P0DPS1 2.7.7.-~~~pi-polB~~~Primer-independent DNA polymerase PolB~~~
MSNNLQDILAAASGYQSVTSEPALNRKRPKTLDDYPVIPPASKKVSVISSDLTLHIGFDTEYVFNPETRQNDILSYQSYV
VLPDNTGISNIIYPPDSQKKSRLSFKDFLCQTITPLLETGVITKWPGIINIYAHFIRADIASFANFWSDYKILLKGIRGT
VSSFKNRYGIDFDEQQERRVKTEQIMFDKRTSPPRCSNVAFIDTLLITPGGMGLAECGELLGLPKLTIPAPYSITNMREY
LLGDRAGFEAYALRDAEIAVRYALQVRNFCARELMIDRVPATIGAMAVSRFTKTLKENNMSPEVCLGTHIKTRELWLTEK
QAFRTIKNPASVPSRELFETFPINCYHGGRNECFMMGVTPSDHWYDYDLAGAYTTGLLDILTPDYGNIRLSKNPDDYCGH
VMGFALVTFRFPESVPYPSLPVRTDQYGLFFPLSGESWATAPEIELALSLGAEMTIHNGIIVPWICDTSPHNSESTSVFL
PFVQQVRENRNRHIKGSLEEKFWKEIGNSLYGKLAQGLRAKTAFDTARGLNRSLPPSSVTQPFFAAHVTGFIRAVVGELM
NALPSDSSVVSVTTDGFLTNCPLDKINMSGPLSSRFQSLCDIVDPGSSMLTCKHEVSQLIAMKTRGQLTYKAIQGKPVVH
ARAGVKPPADIPRSDYNDYMVDLYLNRLPGQTLSRSTLISTREMWLSESDLVSREQDIRLNLEFDFKRQPVRPAMNEGHL
LMFSRPWDNMEEALQQRSLFDDWRQTHTLKTLADWDDWCDFLYCRTVFSDMKLKVGSKRSDDILVRLFLRALTQCQWGLM
LKDKKSYSCKEVAEWLTSEGYSVTVTDVKNAVRAKIPQMKFSSVTPRMKSLMDIIARKYPTFCLPV
>Q8NPZ2 2.7.8.-~~~~~~Phosphatidylinositol phosphate synthase~~~COG0558
MLGLHGRKPAQVIVEPVAKLMIKLKVTPNQLTLVSAGLTVGVALLLIPTGHLIWAAVLTGLFAAFDMIDGTVARMQGGGT
KFGATLDATCDRITDGALFGAITWWLVYSYDAPQALVAASLVCLVASQVISYVKARGEASGFTMDGGLVERPERLIVSLV
GLGLTGMGVPYAIDVALWALAAGSIYTVVQRLVMAGKSPLAKEFTKAPAGAKADYSNTK
>Q6A8U1 2.7.8.-~~~~~~Phosphatidylinositol phosphate synthase~~~COG0558
MLEHFRAGWAKVMNPIADALLRAHVTPDVVTWIGTIGAVLMALICFPQGWLWQGPWLVTLFIFSDSLDGNMARKLGRHSQ
WGSFLDSTLDRFGDAAIFTGVALYFAGPGNSVLWTAMACAALVFGMATSYVRAKAESLALEAKVGIATRADRLLVSLVAI
EITGLARVGAFPHWCVVALPIALCYLTLAGAITVVQRMVAVRRACES
>B1MCK4 2.7.8.-~~~~~~Phosphatidylinositol phosphate synthase~~~
MSGLLSRETFAKITNPLASALLRAGFTPDTVTIFGTAASVVAALTLFPTGHLFWGGMAVWLFAMFDMLDGAMARARGGGT
RFGAVLDATCDRVADGAVFAGLVWWAAFGWGSTSLVVATLICMITSQVISYVKARAEASGLRADGGLIERPERLIIVLAG
AIFSGGFGVQWPLHTAMWVLAVASLVTVAQRMHAVRTSPGALDLLPNSDAGQDTAETNQP
>A0A0H3MFZ3 2.7.8.-~~~pgsA1~~~Phosphatidylinositol phosphate synthase~~~
MSKLPFLSRAAFARITTPIARGLLRVGLTPDVVTILGTTASVAGALTLFPMGKLFAGACVVWFFVLFDMLDGAMARERGG
GTRFGAVLDATCDRISDGAVFCGLLWWIAFHMRDRPLVIATLICLVTSQVISYIKARAEASGLRGDGGFIERPERLIIVL
TGAGVSDFPFVPWPPALSVGMWLLAVASVITCVQRLHTVWTSPGAIDRMAIPGKGDR
>B2HM76 2.7.8.-~~~pgsA1~~~Phosphatidylinositol phosphate synthase~~~COG0558
MSKAPFLSRAAFARVTNPLARGLLRIGLTPDAVTIIGTTASVAGALVLFPMGKLFPGACVVWFFVLFDMLDGAMARERGG
GTRFGAVLDAACDRISDGAVFGGLLWWVAFGMRDRLLVVATLICLVTSQVISYIKARAEASGLRGDGGIIERPERLIIVL
AGAGVSDFPFIAWPPALPVAMWLLAVTSVITCGQRLYTVWTSPGATDLLVPSAPVRDDDAQGHPRSGDPGKTQR
>Q9F7Y9 2.7.8.-~~~pgsA~~~Phosphatidylinositol phosphate synthase~~~COG0558
MSNVYLMTRAAYVKLSRPVAKAALRAGLTPDIVTLAGTAAAVIGALTLFPIGQLWWGAVVVSFFVLADMLDGAMAREQGG
GTRFGAVLDATCDRLGDGAVFAGLTWWAAFGLDSPSLVVATLICLVTSQVISYIKARAEASGLRGDGGIIERPERLVIVL
IGAGLSDLPFFPLPWTLHVAMWVLAVASVVTLLQRVHAVRTSPGAMEPLHPANGEKPETSEP
>P9WPG7 2.7.8.-~~~pgsA1~~~Phosphatidylinositol phosphate synthase~~~COG0558
MSKLPFLSRAAFARITTPIARGLLRVGLTPDVVTILGTTASVAGALTLFPMGKLFAGACVVWFFVLFDMLDGAMARERGG
GTRFGAVLDATCDRISDGAVFCGLLWWIAFHMRDRPLVIATLICLVTSQVISYIKARAEASGLRGDGGFIERPERLIIVL
TGAGVSDFPFVPWPPALSVGMWLLAVASVITCVQRLHTVWTSPGAIDRMAIPGKGDR
>Q827U4 2.7.8.-~~~pgsA2~~~Phosphatidylinositol phosphate synthase~~~COG0558
MGQPVASRGRAATPTIGKAMLNKYARAFFTRVLTPFAAFLIRRGVSPDTVTLLGTAGVIAGALVFYPRGEFFWGTIVITL
FVFSDLVDGNMARQLGRTSRWGAFLDSTLDRVADGAIFGGFALWYAGGGDNNVLCAVSIFCLASGQVVSYTKARGESIGL
PVAVNGLVERAERLVISLVAAGFAGLHKFGVPGIQVLLPIALWIVAVGSLVTLIQRVVTVRRESAEADAATAAENASQGS
GAAS
>P46547 3.4.11.5~~~pip~~~Proline iminopeptidase~~~
MSSPLHYVLDGIHCEPHFFTVPLDHQQPDDEETITLFGRTLCRKDRLDDELPWLLYLQGGPGFGAPRPSANGGWIKRALQ
EFRVLLLDQRGTGHSTPIHAELLAHLNPRQQADYLSHFRADSIVRDAELIREQLSPDHPWSLLGQSFGGFCSLTYLSLFP
DSLHEVYLTGGVAPIGRSADEVYRATYQRVADKNRAFFARFPHAQAIANRLATHLQRHDVRLPNGQRLTVEQLQQQGLDL
GASGAFEELYYLLEDAFIGEKLNPAFLYQVQAMQPFNTNPVFAILHELIYCEGAASHWAAERVRGEFPALAWAQGKDFAF
TGEMIFPWMFEQFRELIPLKEAAHLLAEKADWGPLYDPVQLARNKVPVACAVYAEDMYVEFDYSRETLKGLSNSRAWITN
EYEHNGLRVDGEQILDRLIRLNRDC
>O05420 3.4.11.5~~~fpaP~~~Proline iminopeptidase~~~COG2267
MIPITTPVGNFKVWTKRFGTNPKIKVLLLHGGPAMTHEYMECFETFFQREGFEFYEYDQLGSYYSDQPTDEKLWNIDRFV
DEVEQVRKAIHADKENFYVLGNSWGGILAMEYALKYQQNLKGLIVANMMASAPEYVKYAEVLSKQMKPEVLAEVRAIEAK
KDYANPRYTELLFPNYYAQHICRLKEWPDALNRSLKHVNSTVYTLMQGPSELGMSSDARLAKWDIKNRLHEIATPTLMIG
ARYDTMDPKAMEEQSKLVQKGRYLYCPNGSHLAMWDDQKVFMDGVIKFIKDVDTKSFN
>P46544 3.4.11.5~~~pepIP~~~Proline iminopeptidase~~~
MMQITEKYLPFGNWQTYCRIVGEATDRAPLLLLHGGPGSSHNYFEVLDQVAEKSGRQVIMYDQLGCGNSSIPDDQAETAY
TAQTWVKELENVREQLGLDQIHLLGQSWGGMLALIYLCDYQPEGVKSLILSSTLASAKLWSQELHRLIKYLPKGEQAAIK
EAETTGNYDSLAYQAANAHFMDQHAIKLTPDLPEPVLRKKKGGSLAYLTGWGPNEYTPIGNLHGYEYTDRLKDLHLPALI
TSGTDDLCTPLVAKSMYDNLPNARWELFAGCGHMPFVQENAKYQELLSDWLISQD
>P46542 3.4.11.5~~~pip~~~Proline iminopeptidase~~~
MQITEKYLPFGNWQTYCRIVGEATDRAPLLLLHGGPGSSHNYFEVLDQVAEKSGRQVIMYDQLGCGNSSIPDDQAETAYT
AQTWVKELENVREQLGLDQIHLLGQSWGGMLALIYLCDYQPKGVKSLILSSTLASAKLWSQELHRLIKYLPKGEQAAIKE
AETTGNYDSPAYQAANAHFMDQHAINVTPDLPEPVLRKKKGGNLAYLTGWGPNEYTPIGNLHGYEYTDRLKDLDLPALIT
SGTDDLCTPLVAKSMYDHLPNARWELFAGCGHMPFVQENAKYQELLSDWLISQD
>P52278 3.4.11.5~~~pip~~~Proline iminopeptidase~~~COG2267
MEIIEGKMPFMGYETYYRIVGERSEKPPLVLLHGGPGSSHNYFEVLDELAQKDGRRIIMYDQLGCGESSIPDDHPELYTK
ETWVKELEALREHLALRKMHLLGQSWGGMLAIIYMCDYHPEGIQSLILSSTLSSASLWSKELHRMIKYLPIEEQAAIHRA
ELTGNFNDPDYLKANEHFMNQHAIDMTKTWPECVMRKKRGGTVAYETAWGPNEYTPEGNLHDYEYTDKLSKIKVPTLITS
GTDDLCTPYVAKTMQDQIASSKWRLFEGCGHMSFVEKTDEYVALLQEWLDQHDE
>P42786 3.4.11.5~~~pip~~~Proline iminopeptidase~~~
MYEIKQPFHSGYLQVSEIHQIYWEESGNPDGVPVIFLHGGPGAGASPECRGFFNPDVFRIVIIDQRGCGRSHPYACAEDN
TTWDLVADIEKVREMLGIGKWLVFGGSWGSTLSLAYAQTHPERVKGLVLRGIFLCRPSETAWLNEAGGVSRIYPEQWQKF
VAPIAENRRNRLIEAYHGLLFHQDEEVCLSAAKAWADWESYLIRFEPEGVDEDAYASLAIARLENHYFVNGGWLQGDKAI
LNNIGKIRHIPTVIVQGRYDLCTPMQSAWELSKAFPEAELRVVQAGHCAFDPPLADALVQAVEDILPRLL
>O32449 3.4.11.5~~~pip~~~Proline iminopeptidase~~~
MEQLRGLYPPLAAYDSGWLDTGDGHRIYWELSGNPNGKPAVFIHGGPGGGISPHHRQLFDPERYKVLLFDQRGCGRSRPH
ASLDNNTTWHLVADIERLREMAGVEQWLVFGGSWGSTLALAYAQTHPERVSEMVLRGIFTLRKQRLHWYYQDGASRFFPE
KWERVLSILSDDERKDVIAAYRQRLTSADPQVQLEAAKLWSVWEGETVTLLPSRESASFGEDDFALAFARIENHYFTHLG
FLESDDQLLRNVPLIRHIPAVIVHGRYDMACQVQNAWDLAKAWPEAELHIVEGAGHSYDEPGILHQLMIATDRFAGK
>P46541 3.4.11.5~~~pip~~~Proline iminopeptidase~~~
MYTEGFIDVTGGRVSFQKFDENGGGTPVIVLHGGPGSSCYSLLGLKALAKDRPVILYDQLGCGKSDRPMDTTLWRLDRFV
EELAQIRQALNLDEVHILGHSWGTTLAAAYCLTKPSGVKSVIFSSPCLSAPLWEQDQKRNLKKLPLDVQETINRCEENGT
TDSEEFAAAIEVFGKHFVNRLEKQPEWLEQKPSGYRNADIYNIMWGPSEFTVLGNLKNFDCTTQLKEITCPSLYTCGRFD
EATPETTEYYSSLTPKSKFHVFEKSAHMPYIEEPEEYLAVIGDFLNSI
>P52279 3.4.11.5~~~pip~~~Proline iminopeptidase~~~
MRTLYPEITPYQQGSLKVDDRHTLYFEQCGNPHGKPVVMLHGGPGGGCNDKMRRFHDPAKYRIVLFDQRGSGRSTPHADL
VDNTTWDLVADIERLRTHLGVDRWQVFGGSWGSTLALAYAADPSAAGHQLVLRGIFLLRRFELEWFYQEGASRLFPDAWE
HYLNAIPPVERADLMSAFHRRLTSDDEATRLAAAKAWSVWEGATSFLHVDEDFVTGHEDAHFALAFARIENHYFVNGGFF
EVEDQLLRDAHRIADIPGVIVHGRYDVVCPLQSAWDLHKAWPKAQLQISPASGHSAFEPENVDALVRATDGFA
>P03067 ~~~pir~~~PI protein~~~
MRLKVMMDVNKKTKIRHRNELNHTLAQLPLPAKRVMYMALAPIDSKEPLERGRVFKIRAEDLAALAKITPSLAYRQLKEG
GKLLGASKISLRGDDIIALAKELNLPFTAKNSPEELDLNIIEWIAYSPDEGYLSLKFTRTIEPYISSLIGKKNKFTTQLL
TASLRLSSQYSSSLYQLIRKHYSNFKKKNYFIISVDELKEELIAYTFDKDGNIEYKYPDFPIFKRDVLNKAIAEIKKKTE
ISFVGFTVHEKEGRKISKLKFEFVVDEDEFSGDKDDEAFFMNLSEADAAFLKVFDETVPPKKAKG
>P80569 ~~~pisA~~~Bacteriocin piscicolin-126~~~
MKTVKELSVKEMQLTTGGKYYGNGVSCNKNGCTVDWSKAIGIIGNNAAANLTTGGAAGWNKG
>P0AFJ7 ~~~pitA~~~Low-affinity inorganic phosphate transporter PitA~~~COG0306
MLHLFAGLDLHTGLLLLLALAFVLFYEAINGFHDTANAVATVIYTRAMRSQLAVVMAAVFNFLGVLLGGLSVAYAIVHML
PTDLLLNMGSSHGLAMVFSMLLAAIIWNLGTWYFGLPASSSHTLIGAIIGIGLTNALMTGTSVVDALNIPKVLSIFGSLI
VSPIVGLVFAGGLIFLLRRYWSGTKKRARIHLTPAEREKKDGKKKPPFWTRIALILSAIGVAFSHGANDGQKGIGLVMLV
LIGVAPAGFVVNMNATGYEITRTRDAINNVEAYFEQHPALLKQATGADQLVPAPEAGATQPAEFHCHPSNTINALNRLKG
MLTTDVESYDKLSLDQRSQMRRIMLCVSDTIDKVVKMPGVSADDQRLLKKLKSDMLSTIEYAPVWIIMAVALALGIGTMI
GWRRVATTIGEKIGKKGMTYAQGMSAQMTAAVSIGLASYTGMPVSTTHVLSSSVAGTMVVDGGGLQRKTVTSILMAWVFT
LPAAVLLSGGLYWLSLQFL
>P43676 ~~~pitB~~~Low-affinity inorganic phosphate transporter PitB~~~COG0306
MLNLFVGLDIYTGLLLLLALAFVLFYEAINGFHDTANAVAAVIYTRAMQPQLAVVMAAFFNFFGVLLGGLSVAYAIVHML
PTDLLLNMGSTHGLAMVFSMLLAAIIWNLGTWFFGLPASSSHTLIGAIIGIGLTNALLTGSSVMDALNLREVTKIFSSLI
VSPIVGLVIAGGLIFLLRRYWSGTKKRDRIHRIPEDRKKKKGKRKPPFWTRIALIVSAAGVAFSHGANDGQKGIGLVMLV
LVGIAPAGFVVNMNASGYEITRTRDAVTNFEHYLQQHPELPQKLIAMEPPLPAASTDGTQVTEFHCHPANTFDAIARVKT
MLPGNMESYEPLSVSQRSQLRRIMLCISDTSAKLAKLPGVSKEDQNLLKKLRSDMLSTIEYAPVWIIMAVALALGIGTMI
GWRRVAMTIGEKIGKRGMTYAQGMAAQMTAAVSIGLASYIGMPVSTTHVLSSAVAGTMVVDGGGLQRKTVTSILMAWVFT
LPAAIFLSGGLYWIALQLI
>Q87B49 ~~~pilY1_1~~~Type IV pilus biogenesis factor PilY1 homolog PD_1611~~~
MLVMGRDHKLYYEAYNDASDLDGDGVLDVGYKPDKITYYGYYNSNVCYRANGTMFQAMSVANGSNGKKCTGAWSGDFLNY
LTTSRMDALRKVLFGGYREVDTLTQTILRASYTPQDAHSWGKEYASVSHDGYDISDYAPLSAPGSGHYHLFAVTTLSDNG
IPQLRVLADTTFRVWNWVSIERPVAGSDCFTPENKRVSCVSGGSRGISDYPLRVEVCSVADELRESNCKLYQNNTSYKPT
GILHDYGENDRMYFGLLTGSYQKNITGGVLHSNVSNFSREINPLTGQFCLNGNCGGGGDVKGIVHTISSFRMLDFNYKDY
TYGCGWIATRPVKEGECWMWGNPVAEMMYETLRYFGGATAPRPEYDINSSSQDIATLQLSHPGWKPPYTSVDKGGSGYSV
CAQPTMTVFSDINPSYDDKLPGSHWSNFSGSGDPASMRNLDVSAEADRIWEAEGGGSKLFFIGESNNNSDNAPTPKVVSN
LSTVRGLSPEEPSKGGTYYSAAVARYGANHKMGGSKFVRTYAVALASPLPKFEFPVGNARVSLVPFAKSVRGFGISATGN
FQPTNQIIDFYVQRVANMAGSSGADYDATINGGRPYAEFRINYEDVEQGADHDMDAIALYTIYVNAKNQLVVTLKSEYSA
GSIDQHMGYVISGTTRDGVYLEICDLADGHSNDGTRSSCAGQQPYKLNTPPNRLPGYCNATPMPGDCNGLPPVATRIFSV
ASQGANAVLLKDPLWYAAKYGHDQGVILNSSGGLANYFPVNNALYLRQQVAKAFSAIQSQAGSSGSIAVVGASVSSTSFA
VIPSYSSTHDGKDWTGEMTAYRIDANGMIGDVLWLASAGVPSGTSAIAKRVIYTALSHVDDTNRASVVRRFVAEKLVDSS
SGDVATDAAQVFGRLGYTPRGVIDDFGSSVTPNQLVNYLRGDKRMEGATLNTAPFRRRFGPLGDMINSIPVVATRRANYG
WATASGLPQVQRDSYSAFINARQNSTAAEHIFVGANDGMLHAFDDKGTERFAYVPNGVLHHLGFLANPEYQHHYYVDGKS
TLSDAYLDGSWRSVLVGGTGAGGRSMFALDVTSPSTFNESNVLWEMNSENDDDMGYTMGKPYIVPLQNGSWAAIFGNGYN
STNGRAVLFIVNLATGQLIRKIEARDGIDPDGSDPANMGYNGLGNLAVLDTDGDGLVDMVYGADLHGNLWKFNLSGKDPQ
RWGIAYKDGLGNPIPLFVARNPQGYRQPITGGLEVAVGPSAGYIIYFGSGRYFAANDNNSKDLSTLYGIWDSGSPVIAGR
AALSAQIIQASDHPTSPDTRIVTRRPLSYLSKHGWYVDLVVQGQDPQGERSIATPLLQGGRVFFSTYVPGVSVNCASGGS
NWLYVLDAASGGAALGQVNIPSRGSRSSIGNSDTGAVSTGGDAPIQSVAMTRTAPRQPVFCNPGEQGCPLVPETHAAPLD
TRCSEVIIDPNDPTRSISLFRACGRQSWRQLR
>Q87E25 ~~~pilY1_2~~~Type IV pilus biogenesis factor PilY1 homolog PD_0502~~~
MVGMSRIILNNLFFFRCVVAVFSAHSLVISGAVHAGVQISQSPLHGGGDVPGNLAIVASIEFPTVISVANLADTYTPGVR
YVGYFDSNKCYKYHYSSRELDRYFYPVASPRPQANYGCNTTGGVWAGNFLNWAATQTIDPFRSALTGGYRVRDTANETIL
EKAVMDRAYPGNFPRRTIEGLALTTSVPAQWLRFRMRIDGLGNRMRFTQFPGLSTDPLNTEGQPYDPSRHPLNSNDRGVY
EVSVRVKVCDASVGLESNCVAYPSGFYKPEGLVQEYSKRVRYSVFSYKNDDYYLDDGGVLRARQKFVGPKTYYPEQGEKT
NPHAEWHPRTGVLYDNPNPEDAKATTHRVGRTIGNSGVINYLNKFSQMETGKNTKGFDPVSELYYTAYRYFKRLGNVPEY
SVLTGSVNEKYQQADAFPVITDWDDPIRYACQSNVVLGIGDTHTNFDKNLPGNTNTTGEPVKPQAVRNDRSIDVVKRMAQ
IFQMEGMRQQDAMTSALAPSFNFLVPGAGNNSAYIAALAYDAHTKDMRPDLEGDQLLTTHWVDVVEGGDYKIPISTNQYW
LAAKYGGFQVPAGYDPDKTVNPLSEATWWTNGEYVNGDTKAKRADNFYIAADAEKMVASLKHAFSRIVAEIKGAGTGLSS
NSARLETGAVTYQAQFFSGTWRGDLIAYHVDKVTGALTPFWNANFPAWEQRVIKFANATTLQDFTKKNLGQTALASASAQ
QINYLRGDRSQEGNVPGKLRIRSGIMGDIVNSQPLYVGAPNGRLYTTANFTGASAYAAFAAQQANRVPVVYVGANDGMLH
AFDANTGKEIFAFVPRAAMPKLLEYTDQNYGHQYYVDGELTAADIYDTKLGWRSVLVGTLGRGGKGLFALDVTDPSNIRL
LWDKTSADIGGLGNTLSKPMIAQTSDGTWSVLLGNGPNSTADNAQLIVMNLLTGHAAQVPVSKTSNNGLSGVFPWSSQSN
GITDRVYAGDLLGTLWRFTFSDNAWKVAPLFTATYQGKAQPISATPLGAIERSTGRMWIFFGTGRALSSHDMDNKEVQSW
YGLIDQGTTIPGRTRLSQVQIVDEGVVNGYAVRTVSDPKNIGTDGWYMDLISPKSGKQGERMIVSNMFRGAALIGTTRIP
DNSDICKLSGSGFVMAINPFTGGRLGQWFFDLNTGGGSGGALNGNPVSGVGVSSAPNSPVFTGNIMQIGADDGTVTSLKT
PSSGGLNINRVSWREILRP
>Q87FA5 ~~~pilY1_3~~~Type IV pilus biogenesis factor PilY1 homolog PD_0023~~~
MKKTVFNPALNRAAAILIGTLVGISGVVHASVDISSSPLHGGKDVPGNLAILASVEFPTLISVANLADTYTPGVRYVGYF
DSNKCYKYHYSSRELDRYFYPVASPRPQANYGCNTTGGVWAGNFLNWAATQTIDPFRSALTGGYRVRDTVSETILEKAVM
DRDSQENFPRRNVAGRNVLATLVPTQWNNFRIRIDGLGNRMRFTQFSSLWTDPLNTEGQPYDSSRHPLNRNDRGVYEVSV
RVKVCDPSAGLESNCVAYPRGSYKPEGLIQEYSKRIRYSVFGYKNDHSYLIDGGVLRARQKFVGPQTHYPEQGKKTNSHA
EWDPQTGILYDNPDPEDAAATTRRVGRTIANSGVINYLNKSGQMDTGRISKIYDPVSELYYTAYRYFKRLGNVPEYSVLT
GSVNEKYQQADAFPVITDWDDPIRYACQSNVVLGIGDTHTNQDKNLPGNTNTMEEPSKPQTVRNDRSIDVVKRMAQIFRM
EGMRQQDAMTSAVAPKFNFHRYNSAYIAALAYDAHTKDMRPDLEGDQLLTTHWVDVVEAGDYKIPISTNQYWLAAKYGGF
QVPAGYDPDKTVNPLSEATWWTNGEYVNNDLKANAKRADNFYVAADAEKMVASLKHAFSRIVAEIKGAGTGLSSNSARLE
TGAVTYQAQFFSGTWRGDLIAYHVDKVTGALTPFWNANFPAWEQRVIKFANATTLQDFTKKNLGQTALASASAQQINYLR
GDRSQEGNVPGKLRIRSGIMGDIVNSQPLYVGAPNGRLYTTANFTGASAYAAFAAQQANRVPVVYVGANDGMLHAFDANT
GKEIFAFVPRAAMPKLLEYTDQNYGHQYYVDGELTAADIYDTKLGWRSVLVGTLGRGGKGLFALDVTDPSNIRLLWDKTS
ADIGGLGNTLSKPMIAQTSDGTWSVLLGNGPNSTADNAQLIVMNLLTGHAAQVPVSKTSNNGLSGVFPWSSQSNGITDRV
YAGDLLGTLWRFTFSDNAWKVAPLFTATYQGKAQPISATPLGAIERSTGRMWIFFGTGRALSSHDMDNKEVQSWYGLIDQ
GTTIPGRTRLSQVQIVDEGVVNGYAVRTVSDPKNIGTDGWYMDLISPKSGKQGERMIVSNMFRGAALIGTTRIPDNSDIC
KLSGSGFVMAINPFTGGRLGQWFFDLNTGGGSGGALNGNPVSGVGVSSAPNSPVFTGNIMQIGADDGTVTSLKTPSSGGL
NINRVSWREILRTE
>Q9KIG4 2.7.11.1~~~spk1~~~Serine/threonine-protein kinase PK-1~~~COG0515
MDTTLQDPLVGQVLDGRYRVDARIAVGGMATVYRAVDTRLDRVLALKVMHPSLAADASFVERFIREAKSVARLAHPNVVQ
VFDQGTDGAYVYLAMEYIAGCTLRDVLRERGALQPRAALDILEPVLAALGAAHRAGFVHRDMKPENVLIGDDGRVKVADF
GLVRAVDSVTNTTGTVLGTVSYLAPEQIEHGTADPRVDVYACGILLYEMLTGEKPHDGDSPAIVLYKHLHDDVPPPSAAV
PGMAYELDELVASATARGPEVRPHDAVALLARARDARARLGDEQLDAVPPQALASEHDNADDRTSVIPRALTVRRPLPVN
EEDEGADAAHRTSRFRSPPPLPPRGRTALRRGPMAIVIGVLLVLGLGAGVWYINSGQFTKVPPLLAKTEKEARDRLADAG
LDAGQVSEAYSDTVERGSVATDPEAGARIRTNDSVSLTLSKGPRTVRVPDLDGYPQDKARSLLEDEGLKPGMSTREFSDS
VPAGSVISTEPGKGTEVRAGSAVALTVSKGAPVDVPDVAGDDLEDARAELEEAGLEVKVATERVTSEYDAGRVARQDPGP
GGRVAEGDTVTLTLSKGPEMAEVPDVVGDSVGEAREKLEGAGFRVDEDRGLLGLFGDTVKGQSVDGGDSAPKGSTITIEI
R
>Q9I6Z1 2.7.4.34~~~ppk2~~~GDP-polyphosphate phosphotransferase~~~
MSEEPTVSPPSPEQPAAQPAKPARPAARRAPRKPATRRPRVASPAQKAREEIQAISQKPVALQVASAPHGSSEDSTSASL
PANYPYHTRMRRNEYEKAKHDLQIELLKVQSWVKETGQRVVVLFEGRDAAGKGGTIKRFMEHLNPRGARIVALEKPSSQE
QGQWYFQRYIQHLPTAGEMVFFDRSWYNRAGVERVMGFCSPLQYLEFMRQAPELERMLTNSGILLFKYWFSVSREEQLRR
FISRRDDPLKHWKLSPIDIKSLDKWDDYTAAKQAMFFHTDTADAPWTVIKSDDKKRARLNCIRHFLHSLDYPDKDRRIAH
EPDPLLVGPASRVIEEDEKVYAEAAAAPGHANLDIPA
>Q92SA6 2.7.4.-~~~~~~ADP-polyphosphate phosphotransferase 1~~~COG2326
MALDEAPAEARPGSRAVELEIDGRSRIFDIDDPDLPKWIDEEAFRSDDYPYKKKLDREEYEETLTKLQIELVKVQFWMQA
TGKRVMAVFEGRDAAGKGGAIHATTANMNPRSARVVALTKPTETERGQWYFQRYVATFPTAGEFVLFDRSWYNRAGVEPV
MGFCTPDQYEQFLKEAPRFEEMIANEGIHLFKFWINIGREMQLKRFHDRRHDPLKIWKLSPMDIAALSKWDDYTGKRDRM
LKETHTEHGPWAVIRGNDKRRSRINVIRHMLTKLDYDGKDEAAIGEVDEKILGSGPGFLR
>Q5LX16 2.7.4.-~~~~~~NDP-polyphosphate phosphotransferase 1~~~COG2326
MTHESDDPSLDWLEAELEDSLDEDFEIEFSEPMLSMEIRRIYKDQRPDLLDRQVYFRNLLRLQAELIKLQDWVQHTNSKV
LIIMEGRDAAGKGGVIKRITQRLNPRIARVVALPAPSRREQSQWYFQRYVPYLPSGGEMVLFDRSWYNRAGVERVMGFAT
EDQVEQFFQDVPEFERMLVRSGIILLKYWFSITDEEQQLRFLMRVHDPMKQWKLSPMDLESRIRWEQYTKAKEQMFSRTN
IPEAPWYIVEGNDKKRERLNCIEHLLSKIPYEDIPHEKVTLPDRRYNPDYERQVLPDELYVPKVY
>Q8NM65 2.7.4.1~~~ppk2B~~~Polyphosphate kinase PPK2B~~~COG2326
MVGKLPIMAETNENDLPVIDLAQIEGYVVDDSDEDDPVLLRPDGTPIETWREDFPYEERVTREDYEKVKRSLQIELLKWQ
NWTKETGQRHIILFEGRDAAGKGGTIKRFNEHLNPRGARTVALEKPSPRESTSWYFQRYIQHFPAAGEIVFFDRSWYNRS
GVERVMGFCTESQHAEFLREVPMLENMILGSGISLTKFWFSVTRKEQRTRFAIRQVDPVRQWKLSPMDLASLDRWDDYTR
AKEEQFRYTDTDESPWITIKSNDKKRARINAMRYVLSKFDYTDKDYELVGEPDPKVVLRGRDQIGD
>Q9I154 2.7.4.-~~~~~~ADP-polyphosphate phosphotransferase~~~
MDSYGDTSGRIGRDWLDRHDEELEQELLDDELNLDELFGPEQEDAPGELSRRRYFRELFRLQRELVKLQNWVVHTGHKVV
ILFEGRDAAGKGGVIKRITQRLNPRVCRVAALPAPNDREQTQWYFQRYVSHLPAGGEIVLFDRSWYNRAGVERVMGFCND
EQYEEFFRSVPEFEKMLARSGIQLLKYWFSISDAEQHLRFLSRIHDPLKQWKLSPMDLESRRRWEAYTKAKETMLERTHI
PEAPWWVVQADDKKRARLNCIHHLLQQMPYREVPQPPVHLPERLRHADYVRHPTPGEIIVPEVY
>Q930V2 2.7.4.-~~~~~~ADP-polyphosphate phosphotransferase 2~~~
MSNSKDEVERIDWLEAELADTIDEDYELELSEPTLSEKIREIYRKAHPPALPRMDYFRALLALQAELIKLQDWVVYHKQK
VVVIFEGRDAAGKGGVIKRITQRLNPRIVRTVALPAPSDREKTQWYFQRYVPHLPAGGEIVLFDRSWYNRCGVERVMGFA
TEEEVEQFFDDVPEFERMLVRSGVRLVKYWFSITDEEQQLRFLTRIHDPLKQWKLSPMDLQSRVRWEAYTKAKEETFART
NIREAPWHIVEANDKKRARLNCIDHLLKQIPYEDVPHEDITLPERIFNPNYERKVLPPELYVPAKY
>Q5LU04 2.7.4.-~~~ppk2~~~NDP-polyphosphate phosphotransferase 2~~~COG2326
METAKPIAPQKDSKANGVDATDPVVKVASPQDPAGDAKVEDATAPVAEVEPRTPRNRRLPTPENVRHAFESGKYPYSRKM
SRRPYEAEKAMLQAELLKVQLWAQETGERFVLLFEGRDAAGKGGTIKRFMEHLNPRQARVVALNKPTWEEKGQWYYQRYV
QELPTVGEMVFYDRSWYNRAGVERVMGFCTPNEYLEFMRQTPDLERMLVRSGIRLYKYWFSVTQEEQQRRFKSRETDPLK
QWKLSPIDKASLDKWDDYTEAKEAMFFYTDTADAPWTIIKSNDKKRARLNCMRHFLSTIDYPGKDKHVVGEPDPLIVGRA
HHVIQKSEHILGTALHPDQRARQD
>Q92ZU4 2.7.4.-~~~~~~ADP-polyphosphate phosphotransferase 3~~~
MDKHTDDRKKNNHWKAEDRKSAATEASETRSGGNYAKELARLQEEIAHLQAWVKKTGARIVIVFEGRDAAGKGGVIKRIT
ERVSPRVFRVVALPAPTDREKTQIYMQRYIQQFPAAGEVVIFDRSWYNRPGVERVMGFCSEKKAKRFLEIAPRFEAAMIE
SGIVLLKYFLDVSEEEQDRRFRQRINDPLRQWKLSPMDVESYRRWWDYTRAYDEMIRMTDTDDAPWWIVPSDNKKQARVN
CIAHILSSIPYERVKFEDPDLGKRQKRPADFEGDTRRRTVPNLF
>Q5LSN8 2.7.4.-~~~~~~NDP-polyphosphate phosphotransferase 3~~~COG2326
MNRNGSTKDPRRMTGAATGEISRYFNDKAPKDIRRAIEKADKDDILSTTYPYDAEMTAKDYRAQMEALQIELVKLQAWIK
QSGARVALLFEGRDAAGKGGTIKRFRENLNPRGARVVALSKPTEAERSQWYFQRYIQHLPSAGELVFYDRSWYNRGVVEH
VFGWCDEEQRERFFRQVMPFEHDLVDDGIHLFKFWLNVGRAEQLRRFHDRERDPLKQWKLSPVDIAGLDKWEAYTTAISQ
TLTRSHSDRAPWTVIRSDDKKRARLAAIRTVLSGIDYDNKDRAAVGQPDAAICGGPDIWDA
>A9CKA8 2.7.4.-~~~~~~ADP-polyphosphate phosphotransferase~~~COG2326
MGEEKKKRTVEITIGGKLRSFDIDDPVLPDWVEEKKLSAGNFPYDKKMKREDYDATLEALQVELVKVQFWLQATGKRVMA
VFEGRDAAGKGGAIFATHAYLNPRYARVVALTKPTETERGQWYFQRYISHFPTAGEFVLFDRSWYNRAGVEPVMGFCTPD
EHKRFLKETPRLEKMLVHDDIHLFKFWLDIGRETQIERFHDRRQSPLKCWKLSDMDIAALTKWDDYTQKRDEMLEKTHTD
AAPWTVVRANDKRRARVNLIRHILLALDYEGKDRQAIGEIDDKILGSGPDFLK
>A0QQV6 2.7.4.-~~~~~~GDP-polyphosphate phosphotransferase~~~COG2326
MLDSTGYAVRDDDDDDPELLLPGGEVVDTWREGYPYDERMHRADYEEQKRLLQIELLKLQKWSQAHGHRHVIVFEGRDAA
GKGGTIKRFMEHLNPRGARVVALEKPTERERTQWYFQRYVEHLPAAGELVLFDRSWYNRAGVERVMGYCTPKQHAEFIRQ
APLFEQMLVNDGISLTKLWFSVTRSEQLTRFTIRQVDPVRQWKLSPTDLASLDKWDDYTAAKEEMFAWTDTEIAPWTVVK
SNDKKRARINAMRYVLGKFDYDNKDHEVVGQADPLIVGRALSD
>O05877 2.7.4.-~~~ppk2~~~GDP-polyphosphate phosphotransferase~~~COG2326
MDIPSVDVSTATNDGASSRAKGHRSAAPGRRKISDAVYQAELFRLQTEFVKLQEWARHSGARLVVIFEGRDGAGKGGAIK
RITEYLNPRVARIAALPAPTDRERGQWYYQRYIAHLPAKGEIVLFDRSWYNRAGVEKVMGFCTPQEYVLFLRQTPIFEQM
LIDDGILLRKYWFSVSDAEQLRRFKARRNDPVRQWKLSPMDLESVYRWEDYSRAKDEMMVHTDTPVSPWYVVESDIKKHA
RLNMMAHLLSTIDYADVEKPKVKLPPRPLVSGNYRRPPRELSTYVDDYVATLIAR
>Q6N140 2.7.4.-~~~~~~ADP-polyphosphate phosphotransferase~~~COG2326
MKIKTKQFRVGEGEKVDLGKWPTKVDPFYESKEHYHELLRTQVERLSDLQQLLYASNRHAVLLIFQAMDAAGKDGVIRHV
LSGINPQGCQVFSFKHPSATELQHDFLWRTTRDLPERGRIGVFNRSYYEEVLIVRVHPDILQSEAVPNGENFGKSFWHKR
YRSIRNLEQHLHANGTRIVKFFLHLSKDEQRKRFLARIDEPEKNWKFSAADLEERQYWDDYMDAYEKCLSETSSEDSPWY
AVPADDKENARLIVSQVIAETMESLKMSYPETTPARRKELLQMRQQLLK
>Q83XD3 2.7.4.33~~~pap~~~Polyphosphate:AMP phosphotransferase~~~
MDTETIASAVLNEEQLSLDLIEAQYALMNTRDQSNAKSLVILVSGIELAGKGEAVKQLREWVDPRFLYVKADPPHLFNLK
QPFWQPYTRFVPAEGQIMVWFGNWYGDLLATAMHASKPLDDTLFDEYVSNMRAFEQDLKNNNVDVLKVWFDLSWKSLQKR
LDDMDPSEVHWHKLHGLDWRNKKQYDTLQKLRTRFTDDWQIIDGEDEDLRNHNFAQAILTALRHCPEHEKKAALKWQQAP
IPDILTQFEVPQAEDANYKSELKKLTKQVADAMRCDDRKVVIAFEGMDAAGKGGAIKRIVKKLDPREYEIHTIAAPEKYE
LRRPYLWRFWSKLQSDDITIFDRTWYGRVLVERVEGFATEVEWQRAYAEINRFEKNLSSSQTVLIKFWLAIDKDEQAARF
KARESTPHKRFKITEEDWRNRDKWDDYLKAAADMFAHTDTSYAPWYIISTNDKQQARIEVLRAILKQLKADRDTD
>Q9HYF1 2.7.4.33~~~~~~Polyphosphate:AMP phosphotransferase~~~
MFESAEVGHSIDKDTYEKAVIELREALLEAQFELKQQARFPVIILINGIEGAGKGETVKLLNEWMDPRLIEVQSFLRPSD
EELERPPQWRFWRRLPPKGRTGIFFGNWYSQMLYARVEGHIKEAKLDQAIDAAERFERMLCDEGALLFKFWFHLSKKQLK
ERLKALEKDPQHSWKLSPLDWKQSEVYDRFVHYGERVLRRTSRDYAPWYVVEGADERYRALTVGRILLEGLQAALATKER
AKRQPHAAPLVSSLDNRGLLDSLDLGQYLDKDAYKEQLAAEQARLAGLIRDKRFRQHSLVAVFEGNDAAGKGGAIRRVTD
ALDPRQYHIVPIAAPTEEERAQPYLWRFWRHIPARRQFTIFDRSWYGRVLVERIEGFCAPADWLRAYGEINDFEEQLSEY
GIIVVKFWLAIDKQTQMERFKEREKTPYKRYKITEEDWRNRDKWDQYVDAVGDMVDRTSTEIAPWTLVEANDKRFARVKV
LRTINDAIEAAYKKDK
>Q886D9 2.7.4.33~~~~~~Polyphosphate:AMP phosphotransferase~~~COG2326
MFESAEIGHAIDDDTYEAALPSLREALLEAQIDLHEQAKRQIIVLINGIEGAGKGETVKLLSEWMDPRLIEVRTFDQQTD
EELAHPPVWRYWRQLPAKGRMGIFFGNWYSQMLQGRVHGQYKDAVLDQAISGAERLEKMLCDEGALIFKFWFHLSKKQMK
LRLKTLKDDPLHSWRISPLDWQQSKTYDKFVRFGERVLRRTSRDYAPWHVIEGVDANYRSLTVGRLLLEGMQAALNKVEP
ESSALTIGPLAIHNNERTLLDSLDLSLHLSKEDYQHELIAEQARLSGNLRDKRMKSHALVAVFEGNDAAGKGGAIRRVAA
ALDPRQYAIVPIAAPTQDERAQPYLWRFWRQIPARGKFTIFDRSWYGRVLVERVEGFCSESDWKRAYAEINDFEEQLTEA
GVVVVKFWLAIDEQTQLERFQEREKIPFKRYKITEDDWRNRKKWPDYRQAVGDMVDRTSTEIAPWTLIEANDKRWARVKV
LRTINEALEKAFARDKKK
>M9XB82 2.7.4.-~~~~~~AMP/ADP-polyphosphate phosphotransferase~~~COG2326
MKKYRVQPDGRFELKRFDPDDTSAFEGGKQAALEALAVLNRRLEKLQELLYAEGQHKVLVVLQAMDAGGKDGTIRVVFDG
VNPSGVRVASFGVPTEQELARDYLWRVHQQVPRKGELVIFNRSHYEDVLVVRVKNLVPQQVWQKRYRHIREFERMLADEG
TTILKFFLHISKDEQRQRLQERLDNPEKRWKFRMGDLEDRRLWDRYQEAYEAAIRETSTEYAPWYVIPANKNWYRNWLVS
HILVETLEGLAMQYPQPETASEKIVIE
>P37562 2.7.11.1~~~yabT~~~Probable serine/threonine-protein kinase YabT~~~COG0515
MMNDALTSLACSLKPGTTIKGKWNGNTYTLRKQLGKGANGIVYLAETSDGHVALKVSDDSLSITSEVNVLKSFSKAQSVT
MGPSFFDTDDAYIPSANTKVSFYAMEYIKGPLLLKYVSDKGAEWIPVLMIQLLSSLSVLHQQGWIFGDLKPDNLIVTGPP
ARIRCIDVGGTTKEGRAIKEYTEFYDRGYWGYGTRKAEPSYDLFAVAMIMINSVHKKEFKKTNQPKEQLRSLIEGNPLLQ
KYKKALFSALNGDYQSADEMKKDMLDAGQKAAQRKQPIKASPQPATRQRQQKPRQGKITKTRYTPKQKPAKSGGLFETTL
IVISVLALYFAYIIFFLI
>A0A0H3MBJ2 2.7.11.1~~~pkn1~~~Serine/threonine-protein kinase Pkn1~~~
MEERAAVEYWGDYKVIAELGHGLWSRDVLAEHRFIKKRYILKILPSELSSSENFMRVFQEVIVQLAAIRHASLVAIENVS
REGDRYFVVTEENGGTISLAQYLSGRKLSEEEVVHLIQQLCDALELVHSIGLAHGQIHLHSVHVSFFNGIANIYLPEVGF
ASLLRERMFSTIMQSGSARESITRIRDLLMFEAPEEQEVFGREADVYSVGVLAYYLLVGSFPWGSFPKPSLCMPDSWYDW
DGFILSCLQQQREARPKCLREALRRKTSGEQLQVTLDSCREPLREMEIEDTPTELGPPSALIREGERLCEVKEEQHAFVL
VEAKSIDEAMVTTVDSEEELESSEGYANPLQSLLAREPVVSRYVEVEREEIKPQPLLTEMIFIEGGEFSRGSGDGQRDEL
PVHNITLPGFFLDIHPVTNEQFVRFLECVGSEQDEHYNELIRLKDSRIQRRSGRLIIEPGYAKHPVVGVTWYGASSYACW
IGKRLPSEAEWEVAASGGKLGLRYPTGEEIDKSKANFFSSDTTPVMSYPSSILGLYDMAGNVYEWCQDWYSYDFYESSAL
EPDAPLGPPQGVYRVLRGGCWKSLKDDLRCAHRHRNNPGAINSTYGFRCAKDVK
>P33973 2.7.11.1~~~pkn1~~~Serine/threonine-protein kinase Pkn1~~~
MPEVSSGGGCGACGRRHGADASCPTLVRADVRAGGTAHPRCAPVVEAQDPLVGVRCGSFRLVRRLGRGGMGAVYLGEHVS
IGSRVAVKVLHAHLTMYPELVQRFHAEARAVNLIGHENIVSIFDMDATPPRPYLIMEFLDGAPLSAWVGTPLAAGAVVSV
LSQVCDALQAAHARGIVHRDLKPDNIFLVRRNGNAPFVKVLDFGIAKLADAHMPQTHAGIIVGTPEYMAPEQSLGRGVDG
RADLYALGVIAYQLLTGRLPFNDEGLAAQLVAHQLRPPPPPSSVYPAVSAALEHVILRALAKKPEDRYASIAAFRNALQV
ALAEHVRVSARKTRPGGLAVLERAPVAPDMPTEGQSRGRLGVDARAGHVPSSLASTSQRRLAPAAPAVPRASLVEVPVQV
VLRPGESPVRLRGSGLSRGGLFLHGGRVLPPLCSRLPVVLELASGPLSVMCEVVRVVPPAQARVWGMPTGFGVQFVEATA
VLKAAVDALLQGEPVRAVPQVPLTEDPAVARLLEAWRQRSAGDAYAVLALEPDSDMGTVRLRTREAWRSLESLEQHSLTP
PQRAQVDALRVRVREAAEALGATVQRALYDAWRGNHRGVAKCLEAGLTAEQLESLRREFLARRPQAMGTARSHFQSGGAL
ERDGQLSQALDQYERGLKLAPLEVDMLQRYRRLRRVLGGRATAPTGHDRARSP
>P54737 2.7.11.1~~~pkn5~~~Serine/threonine-protein kinase pkn5~~~
MPLKVIGPYRVLETLGSGGAGTVYRALDRRTTDEVALKLLSAGPARDARAARRLAREFDTLVDLSHPNVVKVFESGVHQG
VPYLAMELIEGLTLRHYLDLSSGDRQTPPGSHTPRSPLSVLRTADDDFGPLSRSFSDSMDDSEDSPFDGTFGLEAFAEEA
PSEDLESFASSASPHVGIGSDDSLEGFDLPPPMPRPAEPEEEPGRVVREEDLNRPERMGRLKDAMLQICEALAYIHGHGL
VHRDLKPSNIMVDDDRQVRLMDFGLAKFLADDAAITEAGKLVGTYRYMAPEQILGEPLDGRADLYSLGVILYELLSGRPP
FDAKTPHELWRQVLETEPPPVLALNLHGDPQLARVAHRLIRKEPDDRFQTAEEVYEALSE
>P54738 2.7.11.1~~~pkn6~~~Serine/threonine-protein kinase pkn6~~~
MQIGKYQLVRKLASGGMAEVFLAKAAGPRGFEKTLVLKRILPHLAEDAAFVEMFLGEARLAAQLEHPNIVQIFDFGEAEG
SFFLAMEFIDGPNLRKLVKRAAEEALPPAFCAKVVAAAAEGLAYAHEFRDVETGEPLGLIHRDVSPDNILVSRQGAVKVV
DFGIAKVAGQGHRTLTGVVKGKVAYMPPEQLQAKAMDRRVDVYALGVVLYELLTGKRPFDATTDVSVMQAILFESFIPVS
ARRPDVPVALQQVLDKALAKDRERRYADCRALQDDLERFVLSTGEPVGAYQIAQRIAQWVPEVAAAPAMTPSQGGSKGAV
ASQAKADARSASMVSPPVDSTSPTTPMPRSLVAPVEVPADSTSPTTPMPVAIGGVVQALEPRSSPQQDTLQSYPVVVKTP
ALRADASARGASRPRAQSRASGVKVQAPQARDEDVVAMAAASSPPSGGASPAPTTPEDADDAVHTRSTEYAATVPSGRPG
GRIAGIVGAVVALLVGGAVTVMRGDDSEVSPVRVNPPPLTHLPREPAVPSQGGRNVPQEKPAANVNAGARDSNDGAQAKP
QVSTDVSVVPQEPHVARDATTPEAGLPTVKDAPPENGGAASKEGSAVVAKREPAKASGDPEPNPSRVRERAPTQKVAAVA
KGRLEFRIRPYAVVSLDGKVLGQTPFAAVEVPEGRHTVRLVNKELGKDVTRTVDVKAGQATVFKLNLEAE
>A5TY85 2.7.11.1~~~pknA~~~Serine/threonine-protein kinase PknA~~~COG0515
MSPRVGVTLSGRYRLQRLIATGGMGQVWEAVDNRLGRRVAVKVLKSEFSSDPEFIERFRAEARTTAMLNHPGIASVHDYG
ESQMNGEGRTAYLVMELVNGEPLNSVLKRTGRLSLRHALDMLEQTGRALQIAHAAGLVHRDVKPGNILITPTGQVKITDF
GIAKAVDAAPVTQTGMVMGTAQYIAPEQALGHDASPASDVYSLGVVGYEAVSGKRPFAGDGALTVAMKHIKEPPPPLPPD
LPPNVRELIEITLVKNPAMRYRSGGPFADAVAAVRAGRRPPRPSQTPPPGRAAPAAIPSGTTARVAANSAGRTAASRRSR
PATGGHRPPRRTFSSGQRALLWAAGVLGALAIIIAVLLVIKAPGDNSPQQAPTPTVTTTGNPPASNTGGTDASPRLNWTE
RGETRHSGLQSWVVPPTPHSRASLARYEIAQ
>P9WI83 2.7.11.1~~~pknA~~~Serine/threonine-protein kinase PknA~~~COG0515
MSPRVGVTLSGRYRLQRLIATGGMGQVWEAVDNRLGRRVAVKVLKSEFSSDPEFIERFRAEARTTAMLNHPGIASVHDYG
ESQMNGEGRTAYLVMELVNGEPLNSVLKRTGRLSLRHALDMLEQTGRALQIAHAAGLVHRDVKPGNILITPTGQVKITDF
GIAKAVDAAPVTQTGMVMGTAQYIAPEQALGHDASPASDVYSLGVVGYEAVSGKRPFAGDGALTVAMKHIKEPPPPLPPD
LPPNVRELIEITLVKNPAMRYRSGGPFADAVAAVRAGRRPPRPSQTPPPGRAAPAAIPSGTTARVAANSAGRTAASRRSR
PATGGHRPPRRTFSSGQRALLWAAGVLGALAIIIAVLLVIKAPGDNSPQQAPTPTVTTTGNPPASNTGGTDASPRLNWTE
RGETRHSGLQSWVVPPTPHSRASLARYEIAQ
>A0QNG1 2.7.11.1~~~pknB~~~Serine/threonine-protein kinase PknB~~~COG0515
MTTPQHLSDRYELGEILGFGGMSEVHLARDLRLHRDVAVKVLRADLARDPSFYLRFRREAQNAAALNHPAIVAVYDTGEA
ETPNGPLPYIVMEYVDGVTLRDIVHTDGPIAPRRAIEIIADACQALNFSHQHGIIHRDVKPANIMISKNNAVKVMDFGIA
RALADTGNSVTQTAAVIGTAQYLSPEQARGETVDARSDVYSLGCVLYEILTGEPPFIGDSPVAVAYQHVREDPVPPSRRH
ADVTPELDAVVLKALAKNPDNRYQTAAEMRADLIRVHEGQAPDAPKVLTDAERTSMLAAPPADRAGAATQDMPVPRPAGY
SKQRSTSVARWLIAVAVLAVLTVVVTVAINMVGGNPRNVQVPDVAEQSADDAQAALQNRGFKTVIDRQPDNEVPPGLVIG
TDPEAGSELGAGEQVTINVSTGPEQALVPDVAGLTPTQARQKLKDAGFEKFRESPSPSTPEQKGRVLATNPQANQTAAII
NEITIVVGAGPEDAPVLSCAGQNAESCKAILAAGGFTNTVVVEVDNPAAAGQVVGTEPADGQSVPKDTVIQIRVSKGNQF
VMPDLVGQFWSDAYPRLTALGWTGVLDKGPDVRDSGQRTNAVVTQSPSAGTPVNKDAKITLSFAA
>A5TY84 2.7.11.1~~~pknB~~~Serine/threonine-protein kinase PknB~~~COG0515
MTTPSHLSDRYELGEILGFGGMSEVHLARDLRLHRDVAVKVLRADLARDPSFYLRFRREAQNAAALNHPAIVAVYDTGEA
ETPAGPLPYIVMEYVDGVTLRDIVHTEGPMTPKRAIEVIADACQALNFSHQNGIIHRDVKPANIMISATNAVKVMDFGIA
RAIADSGNSVTQTAAVIGTAQYLSPEQARGDSVDARSDVYSLGCVLYEVLTGEPPFTGDSPVSVAYQHVREDPIPPSARH
EGLSADLDAVVLKALAKNPENRYQTAAEMRADLVRVHNGEPPEAPKVLTDAERTSLLSSAAGNLSGPRTDPLPRQDLDDT
DRDRSIGSVGRWVAVVAVLAVLTVVVTIAINTFGGITRDVQVPDVRGQSSADAIATLQNRGFKIRTLQKPDSTIPPDHVI
GTDPAANTSVSAGDEITVNVSTGPEQREIPDVSTLTYAEAVKKLTAAGFGRFKQANSPSTPELVGKVIGTNPPANQTSAI
TNVVIIIVGSGPATKDIPDVAGQTVDVAQKNLNVYGFTKFSQASVDSPRPAGEVTGTNPPAGTTVPVDSVIELQVSKGNQ
FVMPDLSGMFWVDAEPRLRALGWTGMLDKGADVDAGGSQHNRVVYQNPPAGTGVNRDGIITLRFGQ
>P9WI81 2.7.11.1~~~pknB~~~Serine/threonine-protein kinase PknB~~~COG0515
MTTPSHLSDRYELGEILGFGGMSEVHLARDLRLHRDVAVKVLRADLARDPSFYLRFRREAQNAAALNHPAIVAVYDTGEA
ETPAGPLPYIVMEYVDGVTLRDIVHTEGPMTPKRAIEVIADACQALNFSHQNGIIHRDVKPANIMISATNAVKVMDFGIA
RAIADSGNSVTQTAAVIGTAQYLSPEQARGDSVDARSDVYSLGCVLYEVLTGEPPFTGDSPVSVAYQHVREDPIPPSARH
EGLSADLDAVVLKALAKNPENRYQTAAEMRADLVRVHNGEPPEAPKVLTDAERTSLLSSAAGNLSGPRTDPLPRQDLDDT
DRDRSIGSVGRWVAVVAVLAVLTVVVTIAINTFGGITRDVQVPDVRGQSSADAIATLQNRGFKIRTLQKPDSTIPPDHVI
GTDPAANTSVSAGDEITVNVSTGPEQREIPDVSTLTYAEAVKKLTAAGFGRFKQANSPSTPELVGKVIGTNPPANQTSAI
TNVVIIIVGSGPATKDIPDVAGQTVDVAQKNLNVYGFTKFSQASVDSPRPAGEVTGTNPPAGTTVPVDSVIELQVSKGNQ
FVMPDLSGMFWVDAEPRLRALGWTGMLDKGADVDAGGSQHNRVVYQNPPAGTGVNRDGIITLRFGQ
>P0DPS9 2.7.11.1~~~pknD~~~Serine/threonine-protein kinase PknD~~~
MQRYELIRLIGKGGMGEVYLAHDKACSRRVALKRIREDLSGNALLRKRFLREAKIAADLIHPGIVPVYSICSDGEAVYYT
MPYIEGFSLKSLLKSVWQKEVLSKELEEKTSVKSFLPIFDKICATVEYIHSKGVLHRDLKPDNILLGLFGEVVIVDWGAA
IFKHAKELKLEQDDEAAVSFDERNICYSSMTIPGKIVGTPDYMAPESLLGVEASEKTDIYALGLILYQMLTLAFPYRRKK
GRKLSYRDVVLPPIEMSPYREIPPSLSQIAMKAIAINPADRFSSIQELRQALQPYLQGDPEWTVKATLMAKEKSCWKYYD
PILLSRYFPVLASSPAQWYNFMLSEVEISASTRVEYTVTKSAVHEGMGILFLPSKEAERGEFYCGYGLWFSVQNHELTVS
LIKNGIEIQKKSQEMISQQYRFAILIEKSDNRIAVFVEQALFILHIDYLPSLGNRLGVIIQDLQGMSNIAISESIGALRV
SCLAVPDAFLSEKLYDQAAIFYRKIRDSFPGRKESYEAQFRLGVTLLTQIEEQGGDLTQALSSFDYLHGGAGAPLEYLGK
ALVYQRNGSFVEEIRCLLFALKRYSQHPEIPRLEDHLCFRLYDSLHKHRSEALVFMLLILWIAPEKISVREEKRFLRIIY
HKQQATLFCQVDKAPLQFRSSKMELFLSFWTGFSLFLPELFRRAGELRDYQALADIFYVAGVSGNREAFMQFSTALANVS
DEITFPESLHNQKVAELMFFVKGVEALRNKDYQKAKKLLWKTPFTLQLYALDIFHIQAFLDEEIESFIDLLQAIYDPASE
EERDHILVYIIQTHLWNRDLERAYKLLNDRFPLDEELAEYSEAFILWGCYLALTGDRVAVKAHFSRCRYKYGKSALIGKC
VDGDIFDYLDNLVWWEKKMTLFQSYFLLRCLNESPRRYEKYRQAYLSMENNFFD
>P9WI78 2.7.11.1~~~pknD~~~Serine/threonine-protein kinase PknD~~~
MSDAVPQVGSQFGPYQLLRLLGRGGMGEVYEAEDTRKHRVVALKLISPQYSDNAVFRARMQREADTAGRLTEPHIVPIHD
YGEINGQFFVEMRMIDGTSLRALLKQYGPLTPARAVAIVRQIAAALDAAHANGVTHRDVKPENILVTASDFAYLVDFGIA
RAASDPGLTQTGTAVGTYNYMAPERFTGDEVTYRADIYALACVLGECLTGAPPYRADSVERLIAAHLMDPAPQPSQLRPG
RVPPALDQVIAKGMAKNPAERFMSAGDLAIAAHDALTTSEQHQATTILRRGDNATLLATPADTGLSQSESGIAGAGTGPP
TPGAARWSPGDSATVAGPLAADSRGGNWPSQTGHSPAVPNALQASLGHAVPPAGNKRKVWAVVGAAAIVLVAIVAAAGYL
VLRPSWSPTQASGQTVLPFTGIDFRLSPSGVAVDSAGNVYVTSEGMYGRVVKLATGSTGTTVLPFNGLYQPQGLAVDGAG
TVYVTDFNNRVVTLAAGSNNQTVLPFDGLNYPEGLAVDTQGAVYVADRGNNRVVKLAAGSKTQTVLPFTGLNDPDGVAVD
NSGNVYVTDTDNNRVVKLEAESNNQVVLPFTDITAPWGIAVDEAGTVYVTEHNTNQVVKLLAGSTTSTVLPFTGLNTPLA
VAVDSDRTVYVADRGNDRVVKLTS
>P9WI79 2.7.11.1~~~pknD~~~Serine/threonine-protein kinase PknD~~~COG0515
MSDAVPQVGSQFGPYQLLRLLGRGGMGEVYEAEDTRKHRVVALKLISPQYSDNAVFRARMQREADTAGRLTEPHIVPIHD
YGEINGQFFVEMRMIDGTSLRALLKQYGPLTPARAVAIVRQIAAALDAAHANGVTHRDVKPENILVTASDFAYLVDFGIA
RAASDPGLTQTGTAVGTYNYMAPERFTGDEVTYRADIYALACVLGECLTGAPPYRADSVERLIAAHLMDPAPQPSQLRPG
RVPPALDQVIAKGMAKNPAERFMSAGDLAIAAHDALTTSEQHQATTILRRGDNATLLATPADTGLSQSESGIAGAGTGPP
TPGAARWSPGDSATVAGPLAADSRGGNWPSQTGHSPAVPNALQASLGHAVPPAGNKRKVWAVVGAAAIVLVAIVAAAGYL
VLRPSWSPTQASGQTVLPFTGIDFRLSPSGVAVDSAGNVYVTSEGMYGRVVKLATGSTGTTVLPFNGLYQPQGLAVDGAG
TVYVTDFNNRVVTLAAGSNNQTVLPFDGLNYPEGLAVDTQGAVYVADRGNNRVVKLAAGSKTQTVLPFTGLNDPDGVAVD
NSGNVYVTDTDNNRVVKLEAESNNQVVLPFTDITAPWGIAVDEAGTVYVTEHNTNQVVKLLAGSTTSTVLPFTGLNTPLA
VAVDSDRTVYVADRGNDRVVKLTS
>A5U3A3 2.7.11.1~~~pknE~~~Serine/threonine-protein kinase PknE~~~COG0515
MDGTAESREGTQFGPYRLRRLVGRGGMGDVYEAEDTVRERIVALKLMSETLSSDPVFRTRMQREARTAGRLQEPHVVPIH
DFGEIDGQLYVDMRLINGVDLAAMLRRQGPLAPPRAVAIVRQIGSALDAAHAAGATHRDVKPENILVSADDFAYLVDFGI
ASATTDEKLTQLGNTVGTLYYMAPERFSESHATYRADIYALTCVLYECLTGSPPYQGDQLSVMGAHINQAIPRPSTVRPG
IPVAFDAVIARGMAKNPEDRYVTCGDLSAAAHAALATADQDRATDILRRSQVAKLPVPSTHPVSPGTRWPQPTPWAGGAP
PWGPPSSPLPRSARQPWLWVGVAVAVVVALAGGLGIALAHPWRSSGPRTSAPPPPPPADAVELRVLNDGVFVGSSVAPTT
IDIFNEPICPPCGSFIRSYASDIDTAVADKQLAVRYHLLNFLDDQSHSKNYSTRAVAASYCVAGQNDPKLYASFYSALFG
SDFQPQENAASDRTDAELAHLAQTVGAEPTAISCIKSGADLGTAQTKATNASETLAGFNASGTPFVWDGSMVVNYQDPSW
LARLIG
>P9WI77 2.7.11.1~~~pknE~~~Serine/threonine-protein kinase PknE~~~COG0515
MDGTAESREGTQFGPYRLRRLVGRGGMGDVYEAEDTVRERIVALKLMSETLSSDPVFRTRMQREARTAGRLQEPHVVPIH
DFGEIDGQLYVDMRLINGVDLAAMLRRQGPLAPPRAVAIVRQIGSALDAAHAAGATHRDVKPENILVSADDFAYLVDFGI
ASATTDEKLTQLGNTVGTLYYMAPERFSESHATYRADIYALTCVLYECLTGSPPYQGDQLSVMGAHINQAIPRPSTVRPG
IPVAFDAVIARGMAKNPEDRYVTCGDLSAAAHAALATADQDRATDILRRSQVAKLPVPSTHPVSPGTRWPQPTPWAGGAP
PWGPPSSPLPRSARQPWLWVGVAVAVVVALAGGLGIALAHPWRSSGPRTSAPPPPPPADAVELRVLNDGVFVGSSVAPTT
IDIFNEPICPPCGSFIRSYASDIDTAVADKQLAVRYHLLNFLDDQSHSKNYSTRAVAASYCVAGQNDPKLYASFYSALFG
SDFQPQENAASDRTDAELAHLAQTVGAEPTAISCIKSGADLGTAQTKATNASETLAGFNASGTPFVWDGSMVVNYQDPSW
LARLIG
>A5U3A6 2.7.11.1~~~pknF~~~Serine/threonine-protein kinase PknF~~~COG0515
MPLAEGSTFAGFTIVRQLGSGGMGEVYLARHPRLPRQDALKVLRADVSADGEYRARFNREADAAASLWHPHIVAVHDRGE
FDGQLWIDMDFVDGTDTVSLLRDRYPNGMPGPEVTEIITAVAEALDYAHERRLLHRDVKPANILIANPDSPDRRIMLADF
GIAGWVDDPSGLTATNMTVGTVSYAAPEQLMGNELDGRADQYALAATAFHLLTGSPPFQHANPAVVISQHLSASPPAIGD
RVPELTPLDPVFAKALAKQPKDRYQRCVDFARALGHRLGGAGDPDDTRVSQPVAVAAPAKRSLLRTAVIVPAVLAMLLVM
AVAVAVREFQRADDERAAQPARTRTTTSAGTTTSVAPASTTRPAPTTPTTTGAADTATASPTAAVVAIGALCFPLGSTGT
TKTGATAYCSTLQGTNTTIWSLTEDTVASPTVTATADPTEAPLPIEQESPIRVCMQQTGQTRRECREEIRRSNGWP
>P9WI75 2.7.11.1~~~pknF~~~Serine/threonine-protein kinase PknF~~~COG0515
MPLAEGSTFAGFTIVRQLGSGGMGEVYLARHPRLPRQDALKVLRADVSADGEYRARFNREADAAASLWHPHIVAVHDRGE
FDGQLWIDMDFVDGTDTVSLLRDRYPNGMPGPEVTEIITAVAEALDYAHERRLLHRDVKPANILIANPDSPDRRIMLADF
GIAGWVDDPSGLTATNMTVGTVSYAAPEQLMGNELDGRADQYALAATAFHLLTGSPPFQHANPAVVISQHLSASPPAIGD
RVPELTPLDPVFAKALAKQPKDRYQRCVDFARALGHRLGGAGDPDDTRVSQPVAVAAPAKRSLLRTAVIVPAVLAMLLVM
AVAVAVREFQRADDERAAQPARTRTTTSAGTTTSVAPASTTRPAPTTPTTTGAADTATASPTAAVVAIGALCFPLGSTGT
TKTGATAYCSTLQGTNTTIWSLTEDTVASPTVTATADPTEAPLPIEQESPIRVCMQQTGQTRRECREEIRRSNGWP
>A0QQK3 2.7.11.1~~~pknG~~~Serine/threonine-protein kinase PknG~~~COG0515
MTSPENPDLPDADDAYVDSGPGTQPASLEDLDMDSASTMRPMATQAVYRPEFDDTDGTSRGTVVTEAYDQVTMATRALSP
MRRLGGGLVEIPRVPERDPLTALMTNPVVAESKRFCWNCGKPVGRSTPDGRALSEGWCPHCGSPYSFLPQLSPGDIVADQ
YEIKGCIAHGGLGWVYLAFDKNVNDRPVVLKGLVHSGDAEAQAIAMAERQFLAEVTHPGIVKIYNFVEHEDKHGNPVGYI
VMEYVGGTSLKQARGAKLPVAEAIGYMLEILPALGYLHSIGLAYNDLKPENIMITEEQLKLIDLGAVSRLNSYGYLYGTP
GYQAPEIVRTGPTVATDIYTVGRTLAALTLSLRTRRGRYVDGLPSDDPVLETYDSYHRLLRRAIDPDPRRRFTSAEEMSS
QLLGVLREVVATDTGVPRPGLSTVFSPSRSTFGVDLLVAHTDVYVDGQVHSEKLTAQEIVRALPVPLVDRTDVGAPMLVA
SVLSEPVHTLDQLRAARHGALDTEGIDLNESVELPLMEVRALLDLGDVAKATRKLEDLAARVGWRWRLVWFKAVSEMLSA
DYDSATKHFTEVLDTLPGELAPKLALAATAELAGTADELKFYKTVWSTDNGVISAGFGLARAQSVAGERDMAVQTLDEVP
PTSRHFTTARLTSAVTLLSGRSTSEITEQHIRDAARRVEALPDSEPRVLQIRALVLGTALDWLADNTASSNHILGFPFTE
HGLKLGVEASLRALARIAPTQSHRYALVDLANSVRPMSTF
>P9WI73 2.7.11.1~~~pknG~~~Serine/threonine-protein kinase PknG~~~COG0515
MAKASETERSGPGTQPADAQTATSATVRPLSTQAVFRPDFGDEDNFPHPTLGPDTEPQDRMATTSRVRPPVRRLGGGLVE
IPRAPDIDPLEALMTNPVVPESKRFCWNCGRPVGRSDSETKGASEGWCPYCGSPYSFLPQLNPGDIVAGQYEVKGCIAHG
GLGWIYLALDRNVNGRPVVLKGLVHSGDAEAQAMAMAERQFLAEVVHPSIVQIFNFVEHTDRHGDPVGYIVMEYVGGQSL
KRSKGQKLPVAEAIAYLLEILPALSYLHSIGLVYNDLKPENIMLTEEQLKLIDLGAVSRINSFGYLYGTPGFQAPEIVRT
GPTVATDIYTVGRTLAALTLDLPTRNGRYVDGLPEDDPVLKTYDSYGRLLRRAIDPDPRQRFTTAEEMSAQLTGVLREVV
AQDTGVPRPGLSTIFSPSRSTFGVDLLVAHTDVYLDGQVHAEKLTANEIVTALSVPLVDPTDVAASVLQATVLSQPVQTL
DSLRAARHGALDADGVDFSESVELPLMEVRALLDLGDVAKATRKLDDLAERVGWRWRLVWYRAVAELLTGDYDSATKHFT
EVLDTFPGELAPKLALAATAELAGNTDEHKFYQTVWSTNDGVISAAFGLARARSAEGDRVGAVRTLDEVPPTSRHFTTAR
LTSAVTLLSGRSTSEVTEEQIRDAARRVEALPPTEPRVLQIRALVLGGALDWLKDNKASTNHILGFPFTSHGLRLGVEAS
LRSLARVAPTQRHRYTLVDMANKVRPTSTF
>P9WI71 2.7.11.1~~~pknH~~~Serine/threonine-protein kinase PknH~~~COG0515
MSDAQDSRVGSMFGPYHLKRLLGRGGMGEVYEAEHTVKEWTVAVKLMTAEFSKDPVFRERMKREARIAGRLQEPHVVPIH
DYGEVDGQMFLEMRLVEGTDLDSVLKRFGPLTPPRAVAIITQIASALDAAHADGVMHRDVKPQNILITRDDFAYLVDFGI
ASATTDEKLTQLGTAVGTWKYMAPERFSNDEVTYRADIYALACVLHECLTGAPPYRADSAGTLVSSHLMGPIPQPSAIRP
GIPKAFDAVVARGMAKKPEDRYASAGDLALAAHEALSDPDQDHAADILRRSQESTLPAPPKPVPPPTMPATAMAPRQPPA
PPVTPPGVQPAPKPSYTPPAQPGPAGQRPGPTGQPSWAPNSGPMPASGPTPTPQYYQGGGWGAPPSGGPSPWAQTPRKTN
PWPLVAGAAAVVLVLVLGAIGIWIAIRPKPVQPPQPVAEERLSALLLNSSEVNAVMGSSSMQPGKPITSMDSSPVTVSLP
DCQGALYTSQDPVYAGTGYTAINGLISSEPGDNYEHWVNQAVVAFPTADKARAFVQTSADKWKNCAGKTVTVTNKAKTYR
WTFADVKGSPPTITVIDTQEGAEGWECQRAMSVANNVVVDVNACGYRITNQAGQIAAKIVDKVNKE
>P9WI69 2.7.11.1~~~pknI~~~Serine/threonine-protein kinase PknI~~~COG0515
MALASGVTFAGYTVVRMLGCSAMGEVYLVQHPGFPGWQALKVLSPAMAADDEFRRRFQRETEVAARLFHPHILEVHDRGE
FDGQLWIAMDYVDGIDATQHMADRFPAVLPVGEVLAIVTAVAGALDYAHQRGLLHRDVNPANVVLTSQSAGDQRILLADF
GIASQPSYPAPELSAGADVDGRADQYALALTAIHLFAGAPPVDRSHTGPLQPPKLSAFRPDLARLDGVLSRALATAPADR
FGSCREFADAMNEQAGVAIADQSSGGVDASEVTAAAGEEAYVVDYPAYGWPEAVDCKEPSARAPAPAAPTPQRRGSMLQS
AAGVLARRLDNFSTATKAPASPTRRRPRRILVGAVAVLLLAGLFAVGIVIGRKTNTTATEVARPPTSGSAVPSAPTTTVA
VTAPVPLDGTYRIEIQRSKQTYDYTPTPQPPDVNTWWAFRTSCTPTECLAAATMLDDNDHTQAKTPPVRPFLMQFGEGQW
KSRPETVQFPCVGPNGSPSTQATTQLLALRPQPQGDLVGEMVVTVHSNECGQQGAVIRIPAVASRSGDLPPAVTVPDPAT
IPDTPDTTSTATLTPPTTTAPGPGR
>P9WI66 2.7.11.1~~~pknJ~~~Serine/threonine-protein kinase PknJ~~~
MAHELSAGSVFAGYRIERMLGAGGMGTVYLARNPDLPRSEALKVLAAELSRDLDFRARFVREADVAAGLDHPNIVAVHQR
GQFEGRLWIAMQFVDGGNAEDALRAATMTTARAVYVIGEVAKALDYAHQQGVIHRDIKPANFLLSRAAGGDERVLLSDFG
IARALGDTGLTSTGSVLATLAYAAPEVLAGQGFDGRADLYSLGCALFRLLTGEAPFAAGAGAAVAVVAGHLHQPPPTVSD
RVPGLSAAMDAVIATAMAKDPMRRFTSAGEFAHAAAAALYGGATDGWVPPSPAPHVISQGAVPGSPWWQHPVGSVTALAT
PPGHGWPPGLPPLPRRPRRYRRGVAAVAAVMVVAAAAVTAVTMTSHQPRTATPPSAAALSPTSSSTTPPQPPIVTRSRLP
GLLPPLDDVKNFVGIQNLVAHEPMLQPQTPNGSINPAECWPAVGGGVPSAYDLGTVIGFYGLTIDEPPTGTAPNQVGQLI
VAFRDAATAQRHLADLASIWRRCGGRTVTLFRSEWRRPVELSTSVPEVVDGITTMVLTAQGPVLRVREDHAIAAKNNVLV
DVDIMTPDTSRGQQAVIGITNYILAKIPG
>P9WI67 2.7.11.1~~~pknJ~~~Serine/threonine-protein kinase PknJ~~~COG0515
MAHELSAGSVFAGYRIERMLGAGGMGTVYLARNPDLPRSEALKVLAAELSRDLDFRARFVREADVAAGLDHPNIVAVHQR
GQFEGRLWIAMQFVDGGNAEDALRAATMTTARAVYVIGEVAKALDYAHQQGVIHRDIKPANFLLSRAAGGDERVLLSDFG
IARALGDTGLTSTGSVLATLAYAAPEVLAGQGFDGRADLYSLGCALFRLLTGEAPFAAGAGAAVAVVAGHLHQPPPTVSD
RVPGLSAAMDAVIATAMAKDPMRRFTSAGEFAHAAAAALYGGATDGWVPPSPAPHVISQGAVPGSPWWQHPVGSVTALAT
PPGHGWPPGLPPLPRRPRRYRRGVAAVAAVMVVAAAAVTAVTMTSHQPRTATPPSAAALSPTSSSTTPPQPPIVTRSRLP
GLLPPLDDVKNFVGIQNLVAHEPMLQPQTPNGSINPAECWPAVGGGVPSAYDLGTVIGFYGLTIDEPPTGTAPNQVGQLI
VAFRDAATAQRHLADLASIWRRCGGRTVTLFRSEWRRPVELSTSVPEVVDGITTMVLTAQGPVLRVREDHAIAAKNNVLV
DVDIMTPDTSRGQQAVIGITNYILAKIPG
>P9WI65 2.7.11.1~~~pknK~~~Serine/threonine-protein kinase PknK~~~COG0515
MTDVDPHATRRDLVPNIPAELLEAGFDNVEEIGRGGFGVVYRCVQPSLDRAVAVKVLSTDLDRDNLERFLREQRAMGRLS
GHPHIVTVLQVGVLAGGRPFIVMPYHAKNSLETLIRRHGPLDWRETLSIGVKLAGALEAAHRVGTLHRDVKPGNILLTDY
GEPQLTDFGIARIAGGFETATGVIAGSPAFTAPEVLEGASPTPASDVYSLGATLFCALTGHAAYERRSGERVIAQFLRIT
SQPIPDLRKQGLPADVAAAIERAMARHPADRPATAADVGEELRDVQRRNGVSVDEMPLPVELGVERRRSPEAHAAHRHTG
GGTPTVPTPPTPATKYRPSVPTGSLVTRSRLTDILRAGGRRRLILIHAPSGFGKSTLAAQWREELSRDGAAVAWLTIDND
DNNEVWFLSHLLESIRRVRPTLAESLGHVLEEHGDDAGRYVLTSLIDEIHENDDRIAVVIDDWHRVSDSRTQAALGFLLD
NGCHHLQLIVTSWSRAGLPVGRLRIGDELAEIDSAALRFDTDEAAALLNDAGGLRLPRADVQALTTSTDGWAAALRLAAL
SLRGGGDATQLLRGLSGASDVIHEFLSENVLDTLEPELREFLLVASVTERTCGGLASALAGITNGRAMLEEAEHRGLFLQ
RTEDDPNWFRFHQMFADFLHRRLERGGSHRVAELHRRASAWFAENGYLHEAVDHALAAGDPARAVDLVEQDETNLPEQSK
MTTLLAIVQKLPTSMVVSRARLQLAIAWANILLQRPAPATGALNRFETALGRAELPEATQADLRAEADVLRAVAEVFADR
VERVDDLLAEAMSRPDTLPPRVPGTAGNTAALAAICRFEFAEVYPLLDWAAPYQEMMGPFGTVYAQCLRGMAARNRLDIV
AALQNFRTAFEVGTAVGAHSHAARLAGSLLAELLYETGDLAGAGRLMDESYLLGSEGGAVDYLAARYVIGARVKAAQGDH
EGAADRLSTGGDTAVQLGLPRLAARINNERIRLGIALPAAVAADLLAPRTIPRDNGIATMTAELDEDSAVRLLSAGDSAD
RDQACQRAGALAAAIDGTRRPLAALQAQILHIETLAATGRESDARNELAPVATKCAELGLSRLLVDAGLA
>P9WI63 2.7.11.1~~~pknL~~~Serine/threonine-protein kinase PknL~~~COG0515
MVEAGTRDPLESALLDSRYLVQAKIASGGTSTVYRGLDVRLDRPVALKVMDSRYAGDEQFLTRFRLEARAVARLNNRALV
AVYDQGKDGRHPFLVMELIEGGTLRELLIERGPMPPHAVVAVLRPVLGGLAAAHRAGLVHRDVKPENILISDDGDVKLAD
FGLVRAVAAASITSTGVILGTAAYLSPEQVRDGNADPRSDVYSVGVLVYELLTGHTPFTGDSALSIAYQRLDADVPRASA
VIDGVPPQFDELVACATARNPADRYADAIAMGADLEAIAEELALPEFRVPAPRNSAQHRSAALYRSRITQQGQLGAKPVH
HPTRQLTRQPGDCSEPASGSEPEHEPITGQFAGIAIEEFIWARQHARRMVLVWVSVVLAITGLVASAAWTIGSNLSGLL
>P9WPF5 2.3.1.-~~~~~~Polyketide synthase-like Pks10~~~COG3424
MSVIAGVFGALPPYRYSQRELTDSFVSIPDFEGYEDIVRQLHASAKVNSRHLVLPLEKYPKLTDFGEANKIFIEKAVDLG
VQALAGALDESGLRPEDLDVLITATVTGLAVPSLDARIAGRLGLRADVRRVPLFGLGCVAGAAGVARLHDYLRGAPDGVA
ALVSVELCSLTYPGYKPTLPGLVGSALFADGAAAVVAAGVKRAQDIGADGPDILDSRSHLYPDSLRTMGYDVGSAGFELV
LSRDLAAVVEQYLGNDVTTFLASHGLSTTDVGAWVTHPGGPKIINAITETLDLSPQALELTWRSLGEIGNLSSASVLHVL
RDTIAKPPPSGSPGLMIAMGPGFCSELVLLRWH
>P9WPF3 2.3.1.-~~~~~~Methyl-branched alkylpyrone synthesis polyketide synthase-like Pks11~~~COG3424
MSVIAGVFGALPPHRYSQSEITDSFVEFPGLKEHEEIIRRLHAAAKVNGRHLVLPLQQYPSLTDFGDANEIFIEKAVDLG
VEALLGALDDANLRPSDIDMIATATVTGVAVPSLDARIAGRLGLRPDVRRMPLFGLGCVAGAAGVARLRDYLRGAPDDVA
VLVSVELCSLTYPAVKPTVSSLVGTALFGDGAAAVVAVGDRRAEQVRAGGPDILDSRSSLYPDSLHIMGWDVGSHGLRLR
LSPDLTNLIERYLANDVTTFLDAHRLTKDDIGAWVSHPGGPKVIDAVATSLALPPEALELTWRSLGEIGNLSSASILHIL
RDTIEKRPPSGSAGLMLAMGPGFCTELVLLRWR
>I6XD69 2.3.1.295~~~~~~Mycoketide-CoA synthase~~~COG0604
MVDQLQHATEALRKALVQVERLKRTNRALLERSSEPIAIVGMSCRFPGGVDSPEGLWQMVADARDVMSEFPTDRGWDLAG
LFDPDPDVRHKSYARTGGFVDGVADFDPAFFGISPSEALAMDPQHRMLLELSWEALERAGIDPTGLRGSATGVFAGLIVG
GYGMLAEEIEGYRLTGMTSSVASGRVAYVLGLEGPAVSVDTACSSSLVALHMAVGSLRSGECDLALAGGVTVNATPTVFV
EFSRHRGLAPDGRCKPYAGRADGVGWSEGGGMLVLQRLSDARRLGHPVLAVVVGSAVNQDGASNGLTAPNGPSQQRVVRA
ALANAGLSAAEVDVVEGHGTGTTLGDPIEAQALLATYGQDRGEPGEPLWLGSVKSNMGHTQAAAGVAGVIKMVLAMRHEL
LPATLHVDVPSPHVDWSAGAVELLTAPRVWPAGARTRRAGVSSFGISGTNAHVIIEAVPVVPRREAGWAGPVVPWVVSAK
SESALRGQAARLAAYVRGDDGLDVADVGWSLAGRSVFEHRAVVVGGDRDRLLAGLDELAGDQLGGSVVRGTATAAGKTVF
VFPGQGSQWLGMGIELLDTAPAFAQQIDACAEAFAEFVDWSLVDVLRGAPGAPGLDRVDVVQPVLFAVMVSLAELWKSVA
VHPDAVIGHSQGEIAAAYVAGALSLRDAARVVTLRSKLLAGLAGPGGMVSIACGADQARDLLAPFGDRVSIAVVNGPSAV
VVSGEVGALEELIAVCSTKELRTRRIEVDYASHSVEVEAIRGPLAEALSGIEPRSTRTVFFSTVTGNRLDTAGLDADYWY
RNVRQTVLFDQAVRNACEQGYRTFIESSPHPALITGVEETFAACTDGDSEAIVVPTLGRGDGGLHRFLLSAASAFVAGVA
VNWRGTLDGAGYVELPTYAFDKRRFWLSAEGSGADVSGLGLGASEHPLLGAVVDLPASGGVVLTGRLSPNVQPWLADHAV
SDVVLFPGTGFVELAIRAGDEVGCSVLDELTLAAPLLLPATGSVAVQVVVDAGRDSNSRGVSIFSRADAQAGWLLHAEGI
LRPGSVEPGADLSVWPPAGAVTVDVADGYERLATRGYRYGPAFRGLTAMWARGEEIFAEVRLPEAAGGVGGFGVHPALLD
AVLHAVVIAGDPDELALPFAWQGVSLHATGASAVRARIAPAGPSAVSVELADGLGLPVLSVASMVARPVTERQLLAAVSG
SGPDRLFEVIWSPASAATSPGPTPAYQIFESVAADQDPVAGSYVRSHQALAAVQSWLTDHESGVLVVATRGAMALPREDV
ADLAGAAVWGLVRSAQTEHPGRIVLVDSDAATDDAAIAMALATGEPQVVLRGGQVYTARVRGSRAADAILVPPGDGPWRL
GLGSAGTFENLRLEPVPNADAPLGPGQVRVAMRAIAANFRDIMITLGMFTHDALLGGEGAGVVVEVGPGVTEFSVGDSVF
GFFPDGSGTLVAGDVRLLLPMPADWSYAEAAAISAVFTTAYYAFIHLADVQPGQRVLIHAGTGGVGMAAVQLARHLGLEV
FATASKGKWDTLRAMGFDDDHISDSRSLEFEDKFRAATGGRGFDVVLDSLAGEFVDASLRLVAPGGVFLEMGKTDIRDPG
VIAQQYPGVRYRAFDLFEPGRPRMHQYMLELATLFGDGVLRPLPVTTFDVRRAPAALRYLSQARHTGKVVMLMPGSWAAG
TVLITGGTGMAGSAVARHVVARHGVRNLVLVSRRGPDAPGAAELVAELAAAGAQVQVVACDAADRAALAKVIADIPVQHP
LSGVIHTAGALDDAVVMSLTPDRVDVVLRSKVDAAWHLHELTRDLDVSAFVMFSSMAGLVGSSGQANYAAANSFLDALAA
HRRAHGLPAISLGWGLWDQASAMTGGLDAADLARLGREGVLALSTAEALELFDTAMIVDEPFLAPARIDLTALRAHAVAV
PPMFSDLASAPTRRQVDDSVAAAKSKSALAHRLHGLPEAEQHAVLLGLVRLHIATVLGNITPEAIDPDKAFQDLGFDSLT
AVEMRNRLKSATGLSLSPTLIFDYPTPNRLASYIRTELAGLPQEIKHTPAVRTTSEDPIAIVGMACRYPGGVNSPDDMWD
MLIQGRDVLSEFPADRGWDLAGLYNPDPDAAGACYTRTGGFVDGVGDFDPAFFGVGPSEALAMDPQHRMLLELSWEALER
AGIDPTGLRGSATGVFAGVMTQGYGMFAAEPVEGFRLTGQLSSVASGRVAYVLGLEGPAVSVDTACSSSLVALHMAVGSL
RSGECDLALAGGVTVNATPDIFVEFSRWRGLSPDGRCKAFAAAADGTGFSEGGGMLVLQRLSDARRLGHPVLAVVVGSAV
NQDGASNGLTAPNGPSQQRVVRAALANAGLSAAEVDVVEGHGTGTTLGDPIEAQALLATYGQDRGEPGEPLWLGSVKSNM
GHTQAAAGVAGVIKMVLAMRHELLPATLHVDVPSPHVDWSAGAVELLTAPRVWPAGARTRRAGVSSFGISGTNAHVIIEA
VPVVPRREAGWAGPVVPWVVSAKSESALRGQAARLAAYVRGDDGLDVADVGWSLAGRSVFEHRAVVVGGDRDRLLAGLDE
LAGDQLGGSVVRGTATAAGKTVFVFPGQGSQWLGMGMGLHAGYPVFAEAFNTVVGELDRHLLRPLREVMWGHDENLLNST
EFAQPALFAVEVALFRLLGSWGVRPDFVMGHSIGELSAAHVAGVLSLENAAVLVAARGRLMQALPAGGAMVAVQAAEEEV
RPLLSAEVDIAAVNGPASLVISGAQNAVAAVADQLRADGRRVHQLAVSHAFHSPLMDPMIDEFAAVAAGIAIGRPTIGVI
SNVTGQLAGDDFGSAAYWRRHIRQAVRFADSVRFAQAAGGSRFLEVGPSGGLVASIEESLPDVAVTTMSALRKDRPEPAT
LTNAVAQGFVTGMDLDWRAVVGEAQFVELPTYAFQRRRFWLSGDGVAADAAGLGLAASEHALLGAVIDLPASGGVVLTGR
LSPSVQGWLADHSVAGVTIFPGAGFVELAIRAGDEVGCGVVDELTLAAPLVLPASGSVAVQVVVNGPDESGVRGVSVYSR
GDVGTGWVLHAEGALRAGSAEPTADLAMWPPAGAVPVEVADGYQQLAERGYGYGPAFRGLTAMWRRGDEVFAEVALPADA
GVSVTGFGVHPVLLDAALHAVVLSAESAERGQGSVLVPFSWQGVSLHAAGASAVRARIAPVGPSAVSIELADGLGLPVLS
VASMLARPVTDQQLRAAVSSSGPDRLFEVTWSPQPSAAVEPLPVCAWGTTEDSAAVVFESVPLAGDVVAGVYAATSSVLD
VLQSWLTRDGAGVLVVMTRGAVALPGEDVTDLAGAAVWGLVRSAQTEHPGRIVLVDSDAPLDDSALAAVVTTGEPQVLWR
RGEVYTARVHGSRAVGGLLVPPSDRPWRLAMSTAGTFENLRLELIPDADAPLGPGQVRVAVSAIAANFRDVMIALGLYPD
PDAVMGVEACGVVIETSLNKGSFAVGDRVMGLFPEGTGTVASTDQRLLVKVPAGWSHTAAATTSVVFATAHYALVDLAAA
RSGQRVLIHAGTGGVGMAAVQLARHLGLEVFATASKGKWDTLRAMGFDDDHISDSRSLEFEDKFRAATGGRGFDVVLDSL
AGEFVDASLRLVAPGGVFLEMGKTDIRDPGVIAQQYPGVRYRAFDLFEPGPDRIAQILAELATLFGDGVLRPLPVTTFDV
RCAPAALRYLSQARHTGKVVMLMPGSWAAGTVLITGGTGMAGSAVARHVVARHGVRNLVLVSRRGPDAPGAAELVAELAA
AGAQVQVVACDAADRAALAKVIADIPVQHPLSGVIHTAGALDDAVVMSLTPDRVDVVLRSKVDAAWHLHELTRDLDVSAF
VMFSSMAGLVGSSGQANYAAANSFLDALAAHRRAHGLPAISLGWGLWDQASAMTGGLATVDFKRFARDGIVAMSSADALQ
LFDTAMIVDEPFMLPAHIDFAALKVKFDGGTLPPMFVDLINAPTRRQVDDSLAAAKSKSALLQRLEGLPEDEQHAVLLDL
VRSHIATVLGSASPEAIDPDRAFQELGFDSLTAVEMRNRLKSATGLALSPTLIFDYPNSAALAGYMRRELLGSSPQDTSA
VAAGEAELQRIVASIPVKRLRQAGVLDLLLALANETETSGQDPALAPTAEQEIADMDLDDLVNAAFRNDDE
>I6X8D2 2.3.1.-~~~~~~Polyketide synthase Pks13~~~COG0236
MADVAESQENAPAERAELTVPEMRQWLRNWVGKAVGKAPDSIDESVPMVELGLSSRDAVAMAADIEDLTGVTLSVAVAFA
HPTIESLATRIIEGEPETDLAGDDAEDWSRTGPAERVDIAIVGLSTRFPGEMNTPEQTWQALLEGRDGITDLPDGRWSEF
LEEPRLAARVAGARTRGGYLKDIKGFDSEFFAVAKTEADNIDPQQRMALELTWEALEHARIPASSLRGQAVGVYIGSSTN
DYSFLAVSDPTVAHPYAITGTSSSIIANRVSYFYDFHGPSVTIDTACSSSLVAIHQGVQALRNGEADVVVAGGVNALITP
MVTLGFDEIGAVLAPDGRIKSFSADADGYTRSEGGGMLVLKRVDDARRDGDAILAVIAGSAVNHDGRSNGLIAPNQDAQA
DVLRRAYKDAGIDPRTVDYIEAHGTGTILGDPIEAEALGRVVGRGRPADRPALLGAVKTNVGHLESAAGAASMAKVVLAL
QHDKLPPSINFAGPSPYIDFDAMRLKMITTPTDWPRYGGYALAGVSSFGFGGANAHVVVREVLPRDVVEKEPEPEPEPKA
AAEPAEAPTLAGHALRFDEFGNIITDSAVAEEPEPELPGVTEEALRLKEAALEELAAQEVTAPLVPLAVSAFLTSRKKAA
AAELADWMQSPEGQASSLESIGRSLSRRNHGRSRAVVLAHDHDEAIKGLRAVAAGKQAPNVFSVDGPVTTGPVWVLAGFG
AQHRKMGKSLYLRNEVFAAWIEKVDALVQDELGYSVLELILDDAQDYGIETTQVTIFAIQIALGELLRHHGAKPAAVIGQ
SLGEAASAYFAGGLSLRDATRAICSRSHLMGEGEAMLFGEYIRLMALVEYSADEIREVFSDFPDLEVCVYAAPTQTVIGG
PPEQVDAILARAEAEGKFARKFATKGASHTSQMDPLLGELTAELQGIKPTSPTCGIFSTVHEGRYIKPGGEPIHDVEYWK
KGLRHSVYFTHGIRNAVDSGHTTFLELAPNPVALMQVALTTADAGLHDAQLIPTLARKQDEVSSMVSTMAQLYVYGHDLD
IRTLFSRASGPQDYANIPPTRFKRKEHWLPAHFSGDGSTYMPGTHVALPDGRHVWEYAPRDGNVDLAALVRAAAAHVLPD
AQLTAAEQRAVPGDGARLVTTMTRHPGGASVQVHARIDESFTLVYDALVSRAGSESVLPTAVGAATAIAVADGAPVAPET
PAEDADAETLSDSLTTRYMPSGMTRWSPDSGETIAERLGLIVGSAMGYEPEDLPWEVPLIELGLDSLMAVRIKNRVEYDF
DLPPIQLTAVRDANLYNVEKLIEYAVEHRDEVQQLHEHQKTQTAEEIARAQAELLHGKVGKTEPVDSEAGVALPSPQNGE
QPNPTGPALNVDVPPRDAAERVTFATWAIVTGKSPGGIFNELPRLDDEAAAKIAQRLSERAEGPITAEDVLTSSNIEALA
DKVRTYLEAGQIDGFVRTLRARPEAGGKVPVFVFHPAGGSTVVYEPLLGRLPADTPMYGFERVEGSIEERAQQYVPKLIE
MQGDGPYVLVGWSLGGVLAYACAIGLRRLGKDVRFVGLIDAVRAGEEIPQTKEEIRKRWDRYAAFAEKTFNVTIPAIPYE
QLEELDDEGQVRFVLDAVSQSGVQIPAGIIEHQRTSYLDNRAIDTAQIQPYDGHVTLYMADRYHDDAIMFEPRYAVRQPD
GGWGEYVSDLEVVPIGGEHIQAIDEPIIAKVGEHMSRALGQIEADRTSEVGKQ
>P96284 ~~~~~~Putative inactive phenolphthiocerol synthesis polyketide synthase type I Pks15~~~COG3321
MIEEQRTMSVEGADQQSEKLFHYLKKVAVELDETRARLREYEQRATEPVAVVGIGCRFPGGVDGPDGLWDVVSAGRDVVS
EFPTDRGWDVEGLYDPDPDAEGKTYTRWGAFLDDATGFDAGFFGIAPSEVLAMDPQQRLMLEVSWEALEHAGIDPLSLRG
SATGVYTGIFAASYGNRDTGGLQGYGLTGTSISVASGRVSYVLGLQGPAVSVDTACSSSLVAIHWAMSSLRSGECDLALA
GGVTVMGLPSIFVGFSRQRGLAADGRCKAFAAAADGTGWGEGAGVVVLERLSDARRLGHSVLAVVRGSAVNQDGASNGLT
APNGLAQQRVIQVALANAGLSAADVDVVEAHGTATTLGDPIEAQALLSTYGQGGPAEQPLWVGSIKSNMGHTQAAAGVAG
VIKMVQAMRHGVMPATLHVDEPSPRVDWTSGAVSVLTEAREWSVDGRPRRAAVSSFGISGTNAHLILEEAPVPAPAEAPV
EASESTGGRGRRWCRG
>P9WPF1 2.3.1.-~~~~~~Alpha-pyrone synthesis polyketide synthase-like Pks18~~~COG3424
MNVSAESGAPRRAGQRHEVGLAQLPPAPPTTVAVIEGLATGTPRRVVNQSDAADRVAELFLDPGQRERIPRVYQKSRITT
RRMAVDPLDAKFDVFRREPATIRDRMHLFYEHAVPLAVDVSKRALAGLPYRAAEIGLLVLATSTGFIAPGVDVAIVKELG
LSPSISRVVVNFMGCAAAMNALGTATNYVRAHPAMKALVVCIELCSVNAVFADDINDVVIHSLFGDGCAALVIGASQVQE
KLEPGKVVVRSSFSQLLDNTEDGIVLGVNHNGITCELSENLPGYIFSGVAPVVTEMLWDNGLQISDIDLWAIHPGGPKII
EQSVRSLGISAELAAQSWDVLARFGNMLSVSLIFVLETMVQQAESAKAISTGVAFAFGPGVTVEGMLFDIIRR
>P96285 ~~~pks1~~~Putative inactive phenolphthiocerol synthesis polyketide synthase type I Pks1~~~COG0604
MISARSAEALTAQAGRLMAHVQANPGLDPIDVGCSLASRSVFEHRAVVVGASREQLIAGLAGLAAGEPGAGVAVGQPGSV
GKTVVVFPGQGAQRIGMGRELYGELPVFAQAFDAVADELDRHLRLPLRDVIWGADADLLDSTEFAQPALFAVEVASFAVL
RDWGVLPDFVMGHSVGELAAAHAAGVLTLADAAMLVVARGRLMQALPAGGAMVAVAASEDEVEPLLGEGVGIAAINAPES
VVISGAQAAANAIADRFAAQGRRVHQLAVSHAFHSPLMEPMLEEFARVAARVQAREPQLGLVSNVTGELAGPDFGSAQYW
VDHVRRPVRFADSARHLQTLGATHFIEAGPGSGLTGSIEQSLAPAEAMVVSMLGKDRPELASALGAAGQVFTTGVPVQWS
AVFAGSGGRRVQLPTYAFQRRRFWETPGADGPADAAGLGLGATEHALLGAVVERPDSDEVVLTGRLSLADQPWLADHVVN
GVVLFPGAGFVELVIRAGDEVGCALIEELVLAAPLVMHPGVGVQVQVVVGAADESGHRAVSVYSRGDQSQGWLLNAEGML
GVAAAETPMDLSVWPPEGAESVDISDGYAQLAERGYAYGPAFQGLVAIWRRGSELFAEVVAPGEAGVAVDRMGMHPAVLD
AVLHALGLAVEKTQASTETRLPFCWRGVSLHAGGAGRVRARFASAGADAISVDVCDATGLPVLTVRSLVTRPITAEQLRA
AVTAAGGASDQGPLEVVWSPISVVSGGANGSAPPAPVSWADFCAGSDGDASVVVWELESAGGQASSVVGSVYAATHTALE
VLQSWLGADRAATLVVLTHGGVGLAGEDISDLAAAAVWGMARSAQAENPGRIVLIDTDAAVDASVLAGVGEPQLLVRGGT
VHAPRLSPAPALLALPAAESAWRLAAGGGGTLEDLVIQPCPEVQAPLQAGQVRVAVAAVGVNFRDVVAALGMYPGQAPPL
GAEGAGVVLETGPEVTDLAVGDAVMGFLGGAGPLAVVDQQLVTRVPQGWSFAQAAAVPVVFLTAWYGLADLAEIKAGESV
LIHAGTGGVGMAAVQLARQWGVEVFVTASRGKWDTLRAMGFDDDHIGDSRTCEFEEKFLAVTEGRGVDVVLDSLAGEFVD
ASLRLLVRGGRFLEMGKTDIRDAQEIAANYPGVQYRAFDLSEAGPARMQEMLAEVRELFDTRELHRLPVTTWDVRCAPAA
FRFMSQARHIGKVVLTMPSALADRLADGTVVITGATGAVGGVLARHLVGAYGVRHLVLASRRGDRAEGAAELAADLTEAG
AKVQVVACDVADRAAVAGLFAQLSREYPPVRGVIHAAGVLDDAVITSLTPDRIDTVLRAKVDAAWNLHQATSDLDLSMFA
LCSSIAATVGSPGQGNYSAANAFLDGLAAHRQAAGLAGISLAWGLWEQPGGMTAHLSSRDLARMSRSGLAPMSPAEAVEL
FDAALAIDHPLAVATLLDRAALDARAQAGALPALFSGLARRPRRRQIDDTGDATSSKSALAQRLHGLAADEQLELLVGLV
CLQAAAVLGRPSAEDVDPDTEFGDLGFDSLTAVELRNRLKTATGLTLPPTVIFDHPTPTAVAEYVAQQMSGSRPTESGDP
TSQVVEPAAAEVSVHA
>A0R1E8 2.3.1.-~~~pks5~~~Mycocerosic acid synthase-like polyketide synthase~~~COG0604
MTQNCVAPVAIIGMACRLPGAINSPQQLWEALLRGDDFVTEIPTGRWDAEEYYDPEPGVPGRSVSKWGAFLDDPAAFDPE
FFGITEREAAAIDPQHRLLLETAWEAVEHSGLNPAGLAGSATGVFMGLTHNDYAHLAADAKALEGPYGFTGTSFSLASGR
IAYALGVHGPAITVDTACSSSLSAIHMACRSLHDGESDVALAGGVSVLLEPRKAAGGSAAGMLSPTGHCHAFDTAADGFV
SAEGCVVLTLKRLDDAVADGDRILAVIRGTATNQDGRTVNIATPSADAQAKVYRMALKAAGVEPGTVGLVEAHGTGTPVG
DPLEFSSLAEVYGTDGPCALGSIKTNFGHTQSAAGALGVMKAVLALQHNVIPQNLHFTRLPDQMAEIETGLFVPETITPW
PVREGQPRRAAVSAYGLSGTNVHAVLEQAPESPAETAAEAISPKAGNALVFPVSASSADALRSTAQHLADWLLRSGDGNG
RGPAIDLGDLAYTLARRRGFRAARSAVLAGDRGTLVEGLRQIADGEAMPQQAVTNDDRGPVWVFSGQGSQWASMGAELLD
REPAFAAAIAELEPLIAAESDFSVTEALTASETVTGIDRVQPTIFAVQVALAAAMRSHGVVPGAVIGHSMGEVAASVVSG
ALSLEDGVKVICRRTRLMTRIAGSGAMAMVELPAQQVLSELASRGVDDVVLSVVASPQSTVVGGATASVRELIEMWESRG
VMAREIAVDVASHSPQVDPILDDLIEALADLDPAEPEIPYYSATLYDPRDYADYDAYYWADNLRHTVRFSAAVQAALEDG
HRVFAELSPHPLLTHPVEQTARSLDMPLAVFAAMRRQQEMPHGLLGFVADLHSAGAAVDFSVLYPTGRLLDAPLPAWTHS
TLLLDRELESSAPGVPSVSVHPLLGSHVVLPQEPEEHLWQGDVGTEAHPWLSDHRVHQVAVLPGAAYCEMALAAVTPVLG
DTGEVHDLKFHDMLLLDDATPVWVSAAVTAPGTAEFGVETHQSGDRTQRATAVLRGDVDAERPAAHSIDALLAAHPNRVD
GDELRAGFGTVGIGHGAAFAGLSEAYVATAAEPTVVAAVALPGPLRSGQRGYTVHPALLDACFQSVIAHPEVQNIASGML
LPLGVRRLRAYGSTRNVRYCLSRIVKADSFGVEADLELLDADGTVLLSAMGLQLGTGNSDKAEEERLLDERLLTIEWQQR
ELPRPEGSETVDAGSWLVILAGDDDENPRAAGVVSALIGAGMPTTTMAWSHDADHDAQAAALTARLDEQPLAGVAVIVGD
SETGTDAHDVGADARRGADHVRHLVRIARTLADAVGEPPRLYVVTHRSQHVLDTDEPYLEHSGLRGLIRVVGMEHPRLRA
TQIDVDDSTAHEALVRQLLSGSPEDETAWRDGQWYAARLCPSPLRAAERRTAVADNASEGMRLVVRNPGDLESMELVTFE
RGTPGPGQIEVAVKASSINFADVLVAFGRCPSFDGRLPELGSEFGGVVTAVGPGVTTHRVGDRVGGVSANGCWSNFVTCE
ADLATKLPEGISEHEAAAVGLAYGTVWLGLTELARMSAGDKILIHSATGGVGQAAIAVARAAGAEIYATAGSEKRRQLLR
DWGIEHVYDSRTTAFADQIRTDTDGYGVDIVLNSVTGPAQRAGLELLAFGGRFVEIGKRDIYADTRLGLFPFRRNLSFYA
VDLALMTVTHPQKIRDLLATVYRLIADGTLPLPEITHYPLEEAATAIRIMGGAQHTGKLVIDIPDTGQSQVVVPPEQVPV
FRGDGAYVITGGLGGLGLFLAERMAAAGCGRIVVNSRSAPSTRSSEIIELIRATGADIVVECGDIAEPDTALRLVAAATQ
TGLPLRGVLHAAAVVEDATLANITDELVEHDWAPKVYGAWNLHQAVQSGGPATSELDWFCAFSSAAALVGSPGQGAYAAA
NSWLDAFMQWRRAQGLPATSIAWGAWGEIGRGTAMAEGDNAIAPDEGAYAFEAILRHDRVYNGYAPVLGASWLTAFAQRS
PFAELFLADTQGASETRKLRSELAALPREEWPTHLRRLIAEQVGLLLRRTVDPDRPLSEYGLDSLGHLELRTRIETETGV
RVSAMDMTTIRGLAQRLCEMLDTDDAVSAPS
>O53901 2.3.1.-~~~pks5~~~Mycocerosic acid synthase-like polyketide synthase~~~COG0604
MGKERTKTVDRTRVTPVAVIGMGCRLPGGIDSPDRLWEALLRGDDLVTEIPADRWDIDEYYDPEPGVPGRTDCKWGAYLD
NVGDFDPEFFGIGEKEAIAIDPQHRLLLETSWEAMEHGGLTPNQMASRTGVFVGLVHTDYILVHADNQTFEGPYGNTGTN
ACFASGRVAYAMGLQGPAITVDTACSSGLTAIHLACRSLHDGESDIALAGGVYVMLEPRRFASGSALGMLSATGRCHAFD
VSADGFVSGEGCVMLALKRLPDALADGDRILAVIRGTAANQDGHTVNIATPSRSAQVAAYREALDVAGVDPATVGMVEAH
GPGTPVGDPIEYASLAEVYGNDGPCALASVKTNFGHTQSAAGALGLMKAVLALQHGVVPQNLHFTALPDKLAAIETNLFV
PQEITPWPGADQETPRRAAVSSYGMTGTNVHAIVEQAPVPAPESGAPGDTPATPGIDGALLFALSASSQDALRQTAARLA
DWVDAQGPELAPADLAYTLARRRGHRPVRTAVLAATTAELTEALREVATGEPPYPPAVGQDDRGPVWVFSGQGSQWAGMG
ADLLATEPVFAATIAAIEPLIAAESGFSVTEAMTAPEVVTGIDRVQPTLFAMQVALAATMKSYGVAPGAVIGHSLGESAA
AVVAGALCLEDGVRVICRRSALMTRIAGAGAMASVELPAQQVLSELMARGVNDAVVAVVASPQSTVIGGATQTVRDLVAA
WEQRDVLAREVAVDVASHSPQVDPILDELAEALAEISPLQPEIPYYSATSFDPREEPYCDAYYWVDNLRHTVRFAAAVQA
ALEDGYRVFTELTPHPLLTHAVDQTARSLDMSAAALAGMRREQPLPHGLRALAGDLYAAGAAVDFAVLYPTGRLINAPLP
TWNHRRLLLDDTTRRIAHANTVAVHPLLGSHVRLPEEPERHVWQGEVGTVTQPWLADHQIHGAAALPGAAYCEMALAAAR
AVLGEASEVRDIRFEQMLLLDDETPIGVTATVEAPGVVPLTVETSHDGRYTRQLAAVLHVVREADDAPDQPPQKNIAELL
ASHPHKVDGAEVRQWLDKRGHRLGPAFAGLVDAYIAEGAGDTVLAEVNLPGPLRSQVKAYGVHPVLLDACFQSVAAHPAV
QGMADGGLLLPLGVRRLRSYGSARHARYCCTTVTACGVGVEADLDVLDEHGAVVLAVRGLQLGTGASQASERARVLGERL
LSIEWHERELPENSHAEPGAWLLISTCDATDLVAAQLTDALKVHDAQCTTMSWPQRADHAAQAARLRDQLGTGGFTGVFV
LTAPQTGDPDAESPVRGGELVKHVVRIAREIPEITAQEPRLYVLTHNAQAVLSGDRPNLEQGGMRGLLRVIGAEHPHLKA
SYVDVDEQTGAESVARQLLAASGEDETAWRNDQWYTARLCPAPLRPEERQTTVVDHAEAGMRLQIRTPGDLQTLEFAAFD
RVPPGPGEIEVAVTASSINFADVLVTFGRYQTLDGRQPQLGTDFAGVVSAVGPGVSELKVGDRVGGMSPNGCWATFVTCD
ARLATRLPEGLTDAQAAAVTTASATAWYGLQDLARIKAGDKVLIHSATGGVGQAAIAIARAAGAQIYATAGNEKRRDLLR
DMGIEHVYDSRSVEFAEQIRRDTAGYGVDIVLNSVTGAAQLAGLKLLALGGRFIEIGKRDIYSNTRLELLPFRRNLAFYG
LDLGLMSVSHPAAVRELLSTVYRLTVEGVLPMPQSTHYPLAEAATAIRVMGAAEHTGKLILDVPHAGRSSVVLPPEQARV
FRSDGSYIITGGLGGLGLFLAEKMANAGAGRIVLSSRSQPSQKALETIELVRAIGSDVVVECGDIAQPDTADRLVTAATA
TGLPLRGVLHAAAVVEDATLANITDELIERDWAPKAYGAWQLHRATADQPLDWFCSFSSAAALVGSPGQGAYAAANSWLD
TFTHWRRAQDLPATSIAWGAWGQIGRAIAFAEQTGDAIAPEEGAYAFETLLRHNRAYSGYAPVIGSPWLTAFAQHSPFAE
KFQSLGQNRSGTSKFLAELVDLPREEWPDRLRRLLSKQVGLILRRTIDTDRLLSEYGLDSLSSQELRARVEAETGIRISA
TEINTTVRGLADLMCDKLAADRDAPAPA
>O34769 3.-.-.-~~~pksB~~~Probable polyketide biosynthesis zinc-dependent hydrolase PksB~~~COG0491
MNLTYKVHPIKTRYQGWTNYCYIIEDIVSRSAIVVDPSWELSKITTTLSELEAELKAVALTHSHYDHVNLVDPLTKMFNA
QVYMSKKEIDYYQFRCRNLISLDDHQTISIGNTRAQCLLTPGHTAGGMCYLFSESIFTGDTVFTEGCGICEDDGSSAEEM
FDSIQRIKSEVSPHVRVYPGHSFGKSPGHSIKDLYQHNIYFQIDKKEYFVKFRTRKNQKGIFDFK
>O34825 2.3.1.39~~~pksC~~~Polyketide biosynthesis malonyl CoA-acyl carrier protein transacylase PksC~~~COG0331
MITYVFPGQGSQKQGMGSGLFDEFKELTDQADEILGYSIKRLCLENPYSNLNKTQFTQPALYVVNALSYLKKIRDEEVKP
DFVAGHSLGEYNALFAAEAFDFETGLQLVRKRGELMSLISNGGMAAVMGLNEEQVAKALKEYHLHDVDIANVNAPYQIVI
SGKKDEIEKAASLFETMTEVTMVLPLNVSGAFHSRYMNKAKEEFEEFLHAFYFSPPSIPVISNVYAKPYTYEFMKQTLAD
QINHSVKWTDSISYLMKKGHMEFEEVGPGNVLTGLIHRIKKDAEAMPR
>O34877 2.3.1.-~~~pksD~~~Polyketide biosynthesis acyltransferase homolog PksD~~~COG3321
MNEPLVFMFSGQGSQYYHMGKELFKENTVFRQSMLEMDAIAARRIGTSIVEEIYHPGKRVSDPFDSILFSHPAIFMIEYS
LYKVLEDRGIYPDYVLGSSLGEFAAAAVSGVSDAEDMLDCILEQAIIIQNSCDKGKMLAILDKPQLLNDHPQLFGNSELI
SINYDSHFVISGEEDHIRKIMEDLKEKQILCQLLPVSYAFHSSLIDPAESAYAEFLRSKSFQKPSIPIVSSLTGSCLHVM
DENFFWNAVRKPMMFREAIRYLESQHTCKFIDLGPSGTLAAFVKQLIPGDSADRCCSIITPFHQELKNLNTVEYFRTPER
KFTR
>O34787 ~~~pksE~~~Polyketide biosynthesis protein PksE~~~COG0331
MITYVFPGQGSQQKGMGQGLFEQYQHLTDQADQILGYSIEKLCTEKSYLDVNHTEYTQPALYVVNALSYLKRVEETGRKP
DFAAGHSLGEYNALMAAGAFDFETGLRLVKKRGELMGRITGGGMAAVIGLSKEQVTAVLEEHRLYDIDVANENTPQQIVI
SGPKKEIEKARAVFENTKDVKLFHPLNVSGAFHSRYMNEAKQVFKQYIDSFQFAPLAIPVISNVYAEPYHQDRLKDTLSE
QMDNTVKWTDSIRFLMGRGEMEFAEIGPGTVLTGLIHRIKNEAEPLTYIPKKNPAISAHLKEQRNVQAGITAESLGSAEF
KQDYHLTYAYLAGGMYRGIASKEMVVKLSRAGMMGFFGTGGLSLKEVEDAIHAIQGELGKGQAYGINLVHNMKHTESEEK
MIDLLLRNQVSIVEASAFLSVTPVLVRYRAKGVKRNQNGDVICSNRLIAKISRPEVAESFLSPAPENMLQKLLGENKITM
NEAELLRCIPMADDICVEADSGGHTDGGVAYSLMPAMTSLRDEMMKKYQYRKKIRVGAAGGIGTPEAAMAAFMLGADFIL
TGSINQCTVEAATSDKVKDLLQQMNVQDTAYAPAGDMFESGSKVQVLKKGVFFPARANKLYELYQRYGSIRELDAKMLAQ
LEEKYFKRSIEDIYKDIALHYPAADIEKAEQNPKHKMALIFRWYFRYSSKLAISGSEHSKVDYQIHCGPALGAFNQWVKG
SQLENWRNRHVDEIGKKLMTETAVLLHERMQSMYQPSHETDNIKIKV
>P40804 4.1.1.87~~~pksF~~~Polyketide biosynthesis malonyl-ACP decarboxylase PksF~~~COG0304
MTKCNLPEVVVTGVGVTASIGQGKEDFASSLLSGRHAFDVMKRSGRQKDSRFIGAEIASLSYPDRLSKKMLRKASFSSRA
ALVTLTEAWEEAELDDADSSRIGLVVGGSNFQQRENFEVYERYQDRSGFISPAYGLSFMDSDLCGICTDQFGITGLAYTV
GGASASGQLAVIHAIQQVLSGEVDTCIALGALMDLSYMECEALRALGAMGTDKYADEPENACRPFDQNRDGFIYGESCGA
LVIERKETALRRGLKPYAALSGWSIKLDGNRNPDPSLEGEIHVIQKALERARLLPEDIDYINPHGTGSFIGDEIELKALR
ACRLSHAYINATKSITGHGLSAAGIVEIISVLLQMKKSALHPSRNLDHPIDDSFHWVNEKSISYRIKNALSLSMGFGGMN
TAVCIQNIEKCGGES
>P40830 2.3.3.-~~~pksG~~~Polyketide biosynthesis 3-hydroxy-3-methylglutaryl-ACP synthase PksG~~~COG3425
MVSAGIEAMNVFGGTAYLDVMELAKYRHLDTARFENLLMKEKAVALPYEDPVTFGVNAAKPIIDALSEAEKDRIELLITC
SESGIDFGKSLSTYIHEYLGLNRNCRLFEVKQACYSGTAGFQMAVNFILSQTSPGAKALVIASDISRFLIAEGGDALSED
WSYAEPSAGAGAVAVLVGENPEVFQIDPGANGYYGYEVMDTCRPIPDSEAGDSDLSLMSYLDCCEQTFLEYQKRVPGANY
QDTFQYLAYHTPFGGMVKGAHRTMMRKVAKVKTSGIETDFLTRVKPGLNYCQRVGNIMGAALFLALASTIDQGRFDTPKR
IGCFSYGSGCCSEFYSGITTPQGQERQRTFGIEKHLDRRYQLSMEEYELLFKGSGMVRFGTRNVKLDFEMIPGIMQSTQE
KPRLFLEEISEFHRKYRWIS
>P40805 4.2.1.-~~~pksH~~~Probable polyketide biosynthesis enoyl-CoA hydratase PksH~~~COG1024
MDLVTYQTIKVRFQASVCYITFHRPEANNTINDTLIEECLQVLNQCETSTVTVVVLEGLPEVFCFGADFQEIYQEMKRGR
KQASSQEPLYDLWMKLQTGPYVTISHVRGKVNAGGLGFVSATDIAIADQTASFSLSELLFGLYPACVLPFLIRRIGRQKA
HYMTLMTKPISVQEASEWGLIDAFDAESDVLLRKHLLRLRRLNKKGIAHYKQFMSSLDHQVSRAKATALTANQDMFSDPQ
NQMGIIRYVETGQFPWEDQ
>P40802 4.-.-.-~~~pksI~~~Putative polyketide biosynthesis enoyl-CoA isomerase PksI~~~COG1024
MTHSVVELIEIESAIIQVKMQDRTHKNAFSQELTDDLIQAFEYIRQNPKYKAVILTGYDNYFASGGTQEGLLRIQQGLTK
FTDDNLYSLALDCEIPVIAAMQGHGIGGGFVMGLFADIVILSRESVYTANFMKYGFTPGMGATFIVPKKLGFSLAQEILL
NAGSYRGADLEKRGVPFKVLPRAEVLDYAVELAQELAEKPRNSLVTLKDHLVAPLRDQLPRVIEQELMMHEKTFHHEEVK
SRIKGLYGN
>P40806 ~~~pksJ~~~Polyketide synthase PksJ~~~COG0300
MRNNDNIRILTNPSVSHGEPLHISEKQPATIPEVLYRTATELGDTKGIIYLQPDGTEVYQSYRRLWDDGLRIAKGLRQSG
LKAKQSVILQLGDNSQLLPAFWGCVLTGVVPAPLAVPPTYAESSSGTQKLKDAWTLLDKPAVITDRGMHQEMLDWAKEQG
LEGFRAIIVEDLLSAEADTDWHQSSPEDLALLLLTSGSTGTPKAVMLNHRNIMSMVKGIIQMQGFTREDITFNWMPFDHV
GGIGMLHLRDVYLGCQEINVSSETILMEPLKWLDWIDHYRASVTWAPNFAFGLVTDFAEEIKDKKWDLSSMRYMLNGGEA
MVAKVGRRILELLEPHGLPADAIRPAWGMSETSSGVIFSHEFTRAGTSDDDHFVEIGSPIPGFSMRIVNDHNELVEEGEI
GRFQVSGLSVTSGYYQRPDLNESVFTEDGWFETGDLGFLRNGRLTITGRTKDAIIINGINYYSHAIESAVEELPEIETSY
TAACAVRLGQNSTDQLAIFFVTSAKLNDEQMSQLLRNIQSHVSQVIGVTPEYLLPVQKEEIPKTAIGKIQRTQLKTSFEN
GEFDHLLHKPNRMNDAVQDEGIQQADQVKRVREEIQKHLLTCLTEELHVSHDWVEPNANIQSLGVNSIKMMKLIRSIEKN
YHIKLTAREIHQYPTIERLASYLSEHEDLSSLSADKKGTDTYKTEPERSQATFQPLSEVQKGLWTLQKMSPEKSAYHVPL
CFKFSSGLHHETFQQAFGLVLNQHPILKHVIQEKDGVPFLKNEPALSIEIKTENISSLKESDIPAFLRKKVKEPYVKENS
PLVRVMSFSRSEQEHFLLVVIHHLIFDGVSSVTFIRSLFDTYQLLLKGQQPEKAVSPAIYHDFAAWEKNMLAGKDGVKHR
TYWQKQLSGTLPNLQLPNVSASSVDSQFREDTYTRRLSSGFMNQVRTFAKEHSVNVTTVFLSCYMMLLGRYTGQKEQIVG
MPAMVRPEERFDDAIGHFLNMLPIRSELNPADTFSSFISKLQLTILDGLDHAAYPFPKMVRDLNIPRSQAGSPVFQTAFF
YQNFLQSGSYQSLLSRYADFFSVDFVEYIHQEGEYELVFELWETEEKMELNIKYNTGLFDAASISAMFDHFVYVTEQAML
NPSQPLKEYSLLPEAEKQMILKTWNATGKTYPYITFHELFEQQAKKTPDRAAVSYEGQTLTYRELDEKSTQLAIYLQAHG
VGPDRLAGIYVDRSLDMLVGLLAILKAGGAYVPLDPSYPAERLEYMLEDSEVFITLTTSELVNTLSWNGVTTALLDQDWD
EIAQTASDRKVLTRTVTPENLAYVIYTSGSTGKPKGVMIPHKALTNFLVSMGETPGLTAEDKMLAVTTYCFDIAALELFL
PLIKGAHCYICQTEHTKDVEKLKRDIRAIKPTVMQATPATWKMLFYSGWENEESVKILCGGEALPETLKRYFLDTGSEAW
NMFGPTETTIWSAVQRINVECSHATIGRPIANTQIYITDSQLAPVPAGVPGELCIAGDGVAKGYYKKEELTDSRFIDNPF
EPGSKLYRTGDMARWLTGGRIEYIGRIDNQVKIRGFRIELGDIESRLSEHPGILECVVVADMDNLAAYYTAKHANASLTA
RELRHFVKNALPAYMVPSYFIQLDHMPLTPNGKIDRNSLKNIDLSGEQLKQRQTSPKNIQDTVFTIWQEVLKTSDIEWDD
GFFDVGGDSLLAVTVADRIKHELSCEFSVTDLFEYSTIKNISQYITEQRMGDASDHIPTDPAAHIEDQSTEMSDLPDYYD
DSVAIIGISCEFPGAKNHDEFWENLRDGKESIAFFNKEELQRFGISKEIAENADYVPAKASIDGKDRFDPSFFQISPKDA
EFMDPQLRMLLTHSWKAIEDAGYAARQIPQTSVFMSASNNSYRALLPSDTTESLETPDGYVSWVLAQSGTIPTMISHKLG
LRGPSYFVHANCSSSLIGLHSAYKSLLSGESDYALVGGATLHTESNIGYVHQPGLNFSSDGHIKAFDASADGMIGGEGVA
VVLLKKAADAVKDGDHIYALLRGIGVNNDGADKVGFYAPSVKGQADVVQQVMNQTKVQPESICYVEAHGTGTKLGDPIEL
AALTNVYRQYTNKTQFCGIGSVKTNIGHLDTAAGLAGCIKVVMSLYHQELAPSVNYKEPNPNTDLASSPFYVVDQKKTLS
REIKTHRAALSSFGLGGTNTHAIFEQFKRDSDKGKIDGTCIVPISAKNKERLQEYAEDILAYLERRGFENSQLPDFAYTL
QVGREAMEHRVVFIADHVNELKQRLTDFINGNTAIEGCFQGSKHNAREVSWLTEDEDSAELIRKWMAKGKVNKLAEMWSK
GAHIDWMQLYKGERPNRMSLPTYPFAKERYWPSQDDRKPVAQISGNQTGIGSIHPLLHQNTSDFSEQKFSSVFTGDEFFL
RDHVVRGKPVLPGVAYLEMAYAAINQAAGSEIGQDVRIRLNHTVWVQPVVVDRHSAQVDISLFPEEDGKITFDIYSTQED
GDDPVIHSQGSAELASAAETPVADLTEMSRRCGKGKMSPDQFYEEGRSRGMFHGPAFQGIKNVNIGNREVLAQLQLPEIV
SGTNEQFVLHPSIMDSALQTATICIMQELTDQKLILPFALEELEVIKGCSSSMWAYARLSDSDHSGGVVQKADIDVIDES
GTVCVRIKGFSTRVLEGEVHTSKPSTRHERLMLEPVWEKQNEEREDEDLSYTEHIIVLFETERSVTDSIASHMKDARVIT
LNEAVGHIAERYQCYMQNIFELLQSKVRKLSAGRIIIQAIVPLEKEKQLFAGVSGLFKTAEIEFSKLTAQVIEIEKPEEM
IDLHLKLKDDSRRPFDKQIRYEAGYRFVKGWREMVLPSADTLHMPWRDEGVYLITGGAGSLGLLFAKEIANRTGRSTIVL
TGRSVLSEDKENELEALRSIGAEVVYREADVSDQHAVRHLLEEIKERYGTLNGIIHGAGSSKDRFIIHKTNEEFQEVLQP
KVSGLLHVDECSKDFPLDFFIFFSSVSGCLGNAGQADYAAANSFMDAFAEYRRSLAASKKRFGSTISFNWPLWEEGGMQV
GAEDEKRMLKTTGMVPMPTDSGLKAFYQGIVSDKPQVFVMEGQLQKMKQKLLSAGSKAKRNDQRKADQDQGQTRKLEAAL
IQMVGAILKVNTDDIDVNTELSEYGFDSVTFTVFTNKINEKFQLELTPTIFFEYGSVQSLAEYVVAAYQGEWNQDATAKG
KDERTNLVHSLSSLEASLSNMVSAILKVNSEDIDVNTELSEYGFDSVTFTVFTNKINEEFQLELTPTIFFEYGSLHSLAE
YLTVEHGDTLVQEREKPEGQEELQTKSSEAPKITSRRKRRFTQPIIAKAERNKKQAADFEPVAIVGISGRFPGAMDIDEF
WKNLEEGKDSITEVPKDRWDWREHYGNPDTDVNKTDIKWGGFIDGVAEFDPLFFGISPREADYVDPQQRLLMTYVWKALE
DAGCSPQSLSGTGTGIFIGTGNTGYKDLFHRANLPIEGHAATGHMIPSVGPNRMSYFLNIHGPSEPVETACSSSLVAIHR
AVTAMQNGDCEMAIAGGVNTILTEEAHISYSKAGMLSTDGRCKTFSADANGYVRGEGVGMVMLKKLEDAERDGNHIYGVI
RGTAENHGGRANTLTSPNPKAQADLLVRAYRQADIDPSTVTYIEAHGTGTELGDPIEINGLKAAFKELSNMRGESQPDVP
DHRCGIGSVKSNIGHLELAAGISGLIKVLLQMKHKTLVKSLHCETLNPYLQLTDSPFYIVQEKQEWKSVTDRDGNELPRR
AGISSFGIGGVNAHIVIEEYMPKANSEHTATEQPNVIVLSAKNKSRLIDRASQLLEVIRNKKYTDQDLHRIAYTLQVGRE
EMDERLACVAGTMQELEEKLQAFVDGKEETDEFFRGQSHRNKETQTIFTADEDMALALDAWIRKRKYAKLADLWVKGVSI
QWNTLYGETKPRLISLPSYPFAKDHYWVPAKEHSERDKKELVNAIEDRAACFLTKQWSLSPIGSAVPGTRTVAILCCQET
ADLAAEVSSYFPNHLLIDVSRIENDQSDIDWKEFDGLVDVIGCGWDDEGRLDWIEWVQRLVEFGHKEGLRLLCVTKGLES
FQNTSVRMAGASRAGLYRMLQCEYSHLISRHMDAEEVTDHRRLAKLIADEFYSDSYDAEVCYRDGLRYQAFLKAHPETGK
ATEQSAVFPKDHVLLITGGTRGIGLLCARHFAECYGVKKLVLTGREQLPPREEWARFKTSNTSLAEKIQAVRELEAKGVQ
VEMLSLTLSDDAQVEQTLQHIKRTLGPIGGVIHCAGLTDMDTLAFIRKTSDDIQRVLEPKVSGLTTLYRHVCNEPLQFFV
LFSSVSAIIPELSAGQADYAMANSYMDYFAEAHQKHAPIISVQWPNWKETGMGEVTNQAYRDSGLLSITNSEGLRFLDQI
VSKKFGPVVLPAMANQTNWEPELLMKRRKPHEGGLQEAALQSPPARDIEEADEVSKCDGLLSETQSWLIDLFTEELRIDR
EDFEIDGLFQDYGVDSIILAQVLQRINRKLEAALDPSILYEYPTIQRFADWLIGSYSERLSALFGGRISDASAPLENKIE
AEASVPGKDRALTPQIQAPAILSPDSHAEGIAVVGLSCRFPGAETLESYWSLLSEGRSSIGPIPAERWGCKTPYYAGVID
GVSYFDPDFFLLHEEDVRAMDPQALLVLEECLKLLYHAGYTPEEIKGKPVGVYIGGRSQHKPDEDSLDHAKNPIVTVGQN
YLAANLSQFFDVRGPSVVVDTACSSALVGMNMAIQALRGGDIQSAIVGGVSLLSSDASHRLFDRRGILSKHSSFHVFDER
ADGVVLGEGVGMVMLKTVKQALEDGDIIYAVVKAASVNNDGRTAGPATPNLEAQKEVMKDALFKSGKKPEDISYLEANGS
GSIVTDLLELKAIQSVYRSGHSSPLSLGSIKPNIGHPLCAEGIASFIKVVLMLKERRFVPFLSGEKEMAHFDQQKANITF
SRALEKWTDSQPTAAINCFADGGTNAHVIVEAWEKDEKHAIKRSPISPPQLKKRMLSPGEPKLEAETSKMTAANIWDTYE
VEV
>Q05470 ~~~pksL~~~Polyketide synthase PksL~~~COG0236
MRWRSNVKKITKQLTLSLKNPFIYHHVVYGQNVLPGLAYIDIIYQIFREHGFSCSELQLRNLSIYQPLTAEQDAVIVLNI
QCAEKKEGQWQITAKGIEKRDGKEASEEKLYMKADMHADSPAIFEETLDLSQIKASAQNVVQLDDVYEQCRRQELVHSEY
MKAKGCIYEEEDGVLLELSLGSEAMLHAEGFMFHPTLIDGSGVGANHLLTSLLKGEQRLYLPLFYESFSASALLQTDCMT
RIKRSSVRREKELIYVTLEFFNASGEKVAELKNFTSKLVREAELISGKHQDAQETQMTRADTAERDKPADMVSSPVNSYS
EAEQFVSQLIAEKINKPVEQVEKQVGYYQMGLNSSGLLEVVETISDKIGESLSPTLLFEHTTIAELSAFLAEEYAEHFSA
AGSLGQNERARVSDSINDHKTVEGSRPAPIEAAGDIAIIGLAGRYPKAANIHEFWNNLKEGKDCVSEIPESRWDWQRLEG
ITSPSGKDISKWGGFIDDPDCFDPQFFRITPREAETMDPQERLFLETCWETIEDAGYTPKTLAKPKGRNKRQHVGVFAGV
MHKDYTLVGAEEASAENVFPLSLNYAQIANRVSYFCNFHGPSMAVDTVCSSSLTAVHLALESIRHGECDVALAGGVNLSL
HPNKYMTYGVWDMFSTDGHCRTFGKDGDGYVPAEGIGAVLLKPLRQAEEDGDRIYAVIKGSAVNHVGTVSGISVPSPVSQ
ADLIETCLEKTGIDPRTISYVEAHGTGTSLGDPIEIQGLVKAFRQYTQDRQFCSIGSVKSNIGHAESAAGISGLSKVALQ
LHHQKLVPSLHSEELNPYVDFEKSPFYVQHETETWKQPVIKENGEDVPYPRRAGISSFGATGSNAHIILEEHIPQAAEQD
VSLSSDSDISAVIPLSARNQERLRVYAKRLLDFLHDGIQIRDLAYTLQVGREPMEERVSFLASGIQELSDQLKAFIEGRK
AIQHCWKGRVSRGSEPSRPAESVHKLLEQRKLDQIAEQWANGSGVDWKLLYEGSKPKRISLPTYPFERVRYWVPKAEKKT
DRSKQERHILHPLLHQNVSDISGVRFRSAFTGREFFLKDHVIKGEHVLPGAALLEMVRAAVERAAADQFPTGFRLRNIVW
VRPFAVTEQQKDIDVRLYPEENGEITFEICRDPESAEESPIVYGQGSAVLCEAGENPVINIEELKASYNGRTLSPFDCYE
AYTEMGIHYGDSHRAIDSLYAGENGVLVKLTMPPVISDTEDHYILHPSMIDSAFQASIGLRLGGATSLEDRKAMLPFAIQ
DVRIFKGCEASMWARITYSEGSTAGDRMQKLDIDLCNEEGQVCVRLTSYSARVLETDQEGPSEANDTLLFEHIWEERAAE
RQELIEYDTYKVVVCDVGEQMESLQNHLDCTVLQHDTETIDERFEGYAIQLFEEIKQLMHSKTGGHTFIQVAVPALDEPQ
LLSGLTGLLKTAELENPKLTGQLIEIETGMSAGELFEILEENRRYPRDTHIRHWQGKRFVSKWKEVSGEHLSADMPWKDK
GVYLITGGAGGLGFIFATEIANQTNDAVVILTGRSPLDERKKKKLKALQKLGIQAIYRQADLADKQTVDALLKETQNVYG
DLDGIIHSAGLIKDNFIMKKKKEEVQTVLAPKVAGLIHLDEATKDIPLDFFILFSSGAGAVGSAGQADYAMANAFMNAFS
EYRNGQAELHKRYGKTLSVCWPLWKDGGMQIDAETARMLKRETGMVAMETDRGIQALYHGWTSGKPQVLVASGVTDRIRA
FLHETGHGKGQSHNIKKSSLNQEAEKADMIGEIDEEILREKAENYFKQVLSSVIKLPAGQIDAEAPLEDYGIDSIMIMHV
TGQLEKVFGSLSKTLFFEYQDIRSLTRYFIDSRREKLLDILGFETGKPSVERKSEPEKQEIPVIPRKSGFLPLQDKEQKQ
VREKETEEIAIIGISGRYPQADNIDELWEKLRDGRDCITEIPADRWDHSLYYDEDKDKPGKTYSKWGGFMKDVDKFDPQF
FHISPREAKLMDPQERLFLQCVYETMEDAGYTREHLGRKRDAELGGSVGVYVGVMYEEYQLYGAQEQVRGRSLALTGNPS
SIANRVSYYFDFHGPSIALDTMCSSSLTAIHLACQSLQRGECEAAFAGGVNVSIHPNKYLMLGQNKFMSSKGRCESFGQG
GDGYVPGEGVGAVLLKPLSKAVEDGDHIYGIIKGTAINHGGKTNGYSVPNPNAQADVIKKAFVEAKVDPRTVSYIEAHGT
GTSLGDPIEITGLSKVFTQETDDKQFCAIGSAKSNIGHCESAAGIAGVTKVLLQMKYRQLAPSLHSNVLNPNIDFLNSPF
KVQQELEEWKRPIISVNGKDIELPRIAGVSSFGAGGVNAHILIEEYAPEPVEERLPARKQPAVIVLSAKNEERLQKRAER
LLHAIREQTYVEADLHRIAYTLQVGREAMKERLAFVAETMQELEEKLYECISGTENREYVYRGQVKSNKEAIAAFAADED
MSKTIEAWLQKGKYAKVLDLWVRGLRIDWSTLYQDQKPRRISLPAYPFARDRYWIDVNAKAEEKRTEEPFAPVQPVIPKP
SVDREASGKPANITLQPLMTNQDRLERVPSDTETETITAEALCDELTAGLAEVLYMDQNEIDPDEAFIDIGMDSITGLEW
IKAINKQYGTSLNVTKVYDYPTTRDFAVYLAHELSTQAGEKKQTETYTPIRQKTVVPAAKPANISLQPLEHHQPVQEEAE
ETIQYAAAEISASRQYTVAIETLHENLRESIADVLYMEPYEVDIDEAFIDIGMDSITGLEWIKAVNKQYGTSFTVTRVYD
YPTIRDFAEMLKSELGTHLDRKIEHTDSFEAAQQKPAASSHPKPAERPLQPVQHPIKKEHEKKTVPVLQDRPEDAIAIVG
MSGRYPGARNVREYWDNLVHARNAIRDIPTSRWDVDKYYDPVLNKKGKVYCKSMGMLDDIEHFDPLFFNIPPSEAELMDP
QHRIFLQEGYKAFEDAGYNARTLNEKKCGVYLGIMSNEYGVMLNRQSRANATGNSFAIAAARIPYFLNLKGPAIPIDTAC
SSSLVGTHLARQALINKEIDMALVGGVSLYLTPESYMSMCEAGMLSPDGQCKAFDNGANGFVPGEGAGALVLKRLKDAEA
DRDHIYGIIIGSGINQDGKTNGITAPSAKSQMDLERDIYETYGIHPESISYVEMHGTGTKQGDPIELEALSTVFQEKTDK
KQFCAIGSVKSNIGHTSAAAGVAGVQKVLLCMNHKTLVPTLNFTTPNEHFEFEHSPLYVNTELKPWETADGKPRRACVSS
FGYSGTNAHIVIEEYQPEKRNDRLTKQHRSALFVLSAKKEKQLKAYAEAMKDFVTSNEDIDLEDMAYTLQTGREAMDYRM
AFLADSREMLIKALDDYLAEMPNGSIFAAHVKTKKSEIKLFETDHDAKALLQTWIEKKRLEKVAELWVKGLQIDWNKLYG
EYTPRRISLPAYPFAEEYYWLPTQEGEPETIATAMPQFELMPKRCFLRKQWQPCPIEPAEMTNQTVAILANEETMALAEE
LSAYFSTYRIFDSQELDRVSAADYEHVAGAIDLIGCGTSHEHSMGWINWLQKLIEQGRASKHHLTVLGVTKGLEAYANEG
VLLSGASRAGLYRMLQSEYSHLTSRHADMECEASHEELARLIAVEYYAKSTESEVCYRNGQRYRAYLTEQPAEAALSHKQ
VSFSTDKVLLITGGTRGLGLLCARHFVKTYGVKRLVLIGREELPPRDQWNSVKISSLAEKIKAVQELEDMGAQVQVLSLD
LTDRVAVEQSLKTIHETMGAIGGVIHCAGMVNKQNPAFIRKSLEEIGQVLEPKVEGLQTLFDLLQDEPLAFFTLFSSVSA
AIPALAAGQADYAMANAFMDYFAEAHQDKCPIVSIQWPNWKETGLGEVRSKALEQTGLISLTNDEGLQLLDQILSDRQYA
VVLPAVPDTNVWKPDKLMQPSLPVEALSHPETKEQTSTRNLFPETVDWLVTLFSDELKIAAEDFETDEPFQEYGIDSIIL
AQLVQQMNQQLNGDIDPSILFEYPTIESFAHWLISKYDISAVLQPSVPEKQTPLKPQSAMKQKLVPEQRPQQISHEKTAL
LAEDIAIIGLSCRFPGAETLEEYWDLIRDGRSAIAPVPPERFGNSSSNYAGLIDEMNRFDHDFFMMSESDVRAMDPQALA
VLEESLKLWYHAGYTEKEVKGMRAGVYIGGRSQHKPDPASLSKAKNPIVAGGQNYLAANISQFFDLKGPSIVLDTACSSA
LVGLNMAIQALRSGDIEAAVVGGVSLLDADAHRMFHERGLLCDKPSFHIFDKRADGVILGEGVGMVLVKTVNQAVEDGDS
IYAVIKAAAINNDGRTAGPSSPNLEAQKDVMLSALEKSGKKTEEISYLEANGSGSAVTDLLELKAIQSIYRSESKAPLGL
GSVKPNIGHPLCAEGIASLIKVALMLKHRQLVPFLSGNENMPYFDIEKTDLYFSRSQAEWKETTPAAAINCFADGGTNAH
LIIEGWRDSAERPIRRKPLPLPELNRQPVLIKPSAQNVQKKVHSDTGASKDMFWKTFK
>P40872 ~~~pksM~~~Polyketide synthase PksM~~~COG0236
MITEQLHISLNNPIMSNHKVYGQALLPGLAYIDLIYQVFQEHGYAYQELELKNLTIFYPLIADESYDIALTIHVSERKEG
TWSIIIDGQKQHGESLSDKRQYVTADMHRKEQTAFAESIDLNQWKSTADRILNLDEIYEQCRSQELVHTGMMKAEGQIYE
AKEGAVIDLAVGQEALRHSDAFLFHPTLIDGSGIGSSCLISDQTMYLPLYYESFSASERLQKGCTARILSSSVRQKKELT
YMTIEYFNSAGQKVAELKQFAGKSVRNMSAFHSAKEIQEERAAVSQNISRDYPAFEMYLRQLLAKQLERPAEQMDIHAGY
YELGLDSSSLLTVVQEIGDKVGADLAPTLLFEFTTIAELAAHLADHYSIGEADDAVRQSPSPIDGVTSSPEIGEDIAIIG
MAGRYPKAKNIQEFWEQLKAGTDCITEIPNSRWEWKESDGLDSPAGKPLSKWGGFIEEADCFDPQFFRISPREAEMMDPQ
ERLFLETCWEAIEDAGYTPETIASPQGENKRQHVGVFAGVMHKDYSLIGAEALSEHNPFPLSLNYAQIANRVSYYCNFHG
PSMAVDTVCSSSLTAVHLAIESIRNGECEAALAGGVNLSLHPAKYISYGSVGMHSSDGYCHTFGKGGDGYVSGEGVGTVL
LKPLRKAEQDGDRIYAVIKGSAINHVGKVSGITVPSPVAQADVIEACLEKTGIDPRTISYVEAHGTGTSLGDPIEVQGLV
KAFSRNTQDKQFCSIGSVKSNIGHAEAAAGISGLTKTVLQLHHKTLVPSLHSEELNPYLKLDQTPFFVQHETKEWEQPSF
TENGVDVTYPRRAGLSSFGASGSNAHLILEEYIPAESHSETILTKNEEIVIPLSARNKDRLQAYALKLLDFLSEDVNLLA
LAYTMQAGRVEMEERAAFIVKDIKDLTAKLRAFANGEEEIEGCWTGRAKENQEAAGLASVNALNNNLIRDSEMMEMAKAW
VQGKRVTWDDLYGDRKPLKISVPTYPFARERYWISVPEMKTSTVNHILHPLVHRNTSDFTEQRFSSVFTGTEFFLSDHVV
QGQKILPGVAYLEMAREAAEKAAGDLDGEQRVVSLKDIVWVRPITIESEPKEIHIGLFPEDNGDISFDIYSSSEHKEEAL
TIHCQGRAVISDEAETSILNLSSIQTECSLDTVTSEQCYAAFRKIGLDYGEGYQGIEKVYVGKDQLLAKISLPAFLKNDK
QHFALHPSLMDSAFHATVGFIVSSVNAAGQAQTLSLPFALQEVDIFSPCPEKIWSYIRYSSDSKAENKVRKYDIDLCDEN
GRVCVRMKGASMRALDGEQHSKPQLLTDSQLTGHTVMIPVWEPVSLEAEDNASFAGKRAVLCGAAEADRTFIKHHYPQIS
FVDIRPADDIEAIADKLQAYGSIDHVLWIAPSHRGSIGSDGQEEAVLHLFKLVKACLQLGYGEKQLEWSLVTVQAQPVTQ
HEAVQPAHASIHGLAGTMAKEYPHWKIRLLDLEKGCTWPVNHMFALPADRLGHAWAYRNQQWHQQQLIPYRSSLSGDTLY
RKGGVYVVIGGAGYIGEAWSEYMIRRYQAQIVWIGRSQLNAAIQSKIDRLSALGPEPFYIAADAADKHSLQQAYEQVKKR
HPHIHGIVHSAMVLFEQSLEKMKPEEFTAGLAAKIDVSIRMAQVFRQENVDFVLFFSSLVAHIKNVKQSHYASGCTFADA
FAHQLSQSWACPVKVMNWGYWGNSEAAEDEHYVQLMNQIGLGLIEPAEAMKALEALLSGPVSQTAFIHTTKPVAVEGVNQ
NEFITLYPEQPSADAESLMERLPTTGRFQRVTHEELDDLLYRLLLGQLQTAELFDGYTLSVERLQQYKTREFYGKWIRQS
SEFLLQHGYLKKVGDSLVRKDQAEDIELLWLEWNAKKEKWLKDSETKAMVVLAEAMLQALPDILTGKVPATDIMFPHSSM
ELVEGIYKHNQVADYFNKVLADTLLAYLDERLKHDPEASIRIMEIGAGTGGTSAGIFEKLKPYQKHINEYCYTDLSKAFL
LHAEKEYGAENPYLTYQLFDVEKPIDQQEFEAGGYDVVIAANVLHATKNIRQTLRHTKAVMKNNGMLLLNEMAGNSLFPH
ITFGLLEGWWLYEDPAVRIPGCPGLHPDSWKAALESEGFESVFFPAEAAHDLSHQIVAASSNGLVRRMMKNVILPEKVVS
QASNQEPAYIHTIDSEEAGQSKHALLREKSTEYMKKLIGETLKIPAGKIESSEPLEKYGIDSIVVVQLTNTLRKEFDHVS
STLFFEYQTIDALVEHFIKTKTEALMKLTGLDRQVQQHTPAESRTQSSQKPDQAAKRTRRFRKLGFSGEKETPTNTLASR
DVAVIGISGRYPQAETAEDFWNNLKEGRNCIEEIPKDRWDWKAYYDKEKGKEGSIYTKWGGFIKDMDKFDPLFFQISPLE
AERMDPQERLFLQTAYASIEDAGYTPDSLCSSRKIGVFAGVMNKNYPTGYGYWSIANRISYLLNFQGPSLAVDTACSSSL
TAIHLALESIYSGSSDCAIAGGVNLVVDPVHYQNLSVMNMLSASDTCKSFGDDADGFVDGEGVGAIVLKPLQQAIADGDH
IYGVIKASAINSGGKTNGYTVPNPHAQAQVIKEAIERADIPARTISYLEAHGTGTALGDPIEIAGLTKAFEKDTQEKQFC
AIGSSKSNIGHCESAAGIAGLTKILFQFKYGQIAPSLHAQRLNPNIEFSHTPFVVQQQLGEWKRPVIGGQEVPRRAGLSS
FGAGGSNAHIILEEYIPRTGAQTPKDHPPALIVLSAKNMERLQEKAEQLLTAIKQKRYCETDLIRIAYTLQTGREAMEER
LAFIAESLEDLERKLNDFIENKADSLYLDRIDDNKKALAVLSADEDTEKIIEAWMSKGKYTKLLDLWVKGLSFDWGMLYG
TQTPVRISLPAYPFAKERYWAPGAAKAPVSIEQDHDQQTEEPFKVMTFQEVWKEEPATLTSKRIKTLICFLTEREKQNAF
ASALKNVDQDTKVIFISQGEVYSKQSEYSYQIVRQEPVTFEKAFQSIKEELGEPDAILYMWPMEDKRCIKDHSCIVYLLQ
GMSAAKLHPSRLLLAGCFEDSLDRSYLESWIGFERSLGLVLPHTKVTGIFQPAEQGSMDDWTRKVWAELQASTEQTVLYQ
NLKRYVNHIEQTTIQPDNSKLKSGGTYLITGGVGGLGYLFAKHLAKNYAANLILTGRSPFNDEKQKQIKELKDLGGEAMY
AEADVSDPIAMGDCVKRGKDRFGAINGVIHAAGIESDSAIFDKKIESFQRIIEPKINGTIALDEWLKNEDLDFMCYFSSS
SAVLGDFGCCDYAIGNRFQMAYAQYRNELHNGKTFVINWPVWKDGGMKIGDEETTDMYLKSSGQRFLEAEEGIRMFEHIL
AQQDAQHLVIAGQPSRVSRFLGMTEPAIPEPATQAPLAQENKDEVKTLSIEKRLEHDLKEHIHTLLKISKDKLNLNKNWA
DFGFDSIYLAKFSNVLSKHFNIEVTPALFFGYSTLQELISFFLTDHKELIEAFYRDDASEAQKPPEAYAVIPVALEPEAS
KKSIRQVHDEPIAIIGMSGRFPQADSVHELWDNLKNGKSCISDIPGERRDWGRANRDPEKAVPRWGAFLKDIDRFDPLFF
QISPKEAESMDPRQRIFLEEAWHTFEDAGYMGDRIKGKSCGVYVGVEEGEYAHLTGDTDYINGTQNATLSARIAYALDLK
GPNMALTAACSSGLVAIHQACSALRQGDCEMALAGGVSLNISHMSFEALTRAEMLSPNGQCKVFDQDANGLVPGEAVAAV
LLKPLSKAIEDKDHIYGCIKASGVNYDGKTNGITAPNPFSQAELIENIYEKNEINPLDIQYVMAHSTGSNLGDPLEVQAL
TSVFSKYTKQKQFCMISSIKPLIGHTFAASGTVALISMLMAMKNQIIPATHHCESENPYIPFKESPFVLCKENRSWIKKN
QKPRMGTISTTGISGTNAHAVIEEYIPDDQPSTQRHQGSPQIFVISAQNDDRLQDAACRMIAYLEQNHNLSLPDVAYTLQ
VGRKAMEARLAIVANNQEQLVRKLKEYVEAMKNGGVSGQQRSLYTGYTEGILEEQDEAVLQALAKERNLENIAECWVKGY
QIPWELLHDGDDVRMVSLPGYPFARERYWISSGTQQSEAVKQHSQDMKTEIDEPNGKTHIQKIIVQFLARELGISEDRIN
FKRNFLDYGMDSILGRKLMRHIEKTTQLKMAGREILECQTVQALSDHLALKAEKQNHSAAAHHIKGTYTDEQIIGLMQEV
ALGKLDFKSVQNIIEGSKSYES
>O31782 2.3.1.-~~~pksN~~~Polyketide synthase PksN~~~COG1020
MKRQLKSPLSEGQKGLWMLQKMSPGMSAYNIPLCFRFSKPIHAETFKKALLFVQRQYPVLASVIQEENGIPFQSVQLSKD
LYFVEEDISAMKSADIMPFLKEKAKEPFQLEAGPLWRTHLFHRLEECIVLITIHHIIFDGVSMLTLISALFEAYQQLLNG
IEPLQQPSTADYYDFVDWENRMLTGREGEEHLAYWKEQLSGSLPVLDLPADRPRSSARKFKGQAYKSLLPHHLRNQIKSF
ARTNHVNESVVFLSIYKVLLHHYTKQKDIIVGVPTMGRQEDRFETLIGYFINMMAVRSKNIGSQPLTAFIRELQLTVAVG
LDHAAFPFPALVRELNVDRSAADSPVFQTAFLYQNFFQATGLQKVLEPYQTLGIEYIEDIRQEGEFELALEIYEQENETV
LHLLYNPDLYELSSIESMMENYMKLAQHMMEDPSLPLEAYSLQLNQEQTSLLEQWNATGTNIANDKCIHEVFEEKAKQTP
DAVAVMFEDRSLTYKEVDEKSTSVAVYLQHQGVRPEQPVGICAERSFDMIIGILGILKAGGAYVPLDPSFPQERLKYMLK
DSQASIVLTQPNVHDRISGLTGSHVKAINIELACRNGYTDQQSSGLKREVKPEHLAYIIYTSGSTGEPKGVMVEHRSIMN
TLNFLESHYPVTAEDAYLLKTNYVFDVSISELFGWFIGDGRLVILPPNGEKSPQLCMDYIETYKVTHINFVPAMLHVFLE
MAKDNKRFTEDGPLKYMMVAGEAFPKVLVKKAVSLFTNCRVENIYGPTEASIYAAYFGCGKGDIASHHTPIGKPVSNTKI
YIVDQHLKPVPIGKPGELCIAGAGLARGYFKKPGLTAEKFIDNPFESGTKLYKSGDSARWLPDGNIEYLGRIDSQVKIRG
FRVELGAIETKLGEFPGILDQAVVVKQLEGHQQLAAYYTEESGHASANPKDLRLHLKSSLPEYMIPSHFIRLDELPLSPS
GKVNRKELEKREIVFNRRKPNHLQLTEIEDQVLRIWEETLKVSGFGPEDGFFDAGGDSLLAVAVAERIKKEFDCEFHVTE
LFEYSTIRAISEYILEMKNSDLAGTQNEDDHDDKKDGKYPKQKIPPYFDDSVAIVGISCQFPGAKNHHDFWNHIKEGKES
IRFFSEEELRANGVPEELIQHPDYVPVQSVIEGKDLFDPGFFQISPKDAEYMDPQLRLLLLHSWKAIEDAGYVAKEIPAT
SVYMSASSNSYRTLLPKETTEGHESPDGYVSWVLAQSGTIPTMISHKLGLKGPSYFVHSNCSSSLVGLYQAYKSLTSGES
QYALVGGATLHAQSAIGYVHQNGLNFSSDGHVKAFDASADGMAGGEGVAVILLKKAVDAVKDGDHIYAIMRGIGINNDGA
EKAGFYAPSVKGQTEVIQHVLDTTKIHPETVSYIEAHGTGTKLGDPIEMSALNKVYKQYTDKTQFCGIGSVKTNIGHLDT
AAGLAGCIKVAMSLYHNELAPTINCTEPNPDIKFESSPFYVVRERKSLEKHAGVHRAALSSFGLGGTNAHAIFEQYENIS
DAGAENEGNQPYIIPISAKNSERLQVYAKEMLSYISQDEQRHFSLRDIAYTFQVGREAMDNRIVFIVNDLEEWKHQLEAF
VTGKPLAEGCIQGEKTRMTSAEQLLGNAEADDMASSRISKEELRKLAEMWANGFHVEWRRLYPNIKPRRISLPTYPFAEE
RYWPESSTGAITTIEPSRLHPLVHHNTSVLSEQRFSSIFTGQEYFIAEHIIKGMAILPAAVTLEMARAAIEQGIGGLEDH
ETGIRLKNVVWVRPVVAGSEPVQVNIGLYDEDGGHIAYRMYGDPESADAEPVVYNQGKAELIQLKREKALDLSKIKKQCD
QSKMDAASFYEGMIGADYGPGYKSVEAVYKGDGQLLAKLSLPESVAHTLGDYVLHPSVMDGALQAAEYLQNVVRAELSDT
EDFKAALPFALEELEVFRQCVSDMWVYVQFNSKNKPGDLIQKVDIHLCDEHGMICVRLKGFSTRVMEADIQTEPSKINAE
TLLLQPVWQEQKAANSLAAKKYAEHLVFLCEYDHETRKQIEAAIEDVHVYSLEARPSSVDGRFHSYTEQVFKKVQEIIRT
KPKDGILVQIVTSAEGEQQLFSGLTGLLKTACQENAKLTGQMIEVSSEESGESIAGKLLENQMSSDSYVKYQNGTRYIAD
WREIKQAKGDGSKPWKDNGVYLISGGAGGLGHIFAKEIAEQTKNATVILAGRSPLSESKSKKLKELHSKGADITYRQTDV
TNKIEVYQLIDDIQKRYGRLNGILHSAGIIKDSYLVNKQAKDLHDVLAPKVKGLVYLDEASKDLPLDFFILFSSLSGSLG
SIGQSDYAAANVFMDMYAGYRNRLADLSQRHGQTLSVNWPLWRDGGMQVDQETEKRLVQLAGIVPMRAEKGIQALYQALH
SEANQVMVIEGDVQKIKQNMLAKNASAPMEKKEAEHMTEQINSIDADSLLDKVKAMLKREIAKLLKVKLETIDDHAEMTV
YGFDSISMTEFTNHINRAYQLELTPTVFFDHPTIHAFGKHLSEEYQSVFAKTFAVRAVSAQLQPAAKQEQAVRAKAKRRR
KQQVMLPNAIQSDAGPEPIAIVGISGIFPMAKDVEAYWNILKEGKDCMTEIPKDRWDWREYEGDPAKEVNKTNVKWGGFI
DGIADFDPLFFGISPREAEQMEPQQRLLLTYAWKAIEDAGYSAKRLSGTKTGVFIGTGNTGYSSLLSKANSAIEGSAAAN
TSPSVGPNRVSYFLNLHGPSEPVDTACSSSLVAIHHAISSIEEGTCDMALAGGVNTIILPEVYISFDKAGALSKEGKCKT
FSNQADGFAHGEGAGILFLKKLKAAEEAGDHIYGVIKGSAINHGGRAASLTTPNPKAQADVIQSAYQKAGIDPKTVTYIE
AHGTGTELGDPVEINGLKSAFKALGVNEGDTSANPYCGLGSVKTNIGHLSLAAGAAGVIKILLQLKHKTLVKSLHCENVN
PYIQLKNSPFYIVRETEEWKALKNEQGEELPRRAGVSSFGIGGVNAHVIIEEYIPEASDENIPSIAPEHPGIFVLSAKNE
ARLKEHAQQLADALDKQTYSDVNLARIAYTLQAGRDAMEERLGIISGSIEDLQKKLKDFAAEKSGVEDVFKGRIDKGTLQ
MLTEDEEIQEAVEKWMERGKYAKLLELWVKGLDVDWTKLYGENLPKRISLPTYPFAKDRYWISDHIEKSGSIDANQAASR
LGGAVLHPLMHQNTSNLSEQRFSSIYTGEEFFLADHVVKGQRILPGVAHLELARAAVEQAAEVQGVPRIMKLKNAVWVRP
IVVEDQPQQVHIRLLPGENGEISYEIYGHSDVTGEQSIVYSQGSAVLNPAENLPAVDLQSLREQCQESHFSVNEVYDTYR
MIGFEYGPAYRGVKKIYTAEQFVLAKLSLHPSAADTLSQYKMHPGLMDSALQASSILTGAGDNQLTLPFAVQELEVFGAC
SSEMWVYARYSQGSKATDKVQKRDMDILDESGNVCVRMKGLSFRAAEGGSGSAESDQTLATLMFEEKWVPKDFKKESPEP
HYERHIVMLCDMNGLSKDRIESRMTGAECIVLESFREGLAERFQDYAEQALETVQGLLKSRPQGNVLIQLLTSAQRKQYS
FSGLSALLKTAGLENKKLIGQTIEIDSHENVESVIEKLKENKRHTEDQHIKYEKGKRYINDLDEMQIDDREISMPWRDKG
VYLITGGAGGLGFIFAKEIARQAEQPVLILTGRSALNADQQAELNELQQLGARAEYRQVDVTQTEAASELITSITSDYED
LNGVIHSAGLIKDNYLMSKTNEELTQVLAPKVKGLVNVDEATEHLALDFFILFSSISSVAGSAGQADYAMANAFMDSYAA
YRNALVTAMYRHGQTLSINWPLWKEGGMRANKEIENMTLKNTGVTPMRTETGIQALYKGLAFGKDQVIVMEGFKDMMREK
LTQKPSSDDVPMKTVQVRVTSEARMDQGNMFDHIQEVLKQTISQLLKIKPEEIDPDMEFNQYGFDSITLTEFANTLNEKC
KLDLTPTVFFEHATVYAFAGYLSEEYPNAFTAQTPAKAEVLMQPVEQNIKNMTFSTENRFVKPSVTPMQKEADHKPEPIA
IVGMSGVFPKAKDVEEYWKNLSSGADCITEVPKDRWDWQEYYGDPLKEANKTNVKWGGFIDEVADFDPLFFGISPLEAEQ
MEPQQRLLMTYAWKAVEEAGHSARSLAGTKTGIFIGTGNTGYSSLLSNVDIEGSAAANMSPSAGPNRVSYFLNIHGPSEP
IDTACSSSLVAIHHAVCAIENGNCEMAIAGGVNTVVTPQGHIAYDKAGALSKEGRCKTFSDKADGFAVSEGAGILFLKKL
TAAERDGDHIYGVIKGSAVNHGGRANSLTTPNPKAQADVVKTAYEKAGIDPRTVTYIEAHGTGTELGDPVEINGLKAAFK
ELYEKTGDPAVHGSHCGLGSAKTNIGHLSLAAGVAGVIKVLLQLKHKTLVKSLYSETVNPYIRLDDSPFYIVQESREWQA
LRDEAGRELPRRAGISSFGIGGVNAHVVIEEYIPKETTHPATAPAVTAQHPGIFILSAKDEDRLKDQARQLADFISKRSI
TARDLTDIAYTLQEGRDAMEERLGIIAVSTGDLLEKLNLFIEGGTNAKYMYRGRAEKGIAQTLRSDDEVQKTLNNSWEPH
IYERLLDLWVKGMEIGWSKLYDGKQPKRISLPTYPFAKERYWITDTKEEAAAHQTALKTVESAALHPLIHVNTSDLSEQR
FSSAFTGAEFFFADHKVKGKPVMPGVAYLEMVHAAVTRAVRRTEDQQSVIHIKNVVWVQPIVADGQPVQVDISLNPQQDG
EIAFNVYTEAAHNDRKIHCQGSASIRGAGDIPVQDISALQDQCSLSTLSHDQCYELFKAIGIDYGPGFQGIDRLYIGRNQ
ALAELSLPAGVTHTLNEFVLHPSMADSALQASIGLKLNSGDEQLSLPFALQELEIFSPCTNKMWVSVTSRPNEDKIQRLD
IDLCDEQGRVCVRIKGITSRLLEEGIQPPDGPTSLGNSKATLNGALLMAPIWDRVQLEKRSISPADERVVILGGDDNSRK
AVQREFPFAKELYIEPNASIHRITGQLEALGSFDHIVWMSPSRVTECEVGDEMIEAQDQGVIQMYRLIKAMLSLGYGQKE
ISWTIVTVNTQYVDQHDIVDPVDAGVHGLIGSMSKEYPNWQTKLIDVKKYEDLPLSQLLSLPADQEGNTWAYRNKIWHKL
RLIPVHNNQPVHTKYKHGGVYVVIGGAGGIGEAWSEYMIRTYQAQIVWIGRRKKDAAIQSKLDRFARLGRAPYYIQADAA
NREELERAYETMKQTHREINGIIHSAIVLQDRSLMNMSEECFRNVLAAKVDVSVRMAQVFRHEPLDFVLFFSSVQSFARA
SGQSNYAAGCSFKDAFAQRLSQVWPCTVAVMNWSYWGSIGVVSSPDYQKRMAQAGIGSIEAPEAMEALELLLGGPLKQLV
MMKMANETNDEAEQTEETIEVYPETHGSAIQKLRSYHPGDNTKIQQLL
>O31784 2.3.1.-~~~pksR~~~Polyketide synthase PksR~~~COG0236
MLNTEDILCKMLFAQLQSIGFFTESKSQPVLENFYGRWFEESQSILERHQFLKRTENGHVPTRSIGTMSELWKEWNEQKF
DLLQDNNMKAMVTLVETALKALPEILTGKASATDILFPNSSMDLVEGVYKNNQVADYFNDVLADTLTAYLQERLKQEPEA
KIRILEIGAGTGGTSAAVFQKLKAWQTHIKEYCYTDLSKAFLMHAENKYGPDNPYLTYKRFNVEEPASEQHIDAGGYDAV
IAANVLHATKNIRQTLRNAKAVLKKNGLLLLNEISNHNIYSHLTFGLLEGWWLYEDPDLRIPGCPGLYPDTWKMVLESEG
FRYVSFMAEQSHQLGQQIIAAESNGVVRQKKRTEAEEDPSHIQMNAEIDHSQESDSLIEQTAQFVKHTLAKSIKLSPERI
HEDTTFEKYGIDSILQVNFIRELEKVTGELPKTILFEHNNTKELVEYLVKGHENKLRTALLKEKTKPAKNEAPLQTERTD
PNKPFTFHTRRFVTEQEVTETQLANTEPLKIEKTSNLQGTHFNDSSTEDIAIIGVSGRYPMSNSLEELWGHLIAGDNCIT
EAPESRWRTSLLKTLSKDPKKPANKKRYGGFLQDIEAFDHQLFEVEQNRVMEMTPELRLCLETVWETFEDGGYTRTRLDK
LRDDDGVGVFIGNMYNQYFWNIPSLEQAVLSSNGGDWHIANRVSHFFNLTGPSIAVSSACSSSLNAIHLACESLKLKNCS
MAIAGGVNLTLDLSKYDSLERANLLGSGNQSKSFGTGNGLIPGEGVGAVLLKPLSKAMEDQDHIYAVIKSSFANHSGGRQ
MYTAPDPKQQAKLIVKSIQQSGIDPETIGYIESAANGSALGDPIEVIALTNAFQQYTNKKQFCAIGSVKSNLGHLEAASG
ISQLTKVLLQMKKGTLVPTINAMPVNPNIKLEHTAFYLQEQTEPWHRLNDPETGKQLPRRSMINSFGAGGAYANLIIEEY
METAPEKEHIAPRQQEFTAVFSAKTKWSLLSYLENMQLFLEKEASLDIEPVVQALHRRNHNLEHRTAFTVASTQELIEKL
KVFRTSRESSLQQGIYTSFDLQPCAESASRDREINAAEQWAQGALIAFKEADIGNRTGWVHLPHYAFDHNTSFHFDVSSI
NEKSSDVEDNINQPVIQDQFTYDEPYVQGHVFNNERVLVGATYGSLAIEAFFNLFPEENSGRISKLSYISPIVIKQGETI
ELQAKPLQKDQVIELQIMYREPSSGLWKPAAIGQCGIGSFEPKKVNIENVKHSLTKLHHIDQMYKTGNGPEWGELFKTIT
HLYRDHKSILAKIRLPQSGLANGHHYTVSPLMTNSAYLAILSFLEQFDMTGGFLPFGINDIQFTKQTIKGDCWLLITLVK
NTGDMLLFDVDVINESSETVLHYSGYSLKQLRISNQRGNQNKAIKASNLKARIRSYVTDKLAVNMADPSKLSIAKAHIMD
FGIDSSQLVALTREMEAETKIELNPTLFFEYPTIQELIDFFADKHEASFAQLFGEAHQQEERPAQIENQMKQIPAYETNT
DKTIEHAADGIAIIGMSGQFPKANSVTEFWDNLVQGKNCVSEVPKERWDWRKYAAADKEGQSSLQWGGFIEGIGEFDPLF
FGISPKEAANMDPQEFLLLIHAWKAMEDAGLTGQVLSSRPTGVFVAAGNTDTAVVPSLIPNRISYALDVKGPSEYYEAAC
SSALVALHRAIQSIRNGECEQAIVGAVNLLLSPKGFIGFDSMGYLSEKGQAKSFQADANGFVRSEGAGVLIIKPLQKAIE
DSDHIYSVIKGSGVSHGGRGMSLHAPNPAGMKDAMLKAYQGAQIDPKTVTYIEAHGIASPLADAIEIEALKSGCSQLELE
LPQEVREEAPCYISSLKPSIGHGELVSGMAALMKVSMAMKHQTIPGISGFSSLNDQVSLKGTRFRVTAENQQWRDLSDDA
GKKIPRRASINSYSFGGVNAHVILEEYIPLPKPPVSMSENGAHIVVLSAKNQDRLKAIAQQQLDYVNKQQELSLQDYAYT
LQTGREEMEDRLALVVRSKEELVIGLQACLAEKGDKLKSSVPVFSGNAENGSSDLEALLDGPLREMVIETLLSENNLEKI
AFCWTKGVQIPWEKLYQGKGARRIPLPTYPFEKRSCWNGFQAVENTPSVSQDERINNSSDHHILANVLGMAPDELQFHKP
LQQYGFDSISCIQLLQQLQSKVDPLIVLTELQACHTVQDMMDLIAKKQEDTSLQNDQARTFPELIPLNDGKRGRPVFWFH
GGVGGVEIYQQFAQKSQRPFYGIQARGFMTDSAPLHGIEQMASYYIEIIRSIQPEGPYDVGGYSLGGMIAYEVTRQLQSQ
GLAVKSMVMIDSPYRSETKENEASMKTSMLQTINTMLASIAKREKFTDVLISREEVDISLEDEEFLSELIDLAKERGLNK
PDKQIRAQAQQMMKTQRAYDLESYTVKPLPDPETVKCYYFRNKSRSFFGDLDTYFTLSNEKEPFDQAAYWEEWERQIPHF
HLVDVDSSNHFMILTEPKASTALLEFCEKLYSNRGVVNANFLKAFRKKHEAREEKETDELVKR
>O31785 1.14.-.-~~~pksS~~~Polyketide biosynthesis cytochrome P450 PksS~~~COG2124
MEKLMFHPHGKEFHHNPFSVLGRFREEEPIHRFELKRFGATYPAWLITRYDDCMAFLKDNRITRDVKNVMNQEQIKMLNV
SEDIDFVSDHMLAKDTPDHTRLRSLVHQAFTPRTIENLRGSIEQIAEQLLDEMEKENKADIMKSFASPLPFIVISELMGI
PKEDRSQFQIWTNAMVDTSEGNRELTNQALREFKDYIAKLIHDRRIKPKDDLISKLVHAEENGSKLSEKELYSMLFLLVV
AGLETTVNLLGSGTLALLQHKKECEKLKQQPEMIATAVEELLRYTSPVVMMANRWAIEDFTYKGHSIKRGDMIFIGIGSA
NRDPNFFENPEILNINRSPNRHISFGFGIHFCLGAPLARLEGHIAFKALLKRFPDIELAVAPDDIQWRKNVFLRGLESLP
VSLSK
>P49695 2.7.11.1~~~pkwA~~~Probable serine/threonine-protein kinase PkwA~~~
MIEPLQPGDPGRIGPYRLVSRLGAGGMGQVFLARSPGGRPVVVKVILPEYANDDEYRIRFAREVEAARRVGGFHTAQVID
ADPTADPPWMATAYIPGPSLRKAVTERGPLYGNNLRTLAAGLVEGLAAIHACGLVHRDFKPSNIVLAADGPRVIDFGVAR
PLDSSVMTQSGAVIGTLAYMSPEQTDGSQVGPASDVFSLGTVLAFAATGRSPFMADSIGEIIARISGPPPELPELPDDLR
ELVYACWEQNPDLRPTTAELLAQLSTDHTGDDWPPPHLSDLIGSMLPLGATTSPNPSLAIEPPPPSHGPPRPSEPLPDPG
DDADEPSAEKPSRTLPEPEPPELEEKPIQVIHEPERPAPTPPRPREPARGAIKPKNPRPAAPQPPWSPPRVQPPRWKQLI
TKKPVAGILTAVATAGLVVSFLVWQWTLPETPLRPDSSTAPSESADPHELNEPRILTTDREAVAVAFSPGGSLLAGGSGD
KLIHVWDVASGDELHTLEGHTDWVRAVAFSPDGALLASGSDDATVRLWDVAAAEERAVFEGHTHYVLDIAFSPDGSMVAS
GSRDGTARLWNVATGTEHAVLKGHTDYVYAVAFSPDGSMVASGSRDGTIRLWDVATGKERDVLQAPAENVVSLAFSPDGS
MLVHGSDSTVHLWDVASGEALHTFEGHTDWVRAVAFSPDGALLASGSDDRTIRLWDVAAQEEHTTLEGHTEPVHSVAFHP
EGTTLASASEDGTIRIWPIATE
>P0AA47 ~~~plaP~~~Low-affinity putrescine importer PlaP~~~COG0531
MSHNVTPNTSRVELRKTLTLVPVVMMGLAYMQPMTLFDTFGIVSGLTDGHVPTAYAFALIAILFTALSYGKLVRRYPSAG
SAYTYAQKSISPTVGFMVGWSSLLDYLFAPMINILLAKIYFEALVPSIPSWMFVVALVAFMTAFNLRSLKSVANFNTVIV
VLQVVLIAVILGMVVYGVFEGEGAGTLASTRPFWSGDAHVIPMITGATILCFSFTGFDGISNLSEETKDAERVIPRAIFL
TALIGGMIFIFATYFLQLYFPDISRFKDPDASQPEIMLYVAGKAFQVGALIFSTITVLASGMAAHAGVARLMYVMGRDGV
FPKSFFGYVHPKWRTPAMNIILVGAIALLAINFDLVMATALINFGALVAFTFVNLSVISQFWIREKRNKTLKDHFQYLFL
PMCGALTVGALWVNLEESSMVLGLIWAAIGLIYLACVTKSFRNPVPQYEDVA
>P0C178 ~~~petE~~~Plastocyanin~~~
ETYTVKLGSDKGLLVFEPAKLTIKPGDTVEFLNNKVPPHNVVFDAALNPAKSADLAKSLSHKQLLMSPGQSTSTTFPADA
PAGEYTFYCEPHRGAGMVGKITVAG
>P46444 ~~~petE~~~Plastocyanin~~~COG3794
MKLIAASLRRLSLAVLTVLLVVSSFAVFTPSASAETYTVKLGSDKGLLVFEPAKLTIKPGDTVEFLNNKVPPHNVVFDAA
LNPAKSADLAKSLSHKQLLMSPGQSTSTTFPADAPAGEYTFYCEPHRGAGMVGKITVAG
>O52830 ~~~petE~~~Plastocyanin~~~
MKLIAASLRRLSLAVLTVLLVVSSFAVFTPSASAETYTVKLGSDKGLLVFEPAKLTIKPGDTVEFLNNKVPPHNVVFDAA
LNPAKSADLAKSLSHKQLLMSPGQSTSTTFPADAPAGEYTFYCEPHRGAGMVGKITVAG
>Q51883 ~~~petE~~~Plastocyanin~~~
MKLIAQISRSLSLALFALVLMVGSFVAVMSPAAAETFTVKMGADSGLLQFEPANVTVHPGDTVKWVNNKLPPHNILFDDK
QVPGASKELADKLSHSQLMFSPGESYEITFSSDFPAGTYTYYCAPHRGAGMVGKITVEG
>P50057 ~~~petE~~~Plastocyanin~~~
MKFFASLSKRFAPVLSLVVLVAGTLLLSAAPASAATVQIKMGTDKYAPLYEPKALSISAGDTVEFVMNKVGPHNVIFDKV
PAGESAPALSNTKLAIAPGSFYSVTLGTPGTYSFYCTPHRGAGMVGTITVE
>P55020 ~~~petE~~~Plastocyanin~~~COG3794
MKVLASFARRLSLFAVAAVLCVGSFFLSAAPASAQTVAIKMGADNGMLAFEPSTIEIQAGDTVQWVNNKLAPHNVVVEGQ
PELSHKDLAFSPGETFEATFSEPGTYTYYCEPHRGAGMVGKIVVQ
>P21697 ~~~petE~~~Plastocyanin~~~COG3794
MSKKFLTILAGLLLVVSSFFLSVSPAAAANATVKMGSDSGALVFEPSTVTIKAGEEVKWVNNKLSPHNIVFAADGVDADT
AAKLSHKGLAFAAGESFTSTFTEPGTYTYYCEPHRGAGMVGKVVVE
>Q3M9H8 ~~~petE~~~Plastocyanin~~~COG3794
MKLIAASLRRLSLAVLTVLLVVSSFAVFTPSAAAETYTVKLGSDKGLLVFEPAKLTIKPGDTVEFLNNKVPPHNVVFDAT
LNPAKSADLAKSLSHKQLLMSPGQSTSTTFPADAPAGDYSFYCEPHRGAGMVGKITVAS
>O34705 3.1.1.-~~~ytpA~~~Phospholipase YtpA~~~COG2267
MWTWKADRPVAVIVIIHGASEYHGRYKWLIEMWRSSGYHVVMGDLPGQGTTTRARGHIRSFQEYIDEVDAWIDKARTFDL
PVFLLGHSMGGLVAIEWVKQQRNPRITGIILSSPCLGLQIKVNKALDLASKGLNVIAPSLKVDSGLSIDMATRNEDVIEA
DQNDSLYVRKVSVRWYRELLKTIESAMVPTEAFLKVPLLVMQAGDDKLVDKTMVIKWFNGVASHNKAYREWEGLYHEIFN
EPEREDVFKAARAFTDQYI
>Q47499 3.1.-.-~~~plcA~~~Extracellular phospholipase C~~~
MEKVLIWFCGTGTTKQDFLANVEISGFSAIVAIDGIGTAAMLTKTQALAKRANWGGSFVDMSETLGVLYDQVNGYDDRAG
VVTLDSLFPLVDYLKTLKEYQLVVGGHSRGAAVGLTEFLAELYHLAVQNQAPGVWANAKTIRLVVVDPVQGQQDADKDTN
AFNAILKDKTLAQILAELETKWFGGREVFDTLVYSARYDARSSFAFDSRWYRFITEQMGKQAGPAKRAKLVMAGFRHSAP
VSKEDEISALYQGKGVAPIAFLQQLVSFDPNWEQSARLLSQIENGYLDQLAAGAKTDLISQLDKQTSLLSTALPALSAAN
RCKKRCRRITRKNRNTAGSTAISTGRRRFNASYQVTSD
>P14262 4.6.1.13~~~~~~1-phosphatidylinositol phosphodiesterase~~~COG0823
MSNKKLILKLFICSTIFITFVFALHDKRVVAASSVNELENWSKWMQPIPDSIPLARISIPGTHDSGTFKLQNPIKQVWGM
TQEYDFRYQMDHGARIFDIRGRLTDDNTIVLHHGPLYLYVTLHEFINEAKQFLKDNPSETIIMSLKKEYEDMKGAEDSFS
STFEKKYFVDPIFLKTEGNIKLGDARGKIVLLKRYSGSNEPGGYNNFYWPDNETFTTTVNQNANVTVQDKYKVSYDEKVK
SIKDTMDETMNNSEDLNHLYINFTSLSSGGTAWNSPYYYASYINPEIANYIKQKNPARVGWVIQDYINEKWSPLLYQEVI
RANKSLIKE
>P08954 4.6.1.13~~~~~~1-phosphatidylinositol phosphodiesterase~~~
MSNKKLILKLFICSTIFITFVFALHDKRVVAASSVNELENWSKWMQPIPDNIPLARISIPGTHDSGTFKLQNPIKQVWGM
TQEYDFRYQMDHGARIFDIRGRLTDDNTIVLHHGPLYLYVTLHEFINEAKQFLKDNPSETIIMSLKKEYEDMKGAEGSFS
STFEKNYFVDPIFLKTEGNIKLGDARGKIVLLKRYSGSNESGGYNNFYWPDNETFTTTVNQNVNVTVQDKYKVNYDEKVK
SIKDTMDETMNNSEDLNHLYINFTSLSSGGTAWNSPYYYASYINPEIANDIKQKNPTRVGWVIQDYINEKWSPLLYQEVI
RANKSLIKE
>P34024 4.6.1.13~~~plcA~~~1-phosphatidylinositol phosphodiesterase~~~COG0823
MYKNYLQRTLVLLLCFILYFFTFPLGGKAYSLNNWNKPIKNSVTTKQWMSALPDTTNLAALSIPGTHDTMSYNGDITWTL
TKPLAQTQTMSLYQQLEAGIRYIDIRAKDNLNIYHGPIFLNASLSGVLETITQFLKKNPKETIIMRLKDEQNSNDSFDYR
IQPLINIYKDYFYTTPRTDTSNKIPTLKDVRGKILLLSENHTKKPLVINSRKFGMQFGAPNQVIQDDYNGPSVKTKFKEI
VQTAYQASKADNKLFLNHISATSLTFTPRQYAAALNNKVEQFVLNLTSEKVRGLGILIMDFPEKQTIKNIIKNNKFN
>Q2G1Q2 4.6.1.13~~~plc~~~1-phosphatidylinositol phosphodiesterase~~~COG0823
MKKCIKTLFLSIILVVMSGWYHSAHASDSLSKSPENWMSKLDDGKHLTEINIPGSHDSGSFTLKDPVKSVWAKTQDKDYL
TQMKSGVRFFDIRGRASADNMISVHHGMVYLHHELGKFLDDAKYYLSAYPNETIVMSMKKDYDSDSKVTKTFEEIFREYY
YNNPQYQNLFYTGSNANPTLKETKGKIVLFNRMGGTYIKSGYGADTSGIQWADNATFETKINNGSLNLKVQDEYKDYYDK
KVEAVKNLLAKAKTDSNKDNVYVNFLSVASGGSAFNSTYNYASHINPEIAKTIKANGKARTGWLIVDYAGYTWPGYDDIV
SEIIDSNK
>P45723 4.6.1.13~~~plc~~~1-phosphatidylinositol phosphodiesterase~~~
MSGWYHSAHASDSLSKSPENWMSKLDDGKHLTEINIPGSHDSGSFTLKDPVKSVWAKTQDKDYLTQMKSGVRFFDIRGRA
SADNMISVHHGMVYLHHELGKFLDDAKYYLSAYPNETIVMSMKKDYDSDSKVTKTFEEIFREYYYNNPQYQNLFYTGSNA
NPTLKETKGKIVLFNRMGGTYIKSGYGADTSGIQWADNATFETKINNGSLNLKVQDEYKDYYDKKVEAVKNLLAKAKTDS
NKDNVYVNFLSVASGGSAFNSTYNYASHINPEIAKTIKANGKARTGWLIVDYAGYTWPGYDDIVSEIIDSNK
>P07000 3.1.1.5~~~pldB~~~Lysophospholipase L2~~~COG2267
MFQQQKDWETRENAFAAFTMGPLTDFWRQRDEAEFTGVDDIPVRFVRFRAQHHDRVVVICPGRIESYVKYAELAYDLFHL
GFDVLIIDHRGQGRSGRLLADPHLGHVNRFNDYVDDLAAFWQQEVQPGPWRKRYILAHSMGGAISTLFLQRHPGVCDAIA
LTAPMFGIVIRMPSFMARQILNWAEAHPRFRDGYAIGTGRWRALPFAINVLTHSRQRYRRNLRFYADDPTIRVGGPTYHW
VRESILAGEQVLAGAGDDATPTLLLQAEEERVVDNRMHDRFCELRTAAGHPVEGGRPLVIKGAYHEILFEKDAMRSVALH
AIVDFFNRHNSPSGNRSTEV
>Q988B7 1.1.1.107~~~pldh-t~~~Pyridoxal 4-dehydrogenase~~~COG1028
MTERLAGKTALVTGAAQGIGKAIAARLAADGATVIVSDINAEGAKAAAASIGKKARAIAADISDPGSVKALFAEIQALTG
GIDILVNNASIVPFVAWDDVDLDHWRKIIDVNLTGTFIVTRAGTDQMRAAGKAGRVISIASNTFFAGTPNMAAYVAAKGG
VIGFTRALATELGKYNITANAVTPGLIESDGVKASPHNEAFGFVEMLQAMKGKGQPEHIADVVSFLASDDARWITGQTLN
VDAGMVRH
>P20626 3.1.4.-~~~pld~~~Phospholipase D~~~
MREKVVLFLSIIMAIMLPVGNAAAAPVVHNPASTANRPVYAIAHRVLTTQGVDDAVAIGANALEIDFTAWGRGWWADHDG
IPTSAGATAEEIFKHIADKRKQGANITFTWLDIKNPDYCRDARSVCSINALRDLARKYLEPAGVRVLYGFYKTVGGPAWK
TITADLRDGEAVALSGPAQDVLNDFARSENKILTKQKIADYGYYNINQGFGNCYGTWNRTCDQLRKSSEARDQGKLGKTF
GWTIATGQDARVNDLLGKANVDGLIFGFKITHFYRHADTENSFKAIKRWVDKHSATHHLATVADNPW
>Q76KC2 1.1.1.107~~~pld1~~~Pyridoxal 4-dehydrogenase~~~
MHLKASEKRALGRTGLTVTALGLGTAPLGGLYAPVSRADADALLEAGWDSGIRYFDSAPMYGYGRCEHLLGDMLREKPER
AVISTKVGRLMTNERAGRTLPPAPPKNPLDSGWHNGLNFREVFDYSYDGVMRSFDDSQQRLGFPEIDLLYVHDIGRVTHA
DRHEFHWNALTRGGGFRALTELRAAGNIKGFGLGVNEWQIIRDALEEADLDCSLLAGRYSLLDQVSEKEFLPLAQKRGMA
LVIAGVFNSGILAAPRGGEQKFDYADAPAEIIARTNRLHDICDEYHVPLAAAAMQFPLRHEAVSSILIGVRSPEQIRQNV
VWFEQSIPDEFWTTLRSEGLIS
>Q92G53 3.1.4.4~~~pld~~~Phospholipase D~~~
MKRKNNKFIEISIAFILGIALGLYGQNPDYFTNLISQKSLALSALQIKHYNISELSRSKVSTCFTPPAGCTKFIANQIDK
AEESIYMQAYGMSDALITTALINAQARGVKVRILLDRSNLKQKFSKLHELQRAKIDVDIDKVPGIAHNKVIIIDKKKVIT
GSFNFTAAADKRNAENVIIIEDQELAESYLQNWLNRKASN
>Q53728 3.1.4.4~~~~~~Phospholipase D~~~
MTSDQRPARLPTHKGKLLAPHRLHRLIPVSVALTTVCAALPSSTAYAADTPPTPHLDAIERSLRDTSPGLEGSVWQRTDG
NRLDAPDGDPAGWLLQTPGCWGDAGCKDRAGTRRLLDKMTRNIADARHTVDISSLAPFPNGGFEDAVVDGLKAVVAAGHS
PRVRILVGAAPIYHLNVVPSRYRDELIGKLGAAAGKVTLNVASMTTSKTSLSWNHSKLLVVDGKTAITGGINGWKDDYLD
TAHPVSDVDMALSGPAAASAGKYLDTLWDWTCRNASDPAKVWLATSNGASCMPSMEQDEAGSAPAEPTGDVPVIAVGGLG
VGIKESDPSSGYHPDLPTAPDTKCTVGLHDNTNADRDYDTVNPEENALRSLIASARSHVEISQQDLNATCPPLPRYDIRT
YDTLAGKLAAGVKVRIVVSDPANRGAVGSGGYSQIKSLDEISDTLRTRLVALTGDNEKASRALCGNLQLASFRSSDAAKW
ADGKPYALHHKLVSVDDSAFYIGSKNLYPAWLQDFGYIVESPAAAQQLKTELLDPEWKYSQQAAATPAGCPARQAG
>Q8KRU5 3.1.4.4~~~pld~~~Phospholipase D~~~
MTSRYRSSEAHQGLASFSPRRRTVVKAAAATAVLAGPLAAALPARATTGTPAFLHGVASGDPLPDGVLLWTRVTPTADAT
PGSGLGPDTEVGWTVATDKAFTNVVAKGSTTATAASDHTVKADIRGLAPATDHWFRFSAGGTDSPAGRARTAPAADAAVA
GLRFGVVSCANWEAGYFAAYRHLAARGDLDAWLHLGDYIYEYGAGEYGTRGTSVRSHAPAHEILTLADYRVRHGRYKTDP
DLQALHAAAPVVAIWDDHEIANDTWSGGAENHTEGVEGAWAARQAAAKQAYFEWMPVRPAIAGTTYRRLRFGKLADLSLL
DLRSFRAQQVSLGDGDVDDPDRTLTGRAQLDWLKAGLKSSDTTWRLVGNSVMIAPFAIGSLSAELLKPLAKLLGLPQEGL
AVNTDQWDGYTDDRRELLAHLRSNAIRNTVFLTGDIHMAWANDVPVNAGTYPLSASAATEFVVTSVTSDNLDDLVKVPEG
TVSALASPVIRAANRHVHWVDTDRHGYGVLDITAERAQMDYYVLSDRTQAGATASWSRSYRTRSGTQRVERTYDPE
>P37894 2.7.13.3~~~pleC~~~Non-motile and phage-resistance protein~~~COG2202
MGRHGGPAAAGPTAPSAVRAKAVNAPSQVFVRIAILAALLLLAVYTAFGVHRLQREAMAQPGGAPLAAKADLIAGRVDAN
LAAQRAGLSAAADLLKRDPGATMDAAETTLRAAGGEAAAVAVVSEAGVVAVAGRDDGADWKAAALAAGASGRTNWVGSVG
ETGRLYVATTTSLDRARAFVIASGDASRLVADPEKGESGALALPDGKLIAARGRGVQGAGALREAFALSIEDLGDGPAAV
RGQAADGALLDVAVRPVAQGALLAVAAAPTRSVANLDRQVMEGAFSLLVPLGVGIALALLLMIQSRKAEVAHREFIDSER
RFRLAVEAARCGIWEWDLNGDQVYLSDVTGAMFGWGGGGVVSGQDLLERISIDHRERVRQALANAAMYGAFDVSFRVPAS
EQGARSLWIDARGQGFGKPGSEGHARIIGVALDVTEERIAQARAQAAENRLRDAIESVSEAFVLWDRQGRLLMCNRNYRS
VFSLEPKILKPGAARAEVNRFAALAIKQDHPAPDGAKGVREAEMMDGRWIQISERRTAEGGLVMTAADITAIKTQEEARR
RNEEQLQNAVAGLERSQEQLAELARKYETEKVKAESANKAKSEFLANMSHELRTPLNAINGFSEIMMNEMFGPLGDQRYK
GYSQDIHSSGQHLLALINDILDMSKIEAGKMNLKFESMHLEDVAEDAVRLVRNRAEAAGLKLDIDFPQLPEIEADYRAVK
QVLLNLLSNAIKFTPRAGSVTVRAEVRRDPFGDLIKVSVTDTGIGIAKEDLARLAKPFEQVESQFSKTTQGTGLGLALTK
SLITMHDGVLEMHSTPGEGTTVSFTLPVRHSDQKITRDFVAA
>Q9A5I5 ~~~pleD~~~Response regulator PleD~~~COG0745
MSARILVVDDIEANVRLLEAKLTAEYYEVSTAMDGPTALAMAARDLPDIILLDVMMPGMDGFTVCRKLKDDPTTRHIPVV
LITALDGRGDRIQGLESGASDFLTKPIDDVMLFARVRSLTRFKLVIDELRQREASGRRMGVIAGAAARLDGLGGRVLIVD
DNERQAQRVAAELGVEHRPVIESDPEKAKISAGGPVDLVIVNAAAKNFDGLRFTAALRSEERTRQLPVLAMVDPDDRGRM
VKALEIGVNDILSRPIDPQELSARVKTQIQRKRYTDYLRNNLDHSLELAVTDQLTGLHNRRYMTGQLDSLVKRATLGGDP
VSALLIDIDFFKKINDTFGHDIGDEVLREFALRLASNVRAIDLPCRYGGEEFVVIMPDTALADALRIAERIRMHVSGSPF
TVAHGREMLNVTISIGVSATAGEGDTPEALLKRADEGVYQAKASGRNAVVGKAA
>B8GZM2 ~~~pleD~~~Response regulator PleD~~~
MSARILVVDDIEANVRLLEAKLTAEYYEVSTAMDGPTALAMAARDLPDIILLDVMMPGMDGFTVCRKLKDDPTTRHIPVV
LITALDGRGDRIQGLESGASDFLTKPIDDVMLFARVRSLTRFKLVIDELRQREASGRRMGVIAGAAARLDGLGGRVLIVD
DNERQAQRVAAELGVEHRPVIESDPEKAKISAGGPVDLVIVNAAAKNFDGLRFTAALRSEERTRQLPVLAMVDPDDRGRM
VKALEIGVNDILSRPIDPQELSARVKTQIQRKRYTDYLRNNLDHSLELAVTDQLTGLHNRRYMTGQLDSLVKRATLGGDP
VSALLIDIDFFKKINDTFGHDIGDEVLREFALRLASNVRAIDLPCRYGGEEFVVIMPDTALADALRIAERIRMHVSGSPF
TVAHGREMLNVTISIGVSATAGEGDTPEALLKRADEGVYQAKASGRNAVVGKAA
>T2KNA3 4.2.2.-~~~~~~Endo-acting ulvan lyase~~~
MGTSVRRISVVLMMLFGTNFCWSQSMQHPVIWVTQDEKQDILNLIEKYDWAKKMEHDLHAVVDKKVEAHQKKPSVILSHI
PEIPADNSLTEFEAVTVGDHAAVLTDASYAAMLYFLTDDEKYAQFSADVLWHYVTVLSDRSPKNTTICGNHFYDPRTSYA
QFALAYDFIYNFLNKPTTKVYKASANKQQTFDRDLFQKVLLNMVGSSLQEYGRPDTHGKFISNHPILTAPGVLYGILCIE
DDKERERLFDVFWEKGTAHQNSFKNTILPMFGKQGIWPESTSYSFMPAVTLVLNIIDRVYPEMQVTQNYKNIYKGNFLFD
NLRMPDGRFVRYGDSKRNHDGTEQLYRYTLNLAQRRGYSNLENQAKIALSQAYQRQGGYQSKISPATFNSSEPLKLFWGT
PIPKGIDSKIDFKKPTVLVEHAGIALQRNYVETDNELYGLCGIIGGAHYVHSHVTGITMELYGAGYVMAPNGGLPKTVKE
RRIPLHENYFRLYAGNNTVIVNGTSHGIQPGSWKDGAYVWQNTVVNIAAEPKHLEDPISEHFNFATQFLKDTINNCDQER
TLSTIRTSEKTGYYLDVFRSKSLTENKFQDYIYHNIGDATLLETENGETLRTEPTTRYKTDIGDPVQSPGWRYFEDTKST
KPIHKGVHATFKIDYDERFMHMFVPQGVNRSYTTALAPPTREAKNGYEEKPTQVLAIRQDGEAWEKPFIAVFEPSVKASS
SVQTVTPLQDADKVVGVTVISKVNGKLITDYIISLDSKDGVYENKILKIKFEGRFGIIRVEGEQKTISLYIGEGKTLKYN
NYTLDSETATTAYKVF
>T2KM04 3.1.6.-~~~~~~Ulvan-active sulfatase~~~COG3119
MKRLFIYTLGLVLTVSACKEKQQKTTVTQEQKPMNVLFIAVDDLRPELNFYGASHIKSPNLDKLASQSLVFNRSYCNVPV
CGASRASLLTGTRPTRHRFFDYKARADTDAPEAVSLPMTFKQNGYTTISNGKIYHNIDDDSLAWNTIWFPKGNIRNYQLE
KNILKNADANLAMGGASAFENADVNDEAYFDGEIARKGIADLQHLKKSKQPFFLAMGFMKPHLPFNAPKKYWDLYDREQI
KLPETYVQPESTPKKAFPNYGELRNYGNIPKKGDLPEDLAKELIHGYYACVSYVDAQIGLVLDALEKTGLADNTIVVLWG
DHGWNLGDHKLWCKHVNFETALSAPLVVKVPGKTNGERSNAITEFIDIYPTLCELTGLEVPDTAEGVSFLPLINGEKQIK
NWAVSKFKDGVTLVKDDLFYTEWTDDDGVAYERMLFDHKTDSLELDNLAEKPEFQELVKQLALELRQKWGKDFLTQNTK
>T2KN71 3.1.6.-~~~~~~Ulvan-active sulfatase~~~COG3119
MNSKKTGVIILGCIAFLHIACSGDKKTQAQDTSDSMLEKSSAISEKPNIIFYLADDQDVYDYGCYGNEKVHTPAVDALAK
DGILFTNAFTAQAICAPSRSQLFTGKYPLKNGCFANHTGTRSDIKSVTTHMKKLGYEVVLAGKSHVKPENVYQWDREWEP
VPKQGVPRDYIPLDSIAAYLKNAKKPFCMFITSKYPHGKYFDVEHPKASDIKFYPFNENKKTDKTFIKTKAGYYRSIEED
NTQLEEVLKLVDTYLTDNTLFIYSADHGVSGKFTVKDIGLKVPFVARWPKVIKPGSTSNQLIHYTDVLPTFMEIAGGKFP
EDMDGNSFLPLLQGKDVEVNNYVYGVRTNQNILNSEIFPSRMIRDKRYKYIRNFNSIEVVEQNLTGKPNVNYFIERGAKA
HKNEPFEELYDLQNDPFEQHNLASNPDYKSIKEKLIKDMFSWMKAQGDILSENMIGIPIITPKGNRGFKLDQDTPRRKIP
EARKNTLTKDDYIVIEHW
>T2KMG4 3.1.6.-~~~~~~Sulfatase~~~COG3119
MKTRYFLLLGICMLSCRTDEKKQVQKEVDKPNVLFIAVDDLNNMISPIANFSNIQTPNFDRLAAMGVTFTDAHCPAPLCG
PSRSAIMTGLRPSTTGIYGMTPDNKIRRDDNEATKDIIFLPEYFKKNGYHSMGIGKLFHNYAPDGMFDEGGGRVKGFGPF
PEKRFVWDGFGTSKSRKGQYGRTNTDWGAFPESDTLMPDHQAVNWVLERFNKNYKQPFFLALGFQRPHVPLYVPQKWFDL
YPLESIQTPPYQSDDLNDIPPVGLKINDLPMMPSTEWAINSGEWKKIIQAYLACVSFVDYELGRVLDALKNSPYAKNTII
VLWSDHGYRLGEKGTFAKHALWESATKAPLFFAGPNLPKGKKIDAPVEMLSIYPTLLELSGLQAYARNEAKSLVRMMQKN
EGLKDTYAITTYGKNNHAVKVDGYRYIQYEDGTEEFYDNASDPNEWINEANNFKFKSKIEALKALLPKTNATWDAESNYT
FQPYFVEQKTRGNVNAAKAVKVIGAER
>T2KNA8 3.2.1.31~~~~~~Putative beta-glucuronidase~~~COG3940
MPRFLKYILGLFLISISAFGQNLVPEVTELESGFNSPPNQAKARTWWHWISGNVSKSGITKDLEAMKAVGIQEAQLFNVD
LGFPAGPVDYLSEDWLDLFHFSALEAKRIGLELTFHNTAGWSSSGGPWISPEYAMQTVVYSEIIVKGGKAIKKQLPQPET
KLNFYKDIAVLAFPKPKQTMKIDDLDFKSLSGRIRNHLLPDTKIIPSEAVIQKQEIINLTAHLNDAGILEWKVPKGEWVI
LRLGHTPTGKKNHPAPKGGHGLEVDKMSTKAVDVYWEGGIQPILNKLGDLVGTTVNNCLIDSYEVGTANWTAGFDAEFET
LRGYSLVSYLPTLAGYYVESGEITERFLWDFRRTIGDLMAKNYYAHFRDLCHKNGLKFSVEPYWGPFDNMQVGATGDIVM
CEFWSGGYPFFDSPKFVSSIAHLNGSSIVGAESFTGIGGWDEHPAELKSIGDRAWAEGITRFIFHTYVHQPWDVAPGLAL
SYHGTDFNRLNTWWRQGKAFMDYIARSQFMLQQGKNVADVLVFTGESSPNTAFLLPEIKQLGYDYDLIGSNKLSDLFVKN
GKICTPVGGQYDVLMLPESDWIKPETLHKIEDLVKDGAKVIGSKPKKSPSLEHYSTCDAEVKRLSDFLWGKGLVKEISIV
DFLKGNNLLADFKIESDDVSDISFIHRKTDEADIYFIANARKESREIKVRFRVSNKQPEIWQAESGTIKKPAVWQNHADG
TTSLPLQLGMEEAVFVVFKNASKEKSQLVSAKMELENPKSEPLSNLQIIKAEYGTFLQEGLVDITDKVAAEVKDNQLHIQ
ASRAFCDCDPAMGYIKEFRMEYQIGEDIKTISAQEKEYVNINAGDKKLTVLKAVFGKFKPETKGVPKHYPVHDVTEKIKQ
EIASGNLVIPVNNQLIGGKTPEGDNTTIKITFTTDGEEQTLFVPKGRPLNLSKDRSKPEIVLNDGETQWITPYPGTLSYK
NLSGKVMATTVKSVPQPIMLAGTWDVEFPSDLVTINKVRFDELKSWSAVENEGIKYFSGTASYHKTFQVSKKLLKSNNKL
ELDLGSVAVIAEVILNGKPVGTLWKAPFRLDVTNDVKTGENKLEVKVTNLWPNRLIGDEKLPLDFERKGPKIKSVPDWLL
NNTKRPSERTTFPAWKHWDKEDELLSSGLLGPVKINVLVEKSL
>T2KM09 3.2.1.31~~~~~~Putative beta-glucuronidase~~~COG3250
MHQKIVECIPLWSRNLNSNVKGMRGEGIACLLIVLCSIIYRTEAQRIETNFNNNWHFILKDSPDFSKENLDDSSWELLNV
PHDWSFEKGVRKGGDQGQGGGYHDGGIGWYRKTFSFSKASLSKTTYINFDGVYMNSEVWINGNRLGKRPYGYISFRYDIS
KYLKVGKNTIAVRVDNGLEPSARWYHSCGIYAPVKLVEVNPTHFKPNTIFIKTPSIEKQQGVVSIDAEIKGAFKGLKYNV
ELLTANGKVIATHSEKLASAQPSVQLEVKPPKLWSPESPNLYKAKTQILDGKKVIDEKTTTFGFRTVAWKTETGFWLNGE
NVKLKGVCEHWEGGPVGGAWTKPMLRWKLQSLKDMGINAIRPSHNSTPPMFYDICDEIGLLVMDEIFDGWHKKAPEDYGK
QAFDEWWQADVKEWITRDRNHPSIFVWSLGNETHSDVAPEMVAFGKNLDPTRLFTSGAGNPEDMDIQGVNGGSETKSFIE
NNKLTKPFISTEAPHTWQTRGYYRTQTWWRDNELSGTYELPNLTEKEVFFYEGINPKNWKNRKQRFNSSYDNATVRVSAR
KYWEVMRDTPWHSGHFRWTGFDYYGEAGLVHGGLPFNLFMGGALDVAGFKKDLYYFYQSQWTEKPMIHMLPHWTHPRMKK
GTVIPVWVYANADEVELFLNGISLGKDKPGTVWNEMQCEWLVPYEEGTLEAVGYINGKVVNRTSFSTAQQPSKLKTSILK
LDAEGSFTDSFIVTSESLDTAGHLYPYGENKVYYHIQGDVKKISMENGNPIDPTSRTKSDFRALFFGKTRTFLRALPEPK
EAAVVTAAILGDKALYTSNLITIDAQHIQLLGKSKTSDLEIRYTTNGENPETHGKLYKDAFMVEDDTTVKAIVKQNGKTV
LSMEETFGKNEGLFWGDEHSADMWIGRGVDISAEEGVLTGAAKPSREAHRFKGSGFVDFKGGEGSITWYQENDGEPGDYS
IRFRYMHNNHGKLHPMKLYVNDEYVRTIEFEPTGGWEKEWKFVPTIIVLQSGANNIKLETTGESGPFIDELFID
>T2KN75 3.2.1.31~~~~~~Beta-glucuronidase~~~COG3250
MTILCSCAQQQQQQQQDTEVITSSERTTYNFNVDWKFIKSNPKQAQDINYNDATWETISCPHTFNDVDTFDDLSHGHHDG
EDNQWRGTVWYRKHFKLPKDDKGKKVFIEFESVRQIADVYINGVHLGQNQTGFIPFGFDLTPHLKFGEENIIAVKVNNDR
GDHFRENFPLVWNHEHWHPTHGGIYRNVFLHTMDPLHITLPLYDNLETVGTYVYAENISEKSADITVETEIQNEHAENKN
ITLVTQIVDNDGAVVAHSNKNVAIPSGQKMKVTTVTNIQNPQLWYTRYPYMYKVVSAIKESNKVIDTYESPLGIRNFDFN
KDSGFWINGEQIKLHGWGQKPTNAWAGLGAALPDWLRDFTFKLMDEAGGNFIRWGHCAASPAEVDMGDKYGFVTLMPGVS
GESEDEGETWDIRYKAFKDLIVYYRNHPSIFIWEGGNWAESEAHYKEILEAIKTFDPKGKRLMGNRRADVKNDSEGYVSI
EIGTEGWEREYPDLPIIESEYNREEAPRRIWDKNSPDDNFYNHPNISKNTYKLSSEEFAVRQADHWWNKMGKKAYHSGGA
NWIFSDGPHGGRCPTEVTRASGEVDAVRLPKEAFYALKAMWRPEPQVHIVGHWNYEVGTKKTMYVMSNCASVKLYVNDKL
VGTNSNPENGYVFKFDNVAWESGKIKAEGFIDDALKTTQTKETTGEPAALKLTSITGPEGWLADGSDVALIDVEVVDAQG
RRCPLAKGRVDFTISGPAIWRGGYNSGKPNSTNNLFLDIEAGINRVAVRSVLESGTVTIMAKKPGFKDVSVTLKSLPIDF
NNGLTTTLPQVYTNVLTKEPLPEHIPEMPEYIPGVKNRSELFRKFSYTGDGKAMLRTNMHWGKKAYTDLEYNYTVLPRYL
NESEYVRTPNSDNRYWARDQLQFIAGKKMHIYVLHDDTVPRPEFLLRDYEDTGDNVNVVGASMSVFHRVAEEGESIIMAG
NSDGDAPENCRMYTVMVKEFK
>T2KPK5 3.1.6.-~~~~~~Ulvan-active sulfatase~~~COG3119
MTNMNLKHKIFIMLLLVFCSSKIIAQQSQPNVLVFYVDDLRAELGCYGSKTAITPNIDKLATEGVQFNKAYVQQAICAPS
RMSTLTGLRPETLGIYSIFTPLRSVHKDVVSVPQLFKENGYKTVSIGKVYHHGTDDKNQWTNYFTKEPNTYNKPENIALL
EQFKKEGKKANGPAFENADVADEAYKDGRAAKYAVETLKKLKNDKFIMFVGFSKPHLPFNAPKKYWDLYDKNNFEIPERK
KPENMYRLALTNWGELKGYHGIPNDVEYLDDNLTRDLIHGYHASISYVDAQVGKVMEALEALGLRKNTTVIFMSDHGYKI
GEYGAWCKHSNEEIDVRVPLIVSRETSYKGRVAGKTSDALVENVDIFPTLVELCGLEGPKTDGKSILQVIDRPNTPWDQV
ATAVYARGKNIMGCTATDGEWRYTEWRDAKTQDILGAELYEHKNSLLSFKNLSGNTKYKKEEARMKGLLETQFPRNQGPF
LQHDTPRN
>T2KMG7 3.1.6.-~~~~~~Ulvan-active sulfatase~~~COG3119
MNFKQNIVYKKMAISMKITAIRPIALVISFTLLSCKDKVKTVEQQDEPTKPNIVYILTDQWRGAALGYAGDPNVKTPHLD
ALAKEAVNFTNAVSVTPVCTPHRASLLTGKYPITTGMFLNDLYLPSEELCMAEIFKAEGYNTAYWGKWHLDGHRRSAYTP
KERRQGFDYWKALECSHDYNKMPYYDNDNPEVKYWGKYSPFAIVEDANTYLEKQAKDDTPFLAVVSIATPHFPHGSAPQK
YKDMYSPESLILNPNVSPKFEARSREELQGYYAHATATDEAIGLLLKQMDALGLNENTIVVFSSDHGEMMGANDVRPFQK
QVAWDESIRVPFLIKYPGIDKQKGVTVNAPINTPDILPSLLGLSNIKIPDGIEGEDLSELIKNPDPEADREALVMNVAPF
AGGYPNLPYRAIRTKQYTYARTTEGPSMFFDNVADPYQQNNLLGKPEFETLQNELDAKLNKKLAELGDEFKSRDYYLKKY
NYVFGKNKPAIPYWEFNNGKGEVQSPIPVTQ
>T2KLZ3 3.2.1.-~~~~~~Unsaturated glucuronyl hydrolase~~~COG4225
MRKLVYLVLVLGLTFLNVRCKSETKQNKKEEQNIGKQYSSLENRFQKLVNYPVGANNFPRSMSLAPEVVHKVPSKDWTSG
FFPGNLWLIHELTGDSIYKVKAQEWTVLMEDQKENDRTHDMGFKVYCSFGEGLKQDPDNQYYKDVIIESAKTLITRYNDT
VKSIRSWDFNKDVWDFPVIIDNMMNLELLFEATKISGDNIYHNIAVQHANTTLKHQFRPDYSVFHVINYDTISGVVKTKD
THQGFDRNSTWARGQAWAIYGYTMSYRYTNNPKYLAQAEATTQFYMEHENLPKDGVPYWDFNDPEISDAPRDASAAAIVT
SALFELYTYTNNKTYLDFATQVLNTLNSEAYLLKDTVNGPFILNHSTGNWPKNDEIDEPIVYGDYYFLEALKRKQNLILK
>T2KNB2 3.2.1.40~~~~~~Alpha-L-rhamnosidase~~~COG3408
MILHKSVFKSYIYVLTYFVFFSVMSCENSSVLQEAKHLTISEGFKNPLGFYDAKPTFSWELPVVEGVISQSAYQIVVASS
PDLLPNNPDLWDSNKQSSSQSVWINYEGKPLVSRQKVFWQVKYWNQDDKASNWSPVQNFELGLLNNSDWKAKWIGLPTKE
EGVLGSQDNIIHRPQYLRKVFELSNDVANARLYITAKGVFDVAINGEDVSDDVMPPGYTPYKKRIETITYDVTDLIESGQ
NTIGVEVAAGWHSGRLGWMKSYWSDTESPKILCQLEVTMKDGSKASIISDDTWKATTQGPIRISEIYDGETYDAHLEMPH
WTTNSFDDKNWKAVQAFPVTSTIKLEPKRHTTVKSKIVLESKEIILKADAAIFDLQQNMVGVPLLKVPMKMGDTLKIRFA
EMLSPDGTFYTDNYRSAQSTDYYIAAKEGTIEWMPKFTFHGFRYVELSGFDASKTPSKNWVKGVVQYSNFNENGSFTSSH
EKLNQLQSNIVWGLRGNFFDIPTDCPQRDERMGWTGDAQVFGPTSMFNADVYKFWASWMQSVRESQYDNGGIPFVVPDVL
HNGKVSSGWGDVCTIIPWKIYYRTGDVGILEENYDMMKKWVAHHQATSKDFISHMNSFADWLQPYPENGNNKGDTSHSLI
GTAFFAHSAKLTAKTAEVLGKKEEQATYEALYKSVAKAFENAFFKNGKVKDVTATQTSYLLALAFDLLSEENKENAKQQL
LEKISEADNHLRTGFLGTPLLSEVLDETGEIDLMYKLLFNETYPSWFYSINQGATTIWERWNSYSKAEGFNPMKMNSLNH
YAYGAIGEWMYERITGIAPLQAGYKIISIAPIPKAPLTSASATLNTPYGEVASSWEIKNETLFLEVVVPPNTTAEIEIPT
DNSESLKVDNENFTNGKNLKLIKNEKRKIKILAQPGTYEFQAKYSL
>T2KM13 5.1.3.32~~~rhaM~~~L-rhamnose mutarotase~~~COG3254
MERLAFKMKLNKGQKQAYKERHDQLWPELKQLLKDNGVSEYSIFIDEETNTLFAFQKVSGHGGSQDLANNEIVKKWWDFM
ADIMQVNPDNSPVSIPLEEVFYME
>T2KN80 ~~~~~~Uncharacterized protein P22~~~
MCTTAWATAQPVIKPPKGRIAIIADGNSPDPDDLGGTAISLALLRATSLESRLVHYSHSCDLVRVNRISEAAEYERHAMM
QTACDGTARRWGGFENLTFFDAKWQLDETIKDLSKAINASSAEDPLWIIEAGEPDIIGFALAASEKEKHQYVKVVTHHPA
NDDAGDFYTWQSILDFGVEEVRIPDQNINLKVDESEWDWAKNHSDDRMKFVWLMGKMAEVDDVVKFQKGKWDCSDAGMVL
YWITGATNGGVKQGSVTQVKTILEGFLSQNNN
>T2KPK8 ~~~~~~Uncharacterized protein P23~~~COG3055
MKKSKASALLWLFSLVGFMLHAQTFNLNQPMVAQNVIFEEKDGLVAVEAEYFYKQTHTDLREWYRTTKDSVAVVGRDEDA
NHYLNASNSSYIEVLPDTRVTHSDQLVRGVNFSNKPGQLAVVSYKIKFNSPGRYYVWVRALSTGSEDNGLHVGLNGTWPE
HGQRMQWCDGKKYWMWESKQRTKDEHCGVPHAIYLDVPKAGIHEVQFSMREDGFEFDKFVLTTNSNYVPIDKGPNMTLAD
GNLPSSYKSKSEPSYFNTIARKLPENKFIASQEFPIDGTNFYKNGKNWLAINPEQYKQAKISTLFDFESGTYDVIYVGVG
ENDGRSTFRIVINNKELGTYQPPLTQMLWEEGKAFNGFWKNVKLNKGDTITVEVQVASDGNEWTRGRWAGIVFAPVGQGY
VVQESPSTYIFEK
>T2KMH0 3.2.1.-~~~~~~Beta-xylosidase~~~COG1472
MKKLWLMGLLLASFFTTVAQNNAQTKSNSDEEIDKKVATLISQMTLDEKIAEMTQDAPANERLGIPSMKYGEALHGLWLV
LDYYGNTTVYPQAVAAASTWEPELIKKMASQTAREARALGVTHCYSPNLDVYAGDARYGRVEESYGEDPYLVSRMGVAFI
EGLQGTGEEQFDENHVIATAKHFVGYPENRRGINGGFSDMSERRLREVYLPPFEAAVKEAGVGSVMPGHQDFNGVPCHMN
TWLLKDILRDELGFDGFIVSDNNDVGRLETMHFIAENRTEAAILGLKAGVDMDLVIGKNVELATYHTNILKDTILKNPAL
MKYIDQATSRILTAKYKLGLFDAKPKKIDTETVETGTDEHREFALELAEKSIIMLKNDNNLLPLDVSKIKSLAVIGPNAH
EERPKKGTYKLLGGYSGLPPYYVSVLDGLKKKVGEHVKINYAKGCDIDSFSKEGFPEAISAAKNSDAVVLVVGSSHKTCG
EGGDRADLDLYGVQKELVEAIHKTGKPVIVVLINGRPLSINYIAENIPSILETWYGGMRAGDAVANVIFGDVNPGGKLTM
SFPRDVGQVPVTYLERPDFIGSGKGQYRFSDKTPLFPFGFGLSYTTFKYGTPKLDNTSIAANGTTTVSVEVTNTGKVTGD
EVVQMYVRDDYASVGRYLKMLKGFKRITLKPGETKTVSFKLGFDELNILNQDLKKVVEPGTFTISVGASSKADDLKTVSL
TVK
>T2KNB8 ~~~~~~SusD-like protein P25~~~COG0702
MKIQNIIVYVFLIFSCFSCEEFLEEDPRALIAPETFYQSESDVRQAVVGLYSILKNNSIYGQLGLDLFYDNGADIIEPNR
STNVVEPLGNYSLNEAIADVSVQKMSVSDTWKDLYRVIYNANIILDNVDGNDAISEEAQIDIMAEVKFIRALCYWHIVNL
WGDAPFYTEPLVLEEIRVLGRTDEDTILSTVVSDLQYAQVHLASVYPEEDRGRASKWAAAIVEAKIHMQEQNWQAGLNKC
MEIISQSPHSLLGNYADVFNPNNEYNSEIIWSLDFAKDIRGQFEEGTLGADGSFPSVFGNGNWRPSMFAPRLRDEPKNSS
ERNALAAALQANGEAFNGTGLQVASKDFAGKFPRNDYRRALNIVDNYLGFDLNFPYMAKIWNLDVDNSPRFNHSDNRIVF
RLADVYLMAAECENELNGPANAFQYINKVRERAFATQTEWELKGLDQQGFREAIYDERKWELAGECHRRYDLIRWGILLD
VVQDLEYRFWTPNTNIRPYHVKLPIPLQELQVNPVLLESDATNNGYR
>T2KM18 ~~~~~~TonB-dependent receptor P26~~~COG1629
MFCYSSLHAQIISGTVSAEGQVLPGAAVIIKGSTKGTSTDFDGYYTIEAQASDVLVFSYVGYANKEVTVGTNTQIDVALE
ADNTLDEVVVIGYGTQRKSDLTGSVSSVSAEDVNVNPVSRVDQALQGRAAGVQVTQTSGAPGAASVIRVRGGNSITGSNE
PLWVIDGIVVGTNFNLNNINSNDIKSIEILKDASSIAIYGSRGANGVVLVTTKTGTGAGSSKPEVSANIYTSMQMVPELP
KMLSQAEQIAYTNESAAFRGAAIPFPNDPSTYPNNDWFDLLLGPAPIYNADVSITGASENVSYYTSLNYFNQEGIVKTSG
IEKYIFRSNLDIRLSDKLKTGFRVNYSYIDQQNGLVGYGNAIATLPTQPIYNEDGSYNGFDEVVGSPWSNPIANMALNTN
ETFRNNFLGSFYIDYSPSEKWIIRSTFSPDFDNSKQNRFTSSQSPNLLYLGEGGNASVRTVNTKGWNNENTIQYQSEIGE
NHRITALGGASFQKVSTEIVESEAFGITNDATGFNNLSNSDPTRNILTSDYSGFQIASFFGRLNYAYKDKYLLTLVGRTD
GSSVFSDDNKYEFYPSIAAAWKISEEGFMQNQETFGELKLRASYGKSGNQAIDPYRTKGLLVEANTTLNGIQQTGLTLGR
PSNPNLTWETTNSLDIALEASMFNGRVFAELNYYYKKTNDLLLDVTIPKQTGFNSQLQNVGSLENKGWEFSLNTTNVRTD
NFNWKSTLMLSSNKNKILDLGGVDFIDLVVDELLGSGNTRLIVGESVPVFTGVKFLGTWKSQEEIDASGLRDPQVVGGAK
YHDENGDGIISTDDAVVLGSPLPDLIFGFENTLSYKNLDFSFYFQGTQGNEVYNLRMRNHYFNRGEFTKFAEVADRWTPE
NPTSDIPRAGGDSVTGTPPNSAYVEDGSHIRLKTVRLAYNMPVDKMGMDGVKNATVYLTGTNLLLWSDFRLIDPEGSNFG
RNGIGNIAQGYNDGSYPNPRTITLGLNVTF
>T2KN85 3.2.1.-~~~~~~Beta-xylosidase~~~COG3507
MYLNACRALTLISVLSLLACNSEPEKKQATPSKEIAKAQTGSWGDQGDGTYINPILNADYPDSDIEQVGDTYYMITSKQH
MSPGMPILESKDMVNWTNVGHVFNSLSWAPEYNWDRMNGYSFGTWAGDLAYHEGTWYCYQIDYQHGLMVATSKDIKGPWS
KPIMMLPKSEVLDDPAVFWDEDTHKAYIIINTAGKQKEASNTIEGNENRIYEMSWDGTKILDEGKLVYTGMGAEAAKIYK
IDGTWYIFLAQWTMGDMSTKPGVKNPKNDRKQIVLRSKESIYGPYEVKTVLEKGTVFNNRSASQGALMQAPDNSWWYMHQ
LIQNDDIPFQGRPQCLEPVTWVDGWPIIGVDEDNDGIGEPVKTYKKPIDGYPVTAPRTDDDFSSPKLGFQWEWNHNPRNT
HWSLTERPGWLRLKASKVLPNEKGYGPNINEWTNNDGSDSDFWRANNTLSQRIMGITTGTAVAKFDVSGMKPHQLAGFVR
YGGVFNLLGVEVDEHGKKHLFYMEPMGEKTVGPEITVNDLYIRTSNRSNQAIYEYSFDGKNFKRFGPTFTIAFGKWTGDR
LGLFSWNDKEDAGYIDVDWFTYDYDGPKAANQ
>T2KPL4 3.2.1.40~~~~~~Alpha-L-rhamnosidase~~~COG3408
MKYNKLLFSLLLLAVFCFSCKEEQKLQTSLDFDKVLLNAKSNPIAIESESPLFSWIIKAEGFGKSQSAYHILVASSLDKL
DETHADVWNSNKVESSKSTFVKYEGKELKAATRYYWKVKVWDKSNQESNWSEPQYFQMGLLDESNWGEAKWITLTNDTRT
SEYRFREYKTGRMEQPIQVDGFAASYFRNKINLNKEVDNAQVYICGLGYYEFFLNGEKVGDHVLDPAPSNYDKQAYYVNY
DITEQLNSGENALGIILGNGFYGQNISWKNDPESDRDLAYGPPTVRVLLKLKYKDGTESEFFSDETWKESTGPIVFNNIY
GGDTYDARFELGDWTSTNYDDSSWGFAKETAPEIKNISAQQIPAIKKLQDYEPQNVFKGSDGEWIVDFGQNIAGWVKLNV
SEKEGQLIEVITTEALLTNGRDIFPGSTGGGANGMAQIYQYICKGDGQESWEPKFSYHGFRYAKIKGVSTKPDADMIKAV
LVATDIQETGSFECSDDLFNKMHNISKWTIVDNVHGIPEDCPHREKCGWLGDAHAFCEYALYNYDMYDFYKKYMEDIRTQ
MLPTKGHNNPELKFQVPTMIAPGKRTSSYAKIDWGVATMYLPWYNYLYYGDDAIVNEYYPEMKDLTNFYLNFKGENGIMQ
DGMGDWCPPRWDRRTNPEAMECDPIISANAYFYDVLGIMETFAKMNNDGAFQSEMKAEKEALKDAFNKAFLVEIPNTDFK
WYQSQTATVQALQFGMVPEEEIENVVNGLEYDIVEVKGGHHSTGIHGNRYIYTVLSKYGKADLAYRILTTPDFPSQTYIM
NSGFTTWPERQFEWETMEGPTNSLNHPMHSGFSAYFFESLGGIKSSTKEAGYKQFIVNPEFPSQITQTKVSVPTPYGDIK
NDWSFEEGKLSMTLEIPFNTEANLVLNQAELESLIINGKTFQNLQKNTKSVTLQGSNVILGSGKYKILYNKR
>T2KMH5 4.2.2.-~~~~~~Broad-specificity ulvan lyase~~~
MKRRNFIQLSSLATIGMSLPSAGIVNACSSFPEQSLEFKNLTSELLKEWCDGMLKVQINNPSNLEEHGALRCPSCSHIHG
RCMDAVYPFLYMADVSGDEKYIEAAKLVMIWAENNVSQENGAWTVIPNPKSWKGITIFGAIALAESLHYHSHILDDKTLK
AWTNRLARAGQYIYDTFTIDFTNINYGGTAIYGLDIIGDVLGNGNFKEKSKKMAEEVQAFFTKNDYLLYGECKPEADKLS
AKGLHGVDLGYNVEETLNSLVMYALKNDDQALLQIVTKSLNSHLEFMLPDGGWDNSWGNRMYKWTYWGSRTCDGSQPAFA
MMAHINPAFGTAAVKNTELLKQCTANGLLHGGPHYISAGIPPCVHHTFTHAKPLAALLDHWKHLPEINKTTALPRVTANG
IKHFKDLDVLLFSRGDWRGTVSAYDAEYHYKKDYRQATGGSLGILYHNKVGLLCAASMAVYNMVEPYNQQPQPGKDIALT
PRIETFKEDQWYTNLYDLTANLEAIDTKEVINLASVVKLKNESRKMVSGTASEFHLTYSCAKEGLTIKVSTQQDILEPTA
FVLPIASPEKEKVEFVNEHEIKISKPGGVVTIKANVPLKLKEYSGTRTFNMVPGLEALPIELFFETHIKELVLIVSVV
>T2KN63 ~~~~~~SusD-like protein P2~~~COG0702
MKKYKITFIVLLLTLVGCSDLEENPVGILAPESFFKTTADLQAAINGSYASMSTESFWGRKLTLTLLLRGDLADIGDQGT
SGRRKEVNNFTMGDDNGMVSAFWPQAYAIIGTANQAISNAGLINDDENKVNAVAAQAYFCRAFTYYHLVRLFGDIPYIDF
AVSDASEIDAISKTPENEVYEGIIADLQYAKEWLPDTQFSRALPSKATAAAYLASVYLTRGDFQKAAEEAQFVINNEARF
DLRLEPDFQNLFDANQTAGLKEPLFTIDYMGQISSSGYGQDYVASVTGIRGDATHEYGEGWSVAVPSLKVYQDWDAKDYR
RAVSLDTTATSKSGEVYPYTQFEEYSDLAVNRPHIAKYYRYAGLAGNNGRESSTNYIPMRYAEVLLIAAEALNEISAGSS
EAVSYVNRLRERARLGSGSMHPLNISEGLLQDELRNIIIEERKIELAFEFKRWYDIKRLKLGNEVFGPNGLEPQPNFDAN
RDYLLPLPGPELVRNSNLMPNNPGY
>T2KNC2 4.2.2.-~~~~~~Endo-acting ulvan lyase~~~
MLEKTTLKNIILIHFLMFLAVVTAQTAPDEDTSAITRCTAEGTNPVRETDIPNPVNVGTIDDRSCYANYKESTVYGKTWG
VYNITFDSNDFDTSLQPRIERSLSRSSETGIGSYARLTGVFRILEVGDTSGTSQDGTYLAQAKGKHTGGGGSPDPAICLY
LAKPVYGTGEDADKQVSFDIYAERILYRGGEGDGREIVFLKNVKKDEETNFELEVGFKEDPNDVSKKIQYCNAVIGGDTF
NWNIPEPERGTESGIRYGAYRVKGGRAQIRWANTTYQKVENVEVTNPGPIGDVYKLKNVATGQYLSDSGVSASAVIMSDS
GEAQNNYWTFVESGSLFNIDNETFGILRAPGAGGPGGAYVVVSTTKEGPSSDGDKVWTIHYNESNDTYRFESGSSGRFMY
QEINGNVTHISAMNTDDRSVWKAIAVESLSVDENAILASDVRVFPNPASDSFTISLKTINHVTVNIYDVLGNTIFKSEFN
GDTIQIRNKGQFKAGVYLIQLTDKNNNKYHKKLIVK
>T2KM23 3.2.1.-~~~~~~Alpha-1,4-L-rhamnosidase~~~COG3664
MKNKKRLCHILKYIITCFLFGVIFIIPIQAQIVLQTDFTDSENARQNIDYHFNVFNRITPLNGVKIKTPLGKPRVCIVRP
LGGIVKNGKPDISKDSYKWDKKSKTFYTDFTVLKNQIDGVINSGYAIHQIVLDNPSWAFQRNKNGELVADSLKVSTYGNA
EPPKDYNAWSNYLKDVLKFLVNTYGEESMLKIQFNIGREIGTPSHWSGSKEAFFEFYKISSSAIREVLPTAKVGTHFLWG
SSKNAWGTDFIKWSKANNVHYDFIGVSFYPFYNKPDRTLFKEVYAKDFAVIKDIPEWHKNAKLEMHEYALIKSLNKAGNA
FENAPKAQQNSFIVGLMKMFYEHNMQNVFQWGQGTNFEAAQEALFSIQGQTYYTSTKNGKPLLETNDVDAIFIKDVSNNI
YNIMAYNYNANPNATTDEHLNLKAKLDVPPGTKVKVRFALYNKEKDTMSWSEWKEEATQGNEKSKSVISLNAELPVFSFL
KYEVKVQ
>T2KN90 3.1.6.-~~~~~~Ulvan-active sulfatase~~~COG3119
MLFLRFKFFNNRLLFVSVLCFVICVSCKREHKEIKIKGEKATELKLPERPNILWLVTEDMGAYIPPFGDSTVVTPHLSKL
AKEGVIYPNLYSTSGVCAPSRAAIATGMYPSSIGANHMRTNSFTKERGLPAYEAVPPSNVRMLSEWLRKAGYYCTNNYKT
DYQFKAPVTAWDESSPYAHWRNRNDDQPFFAVFNFTDTHESGLFEPYGLREIETRLYRAGDTTYQWKNYGASHANNRMSE
AETPQYLSKDTKFNIPPYLPETDLVKRDMWKLYNNIGEMDNQVGAVLQQLEDDGLLENTIIFFYGDHGGPLPREKRLIYD
SGLNTPMIIRFPNKLEAETSDPQLISFVDFAPTLLSIIGEKPKEYMQGQAFLGQYKNKERSYIHAAADRFDAETDVIRAV
RDKRFKYIRNYRPEQGYYLPIDYRERIPTMQELLRLKAEGKLNEEQMQWFRDVKPEEELFDCKSDPFELKNLANNPEYQN
KLVELRKELDRWLTAIGDDANLPESELINKLWNGSNTQPVTSDPKVSINNGNITISCDTEGASVGYKIVTAGNKTSKTWH
IYNGPFKMPLGSTLEIIAHRIGFKPSKAIQISTSDL
>T2KPL9 3.2.1.-~~~~~~Unsaturated 3S-rhamnoglycuronyl hydrolase~~~COG4225
MKNQALKILTLCVLVGSAMSLKLYAQKGLNHSEIEAKMIKALEWQEAHPIFALAPTDWTEGAYYIGVSRAHKTTQDMMYM
AALKNQAYWNNWQTYSRLHHADDVAISYSYIYIGMNDKRPGFVNLEPTKKFLDAHLHEDDEWKAGTDKSASGKTILWWWC
DALFMAPPVLNLYAKHTNQPKYRDEMHKYYMETYNQLYDKEERLFARDMRFVWKGTEKDLKEPNDKKIFWSRGNGWVLGG
LALLLDDMPNDYKHRTFYENLFKDMASRILELQPKDGLWRTSLLSPETYDHGEVSGSGFYTFALAWGVNNGLLDRNKYEP
AVKKAWKALADCQHEDGRVGWVQNIGASPEPASADSWQNFGTGAFLMAGSEVLKLEE
>T2KMH9 3.2.1.-~~~~~~Putative beta-xylosidase~~~COG1472
MKKLLFTFLVSTGTIFFSCQRTYTQSKDYKNASLTIEERVDALLPKMSLEEKVAQMRIFHANIGVEAEGNGNLKLSDKVI
EKLKLGIAGIKNPGEHMDPVAAAKFNNDLQKYIIENNRWGIPALFVTESYNGVDAAGSTRFGRPLTSAASFNPQLVNRIW
DVVGREARLRGMHMCHSPEADLVRDPRFGRMSEAFGEDTYLTTQMVVNAINGVQGNYDGLGNGTHIGAVAKHFAGYGQVL
GGSNFAAIEISPRTLIDEIYPPFEAAVKEAKTLGIMASHGDINGVASHGNPELLTGVLRDQWGFKGYVVSDSNDIARLFY
FMNVAESPEEAAQMGLEAGIDIDLYAEDSYAYLPEMVKKNPNLEKLIDRSVRRVLRTKFILGLFDNPYIDIEEVKKGVRA
NSSLTLAKESDLESIILLKNENKILPLNKNKTTKIALLGPLVKDDTKSMFETVASKHISFVAEKGFHLTDEKGGAPKLLE
RDENAISKMVNMAKNSDLSILFLGGDEFTSKEAFFNNALGDRATIEPVGAQDELIEKIKALGKPVIVVLKHRRTLAINTI
SEQADAILDTWDLSEFGDESTARIIFGEVSPSGKLPVTVPRSIGQIPFHYSMKEINYKKGYLFMEDGPLYPFGYGLSYSN
FEYSDIKKSNSEMTKDSEIEVSVTIKNTGNVKAKEVVQMYIKDVKGSVIRPDKELKGFEKISLNPGESKKVSFKITPEML
KFTGLKMEKVLESGEYTVMIGTSSVDYKKTSFQLKK
>T2KNC8 1.-.-.-~~~~~~Oxidoreductase P35~~~COG0673
MESRINWGIIGCGNVAEVKSGPAFYKTENSTLVAVMRRNEDKVIDFANRHGVANWTTNAEALIQNDLINAVYIATPPSSH
LQYALRAINVGKNVYLEKPMVLNNHEANILVEAVKRSNVKVTVAHYRRELPVYLKIKELLDSNVIGNVISAEIQIKQTRN
TNLIAKTEVNWRTIPEISGGGYFHDIAPHQIDLMCHYFGEVENIKKGSCKENQVSHQDVSGEVLFKNGVQFSGTWNFNAL
EDKDECTIKGERGSISFSFYTSTITVSKNGLIESYHYENPEHVQQPMIEKTVGYFLAHNSNPCSVEEAAMVTHIMDVFCG
T
>T2KM26 ~~~~~~Bifunctional sulfatase/alpha-L-rhamnosidase~~~COG3119
MIKYKAIINLVFIAVFFNNAMSQTVKKEKPNIIFILTDDQRFDAIGYAGNKFVNTPEMDKLAQQGTYFDHAIVTTPICAA
SRASLWTGLHERSHNFNFQTGNVREEYMNNAYPKLLKNNGYYTGFYGKYGVRYDNLESQFDEFESYDRNNRYKDKRGYYY
KTINNDTVHLTRYTGQQAIDFIDKNATNTQPFMLSLSFSAPHAHDGAPEQYFWQTTTDALLQDTTLPGPDLADEKYFLAQ
PQAVRDGFNRLRWTWRYDDPEKYQHSLKGYYRMISGIDLEIKKIRDKLKEKGVDKNTVIIVMGDNGYFLGERQLAGKWLM
YDNSIRVPLIVFDPRVNKHQDISEMVLNIDVTQTIADLAGVKAPESWQGKSLLPLVKQETSTISRDTILIEHLWDFENIP
PSEGVRTEEWKYFRYVNDKTIEELYNIKKDPKEINNLIGKKKYQNVAKALREKLDELIAKNSDEFRAGPSDLTVELIRQP
ESEVKIFDLKPEFGWTVPLSSKYQSAYQLLVASSETIINANNGDVWDSGQVRSSQSTNVDFGGKPLKIGETYYWKVRIWD
EENRLVDYSKAQKFTIGESDNYIISTENKFVTDKIKPSKFENRDGVYFIDFGKAAFATMEFNYQAKTPHTLTIRVGEMID
ENGNVNRTPPAKSNIRYQELKVEVKPGQTRYRIPIQTDERNTRPNKAIPLPKGFPPLLPFRYAEIEGAQSSINANDVEQL
AYHTFWDEKASSFKSDNNILNQVWDLSKYSIKATTFNGLYVDGDRERIPYEADAYLNQLSHYTTDREYAMARRTIEYFMK
NPTWPTEWQQHVALLLYADYMYTGNTELVERYYEALKHKSLYELSNEDGLITSTKVDAEFMKKLGFPEGYKKPLTDIVDW
PGANFNGSKTPGERDGFVFQPYNTVINSFFYENMKIMAQFAKILGKTDEVLDFELRAAKAKKAVNEQMFDKKRGIYVDGI
GTDHASLHANMMPLAFGLVPQEHVDTVVEFVKSRGMACSVYGAQFLLDGLYNVGEADYALDLLASTSERSWYNMIRIGST
ITLEAWDNKYKNNLDWNHAWGAVPANAIPRGLWGIKPKTAGFGIASIKPQMGKLKSSQITVPTVRGAIHATFTHNGPRSQ
TYEIEIPGNMVAEFSLDDIDGKDLIHNGQKVPAAFGAVQLSPGKHIIELKINSF
>T2KN95 ~~~~~~Uncharacterized protein P37~~~
MKTYNINKRINTLLLLVITMLSFSGCDLEPQEKFRFDPEVDPQFTFGSMTTWEWLQTNPNDEFGFMIEAIKQTGLQDMYN
SKTETYTFFLMKDPNWTNNGPGFFSREFNLKNTADRDPKEVFEDPAVDLDIVRNYLLYLTLPIYVDQGPDHLKTLDLPYT
FETLSEDVNNQIMTIARDWNYVMQINDSPDLPTGNLGKINVPVGYHNYIFSNGNSVAHIFGLNNNGKMARRYKFGEPKMD
F
>T2KPM5 ~~~~~~SusD-like protein P38~~~COG0702
MKKFKNISITFLILISLGVLNSCESVLEVEPESSISDEQFWKTNEDAKLGLAAAYDALQKAYRTKRFYWGEFRADNYVNS
EKPQPDTQDLINNNLTPESSTEYLQWDEFYSLIFRANLAIEKIPEIPYYDTQYLGEAYALRAFAYFDAYRVWGGVPLFTK
AELTFSDDAIKPRSSAQEVLDLVLSDIEEAEKNLTVVSSDYTFSKLSLLAFKAQVHMYLNEYEAANTALTSLIASNQFSL
TTNRKQWRDLFLNDEINYPGEGQEGPELIMSIRYDFEEDGNRASGIYQVFFPGVPSYYVAPNLVEEWETKFPTDSTAWAT
KYPNVPPHVFEENEDTGELNAKYGDYRYYESIAAPGTQEEDLRISKYHKVNISPSIDDTNIILFRYADMLLLKAEALNQL
GQPTEAIELVNQIREARELPLVNSGTIPDVVNINDKDELEDFILSERRLELLAEGYRWWDLVRTNKAVEVMGPINGLTQD
RIIWPLWFRHLIDNPKLEQNVPY
>T2KMI3 ~~~~~~TonB-dependent receptor P39~~~COG4771
MFKQKLKMKPKIKRNCTFSGLAFILMLLFSSFTVNNLNAQSEVTGTIMGEDGIPIPGVNVIQKGTKNGTVTDFDGRYSVT
LVPGQLVLVYSYIGYETQEVPIKSRKVIDLTLKAELQSLDEVVVIGYGEQKRADVIGAVGSVDSEELSSVSPVDALQGIQ
GRVAGVQVTTNGGPGGDSEIIIRGISTFGAGSSPLYVVDGQQVNDITNINPADIESMDILKDGASAAIYGSKSANGVVLI
TTKQGKPGFPKMTVDYISSVSFLNNLVPVSNTRQWNKFESLRTGSTDASGQVEDSLGIRSQLVVDVQDAIKQLGVKNQVN
LAFSGGGEKSKFYWNTGYLDETGIVKGSGYNRITSNLKIDFDLNKFITAGTRMTGTYQMQDGINEGSVFRNLSYRQPNVL
LVDFDGSYIRERYARNNPLARAELQVNDNRQFSSTIFNYISVKLAPGLTFKTTLGFNYRNQKLNQFNPQETVNIDNGKIN
GRERVNTFYDFQNENFFNYNKTFNDKHTVTGLAGFSIQRWWYEYSDLNAIEFNNDYIQTFNNVKEYNLNTTGTDATTHAL
SSLYARIGYDYKSKYLITASIRRDGSSRFGENRIWGNFPAIQLGWKISEENFMKSLGFINLLKLRASYAITGNERIGDFE
SIALYNPGFFYNSVNGFAPVQLGNGDLGWEETAQQNYGIDLSLFKRRLNVSVDRYVKTTDDLLYNVPIPQETGFSNIRAN
IGSVENRGWEVSIAAKPIRNERFTWTTSFNFSYNENEVLELADEDGFETGGYLIEEGESLGNMYGYKNLGVFQYDESNAF
TPDGIRLTPNFDANQNFVNYTLNGQAYNGDIERLKFANKVLRGGDIIFQDQNGDFNIDAANDRTIIGNGLSDFAGGFSNR
FDYNGFFFSFLFNYNFGNDIYRDYDHIRDKASNAVYAPSPDRIDGAWVNPGDITKYPSLEVSRANNRSGYESNYVSSADF
ISLRNIQLGYSFNPDTLNKLGFINRLSLNASINNVFMFTNYEGYNPELGNRGNALEPGWDSLRYPNQTEIVIGLNVEF
>T2KPJ3 ~~~~~~TonB-dependent receptor P3~~~COG4771
MTTKNNKQLKSVLFMFLLLIGAYVKAQEKNVSGTVTSSEDGMMLPGVNIIVKGTASGTTSDFDGNYNIEVPDSNAILQFN
YLGFVTQEIKVGAQTNISVVLQVDQNELEEIVVIGYGTVKKSDVSGSVSSVKSAELTAYPTVSAEQALQGRAAGVQVQSN
NGGEPGAPIKVRIRGGTSINASSDALIVVDGFVGASMPAPQDIASMEVLKDASATAIYGSRGANGVIMVTTKKGTSSKPT
LELNTSYSLQHVNNTIDLLDADEFATYRQAYSENYVQGPANTDWQDEIYTTGSISNTQLAFSGGSDNSKYYISGNYFAQD
GVVINSNLERFTILSNVDVDITKRFKVGLNVFGGRSTKDGVSTQAQTGGTGGGDVISSAYRFAPDLGIYNADGTYTINSL
GDDIDNPYALATESVDERKADTYRANFYAAYEFIDGLEFKTTFGFSSENTQIGKFKPTTILAGAGVGGEATFEYRNTTNT
LSENYLTYNKSFGAHNLSLLGGYSYQKVQNEGAFAGARSFVTNEVSYRNLEGGAVTMQPSSYLNETELVSVFGRVNYEYA
SKYIFTFTARRDGSSNFSKNNKYAFFPSGAIAWNMAKENFLKDSNTITTWKWRASYGATGNPSISPYETLAKFSSVYAVV
GDQQVNGVVLTDFANDNLKWETSKQLDLGLDVALFDNRLELSFDYYTIKTEDLLFPRPLPEYSGVSSQIQNIGELENKGY
EFSINSRNITNQDFTWSTAFNFSRNKNKMVKLPDGDDLFIDSAPGHFLQRQTQILREGEAIGSFYGYEYKGVYQGGNFPE
GTATLSGDSDPGGELFADLDGNGEISTADRKIIGDPTPDFTMGFNNDLRYKNFDMNLFFQASVGGEILNYTLLELGSGAA
NSTADMVNAWSPTNTNTDVPRPAVREKRITSRYVYDGSYVRLKNLSFGYNLPESFLGKTGLQTVRLYVSGQNLLTFTDYP
GADPEANYRNDNNQRSNTNIGLDYGSYPNVRTFTMGLNMKF
>T2KN98 5.3.1.17~~~kduI~~~4-deoxy-L-threo-5-hexosulose-uronate ketol-isomerase~~~COG3717
MSTKYESRYASSPQTVKQYDTQELRNEFLIDNLMQNDTINLTYTHYDRYIAGSAVPTSSPLTLETIDPLKSEYFLERREL
GIINVGGTGSVTVDGTVYELGLKDALYVGMGNKDVVFASDDASNPAQFYLNSAPAHTNYPTKKVSKAEANKIELGTLETA
NHRTVNQMIIGGIVTTCQLQMGMTELKTGSVWNTMPAHVHNRRMEVYLYIDIPQDQAVCHFMGEPQETRHIWMQNNQAVI
SPPWSIHSGSGTSNYTFVWGMAGENLDYNDMDVAKITELR
>T2KLZ8 1.1.1.127~~~kduD~~~2-dehydro-3-deoxy-D-gluconate 5-dehydrogenase~~~COG1028
MSVDLFDVKGKIALVTGSTHGLGMAMAKGLGLAGATIVVNGNSSQDKIDSAIAEYEKEGIKAVGYKFNVAKEDEVQAAVS
KIEAEVGPIDILINNAGIIKRTPLLEMEVADFKEVVDIDLVSPFIVSKHVVKNMVERKAGKVINICSMMSELGRNSVGAY
AAAKGGLKMLTQNMATEWAKYNIQVNGIGPGYFATSQTAPIRVDGHPFNDFIINRTPAAKWGDPNDLAGAAIFLSSKASD
FVNGHVVYVDGGILATIGKPSNEE
>T2KPJ7 3.2.1.31~~~~~~Putative beta-glucuronidase~~~COG3250
MGFCMKDSKQYYKSSIGKSLKRSNGYLKLVLVLYLIMVSWSGYSKEVFNSRTKENINANWLYLEKNIKDINLALNDANWE
SINLPHTWNALDATDLNPGYRRSGSWYKKELAISNIENNKLYQLYFEGVNINSEVYVNGQKAGGHIGGYIGFTIDITEFI
KSGKNDIVIRVDNSYDPEVIPSQKSDFFIFGGITRDVWLETIPKQHLSELKITTPKVSENEAELLATVAINNLNNSNLKV
QANLLDAQGVTVVSSVFKIKNNTAKIHFRNIKNPKLWDTEHPNLYTLKVALLEKGDVIDSVQNRVGFRWFEFKDHGAFYL
NGKRVLLRGTHRHEEHAGVGAAMSNMQHRKDMELIKDMGANFVRLAHYPQDPEVYKACDELGLLIWDELPWCRGGLGNET
WKTNTKNMLTEIINQNYNHPSIIIWSLGNEMYWLPDFENGDDTDKMNSFLTELNDLAHQLDPSRKTAIRKYYEGSHIVDV
FSPSIWSGWYSGSYKSYQKAIDTYKKEYPHFLHAEYGGSSHVGRHTENPVTGEGKIQSDGWEEEIVQTDVANIAQIGDWS
ENYIVDLFDWHLRISENDENFVGNVQWAFKDFGTPLRPENAIPYMNQKGLVDRAGNPKDAFYVFKSYWSKEPFTYIESHT
WTERQGPKDLARDISVYSNCPEVELFLNGKSLGVKKRDLKVFPAAGLNWNLNFKEGKNTLVAVGKTKENKTVKDELAINY
RFTKNGKAVGLKLESELLENGNYLVTAIAYDKNGLRCLDYEDQVYFQCLSGGETLKSQGTPTGSESIAMANGKAAIEVKR
DGKNIPVVMMVLNQNFKGTYLTIE
>T2KMF9 3.1.1.31~~~pgl~~~6-phosphogluconolactonase~~~COG2706
MFVGSFTDKKPGTGIHVFDFNTKSGEAQLLSEVDSIINSSFLKLSPNGKYLYSVIESQLQTHGKIAAFKIDSNAGDLKLI
NMQDCGGRNPAHIEIDKSGKFLAVSNYTDPSLSFFEVDETGKIKKIDEFFTFTGSGIVKGNQDTAHIHSSNFSLENDYLF
LQDLGSDCIHKFKVNLDANQNMSLQKADAIKVKPGSGPRHFVFHQNGKYGYGINELSGKVSAYALLNGNLKFLADYNAYS
KKQDSYRSADIHISPDGKFLYASNRGPNEDSIVIFSINKSNGALKLIGHEPTYGEHPRNFAIDPSGQFLLVANQFSNNIV
IFRRDVETGKLQKLPQELVVNGSSSLQMFTYSH
>P76002 ~~~pliG~~~Inhibitor of g-type lysozyme~~~
MKIKSIRKAVLLLALLTSTSFAAGKNVNVEFRKGHSSAQYSGEIKGYDYDTYTFYAKKGQKVHVSISNEGADTYLFGPGI
DDSVDLSRYSPELDSHGQYSLPASGKYELRVLQTRNDARKNKTKKYNVDIQIK
>Q7N561 ~~~pllA~~~Lectin A~~~
MSDWSGSVPANAENGKSTGLILKQGDTISVVAHGWVKYGRDNVEWAAPDGPVPNNPQPSSIATLVAKIANKKFAIGNGVL
HKTVPVDGELILLFNDVPGTFGDNSGEFQVEVIIESRYSPLK
>P80214 ~~~plnA~~~Bacteriocin plantaricin-A~~~
MKIQIKGMKQLSNKEMQKIVGGKSSAYSLQMGATAIKQVKKLFKKWGW
>C0HJC0 ~~~~~~Bacteriocin plantaricin KL-1Y~~~
GRADYNFGYGLGRGTRKFFNGIGRWVRKTF
>P40415 5.2.1.8~~~~~~Probable parvulin-type peptidyl-prolyl cis-trans isomerase~~~COG0760
MKRIAMLAAACVIAVPAFAQNVATVNGKPITQKSLDEFVKLVVSQGATDSPQLREQIKQEMINRQVFVQAAEKDGVAKQA
DVQTEIELARQGILVRALMADYLQKHPVTDAQVKAEYEKIKKEQAGKMEYKVRHILVEDEKTANDLLAQVKSNKNKFDDL
AKKNSKDPGSAERGGDLGWAPATNYVQPFAEAVTKLKKGQLVDKPVQTQFGWHVIQVDDTRPVEFPAMDQVRPQLEEMLR
QQTLANYQKQLREQAKIQ
>Q7NBF9 ~~~plpA~~~Fibronectin-binding protein PlpA~~~
MDNNQNNFNQPGQQGFDQYQQQSGALVSYGYDANGNPVSDPSLAVYDANGMLINQNQYDQNQQQEYDQYGNPVGMLSGNV
YSQENDPYYQQYNQQQNQGYEQQYDEYGNPIGLLPGSENQQNNQQQYDQYGNPIGMLPGGTNDQAYDPNQMQYDQYGNPV
GMLAGVNANDQGYDYNQMQYDEYGNPVGMLPGGNANDQGYDYNQMQYDEYGNPPALYDNNQQDYYGYDQNQQYDQANNQL
AVVDENYEQEQQVESNEEPAHEQDLREFLNNNSDTELVSYYEEEEDEKKPRNKKKQRQTAQARGLLPELATVNQPDQTPI
TPHLEAPVHYDENEELSTDDLSDDINLDQNQAVHHDAEDDVINIPIEDIESALLPKFEEIQRHNLEEIQKVKLEAAENFK
VLQRANEELKSSNNELKTTNQQLKTSNEALEDSNKRIESQLQALLDSINDIKSKNNQPSQEEVDSRSKLEQRLEELAYKL
DQTKEIIDETSESSRESFQQSKDQLIENFEKKIELLTEKLNQTQESFNQSQEAKQQQDKEFAQKIERIIEQTKEANDSLQ
NRVKESSLDMESKLENKFESFADKLTEITSKKISEKMTEQQASKKEEIDSLQSSFQKALSEVVSKVENYANQSQLQSQHL
NQTLSYHQQQINNSLRQSAQMAQMAQMQHMMPQQMLMGMNNPYGFNHHTMMPPVHQYQQLPPPPPVQAPAQQLLPTINNP
LQLPNENPTLFEKLMLANMFKQTINPPQPQPQALPQPHPQPQQLPPQILALPPTVVQQPNYLVPQPPRQPDYYSNRLNER
MMLDDAYNAGYDEAVYELENQYYPPAYEYPEYEEIQPSFRRRGGRAKFDPYNNR
>P67080 ~~~yggS~~~Pyridoxal phosphate homeostasis protein~~~COG0325
MNDIAHNLAQVRDKISAAATRCGRSPEEITLLAVSKTKPASAIAEAIDAGQRQFGENYVQEGVDKIRHFQELGVTGLEWH
FIGPLQSNKSRLVAEHFDWCHTIDRLRIATRLNDQRPAELPPLNVLIQINISDENSKSGIQLAELDELAAAVAELPRLRL
RGLMAIPAPESEYVRQFEVARQMAVAFAGLKTRYPHIDTLSLGMSDDMEAAIAAGSTMVRIGTAIFGARDYSKK
>P44506 ~~~~~~Pyridoxal phosphate homeostasis protein~~~COG0325
MNIQHNLNLIQQKIETACKEENRNQNTVKLLAVSKTKPISAILSAYQAGQTAFGENYVQEGVEKIQYFESQGINLEWHFI
GPLQSNKTRLVAEHFDWMQTLDRAKIADRLNEQRPTNKAPLNVLIQINISDEESKSGIQPEEMLTLAKHIENLPHLCLRG
LMAIPAPTDNIAEQENAFRKMLELFEQLKQVLPNQQIDTLSMGMTDDMPSAIKCGSTMVRIGTAIFGARNYSTSQNK
>P9WFQ7 ~~~~~~Pyridoxal phosphate homeostasis protein~~~COG0325
MAADLSAYPDRESELTHALAAMRSRLAAAAEAAGRNVGEIELLPITKFFPATDVAILFRLGCRSVGESREQEASAKMAEL
NRLLAAAELGHSGGVHWHMVGRIQRNKAGSLARWAHTAHSVDSSRLVTALDRAVVAALAEHRRGERLRVYVQVSLDGDGS
RGGVDSTTPGAVDRICAQVQESEGLELVGLMGIPPLDWDPDEAFDRLQSEHNRVRAMFPHAIGLSAGMSNDLEVAVKHGS
TCVRVGTALLGPRRLRSP
>Q55500 2.5.1.39~~~plqA~~~4-hydroxybenzoate solanesyltransferase~~~COG0382
MVAQTPSSPPLWLTIIYLLRWHKPAGRLILMIPALWAVCLAAQGLPPLPLLGTIALGTLATSGLGCVVNDLWDRDIDPQV
ERTKQRPLAARALSVQVGIGVALVALLCAAGLAFYLTPLSFWLCVAAVPVIVAYPGAKRVFPVPQLVLSIAWGFAVLISW
SAVTGDLTDATWVLWGATVFWTLGFDTVYAMADREDDRRIGVNSSALFFGQYVGEAVGIFFALTIGCLFYLGMILMLNPL
YWLSLAIAIVGWVIQYIQLSAPTPEPKLYGQIFGQNVIIGFVLLAGMLLGWL
>P9WI59 ~~~plsB1~~~Putative acyltransferase plsB1~~~COG2937
MTAREVGRIGLRKLLQRIGIVAESMTPLATDPVEVTQLLDARWYDERLRALADELGRDPDSVRAEAAGYLREMAASLDER
AVQAWRGFSRWLMRAYDVLVDEDQITQLRKLDRKATLAFAFSHRSYLDGMLLPEAILANRLSPALTFGGANLNFFPMGAW
AKRTGAIFIRRQTKDIPVYRFVLRAYAAQLVQNHVNLTWSIEGGRTRTGKLRPPVFGILRYITDAVDEIDGPEVYLVPTS
IVYDQLHEVEAMTTEAYGAVKRPEDLRFLVRLARQQGERLGRAYLDFGEPLPLRKRLQEMRADKSGTGSEIERIALDVEH
RINRATPVTPTAVVSLALLGADRSLSISEVLATVRPLASYIAARNWAVAGAADLTNRSTIRWTLHQMVASGVVSVYDAGT
EAVWGIGEDQHLVAAFYRNTAIHILVDRAVAELALLAAAETTTNGSVSPATVRDEALSLRDLLKFEFLFSGRAQFEKDLA
NEVLLIGSVVDTSKPAAAADVWRLLESADVLLAHLVLRPFLDAYHIVADRLAAHEDDSFDEEGFLAECLQVGKQWELQRN
IASAESRSMELFKTALRLARHRELVDGADATDIAKRRQQFADEIATATRRVNTIAELARRQ
>P0A7A7 2.3.1.15~~~plsB~~~Glycerol-3-phosphate acyltransferase~~~COG2937
MSGWPRIYYKLLNLPLSILVKSKSIPADPAPELGLDTSRPIMYVLPYNSKADLLTLRAQCLAHDLPDPLEPLEIDGTLLP
RYVFIHGGPRVFTYYTPKEESIKLFHDYLDLHRSNPNLDVQMVPVSVMFGRAPGREKGEVNPPLRMLNGVQKFFAVLWLG
RDSFVRFSPSVSLRRMADEHGTDKTIAQKLARVARMHFARQRLAAVGPRLPARQDLFNKLLASRAIAKAVEDEARSKKIS
HEKAQQNAIALMEEIAANFSYEMIRLTDRILGFTWNRLYQGINVHNAERVRQLAHDGHELVYVPCHRSHMDYLLLSYVLY
HQGLVPPHIAAGINLNFWPAGPIFRRLGAFFIRRTFKGNKLYSTVFREYLGELFSRGYSVEYFVEGGRSRTGRLLDPKTG
TLSMTIQAMLRGGTRPITLIPIYIGYEHVMEVGTYAKELRGATKEKESLPQMLRGLSKLRNLGQGYVNFGEPMPLMTYLN
QHVPDWRESIDPIEAVRPAWLTPTVNNIAADLMVRINNAGAANAMNLCCTALLASRQRSLTREQLTEQLNCYLDLMRNVP
YSTDSTVPSASASELIDHALQMNKFEVEKDTIGDIIILPREQAVLMTYYRNNIAHMLVLPSLMAAIVTQHRHISRDVLME
HVNVLYPMLKAELFLRWDRDELPDVIDALANEMQRQGLITLQDDELHINPAHSRTLQLLAAGARETLQRYAITFWLLSAN
PSINRGTLEKESRTVAQRLSVLHGINAPEFFDKAVFSSLVLTLRDEGYISDSGDAEPAETMKVYQLLAELITSDVRLTIE
SATQGEG
>P9WI61 2.3.1.15~~~plsB~~~Glycerol-3-phosphate acyltransferase~~~COG2937
MTKPAADASAVLTAEDTLVLASTATPVEMELIMGWLGQQRARHPDSKFDILKLPPRNAPPAALTALVEQLEPGFASSPQS
GEDRSIVPVRVIWLPPADRSRAGKVAALLPGRDPYHPSQRQQRRILRTDPRRARVVAGESAKVSELRQQWRDTTVAEHKR
DFAQFVSRRALLALARAEYRILGPQYKSPRLVKPEMLASARFRAGLDRIPGATVEDAGKMLDELSTGWSQVSVDLVSVLG
RLASRGFDPEFDYDEYQVAAMRAALEAHPAVLLFSHRSYIDGVVVPVAMQDNRLPPVHMFGGINLSFGLMGPLMRRSGMI
FIRRNIGNDPLYKYVLKEYVGYVVEKRFNLSWSIEGTRSRTGKMLPPKLGLMSYVADAYLDGRSDDILLQGVSICFDQLH
EITEYAAYARGAEKTPEGLRWLYNFIKAQGERNFGKIYVRFPEAVSMRQYLGAPHGELTQDPAAKRLALQKMSFEVAWRI
LQATPVTATGLVSALLLTTRGTALTLDQLHHTLQDSLDYLERKQSPVSTSALRLRSREGVRAAADALSNGHPVTRVDSGR
EPVWYIAPDDEHAAAFYRNSVIHAFLETSIVELALAHAKHAEGDRVAAFWAQAMRLRDLLKFDFYFADSTAFRANIAQEM
AWHQDWEDHLGVGGNEIDAMLYAKRPLMSDAMLRVFFEAYEIVADVLRDAPPDIGPEELTELALGLGRQFVAQGRVRSSE
PVSTLLFATARQVAVDQELIAPAADLAERRVAFRRELRNILRDFDYVEQIARNQFVACEFKARQGRDRI
>O07584 2.3.1.n4~~~plsC~~~1-acyl-sn-glycerol-3-phosphate acyltransferase~~~COG0204
MYKFCANALKVILSLRGGVKVYNKENLPADSGFVIACTHSGWVDVITLGVGILPYQIHYMAKKELFQNKWIGSFLKKIHA
FPVDRENPGPSSIKTPIKLLKEGEIVGIFPSGTRTSEDVPLKRGAVTIAQMGKAPLVPAAYQGPSSGKELFKKGKMKLII
GEPLHQADFAHLPSKERLAAMTEALNQRIKELENKLDQL
>P26647 2.3.1.51~~~plsC~~~1-acyl-sn-glycerol-3-phosphate acyltransferase~~~COG0204
MLYIFRLIITVIYSILVCVFGSIYCLFSPRNPKHVATFGHMFGRLAPLFGLKVECRKPTDAESYGNAIYIANHQNNYDMV
TASNIVQPPTVTVGKKSLLWIPFFGQLYWLTGNLLIDRNNRTKAHGTIAEVVNHFKKRRISIWMFPEGTRSRGRGLLPFK
TGAFHAAIAAGVPIIPVCVSTTSNKINLNRLHNGLVIVEMLPPIDVSQYGKDQVRELAAHCRSIMEQKIAELDKEVAERE
AAGKV
>Q8DNY1 2.3.1.n4~~~plsC~~~1-acyl-sn-glycerol-3-phosphate acyltransferase~~~COG0204
MIRYNNNKKTIEGDRMFYTYLRGLVVLLLWSINGNAHYHNTDKIPNQDENYILVAPHRTWWDPVYMAFATKPKQFIFMAK
KELFTNRIFGWWIRMCGAFPIDRENPSASAIKYPINVLKKSDRSLIMFPSGSRHSNDVKGGAALIAKMAKVRIMPVTYTG
PMTLKGLISRERVDMNFGNPIDISDIKKMNDEGIETVANRIQTEFQRLDEETKQWHNDKKPNPLWWFIRIPALILAIILA
ILTIIFSFIASFIWNPDKKREELA
>P71018 2.3.1.274~~~plsX~~~Phosphate acyltransferase~~~COG0416
MRIAVDAMGGDHAPKAVIDGVIKGIEAFDDLHITLVGDKTTIESHLTTTSDRITVLHADEVIEPTDEPVRAVRRKKNSSM
VLMAQEVAENRADACISAGNTGALMTAGLFIVGRIKGIDRPALAPTLPTVSGDGFLLLDVGANVDAKPEHLVQYAIMGSV
YSQQVRGVTSPRVGLLNVGTEDKKGNELTKQTFQILKETANINFIGNVEARDLLDDVADVVVTDGFTGNVTLKTLEGSAL
SIFKMMRDVMTSTLTSKLAAAVLKPKLKEMKMKMEYSNYGGASLFGLKAPVIKAHGSSDSNAVFHAIRQAREMVSQNVAA
LIQEEVKEEKTDE
>P27247 2.3.1.274~~~plsX~~~Phosphate acyltransferase~~~COG0416
MTRLTLALDVMGGDFGPSVTVPAALQALNSNSQLTLLLVGNSDAITPLLAKADFEQRSRLQIIPAQSVIASDARPSQAIR
ASRGSSMRVALELVKEGRAQACVSAGNTGALMGLAKLLLKPLEGIERPALVTVLPHQQKGKTVVLDLGANVDCDSTMLVQ
FAIMGSVLAEEVVEIPNPRVALLNIGEEEVKGLDSIRDASAVLKTIPSINYIGYLEANELLTGKTDVLVCDGFTGNVTLK
TMEGVVRMFLSLLKSQGEGKKRSWWLLLLKRWLQKSLTRRFSHLNPDQYNGACLLGLRGTVIKSHGAANQRAFAVAIEQA
VQAVQRQVPQRIAARLESVYPAGFELLDGGKSGTLR
>Q82ZE8 2.3.1.274~~~plsX~~~Phosphate acyltransferase~~~COG0416
MKIAVDAMGGDNAPQAIVEGVMLAKQDFPDIEFQLYGKEAEIKKYITDEKNITIIHTDEKIASDDEPVKAIRRKKTASMV
LAAQAVKNGEADAIFSAGNTGALLAAGLFIVGRIKNVERPGLMSTLPVMGEPDKGFDMLDLGANADNKPEHLVQYAVLGS
FYAEKVRNVQNPRVGLLNNGTEETKGSELTKKAFELLAADETINFVGNVEARELLNGVADVVVTDGFTGNAVLKSIEGTA
MNMMSLLKTAILSEGVKGKMGALLLKNALHGMKDEMDYSKHGGAVLFGLKAPVIKTHGATGPDAVRYTIRQIHTMLETQV
VPQLVEYYEGKAE
>P65739 2.3.1.274~~~plsX~~~Phosphate acyltransferase~~~
MVKLAIDMMGGDNAPDIVLEAVQKAVEDFKNLEIMLFGDEKKYNLNHERIEFRHCSEKIEMEDEPVRAIKRKKDSSMVKM
AEAVKSGEADGCVSAGNTGALMSVGLFIVGRIKGVARPALVVTLPTIDGKGFVFLDVGANADAKPEHLLQYAQLGDIYAQ
KIRGIDNPKISLLNIGTEPAKGNSLTKKSFELLNQDHSLNFVGNIEAKTLMDGDTDVVVTDGYTGNMVLKNLEGTAKSIG
KMLKDTIMSSTKNKLAGAILKKDLAEFAKKMDYSEYGGSVLLGLEGTVVKAHGSSNAKAFYSAIRQAKIAGEQNIVQTMK
ETVGESNE
>Q8DRN3 2.3.1.274~~~plsX~~~Phosphate acyltransferase~~~COG0416
MKKIAVDAMGGDYAPQAIVEGVNQALSDFSDIEVQLYGDEAKIKQYLTATERVSIIHTDEKIDSDDEPTRAIRNKKNASM
VLAAKAVKDGEADAVLSAGNTGALLAAGFFIVGRIKNIDRPGLMSTLPTVDGKGFDMLDLGANAENTAQHLHQYAVLGSF
YAKNVRGIAQPRVGLLNNGTESSKGDPLRKETYELLAADESLNFIGNVEARDLMNGVADVVVADGFTGNAVLKSIEGTAM
GIMGLLKTAITGGGLRAKLGALLLKDSLRGLKKQLNYSDVGGAVLFGVKAPVVKTHGSSDAKAVYSTIRQIRTMLETDVV
AQTAREFSGE
>O66905 2.3.1.275~~~plsY~~~Glycerol-3-phosphate acyltransferase~~~COG0344
MKALFLVIFAYLLGSITFGEVIAKLKGVDLRNVGSGNVGATNVTRALGKKYGVLVFFLDFLKGFIPALIAVKSFGIDSWV
LTFTGLASVLGHMYPVFFGFKGGKGVATALGVVFAVSPSVALFSFLVWLGIFLWKRYVSLASITATISAFLFLFVAGYPV
NVLFMAIVIGALIIYRHRENINRLLTGREHRF
>Q45064 2.3.1.275~~~plsY~~~Glycerol-3-phosphate acyltransferase~~~COG0344
MLIALLIILAYLIGSIPSGLIVGKLAKGIDIREHGSGNLGATNAFRTLGVKAGSVVIAGDILKGTLATALPFLMHVDIHP
LLAGVFAVLGHVFPIFAKFKGGKAVATSGGVLLFYAPLLFITMVAVFFIFLYLTKFVSLSSMLTGIYTVIYSFFVHDTYL
LIVVTLLTIFVIYRHRANIKRIINKTEPKVKWL
>P60782 2.3.1.15~~~plsY~~~Probable glycerol-3-phosphate acyltransferase~~~COG0344
MSAIAPGMILIAYLCGSISSAILVCRLCGLPDPRTSGSGNPGATNVLRIGGKGAAVAVLIFDVLKGMLPVWGAYELGVSP
FWLGLIAIAACLGHIWPVFFGFKGGKGVATAFGAIAPIGWDLTGVMAGTWLLTVLLSGYSSLGAIVSALIAPFYVWWFKP
QFTFPVSMLSCLILLRHHDNIQRLWRRQETKIWTKFKRKREKDPE
>P0A4Q0 2.3.1.275~~~plsY~~~Glycerol-3-phosphate acyltransferase~~~COG0344
MITIVLLILAYLLGSIPSGLWIGQVFFQINLREHGSGNTGTTNTFRILGKKAGMATFVIDFFKGTLATLLPIIFHLQGVS
PLIFGLLAVIGHTFPIFAGFKGGKAVATSAGVIFGFAPIFCLYLAIIFFGALYLGSMISLSSVTASIAAVIGVLLFPLFG
FILSNYDSLFIAIILALASLIIIRHKDNIARIKNKTENLVPWGLNLTHQDPKK
>P61598 ~~~~~~Putative surface protein SA2285~~~
MRDKKGPVNKRVDFLSNKLNKYSIRKFTVGTASILIGSLMYLGTQQEAEAAENNIENPTTLKDNVQSKEVKIEEVTNKDT
APQGVEAKSEVTSNKDTIEHEASVKAEDISKKEDTPKEVANVAEVQPKSSVTHNAEAPKVRKARSVDEGSFDITRDSKNV
VESTPITIQGKEHFEGYGSVDIQKNPTDLGVSEVTRFNVGNESNGLIGALQLKNKIDFSKDFNFKVRVANNHQSNTTGAD
GWGFLFSKGNAEEYLTNGGILGDKGLVNSGGFKIDTGYIYTSSMDKTEKQAGQGYRGYGAFVKNDSSGNSQMVGENIDKS
KTNFLNYADNSTNTSDGKFHGQRLNDVILTYVASTGKMRAEYAGKTWETSITDLGLSKNQAYNFLITSSQRWGLNQGINA
NGWMRTDLKGSEFTFTPSAKNNNRIRKKVEEIPFKKERKFNPDLAPGTEKVTREGQKGEKTITTPTLKNPLTGEIISKGE
SKEEITKDPINELTEYGPETIAPGHRDEFDPKLPTGEKEEVPGKPGIKNPETGDVVRPPVDSVTKYGPVKGDSIVEKEEI
PFEKERKFNPDLAPGTEKVTREGQKGEKTITTPTLKNPLTGEIISKGESKEEITKDPINELTEYGPETIAPGHRDEFDPK
LPTGEKEEVPGKPGIKNPETGDVVRPPVDSVTKYGPVKGDSIVEKEEIPFEKERKFNPDLAPGTEKVTREGQKGEKTITT
PTLKNPLTGEIISKGESKEEITKDPINELTEYGPETIAPGHRDEFDPKLPTGEKEEVPGKPGIKNPETGDVVRPPVDSVT
KYGPVKGDSIVEKEEIPFEKERKFNPDLAPGTEKVTREGQKGEKTITTPTLKNPLTGEIISKGESKEEITKDPINELTEY
GPETIAPGHRDEFDPKLPTGEKEEVPGKPGIKNPETGDVVRPPVDSVTKYGPVKGDSIVEKEEIPFKKERKFNPDLAPGT
EKVTREGQKGEKTITTPTLKNPLTGEIISKGESKEEITKDPINELTEYGPETITPGHRDEFDPKLPTGEKEEVPGKPGIK
NPETGDVVRPPVDSVTKYGPVKGDSIVEKEEIPFEKERKFNPDLAPGTEKVTREGQKGEKTITTPTLKNPLTGEIISKGE
SKEEITKDPVNELTEFGGEKIPQGHKDIFDPNLPTDQTEKVPGKPGIKNPDTGKVIEEPVDDVIKHGPKTGTPETKTVEI
PFETKREFNPKLQPGEERVKQEGQPGSKTITTPITVNPLTGEKVGEGQPTEEITKQPVDKIVEFGGEKPKDPKGPENPEK
PSRPTHPSGPVNPNNPGLSKDRAKPNGPVHSMDKNDKVKKSKIAKESVANQEKKRAELPKTGLESTQKGLIFSSIIGIAG
LMLLARRRKN
>P80544 ~~~pls~~~Surface protein~~~
MNKNSKKKLDFLPNKLNKYSIRRFTVGTASILVGATLIFGVANDQAEAAENNTTQKQDDSSDASKVKGNVQTIEQSSANS
NESDIPEQVDVTKDTTEQASTEEKANTTEQASTEEKADTTEQATTEEAPKAEGTDKVETEEAPKAEETDKATTEEAPKAE
ETDKATEEAPKTEETDKATTEEAPAAEETSKAATEEAPKAEETSKAATEEAPKAEETEKTATEEAPKTEETDKVETEEAP
KAEETSKAATEKAPKAEETNKVETEEAPAAEETNKAATEETPAVEDTNAKSNSNAQPSETERTQVVDTVAKDLYKKSEVT
EAEKAEIEKVLPKDISNLSNEEIKKIALSEVLKETANKENAQPRATFRSVSSNARTTNVNYSATALRAAAQDTVTKKGTG
NFTAHGDIIHKTYKEEFPNEGTLTAFNTNFNPNTGTKGALEYNDKIDFNKDFTITVPVANNNQGNTTGADGWGFMFTQGN
GQDFLNQGGILRDKGMANASGFKIDTAYNNVNGKVDKLDADKTNNLSQIGAAKVGYGTFVKNGADGVTNQVGQNALNTKD
KPVNKIIYADNTTNHLDGQFHGQRLNDVVLNYDAATSTITATYAGKTWKATTDDLGIDKSQKYNFLITSSHMQNRYSNGI
MRTNLEGVTITTPQADLIDDVEVTKQPIPHKTIREFDPTLEPGSPDVIVQKGEDGEKTTTTPTKVDPDTGDVVERGEPTT
EVTKNPVDEIVHFTPEEVPQGHKDEFDPNLPIDGTEEVPGKPGIKNPETGEVVTPPVDDVTKHGPKAGEPEVTKEEIPFE
KKREFNPDLKPGEEKVTQEGQTGEKTTTTPTTINPLTGEKVGEGEPTTEVTKEPVDEITQFGGEEVPQGHKDEFDPNLPI
DGTEEVPGKPGIKNPETGEVVTPPVDDVTKHGPKAGEPEVTKEEIPFEKKREFNPDLKPGEEKVTQEGQTGEKTTTTPTT
INPLTGEKVGEGEPTTEVTKEPVDEITQFGGEEVPQGHKDEFDPNLPIDGTEEVPGKPGIKNPETGEVVTPPVDDVTKHG
PKAGEPEVTKEEIPFEKKREFNPDLKPGEEKVTQEGQTGEKTTTTPTTINPLTGEKVGEGEPTTEVTKEPVDEITQFGGE
EVPQGHKDEFDPNLPIDGTEEVPGKPGIKNPETGEVVTPPVDDVTKHGPKAGEPEVTKEEIPYETKRVLDPTMEPGSPDK
VAQKGENGEKTTTTPTTINPLTGEKVGEGEPTTEVTKEPIDEIVNYAPEIIPHGTREEIDPNLPEGETKVIPGKDGLKDP
ETGEIIEEPQDEVIIHGAKDDSDADSDSDADSDSDADSDSDADSDSDADSDSDSDSDSDSDSDSDADSDSDSDSDSDADS
DSDADSDSDADSDSDSDADSDSDSDADSDSDSDSDSDADSDSDSDSDSDADSDSDADSDSDSDSDSDADSDSDSDSDSDA
DSDSDADSDSDADSDSDADSDSDSDSDSDADSDSDADSDSDADSDSDADSDSDSDSDSDADSDSDSDSDSDSDADSDSDA
DSDSDSDADSDSDADSDSDADGDSDADSDSDADSDSDSDSDSDSDSDSDADSDSDSDSDSDADRDHNDKTDKPNNKELPD
TGNDAQNNGTLFGSLFAALGGLFLVGRRRKNKNNEEK
>Q4KCZ0 1.14.19.56~~~pltA~~~1H-pyrrole-2-carbonyl-[peptidyl-carrier protein] chlorinase~~~COG0644
MSDHDYDVVIIGGGPAGSTMASYLAKAGVKCAVFEKELFEREHVGESLVPATTPVLLEIGVMEKIEKANFPKKFGAAWTS
ADSGPEDKMGFQGLDHDFRSAEILFNERKQEGVDRDFTFHVDRGKFDRILLEHAGSLGAKVFQGVEIADVEFLSPGNVIV
NAKLGKRSVEIKAKMVVDASGRNVLLGRRLGLREKDPVFNQFAIHSWFDNFDRKSATQSPDKVDYIFIHFLPMTNTWVWQ
IPITETITSVGVVTQKQNYTNSDLTYEEFFWEAVKTRENLHDALKASEQVRPFKKEADYSYGMKEVCGDSFVLIGDAARF
VDPIFSSGVSVALNSARIASGDIIEAVKNNDFSKSSFTHYEGMIRNGIKNWYEFITLYYRLNILFTAFVQDPRYRLDILQ
LLQGDVYSGKRLEVLDKMREIIAAVESDPEHLWHKYLGDMQVPTAKPAF
>Q4KCY6 1.3.8.14~~~pltE~~~L-prolyl-[peptidyl-carrier protein] dehydrogenase~~~COG1960
MDFNYDDTQKKHAAMIAQVCAEQLAACGNEHSRYFTARQWAICGEAGLLGLSIPREYGGQGLGALSTAIAMHAFGLGCTD
MGLVFAAAAHQFACAMPIVEFATAETKRDVLPKLASGEFIGSNAITEPEAGSDSSNLKSRAWPQADGSYRLDGHKSFAGN
APIADIFVTYATTQPEYGALGVSGFIVHRSSAGLRVSEPLDKVCLRSCPAGEVFFDDCRVPEVNRLGEEGQGRQVFQSSM
GWERACLFAAFLGMMERQLEQTIEHARTRRQFGKPIGDNQAVSHRIAQMKLRLESARLLLFRACWGMDQGDPGQLNIALS
KLAISEGALASSIDAVRIFGGRGCLESFGIEAMLRDSIGTTIFSGTSDMQHEIIARELKL
>Q4KCY5 6.2.1.53~~~pltF~~~L-proline--[L-prolyl-carrier protein] ligase~~~COG1020
MKLLHERMMHSLARYPRQTAVVDEQDALSYEALELRIREFVAMLCALGVGQGQRILLWAHKSVDLVAVMQAALRLGVVYV
PVDPLSPVSRLEKIAGDSQAVLVLCTAARLEELAGSALAQVRSVVLDDPASAGYWRNIDTGSSVVPTLAIQPDDLAYILY
TSGSTGVPKGVALSHGNALAFVDWACERYCFQPGERFANHAPLHFDLSVLDIYCALNVGATVCLVPESIAFSPRLLTDFI
RQHEISIWYSVPSVLMMMMQDGDLLSDIQDTLRVLLFAGEPFPVKHLRDLRAAYADVRLANLFGPTETNVCTAFEVGAID
PERVLPVPIGTAASGNQVWAQKPDGSRCAVGEEGELVVQGPTVMLGYFAKPAQEGPYKTGDMVRQRPDGNYEYLGRRDDM
LKVRGNRIERGEVEAALLAHPQVSEAAVLVVGEGMNAQLWGVLVAHTRDALSLIDLKRHCAQRLPRYMIIDKVLCLDALP
RNANGKVDRFALARQVEG
>Q6CZT4 4.2.2.2~~~pel1~~~Pectate lyase 1~~~COG3866
MKYLLPSAAAGLLLLAAQPTMAANTGGYATTDGGDVSGAVKKTARSLQEIVDIIEAAKKDSSGKVVKGGAFPLVITYNGN
EDALIKAAEANICGQWSKDPRGVEIKEFTKGITILGTNGSSANFGIWVVNSSNVVVRNMRFGYMPGGAKDGDAIRIDNSP
NVWIDHNEIFAKNFECAGTPDNDTTFESAVDIKKASTNVTVSYNFIHGVKKVGLSGSSNTDTGRNLTYHHNIYSDVNSRL
PLQRGGQVHAYNNLYDGIKSSGFNVRQKGIALIESNWFENALNPVTARNDDSNFGTWELRNNNITSPSDFAKYKITWGKP
STPHINADDWKSTGKFPAVSYSYSPVSAQCVKDKLANYAGVGKNQAVLTAANCK
>P0C1C0 4.2.2.2~~~pel1~~~Pectate lyase 1~~~
MKYLLPSAAAGLLLLAAQPTMAANTGGYATTDGGDVSGAVKKTARSLQEIVDIIEAAKKDSSGKAVKGGAYPLVITYNGN
EDALIKAAEANICGQWSKDPRGVEIKEFTKGITILGTNGSSANFGIWMVNSSNVVVRNMRFGYMPGGAKDGDAIRIDNSP
NVWIDHNEIFAKNFECAGTPDNDTTFESAVDIKKGATNVTVSYNYIHGVKKVGLSGSSNTDTGRDLTYHHNIYSDVNSRL
PLQRGGKVHAYNNLYDGIKSSGFNVRQKGIALIESNWFENALNPVTARNDDSNFGTWELRNNNITSPSDFAKYKITWGKP
STPHINADDWKSTGKFPAVPYSYSPVSAQCVKDKLASYAGVGKNLAVLTAANCK
>Q6CZT3 4.2.2.2~~~pel2~~~Pectate lyase 2~~~COG3866
MKYLLPTAATGLLLLAAQPAVAANTGGYATTDGGETSGAVKKTARSLQEIVDIIEAAKVDSKGKKVKGGAYPLIITYNGN
EDSLIKAAEKNICGQWSKDARGVQIKEFTKGITILGTNGSSANFGVWIVNSSDVVVRNMRFGYMPGGAQDGDAIRVDNSP
NVWIDHNEIFAKNFECKGTPDNDTTFESAVDIKKGSTNVTVSYNYIHGIKKVGLSGASNTDTGRNLTYHHNIYRDVNSRL
PLQRGGLVHAYNNLYDGITGSGFNVRQKGIALIESNWFENALNPVTARNDSSNFGTWELRNNNVTKPADFSKYNITWGRP
STPHVNADDWKNTGKFPSISYKYSPVSAQCVKDKLANYAGVSKNLAVLTAANCK
>P0C1C2 4.2.2.2~~~pel3~~~Pectate lyase 3~~~
MKYLLPSAAAGLLLLAAQPTMAANTGGYATTDGGDVAGAVKKTARSMQDIIDIIEAAKLDSNGKKVKGGAYPLVITYNGN
EDALIKAAEANICGQWSKDARGVEIKEFTKGITIIGTNGSSANFGIWLTKSSDIVIRNMRFGYMPGGAQDGDAIRIDNTP
NVWIDHNEIFAKNFECAGTKDGDTTFESAIDIKKASTNVTISYNYIHGIKKVGLSGFSSSDTGRDLTYHHNIYDDVNARL
PLQRGGQVHAYNNLYTGITSSGLNVRQKGIALIERNWFENAKNPVTSRYDGSNFGTWELRNNNVMSPADFAKYNITWDKD
SKPYVNAEDWKSTGTFASVPYSYSPVSAQCVKDKLANYAGVNKNLAVLTAANCN
>P0C1A2 4.2.2.2~~~pelA~~~Pectate lyase A~~~
MMNKASGRSFTRSSKYLLATLIAGMMASGVSAAELVSDKALESAPTVGWASQNGFTTGGAAATSDNIYIVTNISEFTSAL
SAGAEAKIIQIKGTIDISGGTPYTDFADQKARSQINIPANTTVIGLGTDAKFINGSLIIDGTDGTNNVIIRNVYIQTPID
VEPHYEKGDGWNAEWDAMNITNGAHHVWIDHVTISDGNFTDDMYTTKDGETYVQHDGALDIKRGSDYVTISNSLIDQHDK
TMLIGHNDTNSAQDKGKLHVTLFNNVFNRVTERAPRVRYGSIHSFNNVFKGDAKDPVYRYQYSFGIGTSGSVLSEGNSFT
IANLSASKACKVVKKFNGSIFSDNGSVLNGSAVDLSGCGFSAYTSKIPYIYDVQPMTTELAQSITDNAGSGKL
>P0C1A3 4.2.2.2~~~pelA~~~Pectate lyase A~~~COG3866
MNKVSGRSFTRTSTCLLATLIAGVMTSGVSAAELVNSKALESAPAAGWASQNGSTTGGAAATSDNIYVVTNISEFTSALS
AGAVAKIIQITGTVDISGGTPYKDFADQKARSQINIPANTTVIGIGTDAKFINGSLIIDGTDGTNNVIIRNVYIQTPIDV
EPHYEKGDGWNAEWDGMNITNGAHHVWVDHVTISDGSFTDDMYTTKDGETYVQHDGALDIKRGSDYVTISNSLFDQHDKT
MLIGHSDTNSAQDKGKLHVTLFNNVFNRVTERAPRVRYGSIHSFNNVFNGDVKDPVYRYLYSFGIGTSGSVLSEGNSFTI
ANLSASKACKVVKKFNGSIFSDNGSVLNGSAADLSGCGFSAYTSAIPYVYAVQPMTTELAQSITDHAGSGKL
>D3JTC1 4.2.2.2~~~pelA~~~Pectate lyase A~~~
MKKMLTLLLSAGLVASIFGVMPAAAAPTVVNSTIVVPKGTTYDGQGKTFVANPSTLGDGSQAENQKPVFRLEAGATLKNV
IIGAPAADGVHCYGNCNISNVVWQDVGEDALTLKSSGTVNITGGAAYKAYDKVFQINAAGTINIKNFRADDIGKLVRQNG
GTTFTVNMTLDNSNISNVKDAIMRTDSSSSQGRITNTRYSKVPTLFKGFASGKTSQSGNTQY
>Q9X6Z2 4.2.2.2~~~pelA~~~Pectate lyase A~~~
MKKMLTLLLSAGLVASIFGVMPAAAAPTVVNSTIVVPKGTTYDGQGKTFVANPSTLGDGSQAENQKPVFRLEAGATLKNV
IIGAPAADGVHCYGSCNISNVVWEDVGEDALTLKSSGTVNITGGAAYKAYDKVFQMNASGTINIKNFRADDIGKLVRQNG
GTSYAVNMTLDNSNISNVKDSIMRTDSSVSQGKITNTRYSKVPTLFKGFASGKTSQSGNTQY
>D3JTC2 4.2.2.2~~~pelB~~~Pectate lyase B~~~
MKKTVRSLCSTALALTLGFTLLSGPASVQAAGNADYNLAGFSQGNTGGGIISESNTSTYKKVYNATDLALALKKNSGVKV
VEIMNDLDLGWNEIPSAAQTSPFAKHNDALTHPVLKQTGVSKITVDGFNGLTIFSANGSKIKHAAITVKRSSNVIIRNLE
FDELWEWDESTKGDYDKNDWDYITLEDSSGVWIDHCTFNKAYDGLVDSKKGTSGVTISWSTFKGDDGSANSWVTRQINEL
EANKASYPMYNYLRSSAVGLSKQDVIAISGPQKKGHLVGATSLESANANLSITLHHNLYKDIQDRMPRLRGGNAHAYNII
MDAADARSAQSRITSAMATAIASKGYKFGITSNGAISTESGAVLVEKSVIKDVQXPCTQQSDRSDQRHVHR
>O34310 4.2.2.2~~~pelC~~~Pectate lyase C~~~COG5297
MKKIVSILFMFGLVMGFSQFQPSTVFAADKVVHETIIVPKNTTYDGKGQRFVAGKELGDGSQSENQDPVFRVEDGATLKN
VVLGAPAADGVHTYGNVNIQNVKWEDVGEDALTVKKEGKVTIDGGSAQKASDKIFQINKASTFTVKNFTADNGGKFIRQL
GGSTFHVDVIIDKCTITNMKEAIFRTDSKTSTVRMTNTRYSNVGQKWIGVQHIYENNNTQF
>P11073 4.2.2.2~~~pelC~~~Pectate lyase C~~~
MKSLITPITAGLLLALSQPLLAATDTGGYAATAGGNVTGAVSKTATSMQDIVNIIDAARLDANGKKVKGGAYPLVITYTG
NEDSLINAAAANICGQWSKDPRGVEIKEFTKGITIIGANGSSANFGIWIKKSSDVVVQNMRIGYLPGGAKDGDMIRVDDS
PNVWVDHNELFAANHECDGTPDNDTTFESAVDIKGASNTVTVSYNYIHGVKKVGLDGSSSSDTGRNITYHHNYYNDVNAR
LPLQRGGLVHAYNNLYTNITGSGLNVRQNGQALIENNWFEKAINPVTSRYDGKNFGTWVLKGNNITKPADFSTYSITWTA
DTKPYVNADSWTSTGTFPTVAYNYSPVSAQCVKDKLPGYAGVGKNLATLTSTACK
>P04960 4.2.2.2~~~pelE~~~Pectate lyase E~~~
MKNTRVRSIGTKSLLAAVVTAALMATSAYAAVETDAATTGWATQNGGTTGGAKAAKAVEVKNISDFKKALNGTDSSAKII
KVTGPIDISGGKAYTSFDDQKARSQISIPSNTTIIGVGSNGKFTNGSLVIKGVKNVILRNLYIETPVDVAPHYESGDGWN
AEWDAAVIDNSTNVWVDHVTISDGSFTDDKYTTKDGEKYVQHDGALDIKKGSDYVTISYSRFELHDKTILIGHSDSNGSQ
DSGKLRVTFHNNVFDRVTERAPRVRFGSIHAYNNVYLGDVKHSVYPYLYSFGLGTSGSILSESNSFTLSNLKSIDGKNPE
CSIVKQFNSKVFSDKGSLVNGSTTTKLDTCGLTAYKPTLPYKYSAQTMTSSLATSINNNAGYGKL
>P0C1A7 4.2.2.2~~~pelL~~~Pectate lyase L~~~COG4733
MKYLNCFISTGLAAFFLVNSTSVLAADCSSDLTSGISTKRIYYVAPNGNSSNNGSSFNAPMSFSAAMAAVNPGELILLKP
GTYTIPYTQGKGNTITFNKSGKDGAPIYVAAANCGRAVFDFSFPDSQWVQASYGFYVTGDYWYFKGVEVTRAGYQGAYVI
GSHNTFENTAFHHNRNTGLEINNGGSYNTVINSDAYRNYDPKKNGSMADGFGPKQKQGPGNRFVGCRAWENSDDGFDLFD
SPQKVVIENSWAFRNGINYWNDSAFAGNGNGFKLGGNQAVGNHRITRSVAFGNVSKGFDQNNNAGGVTVINNTSYKNGIN
YGFGSNVQSGQKHYFRNNVSLSASVTVSNADAKSNSWDTGPAASASDFVSLDTSLATVSRDNDGTLPETSLFRLSANSKL
INAGTKESNISYSGSAPDLGAFERN
>E3E7F9 4.2.2.2~~~pel9A~~~Pectate lyase L~~~COG3266
MFKRNDRSKNGFNALRLGVSFVLASSCLIGTAYADVPSNNLPSTTLTEEAVSATDSAAASTTLAAGDLYVAPNGNVSNPG
TISSPTTLEAALTQIAPGKTIYLRGGNYAYSSTITIQRGNNGSNGSLKGLVAYGSEKPVLDFSAQAFGSANRGLQLNGDF
WLVKGLEVKGAGDNGIYIGGSNNRIENVETHHNRDTGLQLGRYSPNASTSEWPANNLILNSYSHDNADPDNGEDADGFAA
KLTVGSGNVFDNCLAAYNVDDGWDLYSKTETGPIGAVTILNSVAHHNGQTSDGTSTANSDGNGFKLGGDKIKVNHIVKNS
IAFQNKKHGFTYNSNPGTITLTNNTSWDNGQSNFAFDKGEHVFINNLSFEGTASDKTSGTDQDNSNVWWKNKKTTNAKGL
AASADDFVSLVPSITRGADGSIQLGDFLKLAKGSDLIGSGTPSGNIGAR
>Q76EC9 4.2.2.2~~~pel9A~~~Pectate lyase L~~~
MRNCKGLSILLCFLLVFFAMPFPAVAEEAEAMPVSEASEWSFSAFGSNTSVEKNPDPMINSDGSVKIVANGGKIASNEQG
ISFYYREVPSDANFEIKAKAEVLNFKGDDKQVSFGLMLMDQIGQHRNSEKHNSNYIAVGALDTIIKAFYMQESLTKTDML
SQTPSQGDIFELSIKKSGDNYVLTCNGTTETFTLPGLFSDTIYVGIYAARNAEIKFSDLNFTLDTKDIVDLSVDLSGMKT
SYLVDEPLNLKGLKVTAHYSDGTSEELTEEDYIVTGFDSSTPGTNTICINVGEISKTIDLEILPLTCTKLTVKYLPAKTD
YYLGDSFNPEGLKVIAEYNDGYKVTELTEDKYALYIAGKAAEDYVFSKAGTQKVEVISRENPSVKTGFEVNISDASIESL
EISRKPEKTAYFIGDEPDLTGLVVYARYSDGSKVRLDKSEYEVKGFDSSAPGEKEITVYHKGKTVAFSVVVKEKEVMGIE
VTKYPKTTYYIGETFNAEGLEVSKVYDNGDREPLTDFSVDASAFDGSTPGVYDVIISADGFDPITLKVTVREKTEYEWKA
IRFGQSTSDSKNYVNFLDNGAVEIVALEGGGKIATDHDGITFYYTEIDAKDNFVLSADIKVKEYAKNPHDGQESFGIMAR
DAIGTPGDSSIFASNIAAIGGFSGGTKSPNGTQLFIRTGVSSPDGAGSKGIRRIMIKDEKPGPDNTYPAAEYRLTLAKTN
SGFVGKLNDGEEVIFYEPDILNVQDSKIYVGFFAARLATIEVSNIELYVSSSETDAPRYIPPEAPVTPSLQILSLDKTSN
VNYSLVVKPNVNGSITVKQGAEILVRDVTVNAGEKYSVDAVLEKNSENPFTVIFIPDDTQNLSSYEKIIKNFSVTMRTYN
EGGNIYVSPNGTPYGDGTKDNPLDLDTAIAFVKEGQKIILMNGVYKRDSALVISRYNDGTAENRKYLVAEPGSRPVIDFD
KKGQGVTLSGNYWYIEGIDFARSAPNYPGFIIGGNYNIVENCRFYENGDTGLQISRTDSSENIAEWPSYNKIINCESFDN
RDPSENNADGFAAKLTCGVGNMFIGCVSHHNIDDGWDLYTKAGTGAIGPVIIDSCIAYENGTLTDGTVGKGDKNGFKLGG
EGVPVQHIIKNSIAFNNGAVGFTSNSNPSVIAINNIAYNNAKGNLVFTSYSGIETHFVVDGFVSYNTEGAPRDSATGVPA
SDDNYLFDGTKSVNKSGEELTEKEFIERLLELISKIKSIK
>Q05526 4.2.2.9~~~pelW~~~Pectate disaccharide-lyase~~~
MSIFTDLNTSRKWQIDQWLSAVNSHIEKIQQYGHSVVNPTPLLADGFEIKTQSPVVWQFPDGHDAPISNFASQQNWLRLL
ISMSVITETEKYRHLAFCQSEYFLNRFVDENSGLFYWGGHRFINLDTLASEGPESKSMVHELKHHLPYYEFLHQVNPEKT
RHFIQGFWNAHVEDWSCLDLGRHGDYARQRDPDVFLHSRHDVVTPANWPELPLTKGLTFVNAGTDLIYAAFVYARHTGDA
HAAAWGKHLYRQYVLARNPETGMPVYQFSSPLQRQPVPADDNQTQSWFGDRAQRQFGPEFGAIAREANVLFRDMRPLLID
NPLAMLDILRHQPDAEILTWVIAGLKNYYQYAYDVNSNSLRPMWNNGQDMTDYCFKRDGYYGKAGTVLKPFPLEGDYLLP
LVRAWLLSDDDDLHTLIVTMLSRLEKQGIHQSASPFLLLAITELAHAKQSAQWAEYAWQMAEILFKRYFHHGLFVRSEHH
RYVRLDDPFPAILLTLIAACRNKWSEVPAVLTQGGYIHGDYRINGESRVIYDTEFIYPEKLIH
>P22751 4.2.2.9~~~pelX~~~Pectate disaccharide-lyase~~~
MKYAASGLLSVALNSLLLLGSNQRFATQDVAPVWRGIAFGQSTDVNFATNVLPEKVGVNDVTINGKKLTVNDKADLSAPI
TIESRGGKIANTHDGLTFFYTQLPANVNFTLQSDVTVEQFGPESDAKPNAQEGAGLLVRDILGVPRQEPLKEGYEEFPAA
SNMVMNAIMTQDKKSKTEVKMQLISRNGVTQPWGNTNAEITRTSYQEKINLEQTPTFRLKLERTNDGFITAYAPKGSDQW
VSKTVKGADLVTHQDKDHYYVGFFASRNAKITISNASLTTSPANTKPSAPFKAETTAPLLQVASSSLSTSDTYPVQARVN
YNGTVEVFQNGKSLGKPQRVRAGDDFSLTTRLTQQKSDFKLVYIPSEGEDKTAKETSFSVEKITLADARNLYVSPEGKAG
NDGSKNAPLDIKTAINALPGGGTLWLMDGDYSATVIPVSATQRKGMKTLMPVGKKAVFHGLQLNASYWKVKGIEITEKSF
RIEGSHNQIERLLAHHCDNTGIQVSSSDNVGRPLWASHNLILNSESHSNQHPSKKDADGFAVKMRVGEGNVIRGAFSHDN
VDDGFDLFNKIEDGPNGAVMIENSISLNNTSNGFKLGGEGQPVAHQVKNSIAIGNHMDGFSDNFNPGALQVSNNIALDNV
RFNFIFRPSPYYGYEKQGIFKNNVSLRTQPGKYDDAVVGRLDASNYFIRIIERSTVRVRKSRRRITNPSRCQRSSAGMKK
AACNWVIFCRRSNRHKTQRHRNRYPSTPA
>P16530 4.2.2.2~~~PEL X~~~Putative pectate lyase X~~~
MKYLLPTAAAGLLLLAAQPAMAANTGGYATTDGGEVSGAVKKTARSMKEIVDIIEAAQVDSKGKKVKGGAYPLIITYSGN
EDSLIKAAEKNICGQWSKDARGVQIKEFTKGTYYPGHQWLIRQLRCLDCETLLTLWYVICALAICQAARKHGDAIRIDNS
PNVWIDHNEIFAKNFECKGTPDNDTTFESAVDIKKGSTNVTVSVSVKEVGTLVNLSRLFFPFRIQRYRAFRLPVSCLP
>P39116 4.2.2.2~~~pel~~~Pectate lyase~~~COG3866
MKKVMLATALFLGLTPAGANAADLGHQTLGSNDGWGAYSTGTTGGSKASSSNVYTVSNRNQLVSALGKETNTTPKIIYIK
GTIDMNVDDNLKPLGLNDYKDPEYDLDKYLKAYDPSTWGKKEPSGTQEEARARSQKNQKARVMVDIPANTTIVGSGTNAK
VVGGNFQIKSDNVIIRNIEFQDAYDYFPQWDPTDGSSGNWNSQYDNITINGGTHIWIDHCTFNDGSRPDSTSPKYYGRKY
QHHDGQTDASNGANYITMSYNYYHDHDKSSIFGSSDSKTSDDGKLKITLHHNRYKNIVQRAPRVRFGQVHVYNNYYEGST
SSSSYPFSYAWGIGKSSKIYAQNNVIDVPGLSAAKTISVFSGGTALYDSGTLLNGTQINASAANGLSSSVGWTPSLHGSI
DASANVKSNVINQAGAGKLN
>Q51915 4.2.2.2~~~pel~~~Pectate lyase~~~
MTKPSTFTACKLASAVFGALLFSSVPAHAADIWLDVATTGWATQNGGTKGGSRAAANDIYTVKNAAELKKALSASAGSNG
RIIKITGIIDVSEGKVYTKTADMKVRGRLDIPGKTTIVGIGSNAEIREGFFYAKENDVIIRNITVENPWDPEPIFDKDDG
ADGNWNSEYDGLTVEGANNVWVDHVTFTDGRRTDDQNGTEHERPKQHHDGALDVKNGANFVTISYSVFKSHEKNNLIGSS
DSRTTDDGKLKVTIHNTLFENISARAPRVRYGQVHLYNNYHVGSTSHKVYPFSYAHGVGKNSKIFSERNAFEIAGISGCD
KIAGDYGGSVYRDTGSTLNGSALSCSWSSSIGWTPPYSYTPLAADKVAADVKAKAGAGKL
>B2FHL8 4.2.2.-~~~~~~Polysaccharide lyase~~~
MSLPLRLALLPTLLASASAFAACPAPPPGQPDIRAIGYYTDKAGSVIDPALQQQNKDATAPLDRYAADVARMSDDYLRNG
DPAAAQCTLSWLGAWADDGAMLGQMIRVNNDQSFYMRQWMLDAVAMAYLKVHDQANPQQRARIDPWLQKLARANLAYWDN
PKRRRNNHYYWGGLGVLATGLATDDDALWQAGHAAFQKGIDDIQDDGSLPLEMARGQRALHYHDYALAPLVMMAELARLR
GQDWYASRNHAIDRLARRVIEGSRDPAWFNQHTGAAQLPLQASGWVEFYRLRSPDGGVFDAAHARGPFHSPRLGGDLTLM
ATHGIVRTPLR
>P0AFK0 3.4.-.-~~~pmbA~~~Metalloprotease PmbA~~~COG0312
MALAMKVISQVEAQRKILEEAVSTALELASGKSDGAEVAVSKTTGISVSTRYGEVENVEFNSDGALGITVYHQNRKGSAS
STDLSPQAIARTVQAALDIARYTSPDPCAGVADKELLAFDAPDLDLFHPAEVSPDEAIELAARAEQAALQADKRITNTEG
GSFNSHYGVKVFGNSHGMLQGYCSTRHSLSSCVIAEENGDMERDYAYTIGRAMSDLQTPEWVGADCARRTLSRLSPRKLS
TMKAPVIFANEVATGLFGHLVGAIAGGSVYRKSTFLLDSLGKQILPDWLTIEEHPHLLKGLASTPFDSEGVRTERRDIIK
DGILTQWLLTSYSARKLGLKSTGHAGGIHNWRIAGQGLSFEQMLKEMGTGLVVTELMGQGVSAITGDYSRGAAGFWVENG
EIQYPVSEITIAGNLKDMWRNIVTVGNDIETRSNIQCGSVLLPEMKIAGQ
>Q5KSN4 4.2.3.31~~~~~~Ent-pimara-9(11),15-diene synthase~~~
MRARHRVALKVLADLRSWAAEYPQVLEATPIEALAISTAAISPWRGANELRLSAPDVRCGPTPLDDHVEQNVRSLDELDD
LFGRCEAIVRGGDRDDGHPLLASLSGWQSALERAPHYPKLAGLWGDRFAEALRGERYDWTAGLARDRGEGPSDPQEYLTY
AASSNAWITHFPRWATSDRDDLLDGLPVLDNALEAIEVAVRLSNDLATFERERAEPGQNNILMYDTSPDWVHDELDRHSR
KAQEQLDPLATAGFPPAVELLRLLDWSVTFYSGADFRGWGSDRDLTGPSGLPSDM
>P0C1A8 3.1.1.11~~~pemA~~~Pectinesterase A~~~
MLKTISGTLALSLIIAASVHQAQAATTYNAVVSKSSSDGKTFKTIADAIASAPAGSTPFVILIKNGVYNERLTITRNNLL
LKGESRNGAVIAAATAAGTLKSDGSKWGTAGSSTITISAKDFSAQSLTIRNDFDFPANQAKSDSDSSKIKDTQAVALYVT
KSGDRAYFKDVSLVGYQDTLYVSGGRSFFSDCRISGTVDFIFGDGTALFNNCDLVSRYRADVKSGNVSGYLTAPSTNINQ
KYGLVITNSRVIRESDSVPAKSYGLGRPWHPTTTFSDGRYADPNAIGQTVFLNTSMDNHIYGWDKMSGKDKNGNTIWFNP
EDSRFFEYKSYGAGAAVSKDRRQLTDAQAAEYTQSKVLGDWTPTLP
>P0C1A9 3.1.1.11~~~pemA~~~Pectinesterase A~~~COG4677
MLKTISGTLALSLIIAASVHQAQAATTYNAVVSKSSSDGKTFKTIADAIASAPAGSTPFVILIKNGVYNERLTITRNNLH
LKGESRNGAVIAAATAAGTLKSDGSKWGTAGSSTITISAKDFSAQSLTIRNDFDFPANQAKSDSDSSKIKDTQAVALYVT
KSGDRAYFKDVSLVGYQDTLYVSGGRSFFSDCRISGTVDFIFGDGTALFNNCDLVSRYRADVKSGNVSGYLTAPSTNINQ
KYGLVITNSRVIRESDSVPAKSYGLGRPWHPTTTFSDGRYADPNAIGQTVFLNTSMDNHIYGWDKMSGKDKNGNTIWFNP
EDSRFFEYKSYGAGATVSKDRRQLTDAQAAEYTQSKVLGDWTPTLP
>Q47474 3.1.1.11~~~pemB~~~Pectinesterase B~~~COG4677
MSLTHYSGLAAAVSMSLILTACGGQTPNSARFQPVFPGTVSRPVLSAQEAGRFTPQHYFAHGGEYAKPVADGWTPTPIDT
SRVTAAYVVGPRAGVAGATHTSIQQAVNAALRQHPGQTRVYIKLLPGTYTGTVYVPEGAPPLTLFGAGDRPEQVVVSLAL
DSMMSPADYRARVNPHGQYQPADPAWYMYNACATKAGATINTTCSAVMWSQSNDFQLKNLTVVNALLDTVDSGTHQAVAL
RTDGDRVQLENVRLLSRQDTFFVNTSDRQNSYVTDHYSRAYIKDSYIEGDVDYVFGRATAVFDRVRFHTVSSRGSKEAYV
FAPDSIPSVKYGFLVINSQLTGDNGYRGAQKAKLGRAWDQGAKQTGYLPGKTANGQLVIRDSTIDSSYDLANPWGAAATT
DRPFKGNISPQRDLDDIHFNRLWEYNTQVLLHE
>Q04681 ~~~pmfA~~~Major fimbrial subunit~~~COG3539
MKLSKIALAAALVFGINSVATAENETPAPKVSSTKGEIQLKGEIVNSACGLAASSSPVIVDFSEIPTSALANLQKAGNIK
KDIELQDCDTTVAKTATVSYTPSVVNAVNKDLASFVSGNASGAGIGLMDAGSKAVKWNTATTPVQLINGVSKIPFVAYVQ
AESADAKVTPGEFQAVINFQVDYQ
>Q8GAH9 ~~~pmfR~~~Transcriptional activator PmfR~~~
MEQQVLTVRDLVTAQSLGMKVLSGAGGLDRQVLWAHSCELSDPDRWLGPHELLMTVGLCVPHSAVEQRNFIVKLDEAGLS
GVALGDHNSLPPLTRELYEEADRRSFPVLLTNQATPFAAIGRTVAAATATTQTMQVLKLSKLYQLSTYARTDPLRMMNDL
QALLRAGLSVFDVQTGLTIVEGAPLEFMPTSVRERTYALPGDSDSRLMISEYPGEEVSSFLLIHVLQVIDVALSQLLRSL
RRRSERSTQMLASIFEGRSPDGLGSILGPSGTSSGYQFVAVALEDSEKVARAASIKSLPVLAGPGSSSFFILMPEQSRND
VRNLLHGLDVRAGVSSTYLDLRDAKAAADEAAKIFSSGGTNGLWTDFTGVPVSLLTRSRKEASAIVQQVLGRLAGTDPKI
TVLRETLFAFLANDRRWNETAAALGIHRQTLSYRLTRIKEITGRDIASSADLSAFWLAFQAWPSFSDRSD
>Q9KWG3 2.7.4.2~~~~~~Phosphomevalonate kinase~~~
MTTGQRTIVRHAPGKLFVAGEYAVVDPGNPAILVAVDRHISVTVSDADADTGAADVVISSDLGPQAVGWRWHDGRLVVRD
PDDGQQARSALAHVVSAIETVGRLLGERGQKVPALTLSVSSRLHEDGRKFGLGSSGAVTVATVAAVAAFCGLELSTDERF
RLAMLATAELDPKGSGGDLAASTWGGWIAYQAPDRAFVLDLARRVGVDRTLKAPWPGHSVRRLPAPKGLTLEVGWTGEPA
STASLVSDLHRRTWRGSASHQRFVETTTDCVRSAVTALESGDDTSLLHEIRRARQELARLDDEVGLGIFTPKLTALCDAA
EAVGGAAKPSGAGGGDCGIALLDAEASRDITHVRQRWETAGVLPLPLTPALEGI
>Q9LCB4 2.3.3.18~~~Pmms~~~2-phosphinomethylmalate synthase~~~
MTVQNPQEPEYFPEVFPQDAFPQYAWDEGMRPITLPHEVWLSETTHRDGQQGGLPLSLDTSRRIYDILCEITGDSTAIRH
AEFFPYRDSDRNALIYALERHRDGAPIEPTTWIRARREDVELIKRIGVIETGLLSSSSDYHTFHKFGSGGRTQAASMYLD
AVTMALDHGIRPRVHLEDTTRSSPDFVRALVEEVLKTAERYPAELQPRFRVCDTLGIGLPYDDVSLPRSIPRWIRLLRGF
GLSPSQIELHPHNDTWLVVANCLAAIREGCGVISGTTLGTGERTGNAPLEAVMVHLLGMGYWSGARVNLPAVNKLVELYE
GIGAGPSQKYPFFGRDAYVTRAGIHADGLNKFWWMYAPFNAPLLTGRELDVALTKDSGQAGLLFVLNKRLGLQLEKGDPR
VAEVLAWMDRQWDAGRVSAVEWSELEPVVEKAFATEEGVG
>Q607G3 1.14.18.3~~~pmoA1~~~Particulate methane monooxygenase beta subunit~~~
MSAAQSAVRSHAEAVQVSRTIDWMALFVVFFVIVGSYHIHAMLTMGDWDFWSDWKDRRLWVTVTPIVLVTFPAAVQSYLW
ERYRLPWGATVCVLGLLLGEWINRYFNFWGWTYFPINFVFPASLVPGAIILDTVLMLSGSYLFTAIVGAMGWGLIFYPGN
WPIIAPLHVPVEYNGMLMSIADIQGYNYVRTGTPEYIRMVEKGTLRTFGKDVAPVSAFFSAFMSILIYFMWHFIGRWFSN
ERFLQST
>G1UBD1 1.14.18.3~~~pmoB1~~~Particulate methane monooxygenase alpha subunit~~~
MKTIKDRIAKWSAIGLLSAVAATAFYAPSASAHGEKSQAAFMRMRTIHWYDLSWSKEKVKINETVEIKGKFHVFEGWPET
VDEPDVAFLNVGMPGPVFIRKESYIGGQLVPRSVRLEIGKTYDFRVVLKARRPGDWHVHTMMNVQGGGPIIGPGKWITVE
GSMSEFRNPVTTLTGQTVDLENYNEGNTYFWHAFWFAIGVAWIGYWSRRPIFIPRLLMVDAGRADELVSATDRKVAMGFL
AATILIVVMAMSSANSKYPITIPLQAGTMRGMKPLELPAPTVSVKVEDATYRVPGRAMRMKLTITNHGNSPIRLGEFYTA
SVRFLDSDVYKDTTGYPEDLLAEDGLSVSDNSPLAPGETRTVDVTASDAAWEVYRLSDIIYDPDSRFAGLLFFFDATGNR
QVVQIDAPLIPSFM
>Q9RB65 ~~~~~~Probable outer membrane protein pmp10~~~COG3468
MKSQFSWLVLSSTLACFTSCSTVFAATAENIGPSDSFDGSTNTGTYTPKNTTTGIDYTLTGDITLQNLGDSAALTKGCFS
DTTESLSFAGKGYSLSFLNIKSSAEGAALSVTTDKNLSLTGFSSLTFLAAPSSVITTPSGKGAVKCGGDLTFDNNGTILF
KQDYCEENGGAISTKNLSLKNSTGSISFEGNKSSATGKKGGAICATGTVDITNNTAPTLFSNNIAEAAGGAINSTGNCTI
TGNTSLVFSENSVTATAGNGGALSGDADVTISGNQSVTFSGNQAVANGGAIYAKKLTLASGGGGGISFSNNIVQGTTAGN
GGAISILAAGECSLSAEAGDITFNGNAIVATTPQTTKRNSIDIGSTAKITNLRAISGHSIFFYDPITANTAADSTDTLNL
NKADAGNSTDYSGSIVFSGEKLSEDEAKVADNLTSTLKQPVTLTAGNLVLKRGVTLDTKGFTQTAGSSVIMDAGTTLKAS
TEEVTLTGLSIPVDSLGEGKKVVIAASAASKNVALSGPILLLDNQGNAYENHDLGKTQDFSFVQLSALGTATTTDVPAVP
TVATPTHYGYQGTWGMTWVDDTASTPKTKTATLAWTNTGYLPNPERQGPLVPNSLWGSFSDIQAIQGVIERSALTLCSDR
GFWAAGVANFLDKDKKGEKRKYRHKSGGYAIGGAAQTCSENLISFAFCQLFGSDKDFLVAKNHTDTYAGAFYIQHITECS
GFIGCLLDKLPGSWSHKPLVLEGQLAYSHVSNDLKTKYTAYPEVKGSWGNNAFNMMLGASSHSYPEYLHCFDTYAPYIKL
NLTYIRQDSFSEKGTEGRSFDDSNLFNLSLPIGVKFEKFSDCNDFSYDLTLSYVPDLIRNDPKCTTALVISGASWETYAN
NLARQALQVRAGSHYAFSPMFEVLGQFVFEVRGSSRIYNVDLGGKFQF
>O86164 ~~~~~~Probable outer membrane protein pmp11~~~COG3210
MKTSIPWVLVSSVLAFSCHLQSLANEELLSPDDSFNGNIDSGTFTPKTSATTYSLTGDVFFYEPGKGTPLSDSCFKQTTD
NLTFLGNGHSLTFGFIDAGTHAGAAASTTANKNLTFSGFSLLSFDSSPSTTVTTGQGTLSSAGGVNLENIRKLVVAGNFS
TADGGAIKGASFLLTGTSGDALFSNNSSSTKGGAIATTAGARIANNTGYVRFLSNIASTSGGAIDDEGTSILSNNKFLYF
EGNAAKTTGGAICNTKASGSPELIISNNKTLIFASNVAETSGGAIHAKKLALSSGGFTEFLRNNVSSATPKGGAISIDAS
GELSLSAETGNITFVRNTLTTTGSTDTPKRNAINIGSNGKFTELRAAKNHTIFFYDPITSEGTSSDVLKINNGSAGALNP
YQGTILFSGETLTADELKVADNLKSSFTQPVSLSGGKLLLQKGVTLESTSFSQEAGSLLGMDSGTTLSTTAGSITITNLG
INVDSLGLKQPVSLTAKGASNKVIVSGKLNLIDIEGNIYESHMFSHDQLFSLLKITVDADVDTNVDISSLIPVPAEDPNS
EYGFQGQWNVNWTTDTATNTKEATATWTKTGFVPSPERKSALVCNTLWGVFTDIRSLQQLVEIGATGMEHKQGFWVSSMT
NFLHKTGDENRKGFRHTSGGYVIGGSAHTPKDDLFTFAFCHLFARDKDCFIAHNNSRTYGGTLFFKHSHTLQPQNYLRLG
RAKFSESAIEKFPREIPLALDVQVSFSHSDNRMETHYTSLPESEGSWSNECIAGGIGLDLPFVLSNPHPLFKTFIPQMKV
EMVYVSQNSFFESSSDGRGFSIGRLLNLSIPVGAKFVQGDIGDSYTYDLSGFFVSDVYRNNPQSTATLVMSPDSWKIRGG
NLSRQAFLLRGSNNYVYNSNCELFGHYAMELRGSSRNYNVDVGTKLRF
>Q9Z3D6 ~~~~~~Probable outer membrane protein pmp12~~~COG4625
MTILRNFLTCSALFLALPAAAQVVYLHESDGYNGAINNKSLEPKITCYPEGTSYIFLDDVRISNVKHDQEDAGVFINRSG
NLFFMGNRCNFTFHNLMTEGFGAAISNRVGDTTLTLSNFSYLAFTSAPLLPQGQGAIYSLGSVMIENSEEVTFCGNYSSW
SGAAIYTPYLLGSKASRPSVNLSGNRYLVFRDNVSQGYGGAISTHNLTLTTRGPSCFENNHAYHDVNSNGGAIAIAPGGS
ISISVKSGDLIFKGNTASQDGNTIHNSIHLQSGAQFKNLRAVSESGVYFYDPISHSESHKITDLVINAPEGKETYEGTIS
FSGLCLDDHEVCAENLTSTILQDVTLAGGTLSLSDGVTLQLHSFKQEASSTLTMSPGTTLLCSGDARVQNLHILIEDTDN
FVPVRIRAEDKDALVSLEKLKVAFEAYWSVYDFPQFKEAFTIPLLELLGPSFDSLLLGETTLERTQVTTENDAVRGFWSL
SWEEYPPSLDKDRRITPTKKTVFLTWNPEITSTP
>Q9Z896 ~~~~~~Probable outer membrane protein pmp13~~~COG3210
MKTSIRKFLISTTLAPCFASTAFTVEVIMPSENFDGSSGKIFPYTTLSDPRGTLCIFSGDLYIANLDNAISRTSSSCFSN
RAGALQILGKGGVFSFLNIRSSADGAAISSVITQNPELCPLSFSGFSQMIFDNCESLTSDTSASNVIPHASAIYATTPML
FTNNDSILFQYNRSAGFGAAIRGTSITIENTKKSLLFNGNGSISNGGALTGSAAINLINNSAPVIFSTNATGIYGGAIYL
TGGSMLTSGNLSGVLFVNNSSRSGGAIYANGNVTFSNNSDLTFQNNTASPQNSLPAPTPPPTPPAVTPLLGYGGAIFCTP
PATPPPTGVSLTISGENSVTFLENIASEQGGALYGKKISIDSNKSTIFLGNTAGKGGAIAIPESGELSLSANQGDILFNK
NLSITSGTPTRNSIHFGKDAKFATLGATQGYTLYFYDPITSDDLSAASAAATVVVNPKASADGAYSGTIVFSGETLTATE
AATPANATSTLNQKLELEGGTLALRNGATLNVHNFTQDEKSVVIMDAGTTLATTNGANNTDGAITLNKLVINLDSLDGTK
AAVVNVQSTNGALTISGTLGLVKNSQDCCDNHGMFNKDLQQVPILELKATSNTVTTTDFSLGTNGYQQSPYGYQGTWEFT
IDTTTHTVTGNWKKTGYLPHPERLAPLIPNSLWANVIDLRAVSQASAADGEDVPGKQLSITGITNFFHANHTGDARSYRH
MGGGYLINTYTRITPDAALSLGFGQLFTKSKDYLVGHGHSNVYFATVYSNITKSLFGSSRFFSGGTSRVTYSRSNEKVKT
SYTKLPKGRCSWSNNCWLGELEGNLPITLSSRILNLKQIIPFVKAEVAYATHGGIQENTPEGRIFGHGHLLNVAVPVGVR
FGKNSHNRPDFYTIIVAYAPDVYRHNPDCDTTLPINGATWTSIGNNLTRSTLLVQASSHTSVNDVLEIFGHCGCDIRRTS
RQYTLDIGSKLRF
>Q9Z895 ~~~~~~Probable outer membrane protein pmp14~~~COG3210
MPLSFKSSSFCLLACLCSASCAFAETRLGGNFVPPITNQGEEILLTSDFVCSNFLGASFSSSFINSSSNLSLLGKGLSLT
FTSCQAPTNSNYALLSAAETLTFKNFSSINFTGNQSTGLGGLIYGKDIVFQSIKDLIFTTNRVAYSPASVTTSATPAITT
VTTGASALQPTDSLTVENISQSIKFFGNLANFGSAISSSPTAVVKFINNTATMSFSHNFTSSGGGVIYGGSSLLFENNSG
CIIFTANSCVNSLKGVTPSSGTYALGSGGAICIPTGTFELKNNQGKCTFSYNGTPNDAGAIYAETCNIVGNQGALLLDSN
TAARNGGAICAKVLNIQGRGPIEFSRNRAEKGGAIFIGPSVGDPAKQTSTLTILASEGNIAFQGNMLNTKPGIRNAITVE
AGGEIVSLSAQGGSRLVFYDPITHSLPTTSPSNKDITINANGASGSVVFTSKGLSSTELLLPANTTTILLGTVKIASGEL
KITDNAVVNVLGFATQGSGQLTLGSGGTLGLATPTGAPAAVDFTIGKLAFDPFSFLKRDFVSASVNAGTKNVTLTGALVL
DEHDVTDLYDMVSLQSPVAIPIAVFKGATVTKTGFPDGEIATPSHYGYQGKWSYTWSRPLLIPAPDGGFPGGPSPSANTL
YAVWNSDTLVRSTYILDPERYGEIVSNSLWISFLGNQAFSDILQDVLLIDHPGLSITAKALGAYVEHTPRQGHEGFSGRY
GGYQAALSMNYTDHTTLGLSFGQLYGKTNANPYDSRCSEQMYLLSFFGQFPIVTQKSEALISWKAAYGYSKNHLNTTYLR
PDKAPKSQGQWHNNSYYVLISAEHPFLNWCLLTRPLAQAWDLSGFISAEFLGGWQSKFTETGDLQRSFSRGKGYNVSLPI
GCSSQWFTPFKKAPSTLTIKLAYKPDIYRVNPHNIVTVVSNQESTSISGANLRRHGLFVQIHDVVDLTEDTQAFLNYTFD
GKNGFTNHRVSTGLKSTF
>Q9Z883 ~~~~~~Probable outer membrane protein pmp15~~~COG3210
MRFFCFGMLLPFTFVLANEGLQLPLETYITLSPEYQAAPQVGFTHNQNQDLAIVGNHNDFILDYKYYRSNGGALTCKNLL
ISENIGNVFFEKNVCPNSGGAIYAAQNCTISKNQNYAFTTNLVSDNPTATAGSLLGGALFAINCSITNNLGQGTFVDNLA
LNKGGALYTETNLSIKDNKGPIIIKQNRALNSDSLGGGIYSGNSLNIEGNSGAIQITSNSSGSGGGIFSTQTLTISSNKK
LIEISENSAFANNYGSNFNPGGGGLTTTFCTILNNREGVLFNNNQSQSNGGAIHAKSIIIKENGPVYFLNNTATRGGALL
NLSAGSGNGSFILSADNGDIIFNNNTASKHALNPPYRNAIHSTPNMNLQIGARPGYRVLFYDPIEHELPSSFPILFNFET
GHTGTVLFSGEHVHQNFTDEMNFFSYLRNTSELRQGVLAVEDGAGLACYKFFQRGGTLLLGQGAVITTAGTIPTPSSTPT
TVGSTITLNHIAIDLPSILSFQAQAPKIWIYPTKTGSTYTEDSNPTITISGTLTLRNSNNEDPYDSLDLSHSLEKVPLLY
IVDVAAQKINSSQLDLSTLNSGEHYGYQGIWSTYWVETTTITNPTSLLGANTKHKLLYANWSPLGYRPHPERRGEFITNA
LWQSAYTALAGLHSLSSWDEEKGHAASLQGIGLLVHQKDKNGFKGFRSHMTGYSATTEATSSQSPNFSLGFAQFFSKAKE
HESQNSTSSHHYFSGMCIENTLFKEWIRLSVSLAYMFTSEHTHTMYQGLLEGNSQGSFHNHTLAGALSCVFLPQPHGESL
QIYPFITALAIRGNLAAFQESGDHAREFSLHRPLTDVSLPVGIRASWKNHHRVPLVWLTEISYRSTLYRQDPELHSKLLI
SQGTWTTQATPVTYNALGIKVKNTMQVFPKVTLSLDYSADISSSTLSHYLNVASRMRF
>Q9Z882 ~~~~~~Probable outer membrane protein pmp16~~~COG3210
MSKTPPKFLFYLGNFTACMFGMTPAVYSLQTDSLEKFALERDEEFRTSFPLLDSLSTLTGFSPITTFVGNRHNSSQDIVL
SNYKSIDNILLLWTSAGGAVSCNNFLLSNVEDHAFFSKNLAIGTGGAIACQGACTITKNRGPLIFFSNRGLNNASTGGET
RGGAIACNGDFTISQNQGTFYFVNNSVNNWGGALSTNGHCRIQSNRAPLLFFNNTAPSGGGALRSENTTISDNTRPIYFK
NNCGNNGGAIQTSVTVAIKNNSGSVIFNNNTALSGSINSGNGSGGAIYTTNLSIDDNPGTILFNNNYCIRDGGAICTQFL
TIKNSGHVYFTNNQGNWGGALMLLQDSTCLLFAEQGNIAFQNNEVFLTTFGRYNAIHCTPNSNLQLGANKGYTTAFFDPI
EHQHPTTNPLIFNPNANHQGTILFSSAYIPEASDYENNFISSSKNTSELRNGVLSIEDRAGWQFYKFTQKGGILKLGHAA
SIATTANSETPSTSVGSQVIINNLAINLPSILAKGKAPTLWIRPLQSSAPFTEDNNPTITLSGPLTLLNEENRDPYDSID
LSEPLQNIHLLSLSDVTARHINTDNFHPESLNATEHYGYQGIWSPYWVETITTTNNASIETANTLYRALYANWTPLGYKV
NPEYQGDLATTPLWQSFHTMFSLLRSYNRTGDSDIERPFLEIQGIADGLFVHQNSIPGAPGFRIQSTGYSLQASSETSLH
QKISLGFAQFFTRTKEIGSSNNVSAHNTVSSLYVELPWFQEAFATSTVLAYGYGDHHLHSLHPSHQEQAEGTCYSHTLAA
AIGCSFPWQQKSYLHLSPFVQAIAIRSHQTAFEEIGDNPRKFVSQKPFYNLTLPLGIQGKWQSKFHVPTEWTLELSYQPV
LYQQNPQIGVTLLASGGSWDILGHNYVRNALGYKVHNQTALFRSLDLFLDYQGSVSSSTSTHHLQAGSTLKF
>Q9Z880 ~~~~~~Probable outer membrane protein pmp18~~~COG3210
MQNNRSLSKSSFFVGALILGKTTILLNATPLSDYFDNQANQLTTLFPLIDTLTNMTPYSHRATLFGVRDDTNQDIVLDHQ
NSIESWFENFSQDGGALSCKSLAITNTKNQILFLNSFAIKRAGAMYVNGNFDLSENHGSIIFSGNLSFPNASNFADTCTG
GAVLCSKNVTISKNQGTAYFINNKAKSSGGAIQAAIINIKDNTGPCLFFNNAAGGTAGGALFANACRIENNSQPIYFLNN
QSGLGGAIRVHQECILTKNTGSVIFNNNFAMEADISANHSSGGAIYCISCSIKDNPGIAAFDNNTAARDGGAICTQSLTI
QDSGPVYFTNNQGTWGGAIMLRQDGACTLFADQGDIIFYNNRHFKDTFSNHVSVNCTRNVSLTVGASQGHSATFYDPILQ
RYTIQNSIQKFNPNPEHLGTILFSSAYIPDTSTSRDDFISHFRNHIGLYNGTLALEDRAEWKVYKFDQFGGTLRLGSRAV
FSTTDEEQSSSSVGSVININNLAINLPSILGNRVAPKLWIRPTGSSAPYSEDNNPIINLSGPLSLLDDENLDPYDTADLA
QPIAEVPLLYLLDVTAKHINTDNFYPEGLNTTQHYGYQGVWSPYWIETITTSDTSSEDTVNTLHRQLYGDWTPTGYKVNP
ENKGDIALSAFWQSFHNLFATLRYQTQQGQIAPTASGEATRLFVHQNSNNDAKGFHMEATGYSLGTTSNTASNHSFGVNF
SQLFSNLYESHSDNSVASHTTTVALQINNPWLQERFSTSASLAYSYSNHHIKASGYSGKIQTEGKCYSTTLGAALSCSLS
LQWRSRPLHFTPFIQAIAVRSNQTAFQESGDKARKFSVHKPLYNLTVPLGIQSAWESKFRLPTYWNIELAYQPVLYQQNP
EVNVSLESSGSSWLLSGTTLARNAIAFKGRNQIFIFPKLSVFLDYQGSVSSSTTTHYLHAGTTFKF
>Q9Z813 ~~~~~~Probable outer membrane protein pmp19~~~COG4625
MKQMRLWGFLFLSSFCQVSYLRANDVLLPLSGIHSGEDLELFTLRSSSPTKTTYSLRKDFIVCDFAGNSIHKPGAAFLNL
KGDLFFINSTPLAALTFKNIHLGARGAGLFSESNVTFKGLHSLVLENNESWGGVLTTSGDLSFINNTSVLCQNNISYGPG
GALLLQGRKSKALFFRDNRGTILFLKNKAVNQDESHPGYGGAVSSISPGSPITFADNQEILFQENEGELGGAIYNDQGAI
TFENNFQTTSFFSNKASFGGAVYSRYCNLYSQWGDTLFTKNAAAKVGGAIHADYVHIRDCKGSIVFEENSATAGGAIAVN
AVCDINAQGPVRFINNSALGLNGGAIYMQATGSILRLHANQGDIEFCGNKVRSQFHSHINSTSNFTNNAITIQGAPREFS
LSANEGHRICFYDPIISATENYNSLYINHQRLLEAGGAVIFSGARLSPEHKKENKNKTSIINQPVRLCSGVLSIEGGAIL
AVRSFYQEGGLLALGPGSKLTTQGKNSEKDKIVITNLGFNLENLDSSDPAEIRATEKASIEISGVPRVYGHTESFYENHE
YASKPYTTSIILSAKKLVTAPSRPEKDIQNLIIAESEYMGYGYQGSWEFSWSPNDTKEKKTIIASWTPTGEFSLDPKRRG
SFIPTTLWSTFSGLNIASNIVNNNYLNNSEVIPLQHLCVFGGPVYQIMEQNPKQSSNNLLVQHAGHNVGARIPFSFNTIL
SAALTQLFSSSSQQNVADKSHAQILIGTVSLNKSWQALSLRSSFSYTEDSQVMKHVFPYKGTSRGSWRNYGWSGSVGMSY
AYPKGIRYLKMTPFVDLQYTKLVQNPFVETGYDPRYFSSSEMTNLSLPIGIALEMRFIGSRSSLFLQVSTSYIKDLRRVN
PQSSASLVLNHYTWDIQGVPLGKEALNITLNSTIKYKIVTAYMGISSTQREGSNLSANAHAGLSLSF
>Q9Z9G5 ~~~pmp1~~~Probable outer membrane protein pmp1~~~COG3210
MRFSLCGFPLVFSFTLLSVFDTSLSATTISLTPEDSFHGDSQNAERSYNVQAGDVYSLTGDVSISNVDNSALNKACFNVT
SGSVTFAGNHHGLYFNNISSGTTKEGAVLCCQDPQATARFSGFSTLSFIQSPGDIKEQGCLYSKNALMLLNNYVVRFEQN
QSKTKGGAISGANVTIVGNYDSVSFYQNAATFGGAIHSSGPLQIAVNQAEIRFAQNTAKNGSGGALYSDGDIDIDQNAYV
LFRENEALTTAIGKGGAVCCLPTSGSSTPVPIVTFSDNKQLVFERNHSIMGGGAIYARKLSISSGGPTLFINNISYANSQ
NLGGAIAIDTGGEISLSAEKGTITFQGNRTSLPFLNGIHLLQNAKFLKLQARNGYSIEFYDPITSEADGSTQLNINGDPK
NKEYTGTILFSGEKSLANDPRDFKSTIPQNVNLSAGYLVIKEGAEVTVSKFTQSPGSHLVLDLGTKLIASKEDIAITGLA
IDIDSLSSSSTAAVIKANTANKQISVTDSIELISPTGNAYEDLRMRNSQTFPLLSLEPGAGGSVTVTAGDFLPVSPHYGF
QGNWKLAWTGTGNKVGEFFWDKINYKPRPEKEGNLVPNILWGNAVDVRSLMQVQETHASSLQTDRGLWIDGIGNFFHVSA
SEDNIRYRHNSGGYVLSVNNEITPKHYTSMAFSQLFSRDKDYAVSNNEYRMYLGSYLYQYTTSLGNIFRYASRNPNVNVG
ILSRRFLQNPLMIFHFLCAYGHATNDMKTDYANFPMVKNSWRNNCWAIECGGSMPLLVFENGRLFQGAIPFMKLQLVYAY
QGDFKETTADGRRFSNGSLTSISVPLGIRFEKLALSQDVLYDFSFSYIPDIFRKDPSCEAALVISGDSWLVPAAHVSRHA
FVGSGTGRYHFNDYTELLCRGSIECRPHARNYNINCGSKFRF
>A0A5P3XKQ1 ~~~pmp1~~~Paraclostridial mosquitocidal protein 1~~~
MLQIRVFNYNDPIDGENIVELRYHNRSPVKAFQIVDGIWIIPERYNFTNDTKKVPDDRALTILEDEVFAVRENDYLTTDV
NEKNSFLNNITKLFKRINSSNIGNQLLNYISTSVPYPVVSTNSIKARDYNTIKFDSIDGRRITKSANVLIYGPSMKNLLD
KQTRAINGEEAKNGIGCLSDIIFSPNYLSVQTVSSSRFVEDPASSLTHELIHALHNLYGIQYPGEEKFKFGGFIDKLLGT
RECIDYEEVLTYGGKDSEIIRKKIDKSLYPDDFVNKYGEMYKRIKGSNPYYPDEKKLKQSFLNRMNPFDQNGTFDTKEFK
NHLMDLWFGLNESEFAKEKKILVRKHYITKQINPKYTELTNDVYTEDKGFVNGQSIDNQNFKIIDDLISKKVKLCSITSK
NRVNICIDVNKEDLYFISDKEGFENIDFSEPEIRYDSNVTTATTSSFTDHFLVNRTFNDSDRFPPVELEYAIEPAEIVDN
TIMPDIDQKSEISLDNLTTFHYLNAQKMDLGFDSSKEQLKMVTSIEESLLDSKKVYTPFTRTAHSVNERISGIAESYLFY
QWLKTVINDFTDELNQKSNTDKVADISWIIPYVGPALNIGLDLSHGDFTKAFEDLGVSILFAIAPEFATISLVALSIYEN
IEEDSQKEKVINKVENTLARRIEKWHQVYAFMVAQWWGMVHTQIDTRIHQMYESLSHQIIAIKANMEYQLSHYKGPDNDK
LLLKDYIYEAEIALNTSANRAMKNIERFMIESSISYLKNNLIPSVVENLKKFDADTKKNLDQFIDKNSSVLGSDLHILKS
QVDLELNPTTKVAFNIQSIPDFDINALIDRLGIQLKDNLVFSLGVESDKIKDLSGNNTNLEVKTGVQIVDGRDSKTIRLN
SNENSSIIVQKNESINFSYFSDFTISFWIRVPRLNKNDFIDLGIEYDLVNNMDNQGWKISLKDGNLVWRMKDRFGKIIDI
ITSLTFSNSFIDKYISSNIWRHITITVNQLKDCTLYINGDKIDSKSINELRGIDNNSPIIFKLEGNRNKNQFIRLDQFNI
YQRALNESEVEMLFNSYFNSNILRDFWGEPLEYNKSYYMINQAILGGPLRSTYKSWYGEYYPYISRMRTFNVSSFILIPY
LYHKGSDVEKVKIINKNNVDKYVRKNDVADVKFENYGNLILTLPMYSKIKERYMVLNEGRNGDLKLIQLQSNDKYYCQIR
IFEMYRNGLLSIADDENWLYSSGWYLYSSGWYLDNYKTLDLKKHTKTNWYFVSEDEGWKE
>Q9Z812 ~~~~~~Probable outer membrane protein pmp20~~~COG3210
MKWLPATAVFAAVLPALTAFGDPASVEISTSHTGSGDPTSDAALTGFTQSSTETDGTTYTIVGDITFSTFTNIPVPVVTP
DANDSSSNSSKGGSSSSGATSLIRSSNLHSDFDFTKDSVLDLYHLFFPSASNTLNPALLSSSSSGGSSSSSSSSSSGSAS
AVVAADPKGGAAFYSNEANGTLTFTTDSGNPGSLTLQNLKMTGDGAAIYSKGPLVFTGLKNLTFTGNESQKSGGAAYTEG
ALTTQAIVEAVTFTGNTSAGQGGAIYVKEATLFNALDSLKFEKNTSGQAGGGIYTESTLTISNITKSIEFISNKASVPAP
APEPTSPAPSSLINSTTIDTSTLQTRAASATPAVAPVAAVTPTPISTQETAGNGGAIYAKQGISISTFKDLTFKSNSASV
DATLTVDSSTIGESGGAIFAADSIQIQQCTGTTLFSGNTANKSGGGIYAVGQVTLEDIANLKMTNNTCKGEGGAIYTKKA
LTINNGAILTTFSGNTSTDNGGAIFAVGGITLSDLVEVRFSKNKTGNYSAPITKAASNTAPVVSSSTTAASPAVPAAAAA
PVTNAAKGGALYSTEGLTVSGITSILSFENNECQNQGGGAYVTKTFQCSDSHRLQFTSNKAADEGGGLYCGDDVTLTNLT
GKTLFQENSSEKHGGGLSLASGKSLTMTSLESFCLNANTAKENGGGANVPENIVLTFTYTPTPNEPAPVQQPVYGEALVT
GNTATKSGGGIYTKNAAFSNLSSVTFDQNTSSENGGALLTQKAADKTDCSFTYITNVNITNNTATGNGGGIAGGKAHFDR
IDNLTVQSNQAKKGGGVYLEDALILEKVITGSVSQNTATESGGGIYAKDIQLQALPGSFTITDNKVETSLTTSTNLYGGG
IYSSGAVTLTNISGTFGITGNSVINTATSQDADIQGGGIYATTSLSINQCNTPILFSNNSAATKKTSTTKQIAGGAIFSA
AVTIENNSQPIIFLNNSAKSEATTAATAGNKDSCGGAIAANSVTLTNNPEITFKGNYAETGGAIGCIDLTNGSPPRKVSI
ADNGSVLFQDNSALNRGGAIYGETIDISRTGATFIGNSSKHDGSAICCSTALTLAPNSQLIFENNKVTETTATTKASINN
LGAAIYGNNETSDITISLSAENGSIFFKNNLCTATNKYCSIAGNVKFTAIEASAGKAISFYDAVNVSTKETNAQELKLNE
KATSTGTILFSGELHENKSYIPQKVTFAHGNLILGKNAELSVVSFTQSPGTTITMGPGSVLSNHSKEAGGIAINNVIIDF
SEIVPTKDNATVAPPTLKLVSRTNADSKDKIDITGTVTLLDPNGNLYQNSYLGEDRDITLFNIDNSASGAVTATNVTLQG
NLGAKKGYLGTWNLDPNSSGSKIILKWTFDKYLRWPYIPRDNHFYINSIWGAQNSLVTVKQGILGNMLNNARFEDPAFNN
FWASAIGSFLRKEVSRNSDSFTYHGRGYTAAVDAKPRQEFILGAAFSQVFGHAESEYHLDNYKHKGSGHSTQASLYAGNI
FYFPAIRSRPILFQGVATYGYMQHDTTTYYPSIEEKNMANWDSIAWLFDLRFSVDLKEPQPHSTARLTFYTEAEYTRIRQ
EKFTELDYDPRSFSACSYGNLAIPTGFSVDGALAWREIILYNKVSAAYLPVILRNNPKATYEVLSTKEKGNVVNVLPTRN
AARAEVSSQIYLGSYWTLYGTYTIDASMNTLVQMANGGIRFVF
>Q9Z6U5 ~~~~~~Probable outer membrane protein pmp21~~~COG3210
MVAKKTVRSYRSSFSHSVIVAILSAGIAFEAHSLHSSELDLGVFNKQFEEHSAHVEEAQTSVLKGSDPVNPSQKESEKVL
YTQVPLTQGSSGESLDLADANFLEHFQHLFEETTVFGIDQKLVWSDLDTRNFSQPTQEPDTSNAVSEKISSDTKENRKDL
ETEDPSKKSGLKEVSSDLPKSPETAVAAISEDLEISENISARDPLQGLAFFYKNTSSQSISEKDSSFQGIIFSGSGANSG
LGFENLKAPKSGAAVYSDRDIVFENLVKGLSFISCESLEDGSAAGVNIVVTHCGDVTLTDCATGLDLEALRLVKDFSRGG
AVFTARNHEVQNNLAGGILSVVGNKGAIVVEKNSAEKSNGGAFACGSFVYSNNENTALWKENQALSGGAISSASDIDIQG
NCSAIEFSGNQSLIALGEHIGLTDFVGGGALAAQGTLTLRNNAVVQCVKNTSKTHGGAILAGTVDLNETISEVAFKQNTA
ALTGGALSANDKVIIANNFGEILFEQNEVRNHGGAIYCGCRSNPKLEQKDSGENINIIGNSGAITFLKNKASVLEVMTQA
EDYAGGGALWGHNVLLDSNSGNIQFIGNIGGSTFWIGEYVGGGAILSTDRVTISNNSGDVVFKGNKGQCLAQKYVAPQET
APVESDASSTNKDEKSLNACSHGDHYPPKTVEEEVPPSLLEEHPVVSSTDIRGGGAILAQHIFITDNTGNLRFSGNLGGG
EESSTVGDLAIVGGGALLSTNEVNVCSNQNVVFSDNVTSNGCDSGGAILAKKVDISANHSVEFVSNGSGKFGGAVCALNE
SVNITDNGSAVSFSKNRTRLGGAGVAAPQGSVTICGNQGNIAFKENFVFGSENQRSGGGAIIANSSVNIQDNAGDILFVS
NSTGSYGGAIFVGSLVASEGSNPRTLTITGNSGDILFAKNSTQTAASLSEKDSFGGGAIYTQNLKIVKNAGNVSFYGNRA
PSGAGVQIADGGTVCLEAFGGDILFEGNINFDGSFNAIHLCGNDSKIVELSAVQDKNIIFQDAITYEENTIRGLPDKDVS
PLSAPSLIFNSKPQDDSAQHHEGTIRFSRGVSKIPQIAAIQEGTLALSQNAELWLAGLKQETGSSIVLSAGSILRIFDSQ
VDSSAPLPTENKEETLVSAGVQINMSSPTPNKDKAVDTPVLADIISITVDLSSFVPEQDGTLPLPPEIIIPKGTKLHSNA
IDLKIIDPTNVGYENHALLSSHKDIPLISLKTAEGMTGTPTADASLSNIKIDVSLPSITPATYGHTGVWSESKMEDGRLV
VGWQPTGYKLNPEKQGALVLNNLWSHYTDLRALKQEIFAHHTIAQRMELDFSTNVWGSGLGVVEDCQNIGEFDGFKHHLT
GYALGLDTQLVEDFLIGGCFSQFFGKTESQSYKAKNDVKSYMGAAYAGILAGPWLIKGAFVYGNINNDLTTDYGTLGIST
GSWIGKGFIAGTSIDYRYIVNPRRFISAIVSTVVPFVEAEYVRIDLPEISEQGKEVRTFQKTRFENVAIPFGFALEHAYS
RGSRAEVNSVQLAYVFDVYRKGPVSLITLKDAAYSWKSYGVDIPCKAWKARLSNNTEWNSYLSTYLAFNYEWREDLIAYD
FNGGIRIIF
>Q9Z3A1 ~~~pmp2~~~Probable outer membrane protein pmp2~~~COG4625
MKIPLRFLLISLVPTLSMSNLLGAATTEELSASNSFDGTTSTTSFSSKTSSATDGTNYVFKDSVVIENVPKTGETQSTSC
FKNDAAAGDLNFLGGGFSFTFSNIDATTASGAAIGSEAANKTVTLSGFSALSFLKSPASTVTNGLGAINVKGNLSLLDND
KVLIQDNFSTGDGGAINCAGSLKIANNKSLSFIGNSSSTRGGAIHTKNLTLSSGGETLFQGNTAPTAAGKGGAIAIADSG
TLSISGDSGDIIFEGNTIGATGTVSHSAIDLGTSAKITALRAAQGHTIYFYDPITVTGSTSVADALNINSPDTGDNKEYT
GTIVFSGEKLTEAEAKDEKNRTSKLLQNVAFKNGTVVLKGDVVLSANGFSQDANSKLIMDLGTSLVANTESIELTNLEIN
IDSLRNGKKIKLSAATAQKDIRIDRPVVLAISDESFYQNGFLNEDHSYDGILELDAGKDIVISADSRSIDAVQSPYGYQG
KWTINWSTDDKKATVSWAKQSFNPTAEQEAPLVPNLLWGSFIDVRSFQNFIELGTEGAPYEKRFWVAGISNVLHRSGREN
QRKFRHVSGGAVVGASTRMPGGDTLSLGFAQLFARDKDYFMNTNFAKTYAGSLRLQHDASLYSVVSILLGEGGLREILLP
YVSKTLPCSFYGQLSYGHTDHRMKTESLPPPPPTLSTDHTSWGGYVWAGELGTRVAVENTSGRGFFQEYTPFVKVQAVYA
RQDSFVELGAISRDFSDSHLYNLAIPLGIKLEKRFAEQYYHVVAMYSPDVCRSNPKCTTTLLSNQGSWKTKGSNLARQAG
IVQASGFRSLGAAAELFGNFGFEWRGSSRSYNVDAGSKIKF
>Q9Z899 ~~~pmp6~~~Probable outer membrane protein pmp6~~~COG3210
MKYSLPWLLTSSALVFSLHPLMAANTDLSSSDNYENGSSGSAAFTAKETSDASGTTYTLTSDVSITNVSAITPADKSCFT
NTGGALSFVGADHSLVLQTIALTHDGAAINNTNTALSFSGFSSLLIDSAPATGTSGGKGAICVTNTEGGTATFTDNASVT
LQKNTSEKDGAAVSAYSIDLAKTTTAALLDQNTSTKNGGALCSTANTTVQGNSGTVTFSSNTATDKGGGIYSKEKDSTLD
ANTGVVTFKSNTAKTGGAWSSDDNLALTGNTQVLFQENKTTGSAAQANNPEGCGGAICCYLATATDKTGLAISQNQEMSF
TSNTTTANGGAIYATKCTLDGNTTLTFDQNTATAGCGGAIYTETEDFSLKGSTGTVTFSTNTAKTGGALYSKGNSSLTGN
TNLLFSGNKATGPSNSSANQEGCGGAILSFLESASVSTKKGLWIEDNENVSLSGNTATVSGGAIYATKCALHGNTTLTFD
GNTAETAGGAIYTETEDFTLTGSTGTVTFSTNTAKTAGALHTKGNTSFTKNKALVFSGNSATATATTTTDQEGCGGAILC
NISESDIATKSLTLTENESLSFINNTAKRSGGGIYAPKCVISGSESINFDGNTAETSGGAIYSKNLSITANGPVSFTNNS
GGKGGAIYIADSGELSLEAIDGDITFSGNRATEGTSTPNSIHLGAGAKITKLAAAPGHTIYFYDPITMEAPASGGTIEEL
VINPVVKAIVPPPQPKNGPIASVPVVPVAPANPNTGTIVFSSGKLPSQDASIPANTTTILNQKINLAGGNVVLKEGATLQ
VYSFTQQPDSTVFMDAGTTLETTTTNNTDGSIDLKNLSVNLDALDGKRMITIAVNSTSGGLKISGDLKFHNNEGSFYDNP
GLKANLNLPFLDLSSTSGTVNLDDFNPIPSSMAAPDYGYQGSWTLVPKVGAGGKVTLVAEWQALGYTPKPELRATLVPNS
LWNAYVNIHSIQQEIATAMSDAPSHPGIWIGGIGNAFHQDKQKENAGFRLISRGYIVGGSMTTPQEYTFAVAFSQLFGKS
KDYVVSDIKSQVYAGSLCAQSSYVIPLHSSLRRHVLSKVLPELPGETPLVLHGQVSYGRNHHNMTTKLANNTQGKSDWDS
HSFAVEVGGSLPVDLNYRYLTSYSPYVKLQVVSVNQKGFQEVAADPRIFDASHLVNVSIPMGLTFKHESAKPPSALLLTL
GYAVDAYRDHPHCLTSLTNGTSWSTFATNLSRQAFFAEASGHLKLLHGLDCFASGSCELRSSSRSYNANCGTRYSF
>Q9Z898 ~~~pmp7~~~Probable outer membrane protein pmp7~~~COG3210
MKSSVSWLFFSSIPLFSSLSIVAAEVTLDSSNNSYDGSNGTTFTVFSTTDAAAGTTYSLLSDVSFQNAGALGIPLASGCF
LEAGGDLTFQGNQHALKFAFINAGSSAGTVASTSAADKNLLFNDFSRLSIISCPSLLLSPTGQCALKSVGNLSLTGNSQI
IFTQNFSSDNGGVINTKNFLLSGTSQFASFSRNQAFTGKQGGVVYATGTITIENSPGIVSFSQNLAKGSGGALYSTDNCS
ITDNFQVIFDGNSAWEAAQAQGGAICCTTTDKTVTLTGNKNLSFTNNTALTYGGAISGLKVSISAGGPTLFQSNISGSSA
GQGGGGAINIASAGELALSATSGDITFNNNQVTNGSTSTRNAINIIDTAKVTSIRAATGQSIYFYDPITNPGTAASTDTL
NLNLADANSEIEYGGAIVFSGEKLSPTEKAIAANVTSTIRQPAVLARGDLVLRDGVTVTFKDLTQSPGSRILMDGGTTLS
AKEANLSLNGLAVNLSSLDGTNKAALKTEAADKNISLSGTIALIDTEGSFYENHNLKSASTYPLLELTTAGANGTITLGA
LSTLTLQEPETHYGYQGNWQLSWANATSSKIGSINWTRTGYIPSPERKSNLPLNSLWGNFIDIRSINQLIETKSSGEPFE
RELWLSGIANFFYRDSMPTRHGFRHISGGYALGITATTPAEDQLTFAFCQLFARDRNHITGKNHGDTYGASLYFHHTEGL
FDIANFLWGKATRAPWVLSEISQIIPLSFDAKFSYLHTDNHMKTYYTDNSIIKGSWRNDAFCADLGASLPFVISVPYLLK
EVEPFVKVQYIYAHQQDFYERYAEGRAFNKSELINVEIPIGVTFERDSKSEKGTYDLTLMYILDAYRRNPKCQTSLIASD
ANWMAYGTNLARQGFSVRAANHFQVNPHMEIFGQFAFEVRSSSRNYNTNLGSKFCF
>Q9Z393 ~~~pmp8~~~Probable outer membrane protein pmp8~~~COG3468
MKIPLHKLLISSTLVTPILLSIATYGADASLSPTDSFDGAGGSTFTPKSTADANGTNYVLSGNVYINDAGKGTALTGCCF
TETTGDLTFTGKGYSFSFNTVDAGSNAGAAASTTADKALTFTGFSNLSFIAAPGTTVASGKSTLSSAGALNLTDNGTILF
SQNVSNEANNNGGAITTKTLSISGNTSSITFTSNSAKKLGGAIYSSAAASISGNTGQLVFMNNKGETGGGALGFEASSSI
TQNSSLFFSGNTATDAAGKGGAIYCEKTGETPTLTISGNKSLTFAENSSVTQGGAICAHGLDLSAAGPTLFSNNRCGNTA
AGKGGAIAIADSGSLSLSANQGDITFLGNTLTSTSAPTSTRNAIYLGSSAKITNLRAAQGQSIYFYDPIASNTTGASDVL
TINQPDSNSPLDYSGTIVFSGEKLSADEAKAADNFTSILKQPLALASGTLALKGNVELDVNGFTQTEGSTLLMQPGTKLK
ADTEAISLTKLVVDLSALEGNKSVSIETAGANKTITLTSPLVFQDSSGNFYESHTINQAFTQPLVVFTAATAASDIYIDA
LLTSPVQTPEPHYGYQGHWEATWADTSTAKSGTMTWVTTGYNPNPERRASVVPDSLWASFTDIRTLQQIMTSQANSIYQQ
RGLWASGTANFFHKDKSGTNQAFRHKSYGYIVGGSAEDFSENIFSVAFCQLFGKDKDLFIVENTSHNYLASLYLQHRAFL
GGLPMPSFGSITDMLKDIPLILNAQLSYSYTKNDMDTRYTSYPEAQGSWTNNSGALELGGSLALYLPKEAPFFQGYFPFL
KFQAVYSRQQNFKESGAEARAFDDGDLVNCSIPVGIRLEKISEDEKNNFEISLAYIGDVYRKNPRSRTSLMVSGASWTSL
CKNLARQAFLASAGSHLTLSPHVELSGEAAYELRGSAHIYNVDCGLRYSF
>Q9Z398 ~~~pmp9~~~Probable outer membrane protein pmp9~~~COG4625
MKSSLHWFLISSSLALPLSLNFSAFAAVVEINLGPTNSFSGPGTYTPPAQTTNADGTIYNLTGDVSITNAGSPTALTASC
FKETTGNLSFQGHGYQFLLQNIDAGANCTFTNTAANKLLSFSGFSYLSLIQTTNATTGTGAIKSTGACSIQSNYSCYFGQ
NFSNDNGGALQGSSISLSLNPNLTFAKNKATQKGGALYSTGGITINNTLNSASFSENTAANNGGAIYTEASSFISSNKAI
SFINNSVTATSATGGAIYCSSTSAPKPVLTLSDNGELNFIGNTAITSGGAIYTDNLVLSSGGPTLFKNNSAIDTAAPLGG
AIAIADSGSLSLSALGGDITFEGNTVVKGASSSQTTTRNSINIGNTNAKIVQLRASQGNTIYFYDPITTSITAALSDALN
LNGPDLAGNPAYQGTIVFSGEKLSEAEAAEADNLKSTIQQPLTLAGGQLSLKSGVTLVAKSFSQSPGSTLLMDAGTTLET
ADGITINNLVLNVDSLKETKKATLKATQASQTVTLSGSLSLVDPSGNVYEDVSWNNPQVFSCLTLTADDPANIHITDLAA
DPLEKNPIHWGYQGNWALSWQEDTATKSKAATLTWTKTGYNPNPERRGTLVANTLWGSFVDVRSIQQLVATKVRQSQETR
GIWCEGISNFFHKDSTKINKGFRHISAGYVVGATTTLASDNLITAAFCQLFGKDRDHFINKNRASAYAASLHLQHLATLS
SPSLLRYLPGSESEQPVLFDAQISYIYSKNTMKTYYTQAPKGESSWYNDGCALELASSLPHTALSHEGLFHAYFPFIKVE
ASYIHQDSFKERNTTLVRSFDSGDLINVSVPIGITFERFSRNERASYEATVIYVADVYRKNPDCTTALLINNTSWKTTGT
NLSRQAGIGRAGIFYAFSPNLEVTSNLSMEIRGSSRSYNADLGGKFQF
>Q9PJY3 ~~~pmpA~~~Probable outer membrane protein PmpA~~~COG4625
MNQVIKTIALCYQKYISRASNKTFSIHNTLSLSLLPKCLLGSLIIYTSHAFGEMELAISGHKYGKDRDAFTMISSCPEGT
NYMINRKLILSEFSSLDKFSSGGAFKNLAGKIAFLGKHSSSSIHFKHLNINGFGSGIFSESAIEFSDLRKLVAFGSESTG
GIFTARDDISFKNNHYIAFRNNIAKGNGGVILLQGDERGTVSFTDQQGAIIFANNQALVSPSIKHSGRGGAISGDFAGSR
IIFLNNQQITFEENSAVHGGAIYNKNGVVEFLGNGGTLSFKENSTRANGGAIYTGKFKANQQTAPIIFSQNNANQKGGAI
YAQYVNLEQNQDAIRFENNSAKEGGGAISSSQCAITAHNSITFSNNFAGDLGGGAILLGGKQPSLSLVAHNGNIAFIGNT
MLPATKKASLPRNNSILVKESPYKIQFAANKNRSIIFFDPVIALSPSTSPVEINSPEHETPFFSPKGTIVFSGANLIDDA
KEDIANRTSIFNQPVLLHNGTLSIESGAHLVVQSFKQTGGRISLSPGSSLALYTTNTLFHGNVSSTDPLEINGLSLGVDT
SPSNLYSEIRAGSAPLKLSGSPDIHDPERLFYENRDSAASPYQMEILLSSDKIIDVSNFTIDAPVPNKEAGFQGSWHFSW
QPNTVNNTKHKVLRASWIPTGEYILEPSRVGNAIPNSLWSTFLLLQTASHNLGDHLCNNSDLVPTSYLGLLIGGIGAEMR
TYSAEKESFISRSGTTGTTIIRLTPTLTLSGGATHMFGDSFVTKLPEFIASEGMVQNVGLTQILGPLTVKSTLCAALDHN
AMIRLSAQKNHTRAKWDTFGIRGTLGASYSLLDYENMVRIFTFANVEATNVLQKSFTETGYNPRSFARTRLTNIAVPVGI
GYEFCLSNHSFALLGKGHIGYSRDIKRKNPVTFAQLAMNNFSWTANGCQVPTSAHTIANQLILRYKACSLYVNAVATKLE
STYLSSSLSCGGYVGF
>O84417 ~~~pmpA~~~Probable outer membrane protein PmpA~~~
MNRVIEIHAHYDQRQLSQSPNTNFLVHHPYLTLIPKFLLGALIVYAPYSFAEMELAISGHKQGKDRDTFTMISSCPEGTN
YIINRKLILSDFSLLNKVSSGGAFRNLAGKISFLGKNSSASIHFKHININGFGAGVFSESSIEFTDLRKLVAFGSESTGG
IFTAKEDISFKNNHHIAFRNNITKGNGGVIQLQGDMKGSVSFVDQRGAIIFTNNQAVTSSSMKHSGRGGAISGDFAGSRI
LFLNNQQITFEGNSAVHGGAIYNKNGLVEFLGNAGPLAFKENTTIANGGAIYTSNFKANQQTSPILFSQNHANKKGGAIY
AQYVNLEQNQDTIRFEKNTAKEGGGAITSSQCSITAHNTIIFSDNAAGDLGGGAILLEGKKPSLTLIAHSGNIAFSGNTM
LHITKKASLDRHNSILIKEAPYKIQLAANKNHSIHFFDPVMALSASSSPIQINAPEYETPFFSPKGMIVFSGANLLDDAR
EDVANRTSIFNQPVHLYNGTLSIENGAHLIVQSFKQTGGRISLSPGSSLALYTMNSFFHGNISSKEPLEINGLSFGVDIS
PSNLQAEIRAGNAPLRLSGSPSIHDPEGLFYENRDTAASPYQMEILLTSDKIVDISKFTTDSLVTNKQSGFQGAWHFSWQ
PNTINNTKQKILRASWLPTGEYVLESNRVGRAVPNSLWSTFLLLQTASHNLGDHLCNNRSLIPTSYFGVLIGGTGAEMST
HSSEEESFISRLGATGTSIIRLTPSLTLSGGGSHMFGDSFVADLPEHITSEGIVQNVGLTHVWGPLTVNSTLCAALDHNA
MVRICSKKDHTYGKWDTFGMRGTLGASYTFLEYDQTMRVFSFANIEATNILQRAFTETGYNPRSFSKTKLLNIAIPIGIG
YEFCLGNSSFALLGKGSIGYSRDIKRENPSTLAHLAMNDFAWTTNGCSVPTSAHTLANQLILRYKACSLYITAYTINREG
KNLSNSLSCGGYVGF
>Q9PJY2 ~~~pmpB~~~Probable outer membrane protein PmpB~~~COG3210
MSSMKWLSATAVFAAVLPSVSGFCFPESKELNFSRTGTSSSTTFTETIGENGTEYIVSGNSSFTNFTNIPVKKPTTDDSS
TSTPTTSSAVDPTEKIVRASSSSSPNSGDTSATPDPKGGGAFYNEHSGILSFMARSGVEGSLTLSNIKMTGDGGAIYSQG
ELLFTDLTGLTIQGNLSQLSGGGIFGGSTISFSGINQATFSSNTAEVVPEETTPNPNPGTQTTTSQPSPTSKVQSLFTYS
SSTQANGNGADSQTPSHKPGSGGAIYATGDLTISDSQEIVFSVNKASKDGGAIFAEKNVSFENITTLKVQNNGAEEKGGG
IYASGDLSIQSSKQSLFNSNTSKQGGGALYIEGNVDFKDLEEIRIKYNKSGTFETKKVTLSLPEAQTNKSSVTAASQSGP
NTTPTPTPPVTAKGGGLYTEKNLSISNITGIIEITNNKATDVGGGAYVKGTLTCKDSHRLQFQKNSSEKKGGGLYTEDTI
TLSNLTGKTLFQENTAKEEGGGLYIQGDDKTLTMTGLDSFCLIDNTSATHGGGAYVTKEISQTYTSDVEEFPGITPVHGE
TIISGNKATGGSGGGVCTKHLVLSNLQTISISENFASENGGGACTCPDNFPAPTASTPSTNQTAAPKDDKDFLIDYVVST
TIDKNKATKKGAGVYAKKAKLSRIDELNISDNAAQETGGGFCCTESLELDTIASLSVTKNLAGKEGGGLHAKTLNISNLK
SGLSFSNNTANSSSTGVATTATTSQSPTVSSFLPRATAGSSPAPAQTTPTYAGVVGGAIYGETVSFSKCSGLCQFTENSA
IDNTPSSPSLNVQGGAIYAKTSLSIEAEDPSTSYVFSKNSVSTGKAQTTGQIAGGAIYSPSVTLNCQTVFSGNSASMATT
NPPSGTSPKDTIGGAIAGTTISLSKTSHFSENTADLGAAIGTLSGGSSSNLTEKITLSNGSFTFEKNKANKRGVIYAPSV
SIKGNNITFNQNTSTHDGSAIYFTKDATIESLGSVLFTGNNVTAEQASSTATGGQTTNTTNYGAAIFGDPGTQTTDTTLK
LIASSGNITFSNNSQNTATNNPATKFCSISGYVKLTLQAAQGKTISFFDSIRTSTKKTGQAQNSYETLDINKTENSNTYA
GTVLFSSELHEVKSYVPQNVVLHNGTLVLKKNAELHVVSFEQKEGSKLIMEPGAVLSNQNIANGALAINGLTIDLSSLGA
PQTGEVFSPPELRIVATTSNSGGGGGVGGYVTASKNLSAASPTVAATNPTMADNKVFLTGALTLIDPDGNFYQNPILGTD
LTDVPLIKLPTTANQVDVSNLTLSGDLSPKKGYTGTWTLNPDPQTGKVVANWKFDMYRRWEYIPRDNHFYANSILGSQNS
MIVVKQGLINNMLHNARFDDAAYNNFWVSGVGTFLSQQGTPLSEEFSYYSRGTSVAIDAKPRPDFILGAAFSKMVGRTKA
IKKVHNYSHKGSEYSYQASVYGGKFLYFLLNKQHGWALPFLLQGVVSYGHIKHDTTTLYPSIHEKNKGDWEDLGWLVDLR
VSMDVKEPSKRSSKRVALYGELEYSSIRQKSFTEIDYDPRRFDDCAYRNLSIPMGCYFEGAIMSYDILMYNKLSLAYMPS
IYRNNPVCKYWVLSSNETSKVVCGVPTRTSARAEYSTQLYLGPFWTLYGNYTIDVGMYTLAQMTSCGARMIF
>O84418 ~~~pmpB~~~Probable outer membrane protein PmpB~~~
MSSMKWLSATAVFAAVLPSVSGFCFPEPKELNFSRVGTSSSTTFTETVGEAGAEYIVSGNASFTKFTNIPTTDTTTPTNS
NSSSSNGETASVSEDSDSTTTTPDPKGGGAFYNAHSGVLSFMTRSGTEGSLTLSEIKITGEGGAIFSQGELLFTDLTGLT
IQNNLSQLSGGAIFGESTISLSGITKATFSSNSAEVPAPVKKPTEPKAQTASETSGSSSSSGNDSVSSPSSSRAEPAAAN
LQSHFICATATPAAQTDTETSTPSHKPGSGGAIYAKGDLTIADSQEVLFSINKATKDGGAIFAEKDVSFENITSLKVQTN
GAEEKGGAIYAKGDLSIQSSKQSLFNSNYSKQGGGALYVEGDINFQDLEEIRIKYNKAGTFETKKITLPKAQASAGNADA
WASSSPQSGSGATTVSNSGDSSSGSDSDTSETVPATAKGGGLYTDKNLSITNITGIIEIANNKATDVGGGAYVKGTLTCE
NSHRLQFLKNSSDKQGGGIYGEDNITLSNLTGKTLFQENTAKEEGGGLFIKGTDKALTMTGLDSFCLINNTSEKHGGGAF
VTKEISQTYTSDVETIPGITPVHGETVITGNKSTGGNGGGVCTKRLALSNLQSISISGNSAAENGGGAHTCPDSFPTADT
AEQPAAASAATSTPESAPVVSTALSTPSSSTVSSLTLLAASSQASPATSNKETQDPNADTDLLIDYVVDTTISKNTAKKG
GGIYAKKAKMSRIDQLNISENSATEIGGGICCKESLELDALVSLSVTENLVGKEGGGLHAKTVNISNLKSGFSFSNNKAN
SSSTGVATTASAPAAAAASLQAAAAAVPSSPATPTYSGVVGGAIYGEKVTFSQCSGTCQFSGNQAIDNNPSQSSLNVQGG
AIYAKTSLSIGSSDAGTSYIFSGNSVSTGKSQTTGQIAGGAIYSPTVTLNCPATFSNNTASMATPKTSSEDGSSGNSIKD
TIGGAIAGTAITLSGVSRFSGNTADLGAAIGTLANANTPSATSGSQNSITEKITLENGSFIFERNQANKRGAIYSPSVSI
KGNNITFNQNTSTHDGSAIYFTKDATIESLGSVLFTGNNVTATQASSATSGQNTNTANYGAAIFGDPGTTQSSQTDAILT
LLASSGNITFSNNSLQNNQGDTPASKFCSIAGYVKLSLQAAKGKTISFFDCVHTSTKKIGSTQNVYETLDINKEENSNPY
TGTIVFSSELHENKSYIPQNAILHNGTLVLKEKTELHVVSFEQKEGSKLIMKPGAVLSNQNIANGALVINGLTIDLSSMG
TPQAGEIFSPPELRIVATTSSASGGSGVSSSIPTNPKRISAAAPSGSAATTPTMSENKVFLTGDLTLIDPNGNFYQNPML
GSDLDVPLIKLPTNTSDVQVYDLTLSGDLFPQKGYMGTWTLDSNPQTGKLQARWTFDTYRRWVYIPRDNHFYANSILGSQ
NSMIVVKQGLINNMLNNARFDDIAYNNFWVSGVGTFLAQQGTPLSEEFSYYSRGTSVAIDAKPRQDFILGAAFSKMVGKT
KAIKKMHNYFHKGSEYSYQASVYGGKFLYFLLNKQHGWALPFLIQGVVSYGHIKHDTTTLYPSIHERNKGDWEDLGWLAD
LRISMDLKEPSKDSSKRITVYGELEYSSIRQKQFTEIDYDPRHFDDCAYRNLSLPVGCAVEGAIMNCNILMYNKLALAYM
PSIYRNNPVCKYRVLSSNEAGQVICGVPTRTSARAEYSTQLYLGPFWTLYGNYTIDVGMYTLSQMTSCGARMIF
>Q9PJY1 ~~~pmpC~~~Probable outer membrane protein PmpC~~~COG3210
MKFLSATAVFAAALPSITSASSVESQIETKDLNSSRTGSSSSQSFTEIIPENGAEYRVSGDVSFSDFSNIPEEAETLAIS
HKEQPNNEVVLSEENHQASFQDSAQNQTENASEGNSPNSENTNQSSTTETESITTDEQVQNDNESAASVPTTVETATAMR
LPSYHLQTESLVEGATEEDQNQPNSQNTSSGGGAFYNSQQGPLSFINDPDKDSSLTLSKIRVIGEGGAIYSKGPLSITGL
KKLALKENLSQKAGGAICAESTISISSVDSIIFSKNTVTPPAANKPELPNDPSGSNGNDGSDDSNSSGNTDSNESNPNNS
ASNNTGSENELSSSTPSAQLPNPATPFLSSVSTNSQPIDTEPENAWHAESGSGGAIYSKGKLSIASSKEVVFDHNSATKN
GGAIFGEEEIALEKIASLKFDSNTTGEKGGAIHAKTVTLSDIKNTLIFVNNTAKTPEENSLKSSQLNNQNPSEEEHQDTS
EGEESQSLETSPITNQDSASSHVAIFRSIAASSSQSNSENIPNADGSTSAGGDAGSSSQPSTPGSDSSINHVIGGGAIYG
EAVKIENLSGYGTFSNNNAVDHQISGSTSDVLGGAIYAKTSLTIDSGNSSGTITFSENTTSSKSTTGQVAGGAIFSPSVT
ITTPVTFSKNSAINATTSSKKDTFGGAIGAISTVSLSKGARFSENIADLGSAIGLVPTTQDAETVQLTTGSYYFEKNKAL
KRATVYAPIVSIKAHTATFDQNISAEEGSAIYFTKEATIESLGSVLFTGNLVTPIQSTTVLTSGNTSKYGAAIFGQIANA
SGSQTDNLPLKLIASGGNISFRNNEYRPDATNTGQSTFCSIAGDIKLTMQAAEGKVISFFDAIRTSTKKTGTLASAYDTL
DINKSNDSGSINSAFTGTIMFSSELHENKSYIPQNVVLHSGSLILKANTELHVLSFDQKEGSSLIMEPGSVLSNQDIADG
SLVVNSLTIDLSSVGRNSASGDNIFMPPELRIVDTSTNSGNSSSTPPSSNTPPNSTPTAQAPISKNFAATTTTPTTPPTT
GNIVFLNGVIKLIDPNGTFFQNPALGSDQKISLLVLPSDQTKLQAQKVVLTGDISPKKGYTGTLTLDPQQLQNGVIQALW
TFKSYRQWAYIPRDNHFYANSILGSQMSMATVKQGLINDKLNLARFDEVAYNNLWISGLGTMLSQRGGQRSEEMTYYSRG
ASVALDAKPTQDLIIGAAFSKMIGRSKSLKLERNYTHKGSEYSYQASVYGGSPFYLTINKEAGRSLPLLLQGVISYGYIK
HDTVTHYPTIRELNKGEWEDLGWLTALRVSSILKTPKQGDSKRITVYGEVEYSSIRQKQFTETEYDPRYFSNCTYRNLAV
PVGLALEGEFKGNDILMYNRFSVAYMPSIYRNSPVCKYQVLSSGEGGEIVCGVPTRNSSRAEYSTQLYLGPLWTLYGSYT
LEADAHTLANMINCGARMTF
>O84419 ~~~pmpC~~~Probable outer membrane protein PmpC~~~
MKFMSATAVFAAALSSVTEASSIQDQIKNTDCNVSKLGYSTSQAFTDMMLADNTEYRAADSVSFYDFSTSSRLPRKHLSS
SSEASPTTEGVSSSSSGETDEKTEEELDNGGIIYAREKLTISESQDSLSNQSIELHDNSIFFGEGEVIFDHRVALKNGGA
IYGEKEVVFENIKSLLVEVNIAVEKGGSVYAKERVSLENVTEATFSSNGGEQGGGGIYSEQDMLISDCNNVHFQGNAAGA
TAVKQCLDEEMIVLLAECVDSLSEDTLDSTPETEQTESNGNQDGSSETEDTQVSESPESTPSPDDVLGKGGGIYTEKSLT
ITGITGTIDFVSNIATDSGAGVFTKENLSCTNTNSLQFLKNSAGQHGGGAYVTQTMSVTNTTSESITTPPLIGEVIFSEN
TAKGHGGGICTNKLSLSNLKTVTLTKNSAKESGGAIFTDLASIPITDTPESSTPSSSSPASTPEVVASAKINRFFASTAK
PAAPSLTEAESDQTDQTETSDTNSDIDVSIENILNVAINQNTSAKKGGAIYGKKAKLSRINNLELSGNSSQDVGGGLCLT
ESVEFDAIGSLLSHYNSAAKEGGAIHSKTVTLSNLKSTFTFADNTVKAIVESTPEAPEEIPPVEGEESTATEDPNSNTEG
SSANTNLEGSQGDTADTGTGDVNNESQDTSDTGNAESEEQLQDSTQSNEENTLPNSNIDQSNENTDESSDSHTEEITDES
VSSSSESGSSTPQDGGAASSGAPSGDQSISANACLAKSYAASTDSSPVSNSSGSEEPVTSSSDSDVTASSDNPDSSSSGD
SAGDSEEPTEPEAGSTTETLTLIGGGAIYGETVKIENFSGQGIFSGNKAIDNTTEGSSSKSDVLGGAVYAKTLFNLDSGS
SRRTVTFSGNTVSSQSTTGQVAGGAIYSPTVTIATPVVFSKNSATNNANNTTDTQRKDTFGGAIGATSAVSLSGGAHFLE
NVADLGSAIGLVPGTQNTETVKLESGSYYFEKNKALKRATIYAPVVSIKAYTATFNQNRSLEEGSAIYFTKEASIESLGS
VLFTGNLVTLTLSTTTEGTPATTSGDVTKYGAAIFGQIASSNGSQTDNLPLKLIASGGNICFRNNEYRPTSSDTGTSTFC
SIAGDVKLTMQAAKGKTISFFDAIRTSTKKTGTQATAYDTLDINKSEDSETVNSAFTGTILFSSELHENKSYIPQNVVLH
SGSLVLKPNTELHVISFEQKEGSSLVMTPGSVLSNQTVADGALVINNMTIDLSSVEKNGIAEGNIFTPPELRIIDTTTGG
SGGTPSTDSESNQNSDDTEEQNNNDASNQGESANGSSSPAVAAAHTSRTRNFAAAATATPTTTPTATTTTSNQVILGGEI
KLIDPNGTFFQNPALRSDQQISLLVLPTDSSKMQAQKIVLTGDIAPQKGYTGTLTLDPDQLQNGTISVLWKFDSYRQWAY
VPRDNHFYANSILGSQMLMVTVKQGLLNDKMNLARFEEVSYNNLWISGLGTMLSQVGTPTSEEFTYYSRGASVALDAKPA
HDVIVGAAFSKMIGKTKSLKRENNYTHKGSEYSYQASVYGGKPFHFVINKKTEKSLPLLLQGVISYGYIKHDTVTHYPTI
RERNKGEWEDLGWLTALRVSSVLRTPAQGDTKRITVYGELEYSSIRQKQFTETEYDPRYFDNCTYRNLAIPMGLAFEGEL
SGNDILMYNRFSVAYMLSIYRNSPTCKYQVLSSGEGGEIICGVPTRNSARGEYSTQLYLGPLWTLYGSYTIEADAHTLAH
MMNCGARMTF
>Q9PLB0 ~~~pmpD~~~Probable outer membrane protein PmpD~~~COG3210
MSSEKDKKNSCSKFSLSVVAAILASMSGLSNCSDLYAVGSSADHPAYLIPQAGLLLDHIKDIFIGPKDSQDKGQYKLIIG
EAGSFQDSNAETLPQKVEHSTLFSVTTPIIVQGIDQQDQVSSQGLVCNFSGDHSEEIFERESFLGIAFLGNGSKDGITLT
DIKSSLSGAALYSSDDLIFERIKGDIELSSCSSLERGGACSAQSILIHDCQGLTVKHCAAGVNVEGVSASDHLGFGGGAF
STTSSLSGEKSLYMPAGDIVVATCDGPVCFEGNSAQLANGGAIAASGKVLFVANEKKISFTDNQALSGGAISASSSISFQ
NCAELVFKSNLAKGVKDKCSLGGGALASLESVVLKDNLGITYEKNQSYSEGGAIFGKDCEIFENRGPVVFRDNTAALGGG
AILAQQTVAICGNKSGISFEGSKSSFGGAIACGNFSSENNSSALGSIDISNNLGDISFLRTLCTTSDLGQTDYQGGGALF
AENISLSENAGAITFKDNIVKTFASNGKMLGGGAILASGNVLISKNSGEISFVGNARAPQAIPTRSSDELSFGAQLTQTT
SGCSGGGALFGKEVAIVQNATVVFEQNRLQCGEQETHGGGGAVYGMESASIIGNSFVRFGNNYAVGNQISGGALLSKKVR
LAENTRVDFSRNIATFCGGAVQVSDGSCELINNGYVLFRDNRGQTFGGAISCLKGDVIISGNKDRVEFRDNIVTRPYFEE
NEEKVETADINSDKQEAEERSLLENIEQSFITATNQTFFLEEEKLPSEAFISAEELSKRRECAGGAIFAKRVYITDNKEP
ILFSHNFSDVYGGAIFTGSLQETDKQDVVTPEVVISGNDGDVIFSGNAAKHDKHLPDTGGGAICTQNLTISQNNGNVLFL
NNFACSGGAVRIEDHGEVLLEAFGGDIIFNGNSSFRAQGSDAIYFAGKDSRIKALNATEGHAIVFQDALVFENIEERKSS
GLLVINSQENEGYTGSVRFLGSESKVPQWIHVQQGGLELLHGAILCSYGVKQDPRAKIVLSAGSKLKILDSEQENNAEIG
DLEDSVNSEKTPSLWIGKNAQAKVPLVDIHTISIDLASFSSKAQETPEEAPQVIVPKGSCVHSGELSLELVNTTGKGYEN
HALLKNDTQVSLMSFKEENDGSLEDLSKLSVSDLRIKVSTPDIVEETYGHMGDWSEATIQDGALVINWHPTGYKLDPQKA
GSLVFNALWEEEAVLSTLKNARIAHNLTIQRMEFDYSTNAWGLAFSSFRELSSEKLVSVDGYRGSYIGASAGIDTQLMED
FVLGISTASFFGKMHSQNFDAEISRHGFVGSVYTGFLAGAWFFKGQYSLGETHNDMTTRYGVLGESNATWKSRGVLADAL
VEYRSLVGPARPKFYALHFNPYVEVSYASAKFPSFVEQGGEARAFEETSLTNITVPFGMKFELSFTKGQFSETNSLGIGC
AWEMYRKVEGRSVELLEAGFDWEGSPIDLPKQELRVALENNTEWSSYFSTALGVTAFCGGFSSMDNKLGYEANAGMRLIF
>O84818 ~~~pmpD~~~Probable outer membrane protein PmpD~~~
MSSEKDIKSTCSKFSLSVVAAILASVSGLASCVDLHAGGQSVNELVYVGPQAVLLLDQIRDLFVGSKDSQAEGQYRLIVG
DPSSFQEKDADTLPGKVEQSTLFSVTNPVVFQGVDQQDQVSSQGLICSFTSSNLDSPRDGESFLGIAFVGDSSKAGITLT
DVKASLSGAALYSTEDLIFEKIKGGLEFASCSSLEQGGACAAQSILIHDCQGLQVKHCTTAVNAEGSSANDHLGFGGGAF
FVTGSLSGEKSLYMPAGDMVVANCDGAISFEGNSANFANGGAIAASGKVLFVANDKKTSFIENRALSGGAIAASSDIAFQ
NCAELVFKGNCAIGTEDKGSLGGGAISSLGTVLLQGNHGITCDKNESASQGGAIFGKNCQISDNEGPVVFRDSTACLGGG
AIAAQEIVSIQNNQAGISFEGGKASFGGGIACGSFSSAGGASVLGTIDISKNLGAISFSRTLCTTSDLGQMEYQGGGALF
GENISLSENAGVLTFKDNIVKTFASNGKILGGGAILATGKVEITNNSEGISFTGNARAPQALPTQEEFPLFSKKEGRPLS
SGYSGGGAILGREVAILHNAAVVFEQNRLQCSEEEATLLGCCGGGAVHGMDSTSIVGNSSVRFGNNYAMGQGVSGGALLS
KTVQLAGNGSVDFSRNIASLGGGALQASEGNCELVDNGYVLFRDNRGRVYGGAISCLRGDVVISGNKGRVEFKDNIATRL
YVEETVEKVEEVEPAPEQKDNNELSFLGRAEQSFITAANQALFASEDGDLSPESSISSEELAKRRECAGGAIFAKRVRIV
DNQEAVVFSNNFSDIYGGAIFTGSLREEDKLDGQIPEVLISGNAGDVVFSGNSSKRDEHLPHTGGGAICTQNLTISQNTG
NVLFYNNVACSGGAVRIEDHGNVLLEAFGGDIVFKGNSSFRAQGSDAIYFAGKESHITALNATEGHAIVFHDALVFENLE
ERKSAEVLLINSRENPGYTGSIRFLEAESKVPQCIHVQQGSLELLNGATLCSYGFKQDAGAKLVLAAGAKLKILDSGTPV
QQGHAISKPEAEIESSSEPEGAHSLWIAKNAQTTVPMVDIHTISVDLASFSSSQQEGTVEAPQVIVPGGSYVRSGELNLE
LVNTTGTGYENHALLKNEAKVPLMSFVASGDEASAEISNLSVSDLQIHVVTPEIEEDTYGHMGDWSEAKIQDGTLVISWN
PTGYRLDPQKAGALVFNALWEEGAVLSALKNARFAHNLTAQRMEFDYSTNVWGFAFGGFRTLSAENLVAIDGYKGAYGGA
SAGVDIQLMEDFVLGVSGAAFLGKMDSQKFDAEVSRKGVVGSVYTGFLAGSWFFKGQYSLGETQNDMKTRYGVLGESSAS
WTSRGVLADALVEYRSLVGPVRPTFYALHFNPYVEVSYASMKFPGFTEQGREARSFEDASLTNITIPLGMKFELAFIKGQ
FSEVNSLGISYAWEAYRKVEGGAVQLLEAGFDWEGAPMDLPRQELRVALENNTEWSSYFSTVLGLTAFCGGFTSTDSKLG
YEANTGLRLIF
>Q9PL47 ~~~pmpE~~~Probable outer membrane protein PmpE~~~COG3210
MKKLFFFVLIGSSILGFTREVPPSILLKPILNPYHMTGLFFPKVNLLGDTHNLTDYHLDNLKCILACLQRTPYEGAAFTV
TDYLGFSDTQKDGIFCFKNLTPESGGVIGSPTQNTPTIKIHNTIGPVLFENNTCHRLWTQTDPENEGNKAREGGAIHAGD
VYISNNQNLVGFIKNFAYVQGGAISANTFAYKENKSSFLCLNNSCIQTKTGGKGGAIYVSTSCSFENNNKDLLFIQNSGC
AGGAIFSPTCSLIGNQGDIVFYSNHGFKNVDNATNESGDGGAIKVTTRLDITNNGSQIFFSDNISRNFGGAIHAPCLHLV
GNGPTYFTNNIANHTGGAIYITGTETSKISADHHAIIFDNNISANATNADGSSSNTNPPHRNAITMDNSAGGIELGAGKS
QNLIFYDPIQVTNAGVTVDFNKDASQTGCVVFSGATVLSADISQANLQTKTPATLTLSHGLLCIEDRAQLTVNNFTQTGG
IVALGNGAVLSSYQHSTTDATQTPPTTTTTDASVTLNHIGLNLPSILKDGAEMPLLWVEPISTTQGNTTTYTSDTAASFS
LNGATLSLIDEDGNSPYENTDLSRALYAQPMLAISEASDNQLQSESMDFSKVNVPHYGWQGLWTWGWAKTENPTTTPPAT
ITDPKKANQFHRTLLLTWLPAGYIPSPKHKSPLIANTLWGNILFATENLKNSSGQELLDRPFWGITGGGLGMMVYQEPRK
DHPGFHMHTSGYSAGMITGNTHTFSLRFSQSYTKLNERYAKNYVSSKNYSCQGEMLLSLQEGLMLTKLIGLYSYGNHNSH
HFYTQGEDLSSQGEFHSQTFGGAVFFDLPLKPFGRTHILTAPFLGAIGMYSKLSSFTEVGAYPRTFITETPLINVLIPIG
VKGSFMNATHRPQAWTVELAYQPVLYRQEPSISTQLLAGKGMWFGHGSPASRHALAYKISQKTQLLRFATLQLQYHGYYS
SSTFCNYLNGEVSLRF
>O84877 ~~~pmpE~~~Probable outer membrane protein PmpE~~~
MKKAFFFFLIGNSLSGLAREVPSRIFLMPNSVPDPTKESLSNKISLTGDTHNLTNCYLDNLRYILAILQKTPNEGAAVTI
TDYLSFFDTQKEGIYFAKNLTPESGGAIGYASPNSPTVEIRDTIGPVIFENNTCCRLFTWRNPYAADKIREGGAIHAQNL
YINHNHDVVGFMKNFSYVQGGAISTANTFVVSENQSCFLFMDNICIQTNTAGKGGAIYAGTSNSFESNNCDLFFINNACC
AGGAIFSPICSLTGNRGNIVFYNNRCFKNVETASSEASDGGAIKVTTRLDVTGNRGRIFFSDNITKNYGGAIYAPVVTLV
DNGPTYFINNIANNKGGAIYIDGTSNSKISADRHAIIFNENIVTNVTNANGTSTSANPPRRNAITVASSSGEILLGAGSS
QNLIFYDPIEVSNAGVSVSFNKEADQTGSVVFSGATVNSADFHQRNLQTKTPAPLTLSNGFLCIEDHAQLTVNRFTQTGG
VVSLGNGAVLSCYKNGTGDSASNASITLKHIGLNLSSILKSGAEIPLLWVEPTNNSNNYTADTAATFSLSDVKLSLIDDY
GNSPYESTDLTHALSSQPMLSISEASDNQLQSENIDFSGLNVPHYGWQGLWTWGWAKTQDPEPASSATITDPQKANRFHR
TLLLTWLPAGYVPSPKHRSPLIANTLWGNMLLATESLKNSAELTPSGHPFWGITGGGLGMMVYQDPRENHPGFHMRSSGY
SAGMIAGQTHTFSLKFSQTYTKLNERYAKNNVSSKNYSCQGEMLFSLQEGFLLTKLVGLYSYGDHNCHHFYTQGENLTSQ
GTFRSQTMGGAVFFDLPMKPFGSTHILTAPFLGALGIYSSLSHFTEVGAYPRSFSTKTPLINVLVPIGVKGSFMNATHRP
QAWTVELAYQPVLYRQEPGIAAQLLASKGIWFGSGSPSSRHAMSYKISQQTQPLSWLTLHFQYHGFYSSSTFCNYLNGEI
ALRF
>Q9PL46 ~~~pmpF~~~Probable outer membrane protein PmpF~~~COG3210
MTRRILPLSLVFIPLSCISASETDTLKLPNLTFGGREIEFIVTPPSSIAAQYITYANVSNYRGNFTISSCTQDQWFSRGL
STTNSSGAFVESMTSFTAIDNADLFFCNNYCTHQGGGGAINATGLISFKNNQNILFYNNTTIGTQFTGVALRTERNRGGA
LYGSSIELINNHSLNFINNTSGDMGGAVSTIQNLVIKNTSGIVAFENNHTTDHIPNTFATILARGGAVGCQGACEISHNT
GPVVFNSNYGGYGGAISTGGQCIFRDNKDKLIFINNSALGWHNTSAQGNGAVISAGGEFGLLNNKGPIYFENNNASYIAG
AISCNNLNFQENGPIYFLNNSALYGGAFHLFASPAANYIHTGSGDIIFNNNTELSTTGMSAGLRKLFYIPGTTNNNPITL
SLGAKKDTRIYFYDLFQWGGLKKANTPPENSPHTVTINPSDEFSGAVVFSYKNISSDLQAHMIASKTHNQIKDSPTTLKF
GTMSIENGAEFEFFNGPLTQESTSLLALGQDSILTVGKDASLTITHLGIILPGLLNDQGTTAPRIRVNPQDMTQNTNSNQ
APVSTENVATQKIFFSGLVSLVDENYESVYDSCDLSRGKANQPILHIETTNDAQLSNDWKNTLNTSLYSLPHYGYQGLWT
SNWMTTTRTVSLTNSTETQTANNSIQEQKNTSETFDSNSTTTAKIPSIRASTGGTTPLATTDVTVTRHSLVVSWTPIGYI
ADPARRGDLIANNLVSSGRNTTLYLRSLLPDDSWFALQGSAATLFTKQQKRLDYHGYSSASKGYAISSQASGAHGHKFLF
SFSQSSDTMKEKRTNNKISSRYYLSALCFEQPMFDRIALIGAAAYNYGTHKTYNFYGTKKFSKGNFHSTTLGGSLRCELR
DSMPFQSIMLTPFIQALISRTEPASIQEQGDLARLFSLKQPHTAVVSPIGIKGVYSSNKWPTVSCEMEVAYQPTLYWKRP
ILNTVLIKNNGSWETTNTPLAKHSFYGRGSSSLKFSYLKLFANYQAQVATSTVSHYMNAGGALVF
>P38008 ~~~pmpF~~~Probable outer membrane protein PmpF~~~
MIKRTSLSFACLSFFYLSTISILQANETDTLQFRRFTFSDREIQFVLDPASLITAQNIVLSNLQSNGTGACTISGNTQTQ
IFSNSVNTTADSGGAFDMVTTSFTASDNANLLFCNNYCTHNKGGGAIRSGGPIRFLNNQDVLFYNNISAGAKYVGTGDHN
EKNRGGALYATTITLTGNRTLAFINNMSGDCGGAISADTQISITDTVKGILFENNHTLNHIPYTQAENMARGGAICSRRD
LCSISNNSGPIVFNYNQGGKGGAISATRCVIDNNKERIIFSNNSSLGWSQSSSASNGGAIQTTQGFTLRNNKGSIYFDSN
TATHAGGAINCGYIDIRDNGPVYFLNNSAAWGAAFNLSKPRSATNYIHTGTGDIVFNNNVVFTLDGNLLGKRKLFHINNN
EITPYTLSLGAKKDTRIYFYDLFQWERVKENTSNNPPSPTSRNTITVNPETEFSGAVVFSYNQMSSDIRTLMGKEHNYIK
EAPTTLKFGTLAIEDDAELEIFNIPFTQNPTSLLALGSGATLTVGKHGKLNITNLGVILPIILKEGKSPPCIRVNPQDMT
QNTGTGQTPSSTSSISTPMIIFNGRLSIVDENYESVYDSMDLSRGKAEQLILSIETTNDGQLDSNWQSSLNTSLLSPPHY
GYQGLWTPNWITTTYTITLNNNSSAPTSATSIAEQKKTSETFTPSNTTTASIPNIKASAGSGSGSASNSGEVTITKHTLV
VNWAPVGYIVDPIRRGDLIANSLVHSGRNMTMGLRSLLPDNSWFALQGAATTLFTKQQKRLSYHGYSSASKGYTVSSQAS
GAHGHKFLLSFSQSSDKMKEKETNNRLSSRYYLSALCFEHPMFDRIALIGAAACNYGTHNMRSFYGTKKSSKGKFHSTTL
GASLRCELRDSMPLRSIMLTPFAQALFSRTEPASIRESGDLARLFTLEQAHTAVVSPIGIKGAYSSDTWPTLSWEMELAY
QPTLYWKRPLLNTLLIQNNGSWVTTNTPLAKHSFYGRGSHSLKFSHLKLFANYQAEVATSTVSHYINAGGALVF
>Q9PL45 ~~~pmpG~~~Probable outer membrane protein PmpG~~~COG3210
MMQTPFHKFFLLAMLSYSLLQGGHAADISMPPGIYDGTTLTAPFPYTVIGDPRGTKVTSSGSLELKNLDNSIATLPLSCF
GNLLGNFTIAGRGHSLVFENIRTSTNGAALSNHAPSGLFVIEAFDELSLLNCNSLVSVVPQTGGTTTSVPSNGTIYSRTD
LVLRDIKKVSFYSNLVSGDGGAIDAQSLMVNGIEKLCTFQENVAQSDGGACQVTKTFSAVGNKVPLSFLGNVAGNKGGGV
AAVKDGQGAGGATDLSVNFANNTAVEFEGNSARIGGGIYSDGNISFLGNAKTVFLSNVASPIYVDPAAAGGQPPADKDNY
GDGGAIFCKNDTNIGEVSFKDEGVVFFSKNIAAGKGGAIYAKKLTISDCGPVQFLGNVANDGGAIYLVDQGELSLSADRG
DIIFDGNLKRMATQGAATVHDVMVASNAISMATGGQITTLRAKEGRRILFNDPIEMANGQPVIQTLTVNEGEGYTGDIVF
AKGDNVLYSSIELSQGRIILREQTKLLVNSLTQTGGSVHMEGGSTLDFAVTTPPAANSMALTNVHFSLASLLKNNGVTNP
PTNPPVQVSSPAVIGNTAAGTVTISGPIFFEDLDETAYDNNQWLGADQTIDVLQLHLGANPPANAPTDLTLGNESSKYGY
QGSWTLQWEPDPANPPQNNSYMLKASWTKTGYNPGPERVASLVSNSLWGSILDVRSAHSAIQASIDGRAYCRGIWISGIS
NFFYHDQDALGQGYRHISGGYSIGANSYFGSSMFGLAFTETFGRSKDYVVCRSNDHTCVGSVYLSTRQALCGSCLFGDAF
VRASYGFGNQHMKTSYTFAEESNVRWDNNCVVGEVGAGLPIMLAASKLYLNELRPFVQAEFAYAEHESFTERGDQAREFK
SGHLMNLSIPVGVKFDRCSSKHPNKYSFMGAYICDAYRSISGTETTLLSHKETWTTDAFHLARHGVMVRGSMYASLTGNI
EVYGHGKYEYRDASRGYGLSIGSKIRF
>O84879 ~~~pmpG~~~Probable outer membrane protein PmpG~~~
MQTSFHKFFLSMILAYSCCSLSGGGYAAEIMIPQGIYDGETLTVSFPYTVIGDPSGTTVFSAGELTLKNLDNSIAALPLS
CFGNLLGSFTVLGRGHSLTFENIRTSTNGAALSDSANSGLFTIEGFKELSFSNCNSLLAVLPAATTNNGSQTPTTTSTPS
NGTIYSKTDLLLLNNEKFSFYSNLVSGDGGAIDAKSLTVQGISKLCVFQENTAQADGGACQVVTSFSAMANEAPIAFIAN
VAGVRGGGIAAVQDGQQGVSSSTSTEDPVVSFSRNTAVEFDGNVARVGGGIYSYGNVAFLNNGKTLFLNNVASPVYIAAE
QPTNGQASNTSDNYGDGGAIFCKNGAQAAGSNNSGSVSFDGEGVVFFSSNVAAGKGGAIYAKKLSVANCGPVQFLGNIAN
DGGAIYLGESGELSLSADYGDIIFDGNLKRTAKENAADVNGVTVSSQAISMGSGGKITTLRAKAGHQILFNDPIEMANGN
NQPAQSSEPLKINDGEGYTGDIVFANGNSTLYQNVTIEQGRIVLREKAKLSVNSLSQTGGSLYMEAGSTLDFVTPQPPQQ
PPAANQLITLSNLHLSLSSLLANNAVTNPPTNPPAQDSHPAIIGSTTAGSVTISGPIFFEDLDDTAYDRYDWLGSNQKID
VLKLQLGTQPSANAPSDLTLGNEMPKYGYQGSWKLAWDPNTANNGPYTLKATWTKTGYNPGPERVASLVPNSLWGSILDI
RSAHSAIQASVDGRSYCRGLWVSGVSNFFYHDRDALGQGYRYISGGYSLGANSYFGSSMFGLAFTEVFGRSKDYVVCRSN
HHACIGSVYLSTKQALCGSYLFGDAFIRASYGFGNQHMKTSYTFAEESDVRWDNNCLVGEIGVGLPIVITPSKLYLNELR
PFVQAEFSYADHESFTEEGDQARAFRSGHLMNLSVPVGVKFDRCSSTHPNKYSFMGAYICDAYRTISGTQTTLLSHQETW
TTDAFHLARHGVIVRGSMYASLTSNIEVYGHGRYEYRDTSRGYGLSAGSKVRF
>Q9PL44 ~~~pmpH~~~Probable outer membrane protein PmpH~~~COG3210
MPFSLRSTSFCFLACLCSYSYGLASSPQVLTPNVIIPFKGDDIYLNGDCVFASIYAGAEQGSIISANGQNLTIVGQNHTL
SFTDSQGPALQNCAFISAEEKISLRDFSSLLFSKNVSCGEKGMISGKTVSISGGDSIVFKDNSVGYSSLPSVGQTPTTPI
VGDVLKGSIFCVETGLEISGVKKELVFDNTAGNFGAVFCSRAAQGDTTFTVKDCKGKILFQDNVGSCGGGVIYKGEVLFQ
DNEGEMLFRGNSAHDDLGILDANPQPPTEVGGGGGVICTPEKTVTFKGNKGPITFDYNFAKGRGGAIQSQTFSLVADSAV
VFSNNTAEKGGGAIYALEVNVSTNGGSILFEGNRASEGGAICVSEPIAANNGGLTLHAADGDIIFSKNMTSDRPGERSAI
RILDSGTNVSLNASGASKMIFYDPVVQNNPATPPTGTSGEIKINESGSGSVVFTAETLTPSEKLNVINATSNFPGNLTVS
SGELVVTKGATLTVGNITATSGRVTLGSGASLSAVAGTAGTCTVSKLGIDLESFLVPTYETAKLGADTTVAVNNNPTLDL
VMANETEMYDNPLFMNAVTIPFVTLVSLQTTGGVTTSAVTLNNADTAHYGYQGSWSADWRRPPLAPDPSGMTPLDKSNTL
YVTWRPSSNYGVYKLDPQRRGELVPNSLWVSGSALRTFTNGLKEHYVSRDVGFIASVQALGDYVLNYKQGNRDGFLARYG
GFQAVAASHYENGGIFGVAFGQLYGQTKSRLYDSKDAGNITILSCFGRSYIDVKGTETVVYWETAYGYSVHRMHTQYFNG
KTNKFDHSKCRWHNNSYYAFVGAEHNFLEYCIPTRQLARDYDLTGFMRFEMSGGWSSGAKETGALPRHFDRGTGHNMSLP
IGVVAHAVSNGRRSPPSKLTINMGYRPDIWRVTPHCNMKIIANGVKTPIQGSPLARHAFFLEVHDTLYVRHLGRAYMNYS
LDARHRQTTHFVSLGLNRIF
>O84880 ~~~pmpH~~~Probable outer membrane protein PmpH~~~
MPFSLRSTSFCFLACLCSYSYGFASSPQVLTPNVTTPFKGDDVYLNGDCAFVNVYAGAENGSIISANGDNLTITGQNHTL
SFTDSQGPVLQNYAFISAGETLTLKDFSSLMFSKNVSCGEKGMISGKTVSISGAGEVIFWDNSVGYSPLSIVPASTPTPP
APAPAPAASSSLSPTVSDARKGSIFSVETSLEISGVKKGVMFDNNAGNFGTVFRGNSNNNAGSGGSGSATTPSFTVKNCK
GKVSFTDNVASCGGGVVYKGTVLFKDNEGGIFFRGNTAYDDLGILAATSRDQNTETGGGGGVICSPDDSVKFEGNKGSIV
FDYNFAKGRGGSILTKEFSLVADDSVVFSNNTAEKGGGAIYAPTIDISTNGGSILFERNRAAEGGAICVSEASSGSTGNL
TLSASDGDIVFSGNMTSDRPGERSAARILSDGTTVSLNASGLSKLIFYDPVVQNNSAAGASTPSPSSSSMPGAVTINQSG
NGSVIFTAESLTPSEKLQVLNSTSNFPGALTVSGGELVVTEGATLTTGTITATSGRVTLGSGASLSAVAGAANNNYTCTV
SKLGIDLESFLTPNYKTAILGADGTVTVNSGSTLDLVMESEAEVYDNPLFVGSLTIPFVTLSSSSASNGVTKNSVTINDA
DAAHYGYQGSWSADWTKPPLAPDAKGMVPPNTNNTLYLTWRPASNYGEYRLDPQRKGELVPNSLWVAGSALRTFTNGLKE
HYVSRDVGFVASLHALGDYILNYTQDDRDGFLARYGGFQATAASHYENGSIFGVAFGQLYGQTKSRMYYSKDAGNMTMLS
CFGRSYVDIKGTETVMYWETAYGYSVHRMHTQYFNDKTQKFDHSKCHWHNNNYYAFVGAEHNFLEYCIPTRQFARDYELT
GFMRFEMAGGWSSSTRETGSLTRYFARGSGHNMSLPIGIVAHAVSHVRRSPPSKLTLNMGYRPDIWRVTPHCNMEIIANG
VKTPIQGSPLARHAFFLEVHDTLYIHHFGRAYMNYSLDARRRQTAHFVSMGLNRIF
>Q9PL41 ~~~pmpI~~~Probable outer membrane protein PmpI~~~COG3210
MRPDHVNLCCLCATILSPTAILFGQDALDKSALITKNPNSIVCTFLEDCTMENFSPALLSHARQDDPLYIIGNTHNWFVS
NLHPSTNEERFLKEKGDLSIQDFRFLSFTDCSSSTEDSPSILYHKNGQLFLRNNGNMSFYRNHSEGSGGALSTDALFLQH
NYLFTNFEENSSAKNGGAIQAQTLSLSRNVSSLSFSRNRANLNGGAICCQNLICSGNVNPLFFTNNSALNGGAICCINEQ
NLSEKGCLSLAYNQETLFSGNSAKEKGGAIYTKHMVLRHNGPVSFVNNSAKLGGAIAIQSGGSLSIIAGGGSVLFQNNSC
HFSDQGTVRNAIYLEKNALLSSLEARHGDILFFDPIVQEVVSPEFSTTSALTPLRIQTNTNRAVIFSSENLSKEEKTEAN
LISKIQQPIELQSGCLVLKDRVILSAPSLSQAPQALLVMDVGTSLTTSSDLKLTTLSIPLHSIDTENSVSIQSPTLSIQK
IFLSNSEHENFYENVELLSKDQKDIPLLSLPKGLPHPDLPDGNLSSHFGYQGDWNFSWQTSDQRETLVANWTANSYIPHP
ERQSALVANTLWNTYSDMQAVQSMINTTAQGGAYLFGTWGSAVSNLFYSHGNSGKSTDNWKHRSLGYLFGISTHSLDDHS
FCLAAGQLFGKSSDSFVTSADTTSYIAAIQTQIATSLIKISAQACYNESIHELKTKYRSFSKEGFGAWHSVAVSGEIGAS
IPIVSNGSGLFSSFSIFSKLQGFSGKQDGFEESRGEARAFADSSFTNISLPVGIAFEKKSQKTRNYYHFLGAYIQDLKRC
VESGPVTLLKNSVTWDAPMANLDSRAWMFRLTNQRALHRFQTLVNMSYMLRGQSYSYSLDLGTTYRF
>O84882 ~~~pmpI~~~Probable outer membrane protein PmpI~~~
MRPDHMNFCCLCAAILSSTAVLFGQDPLGETALLTKNPNHVVCTFFEDCTMESLFPALCAHASQDDPLYVLGNSYCWFVS
KLHITDPKEALFKEKGDLSIQNFRFLSFTDCSSKESSPSIIHQKNGQLSLRNNGSMSFCRNHAEGSGGAISADAFSLQHN
YLFTAFEENSSKGNGGAIQAQTFSLSRNVSPISFARNRADLNGGAICCSNLICSGNVNPLFFTGNSATNGGAICCISDLN
TSEKGSLSLACNQETLFASNSAKEKGGAIYAKHMVLRYNGPVSFINNSAKIGGAIAIQSGGSLSILAGEGSVLFQNNSQR
TSDQGLVRNAIYLEKDAILSSLEARNGDILFFDPIVQESSSKESPLPSSLQASVTSPTPATASPLVIQTSANRSVIFSSE
RLSEEEKTPDNLTSQLQQPIELKSGRLVLKDRAVLSAPSLSQDPQALLIMEAGTSLKTSSDLKLATLSIPLHSLDTEKSV
TIHAPNLSIQKIFLSNSGDENFYENVELLSKEQNNIPLLTLSKEQSHLHLPDGNLSSHFGYQGDWTFSWKDSDEGHSLIA
NWTPKNYVPHPERQSTLVANTLWNTYSDMQAVQSMINTIAHGGAYLFGTWGSAVSNLFYAHDSSGKPIDNWHHRSLGYLF
GISTHSLDDHSFCLAAGQLLGKSSDSFITSTETTSYIATVQAQLATPLMKISAQACYNESIHELKTKYRSFSKEGFGSWH
SVAVSGEVCASIPIVSNGSGLFSSFSIFSKLQGFSGTQDGFEESSGEIRSFSASSFRNISLPMGITFEKKSQKTRNYYYF
LGAYIQDLKRDVESGPVVLLKNAVSWDAPMANLDSRAYMFRLTNQRALHRLQTLLNVSYVLRGQSHSYSLDLGTTYRF
>Q51423 ~~~pmpR~~~Transcriptional regulatory protein PmpR~~~
MAGHSKWANIKHRKERQDAKKGKIFTKLIRELTVAARQGGGVPADNPRLRLAVDKALTANMTRDTIDRAIARGVGSNDAD
NMVELSYEGYAPSGVAIIVEAMTDNRNRTAAEVRHAFSKCGGNLGTDGSVAYMFERKGQISFAPGVDEEALMDAALEAGA
DDVVVNDDGSIDVFTTFADFISVNEALAAAGFKGDEAEVTMIPSTTATLDLETAQKVLKLIDMLEDLDDVQNVYSNADIP
DDVMAQLG
>Q9HV32 ~~~pmrA~~~Response regulator protein PmrA~~~
MRILLAEDDLLLGDGIRAGLRLEGDTVEWVTDGVAAENALVTDEFDLLVLDIGLPRRSGLDILRNLRHQGLLTPVLLLTA
RDKVADRVAGLDSGADDYLTKPFDLDELQARVRALTRRTTGRALPQLVHGELRLDPATHQVTLSGQAVELAPREYALLRL
LLENSGKVLSRNQLEQSLYGWSGDVESNAIEVHVHHLRRKLGNQLIRTVRGIGYGIDQPAP
>Q9HV31 2.7.13.3~~~pmrB~~~Sensor protein kinase PmrB~~~
MSRAAVPSVRRRLLVNLLVGFVLCWLSVAALTYHLSLKQVNRLFDDDMVDFGEAALRLLDLATEDQAGEDGSITEIIERS
REAIQGLPLLRRESALGYALWRDGQPLLSSLNLPPEITAQGPGFSTVEAQGTHWRVLQLNIDGFQIWISENLIYRQHTMN
LLLFYSLFPLLLALPLLGGLVWFGVARGLAPLREVQAEVQQRSARHLQPIAVEAVPLEIRGLIDELNLLLERLRTALEAE
RRLTSDAAHEIRTPLASLRTHAQVALRSEDPKAHARGLLQVSRSVERISTLMEQILLLARLDGDALLEQFHPVNLATLAE
DVLSELARQAIDKDIELSLHQETVYVMGIDLWLKAMVGNLVGNALRYTPAGGQVEIRVENRAQHAVLRVRDNGPGVALEE
QQAIFTRFYRSPATSSGEGSGLGLPIVKRIVELHFGSIGLGKGLEGKGLEVQVFLPKTQPDATRPPARGPDSGRSHI
>P37590 ~~~pmrD~~~Signal transduction protein PmrD~~~
MEWLVKKSCCNKQDNRHVLMLCDAGGAIKMIAEVKSDFAVKVGDLLSPLQNALYCINREKLHTVKVLSASSYSPDEWERQ
CKVAGKTQ
>P37589 ~~~pmrD~~~Signal transduction protein PmrD~~~
MEWLVKKSHYVKKRACHVLVLCDSGGSLKMIAEANSMILLSPGDILSPLQDAQYCINREKHQTLKIVDARCYSCDEWQRL
TRKPS
>Q1IAL0 6.2.1.70~~~pmsD~~~Pseudomonine synthase PmsD~~~COG1020
MRELTSMQAACWIGRTAHAALGKVSAHLYAEFDGHAIDLERLRAALQQVSLLHPMLRLRIDQDGLQGIAPMEQAPRLEVD
DLRGLAEPEVAQRLLRKREAWTHQQLDLRHGCAARFSVSQLEGERSRLHVDTDMIAIDPSSLRVLMEDLARCYEAPDAPV
ATPPSFFAWWDAVRADPALKATQERDRQWWRARLDSIAPAPTLPLLDVQPAQAHSQRLTTWLDANQHIALRQLARERKVT
LSTLMLGLFASALGAQTGDRQFRLNVPSFWRAPLIDGVERIVGEFANVLIVDVDLDAAPDIAALCNQLAKTTAACMAHSA
YPGVNLMRDLSRHHGTPQLAPVVFTAALDMPGKELFSPRVKNAFGPLGWLISQGPQVALDAQIACADGGILINWDIRLDA
LPEAWVTQLFERFVDLASEVARQPATLDQALPRAPEKRPLNPLQQAYLLGRTTQMPLGGVAMQEFREYRGMMDSAVLRSR
LDAMVRRHPSLRTRIDADRRVQYVSDEVRLNLDEVDLGHLPLAEALGVIDARREDYAHALSPLDRSPWNVTVFHLAHGER
VVFVRLDALILDGRSIATLLVELFEGQLEETPQVDTGAPKADQTEQRTADAAYWNTKLAAVDGAPRLPWSVPLDQAGVAR
YERQSLVVPRETFKKFCMVGARQRLFKNTTLMAVILEVLSHWVSEGGLCVAVPVAPPSDAAFANRSSFLAINWNRAAGSF
AERSAGLQVDVLEGLQHLAYSGVDLARQLFERHGPGPVLPVVITNGFSWPVAPSDSAMRLQGGLTQTPQVAMDIRFWADA
DGALQLDIDYAREVLAPALVSDFLGTLGRAIGQIAGAGEFALAPAALIDTDHYRLNSPVEAACRDGYLARIADHLFTPGN
HKTALISGERRLSYSELGDGVARIIAALRARGIGQGQVVAICLPRSPEHTMLTLACALTGVIWVPIDVAAPAERRHYLLE
NCHPDLVVLGQAQTLEQPSTTCAALLATPAAAPGHLADLSLNEAPGYYLYTSGTTGKPKCVVLNNRATANVIGSTLAEWR
VTERDVFLSVTPLHHDMSVFDVFGSLAAGATLVLPAPGEDKDALRWNQLVAEHQVTLWCSVPAILEMLLACRGEHGLQSL
RLIAQGGDYIKPAVIAGLRELLPQARLISLGGPTETTIWSIWHEIGADDRKLIPYGRPLPGNRYFVLDAQGRHCPVGVVG
RIHTAGANLALGYLLDGALQQSDFITVDDEHGQAVRAFRTGDCGRYRVDGTLLFDSRVNGYVKVRGVRVSLPDIEMVLNQ
HPALRHVLVVDYGEPRLGEVCLGALYVCDPQAAEPSMAELRDYAREHLPHSHVPTRLLGVAALPLSQNGKPDRRRARELL
SAPATASVRDKVLAIYLQVLGHSNEAGTDSAVDFISLGLRPPHLKAVAAQLQAQFGVSLSPGQLLRCRNAQEVERLLG
>Q1IAK8 6.2.1.61~~~pmsE~~~Pseudomonine synthase PmsE~~~COG1021
MTIEFTDWPQDRAQRYRDAGYWIDQPLTEILHSRCQAQPQALAIICGERRFTYGELDTLSSILASRLAEQGLGQGDTALV
QLPNVAEFYIVLFALLKAGIVPLNALFSHRRLELTAYAKQIVPKLLIASREHEVFRDDAYVQAFAEVGAAPAVTLLLGES
DPAASLAHWIETPGSQPVAYAPTAADQVALFQLSGGSTGIPKLIPRTHNDYHYNARACADVCALNAHTRFLCAVPAAHNF
LLSSPGALGVFHAGGCVVMAASPEPLSCFALVEQHEVNTVALVPSAVALWLQAAPAHRDKLQSLAYLQVGGAVFADSLAR
QVPGVLGCQLQQVFGMAEGLINYTRLDDSDEQIFTTQGRPVSPDDEIKIVDEQGVPVAPGEPGMLATRGPYTFCGYYKAP
EQNASAFDAEGFYYSGDLVVLTPSGDLRVVGRIKDQINRGGEKVASEEIENLLVLHPEVTHAGLVAMPDEALGEKSCAFV
VSRNPSLKAPALRRHLMELGIAEYKLPDRIRLIEAMPLTAVGKIDKKQLRHLVSVENTRTWLQTRLRQLIEDSEELDPEE
NLIFYGLDSLQVMKLAAELKARGIEVSFEELASTPTLASWWALVEARQKAA
>P9WN05 2.4.1.109~~~pmt~~~Probable dolichyl-phosphate-mannose--protein mannosyltransferase~~~COG4346
MTARPPESCVLAKDRPEEPVVPVVSPGPLVPVADFGPLDRLRGWIVTGLITLLATVTRFLNLGSLTDAGTPIFDEKHYAP
QAWQVLNNHGVEDNPGYGLVVHPPVGKQLIAIGEAIFGYNGFGWRFTGALLGVVLVALVVRIVRRISRSTLVGAIAGVLL
ICDGVSFVTARTALLDGFLTFFVVAAFGALIVDRDQVRERMHIALLAGRSAATVWGPRVGVRWWRFGAGVLLGLACATKW
SGVYFVLFFGAMALAFDVAARRQYQVQRPWLGTVRRDVLPSGYALGLIPFAVYLATYAPWFASETAIDRHAVGQAVGRNS
VVPLPDAVRSLWHYTAKAFHFHAGLTNSAGNYHPWESKPWTWPMSLRPVLYAIDQQDVAGCGAQSCVKAEMLVGTPAMWW
LAVPVLAYAGWRMFVRRDWRYAVVLVGYCAGWLPWFADIDRQMYFFYAATMAPFLVMGISLVLGDILYHPGQGSERRTLG
LIVVCCYVALVVTNFAWLYPVLTGLPISQQTWNLEIWLPSWR
>H8ZPX1 1.4.2.3~~~pao~~~Pseudooxynicotine oxidase~~~
MANDKGDISKDGVSRRKFLGGAVIGAAAAAGVGSQILSLSATAQGADKERVGPLQSNVDYDAVVIGGGFAGVTAARELSR
SGLKTLVLEGRSRLGGRTFTSKLDGEKVELGGTWVHWTQPNVWTEVMHYGLEIEETVGLASPETVIWVTDNQVKRAPAAE
AFEIFGAACTEYYKEAHNIYPRPFDPFFAKKALQEMDGLSASEYLNKLSLTREQKDMMDSWLSGNGHNYPETIAYSEIMR
WFALSNFNMPTMFDSIARYKIKSGTVSLLEAMVAESDMEVQLSTPVLKVKQDSHRVLITTEEGTIAASAVVMAVPLNTMG
DVEYSPRLSDAKSEIASQGHAGKGVKGYIRIKQDVGNVMTYAPARNDVTPFTSVFTDHVGENGTLLIAFSADPKLVDIND
SKAVEKALHPLLPGVEVTSSYGYDWNLDPFSKGTWCTYRPGQTTRYLTELQKREGRLFFAGSDMANGWRGFIDGAIESGR
EVGYQVASYLKGKNSNA
>P37967 3.1.1.-~~~pnbA~~~Para-nitrobenzyl esterase~~~COG2272
MTHQIVTTQYGKVKGTTENGVHKWKGIPYAKPPVGQWRFKAPEPPEVWEDVLDATAYGSICPQPSDLLSLSYTELPRQSE
DCLYVNVFAPDTPSKNLPVMVWIHGGAFYLGAGSEPLYDGSKLAAQGEVIVVTLNYRLGPFGFLHLSSFNEAYSDNLGLL
DQAAALKWVRENISAFGGDPDNVTVFGESAGGMSIAALLAMPAAKGLFQKAIMESGASRTMTKEQAASTSAAFLQVLGIN
EGQLDKLHTVSAEDLLKAADQLRIAEKENIFQLFFQPALDPKTLPEEPEKAIAEGAASGIPLLIGTTRDEGYLFFTPDSD
VHSQETLDAALEYLLGKPLAEKVADLYPRSLESQIHMMTDLLFWRPAVAYASAQSHYAPVWMYRFDWHPKKPPYNKAFHA
LELPFVFGNLDGLERMAKAEITDEVKQLSHTIQSAWITFAKTGNPSTEAVNWPAYHEETRETLILDSEITIENDPESEKR
QKLFPSKGE
>Q6F6U3 3.5.1.19~~~pncA~~~Nicotinamidase~~~COG1335
MLELRMKQANNCALIVVDVQNGFTPGGNLAVAGADQIIPCINQLGTCFDTIVITQDWHPHNHISFASNHLGKQPFDTIQL
PYGPQVLWPSHCVQGTQDAELHPALDLPTAQLIIRKGFHRNIDSYSAFMEADRHTSTGLAGYLKERGIDTVYIVGIATDF
CVAWTAIDASKAGLNSYVIIDACKAIDMNGSLQHAWQEMLASGVQRISSRNILV
>P21369 3.5.1.19~~~pncA~~~Nicotinamidase~~~COG1335
MPPRALLLVDLQNDFCAGGALAVPEGDSTVDVANRLIDWCQSRGEAVIASQDWHPANHGSFASQHGVEPYTPGQLDGLPQ
TFWPDHCVQNSEGAQLHPLLHQKAIAAVFHKGENPLVDSYSAFFDNGRRQKTSLDDWLRDHEIDELIVMGLATDYCVKFT
VLDALQLGYKVNVITDGCRGVNIQPQDSAHAFMEMSAAGATLYTLADWEETQG
>I6XD65 3.5.1.19~~~pncA~~~Nicotinamidase/pyrazinamidase~~~COG1335
MRALIIVDVQNDFCEGGSLAVTGGAALARAISDYLAEAADYHHVVATKDFHIDPGDHFSGTPDYSSSWPPHCVSGTPGAD
FHPSLDTSAIEAVFYKGAYTGAYSGFEGVDENGTPLLNWLRQRGVDEVDVVGIATDHCVRQTAEDAVRNGLATRVLVDLT
AGVSADTTVAALEEMRTASVELVCSS
>P9WJI9 6.3.4.21~~~pncB1~~~Nicotinate phosphoribosyltransferase pncB1~~~COG1488
MGPPPAARRREGEPDNQDPAGLLTDKYELTMLAAALRDGSANRPTTFEVFARRLPTGRRYGVVAGTGRLLEALPQFRFDA
DACELLAQFLDPATVRYLREFRFRGDIDGYAEGELYFPGSPVLSVRGSFAECVLLETLVLSIFNHDTAIASAAARMVSAA
GGRPLIEMGSRRTHERAAVAAARAAYIAGFAASSNLAAQRRYGVPAHGTAAHAFTMLHAQHGGPTELAERAAFRAQVEAL
GPGTTLLVDTYDVTTGVANAVAAAGAELGAIRIDSGELGVLARQAREQLDRLGATRTRIVVSGDLDEFSIAALRGEPVDS
YGVGTSLVTGSGAPTANMVYKLVEVDGVPVQKRSSYKESPGGRKEALRRSRATGTITEELVHPAGRPPVIVEPHRVLTLP
LVRAGQPVADTSLAAARQLVASGLRSLPGDGLKLAPGEPAIPTRTIPA
>P9WJI6 6.3.4.21~~~pncB2~~~Nicotinate phosphoribosyltransferase pncB2~~~
MAIRQHVGALFTDLYEVTMAQAYWAERMSGTAVFEIFFRKLPPGRSYIMAAGLADVVEFLEAFRFDEQDLRYLRGLGQFS
DEFLRWLAGVRFTGDVWAAPEGTVIFPNEPAVQLIAPIIEAQLVETFVLNQIHLQSVLASKAARVVAAARGRPVVDFGAR
RAHGTDAACKVARTSYLAGAAGTSNLLAARQYGIPTFGTMAHSFVQAFDSEVAAFEAFARLYPATMLLVDTYDTLRGVDH
VIELAKRLGNRFDVRAVRLDSGDLDELSKATRARLDTAGLEQVEIFASSGLDENRIAALLAARCPIDGFGVGTQLVVAQD
APALDMAYKLVAYDGSGRTKFSSGKVIYPGRKQVFRKLEHGVFCGDTLGEHGENLPGDPLLVPIMTNGRRIRQHAPTLDG
ARDWARQQIDALPPELRSLEDTGYSYPVAVSDRIVGELARLRHADTAEAHPGSNVVGAKAKRP
>P9WJI7 6.3.4.21~~~pncB2~~~Nicotinate phosphoribosyltransferase pncB2~~~COG1488
MAIRQHVGALFTDLYEVTMAQAYWAERMSGTAVFEIFFRKLPPGRSYIMAAGLADVVEFLEAFRFDEQDLRYLRGLGQFS
DEFLRWLAGVRFTGDVWAAPEGTVIFPNEPAVQLIAPIIEAQLVETFVLNQIHLQSVLASKAARVVAAARGRPVVDFGAR
RAHGTDAACKVARTSYLAGAAGTSNLLAARQYGIPTFGTMAHSFVQAFDSEVAAFEAFARLYPATMLLVDTYDTLRGVDH
VIELAKRLGNRFDVRAVRLDSGDLDELSKATRARLDTAGLEQVEIFASSGLDENRIAALLAARCPIDGFGVGTQLVVAQD
APALDMAYKLVAYDGSGRTKFSSGKVIYPGRKQVFRKLEHGVFCGDTLGEHGENLPGDPLLVPIMTNGRRIRQHAPTLDG
ARDWARQQIDALPPELRSLEDTGYSYPVAVSDRIVGELARLRHADTAEAHPGSNVVGAKAKRP
>Q9HW26 6.3.4.21~~~pncB2~~~Nicotinate phosphoribosyltransferase 2~~~
MAESAFSERIVQNLLDTDFYKLTMMQAVLHNYPNAEVEWEFRCRNQEDLRLYLPAIREQLEYLAGLAISDEQLAFLERIP
FLAPDFIRFLGLFRFNPRYVQTGIENDEFFLRLKGPWLHVILFEVPLLAMISEVRNRARYPAATVEQARERLQEKFDWLR
REASAEELAGFKMADFGTRRRFSYRVHEAVVSGLKEDFPGCFVGTSNVHLARKLDLKPLGTMAHEWLMAHQQLGPRLIDS
QSAALDCWVREYRGLLGIALTDCITTDAFLRDFDLYFAKLFDGLRHDSGDPLLWAEKTIAHYLKLGIDPLTKTLVFSDGL
DLPRALKIYRALQGRINVSFGIGTHFTCDLPGVEPMNIVVKMSACNGHPVAKISDTPGKAQCRDPDFIHYLKHVFQVA
>Q6F6W1 6.3.4.21~~~pncB~~~Nicotinate phosphoribosyltransferase~~~COG1488
MLSMSPIIHSLLDTDLYKFTMLQVVLHQFPQTHSVYHFRCRNLDETQYPLTDILDDLNEQLDHLCTLKFKDDELQYLRSF
RFIKSDFVDYLELFQLKRRFITAGIDEEGRLDIWVEGPMVQAMMFEIFVLAIVNELYFRRIRSDAVLEEGERRLQAKLAL
LEQYQTQHQSDEPPFLVSDFGTRRRYSFEWQKHVIAAFHHHFPNIFRGTSNVLLAKELNITPIGTMAHEFLQAFQALDVR
LRDFQKAALETWVQEYRGDLGIALTDVVGMDAFLRDFDLYFAKLFDGLRHDSGDPYEWGDKAYAHYRKLKIDTKTKMLTF
SDGLNLEKAWELHQYFKGRFKVSFGIGTNLTNDMGQTPLNIVLKLVECNGQSVAKISDSPGKTMTDNDTFLAYLRQVFQI
AEEEPVA
>Q8UIS9 6.3.4.21~~~pncB~~~Nicotinate phosphoribosyltransferase~~~COG1488
MTKTDIATRVHNHTWKLDPIVRSLIDTDFYKLLMLQMIWKLYPEVDATFSLINRTKTVRLAEEIDEMELREQLDHARTLR
LSKKENIWLAGNTFYGRSQIFEPEFLSWLSSYQLPEYELFKRDGQYELNFHGRWMDTTLWEIPALSIINELRSRSAMRSL
GYFTLDVLYARAKAKMWEKVERLRELPGLRISDFGTRRRHSFLWQRWCVEALKEGIGPAFTGTSNVLLAMDSDLEAVGTN
AHELPMVVAALAQTNEELAAAPYQVLKDWNRLYGGNLLIVLPDAFGTAAFLRNAPEWVADWTGFRPDSAPPIEGGEKIIE
WWRKMGRDPRTKMLIFSDGLDVDAIVDTYRHFEGRVRMSFGWGTNLTNDFAGCAPKTIASLKPISIVCKVSDANGRPAVK
LSDNPQKATGDPAEVERYLKFFGEEDHKEQKVLV
>P18133 6.3.4.21~~~pncB~~~Nicotinate phosphoribosyltransferase~~~COG1488
MTQFASPVLHSLLDTDAYKLHMQQAVFHHYYDVHVAAEFRCRGDDLLGIYADAIREQVQAMQHLRLQDDEYQWLSALPFF
KADYLNWLREFRFNPEQVTVSNDNGKLDIRLSGPWREVILWEVPLLAVISEMVHRYRSPQADVAQALDTLESKLVDFSAL
TAGLDMSRFHLMDFGTRRRFSREVQETIVKRLQQESWFVGTSNYDLARRLSLTPMGTQAHEWFQAHQQISPDLANSQRAA
LAAWLEEYPDQLGIALTDCITMDAFLRDFGVEFASRYQGLRHDSGDPVEWGEKAIAHYEKLGIDPQSKTLVFSDNLDLRK
AVELYRHFSSRVQLSFGIGTRLTCDIPQVKPLNIVIKLVECNGKPVAKLSDSPGKTICHDKAFVRALRKAFDLPHIKKAS
>P22253 6.3.4.21~~~pncB~~~Nicotinate phosphoribosyltransferase~~~
MTQFASPVLHSLLDTDAYKLHMQQAVFHHYYDVQVAAEFRCRGDDLLGIYADAIREQVDAMQHLRLLEDEFQWLSGLPFF
KPDYLNWLREFRYNPAQVCVTNDNGKLNIRLTGPWREVIMWEVPLLAVISELVHHYRSPNAGVDQALDALESKLVDFTAL
TANLDMSRFHLMDFGTRRRFSREVQQAIVKRLQQESWFVGTSNYDLARRLALTPMGTQAHEWFQAHQQISPDLATSQRAA
LAAWLNEYPDQLGIALTDCITMDAFLRDFGIEFASRYQGLRHDSGDPVAWGEKAIAHYEKLGIDPLTKTLVFSDNLDLPK
AVELYRHFASRVQLSFGIGTRLTCDIPQVKPLNIVIKLVECNGKPVAKLSDSPGKTICHDKAFVRALRKAFDLPQVRKAS
>Q9KN67 6.3.4.21~~~pncB~~~Nicotinate phosphoribosyltransferase~~~COG1488
MNPRLFSPHIIRSLLDLDAYKINMMQAIHHFYPDVSVRYELIVRSEEDASGLLDAIRQEIAHLGTLRFSDADIHYLTQHA
PHLKATFLQSLRYFHFVPQEQVEMGIVKQGGKQQLRISIRGSWRDTILYETLVMAIVSEVRSRQRWAEVPADLPLKVLKT
KLDQLKAEIERRGINNFSLTEMGTRRRFSSQVQRDVLACLKQEIPQWVLGTSNYHFAREFDLKPIGTIAHEWFMGHQALV
NERDSQQVALERWLTAFDGMLAIAPTDTLTIDAFLNDFNRHLANAYDGVRHDSGCPFRWGDKMIAHYQQLGIDPTTKLFI
FSDGLDFDQALELCEYFAGRVKISFGIGTFLTNDLANWRNAAGVEYRPLSIVIKLAECQGRPVAKISDQPEKAMCEDPIF
LANLKRRFNIELDVDALIQELRHQKRSPRHYISAA
>Q8ZG93 6.3.4.21~~~pncB~~~Nicotinate phosphoribosyltransferase~~~COG1488
MTQDASPILTSLLDTDAYKLHMQQAVFHHYRHITVAAEFRCRSDELLGVYADEIRHQVTLMGQLALTSDEFIYLSSLPFF
QDDYLHWLRDFRFKPEQVSVAVHDGKLDIRIAGLWCEVIMWEVPLLAVISEIVHRRRSTQVTTDQAVQQLRTKLEQFNAL
SADIDITHFKLMDFGTRRRFSREIQHTVVSTLKDEFPYLVGTSNYDLARTLALAPVGTQAHEWFQAHQQISPTLANSQRV
ALQVWLDEYPNQLGIALTDCITMDAFLRDFDLAFANRYQGLRHDSGDPIEWGEKAIAHYEKLGIDPMKKVLVFSDNLDLE
KALFLYRHFYQRIKLVFGIGTRLTCDIPDVKPLNIVIKLVECNDKPVAKLSDSPGKTICQDPAFVDQLRKAFALPLVKKA
S
>P0A6G3 3.5.1.42~~~pncC~~~Nicotinamide-nucleotide amidohydrolase PncC~~~COG1546
MTDSELMQLSEQVGQALKARGATVTTAESCTGGWVAKVITDIAGSSAWFERGFVTYSNEAKAQMIGVREETLAQHGAVSE
PVVVEMAIGALKAARADYAVSISGIAGPDGGSEEKPVGTVWFAFATARGEGITRRECFSGDRDAVRRQATAYALQTLWQQ
FLQNT
>Q8EK32 3.5.1.42~~~pncC~~~Nicotinamide-nucleotide amidohydrolase PncC~~~COG1058
MKLEMICTGEEVLSGQIVDTNAAWFASTMMEHGIEIQRRVTVGDRLEDLIAVFQERSLHADVILVNGGLGPTSDDMSAEA
MAKAKGESLVENSEWRQRLEDWFTRNNREMPVSNLKQAMLPVSAVMVDNPVGTACGFRVKLNRAWLFFTPGVPFELKHMV
KEQFIPFIRDEFNLDAKVALKKLLTIGHGESALADKIEPLELPEGITIGYRSSMPHIEIKIFARGEKAIALLPRVAGHIK
MVLGTAVVAEDKATLAEEIHFRLLNSGLTLSAAESCTGGMITSQLVDFPGSSSYLQHGLVTYSNESKVRVLGVNPATLDD
HGAVSIPTVEEMAKGARAILDSDFALATSGIAGPDGGTEDKPVGTVAIALATRSGVYSQMIKLPRRSRDLVRSLSAAVAY
DMLRRELLSEAVIVDYQSIGRFSK
>P11902 ~~~pndA~~~Protein PndA~~~
MPQRTFLMMLIVVCVTILCFVWMVRDSLCGFRIEQGNTVLVATLAYEVKR
>P21163 3.5.1.52~~~ngl~~~Peptide-N(4)-(N-acetyl-beta-D-glucosaminyl)asparagine amidase F~~~
MRKLLIFSISAYLMAGIVSCKGVDSATPVTEDRLALNAVNAPADNTVNIKTFDKVKNAFGDGLSQSAEGTFTFPADVTTV
KTIKMFIKNECPNKTCDEWDRYANVYVKNKTTGEWYEIGRFITPYWVGTEKLPRGLEIDVTDFKSLLSGNTELKIYTETW
LAKGREYSVDFDIVYGTPDYKYSAVVPVIQYNKSSIDGVPYGKAHTLGLKKNIQLPTNTEKAYLRTTISGWGHAKPYDAG
SRGCAEWCFRTHTIAINNANTFQHQLGALGCSANPINNQSPGNWAPDRAGWCPGMAVPTRIDVLNNSLTGSTFSYEYKFQ
SWTNNGTNGDAFYAISSFVIAKSNTPISAPVVTN
>Q9AJD6 1.1.3.12~~~pno~~~Pyridoxine 4-oxidase~~~
MAQYDVAIIGAGSAGALIAARLSEDPARNVLLIEAGGRPSDPDILKPSMWPAIQHRSYDWDYKTTPQEGAAGRSFAWARG
KGLGGSSLLHAMGYMRGHPADFAAWAEATGDERWSWEGLLPSFMANEDHVSGGDGIHGKDGPMPVWIPDDEVSPLTQAFM
TAGNALGLPRIPDHNTGQMIGVTPNSLMIRDGRRVTVAEAWLTPEVCARPNLTIMTGTLTRRLKLEKSHVSAIELAGPEG
LATVTASEIILSAGSLESPALLMRSGIGRENVLREAGVTCRVKAPELGLNLMDHLLGAGNLYATKKHLPPSRLQHSESMA
YMRAGDFSAGGQPEIVVGCGVAPIVSESFTAPAPGNAYSFLFGVTHPTSRGEIRITGDAPDSPLIIDPRYLQTQNDRNLF
RAALGAAREIGHRPELAEWRDHEILPKSLAASQDIDTFIAKAVITHHHPSGTCRMGKDEMSVVDADLRLRGLDNLYVVDG
SVLPSLTAGPIHAAVQAIAENFTTGFK
>C1I201 1.14.13.167~~~pnpA~~~Para-nitrophenol 4-monooxygenase~~~
METLDGVVVVGGGPVGLLTALKLGKAGIKVVVLEAEPGVSPSPRAVAYMPPTAAALDRFGLLQDIRKRAVMCPDFAYRHG
NGELIAKMDWSVLSQDTQYPYMLLLGQNHVSNVIFQHLRELPNVEIRWNHRVEEVDQDDAYVTIETSSPGGTSRLRARWL
AATDGARSTVRQKIGLTFDGITWDERLVATNVFYDFSLHGYSRANFVHDPVDWAVVVQLDKTGLWRVCYGEDASLSDAEV
RRRLPERFKRLLPGAPTPDQYRVDHLNPYRVHQRCAAEFRRGRVVLAGDAAHATNPMGGLGLSGGVLDAEHLAEALIAVI
KNGASTKTLDEYSIDRRKVFLEFTSPTATANFTWMKESDPAQRIRDDAMFKEAGTDRAVMRQFLLDLEKLNGRRVIEKKL
KAA
>C1I202 1.6.5.6~~~pnpB~~~p-benzoquinone reductase~~~
MPTKIQIVFYSSYGHIYKMAEAIAAGAREVGDVEVTLLQVPELMPEEVQVKSGIKGYRAAFGSIPYATPEVLAEADAIIF
GTPTRFGNMCSQMRNFLDQTGGLWMSGGLIGKVGSVFTSTASQHGGQETTITSFHTTLLHHGMVIVGVPYSEPGLTNMTE
ISGGTPYGASTLAGADGSRQPSENELQIARFQGKHVATIAKRLANNK
>B0VLR7 2.7.7.8~~~pnp~~~Polyribonucleotide nucleotidyltransferase~~~
MSMFNIVRKEFQFGQHQVVLETGRVARQANTVLITMGGVTVLVAVVAAPTAKAGQDFFPLTVNYQEKQYAAGRIPGGYGK
REGRASEAETLTSRLIDRPIRPLFPEGYYNEIQVTATVVSSDKTMEADIAAMLGTSAALAIAGTPFRGPIGAARVGLING
EYVLNPNFEQMAQSDLDLVVAGTESAVLMVESEAKELSEDQMLGAVLFGHDEMQIAIQAINEFAAAAGAKPSDWVAPAHN
EELRAKLKEAFEAKISEAYTIAVKQDRYAALDALHAEAVAQFVPEEDVDGIADEVDYLFEDLKYRTVRDNILSGKPRIDG
RDTKTVRALDVQVGVLERAHGSALFTRGETQALVTTTLGNTRDALMVDTLAGTKTDNFMLHYNFPAYSVGETGRESGPKR
REIGHGRLARRGVQAVLPAADRFPYVIRIVSDITESNGSSSMASVCGASLSLMDAGVPLKAPVAGIAMGLVKEGERFAVL
SDILGDEDHLGDMDFKVAGSANGITALQMDIKIEGITEEIMEVALNQAFAGRMHILNEMNKVISRARPEISMHAPTFEVI
TINPDKIRDVIGKGGATIRQITEETKAAIDIEDNGTVRVFGETKAAAKAAIAKIQAITAEVEPGKIYDGKVIRIVEFGAF
VNIMPGTDGLLHISQISNERITNVTDVLKEGQEVKVQVQDVDNRGRIKLTMKDIEQA
>P50849 2.7.7.8~~~pnp~~~Polyribonucleotide nucleotidyltransferase~~~COG1185
MGQEKHVFTIDWAGRTLTVETGQLAKQANGAVMIRYGDTAVLSTATASKEPKPLDFFPLTVNYEERLYAVGKIPGGFIKR
EGRPSEKAVLASRLIDRPIRPLFADGFRNEVQVISIVMSVDQNCSSEMAAMFGSSLALSVSDIPFEGPIAGVTVGRIDDQ
FIINPTVDQLEKSDINLVVAGTKDAINMVEAGADEVPEEIMLEAIMFGHEEIKRLIAFQEEIVAAVGKEKSEIKLFEIDE
ELNEKVKALAEEDLLKAIQVHEKHAREDAINEVKNAVVAKFEDEEHDEDTIKQVKQILSKLVKNEVRRLITEEKVRPDGR
GVDQIRPLSSEVGLLPRTHGSGLFTRGQTQALSVCTLGALGDVQILDGLGVEESKRFMHHYNFPQFSVGETGPMRGPGRR
EIGHGALGERALEPVIPSEKDFPYTVRLVSEVLESNGSTSQASICASTLAMMDAGVPIKAPVAGIAMGLVKSGEHYTVLT
DIQGMEDALGDMDFKVAGTEKGVTALQMDIKIEGLSREILEEALQQAKKGRMEILNSMLATLSESRKELSRYAPKILTMT
INPDKIRDVIGPSGKQINKIIEETGVKIDIEQDGTIFISSTDESGNQKAKKIIEDLVREVEVGQLYLGKVKRIEKFGAFV
EIFSGKDGLVHISELALERVGKVEDVVKIGDEILVKVTEIDKQGRVNLSRKAVLREEKEKEEQQS
>Q9AC32 2.7.7.8~~~pnp~~~Polyribonucleotide nucleotidyltransferase~~~COG1185
MFDIKRKTIEWGGKTLVLETGRIARQADGAVLATMGETVVLATAVFAKSQKPGQDFFPLTVNYQEKTFAAGKIPGGFFKR
EGRPSEKETLVSRLIDRPIRPLFVKGFKNEVQVVVTVLQHDLENDPDILGMVAASAALCLSGAPFMGPIGAARVGWVDGA
YVLNPTLDEMKESKMDLVVAGTADAVMMVESEIQELSEEIVLGGVNFAHQQMQAVIDAIIDLAEHAAKEPFAFEPEDTDA
IKAKMKDLVGADIAAAYKIQKKQDRYEAVGAAKKKAIAALGLSDENPTGYDPLKLGAIFKELEADVVRRGILDTGLRIDG
RDVKTVRPILGEVGILPRTHGSALFTRGETQAIVVATLGTGDDEQFIDALEGTYKESFLLHYNFPPYSVGETGRMGSPGR
REIGHGKLAWRALRPMLPTKEDFPYTIRLVSEITESNGSSSMATVCGSSLAMMDAGVPLVRPVSGIAMGLILEQDGFAVL
SDILGDEDHLGDMDFKVAGTSEGLTSLQMDIKIAGITPAIMEQALAQAKEGRAHILGEMNKAMDAPRADVGDFAPKIETI
NIPTDKIREVIGSGGKVIREIVATTGAKVDINDDGVVKVSASDGAKIKAAIDWIKSITDEAEVGKIYDGKVVKVVDFGAF
VNFFGAKDGLVHVSQISNERVAKPSDVLKEGQMVKVKLLGFDDRGKTKLSMKVVDQETGEDLSKKEAAAEEA
>Q83D87 2.7.7.8~~~pnp~~~Polyribonucleotide nucleotidyltransferase~~~COG1185
MNKIRKTFQYGKHEVTFETGEMARQATGAVVVRMGDTVLLVSVVAKKEAEEGRDFFPLTVNYQEKTYAAGKIPGGYFKRE
GRPTEKETLTSRLIDRPLRPLFPKGFTNEVQVIATVLSVDSKVPTDIPAILGASAAIGLSGIPFNGSLGAARVGYRGGEY
LLNPSLDELKDSALDLVVAGTRDAVLMVESEAQELPESVMLGAVLHGHQAMQVAIQAIAEFIQEAGGAKWEWEPPTVNTA
LEKWVVEKSEAPLKKAYQIQEKTARQAQIQAIRDQLLADRAAEREGEENAVNEHELAVIFHELERRIVREQILTGQPRID
GRDTKTVRPITVKVGVLPRSHGSALFTRGETQALVVTTLGTERDAQSIDDLDGDRQEEFIFHYNFPPFCVGEVGFMSGPK
RREIGHGRLAKRAVVPVVPTLDKFPYVIRVVSEILESNGSSSMASVCGSSLALMDAGVPTKAPVAGIAMGLIKENDKYAV
LSDILGDEDHLGDMDFKVAGTSNGVTALQMDIKIEGITKEIMEQALDQAKEGRLHILSIMNKVLDKPRSQVSDLAPQYVT
MKINPEKIRDVIGKGGVVIREITEATNCAIDISDDGTIKIAAHTTEEGEAAKRRIEELTAEVELGKVYEGTVVKITDFGA
FVQILPNTQGLVHISQIAQERVENVRDYLEEGQVIRVKVIEIDRQGRVRLSMKQID
>Q9RSR1 2.7.7.8~~~pnp~~~Polyribonucleotide nucleotidyltransferase~~~COG1185
MIGKTFTTMLGGRELSIETGKLAKLVSGSVTVRYGDTLLLVTAQASDTQSKLDFLPLTVEFEERHYAVGKIPGSFQRREG
RPGEKAILSARITDRQIRPLFPKGYRHETQVIITVLSADGQNAPDVLGPIGAAAALSISDIPWAGPTACVRVGQIDGQYV
VNPTTEQLTRSRMDLVVAGTREAVMMVECGAQTVSEDDLVGAIEFAHAEMQGVIALIEQMRAEVGHEKFNFLAEEGPAND
YVPELTEKAKAAGLRDALLTHGKKDRSARLKALRNGLIEGYVPDPTAEGSAELTQALKDAFGKVEKRELRRLILEENLRA
DGRDSKTVRPIWIEARPLPTAHGSAVFTRGETQVLGVTTLGTERDEILIDDLTAESGDKFLLHYNFPPYSTGEVKRMGGQ
SRREIGHGNLAKRAIRAVLPSFEEFPYVIRVVGDVLESNGSSSMGTVCAGTLSLMDAGVPLKAPVAGVAMGLVMEGDNYR
VLTDILGLEDALGDMDFKVCGTAEGVTALQMDIKVGGITPQIMREALAQAKEGRLHILGKMAEVLAAPRAELSPTAPHIL
SLKINPELIGKVIGPGGKQVRELEAMGAQVTIEEDGTVRIFSASGESAEAVKARIEAVTKEAKVGEEFEGTVVKIAPFGA
FVNLFPGQDGMLHISQLSEQRVENVEDVLTVGDKLKVKIANIDDRGKIDLIRPELEGKVPLREPRAPRGGDRGPRRDSDR
GGDRGPRREFSDRGPRPEGARSERPEGQRTERPATAPATQESSQSSDAPAAPVFPRRED
>P05055 2.7.7.8~~~pnp~~~Polyribonucleotide nucleotidyltransferase~~~COG1185
MLNPIVRKFQYGQHTVTLETGMMARQATAAVMVSMDDTAVFVTVVGQKKAKPGQDFFPLTVNYQERTYAAGRIPGSFFRR
EGRPSEGETLIARLIDRPIRPLFPEGFVNEVQVIATVVSVNPQVNPDIVAMIGASAALSLSGIPFNGPIGAARVGYINDQ
YVLNPTQDELKESKLDLVVAGTEAAVLMVESEAQLLSEDQMLGAVVFGHEQQQVVIQNINELVKEAGKPRWDWQPEPVNE
ALNARVAALAEARLSDAYRITDKQERYAQVDVIKSETIATLLAEDETLDENELGEILHAIEKNVVRSRVLAGEPRIDGRE
KDMIRGLDVRTGVLPRTHGSALFTRGETQALVTATLGTARDAQVLDELMGERTDTFLFHYNFPPYSVGETGMVGSPKRRE
IGHGRLAKRGVLAVMPDMDKFPYTVRVVSEITESNGSSSMASVCGASLALMDAGVPIKAAVAGIAMGLVKEGDNYVVLSD
ILGDEDHLGDMDFKVAGSRDGISALQMDIKIEGITKEIMQVALNQAKGARLHILGVMEQAINAPRGDISEFAPRIHTIKI
NPDKIKDVIGKGGSVIRALTEETGTTIEIEDDGTVKIAATDGEKAKHAIRRIEEITAEIEVGRVYTGKVTRIVDFGAFVA
IGGGKEGLVHISQIADKRVEKVTDYLQMGQEVPVKVLEVDRQGRIRLSIKEATEQSQPAAAPEAPAAEQGE
>A0QVQ5 2.7.7.8~~~pnp~~~Polyribonucleotide nucleotidyltransferase~~~COG1185
MSVVELEDGVYESTAVIDNGSFGTRTIRFETGRLAQQAAGSAVAYLDDETMLLSATTASKNPKDHFDFFPLTVDVEERMY
AAGRIPGSFFRREGRPSTDAILTCRLIDRPLRPSFVDGLRNEIQVVVTVMSLDPKDLYDVLAINAASMSTQLAGLPFSGP
VGGARIALIDGTWVAFPTVEQLERAVFDMVVAGRIVGDGDSADVAIMMVEAEATENVVELVAGGAQAPTEAVVAEGLEAA
KPFIKALCAAQQELADRAAKPAGEYPVFPDYEADVYDAVASVATEALAEALTIAGKTERNDRTDEIKVEVLERLAEPYAG
REKEIGAAFRSLTKKLVRQRILTDHFRIDGRGITDIRALSAEVAVIPRAHGSALFERGETQILGVTTLDMIKMAQQIDSL
GPENTKRYMHHYNFPPYSTGETGRVGSPKRREIGHGALAERALVPVLPSIEEFPYAIRQVSEALGSNGSTSMGSVCASTL
ALLNAGVPLKAPVAGIAMGLVSDDVDVDGKVEKRYVALTDILGAEDAFGDMDFKVAGTKDFVTALQLDTKLDGIPSQVLA
GALSQAKDARLTILDVMAEAIDRPDEMSPYAPRITTIKVPVDKIGEVIGPKGKMINSITEETGAQISIEDDGTVFVGAAD
GLSAQAAIDKINAIANPQLPKVGERFLGTVVKTTDFGAFVSLLPGRDGLVHISKLGKGKRIAKVEDVVKVGDKLRVEIAD
IDNRGKISLVLVAEESAESAESAGDKGAEKAEGAAADVTPAEA
>P9WI57 2.7.7.8~~~pnp~~~Polyribonucleotide nucleotidyltransferase~~~COG1185
MSAAEIDEGVFETTATIDNGSFGTRTIRFETGRLALQAAGAVVAYLDDDNMLLSATTASKNPKEHFDFFPLTVDVEERMY
AAGRIPGSFFRREGRPSTDAILTCRLIDRPLRPSFVDGLRNEIQIVVTILSLDPGDLYDVLAINAASASTQLGGLPFSGP
IGGVRVALIDGTWVGFPTVDQIERAVFDMVVAGRIVEGDVAIMMVEAEATENVVELVEGGAQAPTESVVAAGLEAAKPFI
AALCTAQQELADAAGKSGKPTVDFPVFPDYGEDVYYSVSSVATDELAAALTIGGKAERDQRIDEIKTQVVQRLADTYEGR
EKEVGAALRALTKKLVRQRILTDHFRIDGRGITDIRALSAEVAVVPRAHGSALFERGETQILGVTTLDMIKMAQQIDSLG
PETSKRYMHHYNFPPFSTGETGRVGSPKRREIGHGALAERALVPVLPSVEEFPYAIRQVSEALGSNGSTSMGSVCASTLA
LLNAGVPLKAPVAGIAMGLVSDDIQVEGAVDGVVERRFVTLTDILGAEDAFGDMDFKVAGTKDFVTALQLDTKLDGIPSQ
VLAGALEQAKDARLTILEVMAEAIDRPDEMSPYAPRVTTIKVPVDKIGEVIGPKGKVINAITEETGAQISIEDDGTVFVG
ATDGPSAQAAIDKINAIANPQLPTVGERFLGTVVKTTDFGAFVSLLPGRDGLVHISKLGKGKRIAKVEDVVNVGDKLRVE
IADIDKRGKISLILVADEDSTAAATDAATVTS
>Q9K062 2.7.7.8~~~pnp~~~Polyribonucleotide nucleotidyltransferase~~~
MMFDKHVKTFQYGNQTVTLETGEIARQAAAAVKVSMGDTVVLVAVTTNKEVKEGQDFFPLTVDYLERTYAAGKIPGGFFK
REGKQSEKEILTSRLIDRPIRPLFPEGFYHDIQIVAMVVSVDPEIDSDIPAMLGASAALVLSGVPFAGPIGAARVGYVNG
VYVLNPTKAELAKSQLDLVVAGTSKAVLMVESEAKILPEDVMLGAVVYGHDQMQVAINAINEFADEVNPELWDWKAPETN
EELVAKVRGIAGETIKEAFKIRQKQARSAKLDEAWSAVKEALITEETDTLAANEIKGIFKHLEADVVRSQILDGQPRIDG
RDTRTVRPLNIQTSVLPRTHGSALFTRGETQALAVATLGTSRDEQIIDALSGEYTDRFMLHYNFPPYSTGEVGRMGAPKR
REIGHGRLAKRALLAVLPKPEDFSYTMRVVSEITESNGSSSMASVCGGCLSLLSAGVPLKAHVAGIAMGLILEGNKFAVL
TDILGDEDHLGDMDFKVAGTTEGVTALQMDIKIQGITKEIMQIALAQAKEARLHILDQMKAAVAGPQELSAHAPRLFTMK
INQDKIREVIGKGGETIRSITAETGTEINIAEDGTITIAATTQEAGDAAKKRIEQITAEVEVGKVYEGTVVKILDNNVGA
IVSVMPGKDGLVHISQIAHERVRNVGDYLQVGQVVNVKALEVDDRGRVRLSIKALLDAPAREENAAE
>Q8YP11 2.7.7.8~~~pnp~~~Polyribonucleotide nucleotidyltransferase~~~COG1185
MAEFEKSISFDGRDIRLKVGLLAPQAGGSVLIESGDTAVLVTATRSPGREGIDFLPLTVDYEERLYAAGRIPGGIMRREG
RPPEKTILTSRLIDRPLRPLFPSWLRDDLQVVALTMSMDEQVPPDVLAVTGASIATLIAKIPFNGPMAAVRVGLVGDDFI
INPTYAEIEAGDLDLVVAGSPHGVIMVEAGANQLPERDIIEAIDFGYEAVRDLIKAQLDLVAELGLEIVQEAPPEVDQTL
ENYIRDRASDEIKKILAQFELTKPERDAALDVVKDNIATAIAELPEEDPIRLAATANSKALGNTFKDITKYFMRRQIVED
NVRVDGRKLDQVRPVSSQVGVLPKRVHGSGLFNRGLTQVLSACTLGTPGDAQNLNDDLQTDQSKRYLHHYNFPPFSVGET
KPLRAPGRREIGHGALAERAILPVLPPKEQFPYVIRVVSEVLSSNGSTSMGSVCGSTLALMDAGVPILKPVSGAAMGLIK
EGDEVRVLTDIQGIEDFLGDMDFKVAGTDAGITALQMDMKISGLSLEVIAQAIHQAKDARLHILDKMLQTIDQPRTETSP
YAPRLLTIKIDPDMIGLVIGPGGKTIKGITEETGAKIDIEDDGTVTISAVDENKAKRARNIVQGMTRKLNEGDVYAGRVT
RIIPIGAFVEFLPGKEGMIHISQLADYRVGKVEDEVAVGDEVIVKVREIDNKGRINLTRLGIHPDQAAAAREAAAVNR
>P41121 2.7.7.8~~~pnp~~~Polyribonucleotide nucleotidyltransferase~~~
MLNPIVRKFQYGQHTVTIETGMMARQATAAVMVNMDDTAVFVTVVGQKKVKAGQDFFPLTVNYQERTYAAGRIPGSFFRR
EGRPGEGETLVARLIDRPLRPLFPEGFLNEVRIVATVVSVNPQINPDIVAMIGASAALALSGIPFNGPIGAARVGYINDQ
YVLNPTSDELKNSRLDLVVSGTAGAVLMVESEADLLTEEQMLGAVVFGHDQQQVVIDNINALAAEAGKEKWDWVPEPVNQ
ALHDRVAELAESRLGDAYRITEKQERYAQVDAIKDEVTAALLEQDETLEEAEIHEILGSLEKNVVRSRVLSGEPRIDGRE
KDMVRALDVRTGVLPRTHGSALFTRGETQALVTATLGTERDAQIIDELMGERTDRFLLHYNFPPYSVGETGMMGSPKRRE
IGHGRLAKRGVLAVMPKANEFPYTVRVVSEITESNGSSSMASVCGASLALMDAGVPIKAAVAGIAMGLVKEGDNFVVLSD
ILGDEDHLGDMDFKVAGSCEGISALQMDIKIEGITREIMQVALNQAKGARLHILSVMEQAITTPRDDISQFAPRIHTIKI
NPDKIKDVIGKGGSVIRALTEETGTTIEIEDDGTVKIAATDGEKAKHAISRIEEITAEIEVGRIYAGKVTRIVDFGAFVA
IGGGKEGLVHISQIADKRVEKVADYLQVGQETSVKVLEIDRQGRVRLSIKEATAGTAVEEAPPAPQSAE
>Q3IJ73 2.7.7.8~~~pnp~~~Polyribonucleotide nucleotidyltransferase~~~COG1185
MQAIIKEFQLGQHTVTLETGAIARQADGAVLASIGDTSVLVTVVGKREAQPGQDFFPLTVNYQERMYAAGRIPGGFLKRE
GRPNDGETLIARLIDRPIRPLFPSGFVNEVQVIATVVSVNPEIQPDMVALIGTSAALAISGIPFSGPIGATRVGYIDGEY
VLNPTLKELEESKLDLVVAGTDNAVLMVESEADVLAEDIMLGAVVYGHEQAQAIITAIKEFKAEAGKPTWDWTAPAKNVS
LEEKVASIAADKVGEAYRITDKVARKEALGVAKDEVVAVLTSELAEGESLDKQEIGKIFGSLEKKIVRGRIAAGEKRIDG
REPDMIRALDVMTGVLPRTHGSAIFTRGETQALVTATLGTERDSQLIDDLTGTHKNHFMLNYNFPPFCVGETGFVGSPKR
REIGHGNLAKRGIAAVMPTLTEFPYSIRVVSEITESNGSSSMASVCGTSLALMNAGVPIKASVAGIAMGLVKEDDKFVVL
SDILGDEDHLGDMDFKVAGTAGGITALQMDIKIEGITQEIMQIALKQAKAARLHILEVMDKAISAPSEELSQFAPRIYTM
KIPQKKIAEVIGKGGATIRQLTEETGTTIEIGDDGTIKIAATDGESAANAISRIEQLTAELEVGTIYEGKVVRIVDFGAF
VNILPGKDGLVHISQISTERVNNVTDHLSEGQEVKVKVLEVDRQGRVRLSIKEAMESAAPAADAPTDA
>Q8ZLT3 2.7.7.8~~~pnp~~~Polyribonucleotide nucleotidyltransferase~~~
MLNPIVRKFQYGQHTVTLETGMMARQATAAVMVSMDDTAVFVTVVGQKKAKPGQDFFPLTVNYQERTYAAGRIPGSFFRR
EGRPSEGETLIARLIDRPVRPLFPEGFVNEVQVIATVVSVNPQVNPDIVAMIGASAALSLSGIPFNGPIGAARVGYINDQ
YVLNPTQDELKESKLDLVVAGTEAAVLMVESEAELLSEDTMLGAVVFGHEQQQVVIQAINDLVKEAGKPRWDWQPEAVND
ALNARVAALAESRLSDAYRITDKQERYAQVDVIKSETIEQLIAEDETLDANELGEILHAIEKNVVRSRVLAGEPRIDGRE
KDMIRGLDVRTGVLPRTHGSALFTRGETQALVTATLGTARDAQVLDELMGERTDSFLFHYNFPPYSVGETGMVGSPKRRE
IGHGRLAKRGVLAVMPDMDKFPYTVRVVSEITESNGSSSMASVCGASLALMDAGVPIKAAVAGIAMGLVKEGDNYVVLSD
ILGDEDHLGDMDFKVAGSRDGISALQMDIKIEGITKEIMQVALNQAKGARLHILGVMEQAINAPRGDISEFAPRIHTIKI
STDKIKDVIGKGGSVIRALTEETGTTIEIEDDGTVKIAATDGEKAKYAIRRIEEITAEIEVGRIYNGKVTRIVDFGAFVA
IGGGKEGLVHISQIADKRVEKVTDYLQMGQEVPVKVLEVDRQGRVRLSIKEATEQSQPAAAPEAPASEQAE
>Q2FZ20 2.7.7.8~~~pnp~~~Polyribonucleotide nucleotidyltransferase~~~COG1185
MSQEKKVFKTEWAGRSLTIETGQLAKQANGAVLVRYGDTVVLSTATASKEPRDGDFFPLTVNYEEKMYAAGKIPGGFKKR
EGRPGDDATLTARLIDRPIRPLFPKGYKHDVQIMNMVLSADPDCSPQMAAMIGSSMALSVSDIPFQGPIAGVNVGYIDGK
YIINPTVEEKEVSRLDLEVAGHKDAVNMVEAGASEITEQEMLEAIFFGHEEIQRLVDFQQQIVDHIQPVKQEFIPAERDE
ALVERVKSLTEEKGLKETVLTFDKQQRDENLDNLKEEIVNEFIDEEDPENELLIKEVYAILNELVKEEVRRLIADEKIRP
DGRKPDEIRPLDSEVGILPRTHGSGLFTRGQTQALSVLTLGALGDYQLIDGLGPEEEKRFMHHYNFPNFSVGETGPVRAP
GRREIGHGALGERALKYIIPDTADFPYTIRIVSEVLESNGSSSQASICGSTLALMDAGVPIKAPVAGIAMGLVTREDSYT
ILTDIQGMEDALGDMDFKVAGTKEGITAIQMDIKIDGLTREIIEEALEQARRGRLEIMNHMLQTIDQPRTELSAYAPKVV
TMTIKPDKIRDVIGPGGKKINEIIDETGVKLDIEQDGTIFIGAVDQAMINRAREIIEEITREAEVGQTYQATVKRIEKYG
AFVGLFPGKDALLHISQISKNRIEKVEDVLKIGDTIEVKITEIDKQGRVNASHRALEE
>Q7A5X7 2.7.7.8~~~pnp~~~Polyribonucleotide nucleotidyltransferase~~~
MSQEKKVFKTEWAGRSLTIETGQLAKQANGAVLVRYGDTVVLSTATASKEPRDGDFFPLTVNYEEKMYAAGKIPGGFKKR
EGRPGDDATLTARLIDRPIRPLFPKGYKHDVQIMNMVLSADPDCSPQMAAMIGSSMALSVSDIPFQGPIAGVNVGYIDGK
YIINPTVEEKEVSRLDLEVAGHKDAVNMVEAGASEITEQEMLEAIFFGHEEIQRLVDFQQQIVDHIQPVKQEFIPAERDE
ALVERVKSLTEEKGLKETVLTFDKQQRDENLDNLKEEIVNEFIDEEDPENELLIKEVYAILNELVKEEVRRLIADEKIRP
DGRKPDEIRPLDSEVGILPRTHGSGLFTRGQTQALSVLTLGALGDYQLIDGLGPEEEKRFMHHYNFPNFSVGETGPVRAP
GRREIGHGALGERALKYIIPDTADFPYTIRIVSEVLESNGSSSQASICGSTLALMDAGVPIKAPVAGIAMGLVTREDSYT
ILTDIQGMEDALGDMDFKVAGTKEGITAIQMDIKIDGLTREIIEEALEQARRGRLEIMNHMLQTIDQPRTELSAYAPKVV
TMTIKPDKIRDVIGPGGKKINEIIDETGVKLDIEQDGTIFIGAVDQAMINRAREIIEEITREAEVGQTYQATVKRIEKYG
AFVGLFPGKDALLHISQISKNRIEKVEDVLKIGDTIEVKITEIDKQGRVNASHRALEE
>Q8CST1 2.7.7.8~~~pnp~~~Polyribonucleotide nucleotidyltransferase~~~COG1185
MSQEKKVFKTEWAGRSLTIETGQLAKQANGAVLVRYGDTVVLSTATASKEPRDGDFFPLTVNYEEKMYAAGKIPGGFKKR
EGRPGDEATLTARLIDRPIRPLFPKGYRHDVQIMNIVLSADPDCSPEMAAMIGSSMALSVSDIPFQGPIAGVNVGYIDGK
YVINPSVADKEISRLDLEVAGHKDAVNMVEAGASEITESEMLEAIFFGHEEIKRLVAFQQEIIDHIQPIKQEFVPVERDE
DLVEKVKSLTEDKGLKDTVLTFDKQQRDENLDALKEEVVGHFLDEEDPENETLVKEVYAILNDLIKEEVRRLIADEKIRP
DGRKVDEIRPLESEVGLLPRAHGSGLFTRGQTQALSVLTLGALGDYQLIDGLGPEVEKRFMHHYNFPNFSVGETGPVRAP
GRREIGHGALGERALRYIIPDTQDFPYTIRIVSEVLESNGSSSQASICGSTLALMDAGVPIKAPVAGIAMGLVTRDDSYT
ILTDIQGMEDALGDMDFKVAGTKDGITAIQMDIKIDGLTREVIEEALEQARQGRLAIMDHMLHTIEQPREELSAYAPKVV
TMSINPDKIRDVIGPGGKKINEIIDETGVKLDIEQDGTIFIGAVDQAMINRAKEIIEDITREAEVGQVYHAKVKRIEKYG
AFVELFPGKDALLHISQISQERINKVEDVLKIGDTIEVKITEIDKQGRVNASHKVLEQSKN
>Q53597 2.7.7.8~~~pnp~~~Polyribonucleotide nucleotidyltransferase~~~
MENETHYAEAVIDNGAFGTRTIRFETGRLAKQAAGSAVAYLDDDTMVLSATTASKNPKDQLDFFPLTVDVEERMYAAGKI
PGSFFRREGRPSEDAILTCRLIDRPLRPSFKKGLRNEIQVVATIMALNPDHLYDVVAINAASASTQLAGLPFSGPYGGVR
VALIRGQWVAFPTHTELEDAVFDMVVAGRVLEDGDVAIMMVEAEATEKTVQLVKDGAEAPTEEVVAAGLDAAKPFIKVLC
KAQADLAAKAAKPTGEFPVPSSTTRTTSEALSAAVRPELSAALTIAGKQDREAELDRVKALAAEKLLPEFEGREKEISAA
YRPWPSSSSAERVIKEKKRIDGRGVTDIRTLAAEVEAIPRVHGSALFERGETQILGVTTLNMLRMEQQLDTLSPVTRKPY
MHNYNFPPISVGETGRVGSPKRREIGHGALAERAIVPVLPTREEFPYAIRQVSEALGSNGSTSMGSVCASTMSLLNAGVP
LKAPVAGIAMGLISQEINGETHYVALTDILGAEDAFGDMDFKVAGTKEFVTALQLDTKLDGIPASVLAAALKQARDARLH
ILDVMMEAIDTPDEMSPNAPRIITVKIPVDKIGEVIGPKRQMINQIQEDTGAEITIEDDGTIYIGAADGPAAEAARATIN
GIANPTSPEVGERILGSVVKTTTFGAFVSLLPGKDGLLHISQIRKLAGGKRVENVEDVLGVGQKVQVEIAEIDSRGKLSL
IPVIEGEEAASDEKKDDAEQ
>Q8DWB2 2.7.7.8~~~pnp~~~Polyribonucleotide nucleotidyltransferase~~~COG1185
MSKQVFETIFAGKKLAVEIGQVAKQANGAALVRYGDSTVLSAAVMSKKMSTGDFFPLQINYEEKRYAAGKFPGGFNKREG
RPSTDATLTARLIDRPIRPMFAEGFRNEVQVINTVLSYDADASAPMAAMFGSSLALSISDIPFNGPIAGVQVAYLDGQYV
INPTAEEKKASLLELTVAGTKEAINMVESGAKELSEDIMLEALLKGHEAVRELIAFQEEIIAAVGKEKAEVELLQVDADL
QAEIVGKYNADLQKAVQIEEKKAREIATEAVKEHVTAEYEERYAEHEEHDRIMRDVAEILEQMEHAEVRRLITEDKVRPD
GRRVDEIRPLDAEIDFLPKVHGSGLFTRGQTQALSVLTLAPMGDTQIVDGLDEEYKKRFMHHYNFPQYSVGETGRYGAPG
RREIGHGALGERALAQVLPSLEAFPYAIRLVAEVLESNGSSSQASICAGTLALMAGGVPIKAPVAGIAMGLISDGTNYTV
LTDIQGLEDHFGDMDFKVAGTREGITALQMDIKIEGITPQILEEALAQAKKARFEILDVIEKVIPAPRLELAPTAPKIDT
IKVDVDKIKIVIGKGGETIDKIIEETGVKIDIDEDGNIAIYSSDQEAINRTKEIIASLVREAKVGEIYEAEVVRIEKFGA
FVHLFDKTDALVHISEIAWTRTNKVEDVLAVGDKVTVKVVKVDDKGRIDASMKALLPRPPRSEKSNKEDHQSVRHHGSPK
DDKGKEKYDK
>Q5X9W0 2.7.7.8~~~pnp~~~Polyribonucleotide nucleotidyltransferase~~~
MSKQTFTTTFAGNPLVVEVGQVAKQANGATVVRYGESTVLTAAVMSKKMATGDFFPLQVNYEEKMYAAGKFPGGFMKREG
RPSTDATLTARLIDRPIRPMFAEGFRNEVQVTNTVLSYDENASAPMAAMFGSSLALSISDIPFNGPIAGVQVGYIDGEFI
INPDKEQMEASLLELTVAGSKEAINMVESGAKELSEDIMLEALLKGHQAIQELIAFQEQIVAVVGKEKAEVELLQVDADL
QADIVAKYNAQLQKAVQVEEKKAREAATEAVKEMVKAEYEERYAEDENLATIMRDVAEILEQMEHAEVRRLITEDKIRPD
GRKIDEIRPLDAVVDFLPKVHGSGLFTRGQTQALSILTLAPMGETQIIDGLAPEYKKRFLHHYNFPQYSVGETGRYGAAG
RREIGHGALGERALEQVLPSLEEFPYAIRLVAEVLESNGSSSQASICAGTLALMAGGVPIKAPVAGIAMGLISDGTNYTV
LTDIQGLEDHFGDMDFKVAGTREGITALQMDIKIAGITPQILEEALAQAKKARFEILDVIEATIAEPRPELAPTAPKIDT
IKIDVDKIKVVIGKGGETIDKIIAETGVKIDIDDEGNVSIYSSDQAAINRTKEIIAGLVREAKVGEVYHAKVVRIEKFGA
FVNLFDKTDALVHISEIAWTRTTNVSDVLEVGEDVDVKVIKIDEKGRVDASMKALIPRPPKPEKKEEKHD
>P72659 2.7.7.8~~~pnp~~~Polyribonucleotide nucleotidyltransferase~~~COG1185
MQEFDKSISFDGRDIRLKMGTLAPQAGGSVLIQSGDTAVLVTATRAKGRDGIDFLPLTVDYEGRLYAAGRIPGGFLRREG
RPPEKATLISRLIDRPLRPLFPHWLRDELQIVATTLSMDEEVPPDVLAVTGASVAVILAQIPFKGPMAAVRVGLVGDDFI
INPTYREVHNGDLDLVVAGTPAGIVMVEAGANQLPEQDIIEAIDFGYEAVQDLINAQRELMTDLGITLATSEPPPVNTAV
EEFIANRASKKIITVLGQFDLGKDGRDAALDEIKATEVETAIAELPETDPVKQSVEEDPKLVGNLYKALTKKLMRKQIVD
DGVRVDGRKLEQVRPISCEVGFLPRRVHGSGLFNRGLTQVLSLATLGSPGDAQDLADDLHPEDEKRYLHHYNFPPYSVGE
ARPMRSPGRREIGHGALAERAIIPVLPPQEDFPYVVRVVSEVLSSNGSTSMGSVCGSTLALMDAGVPIKKPVSGAAMGLI
KEGDEIRILTDIQGIEDFLGDMDFKVAGTDSGITALQMDMKIDGLSMEVVSKAIMQALPARLHILDKMLATIREPRPELS
PFAPRLLTLKIEPEHIGMVIGPGGKTIKGITEQTSCKIDIADDGTVTIASSEGERAERARQMIYNMTRKLNEGEVYLGRV
TRIIPIGAFVEVLPGKEGMIHISQLTEGRVGKVEDEVGVGDEVIVKVREIDSKGRLNLTRLGIHPDEAAEARRNASRG
>Q2RSB2 7.1.1.1~~~pntAA~~~NAD(P) transhydrogenase subunit alpha part 1~~~COG3288
MKIAIPKERRPGEDRVAISPEVVKKLVGLGFEVIVEQGAGVGASITDDALTAAGATIASTAAQALSQADVVWKVQRPMTA
EEGTDEVALIKEGAVLMCHLGALTNRPVVEALTKRKITAYAMELMPRISRAQSMDILSSQSNLAGYRAVIDGAYEFARAF
PMMMTAAGTVPPARVLVFGVGVAGLQAIATAKRLGAVVMATDVRAATKEQVESLGGKFITVDDEAMKTAETAGGYAKEMG
EEFRKKQAEAVLKELVKTDIAITTALIPGKPAPVLITEEMVTKMKPGSVIIDLAVEAGGNCPLSEPGKIVVKHGVKIVGH
TNVPSRVAADASPLFAKNLLNFLTPHVDKDTKTLVMKLEDETVSGTCVTRDGAIVHPALTGQGA
>Q9ALA2 7.1.1.1~~~pntA~~~NAD(P) transhydrogenase subunit alpha~~~
MKIGAPREIFEGEARVAMTPDSALQLQKLGHHCVIETGAGMKAGFSDEAYAAAGVEVLPSAAALFEAADIVVKVRGPERA
EAERLRRGQTLISFFWPAQNAELLELCKEKGATVVAMDMVPRISRAQKMDALSSMANIAGYRAVIEAGNNFGRFFTGQVT
AAGKVPPAKVLVVGAGVAGLAAIGTATSLGAITYAFDVRPEVAEQIESMGAEFVYLEFEEAQDGAATGGYAAPSSPEFRE
KQLAKFRELAPEMDIVITTALIPGRPAPKLWTEDMVSAMKRGSVIVDLASERGGNCDLTVPDQKIVTPNGVTIVGYTDFP
SRMAAQASTLYSTNIRHMLTDLTPKKDGVIHHNMEDDVIRGATVTHDGAITFPPPPPKVAAIAAAKPREKVKELTPEEKR
AAEIATFRKQTVSQVAMLAVGTALLLFVGMYAPPSFMAHFIVFALACFVGFQVIWNVSHSLHTPLMAVTNAISGIVILGA
LLQIGSGNVLVVLLAAISVLIATINIVGGFLVTRRMLAMFQKS
>P07001 7.1.1.1~~~pntA~~~NAD(P) transhydrogenase subunit alpha~~~COG3288
MRIGIPRERLTNETRVAATPKTVEQLLKLGFTVAVESGAGQLASFDDKAFVQAGAEIVEGNSVWQSEIILKVNAPLDDEI
ALLNPGTTLVSFIWPAQNPELMQKLAERNVTVMAMDSVPRISRAQSLDALSSMANIAGYRAIVEAAHEFGRFFTGQITAA
GKVPPAKVMVIGAGVAGLAAIGAANSLGAIVRAFDTRPEVKEQVQSMGAEFLELDFKEEAGSGDGYAKVMSDAFIKAEME
LFAAQAKEVDIIVTTALIPGKPAPKLITREMVDSMKAGSVIVDLAAQNGGNCEYTVPGEIFTTENGVKVIGYTDLPGRLP
TQSSQLYGTNLVNLLKLLCKEKDGNITVDFDDVVIRGVTVIRAGEITWPAPPIQVSAQPQAAQKAAPEVKTEEKCTCSPW
RKYALMALAIILFGWMASVAPKEFLGHFTVFALACVVGYYVVWNVSHALHTPLMSVTNAISGIIVVGALLQIGQGGWVSF
LSFIAVLIASINIFGGFTVTQRMLKMFRKN
>P0AB67 7.1.1.1~~~pntB~~~NAD(P) transhydrogenase subunit beta~~~COG1282
MSGGLVTAAYIVAAILFIFSLAGLSKHETSRQGNNFGIAGMAIALIATIFGPDTGNVGWILLAMVIGGAIGIRLAKKVEM
TEMPELVAILHSFVGLAAVLVGFNSYLHHDAGMAPILVNIHLTEVFLGIFIGAVTFTGSVVAFGKLCGKISSKPLMLPNR
HKMNLAALVVSFLLLIVFVRTDSVGLQVLALLIMTAIALVFGWHLVASIGGADMPVVVSMLNSYSGWAAAAAGFMLSNDL
LIVTGALVGSSGAILSYIMCKAMNRSFISVIAGGFGTDGSSTGDDQEVGEHREITAEETAELLKNSHSVIITPGYGMAVA
QAQYPVAEITEKLRARGINVRFGIHPVAGRLPGHMNVLLAEAKVPYDIVLEMDEINDDFADTDTVLVIGANDTVNPAAQD
DPKSPIAGMPVLEVWKAQNVIVFKRSMNTGYAGVQNPLFFKENTHMLFGDAKASVDAILKAL
>Q2RSB4 7.1.1.1~~~pntB~~~NAD(P) transhydrogenase subunit beta~~~COG1282
MTHSLTMAAYIVAGVLFILALRGLSNPESARNGNRMGMVGMAIAILTTLLSPSVQAYAWIVLAIAIGGAIGTVIAKKVLM
TALPQLVAAFHSLVGMAAVLVATGALLNPEAYGIGSAGAIHAGSLVEMSLGLAVGAITFSGSVIAFGKLQGLIAGKPVTF
PMQHPLNAVLGILLVVLLVVFAATESHTAYFALMILAFALGFLLIIPIGGADMPVVISMLNSYSGWAAAGIGFTLGNPLL
IIAGALVGSSGAILSYIMCKGMNRSIFNVILGGFGSEGGVAAAGGAAGDRSVKAGSAEDAAFIMKNASKVIIVPGYGMAV
AQAQHALREMADVLKKEGVEVSYAIHPVAGRMPGHMNVLLAEANVPYDEVFELEEINSSFQTADVAFVIGANDVTNPAAK
TDPSSPIYGMPILDVEKAGTVLFIKRSMASGYAGVENELFFRNNTMMLFGDAKKMTEQIVQAMN
>P0C188 7.1.1.1~~~pntB~~~NAD(P) transhydrogenase subunit beta~~~
MTHSLTMAAYIVAGVLFILALRGLSNPESARNGNRMGMVGMAIAILTTLLSPSVQAYAWIVLAIAIGGAIGTVIAKKVLM
TALPQLVAAFHSLVGMAAVLVATGALLNPEAYGIGSAGAIHAGSLVEMSLGLAVGAITFSGSVIAFGKLQGLIAGKPVTF
PMQHPLNAVLGILLVVLLVVFAATESHTAYFALMILAFALGFLLIIPIGGADMPVVISMLNSYSGWAAAGIGFTLGNPLL
IIAGALVGSSGAILSYIMCKGMNRSIFNVILGGFGSEGGVAAAGGAAGDRSVKAGSAEDAAFIMKNASKVIIVPGYGMAV
AQAQHALREMADVLKKEGVEVSYAIHPVAGRMPGHMNVLLAEANVPYDEVFELEEINSSFQTADVAFVIGANDVTNPAAK
TDPSSPIYGMPILDVEKAGTVLFIKRSMASGYAGVENELFFRNNTMMLFGDAKKMTEQIVQAMN
>E3VWI7 1.14.13.170~~~pntE~~~Pentalenolactone D synthase~~~
MDLEAVREKYRQERDKRGVGRTYQFARGDFSRYARDPYTERREREPLTDEVDVAVVGAGIGGLLTGARLREETGLERIRL
IDEAGDVGGTWYWNRFPGVRCDVESYVYMPLLEEIGTIPTEKYSTGPEIFAHLQRIAHRYGLYRDALFQTTVTELRWDEA
AARWLVSTDRGDLFRARYVAMSIGLMHRPKLPGLPGLETFAGHSFHTSRWDFGYTGGDSTGGLTGLKDKRVGVIGTGSTT
VQLAPHLAEWAERLYIFQRTPAAVDVRGNRPTPPGWADGLDAGWQQRRMENFHALTSGIPQDEDLVQDRWTQTTAELATA
ILPTGDTGGDPKERALAAERADFRKMEELRARIDSVVTDPATAAALKPYYRVYCKRPCFHDGYLQTFNRPNVTLVDTQGQ
GVERLTASGVVANGREYPVDCLIFATGYEHEFAVPYTERAGYDIVGRGGVRLSEKWAQGAHTLHGLQVHGFPNCFILSKV
QAGRHVNIAYMLGEQTRHLAHIVKCVEERGHRVVEASEAGEKEWVEEILRLASGDLDFLENCTPGLYNNEGDPGGLPLLN
SSYGGGSVEFVNILRRWREAGDLAGLELR
>E3VWI3 1.14.19.8~~~pntM~~~Pentalenolactone synthase~~~
MTDLPRLPFDNPDIMGIAPQMLALQKEGPIARVGTAGEDAWLVTRYDEVRTLLADRRLRLSNPNPQPSAKSAARAFMVAL
MAGDDHETEPARHAQMRSLLIPRFSTRRLRLMKTRIEHHVDELLDQLAASAPPVDLHRVLSFRLPTMVVCDLLGVPLADR
ERFGQWARGTFDQSDNEHSANTFQQVVDYMLELVARKRVEPGDDILSELIAEKDGALSDADIAHLGNAVLLFGYETTIVR
IDLGTLLLLRNPVQRAQLAEDPGLAPAAVEEILRLGVGGKGSNALIPRYAHGDITVGETVIRTGDAVMLAIGAANYDDRA
FPDGGLFDLTRVRPRSHLAFGHGARHCIGRTLARIELTAVFERLFRRLPDLRLAVPEESLRWQEHRITGGFDEIPVTF
>P0AFK2 ~~~pnuC~~~Nicotinamide riboside transporter PnuC~~~COG3201
MDFFSVQNILVHIPIGAGGYDLSWIEAVGTIAGLLCIGLASLEKISNYFFGLINVTLFGIIFFQIQLYASLLLQVFFFAA
NIYGWYAWSRQTSQNEAELKIRWLPLPKALSWLAVCVVSIGLMTVFINPVFAFLTRVAVMIMQALGLQVVMPELQPDAFP
FWDSCMMVLSIVAMILMTRKYVENWLLWVIINVISVVIFALQGVYAMSLEYIILTFIALNGSRMWINSARERGSRALSH
>Q57425 ~~~pnuC~~~Nicotinamide riboside transporter PnuC~~~COG3201
MTLAARLKQEFVSGWKPFEVVWLALFIIAQIWAYVQTPDSWLAMISGISGILCVVLVSKGKISNYFFGLIFAYTYFYVAW
GSNFLGEMNTVLYVYLPSQFIGYFMWKANMQNSDGGESVIAKALTVKGWMTLIVVTTVGTLLFVQALQAAGGSSTGLDGL
TTIITVAAQILMILRYREQWLLWIGLNILSIFLWAETPAIYLMYSAYLLNSLYGYYNWTKLVKRTN
>D2ZZC1 ~~~pnuC~~~Nicotinamide riboside transporter PnuC~~~
MQYGMDSFGLRGIPHQVFIKKKEGKIMSLAWWKRELFGGWTHFEAVWLLMFLGIQAVVFVFNPDSWLASVAAVTGILCVV
FVGKGKISNYLFGLISVSLYAYVSYTFKLYGEMMLNLLVYVPVQFVGFAMWRKHMALGETAETEEVKAKALTVRQWLLVV
AASVVGTSVYIEWLHHLGSALPTLDGVTVVVSIVAQVLMILRYREQWALWIVVNILTISLWAVAWFKNGETSLPLLLMYV
MYLCNSVYGYINWTKLVKRHSGQ
>P24520 ~~~pnuC~~~Nicotinamide riboside transporter PnuC~~~
MDFFSTHNILIHIPIGAGGYDLSWIEAVGTIAGLLCIWLASLEKISNYFFGLVNVTLFAIIFFQIQLYASLLLQLFFFAA
NIYGWYAWSRQTKDNQAELKIRWLPLPKAMAWLAICVIAIGLMTRYIDPVFAVLTRVAVAIMQMLGLQVTMPVLQPDAFP
FWDSCMMVLSIVAMILMTRKYVENWLLWVIINVISVVIFALQGVYAMSLEYLILTFIAVNGSRLWINSARERGSRALSR
>Q52185 1.14.12.-~~~pobA~~~Phenoxybenzoate dioxygenase subunit alpha~~~
MSKTIPIVDAQHAGSAYQHVPGHPDPQLSAVAKGTPTGEYLRRYWQPVALSADVTDRPQMVRILGEDLVLFRDKAGRPGL
LYPRCMHRGTSLYYGHVEEAGIRCCYHGWLFAVDGTCLNQPCEPEGGLRREAARQPWYPVEERYGLVFAYMGPPEKKPVL
PRYDILEDLEEGEFIEVISGGFVSYADHVEDPNVPYHWLQNWENIMDPYHVYILHSTFSGIQFAENFKILPRVDFEAVDG
GVIYHAWRDLEDGRQLERINSALFPNISAIPMIDLSPGQGRWIGWHVAVDDQHYRGFFAARTRQPGNFAPIKMHNGKSWT
ELSEQEKQDFPGDFEAQFGQGRVTLHGEEHLATSDHGIALLRRQMKQQIAIVQQGGDPAGVHFNEADALVRIRSGNFYTT
SDKTETAAD
>Q52186 1.-.-.-~~~pobB~~~Phenoxybenzoate dioxygenase subunit beta~~~
MSAAATMAPVSLRIHAIAYGADDVLLFDLRAPARDGLAPFDAGAHIDLRLPRGITRSYSLLNDPAERHRYVIGVKREPES
RGGSAWLHADARVGALIEVDGPSNHFALDESAPHAVFIAGGIGITPLWSMVQRLEHLGTPWTLHYRARSRRGAALLDELA
GHGDRVHLSFSDEGAPSLDLAAIVAAAPEGAHFYCCGPVPMLEAFEAACVGLDPARVHLEYFAAKEAPATEGGFVVHLAR
SGRTIPIAAGCTILDALQAGGVAVPSSCQQGVCGICETAVLAGVPDHRDLVLSDQERAAGRTMMICCSGSKTAELTLDL
>Q43992 ~~~pobR~~~p-hydroxybenzoate hydroxylase transcriptional activator~~~COG1414
MEQHHQYLAHPHSSEEIRTEDYIAGLAKGLALLEAFGIDRQRLNVTQVAERTGISRTAARRYLKTLKFLGYLDTDEHYFW
LTHRVLRFSSSYLSSAHLPKVAQSFLNLLCAQTSLTFSIVVLDEHEVVPVARSYLPQQDNLRVSPYGMHLGNRLPAHATS
TGKVLLSVLDREVQIEWIEKYGLKRLTPYTITDEHTFLETLDAVRQSDYCLSTEEHELGLIAIAVPVLNAQGLTIAALNC
MSQTNRVQPQYLIDQVLPLLRNTANELRNLV
>Q05587 ~~~pocR~~~Regulatory protein PocR~~~
MISASALNSELINKIAQDFAQATGLAVVVVNIHGDEISELFNFTPFCQLMRQHPQHSTRCRMSDRCGGLEASKSDQPCIY
RCHAGLTDFSIPLVIAGHLVGFVLCGQVRLSNDVELVNILNVDDRWQADPELLNEFRNVPEMDYSRVIASADLLKLIVEN
CLKKQLNFVVIKDNPQQSEANKTTRGPTPHDSKMKKALRYIDAHLSDDLRLEDVASHVYLSPYYFSKLFKKYQGIGFNAW
VNRQRMVSARELLCHSDWSIASIARNLGFSQTSYFCKVFRQTYQVTPQAYRQQINENSHPPSL
>Q9ZG88 ~~~podJ~~~Localization factor PodJL~~~
MTAASPWSVKGIDPKAREVAKDLARRSGMTLGEWLNRMIIEGDGQTADPRLAGDDVPNRAYLEIVKDDAPPRIEIAEHPA
DEVGRVALALDRLTQRIEAAEGRNAAAITGIDHSVRDALTRLGASEREQIAVAARFEGAVDELKTEQARATERLRRIESE
AAGPRSAEALRALEGALGKVAGHLYEGEARTREAIATLEAKLNQQSSGDPSALVEAVVARLGERLEAAETRTSDALRELG
ASFQALDQRLGAVETANPATGVQEGLDSLAATLTQKMEAARLEMAAKLRESADGRFDRMERKLGEMAAHVQAAEQRSAQA
IERMGREIVGVADAFNRRVHAAESRNASAIEQVGGEVARIAASVEHKLNRADSVQAQALEKLGGEIARITEKLAERIGSA
ERRNALAIDDVGEQVARVTERLNQRHERSSQELVDRIRQSEERTLRMLEEAREKIDSRLSEAQRKLEAAPPSPPPAQAPA
PVATAQRPVPPAASPFEDNYFSQAASFSTSEDEADAFDAPPAPARSFEVAEFPAAEPEEPAFAHDDYAIADGFEPESPRY
EVEPEVSDFAPAEPSRPMSTRDIIEQARAAARAAAASEGKGGKAKSAKKEKASKASGSLFSGFGGFSTKKSKARLGATVT
TALVVFAAAGALGAGVGGLLLLNTDDGNNSPSRVAQAIAGRKADVEVNGPEADTTPGAPRAAVALTTGKVVPAEVEAPAA
PPTNEAKALFEDAVRKIESGDRSGVELLKRAANGGYPAAQFYLSKMYEGGKNGVKVDMAEARRWSERAANGGDPRAMHNL
ALYYFKGEGGPRNSTTAASWFRKAADMGLVDSQFNLAQLYESGLGVSQNPAEAYKWYVIAGRAGDSTARGRATALRSQLT
AEAQQTADRSALAFRPQTQVQTASLSSAAPAAANANLGVAQRVLSQLGYYQGPRDGVSSPALRMAIAAYQRDQGLPPTGS
VDAETLNRLSVYAR
>B8GXA0 ~~~podJ~~~Localization factor PodJL~~~
MTAASPWSVKGIDPKAREVAKDLARRSGMTLGEWLNRMIIEGDGQTADPRLAGDDVPNRAYLEIVKDDAPPRIEIAEHPA
DEVGRVALALDRLTQRIEAAEGRNAAAITGIDHSVRDALTRLGASEREQIAVAARFEGAVDELKTEQARATERLRRIESE
AAGPRSAEALRALEGALGKVAGHLYEGEARTREAIATLEAKLNQQSSGDPSALVEAVVARLGERLEAAETRTSDALRELG
ASFQALDQRLGAVETANPATGVQEGLDSLAATLTQKMEAARLEMAAKLRESADGRFDRMERKLGEMAAHVQAAEQRSAQA
IERMGREIVGVADAFNRRVHAAESRNASAIEQVGGEVARIAASVEHKLNRADSVQAQALEKLGGEIARITEKLAERIGSA
ERRNALAIDDVGEQVARVTERLNQRHERSSQELVDRIRQSEERTLRMLEEAREKIDSRLSEAQRKLEAAPPSPPPAQAPA
PVATAQRPVPPAASPFEDNYFSQAASFSTSEDEADAFDAPPAPARSFEVAEFPAAEPEEPAFAHDDYAIADGFEPESPRY
EVEPEVSDFAPAEPSRPMSTRDIIEQARAAARAAAASEGKGGKAKSAKKEKASKASGSLFSGFGGFSTKKSKARLGATVT
TALVVFAAAGALGAGVGGLLLLNTDDGNNSPSRVAQAIAGRKADVEVNGPEADTTPGAPRAAVALTTGKVVPAEVEAPAA
PPTNEAKALFEDAVRKIESGDRSGVELLKRAANGGYPAAQFYLSKMYEGGKNGVKVDMAEARRWSERAANGGDPRAMHNL
ALYYFKGEGGPRNSTTAASWFRKAADMGLVDSQFNLAQLYESGLGVSQNPAEAYKWYVIAGRAGDSTARGRATALRSQLT
AEAQQTADRSALAFRPQTQVQTASLSSAAPAAANANLGVAQRVLSQLGYYQGPRDGVSSPALRMAIAAYQRDQGLPPTGS
VDAETLNRLSVYAR
>P94544 ~~~polX~~~DNA polymerase/3'-5' exonuclease PolX~~~COG1387
MHKKDIIRLLETIAVYMELKGDNPFKVSAFRKAAAALEQDDRSLSEMDDMMSLSGIGKGTYSVIKEYIDEGKSSTLESLQ
KEVPEGLVPLLKLPGLGGKKIAKLYKELGVHDAESLKEACEQQKVQGLAGFGKKSEEKILQALGEAGKQPERFPIGYALR
IAREIEEHLSQFTHIIKFSRAGSLRRARETVKDLDYIIATDHPAEVREQLLELPNIKSVIASGDTKVSVILSFEYETSVD
FRLVTEEQFPTTLHHFTGSKDHNIKMRQIAKERGERISEYGVETVETGEIKTFPSEREFYAHFGLPLIPPEIRESGQEVE
TYSDSIELIELGQIKGDLHMHSTWSDGAFSIREMAEACIKKGYQYMAITDHSQYLKVANGLTAERLKQQAKEIDALNAEF
ENFRILKGVEMDILPDGTLDYDDDVLAEMDIVIASIHSSFNQPEHVIMKRLETALTNKHVDIIAHPTGRLIGRRAGYEID
IDQLIELARKTNTALELNANPARLDLRTEHLMKANEQGVTLVINTDAHNIEMLDDMKTGVTAARKGWTETKNVLNARSLK
DVEAFLKRND
>O06873 ~~~pomA~~~Chemotaxis protein PomA~~~COG1291
MDLATLLGLIGGFAFVIMAMVLGGSIGMFVDVTSILIVVGGSIFVVLMKFTMGQFFGATKIAGKAFMFKADEPEDLIAKI
VEMADAARKGGFLALEEMEINNTFMQKGIDLLVDGHDADVVRAALKKDIALTDERHTQGTGVFRAFGDVAPAMGMIGTLV
GLVAMLSNMDDPKAIGPAMAVALLTTLYGAILSNMVFFPIADKLSLRRDQETLNRRLIMDGVLAIQDGQNPRVIDSYLKN
YLNEGKRALEIDE
>Q1DEM0 ~~~pomZ~~~Cell division protein PomZ~~~COG1192
MEAPTYSSKQVAEMLGVSPKQIPEESRKDAYTPDDIWELRTTLDRFPARLGHRRQLFLNFKGGTGKTSLSTSYAWRLAEL
GYAVLLIDLDSQGHATKCLGYEGEDFEKTLLDVLVRKTPLAKVIQKSSLPNLDFVPSNLTMSTVDLALMPMAGREFKLRN
ALKDVEAQYDVVVFDAPPSFGLLNLNALMAANDLFVPVLADFLSFHGLKLLFETVQSLEEDLNHVLDHVFIVVNSFNATF
KLAKEALEALQTHYPEFLLPTIIRQCTKFAQASSEGRPVFVADPSSKGANDIQAMIDNILPRLVAAAAVAQTKGTQQAG
>Q9RBS0 ~~~popA~~~Protein PopA1~~~
MSVGNIQSPSNLPGLQNLNLNTNTNSQQSGQSVQDLIKQVEKDILNIIAALVQKAAQSAGGNTGNTGNAPAKDGNANAGA
NDPSKNDPSKSQGPQSANKTGNVDDANNQDPMQALMQLLEDLVKLLKAALHMQQPGGNDKGNGVGGANGAKGAGGQGGLA
EALQEIEQILAQLGGGGAGAGGAGGGVGGAGGADGGSGAGGAGGANGADGGNGVNGNQANGPQNAGDVNGANGADDGSED
QGGLTGVLQKLMKILNALVQMMQQGGLGGGNQAQGGSKGAGNASPASGANPGANQPGSADDQSSGQNNLQSQIMDVVKEV
VQILQQMLAAQNGGSQQSTSTQPM
>Q1DFT5 3.4.21.-~~~popC~~~Subtilisin-like protease PopC~~~COG1404
MKSYLLVPKESIETQARVGPRGTEQGERVLSRTTALRFAVANKAPDALFALGLRSATLPGARPPVSGQEERRRKGKGAKS
ARTGTRGADSSTPPMPGATVAEQTGAEPGSYRYMPLIGATMAHFYEDHTEKEARGELERDFEFIPDVVPLSFPGPVSAGQ
PGPRNRGMSSLAEREWPDECGVPLAHAQGIRGAGVMLGILDTGVDADHPEHAARVIQFRYVSLFPNSPHNPARDIRGFDP
DGHGTHVCGIAAGVHHGVAPEVDLYVASVIESETIRTSLGRVAAGMEWLLHQFSRPENSTRPAVVNLSLGFPLMPPPGIS
EADYNLNLRALQTMIRRLLDSNVLPVVAAGNSGPDTVGYPAAFPESLAVGAVDFERNVATFSASGTVGRRVVPDIMGYGV
NVYSSTERRCNNQAFYERMSGTSMAAPYVAGIAALYRCRAPDLTALEVRDLILSNAVKLPRSGTHKTGKGLAVFR
>Q1DFT4 ~~~popD~~~PopC secretion inhibitor~~~
MNPGSAPWERRTRERMRAMSRKNGEWGDVRVGGVPGLSARVRPLPGAAGADTQPDWIDVTVMPREEPAAASRRRTSPRPP
VRSRAEVHQAGLAESAQFHQSLMRWLEAHHLLGAVRSVSEPGSMPMLHLRCAPRVLDQLRRAPEFEAGTMMPLDLI
>P80672 ~~~porA~~~Major outer membrane protein~~~COG4773
MKLVKLSLVAALAAGAFSAANATPLEEAIKDVDVSGVLRYRYDTGNFDKNFVNNSNLNNSKQDHKYRAQVNFSAAIADNF
KAFVQFDYNAADGGYGANGIKNDQKGLFVRQLYLTYTNEDVATSVIAGKQQLNLIWTDNAIDGLVGTGVKVVNNSIDGLT
LAAFAVDSFMAAEQGADLLEHSNISTTSNQAPFKVDSVGNLYGAAAVGSYDLAGGQFNPQLWLAYWDQVAFFYAVDAAYS
TTIFDGINWTLEGAYLGNSLDSELDDKTHANGNLFALKGSIEVNGWDASLGGLYYGDKEKASTVVIEDQGNLGSLLAGEE
IFYTTGSRLNGDTGRNIFGYVTGGYTFNETVRVGADFVYGGTKTEAANHLGGGKKLEAVARVDYKYSPKLNFSAFYSYVN
LDQGVNTNESADHSTVRLQALYKF
>C0HJE6 ~~~porA~~~Porin PorA~~~
MKRTLGHALIIIGAALIVIAVLLPTFLVPRLRVIPLDTVSDTITEVRDGTLLDSSQLGKNEPTPNRKNDPRCKAETDEEK
RDLPVHCFINDKTPMQSKRHVEIEEPADEKIATLQVGTTLLRDDRKEPKNLINAILDRITVDRSTAYPVDDPISSVAINA
PQGGSDTKPPTFTRPGIQYQFPFGAQKKSYPYYDVQAMRNFEIDFVGEETQDGVKVYKYSMTIPPQNLYESLTEHFTRDG
RKLTEADKSSLASMRLSFPAHKWGLEGDDDVELDRYYTNVRTVRVEPTSGVIVNGTEEMFMFYAKDDKEAEEIASKAGRE
KEAKEQNRTAMKYTAQWDEGSKGRQMDRAKEARKSLTIGGTVAPWILGILGLVPIVIGFRIRSKSA
>B5CY96 3.2.1.178~~~~~~Beta-porphyranase A~~~COG3401
MSYKYIFLLSAFTLGVPPGIYCQGRNEVVVDYNTRRFLSGVSELDRSKYFNIHSTSDDDKDVGKFLADYQVGLGRKFWGP
YSYAYNKTHEVGKYPQMKPYSGNISVKRYIATEHPYVQHIQGGIDVQAAGAWSAEYYSNSELVPEFFEPLNEPFVHANDA
GFTVQGQAMRELMVDFYASIGKHIHNNPRLNGKMKVIGYAAAYPAWEDGNFNYWNTRMKMFIDRAGAYMDGFSVHLYDGI
NVTGTDTKRSGSNSEAVLDMVEAYSYIKFGHVKPLAISEFGGIDNSKPDDSYDDISSVRSVSSFNHFLFNLMERQDNLFI
SIPFVSDKAEWHITAANNYTSYSAALFIPDNPQNLKNTTWRLNDKKYFFELWKNVKGERVDITSSNPDIQVQAFKDGGRL
YIALDNLDDNPQTVYLNNKNSWKDVSNVTKRSLYVNYNAGIEYTEQNVPSMPESISIVPNQTIVLVADVSSSAFTNSIIR
NKYYSSEYLKPISAGSSLSFPFTGIESGSGRASLRMSIGRPVSASKKPVVKINGTAVSVPDNWKGYGQSNRNIFFGMIEV
PFDIQLLKNGDNNVDITFSDGGGHVSSMILQVEKYTVSTLQNGTFSEGLSAWQPLGNYGTVCVQTDNAGNNVACISGHAG
LMQRVDMESGRTYRFSADVKTEGACKLKVMLQDMSTGTVYTEEFSSPGNYKAVSFDFNSTVKKVVCAIVCERQNDAAWID
NIVLLPQN
>O05651 1.2.7.1~~~porA~~~Pyruvate synthase subunit PorA~~~COG0674
MERVVERVAVTGAEAVANAMRQIEPDVVAAYPITPQTPIVEYFARFVADGVVRTEMIPVESEHSAMSAVVGAAAAGARAM
TATSANGLALMHEIVYIAASYRLPIVMPVVNRALSGPINIHCDHSDAMAERDSGWIQLFAETNQEAYDFTILAVRLAEHE
DVRLPVMVNLDGFILSHGVEPVEFYPDELVKKFVGELKPMYPLLDTEHPVTWGPLDLYDYYFEHKRQQIEAMENVKKVFP
EIAKEFEETFGRKYWFVEPYRMEDAEHVMVALGSTNSTIKYVVDELREEGYKVGSLKIWMFRPFPKEQLQELLNGRKSVV
VLDRAVSFGAEAPLYEAVKSALYEVAARPMLGSYVYGLGGRDIKPEHIRKAFEDAINGNLIADEQRYLGLRE
>D7GXG0 3.2.1.178~~~porA~~~Beta-porphyranase A~~~
MKKVLLFLIFLVSANLSAQLPSPTNGKKWEKVEQLSDEFNGNSIDTNKWYDYHPFWEGRAPSNFKKGNAFVSDGFLNLRS
TLRKEPSSVQDPFKDIWVDAAAAVSKTKAQPGYYYEARFKASSLSMTSSFWFRVGQFSEIDVIEHIGNPSKENRQDDLPY
QYHVNTHYYGKHAGLQPLGTEYKMPGRGRDNFYTYGFWWKSPNELLFYFNGKQVMRIVPRVPLDEELRMIFDTEVFPFAT
AGVANIGLPKPENLRDNSKNTMKVDWVRVYKLVDGTAAEDSSDAPIGSYISLKKTQGDGKFVTGEKDGSQLVARGSTVQS
WEKFKVEKHPKGGITLKANSNGKYVQVQGSDINKPVRAAGDFQGDWEQFEWKSKGNGLVALKNVLTGKWLQAPWTENNAI
IRPKGPVDNGWETFAWKKETSPTASTALSAQLETKTVDGIRVYPSPASETLTIEGVEGENGLRVFDSTGNPVLKKEGILG
RKERLNVSGLIKGNYLLRTGSGEQTWFQKN
>B5CY92 3.2.1.178~~~~~~Beta-porphyranase B~~~COG2273
MRKTVLYLSAASLFLSSYTLKNDKEYSLAEEHIKNLPEAPEGYKWVVNEDYTDEFNGKRLNAAKWHAKSPYWTNGRPPAT
FKAENVSVKKGCLRIINTVLSPTEGLDGKPGDKYRLAGGAVASVKNQAHYGYYETRMKASLTTMSSTFWLSNRPVMKEIM
KGGKKIKTWSSQELDIIETMGIIRSVNPDNPWNKTWNMQMNSNTHYWYQEQGGKRTDNTAKRSDVVSYMTDPSAEDFHTY
GCWWVDANTVKFYYDGKYMYTIKPTTKYTDTPFDRPMFIHIVTETYDWEKQVPTAEDLKDKDKSTTYYDWVRAYKLVPIE
E
>Q51485 ~~~oprB~~~Porin B~~~
MYKNKKTRPAARTVGCLFALGALGLGSAAHAAEAFSPNSKWMLGDWGGKRTELLEKGYDFKLEYVGEAAANLDGGYDDDK
TGRYTDQFALGVHMDLEKILGWKATEFQFTVTERNGKNLSNDRIGDPRAGHISSVQEVWGRGQTWRLTQLWLKQQYFDGA
LDVKFGRFGEGEDFNSFPCDFQNLAFCGSQVGNWAGSIWYNWPVSQWALRVKYNFAPDWYVQVGAYEQNPSNLETGNGFK
MSGSGTKGALLPVELIWQPKVGAEQLPGEYRLGYYYSTAKADDVYDDVDGQPQGLTGNDFKSRGSKHGWWVVAQQQVTSH
NGDASRGLSLFANLTVHDKATNVVDNYQQLGVVYKGPFDARPKDDIGLGIARIHVNDDVKKRQRLVNQVNGIDDYDNPLY
QPLQDTEYNAELYYGVHVTDWLTVRPNLQYIKQPGGVDEVDNALVAGIKIQTVF
>Q56317 1.2.7.1~~~porB~~~Pyruvate synthase subunit PorB~~~COG1013
MPVNIKQLAQEFDKKEIGITQGHRLCPGCGAPITVKFVMMIARHLGYEPVVGLATGCLEVSTSIYPYTAWSVPYIHNAFE
NVAATMSGVETAYKALKNKGKIPEDKKYAFIAFGGDGGTYDIGLQSLSGMLERGHKVLYVLYDNEGYMNTGNQRSGSTPP
GSDTTTAPVGKKLPGKVQLKKNIVEIVAAHENVYAATASLSEPMDFFAKVEKALNFDGPSFLAVFSPCVRFWRVNDDKTV
EISKLAVETKYWPLYEVERGVYRVTRKPRQFKPVEEFLKAQGRFRKLLSRPDAKEIVDELQEYVDRRWERLLTLEEVTKD
KPIR
>D7GXF9 3.2.1.178~~~porB~~~Beta-porphyranase B~~~
MKLSNQFLITITLLITSITFAQEAPHFKPGEDPRQPHQEWKLIENMSDEFEGKKIDEKKWQISGQGWIGRAPGLFLAENI
SLNNGSLQITTTMLPEPIVKNNKTYTHGGGYVGSRNGMTYGYYECEMKANKTFMSSTFWLINEGKDRLGCDKRTTELDIQ
ESVGQITNDADWMKYFDQTMNSNTHSRNIPEGCEYEKGSSKGKAELGGKAYEDFHVYGVWWKSKDEIIFFLDGKMQSKVT
PPADFDIEMYLRMVVETYDWNPVPKDGGMTGSKEDRTTTYNWVRSWQLVDSKN
>O05650 1.2.7.1~~~porC~~~Pyruvate synthase subunit PorC~~~COG1014
MPVAKKYFEIRWHGRAGQGAKSASQMLAEAALEAGKYVQAFPEYGAERTGAPMRAFNRIGDEYIRVRSAVENPDVVVVID
ETLLSPAIVEGLSEDGILLVNTVKDFEFVRKKTGFNGKICVVDATDIALQEIKRGIPNTPMLGALVRVTGIVPLEAIEKR
IEKMFGKKFPQEVIDANKRALRRGYEEVKCSE
>P32722 3.4.21.-~~~oprD~~~Porin D~~~
MKVMKWSAIALAVSAGSTQFAVADAFVSDQAEAKGFIEDSSLDLLLRNYYFNRDGKSGSGDRVDWTQGFLTTYESGFTQG
TVGFGVDAFGYLGLKLDGTSDKTGTGNLPVMNDGKPRDDYSRAGGAVKVRISKTMLKWGEMQPTAPVFAAGGSRLFPQTA
TGFQLQSSEFEGLDLEAGHFTEGKEPTTVKSRGELYATYAGETAKSADFIGGRYAITDNLSASLYGAELEDIYRQYYLNS
NYTIPLASDQSLGFDFNIYRTNDEGKAKAGDISNTTWSLAAAYTLDAHTFTLAYQKVHGDQPFDYIGFGRNGSGAGGDSI
FLANSVQYSDFNGPGEKSWQARYDLNLASYGVPGLTFMVRYINGKDIDGTKMSDNNVGYKNYGYGEDGKHHETNLEAKYV
VQSGPAKDLSFRIRQAWHRANADQGEGDQNEFRLIVDYPLSIL
>Q56316 ~~~porD~~~Pyruvate synthase subunit PorD~~~COG1144
MSLKSWKEIPIGGVIDKPGTAREYKTGAWRVMRPILHKEKCIDCMFCWLYCPDQAIIQEGGIMKGFNYDYCKGCGLCANV
CPKQAIEMRPETEFLSEEG
>P13794 ~~~oprF~~~Outer membrane porin F~~~
MKLKNTLGVVIGSLVAASAMNAFAQGQNSVEIEAFGKRYFTDSVRNMKNADLYGGSIGYFLTDDVELALSYGEYHDVRGT
YETGNKKVHGNLTSLDAIYHFGTPGVGLRPYVSAGLAHQNITNINSDSQGRQQMTMANIGAGLKYYFTENFFAKASLDGQ
YGLEKRDNGHQGEWMAGLGVGFNFGGSKAAPAPEPVADVCSDSDNDGVCDNVDKCPDTPANVTVDANGCPAVAEVVRVQL
DVKFDFDKSKVKENSYADIKNLADFMKQYPSTSTTVEGHTDSVGTDAYNQKLSERRANAVRDVLVNEYGVEGGRVNAVGY
GESRPVADNATAEGRAINRRVEAEVEAEAK
>P37726 ~~~oprF~~~Outer membrane porin F~~~COG2885
MKLKNTLGFAIGSIIAATSFGALAQGQGAVEGELFYKKQYNDSVKHIEDGFNPGARIGYFLTDDLSLNLSYDKTNHTRSN
DGTGSQKIGGDTSSLTAQYHFGQAGVDSLRPYVEGGFGHQSRGNVKADGHSGRDQSTLAIAGAGVKYYFTNNVYARAGVE
ADYALDNGKWDYSALVGLGVNFGGNAGAAAPAPTPAPAPEPTPEPEAPVAQVVRVELDVKFDFDKSVVKPNSYGDVKNLA
DFMAQYPATNVEVAGHTDSIGPDAYNQKLSQRRADRVKQVLVKDGVAPSRITAVGYGESRPVADNATEAGRAVNRRVEAS
VEAQAQ
>P39767 ~~~opmA~~~Porin~~~
EISLNGYGRFGLQYVEDRGVGLEDTIISSRLRINIVGTTETDQGVTFGAKLRMQWDDGDAFAGTAGNAAQFWTSYNGVTV
SVGNVDTAFDSVALTYDSEMGYEASSFGDAQSSFFAYNSKYDASGALDNYNGIAVTYSISGVNLYLSYVDPDQTVDSSLV
TEEFGIAADWSNDMISLAAAYTTDAGGIVDNDIAFVGAAYKFNDAGTVGLNWYDNGLSTAGDQVTLYGNYAFGATTVRAY
VSDIDRAGADTAYGIGADYQFAEGVKVSGSVQSGFANETVADVGVRFDF
>P31243 ~~~~~~Porin~~~
EVKLSGDARMGVMYNGDDWNFSSRSRVLFTMSGTTDSGLEFGASFKAHESVGAETGEDGTVFLSGAFGKIEMGDALGASE
ALFGDLYEVGYTDLDDRGGNDIPYLTGDERLTAEDNPVLLYTYSAGAFSVAASMSDGKVGETSEDDAQEMAVAAAYTFGN
YTVGLGYEKIDSPDTALMADMEQLELAAIAKFGATNVKAYYADGELDRDFARAVFDLTPVAAAATAVDHKAYGLSVDSTF
GATTVGGYVQVLDIDTIDDVTYYGLGASYDLGGGASIVGGIADNDLPNSDMVADLGVKFKF
>P32977 ~~~oprO~~~Porin O~~~
MIRKHSLGFVASALALAVSAQAFAGTVTTDGADIVIKTKGGLEVATTDKEFSFKLGGRLQADYSRFDGFYTKNGNTADAA
YFRRAFIELGGTAYKDWKYQINFDLSHNTGSSDNGYFDEASVTYTGFNPVNLKFGRFDPDFGLEKATSSKWVTAPERNAA
YELADWINTHQDGMGAQVNSTLADMAYLSAGVSAKDADDSDGDSVKQFNFRGVFAPMHEAGNVLHVGVNYAYRDLDDTAF
DSRIRPRLGMRGIATSGGNDAGDNGNRATFGGVSNSPAGSYKDDSVWGLEGAWAMGPFSAQAEYLARKLKADDNAYKDIK
AKGYYAQLAYTLTGESRQYKLEGAKFDSVKPENKEIGAWEVFYRYDNIKVEDDNVVADTATREVGDTKAKAHNLGVNWYV
NDAVKISAAYVKAKTDKITNNNGDDDGDGFVTRLQYVF
>P05695 ~~~oprP~~~Porin P~~~
MIRRHSCKGVGSSVAWSLLGLAISAQSLAGTVTTDGADIVIKTKGGLEVATTDKEFSFKLGGRLQADYGRFDGYYTNNGN
TADAAYFRRAYLEFGGTAYRDWKYQINYDLSRNVGNDSAGYFDEASVTYTGFNPVNLKFGRFYTDFGLEKATSSKWVTAL
ERNLTYDIADWVNDNVGTGIQASSVVGGMAFLSGSVFSENNNDTDGDSVKRYNLRGVFAPLHEPGNVVHLGLQYAYRDLE
DSAVDTRIRPRMGMRGVSTNGGNDAGSNGNRGLFGGSSAVEGLWKDDSVWGLEGAWALGAFSAQAEYLRRTVKAERDRED
LKASGYYAQLAYTLTGEPRLYKLDGAKFDTIKPENKEIGAWELFYRYDSIKVEDDNIVVDSATREVGDAKGKTHTLGVNW
YANEAVKVSANYVKAKTDKISNANGDDSGDGLVMRLQYVF
>P80354 1.1.1.-~~~por~~~Polyol:NADP oxidoreductase~~~COG0246
MITRETLKSLPANVQAPPYDIDGIKPGIVHFGVGNFFRAHEAFYVEQILEHAPDWAIVGVGLTGSDRSKKKAEEFKAQDC
LYSLTETAPSGKSTVRVMGALRDYLLAPADPEAVLKHLVDPAIRIVSMTITEGGYNINETTGAFDLENAAVKADLQNPEK
PSTVFGYVVEALRRRRDAGGKAFTVMSCDNLRHNGNVARKAFLGYAKARDPELAKWIEENATFPNGMVDRITPTVSAEIA
KKLNAASGLDDDLPLVAEDFHQWVLEDRFANGRPPLEKAGVQLVDDVTDWEHVKIRMLNAGHITLCFPGILVGYENVDDA
IEDKDLRGNLENYLNKDVIPTLKAPPGMTLEGYRDSVISRFSNKAMSDQTLRIASDGCSKIQVFWTETVRRAIECKRDLS
RIAFGIASYLEMLRGRDEKGGTYESSEPTYGEAQKKLAKADDFESALKLPAFDGWRDLDTSELDQKVIALRKVIREKGVK
AAIPA
>Q59987 1.3.1.33~~~por~~~Light-dependent protochlorophyllide reductase~~~COG1028
MEQPMKPTVIITGASSGVGLYGAKALIDKGWHVIMACRNLDKTQKVADELGFPKDSYTIIKLDLGYLDSVRRFVAQFREL
GRPLKALVCNAAVYFPLLDEPLWSADDYELSVATNHLGHFLLCNLLLEDLKACPDADKRLIILGTVTANSKELGGKIPIP
APPDLGNFEGFEAGFKKPIAMINNKKFKSGKAYKDSKLCNMLTTRELHRRFHQETGIVFNSLYPGCVADTPLFRNHYSLF
RTIFPWFQKNVTKGYVSQELAGERVAMVVADDKFKDSGVHWSWGNRQQAGREAFVQELSEQGSDAQKAQRMWDLSEKLVG
LV
>P69874 7.6.2.11~~~potA~~~Spermidine/putrescine import ATP-binding protein PotA~~~COG3842
MGQSKKLNKQPSSLSPLVQLAGIRKCFDGKEVIPQLDLTINNGEFLTLLGPSGCGKTTVLRLIAGLETVDSGRIMLDNED
ITHVPAENRYVNTVFQSYALFPHMTVFENVAFGLRMQKTPAAEITPRVMEALRMVQLETFAQRKPHQLSGGQQQRVAIAR
AVVNKPRLLLLDESLSALDYKLRKQMQNELKALQRKLGITFVFVTHDQEEALTMSDRIVVMRDGRIEQDGTPREIYEEPK
NLFVAGFIGEINMFNATVIERLDEQRVRANVEGRECNIYVNFAVEPGQKLHVLLRPEDLRVEEINDDNHAEGLIGYVRER
NYKGMTLESVVELENGKMVMVSEFFNEDDPDFDHSLDQKMAINWVESWEVVLADEEHK
>Q7A679 7.6.2.11~~~potA~~~Spermidine/putrescine import ATP-binding protein PotA~~~
MEPLLSLKSVSKSYDDLNILDDIDIDIESGYFYTLLGPSGCGKTTILKLIAGFEYPDSGEVIYQNKPIGNLPPNKRKVNT
VFQDYALFPHLNVYDNIAFGLKLKKLSKTEIDQKVTEALKLVKLSGYEKRNINEMSGGQKQRVAIARAIVNEPEILLLDE
SLSALDLKLRTEMQYELRELQSRLGITFIFVTHDQEEALALSDFLFVLKDGKIQQFGTPTDIYDEPVNRFVADFIGESNI
VEGRMVRDYVVNIYGQDFECVDMGIPENKKVEVVIRPEDISLIKAEEGLFKATVDSMLFRGVHYEICCIDNKGYEWVIQT
TKKAEVGSEVGLYFDPEAIHIMVPGETEEEFDKRIESYEEVDNA
>P0AFK4 ~~~potB~~~Spermidine/putrescine transport system permease protein PotB~~~COG1176
MIVTIVGWLVLFVFLPNLMIIGTSFLTRDDASFVKMVFTLDNYTRLLDPLYFEVLLHSLNMALIATLACLVLGYPFAWFL
AKLPHKVRPLLLFLLIVPFWTNSLIRIYGLKIFLSTKGYLNEFLLWLGVIDTPIRIMFTPSAVIIGLVYILLPFMVMPLY
SSIEKLDKPLLEAARDLGASKLQTFIRIIIPLTMPGIIAGCLLVMLPAMGLFYVSDLMGGAKNLLIGNVIKVQFLNIRDW
PFGAATSITLTIVMGLMLLVYWRASRLLNKKVELE
>P0AFK9 ~~~potD~~~Spermidine/putrescine-binding periplasmic protein~~~COG0687
MKKWSRHLLAAGALALGMSAAHADDNNTLYFYNWTEYVPPGLLEQFTKETGIKVIYSTYESNETMYAKLKTYKDGAYDLV
VPSTYYVDKMRKEGMIQKIDKSKLTNFSNLDPDMLNKPFDPNNDYSIPYIWGATAIGVNGDAVDPKSVTSWADLWKPEYK
GSLLLTDDAREVFQMALRKLGYSGNTTDPKEIEAAYNELKKLMPNVAAFNSDNPANPYMEGEVNLGMIWNGSAFVARQAG
TPIDVVWPKEGGIFWMDSLAIPANAKNKEGALKLINFLLRPDVAKQVAETIGYPTPNLAARKLLSPEVANDKTLYPDAET
IKNGEWQNDVGAASSIYEEYYQKLKAGR
>P0AAF1 ~~~potE~~~Putrescine transporter PotE~~~COG0531
MSQAKSNKMGVVQLTILTMVNMMGSGIIMLPTKLAEVGTISIISWLVTAVGSMALAWAFAKCGMFSRKSGGMGGYAEYAF
GKSGNFMANYTYGVSLLIANVAIAISAVGYGTELLGASLSPVQIGLATIGVLWICTVANFGGARITGQISSITVWGVIIP
VVGLCIIGWFWFSPTLYVDSWNPHHAPFFSAVGSSIAMTLWAFLGLESACANTDVVENPERNVPIAVLGGTLGAAVIYIV
STNVIAGIVPNMELANSTAPFGLAFAQMFTPEVGKVIMALMVMSCCGSLLGWQFTIAQVFKSSSDEGYFPKIFSRVTKVD
APVQGMLTIVIIQSGLALMTISPSLNSQFNVLVNLAVVTNIIPYILSMAALVIIQKVANVPPSKAKVANFVAFVGAMYSF
YALYSSGEEAMLYGSIVTFLGWTLYGLVSPRFELKNKHG
>P31133 ~~~potF~~~Putrescine-binding periplasmic protein PotF~~~COG0687
MTALNKKWLSGLVAGALMAVSVGTLAAEQKTLHIYNWSDYIAPDTVANFEKETGIKVVYDVFDSNEVLEGKLMAGSTGFD
LVVPSASFLERQLTAGVFQPLDKSKLPEWKNLDPELLKLVAKHDPDNKFAMPYMWATTGIGYNVDKVKAVLGENAPVDSW
DLILKPENLEKLKSCGVSFLDAPEEVFATVLNYLGKDPNSTKADDYTGPATDLLLKLRPNIRYFHSSQYINDLANGDICV
AIGWAGDVWQASNRAKEAKNGVNVSFSIPKEGAMAFFDVFAMPADAKNKDEAYQFLNYLLRPDVVAHISDHVFYANANKA
ATPLVSAEVRENPGIYPPADVRAKLFTLKVQDPKIDRVRTRAWTKVKSGK
>P31134 7.6.2.16~~~potG~~~Putrescine transport ATP-binding protein PotG~~~COG3842
MNDAIPRPQAKTRKALTPLLEIRNLTKSYDGQHAVDDVSLTIYKGEIFALLGASGCGKSTLLRMLAGFEQPSAGQIMLDG
VDLSQVPPYLRPINMMFQSYALFPHMTVEQNIAFGLKQDKLPKAEIASRVNEMLGLVHMQEFAKRKPHQLSGGQRQRVAL
ARSLAKRPKLLLLDEPMGALDKKLRDRMQLEVVDILERVGVTCVMVTHDQEEAMTMAGRIAIMNRGKFVQIGEPEEIYEH
PTTRYSAEFIGSVNVFEGVLKERQEDGLVLDSPGLVHPLKVDADASVVDNVPVHVALRPEKIMLCEEPPANGCNFAVGEV
IHIAYLGDLSVYHVRLKSGQMISAQLQNAHRHRKGLPTWGDEVRLCWEVDSCVVLTV
>P31135 ~~~potH~~~Putrescine transport system permease protein PotH~~~COG1176
MSTLEPAAQSKPPGGFKLWLSQLQMKHGRKLVIALPYIWLILLFLLPFLIVFKISLAEMARAIPPYTELMEWADGQLSIT
LNLGNFLQLTDDPLYFDAYLQSLQVAAISTFCCLLIGYPLAWAVAHSKPSTRNILLLLVILPSWTSFLIRVYAWMGILKN
NGVLNNFLLWLGVIDQPLTILHTNLAVYIGIVYAYVPFMVLPIYTALIRIDYSLVEAALDLGARPLKTFFTVIVPLTKGG
IIAGSMLVFIPAVGEFVIPELLGGPDSIMIGRVLWQEFFNNRDWPVASAVAIIMLLLLIVPIMWFHKHQQKSVGEHG
>P0AFL1 ~~~potI~~~Putrescine transport system permease protein PotI~~~COG1177
MNNLPVVRSPWRIVILLLGFTFLYAPMLMLVIYSFNSSKLVTVWAGWSTRWYGELLRDDAMMSAVGLSLTIAACAATAAA
ILGTIAAVVLVRFGRFRGSNGFAFMITAPLVMPDVITGLSLLLLFVALAHAIGWPADRGMLTIWLAHVTFCTAYVAVVIS
SRLRELDRSIEEAAMDLGATPLKVFFVITLPMIMPAIISGWLLAFTLSLDDLVIASFVSGPGATTLPMLVFSSVRMGVNP
EINALATLILGAVGIVGFIAWYLMARAEKQRIRDIQRARRG
>P07003 1.2.5.1~~~poxB~~~Pyruvate dehydrogenase [ubiquinone]~~~COG0028
MKQTVAAYIAKTLESAGVKRIWGVTGDSLNGLSDSLNRMGTIEWMSTRHEEVAAFAAGAEAQLSGELAVCAGSCGPGNLH
LINGLFDCHRNHVPVLAIAAHIPSSEIGSGYFQETHPQELFRECSHYCELVSSPEQIPQVLAIAMRKAVLNRGVSVVVLP
GDVALKPAPEGATMHWYHAPQPVVTPEEEELRKLAQLLRYSSNIALMCGSGCAGAHKELVEFAGKIKAPIVHALRGKEHV
EYDNPYDVGMTGLIGFSSGFHTMMNADTLVLLGTQFPYRAFYPTDAKIIQIDINPASIGAHSKVDMALVGDIKSTLRALL
PLVEEKADRKFLDKALEDYRDARKGLDDLAKPSEKAIHPQYLAQQISHFAADDAIFTCDVGTPTVWAARYLKMNGKRRLL
GSFNHGSMANAMPQALGAQATEPERQVVAMCGDGGFSMLMGDFLSVVQMKLPVKIVVFNNSVLGFVAMEMKAGGYLTDGT
ELHDTNFARIAEACGITGIRVEKASEVDEALQRAFSIDGPVLVDVVVAKEELAIPPQIKLEQAKGFSLYMLRAIISGRGD
EVIELAKTNWLR
>P37063 1.2.3.3~~~pox5~~~Pyruvate oxidase~~~COG0028
MVMKQTKQTNILAGAAVIKVLEAWGVDHLYGIPGGSINSIMDALSAERDRIHYIQVRHEEVGAMAAAADAKLTGKIGVCF
GSAGPGGTHLMNGLYDAREDHVPVLALIGQFGTTGMNMDTFQEMNENPIYADVADYNVTAVNAATLPHVIDEAIRRAYAH
QGVAVVQIPVDLPWQQIPAEDWYASANSYQTPLLPEPDVQAVTRLTQTLLAAERPLIYYGIGARKAGKELEQLSKTLKIP
LMSTYPAKGIVADRYPAYLGSANRVAQKPANEALAQADVVLFVGNNYPFAEVSKAFKNTRYFLQIDIDPAKLGKRHKTDI
AVLADAQKTLAAILAQVSERESTPWWQANLANVKNWRAYLASLEDKQEGPLQAYQVLRAVNKIAEPDAIYSIDVGDINLN
ANRHLKLTPSNRHITSNLFATMGVGIPGAIAAKLNYPERQVFNLAGDGGASMTMQDLATQVQYHLPVINVVFTNCQYGFI
KDEQEDTNQNDFIGVEFNDIDFSKIADGVHMQAFRVNKIEQLPDVFEQAKAIAQHEPVLIDAVITGDRPLPAEKLRLDSA
TSSAADIEAFKQRYEAQDLQPLSTYLKQFGLDDLQHQIGQGGF
>Q5XAP6 3.1.3.16~~~~~~Putative protein phosphatase 2C-type~~~
MKISLKTDIGQKRSNNQDFINKFDNKKGITLVILADGMGGHRAGNIASEMTVTDLGREWVKTDFTELSQIRDWLFETIQS
ENQRIYDLGQSEDFKGMGTTVEAVALVESSAIYAHIGDSRIGLVHDGHYTLLTSDHSLVNELVKAGQITEEEAASHPQRN
IITQSIGQASPVEPDLGVRVLEPGDYLVINSDGLTNMISNDEIVTILGSKVSLDEKNQEMIDLANLRGGLDNITIALVHN
ESEDVE
>P29430 ~~~pedA~~~Bacteriocin pediocin PA-1~~~
MKKIEKLTEKEMANIIGGKYYGNGVTCGKHSCSVDWGKATTCIINNGAMAWATGGHQGNHKC
>P37487 3.6.1.1~~~ppaC~~~Manganese-dependent inorganic pyrophosphatase~~~COG1227
MEKILIFGHQNPDTDTICSAIAYADLKNKLGFNAEPVRLGQVNGETQYALDYFKQESPRLVETAANEVNGVILVDHNERQ
QSIKDIEEVQVLEVIDHHRIANFETAEPLYYRAEPVGCTATILNKMYKENNVKIEKEIAGLMLSAIISDSLLFKSPTCTD
QDVAAAKELAEIAGVDAEEYGLNMLKAGADLSKKTVEELISLDAKEFTLGSKKVEIAQVNTVDIEDVKKRQAELEAVISK
VVAEKNLDLFLLVITDILENDSLALAIGNEAAKVEKAFNVTLENNTALLKGVVSRKKQVVPVLTDAMAE
>Q9RRB7 3.6.1.1~~~ppaC~~~Probable manganese-dependent inorganic pyrophosphatase~~~COG1227
MLAVFGHLNPDTDAISAAMVYARLLTRQGTEAQAYRLGEPNFETAYVLRELGLEAPPLLTELPAGSKVALVDHNESAQSL
PALGELDVTRVVDHHKLGDLTTINPPYLRFEPVGCTGTILLKLHREAGLSVEPQDAKLMLSAILSDTLHFRSPTTTQDDR
DAVAFLAPVAGVNDVEAYALAMFAAKSDLGNTPAETLLRMDYKVFPFGDPVQPQNWGIGVIETTNPAYVFGRQQELLAAM
DQVKAEDTLSGMLLSVVDILNETNRTLVLGATEAKVLREAFGAEAEGQVADLGNRISRKKQIVPTLEKYFAPEA
>P65752 3.6.1.1~~~ppaC~~~Probable manganese-dependent inorganic pyrophosphatase~~~
MAKTYIFGHKNPDTDAISSAIIMAEFEQLRGNSGAKAYRLGDVSAETQFALDTFNVPAPELLTDDLDGQDVILVDHNEFQ
QSSDTIASATIKHVIDHHRIANFETAGPLCYRAEPVGCTATILYKMFRERGFEIKPEIAGLMLSAIISDSLLFKSPTCTQ
QDVKAAEELKDIAKVDIQKYGLDMLKAGASTTDKSVEFLLNMDAKSFTMGDYVTRIAQVNAVDLDEVLNRKEDLEKEMLA
VSAQEKYDLFVLVVTDIINSDSKILVVGAEKDKVGEAFNVQLEDDMAFLSGVVSRKKQIVPQITEALTK
>P65753 3.6.1.1~~~ppaC~~~Probable manganese-dependent inorganic pyrophosphatase~~~
MAKTYIFGHKNPDTDAISSAIIMAEFEQLRGNSGAKAYRLGDVSAETQFALDTFNVPAPELLTDDLDGQDVILVDHNEFQ
QSSDTIASATIKHVIDHHRIANFETAGPLCYRAEPVGCTATILYKMFRERGFEIKPEIAGLMLSAIISDSLLFKSPTCTQ
QDVKAAEELKDIAKVDIQKYGLDMLKAGASTTDKSVEFLLNMDAKSFTMGDYVTRIAQVNAVDLDEVLNRKEDLEKEMLA
VSAQEKYDLFVLVVTDIINSDSKILVVGAEKDKVGEAFNVQLEDDMAFLSGVVSRKKQIVPQITEALTK
>Q8DYS6 3.6.1.1~~~ppaC~~~Probable manganese-dependent inorganic pyrophosphatase~~~
MSKILVFGHQNPDSDAIGSSVAFAYLAKEAWGLDTEAVALGTPNEETAYVLDYFGVQAPRVVESAKAEGVETVILTDHNE
FQQSISDIKDVTVYGVVDHHRVANFETANPLYMRLEPVGSASSIVYRMFKENGVSVPKELAGLLLSGLISDTLLLKSPTT
HASDIPVAKELAELAGVNLEEYGLEMLKAGTNLSSKTAAELIDIDAKTFELNGEAVRVAQVNTVDINDILARQEEIEVAI
QEAIVTEGYSDFVLMITDIVNSNSEILALGSNMAKVEAAFEFTLENNHAFLAGAVSRKKQVVPQLTESYNA
>P95765 3.6.1.1~~~ppaC~~~Probable manganese-dependent inorganic pyrophosphatase~~~COG1227
MSKILVFGHQNPDSDAIGSSYAFAYLAREAYGLDTEAVALGEPNEETAFVLDYFGVAAPRVITSAKAEGAEQVILTDHNE
FQQSVADIAEVEVYGVVDHHRVANFETANPLYMRLEPVGSASSIVYRMFKEHSVAVSKEIAGLMLSGLISDTLLLKSPTT
HPTDKAIAPELAELAGVNLEEYGLAMLKAGTNLASKSAEELIDIDAKTFELNGNNVRVAQVNTVDIAEVLERQAEIEAAI
EKAIADNGYSDFVLMITDIINSNSEILAIGSNMDKVEAAFNFVLENNHAFLAGAVSRKKQVVPQLTESFNA
>O68579 3.6.1.1~~~ppaC~~~Probable manganese-dependent inorganic pyrophosphatase~~~COG1227
MSKILVFGHQNPDSDAIGSSMAYAYLKRQLGVDAQAVALGNPNEETAFVLDYFGIQAPPVVKSAQAEGAKQVILTDHNEF
QQSIADIREVEVVEVVDHHRVANFETANPLYMRLEPVGSASSIVYRLYKENGVAIPKEIAGVMLSGLISDTLLLKSPTTH
ASDPAVAEDLAKIAGVDLQEYGLAMLKAGTNLASKTAAQLVDIDAKTFELNGSQVRVAQVNTVDINEVLERQNEIEEAIK
ASQAANGYSDFVLMITDILNSNSEILALGNNTDKVEAAFNFTLKNNHAFLAGAVSRKKQVVPQLTESFNG
>P65756 3.6.1.1~~~ppaC~~~Probable manganese-dependent inorganic pyrophosphatase~~~COG1227
MSKILVFGHQNPDSDAIGSSVAFAYLAKEAYGLDTEAVALGTPNEETAFVLNYFGVEAPRVITSAKAEGAEQVILTDHNE
FQQSVSDIAEVEVYGVVDHHRVANFETASPLYMRLEPVGSASSIVYRMFKEHGVAVPKEIAGLMLSGLISDTLLLKSPTT
HPTDKIIAPELAELAGVNLEEYGLAMLKAGTNLASKSAEELIDIDAKTFELNGNNVRVAQVNTVDIAEVLERQAEIEAAM
QAANESNGYSDFVLMITDIVNSNSEILALGANMDKVEAAFNFKLENNHAFLAGAVSRKKQVVPQLTESFNA
>Q988B8 2.6.1.30~~~ppaT~~~Pyridoxamine--pyruvate transaminase~~~COG0075
MMRYPEHADPVITLTAGPVNAYPEVLRGLGRTVLYDYDPAFQLLYEKVVDKAQKAMRLSNKPVILHGEPVLGLEAAAASL
ISPDDVVLNLASGVYGKGFGYWAKRYSPHLLEIEVPYNEAIDPQAVADMLKAHPEITVVSVCHHDTPSGTINPIDAIGAL
VSAHGAYLIVDAVSSFGGMKTHPEDCKADIYVTGPNKCLGAPPGLTMMGVSERAWAKMKANPLAPRASMLSIVDWENAWS
RDKPFPFTPSVSEINGLDVALDLYLNEGPEAVWARHALTAKAMRAGVTAMGLSVWAASDSIASPTTTAVRTPDGVDEKAL
RQAARARYGVVFSSGRGETLGKLTRIGHMGPTAQPIYAIAALTALGGAMNAAGRKLAIGKGIEAALAVIDADA
>P60120 2.3.1.-~~~~~~Putative pyridoxal phosphate-dependent acyltransferase~~~
MVQSLHEFLEENINYLKENGLYNEIDTIEGANGPEIKINGKSYINLSSNNYLGLATNEDLKSAAKAAIDTHGVGAGAVRT
INGTLDLHDELEETLAKFKGTEAAIAYQSGFNCNMAAISAVMNKNDAILSDELNHASIIDGCRLSKAKIIRVNHSDMDDL
RAKAKEAVESGQYNKVMYITDGVFSMDGDVAKLPEIVEIAEEFGLLTYVDDAHGSGVMGKGAGTVKHFGLQDKIDFQIGT
LSKAIGVVGGYVAGTKELIDWLKAQSRPFLFSTSLAPGDTKAITEAVKKLMDSTELHDKLWDNAQYLKNGLSKLGYDTGE
SETPITPVIIGDEKTTQEFSKRLKDEGVYVKSIVFPTVPRGTGRVRNMPTAAHTKDMLDEAIAAYEKVGKEMKLI
>Q9JMQ2 3.6.1.1~~~ppaX~~~Pyrophosphatase PpaX~~~COG0546
MSDKQVTTILFDLDGTLINTNELIIASFLHTLEHYYPSKYKREDVLAFIGPSLFDTFSSMDPDKCEDMIAMYRAYNHDMH
DSLVTEYETVYETLDALKKAGFTLGIVTTKLRDTVNMGLKLTGIGEFFETVVTLDDVTNAKPDPEPVLLALKQLGSEPAE
AIMVGDNYHDVLAGKNAGTKTAGVAWTIKGPEMLAKHEPDFMLEKMSDLLQIVGVK
>P07102 3.1.3.-~~~appA~~~Phytase AppA~~~
MKAILIPFLSLLIPLTPQSAFAQSEPELKLESVVIVSRHGVRAPTKATQLMQDVTPDAWPTWPVKLGWLTPRGGELIAYL
GHYQRQRLVADGLLAKKGCPQSGQVAIIADVDERTRKTGEAFAAGLAPDCAITVHTQADTSSPDPLFNPLKTGVCQLDNA
NVTDAILSRAGGSIADFTGHRQTAFRELERVLNFPQSNLCLKREKQDESCSLTQALPSELKVSADNVSLTGAVSLASMLT
EIFLLQQAQGMPEPGWGRITDSHQWNTLLSLHNAQFYLLQRTPEVARSRATPLLDLIKTALTPHPPQKQAYGVTLPTSVL
FIAGHDTNLANLGGALELNWTLPGQPDNTPPGGELVFERWRRLSDNSQWIQVSLVFQTLQQMRDKTPLSLNTPPGEVKLT
LAGCEERNAQGMCSLAGFTQIVNEARIPACSL
>P19405 3.1.3.1~~~phoB~~~Alkaline phosphatase 3~~~COG1785
MKKFPKKLLPIAVLSSIAFSSLASGSVPEASAQEKKKGNQDEIKNVIVLIGDGMGVSYTSAYRYLKDNKKTKVVEPTAFD
QYLVGQQTTYPDDPEQNVTDSAAAATAMSAGIKTYNNAIAVDNDGSEAKTVLEAAKEKGKATGLVATSEITHATPASFGS
HDHSRKNMNSIADDYFDEMVNGKHKIDVLLGGGKSNFDRKDRNLIKEFKKAGYSYVDDRKDMLKNKDSQVLGLFADGGLP
KKIDRTKDIPSLKDMTNTAIKKLNKDKDGFFLMVEGSQIDWAGHDNDIVGAMSEMEDFEQAYKAAIDFAKKDKHTLVVAT
ADHSTGGYSIGADGIYNWFSEPIKAAKRTPDFMAEKIADGADVEKTLKTYIDQKKLALTKAEIQSVEEAAKSKEVLDIDN
AIENIFNKRSHTGWTTGGHTGEDVPVYAYGPSSETFAGQIDNTEIAKNVFKALQYNIKINDK
>P19406 3.1.3.1~~~phoA~~~Alkaline phosphatase 4~~~COG1785
MKKMSLFQNMKSKLLPIAAVSVLTAGIFAGAELQQTEKASAKKQDKAEIRNVIVMIGDGMGTPYIRAYRSMKNNGDTPNN
PKLTEFDRNLTGMMMTHPDDPDYNITDSAAAGTALATGVKTYNNAIGVDKNGKKVKSVLEEAKQQGKSTGLVATSEINHA
TPAAYGAHNESRKNMDQIANSYMDDKIKGKHKIDVLLGGGKSYFNRKDRNLTKEFKQAGYSYVTTKQALKKNKDQQVLGL
FADGGLAKALDRDSKTPSLKDMTVSAIDRLNQNKKGFFLMVEGSQIDWAAHDNDTVGAMSEVKDFEQAYKAAIEFAKKDK
HTLVIATADHTTGGFTIGANGEKNWHAEPILSAKKTPEFMAKKISEGKPVKDVLARYANLKVTSEEIKSVEAAAQADKSK
GASKAIIKIFNTRSNSGWTSTDHTGEEVPVYAYGPGKEKFRGLINNTDQANIIFKILKTGK
>Q8A8A4 ~~~~~~Protein ppBat~~~COG0693
MAKKVAVLAVNPVNGCGLFQYLEAFFENGISYKVFAVSDTKEIKTNSGMVLIVDDVIANLKGHEDEFDALVFSCGDAVPV
FQQYANQPYNVDLMEVIKTFGEKGKMMIGHCAGAMMFDFTGITKGKKVAVHPLAKPAIQNGIATDEKSEIDGNFFTAQDE
NTIWTMLPKVIEALK
>P42251 3.1.3.1~~~phoD~~~Alkaline phosphatase D~~~COG3540
MAYDSRFDEWVQKLKEESFQNNTFDRRKFIQGAGKIAGLSLGLTIAQSVGAFEVNAAPNFSSYPFTLGVASGDPLSDSVV
LWTRLAPDPLNGGGMPKQAVPVKWEVAKDEHFRKIVRKGTEMAKPSLAHSVHVEADGLEPNKVYYYRFKTGHELSPVGKT
KTLPAPGANVPQMTFAFASCQQYEHGYYTAYKHMAKEKLDLVFHLGDYIYEYGPNEYVSKTGNVRTHNSAEIITLQDYRN
RHAQYRSDANLKAAHAAFPWVVTWDDHEVENNYANKIPEKGQSVEAFVLRRAAAYQAYYEHMPLRISSLPNGPDMQLYRH
FTYGNLASFNVLDTRQYRDDQANNDGNKPPSDESRNPNRTLLGKEQEQWLFNNLGSSTAHWNVLAQQIFFAKWNFGTSAS
PIYSMDSWDGYPAQRERVINFIKSKNLNNVVVLTGDVHASWASNLHVDFEKTSSKIFGAEFVGTSITSGGNGADKRADTD
QILKENPHIQFFNDYRGYVRCTVTPHQWKADYRVMPFVTEPGAAISTRASFVYQKDQTGLRKVSSTTIQGGVKQSDEVEE
DRFFSHNKAHEKQMIKKRAKITN
>Q02QC9 3.1.3.1~~~phoA~~~Alkaline phosphatase H~~~
MTPGYPLALSLAVSMAVLGSALPAQARQDDPSLFNRQARGELSEYGGARRVEQDLTQALKQSLSKKKAKNVILLIGDGMG
DSEITVARNYARGAGGYFKGIDALPLTGQYTHYSLHKDSGLPDYVTDSAASATAWTTGVKSYNGAIGVDIHEQPHRNLLE
LAKLNGKATGNVSTAELQDATPAALLAHVTARKCYGPEATSKQCPSNALENGGAGSITEQWLKTRPDVVLGGGAATFAET
AKAGRYAGKTLRAQAEARGYRIVENLDELKAVRRANQKQPLIGLFAPGNMPVRWLGPTATYHGNLNQPAVSCEANPKRTA
DIPTLAQMTSKAIELLKDNPNGFFLQVEGASIDKQDHAANPCGQIGETVDLDEAVQKALAFAKADGETLVIVTADHAHSS
QIIPPETAAPGLTQLLTTKDGAPLAISYGNSEEGSQEHTGTQLRIAAYGPQAANVTGLTDQTDLFFTIRRALNLRD
>P35483 3.1.3.1~~~phoA~~~Alkaline phosphatase H~~~
MTPGYPLALSLAVSMAVLGSALPAQARQDDPSLFNRQARGELSEYGGARRVEQDLTQALKQSLSKKKAKNVILLIGDGMG
DSEITVARNYARGAGGYFKGIDALPLTGQYTHYSLHKDSGLPDYVTDSAASATAWSTGVKSYNGAIGVDIHEQPHRNLLE
LAKLNGKATGNVSTAELQDATPAALLAHVTARKCYGPEATSKQCPSNALENGGAGSITEQWLKTRPDVVLGGGAATFAET
AKAGRYAGKTLRAQAEARGYRIVENLDELKAVRRANQKQPLIGLFAPGNMPVRWLGPTATYHGNLNQPAVSCEANPKRTA
DIPTLAQMTSKAIELLKDNPNGFFLQVEGASIDKQDHAANPCGQIGETVDLDEAVQKALAFAKADGETLVIVTADHAHSS
QIIPPETAAPGLTQLLTTKDGAPLAISYGNSEESSQEHTGTQLRIAAYGPQAANVTGLTDQTDLFFTIRRALNLRD
>Q02HI0 3.1.3.1~~~phoA2~~~Alkaline phosphatase L~~~
MYKRSLIAASLSVAALVSAQAMADINGGGATLPQQLYQEPGVLTAGFAAYIGVGSGNGKAAFLNNDYTKFVAGTTNKNVH
WAGSDSKLSKTNETNPYLSAHGSAWGPLIQVPSVATSVALPFNKSGSNAVNFADVNTLCGVFSGRLTDWSQIPGSGRSGA
ITVVYRSESSGTTELFTRFLNASCSSTLEGGTFAITTSFGSSFSGGLPAGAVSAQGSQAVMNALNAAQGRITYMSPDFAA
PTLAGLDDATKVAQVRGVSPAPANVSAAIGAVTPPTTAQRSDPNNWVPVFAATANPNDPSVRPYPTSGYPILGFTNLIFS
QCYANATQTQQVRDFFTRHYGATANNDTAITNHRFVPLPASWKLAVRQSFLTSTNNLYIGHSNVCNGIGRPL
>P35482 3.1.3.1~~~lapA~~~Alkaline phosphatase L~~~
MFKRSLIAASLSVAALVSAQAMAVTGGGASLPAELYKGSADSILPANFSYAVTGSGTGKNAFLTNNSSLFGTTGTVHYAG
SDSVLSGSELTTYNSNYNGTYGPLIQIPSVATSVTVPYRKDGNTTLNLTSAQLCDAFSGAKTTWGQLLGTTDSTPIRIVY
RTGSSGTTELFTRHLNSICPTRFATNSTFTNARLPAGGTLPSNWVGVAATSTVVSTVKATNGSLGYVSPDAVNINSNAEV
SRVNGNLPTQANVSTALGSVAPPANAADRADPSKWVPVFTNPSAGYSIVGYTNFVFGQCYKDASVSTDVRAFINKHYGGT
TTNAAVAAHGFIPLTPAWKSAIVSAFYTGTSENLAIGNTNVCNTKGRP
>K4LAH1 3.1.3.1~~~phoA2~~~Alkaline phosphatase L~~~
MYKRSLIAASLSVAALVSAQAMAEINGGGATLPQQLYQEPGVLTAGFAAYIGAGSGNGKAAFLNNDYTKFVAGTTNKNVH
WAGSDSKLSKTNETNPYLSAHGSAWGPLIQVPSVATSVALPFNKSGSNAVNFADVNTLCGVFSGRLTDWSQIPGSGRSGA
ITVAYRSESSGTTELFTRFLNASCSSALEGGTFAITTSFGNSFSGGLPAGAVSAQGSQAVMNTLNAAEGRITYMSPDFAA
PTLAGLDDATKVAQVRGVSPAPANVSAAIGAVTPPTTAQRSDPNNWVPVFAATASATDPSVRPYPTTGYPILGFTNLIFS
QCYADATQTQQVRDFFTRHYGASVNNDTAITNHRFVPLPASWKLAVRQSFLTSTNNLYIGHSNVCNGIGRPL
>P00634 3.1.3.1~~~phoA~~~Alkaline phosphatase~~~COG1785
MKQSTIALALLPLLFTPVTKARTPEMPVLENRAAQGDITAPGGARRLTGDQTAALRDSLSDKPAKNIILLIGDGMGDSEI
TAARNYAEGAGGFFKGIDALPLTGQYTHYALNKKTGKPDYVTDSAASATAWSTGVKTYNGALGVDIHEKDHPTILEMAKA
AGLATGNVSTAELQDATPAALVAHVTSRKCYGPSATSEKCPGNALEKGGKGSITEQLLNARADVTLGGGAKTFAETATAG
EWQGKTLREQAQARGYQLVSDAASLNSVTEANQQKPLLGLFADGNMPVRWLGPKATYHGNIDKPAVTCTPNPQRNDSVPT
LAQMTDKAIELLSKNEKGFFLQVEGASIDKQDHAANPCGQIGETVDLDEAVQRALEFAKKEGNTLVIVTADHAHASQIVA
PDTKAPGLTQALNTKDGAVMVMSYGNSEEDSQEHTGSQLRIAAYGPHAANVVGLTDQTDLFYTMKAALGLK
>Q05205 3.1.3.1~~~phoA~~~Alkaline phosphatase~~~
MNLSPSRTPICAALAAALLGAAALAPAHAAQRILQLSEDTTHSKPVSAASALRGTPLAKAGAADRVCEAGAKWLRVGFKQ
LKLAGYDSLVLTSSGGDKLVFEGQHWNQRSFTTRPLRGECVDIQPYFSQPDSAFQLDRYDYSTVALDKATVVVAGAGDIC
DTSGNACQGTSDLIVSINPTAVFTAGDNAYNSGTLSEYNSRYAPTWGRFKALTSPSPGNHDYSTTGAKGYFDYFNGSGNQ
TGPAGDRSKGYYSWDVGDWHFVSLNTMSGGTVAQAQIDWLKADLAANTKPCTAAYFHHPLLSRGSYSGYSQVKPFWDALY
AAKADLVLVGHDHNYQRYGKMNPDKAAASDGIRQVLVGTGGRAFYGISGSHALLEASNDSTFGVLKLTLSATGYTGDFVP
RAGSSYTDHFTGTCNKGSGNPPTQTLTLNSVRDVTVKSGGSRDNGATLYADGSDGGQVLRGLMAWNVSSAAGKTLTGAQV
KLQVSDRSTGTYDLYRAGAAWTEANASYSGVSLGSKIGSVVPSATGAQSIALNAAGFSW
>Q06903 3.4.21.26~~~~~~Prolyl endopeptidase~~~COG1505
MSGKARLHYPVTRQSEQLDHYFGQAVADPYRWLEDDRSPETEAWVKAQNRVTQDYLAQIPFRDAIKGKLATSWNYAKEGA
PFREGRYHYFFKNDGLQNQNVLCGQLAGKPAEVFLDPNLLSPDGTTALDQLSFSRDGKTLAYSLSLAGSDWREIHLMDVE
SKQPLETPLRDVKFSGISWLGNEGFFYSSYDKPDGSELSARTDQHKLYFHRLGTAQEEDRLVFGAIPAQRHRYVGATVTE
DDRYLLISAADSTSGNRLYVKDLTREGAPLLTVQGDLAADVSLVDNKGSRLYLLTNRDAPNRRLVTVEADNPGPEQWRDL
IPERQQVLTVHSGGGYLFAEYMVDATARVEQFDHDGKRVREVGLPGLGSVSGFNGKQDDPALYFGFENYAQPPTLYKFEP
NSGAISLYRASAAPFKPEDYVSEQRFYRSKDGTRVPLIISYRKGLKLDGSNPTILYGYGGFDVSLTPSFSVSVANWLDLG
GVYAVANLRGGGEYGQAWHLAGTRMNKQNVFDDFIAAAEYLKAEGYTRTDRLAIRGGSNGGLLVGAVMTQRPDLMRVACQ
AVGVLDMLRYHTFTAGAGWAYDYGTSADSEAMFDYLKGYSPLHSVRAGVSYPSTLVTTADHDDRVVPAHSFKFAATLQAD
DAGPHPQLIRIETNAGHGAGTPVAKLIEQSADIYAFTLFEMGYRQLPRQP
>P27028 3.4.21.26~~~f1pep1~~~Prolyl endopeptidase~~~COG1505
MKYNKLSVAVAAFAFAAVSAQNSNVLKYPETKKVSHTDTYFGTQVSDPYRWLEDDRAEDTKAWVQQEVKFTQDYLAQIPF
RDQLKKQLMDIWNYEKISAPFKKGKYTYFSKNDGLQAQSVLYRKDAAGKTEVFLDPNKFSEKGTTSLASVSFNKKGTLVA
YSISEGGSDWNKIIILDAETKKQLDETLLDVKFSGISWLGDEGFFYSSYDKPKEGSVLSGMTDKHKVYFHKLGTKQSQDE
LIIGGDKFPRRYIGAYVTDDQRYLVVSAANATNGNELYIKDLKNKTDFIPIITGFDSNVNVADTDGDTLYLFTDKDAPNK
RLVKTTIQNPKAETWKDVIAETSEPLEINTGGGYFFATYMKDAIDQVKQYDKNGKLVRAIKLPGSGNASGFGGEKTEKDL
YYSFTNYITPPTIFKYNVTTGNSEVYQKPKVKFNPENYVSEQVFYTSSDGTKIPMMISYKKGLKKDGKNPTILYSYGGFN
ISLQPAFSVVNAIWMENGGIYAVPNIRGGGEYGKKWHDAGTKMQKKNVFNDFIAAGEYLQKNGYTSKEYMALSGRSNGGL
LVGATMTMRPDLAKVAFPGVGVLDMLRYNKFTAGAGWAYDYGTAEDSKEMFEYLKSYSPVHNVKAGTCYPSTMVITSDHD
DRVVPAHSFKFGSELQAKQSCKNPILIRIETNAGHGAGRSTEQVVAENADLLSFALYEMGIKSLK
>P27195 3.4.21.26~~~~~~Prolyl endopeptidase~~~COG1505
MKYKKLSVAVAAFAFAAVSAQNSNSLKYPETKKVNHTDTYFGNQVSDPYRWLEDDRAEDTKAWVQQEVKFTQDYLAQIPF
RGQIKKQLLDIWNYEKISAPFKKGKYTYFYKNDGLQAQSVLYRKDASGKTEVFLDPNKFSDKGTTSLANLSFNKKGTLVA
YSISEGGSDWNKIIILDAETKKQIDETLLDVKFSGISWLGDEGFFYSSYDKPKDGSVLSGMTDKHKVYFHKLGTKQSQDE
LIIGGDKFPRRYLSGYVTEDQRYLVVSAANATNGNELYIKDLKNKTDFIPIITGFESNVGLVDTDGDTLFLHTDKNAPNM
RMVKTTIQNPKPETWKDVIAETSEPMRVNSGGGYFFATYMKDALSQIKQYDKTGKLVREIKLPGSGTAGGFGGEKTEKEL
YYSFTNYITPPTIFKFSIDSGKSEVYQKPKVKFNPENYVSEQVFYTSADGTKIPMMISNKKGLKKDGKNPTILYSYGGFN
ISLQPAFSVVNAIWMENGGIYAVPNIRGGGEYGKKWHDAGTKQQKKNVFNDFIAAGEYLQKNGYTSKDYMALSGRSNGGL
LVGATMTMRPDLAKVAFPGVGVLDMLRYNKFTAGAGWAYDYGTAEDSKEMFEYLKSYSPVHNVKAGTCYPSTMVITSDHD
DRVVPAHSFKFGAELQAKQACKNPVLIRIETNAGHGAGRSTEQVVMENADLLSFALYEMGIKNLK
>P36647 ~~~ppdD~~~Prepilin peptidase-dependent protein D~~~COG4969
MDKQRGFTLIELMVVIGIIAILSAIGIPAYQNYLRKAALTDMLQTFVPYRTAVELCALEHGGLDTCDGGSNGIPSPTTTR
YVSAMSVAKGVVSLTGQESLNGLSVVMTPGWDNANGVTGWTRNCNIQSDSALQQACEDVFRFDDAN
>Q9RUV0 3.1.4.16~~~~~~Phosphatase/phosphodiesterase DR_1281~~~COG1692
MRVLFIGDVFGQPGRRVLQNHLPTIRPQFDFVIVNMENSAGGFGMHRDAARGALEAGAGCLTLGNHAWHHKDIYPMLSED
TYPIVRPLNYADPGTPGVGWRTFDVNGEKLTVVNLLGRVFMEAVDNPFRTMDALLERDDLGTVFVDFHAEATSEKEAMGW
HLAGRVAAVIGTHTHVPTADTRILKGGTAYQTDAGFTGPHDSIIGSAIEGPLQRFLTERPHRYGVAEGRAELNGVALHFE
GGKATAAERYRFIED
>P22983 2.7.9.1~~~ppdK~~~Pyruvate, phosphate dikinase~~~COG0574
MAKWVYKFEEGNASMRNLLGGKGCNLAEMTILGMPIPQGFTVTTEACTEYYNSGKQITQEIQDQIFEAITWLEELNGKKF
GDTEDPLLVSVRSGARASMPGMMDTILNLGLNDVAVEGFAKKTGNPRFAYDSYRRFIQMYSDVVMEVPKSHFEKIIDAMK
EEKGVHFDTDLTADDLKELAEKFKAVYKEAMNGEEFPQEPKDQLMGAVKAVFRSWDNPRAIVYRRMNDIPGDWGTAVNVQ
TMVFGNKGETSGTGVAFTRNPSTGEKGIYGEYLINAQGEDVVAGVRTPQPITQLENDMPDCYKQFMDLAMKLEKHFRDMQ
DMEFTIEEGKLYFLQTRNGKRTAPAALQIACDLVDEGMITEEEAVVRIEAKSLDQLLHPTFNPAALKAGEVIGSALPASP
GAAAGKVYFTADEAKAAHEKGERVILVRLETSPEDIEGMHAAEGILTVRGGMTSHAAVVARGMGTCCVSGCGEIKINEEA
KTFELGGHTFAEGDYISLDGSTGKIYKGDIETQEASVSGSFERIMVWADKFRTLKVRTNADTPEDTLNAVKLGAEGIGLC
RTEHMFFEADRIMKIRKMILSDSVEAREEALNELIPFQKGDFKAMYKALEGRPMTVRYLDPPLHEFVPHTEEEQAELAKN
MGLTLAEVKAKVDELHEFNPMMGHRGCRLAVTYPEIAKMQTRAVMEAAIEVKEETGIDIVPEIMIPLVGEKKELKFVKDV
VVEVAEQVKKEKGSDMQYHIGTMIEIPRAALTADAIAEEAEFFSFGTNDLTQMTFGFSRDDAGKFLDSYYKAKIYESDPF
ARLDQTGVGQLVEMAVKKGRQTRPGLKCGICGEHGGDPSSVEFCHKVGLNYVSCSPFRVPIARLAAAQAALNNK
>P9WI47 ~~~PPE2~~~PPE family protein PPE2~~~COG5651
MTAPIWMASPPEVHSALLSSGPGPGPLLVSAEGWHSLSIAYAETADELAALLAAVQAGTWDGPTAAVYVAAHTPYLAWLV
QASANSAAMATRQETAATAYGTALAAMPTLAELGANHALHGVLMATNFFGINTIPIALNESDYARMWIQAATTMASYQAV
STAAVAAAPQTTPAPQIVKANAPTAASDEPNQVQEWLQWLQKIGYTDFYNNVIQPFINWLTNLPFLQAMFSGFDPWLPSL
GNPLTFLSPANIAFALGYPMDIGSYVAFLSQTFAFIGADLAAAFASGNPATIAFTLMFTTVEAIGTIITDTIALVKTLLE
QTLALLPAALPLLAAPLAPLTLAPASAAGGFAGLSGLAGLVGIPPSAPPVIPPVAAIAPSIPTPTPTPAPAPAPTAVTAP
TPPPGPPPPPVTAPPPVTGAGIQSFGYLVGDLNSAAQARKAVGTGVRKKTPEPDSAEAPAAAAAPEEQVQPQRRRRPKIK
QLGRGYEYLDLDPETGHDPTGSPQGAGTLGFAGTTHKASPGQVAGLITLPNDAFGGSPRTPMMPGTWDTDSATRVE
>P9WI45 ~~~PPE3~~~Uncharacterized PPE family protein PPE3~~~COG5651
MTLWMASPPEVHSALLSSGPGPGSVLSAAGVWSSLSAEYAAVADELIGLLGAVQTGAWQGPSAAAYVAAHAPYLAWLMRA
SETSAEAAARHETVAAAYTTAVAAMPTLVELAANHTLHGVLVATNFFGINTIPIALNEADYARMWTQAASTMATYQAVAE
AAVASAPQTTPAPPILAAEAADDDHDHDHDHGGEPTPLDYLVAEILRIISGGRLIWDPAEGTMNGIPFEDYTDAAQPIWW
VVRAIEFSKDFETFVQELFVNPVEAFQFYFELLLFDYPTHIVQIVEALSQSPQLLAVALGSVISNLGAVTGFAGLSGLAG
MQPAAIPALAPVAAAPSTLPAVAMAPTMAAPGAAVASAAAPASAPAASTVASATPAPPPAPGAAGFGYPYAIAPPGIGFG
SGMSASASAQRKAPQPDSAAAAAAAAAVRDQARARRRRRVTRRGYGDEFMDMNIDVDPDWGPPPGEDPVTSTVASDRGAG
HLGFAGTARREAVADAAGMTTLAGDDFGDGPTTPMVPGSWDPDRDAPGSAEPGDRG
>P9WI43 ~~~PPE4~~~PPE family protein PPE4~~~COG5651
MAAPIWMASPPEVHSALLSNGPGPGSLVAAATAWSQLSAEYASTAAELSGLLGAVPGWAWQGPSAEWYVAAHLPYVAWLT
QASADAAGAAAQHEAAAAAYTTALAAMPTLAELAANHVIHTVLVATNFFGINTIPITLNEADYVRMWLQAAAVMGLYQAA
SGAALASAPRTVPAPTVMNPGGGAASTVGAVNPWQWLLALLQQLWNAYTGFYGWMLQLIWQFLQDPIGNSIKIIIAFLTN
PIQALITYGPLLFALGYQIFFNLVGWPTWGMILSSPFLLPAGLGLGLAAIAFLPIVLAPAVIPPASTPLAAAAVAAGSVW
PAVSMAVTGAGTAGAATPAAGAAPSAGAAPAPAAPATASFAYAVGGSGDWGPSLGPTVGGRGGIKAPAATVPAAAAAAAT
RGQSRARRRRRSELRDYGDEFLDMDSDSGFGPSTGDHGAQASERGAGTLGFAGTATKERRVRAVGLTALAGDEFGNGPRM
PMVPGTWEQGSNEPEAPDGSGRGGGDGLPHDSK
>P9WI41 ~~~~~~PPE family protein PPE10~~~COG5651
MTSPHFAWLPPEINSALMFAGPGSGPLIAAATAWGELAEELLASIASLGSVTSELTSGAWLGPSAAAMMAVATQYLAWLS
TAAAQAEQAAAQAMAIATAFEAALAATVQPAVVAANRGLMQLLAATNWFGQNAPALMDVEAAYEQMWALDVAAMAGYHFD
ASAAVAQLAPWQQVLRNLGIDIGKNGQINLGFGNTGSGNIGNNNIGNNNIGSGNTGTGNIGSGNTGSGNLGLGNLGDGNI
GFGNTGSGNIGFGITGDHQMGFGGFNSGSGNIGFGNSGTGNVGLFNSGSGNIGIGNSGSLNSGIGTSGTINAGLGSAGSL
NTSFWNAGMQNAALGSAAGSEAALVSSAGYATGGMSTAALSSGILASALGSTGGLQHGLANVLNSGLTNTPVAAPASAPV
GGLDSGNPNPGSGSAAAGSGANPGLRSPGTSYPSFVNSGSNDSGLRNTAVREPSTPGSGIPKSNFYPSPDRESAYASPRI
GQPVGSE
>P9WI31 ~~~~~~PPE family protein PPE15~~~COG5651
MDFGALPPEINSARMYAGAGAGPMMAAGAAWNGLAAELGTTAASYESVITRLTTESWMGPASMAMVAAAQPYLAWLTYTA
EAAAHAGSQAMASAAAYEAAYAMTVPPEVVAANRALLAALVATNVLGINTPAIMATEALYAEMWAQDALAMYGYAAASGA
AGMLQPLSPPSQTTNPGGLAAQSAAVGSAAATAAVNQVSVADLISSLPNAVSGLASPVTSVLDSTGLSGIIADIDALLAT
PFVANIINSAVNTAAWYVNAAIPTAIFLANALNSGAPVAIAEGAIEAAEGAASAAAAGLADSVTPAGLGASLGEATLVGR
LSVPAAWSTAAPATTAGATALEGSGWTVAAEEAGPVTGMMPGMASAAKGTGAYAGPRYGFKPTVMPKQVVV
>P9WI27 ~~~~~~PPE family protein PPE17~~~COG5651
MDFTIFPPEFNSLNIQGSARPFLVAANAWKNLSNELSYAASRFESEINGLITSWRGPSSTIMAAAVAPFRAWIVTTASLA
ELVADHISVVAGAYEAAHAAHVPLPVIETNRLTRLALATTNIFGIHTPAIFALDALYAQYWSQDGEAMNLYATMAAAAAR
LTPFSPPAPIANPGALARLYELIGSVSETVGSFAAPATKNLPSKLWTLLTKGTYPLTAARISSIPVEYVLAFVEGSNMGQ
MMGNLAMRSLTPTLKGPLELLPNAVRPAVSATLGNADTIGGLSVPPSWVADKSITPLAKAVPTSAPGGPSGTSWAQLGLA
SLAGGAVGAVAARTRSGVILRSPAAG
>L7N675 ~~~~~~PPE family protein PPE18~~~COG5651
MVDFGALPPEINSARMYAGPGSASLVAAAQMWDSVASDLFSAASAFQSVVWGLTVGSWIGSSAGLMVAAASPYVAWMSVT
AGQAELTAAQVRVAAAAYETAYGLTVPPPVIAENRAELMILIATNLLGQNTPAIAVNEAEYGEMWAQDAAAMFGYAAATA
TATATLLPFEEAPEMTSAGGLLEQAAAVEEASDTAAANQLMNNVPQALQQLAQPTQGTTPSSKLGGLWKTVSPHRSPISN
MVSMANNHMSMTNSGVSMTNTLSSMLKGFAPAAAAQAVQTAAQNGVRAMSSLGSSLGSSGLGGGVAANLGRAASVGSLSV
PQAWAAANQAVTPAARALPLTSLTSAAERGPGQMLGGLPVGQMGARAGGGLSGVLRVPPRPYVMPHSPAAG
>P9WI23 ~~~~~~Uncharacterized PPE family protein PPE20~~~COG5651
MTEPWIAFPPEVHSAMLNYGAGVGPMLISATQNGELSAQYAEAASEVEELLGVVASEGWQGQAAEAFVAAYMPFLAWLIQ
ASADCVEMAAQQHVVIEAYTAAVELMPTQVELAANQIKLAVLVATNFFGINTIPIAINEAEYVEMWVRAATTMATYSTVS
RSALSAMPHTSPPPLILKSDELLPDTGEDSDEDGHNHGGHSHGGHARMIDNFFAEILRGVSAGRIVWDPVNGTLNGLDYD
DYVYPGHAIWWLARGLEFFQDGEQFGELLFTNPTGAFQFLLYVVVVDLPTHIAQIATWLGQYPQLLSAALTGVIAHLGAI
TGLAGLSGLSAIPSAAIPAVVPELTPVAAAPPMLAVAGVGPAVAAPGMLPASAPAPAAAAGATAAGPTPPATGFGGFPPY
LVGGGGPGIGFGSGQSAHAKAAASDSAAAESAAQASARAQARAARRGRSAAKARGHRDEFVTMDMGFDAAAPAPEHQPGA
RASDCGAGPIGFAGTVRKEAVVKAAGLTTLAGDDFGGGPTMPMMPGTWTHDQGVFDEHR
>Q79FK6 ~~~~~~PPE family protein PPE26~~~COG5651
MDFGALPPEVNSVRMYAGPGSAPMVAAASAWNGLAAELSSAATGYETVITQLSSEGWLGPASAAMAEAVAPYVAWMSAAA
AQAEQAATQARAAAAAFEAAFAATVPPPLIAANRASLMQLISTNVFGQNTSAIAAAEAQYGEMWAQDSAAMYAYAGSSAS
ASAVTPFSTPPQIANPTAQGTQAAAVATAAGTAQSTLTEMITGLPNALQSLTSPLLQSSNGPLSWLWQILFGTPNFPTSI
SALLTDLQPYASFFYNTEGLPYFSIGMGNNFIQSAKTLGLIGSAAPAAVAAAGDAAKGLPGLGGMLGGGPVAAGLGNAAS
VGKLSVPPVWSGPLPGSVTPGAAPLPVSTVSAAPEAAPGSLLGGLPLAGAGGAGAGPRYGFRPTVMARPPFAG
>P9WI09 ~~~~~~Uncharacterized PPE family protein PPE29~~~COG5651
MDFGLLPPEINSGRMYTGPGPGPMLAAATAWDGLAVELHATAAGYASELSALTGAWSGPSSTSMASAAAPYVAWMSATAV
HAELAGAQARLAIAAYEAAFAATVPPPVIAANRAQLMVLIATNIFGQNTPAIMMTEAQYMEMWAQDAAAMYGYAGSSATA
SRMTAFTEPPQTTNHGQLGAQSSAVAQTAATAAGGNLQSAFPQLLSAVPRALQGLALPTASQSASATPQWVTDLGNLSTF
LGGAVTGPYTFPGVLPPSGVPYLLGIQSVLVTQNGQGVSALLGKIGGKPITGALAPLAEFALHTPILGSEGLGGGSVSAG
IGRAGLVGKLSVPQGWTVAAPEIPSPAAALQATRLAAAPIAATDGAGALLGGMALSGLAGRAAAGSTGHPIGSAAAPAVG
AAAAAVEDLATEANIFVIPAMDD
>P9WI05 ~~~~~~Uncharacterized PPE family protein PPE32~~~COG5651
MLDFGALPPEINSGRMYAGPGSGPLLAAAAAWDALAAELYSAAASYGSTIEGLTVAPWMGPSSITMAAAVAPYVAWISVT
AGQAEQAGAQAKIAAGVYETAFAATVPPPVIEANRALLMSLVATNIFGQNTPAIAATEAHYAEMWAQDAAAMYGYAGSSA
TASQLAPFSEPPQTTNPSATAAQSAVVAQAAGAAASSDITAQLSQLISLLPSTLQSLATTATATSASAGWDTVLQSITTI
LANLTGPYSIIGLGAIPGGWWLTFGQILGLAQNAPGVAALLGPKAAAGALSPLAPLRGGYIGDITPLGGGATGGIARAIY
VGSLSVPQGWAEAAPVMRAVASVLPGTGAAPALAAEAPGALFGEMALSSLAGRALAGTAVRSGAGAARVAGGSVTEDVAS
TTTIIVIPAD
>P9WI03 ~~~~~~Uncharacterized PPE family protein PPE33~~~COG5651
MDFGLQPPEITSGEMYLGPGAGPMLAAAVAWDGLAAELQSMAASYASIVEGMASESWLGPSSAGMAAAAAPYVTWMSGTS
AQAKAAADQARAAVVAYETAFAAVVPPPQIAANRSQLISLVATNIFGQNTAAIAATEAEYGEMWAQDTMAMFGYASSSAT
ASRLTPFTAPPQTTNPSGLAGQAAATGQATALASGTNAVTTALSSAAAQFPFDIIPTLLQGLATLSTQYTQLMGQLINAI
FGPTGATTYQNVFVTAANVTKFSTWANDAMSAPNLGMTEFKVFWQPPPAPEIPKSSLGAGLGLRSGLSAGLAHAASAGLG
QANLVGDLSVPPSWASATPAVRLVANTLPATSLAAAPATQIPANLLGQMALGSMTGGALGAAAPAIYTGSGARARANGGT
PSAEPVKLEAVIAQLQKQPDAVRHWNVDKADLDGLLDRLSKQPGIHAVHVSNGDKPKVALPDTQLGSH
>Q79FI9 ~~~~~~PPE family protein PPE34~~~COG5263
MNFSTLPPEINSALIFGGAGSEPMSAAAVAWDQLAMELASAAASFNSVTSGLVGESWLGPSSAAMAAAVAPYLGWLAAAA
AQAQRSATQAAALVAEFEAVRAAMVQPALVAANRSDLVSLVFSNFFGQNAPAIAAIEAAYEQMWAIDVSVMSAYHAGASA
VASALTPFTAPPQNLTDLPAQLAAAPAAVVTAAITSSKGVLANLSLGLANSGFGQMGAANLGILNLGSLNPGGNNFGLGN
VGSNNVGLGNTGNGNIGFGNTGNGNIGFGLTGDNQQGFGGWNSGTGNIGLFNSGTGNIGIGNTGTGNFGIGNSGTSYNTG
IGNTGQANTGFFNAGIANTGIGNTGNYNTGSFNLGSFNTGDFNTGSSNTGFFNPGNLNTGVGNTGNVNTGGFNSGNYSNG
FFWRGDYQGLIGFSGTLTIPAAGLDLNGLGSVGPITIPSITIPEIGLGINSSGALVGPINVPPITVPAIGLGINSTGALV
GPINIPPITLNSIGLELSAFQVINVGSISIPASPLAIGLFGVNPTVGSIGPGSISIQLGTPEIPAIPPFFPGFPPDYVTV
SGQIGPITFLSGGYSLPAIPLGIDVGGGLGPFTVFPDGYSLPAIPLGIDVGGGLGPFTVFPDGYSLPAIPLGIDVGGGLG
PFTVFPDGYSLPAIPLGIDVGGAIGPLTTPPITIPSIPLGIDVSGSLGPINIPIEIAGTPGFGNSTTTPSSGFFNSGTGG
TSGFGNVGSGGSGFWNIAGNLGNSGFLNVGPLTSGILNFGNTVSGLYNTSTLGLATSAFHSGVGNTDSQLAGFMRNAAGG
TLFNFGFANDGTLNLGNANLGDYNVGSGNVGSYNFGSGNIGNGSFGFGNIGSNNFGFGNVGSNNLGFANTGPGLTEALHN
IGFGNIGGNNYGFANIGNGNIGFGNTGTGNIGIGLTGDNQVGFGALNSGSGNIGFFNSGNGNIGFFNSGNGNVGIGNSGN
YNTGLGNVGNANTGLFNTGNVNTGIGNAGSYNTGSYNAGDTNTGDLNPGNANTGYLNLGDLNTGWGNIGDLNTGALISGS
YSNGILWRGDYQGLIGYSDTLSIPAIPLSVEVNGGIGPIVVPDITIPGIPLSLNALGGVGPIVVPDITIPGIPLSLNALG
GVGPIVVPDITIPGIPLSLNALGGVGPIVVPDITIPGIPLSLNALGGVGPIVVPDITIPGIPLSLNALGGVGPITVPGVP
ISRIPLTINIRIPVNITLNELPFNVAGIFTGYIGPIPLSTFVLGVTLAGGTLESGIQGFSVNPFGLNIPLSGATNAVTIP
GFAINPFGLNVPLSGGTSPVTIPGFAINPFGLNVPLSGGTSPVTIPGFTIPGSPLNLTANGGLGPINIPINITSAPGFGN
STTTPSSGFFNSGDGSASGFGNVGPGISGLWNQVPNALQGGVSGIYNVGQLASGVANLGNTVSGFNNTSTVGHLTAAFNS
GVNNIGQMLLGFFSPGAGP
>P9WI01 ~~~~~~Uncharacterized PPE family protein PPE36~~~COG5651
MPNFWALPPEINSTRIYLGPGSGPILAAAQGWNALASELEKTKVGLQSALDTLLESYRGQSSQALIQQTLPYVQWLTTTA
EHAHKTAIQLTAAANAYEQARAAMVPPAMVRANRVQTTVLKAINWFGQFSTRIADKEADYEQMWFQDALVMENYWEAVQE
AIQSTSHFEDPPEMADDYDEAWMLNTVFDYHNENAKEEVIHLVPDVNKERGPIELVTKVDKEGTIRLVYDGEPTFSYKEH
PKF
>Q79FH3 ~~~~~~Uncharacterized PPE family protein PPE37~~~COG5651
MTFPMWFAVPPEVPSAWLSTGMGPGPLLAAARAWHALAAQYTEIATELASVLAAVQASSWQGPSADRFVVAHQPFRYWLT
HAATVATAAAAAHETAAAGYTSALGGMPTLAELAANHAMHGALVTTNFFGVNTIPIALNEADYLRMWIQAATVMSHYQAV
AHESVAATPSTPPAPQIVTSAASSAASSSFPDPTKLILQLLKDFLELLRYLAVELLPGPLGDLIAQVLDWFISFVSGPVF
TFLAYLVLDPLIYFGPFAPLTSPVLLPAGLTGLAGLGAVSGPAGPMVERVHSDGPSRQSWPAATGVTLVGTNPAALVTTP
APAPTTSAAPTAPSTPGSSAAQGLYAVGGPDGEGFNPIAKTTALAGVTTDAAAPAAKLPGDQAQSSASKATRLRRRLRQH
RFEFLADDGRLTMPNTPEMADVAAGNRGLDALGFAGTIPKSAPGSATGLTHLGGGFADVLSQPMLPHTWDGSD
>P9WHZ9 ~~~~~~Uncharacterized PPE family protein PPE38~~~COG5651
MILDFSWLPPEINSARIYAGAGSGPLFMAAAAWEGLAADLRASASSFDAVIAGLAAGPWSGPASVAMAGAAAPYVGWLSA
AAGQAELSAGQATAAATAFEAALAATVHPAAVTANRVLLGALVATNILGQNTPAIAATEFDYVEMWAQDVGAMVGYHAGA
AAVAETLTPFSVPPLDLAGLASQAGAQLTGMATSVSAALSPIAEGAVEGVPAVVAAAQSVAAGLPVDAALQVGQAAAYPA
SMLIGPMMQLAQMGTTANTAGLAGAEAAGLAAADVPTFAGDIASGTGLGGAGGLGAGMSAELGKARLVGAMSVPPTWEGS
VPARMASSAMAGLGAMPAEVPAAGGPMGMMPMPMGMGGAGAGMPAGMMGRGGANPHVVQARPSVVPRVGIG
>P9WHZ7 ~~~~~~Uncharacterized PPE family protein PPE40~~~COG5651
MVNFSVLPPEINSGRMFFGAGSGPMLAAAAAWDGLAAELGLAAESFGLVTSGLAGGSGQAWQGAAAAAMVVAAAPYAGWL
AAAAARAGGAAVQAKAVAGAFEAARAAMVDPVVVAANRSAFVQLVLSNVFGQNAPAIAAAEATYEQMWAADVAAMVGYHG
GASAAAAALAPWQQAVPGLSGLLGGAANAPAAAAQGAAQGLAELTLNLGVGNIGSLNLGSGNIGGTNVGSGNVGGTNLGS
GNYGSLNWGSGNTGTGNAGSGNTGDYNPGSGNFGSGNFGSGNIGSLNVGSGNFGTLNLANGNNGDVNFGGGNTGDFNFGG
GNNGTLNFGFGNTGSGNFGFGNTGNNNIGIGLTGDGQIGIGGLNSGTGNIGFGNSGNNNIGFFNSGDGNIGFFNSGDGNT
GFGNAGNINTGFWNAGNLNTGFGSAGNGNVGIFDGGNSNSGSFNVGFQNTGFGNSGAGNTGFFNAGDSNTGFANAGNVNT
GFFNGGDINTGGFNGGNVNTGFGSALTQAGANSGFGNLGTGNSGWGNSDPSGTGNSGFFNTGNGNSGFSNAGPAMLPGFN
SGFANIGSFNAGIANSGNNLAGISNSGDDSSGAVNSGSQNSGAFNAGVGLSGFFR
>Q79FE1 ~~~~~~PPE family protein PPE41~~~COG5651
MHFEAYPPEVNSANIYAGPGPDSMLAAARAWRSLDVEMTAVQRSFNRTLLSLMDAWAGPVVMQLMEAAKPFVRWLTDLCV
QLSEVERQIHEIVRAYEWAHHDMVPLAQIYNNRAERQILIDNNALGQFTAQIADLDQEYDDFWDEDGEVMRDYRLRVSDA
LSKLTPWKAPPPIAHSTVLVAPVSPSTASSRTDT
>P9WHZ5 ~~~~~~Uncharacterized PPE family protein PPE42~~~COG5651
MNFAVLPPEVNSARIFAGAGLGPMLAAASAWDGLAEELHAAAGSFASVTTGLAGDAWHGPASLAMTRAASPYVGWLNTAA
GQAAQAAGQARLAASAFEATLAATVSPAMVAANRTRLASLVAANLLGQNAPAIAAAEAEYEQIWAQDVAAMFGYHSAASA
VATQLAPIQEGLQQQLQNVLAQLASGNLGSGNVGVGNIGNDNIGNANIGFGNRGDANIGIGNIGDRNLGIGNTGNWNIGI
GITGNGQIGFGKPANPDVLVVGNGGPGVTALVMGGTDSLLPLPNIPLLEYAARFITPVHPGYTATFLETPSQFFPFTGLN
SLTYDVSVAQGVTNLHTAIMAQLAAGNEVVVFGTSQSATIATFEMRYLQSLPAHLRPGLDELSFTLTGNPNRPDGGILTR
FGFSIPQLGFTLSGATPADAYPTVDYAFQYDGVNDFPKYPLNVFATANAIAGILFLHSGLIALPPDLASGVVQPVSSPDV
LTTYILLPSQDLPLLVPLRAIPLLGNPLADLIQPDLRVLVELGYDRTAHQDVPSPFGLFPDVDWAEVAADLQQGAVQGVN
DALSGLGLPPPWQPALPRLF
>P9WHY9 ~~~~~~Uncharacterized PPE family protein PPE46~~~COG5651
MTAPVWLASPPEVHSALLSAGPGPGSLQAAAAGWSALSAEYAAVAQELSVVVAAVGAGVWQGPSAELFVAAYVPYVAWLV
QASADSAAAAGEHEAAAAGYVCALAEMPTLPELAANHLTHAVLVATNFFGINTIPIALNEADYVRMWVQAATVMSAYEAV
VGAALVATPHTGPAPVIVKPGANEASNAVAAATITPFPWHEIVQFLEETFAAYDQYLSALLSELPAVAWVWFQLFVDILG
FNIIGFIITLASNAQLLTEFAINASYVAVGLLYAIAGVIDIVVEWVIGNLFGVVPLLGGPLLGALAAAVVPGVAGLAGVA
GLAALPAVGAAAGAPAALVGSVAPVSGGVVSPQARLVSAVEPAPASTSVSVLASDRGAGALGFVGTAGKESVGQPAGLTV
LADEFGDGAPVPMLPGSWGPDLVGVAGDGGLVSV
>P9WHY7 ~~~~~~Uncharacterized PPE family protein PPE47/PPE48~~~COG5651
MTAPVWLASPPEVHSALLSAGPGPGSLQAAAAGWSALSAEYAAVAQELSVVVAAVGAGVWQGPSAELFVAAYVPYVAWLV
QASADSAAAAGEHEAAAAGYVCALAEMPTLPELAANHLTHAVLVATNFFGINTIPIALNEADYVRMWVQAATVMSAYEAV
VGAALVATPHTGPAPVIVKPGANEASNAVAAATITPFPFGELAKFLEMAAQAFTEVGELIMKSAEAWAVGFVELITGLVN
FEPWLVLTGMIDMFFATVGFALGVFVLVPLLEFAVVLELAILSIGWIISNIFGAIPVLGGPLLGALAAAVVPGVAGLAGV
AGLAALPAVGAAAGAPAALVGSVAPVSGGVVSPQARLVSAVEPAPASTSVSVLASDRGAGALGFVGTAGKESVGQPAGLT
VLADEFGDGAPVPMLPGSWGPDLVGVAGDGGLVSV
>Q6MX07 ~~~~~~Uncharacterized PPE family protein PPE50~~~COG5651
MDYAFLPPEINSARMYSGPGPNSMLVAAASWDALAAELASAAENYGSVIARLTGMHWWGPASTSMLAMSAPYVEWLERTA
AQTKQTATQARAAAAAFEQAHAMTVPPALVTGIRGAIVVETASASNTAGTPP
>P9WHY3 ~~~~~~Transporter PPE51~~~COG5651
MDFALLPPEVNSARMYTGPGAGSLLAAAGGWDSLAAELATTAEAYGSVLSGLAALHWRGPAAESMAVTAAPYIGWLYTTA
EKTQQTAIQARAAALAFEQAYAMTLPPPVVAANRIQLLALIATNFFGQNTAAIAATEAQYAEMWAQDAAAMYGYATASAA
AALLTPFSPPRQTTNPAGLTAQAAAVSQATDPLSLLIETVTQALQALTIPSFIPEDFTFLDAIFAGYATVGVTQDVESFV
AGTIGAESNLGLLNVGDENPAEVTPGDFGIGELVSATSPGGGVSASGAGGAASVGNTVLASVGRANSIGQLSVPPSWAAP
STRPVSALSPAGLTTLPGTDVAEHGMPGVPGVPVAAGRASGVLPRYGVRLTVMAHPPAAG
>Q50703 ~~~~~~PPE family protein PPE57~~~COG5651
MHPMIPAEYISNIIYEGPGADSLFFASGQLRELAYSVETTAESLEDELDELDENWKGSSSDLLADAVERYLQWLSKHSSQ
LKHAAWVINGLANAYNDTRRKVVPPEEIAANREERRRLIASNVAGVNTPAIADLDAQYDQYRARNVAVMNAYVSWTRSAL
SDLPRWREPPQIYRGG
>P9WHW9 ~~~~~~PPE family immunomodulator PPE68~~~COG5651
MLWHAMPPELNTARLMAGAGPAPMLAAAAGWQTLSAALDAQAVELTARLNSLGEAWTGGGSDKALAAATPMVVWLQTAST
QAKTRAMQATAQAAAYTQAMATTPSLPEIAANHITQAVLTATNFFGINTIPIALTEMDYFIRMWNQAALAMEVYQAETAV
NTLFEKLEPMASILDPGASQSTTNPIFGMPSPGSSTPVGQLPPAATQTLGQLGEMSGPMQQLTQPLQQVTSLFSQVGGTG
GGNPADEEAAQMGLLGTSPLSNHPLAGGSGPSAGAGLLRAESLPGAGGSLTRTPLMSQLIEKPVAPSVMPAAAAGSSATG
GAAPVGAGAMGQGAQSGGSTRPGLVAPAPLAQEREEDDEDDWDEEDDW
>Q183R7 3.4.24.89~~~zmp1~~~Pro-Pro endopeptidase~~~
MRPSKKLLIAIISIFLISSVPVSAHADSTTIQQNKDTLSQIVVFPTGNYDKNEANAMVNRLANIDGKYLNALKQNNLKIK
LLSGKLTDEKEYAYLKGVVPKGWEGTGKTWDDVPGLGGSTVALRIGFSNKGKGHDAINLELHETAHAIDHIVLNDISKSA
QFKQIFAKEGRSLGNVNYLGVYPEEFFAESFAYYYLNQDTNSKLKSACPQTYSFLQNLAK
>K4ZRC1 3.4.24.89~~~~~~Pro-Pro endopeptidase~~~
MKWDKRVVALILAVMIVCPLFAAPAHAQEQSILDKLVVLPSGEYNHSEAAAMKQRLEKIPTSILDALYSKGVKIKLTQGA
ITNEPELAYLKGVVPRGWEGTGLTWDDVPGVSERVVAVRIGYSEKGKGHNSLNLEIHETLHAVDRLVLNEVSGTDEFINI
FNKEASVKYKGDGYVSAYPTEYFAEAASLYLYSDATRSDLKDSMPLTYEFMAKLFAN
>A5U654 2.7.1.63~~~ppgK~~~Polyphosphate glucokinase~~~COG1940
MTSTGPETSETPGATTQRHGFGIDVGGSGIKGGIVDLDTGQLIGDRIKLLTPQPATPLAVAKTIAEVVNGFGWRGPLGVT
YPGVVTHGVVRTAANVDKSWIGTNARDTIGAELGGQQVTILNDADAAGLAETRYGAGKNNPGLVVLLTFGTGIGSAVIHN
GTLIPNTEFGHLEVGGKEAEERAASSVKEKNDWTYPKWAKQVIRVLIAIENAIWPDLFIAGGGISRKADKWVPLLENRTP
VVPAALQNTAGIVGAAMASVADTTH
>P9WIN1 2.7.1.63~~~ppgK~~~Polyphosphate glucokinase~~~COG1940
MTSTGPETSETPGATTQRHGFGIDVGGSGIKGGIVDLDTGQLIGDRIKLLTPQPATPLAVAKTIAEVVNGFGWRGPLGVT
YPGVVTHGVVRTAANVDKSWIGTNARDTIGAELGGQQVTILNDADAAGLAETRYGAGKNNPGLVVLLTFGTGIGSAVIHN
GTLIPNTEFGHLEVGGKEAEERAASSVKEKNDWTYPKWAKQVIRVLIAIENAIWPDLFIAGGGISRKADKWVPLLENRTP
VVPAALQNTAGIVGAAMASVADTTH
>Q84G06 3.11.1.3~~~pphA~~~Phosphonopyruvate hydrolase~~~
MTKNQALRAALDSGRLFTAMAAHNPLVAKLAEQAGFGGIWGSGFELSASYAVPDANILSMSTHLEMMRAIASTVSIPLIA
DIDTGFGNAVNVHYVVPQYEAAGASAIVMEDKTFPKDTSLRTDGRQELVRIEEFQGKIAAATAARADRDFVVIARVEALI
AGLGQQEAVRRGQAYEEAGADAILIHSRQKTPDEILAFVKSWPGKVPLVLVPTAYPQLTEADIAALSKVGIVIYGNHAIR
AAVGAVREVFARIRRDGGIREVDAALPSVKEIIELQGDERMRAVEARYLK
>Q7A6I1 5.2.1.8~~~~~~Putative peptidyl-prolyl cis-trans isomerase~~~
MANYPQLNKEVQQGEIKVVMHTNKGDMTFKLFPNIAPKTVENFVTHAKNGYYDGITFHRVINDFMIQGGDPTATGMGGES
IYGGAFEDEFSLNAFNLYGALSMANSGPNTNGSQFFIVQMKEVPQNMLSQLADGGWPQPIVDAYGEKGGTPWLDQKHTVF
GQIIDGETTLEDIANTKVGPQDKPLHDVVIESIDVEE
>P73789 5.2.1.8~~~~~~Peptidyl-prolyl cis-trans isomerase slr1251~~~COG0652
MMSKVFFDITIGSDTAGRIVMELFDEVTPKTAENFRALCTGEKGVGKAGKPLHFKGSHFHRVITDFMAQGGDFTRGNGTG
GESIYGEKFADENFQLKHDRPGLLSMANAGPNTNGSQFFLTFVPCPWLDGKHVVFGEVVEGLEILEQLEANGSQSGQTKQ
AIVISDCGEIK
>P0AFL3 5.2.1.8~~~ppiA~~~Peptidyl-prolyl cis-trans isomerase A~~~COG0652
MFKSTLAAMAAVFALSALSPAAMAAKGDPHVLLTTSAGNIELELDKQKAPVSVQNFVDYVNSGFYNNTTFHRVIPGFMIQ
GGGFTEQMQQKKPNPPIKNEADNGLRNTRGTIAMARTADKDSATSQFFINVADNAFLDHGQRDFGYAVFGKVVKGMDVAD
KISQVPTHDVGPYQNVPSKPVVILSAKVLP
>P9WHW3 5.2.1.8~~~ppiA~~~Peptidyl-prolyl cis-trans isomerase A~~~COG0652
MADCDSVTNSPLATATATLHTNRGDIKIALFGNHAPKTVANFVGLAQGTKDYSTQNASGGPSGPFYDGAVFHRVIQGFMI
QGGDPTGTGRGGPGYKFADEFHPELQFDKPYLLAMANAGPGTNGSQFFITVGKTPHLNRRHTIFGEVIDAESQRVVEAIS
KTATDGNDRPTDPVVIESITIS
>Q06118 5.2.1.8~~~ppiA~~~Peptidyl-prolyl cis-trans isomerase A~~~
MTTKVYFDITIDDAPAGRITFNLFDDVVPKTAENFRALATGEKGFGYAGSSFHRVITDFMLQGGDFTRGDGTGGKSIYGE
KFADENFQLKHDRVGLLSMANAGKNTNGSQFFITTVLTPWLDGKHVVFGEVADDDSMALVRKIEALGSSSGRTSAKVTIA
ESGAL
>P35137 5.2.1.8~~~ppiB~~~Peptidyl-prolyl cis-trans isomerase B~~~COG0652
MKTGYFLLEDGNKIEFELYPEAAPGTVANFEKLANEGFYDGLTFHRVIPGFVSQGGCPHGTGTGGPGYTIKCETEGNPHT
HEAGALSMAHAGKDTGGSQFFIVHEPQPHLNGVHTVFGKVTSGLEFAKNMSNGDVMKEVRVEG
>P23869 5.2.1.8~~~ppiB~~~Peptidyl-prolyl cis-trans isomerase B~~~COG0652
MVTFHTNHGDIVIKTFDDKAPETVKNFLDYCREGFYNNTIFHRVINGFMIQGGGFEPGMKQKATKEPIKNEANNGLKNTR
GTLAMARTQAPHSATAQFFINVVDNDFLNFSGESLQGWGYCVFAEVVDGMDVVDKIKGVATGRSGMHQDVPKEDVIIESV
TVSE
>P9WHW1 5.2.1.8~~~ppiB~~~Probable peptidyl-prolyl cis-trans isomerase B~~~COG0652
MGHLTPVAAPRLACAFVPTNAQRRATAKRKLERQLERRAKQAKRRRILTIVGGSLAAVAVIVAVVVTVVVNKDDHQSTTS
ATPTDSASTSPPQAATAPPLPPFKPSANLGANCQYPPSPDKAVKPVKLPRTGKVPTDPAQVSVSMVTNQGNIGLMLANNE
SPCTVNSFVSLAQQGFFKGTTCHRLTTSPMLAVLQCGDPKGDGTGGPGYQFANEYPTDQYSANDPKLNEPVIYPRGTLAM
ANAGPNTNSSQFFMVYRDSKLPPQYTVFGTIQADGLTTLDKIAKAGVAGGGEDGKPATEVTITSVLLD
>P77949 5.2.1.8~~~cypB~~~Peptidyl-prolyl cis-trans isomerase B~~~
MAEQLYATLKTNRGDIEIRLLPNHAPKTVRNFVELATGQREWVNPETGEKSTDRLYDGTVFHRVISGFMIQGGDPLGNGT
GGPGYKFADEFHPELGFTQPYLLAMANAGPGTNGSQFFLTVSPTAWLTGKHTIFGEVSGEAGRKVVDAIAATPTNPRTDR
PLEDVVIESVVVETR
>P83221 5.2.1.8~~~~~~Peptidyl-prolyl cis-trans isomerase cyp18~~~
MSTVELNTSAGRIVLELNDAEAPKTVENFLAYVRSGHYDGTIFHRVISDFMIQGGGFTPDMQQKSTLAPIQNEADNGLRN
DNYTVAMARTNDPHSATAQFFINVKDNAFLNHTSKTPNGWGYAVFGRVTEGQDVVDAIKGVKTGSSRGHQDVPVQPVVIE
SAKILG
>O66105 5.2.1.8~~~ppiB~~~Probable peptidyl-prolyl cis-trans isomerase~~~COG0652
MNTQVWRVCVGVMLFCFVGRIGCAEEKMVREEGLAVADGIYAVMETNRGTIVLSLFFEKAPLTVCNFVGLAEGTLAVCKG
RPFYQGLTFHRVIKDFMIQGGDPQGNGTGGPGYQFPDECDPALRHDSPGVLSMANAGPGTNGSQFFITHVATPWLDGKHT
VFGKVVEGMEVVHAIIAGDTIRSLKIVRRGAAAKRFVCDQAQFDQLRKRVSAASK
>P0A9L5 5.2.1.8~~~ppiC~~~Peptidyl-prolyl cis-trans isomerase C~~~COG0760
MAKTAAALHILVKEEKLALDLLEQIKNGADFGKLAKKHSICPSGKRGGDLGEFRQGQMVPAFDKVVFSCPVLEPTGPLHT
QFGYHIIKVLYRN
>P0ADY1 ~~~ppiD~~~Periplasmic chaperone PpiD~~~COG0760
MMDSLRTAANSLVLKIIFGIIIVSFILTGVSGYLIGGGNNYAAKVNDQEISRGQFENAFNSERNRMQQQLGDQYSELAAN
EGYMKTLRQQVLNRLIDEALLDQYARELKLGISDEQVKQAIFATPAFQVDGKFDNSRYNGILNQMGMTADQYAQALRNQL
TTQQLINGVAGTDFMLKGETDELAALVAQQRVVREATIDVNALAAKQPVTEQEIASYYEQNKNNFMTPEQFRVSYIKLDA
ATMQQPVSDADIQSYYDQHQDQFTQPQRTRYSIIQTKTEDEAKAVLDELNKGGDFAALAKEKSADIISARNGGDMGWLED
ATIPDELKNAGLKEKGQLSGVIKSSVGFLIVRLDDIQPAKVKSLDEVRDDIAAKVKHEKALDAYYALQQKVSDAASNDTE
SLAGAEQAAGVKATQTGWFSKDNLPEELNFKPVADAIFNGGLVGENGAPGINSDIITVDGDRAFVLRISEHKPEAVKPLA
DVQEQVKALVQHNKAEQQAKVDAEKLLVDLKAGKGAEAMQAAGLKFGEPKTLSRSGRDPISQAAFALPLPAKDKPSYGMA
TDMQGNVVLLALDEVKQGSMPEDQKKAMVQGITQNNAQIVFEALMSNLRKEAKIKIGDALEQQ
>P0A7B1 2.7.4.1~~~ppk~~~Polyphosphate kinase~~~COG0855
MGQEKLYIEKELSWLSFNERVLQEAADKSNPLIERMRFLGIYSNNLDEFYKVRFAELKRRIIISEEQGSNSHSRHLLGKI
QSRVLKADQEFDGLYNELLLEMARNQIFLINERQLSVNQQNWLRHYFKQYLRQHITPILINPDTDLVQFLKDDYTYLAVE
IIRGDTIRYALLEIPSDKVPRFVNLPPEAPRRRKPMILLDNILRYCLDDIFKGFFDYDALNAYSMKMTRDAEYDLVHEME
ASLMELMSSSLKQRLTAEPVRFVYQRDMPNALVEVLREKLTISRYDSIVPGGRYHNFKDFINFPNVGKANLVNKPLPRLR
HIWFDKAQFRNGFDAIRERDVLLYYPYHTFEHVLELLRQASFDPSVLAIKINIYRVAKDSRIIDSMIHAAHNGKKVTVVV
ELQARFDEEANIHWAKRLTEAGVHVIFSAPGLKIHAKLFLISRKENGEVVRYAHIGTGNFNEKTARLYTDYSLLTADARI
TNEVRRVFNFIENPYRPVTFDYLMVSPQNSRRLLYEMVDREIANAQQGLPSGITLKLNNLVDKGLVDRLYAASSSGVPVN
LLVRGMCSLIPNLEGISDNIRAISIVDRYLEHDRVYIFENGGDKKVYLSSADWMTRNIDYRIEVATPLLDPRLKQRVLDI
IDILFSDTVKARYIDKELSNRYVPRGNRRKVRAQLAIYDYIKSLEQPE
>P9WHV9 2.7.4.1~~~ppk~~~Polyphosphate kinase~~~COG0855
MMSNDRKVTEIENSPVTEVRPEEHAWYPDDSALAAPPAATPAAISDQLPSDRYLNRELSWLDFNARVLALAADKSMPLLE
RAKFLAIFASNLDEFYMVRVAGLKRRDEMGLSVRSADGLTPREQLGRIGEQTQQLASRHARVFLDSVLPALGEEGIYIVT
WADLDQAERDRLSTYFNEQVFPVLTPLAVDPAHPFPFVSGLSLNLAVTVRQPEDGTQHFARVKVPDNVDRFVELAAREAS
EEAAGTEGRTALRFLPMEELIAAFLPVLFPGMEIVEHHAFRITRNADFEVEEDRDEDLLQALERELARRRFGSPVRLEIA
DDMTESMLELLLRELDVHPGDVIEVPGLLDLSSLWQIYAVDRPTLKDRTFVPATHPAFAERETPKSIFATLREGDVLVHH
PYDSFSTSVQRFIEQAAADPNVLAIKQTLYRTSGDSPIVRALIDAAEAGKQVVALVEIKARFDEQANIAWARALEQAGVH
VAYGLVGLKTHCKTALVVRREGPTIRRYCHVGTGNYNSKTARLYEDVGLLTAAPDIGADLTDLFNSLTGYSRKLSYRNLL
VAPHGIRAGIIDRVEREVAAHRAEGAHNGKGRIRLKMNALVDEQVIDALYRASRAGVRIEVVVRGICALRPGAQGISENI
IVRSILGRFLEHSRILHFRAIDEFWIGSADMMHRNLDRRVEVMAQVKNPRLTAQLDELFESALDPCTRCWELGPDGQWTA
SPQEGHSVRDHQESLMERHRSP
>P0DP44 2.7.4.1~~~ppk~~~Polyphosphate kinase~~~
MDDSSLYIHRELSQLQFNIRVLEQALDESYPLLERLKFLLIFSSNLDEFFEIRIAGLKKQITFAREQAGADGLLPHQALA
RISELVHEQVSRQYRILNETLLPELAKHQIRFIRRRHWTLKIKTWVRRFFRDEIAPIITPIGLDPTHPFPLLVNKSLNFI
VELEGMDAFGRDSGLAIIPAPRLLPRIIRLPEDVGGEGDNYVFLSSMIHAHADDLFPGMKVKGCYQFRLTRNADLSVDTE
DVEDLARALRGELFSRRYGDAVRLEVVDTCPQNLTNYLLKQFGLSESELYKVSGPVNLTRLFSVTGLESHPELQYPPFTP
AIPRLLQKKENLFNVLSKLDVLLMHPFESFTPVIDLLRQAAKDPNVLAIKQTLYRSGANSEIVDALVEAARNGKEVTAVI
ELRARFDEESNLQLASRLQQAGAVVIYGVVGFKTHAKMMLILRREDGELRRYAHLGTGNYHAGNARLYTDYSLLTADVAL
CEDLHKLFNQLIGMGKTLRMKKLLHAPFTLKKNLLEMINREAAQAALGQPAHIMAKVNSLTDPKVIRALYKASQAGVRID
LVVRGMCCLRPGIPGVSHNIHVRSIIGRFLEHSRIYYFLNGGDEKLYLSSADWMERNLDMRVETCFPVEGKKLVQRVKKE
LETYLTDNTQAWVLQADGSYQRLSPTGNQNPRNTQATLLEKLAAPVLTAR
>P0DP45 2.7.4.1~~~ppk~~~Polyphosphate kinase~~~COG0855
MDDSSLYIHRELSQLQFNIRVLEQALDESYPLLERLKFLLIFSSNLDEFFEIRIAGLKKQITFAREQAGADGLLPHQALA
RISELVHEQVSRQYRILNETLLPELAKHQIRFIRRRHWTLKIKTWVRRFFRDEIAPIITPIGLDPTHPFPLLVNKSLNFI
VELEGMDAFGRDSGLAIIPAPRLLPRIIRLPEDVGGEGDNYVFLSSMIHAHADDLFPGMKVKGCYQFRLTRNADLSVDTE
DVEDLARALRGELFSRRYGDAVRLEVVDTCPQNLTNYLLKQFGLSESELYKVSGPVNLTRLFSVTGLESHPELQYPPFTP
AIPRLLQKKENLFNVLSKLDVLLMHPFESFTPVIDLLRQAAKDPNVLAIKQTLYRSGANSEIVDALVEAARNGKEVTAVI
ELRARFDEESNLQLASRLQQAGAVVIYGVVGFKTHAKMMLILRREDGELRRYAHLGTGNYHAGNARLYTDYSLLTADVAL
CEDLHKLFNQLIGMGKTLRMKKLLHAPFTLKKNLLEMINREAAQAALGQPAHIMAKVNSLTDPKVIRALYKASQAGVRID
LVVRGMCCLRPGIPGVSHNIHVRSIIGRFLEHSRIYYFLNGGDEKLYLSSADWMERNLDMRVETCFPVEGKKLVQRVKKE
LETYLTDNTQAWVLQADGSYQRLSPTGNQNPRNTQATLLEKLAAPVLTAR
>A0QZ12 2.4.1.-~~~ppm1~~~Polyprenol monophosphomannose synthase~~~COG0463
MSVPGEREQGAGEDPATVRPTQRTLVIIPTYNERENLPLIVGRVHHACPQVHILVVDDGSPDGTGALADELALADPDRVH
VMHRTSKAGLGAAYLAGFDWGLRRGYSVLVEMDADGSHAPEELSRLLDAVDAGADLAIGSRYVPGGTVRNWPWRRLVLSK
TANTYSRFLLGVGIHDITAGYRAYRREVLEKIDLSAVDSKGYCFQIDLTWRAINNGFSVVEVPITFTERELGVSKMSGSN
IREAMFKVAEWGIRGRLDRARGVVR
>A0A0H3M5A8 ~~~lnt~~~Bifunctional apolipoprotein N-acyltransferase/polyprenol monophosphomannose synthase~~~
MKLGAWVAAQLPTTRTAVRTRLTRLVVSIVAGLLLYASFPPRNCWWAAVVALALLAWVLTHRATTPVGGLGYGLLFGLVF
YVSLLPWIGELVGPGPWLALATTCALFPGIFGLFAVVVRLLPGWPIWFAVGWAAQEWLKSILPFGGFPWGSVAFGQAEGP
LLPLVQLGGVALLSTGVALVGCGLTAIALEIEKWWRTGGQGDAPPAVVLPAACICLVLFAAIVVWPQVRHAGSGSGGEPT
VTVAVVQGNVPRLGLDFNAQRRAVLDNHVEETLRLAADVHAGLAQQPQFVIWPENSSDIDPFVNPDAGQRISAAAEAIGA
PILIGTLMDVPGRPRENPEWTNTAIVWNPGTGPADRHDKAIVQPFGEYLPMPWLFRHLSGYADRAGHFVPGNGTGVVRIA
GVPVGVATCWEVIFDRAPRKSILGGAQLLTVPSNNATFNKTMSEQQLAFAKVRAVEHDRYVVVAGTTGISAVIAPDGGEL
IRTDFFQPAYLDSQVRLKTRLTPATRWGPILQWILVGAAAAVVLVAMRQNGWFPRPRRSEPKGENDDSDAPPGRSEASGP
PALSESDDELIQPEQGGRHSSGFGRHRATSRSYMTTGQPAPPAPGNRPSQRVLVIIPTFNERENLPVIHRRLTQACPAVH
VLVVDDSSPDGTGQLADELAQADPGRTHVMHRTAKNGLGAAYLAGFAWGLSREYSVLVEMDADGSHAPEQLQRLLDAVDA
GADLAIGSRYVAGGTVRNWPWRRLVLSKTANTYSRLALGIGIHDITAGYRAYRREALEAIDLDGVDSKGYCFQIDLTWRT
VSNGFVVTEVPITFTERELGVSKMSGSNIREALVKVARWGIEGRLSRSDHARARPDIARPGAGGSRVSRADVTE
>O53493 ~~~ppm1~~~Bifunctional apolipoprotein N-acyltransferase/polyprenol monophosphomannose synthase~~~COG0463
MKLGAWVAAQLPTTRTAVRTRLTRLVVSIVAGLLLYASFPPRNCWWAAVVALALLAWVLTHRATTPVGGLGYGLLFGLVF
YVSLLPWIGELVGPGPWLALATTCALFPGIFGLFAVVVRLLPGWPIWFAVGWAAQEWLKSILPFGGFPWGSVAFGQAEGP
LLPLVQLGGVALLSTGVALVGCGLTAIALEIEKWWRTGGQGDAPPAVVLPAACICLVLFAAIVVWPQVRHAGSGSGGEPT
VTVAVVQGNVPRLGLDFNAQRRAVLDNHVEETLRLAADVHAGLAQQPQFVIWPENSSDIDPFVNPDAGQRISAAAEAIGA
PILIGTLMDVPGRPRENPEWTNTAIVWNPGTGPADRHDKAIVQPFGEYLPMPWLFRHLSGYADRAGHFVPGNGTGVVRIA
GVPVGVATCWEVIFDRAPRKSILGGAQLLTVPSNNATFNKTMSEQQLAFAKVRAVEHDRYVVVAGTTGISAVIAPDGGEL
IRTDFFQPAYLDSQVRLKTRLTPATRWGPILQWILVGAAAAVVLVAMRQNGWFPRPRRSEPKGENDDSDAPPGRSEASGP
PALSESDDELIQPEQGGRHSSGFGRHRATSRSYMTTGQPAPPAPGNRPSQRVLVIIPTFNERENLPVIHRRLTQACPAVH
VLVVDDSSPDGTGQLADELAQADPGRTHVMHRTAKNGLGAAYLAGFAWGLSREYSVLVEMDADGSHAPEQLQRLLDAVDA
GADLAIGSRYVAGGTVRNWPWRRLVLSKTANTYSRLALGIGIHDITAGYRAYRREALEAIDLDGVDSKGYCFQIDLTWRT
VSNGFVVTEVPITFTERELGVSKMSGSNIREALVKVARWGIEGRLSRSDHARARPDIARPGAGGSRVSRADVTE
>P0ADR8 3.2.2.-~~~ppnN~~~Pyrimidine/purine nucleotide 5'-monophosphate nucleosidase~~~COG1611
MITHISPLGSMDMLSQLEVDMLKRTASSDLYQLFRNCSLAVLNSGSLTDNSKELLSRFENFDINVLRRERGVKLELINPP
EEAFVDGRIIRALQANLFAVLRDILFVYGQIHNTVRFPNLNLDNSVHITNLVFSILRNARALHVGEAPNMVVCWGGHSIN
ENEYLYARRVGNQLGLRELNICTGCGPGAMEAPMKGAAVGHAQQRYKDSRFIGMTEPSIIAAEPPNPLVNELIIMPDIEK
RLEAFVRIAHGIIIFPGGVGTAEELLYLLGILMNPANKDQVLPLILTGPKESADYFRVLDEFVVHTLGENARRHYRIIID
DAAEVARQMKKSMPLVKENRRDTGDAYSFNWSMRIAPDLQMPFEPSHENMANLKLYPDQPVEVLAADLRRAFSGIVAGNV
KEVGIRAIEEFGPYKINGDKEIMRRMDDLLQGFVAQHRMKLPGSAYIPCYEICT
>Q6FF51 2.4.2.1~~~ppnP~~~Pyrimidine/purine nucleoside phosphorylase~~~COG3123
MSSAQFDHVTVIKKSNVYFGGLCISHTVQFEDGTKKTLGVILPTEQPLTFETHVPERMEIISGECRVKIADSTESELFRA
GQSFYVPGNSLFKIETDEVLDYVCHLEG
>P0C037 2.4.2.1~~~ppnP~~~Pyrimidine/purine nucleoside phosphorylase~~~COG3123
MLQSNEYFSGKVKSIGFSSSSTGRASVGVMVEGEYTFSTAEPEEMTVISGALNVLLPDATDWQVYEAGSVFNVPGHSEFH
LQVAEPTSYLCRYL
>Q9I3E3 2.4.2.1~~~ppnP~~~Pyrimidine/purine nucleoside phosphorylase~~~
MFKVNEYFDGTVKSIAFDMTAGPATIGVMAAGEYEFGTSQLEIMHVVAGALTVKLPGSDEWQEYASGSQFTVPANSKFQL
KVAQDTAYLCEYR
>Q8ZRE7 2.4.2.1~~~ppnP~~~Pyrimidine/purine nucleoside phosphorylase~~~
MLQSNEYFSGKVKSIGFTSSSTGRASVGVMAEGEYTFGTAEPEEMTVVSGALKVLLPGTVEWKVYTAGEVFNVPGHSEFH
LQVAEPASYLCRYL
>Q9KKY0 2.4.2.1~~~ppnP~~~Pyrimidine/purine nucleoside phosphorylase~~~COG3123
MIKENVYFDGNVKSLGFSQQDGESTVGVMAPGQYTFGTGAPERMTVVKGALTIKRVTDADWVTFTAGEAFEVAGNSSFDL
QVEVATAYLCEFLPA
>Q87K41 2.4.2.1~~~ppnP~~~Pyrimidine/purine nucleoside phosphorylase~~~COG3123
MSIKENSYFAGGVKSLGFNQHGQDVSVGVMLPGEYTFGTQAPERMTVVKGALVVKRVGEADWTTYSSGESFDVEGNSSFE
LQVKDATAYLCEYL
>Q46836 ~~~pppA~~~Prepilin peptidase PppA~~~COG1989
MLFDVFQQYPTAMPVLATVGGLIIGSFLNVVIWRYPIMLRQQMAEFHGEMSSAQSKISLALPRSHCPHCQQTIRIRDNIP
LFSWLMLKGRCRDCQAKISKRYPLVELLTALAFLLASLVWPESGWGLAVMILSAWLIAASVIDLDHQWLPDVFTQGVLWT
GLIAAWAQQSPLTLQDAVTGVLVGFITFYSLRWIAGIVLRKEALGMGDVLLFAALGGWVGALSLPNVALIASCCGLIYAV
ITKRGSTTLPFGPCLSLGGIATLYLQALF
>O25656 3.4.24.-~~~pqqE~~~Zinc protease PqqE~~~COG0612
MKHFSVKRLLGLSSVLLVTLGASMHAQSYLPKHESVTLKNGLQVVSVPLENKTGVIEVDVLYKVGSRNETMGKSGIAHML
EHLNFKSTKNLKAGEFDKIVKRFGGVSNASTSFDITRYFIKTSQANLDKSLELFAETMGSLNLKEDEFLPERQVVAEERR
WRTDNSPIGMLYFRFFNTAYVYHPYHWTPIGFMDDIQNWTLKDIKKFHSLYYQPKNAIVLVVGDVNSQKVFELSKKHFES
LKNLDEKAIPTPYMKEPKQDGARTAVVHKDGVHLEWVALGYKVPAFKHKDQVALDALSRLLGEGKSSWLQSELVDKKRLA
SQAFSHNMQLQDESVFLFIAGGNPNVKAEALQKEIVALLEKLKKGEITQAELDKLKINQKADFISNLESSSDVAGLFADY
LVQNDIQGLTDYQRQFLDLKVSDLVRVANEYFKDTQSTTVFLKP
>O32504 ~~~pprA~~~DNA repair protein PprA~~~
MLPLAFLICSGHNKGSMARAKAKDQTDGIYAAFDTLMSTAGVDSQIAALAASEADAGTLDAALTQSLQEAQGRWGLGLHH
LRHEARLTDDGDIEILTDGRPSARVSEGFGALAQAYAPMQALDERGLSQWAALGEGYRAPGDLPLAQLKVLIEHARDFET
DWSAGRGETFQRVWRKGDTLFVEVARPASAEAALSDAAWDVIASIKDRAFQRELMRRSEKDGMLGALLGARHAGAKANLA
QLPEAHFTVQAFVQTLSGAAARNAEEYRAALKTAAAALEEYQGVTTRQLSEVLRHGLRES
>Q9HWA7 2.7.13.3~~~pprA~~~Two-component sensor PprA~~~
MFEFSRSSSAEAERPEPFSQEGPALWSASLRSWDLCFEMDEQDRVIRVGGRQAYRLQCAHGLGEQPRPFAEYLERRAPGA
PTLAGLRRGERLDLTLRSDAAAPLTCRFQPMQPLDGLGRSLLLGMDISDLNWQSDSQQHQLQSLSLGKLILSRLRHVSHG
HLAEAVQEILESLSGAFQMQAIALLLGDGKGFCTVFASHVRPGSDSLLRPPLQLADDDLREGAGARLLRRGEGASTLLRQ
IGEDALYLVPATMRGGRLGALLVRPMSLEQLAQGPAPQDWQYLAELLANQVADRCELHEQHDSSRKLGLLQEMIGGGWWR
YWAEQELFELAPALHDSLGLTGEYRRVPLEHLQGLLQPADADELGLRLRASLRSGQALAQDLCLRQPDSRGERRWLRIEG
RPLGRGSALGLSGVLLDISEGRRQEERAQAAHARLRSLIDSAPVVIYVQRVEQGHLVPEFYSESASNLLGLDLQGQSWQA
LAERVHPDDLEAFFARGRELLREGRVKTRYRLADGQGNWHWLYDEAKLLRDAQGLPSEAVGLWLDVTEQHLAAQRIAESE
ERYRVLVEDSPALICRYTADLVLTYVNRTFADSLATSPERLVGRRLDEWLAAEDASALRARLLGSPREGASEVPELRFNL
PGQRFLWLVWAERPLFDARGELCEVQAVGRDNTPVRRAQQQLAQGAKMASLGEMVSGLAHEVKQPLHVLRMTLFNMRQRM
NSVGLDGDYLGEKLERMDAQVLRVDRLVSHLGVFSRKSALEALPFDPYAAFEGALGLLGEGLRQHAIEVECPAPTQRMVV
RGQADQLEQVIINLLANARDALLGNPGLASRRVRLEQVACREPGWVELHVHDNGGGIEPLLLERIFEPFFTTKAEGKGTG
LGLSVSHDLVRNMGGSLTAANQGEGALFVVRLPLAAPAEAGG
>Q9HWA4 ~~~pprB~~~Two-component response regulator PprB~~~
MDKPASRHFSVLIIDDEPQVTSELRELLENSGYRCVTSTHRESAIASFQADPNIGLVICDLYLGQDNGIRLIESLKEVAG
NGRFFESIILTGHDGRQEVIEAMRVGAADYYQKPVAPQELLHGLERLESRLHERVRSQLSLSHVNQRLEYLAESLNSIYR
DIHKIKYEVHGNSQPSALRSEDSQPSAPPAPVAESQVSPSNPLFGKLSPRQQAVARLVSKGLTNYQIAYELGITENTVKL
YVSQVLRLMHMHNRTQLALALSPAAMQQGSGAVVH
>P39845 2.3.1.-~~~ppsA~~~Plipastatin synthase subunit A~~~COG1020
MSEHTYSLTHAQRRVWFTELLEPDTSICNLTACVKFKGNIELDTLEGALNHSISRNDAIRFQLLEGEELEPRLHLTEYKY
YPLRIIDFSNVEMIEIEQWIQDQASIPFKLFNSPLFQFYLLRIDSHEVWLFAKFHHIIMDGISLNVMGNQIIDLYQKMKK
KDPLPDQPEPSYLSYIEKESQYLQSPRFAKDRLFWTQTFEHPLEYHSLADQTSLQKQSTSASRDTIILSPDLEQTIRIFC
EEHKINIISLFMASFYICISRITSKKDLAIGTYYGNRGSKAEKEMLGMFVSSLPIRITVDPDTDFLSFVRTIGREQLSVM
RHQRFPYNLLVNELRNEQKDLHNLIGISMQYQPLQWHNADDFDYETALYFSGYTANELSVQIQERIDNGTIQLNFDYQNT
LFSLEDIKRIQSHLLTILENALKHPHSFIRELDMTNTREKQKLLYEFNKTEAVSPKAFTLHGLFERQAAFTPERLAIRFS
GGSLTYAELDMYASRLAAHLAARGVTNESIVGVLSERSPDMLIAVLAVLKAGGAYLPLDPAYPKERLSYMLKDSGASLLL
TQPGCSAPNFSGETLEVDMTSLASEKAENHEFTPADGGSLAYVIYTSGSTGQPKGVAVEHRQAVSFLTGMQHQFPLSEDD
IVMVKTSFSFDASVWQLFWWSLSGASAYLLPPGWEKDSALIVQAIHQENVTTAHFIPAMLNSFLDQAEIERLSDRTSLKR
VFAGGEPLAPRTAARFASVLPQVSLIHGYGPTEATVDAAFYVLDPERDRDRLRIPIGKPVPGARLYVLDPHLAVQPSGVA
GELYIAGAGVARGYLNRPALTEERFLEDPFYPGERMYKTGDVARWLPDGNVEFLGRTDDQVKIRGYRIEPGEIEAALRSI
EGVREAAVTVRTDSGEPELCAYVEGLQRNEVRAQLERLLPGYMVPAYMIEMEQWPVTPSGKLDRNALPAPGGAADAETYT
APRNVTEMKLSQLWEDVLKNGPVGIHDNFFDRGGHSLKATALVSRIAKEFDVQVPLKDVFAHPTVEGLATVIREGTDSPY
EAIKPAEKQETYPVSSAQKRIYVLQQLEDGGTGYNMPAVLELEGKLNPERMERAFKELIKRHESLRTSFEQDAGGDPVQR
IHDEVPFTLQTTVLGERTEQEAAAAFIKPFDLSQAPLFRAQIVKISDERHLLLVDMHHIISDGVSVNILIREFGELYNNR
NLPALRIQYKDYAVWREGFKTGDAYKTQEAYWLKQLEGELPVLDLPADHARPPVRSFAGDKVSFTLDQEVASGLHKLARE
NGSTLYMVLLAAYTAFLSRLSGQEDIIVGSPIAGRPHKDLEPILGMFVNTLALRTRPEGGKPFVQYLQEVRETALEAFEH
QDYPFEELVDKLELTRDMSRNPVFDAMFILQNVEKQDIDLREIKVRPANFAHHISLFDITLIATEISGSICCEMEFSTEV
FLKATIERWADHFIEFLHEALSTPETSLAQINILSDKEKQKIVFEFNKTQVEFAQKDIPFHRIFEAKAEENPEHIAVIDN
ETEISYRLLNERANRLARTLQNRKGPKPTVAVLAKRSIDAIVGVLAVMKAGGVYIPIDAHYPKARIEYILRDSGADILLL
QRELKHLISNSPESEMSHIFLDDEGSFEESNCNLNLSPAPEEPVYIIYTSGTTGAPKGVIVTYQNFTHAALAWRQIYELD
RKPVRLLQIASFSFDVFSGDLARTLTNGGTLIVCPDETRLEPAEIYKIIKSQRITVMESTPALIIPVMEYVYRNQFKLPD
LDILILGSDMVKAQDFKTLTDRFGQSMRIINSYGVTEATIDSSFYETSMGGECTGDNVPIGSPLPNVHMYVLSQTDQIQP
IGVAGELCIGGAGVAKGYHHKPDLTQMKFTENPFVSGERLYRTGDRACWLPNGTIRLLGRMDYQVKINGYRIETEEIESV
LLQTGLVREAAVAVQHDKNGQAGLAAYIVPSDVNTNALRAALTKELPAYMIPAYLIPLVNMPLTLNGKLDRNALPAPNNV
LSRPYTAPVNDLQKTMAYIWEDVLSMSRVGIHDSFFELGGDSIKALQVAARLAAEGWSMTIRDLFRYSTIQELCGHITPL
ASQADQGPAEGEAELTPIQRRFFGQVHAFHYHYNQSVMLFSEKGFNANALHLALRKITEHHDAIRMIFQRDQNGHVIQFN
RGINHKDHELFGLYISDWTKASLERAHLDEKLAAEETVIQSKMNVEKGPLLQAGLFKTAEGDHLLIALHHLVIDGVSWRI
LLEDLAAAYQQALEKKEIQLPPKTDSYLSYADGLTQIAESKQLLSEKTYWQTILDAHTAFLPKDIENVPDKLQMNSDAAA
FVLSGDWTEKLLFETQQAYGTDANELLLTALGMALSEWTGHDQIVISTEGHGREGHVPNIDISRTVGWFTSIYPILLDMG
IPEPFEDQLAYRIKTTKDMLRRVPNKGTGYGLLTHIGELRHKEPEVSFNYLGQFSEEKEVETFQLSYYQPRYEIAGERER
EYELDINALITDGRLHVKAVYTQVFSKHSIECFMDRFHRHLIETIEHCSQKKAREKTLSDFSNKELTLSALSSIEDLVKD
L
>P23538 2.7.9.2~~~ppsA~~~Phosphoenolpyruvate synthase~~~COG0574
MSNNGSSPLVLWYNQLGMNDVDRVGGKNASLGEMITNLSGMGVSVPNGFATTADAFNQFLDQSGVNQRIYELLDKTDIDD
VTQLAKAGAQIRQWIIDTPFQPELENAIREAYAQLSADDENASFAVRSSATAEDMPDASFAGQQETFLNVQGFDAVLVAV
KHVFASLFNDRAISYRVHQGYDHRGVALSAGVQRMVRSDLASSGVMFSIDTESGFDQVVFITSAWGLGEMVVQGAVNPDE
FYVHKPTLAANRPAIVRRTMGSKKIRMVYAPTQEHGKQVKIEDVPQEQRDIFSLTNEEVQELAKQAVQIEKHYGRPMDIE
WAKDGHTGKLFIVQARPETVRSRGQVMERYTLHSQGKIIAEGRAIGHRIGAGPVKVIHDISEMNRIEPGDVLVTDMTDPD
WEPIMKKASAIVTNRGGRTCHAAIIARELGIPAVVGCGDATERMKDGENVTVSCAEGDTGYVYAELLEFSVKSSSVETMP
DLPLKVMMNVGNPDRAFDFACLPNEGVGLARLEFIINRMIGVHPRALLEFDDQEPQLQNEIREMMKGFDSPREFYVGRLT
EGIATLGAAFYPKRVIVRLSDFKSNEYANLVGGERYEPDEENPMLGFRGAGRYVSDSFRDCFALECEAVKRVRNDMGLTN
VEIMIPFVRTVDQAKAVVEELARQGLKRGENGLKIIMMCEIPSNALLAEQFLEYFDGFSIGSNDMTQLALGLDRDSGVVS
ELFDERNDAVKALLSMAIRAAKKQGKYVGICGQGPSDHEDFAAWLMEEGIDSLSLNPDTVVQTWLSLAELKK
>Q7TXM0 2.3.1.292~~~ppsA~~~Phenolphthiocerol/phthiocerol polyketide synthase subunit A~~~
MTGSISGEADLRHWLIDYLVTNIGCTPDEVDPDLSLADLGVSSRDAVVLSGELSELLGRTVSPIDFWEHPTINALAAYLA
APEPSPDSDAAVKRGARNSLDEPIAVVGMGCRFPGGISCPEALWDFLCERRSSISQVPPQRWQPFEGGPPEVAAALARTT
RWGSFLPDIDAFDAEFFEISPSEADKMDPQQRLLLEVAWEALEHAGIPPGTLRRSATGVFAGACLSEYGAMASADLSQVD
GWSNSGGAMSIIANRLSYFLDLRGPSVAVDTACSSSLVAIHLACQSLRTQDCHLAIAAGVNLLLSPAVFRGFDQVGALSP
TGQCRAFDATADGFVRGEGAGVVVLKRLTDAQRDGDRVLAVICGSAVTQDGRSNGLMAPNPAAQMAVLRAAYTNAGMQPS
EVDYVEAHGTGTLLGDPIEARALGTVLGRGRPEDSPLLIGSVKTNLGHTEAAAGIAGFIKTVLAVQHGQIPPNQHFETAN
PHIPFTDLRMKVVDTQTEWPATGHPRRAGVSSFGFGGTNAHVVIEQGQEVRPAPGQGLSPAVSTLVVAGKTMQRVSATAG
MLADWMEGPGADVALADVAHTLNHHRSRQPKFGTVVARDRTQAIAGLRALAAGQHAPGVVNPAEGSPGPGTVFVYSGRGS
QWAGMGRQLLADEPAFAAAVAELEPVFVEQAGFSLHDVLANGEELVGIEQIQLGLIGMQLALTELWCSYGVQPDLVIGHS
MGEVAAAVVAGALTPAEGLRVTATRSRLMAPLSGQGGMALLELDAPTTEALIADFPQVTLGIYNSPRQTVIAGPTEQIDE
LITRVRARDRFASRVNIEVAPHNPAMDALQPAMRSELADLTPRTPTIGIISTTYADLHTQPVFDAEHWATNMRNPVHFQQ
AIASAGSGADGAYHTFIEISAHPLLTQAIIDTLHSAQPGARYTSLGTLQRDTDDVVTFRTNLNKAHTIHPPHTPHPPEPH
PPIPTTPWQHTRHWITTKYPAGSVGSAPRAGTLLGQHTTVATVSASPPSHLWQARLAPDAKPYQGGHRFHQVEVVPASVV
LHTILSAATELGYSALSEVRFEQPIFADRPRLIQVVADNRAISLASSPAAGTPSDRWTRHVTAQLSSSPSDSASSLNEHH
RANGQPPERAHRDLIPDLAELLAMRGIDGLPFSWTVASWTQHSSNLTVAIDLPEALPEGSTGPLLDAAVHLAALSDVADS
RLYVPASIEQISLGDVVTGPRSSVTLNRTAHDDDGITVDVTVAAHGEVPSLSMRSLRYRALDFGLDVGRAQPPASTGPVE
AYCDATNFVHTIDWQPQTVPDATHPGAEQVTHPGPVAIIGDDGAALCETLEGAGYQPAVMSDGVSQARYVVYVADSDPAG
ADETDVDFAVRICTEITGLVRTLAERDADKPAALWILTRGVHESVAPSALRQSFLWGLAGVIAAEHPELWGGLVDLAIND
DLGEFGPALAELLAKPSKSILVRRDGVVLAPALAPVRGEPARKSLQCRPDAAYLITGGLGALGLLMADWLADRGAHRLVL
TGRTPLPPRRDWQLDTLDTELRRRIDAIRALEMRGVTVEAVAADVGCREDVQALLAARDRDGAAPIRGIIHAAGITNDQL
VTSMTGDAVRQVMWPKIGGSQVLHDAFPPGSVDFFYLTASAAGIFGIPGQGSYAAANSYLDALARARRQQGCHTMSLDWV
AWRGLGLAADAQLVSEELARMGSRDITPSEAFTAWEFVDGYDVAQAVVVPMPAPAGADGSGANAYLLPARNWSVMAATEV
RSELEQGLRRIIAAELRVPEKELDTDRPFAELGLNSLMAMAIRREAEQFVGIELSATMLFNHPTVKSLASYLAKRVAPHD
VSQDNQISALSSSAGSVLDSLFDRIESAPPEAERSV
>P9WQE7 2.3.1.292~~~ppsA~~~Phenolphthiocerol/phthiocerol polyketide synthase subunit A~~~COG1020
MTGSISGEADLRHWLIDYLVTNIGCTPDEVDPDLSLADLGVSSRDAVVLSGELSELLGRTVSPIDFWEHPTINALAAYLA
APEPSPDSDAAVKRGARNSLDEPIAVVGMGCRFPGGISCPEALWDFLCERRSSISQVPPQRWQPFEGGPPEVAAALARTT
RWGSFLPDIDAFDAEFFEISPSEADKMDPQQRLLLEVAWEALEHAGIPPGTLRRSATGVFAGACLSEYGAMASADLSQVD
GWSNSGGAMSIIANRLSYFLDLRGPSVAVDTACSSSLVAIHLACQSLRTQDCHLAIAAGVNLLLSPAVFRGFDQVGALSP
TGQCRAFDATADGFVRGEGAGVVVLKRLTDAQRDGDRVLAVICGSAVNQDGRSNGLMAPNPAAQMAVLRAAYTNAGMQPS
EVDYVEAHGTGTLLGDPIEARALGTVLGRGRPEDSPLLIGSVKTNLGHTEAAAGIAGFIKTVLAVQHGQIPPNQHFETAN
PHIPFTDLRMKVVDTQTEWPATGHPRRAGVSSFGFGGTNAHVVIEQGQEVRPAPGQGLSPAVSTLVVAGKTMQRVSATAG
MLADWMEGPGADVALADVAHTLNHHRSRQPKFGTVVARDRTQAIAGLRALAAGQHAPGVVNPADGSPGPGTVFVYSGRGS
QWAGMGRQLLADEPAFAAAVAELEPVFVEQAGFSLHDVLANGEELVGIEQIQLGLIGMQLALTELWCSYGVRPDLVIGHS
MGEVAAAVVAGALTPAEGLRVTATRSRLMAPLSGQGGMALLELDAPTTEALIADFPQVTLGIYNSPRQTVIAGPTEQIDE
LIARVRAQNRFASRVNIEVAPHNPAMDALQPAMRSELADLTPRTPTIGIISTTYADLHTQPVFDAEHWATNMRNPVRFQQ
AIASAGSGADGAYHTFIEISAHPLLTQAIIDTLHSAQPGARYTSLGTLQRDTDDVVTFRTNLNKAHTIHPPHTPHPPEPH
PPIPTTPWQHTRHWITTKYPAGSVGSAPRAGTLLGQHTTVATVSASPPSHLWQARLAPDAKPYQGGHRFHQVEVVPASVV
LHTILSAATELGYSALSEVRFEQPIFADRPRLIQVVADNRAISLASSPAAGTPSDRWTRHVTAQLSSSPSDSASSLNEHH
RANGQPPERAHRDLIPDLAELLAMRGIDGLPFSWTVASWTQHSSNLTVAIDLPEALPEGSTGPLLDAAVHLAALSDVADS
RLYVPASIEQISLGDVVTGPRSSVTLNRTAHDDDGITVDVTVAAHGEVPSLSMRSLRYRALDFGLDVGRAQPPASTGPVE
AYCDATNFVHTIDWQPQTVPDATHPGAEQVTHPGPVAIIGDDGAALCETLEGAGYQPAVMSDGVSQARYVVYVADSDPAG
ADETDVDFAVRICTEITGLVRTLAERDADKPAALWILTRGVHESVAPSALRQSFLWGLAGVIAAEHPELWGGLVDLAIND
DLGEFGPALAELLAKPSKSILVRRDGVVLAPALAPVRGEPARKSLQCRPDAAYLITGGLGALGLLMADWLADRGAHRLVL
TGRTPLPPRRDWQLDTLDTELRRRIDAIRALEMRGVTVEAVAADVGCREDVQALLAARDRDGAAPIRGIIHAAGITNDQL
VTSMTGDAVRQVMWPKIGGSQVLHDAFPPGSVDFFYLTASAAGIFGIPGQGSYAAANSYLDALARARRQQGCHTMSLDWV
AWRGLGLAADAQLVSEELARMGSRDITPSEAFTAWEFVDGYDVAQAVVVPMPAPAGADGSGANAYLLPARNWSVMAATEV
RSELEQGLRRIIAAELRVPEKELDTDRPFAELGLNSLMAMAIRREAEQFVGIELSATMLFNHPTVKSLASYLAKRVAPHD
VSQDNQISALSSSAGSVLDSLFDRIESAPPEAERSV
>Q9K0I2 2.7.9.2~~~ppsA~~~Phosphoenolpyruvate synthase~~~
MADNYVIWFENLRMTDVERVGGKNASLGEMISQLTEKGVRVPGGFATTAEAYRAFLAHNGLSERISAALAKLDVEDVAEL
ARVGKEIRQWILDTPFPEQLDAEIEAAWNKMVADAGGADISVAVRSSATAEDLPDASFAGQQETFLNINGLDNVKEAMHH
VFASLYNDRAISYRVHKGFEHDIVALSAGVQRMVRSDSGASGVMFTLDTESGYDQVVFVTSSYGLGENVVQGAVNPDEFY
VFKPTLKAGKPAILRKTMGSKHIKMIFTDKAEAGKSVTNVDVPEEDRNRFSITDEEITELAHYALTIEKHYGRPMDIEWG
RDGLDGKLYILQARPETVKSQEEGNRNLRRFAINGDKTVLCEGRAIGQKVGQGKVRLIKDASEMDSVEAGDVLVTDMTDP
DWEPVMKRASAIVTNRGGRTCHAAIIARELGIPAVVGCGNATELLKNGQEVTVSCAEGDTGFIYAGLLDVQITDVALDNM
PKAPVKVMMNVGNPELAFSFANLPSEGIGLARMEFIINRQIGIHPKALLEFDKQDDELKAEITRRIAGYASPVDFYVDKI
AEGVATLAASVYPRKTIVRMSDFKSNEYANLVGGNVYEPHEENPMLGFRGAARYVADNFKDCFALECKALKRVRDEMGLT
NVEIMIPFVRTLGEAEAVVKALKENGLERGKNGLRLIMMCELPSNAVLAEQFLQYFDGFSIGSNDMTQLTLGLDRDSGLV
SESFDERNPAVKVMLHLAISACRKQNKYVGICGQGPSDHPDFAKWLVEEGIESVSLNPDTVIETWLYLANELNK
>Q02KR1 2.7.9.2~~~ppsA~~~Phosphoenolpyruvate synthase~~~
MVEYVVSLDKLGVHDVEHVGGKNASLGEMISNLAGAGVSVPGGFATTAQAYRDFLEQSGLNDRIHAALDALDVDDVNALA
KTGAQIRQWVMEAEFPARLDSEIRQAFAALANGNDNLAVAVRSSATAEDLPDASFAGQQETFLNIRGVDNVIRAAKEVFA
SLFNDRAIAYRVHQGFDHKLVALSAGVQRMVRSETGTAGVMFTLDTESGFRDVVFITGAYGLGETVVQGAVNPDEFYVHK
PTLEAGRPAILRRNLGSKAIKMIYGDEAKAGRSVKVVDVDRADRARFALSDAEVTELAKQAMIIEKHYGRPMDIEWAKDG
DDGKLYIVQARPETVKSRASATVMERYLLKEKGTVLVEGRAIGQRIGAGPVKVINDVSEMDKVQPGDVLVSDMTDPDWEP
VMKRASAIVTNRGGRTCHAAIIARELGIPAVVGCGNATQILQDGQGVTVSCAEGDTGFIFEGELGFDVRKNSVDAMPDLP
FKIMMNVGNPDRAFDFAQLPNEGVGLARLEFIINRMIGVHPKALLNFAGLPADIKESVEKRIAGYPDPVGFYVEKLVEGI
STLAAAFWPKKVIVRLSDFKSNEYANLIGGKLYEPEEENPMLGFRGASRYISESFRDCFELECRALKKVRNEMGLTNVEI
MVPFVRTLGEASQVVELLAGNGLKRGENGLKVIMMCELPSNALLADEFLEFFDGFSIGSNDLTQLTLGLDRDSGIVAHLF
DERNPAVKKLLANAIAACNKAGKYIGICGQGPSDHPDLARWLMEQGIESVSLNPDSVLDTWFFLAEGQDQA
>P39846 2.3.1.-~~~ppsB~~~Plipastatin synthase subunit B~~~COG1020
MAQSAQIQDIYPLSHMQEGMLFHSLMDFSSKAYIEQTSFTITGNLCVDSFQKSLNLLVSRYDIFRTIFIKEVPDLTGPQQ
VVLSNRELTVYREDISRLADQEQQTLIDAFMTKDREKGFDLQKDPLMRLALFDRGDSQYTCVWTHHHIIMDGWCLGIILK
EFFSMYDSLKNNSPVQLGSTVPYSRYIEWLGEQDQEETAAYWSEYLKEYGNTASIPRIKRRTADGNYKADQVSFSLAPDM
VEKLTEAAQNWGVTLNTLFMSIWGVLLHRYNAADDAVFGSVISGRPSAIDGIESMVGLFINTVPVRIRSAEGITFSSLVK
AVQEDILSSEQHGYYPLYEIQNHSPLKQGLIDHIFVFENYPVQLHQALSVESENDEGALKLSDISMSEQTNYDFNIVIVP
GESFYIKFSYNADVYEREEMLRIQGHLKQALDCILTNPDVAVSDINIVPPEEQQVIQLFNETERPYVNKTIPQLFEEQAH
KTPEAAALKMGNECWTYRQLQVRANQIAHALIEKGVGSGDIVAVMMGRSMEMPAALLGIWKAGGAYMPLDPHFPAERLSF
LLKDSQAAQLLIEEDLISLIPPSYEGNTITIEHTESYQTEAPNMPPGDLAYLIYTSGTTGRPKGVLVDHHGIANTLQWRR
EEYSMTEQDISLHLFSYVFDGCVTSLFTPLLSGACVLLTTDDEAKDVLALKRKIARYKVSHMIIVPSLYRVLLEVMTADD
AKSLRIVTFAGEAVTPDLLELNQIICPSAELANEYGPTENSVATTILRHLNKKERITIGHPIRNTKVFVLHGNQMQPIGA
AGELCISGAGLARGYYKQQELTQKAFSDHPFLEGERLYRTGDAGRFLPDGTIEYIGRFDDQVKIRGYRIELREIETVLRQ
APGVKEAAVLARDVSAEEKELVAYIVPEKGNSLPDLYQHLAGTLPSYMIPASIINISQMPLTSSGKLDRFALPEPENNTS
VTYMAPRTLIEADLAHIWEDVLNKQHIGIRDDFFQLGGQSLKAAALVSRIHKKLNVELPLSEVFSYPTVESMAVKLMSLK
EHAFTQIEPADQRDVYPLSFSQKRLYALHQLADDSTGYNMPAVLELRGNLNRQRLRSVLTELVNRHEALRTVFVLDRDEP
VQIIYPEMAFDLKELEMESEQMLESAIETFIKPFYLSSGPLFRACVITMGNNRGFLLLDMHHIIADGVSMSTLVQEFTDL
YCGKELPALNLHYKDFAVWQQEKHPKELYKKQEAYWLGQLGGSLPTLELPLDKTRPRLPDFRGGTIEVNIDKDMADELHR
LMAETGTTLYMILLAVYSILLSKLSGQEDIVVGSPAAGRPHADLERVIGMFVNTLAMRSKPEGHKTFSSYLHDIRHLALT
AYEHQDYPFEELADKLDTNREVNRNPLFDAMLVLQSSEDFRFEVPGLSISSVTPKHDISKFDLTLHAEEHLSGIRCRFEY
STALFEEETITQWASYFIELVKGVTADTEMRISNMQLLPAAERRLLLEKMGQYAAYPRNENIVSLFEKQVAQYPEHIAVV
CGHSQLTYRDLNEKAERAAAMLIKQGVRTGDIVGLMLDRSPDMIIGVLSILKAGGAYLPIDPEYPKERISFMLNDSGAKL
LLTERGLNKPADYTGHILYIDECENNSIPADVNIEEIVTDQPAYVIYTSGTTGQPKGVIVEHRNVISLLKHQNLPFEFNH
EDVWTLFHSYCFDFSVWEMFGALLNGSTLVVVSKETARDPQAFRLLLKKERVTVLNQTPTAFYGLMLEDQNHTDHLNIRY
VIFGGEALQPGLLQSWNEKYPHTDLINMYGITETTVHVTFKKLSAADIAKNKSNIGRPLSTLQAHVMDAHMNLQPTGVPG
ELYIGGEGVARGYLNRDELTADRFVSNPYLPGDRLYRTGDLAKRLSNGELEYLGRIDEQVKVRGHRIELGEIQAALLQYP
MIKEAAVITRADEQGQTAIYAYMVIKDQQAANISDIRTYLKNALPDFMLPARMIQIDSIPVTVNGKLDQKALPEPEKQAY
TADDISPRNEIETVMAEIWEELLNVDELGVSANFFKLGGDSIKALQVCARLKQRGFETTVREMFEHQTLGELSARVRKDV
RAIDQGPVEGEITWTPIQQWFFSQSLESHHFNQSVMIYRAERFDEAALRKVLKSLVTHHDALRIVCRHEDGRQVQINRGI
DLSDEELYALELFDVKDSLTEARNTIEEAASRMQEHIRLETGPLLHAGLFRTENGDHLFLTIHHLVVDAVSWRILFEDFS
TAYKQAVSGESIKLPQKTDSYLTYSQRIADYSISRQVQREAAYWDECENRHIQPIPKDNDAASNTFKDTEVIDFELSRHH
TELLLTAAHKAYSTEMNDILLTALGLALQKWTGNNQFKISMEGHGRESYLEDIDISRTVGWFTSIYPVWLDMRDSDHKDK
EERLGHLIKQTKDMLHRIPHKGAGYGVLKYISKRWGSQKNSPEISFNYLGQFDQDIQSNAFEVSDIKPGNEISPNWERPY
ALDISGAVSSGCLNMHIIYNRFQFEEKTIQTFSRHFKQTLENIIEHCTGKENQEWSASDFTDEDLTLDELSEIMGAVNKL
>Q7TXL9 2.3.1.292~~~ppsB~~~Phenolphthiocerol/phthiocerol polyketide synthase subunit B~~~
MMRTAFSRISGMTAQQRTSLADEFDRVSRIAVAEPVAVVGIGCRFPGDVDGPESFWDFLVAGRNAISTVPADRWDAEAFY
HPDPLTPGRMTTKWGGFVPDVAGFDAEFFGITPREAAAMDPQQRMLLEVAWEALEHAGIPPDSLGGTRTAVMMGVYFNEY
QSMLAASPQNVDAYSGTGNAHSITVGRISYLLGLRGPAVAVDTACSSSLVAVHLACQSLRLRETDLALAGGVSITLRPET
QIAISAWGLLSPQGRCAAFDAAADGFVRGEGAGVVVLKRLTDAVRDGDQVLAVVRGSAVNQDGRSNGVTAPNTAAQCDVI
ADALRSGDVAPDSVNYVEAHGTGTVLGDPIEFEALAATYGHGGDACALGAVKTNIGHLEAAAGIAGFIKATLAVQRATIP
PNLHFSQWNPAIDAASTRFFVPTQNSPWPTAEGPRRAAVSSFGLGGTNAHVIIEQGSELAPVSEGGEDTGVSTLVVTGKT
AQRMAATAQVLADWMEGPGAEVAVADVAHTVNHHRARQATFGTVVARDRAQAIAGLRALAAGQHAPGVVSHQDGSPGPGT
VFVYSGRGSQWAGMGRQLLADEPAFAAAVAELEPVFVEQAGFSLRDVIATGKELVGIEQIQLGLIGMQLTLTELWRSYGV
QPDLVIGHSMGEVAAAVVAGALTPAEGLRVTATRARLMAPLSGQGGMALLGLDAAATEALIADYPQVTVGIYNSPRQTVI
AGPTEQIDELIARVRAQNRFASRVNIEVAPHNPAMDALQPAMRSELADLTPRTPTIGIISTTYADLHTQPIFDAEHWATN
MRNPVRFQQAIASAGSGADGAYHTFIEISAHPLLTQAIADTLEDAHRPTKSAAKYLSIGTLQRDADDTVTFRTNLYTADI
AHPPHTCHPPEPHPTIPTTPWQHTHHWIATTHPSTAAPEDPGSNKVVVNGQSTSESRALEDWCHQLAWPIRPAVSADPPS
TAAWLVVADNELCHELARAADSRVDSLSPPALAAGSDPAALLDALRGVDNVLYAPPVPGELLDIESAYQVFHATRRLAAA
MVASSATAISPPKLFIMTRNAQPISEGDRANPGHAVLWGLGRSLALEHPEIWGGIIDLDDSMPAELAVRHVLTAAHGTDG
EDQVVYRSGARHVPRLQRRTLPGKPVTLNADASQLVIGATGNIGPHLIRQLARMGAKTIVAMARKPGALDELTQCLAATG
TDLIAVAADATDPAAMQTLFDRFGTELPPLEGIYLAAFAGRPALLSEMTDDDVTTMFRPKLDALALLHRLSLKSPVRHFV
LFSSVSGLLGSRWLAHYTATSAFLDSFAGARRTMGLPATVVDWGLWKSLADVQKDATQISAESGLQPMADEVAIGALPLV
MNPDAAVATVVVAADWPLLAAAYRTRGALRIVDDLLPAPEDVGKGESEFRTSLRSCPAEKRRDMLFDHVGALAATVMGMP
PTEPLDPSAGFFQLGMDSLMSVTLQRALSESLGEFLPASVVFDYPTVYSLTDYLATVLPELLEIGATAVATQQATDSYHE
LTEAELLEQLSERLRGTQ
>P9WQE5 2.3.1.292~~~ppsB~~~Phenolphthiocerol/phthiocerol polyketide synthase subunit B~~~COG1020
MMRTAFSRISGMTAQQRTSLADEFDRVSRIAVAEPVAVVGIGCRFPGDVDGPESFWDFLVAGRNAISTVPADRWDAEAFY
HPDPLTPGRMTTKWGGFVPDVAGFDAEFFGITPREAAAMDPQQRMLLEVAWEALEHAGIPPDSLGGTRTAVMMGVYFNEY
QSMLAASPQNVDAYSGTGNAHSITVGRISYLLGLRGPAVAVDTACSSSLVAVHLACQSLRLRETDLALAGGVSITLRPET
QIAISAWGLLSPQGRCAAFDAAADGFVRGEGAGVVVLKRLTDAVRDGDQVLAVVRGSAVNQDGRSNGVTAPNTAAQCDVI
ADALRSGDVAPDSVNYVEAHGTGTVLGDPIEFEALAATYGHGGDACALGAVKTNIGHLEAAAGIAGFIKATLAVQRATIP
PNLHFSQWNPAIDAASTRFFVPTQNSPWPTAEGPRRAAVSSFGLGGTNAHVIIEQGSELAPVSEGGEDTGVSTLVVTGKT
AQRMAATAQVLADWMEGPGAEVAVADVAHTVNHHRARQATFGTVVARDRAQAIAGLRALAAGQHAPGVVSHQDGSPGPGT
VFVYSGRGSQWAGMGRQLLADEPAFAAAVAELEPVFVEQAGFSLRDVIATGKELVGIEQIQLGLIGMQLTLTELWRSYGV
QPDLVIGHSMGEVAAAVVAGALTPAEGLRVTATRARLMAPLSGQGGMALLGLDAAATEALIADYPQVTVGIYNSPRQTVI
AGPTEQIDELIARVRAQNRFASRVNIEVAPHNPAMDALQPAMRSELADLTPRTPTIGIISTTYADLHTQPIFDAEHWATN
MRNPVRFQQAIASAGSGADGAYHTFIEISAHPLLTQAIADTLEDAHRPTKSAAKYLSIGTLQRDADDTVTFRTNLYTADI
AHPPHTCHPPEPHPTIPTTPWQHTHHWIATTHPSTAAPEDPGSNKVVVNGQSTSESRALEDWCHQLAWPIRPAVSADPPS
TAAWLVVADNELCHELARAADSRVDSLSPPALAAGSDPAALLDALRGVDNVLYAPPVPGELLDIESAYQVFHATRRLAAA
MVASSATAISPPKLFIMTRNAQPISEGDRANPGHAVLWGLGRSLALEHPEIWGGIIDLDDSMPAELAVRHVLTAAHGTDG
EDQVVYRSGARHVPRLQRRTLPGKPVTLNADASQLVIGATGNIGPHLIRQLARMGAKTIVAMARKPGALDELTQCLAATG
TDLIAVAADATDPAAMQTLFDRFGTELPPLEGIYLAAFAGRPALLSEMTDDDVTTMFRPKLDALALLHRRSLKSPVRHFV
LFSSVSGLLGSRWLAHYTATSAFLDSFAGARRTMGLPATVVDWGLWKSLADVQKDATQISAESGLQPMADEVAIGALPLV
MNPDAAVATVVVAADWPLLAAAYRTRGALRIVDDLLPAPEDVGKGESEFRTSLRSCPAEKRRDMLFDHVGALAATVMGMP
PTEPLDPSAGFFQLGMDSLMSVTLQRALSESLGEFLPASVVFDYPTVYSLTDYLATVLPELLEIGATAVATQQATDSYHE
LTEAELLEQLSERLRGTQ
>P39847 2.3.1.-~~~ppsC~~~Plipastatin synthase subunit C~~~COG1020
MPQQPEIQDIYPLSFMQEGMLFHSLYDEQSRAYFEQASFTIHGQLDLERFQKSMDAVFDRYDIFRTAFIYKNVAKPRQVV
LKQRHCPIHIEDISHLNERDKEHCTEAFKEQDKSKGFDLQTDVLMRISILKWAPDHYVCIWSHHHILMDGWCLGIVIKDF
LHIYQALGKGQLPDLPPVQPYGTYIKWLMQQDREEAAEYWKKRLQHFEKSTPLPKRTDQIPNGTLQQITFAIPEKETAEL
QKIAAASGATLNTVFQALWGIMLQKVNRSSDAVFGSVISGRPSELKDVENMVGLFINTIPIRAQSDSLSFSDLVRRMQKD
MNEAEAYSYFPLYDIQAQSALKQELIDHIIVFENTPTQQEIEELNQAGSFDFSVKDFEMEEVTNYSCSVKVIPGRTLYVR
IHFQTSAYQPSMMSEIKDYLLHMVSDVISDPSLPVSKMTLLDEDKTRKIVSQNNRTVSVSPEAPTLHGLFERQAAVTPER
LAIRFSGGSLTYAELDMYASRLAAHLAARGVTNESIVGVLSERSPDMLIAVLAVLKAGGAYLPLDPAYPKERLSYMLKDS
GASLLLTQPGCSAPNFSGETLEVDMTSLECEEVKRHVSASVSDGSLAYVIYTSGSTGQPKGVAVEHRQAVSFLTGMQHQF
RLSEDDIVMVKTSFSFDASVWQLFWWALSGASAYLLPPGWEKDSALIVQAIHQENVTTAHFIPAMLNSFLDQAEIERLSD
RTSLKRVFAGGEPLAPRTAARFASVLPQVSLIHGYGPTEATVDAAFYVLDPERDRDRLRIPIGKPVPGARLYVLDPHLAV
QPSGVAGELYIAGAGVARGYLNRPALTEERFLEDPFYLGERMYKTGDVARWLPDGNVEFLGRTDDQVKIRGYRIEPGEIE
AALRSIEGVREAAVTVRTDSGEPELCAYVEGLQRNEVRAQLQRLLPGYMVPAYMIEMEQWPVTPSGKLDRNALPAPGGAA
DAETYTAPRNVTEMKLSQLWEDVLKNGPVGIHDNFFDRGGHSLKATALVSRIAKEFDVQVPLKDVFAHPTVEGLATVIRE
GTDSPYEAIKPAEKQETYPVSSAQKRIYVLQQLEDGGTGYNMPAVLELEGKLNLERMDRAFKELIKRHESLRTAFEQDAG
GDPVQRIHDEVPFTLQTTVLGARTEEEAAAAFIKPFDLSQAPLFRAQIVKVSDERHLLLVDMHHIISDGVSVNILIREFG
ELYNNRKLPALRIQYKDYAVWQEGFKTGDAYKTQGAYWLKQLEGELPVLDLPADHARPPMRSFAGDKVSFTLDQEVTSGL
YKLARENGSTLYMVLLAAYTAFLSRLSGQEDIIVGSPIAGRPHKDLEPILGMFVNTLALRTRPEGGKPFVQYLQEVRETA
MEAFEHQDYPFEELVDKLELTRDMSRNPLFDVMFVLQNMDQESLELDELCLKPAANNGHQTSKFDLTLYAQEQPRGLLTF
QMEFSTDLYKKKTIEKWLQYFNNMLLSIIKDNKAALGTINILNEDEAHYLIHELNRTKIDYPRNETISRLFEMQAEQTPN
AVAIVSDTQVFTYEDLNSWANQIASVLQIKGVGPDSVVALLTGRTPELIAGMLGILKAGGAYLPIDSNLPVERIAYMLSD
SRAALLLQSEKTEKRLLGIECEQIIIEDIQKQGEAKNVESSAGPHSLAYIIYTSGSTGKPKGVMIEQRSVIRLVKNSNYI
TFTPEDRLLMTSSIGFDVGSFEIFGPLLNGAALHLSDQQTFLDSHQLKRYIEHQGITTIWLTSSLFNHLTEQNEQTFSQL
KHLIIGGEALSPSHVNRIRNVCPEVSIWNGYGPTENTTFSTCLHIQKTYELSIPIGRPVGNSTAFILNQWGVLQPVGAVG
ELCVGGDGVARGYLGRPDLTKEKFVPHPFAPGDRLYRTGDLARWLSDGTIEYVGRIDDQVKVRGYRVELGEIETALRQID
GVKEAAVLARTAQTGSKELFGYISVKAGTNAEQVRSLLARSLPNYMIPAYIIEMETLPLTSNGKLNRKALPEPDVASKQT
YIPPRNELEEQLALIWQEVLGIQRIGIEDSFFELGGDSIKALQVSARLGRYGLSLQVSDLFRHPKIKDLSPFIRKSERII
EQGPIQGDVPWTPVQQWFFSQDIEERHHFNQSVMLFHSGRLSENALRPALKKLAEHHDALRMVYRNDDRRWIQINQGIHE
SQLYSLRISDLSQSESGWETKIKQEVADLQQSINLQEGPLLHAALFKTLTGDYLFLAIHHLVVDGVSWRILLEDLSAGYQ
QAAAGQTIQLPPKTDSYQEYARRIQEYAQSSKLIREEAYWRSVEEQQAAELPYEIPHHVNIDFSKRDSLSFSLTEADTAV
LLQNVNHAYGTDTQDILLTAASLAICEWTGGSKLRIAMEGHGREHILPELDISRTVGWFTSMYPALISFENHRDELGTSV
KTVKDTLGRIPNKGVGYGMLKYLTHPENKSITFSKTPEISFNYLGQFNDIERQDTFRPSSLGSGKDITHTWKREQIIEMS
AMAADKKLHFNLSYPPARFHRNTMEQLINRIEHFLLDIMKHCAGQQKAEKTLSDFSSQSLTAEDLDSISSLVEEL
>Q7TXL8 2.3.1.292~~~ppsC~~~Phenolphthiocerol/phthiocerol polyketide synthase subunit C~~~
MTAATPDRRAIITEALHKIDDLTARLEIAEKSSSEPIAVIGMGCRFPGGVNNPEQFWDLLCAGRSGIVRVPAQRWDADAY
YCDDHTVPGTICSTEGGFLTSWQPDEFDAEFFSISPREAAAMDPQQRLLIEVAWEALEDAGVPQHTIRGTQTSVFVGVTA
YDYMLTLAGRLRPVDLDAYIPTGNSANFAAGRLAYILGARGPAVVIDTACSSSLVAVHLACQSLRGRESDMALVGGTNLL
LSPGPSIACSRWGMLSPEGRCKTFDASADGYVRGEGAAVVVLKRLDDAVRDGNRILAVVRGSAVNQDGASSGVTVPNGPA
QQALLAKALTSSKLTAADIDYVEAHGTGTPLGDPIELDSLSKVFSDRAGSDQLVIGSVKTNLGHLEAAAGVAGLMKAVLA
VHNGYIPRHLNFHQLTPHASEAASRLRIAADGIDWPTTGRPRRAGVSSFGVSGTNAHVVIEQAPDPMAAAGTEPQRGPVP
AVSTLVVFGKTAPRVAATASVLADWLDGPGAAVPLADVAHTLNHHRARQTRFGTVAAVDRRQAVIGLRALAAGQSAPGVV
APREGSIGGGTVFVYSGRGSQWAGMGRQLLADEPAFAAAIAELEPEFVAQGGFSLRDVIAGGKELVGIEQIQLGLIGMQL
ALTALWRSYGVTPDAVIGHSMGEVAAAVVAGALTPAQGLRVTAVRSRLMAPLSGQGTMALLELDAEATEALIADYPEVSL
GIYASPRQTVISGPPLLIDELIDKVRQQNGFATRVNIEVAPHNPAMDALQPAMRSELADLTPQPPTIPIISTTYADLGIS
LGSGPRFDAEHWATNMRNPVRFHQAIAHAGADHHTFIEISAHPLLTHSISDTLRASYDVDNYLSIGTLQRDAHDTLEFHT
NLNTTHTTHPPQTPHPPEPHPVLPTTPWQHTQHWITATSAAYHRPDTHPLLGVGVTDPTNGTRVWESELDPDLLWLADHV
IDDLVVLPGAAYAEIALAAATDTFAVEQDQPWMISELDLRQMLHVTPGTVLVTTLTGDEQRCQVEIRTRSGSSGWTTHAT
ATVARAEPLAPLDHEGQRREVTTADLEDQLDPDDLYQRLRGAGQQHGPAFQGIVGLAVTQAGVARAQVRLPASARTGSRE
FMLHPVMMDIALQTLGATRTATDLAGGQDARQGPSSNSALVVPVRFAGVHVYGDITRGVRAVGSLAAAGDRLVGEVVLTD
ANGQPLLVVDEVEMAVLGSGSGATELTNRLFMLEWEPAPLEKTAEATGALLLIGDPAAGDPLLPALQSSLRDRITDLELA
SAADEATLRAAISRTSWDGIVVVCPPRANDESMPDEAQLELARTRTLLVASVVETVTRMGARKSPRLWIVTRGAAQFDAG
ESVTLAQTGLRGIARVLTFEHSELNTTLVDIEPDGTGSLAALAEELLAGSEADEVALRDGQRYVNRLVPAPTTTSGDLAA
EARHQVVNLDSSGASRAAVRLQIDQPGRLDALNVHEVKRGRPQGDQVEVRVVAAGLNFSDVLKAMGVYPGLDGAAPVIGG
ECVGYVTAIGDEVDGVEVGQRVIAFGPGTFGTHLGTIADLVVPIPDTLADNEAATFGVAYLTAWHSLCEVGRLSPGERVL
IHSATGGVGMAAVSIAKMIGARIYTTAGSDAKREMLSRLGVEYVGDSRSVDFADEILELTDGYGVDVVLNSLAGEAIQRG
VQILAPGGRFIELGKKDVYADASLGLAALAKSASFSVVDLDLNLKLQPARYRQLLQHILQHVADGKLEVLPVTAFSLHDA
ADAFRLMASGKHTGKIVISIPQHGSIEAIAAPPPLPLVSRDGGYLIVGGMGGLGFVVARWLAEQGAGLIVLNGRSAPSDE
VAAAIAELNASGSRIEVITGDITEPDTAERLVRAVEDAGFRLAGVVHSAMVLADEIVLNMTDSAARRVFAPKVTGSWRLH
VATAARDVDWWLTFSSAAALLGTPGQGAYAAANSWVDGLVAHRRSAGLPAVGINWGPWADVGRAQFFKDLGVEMINAEQG
LAAMQAVLTADRGRTGVFSLDARQWFQSFPAVAGSSLFAKLHDSAARKSGQRRGGGAIRAQLDALDAAERPGHLASAIAD
EIRAVLRSGDPIDHHRPLETLGLDSLMGLELRNRLEASLGITLPVALVWAYPTISDLATALCERMDYATPAAAQEISDTE
PELSDEEMDLLADLVDASELEAATRGES
>P96202 2.3.1.292~~~ppsC~~~Phenolphthiocerol/phthiocerol polyketide synthase subunit C~~~COG0604
MTAATPDRRAIITEALHKIDDLTARLEIAEKSSSEPIAVIGMGCRFPGGVNNPEQFWDLLCAGRSGIVRVPAQRWDADAY
YCDDHTVPGTICSTEGGFLTSWQPDEFDAEFFSISPREAAAMDPQQRLLIEVAWEALEDAGVPQHTIRGTQTSVFVGVTA
YDYMLTLAGRLRPVDLDAYIPTGNSANFAAGRLAYILGARGPAVVIDTACSSSLVAVHLACQSLRGRESDMALVGGTNLL
LSPGPSIACSRWGMLSPEGRCKTFDASADGYVRGEGAAVVVLKRLDDAVRDGNRILAVVRGSAVNQDGASSGVTVPNGPA
QQALLAKALTSSKLTAADIDYVEAHGTGTPLGDPIELDSLSKVFSDRAGSDQLVIGSVKTNLGHLEAAAGVAGLMKAVLA
VHNGYIPRHLNFHQLTPHASEAASRLRIAADGIDWPTTGRPRRAGVSSFGVSGTNAHVVIEQAPDPMAAAGTEPQRGPVP
AVSTLVVFGKTAPRVAATASVLADWLDGPGAAVPLADVAHTLNHHRARQTRFGTVAAVDRRQAVIGLRALAAGQSAPGVV
APREGSIGGGTVFVYSGRGSQWAGMGRQLLADEPAFAAAIAELEPEFVAQGGFSLRDVIAGGKELVGIEQIQLGLIGMQL
ALTALWRSYGVTPDAVIGHSMGEVAAAVVAGALTPAQGLRVTAVRSRLMAPLSGQGTMALLELDAEATEALIADYPEVSL
GIYASPRQTVISGPPLLIDELIDKVRQQNGFATRVNIEVAPHNPAMDALQPAMRSELADLTPQPPTIPIISTTYADLGIS
LGSGPRFDAEHWATNMRNPVRFHQAIAHAGADHHTFIEISAHPLLTHSISDTLRASYDVDNYLSIGTLQRDAHDTLEFHT
NLNTTHTTHPPQTPHPPEPHPVLPTTPWQHTQHWITATSAAYHRPDTHPLLGVGVTDPTNGTRVWESELDPDLLWLADHV
IDDLVVLPGAAYAEIALAAATDTFAVEQDQPWMISELDLRQMLHVTPGTVLVTTLTGDEQRCQVEIRTRSGSSGWTTHAT
ATVARAEPLAPLDHEGQRREVTTADLEDQLDPDDLYQRLRGAGQQHGPAFQGIVGLAVTQAGVARAQVRLPASARTGSRE
FMLHPVMMDIALQTLGATRTATDLAGGQDARQGPSSNSALVVPVRFAGVHVYGDITRGVRAVGSLAAAGDRLVGEVVLTD
ANGQPLLVVDEVEMAVLGSGSGATELTNRLFMLEWEPAPLEKTAEATGALLLIGDPAAGDPLLPALQSSLRDRITDLELA
SAADEATLRAAISRTSWDGIVVVCPPRANDESMPDEAQLELARTRTLLVASVVETVTRMGARKSPRLWIVTRGAAQFDAG
ESVTLAQTGLRGIARVLTFEHSELNTTLVDIEPDGTGSLAALAEELLAGSEADEVALRDGQRYVNRLVPAPTTTSGDLAA
EARHQVVNLDSSGASRAAVRLQIDQPGRLDALNVHEVKRGRPQGDQVEVRVVAAGLNFSDVLKAMGVYPGLDGAAPVIGG
ECVGYVTAIGDEVDGVEVGQRVIAFGPGTFGTHLGTIADLVVPIPDTLADNEAATFGVAYLTAWHSLCEVGRLSPGERVL
IHSATGGVGMAAVSIAKMIGARIYTTAGSDAKREMLSRLGVEYVGDSRSVDFADEILELTDGYGVDVVLNSLAGEAIQRG
VQILAPGGRFIELGKKDVYADASLGLAALAKSASFSVVDLDLNLKLQPARYRQLLQHILQHVADGKLEVLPVTAFSLHDA
ADAFRLMASGKHTGKIVISIPQHGSIEAIAAPPPLPLVSRDGGYLIVGGMGGLGFVVARWLAEQGAGLIVLNGRSAPSDE
VAAAIAELNASGSRIEVITGDITEPDTAERLVRAVEDAGFRLAGVVHSAMVLADEIVLNMTDSAARRVFAPKVTGSWRLH
VATAARDVDWWLTFSSAAALLGTPGQGAYAAANSWVDGLVAHRRSAGLPAVGINWGPWADVGRAQFFKDLGVEMINAEQG
LAAMQAVLTADRGRTGVFSLDARQWFQSFPAVAGSSLFAKLHDSAARKSGQRRGGGAIRAQLDALDAAERPGHLASAIAD
EIRAVLRSGDPIDHHRPLETLGLDSLMGLELRNRLEASLGITLPVALVWAYPTISDLATALCERMDYATPAAAQEISDTE
PELSDEEMDLLADLVDASELEAATRGES
>P94459 2.3.1.-~~~ppsD~~~Plipastatin synthase subunit D~~~COG1020
MTKANSIQDIYPLSYMQEGMLFHSLLQKDSQAYVEQASFTIEGKVNPQFFQNSINALVERHDIFRTIFISQNVSSPQQVV
LRERNVIVLEEDITHLNEAEQSQFIEQWKEKDRDRGFHLQKDVLMRIALIQTGESQYSCIWTFHHIMMDGWCLSIVLKEF
LHIYASYVNASPITLEPVQPYGKYIKWLMEQDKEQAVSYWDHYLSGHEQQTVLPKQKKTKGKSRQEHVTFSFSKEESSRL
SELAAREEVTLSTIFHTIWGILLQKYNNNDDAVFGSVISGRPAEIEGIEHMVGLFINTMPVRVQGAKTPFLQLIKDMQKD
RLAAEAYSYHPLYEIQSRSAVKQGLIDHILVFENYPVQQEIQMLNKQEHASDLFQIHNFTVADETNYSFYLMVAPGEEIH
IKMNYDAEQHDRSFVLSVKEHLLNAVSQILNNPNLPPEEIDITTDTEKRQLIGEITDQTPVYETIHAMFEKQAEKTPDAH
AVIDQACSLTYRELNKAANRLARHLRMKGVVRQEPVAIMMERSAAFITGVLGILKAGGAIVPVDPHYPADRIRYILHDCG
CSHVVSQAHLPSSLEDNYIITHPEDIESKVDGSNIKSVNNADDLLYMIYTSGTTGKPKGVQFEHRNMANLLKFEYTHSGI
DFEADVLQFATPSFDVCYQEIFSALLKGGTLHIVPEAIKRDVPQLFAFINKHQTNIVFLPTAFIKMIFSERELANSFPDG
VKHLIAAGEQLMISDLFQDVLRKRGIHLHNHYGPSETHVVSTYTIHPGDPIPELPPIGKPIGCTDLYILNHQKQLQPCGV
PGELYISGASVARGYVNHDKLTSDKFSSDPFKPDVIMYRTGDLARRLEDGNIEYIGRADNQVKIRGYRIEPQEIEVTLMN
HPDISEAAILIWQDQNGEHELCAYYCSVQKLNTIDLRSYMASELPEYMIPAKWIWVDSIPLTPNGKVDRAALPEPDASIS
GNPYTAPRNLLEAKLSQLFEDVLKNGHIGIQDNFFDNGGHSLKATVLMSRIAKEFHVQVSLKDIFAHPTVEGLALIIREA
EQNLYAAIEPAEKRDTYPVSSAQKRIYVLQQLDEGVAYNMPAVLELEGALDVAKLSAVCKELISRHEPLRTSFVSGADDE
PVQRIHTEVPFTLSKETTIEGFVRPFDLSQAPLFRAGLIEVSNEKHVLLVDMHHIISDGVSVQLLIREFTDLYANRQLKP
LRIQYKDYAVWQQKFKKGDSYQKQETYWQQQFSGDLPILELPTDKRRPAERQFIGGKVTFQLDKEITARIKRLAHKNRST
LYMTLLALYSAFLSRLSGQDDIVIGSPIAGRPHADLEAVLGMFVNTLALRTRPAGNKTFEEFLKEVRQTALEAYEHQDYP
FEELVDKLGVQREMSRNPLFDTTLVLQNMEQQKLKMNDVQLQWNDLEHPISKFDISLYVTEHDSELFCQFEYSTALFEKE
TIQRWASLFTTLVEHTAASPETELDNIPILTKEEERDFIESCHLFEETGYSMNQTLHYALEQQAEKTPDQAAVIFEDGVM
TYKELNEQANRIAWELIGRGVKPETTVAIIGKRSPEMLLGIYGILKAGGAYLPIDPDYPEERISFLLEDSGTNILLLQSA
GLHVPEFTGEIVYLNQTNSGLAHRLSNPNVDVLPQSLAYVIYTSGSTGMPKGVEIEHRSAVNFLNSLQSRYQLKHSDMIM
HKTSYSFDASIWELFWWPYAGASVYLLPQGGEKEPEVIAKAIEEQKITAMHFVPSMLHAFLEHIKYRSVPIKTNRLKRVF
SGGEQLGTHLVSRFYELLPNVSITNSYGPTEATVEAAFFDCPPHEKLERIPIGKPVHHVRLYLLNQNQRMLPVGCIGELY
IAGAGVARGYLNRPALTEERFLEDPFYPGERMYKTGDVARWLPDGNVEFLGRTDDQVKIRGYRIEPGEIEAALRSIEGVR
EAAVTVRTDSGEPELCAYVEGLQRNEVRAQLERLLPGYMVPAYMIEMEQWPVTPSGKLDRNALPAPGGAADAETYTAPRN
VTEMKLSQLWEDVLKNGPVGIHDNFFDRGGHSLKATALVSRITKEFDVQVPLKDVFAHPTVEGLATVIREGTDSPYEAIK
PAEKQETYPVSSAQKRIYVLQQLEDGGTGYNMPAVLELEGKLNPERMDRAFQELIKRHESLRTSFEQDEGGDPVQRIHDE
VPFTLQTTVLGARTEQEAAAAFIKPFDLSQAPLFRAQIVKVSDERHLLLVDMHHIISDGVSVNILIQEFGELYNNRKLPA
LRIQYKDYAVWQEGFKTGDAYKMQEAYWLKQLEGELPVLDLPADHARPPVRSFAGDKVSFTLEPEVASGLHKLARENGST
LYMVLLAAYTAFLSRLSGQEDIIVGSPIAGRPHKDLEPILGMFVNTLALRTRPEGGKPFVQYLQEVRETALEAFEHQNYP
FEELVDKLELTRDMSRNPVFDAMLVVQNNDYEPLHLHDLQMKPAQVSHLVSKFDLTLQASEGDGNIHFLFEYSTALFEKT
TIERWASHLTNVLSIIGKNPKVTLNHIDILTQEERHQLLNEFNTGQANQYGVQTISQLFEQQAARTPKASALVSGDKTLT
YQELDEWSNGIARALRSRGVKPDTPVGIMMHRSFSMIASILGVWKAGGCYVPIDPEYPKERKRYILSDSGTKLLMTINEA
DLGVLADFEGEILTIESVEEDDKSPLPQMSSAHHLAYIIYTSGTTGRPKGVMVEHKGIANTLQWRRNAYAFNETDTILQL
FSFSFDGFITSMFTPLLSGAKAVLLHEEEAKDILAIKHQLSRQRITHMIIVPVLYRALLDVVQPEDVKTLRVVTLAGEAA
DRELIARSLAICPHTELANEYGPTENSVATTVMRHMEKQAYVSIGQPIDGTQVLILNSNHQLQPIGVAGELCIAGTGLAR
GYVNLPELTERAFTQNPFKPEARMYRTGDAARWMADGTLEYLGRIDDQVKIRGYRVETKEIESVIRCIKGVKDAAVVAHV
TASGQTELSAYVVTKPGLSTNAVRSELQNKLPVFMHPAFIEKLDSLPLSPNGKLDRGALPKPVYNHEGERPFLPPSSKME
QILADIWKEVLGAEKIGTADSFFELGGDSIKALQVSARLHRIGKQMAVKDLFSHPTIQELAAYIRDSDTSSSQAAVEGDV
QWSPVQKWFLSQDIKEKHHFNQSVMLHRSTSVQEDALRKTLKAITCHHDALRMVFTQNEQGKWDQYNRPLSHSDDALYGL
QMIDLSAPDGTDGNRPYEPLIKRHVLDIQQKMDLKNGPLLQAGLFHTIDGDFLFLSAHHLVVDGISWRVLLEDLALGYRQ
AAGGEDIKLPPKTSSFKAYAKKLSDYAESQQLMKQLKYWREAEEYQTEALPFDQIDGTRAHEGQRSTISFTLNDKETAAL
LKDANSAYNTDTQDMLLASVILALRHWTNQSAFKLSLEGHGREDVLKGIDVSRTIGWFTAIYPLLIKLNADLPDSEESMV
HVLKTTKDTLRRVPDKGFGYGVIKYLTPPGKKDINFTGAPEISFNYLGQFESGRTAEVPEEDAFSFSPLGAGGDISTTWN
REQSLDISAIAAEGKLTVNMTYDNARFQRKTIEQLSETCRQFLLQLIEHCQNKSETEKTISDFDDQELTEDALQEIADML
SFH
>Q7TXL7 2.3.1.292~~~ppsD~~~Phenolphthiocerol/phthiocerol polyketide synthase subunit C~~~
MTSLAERAAQLSPNARAALARELVRAGTTFPTDICEPVAVVGIGCRFPGNVTGPESFWQLLADGVDTIEQVPPDRWDADA
FYDPDPSASGRMTTKWGGFVSDVDAFDADFFGITPREAVAMDPQHRILLEVAWEALEHAGIPPDSLSGTRTGVMMGLSSW
DYTIVNIERRADIDAYLSTGTPHCAAVGRIAYLLGLRGPAVAVDTACSSSLVAIHLACQSLRLRETDVALAGGVQLTLSP
FTAIALSKWSALSPTGRCNSFDANADGFVRGEGCGVVVLKRLADAVRDQDRVLAVVRGSATNSDGRSNGMTAPNALAQRD
VITSALKLADVTPDSVNYVETHGTGTVLGDPIEFESLAATYGLGKGQGESPCALGSVKTNIGHLEAAAGVAGFIKAVLAV
QRGHIPRNLHFTRWNPAIDASATRLFVPTESAPWPAAAGPRRAAVSSFGLSGTNAHVVVEQAPDTAVAAAGGMPYVSALN
VSGKTAARVASAAAVLADWMSGPGAAAPLADVAHTLNRHRARHAKFATVIARDRAEAIAGLRALAAGQPRVGVVDCDQHA
GGPGRVFVYSGQGSQWASMGQQLLANEPAFAKAVAELDPIFVDQVGFSLQQTLIDGDEVVGIDRIQPVLVGMQLALTELW
RSYGVIPDAVIGHSMGEVSAAVVAGALTPEQGLRVITTRSRLMARLSGQGAMALLELDADAAEALIAGYPQVTLAVHASP
RQTVIAGPPEQVDTVIAAVATQNRLARRVEVDVASHHPIIDPILPELRSALADLTPQPPSIPIISTTYESAQPVADADYW
SANLRNPVRFHQAVTAAGVDHNTFIEISPHPVLTHALTDTLDPDGSHTVMSTMNRELDQTLYFHAQLAAVGVAASEHTTG
RLVDLPPTPWHHQRFWVTDRSAMSELAATHPLLGAHIEMPRNGDHVWQTDVGTEVCPWLADHKVFGQPIMPAAGFAEIAL
AAASEALGTAADAVAPNIVINQFEVEQMLPLDGHTPLTTQLIRGGDSQIRVEIYSRTRGGEFCRHATAKVEQSPRECAHA
HPEAQGPATGTTVSPADFYALLRQTGQHHGPAFAALSRIVRLADGSAETEISIPDEAPRHPGYRLHPVVLDAALQSVGAA
IPDGEIAGSAEASYLPVSFETIRVYRDIGRHVRCRAHLTNLDGGTGKMGRIVLINDAGHIAAEVDGIYLRRVERRAVPLP
LEQKIFDAEWTESPIAAVPAPEPAAETTRGSWLVLADATVDAPGKAQAKSMADDFVQQWRSPMRRVHTADIHDESAVLAA
FAETAGDPEHPPVGVVVFVGGASSRLDDELAAARDTVWSITTVVRAVVGTWHGRSPRLWLVTGGGLSVADDEPGTPAAAS
LKGLVRVLAFEHPDMRTTLVDLDITQDPLTALSAELRNAGSGSRHDDVIAWRGERRFVERLSRATIDVSKGHPVVRQGAS
YVVTGGLGGLGLVVARWLVDRGAGRVVLGGRSDPTDEQCNVLAELQTRAEIVVVRGDVASPGVAEKLIETARQSGGQLRG
VVHAAAVIEDSLVFSMSRDNLERVWAPKATGALRMHEATADCELDWWLGFSSAASLLGSPGQAAYACASAWLDALVGWRR
ASGLPAAVINWGPWSEVGVAQALVGSVLDTISVAEGIEALDSLLAADRIRTGVARLRADRALVAFPEIRSISYFTQVVEE
LDSAGDLGDWGGPDALADLDPGEARRAVTERMCARIAAVMGYTDQSTVEPAVPLDKPLTELGLDSLMAVRIRNGARADFG
VEPPVALILQGASLHDLTADLMRQLGLNDPDPALNNADTIRDRARQRAAARHGAAMRRRPKPAVQGG
>P9WQE3 2.3.1.292~~~ppsD~~~Phenolphthiocerol/phthiocerol polyketide synthase subunit D~~~COG1028
MTSLAERAAQLSPNARAALARELVRAGTTFPTDICEPVAVVGIGCRFPGNVTGPESFWQLLADGVDTIEQVPPDRWDADA
FYDPDPSASGRMTTKWGGFVSDVDAFDADFFGITPREAVAMDPQHRMLLEVAWEALEHAGIPPDSLSGTRTGVMMGLSSW
DYTIVNIERRADIDAYLSTGTPHCAAVGRIAYLLGLRGPAVAVDTACSSSLVAIHLACQSLRLRETDVALAGGVQLTLSP
FTAIALSKWSALSPTGRCNSFDANADGFVRGEGCGVVVLKRLADAVRDQDRVLAVVRGSATNSDGRSNGMTAPNALAQRD
VITSALKLADVTPDSVNYVETHGTGTVLGDPIEFESLAATYGLGKGQGESPCALGSVKTNIGHLEAAAGVAGFIKAVLAV
QRGHIPRNLHFTRWNPAIDASATRLFVPTESAPWPAAAGPRRAAVSSFGLSGTNAHVVVEQAPDTAVAAAGGMPYVSALN
VSGKTAARVASAAAVLADWMSGPGAAAPLADVAHTLNRHRARHAKFATVIARDRAEAIAGLRALAAGQPRVGVVDCDQHA
GGPGRVFVYSGQGSQWASMGQQLLANEPAFAKAVAELDPIFVDQVGFSLQQTLIDGDEVVGIDRIQPVLVGMQLALTELW
RSYGVIPDAVIGHSMGEVSAAVVAGALTPEQGLRVITTRSRLMARLSGQGAMALLELDADAAEALIAGYPQVTLAVHASP
RQTVIAGPPEQVDTVIAAVATQNRLARRVEVDVASHHPIIDPILPELRSALADLTPQPPSIPIISTTYESAQPVADADYW
SANLRNPVRFHQAVTAAGVDHNTFIEISPHPVLTHALTDTLDPDGSHTVMSTMNRELDQTLYFHAQLAAVGVAASEHTTG
RLVDLPPTPWHHQRFWVTDRSAMSELAATHPLLGAHIEMPRNGDHVWQTDVGTEVCPWLADHKVFGQPIMPAAGFAEIAL
AAASEALGTAADAVAPNIVINQFEVEQMLPLDGHTPLTTQLIRGGDSQIRVEIYSRTRGGEFCRHATAKVEQSPRECAHA
HPEAQGPATGTTVSPADFYALLRQTGQHHGPAFAALSRIVRLADGSAETEISIPDEAPRHPGYRLHPVVLDAALQSVGAA
IPDGEIAGSAEASYLPVSFETIRVYRDIGRHVRCRAHLTNLDGGTGKMGRIVLINDAGHIAAEVDGIYLRRVERRAVPLP
LEQKIFDAEWTESPIAAVPAPEPAAETTRGSWLVLADATVDAPGKAQAKSMADDFVQQWRSPMRRVHTADIHDESAVLAA
FAETAGDPEHPPVGVVVFVGGASSRLDDELAAARDTVWSITTVVRAVVGTWHGRSPRLWLVTGGGLSVADDEPGTPAAAS
LKGLVRVLAFEHPDMRTTLVDLDITQDPLTALSAELRNAGSGSRHDDVIAWRGERRFVERLSRATIDVSKGHPVVRQGAS
YVVTGGLGGLGLVVARWLVDRGAGRVVLGGRSDPTDEQCNVLAELQTRAEIVVVRGDVASPGVAEKLIETARQSGGQLRG
VVHAAAVIEDSLVFSMSRDNLERVWAPKATGALRMHEATADCELDWWLGFSSAASLLGSPGQAAYACASAWLDALVGWRR
ASGLPAAVINWGPWSEVGVAQALVGSVLDTISVAEGIEALDSLLAADRIRTGVARLRADRALVAFPEIRSISYFTQVVEE
LDSAGDLGDWGGPDALADLDPGEARRAVTERMCARIAAVMGYTDQSTVEPAVPLDKPLTELGLDSLMAVRIRNGARADFG
VEPPVALILQGASLHDLTADLMRQLGLNDPDPALNNADTIRDRARQRAAARHGAAMRRRPKPEVQGG
>O31827 2.3.1.-~~~ppsE~~~Plipastatin synthase subunit E~~~COG1020
MKKGADTMNTIKKIKNIYPLSHMQEGMLFHSFLRKEEGAYVEQSLFTIKGSLSYDWFQRSIQAIIDRHDIFRTVFLPHVP
HLSGPRQVVMTEREFHLNSEDISHLPTNDQNEYIERFKEKDKQKGFDLQKDMLMRISLFKTAKDEHVCIWSHHHILMDGW
CLGIVMQEFMQIYQSIHAGKPLSLDPVRPYSTYISWLTNRDKEKAAAYWDTYLKNYSAPSPLPRVSDKETKESYHREDLI
FSLNKPLTDKLKETAKQHGVTLATLIQAVWGVMLQQYNRTDDVVFGAVVSGRPSEIPGVEQMIGLFINTIPIRIKTHQDE
TFHELLIRCQKEMLEAEPFTCQPLFDIQANTALKQELIDHIIVFENYPLQQKIADSADQTDSPLQIDQVQVSEQSGYNFN
LVVAPGEELVIKFSYNAFVYDAAWISCIKRQFTQALSTAAQHPHMPIADFSFLDATEKEQIVTQFNNTKTEYPKNHTIID
LFREQAEKTPDHTALVYGNMSISYKELDKRSNALARELIQKGFRKNETAGILAAHSPEFMISVLAVLKAGGAYLPLDAEL
PPERVSFMLEETQAKMLIVQKGLEQNAAFSGTCIISDAQGLMEENDIPINISSSPDDLAYIMYTSGSTGRPKGVMITNRN
VVSLVRNSNYTSASGDDRFIMTGSISFDAVTFEMFGALLNGASLHIIDKSTMLTPDRFGAYLLENDITVLFLTTALFNQL
AQVRADMFRGLHTLYVGGEALSPALMNAVRHACPDLALHNIYGPTENTTFSTFFEMKRDYAGPIPIGKPISNSTAYILDT
KGRLLPIGVPGELCVGGDGVAKGYLNRVDLTNAVFSPHPFLPGERIYRTGDLARWLPDGNLEYISRIDRQMKIRGKRIEP
AEIEARLLEMEGVQEAAVTLREKDGEAQLYTHYVGDHKKTDTDFRADLARVLPDYMIPQHWVRVERMPLTGNGKIDRSAL
PIPENKPAKRQNIILPRNLVEEELANIWKQVLGVNTISIDDDFFAIGGHSLRALQVIHTLKHQQNIDIPIDFLFEHPTIA
QLAEKLYSKQLTAANEQHVIKLNQHGAQNLFCFPPISGFGIYFKDLALLLNEKAAVYGFHFIEQDTRIEQYVNCMTDIQP
EGPYVLLGYSAGGNLAFEVAQAMERKGLEVSDFIIVDAYLKEQPLPIDTGNDESAAYLPEAVREKVMKKKRNYQEYWAQL
LNEGHIKASIHFIEAGIHPETSGHTGLTKWEGACGNYSEYTGFGAHKDMLEGTYAEKNADIILDILEKITSNQVILHKR
>Q7TXL6 2.3.1.292~~~ppsE~~~Phenolphthiocerol/phthiocerol polyketide synthase subunit E~~~
MSIPENAIAVVGMAGRFPGAKDVSAFWSNLRRGKESIVTLSEQELRDAGVSDKTLADPAYVRRAPLLDGIDEFDAGFFGF
PPLAAQVLDPQHRLFLQCAWHALEDAGADPARFDGSIGVYGTSSPSGYLLHNLLSHRDPNAVLAEGLNFDQFSLFLQNDK
DFLATRISHAFNLRGPSIAVQTACSSSLVAVHLACLSLLSGECDMALAGGSSLCIPHRVGYFTSPGSMVSAVGHCRPFDV
RADGTVFGSGVGLVVLKPLAAAIDAGDRIHAVIRGSAINNDGSAKMGYAAPNPAAQADVIAEAHAVSGIDSSTVSYVECH
GTGTPLGDPIEIQGLRAAFEVSQTSRSAPCVLGSVKSNIGHLEVAAGIAGLIKTILCLKNKALPATLHYTSPNPELRLDQ
SPFVVQSKYGPWECDGVRRAGVSSFGVGGTNAHVVLEEAPAEASEVSAHAEPAGPQVILLSAQTAAALGESRTALAAALE
TQDGPRLSDVAYTLARRRKHNVTMAAVVHDREHAATVLRAAEHDNVFVGEAAHDGEHGDRADAAPTSDRVVFLFPGQGAQ
HVGMAKGLYDTEPVFAQHFDTCAAGFRDETGIDLHAEVFDGTATDLERIDRSQPALFTVEYALAKLVDTFGVRAGAYIGY
STGEYIAATLAGVFDLQTAIKTVSLRARLMHESPPGAMVAVALGPDDVTQYLPPEVELSAVNDPGNCVVAGPKDQIRALR
QRLTEAGIPVRRVRATHAFHTSAMDPMLGQFQEFLSRQQLRPPRTPLLSNLTGSWMSDQQVVDPASWTRQISSPIRFADE
LDVVLAAPSRILVEVGPGGSLTGSAMRHPKWSTTHRTVRLMRHPLQDVDDRDTFLRALGELWSAGVEVDWTPRRPAVPHL
VSLPGYPFARQRHWVEPNHTVWAQAPGANNGSPAGTADGSTAATVDAARNGESQTEVTLQRIWSQCLGVSSVDRNANFFD
LGGDSLMAISIAMAAANEGLTITPQDLYEYPTLASLTAAVDASFASSGLAKPPEAQANPAVPPNVTYFLDRGLRDTGRCR
VPLILRLDPKIGLPDIRAVLTAVVNHHDALRLHLVGNDGIWEQHIAAPAEFTGLSNRSVPDGVAAGSPEERAAVLGILAE
LLEDQTDPNAPLAAVHIAAAHGGPHYLCLAIHAMVTDDSSRQILATDIVTAFGQRLAGEEITLEPVSTGWREWSLRCAAL
ATHPAALDTRSYWIENSTKATLWLADALPNAHTAHPPRADELTKLSSTLSVEQTSELDDGRRRFRRSIQTILLAALGRTI
AQTVGEGVVAVELEGEGRSVLRPDVDLRRTVGWFTTYYPVPLACATGLGALAQLDAVHNTLKSVPHYGIGYGLLRYVYAP
TGRVLGAQRTPDIHFRYAGVIPELPSGDAPVQFDSDMTLPVREPIPGMGHAIELRVYRFGGSLHLDWWYDTRRIPAATAE
ALERTFPLALSALIQEAIAAEHTEHDDSEIVGEPEAGALVDLSSMDAG
>P9WQE1 2.3.1.292~~~ppsE~~~Phenolphthiocerol/phthiocerol polyketide synthase subunit E~~~COG1020
MSIPENAIAVVGMAGRFPGAKDVSAFWSNLRRGKESIVTLSEQELRDAGVSDKTLADPAYVRRAPLLDGIDEFDAGFFGF
PPLAAQVLDPQHRLFLQCAWHALEDAGADPARFDGSIGVYGTSSPSGYLLHNLLSHRDPNAVLAEGLNFDQFSLFLQNDK
DFLATRISHAFNLRGPSIAVQTACSSSLVAVHLACLSLLSGECDMALAGGSSLCIPHRVGYFTSPGSMVSAVGHCRPFDV
RADGTVFGSGVGLVVLKPLAAAIDAGDRIHAVIRGSAINNDGSAKMGYAAPNPAAQADVIAEAHAVSGIDSSTVSYVECH
GTGTPLGDPIEIQGLRAAFEVSQTSRSAPCVLGSVKSNIGHLEVAAGIAGLIKTILCLKNKALPATLHYTSPNPELRLDQ
SPFVVQSKYGPWECDGVRRAGVSSFGVGGTNAHVVLEEAPAEASEVSAHAEPAGPQVILLSAQTAAALGESRTALAAALE
TQDGPRLSDVAYTLARRRKHNVTMAAVVHDREHAATVLRAAEHDNVFVGEAAHDGEHGDRADAAPTSDRVVFLFPGQGAQ
HVGMAKGLYDTEPVFAQHFDTCAAGFRDETGIDLHAEVFDGTATDLERIDRSQPALFTVEYALAKLVDTFGVRAGAYIGY
STGEYIAATLAGVFDLQTAIKTVSLRARLMHESPPGAMVAVALGPDDVTQYLPPEVELSAVNDPGNCVVAGPKDQIRALR
QRLTEAGIPVRRVRATHAFHTSAMDPMLGQFQEFLSRQQLRPPRTPLLSNLTGSWMSDQQVVDPASWTRQISSPIRFADE
LDVVLAAPSRILVEVGPGGSLTGSAMRHPKWSTTHRTVRLMRHPLQDVDDRDTFLRALGELWSAGVEVDWTPRRPAVPHL
VSLPGYPFARQRHWVEPNHTVWAQAPGANNGSPAGTADGSTAATVDAARNGESQTEVTLQRIWSQCLGVSSVDRNANFFD
LGGDSLMAISIAMAAANEGLTITPQDLYEYPTLASLTAAVDASFASSGLAKPPEAQANPAVPPNVTYFLDRGLRDTGRCR
VPLILRLDPKIGLPDIRAVLTAVVNHHDALRLHLVGNDGIWEQHIAAPAEFTGLSNRSVPNGVAAGSPEERAAVLGILAE
LLEDQTDPNAPLAAVHIAAAHGGPHYLCLAIHAMVTDDSSRQILATDIVTAFGQRLAGEEITLEPVSTGWREWSLRCAAL
ATHPAALDTRSYWIENSTKATLWLADALPNAHTAHPPRADELTKLSSTLSVEQTSELDDGRRRFRRSIQTILLAALGRTI
AQTVGEGVVAVELEGEGRSVLRPDVDLRRTVGWFTTYYPVPLACATGLGALAQLDAVHNTLKSVPHYGIGYGLLRYVYAP
TGRVLGAQRTPDIHFRYAGVIPELPSGDAPVQFDSDMTLPVREPIPGMGHAIELRVYRFGGSLHLDWWYDTRRIPAATAE
ALERTFPLALSALIQEAIAAEHTEHDDSEIVGEPEAGALVDLSSMDAG
>A1YCA5 2.7.8.7~~~npt~~~4'-phosphopantetheinyl transferase Npt~~~
MIETILPAGVESAELLEYPEDLKAHPAEEHLIAKSVEKRRRDFIGARHCARLALAELGEPPVAIGKGERGAPIWPRGVVG
SLTHCDGYRAAAVAHKMRFRSIGIDAEPHATLPEGVLDSVSLPPEREWLKTTDSALHLDRLLFCAKEATYKAWWPLTARW
LGFEEAHITFEIEDGSADSGNGTFHSELLVPGQTNDGGTPLLSFDGRWLIADGFILTAIAYA
>P31992 5.3.2.-~~~pptA~~~Tautomerase PptA~~~COG1942
MPHIDIKCFPRELDEQQKAALAADITDVIIRHLNSKDSSISIALQQIQPESWQAIWDAEIAPQMEALIKKPGYSMNA
>Q9F0Q6 2.7.8.7~~~svp~~~4'-phosphopantetheinyl transferase Svp~~~
MIAALLPSWAVTEHAFTDAPDDPVSLLFPEEAAHVARAVPKRLHEFATVRVCARAALGRLGLPPGPLLPGRRGAPSWPDG
VVGSMTHCQGFRGAAVARAADAASLGIDAEPNGPLPDGVLAMVSLPSEREWLAGLAARRPDVHWDRLLFSAKESVFKAWY
PLTGLELDFDEAELAVDPDAGTFTARLLVPGPVVGGRRLDGFEGRWAAGEGLVVTAIAVAAPAGTAEESAEGAGKEATAD
DRTAVP
>I6YEE1 3.1.4.14~~~pptH~~~[Acyl-carrier-protein] phosphodiesterase PptH~~~COG1409
MTWKGSGQETVGAEPTLWAISDLHTGHLGNKPVAESLYPSSPDDWLIVAGDVAERTDEIRWSLDLLRRRFAKVIWVPGNH
ELWTTNRDPMQIFGRARYDYLVNMCDEMGVVTPEHPFPVWTERGGPATIVPMFLLYDYSFLPEGANSKAEGVAIAKERNV
VATDEFLLSPEPYPTRDAWCHERVAATRARLEQLDWMQPTVLVNHFPLLRQPCDALFYPEFSLWCGTTKTADWHTRYNAV
CSVYGHLHIPRTTWYDGVRFEEVSVGYPREWRRRKPYSWLRQVLPDPQYAPGYLNDFGGHFVITPEMRTQAAQFRERLRQ
RQSR
>O33336 2.7.8.7~~~pptT~~~4'-phosphopantetheinyl transferase PptT~~~COG2977
MTVGTLVASVLPATVFEDLAYAELYSDPPGLTPLPEEAPLIARSVAKRRNEFITVRHCARIALDQLGVPPAPILKGDKGE
PCWPDGMVGSLTHCAGYRGAVVGRRDAVRSVGIDAEPHDVLPNGVLDAISLPAERADMPRTMPAALHWDRILFCAKEATY
KAWFPLTKRWLGFEDAHITFETDSTGWTGRFVSRILIDGSTLSGPPLTTLRGRWSVERGLVLTAIVL
>P9WHV4 3.6.1.11~~~ppx~~~Exopolyphosphatase 1~~~
MRLGVLDVGSNTVHLLVVDAHRGGHPTPMSSTKATLRLAEATDSSGKITKRGADKLISTIDEFAKIAISSGCAELMAFAT
SAVRDAENSEDVLSRVRKETGVELQALRGEDESRLTFLAVRRWYGWSAGRILNLDIGGGSLEVSSGVDEEPEIALSLPLG
AGRLTREWLPDDPPGRRRVAMLRDWLDAELAEPSVTVLEAGSPDLAVATSKTFRSLARLTGAAPSMAGPRVKRTLTANGL
RQLIAFISRMTAVDRAELEGVSADRAPQIVAGALVAEASMRALSIEAVEICPWALREGLILRKLDSEADGTALIESSSVH
TSVRAVGGQPADRNAANRSRGSKP
>P9WHV5 3.6.1.11~~~ppx1~~~Exopolyphosphatase 1~~~COG0248
MRLGVLDVGSNTVHLLVVDAHRGGHPTPMSSTKATLRLAEATDSSGKITKRGADKLISTIDEFAKIAISSGCAELMAFAT
SAVRDAENSEDVLSRVRKETGVELQALRGEDESRLTFLAVRRWYGWSAGRILNLDIGGGSLEVSSGVDEEPEIALSLPLG
AGRLTREWLPDDPPGRRRVAMLRDWLDAELAEPSVTVLEAGSPDLAVATSKTFRSLARLTGAAPSMAGPRVKRTLTANGL
RQLIAFISRMTAVDRAELEGVSADRAPQIVAGALVAEASMRALSIEAVEICPWALREGLILRKLDSEADGTALIESSSVH
TSVRAVGGQPADRNAANRSRGSKP
>L7N5A6 3.6.1.11~~~ppx2~~~Exopolyphosphatase 2~~~
MALTRVAAIDCGTNSIRLLIADVGAGLARGELHDVHRETRIVRLGQGVDATGRFAPEAIARTRTALTDYAELLTFHHAER
VRMVATSAARDVVNRDVFFAMTADVLGAALPGSAAEVITGAEEAELSFRGAVGELGSAGAPFVVVDLGGGSTEIVLGEHE
VVASYSADIGCVRLTERCLHSDPPTLQEVSTARRLVRERLEPALRTVPLELARTWVGLAGTMTTLSALAQSMTAYDAAAI
HLSRVPGADLLEVCQRLIGMTRKQRAALAPMHPGRADVIGGGAIVVEELARELRERAGIDQLTVSEHDILDGIALSLAG
>P96374 3.6.1.11~~~ppx2~~~Exopolyphosphatase 2~~~COG0248
MALTRVAAIDCGTNSIRLLIADVGAGLARGELHDVHRETRIVRLGQGVDATGRFAPEAIARTRTALTDYAELLTFHHAER
VRMVATSAARDVVNRDVFFAMTADVLGAALPGSAAEVITGAEEAELSFRGAVGELGSAGAPFVVVDLGGGSTEIVLGEHE
VVASYSADIGCVRLTERCLHSDPPTLQEVSTARRLVRERLEPALRTVPLELARTWVGLAGTMTTLSALAQSMTAYDAAAI
HLSRVPGADLLEVCQRLIGMTRKQRAALAPMHPGRADVIGGGAIVVEELARELRERAGIDQLTVSEHDILDGIALSLAG
>P0AFL8 3.6.1.11~~~ppx~~~Exopolyphosphatase~~~COG0248
MPIHDKSPRPQEFAAVDLGSNSFHMVIARVVDGAMQIIGRLKQRVHLADGLGPDNMLSEEAMTRGLNCLSLFAERLQGFS
PASVCIVGTHTLRQALNATDFLKRAEKVIPYPIEIISGNEEARLIFMGVEHTQPEKGRKLVIDIGGGSTELVIGENFEPI
LVESRRMGCVSFAQLYFPGGVINKENFQRARMAAAQKLETLTWQFRIQGWNVAMGASGTIKAAHEVLMEMGEKDGIITPE
RLEKLVKEVLRHRNFASLSLPGLSEERKTVFVPGLAILCGVFDALAIRELRLSDGALREGVLYEMEGRFRHQDVRSRTAS
SLANQYHIDSEQARRVLDTTMQMYEQWREQQPKLAHPQLEALLRWAAMLHEVGLNINHSGLHRHSAYILQNSDLPGFNQE
QQLMMATLVRYHRKAIKLDDLPRFTLFKKKQFLPLIQLLRLGVLLNNQRQATTTPPTLTLITDDSHWTLRFPHDWFSQNA
LVLLDLEKEQEYWEGVAGWRLKIEEESTPEIAA
>P0AFL6 3.6.1.11~~~ppx~~~Exopolyphosphatase~~~COG0248
MPIHDKSPRPQEFAAVDLGSNSFHMVIARVVDGAMQIIGRLKQRVHLADGLGPDNMLSEEAMTRGLNCLSLFAERLQGFS
PASVCIVGTHTLRQALNATDFLKRAEKVIPYPIEIISGNEEARLIFMGVEHTQPEKGRKLVIDIGGGSTELVIGENFEPI
LVESRRMGCVSFAQLYFPGGVINKENFQRARMAAAQKLETLTWQFRIQGWNVAMGASGTIKAAHEVLMEMGEKDGIITPE
RLEKLVKEVLRHRNFASLSLPGLSEERKTVFVPGLAILCGVFDALAIRELRLSDGALREGVLYEMEGRFRHQDVRSRTAS
SLANQYHIDSEQARRVLDTTMQMYEQWREQQPKLAHPQLEALLRWAAMLHEVGLNINHSGLHRHSAYILQNSDLPGFNQE
QQLMMATLVRYHRKAIKLDDLPRFTLFKKKQFLPLIQLLRLGVLLNNQRQATTTPPTLTLITDDSHWTLRFPHDWFSQNA
LVLLDLEKEQEYWEGVAGWRLKIEEESTPEIAA
>Q9ZN70 3.6.1.11~~~ppx~~~Exopolyphosphatase~~~
MDLQSMPQKPAEAFPLIAALDLGSNSFHLCLAKANIHGEVRILERLGEKVQLAAGLDEERNLSEEATQRGLDCLRRFAQF
ISGMPQGSVRVVATNALREARNRSDFIRRAEEVLGHPVEVISGREEARLIYLGVANSMPDSGGRRLVSDIGGGSTEFIIG
QGFESELRESLQMGCVSYTQRYFRDGKITPARYAQAYTAARLELMGIENSLRRLGWQQAVGASGTIRAVALAIKAGGHGN
GEISPDGLAWLKRKVLKLGDVEKLDLEGIKPDRRTIFPAGLAILEAIFDALELEQMVHSEGALREGVLYDLVGRHQHEDV
RERTISSLMQRYHVDPEQASRVEAKALKVLAEVGDAWELNGELHRDLLSWGARVHEIGLDIAHYHYHKHGAYLIEHSDLA
GFSRQDQQMLSLLVRGHRRNIPADKLAEFAEEGDKLVRLCIVLRFAILFHHIRGTQEMPSVRLKAEPKSLSVTFPEGWLE
ANPLTQADFAQEAEWLKRVGYSLNVR
>Q9S605 3.6.1.11~~~ppx~~~Exopolyphosphatase~~~COG0248
MDLQSMPQKPAEAFPLIAALDLGSNSFHLCLAKANIHGEVRILERLGEKVQLAAGLDEERNLSEEATQRGLDCLRRFAQF
ISGMPQGSVRVVATNALREARNRSDFIRRAEEVLGHPVEVISGREEARLIYLGVANSMPDSGGRRLVSDIGGGSTEFIIG
QGFESELRESLQMGCVSYTQRYFRDGKITPARYAQAYTAARLELMGIENSLRRLGWQQAVGASGTIRAVALAIKAGGHGN
GEISPDGLAWLKRKVLKLGDVEKLDLEGIKPDRRTIFPAGLAILEAIFDALELEQMVHSEGALREGVLYDLVGRHQHEDV
RERTISSLMQRYHVDPEQASRVEAKALKVLAEVGDAWELNDELHRDLLSWGARVHEIGLDIAHYHYHKHGAYLIEHSDLA
GFSRQDQQMLSLLVRGHRRNIPADKLAEFAAEGDKLVRLCIVLRFAILFHHIRGTQEMPSVRLKAEPKSLSVTFPEGWLE
ANPLTQADFAQEAEWLKRVGYSLNVR
>C4PWA1 2.5.1.121~~~ppzP~~~5,10-dihydrophenazine-1-carboxylate 9-dimethylallyltransferase~~~
MSESAELTELYSAIEETTRVVGAPCRRDTVRPILTAYEDVIAQSVISFRVQTGTSDAGDLDCRFTLLPKDMDPYATALSN
GLTAKTDHPVGSLLEEVHRQFPVDCYGIDFGAVGGFKKAWSFFRPDSLQSASDLAALPSMPSGVSENLGLFDRYGMTDTV
SVVGFDYAKRSVNLYFTGASPESFEPRGIQAILRECGLPEPSDELLRFGEEAFAIYVTLSWDSQKIERVTYSVNTPDPMA
LPVRIDTRIEQLVKDAPLGSAGHRYVYGVTATPKGEYHKIQKYFQWQSRVEKMLTADAG
>P0AFL9 ~~~pqiA~~~Intermembrane transport protein PqiA~~~COG2995
MCEHHHAAKHILCSQCDMLVALPRLEHGQKAACPRCGTTLTVAWDAPRQRPTAYALAALFMLLLSNLFPFVNMNVAGVTS
EITLLEIPGVLFSEDYASLGTFFLLFVQLVPAFCLITILLLVNRAELPVRLKEQLARVLFQLKTWGMAEIFLAGVLVSFV
KLMAYGSIGVGSSFLPWCLFCVLQLRAFQCVDRRWLWDDIAPMPELRQPLKPGVTGIRQGLRSCSCCTAILPADEPVCPR
CSTKGYVRRRNSLQWTLALLVTSIMLYLPANILPIMVTDLLGSKMPSTILAGVILLWSEGSYPVAAVIFLASIMVPTLKM
IAIAWLCWDAKGHGKRDSERMHLIYEVVEFVGRWSMIDVFVIAVLSALVRMGGLMSIYPAMGALMFALVVIMTMFSAMTF
DPRLSWDRQPESEHEES
>P43671 ~~~pqiB~~~Intermembrane transport protein PqiB~~~COG1463
MESNNGEAKIQKVKNWSPVWIFPIVTALIGAWVLFYHYSHQGPEVTLITANAEGIEGGKTTIKSRSVDVGVVESATLADD
LTHVEIKARLNSGMEKLLHKDTVFWVVKPQIGREGISGLGTLLSGVYIELQPGAKGSKMDKYDLLDSPPLAPPDAKGIRV
ILDSKKAGQLSPGDPVLFRGYRVGSVETSTFDTQKRNISYQLFINAPYDRLVTNNVRFWKDSGIAVDLTSAGMRVEMGSL
TTLLSGGVSFDVPEGLDLGQPVAPKTAFVLYDDQKSIQDSLYTDHIDYLMFFKDSVRGLQPGAPVEFRGIRLGTVSKVPF
FAPNMRQTFNDDYRIPVLIRIEPERLKMQLGENADVVEHLGELLKRGLRGSLKTGNLVTGALYVDLDFYPNTPAITGIRE
FNGYQIIPTVSGGLAQIQQRLMEALDKINKLPLNPMIEQATSTLSESQRTMKNLQTTLDSMNKILASQSMQQLPTDMQST
LRELNRSMQGFQPGSAAYNKMVADMQRLDQVLRELQPVLKTLNEKSNALVFEAKDKKDPEPKRAKQ
>P0AB10 ~~~pqiC~~~Intermembrane transport lipoprotein PqiC~~~COG3009
MKKWLVTIAALWLAGCSSGEINKNYYQLPVVQSGTQSTASQGNRLLWVEQVTVPDYLAGNGVVYQTSDVKYVIANNNLWA
SPLDQQLRNTLVANLSTQLPGWVVASQPLGSAQDTLNVTVTEFNGRYDGKVIVSGEWLLNHQGQLIKRPFRLEGVQTQDG
YDEMVKVLAGVWSQEAASIAQEIKRLP
>P27532 ~~~pqqA~~~Coenzyme PQQ synthesis protein A~~~
MQWTKPAFTDLRIGFEVTMYFEAR
>Q49149 ~~~pqqB~~~Coenzyme PQQ synthesis protein B~~~COG1235
MHVVILGSAAGGGVPQWNCRCSICSLAWAGDSRVRPRTQSSIAVSPDGERWLLLNASPDIRQQIQANPQMHPREGLRHSP
IHAVLLTNGDVDHVAGLLTLREGQPFTLYATPGILASVSDNRVFDVMAADVVKRQTIALNETFEPVPGLSVTLFSVPGKV
PLWLEDASMEIGAETETTVGTMIEAGGKRLAYIPGCARVTEDLKARIAGADALLFDGTVLEDDDMIRAGVGTKTGWRMGH
IQMNGETGSIASLADIEIGRRVFVHINNTNPVLIEDSYERASVEARGWTVAHDGLTLDL
>Q88QV5 ~~~pqqB~~~Coenzyme PQQ synthesis protein B~~~COG1235
MYIQVLGSAAGGGFPQWNCNCVNCKGYRDGTLKATARTQSSIALSDDGVHWILCNASPDIRAQLQAFAPMQPARALRDTG
INAIVLLDSQIDHTTGLLSLREGCPHQVWCTDMVHQDLTTGFPLFNMLSHWNGGLQWNRIELEGSFVIDACPNLKFTPFP
LRSAAPPYSPHRFDPHPGDNLGLMVEDTRTGGKLFYAPGLGQVDEKLLAMMHGADCLLVDGTLWEDDEMQRRGVGTRTGR
EMGHLAQNGPGGMLEVLDGFPRQRKVLIHINNTNPILDENSPERAEVLRRGVEVAFDGMSIEL
>Q49150 ~~~pqqCD~~~Bifunctional coenzyme PQQ synthesis protein C/D~~~COG5424
MTAQFPPPVPDTEQRLLSHEELEAALRDIGARRYHNLHPFHRLLHDGKLSKDQVRAWALNRYYYQAMIPVKDAALLARLP
DAQLRRIWRQRIVDHDGDHEGDGGIERWLKLAEGVGFTRDYVLSTKGILSATRFSVDAYVHFVSERSLLEAIASSLTEMF
SPTIISERVAGMLKNYDFITKDTLAYFDKRLTQAPRDADFALDYVKRHATTPEMQRAAIDALTFKCNVLWTQLDALYFAY
VAPGMVPPDAWQPGEGLVAETNSAEDSPAAAASPAATTAEPTAFSGSDVPRLPRGVRLRFDEVRNKHVLLAPERTFDLDD
NAVAVLKLVDGRNTVSQIAQILGQTYDADPAIIEADILPMLAGLAQKRVLER
>A6T9H1 1.3.3.11~~~pqqC~~~Pyrroloquinoline-quinone synthase~~~
MLITDTLSPQAFEEALRAKGDFYHIHHPYHIAMHNGNATREQIQGWVANRFYYQTTIPLKDAAIMANCPDAQTRRKWVQR
ILDHDGSHGEDGGIEAWLRLGEAVGLSRDDLLSERHVLPGVRFAVDAYLNFARRACWQEAACSSLTELFAPQIHQSRLDS
WPQHYPWIKEEGYFYFRSRLSQANRDVEHGLALAKAYCDSAEKQNRMLEILQFKLDILWSMLDAMTMAYALQRPPYHTVT
DKAAWHTTRLV
>P27505 1.3.3.11~~~pqqC~~~Pyrroloquinoline-quinone synthase~~~
MLITDTLSPQAFEEALRAKGAFYHIHHPYHIAMHNGDATRKQIQGWVANRFYYQTTIPLKDAAIMANCPDAQTRRKWVQR
ILDHDGSHGEDGGIEAWLRLGEAVGLSRDDLLSERHVLPGVRFAVDAYLNFARRACWQEAACSSLTELFAPQIHQSRLDS
WPQHYPWIKEEGYFYFRSRLSQANRDVEHGLALAKAYCDSAEKQNRMLEILQFKLDILWSMLDAMTMAYALQRPPYHTVT
DKAAWHTTRLV
>P27506 ~~~pqqD~~~PqqA binding protein~~~
MQKTSIVAFRRGYRLQWEAAQESHVILYPEGMAKLNETAAAILELVDGRRDVAAIIAMLNERFPEAGGVDDDVIEFLQIA
CQQKWITCREPE
>Q8P6M8 ~~~pqqD~~~PqqA binding protein~~~COG0535
MSTISRDSCPALRAGVRLQHDRARDQWVLLAPERVVELDDIALVVAQRYDGTQSLAQIAQTLAAEFDADASEIETDVIEL
TTTLHQKRLLRL
>P27507 1.21.98.4~~~pqqE~~~PqqA peptide cyclase~~~
MSQSKPTVNPPLWLLAELTYRCPLQCPYCSNPLDFARQDKELTTEQWIEVFRQARAMGSVQLGFSGGEPLTRKDLPELIR
AARDLGFYTNLITSGIGLTESKLDAFSEAGLDHIQISFQASDEVLNAALAGNKKAFQQKLAMAKAVKARDYPMVLNFVLH
RHNIDQLDKIIELCIELEADDVELATCQFYGWAFLNREGLLPTREQIARAEQVVADYRQKMAASGNLTNLLFVTPDYYEE
RPKGCMGGWGSIFLSVTPEGTALPCHSARQLPVAFPSVLEQSLESIWYDSFGFNRYRGYDWMPEPCRSCDEKEKDFGGCR
CQAFMLTGSADNADPVCSKSPHHHKILEARREAACSDIKVSQLQFRNRTRSQLIYQTRDL
>P71517 1.21.98.4~~~pqqE~~~PqqA peptide cyclase~~~COG0535
MNAPTPAPSPVDVIPAPVGLLAELTHRCPLRCPYCSNPLELDRRSAELDTQTWLRVLTEAAGLGVLHVHLSGGEPTARPD
IVEITAKCAELGLYSNLITSGVGGALAKLDALYDVGLDHVQLSVQGVDAANAEKIGGLKNAQPQKMQFAARVTELGLPLT
LNSVIHRGNIHEVPGFIDLAVKLGAKRLEVAHTQYYGWAYVNRAALMPDKSQVDESIRIVEAARERLKGQLVIDLVVPDY
YAKYPKACAGGWGRKLMNVTPQGKVLPCHAAETIPGLEFWYVTDHALGEIWTKSPAFAAYRGTSWMKEPCRSCDRREKDW
GGCRCQALALTGDAANTDPACSLSPLHAKMRDLAKEEAAETPPDYIYRSIGTNVQNPLSEKAPL
>P31828 3.4.24.-~~~pqqL~~~Probable zinc protease PqqL~~~COG0612
MEIIMRNLCFLLTLVATLLLPGRLIAAALPQDEKLITGQLDNGLRYMIYPHAHPKDQVNLWLQIHTGSLQEEDNELGVAH
FVEHMMFNGTKTWPGNKVIETFESMGLRFGRDVNAYTSYDETVYQVSLPTTQKQNLQQVMAIFSEWSNAATFEKLEVDAE
RGVITEEWRAHQDAKWRTSQARRPFLLANTRNLDREPIGLMDTVATVTPAQLRQFYQRWYQPNNMTFIVVGDIDSKEALA
LIKDNLSKLPANKAAENRVWPTKAENHLRFNIINDKENRVNGIALYYRLPMVQVNDEQSFIEQAEWSMLVQLFNQRLQER
IQSGELKTISGGTARSVKIAPDYQSLFFRVNARDDNMQDAANALMAELATIDQHGFSAEELDDVKSTRLTWLKNAVDQQA
ERDLRMLTSRLASSSLNNTPFLSPEETYQLSKRLWQQITVQSLAEKWQQLRKNQDAFWEQMVNNEVAAKKALSPAAILAL
EKEYANKKLAAYVFPGRNLSLTVDADPQAEISSKETLAENLTSLTLSNGARVILAKSAGEEQKLQIIAVSNKGDLSFPAQ
QKSLIALANKAVSGSGVGELSSSSLKRWSAENSVTMSSKVSGMNTLLSVSARTNNPEPGFQLINQRITHSTINDNIWASL
QNAQIQALKTLDQRPAEKFAQQMYETRYADDRTKLLQENQIAQFTAADALAADRQLFSSPADITFVIVGNVAEDKLVALI
TRYLGSIKHSDSPLAAGKPLTRATDNASVTVKEQNEPVAQVSQWKRYDSRTPVNLPTRMALDAFNVALAKDLRVNIREQA
SGAYSVSSRLSVDPQAKDISHLLAFTCQPERHDELLTLANEVMVKRLAKGISEQELNEYQQNVQRSLDIQQRSVQQLANT
IVNSLIQYDDPAAWTEQEQLLKQMTVENVNTAVKQYLSHPVNTYTGVLLPK
>P76115 ~~~pqqU~~~Pyrroloquinoline quinone transporter~~~COG4772
MKIFSVRQTVLPALLVLSPVVFAADEQTMIVSAAPQVVSELDTPAAVSVVDGEEMRLATPRINLSESLTGVPGLQVQNRQ
NYAQDLQLSIRGFGSRSTYGIRGIRLYVDGIPATMPDGQGQTSNIDLSSVQNVEVLRGPFSALYGNASGGVMNVTTQTGQ
QPPTIEASSYYGSFGSWRYGLKATGATGDGTQPGDVDYTVSTTRFTTHGYRDHSGAQKNLANAKLGVRIDEASKLSLIFN
SVDIKADDPGGLTKAEWKANPQQAPRAEQYDTRKTIKQTQAGLRYERSLSSRDDMSVMMYAGERETTQYQSIPMAPQLNP
SHAGGVITLQRHYQGIDSRWTHRGELGVPVTFTTGLNYENMSENRKGYNNFRLNSGMPEYGQKGELRRDERNLMWNIDPY
LQTQWQLSEKLSLDAGVRYSSVWFDSNDHYVTPGNGDDSGDASYHKWLPAGSLKYAMTDAWNIYLAAGRGFETPTINELS
YRADGQSGMNLGLKPSTNDTIEIGSKTRIGDGLLSLALFQTDTDDEIVVDSSSGGRTTYKNAGKTRRQGAELAWDQRFAG
DFRVNASWTWLDATYRSNVCNEQDCNGNRMPGIARNMGFASIGYVPEDGWYAGTEARYMGDIMADDENTAKAPSYTLVGL
FTGYKYNYHNLTVDLFGRVDNLFDKEYVGSVIVNESNGRYYEPSPGRNYGVGMNIAWRFE
>Q9I4X3 6.2.1.32~~~pqsA~~~Anthranilate--CoA ligase~~~
MSTLANLTEVLFRLDFDPDTAVYHYRGQTLSRLQCRTYILSQASQLARLLKPGDRVVLALNDSPSLACLFLACIAVGAIP
AVINPKSREQALADIAADCQASLVVREADAPSLSGPLAPLTLRAAAGRPLLDDFSLDALVGPADLDWSAFHRQDPAAACF
LQYTSGSTGAPKGVMHSLRNTLGFCRAFATELLALQAGDRLYSIPKMFFGYGMGNSLFFPWFSGASALLDDTWPSPERVL
ENLVAFRPRVLFGVPAIYASLRPQARELLSSVRLAFSAGSPLPRGEFEFWAAHGLEICDGIGATEVGHVFLANRPGQARA
DSTGLPLPGYECRLVDREGHTIEEAGRQGVLLVRGPGLSPGYWRASEEQQARFAGGWYRTGDLFERDESGAYRHCGREDD
LFKVNGRWVVPTQVEQAICRHLPEVSEAVLVPTCRLHDGLRPTLFVTLATPLDDNQILLAQRIDQHLAEQIPSHMLPSQL
HVLPALPRNDNGKLARAELRHLADTLYHDNLPEERAC
>Q9I4X2 ~~~pqsB~~~2-heptyl-4(1H)-quinolone synthase subunit PqsB~~~
MLIQAVGVNLPPSYVCLEGPLGGERPRAQGDEMLMQRLLPAVREALDEAAVKPEEIDLIVGLALSPDHLIENRDIMAPKI
GHPLQKVLGANRAHVFDLTDSSLARALYVVDTLASDQGYRNVLVVRGESSQGLEVDSESGFALADGALALLCRPTGKAAF
RRGALGGDPAQEWLPLSIPLNTDIRQVGDVKGHLNLPAQPGLPEAVRAGFTRLAGDFPQLNWVREEWFGQGRPDGRCLGP
FELASQLRAAQRDRLDELLLISFDPFGMVVEGVTLELAGEAHA
>Q9I4X1 2.3.1.230~~~pqsC~~~2-heptyl-4(1H)-quinolone synthase subunit PqsC~~~
MHKVKLAAITCELPARSYENDDPVFAAVPDLSESWWQFWGVNRRGYFDPRNGENEFSLVVRAAERLLRSSDTAPDSVDML
ICSASSPIMTDAGDVLPDLRGRLYPRMANVLSKQLGLSRALPLDSQMECASFLLNLRLAASMIRQGKAEKVLVVCSEYIS
NLLDFTSRTSTLFADGCAVALLTRGDDDSCDLLASAEHSDATFYEVATGRWRLPENPTGEAKPRLYFSLFSDGQNKMASF
VPTNVPIAMRRALEKAGLGSDDIDYFVFHQPAPFLVKAWAEGIGARPEQYQLTMGDTGVMISVSIPYTLMTGLREGKIRP
GDRIVMAGAATGWGFAAQVWQLGEVLVC
>P20582 2.3.1.262~~~pqsD~~~Anthraniloyl-CoA anthraniloyltransferase~~~
MGNPILAGLGFSLPKRQVSNHDLVGRINTSDEFIVERTGVRTRYHVEPEQAVSALMVPAARQAIEAAGLLPEDIDLLLVN
TLSPDHHDPSQACLIQPLLGLRHIPVLDIRAQCSGLLYGLQMARGQILAGLARHVLVVCGEVLSKRMDCSDRGRNLSILL
GDGAGAVVVSAGESLEDGLLDLRLGADGNYFDLLMTAAPGSASPTFLDENVLREGGGEFLMRGRPMFEHASQTLVRIAGE
MLAAHELTLDDIDHVICHQPNLRILDAVQEQLGIPQHKFAVTVDRLGNMASASTPVTLAMFWPDIQPGQRVLVLTYGSGA
TWGAALYRKPEEVNRPC
>P20581 3.1.2.32~~~pqsE~~~2-aminobenzoylacetyl-CoA thioesterase~~~
MLRLSAPGQLDDDLCLLGDVQVPVFLLRLGEASWALVEGGISRDAELVWADLCRWVADPSQVHYWLITHKHYDHCGLLPY
LCPRLPNVQVLASERTCQAWKSESAVRVVERLNRQLLRAEQRLPEACAWDALPVRAVADGEWLELGPRHRLQVIEAHGHS
DDHVVFYDVRRRRLFCGDALGEFDEAEGVWRPLVFDDMEAYLESLERLQRLPTLLQLIPGHGGLLRGRLAADGAESAYTE
CLRLCRRLLWRQSMGESLDELSEELHRAWGGQSVDFLPGELHLGSMRRMLEILSRQALPLD
>Q02N79 1.14.13.182~~~pqsH~~~2-heptyl-3-hydroxy-4(1H)-quinolone synthase~~~
MTVLIQGAGIAGLALAREFTKAGIDWLLVERASEIRPIGTGITLASNALTALSSTLDLDRLFRRGMPLAGINVYAHDGSM
LMSMPSSLGGSSRGGLALQRHELHAALLEGLDESRIRVGVSIVQILDGLDHERVTLSDGTVHDCSLVVGADGIRSSVRRY
VWPEATLRHSGETCWRLVVPHRLEDAELAGEVWGHGKRLGFIQISPREMYVYATLKVRREEPEDEEGFVTPQRLAAHYAD
FDGIGASIARLIPSATTLVHNDLEELAGASWCRGRVVLIGDAAHAMTPNLGQGAAMALEDAFLLARLWCLAPRAETLILF
QQQREARIEFIRKQSWIVGRLGQWESPWSVWLRNTLVRLVPNASRRRLHQRLFTGVGEMAAQ
>P9WIM9 ~~~~~~Proline-rich 28 kDa antigen~~~
MIQIARTWRVFAGGMATGFIGVVLVTAGKASADPLLPPPPIPAPVSAPATVPPVQNLTALPGGSSNRFSPAPAPAPIASP
IPVGAPGSTAVPPLPPPVTPAISGTLRDHLREKGVKLEAQRPHGFKALDITLPMPPRWTQVPDPNVPDAFVVIADRLGNS
VYTSNAQLVVYRLIGDFDPAEAITHGYIDSQKLLAWQTTNASMANFDGFPSSIIEGTYRENDMTLNTSRRHVIATSGADK
YLVSLSVTTALSQAVTDGPATDAIVNGFQVVAHAAPAQAPAPAPGSAPVGLPGQAPGYPPAGTLTPVPPR
>E3PTZ4 5.1.1.4~~~prdF~~~Proline racemase~~~COG3938
MKFSKGIHAIDSHTMGEPTRIVVGGIPQINGETMADKKKYLEDNLDYVRTALMHEPRGHNDMFGSIITSSNNKEADFGII
FMDGGGYLNMCGHGSIGAATVAVETGMVEMVEPVTNINMEAPAGLIKAKVMVENEKVKEVSITNVPSFLYMEDAKLEVPS
LNKTITFDISFGGSFFAIIHAKELGVKVETSQVDVLKKLGIEIRDLINEKIKVQHPELEHIKTVDLVEIYDEPSNPEATY
KNVVIFGQGQVDRSPCGTGTSAKLATLYKKGHLKIDEKFVYESITGTMFKGRVLEETKVGEFDAIIPEITGGAYITGFNH
FVIDPEDPLKYGFTV
>A8DEZ8 5.1.1.4~~~~~~Proline racemase~~~
MKFSRSIQAIDSHTAGEATRIVVGGIPNIKGNSMPEKKEYLEENLDYLRTAIMLEPRGHNDMFGSVMTQPCCPDADFGII
FMDGGGYLNMCGHGTIGAMTAAIETGVVPAVEPVTHVVMEAPAGIIRGDVTVVDGKAKEVSFLNVPAFLYKEGVEVDLPG
VGTVKFDISFGGSFFAIIHASQLGLKIEPQNAGKLTELAMKLRDIINEKIEIQHPTLAHIKTVDLVEIYDEPTHPEATYK
NVVIFGQGQVDRSPCGTGTSAKLATLHAKGELKVGEKFVYESILGTLFKGEIVEETKVADFNAVVPKITGSAYITGFNHF
VIDEEDPLKHGFILK
>C4TP09 1.14.13.33~~~praI~~~4-hydroxybenzoate 3-monooxygenase (NAD(P)H)~~~
MRTQVGIIGAGPAGLLLSHLLYLQGIESIIIENRTREEIEGTIRAGVLEQGTVDLMNQMGVGARMMKEGHFHEGFELRFN
GRGHRINVHELTGGKYVTVYAQHEVIKDLVAARLQTGGQIHFNVGDVSLHDVDTSSPKIRFRPNKDGELQEIECDFIAGC
DGFRGPSRPAIPQSVRKEYQKVYPFSWLGILVEAPPSAHELIYANHERGFALVSTRSPQIQRLYLQVDAQDHIDNWSDDR
IWSELHARLETRDGFKLLEGPIFQKGIVSMRSFVCDPMQHGRLFLAGDAAHIVPPTGAKGLNLAAADVQVLARGLEAYYK
AGKMEILNRCTEICLRRIWKAERFSWFMTTMLHRDQGHTPFERGIQLAELDYVTSSRAASTSLAENYIGLPMEF
>P9WIM7 ~~~pra~~~Proline-rich antigen homolog~~~COG1714
MTEQPPPGGSYPPPPPPPGPSGGHEPPPAAPPGGSGYAPPPPPSSGSGYPPPPPPPGGGAYPPPPPSAGGYAPPPPGPAI
RTMPTESYTPWITRVLAAFIDWAPYVVLVGIGWVIMLVTQTSSCVTSISEYDVGQFCVSQPSMIGQLVQWLLSVGGLAYL
VWNYGYRQGTIGSSIGKSVLKFKVVSETTGQPIGFGMSVVRQLAHFIDAIICFVGFLFPLWDAKRQTLADKIMTTVCVPI
>P23916 3.4.21.-~~~prcA~~~Calcium-dependent protease~~~COG1404
MVHVRYGGQNGEQYELAISENHIVVRTESRSSLISDRPFEAAPVSPQARNILNQFELSTRFSQAGVEVLHVKEPSHDGAL
RDTAREILNQEPEVQFAGRVLIDPVSQQPIVYTENLFVKFDHEEDVSFCQEILGRYGLTIKRQLEYARNAYFVSAPSNTG
LAIFDISERLLNEESVELCHPELVREFRQRQAFPPQWHLKQTTIGGKTINAHANVEAAWKLSDGTGTIIAIIDDGVDIDH
EEFRSSGKIVAPRDVTRKTNFPTPGNRDNHGTACAGVACGNGNFGASGVAPGAKLMPIRFVSALGSQDEADSFVWAAQNG
ADVISCSWGPPDGTWWDDKDPLHKQKVPLPDSTRLAMDYAINKGRNGKGCVILFAAGNGNESVDNDGYASYEKVIAVAAC
NDFGTRSAYSDFGTAVWCAFPSNNGNPSQTPGIWTADRTGVVGYNSGNTNLGDQAGNYTNSFGGTSSACPGAAGVAALIL
SRNPNLRWDEVRDIIKRSCDRIDPVGGNYNAEGRSPFYGYGRINALKAVELALPAQPEPVSIFTAVQDVPINDLQISQLS
LAIANTNPIKSIKVTVDIEHTYIGDLVVSLNPPAESGVLPIILHDRKGGGADDIKQTYDEVSTPGLTALKGKIPQGTWTL
EVADKAQADTGKIRSLTIELGF
>P23865 3.4.21.102~~~prc~~~Tail-specific protease~~~COG0793
MNMFFRLTALAGLLAIAGQTFAVEDITRADQIPVLKEETQHATVSERVTSRFTRSHYRQFDLDQAFSAKIFDRYLNLLDY
SHNVLLASDVEQFAKKKTELGDELRSGKLDVFYDLYNLAQKRRFERYQYALSVLEKPMDFTGNDTYNLDRSKAPWPKNEA
ELNALWDSKVKFDELSLKLTGKTDKEIRETLTRRYKFAIRRLAQTNSEDVFSLAMTAFAREIDPHTNYLSPRNTEQFNTE
MSLSLEGIGAVLQMDDDYTVINSMVAGGPAAKSKAISVGDKIVGVGQTGKPMVDVIGWRLDDVVALIKGPKGSKVRLEIL
PAGKGTKTRTVTLTRERIRLEDRAVKMSVKTVGKEKVGVLDIPGFYVGLTDDVKVQLQKLEKQNVSSVIIDLRSNGGGAL
TEAVSLSGLFIPAGPIVQVRDNNGKVREDSDTDGQVFYKGPLVVLVDRFSASASEIFAAAMQDYGRALVVGEPTFGKGTV
QQYRSLNRIYDQMLRPEWPALGSVQYTIQKFYRVNGGSTQRKGVTPDIIMPTGNEETETGEKFEDNALPWDSIDAATYVK
SGDLTAFEPELLKEHNARIAKDPEFQNIMKDIARFNAMKDKRNIVSLNYAVREKENNEDDATRLARLNERFKREGKPELK
KLDDLPKDYQEPDPYLDETVNIALDLAKLEKARPAEQPAPVK
>Q9Z4P6 1.21.4.1~~~prdA~~~D-proline reductase proprotein PrdA~~~COG0252
MSITLESAKEHANDLAVLCCRAEEGTVIGPSNLEDPAIFGDLEDSGLLTIPANCLKIGEVLGAKLVKTADSLTPLTPELL
EGVNSISEEAPKQEASAPVEAPVAEVAPAAMPVANVTGSMLKIHIGEGKDINLEIPLTIAGQMGVVAPTAAAPAGVAMPV
ASATEQVVAPAGEPKLVRTLQKKHFKIEKVEFGPETKIENNTIYIRENICEDAVKVSNLVTDIKVEIITPADYGKYSETI
MDVQPIATKEGDGKIGQGVTRVIDGAIIMVTGTDEDGVQIGEFGSSEGELDANIMWGRPGAPDKGEILIKTQVTIKAGTN
MERPGPLAAHKATDFITQEIREALKKLDDSEVVETEELAQYRRPGKKKVVIIKEIMGQGAMHDNLILPVEPVGVIGAKPN
VDLGNVPVVLSPLEVLDGGIHALTCIGPASKENSRHYWREPLVIEVMNDEEFDLAGVVFVGSPQVNAEKFYVSERLGMLV
ETMDVEGAFITTEGFGNNHIDFASHHEQVGMRGIPVVGMSFCAVQGALVVGNKYMKYMVDNNKSEQGIENEILSNNTLCP
EDAIRAVAMLKAAIAEEEVKVAERKFNKNVKENNVDLIEEQAGKEITLLPNEQVLPMSKKRKEIYEADK
>Q9Z4Q7 1.21.4.1~~~prdB~~~D-proline reductase subunit gamma~~~COG1978
MSDLTVVKGLQSEIYVPITPPPVWTPVTKELKDMTVALVTAAGVHMKADKRFNLAGDFSFRVIPGDASVNDMMVSHGGYD
NGDVNKDINCMFPIDPMRTLAKEGFIKALAPINIGFMGGGGDQKKFSEETGPEIARQLKEEGVDAVLLTAGUGTCHRSAV
IVQRAIEESGIPTIIIAALPPVVRQNGTPRAVAPLVPMGANAGEPNNPEMQKAICTDSLKQLVEIPSAGKIVPLPYEYVA
KV
>P25889 1.3.1.1~~~preA~~~NAD-dependent dihydropyrimidine dehydrogenase subunit PreA~~~COG0167
MLTKDLSITFCGVKFPNPFCLSSSPVGNCYEMCAKAYDTGWGGVVFKTIGFFIANEVSPRFDHLVKEDTGFIGFKNMEQI
AEHPLEENLAALRRLKEDYPDKVLIASIMGENEQQWEELARLVQEAGADMIECNFSCPQMTSHAMGSDVGQSPELVEKYC
RAVKRGSTLPMLAKMTPNIGDMCEVALAAKRGGADGIAAINTVKSITNIDLNQKIGMPIVNGKSSISGYSGKAVKPIALR
FIQQMRTHPELRDFPISGIGGIETWEDAAEFLLLGAATLQVTTGIMQYGYRIVEDMASGLSHYLADQGFDSLQEMVGLAN
NNIVPAEDLDRSYIVYPRINLDKCVGCGRCYISCYDGGHQAMEWSEKTRTPHCNTEKCVGCLLCGHVCPVGCIELGEVKF
KKGEKEHPVTL
>P76440 1.3.1.1~~~preT~~~NAD-dependent dihydropyrimidine dehydrogenase subunit PreT~~~COG0493
MPQQNYLDELTPAFTSLLAIKEASRCLLCHDAPCSQACPAQTDPGKFIRSIYFRNFKGAAETIRENNALGAVCARVCPTE
KLCQSGCTRAGVDAPIDIGRLQRFVTDFEQQTGMEIYQPGTKTLGKVAIIGAGPAGLQASVTLTNQGYDVTIYEKEAHPG
GWLRNGIPQFRLPQSVLDAEIARIEKMGVTIKCNNEVGNTLTLEQLKAENRAVLVTVGLSSGSGLPLFEHSDVEIAVDFL
QRARQAQGDISIPQSALIIGGGDVAMDVASTLKVLGCQAVTCVAREELDEFPASEKEFTSARELGVSIIDGFTPVAVEGN
KVTFKHVRLSGELTMAADKIILAVGQHARLDAFAELEPQRNTIKTQNYQTRDPQVFAAGDIVEGDKTVVYAVKTGKEAAE
AIHHYLEGACSC
>P13925 ~~~pre~~~Plasmid recombination enzyme~~~
MSYMVARMQKMKAGNLGGAFKHNERVFETHSNKDINPSRSHLNYELTDRDRSVSYEKQIKDYVNENKVSNRAIRKDAVLC
DEWIITSDKDFFEKLDEEQTRTFFETAKNYFAENYGESNIAYASVHLDESTPHMHMGVVPFENGKLSSKAMFDREELKHI
QEDLPRYMSDHGFELERGKLNSEAKHKTVAEFKRAMADMELKEELLEKYHAPPFVDERTGELNNDTEAFWHEKEFADMFE
VQSPIRETTNQEKMDWLRKQYQEELKKLESSKKPLEDDLSHLEELLDKKTKEYIKIDSEASERASELSKAEGYINTLENH
SKSLEAKIECLESDNLQLEKQKATKLEAKALNESELRELKPKKNFLGKEHYELSPEQFEGLKAEVYRSRTLLHHKDIELE
QAKRQVSLRASKNYFTASLERAKEKAKGESIDRLKSEIKRLKNENSILRQQNDKMLGKLRELMPDKAFKNLLSELKAIKP
IVNIIKKAIEKSLF
>P22262 ~~~prfA~~~Listeriolysin regulatory protein~~~COG0664
MNAQAEEFKKYLETNGIKPKQFHKKELIFNQWDPQEYCIFLYDGITKLTSISENGTIMNLQYYKGAFVIMSGFIDTETSV
GYYNLEVISEQATAYVIKINELKELLSKNLTHFFYVFQTLQKQVSYSLAKFNDFSINGKLGSICGQLLILTYVYGKETPD
GIKITLDNLTMQELGYSSGIAHSSAVSRIISKLKQEKVIVYKNSCFYVQNLDYLKRYAPKLDEWFYLACPATWGKLN
>P41783 ~~~prgH~~~Protein PrgH~~~
METSKEKTITSPGPYIVRLLNSSLNGCEFPLLTGRTLFVVGQSDALTASGQLPDIPADSFFIPLDHGGVNFEIQVDTDAT
EIILHELKEGNSESRSVQLNTPIQVGELLILIRPESEPWVPEQPEKLETSAKKNEPRFKNGIVAALAGFFILGIGTVGTL
WILNSPQRQAAELDSLLGQEKERFQVLPGRDKMLYVAAQNERDTLWARQVLARGDYDKNARVINENEENKRISIWLDTYY
PQLAYYRIHFDEPRKPVFWLSRQRNTMSKKELEVLSQKLRALMPYADSVNITLMDDVTAAGQAEAGLKQQALPYSRRNHK
GGVTFVIQGALDDVEILRARQFVDSYYRTWGGRYVQFAIELKDDWLKGRSFQYGAEGYIKMSPGHWYFPSPL
>P41785 ~~~prgJ~~~Protein PrgJ~~~
MSIATIVPENAVIGQAVNIRSMETDIVSLDDRLLQAFSGSAIATAVDKQTITNRIEDPNLVTDPKELAISQEMISDYNLY
VSMVSTLTRKGVGAVETLLRS
>P41786 ~~~prgK~~~Lipoprotein PrgK~~~
MIRRYLYTFLLVMTLAGCKDKDLLKGLDQEQANEVIAVLQMHNIEANKIDSGKLGYSITVAEPDFTAAVYWIKTYQLPPR
PRVEIAQMFPADSLVSSPRAEKARLYSAIEQRLEQSLQTMEGVLSARVHISYDIDAGENGRPPKPVHLSALAVYERGSPL
AHQISDIKRFLKNSFADVDYDNISVVLSERSDAQLQAPGTPVKRNSFATSWIVLIILLSVMSAGFGVWYYKNHYARNKKG
ITADDKAKSSNE
>P17888 3.6.4.-~~~priA~~~Primosomal protein N'~~~COG1198
MPVAHVALPVPLPRTFDYLLPEGMTVKAGCRVRVPFGKQQERIGIVVSVSDASELPLNELKAVVEVLDSEPVFTHSVWRL
LLWAADYYHHPIGDVLFHALPILLRQGRPAANAPMWYWFATEQGQAVDLNSLKRSPKQQQALAALRQGKIWRDQVATLEF
NDAALQALRKKGLCDLASETPEFSDWRTNYAVSGERLRLNTEQATAVGAIHSAADTFSAWLLAGVTGSGKTEVYLSVLEN
VLAQGKQALVMVPEIGLTPQTIARFRERFNAPVEVLHSGLNDSERLSAWLKAKNGEAAIVIGTRSALFTPFKNLGVIVID
EEHDSSYKQQEGWRYHARDLAVYRAHSEQIPIILGSATPALETLCNVQQKKYRLLRLTRRAGNARPAIQHVLDLKGQKVQ
AGLAPALITRMRQHLQADNQVILFLNRRGFAPALLCHDCGWIAECPRCDHYYTLHQAQHHLRCHHCDSQRPVPRQCPSCG
STHLVPVGLGTEQLEQTLAPLFPGVPISRIDRDTTSRKGALEQQLAEVHRGGARILIGTQMLAKGHHFPDVTLVALLDVD
GALFSADFRSAERFAQLYTQVAGRAGRAGKQGEVVLQTHHPEHPLLQTLLYKGYDAFAEQALAERRMMQLPPWTSHVIVR
AEDHNNQHAPLFLQQLRNLILSSPLADEKLWVLGPVPALAPKRGGRWRWQILLQHPSRVRLQHIINGTLALINTIPDSRK
VKWVLDVDPIEG
>P9WMQ9 3.6.4.-~~~priA~~~Probable primosomal protein N'~~~COG1198
MLSVPHLDRDFDYLVPAEHSDDAQPGVRVRVRFHGRLVDGFVLERRSDSDHHGKLGWLDRVVSPEPVLTTEIRRLVDAVA
ARYAGTRQDVLRLAVPARHARVEREITTAPGRPVVAPVDPSGWAAYGRGRQFLAALADSRAARAVWQALPGELWADRFAE
AAAQTVRAGRTVLAIVPDQRDLDTLWQAATALVDEHSVVALSAGLGPEARYRRWLAALRGSARLVIGTRSAVFAPLSELG
LVMVWADADDSLAEPRAPYPHAREVAMLRAHQARCAALIGGYARTAEAHALVRSGWAHDVVAPRPEVRARSPRVVALDDS
GYDDARDPAARTARLPSIALRAARSALQSGAPVLVQVPRRGYIPSLACGRCRAIARCRSCTGPLSLQGAGSPGAVCRWCG
RVDPTLRCVRCGSDVVRAVVVGARRTAEELGRAFPGTAVITSAGDTLVPQLDAGPALVVATPGAEPRAPGGYGAALLLDS
WALLGRQDLRAAEDALWRWMTAAALVRPRGAGGVVTVVAESSIPTVQSLIRWDPVGHAEAELAARTEVGLPPSVHIAALD
GPAGTVTALLEAARLPDPDRLQADLLGPVDLPPGVRRPAGIPADAPVIRMLLRVCREQGLELAASLRRGIGVLSARQTRQ
TRSLVRVQIDPLHIG
>P67675 ~~~priB~~~Primosomal replication protein N~~~
MNTLELSARVLECGAMRHTPAGLPALELLLVHESEVVEAGHPRRVELTISAVALGDLALLLADTPLGTEMQVQGFLAPAR
KDSVKVKLHLQQARRIAGSMGRDPLVG
>P67673 ~~~priB~~~Primosomal replication protein N~~~COG2965
MNTLELSARVLECGAMRHTPAGLPALELLLVHESEVVEAGHPRRVELTISAVALGDLALLLADTPLGTEMQVQGFLAPAR
KDSVKVKLHLQQARRIAGSMGRDPLVG
>P07013 ~~~priB~~~Primosomal replication protein N~~~COG2965
MTNRLVLSGTVCRAPLRKVSPSGIPHCQFVLEHRSVQEEAGFHRQAWCQMPVIVSGHENQAITHSITVGSRITVQGFISC
HKAKNGLSKMVLHAEQIELIDSGD
>Q5F924 ~~~priB~~~Primosomal replication protein N~~~
MGFTNLVSLAALIEKAFPIRYTPAGIPVLDIILKHESWQEENGQQCLVQLEIPARILGRQAEEWQYRQGDCATVEGFLAQ
KSRRSLMPMLRIQNIKEYKG
>P23862 ~~~priC~~~Primosomal replication protein N''~~~COG3923
MKTALLLEKLEGQLATLRQRCAPVSQFATLSARFDRHLFQTRATTLQACLDEAGDNLAALRHAVEQQQLPQVAWLAEHLA
AQLEAIAREASAWSLREWDSAPPKIARWQRKRIQHQDFERRLREMVAERRARLARVTDLVEQQTLHREVEAYEARLARCR
HALEKIENRLARLTR
>B5H7H3 4.2.3.182~~~~~~Pristinol synthase~~~COG0664
MAHETTSGRRLPDPTSPSDPTRRTAAIRIPFPARLNPHAERARQHTLQWVQETGLLTGDEATAEYDTLRLERLMAYFYPD
ASAGDLELAADFNAWFFIFDDQFDGGLGTRPHEIRGVVDALVGTMTTDGAPRPADVRDTPLVRAFRDIWLRSTAGAPYAW
RLRFRDHWQAYLAAHVGEAHHRNADRLPSLEQFLEVRRHSIGVQPCLDFTERCGGYALPDELYRSFPLREMREITGDVVI
FVNDIVSLVKELAAGDINNSVVIEREHKGCTLEESVEHITALANARTARFARLAASLPGTLADLGVPAPSREHVSHYVDG
MRHVMAGNLSWSLATSRYDETGIAAVSGGRRRPWDGLTTATGTASPRHPRRA
>Q81WH6 2.7.11.1~~~prkC~~~Serine/threonine-protein kinase PrkC~~~COG0515
MLIGKRLNDRYKLLKMIGGGGMANVYLAHDDILGRDVAVKILRLDYSNNEEFIKRFHREAQSVTTLSHPNIVNMYDVGEE
DGIYYLVMEYVPGQTLKQYIIERGMLPIGEALDIMEQLTSAMAHAHHFEIVHRDIKPHNILIRADGVIKVTDFGIATATS
ATTITHTNSVLGSVHYLSPEQARGGIANKQSDIYSLGIVMFELLTGRQPFSGESAVAIALKHLQSEIPSPKRWNENIPQS
VENIILKATAKDPFHRYQSANAMKRDIETALYPERINEQPFYIPEDMEATKAIPIIQQEQLFENVTDETIVLKGSKVDEQ
IRKEETDLSKKKKRSNKWLKILITTFLLLAIGITLALTVIPGFFIPKDVKVPDVAGMKYTTAVNTLVEKGFEVTEPNIVY
TDDVETGDVIKTDPVAGRVVKENSKITIYQSGGKKKSKMIDFTGKDLESIRTELEEKYKQVTVYYIEDDRPKGAIVEQIP
TSDQMVVEAEQELKIWVSKGPYQIRPGDFSRWTENSVTGYLNERKLTPDIKREYSDTVDKGLVISQSPKPGTPLKEGDKV
TIIISEGPKPKVTKTVKVDNISIPYESSIIGEKKPQTIEIYKEDMQQKMDRPIETRTISESATISLEFVIQEDTKGRYKI
VRDGVTIIDKEVPYPTQ
>O34507 2.7.11.1~~~prkC~~~Serine/threonine-protein kinase PrkC~~~COG0515
MLIGKRISGRYQILRVIGGGGMANVYLAEDIILDREVAIKILRFDYANDNEFIRRFRREAQSASSLDHPNIVSIYDLGEE
DDIYYIVMEYVEGMTLKEYITANGPLHPKEALNIMEQIVSAIAHAHQNQIVHRDIKPHNILIDHMGNIKVTDFGIATALS
STTITHTNSVLGSVHYLSPEQARGGLATKKSDIYALGIVLFELLTGRIPFDGESAVSIALKHLQAETPSAKRWNPSVPQS
VENIILKATAKDPFHRYETAEDMEADIKTAFDADRLNEKRFTIQEDEEMTKAIPIIKDEELAKAAGEKEAEVTTAQENKT
KKNGKRKKWPWVLLTICLVFITAGILAVTVFPSLFMPKDVKIPDVSGMEYEKAAGLLEKEGLQVDSEVLEISDEKIEEGL
MVKTDPKADTTVKEGATVTLYKSTGKAKTEIGDVTGQTVDQAKKALKDQGFNHVTVNEVNDEKNAGTVIDQNPSAGTELV
PSEDQVKLTVSIGPEDITLRDLKTYSKEAASGYLEDNGLKLVEKEAYSDDVPEGQVVKQKPAAGTAVKPGNEVEVTFSLG
PEKKPAKTVKEKVKIPYEPENEGDELQVQIAVDDADHSISDTYEEFKIKEPTERTIELKIEPGQKGYYQVMVNNKVVSYK
TIEYPKDE
>A6QGC0 2.7.11.1~~~prkC~~~Serine/threonine-protein kinase PrkC~~~
MIGKIINERYKIVDKLGGGGMSTVYLAEDTILNIKVAIKAIFIPPREKEETLKRFEREVHNSSQLSHQNIVSMIDVDEED
DCYYLVMEYIEGPTLSEYIESHGPLSVDTAINFTNQILDGIKHAHDMRIVHRDIKPQNILIDSNKTLKIFDFGIAKALSE
TSLTQTNHVLGTVQYFSPEQAKGEATDECTDIYSIGIVLYEMLVGEPPFNGETAVSIAIKHIQDSVPNVTTDVRKDIPQS
LSNVILRATEKDKANRYKTIQEMKDDLSSVLHENRANEDVYELDKMKTIAVPLKKEDLAKHISEHKSNQPKRETTQVPIV
NGPAHHQQFQKPEGTVYEPKPKKKSTRKIVLLSLIFSLLMIALVSFVAMAMFGNKYEETPDVIGKSVKEAEQIFNKNNLK
LGKISRSYSDKYPENEIIKTTPNTGERVERGDSVDVVISKGPEKVKMPNVIGLPKEEALQKLKSLGLKDVTIEKVYNNQA
PKGYIANQSVTANTEIAIHDSNIKLYESLGIKQVYVEDFEHKSFSKAKKALEEKGFKVESKEEYSDDIDEGDVISQSPKG
KSVDEGSTISFVVSKGKKSDSSDVKTTTESVDVPYTGKNDKSQKVKVYIKDKDNDGSTEKGSFDITSDQRIDIPLRIEKG
KTASYIVKVDGKTVAEKEVSYDDV
>P00778 3.4.21.12~~~alpha-LP~~~Alpha-lytic protease~~~
MYVSNHRSRRVARVSVSCLVAALAAMSCGAALAADQVDPQLKFAMQRDLGIFPTQLPQYLQTEKLARTQAAAIEREFGAQ
FAGSWIERNEDGSFKLVAATSGARKSSTLGGVEVRNVRYSLKQLQSAMEQLDAGANARVKGVSKPLDGVQSWYVDPRSNA
VVVKVDDGATEAGVDFVALSGADSAQVRIESSPGKLQTTANIVGGIEYSINNASLCSVGFSVTRGATKGFVTAGHCGTVN
ATARIGGAVVGTFAARVFPGNDRAWVSLTSAQTLLPRVANGSSFVTVRGSTEAAVGAAVCRSGRTTGYQCGTITAKNVTA
NYAEGAVRGLTQGNACMGRGDSGGSWITSAGQAQGVMSGGNVQSNGNNCGIPASQRSSLFERLQPILSQYGLSLVTG
>P27458 3.4.24.32~~~~~~Beta-lytic metalloendopeptidase~~~
MKKISKAGLGLALVCALATIGGNAARRATAQRRGSGVFYDEMFDFDIDAHLAKHAPHLHKHSEEISHWAGYSGISRSVDR
ADGAAERAVTPSARRIVRSASWRAPTASARRPARSRWRCASRCTSAIPTRQGAGDAGPRQSAAGAVRAFRRQRAGGRAAR
RRRVPAGLRPPVQRTAPGQGGFGPLRQGRPGRAAVSPNGLLQFPFPRGASWHVGGAHTNTGSGNYPMSSLDMSRGGGWGS
NQNGNWVSASAAGSFKRHSSCFAEIVHTGGWSTTYYHLMNIQYNTGANVSMNTAIANPANTQAQALCNGGQSTGPHEHWS
LKQNGSFYHLNGTYLSGYRITATGSSYDTNCSRFYLTKNGQNYCYGYYVNPGPN
>P00801 3.4.24.32~~~~~~Beta-lytic metalloendopeptidase~~~
SPNGLLQFPFPRGASWHVGGAHTNTGSGNYPMSSLDMSRGGGSNQNGNWVSASAAGGSFKRHSSCFAEIVHTGGWSTTYY
HLMNIQYNTGANVSMNTAIANAPNTQAQALCNGGQSTGPHQHWSLKQNGSFYHLNGTYLSGYRITATGSSYDTNCSRFYL
TKNGQNYCYGYYVNPGPN
>P15373 ~~~prlF~~~Antitoxin PrlF~~~COG2002
MPANARSHAVLTTESKVTIRGQTTIPAPVREALKLKPGQDSIHYEILPGGQVFMCRLGDEQEDHTMNAFLRFLDADIQNN
PQKTRPFNIQQGKKLVAGMDVNIDDEIGDDE
>P45558 2.1.1.-~~~prmA~~~Ribosomal protein L11 methyltransferase~~~COG2264
MDKDWFEVSVITSSEAVEAVTGILYNTPVKGVAIEDSKDVEFKKKHPGDWDYFDESLLNVKDGAVIKAYYKDDHNFDESV
KYIEESIDKLSEFGINKGEGKVFVNKVNETDWENNWKKYYKPTKIGARIVVKPLWEEYTPKDYELMLNMDPGMAFGTGTH
ETTRMCIQALERYVNEDAEVFDIGTGSGILAIAAAKLNAKKVLGVDLDSVAVKAAKENIQYNNVNNIEILHGNLMEVVQG
KADIIVANIIADVINILIPDINKFLKTDGYFISSGIIKDRAEDVIENLKKNKFEIIEVNNQGEWICIVAKL
>P0A8T1 2.1.1.-~~~prmA~~~Ribosomal protein L11 methyltransferase~~~COG2264
MPWIQLKLNTTGANAEDLSDALMEAGAVSITFQDTHDTPVFEPLPGETRLWGDTDVIGLFDAETDMNDVVAILENHPLLG
AGFAHKIEQLEDKDWEREWMDNFHPMRFGERLWICPSWRDVPDENAVNVMLDPGLAFGTGTHPTTSLCLQWLDSLDLTGK
TVIDFGCGSGILAIAALKLGAAKAIGIDIDPQAIQASRDNAERNGVSDRLELYLPKDQPEEMKADVVVANILAGPLRELA
PLISVLPVSGGLLGLSGILASQAESVCEAYADSFALDPVVEKEEWCRITGRKN
>Q768T5 1.14.13.227~~~prmA~~~Propane 2-monooxygenase, hydroxylase component large subunit~~~
MSRQSLTKAHAKITELSWEPTFATPATRFGTDYTFEKAPKKDPLKQIMRSYFPMEEEKDNRVYGAMDGAIRGNMFRQVQE
RWLEWQKLFLSIIPFPEISAARAMPMAIDAVPNPEIHNGLAVQMIDEVRHSTIQMNLKKLYMNNYIDPAGFDITEKAFAN
NYAGTIGRQFGEGFITGDAITAANIYLTVVAETAFTNTLFVAMPDEAAANGDYLLPTVFHSVQSDESRHISNGYSILLMA
LADERNRPLLERDLRYAWWNNHCVVDAAIGTFIEYGTKDRRKDRESYAEMWRRWIYDDYYRSYLLPLEKYGLTIPHDLVE
EAWNRIVDKHYVHEVARFFATGWPVNYWRIDAMTDTDFEWFEEKYPGWYNKFGKWWENYNRLAYPGKNKPIAFEDVDYEY
PHRCWTCMVPCLIREDMVTDKVDGQWRTYCSETCAWTDKVAFRPEYEGRPTPNMGRLTGFREWETLHHGKDLADIITDLG
YVRDDGKTLIPQPHLDLDPKKMWTLDDVRGIPFGSPNVALNEMSDDEREAHIAAYMANKNGAVTV
>Q04AV7 2.1.1.-~~~prmA~~~Ribosomal protein L11 methyltransferase~~~
MKLLEIKIESSYDVEDALAYFATEDLKALGTEARRRSDFEQAGWLHDSTVVDMDDIPNLPDELEFIAYFDEETDPEEMVK
CFKDKLAELAGYGLKTAPGEISVDYVADQDWNTVWKKYYHVINLSRHLAIVPEWEDYQPVFKDQEIIRLDPGLAFGTGNH
QTTQLAMLGIERAMVKPLTVADVGTGSGILAIAAHKLGAKSVLATDISDESMTAAEENAALNGIYDIALQKTSLLADVDG
KFDLIVANILAEILLDLIPQLDSHLNEDGQVIFSGIDYLQLPKIEQALAENSFQIDLKMRAGRWIGLAISRKHD
>Q0SJK9 1.14.13.227~~~prmA~~~Propane 2-monooxygenase, hydroxylase component large subunit~~~COG3350
MSRQSLTKAHAKITELSWDPTFATPATRFGTDYTFEKAPKKDPLKQIMRSYFPMEEEKDNRVYGAMDGAIRGNMFRQVQQ
RWLEWQKLFLSIIPFPEISAARAMPMAIDAVPNPEIHNGLAVQMIDEVRHSTIQMNLKKLYMNNYIDPAGFDMTEKAFAN
NYAGTIGRQFGEGFITGDAITAANIYLTVVAETAFTNTLFVAMPDEAAANGDYLLPTVFHSVQSDESRHISNGYSILLMA
LADERNRPLLERDLRYAWWNNHCVVDAAIGTFIEYGTKDRRKDRESYAEMWRRWIYDDYYRSYLIPLEKYGLTIPHDLVE
EAWKRITDKGYVHEVARFFATGWPVNYWRIDAMTDKDFEWFEHKYPGWYSKYGKWWEEYNRLAYPGRNKPIAFEEVGYQY
PHRCWTCMVPALIREDMVVEKVDDQWRTYCSETCYWTDAVAFRSEYQGRPTPNMGRLTGFREWETLHHGKDLADIVSDLG
YVRDDGKTLVGQPHLDLDDPKKMWTLDDVRGNTFQSPNVLLNEMSDAERNAHIAAYRAGGAVPA
>P0A0P5 2.1.1.-~~~prmA~~~Ribosomal protein L11 methyltransferase~~~
MNWTELSIIINHEAVELATNILENHGSNGVVIEDSDDLINQPEDKYGEIYALKKEDYPDKGVRLKAYFNEMTYDDKLRQQ
IKDELLNLDELDQHNIQFSEQIIAETDWENEWKNYFHPFRASKKFTIVPSWETYAKEADEELCIELDPGMAFGTGDHPTT
SMCLKAIETYVLPQHSVIDVGTGSGILSIASHLIGVKRIKALDIDEMAVSVAKENFRRNHCETLIEAVPGNLLKDETEKF
DIVIANILAHIIDEMIEDAYNTLNEGGYFITSGIIKEKYEGIQSHMERVGFKIISEQHDNGWVCLVGQKVSE
>Q84BQ9 2.1.1.-~~~prmA~~~Ribosomal protein L11 methyltransferase~~~COG2264
MWVYRLKGTLEALDPILPGLFDGGARGLWEREGEVWAFFPAPVDLPYEGVWEEVGDEDWLEAWRRDLKPALAPPFVVLAP
WHTWEGAEIPLVIEPGMAFGTGHHETTRLALKALARHLRPGDKVLDLGTGSGVLAIAAEKLGGKALGVDIDPMVLPQAEA
NAKRNGVRPRFLEGSLEAALPFGPFDLLVANLYAELHAALAPRYREALVPGGRALLTGILKDRAPLVREAMAGAGFRPLE
EAAEGEWVLLAYGR
>P39199 2.1.1.298~~~prmB~~~Ribosomal protein uL3 glutamine methyltransferase~~~COG2890
MDKIFVDEAVNELQTIQDMLRWSVSRFSAANIWYGHGTDNPWDEAVQLVLPSLYLPLDIPEDMRTARLTSSEKHRIVERV
IRRVNERIPVAYLTNKAWFCGHEFYVDERVLVPRSPIGELINNKFAGLISKQPQHILDMCTGSGCIAIACAYAFPDAEVD
AVDISPDALAVAEQNIEEHGLIHNVIPIRSDLFRDLPKVQYDLIVTNPPYVDAEDMSDLPNEYRHEPELGLASGTDGLKL
TRRILGNAADYLADDGVLICEVGNSMVHLMEQYPDVPFTWLEFDNGGDGVFMLTKEQLIAAREHFAIYKD
>Q768T4 1.18.1.-~~~prmB~~~Propane 2-monooxygenase, reductase component~~~
MADTHKISFEPVDIEMEVGEDETILDAAFRQGIHLMHGCREGRCSCKSYMLEGDVQMDDYSTFACNDAEEAEGYVLLCRT
YAYSDCEIELLNFDEDELLGGAPIQDVTTKVAAIEPMTPDIVSLKLDVVEPESVEFKSGQYFDLFIPGTEDKRSFSIATT
PATPDRLEFLIKKYPGGLFAGMLTDGLSVGQEIKLNGPYGSCTLRNGHVLPIVAIGGGAGMAPLLSLLRHISETGLNRPV
RFYYGARTAADLFLLDEIATLGEKIDDFSFTACLSESTDNAPEGVTVIGGNVTDIVNDNEADLARTEVYFCAPPPMVDAA
LALAEQHSVPHDQIFYDKFTSPAFDS
>Q0SJK8 1.18.1.-~~~prmB~~~Propane 2-monooxygenase, reductase component~~~COG1018
MAPRPLRRHPPLHHSFHESRRDPVADKHRINFEPVDIEMEVGEDEYILDAAFRQGIHLMHGCREGRCSACKSFVLEGDIQ
MEDYSTFACNDAEVDEGHVLLCRSTAYSDCTIELLNFDEDELLGGVPIQDVRTRVTRIEPMTKDIVSLRLAPVEPAGYEF
KPGQYSDLHIPGTEEHRSFSMATTRSTPGHVEFLIKKYPGGKFAGLLEDGISVGDEIALTGPYGSFTIKEGHVLPMVFIG
GGAGMAPLLSLLRHMSETGNTRQVHFYYGARTPQDLFYVDEILELGRGLTDFTFVACLSESMDPPPVGAIAVEDGNVTDV
VGRREPDIGRAEVYLCGPPPMVDAALELLEANGTPKDQIFYDKFTSPAFE
>B0B9D1 2.1.1.297~~~prmC~~~Release factor glutamine methyltransferase~~~
MKKLLREASEYLLSRGIRFPQREAEDILMDLLEISSRSALHQAKLSSEEQSLYWKRLRKRGDRCPTAYIHGKVHFLGVEL
QVTPQVLIPRQETEIFVEQIIGYLQMHKEKTTFYDVCCGSGCIGLAVRKHCPHVRVTLSDISPEALAIAESNARSNALAV
DFLLGDLFDPFSFPADVLVCNPPYLSYKEFFESDPEVRCHEPWKALVGGVSGLEFYHRIATHIHKILVSGGVGWLEIGST
QGEDVKQIFHAKGIRGRVLKDYAQLDRFFFLENQANDAVSSGEVSGFSER
>P0ACC1 2.1.1.297~~~prmC~~~Release factor glutamine methyltransferase~~~COG2890
MEYQHWLREAISQLQASESPRRDAEILLEHVTGRGRTFILAFGETQLTDEQCQQLDALLTRRRDGEPIAHLTGVREFWSL
PLFVSPATLIPRPDTECLVEQALARLPEQPCRILDLGTGTGAIALALASERPDCEIIAVDRMPDAVSLAQRNAQHLAIKN
IHILQSDWFSALAGQQFAMIVSNPPYIDEQDPHLQQGDVRFEPLTALVAADSGMADIVHIIEQSRNALVSGGFLLLEHGW
QQGEAVRQAFILAGYHDVETCRDYGDNERVTLGRYYQ
>Q768T3 1.14.13.227~~~prmC~~~Propane 2-monooxygenase, hydroxylase component small subunit~~~
MSAPAQPRERSFPSIEFTDAEADAREFPSSRSRKYNYYQPSKKRATIYEDVTVDVQPDPERHLTQGWIYGFGDGPGGYPK
EWTSAQSSNWHQFLDPNEEWEQSIYRNNSAVVHQVDLCLQNAKRARAYDGWNSAWLKFIERNLGAWMHAESGMGLHVFTS
IQRSAPTNMINNAVCVNAAHKLRFAQDLALFNLDLSEAEEAFDGSAHKEVWQSAPEWQPTREAVERLTAIGDWAELLFCS
NIVFEQLVGSLFRSELVMQVAARNGDYITPTIVGTGEYDYDRDLNYSRALFQMLARDEKHGIDNRKLFSRWMSEWFPGAS
TRARGLQPIWSQPADKSVTFSSSLEHAKTKFADVLAAIDVDIPEELNK
>P9WHV3 2.1.1.297~~~prmC~~~Release factor glutamine methyltransferase~~~COG2890
MTLRQAIDLAAALLAEAGVDSARCDAEQLAAHLAGTDRGRLPLFEPPGDEFFGRYRDIVTARARRVPLQHLIGTVSFGPV
VLHVGPGVFVPRPETEAILAWATAQSLPARPLIVDACTGSGALAVALAQHRANLGLKARIIGIDDSDCALDYARRNAAGT
PVELVRADVTTPRLLPELDGQVDLMVSNPPYIPDAAVLEPEVAQHDPHHALFGGPDGMTVISAVVGLAGRWLRPGGLFAV
EHDDTTSSSTVDLVSSTKLFVDVQARKDLAGRPRFVTAMRWGHLPLAGENGAIDPRQRRCRAKR
>Q0SJK7 1.14.13.227~~~prmC~~~Propane 2-monooxygenase, hydroxylase component small subunit~~~
MTATAESKQRSFPKIEFTDSEAGALEFPSSRSRTFTYYTPAKKRSTMYEDVTVDVQPDPDRHLSQGWIYGFGDGPGGYPQ
EWTAAKSSNWHAFLDPNEEWDQTIYRNNSKVVHQVELCLSNAKRARVYDGWNTPWLTFISRNLGAWMHAENGLALHVFTS
IQRSCPTNMINTAVAVNAAHKMRFAQDLALFNLDLSEATENFDGTAHKEVWQSAPEWQPTREVVERLTAVPDWCELLFGS
NIVFEQLVGTLFRSELVMQIAAGNGDYITPTIVGTGEHDYDRDLAYTRNLFRLLTRDPEHGEANKELFGTWLAIWVPRCL
DAARALQPIWSQPADKAITFATSFDAATDKFRSLLEDLGLDIPKELDQ
>Q9WYV8 2.1.1.297~~~prmC~~~Release factor glutamine methyltransferase~~~COG2890
MDTRKNVSGAERKIWSLIRDCSGKLEGVTETSVLEVLLIVSRVLGIRKEDLFLKDLGVSPTEEKRILELVEKRASGYPLH
YILGEKEFMGLSFLVEEGVFVPRPETEELVELALELIRKYGIKTVADIGTGSGAIGVSVAKFSDAIVFATDVSSKAVEIA
RKNAERHGVSDRFFVRKGEFLEPFKEKFASIEMILSNPPYVKSSAHLPKDVLFEPPEALFGGEDGLDFYREFFGRYDTSG
KIVLMEIGEDQVEELKKIVSDTVFLKDSAGKYRFLLLNRRSS
>Q768T2 ~~~prmD~~~Propane 2-monooxygenase, effector component~~~
MQFGADTEFSNMCGVTLMNTPIGRVVADVMGAKDGVELTEYPSMIRVDGVNRLDFDYDELTDALGQDFDGSIFEEISSTH
YGRMVHLDDKTILFASPEDAAEFIGFDLTAS
>Q0SJK6 ~~~prmD~~~Propane 2-monooxygenase, effector component~~~
MSMQFGSSTEFSNMCGVTLMNTPIGRVVAEVMGAKDGVELTEYPSMIRVDGQRLLDFDYEELTDALGQEFDGSIFEEISS
THYGRMVHLDEKTLLFASPEDAAEYIGFDLTAQ
>P95480 1.14.19.9~~~prnA~~~Flavin-dependent tryptophan halogenase PrnA~~~
MNKPIKNIVIVGGGTAGWMAASYLVRALQQQANITLIESAAIPRIGVGEATIPSLQKVFFDFLGIPEREWMPQVNGAFKA
AIKFVNWRKSPDPSRDDHFYHLFGNVPNCDGVPLTHYWLRKREQGFQQPMEYACYPQPGALDGKLAPCLSDGTRQMSHAW
HFDAHLVADFLKRWAVERGVNRVVDEVVDVRLNNRGYISNLLTKEGRTLEADLFIDCSGMRGLLINQALKEPFIDMSDYL
LCDSAVASAVPNDDARDGVEPYTSSIAMNSGWTWKIPMLGRFGSGYVFSSHFTSRDQATADFLKLWGLSDNQPLNQIKFR
VGRNKRAWVNNCVSIGLSSCFLEPLESTGIYFIYAALYQLVKHFPDTSFDPRLSDAFNAEIVHMFDDCRDFVQAHYFTTS
RDDTPFWLANRHDLRLSDAIKEKVQRYKAGLPLTTTSFDDSTYYETFDYEFKNFWLNGNYYCIFAGLGMLPDRSLPLLQH
RPESIEKAEAMFASIRREAERLRTSLPTNYDYLRSLRDGDAGLSRGQRGPKLAAQESL
>P95481 1.14.19.-~~~prnB~~~Monodechloroaminopyrrolnitrin synthase PrnB~~~
MERTLDRVGVFAATHAAVAACDPLQARALVLQLPGLNRNKDVPGIVGLLREFLPVRGLPCGWGFVEAAAAMRDIGFFLGS
LKRHGHEPAEVVPGLEPVLLDLARATNLPPRETLLHVTVWNPTAADAQRSYTGLPDEAHLLESVRISMAALEAAIALTVE
LFDVSLRSPEFAQRCDELEAYLQKMVESIVYAYRFISPQVFYDELRPFYEPIRVGGQSYLGPGAVEMPLFVLEHVLWGSQ
SDDQTYREFKETYLPYVLPAYRAVYARFSGEPALIDRALDEARAVGTRDEHVRAGLTALERVFKVLLRFRAPHLKLAERA
YEVGQSGPEIGSGGYAPSMLGELLTLTYAARSRVRAALDES
>P95483 1.14.13.-~~~prnD~~~Aminopyrrolnitrin oxygenase PrnD~~~
MNDIQLDQASVKKRPSGAYDATTRLAASWYVAMRSNELKDKPTELTLFGRPCVAWRGATGRAVVMDRHCSHLGANLADGR
IKDGCIQCPFHHWRYDEQGQCVHIPGHNQAVRQLEPVPRGARQPTLVTAERYGYVWVWYGSPLPLHPLPEISAADVDNGD
FMHLHFAFETTTAVLRIVENFYDAQHATPVHALPISAFELKLFDDWRQWPEVESLALAGAWFGAGIDFTVDRYFGPLGML
SRALGLNMSQMNLHFDGYPGGCVMTVALDGDVKYKLLQCVTPVSEGKNVMHMLISIKKVGGILRRATDFVLFGLQTRQAA
GYDVKIWNGMKPDGGGAYSKYDKLVLKYRAFYRGWVDRVASER
>Q2SZ88 1.2.1.41~~~proA~~~Gamma-glutamyl phosphate reductase~~~
MDIDQYMTDVGRRARRASRSIARASTAAKNAALEAVARAIERDAGALKAANARDVARAKDKGLDAAFVDRLTLSDKALKT
MVEGLRQVATLPDPIGEMSNLKYRPSGIQVGQMRVPLGVIGIIYESRPNVTIDAAALCLKSGNATILRGGSEALESNTAL
AKLIGEGLAEAGLPQDTVQVVETADRAAVGRLITMTEYVDVIVPRGGKSLIERLINEARVPMIKHLDGICHVYVDDRASV
TKALTVCDNAKTHRYGTCNTMETLLVARGIAPAVLSPLGRLYREKGVELRVDADARAVLEAAGVGPLVDATDEDWRTEYL
APVLAIKIVDGIDAAIEHINEYGSHHTDAIVTEDHDRAMRFLREVDSASVMVNASTRFADGFEFGLGAEIGISNDKLHAR
GPVGLEGLTSLKYVVLGHGEGRQ
>P07004 1.2.1.41~~~proA~~~Gamma-glutamyl phosphate reductase~~~COG0014
MLEQMGIAAKQASYKLAQLSSREKNRVLEKIADELEAQSEIILNANAQDVADARANGLSEAMLDRLALTPARLKGIADDV
RQVCNLADPVGQVIDGGVLDSGLRLERRRVPLGVIGVIYEARPNVTVDVASLCLKTGNAVILRGGKETCRTNAATVAVIQ
DALKSCGLPAGAVQAIDNPDRALVSEMLRMDKYIDMLIPRGGAGLHKLCREQSTIPVITGGIGVCHIYVDESVEIAEALK
VIVNAKTQRPSTCNTVETLLVNKNIADSFLPALSKQMAESGVTLHADAAALAQLQAGPAKVVAVKAEEYDDEFLSLDLNV
KIVSDLDDAIAHIREHGTQHSDAILTRDMRNAQRFVNEVDSSAVYVNASTRFTDGGQFGLGAEVAVSTQKLHARGPMGLE
ALTTYKWIGIGDYTIRA
>P21347 3.4.24.-~~~~~~Zinc metalloproteinase~~~COG3227
MHPNYYLSPLAVAIALGIASPVKAADPIPLQKSSFSEVTQKFQLTLPGVMKGAVVSTNSLQFIRQHTDGNKVTHVRMQQQ
YAGFPVFGGYAILHSKNATPSLATAKSDEKMNGVIYDGLQAELGQPKPSFVKNASMALQQFKDKYANKQVSEDQVTPMIY
IDEKHQAHWAYKVSVLVIHDDRIPERPTAIIDAETNKPFVQWDDVKTEKVQAKGMGFGGNRKIGEYQFGKDLPLLEITRD
SSVEMCFMENTDVKVVDMGHKYYSNNKPMQFTCKETPDTQSTKTYYTGYSADGYDRDNGAASPTNDALYAGYVIKHMYHD
WYGVEALTKSDGSPMQLVMRVHYGQGYENAYWDGKQMTFGDGDTMMYPLVSLGVGGHEVSHGFTEQHSGLEYFGQSGGMN
ESFSDMAAQAAEYYSVGKNSWQIGPEIMKEDSGYDALRYMDKPSRDGMSIDVADDYYGGLDVHYSSGVYNHLFYILANQP
NWNLRMAFDVMVKANMDYWTPYSTFDEGGCGMLSAAKDLGYNLDDIKKSLSEVTINYQSCYVD
>P9WHV1 1.2.1.41~~~proA~~~Gamma-glutamyl phosphate reductase~~~COG0014
MTVPAPSQLDLRQEVHDAARRARVAARRLASLPTTVKDRALHAAADELLAHRDQILAANAEDLNAAREADTPAAMLDRLS
LNPQRVDGIAAGLRQVAGLRDPVGEVLRGYTLPNGLQLRQQRVPLGVVGMIYEGRPNVTVDAFGLTLKSGNAALLRGSSS
AAKSNEALVAVLRTALVGLELPADAVQLLSAADRATVTHLIQARGLVDVVIPRGGAGLIEAVVRDAQVPTIETGVGNCHV
YVHQAADLDVAERILLNSKTRRPSVCNAAETLLVDAAIAETALPRLLAALQHAGVTVHLDPDEADLRREYLSLDIAVAVV
DGVDAAIAHINEYGTGHTEAIVTTNLDAAQRFTEQIDAAAVMVNASTAFTDGEQFGFGAEIGISTQKLHARGPMGLPELT
STKWIAWGAGHTRPA
>Q9WYC9 1.2.1.41~~~proA~~~Gamma-glutamyl phosphate reductase~~~COG0014
MDELLEKAKKVREAWDVLRNATTREKNKAIKKIAEKLDERRKEILEANRIDVEKARERGVKESLVDRLALNDKRIDEMIK
ACETVIGLKDPVGEVIDSWVREDGLRIARVRVPIGPIGIIYESRPNVTVETTILALKSGNTILLRGGSDALNSNKAIVSA
IREALKETEIPESSVEFIENTDRSLVLEMIRLREYLSLVIPRGGYGLISFVRDNATVPVLETGVGNCHIFVDESADLKKA
VPVIINAKTQRPGTCNAAEKLLVHEKIAKEFLPVIVEELRKHGVEVRGCEKTREIVPDVVPATEDDWPTEYLDLIIAIKV
VKNVDEAIEHIKKYSTGHSESILTENYSNAKKFVSEIDAAAVYVNASTRFTDGGQFGFGAEIGISTQRFHARGPVGLREL
TTYKFVVLGEYHVRE
>P39820 2.7.2.11~~~proB~~~Glutamate 5-kinase 1~~~COG0263
MKKQRIVVKIGSSSLTNSKGSIDEAKIREHVQAISVLKKAGHEMILITSGAVAAGFSSLGYPSRPVTIKGKQAAAAVGQT
LLMQQYMNQFKQYSLTPGQILLTRNDFSKRERYRNAYATIMELLERGVIPIINENDSTSVEELTFGDNDMLSALVSGLIH
ADQLMILTDINGLYDANPNENPEAKRFDYLPEITPELLGYAGSAGSKVGTGGMKSKLLATQTALSLGVKVFIGTGSGEQK
LADILDGRGDGTYIGDKELSSVNNTRQWIQFHSPISGEIIIDAGAEEAMIHNGSSLLPAGVVGVNGSFPKGAVVEVRGPG
GVIGKGQTHYSSEEIMEAKGKRSDELDFEKTFEVIHRNDWVNVKD
>Q2SZF9 2.7.2.11~~~proB~~~Glutamate 5-kinase~~~
MRSIIADSKRLVVKVGSSLVTNDGRGLDHDAIGRWAAQIAALRNEGKEVVLVSSGAIAEGMQRLGWSRRPREIDELQAAA
AVGQMGLAQVYESRFAEHGIRTAQILLTHADLADRERYLNARSTLLTLLRLGVVPIINENDTVVTDEIKFGDNDTLGALV
ANLIEGDALIILTDQQGLFTADPRKDPGATLVAEASAGAPELEAMAGGAGSSIGRGGMLTKILAAKRAAHSGANTVIASG
RERDVLLRLASGEAIGTQLIARTARMAARKQWMADHLQVRGHVVIDAGAVDKLTAGGKSLLPIGVVAVQGVFARGEVIAC
VNDAGREVARGITNYSSAEAKLIQRKPSGEIEAVLGYMLEPELIHRDNLVLV
>Q9PJ29 2.7.2.11~~~proB~~~Glutamate 5-kinase~~~COG0263
MKRIVVKVGSHVISEENTLSFERLKNLVAFLAKLMEKYEVILVTSAAISAGHTKLDIDRKNLINKQVLAAIGQPFLISVY
NELLAKFNKLGGQILLTGKDFDSRKATKHAKNAIDMMINLGILPIINENDATAIEEIVFGDNDSLSAYATHFFDADLLVI
LSDIDGFYDKNPSEFSDAKRLEKITHIKEEWLQATIKTGSEHGTGGIVTKLKAAKFLLEHNKKMFLASGFDLSVAKTFLL
EDKQIGGTLFE
>P0A7B5 2.7.2.11~~~proB~~~Glutamate 5-kinase~~~COG0263
MSDSQTLVVKLGTSVLTGGSRRLNRAHIVELVRQCAQLHAAGHRIVIVTSGAIAAGREHLGYPELPATIASKQLLAAVGQ
SRLIQLWEQLFSIYGIHVGQMLLTRADMEDRERFLNARDTLRALLDNNIVPVINENDAVATAEIKVGDNDNLSALAAILA
GADKLLLLTDQKGLYTADPRSNPQAELIKDVYGIDDALRAIAGDSVSGLGTGGMSTKLQAADVACRAGIDTIIAAGSKPG
VIGDVMEGISVGTLFHAQATPLENRKRWIFGAPPAGEITVDEGATAAILERGSSLLPKGIKSVTGNFSRGEVIRICNLEG
RDIAHGVSRYNSDALRRIAGHHSQEIDAILGYEYGPVAVHRDDMITR
>A0R148 2.7.2.11~~~proB~~~Glutamate 5-kinase~~~COG0263
MSEHREAVRTARSVVVKIGTTALTTPSGVFDANRLASLVEAIEGRMKAGSDVVIVSSGAIAAGIEPLGLSKRPTDLATKQ
AAASVGQVALVNAWSAAFAVYNRTVGQVLLTAHDISMRVQHNNAQRTLDRLRALHAVAIVNENDTVATNEIRFGDNDRLS
ALVAHLVGADALILLSDIDGLYDGDPRKATPDKPARFIPEVAAQGDLDGVVAGRGSSLGTGGMASKLSSALLAADAGVPV
LLAAAADAGRALDDASVGTVFAPRPERMSARKFWMRYAAESAGALTLDDGAVRAVIKQRRSLLPAGITSVTGRFHGGDVV
DLRALDGHTVARGVVAYDQAELASIIGRSTHELPVEMRRPAVHADDLVRT
>P9WHU9 2.7.2.11~~~proB~~~Glutamate 5-kinase~~~COG0263
MRSPHRDAIRTARGLVVKVGTTALTTPSGMFDAGRLAGLAEAVERRMKAGSDVVIVSSGAIAAGIEPLGLSRRPKDLATK
QAAASVGQVALVNSWSAAFARYGRTVGQVLLTAHDISMRVQHTNAQRTLDRLRALHAVAIVNENDTVATNEIRFGDNDRL
SALVAHLVGADALVLLSDIDGLYDCDPRKTADATFIPEVSGPADLDGVVAGRSSHLGTGGMASKVAAALLAADAGVPVLL
APAADAATALADASVGTVFAARPARLSARRFWVRYAAEATGALTLDAGAVRAVVRQRRSLLAAGITAVSGRFCGGDVVEL
RAPDAAMVARGVVAYDASELATMVGRSTSELPGELRRPVVHADDLVAVSAKQAKQV
>P09879 ~~~~~~Protein B~~~
DQVTTPQVVNHVNSNNQAQQMAQKLDQDSIQLRNIKDNVQGTDYEKPVNEAITSVEKLKTSLRANSETVYDLNSIGSRVE
ALTDVIEAITFSTQHLANKVSQANIDMGFGITKLVIRILDPFASVDSIKAQVNDVKALEQKVLTYPDLKPTDRATIYTKS
KLDKEIWNTRFTRDKKVLNVKEFKVYNTLNKAITHAVGVQLNPNVTVQQVDQEIVTLQAALQTALK
>Q8RMG1 1.5.5.2~~~fadM~~~Proline dehydrogenase 1~~~
MLRHVFLFLSQNKTLTKFAKAYGTRLGARRFVAGDTIESAVKTVKRLNRSGLCATIDYLGEYAASEKEANQVAEECKKAI
QAIAEHQLDSELSLKLTSIGLDLSEELALTHLRAILSVAKQYDVAVTIDMEDYSHYEQTLSIYRQCKQEFEKLGTVIQAY
LYRAAEDIKKMRDLKPNLRLVKGAYKESAAVAFPDKRGTDLHFQSLIKLQLLSGNYTAVATHDDDIIKFTKQLVAEHRIP
ASQFEFQMLYGIRPERQKELAKEGYRMRVYVPYGTDWFSYFMRRIAERPANAAFVLKGILKK
>O32179 1.5.5.2~~~fadM~~~Proline dehydrogenase 1~~~COG0506
MLRHVFLFLSQNKTLTKFAKAYGTRLGARRFVAGDTIESAVKTVKRLNRSGLCATIDYLGEYAASEKEANQVAEECKKAI
QAIAEHQLNSELSLKLTSIGLDLSEELALTHLRAILSVAKQYDVAVTIDMEDYSHYEQTLSIYRQCKQEFEKLGTVIQAY
LYRAAEDIRKMRDLKPNLRLVKGAYKESAAVAFPDKRGTDLHFQSLIKLQLLSGNYTAVATHDDDIIAFTKQLVAEHQIP
ASQFEFQMLYGIRPERQKELAKEGYRMRVYVPYGTDWFSYFMRRIAERPANAAFVLKGILKK
>P94390 1.5.5.2~~~putB~~~Proline dehydrogenase 2~~~COG0506
MITRDFFLFLSKSGFLNKMARNWGSRVAAGKIIGGNDFNSSIPTIRQLNSQGLSVTVDHLGEFVNSAEVARERTEECIQT
IATIADQELNSHVSLKMTSLGLDIDMDLVYENMTKILQTAEKHKIMVTIDMEDEVRCQKTLDIFKDFRKKYEHVSTVLQA
YLYRTEKDIDDLDSLNPFLRLVKGAYKESEKVAFPEKSDVDENYKKIIRKQLLNGHYTAIATHDDKMIDFTKQLAKEHGI
ANDKFEFQMLYGMRSQTQLSLVKEGYNMRVYLPYGEDWYGYFMRRLAERPSNIAFAFKGMTKK
>Q9RW55 1.5.5.2~~~~~~Proline dehydrogenase~~~COG0506
MIDQLYRKAVLTVAERPQVEQLARQKMWNLAERFVAGESIESAIQAVQALERDGIAGNLDLLGEFIDSPAKCTEFADDVI
KLIEAAHAAGIKPYVSIKLSSVGQGKDENGEDLGLTNARRIIAKAKEYGGFICLDMEDHTRVDVTLEQFRTLVGEFGAEH
VGTVLQSYLYRSLGDRASLDDLRPNIRMVKGAYLEPATVAYPDKADVDQNYRRLVFQHLKAGNYTNVATHDERIIDDVKR
FVLAHGIGKDAFEFQMLYGIRRDLQKQLAAEGYRVRVYLPYGRDWYAYFSRRIAETPRNAAFVVQGMLKG
>Q72IB8 1.5.5.2~~~~~~Proline dehydrogenase~~~COG0506
MNLDLAYRSFVLGVAGHPQVERLIKHRAKGLVRRYVAGETLEEALKAAEALEREGVHAILDLLGEMVRTEEEARAFQRGL
LELVWALAGKPWPKYISLKLTQLGLDLSEDLALALLREVLREAEPRGVFVRLDMEDSPRVEATLRLYRALREEGFSQVGI
VLQSYLYRTEKDLLDLLPYRPNLRLVKGAYREPKEVAFPDKRLIDAEYLHLGKLALKEGLYVAFATHDPRIIAELKRYTE
AMGIPRSRFEFQFLYGVRPEEQRRLAREGYTVRAYVPYGRDWYPYLTRRIAERPENLLLVLRSLVSG
>R9UTQ8 1.14.11.57~~~~~~L-proline trans-4-hydroxylase~~~
MSVSAPLLDAKVRYGRDGWLPLPHTLSDPDVRKLRQRIEGISREQRPEVVLEEGSSAVRALHGCHDFDEVCARLVRLPAL
VGLAEQLLGGPVYVYQFKVNMKQAHEGAAWPWHQDFAFWHHEDGMGAPDAVNIAIFLDDVTDENGPLEVIPGSQHAGIVE
DTARPGRERSHDWRHHVSAKLEYVVPDEIAGRLAGTFGVRRLTGPAGTAVAFHPSIIHSSSNNTSAQRRCVLLITYNRVT
NTPAHPVRPPFLVSRDSTPVVPVDADRL
>Q6E3K9 ~~~pcfA~~~Propionicin-F~~~
MNTKAVNLKSENTTKLVSYLTENQLDEFIRRIRIDGALVEEVSQNAKQALDNTGLNGWINTDCDEGLLSDFISKIASARW
IPLAESIRPAVTDRDKYRVSCWFYQGMNIAIYANIGGVANIIGYTEAAVATLLGAVVAVAPVVPGTPTPPKDKSSQYKEV
PLAVRLSETYHEEGVRGLFDELNYSESRMISTLRRASTDGVLINSWNDGQDTILLKKYNFQDLQLTVRSRIVGNQTIIEE
CKITDGRKTLSDETV
>Q79VC4 ~~~proP~~~Ectoine/proline transporter ProP~~~COG0477
MSPIRSKKKIKNEPRLTVDDVNVVPPKKIRPAIKGTVVGNFMEWYDFGIYGYLTVTMTAVFTQGLPQEWQLLAVMFGFAV
SYLVRPLGGLVLGPLGDKVGRQKVLYVTMAMMAVSTALIGLLPTAASIGAWALVLLYLLKMVQGFSTGGEYAGATTYVAE
FAPDRRRGFFGAFLDMGSYLGFAAGASVVAITTWVTTHFYGATAMEDFGWRIPFLTAIPLGIIAVYLRTRIPETPAFENN
QDEPNAVVEKDTEDPYARLGLAGVIRHHWRPLLIGIAIVAATNTAGYALTSYMPVYLEEQIGLHSASAAAVTVPILVVMS
LLLPFVGMWSDRVGRKPVYATAVAATLILMVPAFLIMNTGTIGAVLIALSMVAIPTGLYVALSASALPALFPTASRFSGM
GISYNISVSLFGGTTPLITQFLLQKTGLDIVPALYIMFFSAIAGVALLFMTESSQKPLLGSFPTVETKSEAVEIVKNQDE
DPNIDLSHMPFPDEENVGAEKQNA
>P0C0L7 ~~~proP~~~Proline/betaine transporter~~~COG0477
MLKRKKVKPITLRDVTIIDDGKLRKAITAASLGNAMEWFDFGVYGFVAYALGKVFFPGADPSVQMVAALATFSVPFLIRP
LGGLFFGMLGDKYGRQKILAITIVIMSISTFCIGLIPSYDTIGIWAPILLLICKMAQGFSVGGEYTGASIFVAEYSPDRK
RGFMGSWLDFGSIAGFVLGAGVVVLISTIVGEANFLDWGWRIPFFIALPLGIIGLYLRHALEETPAFQQHVDKLEQGDRE
GLQDGPKVSFKEIATKYWRSLLTCIGLVIATNVTYYMLLTYMPSYLSHNLHYSEDHGVLIIIAIMIGMLFVQPVMGLLSD
RFGRRPFVLLGSVALFVLAIPAFILINSNVIGLIFAGLLMLAVILNCFTGVMASTLPAMFPTHIRYSALAAAFNISVLVA
GLTPTLAAWLVESSQNLMMPAYYLMVVAVVGLITGVTMKETANRPLKGATPAASDIQEAKEILVEHYDNIEQKIDDIDHE
IADLQAKRTRLVQQHPRIDE
>P45577 ~~~proQ~~~RNA chaperone ProQ~~~COG3109
MENQPKLNSSKEVIAFLAERFPHCFSAEGEARPLKIGIFQDLVDRVAGEMNLSKTQLRSALRLYTSSWRYLYGVKPGATR
VDLDGNPCGELDEQHVEHARKQLEEAKARVQAQRAEQQAKKREAAATAGEKEDAPRRERKPRPTTPRRKEGAERKPRAQK
PVEKAPKTVKAPREEQHTPVSDISALTVGQALKVKAGQNAMDATVLEITKDGVRVQLNSGMSLIVRAEHLVF
>A0A0S2DNK1 3.4.21.-~~~~~~Probable serine protease FE772_23060~~~
MEQERIRQLAKLLAAGARARDAQAESMMDGGVAAAPAPAGQTLDVVEQALSTPPDDVDETQWRAAREQLLTHAHNGLVKL
QRGDLQLDADEGCAMEAVIISDGSRPSFLLCDGEIDPKDPSIETWAGNIAAAQALGIAKLAAAVGRIQPKNGHASRYVGT
GTLIDRDAGLILTNYHVIEQAQQNYGVAMTRNGDRLSVDGWLEIDFVGESCSLRTHRFRIVEVALPQGYGSTFHGIDAAV
ARIEPLPDSPALPDPVPLLSADAAYATGAISSLALIGFPARPSLQDGKDVDWSFVMRVLFGNRFGVKRLAPGQFTLPLGS
HALDQGRRAIGHDATTFGGASGSLLMSWLDDRTPSFALHFGGATGVSNYALSFAAERNALSAIGARF
>A0A0S2DN74 3.4.21.-~~~~~~Probable serine protease FE772_23065~~~
MVAVVQADYSRAEALAAWTRLSDEFIGNCYVSVRPRHAPAWEVVVASAAGSLRLEAFKRAHDHDFLDRLAVAIGNWEQKA
QRPDHEIAQMLDQVGDYGLMQGMTNPDKGFMHADILLPLLQARDACAIVVRTEPVFQQLGTAFLVRPDLILTAAHVVMDV
DAATGRWASTLKNGLAFHFREKPNQREHPTLAIRPAAVALISHALPHGRPPNLLERSLAAPADTCLDYALIRLAQRVSHL
RPVEVVDTAAVKQGKPCWAFGFPGGNALMMDVDLVTDIDPGSGRWLHRANVAAGMSGGCCINHEGQVAGLHEGTLDSEDD
AKVKRNRGISIAAIRRDQCRDGKDPLKQVASSPSLEFRDPALVDGWYRAGTALAGEAGAAQWRASVEAALRGRNPDATDS
LPAYHPWFARADVEKWIDSAAPDERLSLIHGPPGVGKSFCIHLLRGKLDPYADLVVFNPTQTNDMTWSDATGHAAAANAS
DYRTAAASVRYRAIDDFLGELRDRSAGGTRTCYVAIDFGPAGRQDRFVGTNWVELIAILAAAGWIRVMLIGLDDYERSVM
IDRMESRPETDSVKIAESELAPITAAEFRTYAKHLASARGKPAPKQAEMAKYVDSAVVGVAEPMKMVAAVRAAIELEAAL
S
>J9ZXD8 ~~~poyA~~~Polytheonamide B~~~
MADSDNTPTSRKDFETAIIAKAWKDPEYLRRLRSNPREVLQEELEALHPGAQLPDDLGISIHEEDENHVHLVMPRHPQNV
SDQTLTDDDLDQAAGGTGIGVVVAVVAGAVANTGAGVNQVAGGNINVVGNINVNANVSVNMNQTT
>P0DV49 3.4.21.-~~~~~~Roc-COR-CHAT protease~~~
MSTLNNLTKTLTQQVDDGPISAVRLVKGYDNSYALNAAGQVTALSIGNSSLKKLVLGTEAQALEYLYLSGSESLKEVVFE
VPLPHLTHLYLNNCAIKDITIPKGFRSLQQVYLQKNGLTELVFEGDCPALVLLDVSENQLKGLSFHSGFRALKYIYATNN
VLQKITFNRSMRLLNTLHLAKNQLTELAPFLSEIETMETLYLQGNQLLRIDREIWDRDLNCWDTMKGYLTSLNKGNIIRE
YLHEAKMILIGNGEVGKTSIRLKLLDINAPLPDKKDRTPGLDIVPYTIANLSSAQTGLPASTSFTFNIWDFGGQGKYREI
QQLFCSRKSLYLYVTAYDDTADYNELYVGYEYWLAMANAYNNDSGQHSPVIYVQNKNDMGEKPINEEEVKHKFGNVGAFI
KISCVDKEQFTALPKLIVKSISKISEDIFTVQYNSEWLGVKEDLNELKNQGTNYISKEEFITICTERGLSEDEIRAWLTV
LDRIGAVIYFGDNEKLKDWIVLNPIWVKDAICKVIDFEFYDDIATLKPSFLPKIWEEYSESEREKLLQLLISYHFCYKEE
ESFIVPALFTENQPKYPEHLSSFDCEIKLKYVSFLPAGTLHKFMVKLHDKIYNELRWKKGVVLQDGATNTYAEVTERWKE
HSIYIRLKDKKGQHPLWQEIQKTLQELNEELKTTKLMQLDFEVYCFYNNKWKAKPDIEDLCELDSTNIFKFMFGLSSPVK
EYISQKPTPLLGKAKLSIIFLTADPENKNPIGASVQKQRIENTISDAFEFYNNLETKYNEIGSNTVGKDIVHITVHGIPD
QLFFVHDKHTDQSNPVQSDYLCKQLEKTKQVKKLIVLIACNSETTAKKIVDGGLTKYAIGTTIDISSKAAIDFSEKFYDL
LKNSPLQIESVFEETCYELNQDKYRAKLDDGEFYDYSKVFKIFKKTT
>P0DV53 3.4.21.-~~~~~~Roc-COR-CHAT protease~~~
MPQPAIIAKLEQQLRITIAPFDAPDLKAFMRYGRKGDDNSQYFLKDGKLTGLKLRTLGLKDTSFLEERELAGLQGLYLAE
NDFSSLQLPGHLQQLRLLHLADNKELKTLEFAGSMPLLEEIDLSDSGIQTLQLPACPALQKLDVSRSKLEAFSFASACPA
LWWLDLSGNGELRKLKMPAGFKALQYLYLYKSGIQELQINGKLPKLVVLDLEGNQLKQWPEKLLLPEGLETLYLEGNPIE
NIPETIRGSGERHNSVEDVRQYLLSIIDEDKVEYLHQAKMILVGNGEVGKTSIRLKLLDSKATLPKKKDRTQGLDIVPYE
LKALSPDLTGLEDAIDFQLNIWDFGGQGKYREVQQLFFSPQSLYLFVTACDDTPEDKESYVTFDYWMSLVHALSYDREQE
RSSPVIHVVNKIDKERMDIDQTAHNRFGNVEEFHAISCKHLTNFEALRKAIPRVLPKVGQGIFRDQYNEDWLGVMEELQR
RQGEHHITYQEYRDEVCKDRLNDGEARAWLRILDRIGTVIYFGENEKLKDWIILNPNWVKQAVFEVIDSGERNPVPQWRF
EKQIWSHYSEKEREKLFELLQAYDLAYKQQNAFGEQEFVVPALLEHESPNYQDLLPQEELPLKLRFAFKPILPAGTVNKL
MVRLKDYIYRGLMWKDNVVFHHPDSNAYVQVEEDWQEHFIYLSVYGKQPSVIYETVTSTLKDINNDFKNAKFLKELEFTV
EGFDGEEWMKKRLLKKAGADFFNFLWEKPGIHYKEEDEKNIMDQVKELIAKNRVGDALEMLKSLVPAHLEAEVLQLISRY
SKLQRDSRMGILANNDENVERNRIVSSILNLASEAERDNDHTGLSDSSDQEDETFTNVTDQTKKILFICSSPSGKNLLDF
GKEFKSIGIARQLADSRDDYAEPIIKTSVEADDLLHIMTKYQPDILHISLHSSKSKGLYFENNAGQAEPISAEDFKDIIE
TYASDPDGKGRIETIVLSCCDSEAYGRAITNFADHIVVTKDLFPDKAAVVYAKDFYRMLFNNKDIGFAHRSATSAIKRKK
YPSDGFAHPIHEIPFLIKNDDK
>P14175 ~~~proV~~~Glycine betaine/proline betaine transport system ATP-binding protein ProV~~~COG4175
MAIKLEIKNLYKIFGEHPQRAFKYIEQGLSKEQILEKTGLSLGVKDASLAIEEGEIFVIMGLSGSGKSTMVRLLNRLIEP
TRGQVLIDGVDIAKISDAELREVRRKKIAMVFQSFALMPHMTVLDNTAFGMELAGINAEERREKALDALRQVGLENYAHS
YPDELSGGMRQRVGLARALAINPDILLMDEAFSALDPLIRTEMQDELVKLQAKHQRTIVFISHDLDEAMRIGDRIAIMQN
GEVVQVGTPDEILNNPANDYVRTFFRGVDISQVFSAKDIARRTPNGLIRKTPGFGPRSALKLLQDEDREYGYVIERGNKF
VGAVSIDSLKTALTQQQGLDAALIDAPLAVDAQTPLSELLSHVGQAPCAVPVVDEDQQYVGIISKGMLLRALDREGVNNG
>P17328 ~~~proV~~~Glycine betaine/proline betaine transport system ATP-binding protein ProV~~~
MAIKLEVKNLYKIFGEHPQRAFKYIEKGLSKEQILEKTGLSLGVKDASLAIEEGEIFVIMGLSGSGKSTMVRLLNRLIEP
TRGQVLIDGVDIAKISDAELREVRRKKIAMVFQSFALMPHMTVLDNTAFGMELAGIAAQERREKALDALRQVGLENYAHA
YPDELSGGMRQRVGLARALAINPDILLMDEAFSALDPLIRTEMQDELVKLQAKHQRTIVFISHDLDEAMRIGDRIAIMQN
GEVVQVGTPDEILNNPANDYVRTFFRGVDISQVFSAKDIARRSPVGLIRKTPGFGPRSALKLLQDEDREYGYVIERGNKF
VGVVSIDSLKAALSQAQGIEAALIDDPLVVDAQTPLSELLSHVGQAPCAVPVVDEEHQYVGIISKRMLLQALDREGGNNG
>P14176 ~~~proW~~~Glycine betaine/proline betaine transport system permease protein ProW~~~COG4176
MADQNNPWDTTPAADSAAQSADAWGTPTTAPTDGGGADWLTSTPAPNVEHFNILDPFHKTLIPLDSWVTEGIDWVVTHFR
PVFQGVRVPVDYILNGFQQLLLGMPAPVAIIVFALIAWQISGVGMGVATLVSLIAIGAIGAWSQAMVTLALVLTALLFCI
VIGLPLGIWLARSPRAAKIIRPLLDAMQTTPAFVYLVPIVMLFGIGNVPGVVVTIIFALPPIIRLTILGINQVPADLIEA
SRSFGASPRQMLFKVQLPLAMPTIMAGVNQTLMLALSMVVIASMIAVGGLGQMVLRGIGRLDMGLATVGGVGIVILAIIL
DRLTQAVGRDSRSRGNRRWYTTGPVGLLTRPFIK
>P17327 ~~~proW~~~Glycine betaine/proline betaine transport system permease protein ProW~~~
MADQTNPWDTAQVADTTTQTADAWGTPAGVATDGGSTDWLNSAPAPAPEHFSLLDPFHKTLIPLDSWVTEGIDWVVTHFR
PLFQGIRVPVDYILNGFQQLLLGMPAPVAIILFALIAWQVSGVGMGIATLISLIAIGAIGAWSQAMITLALVLTALLFCV
VIGLPMGIWLARSPRAAKIVRPLLDAMQTTPAFVYLVPIVMLFGIGNVPGVVVTIIFALPPIVRLTILGINQVPADLIEA
SRSFGASPRQMLFKVQLPLAMPTIMAGVNQTLMLALSMVVIASMIAVGGLGQMVLRGIGRLDMGLATVGGVGIVILAIIL
DRLTQAVGRDSRSRGNRRWYTTGPVGLITRPFVK
>P64483 3.1.1.-~~~proXp-y~~~Multifunctional Ser/Thr-tRNA deacylase ProXp-y~~~COG2606
MTEMAKGSVTHQRLIALLSQEGADFRVVTHEAVGKCEAVSEIRGTALGQGAKALVCKVKGNGVNQHVLAILAADQQADLS
QLASHIGGLRASLASPAEVDELTGCVFGAIPPFSFHPKLKLVADPLLFERFDEIAFNAGMLDKSVILKTADYLRIAQPEL
VNFRRTA
>Q9L4Q7 ~~~proX~~~Prolyl-tRNA editing protein ProX~~~COG3760
MDMDAKQAVIAKLDELKINYTLIEHDPVYTIEEMEKIDIENVDYIVKNLFLRDAKGRQHYLVVADKDQKIDLKTLQDKIG
STKLSFASEDRLQKYLKLTKGAVSPFGVLNDETAEVEVVFDKNLVGRSCVAVHPNDNSATVVLSYEDLEKIVKANGNTFK
AIEL
>P0AFM2 ~~~proX~~~Glycine betaine/proline betaine-binding periplasmic protein~~~COG2113
MRHSVLFATAFATLISTQTFAADLPGKGITVNPVQSTITEETFQTLLVSRALEKLGYTVNKPSEVDYNVGYTSLASGDAT
FTAVNWTPLHDNMYEAAGGDKKFYREGVFVNGAAQGYLIDKKTADQYKITNIAQLKDPKIAKLFDTNGDGKADLTGCNPG
WGCEGAINHQLAAYELTNTVTHNQGNYAAMMADTISRYKEGKPVFYYTWTPYWVSNELKPGKDVVWLQVPFSALPGDKNA
DTKLPNGANYGFPVSTMHIVANKAWAEKNPAAAKLFAIMQLPVADINAQNAIMHDGKASEGDIQGHVDGWIKAHQQQFDG
WVNEALAAQK
>Q8ZML1 ~~~proX~~~Glycine betaine/proline betaine-binding periplasmic protein~~~
MRHTVIFASAFATLVTASAFAADLPGKGITVQPIQSTISEETFQTLLVSRALEKLGYTVNKPSEVDYNVGYTSIASGDAT
FTAVNWQPLHDDMYAAAGGDNKFYREGVFVSGAAQGYLIDKKTAEQYNITNIAQLKDPKIAKIFDTNGDGKADMMGCSPG
WGCEAVINHQNKAFDLQKTVEVSHGNYAAMMADTITRFKEGKPVLYYTWTPYWVSDVMKPGKDVVWLQVPFSSLPGEQKN
IDTKLPNGANYGFPVNTMHIVANKAWAEKNPAAAKLFAIMKLPLADINAQNAMMHAGKSSEADVQGHVDGWINAHQQQFD
GWVKEALAAQK
>P0AAE2 ~~~proY~~~Proline-specific permease ProY~~~COG1113
MESKNKLKRGLSTRHIRFMALGSAIGTGLFYGSADAIKMAGPSVLLAYIIGGIAAYIIMRALGEMSVHNPAASSFSRYAQ
ENLGPLAGYITGWTYCFEILIVAIADVTAFGIYMGVWFPTVPHWIWVLSVVLIICAVNLMSVKVFGELEFWFSFFKVATI
IIMIVAGFGIIIWGIGNGGQPTGIHNLWSNGGFFSNGWLGMVMSLQMVMFAYGGIEIIGITAGEAKDPEKSIPRAINSVP
MRILVFYVGTLFVIMSIYPWNQVGTAGSPFVLTFQHMGITFAASILNFVVLTASLSAINSDVFGVGRMLHGMAEQGSAPK
IFSKTSRRGIPWVTVLVMTTALLFAVYLNYIMPENVFLVIASLATFATVWVWIMILLSQIAFRRRLPPEEVKALKFKVPG
GVATTIGGLIFLLFIIGLIGYHPDTRISLYVGFAWIVVLLIGWMFKRRHDRQLAENQ
>P55798 3.1.3.16~~~pphA~~~Serine/threonine-protein phosphatase 1~~~COG0639
MKQPAPVYQRIAGHQWRHIWLSGDIHGCLEQLRRKLWHCRFDPWRDLLISVGDVIDRGPQSLRCLQLLEQHWVCAVRGNH
EQMAMDALASQQMSLWLMNGGDWFIALADNQQKQAKTALEKCQHLPFILEVHSRTGKHVIAHADYPDDVYEWQKDVDLHQ
VLWSRSRLGERQKGQGITGADHFWFGHTPLRHRVDIGNLHYIDTGAVFGGELTLVQLQ
>Q8ZNY9 3.1.3.16~~~pphA~~~Serine/threonine-protein phosphatase 1~~~
MMRPEEIYQRIEAKNWRHVWVVGDIHGCFSMLMKRLRECRFDPQQDLLVSVGDLIDRGPDSLGCLALLRESWMTAVRGNH
EQMALDARASSQSTLWLMNGGDWFTRLTAEHAAQAEALFILCQRLPWILEVRCRHSTHVIAHADYPASTYQWQKKVDLHQ
VLWSRERLINKRGGISGADHFWFGHTPLRRRMDFANVHYIDTGAVFGGQLTLARIQ
>P55799 3.1.3.16~~~pphB~~~Serine/threonine-protein phosphatase 2~~~COG0639
MPSTRYQKINAHHYRHIWVVGDIHGEYQLLQSRLHQLSFFPKIDLLISVGDNIDRGPESLDVLRLLNQPWFTSVKGNHEA
MALEAFETGDGNMWLASGGDWFFDLNDSEQQEAIDLLLKFHHLPHIIEITNDNIKYAIAHADYPGSEYLFGKEIAESELL
WPVDRVQKSLNGELQQINGADYFIFGHMMFDNIQTFANQIYIDTGSPNSGRLSFYKIK
>Q8ZMH3 3.1.3.16~~~pphB~~~Serine/threonine-protein phosphatase 2~~~
MELIRYADINSDLYRHIWVVGDIHGCYSLLLTRLAQLNFSPDTDLLISTGDNIDRGKENLETLRLLNTPWFISVVGNHEA
MALDAFETQDGNFWYVNGGYWYDSVTEKDRQEATELLLTFKQRPHIIEVETSSKKYVIAHADYPDDSYDYGKQVDIDSVL
WSRDRLLGSLQGNIHPIRGADTFIFGHMIVDYTTTFANQIYIDTGSFCSGNLSFFKIK
>P76395 3.1.3.16~~~pphC~~~Serine/threonine-protein phosphatase 3~~~COG0631
MSWRLVYASTVGTSHISADLPCQDACQMQIAWLNDQQPLLSVFVADGAGSVSQGGEGAMLAVNEAMAYMSQKVQGGELGL
NDVLATNMVLTIRQRLFAEAEAKELAVRDFACTFLGLISSPDGTLIMQIGDGGVVVDLGHGLQLPLTPMAGEYANMTHFI
TDEDAVSRLETFTSTGRAHKVAAFTDGIQRLALNMLDNSPHVPFFTPFFNGLAAATQEQLDLLPELLKQFLSSPAVNERT
DDDKTLALALWAE
>Q937P0 4.1.3.30~~~prpB~~~2-methylisocitrate lyase~~~
MTYSASDLARSAGARFRQALADEHPLQVVGTINANHALLAKRAGYRAIYLSGGGVAAGSLGLPDLGISNLDDVLTDIRRI
TDVCDTPLLVDVDTGFGASAFNVARTTKSLIKFGAAAMHIEDQVGAKRCGHRPNKEIVTQGEMVDRIRAAVDARTDENFV
IMARTDALAVEGLDKAIERAVACAEAGADAIFPEAMTDLAMYRKFVDAVKVPVLANITEFGATPLFTTEELDGAGVSMVL
LPLSAFRAMNKAAENVYAAIRQDGTQKNVVDTMQTRAELYESIGYHDFEQKLDALFAQGKGK
>P77541 4.1.3.30~~~prpB~~~2-methylisocitrate lyase~~~COG2513
MSLHSPGKAFRAALTKENPLQIVGTINANHALLAQRAGYQAIYLSGGGVAAGSLGLPDLGISTLDDVLTDIRRITDVCSL
PLLVDADIGFGSSAFNVARTVKSMIKAGAAGLHIEDQVGAKRCGHRPNKAIVSKEEMVDRIRAAVDAKTDPDFVIMARTD
ALAVEGLDAAIERAQAYVEAGAEMLFPEAITELAMYRQFADAVQVPILANITEFGATPLFTTDELRSAHVAMALYPLSAF
RAMNRAAEHVYNVLRQEGTQKSVIDTMQTRNELYESINYYQYEEKLDNLFARSQVK
>Q56062 4.1.3.30~~~prpB~~~2-methylisocitrate lyase~~~
MSLHSPGQAFRAALAKENPLQIVGAINANHALLAQRAGYQAIYLSGGGVAAGSLGLPDLGISTLDDVLTDIRRITDVCPL
PLLVDADIGFGSSAFNVARTVKSIAKAGAAALHIEDQVGAKRCGHRPNKAIVSKEEMVDRIRAAVDARTDPNFVIMARTD
ALAVEGLEAALDRAQAYVDAGADMLFPEAITELSMYRRFADVAQVPILANITEFGATPLFTTDELRSAHVAMALYPLSAF
RAMNRAAEKVYTVLRQEGTQKNVIDIMQTRNELYESINYYQFEEKLDALYRNKKS
>Q8NSH7 2.3.3.5~~~prpC1~~~2-methylcitrate synthase 1~~~COG0372
MSDSQVRKGLNGVISDYTSISKVMPESNSLTYRGYAVEDLVENCSFEEVIYLLWFGELPTTEQLRTFNTTGRSYRSLDAG
LISLIHSLPNTCHPMDVLRTAVSYMGTFDPDPFTRDADHIRSIGHNLLAQLPMVVAMDIRRRSGEEIIAPDHNKGIASNF
LSMVFGNDDGSVANSADDIRDFERSLILYAEHSFNASTFSARVISSTRSDTYSAITGAIGALKGPLHGGANEFVMHTMLD
IDDPNNAADWMGKALDRKERIMGFGHRVYKNGDSRVPSMEKSMRSLAARHRGQKWVHMYESMQEVMEARTGIKPNLDFPA
GPAYYMLGFPVDFFTPLFVLARVSGWTAHIVEQFENNALIRPLSAYNGVEEREVVPISERT
>Q8NSL1 2.3.3.5~~~prpC2~~~2-methylcitrate synthase 2~~~COG0372
MSSATTTDVRKGLYGVIADYTAVSKVMPETNSLTYRGYAVEDLVENCSFEEVFYLLWHGELPTAQQLAEFNERGRSYRSL
DAGLISLIHSLPKEAHPMDVMRTAVSYMGTKDSEYFTTDSEHIRKVGHTLLAQLPMVLAMDIRRRKGLDIIAPDSSKSVA
ENLLSMVFGTGPESPASNPADVRDFEKSLILYAEHSFNASTFTARVITSTKSDVYSAITGAIGALKGPLHGGANEFVMHT
MLAIDDPNKAAAWINNALDNKNVVMGFGHRVYKRGDSRVPSMEKSFRELAARHDGEKWVAMYENMRDAMDARTGIKPNLD
FPAGPAYHLLGFPVDFFTPLFVIARVAGWTAHIVEQYENNSLIRPLSEYNGEEQREVAPIEKR
>O34002 2.3.3.5~~~gltA~~~2-methylcitrate synthase~~~
MTEPTIHKGLAGVTADVTAISKVNSDTNSLLYRGYPVQELAAKCSFEQVAYLLWNSELPNDSELKAFVNFERSHRKLDEN
VKGAIDLLSTACHPMDVARTAVSVLGANHARAQDSSPEANLEKAMSLLATFPSVVAYDQRRRRGEELIEPREDLDYSANF
LWMTFGEEAAPEVVEAFNVSMILYAEHSFNASTFTARVITSTLADLHSAVTGAIGALKGPLHGGANEAVMHTFEEIGIRK
DESLDEAATRSKAWMVDALAQKKKVMGFGHRVYKNGDSRVPTMKSALDAMIKHYDRPEMLGLYNGLEAAMEEAKQIKPNL
DYPAGPTYNLMGFDTEMFTPLFIAARITGWTAHIMEQVADNALIRPLSEYNGPEQRQVP
>O34779 3.1.3.16~~~prpC~~~Protein phosphatase PrpC~~~COG0631
MLTALKTDTGKIRQHNEDDAGIFKGKDEFILAVVADGMGGHLAGDVASKMAVKAMGEKWNEAETIPTAPSECEKWLIEQI
LSVNSKIYDHAQAHEECQGMGTTIVCALFTGKTVSVAHIGDSRCYLLQDDDFVQVTEDHSLVNELVRTGEISREDAEHHP
RKNVLTKALGTDQLVSIDTRSFDIEPGDKLLLCSDGLTNKVEGTELKDILQSDSAPQEKVNLLVDKANQNGGEDNITAVL
LELALQVEEGEDQC
>Q937N9 2.3.3.5~~~prpC~~~2-methylcitrate synthase~~~
MSEAQPLVTPKPKKSVALSGVTAGNTALCTVGRTGNDLHYRGYDILDIAETCEFEEIAHLLVHGKLPTKSELAAYKAKLK
SLRGLPANVKAALEWVPASAHPMDVMRTGVSVLGTVLPEKEDHNTPGARDIADRLMASLGSMLLYWYHYSHNGRRIEVET
DDDSIGGHFLHLLHGEKPSALWERAMNTSLNLYAEHEFNASTFTARVIAGTGSDMYSSISGAIGALRGPKHGGANEVAFE
IQKRYDNPDEAQADITRRVENKEVVIGFGHPVYTTGDPRNQVIKEVAKKLSKDAGSMKMFDIAEALETVMWDIKKMFPNL
DWFSAVSYHMMGVPTAMFTALFVIARTSGWAAHIIEQRIDNKIIRQSANYTGPENLKFVPLKDRK
>P31660 2.3.3.5~~~prpC~~~2-methylcitrate synthase~~~COG0372
MSDTTILQNSTHVIKPKKSVALSGVPAGNTALCTVGKSGNDLHYRGYDILDLAKHCEFEEVAHLLIHGKLPTRDELAAYK
TKLKALRGLPANVRTVLEALPAASHPMDVMRTGVSALGCTLPEKEGHTVSGARDIADKLLASLSSILLYWYHYSHNGERI
QPETDDDSIGGHFLHLLHGEKPSQSWEKAMHISLVLYAEHEFNASTFTSRVIAGTGSDMYSAIIGAIGALRGPKHGGANE
VSLEIQQRYETPDEAEADIRKRVENKEVVIGFGHPVYTIADPRHQVIKRVAKQLSQEGGSLKMYNIADRLETVMWESKKM
FPNLDWFSAVSYNMMGVPTEMFTPLFVIARVTGWAAHIIEQRQDNKIIRPSANYVGPEDRPFVALDKRQ
>H8F0D7 2.3.3.5~~~gltA1~~~2-methylcitrate synthase~~~
MTGPLAAARSVAATKSMTAPTVDERPDIKKGLAGVVVDTTAISKVVPQTNSLTYRGYPVQDLAARCSFEQVAFLLWRGEL
PTDAELALFSQRERASRRVDRSMLSLLAKLPDNCHPMDVVRTAISYLGAEDPDEDDAAANRAKAMRMMAVLPTIVAIDMR
RRRGLPPIAPHSGLGYAQNFLHMCFGEVPETAVVSAFEQSMILYAEHGFNASTFAARVVTSTQSDIYSAVTGAIGALKGR
LHGGANEAVMHDMIEIGDPANAREWLRAKLARKEKIMGFGHRVYRHGDSRVPTMKRALERVGTVRDGQRWLDIYQVLAAE
MASATGILPNLDFPTGPAYYLMGFDIASFTPIFVMSRITGWTAHIMEQATANALIRPLSAYCGHEQRVLPGTF
>I6Y9Q3 2.3.3.5~~~prpC~~~2-methylcitrate synthase~~~COG0372
MTGPLAAARSVAATKSMTAPTVDERPDIKKGLAGVVVDTTAISKVVPQTNSLTYRGYPVQDLAARCSFEQVAFLLWRGEL
PTDAELALFSQRERASRRVDRSMLSLLAKLPDNCHPMDVVRTAISYLGAEDPDEDDAAANRAKAMRMMAVLPTIVAIDMR
RRRGLPPIAPHSGLGYAQNFLHMCFGEVPETAVVSAFEQSMILYAEHGFNASTFAARVVTSTQSDIYSAVTGAIGALKGR
LHGGANEAVMHDMIEIGDPANAREWLRAKLARKEKIMGFGHRVYRHGDSRVPTMKRALERVGTVRDGQRWLDIYQVLAAE
MASATGILPNLDFPTGPAYYLMGFDIASFTPIFVMSRITGWTAHIMEQATANALIRPLSAYCGHEQRVLPGTF
>Q56063 2.3.3.5~~~prpC~~~2-methylcitrate synthase~~~
MTDTTILQNNTHVIKPKKSVALSGVPAGNTALCTVGKSGNDLHYRGYDILDLAEHCEFEEVAHLLIHGKLPTRDELNAYK
SKLKALRGLPANVRTVLEALPAASHPMDVMRTGVSALGCTLPEKEGHTVSGARDIADKLLASLSSILLYWYHYSHNGERI
QPETDDDSIGGHFLHLLHGEKPTQSWEKAMHISLVLYAEHEFNASTFTSRVIAGTGSDVYSAIIGAIGALRGPKHGGANE
VSLEIQQRYETPDEAEADIRKRVENKEVVIGFGHPVYTIADPRHQVIKRVAKQLSEEGGSLKMYHIADRLETVMWETKKM
FPNLDWFSAVSYNMMGVPTEMFTPLFVIARVTGWAAHIIEQRQDNKIIRPSANYTGPEDRPFVSIDDRC
>Q8NSL3 4.2.1.79~~~prpD2~~~2-methylcitrate dehydratase 2~~~COG2079
MINHEVRTHRSAEEFPYEEHLAHKIARVAADPVEVAADTQEMIINRIIDNASVQAASVLRRPVSSARAMAQVRPVTDGRG
ASVFGLPGRYAAEWAALANGTAVRELDFHDTFLAAEYSHPGDNIPPILAAAQQAGKGGKDLIRGIATGYEIQVNLVRGMC
LHEHKIDHVAHLGPSAAAGIGTLLDLDVDTIYQAIGQALHTTTATRQSRKGAISSWKAFAPAFAGKMSIEAVDRAMRGEG
APSPIWEGEDGVIAWLLSGLDHIYTIPLPAEGEAKRAILDTYTKEHSAEYQSQAPIDLARSMGEKLAAQGLDLRDVDSIV
LHTSHHTHYVIGTGSNDPQKFDPDASRETLDHSIMYIFAVALEDRAWHHERSYAPERAHRRETIELWNKISTVEDPEWTR
RYHSVDPAEKAFGARAVITFKDGTVVEDELAVADAHPLGARPFAREQYIQKFRTLAEGVVSEKEQDRFLDAAQRTHELED
LSELNIELDADILAKAPVIPEGLF
>P77243 4.2.1.79~~~prpD~~~2-methylcitrate dehydratase~~~COG2079
MSAQINNIRPEFDREIVDIVDYVMNYEISSKVAYDTAHYCLLDTLGCGLEALEYPACKKLLGPIVPGTVVPNGVRVPGTQ
FQLDPVQAAFNIGAMIRWLDFNDTWLAAEWGHPSDNLGGILATADWLSRNAVASGKAPLTMKQVLTAMIKAHEIQGCIAL
ENSFNRVGLDHVLLVKVASTAVVAEMLGLTREEILNAVSLAWVDGQSLRTYRHAPNTGTRKSWAAGDATSRAVRLALMAK
TGEMGYPSALTAPVWGFYDVSFKGESFRFQRPYGSYVMENVLFKISFPAEFHSQTAVEAAMTLYEQMQAAGKTAADIEKV
TIRTHEACIRIIDKKGPLNNPADRDHCIQYMVAIPLLFGRLTAADYEDNVAQDKRIDALREKINCFEDPAFTADYHDPEK
RAIANAITLEFTDGTRFEEVVVEYPIGHARRRQDGIPKLVDKFKINLARQFPTRQQQRILEVSLDRARLEQMPVNEYLDL
YVI
>H8F0D6 4.2.1.79~~~~~~2-methylcitrate dehydratase~~~
MVRIMLMHAVRAWRSADDFPCTEHMAYKIAQVAADPVDVDPEVADMVCNRIIDNAAVSAASMVRRPVTVARHQALAHPVR
HGAKVFGVEGSYSADWAAWANGVAARELDFHDTFLAADYSHPADNIPPLVAVAQQLGVCGAELIRGLVTAYEIHIDLTRG
ICLHEHKIDHVAHLGPAVAAGIGTMLRLDQETIYHAIGQALHLTTSTRQSRKGAISSWKAFAPAHAGKVGIEAVDRAMRG
EGSPAPIWEGEDGVIAWLLAGPEHTYRVPLPAPGEPKRAILDSYTKQHSAEYQSQAPIDLACRLRERIGDLDQIASIVLH
TSHHTHVVIGTGSGDPQKFDPDASRETLDHSLPYIFAVALQDGCWHHERSYAPERARRSDTVALWHKISTVEDPEWTRRY
HCADPAKKAFGARAEVTLHSGEVIVDELAVADAHPLGTRPFERKQYVEKFTELADGVVEPVEQQRFLAVVESLADLESGA
VGGLNVLVDPRVLDKAPVIPPGIFR
>O06582 4.2.1.79~~~prpD~~~2-methylcitrate dehydratase~~~COG2079
MPDQDTKVRFFRVFCWCPVLRMVRIMLMHAVRAWRSADDFPCTEHMAYKIAQVAADPVDVDPEVADMVCNRIIDNAAVSA
ASMVRRPVTVARHQALAHPVRHGAKVFGVEGSYSADWAAWANGVAARELDFHDTFLAADYSHPADNIPPLVAVAQQLGVC
GAELIRGLVTAYEIHIDLTRGICLHEHKIDHVAHLGPAVAAGIGTMLRLDQETIYHAIGQALHLTTSTRQSRKGAISSWK
AFAPAHAGKVGIEAVDRAMRGEGSPAPIWEGEDGVIAWLLAGPEHTYRVPLPAPGEPKRAILDSYTKQHSAEYQSQAPID
LACRLRERIGDLDQIASIVLHTSHHTHVVIGTGSGDPQKFDPDASRETLDHSLPYIFAVALQDGCWHHERSYAPERARRS
DTVALWHKISTVEDPEWTRRYHCADPAKKAFGARAEVTLHSGEVIVDELAVADAHPLGTRPFERKQYVEKFTELADGVVE
PVEQQRFLAVVESLADLESGAVGGLNVLVDPRVLDKAPVIPPGIFR
>Q8Z903 4.2.1.79~~~prpD~~~2-methylcitrate dehydratase~~~COG2079
MSAHISNVRPDFDREIVDIVDYVMNYEITSKVAYDTAHYCLLDTLGCGLEALEYPACKKLLGPIVPGTVVPNGARVPGTQ
FQLDPVQAAFNISAMIRWLDFNDTWLAAEWGHPSDNLGGILATADWLSRNAVAAGKAPLTMKQVLSGMIKAHEIQGCIAL
ENAFNRVGLDHVLLVKVASTAVVAEMLGLTRDEILNAVSLAWVDGQSLRTYRHAPNTGTRKSWAAGDATSRAVRLALMAK
TGEMGYPSALTAKTWGFYDVSFKGETFRFQRPYGSYVMENVLFKISFPAEFHSQTAVEAAMTLYEQMQAAGKTAADIEKV
TIRTHEACLRIIDKKGPLNNPADRDHCIQYMVAVPLLFGRLTAADYEDEVAQDKRIDALREKIVCYEDPAFTADYHDPEK
RAIGNAITVEFTDGSRFGEVVVEYPIGHARRRADGIPKLIEKFKINLARQFLTRQQQRILDVSLDRARLEQMPVNEYLDL
YII
>P74840 4.2.1.79~~~prpD~~~2-methylcitrate dehydratase~~~
MSTQELNIRPDFDREIVDIVDYVMNYEITSKVAYDTAHYCLLDTLGCGLEALEYPACKKLLGPIVPGTVVPNGARVPGTQ
FQLDPVQAAFNIGAMIRWLDFNDTWLAAEWGHPSDNLGGILATADWLSRNAVAAGKAPLTMKQVLSGMIKAHEIQGCIAL
ENAFNRVGLDHVLLVKVASTAVVAEMLGLTRDEILNAVSLAWVDGQSLRTYRHAPNTGTRKSWAAGDATSRAVRLALMAK
TGEMGYPSALTAKTWGFYDVSFKGETFRFQRPYGSYVMENVLFKISFPAEFHSQTAVEAAMTLYEQMQAAGKTAADIEKV
TIRTHEACLRIIDKKGPLNNPADRDHCIQYMVAVPLLFGRLTAADYEDEVAQDKRIDALREKIVCYEDPAFTADYHDPEK
RAIGNAITVEFTDGSRFGEVVVEYPIGHARRRADGIPKLIEKFKINLARQFPTRQQQRILDVSLDRARLEQMPVNEYLDL
YVI
>O31614 3.6.1.17~~~prpE~~~Bis(5'-nucleosyl)-tetraphosphatase PrpE [asymmetrical]~~~COG0639
MAYDIISDIHGCYDEMTALIQKLGYTIKNGVPVHEEGRVLVFAGDLTDRGPKSIEVIRFVAGAYEKGAVRYVPGNHCNKL
YRYLKGNPVKVMHGLETTAAELEELSKDEKKSVSEQFMKLYETAPLYDILHNGELVVAHAGIRADDIGKYTRRVKDFVLY
GDVTGETYPDGRPIRRDWAAAYNGKAWVVYGHTPVKEPRKVNRTINIDTGCVFGNQLTGFRFPEIETVSVPSSLPYDESR
FRPI
>Q937N7 5.3.3.-~~~~~~2-methyl-aconitate isomerase~~~
MTHVPQIKIPATYIRGGTSKGVFFRLQDLPETAQVPGPARDALLMRVIGSPDPYGKQIDGMGAATSSTSKTVILSKSTRP
DHDVDYLFGQVSIDQPFVDWSGNCGNLSAAVGPFAISAGLVDASRIPHNGVAVVRIWQANIGKTIIGHVPVTNGEVQETG
DFELDGVTFPAAEVQLEFMDPAAEEEGAGGAMFPTGNVVDDLEVPAVGTLKATMINAGIPTIFVNAESIGYTGTELQDAI
NSDTRALAMFEDHPCYGALRMGLIKNVDEAAKRQHTPKVAFVRQAGDYVASSGKKVAAADVDLLVRALSMGKLHHAMMGT
AAVAIGTAAAIPGTLVNLAAGGGERNAVRFGHPSGTLRVGAEAQQVDGEWAVKKAIMSRSARVLMEGWVRVPGDAF
>Q8EJW4 5.3.3.-~~~prpF~~~2-methyl-aconitate isomerase~~~COG2828
MSNKLFPPQIKVAATYMRGGTSKGVFFRLQDLPEAAQVPGPARDALLLRVIGSPDPYAKQIDGMGGATSSTSKTVILSHS
SKANHDVDYLFGQVSIDKPFVDWSGNCGNLTAAVGAFAISNGLIDAARIPRNGVCTVRIWQANIGKTIIAHVPITDGAVQ
ETGDFELDGVTFPAAEVQIEFMNPAADDDGEGGCMFPTGNLVDVLEVPGIGRFNATMINAGIPTIFINAEDLGYTGTELQ
DDINSDNAALAKFETIRAHGALRMGLIKHIDEAASRQHTPKIAFVAPPKSYASSSGKTVAAEDVDLLVRALSMGKLHHAM
MGTAAVAIGTAAAIPGTLVNLAAGGGEKEAVRFGHPSGTLRVGAQAVQENGEWTVIKAIMSRSARVLMEGFVRVPKP
>P77743 ~~~prpR~~~Propionate catabolism operon regulatory protein~~~COG3829
MAHPPRLNDDKPVIWTVSVTRLFELFRDISLEFDHLANITPIQLGFEKAVTYIRKKLANERCDAIIAAGSNGAYLKSRLS
VPVILIKPSGYDVLQALAKAGKLTSSIGVVTYQETIPALVAFQKTFNLRLDQRSYITEEDARGQINELKANGTEAVVGAG
LITDLAEEAGMTGIFIYSAATVRQAFSDALDMTRMSLRHNTHDATRNALRTRYVLGDMLGQSPQMEQVRQTILLYARSSA
AVLIEGETGTGKELAAQAIHREYFARHDARQGKKSHPFVAVNCGAIAESLLEAELFGYEEGAFTGSRRGGRAGLFEIAHG
GTLFLDEIGEMPLPLQTRLLRVLEEKEVTRVGGHQPVPVDVRVISATHCNLEEDMQQGRFRRDLFYRLSILRLQLPPLRE
RVADILPLAESFLKVSLAALSAPFSAALRQGLQASETVLLHYDWPGNIRELRNMMERLALFLSVEPTPDLTPQFMQLLLP
ELARESAKTPAPRLLTPQQALEKFNGDKTAAANYLGISRTTFWRRLKS
>O06581 ~~~prpR~~~HTH-type transcriptional regulator PrpR~~~COG1396
MTRSNVLPVARTYSRTFSGARLRRLRQERGLTQVALAKALDLSTSYVNQLENDQRPITVPVLLLLTERFDLSAQYFSSDS
DARLVADLSDVFTDIGVEHAVSGAQIEEFVARMPEVGHSLVAVHRRLRAATEELEGYRSRATAETELPPARPMPFEEVRD
FFYDRNNYIHDLDMAAERMFTESGMRTGGLDIQLAELMRDRFGISVVIDDNLPDTAKRRYHPDTKVLRVAHWLMPGQRAF
QIATQLALVGQSDLISSIVATDDQLSTEARGVARIGLANYFAGAFLLPYREFHRAAEQLRYDIDLLGRRFGVGFETVCHR
LSTLQRPRQRGIPFIFVRTDKAGNISKRQSATAFHFSRVGGSCPLWVVHDAFAQPERIVRQVAQMPDGRSYFWVAKTTAA
DGLGYLGPHKNFAVGLGCDLAHAHKLVYSTGVVLDDPSTEVPIGAGCKICNRTSCAQRAFPYLGGRVAVDENAGSSLPYS
STEQSV
>P9WGM1 ~~~prrA~~~Transcriptional regulatory protein PrrA~~~COG0745
MDTGVTSPRVLVVDDDSDVLASLERGLRLSGFEVATAVDGAEALRSATENRPDAIVLDINMPVLDGVSVVTALRAMDNDV
PVCVLSARSSVDDRVAGLEAGADDYLVKPFVLAELVARVKALLRRRGSTATSSSETITVGPLEVDIPGRRARVNGVDVDL
TKREFDLLAVLAEHKTAVLSRAQLLELVWGYDFAADTNVVDVFIGYLRRKLEAGGGPRLLHTVRGVGFVLRMQ
>P9WGK7 2.7.13.3~~~prrB~~~Sensor-type histidine kinase PrrB~~~COG2205
MNILSRIFARTPSLRTRVVVATAIGAAIPVLIVGTVVWVGITNDRKERLDRRLDEAAGFAIPFVPRGLDEIPRSPNDQDA
LITVRRGNVIKSNSDITLPKLQDDYADTYVRGVRYRVRTVEIPGPEPTSVAVGATYDATVAETNNLHRRVLLICTFAIGA
AAVFAWLLAAFAVRPFKQLAEQTRSIDAGDEAPRVEVHGASEAIEIAEAMRGMLQRIWNEQNRTKEALASARDFAAVSSH
ELRTPLTAMRTNLEVLSTLDLPDDQRKEVLNDVIRTQSRIEATLSALERLAQGELSTSDDHVPVDITDLLDRAAHDAARI
YPDLDVSLVPSPTCIIVGLPAGLRLAVDNAIANAVKHGGATLVQLSAVSSRAGVEIAIDDNGSGVPEGERQVVFERFSRG
STASHSGSGLGLALVAQQAQLHGGTASLENSPLGGARLVLRLPGPS
>Q9AFF7 ~~~~~~Blue-light absorbing proteorhodopsin~~~
MGKLLLILGSAIALPSFAAAGGDLDISDTVGVSFWLVTAGMLAATVFFFVERDQVSAKWKTSLTVSGLITGIAFWHYLYM
RGVWIDTGDTPTVFRYIDWLLTVPLQVVEFYLILAACTSVAASLFKKLLAGSLVMLGAGFAGEAGLAPVLPAFIIGMAGW
LYMIYELYMGEGKAAVSTASPAVNSAYNAMMMIIVVGWAIYPAGYAAGYLMGGEGVYASNLNLIYNLADFVNKILFGLII
WNVAVKESSNA
>Q9F7P4 ~~~~~~Green-light absorbing proteorhodopsin~~~
MKLLLILGSVIALPTFAAGGGDLDASDYTGVSFWLVTAALLASTVFFFVERDRVSAKWKTSLTVSGLVTGIAFWHYMYMR
GVWIETGDSPTVFRYIDWLLTVPLLICEFYLILAAATNVAGSLFKKLLVGSLVMLVFGYMGEAGIMAAWPAFIIGCLAWV
YMIYELWAGEGKSACNTASPAVQSAYNTMMYIIIFGWAIYPVGYFTGYLMGDGGSALNLNLIYNLADFVNKILFGLIIWN
VAVKESSNA
>Q6J4G7 ~~~~~~Green-light absorbing proteorhodopsin~~~
MGKLLLILGSVIALPTFAAGGGDLDASDYTGVSFWLVTAALLASTVFFFVERDRVSAKWKTSLTVSGLVTGIAFWHYMYM
RGVWIETGDSPTVFRYIDWLLTVPLLICEFYLILAAATNVAGSLFKKLLVGSLVMLVFGYMGEAGIMAAWPAFIIGCLAW
VYMIYELWAGEGKSACNTASPAVQSAYNTMMYIIIFGWAIYPVGYFTGYLMGDGGSALNLNLIYNLADFVNKILFGLIIW
NVAVKESSNA
>Q81U45 5.2.1.8~~~prsA1~~~Foldase protein PrsA 1~~~COG0760
MKKAMLALAATSVIALSACGTSSSDKIVTSKAGDITKDEFYEQMKTQAGKQVLNNMVMEKVLIKNYKVEDKEVDKKYDEM
KKQYGDQFDTLLKQQGIKEETLKTGVRAQLAQEKAIEKTITDKELKDNYKPEIKASHILVKDEATAKKVKEELGQGKSFE
ELAKQYSEDTGSKEKGGDLGFFGAGKMVKEFEDAAYKLKKDEVSEPVKSQFGYHIIKVTDIKEPEKSFEQSKADIKKELV
AKKSQDGEFMNDLMMKEIKKADVKVDDKDLKDLFEEKKADAKKEEKK
>Q8Y759 5.2.1.8~~~prsA1~~~Foldase protein PrsA 1~~~COG0760
MTKLKKVMISVIAATLLLLAGCGSSAVIKTDAGSVTQDELYEAMKTTYGNEVVQQLTFKKILEDKYTVTEKEVNAEYKKY
EEQYGDSFESTLSSNNLTKTSFKENLEYNLLVQKATEANMDVSESKLKAYYKTWEPDITVRHILVDDEATAKEIQTKLKN
GEKFTDLAKEYSTDTATSTNGGLLDPFGPGEMDETFEKAAYALENKDDVSGIVKSTYGYHLIQLVKKTEKGTYAKEKANV
KAAYIKSQLTSENMTAALKKELKAANIDIKDSDLKDAFADYTSTSSTSSTTTSN
>Q81TU1 5.2.1.8~~~prsA2~~~Foldase protein PrsA 2~~~COG0760
MRGKHIFIITALISILMLAACGQKNSSATVATATDSTITKSDFEKQLKDRYGKDMLYEMIAQDVITKKYKVSDDDVDKEV
QKAKSQYGDQFKNVLKNNGLKDEADFKNQIKFKLSMNKAIKQSVTEKDVKDHYKPEIKASHILVSDENEAKEIKKKLDTG
ASFEELAKQESQDLLSKEKGGDLGYFHSGAMTPEFETAAYKLKIGQISDPVQSPNGYHIIKLTGKKDLKPYDEVKNSIRK
NLEEERTADPIFGKKLLQSELKKANIKINDSELEDTFTIVSPQGN
>Q81QT1 5.2.1.8~~~prsA3~~~Foldase protein PrsA 3~~~COG0760
MKKKKLFLGTIISCVVLALSACGSSDNVVTSKVGNITEKELSKELRQKYGESTLYQMVLSKALLDKYKVSDEEAKKQVEE
AKDKMGDNFKSTLEQVGLKNEDELKEKMKPEIAFEKAIKATVTEKDVKDNYKPEMKVSHILVKDEKTAKEVKEKVNNGED
FAALAKQYSEDTGSKEQGGEITGFAPGQTVKEFEEAAYKLDAGQVSEPVKTTYGYHIIKVTDKKELKPFDEVKDSIRKDI
EQQRLQDTTGKWKQQVVNELLKDADIKVNDKEFKNTFEFLEKK
>Q81CB1 5.2.1.8~~~prsA4~~~Foldase protein PrsA 4~~~
MKRKKLVIGSILMGMTLSLSACGSSDNIVTTKSGSISESDFNKKLKENYGKQNLSEMVVEKVLHDKYKVTDEEVTKQLEE
LKDKMGDNFNTYMESNGVKNEDQLKEKLKLTFAFEKAIKATVTEKDIKDHYKPKLQVSHILVKDEKTAKEIKEKLNSGED
FAALAKQYSEDPGSKEKGGELSEFGPGMMVKEFEDAAYKLEVGQLSEPVKSSFGYHIIKLTDKKELKPYEEEKENIRKEL
EQQRIQDPQFHQQVTRDLLKNADIKVSDKDLKDTFKELEK
>P24327 5.2.1.8~~~prsA~~~Foldase protein PrsA~~~COG0760
MKKIAIAAITATSILALSACSSGDKEVIAKTDAGDVTKGELYTNMKKTAGASVLTQLVQEKVLDKKYKVSDKEIDNKLKE
YKTQLGDQYTALEKQYGKDYLKEQVKYELLTQKAAKDNIKVTDADIKEYWEGLKGKIRASHILVADKKTAEEVEKKLKKG
EKFEDLAKEYSTDSSASKGGDLGWFAKEGQMDETFSKAAFKLKTGEVSDPVKTQYGYHIIKKTEERGKYDDMKKELKSEV
LEQKLNDNAAVQEAVQKVMKKADIEVKDKDLKDTFNTSSTSNSTSSSSSNSK
>Q9CEV9 5.2.1.8~~~prsA~~~Foldase protein PrsA~~~COG0760
MKFKKLGLVMTTVFAGAALVTLSGCSSSDSASKDIITMKGDTIRVSDLYKEAKQFPSQPTNTLLQNLTFDKIFTKDFGKE
VTDKDVSKKVKSIKDQYGSQFSSALQQQGLTEASFTPYMRTQMLEQAAIDHEIKETQYTDANLKKAWESYHPDVTAYVVS
ETSKDAATKALDAAKKDDAGKASFEKTNAESKVTFNSTSTSVPTEVQTAAFKLKNGEFSDVIESTSSSTGATSYYIVEMV
KTSEKGTDMNKYKKELQNVIKTEKEQDTTFVSGVIAKYLKKNNVTVKESAFASLFSQFTQTSSSSSSK
>P0C2B5 5.2.1.8~~~prsA~~~Foldase protein PrsA~~~
MKKKMRLKVLLASTATALLLLSGCQSNQTDQTVATYSGGKVTESSLYKELKQSPTTKTMLANMLIYRALNHAYGKSVSTK
TVNDAYDSYKQQYGENFDAFLSQNGFSRSSFKESLRTNFLSEVALKKLKKVSESQLKAAWKTYQPKVTVQHILTSDEDTA
KQVISDLAAGKDFAMLAKTDSIDTATKDNGGKISFELNNKTLDATFKDAAYKLKNGDYTQTPVKVTDGYEVIKMINHPAK
GTFTSSKKALTASVYAKWSRDSSIMQRVISQVLKNQHVTIKDKDLADALDSYKKLATTN
>P60747 5.2.1.8~~~prsA~~~Foldase protein PrsA~~~
MKMINKLIVPVTASALLLGACGASATDSKENTLISSKAGDVTVADTMKKIGKDQIANASFTEMLNKILADKYKNKVNDKK
IDEQIEKMQKQYGGKDKFEKALQQQGLTADKYKENLRTAAYHKELLSDKIKISDSEIKEDSKKASHILIKVKSKKSDKEG
LDDKEAKQKAEEIQKEVSKDPSKFGEIAKKESMDTGSAKKDGELGYVLKGQTDKDFEKALFKLKDGEVSEVVKSSFGYHI
IKADKPTDFNSEKQSLKEKLVDQKVQKNPKLLTDAYKDLLKEYDVDFKDRDIKSVVEDKILNPEKLKQGGAQGGQSGMSQ
>P60748 5.2.1.8~~~prsA~~~Foldase protein PrsA~~~
MKMINKLIVPVTASALLLGACGASATDSKENTLISSKAGDVTVADTMKKIGKDQIANASFTEMLNKILADKYKNKVNDKK
IDEQIEKMQKQYGGKDKFEKALQQQGLTADKYKENLRTAAYHKELLSDKIKISDSEIKEDSKKASHILIKVKSKKSDKEG
LDDKEAKQKAEEIQKEVSKDPSKFGEIAKKESMDTGSAKKDGELGYVLKGQTDKDFEKALFKLKDGEVSEVVKSSFGYHI
IKADKPTDFNSEKQSLKEKLVDQKVQKNPKLLTDAYKDLLKEYDVDFKDRDIKSVVEDKILNPEKLKQGGAQGGQSGMSQ
>Q8CVC6 5.2.1.8~~~prsA~~~Foldase protein PrsA~~~COG0760
MKKRTIATGLVTLLSIVTLAACSKTNQNSKIATMKGDTITVADFYNEVKNSTASKQAVLSLLVSKVFEKQYGDKVSDKEV
TKAYNEAAKYYGDSFSSALASRGYTKEDYKKQIRSEKLIEYAVKEEAKKEITDASYKSAYKDYKPEVTAQVIQLDSEDKA
KSVLEEAKADGADFAKIAKDNTKGDKTEYSFDSGSTNLPSQVLSAALNLDKDGVSDVIKASDSTTYKPVYYIVKITKKTD
KNADWKAYKKRLKEIIVSQKLNDSNFRNAVIGKAFKKANVKIKDKAFSEILSQYAAASGSGSSGSTTTTTAASSAATTAA
DDQTTAAETTAAE
>B1IBE1 5.2.1.8~~~prsA~~~Foldase protein PrsA~~~
MKKKLLAGAITLLSVATLAACSKGSEGADLISMKGDVITEHQFYEQVKNNPSAQQVLLNMTIQKVFEKQYGSELDDKEVD
DTIAEEKKQYGENYQRVLSQAGMTLETRKAQIRTSKLVELAVKKVAEAELTDEAYKKAFDEYTPDVTAQIIRLNNEDKAK
EVLEKAKAEGADFAQLAKDNSTDEKTKENGGEITFDSASTEVPEQVKKAAFALDVDGVSDVITATGTQAYSSQYYIVKLT
KKTEKSSNIDDYKEKLKTVILTQKQNDSTFVQSIIGKELQAANIKVKDQAFQNIFTQYIGGGDSSSSSSTSNE
>Q7ANN4 ~~~prsD~~~Type I secretion system ATP-binding protein PrsD~~~COG4618
MATSKGRNADPAAALRDCRAAFIGVGVASALVNLLYLTGSFFMLEVYDRILPSRSIPSLIALSLLALLLYAFQGAFELIR
GRMLVRIAGALDESLNGRIYRAIVKAPLKLRMQGDGLQALRDFDQVRSFLSGVGPAALFDLPWLPFYIAICFLFHPVIGL
IAIIGGLILTLLTYLTNRGTQAPAKKASEAGGLRNVFAQASQRNAEVVHAMGMSARLTALWERRNTEFRDENRRTSDIGN
GYGALSKVFRMALQSGVLAAGAVLVIRGEASPGIIIAGSILTARALAPVELAIGNWRGLVAARQSWQRLKELLNALPEAD
APLQLPDPHERLTVEGLASGPPAAQRLVVSDVNFTVRAGGAVGVIGPSASGKSSLARAILGIWPAYRGSVRLDGAALDQW
DSDALGKHVGYLPQDVELFAGTIAQNICRFAEDATSEAIVAAAKAARVNDLILRLPNGYDTEIGDGGMTLSAGQRQRVAL
ARALYGDPFLVVLDEPNSNLDAEGEQALSEAIMSVRSRGGIVIVVAHRPSALASVDLVLMMNEGRMQAFGPKEQVLGQVL
RPQQVERQNALKVVAEGQEAKQ
>Q7ANN5 ~~~prsE~~~Type I secretion system membrane fusion protein PrsE~~~COG0845
MKGWLQQHKPTARRSLSRHLIGVSVLALALVAGVGGWAATTELSSAIVAGGVVIVDDNVKKVQHLTGGIVGELLVKEGDR
VEAGQVLIRLDGTTVRANLAIIESTLAQFYARRARLQAERMGAASFEIEEDLAEFIPGTAAAKLIEGEQRLFASRRSALS
GMKGQLDSRKAQLADEVEGLTVQLNAIEEALKLIAEELTGVDSLFGQGLVPMQRVTTLKRQRAELEGGRGRHIAARAQAR
GKSSEIDLQILQLDEDRRSEISKELTDVEAKIAEYEERRTAATDQLRRLDITAPLSGRIYQLAIHTVNGVINPGETLMLV
VPEAEDLTVEAKVATHDIDQIRVGQSVEIRFSAFNQRTTPEVEAEVVTVAPDLVTDERTGASYYPLRIRPKAESLAKLKG
LSLYPGMPAEVFIKIADRTVISYLTKPLTDQMRHAFRED
>P42187 ~~~prsF~~~Minor fimbrial protein PrsF~~~
MIRLSLFISLLLTSVAVLADVQINIRGNVYIPPCTINNGQNIVVDFGNINPEHVDNSRGEVTKNISISCPYKSGSLWIKV
TGNTMGVGQNNVLATNITHFGIALYQGKGMSTPLTLGNGSGNGYRVTAGLDTARSTFTFTSVPFRNGSGILNGGDFRTTA
SMSMIYN
>P42191 ~~~prsK~~~Protein PrsK~~~
MIKSTGALLLFAALSAGQAMASDVAFRGNLLDRPCHVSGDSLNKHVVFKTRASRDFWYPPGRSPTESFVIRLENCHATAV
GKIVTLTFKGTEEALPGHLKVTGVNSGRLAIALLDTDGSSLLKPGASHNKGQGEKVTGNSLELPFGAYVVATPEALRTKS
VVPGDYEATATFELTYR
>P50738 3.4.-.-~~~prsW~~~Protease PrsW~~~COG2339
MFAIISAGIAPGIALLSYFYLKDQYDNEPVHMVLRSFFLGVVLVFPIMFIQYVLEKENVGGGSFFVSFLSSGFLEESLKW
FILMISVYPHAHFDEHYDGIVYGASVSLGFATLENILYLIGHGVEHAFVRALLPVSCHALIGVIMGFYLGKARFSADKAR
VKWLTLSLVVPSLLHGSYDFILTALSNWIYYMLPFMVFLWWFGLRKAKKARSVNMMQV
>Q07295 3.4.24.40~~~prtA~~~Serralysin A~~~
MEKNLSSRDDDALHSLSASSSYDSVYDLLHYHERGNGLTINGKPSYSIEDAGDQITRDNVSWNGSNVFGKSANLTFKFLQ
SARSTPDGDTGFVKFNAAQVAQAKLALQSWADLANITFTEVTGNQSANVTFGNYTRDSSGRLDYGTQAYAYLPGSGSASG
TTWYNYNVDNIRSPDKMEYGRQTLTHEIGHALGLNHPGDYNAGEGNPTYREDTRQFSIMSYWSEKNTGGDFKGHYAAGPM
LDDIDAIQRLYGANMTTRTGDTVYGFNSNTDRDFYTATSSSKALIFSVWDAGGKHTFDFSGYSNNQRINLNEGSLSDVGG
LKGNVSIAHGVTIENAIGGSGNDLLIGNNADNILRGGAGDDILYGGGGADRLYGGSGRDTFVYTAVSDSKVAAPDWILDF
QTGVDKIDLSALNTGNNLHFVNQFSGSGGEILLNWDSSASVSNLYLNLDNNTSPEFQVKIVGQVSQTADFVV
>P34025 3.4.24.-~~~mpl~~~Zinc metalloproteinase~~~COG3227
MKSKLICIIMVIAFQAHFNMAVKADSVGEERLRNNIQAKRNPADLKTLPDSCEAKDFYKNFKILDMTKDKLGVTHYTLAL
SSDGYLTDNDEIKVHVTPDNKITFINGDLQQGQLRITNQIKITEKNAIEKAFEAIGQSEAHVKSYIGNPVKEKEIIINSR
TKRLVYNIKLIFAEPEVASWIIQVDAETGAILKKQNMLSEVERADTHKDFQALGKGANRLLQRPLHVMKINDLFYLVDRT
HKGLIRTFDLNHKTDASFGKVVSNKTNMFTDPEFSSAVDAHFYASEVYDYYKNVHQLESLDGKGGEIDSFVHYGLNCNNA
FWDGREILYGDGDKKNFKPFSCAKTIVGHELTHAVIQYSAGLEYEGQSGALNESFADVFGYFIAPNHWLIGEDVCVRGLR
DGRIRSIKDPDKYNQAAHMKDYESLPITEEGDWGGVHFNSGIPNKAAYNTITKLGKEKTEQLYFRALKYYLTKKAQFTDA
KKALQQAAKDLYGEDASKKVAEAWEAVGVN
>P23224 3.4.24.-~~~mpl~~~Zinc metalloproteinase~~~COG3227
MKSKLICIIMVIAFQAHFTMTVKADSVGEEKLQNNTQAKKTPADLKALPDSCEAKDFYKNFKILDMTKDKLGVTHYTLAL
SSGGYLTDNDEIKVHVTPDNKITFINGDLQQGQLRITNQIKITEKNAIEKAFEAIGQSEAHVKSYVGNPVKEKEIILNSR
TKRLVYNIKLIFAEPEVASWIVQVDVETGAILKKQNMLSEVERADTHKDFQALGKGANRLLQRPLHVMKINDLFYLVDRT
HKGLIRTFDLKHNTDTSFGKVVSNKTNMFTDPEFSSAVDAHFYASEVYEYYKNVHQLESLDGKGGEIDSFVHYGLNCNNA
FWDGQEILYGDGDKKNFKPFSCAKTIVGHELTHAVIQYSAGLEYEGQSGALNESFADVFGYFIAPNHWLIGEDVCVRGSR
DGRIRSIKDPDKYNQAAHMKDYESLPLTEEGDWGGVHYNSGIPNKAAYNTITKLGKEKTEQLYFRALKYYLTKKSQFTDA
KKALQQAAKDLYGEDASKKVAEAWEAVGVN
>P82115 3.4.24.40~~~prtA~~~Serralysin~~~
MERYMSLKKKISYSELIGSAKANELQTQLQAYVPGKDPNIVVEHEPSKNAAKELIRGDYRWGHQGDDKSETFQLTYSFLE
SEPDNMPWHITGFSAFNEEQRTAAKLSIQSWTDVANINFTETTDSDKAHITFGFFDASLTGSYAFAYLPSPESKQSGTWY
NLKSRTFSENDIGVNGYGRQTFTHEIGHTLGLEHPAAYNASDKERPTYKKSATYFEDSRAYTVMSYFGEKNTRTDFKGIY
SSAPLLNDISAIQEVYGANNTTRTDDTVYGFNSNTDRDFFTAKDENSKLLFTAWDAGGNDTFDFSGFTQDQRINLNEASF
SDVGGLKGNVSIARGVTIENAIGGSGNDILIGNDAENILKGGAGDDIIYGGLGADQLWGGEGKDTFVYLSAKESPPLERD
WIHDFVSGEDKIDVSLFDLGEAGKGGVKFVREFTGAVGEAVLRYDTVNKVNDFAINLGDKFSYDDFWVKIVGEPILESDF
ILA
>P00776 3.4.21.80~~~sprA~~~Streptogrisin-A~~~
MTFKRFSPLSSTSRYARLLAVASGLVAAAALATPSAVAAPEAESKATVSQLADASSAILAADVAGTAWYTEASTGKIVLT
ADSTVSKAELAKVSNALAGSKAKLTVKRAEGKFTPLIAGGEAITTGGSRCSLGFNVSVNGVAHALTAGHCTNISASWSIG
TRTGTSFPNNDYGIIRHSNPAAADGRVYLYNGSYQDITTAGNAFVGQAVQRSGSTTGLRSGSVTGLNATVNYGSSGIVYG
MIQTNVCAEPGDSGGSLFAGSTALGLTSGGSGNCRTGGTTFYQPVTEALSAYGATVL
>P16316 3.4.24.40~~~prtB~~~Serralysin B~~~
MQQNEKASLNTSAAAAKAGSYTGYADVYDFWYYHQRGDGLTVNGKPSYTTDKAVSEGLTRPHTTWNGDNVFGKAANLTYS
FLNTFSSTPNGHTGPVKFTPVQMQQAKLSLQSWADAANLTFTEVSPNQKANITFANYTRNADGSLNTDTQAYAAYPGTHP
VSGSAWFNYNQSSIRNPDTDEYGRHSFTHEIGHALGLSHPAEYNAGEGDISYKNSAAYAEDSRQFSIMSYWEVENTGGDF
KGHYSAGPLMDDIAAIQKLYGANMTTRTGDTVYGFNSNTDRDFYTATNSSKALVFSVWDAGGTDTFDFSGYSNNQRINLN
EGSFSDVGGLKGNVSIAHGVTIENAIGGSGNDILIGNGADNILQGGAGDDVLYGSTGADTLTGGAGRDIFVYGSGQDSTV
SAYDWITDFQTGIDKIDLSAFRNEGQLSFVQDQFTGKGQEVMLQWDAANSTTNLWLHEAGHSSVDFLVRIVGQTAQSDII
V
>P00777 3.4.21.81~~~sprB~~~Streptogrisin-B~~~
MRIKRTSNRSNAARRVRTTAVLAGLAAVAALAVPTANAETPRTFSANQLTAASDAVLGADIAGTAWNIDPQSKRLVVTVD
STVSKAEINQIKKSAGANADALRIERTPGKFTKLISGGDAIYSSTGRCSLGFNVRSGSTYYFLTAGHCTDGATTWWANSA
RTTVLGTTSGSSFPNNDYGIVRYTNTTIPKDGTVGGQDITSAANATVGMAVTRRGSTTGTHSGSVTALNATVNYGGGDVV
YGMIRTNVCAEPGDSGGPLYSGTRAIGLTSGGSGNCSSGGTTFFQPVTEALSAYGVSVY
>P16317 3.4.24.40~~~prtC~~~Serralysin C~~~
MGKNLSLRQDDAQHALSANTSSAYNSVYDFLRYHDRGDGLTVNGKTSYSIDQAAAQITRENVSWNGTNVFGKSANLTFKF
LQSVSSIPSGDTGFVKFNAEQIEQAKLSLQSWSDVANLTFTEVTGNKSANITFGNYTRDASGNLDYGTQAYAYYPGNYQG
AGSSWYNYNQSNIRNPGSEEYGRQTFTHEIGHALGLAHPGEYNAGEGDPSYNDAVYAEDSYQFSIMSYWGENETGADYNG
HYGGAPMIDDIAAIQRLYGANMTTRTGDSVYGFNSNTDRDFYTATDSSKALIFSVWDAGGTDTFDFSGYSNNQRINLNEG
SFSDVGGLKGNVSIAHGVTIENAIGGSGNDILVGNSADNILQGGAGNDVLYGGAGADTLYGGAGRDTFVYGSGQDSTVAA
YDWIADFQKGIDKIDLSAFRNEGQLSFVQDQFTGKGQEVMLQWDAANSITNLWLHEAGHSSVDFLVRIVGQAAQSDIIV
>P52321 3.4.21.-~~~sprD~~~Streptogrisin-D~~~
MCVSRRRNSGRPILRVRAPHLLRARPHRRSKLKHRRISRKRATLAGSAVVALVAAGFTFQTANASDDVPAFGAKTLSADA
AGKLATTLDRDLGADAAGSYYDATAKTLVVNVVDEAGAEQVRQAGGKARIVENSLAELKSARGTLTEKATIPGTSWAVDP
VSNKVLVTADSTVDGAAWKKLSAVVEGLGGKAELNRTAGEFTPLIAGGDAIWGSGSRCSLGFNVVKGGEPYFLTAGHCTE
SVTSWSDTQGGSEIGANEGSSFPENDYGLVKYTSDTAHPSEVNLYDGSTQAITQAGDATVGQAVTRSGSTTQVHDGEVTA
LDATVNYGNGDIVNGLIQTTVCAEPGDSGGALFAGDTALGLTSGGSGDCSSGGTTFFQPVPEALAAYGAEIG
>Q07162 3.4.24.40~~~prtG~~~Serralysin G~~~
MALYGKKTDLSSASSGGYTGVADVYQLWHYHARGNGNIGDKPSYTLEQARDQITRGNITWNGEKVFGKSAALTYSFLQSV
ADSDMPDNFKGFVKFNAAQIQQTKLALQSWADVANVTFSEAKDGERATIQFGNYTLTPDGNTDNNSQAFGFYPGNWKWAG
SAWFNYNQADNQRPDINEFGRNTLTHEIGHTLGLYHPGDYDASDGNPGYKDVTYAEDTRQFSIMSYWNEGYTGGDFHGYH
AAAPHDIAAIQKLYGANMSTRTGDTVYGFHSNSGRDFYTATDSKTPLIFSVWDAGGNDTFDFSGYSANQRISLISGTFSD
VGGLKAMVSIAAGAVIENAIGGSGHDVIVGNLSDNRIDGGAGNDVLYGDGGADILTGGAGKDIFVYAWEKDSLSSAPDTI
TDFQRGEDRIDLSAFNKNHDLRFVDNFSGKGNEVVLNWDSQSHQTNMWLHLSGHETADFLVNIVGAALQPSDVIV
>Q99405 3.4.21.-~~~aprE~~~M-protease~~~COG1404
MKKPLGKIVASTALLISVAFSSSIASAAEEAKEKYLIGFNEQEAVSEFVEQIEANDDVAILSEEEEVEIELLHEFETIPV
LSVELSPEDVDALELDPTISYIEEDAEVTTMAQSVPWGISRVQAPAAHNRGLTGSGVKVAVLDTGISTHPDLNIRGGASF
VPGEPSTQDGNGHGTHVAGTIAALNNSIGVLGVAPSAELYAVKVLGASGSGSVSSIAQGLEWAGNNGMHVANLSLGSPSP
SATLEQAVNSATSRGVLVVAASGNSGAGSISYPARYANAMAVGATDQNNNRASFSQYGAGLDIVAPGVNVQSTYPGSTYA
SLNGTSMATPHVAGVAALVKQKNPSWSNVQIRNHLKNTATGLGNTNLYGSGLVNAEAATR
>Q06553 ~~~prtR~~~HTH-type transcriptional regulator PrtR~~~
MDKSTQIPPDSFAARLKQAMAMRNLKQETLAEAAGVSQNTIHKLTSGKAQSTRKLIEIAAALGVSPVWLQTGEGAPAARS
AVSVADGSPLVLEPLHPWDSDTPLDEDEVELPLYKEVEMSAGAGRTAVREIEGRKLRFSYATLRASGVDPSAAICAQLTG
NSMEPLIMDGSTIGVDTATTHITDGEIYALEHDGMLRVKFVYRLPGGGIRLRSFNREEYPDEEYSPEDMRSRQISMIGWV
FWWSTVRHRRGPSLVR
>A9YWT8 3.4.24.-~~~prtS~~~Protease PrtS~~~
MQIQNNNYKGLIPPYILQNIYKNTSESEKDNVLMTLNHTQSLMLDSVIKTSDSIDNTDDEVVSDTLHRSIYDAKNETKLP
GTLVRDEGDPDNGDVAVDNAYKYLEATYNFYKEVFNRNSLDDKGMKLIATVHYGKEYMNAYWGRGQMVFGDGDGKVFNNF
TTSIDVIGHELSHGVIEKTADLIYFFQSGALNESIADVFGSLVRQHYLKQKADEASWVVGEELLAKGIKGVGIRSMKEPG
KAYDDPLLGKNPQPGHMDDFKDYPIYRDNGGVHVNSGIPNKAFYNLAIKLGGYAWEKAGKIWYNTLLDKDLARDTTFLSF
AKLTVKHARDLFDEDVEKATIDSWKEVGIKVKEEDKDKGKDEGKDKAETKV
>P09489 3.4.21.-~~~~~~Extracellular serine protease~~~
MILNKRLKLAYCVFLGCYGLSIHSSLAAYQDPGRLGAPDSWKTAEFNRQWGLEAISAEFAYARGYTGKGITIGVIDNAIL
SHSEFSGKLTRLDNGSYNFSYDKQDNMSFGDHGTHVAGIAAAKRDGAGMHGVAFDADIIGTKLNDYGNRNGREELIQSAA
RVINNSWGIAPDIRRDAKGDIIWLPNGRPDYVAFVKSEVIAEMMRSKSSVEWGSEQPVPTGGHSAMSTLLRAARHGKLIV
FSAGNYNNYNIPEAQKSLPYAFPDVLNNYLIVTNLSDENQLSVSSTSCGQTASYCVSAPGSDIYSTVGRLESNTGGAVNR
EAYNKGELSLNPGYGNKSGTSMAAPHVTGVAAVLMQRFPYMSADQISAVIKTTATDLGVAGIDNLFGWGRVNLRDAINGP
KMFITKEDIPQEYYVPGSYSEKQFVVNIPGLGNIVEPGTPVERRCTSSECSFDSWSNDISGHGGLTKTGAGTLALLGNNT
YRGDTWVKQGVLAIDGSVASNVYIENSGTLSGEGTVGAFRAARSGSVAPGNGIGTLHVLHDAIFDRGSQYNVEVADNGRS
DKIAARRAFLNGGSVNVSLERSQNLLSQNEAQSLLGNKYTILTTTDGVTGRFENANPSYPFVKVALDYRGNDVGLGITRT
DASFDSLASTENEKAVARAVETLNATEPVTETAKRSVAIPAAEEANLLQSDGGEAQAVNEEASIVAGHPIYESFLGFTSA
RELQQATRQLSGQIHADMASAQINESRYLRDTATERLRQAEGRRTATDIKADDNGAWAKLLGSWGHASGNDNATGYQTST
YGVLLGLDSELFGDGRLGMMTGYTRTSLDGGYQSDAHSDNYHLGLYGDKRFGALALRAGGTYTWHRIDTSRSVNYGAQSD
REKAKYNARTGQLFIESGYDWTSDAVNLEPFANLAYTHYRNEEINEQGGAAALRGDKQSQSATASTLGLRADTEWQTDSV
AIALRGELGWQHQYGKLERKTQLMFKRTDAAFDVNSVPVSRDGAILKAGVDVSINKNAVLSLGYGGQLSSNHQDNSVNAG
LTWRF
>Q9KMU6 3.4.24.-~~~prtV~~~Pre-pro-metalloprotease PrtV~~~COG3291
MKTIKKTLLAAAIASFFSSGLYAQTPIDLGVVNEDKLIEMLVRTGQIPADASDVDKRIALERYLEEKIRSGFKGDAQFGK
KALEQRAKILKVIDKQKGPHKARVFALDVGQKRTDKVLALLIDFPDLPWDDNRLTKEHTEMLYDRYEPSHYQDLLFSDKG
YTGPNGENFISMRQYYESESGNSYSVSGQAAGWYRASKNAAYYGGNSPGTNNDMNARELVREALDQLARDPNINLADYDI
EDRYDYNGNGNFREPDGVIDHLMIFHASVGEEAGGGVLGADAIWSHRFNLGRYHVLEGTKSNVPGRFNGQFAAFDYTIQP
IDAAAGVCAHEYGHDLGLPDEYDTQYTGTGEPVSYWSIMSSGSWAGKIGGTQPTAFSSWAKQFLQNSIGGRWINHEQLSI
NELEAKPRVVTLFQTTDNSRPNMVKVTLPMKRVEGIKPAEGEFSFYSNRGDDLKNRMSRPLTIPAGSQATLRFKAWFQIE
KDYDYARVLINGKPIAGNITTMDDPFKSGLVPAISGQSDGWVDAQFDLSAWAGQTVELAFDYLTDGGLAMEGLYVDDLRL
EVDGNQTLIDNAEGTSSFAFQGFTKNGGFHEANHYYLLQWRSHNDVDQGLANLKRFGQLMSFEPGLLVWYVDESYADNWV
GKHPGEGWLGVVDADQNALVWSKTGEVAQTRFQVRDATFSLFDQAPLKLVTADGNTLEDMNLTANASFSDDQDYSSPQAP
DSGRKVMPFGLKIDLLSQSKENEYGVVRLSKVTTENIAPVARFELKVEGLSVMSQNTSSDSDGNIVSYLWDFGNGQTSTE
AAPTWSYTKAGSYSVTLTVTDDKGDSDTHQQTIKVDTPNALPQASANYIHLGRWVTMWSTSTDSDGRIVDTEWTLPNGKI
KRGRMFTAIFPSYGHHDVQLKVMDDRGAVTTITIKVKL
>C3LUP3 3.4.24.-~~~prtV~~~Pre-pro-metalloprotease PrtV~~~
MKTIKKTLLAAAIASFFSSGLYAQTPIDLGVVNEDKLIEMLVRTGQIPADASDVDKRIALERYLEEKIRSGFKGDAQFGK
KALEQRAKILKVIDKQKGPHKARVFALDVGQKRTDKVLALLIDFPDLPWDDNRLTKEHTEMLYDRYEPSHYQDLLFSDKG
YTGPNGENFISMRQYYESESGNSYSVSGQAAGWYRASKNAAYYGGNSPGTNNDMNARELVREALDQLARDPNINLADYDI
EDRYDYNGNGNFREPDGVIDHLMIFHASVGEEAGGGVLGADAIWSHRFNLGRYHVLEGTKSNVPGRFNGQFAAFDYTIQP
IDAAAGVCAHEYGHDLGLPDEYDTQYTGTGEPVSYWSIMSSGSWAGKIGGTQPTAFSSWAKQFLQNSIGGRWINHEQLSI
NELEAKPRVVTLFQTTDNSRPNMVKVTLPMKRVEGIKPAEGEFSFYSNRGDDLKNRMSRPLTIPAGSQATLRFKAWFQIE
KDYDYARVLINGKPIAGNITTMDDPFKSGLVPAISGQSDGWVDAQFDLSAWAGQTVELAFDYLTDGGLAMEGLYVDDLRL
EVDGNQTLIDNAEGTSSFAFQGFTKNGGFHEANHYYLLQWRSHNDVDQGLANLKRFGQLMSFEPGLLVWYVDESYADNWV
GKHPGEGWLGVVDADQNALVWSKTGEVAQTRFQVRDATFSLFDQAPLKLVTADGNTLEDMNLTANASFSDDQDYSSPQAP
DSGRKVMPFGLKIDLLSQSKENEYGVVRLSKVTTENIAPVARFELKVEGLSVMSQNTSSDSDGNIVSYLWDFGNGQTSTE
AAPTWSYTKAGSYSVTLTVTDDKGDSDTHQQTIKVDTPNALPQASANYIHLGRWVTMWSTSTDSDGRIVDTEWTLPNGKI
KRGRMFTAIFPSYGHHDVQLKVMDDRGAVTTITIKVKL
>P19144 3.4.24.40~~~prtC~~~Serralysin C~~~
MEKNLSSRDDDALHSLSAPSSSYNSIYDLLHYHERGNGLTINGKPSYSIEDAGDQITRDNVSWNGANVFGKSANLTFKFL
QSARSTPDGDTGFVKFNAAQISQAKLALQSWADVANVTFTEVTGNQSANVTFGNYTRDSSGRLDYGTQAYAYLPGSGSAS
GTTWYNYNVDNIRSPDTMEYGRQTLTHEIGHALGLNHPGDYNAGEGNPSYSDVTYAEDTRQFSIMSYWSEKNTGGDFKGH
YAAGPMLDDIAAIQRLYGANMTTRTGDSVYGFNSNTDRDFYTATSSSKALIFSAWDAGGNDTFDFSGYSNNQRINLNDGS
LSDVGGLKGNVSIAEGVTIENAIGGSGNDLLIGNNADNTLRGGAGDDVLFGGSGADRLYGGSGRDTFVYTAASDSKVAAP
DWLLDFQTGADKIDLSALNTGNNLHFVNQFSGSGGEIMLNWDASANTSNLYLNLDNNTSPEFLVKIVGQVSQTADFVV
>P44758 1.11.1.27~~~PGdx~~~Hybrid peroxiredoxin hyPrx5~~~COG0678
MSSMEGKKVPQVTFRTRQGDKWVDVTTSELFDNKTVIVFSLPGAFTPTCSSSHLPRYNELAPVFKKYGVDDILVVSVNDT
FVMNAWKEDEKSENISFIPDGNGEFTEGMGMLVGKEDLGFGKRSWRYSMLVKNGVVEKMFIEPNEPGDPFKVSDADTMLK
YLAPQHQVQESISIFTKPGCPFCAKAKQLLHDKGLSFEEIILGHDATIVSVRAVSGRTTVPQVFIGGKHIGGSDDLEKYF
A
>P73728 1.11.1.27~~~~~~Peroxiredoxin sll1621~~~COG0678
MTPERVPSVVFKTRVRDESVPGPNPYRWEDKTTEQIFGGKKVVLFSLPGAFTPTCSSNHLPRYEQLFEEFQALGVDDIIC
LSVNDAFVMFQWGKQIGADKVKLLPDGNGEFTRKMGMLVEKSNLGFGMRSWRYSMFVNDGKIEKMFIEPEFGDNCPVDPF
ECSDADTMLAYLKGAEAPGVSEPVKAFVG
>P25026 1.11.1.-~~~cpo~~~Non-heme chloroperoxidase~~~
MPYVTTKDNVEIFYKDWGPKDAQPIVFHHGWPLSGDDWDAQMLFFVQKGYRVIAHDRRGHGRSAQVSDGHDMDHYAADAF
AVVEALDLRNAVHIGHSTGGGEVARYVANDGQPAGRVAKAVLVSAVPPLMLKTESNPEGLPIEVFDGFRKALADNRAQFF
LDVPTGPFYGFNRAGATVHQGVIRNWWRQGMEGSAKAHYDGIKAFSETDQTEDLKSITVPTLVLHGEDDQIVPIADAALK
SIKLLQNGTLKTYPGYSHGMLTVNADVLNADLLAFVQA
>O31168 1.11.1.-~~~cpo~~~Non-heme chloroperoxidase~~~
MPFITVGQENSTSIDLYYEDHGAGQPVVLIHGFPLSGHSWERQSAALLDAGYRVITYDRRGFGQSSQPTTGYDYDTFAAD
LNTVLETLDLQDAVLVGFSMGTGEVARYVSSYGTARIAKVAFLASLEPFLLKTDDNPDGAAPKEFFDGIVAAVKADRYAF
YTGFFNDFYNLDENLGTRISEEAVRNSWNTAASGGFFAAAAAPTTWYTDFRADIPRIDVPALILHGTGDRTLPIENTARV
FHKALPSAEYVEVEGAPHGLLWTHAEEVNTALLAFLAK
>O31158 1.11.1.-~~~cpo~~~Non-heme chloroperoxidase~~~
MTTFTTRDGTQIYYKDWGSGQPIVFSHGWPLNADSWESQMIFLAAQGYRVIAHDRRGHGRSSQPWSGNDMDTYADDLAQL
IEHLDLRDAVLFGFSTGGGEVARYIGRHGTARVAKAGLISAVPPLMLKTEANPGGLPMEVFDGIRQASLADRSQLYKDLA
SGPFFGFNQPGAKSSAGMVDWFWLQGMAAGHKNAYDCIKAFSETDFTEDLKKIDVPTLVVHGDADQVVPIEASGIASAAL
VKGSTLKIYSGAPHGLTDTHKDQLNADLLAFIKG
>P49323 1.11.1.-~~~cpo~~~Non-heme chloroperoxidase~~~
MGTVTTSDGTNIFYKDWGPRDGLPVVFHHGWPLSADDWDNQMLFFLSHGYRVIAHDRRGHGRSDQPSTGHDMDTYAADVA
ALTEALDLRGAVHIGHSTGGGEVARYVARAEPGRVAKAVLVSAVPPVMVKSDTNPDGLPLEVFDEFRAALAANRAQFYID
VPSGPFYGFNREGATVSQGLIDHWWLQGMMGAANAHYECIAAFSETDFTDDLKRIDVPVLVAHGTDDQVVPYADAAPKSA
ELLANATLKSYEGLPHGMLSTHPEVLNPDLLAFVKS
>Q9L3Q5 1.11.1.24~~~prxU~~~Selenocysteine-containing peroxiredoxin PrxU~~~
MVSVGKKAPDFEMAGFYKGEFKTFRLSEYLGKWVVLCFYPGDFTFVUATEVSAVAEKYPEFQKLGVEVLSVSVDSVFVHK
MWNDNELSKMVEGGIPFPMLSDGGGNVGTLYGVYDPEAGVENRGRFLIDPDGIIQGYEVLILPVGRNVSETLRQIQAFQL
VRETKGAEVAPSGWKPGKKTLKPGPGLVGNVYKEWSVKEAFED
>P55111 3.4.24.-~~~hly~~~Zinc metalloproteinase~~~
MKKYYAVTGIALAVGMLCTTQLAGATQAADPSVGSLDSSNVVTEFSAQGNVEQITFKSAIKSAPMSSARSAQTSAIIPGL
KNLFVSAPGSDFSLNDSSNNYIKRFTQNIAGIPVLGSSITEVLDGQGAVTSAIGAVTSATKGAFPADLAAGQAAALASAT
KIASAGKDASAISLVDQKAIWFDAVLIGKGATGSVAVPAYQFSFTTGFAESRVLTVAANDGAILNDRTDRKDINRVVCDA
NSKVIDLEASNADALLKCGKTQANKPTRIEGQAASSVADVNSVYNFLNDTASFYGANTKANDLTALIGNDEGDGLGKAMR
AVVRICVTDSQNGEQCPFANAFWYNGQMTYGQGVTTDDITGHELTHGVTEKTNGLVYANESGAINESMSDVFGEFIDLSN
GSSDDTAANRWAIGEGSSLGVIRSMKDPGKYGEPAIYKGSNWKPTATNPNDNNDQGGVHSNSGVGNKLAFLITDGQTFNG
QTVTGIGIAKAAQLYWAAQRQLTANATYSSLGKALNSACSANVSNNVAGTTAANCTQVANAIKAVGIK
>P23694 3.4.24.40~~~~~~Serralysin~~~
MQSTKKAIEITESSLAAATTGYDAVDDLLHYHERGNGIQINGKDSFSNEQAGLFITRENQTWNGYKVFGQPVKLTFSFPD
YKFSSTNVAGDTGLSKFSAEQQQQAKLSLQSWADVANITFTEVAAGQKANITFGNYSQDRPGHYDYGTQAYAFLPNTIWQ
GQDLGGQTWYNVNQSNVKHPATEDYGRQTFTHEIGHALGLSHPGDYNAGEGNPTYNDVTYAEDTRQFSLMSYWSETNTGG
DNGGHYAAAPLLDDIAAIQHLYGANPSTRTGDTVYGFNSNTGRDFLSTTSNSQKVIFAAWDAGGNDTFDFSGYTANQRIN
LNEKSFSDVGGLKGNVSIAAGVTIENAIGGSGNDVIVGNAANNVLKGGAGNDVLFGGGGADELWGGAGKDIFVFSAASDS
APGASDWIRDFQKGIDKIDLSFFNKEANSSDFIHFVDHFSGTAGEALLSYNASSNVTDLSVNIGGHQAPDFLVKIVGQVD
VATDFIV
>P07268 3.4.24.40~~~~~~Serralysin~~~
MQSTKKAIEITESNFAAATTGYDAVDDLLHYHERGNGIQINGKDSFSNEQAGLFITRENQTWNGYKVFGQPVKLTFSFPD
YKFSSTNVAGDTGLSKFSAEQQQQAKLSLQSWADVANITFTEVAAGQKANITFGNYSQDRPGHYDYGTQAYAFLPNTIWQ
GQDLGGQTWYNVNQSNVKHPATEDYGRQTFTHEIGHALGLSHPGDYNAGEGNPTYRDVTYAEDTRQFSLMSYWSETNTGG
DNGGHYAAAPLLDDIAAIQHLYGANLSTRTGDTVYGFNSNTGRDFLSTTSNSQKVIFAAWDAGGNDTFDFSGYTANQRIN
LNEKSFSDVGGLKGNVSIAAGVTIENAIGGSGNDVIVGNAANNVLKGGAGNDVLFGGGGADELWGGAGKDIFVFSAASDS
APGASDWIRDFQKGIDKIDLSFFNKEAQSSDFIHFVDHFSGAAGEALLSYNASNNVTDLSVNIGGHQAPDFLVKIVGQVD
VATDFIV
>Q53080 ~~~prcA1~~~Proteasome subunit alpha 1~~~
MTMPYYASAEQIMRDRSELARKGIARGRSVVVLTFRDGVLFVAENPSTALHKVSELYDRLGFAAVGKYNEFENLRRAGIV
HADMRGYSYDRRDVTGRSLANAYAQTLGTIFTEQPKPYEVEICVAEVGRVGSPKAPQLYRITYDGSIVDEQHFVVMGGTT
EPIATAMRESYRADLDLEAAVGIAVNALRQGGAGEGEKRNVDVASLEVAVLDQSRPRRAFRRIAGTALEQLVPAEPAAAS
ESAPEPKPDTETKPADTQD
>Q53084 ~~~prcA2~~~Proteasome subunit alpha 2~~~
MTMPYYASAEQIMRDRSELARKGIARGRSVVVLTYRDGVLFVAENPSRALHKVSELYDRLGFAAVGKYNEFENLRRAGIV
HADMRGYSYDRRDVTGRSLANAYAQTLGTIFTEQPKPYEVEICVAEIGRFGSSTPAQLYRITYDGSIADEQEFVVMGGTT
EPIVTAMRESYQRDLDLESAVRLAVGALQKGGPAPAGTTEAEPRTLDVSALEVAVLDSNRPRRAFKRIAGSSLEEMLPTP
AATEDAPPANGDAPS
>B0C474 1.97.1.12~~~psaA~~~Photosystem I P700 chlorophyll a apoprotein A1~~~COG2885
MTTSPGGPETKGRTAEVDINPVSASLEVAGKPGHFNKSLSKGPQTTTWIWNLHALAHDFDTQTNDLEEISRKIFSAHFGH
LSIIFVWISGMIFHAARFSNYYAWLADPLGNKPSAHVVWPIVGQDILNADVGNGFRGVQITSGLFHILRGAGMTDPGELY
SAAIGALVAAVVMMYAGYYHYHKKAPKLEWFQNAESTMTHHLIVLLGLGNLAWTGHLIHVSLPVNKLLDSGVAPQDIPIP
HEFLFDNGFMADLYPSFAQGLMPYFTLNWGAYSDFLTFKGGLDPTTGGLWMTDIAHHHLALAVMYIIAGHMYRTNWGIGH
SMKEIMESHKGPFTGEGHKGLYEVLTTSWHAQLAINLATWGSFSIIVAHHMYAMPPYPYLATDYGTQLNLFVHHMWIGGF
LIVGGAAHAAIFMVRDYDPAVNQNNVLDRMLRHRDTIISHLNWVCIFLGFHSFGLYIHNDNMRSLGRPQDMFSDTAIQLQ
PIFSQWVQNLQANVAGTIRAPLAEGASSLAWGGDPLFVGGKVAMQHVSLGTADFMIHHIHAFQIHVTVLILIKGVLYARS
SRLIPDKANLGFRFPCDGPGRGGTCQSSGWDHIFLGLFWMYNCISIVNFHFFWKMQSDVWGAANANGGVNYLTAGNWAQS
SITINGWLRDFLWAQSVQVINSYGSALSAYGILFLGAHFIWAFSLMFLFSGRGYWQELIESIVWAHSKLKIAPAIQPRAM
SITQGRAVGLGHYLLGGIVTSWSFYLARILALG
>Q7NFT6 1.97.1.12~~~psaA~~~Photosystem I P700 chlorophyll a apoprotein A1~~~COG2885
MSTTPQEREKPVRVLVDNDPVPTSTEKWGKPGWFERNLARGPKTTTWIWDLHALAHDFETHTSDKEEISRKIFSAHFGHL
AVVCVWLSGMFWHGAYFSNFTAWMENPLGLKPSAQTVWPVFGQEILNDPSTVAKGFEQGGIVITSGLFHLWRAVGFTTTG
QLAAMSIAMLIIAALFLFAGWFHYHKRAPKLEWFQNVESMLNHHLAGLFGLGSLFWTGHLIHVALPVKAQLDAGIAPAQV
NPFAGLDYGLMGQYFPKGFGPNGGLGAFFTLNWGQFTDFLTFKGGLEPATGALYLTDIAHHHLAIATLFIIAGHMYRTNW
GIGHSIKEMLEAHKGPLTGEGHRGLYEVLTTSWHAQLAINLAMAGSITIIVAHHMYAMNPYPYMGTDYATQISLFTHHMW
IGGFLIVGAGAHAAIFMVRDYDPVTNQNNLLDRVLRHRDAIISHLNWVTLFLGFHSFGLYVHNDTMQALGRPRDMFADFA
IPLQPVFAQWIQNIHAAAPGGATAPWVGGTSPTWYTGALSSAATLQANQVLALANDKISISPIHLGTADFMVHHIFALCI
HVTVLILLKGVLFARSSRLIPDKANLGFRFPCDGPGRGGTCQSSAWDHVFLGLFWMYNTISVVIFHFSWKMQSDVWGTVD
RSTGAVNHIIGNTDVLLGGQTVALSQYAASSININGWLRDFLWAQSSAVINSYGGPLSAYGLMFLGAHFIWAFSLMFLFS
GRGYWQELIESIVWAHNKLKVAPAIQPRALSITQGRAVGVAHYLLGGIATTWAFFLARFLALP
>P58576 1.97.1.12~~~psaA~~~Photosystem I P700 chlorophyll a apoprotein A1~~~COG2885
MTISPPEREEKKARVIVDKDPVPTSFEKWAQPGHFDRTLARGPKTTTWIWNLHALAHDFDTHTSDLEDISRKIFAAHFGH
LAVVTIWLSGMIFHGAKFSNYEAWLSDPLNVRPSAQVVWPIVGQDILNGDVGGGFHGIQITSGLFQVWRGWGITNSFQLY
CTAIGGLVLAGLFLFAGWFHYHKRAPKLEWFQNVESMLNHHLQVLLGCGSLGWAGHLIHVSAPINKLMDAGVAVKDIPLP
HEFILNKSLLIDLFPGFAAGLTPFFTLNWGQYADFLTFKGGLNPVTGGLWMTDIAHHHLAIAVVFIIAGHQYRTNWGIGH
SIKEILENHKGPFTGEGHKGLYENLTTSWHAQLATNLAFLGSLTIIIAHHMYAMPPYPYLATDYATQLCIFTHHIWIGGF
LIVGGAAHAAIFMVRDYDPVVNQNNVLDRVIRHRDAIISHLNWVCIFLGFHSFGLYIHNDTMRALGRPQDMFSDTAIQLQ
PVFAQWVQNLHTLAPGGTAPNALEPVSYAFGGGVLAVGGKVAMMPIALGTADFLIHHIHAFTIHVTVLILLKGVLFARSS
RLIPDKANLGFRFPCDGPGRGGTCQVSGWDHVFLGLFWMYNSLSIVIFHFSWKMQSDVWGTVDAAGNVSHITGGNFAQSA
ITINGWLRDFLWAQASQVINSYGSALSAYGLMFLGAHFVWAFSLMFLFSGRGYWQELIESIVWAHNKLKVAPAIQPRALS
ITQGRAVGVAHYLLGGIATTWAFFHAHILSVG
>Q9L4N4 1.97.1.12~~~psaA~~~Photosystem I P700 chlorophyll a apoprotein A1~~~COG2885
MTISPPERGEKAKGAAPTPYDQPVDRDHAPIDYEKLNKPGFWSSKLSKGPKTTTWIWNLHADAHDFDTHLGDLEETSRKI
FSAHFGHLAVVFIWMSAAFFHGARFSNYTGWLADPTNVKPGAQVVWPVVGQEILNADLGGNYQGLQITSGIFQMWRAWGI
TSEVQLMALAIGGVIMAALMLHGGIYHYHKAAPKLEWFRKIEPMLQHHQIALIGLGSIAWAGHLIHIGAPVAALLDAIDA
GNPLVVDGVSIASAADVTNLAPRLCDPAVASQIFPSLAGRTVENFFTLNWWAFTDILTNKGGLNPVTGSLWMTDISHHHL
AFGVFAIFGGHMWRNNVHGVGHSMKEIMDVHKGDPILFPAPKGHQGIFEFLSNSWHGQLSINLAMVGSASIVVAHHMYAL
PPYPYIAIDYPTVLGLFTHHMWIGGLFICGAAAHAGIAMIRDYDPAVHIDNVLDRILKARDAIISHLNWVCMWLGFHSFG
LYIHNDVMRALGRPKDMFSDTGIQLQPFLAQWVQNLQQSAVGTGDLVGAGNLPGSVLSEVFNGNVVEVGGKVAIAPIPLG
TADLMIHHVHAFTIHVTLLILLKGVLYARSSRLIPDKAQLGFRFPCDGPGRGGTCQVSSWDHVFLGLFWMYNSLSVVIFH
FSWKMQSDVWGLTGGNFAQSSITINGWLRDFLWAQSSQVLTSYGQPISMYGLMFLGAHFVWAFSLMFLFSGRGYWQELFE
SIIWAHNKLKVAPTIQPRALSITQGRAVGVAHFLLGGIATTWAFFHARLIGLG
>Q7V510 1.97.1.12~~~psaA~~~Photosystem I P700 chlorophyll a apoprotein A1~~~COG2885
MTISPPERGEKAKPIYDQPVDRDHVPADFEKFEQPGFFSKSLAKGPNSTTWIWNLHADAHDFDTHIGDLEETSRKIFSAH
FGHLAIVFIWMSGAFFHGARFSNYSGWLADPTHVKASAQVVWPIVGQEIMNADMGAGFNGIQITSGIFQMWRAWGITSET
ELMALATGALIMAALVLHGGIFHYHKAAPKLEWFKKIESMLQHHQIGLFGLGSLGWTGHLIHVANPTNALLDAIDAGTPM
VLDGKTIATAADIPLPHELYNADLVGQIYPGLASGVGNFFSANWWAFSDFLTNNGGVNPVTGALWSTDVAHHHLAWAVFL
MFGGHVYRSRFGIGHSMKEIMGNVKGDPLLFPAPNGHKGLFEFLSNSWHAQLAVNLACIGSGSIVVAHHMYSLPPYPYLA
TDYPTVLGLFTHHMWIGGLMICGAAAHAGIAVIRDYDVSVHVDNVLDRMFKARDAIISHLNWVCMFLGFHSFGLYIHNDS
MRALGRSQDMFSDSAIQLQPVLAQWIQSLWASSIGTSAVVGTTTGLPGAVSDVFNGSVVAVGGKVALMAIPLGTADLMIH
HIHAFTIHVTCLILLKGVLFARSSRLVPDKANLGFRFSCDGPGRGGTCQVSSWDHVFLGLFWMYNSLSMVIFYFSWKMQS
DVWGTVNSDGSVTHLVSGNFAQSAITVNGWFRDFLWAQSSQVLTSYGTGLSGYGLLFLGGHFVWAFSLMFLFSGRGYWQE
LFESIIWAHNKLKLAPTIQPRALSITQGRAVGVTHFLFGGIVTTWAFFHARLLGLG
>Q9RC08 1.97.1.12~~~psaA~~~Photosystem I P700 chlorophyll a apoprotein A1~~~COG2885
MTISPPESGEKDKKILESPVKADPRPIDFAKLDKPGFWSSKLSKGPKTTTWIWNLHADAHDFDIHTGDAEEATRKIFSAH
FGHLAVIFIWMSAAFFHGARFSNYSGWLADPTHVKPGAQQVWAIVGQEMLNGDLGANYNGIQISSGVFHMWRAWGITNES
ELMALAIGAVVMAALMLHAGIFHYHKAAPKMEWFQNIESMLNHHIAGLVGLGSLAWAGHCIHIGAPTAALLDAIDAGTPL
VINGKEIATIADMPMPHQLCDPQIIAQIFPGLASGTGNFFSLNWLAFSDFLTFKGGLNPVTGSLWMTDVSHHHLAFGVIA
IIGGHMYRTNYGIGHSMKEILDSQQGDPILFPAPKGHQGLFEFMAESRHAQLSVNLAMLGSLSILISHHMYAMPPYPYIA
TDYMTVLGLFTHHMWIGGLFIVGAGAHAGIAMVRDYDPAKHIDNVLDRILKARDALISHLNWVCMWLGFHSFGLYIHNDT
MRALGRPQDMFSDSAIQLQPIFAQWVQSIQASAVGTSILAGTAEALPHKAISEVFNGSLVEVGGKVAIAPIPLGTADLMI
HHIHAFQIHVTVLILLKGVLYARSSRLIPDKASLGFRFPCDGPGRGGTCQVSSWDHVFLGLFWMYNCLSIVIFHFSWKMQ
SDVWGLTGGNFSQSAITINGWLRDFLWAQSSQVLTSYGSAISMYGLMFLGAHFIWAFSLMFLFSGRGYWQELFESIVWAH
NKLKVAPTIQPRALSITQGRAVGVTHFLVGGIATTWAFFHARLFGLG
>Q31LJ0 1.97.1.12~~~psaA~~~Photosystem I P700 chlorophyll a apoprotein A1~~~COG2885
MTISPPEREAKVKATVDKNPVPTSFEKWGKPGHFDRTLAKGPKTTTWIWNLHANAHDFDSHTSDLEDISRKIFSAHFGHL
AVIFIWLSGAYFHGARFSNFSGWLADPTHVKPSAQVVWPIFGQEILNGDVGGGFHGIQITSGLFQLWRASGYTNEFQLYV
TAIGALVMAGLMLFAGWFHYHKAAPKLEWFQNVESMLNHHLAGLLGLGSLSWAGHQIHVSLPVNKLLDAIDAGEPLVLNG
KTIASAADIPLPHEFLDVSLISQLFPGFEAGVKAFFTLNWSAYADFLTFKGGLNPVTGGLWLTDTAHHHLAIAVLFIVAG
HMYRTNWGIGHSLKEILEAHKGPFTGQGHKGLYEILTTSWHAQLSINLAILGSISIIVAHHMYAMPPYPYLATDYPTMLS
LFTHHIWIGGFLIVGAGAHAAIFMVRDYDPAKNVDNLLDRVLRHRDAIISHLNWVCIWLGFHSFGLYIHNDTMRALGRPQ
DMFSDSAIQLQPIFAQWIQNIHALAPGNTAPNALASVSQVFGGDVVAVGGKVAAAPIVLGTADFMVHHIHAFTIHVTALI
LLKGVLYARSSRLVPDKANLGFRFPCDGPGRGGTCQVSGWDHVFLGLFWMYNSLSIVIFHYSWKMQSDVWGSVLPDGSVA
HIANGNFAQSALTINGWLRDFLWAQASQVITSYGSSTSAYGLLFLGAHFVWAFSLMFLFSGRGYWQELIESIVWAHNKLK
VAPAIQPRALSIIQGRAVGVAHYLLGGIVTTWSFFLARIIAVG
>P29254 1.97.1.12~~~psaA~~~Photosystem I P700 chlorophyll a apoprotein A1~~~COG2885
MTISPPEREAKAKVSVDNNPVPTSFEKWGKPGHFDRTLARGPKTTTWIWNLHANAHDFDSQTSDLEDVSRKIFSAHFGHL
AVVFVWLSGMYFHGAKFSNYEGWLADPTHIKPSAQVVWPIVGQGILNGDVGGGFHGIQITSGLFYLWRASGFTDSYQLYC
TAIGGLVMAALMLFAGWFHYHVKAPKLEWFQNVESMMNHHLAGLLGLGSLGWAGHQIHVSMPINKLLDAGVAPKDIPLPH
EFILEPSKMAELYPSFAQGLTPFFTLNWGVYSDFLTFKGGLNPVTGGLWLSDTAHHHLAIAVLFIIAGHMYRTNWGIGHS
MKEILEAHKGPFTGEGHKGLYEILTTSWHAQLAINLALLGSLTIIVAQHMYAMPPYPYQAIDYATQLSLFTHHMWIGGFL
IVGAGAHGAIFMVRDYDPAKNVNNLLDRMLRHRDAIISHLNWVCIFLGFHSFGLYIHNDTMRALGRPQDMFSDTAIQLQP
IFAQWVQHLHTLAPGATAPNALATASYAFGGETIAVAGKVAMMPITLGTADFMVHHIHAFTIHVTALILLKGVLYARSSR
LVPDKANLGFRFPCDGPGRGGTCQVSGWDHVFLGLFWMYNSLSIVIFHFSWKMQSDVWGTVSPDGSVTHVTLGNFAQSAI
TINGWLRDFLWAQAANVINSYGSALSAYGIMFLAGHFVFAFSLMFLFSGRGYWQELIESIVWAHNKLNVAPAIQPRALSI
IQGRAVGVAHYLLGGIVTTWAFFLARSLSIG
>P0A405 1.97.1.12~~~psaA~~~Photosystem I P700 chlorophyll a apoprotein A1~~~COG2885
MTISPPEREPKVRVVVDNDPVPTSFEKWAKPGHFDRTLARGPQTTTWIWNLHALAHDFDTHTSDLEDISRKIFSAHFGHL
AVVFIWLSGMYFHGAKFSNYEAWLADPTGIKPSAQVVWPIVGQGILNGDVGGGFHGIQITSGLFQLWRASGITNEFQLYC
TAIGGLVMAGLMLFAGWFHYHKRAPKLEWFQNVESMLNHHLAGLLGLGSLAWAGHQIHVSLPINKLLDAGVAAKDIPLPH
EFILNPSLMAELYPKVDWGFFSGVIPFFTFNWAAYSDFLTFNGGLNPVTGGLWLSDTAHHHLAIAVLFIIAGHMYRTNWG
IGHSLKEILEAHKGPFTGAGHKGLYEVLTTSWHAQLAINLAMMGSLSIIVAQHMYAMPPYPYLATDYPTQLSLFTHHMWI
GGFLVVGGAAHGAIFMVRDYDPAMNQNNVLDRVLRHRDAIISHLNWVCIFLGFHSFGLYVHNDTMRAFGRPQDMFSDTGI
QLQPVFAQWVQNLHTLAPGGTAPNAAATASVAFGGDVVAVGGKVAMMPIVLGTADFMVHHIHAFTIHVTVLILLKGVLFA
RSSRLIPDKANLGFRFPCDGPGRGGTCQVSGWDHVFLGLFWMYNCISVVIFHFSWKMQSDVWGTVAPDGTVSHITGGNFA
QSAITINGWLRDFLWAQASQVIGSYGSALSAYGLLFLGAHFIWAFSLMFLFSGRGYWQELIESIVWAHNKLKVAPAIQPR
ALSIIQGRAVGVAHYLLGGIATTWAFFLARIISVG
>P25936 1.97.1.12~~~psaA~~~Photosystem I P700 chlorophyll a apoprotein A1~~~
MTISPPEREPKVRVVVDNDPVPTSFEKWAKPGHFDRTLARGPQTTTWIWNLHALAHDFDTHTSDLEDISRKIFSAHFGHL
AVVFIWLSGMYFHGAKFSNYEAWLADPTGIKPSAQVVWPIVGQGILNGDVGGGFHGIQITSGLFQLWRASGITNEFQLYC
TAIGGLVMAGLMLFAGWFHYHKRAPKLEWFQNVESMLNHHLAGLLGLGSLSWAGHQIHVSLPINKLLDAGVAAKDIPLPH
EFILNPSLMAELYPKVDWGFFSGVIPFFTFNWAAYSDFLTFNGGLNPVTGGLWLSDTAHHHLAIAVLFIIAGHMYRTNWG
IGHSLKEILEAHKGPFTGAGHKGLYEVLTTSWHAQLAINLAMMGSLSIIVAQHMYAMPPYPYLATDYPTQLSLFTHHMWI
GGFLVVGGAAHGAIFMVRDYDPAMNQNNVLDRVLRHRDAIISHLNWVCIFLGFHSFGLYVHNDTMRAFGRPQDMFSDTGI
QLQPVFAQWVQNLHTLAPGGTAPNAAATASVAFGGDVVAVGGKVAMMPIVLGTADFMVHHIHAFTIHVTVLILLKGVLFA
RSSRLIPDKANLGFRFPCDGPGRGGTCQVSGWDHVFLGLFWMYNCISVVIFHFSWKMQSDVWGTVAPDGTVSHITGGNFA
QSAITINGWLRDFLWAQASQVIGSYGSALSAYGLLFLGAHFIWAFSLMFLFSGRGYWQELIESIVWAHNKLKVAPAIQPR
ALSIIQGRAVGVAHYLLGGIATTWAFFLARIISVG
>Q44550 1.97.1.12~~~psaA~~~Photosystem I P700 chlorophyll a apoprotein A1~~~COG2885
MTISPPEREEKKARVIVDKDPVPTSFEKWAQPGHFDRTLARGPKTTTWIWNLHALAHDFDTHTSDLEDISRKIFAAHFGH
LAVVTIWLSGMIFHGAKFSNYEAWLSDPLNVRPSAQVVWPIVGQDILNGDVGGGFHGIQITSGLFQVWRGWGITNSFQLY
CTAIGGLVLAGLFLFAGWFHYHKRAPKLEWFQNVESMLNHHLQVLLGCGSLGWAGHLIHVSAPINKLLDAGVAVKDIPLP
HEFILNKSVLIDLFPGFAAGLTPFFTLNWGQYADFLTFKGGLNPVTGGLWLTDISHHHLAIAVLFIIAGHQYRTNWGIGH
SIKEILENHKGPFTGEGHKGLYENLTTSWHAQLATNLAFLGSLTIIIAHHMYAMPPYPYLATDYATQLCIFTHHIWIGGF
LIVGGAAHAAIFMVRDYDPVVNQNNVLDRVIRHRDAIISHLNWVCIFLGFHSFGLYIHNDTMRALGRPQDMFSDTAIQLQ
PVFAQWVQNLHTLAPGGTAPNALEPVSYAFGGGVLAVGGKVAMMPIALGTADFLIHHIHAFTIHVTVLILLKGVLFARSS
RLIPDKANLGFRFPCDGPGRGGTCQVSGWDHVFLGLFWMYNSLSIVIFHFSWKMQSDVWGTVDAAGNVSHITGGNFAQSA
ITINGWLRDFLWAQASQVINSYGSALSAYGLMFLGAHFVWAFSLMFLFSGRGYWQELIESIVWAHNKLKVAPAIQPRALS
ITQGRAVGVAHYLLGGIATTWAFFHAHILSVG
>P31522 ~~~psaA~~~pH 6 antigen~~~
MKMKCFAKNALAVTTLMIAACGMANASTVINSKDVSGEVTVKQGNTFHVDFAPNTGEIFAGKQPGDVTMFTLTMGDTAPH
GGWRLIPTGDSKGGYMISADGDYVGLYSYMMSWVGIDNNWYINDDSPKDIKDHLYVKAGTVLKPTTYKFTGRVEEYVF
>P58565 1.97.1.12~~~psaB1~~~Photosystem I P700 chlorophyll a apoprotein A2 1~~~COG2885
MATKFPKFSQDLAQDPTTRRIWYAMAMGNDFESHDGMTEENLYQKIFATHFGHLAIIFLWASSLLFHVAWQGNFEQWIKD
PLHVRPIAHAIWDPHFGKPAIEAFTQAGANGPVNIAYSGVYHWWYTIGMRTNTELYTGSVFLLLFASLFLFAGWLHLQPK
FRPSLAWFKSAESRLNHHLAGLFGVSSLAWAGHLIHVAIPESRGQHVGWDNFLSTAPHPAGLQPFFTGNWGVYAQNPDTA
GHIFSTSQGAGTAILTFLGGFHPQTESLWLTDMAHHHLAIAVLFIVAGHMYRTNFGIGHSIKEMMNAKTFFGKPVEGPFN
MPHQGIYDTYNNSLHFQLGWHLACLGVVTSWVAQHMYSLPSYAFIAKDYTTQAALYTHHQYIAIFLMVGAFAHGAIFLVR
DYDPEQNKGNVLERVLQHKEAIISHLSWVSLFLGFHTLGLYVHNDVVVAFGTPEKQILIEPVFAQFIQAAHGKVLYGLDT
LLSNPDSVAYTAYPNYANVWLPGWLDAINSGTNSLFLTIGPGDFLVHHAIALGLHTTTLILVKGALDARGSKLMPDKKDF
GYAFPCDGPGRGGTCDISAWDSFYLSLFWALNTVGWVTFYWHWKHLGIWQGNVAQFNENSTYLMGWFRDYLWANSAQLIN
GYNPYGVNNLSVWAWMFLFGHLVWATGFMFLISWRGYWQELIETLVWAHERTPIANLVRWKDKPVALSIVQARVVGLAHF
TVGYVLTYAAFLIASTAGKFG
>P31088 1.97.1.12~~~psaB1~~~Photosystem I P700 chlorophyll a apoprotein A2 1~~~COG2885
MATKFPKFSQDLAQDPTTRRIWYAMAMGNDFESHDGMTEENLYQKIFATHFGHLAIIFLWASSLLFHVAWQGNFEQWIKD
PLHVRPIAHAIWDPHFGKPAIEAFTQAGANGPVNIAYSGVYHWWYTIGMRTNTELYTGSVFLLLFASLFLFAGWLHLQPK
FRPSLAWFKSAESRLNHHLAGLFGVSSLAWAGHLIHVAIPESRGQHVGWDNFLTTAPHPAGLQPFFTGNWGVYAQNPDTA
GHIFSTSQGSGTAILTFLGGFHPQTESLWLTDIAHHHLAIAVLFIVAGHMYRTNFGIGHSIKEMMNAKTFFGKPVEGPFN
MPHQGIYDTYNNSLHFQLGWHLACLGVVTSWVAQHMYSLPSYAFIAKDYTTQAALYTHHQYIAIFLMVGAFAHGAIFLVR
DYDPEQNKGNVLERVLQHKEAIISHLSWVSLFLGFHTLGLYVHNDVVVAFGTPEKQILIEPVFAQFIQAAHGKVLYGLDT
LLSNPDSVAYTAYPNYANVWLPGWLDAINSGTNSLFLTIGPGDFLVHHAIALGLHTTTLILVKGALDARGSKLMPDKKDF
GYAFPCDGPGRGGTCDISAWDSFYLSLFWALNTVGWVTFYWHWKHLGIWQGNVAQFNENSTYLMGWFRDYLWANSAQLIN
GYNPYGVNNLSVWAWMFLFGHLVWATGFMFLISWRGYWQELIETLVWAHERTPIANLIRWKDKPVALSIVQARVVGLAHF
TVGYVLTYAAFLIASTAGKFG
>Q7NFT5 1.97.1.12~~~psaB~~~Photosystem I P700 chlorophyll a apoprotein A2~~~COG2885
MATRFPKFSQDLAQDPTTRRIWYGIATAHDFESHDGMTEESLYQKLFATHFGHLAIIFLWSSGNLFHIAWQGNFEQWVSN
PTGVVPIAHAIWDPHFGKGAVEAFTPEGGAGPVNAAYSGLYYLYYTLGMRFNSDLYQGSIFLMVLATVFLIAGWLHLQPR
FRPSLAWFKNAESRLNHHLSALFGVSSLAFAGHMIHVAIPAARGQRVDWSNFLNTLPHPAGLAPFFTGNWGVYADPQAGP
PILTFIGGLNPATGTLWLTDIAHHHLAIAVIFIIAGHMYRTNFGIGHSIKEILDAHKGPLTGEGHRGLYDTINNSLHFQL
GLALASLGVVTSLVAQHTYALPAYFYMPQDHTTMAALYTHHQYIAGFLMVGAFAHGAIFFVRDYDPKANENNVLARMLEH
KEALISHLSWVSLFLGFHTLGLYVHNDVMLAFGRPEDQLLIEPVFAQFVQVQSGKIIEGIPALFGGPGVTAPGEFLTGWL
GSVNANNSPIFLPIGPGDFLVHHAIALGLHTTTLILVKGALDARGSKLMPDKKDFGFAFPCDGPGRGGTCDISAWDAFYL
AVFWMLNTIGWVTFYWHWKWISIWGDNVAQFNASSTYLMGWLRDYLWANSAPLIGGYSPSGGTNALSVWAWMFLFGHLVW
ATGFMFLIAWRGYWQELIETLVWAHERTPLANLVRWKDKPVAMSIVQGRLVGLAHFTIGYILTYAAFLIASTAALYPNGP
AAFTPAISAEQAKGVLSEFKAKPVPGGVMLLLPENIVFDFDKSSVKLDADPALNRVVGVIQFYGSEPVEILGHTDSLGED
AYNQKLSEERASAVKAFFEKKGIEAERLTAKGYGETKPVAPNAKPDGSDNPDGRQQNRRVEILIKTEVVPVS
>Q31LJ1 1.97.1.12~~~psaB~~~Photosystem I P700 chlorophyll a apoprotein A2~~~COG2885
MATKFPKFSQDLAQDPTTRRIWYGIATAHDFESHDGMTEENLYQKIFASHFGHLAIIFLWVSGNLFHVAWQGNFEQWSQD
PLHVRPIAHAIWDPHFGQGAIDAFTQAGASSPVNVAYSGVYHWWYTIGMRTNGDLYQGSIFLLILSALFLFAGWLHLQPK
FRPSLSWFKNAESRLNHHLAGLFGFSSLAWTGHLVHVAIPEARGQHVGWDNFLSTLPHPAGLAPFFTGNWSVYAENPDTA
SHAFGTAEGAGTAILTFLGGFHPQTEALWLTDIAHHHLAIAVIFIIAGHMYRTNFGIGHSIKEILEAHKPPAGGLGAGHK
GLYETLNNSLHFQLALALASLGVVTSLVAQHMYSMPPYAFIAKDYTTMAALYTHHQYIATFIMCGAFAHGAIFLIRDYDP
EANKNNVLARVLEHKEAIISHLSWVSLFLGFHTLGLYVHNDVVVAFGTPEKQILIEPVFAQFVQAASGKALYGFNVLLAN
ADSAATAASLGTYLPNWLDAINSGKTALFLPIGPGDFLVHHAIALGLHTTTLILVKGALDARGSKLMPDKKDFGYSFPCD
GPGRGGTCDISAWDAFYLAVFWALNTVGWVTFYWHWKNLTVWQGNVAQFNESSTYLMGWLRDYLWLNSSQLINGYNPFGT
NNLSVWSWMFLFGHLIWATGFMFLISWRGYWQELIETIVWAHQRTPLANIVGWKDKPVALSIVQARVVGLAHFTVGYFLT
YAAFLIASTAGKFG
>P29255 1.97.1.12~~~psaB~~~Photosystem I P700 chlorophyll a apoprotein A2~~~COG2885
MATKFPKFSQDLAQDPTTRRIWYGIATAHDFETHDGMTEENLYQKIFASHFGHIAIIFLWTSGTLFHVAWQGNFEQWIKD
PLNIRPIAHAIWDPHFGEGAVNAFTQAGASNPVNIAYSGVYHWFYTIGMTTNQELYSGAVFLLVLASLFLFAGWLHLQPK
FRPSLAWFKNAESRLNHHLAGLFGVSSLAWAGHLVHVAIPEARGQHVGWDNFLSTPPHPAGLMPFFTGNWGVYAADPDTA
GHIFGTSEGAGTAILTFLGGFHPQTESLWLTDIAHHHLAIAVIFIIAGHMYRTNWGIGHSIKEILNAHKGPLTGAGHTNL
YDTINNSLHFQLGLALASLGVITSLVAQHMYSLPSYAFIAQDHTTQAALYTHHQYIAGFLMVGAFAHGAIFFVRDYDPVA
NKDNVLARMLEHKEALISHLSWVSLFLGFHTLGLYVHNDVVVAFGTPEKQILIEPVFAQWIQATSGKALYGFDVLLSNPD
SIASTTGAAWLPGWLDAINSGTNSLFLTIGPGDFLVHHAIALGLHTTALILIKGALDARGSKLMPDKKDFGYSFPCDGPG
RGGTCDISAWDAFYLAMFWMLNTLGWLTFYWHWKHLGVWSGNVAQFNENSTYLMGWFRDYLWANSAQLINGYNPYGVNNL
SVWAWMFLFGHLVWATGFMFLISWRGYWQELIETIVWAHERTPLANLVRWKDKPVALSIVQARLVGLAHFTVGYVLTYAA
FLIASTAGKFG
>P0A407 1.97.1.12~~~psaB~~~Photosystem I P700 chlorophyll a apoprotein A2~~~COG2885
MATKFPKFSQDLAQDPTTRRIWYAIAMAHDFESHDGMTEENLYQKIFASHFGHLAIIFLWVSGSLFHVAWQGNFEQWVQD
PVNTRPIAHAIWDPQFGKAAVDAFTQAGASNPVDIAYSGVYHWWYTIGMRTNGDLYQGAIFLLILASLALFAGWLHLQPK
FRPSLSWFKNAESRLNHHLAGLFGVSSLAWAGHLIHVAIPESRGQHVGWDNFLSTMPHPAGLAPFFTGNWGVYAQNPDTA
SHVFGTAQGAGTAILTFLGGFHPQTESLWLTDMAHHHLAIAVLFIVAGHMYRTQFGIGHSIKEMMDAKDFFGTKVEGPFN
MPHQGIYETYNNSLHFQLGWHLACLGVITSLVAQHMYSLPPYAFIAQDHTTMAALYTHHQYIAGFLMVGAFAHGAIFLVR
DYDPAQNKGNVLDRVLQHKEAIISHLSWVSLFLGFHTLGLYVHNDVVVAFGTPEKQILIEPVFAQFIQAAHGKLLYGFDT
LLSNPDSIASTAWPNYGNVWLPGWLDAINSGTNSLFLTIGPGDFLVHHAIALGLHTTTLILVKGALDARGSKLMPDKKDF
GYAFPCDGPGRGGTCDISAWDAFYLAMFWMLNTIGWVTFYWHWKHLGVWEGNVAQFNESSTYLMGWLRDYLWLNSSQLIN
GYNPFGTNNLSVWAWMFLFGHLVWATGFMFLISWRGYWQELIETLVWAHERTPLANLVRWKDKPVALSIVQARLVGLAHF
SVGYILTYAAFLIASTAAKFG
>P0A409 1.97.1.12~~~psaB~~~Photosystem I P700 chlorophyll a apoprotein A2~~~
MATKFPKFSQDLAQDPTTRRIWYAIAMAHDFESHDGMTEENLYQKIFASHFGHLAIIFLWVSGSLFHVAWQGNFEQWVQD
PVNTRPIAHAIWDPQFGKAAVDAFTQAGASNPVDIAYSGVYHWWYTIGMRTNGDLYQGAIFLLILASLALFAGWLHLQPK
FRPSLSWFKNAESRLNHHLAGLFGVSSLAWAGHLIHVAIPESRGQHVGWDNFLSTMPHPAGLAPFFTGNWGVYAQNPDTA
SHVFGTAQGAGTAILTFLGGFHPQTESLWLTDMAHHHLAIAVLFIVAGHMYRTQFGIGHSIKEMMDAKDFFGTKVEGPFN
MPHQGIYETYNNSLHFQLGWHLACLGVITSLVAQHMYSLPPYAFIAQDHTTMAALYTHHQYIAGFLMVGAFAHGAIFLVR
DYDPAQNKGNVLDRVLQHKEAIISHLSWVSLFLGFHTLGLYVHNDVVVAFGTPEKQILIEPVFAQFIQAAHGKLLYGFDT
LLSNPDSIASTAWPNYGNVWLPGWLDAINSGTNSLFLTIGPGDFLVHHAIALGLHTTTLILVKGALDARGSKLMPDKKDF
GYAFPCDGPGRGGTCDISAWDAFYLAMFWMLNTIGWVTFYWHWKHLGVWEGNVAQFNESSTYLMGWLRDYLWLNSSQLIN
GYNPFGTNNLSVWAWMFLFGHLVWATGFMFLISWRGYWQELIETLVWAHERTPLANLVRWKDKPVALSIVQARLVGLAHF
SVGYILTYAAFLIASTAAKFG
>B0CB42 1.97.1.12~~~psaC~~~Photosystem I iron-sulfur center~~~COG1143
MSHTVKIYDTCIGCTQCVRACPTDVLEMVPWDGCKAGQIASSPRTEDCVGCKRCETACPTDFLSIRVYLGAETTRSMGLA
Y
>Q7NG86 1.97.1.12~~~psaC~~~Photosystem I iron-sulfur center~~~COG1143
MSHSVKIYDTCIGCTQCVRACPLDVLEMVPWDGNKAGTIASSPRTEDCVGCKRCETACPTDFLSIRVYLGAETTRSMGLA
Y
>P0A412 1.97.1.12~~~psaC~~~Photosystem I iron-sulfur center~~~
MSHTVKIYDTCIGCTQCVRACPTDVLEMVPWDGCKAAQIASSPRTEDCVGCKRCETACPTDFLSIRVYLGAETTRSMGLA
Y
>P0A410 1.97.1.12~~~psaC~~~Photosystem I iron-sulfur center~~~COG1143
MSHTVKIYDTCIGCTQCVRACPTDVLEMVPWDGCKAAQVASSPRTEDCVGCKRCETACPTDFLSIRVYLGAETTRSMGLA
Y
>Q31QV2 1.97.1.12~~~psaC~~~Photosystem I iron-sulfur center~~~COG1143
MSHSVKIYDTCIGCTQCVRACPLDVLEMVPWDGCKAGQIAASPRTEDCVGCKRCETACPTDFLSIRVYLGAETTRSMGLA
Y
>P0A416 1.97.1.12~~~psaC~~~Photosystem I iron-sulfur center~~~
MAHTVKIYDTCIGCTQCVRACPTDVLEMVPWDGCKAGQIASSPRTEDCVGCKRCETACPTDFLSIRVYLGAETTRSMGLA
Y
>P31087 1.97.1.12~~~psaC~~~Photosystem I iron-sulfur center~~~COG1143
MSHSVKIYDTCIGCTQCVRACPLDVLEMVPWDGCKAGQIASSPRTEDCVGCKRCETACPTDFLSIRVYLGAETTRSMGLA
Y
>P31085 1.97.1.12~~~psaC~~~Photosystem I iron-sulfur center~~~COG1143
MSHSVKIYDTCIGCTQCVRACPLDVLEMVPWDGCKAGQIAASPRTEDCVGCKRCETACPTDFLSIRVYLGAETTRSMGLA
Y
>P32422 1.97.1.12~~~psaC~~~Photosystem I iron-sulfur center~~~COG1143
MSHSVKIYDTCIGCTQCVRACPLDVLEMVPWDGCKAAQIASSPRTEDCVGCKRCETACPTDFLSIRVYLGAETTRSMGLA
Y
>P0A415 1.97.1.12~~~psaC~~~Photosystem I iron-sulfur center~~~COG1143
MAHTVKIYDTCIGCTQCVRACPTDVLEMVPWDGCKAGQIASSPRTEDCVGCKRCETACPTDFLSIRVYLGAETTRSMGLA
Y
>P0A417 1.97.1.12~~~psaC~~~Photosystem I iron-sulfur center~~~
MAHTVKIYDTCIGCTQCVRACPTDVLEMVPWDGCKAGQIASSPRTEDCVGCKRCETACPTDFLSIRVYLGAETTRSMGLA
Y
>P0A411 1.97.1.12~~~psaC~~~Photosystem I iron-sulfur center~~~COG1143
MSHTVKIYDTCIGCTQCVRACPTDVLEMVPWDGCKAAQVASSPRTEDCVGCKRCETACPTDFLSIRVYLGAETTRSMGLA
Y
>Q7NF26 ~~~psaD~~~Photosystem I reaction center subunit II~~~
MADVKELPFGGSTPLFGGSTGGLLRKAQIEEKYLIVWNSKEEQVFEMPTGGAATMVAGTNVLYLARKEQCHALHRQLVST
FKIRDSKIYRVYPNGEQVLIFPMDGVPSEKSNPGREVVGYVPRKIGDNPNPVDVKFTGKETFDV
>P23808 ~~~psaD~~~Photosystem I reaction center subunit II~~~
AETLSGQTPLFAGSTGGLLKKAEVEEKYAITWTSPKAQVFELPTGGAATMQQGQNLLYLARKEYGIALGGQLRKFKITDY
KIYRILPGGETTLIHPADGVFPEKVNAGREKVRFVPRRIGENPNPSAIKFSGKYTYDA
>P58573 ~~~psaD~~~Photosystem I reaction center subunit II~~~
MAETLSGKTPLFAGSTGGLLTKAVEEEKYAITWTSPKAQVFELPTGGAATMHEGENLLYIARKEYGIALGGQLRKFKITN
YKIYRILPSGETTFIHPADGVFPEKVNAGREKVRFNARSIGENPNPSQVKFSGKATYDA
>P56596 ~~~psaD~~~Photosystem I reaction center subunit II~~~
AEQLSGKTPLFAGSTGGLLTKANVEEKYAITWTSPKAQVFELPTGGAATMNQGENLLYLARKEQGIALGGQLRKFKITDY
KIYRIFPNGETTFIHPADGVFPEKVNEGREKVRFVPRRIGQNPSPAQLKFSGKYTYDA
>P0A421 ~~~psaD~~~Photosystem I reaction center subunit II~~~
MTTLTGQPPLYGGSTGGLLSAADTEEKYAITWTSPKEQVFEMPTAGAAVMREGENLVYFARKEQCLALAAQQLRPRKIND
YKIYRIFPDGETVLIHPKDGVFPEKVNKGREAVNSVPRSIGQNPNPSQLKFTGKKPYDP
>P23076 ~~~psaD~~~Photosystem I reaction center subunit II~~~
MAETLTGKTPVFGGSTGGLLKSAETEEKYAITWTSTKEQVFELPTGGAAVMHEGDNLLYFARKEQALALGTQLRTKFKPK
IESYKIYRFFPGADVGYLHPKDGVFPEKVNEGRSFAGKVDRRIGQNPNPATIKFTGKQPYTA
>P19569 ~~~psaD~~~Photosystem I reaction center subunit II~~~
MTELSGQPPKFGGSTGGLLSKANREEKYAITWTSASEQVFEMPTGGAAIMNEGENLLYLARKEQCLALGTQLRTKFKPKI
QDYKIYRVYPSGEVQYLHPADGVFPEKVNEGREAQGTKTRRIGQNPEPVTIKFSGKAPYEV
>P0A420 ~~~psaD~~~Photosystem I reaction center subunit II~~~
MTTLTGQPPLYGGSTGGLLSAADTEEKYAITWTSPKEQVFEMPTAGAAVMREGENLVYFARKEQCLALAAQQLRPRKIND
YKIYRIFPDGETVLIHPKDGVFPEKVNKGREAVNSVPRSIGQNPNPSQLKFTGKKPYDP
>P0A422 ~~~psaD~~~Photosystem I reaction center subunit II~~~
MTTLTGQPPLYGGSTGGLLSAADTEEKYAITWTSPKEQVFEMPTAGAAVMREGENLVYFARKEQCLALAAQQLRPRKIND
YKIYRIFPDGETVLIHPKDGVFPEKVNKGREAVNSVPRSIGQNPNPSQLKFTGKKPYDP
>P31089 ~~~psaD~~~Photosystem I reaction center subunit II~~~
MAETLSGKTPLFAGSTGGLLTKAVEEEKYAITWTSPKAQVFELPTGGAATMHEGENLLYIARKEYGIALGGQLRKFKITN
YKIYRILPSGETTFIHPADGVFPEKVNPGREKVRFNARSIGENPNPSQVKFSGKATYDA
>Q7NFW6 ~~~psaE~~~Photosystem I reaction center subunit IV~~~
MAIERGAKVRILRKESYWYREVGTVASVDKSEKTIYPVTVRFEKVNYSGINTNNFGVSELEEVEA
>P23809 ~~~psaE~~~Photosystem I reaction center subunit IV~~~
VQRGSKVRILRPESYWFQDVGTVASIDQSGIKYSVIVRFDKVNYSGINTNNFAEDELLEVAPPAAK
>P58575 ~~~psaE~~~Photosystem I reaction center subunit IV~~~
MVQRGSKVRILRPESYWFQDVGTVASVDQSGIKYPVIVRFDKVNYAGINTNNFAVDELIEVEAPKAKAKK
>Q9WWP1 ~~~psaE~~~Photosystem I reaction center subunit IV~~~
MVQRGSKVRILRPESYWFQDVGTVASVDQSGIKYPVIVRFEKVNYSGINTNNFAEDELVEVEAPKAKPKK
>Q31NL7 ~~~psaE~~~Photosystem I reaction center subunit IV~~~
MAIARGDKVRILRPESYWFNEVGTVASVDQSGIKYPVVVRFEKVNYNGFSGSDGGVNTNNFAEAELQVVAAAAKK
>P0A424 ~~~psaE~~~Photosystem I reaction center subunit IV~~~
MVQRGSKVKILRPESYWYNEVGTVASVDQTPGVKYPVIVRFDKVNYTGYSGSASGVNTNNFALHEVQEVAPPKKGK
>P31969 ~~~psaE~~~Photosystem I reaction center subunit IV~~~
MAIERGSKVKILRKESYWYGDVGTVASIDKSGIIYPVIVRFNKVNYNGFSGSAGGLNTNNFAEHELEVVG
>P23077 ~~~psaE~~~Photosystem I reaction center subunit IV~~~
MAIARGDKVRILRPESYWFNEVGTVASVDQSGIKYPVVVRFEKVNYNGFSGSDGGVNTNNFAEAELQVVAAAAKK
>P12975 ~~~psaE~~~Photosystem I reaction center subunit IV~~~
MALNRGDKVRIKRTESYWYGDVGTVASVEKSGILYPVIVRFDRVNYNGFSGSASGVNTNNFAENELELVQAAAK
>P0A423 ~~~psaE~~~Photosystem I reaction center subunit IV~~~
MVQRGSKVKILRPESYWYNEVGTVASVDQTPGVKYPVIVRFDKVNYTGYSGSASGVNTNNFALHEVQEVAPPKKGK
>P31090 ~~~psaE~~~Photosystem I reaction center subunit IV~~~
MVQRGSKVRILRPESYWFQDVGTVASVDQSGIKYPVIVRFDKVNYSGINTNNFAVDELIEVEAPKAKPAKK
>Q7NH05 ~~~psaF~~~Photosystem I reaction center subunit III~~~
MSNKQSRVPFGAALLGILTLLLLFETGAFAQTQVKDPLKLCKDVPAYQELKTQRLEAAQKAQADGKPVTFNEAGTKQKFE
RYDTAYCGQDGYPHLITSGQLDRAGDFLIPSVLFLWIAGALGWAGRLYLAESKGPEDEIIIDLPKAIKCLLLGLIWPVQA
IPELISGKIRVPEDRVTISPR
>O31127 ~~~psaF~~~Photosystem I reaction center subunit III~~~
MQRLFALILAIFLWFHFAPPAQALGADLVPCSESPAFQQRAQVARNTTADPESGKKRFERYSQALCGPEGLPHLIVDGRL
DHAGDFLIPSILFLYIAGWIGWVGRAYLQTIKKQGSNVEQKEIQIELPIALPIMLSGFAWPAAAIKEFLSGELTAKDEEI
TISPR
>P58564 ~~~psaF~~~Photosystem I reaction center subunit III~~~
MRRLFALILVICLSFSFAPPAKALGADLTPCAENPAFQALAKNARNTTADPQSGQKRFERYSQALCGPEGYPHLIVDGRL
DRAGDFLIPSILFLYIAGWIGWVGRAYLQAIKKDSDTEQKEIQLDLGIALPIIATGFAWPAAAVKELLSGELTAKDSEIT
VSPR
>Q9X7I4 ~~~psaF~~~Photosystem I reaction center subunit III~~~
MRRLFSILLSAFLLLGLAPIVNAAGEAVNADRAATDFTASALTTCSENTRFNERASQATTPKDIARFERYSKASCGDDGL
PHLVIAATIEPWGALANRHHEGDILIPGHIFIYVAGIIGWSGREYLRASKKTKNPAENEIIIDFALARQCLIKGAAWPVE
ANKQGRSGDLREKDENISLNGPR
>P0A402 ~~~psaF~~~Photosystem I reaction center subunit III~~~
MRRFLALLLVLTLWLGFTPLASADVAGLVPCKDSPAFQKRAAAAVNTTADPASGQKRFERYSQALCGEDGLPHLVVDGRL
SRAGDFLIPSVLFLYIAGWIGWVGRAYLIAVRNSGEANEKEIIIDVPLAIKCMLTGFAWPLAALKELASGELTAKDNEIT
VSPR
>P31083 ~~~psaF~~~Photosystem I reaction center subunit III~~~
MRRLFAVVLAACLWLGFAPQASADVAGLTPCSESPRFIQRAEAAATPQAKARFENYSQALCGADGLPHLIVDGRLDHAGD
FIIPSLLFLYIAGWIGWVGRSYLQAIKSDKDAAGKEIVIDVPLAVKFSLTGFAWPLAAFQEFSSGKLLAKADEITVSPR
>P29256 ~~~psaF~~~Photosystem I reaction center subunit III~~~
MKHLLALLLAFTLWFNFAPSASADDFANLTPCSENPAYLAKSKNFLNTTNDPNSGKIRAERYASALCGPEGYPHLIVDGR
FTHAGDFLIPSILFLYIAGWIGWVGRSYLIEIRESKNPEMQEVVINVPLAIKKMLGGFLWPLAAVGEYTSGKLVMKDSEI
PTSPR
>P0A401 ~~~psaF~~~Photosystem I reaction center subunit III~~~
MRRFLALLLVLTLWLGFTPLASADVAGLVPCKDSPAFQKRAAAAVNTTADPASGQKRFERYSQALCGEDGLPHLVVDGRL
SRAGDFLIPSVLFLYIAGWIGWVGRAYLIAVRNSGEANEKEIIIDVPLAIKCMLTGFAWPLAALKELASGELTAKDNEIT
VSPR
>P58560 ~~~psaI~~~Photosystem I reaction center subunit VIII~~~
MATAFLPSILADASFLSSIFVPVIGWVVPIATFSFLFLYIEREDVA
>Q55330 ~~~psaI~~~Photosystem I reaction center subunit VIII~~~
MDGSYAASYLPWILIPMVGWLFPAVTMGLLFIHIESEGEG
>P0A427 ~~~psaI~~~Photosystem I reaction center subunit VIII~~~
MMGSYAASFLPWIFIPVVCWLMPTVVMGLLFLYIEGEA
>P23079 ~~~psaI~~~Photosystem I reaction center subunit VIII~~~
MATAFLPSILADASFLSSIFVPVIGWVVPIATFSFLFLYIEGEDVA
>B0C7S6 ~~~psaJ~~~Photosystem I reaction center subunit IX~~~
MGDVPLKIDSEKDFMKFFSTAPVIALVFFTLTAGFLVELNRFFPDILFFPY
>P58568 ~~~psaJ~~~Photosystem I reaction center subunit IX~~~
MADKADQSSYLIKFISTAPVAATIWLTITAGILIEFNRFFPDLLFHPLP
>Q31NU0 ~~~psaJ~~~Photosystem I reaction center subunit IX~~~
MDGLKRYLSSAPILATIWFAITAGILIEFNRFFPDLLFHPL
>Q55329 ~~~psaJ~~~Photosystem I reaction center subunit IX~~~
MDGLKSFLSTAPVMIMALLTFTAGILIEFNRFYPDLLFHP
>P0A429 ~~~psaJ~~~Photosystem I reaction center subunit IX~~~
MKHFLTYLSTAPVLAAIWMTITAGILIEFNRFYPDLLFHPL
>P23075 ~~~psaJ~~~Photosystem I reaction center subunit IX~~~
MKHFLTYLSTAPVLALXXXXXXAGLLIEINRFFPDALTFPFFSF
>P23080 ~~~psaJ~~~Photosystem I reaction center subunit IX~~~
MADKADQSSYLIKFISTAPVAATIWLIITAGILIEFNRFFPDLLFHPLP
>P58583 ~~~psaK1~~~Photosystem I reaction center subunit PsaK 1~~~
MLTSTLLAAATTPLEWSPTVGIIMVIANVIAITFGRQTIKYPSAEPALPSAKFFGGFGAPALLATTAFGHILGVGLVLGL
HNLGRI
>P72712 ~~~psaK1~~~Photosystem I reaction center subunit PsaK 1~~~
MHSFLLATAVPATLSWSPKVAGVMIACNILAIAFGKLTIKQQNVGTPMPSSNFFGGFGLGAVLGTASFGHILGAGVILGL
ANMGVL
>P74564 ~~~psaK2~~~Photosystem I reaction center subunit PsaK 2~~~
MFNTALLLAQASPTTAGWSLSVGIIMCLCNVFAFVIGYFAIQKTGKGKDLALPQLASKKTFGLPELLATMSFGHILGAGM
VLGLASSGIL
>P0A426 ~~~psaK~~~Photosystem I reaction center subunit PsaK~~~
MVLATLPDTTWTPSVGLVVILCNLFAIALGRYAIQSRGKGPGLPIALPALFEGFGLPELLATTSFGHLLAAGVVSGLQYA
GAL
>P0A425 ~~~psaK~~~Photosystem I reaction center subunit PsaK~~~
MVLATLPDTTWTPSVGLVVILCNLFAIALGRYAIQSRGKGPGLPIALPALFEGFGLPELLATTSFGHLLAAGVVSGLQYA
GAL
>P23318 ~~~psaK~~~Photosystem I reaction center subunit PsaK~~~
MVLATTLPDTTWTPSVGLVVILSNLFAIALGRYAIQSRGKGPGLPIALPALFEGFGLPELLATTSFGHLLAAGVVSVGLQ
YAGAL
>P23317 ~~~psaK~~~Photosystem I reaction center subunit PsaK~~~
MLTSTLLAAATTPLEWSPTIGIIMVIANVIAITFGRQTIKYPSAEPALPSAKFFGGFGAPALLATTAFGHILGVGIILGL
HNLGRF
>Q7NIE7 ~~~psaL~~~Photosystem I reaction center subunit XI~~~
MTLARYVYTPDPQEGTLLTPVNNSTAIRWFIDNLPINRVGMDEFTRGLEIGMAHGYWLIGPFALLGPLRNTELGLVAGLV
STIGLLLISTIGLSGYASLVEDVPTEFDRKGWSRLAGGFLVGGVGGAIFAFAILQFFPLVSAIARIP
>P58577 ~~~psaL~~~Photosystem I reaction center subunit XI~~~
MAQAVDASKNLPSDPRNREVVFPAGRDPQWGNLETPVNASPLVKWFINNLPAYRPGLTPFRRGLEVGMAHGYFLFGPFAK
LGPLRDAANANLAGLLGAIGLVVLFTLALSLYANSNPPTALASVTVPNPPDAFQSKEGWNNFASAFLIGGIGGAVVAYFL
TSNLALIQGLVG
>P95822 ~~~psaL~~~Photosystem I reaction center subunit XI~~~
MAQDVIANGGTPEIGNLATPINSSPFTRTFINALPIYRRGLSSNRRGLEIGMAHGFLLYGPFSILGPLRNTETAGSAGLL
ATVGLVVILTVCLSLYGNAGSGPSAAESTVTTPNPPQELFTKEGWSEFTSGFILGGLGGAFFAFYLASTPYVQPLVKIAA
GVWSVH
>P31084 ~~~psaL~~~Photosystem I reaction center subunit XI~~~
AQDVIANGGTPEIGDLATPTNSSPFTRTFINALPIYRRGLSSNRRGLEIGMAHGFLLYGPLSILGPLRNTETAGSAGLLA
TVGLVVILTVCLSLYGNAGSGPSAAESTVTTPNPPQELFTKEGWSEFTSGFILGGLGGAFFAFYLASTPYVQPLVKIAAG
VWSVH
>P37277 ~~~psaL~~~Photosystem I reaction center subunit XI~~~
MAESNQVVQAYNGDPFVGHLSTPISDSAFTRTFIGNLPAYRKGLSPILRGLEVGMAHGYFLIGPWTLLGPLRDSEYQYIG
GLIGALALILVATAALSSYGLVTFQGEQGSGDTLQTADGWSQFAAGFFVGGMGGAFVAYFLLENLSVVDGIFRGLFN
>Q8DGB4 ~~~psaL~~~Photosystem I reaction center subunit XI~~~
MAEELVKPYNGDPFVGHLSTPISDSGLVKTFIGNLPAYRQGLSPILRGLEVGMAHGYFLIGPWVKLGPLRDSDVANLGGL
ISGIALILVATACLAAYGLVSFQKGGSSSDPLKTSEGWSQFTAGFFVGAMGSAFVAFFLLENFSVVDGIMTGLFN
>P31092 ~~~psaL~~~Photosystem I reaction center subunit XI~~~
MAQAVDASKNLPSDPRNREVVFPAGRDPQWGNLETPVNASPLVKWFINNLPAYRPGLTPFRRGLEVGMAHGYFLFGPFAK
LGPLRDAANANLAGLLGAIGLVVLFTLSLSLYANSNPPKALASVTVPNPPDAFQSKEGWNNFASAFLIGGIGGAVVAYFL
TSNFALIQGLVG
>Q7NHY3 ~~~psaM~~~Photosystem I reaction center subunit XII~~~
MAATVVSGAQVAIAFVVALIAGIAALLLSTALGK
>Q8YNB0 ~~~psaM~~~Photosystem I reaction center subunit XII~~~
MSSISDTQVYIALVVALIPGLLAWRLATELYK
>P72986 ~~~psaM~~~Photosystem I reaction center subunit XII~~~
MALSDTQILAALVVALLPAFLAFRLSTELYK
>P0A403 ~~~psaM~~~Photosystem I reaction center subunit XII~~~
MALTDTQVYVALVIALLPAVLAFRLSTELYK
>P58566 ~~~psaX~~~Photosystem I 4.8 kDa protein~~~
MAKAKISPVANTGAKPPYTFRTGWALLLLAVNFLVAAYYFHIIQ
>P23319 ~~~psaX~~~Photosystem I 4.8 kDa protein~~~
MAKAKTPAVANTGAKPPYTFRTAWALLLLGVNFLVAAYYFHIIQ
>P0C029 ~~~psaZ~~~Photosystem I reaction center subunit Z~~~
MQSYNVFPALVIITTLVVPFMAAAALLFIIERDPS
>A0QZ46 ~~~prcA~~~Proteasome subunit alpha~~~COG0638
MSFPYFISPEQAMRERSELARKGIARGRSVVALAYSEGVLFVAENPSRSLQKVSELYDRVGFAAVGRFNEFDNLRRGGIQ
FADTRGYAYDRRDVTGRQLANVYAQTLGTIFTEQAKPYEVELCVAEVAHYGETKAPELYRITYDGSIADEPHFVVMGGTT
EPIIAALNESYTENASLQDAVEIAVKALSASAEGAEPRSLGPSTLEVAILDAGRPRRAFRRITGAALEALLPEQPQQADS
GDKPTE
>P9WHU1 ~~~prcA~~~Proteasome subunit alpha~~~COG0638
MSFPYFISPEQAMRERSELARKGIARAKSVVALAYAGGVLFVAENPSRSLQKISELYDRVGFAAAGKFNEFDNLRRGGIQ
FADTRGYAYDRRDVTGRQLANVYAQTLGTIFTEQAKPYEVELCVAEVAHYGETKRPELYRITYDGSIADEPHFVVMGGTT
EPIANALKESYAENASLTDALRIAVAALRAGSADTSGGDQPTLGVASLEVAVLDANRPRRAFRRITGSALQALLVDQESP
QSDGESSG
>Q7AKQ6 ~~~prcA~~~Proteasome subunit alpha~~~COG0638
MSTPFYVSPQQAMADRAEYARKGIARGRSLVVLQYADGIVFVGENPSRALHKFSEIYDRIGFAAAGKYNEYENLRIGGVR
YADLRGYTYDRDDVTARGLANVYAQTLGTIFSSQAEKPYEVELVVAEVGDSPENDQIYRLPHDGSIVDEHGSVAVGGNAE
QISGYLDQRHRDGMTLAEALKLAVQALSRDTNGTEREIPAERLEVAVLDRTRPQQRKFKRIVGGQLSRLLESGAASADGE
AETEAETDSGSDEE
>Q53079 3.4.25.1~~~prcB1~~~Proteasome subunit beta 1~~~
MTADRPALRTGDRDTRLSFGSNLSSFTDYLRGHAPELLPENRIGHRSHSTRGGDGMESGDLAPHGTTIVALTYKGGVLLA
GDRRATQGNLIASRDVEKVYVTDEYSAAGIAGTAGIAIELVRLFAVELEHYEKIEGVPLTFDGKANRLASMVRGNLGAAM
QGLAVVPLLVGYDLDADDESRAGRIVSYDVVGGRYEERAGYHAVGSGSLFAKSALKKIYSPDSDEETALRAAIESLYDAA
DDDSATGGPDLTRGIYPTAVTITQAGAVHVSEETTSELARRIVAERTEQGGSAR
>P74367 ~~~~~~Photosystem II lipoprotein Psb27~~~
MSFLKNQLSRLLALILVVAIGLTACDSGTGLTGNYSQDTLTVIATLREAIDLPQDAPNRQEVQDTARGQINDYISRYRRK
GDAGGLKSFTTMQTALNSLAGYYTSYGARPIPEKLKKRLQLEFTQAERSIERGV
>Q8DG60 ~~~~~~Photosystem II lipoprotein Psb27~~~
MKRFWAMVCALFLSVSLLLTSCANVPTGLTGNFREDTLALISSLREAIALPENDPNKKAAQAEARKKLNDFFALYRRDDS
LRSLSSFMTMQTALNSLAGHYSSYPNRPLPEKLKARLEQEFKQVELALDREAKS
>Q55356 ~~~~~~Photosystem II reaction center Psb28 protein~~~
MAEIQFSKGVAETVVPEVRLSKSKNGQSGMAKFYFLEPTILAKESTDDITGMYLIDDEGEIITREVKGKFINGRPTAIEA
TVILNSQPEWDRFMRFMERYGAENGLGFSKSE
>Q8DLJ8 ~~~~~~Photosystem II reaction center Psb28 protein~~~
MGAMAEIQFIRGINEEVVPDVRLTRARDGSSGQAMFYFDNPKIVQEGNLEVTGMYMVDEEGEIVTRDVNAKFINGQPVAI
EATYTMRSPQEWDRFIRFMDRYAASHGLGFQKSENS
>Q53083 3.4.25.1~~~prcB2~~~Proteasome subunit beta 2~~~
MTVDRAPRITDGDTRLSFGSNLSSFSEYLRVHAPEHLPQNRFADTGGVVMGGGDVAPHGTTIVAISYAGGVLLAGDRRAT
MGNLIASRDVQKVYVTDDYSAAGIAGTAGIAIELVRLFAVELEHYEKIEGVPLTFDGKANRLSSMVRGNLGAAMQGLAVV
PLLVGYDLDAVDPSRAGRIVSYDVVGGRYEERAGYHAVGSGSLFAKSALKKLYSPGIDEDTALRFAVEALYDAADDDSAT
GGPDLTRGIYPTAVTITSAGAVELSTAKAAEIAREIVAARTATASPEGESAL
>P04996 1.10.3.9~~~psbA1~~~Photosystem II protein D1 1~~~
MTSILREQRRDNVWDRFCEWVTSTDNRIYVGWFGVLMIPTLLTATICFIVAFIAAPPVDIDGIREPVAGSLMYGNNIISG
AVVPSSNAIGLHFYPIWEAASLDEWLYNGGPYQLVVFHFLLGISCYMGRQWELSYRLGMRPWICVAYSAPLSAAFAVFLI
YPIGQGSFSDGMPLGISGTFNFMFVFQAEHNILMHPFHMLGVAGVFGGSLFSAMHGSLVTSSLVRETTETESQNYGYKFG
QEEETYNIVAAHGYFGRLIFQYASFNNSRSLHFFLGAWPVVGIWFTSMGISTMAFNLNGFNFNQSVLDSQGKVINTWADV
LNRANLGMEVMHERNAHNFPLDLAAGEATPVALTAPSIHG
>P07826 1.10.3.9~~~psbA1~~~Photosystem II protein D1 1~~~
MTTTQLGLQEQSLWSRFCCWITSTSNRLYIGWFGVLMIPTLLTATTCFIIAFIAAPPVDIDGIREPIAGSLLYGNNIITA
AVVPSSNAIGLHFYPIWEAHSLDEWLYNGGPYQLIVFHFLIGIFCYLGRQWELSYRLGMRPWICVAYSAPVAAATATLLI
YSIGQGSFSDGLPLGISGTFNFMLVLQAEHNVLMHPFHMLGVAGVFGGALFAAMHGSLVTSSLIRETTEVESQNQGYKFG
QEEETYNIVAAHGYFGRLIFQYASFNNSRALHFFLGAWPVVGIWFAALAVCCFAFNLNGFNFNQSILDAQGRPVSTWADV
INRANIGFEVMHERNVHNFPLDLASGDAQMVALNAPAIEG
>P0A444 1.10.3.9~~~psbA1~~~Photosystem II protein D1 1~~~
MTTTLQRRESANLWERFCNWVTSTDNRLYVGWFGVIMIPTLLAATICFVIAFIAAPPVDIDGIREPVSGSLLYGNNIITG
AVVPSSNAIGLHFYPIWEAASLDEWLYNGGPYQLIIFHFLLGASCYMGRQWELSYRLGMRPWICVAYSAPLASAFAVFLI
YPIGQGSFSDGMPLGISGTFNFMIVFQAEHNILMHPFHQLGVAGVFGGALFCAMHGSLVTSSLIRETTETESANYGYKFG
QEEETYNIVAAHGYFGRLIFQYASFNNSRSLHFFLAAWPVVGVWFTALGISTMAFNLNGFNFNHSVIDAKGNVINTWADI
INRANLGMEVMHERNAHNFPLDLASAESAPVAMIAPSING
>P16033 1.10.3.9~~~psbA2~~~Photosystem II protein D1 2~~~
MTTTLQQRESASLWEQFCQWVTSTNNRIYVGWFGTLMIPTLLTATTCFIIAFIAAPPVDIDGIREPVAGSLLYGNNIISG
AVVPSSNAIGLHFYPIWEAASLDEWLYNGGPYQLVVFHFLIGIFCYMGRQWELSYRLGMRPWICVAYSAPVSAATAVFLI
YPIGQGSFSDGMPLGISGTFNFMIVFQAEHNILMHPFHMLGVAGVFGGSLFSAMHGSLVTSSLVRETTEVESQNYGYKFG
QEEETYNIVAAHGYFGRLIFQYASFNNSRSLHFFLGAWPVIGIWFTAMGVSTMAFNLNGFNFNQSILDSQGRVIGTWADV
LNRANIGFEVMHERNAHNFPLDLASGEQAPVALTAPAVNG
>P0A446 1.10.3.9~~~psbA2~~~Photosystem II protein D1 2~~~
MTTVLQRRQTANLWERFCDWITSTENRLYIGWFGVIMIPTLLAATICFVIAFIAAPPVDIDGIREPVSGSLLYGNNIITA
AVVPSSNAIGLHLYPIWDAASLDEWLYNGGPYQLIIFHFLIGIFCYMGREWELSYRLGMRPWIPVAFSAPVAAATAVLLI
YPIGQGSFSDGLMLGISGTFNFMIVFQAEHNILMHPFHMLGVAGVFGGALFAAMHGSLVTSSLIRETTETESTNYGYKFG
QEEETYNIVAAHGYFGRLIFQYASFNNSRSLHFFLAAWPVVGIWFAALGISTMAFNLNGFNFNHSVVDAQGNVINTWADI
INRANIGIEVMHERNAHNFPLDLASGELAPVAMIAPSIEA
>Q8DIV4 1.10.3.9~~~psbA3~~~Photosystem II protein D1 3~~~
MTTVLQRREQLNLWEQFCSWVTSTNNRLYVGWFGVLMIPTLLAATICFVIAFIAAPPVDIDGIREPVSGSLLYGNNIITG
AVVPSSNAIGLHFYPIWEAASLDEWLYNGGPYQLIIFHFLIGVFCYMGREWELSYRLGMRPWICVAYSAPVAAATAVFLI
YPIGQGSFSDGMPLGISGTFNFMLVFQAEHNILMHPFHQLGVAGVFGGALFSAMHGSLVTSSLIRETTETESANYGYKFG
QEEETYNIVAAHGYFGRLIFQYASFNNSRALHFFLAAWPVIGIWFTALGISTMAFNLNGFNFNHSVVDAQGNVINTWADI
INRANLGMEVMHERNAHNFPLDLASAESAPVAMIAPSING
>P46895 1.10.3.9~~~psbA~~~Photosystem II protein D1~~~
MTTIRQQRSSLLKGWPQFCEWVTSTDNRIYVGWFGVLMIPCLLAAAICFVVAFIAAPPVDIDGIREPVAGSFLYGNNIIS
GAVVPSSNAIGLHFYPIWEAATLDEWLYNGGPYQLVIFHFLIGICGWMGRQWELSYRLGMRPWICVAYSAPVSAAFAVFL
VYPFGQGSFSDGMPLGISGTFNFMFVFQAEHNILMHPFHMAGVAGMFGGSLFSAMHGSLVTSSLIRETTETESQNYGYKF
GQEEETYNIVAAHGYFGRLIFQYASFNNSRSLHFFLAIFPVVCVWLTSMGICTMAFNLNGFNFNQSVVDANGKVVPTWGD
VLNRANLGMEVMHERNAHNFPLDLAAAESTSVALVAPSIG
>P14660 1.10.3.9~~~psbA~~~Photosystem II protein D1~~~
MTTTLQQRESASLWEQFCQWVTSTNNRIYVGWFGTLMIPTLLTATTCFIIAFIAAPPVDIDGIREPVAGSLLYGNNIISG
AVVPSSNAIGLHFYPIWEAASLDEWLYNGGPYQLVVFQFLIGIFCYMGRQWELSYRLGMRPWICVAYSAPVSARTAVFLI
YPIGQGSFSDGMPLGISGTFNFMIVFQAEHNILMHPFHMLGVAGVFGGSLFSAMHGSLVTSSLVRETTEVESQNYGYKFG
QEEETYNIVAAHGYFGRLIFQYASFNNSRSLHFFLGAWPVIGIWFTAMGVSTMAFNLNGFNFNQSILDSQGRVIGTWADV
LNRANIGFEVMHERNAHNFPLDLASGEQAPVALTAPAING
>P51765 1.10.3.9~~~psbA~~~Photosystem II protein D1~~~
MTTTLQRRESANLWERFCNWVTSTDNRLYVGWFGVIMIPTLLAATICFVIAFIAAPPVDIDGIREPVSGSLLYGNNIITG
AVVPSSNAIGLHFYPIWEAASLDEWLYNGGPYQLIIFHFLLGASCYMGRQWELSYRLGMRPWICVAYSAPLASAFAVFLI
YPIGQGSFSDGMPLGISGTFNFMIVFQAEHNILMHPFHQLGVAGVFGGALFCAMHGSLVTSSLIRETTETESANYGYKFG
QEEETYNIVAAHGYFGRLIFQYASFNNSRSLHFFLAAWRVVGVWFAALGISTMAFNLNGFNFNHSVIDAKGNVINTWADI
INRANLGMEVMHERNAHNFPLDLASAESAPVAMIAPSING
>P05429 ~~~psbB~~~Photosystem II CP47 reaction center protein~~~
MGLPWYRVHTVVLNDPGRLISVHLMHTALVAGWAGSMALYELAIFDSSDAVLNPMWRQGMFVLPFMARLGVTSSWNGWSV
TGETGLDPGFWSFEGVAAAHIVLSGLLFLAAVWHWVFWDLELFVDPRTGESALDLPKMFGIHLFLSGLLCFGFGAFHLTG
VWGPGMWVSDPYGLTGHVQPVAPEWGPAGFNPFNPGGVVAHHIAAGIVGIIAGLFHLTVRPPERLYKALRMGNIETVLSS
SIAAVFFAAFVVAGTMWYGNATTPIELFGPTRYQWDKGYFQEEIQRRVDSQLAEGASLSEAWSTIPEKLAFYDYVGNSPA
KGGLFRTGAMNSGDGIAQEWIGHPIFKDKEGRELEVRRMPNFFETFPVIMTDADGVVRADIPFRRSESKFSVEQTGVTVS
FYGGALDGQTFSNPSDVKKFARKAQLGEGFDFDTETFNSDGVFRTSPRGWFTFGHAVFALLFFFGHIWHGSRTLFRDVFA
GVDPGLEEQVEFGVFAKVGDLSTRKEA
>Q8DIQ1 ~~~psbB~~~Photosystem II CP47 reaction center protein~~~
MGLPWYRVHTVLINDPGRLIAAHLMHTALVAGWAGSMALYELATFDPSDPVLNPMWRQGMFVLPFMARLGVTGSWSGWSI
TGETGIDPGFWSFEGVALAHIVLSGLLFLAACWHWVYWDLELFRDPRTGEPALDLPKMFGIHLFLAGLLCFGFGAFHLTG
LFGPGMWVSDPYGLTGSVQPVAPEWGPDGFNPYNPGGVVAHHIAAGIVGIIAGLFHILVRPPQRLYKALRMGNIETVLSS
SIAAVFFAAFVVAGTMWYGSATTPIELFGPTRYQWDSSYFQQEINRRVQASLASGATLEEAWSAIPEKLAFYDYIGNNPA
KGGLFRTGPMNKGDGIAQAWKGHAVFRNKEGEELFVRRMPAFFESFPVILTDKNGVVKADIPFRRAESKYSFEQQGVTVS
FYGGELNGQTFTDPPTVKSYARKAIFGEIFEFDTETLNSDGIFRTSPRGWFTFAHAVFALLFFFGHIWHGARTLFRDVFS
GIDPELSPEQVEWGFYQKVGDVTTRRKEAV
>P09193 ~~~psbC~~~Photosystem II CP43 reaction center protein~~~
MVTLSNTSMVGGRDLPSTGFAWWSGNARLINLSGKLLGAHVAHAGLIVFWAGAMTLFEVAHFIPEKPMYEQGLILLPHIA
TLGWGVGPAGEVTDIFPFFVVGVLHLISSAVLGLGGIYHALRGPEVLEEYSSFFGYDWKDKNQMTNIIGYHLILLGCGAL
LLVFKAMFFGGVYDTWAPGGGDVRVITNPTLNPAIIFGYLLKAPFGGEGWIISVNNMEDIIGGHIWIGLICISGGIWHIL
TKPFGWARRALIWSGEAYLSYSLGALSLMGFIASVFVWFNNTAYPSEFYGPTGMEASQSQAFTFLVRDQRLGANIASAQG
PTGLGKYLMRSPSGEIIFGGETMRFWDFRGPWLEPLRGPNGLDLDKLRNDIQPWQVRRAAEYMTHAPLGSLNSVGGVITD
VNSFNYVSPRAWLATSHFVLGFFFLVGHLWHAGRARAAAAGFEKGIDRETEPTLFMPDLD
>Q8DIF8 ~~~psbC~~~Photosystem II CP43 reaction center protein~~~
MVTLSSNSIFATNRDQESSGFAWWAGNARLINLSGKLLGAHVAHAGLIVFWAGAMTLFELAHFIPEKPMYEQGLILIPHI
ATLGWGVGPGGEVVDTFPFFVVGVVHLISSAVLGFGGVYHAIRGPETLEEYSSFFGYDWKDKNKMTTILGFHLIVLGIGA
LLLVAKAMFFGGLYDTWAPGGGDVRVITNPTLDPRVIFGYLLKSPFGGEGWIVSVNNLEDVVGGHIWIGLICIAGGIWHI
LTTPFGWARRAFIWSGEAYLSYSLGALSMMGFIATCFVWFNNTVYPSEFYGPTGPEASQAQAMTFLIRDQKLGANVGSAQ
GPTGLGKYLMRSPTGEIIFGGETMRFWDFRGPWLEPLRGPNGLDLNKIKNDIQPWQERRAAEYMTHAPLGSLNSVGGVAT
EINSVNFVSPRSWLATSHFVLAFFFLVGHLWHAGRARAAAAGFEKGIDRESEPVLSMPSLD
>P09192 1.10.3.9~~~psbD~~~Photosystem II D2 protein~~~
MTIAVGRAPVERGWFDVLDDWLKRDRFVFIGWSGLLLFPCAFMALGGWLTGTTFVTSWYTHGLASSYLEGANFLTVAVSS
PADAFGHSLLFLWGPEAQGNLTRWFQIGGLWPFVALHGAFGLIGFMLRQFEISRLVGIRPYNAIAFSGPIAVFVSVFLMY
PLGQSSWFFAPSFGVAGIFRFILFLQGFHNWTLNPFHMMGVAGILGGALLCAIHGATVENTLFEDGEDSNTFRAFEPTQA
EETYSMVTANRFWSQIFGIAFSNKRWLHFFMLFVPVTGLWMSSVGIVGLALNLRAYDFVSQELRAAEDPEFETFYTKNIL
LNEGMRAWMAPQDQPHENFIFPEEVLPRGNAL
>Q8CM25 1.10.3.9~~~psbD1~~~Photosystem II D2 protein~~~
MTIAIGRAPAERGWFDILDDWLKRDRFVFVGWSGILLFPCAYLALGGWLTGTTFVTSWYTHGLASSYLEGCNFLTVAVST
PANSMGHSLLLLWGPEAQGDFTRWCQLGGLWTFIALHGAFGLIGFMLRQFEIARLVGVRPYNAIAFSAPIAVFVSVFLIY
PLGQSSWFFAPSFGVAAIFRFLLFFQGFHNWTLNPFHMMGVAGVLGGALLCAIHGATVENTLFQDGEGASTFRAFNPTQA
EETYSMVTANRFWSQIFGIAFSNKRWLHFFMLFVPVTGLWMSAIGVVGLALNLRSYDFISQEIRAAEDPEFETFYTKNLL
LNEGIRAWMAPQDQPHENFVFPEEVLPRGNAL
>P09190 ~~~psbE~~~Cytochrome b559 subunit alpha~~~
MSGTTGERPFSDIVTSIRYWVIHSITIPMLFIAGWLFVSTGLAYDAFGTPRPDEYFTQTRQELPILQERYDINQEIQEFN
Q
>Q8DIP0 ~~~psbE~~~Cytochrome b559 subunit alpha~~~
MAGTTGERPFSDIITSVRYWVIHSITIPALFIAGWLFVSTGLAYDVFGTPRPDSYYAQEQRSIPLVTDRFEAKQQVETFL
EQLK
>P12238 ~~~psbE~~~Cytochrome b559 subunit alpha~~~
MAGTTGERPFSDIITSVRYWVIHSITIPALFIAGWLFVSTGLAYDVFGTPRPDSYYAQEQRSIPLVTDRFEAKQQVETFL
EQLK
>P09191 ~~~psbF~~~Cytochrome b559 subunit beta~~~
MATQNPNQPVTYPIFTVRWLAVHTLAVPSVFFVGAIAAMQFIQR
>Q8DIN9 ~~~psbF~~~Cytochrome b559 subunit beta~~~
MTSNTPNQEPVSYPIFTVRWVAVHTLAVPTIFFLGAIAAMQFIQR
>P12239 ~~~psbF~~~Cytochrome b559 subunit beta~~~
MTSNTPNQEPVSYPIFTVRWVAVHTLAVPTIFFLGAIAAMQFIQR
>P14835 ~~~psbH~~~Photosystem II reaction center protein H~~~
MAQRTRLGDILRPLNSEYGKVVPGWGTTPVMGVFMALFLVFLLIILQIYNSSLILEGFSVDWAG
>Q8DJ43 ~~~psbH~~~Photosystem II reaction center protein H~~~
MARRTWLGDILRPLNSEYGKVAPGWGTTPLMAVFMGLFLVFLLIILEIYNSTLILDGVNVSWKALG
>Q54697 ~~~psbI~~~Photosystem II reaction center protein I~~~
MLTLKIAVYIVVGLFISLFIFGFLSSDPTRNPGRKDFE
>Q8DJZ6 ~~~psbI~~~Photosystem II reaction center protein I~~~
METLKITVYIVVTFFVLLFVFGFLSGDPARNPKRKDLE
>P12240 ~~~psbI~~~Photosystem II reaction center protein I~~~
METLKITVYIVVTFFVLLFVFGFLSGDPARNPKRKDLE
>P73070 ~~~psbJ~~~Photosystem II reaction center protein J~~~
MFAEGRIPLWVVGVVAGIGAIGVLGLFFYGAYAGLGSSM
>P59087 ~~~psbJ~~~Photosystem II reaction center protein J~~~
MMSEGGRIPLWIVATVAGMGVIVIVGLFFYGAYAGLGSSL
>Q7DGD4 ~~~psbJ~~~Photosystem II reaction center protein J~~~
MMSEGGRIPLWIVATVAGMGVIVIVGLFFYGAYAGLGSSL
>P15819 ~~~psbK~~~Photosystem II reaction center protein K~~~
METIYLLAKLPEAYQIFDPLVDVLPVIPLFFLALAFVWQAAVGFK
>Q9F1K9 ~~~psbK~~~Photosystem II reaction center protein K~~~
MIDALVLVAKLPEAYAIFDPLVDVLPVIPVLFLALAFVWQAAVGFR
>Q7M157 ~~~psbL~~~Photosystem II reaction center protein L~~~
MKNTNPNSQPVELNRTSLFLGRLLIFVLGILFSSYIFN
>Q55354 ~~~psbL~~~Photosystem II reaction center protein L~~~
MDRNSNPNRQPVELNRTSLYLGLLLVAVLGILFSSYFFN
>Q8DIN8 ~~~psbL~~~Photosystem II reaction center protein L~~~
MEPNPNRQPVELNRTSLYLGLLLILVLALLFSSYFFN
>P12241 ~~~psbL~~~Photosystem II reaction center protein L~~~
MEPNPNRQPVELNRTSLYLGLLLILVLALLFSSYFFN
>P72701 ~~~psbM~~~Photosystem II reaction center protein M~~~
MQVNNLGFIASILFVLVPTVFLLILFIQTGKQSES
>Q8DHA7 ~~~psbM~~~Photosystem II reaction center protein M~~~
MEVNQLGLIATALFVLVPSVFLIILYVQTESQQKSS
>P12312 ~~~psbM~~~Photosystem II reaction center protein M~~~
MEVNQLGFIATALFVLVPSVFLIILYVQTESQQKSS
>P11472 ~~~psbO~~~Photosystem II manganese-stabilizing polypeptide~~~
MRYRAFLAAFLAVCLGVLTACSSGPTAADLGTLTYDQIKDTGLANKCLSLKESARGTIPLEAGKKYALTDLCLEPQEFFV
KEEPGNKRQKAEFVPGKVLTRYTSSLDQVYGDLALKADGTVSFTEKGGIDFQAITVLLPGGEEVPFLFTVKGLVASTSEP
ATSINTSTDLRGGYRVPSYRTSNFLDPKARGLTTGYESAVAIPSAGDAEDLTKENVKRFVTGQGEISLAVSKVDGATGEV
AGVFTAIQPSDTDMGGKEAVDVKLVGQFYGRIEPADA
>P0A432 ~~~psbO~~~Photosystem II manganese-stabilizing polypeptide~~~
MKYRILMATLLAVCLGIFSLSAPAFAAKQTLTYDDIVGTGLANKCPTLDDTARGAYPIDSSQTYRIARLCLQPTTFLVKE
EPKNKRQEAEFVPTKLVTRETTSLDQIQGELKVNSDGSLTFVEEDGIDFQPVTVQMAGGERIPLLFTVKNLVASTQPNVT
SITTSTDFKGEFNVPSYRTANFLDPKGRGLASGYDSAIALPQAKEEELARANVKRFSLTKGQISLNVAKVDGRTGEIAGT
FESEQLSDDDMGAHEPHEVKIQGVFYASIEPA
>P10549 ~~~psbO~~~Photosystem II manganese-stabilizing polypeptide~~~
MRFRPSIVALLSVCFGLLTFLYSGSAFAVDKSQLTYDDIVNTGLANVCPEISSFTRGTIEVEPNTKYFVSDFCMEPQEYF
VKEEPVNKRQKAEYVKGKVLTRQTTSLEQIRGSIAVGADGTLTFKEKDGIDFQPITVLLPGGEEVPFFFTVKNFTGTTEP
GFTSINSSTDFVGDFNVPSYRGAGFLDPKARGLYTGYDNAVALPSAADKFRTNKKETPLGKGTLSLQVTQVDGSTGEIAG
IFESEQPSDTDLGAKEPLDVKVRGIFYGRVDTDV
>P0A431 ~~~psbO~~~Photosystem II manganese-stabilizing polypeptide~~~
MKYRILMATLLAVCLGIFSLSAPAFAAKQTLTYDDIVGTGLANKCPTLDDTARGAYPIDSSQTYRIARLCLQPTTFLVKE
EPKNKRQEAEFVPTKLVTRETTSLDQIQGELKVNSDGSLTFVEEDGIDFQPVTVQMAGGERIPLLFTVKNLVASTQPNVT
SITTSTDFKGEFNVPSYRTANFLDPKGRGLASGYDSAIALPQAKEEELARANVKRFSLTKGQISLNVAKVDGRTGEIAGT
FESEQLSDDDMGAHEPHEVKIQGVFYASIEPA
>D0VWR2 ~~~psbO~~~Photosystem II manganese-stabilizing polypeptide~~~
QTLTYDDIVGTGLANKCPTLDDTARGAYPIDSSQTYRIARLCLQPTTFLVKEEPKNKRQEAEFVPTKLVTRETTSLDQIQ
GELKVNSDGSLTFVEEDGIDFQPVTVQMAGGERIPLLFTVKNLVASTQPNVTSITTSTDFKGEFNVPSYRTANFLDPKGR
GLASGYDSAIALPQAKEEELARANVKRFSLTKGQISLNVAKVDGRTGEIAGTFESEQLSDDDMGAHEPHEVKIQGVFYAS
IEPA
>P74787 ~~~psbT~~~Photosystem II reaction center protein T~~~
MESVAYILVLTMALAVLFFAIAFREPPRIEK
>Q8DIQ0 ~~~psbT~~~Photosystem II reaction center protein T~~~
METITYVFIFACIIALFFFAIFFREPPRITKK
>P12313 ~~~psbT~~~Photosystem II reaction center protein T~~~
METITYVFIFACIIALFFFAIFFREPPRITKK
>P20094 ~~~psbU~~~Photosystem II 12 kDa extrinsic protein~~~
MKRLVGVLMILGLMLTSWGLLGSPQTAIAASLSPLSFNPSPVLAEQQFRNAMDDKLATDFGKKIDLNNTNVRAFMQYPGM
YPTLARMILKNAPFESVEDVLKMPGLTDTQKEILKNNFSNFVVSPPLDALVEGGDRFNNGIYR
>P74765 ~~~psbU~~~Photosystem II 12 kDa extrinsic protein~~~COG1555
MSRVVSALMGLVLMFGCAFFSVQPQAQALDLSNGFVSAAVLGERVNPADKVLESEYGKKIDLNNASVRLFRELRGFYPTL
AKRIIENAPYDSVEDVLNIPDLSEKQLARLEENLERFTVTPPADVFIDGDQRLNTGDY
>Q55332 ~~~psbU~~~Photosystem II 12 kDa extrinsic protein~~~COG1555
MKFISRLLVACSLLIGLMGFLGADLAQALTPNPILAELNAVDAKLTTDFGQKIDLNNSDIRDFRGLRGFYPNLASEIIKN
APYDTVEEVLDIPGLSETQKSRLEANLGSFTVTEPSIELTSGDDRINPGVY
>Q9F1L5 ~~~psbU~~~Photosystem II 12 kDa extrinsic protein~~~COG1555
MQRLGRWLALAYFVGVSLLGWINWSAPTLAATASTEEELVNVVDEKLGTAYGEKIDLNNTNIAAFIQYRGLYPTLAKLIV
KNAPYESVEDVLNIPGLTERQKQILRENLEHFTVTEVETALVEGGDRYNNGLYK
>P56152 ~~~psbU~~~Photosystem II 12 kDa extrinsic protein~~~
ATASTEEELVNVVDEKLGTAYGEKIDLNNTNIAAFIQYRGLYPTLAKLIVKNAPYESVEDVLNIPGLTERQKQILRENLE
HFTVTEVETALVEGGDRYNNGLYK
>Q8DJE2 ~~~psbV2~~~Cytochrome c-550-like protein~~~COG2010
MYQPHFWQRSIGWLCGGLLILLLGWTIAPATALAAAGVDNYVIQYLKVTDTVELPVNDRGETKTFTAVDLTRGKRLFEEN
CKNCHVGGSTLPNPLVSLSLKDLKGATPPRDTIASLVAFQRSPKSYDGSEESYSCRRVSEDWLTTEQLETLAAFILRAAA
VAPGWGVESFPDSAP
>P72575 ~~~psbX~~~Photosystem II reaction center X protein~~~
MTPSLANFLWSLVLGAAIVLIPATVGLIFISQKDKITRS
>Q9F1R6 ~~~psbX~~~Photosystem II reaction center X protein~~~
MTITPSLKGFFIGLLSGAVVLGLTFAVLIAISQIDKVQRSL
>P73676 ~~~psbY~~~Photosystem II protein Y~~~
MDWRVIVVVSPLLIAATWAAINIGAAAIRQLQDVLGREA
>Q8DKM3 ~~~psbY~~~Photosystem II protein Y~~~
MDWRVLVVLLPVLLAAGWAVRNILPYAVKQVQKLLQKAKAA
>P73528 ~~~psbZ~~~Photosystem II reaction center protein Z~~~
MSIVFQIALAALVLFSFVMVVGVPVAYASPQNWDRSKPLLYLGSGIWAILVIVVALLNFLVV
>Q8DHJ2 ~~~psbZ~~~Photosystem II reaction center protein Z~~~
MTILFQLALAALVILSFVMVIGVPVAYASPQDWDRSKQLIFLGSGLWIALVLVVGVLNFFVV
>D0VWR5 ~~~psbZ~~~Photosystem II reaction center protein Z~~~
MTILFQLALAALVILSFVMVIGVPVAYASPQDWDRSKQLIFLGSGLWIALVLVVGVLNFFVV
>A0QZ47 3.4.25.1~~~prcB~~~Proteasome subunit beta~~~COG0638
MTWRDNQSFPQPTLNTTGIPSVPVDLSSFSELLSRQAPELLPVNRVAYGTTPVGPTDAVPHGTTIVALKYPGGVLIAGDR
RSTQGNMIAGRDVQKVYITDDYTATGIAGTAAIAVEFARLYAVELEHYEKLEGVPLTFRGKVNRLAIMVRGNLGAALQGF
VALPLLVGYDLDDPHPEGAGRIVSFDAAGGWNIEEEGYQSVGSGSIFAKSSMKKLYSQVSDADSALKVAVEALYDAADDD
SATGGPDLVRGIYPTAVTIGAEGAEEVPETRIAELAREVIESRSRTDTFGPDARRGIDARGDS
>P9WHT9 3.4.25.1~~~prcB~~~Proteasome subunit beta~~~COG0638
MTWPLPDRLSINSLSGTPAVDLSSFTDFLRRQAPELLPASISGGAPLAGGDAQLPHGTTIVALKYPGGVVMAGDRRSTQG
NMISGRDVRKVYITDDYTATGIAGTAAVAVEFARLYAVELEHYEKLEGVPLTFAGKINRLAIMVRGNLAAAMQGLLALPL
LAGYDIHASDPQSAGRIVSFDAAGGWNIEEEGYQAVGSGSLFAKSSMKKLYSQVTDGDSGLRVAVEALYDAADDDSATGG
PDLVRGIFPTAVIIDADGAVDVPESRIAELARAIIESRSGADTFGSDGGEK
>Q7AKQ5 3.4.25.1~~~prcB~~~Proteasome subunit beta~~~COG0638
MEANTRSTGRLPAAFLTPGSSSFMDFLGEHQPEMLPGNRQLPPVQGVIEAPHGTTIVAVTFPGGVVLAGDRRATMGNMIA
QRDIEKVFPADEYSAVGIAGTAGLAVEMVKLFQLELEHFEKVEGAQLSLEGKANRLSTMIRSNLGMAMQGLAVVPLFAGY
DVDRGRGRIFSYDVTGGRSEERHFATTGSGSVFARGAMKKLFRDDLTEEQATTLVVQALYDAADDDSATGGPDVARRIYP
IITVITEDGFRRLGEDEAAELAGSVLQARLEQPDGPRAALL
>Q8KEP5 ~~~pscD~~~P840 reaction center 17 kDa protein~~~
MQPQLSRPQTASNQVRKAVSGPWSGNAVHKAEKYFITSAKRDRDGKLQIELVPASGRRKLSPTPEMIRRLIDGEIEIYIL
TTQPDIAIDMNKEIIDMENRYVIDFDKRGVKWTMREIPVFYHEGKGLCVELHNKIYTLDQFFK
>Q9I317 ~~~pscE~~~Type 3 secretion system chaperone PscE~~~
MMTALETRLSVADGTHAAALRQRLQAALAECRRELARGACPEHFQFLQQQARALEGGLGILSQLTED
>P95435 ~~~pscG~~~Type 3 secretion system chaperone PscG~~~
MDTSLIRELAELALAGSGQHCHEEALCIAEWLERLGQDEAARLIRISSLANQGRYQEALAFAHGNPWPALEPWFALCEWH
LGLGAALDRRLAGLGGSSDPALADFAAGMRAQVRT
>Q9I315 ~~~pscI~~~Type III inner-rod protein PscI~~~
MDISRMGAQAQITSLEELSGGPAGAAHVAEFERAMGGAGSLGGDLLSELGQIRERFSQAKQELQMELSTPGDDPNSLMQM
QWSLMRITMQEELIAKTVGRMSQNVETLMKTQ
>P39822 4.1.1.65~~~psd~~~Phosphatidylserine decarboxylase proenzyme~~~COG0688
MFNTAVKILYRSLIELTNHRLSSYLIKGFCESKISKPVIPLFSKHFRLNWDDVDGTAADYGSLSELFIRQINLERRPVSK
EAHAVVSPVDGVVQTVGIINPNQTFTVKGKDYSFAELTGCKSADHQYNGGYFVVLYLSPRHYHRFHSPISCRYQKLAELG
NRSYPVNQLGLKYGKDVLSKNYRFVYELNSGSRNVLMIPVGAMNINSIVQTNTRTELEIGEELGYFSFGSTVILVFEKDA
FQPSAHLAEGQEVQVGELIGYEE
>P0A8K1 4.1.1.65~~~psd~~~Phosphatidylserine decarboxylase proenzyme~~~COG0688
MLNSFKLSLQYILPKLWLTRLAGWGASKRAGWLTKLVIDLFVKYYKVDMKEAQKPDTASYRTFNEFFVRPLRDEVRPIDT
DPNVLVMPADGVISQLGKIEEDKILQAKGHNYSLEALLAGNYLMADLFRNGTFVTTYLSPRDYHRVHMPCNGILREMIYV
PGDLFSVNHLTAQNVPNLFARNERVICLFDTEFGPMAQILVGATIVGSIETVWAGTITPPREGIIKRWTWPAGENDGSVA
LLKGQEMGRFKLGSTVINLFAPGKVNLVEQLESLSVTKIGQPLAVSTETFVTPDAEPAPLPAEEIEAEHDASPLVDDKKD
QV
>P9WHQ5 4.1.1.65~~~psd~~~Phosphatidylserine decarboxylase proenzyme~~~COG0688
MARRPRPDGPQHLLALVRSAVPPVHPAGRPFIAAGLAIAAVGHRYRWLRGTGLLAAAACAGFFRHPQRVPPTRPAAIVAP
ADGVICAIDSAAPPAELSMGDTPLPRVSIFLSILDAHVQRAPVSGEVIAVQHRPGRFGSADLPEASDDNERTSVRIRMPN
GAEVVAVQIAGLVARRIVCDAHVGDKLAIGDTYGLIRFGSRLDTYLPAGAEPIVNVGQRAVAGETVLAECR
>Q0P8W4 4.2.1.115~~~pseB~~~UDP-N-acetylglucosamine 4,6-dehydratase (inverting)~~~COG1086
MFNKKNILITGGTGSFGKTYTKVLLENYKPNKIIIYSRDELKQFEMASVFNAPCMRYFIGDVRDKERLSAAMRDVDFVIH
AAAMKHVPIAEYNPMECIKTNIHGAQNVIDACFENGVKKCIALSTDKACNPVNLYGATKLASDKLFVAANNIAGNKQTRF
GVTRYGNVVGSRGSVVPFFKKLISEGAKELPITDTRMTRFWISLEDGVKFVLSNFERMHGGEIFIPKIPSMKITDLAHAL
APNLSHKIIGIRAGEKLHEIMISSDDSHLTYEFENYYAISPSIKFVDKDNDFSINALGEKGQKVKDGFSYSSDNNPLWAS
EKELLEIINHTEGF
>O25511 4.2.1.115~~~pseB~~~UDP-N-acetylglucosamine 4,6-dehydratase (inverting)~~~COG1086
MPNHQNMLDNQTILITGGTGSFGKCFVRKVLDTTNAKKIIVYSRDELKQSEMAMEFNDPRMRFFIGDVRDLERLNYALEG
VDICIHAAALKHVPIAEYNPLECIKTNIMGASNVINACLKNAISQVIALSTDKAANPINLYGATKLCSDKLFVSANNFKG
SSQTQFSVVRYGNVVGSRGSVVPFFKKLVQNKASEIPITDIRMTRFWITLDEGVSFVLKSLKRMHGGEIFVPKIPSMKMT
DLAKALAPNTPTKIIGIRPGEKLHEVMIPKDESHLALEFEDFFIIQPTISFQTPKDYTLTKLHEKGQKVAPDFEYSSHNN
NQWLEPDDLLKLL
>Q0P8W3 2.6.1.92~~~pseC~~~UDP-4-amino-4,6-dideoxy-N-acetyl-beta-L-altrosamine transaminase~~~COG0399
MLTYSHQNIDQSDIDTLTKALKDEILTGGKKVNEFEEALCEYMGVKHACVLNSATSALHLAYTALGVQEKIVLTTPLTFA
ATANAALMAGAKVEFIDIKNDGNIDEKKLEARLLKESENIGAISVVDFAGNSVEMDEISNLTKKYNIPLIDDASHALGAL
YKSEKVGKKADLSIFSFHPVKPITTFEGGAVVSDNEELIDKIKLLRSHGIVKKRLWDSDMVELGYNYRLSDVACALGINQ
LKKLDHNLEKREEIANFYDKEFEKNPYFSTIKIKDYKKSSRHLYPILLFPEFYCQKEELFESLLHAGIGVQVHYKPTYEF
SFYKKLLGEIKLQNADNFYKAELSIPCHQEMNLKDAKFVKDTLFSILEKVKKGYCG
>O25130 2.6.1.92~~~pseC~~~UDP-4-amino-4,6-dideoxy-N-acetyl-beta-L-altrosamine transaminase~~~COG0399
MKEFAYSEPCLDKEDKKAVLEVLNSKQLTQGKRSLLFEEALCEFLGVKHALVFNSATSALLTLYRNFSEFSADRNEIITT
PISFVATANMLLESGYTPVFAGIKNDGNIDELALEKLINERTKAIVSVDYAGKSVEVESVQKLCKKHSLSFLSDSSHALG
SEYQNKKVGGFALASVFSFHAIKPITTAEGGAVVTNDSELHEKMKLFRSHGMLKKDFFEGEVKSIGHNFRLNEIQSALGL
SQLKKAPFLMQKREEAALTYDRIFKDNPYFTPLHPLLKDKSSNHLYPILMHQKFFTCKKLILESLHKRGILAQVHYKPIY
QYQLYQQLFNTAPLKSAEDFYHAEISLPCHANLNLESVQNIAHSVLKTFESFKIE
>Q0P8U5 3.6.1.57~~~pseG~~~UDP-2,4-diacetamido-2,4,6-trideoxy-beta-L-altropyranose hydrolase~~~COG3980
MKVLFRSDSSSQIGFGHIKRDLVLAKQYSDVSFACLPLEGSLIDEIPYPVYELSSESIYELINLIKEEKFELLIIDHYGI
SVDDEKLIKLETGVKILSFDDEIKPHHCDILLNVNAYAKASDYEGLVPFKCEVRCGFSYALIREEFYQEAKENREKKYDF
FICMGGTDIKNLSLQIASELPKTKIISIATSSSNPNLKKLQKFAKLHNNIRLFIDHENIAKLMNESNKLIISASSLVNEA
LLLKANFKAICYVKNQESTATWLAKKGYEVEYKY
>O25094 2.3.1.202~~~pseH~~~UDP-4-amino-4,6-dideoxy-N-acetyl-beta-L-altrosamine N-acetyltransferase~~~COG1670
MKKNYSYKNIQAIDFTNLNDGEKLLVLEFRNHPNTALWMYSTFISLKTHLQFIEDLKNSPNHRYFLFKEEGVYLGVGSIT
KINFFHKHGYLGIYKNPFLKNGGETILKALEFIAFEEFQLHSLHLEVMENNFKAIAFYEKNHYELEGRLKGFISKDKEFI
DVLLYYKDKKGYNDQSLLKL
>Q0P8U0 2.5.1.97~~~pseI~~~Pseudaminic acid synthase~~~COG2089
MQIGNFNTDKKVFIIAELSANHAGSLEMALKSIKAAKKAGADAIKIQTYTPDSLTLNSDKEDFIIKGGLWDKRKLYELYE
SAKTPYEWHSQIFETAQNEGILCFSSPFAKEDVEFLKRFDPIAYKIASFEANDENFVRLIAKEKKPTIVSTGIATEEELF
KICEIFKEEKNPDLVFLKCTSTYPTAIEDMNLKGIVSLKEKFNVEVGLSDHSFGFLAPVMAVALGARVIEKHFMLDKSIE
SEDSKFSLDFDEFKAMVDAVRQAESALGDGKLDLDEKVLKNRVFARSLYASKDIKKGEMFSEENVKSVRPSFGLHPKFYQ
ELLGKKASKDIKFGDALKQGDFQ
>P10031 ~~~psiB~~~Protein PsiB~~~
MKTELTLNVLQTMNAQEYEDIRAAGSDERRELTHAVMRELDAPDNWTMNGEYGSEFGGFFPVQVRFTPAHERFHLALCSP
GDVSQVWVLVLVNAGGEPFAVVQVQRRFASEAVSHSLALAASLDTQGYSVNDIIHISMAEGGQV
>P0A7C8 ~~~psiE~~~Protein PsiE~~~COG3223
MTSLSRPRVEFISTILQTVLNLGLLCLGLILVVFLGKETVHLADVLFAPEQTSKYELVEGLVVYFLYFEFIALIVKYFQS
GFHFPLRYFVYIGITAIVRLIIVDHKSPLDVLIYSAAILLLVITLWLCNSKRLKRE
>P0A279 ~~~psiE~~~Protein PsiE~~~
MMPLSRSRLEFIATILQNVLNLGLLTLGLILVVFLGKETVHLADALFAPEQASKYELVEGLVIYFLYFEFIALIVKYFKS
GLHFPLRYFVYIGITAIVRLIIVDHKTPMDVLLYSAAILLLVITLWLCNSNRLRRE
>P0AFM4 ~~~psiF~~~Phosphate starvation-inducible protein PsiF~~~
MKITLLVTLLFGLVFLTTVGAAERTLTPQQQRMTSCNQQATAQALKGDARKTYMSDCLKNSKSAPGEKSLTPQQQKMREC
NNQATQQSLKGDDRNKFMSACLKKAA
>Q31KC7 2.7.1.47~~~PSK~~~D-ribulose kinase~~~COG1070
MVVALGLDFGTSGARAIACDFDSDRSVSVSVTFPKTSQNWPQVWREALWQLLTQIPADWRSRIERIAIDGTSGTVLLCDR
EGQPQTEPLLYNQACPIDLADLADWVPADHAALSSTSSLAKLWFWQQQFGALPPDWQILAQADWLSLQLHGCSQQSDYHN
ALKLGYSPDRERFSKNLLDSELGALLPVVHEPGVAIGPILPAIAQEFGLSPDCQICAGTTDSIAAFLASGAHQPGEAVTS
LGSTIVLKLLSQVAVSDRLTGVYSHKLGGYWLTGGASNCGGATLRQFFPDTELESLSCQIDPTKKSGLDYYPLPSRGERF
PIADPDRLPQLEPRPENPVQFLQGLLEGLTQVETLGYQRLQDLGATPLKRIWTAGGGAKNAVWQQLRQQAIGVPIAIAPN
TEAAFGTARLAAFGLAAFHSAGLKRT
>O06739 4.4.1.19~~~yitD~~~Phosphosulfolactate synthase~~~COG1809
MNDFSLELPVRTNKPRETGQSILIDNGYPLQFFKDAIAGASDYIDFVKFGWGTSLLTKDLEEKISTLKEHDITFFFGGTL
FEKYVSQKKVNEFHRYCTYFGCEYIEISNGTLPMTNKEKAAYIADFSDEFLVLSEVGSKDAELASRQSSEEWLEYIVEDM
EAGAEKVITEARESGTGGICSSSGDVRFQIVDDIISSDIDINRLIFEAPNKTLQQGFIQKIGPNVNLANIPFHDAIALET
LRLGLRSDTFFL
>P0C7Y0 ~~~psmA1~~~Phenol-soluble modulin alpha 1 peptide~~~
MGIIAGIIKVIKSLIEQFTGK
>P0C7Y1 ~~~psmA1~~~Phenol-soluble modulin alpha 1 peptide~~~
MGIIAGIIKVIKSLIEQFTGK
>A9JX05 ~~~psmA1~~~Phenol-soluble modulin alpha 1 peptide~~~
MGIIAGIIKVIKSLIEQFTGK
>P0C7Z2 ~~~psmA2~~~Phenol-soluble modulin alpha 2 peptide~~~
MGIIAGIIKFIKGLIEKFTGK
>A9JX06 ~~~psmA2~~~Phenol-soluble modulin alpha 2 peptide~~~
MGIIAGIIKFIKGLIEKFTGK
>P0C804 ~~~psmA3~~~Phenol-soluble modulin alpha 3 peptide~~~
MEFVAKLFKFFKDLLGKFLGNN
>P0C805 ~~~psmA3~~~Phenol-soluble modulin alpha 3 peptide~~~
MEFVAKLFKFFKDLLGKFLGNN
>A9JX07 ~~~psmA3~~~Phenol-soluble modulin alpha 3 peptide~~~
MEFVAKLFKFFKDLLGKFLGNN
>P0C817 ~~~psmA4~~~Phenol-soluble modulin alpha 4 peptide~~~
MAIVGTIIKIIKAIIDIFAK
>A9JX08 ~~~psmA4~~~Phenol-soluble modulin alpha 4 peptide~~~
MAIVGTIIKIIKAIIDIFAK
>P54617 ~~~ydjF~~~Phage shock protein A homolog~~~COG1842
MSIIGRFKDIMSANINALLDKAENPEKMVDQYLRNMNSDLAKVKAETAAVMAEEQRAKREYHECQADMEKMESYAMKALQ
AGNESDARKFLERKTSLESKLSELQAANQIAATNAAQMRKMHDKLVSDIGELEARKNMIKAKWAVAKTQERMNKLGASVS
STSQSMSAFGRMEDKVNKALDQANAMAELNSAPQDDMADLSAKYDTGGSSQVDDELAALKAKMMLDK
>Q9RUB7 ~~~~~~Phage shock protein A homolog~~~COG1842
MSIFDRLSRLLRANVNDMISKAEDPAKIIDQALRDMRSAYADARNEVAGAMAQAAKLEREAGTNSKLAAEYEKKAEEALR
GGSEDLAREALRRAQNHKDLAKGFDEQRTVQQSTVDQLKTQLRALEAKIDEMESKKTLLAARQKTAQAGETLDRVSGFSK
AGGAMDAFNEMEQKVAGMEDRNKAMGELRNDQDFDAQLKDLGRDKDVDDALAALKAKVQSSNQ
>P0AFM6 ~~~pspA~~~Phage shock protein A~~~COG1842
MGIFSRFADIVNANINALLEKAEDPQKLVRLMIQEMEDTLVEVRSTSARALAEKKQLTRRIEQASAREVEWQEKAELALL
KEREDLARAALIEKQKLTDLIKSLEHEVTLVDDTLARMKKEIGELENKLSETRARQQALMLRHQAANSSRDVRRQLDSGK
LDEAMARFESFERRIDQMEAEAESHSFGKQKSLDDQFAELKADDAISEQLAQLKAKMKQDNQ
>D3DFG8 3.1.3.3~~~pspA~~~Phosphoserine phosphatase 1~~~COG0406
MVKLILVRHAESEWNPVGRYQGLLDPDLSERGKKQAKLLAQELSREHLDVIYSSPLKRTYLTALEIAEAKNLEVIKEDRI
IEIDHGMWSGMLVEEVMEKYPEDFRRWVEEPHKVEFQGGESLASVYNRVKGFLEEVRKRHWNQTVVVVSHTVPMRAMYCA
LLGVDLSKFWSFGCDNASYSVIHMEERRNVILKLNITCHLGEFYVEAHKAI
>P9WHP5 ~~~pspA~~~PspA protein~~~COG1842
MANPFVKAWKYLMALFSSKIDEHADPKVQIQQAIEEAQRTHQALTQQAAQVIGNQRQLEMRLNRQLADIEKLQVNVRQAL
TLADQATAAGDAAKATEYNNAAEAFAAQLVTAEQSVEDLKTLHDQALSAAAQAKKAVERNAMVLQQKIAERTKLLSQLEQ
AKMQEQVSASLRSMSELAAPGNTPSLDEVRDKIERRYANAIGSAELAESSVQGRMLEVEQAGIQMAGHSRLEQIRASMRG
EALPAGGTTATPRPATETSGGAIAEQPYGQ
>P0AFM9 ~~~pspB~~~Phage shock protein B~~~
MSALFLAIPLTIFVLFVLPIWLWLHYSNRSGRSELSQSEQQRLAQLADEAKRMRERIQALESILDAEHPNWRDR
>D3DFP8 3.1.3.3~~~pspB~~~Putative phosphoserine phosphatase 2~~~COG0406
MKRLYLVRHAQSEYNEKGIFQGRLDSDLTPLGFVQARLLAREFLKKKVDIIYSSPQRRAYKTALTISDMLGTQLVVDERL
REMSFGEYEGKHFWSMLEAHKDVFLNWLSNPVKHPLPTQESMEEFEKRVRSFLEDVKSSHYQNMLIVAHGGTLHAIVCLL
TGIGLENLWNIHMDNAGITEIHMEGEKSTLVYLNKLCHTRQLT
>P0AFN2 ~~~pspC~~~Phage shock protein C~~~COG1983
MAGINLNKKLWRIPQQGMVRGVCAGIANYFDVPVKLVRILVVLSIFFGLALFTLVAYIILSFALDPMPDNMAFGEQLPSS
SELLDEVDRELAASETRLREMERYVTSDTFTLRSRFRQL
>P0AFV8 ~~~pspD~~~Phage shock protein D~~~
MNTRWQQAGQKVKPGFKLAGKLVLLTALRYGPAGVAGWAIKSVARRPLKMLLAVALEPLLSRAANKLAQRYKR
>P23857 2.8.1.1~~~pspE~~~Thiosulfate sulfurtransferase PspE~~~COG0607
MFKKGLLALALVFSLPVFAAEHWIDVRVPEQYQQEHVQGAINIPLKEVKERIATAVPDKNDTVKVYCNAGRQSGQAKEIL
SEMGYTHVENAGGLKDIAMPKVKG
>P37344 ~~~pspF~~~Psp operon transcriptional activator~~~COG1221
MAEYKDNLLGEANSFLEVLEQVSHLAPLDKPVLIIGERGTGKELIASRLHYLSSRWQGPFISLNCAALNENLLDSELFGH
EAGAFTGAQKRHPGRFERADGGTLFLDELATAPMMVQEKLLRVIEYGELERVGGSQPLQVNVRLVCATNADLPAMVNEGT
FRADLLDRLAFDVVQLPPLRERESDIMLMAEYFAIQMCREIKLPLFPGFTERARETLLNYRWPGNIRELKNVVERSVYRH
GTSDYPLDDIIIDPFKRRPPEDAIAVSETTSLPTLPLDLREFQMQQEKELLQLSLQQGKYNQKRAAELLGLTYHQFRALL
KKHQI
>P32696 ~~~pspG~~~Phage shock protein G~~~
MLELLFVIGFFVMLMVTGVSLLGIIAALVVATAIMFLGGMLALMIKLLPWLLLAIAVVWVIKAIKAPKVPKYQRYDRWRY
>P94512 3.1.3.3~~~serB~~~Phosphoserine phosphatase~~~COG1011
MKAVFFDLDDTLLWDEKSVRTTFAETCLQAEKKYGLAPEEFEAAVREAARELYMSYETYPYTVMIGINPFEGLWSNFSEP
ISEGFQKLNKIVPEYRRNAWTNGLKALGIDDPAYGEYLGEFFAAERRKRPFVYDETFAVLDQLKGKYELLLLTNGDPSLQ
KEKLAGVPELAPYFNEIVISGAFGKGKPDVSIFEHCLKLMNIEKDDAIMVGDNLNTDILGASRAGIKTVWINRTDKKNET
DVKPDYIISSLHDLFPILEK
>Q72H00 3.1.3.3~~~~~~Phosphoserine phosphatase~~~COG1011
MKLLLLDLDDTLLQDLPVSRAVLEDLGRKAGVEGFFARVKARAEALFREAPFYPWAEAIGHSALEALWARYSTPGLEALA
AWAGPFRERVFREALEEAGGAPERARELAEAFFRERRRYPLYPEAEAFLAEARRRGLALALLTNGVPDLQREKLVGAGLA
HHFSLVLISGEVGIGKPDPRLFRMALCAFGVAPEEAAMVGDNPQKDVRGARLAGVRAVWVDRGLRPEDPEASPDLRVGDL
REVFLAEAL
>P31075 ~~~psrA~~~Polysulfide reductase chain A~~~COG0243
METTMTRRDFLKSAGAAGAAGLVWSQTIPGTLGALEKQEIKGSAKFVPSICEMCTSSCTIEARVEGDKGVFIRGNPKDKS
RGGKVCARGGSGFNQLYDPQRLVKPIMRVGERGEGKWKEVSWDEAYTFIAKKLDEIKQKHGAHTVAFTARSGWNKTWFHH
LAQAYGSPNIFGHESTCPLAYNMAGRDVFGGSMNRDFAKAKYIINMGHNVFEGIVISYVRQYMEAIENGAKVVTLEPRLS
VMAQKASEWHAIKPGHDLPFVLGFMHTLIFENLYDKKFVQKYCTGFEELKASIEPCTPEKMALECDIPADTIKRLAREFA
KAAPKAIFDFGHRVTFTPQELELRRAMMMVNALVGNIERDGGMYFGKNASFYNQFLGEEDPKAKGLKKPKTPAYPKVEVP
RIDRIGEKDGEFFLANKGEGIVSLVPKATLNELPGVPCKIHGWFIVRNNPVMTQTNADTVIKALKSMDLVVCVDIQVSDT
AWFADVVLPDTTYLERDEEFTAGGGKNPSFGIGRQKVVEPLGDAKPGWKIAKELSEKMGLGEYFPWKDIEDYRLQQVDGD
LDLLAKLKKDGSASFGVPLMLQEKKSVAEFVKKFPGAASKVNEEGLIDFPKKIQLFSPKLEEVSGKGGLGYEPFKYKEED
ELYFVQGKTPVRSNSHTGNVPWLNNLMEYDAIWIHPKTASKLGIKNGDAIELYNKFSSQKSKALITEGVREDTLFGYFGF
GHVSKDLKRAYGKGVNSNALMPSFTSPNSGMDLHVFGVKVKKA
>P0A8A4 2.7.11.33~~~ppsR~~~Phosphoenolpyruvate synthase regulatory protein~~~COG1806
MDNAVDRHVFYISDGTAITAEVLGHAVMSQFPVTISSITLPFVENESRARAVKDQIDAIYHQTGVRPLVFYSIVLPEIRA
IILQSEGFCQDIVQALVAPLQQEMKLDPTPIAHRTHGLNPNNLNKYDARIAAIDYTLAHDDGISLRNLDQAQVILLGVSR
CGKTPTSLYLAMQFGIRAANYPFIADDMDNLVLPASLKPLQHKLFGLTIDPERLAAIREERRENSRYASLRQCRMEVAEV
EALYRKNQIPWINSTNYSVEEIATKILDIMGLSRRMY
>A0A0H2URK1 ~~~psrP~~~Pneumococcal serine-rich repeat protein~~~
MTETVEDKVSHSITGLDILKGIVAAGAVISGTVATQTKVFTNESAVLEKTVEKTDALATNDTVVLGTISTSNSASSTSLS
ASESASTSASESASTSASTSASTSASESASTSASTSISASSTVVGSQTAAATEATAKKVEEDRKKPASDYVASVTNVNLQ
SYAKRRKRSVDSIEQLLASIKNAAVFSGNTIVNGAPAINASLNIAKSETKVYTGEGVDSVYRVPIYYKLKVTNDGSKLTF
TYTVTYVNPKTNDLGNISSMRPGYSIYNSGTSTQTMLTLGSDLGKPSGVKNYITDKNGRQVLSYNTSTMTTQGSGYTWGN
GAQMNGFFAKKGYGLTSSWTVPITGTDTSFTFTPYAARTDRIGINYFNGGGKVVESSTTSQSLSQSKSLSVSASQSASAS
ASTSASASASTSASASASTSASASASTSASVSASTSASASASTSASASASTSASESASTSASASASTSASASASTSASAS
ASTSASESASTSASASASTSASESASTSASASASTSASASASTSASGSASTSTSASASTSASASASTSASASASISASES
ASTSASESASTSTSASASTSASESASTSASASASTSASASASTSASASASTSASASTSASESASTSASASASTSASASAS
TSASASASTSASASASTSASVSASTSASASASTSASASASTSASESASTSASASASTSASASASTSASASASTSASASAS
TSASASASTSASESASTSASASASTSASASASTSASASASTSASASASTSASASASISASESASTSASASASTSASASAS
TSASASASTSASESASTSASASASTSASASASTSASASASTSASASASTSASASASTSASASASTSASESASTSASASAS
TSASESASTSASASASTSASASASTSASASASTSASASASTSASASASTSASASASTSASASTSASESASTSASASASTS
ASASASTSASASASTSASESASTSASASASTSASASASTSASASASTSASASASTSASASASISASESASTSASASASTS
ASVSASTSASASASTSASESASTSASASASTSASESASTSASASASTSASASASISASESASTSASASASTSASASASTS
ASASASTSASESASTSTSASASTSASESASTSASASASTSASASASTSASASASTSASASASTSASASTSASESASTSAS
ASASTSASASASTSASASASTSASASASTSASASASTSASASASTSASASASTSASASASTSASESASTSASASASTSAS
ASASTSASASASTSASASASTSASVSASTSASESASTSASASASTSASASASTSASESASTSASASASTSASESASTSAS
ASASTSASASASTSASASASTSASASASTSASASASTSASASASTSASASASTSASASASTSASASASTSASASASTSAS
ASASTSASASASTSASASASISASESASTSASASASTSASASASTSASVSASTSASASASTSASASASISASESASTSAS
ASASTSASASASTSASASASTSASASASISASESASTSASASASTSASASASTSASASASTSASASASTSASASASTSAS
ASASTSASASASTSASASASTSASASASTSASESASTSASASASTSASASASTSASASASTSASVSASTSASESASTSAS
ASASTSASASASTSASASASTSASESASTSASASASTSASASASTSASESASTSASASASTSASASASTSASASASTSAS
ASASASTSASASASTSASASASTSASASASISASESASTSASESASTSTSASASTSASESASTSASASASTSASASASTS
ASASASTSASASTSASESASTSASASASTSASASASTSASASASTSASASASTSASASASTSASVSASTSASASASTSAS
ASASTSASESASTSASASASTSASASASTSASASASTSASASASTSASASASTSASESASTSASASASTSASASASTSAS
ASASTSASASASTSASASASISASESASTSASASASTSASASASTSASASASTSASESASTSASASASTSASASASTSAS
ASASTSASASASTSASASASTSASASASTSASESASTSASASASTSASESASTSASASASTSASASASTSASASASTSAS
ASASTSASASASTSASASASTSASASTSASESASTSASASASTSASASASTSASASASTSASESASTSASASASTSASAS
ASTSASASASTSASASASTSASASASISASESASTSASASASTSASVSASTSASASASTSASESASTSASASASTSASES
ASTSASASASTSASASASISASESASTSASASASTSASASASTSASASASTSASESASTSTSASASTSASESASTSASAS
ASTSASASASTSASASASTSASASASTSASASTSASESASTSASASASTSASASASTSASASASTSASASASTSASASAS
TSASASASTSASASASTSASASASTSASASASTSASASASTSASASASTSASESASTSASASASTSASASASTSASASAS
TSASVSASTSASESASTSASASASTSASASASTSASASASTSASESASTSASASASTSASASASTSASESASTSASASAS
TSASASASTSASASASTSASASASASTSASASASTSASASASTSASASASISASESASTSASASASASTSASASASTSAS
ASASTSASASASISASESASTSASESASTSTSASASTSASESASTSASASASTSASASASTSASASASTSASASTSASES
ASTSASASASTSASASASTSASASASTSASASASTSASASASTSASVSASTSASASASTSASASASTSASESASTSASAS
TSASESASTSASASASTSASASASTSASASASTSASESASTSASASASTSASASASTSASESASTSASASASTSASASAS
TSASASASTSASESASTSASASASTSASESASTSASASASTSASASASTSASGSASTSTSASASTSASASASTSASASAS
ISASESASTSASESASTSTSASASTSASESASTSASASASTSASASASTSASASASTSASASTSASESASTSASASASTS
ASASASTSASASASTSASASASTSASVSASTSASASASTSASASASTSASESASTSASASASTSASASASTSASASASTS
ASASASTSASASASTSASESASTSASASASTSASASASTSASASASTSASASASTSASASASISASESASTSASASASTS
ASASASTSASASASTSASESASTSASASASTSASASASTSASASASTSASASASTSASASASTSASASASTSASESASTS
ASASASTSASESASTSASASASTSASASASTSASASASTSASASASTSASASASTSASASASTSASASTSASESASTSAS
ASASTSASASASTSASASASTSASESASTSASASASTSASASASTSASASASTSASASASTSASASASISASESASTSAS
ASASTSASVSASTSASASASTSASESASTSASASASTSASESASTSASASASTSASASASISASESASTSASASASTSAS
ASASTSASASASTSASESASTSTSASASTSASESASTSASASASTSASASASTSASASASTSASASASTSASASTSASES
ASTSASASASTSASASASTSASASASTSASASASTSASASASTSASASASTSASASASTSASASASTSASESASTSASAS
ASTSASASASTSASASASTSASASASTSASVSASTSASESASTSASASASTSASASASTSASESASTSASASASTSASES
ASTSASASASTSASASASTSASASASTSASASASTSASASASTSASASASTSASASASTSASASASTSASASASTSASAS
ASTSASASASTSASASASTSASASASISASESASTSASASASTSASASASTSASVSASTSASASASTSASASASISASES
ASTSASASASTSASASASTSASASASTSASASASISASESASTSASASASTSASASASTSASASASTSASASASTSASAS
ASTSASASASTSASASASTSASASASTSASASASTSASESASTSASASASTSASASASISASESASTSASASASTSASAS
ASTSASASASTSASESASTSTSASASTSASESASTSASASASTSASASASTSASASASTSASASASTSASASTSASESAS
TSASASASTSASASASTSASASASTSASASASTSASASASTSASASASTSASASASTSASASASTSASESASTSASASAS
TSASASASTSASASASTSASASASTSASVSASTSASESASTSASASASTSASASASTSASESASTSASASASTSASESAS
TSASASASTSASASASTSASASASTSASASASTSASASASTSASASASTSASASASTSASASASTSASASASTSASASAS
TSASASASTSASASASTSASASASISASESASTSASASASTSASASASTSASVSASTSASASASTSASASASISASESAS
TSASASASTSASASASTSASASASTSASASASISASESASTSASASASTSASASASTSASASASTSASASASTSASASAS
TSASASASTSASASASTSASASASTSASASASTSASASASTSASASASTSASASASTSASASASTSASASASTSVSNSAN
HSNSQVGNTSGSTGKSQKELPNTGTESSIGSVLLGVLAAVTGIGLVAKRRKRDEEE
>Q9FCP2 2.5.1.98~~~pssM~~~Exopolysaccharide glucosyl ketal-pyruvate-transferase~~~
MKMFAYRGKHENFGDELNHWLWERLLPGFFDEDESQLFLGIGSILYDNFDPNMQKIVFGSGYGGYTNPPKVDGNWTFYFV
RGKKTAEVLGIDPSYAIGDSGILTRSCWDAKSIEKRYPVSFMPHYESAMYGSWDKVCELAGIHYIDPRWPVEKVLTEISA
SHKVVSEAMHGCIISDALRVPWRAIRPIAPGNRAKWYDWASALDLEIDFDPIGPSNVVEAGASLVRNNTYLLKNITFRHR
RIRQLTGNYVFGSTVKTLQRVAEKPGQLSSDESMVNAHNRMLLELDRLKQDFSKKTASVL
>Q9ABR0 2.7.8.31~~~pssY~~~UDP-glucose:undecaprenyl-phosphate glucose-1-phosphate transferase~~~COG2148
MKEQGLPAVNIIACASPLLCTTTSSDCVSVWACELEQPLSDALGKLKDHPVDLGPSTQLLTLRPGGPLPRVAVDLRKRVL
DVVAAALLTALFAPLLLLAALAIKLESPGPALFRQTRGGLGGAPFQILKLRTMHCREDGPDVAQAQRGDDRVTRVGRILR
AASIDELPQLLNVLRGDMSLVGPRPHATAHDDYYSARIPEYAARYQARPGLTGLAQVRGLRGGTETVELMRQRIAADIDY
IQTWSLWRDLKIVLRTVPSLLTTDNAY
>P23830 2.7.8.8~~~pssA~~~CDP-diacylglycerol--serine O-phosphatidyltransferase~~~COG1502
MLSKFKRNKHQQHLAQLPKISQSVDDVDFFYAPADFRETLLEKIASAKQRICIVALYLEQDDGGKGILNALYEAKRQRPE
LDVRVLVDWHRAQRGRIGAAASNTNADWYCRMAQENPGVDVPVYGVPINTREALGVLHFKGFIIDDSVLYSGASLNDVYL
HQHDKYRYDRYHLIRNRKMSDIMFEWVTQNIMNGRGVNRLDDVNRPKSPEIKNDIRLFRQELRDAAYHFQGDADNDQLSV
TPLVGLGKSSLLNKTIFHLMPCAEQKLTICTPYFNLPAILVRNIIQLLREGKKVEIIVGDKTANDFYIPEDEPFKIIGAL
PYLYEINLRRFLSRLQYYVNTDQLVVRLWKDDDNTYHLKGMWVDDKWMLITGNNLNPRAWRLDLENAILIHDPQLELAPQ
REKELELIREHTTIVKHYRDLQSIADYPVKVRKLIRRLRRIRIDRLISRIL
>P44704 2.7.8.8~~~pssA~~~CDP-diacylglycerol--serine O-phosphatidyltransferase~~~COG1502
MLINKTKRAEQNLNNLPFLALQAEQIEFLGSSAEFKTQIIELIRNAKKRIYVTALYWQKDEAGQEILDEIYRVKQENPHL
DVKVLIDWHRAQRNLLGAEKSATNADWYCEQRQTYQLPDDPNMFFGVPINTREVFGVLHVKGFVFDDTVLYSGASINNVY
LHQFEKYRYDRYQKITHAELADSMVNFINDYLLDFSAVYPLDVTNRPRTKEIRGNIRAYRKDLAQNGEYSLKSAVKLPNV
LSVSPLFGLGASGNELNQVIEDLFLQVQKKLVICTPYFNFPRTLQHKIATLLENGKRVEIIVGDKVANDFYIPPEQPFKM
AGALPYLYESNLRRFCEKFETQIESGQLVVRLWRDGDNTYHLKGVWVDDRYILLTGNNLNPRAWRLDAENGLLIYDPQQQ
LLAQVEKEQNQIRQHTKVLKHYTELEELNQYPEPVQKLLKKFARIKADKLVKMIL
>P9WG11 ~~~pstA1~~~Phosphate transport system permease protein PstA 1~~~COG0581
MSPSMSIEALDQPVKPVVFRPLTLRRRIKNSVATTFFFTSFVVALIPLVWLLWVVIARGWFAVTRSGWWTHSLRGVLPEQ
FAGGVYHALYGTLVQAGVAAVLAVPLGLMTAVYLVEYGTGRMSRVTTFTVDVLAGVPSIVAALFVFSLWIATLGFQQSAF
AVALALVLLMLPVVVRAGEEMLRLVPDELREASYALGVPKWKTIVRIVAPIAMPGIVSGILLSIARVVGETAPVLVLVGY
SHSINLDVFHGNMASLPLLIYTELTNPEHAGFLRVWGAALTLIIVVATINLAAAMIRFVATRRRRLPL
>P9WG09 ~~~pstA2~~~Phosphate transport system permease protein PstA 2~~~COG0581
MGESAESGSRQLPAMSPPRRSVAYRRKIVDALWWAACVCCLAVVITPTLWMLIGVVSRAVPVFHWSVLVQDSQGNGGGLR
NAIIGTAVLAIGVILVGGTVSVLTGIYLSEFATGKTRSILRGAYEVLSGIPSIVLGYVGYLALVVYFDWGFSLAAGVLVL
SVMSIPYIAKATESALAQVPTSYREAAEALGLPAGWALRKIVLKTAMPGIVTGMLVALALAIGETAPLLYTAGWSNSPPT
GQLTDSPVGYLTYPIWTFYNQPSKSAQDLSYDAALLLIVFLLLLIFIGRLINWLSRRRWDV
>P07654 ~~~pstA~~~Phosphate transport system permease protein PstA~~~COG0581
MAMVEMQTTAALAESRRKMQARRRLKNRIALTLSMATMAFGLFWLIWILMSTITRGIDGMSLALFTEMTPPPNTEGGGLA
NALAGSGLLILWATVFGTPLGIMAGIYLAEYGRKSWLAEVIRFINDILLSAPSIVVGLFVYTIVVAQMEHFSGWAGVIAL
ALLQVPIVIRTTENMLKLVPYSLREAAYALGTPKWKMISAITLKASVSGIMTGILLAIARIAGETAPLLFTALSNQFWST
DMMQPIANLPVTIFKFAMSPFAEWQQLAWAGVLIITLCVLLLNILARVVFAKNKHG
>P9WQL1 7.3.2.1~~~pstB1~~~Phosphate import ATP-binding protein PstB 1~~~COG1117
MAKRLDLTDVNIYYGSFHAVADVSLAILPRSVTAFIGPSGCGKTTVLRTLNRMHEVIPGARVEGAVLLDDQDIYAPGIDP
VGVRRAIGMVFQRPNPFPAMSIRNNVVAGLKLQGVRNRKVLDDTAESSLRGANLWDEVKDRLDKPGGGLSGGQQQRLCIA
RAIAVQPDVLLMDEPCSSLDPISTMAIEDLISELKQQYTIVIVTHNMQQAARVSDQTAFFNLEAVGKPGRLVEIASTEKI
FSNPNQKATEDYISGRFG
>P9WQK9 7.3.2.1~~~pstB2~~~Phosphate import ATP-binding protein PstB 2~~~COG1117
MACERLGGQSGAADVDAAAPAMAAVNLTLGFAGKTVLDQVSMGFPARAVTSLMGPTGSGKTTFLRTLNRMNDKVSGYRYS
GDVLLGGRSIFNYRDVLEFRRRVGMLFQRPNPFPMSIMDNVLAGVRAHKLVPRKEFRGVAQARLTEVGLWDAVKDRLSDS
PFRLSGGQQQLLCLARTLAVNPEVLLLDEPTSALDPTTTEKIEEFIRSLADRLTVIIVTHNLAQAARISDRAALFFDGRL
VEEGPTEQLFSSPKHAETARYVAGLSGDVKDAKRGN
>P0A2V9 7.3.2.1~~~pstB3~~~Phosphate import ATP-binding protein PstB 3~~~COG1117
MGTFSVRHLDLFYGDFQALKNISIQLPERQITALIGPSGCGKSTFLKTLNRMNDLVPSCHIEGQVLLDEQDIYSSKFNLN
QLRKRVGMVFQQPNPFAMSIYDNVAYGPRTHGIRDKKQLDALVEKSLKGAAIWEEVKDDLKKSAMSLSGGQQQRLCIARA
LAVEPDILLMDEPTSALDPISTLKIEDLIQQLKKDYTIIIVTHNMQQASRISDKTAFFLTGEICEFGDTVDVFTNPKDQR
TEDYISGRFG
>B8GYG4 7.3.2.1~~~pstB~~~Phosphate import ATP-binding protein PstB~~~
MTVQSPDDSTRAPATSTAAPATADPKIKARGVKVFYGDKQALFDVDLDIPAKSVTAFIGPSGCGKSTFLRCINRMNDTIP
SARVEGSILIDGADVNAKSVDPVVLRSRVGMVFQKPNPFPKTIFENVAYGPRIHGLATGKAELEAIVESSLKKAGLWNEV
ADRLHQPGTGLSGGQQQRLVIARAIAVSPEVILMDEPCSALDPIATAKIEELIDELRSQFCIVIVTHSMAQAARVSQRTA
FFHLGKLVESGPTEEMFTNPRDSRTQDYITGRFG
>P0AAH0 7.3.2.1~~~pstB~~~Phosphate import ATP-binding protein PstB~~~COG1117
MSMVETAPSKIQVRNLNFYYGKFHALKNINLDIAKNQVTAFIGPSGCGKSTLLRTFNKMFELYPEQRAEGEILLDGDNIL
TNSQDIALLRAKVGMVFQKPTPFPMSIYDNIAFGVRLFEKLSRADMDERVQWALTKAALWNETKDKLHQSGYSLSGGQQQ
RLCIARGIAIRPEVLLLDEPCSALDPISTGRIEELITELKQDYTVVIVTHNMQQAARCSDHTAFMYLGELIEFSNTDDLF
TKPAKKQTEDYITGRYG
>P9WG07 ~~~pstC1~~~Phosphate transport system permease protein PstC 1~~~COG0573
MLARAGEVGRAGPAIRWLGGIGAVIPLLALVLVLVVLVIEAMGAIRLNGLHFFTATEWNPGNTYGETVVTDGVAHPVGAY
YGALPLIVGTLATSAIALIIAVPVSVGAALVIVERLPKRLAEAVGIVLELLAGIPSVVVGLWGAMTFGPFIAHHIAPVIA
HNAPDVPVLNYLRGDPGNGEGMLVSGLVLAVMVVPIIATTTHDLFRQVPVLPREGAIALGMSNWECVRRVTLPWVSSGIV
GAVVLGLGRALGETMAVAMVSGAVLGAMPANIYATMTTIAATIVSQLDSAMTDSTNFAVKTLAEVGLVLMVITLLTNVAA
RGMVRRVSRTALPVGRGI
>P9WG05 ~~~pstC2~~~Phosphate transport system permease protein PstC 2~~~COG0573
MVTEPLTKPALVAVDMRPARRGERLFKLAASAAGSTIVIAILLIAIFLLVRAVPSLRANHANFFTSTQFDTSDDEQLAFG
VRDLFMVTALSSITALVLAVPVAVGIAVFLTHYAPRRLSRPFGAMVDLLAAVPSIIFGLWGIFVLAPKLEPIARFLNRNL
GWLFLFKQGNVSLAGGGTIFTAGIVLSVMILPIVTSISREVFRQTPLIQIEAALALGATKWEVVRMTVLPYGRSGVVAAS
MLGLGRALGETVAVLVILRSAARPGTWSLFDGGYTFASKIASAASEFSEPLPTGAYISAGFALFVLTFLVNAAARAIAGG
KVNG
>P0AGH8 ~~~pstC~~~Phosphate transport system permease protein PstC~~~COG0573
MAATKPAFNPPGKKGDIIFSVLVKLAALIVLLMLGGIIVSLIISSWPSIQKFGLAFLWTKEWDAPNDIYGALVPIYGTLV
TSFIALLIAVPVSFGIALFLTELAPGWLKRPLGIAIELLAAIPSIVYGMWGLFIFAPLFAVYFQEPVGNIMSNIPIVGAL
FSGPAFGIGILAAGVILAIMIIPYIAAVMRDVFEQTPVMMKESAYGIGCTTWEVIWRIVLPFTKNGVIGGIMLGLGRALG
ETMAVTFIIGNTYQLDSASLYMPGNSITSALANEFAEAESGLHVAALMELGLILFVITFIVLAASKFMIMRLAKNEGAR
>P9WHW5 3.1.3.16~~~pstP~~~Serine/threonine protein phosphatase PstP~~~COG0631
MARVTLVLRYAARSDRGLVRANNEDSVYAGARLLALADGMGGHAAGEVASQLVIAALAHLDDDEPGGDLLAKLDAAVRAG
NSAIAAQVEMEPDLEGMGTTLTAILFAGNRLGLVHIGDSRGYLLRDGELTQITKDDTFVQTLVDEGRITPEEAHSHPQRS
LIMRALTGHEVEPTLTMREARAGDRYLLCSDGLSDPVSDETILEALQIPEVAESAHRLIELALRGGGPDNVTVVVADVVD
YDYGQTQPILAGAVSGDDDQLTLPNTAAGRASAISQRKEIVKRVPPQADTFSRPRWSGRRLAFVVALVTVLMTAGLLIGR
AIIRSNYYVADYAGSVSIMRGIQGSLLGMSLHQPYLMGCLSPRNELSQISYGQSGGPLDCHLMKLEDLRPPERAQVRAGL
PAGTLDDAIGQLRELAANSLLPPCPAPRATSPPGRPAPPTTSETTEPNVTSSPASPSPTTSAPAPTGTTPAIPTSASPAA
PASPPTPWPVTSSPTMAALPPPPPQPGIDCRAAA
>A0A0H3M950 ~~~pstS1~~~Phosphate-binding protein PstS1~~~
MKIRLHTLLAVLTAAPLLLAAAGCGSKPPSGSPETGAGAGTVATTPASSPVTLAETGSTLLYPLFNLWGPAFHERYPNVT
ITAQGTGSGAGIAQAAAGTVNIGASDAYLSEGDMAAHKGLMNIALAISAQQVNYNLPGVSEHLKLNGKVLAAMYQGTIKT
WDDPQIAALNPGVNLPGTAVVPLHRSDGSGDTFLFTQYLSKQDPEGWGKSPGFGTTVDFPAVPGALGENGNGGMVTGCAE
TPGCVAYIGISFLDQASQRGLGEAQLGNSSGNFLLPDAQSIQAAAAGFASKTPANQAISMIDGPAPDGYPIINYEYAIVN
NRQKDAATAQTLQAFLHWAITDGNKASFLDQAHFQPLPPAVVKLSDALIATISS
>P9WGU1 ~~~pstS1~~~Phosphate-binding protein PstS 1~~~COG0226
MKIRLHTLLAVLTAAPLLLAAAGCGSKPPSGSPETGAGAGTVATTPASSPVTLAETGSTLLYPLFNLWGPAFHERYPNVT
ITAQGTGSGAGIAQAAAGTVNIGASDAYLSEGDMAAHKGLMNIALAISAQQVNYNLPGVSEHLKLNGKVLAAMYQGTIKT
WDDPQIAALNPGVNLPGTAVVPLHRSDGSGDTFLFTQYLSKQDPEGWGKSPGFGTTVDFPAVPGALGENGNGGMVTGCAE
TPGCVAYIGISFLDQASQRGLGEAQLGNSSGNFLLPDAQSIQAAAAGFASKTPANQAISMIDGPAPDGYPIINYEYAIVN
NRQKDAATAQTLQAFLHWAITDGNKASFLDQVHFQPLPPAVVKLSDALIATISS
>Q97Q31 ~~~pstS1~~~Phosphate-binding protein PstS 1~~~COG0226
MKKRKKLALSLIAFWLTACLVGCASWIDRGESITAVGSTALQPLVEVAADEFGTIHVGKTVNVQGGGSGTGLSQVQSGAV
DIGNSDVFAEEKDGIDASALVDHKVAVAGLALIVNKEVDVDNLTTEQLRQIFIGEVTNWKEVGGKDLPISVINRAAGSGS
RATFDTVIMEGQSAMQSQEQDSNGAVKSIVSKSPGAISYLSLTYIDDSVKSMKLNGYDLSPENISSNNWPLWSYEHMYTL
GQPNELAAEFLNFVLSDETQEGIVKGLKYIPIKEMKVEKDAAGTVTVLEGRQ
>Q8DPB1 ~~~pstS1~~~Phosphate-binding protein PstS 1~~~COG0226
MKKRKKLALSLIAFWLTACLVGCASWIDRGESITAVGSTALQPLVEVAADEFGTIHVGKTVNVQGGGSGTGLSQVQSGAV
DIGNSDVFAEEKDGIDASALVDHKVAVAGLALIVNKEVDVDNLTTEQLRQIFIGEVTNWKEVGGKDLPISVINRAAGSGS
RATFDTVIMEGQSAMQSQEQDSNGAVKSIVSKSPGAISYLSLTYIDDSVKSMKLNGYDLSPENISSNNWPLWSYEHMYTL
GQPNELAAEFLNFVLSDETQEGIVKGLKYIPIKEMKVEKDAAGTVTVLEGRQ
>A0A0H3MBL5 ~~~pstS2~~~Phosphate-binding protein PstS2~~~
MKFARSGAAVSLLAAGTLVLTACGGGTNSSSSGAGGTSGSVHCGGKKELHSSGSTAQENAMEQFVYAYVRSCPGYTLDYN
ANGSGAGVTQFLNNETDFAGSDVPLNPSTGQPDRAAERCGSPAWDLPTVFGPIAITYNIKGVSTLNLDGPTTAKIFNGTI
TVWNDPQIQALNSGTDLPPTPISVIFRSDKSGTSDNFQKYLDGASNGAWGKGASETFNGGVGVGASGNNGTSALLQTTDG
SITYNEWSFAVGKQLNMAQIITSAGPDPVAITTESVGKTIAGAKIMGQGNDLVLDTSSFYRPTQPGSYPIVLATYEIVCS
KYPDATTGTAVRAFMQAAIGPGQEGLDQYGSIPLPKSFQAKLAAAVNAIS
>P9WGT9 ~~~pstS2~~~Phosphate-binding protein PstS 2~~~COG0226
MKFARSGAAVSLLAAGTLVLTACGGGTNSSSSGAGGTSGSVHCGGKKELHSSGSTAQENAMEQFVYAYVRSCPGYTLDYN
ANGSGAGVTQFLNNETDFAGSDVPLNPSTGQPDRSAERCGSPAWDLPTVFGPIAITYNIKGVSTLNLDGPTTAKIFNGTI
TVWNDPQIQALNSGTDLPPTPISVIFRSDKSGTSDNFQKYLDGASNGAWGKGASETFNGGVGVGASGNNGTSALLQTTDG
SITYNEWSFAVGKQLNMAQIITSAGPDPVAITTESVGKTIAGAKIMGQGNDLVLDTSSFYRPTQPGSYPIVLATYEIVCS
KYPDATTGTAVRAFMQAAIGPGQEGLDQYGSIPLPKSFQAKLAAAVNAIS
>P0C2M5 ~~~pstS2~~~Phosphate-binding protein PstS 2~~~COG0226
MKFKKMLTLAAIGLSGFGLVACGNQSAASKQSASGTIEVISRENGSGTRGAFTEITGILKKDGDKKIDNTAKTAVIQNST
EGVLSAVQGNANAIGYISLGSLTKSVKALEIDGVKASRDTVLDGEYPLQRPFNIVWSSNLSKLGQDFISFIHSKQGQQVV
TDNKFIEAKTETTEYTSQHLSGKLSVVGSTSVSSLMEKLAEAYKKENPEVTIDITSNGSSAGITAVKEKTADIGMVSREL
TPEEGKSLTHDAIALDGIAVVVNNDNKASQVSMAELADVFSGKLTTWDKIK
>A0A0H3MBK5 ~~~pstS3~~~Phosphate-binding protein PstS3~~~
MKLNRFGAAVGVLAAGALVLSACGNDDNVTGGGATTGQASAKVDCGGKKTLKASGSTAQANAMTRFVNVFEQACPGQTLN
YTANGSGAGISEFNGNQTDFGGSDVPLSKDEAAAAQRRCGSPAWNLPVVFGPIAVTYNLNSVSSLNLDGPTLAKIFNGSI
TQWNNPAIQALNRDFTLPGERIHVVFRSDESGTTDNFQRYLQAASNGAWGKGAGKSFQGGVGEGARGNDGTSAAAKNTPG
SITYNEWSFAQAQHLTMANIVTSAGGDPVAITIDSVGQTIAGATISGVGNDLVLDTDSFYRPKRPGSYPIVLATYEIVCS
KYPDSQVGTAVKAFLQSTIGAGQSGLGDNGYIPIPDEFKSRLSTAVNAIA
>P9WGT7 ~~~pstS3~~~Phosphate-binding protein PstS 3~~~COG0226
MKLNRFGAAVGVLAAGALVLSACGNDDNVTGGGATTGQASAKVDCGGKKTLKASGSTAQANAMTRFVNVFEQACPGQTLN
YTANGSGAGISEFNGNQTDFGGSDVPLSKDEAAAAQRRCGSPAWNLPVVFGPIAVTYNLNSVSSLNLDGPTLAKIFNGSI
TQWNNPAIQALNRDFTLPGERIHVVFRSDESGTTDNFQRYLQAASNGAWGKGAGKSFQGGVGEGARGNDGTSAAAKNTPG
SITYNEWSFAQAQHLTMANIVTSAGGDPVAITIDSVGQTIAGATISGVGNDLVLDTDSFYRPKRPGSYPIVLATYEIVCS
KYPDSQVGTAVKAFLQSTIGAGQSGLGDNGYIPIPDEFKSRLSTAVNAIA
>O51233 ~~~pstS~~~Phosphate-binding protein PstS~~~
MKKVIILIFMLSTSLLYNCKNQDNEKIVSIGGSTTVSPILDEMILRYNKINNNTKVTYDAQGSSVGINGLFNKIYKIAIS
SRDLTKEEIEQGAKETVFAYDALIFITSPEIKITNITEENLAKILNGEIQNWKQVGGPDAKINFINRDSSSGSYSSIKDL
LLNKIFKTHEEAQFRQDGIVVKSNGEVIEKTSLTPHSIGYIGLGYAKNSIEKGLNILSVNSTYPTKETINSNKYTIKRNL
IIVTNNKYEDKSVTQFIDFMTSSTGQDIVEEQGFLGIKT
>P0AG82 ~~~pstS~~~Phosphate-binding protein PstS~~~COG0226
MKVMRTTVATVVAATLSMSAFSVFAEASLTGAGATFPAPVYAKWADTYQKETGNKVNYQGIGSSGGVKQIIANTVDFGAS
DAPLSDEKLAQEGLFQFPTVIGGVVLAVNIPGLKSGELVLDGKTLGDIYLGKIKKWDDEAIAKLNPGLKLPSQNIAVVRR
ADGSGTSFVFTSYLAKVNEEWKNNVGTGSTVKWPIGLGGKGNDGIAAFVQRLPGAIGYVEYAYAKQNNLAYTKLISADGK
PVSPTEENFANAAKGADWSKTFAQDLTNQKGEDAWPITSTTFILIHKDQKKPEQGTEVLKFFDWAYKTGAKQANDLDYAS
LPDSVVEQVRAAWKTNIKDSSGKPLY
>Q02DZ3 ~~~pstS~~~Phosphate-binding protein PstS~~~
MKLKRLMAALTFVAAGVGAASAVAAIDPALPEYQKASGVSGNLSSVGSDTLANLMTMWAEEYKRLYPNVNIQIQAAGSST
APPALTEGTANLGPMSRKMKDVELQAFEQKYGYKPTAVPVAVDALAIFVHKDNPIKGLTMQQVDAIFSATRLCGSKQDVK
TWGDLGLTGDWAKKPVQLFGRNSVSGTYGYFKEEALCKGDFRPNVNEQPGSASVVQSVSQSLNGIGYSGIGYKTASVKTV
ALAKKEGAAFVEDNEQNALNGTYPLSRFLYVYVNKAPNKPLDPLEAQFLKLVLSKTGQQVVVKDGYIPLPAKVAEKAIKE
LGL
>G3XDA8 ~~~pstS~~~Phosphate-binding protein PstS~~~
MKLKRLMAALTFVAAGVGAASAVAAIDPALPEYQKASGVSGNLSSVGSDTLANLMTMWAEEYKRLYPNVNIQIQAAGSST
APPALTEGTANLGPMSRKMKDVELQAFEQKYGYKPTAVPVAVDALAIFVHKDNPIKGLTMQQVDAIFSATRLCGSKQDVK
TWGDLGLTGDWAKKPVQLFGRNSVSGTYGYFKEEALCKGDFRPNVNEQPGSASVVQSVSQSLNGIGYSGIGYKTASVKTV
ALAKKEGAAFVEDNEQNALNGTYPLSRFLYVYVNKAPNKPLDPLEAQFLKLVLSKTGQQVVVKDGYIPLPAKVAEKAIKE
LGL
>P0DMR4 ~~~pstS~~~Phosphate-binding protein PstS~~~COG0226
MKLKRLMAALTFVAAGVGAASAVAAIDPALPEYQKASGVSGNLSSVGSDTLANLMTMWAEEYKRLYPNVNIQIQAAGSST
APPALTEGTANLGPMSRKMKDVELQAFEQKYGYKPTAVPVAVDALAIFVHKDNPIKGLTMQQVDAIFSATRLCGSKQDVK
TWGDLGLTGDWAKKPVQLFGRNSVSGTYGYFKEEALCKGDFRPNVNEQPGSASVVQSVSQSLNGIGYSGIGYKTASVKTV
ALAKKEGAAFVEDNEQNALNGTYPLSRFLYVYVNKAPNKPLDPLEAQFLKLVLSKTGQQVVVKDGYIPLPAKVAEKAIKE
LGL
>Q7A5Q2 ~~~pstS~~~Phosphate-binding protein PstS~~~
MKKWQFVGTTALGATLLLGACGGGNGGSGNSDLKGEAKGDGSSTVAPIVEKLNEKWAQDHSDAKISAGQAGTGAGFQKFI
AGDIDFADASRPIKDEEKQKLQDKNIKYKEFKIAQDGVTVAVNKENDFVDELDKQQLKAIYSGKAKTWKDVNSKWPDKKI
NAVSPNSSHGTYDFFENEVMNKEDIKAEKNADTNAIVSSVTKNKEGIGYFGYNFYVQNKDKLKEVKIKDENGKATEPTKK
TIQDNSYALSRPLFIYVNEKALKDNKVMSEFIKFVLEDKGKAAEEAGYVAAPEKTYKSQLDDLKAFIDKNQKSDDKKSDD
KKSEDKK
>P33025 4.2.1.70~~~psuG~~~Pseudouridine-5'-phosphate glycosidase~~~COG2313
MSELKISPELLQISPEVQDALKNKKPVVALESTIISHGMPFPQNAQTAIEVEETIRKQGAVPATIAIIGGVMKVGLSKEE
IELLGREGHNVTKVSRRDLPFVVAAGKNGATTVASTMIIAALAGIKVFATGGIGGVHRGAEHTFDISADLQELANTNVTV
VCAGAKSILDLGLTTEYLETFGVPLIGYQTKALPAFFCRTSPFDVSIRLDSASEIARAMVVKWQSGLNGGLVVANPIPEQ
FAMPEHTINAAIDQAVAEAEAQGVIGKESTPFLLARVAELTGGDSLKSNIQLVFNNAILASEIAKEYQRLAG
>Q9X1H5 4.2.1.70~~~psuG~~~Pseudouridine-5'-phosphate glycosidase~~~COG2313
MIIESRIEKGKPVVGMETTVFVHGLPRKEAIELFRRAKEISREKGFQLAVIGILKGKIVAGMSEEELEAMMREGADKVGT
REIPIVVAEGKNAATTVSATIFLSRRIGIEVVVTGGTGGVHPGRVDVSQDLTEMSSSRAVLVSSGIKSILDVEATFEMLE
TLEIPLVGFRTNEFPLFFSRKSGRRVPRIENVEEVLKIYESMKEMELEKTLMVLNPVPEEYEIPHDEIERLLEKIELEVE
GKEVTPFLLKKLVEMTNGRTLKANLALLEENVKLAGEIAVKLKRS
>P30235 2.7.1.83~~~psuK~~~Pseudouridine kinase~~~COG0524
MREKDYVVIIGSANIDVAGYSHESLNYADSNPGKIKFTPGGVGRNIAQNLALLGNKAWLLSAVGSDFYGQSLLTQTNQSG
VYVDKCLIVPGENTSSYLSLLDNTGEMLVAINDMNISNAITAEYLAQHGEFIQRAKVIVADCNISEEALAWILDNAANVP
VFVDPVSAWKCVKVRDRLNQIHTLKPNRLEAETLSGIALSGREDVAKVAAWFHQHGLNRLVLSMGGDGVYYSDISGESGW
SAPIKTNVINVTGAGDAMMAGLASCWVDGMPFAESVRFAQGCSSMALSCEYTNNPDLSIANVISLVENAECLN
>P33024 ~~~psuT~~~Putative pseudouridine transporter~~~COG1972
MDIMRSVVGMVVLLAIAFLLSVNKKSISLRTVGAALLLQIAIGGIMLYFPPGKWAVEQAALGVHKVMSYSDAGSAFIFGS
LVGPKMDVLFDGAGFIFAFRVLPAIIFVTALISLLYYIGVMGLLIRILGSIFQKALNISKIESFVAVTTIFLGQNEIPAI
VKPFIDRMNRNELFTAICSGMASIAGSMMIGYAGMGVPIDYLLAASLMAIPGGILFARILSPATEPSQVTFENLSFSETP
PKSFIEAAASGAMTGLKIAAGVATVVMAFVAIIALINGIIGGIGGWFGFANASLESIFGYVLAPLAWIMGVDWSDANLAG
SLIGQKLAINEFVAYLSFSPYLQTGGTLEVKTIAIISFALCGFANFGSIGVVVGAFSAISPKRAPEIAQLGLRALAAATL
SNLMSATIAGFFIGLA
>P37177 2.7.3.9~~~ptsP~~~Phosphoenolpyruvate-dependent phosphotransferase system~~~COG3605
MLTRLREIVEKVASAPRLNEALNILVTDICLAMDTEVCSVYLADHDRRCYYLMATRGLKKPRGRTVTLAFDEGIVGLVGR
LAEPINLADAQKHPSFKYIPSVKEERFRAFLGVPIIQRRQLLGVLVVQQRELRQYDESEESFLVTLATQMAAILSQSQLT
ALFGQYRQTRIRALPAAPGVAIAEGWQDATLPLMEQVYQASTLDPALERERLTGALEEAANEFRRYSKRFAAGAQKETAA
IFDLYSHLLSDTRLRRELFAEVDKGSVAEWAVKTVIEKFAEQFAALSDNYLKERAGDLRALGQRLLFHLDDANQGPNAWP
ERFILVADELSATTLAELPQDRLVGVVVRDGAANSHAAIMVRALGIPTVMGADIQPSVLHRRTLIVDGYRGELLVDPEPV
LLQEYQRLISEEIELSRLAEDDVNLPAQLKSGERIKVMLNAGLSPEHEEKLGSRIDGIGLYRTEIPFMLQSGFPSEEEQV
AQYQGMLQMFNDKPVTLRTLDVGADKQLPYMPISEENPCLGWRGIRITLDQPEIFLIQVRAMLRANAATGNLNILLPMVT
SLDEVDEARRLIERAGREVEEMIGYEIPKPRIGIMLEVPSMVFMLPHLAKRVDFISVGTNDLTQYILAVDRNNTRVANIY
DSLHPAMLRALAMIAREAEIHGIDLRLCGEMAGDPMCVAILIGLGYRHLSMNGRSVARAKYLLRRIDYAEAENLAQRSLE
AQLATEVRHQVAAFMERRGMGGLIRGGL
>P08838 2.7.3.9~~~ptsI~~~Phosphoenolpyruvate-protein phosphotransferase~~~COG1080
MQELKGIGASAGIAIAKAYRLEEPDLTVEKKNISDSEAEVSRFDEAIARSKEELEKIKEHALKELGQDKADIFSAHLLVL
SDPELLNPVKEKISTDSVNAEFALKETSSMFVTMFESMDNEYMKERAADIRDVTKRVTGHLLGVEIPNPSMISEEVIIVA
EDLTPSDTAQLNREFVKGFTTDIGGRTSHSAIMARSLEIPAVVGTKAATGTIQNGVTVIVDGINGDVIIDPSAETVKEYE
EKHNAYLAQKAEWAKLVNEPTVSKDGHHVELAANIGTPDDVKGVLENGGEAVGLYRTEFLYMGRDQLPTEDEQFDAYKTV
LERMEGKSVVVRTLDIGGDKELPYLQLPKEMNPFLGYRAIRLCLEEQEIFRTQLRALLRASTYGNLKIMFPMIATVNEFK
EAKAILLEEKEKLVKAGQAVSDDIEVGMMVEIPSTAVIADQFAKEVDFFSIGTNDLIQYTMAADRMNERVSYLYQPYNPA
ILRLITLVIEAAHKEGKWVGMCGEMAGDEIAIPILLGLGLDEFSMSATSILPARTQISKLSKQEAESFKEKILSMSTTEE
VVAFVKETFK
>P08839 2.7.3.9~~~ptsI~~~Phosphoenolpyruvate-protein phosphotransferase~~~COG1080
MISGILASPGIAFGKALLLKEDEIVIDRKKISADQVDQEVERFLSGRAKASAQLETIKTKAGETFGEEKEAIFEGHIMLL
EDEELEQEIIALIKDKHMTADAAAHEVIEGQASALEELDDEYLKERAADVRDIGKRLLRNILGLKIIDLSAIQDEVILVA
ADLTPSETAQLNLKKVLGFITDAGGRTSHTSIMARSLELPAIVGTGSVTSQVKNDDYLILDAVNNQVYVNPTNEVIDKMR
AVQEQVASEKAELAKLKDLPAITLDGHQVEVCANIGTVRDVEGAERNGAEGVGLYRTEFLFMDRDALPTEEEQFAAYKAV
AEACGSQAVIVRTMDIGGDKELPYMNFPKEENPFLGWRAIRIAMDRREILRDQLRAILRASAFGKLRIMFPMIISVEEVR
ALRKEIEIYKQELRDEGKAFDESIEIGVMVETPAAATIARHLAKEVDFFSIGTNDLTQYTLAVDRGNDMISHLYQPMSPS
VLNLIKQVIDASHAEGKWTGMCGELAGDERATLLLLGMGLDEFSMSAISIPRIKKIIRNTNFEDAKVLAEQALAQPTTDE
LMTLVNKFIEEKTIC
>P23530 2.7.3.9~~~ptsI~~~Phosphoenolpyruvate-protein phosphotransferase~~~COG1080
MSEMLKGIAASDGVAVAKAYLLVQPDLSFNKTSVEDTDAEATRLDDALAKSTEELQAIRDKAAQSLGEAEAQVFDAHLMV
LSDPEMVGQIKQNIQDNKVNAEAALKEVTDMYIGMFEAMDDNAYMQERAADIRDVAKRILAHLLGVTLPNPSMINEEVIV
VAHDLTPSDTAQLDRTYVKAFVTDIGGRTSHSAIMARSLEIPAIVGTKEITDKVKAGDILAVNGIIGDVIIDPTDAEKSE
FEAEAKAYADQKAEWDKLKNAETVTADGKHVELAANIGTPKDLEGVHKNGGEAVGLYRTEFLYMDSSDFPTEEDQYQAYK
AVLEGMEGKPVVVRTMDIGGDKELPYLTLPHEMNPFLGYRALRISLSELGDGMFRTQMRALLRASVHGNLRIMFPMVATL
KEFRAAKAIFEDEKQKLVNEGVEVSNDIQVGIMIEIPAAAVLADKFAKEVDFFSVGTNDLIQYTMAADRMNERVSYLYQP
YNPSILRLIKNVIDAAHAEGKWAGMCGEMAGDQTAVPLLLGMGLDEFSMSATSILKTRSLMKRLDTTKMAELADRALKEC
DTMEEVFALVEEYTK
>Q9ZAD8 2.7.3.9~~~ptsI~~~Phosphoenolpyruvate-protein phosphotransferase~~~
MTTMLKGIAASSGVAVAKAYLLVQPDLSFETKTIADTANEEARLDAALATSQSELQLIKDKAVTTLGEEAASVFDAHMMV
LADPDMTAQIKAVINDKKVNAESALKEVTDMFIGIFEGMTDNAYMQERAADIKDVTKRVLAHLLGVKLPSPALIDEEVII
VAEDLTPSDTAQLDKKFVKAFVTNIGGRTSHSAIMARTLEIPAVLGTNNITELVSEGQLLAVSGLTGEVILDPSTEQQSE
FHKAGDAYAAQKAEWAALKDAETVTADGRHYELAANIGTPKDVEGVNDNGAEAIGLYRTEFLYMDAQDFPTEDDQYEAYK
AVLEGMNGKPVVVRTMDIGGDKTLPYFDLPKEMNPFLGWRALRISLSTAGDGMFRTQLRALLRASVHGQLRIMFPMVALV
TEFRAAKKIYDEEKSKLIAEGVPVAEGIEVGIMIEIPAAAMLADQFAKEVDFFSIGTNDLIQYTMAADRMNEQVSYLYQP
YNPSILRLINNVIKAAHAEGKWAGMCGEMAGDQTAVPLLMGMGLDEFSMSATSVLQTRSLMKRLDSKKMEELSSKALSEC
ATMEEVIALVEEYTK
>Q84F83 2.7.3.9~~~ptsI~~~Phosphoenolpyruvate-protein phosphotransferase~~~
MTHLACIAASDGIAIAKAYRFVQPNLTFSQTTVQDVQAEQQRLAAALAKTEQELRPSKQQTLKKFSAEEAAIFEAHLLVV
KDSELIGPINQKTADEGVNAEFALHEVSSMFVALFESMDDEYMSARASDIKDVTNRILAHLLGAHIPNPSSISEQVIIVA
NDLTPSETAQLDRNFVLGFITDIGGRTSHSAIMGRSLEIPAVVGTGVATTTIQDGDILIVDGLSGQVFVNPTADVIVSYQ
EKAQSYHTQQAEWSTLVNEQTVSKDGVHVELAANIGSPGDLEGVLRHGAEGIGLYRTEFLYMGRENLPSEDEQFTAYQTV
LEGMQGKPVVIRTLDLGGDKHLPYLPLQEEMNPFLGHRAIRLCLEQQELFRTPLRALLRASVYGNLKIMFPIIATIQEFR
DAKAILLEEQEKLKAAGEEVSADIEIGMMVEIPATAVMADVFAKEVDFFSIGTNDLIQYTMAADRMNEKVSYLYQPYNPA
FLRLIQMVIHAAHQEQKWVGMCGEMAGDELAVPLLLGLGLDEFSMSATSILKTRSLLKRLSVKDMQALATEALQVATAEE
VMEKVKQAVK
>P45617 2.7.3.9~~~ptsI~~~Phosphoenolpyruvate-protein phosphotransferase~~~
MSKQIKGIAASEGISLARALVIKETKLDIQKQLISDVDQEIIKLEQAIEKSIADLKKIQQITLKKLGEEKAAIFDAHQDI
ANDPAIKEEVVELIKKEKVNAEYALFTVSNNYFEMFSQLEDPYFKERSADIKDVSLRIISHILGLEIHDLSTIDKEVIII
SDDLTPSQTAQLDKKFVKGFLTNVGGRTSHAAIMARSLEIPAILGLKNITELVKTDDLIALDGSSGIVELDLNDDDIKNY
QTKVQQYIELKEQLKKFKDEPSLTKDKIKKLIEANIGSTNDVQSVLDSGAEGIGLFRTEFLYMDNDHFPTEEEQFEAYKK
VVSQIKHLVVFRTLDIGGDKKLSYFKFDEEMNPFLGYRAIRFTLDRKDIFKDQIRALLRASAFGKLGIMFPMIATIDEFK
QAKTFVEECKIELDKEGIKYDNQVQIGMMVEIPSAAILADQFAKYADFFSIGTNDLIQYSFASDRMNQNVSYLYQPLNPS
LLRLIQLTISGAHKHNKWVGMCGEMAGDSKALPILLGLDLDAFSMSATSVLKARSLMSKIEFSKAKILANKVLECETNEQ
VNKLVEDFLNNLD
>P0A249 2.7.3.9~~~ptsI~~~Phosphoenolpyruvate-protein phosphotransferase~~~
MISGILASPGIAFGKALLLKEDEIVIDRKKISADKVDQEVERFLSGRAKASAQLEAIKTKAGETFGEEKEAIFEGHIMLL
EDEELEQEIIALIKDKHMTADAAAHEVIEGQATALEELDDEYLKERAADVRDIGKRLLRNILGLAIIDLSAIQEEVILVA
ADLTPSETAQLNLQKVLGFITDAGGRTSHTSIMARSLELPAIVGTGSVTAQVKNGDYLILDAVNNQVYVNPTNDVIEQLR
AVQEQVATEKAELAKLKDLPAITLDGHQVEVCANIGTVRDVEGAERNGAEGVGLYRTEFLFMDRDALPTEEEQFAAYKAV
AEACGSQAVIVRTMDIGGDKELPYMNFPKEENPFLGWRAVRIAMDRKEILRDQVRAILRASAFGKLRIMFPMIISVEEVR
ALRKEIEIYKQELRDEGKAFDESIEIGVMVETPAAATIARHLAKEVDFFSIGTNDLTQYTLAVDRGNDMISHLYQPMSPS
VLNLIKQVIDASHAEGKWTGMCGELAGDERATLLLLGMGLDEFSMSAISIPRIKKIIRNTNFEDAKVLAEQALAQPTTDE
LMTLVNKFIEEKTIC
>Q99V14 2.7.3.9~~~ptsI~~~Phosphoenolpyruvate-protein phosphotransferase~~~
MSKLIKGIAASDGVAIAKAYLLVEPDLTFDKNEKVTDVEGEVAKFNSAIEASKVELTKIRNNAEVQLGADKAAIFDAHLL
VLDDPELIQPIQDKIKNENANAATALTDVTTQFVTIFESMDNEYMKERAADIRDVSKRVLSHILGVELPNPSMIDESVVI
VGNDLTPSDTAQLNKEFVQGFATNIGGRTSHSAIMSRSLEIPAIVGTKSITQEVKQGDMIIVDGLNGDVIVNPTEDELIA
YQDKRECYFADKKELQKLRDADTVTVDGVHAELAANIGTPNDLPGVIENGAQGIGLYRTEFLYMGRDQMPTEEEQFEAYK
EVLEAMDGKRVVVRTLDIGGDKELSYLNLPEEMNPFLGYRAIRLCLAQQDIFRPQLRALLRASVYGKLNIMFPMVATINE
FREAKAILLEEKENLKNEGHDISDDIELGIMVEIPATAALADVFAKEVDFFSIGTNDLIQYTLAADRMSERVSYLYQPYN
PSILRLVKQVIEASHKEGKWTGMCGEMAGDETAIPLLLGLGLDEFSMSATSILKARRQINGLSKNEMTELANRAVDCATQ
EEVIELVNNYVK
>P51183 2.7.3.9~~~ptsI~~~Phosphoenolpyruvate-protein phosphotransferase~~~
MSKLIKGIAASDGVAIAKAYLLVEPDLTFDKNEKVTDVEGEVAKFNSAIEASKVELTKIRNNAEVQLGADKAAIFDAHLL
VLDDPELIQPIQDKIKNENANAATALTDVTTQFVTIFESMDNEYMKERAADIRDVSKRVLSHILGVELPNPSMIDESVVI
VGNDLTPSDTAQLNKEFVQGFATNIGGRTSHSAIMSRSLEIPAIVGTKSITQEVKQGDMIIVDGLNGDVIVNPTEDELIA
YQDKRERYFADKKELQKLRDADTVTVDGVHAELAANIGTPNDLPGVIENGAQGIGLYRTEFLYMGRDQMPTEEEQFEAYK
EVLEAMGGKRVVVRTLDIGGDKELSYLNLPEEMNPFLGYRAIRLCLAQQDIFRPQLRALLRASVYGKLNIMFPMVATINE
FREAKAILLEEKENLKNEGHDISDDIELGIMVEIPATAALADVFAKEVDFFSIGTNDLIQYTLAADRMSERVSYLYQPYN
PSILRLVKQVIEASHKEGKWTGMCGEMAGDETAIPLLLGLGLDEFSMSATSILKARRQINGLSKNEMTELANRAVDCATQ
EEVIELVNNYVK
>P23533 2.7.3.9~~~ptsI~~~Phosphoenolpyruvate-protein phosphotransferase~~~COG1080
MAKQIKGIAASDGVAIAKAYLLVEPDLSFDNESVTDTDAEVAKFNGALNKSKVELTKIRNNAEKQLGADKAAIFDAHLLV
LEDPELIQPIEDKIKNESVNAAQALTDVSNQFITIFESMDNEYMAERAADIRDVSKRVLAHILGVELPNPSIVDESVVII
GNDLTPSDTAQLNKEYVQGFVTNIGGRTSHSAIMSRSLEIPAVVGTKSITEEVEAGDTIVVDGMTGDVLINPSDEVIAEY
QEKRENFFKDKQELQKLRDAESVTADGHHVELAANIGTPNDLPGVIENGAEGIGLYRTEFLYMGRDQMPTEEEQFEAYKA
VLEAMKGKRVVVRTLDIGGDKELPYLDLPEEMNPFLGYRAIRLCLDQPEIFRPQLRALLRASVFGKLNIMFPMVATIQEF
RDAKALLEEERANLKNEGYEVADDIELGIMVEIPSTAALADIFAKEVDFFSIGTNDLIQYTMAADRMSERVSYLYQPYNP
AILRLVKQVIEASHAEGKWTGMCGEMAGDQTAIPLLLGLGLDEFSMSATSILKARRLIRSLNESEMKELSERAVQCATSE
EVVDLVEEYTKNA
>Q9WXK9 2.7.3.9~~~ptsI~~~Phosphoenolpyruvate-protein phosphotransferase~~~
MTEMLKGIAASDGVAVAKAYLLVQPDLSFETVTVEDTSAEEARLDAALKASQDELSIIREKAVETLGEEAAAVFDAHLMV
LADPEMISQIKETIRAKQTNAEAGLKEVTDMFITIFEGMEDNPYMQERAADIRDVAKRVLAHLLGAKLPNPATIDEESIV
IAHDLTPSDTAQLNKQFVKAFVTNIGGRTSHSAIMARTLEIAAVLGTNDITSRVKDGDIVAVNGITGEVIINPTDEQVAE
FKAAGEAYAKQKAEWALLKDAKTVTADGKHFELAANIGTPKDVEGVNANGAEAVGLYRTEFLYMDSQDFPTEDEQYEAYK
AVLEGMNGKPVVVRTMDIGGDKELPYLDLPKEMNPFLGFRALRISISETGNAMFRTQIRALLRASVHGQLRIMFPMVALL
KEFRAAKAIFDEEKANLKAEGVAVSDDIQVGIMIEIPAAAMLADQFAKEVDFFSIGTNDLIQYTMAADRMNEQVSYLYQP
YNPSILRLINNVIKAAHAEGKWVGMCGEMAGDQKAVPLLVEMGLDEFSMSATSILRTRSLMKKLDTAKMQEYANRALTEC
STMEEVLELSKEYVNVD
>P39646 2.3.1.8~~~pta~~~Phosphate acetyltransferase~~~COG0280
MADLFSTVQEKVAGKDVKIVFPEGLDERILEAVSKLAGNKVLNPIVIGNENEIQAKAKELNLTLGGVKIYDPHTYEGMED
LVQAFVERRKGKATEEQARKALLDENYFGTMLVYKGLADGLVSGAAHSTADTVRPALQIIKTKEGVKKTSGVFIMARGEE
QYVFADCAINIAPDSQDLAEIAIESANTAKMFDIEPRVAMLSFSTKGSAKSDETEKVADAVKIAKEKAPELTLDGEFQFD
AAFVPSVAEKKAPDSEIKGDANVFVFPSLEAGNIGYKIAQRLGNFEAVGPILQGLNMPVNDLSRGCNAEDVYNLALITAA
QAL
>P77844 2.3.1.8~~~pta~~~Phosphate acetyltransferase~~~COG0280
MSAELFENWLLKRARAEHSHIVLPEGDDDRILMAAHQLLDQDICDITILGDPVKIKERATELGLHLNTAYLVNPLTDPRL
EEFAEQFAELRKSKSVTIDEAREIMKDISYFGTMMVHNGDADGMVSGAANTTAHTIKPSFQIIKTVPEASVVSSIFLMVL
RGRLWAFGDCAVNPNPTAEQLGEIAVVSAKTAAQFGIDPRVAILSYSTGNSGGGSDVDRAIDALAEARRLNPELCVDGPL
QFDAAVDPGVARKKMPDSDVAGQANVFIFPDLEAGNIGYKTAQRTGHALAVGPILQGLNKPVNDLSRGATVPDIVNTVAI
TAIQAGGRS
>P99092 2.3.1.8~~~pta~~~Phosphate acetyltransferase~~~
MADLLNVLKDKLSGKNVKIVLPEGEDERVLTAATQLQATDYVTPIVLGDETKVQSLAQKLDLDISNIELINPATSELKAE
LVQSFVERRKGKATEEQAQELLNNVNYFGTMLVYAGKADGLVSGAAHSTGDTVRPALQIIKTKPGVSRTSGIFFMIKGDE
QYIFGDCAINPELDSQGLAEIAVESAKSALSFGMDPKVAMLSFSTKGSAKSDDVTKVQEAVKLAQQKAEEEKLEAIIDGE
FQFDAAIVPGVAEKKAPGAKLQGDANVFVFPSLEAGNIGYKIAQRLGGYDAVGPVLQGLNSPVNDLSRGCSIEDVYNLSI
ITAAQALQ
>Q6GJ80 2.3.1.8~~~pta~~~Phosphate acetyltransferase~~~
MADLLNVLKDKLSGKNVKIVLPEGEDERVLTAATQLQATDYVTPIVLGDETKVQSLAQKLNLDISNIELINPATSELKAE
LVQSFVERRKGKTTEEQAQELLNNVNYFGTMLVYAGKADGLVSGAAHSTGDTVRPALQIIKTKPGVSRTSGIFFMIKGDE
QYIFGDCAINPELDSQGLAEIAVESAKSALSFGMDPKVAMLSFSTKGSAKSDDVTKVQEAVKLAQQKAEEEKLEAIIDGE
FQFDAAIVPGVAEKKAPGAKLQGDANVFVFPSLEAGNIGYKIAQRLGGYDAVGPVLQGLNSPVNDLSRGCSIEDVYNLSF
ITAAQALQ
>Q9X0L4 2.3.1.8~~~pta~~~Phosphate acetyltransferase~~~COG0280
MFLEKLVEMARGKGKKLAVAAANDDHVIEAVYRAWRERVCEPVLFGPEEEITRIIEELVPEWKNPQIIDCPPEEAGRLAV
EAVSKGECDFLMKGKIKTGDLMKIYLDERYGLRTGKTMAMVSVMEIPDFPRPLIISDPGMLISPTLEQKVDMIEHCVRVA
NVMGLETPKVAVVGAIEVVNPKMPITMEAAILSKMNQRGQIKGCIVDGPFALDNVVSEEAAKKKGIQSPVAGKADILILP
DIEAANILYKALVFLAKAKSASTILGGKVPVVLTSRADSEETKFYSIALSAVFA
>P0A9M8 2.3.1.8~~~pta~~~Phosphate acetyltransferase~~~COG0280
MSRIIMLIPTGTSVGLTSVSLGVIRAMERKGVRLSVFKPIAQPRTGGDAPDQTTTIVRANSSTTTAAEPLKMSYVEGLLS
SNQKDVLMEEIVANYHANTKDAEVVLVEGLVPTRKHQFAQSLNYEIAKTLNAEIVFVMSQGTDTPEQLKERIELTRNSFG
GAKNTNITGVIVNKLNAPVDEQGRTRPDLSEIFDDSSKAKVNNVDPAKLQESSPLPVLGAVPWSFDLIATRAIDMARHLN
ATIINEGDINTRRVKSVTFCARSIPHMLEHFRAGSLLVTSADRPDVLVAACLAAMNGVEIGALLLTGGYEMDARISKLCE
RAFATGLPVFMVNTNTWQTSLSLQSFNLEVPVDDHERIEKVQEYVANYINADWIESLTATSERSRRLSPPAFRYQLTELA
RKAGKRIVLPEGDEPRTVKAAAICAERGIATCVLLGNPAEINRVAASQGVELGAGIEIVDPEVVRESYVGRLVELRKNKG
MTETVAREQLEDNVVLGTLMLEQDEVDGLVSGAVHTTANTIRPPLQLIKTAPGSSLVSSVFFMLLPEQVYVYGDCAINPD
PTAEQLAEIAIQSADSAAAFGIEPRVAMLSYSTGTSGAGSDVEKVREATRLAQEKRPDLMIDGPLQYDAAVMADVAKSKA
PNSPVAGRATVFIFPDLNTGNTTYKAVQRSADLISIGPMLQGMRKPVNDLSRGALVDDIVYTIALTAIQSAQQQ
>P9WHP1 2.3.1.8~~~pta~~~Phosphate acetyltransferase~~~COG0280
MADSSAIYLAAPESQTGKSTIALGLLHRLTAMVAKVGVFRPITRLSAERDYILELLLAHTSAGLPYERCVGVTYQQLHAD
RDDAIAEIVDSYHAMADECDAVVVVGSDYTDVTSPTELSVNGRIAVNLGAPVLLTVRAKDRTPDQVASVVEVCLAELDTQ
RAHTAAVVANRCELSAIPAVTDALRRFTPPSYVVPEEPLLSAPTVAELTQAVNGAVVSGDVALREREVMGVLAAGMTADH
VLERLTDGMAVITPGDRSDVVLAVASAHAAEGFPSLSCIVLNGGFQLHPAIAALVSGLRLRLPVIATALGTYDTASAAAS
ARGLVTATSQRKIDTALELMDRHVDVAGLLAQLTIPIPTVTTPQMFTYRLLQQARSDLMRIVLPEGDDDRILKSAGRLLQ
RGIVDLTILGDEAKVRLRAAELGVDLDGATVIEPCASELHDQFADQYAQLRKAKGITVEHAREIMNDATYFGTMLVHNCH
ADGMVSGAAHTTAHTVRPALEIIKTVPGISTVSSIFLMCLPDRVLAYGDCAIIPNPTVEQLADIAICSARTAAQFGIEPR
VAMLSYSTGDSGKGADVDKVRAATELVRAREPQLPVEGPIQYDAAVEPSVAATKLRDSPVAGRATVLIFPDLNTGNNTYK
AVQRSAGAIAIGPVLQGLRKPVNDLSRGALVDDIVNTVAITAIQAQGVHE
>Q9I5A5 2.3.1.8~~~pta~~~Phosphate acetyltransferase~~~
MHTFFIAPTGFGVGLTSISLGLLRALERAGLKVGFFKPIAQLHPGDLGPERSSELVARTHGLDTPKPLPLAQVERMLGDG
QLDELLEEIISLYQRAAADKDVVIVEGMVPTRHASYAARVNFHLAKSLDAEVILVSAPENETLTELTDRIEIQAQLFGGP
RDPKVLGVILNKVRGEADAANAEDGVADFARRLTEHSPLLRDDFRLIGCIPWQDELNAARTRDIADLLSARVINAGDYEQ
RRVQKIVLCARAVPNTVQLLKPGVLVVTPGDRDDIILAASLAAMNGVPLAGLLLCSDFPPDPRIMELCRGALQGGLPVLS
VATGSYDTATNLNRMNKEIPVDDRERAERVTEFVAGHIDFEWLKQRCGTPRELRLSPPAFRYQVVQRAQKAGKRIVLPEG
SEPRTVQAAAICQARGIARCVLLAKPEEVQAVAQAQGIVLPEGLEIIDPDLVRQRYVEPMVELRKGKGLNAPMAEQQLED
SVVLATMMLALDEVDGLVSGAIHTTASTIRPALQLIKTAPGYNLVSSVFFMLLPDQVLVYGDCAVNPDPSASDLAEIAVQ
SAASAQAFGIPARVAMISYSTGDSGSGVDVDKVREATRLAREQRPDLLIDGPLQYDAAAIASVGRQKAPNSPVAGQATVF
IFPDLNTGNTTYKAVQRSADCVSVGPMLQGLRKPVNDLSRGALVEDIVYTIALTAIQADAQAPA
>Q8ZND6 2.3.1.222~~~pta~~~Phosphate acetyltransferase~~~
MSRIIMLIPTGTSVGLTSVSLGVIRAMERKGVRLSVFKPIAQPRAGGDAPDQTTTIVRANSTLPAAEPLKMSHVESLLSS
NQKDVLMEEIIANYHANTKDAEVVLVEGLVPTRKHQFAQSLNYEIAKTLNAEIVFVMSQGTDTPEQLNERIELTRSSFGG
AKNTNITGVIINKLNAPVDEQGRTRPDLSEIFDDSSKAQVIKIDPAKLQESSPLPVLGAVPWSFDLIATRAIDMARHLNA
TIINEGDIKTRRVKSVTFCARSIPHMLEHFRAGSLLVTSADRPDVLVAACLAAMNGVEIGALLLTGGYEMDARISKLCER
AFATGLPVFMVNTNTWQTSLSLQSFNLEVPVDDHERIEKVQEYVANYVNAEWIESLTATSERSRRLSPPAFRYQLTELAR
KAGKRVVLPEGDEPRTVKAAAICAERGIATCVLLGNPDEINRVAASQGVELGAGIEIVDPEVVRESYVARLVELRKSKGM
TEPVAREQLEDNVVLGTLMLEQDEVDGLVSGAVHTTANTIRPPLQLIKTAPGSSLVSSVFFMLLPEQVYVYGDCAINPDP
TAEQLAEIAIQSADSAIAFGIEPRVAMLSYSTGTSGAGSDVEKVREATRLAQEKRPDLMIDGPLQYDAAVMADVAKSKAP
NSPVAGRATVFIFPDLNTGNTTYKAVQRSADLISIGPMLQGMRKPVNDLSRGALVDDIVYTIALTAIQASQQQQ
>P58255 2.3.1.19~~~ptb~~~Phosphate butyryltransferase~~~COG0280
MIKSFNEIIMKVKSKEMKKVAVAVAQDEPVLEAVRDAKKNGIADAILVGDHDEIVSIALKIGMDVNDFEIVNEPNVKKAA
LKAVELVSTGKADMVMKGLVNTATFLRSVLNKEVGLRTGKTMSHVAVFETEKFDRLLFLTDVAFNTYPELKEKIDIVNNS
VKVAHAIGIENPKVAPICAVEVINPKMPSTLDAAMLSKMSDRGQIKGCVVDGPLALDIALSEEAAHHKGVTGEVAGKADI
FLMPNIETGNVMYKTLTYTTDSKNGGILVGTSAPVVLTSRADSHETKMNSIALAALVAGNK
>Q9CIE9 ~~~ptcA~~~PTS system cellobiose-specific EIIA component~~~COG1447
MTDKYENPTSDDYMGVVMGIIMSGGNAKGLAFQAIQQAKAGEFAEAESSLNEASEQLREAHDVQTDLLTRLAQGEKIGWN
LYMVHAQDHLMNAITFKDLAVEVVGQEQRLQALENK
>A2RIE7 ~~~ptcA~~~PTS system galactose-specific EIIA component~~~COG1447
MTDKYENPTSDDYMGVVMGIIMSGGNAKGLAFQAIQQAKDGKFAEAESSLNEASEQLREAHDVQTDLLTRLAQGEKIGWN
LYMVHAQDHLMNAITFKDLAVEVVGQERRLQALENK
>Q9CIF0 2.7.1.205~~~ptcB~~~PTS system cellobiose-specific EIIB component~~~COG1440
MADKVIALACAAGMSTSLLVSKMQKAAADNGKDYEIFAKSTADIDNMLAGTGSPKPDVLLLGPQVAFMKGEVAKKAEIAG
VPMDVIKMQDYGMMRGDKVLAAAENLMN
>A2RIE6 2.7.1.204~~~ptcB~~~PTS system galactose-specific EIIB component~~~COG1440
MADKVIALACAAGMSTSLLVSKMQKAAAENGKDYEIFAKSTADIDNMLAGTGSPKPDVLLLGPQVAFMKGEVAKKAEIAG
VPMDVIKMQDYGMMRGDKVLAAAENLMN
>Q837U7 2.1.3.6~~~ptcA~~~Putrescine carbamoyltransferase~~~COG0078
MKRDYVTTETYTKEEMHYLVDLSLKIKEAIKNGYYPQLLKNKSLGMIFQQSSTRTRVSFETAMEQLGGHGEYLAPGQIQL
GGHETIEDTSRVLSRLVDILMARVERHHSIVDLANCATIPVINGMSDYNHPTQELGDLCTMVEHLPEGKKLEDCKVVFVG
DATQVCFSLGLITTKMGMNFVHFGPEGFQLNEEHQAKLAKNCEVSGGSFLVTDDASSVEGADFLYTDVWYGLYEAELSEE
ERMKVFYPKYQVNQEMMDRAGANCKFMHCLPATRGEEVTDEVIDGKNSICFDEAENRLTSIRGLLVYLMNDYEAKNPYDL
IKQAEAKKELEVFLDTQSI
>Q8DW19 2.1.3.6~~~ptcA~~~Putrescine carbamoyltransferase~~~COG0078
MMKKTDYITTEDFSKEELLKLVDLSLKIKACIKNGYYPPLLEHKSLGMIFQQTSTRTRVSFETAMSQLGGHAQYLAPGQI
QLGGHETIEDTSTVLSRLDDILMARVERHQSVVDLARCASIPVINGMSDYNHPTQELGDLCTMIEHLPAGKKLEDCKVVF
VGDATQVCFSLALITTKMGMEFVHFGPKGFQLNDMHKEKLDKICERSGGKYTVTDNEDAIEGADFLYTDVWYGLYEAELS
EEERMQIFFPKYQVDSQMMAKAGADCKFMHCLPATRGEEITDEVMDGPHSICFDEAENRLTSIRGLLVYLLRDYREKNPY
DLVKQEKAKEELETFLKPE
>O05506 ~~~gmuA~~~PTS system oligo-beta-mannoside-specific EIIA component~~~COG1447
MEQMKITNLTDEQISFQLILHSGNARSCIIQSLRAYKEGKKDEADALIAKAEQDLSAAHDIHFQMIQKESGGEATAFSLL
LMHAEDHLMSTLSMKELVKEMLDLFKTKNI
>Q45402 ~~~celD~~~PTS system cellobiose-specific EIIA component~~~
MQTYEQTVFQLILHGGNGRSYAMEAITAAKKGEFAEARRLLEQAGAELQAAHGLQTALLQQEASGGQPVVTLLMVHAQDH
LMTAITVKDLAAEFVELYEALKRQTTES
>O05505 2.7.1.205~~~gmuB~~~PTS system oligo-beta-mannoside-specific EIIB component~~~COG1440
MKKILLACSSGMSTSLLVTKMKEYAQSIGEEAEIWAVGQDKAKEDMRKADAVLIGPQMSFLKSELQKEADQYNIQVEVID
MMAYGMADGKKAYEQALSLMVNQ
>Q45399 2.7.1.205~~~celA~~~PTS system cellobiose-specific EIIB component~~~
MNILLICAAGMSTSLLVTKMKEAAKQKGIEANIWAVSADEAKSHLDQADVVLIGPQIRYKLAAFKKEGEARGIPVDVINP
ADYGRVNGAGVLDFALRLKK
>O05507 ~~~gmuC~~~PTS system oligo-beta-mannoside-specific EIIC component~~~COG1455
MFEKISQFLVPIAGRLNNNRYLQVLRDAFMLAFPLTIFGSIFVVLTNLPFLNKIMNASMLTSFQSHFGIASTATMGIMSV
FVVFGIGYYLSKSYQVEAVFGGAIALVSFLLLTPFIIQPETGDAITGVIPVDRLGAKGMFLGMITAFLSGEIYRRIVQKN
LTIKMPAGVPPAVAKSFAALIPAFITLTVFLLINVMVTLFFKTNMHDVIYHAIQAPLVGLGSGIIPTLIAVFFIQILWFF
GLHGQIIINSVMDPIWNTLQVENLSAYTAGKEIPHIISKPFMEIYTVGMGGTGMTLAIVFTILIFMKSRQMKQVSKLGLA
PGIFNVNEPIIFGLPIVMNPIIIVPWVLAPMVVTLVTYLAMSAGLVPPPTGVTVPWTVPLFINGIMATNSIMGGVMQLIN
LLIVFVIWFPFLKAMDKLNLAKEKEQAVQETAAQQNDNSIKM
>P71012 ~~~fruA~~~PTS system fructose-specific EIIABC component~~~COG1299
MKITELLTKHTIKLNIESKEKENVIDEMVTVLDKAGKLNDRQAYKEAILNRESQSSTGIGEGIAIPHAKTASVINPAIAF
GRSKDGVDYESLDGQPAHLVFMIAATEGANNTHLEALSRLSTLLMREEIRKQLLEAESEDAIIDIINQHDKDDDEEEEEE
EAAPAPAGKGKILAVTACPTGIAHTFMAADALKEKAKELGVEIKVETNGSSGIKHKLTAQEIEDAPAIIVAADKQVEMER
FKGKRVLQVPVTAGIRRPQELIEKAMNQDAPIYQGSGGGSAASNDDEEAKGKSGSGIGNTFYKHLMSGVSNMLPFVVGGG
ILVAISFFWGIHSADPNDPSYNTFAAALNFIGGDNALKLIVAVLAGFIAMSIADRPGFAPGMVGGFMATQANAGFLGGLI
AGFLAGYVVILLKKVFTFIPQSLDGLKPVLIYPLFGIFITGVLMQFVVNTPVAAFMNFLTNWLESLGTGNLVLMGIILGG
MMAIDMGGPLNKAAFTFGIAMIDAGNYAPHAAIMAGGMVPPLGIALATTIFRNKFTQRDREAGITCYFMGAAFVTEGAIP
FAAADPLRVIPAAVVGAAVAGGLTEFFRVTLPAPHGGVFVAFITNHPMLYLLSIVIGAVVMAIILGIVKKPVTEK
>P69811 ~~~fruB~~~Multiphosphoryl transfer protein~~~COG1925
MFQLSVQDIHPGEKAGDKEEAIRQVAAALVQAGNVAEGYVNGMLAREQQTSTFLGNGIAIPHGTTDTRDQVLKTGVQVFQ
FPEGVTWGDGQVAYVAIGIAASSDEHLGLLRQLTHVLSDDSVAEQLKSATTAEELRALLMGEKQSEQLKLDNEMLTLDIV
ASDLLTLQALNAARLKEAGAVDATFVTKAINEQPLNLGQGIWLSDSAEGNLRSAIAVSRAANAFDVDGETAAMLVSVAMN
DDQPIAVLKRLADLLLDNKADRLLKADAATLLALLTSDDAPTDDVLSAEFVVRNEHGLHARPGTMLVNTIKQFNSDITVT
NLDGTGKPANGRSLMKVVALGVKKGHRLRFTAQGADAEQALKAIGDAIAAGLGEGA
>P17127 ~~~fruB~~~Multiphosphoryl transfer protein~~~
MFQLSVQDIHPGEQAGNKEEAIRQIAAALAQAGNVAGGYVDGMLAREQQTSTFLGNGIAIPHGTTDTRDQVLKTGVQVFQ
FPQGVTWGEGQVAYVAIGIAASSDEHLGLLRQLTHVLSDDSVAEQLKSATTAEELRALLMGEKQSEQLKLDNETMTLDVI
ASSLVTLQALNAARLKEAGAVDAAFVAKTINDSPMNLGQGIWLNDSAEGNLRSAVAVSRATQAFDVEGEKAALLVTVAMN
DEQPIAVLKRLGDLLLNNKADRLLSADAATLLALLTSDDALTDDVLSAEFVVRNEHGLHARPGTMLVNTIKQFNSEITVT
NLDGTGKPANGRSLMKVVALGVKKGHRLRFTAQGEDAEQALKAIGDAIAAGLGEGA
>Q9KM70 ~~~fruB~~~Multiphosphoryl transfer protein~~~COG1925
MLELTTQDIQLQQHFANKQAAIQGLAHALTAKGLVAEGYAQGMLNREAQHSTYLGNGIAIPHGTTDTRELVKQTGVTAMH
FPQGLDWGDGNLVYVAIGIAAKSDEHLGILKQLTRVLSADGVEQALQQAKTAQQIIAIIKGEAQLTADFDASLIQLQFPA
SDMVQMSAVAGGLLKNTGCAENEFVADLVTKAPTHLGRGLWLVASDRAVKRTGMSIVTTANHCEYEQQAVKALIAFSVCN
DVHQPLLNTITQCVFEQKQDQLLQADVQQLLNLFSGNAEQTIAQRTIAVGTITEETIAAETVAEPDSARAHTATFRIKNS
HGLHARPGAMLVAEAKKFESNIRVSNLDGDGQVVNAKSLMKVIALGVKHNHQLQFTAEGPDAEAALQALGVAINAGLGEG
>P26379 ~~~levD~~~PTS system fructose-specific EIIA component~~~COG2893
MISVIISGHGDFPIALKESSGMIFGEENNLIAVPFFKGEGIQTLQEKYHQALKDIPEEHEVLFLVDIFGGTPYNAAASFI
AEDQRMDMAAGVNLPILLEVLSLREHLALKDLLNNLKAMSQQSFQVCSEHLEKVKTANQDTREDEL
>P69808 2.7.1.202~~~fryB~~~PTS system fructose-like EIIB component 1~~~COG1445
MSKKLIALCACPMGLAHTFMAAQALEEAAVEAGYEVKIETQGADGIQNRLTAQDIAEATIIIHSVAVTPEDNERFESRDV
YEITLQDAIKNAAGIIKEIEEMIASEQQ
>P32676 2.7.1.202~~~frwD~~~PTS system fructose-like EIIB component 3~~~COG1445
MAYLVAVTACVSGVAHTYMAAERLEKLCLLEKWGVSIETQGALGTENRLADEDIRRADVALLITDIELAGAERFEHCRYV
QCSIYAFLREPQRVMSAVRKVLSAPQQTHLILE
>P20966 ~~~fruA~~~PTS system fructose-specific EIIB'BC component~~~COG1299
MKTLLIIDANLGQARAYMAKTLLGAAARKAKLEIIDNPNDAEMAIVLGDSIPNDSALNGKNVWLGDISRAVAHPELFLSE
AKGHAKPYTAPVAATAPVAASGPKRVVAVTACPTGVAHTFMAAEAIETEAKKRGWWVKVETRGSVGAGNAITPEEVAAAD
LVIVAADIEVDLAKFAGKPMYRTSTGLALKKTAQELDKAVAEATPYEPAGKAQTATTESKKESAGAYRHLLTGVSYMLPM
VVAGGLCIALSFAFGIEAFKEPGTLAAALMQIGGGSAFALMVPVLAGYIAFSIADRPGLTPGLIGGMLAVSTGSGFIGGI
IAGFLAGYIAKLISTQLKLPQSMEALKPILIIPLISSLVVGLAMIYLIGKPVAGILEGLTHWLQTMGTANAVLLGAILGG
MMCTDMGGPVNKAAYAFGVGLLSTQTYGPMAAIMAAGMVPPLAMGLATMVARRKFDKAQQEGGKAALVLGLCFISEGAIP
FAARDPMRVLPCCIVGGALTGAISMAIGAKLMAPHGGLFVLLIPGAITPVLGYLVAIIAGTLVAGLAYAFLKRPEVDAVA
KAA
>Q9KM72 ~~~fruA~~~PTS system fructose-specific EIIB'BC component~~~COG1299
MKMKIAIVTACPSGVANSIIAAGLLQQASKTLGWEAYIECHSTVIAGHTLSEEEINKADLVILAANGKIDMQRFVGKKVY
QSPITACTSDPVGYLKQAAEQATELSSEQATRCDSPATASVSAKKIVAITACPTGVAHTFMAAEALEAEATRQGHQIKVE
TRGSVGAKNQLTEQEIAAADLVIIAADIDVPLDRFNGKKLYKTSTGLTLKKTAQELSNAFAQAKTFSSSANSATNEKAEE
KKGVYKHLMTGVSHMLPVVVAGGLIIALSFVFGIEAFKEEGTLAAALMQIGGGSAFALMIPVLAGYIAFSIADRPGLAPG
LIGGMLASSTGAGFLGGIVAGFLAGYSAKFIADKVQLPQSMAALKPILIIPFIASLFTGLVMIYVVGGPMSSIMSGMTSF
LNNMGSTNAILLGIVLGAMMCFDLGGPVNKAAYTFGVGLLASQTYAPMAAIMAAGMVPALGMGLATFIAKDKFEAGEREA
GKASFVLGLCFISEGAIPFAAKDPMRVIPACMVGGAVTGALSMLFGAKLMAPHGGLFVLLIPNAISPVLLYLVAIAVGTA
ITGFGYAMLKKSAQAKAVAA
>P23355 ~~~fruA~~~PTS system fructose-specific EIIB'BC component~~~COG1299
MSSSIVVIAAGERSTEAVLAAEALRRAATAAGRSVTIEIRSDQGVLGALPTELTNGAAHVLIVGDADADTARFGDAQLLH
LSLGAVLDDPAAAVSQLAATTAPASTSATTDASGAGGKRIVAITSCPTGIAHTFMAAEGLQQAAKKLGYQMRVETQGSVG
AQDALTDEEIRAADVVIIAADREVDLARFGGKRLFKSGTKPAINDGPALIQKALAEAGVHGGAAPVAGANATSDAKGNAR
TGAYKHLMTGVSFMLPFVTAGGLLIALAFALGGIYAGDDAHQGTLAWSLFQIGAKAGFTLMVPALAGYIAYSIADRPGIA
PGMIGGLVAANLNAGFLGGIIAGFIAGYGVAALNRYIKLPRNLEGLKPVLILPVLGTLLVGLAMMYVFGQPVADLLAWLT
AWLRGMQGSSALLLGLLLGGMMAFDMGGPVNKAAYAFSTGLIASQVYTPMAAAMVAGMTPPLGIALATWVFRNRFTVEER
GSATAAGVLGLAFVTEGAIPYAARDPLRTIPALVIGSAVAGAISMTAGAELKAPHGGIFVLLIPNAVTHLLNYVLALVVG
VVVTAVALRLLKKPVADVIA
>P26380 2.7.1.202~~~levE~~~PTS system fructose-specific EIIB component~~~COG3444
MMNIVLARIDDRFIHGQILTRWIKVHAADRIIVVSDDIAQDEMRKTLILSVAPSNVKASAVSVSKMAKAFHSPRYEGVTA
MLLFENPSDIVSLIEAGVPIKTVNVGGMRFENHRRQITKSVSVTEQDIKAFETLSDKGVKLELRQLPSDASEDFVQILRN
VTK
>P77579 ~~~fryC~~~Fructose-like permease IIC component 1~~~COG1299
MAIKKRSATVVPGASGAAAAVKNPQASKTSFWGELPQHVMSGISRMVPTLIMGGVILAFSQLIAYSWLKIPAEIGIMDAL
NSGKFSGFDLSLLKFAWLSQSFGGVLFGFAIPMFAAFVANSIGGKLAFPAGFIGGLMSTQPTQLLNFDPSTMQWATSSPV
PSTFIGALIISIVAGYLVKWMNQKIQLPDFLLAFKTTFLLPILSAIFVMLAMYYVITPFGGWINGGIRTVLTAAGEKGAL
MYAMGIAAATAIDLGGPINKAAGFVAFSFTTDHVLPVTARSIAIVIPPIGLGLATIIDRRLTGKRLFNAQLYPQGKTAMF
LAFMGISEGAIPFALESPITAIPSYMVGAIVGSTAAVWLGAVQWFPESAIWAWPLVTNLGVYMAGIALGAVITALMVVFL
RLMMFRKGKLLIDSL
>P77439 ~~~fryA~~~Multiphosphoryl transfer protein 1~~~COG1080
MLTIQFLCPLPNGLHARPAWELKEQCSQWQSEITFINHRQNAKADAKSSLALIGTGTLFNDSCSLNISGSDEEQARRVLE
EYIQVRFIDSDSVQPTQAELTAHPLPRSLSRLNPDLLYGNVLASGVGVGTLTLLQSDSLDSYRAIPASAQDSTRLEHSLA
TLAEQLNQQLRERDGESKTILSAHLSLIQDDEFAGNIRRLMTEQHQGLGAAIISNMEQVCAKLSASASDYLRERVSDIRD
ISEQLLHITWPELKPRNKLVLEKPTILVAEDLTPSQFLSLDLKNLAGMILEKTGRTSHTLILARASAIPVLSGLPLDAIA
RYAGQPAVLDAQCGVLAINPNDAVSGYYQVAQTLADKRQKQQAQAAAQLAYSRDNKRIDIAANIGTALEAPGAFANGAEG
VGLFRTEMLYMDRDSAPDEQEQFEAYQQVLLAAGDKPIIFRTMDIGGDKSIPYLNIPQEENPFLGYRAVRIYPEFAGLFR
TQLRAILRAASFGNAQLMIPMVHSLDQILWVKGEIQKAIVELKRDGLRHAETITLGIMVEVPSVCYIIDHFCDEVDFFSI
GSNDMTQYLYAVDRNNPRVSPLYNPITPSFLRMLQQIVTTAHQRGKWVGICGELGGESRYLPLLLGLGLDELSMSSPRIP
AVKSQLRQLDSEACRELARQACECRSAQEIEALLTAFTPEEDVRPLLALENIFVDQDFSNKEQAIQFLCGNLGVNGRTEH
PFELEEDVWQREEIVTTGVGFGVAIPHTKSQWIRHSSISIARLAKPIGWQSEMGEVELVIMLTLGANEGMNHVKVFSQLA
RKLVNKNFRQSLFAAQDAQSILTLLETELTF
>P20166 2.7.1.199~~~ptsG~~~PTS system glucose-specific EIICBA component~~~COG1263
MFKALFGVLQKIGRALMLPVAILPAAGILLAIGNAMQNKDMIQVLHFLSNDNVQLVAGVMESAGQIVFDNLPLLFAVGVA
IGLANGDGVAGIAAIIGYLVMNVSMSAVLLANGTIPSDSVERAKFFTENHPAYVNMLGIPTLATGVFGGIIVGVLAALLF
NRFYTIELPQYLGFFAGKRFVPIVTSISALILGLIMLVIWPPIQHGLNAFSTGLVEANPTLAAFIFGVIERSLIPFGLHH
IFYSPFWYEFFSYKSAAGEIIRGDQRIFMAQIKDGVQLTAGTFMTGKYPFMMFGLPAAALAIYHEAKPQNKKLVAGIMGS
AALTSFLTGITEPLEFSFLFVAPVLFAIHCLFAGLSFMVMQLLNVKIGMTFSGGLIDYFLFGILPNRTAWWLVIPVGLGL
AVIYYFGFRFAIRKFNLKTPGREDAAEETAAPGKTGEAGDLPYEILQAMGDQENIKHLDACITRLRVTVNDQKKVDKDRL
KQLGASGVLEVGNNIQAIFGPRSDGLKTQMQDIIAGRKPRPEPKTSAQEEVGQQVEEVIAEPLQNEIGEEVFVSPITGEI
HPITDVPDQVFSGKMMGDGFAILPSEGIVVSPVRGKILNVFPTKHAIGLQSDGGREILIHFGIDTVSLKGEGFTSFVSEG
DRVEPGQKLLEVDLDAVKPNVPSLMTPIVFTNLAEGETVSIKASGSVNREQEDIVKIEK
>Q7A807 2.7.1.199~~~ptsG~~~PTS system glucose-specific EIICBA component~~~
MRKKLFGQLQRIGKALMLPVAILPAAGLLLAIGTAIQGEALQHYLPFIQNGGVQNVAKLMTAAGSIIFENLPMIFALGVA
IGLAGGDGVAAIAAFVGYIIMNKTMGDFLQVTPKNVTDPASGYASILGIPTLQTGVFGGIIIGALAAWCYNKFYNINLPS
YLGFFAGKRFVPIMMATTSFILAFPMALIWPTIQSGLNAFSTGLLDSNTGVAVFLFGFIKRLLIPFGLHHIFHAPFWFEF
GSWKNAAGEIIHGDQRIFIEQIREGAHLTAGKFMQGEFPVMMFGLPAAALAIYHTAKPENKKVVAGLMGSAALTSFLTGI
TEPLEFSFLFVAPLLFFIHAVLDGLSFLTLYLLDVHLGYTFSGGFIDYVLLGVLPNKTQWWLVIPVGLVYAVIYYFVFRF
LIVKLKYKTPGREDKQSQAVTASATELPYAVLEAMGGKANIKHLDACITRLRVEVNDKSKVDVPGLKDLGASGVLEVGNN
MQAIFGPKSDQIKHEMQQIMNGQVVENPTTMEDDKDETVVVAEDKSATSELSHIVHAPLTGEVTPLSEVPDQVFSEKMMG
DGIAIKPSQGEVRAPFNGKIQMIFPTKHAIGLVSDSGLELLIHIGLDTVKLNGEGFTLHVEEGQEVKQGDLLINFDLDYI
RNHAKSDITPIIVTQGNITNLDFKQGEHGNISFGDQLFEAK
>Q57071 2.7.1.199~~~ptsG~~~PTS system glucose-specific EIICBA component~~~COG1263
MWKKFFGQLQRIGKALMLPVAILPAAGLLLALGNAFQGDALQSLMPFIKAEGFQNVAKMMEGAGGIIFDNLAIIFALGVA
IGLASGDGVAAIAAFVGFIVLNKTMGMFLGVTPEKAADAATGFANVLGIPTLQTGVFGGIIIGALAAWCYNKFYNISLPS
YLGFFAGKRFVPIMMATCSFILAFPMAIIWPSIQGGLNAFSEGLLASNTGLAVFLFGFIKRLLIPFGLHHIFHAPFWFEF
GSYKNAAGQIIHGDQRIFIEQIRDNVPLTAGKFMQGEFPVMMFGLPAAALAIYQTAKKENKKVVAGLMLSGALTSFLTGI
TEPLEFSFLFVAPLLFFIHAVLDGLSFLILYLLDLHLGYTFSGGFIDFFLLGILPNKTQWWLVIPVGLVYAAIYYIIFRF
LIVKFNFKTPGREDKEVKSSNVAASELPFKVLDAMGGKANIKHLDACITRLRVEVNDKAKVDVQELKDLGASGVLEVGNN
MQAIFGPKSDQIKHDMQQIMDGKITSPEETTVTEEGDKETAEIAAAGGGVVYAPIKGEVVDISEVPDKVFSEKMMGDGIA
IKPETGEVVAPFDGVVKMVFPTKHAIGLESKDGIELLIHFGLETVKLEGKGFDILVKENDNIVLGQPLMKVDLDYIKEHA
DSTITPIVVTNLNGRTMEVLQHGEVKQGDKVILVK
>P69783 ~~~crr~~~PTS system glucose-specific EIIA component~~~COG2190
MGLFDKLKSLVSDDKKDTGTIEIIAPLSGEIVNIEDVPDVVFAEKIVGDGIAIKPTGNKMVAPVDGTIGKIFETNHAFSI
ESDSGVELFVHFGIDTVELKGEGFKRIAEEGQRVKVGDTVIEFDLPLLEEKAKSTLTPVVISNMDEIKELIKLSGSVTVG
ETPVIRIKK
>P45338 ~~~crr~~~PTS system glucose-specific EIIA component~~~COG2190
MGLFDKLFGSKENKSVEVEIYARISGEIVNIEDVPDVVFSEKIVGDGVAVRPIGNKIVAPVDGVIGKIFETNHAFSMESK
EGVELFVHFGIDTVELKGEGFTRIAQEGQSVKRGDTVIEFDLALLESKAKSVLTPIVISNMDEISCIVKKSGEVVAGESV
VLALKK
>P45618 ~~~crr~~~PTS system glucose-specific EIIA component~~~
MWFFNKNLKVLAPCDGTIITLDEVEDEVFKERMLGDGFAINPKSNDFHAPVSGKLVTAFPTKHAFGIQTKSGVEILLHIG
LDTVSLDGNGFESFVTQDQEVNAGDKLVTVDLKSVAKKVPSIKSPIIFTNNGGKTLEIVKMGEVKQGDVVAILK
>P0A284 ~~~crr~~~PTS system glucose-specific EIIA component~~~COG2190
MGLFDKLKSLVSDDKKDTGTIEIVAPLSGEIVNIEDVPDVVFAEKIVGDGIAIKPTGNKMVAPVDGTIGKIFETNHAFSI
ESDSGIELFVHFGIDTVELKGEGFKRIAEEGQRVKVGDPVIEFDLPLLEEKAKSTLTPVVISNMDEIKELIKLSGSVTVG
ETPVIRIKK
>P0A283 ~~~crr~~~PTS system glucose-specific EIIA component~~~
MGLFDKLKSLVSDDKKDTGTIEIVAPLSGEIVNIEDVPDVVFAEKIVGDGIAIKPTGNKMVAPVDGTIGKIFETNHAFSI
ESDSGIELFVHFGIDTVELKGEGFKRIAEEGQRVKVGDPVIEFDLPLLEEKAKSTLTPVVISNMDEIKELIKLSGSVTVG
ETPVIRIKK
>P60857 ~~~crr~~~PTS system glucose-specific EIIA component~~~
MFKKLFGKGKEVQKDIAIYAPLTGEYVKIEDIPDPVFAQKMMGEGFGINPTEGEVVSPIAGRVDNVFPTKHAIGLKADNG
LELLVHIGLDTVQLDGEGFEVLVSSGDEVNVGDPLVRFNLEFINNNAKSVISPIIITNTDQAASINIYDENAVIKGETKV
IDVTMN
>P69786 ~~~ptsG~~~PTS system glucose-specific EIICB component~~~COG1263
MFKNAFANLQKVGKSLMLPVSVLPIAGILLGVGSANFSWLPAVVSHVMAEAGGSVFANMPLIFAIGVALGFTNNDGVSAL
AAVVAYGIMVKTMAVVAPLVLHLPAEEIASKHLADTGVLGGIISGAIAAYMFNRFYRIKLPEYLGFFAGKRFVPIISGLA
AIFTGVVLSFIWPPIGSAIQTFSQWAAYQNPVVAFGIYGFIERCLVPFGLHHIWNVPFQMQIGEYTNAAGQVFHGDIPRY
MAGDPTAGKLSGGFLFKMYGLPAAAIAIWHSAKPENRAKVGGIMISAALTSFLTGITEPIEFSFMFVAPILYIIHAILAG
LAFPICILLGMRDGTSFSHGLIDFIVLSGNSSKLWLFPIVGIGYAIVYYTIFRVLIKALDLKTPGREDATEDAKATGTSE
MAPALVAAFGGKENITNLDACITRLRVSVADVSKVDQAGLKKLGAAGVVVAGSGVQAIFGTKSDNLKTEMDEYIRNH
>P37439 ~~~ptsG~~~PTS system glucose-specific EIICB component~~~
MFKNAFANLQKVGKSLMLPVSVLPIAGILLGVGSANFSWLPAVVSHVMAEAGGSVFANMPLIFAIGVALGFTNNDGVSAL
AAVVAYGIMVKTMAVVAPLVLHLPAEEIAAKHLADTGVLGGIISGAIAAYMFNRFYRIKLPEYLGFFAGKRFVPIISGLA
AIFTGVVLSFVWPPIGTAIQAFSQWAAYQNPVVAFGIYGFIERCLVPFGLHHIWNVPFQMQIGEYTNAAGQVFHGDIPRY
MAGDPTAGMLSGGFLFKMYGLPAAAIAIWHSAKPENRAKVGGIMISAALTSFLTGITEPIEFSFMFVAPILYIIHAILAG
LAFPICILLGMRDGTSFSHGLIDFIVLSGNSSKLWLFPIVGAGYAIVYYTVFRVLIKALDLKTPGREDTTDDAKAGATSE
MAPALVAAFGGKENITNLDACITRLRVSVADVAKVDQAGLKKLGAAGVVVAGSGVQAIFGTKSDNLKTEMDEYIRNS
>P05706 ~~~srlB~~~PTS system glucitol/sorbitol-specific EIIA component~~~COG3731
MTVIYQTTITRIGASAIDALSDQMLITFREGAPADLEEYCFIHCHGELKGALHPGLQFSLGQHRYPVTAVGSVAEDNLRE
LGHVTLRFDGLNEAEFPGTVHVAGPVPDDIAPGSVLKFESVKE
>O32333 2.7.1.198~~~srlE~~~PTS system glucitol/sorbitol-specific EIIB component~~~COG3732
MEKYNAIKIVKGSGGFGGPLTVKPEEGKDTLLYITGGGAEPEIVEKIVNLTGCKAVNGFKTSVPEEQIFLVIIDCGGTLR
CGIYPQKRIPTINVMPVGKSGPLAKFITEDIYVSAVGLNQISLADSSAEPIKSTKVPEEGKREFKYSADKKVSQSLAENS
KSSIVQKIGMGAGKVVNTLYQAGRDAVQSMITTILPFMAFVAMLIGIIQGSGFGNWFAKILVPLAGNGIGLMILGFICSI
PLLSALLGPGAVIAQIVGTLIGVEIGKGTIPPSLALPALFAINTQCACDFIPVGLGLAEAEPETVEVGVPSVLYSRFMIG
VPRVAVAWVASIGLYQ
>P56580 2.7.1.198~~~srlE~~~PTS system glucitol/sorbitol-specific EIIB component~~~COG3732
MTHIRIEKGTGGWGGPLELKATPGKKIVYITAGTRPAIVDKLAQLTGWQAIDGFKEGEPAEAEIGVAVIDCGGTLRCGIY
PKRRIPTINIHSTGKSGPLAQYIVEDIYVSGVKEENITVVGDATPQPSSVGRDYDTSKKITEQSDGLLAKVGMGMGSTVA
VLFQSGRDTIDTVLKTILPFMAFVSALIGIIMASGLGDWIAHGLAPLASHPLGLVMLALICSFPLLSPFLGPGAVIAQVI
GVLIGVQIGLGNIPPHLALPALFAINAQAACDFIPVGLSLAEARQDTVRVGVPSVLVSRFLTGAPTVLIAWFVSGFIYQ
>O32522 2.7.1.198~~~srlE~~~PTS system glucitol/sorbitol-specific EIIB component~~~
MANTIEIRKGESGWGGPLSINVTAGKKIVYITAGTKPAIVDHLVALTGWEAVDGFKQGEPPAEEIGVAVIDCGGTLRCGL
YPKRRIPTINIHATGKSGPLAQFITEDIYVSGVRVADIRVANDAEAAPPEVAVADVAVNAGKGTGRDYDTSKKITEQSDG
LLAKVGMGMGSAVAILFQSGRETIDTVLKTILPFMAFVSALIGIIMASGLGDFIAHGLTPLANSPVGLVTLALICSFPLL
SPFLGPGAVIAQVIGVLVGVQIGQGTIPPHLALPALFAINAQAACDFIPVGLSLANARQETVRVGVPAVLVGRFITGAPT
VLLAWAASSFIYH
>O32332 ~~~srlA~~~PTS system glucitol/sorbitol-specific EIIC component~~~COG3730
MDAIVYFAKGFMYLFEVGGNTFVSWVTGIIPKVLLLLVFMNSIIAFIGQDKVDRFAKFASRNVILAYGVLPFLSAFMLGN
PMALSMGKFLPERMKPSYYASASYHCHTNSGIFPHINVGEIFIYLGIANGITTLGLDPTALGLRYLLVGLVMNFFAGWVT
DFTTKIVMRQQGIELSNQLKAN
>P56579 ~~~srlA~~~PTS system glucitol/sorbitol-specific EIIC component~~~COG3730
MIETITHGAEWFIGLFQKGGEVFTGMVTGILPLLISLLVIMNALINFIGQHRIERFAQRCAGNPVSRYLLLPCIGTFVFC
NPMTLSLGRFMPEKYKPSYYAAASYSCHSMNGLFPHINPGELFVYLGIASGLTTLNLPLGPLAVSYLLVGLVTNFFRGWV
TDLTTAIFEKKMGIQLEQKVHLAGATS
>O32521 ~~~srlA~~~PTS system glucitol/sorbitol-specific EIIC component~~~
MIEAITHGAEWFIGLFQKGGEVFVGMVTGILPLLISLLVIMNALIVFVGQRRIEKLAQKCAGNPVTRYLVLPFIGTFVFC
NPMTHSLGKFLPEKYKPSYYAAASYSCHSMNGLFPHINPGELFVYLGIANGLTTLGVPLGPLAVSYLLVGLITNFFRGWV
TDLTTSVFEKKMGIKLDKSVHL
>P08877 ~~~ptsH~~~Phosphocarrier protein HPr~~~COG1925
MAQKTFKVTADSGIHARPATVLVQTASKYDADVNLEYNGKTVNLKSIMGVMSLGIAKGAEITISASGADENDALNALEET
MKSEGLGE
>Q9F166 ~~~ptsH~~~Phosphocarrier protein HPr~~~
MEKIFKVTSDSGIHARPATLLVNTASKFGSDINLEYNGKNVNLKSIMGVMSLGIQQNAEIKITANGDDAAQALAAIEETM
KNEGLGE
>P0AA04 ~~~ptsH~~~Phosphocarrier protein HPr~~~COG1925
MFQQEVTITAPNGLHTRPAAQFVKEAKGFTSEITVTSNGKSASAKSLFKLQTLGLTQGTVVTISAEGEDEQKAVEHLVKL
MAELE
>P07515 ~~~ptsH~~~Phosphocarrier protein HPr~~~COG1925
MEKKEFHIVAETGIHARPATLLVQTASKFNSDINLEYKGKSVNLKSIMGVMSLGVGQGSDVTITVDGADEAEGMAAIVET
LQKEGLAE
>P42013 ~~~ptsH~~~Phosphocarrier protein HPr~~~
MAEKTFKVVSDSGIHARPATILVQTASKFNSEIQLEYNGKTVNLKSIMGVMSLGIPKGATIKITAEGADAAEAMAALTDT
LAKEGLAE
>Q9KJV3 ~~~ptsH~~~Phosphocarrier protein HPr~~~COG1925
MEKREFNIIAETGIHARPATLLVQAASKFNSDINLEYKGKSVNLKSIMGVMSLGVGQGADVTISAEGADEADAIAAITDT
MKKEGLAE
>Q9CJ83 ~~~ptsH~~~Phosphocarrier protein HPr~~~COG1925
MASKEFHIVVETGIHARPATLLVHTASKFTSEITLEYKGKSVNLKSIMGVMSLGVGQGADVTISAEGADADDAISTIAET
MTKEGLAE
>Q84F84 ~~~ptsH~~~Phosphocarrier protein HPr~~~
MKTQQFTVIDPLGIHARPASQLVAKATPFASAIEVRTEEKAANLKSILGVMGLALKQGSQFTLYVEGEDEDQAFEALATL
LTEMGLAQ
>P45611 ~~~ptsH~~~Phosphocarrier protein HPr~~~
MAKFSAIITDKVGLHARPASVLAKEASKFSSNITIIANEKQGNLKSIMNVMAMAIKTGTEITIQADGNDADQAIQAIKQT
MIDTALIQG
>P75061 ~~~ptsH~~~Phosphocarrier protein HPr~~~
MKKIQVVVKDPVGIHARPASIIAGEANKFKSELKLVSPSGVEGNIKSIINLMSLGIKQNDHITIKAEGTDEEEALNAIKA
VLEKHQVI
>O69250 ~~~ptsH~~~Phosphocarrier protein HPr~~~
MAQKTFTVTADSGIHARPATTLVQAASKFDSDINLEFNGKTVNLKSIMGVMSLGIQKGATITISAEGSDEADALAALEDT
MSKEGLGE
>P0AA07 ~~~ptsH~~~Phosphocarrier protein HPr~~~
MFQQEVTITAPNGLHTRPAAQFVKEAKGFTSEITVTSNGKSASAKSLFKLQTLGLTQGTVVTISAEGEDEQKAVEHLVKL
MAELE
>P99143 ~~~ptsH~~~Phosphocarrier protein HPr~~~
MEQNSYVIIDETGIHARPATMLVQTASKFDSDIQLEYNGKKVNLKSIMGVMSLGVGKDAEITIYADGSDESDAIQAISDV
LSKEGLTK
>P0A0E3 ~~~ptsH~~~Phosphocarrier protein HPr~~~
MEQNSYVIIDETGIHARPATMLVQTASKFDSDIQLEYNGKKVNLKSIMGVMSLGVGKDAEITIYADGSDESDAIQAISDV
LSKEGLTK
>P23534 ~~~ptsH~~~Phosphocarrier protein HPr~~~
MEQQSYTIIDETGIHARPATMLVQTASKFDSDIQLEYNGKKVNLKSIMGVMSLGVGKDAEITIYADGSDEADAIQAITDV
LSKEGLTE
>Q9EYQ9 ~~~ptsH~~~Phosphocarrier protein HPr~~~COG1925
MEQKSYVIIDETGIHARPATMLVQTASKFDSDIQLEYNGKKVNLKSIMGVMSLGVGKDAEITIYADGSDETDAIEAITDI
LSKEGLTK
>O50515 ~~~ptsH~~~Phosphocarrier protein HPr~~~COG1925
MAERRVNVGWAEGLHARPASIFVRAATATGVPVTIAKADGSPVNAASMLAVLGLGAQGGEEIVLASDAEGAEAALERLAK
LVAEGLEELPETV
>Q9WXK8 ~~~ptsH~~~Phosphocarrier protein HPr~~~
MASKDFHIVAETGIHARPATLLVQTASKFASDITLDYKGKAVNLKSIMGVMSLGVGQGADVTISAEGADADDALAAIEET
MTKEGLA
>P45596 ~~~ptsH~~~Phosphocarrier protein HPr~~~COG1925
MASKDFHIVAETGIHARPATLLVQTASKFASDITLDYKGKAVNLKSIMGVMSLGVGQGADVTITAEGADADDAIAAINET
MTKEGLA
>P24366 ~~~ptsH~~~Phosphocarrier protein HPr~~~
MASKDFHIVAETGIHARPATLLVQTASKFASDITLDYKGKAVNLKSIMGVMSLGVGQGADVTISAEGADADDAIVAIAET
MTKEGLA
>B2SU53 ~~~pthXo1~~~TAL effector protein PthXo1~~~COG2201
MDPIRSRTPSPARELLPGPQPDRVQPTADRGGAPPAGGPLDGLPARRTMSRTRLPSPPAPSPAFSAGSFSDLLRQFDPSL
LDTSLLDSMPAVGTPHTAAAPAECDEVQSGLRAADDPPPTVRVAVTAARPPRAKPAPRRRAAQPSDASPAAQVDLRTLGY
SQQQQEKIKPKVGSTVAQHHEALVGHGFTHAHIVALSRHPAALGTVAVKYQDMIAALPEATHEDIVGVGKQWSGARALEA
LLTVAGELRGPPLQLDTGQLVKIAKRGGVTAVEAVHASRNALTGAPLNLTPAQVVAIASNNGGKQALETVQRLLPVLCQA
HGLTPAQVVAIASHDGGKQALETMQRLLPVLCQAHGLPPDQVVAIASNIGGKQALETVQRLLPVLCQAHGLTPDQVVAIA
SHGGGKQALETVQRLLPVLCQAHGLTPDQVVAIASHDGGKQALETVQRLLPVLCQAHGLTPDQVVAIASNGGGKQALETV
QRLLPVLCQAHGLTPDQVVAIASNGGKQALETVQRLLPVLCQAHGLTPDQVVAIASHDGGKQALETVQRLLPVLCQTHGL
TPAQVVAIASHDGGKQALETVQQLLPVLCQAHGLTPDQVVAIASNIGGKQALATVQRLLPVLCQAHGLTPDQVVAIASNG
GGKQALETVQRLLPVLCQAHGLTPDQVVAIASNGGGKQALETVQRLLPVLCQAHGLTQVQVVAIASNIGGKQALETVQRL
LPVLCQAHGLTPAQVVAIASHDGGKQALETVQRLLPVLCQAHGLTPDQVVAIASNGGGKQALETVQRLLPVLCQAHGLTQ
EQVVAIASNNGGKQALETVQRLLPVLCQAHGLTPDQVVAIASNGGGKQALETVQRLLPVLCQAHGLTPAQVVAIASNIGG
KQALETVQRLLPVLCQDHGLTLAQVVAIASNIGGKQALETVQRLLPVLCQAHGLTQDQVVAIASNIGGKQALETVQRLLP
VLCQDHGLTPDQVVAIASNIGGKQALETVQRLLPVLCQDHGLTLDQVVAIASNGGKQALETVQRLLPVLCQDHGLTPDQV
VAIASNSGGKQALETVQRLLPVLCQDHGLTPNQVVAIASNGGKQALESIVAQLSRPDPALAALTNDHLVALACLGGRPAM
DAVKKGLPHAPELIRRVNRRIGERTSHRVADYAQVVRVLEFFQCHSHPAYAFDEAMTQFGMSRNGLVQLFRRVGVTELEA
RGGTLPPASQRWDRILQASGMKRAKPSPTSAQTPDQASLHAFADSLERDLDAPSPMHEGDQTGASSRKRSRSDRAVTGPS
AQHSFEVRVPEQRDALHLPLSWRVKRPRTRIGGGLPDPGTPIAADLAASSTVMWEQDAAPFAGAADDFPAFNEEELAWLM
ELLPQSGSVGGTI
>Q2T1B9 3.1.1.29~~~pth~~~Peptidyl-tRNA hydrolase~~~
MIKLIVGLGNPGAEYTATRHNAGFWLVDQLAREAGATLRDERRFHGFYAKARLYGEEVHLLEPQTYMNRSGQSVVALAHF
FKILPNEILVAHDELDLPPGAVKLKLGGGSGGHNGLKDISAHLSSQQYWRLRIGIGHPRDMIPESARAGAKPDVANFVLK
PPRKEEQDVIDAAIERALAVMPAVVKGETERAMMQLHRNGA
>P0A7D1 3.1.1.29~~~pth~~~Peptidyl-tRNA hydrolase~~~COG0193
MTIKLIVGLANPGAEYAATRHNAGAWFVDLLAERLRAPLREEAKFFGYTSRVTLGGEDVRLLVPTTFMNLSGKAVAAMAS
FFRINPDEILVAHDELDLPPGVAKFKLGGGHGGHNGLKDIISKLGNNPNFHRLRIGIGHPGDKNKVVGFVLGKPPVSEQK
LIDEAIDEAARCTEMWFTDGLTKATNRLHAFKAQ
>Q5NGZ6 3.1.1.29~~~pth~~~Peptidyl-tRNA hydrolase~~~COG0193
MPKIKMIIGLGNIGKEYQDTRHNVGEWFIAKIAQDNNQSFSSNPKLNCNLAKVSIDYNNVVLVFPTTYMNNSGLAVSKVA
NFYKIAPAEILVVHDELDIDSGEIRLKKGGGHGGHNGLRSINQHLGTNDYLRLRIGIGHPGHKSKVANYVLSNPSIAQKK
DIDSAIDNGICFLDDIINYKLEPVMQKLHTK
>A6TAP7 3.1.1.29~~~pth~~~Peptidyl-tRNA hydrolase~~~
MTIKLIVGLANPGAEYAATRHNAGAWYVDLLADRHRAPLREESKFFGYTSRINLAGEDVRLLVPTTFMNLSGKAVAAMAT
FYRINPDEILVAHDELDLPPGVAKFKLGGGHGGHNGLKDIISKLGNNPNFHRLRVGIGHPGDKNKVVGFVLGKPPASEQK
LIDDAVDEAARCTEIWLKDGLTKATNRLHAFKAQ
>A0R3D3 3.1.1.29~~~pth~~~Peptidyl-tRNA hydrolase~~~COG0193
MAEPLLVVGLGNPGPTYAKTRHNLGFMVADVLAGRIGSAFKVHKKSGAEVVTGRLAGTSVVLAKPRCYMNESGRQVGPLA
KFYSVPPQQIVVIHDELDIDFGRIRLKLGGGEGGHNGLRSVASALGTKNFHRVRIGVGRPPGRKDPAAFVLENFTAAERA
EVPTIVEQAADATELLIAQGLEPAQNTVHAW
>P9WHN7 3.1.1.29~~~pth~~~Peptidyl-tRNA hydrolase~~~COG0193
MAEPLLVVGLGNPGANYARTRHNLGFVVADLLAARLGAKFKAHKRSGAEVATGRSAGRSLVLAKPRCYMNESGRQIGPLA
KFYSVAPANIIVIHDDLDLEFGRIRLKIGGGEGGHNGLRSVVAALGTKDFQRVRIGIGRPPGRKDPAAFVLENFTPAERA
EVPTICEQAADATELLIEQGMEPAQNRVHAW
>B4RK78 3.1.1.29~~~pth~~~Peptidyl-tRNA hydrolase~~~
MSNTIKMVVGLGNPGKEYEQTRHNAGFWFLDELAWKWKASFKEEKKFFGEVARAALPDGDVWLLKPATFMNRSGQAVAAL
AQFYKIKPEEILVVHDELDIPCGRIKFKLGGGNGGHNGLKDIQAKLGTADYYRLRLGIGHPGDRNLVVGYVLNKPSAEHR
RQIDDAVAKSLQAVPDIISGKWEEATRFLHSK
>Q9HVC3 3.1.1.29~~~pth~~~Peptidyl-tRNA hydrolase~~~
MTAVQLIVGLGNPGPEYDQTRHNAGALFVERLAHAQGVSLVADRKYFGLVGKFSHQGKDVRLLIPTTYMNRSGQSVAALA
GFFRIAPDAILVAHDELDMPPGVAKLKTGGGHGGHNGLRDIIAQLGNQNSFHRLRLGIGHPGHSSLVSGYVLGRAPRSEQ
ELLDTSIDFALGVLPEMLAGDWTRAMQKLHSQKA
>Q2G0R9 3.1.1.29~~~pth~~~Peptidyl-tRNA hydrolase~~~COG0193
MKCIVGLGNIGKRFELTRHNIGFEVVDYILEKNNFSLDKQKFKGAYTIERMNGDKVLFIEPMTMMNLSGEAVAPIMDYYN
VNPEDLIVLYDDLDLEQGQVRLRQKGSAGGHNGMKSIIKMLGTDQFKRIRIGVGRPTNGMTVPDYVLQRFSNDEMVTMEK
VIEHAARAIEKFVETSRFDHVMNEFNGEVK
>Q6YP15 3.1.1.29~~~pth~~~Peptidyl-tRNA hydrolase~~~
MKCIVGLGNIGKRFELTRHNIGFEVVDYILEKNNFSLDKQKFKGAYTIERMNGDKVLFIEPMTMMNLSGEAVAPIMDYYN
VNPEDLIVLYDDLDLEQGQVRLRQKGSAGGHNGMKSIIKMLGTDQFKRIRIGVGRPTNGMTVPDYVLQRFSNDEMVTMEK
VIEHAARAIEKFVETSRFDHVMNEFNGEVK
>B5XIP6 3.1.1.29~~~pth~~~Peptidyl-tRNA hydrolase~~~
MVKMIVGLGNPGSKYEKTKHNIGFMAIDNIVKNLDVTFTDDKNFKAQIGSTFINHEKVYFVKPTTFMNNSGIAVKALLTY
YNIDITDLIVIYDDLDMEVSKLRLRSKGSAGGHNGIKSIIAHIGTQEFNRIKVGIGRPLKGMTVINHVMGQFNTEDNIAI
SLTLDRVVNAVKFYLQENDFEKTMQKFNG
>Q5SHZ2 3.1.1.29~~~pth~~~Peptidyl-tRNA hydrolase~~~COG0193
MFLVVGQGNPGERYARTRHNLGFMVLDRLGLSFRPRGEALVAEAEGGLFLKPLTYYNLTGRAVAPLARFYKIPPERILVV
HDEMDLPLGRIRFKAGGSAAGNRGVLSIEEALGTRAFHRLRLGIGKPPDPSRGAEYVLSPFREEELPVVERVLEAAKEAV
WCWVREGLPPCAGRFNGLDLSLG
>A5F686 3.1.1.29~~~pth~~~Peptidyl-tRNA hydrolase~~~COG0193
MSQPIKLLVGLANPGPEYAKTRHNAGAWVVEELARIHNVTLKNEPKFFGLTGRLLINSQELRVLIPTTFMNLSGKAIAAL
ANFYQIKPEEIMVAHDELDLPPGVAKFKQGGGHGGHNGLKDTISKLGNNKEFYRLRLGIGHPGHKDKVAGYVLGKAPAKE
QECLDAAVDESVRCLEILMKDGLTKAQNRLHTFKAE
>Q9KQ21 3.1.1.29~~~pth~~~Peptidyl-tRNA hydrolase~~~COG0193
MSQPIKLLVGLANPGPEYAKTRHNAGAWVVEELARIHNVTLKNEPKFFGLTGRLLINSQELRVLIPTTFMNLSGKAIAAL
ANFYQIKPEEIMVAHDELDLPPGVAKFKQGGGHGGHNGLKDTISKLGNNKEFYRLRLGIGHPGHKDKVAGYVLGKAPAKE
QECLDAAVDESVRCLEILMKDGLTKAQNRLHTFKAE
>C3LPI9 3.1.1.29~~~pth~~~Peptidyl-tRNA hydrolase~~~
MSQPIKLLVGLANPGPEYAKTRHNAGAWVVEELARIHNVTLKNEPKFFGLTGRLLINSQELRVLIPTTFMNLSGKAIAAL
ANFYQIKPEEIMVAHDELDLPPGVAKFKQGGGHGGHNGLKDTISKLGNNKEFYRLRLGIGHPGHKDKVAGYVLGKAPAKE
QECLDAAVDESVRCLEILMKDGLTKAQNRLHTFKAE
>P46319 ~~~licA~~~Lichenan-specific phosphotransferase enzyme IIA component~~~COG1447
MNEEMEQIIFQIILHGGNGRSSAMEAIAAAKSGDAEEARKKLQDAAEELSKAHHYQTELIQNEAGGEKTEMTLLMVHAQD
HLMNAMTVKDMAAEIIELYEKITEQRGASI
>P46318 2.7.1.-~~~licB~~~Lichenan-specific phosphotransferase enzyme IIB component~~~COG1440
MNILLVCAAGMSTSLLVSKMEKSAQEQGKDYTIWAVSGDSVQNHIDKADVLLLGPQVRYMLPQLKKLGESKGVPVDVINT
VHYGTCNGAEVLKSAEQLGHVS
>P46317 ~~~licC~~~Lichenan permease IIC component~~~COG1455
MNKVNQILEEKVMPIAGRIAGQRHLQALRDGIILTMPLIIIGSFFLIIGNLPIPGYAEFMAKTFGSSWSEKLAYPVDATF
EIMGLVAAFGIAYRLAEKYGVDALSAGAISLAAFLLATPYQVPFMPDGATKEIMVGGGIPLSLMGSKGLFVAMIIAMVST
EIYRLIIQRNLVFKMPDGVPPAVSKSFVALIPGFAVIFLIWAARLIVEATPFESLHNIVSVLLGTPLSILGGSLGGSLVA
EAVKMLLWACGLHGANIVGGVMAPIWYGAMDANRIAFQAGEELPKIFTQQFFDIWVNIGGSGATLALVVTMFLRARSKQM
KQLGKLAVGPAIFNINEPIIFGMPIVMNPMLLLPFIITPLVTVTLTYIGMSTGLVAKPAGIAVPWTMPPIFSGYLATGGK
VSGAVMQAINIAVSFVVYYPFFRMWDKQKLKEENDLELVQTPAATDDKEAAL
>P69828 ~~~gatA~~~PTS system galactitol-specific EIIA component~~~COG1762
MTNLFVRSGISFVDRSEVLTHIGNEMLAKGVVHDTWPQALIAREAEFPTGIMLEQHAIAIPHCEAIHAKSSAIYLLRPTN
KVHFQQADDDNDVAVSLVIALIVENPQQQLKLLRCLFGKLQQPDIVETLITLPETQLKEYFTKYVLDSDE
>P9WPI9 2.7.10.-~~~ptkA~~~Tyrosine-protein kinase PtkA~~~COG0546
MSSPRERRPASQAPRLSRRPPAHQTSRSSPDTTAPTGSGLSNRFVNDNGIVTDTTASGTNCPPPPRAAARRASSPGESPQ
LVIFDLDGTLTDSARGIVSSFRHALNHIGAPVPEGDLATHIVGPPMHETLRAMGLGESAEEAIVAYRADYSARGWAMNSL
FDGIGPLLADLRTAGVRLAVATSKAEPTARRILRHFGIEQHFEVIAGASTDGSRGSKVDVLAHALAQLRPLPERLVMVGD
RSHDVDGAAAHGIDTVVVGWGYGRADFIDKTSTTVVTHAATIDELREALGV
>P0A435 2.7.1.200~~~gatB~~~PTS system galactitol-specific EIIB component~~~COG3414
MKRKIIVACGGAVATSTMAAEEIKELCQSHNIPVELIQCRVNEIETYMDGVHLICTTARVDRSFGDIPLVHGMPFVSGVG
IEALQNKILTILQG
>P37188 2.7.1.200~~~gatB~~~PTS system galactitol-specific EIIB component~~~COG3414
MKRKIIVACGGAVATSTMAAEEIKELCQNHNIPVELIQCRVNEIETYMDGVHLICTTAKVDRSFGDIPLVHGMPFISGIG
IEALQNKILTILQG
>P69831 ~~~gatC~~~PTS system galactitol-specific EIIC component~~~COG3775
MFSEVMRYILDLGPTVMLPIVIIIFSKILGMKAGDCFKAGLHIGIGFVGIGLVIGLMLDSIGPAAKAMAENFDLNLHVVD
VGWPGSSPMTWASQIALVAIPIAILVNVAMLLTRMTRVVNVDIWNIWHMTFTGALLHLATGSWMIGMAGVVIHAAFVYKL
GDWFARDTRNFFELEGIAIPHGTSAYMGPIAVLVDAIIEKIPGVNRIKFSADDIQRKFGPFGEPVTVGFVMGLIIGILAG
YDVKGVLQLAVKTAAVMLLMPRVIKPIMDGLTPIAKQARSRLQAKFGGQEFLIGLDPALLLGHTAVVSASLIFIPLTILI
AVCVPGNQVLPFGDLATIGFFVAMAVAVHRGNLFRTLISGVIIMSITLWIATQTIGLHTQLAANAGALKAGGMVASMDQG
GSPITWLLIQVFSPQNIPGFIIIGAIYLTGIFMTWRRARGFIKQEKVVLAE
>O52788 2.7.10.-~~~ptk~~~Tyrosine-protein kinase ptk~~~
MYVMSQTTNTEDTIDLKELFFSLIAQWKLIALCIILSLICALLYLRATPDTYSVNALVQVEENKGASAALLGDLSSMIEQ
KQPAQAEIEILKSRLVLGNVIQHLNLDLKISGTENSFTDRLLSPHHYQTEYQPKSVLFKDDEKVFDIRQFNIPASFRDKK
IELRFKDGQFSLTNTQTEQVILTGKTNQSNTLRTADGLWNISIYTQDQLNDVYLIQKQSLPAAVNNILTNYSVAEKGKLT
GILGLNYQGTDKTHITQVLNAILVSYSQQNIERRSAETAQTLKFLDEQLPELKQQLDVAEREFNKFRQQYNTVDVTKESE
LFLTQSVTLETQKAQLEQQVAEAGAKYTSEHPVMKQMNAQLGAINKKIGELNATLKELPDLQRRYLQLYREVEVKQQLYT
ALLNSYQQLRIAKAGEIGNVRIVDTAVEPIEPIAPKKLQILILSIFLGGFLGTLLALLRNMMRSGIKDSTQIENELDLPV
YATVPRSPVQESRINILKKKKNIPILAVKNSDDIAIESLRSMRTAIHFALSSARNNLITISGPAPEVGKSFISTNLATIL
AQSDKRVLIIDADLRRGYLHKYFNLDTQPGLTELLNGQQSLETVIRHTEVPGLSVISRGKSPANPSELLSSNQFKNLLEQ
MSEKFDHVIIDTPPVLAVTDGIIISQYTGVNLVIARYAKTQMKELELTLNRFEQAGVKVNGFILNDIQRSSAGYGYGYGY
NYAYAYKANKESD
>Q45390 ~~~ptlA~~~Type IV secretion system protein PtlA~~~
MNPLKDLRASLPRLAFMAACTLLSATLPDLAQAGGGLQRVNHFMASIVVVLRGASVATVTIAIIWAGYKLLFRHADVLDV
VRVVLAGLLIGASAEIARYLLT
>P11502 ~~~lacF~~~PTS system lactose-specific EIIA component~~~COG1447
MMATKEEISMVGFALVAYAGDARTAAVHALDAAEAGDFDKANELVEKAQQDINEAHNQQTQLLSQEAGGAEMDVTFIMVH
GQDTLMTTMLLIDETRYMIRMFKRIKELENKQ
>P23532 ~~~lacF~~~PTS system lactose-specific EIIA component~~~
MNREEMTLLGFEIVAYAGDARSKLLEALKAAENGDFAKADSLVVEAGSCIAEAHSSQTGMLAREASGEELPYSVTMMHGQ
DHLMTTILLKDVIHHLIELYKRGAK
>P0A0D6 ~~~lacF~~~PTS system lactose-specific EIIA component~~~
MNREEVQLLGFEIVAFAGDARSKFLEALTAAQAGDFAKADALIEEGNNCIAEAHRAQTSLLAKEAQGDDIAYSVTMMHGQ
DHLMTTILLKDLMKHLLEFYKRG
>Q82IY4 4.2.3.7~~~ptlA~~~Pentalenene synthase~~~COG2124
MPQDVDFHIPFPSRRSPDFERARADHLSWPRALGLIGTDAAAERHSRGGYADLAARFYPSATGADLDLGVDLMSWFFLFD
DLFDGPRGEDPQETRKLTDAVAAALDGPLPTSAPPIAHGFADVWRRTCQGMSPAWRARSARHWRNYFSGYVDEAVSRHLN
TPYDSAGHYLAMRRQTIGVQPTVDLAERSCHCEVPQRVFDSAVLFAMLQIATDTNLILNDIASLEKEEARGELNNMVFIL
MREHGWTRGRSIAHMQDGVRTRLEQFLLLEACLPKVYDTFELTAQERESAEKYRMDGVRSVIRGSYDWHRSSGRYAADYA
IAASYQGYLEELGSTL
>Q45391 ~~~ptlB~~~Type IV secretion system protein PtlB~~~COG3702
MRDPLFKGCTRPAMLMGVPATPLAVCSGTIALLGIWFSIAFLALFPVALLAMRIMIRRDDQQFRLIWLYLRMRWLSRDRT
HAFWQSTVYAPLRYAERRRRLRKP
>P24400 ~~~lacE~~~PTS system lactose-specific EIICB component~~~COG1440
MNKVFDKLKPVFEAIAANKYISAIRDGFIACMPIIIFSSIFMMVAYVPNAWGFYWPDNVTNTLMVAYNYSMGLLALFVAG
TTAKNLTDSKNLELPKTNQINPVAVIVASEISFVILSILPLKTGVDLTYMGTQGLICAYIVGLIVPNIYYVCIKNNVTIK
LPEQVPGNIAQSFKDLIPMGLSVTAFWLFGVGFKAATGTVLPRWIIQVLSPLFQASDSYLGLALIAGAMAFFWFCGVQGP
SIVQPAVVPIMIANTAANLQQYQAGQHVSHVLAMNTMDYVMNFGGTGATLVVPFIMLFAARSAQLKAVGKAAFVPCTFGV
NEPVLFGMPIIMNPMLFIPFLATPIVNVCLFKFFVSVLGMNSMMYTMPWTVPGPIGILISTGFAPLAFVFVLLTLVLDVA
IYFPFIRVYDSTLLAEEKAKEEVIEDDGMAVLASDTVSPSIPTGLTVATATDDDATHVLPETAPSAHGEAYFKQNEVDVL
VLCAGGGTSGILANALNKLSKERGLKLSAAARAYGQDMDLIKDMNMVILAPQMESMKGNLKKITDKYGVKLVTTTGRQYI
ELTNNGDMALDFVESNL
>P23531 ~~~lacE~~~PTS system lactose-specific EIICB component~~~
MHKLIELIEKGKPFFEKISRNIYLRAIRDGFIAGMPVILFSSIFILIAYVPNAWGFHWSKDIETFLMTPYSYSMGILAFF
VGGTTAKALTDSKNRDLPATNQINFLSTMLASMVGFLLMAAEPAKEGGFLTAFMGTKGLLTAFIAAFVTVNVYKVCVKNN
VTIRMPEDVPPNISQVFKDLIPFTVSVVLLYGLELLVKGTLGVTVAESIGTLIAPLFSAADGYLGITLIFGAYAFFWFVG
IHGPSIVEPAIAAITYANIDVNLHLIQAGQHADKVITSGTQMFIATMGGTGATLIVPFLFMWICKSDRNRAIGRASVVPT
FFGVNEPILFGAPIVLNPIFFVPFIFAPIVNVWIFKFFVDTLNMNSFSANLPWVTPGPLGIVLGTNFQVLSFILAGLLVV
VDTIIYYPFVKVYDEQILEEERSGKTNDALKEKVAANFNTAKADAVLGKADVAKEDVAANNNITKETNVLVLCAGGGTSG
LLANALNKAAAEYNVPVKAAAGGYGAHREMLPEFDLVILAPQVASNFDDMKAETDKLGIKLVKTEGAQYIKLTRDGQGAL
AFVQQQFD
>Q99S77 ~~~lacE~~~PTS system lactose-specific EIICB component~~~
MMQKLIAQIEKGKPFFEKLSRNIYLRAIRDGFISAMPVILFSSIFLLIAYVPNIFGFKWDKGMEAILMKPYNYTMGLVAF
LVAGTTAKSLTDSFNRKLESTNQINFISTMLAAMCGFLFLASDPAKDGGFLSAFMGTKGLLTAFLSAFVTVIVYNFCVKR
NITIKMPKEVPPNISQVFKDLIPFSAVIIILYALDLVIRNSFKSNVAEGILKLFEPLFTAADGWIGVTIIFGAFALFWFV
GIHGPSIVEPAIAAITYANIEANFKLLQAGEHADKIITSGTQMFIVTFGGTGATLVVPFMFMWMTKSKRNKAIGRASVVP
TFFGVNEPILFGAPLVLNPVFFIPFVLAPIVNVWIFKLFVEVLGINSFSVNLPWTTPGPLGIIMGTGFGLWSFVLAITLI
VVDIIIYYPFLKVYDSEILDEEEGRKESNSDLKEKVAANFDTKKADSILAASGVSDDAAKASNITEQTNVLVLCAGGGTS
GLLANALNKAAEEYHVPVKAAAGGYGAHMDIMKEYQLIILAPQVASNYEDIKQDTDRLGIKLAKTQGAEYIKLTRDGQAA
LDFVQQQFEN
>P11162 ~~~lacE~~~PTS system lactose-specific EIICB component~~~
MMQKLIAQIEKGKPFFEKLSRNIYLRAIRDGFISAMPVILFSSIFLLIAYVPNIFGFKWDKGMEAILMKPYNYTMGLVAF
LVAGTTAKSLTDSFNRKLESTNQINFISTMQAAMCGFLFLASDPAKDGGFLSAFMGTKGLLTAFLSAFVTVIVYNFCVKR
NITIKMPKEVPPNISQVFKDLIPFSAVIIILYALDLVIRNSFKSNVAEGILKLFEPLFTAADGWIGVTIIFGAFALFWFV
GIHGPSIVEPAIAAITYANIEANFKLLQAGEHADKIITSGTQMFIVTFGGTGATLVVPFMFMWMTKSKRNKAIGRASVVP
TFFGVNEPILFGAPLVLNPVFFIPFVLAPIVNVWIFKLFVEVLGMNSFSVNLPWTTPGPLGIIMGTGFGLWSFVLAITLI
VVDIIIYYPFLKVYDSEILDEEEGRKESNSDLKEKVAANFDTKKADSILAASGVSDDAAKASNITEQTNVLVLCAGGGTS
GLLANALNKAAEEYHVPVKAAAGGYGAHMDIMKEYQLIILAPQVASNYEDIKQDTDRLGIKLAKTQGAEYIKLTRDGQAA
LDFVQQQFEN
>Q7VSX9 ~~~ptlC~~~Type IV secretion system protein PtlC~~~COG3451
MNRRGGQTAFAAIARNERAIAAFIPYSSHLTDTTLITHGADLVRTWRVQGIAFESAEPELVSQRHEQLNGLWRAISCEQV
ALWIHCIRRKTQAGLDARYENPFCRALDASYNARLNARQAMTNEFYLTLVYRPGHAALGKRAHHGQAEVRRQLLAHVRRM
DEIGSLIETTLRSHGENHEQAITVLGCETDSAGRRYSRTLTLLEFLLTGHWQPVRVPAGPVDAYLGSSRILAGAEMMELR
APTCRRYAQFIDFKEYGTHTEPGMLNALLYEDYEYVITHSFSAVGKRQALAYLQRQRAQLANVQDAAYSQIDDLAHAEDA
LVNGDFVIGEYHFSMMILGADPRQLRRDVSSAMTRIQERGFLATPVTLALDAAFYAQLPANWAYRSRKAMLTSRNFAGLC
SFHNFYGGKRDGNPWGPALSLLSTPSGQPFYFNFHHSGLDEDCRGQMMLGNTRIIGQSGSGKTVLLNFLLCQLQKFRSAD
ADGLTTIFFDKDRGAEICIRALDGQYLRIRDGEPTGFNPLQLPCTDRNVMFLDSLLAMLARAHDSPLTSAQHATLATAVR
TVLRMPASLRRMSTLLQNITQATSEQRELVRRLGRWCRDDGAGGTGMLWWVFDNPNDCLDFSRPGNYGIDGTAFLDNAET
RTPISMYLLHRMNEAMDGRRFVYLMDEAWKWIDDPAFAEFAGDQQLTIRKKNGLGVFSTQMPSSLLGARVAASLVQQCAT
EIYLPNPRADRAEYLDGFKCTETEYQLIRSMAEDSHLFLVKQGRQAVVAQLDLSGMDDELAILSGNARNLRCFEQALALT
RERDPNDWIAVFHRLRREASAGLR
>Q7VSX8 ~~~ptlD~~~Type IV secretion system protein PtlD~~~
MAGLSRILLSCTLACLLAGQAAQASVDDPTRAGGDNRVRALRADQARRDVLLTACRDDPGHRRGEPDCVNAERAQALQQW
QAAAMTSVDAAFSDLAGALRNAAPRRMEAAIVRLTRQLQPLVYSMMTLLVLLTGYALLARRDRPFEWHIRHALLVAVVTS
LALSPDRYLSTVVAGVQDVAGWLSGPWTAPDGAAGRGGLAQLDQFAAQAQAWVAQLAGQAANDANPGSAVNWLLCAMIVA
ASAGGWLCLAASLLIVPGLIVTLLLSLGPLFLVLLLFPALQRWTNAWLGALVRALVFMALGTPAVGLLSDVLAGALPAGL
PQRFATDPLRSTMLAATLCATATLMLLTLVPLASSVNAGLRRRLWPNAAHPGLAQAHRQAAARQYAPRPAAAAAAAGPHQ
AGTYAASATPAPAPARPAPSFPAHAYRQYALGGARRPPPRVRRDDRPAPAPDRRVLPRKPNLP
>Q82IY7 1.14.11.36~~~ptlD~~~Pentalenolactone F synthase~~~COG2175
MSRAMDITRIPGSAIGAVVAGADFSGTIDDTQVEEIWQALDQHLVLVFRGHKDPSNDDLLMFARRFGHVPKTGLTTGASP
DHNEILLISNILDENGQKIGVGNAEWMDWHTDYSFRPRVSRIGFLAAVELPPSGGGQTLFTDMYTAYESLPDDLRQRLHS
YRARHSLRSGYEDVIEEEYQGEVSIEGPTAKPFVAPEDGTATVHQLIARNPRTGRRAVYANPLNTKRILELDVTSSKEVL
QQLFAKPGEPELTYAHEWLPGDIVMWDQLGTVHAKRAFDPTERRLLRKVVTIFDDPAEPWHPEDAA
>Q7VSX6 ~~~ptlE~~~Type IV secretion system protein PtlE~~~COG3736
MPDPRPLTPDQTHGRGHAEAAVDWEASRLYRLAQSERRAWTVAWAALAVTALSLIAIATMLPLKTTIPYLIEVEKSSGAA
SVVTQFEPRDFTPDTLMNQYWLTRYVAARERYDWHTIQHDYDYVRLLSAPAVRHDYETSYEAPDAPDRKYGAGTTLAVKI
LSAIDHGKGVGTVRFVRTRRDADGQGAAESSIWVATVAFAYDQPRALTQAQRWLNPLGFAVTSYRVDAEAGQP
>Q82IY8 1.14.13.171~~~ptlE~~~Neopentalenolactone D synthase~~~COG2072
MVDIEAVRAKYREERDKRVQADGGRQYLSAQGEFAHYADDPHAKPIERAPVSDEVDVTIIGAGIGGLLLGARLREACAFD
TIRLVDKAGDVGGTWYWNRFPGLRCDVESYVYMPLLEELGRLPSEKYATGAEIFEHCQAIARTYDLYDEALLQTSVTELS
WDEDSSRWLVRTDRGDLVRSRFVAMAIGSLHRPKLPSIPGTEAFQGHSFHTSRWDFAYTGGDISGGLEKLGDKRVGIVGT
GATAVQCIPHLAESAAHLYVFQRTPSTVSVRNNRPTDPGWAAGLEPGWQQRRMDNFHALTSGVDQDVDLVQDGWTEITSK
LAAILPKSAADADPKDIGTAVELADFHKMEELRKRVDAIVHDKDTADALKPYYRLFCKRPCFHDGYLDTYNRPNVTLVDT
QGRGVERLTPTSVVAGGREYPVDCLIFASGYESEFGVPYTNRTGFSIVGRDGIRLSEKWAEGARTFHGLQVNGFPNCFIL
SKAQSGLHVNVPYMLNEQSKHVAYILKAVQQRGRQVVEASATGEKEWVETILRLANRNLDFTESCTPGLFNNEGNPRNVA
ILNSSYGGGSVGFVNILKRWREADDLADLELREG
>Q7VSX5 ~~~ptlF~~~Type IV secretion system protein PtlF~~~COG3504
MMAARMMAAGLAATALSAHAFRIPTPGEQDARIQTVPYHPEEVVLVRAWNGYVTRIVFDEQEKIIDVAAGFADGWQFSPE
GNVLYIKAKSFPAQGSPAQAPEPGLWNTNLLVKTDRRLYDFDLVLASADAATPQALQRSRMAYRLQFRYPAAPQAASRAS
PVGPAVPAGALNRRYAMQVGNGSDGIAPIAAYDDGRHTWLTFRPGQPFPAVFAVAPDGTETLVNLHIDNQSLVIHRVAPV
LMLRSGASVIRIVNQNGDASESPAFECHAEPAL
>Q82IY9 1.1.1.340~~~ptlF~~~1-deoxy-11-beta-hydroxypentalenate dehydrogenase~~~COG0300
MHLQPSTAVVTGAASGIGFALSARLAQAGARVVMTDIAGDGLAGAVEELAAHGADVTAVVADLTDPAAVQELADTAFGRL
GDIDVVCNNAGVVGPVGMPLWSVPLDEMHAVFDVNYWAHVHVARAFVPRLLDSGRPSHLVQTASMSAFVVGAGTASYAAS
KHADLAAARSLRADLDGTPVRVSVLCPGRVDTPMTRGLVAPRNATGNTTISADEAADAVWNALGSDRFYIFTNADAQTRL
GDQFNDVWRHLAREKYWTESSSPSVNSSRP
>Q7VSX4 ~~~ptlG~~~Type IV secretion system protein PtlG~~~COG2948
MLNRPSSPDGGEAHAWPPDPEIPVFANAEHAHRRPLRWMFALVAVALSCLLATGIWRSRAAPPHAATQTVAPAGQALPPG
RIFTVHPREPEPAPLPDMPAAPDPILPQPRPAPPVPPPPIRAPYDYDEPAPRRDSAALKSGPAMMVATAARLGQTERAGM
ADDGVSADAATLIGRNVSRATRSGGRDYRLLPGTFIDCILQTRIVTNVPGLTTCIVSRDVYSASGKRVLVPRGTTVVGEY
RADLAQGSQRIYVAWSRLFMPSGLTIELASPAVDGTGAAGLPGVVDDKFAQRFGGALLLSVLGDATSYMLARATDARHGV
NVNLTAAGTMNSLAASALNNTINIPPTLYKNHGDQIGILVARPLDFSILRGTNE
>Q7VSX3 7.4.2.8~~~ptlH~~~Type IV secretion system protein PtlH~~~COG0630
MNDAAPDRQASVDFHLQALHPWLSRQDIAEICVNRPGQLWYEDRNGWNRQESGALTLDHLHALATATARFCDRDICPERP
LLAASLPGGERVQIVVPPACEPGTLSLTIRKPARRIWPLSELLRDTLDLPGVPGASQARPDPLLDPWRRGAWDDFLRLAV
QAGKAILVAGQTGSGKTTLMNALSGEIPPRERIVTIEDVRELRLDPATNHVHLLYGTPTEGRTAAVSATELLRAALRMAP
TRILLAELRGGEAFDFLQACASGHSGGISTCHAASADMALQRLTLMCMQHPNCQMLPYSTLRALVESVIDIVVVVERRAG
QGARRRVVDIWYRDGLPAP
>Q82IZ1 1.14.11.35~~~ptlH~~~1-deoxypentalenic acid 11-beta-hydroxylase~~~COG5285
MTNVTGDYTDCTPLLGDRAALDSFYEEHGYLFLRNVLDRDLVKTVAEQMREGLVALGAADPHATLEELTIDSFESVDEVA
MHDYVKYDAFWNNPSTIKVFEQVFGEPVFVFLSTTIRYYPSQAGSEEPSFHYLTPFHQDGFYIGPNQDFRTFWIPLIRTT
RESGGVALADGSHRRGKRDHVLNESFRRFGHPVRGIPPTEVSEDEHLLHSPMEPGDILLFHAHMCHKSIPNLSKDPRLMR
MSMDTRVQPAKSHRGFNAMTPWTESAKDASKGIMAKITGTPTDVE
>Q7VSX7 ~~~ptlI~~~Type IV secretion system protein PtlI~~~
MIHAHSNARLLRWAILAIAPVTLGACAPNGPPGLPYPDGKPLIPINTAAPEQGSSCQTRAP
>Q82IY3 1.14.15.32~~~ptlI~~~Pentalenene oxygenase~~~COG2124
MSQHTFVAGTAPGAVPVVGHAWQMMRRPLHFMSSLSAHGDLVKIRIGPTSAYVPCHPELLRQVLTNDRVFDKGGVFYDRA
RDIAGNGLVTCPYRDHRRQRRLMQSAFQRTQLERYSTAMRAEIDATAARWHDGTVIDAFPELYGMALRTVARTLYSTPVT
EELAQRVEQAFDTVLNGLFRQMFLPHSLRRLPTPANLRYRNNLRFLHDTVQDLITEYRRDDTQRDDLLSALLASRDEDGG
RLGDTEIHDQVITVMAAGTETVAGTLTWIFHLLSRHPEIEARLYEEIDTVLDGKPPHWDDLPSLSLTDRIITEALRMYPP
AWIFTRLTASDVDLAGVRLPEGTTIVFSPSSVQRHSEAYDDASRFDPDRWLPDRTSAVARQAFTAFGTGARKCIGDLFAR
TEATLALATMLSQWRVTVEPDADVRPVALATVYHPRRLRLRLTARTPGQ
>Q8GCB2 4.2.2.22~~~pelA~~~Pectate trisaccharide-lyase~~~
MKKLISIIFIFVLGVVGSLTAAVSAEAASALNSGKVNPLADFSLKGFAALNGGTTGGEGGQTVTVTTGDQLIAALKNKNA
NTPLKIYVNGTITTSNTSASKIDVKDVSNVSIVGSGTKGELKGIGIKIWRANNIIIRNLKIHEVASGDKDAIGIEGPSKN
IWVDHNELYHSLNVDKDYYDGLFDVKRDAEYITFSWNYVHDGWKSMLMGSSDSDNYNRTITFHHNWFENLNSRVPSFRFG
EGHIYNNYFNKIIDSGINSRMGARIRIENNLFENAKDPIVSWYSSSPGYWHVSNNKFVNSRGSMPTTSTTTYNPPYSYSL
DNVDNVKSIVKQNAGVGKINP
>B1B6T1 4.2.2.22~~~pel~~~Pectate trisaccharide-lyase~~~
MKKLISIIFIFVLGVVGSLTAAVSAEAASALNSGKVNPLADFSLKGFAALNGGTTGGEGGQTVTVTTGDQLIAALKNKNA
NTPLKIYVNGTITTSNTSASKIDVKDVSNVSIVGSGTKGELKGIGIKIWRANNIIIRNLKIHEVASGDKDAIGIEGPSKN
IWVDHNELYHSLNVDKDYYDGLFDVKRDAEYITFSWNYVHDGWKSMLMGSSDSDNYNRTITFHHNWFENLNSRVPSFRFG
EGHIYNNYFNKIIDSGINSRMGARIRIENNLFENAKDPIVSWYSSSPGYWHVSNNKFVNSRGSMPTTSTTTYNPPYSYSL
DNVDNVKSIVKQNAGVGKINP
>Q9WYR4 4.2.2.22~~~pelA~~~Pectate trisaccharide-lyase~~~COG3866
MLMRFSRVVSLVLLLVFTAVLTGAVKASLNDKPVGFASVPTADLPEGTVGGLGGEIVFVRTAEELEKYTTAEGKYVIVVD
GTIVFEPKREIKVLSDKTIVGINDAKIVGGGLVIKDAQNVIIRNIHFEGFYMEDDPRGKKYDFDYINVENSHHIWIDHCT
FVNGNDGAVDIKKYSNYITVSWCKFVDHDKVSLVGSSDKEDPEQAGQAYKVTYHHNYFKNCIQRMPRIRFGMAHVFNNFY
SMGLRTGVSGNVFPIYGVASAMGAKVHVEGNYFMGYGAVMAEAGIAFLPTRIMGPVEGYLTLGEGDAKNEFYYCKEPEVR
PVEEGKPALDPREYYDYTLDPVQDVPKIVVDGAGAGKLVFEELNTAQ
>P00550 ~~~mtlA~~~PTS system mannitol-specific EIICBA component~~~COG2213
MSSDIKIKVQSFGRFLSNMVMPNIGAFIAWGIITALFIPTGWLPNETLAKLVGPMITYLLPLLIGYTGGKLVGGERGGVV
GAITTMGVIVGADMPMFLGSMIAGPLGGWCIKHFDRWVDGKIKSGFEMLVNNFSAGIIGMILAILAFLGIGPIVEALSKM
LAAGVNFMVVHDMLPLASIFVEPAKILFLNNAINHGIFSPLGIQQSHELGKSIFFLIEANPGPGMGVLLAYMFFGRGSAK
QSAGGAAIIHFLGGIHEIYFPYVLMNPRLILAVILGGMTGVFTLTILGGGLVSPASPGSILAVLAMTPKGAYFANIAGVC
AAMAVSFVVSAILLKTSKVKEEDDIEAATRRMQDMKAESKGASPLSAGDVTNDLSHVRKIIVACDAGMGSSAMGAGVLRK
KIQDAGLSQISVTNSAINNLPPDVDLVITHRDLTERAMRQVPQAQHISLTNFLDSGLYTSLTERLVAAQRHTANEEKVKD
SLKDSFDDSSANLFKLGAENIFLGRKAATKEEAIRFAGEQLVKGGYVEPEYVQAMLDREKLTPTYLGESIAVPHGTVEAK
DRVLKTGVVFCQYPEGVRFGEEEDDIARLVIGIAARNNEHIQVITSLTNALDDESVIERLAHTTSVDEVLELLAGRK
>C0H3V2 ~~~mtlF~~~Mannitol-specific phosphotransferase enzyme IIA component~~~COG4668
MQVLAKENIKLNQTVSSKEEAIKLAGQTLIDNGYVTEDYISKMFEREETSSTFMGNFIAIPHGTEEAKSEVLHSGISIIQ
IPEGVEYGEGNTAKVVFGIAGKNNEHLDILSNIAIICSEEENIERLISAKSEEDLIAIFNEVN
>P69824 ~~~cmtB~~~Mannitol-specific cryptic phosphotransferase enzyme IIA component~~~COG1762
MRLSDYFPESSISVIHSAKDWQEAIDFSMVSLLDKNYISENYIQAIKDSTINNGPYYILAPGVAMPHARPECGALKTGMS
LTLLEQGVYFPGNDEPIKLLIGLSAADADSHIGAIQALSELLCEEEILEQLLTASSEKQLADIISRG
>P17876 ~~~mtlF~~~Mannitol-specific phosphotransferase enzyme IIA component~~~COG4668
MTELFSNENIFLNQSFEDQNEAIEKAGQALVDAGAVTEDYIQAMKDREAVVSTFMGNGLAIPHGTDEAKSAVLQSGLTLL
QIPEGVQWGDDVAKVVVGIAGKDGEHLDLLSKIAITFSEEENVDRIVNTKSPEEIKAVFEEADV
>O65989 ~~~mtlA~~~PTS system mannitol-specific EIICB component~~~COG2213
METYKDSTQVSKSSLKKKIQGFGGFLSGMVMPNIGAFIAWGLITALFIKTGWLPNDNLSKLVDPMIHYMLPMLIGYQGGK
LVYDTRGGVVGAIATMGMIVGASIPMFLGGMIIGPLGGYVIKKFDKAIENKIPTGFEMLVNNFSAGILGAALAIISYVAV
GPVVAGASTGLGSIALAITNQGLLPLIAVVVEPAKILFLNNAINHGVFSPLGIEQVQHLGKSVFFLLEADPGPGLGILLA
YSLYGKGSAKNSAPGAVIIHFLGGIHEIYFPYVLMKPFLLLAVIAGGICADLTFVLLKAGLVAAASPGSIIAILAMSPKG
GQLPVLAGVAVGAIVSFVVASIILKGSKEKSKDNFEEAQNKMKEMKKESKNQTTANSENVKNNDELVSSDIKLIVFACDA
GMGSSAMGESILKKELKNANIDGIKVQHYSVDSIPKEADVVFVQENLSERARKSAPDANIVTIKNFLDRSTYEGFMKKIK
K
>P50852 ~~~mtlA~~~PTS system mannitol-specific EIICB component~~~
MTHTSENQAGFRVKIQRFGSYLSGMIMPNIGAFIAWGIITALFIPTGWLPNETFAKLVGPMITYLLPLLIGYTGGKMIYD
VRGGVVGATATMGVIVGSDIPMFLGAMIMGPLGGYLIKKFDQQIQGKVKQGFEMLVNNFSAGIIGGLLTLAAFKGVGPVV
SAISKTLAAGVEKIVDLHLLPLANIFIEPGKVLFLNNAINHGILSPLGIEQAAKTGKSILFLLEPNPGPGLGILLAYWLF
GKGMAKQSAPGAIIIHFLGGIHEIYFPYVLMRPILILAAIAGGVSGVLTFTIFDAGLVAVPSPGSIFALLAMTPKGNYLG
VLAGVLVATAVSFFVASIFLKSAKNNEEDITKATEKMQQLKGKKSDVVAVLKNEEKVIPAKVKKIVFACDAGMGSSAMGA
SILRNKMQKAGLNIEVTNTAINQLPEDADIVITHQNLTDRAKEKLPKAFHISVENFLNSPKYDELIEMLKK
>Q7A4B3 ~~~mtlA~~~PTS system mannitol-specific EIICB component~~~
MSQTEEKKGIGRRVQAFGSFLSSMIMPNIGAFIAWGFIAAIFIDNGWLPNKDLATLAGPMITYLIPLLIAFSGGRLIYDL
RGGIIAATATMGVIVALPDTPMLLGAMIMGPLVGWLMKKTDQLIQPRTPQGFEMLFNNFSAGILGFIMTIAGFKILAPLM
KFIMHILSVAVEALVHAHLLPLVSILVEPAKIVFLNNAINHGVFTPLGADQAAKAGQSILYTIESNPGPGLGILLAYMIF
GKGTAKATSYGAGIIHFLGGIHEIYFPYVLMRPLLFIAVILGGMTGVATYQATGFGFKSPASPGSFIVYCLNAPRGEFLH
MLLGVFLAALVSFVVAALIMKFTREPKQDLEAATAQMENTKGKKSSVASKLVSSDKNVNTEENASGNVSETSSSDDDPEA
LLDNYNTEDVDAHNYNNINHVIFACDAGMGSSAMGASMLRNKFKKAGINDITVTNTAINQLPKDAQLVITQKKLTDRAIK
QTPNAIHISVDNFLNSPRYEELLNNLKKDDQA
>P28008 ~~~mtlA~~~PTS system mannitol-specific EIICB component~~~
MDTMSNSQQNKGIGRKVQAFGSFLSSMIMPNIGAFIAWGFIAAIFIDNGWFPNKDLAQLAGPMITYLIPLLIAFSGGRLI
HDLRGGIIAATATMGVIVALPDTPMLLGAMIMGPLVGWLMKKTDEFVQPRTPQGFEMLFNNFSAGILGFIMTIFGFEVLA
PIMKFIMHILSVGVEALVHAHLLPLVSILVEPAKIVFLNNAINHGVFTPLGADQAAHAGQSILYTIESNPGPGIGVLIAY
MIFGKGTAKATSYGAGIIQFFGGIHEIYFPYVLMRPLLFVSVILGGMTGVATYSLLDFGFKTPASPGSIIVYAINAPKGE
FLHMLTGVVLAALVSFVVSALILKFTKDPKQDLAEATAQMEATKGKKSSVASKLSAKDDNKAADNKTAETTTATAASNKA
EDKDSDELLDDYNTEDVDAHNYNNVDHVIFACDAGMGSSAMGASMLRNKFKNAGLENIQVTNTAINQLPKNAQLVITQKK
LTDRAIKQSPDAIHISVENFLNSPRYEELINNLKEDQD
>O31645 ~~~manP~~~PTS system mannose-specific EIIBCA component~~~COG1299
MKLLAITSCPNGIAHTYMAAENLQKAADRLGVSIKVETQGGIGVENKLTEEEIREADAIIIAADRSVNKDRFIGKKLLSV
GVQDGIRKPEELIQKALNGDIPVYRSATKSESGNHQEKKQNPIYRHLMNGVSFMVPFIVVGGLLIAVALTLGGEKTPKGL
VIPDDSFWKTIEQIGSASFSFMIPILAGYIAYSIADKPGLVPGMIGGYIAATGSFYDSASGAGFLGGIIAGFLAGYAALW
IKKLKVPKAIQPIMPIIIIPVFASLIVGLAFVFLIGAPVAQIFASLTVWLAGMKGSSSILLALILGAMISFDMGGPVNKV
AFLFGSAMIGEGNYEIMGPIAVAICIPPIGLGIATFLGKRKFEASQREMGKAAFTMGLFGITEGAIPFAAQDPLRVIPSI
MAGSMTGSVIAMIGNVGDRVAHGGPIVAVLGAVDHVLMFFIAVIAGSLVTALFVNVLKKDITASPVLSETAPTSAPSEAA
AANEIKQPIQSQKAEMSEFKKLTDIISPELIEPNLSGETSDDIIDELIQKLSRRGALLSESGFKQAILNREQQGTTAIGM
NIAIPHGKSEAVREPSVAFGIKRSGVDWNSLDGSEAKLIFMIAVPKESGGNQHLKILQMLSRKLMDDNYRERLLSVQTTE
EAYKLLEEIE
>P69797 2.7.1.191~~~manX~~~PTS system mannose-specific EIIAB component~~~COG2893
MTIAIVIGTHGWAAEQLLKTAEMLLGEQENVGWIDFVPGENAETLIEKYNAQLAKLDTTKGVLFLVDTWGGSPFNAASRI
VVDKEHYEVIAGVNIPMLVETLMARDDDPSFDELVALAVETGREGVKALKAKPVEKAAPAPAAAAPKAAPTPAKPMGPND
YMVIGLARIDDRLIHGQVATRWTKETNVSRIIVVSDEVAADTVRKTLLTQVAPPGVTAHVVDVAKMIRVYNNPKYAGERV
MLLFTNPTDVERLVEGGVKITSVNVGGMAFRQGKTQVNNAVSVDEKDIEAFKKLNARGIELEVRKVSTDPKLKMMDLISK
IDK
>Q5XAF5 2.7.1.191~~~manX~~~PTS system mannose-specific EIIAB component~~~
MGIGIIIASHGKFAEGIHQSGSMIFGEQEKVQVVTFMPNEGPDDLYGHFNNAIQQFDADDEILVLADLWSGSPFNQASRV
AGENPDRKMAIITGLNLPMLIQAYTERLMDAGAGIEQVAANIIKESKDGIKALPEDLNPVEETAATEKVVNALQGAIPAG
TVIGDGKLKINLARVDTRLLHGQVATAWTPASKADRIIVASDEVAQDDLRKQLIKQAAPGGVKANVVPISKLIEASKDPR
FGNTHALILFQTPQDALRAVEGGVEINELNVGSMAHSTGKTMVNNVLSMDKEDVATFEKLRDLSVTFDVRKVPNDSKKNL
FELIQKANIK
>P69801 ~~~manY~~~PTS system mannose-specific EIIC component~~~COG3715
MEITTLQIVLVFIVACIAGMGSILDEFQFHRPLIACTLVGIVLGDMKTGIIIGGTLEMIALGWMNIGAAVAPDAALASII
STILVIAGHQSIGAGIALAIPLAAAGQVLTIIVRTITVAFQHAADKAADNGNLTAISWIHVSSLFLQAMRVAIPAVIVAL
SVGTSEVQNMLNAIPEVVTNGLNIAGGMIVVVGYAMVINMMRAGYLMPFFYLGFVTAAFTNFNLVALGVIGTVMAVLYIQ
LSPKYNRVAGAPAQAAGNNDLDNELD
>P69805 ~~~manZ~~~PTS system mannose-specific EIID component~~~COG3716
MVDTTQTTTEKKLTQSDIRGVFLRSNLFQGSWNFERMQALGFCFSMVPAIRRLYPENNEARKQAIRRHLEFFNTQPFVAA
PILGVTLALEEQRANGAEIDDGAINGIKVGLMGPLAGVGDPIFWGTVRPVFAALGAGIAMSGSLLGPLLFFILFNLVRLA
TRYYGVAYGYSKGIDIVKDMGGGFLQKLTEGASILGLFVMGALVNKWTHVNIPLVVSRITDQTGKEHVTTVQTILDQLMP
GLVPLLLTFACMWLLRKKVNPLWIIVGFFVIGIAGYACGLLGL
>P54715 ~~~malP~~~PTS system maltose-specific EIICB component~~~COG1263
MMQKIQRFGSAMFVPVLLFAFAGIIVGISTLFKNKTLMGPLADPDGFWYQCWYIIEQGGWTVFNQMPLLFAIGIPVALAK
KAQARACLEALTVYLTFNYFVSAILTVWGGAFGVDMNQEVGGTSGLTMIAGIKTLDTNIIGAIFISSIVVFLHNRYFDKK
LPDFLGIFQGSTYIVMISFFIMIPIALAVSYIWPMVQSGIGSLQSFLVASGAVGVWIYTFLERILIPTGLHHFIYTPFIY
GPAVAEGGIVTYWAQHLGEYSQSAKPLKELFPQGGFALHGNSKIFGIPGIALAFYVTAKKEKKKLVAGLLIPVTLTAIVA
GITEPIEFTFLFISPFLFAVHAVLAATMSTVMYMAGVVGNMGGGLIEAVTLNWIPLFGSHGMTYVYQILIGLSFTAIYFF
VFRFLILKFNIATPGREKDEQQETKLYSKKEYRERKNKDETASAAETADDTAFLYIEALGGKDNITEVTNCATRLRVSVK
DETKVEPDSVFRALGAHGVVRNGKAFQVIIGLSVPQMRERVEKILNQ
>P19642 ~~~malX~~~PTS system maltose-specific EIICB component~~~COG1263
MTAKTAPKVTLWEFFQQLGKTFMLPVALLSFCGIMLGIGSSLSSHDVITLIPVLGNPVLQAIFTWMSKIGSFAFSFLPVM
FCIAIPLGLARENKGVAAFAGFIGYAVMNLAVNFWLTNKGILPTTDAAVLKANNIQSILGIQSIDTGILGAVIAGIIVWM
LHERFHNIRLPDALAFFGGTRFVPIISSLVMGLVGLVIPLVWPIFAMGISGLGHMINSAGDFGPMLFGTGERLLLPFGLH
HILVALIRFTDAGGTQEVCGQTVSGALTIFQAQLSCPTTHGFSESATRFLSQGKMPAFLGGLPGAALAMYHCARPENRHK
IKGLLISGLIACVVGGTTEPLEFLFLFVAPVLYVIHALLTGLGFTVMSVLGVTIGNTDGNIIDFVVFGILHGLSTKWYMV
PVVAAIWFVVYYVIFRFAITRFNLKTPGRDSEVASSIEKAVAGAPGKSGYNVPAILEALGGADNIVSLDNCITRLRLSVK
DMSLVNVQALKDNRAIGVVQLNQHNLQVVIGPQVQSVKDEMAGLMHTVQA
>P9WIA1 3.1.3.48~~~ptpA~~~Low molecular weight protein-tyrosine phosphatase A~~~COG0394
MSDPLHVTFVCTGNICRSPMAEKMFAQQLRHRGLGDAVRVTSAGTGNWHVGSCADERAAGVLRAHGYPTDHRAAQVGTEH
LAADLLVALDRNHARLLRQLGVEAARVRMLRSFDPRSGTHALDVEDPYYGDHSDFEEVFAVIESALPGLHDWVDERLARN
GPS
>A0A0H3K9F2 3.1.3.48~~~ptpA~~~Low molecular weight protein-tyrosine-phosphatase PtpA~~~
MVDVAFVCLGNICRSPMAEAIMRQRLKDRNIHDIKVHSRGTGSWNLGEPPHEGTQKILNKHNIPFDGMISELFEATDDFD
YIVAMDQSNVDNIKSINPNLKGQLFKLLEFSNMEESDVPDPYYTNNFEGVYDMVLSSCDNLIDYIVKDANLKEG
>Q7A4S1 3.1.3.48~~~ptpA~~~Low molecular weight protein-tyrosine-phosphatase PtpA~~~
MVDVAFVCLGNICRSPMAEAIMRQRLKDRNIHDIKVHSRGTGSWNLGEPPHEGTQKILNKHNIPFDGMISELFEATDDFD
YIVAMDQSNVDNIKSINPNLKGQLFKLLEFSNMEESDVPDPYYTNNFEGVYDMVLSSCDNLIDYIVKDANLKEG
>P0C5D2 3.1.3.48~~~ptpA~~~Low molecular weight protein-tyrosine-phosphatase PtpA~~~
MVDVAFVCLGNICRSPMAEAIMRQRLKDRNIHDIKVHSRGTGSWNLGEPPHEGTQKILNKHNIPFDGMISELFEATDDFD
YIVAMDQSNVDNIKSINPNLKGQLFKLLEFSNMEESDVPDPYYTNNFEGVYDMVLSSCDNLIDYIVKDANLKEG
>I6WXK4 3.1.3.-~~~ptpB~~~Triple specificity protein phosphatase PtpB~~~COG2365
MAVRELPGAWNFRDVADTATALRPGRLFRSSELSRLDDAGRATLRRLGITDVADLRSSREVARRGPGRVPDGIDVHLLPF
PDLADDDADDSAPHETAFKRLLTNDGSNGESGESSQSINDAATRYMTDEYRQFPTRNGAQRALHRVVTLLAAGRPVLTHC
FAGKDRTGFVVALVLEAVGLDRDVIVADYLRSNDSVPQLRARISEMIQQRFDTELAPEVVTFTKARLSDGVLGVRAEYLA
AARQTIDETYGSLGGYLRDAGISQATVNRMRGVLLG
>P0C5D3 3.1.3.48~~~ptpB~~~Low molecular weight protein-tyrosine-phosphatase PtpB~~~
MKILFVCTGNTCRSPLAESIAKEVMPNHQFESRGIFAVNNQGVSNYVEDLVEEHHLAETTLSQQFTEADLKADIILTMSY
SHKELIEAHFGLQNHVFTLHEYVKEAGEVIDPYGGTKEMYVHTYEELVSLILKLKDIIC
>P42910 ~~~agaC~~~N-acetylgalactosamine permease IIC component 1~~~COG3715
MHEITLLQGLSLAALVFVLGIDFWLEALFLFRPIIVCTLTGAILGDIQTGLITGGLTELAFAGLTPAGGVQPPNPIMAGL
MTTVIAWSTGVDAKTAIGLGLPFSLLMQYVILFFYSAFSLFMTKADKCAKEADTAAFSRLNWTTMLIVASAYAVIAFLCT
YLAQGAMQALVKAMPAWLTHGFEVAGGILPAVGFGLLLRVMFKAQYIPYLIAGFLFVCYIQVSNLLPVAVLGAGFAVYEF
FNAKSRQQAQPQPVASKNEEEDYSNGI
>O52787 3.1.3.48~~~ptp~~~Low molecular weight protein-tyrosine-phosphatase Ptp~~~
MQFKNILVVCIGNICRSPMAEYLLKQNYPQLTIHSAGISGMIGYSADEKAQLCMERIGIDMSPHIAKKLNAELLKQADLI
LVMSQNQQKHIEQTWPFAKGKTFRLGHWQGKNIPDPYQHDQAFFDETSLLIQTCVADWTKHI
>Q7MUW6 3.4.14.12~~~ptpA~~~Prolyl tripeptidyl peptidase~~~COG0823
MKKTIFQQLFLSVCALTVALPCSAQSPETSGKEFTLEQLMPGGKEFYNFYPEYVVGLQWMGDNYVFIEGDDLVFNKANGK
SAQTTRFSAADLNALMPEGCKFQTTDAFPSFRTLDAGRGLVVLFTQGGLVGFDMLARKVTYLFDTNEETASLDFSPVGDR
VAYVRNHNLYIARGGKLGEGMSRAIAVTIDGTETLVYGQAVHQREFGIEKGTFWSPKGSCLAFYRMDQSMVKPTPIVDYH
PLEAESKPLYYPMAGTPSHHVTVGIYHLATGKTVYLQTGEPKEKFLTNLSWSPDENILYVAEVNRAQNECKVNAYDAETG
RFVRTLFVETDKHYVEPLHPLTFLPGSNNQFIWQSRRDGWNHLYLYDTTGRLIRQVTKGEWEVTNFAGFDPKGTRLYFES
TEASPLERHFYCIDIKGGKTKDLTPESGMHRTQLSPDGSAIIDIFQSPTVPRKVTVTNIGKGSHTLLEAKNPDTGYAMPE
IRTGTIMAADGQTPLYYKLTMPLHFDPAKKYPVIVYVYGGPHAQLVTKTWRSSVGGWDIYMAQKGYAVFTVDSRGSANRG
AAFEQVIHRRLGQTEMADQMCGVDFLKSQSWVDADRIGVHGWSYGGFMTTNLMLTHGDVFKVGVAGGPVIDWNRYEIMYG
ERYFDAPQENPEGYDAANLLKRAGDLKGRLMLIHGAIDPVVVWQHSLLFLDACVKARTYPDYYVYPSHEHNVMGPDRVHL
YETITRYFTDHL
>P69791 ~~~chbA~~~PTS system N,N'-diacetylchitobiose-specific EIIA component~~~COG1447
MMDLDNIPDTQTEAEELEEVVMGLIINSGQARSLAYAALKQAKQGDFAAAKAMMDQSRMALNEAHLVQTKLIEGDAGEGK
MKVSLVLVHAQDHLMTSMLARELITELIELHEKLKA
>P69795 2.7.1.196~~~chbB~~~PTS system N,N'-diacetylchitobiose-specific EIIB component~~~COG1440
MEKKHIYLFCSAGMSTSLLVSKMRAQAEKYEVPVIIEAFPETLAGEKGQNADVVLLGPQIAYMLPEIQRLLPNKPVEVID
SLLYGKVDGLGVLKAAVAAIKKAAAN
>P17334 ~~~chbC~~~PTS system N,N'-diacetylchitobiose-specific EIIC component~~~COG1455
MSNVIASLEKVLLPFAVKIGKQPHVNAIKNGFIRLMPLTLAGAMFVLINNVFLSFGEGSFFYSLGIRLDASTIETLNGLK
GIGGNVYNGTLGIMSLMAPFFIGMALAEERKVDALAAGLLSVAAFMTVTPYSVGEAYAVGANWLGGANIISGIIIGLVVA
EMFTFIVRRNWVIKLPDSVPASVSRSFSALIPGFIILSVMGIIAWALNTWGTNFHQIIMDTISTPLASLGSVVGWAYVIF
VPLLWFFGIHGALALTALDNGIMTPWALENIATYQQYGSVEAALAAGKTFHIWAKPMLDSFIFLGGSGATLGLILAIFIA
SRRADYRQVAKLALPSGIFQINEPILFGLPIIMNPVMFIPFVLVQPILAAITLAAYYMGIIPPVTNIAPWTMPTGLGAFF
NTNGSVAALLVALFNLGIATLIYLPFVVVANKAQNAIDKEESEEDIANALKF
>P05458 3.4.24.55~~~ptrA~~~Protease 3~~~COG1025
MPRSTWFKALLLLVALWAPLSQAETGWQPIQETIRKSDKDNRQYQAIRLDNGMVVLLVSDPQAVKSLSALVVPVGSLEDP
EAYQGLAHYLEHMSLMGSKKYPQADSLAEYLKMHGGSHNASTAPYRTAFYLEVENDALPGAVDRLADAIAEPLLDKKYAE
RERNAVNAELTMARTRDGMRMAQVSAETINPAHPGSKFSGGNLETLSDKPGNPVQQALKDFHEKYYSANLMKAVIYSNKP
LPELAKMAADTFGRVPNKESKKPEITVPVVTDAQKGIIIHYVPALPRKVLRVEFRIDNNSAKFRSKTDELITYLIGNRSP
GTLSDWLQKQGLVEGISANSDPIVNGNSGVLAISASLTDKGLANRDQVVAAIFSYLNLLREKGIDKQYFDELANVLDIDF
RYPSITRDMDYVEWLADTMIRVPVEHTLDAVNIADRYDAKAVKERLAMMTPQNARIWYISPKEPHNKTAYFVDAPYQVDK
ISAQTFADWQKKAADIALSLPELNPYIPDDFSLIKSEKKYDHPELIVDESNLRVVYAPSRYFASEPKADVSLILRNPKAM
DSARNQVMFALNDYLAGLALDQLSNQASVGGISFSTNANNGLMVNANGYTQRLPQLFQALLEGYFSYTATEDQLEQAKSW
YNQMMDSAEKGKAFEQAIMPAQMLSQVPYFSRDERRKILPSITLKEVLAYRDALKSGARPEFMVIGNMTEAQATTLARDV
QKQLGADGSEWCRNKDVVVDKKQSVIFEKAGNSTDSALAAVFVPTGYDEYTSSAYSSLLGQIVQPWFYNQLRTEEQLGYA
VFAFPMSVGRQWGMGFLLQSNDKQPSFLWERYKAFFPTAEAKLRAMKPDEFAQIQQAVITQMLQAPQTLGEEASKLSKDF
DRGNMRFDSRDKIVAQIKLLTPQKLADFFHQAVVEPQGMAILSQISGSQNGKAEYVHPEGWKVWENVSALQQTMPLMSEK
NE
>P37080 ~~~sorF~~~PTS system sorbose-specific EIIA component~~~
MVHAIFCAHGQLAGAMLDSVCMVYGEVNVSAVAFVPGENAADIAINLEKLVSAHTDEEWVIAVDLQCGSPWNAAAGLAMR
HPQIRVISGLSLPLALELVDNQHTLSADDLCQHLQAIASQCCVVWQQPETVEEEF
>Q9RGG5 ~~~sorA~~~PTS system sorbose-specific EIIA component~~~COG2893
MEIILVGHAHTAKAFKEAVEMIYGEVPNFHPIDFTPKEGLQSLTDKIVSAIEPGKTASTLIITDLFSGTPYNASAELVLK
KKAADVVAGMCLPMLLEVAVNANNMSVSQLVSHLMKIKEEFSTSLSEKMTTNAKEDDF
>P24555 3.4.21.83~~~ptrB~~~Protease 2~~~COG1770
MLPKAARIPHAMTLHGDTRIDNYYWLRDDTRSQPEVLDYLQQENSYGHRVMASQQALQDRILKEIIDRIPQREVSAPYIK
NGYRYRHIYEPGCEYAIYQRQSAFSEEWDEWETLLDANKRAAHSEFYSMGGMAITPDNTIMALAEDFLSRRQYGIRFRNL
ETGNWYPELLDNVEPSFVWANDSWIFYYVRKHPVTLLPYQVWRHAIGTPASQDKLIYEEKDDTYYVSLHKTTSKHYVVIH
LASATTSEVRLLDAEMADAEPFVFLPRRKDHEYSLDHYQHRFYLRSNRHGKNFGLYRTRMRDEQQWEELIPPRENIMLEG
FTLFTDWLVVEERQRGLTSLRQINRKTREVIGIAFDDPAYVTWIAYNPEPETARLRYGYSSMTTPDTLFELDMDTGERRV
LKQTEVPGFYAANYRSEHLWIVARDGVEVPVSLVYHRKHFRKGHNPLLVYGYGSYGASIDADFSFSRLSLLDRGFVYAIV
HVRGGGELGQQWYEDGKFLKKKNTFNDYLDACDALLKLGYGSPSLCYAMGGSAGGMLMGVAINQRPELFHGVIAQVPFVD
VVTTMLDESIPLTTGEFEEWGNPQDPQYYEYMKSYSPYDNVTAQAYPHLLVTTGLHDSQVQYWEPAKWVAKLRELKTDDH
LLLLCTDMDSGHGGKSGRFKSYEGVAMEYAFLVALAQGTLPATPAD
>P37081 2.7.1.206~~~sorB~~~PTS system sorbose-specific EIIB component~~~
MQITLARIDDRLIHGQVTTVWSKVANAQRIIICNDDVFNDEVRRTLLRQAAPPGMKVNVVSLEKAVAVYHNPQYQDETVF
YLFTNPHDVLTMVRQGVQIATLNIGGMAWRPGKKQLTKAVSLDPQDIQAFRELDKLGVKLDLRVVASDPSVNILDKINET
AFCE
>Q9RGG4 2.7.1.206~~~sorB~~~PTS system sorbose-specific EIIB component~~~COG3444
MIITLARVDDRLIHGQVTTVWSKESNADRIIIVSSEVYKDDIRKTLLKQAAPPGMKVNIVDVPKAIAVYNNPKYQNEKVF
YLFTNPREVVDLVKGGIPLEKLNIGGMQFKQGKTQISKAVSLDADDVAAFRELHQLGVKLDLRVVKTDPSSDILAKIDEV
FGKE
>P37082 ~~~sorA~~~PTS system sorbose-specific EIIC component~~~
MEISTLQIIAIFIFSCIAGMGSVLDEFQTHRPLIACTVIGLILGDLKTGVMLGGTLELIALGWMNVGAAQSPDSALASII
SAILVIVGHQSIAIGIAIALPVAAAGQVLTVFARTITVVFQHAADKAAEEARFRTIDLLHVSALGVQGLRVAIPALVVSL
FVSADMVSSMLSAIPEFVTRGLQIAGGFIVVVGYAMVLRMMGVKYLMPFFFLGFLAGGYLDFSLLAFGGVGVIIALIYIQ
LNPQWRKAEPAASTAPSAPALDQLDD
>Q9RGG3 ~~~sorC~~~PTS system sorbose-specific EIIC component~~~COG3715
MAISTIQIILIFIWSSVVGMGSVLDEFQTHRPLIACSIMGLILGDPKTGIILGGTLELIALGWMNIGAAQSPDSALASTI
STILVIVGNQDIQKGIAIALPVAAAGQVLTVLARTVTVAFQHAADREAEKANFTAIIWLHFTALIVQALRVSIPTTIVAV
FVSPEEIKSMLDALPEVITGGLAVAGGFIVVVGYAMILNMMSVKYLMPFFYQGFVLGGYLKLSLLAWGAVGLIFAIVYVQ
LNPKFATNHNNGTGGSGGTVAAAGDHPAALPEDELDD
>P37083 ~~~sorM~~~PTS system sorbose-specific EIID component~~~
MEQKKITQGDLVSMFLRSNLQQASFNFERIHGLGFCYDMIPAIKRLYPLKADQVAALKRHLVFFNTTPAVCGPVIAVTAA
MEEARANGAAIDDGAINGIKVGLMGPLAGVGDPLVWGTLRPITAALGASLALSGNILGPLLFFFIFNAVRLAMKWYGLQL
GFRKGVNIVSDMGGNLLQKLTEGASILGLFVMGVLVTKWTTINVPLVVSQTPGADGATVTMTVQNILDQLCPGLLALGLT
LLMVRLLNKKVNPVWLIFALFGLGIIGNALGFLS
>Q9RGG2 ~~~sorD~~~PTS system sorbose-specific EIID component~~~COG3716
MADQPVQVNKLKTKITKGDMFKTFVFENFQQASFNFERIHALAFCVDMIPTIKRVYSKKEDQVAALKRHLVFFNTTPAMC
GPIVGVTMALEEGRAAGEPIDDGTINSFKVGLMGPLAGVGDPLMWGTLRPILAALGASLALQGSWLGPILFFVAFNAVRL
SLKWYGLQLGFSRGLALVKDMSGNLLQKITEGATVLGLFIMGILVTKWTTINVPLVVSKTTVNGKTTVTTLQNILDQFCP
GLLALGWTLLCMYLLRKKVSPILLIFALFGVGIVGYWLGILK
>P40193 ~~~ptsJ~~~Vitamin B6 salvage pathway transcriptional repressor PtsJ~~~
MIDGKTANEIFDSIRQHIIAGTLRAEDSLPPVRELASELKVNRNTVAAAYKRLITAGLAQSLGRNGTVIKGSPSPVALEG
GDPHTPLHDLSGGNPDPQRLPDLSRYFARLSRTPHLYGDAPVSPELHAWAARWLRDATPVAGEIDITSGAIDAIERLLCA
HLLPGDSVAVEDPCFLSSINMLRYAGFSASPVSVDSEGMQPEKLERALNQGARAVILTPRAHNPTGCSLSARRAAALQNM
LARYPQVVVIIDDHFALLSSSPWQPVIAQTTQHWAVIRSVSKTLGPDLRLAIVASDSATSAKLRLRLNAGSQWVSHLLQD
LVYACLTDPEYQHRLTQTRLFYAARQQKLARALQQYGIAISPGDGVNAWLPLDTHSQATAFTLAKSGWLVREGEAFGVSA
PSHGLRITLSTLNDSEINTLAADIHQALNR
>P69829 ~~~ptsN~~~Nitrogen regulatory protein~~~COG1762
MTNNDTTLQLSSVLNRECTRSRVHCQSKKRALEIISELAAKQLSLPPQVVFEAILTREKMGSTGIGNGIAIPHGKLEEDT
LRAVGVFVQLETPIAFDAIDNQPVDLLFALLVPADQTKTHLHTLSLVAKRLADKTICRRLRAAQSDEELYQIITDTEGTP
DEA
>P0A9N0 ~~~ptsO~~~Phosphocarrier protein NPr~~~COG1925
MTVKQTVEITNKLGMHARPAMKLFELMQGFDAEVLLRNDEGTEAEANSVIALLMLDSAKGRQIEVEATGPQEEEALAAVI
ALFNSGFDED
>P39794 ~~~treP~~~PTS system trehalose-specific EIIBC component~~~COG1263
MGELNKSARQIVEAVGGAENIAAATHCVTRLRFALIDESKVDQEMLDQIDVVKGSFSTNGQFQVVIGQGTVNKVYAELVK
ETGIGESTKDEVKKASEKNMNPLQRAVKTLADIFIPILPAIVTAGLLMGINNILTAEGIFFSTKSIVQVYPQWADLANMI
NLIAGTAFTFLPALIGWSAVKRFGGNPLLGIVLGVMLVHPDLLNAWGYGAAEQSGEIPVWNLFGLEVQKVGYQGQVLPIL
LASYMLAKIEVFLTKRTPEGIQLLVVAPITLLLTGFASFIIIGPITFAIGNVLTSGLISVFGSFAALGGLLYGGFYSALV
ITGMHHTFLAVDLQLIGSKLGGTFLWPMLALSNIAQGSAALAMMFIVKDEKQKGLSLTSGISAYLGITEPAIFGVNLRYR
FPFIIAMVSSGLAGMYISSQGVLASSVGVGGVPGIFSIMSQYWGAFAIGMAIVLIVPFAGTYAYARFKHK
>P36672 ~~~treB~~~PTS system trehalose-specific EIIBC component~~~COG1263
MMSKINQTDIDRLIELVGGRGNIATVSHCITRLRFVLNQPANARPKEIEQLPMVKGCFTNAGQFQVVIGTNVGDYYQALI
ASTGQAQVDKEQVKKAARHNMKWHEQLISHFAVIFFPLLPALISGGLILGFRNVIGDLPMSNGQTLAQMYPSLQTIYDFL
WLIGEAIFFYLPVGICWSAVKKMGGTPILGIVLGVTLVSPQLMNAYLLGQQLPEVWDFGMFSIAKVGYQAQVIPALLAGL
ALGVIETRLKRIVPDYLYLVVVPVCSLILAVFLAHALIGPFGRMIGDGVAFAVRHLMTGSFAPIGAALFGFLYAPLVITG
VHQTTLAIDLQMIQSMGGTPVWPLIALSNIAQGSAVIGIIISSRKHNEREISVPAAISAWLGVTEPAMYGINLKYRFPML
CAMIGSGLAGLLCGLNGVMANGIGVGGLPGILSIQPSYWQVFALAMAIAIIIPIVLTSFIYQRKYRLGTLDIV
>Q7A3G4 ~~~glcB~~~PTS system glucoside-specific EIICBA component~~~
MFKKLFGQLQRIGKALMLPVAILPAAGILLAFGNAMHNEQLVEIAPWLKNDIIVMISSVMEAAGQVVFDNLPLLFAVGTA
LGLAGGDGVAALAALVGYLIMNATMGKVLHITIDDIFSYAKGAKELSQAAKEPAHALVLGIPTLQTGVFGGIIMGALAAW
CYNKFYNITLPPFLGFFAGKRFVPIVTSVVAIATGVLLSFAWPPIQDGLNSLSNFLLNKNLTLTTFIFGIIERSLIPFGL
HHIFYSPFWFEFGSYTNHAGELVRGDQRIWMAQLKDGVPFTAGAFTTGKYPFMMFGLPAAAFAIYKNARPERKKVVGGLM
LSAGLTAFLTGITEPLEFSFLFVAPVLYGIHVLLAGTSFLVMHLLGVKIGMTFSGGFIDYILYGLLNWDRSHALLVIPVG
IVYAIVYYFLFDFAIRKFKLKTPGREDEETEIRNSSVAKLPFDVLDAMGGKENIKHLDACITRLRVEVVDKSKVDVAGIK
ALGASGVLEVGNNMQAIFGPKSDQIKHDMAKIMSGEITKPSETTVTEEMSDEPVHVEALGTTDIYAPGVGQIIPLSEVPD
QVFAGKMMGDGIGFIPEKGEIVAPFDGTVKTIFPTKHAIGLESESGVEVLIHIGIDTVKLNGEGFESLINVDEKVTQAQP
LMKVNLAYLKAHAPSIVTPMIITNLENKELVIEDVQDADPGKLIMTVK
>Q53922 ~~~glcB~~~PTS system glucoside-specific EIICBA component~~~COG1263
MKNLLKKFFGQLQRIGKALMLPVAILPAAGILLTFGNAMHNEQILHFAPWMQHHYIQLISQIMEASGQVIFDNLPLLFAM
GTALGLAGGDGVAGIAALVGYLIMSATMGKIAGITIDDIFSYADGAKTLGQSAKDPAHALVLGIPTLQTGVFGGIIIGAL
AAWCYNKFYNIQLPQFLGFFAGKRFVPIITSLVAIVTGIVLSFVWPPVQDGLNNLSNFLLGKNLALTTFIFGIIERSLIP
FGLHHIFYAPFWFEFGHYVNESGNLVRGDQRIWMAQYQDGVPFTAGAFTTGKYPFMMFGLPAAAFAIYRQAKPERRKVVG
GLMLSAALTSFLTGITEPLEFSFLFVAPILYVAHVILAGTSFLIMHLLHVQIGMTFSGGFIDYILYGLLSWDRSNALLVI
PVGIAYALIYYFLFTFLIKKLNLKTPGREDKEVESKDVSVSELPFEVLEAMGNKDNIKHLDACITRLRVEVRDKGLVDVE
KLKQLGASGVLEVGNNMQAIFGPKSDQIKHDMQQIMDGKITSPAETTVTEDGDVETAEIVAEGGAVIYAPITGEAVDLSE
VPDKVFSAKMMGDGIAIKPETGEVVAPFDGKVKMIFPTKHAIGLESKDGIELLIHFGLETVKLDGEGFEILVKENDNIVL
GQPLMKVDLNYIKEHADDTITPIIITNAGSANIEVLHTGKVEQGEKLLLVNN
>P08722 ~~~bglF~~~PTS system beta-glucoside-specific EIIBCA component~~~COG1263
MTELARKIVAGVGGADNIVSLMHCATRLRFKLKDESKAQAEVLKKTPGIIMVVESGGQFQVVIGNHVADVFLAVNSVAGL
DEKAQQAPENDDKGNLLNRFVYVISGIFTPLIGLMAATGILKGMLALALTFQWTTEQSGTYLILFSASDALFWFFPIILG
YTAGKRFGGNPFTAMVIGGALVHPLILTAFENGQKADALGLDFLGIPVTLLNYSSSVIPIIFSAWLCSILERRLNAWLPS
AIKNFFTPLLCLMVITPVTFLLVGPLSTWISELIAAGYLWLYQAVPAFAGAVMGGFWQIFVMFGLHWGLVPLCINNFTVL
GYDTMIPLLMPAIMAQVGAALGVFLCERDAQKKVVAGSAALTSLFGITEPAVYGVNLPRKYPFVIACISGALGATIIGYA
QTKVYSFGLPSIFTFMQTIPSTGIDFTVWASVIGGVIAIGCAFVGTVMLHFITAKRQPAQGAPQEKTPEVITPPEQGGIC
SPMTGEIVPLIHVADTTFASGLLGKGIAILPSVGEVRSPVAGRIASLFATLHAIGIESDDGVEILIHVGIDTVKLDGKFF
SAHVNVGDKVNTGDRLISFDIPAIREAGFDLTTPVLISNSDDFTDVLPHGTAQISAGEPLLSIIR
>P39816 ~~~gamP~~~Putative PTS system glucosamine-specific EIICBA component~~~COG1263
MFKKAFQILQQLGRALMTPVAVLPAAGLLLRFGDKDLLNIPIIKDAGGVVFDNLPLIFAVGVAIGLAGGEGVAGLAAVIG
YLILTVTLDNMGKLLGLQPPYEGAEHLIDMGVFGGIIIGLLAAYLYKRFSSIELHPVLGFFSGKRFVPIITSVSSLVIGV
IFSFVWPLIQNGINAASSLIADSTVGLFFYATIYRLLIPFGLHHIFYTPFYFMMGEYTDPSTGNTVTGDLTRFFAGDPTA
GRFMMGDFPYMIFCLPAVALAIIHTARPEKKKMISGVMISAALTSMLTGITEPVEFSFLFVAPVLYLINSILAGVIFVVC
DLFHVRHGYTFSGGGIDYVLNYGLSTNGWVVIPVGIVFAFIYYYLFRFAILKWNLKTPGRETDEDGQNEEKAPVAKDQLA
FHVLQALGGQQNIANLDACITRLRVTVHQPSQVCKDELKRLGAVGVLEVNNNFQAIFGTKSDALKDDIKTIMAGGVPATA
AALDTVTDKPLKPDSDETFIYPIKGETVSLGDVPDQVFSEKMMGEGFAIIPSEGKVVAPADGEIVSIFPTKHAIGFMSAG
GTEILIHVGIDTVKLNGEGFEAHVTSGQAVKQGELLLTFDLNYIKQHAASAITPVIFTNTSEEDLKHIQMK
>P09323 ~~~nagE~~~PTS system N-acetylglucosamine-specific EIICBA component~~~COG1263
MNILGFFQRLGRALQLPIAVLPVAALLLRFGQPDLLNVAFIAQAGGAIFDNLALIFAIGVASSWSKDSAGAAALAGAVGY
FVLTKAMVTINPEINMGVLAGIITGLVGGAAYNRWSDIKLPDFLSFFGGKRFVPIATGFFCLVLAAIFGYVWPPVQHAIH
AGGEWIVSAGALGSGIFGFINRLLIPTGLHQVLNTIAWFQIGEFTNAAGTVFHGDINRFYAGDGTAGMFMSGFFPIMMFG
LPGAALAMYFAAPKERRPMVGGMLLSVAVTAFLTGVTEPLEFLFMFLAPLLYLLHALLTGISLFVATLLGIHAGFSFSAG
AIDYALMYNLPAASQNVWMLLVMGVIFFAIYFVVFSLVIRMFNLKTPGREDKEDEIVTEEANSNTEEGLTQLATNYIAAV
GGTDNLKAIDACITRLRLTVADSARVNDTMCKRLGASGVVKLNKQTIQVIVGAKAESIGDAMKKVVARGPVAAASAEATP
ATAAPVAKPQAVPNAVSIAELVSPITGDVVALDQVPDEAFASKAVGDGVAVKPTDKIVVSPAAGTIVKIFNTNHAFCLET
EKGAEIVVHMGIDTVALEGKGFKRLVEEGAQVSAGQPILEMDLDYLNANARSMISPVVCSNIDDFSGLIIKAQGHIVAGQ
TPLYEIKK
>Q9S2H6 2.7.1.193~~~nagF~~~PTS system N-acetylglucosamine-specific EIIB component~~~COG1264
MASKAEKIVAGLGGIDNIDEIEGCITRLRTEVNDPALVNEAALKAAGAHGVVKMGTAIQVVIGTDADPIAAEIEDMM
>O34521 ~~~nagP~~~PTS system N-acetylglucosamine-specific EIICB component~~~COG1263
MLSFLQKLGKSFMLPIAVLPAVGIILALGREDVFNIPFVYQAGTAVFDHLPLIFAIGIAIGISKDSNGAAGLSGAISYLM
LDAATKTIDKTNNMAVFGGIIAGLIAGYTYNRFKDTKLPEYLGFFSGRRLVPILTAIITIILAGIFGVVWPPIQSCINSF
GEWMLGLGGIGAGIFGLFNRLLIPLGLHHVLNNIFWFQFGEYNGVTGDLARFFAKDPTAGTYMTGFFPIMMFGLPAACLA
MVVTAKPSKRKATAGMMIGFALTAFITGITEPIEFAFMFLSPLLYAVHAVLTGLSLFIVNWLGIRSGFSFSAGAIDYVLS
YGIAEKPLLLLLVGICYAAVYFIVFYVLIKALNLKTPGREDDDVDEVLDENTVQDVNENIMLKGLGGKENLQTIDHCATR
LRLTVKDTALVDEALLKKAGAKGVVKSGGQSVQVIIGPNVEFAAEELRAAVK
>Q9S2H4 ~~~nagE2~~~PTS system N-acetylglucosamine-specific EIIC component~~~COG1263
MSTATDTAAPAKKRGSGLFQGLQKVGRSLQLPIAVLPAAGIMVRLGQDDIFGKDGLGWDKVAAVFNNAGGALTGSLPILF
CIGVAIGFAKKADGSTALAAVVGFLVYSKVLEAFPVTEAVVQDGADVAATYNDPGVLGGIIMGLLAAVLWQRYHRKKLVD
WLGFFNGRRLVPIIMAFVGIVVGVFFGLVWEPIGDGISNFGEWMTGLGSGGAALFGGVNRALIPVGMHQFVNTVAWFQLG
DFTNSAGDVVHGDITRFLAGDPSAGIFQAGFFPIMMFGLPAAALAMAHTARPERRKAVLGMMISLAATSFVTGVTEPIEF
SFMFIAPVLYVLHAVLTAISMAITWGLGVHAGFNFSAGFIDYALNWHLATKPWLIIPIGLVFAAIYYVTFRFAIVKFNLK
TPGREPEEEVEDLTKA
>Q2FK70 ~~~murP~~~PTS system MurNAc-GlcNAc-specific EIIBC component~~~
MTKEQQLAERIIAAVGGMDNIDSVMNCMTRVRIKVLDENKVDDQELRHIDGVMGVIHDERIQVVVGPGTVNKVANHMAEL
SGVKLGDPIPHHHNDSEKMDYKSYAADKAKANKEAHKAKQKNGKLNKVLKSIANIFIPLIPAFIGAGLIGGIAAVLSNLM
VAGYISGAWITQLITVFNVIKDGMLAYLAIFTGINAAKEFGATPGLGGVIGGTTLLTGIAGKNILMNVFTGEPLQPGQGG
IIGVIFAVWILSIVEKRLHKIVPNAIDIIVTPTIALLIVGLLTIFIFMPLAGFVSDSLVSVVNGIISIGGVFSGFIIGAS
FLPLVMLGLHHIFTPIHIEMINQSGATYLLPIAAMAGAGQVGAALALWVRCKRNTTLRNTLKGALPVGFLGIGEPLIYGV
TLPLGRPFLTACIGGGIGGAVIGGIGHIGAKAIGPSGVSLLPLISDNMYLGYIAGLLAAYAGGFVCTYLFGTTKAMRQTD
LLGD
>Q7A804 ~~~~~~PTS system MurNAc-GlcNAc-specific EIIBC component~~~
MTKEQQLAERIIAAVGGMDNIDSVMNCMTRVRIKVLDENKVDDQELRHIDGVMGVIHDERIQVVVGPGTVNKVANHMAEL
SGVKLGDPIPHHHNDSEKMDYKSYAADKAKANKEAHKAKQKNGKLNKVLKSIANIFIPLIPAFIGAGLIGGIAAVLSNLM
VAGYISGAWITQLITVFNVIKDGMLAYLAIFTGINAAKEFGATPGLGGVIGGTTLLTGIAGKNILMNVFTGEPLQPGQGG
IIGVIFAVWILSIVEKRLHKIVPNAIDIIVTPTIALLIVGLLTIFIFMPLAGFVSDSLVSVVNGIISIGGVFSGFIIGAS
FLPLVMLGLHHIFTPIHIEMINQSGATYLLPIAAMAGAGQVGAALALWVRCKRNTTLRNTLKGALPVGFLGIGEPLIYGV
TLPLGRPFLTACIGGGIGGAVIGGIGHIGAKAIGPSGVSLLPLISDNMYLGYIAGLLAAYAGGFVCTYLFGTTKAMRQTD
LLGD
>O69052 ~~~ptxB~~~Probable phosphite transport system-binding protein PtxB~~~
MKRLSALLLTCLLSAVSSLSALAADADPDVLKVALLPDENASELIKRNQPLKDYLEEHLDKKVQLIVTTDYSSMIEAMRF
GRIDLAYFGPLSYVMAKSKSDIEPFAAMVIDGKPTYRSVIIANVASGVNEYADLKGKRMAYGDRASTSSHLIPKTVLLET
ADLTGGQDYEQHFVGTHDAVAVNVANGNADAGGLSEVIFNHAAERGLIDPSKVKVLGYSGEYPQYPWAMRSNLSPELKTK
VRDVFVGIDDPEVLRNFKAEAFAPITDADYDVIRNMGSLLGLDFATM
>P31452 ~~~glvC~~~Phosphotransferase IIC component GlvC~~~COG1263
MLSQIQRFGGAMFTPVLLFPFAGIVVGLAILLQNPMFVGESLTDPNSLFAQIVHIIEEGGWTVFRNMPLIFAVGLPIGLA
KQAQGRACLAVMVSFLTWNYFINAMGMTWGSYFGVDFTQDAVAGSGLTMMAGIKTLDTSIIGAIIISGIVTALHNRLFDK
KLPVFLGIFQGTSYVVIIAFLVMIPCAWLTLLGWPKVQMGIESLQAFLRSAGALGVWVYTFLERILIPTGLHHFIYGQFI
FGPAAVEGGIQMYWAQHLQEFSLSAEPLKSLFPEGGFALHGNSKIFGAVGISLAMYFTAAPENRVKVAGLLIPATLTAML
VGITEPLEFTFLFISPLLFAVHAVLAASMSTVMYLFGVVGNMGGGLID
>O69054 1.20.1.1~~~ptxD~~~Phosphonate dehydrogenase~~~
MLPKLVITHRVHDEILQLLAPHCELMTNQTDSTLTREEILRRCRDAQAMMAFMPDRVDADFLQACPELRVVGCALKGFDN
FDVDACTARGVWLTFVPDLLTVPTAELAIGLAVGLGRHLRAADAFVRSGEFQGWQPQFYGTGLDNATVGILGMGAIGLAM
ADRLQGWGATLQYHEAKALDTQTEQRLGLRQVACSELFASSDFILLALPLNADTQHLVNAELLALVRPGALLVNPCRGSV
VDEAAVLAALERGQLGGYAADVFEMEDWARADRPRLIDPALLAHPNTLFTPHIGSAVRAVRLEIERCAAQNIIQVLAGAR
PINAANRLPKAEPAAC
>P72131 ~~~ptxR~~~HTH-type transcriptional regulator PtxR~~~
MSAALERLNHLNLNHLYAFVAVAEHNSFTAAAEALGLSKSLLSEQLRRLEADLGIQLLTRTTRRMTLTDRGELLFGVAQR
MLGELDGALSDVRDLQGEPSGRLRITAPQDFVKWHISSVSAAFIRQFPKVQVEMLADDQFSDLVGQRIDLAVRIGWPRDS
GLHASKLCDFQQVAVATPGYLAGLPPVLQPHDLARCEWIGHTRLSTPWTWTFERQRERATVQTRGRLLANNTLAVYRLVL
DGAGVSVLPSFLVAREIARGRLVRLLPGWRLPQGGIYALYPSARYMPVRVRAFIESLREHLGREPFRLAQSE
>G3XD97 ~~~ptxS~~~HTH-type transcriptional regulator PtxS~~~
MNGSVLPSRGRVTINQVAEAAGVSKASVSRYIGGDRQLLADATARRIERAIDQLDYRPNQMARGLKRGRTRLIGMLVADI
LNPYSVAVMHGVETACREHGYSLVVCNTDRDDEQERHHLAALQSYNVEGLIVNTLGHHPGELRALHRELPMVLVDRQLAE
LDTDLVGLDNADAVEQALDHLQHRGFRDILLVTEPLDGTSSRIERVQAFNASIGRRPALKGQVLQTDDFFRDGLRAFLSA
SGPGPKALFTCNGVATLCATRQLRDLGCRLFDEVGLLALDELDWYPLVGSGITALAQPTDEIGRTAFERLLARLEGDREP
ARRVTFPAQLIVRGSTHPRG
>A0A167V873 ~~~ptxS~~~HTH-type transcriptional regulator PtxS~~~
MDKTLSQARTRVTISEVAQAAGVSKATVSRYIGGDRQLLADATAQRIEAVIEQLGYRPNRMASALKRGRTRLIGMLLADI
RNPYSVAVMHGVETACREHGYSLVVCNTDCDDARERQHLQALQAYNVDGLIVNTLGHHAGELASLAQELPMVLVDRQLAE
LQTDLVGLDNADAVEQALDHLHACGYRDILAVSEPLDGTSSRQERVAAFQASIARRSGLRGQVLEVSANLPGQLAAFLAS
AGHGPQALFSCNGVATLEVMRHLHGRGEQLFQQLGLVALDDLDWYPLVGGGITALAQPTERIAAAAVQCLLERLQGSQLP
ARRLDLRAQLIVRGSTPIRN
>Q88HH7 ~~~ptxS~~~HTH-type transcriptional regulator PtxS~~~COG1609
MTDAPAHTRERVTISEVARVAGVSKATVSRYIGGDRQLLAEATAKRLEEVIERLGYRPNQMARGLKRGQTRLIGMLVADI
LNPYSVAVMHGVETACRQHGYSLVVCNTNRDDEQERHHLVALQSYNVEGLIVNTLGHHPGELLNLQRDIPMVLVDRQLPE
LNVDLVGLDNADAVEQALDHLQAQGYRDILAVSEPLDGTSSRLERVQAFGASISRRPGMRQQVLEIGAGLQGQLASFLAH
SGHGPQAIFTFNGVATLAVTRALLEAGRNLVADVGLIALDDLDWYPLVGKGITALAQPTERIGVAAFESLLGRLRGDSGA
ARRIDFKANLIIRGSTQPQ
>P77272 ~~~murP~~~PTS system N-acetylmuramic acid-specific EIIBC component~~~COG1263
MAKEISSELLNTILTRVGGPGNIASCGNCMTRLRLGVHDSSLVDPNIKTLEGVKGVILTSDQVQVVFGPGKAHRAAKAMS
ELLGEAPVQDAAEIAAQNKRQLKAKQTSGVQQFLAKFATIFTPLIPGFIAAGLLLGIATLIATVMHVPADAQGTLPDALN
FMKVFSKGLFTFLVILVGYNAAQAFGGTGVNGAIIAALFLLGYNPAATTGYYAGFHDFFGLPIDPRGNIIGVLIAAWACA
RIEGMVRRFMPDDLDMLLTSLITLLITATLAYLIIMPLGGWLFEGMSWLFMHLNSNPFGCAVLAGLFLIAVVFGVHQGFI
PVYLALMDSQGFNSLFPILSMAGAGQVGAALALYWRAQPHSALRSQVRGAIIPGLLGVGEPLIYGVTLPRMKPFVTACLG
GAAGGLFIGLIAWWGLPMGLNSAFGPSGLVALPLMTSAQGILPAMAVYAGGILVAWVCGFIFTTLFGCRNVNLD
>P13249 2.3.-.-~~~pac~~~Puromycin N-acetyltransferase~~~
MTEYKPTVRLATRDDVPRAVRTLAAAFADYPATRHTVDPDRHIERVTELQELFLTRVGLDIGKVWVADDGAAVAVWTTPE
SVEAGAVFAEIGPRMAELSGSRLAAQQQMEGLLAPHRPKEPAWFLATVGVSPDHQGKGLGSAVVLPGVEAAERAGVPAFL
ETSAPRNLPFYERLGFTVTADVEVPEGPRTWCMTRKPGA
>O32146 ~~~pucB~~~Purine catabolism protein PucB~~~COG2068
MRSPYLIGVFLAAGKSRRMGQNKLALPLKGENIGSLSLKTALSSRLDHVLVVERTEHASLEWIGAPYHAPPFQKRWSLHV
CQDAEKGQGHSVSSGVRKAESMGADGIVILLADQPQLSVDHLNALVALAPESFAVSSFLGAFTPPIYFSSTCFPYVKGLK
GDEGARRLLKSGQLGAGAVLEAKDSGELDDIDTPEEYDMVRRAMS
>P23462 ~~~pucC~~~Protein PucC~~~
MGYRAFALKNLARHAPKYLPFADVASEEVPLSRLLRLSLFQITVGMTLTLLAGTLNRVMIVELAVPASLVSVMLAMPMLF
APFRTLIGFKSDTHKSALGLRRAPWIWKGTIYQFGGFAIMPFALLVLSGFGESVDAPRWIGMSAAALAFLLVGAGVHIVQ
TAGLALATDLVAEEDQPKVVGLMYVMLLFGMVISALVYGALLADYTPGRLIQVIQGTALASVVLNMAAMWKQEAVSRDRA
RQMETAEHPTFKEAFGLLMGRPGMLALLTVIALGTFGFGMADVLLEPYGGQALHLTVGETTKLTALFALGTLAGFGTASR
VLGNGARPMRWSAGCTDRVPGFVAIIMSSLISQDGIWLFLAGTFAVGLGIGLFGHATLTATMRTAPADRIGLALGAWGAV
QATAAGLGVALAGVVRDGLVALPGTFGSGVVGPYNTVFAIEALILIVAIAFAVPLLKRGGR
>O32148 2.6.1.112~~~pucG~~~(S)-ureidoglycine--glyoxylate transaminase~~~COG0075
MSGRRELCTPLRTIMTPGPVEVDPRVLRVMSTPVVGQFDPAFTGIMNETMEMLRELFQTKNRWAYPIDGTSRAGIEAVLA
SVIEPEDDVLIPIYGRFGYLLTEIAERYGANVHMLECEWGTVFDPEDIIREIKKVKPKIVAMVHGETSTGRIHPLKAIGE
ACRTEDALFIVDAVATIGGCEVKVDEWKIDAAIGGTQKCLSVPSGMAPITYNERVADVIAARKKVERGIATQADRAALSG
NRPITSNYFDLSQLEDYWSERRLNHHTEATTMLYALREGVRLVLEEGLETRFERHRHHEAALAAGIKAMGLRLFGDDSCK
MPVVTCVEIPGGIDGESVRDMLLAQFGIEIASSFGPLAGKIWRIGTMGYSCRKENVLFVLAGLEAVLLRHNAGIEAGKAL
QAALDVYENAGRQAAV
>O32139 ~~~pucJ~~~Uric acid permease PucJ~~~COG2233
MKKRSFKVFTLSLQHVLAMYAGAILVPLLVGRALNVTTEQLSYLLAIDLLTCGVATLLQTLRGTYIGIGLPVMLGSSFVA
VTPMIAIGSNYGIHAIYGSIIAAGVFIFLFARFFGKLTVLFPPVVTGTVVTLIGLSLVPTGVKNMAGGEKINGSANPEYG
SLENLLLSVGVLVLILVLNRFLKGFARTLSVLIGIAAGTAAAAIMGKVSFSSVTEAPFFQIPKPFYFGAPAFEIGPILTM
LIVGIVIIVESTGVFYAIGKICGRPLTDKDLVKGYRAEGIAILIGGLFNAFPYNTFAQNAGLLQLTKVKTRNIVVTAGCI
LVCLGLIPKIAALASAVPAAVLGGATVVMFGMVIASGVKMLSTADLKNQYHLLTIACSIALGIGASTAPGIFAEFPAPIR
ILVSDGTITGSLTAIFLNLFFSLRDKKELTAQQTELPVLEHTLALEKEV
>O32140 ~~~pucK~~~Uric acid permease PucK~~~COG2233
MKEQHNALQLMMLGLQHMLAMYAGAILVPLIVGAAIGLNAGQLTYLIAIDLFMCGAATLLQLWRNRYFGIGLPVVLGCTF
TAVGPMISIGSTYGVPAIYGAIIAAGLIVVLAAGFFGKLVRFFPPVVTGSVVMIIGISLIPTAMNNLAGGEGSKEFGSLD
NVLLGFGVTAFILLLFYFFKGFIRSIAILLGLIAGTAAAYFMGKVDFSEVLEASWLHVPSLFYFGPPTFELPAVVTMLLV
AIVSLVESTGVYFALADITNRRLSEKDLEKGYRAEGLAILLGGLFNAFPYTAFSQNVGIVQLSKMKSVNVIAITGIILVA
IGLVPKAAALTTVIPTPVLGGAMIVMFGMVISYGIKMLSSVDLDSQGNLLIIASSVSLGLGATTVPALFSSLSGAASVLA
GSGIVIGSLTAIALHAFFQTKQPNSADIKT
>Q45697 ~~~uao~~~Uric acid degradation bifunctional protein~~~
MMRLKQLNEMSASEFIHLLGGVFENSSWVAERAEPNRPYSSFQSLYNKMVEIVETASDNEQLKLIQMHPHLGTNVKITDF
SQEEQKHAGLNELTKDEQNHLILLNQKYKDKFGFPFVMAVRGKIKQEIFRTIKERLQNNHQTEFKQALEEIKKIAMFRLQ
EIFREGENNSMTKHKERVMYYGKGDVFAYRTYLKPLTGVRTIPESPFSGRDHILFGVNVKISVGGTKLLTSFTKGDNSLV
VATDSMKNFIQKHLASYTGTTIEGFLEYVATSFLKKYSHIEKISLIGEEIPFETTFAVKNGNRAASELVFKKSRNEYATA
YLNMVRNEDNTLNITEQQSGLAGLQLIKVSGNSFVGFIRDEYTTLPEDSNRPLFVYLNIKWKYKNTEDSFGTNPENYVAA
EQIRDIATSVFHETETLSIQHLIYLIGRRILERFPQLQEVYFESQNHTWDKIVEEIPESEGKVYTEPRPPYGFQCFTVTQ
EDLPHENILMFSDEPDHKGALK
>O32141 ~~~pucL~~~Uric acid degradation bifunctional protein PucL~~~COG3195
MFTMDDLNQMDTQTLTDTLGSIFEHSSWIAERSAALRPFSSLSDLHRKMTGIVKAADRETQLDLIKKHPRLGTKKTMSDD
SVREQQNAGLGKLEQQEYEEFLMLNEHYYDRFGFPFILAVKGKTKQDIHQALLARLESERETEFQQALIEIYRIARFRLA
DIITEKGETQMKRTMSYGKGNVFAYRTYLKPLTGVKQIPESSFAGRDNTVVGVDVTCEIGGEAFLPSFTDGDNTLVVATD
SMKNFIQRHLASYEGTTTEGFLHYVAHRFLDTYSHMDTITLTGEDIPFEAMPAYEEKELSTSRLVFRRSRNERSRSVLKA
ERSGNTITITEQYSEIMDLQLVKVSGNSFVGFIRDEYTTLPEDGNRPLFVYLNISWQYENTNDSYASDPARYVAAEQVRD
LASTVFHELETPSIQNLIYHIGCRILARFPQLTDVSFQSQNHTWDTVVEEIPGSKGKVYTEPRPPYGFQHFTVTREDAEK
EKQKAAEKCRSLKA
>O32138 ~~~pucR~~~Purine catabolism regulatory protein~~~COG2508
MNILDVMKIPAFENANLIAGKAGGEREVQHVNMMDAPDIVDFLHKNELLVTTAYHLKDHPHQLSELIRQMAKRGCAGLGI
KTKRYLEDIPKEIIELADSYAFPIIELPEHIRLGDIVNATLSHILDMRSNELQQAIYAHKKFTNHIMSGKGLQSLLKKVS
DILQLPVLLLDQHAKMLSASHQISVETEKLKGTLNTVSGPFFTCFSTISDQKTYSVLPIYNHEKNCGYLLIPDMVQAGDK
GLILTIEQAANVISFELLKENALKQFSRRARNEFFNNFIERTFSSDDEIKNRAKEFKLRWDQKYMCIAGKLDRNDESISF
TENQLASDSVFEFLEGELSAFPFPPHFFMKGNVGIILIEATDSWSEMHASVISFLEQFQTQVSAQFKRTVSFGISNICQK
LIDVPDAFTEASDALQSGHLSRSTAFIQVYHAKDVPELLRLLPVEDLKKFYNSTLQSLAEKQQEDQSLLHTLSVYLETHC
QISETAKRLYVHRNTVIYRLEKCEELLGKSLKDPETTMRLRLALRMQRLIS
>P13402 ~~~pufX~~~Intrinsic membrane protein PufX~~~
MADKTIFNDHLNTNPKTNLRLWVAFQMMKGAGWAGGVFFGTLLLIGFFRVVGRMLPIQENQAPAPNITGALETGIELIKH
LV
>P26240 ~~~pufX~~~Intrinsic membrane protein PufX~~~
MSMFDKPFDYENGSKFEMGIWIGRQMAYGAFLGSIPFLLGLGLVLGSYGLGLMLPERAHQAPSPYTTEVVVQHATEVV
>C0SPA0 3.2.1.41~~~amyX~~~Pullulanase~~~COG1523
MVSIRRSFEAYVDDMNIITVLIPAEQKEIMTPPFRLETEITDFPLAVREEYSLEAKYKYVCVSDHPVTFGKIHCVRASSG
HKTDLQIGAVIRTAAFDDEFYYDGELGAVYTADHTVFKVWAPAATSAAVKLSHPNKSGRTFQMTRLEKGVYAVTVTGDLH
GYEYLFCICNNSEWMETVDQYAKAVTVNGEKGVVLRPDQMKWTAPLKPFSHPVDAVIYETHLRDFSIHENSGMINKGKYL
ALTETDTQTANGSSSGLAYVKELGVTHVELLPVNDFAGVDEEKPLDAYNWGYNPLHFFAPEGSYASNPHDPQTRKTELKQ
MINTLHQHGLRVILDVVFNHVYKRENSPFEKTVPGYFFRHDECGMPSNGTGVGNDIASERRMARKFIADCVVYWLEEYNV
DGFRFDLLGILDIDTVLYMKEKATKAKPGILLFGEGWDLATPLPHEQKAALANAPRMPGIGFFNDMFRDAVKGNTFHLKA
TGFALGNGESAQAVMHGIAGSSGWKALAPIVPEPSQSINYVESHDNHTFWDKMSFALPQENDSRKRSRQRLAAAIILLAQ
GVPFIHSGQEFFRTKQGVENSYQSSDSINQLDWDRRETFKEDVHYIRRLISLRKAHPAFRLRSAADIQRHLECLTLKEHL
IAYRLYDLDEVDEWKDIIVIHHASPDSVEWRLPNDIPYRLLCDPSGFQEDPTEIKKTVAVNGIGTVILYLASDLKSFA
>P07811 3.2.1.41~~~pulA~~~Pullulanase~~~
MLRYTCHALFLGSLVLLSGCDNSSSSSTSGSPGSPGNPGNPGTPGTPDPQDVVVRLPDVAVPGEAVQASARQAVIHLVDI
AGITSSTPADYATKNLYLWNNETCDALSAPVADWNDVSTTPTGSDKYGPYWVIPLTKESGSINVIVRDGTNKLIDSGRVS
FSDFTDRTVSVIAGNSAVYDSRADAFRAAFGVALADAHWVDKTTLLWPGGENKPIVRLYYSHSSKVAADSNGEFSDKYVK
LTPTTVNQQVSMRFPHLASYPAFKLPDDVNVDELLQGDDGGIAESDGILSLSHPGADRRRAGRYLCRRAEALSYGAQLTD
SGVTFRVWAPTAQQVELVIYSADKKVIASHPMTRDSASGAWSWQGGSDLKGAFYRYAMTVYHPQSRKVEQYEVTDPYAHS
LSTNSEYSQVVDLNDSALKPEGWDGLTMPHAQKTKADLAKMTIHESHIRDLSAWDQTVPAELRGKYLALTAQESNMVQHL
KQLSASGVTHIELLPVFDLATVNEFSDKVADIQQPFSRLCEVNSAVKSSEFAGYCDSGSTVEEVLTQLKQNDSKDNPQVQ
ALNTLVAQTDSYNWGYDPFHYTVPEGSYATDPEGTARIKEFRTMIQAIKQDLGMNVIMDVVYNHTNAAGPTDRTSVLDKI
VPWYYQRLNETTGSVESATCCSDSAPEHRMFAKLIADSLAVWTTDYKIDGFRFDLMGYHPKAQILSAWERIKALNPDIYF
FGEGWDSNQSDRFEIASQINLKGTGIGTFSDRLRDAVRGGGPFDSGDALRQNQGVGSGAGVLPNELTTLSDDQARHLADL
TRLGMAGNLADFVLIDKDGAVKRGSEIDYNGAPGGYAADPTEVVNYVSKHDNQTLWDMISYKAAQEADLDTRVRMQAVSL
ATVMLGQGIAFDQQGSELLRSKSFTRDSYDSGDWFNRVDYSLQDNNYNVGMPRSSDDGSNYDIIARVKDAVATPGETELK
QMTAFYQELTALRKSSPLFTLGDGATVMKRVDFRNTGADQQTGLLVMTIDDGMQAGRQSGQPCRRHRGGDQRRAGKPDAA
GLRRHIAPAERYSAGGGRPVAGERVQVAADGSVTLPAWSVAVLELPQASRRALACR
>P07206 3.2.1.41~~~pulA~~~Pullulanase~~~
MLRYTRNALVLGSLVLLSGCDNGSSSSSSGNPDTPDNQDVVVRLPDVAVPGEAVTAVENQAVIHLVDIAGITSSSAADYS
SKNLYLWNNETCDALSAPVADWNDVSTTPSGSDKYGPYWVIPLNKESGCINVIVRDGTDKLIDSDLRVAFGDFTDRTVSV
IAGNSAVYDSRADAFRAAFGVALAEAHWVDKNTLLWPGGQDKPIVRLYYSHSSKVAADGEGKFTDRYLKLTPTTVSQQVS
MRFPHLSSYAAFKLPDNANVDELLQGETVAIAAAEDGILISATQVQTAGVLDDAYAEAAEALSYGAQLADGGVTFRVWAP
TAQQVDVVVYSADKKVIGSHPMTRDSASGAWSWQGGSDLKGAFYRYAMTVYHPQSRKVEQYEVTDPYAHSLSTNSEYSQV
VDLNDSALKPDGWDNLTMPHAQKTKADLAKMTIHESHIRDLSAWDQTVPAELRGKYLALTAGDSNMVQHLKTLSASGVTH
VELLPVFDLATVNEFSDKVADIQQPFSRLCEVNSAVKSSEFAGYCDSGSTVEEVLNQLKQSDSQDNPQVQALNTLVAQTD
SYNWGYDPFHYTVPEGSYATDPEGTTRIKEFRTMIQAIKQDLGMNVIMDVVYNHTNAAGPTDRTSVLDKIVPWYYQRLNE
TTGSVESATCCSDSAPEHRMFAKLIADSLAVWTTDYKIDGFRFDLMGYHPKAQILSAWERIKALNPDIYFFGEGWDSNQS
DRFEIASQINLKGTGIGTFSDRLRDSVRGGGPFDSGDALRQNQGIGSGAGVLPNELASLSDDQVRHLADLTRLGMAGNLA
DFVMIDKDGAAKKGSEIDYNGAPGGYAADPTEVVNYVSKHDNQTLWDMISYKASQEADLATRVRMQAVSLATVMLGQGIA
FDQQGSELLRSKSFTRDSYDSGDWFNRVDYSLQDNNYNVGMPRISDDGSNYEVITRVKEMVATPGEAELKQMTAFYQELT
ELRKSSPLFTLGDGSAVMKRVDFRNTGSDQQAGLLVMTVDDGMKAGASLDSRLDGLVVAINAAPESRTLNEFAGETLQLS
AIQQTAGENSLANGVQIAADGTVTLPAWSVAVLELPQGEAQGAGLPVSSK
>Q9F930 3.2.1.41~~~spuA~~~Pullulanase A~~~
MRKTPSHTEKKMVYSIRSLKNGTGSVLIGASLVLLAMATPTISSDESTPTTNEPNNRNTTTLAQPLTDTAAGSGKNESDI
SSPGNANASLEKTEEKPATEPTTPAASPADPAPQTGQDRSSEPTTSTSPVTTETKAEEPIEDNYFRIHVKKLPEENKDAQ
GLWTWDDVEKPSENWPNGALSFKDAKKDDYGYYLDVKLKGEQAKKISFLINNTAGKNLTGDKSVEKLVPKMNEAWLDQDY
KVFSYEPQPAGTVRVNYYRTDGNYDKKSLWYWGDVKNPSSAQWPDGTDFTATGKYGRYIDIPLNEAAREFGFLLLDESKQ
GDDVKIRKENYKFTDLKNHSQIFLKDDDESIYTNPYYVHDIRMTGAQHVGTSSIESSFSTLVGAKKEDILKHSNITNHLG
NKVTITDVAIDEAGKKVTYSGDFSDTKHPYTVSYNSDQFTTKTSWRLKDETYSYDGKLGADLKEEGKQVDLTLWSPSADK
VSVVVYDKNDPDKVVGTVALEKGERGTWKQTLDSTNKLGITDFTGYYYQYQIERQGKTVLALDPYAKSLAAWNSDDAKID
DAHKVAKAAFVDPAKLGPQDLTYGKIHNFKTREDAVIYEAHVRDFTSDPAIAKDLTKPFGTFEAFIEKLDYLKDLGVTHI
QLLPVLSYYFVNELKNHEHLSDYASSNSNYNWGYDPQNYFSLTGMYSSDPKNPEKRIAEFKNLINEIHKRGMGAILDVVY
NHTAKVDIFEDLEPNYYHFMDADGTPRTSFGGGRLGTTHHMTKRLLVDSIKYLVDTYKVDGFRFDMMGDHDAASIEEAYK
AARALNPNLIMLGEGWRTYAGDENMPTKAADQDWMKHTDTVAVFSDDIRNNLKSGYPNEGQPAFITGGKRDVNTIFKNLI
AQPTNFEADSPGDVIQYIAAHDNLTLFDIIAQSIKKDPSKAENYAEIHRRLRLGNLMVLTAQGTPFIHSGQEYGRTKQFR
NPAYRTPVAEDKVPNKSHLLRDKDGNPFDYPYFIHDSYDSSDAVNKFDWTKATDGKAYPENVKSRDYMKGLIALRQSTDA
FRLKSLQDIKDRVHLITVPGQNGVEKEDVVIGYQITAPNGDIYAVFVNADEKAREFNLGTAFAHLRNAEVLADENQAGSV
GIANPKGLEWTEKGLKLNALTATVLRVSQNGTSHESTAEEKPDSTPSKPEHQNEASHPAHQDPAPEARPDSTKPDAKVAD
AENKPSQATADSQAEQPAQEAQASSVKEAVRKESVENSSKENISATPDRQAELPNTGIKNENKLLFAGISLLALLGLGFL
LKNKKEN
>A0A0H2UNG0 3.2.1.41~~~spuA~~~Pullulanase A~~~COG0508
MRKTPSHTEKKMVYSIRSLKNGTGSVLIGASLVLLAMATPTISSDESTPTTNEPNNRNTTTLAQPLTDTAAGSGKNESDI
SSPGNANASLEKTEEKPAASPADPAPQTGQDRSSEPTTSTSPVTTETKAEEPIEDNYFRIHVKKLPEENKDAQGLWTWDD
VEKPSENWPNGALSFKDAKKDDYGYYLDVKLKGEQAKKISFLINNTAGKNLTGDKSVEKLVPKMNEAWLDQDYKVFSYEP
QPAGTVRVNYYRTDGNYDKKSLWYWGDVKNPSSAQWPDGTDFTATGKYGRYIDIPLNEAAREFGFLLLDESKQGDDVKIR
KENYKFTDLKNHSQIFLKDDDESIYTNPYYVHDIRMTGAQHVGTSSIESSFSTLVGAKKEDILKHSNITNHLGNKVTITD
VAIDEAGKKVTYSGDFSDTKHPYTVSYNSDQFTTKTSWRLKDETYSYDGKLGADLKEEGKQVDLTLWSPSADKVSVVVYD
KNDPDKVVGTVALEKGERGTWKQTLDSTNKLGITDFTGYYYQYQIERQGKTVLALDPYAKSLAAWNSDDSKIDDAHKVAK
AAFVDPAKLGPQDLTYGKIHNFKTREDAVIYEAHVRDFTSDPAIAKDLTKPFGTFEAFIEKLDYLKDLGVTHIQLLPVLS
YYFVNELKNHERLSDYASSNSNYNWGYDPQNYFSLTGMYSSDPKNPEKRIAEFKNLINEIHKRGMGAILDVVYNHTAKVD
LFEDLEPNYYHFMDADGTPRTSFGGGRLGTTHHMTKRLLIDSIKYLVDTYKVDGFRFDMMGDHDAASIEEAYKAARALNP
NLIMLGEGWRTYAGDENMPTKAADQDWMKHTDTVAVFSDDIRNNLKSGYPNEGQPAFITGGKRDVNTIFKNLIAQPTNFE
ADSPGDVIQYIAAHDNLTLFDIIAQSIKKDPSKAENYAEIHRRLRLGNLMVLTAQGTPFIHSGQEYGRTKQFRDPAYKTP
VAEDKVPNKSHLLRDKDGNPFDYPYFIHDSYDSSDAVNKFDWTKATDGKAYPENVKSRDYMKGLIALRQSTDAFRLKSLQ
DIKDRVHLITVPGQNGVEKEDVVIGYQITAPNGDIYAVFVNADEKAREFNLGTAFAHLRNAEVLADENQAGPVGIANPKG
LEWTEKGLKLNALTATVLRVSQNGTSHESTAEEKPDSTPSKPEHQNEASHPAHQDPAPEARPDSTKPDAKVADAENKPSQ
ATADSQAEQPAQEAQASSVKEAVRNESVENSSKENIPATPDKQAELPNTGIKNENKLLFAGISLLALLGLGFLLKNKKEN
>O33840 3.2.1.41~~~pulA~~~Pullulanase~~~COG1523
MKTKLWLLLVLLLSALIFSETTIVVHYHRYDGKYDGWNLWIWPVEPVSQEGKAYQFTGEDDFGKVAVVKLPMDLTKVGII
VRLNEWQAKDVAKDRFIEIKDGKAEVWILQGVEEIFYEKPDTSPRIFFAQARSNKVIEAFLTNPVDTKKKELFKVTVDGK
EIPVSRVEKADPTDIDVTNYVRIVLSESLKEEDLRKDVELIIEGYKPARVIMMEILDDYYYDGELGAVYSPEKTIFRVWS
PVSKWVKVLLFKNGEDTEPYQVVNMEYKGNGVWEAVVEGDLDGVFYLYQLENYGKIRTTVDPYSKAVYANSKKSAVVNLA
RTNPEGWENDRGPKIEGYEDAIIYEIHIADITGLENSGVKNKGLYLGLTEENTKGPGGVTTGLSHLVELGVTHVHILPFF
DFYTGDELDKDFEKYYNWGYDPYLFMVPEGRYSTDPKNPHTRIREVKEMVKALHKHGIGVIMDMVFPHTYGIGELSAFDQ
TVPYYFYRIDKTGAYLNESGCGNVIASERPMMRKFIVDTVTYWVKEYHIDGFRFDQMGLIDKKTMLEVERALHKIDPTII
LYGEPWGGWGAPIRFGKSDVAGTHVAAFNDEFRDAIRGSVFNPSVKGFVMGGYGKETKIKRGVVGSINYDGKLIKSFALD
PEETINYAACHDNHTLWDKNYLAAKADKKKEWTEEELKNAQKLAGAILLTSQGVPFLHGGQDFCRTKNFNDNSYNAPISI
NGFDYERKLQFIDVFNYHKGLIKLRKEHPAFRLKNAEEIKKHLEFLPGGRRIVAFMLKDHAGGDPWKDIVVIYNGNLEKT
TYKLPEGKWNVVVNSQKAGTEVIETVEGTIELDPLSAYVLYRE
>P20440 ~~~pulS~~~Pullulanase secretion protein PulS~~~
MRNFILFPMMAVVLLSGCQQNRPTTLSPAVSGQAQLEQLASVAAGARYLKNKCNRSDLPADEAINRAAINVGKKRGWANI
DANLLSQRSAQLYQQLQQDSTPEATKCSQFNRQLAPFIDSLRDNK
>P46354 2.4.2.1~~~punA~~~Purine nucleoside phosphorylase 1~~~COG0005
MKDRIERAAAFIKQNLPESPKIGLILGSGLGILADEIENPVKLKYEDIPEFPVSTVEGHAGQLVLGTLEGVSVIAMQGRF
HFYEGYSMEKVTFPVRVMKALGVEALIVTNAAGGVNTEFRAGDLMIITDHINFMGTNPLIGPNEADFGARFPDMSSAYDK
DLSSLAEKIAKDLNIPIQKGVYTAVTGPSYETPAEVRFLRTMGSDAVGMSTVPEVIVANHAGMRVLGISCISNAAAGILD
QPLSHDEVMEVTEKVKAGFLKLVKAIVAQYE
>P81989 2.4.2.1~~~punA~~~Purine nucleoside phosphorylase~~~
TTTTPPSTPPLDDPATDPFLVARAAADHIAQATGVEGHDMALVLGSGWGGAAELLGEVVAEVPTHEIPGFSAPAVAGHLS
VTRSIRVERADGSVRHALVLGSRTHLYEGKGVRAVVHGVRTAAATGAETLILTNGCGGLNQEWGAGTPVLLSDHINLTAR
SPLEGPTFVDLTDVYSPRLRELAHRVDPTLPEGVYAQFPGPHYETPAEVRMAGILGADLVGMSTTLEAIAARHCGLEVLG
VSLVTNLAAGISPTPLSHAEVIEAGQAAGPRISALLADIAKR
>P77834 2.4.2.1~~~punA~~~Purine nucleoside phosphorylase 1~~~
MNRTAIEQAAQFLKEKFPTSPQIGLILGSGLGVLADEIEQAIKIPYSDIPNFPVSTVEGHAGQLVYGQLEGATVVVMQGR
FHYYEGYSFDKVTFPVRVMKALGVEQLIVTNAAGGVNESFEPGDLMIISDHINNMGGNPLIGPNDSALGVRFPDMSEAYS
KRLRQLAKDVANDIGLRVREGVYVANTGPAYETPAEIRMIRVMGGDAVGMSTVPEVIVARHAGMEVLGISCISNMAAGIL
DQPLTHDEVIETTEKVKADFLRFVKAIVRNMAKN
>P9WP01 2.4.2.1~~~punA~~~Purine nucleoside phosphorylase~~~COG0005
MADPRPDPDELARRAAQVIADRTGIGEHDVAVVLGSGWLPAVAALGSPTTVLPQAELPGFVPPTAAGHAGELLSVPIGAH
RVLVLAGRIHAYEGHDLRYVVHPVRAARAAGAQIMVLTNAAGGLRADLQVGQPVLISDHLNLTARSPLVGGEFVDLTDAY
SPRLRELARQSDPQLAEGVYAGLPGPHYETPAEIRMLQTLGADLVGMSTVHETIAARAAGAEVLGVSLVTNLAAGITGEP
LSHAEVLAAGAASATRMGALLADVIARF
>P40974 1.4.3.10~~~puo~~~Putrescine oxidase~~~
MTDQRTLGSETAIERDVVVVGAGPAGLMAARTLVAAGRTVAVLEARDRVGGRTWSKTVDGAFLEIGGQWISPDQTELLAL
VDELGLETYQRYREGESVYLAPDGTRHTYTGSMFPAGESTIVEMEKLVALLDGLVAEIGATEPWAHPAARELDTISFHHW
LRQHSDDEAACSNIGIFVAGGMLTKPAHAFSVLQAVLMAASAGSFSNLVDEDFILDRRVVGGMQSVSETMAAELGEDVVF
LDTPVRTIRWAGDGGTYAEHVPGTPVTVWSDRLTVRAKDVVVAVPPNLYSRISFEPPLPRLQHQMHQHQSLGLVIKVHAV
YETPFWRDKGLSGTGFGAHELSQEVYDNTNHGDPRGTLVGFVSDERADELFGLPAEERRRLILESLSHYLGEEALHPVVY
YESDFGSEEWTRGAYAASYDLGGLHRYGAHQRTPVGPIRWACSDLAAEGYQHVDGALRQGRLAAAEVLGAGSLTGAER
>P25184 ~~~pupA~~~Ferric-pyoverdine 358 receptor~~~
MSKPLPSALNPLAKALLIRHSLRPRHALSRIGMGLALSSALVFQVQAQEWTLDIPAQSMNSALQALAKQTDTQLLYSPED
IGGLRSSALKGRHDLQSSLRILLQGTGLRYQIDGNTVTVTASAAAKDGQIELSATNVNSAGLGETTEGTGSYTTRVTSTA
TKMNLSIRETPQTITVVTRQRMDDQHLGSMNEVLTQTPGITMSQDGGERFNIYSRGSAINIYQFDGVTTYQDNQTRNMPS
TLMDVGLYDRIEIVRGATGLMTGAGDPSAVVNVIRKRPTREFKSHIQAGVGSWDYYRAEADVSGPLTDDGRVRGRFFAAK
QDNHTFMDWYTQDRDVLYGVVEADVTDTTVARFGIDRQTYKVNGAPGVPIIYTNGQPTNFSRSTSSDARWGYDDYTTTNY
TFGLEQQLAHDWQFKLAAAYMDVDRDSFSSYYSTTTNRSYLELDGSTEISAGIVTAKQHQKGVDATLQGPFQLLGQTHEL
IVGYNYLEYENKHRGDSGPDVNINFYDWDNQTPKPGDDEIIPGIQYNISNRQSGYFVASRFNLTDDLHLILGARASNYRF
DYALWRIGNEPAPYKMVERGVVTPYAGIVYDLTNEQSVYASYTDIFKPQNNVDITGKPLDPEVGKNYELGWKGEFLEGRL
NANIALYMVKRDNLAESTNEVVPDSGGLIASRAVDGAETKGVDVELSGEVLPGWNVFTGYSHTRTEDADGKRLTPQLPMD
TFRFWNTYRLPGEWEKLTLGGGVNWNSKSTLNFARYNSHVTQDDYFVTSLMARYRINESLAATLNVNNIFDKKYYAGMAG
SYGHYGAPRNATVTLRYDF
>P38047 ~~~pupB~~~Ferric-pyoverdine BN7/BN8 receptor~~~
MNHTARKRQGWQRSVSQKLAGAVVQGIACMGASAPLLLMPAWATAAAQAQADFDIPAGPLAPALAHFGQSAHILLSYPTA
LTEGRSTSGLAGRFDIDQGLAILLAGTGLEASRGANASYSLQASASTGALELSAVSISGKAPGSTTEGTGLYTTYSSSSS
TRLNLTPRETPQSLTVMTRQRLDDQRLTNLTDALEATPGITVVRDGLGSESDSYWSRGFAIQNYEVDGVPTSTRLDNYSQ
SMAMFDRVEIVRGATGLISGMGNPSATINLIRKRPTAEAQASITGEAGNWDRYGTGFDVSGPLTETGNIRGRFVADYKTE
KAWIDRYNQQSQLMYGITEFDLSEDTLLTVGFSYLRSDIDSPLRSGLPTRFSTGERTNLKRSLNAAPDWSYNDHEQTSFF
TSIEQQLGNGWSGKIELTHAENKFDELFNFAMGELNPDGSGLSQLPVRFSGTPRQDNLDLYATGPFSLFGREHELITGMT
LSQYRENTPSWGGWRYDYAGSPAGAIDNLFNWDGKSAKPAFVESGKSSIDEDQYAAYLTSRFSVTDDLSLILGSRLINWK
RDTSDRPYGGEETEVNREENGVFIPYAGVGYDLDDTWSLYASYTKIFNPQGAWVTDESNKPLDPMEGVGYELGIKGTHLN
GKLNSSLAVFKLEQDNLAIWQHDNVYSAEQDTTSKGIELELNGELAEGWQASAGYSYSVTTDADDQRINTNLPRNSFKTF
TSYRLHGPLDKITIGGGVNWQSKVGADLHTFSQGSYAVTNLMARYDINQHLSASVNLNNVFDREYYSQSGLYGVYGTPRN
VMTSFKYSF
>A0LU49 ~~~pup~~~Prokaryotic ubiquitin-like protein Pup~~~
MPEKDTGGQHRATRRTEEHDETIDEATATSDVQERREKLDADVDAILDEIDDVLEENAEEFVRSYIQKGGE
>Q8NQE0 ~~~pup~~~Prokaryotic ubiquitin-like protein Pup~~~
MNAKQTQIMGGGGRDEDNAEDSAQASGQVQINTEGVDSLLDEIDGLLENNAEEFVRSYVQKGGE
>A0QZ48 ~~~pup~~~Prokaryotic ubiquitin-like protein Pup~~~
MAQEQTKRGGGGGEDDDLPGASAAGQERREKLTEETDDLLDEIDDVLEENAEDFVRAYVQKGGQ
>P9WHN5 ~~~pup~~~Prokaryotic ubiquitin-like protein Pup~~~
MAQEQTKRGGGGGDDDDIAGSTAAGQERREKLTEETDDLLDEIDDVLEENAEDFVRAYVQKGGQ
>P00497 2.4.2.14~~~purF~~~Amidophosphoribosyltransferase~~~COG0034
MLAEIKGLNEECGVFGIWGHEEAPQITYYGLHSLQHRGQEGAGIVATDGEKLTAHKGQGLITEVFQNGELSKVKGKGAIG
HVRYATAGGGGYENVQPLLFRSQNNGSLALAHNGNLVNATQLKQQLENQGSIFQTSSDTEVLAHLIKRSGHFTLKDQIKN
SLSMLKGAYAFLIMTETEMIVALDPNGLRPLSIGMMGDAYVVASETCAFDVVGATYLREVEPGEMLIINDEGMKSERFSM
NINRSICSMEYIYFSRPDSNIDGINVHSARKNLGKMLAQESAVEADVVTGVPDSSISAAIGYAEATGIPYELGLIKNRYV
GRTFIQPSQALREQGVRMKLSAVRGVVEGKRVVMVDDSIVRGTTSRRIVTMLREAGATEVHVKISSPPIAHPCFYGIDTS
THEELIASSHSVEEIRQEIGADTLSFLSVEGLLKGIGRKYDDSNCGQCLACFTGKYPTEIYQDTVLPHVKEAVLTK
>P0AG16 2.4.2.14~~~purF~~~Amidophosphoribosyltransferase~~~COG0034
MCGIVGIAGVMPVNQSIYDALTVLQHRGQDAAGIITIDANNCFRLRKANGLVSDVFEARHMQRLQGNMGIGHVRYPTAGS
SSASEAQPFYVNSPYGITLAHNGNLTNAHELRKKLFEEKRRHINTTSDSEILLNIFASELDNFRHYPLEADNIFAAIAAT
NRLIRGAYACVAMIIGHGMVAFRDPNGIRPLVLGKRDIDENRTEYMVASESVALDTLGFDFLRDVAPGEAIYITEEGQLF
TRQCADNPVSNPCLFEYVYFARPDSFIDKISVYSARVNMGTKLGEKIAREWEDLDIDVVIPIPETSCDIALEIARILGKP
YRQGFVKNRYVGRTFIMPGQQLRRKSVRRKLNANRAEFRDKNVLLVDDSIVRGTTSEQIIEMAREAGAKKVYLASAAPEI
RFPNVYGIDMPSATELIAHGREVDEIRQIIGADGLIFQDLNDLIDAVRAENPDIQQFECSVFNGVYVTKDVDQGYLDFLD
TLRNDDAKAVQRQNEVENLEMHNEG
>P9WHQ7 2.4.2.14~~~purF~~~Amidophosphoribosyltransferase~~~COG0034
MAVDSDYVTDRAAGSRQTVTGQQPEQDLNSPREECGVFGVWAPGEDVAKLTYYGLYALQHRGQEAAGIAVADGSQVLVFK
DLGLVSQVFDEQTLAAMQGHVAIGHCRYSTTGDTTWENAQPVFRNTAAGTGVALGHNGNLVNAAALAARARDAGLIATRC
PAPATTDSDILGALLAHGAADSTLEQAALDLLPTVRGAFCLTFMDENTLYACRDPYGVRPLSLGRLDRGWVVASETAALD
IVGASFVRDIEPGELLAIDADGVRSTRFANPTPKGCVFEYVYLARPDSTIAGRSVHAARVEIGRRLARECPVEADLVIGV
PESGTPAAVGYAQESGVPYGQGLMKNAYVGRTFIQPSQTIRQLGIRLKLNPLKEVIRGKRLIVVDDSIVRGNTQRALVRM
LREAGAVELHVRIASPPVKWPCFYGIDFPSPAELIANAVENEDEMLEAVRHAIGADTLGYISLRGMVAASEQPTSRLCTA
CFDGKYPIELPRETALGKNVIEHMLANAARGAALGELAADDEVPVGR
>P99164 2.4.2.14~~~purF~~~Amidophosphoribosyltransferase~~~
MFNYSGLNEECGVFGIWNHPEAAQLTYMGLHSLQHRGQEGAGIVVSDQNELKGERGLGLLTEAINDDQMERLKGYQHAIG
HVRYATSGNKGIENIQPFLYHFYDMSVGICHNGNLINAKSLRQNLEKQGAIFHSSSDTEVIMHLIRRSKAPTFEEALKES
LRKVKGGFTFAILTKDALYGAVDPNAIRPLVVGKMKDGTYILASETCAIDVLGAEFVQDIHAGEYVVINDKGITVKSYTH
HTTTAISAMEYIYFARPDSTIAGKNVHAVRKASGKKLAQESPVNADMVIGVPNSSLSAASGYAEEIGLPYEMGLVKNQYV
ARTFIQPTQELREQGVRVKLSAVKDIVDGKNIILVDDSIVRGTTIRRIVKMLKDSGANKVHVRIASPEFMFPSFYGIDVS
TTAELISASKSPEEIKDYIGADSLAYLSVDGLIESIGLDYDAPYSGLCVESFTGDYPAGLYDYEANYKAHLSHRQKQYIS
KNKHFFDSEGNLNV
>O66949 6.3.4.13~~~purD~~~Phosphoribosylamine--glycine ligase~~~COG0151
MKVLVVGNGGREHAIAWKVAQSPLVKELYVAKGNAGIWEIAKRVDISPTDVEKLAEFAKNEGVDFTIVGPEAPLVEGIVD
EFEKRGLKIFGPNKEAAKLEGSKAFAKTFMKKYGIPTARYEVFTDFEKAKEYVEKVGAPIVVKADGLAAGKGAVVCETVE
KAIETLDRFLNKKIFGKSSERVVIEEFLEGEEASYIVMINGDRYVPLPTSQDHKRLLDEDKGPNTGGMGAYSPTPVINEE
VEKRIREEIVERVIKGLKEEGIYYRGFLYAGLMITKEGPKVLEFNVRLGDPEAQPILMRVKNDFLETLLNFYEGKDVHIK
EDERYALDVVLASRGYPEKPETGKIIHGLDYLKSMEDVVVFHAGTKKEGNFTVTSGGRVLNVCAYGKTLKEAKERAYEAI
RYVCFEGMHYRKDIGDKAFKYLSE
>P12039 6.3.4.13~~~purD~~~Phosphoribosylamine--glycine ligase~~~COG0151
MNVLIIGKGGREHTLAWKAAQSSLVENVFAAPGNDGMAASAQLVNIEESDHAGLVSFAKQNQVGLTIVGPEVPLIEGLVD
EFEKAGLHVFGPSKAAAIIEGSKQFAKDLMKKYDIPTAEYETFTSFDEAKAYVQEKGAPIVIKADGLAAGKGVTVAMTEE
EAIACLHDFLEDEKFGDASASVVIEEYLSGEEFSLMAFVKGEKVYPMVIAQDHKRAFDGDKGPNTGGMGAYSPVPQISEE
TVRHAVETIVKPAAKAMVQEGRSFTGVLYAGLMLTENGSKVIEFNARFGDPETQVVLPRMESDLVQVLLDLLDDKEVDLR
WKDTAAVSVVLASEGYPESYAKGTPIGSLAAETEQVVVFHAGTKAEGGEFVTNGGRVANVTAFDETFEAARDRVYKAVDE
IFKPGLFFRKDIGARALKAAQK
>P15640 6.3.4.13~~~purD~~~Phosphoribosylamine--glycine ligase~~~COG0151
MKVLVIGNGGREHALAWKAAQSPLVETVFVAPGNAGTALEPALQNVAIGVTDIPALLDFAQNEKIDLTIVGPEAPLVKGV
VDTFRAAGLKIFGPTAGAAQLEGSKAFTKDFLARHKIPTAEYQNFTEVEPALAYLREKGAPIVIKADGLAAGKGVIVAMT
LEEAEAAVHDMLAGNAFGDAGHRIVIEEFLDGEEASFIVMVDGEHVLPMATSQDHKRVGDKDTGPNTGGMGAYSPAPVVT
DDVHQRTMERIIWPTVKGMAAEGNTYTGFLYAGLMIDKQGNPKVIEFNCRFGDPETQPIMLRMKSDLVELCLAACESKLD
EKTSEWDERASLGVVMAAGGYPGDYRTGDVIHGLPLEEVAGGKVFHAGTKLADDEQVVTNGGRVLCVTALGHTVAEAQKR
AYALMTDIHWDDCFCRKDIGWRAIEREQN
>P9WHM9 6.3.4.13~~~purD~~~Phosphoribosylamine--glycine ligase~~~COG0151
MRVLVIGSGAREHALLLALGKDPQVSGLIVAPGNAGTARIAEQHDVDITSAEAVVALAREVGADMVVIGPEVPLVLGVAD
AVRAAGIVCFGPGKDAARIEGSKAFAKDVMAAAGVRTANSEIVDSPAHLDAALDRFGPPAGDPAWVVKDDRLAAGKGVVV
TADRDVARAHGAALLEAGHPVLLESYLDGPEVSLFCVVDRTVVVPLLPAQDFKRVGEDDTGLNTGGMGAYAPLPWLPDNI
YREVVSRIVEPVAAELVRRGSSFCGLLYVGLAITARGPAVVEFNCRFGDPETQAVLALLESPLGQLLHAAATGKLADFGE
LRWRDGVAVTVVLAAENYPGRPRVGDVVVGSEAEGVLHAGTTRRDDGAIVSSGGRVLSVVGTGADLSAARAHAYEILSSI
RLPGGHFRSDIGLRAAEGKISV
>P65896 6.3.4.13~~~purD~~~Phosphoribosylamine--glycine ligase~~~
MNVLVIGAGGREHALAYKLNQSNLVKQEFVIPGNEAMTPIAEVHTEISESNHQGILDFAKQQNVDWVVIGPEQPLIDGLA
DILRANGFKVFGPNKQAAQIEGSKLFAKKIMKKYNIPTADYKEVERKKDALTYIENCELPVVVKKDGLAAGKGVIIADTI
EAARSAIEIMYGDEEEGTVVFETFLEGEEFSLMTFVNGDLAVPFDCIAQDHKRAFDHDEGPNTGGMGAYCPVPHISDDVL
KLTNETIAQPIAKAMLNEGYQFFGVLYIGAILTKDGPKVIEFNARFGDPEAQVLLSRMESDLMQHIIDLDEGKRTEFKWK
NESIVGVMLASKGYPDAYEKGHKVSGFDLNENYFVSGLKKQGDTFVTSGGRVILAIGKGDNVQDAQRDAYEKVSQIQSDH
LFYRHDIANKALQLK
>Q9X0X7 6.3.4.13~~~purD~~~Phosphoribosylamine--glycine ligase~~~COG0151
MKAVRVHILGSGGREHAIGWAFAKQGYEVHFYPGNAGTKRDGTNHPYEGEKTLKAIPEEDIVIPGSEEFLVEGVSNWRSN
VFGPVKEVARLEGSKVYAKRFMKKYGIRTARFEVAETPEELREKIKKFSPPYVIKADGLARGKGVLILDSKEETIEKGSK
LIIGELIKGVKGPVVIDEFLAGNELSAMAVVNGRNFVILPFVRDYKRLMDGDRGPNTGGMGSWGPVEIPSDTIKKIEELF
DKTLWGVEKEGYAYRGFLYLGLMLHDGDPYILEYNVRLGDPETEVIVTLNPEGFVNAVLEGYRGGKMEPVEPRGFAVDVV
LAARGYPDAPEKGKEITLPEEGLIFFAGVAEKDGKLVTNGGRVLHCMGTGETKEEARRKAYELAEKVHFEGKTYRRDIAL
>Q8ZAR2 6.3.4.13~~~purD~~~Phosphoribosylamine--glycine ligase~~~COG0151
MNILIIGNGGREHALGWKAAQSPLADKIYVAPGNAGTALEPTLENVDIAATDIAGLLAFAQSHDIGLTIVGPEAPLVIGV
VDAFRAAGLAIFGPTQAAAQLEGSKAFTKDFLARHNIPSAEYQNFTDVEAALAYVRQKGAPIVIKADGLAAGKGVIVAMT
QEEAETAVNDMLAGNAFGDAGHRIVVEEFLDGEEASFIVMVDGENVLPMATSQDHKRVGDGDTGPNTGGMGAYSPAPVVT
DDVHQRVMDQVIWPTVRGMAAEGNIYTGFLYAGLMISADGQPKVIEFNCRFGDPETQPIMLRMRSDLVELCLAGTQGKLN
EKTSDWDERPSLGVVLAAGGYPADYRQGDVIHGLPQQEVKDGKVFHAGTKLNGNHEVVTNGGRVLCVTALGETVAQAQQY
AYQLAEGIQWEGVFCRKDIGYRAIARGK
>P08179 2.1.2.2~~~purN~~~Phosphoribosylglycinamide formyltransferase~~~COG0299
MNIVVLISGNGSNLQAIIDACKTNKIKGTVRAVFSNKADAFGLERARQAGIATHTLIASAFDSREAYDRELIHEIDMYAP
DVVVLAGFMRILSPAFVSHYAGRLLNIHPSLLPKYPGLHTHRQALENGDEEHGTSVHFVTDELDGGPVILQAKVPVFAGD
SEDDITARVQTQEHAIYPLVISWFADGRLKMHENAAWLDGQRLPPQGYAADE
>P9WHM5 2.1.2.2~~~purN~~~Phosphoribosylglycinamide formyltransferase~~~COG0299
MQEPLRVPPSAPARLVVLASGTGSLLRSLLDAAVGDYPARVVAVGVDRECRAAEIAAEASVPVFTVRLADHPSRDAWDVA
ITAATAAHEPDLVVSAGFMRILGPQFLSRFYGRTLNTHPALLPAFPGTHGVADALAYGVKVTGATVHLVDAGTDTGPILA
QQPVPVLDGDDEETLHERIKVTERRLLVAAVAALATHGVTVVGRTATMGRKVTIG
>P99162 2.1.2.2~~~purN~~~Phosphoribosylglycinamide formyltransferase~~~
MVKIAIFASGSGSNFENIVEHVESGKLENIEVTALYTDHQNAFCIDRAKKHDIPVYINEPKQFDSKAAYEQHLVTLLNKD
KVEWIILAGYMRLIGPDLLASFEGKILNIHPSLLPKYKGIDAIGQAYHSGDTITGSTVHYVDCGMDTGEIIEQRQCDIRP
DDSKEQLEEKVKKLEYELYPSVIAKIVK
>P15254 6.3.5.3~~~purL~~~Phosphoribosylformylglycinamidine synthase~~~COG0046
MMEILRGSPALSAFRINKLLARFQAARLPVHNIYAEYVHFADLNAPLNDDEHAQLERLLKYGPALASHAPQGKLLLVTPR
PGTISPWSSKATDIAHNCGLQQVNRLERGVAYYIEAGTLTNEQWQQVTAELHDRMMETVFFALDDAEQLFAHHQPTPVTS
VDLLGQGRQALIDANLRLGLALAEDEIDYLQDAFTKLGRNPNDIELYMFAQANSEHCRHKIFNADWVIDGEQQPKSLFKM
IKNTFETTPDHVLSAYKDNAAVMEGSEVGRYFADHETGRYDFHQEPAHILMKVETHNHPTAISPWPGAATGSGGEIRDEG
ATGRGAKPKAGLVGFSVSNLRIPGFEQPWEEDFGKPERIVTALDIMTEGPLGGAAFNNEFGRPALNGYFRTYEEKVNSHN
GEELRGYHKPIMLAGGIGNIRADHVQKGEINVGAKLVVLGGPAMNIGLGGGAASSMASGQSDADLDFASVQRDNPEMERR
CQEVIDRCWQLGDANPILFIHDVGAGGLSNAMPELVSDGGRGGKFELREILSDEPGMSPLEIWCNESQERYVLAVAADQL
PLFDELCKRERAPYAVIGEATEELHLSLHDRHFDNQPIDLPLDVLLGKTPKMTRDVQTLKAKGDALAREGITIADAVKRV
LHLPTVAEKTFLVTIGDRSVTGMVARDQMVGPWQVPVANCAVTTASLDSYYGEAMAIGERAPVALLDFAASARLAVGEAL
TNIAATQIGDIKRIKLSANWMAAAGHPGEDAGLYEAVKAVGEELCPALGLTIPVGKDSMSMKTRWQEGNEEREMTSPLSL
VISAFARVEDVRHTITPQLSTEDNALLLIDLGKGNNALGATALAQVYRQLGDKPADVRDVAQLKGFYDAIQALVAQRKLL
AYHDRSDGGLLVTLAEMAFAGHCGIDADIATLGDDRLAALFNEELGAVIQVRAADREAVESVLAQHGLADCVHYVGQAVS
GDRFVITANGQTVFSESRTTLRVWWAETTWQMQRLRDNPECADQEHQAKSNDADPGLNVKLSFDINEDVAAPYIATGARP
KVAVLREQGVNSHVEMAAAFHRAGFDAIDVHMSDLLTGRTGLEDFHALVACGGFSYGDVLGAGEGWAKSILFNDRVRDEF
ATFFHRPQTLALGVCNGCQMMSNLRELIPGSELWPRFVRNTSDRFEARFSLVEVTQSPSLLLQGMVGSQMPIAVSHGEGR
VEVRDAAHLAALESKGLVALRYVDNFGKVTETYPANPNGSPNGITAVTTESGRVTIMMPHPERVFRTVSNSWHPENWGED
GPWMRIFRNARKQLG
>P74881 6.3.5.3~~~purL~~~Phosphoribosylformylglycinamidine synthase~~~
MMEILRGSPALSAFRINKLLARFQAANLQVHNIYAEYVHFADLNAPLNDSEQAQLTRLLQYGPALSSHTPAGKLLLVTPR
PGTISPWSSKATDIAHNCGLQQVDRLERGVAYYIEASTLTAEQWRQVAAELHDRMMETVFSSLTDAEKLFIHHQPAPVSS
VDLLGEGRQALIDANLRLGLALAEDEIDYLQEAFTKLGRNPNDIELYMFAQANSEHCRHKIFNADWIIDGKPQPKSLFKM
IKNTFETTPDYVLSAYKDNAAVMEGSAVGRYFADHNTGRYDFHQEPAHILMKVETHNHPTAISPWPGAATGSGGEIRDEG
ATGRGAKPKAGLVGFSVSNLRIPGFEQPWEEDFGKPERIVTALDIMTEGPLGGAAFNNEFGRPALTGYFRTYEEKVNSHN
GEELRGYHKPIMLAGGIGNIRADHVQKGEIVVGAKLIVLGGPAMNIGLGGGAASSMASGQSDADLDFASVQRDNPEMERR
CQEVIDRCWQLGDANPILFIHDVGAGGLSNAMPELVSDGGRGGKFELRDILSDEPGMSPLEIWCNESQERYVLAVAADQL
PLFDELCKRERAPYAVIGDATEEQHLSLHDNHFDNQPIDLPLDVLLGKTPKMTRDVQTLKAKGDALNRADITIADAVKRV
LHLPTVAEKTFLVTIGDRTVTGMVARDQMVGPWQVPVADCAVTTASLDSYYGEAMSIGERAPVALLDFAASARLAVGEAL
TNIAATQIGDIKRIKLSANWMAAAGHPGEDAGLYDAVKAVGEELCPQLGLTIPVGKDSMSMKTRWQEGNEQREMTSPLSL
VISAFARVEDVRHTLTPQLSTEDNALLLIDLGKGHNALGATALAQVYRQLGDKPADVRDVAQLKGFYDAMQALVAARKLL
AWHDRSDGGLLVTLAEMAFAGHCGVQVDIAALGDDHLAALFNEELGGVIQVRAEDRDAVEALLAQYGLADCVHYLGQALA
GDRFVITANDQTVFSESRTTLRVWWAETTWQMQRLRDNPQCADQEHEAKANDTDPGLNVKLSFDINEDIAAPYIATGARP
KVAVLREQGVNSHVEMAAAFHRAGFDAIDVHMSDLLGGRIGLGNFHALVACGGFSYGDVLGAGEGWAKSILFNHRVRDEF
ETFFHRPQTLALGVCNGCQMMSNLRELIPGSELWPRFVRNHSDRFEARFSLVEVTQSPSLLLQGMVGSQMPIAVSHGEGR
VEVRDDAHLAALESKGLVALRYVDNFGKVTETYPANPNGSPNGITAVTTENGRVTIMMPHPERVFRTVANSWHPENWGED
SPWMRIFRNARKQLG
>Q81ZH0 6.3.3.1~~~purM~~~Phosphoribosylformylglycinamidine cyclo-ligase~~~COG0150
MANAYKQAGVDIEAGYEAVSRMKKHVQTTMRKEVLGGLGGFGGMFDLSKFALEEPVLVSGTDGVGTKLMLAFMADKHDTI
GIDAVAMCVNDIVVQGAEPLFFLDYIACGKAEPSKIENIVKGISEGCRQAGCALIGGETAEMPGMYSTEEYDLAGFTVGI
VDKKKIVTGEKIEAGHVLIGLASSGIHSNGYSLVRKVLLEDGELSLDRIYGRLELPLGEELLKPTKIYVKPILELLKNHE
VYGMAHITGGGFIENIPRMLPEGIGAEIELGSWKIQPIFSLLQEVGKLEEKEMFNIFNMGIGMVVAVKEEDAKDIVRLLE
EQGETARIIGRTVQGAGVTFNGGKAL
>P08178 6.3.3.1~~~purM~~~Phosphoribosylformylglycinamidine cyclo-ligase~~~COG0150
MTDKTSLSYKDAGVDIDAGNALVGRIKGVVKKTRRPEVMGGLGGFGALCALPQKYREPVLVSGTDGVGTKLRLAMDLKRH
DTIGIDLVAMCVNDLVVQGAEPLFFLDYYATGKLDVDTASAVISGIAEGCLQSGCSLVGGETAEMPGMYHGEDYDVAGFC
VGVVEKSEIIDGSKVSDGDVLIALGSSGPHSNGYSLVRKILEVSGCDPQTTELDGKPLADHLLAPTRIYVKSVLELIEKV
DVHAIAHLTGGGFWENIPRVLPDNTQAVIDESSWQWPEVFNWLQTAGNVEHHEMYRTFNCGVGMIIALPAPEVDKALALL
NANGENAWKIGIIKASDSEQRVVIE
>Q5L3D0 6.3.3.1~~~purM~~~Phosphoribosylformylglycinamidine cyclo-ligase~~~COG0150
MAKAYKQAGVDIEAGYQAVALMKEHVQKTMRPEVLGGIGGFGGLFDLSALGYRQPVLISGTDGVGTKLKLAFLLDRHDTI
GIDCVAMCVNDIIVQGAEPLFFLDYIACGKAVPEKIAAIVKGVADGCVEAGCALIGGETAEMPGMYDEDEYDLAGFAVGV
AEKERLITGETIQAGDALVGLPSSGLHSNGYSLVRRIVFEQAKLSLDEIYEPLDVPLGEELLKPTRIYAKLLRSVRERFT
IKGMAHITGGGLIENIPRMLPPGIGARIQLGSWPILPIFDFLREKGSLEEEEMFSVFNMGIGLVLAVSPETAAPLVEWLS
ERGEPAYIIGEVAKGAGVSFAGGGRA
>Q5F973 6.3.3.1~~~purM~~~Phosphoribosylformylglycinamidine cyclo-ligase~~~
MSTSLSYRDAGVGIDAGDQLVEKIKPFAKRTMRPEVLGDLGGFGALVEIGKKYQNPVLVSGTDGVGTKLKLAFDWDKHDT
VGIDLVAMSVNDILVQGAEPLFFLDYFACGKLDVPRATDVIKGIAQGCEESGCALIGGETAEMPGMYPVGEYDLAGFAVG
VVEKENVITGLSIGAGDVVLGLASNGAHSNGYSLIRKIIERDNPDLDAEFDNGKTLREAVIAPTRLYVKPILAALEKFTI
KGMAHITGGGITENVPRVLPKNTVAQIDAESWELPKLFQWLQKAGNVETQEMYRTFNCGIGMVVIVAAEDADAVRSFLSG
QGETVYRLGCIRERQGNEHQTQVA
>P99163 6.3.3.1~~~purM~~~Phosphoribosylformylglycinamidine cyclo-ligase~~~
MSKAYEQSGVNIHAGYEAVERMSSHVKRTMRKEVIGGLGGFGATFDLSQLNMTAPVLVSGTDGVGTKLKLAIDYGKHDSI
GIDAVAMCVNDILTTGAEPLYFLDYIATNKVVPEVIEQIVKGISDACVETNTALIGGETAEMGEMYHEGEYDVAGFAVGA
VEKDDYVDGSEVKEGQVVIGLASSGIHSNGYSLVRKLINESGIDLASNFDNRPFIDVFLEPTKLYVKPVLALKKEVSIKA
MNHITGGGFYENIPRALPAGYAARIDTTSFPTPKIFDWLQQQGNIDTNEMYNIFNMGIGYTVIVDEKDASRALKILAEQN
VEAYQIGHIVKNESTAIELLGV
>Q97TA2 6.3.3.1~~~purM~~~Phosphoribosylformylglycinamidine cyclo-ligase~~~COG0150
MANKNAYAQSGVDVEAGYEVVERIKKHVARTERAGVMGALGGFGGMFDLSKTGVKEPVLISGTDGVGTKLMLAIKYDKHD
TIGQDCVAMCVNDIIAAGAEPLYFLDYVATGKNEPAKLEQVVAGVAEGCVQAGAALIGGETAEMPGMYGEDDYDLAGFAV
GVAEKSQIIDGSKVVEGDVLLGLASSGIHSNGYSLVRRVFADYTGEEVLPELEGKKLKEVLLEPTRIYVKAVLPLIKEEL
VNGIAHITGGGFIENVPRMFADDLAAEIDESKVPVLPIFKTLEKYGQIKHEEMFEIFNMGVGLMLAVSPENVERVKELLD
EAVYEIGRIVKKENESVIIK
>Q9KPY6 6.3.3.1~~~purM~~~Phosphoribosylformylglycinamidine cyclo-ligase~~~COG0150
MSGNNPSLSYKDAGVDIDAGNALVERIKGAVKRTRRPEVMGGLGGFGALCELPTKYKHPVLVSGTDGVGTKLRLALDMKK
HDTIGIDLVAMCVNDLIVQGAEPLFFLDYYATGKLDVDTAAEVISGIADGCLQAGCALIGGETAEMPGMYEGEDYDVAGF
CVGVVEKEEIIDGSKVQVGDALIAVGSSGPHSNGYSLVRKILEVSKADKNERLAGKTIGEHLLAPTKIYIKSGLKLIAEH
DIHAISHITGGGFWENIPRVLPEGTKAVIDGKSWEWPVIFQWLQEKGNVTTHEMYRTFNCGVGLIIALPKDQANAAVALL
QAEGETAWVIGEIAAANSNEAQVEIN
>P12046 6.3.2.6~~~purC~~~Phosphoribosylaminoimidazole-succinocarboxamide synthase~~~COG0152
MNIVKNELLYEGKAKKIYKTDDENTLYVVYKDSATAFNGEKKAEISGKGRLNNEISSLIFKHLHAKGINNHFIERISETE
QLIKKVTIVPLEVVVRNVVAGSMSKRLGIPEGTELEQPIIEFYYKDDALGDPLITEDHIWLLKAATPEQVETIKSITTIV
NEELQSIFDDCHVRLIDFKLEFGLDAEGQVLLADEISPDTCRLWDKETNEKLDKDLFRRNLGSLTDAYEEIFNRLGGIHH
V
>Q0TTB4 6.3.2.6~~~purC~~~Phosphoribosylaminoimidazole-succinocarboxamide synthase~~~COG0152
MVNQLEMLYEGKAKKIYATDKEDMVIVHYKDDATAFNGEKKAQIESKGVLNNEITSLIFEMLNKEGIKTHFVEKLNDRDQ
LCKKVEIVPLEVIVRNVAAGSMAKRLGLEEGYELKTTVFELSYKDDSLGDPLINDYHAVGIGATTFEELNKIYEITAKVN
EILKEAFKKQNINLIDFKLEFGRYNGEILLADEISPDTCRFWDATTGEKMDKDRFRRDMGNVINGYREVLNRLRN
>P0A7D7 6.3.2.6~~~purC~~~Phosphoribosylaminoimidazole-succinocarboxamide synthase~~~COG0152
MQKQAELYRGKAKTVYSTENPDLLVLEFRNDTSAGDGARIEQFDRKGMVNNKFNYFIMSKLAEAGIPTQMERLLSDTECL
VKKLDMVPVECVVRNRAAGSLVKRLGIEEGIELNPPLFDLFLKNDAMHDPMVNESYCETFGWVSKENLARMKELTYKAND
VLKKLFDDAGLILVDFKLEFGLYKGEVVLGDEFSPDGSRLWDKETLEKMDKDRFRQSLGGLIEAYEAVARRLGVQLD
>Q2GHH2 6.3.2.6~~~purC~~~Phosphoribosylaminoimidazole-succinocarboxamide synthase~~~COG0152
MENKEKIYEGKAKIIFATLNPLEVIQHFKDEITAFNNKKAAIIHEKGILNNYISSFLMKKLIDKGIKTHFISLLNQREQL
VKKITIIPIEVVIRNLAAGNFSKRFQIADGTPFKSPIIEFYYKNDELSDPMVSEGHILSFQWLTNQELEKIKILSLKINN
ILSELFFNVGIKLVDFKLEFGKLHNDEQSDLFLADEISPDTCRLWDISTNKRLDKDRYRLNLGNVIEGYREVAHKLNAIP
NL
>B1MHW4 6.3.2.6~~~purC~~~Phosphoribosylaminoimidazole-succinocarboxamide synthase~~~
MRPSLSDYQHVASGKVRELYRVDDEHLLFVATDRISAFDFVLDTPIPDKGRILTAMSVFFFGLLTVPNHLAGPPDDPRIP
EEVLGRALLVRRLDMLPVECVARGYLTGSGLLDYQRTGAVCGHVLPQGLGEASRLDPPLFTPATKADIGEHDMNVDFAAV
VGLVGAVRANQLRDETIKIYTRAAAHALHKGIILADTKFEFGVDIEGNLVLADEVFTPDSSRYWDAAHYQPGVVQDSFDK
QFVRNWLTGPESGWDRASDTPPPPLPDEVAVATRERYIEAYERISGLSFSDWIGPSA
>A0R4I0 6.3.2.6~~~purC~~~Phosphoribosylaminoimidazole-succinocarboxamide synthase~~~COG0152
MRPALSDYQHLASGKVREIYRIDDEHLLFVASDRISAYDYILDSQIPDKGRILTAMSVFFFDHLLRTAGVPNHLAGPPDD
ERIPADVLGRALVVRRLDMLPVECVARGYLTGSGLIDYEKTGTVCGIALPPGLGEASKFDEPLFTPATKAEIGEHDENIS
FAKVIELVGAELANQLRDRTLQTYTAGADHALSKGIIIADTKFEFGVDRDGTVVLADEVFTPDSSRYWRADSYQPGVVQN
SFDKQFVRNWLTGPESGWDRHGNTPPPALPDDIVAATRERYIEAYERISGLSFDDWIGA
>P9WHN1 6.3.2.6~~~purC~~~Phosphoribosylaminoimidazole-succinocarboxamide synthase~~~COG0152
MRPALSDYQHVASGKVREIYRVDDEHLLLVASDRISAYDYVLDSTIPDKGRVLTAMSAFFFGLVDAPNHLAGPPDDPRIP
DEVLGRALVVRRLEMLPVECVARGYLTGSGLLDYQATGKVCGIALPPGLVEASRFATPLFTPATKAALGDHDENISFDRV
VEMVGALRANQLRDRTLQTYVQAADHALTRGIIIADTKFEFGIDRHGNLLLADEIFTPDSSRYWPADDYRAGVVQTSFDK
QFVRSWLTGSESGWDRGSDRPPPPLPEHIVEATRARYINAYERISELKFDDWIGPGA
>P99064 6.3.2.6~~~purC~~~Phosphoribosylaminoimidazole-succinocarboxamide synthase~~~
MTLLYEGKAKRIFSTNQENELRVEYKDEVTAGNGAKKDTMAGKGRLNNQITSIIFKYLQENGIESHFIKQLSETEQLVKP
VKIIPLEVVVRNIASGSITKRLGFENGEVFREPLVEFFYKNDALNDPLITDDHVKLLNIASDEDIEILKSKALKINNVLK
QLMDAMNLKLVDFKIEFGKTETGQILLADEISPDTCRIWDKATNANFDKDVYRNNTGSLIETYQIFLNKLEDLK
>Q07296 6.3.2.6~~~purC~~~Phosphoribosylaminoimidazole-succinocarboxamide synthase~~~COG0152
MSKQLIYSGKAKDIYTTEDENLIISTYKDQATAFNGVKKEQIAGKGVLNNQISSFIFEKLNVAGVATHFVEKLSDTEQLN
KKVKIIPLEVVLRNYTAGSFSKRFGVDEGIALETPIVEFYYKNDDLDDPFINDEHVKFLQIAGDQQIAYLKEETRRINEL
LKVWFAEIGLKLIDFKLEFGFDKDGKIILADEFSPDNCRLWDADGNHMDKDVFRRGLGELTDVYEIVWEKLQELK
>Q9X0X0 6.3.2.6~~~purC~~~Phosphoribosylaminoimidazole-succinocarboxamide synthase~~~COG0152
MNYEGKTKIVKVTGDYALLEFKDDITAGDGLKHDVLTGKGSICAETTAILMKYLSEKGIKTHLVEYIPPRTLKVIPLKMF
PLEVVVRLKKAGSFVRRYGGAEGEDLPVPLVEFFIKDDERHDPMVCVDHLEILGIATKKQAEKMKEAAVKITLALKEFFE
RANFELWDIKYEFGLDKDGNVVLGDEISPDTFRLRKKGEIFDKDVYRRDLGDPLKKYREVLELCRSLNSQ
>P12047 4.3.2.2~~~purB~~~Adenylosuccinate lyase~~~COG0015
MIERYSRPEMSAIWTDENRFQAWLEVEILACEAWAELGVIPKEDVKVMRENASFDINRILEIEKDTRHDVVAFTRAVSES
LGEERKWVHYGLTSTDVVDTALSYLLKQANDILLKDLERFVDIIKEKAKEHKYTVMMGRTHGVHAEPTTFGLKLALWHEE
MKRNLERFKQAKAGIEVGKISGAVGTYANIDPFVEQYVCEKLGLKAAPISTQTLQRDRHADYMATLALIATSIEKFAVEI
RGLQKSETREVEEFFAKGQKGSSAMPHKRNPIGSENMTGMARVIRGYMMTAYENVPLWHERDISHSSAERIILPDATIAL
NYMLNRFSNIVKNLTVFPENMKRNMDRTLGLIYSQRVLLALIDTGLTREEAYDTVQPKAMEAWEKQVPFRELVEAEEKIT
SRLSPEKIADCFDYNYHLKNVDLIFERLGLA
>P0AB89 4.3.2.2~~~purB~~~Adenylosuccinate lyase~~~COG0015
MELSSLTAVSPVDGRYGDKVSALRGIFSEYGLLKFRVQVEVRWLQKLAAHAAIKEVPAFAADAIGYLDAIVASFSEEDAA
RIKTIERTTNHDVKAVEYFLKEKVAEIPELHAVSEFIHFACTSEDINNLSHALMLKTARDEVILPYWRQLIDGIKDLAVQ
YRDIPLLSRTHGQPATPSTIGKEMANVAYRMERQYRQLNQVEILGKINGAVGNYNAHIAAYPEVDWHQFSEEFVTSLGIQ
WNPYTTQIEPHDYIAELFDCVARFNTILIDFDRDVWGYIALNHFKQKTIAGEIGSSTMPHKVNPIDFENSEGNLGLSNAV
LQHLASKLPVSRWQRDLTDSTVLRNLGVGIGYALIAYQSTLKGVSKLEVNRDHLLDELDHNWEVLAEPIQTVMRRYGIEK
PYEKLKELTRGKRVDAEGMKQFIDGLALPEEEKARLKAMTPANYIGRAITMVDELK
>Q5ZXD1 4.3.2.2~~~purB~~~Adenylosuccinate lyase~~~COG0015
MTLTALNAISPIDGRYVNKTRALSPYFSEFALTYYRLMVEIKWFESLAANDTIPEVPALDNKARKFLSDLISNFNESEAE
KIKEFEKQTNHDVKAVEYYLQDKFQENEQLKSCVAFIHFACTSEDINNLAYALMIKQAIAQVIQPTIAEIMGSITLLGKQ
HADVAMLSRTHGQPATPTTMGKELVNFVARLKRPQQQLAEVLIPAKFNGAVGNYNAHVAAYPEVDWRKHCANFVTSLGLS
FNAYTTQIEPHDGIAEVSQIMVRINNILLDYTQDIWSYISLGYFKQKTIAEEVGSSTMPHKVNPIDFENAEGNLGLSNAL
FIHFANKLTQSRMQRDLSDSTVLRNLGVAFSYSLIAYHSVAKGNDKLQINKSALQKDLSENWEVLAEAIQTVMRRYNEPN
AYEQLKELTRGQMIDAENLKKFIKTLSIPEEAKAELMKLTPETYTGLATQLVKAFS
>Q9I0K9 4.3.2.2~~~purB~~~Adenylosuccinate lyase~~~
MQLSSLTAVSPVDGRYAGKTSSLRPIFSEYGLIRFRVMVEVRWLQRLAAHAGIPEVAPFSAEANALLDSLASDFQLEHAE
RIKEIERTTNHDVKAVEYLLKEQAAKLPELAAVSEFIHFACTSEDINNLSHALMLREGRDSVLLPLMRQIAEAIRELAVK
LADVPMLSRTHGQPASPTTLGKELANVVYRLERQIKQVAGIELLGKINGAVGNYNAHLSAYPEVDWEANARQFIEGDLGL
TFNPYTTQIEPHDYIAELFDAIARFNTILIDFDRDVWGYISLGYFKQKTVAGEIGSSTMPHKVNPIDFENSEGNLGIANA
LFQHLASKLPISRWQRDLTDSTVLRNLGVGIAHSIIAYEASLKGIGKLELNAQRIAEDLDACWEVLAEPVQTVMRRYGVE
NPYEKLKELTRGKGISAEALQTFIEELAIPAEAKVELKKLTPAGYVGNAAAQAKRI
>Q7A4Q3 4.3.2.2~~~purB~~~Adenylosuccinate lyase~~~
MIERYSREEMSNIWTDQNRYEAWLEVEILACEAWSELGHIPKADVQKIRQNAKVNVERAQEIEQETRHDVVAFTRQVSET
LGEERKWVHYGLTSTDVVDTALSFVIKQANDIIEKDLERFIDVLAEKAKNYKYTLMMGRTHGVHAEPTTFGVKMALWYTE
MQRNLQRFKQVREEIEVGKMSGAVGTFANIPPEIESYVCKHLGIGTAPVSTQTLQRDRHAYYIATLALIATSLEKFAVEI
RNLQKTETREVEEAFAKGQKGSSAMPHKRNPIGSENITGISRVIRGYITTAYENVPLWHERDISHSSAERIMLPDVTIAL
DYALNRFTNIVDRLTVFEDNMRNNIDKTFGLIFSQRVLLALINKGMVREEAYDKVQPKAMISWETKTPFRELIEQDESIT
SVLTKEELDECFDPKHHLNQVDTIFERAGLA
>Q7A0G9 4.3.2.2~~~purB~~~Adenylosuccinate lyase~~~
MIERYSREEMSNIWTDQNRYEAWLEVEILACEAWSELGHIPKADVQKIRQNAKVNVERAQEIEQETRHDVVAFTRQVSET
LGEERKWVHYGLTSTDVVDTALSFVIKQANDIIEKDLERFIDVLAEKAKNYKYTLMMGRTHGVHAEPTTFGVKMALWYTE
MQRNLQRFKQVREEIEVGKMSGAVGTFANIPPEIESYVCKHLGIGTAPVSTQTLQRDRHAYYIATLALIATSLEKFAVEI
RNLQKTETREVEEAFAKGQKGSSAMPHKRNPIGSENITGISRVIRGYITTAYENVPLWHERDISHSSAERIMLPDVTIAL
DYALNRFTNIVDRLTVFEDNMRNNIDKTFGLIFSQRVLLALINKGMVREEAYDKVQPKAMISWETKTPFRELIEQDESIT
SVLTKEELDECFDPKHHLNQVDTIFERAGLA
>Q9X0I0 4.3.2.2~~~purB~~~Adenylosuccinate lyase~~~COG0015
MVERYSLSPMKDLWTEEAKYRRWLEVELAVTRAYEELGMIPKGVTERIRNNAKIDVELFKKIEEKTNHDVVAFVEGIGSM
IGEDSRFFHYGLTSSDVLDTANSLALVEAGKILLESLKEFCDVLWEVANRYKHTPTIGRTHGVHAEPTSFGLKVLGWYSE
MKRNVQRLERAIEEVSYGKISGAVGNYANVPPEVEEKALSYLGLKPEPVSTQVVPRDRHAFYLSTLAIVAAGIERIAVEI
RHLQRTEVLEVEEPFRKGQRGSSAMPHKKNPITCERLTGLSRMMRAYVDPSLENIALWHERDISHSSVERYVFPDATQTL
YYMIVTATNVVRNMKVNEERMKKNIDLTKGLVFSQRVLLKLIEKGLTRKEAYDIVQRNALKTWNSEKHFLEYLLEDEEVK
KLVTKEELEELFDISYYLKHVDHIFERFEKE
>A0A0H3AL67 4.3.2.2~~~purB~~~Adenylosuccinate lyase~~~COG0015
MELSALTAVSPVDGRYGSKTIALRSIFSEFGLLKYRTIVEIRWLQKLAATAEIAEVPAFSAEANQFLDAIAANFNEADAL
RIKEIERTTNHDVKAVEYFLKEKVAAMPELHAVNEFIHFACTSEDINNTSHALMLKEARDTVILPEIRNVIDAIRKLAEE
YRDIPLLSRTHGQPASPSTMGKEMANVAYRMERQYKQIANVEILAKINGAVGNYNAHLSAYPTVDWHKFSEEFITESLGV
DWNPYTTQIEPHDYIAELFEAVARFNTILIDFDRDVWGYIALGHFKQRTIAGEIGSSTMPHKVNPIDFENSEGNLGLANA
VFTHLAQKLPISRWQRDLTDSTVLRNLGVGVGYAIIAYTSTLKGISKLEVNRDALLAELDHNWEVLAEPIQTVMRRYGIE
KPYEKLKELTRGKRVDGEAMRQFIDGLALPAEEKTRLKAMTPASYIGYAIELTDKL
>Q9PNY2 ~~~purH~~~Bifunctional purine biosynthesis protein PurH~~~COG0138
MRALLSVSDKEGIVEFGKELENLGFEILSTGGTFKLLKENGIKVIEVSDFTKSPELFEGRVKTLHPKIHGGILHKRSDEN
HIKQAKENEILGIDLVCVNLYPFKKTTIMSDDFDEIIENIDIGGPAMIRSAAKNYKDVMVLCDPLDYEKVIETLKKGQND
ENFRLNLMIKAYEHTANYDAYIANYMNERFNGGFGASKFIVGQKVFDTKYGENPHQKGALYEFDAFFSANFKALKGEASF
NNLTDINAALNLASSFDKAPAIAIVKHGNPCGFAIKENLVQSYIHALKCDSVSAYGGVVAINGTLDEALANKINEIYVEV
IIAANVDEKALAVFEGKKRIKIFTQESPFLIRSFDKYDFKHIDGGFVYQNSDEVGEDELKNAKLMSQREASKEELKDLEI
AMKIAAFTKSNNVVYVKNGAMVAIGMGMTSRIDAAKAAIAKAKEMGLDLQGCVLASEAFFPFRDSIDEASKVGVKAIVEP
GGSIRDDEVVKAADEYGMALYFTGVRHFLH
>P15639 ~~~purH~~~Bifunctional purine biosynthesis protein PurH~~~COG0138
MQQRRPVRRALLSVSDKAGIVEFAQALSARGVELLSTGGTARLLAEKGLPVTEVSDYTGFPEMMDGRVKTLHPKVHGGIL
GRRGQDDAIMEEHQIQPIDMVVVNLYPFAQTVAREGCSLEDAVENIDIGGPTMVRSAAKNHKDVAIVVKSSDYDAIIKEM
DDNEGSLTLATRFDLAIKAFEHTAAYDSMIANYFGSMVPAYHGESKEAAGRFPRTLNLNFIKKLDMRYGENSHQQAAFYI
EENVKEASVATATQVQGKALSYNNIADTDAALECVKEFAEPACVIVKHANPCGVAIGNSILDAYDRAYKTDPTSAFGGII
AFNRELDAETAQAIISRQFVEVIIAPSASEEALKITAAKQNVRVLTCGQWGERVPGLDFKRVNGGLLVQDRDLGMVGAEE
LRVVTKRQPSEQELRDALFCWKVAKFVKSNAIVYAKNNMTIGIGAGQMSRVYSAKIAGIKAADEGLEVKGSSMASDAFFP
FRDGIDAAAAAGVTCVIQPGGSIRDDEVIAAADEHGIAMLFTDMRHFRH
>P9WHM7 ~~~purH~~~Bifunctional purine biosynthesis protein PurH~~~COG0138
MSTDDGRRPIRRALISVYDKTGLVDLAQGLSAAGVEIISTGSTAKTIADTGIPVTPVEQLTGFPEVLDGRVKTLHPRVHA
GLLADLRKSEHAAALEQLGIEAFELVVVNLYPFSQTVESGASVDDCVEQIDIGGPAMVRAAAKNHPSAAVVTDPLGYHGV
LAALRAGGFTLAERKRLASLAFQHIAEYDIAVASWMQQTLAPEHPVAAFPQWFGRSWRRVAMLRYGENPHQQAALYGDPT
AWPGLAQAEQLHGKDMSYNNFTDADAAWRAAFDHEQTCVAIIKHANPCGIAISSVSVADAHRKAHECDPLSAYGGVIAAN
TEVSVEMAEYVSTIFTEVIVAPGYAPGALDVLARKKNIRVLVAAEPLAGGSELRPISGGLLIQQSDQLDAHGDNPANWTL
ATGSPADPATLTDLVFAWRACRAVKSNAIVIAADGATVGVGMGQVNRVDAARLAVERGGERVRGAVAASDAFFPFPDGLE
TLAAAGVTAVVHPGGSVRDEEVTEAAAKAGVTLYLTGARHFAH
>P67544 ~~~purH~~~Bifunctional purine biosynthesis protein PurH~~~
MKKAILSVSNKTGIVEFAKALTQLNYELYSTGGTKRILDEANVPVRSVSDLTHFPEIMDGRVKTLHPAVHGGILADRNKP
QHLNELSEQHIDLIDMVVVNLYPFQQTVANPDVTMDEAIENIDIGGPTMLRAAAKNYKHVTTIVHPADYHEVLTRLRNDS
LDESYRQSLMIKVFEHTAEYDEAIVRFFKGDKETLRYGENPQQSAYFVRTSNAKHTIAGAKQLHGKQLSYNNIKDADATL
ALVKKFDTPAAVAVKHMNPCGVGIGDTIEQAFQHAYEADSQSIFGGIVALNRAVTPELAEQLHSIFLEVIIAPKFTDEAL
DILKQKKNVRLLEIDMTIDSNEEEFVSVSGGYLVQDKDNYVVPKEEMKVVTEVAPTDEQWEAMLLGWKVVPSVKSNAIIL
SNNKQTVGIGAGQMNRVGAAKIALERAIEINDHVALVSDGFFPMGDTVELAAQHGIKAIIQPGGSIKDQDSIDMANKHGI
AMVVTGTRHFKH
>Q9X0X6 ~~~purH~~~Bifunctional purine biosynthesis protein PurH~~~COG0138
MKRILVSLYEKEKYLDILRELHEKGWEIWASSGTAKFLKSNGIEANDVSTITGFENLLGGLVKTLHPEIFAGILGPEPRW
DVVFVDLYPPPDIDIGGVALLRAAAKNWKKVKPAFDMETLKLAIEIDDEETRKYLAGMTFAFTSVYDSIRANQFVEGISL
AFKREDLQLRYGENPHEKAFVYGKPAFEILHEGKTISFNNILDAENAWFMAKNLPRMGAVVVKHQSPCGAAIGEDKVEIV
KKAIEADDESSFGGILAVNFEMDEEVAKSLKKYLEVIVAPSFTQEAIEVLSKKKVRLLKPGDYASWAGKMAFGSLVLSER
KYPEGNFELVVGEPLSEKELEDLEFAYRVVEGAKSNAVLIAKDGVTVGIGSGQPSRKRAAWIATVMAGEKAKGAVAASDA
FFPFPDSLEILAQAGVKAVVAPLGSIRDEEVIEKARELGITFYKAPSRVFRH
>Q81JI9 6.3.4.4~~~purA~~~Adenylosuccinate synthetase~~~COG0104
MSSVVVVGTQWGDEGKGKITDFLSEHAEVVARYQGGNNAGHTIVFGGVKYKLHLIPSGIFYKEKICVIGNGLVVDPKALL
EELKYLHDRGVSTDNLRVSNRAHVILPYHLKQDELEEASKGDNKIGTTKKGIGPAYMDKAARIGIRMADLLDREAFKEKL
EQNLAQKNRLFEKMYDTEGFSVDEIFEEYFEYGQQIAQYVCDTSVVLNDALDNNHRVLFEGAQGVMLDIDHGTYPFVTSS
NPIAGGVTVGTGVGPAKVTRVVGVCKAYTSRVGDGPFPTELHDEIGHQIREVGREYGTTTGRPRRVGWFDSVVVRHARRV
SGLTDLSLNSIDVLTGIPTLKICVAYKCDGKVIDEVPANLNILAKCEPVYEELPGWTEDITGVRSLDELPENARKYVERV
SELTGIQLSMFSVGPDRNQTNIVRNVYEA
>Q2SWD3 6.3.4.4~~~purA~~~Adenylosuccinate synthetase~~~
MSASAVNVTPGRNVVVVGTQWGDEGKGKIVDWLTDHAQGVVRFQGGHNAGHTLIIGGKKTILRLIPSGIMREGVACYIGN
GVVLSPEALFKEIGELEEAGLSVRERLFISEATTLILPYHIAIDQAREARKGAGKIGTTGRGIGPAYEDKVGRRALRVQD
LFDARTFADRLRENLDFHNFVLTQYLGGAAVDFQATLDTMLGYADRLRPMVADVSRRLYEENHAGRNLLFEGAQGTLLDI
DHGTYPFVTSSNCVAGAAAAGAGVGPQKLNYILGITKAYCTRVGSGPFPSELYDADNPSRQDQIGITLANVGKEFGSVTG
RPRRTGWLDAAALRRSIQINGVSGLCMTKLDVLDGLDEVKLCVGYKIDGEDADLLPRGAAEVARCEPVYETFGGWKESTV
GINSWDALPANARAYLTRVQEVAGVPIDMVSTGPDRDETILLRHPFKV
>Q9PMG4 6.3.4.4~~~purA~~~Adenylosuccinate synthetase~~~COG0104
MSKADIIVGIQWGDEGKGKVVDKLCENYDFVCRSAGGHNAGHTIWVNGVRYALHLMPSGVLHPRCINIIGNGVVVSPEVL
IAEMAQFENLKGRLYISDRAHLNLKHHSLIDIAKEKLKGKNAIGTTGKGIGPSYADKINRTGHRVGELLEPQRLCEALIK
DFEANKTFFEMLEIEIPSAEELLADLKRFNEILTPYITDTTRMLWKALDEDKRVLLEGAQGSMLDIDHGTYPYVTSSSTI
SAGTLTGLGLNPKEAGNIIGIVKAYATRVGNGAFPTEDKGEDGEKIAQIGKEIGVSTGRKRRCGWFDAVAVRYTARLNGL
DALSLMKLDVLDGFEKIKICRAYEYKGMEIDYIPSDLENVQPIYEEMDGWDKVFGIKDYDLLPENAKKYIARLEELAGVK
VKYISTSPERDDTIIL
>P0A7D4 6.3.4.4~~~purA~~~Adenylosuccinate synthetase~~~COG0104
MGNNVVVLGTQWGDEGKGKIVDLLTERAKYVVRYQGGHNAGHTLVINGEKTVLHLIPSGILRENVTSIIGNGVVLSPAAL
MKEMKELEDRGIPVRERLLLSEACPLILDYHVALDNAREKARGAKAIGTTGRGIGPAYEDKVARRGLRVGDLFDKETFAE
KLKEVMEYHNFQLVNYYKAEAVDYQKVLDDTMAVADILTSMVVDVSDLLDQARQRGDFVMFEGAQGTLLDIDHGTYPYVT
SSNTTAGGVATGSGLGPRYVDYVLGILKAYSTRVGAGPFPTELFDETGEFLCKQGNEFGATTGRRRRTGWLDTVAVRRAV
QLNSLSGFCLTKLDVLDGLKEVKLCVAYRMPDGREVTTTPLAADDWKGVEPIYETMPGWSESTFGVKDRSGLPQAALNYI
KRIEELTGVPIDIISTGPDRTETMILRDPFDA
>P56137 6.3.4.4~~~purA~~~Adenylosuccinate synthetase~~~COG0104
MADVVVGIQWGDEGKGKIVDRIAKDYDFVVRYQGGHNAGHTIVHKGVKHSLHLMPSGVLYPKCKNIISSAVVVSVKDLCE
EISAFEDLENRLFVSDRAHVILPYHAKKDAFKEKSQNIGTTKKGIGPCYEDKMARSGIRMGDLLDDKILEEKLNAHFKAI
EPFKKAYDLGENYEKDLMGYFKTYAPKICPFIKDTTSMLIEANQKGEKILLEGAQGTLLDIDLGTYPFVTSSNTTSASAC
VSTGLNPKAINEVIGITKAYSTRVGNGPFPSEDTTPMGDHLRTKGAEFGTTTKRPRRCGWLDLVALKYACALNGCTQLAL
MKLDVLDGIDAIKVCVAYERKGERLEIFPSDLKDCVPIYQTFKGWEKSVGVRKLDDLEPNVREYIRFIEKEVGVKIRLIS
TSPEREDTIFL
>Q8RNM2 6.3.4.4~~~purA~~~Adenylosuccinate synthetase~~~COG0104
MGKNVVVLGTQWGDEGKGKIVDLLTQDAQVVVRYQGGHNAGHTLKINGVKTVLRLIPSGMLRPNVTCYIANGVVLSPQAL
LSEIKELEGNGINVRERLRISLACPLILPYHIALDKARETHMGKSAIGTTGRGIGPAYEDKVARRALRVGDLFHRDRFAN
KLTELLDYHNFVLTQYFKQPAVDLESLLGESLQWAEELRPMVCDVSACLHEHRKQGENILFEGAQGVYLDIDHGTYPYVT
SSNTCVGSVINGAGFGPRYIDYVLGITKAYTTRVGGGPFPTELLDDVGKRIAERGQEFGAVTGRPRRCGWFDAVLLKRSI
ELNSISGLCVTKLDVLDGLEVLRIAVAYKDRDGNILSRPPLAADDFNDLLPVYEELPGWQESTADVTVMSDLPANARAYL
KRIEEILGIPIDMLSTGPERDSTITLRGPFL
>A0QQH7 6.3.4.4~~~purA~~~Adenylosuccinate synthetase~~~COG0104
MPAIVLIGAQWGDEGKGKATDLLGGRVQWVVRYQGGNNAGHTVVLPTGENFALHLIPSGILTPGVTNVIGNGVVVDPGVL
LTELKGLEDRGVDTSNLLISADAHLLMPYHVAIDKVVERWAGSKKIGTTGRGIGPCYQDKIARIGIRVADVLDEQVLAEK
IEAALEFKNQVLVKIYNRKALEPAEVLENLLEQAEGFKHRIADARLLLNQALENDEAVLLEGSQGTLLDVDHGTYPFVTS
SNPTAGGAAVGSGIGPTRITTVLGILKAYTTRVGSGPFPTELFDEHGAYLAKTGGEVGVTTGRARRCGWFDAVIARYATR
VNGITDYFLTKLDVLSSLETVPVCVGYTVDGKRVDEMPMTQSDIARAEPVYEELPGWWEDISGAREFEDLPAKARDYVLR
LEELAGAYVSCIGVGPGRDQTIVRRDVLAAR
>P9WHN3 6.3.4.4~~~purA~~~Adenylosuccinate synthetase~~~COG0104
MPAIVLIGAQWGDEGKGKATDLLGGRVQWVVRYQGGNNAGHTVVLPTGENFALHLIPSGVLTPGVTNVIGNGVVIDPGVL
LNELRGLQDRGVDTAKLLISADAHLLMPYHIAIDKVTERYMGSKKIGTTGRGIGPCYQDKIARIGIRVADVLDPEQLTHK
VEAACEFKNQVLVKIYNRKALDPAQVVDALLEQAEGFKHRIADTRLLLNAALEAGETVLLEGSQGTLLDVDHGTYPYVTS
SNPTAGGAAVGSGIGPTRIGTVLGILKAYTTRVGSGPFPTELFDEHGEYLSKTGREFGVTTGRRRRCGWFDAVIARYAAR
VNGITDYFLTKLDVLSSLESVPVCVGYEIDGRRTRDMPMTQRDLCRAKPVYEELPGWWEDISGAREFDDLPAKARDYVLR
LEQLAGAPVSCIGVGPGREQTIVRRDVLQDRP
>P99099 6.3.4.4~~~purA~~~Adenylosuccinate synthetase~~~
MSSIVVVGTQWGDEGKGKITDFLAEQSDVIARFSGGNNAGHTIQFGGETYKLHLVPSGIFYKDKLAVIGNGVVVDPVALL
KELDGLNERGIPTSNLRISNRAQVILPYHLAQDEYEERLRGDNKIGTTKKGIGPAYVDKVQRIGIRMADLLEKETFERLL
KSNIEYKQAYFKGMFNETCPSFDDIFEEYYAAGQRLKEFVTDTSKILDDAFVADEKVLFEGAQGVMLDIDHGTYPFVTSS
NPIAGNVTVGTGVGPTFVSKVIGVCKAYTSRVGDGPFPTELFDEDGHHIREVGREYGTTTGRPRRVGWFDSVVLRHSRRV
SGITDLSINSIDVLTGLDTVKICTAYELDGKEITEYPANLDQLKRCKPIFEELPGWTEDVTSVRTLEELPENARKYLERI
SELCNVQISIFSVGPDREQTNLLKELW
>P65887 6.3.4.4~~~purA~~~Adenylosuccinate synthetase~~~COG0104
MTSVVVVGTQWGDEGKGKITDFLSANAEVIARYQGGDNAGHTIVIDGKKFKLHLIPSGIFFPEKISVIGNGMVVNPKSLV
KELSYLHEEGVTTDNLRISDRAHVILPYHIELDRLQEEAKGDNKIGTTIKGIGPAYMDKAARVGIRIADLLDKDIFRERL
ERNLAEKNRLFEKLYDSKAIVFDDIFEEYYEYGQQIKKYVIDTSVILNDALDNGKRVLFEGAQGVMLDIDQGTYPFVTSS
NPVAGGVTIGSGVGPSKIDKVVGVCKAYTSRVGDGPFPTELFDEVGERIREVGHEYGTTTGRPRRVGWFDSVVMRHSRRV
SGITNLSLNSIDVLSGLDTVKICVAYDLDGQRIDYYPASLEQLKRCKPIYEELPGWSEDITGVRNLEDLPENARNYVRRV
SELVGVRISTFSVGPGREQTNILESVWS
>Q5SLS1 6.3.4.4~~~purA~~~Adenylosuccinate synthetase~~~COG0104
MPGIAIIGAQWGDEGKGKVVDVLAREADYVIRYQGGANAGHTVVAEGKVFKLNLLPSGVIHPHAVNVLGDGMVIDPFRFQ
EEVEGLRKEGFDPKILVSERAHLVLPHHKHVESRHNFVGTTGRGIGPAYSDRARRVGIRAGDLLDEATLRERVRRLLAEK
PNSTREAGWDTEEKALADLHRMREILSPYIADTGSLLREAWRKGKRLLFEGAQATLLDLNYGTYPYVTSSHPTVGGILVG
TGLSHKAITKVYGVAKAYTTRVGEGPFPTELQGELAHHLREKGGEYGTTTGRPRRVGWLDLVALRYACEVNGFDGLVLTK
LDVLSGLEKVKVAVEYLDGARPGEASPEAVRYLELPGWGDLSHVKRREDLPANLLRYLELVEEHTGVPVVLFSTSPRRED
TFGAVSWV
>Q8ZIV7 6.3.4.4~~~purA~~~Adenylosuccinate synthetase~~~COG0104
MGKNVVVLGTQWGDEGKGKVVDLLTERAKYVVRYQGGHNAGHTLVINGEKTVLHLIPSGILRENVISIIGNGVVLAPDAL
MKEMTELEARGVPVRERLLLSEACPLILPYHVALDNAREKARGAKAIGTTGRGIGPAYEDKVARRGLRVSDLFNKETFAI
KLKEIVEYHNFQLVHYYKEAAVDYQKVLDDVLAIADILTAMVVDVSELLDNARKQGELIMFEGAQGTLLDIDHGTYPYVT
SSNTTAGGVATGSGLGPRYVDYVLGIVKAYSTRVGAGPFPTELNDETGEFLRKQGNEYGATTGRSRRTGWLDIVAVRRAV
QINSLSGFCMTKLDVLDGLKEVKLCVGYRMPDGREVDTTPLAAEGWEGIEPIYETMPGWSETTFGVKEHSKLPQAALNYI
QRVEELTGVPIDIISTGPDRDETMILRDPFDA
>P0AG18 5.4.99.18~~~purE~~~N5-carboxyaminoimidazole ribonucleotide mutase~~~COG0041
MSSRNNPARVAIVMGSKSDWATMQFAAEIFEILNVPHHVEVVSAHRTPDKLFSFAESAEENGYQVIIAGAGGAAHLPGMI
AAKTLVPVLGVPVQSAALSGVDSLYSIVQMPRGIPVGTLAIGKAGAANAALLAAQILATHDKELHQRLNDWRKAQTDEVL
ENPDPRGAA
>P9WHM1 5.4.99.18~~~purE~~~N5-carboxyaminoimidazole ribonucleotide mutase~~~COG0041
MTPAGERPRVGVIMGSDSDWPVMADAAAALAEFDIPAEVRVVSAHRTPEAMFSYARGAAERGLEVIIAGAGGAAHLPGMV
AAATPLPVIGVPVPLGRLDGLDSLLSIVQMPAGVPVATVSIGGAGNAGLLAVRMLGAANPQLRARIVAFQDRLADVVAAK
DAELQRLAGKLTRD
>Q9WYS7 5.4.99.18~~~purE~~~N5-carboxyaminoimidazole ribonucleotide mutase~~~COG0041
MPRVGIIMGSDSDLPVMKQAAEILEEFGIDYEITIVSAHRTPDRMFEYAKNAEERGIEVIIAGAGGAAHLPGMVASITHL
PVIGVPVKTSTLNGLDSLFSIVQMPGGVPVATVAINNAKNAGILAASILGIKYPEIARKVKEYKERMKREVLEKAQRLEQ
IGYKEYLNQKE
>Q73PV9 4.1.1.21~~~purE~~~Phosphoribosylaminoimidazole carboxylase~~~COG0041
MRPLVIILMGSSSDMGHAEKIASELKTFGIEYAIRIGSAHKTAEHVVSMLKEYEALDRPKLYITIAGRSNALSGFVDGFV
KGATIACPPPSDSFAGADIYSSLRMPSGISPALVLEPKNAALLAARIFSLYDKEIADSVKSYMESNAQKIIEDDSKLKR
>O66608 6.3.4.18~~~purK~~~N5-carboxyaminoimidazole ribonucleotide synthase~~~COG0026
MLTVGILGGGQLGWMTILEGRKLGFKFHVLEDKENAPACRVADRCFRTGQISEFVDSCDIITYEFEHIKDEVLEKCESKL
IPNPQALYVKKSRIREKLFLKKHGFPVPEFLVIKRDEIIDALKSFKLPVVIKAEKLGYDGKGQYRIKKLEDANQVVKNHD
KEESFIIEEFVKFEAEISCIGVRDREGKTYFYPQPFNKHEEGILIYNYVPYAKLKEAEEITKRLMELLDIVGVFTVEFFL
LKDGRVLINEFAPRVHNTGHWTLDGAYTSQFENLLRAITEMPLGSTELKLPSGMVNILGKSYEEIPLKEILSVEGAKLYW
YGKEKKPRRKVGHVNVVGRSKEEVVEKVERVFTLLKGSREKLPAP
>P09029 6.3.4.18~~~purK~~~N5-carboxyaminoimidazole ribonucleotide synthase~~~COG0026
MKQVCVLGNGQLGRMLRQAGEPLGIAVWPVGLDAEPAAVPFQQSVITAEIERWPETALTRELARHPAFVNRDVFPIIADR
LTQKQLFDKLHLPTAPWQLLAERSEWPAVFDRLGELAIVKRRTGGYDGRGQWRLRANETEQLPAECYGECIVEQGINFSG
EVSLVGARGFDGSTVFYPLTHNLHQDGILRTSVAFPQANAQQQAQAEEMLSAIMQELGYVGVMAMECFVTPQGLLINELA
PRVHNSGHWTQNGASISQFELHLRAITDLPLPQPVVNNPSVMINLIGSDVNYDWLKLPLVHLHWYDKEVRPGRKVGHLNL
TDSDTSRLTATLEALIPLLPPEYASGVIWAQSKFG
>P9WHL9 6.3.4.18~~~purK~~~N5-carboxyaminoimidazole ribonucleotide synthase~~~COG0026
MMAVASSRTPAVTSFIAPLVAMVGGGQLARMTHQAAIALGQNLRVLVTSADDPAAQVTPNVVIGSHTDLAALRRVAAGAD
VLTFDHEHVPNELLEKLVADGVNVAPSPQALVHAQDKLVMRQRLAAAGVAVPRYAGIKDPDEIDVFAARVDAPIVVKAVR
GGYDGRGVRMARDVADARDFARECLADGVAVLVEERVDLRRELSALVARSPFGQGAAWPVVQTVQRDGTCVLVIAPAPAL
PDDLATAAQRLALQLADELGVVGVLAVELFETTDGALLVNELAMRPHNSGHWTIDGARTSQFEQHLRAVLDYPLGDSDAV
VPVTVMANVLGAAQPPAMSVDERLHHLFARMPDARVHLYGKAERPGRKVGHINFLGSDVAQLCERAELAAHWLSHGRWTD
GWDPHRASDDAVGVPPACGGRSDEEERRL
>Q7A695 6.3.4.18~~~purK~~~N5-carboxyaminoimidazole ribonucleotide synthase~~~
MNFNKLKFGATIGIIGGGQLGKMMAQSAQKMGYKVVVLDPSEDCPCRYVAHEFIQAKYDDEKALNQLGQKCDVITYEFEN
ISAQQLKLLCEKYNIPQGYQAIQLLQDRLTEKETLKSAGTKVVPFISVKESTDIDKAIETLGYPFIVKTRFGGYDGKGQV
LINNEKDLQEGFKLIETSECVAEKYLNIKKEVSLTVTRGNNNQITFFPLQENEHRNQILFKTIVPARIDKTAEAKEQVNK
IIQSIHFIGTFTVEFFIDSNNQLYVNEIAPRPHNSGHYSIEACDYSQFDTHILAVTGQSLPNSIELLKPAVMMNLLGKDL
DLLENEFNEHPEWHLHIYGKSGRKDSRKMGHMTVLTNDVNQTEQDMYAKFEGSN
>P12042 6.3.5.3~~~purL~~~Phosphoribosylformylglycinamidine synthase subunit PurL~~~COG0046
MSLLLEPSKEQIKEEKLYQQMGVSDDEFALIESILGRLPNYTEIGIFSVMWSEHCSYKNSKPILRKFPTSGERVLQGPGE
GAGIVDIGDNQAVVFKIESHNHPSALEPYQGAATGVGGIIRDVFSMGARPIAVLNSLRFGELTSPRVKYLFEEVVAGIAG
YGNCIGIPTVGGEVQFDSSYEGNPLVNAMCVGLINHEDIKKGQAKGVGNTVMYVGAKTGRDGIHGATFASEEMSDSSEEK
RSAVQVGDPFMEKLLLEACLEVIQCDALVGIQDMGAAGLTSSSAEMASKAGSGIEMNLDLIPQRETGMTAYEMMLSESQE
RMLLVIERGREQEIIDIFDKYDLEAVSVGHVTDDKMLRLTHKGEVVCELPVDALAEEAPVYHKPSQEPAYYREFLETDVP
APQIEDANEMLKALLQQPTIASKEWVYDQYDYMVRTNTVVAPGSDAGVLRIRGTKKALAMTTDCNARYLYLDPEVGGKIA
VAEAARNIICSGAEPLAVTDNLNFGNPEKPEIFWQIEKAADGISEACNVLSTPVIGGNVSLYNESNGTAIYPTPVIGMVG
LIEDTAHITTQHFKQAGDLVYVIGETKPEFAGSELQKMTEGRIYGKAPQIDLDVELSRQKALLDAIKKGFVQSAHDVSEG
GLGVAIAESVMTTENLGANVTVEGEAALLFSESQSRFVVSVKKEHQAAFEATVKDAVHIGEVTADGILAIQNQDGQQMIH
AQTKELERVWKGAIPCLLKSKA
>P9WHL7 6.3.5.3~~~purL~~~Phosphoribosylformylglycinamidine synthase subunit PurL~~~COG0046
MSPLARTPRKTSVLDTVEHAATTPDQPQPYGELGLKDDEYRRIRQILGRRPTDTELAMYSVMWSEHCSYKSSKVHLRYFG
ETTSDEMRAAMLAGIGENAGVVDIGDGWAVTFKVESHNHPSYVEPYQGAATGVGGIVRDIMAMGARPVAVMDQLRFGAAD
APDTRRVLDGVVRGIGGYGNSLGLPNIGGETVFDPCYAGNPLVNALCVGVLRQEDLHLAFASGAGNKIILFGARTGLDGI
GGVSVLASDTFDAEGSRKKLPSVQVGDPFMEKVLIECCLELYAGGLVIGIQDLGGAGLSCATSELASAGDGGMTIQLDSV
PLRAKEMTPAEVLCSESQERMCAVVSPKNVDAFLAVCRKWEVLATVIGEVTDGDRLQITWHGETVVDVPPRTVAHEGPVY
QRPVARPDTQDALNADRSAKLSRPVTGDELRATLLALLGSPHLCSRAFITEQYDRYVRGNTVLAEHADGGMLRIDESTGR
GIAVSTDASGRYTLLDPYAGAQLALAEAYRNVAVTGATPVAVTNCLNFGSPEDPGVMWQFTQAVRGLADGCADLGIPVTG
GNVSFYNQTGSAAILPTPVVGVLGVIDDVRRRIPTGLGAEPGETLMLLGDTRDEFDGSVWAQVTADHLGGLPPVVDLARE
KLLAAVLSSASRDGLVSAAHDLSEGGLAQAIVESALAGETGCRIVLPEGADPFVLLFSESAGRVLVAVPRTEESRFRGMC
EARGLPAVRIGVVDQGSDAVEVQGLFAVSLAELRATSEAVLPRYFG
>P65901 6.3.5.3~~~purL~~~Phosphoribosylformylglycinamidine synthase subunit PurL~~~
MSKFIEPSVEEIKLEKVYQDMGLSDQEYEKVCDILGRQPNFTETGIFSVMWSEHCSYKHSKPFLKQFPTSGEHVLMGPGE
GAGVVDIGDNQAVVFKVESHNHPSAIEPYQGAATGVGGIIRDIVSIGARPINLLNSLRFGELDNKQNQRLLKGVVKGIGG
YGNCIGIPTTAGEIEFDERYDGNPLVNAMCVGVINHDMIQKGTAKGVGNSVIYVGLKTGRDGIHGATFASEELTEESESK
RPSVQIGDPFVGKKLMEATLEAITFDELVGIQDMGAAGLTSSSSEMAAKGGSGLHLRLEQVPTREPGISPYEMMLSETQE
RMLLVVEKGNEQKFLDLFDKHELDSAVIGEVTDTNRFVLTYDDEVYADIPVEPLADEAPVYILEGEEKDYNTSKNDYTHI
DVKDTFFKLLKHPTIASKHYLYDQYDQQVGANTIIKPGLQASVVRVEGTNKAIASTIDGEARYVYNNPYEGGKMVVAEAY
RNLIAVGATPLAMTDCLNYGSPEKKEIYQQLIDSTKGMAEACDILKTPVVSGNVSLYNETKGTSIFPTPVVGMVGLIENV
NYLNDFEPQVGDKLYLIGDTKDDFGGSQLEKLIYGKVNHEFESLDLSSEVEKGESIKTAIREGLLSHVQTVGKGGLLITL
AKLSAHYGLGLKSSIDITNAQLFSETQGRYVVSVKSGKTLNIDNAIEIGLLTDSDNFKVTTPYTEISENVSDIKQIWEGA
IAQCLTTQD
>Q9X0X3 6.3.5.3~~~purL~~~Phosphoribosylformylglycinamidine synthase subunit PurL~~~COG0046
MKLRYLNILKEKLGREPTFVELQAFSVMWSEHCGYSHTKKYIRRLPKTGFEGNAGVVNLDDYYSVAFKIESHNHPSAIEP
YNGAATGVGGIIRDVLAMGARPTAIFDSLHMSRIIDGIIEGIADYGNSIGVPTVGGELRISSLYAHNPLVNVLAAGVVRN
DMLVDSKASRPGQVIVIFGGATGRDGIHGASFASEDLTGDKATKLSIQVGDPFAEKMLIEAFLEMVEEGLVEGAQDLGAG
GVLSATSELVAKGNLGAIVHLDRVPLREPDMEPWEILISESQERMAVVTSPQKASRILEIARKHLLFGDVVAEVIEEPVY
RVMYRNDLVMEVPVQLLANAPEEDIVEYTPGKIPEFKRVEFEEVNAREVFEQYDHMVGTDTVVPPGFGAAVMRIKRDGGY
SLVTHSRADLALQDTYWGTLIAVLESVRKTLSVGAEPLAITNCVNYGDPDVDPVGLSAMMTALKNACEFSGVPVASGNAS
LYNTYQGKPIPPTLVVGMLGKVNPQKVAKPKPSKVFAVGWNDFELEREKELWRAIRKLSEEGAFILSSSQLLTRTHVETF
REYGLKIEVKLPEVRPAHQMVLVFSERTPVVDVPVKEIGTLSR
>Q5SMH8 6.3.5.3~~~purL~~~Phosphoribosylformylglycinamidine synthase subunit PurL~~~COG0046
MEALAKEIGIPEGEYREIVQRLGREPNRVELLLFKVMWSEHCAYKNSRPLLKALPKEGEAVLQGPGENAGVVRVGEGWAV
AFKIESHNHPSAVEPFQGAATGVGGILRDIMSMGARPIALLDSLRFGPPEEARSRYLLKGVVSGIAFYGNAIGVPTVGGD
LYFHEGYRENPLVNAMCLGLLREEHLKRSRASLGRPIYYAGAKTGRDGIGGAAFASRELKEEKAEDRPAVQVGDPFLGKL
LMEATLEAIELDLVEGVQDMGAAGLTSSLSELAHKSGLGVELHLDLVPTREEGMTPEELLLSESQERMVLVPKEGKEKAL
EEVFGRWGLDCVPVARTIPERVFRVLFRGEVVAEVPTEALAEAPTYVRVGREDPEVRRLRETPIPPLEADPQEVLRRLLA
SPNLASREAVYERYDHQVGTRTALLPGKGDAAVLWIKGTRLGVAAKVDQNPRYSRLHPRLGAMHALAEACRNVSVVGAKP
LAYTDGLNLGSPETPEGYHELAETIAGLKEASEALGVPVVSGNVSLYNESGGKRIPPTAMVGVVGVLEVDKRAEMGFRRP
GEVLLLIGEERGELGASEVLYLLTGKEFGHPPRLDLGREKAVQEAIRDLIQRGLTRTAHDVAEGGLLLALAEMTFPYGVG
ATVEVREEGLEALFGEAPSRVLFTVEKTRLQEATLLLEERGLPYRVLGETGGKSLTVLTPGGVLEWSLEELLSAWKAPLR
EVLDG
>Q89ZI8 2.4.2.1~~~~~~Purine nucleoside phosphorylase BT_4389~~~COG1496
MISITKDKRMLGYESLSSYSNISHFVTTRQGGCSEGNYASFNCTPYSGDEAEKVRRNQTLLMEGMSQIPEELVIPVQTHE
TNYLLIGDAYLSASSQQRQEMLHGVDALITREPGYCLCISTADCVPVLVYDKKHGAIAAIHAGWRGTVAYIVRDTLLRME
KEFGTSGEDVVACIGPSISLASFEVGEEVYEAFQKNGFDMPRISIRKEETGKHHIDLWEANRMQILAFGVPSGQVELARI
CTYIHHDEFFSARRLGIKSGRILSGIMIHK
>P33644 2.4.2.1~~~yfiH~~~Purine nucleoside phosphorylase YfiH~~~COG1496
MSKLIVPQWPQPKGVAACSSTRIGGVSLPPYDSLNLGAHCGDNPDHVEENRKRLFAAGNLPSKPVWLEQVHGKDVLKLTG
EPYASKRADASYSNTPGTVCAVMTADCLPVLFCNRAGTEVAAAHAGWRGLCAGVLEETVSCFADNPENILAWLGPAIGPR
AFEVGGEVREAFMAVDAKASAAFIQHGDKYLADIYQLARQRLANVGVEQIFGGDRCTYTENETFFSYRRDKTTGRMASFI
WLI
>P84138 2.4.2.1~~~ylmD~~~Purine nucleoside phosphorylase YlmD~~~
MPDIFQQEARGWLRCGAPPFAGAVAGLTTKHGGESKGPFASLNMGLHVGDDRTDVVNNRRRLAEWLAFPLERWVCCEQVH
GADIQKVTKSDRGNGAQDFATAVPGVDGLYTDEAGVLLALCFADCVPIYFVAPSAGLVGLAHAGWRGTAGGIAGHMVWLW
QTREHIAPSDIYVAIGPAIGPCCYTVDDRVVDSLRPTLPPESPLPWRETSPGQYALDLKEANRLQLLAAGVPNSHIYVSE
RCTSCEEALFFSHRRDRGTTGRMLAFIGRREEWT
>P9WKD5 2.4.2.1~~~~~~Purine nucleoside phosphorylase Rv2149c~~~COG1496
MLASTRHIARGDTGNVSVRIRRVTTTRAGGVSAPPFDTFNLGDHVGDDPAAVAANRARLAAAIGLPGNRVVWMNQVHGDR
VELVDQPRNTALDDTDGLVTATPRLALAVVTADCVPVLMADARAGIAAAVHAGRAGAQRGVVVRALEVMLSLGAQVRDIS
ALLGPAVSGRNYEVPAAMADEVEAALPGSRTTTAAGTPGVDLRAGIACQLRDLGVESIDVDPRCTVADPTLFSHRRDAPT
GRFASLVWME
>A0A384KG77 2.4.2.1~~~yfiH~~~Purine nucleoside phosphorylase YfiH~~~
MSKLIVPQWPLPKGVAACSSTRIGGVSLPPYDSLNLGAHCGDNPDHVEENRKRLFAAGNLPSKPVWLEQVHGKDVLKLTG
EPYASKRADASYSNTPGTVCAVMTADCLPVLFCNRAGTEVAAVHAGWRGLCAGVLEETVSCFADKPENILAWLGPAIGPR
AFEVGAEVREAFMAVDAKASAAFIQHGDKYLADIYQLARQRLANVGVEQIFGGDRCTYTENETFFSYRRDKTTGRMASFI
WLI
>Q1EIR0 3.5.4.4~~~rl5~~~Adenosine deaminase RL5~~~
MIELEKLDFAKSVEGVEAFSTTRGQVDGRNAYSGVNLCDYVGDDALRVLDARLTLAMQLGVDLDDLVMPRQTHSCRVAVI
DERFRALDIDEQEAALEGVDALVTRLQGIVIGVNTADCVPIVLVDSQAGIVAVSHAGWRGTVGRIAKAVVEEMCRQGATV
DRIQAAMGPSICQDCFEVGDEVVEAFKKAHFNLNDIVVRNPATGKAHIDLRAANRAVLVAAGVPAANIVESQHCSRCEHT
SFFSARRLGINSGRTFTGIYRK
>P12041 6.3.5.3~~~purQ~~~Phosphoribosylformylglycinamidine synthase subunit PurQ~~~COG0047
MKFAVIVLPGSNCDIDMYHAVKDELGHEVEYVWHEETSLDGFDGVLIPGGFSYGDYLRCGAIARFANIMPAVKQAAAEGK
PVLGVCNGFQILQELGLLPGAMRRNKDLKFICRPVELIVQNDETLFTASYEKGESITIPVAHGEGNFYCDDETLATLKEN
NQIAFTYGSNINGSVSDIAGVVNEKGNVLGMMPHPERAVDELLGSADGLKLFQSIVKNWRETHVTTA
>P9WHL5 6.3.5.3~~~purQ~~~Phosphoribosylformylglycinamidine synthase subunit PurQ~~~COG0047
MTARIGVVTFPGTLDDVDAARAARQVGAEVVSLWHADADLKGVDAVVVPGGFSYGDYLRAGAIARFAPVMDEVVAAADRG
MPVLGICNGFQVLCEAGLLPGALTRNVGLHFICRDVWLRVASTSTAWTSRFEPDADLLVPLKSGEGRYVAPEKVLDELEG
EGRVVFRYHDNVNGSLRDIAGICSANGRVVGLMPHPEHAIEALTGPSDDGLGLFYSALDAVLTG
>P99166 6.3.5.3~~~purQ~~~Phosphoribosylformylglycinamidine synthase subunit PurQ~~~
MKFAVLVFPGSNCDRDMFNAAIKSGVEAEYVDYRETSLSGFDGVLIPGGFSFGDYLRSGAMASVAPIISEVKRLATEGKP
VLGVCNGFQILTEIGLLPGALLHNDSHLFISRNEELEIVNNQTAFTNLYEQGEKVIYPVAHGEGHYYCTDEIYQQLKANN
QIILKYVNNPNGSYDDIAGIVNEKGNVCGMMPHPERALETLLGTDSGVKLFEAMVKSWREQHV
>Q55843 6.3.5.3~~~purQ~~~Phosphoribosylformylglycinamidine synthase subunit PurQ~~~COG0047
MTSFGIIVFPGSNCDRDIATVTAGLLDQPTRFIWHQETDLHGVDVVVLPGGFSYGDYLRCGAIARFSPIMTAIIDHANAG
KRVLGICNGFQVLTEVGLLPGALIRNRDLHFICDRVTVRVESNQTVWTKGYQSQQVITLPIAHGEGRYFADGDTLKALED
NEQILFRYSNAQGELTTDSNPNGSLHNIAGITNVQGNVLGMMPHPERAADRLLKATDGLAMFIS
>Q9X0X2 6.3.5.3~~~purQ~~~Phosphoribosylformylglycinamidine synthase subunit PurQ~~~COG0047
MKPRACVVVYPGSNCDRDAYHALEINGFEPSYVGLDDKLDDYELIILPGGFSYGDYLRPGAVAAREKIAFEIAKAAERGK
LIMGICNGFQILIEMGLLKGALLQNSSGKFICKWVDLIVENNDTPFTNAFEKGEKIRIPIAHGFGRYVKIDDVNVVLRYV
KDVNGSDERIAGVLNESGNVFGLMPHPERAVEELIGGEDGKKVFQSILNYLKR
>P37551 ~~~purR~~~Pur operon repressor~~~COG0503
MKFRRSGRLVDLTNYLLTHPHELIPLTFFSERYESAKSSISEDLTIIKQTFEQQGIGTLLTVPGAAGGVKYIPKMKQAEA
EEFVQTLGQSLANPERILPGGYVYLTDILGKPSVLSKVGKLFASVFAEREIDVVMTVATKGIPLAYAAASYLNVPVVIVR
KDNKVTEGSTVSINYVSGSSNRIQTMSLAKRSMKTGSNVLIIDDFMKAGGTINGMINLLDEFNANVAGIGVLVEAEGVDE
RLVDEYMSLLTLSTINMKEKSIEIQNGNFLRFFKDNLLKNGETES
>P0ACP7 ~~~purR~~~HTH-type transcriptional repressor PurR~~~COG1609
MATIKDVAKRANVSTTTVSHVINKTRFVAEETRNAVWAAIKELHYSPSAVARSLKVNHTKSIGLLATSSEAAYFAEIIEA
VEKNCFQKGYTLILGNAWNNLEKQRAYLSMMAQKRVDGLLVMCSEYPEPLLAMLEEYRHIPMVVMDWGEAKADFTDAVID
NAFEGGYMAGRYLIERGHREIGVIPGPLERNTGAGRLAGFMKAMEEAMIKVPESWIVQGDFEPESGYRAMQQILSQPHRP
TAVFCGGDIMAMGALCAADEMGLRVPQDVSLIGYDNVRNARYFTPALTTIHQPKDSLGETAFNMLLDRIVNKREEPQSIE
VHPRLIERRSVADGPFRDYRR
>P12049 6.3.5.3~~~purS~~~Phosphoribosylformylglycinamidine synthase subunit PurS~~~COG1828
MYKVKVYVSLKESVLDPQGSAVQHALHSMTYNEVQDVRIGKYMELTIEKSDRDLDVLVKEMCEKLLANTVIEDYRYEVEE
VVAQ
>Q9X0X1 6.3.5.3~~~purS~~~Phosphoribosylformylglycinamidine synthase subunit PurS~~~COG1828
MPLFKFAIDVQYRSNVRDPRGETIERVLREEKGLPVKKLRLGKSIHLEVEAENKEKAYEIVKKACEELLVNPVVEEYEVR
EL
>P33221 6.3.1.21~~~purT~~~Formate-dependent phosphoribosylglycinamide formyltransferase~~~COG0027
MTLLGTALRPAATRVMLLGSGELGKEVAIECQRLGVEVIAVDRYADAPAMHVAHRSHVINMLDGDALRRVVELEKPHYIV
PEIEAIATDMLIQLEEEGLNVVPCARATKLTMNREGIRRLAAEELQLPTSTYRFADSESLFREAVADIGYPCIVKPVMSS
SGKGQTFIRSAEQLAQAWKYAQQGGRAGAGRVIVEGVVKFDFEITLLTVSAVDGVHFCAPVGHRQEDGDYRESWQPQQMS
PLALERAQEIARKVVLALGGYGLFGVELFVCGDEVIFSEVSPRPHDTGMVTLISQDLSEFALHVRAFLGLPVGGIRQYGP
AASAVILPQLTSQNVTFDNVQNAVGADLQIRLFGKPEIDGSRRLGVALATAESVVDAIERAKHAAGQVKVQG
>P46927 6.3.1.21~~~purT~~~Formate-dependent phosphoribosylglycinamide formyltransferase~~~
MTTIGTPLRTNATKVMMLGSGELGKEVVIELQPLGVEVIAVDRYDNAPAQQVAHRAYTISMLDGNALRDLVEKEKPDFIV
PEVEAIATATLVELEQEGYNVIPTAKATQLTMNREGIRRLAAEELGLKTSPYRFVDNFEQFQQAIQEIGIPCVVKPIMSS
SGHGQSVIKSEADIQQAWDYSQQGGRAGGGRVIVEGFIKFDYEITQLTVRHIHGIVFSSHRHIQVDGDYRESWQPQQMSD
IALKKAQETAEKITSALGGRGIFGVELFVCGDEIIFNEVSPRPHDTGIVTMASQELSQFALHARAILGLPIPEIYRISPA
ASKAIVVEGKSDNVRFGGVDKVLAEIGTNIRLFGKGEVNGHRRLGVILARDENTVRALETSRRAYDKLDIQL
>P37051 3.5.1.10~~~purU~~~Formyltetrahydrofolate deformylase~~~COG0788
MHSLQRKVLRTICPDQKGLIARITNICYKHELNIVQNNEFVDHRTGRFFMRTELEGIFNDSTLLADLDSALPEGSVRELN
PAGRRRIVILVTKEAHCLGDLLMKANYGGLDVEIAAVIGNHDTLRSLVERFDIPFELVSHEGLTRNEHDQKMADAIDAYQ
PDYVVLAKYMRVLTPEFVARFPNKIINIHHSFLPAFIGARPYHQAYERGVKIIGATAHYVNDNLDEGPIIMQDVIHVDHT
YTAEDMMRAGRDVEKNVLSRALYKVLAQRVFVYGNRTIIL
>P9WHM3 3.5.1.10~~~purU~~~Formyltetrahydrofolate deformylase~~~COG0788
MGKGSMTAHATPNEPDYPPPPGGPPPPADIGRLLLRCHDRPGIIAAVSTFLARAGANIISLDQHSTAPEGGTFLQRAIFH
LPGLTAAVDELQRDFGSTVADKFGIDYRFAEAAKPKRVAIMASTEDHCLLDLLWRNRRGELEMSVVMVIANHPDLAAHVR
PFGVPFIHIPATRDTRTEAEQRQLQLLSGNVDLVVLARYMQILSPGFLEAIGCPLINIHHSFLPAFTGAAPYQRARERGV
KLIGATAHYVTEVLDEGPIIEQDVVRVDHTHTVDDLVRVGADVERAVLSRAVLWHCQDRVIVHHNQTIVF
>P09546 ~~~putA~~~Bifunctional protein PutA~~~COG0506
MGTTTMGVKLDDATRERIKSAATRIDRTPHWLIKQAIFSYLEQLENSDTLPELPALLSGAANESDEAPTPAEEPHQPFLD
FAEQILPQSVSRAAITAAYRRPETEAVSMLLEQARLPQPVAEQAHKLAYQLADKLRNQKNASGRAGMVQGLLQEFSLSSQ
EGVALMCLAEALLRIPDKATRDALIRDKISNGNWQSHIGRSPSLFVNAATWGLLFTGKLVSTHNEASLSRSLNRIIGKSG
EPLIRKGVDMAMRLMGEQFVTGETIAEALANARKLEEKGFRYSYDMLGEAALTAADAQAYMVSYQQAIHAIGKASNGRGI
YEGPGISIKLSALHPRYSRAQYDRVMEELYPRLKSLTLLARQYDIGINIDAEESDRLEISLDLLEKLCFEPELAGWNGIG
FVIQAYQKRCPLVIDYLIDLATRSRRRLMIRLVKGAYWDSEIKRAQMDGLEGYPVYTRKVYTDVSYLACAKKLLAVPNLI
YPQFATHNAHTLAAIYQLAGQNYYPGQYEFQCLHGMGEPLYEQVTGKVADGKLNRPCRIYAPVGTHETLLAYLVRRLLEN
GANTSFVNRIADTSLPLDELVADPVTAVEKLAQQEGQTGLPHPKIPLPRDLYGHGRDNSAGLDLANEHRLASLSSALLNS
ALQKWQALPMLEQPVAAGEMSPVINPAEPKDIVGYVREATPREVEQALESAVNNAPIWFATPPAERAAILHRAAVLMESQ
MQQLIGILVREAGKTFSNAIAEVREAVDFLHYYAGQVRDDFANETHRPLGPVVCISPWNFPLAIFTGQIAAALAAGNSVL
AKPAEQTPLIAAQGIAILLEAGVPPGVVQLLPGRGETVGAQLTGDDRVRGVMFTGSTEVATLLQRNIASRLDAQGRPIPL
IAETGGMNAMIVDSSALTEQVVVDVLASAFDSAGQRCSALRVLCLQDEIADHTLKMLRGAMAECRMGNPGRLTTDIGPVI
DSEAKANIERHIQTMRSKGRPVFQAVRENSEDAREWQSGTFVAPTLIELDDFAELQKEVFGPVLHVVRYNRNQLPELIEQ
INASGYGLTLGVHTRIDETIAQVTGSAHVGNLYVNRNMVGAVVGVQPFGGEGLSGTGPKAGGPLYLYRLLANRPESALAV
TLARQDAKYPVDAQLKAALTQPLNALREWAANRPELQALCTQYGELAQAGTQRLLPGPTGERNTWTLLPRERVLCIADDE
QDALTQLAAVLAVGSQVLWPDDALHRQLVKALPSAVSERIQLAKAENITAQPFDAVIFHGDSDQLRALCEAVAARDGTIV
SVQGFARGESNILLERLYIERSLSVNTAAAGGNASLMTIG
>P10503 ~~~putA~~~Bifunctional protein PutA~~~
MGTTTMGVKLDDATRERIKMAASRIDRTPHWLIKQAIFSYLDKLENSDTLPELPALFVGAANESEEPVAPQDEPHQPFLE
FAEQILPQSVSRAAITAAWRRPETDAVSMLMEQARLSPPVAEQAHKLAYQLAEKLRNQKSASGRAGMVQGLLQEFSLSSQ
EGVALMCLAEALLRIPDKATRDALIRDKISNGNWQSHIGRSPSLFVNAATWGLLFTGRLVSTHNEANLSRSLNRIIGKSG
EPLIRKGVDMAMRLMGEQFVTGETIAQALANARKLEEKGFRYSYDMLGEAALTAADAQAYMVSYQQAIHAIGKASNGRGI
YEGPGISIKLSALHPRYSRAQYDRVMEELYPRLKSLTLLARQYDIGLNIDAEEADRLEISLDLLEKLCFEPELAGWNGIG
FVIQAYQKRCPLVIDYLVDLASRSRRRLMIRLVKGAYWDSEIKRAQMEGLEGYPVYTRKVYTDVSYLACAKKLLAVPNLI
YPQFATHNAHTLAAIYHLAGQNYYPGQYEFQCLHGMGEPLYEQVTGKVADGKLNRPCRIYAPVGTHETLLAYLVRRLLEN
GANTSFVNRIADATLPLDELVADPVEAVEKLAQQEGQAGIPHPKIPLPRDLYGEGRINSAGLDLANEHRLASLSSALLSN
AMQKWQAKPVLEQPVADGEMTPVINPAEPKDIVGWGREATESEVEQALQNAVNQAPVWFATPPQERAAILQRAAVLMEDQ
MQQLIGLLVREAGKTFSNAIAEVREAVDFLHYYAGQVRDDFDNETHRPLGPVVCISPWNFPLAIFTGQIAAALAAGNSVL
AKPAEQTSLIAAQGIAILLEAGVPPGVVQLLPGRGETVGAQLTADARVRGVMFTGSTEVATLLQRNIATRLDAQGRPIPL
IAETGGMNAMIVDSSALTEQVVVDVLASAFDSAGQRCSALRVLCLQDDIAEHTLKMLRGAMAECRMGNPGRLTTDIGPVI
DSEAKANIERHIQTMRAKGRPVFQAARENSDDAQEWQTGTFVMPTLIELENFAELEKEVFGPVLHVVRYNRNQLAELIEQ
INASGYGLTLGVHTRIDETIAQVTGSAHVGNLYVNRNMVGAVVGVQPFGGEGLSGTGPKAGGPLYLYRLLAHRPPNALNT
TLTRQDARYPVDAQLKTTLLAPLTALTQWAADRPALQTLCRQFADLAQAGTQRLLPGPTGERNTWTLLPRERVLCLADDE
QDALTQLAAVLAVGSQALWSDDAFHRDLAKRLPAAVAARVQFAKAETLMAQPFDAVIFHGDSDKLRTVCEAVAAREGAIV
SVQGFARGESNILLERLYIERSLSVNTAAAGGNASLMTIG
>P94392 ~~~putP~~~High-affinity proline transporter PutP~~~COG0591
MLLIGYFAYKRTSNLTDYMLGGRSLGPAVTALSAGAADMSGWLLMGLPGAMFSTGLSGAWIVIGLCLGAWANWLYVAPRL
RTYTEKAGNSITIPGFLENRFGDQTKLLRLFSGIVILVFFTFYVSSGMVSGGVLFNSILGMDYHTGLWIVTGVVVAYTLF
GGFLAVSWTDFVQGIIMFAALILVPIVTFFHTGGAGDTVAEIRSVDPDMFNIFKGTSVLGIISLFAWGLGYFGQPHIIVR
FMAITSVKEIKRARRIGMGWMILSAVGAVLTGLGGIAYYHQRGMTLKDPETIFIQLGNILFHPIITGFLISAILAAIMST
ISSQLLVTSSSLVEDLYKSMFRRSASDKELVFLGRLAVLAVSIVALVLAWEKNNTILGLVSYAWAGFGASFGPVVLLSLF
WKRMTKWGALAGMIVGAATVIIWANAGLSDFLYEMIPGFAASLLSVFFVSILTQAPSQAVTDQFNDYQDTMSQ
>P07117 ~~~putP~~~Sodium/proline symporter~~~COG0591
MAISTPMLVTFCVYIFGMILIGFIAWRSTKNFDDYILGGRSLGPFVTALSAGASDMSGWLLMGLPGAVFLSGISESWIAI
GLTLGAWINWKLVAGRLRVHTEYNNNALTLPDYFTGRFEDKSRILRIISALVILLFFTIYCASGIVAGARLFESTFGMSY
ETALWAGAAATILYTFIGGFLAVSWTDTVQASLMIFALILTPVIVIISVGGFGDSLEVIKQKSIENVDMLKGLNFVAIIS
LMGWGLGYFGQPHILARFMAADSHHSIVHARRISMTWMILCLAGAVAVGFFGIAYFNDHPALAGAVNQNAERVFIELAQI
LFNPWIAGILLSAILAAVMSTLSCQLLVCSSAITEDLYKAFLRKHASQKELVWVGRVMVLVVALVAIALAANPENRVLGL
VSYAWAGFGAAFGPVVLFSVMWSRMTRNGALAGMIIGALTVIVWKQFGWLGLYEIIPGFIFGSIGIVVFSLLGKAPSAAM
QKRFAEADAHYHSAPPSRLQES
>P10502 ~~~putP~~~Sodium/proline symporter~~~
MAISTPMLVTFCVYIFGMILIGFIAWRSTKNFDDYILGGRSLGPFVTALSAGASDMSGWLLMGLPGAIFLSGISESWIAI
GLTLGAWINWKLVAGRLRVHTEFNNNALTLPDYFTGRFEDKSRVLRIISALVILLFFTIYCASGIVAGARLFESTFGMSY
ETALWAGAAATIIYTFIGGFLAVSWTDTVQASLMIFALILTPVMVIVGVGGFSESLEVIKQKSIENVDMLKGLNFVAIIS
LMGWGLGYFGQPHILARFMAADSHHSIVHARRISMTWMILCLAGAVAVGFFGIAYFNNNPALAGAVNQNSERVFIELAQI
LFNPWIAGVLLSAILAAVMSTLSCQLLVCSSAITEDLYKAFLRKSASQQELVWVGRVMVLVVALIAIALAANPDNRVLGL
VSYAWAGFGAAFGPVVLFSVMWSRMTRNGALAGMIIGAVTVIVWKQYGWLDLYEIIPGFIFGSLGIVIFSLLGKAPTAAM
QERFAKADAHYHSAPPSKLQAE
>Q2FWY7 ~~~putP~~~Sodium/proline symporter~~~COG0591
MLTMGTALSQQVDANWQTYIMIAVYFLILIVIGFYGYKQATGNLSEYMLGGRSIGPYITALSAGASDMSGWMIMGLPGSV
YSTGLSAMWITIGLTLGAYINYFVVAPRLRVYTELAGDAITLPDFFKNRLNDKNNVLKIISGLIIVVFFTLYTHSGFVSG
GKLFESAFGLDYHFGLILVAFIVIFYTFFGGYLAVSITDFFQGVIMLIAMVMVPIVAMMNLNGWGTFHDVAAMKPTNLNL
FKGLSFIGIISLFSWGLGYFGQPHIIVRFMSIKSHKMLPKARRLGISWMAVGLLGAVAVGLTGIAFVPAYHIKLEDPETL
FIVMSQVLFHPLVGGFLLAAILAAIMSTISSQLLVTSSSLTEDFYKLIRGEEKAKTHQKEFVMIGRLSVLVVAIVAIAIA
WNPNDTILNLVGNAWAGFGASFSPLVLFALYWKGLTRAGAVSGMVSGALVVIVWIAWIKPLAHINEIFGLYEIIPGFIVS
VIVTYVVSKLTKKPGAFVETDLNKVRDIVREK
>Q7A4Q7 ~~~putP~~~Sodium/proline symporter~~~
MLTMGTALSQQVDANWQTYIMIAVYFLILIVIGFYGYKQATGNLSEYMLGGRSIGPYITALSAGASDMSGWMIMGLPGSV
YSTGLSAMWITIGLTLGAYINYFVVAPRLRVYTELAGDAITLPDFFKNRLNDKNNVLKIISGLIIVVFFTLYTHSGFVSG
GKLFESAFGLDYHFGLILVAFIVIFYTFFGGYLAVSITDFFQGVIMLIAMVMVPIVAMMNLNGWGTFHDVAAMKPTNLNL
FKGLSFIGIISLFSWGLGYFGQPHIIVRFMSIKSHKMLPKARRLGISWMAVGLLGAVAVGLTGIAFVPAYHIKLEDPETL
FIVMSQVLFHPLVGGFLLAAILAAIMSTISSQLLVTSSSLTEDFYKLIRGEEKAKTHQKEFVMIGRLSVLVVAIVAIAIA
WNPNDTILNLVGNAWAGFGASFSPLVLFALYWKGLTRAGAVSGMVSGALVVIVWIAWIKPLAHINEIFGLYEIIPGFIVS
VIVTYVVSKLTKKPGAFVETDLNKVRDIVREK
>P94393 ~~~putR~~~Proline-responsive transcriptional activator PutR~~~COG2508
MEELLERVFSFSDVDKLIDFISYELQKPVILESADFFLLAYNSYYINHFDSANQQTIFSKKCPVQIFERFLKDGIIEKLK
TEPEPFRVNKIESIGLNQRVVVSAKHKGEVMGYIWIQELDQNLTDEELDFLYETSFHVGKIIYKTNKLKQEKEEKAEDLI
KRAIYQQFTSEKELRREAERINTVLPSMFSVVILHAANGDGEAVEDLKENIRSYLNLRDKVSHVLTIESNIVIVVASFSQ
KSSVSSAASEFINKLLTHFHFQKIPTPIYIGIGNEYNHLLKLGKSYTEALEVIKAAEITGNQENIPYEYAKLGIYRYLES
IEQKNEFLEYENKDLALLKAKDEESSTELLKTLEIYLLNNCKTKPAAEQLFIHQNTLNYRIKQITEMTSIDLSDFRTRCQ
LYLDLMLMKKK
>P00259 ~~~camB~~~Putidaredoxin~~~
MSKVVYVSHDGTRRELDVADGVSLMQAAVSNGIYDIVGDCGGSASCATCHVYVNEAFTDKVPAANEREIGMLECVTAELK
PNSRLCCQIIMTPELDGIVVDVPDRQW
>P78061 6.3.1.11~~~puuA~~~Gamma-glutamylputrescine synthetase PuuA~~~COG0174
METNIVEVENFVQQSEERRGSAFTQEVKRYLERYPNTQYVDVLLTDLNGCFRGKRIPVSSLKKLEKGCYFPASVFAMDIL
GNVVEEAGLGQEMGEPDRTCVPVLGSLTPSAADPEFIGQMLLTMVDEDGAPFDVEPRNVLNRLWQQLRQRGLFPVVAVEL
EFYLLDRQRDAEGYLQPPCAPGTDDRNTQSQVYSVDNLNHFADVLNDIDELAQLQLIPADGAVAEASPGQFEINLYHTDN
VLEACDDALALKRLVRLMAEKHKMHATFMAKPYEEHAGSGMHIHISMQNNRGENVLSDAEGEDSPLLKKMLAGMIDLMPS
SMALLAPNVNSYRRFQPGMYVPTQASWGHNNRTVALRIPCGDRHNHRVEYRVAGADANPYLVMAAIFAGILHGLDNELPL
QEEVEGNGLEQEGLPFPIRQSDALGEFIENDHLRRYLGERFCHVYHACKNDELLQFERLITETEIEWMLKNA
>P37906 1.4.3.-~~~puuB~~~Gamma-glutamylputrescine oxidoreductase~~~COG0665
MTEHTSSYYAASANKYAPFDTLNESITCDVCVVGGGYTGLSSALHLAEAGFDVVVLEASRIGFGASGRNGGQLVNSYSRD
IDVIEKSYGMDTARMLGSMMFEGGEIIRERIKRYQIDCDYRPGGLFVAMNDKQLATLEEQKENWERYGNKQLELLDANAI
RREVASDRYTGALLDHSGGHIHPLNLAIGEADAIRLNGGRVYELSAVTQIQHTTPAVVRTAKGQVTAKYVIVAGNAYLGD
KVEPELAKRSMPCGTQVITTERLSEDLARSLIPKNYCVEDCNYLLDYYRLTADNRLLYGGGVVYGARDPDDVERLVVPKL
LKTFPQLKGVKIDYRWTGNFLLTLSRMPQFGRLDTNIYYMQGYSGHGVTCTHLAGRLIAELLRGDAERFDAFANLPHYPF
PGGRTLRVPFTAMGAAYYSLRDRLGV
>P23883 1.2.1.5~~~puuC~~~NADP/NAD-dependent aldehyde dehydrogenase PuuC~~~COG1012
MNFHHLAYWQDKALSLAIENRLFINGEYTAAAENETFETVDPVTQAPLAKIARGKSVDIDRAMSAARGVFERGDWSLSSP
AKRKAVLNKLADLMEAHAEELALLETLDTGKPIRHSLRDDIPGAARAIRWYAEAIDKVYGEVATTSSHELAMIVREPVGV
IAAIVPWNFPLLLTCWKLGPALAAGNSVILKPSEKSPLSAIRLAGLAKEAGLPDGVLNVVTGFGHEAGQALSRHNDIDAI
AFTGSTRTGKQLLKDAGDSNMKRVWLEAGGKSANIVFADCPDLQQAASATAAGIFYNQGQVCIAGTRLLLEESIADEFLA
LLKQQAQNWQPGHPLDPATTMGTLIDCAHADSVHSFIREGESKGQLLLDGRNAGLAAAIGPTIFVDVDPNASLSREEIFG
PVLVVTRFTSEEQALQLANDSQYGLGAAVWTRDLSRAHRMSRRLKAGSVFVNNYNDGDMTVPFGGYKQSGNGRDKSLHAL
EKFTELKTIWISLEA
>P76038 3.5.1.94~~~puuD~~~Gamma-glutamyl-gamma-aminobutyrate hydrolase PuuD~~~COG2071
MENIMNNPVIGVVMCRNRLKGHATQTLQEKYLNAIIHAGGLPIALPHALAEPSLLEQLLPKLDGIYLPGSPSNVQPHLYG
ENGDEPDADPGRDLLSMAIINAALERRIPIFAICRGLQELVVATGGSLHRKLCEQPELLEHREDPELPVEQQYAPSHEVQ
VEEGGLLSALLPECSNFWVNSLHGQGAKVVSPRLRVEARSPDGLVEAVSVINHPFALGVQWHPEWNSSEYALSRILFEGF
ITACQHHIAEKQRL
>P50457 2.6.1.19~~~puuE~~~4-aminobutyrate aminotransferase PuuE~~~COG0160
MSNNEFHQRRLSATPRGVGVMCNFFAQSAENATLKDVEGNEYIDFAAGIAVLNTGHRHPDLVAAVEQQLQQFTHTAYQIV
PYESYVTLAEKINALAPVSGQAKTAFFTTGAEAVENAVKIARAHTGRPGVIAFSGGFHGRTYMTMALTGKVAPYKIGFGP
FPGSVYHVPYPSDLHGISTQDSLDAIERLFKSDIEAKQVAAIIFEPVQGEGGFNVAPKELVAAIRRLCDEHGIVMIADEV
QSGFARTGKLFAMDHYADKPDLMTMAKSLAGGMPLSGVVGNANIMDAPAPGGLGGTYAGNPLAVAAAHAVLNIIDKESLC
ERANQLGQRLKNTLIDAKESVPAIAAVRGLGSMIAVEFNDPQTGEPSAAIAQKIQQRALAQGLLLLTCGAYGNVIRFLYP
LTIPDAQFDAAMKILQDALSD
>P76037 ~~~puuP~~~Putrescine importer PuuP~~~COG0531
MAINSPLNIAAQPGKTRLRKSLKLWQVVMMGLAYLTPMTVFDTFGIVSGISDGHVPASYLLALAGVLFTAISYGKLVRQF
PEAGSAYTYAQKSINPHVGFMVGWSSLLDYLFLPMINVLLAKIYLSALFPEVPPWVWVVTFVAILTAANLKSVNLVANFN
TLFVLVQISIMVVFIFLVVQGLHKGEGVGTVWSLQPFISENAHLIPIITGATIVCFSFLGFDAVTTLSEETPDAARVIPK
AIFLTAVYGGVIFIAASFFMQLFFPDISRFKDPDAALPEIALYVGGKLFQSIFLCTTFVNTLASGLASHASVSRLLYVMG
RDNVFPERVFGYVHPKWRTPALNVIMVGIVALSALFFDLVTATALINFGALVAFTFVNLSVFNHFWRRKGMNKSWKDHFH
YLLMPLVGALTVGVLWVNLESTSLTLGLVWASLGGAYLWYLIRRYRKVPLYDGDRTPVSET
>P0A9U6 ~~~puuR~~~HTH-type transcriptional regulator PuuR~~~COG1396
MSDEGLAPGKRLSEIRQQQGLSQRRAAELSGLTHSAISTIEQDKVSPAISTLQKLLKVYGLSLSEFFSEPEKPDEPQVVI
NQDDLIEMGSQGVSMKLVHNGNPNRTLAMIFETYQPGTTTGERIKHQGEEIGTVLEGEIVLTINGQDYHLVAGQSYAINT
GIPHSFSNTSAGICRIISAHTPTTF
>P77931 1.1.2.6~~~pvaA~~~Polyvinylalcohol dehydrogenase~~~
MQQNIERNQVSMTTSRFVWGAVMALVALGSASAAELNLPDGAALYRARCGTCHDNPQDRTPARDVIARNSPAFIMAAMNG
VMAPMAAGLSEAEKQAIALHLGARPAGGSQEINPHAIWGPPSASMPLDGPKCKGKIPPIDLSTPDQWNGWGAGITNARFQ
PNPGLTAADVPRLKVKWAFNYPGSKNGQATVVGDRLFVTSMSGAVYALNAKTGCVYWRHDAAAATRSSVHVVQLPAGAPA
QYAIFFSDWTKAAVALDAQTGKQLWKTTIDDQPGVQMTGSPTYHEGKLFVPISSGNEAFATNDQWECCKFRGALVALDAL
SGKVLWKTYTTQKEPAPFRLNKLGKQMWGPAGGSIWSAPTIDPKRGLVYVATSNSYTEVHHEGSDAVMAMEIETGKVRWI
NQVTKDDNYIIGCPRAANCPEKVGPDFALGNSPILHTLQDGRQYIVVGQKSGAVYAMDPDNDGELIWMRRVSPGSELGGV
EFGMAADAENVYVGISDVITRKGGKPGVYALRIRDGADVWAFPAPRTPCRWNNIFCHPAVSQAVTAMPGVVFAGSMDGHF
RAFSTSDGKVLWEFNTAAAPYKTVAGKQADGGVMDGAGPTIAGGMVYVHSGYAGRSTQNAGDLRGREGNVLIAFSVDGK
>Q588Z1 1.1.2.6~~~pvadh~~~Polyvinylalcohol dehydrogenase~~~
MGSHAWGGAVFSAATLIAFGSVVHASGTVAETAPQSGHAVPADQLDGETLYKARCAACHDNAEGRTPSREVLSKNPASFI
LASMRTGAMVPMAEGLTLEEMTAIARAVGKADAKTDDGIDLRRIWGNSVEGTPLDAPQCSSAPTPVDLGAANQWNGWSTE
KDNGRFQRKPALDVADIPKLKLKWAFQYPGSKNGQATVIGDRLFTTSTSGAVYALNAKTGCVYWRHAAEGATRTSPVIAA
LPEGAPAKTALFFSDFTKAAVALDAETGKQLWKTVVDDQPALQMTGSITYWDGKIYVPISSGTEAFAQIPTWECCKFRGA
LVALDAATGKILWKRYTTEQEPRPFKLNKAGRQMWGPSGGAIWVTPTVDEARRLIYVGTSNSYTDVPYDNSDSVMAIDAD
TGAVRWTVQLLADDNYIDGCWQKGKEHANCPNPLGPDFSIGAAPIYRKMADGKEFLLVGQKSGMIYALDPANKGAKIWER
QLSLGSALGGIEFGTAADDGKVYAGVSDIASQAKDRGKPGLWALDIRTGEVAWNFLNAPDTKCRWNNWWCHGAFSQAISV
IPGAIFAGSYDGHFRAFDTATGKIIWDVDTGTKAVTTLSGAKAFGGVMDGAGPTIAGGMVYVHSGYAGRSSESGGRDLRG
TDGNILMAFSVDGK
>Q9I1L5 4.1.99.24~~~pvcA~~~L-tyrosine isonitrile synthase~~~
MYAIAEDTLPARVLKELLLYRRRYPEHRQSASEADEIRRIEQVQLPRIAAFIEAGEPIEFVLPAFPAKSPNPGKVLDSRP
DMAERLSLSFLNHLCQRIQLFYAPGAKITVCSDGRVFGDLVRIGDAHISAYQDALRLMIEEIGATHIGVFNLEDVRAFEA
QRDNHEQLRQLLIGGYAEPLESIRETLLASEEGLLLYRAITRFLYEDGLTPDYQGSKTALQRDAKERAYGVIQRSWAWGA
LLADQFPRAIRLSIHPQPADSLKFGIHMMPTRDDWLTPWHGVAVNTEDRFVLMKRSEVLELGGELVQINGQPSHYRLPAR
AARRAAVA
>D4I2N1 1.14.20.9~~~pvcB~~~Tyrosine isonitrile desaturase~~~COG2175
MNPADLPDTLDVAPLTGETGEPCSFGILIKPCRAGRHIGELSVTWLRALVYSHQLVVLRGFDHFASSDSLTRYCATFGEI
MMWPYGAVLELVEHANPDDHIFANSYVPLHWDGMYLDTVPEFQLFQCVHAAGDMQGGRTTFSSTNAALRIATPAVRELWA
RAHGRYQRSVELYSNTVEAPIIGIHPLREFPVIRFCEPPDENDATFLNPSSYSFGGINKDEEEMLLVSLMKTLRDPRVYY
AHQWQTGDFVLSDNLSLLHGREQYTHHSGRHLRRVHIHGRPQIANHHLVRSE
>Q9I1L4 1.14.20.9~~~pvcB~~~Tyrosine isonitrile desaturase~~~
MNAYLSDQPVRLSPLRDEQGNQPRFGLLLEPGRPGMHVGELPAQWLKGLARSHHLLLLRGFAAFADAESLTRYCHDFGEV
MLWPFGAVLELVEQEGAEDHIFANNYVPLHWDGMYLETVPEFQVFHCVDAPGDSDGGRTTFSSTPAALQLADSSELELWR
RASGRYQRSAAHYSSRSAAPIVERHPRREFPILRFCEPPVEGDASFINPSEFHYDGIAPEQRGELLASLRRCLYHPQAHY
AHRWRSDDLVIADNLTLLHGREAFAHRAPRHLRRVHIHAEPALRNPHLQRD
>D3V9Q5 1.14.20.10~~~pvcB~~~Tyrosine isonitrile desaturase/decarboxylase~~~COG2175
MNTYEIDCHVELVKPFGLLITPNYPEQDINSLPVDALRKLAQDHLLVILRGFQSGFTDKEKLTEYTRHWGELMTWPFGVV
LDVMEQSIPSDHVLDSSYIPLHWDGMYREAIPEFQIFHCVSAPEAAQGGRTTFVNTEQLILDASEDEFNTWKNTTITYRT
KKVTHYGGEVVSPLVCLHPKGNKWVIRYNEPMHQEDKYADHHSVTIQGLLSEEQKAFEETLYNRLYDPRYFYAHQWQSGD
MVISDNFSLLHGREAFISRSPRHLQRVHVHGMPVCENNSFRTISDSNSIDSTVKEV
>Q51548 1.14.13.195~~~pvdA~~~L-ornithine N(5)-monooxygenase~~~
MTQATATAVVHDLIGVGFGPSNIALAIALQERAQAQGALEVLFLDKQGDYRWHGNTLVSQSELQISFLKDLVSLRNPTSP
YSFVNYLHKHDRLVDFINLGTFYPCRMEFNDYLRWVASHFQEQSRYGEEVLRIEPMLSAGQVEALRVISRNADGEELVRT
TRALVVSPGGTPRIPQVFRALKGDGRVFHHSQYLEHMAKQPCSSGKPMKIAIIGGGQSAAEAFIDLNDSYPSVQADMILR
ASALKPADDSPFVNEVFAPKFTDLIYSREHAERERLLREYHNTNYSVVDTDLIERIYGVFYRQKVSGIPRHAFRCMTTVE
RATATAQGIELALRDAGSGELSVETYDAVILATGYERQLHRQLLEPLAEYLGDHEIGRDYRLQTDERCKVAIYAQGFSQA
SHGLSDTLLSVLPVRAEEISGSLYQHLKPGTAARALHEHALAS
>Q9I194 3.5.1.97~~~pvdQ~~~Acyl-homoserine lactone acylase PvdQ~~~
MGMRTVLTGLAGMLLGSMMPVQADMPRPTGLAADIRWTAYGVPHIRAKDERGLGYGIGYAYARDNACLLAEEIVTARGER
ARYFGSEGKSSAELDNLPSDIFYAWLNQPEALQAFWQAQTPAVRQLLEGYAAGFNRFLREADGKTTSCLGQPWLRAIATD
DLLRLTRRLLVEGGVGQFADALVAAAPPGAEKVALSGEQAFQVAEQRRQRFRLERGSNAIAVGSERSADGKGMLLANPHF
PWNGAMRFYQMHLTIPGRLDVMGASLPGLPVVNIGFSRHLAWTHTVDTSSHFTLYRLALDPKDPRRYLVDGRSLPLEEKS
VAIEVRGADGKLSRVEHKVYQSIYGPLVVWPGKLDWNRSEAYALRDANLENTRVLQQWYSINQASDVADLRRRVEALQGI
PWVNTLAADEQGNALYMNQSVVPYLKPELIPACAIPQLVAEGLPALQGQDSRCAWSRDPAAAQAGITPAAQLPVLLRRDF
VQNSNDSAWLTNPASPLQGFSPLVSQEKPIGPRARYALSRLQGKQPLEAKTLEEMVTANHVFSADQVLPDLLRLCRDNQG
EKSLARACAALAQWDRGANLDSGSGFVYFQRFMQRFAELDGAWKEPFDAQRPLDTPQGIALDRPQVATQVRQALADAAAE
VEKSGIPDGARWGDLQVSTRGQERIAIPGGDGHFGVYNAIQSVRKGDHLEVVGGTSYIQLVTFPEEGPKARGLLAFSQSS
DPRSPHYRDQTELFSRQQWQTLPFSDRQIDADPQLQRLSIRE
>Q9HVR0 3.5.2.9~~~pxpA3~~~5-oxoprolinase subunit A 3~~~
MNDTGRRILLNCDMGESFGAWRMGDDVHSMPLVDQANLACGFHAGDPLTMRRAVELAVRHGVSIGAHPAYPDLSGFGRRS
LACSAEEVHAMVLYQIGALDAFCRSLGTQVAYVKPHGALYNDLVGDDELLRAVLDACAAYRKGLPLMVLALADNGRELEL
ADEADVPLLFEAFADRAYLPDGRLAPRRLGGAVHHDPQRIIEQALAIARGEAFPDYDGNPLRLTADSLCVHGDNPQSLAV
LRRLRAALDSL
>P42963 3.5.2.9~~~pxpA~~~5-oxoprolinase subunit A~~~COG1540
MFQIDLNCDLGESFGAYKIGLDQDILEYVTSANIACGFHAGDPSVMRKTVALAAERGVKMGAHPGLPDLLGFGRRNMAIS
PEEAYDLVVYQIGALSGFLKAEGLHMQHVKPHGALYNMAAVDQKLSDAIAKAVYKVDPGLILFGLAESELVKAGERIGLQ
TANEVFADRTYQSDGTLTPRSQPDALIESDDAAVTQVIKMVKEGAVKSQQGHDVSLKADTVCIHGDGAHALTFAQKIRKQ
LKAAGIEVTAISEQRST
>P75746 3.5.2.9~~~pxpA~~~5-oxoprolinase subunit A~~~COG1540
MKIDLNADLGEGCASDAELLTLVSSANIACGFHAGDAQIMQACVREAIKNGVAIGAHPSFPDRENFGRSAMQLPPETVYA
QTLYQIGALATIARAQGGVMRHVKPHGMLYNQAAKEAQLADAIARAVYACDPALILVGLAGSELIRAGKQYGLTTREEVF
ADRGYQADGSLVPRSQSGALIENEEQALAQTLEMVQHGRVKSITGEWATVAAQTVCLHGDGEHALAFARRLRSAFAEKGI
VVAA
>P45347 3.5.2.9~~~pxpA~~~5-oxoprolinase subunit A~~~COG1540
MKKIDLNADIAEGFPFDESLLQLLSSANVACGLHAGGAKEMQSAVKFAKENKVRIGAHPSFPDRENFGRTAMALSSQELI
AHLRYQLGALKAICDGEGAVISYVKPHGALYNQAAKDEKIARLIAQTVYQFDPNLKLMGLAGSLMLRIAEEEKLQTISEV
FADRHYMPDGSLVPRSQPNAMVESDKEAIQQVLQMVTKGQVNAIDGSLVPVKAESICLHGDNQHSLQFAKRIVEELEKNH
IKITA
>Q53WG6 3.5.2.9~~~pxpA~~~5-oxoprolinase subunit A~~~
MKVDLNADAGESYGAFAYGHDREIFPLVSSANLACGFHGGSPGRILEAVRLAKAHGVAVGAHPGFPDLVGFGRREMALSP
EEVYADVLYQIGALSAFLKAEGLPLHHVKPHGALYLKACRDRETARAIALAVKAFDPGLPLVVLPGTVYEEEARKAGLRV
VLEAFPERAYLRSGQLAPRSMPGSWITDPEEAARRALRMVLEGKVEALDGGEVAVRADTLCIHGDNPNAPEVARAVREAL
EQAGVEVRAF
>P60495 3.5.2.9~~~pxpB~~~5-oxoprolinase subunit B~~~COG2049
MTVRYQIEQLGDSAMMIRFGEEINEQVNGIVHAAAAYIEEQPFPGFIECIPAFTSLTVFYDMYEVYKHLPQGISSPFESV
KRDVEERLAEIAEDYEVNRRIVEIPVCYGGEFGPDLEEVAKINQLSPEEVIDIHTNGEYVVYMLGFAPGFPFLGGMSKRI
AAPRKSSPRPSIPAGSVGIAGLQTGVYPISTPGGWQLIGKTPLALFRPQENPPTLLRAGDIVKFVRISEKDYHAYKEESN
>P0AAV4 3.5.2.9~~~pxpB~~~5-oxoprolinase subunit B~~~COG2049
MQRARCYLIGETAVVLELEPPVTLASQKRIWRLAQRLVDMPNVVEAIPGMNNITVILRNPESLALDAIERLQRWWEESEA
LEPESRFIEIPVVYGGAGGPDLAVVAAHCGLSEKQVVELHSSVEYVVWFLGFQPGFPYLGSLPEQLHTPRRAEPRLLVPA
GSVGIGGPQTGVYPLATPGGWQLIGHTSLSLFDPARDEPILLRPGDSVRFVPQKEGVC
>Q7WY77 3.5.2.9~~~pxpC~~~5-oxoprolinase subunit C~~~COG1984
MKVLKPGLLTTVQDIGRTGYQKYGVLASGAMDTVSLRIANLLIGNGENEAGLEITMMGPGPSFHFSKQTLIAVTGADFTL
RINDEEAPLWKPVLIKENSTVSFGPCKLGSRAYLAAAGGIEVPAVMESKSTYVRGSIGGLHGRALQKEDELNIGEMSALS
QTILSRLSSQLGKQGFAAPKWSVSRGRFLPLKKNPVIRVLEGKQFAFFTEESKTRFYEEAFRVTPQSDRMGYRLKGEPLE
LKAPLEMVSEAVSFGTVQVPPDGNPIILLADRQTTGGYPRIAHIISADLPIVSQIMPGEHVQFEPVSLQEAEALAVEREQ
HIKELKTRMKMEWLT
>P75745 3.5.2.9~~~pxpC~~~5-oxoprolinase subunit C~~~COG1984
MLKIIRAGMYTTVQDGGRHGFRQSGISHCGALDMPALRIANLLVGNDANAPALEITLGQLTVEFETDGWFALTGAGCEAR
LDDNAVWTGWRLPMKAGQRLTLKRPQHGMRSYLAVAGGIDVPPVMGSCSTDLKVGIGGLEGRLLKDGDRLPIGKSKRDSM
EAQGVKQLLWGNRIRALPGPEYHEFDRASQDAFWRSPWQLSSQSNRMGYRLQGQILKRTTDRELLSHGLLPGVVQVPHNG
QPIVLMNDAQTTGGYPRIACIIEADMYHLAQIPLGQPIHFVQCSLEEALKARQDQQRYFEQLAWRLHNEN
>Q7CVK1 1.5.1.49~~~~~~Delta(1)-pyrroline-2-carboxylate reductase~~~COG2055
MSDTTTLTIEALFQRVESIFLRAGLNAVQSGALARVITAGERDACKSHGIYRIEGALRTVKAAKVKPDAIPEVAEDDGTA
IVKVNAMGGFANPAFELGLPALAERAKRLGLAALVINDCTHFSALWPEVEGLTSNGLAGLVMCPSYSTVAPTGGTKPLLG
TNPFAFGWPRKDTSPYVFDFATSVAARGEIELHRRARKSLPEGWAVDADGNPTTDPEAALAGAMLPFGGHKGSAIGTMIE
LLAGIMIGDLTSPEVLDYLGTTTLAPFHGELIVAFSPEAFAKGRPGDPFQRAEVLFEAIIGQGARLPSGRRFAARAKSES
EGITLTAAEMAGLDRLLEKGLDAVS
>V5YW53 1.5.1.1~~~lhpI~~~Delta(1)-pyrroline-2-carboxylate/Delta(1)-piperideine-2-carboxylate reductase~~~
MTALSPIPVFDAADTAALLAYPALLATLGQAVADYAAGEIVSPERLVVPLQAGGVMLSMPSSARDLATHKLVNVCPGNGA
RGLPTILGQVTAYDASTGEMRFALDGPTVTGRRTAAVTALGIQALHGAAPRDILLIGTGKQAANHAEALAAIFPEARLHV
RGTSADSAAAFCAAHRAQAPRLVPLDGDAIPDAIDVVVTLTTSRTPVYREAAREGRLVVGVGAFTADAAEIDANTVRASR
LVVDDPAGARHEAGDLIVAQVDWQHVASLADVLGGTFDRSGPLLFKSVGCAAWDLAACRTARDALAARRAG
>Q73CR9 1.5.1.49~~~~~~Delta(1)-pyrroline-2-carboxylate reductase~~~
MLVISANEQRNLVNMNEVIEYAALALKEFSAERTITPIRDSLPFANEQNTALIMPSVAEGLEALGLKVVTVVPENKKIGK
KTINGIVMLSDFQTGEPLALLEGSYLTMIRTGALSGVATKHLARHNAKTLCIIGTGEQAKGIAEAVFAVRDIEKVILYNR
TEEKAYAFSQYIQEKFNKPAYVYTSANEAISEADIIATTTNASTPVFSKKLQKGVHVNAVGSFRPSMQELPSHAIANATK
VVVESKEAALEETGDLQVPIQEGLFKSSDIHAELGQIISGEKAGRESDEEVTVFKSVGLAVVDIIVAKYLYERAVERGVG
ERIEF
>Q81HB0 1.5.1.49~~~~~~Delta(1)-pyrroline-2-carboxylate reductase~~~
MLVISANEQRNLVNMNEVIEYAALALKEFSAERTITPIRGSLPFANEQNTALIMPSVAEGLEALGVKIVTVVPQNKQIGK
KTINGIVMLSDFQAGEPLALLEGSYLTMIRTGALSGVATKYLARHNAKTLCIIGTGEQAKGIAEAIFAVRDIEKVILYNR
TEEKAYAFAQYIQEKFGKPAYVYKDPNEAVREADIIVTTTNATTPVFSEILQKGVHVNAVGSFRPSMQELPSHAIAKANK
VVVESKEAALDETGDLQVPIKEGLFKANAIHAELGQIISGEKAGRENDEEITIFKSVGLAVVDIIVAKYLYERALEQGVG
NKIEF
>Q63FA5 1.5.1.49~~~arcB~~~Delta(1)-pyrroline-2-carboxylate reductase~~~
MLVISANEQRKLVNMNEVIAYAALALQEFSAERTITPIRTSLPFANEQNTALIMPSVAEGLEALGLKVVTVVPENKKIGK
KTINGIVMLSDFQTGEPLALLEGSYLTMIRTGALSGVATKHLARHNAKTLCIIGTGEQAKGIAEAVFAVRDIEKVMLYNR
TEEKAYAFAQYIQEKFGKPAYVYANANEAISEADIIVTTTNASTPVFSEKLQKGVHINAVGSFRPNMQELPSHAIANANK
VVVESKEAALEETGDLQVPVREGLFEASDIHAELGQIISGEKAGRESDEEITVFKSVGLAVVDIIVAKYLYEKAVERGVG
ERIEF
>Q6HMS8 1.5.1.49~~~arcB~~~Delta(1)-pyrroline-2-carboxylate reductase~~~
MLVISANEQRNLVNMNEVIAYAALALKEFSAERTITPIRGSLPFANEKNTALIMPSVAEGLEALGLKVVTVVPENKKIGK
KTINGIVMLSDFQTGEPLALLEGSYLTMIRTGALSGVATKHLARHNAKTLCIIGTGEQAKGIAEAVFAVRDIEKVILYNR
TEEKAYAFSQYIQEKFGKPAYVHTNANEAISEADIIVTTTNASTPVFSEKLQKGVHVNAVGSFKPSMQELPSHAIVGANK
VVVESKEAALDETGDLQVPIKEGLFKANAIHAELGQIISGEKAGRENDEEITVFKSVGLAVVDIIVAKYLYEKAVESGVG
NKIEF
>Q485R8 1.5.1.49~~~lhpI~~~Delta(1)-pyrroline-2-carboxylate reductase~~~COG2423
MKIISAEQVHQNLNFEELIPLLKQSFSRPFSMPQRQVYSLAPEQSENHDAFALLPSWNEEVIGNKAFTYFPDNAKKHDLP
GLFSKIMLFKRQTGEPLALVDGTSVTYWRTAAISALASQLLSRKNSQHLMLFGTGNLASYLVKAHLTVRDIKQVTLWGRN
AKKVSKLIADFSILYPAVTFKTSVDVNAEVASADIICCATGAKTPLFDGNSVSAGCHIDCLGNHMTDARECDTTTILRAR
VFVDSLTNTLNEAGELLIPMAEDAFNKDEIVGELADMCKTPSMLRQSSDEITLFKSVGTAISDLVAAHSVVEKLAD
>A1B196 1.5.1.49~~~~~~Delta(1)-pyrroline-2-carboxylate reductase~~~COG2423
MARKSSAPQFLSYGDATGRLSWRDAVEALRQGHTLPQAQIRDVFLGPPTGTMMSRSAWIEGLGYGAKTFTVFDGNAARGL
PTVQGAMLVFDKDDGRLQAIVDSPLVTEFKTAADSVLGASLLARPDSRHLLIVGAGTVAASLVRAYTAVLPGIERVSVWA
RRPQQAQDLIEGLDGIEADLAAVSDLPAAVGQADIVSSATMARQPVILGAWVRPGTHVDLIGAFKADMREADDALMARAA
LFVDSRETTLGHIGELMLPIASGAITAESVLGDLYDLVRPGARRRQSEDEITVFKNGGGAHLDLMIASYIARVMAG
>Q9I492 1.5.1.21~~~lhpD~~~Delta(1)-pyrroline-2-carboxylate/Delta(1)-piperideine-2-carboxylate reductase~~~
MIRMTLDEVRELAVRILRRHAFSEAHVQAVADTLVAGERDECASHGIWRLLGCIATLKAGKVSADAEPELHDIAPGLLRV
DAHGGFSQCAFRLGLPHLLEKARSQGIAAMAVNRCVHFSALWVEVEALTEAGLVALATTPSHAWVAPAGGRKPIFGTNPI
AFGWPRPDGPPFVFDFATSAVARGEIQLHERAGKPIPLGWGVDEQGEPTTDASAALRGAMLTFGGHKGSALAAMVELLAG
PLIGDLTSAESLAYDEGSRSSPYGGELLIAIDPRRMLGASAEEHLARAETLFEGIVEQGARLPSQRRFEARERSARDGVT
IPEALHRELLALLE
>Q4KGT8 1.5.1.49~~~~~~Delta(1)-pyrroline-2-carboxylate reductase~~~COG2055
MSEYITLSLDEVCALSYQVLTRHGLSDAHARAIAEVITQGQRDECHSHGVYRLLGCVRSVREGRIDPRAEPSLRHVSPGV
LEVDAHYGYSLLGFHTGLPILAEKARSQGIAAMVIKRCFHFSALWPEVEAIADYGLVGMAMNPSHSWVAPAGGRQPVFGT
NPLAFAWPRPGGQPFVFDFATSAIARGDIELHARQGKPIPEHWAIDADGQPTTDAKAALQGAMQTFGGHKGSALAAMIEL
LAGALIGDLTSAESMAFDGGVGATPCHGELVLAFDPRVFLGEGYEQGLERAEGLFAAIARQGARLPSQRRFAARARSLEH
GVQIPRGLLEDIRGLL
>Q5FB93 1.5.1.21~~~dpkA~~~Delta(1)-pyrroline-2-carboxylate/Delta(1)-piperideine-2-carboxylate reductase~~~
MSAPSTSTVVRVPFTELQSLLQAIFQRHGCSEAVARVLAHNCASAQRDGAHSHGVFRMPGYVSTLASGWVDGQATPQVSD
VAAGYVRVDAAGGFAQPALAAARELLVAKARSAGIAVLAIHNSHHFAALWPDVEPFAEEGLVALSVVNSMTCVVPHGARK
PLFGTNPIAFAAPCAEHDPIVFDMATSAMAHGDVQIAARAGQQLPEGMGVDADGQPTTDPKAILEGGALLPFGGHKGSAL
SMMVELLAAALTGGHFSWEFDWSGHPGAKTPWTGQLIIVINPGKAEGERFAQRSRELVEHMQAVGLTRMPGERRYREREV
AEEEGVAVTEQELQGLKELLG
>Q4U331 1.5.1.21~~~dpkA~~~Delta(1)-pyrroline-2-carboxylate/Delta(1)-piperideine-2-carboxylate reductase~~~
MSASHADQPTQTVSYPQLIDLLRRIFVVHGTSPEVADVLAENCASAQRDGSHSHGIFRIPGYLSSLASGWVDGKAVPVVE
DVGAAFVRVDACNGFAQPALAAARSLLIDKARSAGVAILAIRGSHHFAALWPDVEPFAEQGLVALSMVNSMTCVVPHGAR
QPLFGTNPIAFGAPRAGGEPIVFDLATSAIAHGDVQIAAREGRLLPAGMGVDRDGLPTQEPRAILDGGALLPFGGHKGSA
LSMMVELLAAGLTGGNFSFEFDWSKHPGAQTPWTGQLLIVIDPDKGAGQHFAQRSEELVRQLHGVGQERLPGDRRYLERA
RSMAHGIVIAQADLERLQELAGH
>D7A0Y0 1.5.1.49~~~~~~Delta(1)-pyrroline-2-carboxylate reductase~~~COG2055
MDEPVRLSLAEVHVLCRDTLVAAGLGEEHAQAIARSITRAEADECHSHGLYRLIGYVASVRSGKAERHALPALARATPAV
LRVDAKHGFAPLAVETGVPALIAAAKEIGIAALAIHDCYHFSALWADIEPAVEAGLAAWCFTVGQCCVAPAGGTTPLLGT
NPFAFGWPGPSGRPFIFDFATSAAARGEIELKRRGGEKIPPGWAVGPDGAPTTDPAAALAGALLPFGGHKGSALSMMVEL
IAGPLIGDLTSRQSKAVENGDGGPPLGGELFIAIDPAVFGTGNLSSRLADADELFALAKAQPGVRLPSERRYQARERSRT
NGIAVPAALFAELQALGPRGS
>P20116 ~~~apcC~~~Phycobilisome 7.8 kDa linker polypeptide, allophycocyanin-associated, core~~~
GRLFKITACVPSQTRIRTQRELQNTYFTKLVPYENWFREQQRIQKMGGKIVKVELATGKQGINTGLA
>P80558 ~~~apcC~~~Phycobilisome 7.8 kDa linker polypeptide, allophycocyanin-associated, core~~~
MSRLFKITALVPSLSRTRTQRELQNTYFTKLVPYENWFREQQRIQKAGGKIIKVELATGKQGTNAGLQ
>Q01950 ~~~apcC~~~Phycobilisome 7.8 kDa linker polypeptide, allophycocyanin-associated, core~~~
MRMFRITACVPSQTRIRTQRELQNTYFTKLVPYDNSFREQQRIMKMGGKIVKVELATGRPGTNAGLA
>P50036 ~~~apcC~~~Phycobilisome 7.8 kDa linker polypeptide, allophycocyanin-associated, core~~~
MRMFKITACVPSQTRIRTQRELQNTYFTKLVPYENWFREQQRIQKMGGKIVKVELFTGKPGVNTGLA
>P0DV24 4.6.1.6~~~pycC~~~Cytidylate cyclase~~~
MSFKDVTAKNFKGLKNVSLKKSMAMEGHTLVGTEARLGDAFELCESFSTSPSNIIEYEYQEEIRPFFQKAGLNKHSIGTH
PELTGLGVGMIYNQYTVTMFVDIRKSSRLSLLLPLEQVYVVKNRILQACIDIVRALDGYPHRLMGDALMAFFGRSDVSKE
DAIADAINAASTLRLILMDYIFPSLNEDIGEQIDLGVRIGLDYGAEDEVVWGNFGLGSFCEVTALGLPVDMTAKLQQLAD
KNTAMLGQGILDYIDFPEEYTKPKVKSGEELKYIIPNITNKEGQPINRRIRLLNMARYQELLPFKLNDKKMASAILYPNQ
FNFECFVIEDNKEVLYNSVSRFLPKKRRLTFKLSIYPGPGIGDLKIIFCKRNHGQEAKDDLSEDYSISIEDNKLIRVKNA
DNLSLLRKDGCYVLTVPEETLFRGLHTMEVIVRGNHETLFYRNIIGVYIK
>P0DV26 4.6.1.6~~~pycC~~~Cytidylate cyclase~~~
MSIKNLYKNLNDEMGSISKRRKNNNISFNKSTFDSHAMDALNTSFENYEDISLEGFESYINESASRTVIERNSLVPLKDY
DAVHSLRNAFGKPPRDNEPRIGTHPEFKHLELDNRTDTGYVVTMFMDIIGSTKLGLSYSPSDVFLFKNNIITGAIETINA
FDGHVHRIMGDAVMAFFRSRQNEQHDTLENSVIDAINCAAYFIEVMNEIVKPQIKEVADENIGIRIGIDLGETNYVLWGN
YGIPGVNEVTATSFFVDIASKLQHKAPKNSIMLGQNLVEKLGLTVNDYLTYKLKDGQPDRYIIDFTSKNQSRLRYKQYLL
NQSKYFSILPHGLKPSRIKVVISYSNDELGLVNRKDYFNCSSVIPKGKWVKFHATFCEEYGEHYESLKFKFRVVNNGLDA
SKKDNYDNHETEIIKKAYEKENGVFTAIHKEQTSYKGLQHMYISVISNDTVIEREIPCSIFIK
>A0A0J5ZXG5 4.6.1.26~~~pycC~~~Uridylate cyclase~~~
MALADDLKKWVGETFTGKWEVQETTSVPNPEDLRLNSNHAKDLKAATVLYADLDGSTDMVNTKKWQFSAQIYKTFLKCAS
DIIRDEGGNITAYDGDRVMAVFTGNSKNTSAARCALKINSAVLDIIQPAIAKKWQTDFVLRHVVGIDTSQLRTARIGIRG
DNDLVWIGRAANYAAKLTNLAGKPTRITADVYNKLADKLKYANGVDMWAPEHWDDMGIWTYTSTWKWTV
>P0DV42 4.6.1.26~~~pycC~~~Uridylate cyclase~~~
MRSSHKSFDFENSLKRIDSILNDSTSYEESDDIPDIDDLTFNNGKYVNCAAIFIDLRGSTDLIKTLGKLSKSLARLYRAY
ISEMVAIVNSFKTCKEINIVGDCVSAMFAGDIEGAESPVIEALQASSMANAMMNVLNVKYKKKWKDFVELKAGIGVAYGR
ALVIKAGFSGSGIKDLVYMGDVVNKASKMCGLAYKEYTSHAICVTKEVYENAGKYIANEEKKLTYQDFLTEKNHNTFGSV
YVGNFHRVYINNWAEENK
>A0A1C5G2V9 4.6.1.2~~~pycC~~~Uridylate cyclase~~~
MTEVDLKALLADVDGDVATELASKPEVIDKGHELDISTLPIQARKWHKLRDAVAVVADLKSSTQLGLNKHAASTASIYEA
ATGGVVQIFDEFDANFVAIQGDGAFALFWGDKRRQRAVCAGITIKTFSFKHLVPRLEKKWDGLPETGLKVGLGSSPLLVK
RVGVPRTEHQEPVWAGRAVNYAAKAAQQADRHEMVVTGTIWDWVSDNDFLAVTCSCSNPNPDLWSNITIEKIPDGDGDRE
GKRLTSSWCDVHGPEYCAAVLEGKKRRADVTTQRTSALAAEMKSWVRNKAAQDRKNRLARYQGLH
>A0A4V2JTK3 4.6.1.2~~~pycC~~~Uridylate cyclase~~~
MSDEDLFLTALLDSLGKEVNTVLNNSPIKVEEKDGDFKAEDIPSPSSDTWVKLPEVVAVVCDLKGSTHLGTGKHDTSTAR
IYKSGVEGAVRVFHEFGANFIDIQGDGGFGLFWGERAHERALCAGVTIRTFSEEFVERLEKRWPEGLPETGYKVGIHAAR
TLVKRIGTKREISEQEAVWAGRPVNYAAKCAQSADRHQVIITQAVWDKMKDNDFIAFSCDCGDGPTANLWTDVTVDRLPE
EDRDAVVLNSPWCKTCGPAFCEAIMAGEKKRDIPSAVRTGINRMKMQKALEAKRLRDSSRNSALRGVR
>P0DV40 4.6.1.26~~~pycC~~~Uridylate cyclase~~~
MSKSWNHDRAAKHIDQKIADVEEITIKDYVRDMSLESIPTSTAYRVDGVHMYADIMNLEDMLNITAVEGTECHKRTLRFL
DQHYRAVKRILNKVDARRVDFHSQRLHSLFTKPYNTESGAETKRVQRAVATAQLIIDVLAETGDDDEQIPAAKVRIGIDT
GLALAVNNGRSGYREPLFLGDPANHAAKLASNNKARGIYLTNNARKAIGLAESDEPEKSALTAIEIKACQDAAKLDVTSD
EIVEEWREDLKKNPIGGYQFSRQTPPLRDMDIYSLTPANSKRQEMVSLYADIDGFTAYVADHINEKTDDVVRTLHVIRSE
LERVVTSDFEGRRVRFIGDCVQALSCDGTAHTTDEEKSVSEATRLAGALRSSFNLAIERLNAEGIETGDLGLAIGFDLGP
IAVTRLGAKGNRVRCAIGRSVIESEKRQCACSGVETAIGQVAYDAASKAVQNLFGKSRKTSHLDYNEATEALADDGDASA
KQARSEAYAGSAAIIRADERQVQPHSRQKVDGSR
>A0A4R2TZQ0 4.6.1.26~~~pycC~~~Uridylate cyclase~~~
MSWSRHKSLQRIRSFRASAPAAAINLSNFDLSYMTARATTIQARRKAGGKSEPLIFDVPPDSAVLVEGVHVYIQLLDFAS
AMTERERETEASHRRVLSMLHLNYAACDQVAEEFEAQRVDFHGARMHAVIVSPPGPGNERDRAERALAFADAAKRAIEEV
GRTTENGRYSTRVRVGIDSGSAVAVNSGTQDEREPLFLGAPANYAAKLAEGDEEGVFMSNRIRKDLGLPQLSSFDTLAAE
RASRTSSVGETGLSANTSFQSKRLSDAAIMTAASRARNSFILNVGTDANFSFHRHTPPLSTIDFALLTPSNSVRMGLMSI
FGDIDGFTKYVDECIAAQRIGEMVSNLHVIRSELAATLSQDFLGRKVRFIGDCIHGLLATGTSYETDASGSVVASVKAAG
GMRSSFELCQEELGGIENLGIAIGLEYGETPITRIGIRGDRSVRCSVSRAVSRSEELQGGCTGDQTALGPTALGHAPTSI
RRLFAGGVAMGLDAGSVDEHLGSPPIVRSGEVSAAAAPYDSGE
>A0A1V0HUX5 4.6.1.26~~~pycC~~~Uridylate cyclase~~~
MSINDDISSDVMRIRDADWNKRSGSKVPSPGDVTLSNGAVEIDATYLYADMANSSRMAKELDRRVTAKILKSFLASSSRL
ISHFGGTIMSFDGDRVMGAFMGDAKNSSAIKCSFSIAYSVTQLIRPKFESKYDTVKNAGFKIRHATGVDTGTVFVVRGGI
YGSNELISIGRAPNLAAKLSDLREGEYTTFATKSVYDRTNKLQKQRLDGSSDIWEKRDWDFCDENITIYRSSYWRKPGSN
>P0DV38 4.6.1.6~~~pycC~~~Cytidylate cyclase~~~
MVNHIRIFDNLFQSNISKFQNLTSKSYIIRNDNEKNSYLPMVQEIRELFGKEGEIFSKSIGTHPDFFGIENTNEYQYICS
LFVDISGSTKLALKYSLDKVKLYKNAIISSAIEIFRAFDGHIHRIQGDAVLVYFGHKELEKSDAIINAINAASLMQYFNA
TTLKKFFESENLEPLKIRIGIDFGDDSSVLWSKYGIDGINEITSTSIHTDLASKFQNKAPSNKIMIGENINKYLDIPKKF
RSIKIEKNNGVDVEKRYILNTNNLGRYSMEVFEWEKYLNSFSMLPPFSTENEQFYSPRDLKIRCWIIDEKNQDKYEYIER
GSALKKEMNLLFKLEIYNQCLEFKNIKWRVVNYGEEAKKDKELEFEMNQYEGYQYCNQKTAYTGLHFMECYLYDINDKII
CHDSFGLFINDNNREVRKLGIED
>A0A1T4LJ54 4.6.1.26~~~pycC~~~Uridylate cyclase~~~
MKNDNGQQLSLTQALDLLKSRQGKNFKTADKIIENFCFYDYSNIIMKYDYKSGKKRILNILNSELDVENLNELPCDEKLT
FSNAYYSWVTAIFVDIRKSTELFTNENKKDVSRLIRSFTSEIIEIINQGDNLREIGIRGDCVYGIFTTPKKSQINEVFDM
ACYVNTMIKMLNKLLIKEDIPQIMVGIGVSSAQELVVKAGRKGSGVNNKVWIGDAVTKAANMSGKGNKNHNLPIIISELT
YKNLNDHNKGLMSSRKYNDDLDYYYDCDLIISAFNDWINKGMLDNE
>P0DV28 4.6.1.26~~~pycC~~~Uridylate cyclase~~~
MGLKDELTTFCHDVFNGNWETTEGKNVPDEDSRLTLKNTAITIDGTVLYADLDGSTAMVDGYKNWFAAEIYKTYLYCCAR
IIAAEGGVVTAYDGDRVMALFIGERKNTRAARAAMKIKWAVDEIIMPKKDARYTSNKFALKHVTGIDTCSLFVAKTGARG
ANDLVWVGRAANYAAKLTSLPSTYTYITESVYKMLADEAKTSNGKSMWEKVTWNTFNNSTIYRSNWRWRID
>Q0B9S2 1.5.1.49~~~~~~Delta(1)-pyrroline-2-carboxylate reductase 1~~~COG2055
MSESLAEVVLSLDEVHALALRVLTHNGMSAAHAQAIANVITQGQRDECHSHGVYRLLVCVRSLKKGKVDPQAVPTLRRLS
SSIVAVDAHRGFSLLSFETGLPVLVEMAKQHGIAAMAINHCYHFSALWPEVEAIAAEGLVGIAMNPSHSWVAPEGGREPV
FGTNPIAFAWPRPDGVPFVFDFATSAIARGDIELHAKQGKAIPPHWAIDADGQPTTDPKAALQGAMRTFGGHKGSALAAM
VELLGGALIGDMTSRESMAFDEGVGATPCHGELVIAFDPKVFLGDELDAGLARGERMFASITGQGARLPSQRRFDARARS
IAHGVRIPKALYDEILTLLD
>A9AKH1 1.5.1.49~~~ocd~~~Delta(1)-pyrroline-2-carboxylate reductase 1~~~COG2423
MTALSRTPVFDAADTAALLDYPALLATLARTVADYAAGEIVSPERLVVPLQAGGVMLSMPSSAHDLAIHKLVNVCPGNAA
RGLPTILGQVIACDATTGEMRFVLDGPTVTGRRTAAVTALGVQALHGTPREILLIGTGKQAANHAEAFTALFPDARLHVR
GSRAASAAEFCAAHRAHAPQLMPLDGDAIPDAIDVVVTLTTSRTPVYRDAAREGRLVVGVGAFTADAAEIAADTVRRSRL
VVDDPAGARHEAGDLIVANVDWQQVASLADVLNGTFARGGPMLFKTVGCAAWDLAACRTARDALAAREQR
>Q0B953 1.5.1.49~~~~~~Delta(1)-pyrroline-2-carboxylate reductase 2~~~COG2423
MTALSRIPAFDAAETAALLDYPALLATLTHTVAEYAAGEIVSPERLVVPLQGGGVMLSMPSSARDLASHKLVNVCPGNAA
RGLPTILGQVSAYDATTGEMRFVLDGPTVTGRRTAAITALGIQALHGAAPRDILLIGTGKQAANHAEALAAIFPDARLHV
RGSRAGSAAAFCAAHRAQAPQLAPLDGDAIPDAIDVVVTLTTSRTPVYREAAREGRLVVGVGAFTADAAEIDADTVRHSR
LVVDDPAGARHEAGDLIVAQVDWQRVASLADVLRGAFERSGPLLFKTVGCAAWDLAACRTARDALDARRGG
>A9ALD3 1.5.1.49~~~ybiC~~~Delta(1)-pyrroline-2-carboxylate reductase 2~~~COG2055
MAEPIDAVVLSLDEVHALALRVLTHHGLSDAHARAIANVITQGQRDECHSHGVYRLLVCVRSLRKGKVDPQAVPTLRRLS
SSIVAVDAHRGFSLLSFETGLPVLVEMTKQHGIAAMVINRCYHFSALWPEVEAIAAEGLVGIAMNPSHSWVAPEGGKEPV
FGTNPIAFAWPRPGGMPFVFDFATSAIARGDIELHAKQGKPIPPEWAIDAQGRPTTDPQAALQGAMRTFGGHKGSALAAM
VELLGGALIGDLTSRESMDFDEGVGATPCHGELAIAFDPKVFLGDDLDAGLARGERMFDSIVAQGARLPSQRRFDARARS
IANGVRIPRALYDEIVALLD
>Q9KWU4 6.4.1.1~~~pyc~~~Pyruvate carboxylase~~~COG1038
MSQQSIQKVLVANRGEIAIRIFRACTELNIRTVAVYSKEDSGSYHRYKADEAYLVGEGKKPIDAYLDIEGIIDIAKRNKV
DAIHPGYGFLSENIHFARRCEEEGIVFIGPKSEHLDMFGDKVKAREQAEKAGIPVIPGSDGPAETLEAVEQFGQANGYPI
IIKASLGGGGRGMRIVRSESEVKEAYERAKSEAKAAFGNDEVYVEKLIENPKHIEVQVIGDKQGNVVHLFERDCSVQRRH
QKVIEVAPSVSLSPELRDQICEAAVALAKNVNYINAGTVEFLVANNEFYFIEVNPRVQVEHTITEMITGVDIVQTQILVA
QGHSLHSKKVNIPEQKDIFTIGYAIQSRVTTEDPQNDFMPDTGKIMAYRSGGGFGVRLDTGNSFQGAVITPYYDSLLVKL
STWALTFEQAAAKMVRNLQEFRIRGIKTNIPFLENVAKHEKFLTGQYDTSFIDTTPELFNFPKQKDRGTKMLTYIGNVTV
NGFPGIGKKEKPAFDKPLGVKVDVDQQPARGTKQILDEKGAEGLANWVKEQKSVLLTDTTFRDAHQSLLATRIRSHDLKK
IANPTAALWPELFSMEMWGGATFDVAYRFLKEDPWKRLEDLRKEVPNTLFQMLLRSSNAVGYTNYPDNVIKEFVKQSAQS
GIDVFRIFDSLNWVKGMTLAIDAVRDTGKVAEAAICYTGDILDKNRTKYDLAYYTSMAKELEAAGAHILGIKDMAGLLKP
QAAYELVSALKETIDIPVHLHTHDTSGNGIYMYAKAVEAGVDIIDVAVSSMAGLTSQPSASGFYHAMEGNDRRPEMNVQG
VELLSQYWESVRKYYSEFESGMKSPHTEIYEHEMPGGQYSNLQQQAKGVGLGDRWNEVKEMYRRVNDMFGDIVKVTPSSK
VVGDMALYMVQNNLTEKDVYEKGESLDFPDSVVELFKGNIGQPHGGFPEKLQKLILKGQEPITVRPGELLEPVSFEAIKQ
EFKEQHNLEISDQDAVAYALYPKVFTDYVKTTESYGDISVLDTPTFFYGMTLGEEIEVEIERGKTLIVKLISIGEPQPDA
TRVVYFELNGQPREVVIKDESIKSSVQERLKADRTNPSHIAASMPGTVIKVLAEAGTKVNKGDHLMINEAMKMETTVQAP
FSGTIKQVHVKNGEPIQTGDLLLEIEKA
>A0A0H3JRU9 6.4.1.1~~~pycA~~~Pyruvate carboxylase~~~
MKQIKKLLVANRGEIAIRIFRAAAELDISTVAIYSNEDKSSLHRYKADESYLVGSDLGPAESYLNIERIIDVAKQANVDA
IHPGYGFLSENEQFARRCAEEGIKFIGPHLEHLDMFGDKVKARTTAIKADLPVIPGTDGPIKSYELAKEFAEEAGFPLMI
KATSGGGGKGMRIVREESELEDAFHRAKSEAEKSFGNSEVYIERYIDNPKHIEVQVIGDEHGNIVHLFERDCSVQRRHQK
VVEVAPSVGLSPTLRQRICDAAIQLMENIKYVNAGTVEFLVSGDEFFFIEVNPRVQVEHTITEMVTGIDIVKTQILVAAG
ADLFGEEINMPQQKDITTLGYAIQCRITTEDPLNDFMPDTGTIIAYRSSGGFGVRLDAGDGFQGAEISPYYDSLLVKLST
HAISFKQAEEKMVRSLREMRIRGVKTNIPFLINVMKNKKFTSGDYTTKFIEETPELFDIQPSLDRGTKTLEYIGNVTING
FPNVEKRPKPDYELASIPTVSSSKIASFSGTKQLLDEVGPKGVAEWVKKQDDVLLTDTTFRDAHQSLLATRVRTKDMINI
ASKTADVFKDGFSLEMWGGATFDVAYNFLKENPWERLERLRKAIPNVLFQMLLRASNAVGYKNYPDNVIHKFVQESAKAG
IDVFRIFDSLNWVDQMKVANEAVQEAGKISEGTICYTGDILNPERSNIYTLEYYVKLAKELEREGFHILAIKDMAGLLKP
KAAYELIGELKSAVDLPIHLHTHDTSGNGLLTYKQAIDAGVDIIDTAVASMSGLTSQPSANSLYYALNGFPRHLRTDIEG
MESLSHYWSTVRTYYSDFESDIKSPNTEIYQHEMPGGQYSNLSQQAKSLGLGERFDEVKDMYRRVNFLFGDIVKVTPSSK
VVGDMALYMVQNDLDEQSVITDGYKLDFPESVVSFFKGEIGQPVNGFNKDLQAVILKGQEALTARPGEYLEPVDFEKVRE
LLEEEQQGPVTEQDIISYVLYPKVYEQYIQTRNQYGNLSLLDTPTFFFGMRNGETVEIEIDKGKRLIIKLETISEPDENG
NRTIYYAMNGQARRIYIKDENVHTNANVKPKADKSNPSHIGAQMPGSVTEVKVSVGETVKANQPLLITEAMKMETTIQAP
FDGVIKQVTVNNGDTIATGDLLIEIEKATD
>P29986 ~~~cpcG1~~~Phycobilisome rod-core linker polypeptide CpcG1~~~COG0448
MSIPLLEYAPSSQNQRVEGYEVPNEDTPTIYRLAAAIDDADVDAIIWAGYRQIFSEHLIIKSNRQSFLESQLRNRAINVR
DFIRGLGKSEVYRTQVADLNSNYRLVDITLKRFLGRAAYNQDEEIAWSIVIGSQGLHGFIDALLDSDEYRENFGDDIVPY
QRRRYKDRPFNLVNPRYNAYWRDRQTLNALGGRSFYSARTSGTLTKDDIRRAIPANFMALAGKILTPERNYQRTIASVTS
QIKDIKIPDTSREVTTPEVTVKPVAVALPYRYIPGNKTT
>Q05238 ~~~cpcG~~~Phycobilisome rod-core linker polypeptide CpcG~~~COG0448
MTIPLLQYAPSSQNTRVAGYTVGGDEQPFVFTTDNVISDSDFDVLINAAYRQIFFHAFKCDRQQLLESQLRNGQITVRDF
IRGLLLSETFIDSFYNKNSNYRFVEQCIQRVLGRDPFSEQEKIAWSIVICTKGLAAFVDQLLNTDEYMENFGYDTVPYQR
RRSLASREQGEIPFNIKSPRYDAYYRSQLGFPQVVWQNAVRRFRTPDRVPQAGDPALFLNMARSAQIPKVNVRVSAADIS
LAAVPYRN
>P29987 ~~~cpcG2~~~Phycobilisome rod-core linker polypeptide CpcG2~~~COG0448
MSIPLLEYKPSSQNQRVPGYEVPNEDTPRIYRIEDAAYDSELKELIWATYRQVFSEHVILKFFRQGNLESQLKNRAISVR
DFVRGLAKSEAFKTLVIKSNSNYRLVELALKRLLGRAPYNKDEEIAWSIKIATNGWDGFVDALLDSEEYQSNFGENIVPY
QRRRYKDRPFNLVTPRYGNYWRDKLESERYIEGDIKNFLELAKSIEIKTVTFTPVSTANIKIPDTTRNTTPTGIPISVNP
SANFPVR
>P50040 ~~~cpcG2~~~Phycobilisome rod-core linker polypeptide CpcG2~~~COG0448
MTIPLLSYAPSSQNQRVAGYEVPNEETPWRYSLEDAVDQSDIDELIWAAYRQVFSEHVVLKSTRQPHLESQLANRAISVR
DFIRGLAKSETFRRLVVETNSNYRLVEIALKRLLGRAPYNKQEELAWSIRIATDGWQKFVDTLVDSDEYTQNFGDNTVPY
QRRRYKDRPFNLVTPRYSDYWRDKLENSRYKWGDIRNFLEMARSVKVTPVQFKPVSTANVQIPDTTRRDRPTVPASINPT
ASFPLR
>P29989 ~~~cpcG4~~~Phycobilisome rod-core linker polypeptide CpcG4~~~COG0237
MALPLLQYKPSSQNHRVTSFGAADQNEDTPYIYRIEDVSSYTDIQNIIWASYRQVFSEHEILKFNRQKTLESQVKNGSIS
VRDFIRGLAKSEAFYRLVVSVNNNYRLVDITLKRLLGRSSYNKDEQIAWSIVIGTKGFSGFVDALIDSEEYTKNFGENIV
PYQRKRMEGRPHNLVTPRYGEDFQEKAGTVQTDWRFTLDKFYSRKSQEKQLREGDPRKFADLAASVGNQGNYAQRISAFD
IDYLNAVPNRSRR
>P73093 ~~~cpcG~~~Phycobilisome rod-core linker polypeptide CpcG~~~COG0448
MALPLLNYAPKSQNVRVEGYEIGSEEKPVVFTTENILSSSDMDNLIEAAYRQIFFHAFKWDREKVLESQLRNGQITVRDF
VRGLLLSNTFRNSFYEKNSNYRFVEHCVQKILGRDVYSEREKIAWSIVVATKGYQGLIDDLLNSDEYLNNFGYDTVPYQR
RRNLPGREAGELPFNIKSPRYDAYHRRQLGFPQIVWQNEVRRFIPQEKKLTAGNPMNFLGMARSINPAANTIPKVSAQNI
NIEASVPRR
>Q8DPQ3 3.1.3.5~~~pynA~~~Pyrimidine 5'-nucleotidase PynA~~~COG1011
MFYKFLLFDLDHTLLDFDAAEDVALTQLLKEEGVADIQAYKDYYVPMNKALWKDLELKKISKQELVNTRFSRLFSHFGQE
KDGSFLAQRYQFYLAQQGQTLSGAHDLLDSLIERDYDLYAATNGITAIQTGRLAQSGLVPYFNQVFISEQLQTQKPDALF
YEKIGQQIAGFSKEKTLMIGDSLTADIQGGNNAGIDTIWYNPHHLENHTQAQPTYEVYSYQDLLDCLDKNILEKITF
>Q53226 ~~~pyp~~~Photoactive yellow protein~~~
MEIIPFGSADLDNILAREPQRAEYLPFGAVLLDRTGTILKYNRAEGGIANRNPADVIGKNFFNEIAPCAKGKRFHGEFLR
FHQTGQVNVMFDYKFAYKGANVGVKIHMKSQPDGQSCWLFVKRV
>P16113 ~~~pyp~~~Photoactive yellow protein~~~
MEHVAFGSEDIENTLAKMDDGQLDGLAFGAIQLDGDGNILQYNAAEGDITGRDPKQVIGKNFFKDVAPCTDSPEFYGKFK
EGVASGNLNTMFEYTFDYQMTPTKVKVHMKKALSGDSYWVFVKRV
>P81046 ~~~pyp~~~Photoactive yellow protein~~~
MNIVHFGSDDIENSLANMSDQDLNQLAFGAIQLDASGKVLQYNAAEEGITGRDPKSVIGKNFFEDVAPCTKSQEFQGRFK
EGVANGNLATMFEYVFDYQMKPTKVKVHMKKALVDDSYWIFVKRL
>Q53120 ~~~pyp~~~Photoactive yellow protein~~~
MEMIKFGQDDIENAMADMGDAQIDDLAFGAIQLDETGTILAYNAAEGELTGRSPQDVIGKNFFKDIAPCTDTEEFGGRFR
EGVANGDLNAMFEYVFDYQMQPTKVKVHMKRAITGDSYWIFVKRV
>P11398 ~~~cpcC~~~Phycobilisome 32.1 kDa linker polypeptide, phycocyanin-associated, rod~~~
MAITAAASRLGTEPFSNAAKIELRSDASREEVEAVINAVYRHVLGNDYIMASERLVSAESLLRDGNLTVREFVRSVAKSE
LYKKKFFYNSFQTRFIELNYKHLLGRAPYDESEIVFHLDLYQNKGYDAEIDSYIDSVEYQNNFGDNIVPYYRGFETQPGQ
KTVGFNRMFRLYRGYANSDRAQIEGTKPRLARELATNKASSIVGPSGSNPAWGYRPSVDITPRKTLGNAVGENDRVYRIE
VTGVRSPGYPSVRRSSYAIIVPYERLSEKIQQIHKLGGKIVSITSA
>P18542 ~~~cpeC~~~Phycobilisome 31.8 kDa linker polypeptide, phycoerythrin-associated, rod~~~
MPFGPASRLGVSLFDETPPVEWVPGRSQEEAETIIRAIYRQVLGNAYVMESERLAVPESQFKRGELSVREFVRAVAKSEL
YRSRFFTSCARYRAIELNFRHLLGRPPLDLEEMRSHSTILDTQGFEAEIDSYIDGDEYQSTFGENIVPYIRGYKTEALQS
MVQFTHTFQLVRGASSSSLKGDLSGKAPKLNALVIQSTPTAVISPASAGATFSTPPTGARTRLGVDASAGGKVYRIEVTG
YRAKTFNNISKFRRSNQVFLVPYEKLSQEYQRIHQQGGVIASITPV
>P07123 ~~~cpcC~~~Phycobilisome 32.1 kDa linker polypeptide, phycocyanin-associated, rod~~~COG0237
MAITTAASRLGTEPFSDAPKVELRPKASREEVESVIRAVYRHVLGNDYILASERLVSAESLLRDGNLTVREFVRSVAKSE
LYKKKFFYNSFQTRLIELNYKHLLGRAPYDESEVVYHLDLYQNKGYDAEIDSYIDSWEYQSNFGDNVVPYYRGFETQVGQ
KTAGFNRIFRLYRGYANSDRAQVEGTKSRLARELASNKASTIVGPSGTNDSWGFRASADVAPKKNLGNAVGEGDRVYRLE
VTGIRSPGYPSVRRSSTVFIVPYERLSDKIQQVHKQGGKIVSVTSA
>Q05237 ~~~cpcC~~~Phycobilisome 32.3 kDa linker polypeptide, phycocyanin-associated, rod~~~COG0237
MPVTVAASRLGTAAFDQSPVELRANYSRDDAQTVIRAVYRQVLGNDYVMSSERLTAAESLFTNGFISVRDFVRAVAQSEL
YKEKFLYNNFQTRVIELNFKHLLGRAPYDEAEVIEHLDRYQNEGFEADINSYIDSAEYTENFGDNIVPYIRSYVVQTGHR
TVGFTRMFSLQRGYANSDRAQIAGNASRLAQELARNTTSAVVGPSGVNEGWAFRSAADDYHPGQSLGGSTGLSADDQVVR
VEVAALSTPRYPRIRRSSRVFFVPVSRLSQKLQEIQRMGGRVASISPAGQ
>P73203 ~~~cpcC1~~~Phycobilisome 32.1 kDa linker polypeptide, phycocyanin-associated, rod 1~~~COG0237
MAITTAASRLGVAPYNESRPVELRPDFSLDDAKMVIRAVYRQVLGNDYIMDSERLKGAESLLTNGSISVREFVRTVAKSE
LYKKKFLYNNFQTRVIELNYKHLLGRAPFSEDEVIFHLDLYENQGFDADIDSYIDSVEYQENFGENIVPYYRFNNQVGDR
TVGFTRMFRLYRGYANSDRSQLERSSSRLATELGQNTVSAIVGPSGSNAGWAYRPSRAGNTPAKALGGTVPFGQASKLFR
VEITAISAPGYPKVRRSNKAVIVPFEQLNQTLQQINRLGGKVASITPASLS
>P50034 ~~~cpcC~~~Phycobilisome 32.1 kDa linker polypeptide, phycocyanin-associated, rod~~~COG0237
MAITAAASRLGTSAFSDAPPVELRANWSEEDLETVIRAVYRQVLGNDYVMASERLVSAESLLRNGKITVREFVRAVAKSE
LYKEKFLYGNFQTRVIELNYKHLLGRAPYDESEVIFHLDLYENEGFDADIDSYIDSPEYTNSFGDWVVPYYRGFNTQPGQ
KTVGFNRIFRLYRGYANSDRAQAEGSMSRLARDLATNRANTVVPPSNSDTAFAYYTPSADVPPRACLGGSFGESGRVYRI
EVAGIRQPGYPGVRRSSTAFLVPYEQLSAKMQQLQRTGARIISVNPA
>P11399 ~~~pecC~~~Phycobilisome 34.5 kDa linker polypeptide, phycoerythrocyanin-associated, rod~~~
MSTSVAERLAIKDEVDKKIELRPNWSEDELQIVFKTAYEQVFGRQGLYASQRFATAEALLRNGKISVKQFIELLAKSEFY
KECFFYNNSQVRFIELNYKHLLGRAPYDQSEIAFHVDLYAAAGYDAEIESYIYSPEYDNAFGNFVVPYYRGFQSIPGMKT
VGFNRIFELYRGRANSDNAQFGGKSARLRSKISMNLANTIVPPTSPIAASTSSARTLVTSPVMGDARMFIVEAIAGTLNT
NVAVRRSRQVYTVPYDRLSATYQEIHKRGGKIVKITPAS
>P18543 ~~~cpeD~~~Phycobilisome 27.9 kDa linker polypeptide, phycoerythrin-associated, rod~~~
MASQTILELWPSSSLEEVQTIIRAVYKQVLGNPHVMESERLVTAESQLCDRSITVREFVRSVAKSDFYRNRYFQSCAPYR
FVELNFLHLLGRAPQDQREVSEHIVRTVAEGYDAEIDSYIDSSEYEAAFGENVVPYYRGRSSEANSKQVGFNRIFALDRG
PAQIDSAVKSAQLVYAVATNSANAIKASSSTVIGSGTEKRFKILVQGSKFDSPRRISTTEYIVPASKMTPQIQRINRTSG
KIVSITEIV
>P31329 ~~~pecC~~~Phycobilisome 34.5 kDa linker polypeptide, phycoerythrocyanin-associated, rod~~~COG0237
MSSSVAERLAIRDAIGNKVELRQNWSEDDLQKVFRAAYEQIFGRQGIYASQKFTSAEALLRNGKISVRQFVEILAKSEFY
KECFFYKNSQVRLIELNYKHLLGRAPYDQSEIADHVDIYAARGYDADIDAYIYSSEYENAFGNSIVPYYRGFQSIPGMKT
VGFNRICELYRGRGNSDNAQMGRTNSRLRTKVSLNLPNGILPPTSAGTNFVSAAPTLISSATKGDNRMFVIEAIAGGLNT
NVAVRRSRQVYTVSYERLSATYQEIHKRGGKIVKISQV
>P73204 ~~~cpcC2~~~Phycobilisome 32.1 kDa linker polypeptide, phycocyanin-associated, rod 2~~~COG0237
MTSLVSAQRLGIVAVDEAIPLELRSRSTEEEVDAVILAVYRQVLGNDHLMSQERLTSAESLLRGREISVRDFVRAVALSE
VYRQKFFHSNPQNRFIELNYKHLLGRAPYDQSEIAFHTDLYHQGGYEAEINSYIDSVEYTENFGDWVVPYFRGFATQRNQ
KTVGFSRSFQVYRGYATSDRSQGNGSRSRLTRELARNTASPVYAGSTAESLRGTSAGSRNQMYRLQVIQGAAPGRGTRVR
RGKAEYLVSYDNLSAKLQQINRQGDTVTMISLA
>P11400 ~~~cpcI2~~~Phycobilisome 39 kDa linker polypeptide, phycocyanin-associated, rod~~~
MPITSAASRLGTTAYQTNPIELRPNWTAEDAKIVIQAVYRQVLGNDYLMQSERLTSLESLLTNGKLSVRDFVRAVAKSEL
YKTKFLYPHFQTRVIELNFKHLLGRAPYDESEVIEHLDRDQNQGFDADIDSYIDSAEYDTYFGDSIVPYYRDLVTTGVGQ
RTVGFTRSFRLYRGYANSDRSQLAGSSSRLASDLATNSATAIIAPSGGTQGWSYLPSKQGTAPSRTFGRSSQGSTPRLYR
IEVTGISLPRYPKVRRSNKEFIVPYEQLSSTLQQINKLGGKVASITFAQ
>P11401 ~~~cpcH2~~~Phycobilisome 37.5 kDa linker polypeptide, phycocyanin-associated, rod~~~
MTSSTAARQLGFEPFASTAPTELRASSDVIHAAYRQVFQVFGNDHVMQSERLTSAESLLQQGNISVRDFVRLLAQSELYR
QKFFYSTPQVRFIELNYKHLLGRAPYDESEISYHVNLYTEKGYEAEINSYIDSAEYQESFGERIVPHYRGFETQPGQKTV
GFNRMFQIYRGYANSDRSQGKNKSAWLTQDLALNLASNIQTPNFGKGLTGVVAGDRGQLYRVRVIQADRGRTTQIRRSIQ
EYLVSYDQLSPTLQRLNQRGSRVVNISPA
>O66726 2.1.3.2~~~pyrB~~~Aspartate carbamoyltransferase~~~COG0540
MRSLISSLDLTREEVEEILKYAKEFKEGKEETIKASAVLFFSEPSTRTRLSFEKAARELGIETYLVSGSESSTVKGESFF
DTLKTFEGLGFDYVVFRVPFVFFPYKEIVKSLNLRLVNAGDGTHQHPSQGLIDFFTIKEHFGEVKDLRVLYVGDIKHSRV
FRSGAPLLNMFGAKIGVCGPKTLIPRDVEVFKVDVFDDVDKGIDWADVVIWLRLQKERQKENYIPSESSYFKQFGLTKER
FEKVKLYMHPGPVNRNVDIDHELVYTEKSLIQEQVKNGIPVRKAIYKFLWT
>P05654 2.1.3.2~~~pyrB~~~Aspartate carbamoyltransferase~~~COG0540
MKHLTTMSELSTEEIKDLLQTAQELKSGKTDNQLTGKFAANLFFEPSTRTRFSFEVAEKKLGMNVLNLDGTSTSVQKGET
LYDTIRTLESIGVDVCVIRHSEDEYYEELVSQVNIPILNAGDGCGQHPTQSLLDLMTIYEEFNTFKGLTVSIHGDIKHSR
VARSNAEVLTRLGARVLFSGPSEWQDEENTFGTYVSMDEAVESSDVVMLLRIQNERHQSAVSQEGYLNKYGLTVERAERM
KRHAIIMHPAPVNRGVEIDDSLVESEKSRIFKQMKNGVFIRMAVIQRALQTNVKRGEAAYVISH
>Q727F4 2.1.3.2~~~pyrB~~~Aspartate carbamoyltransferase~~~COG0540
MQNETRSLWPHKDLLDVDQLSKDELLHLLDTAAQFHEINRRPVKKVPTLKGKSVILFFAEPSTRTKTSFDVAGKRLSADT
FSLAKSGSSLQKGESLKDTALTLEAMNPDVLVIRHSSSGAARFLAERLACGVVNAGDGWHAHPTQALLDCYSLRQVWGDT
FEGRTLCILGDIAHSRVARSNVKLLTSLGVRVRLCAPRTLLPAGVGNWPVEVFTDLDAAVRDADAVMCLRLQLERQQAGL
LPDLREYSNRYCLTPRRLELAKPEAKVLHPGPMNRGLEIASSIADAPASLVLDQVAAGVATRMAILFLLATRTDGGR
>P0A786 2.1.3.2~~~pyrB~~~Aspartate carbamoyltransferase catalytic subunit~~~COG0540
MANPLYQKHIISINDLSRDDLNLVLATAAKLKANPQPELLKHKVIASCFFEASTRTRLSFETSMHRLGASVVGFSDSANT
SLGKKGETLADTISVISTYVDAIVMRHPQEGAARLATEFSGNVPVLNAGDGSNQHPTQTLLDLFTIQETQGRLDNLHVAM
VGDLKYGRTVHSLTQALAKFDGNRFYFIAPDALAMPQYILDMLDEKGIAWSLHSSIEEVMAEVDILYMTRVQKERLDPSE
YANVKAQFVLRASDLHNAKANMKVLHPLPRVDEIATDVDKTPHAWYFQQAGNGIFARQALLALVLNRDLVL
>P9WIT7 2.1.3.2~~~pyrB~~~Aspartate carbamoyltransferase~~~COG0540
MTPRHLLTAADLSRDDATAILDDADRFAQALVGRDIKKLPTLRGRTVVTMFYENSTRTRVSFEVAGKWMSADVINVSAAG
SSVGKGESLRDTALTLRAAGADALIIRHPASGAAHLLAQWTGAHNDGPAVINAGDGTHEHPTQALLDALTIRQRLGGIEG
RRIVIVGDILHSRVARSNVMLLDTLGAEVVLVAPPTLLPVGVTGWPATVSHDFDAELPAADAVLMLRVQAERMNGGFFPS
VREYSVRYGLTERRQAMLPGHAVVLHPGPMVRGMEITSSVADSSQSAVLQQVSNGVQVRMAVLFHVLVGAQDAGKEGAA
>P56585 2.1.3.2~~~pyrB~~~Aspartate carbamoyltransferase~~~
MTPLETKRPLQLNDQGQLQHFLSLDGLRRELLTEILDTADSFLEVGARAVKKVPLLRGKTVCNVFFENSTRTRTTFELAA
QRLSADVITLNVSTSSASKGETLLDTLRNLEAMAADMFVVRHGDSGAAHFIAEHVCPQVAIINGGDGRHAHPTQGMLDML
TIRRHKGSFENLSVAIVGDILHSRVARSNMLALKTLGCPDIRVIAPKTLLPIGVEQYGVKVYTDMTEGLKDVDVVIMLRL
QRERMTGGLLPSEGEFYRLFGLTTARLAGAKPDAIVMHPGPINRGVEIESAVADGPHSVILNQVTYGIAIRMAVLSMAMS
GQTAQRQFDQENAQ
>Q934T0 2.1.3.2~~~pyrB~~~Aspartate carbamoyltransferase~~~
MPNTHDTKNNVSPSEYAKFDPSTIHQRLNTSLSRPQLNSDGSIRHFLGVEGLNKAQLQAIIAKALFFEPSTRTRTTFEVA
EKRLGANVLNLDIASSSAKKGESLRDTLWNLQAMTADIFVVRHSASGAAHFMATEVTPDIAIINGGDGWHAHPTQGMLDM
LTIHREAPRPFEELSVAIIGDVKHSRVARSDISALQTLGVKDIRVIAPRTLLPKGIERFGVQVYEDMNSCVRDCDVIMGL
RIQNERIGSPLLASSSEYYKQYGITPERVALAKPDALIMHPGPMNRGVEIASSVADGPQSVILKQVSNGVAIRMAVLALT
MEGQRAHQANRG
>Q2FZ75 2.1.3.2~~~pyrB~~~Aspartate carbamoyltransferase~~~COG0540
MNHLLSMEHLSTDQIYKLIQKASQFKSGERQLPNFEGKYVANLFFENSTRTKCSFEMAELKLGLKTISFETSTSSVSKGE
SLYDTCKTLESIGCDLLVIRHPFNNYYEKLANINIPIANAGDGSGQHPTQSLLDLMTIYEEYGYFEGLNVLICGDIKNSR
VARSNYHSLKALGANVMFNSPNAWIDDSLEAPYVNIDDVIETVDIVMLLRIQHERHGLAEETRFAADDYHQKHGLNEVRY
NKLQEHAIVMHPAPVNRGVEIQSDLVEASKSRIFKQMENGVYLRMAVIDELLK
>P65618 2.1.3.2~~~pyrB~~~Aspartate carbamoyltransferase~~~
MNHLLSMEHLSTDQIYKLIQKASQFKSGERQLPNFEGKYVANLFFENSTRTKCSFEMAELKLGLKTISFETSTSSVSKGE
SLYDTCKTLESIGCDLLVIRHPFNNYYEKLANINIPIANAGDGSGQHPTQSLLDLMTIYEEYGYFEGLNVLICGDIKNSR
VARSNYHSLKALGANVMFNSPNAWIDDSLEAPYVNIDDVIETVDIVMLLRIQHERHGLAEETRFAADDYHQKHGLNEVRY
NKLQEHAIVMHPAPVNRGVEIQSDLVEASKSRIFKQMENGVYLRMAVIDELLK
>Q8ZB39 2.1.3.2~~~pyrB~~~Aspartate carbamoyltransferase~~~COG0540
MANPLYHKHIISINDLSRDELELVLRTAASLKKTPQPELLKHKVIASCFFEASTRTRLSFETSIHRLGASVVGFSDSSNT
SLGKKGETLADTMSVISTYVDAIVMRHPQEGASRLAAQFSGNVPIVNAGDGANQHPTQTLLDLFTIQETQGRLDNINIAM
VGDLKYGRTVHSLTQALAKFNGNHFFFIAPDALAMPAYILQMLEEKEIEYSLHESLEEVVPELDILYMTRVQKERLDPSE
YANVKAQFILRSSDLTGARDNLKVLHPLPRIDEITTDVDKTPYAYYFQQAGNGIFARQALLALVLNAELAL
>O66990 3.5.2.3~~~pyrC~~~Dihydroorotase~~~COG0044
MLKLIVKNGYVIDPSQNLEGEFDILVENGKIKKIDKNILVPEAEIIDAKGLIVCPGFIDIHVHLRDPGQTYKEDIESGSR
CAVAGGFTTIVCMPNTNPPIDNTTVVNYILQKSKSVGLCRVLPTGTITKGRKGKEIADFYSLKEAGCVAFTDDGSPVMDS
SVMRKALELASQLGVPIMDHCEDDKLAYGVINEGEVSALLGLSSRAPEAEEIQIARDGILAQRTGGHVHIQHVSTKLSLE
IIEFFKEKGVKITCEVNPNHLLFTEREVLNSGANARVNPPLRKKEDRLALIEGVKRGIIDCFATDHAPHQTFEKELVEFA
MPGIIGLQTALPSALELYRKGIISLKKLIEMFTINPARIIGVDLGTLKLGSPADITIFDPNKEWILNEETNLSKSRNTPL
WGKVLKGKVIYTIKDGKMVYKD
>Q81WF0 3.5.2.3~~~pyrC~~~Dihydroorotase~~~COG0044
MNYLFKNGRYMNEEGKIVATDLLVQDGKIAKVAENITADNAEVIDVNGKLIAPGLVDVHVHLREPGGEHKETIETGTLAA
AKGGFTTICAMPNTRPVPDCREHMEDLQNRIKEKAHVNVLPYGAITVRQAGSEMTDFETLKELGAFAFTDDGVGVQDASM
MLAAMKRAAKLNMAVVAHCEENTLINKGCVHEGKFSEKHGLNGIPSVCESVHIARDILLAEAADCHYHVCHVSTKGSVRV
IRDAKRAGIKVTAEVTPHHLVLCEDDIPSADPNFKMNPPLRGKEDHEALIEGLLDGTIDMIATDHAPHTAEEKAQGIERA
PFGITGFETAFPLLYTNLVKKGIITLEQLIQFLTEKPADTFGLEAGRLKEGRTADITIIDLEQEEEIDPTTFLSKGKNTP
FAGWKCQGWPVMTIVGGKIAWQKESALV
>B1IV40 3.5.2.3~~~pyrC~~~Dihydroorotase~~~
MTAPSQVLKIRRPDDWHLHLRDGDMLKTVVPYTSEIYGRAIVMPNLAPPVTTVEAAVAYRQRILDAVPAGHDFTPLMTCY
LTDSLDPNELERGFNEGVFTAAKLYPANATTNSSHGVTSVDAIMPVLERMEKIGMPLLVHGEVTHADIDIFDREARFIES
VMEPLRQRLTALKVVFEHITTKDAADYVRDGNERLAATITPQHLMFNRNHMLVGGVRPHLYCLPILKRNIHQQALRELVA
SGFNRVFLGTDSAPHARHRKESSCGCAGCFNAPTALGSYATVFEEMNALQHFEAFCSVNGPQFYGLPVNDTFIELVREEQ
QVAESIALTDDTLVPFLAGETVRWSVKQ
>P05020 3.5.2.3~~~pyrC~~~Dihydroorotase~~~COG0418
MTAPSQVLKIRRPDDWHLHLRDGDMLKTVVPYTSEIYGRAIVMPNLAPPVTTVEAAVAYRQRILDAVPAGHDFTPLMTCY
LTDSLDPNELERGFNEGVFTAAKLYPANATTNSSHGVTSIDAIMPVLERMEKIGMPLLVHGEVTHADIDIFDREARFIES
VMEPLRQRLTALKVVFEHITTKDAADYVRDGNERLAATITPQHLMFNRNHMLVGGVRPHLYCLPILKRNIHQQALRELVA
SGFNRVFLGTDSAPHARHRKESSCGCAGCFNAPTALGSYATVFEEMNALQHFEAFCSVNGPQFYGLPVNDTFIELVREEQ
QVAESIALTDDTLVPFLAGETVRWSVKQ
>A6T7D6 3.5.2.3~~~pyrC~~~Dihydroorotase~~~
MTAQSQVLKIRRPDDWHIHLRDDDMLKTVVPYTSEFYGRAIVMPNLVPPVTTVAAAIAYRQRIMDAVPAGHDFTPLMTCY
LTDSLDPAELERGFNEGVFTAAKLYPANATTNSSHGVTSTDAIMPVLERMEKLGMPLLVHGEVTHAEIDIFDREARFIET
VMEPLRQRLPGLKVVFEHITTKDAAEYVRDGNELLAATITPQHLMFNRNHMLVGGIRPHLYCLPVLKRNIHQQALRELVA
SGFSRAFLGTDSAPHARHRKEASCGCAGCFNAPTALGSYATVFEEMNALQHFEAFCSLNGPRFYGLPVNESYVELVREET
TVVDSIALPNDTLVPFLAGETVRWTVKK
>P9WHL3 3.5.2.3~~~pyrC~~~Dihydroorotase~~~COG0044
MSVLIRGVRPYGEGERVDVLVDDGQIAQIGPDLAIPDTADVIDATGHVLLPGFVDLHTHLREPGREYAEDIETGSAAAAL
GGYTAVFAMANTNPVADSPVVTDHVWHRGQQVGLVDVHPVGAVTVGLAGAELTEMGMMNAGAAQVRMFSDDGVCVHDPLI
MRRALEYATGLGVLIAQHAEEPRLTVGAVAHEGPMAARLGLAGWPRAAEESIVARDALLARDAGARVHICHASAAGTVEI
LKWAKDQGISITAEVTPHHLLLDDARLASYDGVNRVNPPLREASDAVALRQALADGIIDCVATDHAPHAEHEKCVEFAAA
RPGMLGLQTALSVVVQTMVAPGLLSWRDIARVMSENPACIARLPDQGRPLEVGEPANLTVVDPDATWTVTGADLASRSAN
TPFESMSLPATVTATLLRGKVTARDGKIRA
>P06204 3.5.2.3~~~pyrC~~~Dihydroorotase~~~
MTAPSQVLKIRRPDDWHVHLRDGDMLKTVVPYTSEIYGRAIVMPNLASPITTVDAAIAYRQRILDAVPAGHDFTPLMTCY
LTDSLDADELERGFHEGVFTAAKLYPANATTNSSHGVTSVDAIMPVLERMEKLGIPLLVHGEVTHADVDIFDREARFIDT
VMEPLRQRLTALKVVFEHITTKDAAQYVRDGNDYLAATITPQHLMFNRNDMLVGGIRPHLYCLPILKRNIHQQALRELVA
SGFTRAFLGTDSAPHSRHRKETSCGCAGCFNAPSALGSYAAVFEEMNALAHFEAFCSLNGPQFYGLPMNTGWVELVRDEQ
QIPGNIALADDSLVPFLAGETVRWSVKK
>Q5HGN1 3.5.2.3~~~pyrC~~~Dihydroorotase~~~
MKLIKNGKVLQNGELQQADILIDGKVIKQIAPAIEPSNGVDIIDAKGHFVSPGFVDVHVHLREPGGEYKETIETGTKAAA
RGGFTTVCPMPNTRPVPDSVEHFEALQKLIDDNAQVRVLPYASITTRQLGKELVDFPALVKEGAFAFTDDGVGVQTASMM
YEGMIEAAKVNKAIVAHCEDNSLIYGGAMHEGKRSKELGIPGIPNICESVQIARDVLLAEAAGCHYHVCHVSTKESVRVI
RDAKRAGIHVTAEVTPHHLLLTEDDIPGNNAIYKMNPPLRSTEDREALLEGLLDGTIDCIATDHAPHARDEKAQPMEKAP
FGIVGSETAFPLLYTHFVKNGDWTLQQLVDYLTIKPCETFNLEYGTLKENGYADLTIIDLDSEQEIKGEDFLSKADNTPF
IGYKVYGNPILTMVEGEVKFEGDK
>P65906 3.5.2.3~~~pyrC~~~Dihydroorotase~~~
MKLIKNGKVLQNGELQQADILIDGKVIKQIAPAIEPSNGVDIIDAKGHFVSPGFVDVHVHLREPGGEYKETIETGTKAAA
RGGFTTVCPMPNTRPVPDSVEHFEALQKLIDDNAQVRVLPYASITTRQLGKELVDFPALVKEGAFAFTDDGVGVQTASMM
YEGMIEAAKVNKAIVAHCEDNSLIYGGAMHEGKRSKELGIPGIPNICESVQIARDVLLAEAAGCHYHVCHVSTKESVRVI
RDAKRAGIHVTAEVTPHHLLLTEDDIPGNNAIYKMNPPLRSTEDREALLEGLLDGTIDCIATDHAPHARDEKAQPMEKAP
FGIVGSETAFPLLYTHFVKNGDWTLQQLVDYLTIKPCETFNLEYGTLKENGYADLTIIDLDSEQEIKGEDFLSKADNTPF
IGYKVYGNPILTMVEGEVKFEGDK
>Q5SK67 3.5.2.3~~~pyrC~~~Dihydroorotase~~~COG0044
MILIRNVRLVDARGERGPADVLIGEGRILSLEGGEAKQVVDGTGCFLAPGFLDLHAHLREPGEEVKEDLFSGLLAAVRGG
YTDLVSMPNTKPPVDTPEAVRALKEKAKALGLARLHPAAALTEKQEGKTLTPAGLLREAGAVLLTDDGRTNEDAGVLAAG
LLMAAPLGLPVAVHAEDAGLRRNGVMNDGPLADLLGLPGNPPEAEAARIARDLEVLRYALRRSPATPRLHVQHLSTKRGL
ELVREAKRAGLPVTAEATPHHLTLTEEALRTFDPLFKVAPPLRGEEDREALLEGLLDGTLDAIATDHAPHTLAEKEKDLL
RAPFGIPSLEVAFPLLYTELHLKRGFPLQRLVELFTDGPRRVLGLPPLHLEEGAEASLVLLSPKERPVDPSAFASKARYS
PWAGWVLGGWPVLTLVAGRIVHEALK
>Q9KL24 3.5.2.3~~~pyrC~~~Dihydroorotase~~~COG0418
MTTLTITRPDDWHVHLRDGDVLADTVRDISRYNGRALIMPNTVPPVTTTEMALAYRERIMAAQPQAHFEPLMALYLTDNT
SPEEIRKAKASGKVVAAKLYPAGATTNSDSGVTSAKNIYPVLQAMQEVGMLLLVHGEVTTHEVDIFDREKTFLDTVLAPI
VNDFPQLKIVLEHITTADAVTFVQQAGDNVAATITAHHLLFNRNHMLVGGIRPHFYCLPILKRATHQHALVAAATSGSKK
FFLGTDSAPHAKGRKEAACGCAGSYTAHAALELYAEVFEKEGKLENLEAFASFNGPDFYGLPRNQETVTLTKQAWPVAES
MPFGSDIVVPIRAGENIEWTVK
>Q8ZFU4 3.5.2.3~~~pyrC~~~Dihydroorotase~~~COG0418
MTAQPQTLKIRRPDDWHIHLRDDEMLSTVLPYTSEVFARAIVMPNLAQPITTVASAIAYRERILAAVPAGHKFTPLMTCY
LTNSLDAKELTTGFEQGVFTAAKLYPANATTNSTHGVSDIPAIYPLFEQMQKIGMPLLIHGEVTDAAVDIFDREARFIDQ
ILEPIRQKFPELKIVFEHITTKDAADYVLAGNRFLGATVTPQHLMFNRNHMLVGGIRPHLFCLPILKRSTHQQALRAAVA
SGSDRFFLGTDSAPHAKHRKESSCGCAGVFNAPAALPAYASVFEELNALQHLEAFCALNGPRFYGLPVNDDVVELVRTPF
LQPEEIPLGNESVIPFLAGQTLNWSVKR
>A2RJT9 1.3.98.1~~~pyrDA~~~Dihydroorotate dehydrogenase A (fumarate)~~~COG0167
MLNTTFANAKFANPFMNASGVHCMTIEDLEELKASQAGAYITKSSTLEKREGNPLPRYVDLELGSINSMGLPNLGFDYYL
DYVLKNQKENAQEGPIFFSIAGMSAAENIAMLKKIQESDFSGITELNLSCPNVPGKPQLAYDFEATEKLLKEVFTFFTKP
LGVKLPPYFDLVHFDIMAEILNQFPLTYVNSVNSIGNGLFIDPEAESVVIKPKDGFGGIGGAYIKPTALANVRAFYTRLK
PEIQIIGTGGIETGQDAFEHLLCGATMLQIGTALHKEGPAIFDRIIKELEEIMNQKGYQSIADFHGKLKSL
>Q9X9S0 1.3.98.1~~~pyrDA~~~Probable dihydroorotate dehydrogenase A (fumarate)~~~COG0167
MVSTKTQIAGFEFDNCLMNAAGVACMTIEELEEVKNSAAGTFVTKTATLDFRQGNPEPRYQDVPLGSINSMGLPNNGLDY
YLDYLLDLQEKESNRTFFLSLVGMSPEETHTILKKVQESDFRGLTELNLSCPNVPGKPQIAYDFETTDRILAEVFAYFTK
PLGIKLPPYFDIVHFDQAAAIFNKYPLKFVNCVNSIGNGLYIEDESVVIRPKNGFGGIGGEYIKPTALANVHAFYQRLNP
QIQIIGTGGVLTGRDAFEHILCGASMVQVGTTLHKEGVSAFDRITNELKAIMVEKGYESLEDFRGKLRYID
>P25996 1.3.1.14~~~pyrD~~~Dihydroorotate dehydrogenase B (NAD(+)), catalytic subunit~~~COG0167
MLEVKLPGLDLKNPIIPASGCFGFGKEFSRFYDLSCLGAIMIKATTKEPRFGNPTPRVAETGAGMLNAIGLQNPGLDSVL
HHELPWLEQFDTPIIANVAGSQVDDYVEVAEHISKAPNVHALELNISCPNVKTGGIAFGTNPEMAADLTKAVKEVSDVPV
YVKLSPNVANITEIALAIEEAGADGLTMINTLIGMRLDLKTGKPILANKTGGLSGPAVKPVAIRMVYEVSQMVNIPIIGM
GGVQTAEDALEFLLAGASAVAVGTANFVNPFACPEIIEQLPSVLLQYGYQSIEECIGRSWNHEKQPAHHRA
>A4J560 1.3.1.14~~~pyrD~~~Dihydroorotate dehydrogenase B (NAD(+)), catalytic subunit~~~COG0167
MKLNLAVKIGQLDMINPVTTASGTFGYGQEYSPYVDLNQLGAIVVKGTTLEPREGNPTPRLVETPSGILNSIGLQNSGVD
YLLEHYVPFFKKLQTNVIVNISGNTAEEYGQLAARLDEADGIAALEVNISCPNVKKGGMAFGGDFRTAAEVTKVVKNSTA
LPVIVKLSPNVTDIAEIARAVEGAGADGLSVINTLLGMAIDVRKRKPVLGNTMGGLSGPAVKPVALRAVWQVYKAVHIPI
IGMGGIMNATDALEFILAGAQAVSVGTANFVNPYATKEIIQGMEKYLMENGIGDINELVGAAHL
>P0DH74 1.3.1.14~~~pyrDB~~~Dihydroorotate dehydrogenase B (NAD(+)), catalytic subunit~~~COG0167
MMKNPLAVSIPGLTLKNPIIPASGCFGFGEEYANYYDLDQLGSIMIKATTPQARYGNPTPRVAETPSGMLNAIGLQNPGL
EVVMQEKLPKLEKYPNLPIIANVAGACEEDYVAVCAKIGQAPNVKAIELNISCPNVKHGGIAFGTDPEVAFQLTQAVKKV
ASVPIYVKLSPNVTDIVPIAQAIEAGGADGFSMINTLLGMRIDLKTRKPILANQTGGLSGPAIKPVAIRLIRQVASVSQL
PIIGMGGVQTVDDVLEMFMAGASAVGVGTANFTDPYICPKLIDGLPKRMEELGIESLEQLIKEVREGQQNAR
>Q9CFW8 1.3.1.14~~~pyrDB~~~Dihydroorotate dehydrogenase B (NAD(+)), catalytic subunit~~~COG0167
MTENNRLSVKLPGLDLKNPIIPASGCFGFGEEYAKYYDLNKLGSIMVKATTLHPRFGNPTPRVAETASGMLNAIGLQNPG
LEVIMAEKLPWLNENFPDLPIIANVAGSEEDDYVAVCAKIGDAPNVKVIELNISCPNVKHGGQAFGTDPDVAAALVKACK
AVSKVPLYVKLSPNVTDIVPIAKAVEAAGADGLTMINTLMGVRFDLKTRKPVLANITGGLSGPAIKPVALKLIHQVAQVV
DIPIIGMGGVESAQDVLEMYMAGASAVAVGTANFADPFVCPKIIEKLPEVMDQYGIDSLENLIQEVKNSKK
>P54322 1.3.1.14~~~pyrDB~~~Dihydroorotate dehydrogenase B (NAD(+)), catalytic subunit~~~COG0167
MTENNRLSVKLPGLDLKNPIIPASGCFGFGEEYAKYYDLNKLGSIMVKATTLHPRFGNPTPRVAETASGMLNAIGLQNPG
LEVIMTEKLPWLNENFPELPIIANVAGSEEADYVAVCAKIGDAANVKAIELNISCPNVKHGGQAFGTDPEVAAALVKACK
AVSKVPLYVKLSPNVTDIVPIAKAVEAAGADGLTMINTLMGVRFDLKTRQPILANITGGLSGPAIKPVALKLIHQVAQVV
DIPIIGMGGVANAQDVLEMYMAGASAVAVGTANFADPFVCPKIIDKLPELMDQYRIESLESLIQEVKEGKK
>B7GZW7 1.3.5.2~~~pyrD~~~Dihydroorotate dehydrogenase (quinone)~~~
MLYSLARPMLFSLAPERAHELTLSMLDKAHKLGMMRQTVEAKPTTCMGIEFPNPVGLAAGLDKNGAHIDALAGLGFGFIE
IGTITPRPQSGNPKPRLFRIPEAKAIINRMGFNNDGVDKLIENVKASKFRGILGINIGKNADTPVEKAVDDYLICLEKVY
NYASYITVNISSPNTKNLRSLQSGDALTELLQTLKARQLELAEQYNHYVPLVLKVAPDLTAEDVEFISAQLLDFKIDGLI
VTNTTLSREGVENLPYGNESGGLSGAPVFEKSTECLRLFAQTLKGQIPLIGVGGILSGEQAAAKQQAGATLVQIYSGLIY
TGPTLVKQCVEAMT
>P0A7E1 1.3.5.2~~~pyrD~~~Dihydroorotate dehydrogenase (quinone)~~~COG0167
MYYPFVRKALFQLDPERAHEFTFQQLRRITGTPFEALVRQKVPAKPVNCMGLTFKNPLGLAAGLDKDGECIDALGAMGFG
SIEIGTVTPRPQPGNDKPRLFRLVDAEGLINRMGFNNLGVDNLVENVKKAHYDGVLGINIGKNKDTPVEQGKDDYLICME
KIYAYAGYIAINISSPNTPGLRTLQYGEALDDLLTAIKNKQNDLQAMHHKYVPIAVKIAPDLSEEELIQVADSLVRHNID
GVIATNTTLDRSLVQGMKNCDQTGGLSGRPLQLKSTEIIRRLSLELNGRLPIIGVGGIDSVIAAREKIAAGASLVQIYSG
FIFKGPPLIKEIVTHI
>B5Z6I2 1.3.5.2~~~pyrD~~~Dihydroorotate dehydrogenase (quinone)~~~
MLYPLVKKYLFSLDAEDAHEKVCKILRTLSKSSFLCSLIHSQWGYKNPKLENEILGLNFPNPLGLAAGFDKNASMLRALI
AFGFGYLEAGTLTNEAQVGNERPRLFRHIEEESLQNAMGFNNYGAVLGARSFNRFAPYKTPIGINLGKNKHIEQAHALED
YKAVLNQCLNIGDYYTFNLSSPNTPNLRDLQNKAFVNELFCMAKEMTHKPLFLKIAPDLEIDDMLEIVNSAIEAGAHGII
ATNTTIDKSLVFAPKEMGGLSGKCLTKKSREVFKELAKAFFNKSVLVSVGGISDAKEAYERIKMGASLLQIYSAFIYNGP
NLCQNILKDLVKLLQKDGFLSVKEAIGADLR
>P9WHL1 1.3.5.2~~~pyrD~~~Dihydroorotate dehydrogenase (quinone)~~~COG0167
MYPLVRRLLFLIPPEHAHKLVFAVLRGVAAVAPVRRLLRRLLGPTDPVLASTVFGVRFPAPLGLAAGFDKDGTALSSWGA
MGFGYAEIGTVTAHPQPGNPAPRLFRLADDRALLNRMGFNNHGARALAIRLARHRPEIPIGVNIGKTKKTPAGDAVNDYR
ASARMVGPLASYLVVNVSSPNTPGLRDLQAVESLRPILSAVRAETSTPVLVKIAPDLSDSDLDDIADLAVELDLAGIVAT
NTTVSRDGLTTPGVDRLGPGGISGPPLAQRAVQVLRRLYDRVGDRLALISVGGIETADDAWERITAGASLLQGYTGFIYG
GERWAKDIHEGIARRLHDGGFGSLHEAVGSARRRQPS
>K7QRJ5 5.5.1.-~~~pyrE3~~~Dialkyldecalin synthase~~~
MSDTVIIAGGGPVGLMLACELGLAGVDTVVLERHDAPREPSRGGAINATVVELFTQRGIMESLRDDGFEFRMAHFAHIPL
APERVPGDRAFSFAVPHAQVERRLEERARSLGVRVRRSTEITSVRQTPDGVQVTTGDGEVVEGAYLVGCDGSASLVREQA
GIPFPGVDPDFHGLWGDIKVEPGAPVLERIGARQYELGLCMVAPIGPDTVRVITGEFDVPSPPADQEVGFDELRAAVARI
AGVELDGVPGWLSRWTATSRQAERYREGRILLAGDAAHTLFPLGGQALGTGIEDAVNLGWKLAATVQGWAPPSLLDSYHE
ERHAAGARACASTRAQTTIMRSLARVGELRALLTELAGLEEVNAYLVRMVGGIDGSRLPDVPLVTAEGETSVYRLLEAGR
GVLLDLGAGLPAVRHPQVTYVRAEPTNRLDATAVLLRPDGVVAWRAPQDGLEAALETWFGPAA
>Q81WF6 2.4.2.10~~~pyrE~~~Orotate phosphoribosyltransferase~~~COG0461
MKKEIASHLLEIGAVFLQPNDPFTWSSGMKSPIYCDNRLTLSYPKVRQTIAAGLEELIKEHFPTVEVIAGTATAGIAHAA
WVSDRMDLPMCYVRSKAKGHGKGNQIEGKAEKGQKVVVVEDLISTGGSAITCVEALREAGCEVLGIVSIFTYELEAGKEK
LEAANVASYSLSDYSALTEVAAEKGIIGQAETKKLQEWRKNPADEAWITA
>P46534 2.4.2.10~~~pyrE~~~Orotate phosphoribosyltransferase~~~
MKHDIAAKLLQIGAVALQPNEPFTWSSGLKSPIYCDNRLTLAYPGVRRLIADALAELIRTHFPKADLIAGTAAGIPHAAW
VSERLELPMCYVRSQAKRHGKGKQIEGQARPGQRVVVIEDLISTGGTSLAAVRALKEAGCEVLGVAAIFTYGLEKAKQAF
AAENLPAYTLTDYNTLIETAVRLGAVSEHDLATLRQWRENPEEWGS
>P25972 2.4.2.10~~~pyrE~~~Orotate phosphoribosyltransferase~~~COG0461
MGGNQILKQIIAKHLLDIQAVFLRPNEPFTWASGILSPIYCDNRLTLSFPEVRNDVASGISKLVKEHFPEAEMIAGTATA
GIPHAALAADHLNLPMCYVRSKPKAHGKGNQIEGAVQEGQKTVVIEDLISTGGSVLEACAALQAAGCEVLGVVSIFTYGL
PKAEEAFAKAELPYYSLTDYDTLTEVALENGNIHSDDLKKLQTWKRNPESKDWFKK
>Q8NM11 2.4.2.10~~~pyrE~~~Orotate phosphoribosyltransferase~~~COG0461
MSSNSINAEARAELAELIKELAVVHGEVTLSSGKKADYYIDVRRATLHARASRLIGQLLREATADWDYDAVGGLTLGADP
VATAIMHADGRDINAFVVRKEAKKHGMQRRIEGPDLTGKKVLVVEDTTTTGNSPLTAVAALREAGIEVVGVATVVDRATG
ADEVIAAEGLPYRSLLGLSDLGLN
>P0A7E3 2.4.2.10~~~pyrE~~~Orotate phosphoribosyltransferase~~~COG0461
MKPYQRQFIEFALSKQVLKFGEFTLKSGRKSPYFFNAGLFNTGRDLALLGRFYAEALVDSGIEFDLLFGPAYKGIPIATT
TAVALAEHHDLDLPYCFNRKEAKDHGEGGNLVGSALQGRVMLVDDVITAGTAIRESMEIIQANGATLAGVLISLDRQERG
RGEISAIQEVERDYNCKVISIITLKDLIAYLEEKPEMAEHLAAVKAYREEFGV
>P43855 2.4.2.10~~~pyrE~~~Orotate phosphoribosyltransferase~~~COG0461
MEQYKRDFIEFALSRNVLKFGEFTLKSGRKSPYFFNAGLFNTGADLARLGEFYAAAIQASAVDFDVVFGPAYKGIPIGTS
VSVALFNRYGIDKPVCFNRKEVKDHGEGGNLIGSPLQGKILLVDDVITAGTAIRESMELISANQAELAAVLIALNRKERG
KGELSAIQEVERDYQCQVLSIIDLDDLMQFIEQDPRYSSHLPEMRAYRAEFGV
>P56162 2.4.2.10~~~pyrE~~~Orotate phosphoribosyltransferase~~~COG0461
MDIKACYQNAKALLEGHFLLSSGFHSNYYLQSAKVLEDPKLAEQLALELAKQIQEAHLNIECVCSPAIGGILAGYELARA
LGVRFIFTERVDNTMALRRGFEVKKNEKILVCEDIITTGKSAMECAKVLEEKGAQIVAFGALANRGICKRAHSHLKAQEG
ACLPSHLPLFALEDFVFDMHKPSSCPLCATSVAIKPGSRGN
>P9WHK9 2.4.2.10~~~pyrE~~~Orotate phosphoribosyltransferase~~~COG0461
MAGPDRAELAELVRRLSVVHGRVTLSSGREADYYVDLRRATLHHRASALIGRLMRELTADWDYSVVGGLTLGADPVATAI
MHAPGRPIDAFVVRKSAKAHGMQRLIEGSEVTGQRVLVVEDTSTTGNSALTAVHAVQDVGGEVVGVATVVDRATGAAEAI
EAEGLRYRSVLGLADLGLD
>P42719 2.4.2.10~~~pyrE~~~Orotate phosphoribosyltransferase~~~
MIQTTFPDRAVMAELLAKMLWEIKAVHFNAAQPYKLASGMASPVYIDCRKLLSFPRIRSTVMDFAASTLLRDAGFEQFDC
IAGGETAGIPFAALLADRLGLPMIYVRKQPKGHGRNAQIEGNMPEGSRVLVIEDLTTAGGSMFKFIDAVRAAGGIVDHGI
ALFFYGIFGEQRFADGKVRLHHIATWRNVLPSPGSRSSSTTRRCRKSSPSSMRRWLGRERMVA
>P08870 2.4.2.10~~~pyrE~~~Orotate phosphoribosyltransferase~~~
MKPYQRQFIEFALNKQVLKFGEFTLKSGRKSPYFFNAGLFNTGRDLALLGRFYAEALVDSGIEFDLLFGPAYKGIPIATT
TAVALAEHHDKDLPYCFNRKEAKDHGEGGSLVGSALQGRVMLVDDVITAGTAIRESMEIIQAHGATLAGVLISLDRQERG
RGEISAIQEVERDYGCKVISIITLKDLIAYLEEKPDMAEHLAAVRAYREEFGV
>P99144 2.4.2.10~~~pyrE~~~Orotate phosphoribosyltransferase~~~
MAKEIAKSLLDIEAVTLSPNDLYTWSSGIKSPIYCDNRVTLGYPLVRGAIRDGLINLIKEHFPEVEVISGTATAGIPHAA
FIAEKLKLPMNYVRSSNKSHGKQNQIEGAKSEGKKVVVIEDLISTGGSSVTAVEALKQAGAEVLGVVAIFTYGLKKADDT
FSNIQLPFYTLSDYNELIEVAENEGKISSEDIQTLVEWRDNLA
>Q8DTV2 2.4.2.10~~~pyrE~~~Orotate phosphoribosyltransferase~~~COG0461
MTLAKDIARDLLDIKAVYLKPEEPFTWASGIKSPIYTDNRITLSYPETRTLIENGFVETIKEAFPEVEVIAGTATAGIPH
GAIIADKMNLPFAYIRSKPKDHGAGNQIEGRVTKGQKMVIIEDLISTGGSVLDAVAAAQREGADVLGVVAIFTYELPKAT
ANFEKASVKLVTLSNYSELIKVAKVQGYIDADGLTLLKKFKENQETWQD
>Q9A076 2.4.2.10~~~pyrE~~~Orotate phosphoribosyltransferase~~~
MTLASQIATQLLDIKAVYLKPEDPFTWASGIKSPIYTDNRVTLSYPKTRDLIENGFVETIKAHFPEVEVIAGTATAGIPH
GAIIADKMTLPFAYIRSKPKDHGAGNQIEGRVLKGQKMVIIEDLISTGGSVLDAAAAASREGADVLGVVAIFTYELPKAS
QNFKEAGIKLITLSNYTELIAVAKLQGYITNDGLHLLKKFKEDQVNWQQ
>P0CB78 2.4.2.10~~~pyrE~~~Orotate phosphoribosyltransferase~~~COG0461
MTLAKDIASHLLKIQAVYLKPEEPFTWASGIKSPIYTDNRVTLAYPETRTLIENGFVEAIKEAFPEVEVIAGTATAGIPH
GAIIADKMDLPFAYIRSKPKDHGAGNQIEGRVAQGQKMVVVEDLISTGGSVLEAVAAAKREGADVLGVVAIFSYQLPKAD
KNFADAGVKLVTLSNYSELIHLAQEEGYITPEGLDLLKRFKEDQENWQEG
>P61498 2.4.2.10~~~pyrE~~~Orotate phosphoribosyltransferase~~~COG0461
MDVLELYRRTGALLEGHFLLRSGMHSPFFLQSAALLQHPLYAEAVGEALGKLFEDEKVDFVIAPAIGGVVLSFVVAKALG
ARALFAEKDGRGGMLIRKGLTVNPGDRFLAVEDVVTTGESVRKAIRAAEARGGVLVGVGAIVDRSGGRAAFGVPFRALLA
LEVPQYPEEACPLCREGVPLEEV
>Q9KVD5 2.4.2.10~~~pyrE~~~Orotate phosphoribosyltransferase~~~COG0461
MKAYQREFIEFALEKQVLKFGEFTLKSGRKSPYFFNAGLFNTGRDLARLGRFYAAALVDSGIEFDVLFGPAYKGIPIATT
TAVALADHHDVDTPYCFNRKEAKNHGEGGNLVGSKLEGRVMLVDDVITAGTAIRESMELIQANKADLAGVLVAIDRQEKG
KGELSAIQEVERDFGCAVISIVSLTDLITYLEQQGNNTEHLEAVKAYRAQYGI
>P25971 4.1.1.23~~~pyrF~~~Orotidine 5'-phosphate decarboxylase~~~COG0284
MKNNLPIIALDFASAEETLAFLAPFQQEPLFVKVGMELFYQEGPSIVKQLKERNCELFLDLKLHDIPTTVNKAMKRLASL
GVDLVNVHAAGGKKMMQAALEGLEEGTPAGKKRPSLIAVTQLTSTSEQIMKDELLIEKSLIDTVVHYSKQAEESGLDGVV
CSVHEAKAIYQAVSPSFLTVTPGIRMSEDAANDQVRVATPAIAREKGSSAIVVGRSITKAEDPVKAYKAVRLEWEGIKS
>Q9PIC1 4.1.1.23~~~pyrF~~~Orotidine 5'-phosphate decarboxylase~~~COG0284
MKLCVALDLSTKEECLQLAKELKNLDIWLKVGLRAYLRDGFKFIEELKKVDDFKIFLDLKFHDIPNTMADACEEVSKLGV
DMINIHASAGKIAIQEVMTRLSKFSKRPLVLAVSALTSFDEENFFSIYRQKIEEAVINFSKISYENGLDGMVCSVFESKK
IKEHTSSNFLTLTPGIRPFGETNDDQKRVANLAMARENLSDYIVVGRPIYKNENPRAVCEKILNKIHRKNISENDIEQNY
EVIQQKEWDMCNHFEEWIKTRPDKEHALKEFYAKCGIKY
>Q83E06 4.1.1.23~~~pyrF~~~Orotidine 5'-phosphate decarboxylase~~~COG0284
MEKPDPKVIVAIDAGTVEQARAQINPLTPELCHLKIGSILFTRYGPAFVEELMQKGYRIFLDLKFYDIPQTVAGACRAVA
ELGVWMMNIHISGGRTMMETVVNALQSITLKEKPLLIGVTILTSLDGSDLKTLGIQEKVPDIVCRMATLAKSAGLDGVVC
SAQEAALLRKQFDRNFLLVTPGIRLETDEKGDQKRVMTPRAAIQAGSDYLVIGRPITQSTDPLKALEAIDKDIKTR
>P08244 4.1.1.23~~~pyrF~~~Orotidine 5'-phosphate decarboxylase~~~COG0284
MTLTASSSSRAVTNSPVVVALDYHNRDDALAFVDKIDPRDCRLKVGKEMFTLFGPQFVRELQQRGFDIFLDLKFHDIPNT
AAHAVAAAADLGVWMVNVHASGGARMMTAAREALVPFGKDAPLLIAVTVLTSMEASDLVDLGMTLSPADYAERLAALTQK
CGLDGVVCSAQEAVRFKQVFGQEFKLVTPGIRPQGSEAGDQRRIMTPEQALSAGVDYMVIGRPVTQSVDPAQTLKAINAS
LQRSA
>Q5L0U0 4.1.1.23~~~pyrF~~~Orotidine 5'-phosphate decarboxylase~~~COG0284
MHTPFIVALDFPSKQEVERFLRPFAGTPLFVKVGMELYYQEGPAIVAFLKEQGHAVFLDLKLHDIPNTVKQAMKGLARVG
ADLVNVHAAGGRRMMEAAIEGLDAGTPSGRMRPRCIAVTQLTSTDERMLHEELWISRPLVETVAHYAALAKESGLDGVVC
SANEAAFIKERCGASFLAVTPGIRFADDAAHDQVRVVTPRKARALGSDYIVIGRSLTRAADPLRTYARLQHEWNGGERES
TTPT
>Q5FJB3 4.1.1.23~~~pyrF~~~Orotidine 5'-phosphate decarboxylase~~~COG0284
MDRPVIVALDLDNEEQLNKILSKLGDPHDVFVKVGMELFYNAGIDVIKKLTQQGYKIFLDLKMHDIPNTVYNGAKALAKL
GITFTTVHALGGSQMIKSAKDGLIAGTPAGHSVPKLLAVTELTSISDDVLRNEQNCRLPMAEQVLSLAKMAKHSGADGVI
CSPLEVKKLHENIGDDFLYVTPGIRPAGNAKDDQSRVATPKMAKEWGSSAIVVGRPITLASDPKAAYEAIKKEFN
>P9WIU3 4.1.1.23~~~pyrF~~~Orotidine 5'-phosphate decarboxylase~~~COG0284
MTGFGLRLAEAKARRGPLCLGIDPHPELLRGWDLATTADGLAAFCDICVRAFADFAVVKPQVAFFESYGAAGFAVLERTI
AELRAADVLVLADAKRGDIGATMSAYATAWVGDSPLAADAVTASPYLGFGSLRPLLEVAAAHGRGVFVLAATSNPEGAAV
QNAAADGRSVAQLVVDQVGAANEAAGPGPGSIGVVVGATAPQAPDLSAFTGPVLVPGVGVQGGRPEALGGLGGAASSQLL
PAVAREVLRAGPGVPELRAAGERMRDAVAYLAAV
>Q59654 4.1.1.23~~~pyrF~~~Orotidine 5'-phosphate decarboxylase~~~
MSACQSPIIVALDFPTREAALALADQLDPKLCRVKVGKELFTSCAAGIVETLRGKGFEVFLDLKFHDIPNTTAMAVKAAA
EMGVWMVNVHCSGGLRMMAACRETLEAFSGPRPLLIGVTVLTSMEREDLAGIGLDIEPQEQVLRLAALAQKAGMDGLVCS
AQEAPALKAAHPGLQLVTPGIRPAGSAQDDQRRILTPRQALDAGSDYLVIGRPISQAADPAKALAAIVAELG
>P99145 4.1.1.23~~~pyrF~~~Orotidine 5'-phosphate decarboxylase~~~
MKDLPIIALDFESKEKVNQFLDLFDESLFVKVGMELFYQEGPQLINEIKERGHDVFLDLKLHDIPNTVGKAMEGLAKLNV
DLVNVHAAGGVKMMSEAIKGLRKHNQHTKIIAVTQLTSTTEDMLRHEQNIQTSIEEAVLNYAKLANAAGLDGVVCSPLES
RMLTEKLGTSFLKVTPGIRPKGASQDDQHRITTPEEARQLGSTHIVVGRPITQSDNPVESYHKIKESWLV
>P0CB75 4.1.1.23~~~pyrF~~~Orotidine 5'-phosphate decarboxylase~~~COG0284
MREHRPIIALDFPSFEAVKEFLALFPAEESLYLKVGMELYYAAGPEIVSYLKGLGHSVFLDLKLHDIPNTVKSAMKILSQ
LGVDMTNVHAAGGVEMMKAAREGLGSQAKLIAVTQLTSTSEAQMQEFQNIQTSLQESVIHYAKKTAEAGLDGVVCSAQEV
QVIKQATNPDFICLTPGIRPAGVAVGDQKRVMTPADAYQIGSDYIVVGRPITQAEDPVAAYHAIKDEWTQDWN
>Q9WYG7 4.1.1.23~~~pyrF~~~Orotidine 5'-phosphate decarboxylase~~~COG0284
MTPVLSLDMEDPIRFIDENGSFEVVKVGHNLAIHGKKIFDELAKRNLKIILDLKFCDIPSTVERSIKSWDHPAIIGFTVH
SCAGYESVERALSATDKHVFVVVKLTSMEGSLEDYMDRIEKLNKLGCDFVLPGPWAKALREKIKGKILVPGIRMEVKADD
QKDVVTLEEMKGIANFAVLGREIYLSENPREKIKRIKEMRL
>Q9KQT7 4.1.1.23~~~pyrF~~~Orotidine 5'-phosphate decarboxylase~~~COG0284
MNDPKVIVALDYDNLADALAFVDKIDPSTCRLKVGKEMFTLFGPDFVRELHKRGFSVFLDLKFHDIPNTCSKAVKAAAEL
GVWMVNVHASGGERMMAASREILEPYGKERPLLIGVTVLTSMESADLQGIGILSAPQDHVLRLATLTKNAGLDGVVCSAQ
EASLLKQHLGREFKLVTPGIRPAGSEQGDQRRIMTPAQAIASGSDYLVIGRPITQAAHPEVVLEEINSSLV
>P13242 6.3.4.2~~~pyrG~~~CTP synthase~~~COG0504
MTKYIFVTGGVVSSLGKGIVAASLGRLLKNRGLNVTIQKFDPYINVDPGTMSPYQHGEVFVTDDGAETDLDLGHYERFID
INLNKFSNVTTGKIYSTVLKKERRGDYLGGTVQVIPHITNELKDRVYRAGKETNADVVITEIGGTVGDIESLPFLEAIRQ
MKSDIGRENVMYIHCTLVPYIKAAGELKTKPTQHSVKELRSLGIQPNIIVVRTEMPISQDMKDKIALFCDIDTKAVIECE
DADNLYSIPLELQKQGLDKLVCEHMKLACKEAEMSEWKELVNKVSNLSQTITIGLVGKYVELPDAYISVVESLRHAGYAF
DTDVKVKWINAEEVTENNIAELTSGTDGIIVPGGFGDRGVEGKIVATKYARENNIPFLGICLGMQVASIEYARNVLGLKG
AHSAEIDPSTQYPIIDLLPEQKDVEDLGGTLRLGLYPCKLEEGTKAFEVYQDEVVYERHRHRYEFNNEFRQQMEEQGFVF
SGTSPDGRLVEIIELKDHPWFVASQFHPEFKSRPTRPQPLFKGFIGASVEAANQK
>Q59321 6.3.4.2~~~pyrG~~~CTP synthase~~~
MSFKSIFLTGGVVSSLGKGLTAASLALLLERQDLKVAMLKLDPYLNVDPGTMNPYEHGEVYVTDDGVETDLDLGHYHRFS
SVQLSKYSTATSGQIYTKVLTKERNGEFLGSTVQVIPHVTNEIINVIQSCADHHKPDILIVEIGGTIGDIESLPFLEAVR
QFRCEHPQDCLSIHMTYVPYLRAAKEIKTKPTQHSVQNLRSIGISPDVILCRSEAPLSTEVKRKISLFCNVPEHAVFNAI
DLERSIYEMPLLLAKENISDFLLNKLGFSPKPLDLSDWQDLVEALCDKERQHVRIGLVGKYLEHKDAYKSVFESLFHASV
PANCSLELVPIAPESEDLLEQLSQCDGCLIPGGFGTRSWEGKISAARYCRERNIPCFGICLGMQALVVEYARNVLDKPLA
NSMEINPETPDPVVCMMEGQDSVVKGGTMRLGAYPCRIAPGSLASAAYKTDLVQERHRHRYEVNPSYIERLEEHGLKIAG
VCPLGELCEIVEIPNHRWMLGVQFHPEFLSKLAKPHPLFIEFIRAAKAYSLEKANHEHR
>P0A7E5 6.3.4.2~~~pyrG~~~CTP synthase~~~COG0504
MTTNYIFVTGGVVSSLGKGIAAASLAAILEARGLNVTIMKLDPYINVDPGTMSPIQHGEVFVTEDGAETDLDLGHYERFI
RTKMSRRNNFTTGRIYSDVLRKERRGDYLGATVQVIPHITNAIKERVLEGGEGHDVVLVEIGGTVGDIESLPFLEAIRQM
AVEIGREHTLFMHLTLVPYMAASGEVKTKPTQHSVKELLSIGIQPDILICRSDRAVPANERAKIALFCNVPEKAVISLKD
VDSIYKIPGLLKSQGLDDYICKRFSLNCPEANLSEWEQVIFEEANPVSEVTIGMVGKYIELPDAYKSVIEALKHGGLKNR
VSVNIKLIDSQDVETRGVEILKGLDAILVPGGFGYRGVEGMITTARFARENNIPYLGICLGMQVALIDYARHVANMENAN
STEFVPDCKYPVVALITEWRDENGNVEVRSEKSDLGGTMRLGAQQCQLVDDSLVRQLYNAPTIVERHRHRYEVNNMLLKQ
IEDAGLRVAGRSGDDQLVEIIEVPNHPWFVACQFHPEFTSTPRDGHPLFAGFVKAASEFQKRQAK
>O87761 6.3.4.2~~~pyrG~~~CTP synthase~~~COG0504
MSTKYIFVTGGGTSSMGKGIVAASLGRLLKNRGLKVTVQKFDPYLNIDPGTMSPYQHGEVFVTDDGAETDLDLGHYERFI
DINLNKYSNVTSGKVYSEILRKERKGEYLGATVQMVPHVTNMLKEKIKRAATTTDADIIITEVGGTVGDMESLPFIEALR
QMKAEVGADNVMYIHTVPILHLRAAGELKTKIAQNATKTLREYGIQANMLVLRSEVPITTEMRDKIAMFCDVAPEAVIQS
LDVEHLYQIPLNLQAQNMDQIVCDHLKLDAPKADMAEWSAMVDHVMNLKKKVKIALVGKYVELPDAYISVTEALKHAGYA
SDAEVDINWVNANDVTDENVAELVGDAAGIIVPGGFGQRGTEGKIAAIKYARENDVPMLGICLGMQLTAVEFARNVLGLE
GAHSFELDPETKYPVIDIMRDQVDVEDMGGTLRLGLYPAKLKNGSRAKAAYNDAEVVQRRHRHRYEFNNKYREDFEKAGF
VFSGVSPDNRLVEIVELSGKKFFVACQYHPELQSRPNRPEELYTEFIRVAVENSK
>A0QYQ7 6.3.4.2~~~pyrG~~~CTP synthase~~~COG0504
MPALRKHPQTATKHLFVTGGVVSSLGKGLTASSLGQLLTARGLQVTMQKLDPYLNVDPGTMNPFQHGEVFVTEDGAETDL
DVGHYERFLDRNLSGSANVTTGQVYSSVIAKERRGEYLGDTVQVIPHITDEIKSRILAMAEPDAAGVRPDVVITEVGGTV
GDIESLPFLEAARQVRHEVGRENCFFLHVSLVPYLAPSGELKTKPTQHSVAALRSIGITPDALILRCDRDVPEPLKNKIA
LMCDVDVDGVISTPDAPSIYDIPKVLHREELDAYVVRRLNLPFRDVDWTEWDDLLRRVHEPQETVRIALVGKYIDLSDAY
LSVAEALRAGGFKHRAKVEMRWVASDDCETEHGAAAALSDVHGVLIPGGFGIRGIEGKIGAISYARKRGLPVLGLCLGLQ
CIVIEAARSVGITEANSAEFDPKTPDPVISTMADQRDAVAGEADLGGTMRLGAYPAVLTPNSVVAQAYQSTEVSERHRHR
FEVNNAYRDRIAKSGLRFSGTSPDGHLVEFVEYDPQIHPFLVGTQAHPELKSRPTRPHPLFAAFIGAAIDYKAAERLPGM
DLPEQFVPVEHSDADAPALEEPLEKSDVRG
>P9WHK7 6.3.4.2~~~pyrG~~~CTP synthase~~~COG0504
MRKHPQTATKHLFVSGGVASSLGKGLTASSLGQLLTARGLHVTMQKLDPYLNVDPGTMNPFQHGEVFVTEDGAETDLDVG
HYERFLDRNLPGSANVTTGQVYSTVIAKERRGEYLGDTVQVIPHITDEIKRRILAMAQPDADGNRPDVVITEIGGTVGDI
ESQPFLEAARQVRHYLGREDVFFLHVSLVPYLAPSGELKTKPTQHSVAALRSIGITPDALILRCDRDVPEALKNKIALMC
DVDIDGVISTPDAPSIYDIPKVLHREELDAFVVRRLNLPFRDVDWTEWDDLLRRVHEPHETVRIALVGKYVELSDAYLSV
AEALRAGGFKHRAKVEICWVASDGCETTSGAAAALGDVHGVLIPGGFGIRGIEGKIGAIAYARARGLPVLGLCLGLQCIV
IEAARSVGLTNANSAEFDPDTPDPVIATMPDQEEIVAGEADLGGTMRLGSYPAVLEPDSVVAQAYQTTQVSERHRHRYEV
NNAYRDKIAESGLRFSGTSPDGHLVEFVEYPPDRHPFVVGTQAHPELKSRPTRPHPLFVAFVGAAIDYKAGELLPVEIPE
IPEHTPNGSSHRDGVGQPLPEPASRG
>P99072 6.3.4.2~~~pyrG~~~CTP synthase~~~
MTKFIFVTGGVVSSLGKGITASSLGRLLKDRGLNVTIQKFDPYLNVDPGTMSPYQHGEVFVTDDGAETDLDLGHYERFID
INLNKFSNVTAGKVYSHVLKKERRGDYLGGTVQVIPHITNEIKERLLLAGESTNADVVITEIGGTTGDIESLPFIEAIRQ
IRSDLGRENVMYVHCTLLPYIKAAGEMKTKPTQHSVKELRGLGIQPDLIVVRTEYEMTQDLKDKIALFCDINKESVIECR
DADSLYEIPLQLSQQNMDDIVIKRLQLNAKYETQLDEWKQLLDIVNNLDGKITIGLVGKYVSLQDAYLSVVESLKHAGYP
FAKDIDIRWIDSSEVTDENAAEYLADVDGILVPGGFGFRASEGKISAIKYARENNVPFFGICLGMQLATVEFSRNVLGLE
GAHSAELDPATPYPIIDLLPEQKDIEDLGGTLRLGLYPCSIKEGTLAQDVYGKAEIEERHRHRYEFNNDYREQLEANGMV
ISGTSPDGRLVEMVEIPTNDFFIACQFHPEFLSRPNRPHPIFKSFIEASLKYQQNK
>Q5SIA8 6.3.4.2~~~pyrG~~~CTP synthase~~~COG0504
MNGSADAGPRPRKYVFITGGVVSSLGKGILTSSLGALLRARGYRVTAIKIDPYVNVDAGTMRPYEHGEVFVTADGAETDL
DIGHYERFLDMDLSRGNNLTTGQVYLSVIQKERRGEYLSQTVQVIPHITDEIKERIRKVAEEQKAEIVVVEVGGTVGDIE
SLPFLEAIRQFRFDEGEGNTLYLHLTLVPYLETSEEFKTKPTQHSVATLRGVGIQPDILVLRSARPVPEEVRRKVALFTN
VRPGHVFSSPTVEHLYEVPLLLEEQGLGRAVERALGLEAVIPNLSFWQEAVRVLKHPERTVKIAIAGKYVKMPDAYLSLL
EALRHAGIKNRARVEVKWVDAESLEAADLDEAFRDVSGILVPGGFGVRGIEGKVRAAQYARERKIPYLGICLGLQIAVIE
FARNVAGLKGANSTEFDPHTPHPVIDLMPEQLEVEGLGGTMRLGDWPMRIKPGTLLHRLYGKEEVLERHRHRYEVNPLYV
DGLERAGLVVSATTPGMRGRGAGLVEAIELKDHPFFLGLQSHPEFKSRPMRPSPPFVGFVEAALAYQERA
>Q52NW7 3.1.1.88~~~estP~~~Pyrethroid hydrolase~~~
MEICTKGSRKHLTSRASEPSYNVPENQYVLYVVSSTLSIVKQLVKVAESKKSRFSGAIEKLNERLDSLKDYRIINRDLVV
KDLERLKKRFDTEVINAELSEQLAKINVNLSRSYSEKGYLRLEKATGSENEWWAKIKPSQNDDWQQFEPDGYTIFSSRDH
YASLFKSYSDYEIEAKIPLPLRRGKAVVLYPEYISRICVLPESRSIQREQENFTKLRDKGIALSKKDWQAKLTTDELAEQ
EKERATINKRLGYFATEHEKVGIVHKALEPKLKPFQQIEKEWRQCKVKSKSTFPNSMTFVQNPAYQAVHSGFKKLKEQIG
LADEDILLSLEKIEAIGLVNMPLLYERWCLLQIIKVLTQAFRYQPEDNWKRKLIANIQGNEEQISIQFFNPSVSRAITLQ
YEPFLANGKRPDFVLDVEAITKSGNQISKRLVVDAKYYSAAYLKQRGGIGGVIHELYNGKDYSECQENSVFVLHPVLDAV
EKVVSPQEWAKDSYLGELSMFDWEPAHHQRQATNYGAVCANPMKSQRYLDEIQRMLGMFLQYGIEDNTSFRGASDDTHAV
NFCVSCGSEKVVDVTKSMSSNNQKRWYRCNECTHFTVYTHCGTCNTRLIKNGEYWTYLSLMPMSSINIKCPNCESPV
>C0LA90 3.1.1.88~~~pytH~~~Pyrethroid hydrolase~~~
MTVTDIILIHGALNRGACYDAVVPLLEARGYRVHAPDLTGHTPGDGGHLSVVDMEHYTRPVADILARAEGQSILLGHSLG
GASISWLAQHHPDKVAGLIYLTAVLTAPGITPETFVLPGEPNRGTPHALDLIQPVDEGRGLQADFSRLERLREVFMGDYP
GEGMPPAEQFIQTQSTVPFGTPNPMEGRALEIPRLYIEALDDVVIPIAVQRQMQKEFPGPVAVVSLPASHAPYYSMPERL
AEAIADFADAPAEYRQTATKAGPDRPAGADGGRADRADLP
>O31749 2.7.4.22~~~pyrH~~~Uridylate kinase~~~COG0528
MEKPKYKRIVLKLSGEALAGEQGNGINPTVIQSIAKQVKEIAELEVEVAVVVGGGNLWRGKTGSDLGMDRATADYMGMLA
TVMNSLALQDSLETLGIQSRVQTSIEMRQVAEPYIRRKAIRHLEKKRVVIFAAGTGNPYFSTDTTAALRAAEIEADVILM
AKNNVDGVYNADPRKDESAVKYESLSYLDVLKDGLEVMDSTASSLCMDNDIPLIVFSIMEEGNIKRAVIGESIGTIVRGK
>P0A7E9 2.7.4.22~~~pyrH~~~Uridylate kinase~~~COG0528
MATNAKPVYKRILLKLSGEALQGTEGFGIDASILDRMAQEIKELVELGIQVGVVIGGGNLFRGAGLAKAGMNRVVGDHMG
MLATVMNGLAMRDALHRAYVNARLMSAIPLNGVCDSYSWAEAISLLRNNRVVILSAGTGNPFFTTDSAACLRGIEIEADV
VLKATKVDGVFTADPAKDPTATMYEQLTYSEVLEKELKVMDLAAFTLARDHKLPIRVFNMNKPGALRRVVMGEKEGTLIT
E
>Q831V1 2.7.4.22~~~pyrH~~~Uridylate kinase~~~COG0528
MVKPKYQRVVLKLSGEALAGEDGFGIKPPVIKEIVQEIKEVHELGIEMAIVVGGGNIWRGQIGAQMGMERAQADYMGMLA
TVMNALALQDTLENLGVPTRVQTSIEMRQIAEPYIRRRAERHLEKGRVVIFAGGTGNPYFSTDTTAALRAAEVDADVILM
AKNNVDGVYSADPRVDETATKFEELTHLDVISKGLQVMDSTASSLSMDNDIPLVVFNLNEAGNIRRAILGENIGTTVRGK
>P43890 2.7.4.22~~~pyrH~~~Uridylate kinase~~~COG0528
MSQPIYKRILLKLSGEALQGEDGLGIDPAILDRMAVEIKELVEMGVEVSVVLGGGNLFRGAKLAKAGMNRVVGDHMGMLA
TVMNGLAMRDSLFRADVNAKLMSAFQLNGICDTYNWSEAIKMLREKRVVIFSAGTGNPFFTTDSTACLRGIEIEADVVLK
ATKVDGVYDCDPAKNPDAKLYKNLSYAEVIDKELKVMDLSAFTLARDHGMPIRVFNMGKPGALRQVVTGTEEGTTIC
>P56106 2.7.4.22~~~pyrH~~~Uridylate kinase~~~COG0528
MQAKIKNKRVLVKFSGEALAGDNQFGIDIHVLDHIAKEIKSLVENDIEVGIVIGGGNIIRGVSAAQGGIIRRTSGDYMGM
LATVINAVAMQEALEHIGLDTRVQSAIEIKEICESYIYRKAIRHLEKGRVVIFGAGTGNPFFTTDTAATLRAIEIGSDLI
IKATKVDGIYDKDPNKFKDAKKLDTLSYNDALIGDIEVMDDTAISLAKDNKLPIVVCNMFKKGNLLQVIKHQQGVFSMVK
>A0QVD9 2.7.4.22~~~pyrH~~~Uridylate kinase~~~COG0528
MADSNVAGRAAPIRPLYTRVLLKLGGEMFGGGQVGLDPDVVAQVARQIAEVVRSGAQVAVVIGGGNFFRGAQLQQRGMER
TRSDYMGMLGTVMNSLALQDFLQKEGIDTRVQTAITMGQVAEPYIPLRAVRHLEKGRVVIFGAGMGLPYFSTDTTAAQRA
LEIGAEVVLMAKAVDGVFTDDPRTNPDAELITAISHREVIDRGLKVADATAFSLCMDNGMPILVFNLLTSGNIARAVAGE
KIGTLVTT
>P9WHK5 2.7.4.22~~~pyrH~~~Uridylate kinase~~~COG0528
MTEPDVAGAPASKPEPASTGAASAAQLSGYSRVLLKLGGEMFGGGQVGLDPDVVAQVARQIADVVRGGVQIAVVIGGGNF
FRGAQLQQLGMERTRSDYMGMLGTVMNSLALQDFLEKEGIVTRVQTAITMGQVAEPYLPLRAVRHLEKGRVVIFGAGMGL
PYFSTDTTAAQRALEIGADVVLMAKAVDGVFAEDPRVNPEAELLTAVSHREVLDRGLRVADATAFSLCMDNGMPILVFNL
LTDGNIARAVRGEKIGTLVTT
>P65932 2.7.4.22~~~pyrH~~~Uridylate kinase~~~
MTQQIKYKRVLLKLSGESLMGSDPFGINHDTIVQTVGEIAEVVKMGVQVGIVVGGGNIFRGVSAQAGSMDRATADYMGMM
ATVMNALALKDAFETLGIKARVQSALSMQQIAETYARPKAIQYLEEGKVVIFAAGTGNPFFTTDTAAALRGAEMNCDVML
KATNVDGVYTADPKKDPSATRYETITFDEALLKNLKVMDATAFALCRERKLNIVVFGIAKEGSLKRVITGEDEGTLVHC
>P65933 2.7.4.22~~~pyrH~~~Uridylate kinase~~~
MATNAKPVYKRILLKLSGEALQGTEGFGIDASILDRMAQEIKELVELGIQVGVVIGGGNLFRGAGLAKAGMNRVVGDHMG
MLATVMNGLAMRDALHRAYVNARLMSAIPLNGVCDNYSWAEAISLLRNNRVVILSAGTGNPFFTTDSAACLRGIEIEADV
VLKATKVDGVFTADPAKDPSATMYDQLTYSEVLDKELKVMDLAAFTLARDHKLPIRVFNMNKPGALRRVVMGEKEGTLIT
E
>Q2FZ22 2.7.4.22~~~pyrH~~~Uridylate kinase~~~COG0528
MAQISKYKRVVLKLSGEALAGEKGFGINPVIIKSVAEQVAEVAKMDCEIAVIVGGGNIWRGKTGSDLGMDRGTADYMGML
ATVMNALALQDSLEQLDCDTRVLTSIEMKQVAEPYIRRRAIRHLEKKRVVIFAAGIGNPYFSTDTTAALRAAEVEADVIL
MGKNNVDGVYSADPKVNKDAVKYEHLTHIQMLQEGLQVMDSTASSFCMDNNIPLTVFSIMEEGNIKRAVMGEKIGTLITK
>P65936 2.7.4.22~~~pyrH~~~Uridylate kinase~~~
MAQISKYKRVVLKLSGEALAGEKGFGINPVIIKSVAEQVAEVAKMDCEIAVIVGGGNIWRGKTGSDLGMDRGTADYMGML
ATVMNALALQDSLEQLDCDTRVLTSIEMKQVAEPYIRRRAIRHLEKKRVVIFAAGIGNPYFSTDTTAALRAAEVEADVIL
MGKNNVDGVYSADPKVNKDAVKYEHLTHIQMLQEGLQVMDSTASSFCMDNNIPLTVFSIMEEGNIKRAVMGEKIGTLITK
>P65938 2.7.4.22~~~pyrH~~~Uridylate kinase~~~
MEPKYQRILIKLSGEALAGEKGVGIDIPTVQAIAKEIAEVHVSGVQIALVIGGGNLWRGEPAADAGMDRVQADYTGMLGT
VMNALVMADSLQHYGVDTRVQTAIPMQNVAEPYIRGRALRHLEKNRIVVFGAGIGSPYFSTDTTAALRAAEIEADAILMA
KNGVDGVYNADPKKDANAVKFDELTHGEVIKRGLKIMDATASTLSMDNDIDLVVFNMNEAGNIQRVVFGEHIGTTVSNKV
CD
>Q97R83 2.7.4.22~~~pyrH~~~Uridylate kinase~~~COG0528
MKMANPKYKRILIKLSGEALAGERGVGIDIQTVQTIAKEIQEVHSLGIEIALVIGGGNLWRGEPAAEAGMDRVQADYTGM
LGTVMNALVMADSLQQVGVDTRVQTAIAMQQVAEPYVRGRALRHLEKGRIVIFGAGIGSPYFSTDTTAALRAAEIEADAI
LMAKNGVDGVYNADPKKDKTAVKFEELTHRDVINKGLRIMDSTASTLSMDNDIDLVVFNMNQPGNIKRVVFGENIGTTVS
NNIEEKE
>Q9PPX6 2.7.4.22~~~pyrH~~~Uridylate kinase~~~COG0528
MRKQRIVIKISGACLKQNDSSIIDFIKINDLAEQIEKISKKYIVSIVLGGGNIWRGSIAKELDMDRNLADNMGMMATIIN
GLALENALNHLNVNTIVLSAIKCDKLVHESSANNIKKAIEKEQVMIFVAGTGFPYFTTDSCAAIRAAETESSIILMGKNG
VDGVYDSDPKINPNAQFYEHITFNMALTQNLKVMDATALALCQENNINLLVFNIDKPNAIVDVLEKKNKYTIVSK
>P59009 2.7.4.22~~~pyrH~~~Uridylate kinase~~~COG0528
MSELSYRRILLKLSGEALMGDGDYGIDPKVINRLAHEVIEAQQAGAQVALVIGGGNIFRGAGLAASGMDRVTGDHMGMLA
TVINALAMQDALEKLGAKVRVMSAIKINDVCEDFIRRRAIRHLEKGRIAIFAAGTGNPFFTTDSGAALRAIEIGADLLLK
ATKVDGVYDKDPKKHSDAVRYDSLTYDEVIMQGLEVMDTAAFALARDSDLPLRIFGMSEPGVLLRILHGAQIGTLVQGRS
>K7QVW7 5.-.-.-~~~pyrI4~~~Spiro-conjugate synthase~~~
MTTPQIDERAMEAGAAALQETIVDPGPLDVTALAVAAALAAGLHSAADDPAAALDKCIVLDELTEFAEKLVVHDRPGGIG
TTVEYVEVYEDASGVRLGTATGNAVVLKMEPHMWQFHQSVSELADGSFEAVGVIDCTAMLRRMTQVLRVTGRSGRYAGKS
GFMTLAISDPNQRPPHYSVQVVLC
>P0A7F3 ~~~pyrI~~~Aspartate carbamoyltransferase regulatory chain~~~COG1781
MTHDNKLQVEAIKRGTVIDHIPAQIGFKLLSLFKLTETDQRITIGLNLPSGEMGRKDLIKIENTFLSEDQVDQLALYAPQ
ATVNRIDNYEVVGKSRPSLPERIDNVLVCPNSNCISHAEPVSSSFAVRKRANDIALKCKYCEKEFSHNVVLAN
>Q9X1X4 ~~~~~~Dihydroorotate dehydrogenase B (NAD(+)), electron transfer subunit homolog~~~COG0543
MGGTALNEIVKKVKIAEDVFDFWIHSPSVSKEARPGQFVVIRLHEKGERIPLTVADTKPEEGLFRMVVKVVGKTTHELSL
KKEGDTILDVVGPLGNPSEIENYGNVLLVGGGVGIATLYPIAKALKEAGNNITTVLGARTKDYLIMVDEFKEISDVLLVT
DDGSAGMKGVVTDAMDKLFRERKFDICWAVGPTIMMKFCTLKAREFGVPIWVSLNPIMVDGTGMCGACRVTVSGQIKFAC
VDGPEFRGEEVDWDELLKRLAQYREQEKISYERFLKTAGESE
>P25983 ~~~pyrK~~~Dihydroorotate dehydrogenase B (NAD(+)), electron transfer subunit~~~COG0543
MKKAYLTVCSNQQIADRVFQMVLKGELVQGFTTPGQFLHLKVSEAVTPLLRRPISIADVNFEKNEVTIIYRVDGEGTRLL
SLKQQGELVDVLGPLGNGFPVNEVQPGKTALLVGGGVGVPPLQELSKRLIEKGVNVIHVLGFQSAKDVFYEEECRQYGDT
YVATADGSYGETGFVTDVIKRKKLEFDILLSCGPTPMLKALKQEYAHKEVYLSMEERMGCGIGACFACVCHTNESETSYV
KVCLDGPVFKAQEVAL
>A4J559 ~~~pyrK~~~Dihydroorotate dehydrogenase B (NAD(+)), electron transfer subunit~~~COG0543
MSKVFDAKVLAVYMVAPNTYYMEFDAPDIARLAVPGQFVHVRCGETNDPLLRRPISIHMVSRPKGVLALLFRVVGKGTEI
LSQQKPGDRVNMMGPLGRGFTLPLPGSKVAVAAGGIGAAPLVFLVQELANIKCQVTVYLGARDKRSILCDGQFIQMEAEV
VIATDDGSLGFKGTVPELMKRHMDWRKTAMTYVCGPGIMMKEISTMLAEADVPGEVSLEERMGCGVGACLSCAVKISHHG
QISNKRACFEGPVFPSWQVVWE
>P0DH76 ~~~pyrK~~~Dihydroorotate dehydrogenase B (NAD(+)), electron transfer subunit~~~COG0543
MQRKQEMMTIVAQKQLAPRIYQLDLQGELVKEMTRPGQFVHIKVPRADLLLRRPISINQIDHSNETCRLIYRVEGAGTEV
FATMKAGEQLDILGPLGNGFDITTVAAGQTAFIVGGGIGIPPLYELSKQLNEKGVKVIHFLGYASKEVAYYQQEFMALGE
THFATDDGSFGAHGNVGRLLSEALAKGRIPDAVYACGANGMLKAIDSLFPTHPHVYLSLEERMACGIGACYACVCHKKGD
TTGAKSVKVCDEGPIFKASEVIL
>Q9CFW7 ~~~pyrK~~~Dihydroorotate dehydrogenase B (NAD(+)), electron transfer subunit~~~COG0543
MPKLQEMMTIVSQREVASNIFEMVLKGELVEEMDLPGQFLHLAVPNASMLLRRPISISSWDKVAKTCTILYRIGDETSGT
YEISKLQSGAKIDVMGPLGNGFPVDEVVSTDKILIVGGGIGVPPLYELAKQLEEKNCQMTILLGFASEKVKILEKEFAEL
KNVSLKIATDDGSYGTKGHVGMLMEEIDFEVDALYTCGAPAMLKAVAKKYEQLERLYISMESRMACGIGACYACVEHDKE
DENHALKVCEDGPVFLGKQLLL
>P56968 ~~~pyrK~~~Dihydroorotate dehydrogenase B (NAD(+)), electron transfer subunit~~~COG0543
MSQLQEMMTVVSQREVAYNIFEMVLKGTLVDEMDLPGQFLHLAVPNGAMLLRRPISISSWDKRAKTCTILYRIGDETTGT
YKLSKLESGAKVDVMGPLGNGFPVAEVTSTDKILIIGGGIGVPPLYELAKQLEKTGCQMTILLGFASENVKILENEFSNL
KNVTLKIATDDGSYGTKGHVGMLMNEIDFEVDALYTCGAPAMLKAVAKKYDQLERLYISMESRMACGIGACYACVEHDKE
DESHALKVCEDGPVFLGKQLSL
>P39766 ~~~pyrP~~~Uracil permease~~~COG2233
MSKKKVNLGVRDVPTPFSWVSFSLQHLFAMFGSTILVPKLVGMSPAVALVTSGIGTLAYLLITKGQIPAYLGSSFAFISP
IILVKATGGPGAAMVGAFLAGLVYGLIALLIRQLGTGWLMKILPPVVVGPVIIVIGLGLASTAVNMAMYADPNASELVYS
LKHFSVAGVTLAITIICAIFLRGFLSLIPVLIGIIGGYLFALTQGIVNFQPVLDAKWFAVPEFIIPFKDYSPSVTLGIAA
AMVPVAFVTMSEHIGHQMVLSKVVGQDFIKKPGLHRSIMGDSVATILASLIGGPPTTTYGENIGVLAITRVFSVFVIGGA
AVIALCFGFIGKISALISSVPSAVMGGVSFLLFGIIASSGLRMLIDNKIDYENNRNLIITSVILVIGVGGAFIQVSQGGF
QVSGMALAAIVGVILNLILPQAKEEQADTSEQHHI
>P41007 ~~~pyrR~~~Bifunctional protein PyrR~~~
MQKAVVMDEQAIRRALTRIAHEIIERNKGIDGCVLVGIKTRGIYLARRLAERIEQIEGASVPVGELDITLYRDDLTVKTD
DHEPLVKGTNVPFPVTERNVILVDDVLFTGRTVRAAMDAVMDLGRPARIQLAVLVDRGHRELPIRADFVGKNVPTSRSEL
IVVELSEVDGIDQVSIHEK
>P39765 ~~~pyrR~~~Bifunctional protein PyrR~~~COG2065
MNQKAVILDEQAIRRALTRIAHEMIERNKGMNNCILVGIKTRGIYLAKRLAERIEQIEGNPVTVGEIDITLYRDDLSKKT
SNDEPLVKGADIPVDITDQKVILVDDVLYTGRTVRAGMDALVDVGRPSSIQLAVLVDRGHRELPIRADYIGKNIPTSKSE
KVMVQLDEVDQNDLVAIYENE
>F2MMP6 ~~~pyrR~~~Bifunctional protein pyrR~~~
MPKKEVVDAVTMKRALTRISYEIIERNKGIQDIVLVGIKTRGIYIAQRLAERLKQLEDIDVPVGELDITLYRDDVKDMEE
PELHSSDVPVSIEGKEVILVDDVLYTGRTIRAAMDAVMDLGRPRKISLAVLVDRGHRELPIRADYVGKNIPTSKTEEIIV
EMEERDGADRIMISKGNE
>P9WHK3 ~~~pyrR~~~Bifunctional protein PyrR~~~COG2065
MGAAGDAAIGRESRELMSAADVGRTISRIAHQIIEKTALDDPVGPDAPRVVLLGIPTRGVTLANRLAGNITEYSGIHVGH
GALDITLYRDDLMIKPPRPLASTSIPAGGIDDALVILVDDVLYSGRSVRSALDALRDVGRPRAVQLAVLVDRGHRELPLR
ADYVGKNVPTSRSESVHVRLREHDGRDGVVISR
>P65944 ~~~pyrR~~~Bifunctional protein PyrR~~~
MSERIIMDDAAIQRTVTRIAHEILEYNKGTDNLILLGIKTRGEYLANRIQDKIHQIEQQRIPTGTIDITYFRDDIEHMSS
LTTKDAIDIDTDITDKVVIIIDDVLYTGRTVRASLDAILLNARPIKIGLAALVDRGHRELPIRADFVGKNIPTSKEETVS
VYLEEMDQRNAVIIK
>P65946 ~~~pyrR~~~Bifunctional protein PyrR~~~COG2065
MKTKEVVDELTVKRAITRITYEIIERNKDLNKIVLAGIKTRGVFIAHRIQERLKQLENLSVPVVELDTKPFRDDVKSGED
TSLVSVDVTDREVILVDDVLYTGRTIRAAIDNIVGHGRPARVSLAVLVDRGHRELPIRPDYVGKNIPTSRSEEIIVEMTE
LDDQDRVLITEEA
>Q5SK65 ~~~pyrR~~~Bifunctional protein PyrR~~~COG2065
MRFKAELMNAPEMRRALYRIAHEIVEANKGTEGLALVGIHTRGIPLAHRIARFIAEFEGKEVPVGVLDITLYRDDLTEIG
YRPQVRETRIPFDLTGKAIVLVDDVLYTGRTARAALDALIDLGRPRRIYLAVLVDRGHRELPIRADFVGKNVPTSRSEVV
KVKVEEVDGEDRVELWEREGA
>Q59712 ~~~pyrC'~~~Dihydroorotase-like protein~~~COG0044
MTISILGARVIDPKTGLDQVTDLHLDGGRIAAIGAAPAGFSASRTIQADGVVAAPGLVDLGVSLREPGYSRKGNIRSETR
AAVAGGVTSLCCPPQTRPVLDTLAVAELILDRAREAANSKVYPIGALTKGLEGEQLAELVALRDTGCVAFGNGLKQIPNN
RTLARALEYAATFDLTVVFHSQDRDLAEGGLAHEGAMASFLGLPGIPESAETVALARNLLLVEQSGVRAHFSQITSARGA
QLIAQAQELGLPVTADVALYQLILTDESVRQFSSLYHVQPPLRTAKDRDGLRAAVKSGVIQAISSHHQPHERDAKLAPFG
ATEPGISSVELLLPLAMTLVQDGLLDLPTLLARLSSGPAAALRVPAGELKVGGAADLVLFDPQASTVAGEQWSSRGENCP
FIGHCLPGAVRYTLVDGHVCHGPE
>P11396 ~~~cpcD~~~Phycobilisome 8.9 kDa linker polypeptide, phycocyanin-associated, rod~~~
MFGQTTLGIDSVSSSASRVFRFEVVGMRQNEENDKNKYNIRRSGSVYITVPYNRMSEEMQRIHRLGGKIVKIEPLTRAAG
>P07124 ~~~cpcD~~~Phycobilisome 8.9 kDa linker polypeptide, phycocyanin-associated, rod~~~COG0369
MFGQTTLGAGSVSSSASRVFRYEVVGLRQSSETDKNKYNIRNSGSVFITVPYSRMNEEYQRITRLGGKIVKIEQLVSAEA
>Q06583 3.1.-.-~~~pys1~~~Pyocin-S1~~~COG4104
MARPIADLIHFNSTTVTASGDVYYGPGGGTGIGPIARPIEHGLDSSTENGWQEFESYADVGVDPRRYVPLQVKEKRREIE
LQFRDAEKKLEASVQAELDKADAALGPAKNLAPLDVINRSLTIVGNALQQKNQKLLLNQKKITSLGAKNFLTRTAEEIGE
QAVREGNINGPEAYMRFLDREMEGLTAAYNVKLFTEAISSLQIRMNTLTAAKASIEAAAANKAREQAAAEAKRKAEEQAR
QQAAIRAANTYAMPANGSVVATAAGRGLIQVAQGAASLAQAISDAIAVLGRVLASAPSVMAVGFASLTYSSRTAEQWQDQ
TPDSVRYALGMDAAKLGLPPSVNLNAVAKASGTVDLPMRLTNEARGNTTTLSVVSTDGVSVPKAVPVRMAAYNATTGLYE
VTVPSTTAEAPPLILTWTPASPPGNQNPSSTTPVVPKPVPVYEGATLTPVKATPETYPGVITLPEDLIIGFPADSGIKPI
YVMFRDPRDVPGAATGKGQPVSGNWLGAASQGEGAPIPSQIADKLRGKTFKNWRDFREQFWIAVANDPELSKQFNPGSLA
VMRDGGAPYVRESEQAGGRIKIEIHHKVRVADGGGVYNMGNLVAVTPKRHIEIHKGGK
>P73202 ~~~cpcD~~~Phycobilisome 8.9 kDa linker polypeptide, phycocyanin-associated, rod~~~COG0369
MLGQSSLVGYSNTQAANRVFVYEVSGLRQTDANENSAHDIRRSGSVFIKVPYARMNDEMRRISRLGGTIVNIRPYQADSN
EQN
>P50035 ~~~cpcD~~~Phycobilisome 8.9 kDa linker polypeptide, phycocyanin-associated, rod~~~COG0369
MFGQTASGSAALSPSGARVFRYEVVGLRQNEETDRMEFPIRRSGSTFITVPYNRMNEEMQRITRMGGKIVSITPVVAS
>Q06584 3.1.-.-~~~pys2~~~Pyocin-S2~~~
MAVNDYEPGSMVITHVQGGGRDIIQYIPARSSYGTPPFVPPGPSPYVGTGMQEYRKLRSTLDKSHSELKKNLKNETLKEV
DELKSEAGLPGKAVSANDIRDEKSIVDALMDAKAKSLKAIEDRPANLYTASDFPQKSESMYQSQLLASRKFYGEFLDRHM
SELAKAYSADIYKAQIAILKQTSQELENKARSLEAEAQRAAAEVEADYKARKANVEKKVQSELDQAGNALPQLTNPTPEQ
WLERATQLVTQAIANKKKLQTANNALIAKAPNALEKQKATYNADLLVDEIASLQARLDKLNAETARRKEIARQAAIRAAN
TYAMPANGSVVATAAGRGLIQVAQGAASLAQAISDAIAVLGRVLASAPSVMAVGFASLTYSSRTAEQWQDQTPDSVRYAL
GMDAAKLGLPPSVNLNAVAKASGTVDLPMRLTNEARGNTTTLSVVSTDGVSVPKAVPVRMAAYNATTGLYEVTVPSTTAE
APPLILTWTPASPPGNQNPSSTTPVVPKPVPVYEGATLTPVKATPETYPGVITLPEDLIIGFPADSGIKPIYVMFRDPRD
VPGAATGKGQPVSGNWLGAASQGEGAPIPSQIADKLRGKTFKNWRDFREQFWIAVANDPELSKQFNPGSLAVMRDGGAPY
VRESEQAGGRIKIEIHHKVRIADGGGVYNMGNLVAVTPKRHIEIHKGGK
>B3A0N7 ~~~pznA~~~Plantazolicin~~~
MTKITIPTALSAKVHGEGQHLFEPMAARCTCTTIISSSSTF
>D3VML5 ~~~pznA~~~Plantazolicin~~~
MTQIKVPTALIASVHGEGQHLFEPMAARCTCTTIISSSSTF
>A0A172J1R7 1.14.-.-~~~phzNO1~~~Phenazine N-monooxygenase PhzNO1~~~
MIDNDKTHFDAIVIGTGFAGIYMLHKLRNELGLKVRAFDKASGVGGTWYWNKYPGARADTESFVYRYSFDKETSEGWDWR
NRYVDQPEMLGYLQAVVDRHGLAKDIQLKTGIDSAVFDEIHHIWTLTTSTGELFTARYLVNAVGVLSKIVIPQIPGRDKF
QGQIVHTGAWPADLSLEGKRVGVIGTGSTGVQFICAASKLARHLTVFQRSAQFCVPAGSRQVSEAYVAEYKNNFEQIWDG
IRNSRIACGFEESGVSAMSVSEEERQKVFEHHWEIGNGFRFMFGTFSDIAVDPAANQAASDFIRAKIRQIVKDPETARKL
CPTDLYAKRPVCTNDYYETYNLPNVSLVSLPENPIKELTANGVLTEDGVEHKLDLLVFATGFETVEGSYNQMEIRGRGGE
TLQHHWKDAPSSYLGVATAGFPNMFMVLGPNSAFSNLPPSIESQVEWIGELIGWAEGESTPVIETTREAEEAWTATCKEI
AAYTLFPKVKSWIFGENIDGRSSRVLFYFGGLAGYRAKLREVADADYEGFILHSDPSSMVA
>P0A0J9 ~~~qacA~~~Antiseptic resistance protein~~~
MISFFTKTTDMMTSKKRWTALVVLAVSLFVVTMDMTILIMALPELVRELEPSGTQQLWIVDIYSLVLAGFIIPLSAFADK
WGRKKALLTGFALFGLVSLAIFFAESAEFVIAIRFLLGIAGALIMPTTLSMIRVIFENPKERATALAVWSIASSIGAVFG
PIIGGALLEQFSWHSAFLINVPFAIIAVVAGLFLLPESKLSKEKSHSWDIPSTILSIAGMIGLVWSIKEFSKEGLADIIP
WVVIVLAITMIVIFVKRNLSSSDPMLDVRLFKKRSFSAGTIAAFMTMFAMASVLLLASQWLQVVEELSPFKAGLYLLPMA
IGDMVFAPIAPGLAARFGPKIVLPSGIGIAAIGMFIMYFFGHPLSYSTMALALILVGAGMASLAVASALIMLETPTSKAG
NAAAVEESMYDLGNVFGVAVLGSLSSMLYRVFLDISSFSSKGIVGDLAHVAEESVVGAVEVAKATGIKQLANEAVTSFND
AFVATALVGGIIMIIISIVVYLLIPKSLDITKQK
>P14319 ~~~qacC~~~Quaternary ammonium compound-resistance protein QacC~~~
MPYIYLIIAISTEVIGSAFLKSSEGFSKFIPSLGTIISFGICFYFLSKTMQHLPLNITYATWAGLGLVLTTVVSIIIFKE
QINLITIVSIVLIIVGVVSLNIFGTSH
>P0A0N3 ~~~qacR~~~HTH-type transcriptional regulator QacR~~~
MNLKDKILGVAKELFIKNGYNATTTGEIVKLSESSKGNLYYHFKTKENLFLEILNIEESKWQEQWKKEQIKCKTNREKFY
LYNELSLTTEYYYPLQNAIIEFYTEYYKTNSINEKMNKLENKYIDAYHVIFKEGNLNGEWCINDVNAVSKIAANAVNGIV
TFTHEQNINERIKLMNKFSQIFLNGLSK
>P0A0N4 ~~~qacR~~~HTH-type transcriptional regulator QacR~~~
MNLKDKILGVAKELFIKNGYNATTTGEIVKLSESSKGNLYYHFKTKENLFLEILNIEESKWQEQWKKEQIKCKTNREKFY
LYNELSLTTEYYYPLQNAIIEFYTEYYKTNSINEKMNKLENKYIDAYHVIFKEGNLNGEWCINDVNAVSKIAANAVNGIV
TFTHEQNINERIKLMNKFSQIFLNGLSK
>P0A0N5 ~~~qacR~~~HTH-type transcriptional regulator QacR~~~
MNLKDKILGVAKELFIKNGYNATTTGEIVKLSESSKGNLYYHFKTKENLFLEILNIEESKWQEQWKKEQIKCKTNREKFY
LYNELSLTTEYYYPLQNAIIEFYTEYYKTNSINEKMNKLENKYIDAYHVIFKEGNLNGEWCINDVNAVSKIAANAVNGIV
TFTHEQNINERIKLMNKFSQIFLNGLSK
>Q8VUS8 1.4.2.-~~~qhnDH~~~Quinohemoprotein amine dehydrogenase subunit gamma~~~
MNALVGCTTSFDPGWEVDAFGAVSNLCQPMEADLYGCADPCWWPAQVADTLNTYPNWSAGADDVMQDWRKLQSVFPETKG
SS
>P0A182 1.4.9.-~~~qhnDH~~~Quinohemoprotein amine dehydrogenase subunit gamma~~~
MSAVAGCTATTDPGWEVDAFGGVSSLCQPMEADLYGCSDPCWWPAQVPDMMSTYQDWNAQASNSAEDWRNLGTVFPKDK
>P46911 ~~~qcrA~~~Menaquinol:cytochrome c reductase iron-sulfur subunit~~~COG0723
MGGKHDISRRQFLNYTLTGVGGFMAASMLMPMVRFALDPVLKSTGKQDMVQVVSVDELTKEPQRFDFKINQVDAWYESEE
SRSAWVFKNGDEIVALSPICKHLGCTVNWNSDPKNPNKFFCPCHYGLYEKDGTNVPGTPPLAPLDHYEQEVKDGFLYLGK
AKPKGEG
>Q79VE8 ~~~qcrA~~~Cytochrome bc1 complex Rieske iron-sulfur subunit~~~COG0723
MSNNNDKQYTTQELNAMSNEDLARLGTELDDVTIAYRKERFPIANDPAEKRAARAVTFWLVLGIIGGLGFLATYIFWPWE
YKAHGDEGLLAYTLYTPMLGITSGLCILSLGFAVVLYVKKFIPEEIAVQRRHDGPSEEVDRRTIVALLNDSWQTSTLGRR
KLIMGLAGGGAVLAGLTIIAPMGGMIKNPWNPKEGPMDVQGDGTLWTSGWTLVENDVKVYLGRDTAAIAESHTDATGEHW
STTGVSRLVRMRPEDLAAASMETVFPLPAEMVNDGAEYDPAKDVYEHQMHSVHGPRNAVMLIRLRTADAEKVIEREGQES
FHYGDYYAYSKICTHIGCPTSLYEAQTNRILCPCHQSQFDALHYGKPVFGPAARALPQLPITVDEEGYLIAAGNFIEPLG
PAFWERKS
>Q45657 ~~~qcrA~~~Menaquinol:cytochrome c reductase iron-sulfur subunit~~~
MSDNKHRVTRRQFLNYTLTGVGGFMAAGMLMPMLRFAFDPILRETAGTDMVAVADVKEITTEPKRFDFKVKVKDAWYESE
EPRSAWVYKDEKGDIIALSPVCKHLGCTVDWNTDKNNPNHFFCPCHYGLYTKDGTNVPGTPPTAPLDRYEFEVKDGKLYL
GKAKPRGEA
>P9WH23 ~~~qcrA~~~Cytochrome bc1 complex Rieske iron-sulfur subunit~~~COG0723
MSRADDDAVGVPPTCGGRSDEEERRIVPGPNPQDGAKDGAKATAVPREPDEAALAAMSNQELLALGGKLDGVRIAYKEPR
WPVEGTKAEKRAERSVAVWLLLGGVFGLALLLIFLFWPWEFKAADGESDFIYSLTTPLYGLTFGLSILSIAIGAVLYQKR
FIPEEISIQERHDGASREIDRKTVVANLTDAFEGSTIRRRKLIGLSFGVGMGAFGLGTLVAFAGGLIKNPWKPVVPTAEG
KKAVLWTSGWTPRYQGETIYLARATGTEDGPPFIKMRPEDMDAGGMETVFPWRESDGDGTTVESHHKLQEIAMGIRNPVM
LIRIKPSDLGRVVKRKGQESFNFGEFFAFTKVCSHLGCPSSLYEQQSYRILCPCHQSQFDALHFAKPIFGPAARALAQLP
ITIDTDGYLVANGDFVEPVGPAFWERTTT
>P46912 ~~~qcrB~~~Menaquinol:cytochrome c reductase cytochrome b subunit~~~COG1290
MLNKIYDWVDERLDITPMWRDIADHEVPEHVNPAHHFSAFVYCFGGLTFFVTVIQVLSGMFLTMYYVPDIKNAWESVYYL
QNEVAFGQIVRGMHHWGASLVIVMMFLHTLRVFFQGAYKKPRELNWIVGVLIFFVMLGLGFTGYLLPWDMKALFATKVGL
QIAEATPLIGTQVKTLLAGHPDIVGAQTLTRFFAIHVFFLPAALFGLMAAHFIMIRKQGISGPL
>Q79VE9 7.1.1.8~~~qcrB~~~Cytochrome bc1 complex cytochrome b subunit~~~COG1290
MSLATVGNNLDSRYTMASGIRRQINKVFPTHWSFMLGEIALYSFIVLLLTGVYLTLFFDPSITKVIYDGGYLPLNGVEMS
RAYATALDISFEVRGGLFIRQMHHWAALLFVVSMLVHMLRIFFTGAFRRPREANWIIGVVLIILGMAEGFMGYSLPDDLL
SGVGLRIMSAIIVGLPIIGTWMHWLIFGGDFPSDLMLDRFYIAHVLIIPAILLGLIAAHLALVWYQKHTQFPGAGRTENN
VIGIRIMPLFAVKAVAFGLIVFGFLALLAGVTTINAIWNLGPYNPSQVSAGSQPDVYMLWTDGAARVMPAWELYLGNYTI
PAVFWVAVMLGILVVLLVTYPFIERKFTGDDAHHNLLQRPRDVPVRTSLGVMALVFYILLTVSGGNDVYAMQFHVSLNAM
TWIGRIGLIVGPAIAYFITYRLCIGLQRSDREVLEHGIETGIIKQMPNGAFIEVHQPLGPVDDHGHPIPLPYAGAAVPKQ
MNQLGYAEVETRGGFFGPDPEDIRAKAKEIEHANHIEEANTLRALNEANIERDKNEGKN
>Q45658 ~~~qcrB~~~Menaquinol:cytochrome c reductase cytochrome b subunit~~~
MLNKLYDWVDERLDITPLWRDIADHEVPEHVNPAHHFSAFVYCFGGLTFFVTVIQILSGMFLTMYYVPDIKNAWESVYYL
QNEVAFGQIVRGMHHWGASLVIVMMFLHTLRVFFQGAYKKPREMNWIVGVLIFMVMMGLGFTGYLLPWDMKALFATKVGL
QIAEAVPLIGPAIKTLLAGDPEIVGAQTLARFFAIHVFFLPAALLGLMAAHFLMIRRQGISGPL
>P9WP37 7.1.1.8~~~qcrB~~~Cytochrome bc1 complex cytochrome b subunit~~~COG1290
MSPKLSPPNIGEVLARQAEDIDTRYHPSAALRRQLNKVFPTHWSFLLGEIALYSFVVLLITGVYLTLFFDPSMVDVTYNG
VYQPLRGVEMSRAYQSALDISFEVRGGLFVRQIHHWAALMFAAAIMVHLARIFFTGAFRRPRETNWVIGSLLLILAMFEG
YFGYSLPDDLLSGLGLRAALSSITLGMPVIGTWLHWALFGGDFPGTILIPRLYALHILLLPGIILALIGLHLALVWFQKH
TQFPGPGRTEHNVVGVRVMPVFAFKSGAFFAAIVGVLGLMGGLLQINPIWNLGPYKPSQVSAGSQPDFYMMWTEGLARIW
PPWEFYFWHHTIPAPVWVAVIMGLVFVLLPAYPFLEKRFTGDYAHHNLLQRPRDVPVRTAIGAMAIAFYMVLTLAAMNDI
IALKFHISLNATTWIGRIGMVILPPFVYFITYRWCIGLQRSDRSVLEHGVETGIIKRLPHGAYIELHQPLGPVDEHGHPI
PLQYQGAPLPKRMNKLGSAGSPGSGSFLFADSAAEDAALREAGHAAEQRALAALREHQDSIMGSPDGEH
>P46913 ~~~qcrC~~~Menaquinol:cytochrome c reductase cytochrome c subunit~~~COG1290
MHRGKGMKFVGDSRIPAEKKPNIPKDYSEYPGKTEAFWPNFLLKEWMVGAVFLIGFLVLTIVHQPPLERMADPTDTGYIP
LPDWYFLFLYQLLKYEYAAGSFTVVGAMIMPGLAFGALLLAPFLDRGTERRPWKRPVAVGMMLLAISAAVFLTWQSVATH
DWAKAEEQGKITKEADIDTNAEGYKVFKEQGCISCHGDNLQGGAAGPSLVDSGLKPDEIKKIAVEGKGKMPAGVFKGNDK
QLEELAKFISETTAK
>Q8NNK5 7.1.1.8~~~qcrC~~~Cytochrome bc1 complex cytochrome c subunit~~~COG2010
MAKPSAKKVKNRRKVRRTVAGALALTIGLSGAGILATAITPDAQVATAQRDDQALISEGKDLYDVACITCHGVNLQGVED
RGPSLVGVGEGAVYFQVHSGRMPILRNEAQAERKAPRYTEAQTLAIAAYVAANGGGPGLVYNEDGTLAMEELRGENYDGQ
ITSADVARGGDLFRLNCASCHNFTGRGGALSSGKYAPNLDAANEQEIYQAMLTGPQNMPKFSDRQLSADEKKDIIAFIKS
TKETPSPGGYSLGSLGPVAEGLFMWVFGILVLVAAAMWIGSRS
>Q45659 ~~~qcrC~~~Menaquinol:cytochrome c reductase cytochrome c subunit~~~
MHRGKGMKFVGDSRIPAVRKPNIPKDYSEYPGKTEVFWPNFLLKEWLVGSVFLVGFLCLTVAHPSPLERIADPTDTTYIP
LPDWYFLFLYQLLKYSYASGPYTVIGAIVMPGLAFGALLLAPFLDRGPERRPWKRPVATGMMLLTLAAIVYLTWESVVTH
DWEKAAEQGKIRAEVEIDTNAEGYKIAQANTCTSCHGENLSGGAGPSLVGTGLTAEEIAKIAKEGQGSMPGGIFKGTDEE
LQKMANSSPA
>P9WP35 7.1.1.8~~~qcrC~~~Cytochrome bc1 complex cytochrome c subunit~~~COG2010
MTKLGFTRSGGSKSGRTRRRLRRRLSGGVLLLIALTIAGGLAAVLTPTPQVAVADESSSALLRTGKQLFDTSCVSCHGAN
LQGVPDHGPSLIGVGEAAVYFQVSTGRMPAMRGEAQAPRKDPIFDEAQIDAIGAYVQANGGGPTVVRNPDGSIATQSLRG
NDLGRGGDLFRLNCASCHNFTGKGGALSSGKYAPDLAPANEQQILTAMLTGPQNMPKFSNRQLSFEAKKDIIAYVKVATE
ARQPGGYLLGGFGPAPEGMAMWIIGMVAAIGLALWIGARS
>P42106 1.13.11.24~~~qdoI~~~Quercetin 2,3-dioxygenase~~~COG1917
MKTLCTHSLPKEKMPYLLRSGEGERYLFGRQVATVMANGRSTGDLFEIVLLSGGKGDAFPLHVHKDTHEGILVLDGKLEL
TLDGERYLLISGDYANIPAGTPHSYRMQSHRTRLVSYTMKGNVAHLYSVIGNPYDHAEHPPYASEEVSNERFAEAAAVAD
IVFLDEAKPACSAKLAELTELPDGAVPYVLESGEGDRLLTGDQLHRIVAAQKNTDGQFIVVSSEGPKGDRIVDHYHEYHT
ETFYCLEGQMTMWTDGQEIQLNPGDFLHVPANTVHSYRLDSHYTKMVGVLVPGLFEPFFRTLGDPYEGHIFPCEPQALRF
DRILQNIEALDLKVMKP
>O33472 1.13.11.47~~~qdo~~~1H-3-hydroxy-4-oxoquinoline 2,4-dioxygenase~~~
MQSLNVNGTLMTYSESGDPHAPTLFLLSGWCQDHRLFKNLAPLLARDFHVICPDWRGHDAKQTDSGDFDSQTLAQDLLAF
IDAKGIRDFQMVSTSHGCWVNIDVCEQLGAARLPKTIVIDWLLQPHPGFWQQLAEGQHPTEYVAGRQSFFDEWAETTDNA
DVLNHLRNEMPWFHGEMWQRACREIEANYRTWGSPLDRMESLPQKPEICHIYSQPLSQDYRQLQLDFAAGHSWFHPRHIP
GRTHFPSLENPVAVAQAIREFLQA
>Q9Z4J7 1.1.2.8~~~exaA~~~Quinoprotein ethanol dehydrogenase~~~
MTTRTSPAPAGLLRPSLHCLAFAVALGSAGAALAKDVTWEDIANDDKTTGDVLQYGMGTHAQRWSPLKQVNADNVFKLTP
AWSYSFGDEKQRGQESQAIVSDGVIYVTASYSRLFALDAKTGKRLWTYNHRLPDDIRPCCDVVNRGAAIYGDKVFFGTLD
ASVVALNKNTGKVVWKKKFADHGAGYTMTGAPTIVKDGKTGKVLLIHGSSGDEFGVVGRLFARDPDTGEEIWMRPFVEGH
MGRLNGKDSTVTGDVKAPSWPDDRNSPTGKVESWSHGGGAPWQSASFDAETNTIIVGAGNPGPWNTWARTAKGGNPHDYD
SLYTSGQVGVDPSSGEVKWFYQHTPNDAWDFSGNNELVLFDYKAKDGKIVKATAHADRNGFFYVVDRSNGKLQNAFPFVD
NITWASHIDLKTGRPVEREGQRPPLPEPGQKHGKAVEVSPPFLGGKNWNPMAYSQDTGLFYVPANHWKEDYWTEEVSYTK
GSAYLGMGFRIKRMYDDHVGSLRAMDPVSGKVVWEHKEHLPLWAGVLATAGNLVFTGTGDGYFKAFDAKSGKELWKFQTG
SGIVSPPITWEQDGEQYLGVTVGYGGAVPLWGGDMADLTRPVAQGGSFWVFKLPSWDNRTASR
>A8R3S4 1.1.2.8~~~qedA~~~Quinoprotein ethanol dehydrogenase~~~
MTIRSLPAALSPLSMAVQAVLLVSSLALAPAANAKPVTWEDIANDHLNTQNVLQYGMGTNAQRWSPLAMVNDKNVFKLTP
AWSYSFGDERQRGQESQAIINDGVIYVTGSYSRVFALDAKTGRRLWTYNHRLPDNIRPCCDVVNRGAAIFGDKIYFGTLD
ARVIALNKDTGKVVWNKKFGDHSAGYTMTGAPTLIKDQKSGKVLLIHGSSGDEFGVVGQLYARDPETGEEVWMRPFVEGH
MGRLNGKDSTPTGDVKAPSWPDDPTTETGKVESWSHGGGAPWQSASFDPETNTIIVGAGNPGPWNTWARTSKDGNPHDFD
SLYTSGQVGVDPTTGEVKWFYQHTPNDAWDFSGNNELVLFDYKDKDGKQYKATAHADRNGFFYVVDRTNGKLKNAFPFVD
NITWASHIDLKTGRPVENEGQRPAKPLPGETKGKPVEVSPPFLGGKNWNPMAYSQDTGLFYVPANHWKEEYWTEEVNYKK
GSAYLGIGFRIKRMYEDHVGSLRAMDPTTGKVVWEHNERLPLWAGVLATKGNLVFTGTGDGYFKAFNAKTGEELWKFQTG
SGIVSPPITWEQDGEQYIGVTVGYGGAVPLWGGDMAELTKPVAQGGSFWVFKIPAWDTKTAKR
>Q4W6G0 1.1.9.1~~~qgdA~~~Quinohemoprotein alcohol dehydrogenase ADH-IIG~~~
MRQTGLASLPLKSLAVAVLLSLAGTPALAADIPANVDGARIIAADKEPGNWMSTGRTYDEQRYSPLKQISDQNVGQLGLA
WSYKLDLDRGVEATPIVVDGAMYTTGPFSVVYALDARDGRLIWKYDPQSDRHRAGEACCDAVNRGVAVWKGKVYVGVLDG
RLEAIDAKTGQRAWSVDTRADHKRSYTITGAPRVVNGKVVIGNGGAEFGVRGYVTAYDAETGKEAWRFYTVPGDPKLPPE
GKGMEIAAKTWFGDAYVEQGGGGTAWDSFAYDPELNLLYIGVGNGSLWDPKWRSQAKGDNLFLSSIVAVNADTGEYVWHY
QTTPGDAWDYTATQHMILAELPIDGKPRKVLMQAPKNGFFYVIDRATGELLSAKGIVPQSWTKGMDMKTGRPILDEENAA
YWKNGKRNLVTPAFWGAHDWQPMSYNPDTGLVYIPAHIMSAYYEHIPEAPKRNPFKSMYQLGLRTGMMPEGAEGLLEMAK
SWSGKLIAWDPVKQQAAWEVPYVTIFNGGTLSTAGNLVFEGSADGRVIAYAADTGEKLWEQPAASGVMAAPVTYSVDGEQ
YVTFMAGWGGAFSTFAGALSLRAGVQPYAQVLTYKLGGTAKLQEPAPRPDTPKPPALSNDTASIEAGAKLYDGYCSQCHG
IHAVSGGVLPDLRKLTPEKHQMFLGILFGGRVPDGMPSFADAFTPEQVDQIHQYLIKRAHDLHQEGDTWKQFSAKSSH
>Q46444 1.1.9.1~~~qheDH~~~Quinohemoprotein alcohol dehydrogenase~~~
MERLIDNSHGWPGRMVWLLAACLGSAAAFAQTGPAAQAAAAVQRVDGDFIRANAARTPDWPTIGVDYAETRYSRLDQINA
ANVKDLGLAWSYNLESTRGVEATPVVVDGIMYVSASWSVVHAIDTRTGNRIWTYDPQIDRSTGFKGCCDVVNRGVALWKG
KVYVGAWDGRLIALDAATGKEVWHQNTFEGQKGSLTITGAPRVFKGKVIIGKRGAEYGVRGYITAYDAETGERKWRWFSV
PGDPSKPFEDESMKRAARTWDPSGKWWEAGGGGTMWDSMTFDAELNTMYVGTGNGSPWSHKVRSPKGGDNLYLASIVALD
PDTGKYKWHYQETPGDNWDYTSTQPMILADIKIAGKPRKVILHAPKNGFFFVLDRTNGKFISAKNFVPVNWASGYDKHGK
PIGIAAARDGSKPQDAVPGPYGAHNWHPMSFNPQTGLVYLPAQNVPVNLMDDKKWEFNQAGPGKPQSGTGWNTAKFFNAE
PPKSKPFGRLLAWDPVAQKAAWSVEHVSPWNGGTLTTAGNVVFQGTADGRLVAYHAATGEKLWEAPTGTGVVAAPSTYMV
DGRQYVSVAVGWGGVYGLAARATERQGPGTVYTFVVAGKARMPEFVAQRTGQLLQGVKYDPAKVEAGTMLYVANCVFCHG
VPGVDRGGNIPNLGYMDASYIENLPNFVFKGPAMVRGMPDFTGKLSGDDVESLKAFIQGTADAIRPKP
>Q8GR64 1.1.9.1~~~qbdA~~~Quinohemoprotein alcohol dehydrogenase ADH IIB~~~
MKKPLRTSLLMLCLATPLAALAAGVDEAAIRATEQAGGEWLSHGRTYAEQRFSPLKQIDASNVRSLGLAWYMDLDNTRGL
EATPLFHDGVIYTSMSWSRVIAVDAASGKELWRYDPEVAKVKARTSCCDAVNRGVALWGDKVYVGTLDGRLIALDAKTGK
AIWSQQTTDPAKPYSITGAPRVVKGKVIIGNGGAEYGVRGFVSAYDADTGKLAWRFYTVPGDPALPYEHPELREAAKTWQ
GDQYWKLGGGGTVWDSMAYDPELDLLYVGTGNGSPWNREVRSPGGGDNLYLSSILAIRPDTGKLAWHYQVTPGDSWDFTA
TQQITLAELNIDGKPRKVLMQAPKNGFFYVLDRTNGKLISAEKFGKVTWAEKVDLATGRPVEAPGVRYEKEPIVMWPSPF
GAHNWHSMSFNPGTGLVYIPYQEVPGVYRNEGKDFVTRKAFNTAAGFADATDVPAAVVSGALLAWDPVKQKAAWKVPYPT
HWNGGTLSTAGNLVFQGTAAGQMHAYSADKGEALWQFEAQSGIVAAPMTFELAGRQYVAIMAGWGGVATLTGGESMNLPG
MKNRSRLLVFALDGKAQLPPPAPAPAKVERVPQPVTAAPEQVQAGKQLYGQFCSVCHGMGTISGGLIPDLRQSSDATREH
FQQIVLQGALKPLGMPSFDDSLKPEEVEQIKLYVMSREYEDYMARHKAAP
>P0AA53 ~~~qmcA~~~Protein QmcA~~~COG0330
MLIFIPILIFVALVIVGAGVKIVPQGYQWTVERFGRYTKTLQPGLSLVVPFMDRIGRKINMMEQVLDIPSQEVISKDNAN
VTIDAVCFIQVIDAPRAAYEVSNLELAIINLTMTNIRTVLGSMELDEMLSQRDSINSRLLRIVDEATNPWGIKVTRIEIR
DVRPPAELISSMNAQMKAERTKRAYILEAEGIRQAEILKAEGEKQSQILKAEGERQSAFLQAEARERSAEAEARATKMVS
EAIASGDIQAVNYFVAQKYTEALQQIGSSSNSKVVMMPLEASSLMGSIAGIAELVKDSANKRTQP
>D1C7A6 3.2.2.-~~~~~~Queuosine 5'-phosphate N-glycosylase/hydrolase~~~
MADPGDRLGVLTTTRRVVEQAQAVWIDHDAVAQIAEAFAARQVTPPTWNRELHWSDGREALANYILVLDAVNFCFWGEPR
WRIEYAGAVYDGYWALAASLKRALEQGVPLTDASYLAEITRDDVATIFAGEGEIPLLDERARILRETGSVLAERFAGRFS
DAIAAAGRSAVALVDIVTNAFPSFRDVATYRGEQVRFYKRAQILVSDLYGAFDGSDLGAFDDLGELTAFADYKVPQVLHH
LGILRYAPALHDRLARREEIPAGSPEEVEIRAATIWGVEELRRALASRGHALDAYQVDWLLWDEGQRLPAGTLPYHRTRT
IFY
>Q87K03 ~~~~~~Pentapeptide repeat protein VPA0095~~~COG1357
MLKTDLIFERENFSHHDFQNATFKNCHFYMCSFDHADLRDAKFIDCRFIESKALEGCSFRFANLKDASFTNCMLAMSLFN
GANCMGLELRKCDLKGANFQGANFANRVSNTMFFCSAFITGCNLTYCNFERVLLEKCDLFENRWNGANLAGATLKGSDLS
RCEFSPEQWGTFNVEQCDLTHVELDGLDIRRVSLFGVKICDWQQEQLLAPFGLIIL
>P28304 1.6.5.5~~~qorA~~~Quinone oxidoreductase 1~~~COG0604
MATRIEFHKHGGPEVLQAVEFTPADPAENEIQVENKAIGINFIDTYIRSGLYPPPSLPSGLGTEAAGIVSKVGSGVKHIK
AGDRVVYAQSALGAYSSVHNIIADKAAILPAAISFEQAAASFLKGLTVYYLLRKTYEIKPDEQFLFHAAAGGVGLIACQW
AKALGAKLIGTVGTAQKAQSALKAGAWQVINYREEDLVERLKEITGGKKVRVVYDSVGRDTWERSLDCLQRRGLMVSFGN
SSGAVTGVNLGILNQKGSLYVTRPSLQGYITTREELTEASNELFSLIASGVIKVDVAEQQKYPLKDAQRAHEILESRATQ
GSSLLIP
>P39315 1.6.5.2~~~qorB~~~Quinone oxidoreductase 2~~~COG0702
MIAITGATGQLGHYVIESLMKTVPASQIVAIVRNPAKAQALAAQGITVRQADYGDEAALTSALQGVEKLLLISSSEVGQR
APQHRNVINAAKAAGVKFIAYTSLLHADTSPLGLADEHIETEKMLADSGIVYTLLRNGWYSENYLASAPAALEHGVFIGA
AGDGKIASATRADYAAAAARVISEAGHEGKVYELAGDSAWTLTQLAAELTKQSGKQVTYQNLSEADFAAALKSVGLPDGL
ADMLADSDVGASKGGLFDDSKTLSKLIGHPTTTLAESVSHLFNVNN
>P98009 1.10.3.-~~~cyaA~~~Ubiquinol oxidase subunit 1~~~
MLGRLSLSAIPLDVPILVGTFIGVVIVGVAVLGLITYYGKWGYLWKEWFTSVDHKRLAAMYIILALVALFRGFADAIMMR
TQLALAYAGNPGYLPPHHYDQIFSAHGTIMIFFLAMAFMTGLFNFIVPLQIGARDVAFPFLNNLSFWMTAVAFILVNVSL
FIGEFSQCGWLAYPPLSENQFSPGVGVDYYIWAVQISGVGTLLTGVNFFVTIVKMRAPGMTWRKMPVFTWTALCASILIM
VAFPVLTVAVGLLGMDRYFGMHFFTNDGGGNQMMYLNLIWAWGHPEVYILVIPAFGVFSEVVPAFSGKPLFGYSTMVYAT
CSIMVLSFLVWVHHFFTMGAGPDVNAFFGIATMIISIPTGIKLFNWLFTMYKGRIQFHACMYWAVGFMITFTIGGMTGVM
LAIPGADFVLHNSLFLIAHFHNTIIGGVYFGYICGMNFWFPKVMGFKLDETWGKRAFWFWFVGFYCAFVPLYIVGFEGMT
RRLNHYDNPAWHPWLLVAEVGAVLVMLGIACQLTQLYVSIRDRNLPQNRDVTGDPWNGRTLEWSTSSPPPVYNFAIVPHV
HELDTFMLDKENGIDTRQAGAQYEAIHMPKNTSFGSGLCKCSALIFGFAAVWYIWWLAAVGLVGVIGTVIARSADKDIDY
YIPAEEVARIENEHTRKLMAQAAE
>E0TW66 1.10.3.-~~~qoxB~~~Quinol oxidase subunit 1~~~
MKFKWDEFFVTGDPLILGAQVSIALSTIAIIFVLTYFKKWKWLWSEWITTVDHKKLGIMYIISAVIMLFRGGVDGLMMRA
QLALPNNSFLDSNHYNEIFTTHGTIMIIFMAMPFLIGLINVVVPLQIGARDVAFPYLNNLSFWTFFVGAMLFNISFVIGG
SPNAGWTSYMPLASNDMSPGPGENYYLLGLQIAGIGTLMTGINFMVTILKMRTKGMTLMRMPMFTWTTLITMVIIVFAFP
VLTVALALLSFDRLFGAHFFTLEAGGMPMLWANLFWIWGHPEVYIVILPAFGIFSEIISSFARKQLFGYTAMVGSIIAIS
VLSFLVWTHHFFTMGNSASVNSFFSITTMAISIPTGVKIFNWLFTMYKGRISFTTPMLWALAFIPNFVIGGVTGVMLAMA
AADYQYHNTYFLVSHFHYVLIAGTVFACFAGFIFWYPKMFGHKLNERIGKWFFWIFMIGFNICFFPQYFLGLQGMPRRIY
TYGPNDGWTTLNFISTVGAFMMGVGFLILCYNIYYSFRYSTREISGDSWGVGRSLDWATSSAIPPHYNFAVLPEVKSKDA
FHHMKEEKTELYPESKFKKIHMPSNSGRPFFMSVAFGIAGFGLVFEWYWMGVVGLIGVLLCMVLRSFEYDNGYYISVDEI
KETERKISE
>P34956 1.10.3.-~~~qoxB~~~Quinol oxidase subunit 1~~~COG0843
MKFKWDEFFVTGDPLILGAQVSIALSTIAIIFVLTYFKKWKWLWSEWITTVDHKKLGIMYIISAVIMLFRGGVDGLMMRA
QLALPNNSFLDSNHYNEIFTTHGTIMIIFMAMPFLIGLINVVVPLQIGARDVAFPYLNNLSFWTFFVGAMLFNISFVIGG
SPNAGWTSYMPLASNDMSPGPGENYYLLGLQIAGIGTLMTGINFMVTILKMRTKGMTLMRMPMFTWTTLITMVIIVFAFP
VLTVALALLSFDRLFGAHFFTLEAGGMPMLWANLFWIWGHPEVYIVILPAFGIFSEIISSFARKQLFGYKAMVGSIIAIS
VLSFLVWTHHFFTMGNSASVNSFFSITTMAISIPTGVKIFNWLFTMYKGRISFTTPMLWALAFIPNFVIGGVTGVMLAMA
AADYQYHNTYFLVSHFHYVLIAGTVFACFAGFIFWYPKMFGHKLNERIGKWFFWIFMIGFNICFFPQYFLGLQGMPRRIY
TYGPNDGWTTLNFISTVGAFMMGVGFLILCYNIYYSFRYSTREISGDSWGVGRTLDWATSSAIPPHYNFAVLPEVKSQDA
FLHMKEEKTELYPESKFKKIHMPSNSGRPFFMSVAFGLAGFGLVFEWYWMGVVGLIGVLLCMVLRSFEYDNGYYISVDEI
KETERKISE
>Q7A699 1.10.3.-~~~qoxB~~~Probable quinol oxidase subunit 1~~~
MNFPWDQLLVKGNWMITMAQIGAPFLVIGLIAVITYFKLWKYLYKEWFTSVDHKKIGIMYLICAVLMFVRGGIDALLIRA
QLTVPDNKFLESNHYNEIFSTHGVIMIIFMAMPFIFGLWNIVVPLQIGARDVAFPVLNNVSFWLFFAGMILFNLSFIIGG
SPAAGWTNYAPLAGEFSPGPGVNYYLIAIQISGLGTLATGINFFVTILRCKTPTMKFMQMPMFTVTTFITTLIVILAFPP
LTVALALMTTDRIFDTAFFTVAHGGMPMLWANFFWVWGHPEVYIVILPAFGIYSEIIPTFARKRLFGHQSMVWATAGIAF
LSFLVWVHHFFTMGNGALINSFFSISTMLIGIPTGVKLFNWLLTLYKGRITFESPMLFSLAFIPNFLLGGVTGVMLAMAS
ADYQYHNTYFLVAHFHYTLVTGVVFACLAGLIFWYPKMMGYKLNETLNKWCFWFFMIGFNVCFLPQFILGLDGMPRRLYT
YMPSDGWFLLNLISTIGALLMAIGFLFLVVSIVYSHFKSPREATGDNWDGLGRTLEWTTASAIPPKYNFAITPDWNDYDT
FVDMKEHGRHYLDNHNYKDIHMPNNTPVGFWIGIFMTIGGFFLIFETVIPALICLFGIFGTMIYRSFQIDHGYHIPAAEV
AETEARLREARIKEREAVSHES
>Q81HT3 1.10.3.-~~~qoxA~~~Quinol oxidase subunit 2~~~
MQLKKAFWKLASLLPLSLLLFLGGCDKKLAVLNPQGPVAKAQYDLIVWSFLLMSLIIAIVFILFTVILIRYREKPENMDY
EPPEQHGNTLLEIIWTLVPVIIVIALSIPTVKATYASEEVPKESKHIKPVEIYVTSANWKWLFSYPEEKIETVNYLNIPA
GVPIQFKLTSVGPMNAFWVPELGGMKYTMDGMIMDLYLQADKPGSYLGRSANFSGEGFTHMEFEVEAKTKEKYDKWVKEV
QETAPKLTEAKYNEIVKPGVVGRMTFSSHHLSYVDPKSLEYCDYNYYKNKK
>E0TW67 1.10.3.-~~~qoxA~~~Quinol oxidase subunit 2~~~
MIFLFRALKPLLVLALLTVVFVLGGCSNASVLDPKGPVAEQQSDLILLSIGFMLFIVGVVFVLFTIILVKYRDRKGKDNG
SYNPKIHGNTFLEVVWTVIPILIVIALSVPTVQTIYSLEKAPEATKDKEPLVVHATSVDWKWVFSYPEQDIETVNYLNIP
VDRPILFKISSADSMASLWIPQLGGQKYAMAGMLMDQYLQADEVGTYQGRNANFTGEHFADQEFDVNAVTEKDFNSWVKK
TQNEAPKLTKEKYDQLMLPENVDELTFSSTHLKYVDHGQDAEYAMEARKRLGYQAVSPHSKTDPFENVKENEFKKSDDTE
E
>P34957 1.10.3.-~~~qoxA~~~Quinol oxidase subunit 2~~~COG1622
MIFLFRALKPLLVLALLTVVFVLGGCSNASVLDPKGPVAEQQSDLILLSIGFMLFIVGVVFVLFTIILVKYRDRKGKDNG
SYNPEIHGNTFLEVVWTVIPILIVIALSVPTVQTIYSLEKAPEATKDKEPLVVYATSVDWKWVFSYPEQDIETVNYLNIP
VDRPILFKISSADSMASLWIPQLGGQKYAMAGMLMDQYLQADKVGTYEGRNANFTGEHFADQEFDVNAVTEKDFNSWVKK
TQNEAPKLTKEKYDELMLPENVDELTFSSTHLKYVDHGQDAEYAMEARKRLGYQAVSPHSKTDPFENVKKNEFKKSDDTE
E
>Q7A698 1.10.3.-~~~qoxA~~~Probable quinol oxidase subunit 2~~~
MSKFKSLLLLFGTLILLSGCSNIEIFNAKGPVASSQKFLILYSIVFMLVICFVVLGMFAIFIYKYSYNKNAESGKMHHNA
IIETIWFVIPIIIVAALAIPTVKTLYDYEKPPKSEKDPMVVYAVSAGYKWFFAYPDEHIETVNTLTIPKDRPVVFKLQAM
DTMTSFWIPQLGGQKYAMTGMTMNWTLEASQTGTFRGRNSNFNGEGFSRQTFKVNAVSQKDYDKWVKEVKGKKTLDQDTF
DKQLLPSTPNKALEFNGTHMAFVDPAADPEYIFYAYKRFNFELKDPNFTSEENMFKDVSDKPLIPARKAQITNANYKRHG
MKLMILGNDEPYNNEFKKDESKNAKEMKKISKDAQDQDNDDHGGGH
>E0TW65 1.10.3.-~~~qoxC~~~Quinol oxidase subunit 3~~~
MEHAEHGNSNAPMEYQSETGRLNILGFWIFLGAEIVLFSTLFATFFVLQNRTAGGVLPDELFEVNLVMIMTFLLLISSFT
CGIAVHEMRRGSLKGVVIWTIITLLLGAGFVGCEINEFVHYVHEGASLGTSAFWSGFFVLLGTHGTHVTIGIFWIIGILI
QLKKRGLTPQTSSKIFISSLYWHFLDVVWIFIFTGVYLMGLGGL
>P34958 1.10.3.-~~~qoxC~~~Quinol oxidase subunit 3~~~COG1845
MEHAEHGNSNAPMEYQSETGRLNILGFWIFLGAEIVLFSTLFATFFVLKNRTAGGVLPDELFEVNLVMIMTFLLLISSFT
CGIAVHEMRRGSLKGVVIWTIITLLLGAGFVGCEINEFVHYVHEGAALSTSAFWSGFFVLLGTHGTHVTIGIFWITGILI
QLKKRGLTPQTSSKIFISSLYWHFLDVVWIFIFTGVYLMGLGGL
>E0TW64 1.10.3.-~~~qoxD~~~Quinol oxidase subunit 4~~~
MANKSAEHSHFPWKHIVGFALSIVLTLLALWVAVYTDLSSSAKLWIIFGFAFIQAALQLLMFMHMTESENGGIQVGNTLF
GFFGAIVIVLGSIWIFAAHYHHGDHMDGNPPGGAEHSEHSGHNE
>P34959 1.10.3.-~~~qoxD~~~Quinol oxidase subunit 4~~~COG3125
MANKSAEHSHFPWKHIVGFILSIVLTLLALWVAVYTDLSSSAKLWIIFGFAFIQAALQLLMFMHMTESENGTIQVGNTLF
GFFGAIVIVLGSIWIFAAHYHHGDHMDGNPPGGAEHSEHSGHNE
>P37619 ~~~yhhQ~~~Queuosine precursor transporter~~~COG1738
MNVFSQTQRYKALFWLSLFHLLVITSSNYLVQLPVSILGFHTTWGAFSFPFIFLATDLTVRIFGAPLARRIIFAVMIPAL
LISYVISSLFYMGSWQGFGALAHFNLFVARIATASFMAYALGQILDVHVFNRLRQSRRWWLAPTASTLFGNVSDTLAFFF
IAFWRSPDAFMAEHWMEIALVDYCFKVLISIVFFLPMYGVLLNMLLKRLADKSEINALQAS
>P0DOV3 1.97.-.-~~~qrcA~~~Menaquinone reductase, multiheme cytochrome c subunit~~~
MEDRQLTNADNENGRKCACGGAAPFFVGLVVALVFGWWAFPEMLYSQKEQPIRFSHKVHVNDAGMECKQCHSLREDGSFA
GLPSTASCAECHSDVLGSDPEEARFVAEYVKSGKEVKWLVYQYQPDNVFFSHAAHSLDGCNQCHQFSERELCNLCHLDVA
DSDKAPTHYENKLTGYSKQTMKMWQCERCHANENHLGVTNSSNACFVCHK
>Q72E84 ~~~qrcB~~~Menaquinone reductase, molybdopterin-binding-like subunit~~~COG0243
MALDRRGFLKFIGGATAGILATPVVWKGLDDVSIWSQNWSWIPRNIKGANSYVPTVSKLCPTGIGVRVRLVDGRPVRVIG
NPEHPLSKGGVSSIAAAEVQMLYSPARMKRPLKRSPDGAYVMISWEEAEAMLLDGLKAAKGGDALACISGDDNGTINELL
SAFVQQSGSKSFFLMPGEAQPAAKAWDLMGGEGQIGYDIEKSDFVLAIGANVLEAWGTAIRNRHAFGASHPHGAEPTAQF
VYAGPVLNNTATGADDWLPIRPGTESAFALGLAHLLIKAGASSSAPDFDAFRSLAASFSPEKVAAQTGVDAKALTALAQA
LAKAKHPLVIVGSEFSQGAGAAPVMAGIALNMLLGSVNRDGGLRALPVARKVVPAGMDRKAMLQQDLTLWASAIASGKAK
APKAMLVYEANPVYALPQGSAFKDTLAKVPFKVAFTSFLDETAMQCDLVIPVSMGLERLDDVCTPYGCGEVVYSLATPVT
APLFDTKPAGDALIALGGKLGLDLGVASFEDMLKAKAAAHGADFDKLAEGTAFTSRATVGANLSFRPDVLSKALDVKAPA
LPLALAPVMKLNMGTSKTAIPPFNTKTIRRWEVQGKEGYVMLNGATARKLGLAQHDRVVLSNPTGKVTVRVNIFEGVMND
TVAMPLGFGHTAFDEFSKGKGENVMHLLAPSTEPVTGLAVWTGAGVNIAKA
>Q72E85 1.97.-.-~~~qrcC~~~Menaquinone reductase, iron-sulfur cluster-binding subunit~~~COG0437
MSSFKEFKIKWGMVIDLDKCTGCGACMVACQAENNIAPQPDASNKLKSLNWLVVYELNNGKPFPEHDVAYLPRPCMQCGK
PSCVSVCPVVATDKNEEGGIVSQVYPRCIGCRYCMASCPYHARYFNWFDPTWPEGMDKTLTPDVSVRPRGVVEKCTFCHH
RFMQAKDKARVEGRDPSALRDGDYVTSCTEACPNGAIIFGDFNNPEHRVHELHKSKYAFRLLERLGTDPQVYYLSRREWV
RRLGDNYLEHEKVKG
>Q72E86 ~~~qrcD~~~Menaquinone reductase, integral membrane subunit~~~COG5557
MDKNYNLPVDAELFPEGCERCSLSKFMMWMAFVFVFFGWGLYAAYRVLAEGLGVTGLDDYFGFGLWITFDLAVIALGAGA
FFSGLLRYILNIDPLKNIINLAVIIGFLCYSGAMLVLVLDIGQPLRAWFGYWHANVHSMLTEVIFCITCYCLVLIIEYVP
LILENRQLNKNKLVHAVAHNFHVMMPLFAGIGAFLSTFHQGSLGGMYGVLFGRPYIYREGFFIWPWTFFLYVLSAVGSGP
VFTVLVCTLMEKMTGRKLVSWEVKSLMGKIAGTMLMVYLIFKFADTYAWAYDLLPRQGLTFDQMFTSGWIYGKWMLWAEL
FYCGLVPAIILIVPALRNNPVLFYSAAILDCIGITINRYVMTVQALAIPVMPFDSWESYLPNWAEWGASVMIVAYAALVL
SLSYRYLPIFPQEAELNRK
>Q8XBS3 ~~~qseB~~~Transcriptional regulatory protein QseB~~~COG0745
MRILLIEDDMLIGDGIKTGLSKMGFSVDWFTQGRQGKEALYSAPYDAVILDLTLPGMDGRDILREWREKGQREPVLILTA
RDALEERVEGLRLGADDYLCKPFALIEVAARLEALMRRTNGQASNELRHGNVMLDPGKRIATLAGEPLTLKPKEFALLEL
LMRNAGRVLPRKLIEEKLYTWDEEVTSNAVEVHVHHLRRKLGSDFIRTVHGIGYTLGEK
>P52076 ~~~qseB~~~Transcriptional regulatory protein QseB~~~COG0745
MRILLIEDDMLIGDGIKTGLSKMGFSVDWFTQGRQGKEALYSAPYDAVILDLTLPGMDGRDILREWREKGQREPVLILTA
RDALAERVEGLRLGADDYLCKPFALIEVAARLEALMRRTNGQASNELRHGNVMLDPGKRIATLAGEPLTLKPKEFALLEL
LMRNAGRVLSRKLIEEKLYTWDEEVTSNAVEVHVHHLRRKLGSDFIRTVHGIGYTLGEK
>Q8X524 2.7.13.3~~~qseC~~~Sensor protein QseC~~~COG0642
MKFTQRLSLRVRLTLIFLILASVTWLLSSFVAWKQTTDNVDELFDTQLMLFAKRLSTLDLNEINAADRMAQTPNKLKHGH
VDDDALTFAIFTHDGRMVLNDGDNGEDIPYSYQREGFADGQLVGDKDQWRFVWMTSPDGKYRIVVGQEWEYREDMALAIV
AGQLIPWLVALPVMLIIMMVLLGRELAPLNKLALALRMRDPDSEKPLNATGVPSEVRPLVESLNQLFARTHAMMVRERRF
TSDAAHELRSPLTALKVQTEVAQLSDDDPQARKKALLQLHSGIDRATRLVDQLLTLSRLDSLDNLQDVAEIPLEDLLQSS
VMDIYHTAQQAKIDVRLTLNVQGIKRTGQPLLLSLLVRNLLDNAVRYSPQGSVVDVTLNADNFIVRDNGPGVTPEALARI
GERFYRPPGQTATGSGLGLSIVQRIAKLHGMNVEFGNAEQGGFEAKVSW
>P40719 2.7.13.3~~~qseC~~~Sensor protein QseC~~~COG0642
MKFTQRLSLRVRLTLIFLILASVTWLLSSFVAWKQTTDNVDELFDTQLMLFAKRLSTLDLNEINAADRMAQTPNRLKHGH
VDDDALTFAIFTHDGRMVLNDGDNGEDIPYSYQREGFADGQLVGEDDPWRFVWMTSPDGKYRIVVGQEWEYREDMALAIV
AGQLIPWLVALPIMLIIMMVLLGRELAPLNKLALALRMRDPDSEKPLNATGVPSEVRPLVESLNQLFARTHAMMVRERRF
TSDAAHELRSPLTALKVQTEVAQLSDDDPQARKKALLQLHSGIDRATRLVDQLLTLSRLDSLDNLQDVAEIPLEDLLQSS
VMDIYHTAQQAKIDVRLTLNAHSIKRTGQPLLLSLLVRNLLDNAVRYSPQGSVVDVTLNADNFIVRDNGPGVTPEALARI
GERFYRPPGQTATGSGLGLSIVQRIAKLHGMNVEFGNAEQGGFEAKVSW
>P45336 2.7.13.3~~~qseC~~~Sensor protein QseC~~~COG0642
MKNRSLTLRLISVLCLTALFVWLGSTLVAWWQVRHDVNKVFDAQQVLFAERLANSDLSTILLESSTTLNKNSQSVLKKSY
DDDALAFAIFSKTGKLLFSDGRNGKDFIFNYKTGFYNANIYDDDDKWRIFWRMAANGELVIAVGQELDYREDLIEEMILG
QMWIWFASLPILIIVLGWLIHKELRPIKRLSQEVQTRKSGDVSLLNTEGLPVEILPLVKNLNQFFDRTSAMLQRERRFTS
DAAHELRSPLAALRIQIEVAQLAGDDVALREQALLHLTQGIDRASQLIEQLLTLSRLDNLQALETLQLLDWEAIVQSLIS
ERYFVAEKRKITLVFEKESEPKQKQGQSILVSLMLRNLLDNAIKYCPEDTIVSVKISSSQIIIEDNGGGVEPEDLKKLGQ
RFYRPAGQNEKGSGLGLSIVMRIAELHGFKVRLENVVKEGRRIGLKAIISL
>Q8X487 ~~~qseD~~~HTH domain-truncated transcriptional regulator QseD~~~COG0583
MTPLQLSEQGKIFHSQIRHLLQQLESNLAELRGGSDYAQRKIKIAAAHSLSLGLLPSIISQMPPLFTWAIEAIDVDEAVD
KLREGQSDCIFSFHDEDLLEAPFDHIRLFESQLFPVCASDEHGEALFDLVQPHFPLLNYSRNSYMGRLINRTLTRHSELS
FSTFFVSSMSELLKQVALDGCGIAWLPEYAIQQEIRSGQLVVLNRDELVIPIQAYAYRMNTRMNPVAERFWRELRELEIV
LS
>Q8XA47 2.7.13.3~~~qseE~~~Sensor histidine kinase QseE~~~COG2205
MKRWPVFPRSLRQLVMLAFLLILLPLLVLAWQAWQSLNALSDQAALVNRTTLIDARRSEAMTNAALEMERSYRQYCVLDD
PTLAKVYQSQRKRYSEMLDAHAGVLPDDKLYQALRQDLNNLAQLQCNNSGPDAAAAARLEAFASANTEMVQATRTVVFSR
GQQLQREIAERGQYFGWQSLVLFLVSLVMVLLFTRMIIGPVKNIERMINRLGEGRSLGNSVSFSGPSELRSVGQRILWLS
ERLSWLESQRHQFLRHLSHELKTPLASMREGTELLADQVVGPLTPEQKEVVSILDSSSRNLQKLIEQLLDYNRKQADSAV
ELDNVELAPLVETVVSAHSLPARAKMMHTDVDLKATACLAEPMLLMSVLDNLYSNAVHYGAESGNICLRSSLHGARVYID
VINTGTPIPQEERAMIFEPFFQGSHQRKGAVKGSGLGLSIARDCIRRMQGELYLVDESGQDVCFRIELPSSKNTK
>P0AFU5 ~~~qseF~~~Transcriptional regulatory protein QseF~~~COG2204
MSHKPAHLLLVDDDPGLLKLLGLRLTSEGYSVVTAESGAEGLRVLNREKVDLVISDLRMDEMDGMQLFAEIQKVQPGMPV
IILTAHGSIPDAVAATQQGVFSFLTKPVDKDALYQAIDDALEQSAPATDERWREAIVTRSPLMLRLLEQARLVAQSDVSV
LINGQSGTGKEIFAQAIHNASPRNSKPFIAINCGALPEQLLESELFGHARGAFTGAVSNREGLFQAAEGGTLFLDEIGDM
PAPLQVKLLRVLQERKVRPLGSNRDIDINVRIISATHRDLPKAMARGEFREDLYYRLNVVSLKIPALAERTEDIPLLANH
LLRQAAERHKPFVRAFSTDAMKRLMTASWPGNVRQLVNVIEQCVALTSSPVISDALVEQALEGENTALPTFVEARNQFEL
NYLRKLLQITKGNVTHAARMAGRNRTEFYKLLSRHELDANDFKE
>P0AD45 ~~~qseG~~~Quorum-sensing regulator protein G~~~COG3170
MRHIFQRLLPRRLWLAGLPCLALLGCVQNHNKPAIDTPAEEKIPVYQLADYLSTECSDIWALQGKSTETNPLYWLRAMDC
ADRLMPAQSRQQARQYDDGSWQNTFKQGILLADAKITPYERRQLVARIEALSTEIPAQVRPLYQLWRDGQALQLQLAEER
QRYSKLQQSSDSELDTLRQQHHVLQQQLELTTRKLENLTDIERQLSTRKPAGNFSPDTPHESEKPAPSTHEVTPDEP
>O32054 2.4.99.17~~~queA~~~S-adenosylmethionine:tRNA ribosyltransferase-isomerase~~~COG0809
MKVDLFDFELPERLIAQVPLEQRDASRLMVLDKHTGELTDSSFKHIISFFNEGDCLVLNNTRVLPARLFGTKEDTGAKVE
LLLLKQETGDKWETLAKPAKRVKKGTVVTFGDGRLKAICTEELEHGGRKMEFQYDGIFYEVLESLGEMPLPPYIKEQLDD
KERYQTVYSKEIGSAAAPTAGLHFTEEILQQLKDKGVQIEFITLHVGLGTFRPVSADEVEEHNMHAEFYQMSEETAAALN
KVRENGGRIISVGTTSTRTLETIAGEHDGQFKASSGWTSIFIYPGYEFKAIDGMITNFHLPKSSLIMLVSALAGRENILR
AYNHAVEEEYRFFSFGDAMLII
>P0A7F9 2.4.99.17~~~queA~~~S-adenosylmethionine:tRNA ribosyltransferase-isomerase~~~COG0809
MRVTDFSFELPESLIAHYPMPERSSCRLLSLDGPTGALTHGTFTDLLDKLNPGDLLVFNNTRVIPARLFGRKASGGKIEV
LVERMLDDKRILAHIRASKAPKPGAELLLGDDESINATMTARHGALFEVEFNDERSVLDILNSIGHMPLPPYIDRPDEDA
DRELYQTVYSEKPGAVAAPTAGLHFDEPLLEKLRAKGVEMAFVTLHVGAGTFQPVRVDTIEDHIMHSEYAEVPQDVVDAV
LAAKARGNRVIAVGTTSVRSLESAAQAAKNDLIEPFFDDTQIFIYPGFQYKVVDALVTNFHLPESTLIMLVSAFAGYQHT
MNAYKAAVEEKYRFFSYGDAMFITYNPQAINERVGE
>P65951 2.4.99.17~~~queA~~~S-adenosylmethionine:tRNA ribosyltransferase-isomerase~~~
MNIEEFDYDLPESLIAQTPLKDRDHSRLLVMDRETGEMKHLHFKDIIEYFRPGDTLVLNDTRVMPARLFGLKEETGAKVE
MLMLTQIEGNDWEVLLKPAKRIKVGNKLNFGNGKIIAECIKEMDQGGRIMRLHYEGILQERLDELGEMPLPPYIKERLDD
PDRYQTVYAKESGSAAAPTAGLHFTDELLTEIKNKGVNIAFVTLHVGLGTFRPVSVDDVNDHEMHSEYYQMTQETADLLN
DTKSKGHRIISVGTTSTRTLETIRRDHDKFVETSGWTNIFIYPGFDFKAIDGQITNFHLPKSTLVMLVSAFSTRENVLNA
YKTAVNLEYRFFSFGDAMLII
>A8AYK5 2.4.99.17~~~queA~~~S-adenosylmethionine:tRNA ribosyltransferase-isomerase~~~COG0809
MNTADFDFHLPEELIAQTPLEKRDASRLLVVDRSSGEFSDQHFDSIIDQLQPGDALVMNNTRVLPARLYGEKPGTGGHVE
LLLLKNTEGDQWEVLAKPAKRLKVGAQVSFGDGRLTATVVDELEHGGRIVRFDYQGIFLEVLESLGEMPLPPYIHEKLAD
RERYQTVYAKENGSAAAPTAGLHFTKELLAQIEAKGVKLVYLTLHVGLGTFRPVSVDNLDDHEMHSEFYTLSEEAAATLR
EVKANGHRVIAVGTTSIRTLETIGNKFKGDIQADSGWTNIFIKPGYQWQIVDAFSTNFHLPKSTLVMLVSAFAGRDLTLK
AYEHAIAERYRFFSFGDAMFIK
>Q9WZ44 2.4.99.17~~~queA~~~S-adenosylmethionine:tRNA ribosyltransferase-isomerase~~~COG0809
MKVSEFDYELPPELIAQEPVEPRDASRLMVLHRKTQRIEHRIFREIIEYLEPGDLLVLNVSKVIPARLYARKKTGASIEI
LLIERLEEGIWKCLVRPGQKVKKGTELVIDEDLSAVCLGRGEDGTRILKFQPQDDRLIFEKGRTPLPPYIKNEVPLERYQ
TVYAKEEGSVAAPTAGLHFTPELIEKLKKKGVQFAEVVLHVGIGTFRPVKVEEVEKHKMHEEFYQVPKETVRKLRETRER
GNRIVAVGTTTVRTLETIARLPEQEEYVGKTDLFIYPPFEFKLVDALVTNFHLPRSTLLMLVAAFAGKDFVMEAYREAVK
RRYRFFSFGDAMLIL
>Q72JS5 2.4.99.17~~~queA~~~S-adenosylmethionine:tRNA ribosyltransferase-isomerase~~~COG0809
MEGLEAYDYHLPPEQIAQEGVEPRDMARLMVVYREGPFRVAHKRVRDLPEFLRPGDVLVFNESKVIPARLLARKPTGGKV
EILLVRERSPGLWEALLGPARKAPPGTRLLLLSPKDLAPVPGLQAEVVAVEEDGVRLLRFQGDLVAHLEEVGEVPLPPYI
KAKIPMERYQTVYARRPGSVAAPTAGLHFTPELLERLREMGVELRFLTLHVGPGTFRPVKGDPEKHEMHAEPYAIPEEVA
EAVNRAKAEGRRVVAVGTTVVRALESAYREGVGVVAGEGETRLFIRPPYTFKVVDALFTNFHLPRSTLLMLVAAFLGRER
TLEAYRLAVAEGYRFYSLGDAMLIL
>O31675 6.3.4.20~~~queC~~~7-cyano-7-deazaguanine synthase~~~COG0603
MKKEKAIVVFSGGQDSTTCLLWALKEFEEVETVTFHYNQRHSQEVEVAKSIAEKLGVKNHLLDMSLLNQLAPNALTRNDI
EIEVKDGELPSTFVPGRNLVFLSFASILAYQIGARHIITGVCETDFSGYPDCRDEFVKSCNVTVNLAMEKPFVIHTPLMW
LNKAETWKLADELGALDFVKNNTLTCYNGIIADGCGECPACHLRSKGYEEYMVMKGERA
>P77756 6.3.4.20~~~queC~~~7-cyano-7-deazaguanine synthase~~~COG0603
MKRAVVVFSGGQDSTTCLVQALQQYDEVHCVTFDYGQRHRAEIDVARELALKLGARAHKVLDVTLLNELAVSSLTRDSIP
VPDYEPEADGIPNTFVPGRNILFLTLAAIYAYQVKAEAVITGVCETDFSGYPDCRDEFVKALNHAVSLGMAKDIRFETPL
MWIDKAETWALADYYGKLDLVRNETLTCYNGFKGDGCGHCAACNLRANGLNHYLADKPTVMAAMKQKTGLR
>Q6D820 6.3.4.20~~~queC~~~7-cyano-7-deazaguanine synthase~~~COG0603
MKRAVVVFSGGQDSTTCLIQALQDYDDVHCITFDYGQRHRAEIEVAQELSQKLGAAAHKVLDVGLLNELATSSLTRDSIP
VPDYDANAQGIPNTFVPGRNILFLTLASIYAYQVGAEAVITGVCETDFSGYPDCRDEFVKALNQAIVLGIARDIRFETPL
MWLNKAETWALADYYQQLDTVRYHTLTCYNGIKGDGCGQCAACHLRANGLAQYQKDAATVMASLKQKVGLR
>O31676 4.1.2.50~~~queD~~~6-carboxy-5,6,7,8-tetrahydropterin synthase~~~COG0720
MLSQIYPQAQHPYSFELNKDMHISAAHFIPRESAGACSRVHGHTYTVNITVAGDELDDSGFLVNFSVLKKLVHGNYDHTL
LNDHEDFSQDDRYSLPTTEVVAKTIYDNVQAYLDTLENKPTCVQVFVRETPTSYCVYRPKKGGLNG
>P65870 4.1.2.50~~~queD~~~6-carboxy-5,6,7,8-tetrahydropterin synthase~~~COG0720
MMSTTLFKDFTFEAAHRLPHVPEGHKCGRLHGHSFMVRLEITGEVDPHTGWIIDFAELKAAFKPTYERLDHHYLNDIPGL
ENPTSEVLAKWIWDQVKPVVPLLSAVMVKETCTAGCIYRGE
>O31677 4.3.99.3~~~queE~~~7-carboxy-7-deazaguanine synthase~~~COG0602
MAKGIPVLEIFGPTIQGEGMVIGQKTMFVRTAGCDYSCSWCDSAFTWDGSAKKDIRWMTAEEIFAELKDIGGDAFSHVTI
SGGNPALLKQLDAFIELLKENNIRAALETQGTVYQDWFTLIDDLTISPKPPSSKMVTNFQKLDHILTSLQENDRQHAVSL
KVVIFNDEDLEFAKTVHKRYPGIPFYLQVGNDDVHTTDDQSLIAHLLGKYEALVDKVAVDAELNLVRVLPQLHTLLWGNK
RGV
>A0A0H3KB22 4.3.99.3~~~queE~~~7-carboxy-7-deazaguanine synthase~~~COG0602
MTYAVKEIFYTLQGEGANAGRPAVFCRFAGCNLWSGREEDRAQAVCRFCDTDFVGTDGENGGKFKDADALVATIAGLWPA
GEAHRFVVCTGGEPMLQLDQPLVDALHAAGFGIAIETNGSLPVLESIDWICVSPKADAPLVVTKGNELKVVIPQDNQRLA
DYAKLDFEYFLVQPMDGPSRDLNTKLAIDWCKRHPQWRLSMQTHKYLNIP
>O31678 1.7.1.13~~~queF~~~NADPH-dependent 7-cyano-7-deazaguanine reductase~~~COG0780
MTTRKESELEGVTLLGNQGTNYLFEYAPDVLESFPNKHVNRDYFVKFNCPEFTSLCPKTGQPDFATIYISYIPDEKMVES
KSLKLYLFSFRNHGDFHEDCMNIIMNDLIELMDPRYIEVWGKFTPRGGISIDPYTNYGKPGTKYEKMAEYRMMNHDLYPE
TIDNR
>Q46920 1.7.1.13~~~queF~~~NADPH-dependent 7-cyano-7-deazaguanine reductase~~~COG0780
MSSYANHQALAGLTLGKSTDYRDTYDASLLQGVPRSLNRDPLGLKADNLPFHGTDIWTLYELSWLNAKGLPQVAVGHVEL
DYTSVNLIESKSFKLYLNSFNQTRFNNWDEVRQTLERDLSTCAQGKISVALYRLDELEGQPIGHFNGTCIDDQDITIDNY
EFTTDYLENATCGEKVVEETLVSHLLKSNCLITHQPDWGSLQIQYRGRQIDREKLLRYLVSFRHHNEFHEQCVERIFNDL
LRFCQPEKLSVYARYTRRGGLDINPWRSNSDFVPSTTRLVRQ
>Q5L1B7 1.7.1.13~~~queF~~~NADPH-dependent 7-cyano-7-deazaguanine reductase~~~COG0780
MAGRKEEELKDLTLLGNQGTTYSFTYNPNLLEVFDNKHPDRDYFVKFNCPEFTTLCPKTGQPDFATIYISYIPDKKCVES
KSLKLYLFSFRNHGDFHEDCVNIIMNDLIKVMEPRYIEVWGKFTPRGGISIDPYCNWGRPGTKYEKMAEYRLLNHDLYPE
KVDNR
>Q9KTK0 1.7.1.13~~~queF~~~NADPH-dependent 7-cyano-7-deazaguanine reductase~~~COG0780
MSKYSDAKELASLTLGKKTEYANQYDPSLLQPVPRSLNRNDLHLSATLPFQGCDIWTLYELSWLNQKGLPQVAIGEVSIP
ATSANLIESKSFKLYLNSYNQTRFASWDEVQTRLVHDLSACAGETVTVNVKSLNEYTAEPIVTMQGECIDDQDIEIANYE
FDDALLQGAAQGEEVSEVLHSHLLKSNCLITNQPDWGSVEIAYHGAKMNREALLRYLVSFREHNEFHEQCVERIFTDIMR
YCQPQSLTVYARYTRRGGLDINPFRSSHQSAPNHNQRMARQ
>P97030 1.17.99.6~~~queG~~~Epoxyqueuosine reductase~~~COG1600
MNVYQLKEELIEYAKSIGVDKIGFTTADTFDSLKDRLILQESLGYLSGFEEPDIEKRVTPKLLLPKAKSIVAIALAYPSR
MKDAPRSTRTERRGIFCRASWGKDYHDVLREKLDLLEDFLKSKHEDIRTKSMVDTGELSDRAVAERAGIGFSAKNCMITT
PEYGSYVYLAEMITNIPFEPDVPIEDMCGSCTKCLDACPTGALVNPGQLNAQRCISFLTQTKGFLPDEFRTKIGNRLYGC
DTCQTVCPLNKGKDFHLHPEMEPDPEIAKPLLKPLLAISNREFKEKFGHVSGSWRGKKPIQRNAILALAHFKDASALPEL
TELMHKDPRPVIRGTAAWAIGKIGDPAYAEELEKALEKEKDEEAKLEIEKGIELLKASGMTKQGLS
>Q7VYQ1 1.17.99.6~~~queH~~~Epoxyqueuosine reductase QueH~~~COG1636
MSQLVRPTLELPAGRRKVLLHSCCAPCSGEVMEAMTASGIDYAIYFYNPNIHPVKEYEIRKNENIRFAEEHGIEFIDADY
DMDNWFERVKGMENEPERGIRCTACFDMRFERTALYAHEHGFDTITSSLGISRWKDMNQINGCGERAAARYDDLVYWTYN
WRKGGGSQRMIEISKRENFYQQEYCGCVYSLRDTNRHRRAQGRERIHLGVKFYGVEEKL
>Q3Z8V0 1.17.99.6~~~queH~~~Epoxyqueuosine reductase QueH~~~COG1636
MAPKLLLHGCCAHCTAYSFKYWQEQGFAVSVYWYNPNIHPFMEHQSRLEAMRKLSAEMGFELITEPSYHMAEYFKNVSAN
VDGRCRICFDMRLGQTAAYAAGHGYEYFSSSLFISPHQKHQDAVCSAEALAKETGVRFAYADLRKRYSDSRHITKPLDLY
RQQYCGCVYSEYERFGKPNSPA
>P44068 1.17.99.6~~~queH~~~Epoxyqueuosine reductase QueH~~~COG1636
MNTELQPKLEKSAVNFQAKPRKQKIRKDPNAPFIREKLELPDGHNKLLLHSCCAPCSGEVMEAILASGIEFTIYFYNPNI
HPLKEYLIRKEENIRFAKKFGIPFIDADYDRQNWFDRAKGMEWEPERGIRCTMCFDMRFEKAAEYAHKHGFPVFTSCLGI
SRWKDMNQINGCGHRAAEKYDDVIYWDYNWRKEGGSQRMIEISKRERFYQQEYCGCVYSLRDSNKWREETGRQKIEIGKL
YYSAD
>O24926 1.17.99.6~~~queH~~~Epoxyqueuosine reductase QueH~~~COG1636
MLIHICCSVDNLYFLKKAKEAFAGEKIVGFFYNPNIHPYSEYLLRLEDVKRTCEMLGIELLEGDYELEKFLDKAKGKELL
GEKSERCFECFDLRLEASALKAFELGEEKFTTTLLTSPKKDPNQLIAKGQSIAQRHNLEFVVFRNDNFEHFKSELDLNLQ
ALARENELYRQNYCGCQFALKIQKESQNRSPFELYSPLKRQILPASIEERTQVFRTLDMAKKDANKPFLAQKTIATYRLL
NGGVWLSKNSNPLNCCILARSKSKAKVRINDLRWVFSQRLSVLVGYSQRDETLFLTLEGLNTLMAKNYDNLKELNLNPLN
YEEELSLRALVSGSESINPIIVLEERTEKTLFVEIKSVFQEEKVFYLL
>Q0I1Q0 1.17.99.6~~~queH~~~Epoxyqueuosine reductase QueH~~~COG1636
MHRTKLEQKQPHFDAQKRRKKECKNSNTPFVRPKLELPHGHNKLLLHSCCAPCSGEVMEAIHASGIDFTIYFYNPNIHPL
KEYLIRKEENIRFAEKWGIPFIDADYDRQNWFDRAKGMEDEPERGIRCTMCFDMRFEKAAQYAHENGFPVFTSCLGISRW
KDMNQINGCGHRAAEKYDDVVYWDYNWRKGGGSQRMIEISKRERFYQQEYCGCVYSLRDTNKWREANGRQKIEIGKLYYS
ADK
>A0A0H3JUG6 1.17.99.6~~~queH~~~Epoxyqueuosine reductase QueH~~~
MINAEPIISKMKNQKINYDKVLKKLIGQWEREAIRPKILLHSCCAPCSTYTLEFLTQYADIAIYFANSNIHPKNEYLRRA
KVQEQFVEDFNRKTGANVKYIEAPYEPHKFVKMVKDKELADEKEGGLRCTACFEMRLDIVAKAAVEHGYDYFGSAITLSP
KKNAQLINELGMDVQKIYDVNYLPSDFKKSKGYERSIEMCNDYNIFRQCYCGCVFAAMQQGIDFKTVNKEAKAFLEQYPD
>A0A0H2VKG8 1.17.99.6~~~queH~~~Epoxyqueuosine reductase QueH~~~COG1636
MIEANQILAKMKNQKINYDKVLRKIISQWERDGERPKILLHSCCAPCSTYTLEFLTQYADIAIYFANPNIHPKSEYLRRA
KVQEQFVNDFNNKTGASVKYIEAEYEPHKFMKMAKDKGLTEEPEGGLRCTACFEMRLEIVAKAALEHGYDYFGSAITLSP
KKNAQLINELGMDVQNIYNVKYLPSDFKKNKGYERSIEMCNDYNIFRQCYCGCVFAAMKQGIDFKQINKDAQAFLQQF
>Q9A1K3 1.17.99.6~~~queH~~~Epoxyqueuosine reductase QueH~~~
MIDLQEILANMNPNQKINYDRVMQQMAKVWEKESVRPSILMHVCCAPCSTYTLEYLTQFADITVYFANSNIHPKDEYHRR
AYVTQQFVSEFNAKTGNTVQFLEADYVPNEYVRQVRGLEEEPEGGDRCRVCFDYRLDKTAQKAVELGFDYFASALTISPH
KNSQTINDVGIDVQKVYTTKYLPSDFKKNNGYRRSVEMCEEYDIYRQCYCGCVYAAKMQGIDLVQVKKDAKAFMADKDLD
NDFTHIRFSYRGDEM
>Q9WZJ0 1.17.99.6~~~queH~~~Epoxyqueuosine reductase QueH~~~COG1636
MGTVLIHVCCAPDLLTTIFHVRDAEFFFYNPNIQPLSEYEKRREAVDKVANHFSLNVRYGEYSTEEIRKWYTAVKDYKDL
GEGSKRCERCISFLLERTAQEARKRGHESFSTTLLASPRKNLPMIENIGKTIEEKYGVKFFFKNFRKGGAYQEGVRLSKE
LGIYRQNYCGCVFSLLERREKHAEISRKRGHM
>A2RM05 ~~~queT~~~Queuosine precursor transporter QueT~~~COG4708
MKKSKTYDIVTIAIVAALYVILTMTPGLSAISYGPIQFRVSEMLNFTAFFNKKYIIAVTIGCMISNFLSFTWVDVIVGGL
STLVFLSLGVLLFDRFKEDYFWNGQLNKAFFFFAIFFSISMFTIALELKFVAETPFLLTWGTLALGEFASLFIGAFIMDK
LGKRVDLSR
>Q59086 1.1.5.8~~~quiA~~~Quinate/shikimate dehydrogenase (quinone)~~~COG4993
MSDPQEKSHIILKVWCFILGLALLITGAFYVIGGGKLISLGGSWYFLIAGLMITTSAFFMFKKKATGVWLYALAFIGTVI
WALIDAGFEFWPLHSRLMFPAGLFAAVMLTLPSIRKYQYQTPMSAPAYVIGGLTVLGMLGGLYGMFIPHETVKASGEELP
LVPVDPAKKQVNWDHYGNDAGGSRFVALDQINRNNVSKLKEAWRFRTGDFTTGTGNGAEDQMTPLQVGNKVFLCTPHNNI
FAIDADSGKQLWKAEVNSTADAWERCRGVAYFDSTQPLVQPTLAGATPVAALAANTECPRRVYTNTVDGRLIAVNADTGA
RCKDFGVNGTVNLHEGLGENTKAPRFEVTSAPTIAGTTIVVGSRIADNVAADMPGGVIRAYDVITGKLRWAFDPRNPDPN
YVLKPGEIYKRSSTNSWAAMSYDPQMNTVFLPMGSSSVDVWGGNRTAADHKYNTSVLALDATTGKEKWVYNTVHNDLWDF
DLPMQPSLVDFPMKDGTTKPAVVIGTKSGQFYVLDRVTGKPLTKVIEQPIKVADIPGEQYSKTQPRSVEMPQIGNQTLKE
SDMWGATPFDQLMCRINFKSMRYDGLYTAPGTDVSLSFPGSLGGMNWGSIAFDPTHRYMFVNDMRLGLWIQLIKQTPEDI
KIQANGGEKVNTGMGAVPMKGTPYKVNKNRFMSALGIPCQKPPFGTMTAIDMKTRQVAWQVPLGTIQDTGPMGIKMGLKA
PIGMPTIGGPMATQGGLVFFAATQDYYLRAFNSSNGKELWKARLPVGSQGTPMSYMSPKTGKQYVVVSAGGARQSPDHGD
YVIAYALEK
>Q43922 4.2.1.118~~~quiC~~~3-dehydroshikimate dehydratase~~~COG3420
MKLTSLRVSLLALGLVTSGFAAAETYTVDRYQDDSEKGSLRWAIEQSNANSAQENQILIQAVGKAPYVIKVDKPLPPIKS
SVKIIGTEWDKTGEFIAIDGSNYIKGEGEKACPGANPGQYGTNVRTMTLPGLVLQDVNGVTLKGLDVHRFCIGVLVNRSS
NNLIQHNRISNNYGGAGVMITGDDGKGNPTSTTTNNNKVLDNVFIDNGDGLELTRGAAFNLIANNLFTSTKANPEPSQGI
EILWGNDNAVVGNKFENYSDGLQINWGKRNYIAYNELTNNSLGFNLTGDGNIFDSNKVHGNRIGIAIRSEKDANARITLT
KNQIWDNGKDIKRCEAGGSCVPNQRLGAIVFGVPALEHEGFVGSRGGGVVIEPAKLQKTCTQPNQQNCNAIPNQGIQAPK
LTVSKKQLTVEVKGTPNQRYNVEFFGNRNASSSEAEQYLGSIVVVTDHQGLAKANWAPKVSMPSVTANVTDHLGATSELS
SAVKMR
>Q9I4U2 3.5.1.97~~~quiP~~~Acyl-homoserine lactone acylase QuiP~~~
MASPAFMRFLPRCGAAAAFGTLLGLAGCQSWLDDRYADSLPPTSGVQPIKGLAQNVSIRRNALGMPLIETGTFHDALFAL
GYVHASDRLSQMVSLRLLAQGRLAEMVGPGALEIDRFMRTVNLRQAAEIQYRNASPRLQRFFEVYARGVNAYLYRYRDKL
PMDLAQSGYRPEYWKPEDSALVFALLNFGLAVNLQEEIASLTLAQKVGSDKLAWLTPTYPDENLPFDEAEKLKGLRLDGQ
VPGLAGVEGAARQVAALSMLGVAASNNWAIAPQRSRSGKSLMANDTHLPLSMPSVWNYVQIRSPKYQAAGVSIAGLPGVV
AGFNGKLAWGMTMVLGDNQDLYLEQLRRQGNRLYYLADGKWQPTRERQETFFIKGQRPIREVIHETRHGPLLNSALGERK
NILQPLPLKSGYGLAYRSIQQEADKTLDGFFDLSRAKTIEQAFDATREIRAMPLNIVFADEKHIGWQVTGRYPNRKEGRG
LLPSPGWDGRYDWDGYADPILHPSDQDPQQGWLGTANHRTVQPGYGAQLSNSWYYPERAERIAQLAGASKSHDTQSMIRM
QYDQTSLFVAKLQAMFDNPGMALPLRQAIDALPEAQRSRAREAYDRLMAFDGKLTASSSDAALYGAFLHESARQIFLDEL
GPEDGPAWKAFVETANLSYSAQADHLLGRDDSPFWDDTRTPQKEDKPAILARSLAAAVEFCEQRLGSERKAWQWGKLHTY
EWQSDSSKMAPYLGAGERAGLGAIKGYLDRGPYPAGGDHTTLDVSAYGWGQDFDTWLIPAMRLIVDFGQSEPMIGVNSSG
QSGNPASPHYADGIDAWLKGRYVSFPFQPQNLDRVYGNKRLTLTPAR
>O86312 ~~~raaS~~~HTH-type transcriptional regulatory protein RaaS~~~COG1309
MRSADLTAHARIREAAIEQFGRHGFGVGLRAIAEAAGVSAALVIHHFGSKEGLRKACDDFVAEEIRSSKAAALKSNDPTT
WLAQMAEIESYAPLMAYLVRSMQSGGELAKMLWQKMIDNAEEYLDEGVRAGTVKPSRDPRARARFLAITGGGGFLLYLQM
HENPTDLRAALRDYAHDMVLPSLEVYTEGLLADRAMYEAFLAEAQQGEAHVG
>P45870 ~~~racA~~~Chromosome-anchoring protein RacA~~~COG0789
MNTNMVASELGVSAKTVQRWVKQLNLPAERNELGHYSFTAEDVKVLKSVQKQISEGTAIQDIHLPKSAKKRTGFLVQKTS
SDTERRIEQLEQKLDTLLQQRQDESELMARMSELERQLKQKADEGVSYQLLQHRREIDDILADLQSLTSQMKEFTAHSIP
ETAAASEKTKTRKKPLLSLFKFQT
>P29079 5.1.1.13~~~~~~Aspartate racemase~~~COG1794
MENFFSILGGMGTMATESFVRLINHRTKATKDQEYLNYVLFNHATVPDRTAYILDRSEENPMPFLLDDIEKQNLLRPNFI
VLTCNTAHYFFEELQAATDIPILHMPREAANELVRQHTTGRVAILGTEGSMKAGIYEREVKNLGFETVIPDTALQEKINY
LIYHEIKESDHLNQELYYEILEEAVERLNCEKVILGCTELSLMNEFAEDNHYPVIDAQSILADRTIERALAERNEALDTV
SEK
>H8L901 5.1.1.13~~~~~~Aspartate racemase~~~
MENFFSILGGMGTMATESFVRLINHRTKATKDQEYLNYVLFNHATVPDRTAYILDRSEENPMPFLLDDIEKQNLLRPNFI
VLTCNTAHYFFEELQAATDIPILHMPREAANELVRQHTTGRVAILGTEGSMKAGIYEREVKNLGFETVIPDTALQEKINY
LIYHEIKESDYLNQELYYEILEEAVERLNCEKVILGCTELSLMHEFAEDNHYPVIDAQSILADRTIERALAERSEALDTA
SEK
>P32960 5.1.1.10~~~racX~~~Broad specificity amino-acid racemase RacX~~~COG1794
MIGILAGMGPKSTSPFIDKVIDYCQKLYGASNDIDYPHMMIYSCPTPFYADRPIDHDEMKKAIIDGAVKLEKTGVDFIAL
PCNTAHVYYEEIQQALSVPMLHIVEETIKEIPHPAKKAVVLGTEPTIQSAIYQKVLKGNGQEVIHKDHWQQAVNQLIAAI
KQPNHMQHTQALWQTLYEEISQHADIIISACTDLNAVLDHIQSEIPIIDSSACLAKSTVSTYLAYQS
>Q9X1X1 ~~~~~~Probable DNA double-strand break repair Rad50 ATPase~~~COG0419
MRPERLTVRNFLGLKNVDIEFQSGITVVEGPNGAGKSSLFEAISFALFGNGIRYPNSYDYVNRNAVDGTARLVFQFERGG
KRYEIIREINALQRKHNAKLSEILENGKKAAIAAKPTSVKQEVEKILGIEHRTFIRTVFLPQGEIDKLLISPPSEITEII
SDVFQSKETLEKLEKLLKEKMKKLENEISSLQALYTAIWKYLEENDLEVLKSELKTVSEKKKELLKKREELQKEEEQLKR
LLEKYRELVKKKERLRVLSLRRNELQKEVIYEQKVKKAKELEPLFREIYLRQREFERFSQELNSREKRYKELESEKEAIS
KEIPVHRERLSKLEEIGEKIKEELDLLEKVLKASRPLLEQRIRLKENLTRLEEEFRRLVGEKEKREKELLSIEKTENETK
NELEKLLDELSILKKDHMKWLAYQIASSLNEGDTCPVCGGVFHGKVEAVEFNIDEFEKLDQKRSELENTLNVLKERKKSL
SSLIEDLLMKIEEGKKNLKSIRNQIEKIEEELHRLGYSEDLEEKLDEKRKKLRKIEEERHSISQKITAADVQISQIENQL
KEIKGEIEAKRETLKEQREEMDQLKSDFFDRLRKIGIGFEEFRILVKEEVKDAEKELGVVETEIRLLEESLKELESENVR
DVSEDYEKVRNQLEALSQEISDLERKEGRLNHLIEETLRRERELKSLEKKLKEMSDEYNNLDLLRKYLFDKSNFSRYFTG
RVLEAVLKRTKAYLDILTNGRFDIDFDDEKGGFIIKDWGIERPARGLSGGERALISISLAMSLAEVASGRLDAFFIDEGF
SSLDTENKEKIASVLKELERLNKVIVFITHDREFSEAFDRKLRITGGVVVNE
>P37572 3.6.4.-~~~radA~~~DNA repair protein RadA~~~COG1066
MAKTKSKFICQSCGYESPKWMGKCPGCGAWNTMVEEMIKKAPANRRAAFSHSVQTVQKPSPITSIETSEEPRVKTQLGEF
NRVLGGGVVKGSLVLIGGDPGIGKSTLLLQVSAQLSGSSNSVLYISGEESVKQTKLRADRLGINNPSLHVLSETDMEYIS
SAIQEMNPSFVVVDSIQTVYQSDITSAPGSVSQVRECTAELMKIAKTKGIPIFIVGHVTKEGSIAGPRLLEHMVDTVLYF
EGERHHTFRILRAVKNRFGSTNEMGIFEMREEGLTEVLNPSEIFLEERSAGSAGSSITASMEGTRPILVEIQALISPTSF
GNPRRMATGIDHNRVSLLMAVLEKRVGLLLQNQDAYLKVAGGVKLDEPAIDLAIVISIASSFRDTPPNPADCFIGEVGLT
GEVRRVSRIEQRVKEAAKLGFKRMIIPAANLDGWTKPKGIEVIGVANVAEALRTSLGG
>P24554 3.6.4.-~~~radA~~~DNA repair protein RadA~~~COG1066
MAKAPKRAFVCNECGADYPRWQGQCSACHAWNTITEVRLAASPMVARNERLSGYAGSAGVAKVQKLSDISLEELPRFSTG
FKEFDRVLGGGVVPGSAILIGGNPGAGKSTLLLQTLCKLAQQMKTLYVTGEESLQQVAMRAHRLGLPTDNLNMLSETSIE
QICLIAEEEQPKLMVIDSIQVMHMADVQSSPGSVAQVRETAAYLTRFAKTRGVAIVMVGHVTKDGSLAGPKVLEHCIDCS
VLLDGDADSRFRTLRSHKNRFGAVNELGVFAMTEQGLREVSNPSAIFLSRGDEVTSGSSVMVVWEGTRPLLVEIQALVDH
SMMANPRRVAVGLEQNRLAILLAVLHRHGGLQMADQDVFVNVVGGVKVTETSADLALLLAMVSSLRDRPLPQDLVVFGEV
GLAGEIRPVPSGQERISEAAKHGFRRAIVPAANVPKKAPEGMQIFGVKKLSDALSVFDDL
>A0R563 3.6.4.-~~~radA~~~DNA repair protein RadA~~~COG1066
MAGSKIRSQYRCSECQHVAPKWVGRCANCGTWGTVDEVAVLAGNNKLNGAARRSVAPTSPAVPITSIDPGVTRHYPTGVS
ELDRVLGGGLVAGSVTLLAGDPGVGKSTLLLEVANRWAHSGKRALYLSGEESAGQIRLRAERTGCTHDQVYLAAESDLQI
ALGHIDEVKPSLVVVDSVQTMSTTEADGVTGGVTQVRAVTTSLTAYAKAAVGDPAVAMILVGHVTKDGAIAGPRSLEHLV
DVVLHFEGDRASSLRMVRGVKNRFGAADEVGCFQLHDNGIECVSDPSGLFLDQRPLAVPGTAVTVTLDGKRPMIGEVQAL
VSPPAGPPRRAVSGIDSARAAMIGAVLQTRCRMPINSNDLYLSTVGGMRLTDPSADLAVALAIASAYFDIAMPMKAIAIG
EVGLAGDLRRVTGMDRRLSEAARLGFTTAVVPPGVTSAPAGLKVVAADNIRAAVQTMREIAIAGAQ
>P9WHJ9 3.6.4.-~~~radA~~~DNA repair protein RadA~~~COG1066
MANARSQYRCSECRHVSAKWVGRCLECGRWGTVDEVAVLSAVGGTRRRSVAPASGAVPISAVDAHRTRPCPTGIDELDRV
LGGGIVPGSVTLLAGDPGVGKSTLLLEVAHRWAQSGRRALYVSGEESAGQIRLRADRIGCGTEVEEIYLAAQSDVHTVLD
QIETVQPALVIVDSVQTMSTSEADGVTGGVTQVRAVTAALTAAAKANEVALILVGHVTKDGAIAGPRSLEHLVDVVLHFE
GDRNGALRMVRGVKNRFGAADEVGCFLLHDNGIDGIVDPSNLFLDQRPTPVAGTAITVTLDGKRPLVGEVQALLATPCGG
SPRRAVSGIHQARAAMIAAVLEKHARLAIAVNDIYLSTVGGMRLTEPSADLAVAIALASAYANLPLPTTAVMIGEVGLAG
DIRRVNGMARRLSEAARQGFTIALVPPSDDPVPPGMHALRASTIVAALQYMVDIADHRGTTLATPPSHSGTGHVPLGRGT
>Q8DRP0 3.6.4.-~~~radA~~~DNA repair protein RadA~~~COG1066
MEEVEVAEVKNARVSLTGEKTKPMKLAEVTSINVNRTKTEMEEFNRVLGGGVVPGSLVLIGGDPGIGKSTLLLQVSTQLS
QVGTVLYVSGEESAQQIKLRAERLGDIDSEFYLYAETNMQSVRAEVERIQPDFLIIDSIQTIMSPEISGVQGSVSQVREV
TAELMQLAKTNNIAIFIVGHVTKEGTLAGPRMLEHMVDTVLYFEGERHHTFRILRAVKNRFGSTNEIGIFEMQSGGLVEV
LNPSQVFLEERLDGATGSSIVVTMEGTRPILAEVQALVTPTMFGNAKRTTTGLDFNRASLIMAVLEKRAGLLLQNQDAYL
KSAGGVKLDEPAIDLAVAVAIASSYKDKPTNPQECFVGELGLTGEIRRVNRIEQRINEAAKLGFTKIYVPKNSLTGITLP
KEIQVIGVTTIQEVLKKVFA
>P33919 3.6.4.12~~~radD~~~Putative DNA repair helicase RadD~~~COG1061
MIFTLRPYQQEAVDATLNHFRRHKTPAVIVLPTGAGKSLVIAELARLARGRVLVLAHVKELVAQNHAKYQALGLEADIFA
AGLKRKESHGKVVFGSVQSVARNLDAFQGEFSLLIVDECHRIGDDEESQYQQILTHLTKVNPHLRLLGLTATPFRLGKGW
IYQFHYHGMVRGDEKALFRDCIYELPLRYMIKHGYLTPPERLDMPVVQYDFSRLQAQSNGLFSEADLNRELKKQQRITPH
IISQIMEFAATRKGVMIFAATVEHAKEIVGLLPAEDAALITGDTPGAERDVLIENFKAQRFRYLVNVAVLTTGFDAPHVD
LIAILRPTESVSLYQQIVGRGLRLAPGKTDCLILDYAGNPHDLYAPEVGTPKGKSDNVPVQVFCPACGFANTFWGKTTAD
GTLIEHFGRRCQGWFEDDDGHREQCDFRFRFKNCPQCNAENDIAARRCRECDTVLVDPDDMLKAALRLKDALVLRCSGMS
LQHGHDEKGEWLKITYYDEDGADVSERFRLQTPAQRTAFEQLFIRPHTRTPGIPLRWITAADILAQQALLRHPDFVVARM
KGQYWQVREKVFDYEGRFRLAHELRG
>Q6WVP7 1.1.1.-~~~adh~~~NADP-dependent (R)-specific alcohol dehydrogenase~~~
MTDRLKGKVAIVTGGTLGIGLAIADKFVEEGAKVVITGRHADVGEKAAKSIGGTDVIRFVQHDASDEAGWTKLFDTTEEA
FGPVTTVVNNAGIAVSKSVEDTTTEEWRKLLSVNLDGVFFGTRLGIQRMKNKGLGASIINMSSIEGFVGDPTLGAYNASK
GAVRIMSKSAALDCALKDYDVRVNTVHPGYIKTPLVDDLEGAEEMMSQRTKTPMGHIGEPNDIAWICVYLASDESKFATG
AEFVVDGGYTAQ
>Q8YLP6 ~~~raf1~~~RuBisCO accumulation factor 1~~~
MTELPPNAPNPENATNELAQELLRKLRQKQGNWVEWGQAIASLQKSGYNPQDIFEATGFEPVQQNQVIVGSQVYNSLEKS
GASAATLAHYATRGSDVLYELRLLTHEERAAAGDLTFTHKVDADEAREIAKAIKDFSRFRILPEGFSNHPGDAVAYQAWK
LARQYSDLQERSRLIARGLRFAHSETARKQIEQLLVDFTVVSQRPAPIPPFFRFDTEDELPRIVPVVGQLPLKAEELKAV
PLVEEIEPFRLVKFSGEQAWVALPGWQVLLAAEDPVTILATSDRFPKQNQTEPGPVLVVVDRSQREWNDFSYFVVDHDGE
LDFQWFETKPEFPILGKVIILVRPRRILDENVTKDSWQIDE
>Q31Q05 ~~~raf1~~~RuBisCO accumulation factor 1~~~
MREFTPTTLSEEERQELLGQLRRKEGRWLAWARACQTLLKNGLNPQTLFEATGFEPIQQNQITVAMQVYDSILRQDPPAH
VRETYQEWGSDLLYELRELDQEQRSLCAQLALERKLDADQIREVAKATKDFCRLPKQPENFDRHPGDAVAHQCWRLAQER
TDLTERSRLIARGLQFAQSAGARALIEALLLDLSGVPSRKPPMLPIYRLETEEDLPRLLPFAGTLPLSSSQIEAIAAVEA
EGPFGLVSSPQGQQWLALPGWQAILTAEDPIACLEQIDRLPNAPEGPTEAVVLVVDRADRDWDADHFFLVEQAEGARIQW
SPSAIAAPILGRLVLILRPKRVLDEAAIATPWQFEE
>B1XK11 ~~~raf1~~~RuBisCO accumulation factor 1~~~
MIGQPQSPEYKLSPEETDALFRSLLHKEGTWVEWGVGCQQLQQSGHSAQEIFEQTGFQTAQQNMIIVAAQVYQSIASSGV
PEDLLAYCRGPRSDVLYELRILSHSQRAIAAQVCQAKSLEFDGAKELARAMQEFARLPQIPDSFTEHPGDAVAYQAWRSA
KQKKDLQDRTRLIAKGLKFAHSATARQKIEQLLSDLTTSPTKAAPLLPLYRYDEDTNVPLLIPVAGSLPLESDRLLSVPP
LKQASPFNLVTVATATSLVPLPSWQNVLTAGDPVVIFHQTDQLPQPIPGKPEPVLILLDRQQTTWNDNSYFAVDADGKVE
LGWFAEAPIAKILGQVLLVMRPKKILDENNLREPWQMDD
>Q55875 ~~~raf1~~~RuBisCO accumulation factor 1~~~
MTHSPESNPTVSAAEAAELIRSLLHKEGTWVDWGKKCQQLQKAGYGAEEIFEQSGFQKVQQNLVIVASQVYESLVKQGID
ETVLSYYRGPKSDVLYELRILNHQQRAIAAVEAQQKNLAADEAKELAKAFQEFGYLSQLPEGFTDHPGDALAYQCWKLAR
QKKNLPERTRLIVKGLKFAHSPNARQAIEKLLTDLTAQPSRKAPLVPVFRLEEDQEAARLIPVAGTFPLQPQAVQAVQSL
EQVEPFGLVSYQGEGAVVPVPQWQAILTAEDPVAIFCPAGQVSESLARKDEQVLVVVDRSKKIWNDGSYFLLNQGETVAI
QWCETEPEREILAQVVLVLRPKKIFDANNLREPWQMDD
>Q8DI26 ~~~raf1~~~RuBisCO accumulation factor 1~~~
MSQDQDALTTEVLQRLRRKEGTWQDWGAGCRQLQKQGLSPQAIFEATGIEPIHQNQLITALQVSQSLGEAPESVRAYFQT
RGSDLLYELRVLSAGDRLAAATLIVEKQLDVTAVHEVCRALKAVSYRKDNSEGFGESVGDRIGRYYWQLARQQRDLAQRS
RLIAQGLRFVESASGRQALEKLLTDFTVVPAVNQPRLPLYRLDTAEEVPYLVPVAGTAPLTATVLQQVPRLSCTEVFRVV
AVPQGMSLVALPAWQVLLNAVDPVAILWPAADLPAELPPSPEGLPIAQVLLVVDRGLAEWDRDRYLLIAPSPSEAVQLAW
LPEPPTATVVGQLLLVLRPPQVLDESLNRELWFFEE
>P16551 3.2.1.22~~~rafA~~~Alpha-galactosidase~~~
MISKYCRLSSPRSDLIIKTHPHAEIIWWGSALKHFSPDDCASLERPVANGRLDIDTPLTLIAENALGLFSSPGLEGHRNG
LDASPVFYTVDVEHTENTLRLTSEDSVAGLRLVSELVMTPSGILKVRHALTNLREGDWQINRFAITLPVAERAEEVMAFH
GRWTREFQPHRVRLTHDAFVLENRRGRTSHEHFPALIVGTPGFSEQQGEVWAVHLGWSGNHRMRCEAKTDGRRYVQAEAL
WMPGEKALRKNETLYTPWLYACHSADGLNGMSQQYHRFLRDEIIRFPEQKLRPVHLNTWEGIYFNHNPDYIMQMAERAAA
LGVERFIIDDGWFKGRNDDRAALGDWYTDEQKYPNGLMPVINHVKSLGMEFGIWVEPEMINPDSDLFRLHPDWILSMPGY
SQPTGRYQYVLNLNIPEAFDYIYKRFLWLLGEHPVDYVKWDMNRELVQAGHEGRAAADAQTRQFYRLLDLLRERFPHVEF
ESCASGGGRIDFEVLKRTHRFWASDNNDALERCTIQRGMSYFFPPEVMGAHIGHRRCHATFRQHSIAFRGLTALFGHMGL
ELDPVAADAKESDGYRRYALLYKEWRQLIHTGVLWRVDMPDSSIQVQGVVSPDQSQALFMISQLAMPDYTLPGILRFPGL
AAEVRYRLRVIDHPEIQLVGEGGHTMRRLPAWMNQPLEASGEWLAKGGIQLPVLDPESAILIALERAV
>P16552 ~~~rafB~~~Raffinose permease~~~
MNSASTHKNTDFWIFGLFFFLYFFIMATCFPFLPVWLSDVVGLSKTDTGIVFSCLSLFAISFQPLLGVISDRLGLKKNLI
WSISLLLVFFAPFFLYVFAPLLHLNIWAGALTGGVFIGFVFSAGAGAIEAYIERVSRSSGFEYGKARMFGCLGWALCATM
AGILFNVDPSLVFWMGSGGALLLLLLLYLARPSTSQTAMVMNALGANSSLISTRMVFSLFRMRQMWMFVLYTIGVACVYD
VFDQQFAIFFRSFFDTPQAGIKAFGFATTAGEICNAIIMFCTPWIINRIGAKNTLLVAGGIMTIRITGSAFATTMTEVVI
LKMLHALEVPFLLVGAFKYITGVFDTRLSATVYLIGFQFSKQLAAILLSTFAGHLYDRMGFQNTYFVLGMIVLTVTVISA
FTLSSSPGIVHPSVEKAPVAHSEIN
>P21867 ~~~rafR~~~HTH-type transcriptional regulator RafR~~~
MSLKAIATTLGISVTTVSRALGGFSDVAASTRERVEAEARRRGYRPNTQARRLKTGKTDAIGLVYPENDVPFNSGVFMDM
VSCISRELAYHDIDLLLIADDEHADCHSYMRLVESRRIDALIIAHTLDDDPRITHLHKAGIPFLALGRVPQGLPCAWFDF
DNHAGTWQATQKLIALGHKSIALLSENTSHSYVIARRQGWLDALHEHGLKDPLLRLVSPTRRAGYLAVMELMSLPAPPTA
IITDNDLSGDGAAMALQLRGRLSGKEAVSLVVYDGLPQDSIIELDVAAVIQSTRSLVGRQISDMVYQIINGASPESLQIT
WTPIFYPGSTVHSPSF
>Q9I2F8 ~~~rbsB~~~D-ribose/D-allose-binding protein~~~
MKRVASRRLLAAVVLTACSSFLPLSAVHAETPEKPRIALVMKSLANEFFLTMEDGAKAYQKEHADRFELVSNGIKDETDT
SSQIRIVEQMIVSGVDALVIAPADSKALVPVVKKALDAGIVVVNIDNRFDPQVLQAKKIGVPFVGPDNRKGARLVGEYLA
KRLKVGDEVGIIEGVSTTTNAQQRTAGFKDAMDAAGMKIVSLQSGNWEIEKGNAVASAMLNEHPDLKALLAGNDSMALGA
VSAVRAAGRAGQVKVVGYDNIQAIKPMLKDGRVLATADQFAAKQAVFGIQTALKLLAGQTPEHEKDGVVETPVELVTAP
>P33229 3.1.-.-~~~ralR~~~Endodeoxyribonuclease toxin RalR~~~
MRYDNVKPCPFCGCPSVTVKAISGYYRAKCNGCESRTGYGGSEKEALERWNKRTTGNNNGGVHV
>Q8NML3 ~~~ramA~~~HTH-type transcriptional activator RamA~~~COG2197
MDTQRIKDDEDAIRSALTSLKTATGIPVTMFATVLQDNRLQITQWVGLRTPALQNLVIEPGVGVGGRVVATRRPVGVSDY
TRANVISHEKDSAIQDEGLHSIVAVPVIVHREIRGVLYVGVHSAVRLGDTVIEEVTMTARTLEQNLAINSALRRNGVPDG
RGSLKANRVMNGAEWEQVRSTHSKLRMLANRVTDEDLRRDLEELCDQMVTPVRIKQTTKLSARELDVLACVALGHTNVEA
AEEMGIGAETVKSYLRSVMRKLGAHTRYEAVNAARRIGALP
>Q48413 ~~~ramA~~~Transcriptional activator RamA~~~
MTISAQVIDTIVEWIDDNLHQPLRIDDIARHAGYSKWHLQRLFLQYKGESLGRYIRERKLLLAARDLRDTDQRVYDICLK
YGFDSQQTFTRVFTRTFNQPPGAYRKENHSRAH
>O88039 ~~~ramA~~~ABC transporter ATP-binding protein RamA~~~COG1132
MSPAAGAPGSGPGRRRLPALDVRRFSPVRKSRPVPPRAPGRALEVLVLLCSVAAAVAAVAQPLALGRTLDLLLRDGDAGW
WLPLSAALLLGELLLDSATSLFTGRCNATWTASVRTRALRGLLRTAPEHARPYPPGDIGTRLTLNAADAGGAPAARAALA
ASLITPLGALVALALVDVWVALCVLTGLPALALLLRSFARDTGATVAAYQRTQSLIASRLLEALEGADTIGAAGTGERER
ARVLAPLAELAAQGRHMWALHGRALGRSGVLVPLLTLAATAVGGLRLAAGELSVGDLLAVGRYAQLTAGVGAAASLLGAI
VRAREARRRTRELERMTATVYGTRRLPPNGPGELRLCGVRVLRGGREVLRADGVRVPGGSTVAVVGRSGAGKSVLAAVAG
RLIDPDEGYVLLDGVRLDRLTHEALRTEVAYAFERPVLGEGTIAEAVADGARRSSRERVRQAARAAGADGFVRRLPHGYD
TPLPRAPLSGGEHQRLGLARAFAHAGRLLVLDDATSSLDTATEHEVDLALRRSVRPGTRLVVAHRPSVADRADLVLWLED
GQVRAVGTHRELWHTAGYREVFGAGAGAGAGAGAGAGADAGAGADAGPGPDSGAATAVGGSGPGPVRRPEPEEARP
>Q8NTD8 ~~~ramB~~~HTH-type transcriptional regulator RamB~~~COG1396
MGKTYVGSRLRQLRRERDLSQASLAATLGLSASYVNQIEHDVRPLTVPVLLRITEAFGVDATFFSRDDDSRLLAEVQDVM
LDREINPANVELQELSEMVYNHPQLARAMVEMHQRYRNVRDKFSIAVDNRTNTPEERRPIAEAVSMPHEEVRDFIYARQN
YFDALDRRAEAIAAQLGWQPYDSRAMEDSIARRLQMDHDVTITSSKEESGTLHHFDPETRLLTIHARLNPGQRAFRMATE
LGYLEANDLIEGIVDDGIWSTPEARTLAIRGVASYFAAAVMLPYKIFHSEAEKSGYDIEYLGQLFGVGYETTAHRLSTLQ
RPNLRGIPFTFVRVDRAGNMSKRQSATGFHFTHYGGTCPLWNVFETFTNPGQVLRQFAQMPDGRNYLWISRTVRHHEARF
GEVDKMFAIGLGCEARHADRTVYSRGFNLQDLSTATPIGSGCRVCTRENCAQRAFPSVHGRINIDAHESTIAPY
>P9WMI1 ~~~ramB~~~HTH-type transcriptional regulator RamB~~~COG1396
MSKTYVGSRVRQLRNERGFSQAALAQMLEISPSYLNQIEHDVRPLTVAVLLRITEVFGVDATFFASQDDTRLVAELREVT
LDRDLDIAIDPHEVAEMVSAHPGLACAVVNLHRRYRITTAQLAAATEERFSDGSGRGSITMPHEEVRDYFYQRQNYLHAL
DTAAEDLTAQMRMHHGDLARELTRRLTEVHGVRINKRIDLGDTVLHRYDPATNTLEISSHLSPGQQVFKMAAELAYLEFG
DLIDAMVTDGKFTSAESRTLARLGLANYFAAATVLPYRQFHDVAENFRYDVERLSAFYSVSYETIAHRLSTLQRPSMRGV
PFTFVRVDRAGNMSKRQSATGFHFSSSGGTCPLWNVYETFANPGKILVQIAQMPDGRNYLWVARTVELRAARYGQPGKTF
AIGLGCELRHAHRLVYSEGLDLSGDPNTAATPIGAGCRVCERDNCPQRAFPALGRALDLDEHRSTVSPYLVKQL
>Q7AKE5 ~~~ramB~~~ABC transporter ATP-binding protein RamB~~~COG1132
MTSTEAGALRRLAPPARRFLAHRKGVLVRLALWSLAESGQAFLVGHAVARSVDEGFLAGDPRRGLLWLGVALVAVLSGAR
VVRGVFAQLAGVTEPLRDGLVRHAVDRSMARAAPGGPGGTDRAAVSRLTNQVEIARDSFAGLVLTLRSFVFTAAGALLGL
LSLHPALLVVVLPPLAAGLALFLVTLRPMAAAQRRALAADEALGEHAASARAALRDLTACGTGPGAERHGADLVADAAAA
ARTLAGWAAVRTAALGVAGHLPVLALLVAVEWLRGHGVSVGALLGAFTYLVQSLLPALHTLMTALGAAGSRLLVVLDRIL
GPEPEPEPEPEPEPEPELGSGLEPEPEPASEPESGPSTASASAAAFAVHTAAAPAVELRSVTLSYGVRAEPVLDALDLRV
APGEHLAVVGPSGIGKSTLTRLVAGTLAPSRGEVRVAGRVVTGRPAAELAALRVLVPQDAYVFSGTVGDNLAYLRTDPSP
AELDAAVEAFGLAPLVERLGGLDATVRPAELSPGERQLVALVRAYLSPAPLLLLDEATCHLDPASEARAEKALAGRSGTL
VVVAHRLSSAVRADRTLVLDGIRAQSGTHAELLGRSPLYRDLTGHWNS
>O88037 2.7.-.-~~~ramC~~~Probable SapB synthase~~~COG0515
MTAATVRGGTPRDSVVSVSNRWEGAGVNKGYAVYCDADPYFYDAPHRTADRTGAARSRYAAASSPVPEGWQRHESGDWLA
LRPADADLPAQGWKIHVSACLDNAESVLDRVWRHCVDGGTAFKFVPSRYLLHQRNAKYADRAGSGKFVTVYPADEAEFER
LVGELSELLAGEPGPHILSDLRIGDGPVHVRYGGFTRRDCYDADGELRPAVSGPDGVLVPDLRGPVFRIPEWVDPPAFLR
PHLDARSAVTVTGMPYTVESALHFSNGGGVYLARDTRTGARVVLKEARPHAGLAADGADAVTRLHRERRALERLSGLACT
PEVLDHRTVGEHHFLVLEHIDGKPLNTFFARRHPLIEADPGERRLAEYTDWALDVHARVERAVAEVHARGVVFNDLHLFN
IMVRDDDSVALLDFEAAHHVDEAGRQIVANPGFVAPPDRRGVAVDRYALACLRIVLFLPLTSLLAVDRHKAAHLAEVVAE
QFPVDRAFLDAAVEEITRVDGSTRVDGSTRADETTRADETTRLDVTTRVHGAPDAARRPAGPVAPVRPDDWPRSRDSMAA
AIRASATPSRTDRLFPGDIAQFATAGGGLAFAHGAAGVLYALAESGAGRDEDGEQWLLERTKRPPSGMPLGFHDGLAGLA
WTLERLGHRDRALDLAELLLDQPLDHLGPDLHGGTAGLGLALESLAATTGQAALHSAALHCAELAADGLPGGSVPADRVS
RGRARAGLLYGGAGRALLFLRLFERTRDSALLDLARDALRQDLARCVRGAGGALQVDEGWRTMPYLGAGSVGIGMVLDDY
LAHRADEEFARAANEIVAAAQAMFYAQPGLYRGVAGMVLHLGRTTATAPGTGPRAVRRQLDALSWHAMSYRDRLAFPGEQ
MMRLSMDLSTGTAGCLLAVASVLGDAPAGLPFLPPPRRSGGPLTRPHQEP
>P0DX10 ~~~ramQ~~~Corrinoid activation enzyme RamQ~~~
MRVLFPLLEEEIDVSQESLVSDVCASIGLPLNLVCGGKGRCKKCLVNVKENGEMKEVLACQYSVSDGMEIYASREQAASQ
ILETSSNSDLPFDPIVKCYSLNYTDLVTPMCTYDLDVLRKLIPQTIDTPDYSVLKHFSEVYFLEGYERLHVIVSGNRIID
FIPSNEKEKPIYGIAFDIGTTSVVGYLYDITSGCLLNQHSMLNKQIAFGGDVISRIDYAGEGPESLHKIYEAIMETLAEI
ITQVCKKASIDTKDIYQTVYCGNSTMAHLFLELNPKHLGLAPFLGFCKDAISLKGMDTPLPINPKGTVTFMPLLGGFVGA
DTTAVLLGLPRDKKMRLMIDLGTNGEIAVGNCDKYYVASTACGPALEGAGLTMGMRGTTGAIEKVGCENGKITYSVIGKT
APQGFCGSGIVDAIAMLFREGLIAKRGNFIKGDALDAHPMKNRFGVDENNQRYFKIVTAGENPEGKDIIITQKDVRAVQL
AKAAIYTGCCLLSENYGIKGSDLEEIVIAGAFGNYIDVHNAQFIGLLPKIEGVPVRSIGNGAGTGAQLYLLSKEEAGICN
AIPRITTHIELATDPKFVETYMMNTMFGDNVMI
>Q7AKE4 ~~~ramR~~~Response regulator RamR~~~COG2197
MGEMVRIAVVHDEKLLRSALVQLLRSDDTLDVSSHCLDADGPELSAALPADVCVVDGECLTGPEDAGAGRLRARYGDRLV
VLATAKRPGVLRRAFDGGALGLVDKNAPAHRLITAVHTVARGERFLDETLTVALLKGAEMPLTTRELGVLTLASQGAPIA
EIAARLHLSRGTVRNYMATAVRKVGARNRVDAIRIVQSAGWT
>Q1M755 ~~~rapA2~~~Calcium-binding lectin RapA2~~~
MASPIHATDDSATFQETDVISGNLLSNDSSDNGHLFLRAFDGASVGAKSGNSQVTEIQGDYGTFFVKPDGSYTYVLSDAA
KIGFANGESFQEKVSYKISDGSGHTDVGLFTLNIQGVTQVKPIAVDDHYSFNEGDAIGGNVLDNDIAGDNGHLFLRQFDG
TNVSAKSGPDAVTDIVGDYGVFHVKPNGEFTYELTDDLAAGQTVTETVQYYKISDGEGHTDAGVLTLNITGTDALV
>Q00828 3.1.3.-~~~rapA~~~Response regulator aspartate phosphatase A~~~COG0457
MRMKQTIPSSYVGLKINEWYTHIRQFHVAEAERVKLEVEREIEDMEEDQDLLLYYSLMEFRHRVMLDYIKPFGEDTSQLE
FSELLEDIEGNQYKLTGLLEYYFNFFRGMYEFKQKMFVSAMMYYKRAEKNLALVSDDIEKAEFAFKMAEIFYNLKQTYVS
MSYAVQALETYQMYETYTVRRIQCEFVIAGNYDDMQYPERALPHLELALDLAKKEGNPRLISSALYNLGNCYEKMGELQK
AAEYFGKSVSICKSEKFDNLPHSIYSLTQVLYKQKNDAEAQKKYREGLEIARQYSDELFVELFQFLHALYGKNIDTESVS
HTFQFLEEHMLYPYIEELAHDAAQFYIENGQPEKALSFYEKMVHAQKQIQRGDCLYEI
>P60240 3.6.4.-~~~rapA~~~RNA polymerase-associated protein RapA~~~COG0553
MPFTLGQRWISDTESELGLGTVVAVDARTVTLLFPSTGENRLYARSDSPVTRVMFNPGDTITSHDGWQMQVEEVKEENGL
LTYIGTRLDTEESGVALREVFLDSKLVFSKPQDRLFAGQIDRMDRFALRYRARKYSSEQFRMPYSGLRGQRTSLIPHQLN
IAHDVGRRHAPRVLLADEVGLGKTIEAGMILHQQLLSGAAERVLIIVPETLQHQWLVEMLRRFNLRFALFDDERYAEAQH
DAYNPFDTEQLVICSLDFARRSKQRLEHLCEAEWDLLVVDEAHHLVWSEDAPSREYQAIEQLAEHVPGVLLLTATPEQLG
MESHFARLRLLDPNRFHDFAQFVEEQKNYRPVADAVAMLLAGNKLSNDELNMLGEMIGEQDIEPLLQAANSDSEDAQSAR
QELVSMLMDRHGTSRVLFRNTRNGVKGFPKRELHTIKLPLPTQYQTAIKVSGIMGARKSAEDRARDMLYPERIYQEFEGD
NATWWNFDPRVEWLMGYLTSHRSQKVLVICAKAATALQLEQVLREREGIRAAVFHEGMSIIERDRAAAWFAEEDTGAQVL
LCSEIGSEGRNFQFASHMVMFDLPFNPDLLEQRIGRLDRIGQAHDIQIHVPYLEKTAQSVLVRWYHEGLDAFEHTCPTGR
TIYDSVYNDLINYLASPDQTEGFDDLIKNCREQHEALKAQLEQGRDRLLEIHSNGGEKAQALAESIEEQDDDTNLIAFAM
NLFDIIGINQDDRGDNMIVLTPSDHMLVPDFPGLSEDGITITFDREVALAREDAQFITWEHPLIRNGLDLILSGDTGSST
ISLLKNKALPVGTLLVELIYVVEAQAPKQLQLNRFLPPTPVRMLLDKNGNNLAAQVEFETFNRQLNAVNRHTGSKLVNAV
QQDVHAILQLGEAQIEKSARALIDAARNEADEKLSAELSRLEALRAVNPNIRDDELTAIESNRQQVMESLDQAGWRLDAL
RLIVVTHQ
>P70962 3.1.3.-~~~rapB~~~Response regulator aspartate phosphatase B~~~COG0457
MAAYEIPSSQVGVKINKWYKHILAFQVADAVKLKEEIDLDIEQMEEDQLLLLYYQLISYRHQIMLDYVKPDLHEESQLQY
RELIKTLESNQDSISGLSEYYFHLFRGMYEFEQNNYISAISFYRKAEKMLAFVEDEIERAEFHFKVAEVFYIMKQTHFSM
NHAVQALETYKAHDFYRVRRIQCHFVISGNYIDYRNYEKALEHLDDAYRLALLEGQPRLIGSALYNIGNCYDDKGELDQA
AEYFEKALPVFEDYQLEQLPKALFSLTRVLFKKQDSEAAIRYYEKGIAIAQKRNDFFSLAKYKFLQALYVESVNLNMIQE
VFDYMEEKGLYVYIEEFALDAASYFSHREQYKEAVYFYEKAVSMREMIQRNDCLYEV
>P94415 ~~~rapC~~~Regulatory protein RapC~~~COG0457
MKSGVIPSSAVGQKINEWYRYIRTFSVPDAEVLKAEIQQELKHMQHDSNLLLYYSLMEFRHQLMLDYLEPLEKLNIEDQP
SLSELSRNIDSNQADLKGLLDYYVNFFRGMYEFDKREFISAITYYKQAEKKLSFVADHIERAEFYFKIAEAYYYMKQTYF
SLINIKNAYEIYVEQETYNVRIIQCHFVFGVNLMDERNFEQAARHFKLALNMAQAEQKAQLVGRAYYNLGLCYYNQDLLD
PAIDYFEKAVSTFESSRIVNSLPQAYFLITLIYYKQGKHDKASEYHKRGYEYAKETDDADYAVKFEFLQSLYLDQPNEEG
IERCFQYLKNKNMYADIEDLALEVAKYYYEQKWFKLSASYFLQVEEARKQIQRSEGLYEIEI
>P45943 3.1.3.-~~~rapE~~~Response regulator aspartate phosphatase E~~~COG0457
MISITSAEVGMKINEWHRHIQKFNVTDAEMLKAEIERDIEVMEEDQDLLIYYQLMAFRHKIMLEYTLPSDENRMELSEYL
NKIEGHKKKLDNMRAYYYNFFRGMYEFRNGEYTRAITYYKKAERKIPTISDKIEKAEFYFKLSEVYYHMKMTHISMHYAE
LSYNIYKKHELYSVRRIQCHFVIAGNYDDLENHEKALPHLQEALKGAELLKSKNTHIYATAFFNLGNCYHKMDNLNKAAR
YIEQALVQYRKINSDVLPQAYHDLALIYFKQGKKEQAMDCFRKGIRSAVDFKDELFMNLFEALDVLYIRNGDTPKLLNIF
SRLENGKGYPYLEELALLGGNLFDYNGKIEDSIICFKKMVYAQKQISKGECMYEI
>P71002 ~~~rapF~~~Regulatory protein RapF~~~COG0457
MTGVISSSSIGEKINEWYMYIRRFSIPDAEYLRREIKQELDQMEEDQDLHLYYSLMEFRHNLMLEYLEPLEKMRIEEQPR
LSDLLLEIDKKQARLTGLLEYYFNFFRGMYELDQREYLSAIKFFKKAESKLIFVKDRIEKAEFFFKMSESYYYMKQTYFS
MDYARQAYEIYKEHEAYNIRLLQCHSLFATNFLDLKQYEDAISHFQKAYSMAEAEKQPQLMGRTLYNIGLCKNSQSQYED
AIPYFKRAIAVFEESNILPSLPQAYFLITQIHYKLGKIDKAHEYHSKGMAYSQKAGDVIYLSEFEFLKSLYLSGPDEEAI
QGFFDFLESKMLYADLEDFAIDVAKYYHERKNFQKASAYFLKVEQVRQLIQGGVSLYEIEV
>O32294 ~~~rapG~~~Regulatory protein RapG~~~COG0457
MNKIAPAEIASMLNDWYLAIKKHEVEESSRLFEEVKPLLDDMEEDQEVLAYFSLLELRHKVLLHEARGQGFQHEEPSHMN
ATSDMLKYYFFLFEGMYEAYKNNYDIAIGLYKDAEQYLDNIPDPIEKAEFHLKVGKLYYKLGQNIVSLNHTRQAVKTFRE
ETDYKKKLASALITMSGNFTEMSQFEEAEAYLDEAIRITSELEDHFFEAQLLHNFGLLHAQSGKSEEAVSKLEEALQNDE
YARSAYYYHSAYLLIRELFKIKKKEQALSYYQDVKEKLTAEPNRICEAKIDILYAIYAEGGHAETFHLCKQHMDDLLSEK
EYDSVRELSILAGERYRELELYKEAAHFFYEALQIEELIKRTEVI
>Q59HN8 3.1.3.-~~~rapH~~~Response regulator aspartate phosphatase H~~~COG0457
MSQAIPSSRVGVKINEWYKMIRQFSVPDAEILKAEVEQDIQQMEEDQDLLIYYSLMCFRHQLMLDYLEPGKTYGNRPTVT
ELLETIETPQKKLTGLLKYYSLFFRGMYEFDQKEYVEAIGYYREAEKELPFVSDDIEKAEFHFKVAEAYYHMKQTHVSMY
HILQALDIYQNHPLYSIRTIQSLFVIAGNYDDFKHYDKALPHLEAALELAMDIQNDRFIAISLLNIANSYDRSGDDQMAV
EHFQKAAKVSREKVPDLLPKVLFGLSWTLCKAGQTQKAFQFIEEGLDHITARSHKFYKELFLFLQAVYKETVDERKIHDL
LSYFEKKNLHAYIEACARSAAAVFESSCHFEQAAAFYRKVLKAQEDILKGECLYAY
>P96649 3.1.3.-~~~rapI~~~Response regulator aspartate phosphatase I~~~COG0457
MRGVFLDKDKIPYDLVTKKLNEWYTSIKNDQVEQAEIIKTEVEKELLNMEENQDALLYYQLLEFRHEIMLSYMKSKEIED
LNNAYETIKEIEKQGQLTGMLEYYFYFFKGMYEFRRKELISAISAYRIAESKLSEVEDEIEKAEFFFKVSYVYYYMKQTY
FSMNYANRALKIFREYEEYAVQTVRCQFIVAGNLIDSLEYERALEQFLKSLEISKESNIEHLIAMSHMNIGICYDELKEY
KKASQHLILALEIFEKSKHSFLTKTLFTLTYVEAKQQNYNVALIYFRKGRFIADKSDDKEYSAKFKILEGLFFSDGETQL
IKNAFSYLASRKMFADVENFSIEVADYFHEQGNLMLSNEYYRMSIEARRKIKKGEIIDENQPDSIGSSDFK
>O34327 3.1.3.-~~~rapJ~~~Response regulator aspartate phosphatase J~~~COG0457
MRAKIPSEEVAVKLNEWYKLIRAFEADQAEALKQEIEYDLEDMEENQDLLLYFSLMEFRHRIMLDKLMPVKDSDTKPPFS
DMLNEIESNQQKLTGLLEYYFYYFRGMYEFKQKNFILAIDHYKHAEEKLEYVEDEIEKAEFLFKVAEVYYHIKQTYFSMN
YASQALDIYTKYELYGRRRVQCEFIIAGNLTDVYHHEKALTHLCSALEHARQLEEAYMIAAAYYNVGHCKYSLGDYKEAE
GYFKTAAAIFEEHNFQQAVQAVFSLTHIYCKEGKYDKAVEAYDRGIKSAAEWEDDMYLTKFRLIHELYLGSGDLNVLTEC
FDLLESRQLLADAEDLLHDTAERFNQLEHYESAAFFYRRLMNIKKKLAEQRFQ
>Q54305 3.3.2.13~~~rapK~~~Chorismatase~~~COG0251
MTPPVTAPYCRFEKLGASDLDGDETLLGVIEHRTGHTGVSLAEGCPRTAVHTTTREDESFAEAWHAEGPKESGSHDGVAW
ARTPDYLFGVARVPEGGRYAAGTAAVYTGIFDLIGTLGYPSLARTWNYVSGINTPNADGLEVYRDFCVGRAEALDARGID
PATMPAATGIGAHGGGITCYFIAARAGDRVNMENPAVLTAHRYPQRYGPRPPVFSRATWLSPPGADDGRLFVSATAGIVG
HETVHHGDVAAQCEVSLENIARVIGAENLGRHGLRRGYALADVDHLKVYVRHREDISTVRRICAERLSREATVAVLHTDI
ARTDLLVEIEGVVA
>Q54304 4.3.1.28~~~rapL~~~L-lysine cyclodeaminase~~~COG2423
MQTKVLCQRDIKRILSVVGRDVMMDRLISEVHAGFARLGRGETDEPPPRTGFARGGDVPGVIEFMPHRASGIGVTMKTVS
YSPQNFERFNLPTIVGTVSRLDDDSGSMVALADAATITAMRTGAVAAVATRLLARPGSTTLALIGAGAQAVTQAHALSRV
LPLERILISDIKAEHAESFAGRVAFLELPVEVTDAATAMATADVLCTVTSVPVGGGPVVPAEPRQAHLHVNGIGADEQGK
TELPKALLDDAFICVDHPGQARAEGEFQQLPDRELGPSLADLCAAPEIAAPHPERLSVFDSTGSAFADHIALDVLLGFAD
ELGLGHKMSIESTPEDVLDPYSL
>P0A894 ~~~rapZ~~~RNase adapter protein RapZ~~~COG1660
MVLMIVSGRSGSGKSVALRALEDMGFYCVDNLPVVLLPDLARTLADREISAAVSIDVRNMPESPEIFEQAMSNLPDAFSP
QLLFLDADRNTLIRRYSDTRRLHPLSSKNLSLESAIDKESDLLEPLRSRADLIVDTSEMSVHELAEMLRTRLLGKREREL
TMVFESFGFKHGIPIDADYVFDVRFLPNPHWDPKLRPMTGLDKPVAAFLDRHTEVHNFIYQTRSYLELWLPMLETNNRSY
LTVAIGCTGGKHRSVYIAEQLADYFRSRGKNVQSRHRTLEKRKP
>P0AAZ4 ~~~rarA~~~Replication-associated recombination protein A~~~COG2256
MSNLSLDFSDNTFQPLAARMRPENLAQYIGQQHLLAAGKPLPRAIEAGHLHSMILWGPPGTGKTTLAEVIARYANADVER
ISAVTSGVKEIREAIERARQNRNAGRRTILFVDEVHRFNKSQQDAFLPHIEDGTITFIGATTENPSFELNSALLSRARVY
LLKSLSTEDIEQVLTQAMEDKTRGYGGQDIVLPDETRRAIAELVNGDARRALNTLEMMADMAEVDDSGKRVLKPELLTEI
AGERSARFDNKGDRFYDLISALHKSVRGSAPDAALYWYARIITAGGDPLYVARRCLAIASEDVGNADPRAMQVAIAAWDC
FTRVGPAEGERAIAQAIVYLACAPKSNAVYTAFKAALADARERPDYDVPVHLRNAPTKLMKEMGYGQEYRYAHDEANAYA
AGEVYFPPEIAQTRYYFPTNRGLEGKIGEKLAWLAEQDQNSPIKRYR
>P27844 ~~~rarD~~~Protein RarD~~~COG2962
MDAKQTRQGVLLALAAYFIWGIAPAYFKLIYYVPADEILTHRVIWSFFFMVVLMSICRQWSYLKTLIQTPQKIFMLAVSA
VLIGGNWLLFIWAVNNHHMLEASLGYFINPLVNIVLGMIFLGERFRRMQWLAVILAICGVLVQLWTFGSLPIIALGLAFS
FAFYGLVRKKIAVEAQTGMLIETMWLLPVAAIYLFAIADSSTSHMGQNPMSLNLLLIAAGIVTTVPLLCFTAAATRLRLS
TLGFFQYIGPTLMFLLAVTFYGEKPGADKMVTFAFIWVALAIFVMDAIYTQRRTSK
>O31754 3.4.24.-~~~rasP~~~Regulator of sigma-W protease RasP~~~COG0750
MFVNTVIAFIIIFGTLVFFHELGHLLLAQRAGILCREFAIGFGPKIFSFKKNETVYTIRLLPVGGFVRMAGEDPEMIEVK
PGYTVGLLFNKEDQVEKVIINQKEKYPDALVIEVETADLEHDMKITGYEQGKEDELSSFTVSETSFFIVDGEEVQIAPYN
RQFGSKPVWQRIKAIAAGPIMNFILAYVILVMLGLIQGVPSNEPMLGQLTDNGRAAEAGLKEGDYIQSINGEKMRSWTDI
VSAVKENPEKEMDVAVKRDNKTLHISVTPEAVKDENKKTIGRFGSYAPTEKGVLSAVAYGATSTVDVTKAILTNLSKLVT
GQFKLDMLSGPVGIYDMTDQVAKTGIVNLFQFAAFLSINLGIVNLLPIPALDGGRLLFLFIEAIRGKPINREKEAFVVFI
GVAFLMLLMLVVTWNDIQRLFL
>P0AGL5 ~~~ratA~~~Ribosome association toxin RatA~~~COG2867
MILFVGFLLMEIVMPQISRTALVPYSAEQMYQLVNDVQSYPQFLPGCTGSRILESTPGQMTAAVDVSKAGISKTFTTRNQ
LTSNQSILMNLVDGPFKKLIGGWKFTPLSQEACRIEFHLDFEFTNKLIELAFGRVFKELAANMVQAFTVRAKEVYSAR
>P52119 ~~~ratB~~~UPF0125 protein RatB~~~COG2914
MPGKIAVEVAYALPEKQYLQRVTLQEGATVEEAIRASGLLELRTDIDLTKNKVGIYSRPAKLSDSVHDGDRVEIYRPLIA
DPKELRRQRAEKSANK
>P31473 3.6.3.-~~~ravA~~~ATPase RavA~~~COG0714
MAHPHLLAERISRLSSSLEKGLYERSHAIRLCLLAALSGESVFLLGPPGIAKSLIARRLKFAFQNARAFEYLMTRFSTPE
EVFGPLSIQALKDEGRYERLTSGYLPEAEIVFLDEIWKAGPAILNTLLTAINERQFRNGAHVEKIPMRLLVAASNELPEA
DSSLEALYDRMLIRLWLDKVQDKANFRSMLTSQQDENDNPVPDALQVTDEEYERWQKEIGEITLPDHVFELIFMLRQQLD
KLPDAPYVSDRRWKKAIRLLQASAFFSGRSAVAPVDLILLKDCLWYDAQSLNLIQQQIDVLMTGHAWQQQGMLTRLGAIV
QRHLQLQQQQSDKTALTVIRLGGIFSRRQQYQLPVNVTASTLTLLLQKPLKLHDMEVVHISFERSALEQWLSKGGEIRGK
LNGIGFAQKLNLEVDSAQHLVVRDVSLQGSTLALPGSSAEGLPGEIKQQLEELESDWRKQHALFSEQQKCLFIPGDWLGR
IEASLQDVGAQIRQAQQC
>Q5ZUV9 3.4.22.-~~~ravZ~~~Cysteine protease RavZ~~~
MKGKLTGKDKLIVDEFEELGEQESDIDEFDLLEGDEKLPGDSELDKTTSIYPPETSWEVNKGMNSSRLHKLYSLFFDKSS
AFYLGDDVSVLEDKPLTGAYGFQSKKNDQQIFLFRPDSDYVAGYHVDAKSDAGWVNDKLDRRLSEISEFCSKATQPATFI
LPFVEMPTDITKGVQHQVLLTISYDPKSKQLTPTVYDSIGRDTYSESLSSYFKGKYRTTCDEILTQSIEKAIKSTDFTLG
KFTRAAYNHQNRLTEGNCGSYTFRTIKEVISSSAQGTEVKIPGSGYITSNSYLTSQHVQDIESCIKYRNLGVVDIESALT
EGKTLPVQLSEFIVALEDYGKLRSQQSEKSMLNFIGYSKTAKLTAVELLIGILNDIKGKNEISESQYDKLVKEVDCLMDS
SLGKLVQFHLKNLGAESLQKLVLPCVKFDDTIDDFVTIEKDELFDVPDITGEELASKKGIEQGALDKEALLKQKQIKTDL
LDLREEDKTGLKKPLHGGIKVK
>Q47152 ~~~rayT~~~REP-associated tyrosine transposase~~~COG1943
MSEYRRYYIKGGTWFFTVNLRNRRSQLLTTQYQMLRHAIIKVKRDRPFEINAWVVLPEHMHCIWTLPEGDDDFSSRWREI
KKQFTHACGLKNIWQPRFWEHAIRNTKDYRHHVDYIYINPVKHGWVKQVSDWPFSTFHRDVARGLYPIDWAGDVTDFSAG
ERIIS
>P37624 ~~~rbbA~~~Ribosome-associated ATPase~~~COG0842
MTHLELVPVPPVAQLAGVSQHYGKTVALNNITLDIPARCMVGLIGPDGVGKSSLLSLISGARVIEQGNVMVLGGDMRDPK
HRRDVCPRIAWMPQGLGKNLYHTLSVYENVDFFARLFGHDKAEREVRINELLTSTGLAPFRDRPAGKLSGGMKQKLGLCC
ALIHDPELLILDEPTTGVDPLSRSQFWDLIDSIRQRQSNMSVLVATAYMEEAERFDWLVAMNAGEVLATGSAEELRQQTQ
SATLEEAFINLLPQAQRQAHQAVVIPPYQPENAEIAIEARDLTMRFGSFVAVDHVNFRIPRGEIFGFLGSNGCGKSTTMK
MLTGLLPASEGEAWLFGQPVDPKDIDTRRRVGYMSQAFSLYNELTVRQNLELHARLFHIPEAEIPARVAEMSERFKLNDV
EDILPESLPLGIRQRLSLAVAVIHRPEMLILDEPTSGVDPVARDMFWQLMVDLSRQDKVTIFISTHFMNEAERCDRISLM
HAGKVLASGTPQELVEKRGAASLEEAFIAYLQEAAGQSNEAEAPPVVHDTTHAPRQGFSLRRLFSYSRREALELRRDPVR
STLALMGTVILMLIMGYGISMDVENLRFAVLDRDQTVSSQAWTLNLSGSRYFIEQPPLTSYDELDRRMRAGDITVAIEIP
PNFGRDIARGTPVELGVWIDGAMPSRAETVKGYVQAMHQSWLQDVASRQSTPASQSGLMNIETRYRYNPDVKSLPAIVPA
VIPLLLMMIPSMLSALSVVREKELGSIINLYVTPTTRSEFLLGKQLPYIALGMLNFFLLCGLSVFVFGVPHKGSFLTLTL
AALLYIIIATGMGLLISTFMKSQIAAIFGTAIITLIPATQFSGMIDPVASLEGPGRWIGEVYPTSHFLTIARGTFSKALD
LTDLWQLFIPLLIAIPLVMGLSILLLKKQEG
>Q44212 ~~~rbcX~~~RuBisCO chaperone RbcX~~~
MNLKQIAKDTAKTLQSYLTYQALRTVLAQLGETNPPLALWLHNFSAGKVQDGEKYIEELFLEKPDLALRIMTVREHIAEE
IAEFLPEMVVTGIQQANMEKRRQHLERMTQVSLSHPSPESEQQQFSDPDWDNLAS
>O86418 ~~~rbcX~~~RuBisCO chaperone RbcX~~~
MNLKQIAKDTAKTLQSYLTYQALMTVLAQLGETNPPLALWLHTFSVGKVQDGEAYVKELFREQPDLALRIMTVREHIAEE
VAEFLPEMVRSGIQQANMEQRRQHLERMTHLSLSNPSPESEQQTISDTDWDH
>Q31N04 ~~~rbcX~~~RuBisCO chaperone RbcX~~~
MASTQRAKPMEMPRISRDTARMLVNYLTYQAVCVIRDQLAETNPAGAYRLQVFSAEFSFQDGEAYLAALLNHDRELGLRV
MTVREHLAEHILDYLPEMTIAQIQEANINHRRALLERLTGLGAEPSLPETEVSDRPSDSATPDDASNASHAD
>Q44177 ~~~rbcX~~~RuBisCO chaperone RbcX~~~
MEFKKVAKETAITLQSYLTYQAVRLISQQLSETNPGQAIWLGEFSKRHPIQESDLYLEAMMLENKELVLRILTVRENLAE
GVLEFLPEMVLSQIKQSNGNHRRSLLERLTQVDSSSTDQTEPNPGESDTSEDSE
>A0A0H3K9R3 ~~~rbcX~~~RuBisCO chaperone RbcX~~~
MQFMGTASRMASTQRAKPMEMPRISRDTARMLVNYLTYQAVCVIRDQLAETNPAGAYRLQVFSAEFSFQDGEAYLAALLN
HDRELGLRVMTVREHLAEHILDYLPEMTIAQIQEANINHRRALLERLTGLGAEPSLPETEVSDRPSDSATPDDASNASHA
D
>Q55670 ~~~rbcX~~~RuBisCO chaperone RbcX~~~
MFMQTKHIAQATVKVLQSYLTYQAVLRIQSELGETNPPQAIWLNQYLASHSIQNGETFLTELLDENKELVLRILAVREDI
AESVLDFLPGMTRNSLAESNIAHRRHLLERLTRTVAEVDNFPSETSNGESNNNDSPPS
>Q8DIS6 ~~~rbcX~~~RuBisCO chaperone RbcX~~~
MDVKHIAKQTTKTLISYLTYQAVRTVIGQLAETDPPRSLWLHQFTSQESIQDGERYLEALFREQPDLGFRILTVREHLAE
MVADYLPEMLRAGIQQANLQQRCQQLERMTQVSEANVENSNLETPE
>P0A7G2 ~~~rbfA~~~30S ribosome-binding factor~~~COG0858
MAKEFGRPQRVAQEMQKEIALILQREIKDPRLGMMTTVSGVEMSRDLAYAKVYVTFLNDKDEDAVKAGIKALQEASGFIR
SLLGKAMRLRIVPELTFFYDNSLVEGMRMSNLVTSVVKHDEERRVNPDDSKED
>P45141 ~~~rbfA~~~Ribosome-binding factor A~~~COG0858
MAREFKRSDRVAQEIQKEIAVILQREVKDPRIGMVTVSDVEVSSDLSYAKIFVTFLFDHDEMAIEQGMKGLEKASPYIRS
LLGKAMRLRIVPEIRFIYDQSLVEGMRMSNLVTNVVREDEKKHVEESN
>P75589 ~~~rbfA~~~Ribosome-binding factor A~~~
MASYKKERLENDIIRLINRTVIHEIYNETVKTGHVTHVKLSDDLLHVTVYLDCYNREQIDRVVGAFNQAKGVFSRVLAHN
LYLAKAVQIHFVKDKAIDNAMRIESIINSLKKSKPN
>P9WHJ7 ~~~rbfA~~~Ribosome-binding factor A~~~COG0858
MADAARARRLAKRIAAIVASAIEYEIKDPGLAGVTITDAKVTADLHDATVYYTVMGRTLHDEPNCAGAAAALERAKGVLR
TKVGAGTGVRFTPTLTFTLDTISDSVHRMDELLARARAADADLARVRVGAKPAGEADPYRDNGSVAQSPAPGGLGIRTSD
GPEAVEAPLTCGGDTGDDDRPKE
>Q2G2Q4 ~~~rbfA~~~Ribosome-binding factor A~~~COG0858
MSSMRAERVGEQMKKELMDIINNKVKDPRVGFITITDVVLTNDLSQAKVFLTVLGNDKEVENTFKALDKAKGFIKSELGS
RMRLRIMPELMYEYDQSIEYGNKIERMIQDLHKQDR
>P65967 ~~~rbfA~~~Ribosome-binding factor A~~~
MSSMRAERVGEQMKKELMDIINNKVKDPRVGFITITDVVLTNDLSQAKVFLTVLGNDKEVENTFKALDKAKGFIKSELGS
RMRLRIMPELMYEYDQSIEYGNKIERMIQDLHKQDR
>Q55625 ~~~rbfA~~~Ribosome-binding factor A~~~COG0858
MATSRRVSRVSSLIKREVSQMLLHEIKDDRVGTGMVSVTEVEVSGDLQHAKIFVSIYGSPEAKASTMAGLHSAAPFVRRE
LGQRMRLRRTPEVSFLEDRSLERGDKILNLLNNLPQAIATEDLEDDDSGLALD
>Q9WZV9 ~~~rbfA~~~Ribosome-binding factor A~~~COG0858
MNPAYRKAMLESEIQKLLMEALQQLRDPRLKKDFVTFSRVELSKDKRYADVYVSFLGTPEERKETVEILNRAKGFFRTFI
AKNLRLYVAPEIRFYEDKGIEASVKVHQLLVQLGYDPLKDKEKKEEDKEEE
>Q5SJV1 ~~~rbfA~~~Ribosome-binding factor A~~~COG0858
MAYGKAHLEAQLKRALAEEIQALEDPRLFLLTVEAVRLSKDGSVLSVYVEAFREEEGALRALSRAERRLVAALARRVRMR
RLPRLEFLPWRASPA
>O31743 ~~~rbgA~~~Ribosome biogenesis GTPase A~~~COG1161
MTIQWFPGHMAKARREVTEKLKLIDIVYELVDARIPMSSRNPMIEDILKNKPRIMLLNKADKADAAVTQQWKEHFENQGI
RSLSINSVNGQGLNQIVPASKEILQEKFDRMRAKGVKPRAIRALIIGIPNVGKSTLINRLAKKNIAKTGDRPGITTSQQW
VKVGKELELLDTPGILWPKFEDELVGLRLAVTGAIKDSIINLQDVAVFGLRFLEEHYPERLKERYGLDEIPEDIAELFDA
IGEKRGCLMSGGLINYDKTTEVIIRDIRTEKFGRLSFEQPTM
>Q9WZM6 ~~~rbgA~~~Ribosome biogenesis GTPase A~~~COG1161
MSWYPGHIEKAKRQIKDLLRLVNTVVEVRDARAPFATSAYGVDFSRKETIILLNKVDIADEKTTKKWVEFFKKQGKRVIT
THKGEPRKVLLKKLSFDRLARVLIVGVPNTGKSTIINKLKGKRASSVGAQPGITKGIQWFSLENGVKILDTPGILYKNIF
SEDLAAKLLLVGSLPVERIEDQRIFERAFEIFARSIGIESSFSEFFEDFARKRGLLKKGGVPDIERALMLFFTEVAQGKA
GRVSFERPEDITPVQQEQTRGV
>P22849 4.1.1.39~~~cbbL1~~~Ribulose bisphosphate carboxylase large chain 1~~~COG1850
MAKTYSAGVKEYRETYWMPNYTPKDTDILACFKITPQAGVPREEAAAAVAAESSTGTWTTVWTDLLTDLDYYKGRAYAIE
DVPGDDTCFYAFIAYPIDLFEEGSVVNVFTSLVGNVFGFKAVRALRLEDVRFPIAYVMTCNGPPHGIQVERDIMNKYGRP
MLGCTIKPKLGLSAKNYGRAVYECLRGGLDFTKDDENVNSQPFMRWRQRFDFVMDAIDKAERETGERKGHYLNVTAPTPE
EMYKRAEYAKEIGAPIIMHDYITGGFCANTGLAQWCRDNGVLLHIHRAMHAVLDRNPHHGIHFRVLTKILRLSGGDHLHT
GTVVGKLEGDRASTLGWIDLLRESYIKEDRSRGLFFDQDWGSMPGAFAVASGGIHVWHMPALVTIFGDDSVLQFGGGTLG
HPWGNAAGACANRVALEACVEARNQGVAIEKEGKDVLTKAAASSPELKIAMETWKEIKFEFDTVDKLDIAHK
>Q31IK0 4.1.1.39~~~cbbL1~~~Ribulose bisphosphate carboxylase large chain 1~~~COG1850
MAKTYNAGVKEYRETYWMPEYEPKDSDFLACFKVIPQDGVPREEIAAAVAAESSTGTWTTVWTDLLTDLDYYKGRAYKIE
DVPGDDAAFYAFIAYPIDLFEEGSVVSVMTSLVGNVFGFKALRACRLEDIRFPLAYVMTCGGPPHGIQVERDKMDKYGRP
MLGCTIKPKLGLSAKNYGRAVYECLRGGLDFTKDDENVTSQPFMRWRDRFLFCQDAIEKAQAETGERKGHYLNCTAGTPE
EMYERAEFAKEIGTPIIMHDYLTGGFTANTGLANYCRKNGLLLHIHRAMHGVIDRNPHHGIHFRVLTKALRLSGGDHLHS
GTVVGKLEGDREATLGWIDLMRDSFIPEDRSRGIMFDQDFGAMPGVMPVASGGIHVWHMPALVSIFGDDSVLQFGGGTLG
HPWGNAAGAAANRVALEACVQARNEGKEVEKEGKEILTNAAKHSPELKIAMETWKEIKFEFDTVDKLDVKHK
>Q59458 4.1.1.39~~~cbbL1~~~Ribulose bisphosphate carboxylase large chain 1~~~
MAKTYNAGVKEYRETYWMPEYEPKDSDFLACFKVVPQPGVPREEIAAAVAAESSTGTWTTVWTDLLTDLDYYKGRAYRIE
DVPGDDSAFYAFIAYPIDLFEEGSIVSVMTSLVGNVFGFKALRSIRLEDIRFPLAYVMTCGGPPHGIQVERDKMDKYGRP
MLGCTIKPKLGLSAKNYGRAVYECLRGGLDFTKDDENVTSQPFMRWRDRFLFCQDAIEKAQDETGERTGHYLNATAGTPE
EMYERAEFAKEIGSPIVMHDFLTGGLTANTGLANYCRKNGLLLHIHRAMHGVIDRNPLHGIHFRVLSKVLRLSGGDHLHS
GTVVGKLEGDRGSDLGWIDIMRDSFIAEDRSRGIMFDQDFGEMPGVIPVASGGIHVWHMPALVAIFGDDSVLQFGGGTIG
HPWGNAVGAAVNLVALEACVQARNEGQEIEKNGKEILTNDGKHSPELKIAMETWKEIKFEFDTVDKLDLSHK
>P22859 4.1.1.39~~~cbbL2~~~Ribulose bisphosphate carboxylase large chain 2~~~COG1850
MSTKTYDAGVKDYALTYWTPDYVPLDSDLLACFKVTPQAKVSREEAAAAVAAESSTGTWTTVWSDLLTDLDYYKGRAYRI
EDVPGDKESFYAFIAYPLDLFEEGSIVNVLTSLVGNVFGFKAVRALRLEDIRFPLHYVKTCGGPPNGIQVERDRMDKYGR
PFLGATVKPKLGLSAKNYGRAVYEMLRGGLDFTKDDENVNSQPFMRWQNRFEFVSEAVRKAQEETGERKGHYLNVTAPTC
EEMFKRAEFAKECGAPIIMHDFLTGGFTANTSLANWCRDNGMLLHIHRAMHAVIDRNPKHGIHFRVLAKCLRLSGGDHLH
TGTVVGKLEGDRQSTLGFVDQLRESFIPEDRSRGLFFDQDWGGMPGVMAVASGGIHVWHIPALVTIFGDDSVLQFGGGTQ
GHPWGNAAGAAANRVATEACVKARNEGVEIEKHAREVLSDAARHSPELAVAMETWKEIKFEFDVVDKLDAA
>Q31HD9 4.1.1.39~~~cbbL2~~~Ribulose bisphosphate carboxylase large chain 2~~~COG1850
MASKTFDAGVQDYQLTYWTPDYTPLDTDLLACFKVVPQDGVPREEAAAAVAAESSTGTWTTVWTDLLTDMEFYKGRCYRI
EDVPGDKNAFYAFIAYPLDLFEEGSVVNVLTSLVGNVFGFKAVRSLRLEDLRFPIAFIKTCGGPPAGIQVERDKLNKYGR
PMLGCTIKPKLGLSAKNYGRAVYECLRGGLDLTKDDENINSQPFQRWQNRFEFVADAVDKATAETGERKGHYLNVTAGTV
EEMMKRAEFAKELGQPIIMHDFLTAGFTANTTLANWCRDNGMLLHIHRAMHAVIDRNPNHGIHFRVLAKCLRLSGGDHLH
TGTVVGKLEGDRASTLGFVDQLREAFVPEDRSRGVFFDQDWGSMPGVMAVASGGIHVWHMPALVNIFGDDSVLQFGGGTQ
GHPGGNAAGAAANRVALEACVKARNEGRDLEREGGDILRDAARNSKELAVALDTWKEIKFEFDTVDKLDVG
>Q59460 4.1.1.39~~~cbbL2~~~Ribulose bisphosphate carboxylase large chain 2~~~
MASKTFDAGVQDYQLTYWTPDYTPLDTDLLACFKVVPQEGVPREEAAAAVAAESSTGTWTTVWTDLLTDMEFFKGRAYRI
EDVPGDKNAFYAFIAYPLDLFEEGSVVNVLTSLVGNVFGFKAVRSLRLEDLRFPIAFIKTCGGPPSGIQVERDKLNKYGR
PMLGCTIKPKLGLSAKNYGRAVYECLRGGLDLTKDDENINSQPFQRWRDRFEFVAEAVDKATAETGERKGHYLNVTAGTV
EEMMKRAEFAKELGQPIIMHDFLTAGFTANTTLANWCRENGMLLHIHRAMHAVIDRNPLHGIHFRVLAKCLRLSGGDHLH
TGTVVGKLEGDRASTLGFVDQLRESFVPEDRSRGVFFDQDWGSMPGVMAVASGGIHVWHMPALVNIFGDDSVLQFGGGTQ
GHPGGNAAGAAANRVALEACVKARNEGRDLEREGGDILREAARTSKELAVALETWKEIKFEFDTVDKLDVQ
>P0C2C2 4.1.1.39~~~cbbL1~~~Ribulose bisphosphate carboxylase large chain, chromosomal~~~
MNAPETIQAKPRKRYDAGVMKYKEMGYWDGDYVPKDTDVLALFRITPQDGVDPVEAAAAVAGESSTATWTVVWTDRLTAC
DMYRAKAYRVDPVPNNPEQFFCYVAYDLSLFEEGSIANLTASIIGNVFSFKPIKAARLEDMRFPVAYVKTFAGPSTGIIV
ERERLDKFGRPLLGATTKPKLGLSGRNYGRVVYEGLKGGLDFMKDDENINSQPFMHWRDRFLFVMDAVNKASAATGEVKG
SYLNVTAGTMEEMYRRAEFAKSLGSVIIMVDLIVGWTCIQSMSNWCRQNDMILHLHRAGHGTYTRQKNHGVSFRVIAKWL
RLAGVDHMHTGTAVGKLEGDPLTVQGYYNVCRDAYTQTDLTRGLFFDQDWASLRKVMPVASGGIHAGQMHQLIHLFGDDV
VLQFGGGTIGHPQGIQAGATANRVALEAMVLARNEGRDILNEGPEILRDAARWCAPLRAALDTWGDITFNYTPTDTSDFV
PTASVA
>Q0K1E0 4.1.1.39~~~cbbL1~~~Ribulose bisphosphate carboxylase large chain, chromosomal~~~COG1850
MNAPESVQAKPRKRYDAGVMKYKEMGYWDGDYEPKDTDLLALFRITPQDGVDPVEAAAAVAGESSTATWTVVWTDRLTAC
DMYRAKAYRVDPVPNNPEQFFCYVAYDLSLFEEGSIANLTASIIGNVFSFKPIKAARLEDMRFPVAYVKTFAGPSTGIIV
ERERLDKFGRPLLGATTKPKLGLSGRNYGRVVYEGLKGGLDFMKDDENINSQPFMHWRDRFLFVMDAVNKASAATGEVKG
SYLNVTAGTMEEMYRRAEFAKSLGSVIIMIDLIVGWTCIQSMSNWCRQNDMILHLHRAGHGTYTRQKNHGVSFRVIAKWL
RLAGVDHMHTGTAVGKLEGDPLTVQGYYNVCRDAYTHADLSRGLFFDQDWASLRKVMPVASGGIHAGQMHQLISLFGDDV
VLQFGGGTIGHPQGIQAGATANRVALEAMVLARNEGRDILNEGPEILRDAARWCGPLRAALDTWGDISFNYTPTDTSDFA
PTASVA
>P42721 4.1.1.39~~~cbbL2~~~Ribulose bisphosphate carboxylase large chain, plasmid~~~COG1850
MNAPESVQAKPRKRYDAGVMKYKEMGYWDGDYEPKDTDLLALFRITPQDGVDPVEAAAAVAGESSTATWTVVWTDRLTAC
DMYRAKAYRVDPVPNNPEQFFCYVAYDLSLFEEGSIANLTASIIGNVFSFKPIKAARLEDMRFPVAYVKTFAGPSTGIIV
ERERLDKFGRPLLGATTKPKLGLSGRNYGRVVYEGLKGGLDFMKDDENINSQPFMHWRDRFLFVMDAVNKASAATGEVKG
SYLNVTAGTMEEMYRRAEFAKSLGSVVIMIDLIVGWTCIQSMSNWCRQNDMILHLHRAGHGTYTRQKNHGVSFRVIAKWL
RLAGVDHMHTGTAVGKLEGDPLTVQGYYNVCRDAYTHTDLTRGLFFDQDWASLRKVMPVASGGIHAGQMHQLIHLFGDDV
VLQFGGGTIGHPQGIQAGATANRVALEAMVLARNEGRDILNEGPEILRDAARWCGPLRAALDTWGDISFNYTPTDTSDFA
PTASVA
>P27997 4.1.1.39~~~cbbL~~~Ribulose bisphosphate carboxylase large chain~~~
MDTKTTEIKGKERYKAGVLKYAQMGYWDGDYVPKDTDVLALFRITPQEGVDPVEAAAAVAGESSTATWTVVWTDRLTACD
SYRAKAYRVEPVPGTPGQYFCYVAYDLILFEEGSIANLTASIIGNVFSFKPLKAARLEDMRFPVAYVKTYKGPPTGIVGE
RERLDKFGKPLLGATTKPKLGLSGKNYGRVVYEGLKGGLDFMKDDENINSQPFMHWRDRFLYVMEAVNLASAQTGEVKGH
YLNITAGTMEEMYRRAEFAKSLGSVIVMVDLIIGYTAIQSISEWCRQNDMILHMHRAGHGTYTRQKNHGISFRVIAKWLR
LAGVDHLHCGTAVGKLEGDPLTVQGYYNVCREPFNTVDLPRGIFFEQDWADLRKVMPVASGGIHAGQMHQLLSLFGDDVV
LQFGGGTIGHPMGIQAGATANRVALEAMVLARNEGRNIDVEGPEILRAAAKWCKPLEAALDTWGNITFNYTSTDTSDFVP
TASVAM
>O85040 4.1.1.39~~~cbbL~~~Ribulose bisphosphate carboxylase large chain~~~COG1850
MAVKKYSAGVKEYRQTYWMPEYTPLDSDILACFKITPQPGVDREEAAAAVAAESSTGTWTTVWTDLLTDMDYYKGRAYRI
EDVPGDDAAFYAFIAYPIDLFEEGSVVNVFTSLVGNVFGFKAVRGLRLEDVRFPLAYVKTCGGPPHGIQVERDKMNKYGR
PLLGCTIKPKLGLSAKNYGRAVYECLRGGLDFTKDDENINSQPFMRWRDRFLFVQDATETAEAQTGERKGHYLNVTAPTP
EEMYKRAEFAKEIGAPIIMHDYITGGFTANTGLAKWCQDNGVLLHIHRAMHAVIDRNPNHGIHFRVLTKILRLSGGDHLH
TGTVVGKLEGDRASTLGWIDLLRESFIPEDRSRGIFFDQDWGSMPGVFAVASGGIHVWHMPALVNIFGDDSVLQFGGGTL
GHPWGNAAGAAANRVALEACVEARNQGRDIEKEGKEILTAAAQHSPELKIAMETWKEIKFEFDTVDKLDTQNR
>Q56259 4.1.1.39~~~cbbL~~~Ribulose bisphosphate carboxylase large chain~~~COG1850
MAVKTYSAGVKEYRQTYWMPEYTPLDTDILACFKITPQAGVDREEAAAAVAAESSTGTWTTVWTDLLTDLDYYKGRAYAI
EDVPGDDTCFYAFIAYPIDLFEEGSVVNVFTSLVGNVFGFKAVRALRLEDVRFPIAYVKTCGGPPHGIQVERDVMNKYGR
PLLGCTIKPKLGLSAKNYGRAVYECLRGGLDFTKDDENVNSQPFMRWRQRFDFVMEAIQKSERETGERKGHYLNVTAPTP
EEMYKRAEYAKEIGAPIIMHDYITGGFCANTGLANWCRDNGMLLHIHRAMHAVLDRNPHHGIHFRVLTKILRLSGGDHLH
SGTVVGKLEGDREATLGWIDMMRDSFVKEDRSRGIFFDQDWGSMPGVFPVASGGIHVWHMPALVTIFGDDSVLQFGGGTL
GHPWGNAAGAAANRVALEACVEARNKGVAIEKEGKTVLTEAAKNSPELKIAMETWKEIKFEFDTVDKLDVAHK
>Q9ZHZ4 4.1.1.39~~~cbbM~~~Ribulose bisphosphate carboxylase~~~COG1850
MDQSARYADLSLKEEDLIAGGKHILVAYKMKPKAGHGYLEASAHFAAESSTGTNVEVSTTDDFTKGVDALVYYIDEATED
MRIAYPMDLFDRNVTDGRMMLVSVLTLIIGNNQGMGDIEHAKIHDIYFPERAIQLFDGPSKDISDMWRILGRPIENGGYI
AGTIIKPKLGLRPEPFAAAAYQFWLGGDFIKNDEPQGNQVFCPLKKVLPLVYDSMKRAQDETGQAKLFSMNITADDHYEM
MARADFGLETFGPDADKLAFLVDGFVGGPGMITTARRQYPNQYLHYHRAGHGMITSPSAKRGYTAFVLAKISRLQGASGI
HVGTMGYGKMEGEGDDRNIAYMIERDEAQGPVYFQKWYGMKPTTPIISGGMNALRLPGFFENLGHGNVINTAGGGSYGHI
DSPAAGAISLKQAYECWKAGADPIEFAKEHKEFARAFESFPKDADAIFPGWREKLGVHK
>Q31IK3 4.1.1.39~~~cbbM~~~Ribulose bisphosphate carboxylase~~~COG1850
MDQSNRYADLSLKEEDLIAGQNHILVAYTMEPAAGYGYLEVAAHIAAESSTGTNVEVCTTDDFTKGVDAIVYDIDEANGI
MKVAYPFDLFDRNGLDGKTMIVSFLTLAIGNNQGMGDVKNLQMFDFWVPETKLHLFDGPAVDITNMWKMLGRDKDNGGYI
AGTIIKPKLGLRPEPFADAAYQFWLGGDFIKNDEPQGNQTFCPMKKVIPLVADAMKRAQDETGETKLFSANITADDHHEM
CYRADYILETFGADAPQVAFLVDGYVGGPGMITTARRNYPSQYLHYHRAGHGAITSPSAKRGYTAFVLAKISRLQGASGI
HVGTMGYGKMEGDASDKNIAYMIERDECQGPAFYQKWNGMKPTTPIISGGMNALRLPGFFENLGHGNVINTSGGGSYGHI
DSPAAGATSLRQSYECWKSGADPIEFAKDHKEFARAFESFPADADKIYPGWREKLGVHK
>Q59462 4.1.1.39~~~cbbM~~~Ribulose bisphosphate carboxylase~~~
MDQSNRYADLTLTEEKLVADGNHLLVAYRLKPAAGYGFLEVAAHVAAESSTGTNVEVSTTDDFTRGVDALVYEIDEAAFG
DKGGLMKIAYPVDLFDPNLIDGHYNVSHMWSLILGNNQGMGDHEGLRMLDFLVPEKMVKRFDGPATDISDLWKVLGRPEV
DGGYIAGTIIKPKLGLRPEPFAKACYDFWLGGDFIKNDEPQANQNFCPMEVVIPKVAEAMDRAQQATGQAKLFSANVTAD
FHEEMIKRGEYVLGEFAKYGNEKHVAFLVDGFVTGPAGVTTSRRAFPDTYLHFHRAGHGAVTSYKSPMGMDPLCYMKLAR
LMGASGIHTGTMGYGKMEGHNDERVLAYMLERDECQGPYFYQKWYGMKPTTPIISGGMDALRLPGFFENLGHGNVINTCG
GGSFGHIDSPAAGGISLGQAYACWKTGAEPIEAPREFARAFESFPGDADKIFPGWREKLGVHK
>Q6N0W9 4.1.1.39~~~cbbM~~~Ribulose bisphosphate carboxylase~~~COG1850
MDQSNRYANLNLKESELIAGGRHVLCAYIMKPKAGFGNFIQTAAHFAAESSTGTNVEVSTTDDFTRGVDALVYEVDEANS
LMKIAYPIELFDRNVIDGRAMIASFLTLTIGNNQGMGDVEYAKMYDFYVPPAYLKLFDGPSTTIKDLWRVLGRPVINGGF
IVGTIIKPKLGLRPQPFANACYDFWLGGDFIKNDEPQGNQVFAPFKDTVRAVADAMRRAQDKTGEAKLFSFNITADDHYE
MLARGEFILETFADNADHIAFLVDGYVAGPAAVTTARRAFPKQYLHYHRAGHGAVTSPQSKRGYTAFVLSKMARLQGASG
IHTGTMGFGKMEGEAADRAIAYMITEDAADGPYFHQEWLGMNPTTPIISGGMNALRMPGFFDNLGHSNLIMTAGGGAFGH
VDGGAAGAKSLRQAEQCWKQGADPVEFAKDHREFARAFESFPQDADKLYPNWRAKLKPQAA
>Q2RRP5 4.1.1.39~~~cbbM~~~Ribulose bisphosphate carboxylase~~~COG1850
MDQSSRYVNLALKEEDLIAGGEHVLCAYIMKPKAGYGYVATAAHFAAESSTGTNVEVCTTDDFTRGVDALVYEVDEAREL
TKIAYPVALFDRNITDGKAMIASFLTLTMGNNQGMGDVEYAKMHDFYVPEAYRALFDGPSVNISALWKVLGRPEVDGGLV
VGTIIKPKLGLRPKPFAEACHAFWLGGDFIKNDEPQGNQPFAPLRDTIALVADAMRRAQDETGEAKLFSANITADDPFEI
IARGEYVLETFGENASHVALLVDGYVAGAAAITTARRRFPDNFLHYHRAGHGAVTSPQSKRGYTAFVHCKMARLQGASGI
HTGTMGFGKMEGESSDRAIAYMLTQDEAQGPFYRQSWGGMKACTPIISGGMNALRMPGFFENLGNANVILTAGGGAFGHI
DGPVAGARSLRQAWQAWRDGVPVLDYAREHKELARAFESFPGDADQIYPGWRKALGVEDTRSALPA
>P04718 4.1.1.39~~~cbbM~~~Ribulose bisphosphate carboxylase~~~
MDQSSRYVNLALKEEDLIAGGEHVLCAYIMKPKAGYGYVATAAHFAAESSTGTNVEVCTTDDFTRGVDALVYEVDEAREL
TKIAYPVALFHRNITDGKAMIASFLTLTMGNNQGMGDVEYAKMHDFYVPEAYRALFDGPSVNISALWKVLGRPEVDGGLV
VGTIIKPKLGLRPKPFAEACHAFWLGGDFIKNDEPQGNQPFAPLRDTIALVADAMRRAQDETGEAKLFSANITADDPFEI
IARGEYVLETFGENASHVALLVDGYVAGAAAITTARRRFPDNFLHYHRAGHGAVTSPQSKRGYTAFVHCKMARLQGASGI
HTGTMGFGKMEGESSDRAIAYMLTQDEAQGPFYRQSWGGMKACTPIISGGMNALRMPGFFENLGNANVILTAGGGAFGHI
DGPVAGARSLRQAWQAWRDGVPVLDYAREHKELARAFESFPGDADQIYPGWRKALGVEDTRSALPA
>Q60028 4.1.1.39~~~cbbM~~~Ribulose bisphosphate carboxylase~~~COG1850
MDQSARYADLSLKEEDLIKGGRHILVAYKMKPKSGYGYLEAAAHFAAESSTGTNVEVSTTDDFTKGVDALVYYIDEASED
MRIAYPLELFDRNVTDGRFMLVSFLTLAIGNNQGMGDIEHAKMIDFYVPERCIQMFDGPATDISNLWRILGRPVVNGGYI
AGTIIKPKLGLRPEPFAKAAYQFWLGGDFIKNDEPQGNQVFCPLKKVLPLVYDAMKRAQDDTGQAKLFSMNITADDHYEM
CARADYALEVFGPDADKLAFLVDGYVGGPGMVTTARRQYPGQYLHYHRAGHGAVTSPSAKRGYTAFVLAKMSRLQGASGI
HVGTMGYGKMEGEGDDKIIAYMIERDECQGPVYFQKWYGMKPTTPIISGGMNALRLPGFFENLGHGNVINTAGGGSYGHI
DSPAAGAISLRQSYECWKQGADPIEFAKEHKEFARAFESFPKDADKLFPGWREKLGVHK
>Q6ND47 ~~~rlp2~~~Ribulose bisphosphate carboxylase-like protein 2~~~COG1850
MTPDDIAGFYAKRADLDLDNYIELDFDFECAGDPHEAAAHLCSEQSTAQWRRVGFDEDFRPRFAAKVLELSAEPRPSGFS
VPVECAARGPVHACRVTIAHPHGNFGAKIPNLLSAVCGEGVFFSPGIPLIRLQDIRFPEPYLAAFDGPRFGIAGVRERLQ
AFDRPIFFGVIKPNIGLPPQPFAELGYQSWTGGLDIAKDDEMLADVDWCPLAERAALLGDACRRASAETGVPKIYLANIT
DEVDRLTELHDVAVANGAGALLINAMPVGLSAVRMLRKHATVPLIAHFPFIAAFSRLANYGIHSRVMTRLQRLAGFDVVI
MPGFGPRMMTPEHEVLDCIRACLEPMGPIKPCLPVPGGSDSAATLENVYRKVGSADFGFVPGRGVFGHPMGPAAGATSIR
QAWDAIAAGIPVPDHAASHPELAAALRAFGGR
>Q8KBL4 ~~~~~~Ribulose bisphosphate carboxylase-like protein~~~COG1850
MNAEDVKGFFASRESLDMEQYLVLDYYLESVGDIETALAHFCSEQSTAQWKRVGVDEDFRLVHAAKVIDYEVIEELEQLS
YPVKHSETGKIHACRVTIAHPHCNFGPKIPNLLTAVCGEGTYFTPGVPVVKLMDIHFPDTYLADFEGPKFGIEGLRDILN
AHGRPIFFGVVKPNIGLSPGEFAEIAYQSWLGGLDIAKDDEMLADVTWSSIEERAAHLGKARRKAEAETGEPKIYLANIT
DEVDSLMEKHDVAVRNGANALLINALPVGLSAVRMLSNYTQVPLIGHFPFIASFSRMEKYGIHSKVMTKLQRLAGLDAVI
MPGFGDRMMTPEEEVLENVIECTKPMGRIKPCLPVPGGSDSALTLQTVYEKVGNVDFGFVPGRGVFGHPMGPKAGAKSIR
QAWEAIEQGISIETWAETHPELQAMVDQSLLKKQD
>Q6LBA6 4.1.1.39~~~cbbL~~~Ribulose bisphosphate carboxylase large chain~~~
MNDQSMTIRGKDRYKSGVMAYKKMGYWEPDYVPKDTDVIALFRVTPQDGVDPIEAAAAVAGESSTATWTVVWTDRLTAAE
KYRAKCYRVDPVPNSPGQYFAYIAYDLDLFEPGSISNLTASIIGNVFGFKPLKGLRLEDMRLPVAYVKTFQGPATGIVVE
RERLDKFGRPLLGATVKPKLGLSGRNYGRVVYEALKGGLDFTKDDENINSQPFMHWRERFLYCMEAVNRAQAASGEVKGT
YLNVTAATMEDMYERAEFAKELGSCIVMIDLVIGYTAIQSMAKWARKNDMILHLHRAGHSTYTRQKNHGVSFRVIAKWMR
LAGVDHIHAGTVVGKLEGDPNTTRGYYDICREEFNPTKLEHGIFFDQNWASLNKMMPVASGGIHAGQMHQLLDLLGEDVV
LQFGGGTIGHPMGIQAGAIANRVALEAMILARNEGRDYVAEGPEILAKAAATCTPLKSALEVWKDVTFNYESTDAPDFVP
TAIAAV
>Q9ZI34 4.1.1.39~~~cbbL~~~Ribulose bisphosphate carboxylase large chain~~~COG1850
MNAHTGTVRGKERYRSGVMEYKRMGYWEPDYTPKDTDVIALFRVTPQEGVDPIEASAAVAGESSTATWTVVWTDRLTAAE
KYRAKCHRVDPVPGTPGSYFAYIAYDLDLFEPGSIANLSASIIGNVFGFKPLKALRLEDMRFPVAYVKTFQGPATGIVVE
RERLDKFGRPLLGATVKPKLGLSGRNYGRVVYEALKGGLDFTKDDENINSQPFMHWRDRFLYCIEAVNRAQAASGEVKGT
YLNITAGTMEDMYERAEFAKELGSCIVMIDLVIGYTAIQSMAKWARRNDMILHLHRAGHSTYTRQKSHGVSFRVIAKWMR
LAGVDHIHAGTVVGKLEGDPNTTRGYYDVCREDFNPTKLEHGLFFDQSWASLNKMMPVASGGIHAGQMHQLLDLLGEDVV
LQFGGGTIGHPMGIAAGAIANRVALEAMILARNEGRDYVHEGPEILAKAAQTCTPLKSALEVWKDVTFNYQSTDTPDFVP
TALETV
>Q9ZB35 4.1.1.39~~~cbbL~~~Ribulose bisphosphate carboxylase large chain~~~
MATKTYNAGVKEYRSTYWEPHYTPKDTDILACFKITPQPGVDREEVAAAVAAESSTGTWTTVWTDLLTDLDYYKGRAYRI
EDVPGDDTCFYAFVAYPIDLFEEGSVVNVLTSLVGNVFGFKALRALRSEDVRFPIAYVKTCGGPPHGIQVERDIMNKYGR
PLLGCTIKPKLGLSGKNYGRAVYECLRGGLDFTKDDENVNSQPFMRWPQRFDFEQEAIEKAHGETGERKVHYLNVTAPTP
GEMYKRAEYAKELGAPIIMHDYLTGGLCANTGLANWCRDNGMLLHIHRAMHAELDRNPHHGIHFRVLTKVLRLSGRDHLH
SGTVVGKLEGDRASTLGWIDIMRDTFIKEDRSRGIFFDQDFGSMPGVMPVASGGIHVWHMPALVNIFGDDSVLQFGGGTV
GHPWGNAPGATANRVELEACVKARNEGIAVEKEGKAVLTEAANDSPELKIAMETWKEIKFEFDTVDKLDIAHK
>Q603Q7 4.1.1.39~~~cbbL~~~Ribulose bisphosphate carboxylase large chain~~~COG1850
MAVKTYNAGVKEYRETYWDPNYTPADTDLLAVFKITPQPGVPREEAAAAVAAESSTGTWTTVWTDLLTDLDYYKGRAYRI
EDVPGQDEQFYAFIAYPIDLFEEGSVVNVFTSLVGNVFGFKAVRGLRLEDVRFPLAYVMTCGGPPHGIQVERDIMNKYGR
PLLGCTIKPKLGLSAKNYGRAVYECLRGGLDFTKDDENVNSQPFMRWRQRFDFVMEAIEKAEAETGERKGHYLNVTAPTP
EEMYKRAEYAKEIGAPIIMHDFITGGFCANTGLANWCRNNGMLLHIHRAMHAVMDRNPNHGIHFRVFTKMLRLSGGDHLH
TGTVVGKLEGDRQATLGWIDLLRERSIKEDRSRGIFFDQEWGAMPGVFAVASGGIHVWHMPALLSIFGDDAVFQFGGGTL
GHPWGNAAGAAANRVALEACVEARNEGRQLEKEGKEILTEAAKSSPELKAAMETWKEIKFEFDTVDKLDVAHR
>P00879 4.1.1.39~~~cbbL~~~Ribulose bisphosphate carboxylase large chain~~~COG1850
MSYAQTKTQTKSGYKAGVQDYRLTYYTPDYTPKDTDILAAFRVTPQPGVPFEEAAAAVAAESSTGTWTTVWTDLLTDLDR
YKGRCYDIEPVPGEDNQFIAYIAYPLDLFEEGSITNVLTSIVGNVFGFKALRALRLEDIRFPVAYIKTFQGPPHGIQVER
DKLNKYGRPLLGCTIKPKLGLSAKNYGRAVYECLRGGLDFTKDDENINSAPFQRWRDRFLFVADAITKAQAETGEIKGHY
LNVTAPTCEEMLKRAEYAKELKQPIIMHDYLTAGFTANTTLARWCRDNGVLLHIHRAMHAVIDRQKNHGIHFRVLAKALR
LSGGDHIHTGTVVGKLEGERGITMGFVDLLRENYVEQDKSRGIYFTQDWASLPGVMAVASGGIHVWHMPALVEIFGDDSV
LQFGGGTLGHPWGNAPGATANRVALEACVQARNEGRNLAREGNDVIREAAKWSPELAVACELWKEIKFEFEAMDTV
>Q7U5I8 4.1.1.39~~~cbbL~~~Ribulose bisphosphate carboxylase large chain~~~COG1850
MSKKYDAGVKEYRDTYWTPDYVPLDTDLLACFKCTGQEGVPKEEVAAAVAAESSTGTWSTVWSELLTDLDFYKGRCYRIE
DVPGDKESFYAFIAYPLDLFEEGSITNVLTSLVGNVFGFKALRHLRLEDIRFPMAFIKSCYGPPNGIQVERDRMNKYGRP
LLGCTIKPKLGLSGKNYGRVVYECLRGGLDFTKDDENINSQPFQRWQNRFEFVAEAIKLSEQETGERKGHYLNVTANTPE
EMYERAEFAKELGMPIIMHDFITGGFTANTGLSKWCRKNGMLLHIHRAMHAVIDRHPKHGIHFRVLAKCLRLSGGDQLHT
GTVVGKLEGDRQTTLGYIDQLRESFVPEDRSRGNFFDQDWGSMPGVFAVASGGIHVWHMPALVAIFGDDSVLQFGGGTHG
HPWGSAAGAAANRVALEACVKARNAGREIEKESRDILMEAGKHSPELAIALETWKEIKFEFDTVDKLDVQN
>Q7V6F8 4.1.1.39~~~cbbL~~~Ribulose bisphosphate carboxylase large chain~~~COG1850
MSKKYDAGVKEYRDTYWTPDYVPLDTDLLACFKCTGQEGVPREEVAAAVAAESSTGTWSTVWSELLTDLEFYKGRCYRIE
DVPGDKESFYAFIAYPLDLFEEGSITNVLTSLVGNVFGFKALRHLRLEDIRFPMAFIKTCGGPPNGIVVERDRLNKYGRP
LLGCTIKPKLGLSGKNYGRVVYECLRGGLDLTKDDENINSQPFQRWRERFEFVAEAVKLAQQETGEVKGHYLNCTATTPE
EMYERAEFAKELDMPIIMHDYITGGFTANTGLANWCRKNGMLLHIHRAMHAVIDRHPKHGIHFRVLAKCLRLSGGDQLHT
GTVVGKLEGDRQTTLGYIDNLRESFVPEDRSRGNFFDQDWGSMPGVFAVASGGIHVWHMPALLAIFGDDSCLQFGGGTHG
HPWGSAAGAAANRVALEACVKARNAGREIEKESRDILMEAAKHSPELAIALETWKEIKFEFDTVDKLDVQ
>Q7V2D0 4.1.1.39~~~cbbL~~~Ribulose bisphosphate carboxylase large chain~~~COG1850
MSKKYDAGVKEYRDTYWTPEYVPLDTDLLACFKCTGQEGVPREEVAAAVAAESSTGTWSTVWSELLTDLEFYKGRCYRIE
DVPGDPEAFYAFIAYPLDLFEEGSITNVLTSLVGNVFGFKALRHLRLEDIRFPIAFIKTCGGPPNGIVVERDRLNKYGRP
LLGCTIKPKLGLSGKNYGRVVYECLRGGLDLTKDDENINSQPFQRWRERFEFVAEAVKLAQQETGEVKGHYLNCTANTPE
ELYERAEFAKELDMPIIMHDYITGGFTANTGLANWCRKNGMLLHIHRAMHAVIDRHPKHGIHFRVLAKCLRLSGGDQLHT
GTVVGKLEGDRQTTLGYIDNLRESFVPEDRSRGNFFDQDWGSMPGVFAVASGGIHVWHMPALLAIFGDDSCLQFGGGTHG
HPWGSAAGAAANRVALEACVKARNAGREIEKESRDILMEAAKHSPELAIALETWKEIKFEFDTVDKLDVQG
>F4CQ77 4.1.1.39~~~cbbL~~~Ribulose bisphosphate carboxylase large chain~~~COG1850
MADRWNAGVIPYAEMGYWQPDYEPKDTDILCAFRITPQDGVPPEEAGAAVAGESSTATWTVVWTDRLTTFEHYQAKCYKV
DPVPNTPGQWIAYIAYDIDLFEEASIANLTSSIIGNVFGFKPLKALRLEDMRIPTHYVKTFQGPAHGIVMEREHLGKFGR
PILGATTKPKLGLSARNYGRVVYEALRGGLDFTKDDENINSQPFMRWRDRFLFCMEAVNRAQAATGEIKGHYLNVTAGTM
EEMYERANFAAELGSVIVMIDLTIGYTAIQSMAKWARDNNVILHLHRAGHGTYTRQKNHGVSFRVISKWMRLAGVDHIHA
GTVVGKLEGDPMTTAGFYDTLRKDSIKADLSKGLYFDQEWASMPGVMPVASGGIHAGQMHQLIHYLGEDVILQFGGGTIG
HPMGIAAGAEANRVALEAMIKARNEGVDYYKEGPEILKKAASRNRALDTALATWGDITFNYESTDTPDVVATPTNA
>Q31NB3 4.1.1.39~~~cbbL~~~Ribulose bisphosphate carboxylase large chain~~~COG1850
MPKTQSAAGYKAGVKDYKLTYYTPDYTPKDTDLLAAFRFSPQPGVPADEAGAAIAAESSTGTWTTVWTDLLTDMDRYKGK
CYHIEPVQGEENSYFAFIAYPLDLFEEGSVTNILTSIVGNVFGFKAIRSLRLEDIRFPVALVKTFQGPPHGIQVERDLLN
KYGRPMLGCTIKPKLGLSAKNYGRAVYECLRGGLDFTKDDENINSQPFQRWRDRFLFVADAIHKSQAETGEIKGHYLNVT
APTCEEMMKRAEFAKELGMPIIMHDFLTAGFTANTTLAKWCRDNGVLLHIHRAMHAVIDRQRNHGIHFRVLAKCLRLSGG
DHLHSGTVVGKLEGDKASTLGFVDLMREDHIEADRSRGVFFTQDWASMPGVLPVASGGIHVWHMPALVEIFGDDSVLQFG
GGTLGHPWGNAPGATANRVALEACVQARNEGRDLYREGGDILREAGKWSPELAAALDLWKEIKFEFETMDKL
>Q44176 4.1.1.39~~~cbbL~~~Ribulose bisphosphate carboxylase large chain~~~COG1850
MQTKSAGFNAGVQDYRLTYYTPDYTPKDTDLLACFRMTPQPGVPPEECAAAVAAESSTGTWTTVWTDGLTDLDRYKGRCY
NVEPVPGEDNQYFCFVAYPLDLFEEGSVTNVLTSLVGNVFGFKALRALRLEDIRFPVALIKTYQGPPHGITVERDLLNKY
GRPLLGCTIKPKLGLSAKNYGRAVYECLRGGLDFTKDDENINSQPFMRWRDRFLFVQEAIEKSQAETNEVKGHYLNVTAG
TCEEMLKRAEFAKEIGTPIIMHDFLTGGFTANTTLAKWCRDNGVLLHIHRAMHAVIDRQKNHGIHFRVLAKCLRLSGGDH
LHSGTVVGKLEGDRAATLGFVDLMREDYVEEDRSRGVFFTQDYASLPGTMPVASGGIHVWHMPALVEIFGDDSCLQFGGG
TLGHPWGNAPGATANRVALEACVQARNEGRSLAREGNDVLREAGKWSPELAAALDLWKEIKFEFDTVDTL
>P00880 4.1.1.39~~~cbbL~~~Ribulose bisphosphate carboxylase large chain~~~COG1850
MPKTQSAAGYKAGVKDYKLTYYTPDYTPKDTDLLAAFRFSPQPGVPADEAGAAIAAESSTGTWTTVWTDLLTDMDRYKGK
CYHIEPVQGEENSYFAFIAYPLDLFEEGSVTNILTSIVGNVFGFKAIRSLRLEDIRFPVALVKTFQGPPHGIQVERDLLN
KYGRPMLGCTIKPKLGLSAKNYGRAVYECLRGGLDFTKDDENINSQPFQRWRDRFLFVADAIHKSQAETGEIKGHYLNVT
APTCEEMMKRAEFAKELGMPIIMHDFLTAGFTANTTLAKWCRDNGVLLHIHRAMHAVIDRQRNHGIHFRVLAKCLRLSGG
DHLHSGTVVGKLEGDKASTLGFVDLMREDHIEADRSRGVFFTQDWASMPGVLPVASGGIHVWHMPALVEIFGDDSVLQFG
GGTLGHPWGNAPGATANRVALEACVQARNEGRDLYREGGDILREAGKWSPELAAALDLWKEIKFEFETMDKL
>P96486 4.1.1.39~~~cbbL~~~Ribulose bisphosphate carboxylase large chain~~~COG1850
MSKKYDAGVKEYRDTYWTPDYVPLDTDLLACFKCTGQEGVPKEEVAAAVAAESSTGTWSTVWSELLTDLDFYKGRCYRIE
DVPGDKESFYAFIAYPLDLFEEGSITNVLTSLVGNVFGFKALRHLRLEDIRFPMAFIKSCYGPPNGIQVERDRMNKYGRP
LLGCTIKPKLGLSGKNYGRVVYECLRGGLDFTKDDENINSQPFQRWQNRFEFVAEAIKLSEQETGERKGHYLNVTANTPE
EMYERAEFAKELGMPIIMHDFITGGFTANTGLSKWCRKNGMLLHIHRAMHAVIDRHPKHGIHFRVLAKCLRLSGGDQLHT
GTVVGKLEGDRQTTLGYIDQLRESFVPEDRSRGNFFDQDWGSMPGVFAVASGGIHVWHMPALVTIFGDDSVLQFGGGTHG
HPWGSAAGAAANRVALEACVKARNAGRHLEKESRDILMEAGKHSPELAIALETWKEIKFEFDTVDKLDVQN
>P54205 4.1.1.39~~~cbbL~~~Ribulose bisphosphate carboxylase large chain~~~COG1850
MVQAKAGFKAGVQDYRLTYYTPDYTPKDTDLLACFRMTPQPGVPAEEAAAAVAAESSTGTWTTVWTDNLTDLDRYKGRCY
DLEAVPNEDNQYFAFIAYPLDLFEEGSVTNVLTSLVGNVFGFKALRALRLEDIRFPVALIKTFQGPPHGITVERDKLNKY
GRPLLGCTIKPKLGLSAKNYGRAVYECLRGGLDFTKDDENINSQPFMRWRDRFLFVQEAIEKAQAETNEMKGHYLNVTAG
TCEEMMKRAEFAKEIGTPIIMHDFFTGGFTANTTLARWCRDNGILLHIHRAMHAVVDRQKNHGIHFRVLAKCLRLSGGDH
LHSGTVVGKLEGERGITMGFVDLMREDYVEEDRSRGIFFTQDYASMPGTMPVASGGIHVWHMPALVEIFGDDSCLQFGGG
TLGHPWGNAPGATANRVALEACVQARNEGRNLAREGNDVIREACRWSPELAAACELWKEIKFEFEAMDTL
>Q8DIS5 4.1.1.39~~~cbbL~~~Ribulose bisphosphate carboxylase large chain~~~COG1850
MAYTQSKSQKVGYQAGVKDYRLTYYTPDYTPKDTDILAAFRVTPQPGVPFEEAAAAVAAESSTGTWTTVWTDLLTDLDRY
KGCCYDIEPLPGEDNQFIAYIAYPLDLFEEGSVTNMLTSIVGNVFGFKALKALRLEDLRIPVAYLKTFQGPPHGIQVERD
KLNKYGRPLLGCTIKPKLGLSAKNYGRAVYECLRGGLDFTKDDENINSQPFQRWRDRFLFVADAIHKAQAETGEIKGHYL
NVTAPTCEEMLKRAEFAKELEMPIIMHDFLTAGFTANTTLSKWCRDNGMLLHIHRAMHAVMDRQKNHGIHFRVLAKCLRM
SGGDHIHTGTVVGKLEGDKAVTLGFVDLLRENYIEQDRSRGIYFTQDWASMPGVMAVASGGIHVWHMPALVDIFGDDAVL
QFGGGTLGHPWGNAPGATANRVALEACIQARNEGRDLMREGGDIIREAARWSPELAAACELWKEIKFEFEAQDTI
>P23011 4.1.1.39~~~cbbL~~~Ribulose bisphosphate carboxylase large chain~~~
MGADAAIGQIKDAKKRYAAGVLKYAQMGYWDGDYQPKDTDILALFRVTPQDGVDPVEAAAAVAGESSTATWTVVWTDRLT
AADMYRAKAYKVEPVPGQPGQYFCWVAYELDLFEEGSIANLTASIIGNVFSFKPLKACRLEDMRLPVAYVKTFRGPPTGI
VVERERLDKFGRPLLGATTKPKLGLSGKNYGRVVYEGLKGGLDFVKDDENINSQPFMHWRDRFLYCMEAVNKAQAETGEV
KGHYLNITAGTMEEMYRRADFAKELGSVVVMVDLIVGWTAIQSISNWCRENDMLLHMHRAGHGTYTRQKGHGISFRVIAK
WLRLAGVDHLHTGTAVGKLEGDPMTVQGYYNVCREDVTRTDYTRGIFFDQDWAGLRKVMPVASGGIHAGQMHQLIDLFGE
DVVLQFGGGTIGHPDGIQAGAIANRVALETMILARNEGRDIKNEGPEILVEAAKWCQPLRAALDTWGEVTFNYASTDTSD
FVPTASVA
>P0A8V0 3.1.-.-~~~rbn~~~Ribonuclease BN~~~COG1234
MELIFLGTSAGVPTRTRNVTAILLNLQHPTQSGLWLFDCGEGTQHQLLHTAFNPGKLDKIFISHLHGDHLFGLPGLLCSR
SMSGIIQPLTIYGPQGIREFVETALRISGSWTDYPLEIVEIGAGEILDDGLRKVTAYPLEHPLECYGYRIEEHDKPGALN
AQALKAAGVPPGPLFQELKAGKTITLEDGRQINGADYLAAPVPGKALAIFGDTGPCDAALDLAKGVDVMVHEATLDITME
AKANSRGHSSTRQAATLAREAGVGKLIITHVSSRYDDKGCQHLLRECRSIFPATELANDFTVFNV
>A0QZ11 ~~~rbpA~~~RNA polymerase-binding protein RbpA~~~
MADRVLRGSRLGAVSYETDRNHDLAPRQVARYRTDNGEEFDVPFADDAEIPGTWLCRNGLEGTLIEGDVPEPKKVKPPRT
HWDMLLERRSVEELEELLKERLDLIKAKRRGTGS
>P9WHJ5 ~~~rbpA~~~RNA polymerase-binding protein RbpA~~~
MADRVLRGSRLGAVSYETDRNHDLAPRQIARYRTDNGEEFEVPFADDAEIPGTWLCRNGMEGTLIEGDLPEPKKVKPPRT
HWDMLLERRSIEELEELLKERLELIRSRRRG
>Q9RKY0 ~~~rbpA~~~RNA polymerase-binding protein RbpA~~~
MSERALRGTRLVVTSYETDRGIDLAPRQAVEYACEKGHRFEMPFSVEAEIPPEWECKVCGAQALLVDGDGPEEKKAKPAR
THWDMLMERRTREELEEVLEERLAVLRSGAMNIAVHPRDSRKSA
>P22850 ~~~cbbS1~~~Ribulose bisphosphate carboxylase small subunit 1~~~COG4451
MSEMQDYSSSLEDVNSRKFETFSYLPAMDADRIRKQVEYIVSKGWNPAIEHTEPENAFDHYWYMWKLPMFGETDIDTILK
EAEACHKAHPNNHVRLIGFDNYAQSKGAEMVVYRGKPV
>Q31IJ9 ~~~cbbS1~~~Ribulose bisphosphate carboxylase small subunit 1~~~COG4451
MSIQDYPSRLSDPQSRKAETFSYLPKMTAEQIKAQVQYIIDRGWNPAIEHSEPENAFSYYWYMWKLPMFGETDADAVLAE
VDACIKANPNNHVRLIGYDNYAQSQGANMLVKRGDM
>Q59459 ~~~cbbS1~~~Ribulose bisphosphate carboxylase small subunit 1~~~
MSQELDYPSSLSAPTSRKAETFSYLPKMTTDQIKAQVEYIISKGWNPAIEHSEPENAFSYYWYMWKLPMFGDTDANVVLA
EVDACIKANPNNHVRLIGYDNYAQSQGANMLVKRGDM
>P22860 ~~~cbbS2~~~Ribulose bisphosphate carboxylase small subunit 2~~~COG4451
MNTASSMGDHATIGRYETFSYLPPLNREEILEQILYILDNGWNASLEHEHPDRAFEYYWPMWKMPFFGEQDPNVILTEIE
SCRRSYPDHHVRLVGYDTYAQSKGHSFLVHRAR
>Q31HD8 ~~~cbbS2~~~Ribulose bisphosphate carboxylase small subunit 2~~~COG4451
MSISQIDDYRTQYTLETFSFLPELTADEIYDQIVYIINQGWSPALEHEEPAKASDHYWGMWKLPMFGTRDPNEVLAEIDA
CRQAYPNHLIRLVGYDNYTQCQGHNFVVYRPRGM
>Q59461 ~~~cbbS2~~~Ribulose bisphosphate carboxylase small subunit 2~~~
MSIQTDDYRTKYTLETFSFLPEFTADEIYDQIVYIINQGWTPALEHEAPEAASSHYWGMWKLPMFGTRDPNAVLAEIDQC
RAAYPNHMIRLIGYDNYTQCQGHNFCRFTVLGECNSHGTNNE
>P04983 7.5.2.7~~~rbsA~~~Ribose import ATP-binding protein RbsA~~~COG1129
MEALLQLKGIDKAFPGVKALSGAALNVYPGRVMALVGENGAGKSTMMKVLTGIYTRDAGTLLWLGKETTFTGPKSSQEAG
IGIIHQELNLIPQLTIAENIFLGREFVNRFGKIDWKTMYAEADKLLAKLNLRFKSDKLVGDLSIGDQQMVEIAKVLSFES
KVIIMDEPTDALTDTETESLFRVIRELKSQGRGIVYISHRMKEIFEICDDVTVFRDGQFIAEREVASLTEDSLIEMMVGR
KLEDQYPHLDKAPGDIRLKVDNLCGPGVNDVSFTLRKGEILGVSGLMGAGRTELMKVLYGALPRTSGYVTLDGHEVVTRS
PQDGLANGIVYISEDRKRDGLVLGMSVKENMSLTALRYFSRAGGSLKHADEQQAVSDFIRLFNVKTPSMEQAIGLLSGGN
QQKVAIARGLMTRPKVLILDEPTRGVDVGAKKEIYQLINQFKADGLSIILVSSEMPEVLGMSDRIIVMHEGHLSGEFTRE
QATQEVLMAAAVGKLNRVNQE
>P36949 ~~~rbsB~~~Ribose import binding protein RbsB~~~COG1879
MKKAVSVILTLSLFLLTACSLEPPQWAKPSNSGNKKEFTIGLSVSTLNNPFFVSLKKGIEKEAKKRGMKVIIVDAQNDSS
KQTSDVEDLIQQGVDALLINPTDSSAISTAVESANAVGVPVVTIDRSAEQGKVETLVASDNVKGGEMAAAFIADKLGKGA
KVAELEGVPGASATRERGSGFHNIADQKLQVVTKQSADFDRTKGLTVMENLLQGHPDIQAVFAHNDEMALGALEAINSSG
KDILVIGFDGNKDALASIKDRKLSATVAQQPELIGKLATEAADDILHGKKVQKTISAPLKLETQK
>P02925 ~~~rbsB~~~Ribose import binding protein RbsB~~~COG1879
MNMKKLATLVSAVALSATVSANAMAKDTIALVVSTLNNPFFVSLKDGAQKEADKLGYNLVVLDSQNNPAKELANVQDLTV
RGTKILLINPTDSDAVGNAVKMANQANIPVITLDRQATKGEVVSHIASDNVLGGKIAGDYIAKKAGEGAKVIELQGIAGT
SAARERGEGFQQAVAAHKFNVLASQPADFDRIKGLNVMQNLLTAHPDVQAVFAQNDEMALGALRALQTAGKSDVMVVGFD
GTPDGEKAVNDGKLAATIAQLPDQIGAKGVETADKVLKGEKVQAKYPVDLKLVVKQ
>P44737 ~~~rbsB~~~Ribose import binding protein RbsB~~~COG1879
MKKLTALTSAVLLGLAVSSSASAQDTIALAVSTLDNPFFVTLKDGAQKKADELGYKLVVLDSQNDPAKELANIEDLTVRG
AKILLINPTASEAVGNAVAIANRKHIPVITLDRGAAKGNVVSHIASDNIAGGKMAGDFIAQKLGDNAKVIQLEGIAGTSA
ARERGEGFKQAIDAHKFNVLASQPADFDRTKGLNVTENLLASKGDVQAIFAQNDEMALGALRAVKAANKKVLIVGFDGTD
DGVKAVKSGKMAATIAQQPELIGSLGVVTADKILKGEKVEAKIPVDLKVISE
>P0A2C5 ~~~rbsB~~~Ribose import binding protein RbsB~~~
MNMKKLATLVSAVALSATVSANAMAKDTIALVISTLNNPFFVSLKDGAQKEADKLGYNLVVLDSQNNPAKELANVQDLTV
RGTKILLINPTDSDAVGNAVKMANQAKIPVITLDRQATKGDVVSHIASDNVLGGKIAGDYIAKKAGEGAKVIELQGIAGT
SAARERGEGFQQAVAAHKFNVLASQPADFDRTKGLNVMQNLLTAHPDVQAVFAQNDEMALGALRALQTAGKADVMVVGFD
GTPDGEKAVKDGKLAATIAQLPDQIGAKGVEVADKVLKGEKVQAKYPVDLKLVIKQ
>P09658 ~~~cbbS~~~Ribulose bisphosphate carboxylase small subunit, chromosomal~~~
MRITQGTFSFLPELTDEQITKQLEYCLNQGWAVGLEYTDDPHPRNTYWEMFGLPMFDLRDAAGILMEINNARNTFPNHYI
RVTAFDSTHTVESVVMSFIVNRPADEPGFRLVRQEEPGRTLRYSIESYAVQARPEGSRY
>P0AGI1 ~~~rbsC~~~Ribose import permease protein RbsC~~~COG1172
MTTQTVSGRRYFTKAWLMEQKSLIALLVLIAIVSTLSPNFFTINNLFNILQQTSVNAIMAVGMTLVILTSGIDLSVGSLL
ALTGAVAASIVGIEVNALVAVAAALALGAAIGAVTGVIVAKGRVQAFIATLVMMLLLRGVTMVYTNGSPVNTGFTENADL
FGWFGIGRPLGVPTPVWIMGIVFLAAWYMLHHTRLGRYIYALGGNEAATRLSGINVNKIKIIVYSLCGLLASLAGIIEVA
RLSSAQPTAGTGYELDAIAAVVLGGTSLAGGKGRIVGTLIGALILGFLNNGLNLLGVSSYYQMIVKAVVILLAVLVDNKK
Q
>P36946 5.4.99.62~~~rbsD~~~D-ribose pyranase~~~COG1869
MKKHGILNSHLAKILADLGHTDKIVIADAGLPVPDGVLKIDLSLKPGLPAFQDTAAVLAEEMAVEKVIAAAEIKASNQEN
AKFLENLFSEQEIEYLSHEEFKLLTKDAKAVIRTGEFTPYANCILQAGVLF
>P04982 5.4.99.62~~~rbsD~~~D-ribose pyranase~~~COG1869
MKKGTVLNSDISSVISRLGHTDTLVVCDAGLPIPKSTTRIDMALTQGVPSFMQVLGVVTNEMQVEAAIIAEEIKHHNPQL
HETLLTHLEQLQKHQGNTIEIRYTTHEQFKQQTAESQAVIRSGECSPYANIILCAGVTF
>Q8ZKW0 5.4.99.62~~~rbsD~~~D-ribose pyranase~~~
MKKGTVLNSEISSVISRLGHTDTLVVCDAGLPIPNSTARIDMALTQGVPSFMQVVDVVTREMQVEAAILATEIKQQNPQL
HETLLTHLEQLQQHQGNTIKISYTTHEQFKKLTADSQAVIRSGECSPYANVILCAGVTF
>Q2G1A5 5.4.99.62~~~rbsD~~~D-ribose pyranase~~~COG1869
MKKSAVLNEHISKAIATIGHFDLLTINDAGMPIPNDHRRIDLAVTKNLPRFIDVLATVLEEMEIQKIYLAEEIKEHNPTQ
LQQIKQLISSEIEIIFIPHEEMKSNLAHPLNKGNIRTGETTPYSNIALESNVTF
>P83534 ~~~rbsK/rbiA~~~Bifunctional ribokinase/ribose-5-phosphate isomerase A~~~
MKKIIVVGSTNVDKVLNVEKYALPGETLAINTYQQSHGGGKGANQAIAAARSGADTTFITKLGNDEDAKMMVKGFKADGM
NIDDVITTTDQETGKAYITVDKSGQNSIYVYGGANMAMTPTDVDAHKSAIINADRVIAQLEIPVPAVIEAFKIAKEHGVQ
TILNPAPAKELPEELLKLTDIITPNESEAATLTGIEVKDETSMLANAKFFFERGIKMVIITVGGRGSFFATPDDHALIPP
FPAKVVDTTAAGDTFIGSLASQLEIDLSNIRKAMLYASHASSLTIQVAGAQNSIPTREAILNVINQDQMTKTEIEKQKAQ
AAAYAAKLVPDHIVLGLGSGTTAAYFVKAINQRINDEHLDIQCVATSVGTEKLAEKLGMRMLDVNTIDQVDLTVDGADVV
DHQLNGIKGGGAALLFEKLVADMSKQNIWIVDQSKYTDSLAGHILTIEVIPFGGMGVFRYLKENGYQPEFRFKDNGDILE
TDSGNYLINIIIPKDADLEKLSIDLKKQTGVVEHGLFLNVCDELIIGGDQIKTIKRSDLS
>P0A9J6 2.7.1.15~~~rbsK~~~Ribokinase~~~COG0524
MQNAGSLVVLGSINADHILNLQSFPTPGETVTGNHYQVAFGGKGANQAVAAGRSGANIAFIACTGDDSIGESVRQQLATD
NIDITPVSVIKGESTGVALIFVNGEGENVIGIHAGANAALSPALVEAQRERIANASALLMQLESPLESVMAAAKIAHQNK
TIVALNPAPARELPDELLALVDIITPNETEAEKLTGIRVENDEDAAKAAQVLHEKGIRTVLITLGSRGVWASVNGEGQRV
PGFRVQAVDTIAAGDTFNGALITALLEEKPLPEAIRFAHAAAAIAVTRKGAQPSVPWREEIDAFLDRQR
>A0A0H2WZY4 2.7.1.15~~~rbsK~~~Ribokinase~~~
MTNKVVILGSTNVDQFLTVERYAQPGETLHVEEAQKAFGGGKGANQAIATARMQADTTFITKIGTDGVADFILEDFKVAH
IDTSYIIKTAEAKTGQAFITVNAEGQNTIYVYGGANMTMTPEDVINAKDAIINADFVVAQLEVPIPAIISAFEIAKAHGV
TTVLNPAPAKALPNELLSLIDIIVPNETEAELLSGIKVTNEQSMKDNANYFLSIGIKTVLITLGKQGTYFATKNQSQHIE
AYKVNAIDTTAAGDTFIGAFVSRLNKSQDNLADAIDFGNKASSLTVQKHGAQASIPLLEEVNQV
>P0ACQ0 ~~~rbsR~~~Ribose operon repressor~~~COG1609
MATMKDVARLAGVSTSTVSHVINKDRFVSEAITAKVEAAIKELNYAPSALARSLKLNQTHTIGMLITASTNPFYSELVRG
VERSCFERGYSLVLCNTEGDEQRMNRNLETLMQKRVDGLLLLCTETHQPSREIMQRYPTVPTVMMDWAPFDGDSDLIQDN
SLLGGDLATQYLIDKGHTRIACITGPLDKTPARLRLEGYRAAMKRAGLNIPDGYEVTGDFEFNGGFDAMRQLLSHPLRPQ
AVFTGNDAMAVGVYQALYQAELQVPQDIAVIGYDDIELASFMTPPLTTIHQPKDELGELAIDVLIHRITQPTLQQQRLQL
TPILMERGSA
>Q9ZI33 ~~~cbbS~~~Ribulose bisphosphate carboxylase small subunit~~~COG4451
MKLTQGCFSFLPDLTDDQIYKQVQYCLAKGWAVNIEFTDDPHPRNTYWEMWGLPMFDLQDAAGVMMELAECRRVYGDRYI
RISGFDSSPGWESVRISFLVNRPPQEAEFELVRQEVGGRAIRYTTVRKAPAHVS
>P27998 ~~~cbbS~~~Ribulose bisphosphate carboxylase small subunit~~~
MRITQGCFSFLPDLTDEQISAQVDYCLGRGWAVSLEHTDDPHPRNTYWEMWGMPMFDLRDPKGVMIELDECRKAWPGRYI
RINAFDSTRGFETVTMSFIVNRPEVEPSLRMERTEVDGRSIRYTHSIVR
>P45686 ~~~cbbS~~~Ribulose bisphosphate carboxylase small subunit~~~COG4451
MAEMQDYKQSLKYETFSYLPPMNAERIRAQIKYAIAQGWSPGIEHVEVKNSMNQYWYMWKLPFFGEQNVDNVLAEIEACR
SAYPTHQVKLVAYDNYAQSLGLAFVVYRGN
>P06514 ~~~cbbS~~~Ribulose bisphosphate carboxylase small subunit~~~COG4451
MQTLPKERRYETLSYLPPLTDVQIEKQVQYILSQGYIPAVEFNEVSEPTELYWTLWKLPLFGAKTSREVLAEVQSCRSQY
PGHYIRVVGFDNIKQCQILSFIVHKPSRY
>P0A4S5 ~~~cbbS~~~Ribulose bisphosphate carboxylase small subunit~~~COG4451
MPFQSTVGDYQTVATLETFGFLPPMTQDEIYDQIAYIIAQGWSPLVEHVHPSNSMATYWSYWKLPFFGEKDLNVVVSELE
ACHRAYPDHHVRIVGYDAYTQSQGACFVVFEGR
>Q7V6F9 ~~~cbbS~~~Ribulose bisphosphate carboxylase small subunit~~~COG4451
MPFQSTVGDYQTVATLETFGFLPPMTQDEIYDQIAYIIAQGWSPVIEHVHPSGSMQTYWSYWKLPFFGEKDLNMVVSELE
ACHRAYPDHHVRMVGYDAYTQSQGTAFVVFEGR
>Q7V2C9 ~~~cbbS~~~Ribulose bisphosphate carboxylase small subunit~~~COG4451
MPFQSSVGDYQTVATLETFGFLPPMTQEEIYDQIAYIIAQGWSPVIEHVHPSGSMQTYWSYWKLPFFGEKDLNLVVSELE
ACHRAYPDHHVRIIGYDAYTQSQGTAFAVFQGR
>F4CQ78 ~~~cbbS~~~Ribulose bisphosphate carboxylase small subunit~~~COG4451
MRITQGTFSYLPDFTDEEITAQINYALTNDWPLSVEFTDDPHPRNTYWEMWGLPMFDLRDPAGVLMEVNACRAANPNAYV
RLNAYDASLGRQTTALSFIVQRPAEEPGFRLDRAEGRGRAIGYTTHSYATERPVGDRYTQG
>Q31NB2 ~~~cbbS~~~Ribulose bisphosphate carboxylase small subunit~~~COG4451
MSMKTLPKERRFETFSYLPPLSDRQIAAQIEYMIEQGFHPLIEFNEHSNPEEFYWTMWKLPLFDCKSPQQVLDEVRECRS
EYGDCYIRVAGFDNIKQCQTVSFIVHRPGRY
>Q44178 ~~~cbbS~~~Ribulose bisphosphate carboxylase small subunit~~~COG4451
MKTLPKEKRYETLSYLPPLSDQQIARQVQYMMDQGYIPGIEFEKDPTPELHHWTLWKLPLFNASSAQEVLNEVRECRSEY
SDCYIRVVGFDNIKQCQTVSFIVYKPNQTRY
>P04716 ~~~cbbS~~~Ribulose bisphosphate carboxylase small subunit~~~COG4451
MSMKTLPKERRFETFSYLPPLSDRQIAAQIEYMIEQGFHPLIEFNEHSNPEEFYWTMWKLPLFDCKSPQQVLDEVRECRS
EYGDCYIRVAGFDNIKQCQTVSFIVHRPGRY
>P0A4S6 ~~~cbbS~~~Ribulose bisphosphate carboxylase small subunit~~~COG4451
MPFQSTVGDYQTVATLETFGFLPPMTQDEIYDQIAYIIAQGWSPLVEHVHPSNSMATYWSYWKLPFFGEKDLNVVVSELE
ACHRAYPDHHVRIVGYDAYTQSQGACFVVFEGR
>P54206 ~~~cbbS~~~Ribulose bisphosphate carboxylase small subunit~~~COG4451
MKTLPKERRYETLSYLPPLTDQQIAKQVEFLLDQGFIPGVEFEEDPQPETHFWTMWKLPFFGGATANEVLAEVRECRSEN
PNCYIRVIGFDNIKQCQTVSFIVHKPNQNQGRY
>Q8DIS7 ~~~cbbS~~~Ribulose bisphosphate carboxylase small subunit~~~COG4451
MKTLPKERRYETFSYLPPLSDAQIARQIQYAIDQGYHPCVEFNETSNAEIRYWTMWKLPLFNCTNAQDVLNEVQQCRSEY
PNCFIRVVAFDNIKQCQVMSFIVYKPNQANSGYSGYRY
>P23012 ~~~cbbS~~~Ribulose bisphosphate carboxylase small subunit~~~
MRITQGTFSFLPDLTAAQVKAQIQYALDQNWAVSVEYTDDPHPRNTYWEMWGLPMFDLRDAAGVYGEVEACRTAHPGKYV
RVNAFDSNRGWETVRLSFIVQRPEKEDGFRLDRTEGPGRTQRYALQHRSYAAG
>Q06721 ~~~rca~~~Ribulose bisphosphate carboxylase/oxygenase activase~~~
MSYYIAPRFLDKLAVHITKNFLNLPGVRVPLILGIHGRKGEGKTFQCELAFEKMGVEVTLISGGELESPDAGDPARLIRL
RYRETAELIKVRGKMCVLMINDLDAGAGRFDEGTQYTVNTQLVNATLMNIADNPTDVQLPGSYDSTPLRRVPIIVTGNDF
STLYAPLIRDGRMEKFYWEPHRDEKVGIVGGIFAEDGLSQRDVEKLVDSFPNQSIDFFSALRSRIYDEQIRDFIHQVGYE
NVSLRVVNSLEGPPAFKKPDFTLSHLIESANFMVAEQKRIETSQLVDEYNRLNRGRSYQPASPVAEIATSQPSPNGVNQP
QSASPHISLETQEQIRQILAQGHKITFEHVDNRRFRTGSWQSCGTIHVDAESDAISTLESCLAEYRGEYVRLVGIDPKAK
RRVVETIIQRPNGTN
>P58555 ~~~rca~~~Ribulose bisphosphate carboxylase/oxygenase activase~~~COG0464
MSYYIAPRFLDKLAVHITKNFLNIPGVRVPLILGIHGRKGEGKTFQCELAFEKMGIEVTLISGGELESPDAGDPARLIRL
RYRETAELIKVRGKMCVLMINDLDAGAGRFDEGTQYTVNTQLVNATLMNIADNPTDVQLPGSYDSNPIRRVPIIVTGNDF
STLYAPLIRDGRMEKFYWEPNRDDKVGIVGGIFAEDGLSQREIEQLVDTFPKQSIDFFSALRSRIYDIQIRDFIHKVGFE
RISLRVVNSLEAPPEFKKPDFSLAHLIESGNLVLGEQQRVDNSQLVDEYNRLNRGRGYQTAPPPEAPVIQPVNNSSHKQK
TSNTHLSLETQEQIRQILSQGHKITFEHVDARRFRTGSWQSCGTLHIDAESDAISTLEACLVDYDGEYVRMVGIDPKGKR
RVVETIIQRPNGKN
>P33230 ~~~rcbA~~~Double-strand break reduction protein~~~
MYKITATIEKEGGTPTNWTRYSKSKLTKSECEKMLSGKKEAGVSREQKVKLINFNCEKLQSSRIALYSN
>P75811 ~~~rcdA~~~HTH-type transcriptional regulator RcdA~~~COG3226
MRRANDPQRREKIIQATLEAVKLYGIHAVTHRKIATLAGVPLGSMTYYFSGIDELLLEAFSSFTEIMSRQYQAFFSDVSD
APGACQAITDMIYSSQVATPDNMELMYQLYALASRKPLLKTVMQNWMQRSQQTLEQWFEPGTARALDAFIEGMTLHFVTD
RKPLSREEILRMVERVAG
>P06008 ~~~puhA~~~Reaction center protein H chain~~~
MYHGALAQHLDIAQLVWYAQWLVIWTVVLLYLRREDRREGYPLVEPLGLVKLAPEDGQVYELPYPKTFVLPHGGTVTVPR
RRPETRELKLAQTDGFEGAPLQPTGNPLVDAVGPASYAERAEVVDATVDGKAKIVPLRVATDFSIAEGDVDPRGLPVVAA
DGVEAGTVTDLWVDRSEHYFRYLELSVAGSARTALIPLGFCDVKKDKIVVTSILSEQFANVPRLQSRDQITLREEDKVSA
YYAGGLLYATPERAESLL
>Q3J170 ~~~puhA~~~Reaction center protein H chain~~~COG3861
MVGVTAFGNFDLASLAIYSFWIFLAGLIYYLQTENMREGYPLENEDGTPAANQGPFPLPKPKTFILPHGRGTLTVPGPES
EDRPIALARTAVSEGFPHAPTGDPMKDGVGPASWVARRDLPELDGHGHNKIKPMKAAAGFHVSAGKNPIGLPVRGCDLEI
AGKVVDIWVDIPEQMARFLEVELKDGSTRLLPMQMVKVQSNRVHVNALSSDLFAGIPTIKSPTEVTLLEEDKICGYVAGG
LMYAAPKRKSVVAAMLAEYA
>P0C0Y7 ~~~puhA~~~Reaction center protein H chain~~~
MVGVTAFGNFDLASLAIYSFWIFLAGLIYYLQTENMREGYPLENEDGTPAANQGPFPLPKPKTFILPHGRGTLTVPGPES
EDRPIALARTAVSEGFPHAPTGDPMKDGVGPASWVARRDLPELDGHGHNKIKPMKAAAGFHVSAGKNPIGLPVRGCDLEI
AGKVVDIWVDIPEQMARFLEVELKDGSTRLLPMQMVKVQSNRVHVNALSSDLFAGIPTIKSPTEVTLLEEDKICGYVAGG
LMYAAPKRKSVVAAMLAEYA
>P19056 ~~~puhA~~~Reaction center protein H chain~~~
MVGVNFFGDFDLASLAIWSFWAFLAYLIYYLQTENMREGYPLENDDGKLSPNQGPFPVPSPKTFDLADGRKIVVPSVENE
EAHRRTDLALERTSVNEGYPFRPTGNPMLDGVGPASWVPRRDEPEVDAHGHNKIQPMRKTEMKVSAGRDPRGMPVQAGDT
EVVGKIVDMWVDIPEQLVRYLEVELNSGKKKLLPMTMLKIWSDRVRVNAITSDLFDTIPDIKSPDVVTKLEEDKISAYVA
GGYMYAKGVKPYAL
>P06009 ~~~pufL~~~Reaction center protein L chain~~~
MALLSFERKYRVRGGTLIGGDLFDFWVGPYFVGFFGVSAIFFIFLGVSLIGYAASQGPTWDPFAISINPPDLKYGLGAAP
LLEGGFWQAITVCALGAFISWMLREVEISRKLGIGWHVPLAFCVPIFMFCVLQVFRPLLLGSWGHAFPYGILSHLDWVNN
FGYQYLNWHYNPGHMSSVSFLFVNAMALGLHGGLILSVANPGDGDKVKTAEHENQYFRDVVGYSIGALSIHRLGLFLASN
IFLTGAFGTIASGPFWTRGWPEWWGWWLDIPFWS
>Q3J1A5 ~~~pufL~~~Reaction center protein L chain~~~
MALLSFERKYRVPGGTLVGGNLFDFWVGPFYVGFFGVATFFFAALGIILIAWSAVLQGTWNPQLISVYPPALEYGLGGAP
LAKGGLWQIITICATGAFVSWALREVEICRKLGIGYHIPFAFAFAILAYLTLVLFRPVMMGAWGYAFPYGIWTHLDWVSN
TGYTYGNFHYNPAHMIAISFFFTNALALALHGALVLSAANPEKGKEMRTPDHEDTFFRDLVGYSIGTLGIHRLGLLLSLS
AVFFSALCMIITGTIWFDQWVDWWQWWVKLPWWANIPGGING
>P0C0Y8 ~~~pufL~~~Reaction center protein L chain~~~
MALLSFERKYRVPGGTLVGGNLFDFWVGPFYVGFFGVATFFFAALGIILIAWSAVLQGTWNPQLISVYPPALEYGLGGAP
LAKGGLWQIITICATGAFVSWALREVEICRKLGIGYHIPFAFAFAILAYLTLVLFRPVMMGAWGYAFPYGIWTHLDWVSN
TGYTYGNFHYNPAHMIAISFFFTNALALALHGALVLSAANPEKGKEMRTPDHEDTFFRDLVGYSIGTLGIHRLGLLLSLS
AVFFSALCMIITGTIWFDQWVDWWQWWVKLPWWANIPGGING
>P11695 ~~~pufL~~~Reaction center protein L chain~~~
MSRAKAKDPRFPDFSFTVVEGARATRVPGGRTIEEIEPEYKIKGRTTFSAIFRYDPFDFWVGPFYVGFWGFVSVIGIIFG
SYFYINETILKGPYSIPQNFFAGRIDPPPPELGLGFAAPGEPGFAWQMTVLFATIAFFGWMMRQVDISMKLDMGYHVPIA
FGVAFSAWLVLQVIRPIALGMWHEGFVLGIMPHLDWVSNFGYRYNNFFYNPFHAIGITGLFASTWLLACHGSLILSAAQY
RGPEGGDIENVFFRDVQYYSVGESGVHRLGYIFAIGGILSADLCILLSGWPVQDWVSFWNFWNNLPFWSGV
>P19057 ~~~pufL~~~Reaction center protein L chain~~~
MALLSFERKYRVPGGTLIGGSLFDFWVGPFYVGFFGVTTIFFATLGFLLILWGAAMQGTWNPQLISIFPPPVENGLNVAA
LDKGGLWQVITVCATGAFCSWALREVEICRKLGIGFHIPVAFSMAIFAYLTLVVIRPMMMGSWGYAFPYGIWTHLDWVSN
TGYTYGNFHYNPFHMLGISLFFTTAWALAMHGALVLSAANPVKGKTMRTPDHEDTYFRDLMGYSVGTLGIHRLGLLLALN
AVFWSACCMLVSGTIYFDLWSDWWYWWVNMPFWADMAGGING
>O83005 ~~~pufL~~~Reaction center protein L chain~~~
MAMLSFEKKYRVRGGTLIGGDLFDFWVGPFYVGIFGVMTVFFALIGIALIAWNTALGPTWNLWQISVNPPDAKYGLGFAP
LAEGGIWQWVSICATGAFVTWALREVEICRKLGIGFHVPFAFSFAIFAYVTLVVIRPVLMGSWSYGFPYGIFTHLDWVSN
TGYSYGQFHYNPAHMIAITFFFTTCLALALHGGLVLSALNPDRGEPVKSPEHENTVFRDLVGYSIGTIGIHRLGLFLALS
AVFFSAVCMIISGPVLAEGGSWPDWWNWWRNLPIWNP
>P10717 ~~~pufL~~~Reaction center protein L chain~~~
MALLSFERKYRVRGGTLIGGDLFDFWVGPFYVGFFGVTTLLFTVLGTALIVWGAALGPSWTFWQISINPPDVSYGLAMAP
MAKGGLWQIITFSAIGAFVSWALREVEICRKLGIGYHIPFAFGFAILAYVSLVVIRPVMMGAWGYGFPYGFMTHLDWVSN
TGYQYANFHYNPAHMLGITLFFTTCLALALHGSLILSAANPGKGEVVKGPEHENTYFQDTIGYSVGTLGIHRVGLILALS
AVVWSIICMILSGPIYTGSWPDWWLWWQKLPFWNHG
>P06010 ~~~pufM~~~Reaction center protein M chain~~~
MADYQTIYTQIQARGPHITVSGEWGDNDRVGKPFYSYWLGKIGDAQIGPIYLGASGIAAFAFGSTAILIILFNMAAEVHF
DPLQFFRQFFWLGLYPPKAQYGMGIPPLHDGGWWLMAGLFMTLSLGSWWIRVYSRARALGLGTHIAWNFAAAIFFVLCIG
CIHPTLVGSWSEGVPFGIWPHIDWLTAFSIRYGNFYYCPWHGFSIGFAYGCGLLFAAHGATILAVARFGGDREIEQITDR
GTAVERAALFWRWTIGFNATIESVHRWGWFFSLMVMVSASVGILLTGTFVDNWYLWCVKHGAAPDYPAYLPATPDPASLP
GAPK
>Q3J1A6 ~~~pufM~~~Reaction center protein M chain~~~
MAEYQNIFSQVQVRGPADLGMTEDVNLANRSGVGPFSTLLGWFGNAQLGPIYLGSLGVLSLFSGLMWFFTIGIWFWYQAG
WNPAVFLRDLFFFSLEPPAPEYGLSFAAPLKEGGLWLIASFFMFVAVWSWWGRTYLRAQALGMGKHTAWAFLSAIWLWMV
LGFIRPILMGSWSEAVPYGIFSHLDWTNNFSLVHGNLFYNPFHGLSIAFLYGSALLFAMHGATILAVSRFGGERELEQIA
DRGTAAERAALFWRWTMGFNATMEGIHRWAIWMAVLVTLTGGIGILLSGTVVDNWYVWGQNHGMAPLN
>P0C0Y9 ~~~pufM~~~Reaction center protein M chain~~~
MAEYQNIFSQVQVRGPADLGMTEDVNLANRSGVGPFSTLLGWFGNAQLGPIYLGSLGVLSLFSGLMWFFTIGIWFWYQAG
WNPAVFLRDLFFFSLEPPAPEYGLSFAAPLKEGGLWLIASFFMFVAVWSWWGRTYLRAQALGMGKHTAWAFLSAIWLWMV
LGFIRPILMGSWSEAVPYGIFSHLDWTNNFSLVHGNLFYNPFHGLSIAFLYGSALLFAMHGATILAVSRFGGERELEQIA
DRGTAAERAALFWRWTMGFNATMEGIHRWAIWMAVLVTLTGGIGILLSGTVVDNWYVWGQNHGMAPLN
>P09438 ~~~pufM~~~Reaction center protein M chain~~~
MATINMTPGDLELGRDRGRIGKPIEIPLLENFGFDSQLGPFYLGFWNAVAYITGGIFTFIWLMVMFAQVNYNPVAFAKYF
VVLQIDPPSSRYGLSFPPLNEGGWWLIATFFLTVSIFAWYMHIYTRAKALGIKPYLAYGFTGAIALYLVIYIIRPVWMGD
WSEAPAHGIKALLDWTNNVSVRYGNFYYNPFHMLSIFFLLGSTLLLAMHAGTIWALEKYAAHEEWNEIQAPGTGTERAQL
FWRWCMGFNANAYSIHLWAFWFAWLCGITGALGVFFSMPDFVNNWFQWGIEAGINYPQGPTPPVSLP
>P11847 ~~~pufM~~~Reaction center protein M chain~~~
MAEYQNFFNQVQVAGAPEMGLKEDVDTFERTPAGMFNILGWMGNAQIGPIYLGIAGTVSLAFGAAWFFTIGVWYWYQAGF
DPFIFMRDLFFFSLEPPPAEYGLAIAPLKQGGVWQIASLFMAISVIAWWVRVYTRADQLGMGKHMAWAFLSAIWLWSVLG
FWRPILMGSWSVAPPYGIFSHLDWTNQFSLDHGNLFYNPFHGLSIAALYGSALLFAMHGATILAVTRFGGERELEQIVDR
GTASERAALFWRWTMGFNATMEGIHRWAIWMAVMVTLTGGIGILLSGTVVDNWYVWAQVHGYAPVTP
>P10718 ~~~pufM~~~Reaction center protein M chain~~~
MSEYQNILTGVQVRTAPHSAPIAKGIFPRLGKPGFSYWLGKIGDAQIGPIYLGTTGVLSLVFGFFAIEIIGFNLLASVNW
SPMEFGRQFFWLGLEPPAAEYGLGFAPLAEGGWWQIAGFFLTTSILLWWVRMYRRARALKMGTHTAWAFASAIFLFLSLG
FIRPLLMGNFSESVPFGIFPHLEWTNSFSLNYGNFFYNPFHMLSIAFLYGSALLSAMHGATILAVSRLGGDREVEQITDR
GTAAERAALFWRWTMGFNATMESIHRWAWWFAVLCTFTGAIGILLTGTVVDNWFEWGVKHGLAPAP
>P77212 ~~~rclA~~~Probable pyridine nucleotide-disulfide oxidoreductase RclA~~~COG1249
MNKYQAVIIGFGKAGKTLAVTLAKAGWRVALIEQSNAMYGGTCINIGCIPTKTLVHDAQQHTDFVRAIQRKNEVVNFLRN
KNFHNLADMPNIDVIDGQAEFINNHSLRVHRPEGNLEIHGEKIFINTGAQTVVPPIPGITTTPGVYDSTGLLNLKELPGH
LGILGGGYIGVEFASMFANFGSKVTILEAASLFLPREDRDIADNIATILRDQGVDIILNAHVERISHHENQVQVHSEHAQ
LAVDALLIASGRQPATASLHPENAGIAVNERGAIVVDKRLHTTADNIWAMGDVTGGLQFTYISLDDYRIVRDELLGEGKR
STDDRKNVPYSVFMTPPLSRVGMTEEQARESGADIQVVTLPVAAIPRARVMNDTRGVLKAIVDNKTQRMLGASLLCVDSH
EMINIVKMVMDAGLPYSILRDQIFTHPSMSESLNDLFSLVK
>P75687 ~~~rclB~~~Uncharacterized protein RclB~~~
MFKKSVLFATLLSGVMAFSTNADDKIILKHISVSSVSASPTVLEDTIADIARKYNASSWKVTSMRIDNNSTATAVLYK
>P75685 ~~~rclC~~~Inner membrane protein RclC~~~COG3059
MEKYLHLLSRGDKIGLTLIRLSIAIVFMWIGLLKFVPYEADSITPFVANSPLMSFFYEHPEDYKQYLTHEGEYKPEARAW
QTANNTYGFSNGLGVVEVIIALLVLANPVNRWLGLLGGLMAFTTPLVTLSFLITTPEAWVPALGDAHHGFPYLSGAGRLV
LKDTLMLAGAVMIMADSAREILKQRSNESSSTLKTEY
>P77379 ~~~rclR~~~RCS-specific HTH-type transcriptional activator RclR~~~COG2207
MDALSRLLMLNAPQGTIDKNCVLGSDWQLPHGAGELSVIRWHALTQGAAKLEMPTGEIFTLRPGNVVLLPQNSAHRLSHV
DNESTCIVCGTLRLQHSARYFLTSLPETLFLAPVNHSVEYNWLREAIPFLQQESRSAMPGVDALCSQICATFFTLAVREW
IAQVNTEKNILSLLLHPRLGAVIQQMLEMPGHAWTVESLASIAHMSRASFAQLFRDVSGTTPLAVLTKLRLQIAAQMFSR
ETLPVVVIAESVGYASESSFHKAFVREFGCTPGEYRERVRQLAP
>P76425 ~~~rcnA~~~Nickel/cobalt efflux system RcnA~~~COG2215
MTEFTTLLQQGNAWFFIPSAILLGALHGLEPGHSKTMMAAFIIAIKGTIKQAVMLGLAATISHTAVVWLIAFGGMVISKR
FTAQSAEPWLQLISAVIIISTAFWMFWRTWRGERNWLENMHGHDYEHHHHDHEHHHDHGHHHHHEHGEYQDAHARAHAND
IKRRFDGREVTNWQILLFGLTGGLIPCPAAITVLLICIQLKALTLGATLVVSFSIGLALTLVTVGVGAAISVQQVAKRWS
GFNTLAKRAPYFSSLLIGLVGVYMGVHGFMGIMR
>P64534 ~~~rcnB~~~Nickel/cobalt homeostasis protein RcnB~~~COG5455
MTIKNKMLLGALLLVTSAAWAAPATAGSTNTSGISKYELSSFIADFKHFKPGDTVPEMYRTDEYNIKQWQLRNLPAPDAG
THWTYMGGAYVLISDTDGKIIKAYDGEIFYHR
>P64530 ~~~rcnR~~~Transcriptional repressor RcnR~~~COG1937
MSHTIRDKQKLKARASKIQGQVVALKKMLDEPHECAAVLQQIAAIRGAVNGLMREVIKGHLTEHIVHQGDELKREEDLDV
VLKVLDSYIK
>Q13YL3 ~~~rcoM1~~~Heme-containing CO-sensing transcriptional regulator RcoM 1~~~COG3279
MKSSEPASVSAAERRAETFQHKLEQFNPGIVWLDQHGRVTAFNDVALQILGPAGEQSLGVAQDSLFGIDVVQLHPEKSRD
KLRFLLQSKDVGGCPVKSPPPVAMMINIPDRILMIKVSSMIAAGGACGTCMIFYDVTDLTTEPSGLPAGGSAPSPRRLFK
IPVYRKNRVILLDLKDIVRFQGDGHYTTIVTRDDRYLSNLSLADLELRLDSSIYLRVHRSHIVSLQYAVELVKLDESVNL
VMDDAEQTQVPVSRSRTAQLKELLGVV
>Q13IY4 ~~~rcoM2~~~Heme-containing CO-sensing transcriptional regulator RcoM 2~~~COG3279
MKSSESAAATASERRAETFQHKLEQFNPGIVWLDPQGHVSAFNDVALHILGPAGEQSLGVAQDHLFGIDVVQLHPEKSRD
KLRFLLQSRDAGGCPVRSPPPVAMMINIPDRILMIKVSKMTGAAGTCGSCMIFYDVTDLTTEPSSQPAGASVPAPRRLFK
IPVYRKSRVILIDLKDIVRFQGDGHYTTIVTKDERYLSNLSLADLELRLDSSVYLRVHRSHIVSLPYAVELVKLDESVNL
VMDDAEQTQVPVSRSRTAQLKELLGVV
>Q2RNI6 ~~~rcoM~~~CO-responsive transcriptional regulator RcoM~~~COG3279
MDDFAYNLRRAETGVLLLADDLTVTAVSPGALSLLGLDKPGALLGRPILDLHPPPLRPKVAVLLKTARGPSGPAATAVLS
LRGGPVLIRASALTGQGETAFALVLTAVGESRETKEGPSARPAPPGYLRKVPLGLGETTEFVDTAGVIYLEADGHYSRVH
TAFGHSFCPLALAELERRLDPDQFLRVHRSYIVALAHVRAFRKRESGGLLVMDTGAGDLVPIGRAQVTRLRGLLAI
>Q55169 ~~~rcp1~~~Response regulator Rcp1~~~COG0784
MSDESNPPKVILLVEDSKADSRLVQEVLKTSTIDHELIILRDGLAAMAFLQQQGEYENSPRPNLILLDLNLPKKDGREVL
AEIKQNPDLKRIPVVVLTTSHNEDDVIASYELHVNCYLTKSRNLKDLFKMVQGIESFWLETVTLPAA
>P0DMC9 ~~~rcsA~~~Transcriptional regulatory protein RcsA~~~COG2197
MSTIIMDLCSYTRLGLTGYLLSRGVKKREINDIETVDDLAIACDSQRPSVVFINEDCFIHDASNSQRIKLIINQHPNTLF
IVFMAIANVHFDEYLLVRKNLLISSKSIKPESLDDILGDILKKETTITSFLNMPTLSLSRTESSMLRMWMAGQGTIQISD
QMNIKAKTVSSHKGNIKRKIKTHNKQVIYHVVRLTDNVTNGIFVNMR
>P0DMD0 ~~~rcsA~~~Transcriptional regulatory protein RcsA~~~COG2197
MSTIIMDLCSYTRLGLTGYLLSRGVKKREINDIETVDDLAIACDSQRPSVVFINEDCFIHDASNSQHIKHIINQHPNTLF
IVFMAIANVHFDEYLLVRKNLLISSKSIKPESLDDILGDILKKETTITSFLNMPTLSLSRTESSMLRMWMAGQGTIQISD
QMNIKAKTVSSHKGNIKRKIKTHNKQVIYHVVRLTDNVTNGIFVNMR
>P0DMC7 ~~~rcsB~~~Transcriptional regulatory protein RcsB~~~COG2197
MNNMNVIIADDHPIVLFGIRKSLEQIEWVNVVGEFEDSTALINNLPKLDAHVLITDLSMPGDKYGDGITLIKYIKRHFPS
LSIIVLTMNNNPAILSAVLDLDIEGIVLKQGAPTDLPKALAALQKGKKFTPESVSRLLEKISAGGYGDKRLSPKESEVLR
LFAEGFLVTEIAKKLNRSIKTISSQKKSAMMKLGVENDIALLNYLSSVTLSPADKD
>P0DMC8 ~~~rcsB~~~Transcriptional regulatory protein RcsB~~~COG2197
MNNMNVIIADDHPIVLFGIRKSLEQIEWVNVVGEFEDSTALINNLPKLDAHVLITDLSMPGDKYGDGITLIKYIKRHFPS
LSIIVLTMNNNPAILSAVLDLDIEGIVLKQGAPTDLPKALAALQKGKKFTPESVSRLLEKISAGGYGDKRLSPKESEVLR
LFAEGFLVTEIAKKLNRSIKTISSQKKSAMMKLGVENDIALLNYLSSVTLSPADKD
>P58663 ~~~rcsB~~~Transcriptional regulatory protein RcsB~~~
MNNMNVIIADDHPIVLFGIRKSLEQIEWVNVVGEFEDSTALINNLPKLDAHVLITDLSMPGDKYGDGITLIKYIKRHFPS
LSIIVLTMNNNPAILSAVLDLDIEGIVLKQGAPTDLPKALAALQKGKKFTPESVSRLLEKISAGGYGDKRLSPKESEVLR
LFAEGFLVTEIAKKLNRSIKTISSQKKSAMMKLGVENDIALLNYLSSVTLSPTDKE
>P0DMC5 2.7.13.3~~~rcsC~~~Sensor histidine kinase RcsC~~~COG0784
MKYLASFRTTLKASRYMFRALALVLWLLIAFSSVFYIVNALHQRESEIRQEFNLSSDQAQRFIQRTSDVMKELKYIAENR
LSAENGVLSPRGRETQADVPAFEPLFADSDCSAMSNTWRGSLESLAWFMRYWRDNFSAAYDLNRVFLIGSDNLCMANFGL
RDMPVERDTALKALHERINKYRNAPQDDSGSNLYWISEGPRPGVGYFYALTPVYLANRLQALLGVEQTIRMENFFLPGTL
PMGVTILDENGHTLISLTGPESKIKGDPRWMQERSWFGYTEGFRELVLKKNLPPSSLSIVYSVPVDKVLERIRMLILNAI
LLNVLAGAALFTLARMYERRIFIPAESDALRLEEHEQFNRKIVASAPVGICILRTADGVNILSNELAHTYLNMLTHEDRQ
RLTQIICGQQVNFVDVLTSNNTNLQISFVHSRYRNENVAICVLVDVSSRVKMEESLQEMAQAAEQASQSKSMFLATVSHE
LRTPLYGIIGNLDLLQTKELPKGVDRLVTAMNNSSSLLLKIISDILDFSKIESEQLKIEPREFSPREVMNHITANYLPLV
VRKQLGLYCFIEPDVPVALNGDPMRLQQVISNLLSNAIKFTDTGCIVLHVRADGDYLSIRVRDTGVGIPAKEVVRLFDPF
FQVGTGVQRNFQGTGLGLAICEKLISMMDGDISVDSEPGMGSQFTVRIPLYGAQYPQKKGVEGLSGKRCWLAVRNASLCQ
FLETSLQRSGIVVTTYEGQEPTPEDVLITDEVVSKKWQGRAVVTFCRRHIGIPLEKAPGEWVHSVAAPHELPALLARIYL
IEMESDDPANALPSTDKAVSDNDDMMILVVDDHPINRRLLADQLGSLGYQCKTANDGVDALNVLSKNHIDIVLSDVNMPN
MDGYRLTQRIRQLGLTLPVIGVTANALAEEKQRCLESGMDSCLSKPVTLDVIKQTLTLYAERVRKSRDS
>P0DMC6 2.7.13.3~~~rcsC~~~Sensor histidine kinase RcsC~~~COG0784
MKYLASFRTTLKASRYMFRALALVLWLLIAFSSVFYIVNALHQRESEIRQEFNLSSDQAQRFIQRTSDVMKELKYIAENR
LSAENGVLSPRGRETQADVPAFEPLFADSDCSAMSNTWRGSLESLAWFIGYWRDNFSAAYDLNRVFLIGSDNLCMANFGL
RDMPVERDTALKALHERINKYRNAPQDDSGSNLYWISEGPRPGVGYFYALTPVYLANRLQALLGVEQTIRMENFFLPGTL
PMGVTILDENGHTLISLTGPESKIKGDPRWMQERSWFGYTEGFRELVLKKNLPPSSLSIVYSVPVDKVLERIRMVILNAI
LLNVLAGAALFTLARMYERRIFIPAESDALRLEEHEQFNRKIVASAPVGICILRTADGVNILSNELAHTYLNMLTHEDRQ
RLTQIICGQQVNFVDVLTSNNTNLQISFVHSRYRNENVAICVLVDVSSRVKMEESLQEMAQAAEQASQSKSMFLATVSHE
LRTPLYGIIGNLDLLQTKELPKGVDRLVTAMNNSSSLLLKIISDILDFSKIESEQLKIEPREFSPREVMNHITANYLPLV
VRKQLGLYCFIEPDVPVALNGDPMRLQQVISNLLSNAIKFTDTGCIVLHVRADGDYLSIRVRDTGVGIPAKEVVRLFDPF
FQVGTGVQRNFQGTGLGLAICEKLISMMDGDISVDSEPGMGSQFTVRIPLYGAQYPQKKGVEGLSGKRCWLAVRNASLCQ
FLETSLQRSGIVVTTYEGQEPTPEDVLITDEVVSKKWQGRAVVTFCRRHIGIPLEEAPGEWVHSVAAPHELPALLARIYL
IEMESDDPANALPSTDKAVSDNDDMMILVVDDHPINRRLLADQLGSLGYQCKTANDGVDALNVLSKNHIDIVLSDVNMPN
MDGYRLTQRTRQLGLTLPVIGVTANALAEEKQRCLESGMDSCLSKPVTLDVIKQTLTVYAERVRKSRES
>P39838 2.7.2.-~~~rcsD~~~Phosphotransferase RcsD~~~COG0642
MRQKETTATTRFSLLPGSITRFFLLLIIVLLVTMGVMVQSAVNAWLKDKSYQIVDITHAIQKRVDNWRYVTWQIYDNIAA
TTSPSSGEGLQETRLKQDVYYLEKPRRKTEALIFGSHDNSTLEMTQRMSTYLDTLWGAENVPWSMYYLNGQDNSLVLIST
LPLKDLTSGFKESTVSDIVDSRRAEMLQQANALDERESFSNMRRLAWQNGHYFTLRTTFNQPGHLATVVAFDLPINDLIP
PGMPLDSFRLEPDATATGNNDNEKEGTDSVSIHFNSTKIEISSALNSTDMRLVWQVPYGTLLLDTLQNILLPLLLNIGLL
ALALFGYTTFRHFSSRSTENVPSTAVNNELRILRAINEEIVSLLPLGLLVHDQESNRTVISNKIADHLLPHLNLQNITTM
AEQHQGIIQATINNELYEIRMFRSQVAPRTQIFIIRDQDREVLVNKKLKQAQRLYEKNQQGRMIFMKNIGDALKEPAQSL
AESAAKLNAPESKQLANQADVLVRLVDEIQLANMLADDSWKSETVLFSVQDLIDEVVPSVLPAIKRKGLQLLINNHLKAH
DMRRGDRDALRRILLLLMQYAVTSTQLGKITLEVDQDESSEDRLTFRILDTGEGVSIHEMDNLHFPFINQTQNDRYGKAD
PLAFWLSDQLARKLGGHLNIKTRDGLGTRYSVHIKMLAADPEVEEEEERLLDDVCVMVDVTSAEIRNIVTRQLENWGATC
ITPDERLISQDYDIFLTDNPSNLTASGLLLSDDESGVREIGPGQLCVNFNMSNAMQEAVLQLIEVQLAQEEVTESPLGGD
ENAQLHASGYYALFVDTVPDDVKRLYTEAATSDFAALAQTAHRLKGVFAMLNLVPGKQLCETLEHLIREKDVPGIEKYIS
DIDSYVKSLL
>Q9RPT0 2.1.1.-~~~rcsF~~~Putative S-adenosylmethionine-dependent methyltransferase RcsF~~~
MTHSVSPIGYIRSCFMEKFAIPRQPLLAPAARGTLELLPPFDQVEALEGLEQVSHVWLLFLFHQALEDKPRLKVRPPRLG
GNRSLGVFATRATHRPNGIGQSVVRLEGFEAGRLWLSGIDLLDGTPVLDIKPYVPYADAVADARNGIADAPPPGIAVEWS
EQARRQAHEHGQRLRQPVAELIEQCLAQDPRPAYQKPEPGRRYGVRLWDLDVHWHYPRPDLIRVLDVAGGD
>P69411 ~~~rcsF~~~Outer membrane lipoprotein RcsF~~~
MRALPICLVALMLSGCSMLSRSPVEPVQSTAPQPKAEPAKPKAPRATPVRIYTNAEELVGKPFRDLGEVSGDSCQASNQD
SPPSIPTARKRMQINASKMKANAVLLHSCEVTSGTPGCYRQAVCIGSALNITAK
>P36767 ~~~rdgC~~~Recombination-associated protein RdgC~~~COG2974
MLWFKNLMVYRLSREISLRAEEMEKQLASMAFTPCGSQDMAKMGWVPPMGSHSDALTHVANGQIVICARKEEKILPSPVI
KQALEAKIAKLEAEQARKLKKTEKDSLKDEVLHSLLPRAFSRFSQTMMWIDTVNGLIMVDCASAKKAEDTLALLRKSLGS
LPVVPLSMENPIELTLTEWVRSGSAAQGFQLLDEAELKSLLEDGGVIRAKKQDLTSEEITNHIEAGKVVTKLALDWQQRI
QFVMCDDGSLKRLKFCDELRDQNEDIDREDFAQRFDADFILMTGELAALIQNLIEGLGGEAQR
>P44628 ~~~rdgC~~~Recombination-associated protein RdgC~~~COG2974
MWFKNLMTYRLTKPLDWDLAQLQTQLEDCQFHPCGTQDQSKFGWSAPLRGSDLLYFSVGKQILLIAKKEEKILPANVVKR
ELDDRIESLEQKENRKLKKVEKQTLKDDVVMNLLPRAFSKNQHTALWIDTENNLIHIDAASSKRAEDALALLRKSLGSLP
VVPLAFANEPSTILTNWILQDNLPHWLLALEEAELRGSQEDSVIRCKKQPLENEEILALLQDGKKVVSKLALEWEDTLTF
VFNEDCTIKRLKFADTVREKNDDILKEDFAQRFDADFVLMTGILAKLTENLLDEFGGEKARL
>Q9HYX7 ~~~rdgC~~~Recombination-associated protein RdgC~~~
MWFRNLLVYRLTQDLQLDADSLEKALGEKSARPCASQELTTYGFTAPFGKGPDAPLVHVSQDFFLISARKEERILPGSVV
RDALKEKVDEIEAQQMRKVYKKERDQLKDEIVQTLLPRAFIRRSSTFAAIAPSLGLILVDSASAKKAEDLLSTLREALGS
LPVRPLSVKVAPTATLTDWVKTQEAAGDFHVLDECELRDTHEDGGVVRCKRQDLTSEEIQLHLTAGKLVTQLSLAWSDKL
SFVLDDKLAVKRLRFEDLLQEQAEKDGGEDALGQLDASFTLMMLTFAEFLPALFEALGGEEIPQGV
>Q7AY50 ~~~rdlA~~~Rodlin protein RdlA~~~
MLKKAMVAAAAAASVIGMSAAAAPQALAIGDDNGPAVANGNGAESAFGNSATKGDMSPQLSLVEGTLNKPCLGVEDVNVA
VINLVPIQDINVLADDLNQQCADNSTQAKRDGALSHVLEDLSVLSANGEGR
>Q934F8 ~~~rdlB~~~Rodlin protein RdlB~~~
MIKKVVAYAAIAASVMGASAAAAPQAMAIGDDSGPVSANGNGASQYFGNSMTTGNMSPQMALIQGSFNKPCIAVSDIPVS
VIGLVPIQDLNVLGDDMNQQCAENSTQAKRDGALAHLLEDVSILSSNGEGGKG
>Q54527 4.1.1.-~~~rdmB~~~Aclacinomycin 10-hydroxylase RdmB~~~
MSSSSPGEPLEPTDQDLDVLLKNLGNLVTPMALRVAATLRLVDHLLAGADTLAGLADRTDTHPQALSRLVRHLTVVGVLE
GGEKQGRPLRPTRLGMLLADGHPAQQRAWLDLNGAVSHADLAFTGLLDVVRTGRPAYAGRYGRPFWEDLSADVALADSFD
ALMSCDEDLAYEAPADAYDWSAVRHVLDVGGGNGGMLAAIALRAPHLRGTLVELAGPAERARRRFADAGLADRVTVAEGD
FFKPLPVTADVVLLSFVLLNWSDEDALTILRGCVRALEPGGRLLVLDRADVEGDGADRFFSTLLDLRMLTFMGGRVRTRD
EVVDLAGSAGLALASERTSGSTTLPFDFSILEFTAVSEEAAPAAQASEALPAQE
>Q54528 3.1.1.95~~~rdmC~~~Aclacinomycin methylesterase RdmC~~~
MSERIVPSGDVELWSDDFGDPADPALLLVMGGNLSALGWPDEFARRLADGGLHVIRYDHRDTGRSTTRDFAAHPYGFGEL
AADAVAVLDGWGVDRAHVVGLSMGATITQVIALDHHDRLSSLTMLLGGGLDIDFDANIERVMRGEPTLDGLPGPQQPFLD
ALALMNQPAEGRAAEVAKRVSKWRILSGTGVPFDDAEYARWEERAIDHAGGVLAEPYAHYSLTLPPPSRAAELREVTVPT
LVIQAEHDPIAPAPHGKHLAGLIPTARLAEIPGMGHALPSSVHGPLAEVILAHTRSAA
>Q8ZKU8 2.7.11.1~~~rdoA~~~Serine/threonine protein kinase RdoA~~~
MNDNAFTFQTLHPETIMDALFEQGIMVDSGLTPLNSYENRVYQFQDEDRRRFVVKFYRPERWSVDQIREEHQFALELVKD
EVPVAAPLAFNGQTLLAHQGYHYAIFPSVGGRQFEADNIDQMEAVGRYLGRLHQTGRKRPFTFRPDIGLAEYLFEPRQVF
EDAALIPSGQKAAFLKATDTLLSAVTECWRTDFATLRLHGDCHAGNILWRDGPLFVDLDDARNGPAIQDLWMLLNGDKAE
QRMQLETIIEAYEEVSEFDTAEIGLIEPLRAMRLVYYLAWLIRRWGDPAFPKNFPWLTGEDYWQRQTTTFIEQTKILHEP
PLQLTPMY
>P83310 1.14.11.44~~~rdpA~~~(R)-phenoxypropionate/alpha-ketoglutarate-dioxygenase~~~
MHAALSPLSQRFERIAVQPLTGVLGAEITGVDLREPLDDSTWNEILDAFHTYQVIYFPGQAITNEQHIAFSRRFGPVDPV
PLLKSIEGYPEVQMIRREANESGRVIGDDWHTDSTFLDAPPAAVVMRAIDVPEHGGDTGFLSMYTAWETLSPTMQATIEG
LNVVHSATRVFGSLYQAQNRRFSNTSVKVMDVDAGDRETVHPLVVTHPGSGRKGLYVNQVYCQRIEGMTDAESKPLLQFL
YEHATRFDFTCRVRWKKDQVLVWDNLCTMHRAVPDYAGKFRYLTRTTVGGVRPAR
>Q8KSC8 1.14.11.44~~~rdpA~~~(R)-phenoxypropionate/alpha-ketoglutarate-dioxygenase~~~
MHAALSPLSQRFERIAVQPLTGVLGAEITGVDLREPLDDSTWNEILDAFHTYQVIYFPGQAITNEQHIAFSRRFGPVDPV
PLLKSIEGYPEVQMIRREANESGRVIGDDWHTDSTFLDAPPAAVVMRAIDVPEHGGDTGFLSMYTAWETLSPTMQATIEG
LNVVHSATRVFGSLYQAQNRRFSNTSVKVMDVDAGDRETVHPLVVTHPGSGRKGLYVNQVYCQRIEGMTDAESKPLLQFL
YEHATRFDFTCRVRWKKDQVLVWDNLCTMHRAVPDYAGKFRYLTRTTVGGVRPAR
>O25608 1.-.-.-~~~rdxA~~~Oxygen-insensitive NADPH nitroreductase~~~COG0778
MKFLDQEKRRQLLNERHSCKMFDSHYEFSSTELEEIAEIARLSPSSYNTQPWHFVMVTDKDLKKQIAAHSYFNEEMIKSA
SALMVVCSLRPSELLPHGHYMQNLYPESYKVRVIPSFAQMLGVRFNHSMQRLESYILEQCYIAVGQICMGVSLMGLDSCI
IGGFDPLKVGEVLEERINKPKIACLIALGKRVAEASQKSRKSKVDAITWL
>Q8KHV6 1.21.98.2~~~rebD~~~Dichlorochromopyrrolate synthase~~~
MSVFDLPRLHFAGTATTRLPTGPRNGLVDLSTHSVVMDGERFPASRPAAEYHAYLDRVGGKGTAFAGNGYFAIDAGITAV
ERAAGEVDTGDLLVGRAVDVWGHYNEYLATTFNRARIFDVDPSSSWTSTVMIGQFGFGRLGRSHDVGYVFTGGVHGMQPP
RWHEDGRVLHQFTVPAGEDMTWFGSAADSPAAARLRELVESGEADGLVVQLALSDAGPAPMPHAQQWRLRGTIAPWHAGE
PRTCPAGRLLTPHNLTADLRGDHVSLNLISFRPPTGISGLELRTADTDRFIARVPADDPHGVVTVPAAEGGDEALCVVGT
TAAGERIVVSREREVTVHVDDASVFLEHPRGPGDSDQDAEIAVRTYVRGEPAAATIHIGQYFNPRAFPLDEHATAASATP
EDLDVVALCVDGTRWSRHCVISTDENGDGRFLLRGARPGATRLLLSAEGATPFDGLTAAAAYDNDDSLGLWSGLASVAVR
VLPDHWWMDDIPRDKVTFDLLYREVFAFYELLYSFMGEEVFSLADRFRVETHPRLIWQMCDPRNRAKTYYMPPTRDLTGP
QARLLLAYLRAQNSDVVVPVIEPSHTRSGTPISTRTDLVRALRHGVAIELAVMLQYLYAAFSIPTHGAGQELVSRGDWTP
EQLRLMCGDGGETTDGGVRGSLLGVAREEMIHFLVVNNVLMAVGEPFHVPDLDFGTINDTLMVPLDFSLEALGLGSVQRF
IQIEQPEGLTGAVRLGDLPVPVREAEDFHYASLSELYGDIREGLQRVPGLFLVERGRGGGEHHLFLRESVNAVHPDYQLE
VDDLSSALFAIDFVTEQGEGHVLTDEDTGEESHYDTFVRVADLLMKERLTAADTRRAQWSPAYPVARNPTVHGGGQSKEL
VTSPVARELMVLFNKSYFMMLQLMVQHFGGSPDASLRRSKLMNAAIDVMTGVMRPLAELLVTVPSGRHGRTAGPSFELDE
KPAFIPRADVARRAISLRFRHLAESARTCALVPDKVVRNLDFLADQFATEGPR
>Q8KI76 1.5.1.30~~~rbmH~~~Flavin reductase (NADPH)~~~
MTIEFDRPGAHVTAADHRALMSLFPTGVAVITAIDEAGTPHGMTCTSLTSVTLDPPTLLVCLNRASGTLHAVRGGRFGVN
LLHARGRRAAEVFSTAVQDRFGEVRWEHSDVTGMPWLAEDAHAFAGCVVRKSTVVGDHEIVLGEVHEVVREHDLPLLYGM
REFAVWTPEG
>Q8KHE4 4.3.3.5~~~rebG~~~4'-demethylrebeccamycin synthase~~~
MGARVLVATTPGDGHVNPMVPVAQEMVSRGHEVRWYTGKAFRSTVERTGARHEPMRDAHDFGGMPREEAFPQHAGLTGIT
GMIAGFRDIFIEPAADQMTDLLALLEDFPADVLVTDETFFGAGFVSERTGIPVAWIATSIYVFSSRDTAPLGLGLPPSSS
RLGRLRNTVLKQLTDRVVMRDLRRHADVVRDRVGLPRIRKGAFENIMRTPDLYLLGTVPSFEYPRGDMPPEVRFVGPFVS
PAPPDFTPPAWWGELDSGRPVVHVTQGTVANDAERLLLPAIRALAAEDVLVVATTGAPLELEPMPANVRVERFIPHHALL
PHVDAMVTNGGYGGVNTALAHGVPLVVAAATEEKHEVAARVSWSGAGVHLKKRRLSERDIRRAVRAVLDEPRFRVHAARL
RDEYAARDAVVDAVDLIEGLV
>Q8KHZ8 1.14.19.9~~~rebH~~~Flavin-dependent tryptophan halogenase RebH~~~
MSGKIDKILIVGGGTAGWMAASYLGKALQGTADITLLQAPDIPTLGVGEATIPNLQTAFFDFLGIPEDEWMRECNASYKV
AIKFINWRTAGEGTSEARELDGGPDHFYHSFGLLKYHEQIPLSHYWFDRSYRGKTVEPFDYACYKEPVILDANRSPRRLD
GSKVTNYAWHFDAHLVADFLRRFATEKLGVRHVEDRVEHVQRDANGNIESVRTATGRVFDADLFVDCSGFRGLLINKAME
EPFLDMSDHLLNDSAVATQVPHDDDANGVEPFTSAIAMKSGWTWKIPMLGRFGTGYVYSSRFATEDEAVREFCEMWHLDP
ETQPLNRIRFRVGRNRRAWVGNCVSIGTSSCFVEPLESTGIYFVYAALYQLVKHFPDKSLNPVLTARFNREIETMFDDTR
DFIQAHFYFSPRTDTPFWRANKELRLADGMQEKIDMYRAGMAINAPASDDAQLYYGNFEEEFRNFWNNSNYYCVLAGLGL
VPDAPSPRLAHMPQATESVDEVFGAVKDRQRNLLETLPSLHEFLRQQHGR
>Q8KZ94 2.1.1.164~~~rebM~~~Demethylrebeccamycin-D-glucose O-methyltransferase~~~
MTESKSEGTAVAAPTPEEVRQMYDDFTDPFARIWGENLHFGYWEDAGADVSVDDATDRLTDEMIALLDVRSGDRVLDVGC
GIGKPAVRLATARDVRVTGISISRPQVNQANARATAAGLANRVTFSYADAMDLPFEDASFDAVWALESLHHMPDRGRALR
EMARVLRPGGTVAIADFVLLAPVEGAKKEAVDAFRAGGGVLSLGGIDEYESDVRQAELVVTSTVDISAQARPSLVKTAEA
FENARSQVEPFMGAEGLDRMIATFRGLAEVPEAGYVLIGARKP
>Q8KHS0 1.4.3.23~~~rebO~~~Flavin-dependent L-tryptophan oxidase RebO~~~
MSRGHKKITVLGAGVAGLVAAHELEELGHEVEVLEGSDRLGGRVHTHRFGEGGSVPFVELGAMRIPTKHRHTIDYIGKLG
LTPKLKEFKTLFSDDGAYHTTSAGFVRVRDAAKVLVDEFRLLMSGRDLREETILFGAWLTAVGDAIAPADFRAALRTDFT
ADLLEVVDRIDLDPFLVGAARDQFDLHAFFAAHPEVRTSCTGKLNRFVDDILDETSPRLLRLEGGMDQLVDALVERIRGD
IRTGHEVSAIDVREDHVAVTVHNGHGVNTLRSDHVLCTIPFSVLRNLRLTGLSTDKLEIIHDVKYWSATKVAFRCREPFW
ERDGINGGASFGGGRIRQTYYPPVEGDPTRGAVLLASYTMGDDADVLGGMPEAQRHEVVLDEVGRMHPELHEPGMVVEAV
SRAWGEDRWSNGAGVTRWGKDVAACEEERDRAARPEGRLYFAGEHCSSTTAWIDGAVESALAAVRAIEAGDGR
>P62209 ~~~recA~~~Protein RecA~~~
MSSAQLRLVEKDSMDKQKALDAALSQIERAFGKGSIMKLGARENLVETEVISTGSLGLDIALGIGGLPKGRIVEIYGPES
SGKTTLALHAIAQAQKAGGTCAFVDAEHALDPSYARKLGVNIDELLISQPDAGEQALEIADTLVRSGAIDVLVVDSVAAL
VPRAELEGEMGDSHVGLHARLMSQALRKLTGSISKSNCLVIFINQIRLKIGVMFGNPETTTGGNALKFYASVRLDIRRIG
SIKDRDTVVGNQTRVKVVKNKMAPPFRVVEFDIMYGEGVSKVGELLDLGIQAGVVDKSGAWFSYDGTRIGQGRENAKTYL
RNNPEMADAIEAKIRGNAGLVADAMMGTPEADGEASTPE
>P16971 ~~~recA~~~Protein RecA~~~COG0468
MSDRQAALDMALKQIEKQFGKGSIMKLGEKTDTRISTVPSGSLALDTALGIGGYPRGRIIEVYGPESSGKTTVALHAIAE
VQQQGGQAAFIDAEHALDPVYAQKLGVNIEELLLSQPDTGEQALEIAEALVRSGAVDIVVVDSVAALVPKAEIEGDMGDS
HVGLQARLMSQALRKLSGAINKSKTIAIFINQIREKVGVMFGNPETTPGGRALKFYSSVRLEVRRAEQLKQGNDVMGNKT
KIKVVKNKVAPPFRTAEVDIMYGEGISKEGEIIDLGTELDIVQKSGSWYSYEEERLGQGRENAKQFLKENKDIMLMIQEQ
IREHYGLDNNGVVQQQAEETQEELEFEE
>Q2YRU7 ~~~recA~~~Protein RecA~~~
MSQNSLRLVEDNSVDKTKALDAALSQIERAFGKGSIMRLGQNDQVVEIETVSTGSLSLDIALGVGGLPKGRIVEIYGPES
SGKTTLALHTIAEAQKKGGICAFVDAEHALDPVYARKLGVDLENLLISQPDTGEQALEITDTLVRSGAIDVLVVDSVAAL
TPRAEIEGEMGDSLPGLQARLMSQALRKLTGSISRSNCMVIFINQIRMKIGVMFGSPETTTGGNALKFYASVRLDIRRIG
SIKERDEVVGNQTRVKVVKNKLAPPFKQVEFDIMYGAGVSKVGELVDLGVKAGVVEKSGAWFSYNSQRLGQGRENAKQYL
KDNPEVAREIETTLRQNAGLIAEQFLDDGGPEEDAAGAAEM
>P42443 ~~~recA~~~Protein RecA~~~COG0468
MSKDATKEISAPTDAKERSKAIETAMSQIEKAFGKGSIMKLGAESKLDVQVVSTGSLSLDLALGVGGIPRGRITEIYGPE
SGGKTTLALAIVAQAQKAGGTCAFIDAEHALDPVYARALGVNTDELLVSQPDNGEQALEIMELLVRSGAIDVVVVDSVAA
LTPRAEIEGDMGDSLPGLQARLMSQALRKLTAILSKTGTAAIFINQVREKIGVMYGNPETTTGGRALKFYASVRLDVRKI
GQPTKVGNDAVANTVKIKTVKNKVAAPFKEVELALVYGKGFDQLSDLVGLAADMDIIKKAGSFYSYGDERIGQGKEKTIA
YIAERPEMEQEIRDRVMAAIRAGNAGEAPALAPAPAAPEAAEA
>B7UHB7 ~~~recA~~~Protein RecA~~~
MAIDENKQKALAAALGQIEKQFGKGSIMRLGEDRSMDVETISTGSLSLDIALGAGGLPMGRIVEIYGPESSGKTTLTLQV
IAAAQREGKTCAFIDAEHALDPIYARKLGVDIDNLLCSQPDTGEQALEICDALARSGAVDVIVVDSVAALTPKAEIEGEI
GDSHMGLAARMMSQAMRKLAGNLKQSNTLLIFINQIRMKIGVMFGNPETTTGGNALKFYASVRLDIRRIGAVKEGENVVG
SETRVKVVKNKIAAPFKQAEFQILYGEGINFYGELVDLGVKEKLIEKAGAWYSYKGEKIGQGKANATAWLKDNPETAKEI
EKKVRELLLSNPNSTPDFSVDDSEGVAETNEDF
>P0A7G6 ~~~recA~~~Protein RecA~~~COG0468
MAIDENKQKALAAALGQIEKQFGKGSIMRLGEDRSMDVETISTGSLSLDIALGAGGLPMGRIVEIYGPESSGKTTLTLQV
IAAAQREGKTCAFIDAEHALDPIYARKLGVDIDNLLCSQPDTGEQALEICDALARSGAVDVIVVDSVAALTPKAEIEGEI
GDSHMGLAARMMSQAMRKLAGNLKQSNTLLIFINQIRMKIGVMFGNPETTTGGNALKFYASVRLDIRRIGAVKEGENVVG
SETRVKVVKNKIAAPFKQAEFQILYGEGINFYGELVDLGVKEKLIEKAGAWYSYKGEKIGQGKANATAWLKDNPETAKEI
EKKVRELLLSNPNSTPDFSVDDSEGVAETNEDF
>P42445 ~~~recA~~~Protein RecA~~~COG0468
MAIDEDKQKAISLAIKQIDKVFGKGALVRLGDKQVEKIDSISTGSLGLDLALGIGGVPKGRIIEIYGPESSGKTTLSLHI
IAECQKNGGVCAFIDAEHALDVHYAKRLGVDTENLLVSQPDTGEQALEILETITRSGGIDLVVVDSVAALTPKAEIDGDM
GDQHVGLQARLMSHALRKITGVLHKMNTTLIFINQIRMKIGMMGYGSPETTTGGNALKFYASVRIDIRRIASLKQNEQHI
GNRAKAKVVKNKVAPPFREAEFDIMFGEGISKEGEIIDYGVKLDIVDKSGAWLSYQDKKLGQGRENAKALLKEDKALADE
ITLKIKESIGSNEEIMPLPDEPLEEME
>Q9F672 ~~~recA~~~Protein RecA~~~
MDDKKAANNSEKSKALAAALAQIEKQFGKGSVMRMEDGVIAEEIQAVSTGSLGLDIALGIGGLPRGRVIEIYGPESSGKT
TLTLQSIAEMQKLGGTCAFIDAEHALDVTYAQKLGVNLNDLLISQPDTGEQALEICDALVRSGAVDLIVVDSVAALTPKA
EIEGDMGDSLPGLQARLMSQALRKLTGSINRTNTTVIFINQIRMKIGVMFGNPETTTGGNALKFYASVRLDIRRTGSIKS
GDEVIGSETKVKVVKNKVAPPFREAHFDILYGEGTSREGEILDLGSEHKVVEKSGAWYSYNGERIGQGKDNARNYLKEHP
ELAREIENKVRVALGVPELAGGEAEAEAKAS
>P78014 ~~~recA~~~Protein RecA~~~
MVQKEMINKKISQDNSFIQNNNLADFDFLDAKKNSEIKTVSTGSLHLDEALGTGGLPLGRIVELYGNESSGKTTVALHAV
ASFQKAGKVACYIDAEGALDLSYAKAIGIDLGKLLVAHPKHGENAFALMESLIKTNKVALIVVDSVAALIPKQELEGNMD
DQTIGLHARMMSKGLRRVQSLLPESDTCLLFINQLREKPGVMFGNGEVTTGGRALKFYASMRMEAKRSELLKDRFGNYVG
IKSKLTVSKNKVARPFGVAFLEIMFNRGIVYEHEVIELALKHNVVVRSDNAYSFKSQNIAIGKEKLFSVLAEKPELFEQI
KQLTIKQIHSPPPPAS
>Q59560 ~~~recA~~~Protein RecA~~~COG0468
MAQQAPDREKALELAMAQIDKNFGKGSVMRLGEEVRQPISVIPTGSISLDVALGIGGLPRGRVIEIYGPESSGKTTVALH
AVANAQAAGGIAAFIDAEHALDPEYAKKLGVDTDSLLVSQPDTGEQALEIADMLVRSGALDIIVIDSVAALVPRAEIEGE
MGDSHVGLQARLMSQALRKMTGALNNSGTTAIFINQLREKIGVMFGSPETTTGGKALKFYASVRLDVRRIETLKDGTDAV
GNRTRVKVVKNKVSPPFKQAEFDILYGQGISREGSLIDMGVEHGFIRKSGSWFTYEGEQLGQGKENARKFLLENTDVANE
IEKKIKEKLGIGAVVTAEADDVLPAPVDF
>P9WHJ3 ~~~recA~~~Protein RecA~~~COG0468
MTQTPDREKALELAVAQIEKSYGKGSVMRLGDEARQPISVIPTGSIALDVALGIGGLPRGRVIEIYGPESSGKTTVALHA
VANAQAAGGVAAFIDAEHALDPDYAKKLGVDTDSLLVSQPDTGEQALEIADMLIRSGALDIVVIDSVAALVPRAELEGEM
GDSHVGLQARLMSQALRKMTGALNNSGTTAIFINQLRDKIGVMFGSPETTTGGKALKFYASVRMDVRRVETLKDGTNAVG
NRTRVKVVKNKCLAEGTRIFDPVTGTTHRIEDVVDGRKPIHVVAAAKDGTLHARPVVSWFDQGTRDVIGLRIAGGAIVWA
TPDHKVLTEYGWRAAGELRKGDRVAQPRRFDGFGDSAPIPADHARLLGYLIGDGRDGWVGGKTPINFINVQRALIDDVTR
IAATLGCAAHPQGRISLAIAHRPGERNGVADLCQQAGIYGKLAWEKTIPNWFFEPDIAADIVGNLLFGLFESDGWVSREQ
TGALRVGYTTTSEQLAHQIHWLLLRFGVGSTVRDYDPTQKRPSIVNGRRIQSKRQVFEVRISGMDNVTAFAESVPMWGPR
GAALIQAIPEATQGRRRGSQATYLAAEMTDAVLNYLDERGVTAQEAAAMIGVASGDPRGGMKQVLGASRLRRDRVQALAD
ALDDKFLHDMLAEELRYSVIREVLPTRRARTFDLEVEELHTLVAEGVVVHNCSPPFKQAEFDILYGKGISREGSLIDMGV
DQGLIRKSGAWFTYEGEQLGQGKENARNFLVENADVADEIEKKIKEKLGIGAVVTDDPSNDGVLPAPVDF
>P65977 ~~~recA~~~Protein RecA~~~
MAIDENKQKALAAALGQIEKQFGKGSIMRLGEDRSMDVETISTGSLSLDIALGAGGLPMGRIVEIYGPESSGKTTLTLQV
IAAAQREGKTCAFIDAEHALDPVYARKLGVDIDNLLCSQPDTGEQALEICDALARSGAVDVIVVDSVAALTPKAEIEGEI
GDSHMGLAARMMSQAMRKLAGNLKQSNTLLIFINQIRMKIGVMFGNPETTTGGNALKFYASVRLDIRRIGAVKEGDNVVG
SETRVKVVKNKIAAPFKQAEFQILYGEGINFYGELVDLGVKEKLIEKAGAWYSYNGEKIGQGKANATTWLKENPATAKEI
EKRVRELLLSNQNATPDFAVDDSEGVAETNEDF
>P68844 ~~~recA~~~Protein RecA~~~
MDNDRQKALDTVIKNMEKSFGKGAVMKLGDNIGRRVSTTSTGSVTLDNALGVGGYPKGRIIEIYGPESSGKTTVALHAIA
EVQSNGGVAAFIDAEHALDPEYAQALGVDIDNLYLSQPDHGEQGLEIAEAFVRSGAVDIVVVDSVAALTPKAEIEGEMGD
THVGLQARLMSQALRKLSGAISKSNTTAIFINQIREKVGVMFGNPETTPGGRALKFYSSVRLEVRRAEQLKQGQEIVGNR
TKIKVVKNKVAPPFRVAEVDIMYGQGISKEGELIDLGVENDIVDKSGAWYSYNGERMGQGKENVKMYLKENPQIKEEIDR
KLREKLGISDGDVEETEDAPKSLFDEE
>P0A451 ~~~recA~~~Protein RecA~~~COG0468
MAKKPKKLEEISKKFGAEREKALNDALKLIEKDFGKGSIMRLGERAEQKVQVMSSGSLALDIALGSGGYPKGRIIEIYGP
ESSGKTTVALHAVAQAQKEGGIAAFIDAEHALDPAYAAALGVNIDELLLSQPDSGEQGLEIAGKLIDSGAVDLVVVDSVA
ALVPRAEIDGDIGDSHVGLQARMMSQAMRKLGASINKTKTIAIFINQLREKVGVMFGNPETTPGGRALKFYASVRLDVRG
NTQIKGTGDQKETNVGKETKIKVVKNKVAPPFKEAVVEIMYGEGISKTGELLKIASDLDIIKKAGAWYSYKDEKIGQGSE
NAKKYLAEHPEIFDEIDKQVRSKFGLIDGEEVSEQDTENKKDEPKKEEAVNEEVPLDLGDELEIEIEE
>P0A452 ~~~recA~~~Protein RecA~~~COG0468
MAKKPKKLEEISKKFGAEREKALNDALKLIEKDFGKGSIMRLGERAEQKVQVMSSGSLALDIALGSGGYPKGRIIEIYGP
ESSGKTTVALHAVAQAQKEGGIAAFIDAEHALDPAYAAALGVNIDELLLSQPDSGEQGLEIAGKLIDSGAVDLVVVDSVA
ALVPRAEIDGDIGDSHVGLQARMMSQAMRKLGASINKTKTIAIFINQLREKVGVMFGNPETTPGGRALKFYASVRLDVRG
NTQIKGTGDQKETNVGKETKIKVVKNKVAPPFKEAVVEIMYGEGISKTGELLKIASDLDIIKKAGAWYSYKDEKIGQGSE
NAKKYLAEHPEIFDEIDKQVRSKFGLIDGEEVSEQDTENKKDEPKKEEAVNEEVPLDLGDELEIEIEE
>P74737 ~~~recA~~~Protein RecA~~~COG0468
MASTNISDREKALNAALAQIERSFGKGAIMRLGDATQMRVETISTGALTLDLALGGGLPKGRIVEIYGPESSGKTTLALH
AVAATQQAGGVAAFVDAEHALDPVYSKALGVDIDNLLVAQPDNGESALEIVDQLVRSTAVDIIVVDSVAALVPRAEIEGE
MGDTSVGSQARLMSKAMRKIAGNIGRSGCLVIFLNQLRQKIGVTYGSPEVTTGGNALKFYASVRLDIRRIQTLKKGTEGE
YGIRAKVKVAKNKVAPPFRIAEFDIIFGQGISRMGCTIDLAEKCEVITRKGAWYSYNGENIAQGRDNAMKYLEENPEIAA
TIDQQVREKLSLVNAVFPVETEDGAEEQGEDGDF
>P36203 ~~~recA~~~Protein RecA~~~COG0468
MPEEKQKKSVLEKALKRIEENFGKGSIMILGDETQVQPVEVIPTGSLAIDIATGVGGYPRGRIVEIFGQESSGKTTLALH
AIAEAQKMGGVAAFIDAEHALDPVYAKNLGVDLKSLLISQPDHGEQALEIVDELVRSGVVDLIVVDSVAALVPRAEIEGA
MGDMQVGLQARLMSQALRKIAGSVNKSKAVVIFTNQIRMKIGVMFGSPETTTGGLALKFYATMRMEVRRGEPIKEGKDVI
GNVISVKIVKNKVAPPFKTAQTYIIYGKGIDREYELFNIAVNEGIVDRKGSWYYYTTLKGEEVSLGQGSSNAVQFLKDNP
EIAGEIERRIREKYGLLSVEKEEQRKEKKSSGEEAS
>P08394 3.1.11.5~~~recB~~~RecBCD enzyme subunit RecB~~~COG1074
MSDVAETLDPLRLPLQGERLIEASAGTGKTFTIAALYLRLLLGLGGSAAFPRPLTVEELLVVTFTEAATAELRGRIRSNI
HELRIACLRETTDNPLYERLLEEIDDKAQAAQWLLLAERQMDEAAVFTIHGFCQRMLNLNAFESGMLFEQQLIEDESLLR
YQACADFWRRHCYPLPREIAQVVFETWKGPQALLRDINRYLQGEAPVIKAPPPDDETLASRHAQIVARIDTVKQQWRDAV
GELDALIESSGIDRRKFNRSNQAKWIDKISAWAEEETNSYQLPESLEKFSQRFLEDRTKAGGETPRHPLFEAIDQLLAEP
LSIRDLVITRALAEIRETVAREKRRRGELGFDDMLSRLDSALRSESGEVLAAAIRTRFPVAMIDEFQDTDPQQYRIFRRI
WHHQPETALLLIGDPKQAIYAFRGADIFTYMKARSEVHAHYTLDTNWRSAPGMVNSVNKLFSQTDDAFMFREIPFIPVKS
AGKNQALRFVFKGETQPAMKMWLMEGESCGVGDYQSTMAQVCAAQIRDWLQAGQRGEALLMNGDDARPVRASDISVLVRS
RQEAAQVRDALTLLEIPSVYLSNRDSVFETLEAQEMLWLLQAVMTPERENTLRSALATSMMGLNALDIETLNNDEHAWDV
VVEEFDGYRQIWRKRGVMPMLRALMSARNIAENLLATAGGERRLTDILHISELLQEAGTQLESEHALVRWLSQHILEPDS
NASSQQMRLESDKHLVQIVTIHKSKGLEYPLVWLPFITNFRVQEQAFYHDRHSFEAVLDLNAAPESVDLAEAERLAEDLR
LLYVALTRSVWHCSLGVAPLVRRRGDKKGDTDVHQSALGRLLQKGEPQDAAGLRTCIEALCDDDIAWQTAQTGDNQPWQV
NDVSTAELNAKTLQRLPGDNWRVTSYSGLQQRGHGIAQDLMPRLDVDAAGVASVVEEPTLTPHQFPRGASPGTFLHSLFE
DLDFTQPVDPNWVREKLELGGFESQWEPVLTEWITAVLQAPLNETGVSLSQLSARNKQVEMEFYLPISEPLIASQLDTLI
RQFDPLSAGCPPLEFMQVRGMLKGFIDLVFRHEGRYYLLDYKSNWLGEDSSAYTQQAMAAAMQAHRYDLQYQLYTLALHR
YLRHRIADYDYEHHFGGVIYLFLRGVDKEHPQQGIYTTRPNAGLIALMDEMFAGMTLEEA
>P9WMQ3 3.1.11.5~~~recB~~~RecBCD enzyme subunit RecB~~~COG1074
MDRFELLGPLPREGTTTVLEASAGTGKTFALAGLVTRYLAETAATLDEMLLITFNRAASRELRERVRGQIVEAVGALQGD
APPSGELVEHLLRGSDAERAQKRSRLRDALANFDAATIATTHEFCGSVLKSLGVAGDNAADVELKESLTDLVTEIVDDRY
LANFGRQETDPELTYAEALALALAVVDDPCAQLRPPDPEPGSKAAVRLRFAAEVLEELERRKGRLRAQGFNDLLIRLATA
LEAADSPARDRMRERWRIVLVDEFQDTDPMQWRVLERAFSRHSALILIGDPKQAIYGFRGGDIHTYLKAAGTADARYTLG
VNWRSDRALVESLQTVLRDATLGHADIVVRGTDAHHAGHRLASAPRPAPFRLRVVKRHTLGYDGTAHVPIEALRRHIPDD
LAADVAALLASGATFAGRPVVAADIAVIVEHHKDARACRNALAEAGIPAIYTGDTDVFASQAAKDWLCLLEAFDAPQRSG
LVRAAACTMFFGETAESLAAEGDALTDRVAGTLREWADHARHRGVAAVFQAAQLAGMGRRVLSQRGGERDLTDLAHIAQL
LHEAAHRERLGLPGLRDWLRRQAKAGAGPPEHNRRLDSDAAAVQIMTVFVAKGLQFPIVYLPFAFNRNVRSDDILLYHDD
GTRCLYIGGKDGGAQRRTVEGLNRVEAAHDNLRLTYVALTRAQSQVVAWWAPTFDEVNGGLSRLLRGRRPGQSQVPDRCT
PRVTDEQAWAVFAQWEAAGGPSVEESVIGARSSLEKPVPVPGFEVRHFHRRIDTTWRRTSYSDLVRGSEAVTVTSEPAAG
GRADEVEIAVVAAPGSGADLTSPLAALPSGASFGSLVHAVLETADPAAPDLAAELEAQVRRHAPWWTVDVDHAQLAPELA
RALLPMHDTPLGPAAAALTLRQIGVRDRLRELDFEMPLAGGDLRGRSPDVSLADVGELLASHLPGDDPLSPYADRLGSAG
LGDQPLRGYLAGSIDVVLRLPGQRYLVVDYKTNHLGDTAADYGFERLTEAMLHSDYPLQALLYVVVLHRFLRWRQRDYAP
ARHLGGVLYLFVRGMCGAATPVTAGHPAGVFTWNPPTALVVALSDLLDRGRLQS
>P07648 3.1.11.5~~~recC~~~RecBCD enzyme subunit RecC~~~COG1330
MLRVYHSNRLDVLEALMEFIVERERLDDPFEPEMILVQSTGMAQWLQMTLSQKFGIAANIDFPLPASFIWDMFVRVLPEI
PKESAFNKQSMSWKLMTLLPQLLEREDFTLLRHYLTDDSDKRKLFQLSSKAADLFDQYLVYRPDWLAQWETGHLVEGLGE
AQAWQAPLWKALVEYTHQLGQPRWHRANLYQRFIETLESATTCPPGLPSRVFICGISALPPVYLQALQALGKHIEIHLLF
TNPCRYYWGDIKDPAYLAKLLTRQRRHSFEDRELPLFRDSENAGQLFNSDGEQDVGNPLLASWGKLGRDYIYLLSDLESS
QELDAFVDVTPDNLLHNIQSDILELENRAVAGVNIEEFSRSDNKRPLDPLDSSITFHVCHSPQREVEVLHDRLLAMLEED
PTLTPRDIIVMVADIDSYSPFIQAVFGSAPADRYLPYAISDRRARQSHPVLEAFISLLSLPDSRFVSEDVLALLDVPVLA
ARFDITEEGLRYLRQWVNESGIRWGIDDDNVRELELPATGQHTWRFGLTRMLLGYAMESAQGEWQSVLPYDESSGLIAEL
VGHLASLLMQLNIWRRGLAQERPLEEWLPVCRDMLNAFFLPDAETEAAMTLIEQQWQAIIAEGLGAQYGDAVPLSLLRDE
LAQRLDQERISQRFLAGPVNICTLMPMRSIPFKVVCLLGMNDGVYPRQLAPLGFDLMSQKPKRGDRSRRDDDRYLFLEAL
ISAQQKLYISYIGRSIQDNSERFPSVLVQELIDYIGQSHYLPGDEALNCDESEARVKAHLTCLHTRMPFDPQNYQPGERQ
SYAREWLPAASQAGKAHSEFVQPLPFTLPETVPLETLQRFWAHPVRAFFQMRLQVNFRTEDSEIPDTEPFILEGLSRYQI
NQQLLNALVEQDDAERLFRRFRAAGDLPYGAFGEIFWETQCQEMQQLADRVIACRQPGQSMEIDLACNGVQITGWLPQVQ
PDGLLRWRPSLLSVAQGMQLWLEHLVYCASGGNGESRLFLRKDGEWRFPPLAAEQALHYLSQLIEGYREGMSAPLLVLPE
SGGAWLKTCYDAQNDAMLDDDSTLQKARTKFLQAYEGNMMVRGEGDDIWYQRLWRQLTPETMEAIVEQSQRFLLPLFRFN
QS
>P9WIQ5 3.1.11.5~~~recC~~~RecBCD enzyme subunit RecC~~~COG1330
MALHLHRAERTDLLADGLGALLADPQPDPFAQELVLVAARGVERWLSQRLSLVLGCGPGRADGVCAGIAFRNPQSLIAEI
TGTLDDDPWSPEALAWPLLAVIDASLDEPWCRTLASHLGHFATTDAEAELRRGRRYSVARRLAGLFASYARQRPGLLAAW
LDGDLGELPGDLAWQPPLWRALVTTVGADPPHVRHDKTIARLRDGPADLPARLSLFGHTRLACTDVQLLDALAVHHDLHL
WLPHPSDELWRALAGFQGADGLLPRRQDTSRRAAQHPLLETLGRDVRELQRALPAARATDEFLGATTKPDTLLGWLQADI
AGNAPRPAGRSLSDADRSVQVHACHGPARQIDVLREVLLGLLEDDPTLQPRDIVVMCPDIDTYAPLIVAGFGLGEVAGDC
HPAHRLRVRLADRALTQTNPLLSVAAELLTIAETRATASQLLNLAQAAPVRAKFGFADDDLDTITTWVRESNIRWGFDPT
HRRRYGLDTVVHNTWRFGLDRILTGVAMSEDSQAWLDTALPLDDVGSNRVELAGRLAEFVERLHHVVGGLSGARPLVAWL
DALATGIDLLTACNDGWQRAQVQREFADVLARAGSRAAPLLRLPDVRALLDAQLAGRPTRANFRTGTLTVCTMVPMRSVP
HRVVCLVGLDDGVFPRLSHPDGDDVLAREPMTGERDIRSEDRQLLLDAIGAATQTLVITYTGADERTGQPRPPAVPLAEL
LDALDQTTSAPVRERILVTHPLQPFDRKNVTPGALLGAKPFTFDPAALAAAQAAAGKRCPPTAFISGRLPAPPAADVTLA
DLLDFFKDPVKGFFRALDYTLPWDVDTVEDSIPVQVDALAEWTVGERMLRDMLRGLHPDDAAHSEWRRGTLPPGRLGVRR
AKEIRNRARDLAAAALAHRDGHGQAHDVDVDLGDGRRLSGTVTPVFGGRTVSVTYSKLAPKHVLPAWIGLVTLAAQEPGR
EWSALCIGRSKTRNHIARRLFVPPPDPVAVLRELVLLYDAGRREPLPLPLKTSCAWAQARRDGQDPYPPARECWQTNRFR
PGDDDAPAHVRAWGPRAPFEVLLGKPRAGEEVAGEETRLGALAARLWLPLLAAEGSV
>Q9RT63 3.6.4.12~~~recD2~~~ATP-dependent RecD-like DNA helicase~~~COG0507
MSAALPAEPFRVSGGVNKVRFRSDTGFTVMSATLRNEQGEDPDATVIGVMPPLDVGDTFSAEVLMEEHREYGYQYRVVNM
VLEAMPADLSEEGVAAYFEARVGGVGKVLAGRIAKTFGAAAFDLLEDDPQKFLQVPGITESTLHKMVSSWSQQGLERRLL
AGLQGLGLTINQAQRAVKHFGADALDRLEKDLFTLTEVEGIGFLTADKLWQARGGALDDPRRLTAAAVYALQLAGTQAGH
SFLPRSRAEKGVVHYTRVTPGQARLAVETAVELGRLSEDDSPLFAAEAAATGEGRIYLPHVLRAEKKLASLIRTLLATPP
ADGAGNDDWAVPKKARKGLSEEQASVLDQLAGHRLVVLTGGPGTGKSTTTKAVADLAESLGLEVGLCAPTGKAARRLGEV
TGRTASTVHRLLGYGPQGFRHNHLEPAPYDLLIVDEVSMMGDALMLSLLAAVPPGARVLLVGDTDQLPPVDAGLPLLALA
QAAPTIKLTQVYRQAAKNPIIQAAHGLLHGEAPAWGDKRLNLTEIEPDGGARRVALMVRELGGPGAVQVLTPMRKGPLGM
DHLNYHLQALFNPGEGGVRIAEGEARPGDTVVQTKNDYNNEIFNGTLGMVLKAEGARLTVDFDGNVVELTGAELFNLQLG
YALTVHRAQGSEWGTVLGVLHEAHMPMLSRNLVYTALTRARDRFFSAGSASAWQIAAARQREARNTALLERIRAH
>P04993 3.1.11.5~~~recD~~~RecBCD enzyme subunit RecD~~~COG0507
MKLQKQLLEAVEHKQLRPLDVQFALTVAGDEHPAVTLAAALLSHDAGEGHVCLPLSRLENNEASHPLLATCVSEIGELQN
WEECLLASQAVSRGDEPTPMILCGDRLYLNRMWCNERTVARFFNEVNHAIEVDEALLAQTLDKLFPVSDEINWQKVAAAV
ALTRRISVISGGPGTGKTTTVAKLLAALIQMADGERCRIRLAAPTGKAAARLTESLGKALRQLPLTDEQKKRIPEDASTL
HRLLGAQPGSQRLRHHAGNPLHLDVLVVDEASMIDLPMMSRLIDALPDHARVIFLGDRDQLASVEAGAVLGDICAYANAG
FTAERARQLSRLTGTHVPAGTGTEAASLRDSLCLLQKSYRFGSDSGIGQLAAAINRGDKTAVKTVFQQDFTDIEKRLLQS
GEDYIAMLEEALAGYGRYLDLLQARAEPDLIIQAFNEYQLLCALREGPFGVAGLNERIEQFMQQKRKIHRHPHSRWYEGR
PVMIARNDSALGLFNGDIGIALDRGQGTRVWFAMPDGNIKSVQPSRLPEHETTWAMTVHKSQGSEFDHAALILPSQRTPV
VTRELVYTAVTRARRRLSLYADERILSAAIATRTERRSGLAALFSSRE
>P9WHJ1 3.1.11.5~~~recD~~~RecBCD enzyme subunit RecD~~~COG0507
MKLTDVDFAVEASGMVRAFNQAGVLDVSDVHVAQRLCALAGESDERVALAVAVAVRALRAGSVCVDLLSIARVAGHDDLP
WPDPADWLAAVRASPLLADPPVLHLYDDRLLYLDRYWREEEQVCADLLALLTSRRPAGVPDLRRLFPTGFDEQRRAAEIA
LSQGVTVLTGGPGTGKTTTVARLLALVAEQAELAGEPRPRIALAAPTGKAAARLAEAVRREMAKLDATDRARLGDLHAVT
LHRLLGAKPGARFRQDRQNRLPHNVIVVDETSMVSLTLMARLAEAVRPGARLILVGDADQLASVEAGAVLADLVDGFSVR
DDALVAQLRTSHRFGKVIGTLAEAIRAGDGDAVLGLLRSGEERIEFVDDEDPAPRLRAVLVPHALRLREAALLGASDVAL
ATLDEHRLLCAHRDGPTGVLHWNRRVQAWLAEETGQPPWTPWYAGRPLLVTANDYGLRVYNGDTGVVLAGPTGLRAVISG
ASGPLDVATGRLGDVETMHAMTIHKSQGSQVDEVTVLMPQEDSRLLTRELLYTAVTRAKRKVRVVGSEASVRAAIARRAV
RASGLRMRLQSTGCG
>P15032 3.1.11.-~~~recE~~~Exodeoxyribonuclease 8~~~COG0847
MSTKPLFLLRKAKKSSGEPDVVLWASNDFESTCATLDYLIVKSGKKLSSYFKAVATNFPVVNDLPAEGEIDFTWSERYQL
SKDSMTWELKPGAAPDNAHYQGNTNVNGEDMTEIEENMLLPISGQELPIRWLAQHGSEKPVTHVSRDGLQALHIARAEEL
PAVTALAVSHKTSLLDPLEIRELHKLVRDTDKVFPNPGNSNLGLITAFFEAYLNADYTDRGLLTKEWMKGNRVSHITRTA
SGANAGGGNLTDRGEGFVHDLTSLARDVATGVLARSMDLDIYNLHPAHAKRIEEIIAENKPPFSVFRDKFITMPGGLDYS
RAIVVASVKEAPIGIEVIPAHVTEYLNKVLTETDHANPDPEIVDIACGRSSAPMPQRVTEEGKQDDEEKPQPSGTTAVEQ
GEAETMEPDATEHHQDTQPLDAQSQVNSVDAKYQELRAELHEARKNIPSKNPVDDDKLLAASRGEFVDGISDPNDPKWVK
GIQTRDCVYQNQPETEKTSPDMNQPEPVVQQEPEIACNACGQTGGDNCPDCGAVMGDATYQETFDEESQVEAKENDPEEM
EGAEHPHNENAGSDPHRDCSDETGEVADPVIVEDIEPGIYYGISNENYHAGPGISKSQLDDIADTPALYLWRKNAPVDTT
KTKTLDLGTAFHCRVLEPEEFSNRFIVAPEFNRRTNAGKEEEKAFLMECASTGKTVITAEEGRKIELMYQSVMALPLGQW
LVESAGHAESSIYWEDPETGILCRCRPDKIIPEFHWIMDVKTTADIQRFKTAYYDYRYHVQDAFYSDGYEAQFGVQPTFV
FLVASTTIECGRYPVEIFMMGEEAKLAGQQEYHRNLRTLSDCLNTDEWPAIKTLSLPRWAKEYAND
>Q8RDL3 ~~~recF~~~DNA replication and repair protein RecF~~~COG1195
MYLKEIFVDNFRNLKKQKLEFCEGVNLIYGLNAQGKSNLLEAIRLLSMGRSFRGSKMSELVKFDEEYFYVRGLVRSADFY
EKKIEFGYKVNGNKVIKVNGNKLKSTGEILGHFLTVIFSPEDIEIIKEGPSRRRKYLDACISVIDKNYFFDLLQYNKTLS
NRNSLLKKIKEEGKGEDLLEIFDEKLAEYGARIIKVRNNYLEKLKNSMSKFLMEISNEKLEIIYLNSAGVKEVHEENLIR
EKLKNRLTKSLTLDLKYLSTQVGPHREDFKILINGYDSRVYSSQGQKRTAALCLKLSELEILEEETGEKPVLLLDDVMSE
LDDNRKKYILKKLEGFQSFITHTSKSDVEGDCCFKIYDGIVMRE
>Q9RVE0 ~~~recF~~~DNA replication and repair protein RecF~~~COG1195
MGDVRLSALSTLNYRNLAPGTLNFPEGVTGIYGENGAGKTNLLEAAYLALTGQTDAPRIEQLIQAGETEAYVRADLQQGG
SLSIQEVGLGRGRRQLKVDGVRARTGDLPRGGAVWIRPEDSELVFGPPSGRRAYLDSLLSRLSARYGEQLSRYERTVSQR
NAALRGGEEWAMHVWDDVLLKLGTEIMLFRRRALTRLDELAREANAQLGSRKTLALTLTESTSPETYAADLRGRRAEELA
RGSTVTGPHRDDLLLTLGDFPASDYASRGEGRTVALALRRAELELLREKFGEDPVLLLDDFTAELDPHRRQYLLDLAASV
PQAIVTGTELAPGAALTLRAQAGRFTPVADEEMQAEGTA
>P0A7H0 ~~~recF~~~DNA replication and repair protein RecF~~~COG1195
MSLTRLLIRDFRNIETADLALSPGFNFLVGANGSGKTSVLEAIYTLGHGRAFRSLQIGRVIRHEQEAFVLHGRLQGEERE
TAIGLTKDKQGDSKVRIDGTDGHKVAELAHLMPMQLITPEGFTLLNGGPKYRRAFLDWGCFHNEPGFFTAWSNLKRLLKQ
RNAALRQVTRYEQLRPWDKELIPLAEQISTWRAEYSAGIAADMADTCKQFLPEFSLTFSFQRGWEKETEYAEVLERNFER
DRQLTYTAHGPHKADLRIRADGAPVEDTLSRGQLKLLMCALRLAQGEFLTRESGRRCLYLIDDFASELDDERRGLLASRL
KATQSQVFVSAISAEHVIDMSDENSKMFTVEKGKITD
>P9WHI9 ~~~recF~~~DNA replication and repair protein RecF~~~COG1195
MYVRHLGLRDFRSWACVDLELHPGRTVFVGPNGYGKTNLIEALWYSTTLGSHRVSADLPLIRVGTDRAVISTIVVNDGRE
CAVDLEIATGRVNKARLNRSSVRSTRDVVGVLRAVLFAPEDLGLVRGDPADRRRYLDDLAIVRRPAIAAVRAEYERVLRQ
RTALLKSVPGARYRGDRGVFDTLEVWDSRLAEHGAELVAARIDLVNQLAPEVKKAYQLLAPESRSASIGYRASMDVTGPS
EQSDIDRQLLAARLLAALAARRDAELERGVCLVGPHRDDLILRLGDQPAKGFASHGEAWSLAVALRLAAYQLLRVDGGEP
VLLLDDVFAELDVMRRRALATAAESAEQVLVTAAVLEDIPAGWDARRVHIDVRADDTGSMSVVLP
>P24230 3.6.4.12~~~recG~~~ATP-dependent DNA helicase RecG~~~COG1200
MKGRLLDAVPLSSLTGVGAALSNKLAKINLHTVQDLLLHLPLRYEDRTHLYPIGELLPGVYATVEGEVLNCNISFGGRRM
MTCQISDGSGILTMRFFNFSAAMKNSLAAGRRVLAYGEAKRGKYGAEMIHPEYRVQGDLSTPELQETLTPVYPTTEGVKQ
ATLRKLTDQALDLLDTCAIEELLPPELSQGMMTLPEALRTLHRPPPTLQLSDLETGQHPAQRRLILEELLAHNLSMLALR
AGAQRFHAQPLSANDTLKNKLLAALPFKPTGAQARVVAEIERDMALDVPMMRLVQGDVGSGKTLVAALAALRAIAHGKQV
ALMAPTELLAEQHANNFRNWFAPLGIEVGWLAGKQKGKARLAQQEAIASGQVQMIVGTHAIFQEQVQFNGLALVIIDEQH
RFGVHQRLALWEKGQQQGFHPHQLIMTATPIPRTLAMTAYADLDTSVIDELPPGRTPVTTVAIPDTRRTDIIDRVHHACI
TEGRQAYWVCTLIEESELLEAQAAEATWEELKLALPELNVGLVHGRMKPAEKQAVMASFKQGELHLLVATTVIEVGVDVP
NASLMIIENPERLGLAQLHQLRGRVGRGAVASHCVLLYKTPLSKTAQIRLQVLRDSNDGFVIAQKDLEIRGPGELLGTRQ
TGNAEFKVADLLRDQAMIPEVQRLARHIHERYPQQAKALIERWMPETERYSNA
>P64325 3.6.4.12~~~recG~~~ATP-dependent DNA helicase RecG~~~
MAKVNLIESPYSLLQLKGIGPKKIEVLQQLNIHTVEDLVLYLPTRYEDNTVIDLNQAEDQSNVTIEGQVYTAPVVAFFGR
NKSKLTVHLMVNNIAVKCIFFNQPYLKKKIELNQTITVKGKWNRVKQEITGNRVFFNSQGTQTQENADVQLEPVYRIKEG
IKQKQIRDQIRQALNDVTIHEWLTDELREKYKLETLDFTLNTLHHPKSKEDLLRARRTYAFTELFLFELRMQWLNRLEKS
SDEAIEIDYDLDQVKSFIDRLPFELTEAQKSSVNEIFRDLKAPIRMHRLLQGDVGSGKTVVAAICMYALKTAGYQSALMV
PTEILAEQHAESLMALFGDSMNVALLTGSVKGKKRKILLEQLENGTIDCLIGTHALIQDDVIFHNVGLVITDEQHRFGVN
QRQLLREKGAMTNVLFMTATPIPRTLAISVFGEMDVSSIKQLPKGRKPIITTWAKHEQYDKVLMQMTSELKKGRQAYVIC
PLIESSEHLEDVQNVVALYESLQQYYGVSRVGLLHGKLSADEKDEVMQKFSNHEIDVLVSTTVVEVGVNVPNATFMMIYD
ADRFGLSTLHQLRGRVGRSDQQSYCVLIASPKTETGIERMTIMTQTTDGFELSERDLEMRGPGDFFGVKQSGLPDFLVAN
LVEDYRMLEVARDEAAELIQSGVFFENTYQHLRHFVEENLLHRSFD
>P21893 3.1.-.-~~~recJ~~~Single-stranded-DNA-specific exonuclease RecJ~~~COG0608
MKQQIQLRRREVDETADLPAELPPLLRRLYASRGVRSAQELERSVKGMLPWQQLSGVEKAVEILYNAFREGTRIIVVGDF
DADGATSTALSVLAMRSLGCSNIDYLVPNRFEDGYGLSPEVVDQAHARGAQLIVTVDNGISSHAGVEHARSLGIPVIVTD
HHLPGDTLPAAEAIINPNLRDCNFPSKSLAGVGVAFYLMLALRTFLRDQGWFDERNIAIPNLAELLDLVALGTVADVVPL
DANNRILTWQGMSRIRAGKCRPGIKALLEVANRDAQKLAASDLGFALGPRLNAAGRLDDMSVGVALLLCDNIGEARVLAN
ELDALNQTRKEIEQGMQIEALTLCEKLERSRDTLPGGLAMYHPEWHQGVVGILASRIKERFHRPVIAFAPAGDGTLKGSG
RSIQGLHMRDALERLDTLYPGMMLKFGGHAMAAGLSLEEDKFKLFQQRFGELVTEWLDPSLLQGEVVSDGPLSPAEMTME
VAQLLRDAGPWGQMFPEPLFDGHFRLLQQRLVGERHLKVMVEPVGGGPLLDGIAFNVDTALWPDNGVREVQLAYKLDINE
FRGNRSLQIIIDNIWPI
>Q5SJ47 3.1.-.-~~~recJ~~~Single-stranded-DNA-specific exonuclease RecJ~~~COG0608
MRDRVRWRVLSLPPLAQWREVMAALEVGPEAALAYWHRGFRRKEDLDPPLALLPLKGLREAAALLEEALRQGKRIRVHGD
YDADGLTGTAILVRGLAALGADVHPFIPHRLEEGYGVLMERVPEHLEASDLFLTVDCGITNHAELRELLENGVEVIVTDH
HTPGKTPPPGLVVHPALTPDLKEKPTGAGVAFLLLWALHERLGLPPPLEYADLAAVGTIADVAPLWGWNRALVKEGLARI
PASSWVGLRLLAEAVGYTGKAVEVAFRIAPRINAASRLGEAEKALRLLLTDDAAEAQALVGELHRLNARRQTLEEAMLRK
LLPQADPEAKAIVLLDPEGHPGVMGIVASRILEATLRPVFLVAQGKGTVRSLAPISAVEALRSAEDLLLRYGGHKEAAGF
AMDEALFPAFKARVEAYAARFPDPVREVALLDLLPEPGLLPQVFRELALLEPYGEGNPEPLFLLFGAPEEARRLGEGRHL
AFRLKGVRVLAWKQGDLALPPEVEVAGLLSENAWNGHLAYEVQAVDLRKPEALEGGIAPFAYPLPLLEALARARLGEGVY
VPEDNPEGLDYAWKAGFRLLPPEEAGLWLGLPPRPVLGRRVEVALGREARARLSAPPVLHTPEARLKALVHRRLLFAYER
RHPGLFSEALLAYWEVNRVQEPAGSP
>Q9WXF2 ~~~recN~~~DNA repair protein RecN~~~COG0497
MTRKARTPKAAPVPEAVAVVEPPPPDAAPTGPRLSRLEIRNLATITQLELELGGGFCAFTGETGAGKSIIVDALGLLLGG
RANHDLIRSGEKELLVTGFWGDGDESEADSASRRLSSAGRGAARLSGEVVSVRELQEWAQGRLTIHWQHSAVSLLSPANQ
RGLLDRRVTKEAQAYAAAHAAWREAVSRLERLQASQRERARQIDLLAFQVQEISEVSPDPGEEEGLNTELSRLSNLHTIA
QAAAGGVELLSDGDLNAAGLIGEAVRALNAGAKYDETVMQLQNELRAALESVQAIAGELRDVAEGSAADPEALDRVEARL
SALSKLKNKYGPTLEDVVEFGAQAAEELAGLEEDERDAGSLQADVDALHAELLKVGQALDAAREREAEPLVDSLLAVIRE
LGMPHARMEFALSALAEPAAYGLSDVLLRFSANPGEELGPLSDVASGGELSRVMLAVSTVLGADTPSVVFDEVDAGIGGA
AAIAVAEQLSRLADTRQVLVVTHLAQIAARAHHHYKVEKQVEDGRTVSHVRLLTGDERLEEIARMLSGNTSEAALEHARE
LLAG
>P05824 ~~~recN~~~DNA repair protein RecN~~~COG0497
MLAQLTISNFAIVRELEIDFHSGMTVITGETGAGKSIAIDALGLCLGGRAEADMVRTGAARADLCARFSLKDTPAALRWL
EENQLEDGHECLLRRVISSDGRSRGFINGTAVPLSQLRELGQLLIQIHGQHAHQLLTKPEHQKFLLDGYANETSLLQEMT
ARYQLWHQSCRDLAHHQQLSQERAARAELLQYQLKELNEFNPQPGEFEQIDEEYKRLANSGQLLTTSQNALALMADGEDA
NLQSQLYTAKQLVSELIGMDSKLSGVLDMLEEATIQIAEASDELRHYCDRLDLDPNRLFELEQRISKQISLARKHHVSPE
ALPQYYQSLLEEQQQLDDQADSQETLALAVTKHHQQALEIARALHQQRQQYAEELAQLITDSMHALSMPHGQFTIDVKFD
EHHLGADGADRIEFRVTTNPGQPMQPIAKVASGGELSRIALAIQVITARKMETPALIFDEVDVGISGPTAAVVGKLLRQL
GESTQVMCVTHLPQVAGCGHQHYFVSKETDGAMTETHMQSLNKKARLQELARLLGGSEVTRNTLANAKELLAA
>P9WHI7 ~~~recN~~~DNA repair protein RecN~~~COG0497
MLTELRIESLGAISVATAEFDRGFTVLTGETGTGKTMVVTGLHLLGGARADATRVRSGADRAVVEGRFTTTDLDDATVAG
LQAVLDSSGAERDEDGSVIALRSISRDGPSRAYLGGRGVPAKSLSGFTNELLTLHGQNDQLRLMRPDEQRGALDRFAAAG
EAVQRYRKLRDAWLTARRDLVDRRNRARELAQEADRLKFALNEIDTVDPQPGEDVALVADIARLSELDTLREAATTARAT
LCGTPDADAFDRGAVDSLGRARAALQSSDDAALRGLAEQVGEALTVVVDAVAELGAYLDELPADASALDAKLARQAQLRT
LTRKYAADIDGVLRWADEARARLAQLDVSEEGLAALERRTGELAHELGQAAVDLSTIRRKAAKRLAKEVSAELSALAMAD
AEFTIGVTTELADHGDPVALALASGELARAGADGVDAVEFGFVAHRGMTVLPLAKSASGGELSRVMLSLEVVLATSRKQA
AGTTMVFDEIDAGVGGWAAVQIGRRLARLARTHQVIVVTHLPQVAAYADVHLMVQRTGRDGASGVRRLTSEDRVAELARM
LAGLGDSDSGRAHARELLETAQNDELT
>P42095 ~~~recO~~~DNA repair protein RecO~~~COG1381
MLTKCEGIVLRTNDYGETNKIVTLLTREHGKIGVMARGAKKPNSRLSAVSQPFLYGSFLMQKTSGLGTLQQGEMILSMRG
IREDLFLTAYAAYVAELVDRGTEEKKPNPYLFEFILESLKQLNEGTDPDVITFIVQMKMLGVMGLYPELNHCVHCKSQDG
TFHFSVRDNGFICHRCFEKDPYRIPIKPQTARLLRLFYYFDLSRLGNVSLKEETKAELKQVIDLYYEEYSGLYLKSKRFL
DQMESMKHLMGENKS
>P0A7H3 ~~~recO~~~DNA repair protein RecO~~~COG1381
MEGWQRAFVLHSRPWSETSLMLDVFTEESGRVRLVAKGARSKRSTLKGALQPFTPLLLRFGGRGEVKTLRSAEAVSLALP
LSGITLYSGLYINELLSRVLEYETRFSELFFDYLHCIQSLAGVTGTPEPALRRFELALLGHLGYGVNFTHCAGSGEPVDD
TMTYRYREEKGFIASVVIDNKTFTGRQLKALNAREFPDADTLRAAKRFTRMALKPYLGGKPLKSRELFRQFMPKRTVKTH
YE
>P15043 3.6.4.12~~~recQ~~~ATP-dependent DNA helicase RecQ~~~COG0514
MAQAEVLNLESGAKQVLQETFGYQQFRPGQEEIIDTVLSGRDCLVVMPTGGGKSLCYQIPALLLNGLTVVVSPLISLMKD
QVDQLQANGVAAACLNSTQTREQQLEVMTGCRTGQIRLLYIAPERLMLDNFLEHLAHWNPVLLAVDEAHCISQWGHDFRP
EYAALGQLRQRFPTLPFMALTATADDTTRQDIVRLLGLNDPLIQISSFDRPNIRYMLMEKFKPLDQLMRYVQEQRGKSGI
IYCNSRAKVEDTAARLQSKGISAAAYHAGLENNVRADVQEKFQRDDLQIVVATVAFGMGINKPNVRFVVHFDIPRNIESY
YQETGRAGRDGLPAEAMLFYDPADMAWLRRCLEEKPQGQLQDIERHKLNAMGAFAEAQTCRRLVLLNYFGEGRQEPCGNC
DICLDPPKQYDGSTDAQIALSTIGRVNQRFGMGYVVEVIRGANNQRIRDYGHDKLKVYGMGRDKSHEHWVSVIRQLIHLG
LVTQNIAQHSALQLTEAARPVLRGESSLQLAVPRIVALKPKAMQKSFGGNYDRKLFAKLRKLRKSIADESNVPPYVVFND
ATLIEMAEQMPITASEMLSVNGVGMRKLERFGKPFMALIRAHVDGDDEE
>Q8RDI4 ~~~recR~~~Recombination protein RecR~~~COG0353
MSYYSTSVAKLIEELSKLPGIGPKTAQRLAFFIINMPLDEVRSLSQAIIEAKEKLRYCKICFNITDKEVCDICSDENRDH
STICVVSHPMDVVAMEKVKEYKGVYHVLHGVISPIEGVGPEDIRIKELLERVRDGSVKEVILATNPDIEGEATAMYIAKL
LKPFGVKVTRIAHGIPVGGDLEYTDVVTLSKALEGRREV
>Q9ZNA2 ~~~recR~~~Recombination protein RecR~~~COG0353
MKYPPSLVSLIRELSRLPGIGPKSAQRLAFHLFEQPREDIERLASALLEAKRDLHVCPICFNITDAEKCDVCADPSRDQR
TICVVEEPGDVIALERSGEYRGLYHVLHGVLSPMNGVGPDKLHIKPLLPRVGQGMEVILATGTTVEGDATALYLQRLLEP
LGAAISRIAYGVPVGGSLEYTDEVTLGRALTGRQTVSKPQPPQRPGDEDGADGAAVPASR
>P0A7H6 ~~~recR~~~Recombination protein RecR~~~COG0353
MQTSPLLTQLMEALRCLPGVGPKSAQRMAFTLLQRDRSGGMRLAQALTRAMSEIGHCADCRTFTEQEVCNICSNPRRQEN
GQICVVESPADIYAIEQTGQFSGRYFVLMGHLSPLDGIGPDDIGLDRLEQRLAEEKITEVILATNPTVEGEATANYIAEL
CAQYDVEASRIAHGVPVGGELEMVDGTTLSHSLAGRHKIRF
>P9WHI3 ~~~recR~~~Recombination protein RecR~~~COG0353
MFEGPVQDLIDELGKLPGIGPKSAQRIAFHLLSVEPSDIDRLTGVLAKVRDGVRFCAVCGNVSDNERCRICSDIRRDASV
VCIVEEPKDIQAVERTREFRGRYHVLGGALDPLSGIGPDQLRIRELLSRIGERVDDVDVTEVIIATDPNTEGEATATYLV
RMLRDIPGLTVTRIASGLPMGGDLEFADELTLGRALAGRRVLA
>Q9I3H9 ~~~recR~~~Recombination protein RecR~~~
MSFSPLIRQLIESLRILPGVGQKSAQRMALMLLERDRSGGLKLAQALTAAMEGVGHCRQCRTLSEEELCPQCADPRRDDS
LLCVVEGPLDVFAVEQTGYRGRYFVLKGHLSPLDGLGPEAIGIPELEARIRDGAFSEVILATNPTVEGEATAHYIAQLLA
GRGLTLSRIAHGVPLGGELELVDGGTLAHALAGRRPIS
>P0CB76 ~~~recR~~~Recombination protein RecR~~~COG0353
MLYPTPIAKLIDSYSKLPGIGIKTATRLAFYTIGMSADDVNEFAKNLLSAKRELTYCSICGRLTDDDPCSICTDPTRDQT
TILVLEDSRDVAAMENIQEYHGLYHVLHGLISPMNGISPDDINLKSLMTRLMDSEVSEVIVATNATADGEATSMYLSRLL
KPAGIKVTRLARGLAVGADIEYADEVTLLRAIENRTEL
>Q5SHY0 ~~~recR~~~Recombination protein RecR~~~COG0353
MRYPESLLKLTRALSRLPGIGPKTAQRLALHLAFHKEEAEALAEALEGIKRVRACRECGNLAEGELCPICQDEDRDRSLL
AVVESVADLYALERSGEFRGLYHVLGGALNPLEGIGPKELNLEGLFRRLEGVEEVVLATSMTVEGEATALYLAEELKKRG
VRVTRPAYGLPVGGSLEYADEVTLGRALEGRRPV
>P33228 ~~~recT~~~Protein RecT~~~COG3723
MTKQPPIAKADLQKTQGNRAPAAVKNSDVISFINQPSMKEQLAAALPRHMTAERMIRIATTEIRKVPALGNCDTMSFVSA
IVQCSQLGLEPGSALGHAYLLPFGNKNEKSGKKNVQLIIGYRGMIDLARRSGQIASLSARVVREGDEFSFEFGLDEKLIH
RPGENEDAPVTHVYAVARLKDGGTQFEVMTRKQIELVRSLSKAGNNGPWVTHWEEMAKKTAIRRLFKYLPVSIEIQRAVS
MDEKEPLTIDPADSSVLTGEYSVIDNSEE
>P39792 3.1.21.10~~~recU~~~Holliday junction resolvase RecU~~~COG3331
MIRYPNGKTFQPKHSVSSQNSQKRAPSYSNRGMTLEDDLNETNKYYLTNQIAVIHKKPTPVQIVNVHYPKRSAAVIKEAY
FKQSSTTDYNGIYKGRYIDFEAKETKNKTSFPLQNFHDHQIEHMKQVKAQDGICFVIISAFDQVYFLEADKLFYFWDRKE
KNGRKSIRKDELEETAYPISLGYAPRIDYISIIEQLYFSPSSGAKG
>Q5KXY4 3.1.21.10~~~recU~~~Holliday junction resolvase RecU~~~COG3331
MALKYPSGKEYRGNKPNAARRPAADYANRGMTLEDDLNATNEYYRERGIAVIHKKPTPVQIVRVDYPKRSAAVITEAYFR
QASTTDYNGVYRGKYIDFEAKETKNKTAFPLKNFHAHQIRHMEQVVAHGGICFAILRFSLLNETYLLDASHLIAWWNKQE
AGGRKSIPKQEIERHGHSIPLGYQPRIDYISVVDNVYFTR
>P68817 3.1.21.10~~~recU~~~Holliday junction resolvase RecU~~~
MNYPNGKPYRKNSAIDGGKKTAAFSNIEYGGRGMSLEKDIEHSNTFYLKSDIAVIHKKPTPVQIVNVNYPKRSKAVINEA
YFRTPSTTDYNGVYQGYYIDFEAKETKNKTSFPLNNIHDHQVEHMKNAYQQKGIVFLMIRFKTLDEVYLLPYSKFEVFWK
RYKDNIKKSITVDEIRKNGYHIPYQYQPRLDYLKAVDKLILDESEDRV
>P66000 ~~~recX~~~Regulatory protein RecX~~~COG2137
MTESTSRRPAYARLLDRAVRILAVRDHSEQELRRKLAAPIMGKNGPEEIDATAEDYERVIAWCHEHGYLDDSRFVARFIA
SRSRKGYGPARIRQELNQKGISREATEKAMRECDIDWCALARDQATRKYGEPLPTVFSEKVKIQRFLLYRGYLMEDIQDI
WRNFAD
>P33596 ~~~recX~~~Regulatory protein RecX~~~COG2137
MTESTSRRPAYARLLDRAVRILAVRDHSEQELRRKLAAPIMGKNGPEEIDATAEDYERVIAWCHEHGYLDDSRFVARFIA
SRSRKGYGPARIRQELNQKGISREATEKAMRECDIDWCALARDQATRKYGEPLPTVFSEKVKIQRFLLYRGYLMEDIQEI
WRNFAD
>P9WHI1 ~~~recX~~~Regulatory protein RecX~~~COG2137
MTVSCPPPSTSEREEQARALCLRLLTARSRTRAELAGQLAKRGYPEDIGNRVLDRLAAVGLVDDTDFAEQWVQSRRANAA
KSKRALAAELHAKGVDDDVITTVLGGIDAGAERGRAEKLVRARLRREVLIDDGTDEARVSRRLVAMLARRGYGQTLACEV
VIAELAAERERRRV
>P66003 ~~~recX~~~Regulatory protein RecX~~~
MPKITKIEVQKKNKERFNLFLDEQFEMGIDIDTLVKFNLKKGQQLEAADMAEIQKYDHYRIGLNKAIQYLSYKKRTEKEV
IQYLQKEEISEQAISEVIEYCYREKLIDHQDYAESLKNTMIRTTDKGPKIYQQKLYQLGIEPNIIEMFTELYREQQELDD
IIQIAEKISKTKKGPQNKVKEKVMQSLIQKGFEMETIHAVLNEMDFTQDEAVLDDLLQRDLEKIYNKNRKKYTQQKLISK
TIEGLMRKGYKYDKIKAKLEESGIADGTEEIE
>Q8P9X1 ~~~recX~~~Regulatory protein RecX~~~COG2137
MSEQAPAPKRGRRFKEQTPVQRALGLLVRREHSKKELNRKLQARGIEPEAAQAAVERLAGEGWQDDVRFAASVVRNRASS
GYGPLHIRAELGTHGLDSDAVSAAMATFEGDWTENALDLIRRRFGEDGPVDLAQRRKAADLLARRGFDGNSIRLATRFDL
ED
>O54154 6.2.1.53~~~redM~~~L-proline--[L-prolyl-carrier protein] ligase~~~COG1020
MSAATPSVIRLPRDTSAQHAARPAFVGSDPLTYGEFTARVEAVAARLLSLGTRTGDRIAVWMDKQPRYAEAIVAALEAGC
AYVPLDGGQPVSRVRTILADAEPVVLFTDAHHAALLGDDDLPASVTTVVAVGDALPDTVGGIPVAPWESWEQGRAGRVTL
LPSLTPGDLAALLYTSGSTGTPKGVQISHGALANFVAWARDELDVGPDDVFAGHASFNFDLSTFDLFTALSCGAAVWIVP
DAATKDVTALAEGIRRHRITVWYSVPSVLHLLTTSAALTPEHAASLRYVLFAGEVFPVPQLRALRELLPPGTPLYNLYGP
TETNVCTYHRVRPEDLHRATPVPIGLPITGAGTTVVDDAGRTVREPGAIGELHVSGVCVTPGYWRRAEEPVSTAHCRGVH
PTGDLVSYEEDGRLVYRGRKDRMVKLSGYRVELGEIEAAALRHPGIAEAAVLVDGSGPKARLRLYYTLCEGAERIGLVEL
KQHCARHLPTYMVPHGAVRLDRMPLNPNGKTDYRRLGLDAPPRPAAPLGTAR
>O54143 1.3.8.14~~~redW~~~L-prolyl-[peptidyl-carrier protein] dehydrogenase~~~COG1960
MNFDFDAGFDTETRELRDMVVRFARRELDSSGRFDDAEDFRRRWLLAGKQGLTGTTVPGEYGGSGLDAVSAAATMEALGY
GCADTGFAFSVAAHLFAAVMPIVEFGTGEQRAAWLPALCSGERIAAHAITEPEAGSDALHLRTRARPVDDGHVLSGSKCF
ITNAPVADVFVVQAATDPRGGFFGLTTFLVEASTPGLTVGRPYDKVGLRGSPTADVHFDDCYVPAGAVLGAEGSGASIFS
SSMKWERTCLFAAYLGAMRRVLESTVDHVRDREQFGSPIGGFQAVSHRIVDMLGRYEGARLLLYRAARSLSDGTADEVGP
ALAKIAVSEAAVQLGLDAVQLRGGLGIMDGEAETLLRDALPARIFSGTNEIQKNNVARALGLGRRRPAARR
>Q53228 ~~~regA~~~Photosynthetic apparatus regulatory protein RegA~~~COG4567
MAEDLVFELGADRSLLLVDDDEPFLKRLAKAMEKRGFVLETAQSVAEGKAIAQARPPAYAVVDLRLEDGNGLDVVEVLRE
RRPDCRIVVLTGYGAIATAVAAVKIGATDYLSKPADANEVTHALLAKGESLPPPPENPMSADRVRWEHIQRIYEMCDRNV
SETARRLNMHRRTLQRILAKRSPR
>Q3J6C1 2.7.13.3~~~regB~~~Sensor histidine kinase RegB~~~COG2205
MILGPDGILNRDTRGDWVRLRTLILLRWMAVAGQLAAIVVTDWYLGVRLPMGLCFMAVGASVIANVIATFVFPQNRRLTE
FQALMILLFDLTQLSFLLFLTGGLTNPFALLILAPVTISALALELRTTVILGAIAIGLLTFTAYFHLPLILADGSSLSVP
RMFEFGFWLAIVIGILFLGLYSRRVAIEIRSMSDALLATQMALDREQKLTDLGGVVAAAAHELGTPLATIKLVSSELAEE
LSEQPALRDDAELIREQADRCRDILRSMGRAGKDDLQMRQAPLGEVLREAAEPHVGRGKRVEFDLYPSRGGDERQPVILR
RPEVIHGLRNLIQNAVDFARSTVWIDGEWTGDRIAIRIVDDGEGYPPAIIGRIGDPFVRQRRAEESQSRRPGYEGMGLGL
FIAKTLLERSGAELSFANAADPFLRSHERPERCGAIVEVIWPVDRLVVVRNAPLGENVLIQT
>O07130 ~~~regX3~~~Sensory transduction protein RegX3~~~
MTSVLIVEDEESLADPLAFLLRKEGFEATVVTDGPAALAEFDRAGADIVLLDLMLPGMSGTDVCKQLRARSSVPVIMVTA
RDSEIDKVVGLELGADDYVTKPYSARELIARIRAVLRRGGDDDSEMSDGVLESGPVRMDVERHVVSVNGDTITLPLKEFD
LLEYLMRNSGRVLTRGQLIDRVWGADYVGDTKTLDVHVKRLRSKIEADPANPVHLVTVRGLGYKLEG
>Q9F868 ~~~regX3~~~Sensory transduction protein RegX3~~~COG0745
MTSVLIVEDEESLADPLAFLLRKEGFEATVVGDGPSALAEFERSGADIVLLDLMLPGMSGTDVCKQLRARSSVPVIMVTA
RDSEIDKVVGLELGADDYVTKPYSARELIARIRAVLRRGADNDDAGADDGVLEAGPVRMDVERHVVSVNGEPITLPLKEF
DLLEYLMRNSGRVLTRGQLIDRVWGADYVGDTKTLDVHVKRLRSKIEEDPANPVHLVTVRGLGYKLEG
>P9WGL8 ~~~regX3~~~Sensory transduction protein RegX3~~~
MTSVLIVEDEESLADPLAFLLRKEGFEATVVTDGPAALAEFDRAGADIVLLDLMLPGMSGTDVCKQLRARSSVPVIMVTA
RDSEIDKVVGLELGADDYVTKPYSARELIARIRAVLRRGGDDDSEMSDGVLESGPVRMDVERHVVSVNGDTITLPLKEFD
LLEYLMRNSGRVLTRGQLIDRVWGADYVGDTKTLDVHVKRLRSKIEADPANPVHLVTVRGLGYKLEG
>P9WGL9 ~~~regX3~~~Sensory transduction protein RegX3~~~COG0745
MTSVLIVEDEESLADPLAFLLRKEGFEATVVTDGPAALAEFDRAGADIVLLDLMLPGMSGTDVCKQLRARSSVPVIMVTA
RDSEIDKVVGLELGADDYVTKPYSARELIARIRAVLRRGGDDDSEMSDGVLESGPVRMDVERHVVSVNGDTITLPLKEFD
LLEYLMRNSGRVLTRGQLIDRVWGADYVGDTKTLDVHVKRLRSKIEADPANPVHLVTVRGLGYKLEG
>O54408 2.7.6.5~~~relA~~~GTP pyrophosphokinase~~~COG0317
MANEQVLTAEQVIDKARSYLSDEHIAFVEKAYLYAEDAHREQYRKSGEPYIIHPIQVAGILVDLEMDPSTIAGGFLHDVV
EDTDVTLDDLKEAFSEEVAMLVDGVTKLGKIKYKSQEEQQAENHRKMFVAMAQDIRVILIKLADRLHNMRTLKHLPQEKQ
RRISNETLEIFAPLAHRLGISKIKWELEDTALRYLNPQQYYRIVNLMKKKRAERELYVDEVVNEVKKRVEEVNIKADFSG
RPKHIYSIYRKMVLQNKQFNEIYDLLAVRILVNSIKDCYAVLGIIHTCWKPMPGRFKDYIAMPKPNMYQSLHTTVIGPKG
DPLEVQIRTFEMHEIAEYGVAAHWAYKEGKAANEGATFEKKLSWFREILEFQNESTDAEEFMESLKIDLFSDMVYVFTPK
GDVIELPSGSVPIDFSYRIHSEIGNKTIGAKVNGKMVTLDHKLRTGDIVEILTSKHSYGPSQDWVKLAQTSQAKHKIRQF
FKKQRREENVEKGRELVEKEIKNLDFELKDVLTPENIQKVADKFNFSNEEDMYAAVGYNGITALQVANRLTEKERKQRDQ
EEQEKIVQEVTGEPKPYPQGRKREAGVRVKGIDNLLVRLSKCCNPVPGDDIVGFITKGRGVSVHREDCPNVKTNEAQERL
IPVEWEHESQVQKRKEYNVEIEILGYDRRGLLNEVLQAVNETKTNISSVSGKSDRNKVATIHMAIFIQNINHLHKVVERI
KQIRDIYSVRRVMN
>P0AG20 2.7.6.5~~~relA~~~GTP pyrophosphokinase~~~COG0317
MVAVRSAHINKAGEFDPEKWIASLGITSQKSCECLAETWAYCLQQTQGHPDASLLLWRGVEMVEILSTLSMDIDTLRAAL
LFPLADANVVSEDVLRESVGKSVVNLIHGVRDMAAIRQLKATHTDSVSSEQVDNVRRMLLAMVDDFRCVVIKLAERIAHL
REVKDAPEDERVLAAKECTNIYAPLANRLGIGQLKWELEDYCFRYLHPTEYKRIAKLLHERRLDREHYIEEFVGHLRAEM
KAEGVKAEVYGRPKHIYSIWRKMQKKNLAFDELFDVRAVRIVAERLQDCYAALGIVHTHYRHLPDEFDDYVANPKPNGYQ
SIHTVVLGPGGKTVEIQIRTKQMHEDAELGVAAHWKYKEGAAAGGARSGHEDRIAWLRKLIAWQEEMADSGEMLDEVRSQ
VFDDRVYVFTPKGDVVDLPAGSTPLDFAYHIHSDVGHRCIGAKIGGRIVPFTYQLQMGDQIEIITQKQPNPSRDWLNPNL
GYVTTSRGRSKIHAWFRKQDRDKNILAGRQILDDELEHLGISLKEAEKHLLPRYNFNDVDELLAAIGGGDIRLNQMVNFL
QSQFNKPSAEEQDAAALKQLQQKSYTPQNRSKDNGRVVVEGVGNLMHHIARCCQPIPGDEIVGFITQGRGISVHRADCEQ
LAELRSHAPERIVDAVWGESYSAGYSLVVRVVANDRSGLLRDITTILANEKVNVLGVASRSDTKQQLATIDMTIEIYNLQ
VLGRVLGKLNQVPDVIDARRLHGS
>P9WHG9 ~~~relA~~~Bifunctional (p)ppGpp synthase/hydrolase RelA~~~COG0317
MTAQRSTTNPVLEPLVAVHREIYPKADLSILQRAYEVADQRHASQLRQSGDPYITHPLAVANILAELGMDTTTLVAALLH
DTVEDTGYTLEALTEEFGEEVGHLVDGVTKLDRVVLGSAAEGETIRKMITAMARDPRVLVIKVADRLHNMRTMRFLPPEK
QARKARETLEVIAPLAHRLGMASVKWELEDLSFAILHPKKYEEIVRLVAGRAPSRDTYLAKVRAEIVNTLTASKIKATVE
GRPKHYWSIYQKMIVKGRDFDDIHDLVGVRILCDEIRDCYAAVGVVHSLWQPMAGRFKDYIAQPRYGVYQSLHTTVVGPE
GKPLEVQIRTRDMHRTAEYGIAAHWRYKEAKGRNGVLHPHAAAEIDDMAWMRQLLDWQREAADPGEFLESLRYDLAVQEI
FVFTPKGDVITLPTGSTPVDFAYAVHTEVGHRCIGARVNGRLVALERKLENGEVVEVFTSKAPNAGPSRDWQQFVVSPRA
KTKIRQWFAKERREEALETGKDAMAREVRRGGLPLQRLVNGESMAAVARELHYADVSALYTAIGEGHVSAKHVVQRLLAE
LGGIDQAEEELAERSTPATMPRRPRSTDDVGVSVPGAPGVLTKLAKCCTPVPGDVIMGFVTRGGGVSVHRTDCTNAASLQ
QQAERIIEVLWAPSPSSVFLVAIQVEALDRHRLLSDVTRALADEKVNILSASVTTSGDRVAISRFTFEMGDPKHLGHLLN
AVRNVEGVYDVYRVTSAA
>Q931Q4 2.7.6.5~~~relA~~~GTP pyrophosphokinase~~~
MNGVYHIMNNEYPYSADEVLHKAKSYLSADEYEYVLKSYHIAYEAHKGQFRKNGLPYIMHPIQVAGILTEMRLDGPTIVA
GFLHDVIEDTPYTFEDVKEMFNEEVARIVDGVTKLKKVKYRSKEEQQAENHRKLFIAIAKDVRVILVKLADRLHNMRTLK
AMPREKQIRISRETLEIYAPLAHHLGINTIKWELEDTALRYIDNVQYFRIVNLMKKKRSEREAYIETAIDRIRTEMDRMN
IEGDINGRPKHIYSIYRKMMKQKKQFDQIFDLLAIRVIVNSINDCYAILGLVHTLWKPMPGRFKDYIAMPKQNLYQSLHT
TVVGPNGDPLEIQIRTFDMHEIAEHGVAAHWAYKEGKKVSEKDQTYQNKLNWLKELAEADHTSSDAQEFMETLKYDLQSD
KVYAFTPASDVIELPYGAVPIDFAYAIHSEVGNKMIGAKVNGKIVPIDYILQTGDIVEIRTSKHSYGPSRDWLKIVKSSS
AKGKIKSFFKKQDRSSNIEKGRMMVEVEIKEQGFRVEDILTEKNIQVVNEKYNFANEDDLFAAVGFGGVTSLQIVNKLTE
RQRILDKQRALNEEQEVTKSLPIKDNIITDSGVYVEGLENVLIKLSKCCNPIPGDDIVGYITKGHGIKVHRTDCPNIKNE
TERLINVEWVKSKDATQKYQVDLEVTAYDRNGLLNEVLQAVSSTAGNLIKVSGRSDIDKNAIINISVMVKNVNDVYRVVE
KIKQLGDVYTVTRVWN
>Q99TL8 2.7.6.5~~~relA~~~GTP pyrophosphokinase~~~
MNGVYHIMNNEYPYSADEVLHKAKSYLSADEYEYVLKSYHIAYEAHKGQFRKNGLPYIMHPIQVAGILTEMRLDGPTIVA
GFLHDVIEDTPYTFEDVKEMFNEEVARIVDGVTKLKKVKYRSKEEQQAENHRKLFIAIAKDVRVILVKLADRLHNMRTLK
AMPREKQIRISRETLEIYAPLAHRLGINTIKWELEDTALRYIDNVQYFRIVNLMKKKRSEREAYIETAIDRIRTEMDRMN
IEGDINGRPKHIYSIYRKMMKQKKQFDQIFDLLAIRVIVNSINDCYAILGLVHTLWKPMPGRFKDYIAMPKQNLYQSLHT
TVVGPNGDPLEIQIRTFDMHEIAEHGVAAHWAYKEGKKVSEKDQTYQNKLNWLKELAEADHTSSDAQEFMETLKYDLQSD
KVYAFTPASDVIELPYGAVPIDFAYAIHSEVGNKMIGAKVNGKIVPIDYILQTGDIVEIRTSKHSYGPSRDWLKIVKSSS
AKGKIKSFFKKQDRSSNIEKGRMMVEVEIKEQGFRVEDILTEKNIQVVNEKYNFANEDDLFAAVGFGGVTSLQIVNKLTE
RQRILDKQRALNEAQEVTKSLPIKDNIITDSGVYVEGLENVLIKLSKCCNPIPGDDIVGYITKGHGIKVHRTDCPNIKNE
TERLINVEWVKSKDATQKYQVDLEVTAYDRNGLLNEVLQAVSSTAGNLIKVSGRSDIDKNAIINISVMVKNVNDVYRVVE
KIKQLGDVYTVTRVWN
>Q54089 ~~~relA~~~Bifunctional (p)ppGpp synthase/hydrolase RelA~~~
MAKEINLTGEEVVALAAKYMNETDAAFVKKALDYATAAHFYQVRKSGEPYIVHPIQVAGILADLHLDAVTVACGFLHDVV
EDTDITLDNIEFDFGKDVRDIVDGVTKLGKVEYKSHEEQLAENHRKMLMAMSKDIRVILVKLADRLHNMRTLKHLRKDKQ
ERISRETMEIYAPLAHRLGISRIKWELEDLAFRYLNETEFYKISHMMNEKRREREALVDDIVTKIKSYTTEQGLFGDVYG
RPKHIYSIYRKMRDKKKRFDQIFDLIAIRCVMETQSDVYAMVGYIHELWRPMPGRFKDYIAAPKANGYQSIHTTVYGPKG
PIEIQIRTKEMHQVAEYGVAAHWAYKKGVRGKVNQAEQKVGMNWIKELVELQDASNGDAVDFVDSVKEDIFSERIYVFTP
TGAVQELPKDSGPIDFAYAIHTQVGEKAIGAKVNGRMVPLTAKLKTGDVVEIVTNPNSFGPSRDWIKLVKTNKARNKIRQ
FFKNQDKELSVNKGRDMLVSYFQEQGYVANKYLDKKRIEAILPKVSVKSEESLYAAVGFGDISPVSVFNKLTEKERREEE
RAKAKAEAEELVNGGEIKHENKDVLKVRSENGVIIQGASGLLMRIAKCCNPVPGDPIEGYITKGRGIAIHRADCNNIKSQ
DGYQERLIEVEWDLDNSSKDYQAEIDIYGLNRRGLLNDVLQILSNSTKSISTVNAQPTKDMKFANIHVSFGIPNLTHLTT
VVEKIKAVPDVYSVKRTNG
>Q9AA08 ~~~relB1~~~Antitoxin RelB1~~~
MADGFDIHIDQEQAARLKVVADRLGMSVSEYAVALIDAGLTGAAPKAIDPDPAIDEAIADAIERGDEPAISRDEFRAHIR
RVTAGLG
>Q9A5D6 ~~~relB2~~~Antitoxin RelB2~~~COG3905
MAICYARFMVPEPSIFEIDAEAEEAADAEGMADIAAGRVVPHEEVSAWLDTWGTPEEKPAPETWRK
>Q9A4F5 ~~~relB3~~~Antitoxin RelB3~~~
MSGVIAPDRVDDKRRMEHSQNMALTITIPAELASRLRASAEAEGKDVDAYAIDALHVMSDEDWGYTDDDAYWRELRAHSD
EVRRDGGIPLEDVKRWVASWDTENELPPPEPRIKARG
>P0CW75 ~~~relB4~~~Antitoxin RelB4~~~
MAEPDPDIFDEDDEAILAADAEADADFEAGRTVPHERVGEWLKTLGTPHQTPPPYSWRK
>P0C079 ~~~relB~~~Antitoxin RelB~~~COG3077
MGSINLRIDDELKARSYAALEKMGVTPSEALRLMLEYIADNERLPFKQTLLSDEDAELVEIVKERLRNPKPVRVTLDEL
>O50462 ~~~relB~~~Antitoxin RelB~~~COG2161
MAVVPLGEVRNRLSEYVAEVELTHERITITRHGHPAAVLISADDLASIEETLEVLRTPGASEAIREGLADVAAGRFVSND
EIRNRYTAR
>Q9AA09 ~~~relE1~~~Toxin RelE1~~~COG3668
MTFTVLVSVRAKRDFNRLIVWLVERDPRAAARLGPLLEAALDSLTEAPSRGRSVGPTTREISIPFGQSAYVIRYRLLGSS
VHVTRIWHGLEQR
>Q9A5D7 ~~~relE2~~~Toxin RelE2~~~COG3668
MAQVVWTWRALADLTAIRDYIGQFSPLAAQRMALRLKTAADSLAEYPERGRLATATLRELVVVPPYVIRYYVADGLVHIV
RIRHAARL
>Q9A4F4 ~~~relE3~~~Toxin RelE3~~~COG3668
MKSVELGPRARRDLTKLRRWLLNRAPSAADRAIDLILSRAEQLAQHSDLGRRKSQNMRELYVSFGAHGYVLQYRVYPDAV
VIARIRHSLERR
>Q9A3S1 ~~~relE4~~~Toxin RelE4~~~COG3668
MAQDVGHAAPDTATIFVAQVVWTQRAMADVYAIVGHISEQSRPLAAQRLAKRLFDTGASLATYPERGRVSTQGRREIVAI
SPYVLRYRIVGDRVVIGSVRHGARRPI
>P0C077 3.1.-.-~~~relE~~~mRNA interferase toxin RelE~~~COG2026
MAYFLDFDERALKEWRKLGSTVREQLKKKLVEVLESPRIEANKLRGMPDCYKIKLRSSGYRLVYQVIDEKVVVFVISVGK
RERSEVYSEAVKRIL
>O50461 3.1.-.-~~~relE~~~Toxin RelE~~~COG2026
MSDDHPYHVAITATAARDLQRLPEKIAAACVEFVFGPLLNNPHRLGKPLRNDLEGLHSARRGDYRVVYAIDDGHHRVEII
HIARRSASYRMNPCRPR
>O33347 ~~~relF~~~Antitoxin RelF~~~COG2161
MRILPISTIKGKLNEFVDAVSSTQDQITITKNGAPAAVLVGADEWESLQETLYWLAQPGIRESIAEADADIASGRTYGED
EIRAEFGVPRRPH
>O33348 3.1.-.-~~~relG~~~Toxin RelG~~~COG2026
MPYTVRFTTTARRDLHKLPPRILAAVVEFAFGDLSREPLRVGKPLRRELAGTFSARRGTYRLLYRIDDEHTTVVILRVDH
RADIYRR
>P9WF25 ~~~relJ~~~Antitoxin RelJ~~~COG2161
MSISASEARQRLFPLIEQVNTDHQPVRITSRAGDAVLMSADDYDAWQETVYLLRSPENARRLMEAVARDKAGHSAFTKSV
DELREMAGGEE
>P9WF09 3.1.-.-~~~relK~~~Toxin RelK~~~COG4115
MRSVNFDPDAWEDFLFWLAADRKTARRITRLIGEIQRDPFSGIGKPEPLQGELSGYWSRRIDDEHRLVYRAGDDEVTMLK
ARYHY
>Q7WY72 ~~~remA~~~Extracellular matrix regulatory protein A~~~COG2052
MTIKLINIGFGNIISANRMISIVSPESAPIKRMIQDARDRGMLIDATYGRRTRAVVVMDSDHIILSAVQPETVAHRLSVK
EEIMDEGQG
>A1A048 3.2.1.156~~~xylA~~~Reducing end xylose-releasing exo-oligoxylanase~~~
MTNATDTNKTLGESMFAQCGYAQDAIDKRVSQVWHEIFEGPNKFYWENDEGLAYVMDTGNNDVRTEGMSYAMMIALQYDR
KDVFDKLWGWVMRHMYMKDGHHAHYFAWSVAPDGTPNSNGPAPDGEEYFAMDLFLASRRWGDGEDIYEYSAWGREILRYC
VHKGERYDGEPMWNPDNKLIKFIPETEWSDPSYHLPHFYEVFAEEADEEDRPFWHEAAAASRRYLQAACDERTGMNAEYA
DYDGKPHVDESNHWHFYSDAYRTAANIGLDAAWNGPQEVLCDRVAALQRFFLTHDRTSVYAIDGTAVDEVVLHPVGFLAA
TAQGALAAVHSAQPDAEHNAREWVRMLWNTPMRTGTRRYYDNFLYAFAMLALSGKYRYE
>Q9KB30 3.2.1.156~~~~~~Reducing end xylose-releasing exo-oligoxylanase~~~COG3405
MKKTTEGAFYTREYRNLFKEFGYSEAEIQERVKDTWEQLFGDNPETKIYYEVGDDLGYLLDTGNLDVRTEGMSYGMMMAV
QMDRKDIFDRIWNWTMKNMYMTEGVHAGYFAWSCQPDGTKNSWGPAPDGEEYFALALFFASHRWGDGDEQPFNYSEQARK
LLHTCVHNGEGGPGHPMWNRDNKLIKFIPEVEFSDPSYHLPHFYELFSLWANEEDRVFWKEAAEASREYLKIACHPETGL
APEYAYYDGTPNDEKGYGHFFSDSYRVAANIGLDAEWFGGSEWSAEEINKIQAFFADKEPEDYRRYKIDGEPFEEKSLHP
VGLIATNAMGSLASVDGPYAKANVDLFWNTPVRTGNRRYYDNCLYLFAMLALSGNFKIWFPEGQEEEH
>P24042 ~~~repA~~~Replication protein RepA~~~
MNQSFISDILYADIESKAKELTVNSNNTVQPVALMRLGVFVPKPSKSKGESKEIDATKAFSQLEIAKAEGYDDIKITGPR
LDMDTDFKTWIGVIYAFSKYGLSSNTIQLSFQEFAKACGFPSKRLDAKLRLTIHESLGRLRNKGIAFKRGKDAKGGYQTG
LLKVGRFDADLDLIELEADSKLWELFQLDYRVLLQHHALRALPKKEAAQAIYTFIESLPQNPLPLSFARIRERLALQSAV
GEQNRIIKKAIEQLKTIGYLDCSIEKKGRESFVIVHSRNPKLKLPE
>Q52221 ~~~repA~~~Replication initiation protein~~~
MAGLKNTSYNAVHWSQLAPEEQIRFWEDYEAGRATTFLVEPERKRTKRRRGEHSTKPKCENPSWYRPERYKALKGQLGHA
YNRLVKKDPVTGEQSLRMRMSRHPFYVQKRTFVGRKYAFRPEKQRLLDAIWPVLVSFSDAGTHTVGMSVTRLAEEISPKD
SEGHVIPELEVTVSRLSRLLAEQVRFGVLGVSEETMWDREHRQRLPRYVWITPAGWQMLGVDMVKLHEQQQKRLRESEIR
QQLIREGVLREDEDISVHAARKRWYLQRSQDALKKRREKAAASKRANRLKKLPVDQQIYEMAEYLRKRLPPDEAYFCSDD
HLKRLAIRELRQLELTLAAPPPH
>Q57154 ~~~repB~~~RepFIB replication protein A~~~
MDKSSGELVTLTPNNNNTVQPVALMRLGVFVPTLKSLKNSKKNTLSRTDATEELTRLSLARAEGFDKVEITGPRLDMDND
FKTWVGIIHSFARHNVIGDKVELPFVEFAKLCGIPSSQSSRRLRERISPSLKRIAGTVISFSRTDEKHTREYITHLVQSA
YYDTERDIVQLQADPRLFELYQFDRKVLLQLKAINALKRRESAQALYTFIESLPRDPAPISLARLRARLNLKSPVFSQNQ
TVRRAMEQLREIGYLDYTEIQRGRTKFFCIHYRRPRLKAPNDESKENPLPPSPAEKVSPEMAEKLALLEKLGITLDDLEK
LFKSR
>P13921 ~~~repB~~~Replication protein RepB~~~
MAKEKARYFTFLLYPESIPSDWELKLETLGVPMAISPLHDKDKSSIKGQKYKKAHYHVLYIAKNPVTADSVRKKIKLLLG
EKSLAMVQVVLNVENMYLYLTHESKDAIAKKKHVYDKADIKLINNFDIDRYVTLDVEEKTELFNVVVSLIRAYTLQNIFD
LYDFIDENGETYGLTINLVNEVIAGKTGFMKLLFDGAYQRSKRGTKNEER
>P03065 ~~~repD~~~Replication initiation protein~~~
MSTENHSNYLQNKDLDNFSKTGYSNSRLSGNFFTTPQPELSFDAMTIVGNLNKTNAKKLSDFMSTEPQIRLWDILQTKFK
AKALQEKVYIEYDKVKADSWDRRNMRVEFNPNKLTHEEMLWLKQNIIDYMEDDGFTRLDLAFDFEDDLSDYYAMTDKAVK
KTIFYGRNGKPETKYFGVRDSDRFIRIYNKKQERKDNADVEVMSEHLWRVEIELKRDMVDYWNDCFDDLHILKPDWTTPE
KVKEQAMVYLLLNEEGTWGKLERHAKYKYKQLIKEISPIDLTELMKSTLKENEKQLQKQIDFWQREFRFWK
>P03856 ~~~repE~~~Replication initiation protein~~~
MAETAVINHKKRKNSPRIVQSNDLTEAAYSLSRDQKRMLYLFVDQIRKSDGTLQEHDGICEIHVAKYAEIFGLTSAEASK
DIRQALKSFAGKEVVFYRPEEDAGDEKGYESFPWFIKRAHSPSRGLYSVHINPYLIPFFIGLQNRFTQFRLSETKEITNP
YAMRLYESLCQYRKPDGSGIVSLKIDWIIERYQLPQSYQRMPDFRRRFLQVCVNEINSRTPMRLSYIEKKKGRQTTHIVF
SFRDITSMTTG
>P12053 ~~~repE~~~Replication initiation protein~~~
MSKKAEEIQAKQSLEKENSNFSKTGYSNSRLNRHIMYTPEPKLHFDAMTIVGNLNKNNAHKLSEFMSIAPQIRLWDILQT
KFKAKALQEKVYIEYDKVKADAWDRRNMRVEFNPNKLTHEEMLWLKQNIIDYMEDDGFTRLDLAFDFEDDLSDYYAMTDK
SVKKTIFYGRNGKPETKYFGVRDSDRFIRIYNKKQERKDNADIEVMSEHLWRVEIELKRDMVDYWNDCFNDLHILKPDWS
SLEKVKDQAMIYMLIHEESTWGKLERRTKNKYREMLKSISEIDLTDLMKLTLKENEKQLQKQIEFWQREFRFWE
>P20356 ~~~repA~~~Regulatory protein RepA~~~
MATHKPINILEAFAAAPPPLDYVLPNMVAGTVGALVSPGGAGKSMLALQLAAQIAGGPDLLEVGELPTGPVIYLPAEDPP
TAIHHRLHALGAHLSAEERQAVADGLLIQPLIGSLPNIMAPEWFDGLKRAAEGRRLMVLDTLRRFHIEEENASGPMAQVI
GRMEAIAADTGCSIVFLHHASKGAAMMGAGDQQQASRGSSVLVDNIRWQSYLSSMTSAEAEEWGVDDDQRRFFVRFGVSK
ANYGAPFADRWFRRHDGGVLKPAVLERQRKSKGVPRGEA
>P19529 ~~~repN~~~Replication initiation protein~~~
MSKNNHANHSNHLENHDLDNFSKTGYSNSRLNRHTMYTPEPKLSFDAMTIVGNLNKNNAHKLSEFMSVEPQIRLWDILQT
KFKAKALQEKVYIEYDKVKADTWDRRNMRVEFNPNKLTHEEMLWLKQNIIDYMEDDGFTRLDLAFDFEYDLSDYYAMTDK
SVKKTIFYGRNGKPETKYFGVRDSDRFIRIYNKKQERKDNADIKIMSEHLWRVEIELKRDMVDYWNDCFNDLHILQPDWK
TIERTSDRAMVFMLLNDEEEWGKLERRTKNKYKKLIKEISLIDLTDLMKSTLKANEKQLQKQIDFWQREFRFWK
>P09980 5.6.2.4~~~rep~~~ATP-dependent DNA helicase Rep~~~COG0210
MRLNPGQQQAVEFVTGPCLVLAGAGSGKTRVITNKIAHLIRGCGYQARHIAAVTFTNKAAREMKERVGQTLGRKEARGLM
ISTFHTLGLDIIKREYAALGMKANFSLFDDTDQLALLKELTEGLIEDDKVLLQQLISTISNWKNDLKTPSQAAASAIGER
DRIFAHCYGLYDAHLKACNVLDFDDLILLPTLLLQRNEEVRKRWQNKIRYLLVDEYQDTNTSQYELVKLLVGSRARFTVV
GDDDQSIYSWRGARPQNLVLLSQDFPALKVIKLEQNYRSSGRILKAANILIANNPHVFEKRLFSELGYGAELKVLSANNE
EHEAERVTGELIAHHFVNKTQYKDYAILYRGNHQSRVFEKFLMQNRIPYKISGGTSFFSRPEIKDLLAYLRVLTNPDDDS
AFLRIVNTPKREIGPATLKKLGEWAMTRNKSMFTASFDMGLSQTLSGRGYEALTRFTHWLAEIQRLAEREPIAAVRDLIH
GMDYESWLYETSPSPKAAEMRMKNVNQLFSWMTEMLEGSELDEPMTLTQVVTRFTLRDMMERGESEEELDQVQLMTLHAS
KGLEFPYVYMVGMEEGFLPHQSSIDEDNIDEERRLAYVGITRAQKELTFTLCKERRQYGELVRPEPSRFLLELPQDDLIW
EQERKVVSAEERMQKGQSHLANLKAMMAAKRGK
>Q8NR94 1.14.13.219~~~~~~NADPH-dependent resorcinol 4-hydroxylase~~~COG0654
MSPNNFDTDVCIVGGGPTGTLLAVLLGQKGHRVTILEKWPTFYERPRAVTFDHEIARILGYIGIDSENDEAIDYHSDSYD
WKNAAGETLLEVDWTSMTDSGWRTRYWFYQPELEKRLRDLALTMDFVDIRCGFTAVGLSQDENSAIIHGIVTDTPENIPA
DAQREDIRAKYVIGADGANSFVRNSLGLEMNDLGYFFDWLILDLKPTQDIDYGTDHWQLCDPKRPTTIVPGGPGRRRWEF
MALPGEDLKELASEESAWNLLEPWDVTPGKAILERSAVYRFQARWAQEWRSGRALIAGDAAHLMPPFAGEGMCAGLRDSL
ALAWRLDLVLSGKSDDALLDTYGEERREHVHYYIDFSMDLGNVICITDEDEARLRDERMIKELEAQDGVPVNTDVAHLGP
GIWDKDSSHGGELAKQGIVEYQGRKARFDDAVGRGWAVLGLNTDPREVLDEDSLVALDAIGAIVESVGDATSAVLDVEGL
YTRWLKEAGATFIITRPDFYVYSTAVDAEQLQTQIKQLSDLLHLNSVVGA
>Q81SZ9 ~~~resA~~~Thiol-disulfide oxidoreductase ResA~~~COG0526
MKKNRLLFRVIILLILSGAVGFTLYQGFFADKEKMQIGKEAPNFVVTDLEGKKIELKDLKGKGVFLNFWGTWCKPCEKEM
PYMNELYPKYKEKGVEIIALDADETDIAVKNFVNQYGLKFPVAIDKGQKIIGTYGVGPLPTSFLIDKDGKVVEQIIGEQT
KEQLEGYLKKITP
>P35160 ~~~resA~~~Thiol-disulfide oxidoreductase ResA~~~COG0526
MKKKRRLFIRTGILLVLICALGYTIYNAVFAGKESISEGSDAPNFVLEDTNGKRIELSDLKGKGVFLNFWGTWCEPCKKE
FPYMANQYKHFKSQGVEIVAVNVGESKIAVHNFMKSYGVNFPVVLDTDRQVLDAYDVSPLPTTFLINPEGKVVKVVTGTM
TESMIHDYMNLIKPGETSG
>P35162 ~~~resC~~~Cytochrome c biogenesis protein ResC~~~COG0755
MAELSGNFLYAAFLVYLIAVPIFGGAIRGNKDKKGRPNRWANIGITLSIVGFFCHLGYFITRWAASGHAPVSNMFEFTTA
FGMMLVLAFIILYFLYRLPSLGLFTLSIALLLIAYASMFPTDISPLIPSLQSNWLYIHVTTAALGQAILAISFVAGVIFL
LKHVDQTKPSKKTFWLEAIMFILVTTVAFIAITSAFRLAGYEAEFNWVDKSKEKSVMVYEMPALVGPHQGELLTEGRMEP
LVDLPALFSGRKVNTVIWSFGAGLVLYGALRLIIRKRISALLHPLVKNVNLDLVDEVGYRAVSIGFPIFTLGALIFAMIW
AQLAWTRFWGWDPKEVWALITFLFYAAYLHLRLSRGWHGEKSAWLAVIGFAIIMFNLIFVNLVLAGLHSYA
>P35163 ~~~resD~~~Transcriptional regulatory protein ResD~~~COG0745
MDQTNETKILVVDDEARIRRLLRMYLERENYAIDEAENGDEAIAKGLEANYDLILLDLMMPGTDGIEVCRQIREKKATPI
IMLTAKGEEANRVQGFEAGTDDYIVKPFSPREVVLRVKALLRRASQTSYFNANTPTKNVLVFSHLSIDHDAHRVTADGTE
VSLTPKEYELLYFLAKTPDKVYDREKLLKEVWQYEFFGDLRTVDTHVKRLREKLNKVSPEAAKKIVTVWGVGYKFEVGAE
>P35164 2.7.13.3~~~resE~~~Sensor histidine kinase ResE~~~COG5002
MKFWKSVVGKLWFTILSLVLIVLFILTVLLLEFIENYHVEEAENDLTQLANKVAVILENHEDQALARSITWELADNLTSI
AIIQDEKNHWYSPNDKNRLSSITVEQIQHDKDLNKALKDHKKVSKRTGLSDTDTDNERLIVGVPYEKDGKKGMVFLSQSL
LAVKDTTKHTTRYIFLAAGIAIVLTTFFAFFLSSRVTYPLRKMREGAQDLAKGKFDTKIPILTQDEIGELATAFNQMGRQ
LNFHINALNQEKEQLSNILSSMADGVITINIDGTILVTNPPAERFLQAWYYEQNMNIKEGDNLPPEAKELFQNAVSTEKE
QMIEMTLQGRSWVLLMSPLYAESHVRGAVAVLRDMTEERRLDKLREDFIANVSHELRTPISMLQGYSEAIVDDIASSEED
RKEIAQIIYDESLRMGRLVNDLLDLARMESGHTGLHYEKINVNEFLEKIIRKFSGVAKEKNIALDHDISLTEEEFMFDED
KMEQVFTNLIDNALRHTSAGGSVSISVHSVKDGLKIDIKDSGSGIPEEDLPFIFERFYKADKARTRGRAGTGLGLAIVKN
IVEAHNGSITVHSRIDKGTTFSFYIPTKR
>O50979 3.1.22.-~~~resT~~~Telomere resolvase ResT~~~
MPPKVKIKNDFEIFRKELEILYKKYLNNELSYLKLKEKLKILAENHKAILFRKDKFTNRSIILNLSKTRKIIKEYINLSV
IERIRRDNTFLFFWKSRRIKELKNIGIKDRKKIEELIFSNQMNDEKSYFQYFIDLFVTPKWLNDYAHKYKIEKINSYRKE
QIFVKINLNTYIEIIKLLLNQSRDIRLKFYGVLMAIGRRPVEVMKLSQFYIADKNHIRMEFIAKKRENNIVNEVVFPVFA
DPELIINSIKEIRYMEQTENLTKEIISSNLAYSYNRLFRQIFNNIFAPEESVYFCRAIYCKFSYLAFAPKNMEMNYWITK
VLGHEPNDITTAFHYNRYVLDNLDDKADNSLLTLLNQRIYTYVRRKATYSTLTMDRLESLIKEHHIFDDNYIKTLIVIKN
LMLKDNLETLAMVRGLNVKIRKAFKATYGYNYNYIKLTEYLSIIFNYKL
>Q7N4H9 2.4.2.-~~~res~~~Toxin Res~~~COG5654
MILYRLTRSKYVESAWSGTGAKLYGGRWHNIGRPAVYVATSVSLAVLEVLVHVGDDELLTDFALLSIDIPENQIDILDID
TLPSDWNAPVPSTCTMEIGSEWFEVSHSIGLVVPSAIVPYENNVILNPMAKDFHKYINTVKRLDFGIDSRLVKAKK
>Q88K57 2.4.2.-~~~res~~~Toxin Res~~~COG5654
MILWRISAYADLSGTGGLRVSGRWHQAGRPVVYAATSPPGAMLEVLVHLEIDPEDFPTTMRLLRIELPDTVSQAQLPALQ
PGWSAQPELTRTLGNRFLDDCSALLLPVPSAIMPSTTNYLFNPRHPQAQSAKIQVEDFTPDSRLF
>A1JNH0 2.4.2.-~~~res~~~Toxin Res~~~COG5654
MMLYRIVMRRYLASTWTGYGAETYGGRWNHKGHAAIYLASSVSLAMLETLVHIQDSSTLSEFELFQIEIEDSNIMLLQPQ
DWPTNWRSDPAPATTMDIGTEWLESESSLGLLVPSTLVPTENNLLLNPRHKSFQTCLSSVQPLSFAFDPRLK
>Q9WY16 ~~~rex1~~~Redox-sensing transcriptional repressor Rex 1~~~COG2344
MAEKIPKPVSKRLVSYYMCLERLLDEGVEVVSSEELARRLDLKASQIRKDLSYFGEFGKRGVGYNVEHLYDAIGEILGVK
KEWKLVVVGAGNIGRAVANYTVMKEKGFRIIGIFDSDPSKIGKEAAPGLTVSDVSELEKFVEEHGVEIGVIAVPAEHAQE
IAERLEKAGIKGILNFAPVKIKVSVPVENIDITASLRVLTFEIVRRNS
>A0A0S2UQQ5 3.2.1.156~~~rex8A~~~Reducing-end xylose-releasing exo-oligoxylanase Rex8A~~~
MNITGKGAYDTGTYANLFQRSGYREDEIKARLEQTWNDLFYGDEHTRIYYPVGDDKGYMLDTGNDDVRSEGMSYGMMMAV
QMDKKHEFDRLWNYAYTYMQHTEGRYKDYFAWHCKPDGTRLSPGPAPDGEEFFAMALFFASNRWGDGPAPYDYQAQARKI
LHACLHQGEQGEGDPMWEPSNRLIKFIPELPFSDPSYHLPHFYELFAQYANEQDRTFWKEAAEASRAYLRTACHPVTGLS
PEYANYDGTPAPVQLHGDFRHFYSDAYRVAANVALDWEWFRKDPWQVQQSNRIQAFFSDIDVSDYRRYTIEGEPFNEPAL
HPVGLLATNAMASLAADGPDADSFVKRFWNTPLRQGKRRYYDNCLYFFTMLALSGNYRVY
>O05521 ~~~rex~~~Redox-sensing transcriptional repressor Rex~~~COG2344
MNKDQSKIPQATAKRLPLYYRFLKNLHASGKQRVSSAELSDAVKVDSATIRRDFSYFGALGKKGYGYNVDYLLSFFRKTL
DQDEMTDVILIGVGNLGTAFLHYNFTKNNNTKISMAFDINESKIGTEVGGVPVYNLDDLEQHVKDESVAILTVPAVAAQS
ITDRLVALGIKGILNFTPARLNVPEHIRIHHIDLAVELQSLVYFLKHYSVLEEIE
>Q8E565 ~~~rex~~~Redox-sensing transcriptional repressor Rex~~~COG2344
MIMDKSIPKATAKRLSLYYRIFKRFNTDGIEKASSKQIADALGIDSATVRRDFSYFGELGRRGFGYDVKKLMNFFAEILN
DHSTTNVMLVGCGNIGRALLHYRFHDRNKMQISMAFDLDSNDLVGKTTEDGIPVYGISTINDHLIDSDIETAILTVPSTE
AQEVADILVKAGIKGILSFSPVHLTLPKDIIVQYVDLTSELQTLLYFMNQQR
>Q9WX14 ~~~rex~~~Redox-sensing transcriptional repressor Rex~~~COG2344
MATGRAHRPATRSRGIPEATVARLPLYLRALTALSERSVPTVSSEELAAAAGVNSAKLRKDFSYLGSYGTRGVGYDVEYL
VYQISRELGLTQDWPVVIVGIGNLGAALANYGGFASRGFRVAALIDADPGMAGKPVAGIPVQHTDELEKIIQDDGVSIGV
IATPAGAAQQVCDRLVAAGVTSILNFAPTVLNVPEGVDVRKVDLSIELQILAFHEQRKAGEEAAADGAAPPVAARKQQRS
TGSADQGPDGDVPAVMPA
>Q97QV8 ~~~rex~~~Redox-sensing transcriptional repressor Rex~~~COG2344
MKDKQFAIPKATAKRLSLYYRIFKRFHAEKIERANSKQIAEAIGIDSATVRRDFSYFGELGRRGFGYDVKKLMTFFADLL
NDNSITNVMLVGIGNMGHALLHYRFHERNKMKIIMAFDLDDHPEVGTQTPDGIPIYGISQIKDKIKDADVKTAILTVPSV
KSQEVANLLVDAGVKGILSFSPVHLHLPKDVVVQYVDLTSELQTLLYFMRKED
>Q9X2V5 ~~~rex~~~Redox-sensing transcriptional repressor Rex~~~
MKVPEAAISRLITYLRILEELEAQGVHRTSSEQLGELAQVTAFQVRKDLSYFGSYGTRGVGYTVPVLKRELRHILGLNRK
WGLCIVGMGRLGSALADYPGFGESFELRGFFDVDPEKVGRPVRGGVIEHVDLLPQRVPGRIEIALLTVPREAAQKAADLL
VAAGIKGILNFAPVVLEVPKEVAVENVDFLAGLTRLSFAILNPKWREEMMG
>Q72I39 ~~~rex~~~Redox-sensing transcriptional repressor Rex~~~COG2344
MKVPEAAISRLITYLRILEELEAQGVHRTSSEQLGELAQVTAFQVRKDLSYFGSYGTRGVGYTVPVLKRELRHILGLNRK
WGLCIVGMGRLGSALADYPGFGESFELRGFFDVDPEKVGRPVRGGVIEHVDLLPQRVPGRIEIALLTVPREAAQKAADLL
VAAGIKGILNFAPVVLEVPKEVAVENVDFLAGLTRLSFAILNPKWREEMMG
>Q5SHS3 ~~~rex~~~Redox-sensing transcriptional repressor Rex~~~COG2344
MKVPEAAISRLITYLRILEELEAQGVHRTSSEQLGELAQVTAFQVRKDLSYFGSYGTRGVGYTVPVLKRELRHILGLNRK
WGLCIVGMGRLGSALADYPGFGESFELRGFFDVDPEKVGRPVRGGVIEHVDLLPQRVPGRIEIALLTVPREAAQKAADLL
VAAGIKGILNFAPVVLEVPKEVAVENVDFLAGLTRLSFAILNPKWREEMMG
>P0A7I0 ~~~prfA~~~Peptide chain release factor RF1~~~COG0216
MKPSIVAKLEALHERHEEVQALLGDAQTIADQERFRALSREYAQLSDVSRCFTDWQQVQEDIETAQMMLDDPEMREMAQD
ELREAKEKSEQLEQQLQVLLLPKDPDDERNAFLEVRAGTGGDEAALFAGDLFRMYSRYAEARRWRVEIMSASEGEHGGYK
EIIAKISGDGVYGRLKFESGGHRVQRVPATESQGRIHTSACTVAVMPELPDAELPDINPADLRIDTFRSSGAGGQHVNTT
DSAIRITHLPTGIVVECQDERSQHKNKAKALSVLGARIHAAEMAKRQQAEASTRRNLLGSGDRSDRNRTYNFPQGRVTDH
RINLTLYRLDEVMEGKLDMLIEPIIQEHQADQLAALSEQE
>P9WHG3 ~~~prfA~~~Peptide chain release factor 1~~~COG0216
MTQPVQTIDVLLAEHAELELALADPALHSNPAEARRVGRRFARLAPIVATHRKLTSARDDLETARELVASDESFAAEVAA
LEARVGELDAQLTDMLAPRDPHDADDIVLEVKSGEGGEESALFAADLARMYIRYAERHGWAVTVLDETTSDLGGYKDATL
AIASKADTPDGVWSRMKFEGGVHRVQRVPVTESQGRVHTSAAGVLVYPEPEEVGQVQIDESDLRIDVFRSSGKGGQGVNT
TDSAVRITHLPTGIVVTCQNERSQLQNKTRALQVLAARLQAMAEEQALADASADRASQIRTVDRSERIRTYNFPENRITD
HRIGYKSHNLDQVLDGDLDALFDALSAADKQSRLRQS
>Q8DU64 ~~~prfA~~~Peptide chain release factor 1~~~COG0216
MNIYDQLQAVEDRYEELGELLSDPDVVSDTKRFMELSREEANSRETVAVYREYKQVVQNIADAQEMIKDASGDPELEEMA
KEELKNSKVAKEEYEEKLRFLLLPKDPNDDKNIILEIRGAAGGDEAALFAGDLLNMYQKYAENQGWKFEVMEASANGVGG
LKEVVAMVSGQSVYSKLKYESGAHRVQRVPVTESQGRVHTSTATVLVMPEVEEVEYEIDPKDLRVDIYHASGAGGQNVNK
VATAVRIIHLPTNIKVEMQEERTQQKNRDKAMKIIRARVADHFAQIAQDEQDAERKSTVGTGDRSERIRTYNFPQNRVTD
HRIGLTLQKLDSILSGKLDEVIDALILYDQTQKLEELNK
>Q9X183 ~~~prfA~~~Peptide chain release factor 1~~~COG0216
MKEKKKEIEKLLARPDLTPEQMKNYGMEYAKIEEIENITNRIKETQEFIELLREEGENELEIEKYEKELDQLYQELLFLL
SPEASDKAIVEIRPGTGGEEAALFARDLFRMYTRYAERKGWNLEVAEIHETDLGGIREVVFFVKGKNAYGILKYESGVHR
VQRVPVTESGGRIHTSTATVAVLPEIEEKDIEIRPEDLKIETFRASGHGGQYVNKTESAVRITHLPTGIVVSCQNERSQY
QNKQTALRILRARLYQLQKEQKEREISQKRKSQIGTGERSEKIRTYNFPQNRVTDHRINYTSYRLQEILDGDLDEIISKL
IEHDIENNLEEVLGIGASVEEK
>Q72HB8 ~~~prfA~~~Peptide chain release factor 1~~~COG0216
MLDKLDRLEEEYRELEALLSDPEVLKDKGRYQSLSRRYAEMGEVIGLIREYRKVLEDLEQAESLLDDPELKEMAKAEREA
LLARKEALEKELERHLLPKDPMDERDAIVEIRAGTGGEEAALFARDLFNMYLRFAEEMGFETEVLDSHPTDLGGFSKVVF
EVRGPGAYGTFKYESGVHRVQRVPVTETQGRIHTSTATVAVLPKAEEEDFALNMDEIRIDVMRASGPGGQGVNTTDSAVR
VVHLPTGIMVTCQDSRSQIKNREKALMILRSRLLEMKRAEEAERLRKTRLAQIGTGERSEKIRTYNFPQSRVTDHRIGFT
THDLEGVLSGHLTPILEALKRADQERQLAALAEG
>P96077 ~~~prfA~~~Peptide chain release factor 1~~~COG0216
MLDKLDRLEEEYRELEALLSDPEVLKDKGRYQSLSRRYAEMGEVIGLIREYRKVLEDLEQAESLLDDPELKEMAKAEREA
LLARKEALEKELERHLLPKDPMDERDAIVEIRAGTGGEEAALFARDLFNMYLRFAEEMGFETEVLDSHPTDLGGFSKVVF
EVRGPGAYGTFKYESGVHRVQRVPVTETQGRIHTSTATVAVLPKAEEEDFALNMDEIRIDVMRASGPGGQGVNTTDSAVR
VVHLPTGIMVTCQDSRSQIKNREKALMILRSRLLEMKRAEEAERLRKTRLAQIGTGERSEKIRTYNFPQSRVTDHRIGFT
THDLEGVLSGHLTPILEALKRADQERQLAALAEG
>P28367 ~~~prfB~~~Peptide chain release factor 2~~~COG1186
MELSEIRAELENMASRLADFRGSLDLESKEARIAELDEQMADPEFWNDQQKAQTVINEANGLKDYVNSYKKLNESHEELQ
MTHDLLKEEPDTDLQLELEKELKSLTKEFNEFELQLLLSEPYDKNNAILELHPGAGGTESQDWGSMLLRMYTRWGERRGF
KVETLDYLPGDEAGIKSVTLLIKGHNAYGYLKAEKGVHRLVRISPFDSSGRRHTSFVSCEVMPEFNDEIDIDIRTEDIKV
DTYRASGAGGQHVNTTDSAVRITHLPTNVVVTCQTERSQIKNRERAMKMLKAKLYQRRIEEQQAELDEIRGEQKEIGWGS
QIRSYVFHPYSMVKDHRTNTEMGNVQAVMDGDIDTFIDAYLRSKLS
>P07012 ~~~prfB~~~Peptide chain release factor RF2~~~COG1186
MFEINPVNNRIQDLTERSDVLRGYLDYDAKKERLEEVNAELEQPDVWNEPERAQALGKERSSLEAVVDTLDQMKQGLEDV
SGLLELAVEADDEETFNEAVAELDALEEKLAQLEFRRMFSGEYDSADCYLDIQAGSGGTEAQDWASMLERMYLRWAESRG
FKTEIIEESEGEVAGIKSVTIKISGDYAYGWLRTETGVHRLVRKSPFDSGGRRHTSFSSAFVYPEVDDDIDIEINPADLR
IDVYRTSGAGGQHVNRTESAVRITHIPTGIVTQCQNDRSQHKNKDQAMKQMKAKLYELEMQKKNAEKQAMEDNKSDIGWG
SQIRSYVLDDSRIKDLRTGVETRNTQAVLDGSLDQFIEASLKAGL
>A0QU58 ~~~prfB~~~Peptide chain release factor 2~~~COG0216
MDPDRQADIAALDTTLTTVERVLDVDGLRNRIEQLEKDASDPNLWDDQTRAQKVTSDLSHAQNELRRVEGLRQRLDDLPV
LYELAAEAGGPDEVAEADAELAKLREDIEAMEVRTLLSGEYDEREAVVTIRSGAGGVDAADWAEMLMRMYIRWAEKHDYP
VEIFDTSYAEEAGIKSATFAVHAPFAYGTLSVEQGTHRLVRISPFDNQSRRQTSFADVEVLPVVETTDHIEIPENDIRVD
VYRSSGPGGQSVNTTDSAVRLTHIPTGIVVTCQNEKSQLQNKVSAMRVLQAKLLERKRLEERAELDALKGDGGSSWGNQM
RSYVLHPYQMVKDLRTEYEVGNPASVLDGDIDGFLEAGIRWRNRKDDD
>P9WHG1 ~~~prfB~~~Peptide chain release factor 2~~~COG0216
MDPDRQADIAALDCTLTTVERVLDVEGLRSRIEKLEHEASDPHLWDDQTRAQRVTSELSHTQGELRRVEELRRRLDDLPV
LYELAAEEAGAAAADAVAEADAELKSLRADIEATEVRTLLSGEYDEREALVTIRSGAGGVDAADWAEMLMRMYIRWAEQH
KYPVEVFDTSYAEEAGIKSATFAVHAPFAYGTLSVEQGTHRLVRISPFDNQSRRQTSFAEVEVLPVVETTDHIDIPEGDV
RVDVYRSSGPGGQSVNTTDSAVRLTHIPSGIVVTCQNEKSQLQNKIAAMRVLQAKLLERKRLEERAELDALKADGGSSWG
NQMRSYVLHPYQMVKDLRTEYEVGNPAAVLDGDLDGFLEAGIRWRNRRNDD
>Q7A6R4 ~~~prfB~~~Peptide chain release factor 2~~~
MELSEIKRNIDKYNQDLTQIRGSLDLENKETNIQEYEEMMAEPNFWDNQTKAQDIIDKNNALKAIVNGYKTLQAEVDDMD
ATWDLLQEEFDEEMKEDLEQEVINFKAKVDEYELQLLLDGPHDANNAILELHPGAGGTESQDWANMLFRMYQRYCEKKGF
KVETVDYLPGDEAGIKSVTLLIKGHNAYGYLKAEKGVHRLVRISPFDSSGRRHTSFASCDVIPDFNNDEIEIEINPDDIT
VDTFRASGAGGQHINKTESAIRITHHPSGIVVNNQNERSQIKNREAAMKMLKSKLYQLKLEEQAREMAEIRGEQKEIGWG
SQIRSYVFHPYSMVKDHRTNEETGKVDAVMDGDIGPFIESYLRQTMSHD
>Q5SM01 ~~~prfB~~~Peptide chain release factor RF2~~~COG1186
MRLASQSAILVKVWTWNASRNAWKASGGIFDIPQKETRLKELERRLEDPSLWNDPEAARKVSQEAARLRRTVDTFRSLES
DLQGLLELMEELPAEEREALKPELEEAAKKLDELYHQTLLNFPHAEKNAILTIQPGAGGTEACDWAEMLLRMYTRFAERQ
GFQVEVVDLTPGPEAGIDYAQILVKGENAYGLLSPEAGVHRLVRPSPFDASGRRHTSFAGVEVIPEVDEEVEVVLKPEEL
RIDVMRASGPGGQGVNTTDSAVRVVHLPTGITVTCQTTRSQIKNKELALKILKARLYELERKKREEELKALRGEVRPIEW
GSQIRSYVLDKNYVKDHRTGLMRHDPENVLDGDLMDLIWAGLEWKAGRRQGTEEVEAE
>Q83DC7 ~~~prfC~~~Peptide chain release factor 3~~~COG4108
MSVEKQTAMRRTFAIISHPDAGKTTLTEKLLLFGGAIQLAGTIKSRKAARHATSDWMELEKQRGISVTTSVMQFPYKDYL
INLLDTPGHADFTEDTYRTLTAVDSALMVIDAAKGVEPRTIKLMEVCRLRHTPIMTFINKMDRDTRPSIELLDEIESILR
IHCAPVTWPIGMGKYFKGIYHLIEDAIYLYQPGKHERVGESERIEGINNPELDKKLGDLASELRNEIELVKGASHPFERE
GYLKGELTPIFFGSAINNFGVGELLDAFVKEAPPPQGRETNSRLVKPEEEKFSGFVFKIQANMDPGHRDRIAFLRIASGQ
YQKGMKAYHVRLKKEIQINNALTFMAGKRENAEEAWPGDIIGLHNHGTIQIGDTFTQGERFKFTGIPNFASELFRLVRLK
DPLKQKALLKGLTQLSEEGATQLFRPLDSNELILGAVGLLQFDVVAYRLENEYNVKCVYESVNVVTARWVICDDKAVLER
FNQEQSRNLAYDGGGHLTYLAPSRVNLEITMEKWPEIQFSETREH
>B8DIL5 ~~~prfC~~~Peptide chain release factor 3~~~COG4108
MSSRLEREAARRRTFAIISHPDAGKTTLTEKLLLFGGAIQMAGSVKARKAARHATSDWMAMERERGISVTTSVMQFPYRD
RVVNLLDTPGHQDFSEDTYRVLTAVDSALVVIDAAKGVEAQTRKLMDVCRMRATPVMTFVNKMDREALHPLDVMADIEQH
LQIECAPMTWPIGMGSSFKGTYDLLHKQLHLFSATHGGRIQSGIVIHGADDPQLDEYLGDQAEQLRMDLALLEEAGTPFD
EERYLKGELTPVFFGSAINNFGVREMLDMFVEFAPGPQPRPAATRVVEPGEEAFTGVVFKIQANMDKAHRDRMAFLRICS
GTFTRGMRLKHHRTGKDVTVANATIFMAQDRTGVEEAFPGDIIGIPNHGTIKIGDTFTESKEVLKFVGIPNFAPEHFRRV
RLKNPLKAKQLQKGLEQLAEEGAVQLFRPLVNNDYILGAVGVLQFDVIVARLADEYGVDAVYEGVSTHTARWVYCEDKKI
FADFQDYHRGELAVDAEGALAYLAPNPWRLESAMERYPKVEFRTTREIS
>P0A7I4 ~~~prfC~~~Peptide chain release factor RF3~~~COG4108
MTLSPYLQEVAKRRTFAIISHPDAGKTTITEKVLLFGQAIQTAGTVKGRGSNQHAKSDWMEMEKQRGISITTSVMQFPYH
DCLVNLLDTPGHEDFSEDTYRTLTAVDCCLMVIDAAKGVEDRTRKLMEVTRLRDTPILTFMNKLDRDIRDPMELLDEVEN
ELKIGCAPITWPIGCGKLFKGVYHLYKDETYLYQSGKGHTIQEVRIVKGLNNPDLDAAVGEDLAQQLRDELELVKGASNE
FDKELFLAGEITPVFFGTALGNFGVDHMLDGLVEWAPAPMPRQTDTRTVEASEDKFTGFVFKIQANMDPKHRDRVAFMRV
VSGKYEKGMKLRQVRTAKDVVISDALTFMAGDRSHVEEAYPGDILGLHNHGTIQIGDTFTQGEMMKFTGIPNFAPELFRR
IRLKDPLKQKQLLKGLVQLSEEGAVQVFRPISNNDLIVGAVGVLQFDVVVARLKSEYNVEAVYESVNVATARWVECADAK
KFEEFKRKNESQLALDGGDNLAYIATSMVNLRLAQERYPDVQFHQTREH
>Q99V72 ~~~prfC~~~Peptide chain release factor 3~~~
MNLKQEVESRKTFAIISHPDAGKTTLTEKLLYFSGAIREAGTVKGKKTGKFATSDWMKVEQERGISVTSSVMQFDYDDYK
INILDTPGHEDFSEDTYRTLMAVDSAVMVIDCAKGIEPQTLKLFKVCKMRGIPIFTFINKLDRVGKEPFELLDEIEETLN
IETYPMNWPIGMGQSFFGIIDRKSKTIEPFRDEENILHLNDDFELEEDHAITNDSAFEQAIEELMLVEEAGEAFDNDALL
SGDLTPVFFGSALANFGVQNFLNAYVDFAPMPNARQTKEDVEVSPFDDSFSGFIFKIQANMDPKHRDRIAFMRVVSGAFE
RGMDVTLQRTNKKQKITRSTSFMADDKETVNHAVAGDIIGLYDTGNYQIGDTLVGGKQTYSFQDLPQFTPEIFMKVSAKN
VMKQKHFHKGIEQLVQEGAIQYYKTLHTNQIILGAVGQLQFEVFEHRMKNEYNVDVVMEPVGRKIARWIENEDQITDKMN
TSRSILVKDRYDDLVFLFENEFATRWFEEKFLEIKLYSLL
>B2FR00 ~~~prfC~~~Peptide chain release factor 3~~~COG4108
MSEVANEASRRRTFAIISHPDAGKTTLTEKLLLFGGAIQMAGSVKGRKAARHATSDWMALEKERGISVTSSVMQFPYEDK
IVNLLDTPGHADFGEDTYRVLTAVDSALMVIDVAKGVEERTIKLMEVCRLRDTPIMTFINKLDREGKDPIELLDEVETVL
GIQCAPVTWPIGMGQRLKGVVHLLTGEVHLYEPGRNFTRQDSTIFPSIDAPGLAEKIGAQMLADLRDELELVQGASHPFD
LEAYRAGKQTPVFFGSGVNNFGVQPLLDFFVEHAPSPQARSTTGREIAPEENKLTGFVFKIQANMDPQHRDRVAFMRVCS
GRFSAGMKTFHVRTGKEMKLANALTFMASDREIAAEAWPGDVIGIHNHGTISIGDTFTEGEAVTFTGIPNFAPELFRRAR
LRDPLKLKQLQKGLAQLSEEGATQFFRPLTSNDLILGAVGVLQFDVAAYRLKDEYGVEATFEPVSVTTARWVHCSNEKKL
EEFREKNALNLALDAAGHLVYLAPTRVNLQLAQERSPDVRFSATREAAHTVSVG
>Q06994 2.4.1.-~~~rfaB~~~Lipopolysaccharide 1,6-galactosyltransferase~~~
MKIAFIGEAVSGFGGMETVISNVIHTFENSSPKINCEMFFFCRNDKMDKAWLKEIKYAQSFSNIKLSFLRRAKHVYNFSQ
WLKETSPDIVICIDVISCLYANKARKKSGKHFTIFSWPHFSLDHKKHAECITYADYHLAISSGIKEQIMARGISAQDISV
VYNPVSIKTVIVPPPERDKPAVFLYVGRLKFEGQKRVKDLFDGLARTTGEWQLHIIGDGSDFEKCQAYSRELGIEQRVIW
YGWQSAPWQVVQQKIKNVTALLLTSAFEGFPMTLLEAMSYGIPCISSDCMSGPRDMIKPGLNGELYTPGAIDDFVGHLNR
VISGEVKYQHDIIPGTIERFYDVLYFKNFNNAIFSKLQK
>P24173 2.-.-.-~~~rfaC~~~Lipopolysaccharide heptosyltransferase 1~~~COG0859
MRVLIVKTSSMGDVLHTLPALTDAQQAIPGIKFDWVVEEGFAQIPSWHAAVERVIPVAIRRWRKAWFSAPIKAERKAFRE
ALQAENYDAVIDAQGLVKSAALVTRLAHGVKHGMDWQTAREPLASLFYNRKHHIAKQQHAVERTRELFAKSLGYSKPQTQ
GDYAIAQHFLTNLPTDAGEYAVFLHATTRDDKHWPEEHWRELIGLLADSGIRIKLPWGAPHEEERAKRLAEGFAYVEVLP
KMSLEGVARVLAGAKFVVSVDTGLSHLTAALDRPNITVYGPTDPGLIGGYGKNQMVCRAPRENLINLNSQAVLEKLSSL
>P37692 2.-.-.-~~~rfaF~~~ADP-heptose--LPS heptosyltransferase 2~~~COG0859
MKILVIGPSWVGDMMMSQSLYRTLQARYPQAIIDVMAPAWCRPLLSRMPEVNEAIPMPLGHGALEIGERRKLGHSLREKR
YDRAYVLPNSFKSALVPFFAGIPHRTGWRGEMRYGLLNDVRVLDKEAWPLMVERYIALAYDKGIMRTAQDLPQPLLWPQL
QVSEGEKSYTCNQFSLSSERPMIGFCPGAEFGPAKRWPHYHYAELAKQLIDEGYQVVLFGSAKDHEAGNEILAALNTEQQ
AWCRNLAGETQLDQAVILIAACKAIVTNDSGLMHVAAALNRPLVALYGPSSPDFTPPLSHKARVIRLITGYHKVRKGDAA
EGYHQSLIDITPQRVLEELNALLLQEEA
>P25740 2.4.-.-~~~rfaG~~~Lipopolysaccharide core biosynthesis protein RfaG~~~COG0438
MIVAFCLYKYFPFGGLQRDFMRIASTVAARGHHVRVYTQSWEGDCPKAFELIQVPVKSHTNHGRNAEYYAWVQNHLKEHP
ADRVVGFNKMPGLDVYFAADVCYAEKVAQEKGFLYRLTSRYRHYAAFERATFEQGKSTKLMMLTDKQIADFQKHYQTEPE
RFQILPPGIYPDRKYSEQIPNSREIYRQKNGIKEQQNLLLQVGSDFGRKGVDRSIEALASLPESLRHNTLLFVVGQDKPR
KFEALAEKLGVRSNVHFFSGRNDVSELMAAADLLLHPAYQEAAGIVLLEAITAGLPVLTTAVCGYAHYIADANCGTVIAE
PFSQEQLNEVLRKALTQSPLRMAWAENARHYADTQDLYSLPEKAADIITGGLDG
>Q0TAL4 ~~~rfaH~~~Transcription antitermination protein RfaH~~~
MQSWYLLYCKRGQLQRAQEHLERQAVNCLAPMITLEKIVRGKRTAVSEPLFPNYLFVEFDPEVIHTTTINATRGVSHFVR
FGASPAIVPSAVIHQLSVYKPKDIVDPSTPYPGDKVIITEGAFEGFQAIFTEPDGEARSMLLLNLINKEIKHSVKNTEFR
KL
>Q8FBI4 ~~~rfaH~~~Transcription antitermination protein RfaH~~~COG0250
MQSWYLLYCKRGQLQRAQEHLERQAVNCLAPMITLEKIVRGKRTAVSEPLFPNYLFVEFDPEVIHTTTINATRGVSHFVR
FGASPAIVPSAVIHQLSVYKPKDIVDPSTPYPGDKVIITEGAFEGFQAIFTEPDGEARSMLLLNLINKEIKHSVKNTEFR
KL
>P0AFW0 ~~~rfaH~~~Transcription antitermination protein RfaH~~~COG0250
MQSWYLLYCKRGQLQRAQEHLERQAVNCLAPMITLEKIVRGKRTAVSEPLFPNYLFVEFDPEVIHTTTINATRGVSHFVR
FGASPAIVPSAVIHQLSVYKPKDIVDPATPYPGDKVIITEGAFEGFQAIFTEPDGEARSMLLLNLINKEIKHSVKNTEFR
KL
>P27243 ~~~rfaL~~~O-antigen ligase~~~COG3307
MLTSFKLHSLKPYTLKSSMILEIITYILCFFSMIIAFVDNTFSIKIYNITAIVCLLSLILRGRQENYNIKNLILPLSIFL
IGLLDLIWYSAFKVDNSPFRATYHSYLNTAKIFIFGSFIVFLTLTSQLKSKKESVLYTLYSLSFLIAGYAMYINSIHEND
RISFGVGTATGAAYSTMLIGIVSGVAILYTKKNHPFLFLLNSCAVLYVLALTQTRATLLLFPIICVAALIAYYNKSPKKF
TSSIVLLIAILASIVIIFNKPIQNRYNEALNDLNSYTNANSVTSLGARLAMYEIGLNIFIKSPFSFRSAESRAESMNLLV
AEHNRLRGALEFSNVHLHNEIIEAGSLKGLMGIFSTLFLYFSLFYIAYKKRALGLLILTLGIVGIGLSDVIIWARSIPII
IISAIVLLLVINNRNNTIN
>Q9R9D6 2.7.1.-~~~rfaP~~~Lipopolysaccharide core heptose(I) kinase RfaP~~~COG0515
MVELKEPFATLWRGKDPFEEVKTLQGEVFRELETRRTLRFEMAGKSYFLKWHRGTTLKEIIKNLLSLRMPVLGADREWNA
IHRLRDVGVDTMYGVAFGEKGMNPLTRTSFIITEDLTPTISLEDYCADWATNPPDVRVKRMLIKRVATMVRDMHAAGINH
RDCYICHFLLHLPFSGKEEELKISVIDLHRAQLRTRVPRRWRDKDLIGLYFSSMNIGLTQRDIWRFMKVYFAAPLKDILK
QEQGLLSQAEAKATKIRERTIRKSL
>Q9HUF7 2.7.1.-~~~rfaP~~~Lipopolysaccharide core heptose(I) kinase RfaP~~~
MRLVLEEPFKRLWNGRDPFEAVEALQGKVYRELEGRRTLRTEVDGRGYFVKIHRGIGWGEIAKNLLTAKLPVLGARQEWQ
AIRRLHEAGVATMTAVAYGERGSDPARQHSFIVTEELAPTVDLEVFSQDWRERPPPPRLKRALVEAVARMVGDMHRAGVN
HRDCYICHFLLHTDKPVSADDFRLSVIDLHRAQTRDATPKRWRNKDLAALYFSALDIGLTRRDKLRFLRTYFRRPLREIL
RDEAGLLAWMERKAEKLYERKQRYGDLL
>Q9R9D5 2.-.-.-~~~rfaQ~~~Lipopolysaccharide core heptosyltransferase RfaQ~~~COG0859
MRFHGDMLLTTPVISSLKKNYPDAKIDVLLYQDTIPILSENPEINALYGIKNKKAKASEKIANFFHLIKVLRANKYDLIV
NLTDQWMVAILVRLLNARVKISQDYHHRQSAFWRKSFTHLVPLQGGNVVESNLSVLTPLGVDSLVKQTTMSYPPASWKRM
RRELDHAGVGQNYVVIQPTARQIFKCWDNAKFSAVIDALHARGYEVVLTSGPDKDDLACVNEIAQGCQTPPVTALAGKVT
FPELGALIDHAQLFIGVDSAPAHIAAAVNTPLISLFGATDHIFWRPWSNNMIQFWAGDYREMPTRDQRDRNEMYLSVIPA
ADVIAAVDKLLPSSTTGTSL
>Q9ZIS7 2.7.1.-~~~rfaY~~~Lipopolysaccharide core heptose(II) kinase RfaY~~~COG0661
MITSIRYRGFSFYYKDNDNKYKEIFDEILAYNFKTVKVLRNIDDTKVSLIDTKYGRYVFKVFAPKTKRNERFLKSFVKGD
YYQNLIVETDRVRSAGLTFPNDFYFLAERKIFNYASVFIMLIEYVEGVELNDMPIIPENVKAEIKASMEKLHALNMLSGD
PHRGNFIVSKDGVRIIDLSGKSCTAERKARDRLAMERHLGIANEIKDYGYYSVIYRTKLRKFIKKLKGKA
>P14169 5.1.3.10~~~rfbE~~~CDP-paratose 2-epimerase~~~COG0451
MKLLITGGCGFLGSNLASFALSQGIDLIVFDNLSRKGATDNLHWLSSLGNFEFVHGDIRNKNDVTRLITKYMPDSCFHLA
GQVAMTTSIDNPCMDFEINVGGTLNLLEAVRQYNSNCNIIYSSTNKVYGDLEQYKYNETETRYTCVDKPNGYDESTQLDF
HSPYGCSKGAADQYMLDYARIFGLNTVVFRHSSMYGGRQFATYDQGWVGWFCQKAVEIKNGINKPFTISGNGKQVRDVLH
AEDMISLYFTALANVSKIRGNAFNIGGTIVNSLSLLELFKLLEDYCNIDMRFTNLPVRESDQRVFVADIKKITNAIDWSP
KVSAKDGVQKMYDWTSSI
>Q8Z5I4 2.7.7.33~~~rfbF~~~Glucose-1-phosphate cytidylyltransferase~~~COG1208
MKAVILAGGLGTRLSEETIVKPKPMVEIGGKPILWHIMKMYSVHGIKDFIICCGYKGYVIKEYFANYFLHMSDVTFHMAE
NRMEVHHKRVEPWNVTLVDTGDSSMTGGRLKRVAEYVKDDEAFLFTYGDGVADLDIKATIDFHKAHGKKATLTATFPPGR
FGALDIQAGQVRSFQEKPKGDGAMINGGFFVLNPSVIDLIDNDATTWEQEPLMTLAQQGELMAFEHPGFWQPMDTLRDKV
YLEGLWEKGKAPWKTWE
>P26397 4.2.1.45~~~rfbG~~~CDP-glucose 4,6-dehydratase~~~
MIDKNFWQGKRVFVTGHTGFKGSWLSLWLTEMGAIVKGYALDAPTVPSLFEIVRLNDLMESHIGDIRDFEKLRNSIAEFK
PEIVFHMAAQPLVRLSYEQPIETYSTNVMGTVHLLETVKQVGNIKAVVNITSDKCYDNREWVWGYRENEPMGGYDPYSNS
KGCAELVASAFRNSFFNPANYEQHGVGLASVRAGNVIGGGDWAKDRLIPDILRSFENNQQVIIRNPYSIRPWQHVLEPLS
GYIVVAQRLYTEGAKFSEGWNFGPRDEDAKTVEFIVDKMVTLWGDDASWLLDGENHPHEAHYLKLDCSKANMQLGWHPRW
GLTETLGRIVKWHKAWIRGEDMLICSKREISDYMSATTR
>P0A1P4 1.1.1.341~~~rfbJ~~~CDP-abequose synthase~~~
MTFLKEYVIVSGASGFIGKHLLEALKKSGISVVAITRDVIKNNSNALANVRWCSWDNIELLVEELSIDSALIGIIHLATE
YGHKTSSLINIEDANVIKPLKLLDLAIKYRADIFLNTDSFFAKKDFNYQHMRPYIITKRHFDEIGHYYANMHDISFVNMR
LEHVYGPGDGENKFIPYIIDCLNKKQSCVKCTTGEQIRDFIFVDDVVNAYLTILENRKEVPSYTEYQVGTGAGVSLKDFL
VYLQNTMMPGSSSIFEFGAIEQRDNEIMFSVANNKNLKAMGWKPNFDYKKGIEELLKRL
>Q05342 1.1.1.341~~~rfbJ~~~CDP-abequose synthase~~~COG0451
MRIVLTGGSGYIGSSLTPVLIKKYGRVYNIGRNTISEVSINGSKEYCEFTYESLFDSLVELSPDLVINLAAGYYNDSGAP
DLNVIDGNLKIPFIILEYFKSCNYGRFINIGSYWEFSCSGRGVKGVNPYGIIKSTVRRLLDYYSKYNVIYTNLILYGSYG
DNDHRGKIVDCIIDAVNSNETLKLSPGEQKLNLVYIDDIIEAILYIVSSDNGQYDNETLSIYTPTEHTVKEIVCFINEIK
DNNLSLGGGRYRNDEVMAPDYKYRNIFHAKDKLKEYITSKIKK
>P26404 2.7.7.13~~~rfbM~~~Mannose-1-phosphate guanylyltransferase RfbM~~~
MSFLPVIMAGGTGSRLWPLSREYHPKQFLSVEGKLSMLQNTIKRLASLSTEEPVVICNDRHRFLVAEQLREIDKLANNII
LEPVGRNTAPAIALAAFCALQNADNADPLLLVLAADHVIQDEIAFTKAVRHAEEYAANGKLVTFGIVPTHAETGYGYIRR
GELIGNDAYAVAEFVEKPDIDTAGDYFKSGKYYWNSGMFLFRASSYLNELKYLSPEIYKACEKAVGHINPDLDFIRIDKE
EFMSCPSDSIDYAVMEHTQHAVVIPMSAGWSDVGSWSSLWDISNKDHQRNVLKGDIFAHACNDNYIYSEDMFISAIGVSN
LVIVQTTDALLVANKDTVQDVKKIVDYLKRNDRNEYKQHQEVFRPWGKYNVIDSGKNYLVRCITVKPGEKFVAQMHHHRA
EHWIVLSGTARVTKGEQTYMVSENESTFIPPNTIHALENPGMTPLKLIEIQSGTYLGEDDIIRLEQRSGFSKEWTNERS
>P26401 2.4.1.60~~~rfbV~~~Abequosyltransferase RfbV~~~
MLISFCIPTYNRKEYLEELLNSINNQEKFNLDIEICISDNASTDGTEEMIDVWRNNYNFPIIYRRNSVNLGPDRNFLASV
SLANGDYCWIFGSDDALAKDSLAILQTYLDSQADIYLCDRKETGCDLVEIRNPHRSWLRTDDELYVFNNNLDREIYLSRC
LSIGGVFSYLSSLIVKKERWDAIDFDASYIGTSYPHVFIMMSVFNTPGCLLHYISKPLVICRGDNDSFEKKGKARRILID
FIAYLKLANDFYSKNISLKRAFENVLLKERPWLYTTLAMACYGNSDEKRDLSEFYAKLGCNKNMINTVLRFGKLAYAVKN
ITVLKNFTKRIIK
>P37746 ~~~rfbX~~~Putative O-antigen transporter~~~COG2244
MNTNKLSLRRNVIYLAVVQGSNYLLPLLTFPYLVRTLGPENFGIFGFCQATMLYMIMFVEYGFNLTATQSIAKAADSKDK
VTSIFWAVIFSKIVLIVITLIFLTSMTLLVPEYNKHAVIIWSFVPALVGNLIYPIWLFQGKEKMKWLTLSSILSRLAIIP
LTFIFVNTKSDIAIAGFIQSSANLVAGIIALAIVVHEGWIGKVTLSLHNVRRSLADGFHVFISTSAISLYSTGIVIILGF
ISGPTSVGNFNAANTIRNALQGLLNPITQAIYPRISSTLVLNRVKGVILIKKSLTCLSLIGGAFSLILLLGASILVKISI
GPGYDNAVIVLMIISPLPFLISLSNVYGIQVMLTHNYKKEFSKILIAAGLLSLLLIFPLTTLFKEIGAAITLLATECLVT
SLMLMFVRNNKLLVC
>A6X7E7 ~~~rfnT~~~Riboflavin transporter RfnT~~~COG2814
MTDATAARRNIVILTIAQALGASSPPIVISLGGLVGQKLSSDPALVTLPVSLFNLGLALGTLPAAFFMRQFGRRNAYMLG
ALVGAAAGVIAAAGIFAASFLIFCLGTLTAGFYASYVQSYRFAATDAATGDMKARAISWVMVGGLVAAIVGPQLVIWTRD
TIPDAMFAGSFLSQAVLGLLALPVLFMLRAPKVRKDPNAIHDTGRPLGEILRSPRFILSVAAGVCSYALMTFVMTAAPIA
MVGHGHSVDHAALGIQWHVLAMFAPSFFTGKLITRFGKEKITALGLVLIAFSAIIALGGFDVGHFWGALIFLGIGWNFGF
IGATAMVTDCHTPAERGKAQGANDFIMFGTVACASFFAGSLLHSSGWETINWLVFPIVALVLVPLILRLKPKGAAAEA
>B1WVN5 ~~~~~~Pentapeptide repeat protein Rfr32~~~COG1357
MVISRHDLGFIWMKYLLRLTIVVFSLLWLITPTAYAASSSAVTGSSASYEDVKLIGEDFSGKSLTYAQFTNADLTDSNFS
EADLRGAVFNGSALIGADLHGADLTNGLAYLTSFKGADLTNAVLTEAIMMRTKFDDAKITGADFSLAVLDVYEVDKLCDR
ADGVNPKTGVSTRESLGCQ
>O67035 3.1.26.5~~~~~~RNA-free ribonuclease P~~~COG1458
MDVFVLDTSVFTNPEIYRTFEEDQRGAMETFIHLALNSRAEFYMPTSVYTEMRKIMDVGELWAEFEMVVKIRSPRRFQLT
VPADFLYEFIEELRYRINKGLRIAEEHTREASGCEDVGKLIARLREKYREALRQGILDSKEDVDVLLLAYELDGVLVSAD
EGLRTWADKIGIKLIDPKNFKNILESLVRHRF
>A1WZ95 3.1.26.5~~~~~~RNA-free ribonuclease P~~~COG1458
MRRFVLDTSVFTNPDVYLRFDEEPMQAISVFLGLARRADAEFYMPGPVYQELCNLRSMDLIGAEFETEVYIRSPRRFSMT
IPSEVLYEFIEEVRTRIQRGLRIAEEHARQAGQAESLPPELITQLRERYREAMRRGILDSREDIDVVLLAYELDATLVSA
DEGMRKFAERIGIKLVNPRYLRGVMQNLAGDDPGHAPPCGPDQPAG
>Q56328 ~~~rfuA~~~ABC transporter riboflavin-binding protein RfuA~~~COG1744
MNGAVCVLSALIAVFTCFSCRPAVQDERAVRIAVFVPGFRHDSPVYAMLCDGVERAVTQERATGRSIGLDIIEAGPNQAL
WREKLAHLAAEQRYRLIVSSNPALPHVLEPILRQFPLQRFLVLDAYAPQEHSLITFRYNQWEQAYLAGHLSALVSASAMR
FANADKKIGLIAGQSYPVMTQTIIPAFLAGARAVDPAFEVDVRVVGNWYDAAKSADLARILFHEGVDVMMPICGGANQGV
LAAARELGFYVSWFDDNGYARAPGYVVGSSVMEQERLAYEQTLRCIRGELPSAGAWTLGVKDGYVRFIEEDPLYLQTVPE
PIRVRQSALLRRIQSGELTLPVR
>O83321 7.6.2.-~~~rfuB~~~Probable riboflavin import ATP-binding protein RfuB~~~COG3845
MMIAERGVRASARGVLSLHHIGKTYPRVMPRSKRGVWGMFGHPGRRAVDDAHTAHGPCSGARETDAAEHSVLSDVNLSFF
TGEIHALLGKNGAGKSTLAHILSGFCVPTHGQLRLDGKEQRFSVPFDALRAGIGIVHQQPVFAERATVFENVVMGSAALT
GVRWVRRAQVRERIDRIIAQWRMPLKKEEYVACLSADKRFFVSLLCVLFRNPRFIILDEPRCAPAQSRAVFFSHLEEFFV
RSSHAPRCGGGVIVVTHRFADALRWAQRISLIEGGKACSFLRTDLLDEYCSAHQVNECIQKVSCALMSASTVTSSAVSSF
SSLSDTQSCATVPRTSSARPWVLRVESLQVSKHADVPLTDISFSVAASAIIGIVGTPEDGVHVLEDILCDMHAGASRTHC
TGNILLQEHDQVWCLPLQRNTPSLLRAHGVACVPSNCIQRGASMQLTLFDLLVPYTLRTWRTRVRAQMRFVARLLAEEEI
YCDPLQPACTLSGGQLQRVILARELATRPRLLILAEPAEGLDSASEQRLLARLRQVAQAGTALVLLAREQHQAQWRALCT
ERFLLRAGTLCAEVSGTPSPSQDSHT
>O83323 ~~~rfuC~~~Probable riboflavin import permease protein RfuC~~~COG4603
MKRVINSCIAVLLGVAVMSAVIVLCSENPSVSLAAFFLKPFSTRGYIRALFHKAGLFVCMALGASCALKTGMINLGGDGQ
IYAAGFVTALLLREYWGVGFLLQWSVALLCALSVAGILACVSGILKAWLATSEMITSFLLSTACVPLIDALIITVTRDPA
GNLLATAPVHSHFILQQQTSLFGVPAVLTYASLVALAVGCFFSYTRVGYQFRICGKAPEFGRFVGFPVWATYVWGMVLSG
ALFGLTGFFSVVGLFGTCYVGFSVGMGYAALAHALIAHAHITVLVPLAFFFAWMETASEAAVLGAHLTVNVVLFLQAAIF
LLISAQWSAPWNAVRRGARRVYRFLVTVFCFRGEKHRTRRRHALSVHDTHHRRSRWE
>O83324 ~~~rfuD~~~Probable riboflavin import permease protein RfuD~~~COG1079
MGVIGTTVIAILHRAAPLACAAAGALATEYAGVLGIFMEGVITFSSFCIAFFALVWGSYWGGLGITVCVVPLCLFFVAVG
TERMRANPFLTGIAVHFSAMGMSAFGASSMFARAAASAMQMDTAAHGVSFTHVSLAHTRVLPHPLWGTAVAFALVWVFHL
YLYSTNVGINFMHSGEGALALQVRGTDAARYRMVSWAVAGVCAVCAGGLLVLRVGTYTPQMAAGRGWTALAIVFLARKRM
MWCVPAAIFFSGIEHMCDVLQGTHVVPTGVLFALPYILSLVVFVCTRRTSPCRRGERRRSRLLFAYLQRVTCA
>O32236 ~~~rghR~~~HTH-type transcriptional repressor RghR~~~COG1396
MESFGEQLRALREERKLTVNQLATYSGVSAAGISRIENGKRGVPKPATIKKLAEALKIPYEGLMYKAGYIEEVHEARAPY
ETKCKLLEKAEAYDLKNLALLENEKWQYLNKEDLLMLDHYFSFISDEAKKRSADD
>Q1MJ96 2.4.1.-~~~rgtA~~~Lipopolysaccharide core galacturonosyltransferase RgtA~~~COG1807
MLERATRTIKTAGLLLAAYFVLNIVLRIVLPHSLELDEAEQSFFSQYLLAGYGPQPPFYNWMQYAVVSVTGISIGALIVP
KNILLFLSYLFYGLAGRRVLKDEALAAVGMLALITLPQVSYMAQQDLTHTTALLFASSLFLYGFFRTLDRPDMASYLLLG
LATGIGLISKYNFALMPVVALIAILPDAEWRRRALDWRMLAAITVALVIVLPHAVWLQGNLAFASSDTLVKMAAGSEPAG
AVRIGKGLLAFLVAIIAFAALPVVIFAATFRRDFVRALSAGNRWTGMMERMMLASLAGIALIVLFTGSTTVRERWLDPFL
LVLPIYFLAKMQAAGLDLSAGLRRFRPVLPVLMACVLIALGFRVVGAGLIGTYSRPNVPMAGFAREMTRQAEPALVIASD
TYIGGNMRLQFPDVPVVIPDFPAPGIPAYAEAKGPVLIVWRGKKTATAADAVMPERFSSALTAAGIALQEIGSLSLPYYF
GRQGDNFALGYAWVRPETR
>Q1MJ97 2.4.1.-~~~rgtB~~~Lipopolysaccharide core galacturonosyltransferase RgtB~~~COG1807
MTESNRRDISWIFALLAAYFVLQVGVRLATSHSLDLDEAEQAFRSQWLAAGYGPQPPFYNWLQYTVFQFAGVSLTALSIV
KNLLLFISYLLYGLTARLVLRDKALVAIATLGLLTIPQMAFEMQRDLTHTVAVFFSASIFFYGFIRSLKQPSLASYLIAG
IGIGFGLLAKYNFAILPAAALIAALSDARLRPRIFDWRLGLTAAVALVITLPHLFWLKDNLDFATARTLEKMTASGDASY
LTQVAMGVSSLALAIISFAALTVAVFAIVFGKSLRPALGSGSEWTRLLERMMLVFLAGILLLIVFGGAAGIKDRWLVPML
FILPLYFCLKIEAAGVETGKALRRFIPVVAVIMIGVPAALYGSVAAARFTGHYERLNRPYAGMLEILRKQAEPAAILAGD
SLLAGNLRQDIPGVPILSADYPGFNPDLTSRRPLLLVWLLPKGGSEALPPDMAEWLQANLGTSAPEASVIDVPYFYGRGD
DRYRFGYAWVNQPG
>Q1MJ94 2.4.1.-~~~rgtC~~~Lipopolysaccharide core galacturonosyltransferase RgtC~~~COG1807
MLERITRSITSASIFLAAYFLLNIALRIALPHTLDLDEAEQSFYSQYLLAGYGPQPPFYNWIQYAIVSVTGISMWVLSVP
KNIILFGCYLFYGLAAREVLKSRSLAALAMLSLITLPQVGLMAQRELTHTVALLFATSLFLFGFFRTLRQPTIGSYLLIG
IATGIGLISKYNFAILPFAALIAVLPEREWRSRLIDWRLLPAAVLAILIVLPHALWLPDNLASASAPTLERMTAEHLAPA
GLPRIGQGLLSLVIAVLGFVALPIVLIAAAFRRDFFRALSSSSPMIRVIERMMVISLLAFVGVVLFAGASDIHERWLDPC
LLVLLIYLFLKLETADIDLSAGLARFRPVVPVFMVVILSILLFRIVGIQYIGTYTRTNVPFSGYVAELTATRKPVLIVAG
TKFIAGNMRLQFPDVPVVIPFFPGPGVPEYADAKGPVLVIWRGETADDPTISPGFANDLVKSGIHLPELKTLTLPYLFGD
GKRSFSIGYSWVEGGAK
>Q1MJ95 2.4.1.-~~~rgtE~~~Dodecaprenyl-phosphate galacturonate synthase~~~COG0463
MQTTVEPIRGTNDPVQSLELSLVVPIFNEEQSVGPLVERVAAAMVSYPHRWELILVDDGSTDATLVNARKYVGREGLALR
IVELQRNFGQTAAMQAGIDTARGRLIATMDGDLQNDPKDIPSMVSELERRELDLLVGWRKNRKDGLFLRKIPSWCANYLI
GRITGVKLHDYGCSLKIYRASIIKQVKLMGEMHRFIPAWVAGVVPSSRIGEMAVTHHAREHGVSKYGISRTFRVILDLLS
VMFFMRYKARPGHFFGSLGLGLGALAMLILLYLGFDKFILGNDIGTRPMLMVGVVLLLSSVQMITTGILAEMIARTYYRD
DASPNYIVRQIFDDQSQA
>Q9I1W9 ~~~~~~Putative virulence-regulating protein PA2146~~~
MAQHQGGKGNFAEDPKRASEAGKKGGQASGGNFKNDPQRASEAGKKGGQRSHGGN
>O51934 ~~~rgy~~~Reverse gyrase~~~COG1110
MAVNSKYHHSCINCGGLNTDERNERGLPCEVCLPEDSPSDIYRALLERKTLKEYRFYHEFWNEYEDFRSFFKKKFGKDLT
GYQRLWAKRIVQGKSFTMVAPTGVGKTTFGMMTALWLARKGKKSALVFPTVTLVKQTLERLQKLADEKVKIFGFYSSMKK
EEKEKFEKSFEEDDYHILVFSTQFVSKNREKLSQKRFDFVFVDDVDAVLKASRNIDTLLMMVGIPEEIIRKAFSTIKQGK
IYERPKNLKPGILVVSSATAKPRGIRPLLFRDLLNFTVGRLVSVARNITHVRISSRSKEKLVELLEIFRDGILIFAQTEE
EGKELYEYLKRFKFNVGETWSEFEKNFEDFKVGKINILIGVQAYYGKLTRGVDLPERIKYVIFWGTPSMRFSLELDKAPR
FVLARVLKEMGLIKAQENPDVEELRKIAKEHLTQKEFVEKVKEMFRGVVVKDEDLELIIPDVYTYIQASGRSSRILNGVL
VKGVSVIFEEDEEIFESLKTRLLLIAEEEIIEEAEANWKELVHEVEESRRRSERELTDTSRSLLIIVESPTKAETLSRFL
GRASSRKERNIIVHEAVTGEGVILFTATRGHVYDLVTKGGIHGVEEENGKFVPVYNSLKRCRDCGYQFTEDRDECPVCSS
KNIDDKTETLRALREISLEADEILVATDPDVEGEKISWDVTQYLLPSTRSLRRIEMHEITRYGFKKARESVRFVDFNLVK
AQIVRRVQDRWIGFELSGKLQKRFGRSNLSAGRVQSTVLGWIVEREEEYKKSEKDFTLLVLENGVNLEVEGKIADDVVTV
VELQEAEEEKNPLPPYTTSSALSEISQKLRLGVQEVMDILQDLFEKGFITYHRTDSTRISLEGQNVARTYLRKIGKEDIF
MGRSWSTEGAHEAIRPVKPIDARELEEMIEEGLIADLTKKHLRVYELIFNRFLASQSAAVKVKKQIVTVDVDGKRMGIEQ
IVEILRDGWNLFVPLTVSPRFEHRTYKIKEKKFYKKHTVPLFTQASIVEEMKKRGIGRPSTYAKIVEVLFRRGYVYEDKY
KRVRPTRFGVMVYSYLKERYEKYVTEETTRRLEEIMDKVERGEEDYQATLRLLYEEIKSLMEEG
>P9WF03 3.2.1.40~~~~~~Alpha-L-rhamnosidase~~~
MCVVRTFWFAVLTVIFAVSCSSHSVIENDKPTSLKVGEGFVNPLGYYEASPRFSWKPSISNSKSTQQSAYQIQVSSTPEG
LLTNPDKWDSEKIKSSAMSWVQYKGKKITSREKVFWRVRFWDENNIASQWSDVAHIEMGLLENLDWKASWIGAKDTESSL
SPSQSTLATPQYLRTQFSVEKEVLEARLYVTAKGVFKVYLNGKDITANDALPPGWTPYEKRIETLTYDVTTLITKRDNAL
GAIIAGGWYSGRIADLKETDHSKPPRFLAQLEITYSDNTTRLVTTNDSWKATQSGPIRFASNYDGERYDETYEMQGWSMP
DYDDSEWGTVITDASTPGTLLRPKRHLPVRNVDKLTPLSFKQVSKDTVIFDFGQNMVGVPSIKLPVKQGKQVTLRFAEAL
HKGDFYTDNYRSAHSTDYFLPAKDGIAEYTPTFTFHGFRFVEISGFDETKAPVKNWIVANVQHSDIDLYNNFLSANPKLN
KLFENINWGLKGNFFDIPLDCPQRDERLGWTGDANAFIAPSMYMADVYGFWSAYLNSLREEQTEDGFVPLYVPFVKWINW
TSSGWGDAATILPWELYMMTGDQKILEDSYPSMKSWINYHDSQAKNNISSMMTFGDWLQPYPEAEGKGANRGDTDFSLIS
TAFFARSVALTRKTALELGFNEDAKRYEVKQKTLAKAFRAEFFDEDLNVIKGKETQTAYLLALAFDLLPQSEVNIAQTKL
ISLLQSADTHLRTGFLGTPLLADVLQEAGRTDLVYELLFKETYPSWFYSINNGATTTWERWNSYSLEEGFNPQGMNSLNH
YAYGTISRWFYEGILGVKPQLPGFKKAIISPQLTSKLGFAEGSIPTPSGDIDVSWTMTSDGFDVSVTVPFNISAEFVPPA
HYSVVAATNAKNEPIKKWKGLKAGQYQFQLIIDEKHGGAQ
>Q82PP4 3.2.1.40~~~~~~Alpha-L-rhamnosidase~~~COG3250
MSALRVTSPSVEYVQRPLGLDAAHPRLSWPMASAAPGRRQSAYQVRVASSAAGLSHPDVWDSGKVVSDDSVLVPYAGPPL
KPRTRYFWSVRVWDADGGASEWSAPSWWETGLMGASQWSAKWISAPAPLTEAPSLEGSSWIWFPEGEPANSAPAATRWFR
RTVDLPDDITGATLAISADNVYAVSVDGAEVARTDLEADNEGWRRPAVIDVLDHVHSGNNTLAVSASNASVGPAGWICVL
VLTTASGEKKIFSDASWKSTDHEPADGWREPDFDDSGWPAAKVAAAWGAGPWGRVAPVASAANQLRHEFRLPHKKVSRAR
LYATALGLYEAHLNGRRVGRDQLAPGWTDYRKRVQYQTYDVTSSVRPGANALAAYVAPGWYAGNVGMFGPHQYGERPALL
AQLEVEYADGTSERITSGPDWRAASGPIVSADLLSGETYDARKETAGWTSPGFDDRAWLAVRGADNDVPEQIVAQVDGPV
RIAKELPARKVTEPKPGVFVLDLGQNMVGSVRLRVSGDAGTTVRLRHAEVLNPDGTIYTANLRSAAATDTYTLKGQGEET
YEPRFTFHGFRYVEVTGFPGKPSTTSVTGRVMHTSAPFTFEFETNVPMLNKLHSNITWGQRGNFLSVPTDTPARDERLGW
TGDINVFAPTAAYTMESARFLTKWLVDLRDAQTSDGAFTDVAPAVGNLGNGVAGWGDAGVTVPWALYQAYGDRQVLADAL
PSVHAWLRYLEKHSDGLLRPADGYGDWLNVSDETPKDVIATAYFAHSADLAARMATELGKDAAPYTDLFTRIRKAFQTAY
VASDGKVKGDTQSAYVLTLSMNLVPDALRKAAADRLVALIEAKDWHLSTGFLGTPRLLPVLTDTGHTDVAYRLLHQRTFP
SWGYPIDKGSTTMWERWDSIQPDGGFQTPEMNSFNHYAYGSVGEWMYANIAGIAPGRAGYRQVVIRPRPGGEVTSARATF
ASLHGPVSTRWQQRSGGFVLTCSVPPNTTAEVWIPADHPDRVQHTHGTFVRAEDGCAVFEVGSGSHRFTV
>P32170 5.3.1.14~~~rhaA~~~L-rhamnose isomerase~~~COG4806
MTTQLEQAWELAKQRFAAVGIDVEEALRQLDRLPVSMHCWQGDDVSGFENPEGSLTGGIQATGNYPGKARNASELRADLE
QAMRLIPGPKRLNLHAIYLESDTPVSRDQIKPEHFKNWVEWAKANQLGLDFNPSCFSHPLSADGFTLSHADDSIRQFWID
HCKASRRVSAYFGEQLGTPSVMNIWIPDGMKDITVDRLAPRQRLLAALDEVISEKLNPAHHIDAVESKLFGIGAESYTVG
SNEFYMGYATSRQTALCLDAGHFHPTEVISDKISAAMLYVPQLLLHVSRPVRWDSDHVVLLDDETQAIASEIVRHDLFDR
VHIGLDFFDASINRIAAWVIGTRNMKKALLRALLEPTAELRKLEAAGDYTARLALLEEQKSLPWQAVWEMYCQRHDTPAG
SEWLESVRAYEKEILSRRG
>Q9KCL9 5.3.1.14~~~rhaA~~~L-rhamnose isomerase~~~COG4806
MSMKSQFERAKIEYGQWGIDVEEALERLKQVPISIHCWQGDDVGGFELSKGELSGGIDVTGDYPGKATTPEELRMDLEKA
LSLIPGKHRVNLHAIYAETDGKVVERDQLEPRHFEKWVRWAKRHGLGLDFNPTLFSHEKAKDGLTLAHPDQAIRQFWIDH
CIASRKIGEYFGKELETPCLTNIWIPDGYKDTPSDRLTPRKRLKESLDQIFAAEINEAYNLDAVESKLFGIGSESYVVGS
HEFYLSYALKNDKLCLLDTGHYHPTETVSNKISAMLLFHDKLALHVSRPVRWDSDHVVTFDDELREIALEIVRNDALDRV
LIGLDFFDASINRIAAWTIGTRNVIKALLFAMLIPHKQLKEWQETGDYTRRLAVLEEFKTYPLGAIWNEYCERMNVPIKE
EWLKEIAIYEKEVLLQRH
>P32171 2.7.1.5~~~rhaB~~~L-Rhamnulokinase~~~COG1070
MTFRNCVAVDLGASSGRVMLARYERECRSLTLREIHRFNNGLHSQNGYVTWDVDSLESAIRLGLNKVCEEGIRIDSIGID
TWGVDFVLLDQQGQRVGLPVAYRDSRTNGLMAQAQQQLGKRDIYQRSGIQFLPFNTLYQLRALTEQQPELIPHIAHALLM
PDYFSYRLTGKMNWEYTNATTTQLVNINSDDWDESLLAWSGANKAWFGRPTHPGNVIGHWICPQGNEIPVVAVASHDTAS
AVIASPLNGSRAAYLSSGTWSLMGFESQTPFTNDTALAANITNEGGAEGRYRVLKNIMGLWLLQRVLQEQQINDLPALIS
ATQALPACRFIINPNDDRFINPETMCSEIQAACRETAQPIPESDAELARCIFDSLALLYADVLHELAQLRGEDFSQLHIV
GGGCQNTLLNQLCADACGIRVIAGPVEASTLGNIGIQLMTLDELNNVDDFRQVVSTTANLTTFTPNPDSEIAHYVAQIHS
TRQTKELCA
>Q1R415 2.7.1.5~~~rhaB~~~Rhamnulokinase~~~
MTFRNCVAVDLGASSGRVMLARYERECRSLTLREIHRFNNGLHSQNGYVTWNVDSLESAIRLGLNKVCEEGIRIDSIGID
TWGVDFVLLDQQGQRVGLPVAYRDSRTNGLMAQAQQQLGKRDIYQRSGIQFLPFNTIYQLRALTEQQPELIPHIAHALLI
PDYFSYRLTGKMNWEYTNATTTQLVNINSDDWDESLLAWSGANKAWFGRPTHPGNVIGHWICPQGNEIPVVAVASHDTAS
AVIASPLNGSRAAYLSSGTWSLMGFESQTPFTNDTALAANITNEGGAEGRYRVLKNIMGLWLLQRVLQERQINDLPALIA
ATQALPACRFIINPNDDRFINPDEMCSEIQAACRETAQPIPESDAELARCIFDSLALLYADVLHELAQLRGEDFSQLHIV
GGGCQNTLLNQLCADACGIRVIAGPVEASTLGNIGIQLMTLDELNNVDDFRQVVSTTANLTTFTPNPDSEIAHYVAQIHS
TRQTKELCA
>P32169 4.1.2.19~~~rhaD~~~Rhamnulose-1-phosphate aldolase~~~COG0235
MQNITQSWFVQGMIKATTDAWLKGWDERNGGNLTLRLDDADIAPYHDNFHQQPRYIPLSQPMPLLANTPFIVTGSGKFFR
NVQLDPAANLGIVKVDSDGAGYHILWGLTNEAVPTSELPAHFLSHCERIKATNGKDRVIMHCHATNLIALTYVLENDTAV
FTRQLWEGSTECLVVFPDGVGILPWMVPGTDEIGQATAQEMQKHSLVLWPFHGVFGSGPTLDETFGLIDTAEKSAQVLVK
VYSMGGMKQTISREELIALGKRFGVTPLASALAL
>P32156 5.1.3.32~~~rhaM~~~L-rhamnose mutarotase~~~COG3254
MIRKAFVMQVNPDAHEEYQRRHNPIWPELEAVLKSHGAHNYAIYLDKARNLLFAMVEIESEERWNAVASTDVCQRWWKYM
TDVMPANPDNSPVSSELQEVFYLP
>Q7BSH1 5.1.3.32~~~rhaM~~~L-rhamnose mutarotase~~~
MTLEKHAFKMQLNPGMEAEYRKRHDEIWPELVDLLHQSGASDYSIHLDRETNTLFGVLTRPKDHTMASLPDHPVMKKWWA
HMADIMATNPDNSPVQSDLVTLFHMP
>P09378 ~~~rhaR~~~HTH-type transcriptional activator RhaR~~~COG1917
MAHQLKLLKDDFFASDQQAVAVADRYPQDVFAEHTHDFCELVIVWRGNGLHVLNDRPYRITRGDLFYIHADDKHSYASVN
DLVLQNIIYCPERLKLNLDWQGAIPGFNASAGQPHWRLGSMGMAQARQVIGQLEHESSQHVPFANEMAELLFGQLVMLLN
RHRYTSDSLPPTSSETLLDKLITRLAASLKSPFALDKFCDEASCSERVLRQQFRQQTGMTINQYLRQVRVCHAQYLLQHS
RLLISDISTECGFEDSNYFSVVFTRETGMTPSQWRHLNSQKD
>P09377 ~~~rhaS~~~HTH-type transcriptional activator RhaS~~~COG4977
MTVLHSVDFFPSGNASVAIEPRLPQADFPEHHHDFHEIVIVEHGTGIHVFNGQPYTITGGTVCFVRDHDRHLYEHTDNLC
LTNVLYRSPDRFQFLAGLNQLLPQELDGQYPSHWRVNHSVLQQVRQLVAQMEQQEGENDLPSTASREILFMQLLLLLRKS
SLQENLENSASRLNLLLAWLEDHFADEVNWDAVADQFSLSLRTLHRQLKQQTGLTPQRYLNRLRLMKARHLLRHSEASVT
DIAYRCGFSDSNHFSTLFRREFNWSPRDIRQGRDGFLQ
>P27125 ~~~rhaT~~~L-rhamnose-proton symporter~~~
MSNAITMGIFWHLIGAASAACFYAPFKKVKKWSWETMWSVGGIVSWIILPWAISALLLPNFWAYYSSFSLSTRLPVFLFG
AMWGIGNINYGLTMRYLGMSMGIGIAIGITLIVGTLMTPIINGNFDVLISTEGGRMTLLGVLVALIGVGIVTRAGQLKER
KMGIKAEEFNLKKGLVLAVMCGIFSAGMSFAMNAAKPMHEAAAALGVDPLYVALPSYVVIMGGGAIINLGFCFIRLAKVK
DLSLKADFSLAKSLIIHNVLLSTLGGLMWYLQFFFYAWGHARIPAQYDYISWMLHMSFYVLCGGIVGLVLKEWNNAGRRP
VTVLSLGCVVIIVAANIVGIGMAN
>O31523 3.1.1.-~~~rhgT~~~Rhamnogalacturonan acetylesterase RhgT~~~COG2755
MMKKPIQVFLAGDSTVSDCPPHEAPMAGWGQVFGQLFSEGVLVRNHAKGGASTNSFVEEGRLQAIAEHITQGDYLLIQFG
HNDQKPRGTKPYSTFQQFLTLFADTAREKGAHPVFVTSVQRRRFDENGRIEHTLGEYPDAMKALAKELDVPVIDLLAKTK
VLYEAYGPEESKRLFVWFQPNEHPNYPDGIEDNTHFSEKGAMEVAKLVAEGIEELGLPLKDHLVSREGKEHV
>O31528 3.1.1.-~~~yesY~~~Probable rhamnogalacturonan acetylesterase YesY~~~COG2755
MANHIYLAGDSTVQTYGDSTNQGGWGQFLGSHLPEHIQVINRAIGGRSSKTFVEEGRLQAILDVIEPDDWLFVQMGHNDA
SKNKPERYTEPYTTYKQYLKQYIAGAREKGAHPLLITPVARFHYENGVFLNDFPDYCIAMKQTAAEENVQLIDLMEKSLA
FFTEKGEEKVYTYFMISEGINDYTHFTKKGANEMAKLVAKGIKELGLPLTESIIKER
>Q03313 ~~~rhiA~~~Protein RhiA~~~
MSLHVSYVDKEMTDHARASQPGSAALAQGTQYSLLLKNQSAQPWTFYVYQKMPQPVANVFSLAWFCSPYQIRVGNQIKFT
WELAYNFVWSDTGQLIPGVDFFASGVEDCSPSGRNTTTFSLSDGPGLTAPIKGDPAGSLVINDAGNVPNNRFSVGIGMSG
TGTYVAQAGTNLLHTFTPTPSYWIAAGTNVTIGSVLSIDTITQTREAKFPSAVFNLVGVLQEDNTWDINPA
>Q8RJP2 4.2.2.23~~~rhiE~~~Rhamnogalacturonate lyase~~~
MHMNKPLQAWRTPLLTLIFVLPLTATGAVKLTLDGMNSTLDNGLLKVRFGADGSAKEVWKGGTNLISRLSGAARDPDKNR
SFYLDYYSGGVNEFVPERLEVIKQTPDQVHLAYIDDQNGKLRLEYHLIMTRDVSGLYSYVVAANTGSAPVTVSELRNVYR
FDATRLDTLFNSIRRGTPLLYDELEQLPKVQDETWRLPDGSVYSKYDFAGYQRESRYWGVMGNGYGAWMVPASGEYYSGD
ALKQELLVHQDAIILNYLTGSHFGTPDMVAQPGFEKLYGPWLLYINQGNDRELVADVSRRAEHERASWPYRWLDDARYPR
QRATVSGRLRTEAPHATVVLNSSAENFDIQTTGYLFSARTNRDGRFSLSNVPPGEYRLSAYADGGTQIGLLAQQTVRVEG
KKTRLGQIDARQPAPLAWAIGQADRRADEFRFGDKPRQYRWQTEVPADLTFEIGKSRERKDWYYAQTQPGSWHILFNTRT
PEQPYTLNIAIAAASNNGMTTPASSPQLAVKLNGQLLTTLKYDNDKSIYRGAMQSGRYHEAHIPLPAGALQQGGNRITLE
LLGGMVMYDAITLTETPQ
>Q51559 2.3.1.-~~~rhlA~~~3-(3-hydroxydecanoyloxy)decanoate synthase~~~
MRRESLLVSVCKGLRVHVERVGQDPGRSTVMLVNGAMATTASFARTCKCLAEHFNVVLFDLPFAGQSRQHNPQRGLITKD
DEVEILLALIERFEVNHLVSASWGGISTLLALSRNPRGIRSSVVMAFAPGLNQAMLDYVGRAQALIELDDKSAIGHLLNE
TVGKYLPQRLKASNHQHMASLATGEYEQARFHIDQVLALNDRGYLACLERIQSHVHFINGSWDEYTTAEDARQFRDYLPH
CSFSRVEGTGHFLDLESKLAAVRVHRALLEHLLKQPEPQRAERAAGFHEMAIGYA
>P0A8J8 3.6.4.13~~~rhlB~~~ATP-dependent RNA helicase RhlB~~~COG0513
MSKTHLTEQKFSDFALHPKVVEALEKKGFHNCTPIQALALPLTLAGRDVAGQAQTGTGKTMAFLTSTFHYLLSHPAIADR
KVNQPRALIMAPTRELAVQIHADAEPLAEATGLKLGLAYGGDGYDKQLKVLESGVDILIGTTGRLIDYAKQNHINLGAIQ
VVVLDEADRMYDLGFIKDIRWLFRRMPPANQRLNMLFSATLSYRVRELAFEQMNNAEYIEVEPEQKTGHRIKEELFYPSN
EEKMRLLQTLIEEEWPDRAIIFANTKHRCEEIWGHLAADGHRVGLLTGDVAQKKRLRILDEFTRGDLDILVATDVAARGL
HIPAVTHVFNYDLPDDCEDYVHRIGRTGRAGASGHSISLACEEYALNLPAIETYIGHSIPVSKYNPDALMTDLPKPLRLT
RPRTGNGPRRTGAPRNRRRSG
>P25888 3.6.4.13~~~rhlE~~~ATP-dependent RNA helicase RhlE~~~COG0513
MSFDSLGLSPDILRAVAEQGYREPTPIQQQAIPAVLEGRDLMASAQTGTGKTAGFTLPLLQHLITRQPHAKGRRPVRALI
LTPTRELAAQIGENVRDYSKYLNIRSLVVFGGVSINPQMMKLRGGVDVLVATPGRLLDLEHQNAVKLDQVEILVLDEADR
MLDMGFIHDIRRVLTKLPAKRQNLLFSATFSDDIKALAEKLLHNPLEIEVARRNTASDQVTQHVHFVDKKRKRELLSHMI
GKGNWQQVLVFTRTKHGANHLAEQLNKDGIRSAAIHGNKSQGARTRALADFKSGDIRVLVATDIAARGLDIEELPHVVNY
ELPNVPEDYVHRIGRTGRAAATGEALSLVCVDEHKLLRDIEKLLKKEIPRIAIPGYEPDPSIKAEPIQNGRQQRGGGGRG
QGGGRGQQQPRRGEGGAKSASAKPAEKPSRRLGDAKPAGEQQRRRRPRKPAAAQ
>Q9RPT1 1.1.1.100~~~rhlG~~~Rhamnolipids biosynthesis 3-oxoacyl-[acyl-carrier-protein] reductase~~~
MHPYFSLAGRIALVTGGSRGIGQMIAQGLLEAGARVFICARDAEACADTATRLSAYGDCQAIPADLSSEAGARRLAQALG
ELSARLDILVNNAGTSWGAALESYPVSGWEKVMQLNVTSVFSCIQQLLPLLRRSASAENPARVINIGSVAGISAMGEQAY
AYGPSKAALHQLSRMLAKELVGEHINVNVIAPGRFPSRMTRHIANDPQALEADSASIPMGRWGRPEEMAALAISLAGTAG
AYMTGNVIPIDGGFHL
>P54292 ~~~rhlR~~~HTH-type quorum-sensing regulator RhlR~~~
MRNDGGFLLWWDGLRSEMQPIHDSQGVFAVLEKEVRRLGFDYYAYGVRHTIPFTRPKTEVHGTYPKAWLERYQMQNYGAV
DPAILNGLRSSEMVVWSDSLFDQSRMLWNEARDWGLCVGATLPIRAPNNLLSVLSVARDQQNISSFEREEIRLRLRCMIE
LLTQKLTDLEHPMLMSNPVCLSHREREILQWTADGKSSGEIAIILSISESTVNFHHKNIQKKFDAPNKTLAAAYAAALGL
I
>P76469 4.1.2.53~~~rhmA~~~2-keto-3-deoxy-L-rhamnonate aldolase~~~COG3836
MNALLSNPFKERLRKGEVQIGLWLSSTTAYMAEIAATSGYDWLLIDGEHAPNTIQDLYHQLQAVAPYASQPVIRPVEGSK
PLIKQVLDIGAQTLLIPMVDTAEQARQVVSATRYPPYGERGVGASVARAARWGRIENYMAQVNDSLCLLVQVESKTALDN
LDEILDVEGIDGVFIGPADLSASLGYPDNAGHPEVQRIIETSIRRIRAAGKAAGFLAVAPDMAQQCLAWGANFVAVGVDT
MLYSDALDQRLAMFKSGKNGPRIKGSY
>P77215 4.2.1.90~~~rhmD~~~L-rhamnonate dehydratase~~~COG4948
MTLPKIKQVRAWFTGGATAEKGAGGGDYHDQGANHWIDDHIATPMSKYRDYEQSRQSFGINVLGTLVVEVEAENGQTGFA
VSTAGEMGCFIVEKHLNRFIEGKCVSDIKLIHDQMLSATLYYSGSGGLVMNTISCVDLALWDLFGKVVGLPVYKLLGGAV
RDEIQFYATGARPDLAKEMGFIGGKMPTHWGPHDGDAGIRKDAAMVADMREKCGEDFWLMLDCWMSQDVNYATKLAHACA
PYNLKWIEECLPPQQYESYRELKRNAPVGMMVTSGEHHGTLQSFRTLSETGIDIMQPDVGWCGGLTTLVEIAAIAKSRGQ
LVVPHGSSVYSHHAVITFTNTPFSEFLMTSPDCSTMRPQFDPILLNEPVPVNGRIHKSVLDKPGFGVELNRDCNLKRPYS
H
>Q12DF1 4.2.1.90~~~rhmD~~~L-rhamnonate dehydratase~~~COG4948
MNNMPTIKHVRAFTVRGGGADYHDQGSGHWIDDHISTPMGRYPEYRQSRQSFGINVLGTLVVEIEASDGTVGFSVTTGGE
LGCWIVEKHLARFIEGAKVTDIEKIWDQMFNATLYYGRKGIVLNTISGVDLALWDLLAKVRKEPVHALLGGPVRDELTFY
ATGARPDLAKKMGFIGGKLPLHHGPAEREEGLKKNLELLGEMRQRVGDDFWLMYDCWMSLDVEYATRLANAASEYKLKWI
EEALPPDDYWGYAELRRNVPRGMLVTTGEHEATRWGFRMLLEMECCDILQPDVGWCGGITELLKISALADAHGKLVVPHG
SSVYSYHFVITRHNSPFSEFLMMAPKADEVVPMFNPMLLDEPVPVNGRMKASALDAPGFGVRLNPECALQRPFPR
>Q8ZNF9 4.2.1.90~~~rhmD~~~L-rhamnonate dehydratase~~~
MENIMTLPKIKHVRAWFIGGATAEKGAGGGDYHDQGGNHWIDDHIATPMSKYRDYEQSRQSFGINVLGTLIVEVEAENRQ
TGFAVSTAGEMGCFIVEKHLNRFIEGKCVSDIKLIHDQMLGATMYYSGSGGLVMNTISCVDLALWDLFGKVVGLPVYKLL
GGAVRDEIQFYATGARPDLAKEMGFIGGKMPTHWGPHDGDAGIRKDAAMVADMREKCGPDFWLMLDCWMSQDVNYATKLA
HACAPFNLKWIEECLPPQQYEGYRELKRNAPAGMMVTSGEHHGTLQSFRTLAETGIDIMQPDVGWCGGLTTLVEIAALAK
SRGQLVVPHGSSVYSHHAVITFTNTPFSEFLMTSPDCSTLRPQFDPILLDEPVPVNGRIHKSVLDKPGFGVELNRDCHLK
RPYSH
>P77732 ~~~rhmR~~~Uncharacterized HTH-type transcriptional regulator RhmR~~~COG1414
MLESSKVPALTRAIDILNLIARIGPCSAATIIDTLGIPKSTAYLLLNELRRQRFLSLDHQENFCLWTRLVELSGHALSKM
DLRELARPRLTQLMDTTGLLCHLGIIDNGSAYYILKVESSATISVRSHEGKSLSLYRSGIGKCLLAWQPAAVQQSIIEGL
VWEQATPTTITHPQQLHEELARIRRQGWSYDNGEDYADVRCVAAPVFNANNELTAAISVVGTRLQINEEYRDYLAGKAIA
CARDISRLLGWKSPFDLQAS
>P76470 ~~~rhmT~~~Inner membrane transport protein RhmT~~~COG2271
MSTALLDAVVKKNRVRLIPFMLALYVLAFLDRSNIGFAKQTYQIDTGLSNEAYALGAGIFFVVYAFLGVPANLLMRKLGA
RTWIGTTTLLWGFLSAAMAWADTEAKFLIVRTLLRAAEAGFFPGMIYLTSQWFPQRNRASIMGLFYMGAPLALTLGSPLS
GALLEMHGFMGHPGWFWMFVIEGLLAVGAGVFTFFWLDDTPEQARFLSKQEKTLLINQLASEEQQKVTSRLSDALRNGRV
WQLAIIYLTIQVAVYGLIFFLPTQVAALLGTKVGFTASVVTAIPWVAALFGTWLIPRYSDKTGERRNVAALTLLAAGIGI
GLSGLLSPVMAIVALCVAAIGFIAVQPVFWTMPTQLLSGTALAAGIGFVNLFGAVGGFIAPILRVKAETLFASDAAGLLT
LAAVAVIGSLIIFTLRVNRTVAQTDVAHH
>D3RPB9 2.8.1.-~~~~~~Sulfurtransferase Alvin_2599~~~COG0607
MVNEIDSESLSQRLADTEDVLLVDIRTPAEIAQGMIPDALQLPMHLIPIRMSEIPKDRDVVIYCRSGARSYQACAYLMQQ
GYGRVLNLRGGIIAWARHGLPIVAPEG
>P0AG30 3.6.4.-~~~rho~~~Transcription termination factor Rho~~~COG1158
MNLTELKNTPVSELITLGENMGLENLARMRKQDIIFAILKQHAKSGEDIFGDGVLEILQDGFGFLRSADSSYLAGPDDIY
VSPSQIRRFNLRTGDTISGKIRPPKEGERYFALLKVNEVNFDKPENARNKILFENLTPLHANSRLRMERGNGSTEDLTAR
VLDLASPIGRGQRGLIVAPPKAGKTMLLQNIAQSIAYNHPDCVLMVLLIDERPEEVTEMQRLVKGEVVASTFDEPASRHV
QVAEMVIEKAKRLVEHKKDVIILLDSITRLARAYNTVVPASGKVLTGGVDANALHRPKRFFGAARNVEEGGSLTIIATAL
IDTGSKMDEVIYEEFKGTGNMELHLSRKIAEKRVFPAIDYNRSGTRKEELLTTQEELQKMWILRKIIHPMGEIDAMEFLI
NKLAMTKTNDDFFEMMKRS
>P52154 3.6.4.-~~~rho~~~Transcription termination factor Rho~~~
MTESTEQTTPTNGGGLASLKLAQLQALASQLGIAGGSRMRKADLVTAISDHQRGGSVADRDAAERAAQAPAAPAAETAPA
AASSEDAAPAAERPARRRSRRADADTSAPAAAQDGQPQAEAREAQTEQAPRETASDQDRSGGSEARDEGEDRPQSERRSR
GRRRAGDDDAQQGQDRRSDGAQGEDGADADRRGDREDRDDNGRENGRGRNGRNGRDRDNGRDRENGRENSRDRENGRDGS
REQRGDKSEDGGRGDGGRGDRSRRDDRDDEGGRNRRNRRNRNERGRNRRGRGGPEVDETELTEDDVLQPVAGILDVLDNY
AFVRTSGYLPGPNDVYVSLAMVKKYGLRKGDAVVGPIAPRDGEKQQHHGGGSNRQKFNALVKISSVNGQPAVEHPQRVEF
GKLVPLYPQERLRLETDPKLIGPRVIDLVSPIGKGQRGLIVSPPKAGKTMILQSIANAIKTNNPEVHLMMVLVDERPEEV
TDMQRSVDGEVIASTFDRPADDHTTLAELAIERAKRLVEMGRDVVVLLDSMTRLGRAYNLAAPASGRILSGGVDSSALYP
PKKFFGAARNIENGGSLTILATALVETGSRMDEVIFEEFKGTGNMELRLSRHLAERRIFPAVDVNASGTRREEALLSQEE
VKIMWKLRRVLSGLEQQQALDLLTNKIKDTASNAEFLMLVSKTTLGSKGDD
>P9WHF3 3.6.4.-~~~rho~~~Transcription termination factor Rho~~~COG1158
MTDTDLITAGESTDGKPSDAAATDPPDLNADEPAGSLATMVLPELRALANRAGVKGTSGMRKNELIAAIEEIRRQANGAP
AVDRSAQEHDKGDRPPSSEAPATQGEQTPTEQIDSQSQQVRPERRSATREAGPSGSGERAGTAADDTDNRQGGQQDAKTE
ERGTDAGGDQGGDQQASGGQQARGDEDGEARQGRRGRRFRDRRRRGERSGDGAEAELREDDVVQPVAGILDVLDNYAFVR
TSGYLPGPHDVYVSMNMVRKNGMRRGDAVTGAVRVPKEGEQPNQRQKFNPLVRLDSINGGSVEDAKKRPEFGKLTPLYPN
QRLRLETSTERLTTRVIDLIMPIGKGQRALIVSPPKAGKTTILQDIANAITRNNPECHLMVVLVDERPEEVTDMQRSVKG
EVIASTFDRPPSDHTSVAELAIERAKRLVEQGKDVVVLLDSITRLGRAYNNASPASGRILSGGVDSTALYPPKRFLGAAR
NIEEGGSLTIIATAMVETGSTGDTVIFEEFKGTGNAELKLDRKIAERRVFPAVDVNPSGTRKDELLLSPDEFAIVHKLRR
VLSGLDSHQAIDLLMSQLRKTKNNYEFLVQVSKTTPGSMDSD
>P52157 3.6.4.-~~~rho~~~Transcription termination factor Rho~~~
MSDTTDLMGARVEETAAAPATDASAPATGAGSRRRRGTGLEGMVLAELQQVASGLGIRGTARMRKSQLIEVIKEAQAAGG
APAKAAPAAADTAGETKPKRRSTSRTRTGDEAPAEKAEKAGKADKKADKAAADKAAAQQQIEIPGQPTPKVNASAEQAAP
ADDAPSERRRRRATSDAGSPSATDTTVAVETRAEPKADTSAPQQSQGHQQGQGDARSDAEGGDGRRRDRRDRGDRDRGDR
GDRGDRGDRGDRGERGRDRRNKGDDQQNQGGGRQDRQQQGGGGRQDRQQHDDGYDDDGSGRRGRRGRYRDRRGRRGRDEI
QEPQINEDDVLIPVAGILDILDNYAFIRTSGYLPGPNDVYVSLAQVRKNGLRKGDHLTGAVRQPKEGERREKFNALVRLD
SVNGMAPEHGRGRPEFNKLTPLYPQDRLRLETDPGVLTTRIIDLVAPIGKGQRGLIVAPPKTGKTMIMQAIANAITHNNP
ECHLMVVLVDERPEEVTDMQRSVKGEVISSTFDRPAEDHTTVAELAIERAKRLVELGHDVVVLLDSITRLGRAYNLAAPA
SGRILSGGVDSTALYPPKRFFGAARNIEDGGSLTILATALVDTGSRMDEVIFEEFKGTGNAELKLDRKLADKRIFPAVDV
DASGTRKEEILLGSDELAITWKLRRVLHALDQQQAIELLLDKMKQTKSNAEFLIQIQKTTPTPGNGD
>P38527 3.6.4.-~~~rho~~~Transcription termination factor Rho~~~COG1158
MSEEQKTISISELESMNIKQLYEIAKSLGIPRYTSMRKRDLIFAILKAQTESTGYFFGEGVLEIHPEGFGFLRRIEDNLL
PSNDDIYISPSQIRKFNLNTGDIISGVIRKPKEGEKYFAMIKIEAINYRPVEAVNDRVNFDNLTPDYPRERFILETDPKI
YSTRLIDLFAPIGKGQRGMIVAPPKAGKTTILKEIANGIAENHPDTIRIILLIDERPEEVTDIRESTNAIVIAAPFDMPP
DKQVKVAELTLEMAKRLVEFNYDVVILLDSLTRLARVYNIVVPPSGKLLTGGVDPAALYKPKRFFGAARNTREGGSLTII
ATALVETGSKMDEVIFEEFKGTGNMELVLSRQLANKRIFPAINLLLSGTRREELLLDEETLKKVWLLRRMLSAMTEEEGL
TLILNKLSETSSNEEFLKLIDKEKARY
>B9XXL6 3.6.4.13~~~rhpA~~~DEAD-box ATP-dependent RNA helicase RhpA~~~
MELNQPPLPTEIDDDAYHKPSFNDLGLKESVLKSVYEAGFTSPSPIQEKAIPAVLQGRDVIAQAQTGTGKTAAFALPIIN
NLKNNHTIEALVITPTRELAMQISDEIFKLGKHTRTKTVCVYGGQSVKKQCEFIKKNPQVMIATPGRLLDHLKNERIHKF
VPKVVVLDESDEMLDMGFLDDIEEIFDYLPSEAQILLFSATMPEPIKRLADKILENPIKIHIAPSNITNTDITQRFYVIN
EHERAEAIMRLLDTQAPKKSIVFTRTKKEADELHQFLASKNYKSTALHGDMDQRDRRASIMAFKKNDADVLVATDVASRG
LDISGVSHVFNYHLPLNTESYIHRIGRTGRAGKKGMAITLVTPLEYKELLRMQKEIDSEIELFEIPTINENQIIKTLHDA
KVSEGIISLYEQLTEIFEPSQLVLKLLSLQFETSKIGLNQQEIDAIQNPKEKTPKPSHKKTPQHERARSFKKGQHRDRHP
KTNHHSKKPKRR
>O25029 3.6.4.13~~~rhpA~~~DEAD-box ATP-dependent RNA helicase RhpA~~~COG0513
MELNQPPLPTEIDGDAYHKPSFNDLGLKESVLKSVYEAGFTSPSPIQEKAIPAVLQGRDVIAQAQTGTGKTAAFALPIIN
NLKNNHTIEALVITPTRELAMQISDEIFKLGKHTRTKTVCVYGGQSVKKQCEFIKKNPQVMIATPGRLLDHLKNERIHKF
VPKVVVLDESDEMLDMGFLDDIEEIFDYLPSEAQILLFSATMPEPIKRLADKILENPIKIHIAPSNITNTDITQRFYVIN
EHERAEAIMRLLDTQAPKKSIVFTRTKKEADELHQFLASKNYKSTALHGDMDQRDRRSSIMAFKKNDADVLVATDVASRG
LDISGVSHVFNYHLPLNTESYIHRIGRTGRAGKKGMAITLVTPLEYKELLRMQKEIDSEIELFEIPTINENQIIKTLHDA
KVSEGIISLYEQLTEIFEPSQLVLKLLSLQFETSKIGLNQQEIDAIQNPKEKTPKPSNKKTPQHERARSFKKGQHRDRHP
KTNHYSKKPKRR
>E0SAK8 3.1.-.-~~~rhsA~~~Probable deoxyribonuclease RhsA~~~COG3209
MLNDILSRVARVGAMHAGNRPNPPDDRPQPCRGKPPTSPGKTIKHKSFLGALAGAVAGALVAAAVAAAAVFLVGVTGGLA
VAAVGALAVFAAGDLISAVTNKVSAMVDSASPAFGPVASGSGNVFVEKQPVARATKDTVACTKHNSPQLIAQGSESVFVN
DAPAARIDDKTVCGATVKEGASTVFFGSGQGTYLDIADEFSWWEKALLIAVEFLVPPSRGMLKGLGKLFIRGPKAVLRGS
RAGAKWIAGRLADKSSCASKAFKASSGLTRAKAAVKAFLKDPVYIASGEVIESRTDIELGQTLPLAFERTYRSASVHIGL
LGRGWHDSWSEVATVTRDGLNTHVVITLAQGYDIDFTFHQDVQAVYCPHYPEFTLHRRGDGFSLWHRDQQTWRDFSVVQG
ERRLLSAIHDSHDNRIELVRDPKGYLRQVRHSDGVTLLLVWQGEFLHQIQRIDGGQKTLLAEYRQDEQGRLVEANATQAY
HLYYDYDAAHRLTRWHDNDQTWARYEYDAQGRCVYTTCADGFLTARFDYLPDRVVMTDGLGQCSEFGFNDLFLMSWEKSP
LGHVTRYEYDDYGNLLREISPAGRVVEFTYLDDTGRVSTFTDASGHQWQYDYDAAQRLCGVTDPLGREWGWMYDAEGNPE
RLTGPDASEVRFTWNRYGLLTQVSDAAGEVQARLQYDHRQRLLSATDAESRTRQLRYDGQDRVVQWQRADGARFRLGYRR
ASWTLPEQLIRPDDKEEQRQYDRHNNLLSYVDGNGALWRQTFGPFDLLTARTDAEGRTWHYAYDKESQQLTTVIAPDGSH
WQWWLDADGRVIRERDMTGTETHYDYDEDGLCIRVRNGEGDTRHFLYDARGLLLRETAPDDTLHYRYDAVGRLTEVSSST
AHVQLEYDLRDRMVREWHNGTLLTRQVDDAARTVTRTLTWDGDADDTINALAPLTSLFHYTRTGELRQVQLPDGADLTLT
HDAAGRESHRTGGSGFVQQREYDVMGWLTREMSGAQHDGHLLATQTREYRYDGAGNLTGVRHNRDAEGYRLDATGRVQEI
LSGGAGKPVDTTARFLYTRTGLPQEAGRLTEWQAGRLVQHDDTHYQYDRAGRLIRKQVVQPGYRPQVWQYRWDSRNQLRV
VDTPNGERWLYRYDPFGRRVGKRCDQKAEEIRYLWDGDQIAEIRHYRHGQLIQRRHWVYNGWELVVQQRQHTGGDWETDF
VTSSQNGTPQALFTPDGTLRWQAPKATLWGQRQAEKSESPDPGLAFAGQLRDSESGLCYNRFRYYDPAGGCYVSPDPIGI
AGGESNYGYVSNPMCWVDPFGLAKCPTLAHGANGEILSAKATVSKAELRTGSGTNQSSRDYARSLGNQTDDAGHILGNVL
GGQGGKGNVFPQLPAINRGQYRDFEKVVKDYIGQHGSVDIEWAFKYGNGGTRPTEIYYDVYQNGQKVFGRIFNN
>E0SIS2 3.1.-.-~~~rhsB~~~Probable deoxyribonuclease RhsB~~~COG3209
MLNDILSRVARVGAMHAGNRPNPPADRPQPCQGKPPTSPGKTIKHKSFLGALAGAVAGALVAAAVAAAAVFLVGVTGGLA
VAAVGALAVFAAGDLISAVTNKVSAVVDSASPAFGPVASGSGNVFVEKQPVARATKDTVACTKHNSPQLIAQGSESVFVN
DAPAARIDDKTVCGATLKEGASTVFFGSGQGTYLEIADEFSWWEKALLIAVEFLVPPSRGMLKGLGKLFTRNGLKSVLKG
AKAGALFITKVPGKMGCAARAFKANKGMARFKEAAKAFKKDPVYLASGEVIESRTDIELGQTLPLVFERTYRSASAHTGL
LGRGWHDSWSEVATVTHDGLNTHVVITLAQGYDIDFTFHQDVQAVYCPHYPEFTLHRRGDGFSLWHRDQQTWRDFSVVQG
ERRLLSAIHDSHDNRIELVRDPKGYLRQLRHSDGVTLLLVWQGEYLHQIQRIDGGQKTLLAEYRQDEQGRLVEANATHAY
HLYYEYNTAHRLTRWHDNDQTWARYEYDAQGRCVYTTCADGFLTARFDYLPDRVVMTDGLGQRSEFGFNDLHLMSWEQSP
LGHITRYEYDEVGNLLREISPAGRVVEFTYLDDTGRVSTFTDGSGHQWQYDYDDAQRLCGVTDPLGREWGWVYDAEGNPE
RLTGPDASEVRFTWNRYGLLTQVSDAAGEVQARLQYDHRQRLLSATDAESRTRQLRYDRQDRVVQWQRADGARFRLGYRR
ASWTLPEQLIRPDDKEEQRQYDRHNNLLSYVDGNGALWRQTFGPFDLLTARTDAEGRTWRYEYDRESQQLIAVTAPDGSR
WQWWLDADGRVIRERDMTGTETHYGYDEDGLCIRVRNGEGDTRHFLYDARGLLLRETAPDDTLHYRYDAAGRLTEVSSAT
AHVQLDYDLRDRVVREWHNGTLLTRQYDDAARTVTRTLTWDGDADDTTGTLAPLTSLFHYTRTGELRQVQLPDGADLTLT
HDAAGRESLRTGGSGFVQQREYDVMGWLTREQSGAQHDGRLQPAQTREYRYDGAGNLTGVRHNRDAEGYRLDATGRVQEM
LSGGAGKPVDTTARFHYTRTGLPQEAGRLTEWQAGRLVQHDDTHYQYDRAGRLIRKQVVQPGYRPQVWQYRWDSRNQLRV
VDTPNGERWLYRYDPFGRRVGKRCDQKAEETRYLWDGDQIAEIRHYRHGQLIQRRHWVYNGWELVVQQRQHTGGDWETDF
VTSSQNGTPQALFTPDGTLRWQVPKATLWGQRQTEKSESPDPGLAFAGQLRDSESGLCYNRFRYYDPAGGCYVSPDPIGI
AGGESNYGYVQNPNTRVDPLGLAGCAMGEILADADKWSLAKIGDRQKGMIKDKLSTVKERSKALNTKMREHFNANEQKII
SEWEKQTGMNWPTLSSGSRATPHHVIPIKNGGSNEWWNIIPVQHPHTGTIHGTGSALRTHLPYQKDGGKLWNLLGY
>E0SGL7 3.1.-.-~~~rhsC~~~Putative deoxyribonuclease RhsC~~~COG3209
MSQDKKATLLSSEDAANQNFATDNQISGGCAKCGCEVLIHYHYDSGKPVPNAPFILIDSNKTEIHGKTDANGLCKIYDMG
CGTFELMLDEGSDDFKPRETVENNPVLQSNPAYATLAGEYFTLFLLLRKQGLVTYDADDSSDRHVDVDGAGIFTSIPKEY
RKSYDRFWELDKRINRGSRQLKQAINKIHHSLAAEVADKGGDDNAALMLFCEIALGFIPVVGQAMDVYSIGEWSWQSYQE
PARLEDPLHIAEGALCAIGVIPGLGDALKVSGRAIIRALKVGTPKELQFAIRTIRSLSDGNLVKGLTKLRAELRNYGAQA
KALLLKIHAALKQVLAESKLKNNWIVSLMKDSFSAMITSLEKLIAKYDSALAYIESKFNEFIGKVITRVSGSARPKGSIA
KAAEAPKAPAHAESQAAKPATPKSDNSITKADDGKPTNAVSEKKRRTSSKETEDISAGSNKGEQPDHAPQQKKAGEEDNA
CKAGSDKCQGEGEPVDMATGYVVDWRTDVELTGLLPLSMKRYYRSGGERKPGLLGALWRTNWDMSLELENGIATLTDGEL
NQAVFVLPDEGAFSRAPSSPQWRLARQQGQLVLQHVDGLRYRFEHALGLQLCLSAIEDRAGNRIALLWDRAELCWIVLPD
GRLVHVETQQRRIVKLTLCDEHRQPLKTLSSYQYDAQGHLLSVRAGEGRNFDYQYSPQGWLLRWSDLAHTWVEHEYDAAG
RALRDRTAEGFWPGAFRYDDDAFTSHYHSGFGGVTTYVRDARNNILLRREPDGGEHRFEWVDNQLAAEIDPLGGRTEYQR
NDWGQVTAVTLPDGSVHRYDYDDDGQLLAYTDPLGNAWHYSRNAQGLVETASDPEGRSWHHAYTTQGLLSAVNGPDGCEQ
RYHYNRRGLLERLEPGEAPAVTFFYDGHDRLTARHIAHEQGVQVRRWDYEGGRNTPSKVVYEDGSETRFGYDGEGNLTSV
TDALGQRYLFRYGAFDNLLETTDPLGATVRYHYNAESEFAGVTNSQGQMWQYRFDTAGRLSEERHYDGRVYRYDYDVAGQ
LTRRTAPDGSRLEYGYDVGGRLSEIQACRADGASEGATTFTYDLSGRLLKAASPDAVVEYAYNRAGQVVSERVNGEEVRT
GYDDGGQRSVVEGLLSSLSLGWQGGRLTTLSIGSHQPLTFSHTASGYEQRRSNGEGFALRHEWSATGLLSGQALDGVNGV
LERRYQYDVLDCLTGITDSHWGEQAFRLNGAGQVTAERREQGRQRQARLFGYDSEQNLCEVSAIAPDGAGRLSAKNAAVQ
LSSGYDEAGRVTQRGGRQYQYDACGRLVSRRESRPGFRPQETRFEWDAQDRLVRVSLPDGARWRYGYDAFGRRVSKVREG
QVPSAQAVARVAYRWDGDQLSGQTQYRVDGSVTRAVQWVYEPGSFRPLAQVEEQAGQTRLHYIVADLTGTARELCSETGE
VHWRGEQGLWGPHREEKIPIPLRRYLGDAANEEVYCELRYQGQVYDAETGLYYNRHRYYDPELGQYISADPIGLAGGLRP
QGYVHNPMEWVDPFGLVGCPLKDSPLGKNGVELERTVSKKGNVKVDTLFENSNDAKNWAAEKLGPGKTRMYDSNGKWIGW
QNKTGDSVYWGHNDWGKGVGKSTYPHLNININGEKGHLFLRDKIINRGQWDDFSNAFK
>P16918 ~~~rhsC~~~Protein RhsC~~~COG3209
MSGKPAARQGDMTQYGGSIVQGSAGVRIGAPTGVACSVCPGGVTSGHPVNPLLGAKVLPGETDIALPGPLPFILSRTYSS
YRTKTPAPVGSLGPGWKMPADIRLQLRDNTLILSDNGGRSLYFEHLFPGEDGYSRSESLWLVRGGVAKLDEGHRLAALWQ
ALPEELRLSPHRYLATNSPQGPWWLLGWCERVPEADEVLPAPLPPYRVLTGLVDRFGRTQTFHREAAGEFSGEITGVTDG
AGRHFRLVLTTQAQRAEEARQQAISGGTEPSAFPDTLPGYTEYGRDNGIRLSAVWLTHDPEYPENLPAAPLVRYGWTPRG
ELAAVYDRSNTQVRSFTYDDKYRGRMVAHRHTGRPEICYRYDSDGRVTEQLNPAGLSYTYQYEKDRITITDSLNRREVLH
TQGEGGLKRVVKKEHADGSVTQSQFDAVGRLRAQTDAAGRTTEYSPDVVTGLITRITTPDGRASAFYYNHHSQLTSATGP
DGLEIRREYDEWGRLIQETAPDGDITRYRYDNPHSDLPCATEDATGSRKTMTWSRYGQLLSFTDCSGYVTRYDHDRFGQV
TAVHREEGLSQYRAYDSRGQLIAVKDTQGHETRYEYNAAGDLTTVIAPDGSRNGTQYDAWGKAICTTQGGLTRSMEYDAA
GRVIRLTSENGSHTTFRYDVLDRLIQETGFDGRTQRYHHDLTGKLIRSEDEGLVTHWHYDEADRLTHRTVNGETAERWQY
DERGWLTDISHISEGHRVTVHYGYDSKGRLASEHLTVHHPQTNELLWQHETRHAYNAQGLANRCIPDSLPAVEWLTYGSG
WLSGMKLGDTPLVEYTRDRLHRETLRSFGRYELTTAYTPAGQLQSQHLNSLLSDRDYTWNDNGELIRISSPRQTRSYSYS
TTGRLTGVHTTAANLDIRIPYTTDPAGNRLPDPELHPDSALSMWPDNRIARDAHYLYRYDRHGRLTEKTDLIPEGVIRTD
DERTHRYHYDSQHRLVHYTRTQYAEPLVESRYLYDPLGRRVAKRVWRRERDLTGWMSLSRKPQVTWYGWDGDRLTTIQND
RTRIQTIYQPGSFTPLIRVETATGELAKTQRRSLADTLQQSGGEDGGSVVFPPVLVQMLDRLESEILADRVSEESRRWLA
SCGLTVAQMQSQMDPVYTPARKIHLYHCDHRGLPLALISTEGTTAWYAEYDEWGNLLNEENPHQLQQLIRLPGQQYDEES
GLYYNRHRYYDPLQGRYITQDPIGLKGGWNFYQYPLNPISNIDPLGLETLKCIKPLHSMGGTGERSGPDIWGNPFYHQYL
CVPDGKGDYTCGGQDQRGESKGDGLWGPGKASNDTKEAAGRCDLVETDNSCVENCLKGKFKEVRPRYSVLPDIFTPINLG
LFKNCQDWSNDSLETCKMKCSGNNIGRFIRFVFTGVM
>P16919 ~~~rhsD~~~Protein RhsD~~~COG3209
MSGKPAARQGDMTQYGGPIVQGSAGVRIGAPTGVACSVCPGGMTSGNPVNPLLGAKVLPGETDLALPGPLPFILSRTYSS
YRTKTPAPVGVFGPGWKAPSDIRLQLRDDGLILNDNGGRSIHFEPLLPGEAVYSRSESMWLVRGGKAAQPDGHTLARLWG
ALPPDIRLSPHLYLATNSAQGPWWILGWSERVPGAEDVLPAPLPPYRVLTGMADRFGRTLTYRREAAGDLAGEITGVTDG
AGREFRLVLTTQAQRAEEARTSSLSSSDSSRPLSASAFPDTLPGTEYGPDRGIRLSAVWLMHDPAYPESLPAAPLVRYTY
TEAGELLAVYDRSNTQVRAFTYDAQHPGRMVAHRYAGRPEMRYRYDDTGRVVEQLNPAGLSYRYLYEQDRITVTDSLNRR
EVLHTEGGAGLKRVVKKELADGSVTRSGYDAAGRLTAQTDAAGRRTEYGLNVVSGDITDITTPDGRETKFYYNDGNQLTA
VVSPDGLESRREYDEPGRLVSETSRSGETVRYRYDDAHSELPATTTDATGSTRQMTWSRYGQLLAFTDCSGYQTRYEYDR
FGQMTAVHREEGISLYRRYDNRGRLTSVKDAQGRETRYEYNAAGDLTAVITPDGNRSETQYDAWGKAVSTTQGGLTRSME
YDAAGRVISLTNENGSHSVFSYDALDRLVQQGGFDGRTQRYHYDLTGKLTQSEDEGLVILWYYDESDRITHRTVNGEPAE
QWQYDGHGWLTDISHLSEGHRVAVHYGYDDKGRLTGECQTVENPETGELLWQHETKHAYNEQGLANRVTPDSLPPVEWLT
YGSGYLAGMKLGGTPLVEYTRDRLHRETVRSFGSMAGSNAAYELTSTYTPAGQLQSQHLNSLVYDRDYGWSDNGDLVRIS
GPRQTREYGYSATGRLESVRTLAPDLDIRIPYATDPAGNRLPDPELHPDSTLTVWPDNRIAEDAHYVYRHDEYGRLTEKT
DRIPAGVIRTDDERTHHYHYDSQHRLVFYTRIQHGEPLVESRYLYDPLGRRMAKRVWRRERDLTGWMSLSRKPEVTWYGW
DGDRLTTVQTDTTRIQTVYEPGSFTPLIRVETENGEREKAQRRSLAETLQQEGSENGHGVVFPAELVRLLDRLEEEIRAD
RVSSESRAWLAQCGLTVEQLARQVEPEYTPARKAHLYHCDHRGLPLALISEDGNTAWSAEYDEWGNQLNEENPHHVYQPY
RLPGQQHDEESGLYYNRHRYYDPLQGRYITQDPMGLKGGWNLYQYPLNPLQQIDPMGLLQTWDDARSGACTGGVCGVLSR
IIGPSKFDSTADAALDALKETQNRSLCNDMEYSGIVCKDTNGKYFASKAETDNLRKESYPLKRKCPTGTDRVAAYHTHGA
DSHGDYVDEFFSSSDKNLVRSKDNNLEAFYLATPDGRFEALNNKGEYIFIRNSVPGLSSVCIPYHD
>E0SAK7 ~~~rhsIA~~~Immunity protein RhsIA~~~
MTELSKAEKVLKEFIIQMNQWELKYYPLFRNEGMTAYKDAAKKELDDIYDLFCTKKERKQGRQISLSCGEPPEYSPDEEV
LSSELNKNKCVFITQQYTEAKNKFRYTLQFKEDEWRIDKKERFSFYDDKWIKYNL
>E0SIS1 ~~~rhsIB~~~Immunity protein RhsIB~~~
MNIENAYDQLNAWINTSNGSYIDIGGERYSFSRLKTITKDELSNFESDNNLKLPNDYKSFLINVGCVNIFVGEKTAGIEI
IPPTDIRNFSKSVFYNFGDDLYPRLLLTTSIPKLGYFGGFWMESESKENYGIFYPDIPPELWIEECDFIKFDDWLIKLVK
YKSRKI
>E0SGL8 ~~~rhsIC~~~Immunity protein RhsIC~~~
MTFQTLSSENNELLTIASESFLKRWAFSEQRLTVDLMTSDDDELTVYIETDLVQSSPIYLNEDLNICRLSIQDMHEILAV
QHGYYVPPSRFGDLMKYSSECYSFFYGRKSDFKYLASFIGYEKYIACPIRFLEDISWRIR
>P0AA67 ~~~rhtA~~~Threonine/homoserine exporter RhtA~~~COG5006
MPGSLRKMPVWLPIVILLVAMASIQGGASLAKSLFPLVGAPGVTALRLALGTLILIAFFKPWRLRFAKEQRLPLLFYGVS
LGGMNYLFYLSIQTVPLGIAVALEFTGPLAVALFSSRRPVDFVWVVLAVLGLWFLLPLGQDVSHVDLTGCALALGAGACW
AIYILSGQRAGAEHGPATVAIGSLIAALIFVPIGALQAGEALWHWSVIPLGLAVAILSTALPYSLEMIALTRLPTRTFGT
LMSMEPALAAVSGMIFLGETLTPIQLLALGAIIAASMGSTLTVRKESKIKELDIN
>P0AG34 ~~~rhtB~~~Homoserine/homoserine lactone efflux protein~~~COG1280
MTLEWWFAYLLTSIILSLSPGSGAINTMTTSLNHGYRGAVASIAGLQTGLAIHIVLVGVGLGTLFSRSVIAFEVLKWAGA
AYLIWLGIQQWRAAGAIDLKSLASTQSRRHLFQRAVFVNLTNPKSIVFLAALFPQFIMPQQPQLMQYIVLGVTTIVVDII
VMIGYATLAQRIALWIKGPKQMKALNKIFGSLFMLVGALLASARHA
>P0AG38 ~~~rhtC~~~Threonine efflux protein~~~COG1280
MLMLFLTVAMVHIVALMSPGPDFFFVSQTAVSRSRKEAMMGVLGITCGVMVWAGIALLGLHLIIEKMAWLHTLIMVGGGL
YLCWMGYQMLRGALKKEAVSAPAPQVELAKSGRSFLKGLLTNLANPKAIIYFGSVFSLFVGDNVGTTARWGIFALIIVET
LAWFTVVASLFALPQMRRGYQRLAKWIDGFAGALFAGFGIHLIISR
>P0DV87 ~~~~~~Retron Ec73 putative ribosyltransferase/DNA-binding protein~~~
MLTQLKKNGTEVSRATALFSSFVEKNKVKCPGNVKKFVFLCGANKNNGEPSARRLELINFSERYLNNCHFFLAELVFKEL
STDEESLSDNLLDIEADLSKLADHIIIVLESYSSFTELGAFAYSKQLRKKLIIVNNTKFINEKSFINMGPIKAITQQSQQ
SGHFLHYKMTEGIESIERSDGIGEIFDPLYDILSKNDRAISRTLKKEELDPSSNFNKDSVRFIHDVIFVCGPLQLNELIE
IITKIFGTESHYKKNLLKHLGILIAIRIISCTNGIYYSLYKEYYFKYDFDIDNISSMFKVFFLKNKPERMRVYENI
>O66747 1.1.1.302~~~ribD2~~~2,5-diamino-6-ribosylamino-4(3H)-pyrimidinone 5'-phosphate reductase~~~COG1985
MERPYVIIVSEVSVDGKLTLYRGASSKELMSLMDEEAYKYLHEIRAKVDGIMVGCETVRTDNPSLTVRYAKGKNPVRIIP
CSTANVPLDANVLNTKEAPTIIATTERAPKERLEKIKELGAEVIVVGDELVDFDKLLPELYRRGIKSLMVEGGASINWEF
VRRRVVDEIRLIHLPVIVGGENVPTLVGGEGFKKLKNLLHLRLRSHFVRGKQLITEWEVVNKIR
>P0DV88 ~~~~~~Retron Ec86 putative ribosyltransferase/DNA-binding protein~~~
MNKKFTDEQQQQLIGHLTKKGFYRGANIKITIFLCGGDVANHQSWRHQLSQFLAKFSDVDIFYPEDLFDDLLAGQGQHSL
LSLENILAEAVDVIILFPESPGSFTELGAFSNNENLRRKLICIQDAKFKSKRSFINYGPVRLLRKFNSKSVLRCSSNELK
EMCDSSIDVARKLRLYKKLMASIKKVRKENKVSKDIGNILYAERFLLPCIYLLDSVNYRTLCELAFKAIKQDDVLSKIIV
RSVVSRLINERKILQMTDGYQVTALGASYVRSVFDRKTLDRLRLEIMNFENRRKSTFNYDKIPYAHP
>P0A7I7 3.5.4.25~~~ribA~~~GTP cyclohydrolase-2~~~COG0807
MQLKRVAEAKLPTPWGDFLMVGFEELATGHDHVALVYGDISGHTPVLARVHSECLTGDALFSLRCDCGFQLEAALTQIAE
EGRGILLYHRQEGRNIGLLNKIRAYALQDQGYDTVEANHQLGFAADERDFTLCADMFKLLGVNEVRLLTNNPKKVEILTE
AGINIVERVPLIVGRNPNNEHYLDTKAEKMGHLLNK
>O08315 3.5.4.25~~~ribA~~~GTP cyclohydrolase-2~~~COG0807
MKRLEVSNQAKLPTQFGEFYIQCFREKGSNGSKDHLVVFTPNFSQNPLVRLHSECLTGDALGSQKCDCGGALQMALERIS
KEGGLVIYLRQEGRGIGLFNKVNAYALQDKGYDTIQANEMIGFKDDERDYSVAGEILEYYRIKKMRLLTNNPKKIAALEK
YAEVTRESLIVCANEHNQGYLEVKKLKMGHLL
>A5U2B7 ~~~ribBA~~~Riboflavin biosynthesis protein RibBA~~~COG0108
MTRLDSVERAVADIAAGKAVIVIDDEDRENEGDLIFAAEKATPEMVAFMVRYTSGYLCVPLDGAICDRLGLLPMYAVNQD
KHGTAYTVTVDARNGIGTGISASDRATTMRLLADPTSVADDFTRPGHVVPLRAKDGGVLRRPGHTEAAVDLARMAGLQPA
GAICEIVSQKDEGSMAHTDELRVFADEHGLALITIADLIEWRRKHEKHIERVAEARIPTRHGEFRAIGYTSIYEDVEHVA
LVRGEIAGPNADGDDVLVRVHSECLTGDVFGSRRCDCGPQLDAALAMVAREGRGVVLYMRGHEGRGIGLMHKLQAYQLQD
AGADTVDANLKLGLPADARDYGIGAQILVDLGVRSMRLLTNNPAKRVGLDGYGLHIIERVPLPVRANAENIRYLMTKRDK
LGHDLAGLDDFHESVHLPGEFGGAL
>P9WHF1 ~~~ribBA~~~Riboflavin biosynthesis protein RibBA~~~COG0108
MTRLDSVERAVADIAAGKAVIVIDDEDRENEGDLIFAAEKATPEMVAFMVRYTSGYLCVPLDGAICDRLGLLPMYAVNQD
KHGTAYTVTVDARNGIGTGISASDRATTMRLLADPTSVADDFTRPGHVVPLRAKDGGVLRRPGHTEAAVDLARMAGLQPA
GAICEIVSQKDEGSMAHTDELRVFADEHGLALITIADLIEWRRKHEKHIERVAEARIPTRHGEFRAIGYTSIYEDVEHVA
LVRGEIAGPNADGDDVLVRVHSECLTGDVFGSRRCDCGPQLDAALAMVAREGRGVVLYMRGHEGRGIGLMHKLQAYQLQD
AGADTVDANLKLGLPADARDYGIGAQILVDLGVRSMRLLTNNPAKRVGLDGYGLHIIERVPLPVRANAENIRYLMTKRDK
LGHDLAGLDDFHESVHLPGEFGGAL
>P0A7J0 4.1.99.12~~~ribB~~~3,4-dihydroxy-2-butanone 4-phosphate synthase~~~COG0108
MNQTLLSSFGTPFERVENALAALREGRGVMVLDDEDRENEGDMIFPAETMTVEQMALTIRHGSGIVCLCITEDRRKQLDL
PMMVENNTSAYGTGFTVTIEAAEGVTTGVSAADRITTVRAAIADGAKPSDLNRPGHVFPLRAQAGGVLTRGGHTEATIDL
MTLAGFKPAGVLCELTNDDGTMARAPECIEFANKHNMALVTIEDLVAYRQAHERKAS
>P66032 4.1.99.12~~~ribB~~~3,4-dihydroxy-2-butanone 4-phosphate synthase~~~
MNQTLLSSFGTPFERVELALDALREGRGVMVLDDEDRENEGDMIFPAETMTVEQMALTIRHGSGIVCLCITEDRRKQLDL
PMMVENNTSAYGTGFTVTIEAAEGVTTGVSAADRVTTVRAAIKDGAKPSDLNRPGHVFPLRAQAGGVLTRGGHTEATIDL
MTLAGFKPAGVLCELTNDDGTMARAPECIAFAGQHNMAVVTIEDLVAYRQAHERKAS
>Q9KKP2 4.1.99.12~~~ribB~~~3,4-dihydroxy-2-butanone 4-phosphate synthase~~~COG0108
MNQSSLLAEFGDPITRVENALQALREGRGVLLLDDEDRENEGDIIYAVESLTTAQMALMIRECSGIVCLCLTEAQADRLA
LPPMVVNNNSANQTAFTVSIEAKHGVTTGVSAQDRVTTIKTAANPQAKPEDLARPGHVFPLRARAGGVLARRGHTEGTVD
LMQMAGLQPAGVLCELTNPDGSMAKTPEIIEFGKLHNMPVLTIEDMVQYRIQFDLKLA
>Q8ZI56 4.1.99.12~~~ribB~~~3,4-dihydroxy-2-butanone 4-phosphate synthase~~~COG0108
MNQTLLSDFGTPVERVERAIDALRNGRGVMVLDDESRENEGDMVFAAEAMTLEQMALTIRHGSGIVCLCITDERRQQLDL
PMMVTHNSSQFQTAFTVTIEAAEGVTTGVSAADRLTTIRKAIADNAKPADLNRPGHVFPLRGQPGGVLSRRGHTEASIDL
ATLAGYKPAGVLCELTNDDGSMAHAPEVIAFAKLHDMPVVTIDDLAAYLQSRAKKAS
>Q8Y7F2 ~~~ribCF~~~Bifunctional riboflavin kinase/FMN adenylyltransferase~~~COG0196
MKTIYLHHPITTDEWTDINKVMALGFFDGVHLGHQAVIKQAKQIAGQKGLQTAVLTFDPHPSVVLSNIRKQVKYLTPLED
KAEKMAKLGVDIMYVVRFTTQFSELSPQAFVDNYLVALHVEHVVAGFDYSYGKKGEGKMTDLAKYADGRFEVTIVDKQTA
ASDKISSTNIRRAITEGELEEANQLLGYPYTTKGTVIHGDKRGRTIGFPTANIRVNEDYLIPKLGVYAVKFRVNGETHLG
MASIGYNITFKDDQALSIEVYILDFHREIYGEEAEIKWYQFFRPELKFNGVEGLIAQLEKDEQDTRAFFAKLED
>Q8Y914 2.7.7.2~~~ribC~~~FAD synthetase~~~COG0196
MEVSHVTLAPNKDSRAAVLTIGKFDGVHIGHQTILNTALSIKKENEILTAISFSPHPLWALKQIEIYREMLTPRMEKERW
LAYYGVNHLIETEFTSRYAETTPEEFVTDHLTNLNLSHIVVGSEFNFGKGRDSDVDLLRDLCKPYDIGVTSVPVIETNQT
KISSTNIRAFIRRGHFQEAEELLGHPWYITGIVENGEMTGLDDYVLPATGTYQTDSGIVNVTNNRTIEVGLSDGLQQLHM
KNELSE
>P17618 ~~~ribD~~~Riboflavin biosynthesis protein RibD~~~COG0117
MEEYYMKLALDLAKQGEGQTESNPLVGAVVVKDGQIVGMGAHLKYGEAHAEVHAIHMAGAHAEGADIYVTLEPCSHYGKT
PPCAELIINSGIKRVFVAMRDPNPLVAGRGISMMKEAGIEVREGILADQAERLNEKFLHFMRTGLPYVTLKAAASLDGKI
ATSTGDSKWITSEAARQDAQQYRKTHQSILVGVGTVKADNPSLTCRLPNVTKQPVRVILDTVLSIPEDAKVICDQIAPTW
IFTTARADEEKKKRLSAFGVNIFTLETERIQIPDVLKILAEEGIMSVYVEGGSAVHGSFVKEGCFQEIIFYFAPKLIGGT
HAPSLISGEGFQSMKDVPLLQFTDITQIGRDIKLTAKPTKE
>P25539 ~~~ribD~~~Riboflavin biosynthesis protein RibD~~~COG0117
MQDEYYMARALKLAQRGRFTTHPNPNVGCVIVKDGEIVGEGYHQRAGEPHAEVHALRMAGEKAKGATAYVTLEPCSHHGR
TPPCCDALIAAGVARVVASMQDPNPQVAGRGLYRLQQAGIDVSHGLMMSEAEQLNKGFLKRMRTGFPYIQLKLGASLDGR
TAMASGESQWITSPQARRDVQLLRAQSHAILTSSATVLADDPALTVRWSELDEQTQALYPQQNLRQPIRIVIDSQNRVTP
VHRIVQQPGETWFARTQEDSREWPETVRTLLIPEHKGHLDLVVLMMQLGKQQINSIWVEAGPTLAGALLQAGLVDELIVY
IAPKLLGSDARGLCTLPGLEKLADAPQFKFKEIRHVGPDVCLHLVGA
>P9WPH1 ~~~ribD~~~Riboflavin biosynthesis protein RibD~~~COG0117
MNVEQVKSIDEAMGLAIEHSYQVKGTTYPKPPVGAVIVDPNGRIVGAGGTEPAGGDHAEVVALRRAGGLAAGAIVVVTME
PCNHYGKTPPCVNALIEARVGTVVYAVADPNGIAGGGAGRLSAAGLQVRSGVLAEQVAAGPLREWLHKQRTGLPHVTWKY
ATSIDGRSAAADGSSQWISSEAARLDLHRRRAIADAILVGTGTVLADDPALTARLADGSLAPQQPLRVVVGKRDIPPEAR
VLNDEARTMMIRTHEPMEVLRALSDRTDVLLEGGPTLAGAFLRAGAINRILAYVAPILLGGPVTAVDDVGVSNITNALRW
QFDSVEKVGPDLLLSLVAR
>Q59263 ~~~ribF~~~Bifunctional riboflavin kinase/FMN adenylyltransferase~~~
MDIWYGTAAVPKDLDNSAVTIGVFDGVHRGHQKLINATVEKAREVGAKAIMVTFDPHPVSVFLPRRAPLGITTLAERFAL
AESFGIDGVLVIDFTRELSGTSPEKYVEFLLEDTLHASHVVVGANFTFGENAAGTADSLRQICQSRLTVDVIDLLDDEGV
RISSTTVREFLSEGDVARANWALGRHFYVTGPVVRGAGRGGKELGFPTANQYFHDTVALPADGVYAGWLTILPTEAPVSG
NMEPEVAYAAAISVGTNPTFGDEQRSVESFVLDRDADLYGHDVKVEFVDHVRAMEKFDSVEQLLEVMAKDVQKTRTLLAQ
DVQAHKMAPETYFLQAES
>P0AG40 ~~~ribF~~~Bifunctional riboflavin kinase/FMN adenylyltransferase~~~COG0196
MKLIRGIHNLSQAPQEGCVLTIGNFDGVHRGHRALLQGLQEEGRKRNLPVMVMLFEPQPLELFATDKAPARLTRLREKLR
YLAECGVDYVLCVRFDRRFAALTAQNFISDLLVKHLRVKFLAVGDDFRFGAGREGDFLLLQKAGMEYGFDITSTQTFCEG
GVRISSTAVRQALADDNLALAESLLGHPFAISGRVVHGDELGRTIGFPTANVPLRRQVSPVKGVYAVEVLGLGEKPLPGV
ANIGTRPTVAGIRQQLEVHLLDVAMDLYGRHIQVVLRKKIRNEQRFASLDELKAQIARDELTAREFFGLTKPA
>A4IT50 ~~~ribF~~~Putative bifunctional riboflavin kinase/FMN adenylyltransferase~~~COG0196
MKVHEANQGLTLPGSVVAIGAFDGVHQGHQAVLRQAVERSRQLGVESVAYTIDPPPRCRFQGSRMLTTLQEKLDRFAVLG
LNHAVVAHFDERYAARRVDAFIRELTALNPREVIVGQDFRFGRNREGDVALLRRHFPVRIVQTVCCADGQRISSTRIREL
IERGEWEQSTVLLGWPLSS
>K4REQ6 ~~~ribM~~~Riboflavin/roseoflavin transporter RibM~~~COG3201
MNWLNSEAFVLFDQHIIWSDMVGNILGLITLALGFRRSLWTWPVQFLSGLVLFGAFYGHLTGSAGKQAVVMAVALYGWYQ
WNRGTDKAADGKVSVRFATWAERGAMIAAAAVGTVAVALLFKAYPSLSWDPWPDAYIFVGTIVAMYAQARGMVEFWFAWL
LVDLVGVPLNFANGYAFSGFVYVIYGALVLWGMRDWWLRSRRDSRPVLEGAPA
>A6WUW2 ~~~ribN~~~Riboflavin transporter~~~COG0697
MVAACSAYAAVNVATQWAGTRTGIPSVIIAFWQYVIALVLTLPLLVRDGTRALRTSHFGLHVMRVALAAAGVQVWIYALT
HVPIWQVVALSMTSPFFVILCARLFLQEKVTPARLLTTFTGFIGALIIIAPWSDSYTVYSLLPILAAALWAGYSVMTKYL
TRFEKPASISAYMLVLLTPINAALWLASGLSMSAITAPDVEIWSILIVIGAFTALAQYFQTAAYSIADAVYLQPFDDLRL
PINVIFGWIVFAAAPSINFWPGAALIIGASLYLMRQDSGTSRTA
>Q1MIM3 ~~~ribN~~~Riboflavin transporter~~~COG0697
MKDMNQTSLAVEPSRAVVGALWMVLAGIAFSLLNVVTQWLTMKLAFPSASAAFWQYGFAFLFSLPFLKRLGLAAMRTHYP
WRHLTRVVLAALGVEAWVAGLAAVPIWQAIALVMTSPFFIILGARLFLGERVGPARWAATAAGFTGAMIILQPWSDGIGW
AALLPVLSALLWGASSLITKSLTGIERPETITVWLLVLLTPINGGLALAAGFAVPTGATLALFLLAGLLTAVAQYFLTLA
YAAADAAYVQPFDDLKLPLNVLAGWLFFGYAPAGYLWLGAALILSASLFIMRNEMRRERKPA
>Q9KKU0 ~~~ribN~~~Riboflavin transporter~~~COG0697
MSIKHHPLQGALWMLTAGLAFAIVNSVAQYASIQFGLPSTTVALVQYAIAIVVILPYLKTLGIRQSLRTQQFGWHLLRVF
LAVIGIQLWLWALAYPVPIWQGIALLMTSPLFATIGSGLWLREKVGMARWVATLTGFIGAMIILEPWADDFNLASLLPVG
AAFFWASYSLMVKKLSSHDSPSTMVVYLLLLITPFNLLLALPDWQMPNGQTVWLLLIGAGVMTALAQWAIAKAYAVADAS
FVQPFDHAKLPLNVLAGWLVFGWVPPGRLWLGAAIIVLSVAFITQWETKKSRRERNIA
>P94465 2.7.1.26~~~ribR~~~RNA-binding riboflavin kinase RibR~~~COG0196
MTIIAGTVVKGKQLGRKLGFPTANVDAKIHGLRNGVYGVLATVNHQFHLGVMNIGVKPTVGSNLEKTLEIFLFDFHRDIY
GEKIECSILFKIREERRFDSLESLTKQIKKDISCVAKRFELIGIMAPNKKESLLSHQELNLPDLCFYKKCNNLYGVNRGV
YNVIDNWFFEYGITQVAYRRIYILSFLSFLKEDNPKVSSKYIRFGAGGLADKLNRFISSYVEESEENILG
>P17622 2.3.1.-~~~ribT~~~Protein RibT~~~COG0456
MLIRYKKSFEKIAMGLLSFMPNEKDLKQLQQTIKDYETDTDRQLFLWKEDEDIVGAIGVEKKDSEVEIRHISVNPSHRHQ
GIGKQMMDALKHLFKTQVLVPNELTQSFFERCQGQQDQDISYNN
>P0CI36 ~~~ribU~~~Riboflavin transporter RibU~~~COG3601
MSKTRRMVLIAMLAALSTILLLPILQFPLLPGIDFMKVELSIIPVLIGVFTLGLGDGFIILFIRSVLWYLLFNQGPSTWI
GVPMNFVALGIFMAIVWFFTKKKFSIKNYTVGIVLATIASVLVMMVLNVFYALPLYRLAAGFDVDKIFAGATHLFNMGSL
SVTLNPTYLLTVVLPFNALQYIIFALVFGLIVTVFKKNKVVKFYNA
>D8KIE9 ~~~ribU~~~Riboflavin transporter RibU~~~
MSKTRRMVLIAMLAALSTILLLPILQFPLLPGIDFMKVELSIIPVLIGVFTLGLGDGFIILFIRSVLWYLLFNQGPSTWI
GVPMNFVALGIFMAIVWFFTKKKFSIKNYTVGIVLATIASVLVMMVLNVFYALPLYRLAAGFDVDKIFAGATHLFNMGSL
SVTLNPTYLLTVVLPFNALQYIIFALVFGLIVTVFKKNKVVKFYNA
>Q03WN0 ~~~ribU~~~Riboflavin transporter RibU~~~COG3601
MSIIPVTRVQRTTLIAILSAISFGLMLFPQVPIIPSADFLKLDFSIVPVVIGLYWLNYSASLWVILIRTLLKLILANEGV
NTYLGLPVNLLVVLAFITVLKITMPNLEQYSNWQKKILPLISSTFVMTIVAIVINWFVAIPLYARFANFDIAKFIGLKNY
FIGMVLPFNLIEGIIWFVVSMIILRAIQPLQRRFHS
>Q8Y5W0 ~~~ribU~~~Riboflavin transporter RibU~~~COG3601
MKNYSMKVFVSVAVLGTLAFILMMLQFPLLPSAPFLKLDFSDIPALIGGLLFGPLAVILVELIKNVLLYIVSGSPVGVPV
GELANFISGLFYVLPIYYLFHWLRSTKGMVLSTAVGTVLMTGAMAVFNYFVLLPFYIKLGGLPANTDVAWLITYAIVPFN
LLKGVIVSAVFLLLYSRLKKWIAKNQTMKERRKFEKRQQEISH
>E5QVT2 ~~~ribU~~~Riboflavin transporter RibU~~~
MNGRRKLNMQQNKRLITISMLSAIAFVLTFIKFPIPFLPPYLTLDFSDVPSLLATFTFGPVAGIIVALVKNLLNYLFSMG
DPVGPFANFLAGASFLLTAYAIYKNKRSTKSLITGLIIATIVMTIVLSILNYFVLLPLYGMIFNLADIANNLKVIIVSGI
IPFNIIKGIVISIVFILLYRRLANFLKRI
>Q5M614 ~~~ribU~~~Riboflavin transporter RibU~~~COG3601
MRSLFFGIGKSIGNFFQIVKIWRNFFMTNTRKLAYIAILSAVSFLLLYFSFPLIPAADFLKVDFSILPVLIALVIFDFKS
AIGVLLLRSLLKLLLNNGGPGSMIGLPMNFVALGVFVWGLSYFWKKNQTSKNYILGSVLGTILLTVAMVVLNYIYAVPLY
AKFANFDIAQFIGLYKYLFAMVVPFNLLEGLIFSVAFALIYAPLKSILVKL
>Q9X1G6 ~~~ribU~~~Riboflavin transporter RibU~~~COG3601
MSSIKKISFVGIFSALATLVMFLEFPIFPQASFLKYDPSEIPALIVSFLLGPGVGMFVVLVKDILFFLMKSGDPVGIAMN
AVLGMSFVGIAGLIYHRNKSRATAIKGMIVATLFATAFALGLNALIVPLYFEAPFELYLKFFPFILAFNLVKFGIDSVVT
FFVYKKVSSILKLETVEGRSNNG
>Q6F0N9 ~~~ribV~~~Putative riboflavin transporter RibV~~~
MDKKYKLWNYKYDFSEINLKNWKEVLKDTFKLNTRKIALLSMLFAIEILMTIISKVIMGLAIPMIVGVYTIEISFFVILI
IYLCSNYIYASILSITAIWFRLLLGSEPVGLLSMMISDTAFLTIFAVLFFILKKFIFLKFKFKNQIKILIALICFAGLIS
MIGSGFISMLCNDKFIFEMYYLSDDGSGYWKMLLWVGFGVTLAKYSINILLFASTLKVLLILIKQSRV
>Q7MGG3 ~~~~~~Riboflavin biosynthesis protein VVA0006~~~COG0807
MHFVSYSLCYDVRSNIMEQPIYFYEPDENHGFLANFYPCSITVSGTCWPSSEHYYQAQKFDDVRLQEKVLRAEDAAQAFR
LSREYQQWQRHDWYDIRVEVMRFIVREKFLQNTPLAHQLLATGDTELKEHSHKDAFWGDGGDGHGRNELGRILMMVREEL
QEHAPYNLVQFIDSAKLPTQWGTFQMYGFIEKATGKEHLALVYGDIEQQAAPLIRLHSECLTGDALFSARCDCGFQLAKA
LQNIVAEGAGVLLYLRQEGRGIGLINKIRAYHLQDDGADTVEANEQLGFGADMRDYAFCRGILSFLGIERVRLMTNNPRK
VKALQLANIEVTERVPLQEGNNPHNHQYLRTKADKLGHMFDRNFVKP
>Q8A947 3.1.3.-~~~~~~D-ribitol-5-phosphate phosphatase~~~COG1011
MIKNIVFDFGGVIVDIDRDKAVQAFIKLGLADADTRLDKYHQTGIFQELEEGKLSADEFRKQLGDLCGRELTMEETKQAW
LGFFNEVDLRKLDYILGLRKSYHVYLLSNTNPFVMSWACSPEFSSEGKPLNDYCDKLYLSYQLGHTKPAPEIFDFMIKDS
HVIPSETLFVDDGSSNIHIGKELGFETFQPENGADWRQELTVILNS
>A9WGD1 ~~~ribX~~~Riboflavin transport system permease protein RibX~~~COG0600
MQQTLPVTRVSPRVRSLQRFEWPALGLPVTLMLLLVFWQAGVTLSGYPAFILPSPALVAGRFWQALSSGLLWQHTLATLS
AALGGFTLALIIALILGYTLAHIRWLEQALAPVLAASQAIPVVAVAPLIILWFGAGLTSKVLVAALITFLPILINTVVAI
RSIPRELIEMAYISGANRWQLLRYVEAPLALPVLFGGVRTGLALATTGAVVGEFVAGRVGLGALINIARGLFDTPLIFVA
LATLALITLTLYVLAGLLERLLVRWEAS
>P30176 3.2.2.-~~~ybiA~~~N-glycosidase YbiA~~~COG3236
MPVRAQRIQHVMQDTIINFYSTSDDYGDFSNFAAWPIKVDGKTWPTSEHYFQAQKFLDEKYREEIRRVSSPMVAARMGRD
RSKPLRKNWESVKEQVMRKALRAKFEQHAELRALLLATAPAKLVEHTENDAYWGDGGHGKGKNRLGYLLMELREQLAIEK
>B2J4E5 3.2.2.-~~~~~~N-glycosidase Npun_R5314~~~COG3236
MTIYFYKVWQPYGCFSNFSPHGIHIQDTYWATVEHYYQAQKFVGSKDAAIIPLIHAAATPEEAAALGRCSTRQLRRDWDL
VKTQIMREAVLKKFLTHADIREVLLKTGDELLVENSPTDSFWGCGANKAGLNHLGKTLMSVREEIRNLLSLTGIYE
>A9WGD2 ~~~ribY~~~Riboflavin-binding protein RibY~~~COG0715
MMKLRVLTLGILIILLITACSAPTPTTPAAAPTAAPAPNAQPTLQQVTLAMSYIPNIQFAPYYVAAAKGYYAAEGIEVVF
DYNFENDVLQRAATWPTSGVAFATTSGTSVLLARQQGLPVKTVMTLYQRFPIAFFAKSNVPLASVNDLRGQTIGIPGRFG
ESFYALLAALYAGGMSEADVTVQEIGFTQTAAVMEDKVPVAIGYAMNEPVQLRGQGVEVNVLLAADVFNLAANGIAVSEA
LIAQNPELVRKFVRASLRGLADTLANPDEAFDLSLQFIPEAQLGDLSLQRQVLQESLPFWQNELTAQYGLGYTDGQLWTR
TEEFMRAAGLLSAPVDVQQAFTNEFVPGGSY
>Q180E3 ~~~ribZ~~~Riboflavin transporter RibZ~~~COG2814
MKQKWIVLIIICIGVFMSTLDGSILNIANPTIAADFKINMSQIQWVVTAYMLVVTATMLFFGKLGDKVGSNRLYTLGFFI
FTIGSFLCSMSNNLSTLISSRIFQAVGASILMATGLGIVSNAFPANEKGKAIGITGAVVGIGNMSGPVIGGIILEHFGWP
SIFIINIPIGIIAVFLGIKFLPKPVLDEQNKSFDIPGLLLFASCTTLILLAMNEKGNTRLYLGITALIIFLLLALREVKF
EQSFIDLPLFKNRNFTVGNIIGVACYFPQMAVSFLLPFYLEQLKNLSPMMAGYVMTVHPLIMVLIAPIAGSLSDKHGAKN
ILTASFSFMTISLVGMALLKADSPLYLLIVCLVIFGLGLGAFSSPNNSSILADVPPQKQGYGGSFLATIRNLSFALGTAF
FSSFFAQSLTYNQKFKSHTSAYVIASNQSYWIAASVCFIGLILTVFFMRKTDKSIS
>Q92H62 ~~~rickA~~~Arp2/3 complex-activating protein rickA~~~
MVKEIDINKLLAQENNALNTILSQVNELCKQNKQLQGLIEIQNETKELEKEHNRSLPWFKRFVKTVSNVKYILIKSEEQL
TNEAIKYNNKILKDIDNKIYNIAEKSAPLKQALQEEIEKNFKDLTKKDLSKDQRARLSEVFFSYKSKPERFSALHMTNPL
QFINAEALEKQYNSLNATKQNIQNLISANSNIKELKEIQKQVAEIRAEVPHTFFEKLNNIWQNVKNVFVNNSEQVLAKNK
ESNTRTIRKIDEQLYKTKHKFEELIENKERNIKDIIAKLPDNEKLQKIVSNLTNHMASQKEPILANASLAKPLENNITPP
SPLPENNIPSPPPPPPPSPLPENNIPSSPPPPPPPPLPENNIPSPPPPPPPPPPPPMAPAQAETLSKPIESTTVKKLANQ
PRPSIDTSDLMREIAGPKKLKKVEFDPNTGKPVAHSHSKPAQNVNALSGLESIFARRAVIKVSDSSSSESDSGNWSDVSV
NRNKSKMLKTKGERDAKMTTHAQKINNRNSQNPSFVR
>O07434 ~~~ricR~~~Copper-sensing transcriptional repressor RicR~~~COG1937
MTAAHGYTQQKDNYAKRLRRVEGQVRGIARMIEEDKYCIDVLTQISAVTSALRSVALNLLDEHLSHCVTRAVAEGGPGAD
GKLAEASAAIARLVRS
>P37552 3.5.99.10~~~ridA~~~2-iminobutanoate/2-iminopropanoate deaminase~~~COG0251
MTKAVHTKHAPAAIGPYSQGIIVNNMFYSSGQIPLTPSGEMVNGDIKEQTHQVFSNLKAVLEEAGASFETVVKATVFIAD
MEQFAEVNEVYGQYFDTHKPARSCVEVARLPKDALVEIEVIALVK
>P0AF93 3.5.99.10~~~ridA~~~2-iminobutanoate/2-iminopropanoate deaminase~~~COG0251
MSKTIATENAPAAIGPYVQGVDLGNMIITSGQIPVNPKTGEVPADVAAQARQSLDNVKAIVEAAGLKVGDIVKTTVFVKD
LNDFATVNATYEAFFTEHNATFPARSCVEVARLPKDVKIEIEAIAVRR
>Q7CP78 3.5.99.10~~~ridA~~~2-iminobutanoate/2-iminopropanoate deaminase~~~
MSKTIATENAPAAIGPYVQGVDLGSMVITSGQIPVDPKTGAVAEDVSAQARQSLENVKAIVEAAGLKVGDIVKTTVFVKD
LNDFATVNATYEAFFTEHNATFPARSCVEVARLPKDVKIEIEAIAVRR
>P00335 1.1.1.56~~~rbtD~~~Ribitol 2-dehydrogenase~~~
MKHSVSSMNTSLSGKVAAITGAASGIGLECARTLLGAGAKVVLIDREGEKLNKLVAELGENAFALQVDLMQADQVDNLLQ
GILQLTGRLDIFHANAGAYIGGPVAEGDPDVWDRVLHLNINAAFRCVRSVLPHLIAQKSGDIIFTAVIAGVVPVIWEPVY
TASKFAVQAFVHTTRRQVAQYGVRVGAVLPGPVVTALLDDWPKAKMDEALANGSLMQPIEVAESVLFMVTRSKNVTVRDI
VILPNSVDL
>O52547 4.4.1.-~~~rifF~~~Proansamycin X synthase~~~
MNVFDVETYLQRIGCGGETGVDLETLAKLQKSHLMAIPYSSLAYELRDAVNVVDLDEDDVFVTSIAEGQGGACYHLNRLF
HRLLTELGYDVTPLAGSTAEGRETFGTDVEHMFNLVTLDGADWLVDVGYPGPTYVEPLAVSPAVQTQYGSQFRLVEQETG
YALQRRGAVTRWSVVYTFTTQPRQWSDWKELEDNFRALVGDTTRTDTQETLCGRAFANGQVFLRQRRYLTVENGREQVRT
ITDDDEFRALVSRVLSGDHG
>O52552 4.2.1.144~~~rifK~~~3-amino-5-hydroxybenzoate synthase~~~
MNARKAPEFPAWPQYDDAERNGLVRALEQGQWWRMGGDEVNSFEREFAAHHGAAHALAVTNGTHALELALQVMGVGPGTE
VIVPAFTFISSSQAAQRLGAVTVPVDVDAATYNLDPEAVAAAVTPRTKVIMPVHMAGLMADMDALAKISADTGVPLLQDA
AHAHGARWQGKRVGELDSIATFSFQNGKLMTAGEGGAVVFPDGETEKYETAFLRHSCGRPRDDRRYFHKIAGSNMRLNEF
SASVLRAQLARLDEQIAVRDERWTLLSRLLGAIDGVVPQGGDVRADRNSHYMAMFRIPGLTEERRNALVDRLVEAGLPAF
AAFRAIYRTDAFWELGAPDESVDAIARRCPNTDAISSDCVWLHHRVLLAGEPELHATAEIIADAVARA
>Q7BUE1 1.1.1.-~~~rifL~~~Putative UDP-kanosamine synthase oxidoreductase subunit~~~
MSVRAAVVGLGWAGRELWLPLLREHADFEVVAAVDADPASRQAFTKATGIPTHAAVSALTAREVDLAVVAVPNYLHTEVA
GALLATGISVFLEKPVCLNSAEIDVLAAAERSGGMLLAGSAARYRGDVGALRRLLPELGEIRHVALGWIRARGVPRAGGW
FTQREKAGGGALYDLGWHLLDTLAFLLGPAAFTQVIGVTSDDFVNAGAWRAAWRQDQLGADAADVEDTARGFLVRDDGVS
VSLRASWASHQARDVSVIHVEGSAGTADLRCTFGFSPNREPEPVLSVTREGTTTRLPVPLERIGVEYTRQVSDLAAMLAD
PGHRGRAVAEARPIVSMIENFYASAGSARGRGAVPAYQ
>G0FS68 2.7.1.179~~~rifN~~~Kanosamine kinase~~~
MGTPYHLGIDVGGTKVAFRVESGSACIEETSFSWGARHSAEDDLAQLAGHVARLRERIGTPLEAVGVAMPGTVGADGRVA
TWPSRPEWTGVDLKTALHSLFPEAAVAWADDGDLGALAESRASGCENLLYIGIGTGIGGGLVLGGVPCPGLGRGSFEIGH
VIVEMGGVRCVCGRRGCLQALASGPATLRRASLLRGADVTYYRLQRALRNGEPWAADALEGSTRALAAAVTGVQELVHPD
RVLIGGGFAAGIPEIVPSVSGFLADLVRQGQAPLPVEPAALGGLSSLRGAVALAGLVAAGEVP
>P41409 3.2.-.-~~~rihA~~~Pyrimidine-specific ribonucleoside hydrolase RihA~~~COG1957
MALPILLDCDPGHDDAIAIVLALASPELDVKAITSSAGNQTPEKTLRNVLRMLTLLNRTDIPVAGGAVKPLMRELIIADN
VHGESGLDGPALPEPTFAPQNCTAVELMAKTLRESAEPVTIVSTGPQTNVALLLNSHPELHSKIARIVIMGGAMGLGNWT
PAAEFNIYVDPEAAEIVFQSGIPVVMAGLDVTHKAQIHVEDTERFRAIGNPVSTIVAELLDFFLEYHKDEKWGFVGAPLH
DPCTIAWLLKPELFTSVERWVGVETQGKYTQGMTVVDYYYLTGNKPNATVMVDVDRQGFVDLLADRLKFYA
>P33022 3.2.2.8~~~rihB~~~Pyrimidine-specific ribonucleoside hydrolase RihB~~~COG1957
MEKRKIILDCDPGHDDAIAIMMAAKHPAIDLLGITIVAGNQTLDKTLINGLNVCQKLEINVPVYAGMPQPIMRQQIVADN
IHGETGLDGPVFEPLTRQAESTHAVKYIIDTLMASDGDITLVPVGPLSNIAVAMRMQPAILPKIREIVLMGGAYGTGNFT
PSAEFNIFADPEAARVVFTSGVPLVMMGLDLTNQTVCTPDVIARMERAGGPAGELFSDIMNFTLKTQFENYGLAGGPVHD
ATCIGYLINPDGIKTQEMYVEVDVNSGPCYGRTVCDELGVLGKPANTKVGITIDTDWFWGLVEECVRGYIKTH
>P22564 3.2.-.-~~~rihC~~~Non-specific ribonucleoside hydrolase RihC~~~COG1957
MRLPIFLDTDPGIDDAVAIAAAIFAPELDLQLMTTVAGNVSVEKTTRNALQLLHFWNAEIPLAQGAAVPLVRAPRDAASV
HGESGMAGYDFVEHNRKPLGIPAFLAIRDALMRAPEPVTLVAIGPLTNIALLLSQCPECKPYIRRLVIMGGSAGRGNCTP
NAEFNIAADPEAAACVFRSGIEIVMCGLDVTNQAILTPDYLSTLPQLNRTGKMLHALFSHYRSGSMQSGLRMHDLCAIAW
LVRPDLFTLKPCFVAVETQGEFTSGTTVVDIDGCLGKPANVQVALDLDVKGFQQWVAEVLALAS
>P0A946 2.3.1.266~~~rimI~~~[Ribosomal protein bS18]-alanine N-acetyltransferase~~~COG0456
MNTISSLETTDLPAAYHIEQRAHAFPWSEKTFASNQGERYLNFQLTQNGKMAAFAITQVVLDEATLFNIAVDPDYQRQGL
GRALLEHLIDELEKRGVATLWLEVRASNAAAIALYESLGFNEATIRRNYYPTTDGREDAIIMALPISM
>P0A944 2.3.1.266~~~rimI~~~[Ribosomal protein bS18]-alanine N-acetyltransferase~~~COG0456
MNTISSLETTDLPAAYHIEQRAHAFPWSEKTFASNQGERYLNFQLTQNGKMAAFAITQVVLDEATLFNIAVDPDYQRQGL
GRALLEHLIDELEKRGVATLWLEVRASNAAAIALYESLGFNEATIRRNYYPTTDGREDAIIMALPISM
>I6YG32 2.3.1.255~~~rimI~~~N-alpha-acetyltransferase RimI~~~COG0456
MTADTEPVTIGALTRADAQRCAELEAQLFVGDDPWPPAAFNRELASPHNHYVGARSGGTLVGYAGISRLGRTPPFEYEVH
TIGVDPAYQGRGIGRRLLRELLDFARGGVVYLEVRTDNDAALALYRSVGFQRVGLRRRYYRVSGADAYTMRRDSGDPS
>Q8ZJW4 2.3.1.266~~~rimI~~~[Ribosomal protein bS18]-alanine N-acetyltransferase~~~
MNTISILSTTDLPAAWQIEQRAHAFPWSEKTFFGNQGERYLNLKLTADDRMAAFAITQVVLDEATLFNIAVDPDFQRRGL
GRMLLEHLIDELETRGVVTLWLEVRASNAAAIALYESLGFNEATIRRNYYPTAQGHEDAIIMALPISM
>P0A948 2.3.1.267~~~rimJ~~~[Ribosomal protein uS5]-alanine N-acetyltransferase~~~COG1670
MFGYRSNVPKVRLTTDRLVVRLVHDRDAWRLADYYAENRHFLKPWEPVRDESHCYPSGWQARLGMINEFHKQGSAFYFGL
FDPDEKEIIGVANFSNVVRGSFHACYLGYSIGQKWQGKGLMFEALTAAIRYMQRTQHIHRIMANYMPHNKRSGDLLARLG
FEKEGYAKDYLLIDGQWRDHVLTALTTPDWTPGR
>P0C0U4 6.3.2.-~~~rimK~~~Ribosomal protein bS6--L-glutamate ligase~~~COG0189
MKIAILSRDGTLYSCKRLREAAIQRGHLVEILDPLSCYMNINPAASSIHYKGRKLPHFDAVIPRIGTAITFYGTAALRQF
EMLGSYPLNESVAIARARDKLRSMQLLARQGIDLPVTGIAHSPDDTSDLIDMVGGAPLVVKLVEGTQGIGVVLAETRQAA
ESVIDAFRGLNAHILVQEYIKEAQGCDIRCLVVGDEVVAAIERRAKEGDFRSNLHRGGAASVASITPQEREIAIKAARTM
ALDVAGVDILRANRGPLVMEVNASPGLEGIEKTTGIDIAGKMIRWIERHATTEYCLKTGG
>Q9HTZ2 6.3.2.-~~~rimK~~~Probable alpha-L-glutamate ligase~~~
MKIAVLSRNPRLYSTRRLVEAGRERGHEMVVIDTLRAYMNIASHKPQIHYRGQPLEGFDAVIPRIGASVTFYGCAVLRQF
EMMGVFPLNESVAIARSRDKLRSLQLLSRKGIGLPVTGFAHSPDDVPDLIEMVGGAPLVIKLLEGTQGIGVVLCETEKAA
ESVLEAFMGLKHNIMVQEYIKEAGGADIRCFVVGDKVIASMKRQAAPGEFRSNLHRGGSASLIKITPEERMTAIRAARVM
GLNVAGVDILRSNHGPLVMEVNSSPGLEGIESTTGKDIAGIIIQYLEKNGGPHLARTKGKG
>Q88AZ9 6.3.2.-~~~rimK~~~Probable alpha-L-glutamate ligase~~~COG0189
MKIAVLSRNPRLYSTRRLVEAGIERGHEMVVIDTLRAYMNIASHKPQIHYRGKPLEGFDAVIPRIGASVTFYGCAVLRQF
EMMGVFPLNESVAIARSRDKLRSLQLLSRRGIGLPVTGFAHSPDDIPDLIQMVNGAPLVIKVLEGTQGIGVVLCETATAA
ESVIEAFMGLKQDIMVQEYIKEAGGADIRCFVVGDKVIASMKRQAKPGEFRSNLHRGGSASLIKITPEERMTALRAAKVM
GLSVAGVDILRSNHGPLVMEVNSSPGLEGIEVTTSKDVAGMIIEYLEKNSGPHMTRTKGKG
>P13857 2.3.1.-~~~rimL~~~Ribosomal-protein-serine acetyltransferase~~~COG1670
MTETIKVSESLELHAVAENHVKPLYQLICKNKTWLQQSLNWPQFVQSEEDTRKTVQGNVMLHQRGYAKMFMIFKEDELIG
VISFNRIEPLNKTAEIGYWLDESHQGQGIISQALQALIHHYAQSGELRRFVIKCRVDNPQSNQVALRNGFILEGCLKQAE
FLNDAYDDVNLYARIIDSQ
>Q6F7I0 ~~~rimM~~~Ribosome maturation factor RimM~~~COG0806
MTPTQNVPEDRIQIGQLRSAYGLNGWLWVYSNTEPMSNMFDYLPWFIETKAGWQTVDVKRWKPHGKGLVVSLKNVSDRNA
AESLIGSTIWVAKSQLPKTDVDEYYWSDLKGLTVLGLDEEEQEVNLGQIHELFETGANDVMVVRATADSVDAEERMIPWH
KDVVQRVDLEAGRIYVNWGVDY
>P0A7X6 ~~~rimM~~~Ribosome maturation factor RimM~~~COG0806
MSKQLTAQAPVDPIVLGKMGSSYGIRGWLRVFSSTEDAESIFDYQPWFIQKAGQWQQVQLESWKHHNQDMIIKLKGVDDR
DAANLLTNCEIVVDSSQLPQLEEGDYYWKDLMGCQVVTTEGYDLGKVVDMMETGSNDVLVIKANLKDAFGIKERLVPFLD
GQVIKKVDLTTRSIEVDWDPGF
>P44568 ~~~rimM~~~Ribosome maturation factor RimM~~~COG0806
MKNMEQQHIEVVGKLGSTYGIRGWLRIYSSTEQAESIFDYQPWFLKIKGEWQSIELENWRYHNHEIIVKLKGVDDREAAQ
ILANVEIGVDLSVFPELEEGDYYWHDLIGCTVVNLEGYTMGTVTEMMETGSNDVLVVKANTKDAFGKQERLIPFLYEQVV
KRVDLTTKTIEVDWDAGF
>A0QV39 ~~~rimM~~~Ribosome maturation factor RimM~~~COG0806
MDLVVGRVVKAHGISGEVVVEIRTDDPEARFAPGAVLRGRPRSGAEREYTIESVRAHGGRLLVRLAGVADRNGADELRGT
VFLVDTAELPAIDDPDEFYDHELEGMRVVTVDDAPVGKVAEVLHTAGGEILAVKADEGGREILVPFVGAIVTSVSRQNAT
IVIDPPEGLLDLA
>P9WH19 ~~~rimM~~~Ribosome maturation factor RimM~~~COG0806
MELVVGRVVKSHGVTGEVVVEIRTDDPADRFAPGTRLRAKGPFDGGAEGSAVSYVIESVRQHGGRLLVRLAGVADRDAAD
ALRGSLFVIDADDLPPIDEPDTYYDHQLVGLMVQTATGEGVGVVTEVVHTAAGELLAVKRDSDEVLVPFVRAIVTSVSLD
DGIVEIDPPHGLLNLE
>Q9HXQ0 ~~~rimM~~~Ribosome maturation factor RimM~~~
MPTPADDLVVIGKIVSVYGIRGEVKVYSFTDPLDNLLDYRRWTLRRDGEIRQAELVRGRLHGKVLAAKLKGLDDREEART
FTGYEICIPRSELPSLEEGEYYWHQLEGLKVIDQGRQLLGVIDHLLETGANDVMVVKPCAGSLDDRERLLPYTGQCVLSI
DLAAGEMRVDWDADF
>P66656 ~~~rimM~~~Ribosome maturation factor RimM~~~
MRVEVGQIVNTHGIKGEIKVKSNSDFTDVRFQPGQVLTVVHNNNDLEYTVKSHRVHKGLHMLTFEGINNINDIEHLKGSS
IYQERDHEDIVLEENEFYYSDIIGCTVFDDQETPIGRVINIFETGANDVWVIKGSKEYLIPYIADVVKEVDVENKKIIIT
PMEGLLD
>Q5SJH5 ~~~rimM~~~Ribosome maturation factor RimM~~~COG0806
MRLVEIGRFGAPYALKGGLRFRGEPVVLHLERVYVEGHGWRAIEDLYRVGEELVVHLAGVTDRTLAEALVGLRVYAEVAD
LPPLEEGRYYYFALIGLPVYVEGRQVGEVVDILDAGAQDVLIIRGVGERLRDRAERLVPLQAPYVRVEEGSIHVDPIPGL
FD
>P0AEI4 2.8.4.4~~~rimO~~~Ribosomal protein uS12 methylthiotransferase RimO~~~COG0621
MSKVTPQPKIGFVSLGCPKNLVDSERILTELRTEGYDVVPSYDDADMVIVNTCGFIDSAVQESLEAIGEALNENGKVIVT
GCLGAKEDQIREVHPKVLEITGPHSYEQVLEHVHHYVPKPKHNPFLSLVPEQGVKLTPRHYAYLKISEGCNHRCTFCIIP
SMRGDLVSRPIGEVLSEAKRLVDAGVKEILVISQDTSAYGVDVKHRTGFHNGEPVKTSMVSLCEQLSKLGIWTRLHYVYP
YPHVDDVIPLMAEGKILPYLDIPLQHASPRILKLMKRPGSVDRQLARIKQWREICPELTLRSTFIVGFPGETEEDFQMLL
DFLKEARLDRVGCFKYSPVEGADANALPDQVPEEVKEERWNRFMQLQQQISAERLQEKVGREILVIIDEVDEEGAIGRSM
ADAPEIDGAVYLNGETNVKPGDILRVKVEHADEYDLWGSRV
>Q55803 2.8.4.4~~~rimO~~~Ribosomal protein uS12 methylthiotransferase RimO~~~COG0621
MGQTPTIAINHLGCEKNRIDSEHMLGLLVEAGYQVDANEELADYVIVNTCSFIQDARQESVRTLVELAEAKKKIVISGCL
AQHFQEQLLEEIPEAVAVVGTGDYQNIVDIIRRTEQGQRVKAISPNPSFIADENLPRYRTTNEAIAYLRVAEGCDYRCAF
CIIPQLRGKQRSRPIESIVAEAEQLASQGVKELILISQITTNYGLDLYGEPKLAELLQALGKVDIPWIRIHYAYPTGLTP
KVIEAIRDTPNVLPYLDLPLQHSHPDILRAMNRPWQGQVNDDIITRLKTALPDAVLRTTFIVGFPGETEEHFGHLLDFVQ
RHQFDHVGVFTFSPEEGTAAFDLPNAVPEEVMGDRRDRLMALQQPISAQKNAACLGQTLDVLIEQENPSTGEFIGRSTRF
APEVDGLVYVKGNANLNEIVPVVITATDDYDLYGMTAEEAKVF
>Q9X2H6 2.8.4.4~~~rimO~~~Ribosomal protein uS12 methylthiotransferase RimO~~~COG0621
MRVGIKVLGCPKNEADCEVLAGVLREGGHEIVFDVKDADVVVLDTCAFIEDAKRESIDEIFSFVDAKDQYGYKLVVKGCL
VQRYYEELKKEIPEVDQWIGVADPEEIANAIENGTDLVPDQPETVYRYRKRIDLEERPYAYVKISDGCDRGCTFCSIPSF
KGSLRSRSIEDITREVEDLLKEGKKEIILVAQDTTSYGIDLYRKQALPDLLRRLNSLNGEFWIRVMYLHPDHLTEEIISA
MLELDKVVKYFDVPVQHGSDKILKLMGRTKSSEELKKMLSSIRERFPDAVLRTSIIVGFPGETEEDFEELKQFVEEIQFD
KLGAFVYSDEEGTVAFNLKEKVDPEMAKRRQEELLLLQAEISNSRLDRFVGKKLKFLVEGKEGKFLVGRTWTEAPEVDGV
VFVRGKGKIGDFLEVVIKEHDEYDMWGSVI
>P0A8A8 ~~~rimP~~~Ribosome maturation factor RimP~~~COG0779
MSTLEQKLTEMITAPVEALGFELVGIEFIRGRTSTLRIYIDSEDGINVDDCADVSHQVSAVLDVEDPITVAYNLEVSSPG
LDRPLFTAEHYARFVGEEVTLVLRMAVQNRRKWQGVIKAVDGEMITVTVEGKDEVFALSNIQKANLVPHF
>O25687 ~~~rimP~~~Ribosome maturation factor RimP~~~COG0779
MTKKIEEKIGGVIESLGYLLYDVSLVKENEQHVLRVSLKNPNGAVSLDICQQVSEIISPLLDVCDFIQDAYILEVSSMGL
ERTLKTPKHFKLSLGEKVEVKLTNKESFQAVLKDANDLSADFELEDHAIKSVEYKDLKKVKTLFEW
>A0QVM3 ~~~rimP~~~Ribosome maturation factor RimP~~~COG0779
MAPDPKLPSADLPSQKQVIELLDGEFARAGYEIDDVVVNAATRPARITIVADGDKGLDLDAVAMLSRLASGLLDTVDTGD
TPYVLEVTSPGVDRPLTTEKHFRRARGRKAELSLADGSSLTARLGGTDGDQVNVVVAQGKDFAVRQIPLREITKAVVQVE
FSPPNRRELELAEQTGKGARA
>P9WH17 ~~~rimP~~~Ribosome maturation factor RimP~~~COG0779
MTTGLPSQRQVIELLGADFACAGYEIEDVVIDARARPPRIAVIADGDAPLDLDTIAALSRRASALLDGLDGANKIRGRYL
LEVSSPGVERPLTSEKHFRRARGRKVELVLSDGSRLTGRVGEMRAGTVALVIREDRGWAVREIPLAEIVKAVVQVEFSPP
APAELELAQSSEMGLARGTEAGA
>Q97S61 ~~~rimP~~~Ribosome maturation factor RimP~~~COG0779
MDAIATIVELVREVVEPVIEAPFELVDIEYGKIGSDMILSIFVDKPEGITLNDTADLTEIISPVLDTIKPDPFPEQYFLE
ITSPGLERPLKTKDAVAGAVGKYIHVGLYQAIDKQKVFEGTLLAFEEDELTMEYMDKTRKKTVQIPYSLVSKARLAVKL
>Q68A49 ~~~~~~TAL effector protein Rip19~~~
MRIGKSSGWLNESVSLEYEHVSPPTRPRDTRRRPRAASDGGLAHLHRRLAVGYAEDTPRTGARSPAPRRPLPVAPASAPP
APSLVPEPPMPVSLPVVSSPRFSAGSSAAITDPFSSLPPTPVLYAMARELKALSDATWQPAVPLPAEPPTDARRGNTVFD
EASASSPVIASACPQAFASPPRAPRSARARRARTGGDAWPAPTFLSRPSSSRIGRDVFGKLVALGYSREQIRKLKQESLS
EIAKYHTTLTGQGFTHADICRISRRRQSLRVVARNYPELAAALPELTRAHIVDIARQRSGDLALQALLPVATALTAAPLR
LSASQIATVAQYGERPAIQALYRLRRKLTRAPLHLTPQQVVAIASHDGGKPALEAVWAKLPVLRGVPYALSTAQVVAIAC
ISGQQALEAIEAHMPTLRQAPHSLSPERVAAIACIGGRSAVEAVRQGLPVKAIRRIRREKAPVAGPPPASLGPTPQELVA
VLHFFRAHQQPRQAFVDALAAFQTTRPALLRLLSSVGVTEIEALGGTIPDATERWQRLLGRLGFRPATGAAAPSPDSLQG
FAQSLERTLGSPGMAGQSACSPHRKRPAETAIAPRSIRRRPNNAGQPSEPWPDQLAWLQRRKRTARSHIRADSAASVPAN
LHLGTRAQFTPDRLRAEPGPIMQAHTSPASVSFGSHVAFEPGLPDPGTPTSADLASFEAEPFGVGPLDFHLDWLLQILEA
>A1KML4 3.4.24.-~~~rip1~~~Zinc metalloprotease Rip1~~~
MMFVTGIVLFALAILISVALHECGHMWVARRTGMKVRRYFVGFGPTLWSTRRGETEYGVKAVPLGGFCDIAGMTPVEELD
PDERDRAMYKQATWKRVAVLFAGPGMNLAICLVLIYAIALVWGLPNLHPPTRAVIGETGCVAQEVSQGKLEQCTGPGPAA
LAGIRSGDVVVKVGDTPVSSFDEMAAAVRKSHGSVPIVVERDGTAIVTYVDIESTQRWIPNGQGGELQPATVGAIGVGAA
RVGPVRYGVFSAMPATFAFTGDLTVEVGKALAALPTKVGALVRAIGGGQRDPQTPISVVGASIIGGDTVDHGLWVAFWFF
LAQLNLILATINLLPLLPFDGGHIAVAVFERIRNMVRSARGKVAAAPVNYLKLLPATYVVLVLVVGYMLLTVTADLVNPI
RLFQ
>H8EW46 3.4.24.-~~~rip1~~~Zinc metalloprotease Rip1~~~
MMFVTGIVLFALAILISVALHECGHMWVARRTGMKVRRYFVGFGPTLWSTRRGETEYGVKAVPLGGFCDIAGMTPVEELD
PDERDRAMYKQATWKRVAVLFAGPGMNLAICLVLIYAIALVWGLPNLHPPTRAVIGETGCVAQEVSQGKLEQCTGPGPAA
LAGIRSGDVVVKVGDTPVSSFDEMAAAVRKSHGSVPIVVERDGTAIVTYVDIESTQRWIPNGQGGELQPATVGAIGVGAA
RVGPVRYGVFSAMPATFAVTGDLTVEVGKALAALPTKVGALVRAIGGGQRDPQTPISVVGASIIGGDTVDHGLWVAFWFF
LAQLNLILAAINLLPLLPFDGGHIAVAVFERIRNMVRSARGKVAAAPVNYLKLLPATYVVLVLVVGYMLLTVTADLVNPI
RLFQ
>P9WHS3 3.4.24.-~~~rip1~~~Zinc metalloprotease Rip1~~~COG0750
MMFVTGIVLFALAILISVALHECGHMWVARRTGMKVRRYFVGFGPTLWSTRRGETEYGVKAVPLGGFCDIAGMTPVEELD
PDERDRAMYKQATWKRVAVLFAGPGMNLAICLVLIYAIALVWGLPNLHPPTRAVIGETGCVAQEVSQGKLEQCTGPGPAA
LAGIRSGDVVVKVGDTPVSSFDEMAAAVRKSHGSVPIVVERDGTAIVTYVDIESTQRWIPNGQGGELQPATVGAIGVGAA
RVGPVRYGVFSAMPATFAVTGDLTVEVGKALAALPTKVGALVRAIGGGQRDPQTPISVVGASIIGGDTVDHGLWVAFWFF
LAQLNLILAAINLLPLLPFDGGHIAVAVFERIRNMVRSARGKVAAAPVNYLKLLPATYVVLVLVVGYMLLTVTADLVNPI
RLFQ
>L0T550 ~~~rip2~~~Putative zinc metalloprotease Rip2~~~COG1994
MSETGQRESVRPSPIFLGLLGLTAVGGALAWLAGETVQPLAYAGVFVMVIAGWLVSLCLHEFGHAFTAWRFGDHDVAVRG
YLTLDPRRYSHPMLSLGLPMLFIALGGIGLPGAAVYVHTWFMTTARRTLVSLAGPTVNLALAMLLLAATRLLFDPIHAVL
WAGVAFLAFLQLTALVLNLLPIPGLDGYAALEPHLRPETQRALAPAKQFALVFLLVLFLAPTLNGWFFGVVYWLFDLSGV
SHRLAAAGSVLARFWSIWF
>H8EUF2 ~~~rip3~~~Putative zinc metalloprotease Rip3~~~
MRDAIPLGRIAGFVVNVHWSVLVILWLFTWSLATMLPGTVGGYPAVVYWLLGAGGAVMLLASLLAHELAHAVVARRAGVS
VESVTLWLFGGVTALGGEAKTPKAAFRIAFAGPATSLALSATFGALAITLAGVRTPAIVISVAWWLATVNLLLGLFNLLP
GAPLDGGRLVRAYLWRRHGDSVRAGIGAARAGRVVALVLIALGLAEFVAGGLVGGVWLAFIGWFIFAAAREEETRISTQQ
LFAGVRVADAMTAQPHTAPGWINVEDFIQRYVLGERHSAYPVADRDGSITGLVALRQLRDVAPSRRSTTSVGDIALPLHS
VPTARPQEPLTALLERMAPLGPRSRALVTEGSAVVGIVTPSDVARLIDVYRLAQPEPTFTTSPQDADRFSDAG
>P9WHR0 ~~~rip3~~~Putative zinc metalloprotease Rip3~~~
MRDAIPLGRIAGFVVNVHWSVLVILWLFTWSLATMLPGTVGGYPAVVYWLLGAGGAVMLLASLLAHELAHAVVARRAGVS
VESVTLWLFGGVTALGGEAKTPKAAFRIAFAGPATSLALSATFGALAITLAGVRTPAIVISVAWWLATVNLLLGLFNLLP
GAPLDGGRLVRAYLWRRHGDSVRAGIGAARAGRVVALVLIALGLAEFVAGGLVGGVWLAFIGWFIFAAAREEETRISTQQ
LFAGVRVADAMTAQPHTAPGWINVEDFIQRYVLGERHSAYPVADRDGSITGLVALRQLRDVAPSRRSTTSVGDIALPLHS
VPTARPQEPLTALLERMAPLGPRSRALVTEGSAVVGIVTPSDVARLIDVYRLAQPEPTFTTSPQDADRFSDAG
>P9WHR1 ~~~rip3~~~Putative zinc metalloprotease Rip3~~~COG0517
MRDAIPLGRIAGFVVNVHWSVLVILWLFTWSLATMLPGTVGGYPAVVYWLLGAGGAVMLLASLLAHELAHAVVARRAGVS
VESVTLWLFGGVTALGGEAKTPKAAFRIAFAGPATSLALSATFGALAITLAGVRTPAIVISVAWWLATVNLLLGLFNLLP
GAPLDGGRLVRAYLWRRHGDSVRAGIGAARAGRVVALVLIALGLAEFVAGGLVGGVWLAFIGWFIFAAAREEETRISTQQ
LFAGVRVADAMTAQPHTAPGWINVEDFIQRYVLGERHSAYPVADRDGSITGLVALRQLRDVAPSRRSTTSVGDIALPLHS
VPTARPQEPLTALLERMAPLGPRSRALVTEGSAVVGIVTPSDVARLIDVYRLAQPEPTFTTSPQDADRFSDAG
>Q8NRR3 ~~~ripA~~~HTH-type transcriptional regulator RipA~~~COG2207
MSSASLLWCHSGVSTVRFGERIFTLVAGDLLFAPEEAQVADDSQGLVLNIRFETLNIMGPARRIHLGHVWNDRLTFEYSR
SLFGKETLSPDIARLFTDRVPTPPLPAPRKARAVAQVLVSNPADQTSLEEFAEIQGVSARTLQRQFLKSTGYSFSEWRAA
QRVCVAASLLAHDFSISVVANLVGFAATSSLTRAFRRHTGATPSTFTTGQIGMGSAGHPPRIPATTTFAEAHQDQQLWIY
SGTATVTTPGYCRFMGQGDMVTIPAGTQTRIDVAAGSIAFPVPVGLDEWGMDLTRVVAVNNQQPKPLTILEQSEWSKLSE
ELLNTPVPVQM
>A0QX22 3.4.-.-~~~ripA~~~Peptidoglycan endopeptidase RipA~~~COG0791
MRRTVRALATRVHGRVCAVPLVVGMLLATALYGGGPAAADPAAPDNLATLVAKVASADQKLQELGAAIQTQQETVNKAIV
DVQAARDAAAAAQRELEAGQRGVADANAAIEAAQKRFDSFAAATYMNGPSRSYLTATDPADIVNTTATGQALIASSQQVM
AKLQRARTEQVNRESAARLAKEKADQAARDAESSQDNAVAALKQAQQTFNAQQGELERLAAERAAAQAELDSVRKVSATG
NAAPAAAPAAAPAPAAAPAPVPNSAPAPVPGAQPNPQAAAGNWDRAPSGPASSGQNWAVWDPTLPAIPSAFVSGDPIAII
NAVLGIASTSAQVTADMGRSFLQKLGILPTPTGFTNGAIPRVYGREAVEYVIRRGMSQIGVPYSWGGGNAAGPSRGIDSG
AGTVGFDCSGLMLYMFAGVGIKLDHYSGSQYNAGRKIPSSQMRRGDMIFYGPNASQHVAMYLGNGQMLEAPYTGSHVKVS
PVRTSGMTPYVTRLIEY
>O53168 3.4.-.-~~~ripA~~~Peptidoglycan endopeptidase RipA~~~COG0791
MRRNRRGSPARPAARFVRPAIPSALSVALLVCTPGLATADPQTDTIAALIADVAKANQRLQDLSDEVQAEQESVNKAMVD
VETARDNAAAAEDDLEVSQRAVKDANAAIAAAQHRFDTFAAATYMNGPSVSYLSASSPDEIIATVTAAKTLSASSQAVMA
NLQRARTERVNTESAARLAKQKADKAAADAKASQDAAVAALTETRRKFDEQREEVQRLAAERDAAQARLQAARLVAWSSE
GGQGAPPFRMWDPGSGPAGGRAWDGLWDPTLPMIPSANIPGDPIAVVNQVLGISATSAQVTANMGRKFLEQLGILQPTDT
GITNAPAGSAQGRIPRVYGRQASEYVIRRGMSQIGVPYSWGGGNAAGPSKGIDSGAGTVGFDCSGLVLYSFAGVGIKLPH
YSGSQYNLGRKIPSSQMRRGDVIFYGPNGSQHVTIYLGNGQMLEAPDVGLKVRVAPVRTAGMTPYVVRYIEY
>P9WHU5 3.4.-.-~~~ripB~~~Peptidoglycan endopeptidase RipB~~~COG0791
MRHTRFHPIKLAWITAVVAGLMVGVATPADAEPGQWDPTLPALVSAGAPGDPLAVANASLQATAQATQTTLDLGRQFLGG
LGINLGGPAASAPSAATTGASRIPRANARQAVEYVIRRAGSQMGVPYSWGGGSLQGPSKGVDSGANTVGFDCSGLVRYAF
AGVGVLIPRFSGDQYNAGRHVPPAEAKRGDLIFYGPGGGQHVTLYLGNGQMLEASGSAGKVTVSPVRKAGMTPFVTRIIE
Y
>P0CG99 1.17.4.1~~~nrdE1~~~Ribonucleoside-diphosphate reductase subunit alpha 1~~~COG0209
MPPTVTAAEPVTTTGHVLPGEADYHALNAMLNLYDADGKIQFEKDREAAKQYFLQHVNQNTVFFHSQDEKLDYLIENEYY
EREVLDQYSRDFIKSLLDRAYAKKFRFPTFLGAFKYYTSYTLKTFDGKRYLERFEDRVVMVALTLAAGDTELAEKLVDEI
IDGRFQPATPTFLNSGKKQRGEPVSCFLLRIEDNMESIGRSINSALQLSKRGGGVALLLSNIREHGAPIKNIENQSSGVI
PIMKLLEDSFSYANQLGARQGAGAVYLHAHHPDIYRFLDTKRENADEKIRIKTLSLGVVIPDITFELAKKNEDMYLFSPY
DVERVYGVPFADVSVTEKYYEMVDDARIRKTKINAREFFQTLAELQFESGYPYIMFEDTVNRSNPIAGKITHSNLCSEIL
QVSTPSEFNDDLSYAKVGKDISCNLGSLNIAKAMDSPDFAQTIEVAIRALTAVSDQTHITSVPSIEQGNNDSHAIGLGQM
NLHGYLARERIYYGSEEGIDFTNIYFYTVLFHALRASNKIAIERGTHFKGFEKSKYASGEFFDKYTDQVWEPKTDKVRRL
FADADIHIPTQDDWKQLKESVQKHGIYNQNLQAVPPTGSISYINHSTSSIHPVASKIEIRKEGKIGRVYYPAPYMTNDNL
DYYQDAYEIGYEKIIDTYAAATQHVDQGLSLTLFFKDTATTRDVNKAQIYAWRKGIKTLYYIRLRQMALEGTEVEGCVSC
ML
>P0CH00 1.17.4.1~~~nrdE2~~~Ribonucleoside-diphosphate reductase subunit alpha 2~~~
MPPTVTAAEPVTTTGHVLPGEADYHALNAMLNLYDADGKIQFEKDREAAKQYFLQHVNQNTVFFHSQDEKLDYLIENEYY
EREVLDQYSRDFIKSLLDRAYAKKFRFPTFLGAFKYYTSYTLKTFDGKRYLERFEDRVVMVALTLAAGDTELAEKLVDEI
IDGRFQPATPTFLNSGKKQRGEPVSCFLLRIEDNMESIGRSINSALQLSKRGGGVALLLSNIREHGAPIKNIENQSSGVI
PIMKLLEDSFSYANQLGARQGAGAVYLHAHHPDIYRFLDTKRENADEKIRIKTLSLGVVIPDITFELAKKNEDMYLFSPY
DVERVYGVPFADVSVTEKYYEMVDDARIRKTKINAREFFQTLAELQFESGYPYIMFEDTVNRSNPIAGKITHSNLCSEIL
QVSTPSEFNDDLSYAKVGKDISCNLGSLNIAKAMDSPDFAQTIEVAIRALTAVSDQTHITSVPSIEQGNNDSHAIGLGQM
NLHGYLARERIYYGSEEGIDFTNIYFYTVLFHALRASNKIAIERGTHFKGFEKSKYASGEFFDKYTDQVWEPKTDKVRRL
FADADIHIPTQDDWKQLKESVQKHGIYNQNLQAVPPTGSISYINHSTSSIHPVASKIEIRKEGKIGRVYYPAPYMTNDNL
DYYQDAYEIGYEKIIDTYAAATQHVDQGLSLTLFFKDTATTRDVNKAQIYAWRKGIKTLYYIRLRQMALEGTEVEGCVSC
ML
>O66503 1.17.4.1~~~nrdA~~~Ribonucleoside-diphosphate reductase subunit alpha~~~COG0209
MTMYVIKRSGRKEKLDINKIRIAIKFACEGLNVDPLELEADAQIQFRDGITTKEIQQLLIKTAAEKVSAERPDWTYTAAR
LLLYDLYKDVAHLRGYSLRDDLGKYKPYNRKNFYSFVKEYVEKGIYGEYLLENYSEEDFNKLANYIKPERDLYFTYTGIK
ILYDRYLVRDEEGRVIELPQEMYMLIAMTLAVPEKPEERLKWAKKFYDVLSEHKVTVATPTLMNARRPFTQLSSCFVLTV
DDDLFDIFDNVKKAGMISKFAGGLGVYLGKIRATGAPIRKFKGASSGVIPVVKLINDTMTYVDQLGMRKGSASITLDIWH
KDILDFLEVKTNVGDERKKAHDIHPAVSIPDLFMKRLKNREDWTLIDPYWARQYITRKIYDGKYKEVKPLPGSHYYVGIK
EDGTQDILEPKGLEDFYGEEFEKWYLELEENLPSYAKKKVNSFELWKRLLTVAFETGEPYIFFRDEANRKNPNKHTGMVY
SSNLCHEIVQTMSPSKHEKPVLDPETGEITYKKEAGDLPVCNLGSVNLGKVHTEEEIKEVLPLLVRMLDNVIEMNFYAIP
EAEYTNKRYRAIGIGVSNYHYCLVKNGIKWESEEHLKFADKLFELIAFYALKGSLELAKERGRYKLFDGSNWSKGILFGR
SVEEIEENSRQNGNNLPWRELAEEIKKYGIRNAYLLALMPTGSTSLILGATPSIDPIFARFYKEENMSGILPQVPPEVDR
FYWHYKTAYTIDHEWTIRAAAVRQKWIDQAQSLNLFVDPQNIDGPRLSRLYELAWELGLKTIYYLRSRSAMDIEECEACS
V
>P50620 1.17.4.1~~~nrdE~~~Ribonucleoside-diphosphate reductase subunit alpha~~~COG0209
MSQNQVPKWIQLNNEIMIQKDGKFQFDKDKEAVHSYFVDYINQNTVFFHNLKEKLDYLVENQYYEEEFLSLYSFEDIKEV
FKTAYAKKFRFPSFMSAFKFYNDYALKTNDKKKILERYEDRISIVALFFANGDTEKAKEYVNLMINQEYQPSTPTFLNAG
RKRRGELVSCFLLEVNDSLNDISRAIDISMQLSKLGGGVSLNLSKLRAKGEAIKDVENATKGVVGVMKLLDNAFRYADQM
GQRQGSGAAYLNIFHRDINDFLDTKKISADEDVRVKTLSIGVVIPDKFVELAREDKAAYVFYPHTIYKEYGQHMDEMDMN
EMYDKFVDNPRVKKEKINPRKLLEKLAMLRSESGYPYIMFQDNVNKVHANNHISKVKFSNLCSEVLQASQVSSYTDYDEE
DEIGLDISCNLGSLNILNVMEHKSIEKTVKLATDSLTHVSETTDIRNAPAVRRANKAMKSIGLGAMNLHGYLAQNGIAYE
SPEARDFANTFFMMVNFYSIQRSAEIAKEKGETFDQYEGSTYATGEYFDKYVSTDFSPKYEKIANLFEGMHIPTTEDWKK
LKAFVAEHGMYHSYRLCIAPTGSISYVQSSTASVMPIMERIEERTYGNSKTYYPMPGLASNNWFFYKEAYDMDMFKVVDM
IATIQQHIDQGISFTLFLKDTMTTRDLNRIDLYAHHRGIKTIYYARTKDTGQDSCLSCVV
>P00452 1.17.4.1~~~nrdA~~~Ribonucleoside-diphosphate reductase 1 subunit alpha~~~COG0209
MNQNLLVTKRDGSTERINLDKIHRVLDWAAEGLHNVSISQVELRSHIQFYDGIKTSDIHETIIKAAADLISRDAPDYQYL
AARLAIFHLRKKAYGQFEPPALYDHVVKMVEMGKYDNHLLEDYTEEEFKQMDTFIDHDRDMTFSYAAVKQLEGKYLVQNR
VTGEIYESAQFLYILVAACLFSNYPRETRLQYVKRFYDAVSTFKISLPTPIMSGVRTPTRQFSSCVLIECGDSLDSINAT
SSAIVKYVSQRAGIGINAGRIRALGSPIRGGEAFHTGCIPFYKHFQTAVKSCSQGGVRGGAATLFYPMWHLEVESLLVLK
NNRGVEGNRVRHMDYGVQINKLMYTRLLKGEDITLFSPSDVPGLYDAFFADQEEFERLYTKYEKDDSIRKQRVKAVELFS
LMMQERASTGRIYIQNVDHCNTHSPFDPAIAPVRQSNLCLEIALPTKPLNDVNDENGEIALCTLSAFNLGAINNLDELEE
LAILAVRALDALLDYQDYPIPAAKRGAMGRRTLGIGVINFAYYLAKHGKRYSDGSANNLTHKTFEAIQYYLLKASNELAK
EQGACPWFNETTYAKGILPIDTYKKDLDTIANEPLHYDWEALRESIKTHGLRNSTLSALMPSETSSQISNATNGIEPPRG
YVSIKASKDGILRQVVPDYEHLHDAYELLWEMPGNDGYLQLVGIMQKFIDQSISANTNYDPSRFPSGKVPMQQLLKDLLT
AYKFGVKTLYYQNTRDGAEDAQDDLVPSIQDDGCESGACKI
>P0A5W9 1.17.4.1~~~nrdE~~~Ribonucleoside-diphosphate reductase subunit alpha~~~
MPPTVIAEPVASGAHASYSGGPGETDYHALNAMLNLYDADGKIQFDKDREAAHQYFLQHVNQNTVFFHNQDEKLDYLIRE
NYYEREVLDQYSRNFVKTLLDRAYAKKFRFPTFLGAFKYYTSYTLKTFDGKRYLERFEDRVVMVALTLAAGDTALAELLV
DEIIDGRFQPATPTFLNSGKKQRGEPVSCFLLRVEDNMESIGRSINSALQLSKRGGGVALLLTNIREHGAPIKNIENQSS
GVIPIMKLLEDAFSYANQLGARQGAGAVYLHAHHPDIYRFLDTKRENADEKIRIKTLSLGVVIPDITFELAKRNDDMYLF
SPYDVERVYGVPFADISVTEKYYEMVDDARIRKTKIKAREFFQTLAELQFESGYPYIMFEDTVNRANPIDGKITHSNLCS
EILQVSTPSLFNEDLSYAKVGKDISCNLGSLNIAKTMDSPDFAQTIEVAIRALTAVSDQTHIKSVPSIEQGNNDSHAIGL
GQMNLHGYLARERIFYGSDEGIDFTNIYFYTVLYHALRASNRIAIERGTHFKGFERSKYASGEFFDKYTDQIWEPKTQKV
RQLFADAGIRIPTQDDWRRLKESVQAHGIYNQNLQAVPPTGSISYINHSTSSIHPIVSKVEIRKEGKIGRVYYPAPYMTN
DNLEYYEDAYEIGYEKIIDTYAAATQHVDQGLSLTLFFKDTATTRDVNKAQIYAWRKGIKTLYYIRLRQMALEGTEVEGC
VSCML
>P9WH75 1.17.4.1~~~nrdE~~~Ribonucleoside-diphosphate reductase subunit alpha~~~COG0209
MPPTVIAEPVASGAHASYSGGPGETDYHALNAMLNLYDADGKIQFDKDREAAHQYFLQHVNQNTVFFHNQDEKLDYLIRE
NYYEREVLDQYSRNFVKTLLDRAYAKKFRFPTFLGAFKYYTSYTLKTFDGKRYLERFEDRVVMVALTLAAGDTALAELLV
DEIIDGRFQPATPTFLNSGKKQRGEPVSCFLLRVEDNMESIGRSINSALQLSKRGGGVALLLTNIREHGAPIKNIENQSS
GVIPIMKLLEDAFSYANQLGARQGAGAVYLHAHHPDIYRFLDTKRENADEKIRIKTLSLGVVIPDITFELAKRNDDMYLF
SPYDVERVYGVPFADISVTEKYYEMVDDARIRKTKIKAREFFQTLAELQFESGYPYIMFEDTVNRANPIDGKITHSNLCS
EILQVSTPSLFNEDLSYAKVGKDISCNLGSLNIAKTMDSPDFAQTIEVAIRALTAVSDQTHIKSVPSIEQGNNDSHAIGL
GQMNLHGYLARERIFYGSDEGIDFTNIYFYTVLYHALRASNRIAIERGTHFKGFERSKYASGEFFDKYTDQIWEPKTQKV
RQLFADAGIRIPTQDDWRRLKESVQAHGIYNQNLQAVPPTGSISYINHSTSSIHPIVSKVEIRKEGKIGRVYYPAPYMTN
DNLEYYEDAYEIGYEKIIDTYAAATQHVDQGLSLTLFFKDTATTRDVNKAQIYAWRKGIKTLYYIRLRQMALEGTEVEGC
VSCML
>P9WH73 1.17.4.1~~~nrdF1~~~Ribonucleoside-diphosphate reductase subunit beta nrdF1~~~COG0208
MTGKLVERVHAINWNRLLDAKDLQVWERLTGNFWLPEKIPLSNDLASWQTLSSTEQQTTIRVFTGLTLLDTAQATVGAVA
MIDDAVTPHEEAVLTNMAFMESVHAKSYSSIFSTLCSTKQIDDAFDWSEQNPYLQRKAQIIVDYYRGDDALKRKASSVML
ESFLFYSGFYLPMYWSSRGKLTNTADLIRLIIRDEAVHGYYIGYKCQRGLADLTDAERADHREYTCELLHTLYANEIDYA
HDLYDELGWTDDVLPYMRYNANKALANLGYQPAFDRDTCQVNPAVRAALDPGAGENHDFFSGSGSSYVMGTHQPTTDTDW
DF
>P9WH71 1.17.4.1~~~nrdF2~~~Ribonucleoside-diphosphate reductase subunit beta nrdF2~~~COG0208
MTGNAKLIDRVSAINWNRLQDEKDAEVWDRLTGNFWLPEKVPVSNDIPSWGTLTAGEKQLTMRVFTGLTMLDTIQGTVGA
VSLIPDALTPHEEAVLTNIAFMESVHAKSYSQIFSTLCSTAEIDDAFRWSEENRNLQRKAEIVLQYYRGDEPLKRKVAST
LLESFLFYSGFYLPMYWSSRAKLTNTADMIRLIIRDEAVHGYYIGYKFQRGLALVDDVTRAELKDYTYELLFELYDNEVE
YTQDLYDEVGLTEDVKKFLRYNANKALMNLGYEALFPRDETDVNPAILSALSPNADENHDFFSGSGSSYVIGKAVVTEDD
DWDF
>P9WH69 1.-.-.-~~~nrdB~~~R2-like ligand binding oxidase~~~COG0208
MTRTRSGSLAAGGLNWASLPLKLFAGGNAKFWHPADIDFTRDRADWEKLSDDERDYATRLCTQFIAGEEAVTEDIQPFMS
AMRAEGRLADEMYLTQFAFEEAKHTQVFRMWLDAVGISEDLHRYLDDLPAYRQIFYAELPECLNALSADPSPAAQVRASV
TYNHIVEGMLALTGYYAWHKICVERAILPGMQELVRRIGDDERRHMAWGTFTCRRHVAADDANWTVFETRMNELIPLALR
LIEEGFALYGDQPPFDLSKDDFLQYSTDKGMRRFGTISNARGRPVAEIDVDYSPAQLEDTFADEDRRTLAAASA
>A4F7B2 1.-.-.-~~~~~~R2-like ligand binding oxidase~~~COG0208
MTSTATFREDFHSLRAGGLNWDSLPLRLFGKGNAKFWDPADIDFTRDAEDWQGLTEEERRSVAMLCSQFIAGEEAVTQDL
QPFMAAMAAEGRFGDEMYLTQFCFEEAKHTQVFRLWMDAVGLTGDLHSHVAENPGYRAIFYEELPRSLNALHDDPSPANQ
VRASVTYNHVVEGTLALTGYFAWQKICRSRGILPGMQEVVRRIGDDERRHMAWGTFTCRRHVAADESNWDVVQEQMQHLL
PLAVTQIQWRPEDAPEETPFRLDIDELAAYASDRAGRRLGAISAARGVPVEQIDVDASPEQLEDQFGVEDAAALEKA
>O67475 1.17.4.1~~~nrdB~~~Ribonucleoside-diphosphate reductase subunit beta~~~COG0208
MEKTEKNELVRKLIFNPQGDREASKRKIIKGNPTNIFELNEIKYSWAFDLYKLMGFTNFWIPEEIQMLEDRKQYETVLSD
YEKRAYELVLSFLIALDSFQVDMLKEFGRMITAPEVEMAITAQEFQESVHAYSYQFILESVVDPVKADEIYNYWREDERL
LERNKVIAELYNEFIRKPNEENFIKATIGNYILESLYFYSGFAFFYTLGRQGKMRNTVQQIKYINRDELCFIEGTEVLTK
RGFVDFRELREDDLVAQYDIETGEISWTKPYAYVERDYEGSMYRLKHPKSNWEVVATEGHEFIVRNLKTGKERKEPIEKV
KLHPYSAIPVAGRYTGEVEEYDLWELVSGKGITLKTRSAVKNKLTPIEKLLIVLQADGTIDSKRNGKFTGFQQLKFFFSK
YRKINEFEKILNECAPYGIKWKKYERQDGIAYTVYYPNDLPIKPTKFFDEWVRLDEITEEWIREFVEELVKWDGHIPKDR
NKKKVYYYSTKEKRNKDFVQALCALGGMRTVVSRERNPKAKNPVYRIWIYLEDDYINTQTMVKEEFYYKGKVYCVSVPKG
NIVVRYKDSVCIAGNCHVTLFRNIINTLRKENPELFTPEIEKWIVEYFKYAVNEEIKWGQYVTQNQILGINDVLIERYIK
YLGNLRITQIGFDPIYPEVTENPLKWIDEFRKINNTKTDFFQAKPQTYSKANELKW
>P50621 1.17.4.1~~~nrdF~~~Ribonucleoside-diphosphate reductase subunit beta~~~COG0208
MTKIYDAANWSKHEDDFTQMFYNQNVKQFWLPEEIALNGDLLTWKYLGKNEQDTYMKVLAGLTLLDTEQGNTGMPIVAEH
VDGHQRKAVLNFMAMMENAVHAKSYSNIFMTLAPTETINEVFEWVKQNKYLQKKAQMIVGLYKAIQKDDEISLFKAMVAS
VYLESFLFYSGFYYPLYFYGQGKLMQSGEIINLILRDEAIHGVYVGLLAQEIYNKQTEEKKAELREFAIDLLNQLYENEL
EYTEDLYDQVGLSHDVKKFIRYNANKALMNLGFDPYFEEEDINPIVLNGLNTKTKSHDFFSMKGNGYKKATVEPLKDDDF
YFEDEKEQI
>O84835 1.17.4.1~~~nrdB~~~Ribonucleoside-diphosphate reductase subunit beta~~~
MQADILDGKQKRVNLNSKRLVNCNQVDVNQLVPIKYKWAWEHYLNGCANNWLPTEIPMGKDIELWKSDRLSEDERRVILL
NLGFFSTAESLVGNNIVLAIFKHVTNPEARQYLLRQAFEEAVHTHTFLYICESLGLDEKEIFNAYNERAAIKAKDDFQME
ITGKVLDPNFRTDSVEGLQEFVKNLVGYYIIMEGIFFYSGFVMILSFHRQNKMIGIGEQYQYILRDETIHLNFGIDLING
IKEENPEIWTPELQQEIVELIKRAVDLEIEYAQDCLPRGILGLRASMFIDYVQHIADRRLERIGLKPIYHTKNPFPWMSE
TIDLNKEKNFFETRVIEYQHAASLTW
>P69924 1.17.4.1~~~nrdB~~~Ribonucleoside-diphosphate reductase 1 subunit beta~~~COG0208
MAYTTFSQTKNDQLKEPMFFGQPVNVARYDQQKYDIFEKLIEKQLSFFWRPEEVDVSRDRIDYQALPEHEKHIFISNLKY
QTLLDSIQGRSPNVALLPLISIPELETWVETWAFSETIHSRSYTHIIRNIVNDPSVVFDDIVTNEQIQKRAEGISSYYDE
LIEMTSYWHLLGEGTHTVNGKTVTVSLRELKKKLYLCLMSVNALEAIRFYVSFACSFAFAERELMEGNAKIIRLIARDEA
LHLTGTQHMLNLLRSGADDPEMAEIAEECKQECYDLFVQAAQQEKDWADYLFRDGSMIGLNKDILCQYVEYITNIRMQAV
GLDLPFQTRSNPIPWINTWLVSDNVQVAPQEVEVSSYLVGQIDSEVDTDDLSNFQL
>Q9KFH7 1.17.4.1~~~nrdB~~~Ribonucleoside-diphosphate reductase subunit beta~~~COG0208
MEQLQKRKIYDTTASNASTGILNGKSSNVLNWDDVRFSWAYPLYKNMLANFWTPFEINMSHDAKQFPTLTETEQEAFKKI
IGLLAFLDSVQTDYSMRAAEYLTDSSLAALMSVLSFQEVVHNQSYSYVLSSLVPKATQDEIFEYWKHDDVLKERNEFIID
GYEKFVDNPTPKTFLESIVYDVILEGLNFYSGFAFFYNLARNQKMVSTSTMINYINRDEQLHVYLFTNIFKELLVEFPEL
NTEETKTFVKTTLMKAADLEKDWFRYIIGDKIPGINPEDMETYISFIANKRAVQLGMEKPYPEIKHNPMKWIRAYEDVNS
GKSDFFEQKSRQYAKVSADNGFDEL
>P39452 1.17.4.1~~~nrdE~~~Ribonucleoside-diphosphate reductase 2 subunit alpha~~~COG0209
MATTTAECLTQETMDYHALNAMLNLYDSAGRIQFDKDRQAVDAFIATHVRPNSVTFSSQQQRLNWLVNEGYYDESVLNRY
SRDFVITLFTHAHTSGFRFQTFLGAWKFYTSYTLKTFDGKRYLEDFADRVTMVALTLAQGDETLALQLTDEMLSGRFQPA
TPTFLNCGKQQRGELVSCFLLRIEDNMESIGRAVNSALQLSKRGGGVAFLLSNLREAGAPIKRIENQSSGVIPVMKMLED
AFSYANQLGARQGAGAVYLHAHHPDILRFLDTKRENADEKIRIKTLSLGVVIPDITFHLAKENAQMALFSPYDVERVYGK
PFADVAISQHYDELVADERIRKKYLNARDFFQRLAEIQFESGYPYIMYEDTVNRANPIAGRINMSNLCSEILQVNSASEY
DENLDYTRTGHDISCNLGSLNIAHTMDSPDFARTVETAVRGLTAVSDMSHIRSVPSIEAGNAASHAIGLGQMNLHGYLAR
EGIAYGSPEALDFTNLYFYAITWHALRTSMLLARERGETFAGFKQSRYASGEYFSQYLQGNWQPKTAKVGELFTRSGITL
PTREMWAQLRDDVMRYGIYNQNLQAVPPTGSISYINHATSSIHPIVAKVEIRKEGKTGRVYYPAPFMTNENLALYQDAYE
IGAEKIIDTYAEATRHVDQGLSLTLFFPDTATTRDINKAQIYAWRKGIKTLYYIRLRQMALEGTEIEGCVSCAL
>Q08698 1.17.4.1~~~nrdE~~~Ribonucleoside-diphosphate reductase 2 subunit alpha~~~
MATTTPERVMQETMDYHALNAMLNLYDKAGHIQFDKDQQAIDAFFATHVRPHSVTFASQHERLGTLVREGYYDDAVLARY
DRAFVLRLFEHAHASGFRFQTFLGAWKFYTSYTLKTFDGKRYLEHFEDRVTMVALTLAQGDETLATQLTDEMLSGRFQPA
TPTFLNCGKQQRGELVSCFLLRIEDNMESIGRAVNSALQLSKRGGGVAFLLSNLREAGAPIKRIENQSSGVIPVMKMLED
AFSYANQLGARQGAGAVYLHAHHPDILRFLDTKRENADEKIRIKTLSLGVVIPDITFRLAKENAQMALFSPYDIQRRYGK
PFGDIAISERYDELIADPHVRKTYINARDFFQTLAEIQFESGYPYIMFEDTVNRANPIAGRINMSNLCSEILQVNSASRY
DDNLDYTHIGHDISCNLGSLNIAHVMDSPDIGRTVETAIRGLTAVSDMSHIRSVPSIAAGNAASHAIGLGQMNLHGYLAR
EGIAYGSPEALDFTNLYFYTITWHAVHTSMRLARERGKTFAGFAQSRYASGDYFTQYLQDDWQPKTAKVRALFARSGITL
PTREMWLKLRDDVMRYGIYNQNLQAVPPTGSISYINHATSSIHPIVAKIEIRKEGKTGRVYYPAPFMTNENLDMYQDAYD
IGPEKIIDTYAEATRHVDQGLSLTLFFPDTATTRDINKAQIYAWRKGIKSLYYIRLRQLALEGTEIEGCVSCAL
>P37146 1.17.4.1~~~nrdF~~~Ribonucleoside-diphosphate reductase 2 subunit beta~~~COG0208
MKLSRISAINWNKISDDKDLEVWNRLTSNFWLPEKVPLSNDIPAWQTLTVVEQQLTMRVFTGLTLLDTLQNVIGAPSLMP
DALTPHEEAVLSNISFMEAVHARSYSSIFSTLCQTKDVDAAYAWSEENAPLQRKAQIIQQHYRGDDPLKKKIASVFLESF
LFYSGFWLPMYFSSRGKLTNTADLIRLIIRDEAVHGYYIGYKYQKNMEKISLGQREELKSFAFDLLLELYDNELQYTDEL
YAETPWADDVKAFLCYNANKALMNLGYEPLFPAEMAEVNPAILAALSPNADENHDFFSGSGSSYVMGKAVETEDEDWNF
>P17424 1.17.4.1~~~nrdF~~~Ribonucleoside-diphosphate reductase 2 subunit beta~~~
MKLSRISAINWNKIQDDKDLEVWNRLTSNFWLPEKVPLSNDIPAWQTLSAAEQQLTIRVFTGLTLLDTIQNIAGAPSLMA
DAITPHEEAVLSNISFMEAVHARSYSSIFSTLCQTKEVDAAYAWSEENPPLQRKAQIILAHYVSDEPLKKKIASVFLESF
LFYSGFWLPMYFSSRGKLTNTADLIRLIIRDEAVHGYYIGYKYQIALQKLSAIEREELKLFALDLLMELYDNEIRYTEAL
YAETGWVNDVKAFLCYNANKALMNLGYEALFPPEMADVNPAILAALSPNADENHDFFSGSGSSYVMGKTVETEDEDWNF
>P16440 2.5.1.9~~~ribE~~~Riboflavin synthase~~~COG0307
MFTGIIEETGTIESMKKAGHAMALTIKCSKILEDVHLGDSIAVNGICLTVTDFTKNQFTVDVMPETVKATSLNDLTKGSK
VNLERAMAANGRFGGHFVSGHVDGTAEITRIEEKSNAVYYDLKMDPSLTKTLVLKGSITVDGVSLTIFGLTEDTVTISLI
PHTISETIFSEKTIGSKVNIECDMIGKYMYRFLHKANENKTQQTITKAFLSENGF
>Q2YN92 2.5.1.9~~~ribE~~~Riboflavin synthase~~~
MFTGIITDIGKVDRVKPLNEGVLLRIETAYDPETIELGASIACSGVCLTVVALPEKGSNARWFEVEAWEEALRLTTISSW
QSGRKINLERSLKLGDEMGGHLVFGHVDGQAEIVERKDEGDAVRFTLRAPEELAPFIAQKGSVALDGTSLTVNGVNANEF
DVLLIRHSLEVTTWGERKAGDKVNIEIDQLARYAARLAQYQK
>P0AFU8 2.5.1.9~~~ribC~~~Riboflavin synthase~~~COG0307
MFTGIVQGTAKLVSIDEKPNFRTHVVELPDHMLDGLETGASVAHNGCCLTVTEINGNHVSFDLMKETLRITNLGDLKVGD
WVNVERAAKFSDEIGGHLMSGHIMTTAEVAKILTSENNRQIWFKVQDSQLMKYILYKGFIGIDGISLTVGEVTPTRFCVH
LIPETLERTTLGKKKLGARVNIEIDPQTQAVVDTVERVLAARENAMNQPGTEA
>P9WK35 2.5.1.9~~~ribE~~~Riboflavin synthase~~~COG0307
MFTGIVEERGEVTGREALVDAARLTIRGPMVTADAGHGDSIAVNGVCLTVVDVLPDGQFTADVMAETLNRSNLGELRPGS
RVNLERAAALGSRLGGHIVQGHVDATGEIVARCPSEHWEVVRIEMPASVARYVVEKGSITVDGISLTVSGLGAEQRDWFE
VSLIPTTRELTTLGSAAVGTRVNLEVDVVAKYVERLMRSAG
>P51961 2.5.1.9~~~ribE~~~Riboflavin synthase~~~
MFTGIIEAVGNISAITSKGSDFEVSVNCDTLDLADVKIGDSIATNGICLTVVKLTANSYVADLSIETLSRTAFNYYKVGQ
AVNLEKAMLPTTRFGGHIVSGHVDAVAEVIECRTSGRAIDIWIRVPSQIEKYLSEKGSVTVDGVSLTVNAVTGNEFKLTI
VPHTVVETTIADFKVGNKVNIEVDVLARYIERLLLVDKPEDKQSKISMDLLERNGFLL
>Q2YNC6 2.5.1.78~~~ribH1~~~6,7-dimethyl-8-ribityllumazine synthase 1~~~
MEFLMSKHEADAPHLLIVEARFYDDLADALLDGAKAALDEAGATYDVVTVPGALEIPATISFALDGADNGGTEYDGFVAL
GTVIRGETYHFDIVSNESCRALTDLSVEESIAIGNGILTVENEEQAWVHARREDKDKGGFAARAALTMIGLRKKFGA
>Q8YGH2 2.5.1.78~~~ribH1~~~6,7-dimethyl-8-ribityllumazine synthase 1~~~COG0054
MEFLMSKHEADAPHLLIVEARFYDDLADALLDGAKAALDEAGATYDVVTVPGALEIPATISFALDGADNGGTEYDGFVAL
GTVIRGETYHFDIVSNESCRALTDLSVEESIAIGNGILTVENEEQAWVRARREDKDKGGFAARAALTMIGLRKKFGA
>Q983B0 2.5.1.78~~~ribH1~~~6,7-dimethyl-8-ribityllumazine synthase 1~~~COG0054
MAGISQHGKAFIRPKAKAHLLIVEARFHDDLADALLDGATSALEEAGATYDVVTVPGSLEIPAVITFALDGAAEGGTNYD
GFVALGTIIRGDTYHFDIVANESSRALMDMSVQDSVCIGNGILTTENDAQAWTRAKRSEGDKGGFAARAALTMIALKEQL
GARS
>Q2YKV1 2.5.1.78~~~ribH2~~~6,7-dimethyl-8-ribityllumazine synthase 2~~~
MNQSCPNKTSFKIAFIQARWHADIVDEARKSFVAELAAKTGGSVEVEIFDVPGAYEIPLHAKTLARTGRYAAIVGAAFVI
DGGIYRHDFVATAVINGMMQVQLETEVPVLSVVLTPHHFHESKEHHDFFHAHFKVKGVEAAHAALQIVSERSRIAALV
>Q986N2 2.5.1.78~~~ribH2~~~6,7-dimethyl-8-ribityllumazine synthase 2~~~COG0054
MNQHSHKDYETVRIAVVRARWHADIVDQCVSAFEAEMADIGGDRFAVDVFDVPGAYEIPLHARTLAETGRYGAVLGTAFV
VNGGIYRHEFVASAVIDGMMNVQLSTGVPVLSAVLTPHNYHDSAEHHRFFFEHFTVKGKEAARACVEILAAREKIAA
>O66529 2.5.1.78~~~ribH~~~6,7-dimethyl-8-ribityllumazine synthase~~~COG0054
MQIYEGKLTAEGLRFGIVASRFNHALVDRLVEGAIDCIVRHGGREEDITLVRVPGSWEIPVAAGELARKEDIDAVIAIGV
LIRGATPHFDYIASEVSKGLANLSLELRKPITFGVITADTLEQAIERAGTKHGNKGWEAALSAIEMANLFKSLR
>Q81MB5 2.5.1.78~~~ribH~~~6,7-dimethyl-8-ribityllumazine synthase~~~COG0054
MVFEGHLVGTGLKVGVVVGRFNEFITSKLLGGALDGLKRHGVEENDIDVAWVPGAFEIPLIAKKMANSGKYDAVITLGTV
IRGATTHYDYVCNEVAKGVASLSLQTDIPVIFGVLTTETIEQAIERAGTKAGNKGYESAVAAIEMAHLSKHWA
>P11998 2.5.1.78~~~ribH~~~6,7-dimethyl-8-ribityllumazine synthase~~~COG0054
MNIIQGNLVGTGLKIGIVVGRFNDFITSKLLSGAEDALLRHGVDTNDIDVAWVPGAFEIPFAAKKMAETKKYDAIITLGT
VIRGATTHYDYVCNEAAKGIAQAANTTGVPVIFGIVTTENIEQAIERAGTKAGNKGVDCAVSAIEMANLNRSFE
>Q9ZNM0 2.5.1.78~~~ribH~~~6,7-dimethyl-8-ribityllumazine synthase~~~COG0054
MNIIQSGITAENSSIAIIIARFNEFINKNLLLGALDTLKRIGQVHEENILKIYVPGTYEIPTIASYIAKSGKYDAIIAIG
TIIKGQTDHFKYIANDTSSSLSRISAQYFLPITLGILTTKNIEQSIERSGTKMGNKGSDAALAALEMINVMKKLKKVIYY
>Q83DP8 2.5.1.78~~~ribH~~~6,7-dimethyl-8-ribityllumazine synthase~~~COG0054
MTESSFKLAIVVSQFNRAVTEKLLNGVLQRLTELGVQANQIKTVWVPGAVEIPLLAKRLAKSKHYQAVVCLGAVIRGETD
HYNYVCQQVSFGCQQVALEYEVPIIFGVLTTTTKEQAFARAGGERGNKGADWADAAVSMIKLMKEIEITDE
>P61940 2.5.1.78~~~ribH~~~6,7-dimethyl-8-ribityllumazine synthase~~~COG0054
MLHIKTIEGQLDAKGLKFAIVATRFNDFIVDRLIGGACDYLQRHGCDRENLTIVRIPGAFEMPLVAKKLAHSGKYDGIIA
LGAVIRGATPHFDFVSNEASKGLAQACLESGVPLGFGLLTTDNIEQAIERAGSKAGNKGAEAAAAVLETVRVMEQL
>P61714 2.5.1.78~~~ribE~~~6,7-dimethyl-8-ribityllumazine synthase~~~COG0054
MNIIEANVATPDARVAITIARFNNFINDSLLEGAIDALKRIGQVKDENITVVWVPGAYELPLAAGALAKTGKYDAVIALG
TVIRGGTAHFEYVAGGASNGLAHVAQDSEIPVAFGVLTTESIEQAIERAGTKAGNKGAEAALTALEMINVLKAIKA
>B8ZUN3 2.5.1.78~~~ribH~~~6,7-dimethyl-8-ribityllumazine synthase~~~
MSGGAGIPEVPGIDASGLRLGIVASTWHSRICDALLAGARKVAADSGIDGPTVVRVLGAIEIPVVVQELARHHDAVVALG
VVIRGDTPHFDYVCNSVTQGLTRIALDTSTPVGNGVLTTNTEKQALDRAGLPTSAEDKGAQAAAAALTTALTLLNLRSRI
>P9WHE9 2.5.1.78~~~ribH~~~6,7-dimethyl-8-ribityllumazine synthase~~~COG0054
MKGGAGVPDLPSLDASGVRLAIVASSWHGKICDALLDGARKVAAGCGLDDPTVVRVLGAIEIPVVAQELARNHDAVVALG
VVIRGQTPHFDYVCDAVTQGLTRVSLDSSTPIANGVLTTNTEEQALDRAGLPTSAEDKGAQATVAALATALTLRELRAHS
>Q02SM0 2.5.1.78~~~ribH~~~6,7-dimethyl-8-ribityllumazine synthase~~~
MTLKTIEGTFIAPKGRYALVVGRFNSFVVESLVSGAVDALVRHGVAESEITIIRAPGAFEIPLVTQKVAQQGGFDAIIAL
GAVIRGGTPHFEYVAGECTKGLAQVSLQFGIPVAFGVLTVDSIEQAIERSGTKAGNKGAEAALSALEMVSLLAQLEAK
>P66038 2.5.1.78~~~ribH~~~6,7-dimethyl-8-ribityllumazine synthase~~~
MNIIKANVAAPDARVAITIARFNQFINDSLLDGAVDALTRIGQVKDDNITVVWVPGAYELPLATEALAKSGKYDAVVALG
TVIRGGTAHFEYVAGGASNGLASVAQDSGVPVAFGVLTTESIEQAIERAGTKAGNKGAEAALTALEMINVLKAIKA
>P99141 2.5.1.78~~~ribH~~~6,7-dimethyl-8-ribityllumazine synthase~~~
MNFEGKLIGKDLKVAIVVSRFNDFITGRLLEGAKDTLIRHDVNEDNIDVAFVPGAFEIPLVAKKLASSGNYDAIITLGCV
IRGATSHYDYVCNEVAKGVSKVNDQTNVPVIFGILTTESIEQAVERAGTKAGNKGAEAAVSAIEMANLLKSIKA
>B5UAT8 6.3.2.48~~~rizA~~~L-arginine-specific L-amino acid ligase~~~
MLRILLINSDKPEPIQFFQKDKETNDSINISVITRSCYAPLYSHWADHVYIVDDVTDLTVMKSLMLEILKVGPFDHIVST
TEKSILTGGFLRSYFGIAGPGFETALYMTNKLAMKTKLKMEGIPVADFLCVSQVEDIPAAGEKLGWPIIVKPALGSGALN
TFIIHSLDHYEDLYSTSGGLGELKKNNSLMIAEKCIEMEEFHCDTLYADGEILFVSISKYTVPLLKGMAKIQGSFILSQN
DPVYAEILELQKSVAQAFRITDGPGHLEIYRTHSGELIVGEIAMRIGGGGISRMIEKKFNISLWESSLNISVYRDPNLTV
NPIEGTVGYFSLPCRNGTIKEFTPIEEWEKLAGILEVELLYQEGDVVDEKQSSSFDLARLYFCLENENEVQHLLALVKQT
YYLHLTEDHMMNQ
>Q8UE06 ~~~rplJ~~~Large ribosomal subunit protein uL10~~~COG0244
MERAEKREFVTELNEVFKASGSVVVAHYAGVTVAQMNDFRSKMRAAGGTVKVAKNRLAKIALQGTESEGMTNLFKGQTLI
AYSVDPMIAPKVVMDFAKTNDKVVVLGGSMGATTLNAEAVKSLATLPSLDELRAKLLGLLNAPATRVATVVAAPASQLAR
VFSAYAKKDEAA
>P42923 ~~~rplJ~~~Large ribosomal subunit protein uL10~~~COG0244
MSSAIETKKVVVEEIASKLKESKSTIIVDYRGLNVSEVTELRKQLREANVEFKVYKNTMTRRAVEQAELNGLNDFLTGPN
AIAFSTEDVVAPAKVLNDFAKNHEALEIKAGVIEGKVSTVEEVKALAELPSREGLLSMLLSVLQAPVRNLALAAKAVAEQ
KEEQGA
>Q9RSS9 ~~~rplJ~~~Large ribosomal subunit protein uL10~~~COG0244
MANEKNQQTLGSLKDSLQGIETFYVVDYQGLTAGQLTQLRKDIREKGGQLIVAKNTLLNLALQEGGRDFDDALKGPSALV
LAQEDPAGVAKALSDAAGRNDRGIPTVKGGFVEGSKVDVAVVQRLASLGSKTTLQAELVGVLSAHLSNFVGILEAYREKL
EGEGGSESA
>P0A7J3 ~~~rplJ~~~Large ribosomal subunit protein uL10~~~COG0244
MALNLQDKQAIVAEVSEVAKGALSAVVADSRGVTVDKMTELRKAGREAGVYMRVVRNTLLRRAVEGTPFECLKDAFVGPT
LIAYSMEHPGAAARLFKEFAKANAKFEVKAAAFEGELIPASQIDRLATLPTYEEAIARLMATMKEASAGKLVRTLAAVRD
AKEAA
>P44350 ~~~rplJ~~~Large ribosomal subunit protein uL10~~~COG0244
MALNLQDKQAIVAEVNEAAKGALSAVIADSRGVTVEKMTELRKSAREAGVTMRVVRNTLLRRAVEGTDYECLKDTFVGPT
LIAFSNEHPGARARLFKEFAKANDKFEIKGAAFEGKIQDVEFLATLPTYEEAIARLMGTMKEAAAGKLARTFAALRDKLQ
EAA
>P56036 ~~~rplJ~~~Large ribosomal subunit protein uL10~~~COG0244
MQKQHQRQHKVELVANLKSQFADAKALLICDYKGLSVRKLEALRNKARNQGIKVQVIKNTLAHIAMKETGYSDLDLKETN
VFLWGGDQIALSKLVFDFQKEHKDHFVLKAGLFDKESVSVAHVEAVSKLPSKEELMGMLLSVWTAPARYFVTGLDNLRKA
KEEN
>P75240 ~~~rplJ~~~Large ribosomal subunit protein uL10~~~
MEAKKDKAQQVADVSHLLSTSAGFVIFDYTSMSAIEATSIRKKLFKNGSKIKVIKNNILRRALKAGKFEGIDETAIKGKL
AVAVGVNEIVETLKAVDGVVKAKEAMNFVCGYFDNRAFNSADLEKIAKLPGRNELYGMFLSVLQAPLRKFLYALEAVKAA
K
>A0QS62 ~~~rplJ~~~Large ribosomal subunit protein uL10~~~COG0244
MAKADKATAVADIAEQFKASTATVVTEYRGLTVANLAELRRALGDSATYTVAKNTLVKRAASEAGIEGLDELFAGPTAIA
FVKGEAVDAAKAIKKFAKDNKALVIKGGYMDGKALSVADVEKIADLESREVLLAKLAGAMKGNLSKAAGLFNAPASQVAR
LAAALQEKKAGEEAA
>P9WHE7 ~~~rplJ~~~Large ribosomal subunit protein uL10~~~COG0244
MARADKATAVADIAAQFKESTATLITEYRGLTVANLAELRRSLTGSATYAVAKNTLIKRAASEAGIEGLDELFVGPTAIA
FVTGEPVDAAKAIKTFAKEHKALVIKGGYMDGHPLTVAEVERIADLESREVLLAKLAGAMKGNLAKAAGLFNAPASQLAR
LAAALQEKKACPGPDSAE
>Q9HWC7 ~~~rplJ~~~Large ribosomal subunit protein uL10~~~
MAIKLEDKKAIVAEVNEAAKAALSAVVADARGVTVGAMTGLRKEAREAGVYVKVVRNTLLKRAVEGTQFDVLNDVFKGPT
LIAFSNEHPGAAARIFREFAKGQDKFEIKAAAFEGQFLAANQIDVLASLPTYDEAVSQLMSVIQGATSKLARTLAAIRDQ
KEAAAA
>Q6N4R7 ~~~rplJ~~~Large ribosomal subunit protein uL10~~~COG0244
MVLLAGTANRRELAVERAAKKEAVESLNGLFQTTSVAIVAHYSGLTVAQMQKLRQQMKQAGASVKVSKNRLAKIALEGTD
VAAIGPLLKGPTVIATSSDPVAAPKVAVEFAKANEKFVILGGSMGTTVLNVDGVKALASLPSLDELRAKLVGLVQAPATK
IAQVTTAPAAKLARVVQAYASKSEAA
>P99155 ~~~rplJ~~~Large ribosomal subunit protein uL10~~~
MSAIIEAKKQLVDEIAEVLSNSVSTVIVDYRGLTVAEVTDLRSQLREAGVEYKVYKNTMVRRAAEKAGIEGLDEFLTGPT
AIATSSEDAVAAAKVISGFAKDHEALEIKSGVMEGNVITAEEVKTVGSLPSHDGLVSMLLSVLQAPVRNFAYAVKAIGEQ
KEENAE
>Q5XCB5 ~~~rplJ~~~Large ribosomal subunit protein uL10~~~
MSEAIIAKKAEQVELIAEKMKAAASIVVVDSRGLTVDQDTVLRRSLRESGVEFKVIKNSILTRAAEKAGLDELKDVFVGP
SAVAFSNEDVIAPAKVINDFTKTADALEIKGGAIEGAVSSKEEIQALATLPNREGMLSMLLSVLQAPVRNVAYAVKAVAE
NKEGAA
>P29394 ~~~rplJ~~~Large ribosomal subunit protein uL10~~~COG0244
MLTRQQKELIVKEMSEIFKKTSLILFADFLGFTVADLTELRSRLREKYGDGARFRVVKNTLLNLALKNAEYEGYEEFLKG
PTAVLYVTEGDPVEAVKIIYNFYKDKKADLSRLKGGFLEGKKFTAEEVENIAKLPSKEELYAMLVGRVKAPITGLVFALS
GILRNLVYVLNAIKEKKSE
>Q72GS1 ~~~rplJ~~~Large ribosomal subunit protein uL10~~~COG0244
MPNKRNVELLATLKENLERAQGSFFLVNYQGLPAKETHALRQALKQNGARLFVAKNTLIRLALKELGLPELDGLQGPSAV
VFYEDPVAAAKTLVQFAKSNPKGIPQVKSGLLQGQILTAKDVEALAELPTMDELRAELVGVLQAPMAELVGVLGGVAREL
VGILEAYAEKKAA
>Q8VVE3 ~~~rplJ~~~Large ribosomal subunit protein uL10~~~COG0244
MPNKRNVELLATLKENLERAQGSFFLVNYQGLPAKETHALRQALKQNGARLFVAKNTLIRLALKELGLPELDGLQGPSAV
VFYEDPVAAAKTLVQFAKSNPKGIPQVKSGLLQGQILTAKDVEALAELPTMDELRAELVGVLQAPMAELVGVLGGVAREL
VGILEAYAEKKAA
>Q06796 ~~~rplK~~~Large ribosomal subunit protein uL11~~~COG0080
MAKKVVKVVKLQIPAGKANPAPPVGPALGQAGVNIMGFCKEFNARTADQAGLIIPVEISVYEDRSFTFITKTPPAAVLLK
KAAGIESGSGEPNRNKVATVKRDKVREIAETKMPDLNAADVEAAMRMVEGTARSMGIVIED
>Q9RSS7 ~~~rplK~~~Large ribosomal subunit protein uL11~~~COG0080
MKKVAGIVKLQLPAGKATPAPPVGPALGQYGANIMEFTKAFNAQTADKGDAIIPVEITIYADRSFTFITKTPPMSYLIRK
AAGIGKGSSTPNKAKVGKLNWDQVLEIAKTKMPDLNAGSVEAAANTVAGTARSMGVTVEGGPNA
>P0A7J7 ~~~rplK~~~Large ribosomal subunit protein uL11~~~COG0080
MAKKVQAYVKLQVAAGMANPSPPVGPALGQQGVNIMEFCKAFNAKTDSIEKGLPIPVVITVYADRSFTFVTKTPPAAVLL
KKAAGIKSGSGKPNKDKVGKISRAQLQEIAQTKAADMTGADIEAMTRSIEGTARSMGLVVED
>P66054 ~~~rplK~~~Large ribosomal subunit protein uL11~~~COG0080
MAKKVIKEVKLQIPAGKANPAPPVGPALGQAGVNIMGFCKEFNARTADQAGLIIPVVITVFEDRSFTFITKTPPAAVLLK
KAAKVEKGSGEPNKTKVASVTRAQVQEIAETKMPDLNAANVESAMLMVEGTARSMGITIQD
>P75550 ~~~rplK~~~Large ribosomal subunit protein uL11~~~
MAKKTITRIAKINLLGGQAKPGPALASVGINMGEFTKQFNEKTKDKQGEMIPCVITAYNDKSFDFILKTTPVSILLKQAA
KLEKGAKNAKTIVGKITMAKAKEIAQYKLVDLNANTVEAALKMVLGTAKQMGIEVIE
>A0QS45 ~~~rplK~~~Large ribosomal subunit protein uL11~~~COG0080
MAPKKKVAGLIKLQIQAGQANPAPPVGPALGQHGVNIMEFCKAYNAATESQRGNVIPVEITVYEDRSFTFALKTPPAAKL
LLKAAGVQKGSGEPHKTKVAKVTWDQVREIAETKKADLNANDIDAAAKIIAGTARSMGITVE
>P9WHE5 ~~~rplK~~~Large ribosomal subunit protein uL11~~~COG0080
MAPKKKVAGLIKLQIVAGQANPAPPVGPALGQHGVNIMEFCKAYNAATENQRGNVIPVEITVYEDRSFTFTLKTPPAAKL
LLKAAGVAKGSAEPHKTKVAKVTWDQVREIAETKKTDLNANDVDAAAKIIAGTARSMGITVE
>Q9HWC5 ~~~rplK~~~Large ribosomal subunit protein uL11~~~
MAKKIQAYIKLQVKAGQANPSPPVGPALGQHGVNIMEFCKAFNAKTQGQEPGLPTPVIITVYSDRSFTFETKSTPAAVLL
KKAAGITSGSARPNSQKVGTVTRAQLEEIAKTKQADLTAADLDAAVRTIAGSARSMGLNVEGV
>P62441 ~~~rplK~~~Large ribosomal subunit protein uL11~~~COG0080
MAKKVTGYLKLQVPAGAANPSPPIGPALGQRGLNIMEFCKAFNAQTQKEEKNTPIPVVITIYADRSFTFEMKTPPMSYFL
KQAAKIQSGSKLPGRDFAGKVTSAQVREIAEKKMKDLNCDTVESAMRMVEGSARSMGLRVEG
>P0A0F2 ~~~rplK~~~Large ribosomal subunit protein uL11~~~
MAKKVDKVVKLQIPAGKANPAPPVGPALGQAGVNIMGFCKEFNARTQDQAGLIIPVEISVYEDRSFTFITKTPPAPVLLK
KAAGIEKGSGEPNKTKVATVTKDQVREIANSKMQDLNAADEEAAMRIIEGTARSMGIVVE
>P29395 ~~~rplK~~~Large ribosomal subunit protein uL11~~~COG0080
MAKKVAAQIKLQLPAGKATPAPPVGPALGQHGVNIMEFCKRFNAETADKAGMILPVVITVYEDKSFTFIIKTPPASFLLK
KAAGIEKGSSEPKRKIVGKVTRKQIEEIAKTKMPDLNANSLEAAMKIIEGTAKSMGIEVVD
>P62442 ~~~rplK~~~Large ribosomal subunit protein uL11~~~COG0080
MKKVVAVVKLQLPAGKATPAPPVGPALGQHGANIMEFVKAFNAATANMGDAIVPVEITIYADRSFTFVTKTPPASYLIRK
AAGLEKGAHKPGREKVGRITWEQVLEIAKQKMPDLNTTDLEAAARMIAGSARSMGVEVVGAPEVKDA
>Q5SLP6 ~~~rplK~~~Large ribosomal subunit protein uL11~~~COG0080
MKKVVAVVKLQLPAGKATPAPPVGPALGQHGANIMEFVKAFNAATANMGDAIVPVEITIYADRSFTFVTKTPPASYLIRK
AAGLEKGAHKPGREKVGRITWEQVLEIAKQKMPDLNTTDLEAAARMIAGSARSMGVEVVGAPEVKDA
>P36238 ~~~rplK~~~Large ribosomal subunit protein uL11~~~
MKKVVAVVKLQLPAGKATPAPPVGPALGQHGANIMEFVKAFNAATANMGDAIVPVEITIYADRSFTFVTKTPPASYLIRK
AAGLEKGAHKPGREKVGRITWEQVLEIAKQKMPDLNTTDLEAAARMIAGSARSMGVEVVGAPEVKDA
>B7I9B0 ~~~rplM~~~Large ribosomal subunit protein uL13~~~
MKTLSAKPAEVQHDWFVVDATGKTLGRLATEIARRLRGKHKTSYTPHVDTGDYIIVINAEQVQVTGNKALDKKYYRHTEF
PGGLKETNFEKLVAHKPEEIFERAVKGMLPKGPLGYAMIKKMKVYAGSEHPHAAQQPQVLDI
>P70974 ~~~rplM~~~Large ribosomal subunit protein uL13~~~COG0102
MRTTPMANASTIERKWLVVDAAGKTLGRLSSEVAAILRGKHKPTYTPHVDTGDHVIIINAEKIELTGKKLTDKIYYRHTQ
HPGGLKSRTALEMRTNYPEKMLELAIKGMLPKGSLGRQMFKKLNVYRGSEHPHEAQKPEVYELRG
>Q9RXY1 ~~~rplM~~~Large ribosomal subunit protein uL13~~~COG0102
MAFPDTDVSPPRGGPSSPAKSPLLRSFKVKTYIPKNDEQNWVVVDASGVPLGRLATLIASRIRGKHRPDFTPNMIQGDFV
VVINAAQVALTGKKLDDKVYTRYTGYQGGLKTETAREALSKHPERVIEHAVFGMLPKGRQGRAMHTRLKVYAGETHPHSA
QKPQVLKTQPLEVK
>P0AA10 ~~~rplM~~~Large ribosomal subunit protein uL13~~~COG0102
MKTFTAKPETVKRDWYVVDATGKTLGRLATELARRLRGKHKAEYTPHVDTGDYIIVLNADKVAVTGNKRTDKVYYHHTGH
IGGIKQATFEEMIARRPERVIEIAVKGMLPKGPLGRAMFRKLKVYAGNEHNHAAQQPQVLDI
>Q8Y458 ~~~rplM~~~Large ribosomal subunit protein uL13~~~COG0102
MRTTYMAKPGEVERKWYVIDATGVSLGRLSSEVASILRGKNKPQFTPHIDTGDFVIIINAGKIGLTGKKATDKIYYRHSQ
YPGGLKSRTAGEMRTNNPEKLLELSIKGMLPKNSLGRQLFKKLHVYGGSEHEHAAQQPEVYELRG
>P75178 ~~~rplM~~~Large ribosomal subunit protein uL13~~~
MQKTSMLTKEQANKRRQWYIVDAAGLVLGKLAVKAADLIRGKNKVDFTPNQDCGDYLIIINSDQVVLTGNKKENEFWYHH
SQYIGGIKKVSGRDMLKKQSDKLVYNAVKGMLPDNRLSRRWITKVHVFKGDKHNMEAQKPTTLNWS
>A0QSP8 ~~~rplM~~~Large ribosomal subunit protein uL13~~~COG0102
MPTYTPKAGDTTRSWYVIDASDVVLGRLASAAATLLRGKHKPTFTPNVDGGDFVIVINADKIAVSGDKLTKKFAYRHSGY
PGGLRKRTIGELLEKHPTRVVENAIIGMLPHNKLGRQIQKKLKVYAGPDHPHAAQQPIPFEIKQVAQ
>A5U8B9 ~~~rplM~~~Large ribosomal subunit protein uL13~~~COG0102
MPTYAPKAGDTTRSWYVIDATDVVLGRLAVAAANLLRGKHKPTFAPNVDGGDFVIVINADKVAISGDKLQHKMVYRHSGY
PGGLHKRTIGELMQRHPDRVVEKAILGMLPKNRLSRQIQRKLRVYAGPEHPHSAQQPVPYELKQVAQ
>P9WHE1 ~~~rplM~~~Large ribosomal subunit protein uL13~~~COG0102
MPTYAPKAGDTTRSWYVIDATDVVLGRLAVAAANLLRGKHKPTFAPNVDGGDFVIVINADKVAISGDKLQHKMVYRHSGY
PGGLHKRTIGELMQRHPDRVVEKAILGMLPKNRLSRQIQRKLRVYAGPEHPHSAQQPVPYELKQVAQ
>Q9HVY2 ~~~rplM~~~Large ribosomal subunit protein uL13~~~
MKTYTAKPETVQRDWFVVDAAGQTLGRLATEIARRLRGKHKPEYTPHVDTGDYIVVINAEQVRVTGAKTTDKMYYHHSGF
PGGIKSINFEKLIAKAPERVIETAVKGMLPKNPLGRDMYRKLKVYKGASHPHTAQQPQELKI
>Q6N651 ~~~rplM~~~Large ribosomal subunit protein uL13~~~COG0102
MKTFSAKPAEVTKKWVIIDATGLVVGRLATLVAMRLRGKHLPTYTPHVDCGDNVIIINASKVVLTGRKRDNKVYYHHTGF
IGGIKERSAKAILEGRFPERVVEKAIERMIPRGPLGRVQMGNLRVYPGAEHPHEAQQPEKLDIGAMNRKNMRAA
>Q2FW38 ~~~rplM~~~Large ribosomal subunit protein uL13~~~COG0102
MRQTFMANESNIERKWYVIDAEGQTLGRLSSEVASILRGKNKVTYTPHVDTGDYVIVINASKIEFTGNKETDKVYYRHSN
HPGGIKSITAGELRRTNPERLIENSIKGMLPSTRLGEKQGKKLFVYGGAEHPHAAQQPENYELRG
>Q2YYM8 ~~~rplM~~~Large ribosomal subunit protein uL13~~~
MRQTFMANESNIERKWYVIDAEGQTLGRLSSEVASILRGKNKVTYTPHVDTGDYVIVINASKIEFTGNKETDKVYYRHSN
HPGGIKSITAGELRRTNPERLIENSIKGMLPSTRLGEKQGKKLFVYGGAEHPHAAQQPENYELRG
>Q7A473 ~~~rplM~~~Large ribosomal subunit protein uL13~~~
MRQTFMANESNIERKWYVIDAEGQTLGRLSSEVASILRGKNKVTYTPHVDTGDYVIVINASKIEFTGNKETDKVYYRHSN
HPGGIKSITAGELRRTNPERLIENSIKGMLPSTRLGEKQGKKLFVYGGAEHPHAAQQPENYELRG
>Q72IN1 ~~~rplM~~~Large ribosomal subunit protein uL13~~~COG0102
MKTYVPKQVEPRWVLIDAEGKTLGRLATKIATLLRGKHRPDWTPNVAMGDFVVVVNADKIRVTGKKLEQKIYTRYSGYPG
GLKKIPLEKMLATHPERVLEHAVKGMLPKGPLGRRLFKRLKVYAGPDHPHQAQRPEKLEV
>P60488 ~~~rplM~~~Large ribosomal subunit protein uL13~~~COG0102
MKTYVPKQVEPRWVLIDAEGKTLGRLATKIATLLRGKHRPDWTPNVAMGDFVVVVNADKIRVTGKKLEQKIYTRYSGYPG
GLKKIPLEKMLATHPERVLEHAVKGMLPKGPLGRRLFKRLKVYAGPDHPHQAQRPEKLEV
>B7IA29 ~~~rplN~~~Large ribosomal subunit protein uL14~~~
MIQTETMLDVADNSGARRVQCIKVLGGSHRRYASVGDIIKVTVKEAIPRARVKKGDVMNAVVVRTKFGIRRPDGSVIRFD
DNAAVILNNNKAPIATRIFGPVTRELRTEQFMKIISLAPEVL
>P12875 ~~~rplN~~~Large ribosomal subunit protein uL14~~~COG0093
MIQQETRLKVADNSGAREVLTIKVLGGSGRKTANIGDVIVCTVKQATPGGVVKKGEVVKAVIVRTKSGARRSDGSYISFD
ENACVIIRDDKSPRGTRIFGPVARELRENNFMKIVSLAPEVI
>Q9RXJ2 ~~~rplN~~~Large ribosomal subunit protein uL14~~~COG0093
MIMPQSRLDVADNSGAREIMCIRVLNSGIGGKGLTTGGGGNKRYAHVGDIIVASVKDAAPRGAVKAGDVVKAVVVRTSHA
IKRADGSTIRFDRNAAVIINNQGEPRGTRVFGPVARELRDRRFMKIVSLAPEVL
>P0ADY3 ~~~rplN~~~Large ribosomal subunit protein uL14~~~COG0093
MIQEQTMLNVADNSGARRVMCIKVLGGSHRRYAGVGDIIKITIKEAIPRGKVKKGDVLKAVVVRTKKGVRRPDGSVIRFD
GNACVLLNNNSEQPIGTRIFGPVTRELRSEKFMKIISLAPEVL
>Q839F4 ~~~rplN~~~Large ribosomal subunit protein uL14~~~COG0093
MIQQESRLRVADNSGAREILTIKVLGGSGRKTANIGDVIVATVKQATPGGVVKKGEVVKAVIVRTKSGARRADGSYIKFD
ENAAVIIRDDKSPRGTRIFGPVARELRENNFMKIVSLAPEVL
>A5FMZ0 ~~~rplN~~~Large ribosomal subunit protein uL14~~~COG0093
MVQQESRLKVADNTGAKEVLTIRVLGGTKRRYASVGDKIVVSIKDATPNGNVKKGAVSTAVVVRTKKEVRRADGSYIRFD
DNACVLLNAAGEMRGTRVFGPVARELREKQFMKIVSLAPEVL
>Q5L413 ~~~rplN~~~Large ribosomal subunit protein uL14~~~COG0093
MIQQESRLKVADNSGAREVLVIKVLGGSGRRYANIGDVVVATVKDATPGGVVKKGQVVKAVVVRTKRGVRRPDGSYIRFD
ENACVIIRDDKSPRGTRIFGPVARELRDKDFMKIISLAPEVI
>P04450 ~~~rplN~~~Large ribosomal subunit protein uL14~~~
MIQQESRLKVADNSGAREVLVIKVLGGSGRRYANIGDVVVATVKDATPGGVVKKGQVVKAVVVRTKRGVRRPDGSYIRFD
ENACVIIRDDKSPRGTRIFGPVARELRDKDFMKIISLAPEVI
>P56039 ~~~rplN~~~Large ribosomal subunit protein uL14~~~COG0093
MIQSFTRLNVADNSGAKEIMCIKVLGGSHKRYASVGSVIVASVKKAIPNGKVKRGQVVKAVVVRTKKEIQRKNGSLVRFD
DNAAVILDAKKDPVGTRIFGPVSREVRYANFMKIISLAPEVV
>A2RNP5 ~~~rplN~~~Large ribosomal subunit protein uL14~~~COG0093
MIQTESRLKVADNSGAKELLTIRVLGGSSRKFAGIGDIVVATVKSAAPGGAVKKGEVVKAVIVRTKSGAKRPDGSYIKFD
ENAAVLIRDDKTPRGTRIFGPVARELREGGYMKIVSLAPEVL
>Q927L7 ~~~rplN~~~Large ribosomal subunit protein uL14~~~COG0093
MIQQESRMKVADNSGAREVLTIKVLGGSGRKTANIGDVVVCTVKQATPGGVVKKGEVVKAVIVRTKSGARRQDGSYIKFD
ENACVIIRDDKSPRGTRIFGPVARELRENNFMKIVSLAPEVL
>Q50308 ~~~rplN~~~Large ribosomal subunit protein uL14~~~
MVSFMTRLNVADNTGAKQVGIIKVLGSTRKRYAFLGDVVVVSVKDAIPSGMVKKGQVLRAVIVRTKKGQQRKDGTHLKFD
DNACVLIKEDKSPRGTRIFGPVARELRERGYNKILSLAVEVV
>A0QSF9 ~~~rplN~~~Large ribosomal subunit protein uL14~~~COG0093
MIQQESRLKVADNTGAKEILCIRVLGGSSRRYAGIGDVIVATVKDAIPGGNVKRGDVVKAVVVRTVKERRRADGSYIKFD
ENAAVIIKNDNDPRGTRIFGPVGRELREKKFMKIVSLAPEVL
>A5U0A0 ~~~rplN~~~Large ribosomal subunit protein uL14~~~COG0093
MIQQESRLKVADNTGAKEILCIRVLGGSSRRYAGIGDVIVATVKDAIPGGNVKRGDVVKAVVVRTVKERRRPDGSYIKFD
ENAAVIIKPDNDPRGTRIFGPVGRELREKRFMKIISLAPEVL
>P9WHD9 ~~~rplN~~~Large ribosomal subunit protein uL14~~~COG0093
MIQQESRLKVADNTGAKEILCIRVLGGSSRRYAGIGDVIVATVKDAIPGGNVKRGDVVKAVVVRTVKERRRPDGSYIKFD
ENAAVIIKPDNDPRGTRIFGPVGRELREKRFMKIISLAPEVL
>Q7DDT2 ~~~rplN~~~Large ribosomal subunit protein uL14~~~
MIQMQTILDVADNSGARRVMCIKVLGGSKRRYASVGDIIKVAVKDAAPRGRVKKGDVYNAVVVRTAKGVRRPDGALIKFD
NNAAVLLNNKLEPLGTRIFGPVTRELRTERFMKIVSLAPEVL
>Q9HWE5 ~~~rplN~~~Large ribosomal subunit protein uL14~~~
MIQTQSMLDVADNSGARRVMCIKVLGGSHRRYAGIGDIIKVTVKEAIPRGKVKKGQVMTAVVVRTKHGVRRTDGSIIRFD
GNAAVLLNNKQEPIGTRIFGPVTRELRTEKFMKIVSLAPEVL
>Q6N4U4 ~~~rplN~~~Large ribosomal subunit protein uL14~~~COG0093
MIQMQTNLDVADNSGARRVMCIKVIGGSKRRYATVGDVIVVSIKEAIPRGKVKKGDVMKAVVVRVRKDIRRADGSVIRFD
RNAAVLINNQSEPIGTRIFGPVPRELRAKNHMKIISLAPEVL
>Q2FW16 ~~~rplN~~~Large ribosomal subunit protein uL14~~~COG0093
MIQQETRLKVADNSGAREVLTIKVLGGSGRKTANIGDVIVCTVKNATPGGVVKKGDVVKAVIVRTKSGVRRNDGSYIKFD
ENACVIIRDDKGPRGTRIFGPVARELREGNFMKIVSLAPEVL
>Q2YYK7 ~~~rplN~~~Large ribosomal subunit protein uL14~~~
MIQQETRLKVADNSGAREVLTIKVLGGSGRKTANIGDVIVCTVKNATPGGVVKKGDVVKAVIVRTKSGVRRNDGSYIKFD
ENACVIIRDDKGPRGTRIFGPVARELREGNFMKIVSLAPEVL
>Q7A463 ~~~rplN~~~Large ribosomal subunit protein uL14~~~
MIQQETRLKVADNSGAREVLTIKVLGGSGRKTANIGDVIVCTVKNATPGGVVKKGDVVKAVIVRTKSGVRRNDGSYIKFD
ENACVIIRDDKGPRGTRIFGPVARELREGNFMKIVSLAPEVL
>P0A473 ~~~rplN~~~Large ribosomal subunit protein uL14~~~COG0093
MIQTETRLKVADNSGAREILTIKVLGGSGRKFANIGDVIVASVKQATPGGAVKKGDVVKAVIVRTKSGARRADGSYIKFD
ENAAVIIREDKTPRGTRIFGPVARELREGGFMKIVSLAPEVL
>P73310 ~~~rplN~~~Large ribosomal subunit protein uL14~~~COG0093
MIQQQTYLNVADNSGARKLMCLRVLGTGNCTYGGIGDQIIAVVKDALPNMPIKKSDVVRAVIVRTKQPLRRASGMSIRFD
DNAAVIINAEGNPRGTRVFGPVARELRDKNFTKIVSLAPEVL
>Q72I14 ~~~rplN~~~Large ribosomal subunit protein uL14~~~COG0093
MIQPQTYLEVADNTGARKIMCIRVLKGSNAKYATVGDVIVASVKEAIPRGAVKEGDVVKAVVVRTKKEVKRPDGSAIRFD
DNAAVIINNQLEPRGTRVFGPVARELREKGFMKIVSLAPEVL
>Q5SHP8 ~~~rplN~~~Large ribosomal subunit protein uL14~~~COG0093
MIQPQTYLEVADNTGARKIMCIRVLKGSNAKYATVGDVIVASVKEAIPRGAVKEGDVVKAVVVRTKKEIKRPDGSAIRFD
DNAAVIINNQLEPRGTRVFGPVARELREKGFMKIVSLAPEVL
>P60558 ~~~rplN~~~Large ribosomal subunit protein uL14~~~
MIQPQTYLEVADNTGARKIMCIRVLKGSNAKYATVGDVIVASVKEAIPRGAVKEGDVVKAVVVRTKKEVKRPDGSAIRFD
DNAAVIINNQLEPRGTRVFGPVARELREKGFMKIVSLAPEVL
>O83229 ~~~rplN~~~Large ribosomal subunit protein uL14~~~COG0093
MIQVQSRLNVADNSGARLVQCIKVVGGSRRRYASVGDIIVVAVKDALPTSVIKKGSVEKAVIVRVSKEYRRVDGTYIRFD
DNACVVIDANGNPKGKRIFGPVARELRDMDFTKIVSLAPEVL
>B7IA20 ~~~rplO~~~Large ribosomal subunit protein uL15~~~
MTLRLNELAPAEGAKREHRRLGRGIGSGVGKTGGRGIKGQKSRKSGGVRPGFEGGQTAIYRRLPKFGFTSQIALKTAEVR
LSELSKVEGDIVSLETLKAANVVRRDQIRARIVLSGEITRAFTVQGVALTKGAKAAIEAAGGKVEE
>P19946 ~~~rplO~~~Large ribosomal subunit protein uL15~~~COG0200
MKLHELKPSEGSRKTRNRVGRGIGSGNGKTAGKGHKGQNARSGGGVRPGFEGGQMPLFQRLPKRGFTNINRKEYAVVNLD
KLNGFAEGTEVTPELLLETGVISKLNAGVKILGNGKLEKKLTVKANKFSASAKEAVEAAGGTAEVI
>Q9RSK9 ~~~rplO~~~Large ribosomal subunit protein uL15~~~COG0200
MKLHDLKPTPGSRKDRKRVGRGPGGTDKTAGRGHKGQKSRSGAGKGAFFEGGRSRLIARLPKRGFNNVGTTYEVVKLSQL
QDLEDTTFDRDTLEAYRLVRRKNRPVKLLASGEISRAVTVHVDAASAAAIKAVEAAGGRVVLPEVQTQQDDAQKAE
>P02413 ~~~rplO~~~Large ribosomal subunit protein uL15~~~COG0200
MRLNTLSPAEGSKKAGKRLGRGIGSGLGKTGGRGHKGQKSRSGGGVRRGFEGGQMPLYRRLPKFGFTSRKAAITAEIRLS
DLAKVEGGVVDLNTLKAANIIGIQIEFAKVILAGEVTTPVTVRGLRVTKGARAAIEAAGGKIEE
>Q839E5 ~~~rplO~~~Large ribosomal subunit protein uL15~~~COG0200
MKLHELKPAEGSRQVRNRVGRGTSSGNGKTAGRGQKGQKARSGGGVRLGFEGGQTPLFRRLPKRGFTNINRKDYAVVNLD
TLNRFEDGTEVTPVVLKEAGIVKNEKAGIKVLADGELTKKLTVKAAKFSKSAQEAIEAAGGSIEVI
>P04452 ~~~rplO~~~Large ribosomal subunit protein uL15~~~
MKLHELQPAPGSRKKAVRVGRGIGSGNGKTSGRGQKGQNARSGGGVRLGFEGGQTPLFRRLPKRGFTNINRKEYAVVNLE
KLNRFEDGTEVTPELLLETGVISKLKSGVKILGKGQIEKKLTVKAHKFSASAKEAIEAAGGKTEVI
>A2RNN4 ~~~rplO~~~Large ribosomal subunit protein uL15~~~COG0200
MELHSLKAAEGSRKVRNRVGRGTSSGNGKTSGRGQKGQKSRSGGGVRPGFEGGQTELFRRMPKRGFLNVNRKEYAIVNLE
TLNRLEDGATVSAETLVAAKIVKDVKSGVKVLANGELTAKNLTVKVAKVSAAAKAAIEAAGGSVEEA
>Q8Y447 ~~~rplO~~~Large ribosomal subunit protein uL15~~~COG0200
MKLHELKPSEGSRKERNRVGRGTGSGNGKTSGRGHKGQKARSGGGVRLGFEGGQLPLFRRIPKRGFTNINRKEFAIVNLD
VLNRFEDGTEVTPELLVETGIIRNEKSGIKILSNGNIEKKLTVKANKFSAAAKEAIEAAGGKTEVI
>Q50300 ~~~rplO~~~Large ribosomal subunit protein uL15~~~
MELNQLKSVPKARNHKTKTLGRGHGSGLGKTSGRGQKGQKARKSGLTRPGFEGGQTPLYRRLPKFGNARKGFLKQEWVVL
NLNKIAKLKLDKINRASLIEKQVISAKSQLPIKLIGHTKLEKPLHFEVHKVSKQALKAVENANGSVKLLEK
>A5U0A9 ~~~rplO~~~Large ribosomal subunit protein uL15~~~COG0200
MTLKLHDLRPARGSKIARTRVGRGDGSKGKTAGRGTKGTRARKQVPVTFEGGQMPIHMRLPKLKGFRNRFRTEYEIVNVG
DINRLFPQGGAVGVDDLVAKGAVRKNALVKVLGDGKLTAKVDVSAHKFSGSARAKITAAGGSATEL
>P9WHD7 ~~~rplO~~~Large ribosomal subunit protein uL15~~~COG0200
MTLKLHDLRPARGSKIARTRVGRGDGSKGKTAGRGTKGTRARKQVPVTFEGGQMPIHMRLPKLKGFRNRFRTEYEIVNVG
DINRLFPQGGAVGVDDLVAKGAVRKNALVKVLGDGKLTAKVDVSAHKFSGSARAKITAAGGSATEL
>Q9HWF4 ~~~rplO~~~Large ribosomal subunit protein uL15~~~
MQLNDLRSAPGARREKHRPGRGIGSGLGKTGGRGHKGLTSRSGGKVAPGFEGGQQPLHRRLPKFGFVSLKAMDRAEVRTS
ELAKVEGDVVSLQTLKDANLINQHVQRVKVMLSGEVGRAVTLKGIAATKGARAAIEAAGGKFED
>Q6N4V2 ~~~rplO~~~Large ribosomal subunit protein uL15~~~COG0200
MKLSEIADNVGSRKKRMRIGRGIGSGKGKTGGRGGKGQTARSGVRIKGFEGGQMPLHRRLPKRGFNNIFALEFAEVNLDR
LQEAVDSKAIDAGKVVDAAALVEAGVLRRAKDGVRLLGRGELTAKLNIEVHGATKSAIAAVEKAGGSVKILAPKAEEGEA
A
>P0A0F8 ~~~rplO~~~Large ribosomal subunit protein uL15~~~COG0200
MKLHELKPAEGSRKERNRVGRGVATGNGKTSGRGHKGQKARSGGGVRPGFEGGQLPLFRRLPKRGFTNINRKEYAIVNLD
QLNKFEDGTEVTPALLVESGVVKNEKSGIKILGNGSLDKKLTVKAHKFSASAAEAIDAKGGAHEVI
>Q2YYL6 ~~~rplO~~~Large ribosomal subunit protein uL15~~~
MKLHELKPAEGSRKERNRVGRGVATGNGKTSGRGHKGQKARSGGGVRPGFEGGQLPLFRRLPKRGFTNINRKEYAIVNLD
QLNKFEDGTEVTPALLVESGVVKNEKSGIKILGNGSLDKKLTVKAHKFSASAAEAIDAKGGAHEVI
>P0A0F6 ~~~rplO~~~Large ribosomal subunit protein uL15~~~
MKLHELKPAEGSRKERNRVGRGVATGNGKTSGRGHKGQKARSGGGVRPGFEGGQLPLFRRLPKRGFTNINRKEYAIVNLD
QLNKFEDGTEVTPALLVESGVVKNEKSGIKILGNGSLDKKLTVKAHKFSASAAEAIDAKGGAHEVI
>Q72I23 ~~~rplO~~~Large ribosomal subunit protein uL15~~~COG0200
MKLSDLRPNPGANKRRKRVGRGPGSGHGKTATRGHKGQKSRSGGLKDPRRFEGGRSTTLMRLPKRGMQGQVPGEIKRPRY
QGVNLKDLARFEGEVTPELLVRAGLLKKGYRLKILGEGEAKPLKVVAHAFSKSALEKLKAAGGEPVLLEA
>Q5SHQ7 ~~~rplO~~~Large ribosomal subunit protein uL15~~~COG0200
MKLSDLRPNPGANKRRKRVGRGPGSGHGKTATRGHKGQKSRSGGLKDPRRFEGGRSTTLMRLPKRGMQGQVPGEIKRPRY
QGVNLKDLARFEGEVTPELLVRAGLLKKGYRLKILGEGEAKPLKVVAHAFSKSALEKLKAAGGEPVLLEA
>P74910 ~~~rplO~~~Large ribosomal subunit protein uL15~~~
MKLSDLRPNPGANKRRKRVGRGPGSGHGKTATRGHKGQKSRSGGLKDPRRFEGGRSTTLMRLPKRGMQGQVPGEIKRPRY
QGVNLKDLARFEGEVTPELLVRAGLLKKGYRLKILGEGEAKPLKVVAHAFSKSALEKLKAAGGEPVLLEA
>B7IA32 ~~~rplP~~~Large ribosomal subunit protein uL16~~~
MLQPKRTKFRKVHKGRNTGLAHRGSTVSFGSIAIKATERGRMTARQIEAARRTISRRIKRGGKIFIRVFPDKPITEKPLE
VRMGNGKGNVEYWVCEIKPGKILYEIEGVNEDLAREAFALAAAKLPFKTTIVTRTVM
>P14577 ~~~rplP~~~Large ribosomal subunit protein uL16~~~COG0197
MLLPKRVKYRREHRGKMRGRAKGGTEVHFGEFGIQALEASWITNRQIEAARIAMTRYMKRGGKVWIKIFPSKPYTAKPLE
VRMGSGKGAPEGWVAVVKPGKVLFEISGVSEEVAREALRLASHKLPIKTKFVKREEIGGESNES
>Q9RXJ5 ~~~rplP~~~Large ribosomal subunit protein uL16~~~COG0197
MLLPKRTKFRKQFRGRMTGDAKGGDYVAFGDYGLIAMEPAWIKSNQIEACRIVMSRHFRRGGKIYIRIFPDKPVTKKPAE
TRMGKGKGAVEYWVSVVKPGRVMFEVAGVTEEQAKEAFRLAGHKLPIQTKMVKREVYDEAQ
>P0ADY7 ~~~rplP~~~Large ribosomal subunit protein uL16~~~COG0197
MLQPKRTKFRKMHKGRNRGLAQGTDVSFGSFGLKAVGRGRLTARQIEAARRAMTRAVKRQGKIWIRVFPDKPITEKPLAV
RMGKGKGNVEYWVALIQPGKVLYEMDGVPEELAREAFKLAAAKLPIKTTFVTKTVM
>Q839F7 ~~~rplP~~~Large ribosomal subunit protein uL16~~~COG0197
MLVPKRVKHRREFRGKMRGEAKGGKEVAFGEWGLQATESHWITNRQIEAARIAMTRYMKRGGKVWIKIFPHKSYTSKAIG
VRMGKGKGAPEGWVSPVKRGKIMFEIAGVPEEVAREALRLASHKLPVKTKIVKREEMGGESNEG
>A2RNP8 ~~~rplP~~~Large ribosomal subunit protein uL16~~~COG0197
MLVPKRVKHRREFRGKMRGYAKGGDTVSFGEYGLQATTSHWITNRQIEAARIAMTRYMKRNGQVWIKIFPHKSYTAKAIG
VRMGSGKGAPEGWVAPVKRGVVMFELGGVDEATAREALRLASHKLPVKTKFVKRGEA
>Q927L4 ~~~rplP~~~Large ribosomal subunit protein uL16~~~COG0197
MLVPKRVKYRREFRGNMRGRAKGGTEVAFGEYGLQAVEASWITNRQIEAARIAMTRYMKRGGKVWIKIFPHKSYTSKPIG
VRMGKGKGAPEGWVSPVKRGKIMFEIAGVPEDVAREALRLAAHKLPVKTKIVKREEIGGEANES
>P41204 ~~~rplP~~~Large ribosomal subunit protein uL16~~~
MLQPKRTKYRKPHNVSYEGKAKGNSYVAFGEYGLVATKGNWIDARAIESARIAISKCLGKTGKMWIRIFPHMSKTKKPLE
VRMGSGKGNPEFWVAVVKQGTVMFEVANIPESQMIKALTRAGHKLPVTWKILKREEVSA
>A0QSD8 ~~~rplP~~~Large ribosomal subunit protein uL16~~~COG0197
MLIPRKVKHRKQHHPEQRGIASGGTSVSFGDYGIQALEHAYITNRQIESARIAINRHIKRGGKVWINIFPDRPLTKKPAE
TRMGSGKGSPEWWVANVKPGRVLFELSYPDEKTARDALTRAIHKLPIKARIVTREEQF
>A5U094 ~~~rplP~~~Large ribosomal subunit protein uL16~~~COG0197
MLIPRKVKHRKQHHPRQRGIASGGTTVNFGDYGIQALEHAYVTNRQIESARIAINRHIKRGGKVWINIFPDRPLTKKPAE
TRMGSGKGSPEWWVANVKPGRVLFELSYPNEGVARAALTRAIHKLPIKARIITREEQF
>P9WHD5 ~~~rplP~~~Large ribosomal subunit protein uL16~~~COG0197
MLIPRKVKHRKQHHPRQRGIASGGTTVNFGDYGIQALEHAYVTNRQIESARIAINRHIKRGGKVWINIFPDRPLTKKPAE
TRMGSGKGSPEWWVANVKPGRVLFELSYPNEGVARAALTRAIHKLPIKARIITREEQF
>Q9HWE2 ~~~rplP~~~Large ribosomal subunit protein uL16~~~
MLQPKRTKFRKQMTGHNRGLAHRGSKVSFGEYALKATSRGRLTARQIESARRALTRHVKRGGKIWIRVFPDKPVTKKPLE
VRMGKGKGGVEYWVAQIQPGKVLYEIEGVSEELAREAFALAAAKLPLATSFVKRTVM
>Q6N4U1 ~~~rplP~~~Large ribosomal subunit protein uL16~~~COG0197
MMQPKRTKFRKAHKGRIHGVASSGATLAFGQFGLKAMEPERITARQIEAARRALTRHMKRAGRVWIRVFPDLPVSKKPAE
VRMGSGKGSPELWVARVKPGRVMFEIDGVNQQIAREALTLAAAKLPIKTRFVARIAE
>Q2FW13 ~~~rplP~~~Large ribosomal subunit protein uL16~~~COG0197
MLLPKRVKYRRQHRPKTTGRSKGGNYVTFGEFGLQATTTSWITSRQIESARIAMTRYMKRGGKVWIKIFPHTPYTKKPLE
VRMGAGKGAVEGWIAVVKPGRILFEVAGVSEEVAREALRLASHKLPVKTKFVKREELGGETNES
>Q2YYQ3 ~~~rplP~~~Large ribosomal subunit protein uL16~~~
MLLPKRVKYRRQHRPKTTGRSKGGNYVTFGEFGLQATTTSWITSRQIESARIAMTRYMKRGGKVWIKIFPHTPYTKKPLE
VRMGAGKGAVEGWIAVVKPGRILFEVAGVSEEVAREALRLASHKLPVKTKFVKREELGGETNES
>Q7A461 ~~~rplP~~~Large ribosomal subunit protein uL16~~~
MLLPKRVKYRRQHRPKTTGRSKGGNYVTFGEFGLQATTTSWITSRQIESARIAMTRYMKRGGKVWIKIFPHTPYTKKPLE
VRMGAGKGAVEGWIAVVKPGRILFEVAGVSEEVAREALRLASHKLPVKTKFVKREELGGETNES
>P0A475 ~~~rplP~~~Large ribosomal subunit protein uL16~~~COG0197
MLVPKRVKHRREFRGKMRGEAKGGKEVAFGEYGLQATTSHWITNRQIEAARIAMTRYMKRGGKVWIKIFPHKSYTAKAIG
VRMGSGKGAPEGWVAPVKRGKVMFEIAGVSEEIAREALRLASHKLPVKCKFVKREAE
>Q72I11 ~~~rplP~~~Large ribosomal subunit protein uL16~~~COG0197
MLMPRRMKYRKQQRGRLKGATKGGDYVAFGDFGLVALEPAWITAQQIEAARVAMVRHFRRGGKIFIRIFPDKPYTKKPLE
VRMGKGKGNVEGYVAVVKPGRVMFEVAGVTEEQAMEALRIAGHKLPIKTKIVRRDAYDEAQ
>P60489 ~~~rplP~~~Large ribosomal subunit protein uL16~~~COG0197
MLMPRRMKYRKQQRGRLKGATKGGDYVAFGDYGLVALEPAWITAQQIEAARVAMVRHFRRGGKIFIRIFPDKPYTKKPLE
VRMGKGKGNVEGYVAVVKPGRVMFEVAGVTEEQAMEALRIAGHKLPIKTKIVRRDAYDEAQ
>B7IA13 ~~~rplQ~~~Large ribosomal subunit protein bL17~~~
MRHRNSGVKLGRTSSHRKAMFENLANSLFEHELIKTTLPKAKELRRVAEPLITLAKNDTVANRRLAFARTRNAATVGKLF
TVLGPRYKERNGGYLRVLKAGFRAGDAAPMAYVELVDREVNTSAE
>P20277 ~~~rplQ~~~Large ribosomal subunit protein bL17~~~COG0203
MSYRKLGRTSAQRKAMLRDLTTDLIINERIETTETRAKELRSVVEKMITLGKRGDLHARRQAAAYIRNEVANEENNQDAL
QKLFSDIATRYEERQGGYTRIMKLGPRRGDGAPMAIIELV
>Q9RSJ5 ~~~rplQ~~~Large ribosomal subunit protein bL17~~~COG0203
MRHGKAGRKLNRNSSARVALARAQATALLREGRIQTTLTKAKELRPFVEQLITTAKGGDLHSRRLVAQDIHDKDVVRKVM
DEVAPKYAERPGGYTRILRVGTRRGDGVTMALIELV
>P0AG44 ~~~rplQ~~~Large ribosomal subunit protein bL17~~~COG0203
MRHRKSGRQLNRNSSHRQAMFRNMAGSLVRHEIIKTTLPKAKELRRVVEPLITLAKTDSVANRRLAFARTRDNEIVAKLF
NELGPRFASRAGGYTRILKCGFRAGDNAPMAYIELVDRSEKAEAAAE
>Q839D8 ~~~rplQ~~~Large ribosomal subunit protein bL17~~~COG0203
MSYRKLGRTSSQRKAMLRDITTDLIINERIVTTEARAKEVRSTVEKMITLGKRGDLHARRQAATFVRNEVASVREEDESI
VVESALQKLFNDLGPRFAERQGGYTRILKTEPRRGDAAPMVVIEFVK
>P07843 ~~~rplQ~~~Large ribosomal subunit protein bL17~~~
MSYRKLGRTTSQRKALLRDLATDLIINERIETTEARAKELRAVIEKMITLGKRGDLHARRQAAAFIRKEVANSETGQDAL
QKLFSDIAPRYQDRQGGYTRIMKLGPRRGDGAPMVIIELV
>A2RNM6 ~~~rplQ~~~Large ribosomal subunit protein bL17~~~COG0203
MSNRKLGRTSSQRKAMLRDLTTDLIINETIVTTEARAKEVRRTVEKMITLGKKGDLSARRAAAAYVRNEIAIKDFNEETE
TFPTALQKLFNDLAKRYEGRNGGYTRILKVEPRRGDAAPMAIIELV
>Q8Y450 ~~~rplQ~~~Large ribosomal subunit protein bL17~~~COG0203
MGYRKLGRTSSQRKALLRDLATDLIVFERIETTEARAKEIRKVVEKLITSGKKGDLHARRQAAAFIRHEVVEVVQVDAKG
KDGSTVKKNRPVYALQKLFDDVAPRYAERQGGYTRILKKGPRRGDGAPMVIIELV
>Q59547 ~~~rplQ~~~Large ribosomal subunit protein bL17~~~
MSYINKPGKTSAWRVMTVRQQVSAVLAYGKIETTLKKAKNTQKRLDKLITLAKVDNFNNRRQVKKWLLNTNLFDVDQLMD
HLFSKVAPKYEKTPGGYSRVLKLGPRRGDATEMAILQLTDAKYK
>A0QSL9 ~~~rplQ~~~Large ribosomal subunit protein bL17~~~COG0203
MPKPTKGPRLGGSSSHQSALLANLATSLFEHGRIKTTEPKARALRPYAEKLITHAKKGALHNRREVMKKIRDKDVVHTLF
AEIGPFYADRNGGYTRIIKVENRKGDNAPMAVIELVREKTVTDEANRARRAAASQAKADERADEKADEKAEETVEETTEA
PAEESTEAAAEETVEETTEAPAEESTEAAEESEAKDDTK
>A5U8D2 ~~~rplQ~~~Large ribosomal subunit protein bL17~~~COG0203
MPKPTKGPRLGGSSSHQKAILANLATSLFEHGRITTTEPKARALRPYAEKLITHAKKGALHNRREVLKKLRDKDVVHTLF
AEIGPFFADRDGGYTRIIKIEARKGDNAPMAVIELVREKTVTSEANRARRVAAAQAKAKKAAAMPTEESEAKPAEEGDVV
GASEPDAKAPEEPPAEAPEN
>P9WHD3 ~~~rplQ~~~Large ribosomal subunit protein bL17~~~COG0203
MPKPTKGPRLGGSSSHQKAILANLATSLFEHGRITTTEPKARALRPYAEKLITHAKKGALHNRREVLKKLRDKDVVHTLF
AEIGPFFADRDGGYTRIIKIEARKGDNAPMAVIELVREKTVTSEANRARRVAAAQAKAKKAAAMPTEESEAKPAEEGDVV
GASEPDAKAPEEPPAEAPEN
>O52761 ~~~rplQ~~~Large ribosomal subunit protein bL17~~~
MRHRKSGRHLSRTSAHRKAMFQNMAVSLFEHELIKTTLPKAKELRRVAEPLITLAKEDSVANRRLAFDRTRSKAAVGKLF
NDLGKRYANRPGGYLRILKCGFRAGDNAPMAYVELVDRPVGGEVVEAAE
>Q6N4V8 ~~~rplQ~~~Large ribosomal subunit protein bL17~~~COG0203
MKHGKVHRKLNRTAEHRKAMFANMCAALIKHEQIVTTLPKAKELRPIVEKLVTLGKKGGLDKRRQAIAEMRDIEQVKKLF
DVLAPRYKDRNGGYTRIIKAGFRYGDNAAMAVIEFVDRDEDAKGRDSGPTQDNSEAEAA
>Q2FW33 ~~~rplQ~~~Large ribosomal subunit protein bL17~~~COG0203
MGYRKLGRTSDQRKAMLRDLATSLIISERIETTEARAKEVRSVVEKLITLGKKGDLASRRNAAKTLRNVEILNEDETTQT
ALQKLFGEIAERYTERQGGYTRILKQGPRRGDGAESVIIELV
>Q2YYM3 ~~~rplQ~~~Large ribosomal subunit protein bL17~~~
MGYRKLGRTSDQRKAMLRDLATSLIISERIETTEARAKEVRSVVEKLITLGKKGDLASRRNAAKTLRNVEILNEDETTQT
ALQKLFGEIAERYTERQGGYTRILKQGPRRGDGAESVIIELV
>Q7A469 ~~~rplQ~~~Large ribosomal subunit protein bL17~~~
MGYRKLGRTSDQRKAMLRDLATSLIISERIETTEARAKEVRSVVEKLITLGKKGDLASRRNAAKTLRNVEILNEDETTQT
ALQKLFGEIAERYTERQGGYTRILKQGPRRGDGAESVIIELV
>Q72I33 ~~~rplQ~~~Large ribosomal subunit protein bL17~~~COG0203
MRHLKSGRKLNRHSSHRLALYRNQAKSLLTHGRITTTVPKAKELRGFVDHLIHLAKRGDLHARRLVLRDLQDVKLVRKLF
DEIAPRYRDRQGGYTRVLKLAERRRGDGAPLALVELVE
>Q9Z9H5 ~~~rplQ~~~Large ribosomal subunit protein bL17~~~COG0203
MRHLKSGRKLNRHSSHRLALYRNQAKSLLTHGRITTTVPKAKELRGFVDHLIHLAKRGDLHARRLVLRDLQDVKLVRKLF
DEIAPRYRDRQGGYTRVLKLAERRRGDGAPLALVELVE
>B7IA23 ~~~rplR~~~Large ribosomal subunit protein uL18~~~
MNEKKQSRLRRAKSTRLHIRALGATRLCVNRTPRHIYAQVISADGGKVLAQASTLDASLRSGTTGNIEAATKVGALIAER
AKAAGVTKVAFDRSGFKYHGRIKALADAAREGGLEF
>P46899 ~~~rplR~~~Large ribosomal subunit protein uL18~~~COG0256
MITKTSKNAARLKRHARVRAKLSGTAERPRLNVFRSNKHIYAQIIDDVNGVTLASASTLDKDLNVESTGDTSAATKVGEL
VAKRAAEKGISDVVFDRGGYLYHGRVKALADAAREAGLKF
>Q9RSL2 ~~~rplR~~~Large ribosomal subunit protein uL18~~~COG0256
MATATTIRRKLRTRRKVRTTTAASGRLRLSVYRSSKHIYAQIIDDSRGQTLAAASSAALKSGNKTDTAAAVGKALAAAAA
EKGIKQVVFDRGSYKYHGRVKALADAAREGGLDF
>P0C018 ~~~rplR~~~Large ribosomal subunit protein uL18~~~COG0256
MDKKSARIRRATRARRKLQELGATRLVVHRTPRHIYAQVIAPNGSEVLVAASTVEKAIAEQLKYTGNKDAAAAVGKAVAE
RALEKGIKDVSFDRSGFQYHGRVQALADAAREAGLQF
>Q839E8 ~~~rplR~~~Large ribosomal subunit protein uL18~~~COG0256
MITKPDKNKTRQKRHRRVRNKISGTAECPRLNIFRSNKNIYAQVIDDVAGVTLASASALDKEISGGTKTETAAAVGKLVA
ERAAEKGIKKVVFDRGGYLYHGRVQALAEAARENGLEF
>P09415 ~~~rplR~~~Large ribosomal subunit protein uL18~~~
MITKVDRNAVRKKRHARIRKKIFGTTERPRLSVFRSNKHIYAQIIDDTKSATIVSASTLDKEFGLDSTNNIEAAKKVGEL
VAKRALEKGIKQVVFDRGGYLYHGRVKALADAAREAGLEF
>A2RNN7 ~~~rplR~~~Large ribosomal subunit protein uL18~~~COG0256
MISKPDKNKLRQKRHTRVRGKISGTTETPRLNVFRSNTNIYAQVIDDVTGTTLASASSLKLTGTKTEQAAEVGKLVAEAA
KAKGVEEVVFDRGGYLYHGRVAALATAAREAGLKF
>Q8Y445 ~~~rplR~~~Large ribosomal subunit protein uL18~~~COG0256
MITKIDKNKVRKKRHARVRSKISGTESRPRLNVFRSNKNIYAQIIDDVNGVTLASASNLDKDFGSAESKVDAASKVGELV
AKRASEKGITSVTFDRGGYLYHGRVKALAEAARENGLEF
>Q50302 ~~~rplR~~~Large ribosomal subunit protein uL18~~~
MKTRTEQRRLRHKRIVKKIRATNHDNRVVLMVIKSLNHISVQAWDFSQNIVLASSSSLALKLKNGNKDNAKLVGQDIADK
LVKLKLTNVVFDTGGSKYHGRIAALAEAARERGLNF
>A5U0A6 ~~~rplR~~~Large ribosomal subunit protein uL18~~~COG0256
MAQSVSATRRISRLRRHTRLRKKLSGTAERPRLVVHRSARHIHVQLVNDLNGTTVAAASSIEADVRGVPGDKKARSVRVG
QLIAERAKAAGIDTVVFDRGGYTYGGRIAALADAARENGLSF
>P9WHD1 ~~~rplR~~~Large ribosomal subunit protein uL18~~~COG0256
MAQSVSATRRISRLRRHTRLRKKLSGTAERPRLVVHRSARHIHVQLVNDLNGTTVAAASSIEADVRGVPGDKKARSVRVG
QLIAERAKAAGIDTVVFDRGGYTYGGRIAALADAARENGLSF
>Q9HWF1 ~~~rplR~~~Large ribosomal subunit protein uL18~~~
MSVKKETRLRRARKARLKMRELETVRLCVYRSSQHIYAQVIAADGGKVLASASTLDKDLREGATGNIDAAKKVGQLVAER
AKAAGVTQVAFDRSGFKYHGRVKALADAAREGGLEF
>Q6N4U9 ~~~rplR~~~Large ribosomal subunit protein uL18~~~COG0256
MSKMKITNARRTNRVRTALRRTANGRPRLSVFRSSKHIYAQVIDDAKGETLASASSLEKTMRDAGNTGANIDAAKAVGKL
VAERAVEKGVKEVVFDRGGYLYHGRVKALADAARESGLSF
>Q2FW22 ~~~rplR~~~Large ribosomal subunit protein uL18~~~COG0256
MISKIDKNKVRLKRHARVRTNLSGTAEKPRLNVYRSNKHIYAQIIDDNKGVTLAQASSKDSDIATTATKVELATKVGEAI
AKKAADKGIKEIVFDRGGYLYHGRVKALAEAARESGLEF
>Q7A467 ~~~rplR~~~Large ribosomal subunit protein uL18~~~
MISKIDKNKVRLKRHARVRTNLSGTAEKPRLNVYRSNKHIYAQIIDDNKGVTLAQASSKDSDIATTATKVELATKVGEAI
AKKAADKGIKEIVFDRGGYLYHGRVKALAEAARESGLEF
>Q72I20 ~~~rplR~~~Large ribosomal subunit protein uL18~~~COG0256
MARLTAYERRKFRVRNRIKRTGRLRLSVFRSLKHIYAQIIDDEKGVTLVSASSLALKLKGNKTEVARQVGRALAEKALAL
GIKQVAFDRGPYKYHGRVKALAEGAREGGLEF
>Q5SHQ4 ~~~rplR~~~Large ribosomal subunit protein uL18~~~COG0256
MARLTAYERRKFRVRNRIKRTGRLRLSVFRSLKHIYAQIIDDEKGVTLVSASSLALKLKGNKTEVARQVGRALAEKALAL
GIKQVAFDRGPYKYHGRVKALAEGAREGGLEF
>P80320 ~~~rplR~~~Large ribosomal subunit protein uL18~~~
MARLTAYERRKFRVRNRIKRTGRLRLSVFRSLKHIYAQIIDDEKGVTLVSASSLALKLKGNKTEVARQVGRALAEKALAL
GIKQVAFDRGPYKYHGRVKALAEGAREGGLEF
>B7IAS9 ~~~rplS~~~Large ribosomal subunit protein bL19~~~
MSGKHPLVQAIENSQLKTDLPEFAPGDTVVVQVKVKEGDRERLQAFEGVVIAKKNRGLNSAFTVRKISSGVGVERVFQTH
SPVVAKIEVKRRGDVRRAKLYYLRDLSGKAARIREKLPARKA
>O31742 ~~~rplS~~~Large ribosomal subunit protein bL19~~~COG0335
MQKLIEDITKEQLRTDLPAFRPGDTLRVHVKVVEGNRERIQIFEGVVIKRRGGGISETFTVRKISYGVGVERTFPVHTPK
IAKIEVVRYGKVRRAKLYYLRELRGKAARIKEIRR
>Q9RWB4 ~~~rplS~~~Large ribosomal subunit protein bL19~~~COG0335
MQTHIKINRGELLRGIEQDHTRQLPDFRPGDTVRVDTKVREGNRTRSQAFEGVVIAINGSGSRKSFTVRKISFGEGVERV
FPFASPLVNQVTIVERGKVRRAKLYYLRELRGKAARIKSDRSRVMKDAARAQQDKANASASQAAAAQADVTVISAAPEVA
PETQGE
>P0A7K6 ~~~rplS~~~Large ribosomal subunit protein bL19~~~COG0335
MSNIIKQLEQEQMKQDVPSFRPGDTVEVKVWVVEGSKKRLQAFEGVVIAIRNRGLHSAFTVRKISNGEGVERVFQTHSPV
VDSISVKRRGAVRKAKLYYLRERTGKAARIKERLN
>Q833P5 ~~~rplS~~~Large ribosomal subunit protein bL19~~~COG0335
MNPLIQELTQEQLRTDIPAFRPGDTVRVHAKVVEGTRERIQLFEGVVIKRRGAGISETYTVRKVSNGVGVERTFPLHTPR
VAQIEVVRYGKVRRAKLYYLRALHGKAARIKEIRR
>A2RLS5 ~~~rplS~~~Large ribosomal subunit protein bL19~~~COG0335
MNLIESINAAQLRTDIPDFRPGDTVRVHAKVVEGTRERIQIFEGVVIARKNSGINETYTVRKISNGVGVERIFPVHTPRV
EKIEVIRHGKVRRAKLYYLRALTGKKARIAERRR
>O53083 ~~~rplS~~~Large ribosomal subunit protein bL19~~~COG0335
MNKLIDEITKSQLNPDVPNFRPGDTVRVHAKVVEGTRERIQLFEGVVIKRRGAGISETFTVRKISNSVGVERTFPVHTPR
IAKLEVIRRGKVRRAKLYYLRNLRGKAARIKEIR
>P75133 ~~~rplS~~~Large ribosomal subunit protein bL19~~~
MKKINKQALIDLVEQKQLKAYVPEFSAGDEVNVAIKLKEKEKVRIQNFTGTVLRRRGKGISETFIVRKTTDGIPIEKNFQ
IHNPNISIELKRRGKVRRAYISYMRERSGKAAKIKERKQ
>A0QV42 ~~~rplS~~~Large ribosomal subunit protein bL19~~~COG0335
MNTLDFVDQASLRDDIPTFSPGDTVNVHVKVIEGSKERIQVFKGVVIRRQGGGISETFTVRKESYGVGVERTFPVHSPNI
DHIDVLTRGDVRRAKLYYLRELRGKKAKIKEKR
>A5U6Q9 ~~~rplS~~~Large ribosomal subunit protein bL19~~~COG0335
MNRLDFVDKPSLRDDIPAFNPGDTINVHVKVIEGAKERLQVFKGVVIRRQGGGIRETFTVRKESYGVGVERTFPVHSPNI
DHIEVVTRGDVRRAKLYYLRELRGKKAKIKEKR
>P9WHC9 ~~~rplS~~~Large ribosomal subunit protein bL19~~~COG0335
MNRLDFVDKPSLRDDIPAFNPGDTINVHVKVIEGAKERLQVFKGVVIRRQGGGIRETFTVRKESYGVGVERTFPVHSPNI
DHIEVVTRGDVRRAKLYYLRELRGKKAKIKEKR
>Q9K0K5 ~~~rplS~~~Large ribosomal subunit protein bL19~~~
MNLIQQLEQEEIARLNKEIPEFAPGDTVVVSVRVVEGTRSRLQAYEGVVIARRNRGLNSNFIVRKISSGEGVERTFQLYS
PTVEKIEVKRRGDVRRAKLYYLRGLTGKAARIKEKLPARKG
>Q9HXQ2 ~~~rplS~~~Large ribosomal subunit protein bL19~~~
MTNKIIQQIEAEQMNKEIPAFAPGDTVIVQVKVKEGDRQRLQAFEGVVIAKRNRGLNSAFTVRKISNGVGVERTFQTYSP
IVDSLSVKRRGDVRKAKLYYLRALSGKAARIKEKLV
>Q6ND68 ~~~rplS~~~Large ribosomal subunit protein bL19~~~COG0335
MNLIQTLEKEQFDKLSAGKTIPEFGPGDTVIVNVKVVEGERSRVQAYEGVCIGRSGGGINESFTVRKISYGEGVERVFPL
LSPMIDSIKVVRRGKVRRAKLYYLRNLRGKSARIVEKKQDRTAAVAAAE
>Q2FZ42 ~~~rplS~~~Large ribosomal subunit protein bL19~~~COG0335
MTNHKLIEAVTKSQLRTDLPSFRPGDTLRVHVRIIEGTRERIQVFEGVVIKRRGGGVSETFTVRKISSGVGVERTFPLHT
PKIEKIEVKRRGKVRRAKLYYLRSLRGKAARIQEIR
>Q2YXM4 ~~~rplS~~~Large ribosomal subunit protein bL19~~~
MTNHKLIEAVTKSQLRTDLPSFRPGDTLRVHVRIIEGTRERIQVFEGVVIKRRGGGVSETFTVRKISSGVGVERTFPLHT
PKIEKIEVKRRGKVRRAKLYYLRSLRGKAARIQEIR
>P66083 ~~~rplS~~~Large ribosomal subunit protein bL19~~~
MTNHKLIEAVTKSQLRTDLPSFRPGDTLRVHVRIIEGTRERIQVFEGIVIKRRGGGVSETFTVRKISSGVGVERTFPLHT
PKIEKIEVKRRGKVRRAKLYYLRSLRGKAARIQEIR
>Q72JU9 ~~~rplS~~~Large ribosomal subunit protein bL19~~~COG0335
MNRGALIKLVESRYVRTDLPEFRPGDTVRVSYKVKEGNRTRIQDFEGIVIRIRRNGFNTTFTVRKVSYGVGVERIFPLHS
PLIQKIDIVQRGRARRAKLYFIRNLSDREIRRKLRADRKRIDKDRAAERAAKEEVQKAQEPEASQE
>P60490 ~~~rplS~~~Large ribosomal subunit protein bL19~~~COG0335
MNRGALIKLVESRYVRTDLPEFRPGDTVRVSYKVKEGNRTRIQDFEGIVIRIRRNGFNTTFTVRKVSYGVGVERIFPLHS
PLIQKIDIVQRGRARRAKLYFIRNLSDREIRRKLRADRKRIDQDRAAERAAKEEAQKAQEPKASQE
>O67759 ~~~rplA~~~Large ribosomal subunit protein uL1~~~COG0081
MARRGKKYIEASKLVDRNKRYTLEEAVDLLKKMEEVLQRRFDETVELAMRLNVDPRYADQMVRGSVVLPHGLGKPIKVVV
FAEGEYAKKAEEAGADYVGGDELINKILKEEWTDFDVAIATPEMMPKVAKLGRILGPRGLMPSPKTGTVTTNVEQAIKDA
KRGRVEFKVDKAGNVHMPVGKISFEKEKLIDNLYAAIDAVVRAKPPGAKGQYIKNMAVSLTMSPSVKLDINEVLKKLQEK
AA
>Q06797 ~~~rplA~~~Large ribosomal subunit protein uL1~~~COG0081
MAKKGKKYVEAAKLVDRSKAYDVSEAVALVKKTNTAKFDATVEVAFRLGVDPRKNDQQIRGAVVLPNGTGKTQRVLVFAK
GEKAKEAEAAGADFVGDTDYINKIQQGWFDFDVIVATPDMMGEVGKIGRVLGPKGLMPNPKTGTVTFEVEKAIGEIKAGK
VEYRVDKAGNIHVPIGKVSFEDEKLVENFTTMYDTILKAKPAAAKGVYVKNVAVTSTMGPGVKVDSSTFNVK
>Q9RSS8 ~~~rplA~~~Large ribosomal subunit protein uL1~~~COG0081
MPKHGKRYRALEGKVDRNKQYSIDEAAALVKELATAKFDETVEVHFRLGIDPRKSDQNVRGTVALPHGTGRSVRVAVITK
GENVQAAEAAGADVVGSDELIERIAGGFMDFDAVVATPDMMAQIGQKLARLLGPRGLLPNPKSGTVGADVAGMVRGLKAG
RIEFRNDKTGVVHAPIGKASFESGNLSANYQALISALEGAKPGTAKGVFLRSAYLTTTMGPSIPLALGGAALA
>P0A7L0 ~~~rplA~~~Large ribosomal subunit protein uL1~~~COG0081
MAKLTKRMRVIREKVDATKQYDINEAIALLKELATAKFVESVDVAVNLGIDARKSDQNVRGATVLPHGTGRSVRVAVFTQ
GANAEAAKAAGAELVGMEDLADQIKKGEMNFDVVIASPDAMRVVGQLGQVLGPRGLMPNPKVGTVTPNVAEAVKNAKAGQ
VRYRNDKNGIIHTTIGKVDFDADKLKENLEALLVALKKAKPTQAKGVYIKKVSISTTMGAGVAVDQAGLSASVN
>Q830Q6 ~~~rplA~~~Large ribosomal subunit protein uL1~~~COG0081
MAKKSKKMQEALKKVDATKAYSVEEAVALAKDTNIAKFDATVEVAYKLNVDPKKADQQIRGAVVLPNGTGKTQTVLVFAK
GEKAKEAEAAGADFVGDDDMVAKIQGGWFDFDVVVATPDMMATVGKLGRVLGPKGLMPNPKTGTVTMDVTKAVEEVKAGK
VTYRVDKAGNIHVPIGKVSFDNEKLVENFNTINDVLLKAKPSTAKGQYIKNISVTTTFGPGIHVDQASF
>P04447 ~~~rplA~~~Large ribosomal subunit protein uL1~~~
MPKRGKKYLEALKLVDRSKAYPIAEAIELVKKTNVAKFDATVEVAFRLGVDPKKADQQIRGAVVLPHGTGKVARVLVFAK
GEKAKEAEAAGADYVGDTEYINKIQQGWFDFDVVVATPDMMGEVGKLGRILGPKGLMPDPKTGTVTFDVAKAVQEIKAGK
VEYRVDKAGNIHVPIGKVSFDNEKLAENFAAVYEALIKAKPAAAKGTYVKNVTITSTMGPGIKVDPTTVAVAQ
>P56029 ~~~rplA~~~Large ribosomal subunit protein uL1~~~COG0081
MAKKVFKRLEKLFSKIQNDKAYGVEQGVEVVKSLASAKFDETVEVALRLGVDPRHADQMVRGAVVLPHGTGKKVRVAVFA
KDIKQDEAKNAGADVVGGDDLAEEIKNGRIDFDMVIATPDMMAVVGKVGRILGPKGLMPNPKTGTVTMDIAKAVSNAKSG
QVNFRVDKKGNVHAPIGKASFPEEKIKENMLELVKTINRLKPSSAKGKYIRNAALSLTMSPSVSLDAQELMDIK
>A0QS46 ~~~rplA~~~Large ribosomal subunit protein uL1~~~COG0081
MSKNSKAYREAAEKVDRTKLYTPLEAAKLAKETSSKKQDATVEVAIRLGVDPRKADQMVRGTVNLPHGTGKTARVAVFAV
GEKAEQAQAAGADIVGSDDLIEKIQGGFLDFDAAIATPDQMAKVGRIARVLGPRGLMPNPKTGTVTPDVAKAVQDIKGGK
INFRVDKQANLHFIIGKASFDETKLAENYGAALDEVLRAKPSSSKGRYLKKVTVSTTTGPGIPVDPSVTRNFTEA
>P9WHC7 ~~~rplA~~~Large ribosomal subunit protein uL1~~~COG0081
MSKTSKAYRAAAAKVDRTNLYTPLQAAKLAKETSSTKQDATVEVAIRLGVDPRKADQMVRGTVNLPHGTGKTARVAVFAV
GEKADAAVAAGADVVGSDDLIERIQGGWLEFDAAIATPDQMAKVGRIARVLGPRGLMPNPKTGTVTADVAKAVADIKGGK
INFRVDKQANLHFVIGKASFDEKLLAENYGAAIDEVLRLKPSSSKGRYLKKITVSTTTGPGIPVDPSITRNFAGE
>Q6N4R5 ~~~rplA~~~Large ribosomal subunit protein uL1~~~COG0081
MAIGKRLKKIREGIDRTKLYPLDEAVKLVKERAISKFDETIEVAINLGVDPRHADQMVRGVVMLPNGTGRTVRVGVFARG
AKADEAKAAGADVVGAEDLVEQVQAGNINFDRCIATPDMMPLVGRLGKVLGPRGMMPNPKIGTVTMDVAGAVKGAKGGSV
EFRVEKAGIIQAGVGKASFDADKLVENIKALADAVNKAKPSGAKGTYIQRVAVSSTMGPGVKVEPGTVH
>Q99W68 ~~~rplA~~~Large ribosomal subunit protein uL1~~~
MAKKGKKYQEAASKVDRTQHYSVEEAIKLAKETSIANFDASVEVAFRLGIDTRKNDQQIRGAVVLPNGTGKSQSVLVFAK
GDKIAEAEAAGTDYVGEAEYVQKIQQGWFDFDVVVATPDMMGEVGKLGRVLGPKGLMPNPKTGTVTMDVKKAVEEIKAGK
VEYRAEKAGIVHASIGKVSFTDEQLIENFNTLQDVLAKAKPSSAKGTYFKSVAVTTTMGPGVKIDTASFK
>Q72GV9 ~~~rplA~~~Large ribosomal subunit protein uL1~~~COG0081
MPKHGKRYRALLEKVDPNKIYTIDEAAHLVKELATAKFDETVEVHAKLGIDPRRSDQNVRGTVSLPHGLGKQVRVLAIAK
GEKIKEAEEAGADYVGGEEIIQKILDGWMDFDAVVATPDVMGAVGSKLGRILGPRGLLPNPKAGTVGFNIGEIIREIKAG
RIEFRNDKTGAIHAPVGKASFPPEKLADNIRAFIRALEAHKPEGAKGTFLRSVYVTTTMGPSVRINPHS
>Q5SLP7 ~~~rplA~~~Large ribosomal subunit protein uL1~~~COG0081
MPKHGKRYRALLEKVDPNKVYTIDEAARLVKELATAKFDETVEVHAKLGIDPRRSDQNVRGTVSLPHGLGKQVRVLAIAK
GEKIKEAEEAGADYVGGEEIIQKILDGWMDFDAVVATPDVMGAVGSKLGRILGPRGLLPNPKAGTVGFNIGEIIREIKAG
RIEFRNDKTGAIHAPVGKASFPPEKLADNIRAFIRALEAHKPEGAKGTFLRSVYVTTTMGPSVRINPHS
>P27150 ~~~rplA~~~Large ribosomal subunit protein uL1~~~
MPKHGKRYRALLEKVDPNKIYTIDEAAHLVKELATAKFDETVEVHAKLGIDPRRSDQNVRGTVSLPHGLGKQVRVLAIAK
GEKIKEAEEAGADYVGGEEIIQKILDGWMDFDAVVATPDVMGAVGSKLGRILGPRGLLPNPKAGTVGFNIGEIIREIKAG
RIEFRNDKTGAIHAPVGKASFPPEKLADNIRAFIRALEAHKPEGAKGTFLRSVYVTTTMGPSVRINPHS
>B7I694 ~~~rplT~~~Large ribosomal subunit protein bL20~~~
MARVKRGVVAHRRHKKILARAKGYYGARSRVYRVAFQAVIKAGQYAYRDRRQKKRQFRALWIARINAGARQNGLSYSRMI
DGLKKAQVIIDRRVLADIAMHDAVAFAALAEKAKGALAA
>O67086 ~~~rplT~~~Large ribosomal subunit protein bL20~~~COG0292
MRVKGPSSRRKKKKILKLAKGYRGQRSRSYRRAKEAVMRALYYQYRDRKLRKREFRRLWIARINAAVRAYGLNYSTFING
LKKAGIELDRKILADMAVRDPQAFEQVVNKVKEALQVQ
>P55873 ~~~rplT~~~Large ribosomal subunit protein bL20~~~COG0292
MPRVKGGTVTRKRRKKVLKLAKGYFGSKHTLYKVANQQVMKSGNYAFRDRRQKKRDFRKLWITRINAAARMNGLSYSRLM
HGLKLSGIEVNRKMLADLAVNDLTAFNQLADAAKAQLNK
>Q9RSW7 ~~~rplT~~~Large ribosomal subunit protein bL20~~~COG0292
MPRAKTGIVRRRRHKKVLKRAKGFWGSRSKQYRNAFQTLLNAATYEYRDRRNKKRDFRRLWIQRINAGARLHGMNYSTFI
NGLKRANIDLNRKVLADIAAREPEAFKALVDASRNARQ
>P0A7L3 ~~~rplT~~~Large ribosomal subunit protein bL20~~~COG0292
MARVKRGVIARARHKKILKQAKGYYGARSRVYRVAFQAVIKAGQYAYRDRRQRKRQFRQLWIARINAAARQNGISYSKFI
NGLKKASVEIDRKILADIAVFDKVAFTALVEKAKAALA
>Q837C7 ~~~rplT~~~Large ribosomal subunit protein bL20~~~COG0292
MARVKGGTVTRKRRKKVLKLAKGYYGSKHTLFKSAKEQVMNSYYYAFRDRRQKKRDFRKLWIARINAAARMNGLSYSKLM
HGLKLAEIDINRKMLADLAVNDAAAFTALAEQAKDALSK
>A2RMR1 ~~~rplT~~~Large ribosomal subunit protein bL20~~~COG0292
MARVKGSVATRKRRKRILKLAKGYYGAKHRLFKTAKEQVMNSYYYAFRDRRQKKRDFRKLWIARINAAARMNGLSYSKLM
HGLKLADIEVNRKMLADIAIADAAAFTALAEEAKKALAK
>P66103 ~~~rplT~~~Large ribosomal subunit protein bL20~~~COG0292
MPRVKGGTVTRKRRKKIVKLAKGYYGSKHLLFKVANQAVMKSYQYAYRDRRQKKRDFRRLWIARINAAARMQDLSYSKLM
HGLKLAGIDINRKMLADLAVNDIASFNTLADSAKKALAK
>P78023 ~~~rplT~~~Large ribosomal subunit protein bL20~~~
MRIKGGKQTRVRRKKWLKQASGSFGTRHASYKVAKQTVIQAAKYAYRDRRNKKRDFRSLWILRLNAALREQGMTYSVFIN
LLKKHNIEINRKVLSELAIKEPSKFNLIVQKVKSEQPKAAKPAALGN
>A0QYU6 ~~~rplT~~~Large ribosomal subunit protein bL20~~~COG0292
MARVKRALNAQKKRRTVLKASKGYRGQRSRLYRKAKEQQLHSLTYAYRDRRARKGEFRKLWISRINAAARANDITYNRLI
QGLKAAGVEVDRKNLAELAVSDPAAFTALVDVARAALPEDVNAPSGEAA
>A5U300 ~~~rplT~~~Large ribosomal subunit protein bL20~~~COG0292
MARVKRAVNAHKKRRSILKASRGYRGQRSRLYRKAKEQQLHSLNYAYRDRRARKGEFRKLWIARINAAARLNDITYNRLI
QGLKAAGVEVDRKNLADIAISDPAAFTALVDVARAALPEDVNAPSGEAA
>P9WHC5 ~~~rplT~~~Large ribosomal subunit protein bL20~~~COG0292
MARVKRAVNAHKKRRSILKASRGYRGQRSRLYRKAKEQQLHSLNYAYRDRRARKGEFRKLWIARINAAARLNDITYNRLI
QGLKAAGVEVDRKNLADIAISDPAAFTALVDVARAALPEDVNAPSGEAA
>Q9I0A2 ~~~rplT~~~Large ribosomal subunit protein bL20~~~
MARVKRGVIARRRHKKILKLAKGYYGARSRVFRVAKQAVIKAGQYAYRDRRQRKRQFRALWIARINAGARQNGLSYSRLI
AGLKKAAIEIDRKVLADLAVNEKAAFTAIVEKAKASLA
>Q6NDR6 ~~~rplT~~~Large ribosomal subunit protein bL20~~~COG0292
MARVKRGVTAHAKHKKVYKLAKGYRGRRKNTIRTAKAAVDKAGQYAFRDRKRKKRTFRALWIQRLNAAVRPFGMTYSVFI
NGLSKSGIVVDRKVLSDLAINEPAAFQAIAEKAKAALAA
>Q2FXQ1 ~~~rplT~~~Large ribosomal subunit protein bL20~~~COG0292
MPRVKGGTVTRARRKKTIKLAKGYFGSKHTLYKVAKQQVMKSGQYAFRDRRQRKRDFRKLWITRINAAARQHEMSYSRLM
NGLKKAGIDINRKMLSEIAISDEKAFAQLVTKAKDALK
>Q2YTB1 ~~~rplT~~~Large ribosomal subunit protein bL20~~~
MPRVKGGTVTRARRKKTIKLAKGYFGSKHTLYKVAKQQVMKSGQYAFRDRRQRKRDFRKLWITRINAAARQHEMSYSRLM
NGLKKAGIDINRKMLSEIAISDEKAFAQLVTKAKDALK
>P66108 ~~~rplT~~~Large ribosomal subunit protein bL20~~~
MPRVKGGTVTRARRKKTIKLAKGYFGSKHTLYKVAKQQVMKSGQYAFRDRRQRKRDFRKLWITRINAAARQHEMSYSRLM
NGLKKAGIDINRKMLSEIAISDEKAFAQLVTKAKDALK
>Q72L76 ~~~rplT~~~Large ribosomal subunit protein bL20~~~COG0292
MPRAKTGVVRRRKHKKILKLAKGYWGLRSKSFRKARETLFAAGNYAYAHRKRRKRDFRRLWIVRINAACRQHGLNYSTFI
HGLKKAGIEVDRKNLADLAVREPQVFAELVERAKAAQG
>P60491 ~~~rplT~~~Large ribosomal subunit protein bL20~~~COG0292
MPRAKTGVVRRRKHKKILKLAKGYWGLRSKSFRKARETLFAAGNYAYAHRKRRKRDFRRLWIVRINAACRQHGLNYSTFI
HGLKKAGIEVDRKNLADLAVREPQVFAELVERAKAAQG
>B7I6V9 ~~~rplU~~~Large ribosomal subunit protein bL21~~~
MYAVIQSGGKQHRVVEGETLKVELLKAESGATITFDDVLMVVNGDNIQIGAPVVAGAKVTAEVIGHGRHDKIRIIKMRRR
KHYRKQQGHRQWFTELKITGISG
>P26908 ~~~rplU~~~Large ribosomal subunit protein bL21~~~COG0261
MYAIIKTGGKQIKVEEGQTVYIEKLAAEAGETVTFEDVLFVGGDNVKVGNPTVEGATVTAKVEKQGRAKKITVFRYKPKK
NVHKKQGHRQPYTKVTIEKINA
>Q9RY64 ~~~rplU~~~Large ribosomal subunit protein bL21~~~COG0261
MFAIIQTGGKQYRVSEGDVIRVESLQGEAGDKVELKALFVGGEQTVFGEDAGKYTVQAEVVEHGRGKKIYIRKYKSGVQY
RRRTGHRQNFTAIKILGIQG
>P0AG48 ~~~rplU~~~Large ribosomal subunit protein bL21~~~COG0261
MYAVFQSGGKQHRVSEGQTVRLEKLDIATGETVEFAEVLMIANGEEVKIGVPFVDGGVIKAEVVAHGRGEKVKIVKFRRR
KHYRKQQGHRQWFTDVKITGISA
>Q836X6 ~~~rplU~~~Large ribosomal subunit protein bL21~~~COG0261
MYAIIKTGGKQVKVEVGQAIYVEKLNVEAGEKVVFDEVILVGGESTKVGAPTVAGATVEGTVEKHGKQKKVVTFQYKPKK
HSHRKQGHRQPYTKVMIEAINA
>A2RLA4 ~~~rplU~~~Large ribosomal subunit protein bL21~~~COG0261
MSNYAIIKTGGKQVKVEEGSVIYVEKLNVEAGQTVTFDEVIFVGGETTKVGAPLVEGATVVGEVEKHGKQKKVVTFQYKP
KKHSHRKQGHRQPYTKVVIKSVNA
>Q8Y6Y9 ~~~rplU~~~Large ribosomal subunit protein bL21~~~COG0261
MYAIIETGGKQIKVEAGQEIYVEKLAGEVGDVVTFDKVLFVGGDSAKVGVPFVEGATVTAKVEKQGRAKKLTVYKYKPKK
NYHKKQGHRQPYTKLTIDAINA
>P78026 ~~~rplU~~~Large ribosomal subunit protein bL21~~~
MHAIVVCGSKQYLVHENDTFFVEKLEAPVGKEIQLDKVLMLDEKIGAPYLEKARVVCVVEKHGLQRKVNVIKHISQKHHL
KKYGHRQPYTKLKVVRFVHD
>A0R151 ~~~rplU~~~Large ribosomal subunit protein bL21~~~COG0261
MATYAIVKTGGKQYKVAAGDVVKVEKLDSEPGASVSLPVALVVDGANVTSKADDLAKVAVTAEVLEHTKGPKIRIHKFKN
KTGYHKRQGHRQQLTVLKVTGIK
>P9WHC3 ~~~rplU~~~Large ribosomal subunit protein bL21~~~COG0261
MATYAIVKTGGKQYKVAVGDVVKVEKLESEQGEKVSLPVALVVDGATVTTDAKALAKVAVTGEVLGHTKGPKIRIHKFKN
KTGYHKRQGHRQQLTVLKVTGIA
>Q9HVL6 ~~~rplU~~~Large ribosomal subunit protein bL21~~~
MYAVIVTGGKQHKVTEGEFLKVEKLDVATGEAIDFDRVLLVANGEDVKIGLPVVEGAKVTAEVVSHGRHDKVRIIKFRRR
KHHMKRQGHRQWFTEIKITGIQA
>Q6NDF0 ~~~rplU~~~Large ribosomal subunit protein bL21~~~COG0261
MFAVIKTGGRQYRVVPEDVLEVGKIDGDVGSIIQLGEVLVLGGDTPVLGAPTVAGATVAAEVLDHKRGPKVIAFKKRRRK
HSKRKRGYRDEITVLRITEILADGKKPSVGPRAKRTKAAPAAEAAE
>Q2FXS8 ~~~rplU~~~Large ribosomal subunit protein bL21~~~COG0261
MFAIIETGGKQIKVEEGQEIFVEKLDVNEGDTFTFDKVLFVGGDSVKVGAPTVEGATVTATVNKQGRGKKITVFTYKRRK
NSKRKKGHRQPYTKLTIDKINA
>Q2YT83 ~~~rplU~~~Large ribosomal subunit protein bL21~~~
MFAIIETGGKQIKVEEGQEIFVEKLDVNEGDTFTFDKVLFVGGDSVKVGAPTVEGATVTATVNKQGRGKKITVFTYKRRK
NSKRKKGHRQPYTKLTIDKINA
>Q7A583 ~~~rplU~~~Large ribosomal subunit protein bL21~~~
MFAIIETGGKQIKVEEGQEIFVEKLDVNEGDTFTFDKVLFVGGDSVKVGAPTVEGATVTATVNKQGRGKKITVFTYKRRK
NSKRKKGHRQPYTKLTIDKINA
>Q72HR2 ~~~rplU~~~Large ribosomal subunit protein bL21~~~COG0261
MFAIVKTGGKQYRVEPGLKLRVEKLDAEPGATVELPVLLLGGEKTVVGTPVVEGASVVAEVLGHGRGKKILVSKFKAKVQ
YRRKKGHRQPYTELLIKEIRG
>P60492 ~~~rplU~~~Large ribosomal subunit protein bL21~~~COG0261
MFAIVKTGGKQYRVEPGLKLRVEKLDAEPGATVELPVLLLGGEKTVVGTPVVEGASVVAEVLGHGRGKKILVSKFKAKVQ
YRRKKGHRQPYTELLIKEIRG
>P42060 ~~~rplV~~~Large ribosomal subunit protein uL22~~~COG0091
MQAKAVARTVRIAPRKARLVMDLIRGKQVGEAVSILNLTPRAASPIIEKVLKSAIANAEHNYEMDANNLVISQAFVDEGP
TLKRFRPRAMGRASQINKRTSHITIVVSEKKEG
>Q9RXJ7 ~~~rplV~~~Large ribosomal subunit protein uL22~~~COG0091
MTAPEQTFRNKKQRKQQVKLRKPGFAVAKYVRMSPRKVRLVVDVIRGKSVQDAEDLLRFIPRSASEPVAKVLNSAKANAL
HNDEMLEDRLFVKEAYVDAGPTLKRLIPRARGSANIIKKRTSHITIIVAEKGNK
>P61175 ~~~rplV~~~Large ribosomal subunit protein uL22~~~COG0091
METIAKHRHARSSAQKVRLVADLIRGKKVSQALDILTYTNKKAAVLVKKVLESAIANAEHNDGADIDDLKVTKIFVDEGP
SMKRIMPRAKGRADRILKRTSHITVVVSDR
>Q839F9 ~~~rplV~~~Large ribosomal subunit protein uL22~~~COG0091
MSEQITSAKATAKTVRTSPRKARLVIDLIRGKSVADAISILKFTPNKSAGIIEKVLMSAVANAENNFDLDVESLVVSEAF
VNEGPTMKRFRPRAKGSASPINKRTSHITVVVTEK
>P56047 ~~~rplV~~~Large ribosomal subunit protein uL22~~~COG0091
MSKALLRFVRLSPTKARLIARQIQGMNAELAIASLEFTPNKAARVLSKVVASAVANGSLDAKSALIVSCRVDAGPVLRRS
IPRAKGRATAIRKPTSHVFVEVAEGKEMKSSKSHKKNQAEGK
>A2RNQ0 ~~~rplV~~~Large ribosomal subunit protein uL22~~~COG0091
MAEITSAKATAKTVRVSPRKTRLVIDLIRGKRVADAIAILKFTPTKAAVEVENVLNSAIANAENNFGLEKANLVVSETFI
NEGPTMKRFRPRAKGSASPINKRTAHITVVVAEKE
>Q927L2 ~~~rplV~~~Large ribosomal subunit protein uL22~~~COG0091
MASEVTSAKAVAKTVRIAPRKARIVIDLIRGKQVGEAIAILKYTPRSASPIIEKVLKSAIANAEHNYDLDINNLVVEEAF
VDEGPTLKRFRPRAQGRASAINKRTSHITVVVSEVKEG
>P75575 ~~~rplV~~~Large ribosomal subunit protein uL22~~~
MIAFAKQFRVRISPQKARLVCQLIVGKKTADAQNILSNTPKKAATLIAKLLNSAIANATNNHGMNGDALYVFECVANQGP
SMKRTIPRAKGSSNMITKRSSNLVVKLSDNPNERQELIKQQKALVKKRVEGQQKAKMARQKAVTSVVKAPSKTQGGVQK
>A0QSD6 ~~~rplV~~~Large ribosomal subunit protein uL22~~~COG0091
MSTVTEFPSATAKARYVRVSATKARRVIDLVRGKSVEEALDILRWAPQAASEPVAKVIASAAANAQNNEGLDPSTLVVAT
VYADEGPTAKRIRPRAQGRAFRIRKRTSHITVIVESRPPKQKGASAASARSRRAQGSKAAATKKSAETKEGSE
>O06115 ~~~rplV~~~Large ribosomal subunit protein uL22~~~
MSTVTEFPSATAKARYVRVSATKARRVIDLVRGKSVEEALDILRWAPQAASEPVAKVIASAAANAQNNEGLDPSTLVVAT
VYADEGPTAKRIRPRAQGRAFRIRKRTSHITVIVESRPPKQKGASAASARSRRAQGSKAAATKKSAETKEGSE
>A5U092 ~~~rplV~~~Large ribosomal subunit protein uL22~~~COG0091
MTAATKATEYPSAVAKARFVRVSPRKARRVIDLVRGRSVSDALDILRWAPQAASGPVAKVIASAAANAQNNGGLDPATLV
VATVYADQGPTAKRIRPRAQGRAFRIRRRTSHITVVVESRPAKDQRSAKSSRARRTEASKAASKVGATAPAKKAAAKAPA
KKAPASSGVKKTPAKKAPAKKAPAKASETSAAKGGSD
>P9WHC1 ~~~rplV~~~Large ribosomal subunit protein uL22~~~COG0091
MTAATKATEYPSAVAKARFVRVSPRKARRVIDLVRGRSVSDALDILRWAPQAASGPVAKVIASAAANAQNNGGLDPATLV
VATVYADQGPTAKRIRPRAQGRAFRIRRRTSHITVVVESRPAKDQRSAKSSRARRTEASKAASKVGATAPAKKAAAKAPA
KKAPASSGVKKTPAKKAPAKKAPAKASETSAAKGGSD
>Q7DDT5 ~~~rplV~~~Large ribosomal subunit protein uL22~~~
MRVNAQHKNARISAQKARLVADLIRGKDVAQALNILAFSPKKGAELIKKVLESAIANAEHNNGADIDELKVVTIFVDKGP
SLKRFQARAKGRGNRIEKQTCHINVTVGN
>Q6N4T9 ~~~rplV~~~Large ribosomal subunit protein uL22~~~COG0091
MSKPKRERSLPDNEAKAVARMLRVSPQKLNLVAQLIRGRKASAALADLAFSRKRIAVDVKKCLESAIANAENNHDLDVDA
LVVSEAHVGKGIVMKRFTPRGRGRSGRIFKPFAQLTIVVRQVEEASA
>Q2FW11 ~~~rplV~~~Large ribosomal subunit protein uL22~~~COG0091
MEAKAVARTIRIAPRKVRLVLDLIRGKNAAEAIAILKLTNKASSPVIEKVLMSALANAEHNYDMNTDELVVKEAYANEGP
TLKRFRPRAQGRASAINKRTSHITIVVSDGKEEAKEA
>Q2YYQ1 ~~~rplV~~~Large ribosomal subunit protein uL22~~~
MEAKAVARTIRIAPRKVRLVLDLIRGKNAAEAIAILKLTNKASSPVIEKVLMSALANAEHNYDMNTDELVVKEAYANEGP
TLKRFRPRAQGRASAINKRTSHITIVVSDGKEEAKEA
>Q7A460 ~~~rplV~~~Large ribosomal subunit protein uL22~~~
MEAKAVARTIRIAPRKVRLVLDLIRGKNAAEAIAILKLTNKASSPVIEKVLMSALANAEHNYDMNTDELVVKEAYANEGP
TLKRFRPRAQGRASAINKRTSHITIVVSDGKEEAKEA
>P61182 ~~~rplV~~~Large ribosomal subunit protein uL22~~~COG0091
MAEITSAKAMARTVRVSPRKSRLVLDNIRGKSVADAIAILTFTPNKAAEIILKVLNSAVANAENNFGLDKANLVVSEAFA
NEGPTMKRFRPRAKGSASPINKRTAHITVAVAEK
>Q72I09 ~~~rplV~~~Large ribosomal subunit protein uL22~~~COG0091
MEAKAIARYVRISPRKVRLVVDLIRGKSLEEARNILRYTNKRGAYFVAKVLESAAANAVNNHDMLEDRLYVKAAYVDEGP
ALKRVLPRARGRADIIKKRTSHITVILGEKHGK
>Q5SHP3 ~~~rplV~~~Large ribosomal subunit protein uL22~~~COG0091
MEAKAIARYVRISPRKVRLVVDLIRGKSLEEARNILRYTNKRGAYFVAKVLESAAANAVNNHDMLEDRLYVKAAYVDEGP
ALKRVLPRARGRADIIKKRTSHITVILGEKHGK
>P48286 ~~~rplV~~~Large ribosomal subunit protein uL22~~~
MEAKAIARYVRISPRKVRLVVDLIRGKSLEEARNILRYTNKRGAYFVAKVLESAAANAVNNHDMLEDRLYVKAAYVDEGP
ALKRVLPRARGRADIIKKRTSHITVILGEKHGK
>B7IA37 ~~~rplW~~~Large ribosomal subunit protein uL23~~~
MNNERIYQVLKGPVFSEKAQVLGDTAGVQVFKVDINATKLEIKKAVEKLFGVEVVKVNTTITKGKTKRFGRTLGRRSDVK
KAYVTLKAGQDVEMADLGDTAESAAE
>P42924 ~~~rplW~~~Large ribosomal subunit protein uL23~~~COG0089
MKDPRDVLKRPVITERSADLMTEKKYTFEVDVRANKTEVKDAVESIFGVKVDKVNIMNYKGKSKRVGRYTGMTSRRRKAI
VKLTADSKEIEIFEA
>Q9RXK0 ~~~rplW~~~Large ribosomal subunit protein uL23~~~COG0089
MSHYDILQAPVISEKAYSAMERGVYSFWVSPKATKTEIKDAIQQAFGVRVIGISTMNVPGKRKRVGRFIGQRNDRKKAIV
RLAEGQSIEALAGQA
>P0ADZ0 ~~~rplW~~~Large ribosomal subunit protein uL23~~~COG0089
MIREERLLKVLRAPHVSEKASTAMEKSNTIVLKVAKDATKAEIKAAVQKLFEVEVEVVNTLVVKGKVKRHGQRIGRRSDW
KKAYVTLKEGQNLDFVGGAE
>Q839G2 ~~~rplW~~~Large ribosomal subunit protein uL23~~~COG0089
MELLDVIKRPVITEKSMLAMDEKKYTFEVDTRANKTLVKQAVESAFDVKVANVNILNVRPKFKRMGKYAGYTKKRRKAIV
TLTEDSKEIQLFEAAE
>P04454 ~~~rplW~~~Large ribosomal subunit protein uL23~~~
MKDPRDIIKRPIITENTMNLIGQKKYTFEVDVKANKTEVKDAVEKIFGVKVEKVNIMNYKGKFKRVGRYSGYTNRRKKAI
VTLTPDSKEIELFEV
>Q8Y441 ~~~rplW~~~Large ribosomal subunit protein uL23~~~COG0089
MDARDIIKRPVVTEESTSILDDKKYTFEVDTRATKTQVKYAVEEIFDVKVAKVNVMNYKGKLKRMGRYAGYTNKRRKAIV
TVTADSKEIQFFEV
>P75578 ~~~rplW~~~Large ribosomal subunit protein uL23~~~
MDVTNVLLKPVLTEKVYFNQMGETKKYVFVVNPKASKTRVKLAFELVYGIKPLKVNTLIRKPTTIRGGSRFPGLSKLEKL
AVITLPKGIAISVTGEAPEKTDKPADKTTLKESTVKEIKDTKNSPEAVVKTAVEALQIKPTAAPVTTAPLQTVAVKVAKE
VKEVKVEKPVKVEKPTKPAKVAKEAKTTKVAKETKAEKSVQTTKVAKETKTEKSAKTTKTTATKTTKTKTTKKEVKK
>A0QSD3 ~~~rplW~~~Large ribosomal subunit protein uL23~~~COG0089
MATITDPRDIILAPVISEKSYGLIEDNVYTFVVHPDSNKTQIKIAIEKIFDVKVDSVNTANRQGKRKRTRTGFGKRKSTK
RAIVKLAAGSKPIDLFGAPA
>A5U089 ~~~rplW~~~Large ribosomal subunit protein uL23~~~COG0089
MATLADPRDIILAPVISEKSYGLLDDNVYTFLVRPDSNKTQIKIAVEKIFAVKVASVNTANRQGKRKRTRTGYGKRKSTK
RAIVTLAPGSRPIDLFGAPA
>P9WHB9 ~~~rplW~~~Large ribosomal subunit protein uL23~~~COG0089
MATLADPRDIILAPVISEKSYGLLDDNVYTFLVRPDSNKTQIKIAVEKIFAVKVASVNTANRQGKRKRTRTGYGKRKSTK
RAIVTLAPGSRPIDLFGAPA
>Q9K1I6 ~~~rplW~~~Large ribosomal subunit protein uL23~~~
MNQQRLTQVILAPIVSEKSNVLAEKRNQMTFKVLANATKPEIKAAVELLFGVQVADVTTVTIKGKVKRFGRTLGRRSDVK
KAYVSLAAGQELDLEAAAAAADKE
>Q9HWD7 ~~~rplW~~~Large ribosomal subunit protein uL23~~~
MNQERVFKVLLGPHISEKATGLADGKSQFVFKVATDATKLEIKKAVESLFSVKVQRVTTLNVKGKTKRTARGLGKRNDWK
KAYIALQPGQDLDFATSAE
>Q6N4T7 ~~~rplW~~~Large ribosomal subunit protein uL23~~~COG0089
MKSIDPRHYDVIVAPVVTEKSTMASEHNKVVFKVQGGATKPQIKEAVEKLFDVKVKSVNTLVRKGKTKAFRGTFGTQSDV
KRAVVTLEEGHRIDVTTGL
>Q2FW08 ~~~rplW~~~Large ribosomal subunit protein uL23~~~COG0089
MEARDILKRPVITEKSSEAMAEDKYTFDVDTRVNKTQVKMAVEEIFNVKVASVNIMNYKPKKKRMGRYQGYTNKRRKAIV
TLKEGSIDLFN
>Q2YYP8 ~~~rplW~~~Large ribosomal subunit protein uL23~~~
MEARDILKRPVITEKSSEAMAEDKYTFDVDTRVNKTQVKMAVEEIFNVKVASVNIMNYKPKKKRMGRYQGYTNKRRKAIV
TLKEGSIDLFN
>Q7A459 ~~~rplW~~~Large ribosomal subunit protein uL23~~~
MEARDILKRPVITEKSSEAMAEDKYTFDVDTRVNKTQVKMAVEEIFNVKVASVNIMNYKPKKKRMGRYQGYTNKRRKAIV
TLKEGSIDLFN
>Q72I06 ~~~rplW~~~Large ribosomal subunit protein uL23~~~COG0089
MKTAYDVILAPVLSEKAYAGFAEGKYTFWVHPKATKTEIKNAVETAFKVKVVKVNTLHVRGKKKRLGRYLGKRPDRKKAI
VQVAPGQKIEALEGLI
>Q5SHP0 ~~~rplW~~~Large ribosomal subunit protein uL23~~~COG0089
MKTAYDVILAPVLSEKAYAGFAEGKYTFWVHPKATKTEIKNAVETAFKVKVVKVNTLHVRGKKKRLGRYLGKRPDRKKAI
VQVAPGQKIEALEGLI
>Q9RA57 ~~~rplW~~~Large ribosomal subunit protein uL23~~~
MKTAYDVILAPVLSEKAYAGFAEGKYTFWVHPKATKTEIKNAVETAFKVKVVKVNTLHVRGKKKRLGRYLGKRPDRKKAI
VQVAPGQKIEALEGLI
>P0C2N0 ~~~rplW~~~Large ribosomal subunit protein uL23~~~COG0089
MIREERLLKVLRAPHVSEKASAAMEKNNTIVLKVAKDATKAEIKAAVQKLFEVEVEDVNTLLVKGKSKRHGQRVGRRSDW
KKAYVTLKEGQNLDFIGGAE
>B7IA28 ~~~rplX~~~Large ribosomal subunit protein uL24~~~
MAKIKKGDQVIVIAGKEKGKQGTVLSVSEDRVKVEGLNLVKKHQKPNRVTGAEGGIVTQEASLHISNVAILNATTQKADR
VGYQVIDGVKTRVYKSTGESVAVAK
>P0CI78 ~~~rplX~~~Large ribosomal subunit protein uL24~~~COG0198
MHVKKGDKVMVISGKDKGKQGTILAAFPKKDRVLVEGVNMVKKHSKPTQANPQGGISNQEAPIHVSNVMPLDPKTGEVTR
VGYKVEDGKKVRVAKKSGQVLDK
>Q9RXJ1 ~~~rplX~~~Large ribosomal subunit protein uL24~~~COG0198
MPRPSAGSHHNDKLHFKKGDTVIVLSGKHKGQTGKVLLALPRDQKVVVEGVNVITKNVKPSMTNPQGGQEQRELALHASK
VALVDPETGKATRVRKQIVDGKKVRVAVASGKTID
>P60624 ~~~rplX~~~Large ribosomal subunit protein uL24~~~COG0198
MAAKIRRDDEVIVLTGKDKGKRGKVKNVLSSGKVIVEGINLVKKHQKPVPALNQPGGIVEKEAAIQVSNVAIFNAATGKA
DRVGFRFEDGKKVRFFKSNSETIK
>Q839F3 ~~~rplX~~~Large ribosomal subunit protein uL24~~~COG0198
MFVKKGDKVKVITGKDKNKEGVVLAAFPKQDKVIVEGVNVVKKHQKPNQAAPQGGILEVEAPIHVSNVMVIDPSNGEATK
VAFKEVDGKKVRVSKKTGEVLDK
>P04455 ~~~rplX~~~Large ribosomal subunit protein uL24~~~
MHVKKGDKVQVISGKDKGKQGVILAAFPKKNRVIVEGVNIVKKHAKPSQANPQGGIIEKEAPIHVSKVMPLDPKTGEPTR
IGYKIVDGKKVRYAKKSGEILDK
>A2RNP4 ~~~rplX~~~Large ribosomal subunit protein uL24~~~COG0198
MFVKTGDTVKVIAGKDRGTTGKVIKALPKVNKVVVEGVAIMKKHQKPNSENPSGAILEIEAPIHVSNVQVLDKNGVAGRV
GYKVVDDKKVRFNKKSGEILD
>Q8Y443 ~~~rplX~~~Large ribosomal subunit protein uL24~~~COG0198
MHVKKGDKVKVITGKDKGKSGKVLAAFPKKDRVLIEGINMVKKHTKPSNVNPQGGILNVEAPIHVSNVMLIDPKTGEPTR
VGYEVKGDKKVRVAKKSGEVIDK
>Q50307 ~~~rplX~~~Large ribosomal subunit protein uL24~~~
MQRIKKGDKVVVITGKNKGGSGIVLKIMPARQQAIVEGLNKVTRHKKKDQTTKRAAKQSTGKVQQEAPIFLSKLALFDQK
AKQQTIGKIKYVMDPKTNKKTRVFKKSNNTL
>A0QSG0 ~~~rplX~~~Large ribosomal subunit protein uL24~~~COG0198
MKVHKGDTVLVISGKDKGAKGKVLVAYPDRNKVLVEGVNRIKKHTAVSANERGASSGGIVTQEAPIHVSNVMVVDSDGKP
TRVGYRIDDETGKKVRIAKTNGKDI
>A5U0A1 ~~~rplX~~~Large ribosomal subunit protein uL24~~~COG0198
MKVHKGDTVLVISGKDKGAKGKVLQAYPDRNRVLVEGVNRIKKHTAISTTQRGARSGGIVTQEAPIHVSNVMVVDSDGKP
TRIGYRVDEETGKRVRISKRNGKDI
>P9WHB7 ~~~rplX~~~Large ribosomal subunit protein uL24~~~COG0198
MKVHKGDTVLVISGKDKGAKGKVLQAYPDRNRVLVEGVNRIKKHTAISTTQRGARSGGIVTQEAPIHVSNVMVVDSDGKP
TRIGYRVDEETGKRVRISKRNGKDI
>Q9HWE6 ~~~rplX~~~Large ribosomal subunit protein uL24~~~
MQKIRRDDEVIVIAGKDKGKRGKVLKVLADDRLVVGGVNLIKRHTKPNPMLGQQGGIVEKEAPLHVSNVAIFNTETSKAD
RVGFKVEDGKKIRVFKSTQKPVQA
>P60744 ~~~rplX~~~Large ribosomal subunit protein uL24~~~COG0198
MAAKIRKGDKVIVLSGRDKGRTGEVFEVRPDAGKALVRGINVVKRHQKQTQTQEGGIISKEAPIDLSNIAIVGKDGKPTR
VGFKILADGKKVRVAKRSGAEIDG
>Q2FW17 ~~~rplX~~~Large ribosomal subunit protein uL24~~~COG0198
MHIKKGDNVKVIAGKDKGKEGKVIATLPKKDRVVVEGVNIMKKHQKPTQLNPEGGILETEAAIHVSNVQLLDPKTNEPTR
VGYKFVDGKKVRIAKKSGEEIKSNN
>Q2YYK8 ~~~rplX~~~Large ribosomal subunit protein uL24~~~
MHIKKGDNVKVIAGKDKGKEGKVIATLPKKDRVVVEGVNIMKKHQKPTQLNPEGGILETEAAIHVSNVQLLDPKTNEPTR
VGYKFVDGKKVRIAKKSGEEIKSNN
>P60735 ~~~rplX~~~Large ribosomal subunit protein uL24~~~
MHIKKGDNVKVIAGKDKGKEGKVIATLPKKDRVVVEGVNIMKKHQKPTQLNPEGGILETEAAIHVSNVQLLDPKTNEPTR
VGYKFVDGKKVRIAKKSGEEIKSNN
>Q72I15 ~~~rplX~~~Large ribosomal subunit protein uL24~~~COG0198
MRVKMHVKKGDTVLVASGKYKGRVGKVKEVLPKKYAVIVEGVNIVKKAVRVSPKYPQGGFIEKEAPLHASKVRPICPACG
KPTRVRKKFLENGKKIRVCAKCGGALDTEE
>Q5SHP9 ~~~rplX~~~Large ribosomal subunit protein uL24~~~COG0198
MRVKMHVKKGDTVLVASGKYKGRVGKVKEVLPKKYAVIVEGVNIVKKAVRVSPKYPQGGFIEKEAPLHASKVRPICPACG
KPTRVRKKFLENGKKIRVCAKCGGALDTEE
>B7I7B6 ~~~rplY~~~Large ribosomal subunit protein bL25~~~
MANFVLNAQARAEDKQGKGASRRLRRESLVPAIIYGGNAEPVAVTLELRELVKALESNVFFEEVVEIKVGDKVENVKIQA
LQRHPAKNTPMHADFKRA
>Q9RX88 ~~~rplY~~~Large ribosomal subunit protein bL25~~~COG1825
MELTAKPRTPKQKLDESMIAAVAYNKENNVSFALDRKAFDRAFRQQSTTGLFDITVEGGETFPALVKAVQMDKRKRAPIH
VDFYMVTYGEPVEVSVPVHTTGRSQGEVQGGLVDIVVHNLQIVAPGPRRIPQELVVDVTKMNIGDHITAGDIKLPEGCTL
AADPELTVVSVLPPRLTAEELEAEVQAAQVAGLVAAGELSEEAAEAVLEGDASLEEVKAEASEDNAGTDSEDNSDAQ
>P68919 ~~~rplY~~~Large ribosomal subunit protein bL25~~~COG1825
MFTINAEVRKEQGKGASRRLRAANKFPAIIYGGKEAPLAIELDHDKVMNMQAKAEFYSEVLTIVVDGKEIKVKAQDVQRH
PYKPKLQHIDFVRA
>P9WHB5 ~~~rplY~~~Large ribosomal subunit protein bL25~~~COG1825
MAKSASNQLRVTVRTETGKGASRRARRAGKIPAVLYGHGAEPQHLELPGHDYAAVLRHSGTNAVLTLDIAGKEQLALTKA
LHIHPIRRTIQHADLLVVRRGEKVVVEVSVVVEGQAGPDTLVTQETNSIEIEAEALSIPEQLTVSIEGAEPGTQLTAGQI
ALPAGVSLISDPDLLVVNVVKAPTAEELEGEVAGAEEAEEAAVEAGEAEAAGESE
>Q02G03 ~~~rplY~~~Large ribosomal subunit protein bL25~~~
MVDFILNAQVRSDLGKGASRRLRRNAGLVPAVVYGGDKEPQSVTLELREIAKLLENEAAFSHVIALNVGGAKETVLIKAL
QRHPAKGFVMHADFLRVVADHKLTAHVPLHFINEEVAVGVKQSGGEISHTISEVEVSCLPKDLPEFIEVDMAKVELGQIV
HLSDLKAPKGVELVQLAHGNDLAVANIHASRVVKEEGSEEGAAE
>Q9HVC4 ~~~rplY~~~Large ribosomal subunit protein bL25~~~
MVDFILNAQVRSDLGKGASRRLRRNAGLVPAVVYGGDKEPQSVTLELREIAKLLENEAAFSHVIALNVGGAKETVLIKAL
QRHPAKGFVMHADFLRVVADHKLTAHVPLHFINEEVAVGVKQAGGEISHTISEVEVSCLPKDLPEFIEVDMAKVELGQIV
HLSDLKAPKGVELVQLAHGNDLAVANIHASRVVKEEGSEEGAAE
>Q6N1P8 ~~~rplY~~~Large ribosomal subunit protein bL25~~~COG1825
MTSVLELKATARPKSGKGAARAERRAGRVPGVIYGDNQSPLPISVEEKELRLRILAGRFLTTVFDVVLDGKKHRVIPRDY
HLDPVRDFPIHVDFLRLGAGATIRVSVPLHLKGLEVAPGVKRGGTFNIVTHTVELEAPAENIPQFIEADVSTLDIGVSLH
LSDIALPTGVKSVSRDDVTLVTIVPPSGYNEDKAAAGAAPAAAAAPAAAAKAPAAAAKAPAAAAPAAKKK
>P68918 ~~~rplY~~~Large ribosomal subunit protein bL25~~~
MFTINAEVRKEQGKGASRRLRAANKFPAIIYGGKEAPLAIELDHDKVMNMQAKAEFYSEVLTIVVDGKEIKVKAQDVQRH
PYKPKLQHIDFVRA
>Q2FJE0 ~~~rplY~~~Large ribosomal subunit protein bL25~~~
MASLKSIIRQGKQTRSDLKQLRKSGKVPAVVYGYGTKNVSVKVDEVEFIKVIREVGRNGVIELGVGSKTIKVMVADYQFD
PLKNQITHIDFLAINMSEERTVEVPVQLVGEAVGAKEGGVVEQPLFNLEVTATPDNIPEAIEVDITELNINDSLTVADVK
VTGDFKIENDSAESVVTVVAPTEEPTEEEIEAMEGEQQTEEPEVVGESKEDEEKTEE
>Q7A7B3 ~~~rplY~~~Large ribosomal subunit protein bL25~~~
MASLKSIIRQGKQTRSDLKQLRKSGKVPAVVYGYGTKNVSVKVDEVEFIKVIREVGRNGVIELGVGSKTIKVMVADYQFD
PLKNQITHIDFLAINMSEERTVEVPVQLVGEAVGAKEGGVVEQPLFNLEVTATPDNIPEAIEVDITELNINDSLTVADVK
VTGDFKIENDSAESVVTVVAPTEEPTEEEIEAMEGEQQTEEPEVVGESKEDEEKTEE
>Q72IA7 ~~~rplY~~~Large ribosomal subunit protein bL25~~~COG1825
MEYRLKAYYREGEKPSALRRAGKLPGVMYNRHLNRKVYVDLVEFDKVFRQASIHHVIVLELPDGQSLPTLVRQVNLDKRR
RRPEHVDFFVLSDEPVEMYVPLRFVGTPAGVRAGGVLQEIHRDILVKVSPRNIPEFIEVDVSGLEIGDSLHASDLKLPPG
VELAVSPEETIAAVVPPEDVEKLAEEAAAEVAEPEVIKKGKEEEEE
>Q5SHZ1 ~~~rplY~~~Large ribosomal subunit protein bL25~~~COG1825
MEYRLKAYYREGEKPSALRRAGKLPGVMYNRHLNRKVYVDLVEFDKVFRQASIHHVIVLELPDGQSLPTLVRQVNLDKRR
RRPEHVDFFVLSDEPVEMYVPLRFVGTPAGVRAGGVLQEIHRDILVKVSPRNIPEFIEVDVSGLEIGDSLHASDLKLPPG
VELAVSPEETIAAVVPPEDVEKLAEEAAAEVAEPEVIKKGKEEEEE
>P56930 ~~~rplY~~~Large ribosomal subunit protein bL25~~~
MEYRLKAYYREGEKPSALRRAGKLPGLMYNRHLNRKVYVDLVEFDKVFRQASIHHVIVLELPDGQSLPTLVRQVNLDKRR
RRPEHVDFFVLSDEPVEMYVPLRFVGTPAGVRAGGVLQEIHRDILVKVSPRNIPEFIEVDVSGLEIGDSLHASDLKLPPG
VELAVSPEETIAAVVPPEDVEKLAEEAAAEVAEPEVIKKGKEEEEE
>Q8D8W6 ~~~rplY~~~Large ribosomal subunit protein bL25~~~
MKFEAVVRTELGKGASRRLRLAGQFPAVVYGGEAAPVAVALNHDDIVNQMDKPEFYEAITLVIGGEEVKVKPQDVQRHAF
KPKVEHMDFIRI
>B7I6V8 ~~~rpmA~~~Large ribosomal subunit protein bL27~~~
MATKKAGGSTKNGRDSNPKMLGVKVYGGQTVTAGNIIVRQRGTEFHAGANVGMGRDHTLFATADGVVKFEVKGQFGRRYV
KVETV
>P05657 ~~~rpmA~~~Large ribosomal subunit protein bL27~~~COG0211
MLRLDLQFFASKKGVGSTKNGRDSEAKRLGAKRADGQFVTGGSILYRQRGTKIYPGENVGRGGDDTLFAKIDGTVKFERF
GRDRKKVSVYPVAQ
>Q9RY65 ~~~rpmA~~~Large ribosomal subunit protein bL27~~~COG0211
MAHKKGVGSSKNGRDSNPKYLGVKKFGGEVVKAGNILVRQRGTKFKAGQGVGMGRDHTLFALSDGKVVFINKGKGARFIS
IEAAQTEVAAD
>P0A7L8 ~~~rpmA~~~Large ribosomal subunit protein bL27~~~COG0211
MAHKKAGGSTRNGRDSEAKRLGVKRFGGESVLAGSIIVRQRGTKFHAGANVGCGRDHTLFAKADGKVKFEVKGPKNRKFI
SIEAE
>Q836X4 ~~~rpmA~~~Large ribosomal subunit protein bL27~~~COG0211
MLLTMNLQLFAHKKGGGSTSNGRDSESKRLGAKSADGQTVTGGSILYRQRGTKIYPGVNVGIGGDDTLFAKVDGVVRFER
KGRDKKQVSVYPVAN
>P07844 ~~~rpmA~~~Large ribosomal subunit protein bL27~~~
MASKKGVGSTKDGRDSIAKRLGAKRADGQFVTGGSILYRQRGTKVHPGLNVGRGGDDTLYAKIDGIVRFERLGRDRKRVS
VYPVSQEA
>A2RLA2 ~~~rpmA~~~Large ribosomal subunit protein bL27~~~COG0211
MLELNLQLFAHKKGGGSTSNGRDSQAKRLGAKASDGELVSGGSILFRQRGTHIHPGTNVGRGGDDTLFAKIEGTVKFEMK
RGKKHVSVYPVVAK
>P66125 ~~~rpmA~~~Large ribosomal subunit protein bL27~~~COG0211
MLKFDIQHFAHKKGGGSTSNGRDSESKRLGAKRADGQFVTGGSILYRQRGTKIYPGTNVGRGGDDTLFAKTDGVVRFERM
GRDKKKVSVYPEVQEA
>P75458 ~~~rpmA~~~Large ribosomal subunit protein bL27~~~
MNNKYFLTKIDLQFFASKKGVGSTKNGRDSHAKRLGAKKADGQMIRTGQIIYRQRGTRVYPGVNVGLGSDDTLFALSDGL
VKYQKFGPKQGKTRVSVVKHKLDA
>A0R150 ~~~rpmA~~~Large ribosomal subunit protein bL27~~~COG0211
MAHKKGASSSRNGRDSAAQRLGVKRFGGQVVKAGEILVRQRGTHFHPGVNVGRGGDDTLFALAPGAVEFGAKRGRKTVNI
VPVARPEA
>A5U5D7 ~~~rpmA~~~Large ribosomal subunit protein bL27~~~COG0211
MAHKKGASSSRNGRDSAAQRLGVKRYGGQVVKAGEILVRQRGTKFHPGVNVGRGGDDTLFAKTAGAVEFGIKRGRKTVSI
VGSTTA
>P9WHB3 ~~~rpmA~~~Large ribosomal subunit protein bL27~~~COG0211
MAHKKGASSSRNGRDSAAQRLGVKRYGGQVVKAGEILVRQRGTKFHPGVNVGRGGDDTLFAKTAGAVEFGIKRGRKTVSI
VGSTTA
>Q9HVL7 ~~~rpmA~~~Large ribosomal subunit protein bL27~~~
MAHKKAGGSTRNGRDSESKRLGVKLFGGQAVKAGNILVRQRGTKFHAGYGVGLGKDHTLFAKVDGVVKFETKGAFGRKYV
SIVAA
>Q6NDE9 ~~~rpmA~~~Large ribosomal subunit protein bL27~~~COG0211
MAHKKAGGSSRNGRDSAGKRLGVKAFGGEHVIPGNIIARQRGTQWHPGLNVGMGTDHTLFAKVEGRVEFRAKANGRTYVS
VLPIAMQAAE
>Q2FXT0 ~~~rpmA~~~Large ribosomal subunit protein bL27~~~COG0211
MLKLNLQFFASKKGVSSTKNGRDSESKRLGAKRADGQFVTGGSILYRQRGTKIYPGENVGRGGDDTLFAKIDGVVKFERK
GRDKKQVSVYAVAE
>P66133 ~~~rpmA~~~Large ribosomal subunit protein bL27~~~
MLKLNLQFFASKKGVSSTKNGRDSESKRLGAKRADGQFVTGGSILYRQRGTKIYPGENVGRGGDDTLFAKIDGVVKFERK
GRDKKQVSVYAVAE
>Q72HR3 ~~~rpmA~~~Large ribosomal subunit protein bL27~~~COG0211
MAHKKGLGSTKNGRDSQAKRLGVKRYEGQVVRAGNILVRQRGTRFKPGKNVGMGRDFTLFALVDGVVEFQDRGRLGRYVH
VRPLA
>P60493 ~~~rpmA~~~Large ribosomal subunit protein bL27~~~COG0211
MAHKKGLGSTRNGRDSQAKRLGVKRYEGQVVRAGNILVRQRGTRFKPGKNVGMGRDFTLFALVDGVVEFQDRGRLGRYVH
VRPLA
>P0DV54 ~~~rpmB3~~~Large ribosomal subunit protein bL28C~~~
MAAVCDICGKGPGFGKSVSHSHRRTSRRWDPNIQTVHAVTRPGGNKKRLNVCTSCIKAGKITRG
>B7I4T0 ~~~rpmB~~~Large ribosomal subunit protein bL28~~~
MSKVCQVTGKRPVVGNNVSHANNKTKRRFEPNLHHHRFWLESEKRFVRLRLTTKGMRIIDKLGIEKVVADLRAQGQKI
>P37807 ~~~rpmB~~~Large ribosomal subunit protein bL28~~~COG0227
MARKCVITGKKTTAGNNRSHAMNASKRTWGANLQKVRILVNGKPKKVYVSARALKSGKVERV
>Q9RRG8 ~~~rpmB~~~Large ribosomal subunit protein bL28~~~COG0227
MSRECYLTGKKNLVVNSVIRRGKARADGGVGRKTTGITKRVQRANLHKKAIRENGQVKTVWLSANALRTLSKGPYKGIEL
I
>P0A7M2 ~~~rpmB~~~Large ribosomal subunit protein bL28~~~COG0227
MSRVCQVTGKRPVTGNNRSHALNATKRRFLPNLHSHRFWVESEKRFVTLRVSAKGMRVIDKKGIDTVLAELRARGEKY
>Q82ZE4 ~~~rpmB~~~Large ribosomal subunit protein bL28~~~COG0227
MAKVCYFTGRKTSSGNNRSHAMNSTKRTVKPNLQKVRVLIDGKPKKVWVSTRALKSGKIERV
>P23374 ~~~rpmB~~~Large ribosomal subunit protein bL28~~~
MAKCFITGKKKSFGNTRSHAMNASRRDWKANLQKVRILVDGKPKRVWVSARALKSGKVKRV
>A2RHS3 ~~~rpmB~~~Large ribosomal subunit protein bL28~~~COG0227
MSKECYFTGRKTVSSNNRSHAMNQTKRVVKPNLQKVTILENGELKTVWASAKALKKLPAGVERV
>P66144 ~~~rpmB~~~Large ribosomal subunit protein bL28~~~COG0227
MAKECVITGRKSRSGNKRSHAMNSSKRTWKANLQKVRILVNGKPKKVWVSARALKSGKVERV
>P75171 ~~~rpmB~~~Large ribosomal subunit protein bL28~~~
MAKKDQLTLRGPLYGNNRSHSKTITRRKWNVNLQPCKVKTADGKTTRILVSTRTLRTLKKHNRLS
>Q9HTN8 ~~~rpmB~~~Large ribosomal subunit protein bL28~~~
MSRVCQVTGKGPVTGNNISHAHNKTRRRFLPNLQHHRFWVESEKRFVRLRVSAKGMRIIDKRGIEAVLADLRARGEKF
>Q6NCH9 ~~~rpmB~~~Large ribosomal subunit protein bL28~~~COG0227
MSRRCELTAKGAQVGHKVSHSNIKTKRRFLPNLVNVTFLSDTLGRAVRLRVSTNALKSVDHRGGLDAYLLKAREAELSPK
AVELKRAIAKKMAGEPVAAAS
>Q2FZ60 ~~~rpmB~~~Large ribosomal subunit protein bL28~~~COG0227
MGKQCFVTGRKASTGNRRSHALNSTKRRWNANLQKVRILVDGKPKKVWVSARALKSGKVTRV
>Q2YXJ2 ~~~rpmB~~~Large ribosomal subunit protein bL28~~~
MGKQCFVTGRKASTGNRRSHALNSTKRRWNANLQKVRILVDGKPKKVWVSARALKSGKVTRV
>Q9WY96 ~~~rpmB~~~Large ribosomal subunit protein bL28~~~COG0227
MAKRCEVCGKAPRSGNTVSHSDKKSGRWFRPNLQKVRVVLPDGTIKRMRVCTSCLKSGKVKKYVGQVSEV
>Q72G84 ~~~rpmB~~~Large ribosomal subunit protein bL28~~~COG0227
MSKVCEISGKRPIVANSIQRRGKAKREGGVGKKTTGISKRRQYPNLQKVRVRVAGQEITFRVAASHIPKVYELVERAKGL
RLEGLSPKEIKKELLKLL
>P60494 ~~~rpmB~~~Large ribosomal subunit protein bL28~~~COG0227
MSKVCEISGKRPIVANSIQRRGKAKREGGVGKKTTGISKRRQYPNLQKVRVRVAGQEITFRVAASHIPKVYELVERAKGL
KLEGLSPKEIKKELLKLL
>B7IA31 ~~~rpmC~~~Large ribosomal subunit protein uL29~~~
MKTKDLREKSVEELKALLDEQQLNQFRLRMAKATGQLGKSHEVQVARKTIARIKTLLTEKQGNGQ
>P12873 ~~~rpmC~~~Large ribosomal subunit protein uL29~~~COG0255
MKANEIRDLTTAEIEQKVKSLKEELFNLRFQLATGQLENTARIREVRKAIARMKTVIREREIAANK
>Q9RXJ4 ~~~rpmC~~~Large ribosomal subunit protein uL29~~~COG0255
MKPSEMRNLQATDFAKEIDARKKELMELRFQAAAGQLAQPHRVRQLRREVAQLNTVKAELARKGEQQ
>P0A7M6 ~~~rpmC~~~Large ribosomal subunit protein uL29~~~COG0255
MKAKELREKSVEELNTELLNLLREQFNLRMQAASGQLQQSHLLKQVRRDVARVKTLLNEKAGA
>Q839F6 ~~~rpmC~~~Large ribosomal subunit protein uL29~~~COG0255
MKVKEIRELTTAEMLDKEKQLKEELFNLRFQLATGQLENTARIKEVRQSIARIKTVLREQAN
>A5FMZ2 ~~~rpmC~~~Large ribosomal subunit protein uL29~~~COG0255
MKQSEIKDLSAAELQEKLSQTKKVYADLKMAHAISPIANPLRIRSVRRTVARLATELTKRELQ
>P04457 ~~~rpmC~~~Large ribosomal subunit protein uL29~~~
MKAKEIRELTTAEIEQKIKALKEELFNLRFQLATGQLENTARIRQVRKDIARMKTIIRERELAANK
>A2RNP7 ~~~rpmC~~~Large ribosomal subunit protein uL29~~~COG0255
MKLSETKSLLKDLRALSVEELTTREAELKKELFDLRFQAAAGRLENTAKLDEVKKTIARVKTVQAELNK
>P66166 ~~~rpmC~~~Large ribosomal subunit protein uL29~~~COG0255
MKANDIRDLSTTEIQDQEKALKEELFNLRFQLATGQLENTARIREVRKAIARMKTIVRERELA
>Q50310 ~~~rpmC~~~Large ribosomal subunit protein uL29~~~
MTVAKELRQKSSEELVKLVIKLKGELLEYRFKLAHGELDKPHLINQTRRLLATILTILTERKLNWQEEQAKYKLLTKKTN
EAAVNAWKQHLEANKAKLLKSRAKREDASKK
>A0QSD9 ~~~rpmC~~~Large ribosomal subunit protein uL29~~~COG0255
MAVGTTPGELRELTDDELKDKLRESKEELFNLRFQMATGQLSNNRRLRTVRQEIARVYTVLRERELGLASGPAGEES
>A5U095 ~~~rpmC~~~Large ribosomal subunit protein uL29~~~COG0255
MAVGVSPGELRELTDEELAERLRESKEELFNLRFQMATGQLNNNRRLRTVRQEIARIYTVLRERELGLATGPDGKES
>P9WHA7 ~~~rpmC~~~Large ribosomal subunit protein uL29~~~COG0255
MAVGVSPGELRELTDEELAERLRESKEELFNLRFQMATGQLNNNRRLRTVRQEIARIYTVLRERELGLATGPDGKES
>Q9HWE3 ~~~rpmC~~~Large ribosomal subunit protein uL29~~~
MKANELREKSVEQLNEQLLGLLRDQFNLRMQKATGQLGQSHLLSQVKRDIARVKTVLNQQAGK
>Q6N4U2 ~~~rpmC~~~Large ribosomal subunit protein uL29~~~COG0255
MAEMKTADIRAMSEDQMDDAILSLKKERFNLRFQRATGQLENTSRLREARRDIARIKTIAAQKRAGKTK
>O31163 ~~~rpmC~~~Large ribosomal subunit protein uL29~~~
MNDLTKKSVEELKKLEEESRAELFALRFQSAMGNLEKPHRIGELKNQIARILTILSARKNSGENTAINVKVNLNETYAKI
EKESQAFAKQRKAKIEQMMAEQQAAEGKMANLMDLPVNDAMDLTEEQAVVSTPTGETNGLDEQKAPVAAKKPAAAKDFPK
QKDVVEEKTATGKPAAPSAKKAPVAKKDVAQETKTDKDAALKALIKEKAAAKKPAAKSKTSTPSGKTTVTVKSVTSAKAD
IEVPKETSKPVPTKTVKKAAELNAKEKLVAIKSSVAMGGTAKEPGSGVKIDLELKAKDPNAKEYTYGTNWKENRDKILTA
SKTTKKADDKTTKKGTGKK
>Q2FW14 ~~~rpmC~~~Large ribosomal subunit protein uL29~~~COG0255
MKAKEIRDLTTSEIEEQIKSSKEELFNLRFQLATGQLEETARIRTVRKTIARLKTVAREREIEQSKANQ
>Q2YYN0 ~~~rpmC~~~Large ribosomal subunit protein uL29~~~
MKAKEIRDLTTSEIEEQIKSSKEELFNLRFQLATGQLEETARIRTVRKTIARLKTVAREREIEQSKANQ
>P66173 ~~~rpmC~~~Large ribosomal subunit protein uL29~~~
MKAKEIRDLTTSEIEEQIKSSKEELFNLRFQLATGQLEETARIRTVRKTIARLKTVAREREIEQSKANQ
>P38514 ~~~rpmC~~~Large ribosomal subunit protein uL29~~~COG0255
MKASELRNYTDEELKNLLEEKKRQLMELRFQLAMGQLKNTSLIKLTKRDIARIKTILRERELGIRR
>Q72I12 ~~~rpmC~~~Large ribosomal subunit protein uL29~~~COG0255
MKLSEVRKQLEEARKLSPVELEKLVREKKRELMELRFQASIGQLSQNHKIRDLKRQIARLLTVLNEKRRQNA
>Q5SHP6 ~~~rpmC~~~Large ribosomal subunit protein uL29~~~COG0255
MKLSEVRKQLEEARKLSPVELEKLVREKKRELMELRFQASIGQLSQNHKIRDLKRQIARLLTVLNEKRRQNA
>Q9LCY4 ~~~rpmC~~~Large ribosomal subunit protein uL29~~~
MRKQLEEARKLSPVELEKLVREKKRELMELRFQASIGQLSQNHKIRDLKRQIARLLTVLNEKRRQNA
>B7IA36 ~~~rplB~~~Large ribosomal subunit protein uL2~~~
MPIQKCKPTSPGRRFVEKVVHDHLHKGAPYAPLVEAKKRTGGRNNNGHITTRHVGGGHKQHYRIVDFKRNKDGVPAVVER
IEYDPNRTAHIALLKYADGERRYIIAPKGLRAGDKVQSGNDAPIRPGNCLPLRNMPIGSTLHNVELKIGKGAQLARSAGA
SVQLLGRDGSYAIIRLRSGEMRKVHVECRAVIGEVSNQENNLRSLGKAGAARWRGVRPTVRGMAMNPIDHPHGGGEGRNK
GIQPVSPWGQKAKGYKTRTNKRTTKMIIRDRRVK
>P42919 ~~~rplB~~~Large ribosomal subunit protein uL2~~~COG0090
MAIKKYKPTSNGRRGMTTSDFAEITTDKPEKSLLAPLHKKGGRNNQGKLTVRHQGGGHKRQYRVIDFKRDKDGIPGRVAT
VEYDPNRSANIALINYADGEKRYILAPKGIQVGTEIMSGPEADIKVGNALPLINIPVGTVVHNIELKPGKGGQLVRSAGT
SAQVLGKEGKYVLVRLNSGEVRMILSACRASIGQVGNEQHELINIGKAGRSRWKGIRPTVRGSVMNPNDHPHGGGEGRAP
IGRKSPMSPWGKPTLGFKTRKKKNKSDKFIVRRRKNK
>Q9RXJ9 ~~~rplB~~~Large ribosomal subunit protein uL2~~~COG0090
MAVKKYRPYTPSRRQMTTADFSGLTKKRPEKALTEALPKTGGRNNRGRITSRFIGGGHKRLYRIIDFKRRDKSGVNAKVA
AIEYDPNRSARIALLHYADGEKRYILAPEGLTVGATVNAGPEAEPKLGNALPLRFVPVGAVVHALELVPGKGAQLARSAG
TSVQVQGKESDYVIVRLPSGELRRVHSECYATIGAVGNAEHKNIVLGKAGRSRWLGRKPHQRGSAMNPVDHPHGGGEGRT
GAGRVPVTPWGKPTKGLKTRRKRKTSDRFIVTRRK
>P60422 ~~~rplB~~~Large ribosomal subunit protein uL2~~~COG0090
MAVVKCKPTSPGRRHVVKVVNPELHKGKPFAPLLEKNSKSGGRNNNGRITTRHIGGGHKQAYRIVDFKRNKDGIPAVVER
LEYDPNRSANIALVLYKDGERRYILAPKGLKAGDQIQSGVDAAIKPGNTLPMRNIPVGSTVHNVEMKPGKGGQLARSAGT
YVQIVARDGAYVTLRLRSGEMRKVEADCRATLGEVGNAEHMLRVLGKAGAARWRGVRPTVRGTAMNPVDHPHGGGEGRNF
GKHPVTPWGVQTKGKKTRSNKRTDKFIVRRRSK
>Q839G1 ~~~rplB~~~Large ribosomal subunit protein uL2~~~COG0090
MAIKKYKPTTNGRRNMTSSDFAEITTSTPEKSLLQPLKNNAGRNNNGRITVRHQGGGHKRQYRVIDFKRNKDNVAAVVKT
IEYDPNRSANIALVHYEDGVKAYILAPKGLEVGMRLVSGPEADIKVGNALPLENIPVGTVIHNIEMKPGKGGQLIRSAGT
SAQVLGKEGKYVLIRLNSGEVRMILATCRATIGSVGNEQHELINIGKAGRSRWMRKRPTVRGSVMNPNDHPHGGGEGKTP
IGRKAPVSPWGQPAIGYKTRNKKAKSDKLIVRRRTK
>Q5L3Z4 ~~~rplB~~~Large ribosomal subunit protein uL2~~~COG0090
MAIKKYKPTSNGRRGMTVLDFSEITTDQPEKSLLAPLKKKAGRNNQGKITVRHQGGGHKRQYRIIDFKRDKDGIPGRVAT
IEYDPNRSANIALINYADGEKRYIIAPKNLKVGMEIMSGPDADIKIGNALPLENIPVGTLVHNIELKPGRGGQLVRAAGT
SAQVLGKEGKYVIVRLASGEVRMILGKCRATVGEVGNEQHELVNIGKAGRARWLGIRPTVRGSVMNPVDHPHGGGEGKAP
IGRKSPMTPWGKPTLGYKTRKKKNKSDKFIIRRRKK
>P04257 ~~~rplB~~~Large ribosomal subunit protein uL2~~~
MAIKKYKPTSNGRRGMTVLDFSEITTDQPEKSLLAPLKKRAGRNNQGKITVRHQGGGHKRQYRIIDFKRDKDGIPGRVAT
IEYDPNRSANIALINYADGEKRYIIAPKNLKVGMEIMSGPDADIKIGNALPLENIPVGTLVHNIELKPGRGGQLVRAAGT
SAQVLGKEGKYVIVRLASGEVRMILGKCRATVGEVGNEQHELVNIGKAGRARWLGIRPTVRGSVMNPVDHPHGGGEGKAP
IGRKSPMTPWGKPTLGYKTRKKKNKSDKFIIRRRKK
>A2RNQ2 ~~~rplB~~~Large ribosomal subunit protein uL2~~~COG0090
MGIKVYKPTTNGRRNMTGSDFAEITTSTPEKSLLVSMSKTAGRNNTGRITVRHHGGGHKRKYRMIDFKRTTDNVVAKVAT
IEYDPNRTANIALIVYANGVKSYILAAKGLEVGMTVVSGPDADIKVGNALPLANIPVGTLIHNIELKPGKGGQLVRSAGA
SAQVLGSEGKYTLVRLQSGEVRMILSTCRATIGVVGNEQQSLINLGKAGRTRHMGIRPTVRGSVMNPNDHPHGGGEGRQP
VGRKSPMTPWGKPALGLKTRNKKAKSSKLIVRRIND
>P60426 ~~~rplB~~~Large ribosomal subunit protein uL2~~~COG0090
MAIKKYKPTTNGRRHMTSSDFAEITTSTPEKSLLRPLKKKAGRNNQGKLTVRHHGGGHKRQYRVIDFKRNKDGIPGRVAT
IEYDPNRSANIALINYADGEKRYIIAAKGLEVGQTIYSGAEADIKVGNALELKDIPVGTVIHNIEMKPGKGGQLVRSAGT
SAQVLGKEGKYVLIRLNSGEVRMILATCRATIGQVGNEQHELINIGKAGRSRWMGKRPTVRGSVMNPNDHPHGGGEGKAP
IGRKSPMSPWGKPTLGYKTRKKNNNSDKFIVRRRKKK
>P75577 ~~~rplB~~~Large ribosomal subunit protein uL2~~~
MPIKKIISRSNSGIHHSTVIDYKKLLTTNKNKPEKSLLVTLKKHGGRNNQGKITVRHQGGRNKRKYRIIDFKRTHYDNIE
ATVKSIEYDPNRSCFVSLITYANGAKSYIISPDGIKVGDKILASEHPIDIKPGFSMPLAFIPEGTQVHNIELHPKGGGQI
ARSAGSYARILGQDETGKYVILQLLSGETRKFLKECRATVGVVSNLDHNLVVIGKAGRNRHRGIRPTVRGSAMNPNDHPH
GGGEGRSPVGRDAPRTPWGKRHMGVKTRNMKKASTNLIIRNRKGEQY
>A0QSD4 ~~~rplB~~~Large ribosomal subunit protein uL2~~~COG0090
MGIRKYKPTTPGRRGASVSDFAEITRSTPEKSLVRPLHGKGGRNAHGRITTRHKGGGHKRAYRVIDFRRHDKDGVNAKVA
HIEYDPNRTANIALLHYLDGEKRYIIAPQGLKQGDVIESGANADIKPGNNLPLRNIPAGTVIHAVELRPGGGAKLARSAG
VSIQLLGKEGTYAALRMPSGEIRRVDVRCRATVGEVGNAEQSNINWGKAGRMRWKGKRPTVRGVVMNPVDHPHGGGEGKT
SGGRHPVSPWGKPEGRTRKPNKPSDKLIVRRRRTGKKR
>A5U090 ~~~rplB~~~Large ribosomal subunit protein uL2~~~COG0090
MAIRKYKPTTPGRRGASVSDFAEITRSTPEKSLVRPLHGRGGRNAHGRITTRHKGGGHKRAYRMIDFRRNDKDGVNAKVA
HIEYDPNRTARIALLHYLDGEKRYIIAPNGLSQGDVVESGANADIKPGNNLPLRNIPAGTLIHAVELRPGGGAKLARSAG
SSIQLLGKEASYASLRMPSGEIRRVDVRCRATVGEVGNAEQANINWGKAGRMRWKGKRPSVRGVVMNPVDHPHGGGEGKT
SGGRHPVSPWGKPEGRTRNANKSSNKFIVRRRRTGKKHSR
>P9WHA5 ~~~rplB~~~Large ribosomal subunit protein uL2~~~COG0090
MAIRKYKPTTPGRRGASVSDFAEITRSTPEKSLVRPLHGRGGRNAHGRITTRHKGGGHKRAYRMIDFRRNDKDGVNAKVA
HIEYDPNRTARIALLHYLDGEKRYIIAPNGLSQGDVVESGANADIKPGNNLPLRNIPAGTLIHAVELRPGGGAKLARSAG
SSIQLLGKEASYASLRMPSGEIRRVDVRCRATVGEVGNAEQANINWGKAGRMRWKGKRPSVRGVVMNPVDHPHGGGEGKT
SGGRHPVSPWGKPEGRTRNANKSSNKFIVRRRRTGKKHSR
>Q9HWD8 ~~~rplB~~~Large ribosomal subunit protein uL2~~~
MAIVKCKPTSAGRRFVVKVVNQELHKGAPYAPLLEKKSKSGGRNNNGRITTRHIGGGHKQHYRLVDFRRNKDGIPAIVER
VEYDPNRTAHIALLKYADGERRYIIAPKGVAAGDQLISGIGAPIKAGNSMPLRNIPVGSTVHGIELKPGKGAQIARSAGA
SAQLVAREGAYVTLRLRSGEMRKVLAECRATLGEVSNSEHSLRSLGKAGATRWRGVRPTVRGVAMNPVDHPHGGGEGRTS
AGRHPVSPWGLQTKGKKTRSNKRTDNMIVRRRK
>P60403 ~~~rplB~~~Large ribosomal subunit protein uL2~~~COG0090
MALKTFNPTTPGQRQLVMVDRSALYKGKPVKRLTEGKNSNGGRNNTGRITVRFRGGGHKQAYRLVDFKRTKVDVPAKVER
LEYDPNRTAFIALIKYEDGEQAYILAPQRLAVGDTVIAGAYVDVKPGNVMPLGNMPIGTIVHNVELKIGKGGQLARSAGT
YAQIVGRDHDYVILRMNSGEQRLIHGRCIAAIGAVSNPDHMNISIGKAGRKRWLGRRPHNRGVVMNPIDHPHGGGEGRTS
GGRHPVTPWGKPTKGKKTRSNKSTDKFILISRHKRKKK
>P60430 ~~~rplB~~~Large ribosomal subunit protein uL2~~~COG0090
MAIKKYKPITNGRRNMTSLDFAEITKTTPEKSLLKPLPKKAGRNNQGKLTVRHHGGGHKRQYRVIDFKRNKDGINAKVDS
IQYDPNRSANIALVVYADGEKRYIIAPKGLEVGQIVESGAEADIKVGNALPLQNIPVGTVVHNIELKPGKGGQIARSAGA
SAQVLGKEGKYVLIRLRSGEVRMILSTCRATIGQVGNLQHELVNVGKAGRSRWKGIRPTVRGSVMNPNDHPHGGGEGRAP
IGRPSPMSPWGKPTLGKKTRRGKKSSDKLIVRGRKKK
>Q2YYP9 ~~~rplB~~~Large ribosomal subunit protein uL2~~~
MAIKKYKPITNGRRNMTSLDFAEITKTTPEKSLLKPLPKKAGRNNQGKLTVRHHGGGHKRQYRVIDFKRNKDGINAKVDS
IQYDPNRSANIALVVYADGEKRYIIAPKGLEVGQIVESGAEADIKVGNALPLQNIPVGTVVHNIELKPGKGGQIARSAGA
SAQVLGKEGKYVLIRLRSGEVRMILSTCRATIGQVGNLQHELVNVGKAGRSRWKGIRPTVRGSVMNPNDHPHGGGEGRAP
IGRPSPMSPWGKPTLGKKTRRGKKSSDKLIVRGRKKK
>P60432 ~~~rplB~~~Large ribosomal subunit protein uL2~~~
MAIKKYKPITNGRRNMTSLDFAEITKTTPEKSLLKPLPKKAGRNNQGKLTVRHHGGGHKRQYRVIDFKRNKDGINAKVDS
IQYDPNRSANIALVVYADGEKRYIIAPKGLEVGQIVESGAEADIKVGNALPLQNIPVGTVVHNIELKPGKGGQIARSAGA
SAQVLGKEGKYVLIRLRSGEVRMILSTCRATIGQVGNLQHELVNVGKAGRSRWKGIRPTVRGSVMNPNDHPHGGGEGRAP
IGRPSPMSPWGKPTLGKKTRRGKKSSDKLIVRGRKKK
>Q9AMK8 ~~~rplB~~~Large ribosomal subunit protein uL2~~~
MAIRKYKPTTPGRRGASVADFVEVTRSTPEKSLVRPLHSKGGRNNAGRITVRHQGGGHKRAYRIVDFRRHDKDGVPAKVA
HIEYDPNRSARIALLHYADGEKRYILAPRNLQQGDRVENGPGADIKPGNNLALRNIPVGTTIHAIELRPGGGAKFARSAG
ASVQLLAKEGAYAHLRMPSGEIRLVNVRCRATIGEVGNAEQSNINWGKAGRKRWLGVRPTVRGVVMNPVDHPHGGGEGRT
SGGRHPVSPWGKPTGRTRSNKKASNKYIVRRRTKNKKR
>Q72I07 ~~~rplB~~~Large ribosomal subunit protein uL2~~~COG0090
MAVKKFKPYTPSRRFMTVADFSEITKTEPEKSLVKPLKKTGGRNNQGRITVRFRGGGHKRLYRIIDFKRWDKVGIPAKVA
AIEYDPNRSARIALLHYVDGEKRYIIAPDGLQVGQQVVAGPDAPIQVGNALPLRFIPVGTVVHAVELEPKKGAKLARAAG
TSAQIQGREGDYVILRLPSGELRKVHGECYATVGAVGNADHKNIVLGKAGRSRWLGRRPHVRGAAMNPVDHPHGGGEGRA
PRGRPPASPWGWQTKGLKTRKRRKPSSRFIIARRKK
>P60405 ~~~rplB~~~Large ribosomal subunit protein uL2~~~COG0090
MAVKKFKPYTPSRRFMTVADFSEITKTEPEKSLVKPLKKTGGRNNQGRITVRFRGGGHKRLYRIIDFKRWDKVGIPAKVA
AIEYDPNRSARIALLHYVDGEKRYIIAPDGLQVGQQVVAGPDAPIQVGNALPLRFIPVGTVVHAVELEPKKGAKLARAAG
TSAQIQGREGDYVILRLPSGELRKVHGECYATVGAVGNADHKNIVLGKAGRSRWLGRRPHVRGAAMNPVDHPHGGGEGRA
PRGRPPASPWGWQTKGLKTRKRRKPSSRFIIARRKK
>B7IA21 ~~~rpmD~~~Large ribosomal subunit protein uL30~~~
MKTIKVTQTKSSSHRLKNHKLCLQGLGLRRIGHTVEVQDTPSNRGMINKVYYMVSVEE
>P19947 ~~~rpmD~~~Large ribosomal subunit protein uL30~~~COG1841
MAKLEITLKRSVIGRPEDQRVTVRTLGLKKTNQTVVHEDNAAIRGMINKVSHLVSVKEQ
>Q9RSL0 ~~~rpmD~~~Large ribosomal subunit protein uL30~~~COG1841
MKIKLVRSVIGRPGNQVKTVQALGLRKIGDSREVSDTPAVRGMVKTVKHLLEVQE
>P0AG51 ~~~rpmD~~~Large ribosomal subunit protein uL30~~~COG1841
MAKTIKITQTRSAIGRLPKHKATLLGLGLRRIGHTVEREDTPAIRGMINAVSFMVKVEE
>Q839E6 ~~~rpmD~~~Large ribosomal subunit protein uL30~~~COG1841
MAELKITLKRSVIGRPQNQRATVKALGLGKVNSTVTKPANEAIKGMVNTISHLVDVEEV
>P02431 ~~~rpmD~~~Large ribosomal subunit protein uL30~~~
MAKKLAITLTRSVIGRPEDQRITVRTLGLRKMHQTVVHNDNPAIRGMINKVAHLVKVKEIEEE
>A2RNN5 ~~~rpmD~~~Large ribosomal subunit protein uL30~~~COG1841
MAQIKITLVNSPIGRIPAQRKTVKALGLGKLNSSVVKEGSPAILGMVNSISHLVKVEEA
>Q927M5 ~~~rpmD~~~Large ribosomal subunit protein uL30~~~COG1841
MAKLEITLKRSLIGRPQPQRKTVQALGLGKTNSVVVKEDNPAIRGMITKVSHLVDVKEV
>A0QSG7 ~~~rpmD~~~Large ribosomal subunit protein uL30~~~COG1841
MAELKITQVRSTIGARWKQRESLRTLGLKKIRQSVVREDNAQTRGLINTVHHLVEVEEVGK
>A5U0A8 ~~~rpmD~~~Large ribosomal subunit protein uL30~~~COG1841
MSQLKITQVRSTIGARWKQRESLRTLGLRRIRHSVIREDNAATRGLIAVVRHLVEVEPAQTGGKT
>P9WHA3 ~~~rpmD~~~Large ribosomal subunit protein uL30~~~COG1841
MSQLKITQVRSTIGARWKQRESLRTLGLRRIRHSVIREDNAATRGLIAVVRHLVEVEPAQTGGKT
>Q9HWF3 ~~~rpmD~~~Large ribosomal subunit protein uL30~~~
MATVKVTLVKSLNGRLANHKACVKGLGLRRINHTVEVQDTPENRGMINKAYYLLRVEG
>Q6N4V1 ~~~rpmD~~~Large ribosomal subunit protein uL30~~~COG1841
MAKAANMIKVEQIGSPIRRHHSQRETLIGLKLNKIGRVAELQDTPEVRGMIGKVQHLVRVVDEK
>P0A0G2 ~~~rpmD~~~Large ribosomal subunit protein uL30~~~COG1841
MAKLQITLTRSVIGRPETQRKTVEALGLKKTNSSVVVEDNPAIRGQINKVKHLVTVEEK
>Q2YYL5 ~~~rpmD~~~Large ribosomal subunit protein uL30~~~
MAKLQITLTRSVIGRPETQRKTVEALGLKKTNSSVVVEDNPAIRGQINKVKHLVTVEEK
>P0A0G0 ~~~rpmD~~~Large ribosomal subunit protein uL30~~~
MAKLQITLTRSVIGRPETQRKTVEALGLKKTNSSVVVEDNPAIRGQINKVKHLVTVEEK
>Q72I22 ~~~rpmD~~~Large ribosomal subunit protein uL30~~~COG1841
MPRLKVKLVKSPIGYPKDQKAALKALGLRRLQQERVLEDTPAIRGNVEKVAHLVRVEVVE
>Q5SHQ6 ~~~rpmD~~~Large ribosomal subunit protein uL30~~~COG1841
MPRLKVKLVKSPIGYPKDQKAALKALGLRRLQQERVLEDTPAIRGNVEKVAHLVRVEVVE
>P74909 ~~~rpmD~~~Large ribosomal subunit protein uL30~~~
MPRLKVKLVKSPIGYPKDQKAALKALGLRRLQQERVLEDTPAIRGNVEKVAHLVRVEVVE
>O34967 ~~~rpmE2~~~Large ribosomal subunit protein bL31B~~~COG0254
MKEGIHPKNHKVIFQDVNSGYRFLSTSTKTSNETAEWEDGNTYPVIKVEVSSDTHPFYTGRQKFNEKGGRVEQFKKRYNM
GK
>P0A7N1 ~~~ykgM~~~Large ribosomal subunit protein bL31B~~~COG0254
MKPNIHPEYRTVVFHDTSVDEYFKIGSTIKTDREIELDGVTYPYVTIDVSSKSHPFYTGKLRTVASEGNVARFTQRFGRF
VSTKKGA
>Q836E3 ~~~rpmE2~~~Large ribosomal subunit protein bL31B~~~COG0254
MKQDIHPNYQPVVFMDSTTGFKFLSGSTKGSSETVEWEDGNTYPLLRVEVTSDSHPFYTGRQKFTQADGRVDRFNKKYGL
KDENANPDA
>A2RJP7 ~~~rpmE2~~~Large ribosomal subunit protein bL31B~~~COG0254
MKQNIHPNYQPVVFMDTTTGYKFLTGSTKGSKETVEWEDGNTYPLIRVEISSDSHPFYTGRQKFQAADGRIARFEKKYGK
Q
>Q88Z52 ~~~rpmE2~~~Large ribosomal subunit protein bL31B~~~COG0254
MQKGIHPDYHLVVFQDSTTGFKFISGSTATSAETVEWEDGNTYPLIRVEITSDSHPFYTGKQKFTKADGAVDRFNKKYGL
K
>P0A485 ~~~rpmE2~~~Large ribosomal subunit protein bL31B~~~COG0254
MKTGIHPEYRPVVFVDTSTDFKFLSGSTKSSSETIKWEDGNEYPLLRVEISSDSHPFYTGKQKHATADGRVDRFNKKYGL
K
>P66194 ~~~rpmE2~~~Large ribosomal subunit protein bL31B~~~COG0254
MKPDIHPVYRTVVFHDTSANEYVKVGSTIKTEREIELDGVTYPYVTIDVSSKSHPFYTGRQKTFDSESSAARFQKRFGHF
IGAKRG
>Q2FWD8 ~~~rpmE2~~~Large ribosomal subunit protein bL31B~~~COG0254
MKQGIHPEYHQVIFLDTTTNFKFLSGSTKTSSEMMEWEDGKEYPVIRLDISSDSHPFYTGRQKFAAADGRVERFNKKFGL
KSNN
>Q2YUN3 ~~~rpmE2~~~Large ribosomal subunit protein bL31B~~~
MKQGIHPEYHQVIFLDTTTNFKFLSGSTKTSSEMMEWEDGKEYPVIRLDISSDSHPFYTGRQKFAAADGRVERFNKKFGL
KSNN
>P66196 ~~~rpmE2~~~Large ribosomal subunit protein bL31B~~~
MKQGIHPEYHQVIFLDTTTNFKFLSGSTKTSSEMMEWEDGKEYPVIRLDISSDSHPFYTGRQKFAAADGRVERFNKKFGL
KSNN
>Q8DTN5 ~~~rpmE2~~~Large ribosomal subunit protein bL31B~~~COG0254
MKKDIHPEYRPVVFMDTSTGYQFLSGSTKTSKETVEFEGETYPLIRVEISSDSHPFYTGRQKFTQADGRVDRFNKKYGLK
>Q9KTM4 ~~~rpmE2~~~Large ribosomal subunit protein bL31B~~~COG0254
MKAGIHPDYRKVVFHDTTVDHYFVVGSTLQTDRTIEWEGKTYPYITIEVSSESHPFYTGKQRVVQKEGRVANFTRRFGQF
AKESK
>P58472 ~~~rpmE2~~~Large ribosomal subunit protein bL31B~~~COG0254
MKPNIHPPYRTVVFHDTSADAYFTVGSTIATERTIERDGQTYPYVTLDISSASHPYYTGKQKEFAKEGSTARFHQRFGSF
LTKKTN
>Q03223 ~~~rpmE~~~Large ribosomal subunit protein bL31~~~COG0254
MKAGIHPNFKKATVKCACGNEFETGSVKEEVRVEICSECHPFYTGRQKFASADGRVDRFNKKYGLK
>Q9RW44 ~~~rpmE~~~Large ribosomal subunit protein bL31~~~COG0254
MQKDLHPKAVPCKIIYQGQVVMETMSTRPEIHVDVWSGVHPFWTGEERFLDTEGRVDKFNKRFGDSYRRGSKK
>P0A7M9 ~~~rpmE~~~Large ribosomal subunit protein bL31~~~COG0254
MKKDIHPKYEEITASCSCGNVMKIRSTVGHDLNLDVCSKCHPFFTGKQRDVATGGRVDRFNKRFNIPGSK
>P78020 ~~~rpmE~~~Large ribosomal subunit protein bL31~~~
MKKDFHFPSQSVSFKCASCSNSFTIESTLKQKEITIDICGKCHPFYIGELTKQTVHGRAEKLSGKFNAGKAFLENKTPKK
AKGKTEEYTKHRSLNEL
>A0R215 ~~~rpmE~~~Large ribosomal subunit protein bL31~~~COG0254
MKTGIHPEYVDTTVQCGCGHSFTTRSTKQSGTIVVEVCSQCHPFYTGKQKILDSGGRVARFEKRYGKRNKAAADK
>A5U1Z7 ~~~rpmE~~~Large ribosomal subunit protein bL31~~~COG0254
MKSDIHPAYEETTVVCGCGNTFQTRSTKPGGRIVVEVCSQCHPFYTGKQKILDSGGRVARFEKRYGKRKVGADKAVSTGK
>P9WHA1 ~~~rpmE~~~Large ribosomal subunit protein bL31~~~COG0254
MKSDIHPAYEETTVVCGCGNTFQTRSTKPGGRIVVEVCSQCHPFYTGKQKILDSGGRVARFEKRYGKRKVGADKAVSTGK
>Q02EW8 ~~~rpmE~~~Large ribosomal subunit protein bL31~~~
MKADIHPTYEAIEATCSCGNVIKTRSTLCKPIHLDVCSECHPFYTGKQKVLDTGGRIDRFKQRFGVFGATK
>Q9HUD0 ~~~rpmE~~~Large ribosomal subunit protein bL31~~~
MKADIHPTYEAIEATCSCGNVIKTRSTLCKPIHLDVCSECHPFYTGKQKVLDTGGRIDRFKQRFGVFGATK
>Q6NBB0 ~~~rpmE~~~Large ribosomal subunit protein bL31~~~COG0254
MKTEIHPDYHTITVVMTDGTEYQTRSTWGKEGDKLNLDIDPKSHPAWTGGTQQVLDRGGRVSRFQKKFSGFLKKD
>Q9K4E5 ~~~rpmE~~~Large ribosomal subunit protein bL31~~~COG0254
MKRDIHPEYVETQVSCTCGASFTTRSTIESGTIRAEVCSECHPFYTGKQKILDTGGRVARFEARFGKASAGSKK
>Q72JR0 ~~~rpmE~~~Large ribosomal subunit protein bL31~~~COG0254
MKEGIHPKLVPARIICGCGNVIETYSTKPEIYVEVCSKCHPFYTGQQRFVDTEGRVERFQRRYGDSYRKGR
>Q5SJE1 ~~~rpmE~~~Large ribosomal subunit protein bL31~~~COG0254
MKEGIHPKLVPARIICGCGNVIETYSTKPEIYVEVCSKCHPFYTGQQRFVDTEGRVERFQRRYGDSYRKGR
>P66207 ~~~rpmF2~~~Large ribosomal subunit protein bL32B~~~COG0333
MAVPFRRTSKAKKRKRRTHVKLQLPGMNECSNCGEYRLSHHVCPECGQYDGKDVANS
>Q836R0 ~~~rpmF3~~~Large ribosomal subunit protein bL32C~~~COG0333
MAVPARRTSKAKKAKRRTHYKLTIKGLNACSNCGEMKKSHHVCPACGHYDGKDVMSKEA
>B7I7A4 ~~~rpmF~~~Large ribosomal subunit protein bL32~~~
MAVQQNRKSRSRRDMRRSHDALTENALTVDQATGETHRRHHVTKDGFYRGRQLFAKAADAE
>O34687 ~~~rpmF~~~Large ribosomal subunit protein bL32~~~COG0333
MAVPFRRTSKMKKRLRRTHFKLNVPGMTECPSCGEMKLSHRVCKACGSYNGKDINVKSN
>P49228 ~~~rpmF~~~Large ribosomal subunit protein bL32~~~COG0333
MAKHPVPKKKTSKSKRDMRRSHHALTAPNLTECPQCHGKKLSHHICPNCGYYDGRQVLAV
>P0A7N4 ~~~rpmF~~~Large ribosomal subunit protein bL32~~~COG0333
MAVQQNKPTRSKRGMRRSHDALTAVTSLSVDKTSGEKHLRHHITADGYYRGRKVIAK
>P07840 ~~~rpmF~~~Large ribosomal subunit protein bL32~~~
MAVPFRRTSKTRKRLRRTHFKLQVPGMVQCPNCGEWKLAHRVCKACGTYKGRDVVNK
>A2RHH0 ~~~rpmF~~~Large ribosomal subunit protein bL32~~~COG0333
MAVPARHTSSAKKNRRRTHYKLTAPTVTFDETTGDYRHSHRVSLKGYYKGRKVRDTK
>P75238 ~~~rpmF~~~Large ribosomal subunit protein bL32~~~
MAVQQRRSSKHRRDKRRSHDALTAQALSVCQKCGKKKLFHRVCSCGMYGDLRVKKAY
>A0R3I9 ~~~rpmF~~~Large ribosomal subunit protein bL32~~~
MAVPKRRMSRANTRSRRAQWKAEAPGLVTVSVAGQQRKVPRRLLKAARLGLVDLDKR
>A5U121 ~~~rpmF~~~Large ribosomal subunit protein bL32~~~
MAVPKRRKSRSNTRSRRSQWKAAKTELVGVTVAGHAHKVPRRLLKAARLGLIDFDKR
>P9WH99 ~~~rpmF~~~Large ribosomal subunit protein bL32~~~
MAVPKRRKSRSNTRSRRSQWKAAKTELVGVTVAGHAHKVPRRLLKAARLGLIDFDKR
>Q9HZN4 ~~~rpmF~~~Large ribosomal subunit protein bL32~~~
MAVQQNKKSRSARDMRRSHDALESNALSVEKSTGEVHLRHHVSPDGFYRGRKVVDKGSDE
>Q6NCE6 ~~~rpmF~~~Large ribosomal subunit protein bL32~~~COG0333
MAVPRRKTSPSRRGMRRSADAIKRPTYVEDKDSGELRRPHHLDLKTGMYKGRQVLKKKDS
>Q2FZF1 ~~~rpmF~~~Large ribosomal subunit protein bL32~~~COG0333
MAVPKRRTSKTRKNKRRTHFKISVPGMTECPNCGREYKLSHRVCKNCGSYNGEEVAAK
>P66210 ~~~rpmF~~~Large ribosomal subunit protein bL32~~~
MAVPKRRTSKTRKNKRRTHFKISVPGMTECPNCGEYKLSHRVCKNCGSYNGEEVAAK
>P62652 ~~~rpmF~~~Large ribosomal subunit protein bL32~~~COG0333
MAKHPVPKKKTSKARRDARRSHHALTPPILVPCPECKAMKPPHTVCPECGYYAGRKVLEV
>P80339 ~~~rpmF~~~Large ribosomal subunit protein bL32~~~COG0333
MAKHPVPKKKTSKARRDARRSHHALTPPTLVPCPECKAMKPPHTVCPECGYYAGRKVLEV
>P56849 ~~~rpmGA~~~Large ribosomal subunit protein bL33A~~~COG0267
MRVNITLACTECGERNYISKKNKRNNPDRVEFKKYCPRDKKSTLHRETK
>P23375 ~~~rpmGA~~~Large ribosomal subunit protein bL33A~~~
MRVNITLACTECGERNYITSKNKRNNPERLELKKYCPRDRKVTLHRETK
>P66219 ~~~rpmG1~~~Large ribosomal subunit protein bL33A~~~COG0267
MRVNITLECTECGDRNYITTKNKRENPERIELKKYCPRLRRVTLHRETK
>P78015 ~~~rpmG1~~~Large ribosomal subunit protein bL33A~~~
MAVKRSTRLGCNDCREINYLTFKNVKKNPEKLALNKFCSRCRKVVVHKEVKRK
>A0QS39 ~~~rpmG1~~~Large ribosomal subunit protein bL33A~~~COG0267
MASSTDVRPKITLACEVCKHRNYITKKNRRNDPDRLEIKKFCPNCGTHQPHKESR
>A5U019 ~~~rpmG1~~~Large ribosomal subunit protein bL33A~~~COG0267
MASSTDVRPKITLACEVCKHRNYITKKNRRNDPDRLELKKFCPNCGKHQAHRETR
>Q2FYU6 ~~~rpmG1~~~Large ribosomal subunit protein bL33A~~~COG0267
MRVNVTLACTECGDRNYITTKNKRNNPERIEMKKYCPRLNKYTLHRETK
>A0R551 ~~~rpmG2~~~Large ribosomal subunit protein bL33B~~~COG0267
MARNEIRPIVKLRSTAGTGYTYVTRKNRRNDPDRIVLRKYDPVLRRHVEFREER
>P9WH95 ~~~rpmG2~~~Large ribosomal subunit protein bL33B~~~COG0267
MASSTDVRPKITLACEVCKHRNYITKKNRRNDPDRLELKKFCPNCGKHQAHRETR
>Q2FY22 ~~~rpmG2~~~Large ribosomal subunit protein bL33B~~~COG0267
MRVNVTLACTECGDRNYITTKNKRNNPERVEMKKFCSRENKQTLHRETK
>P66231 ~~~rpmG2~~~Large ribosomal subunit protein bL33B~~~
MRVNVTLACTECGDRNYITTKNKRNNPERIEMKKYCPRLNKYTLHRETK
>P59628 ~~~rpmG3~~~Large ribosomal subunit protein bL33C~~~COG0267
MRVNITLECTSCKERNYLTNKNKRNNPDRLEKQKYCPRERKVTLHRETK
>A2RNR2 ~~~rpmG3~~~Large ribosomal subunit protein bL33C~~~COG0267
MLRKAGLACTVCGSRNYTLNLSSVAKEKRVEVKKFCRTCGKHTLHKETR
>Q9RSS4 ~~~rpmG~~~Large ribosomal subunit protein bL33~~~COG0267
MAKDGPRIIVKMESSAGTGFYYTTTKNRRNTQAKLELKKYDPVAKKHVVFREKKV
>P0A7N9 ~~~rpmG~~~Large ribosomal subunit protein bL33~~~COG0267
MAKGIREKIKLVSSAGTGHFYTTTKNKRTKPEKLELKKFDPVVRQHVIYKEAKIK
>Q9HTN9 ~~~rpmG~~~Large ribosomal subunit protein bL33~~~
MRELIRLVSSAGTGHFYTTDKNKRTKPEKIEIKKYDPVVRQHVIYKEAKIK
>Q6N554 ~~~rpmG~~~Large ribosomal subunit protein bL33~~~COG0267
MAKAVTIKIKLVSTADTGFYYVTKKNSRTMTDKMVKKKYDPVARKHVEFKEAKIK
>Q72GW3 ~~~rpmG~~~Large ribosomal subunit protein bL33~~~COG0267
MASEVRIKLLLECTECKRRNYATEKNKRNTPNKLELRKYCPWCRKHTVHREVKI
>P35871 ~~~rpmG~~~Large ribosomal subunit protein bL33~~~COG0267
MASEVRIKLLLECTECKRRNYATEKNKRNTPNKLELRKYCPWCRKHTVHREVKI
>B7IBH8 ~~~rpmH~~~Large ribosomal subunit protein bL34~~~
MKRTFQPSELKRKRVHGFRARMATKAGRQVLARRRAKGRHSLTV
>P05647 ~~~rpmH~~~Large ribosomal subunit protein bL34~~~COG0230
MKRTFQPNNRKRSKVHGFRSRMSSKNGRLVLARRRRKGRKVLSA
>Q9RSH2 ~~~rpmH~~~Large ribosomal subunit protein bL34~~~COG0230
MKRTYQPNNRKRAKTHGFRARMKTKSGRNILARRRAKGRHQLTVSDE
>P0A7P5 ~~~rpmH~~~Large ribosomal subunit protein bL34~~~COG0230
MKRTFQPSVLKRNRSHGFRARMATKNGRQVLARRRAKGRARLTVSK
>Q82YU9 ~~~rpmH~~~Large ribosomal subunit protein bL34~~~COG0230
MKRTYQPNKRKRQKVHGFRKRMSTKNGRRVLASRRRKGRKVISA
>P23376 ~~~rpmH~~~Large ribosomal subunit protein bL34~~~
MKRTYQPNRRKRSKVHGFRARMSTKNGRKVLARRRRKGRKVLSA
>A2RHL6 ~~~rpmH~~~Large ribosomal subunit protein bL34~~~COG0230
MKRTYQPHKKSRKTTHGFRSRMATKNGRRVLAARRRKGRASLTV
>P66248 ~~~rpmH~~~Large ribosomal subunit protein bL34~~~COG0230
MKRTYQPSKRKRKKVHGFRTRMSTKNGRRVLASRRRKGRKVLSA
>P78006 ~~~rpmH~~~Large ribosomal subunit protein bL34~~~
MKRTYQPSKLKRAKTHGFLARMATASGRKVLKLRRKKQRAQLTVSSER
>A0R7K0 ~~~rpmH~~~Large ribosomal subunit protein bL34~~~COG0230
MAKGKRTFQPNNRRRARVHGFRLRMRTRAGRAIVANRRSKGRRALTA
>P0C562 ~~~rpmH~~~Large ribosomal subunit protein bL34~~~COG0230
MAKGKRTFQPNNRRRARVHGFRLRMRTRAGRAIVANRRSKGRRALTA
>A5U9Q2 ~~~rpmH~~~Large ribosomal subunit protein bL34~~~COG0230
MTKGKRTFQPNNRRRARVHGFRLRMRTRAGRSIVSSRRRKGRRTLSA
>P9WH93 ~~~rpmH~~~Large ribosomal subunit protein bL34~~~COG0230
MTKGKRTFQPNNRRRARVHGFRLRMRTRAGRSIVSSRRRKGRRTLSA
>P29436 ~~~rpmH~~~Large ribosomal subunit protein bL34~~~
MKRTFQPSTLKRARVHGFRARMATKNGRQVLSRRRAKGRKRLTV
>Q2FUQ0 ~~~rpmH~~~Large ribosomal subunit protein bL34~~~COG0230
MVKRTYQPNKRKHSKVHGFRKRMSTKNGRKVLARRRRKGRKVLSA
>Q2YZB6 ~~~rpmH~~~Large ribosomal subunit protein bL34~~~
MVKRTYQPNKRKHSKVHGFRKRMSTKNGRKVLARRRRKGRKVLSA
>P80340 ~~~rpmH~~~Large ribosomal subunit protein bL34~~~COG0230
MKRTWQPNRRKRAKTHGFRARMRTPGGRKVLKRRRQKGRWRLTPAVRKR
>B7I693 ~~~rpmI~~~Large ribosomal subunit protein bL35~~~
MAKLKTRRGAAKRFKATANGFKRKQAFKRHILTKKSAKRIRQLRGCVMVHVSDVASVRRMCPYI
>P55874 ~~~rpmI~~~Large ribosomal subunit protein bL35~~~COG0291
MPKMKTHRGSAKRFKKTGSGKLKRSHAYTSHLFANKSQKQKRKLRKSAVVSAGDFKRIKQMLANIK
>Q9RSW6 ~~~rpmI~~~Large ribosomal subunit protein bL35~~~COG0291
MPKMKTHKMAKRRIKITGTGKVMAFKSGKRHQNTGKSGDEIRGKGKGFVLAKAEWARMKLMLPRGK
>P0A7Q1 ~~~rpmI~~~Large ribosomal subunit protein bL35~~~COG0291
MPKIKTVRGAAKRFKKTGKGGFKHKHANLRHILTKKATKRKRHLRPKAMVSKGDLGLVIACLPYA
>Q837C8 ~~~rpmI~~~Large ribosomal subunit protein bL35~~~COG0291
MPKQKTHRGLAKRVKRTGGGGLKRGRAFTSHRFHGKTKKQRRQLRKASMVAKGDYKRIRQQLARMK
>A2RMR2 ~~~rpmI~~~Large ribosomal subunit protein bL35~~~COG0291
MPKQKTHRASAKRFKRTGNGGLKRFRAYTSHRFHGKSVKQRRQLRKASMVSKGDFKRIRRMVATMK
>P0A491 ~~~rpmI~~~Large ribosomal subunit protein bL35~~~COG0291
MPKMKTHRGSAKRFKRTGSGKLKRRHGFTSHMFANKSQKQKRKLRKSAMVSAGDFKRIRQMVAKMK
>P75447 ~~~rpmI~~~Large ribosomal subunit protein bL35~~~
MKVKSAAKKRFKLTKSGQIKRKHAYTSHLAPHKTTKQKRHLRKQGTVSASDFKRIGNLI
>A0QYU7 ~~~rpmI~~~Large ribosomal subunit protein bL35~~~COG0291
MPKAKTHSGASKRFRRTGTGKIVRQKANRRHLLEHKPTKRTRRLDGRTTVSAADNSRINKLLNG
>A5U2Z9 ~~~rpmI~~~Large ribosomal subunit protein bL35~~~COG0291
MPKAKTHSGASKRFRRTGTGKIVRQKANRRHLLEHKPSTRTRRLDGRTVVAANDTKRVTSLLNG
>P9WH91 ~~~rpmI~~~Large ribosomal subunit protein bL35~~~COG0291
MPKAKTHSGASKRFRRTGTGKIVRQKANRRHLLEHKPSTRTRRLDGRTVVAANDTKRVTSLLNG
>Q9I0A1 ~~~rpmI~~~Large ribosomal subunit protein bL35~~~
MPKMKTKSGAAKRFKKTAGGLKHKHAFKSHILTKMTTKRKRQLRGTSMLNKSDVARVERSLRLR
>Q6NDR5 ~~~rpmI~~~Large ribosomal subunit protein bL35~~~COG0291
MPKLKTKSGAKKRFKVTATGKVMSAQRGKRHGMIKRTKKQIRQLRGTRAIFKTDGDNIKKYFLPNA
>Q2FXQ0 ~~~rpmI~~~Large ribosomal subunit protein bL35~~~COG0291
MPKMKTHRGAAKRVKRTASGQLKRSRAFTSHLFANKSTKQKRQLRKARLVSKSDMKRVKQLLAYKK
>Q2YTB0 ~~~rpmI~~~Large ribosomal subunit protein bL35~~~
MPKMKTHRGAAKRVKRTASGQLKRSRAFTSHLFANKSTKQKRQLRKARLVSKSDMKRVKQLLAYKK
>Q72L77 ~~~rpmI~~~Large ribosomal subunit protein bL35~~~COG0291
MPKMKTHKGAKKRVKITASGKVVAMKTGKRHLNWQKSGKEIRQKGRKFVLAKPEAERIKLLLPYE
>Q5SKU1 ~~~rpmI~~~Large ribosomal subunit protein bL35~~~COG0291
MPKMKTHKGAKKRVKITASGKVVAMKTGKRHLNWQKSGKEIRQKGRKFVLAKPEAERIKLLLPYE
>P80341 ~~~rpmI~~~Large ribosomal subunit protein bL35~~~
MPKMKTHKGAKKRVKITASGKVVAMKTGKRHLNWQKSGKEIRQKGRKFVLAKPEAERIKLLLPYE
>Q9HWF6 ~~~rpmJ~~~Large ribosomal subunit protein bL36A~~~
MKVRASVKKLCRNCKIIRRDGIVRVICSAEPRHKQRQG
>Q2EEQ2 ~~~ykgO~~~Large ribosomal subunit protein bL36B~~~COG0257
MKVLNSLRTAKERHPDCQIVKRKGRLYVICKSNPRFKAVQGRKKKR
>B7IA18 ~~~rpmJ~~~Large ribosomal subunit protein bL36~~~
MKVQASVKKICGSCKVIRRNGVIRVICSAEPRHKQRQG
>P20278 ~~~rpmJ~~~Large ribosomal subunit protein bL36~~~COG0257
MKVRPSVKPICEKCKVIRRKGKVMVICENPKHKQKQG
>Q9RSK0 ~~~rpmJ~~~Large ribosomal subunit protein bL36~~~COG0257
MKVRSSVKKMCDNCKVVRRHGRVLVICSNVKHKQRQG
>P0A7Q6 ~~~rpmJ~~~Large ribosomal subunit protein bL36A~~~COG0257
MKVRASVKKLCRNCKIVKRDGVIRVICSAEPKHKQRQG
>Q839E1 ~~~rpmJ~~~Large ribosomal subunit protein bL36~~~COG0257
MKVRPSVKPMCEHCKVIRRKGRVMVICPANPKHKQRQG
>P07841 ~~~rpmJ~~~Large ribosomal subunit protein bL36~~~
MKVRPSVKPICEKCKVIRRRGKVMVICENPKHKQRQG
>A2RNN0 ~~~rpmJ~~~Large ribosomal subunit protein bL36~~~COG0257
MKVRPSVKPICEYCKVIRRNGRVMVICPANPKHKQRQG
>P66290 ~~~rpmJ~~~Large ribosomal subunit protein bL36~~~COG0257
MKVRPSVKPMCEKCKVIRRKGKVMVICENPKHKQKQG
>P52864 ~~~rpmJ~~~Large ribosomal subunit protein bL36~~~
MKVRASVKPICKDCKIIKRHQIVRVICKTQKHKQRQG
>A0QSL4 ~~~rpmJ~~~Large ribosomal subunit protein bL36~~~COG0257
MKVNPSVKPICDKCRVIRRHGRVMVICSDPRHKQRQG
>A5U8D7 ~~~rpmJ~~~Large ribosomal subunit protein bL36~~~COG0257
MKVNPSVKPICDKCRLIRRHGRVMVICSDPRHKQRQG
>P9WH89 ~~~rpmJ~~~Large ribosomal subunit protein bL36~~~COG0257
MKVNPSVKPICDKCRLIRRHGRVMVICSDPRHKQRQG
>Q6N253 ~~~rpmJ~~~Large ribosomal subunit protein bL36~~~COG0257
MKVRNSLKSLLTRHRENRLVRRKGRLYVINKTQRRFKARQG
>Q2FW29 ~~~rpmJ~~~Large ribosomal subunit protein bL36~~~COG0257
MKVRPSVKPICEKCKVIKRKGKVMVICENPKHKQRQG
>Q72I28 ~~~rpmJ~~~Large ribosomal subunit protein bL36~~~COG0257
MKVRASVKRICDKCKVIRRHGRVYVICENPKHKQRQG
>Q5SHR2 ~~~rpmJ~~~Large ribosomal subunit protein bL36~~~COG0257
MKVRASVKRICDKCKVIRRHGRVYVICENPKHKQRQG
>P80256 ~~~rpmJ~~~Large ribosomal subunit protein bL36~~~
MKVRASVKRICDKCKVIRRHGRVYVICENPKHKQRQG
>B7IA39 ~~~rplC~~~Large ribosomal subunit protein uL3~~~
MAIGLVGRKCGMTRIFTDAGVSVPVTVIEVDPNRITQIKTLETDGYQAVQVTTGERRESRVTNAQKGHFAKAGVAAGRLV
KEFRVTEAELEGREVGGTIGVDLFTVGQIVDVTGQSKGKGFQGGVKRWNFRTQDATHGNSVSHRVLGSTGQNQTPGRVFK
GKKMAGHLGDERVTVQGLEIVSVDTERSVLVVKGAIPGATGGDVIVRPTIKA
>P42920 ~~~rplC~~~Large ribosomal subunit protein uL3~~~COG0087
MTKGILGRKIGMTQVFAENGDLIPVTVIEAAPNVVLQKKTAENDGYEAIQLGFDDKREKLSNKPEKGHVAKAETAPKRFV
KELRGVEMDAYEVGQEVKVEIFSAGEIVDVTGVSKGKGFQGAIKRHGQSRGPMSHGSRYHRRPGSMGPVDPNRVFKGKLL
PGRMGGEQITVQNLEIVKVDAERNLLLIKGNVPGAKKSLITVKSAVKSK
>Q9RXK2 ~~~rplC~~~Large ribosomal subunit protein uL3~~~COG0087
MKGILGTKIGMTQIWKNDRAIPVTVVLAGPCPIVQRKTAQTDGYEAVQIGYAPKAERKVNKPMQGHFAKAGVAPTRILRE
FRGFAPDGDSVNVDIFAEGEKIDATGTSKGKGTQGVMKRWNFAGGPASHGSKKWHRRPGSIGQRKTPGRVYKGKRMAGHM
GMERVTVQNLEVVEIRAGENLILVKGAIPGANGGLVVLRSAAKASAAKGGK
>P60438 ~~~rplC~~~Large ribosomal subunit protein uL3~~~COG0087
MIGLVGKKVGMTRIFTEDGVSIPVTVIEVEANRVTQVKDLANDGYRAIQVTTGAKKANRVTKPEAGHFAKAGVEAGRGLW
EFRLAEGEEFTVGQSISVELFADVKKVDVTGTSKGKGFAGTVKRWNFRTQDATHGNSLSHRVPGSIGQNQTPGKVFKGKK
MAGQMGNERVTVQSLDVVRVDAERNLLLVKGAVPGATGSDLIVKPAVKA
>Q839G4 ~~~rplC~~~Large ribosomal subunit protein uL3~~~COG0087
MTKGILGKKVGMTQIFTESGELIPVTVVEATPNVVLQVKTVETDGYEAIQVGYQDKREVLSNKPAKGHVAKANTAPKRFI
KEFKNVELGEYEVGKEIKVDVFQAGDVVDVTGTTKGKGFQGAIKRHGQSRGPMSHGSRYHRRPGSMGPVAPNRVFKNKRL
AGRMGGDRVTIQNLEVVKVDVERNVILIKGNIPGAKKSLITIKSAVKAK
>P28600 ~~~rplC~~~Large ribosomal subunit protein uL3~~~
MTKGILGRKIGMTQIFAENGDLIPVTVIHATPNVVLQKKTIENDGYEAIQLGFEDISEKRANKPQIGHAAKANTAPKRFI
REIRGANINEYEVGQEVKVDIFSEGDIVDVTGISKGKGFQGAIKRHGQSRGPMAHGSRYHRRPGSMGAIAPNRVFKTKNL
PGRMGGERVTIQNLKIVKVDPERNLLLIKGNVPGPRKGLVIVKSAVKAKAKAK
>A2RNQ5 ~~~rplC~~~Large ribosomal subunit protein uL3~~~COG0087
MSKGILGKKVGMTQIFTDNGELIPVTVIEATPNTVLQVKSVETDGYEATQVGFDTLREVLTNKPAKGHAAKANTTPKRFV
REFKGLEGAEVGAEITVDTFAAGDVVDVTGTSKGKGFQGPIKRHGQSRGPMAHGSRYHRRPGSMGPVAANKVPKGKKLAG
RMGNKRVTVQNLVIAQVLPEKNVILVKGNVPGAKKSLIVVKSAIKAK
>Q8Y440 ~~~rplC~~~Large ribosomal subunit protein uL3~~~COG0087
MTKGILGRKVGMTQVFTENGELIPVTVIEAAQNVVLQKKTVETDGYEAVQIGFEDKRAILSNKPEQGHVAKANTTPKRFI
REFRDVNLDEYEIGAEVKVDVFAEGDIIDATGVSKGKGFQGVIKRHGQSRGPMAHGSRYHRRPGSMGPVAPNRVFKNKLL
PGRMGGEQITIQNLEIVKVDVEKNVLLVKGNVPGAKKALVQIKTATKAK
>P75580 ~~~rplC~~~Large ribosomal subunit protein uL3~~~
MEIRGIFGVKVGMSQVFTTNNERLPITVIYCEPNQVAGVKTEAKDKYSATLLSFDTVENKKLNKPQQGFFEKNNLKPTKH
LQEIRNMTGFEMGQQITPQNLFQVGEYVDVSAISKGRGFTGAIKRWNFKIGPLGHGAGYPHRFQGSVQAGRGGASAQRVF
KGKKMSGHYGHEKVTVQNLRIVGFDEANMLVLVSGAIAGPEGGVVLIRTAKKKPGVVKPIELAVQTEKAPEAKPAKLSKK
KQAKELAKAQAANQQTVEAKVDTPVVEPKPTEVKKAAPVVEKKGEDK
>A0QSD1 ~~~rplC~~~Large ribosomal subunit protein uL3~~~COG0087
MARKGILGTKLGMTQVFDENNKVVPVTVVKAGPNVVTRIRTTERDGYSAVQLAYGEISPRKVIKPVAGQFAAAGVNPRRH
VAELRLDDEAAVAEYEVGQELTAEIFSDGAYVDVTGTSKGKGFAGTMKRHGFRGQGAAHGAQAVHRRPGSIGGCATPGRV
FKGTRMSGRMGNDRVTTQNLKVHKVDAENGVLLIKGAIPGRNGGLVVVRSAIKRGEK
>A5U087 ~~~rplC~~~Large ribosomal subunit protein uL3~~~COG0087
MARKGILGTKLGMTQVFDESNRVVPVTVVKAGPNVVTRIRTPERDGYSAVQLAYGEISPRKVNKPLTGQYTAAGVNPRRY
LAELRLDDSDAATEYQVGQELTAEIFADGSYVDVTGTSKGKGFAGTMKRHGFRGQGASHGAQAVHRRPGSIGGCATPARV
FKGTRMAGRMGNDRVTVLNLLVHKVDAENGVLLIKGAVPGRTGGLVMVRSAIKRGEK
>P9WH87 ~~~rplC~~~Large ribosomal subunit protein uL3~~~COG0087
MARKGILGTKLGMTQVFDESNRVVPVTVVKAGPNVVTRIRTPERDGYSAVQLAYGEISPRKVNKPLTGQYTAAGVNPRRY
LAELRLDDSDAATEYQVGQELTAEIFADGSYVDVTGTSKGKGFAGTMKRHGFRGQGASHGAQAVHRRPGSIGGCATPARV
FKGTRMAGRMGNDRVTVLNLLVHKVDAENGVLLIKGAVPGRTGGLVMVRSAIKRGEK
>Q9HWD5 ~~~rplC~~~Large ribosomal subunit protein uL3~~~
MTIGVVGRKCGMTRIFTEEGVSIPVTVIEVEPNRVTQFKTEETDGYRAVQVTAGERRASRVTKAQAGHFAKANVAAGRGV
WEFRLGEEQYAAGDQITVDLFQAGQMVDVTGESKGKGFAGTIKRWNFRGQDNTHGNSVSHRVPGSIGQCQTPGRVFKGKK
MSGHLGAERVTVQSLEIVRVDAERNLLLVKGAVPGATGGDVIVRPAAKARG
>P60456 ~~~rplC~~~Large ribosomal subunit protein uL3~~~COG0087
MRSGVIAQKVGMTRVFTEAGEHIPVTVLKLGNCQVLGHRTKEKNGYVALQVGSGSRKTVYMPKAERGQFAAAKVEPKRKV
EEFRVSEDALLPVGAEIQADHFVVGQFVDVTGTSTGKGFAGGMKRWNFGGLRATHGVSVSHRSIGSTGGRQDPGKTFKNK
KMPGHMGVDRVTTLNLRVVQTDVERGLILVEGAVPGTKGGWIRVRDAVKKALPADAPKPGKFRLANGDAAAEAPAAEQEG
A
>Q2FW06 ~~~rplC~~~Large ribosomal subunit protein uL3~~~COG0087
MTKGILGRKIGMTQVFGENGELIPVTVVEAKENVVLQKKTVEVDGYNAIQVGFEDKKAYKKDAKSNKYANKPAEGHAKKA
DAAPKRFIREFRNVDVDAYEVGQEVSVDTFVAGDVIDVTGVSKGKGFQGAIKRHGQSRGPMSHGSHFHRAPGSVGMASDA
SRVFKGQKMPGRMGGNTVTVQNLEVVQVDTENKVILVKGNVPGPKKGLVEIRTSIKKGNK
>Q2YYP6 ~~~rplC~~~Large ribosomal subunit protein uL3~~~
MTKGILGRKIGMTQVFGENGELIPVTVVEAKENVVLQKKTVEVDGYNAIQVGFEDKKAYKKDAKSNKYANKPAEGHAKKA
DAAPKRFIREFRNVDVDAYEVGQEVSVDTFVAGDVIDVTGVSKGKGFQGAIKRHGQSRGPMSHGSHFHRAPGSVGMASDA
SRVFKGQKMPGRMGGNTVTVQNLEVVQVDTENKVILVKGNVPGPKKGLVEIRTSIKKGNK
>P60449 ~~~rplC~~~Large ribosomal subunit protein uL3~~~
MTKGILGRKIGMTQVFGENGELIPVTVVEAKENVVLQKKTVEVDGYNAIQVGFEDKKAYKKDAKSNKYANKPAEGHAKKA
DAAPKRFIREFRNVDVDAYEVGQEVSVDTFVAGDVIDVTGVSKGKGFQGAIKRHGQSRGPMSHGSHFHRAPGSVGMASDA
SRVFKGQKMPGRMGGNTVTVQNLEVVQVDTENKVILVKGNVPGPKKGLVEIRTSIKKGNK
>Q72I04 ~~~rplC~~~Large ribosomal subunit protein uL3~~~COG0087
MKGILGVKVGMTRIFRDDRAVPVTVILAGPCPVVQRRTPEKDGYTAVQLGFLPQNPKRVNRPLKGHFAKAGVEPVRILRE
IRDFNPEGDTVTVEIFKPGERVDVTGTSKGRGFAGVMKRWNFAGGPDSHGAHKIHRHPGSIGNRKTPGRVYKGKKMAGHY
GAERVTVMNLEVVDVIPEENLLLVKGAVPGPNGGLVIVRETKKAAK
>Q5SHN8 ~~~rplC~~~Large ribosomal subunit protein uL3~~~COG0087
MKGILGVKVGMTRIFRDDRAVPVTVILAGPCPVVQRRTPEKDGYTAVQLGFLPQNPKRVNRPLKGHFAKAGVEPVRILRE
IRDFNPEGDTVTVEIFKPGERVDVTGTSKGRGFAGVMKRWNFAGGPDSHGAHKIHRHPGSIGNRKTPGRVYKGKKMAGHY
GAERVTVMNLEVVDVIPEENLLLVKGAVPGPNGGLVIVRETKKAAK
>P52860 ~~~rplC~~~Large ribosomal subunit protein uL3~~~
MKGILGVKVGMTRIFRDDRAVPVTVILAGPCPVVQRRTPEKDGYTAVQLGFLPQNPKRVNRPLKGHFAKAGVEPVRILRE
IRDFNPEGDTVTVEIFKPGERVDVTGTSKGRGFAGVMKRWNFAGGPDSHGAHKIHRHPGSIGNRKTPGRVYKGKKMAGHY
GAERVTVMNLEVVDVIPEENLLLVKGAVPGPNGGLVIVRETKKAAK
>B7IA38 ~~~rplD~~~Large ribosomal subunit protein uL4~~~
MNLKTVSGSAVELSEVAFGREFNEALVHQVVTAYLAGGRQGTRAHKSRADVSGGGKKPFRQKGTGRARAGSIRSPIWVGG
GKTFAARPQDWSQKVNRKMYRGAMQCILAELVRQDRLVLVEEFAVAAPKTKELLAKLNDLNAARALIVTDAVDENLYLAA
RNLPHVDVVDATAIDPVSLIAFDKVVMSVAAAKKIEVELG
>P42921 ~~~rplD~~~Large ribosomal subunit protein uL4~~~COG0088
MPKVALYNQNGSTAGDIELNASVFGIEPNESVVFDAILMQRASLRQGTHKVKNRSEVRGGGRKPWRQKGTGRARQGSIRS
PQWRGGGVVFGPTPRSYSYKLPKKVRRLAIKSVLSSKVIDNNIIVLEDLTLDTAKTKEMAAILKGLSVEKKALIVTADAN
EAVALSARNIPGVTVVEANGINVLDVVNHEKLLITKAAVEKVEEVLA
>Q9RXK1 ~~~rplD~~~Large ribosomal subunit protein uL4~~~COG0088
MAQINVIGQNGGRTIELPLPEVNSGVLHEVVTWQLASRRRGTASTRTRAQVSKTGRKMYGQKGTGNARHGDRSVPTFVGG
GVAFGPKPRSYDYTLPRQVRQLGLAMAIASRQEGGKLVAVDGFDIADAKTKNFISWAKQNGLDGTEKVLLVTDDENTRRA
ARNVSWVSVLPVAGVNVYDILRHDRLVIDAAALEIVEEEAGEEQQ
>P60723 ~~~rplD~~~Large ribosomal subunit protein uL4~~~COG0088
MELVLKDAQSALTVSETTFGRDFNEALVHQVVVAYAAGARQGTRAQKTRAEVTGSGKKPWRQKGTGRARSGSIKSPIWRS
GGVTFAARPQDHSQKVNKKMYRGALKSILSELVRQDRLIVVEKFSVEAPKTKLLAQKLKDMALEDVLIITGELDENLFLA
ARNLHKVDVRDATGIDPVSLIAFDKVVMTADAVKQVEEMLA
>Q839G3 ~~~rplD~~~Large ribosomal subunit protein uL4~~~COG0088
MPNVALFKQDGTQNGEITLNEEIFGIEPNESVVYDAIIMQRASLRQGTHAVKNRSAVRGGGRKPWRQKGTGRARQGSIRS
PQWRGGGVVFGPTPRSYSYKFPKKVRRLAMKSVLSDKVAENNLVAVEGLSFDAPKTKEFKQVLANLSIDTKVLVVLENGN
DFAALSARNLPNVSVVTSDNVSVLDVVSANKVLATQTALTQIEEVLA
>P28601 ~~~rplD~~~Large ribosomal subunit protein uL4~~~
MPKVALYNQNGQTVGEIELNDAVFGIEPNKHVLFEAVIMQRASMRQGTHKTKNRAEVSGGGRKPWRQKGTGRARQGSIRA
PQWRGGGTVFGPVPRSYSYKLPKKVRRLAIKSALSSKVLENDIVVLDQLSLEAPKTKEMVKILNNLSVDRKALIVTDELN
ENVYLSARNIPGVKVVPANGINVLDVLNHDKLVITKAAVEKVEEVLA
>A2RNQ4 ~~~rplD~~~Large ribosomal subunit protein uL4~~~COG0088
MAKVSLFKQDGSQAGEVTLNDSVFGIEPNESVVFDVVISQRASLRQGTHAHKNRSAVSGGGKKPWRQKGTGRARQGSTRS
PQWRGGGTVFGPNPRSYAYKLPQKVRQLALKSVYSTKVTDGKLIAVDTLDFAAPKTAEFAKVISALSIERKVLVVLPNEG
NEFAELSARNLENVKVTTANSASVLDIVSADKLLVVQSALTQIEEVLA
>P61055 ~~~rplD~~~Large ribosomal subunit protein uL4~~~COG0088
MPKLSLLKQDGTNAGEITLNDTVFGIEPNEKVVVDVILSQRASLRQGTHKVKNRSEVRGGGRKPWRQKGTGRARQGSIRS
PQWRGGGVVFGPTPRSYAYKLPKKVRRLAIKSILSSKVNEEKLVVLEGLTFDAPKTKEFAAFLKNISVDTKALIVVAGES
ENVELSARNLQGITVIPAESISVLEVAKHDKLIITKAAVEKVEEVLA
>P75579 ~~~rplD~~~Large ribosomal subunit protein uL4~~~
MAKLKLIKIDGSFETEPVKLSPGLIAKELKQQPVFDAVLVEQASWRQGTHSILTKGEVRGGGKKPYKQKHTGKARQGSTR
NPHFVGGGIVFGPKPNRNYSLKLNKKAHTAALHTVWSEKLASDNTHLVDQNLFNKTEGKTKVMMQFLKSAKLLDKNVLFV
VNTLNTNLEQSTSNIKNVQVKHLDKVSVRDLMLANALLVEKEVLKALEGKFK
>A0QSD2 ~~~rplD~~~Large ribosomal subunit protein uL4~~~COG0088
MTLKVDVKTPAGKTDGSVELPAELFDVEPNIALMHQVVTAQLAAKRQGTHSTKTRGEVSGGGKKPYRQKGTGRARQGSTR
APQFTGGGTVHGPKPRDYSQRTPKKMIAAALRGALSDRARNDRIHAVTELVEGQTPSTKSAKTFLGTLTENKKVLVVIGR
TDEVGAKSVRNLPGVHVISPDQLNTYDVLNADDVVFSVEALNAYISANSKEGASV
>A5U088 ~~~rplD~~~Large ribosomal subunit protein uL4~~~COG0088
MAAQEQKTLKIDVKTPAGKVDGAIELPAELFDVPANIALMHQVVTAQRAAARQGTHSTKTRGEVSGGGRKPYRQKGTGRA
RQGSTRAPQFTGGGVVHGPKPRDYSQRTPKKMIAAALRGALSDRARNGRIHAITELVEGQNPSTKSARAFLASLTERKQV
LVVIGRSDEAGAKSVRNLPGVHILAPDQLNTYDVLRADDVVFSVEALNAYIAANTTTSEEVSA
>P9WH85 ~~~rplD~~~Large ribosomal subunit protein uL4~~~COG0088
MAAQEQKTLKIDVKTPAGKVDGAIELPAELFDVPANIALMHQVVTAQRAAARQGTHSTKTRGEVSGGGRKPYRQKGTGRA
RQGSTRAPQFTGGGVVHGPKPRDYSQRTPKKMIAAALRGALSDRARNGRIHAITELVEGQNPSTKSARAFLASLTERKQV
LVVIGRSDEAGAKSVRNLPGVHILAPDQLNTYDVLRADDVVFSVEALNAYIAANTTTSEEVSA
>Q9HWD6 ~~~rplD~~~Large ribosomal subunit protein uL4~~~
MQLNVNGAQAIEVSERTFGGEFNETLVHQAVVAYMAGGRQGSKAQKTRSEVSGGGKKPWRQKGTGRARAGTIRSPIWRGG
GTTFAAKPRSHEQKLNKKMYRAALRSILAELVRLDRLVVVADFAVDAPKTKGLVAKLDTLGLKDVLIVTDGVDENLYLAA
RNLAHVDVRDVQGSDPVSLIAYDKVLVTVSAVKKFEELLG
>P61068 ~~~rplD~~~Large ribosomal subunit protein uL4~~~COG0088
MELKVTTLEGKEAGSVQLSDEIFGLEPRSDIIQRCVIWQLAKRQAGTHKAKGRAEVWRTGKKMYKQKGTGGARHGSQRVP
QFRGGGRAFGPVVRSHAIDLPKKVRVLALRHALSAKAKGGGLIVLDKAELEAAKTKTLVGHFSGLGLESALIIDGAEVNN
GFAAAARNIPNIDVLPVQGINVYDILRRKKLVLTKAAVDALEARFK
>P60726 ~~~rplD~~~Large ribosomal subunit protein uL4~~~
MELVLKDAQSALTVSETTFGRDFNEALVHQVVVAYAAGARQGTRAQKTRAEVTGSGKKPWRQKGTGRARSGSIKSPIWRS
GGVTFAARPQDHSQKVNKKMYRGALKSILSELVRQDRLIVVEKFSVEAPKTKLLAQKLKDMALEDVLIITGELDENLFLA
ARNLHKVDVRDATGIDPVSLIAFDKVVMTADAVKQVEEMLA
>Q2FW07 ~~~rplD~~~Large ribosomal subunit protein uL4~~~COG0088
MANYDVLKLDGTKSGSIELSDAVFGIEPNNSVLFEAINLQRASLRQGTHAVKNRSAVSGGGRKPWKQKGTGRARQGTIRA
PQWRGGGIVFGPTPRSYAYKMPKKMRRLALRSALSFKAQENGLTVVDAFNFEAPKTKEFKNVLSTLEQPKKVLVVTENED
VNVELSARNIPGVQVTTAQGLNVLDITNADSLVITEAAAKKVEEVLG
>Q2YYP7 ~~~rplD~~~Large ribosomal subunit protein uL4~~~
MANYDVLKLDGTKSGSIELSDAVFGIEPNNSVLFEAINLQRASLRQGTHAVKNRSAVSGGGRKPWKQKGTGRARQGTIRA
PQWRGGGIVFGPTPRSYAYKMPKKMRRLALRSALSFKAQENGLTVVDAFNFEAPKTKEFKNVLSTLEQPKKVLVVTENED
VNVELSARNIPGVQVTTAQGLNVLDITNADSLVITEAAAKKVEEVLG
>P61059 ~~~rplD~~~Large ribosomal subunit protein uL4~~~
MANYDVLKLDGTKSGSIELSDAVFGIEPNNSVLFEAINLQRASLRQGTHAVKNRSAVSGGGRKPWKQKGTGRARQGTIRA
PQWRGGGIVFGPTPRSYAYKMPKKMRRLALRSALSFKAQENGLTVVDAFNFEAPKTKEFKNVLSTLEQPKKVLVVTENED
VNVELSARNIPGVQVTTAQGLNVLDITNADSLVITEAAAKKVEEVLG
>P38516 ~~~rplD~~~Large ribosomal subunit protein uL4~~~COG0088
MAQVDLLNVKGEKVGTLEISDFVFNIDPNYDVMWRYVDMQLSNRRAGTASTKTRGEVSGGGRKPWPQKHTGRARHGSIRS
PIWRHGGVVHGPKPRDWSKKLNKKMKKLALRSALSVKYRENKLLVLDDLKLERPKTKSLKEILQNLQLSDKKTLIVLPWK
EEGYMNVKLSGRNLPDVKVIIADNPNNSKNGEKAVRIDGLNVFDMLKYDYLVLTRDMVSKIEEVLGNEAGKALTA
>Q72I05 ~~~rplD~~~Large ribosomal subunit protein uL4~~~COG0088
MYQIPVLSPSGRRELAADLPAEINPHLLWEVVRWQLAKRRRGTASTKTRGEVAYSGRKIWPQKHTGRARHGDIGAPIFVG
GGVVFGPKPRDYSYTLPKKVRKKGLAMAVADRAREGKLLLVEAFAGVNGKTKEFLAWAKEAGLDGSESVLLVTGNELVRR
AARNLPWVVTLAPEGLNVYDIVRTERLVMDLDAWEVFQNRIGGEA
>Q5SHN9 ~~~rplD~~~Large ribosomal subunit protein uL4~~~COG0088
MKEVAVYQIPVLSPSGRRELAADLPAEINPHLLWEVVRWQLAKRRRGTASTKTRGEVAYSGRKIWPQKHTGRARHGDIGA
PIFVGGGVVFGPKPRDYSYTLPKKVRKKGLAMAVADRAREGKLLLVEAFAGVNGKTKEFLAWAKEAGLDGSESVLLVTGN
ELVRRAARNLPWVVTLAPEGLNVYDIVRTERLVMDLDAWEVFQNRIGGEA
>B7IA27 ~~~rplE~~~Large ribosomal subunit protein uL5~~~
MARLKARYNDELKAKLQEELSIKNVMEIPRITKITLNMGVGAAATDKKLLDGAVADMQLIAGQKPVVTLARKSIAGFKIR
DGWPIGCKVTLRGDQMYEFLDRLISIAIPRIRDFRGFSAKSFDGRGNYSMGLKEQIVFPEIDFDKIDRIRGMDITITTTA
RTDDEGRALMRAFGFPFK
>P12877 ~~~rplE~~~Large ribosomal subunit protein uL5~~~COG0094
MNRLKEKYNKEIAPALMTKFNYDSVMQVPKIEKIVINMGVGDAVQNAKAIDSAVEELTFIAGQKPVVTRAKKSIAGFRLR
EGMPIGAKVTLRGERMYDFLDKLISVSLPRVRDFRGVSKKSFDGRGNYTLGIKEQLIFPEIDYDKVTKVRGMDIVIVTTA
NTDEEARELLTQVGMPFQK
>Q9RXJ0 ~~~rplE~~~Large ribosomal subunit protein uL5~~~COG0094
MQQLKTKYNDQVRPALMQQFGYSSVMAVPRIEKIVVNEGLGSSKEDSKAIDKAAKELALITLQKPIITKAKKSISNFKLR
QGMPVGIKVTLRGERMYVFLEKLINIGLPRIRDFRGINPNAFDGRGNYNLGIKEQLIFPEITYDMVDKTRGMDITIVTTA
KTDEEARALLQSMGLPFRKQ
>P62399 ~~~rplE~~~Large ribosomal subunit protein uL5~~~COG0094
MAKLHDYYKDEVVKKLMTEFNYNSVMQVPRVEKITLNMGVGEAIADKKLLDNAAADLAAISGQKPLITKARKSVAGFKIR
QGYPIGCKVTLRGERMWEFFERLITIAVPRIRDFRGLSAKSFDGRGNYSMGVREQIIFPEIDYDKVDRVRGLDITITTTA
KSDEEGRALLAAFDFPFRK
>Q839F2 ~~~rplE~~~Large ribosomal subunit protein uL5~~~COG0094
MNRLKEKYIKEVTPSLVEKFNYSSVMQTPKVDKIVINMGVGDAVSNAKNLDKAVEELALITGQKPLITKAKKSIAGFRLR
EGMPIGAKVTLRGERMYEFLDKLVTVSLPRVRDFHGVSKKAFDGRGNYTLGIKEQLIFPEVDYDLVDKVRGMDIVIVTTA
NTDEESRELLAQLGMPFQK
>P08895 ~~~rplE~~~Large ribosomal subunit protein uL5~~~
MNRLKEKYVKEVVPALMSKFNYKSIMQVPKIEKIVINMGVGDAVQNPKALDSAVEELTLIAGQRPVVTRAKKSIAGFRLR
QGMPIGAKVTLRGERMYEFLDKLISVSLPRVRDFRGVSKKAFDGRGNYTLGIKEQLIFPEIDYDKVNKVRGMDIVIVTTA
NTDEEARELLALLGMPFQK
>A2RNP3 ~~~rplE~~~Large ribosomal subunit protein uL5~~~COG0094
MTNRLKEKYTNEVVPALTEQFNYTSIMAVPKVDKIVINMGVGDAVNNSKNLDKAVAELALISGQKPLITKAKKSVAAFRL
REGMPIGAKVTLRGERMFEFLDKLVTVSLPRVRDFHGVSNKAFDGRGNYTLGVKEQLIFPEINYDDVDKVRGMDIVIVTT
ANTDEESRELLAKLGMPFAK
>Q927L9 ~~~rplE~~~Large ribosomal subunit protein uL5~~~COG0094
MNRLKDQYLKEIVPALMSKFNYDSVMEVPKIDKIVINTGVGDATANAKVLDSAVEELALITGQKPVITKAKNSIAGFRLR
EGMPIGAKVTLRGERMYDFLDKLVTVSLPRVRDFRGVSKKAFDGRGNYTLGVREQLIFPEIDYDQVSKVRGMDVVIVTTA
KSDEESHELLTQLGMPFQK
>Q50306 ~~~rplE~~~Large ribosomal subunit protein uL5~~~
MNNLKAHYQKTIAKELQKSFAFSSIMQVPRLEKIVINMGVGDAIRDSKFLESALNELHLISGQKPVATKAKNAISTYKLR
AGQLIGCKVTLRGERMWAFLEKLIYVALPRVRDFRGLSLKSFDGRGNYTIGIKEQIIFPEIVYDDIKRIRGFDVTLVTST
NKDSEALALLRALNLPLVKG
>A0QSG1 ~~~rplE~~~Large ribosomal subunit protein uL5~~~COG0094
MTTTEKALPRLKQRYREEIREALQQEFNYANVMQIPGVVKVVVNMGVGDAARDAKLINGAINDLALITGQKPEVRRARKS
IAQFKLREGMPIGARVTLRGDRMWEFLDRLISIALPRIRDFRGLSPKQFDGTGNYTFGLNEQSMFHEIDVDSIDRPRGMD
ITVVTTATNDAEGRALLRALGFPFKEN
>A5U0A2 ~~~rplE~~~Large ribosomal subunit protein uL5~~~COG0094
MTTAQKVQPRLKERYRSEIRDALRKQFGYGNVMQIPTVTKVVVNMGVGEAARDAKLINGAVNDLALITGQKPEVRRARKS
IAQFKLREGMPVGVRVTLRGDRMWEFLDRLTSIALPRIRDFRGLSPKQFDGVGNYTFGLAEQAVFHEVDVDKIDRVRGMD
INVVTSAATDDEGRALLRALGFPFKEN
>P9WH83 ~~~rplE~~~Large ribosomal subunit protein uL5~~~COG0094
MTTAQKVQPRLKERYRSEIRDALRKQFGYGNVMQIPTVTKVVVNMGVGEAARDAKLINGAVNDLALITGQKPEVRRARKS
IAQFKLREGMPVGVRVTLRGDRMWEFLDRLTSIALPRIRDFRGLSPKQFDGVGNYTFGLAEQAVFHEVDVDKIDRVRGMD
INVVTSAATDDEGRALLRALGFPFKEN
>Q9HWE7 ~~~rplE~~~Large ribosomal subunit protein uL5~~~
MARLKEIYRKEIAPKLKEELQLANVMEVPRVTKITLNMGLGEAVGDKKIIENAVADLEKITGQKPVVTYARKSIAGFKIR
EGWPIGVKVTLRSDRMYEFLDRLLSISLPRVRDFRGLNAKSFDGRGNYSMGVKEQIIFPEIDYDKIDALRGLDITLTTTA
RTDDEGRALLRAFKFPFRN
>Q6N4U5 ~~~rplE~~~Large ribosomal subunit protein uL5~~~COG0094
MAETAYVPRLRTEYDRHIRTQLTEKFGYANVMQVPKLDKVVLNMGVGEAVNDRKKAEQAAADLSLIAGQKAVITYSRVAI
STFKLRENQPIGCKVTLRQARMYEFIDRLITVALPRVRDFRGLNPKSFDGRGNYSLGIKEHIIFPEIDFDKTGESWGMDI
TVCTTARTDDEARALLTAFNFPFRQ
>Q2FW18 ~~~rplE~~~Large ribosomal subunit protein uL5~~~COG0094
MNRLKEKFNTEVTENLMKKFNYSSVMEVPKIDKIVVNMGVGDAVQNSKVLDNAVEELELITGQKPLVTKAKKSIATFRLR
EGMPIGAKVTLRGERMYEFLDKLISVSLPRVRDFQGVSKKAFDGRGNYTLGVKEQLIFPEIDYDKVSKVRGMDIVIVTTA
NTDEEARELLANFGMPFRK
>Q2YYK9 ~~~rplE~~~Large ribosomal subunit protein uL5~~~
MNRLKEKFNTEVTENLMKKFNYSSVMEVPKIDKIVVNMGVGDAVQNSKVLDNAVEELELITGQKPLVTKAKKSIATFRLR
EGMPIGAKVTLRGERMYEFLDKLISVSLPRVRDFQGVSKKAFDGRGNYTLGVKEQLIFPEIDYDKVSKVRGMDIVIVTTA
NTDEEARELLANFGMPFRK
>Q7A465 ~~~rplE~~~Large ribosomal subunit protein uL5~~~
MNRLKEKFNTEVTENLMKKFNYSSVMEVPKIDKIVVNMGVGDAVQNSKVLDNAVEELELITGQKPLVTKAKKSIATFRLR
EGMPIGAKVTLRGERMYEFLDKLISVSLPRVRDFQGVSKKAFDGRGNYTLGVKEQLIFPEIDYDKVSKVRGMDIVIVTTA
NTDEEARELLANFGMPFRK
>Q72I16 ~~~rplE~~~Large ribosomal subunit protein uL5~~~COG0094
MPLDLALKRKYYEEVRPELIRRFGYQNVWEVPRLEKVVINQGLGEAKEDARILEKAAQELALITGQKPAVTRAKKSISNF
KLRKGMPIGLRVTLRRDRMWIFLEKLLNVALPRIRDFRGLNPNSFDGRGNYNLGLREQLIFPEITYDMVDALRGMDIAVV
TTAETDEEARALLELLGFPFRK
>Q5SHQ0 ~~~rplE~~~Large ribosomal subunit protein uL5~~~COG0094
MPLDVALKRKYYEEVRPELIRRFGYQNVWEVPRLEKVVINQGLGEAKEDARILEKAAQELALITGQKPAVTRAKKSISNF
KLRKGMPIGLRVTLRRDRMWIFLEKLLNVALPRIRDFRGLNPNSFDGRGNYNLGLREQLIFPEITYDMVDALRGMDIAVV
TTAETDEEARALLELLGFPFRK
>P41201 ~~~rplE~~~Large ribosomal subunit protein uL5~~~
MPLDVALKRKYYEEVRPELIRRFGYQNVWEVPRLEKVVINQGLGEAKEDARILEKAAQELALITGQKPAVTRAKKSISNF
KLRKGMPIGLRVTLRRDRMWIFLEKLLNVALPRIRDFRGLNPNSFDGRGNYNLGLREQLIFPEITYDMVDALRGMDIAVV
TTAETDEEARALLELLGFPFRK
>B7IA24 ~~~rplF~~~Large ribosomal subunit protein uL6~~~
MSRVAKAPVTVPNGVTVTQNGRQVEVKGSKGTLSFNLHALVELKQEEGKLQLAPAKESKDAWMQAGTARAVLNNLVKGVS
EGFERKLQLVGVGYKAAVKGTVVNLNLGYSHPIDYALPEGVTAETPTATEIILKSANKQLLGQVAAEIRAYRSPEPYKGK
GVRYSDEVILRKEAKKK
>P46898 ~~~rplF~~~Large ribosomal subunit protein uL6~~~COG0097
MSRVGKKLLEIPSDVTVTLNDNNTVAVKGPKGELTRTFHPDMEIKVEDNVLTVARPSDQKEHRALHGTTRSLLGNMVEGV
SKGFERGLELVGVGYRASKSGNKLVLNVGYSHPVEIVPEEGIEIEVPSQTKVVVKGTDKERVGAIAANIRAVRSPEPYKG
KGIRYEGEVVRRKEGKSAK
>Q9RSL3 ~~~rplF~~~Large ribosomal subunit protein uL6~~~COG0097
MSRIGKQPIAVPSGVTVNAQDGVFKVKGPKGELTVPYNTELTVRQDGDQLLVERPSDAQKHRALHGLTRTLVANAVKGVS
DGYTINLELRGVGFRAKLTGKALEMNIGYSHPVIIEPPAGVTFAVPEPTRIDVSGIDKQLVGQVAANVRKVRKPDAYHGK
GVRFVGEQIALKAGKAGATGGKGKK
>P0AG55 ~~~rplF~~~Large ribosomal subunit protein uL6~~~COG0097
MSRVAKAPVVVPAGVDVKINGQVITIKGKNGELTRTLNDAVEVKHADNTLTFGPRDGYADGWAQAGTARALLNSMVIGVT
EGFTKKLQLVGVGYRAAVKGNVINLSLGFSHPVDHQLPAGITAECPTQTEIVLKGADKQVIGQVAADLRAYRRPEPYKGK
GVRYADEVVRTKEAKKK
>Q839E9 ~~~rplF~~~Large ribosomal subunit protein uL6~~~COG0097
MSRIGNKVVVLPAGVEIKQDGNNITVKGPKGELTREFSSDIKMNIEGNEVTFIRPNDSKEMKTIHGTTRANFNNMVVGVS
EGFQKALELIGVGYRAQVQGNKLTLNVGYSHPVEMTAPEGVTFEVPANTQVIVKGINKEVVGELAANIRGVRPPEPYKGK
GIRYVGEFVRRKEGKTGK
>Q2YEI7 ~~~rplF~~~Large ribosomal subunit protein uL6~~~
MSRIGYKTIVIPEGVTVTRDEDVITVKGPKGELSETFSPDVAMNVNGNEINFEPTGKYDNKMCALHGTQRANLANMIEGV
EKGFNKTLKLVGVGYRTQLKGSDLILNVGYSNPVDVKVPEDIDVKVPDNTTIEIAGINKQHVGDFAAKVREIRSPEPYKG
KGIRYENEHITLREGKTGK
>Q5L408 ~~~rplF~~~Large ribosomal subunit protein uL6~~~COG0097
MSRVGKKPIEIPAGVTVTVNGNTVTVKGPKGELTRTFHPDMTITVEGNVITVTRPSDEKHHRALHGTTRSLLANMVEGVS
KGYEKALELVGVGYRASKQGKKLVLSVGYSHPVEIEPEEGLEIEVPSQTKIIVKGADKQRVGELAANIRAVRPPEPYKGK
GIRYEGELVRLKEGKTGK
>P02391 ~~~rplF~~~Large ribosomal subunit protein uL6~~~
MSRVGKKPIEIPAGVTVTVNGNTVTVKGPKGELTRTFHPDMTITVEGNVITVTRPSDEKHHRALHGTTRSLLANMVEGVS
KGYEKALELVGVGYRASKQGKKLVLSVGYSHPVEIEPEEGLEIEVPSQTKIIVKGADKQRVGELAANIRAVRPPEPYKGK
GIRYEGELVRLKEGKTGK
>A2RNN8 ~~~rplF~~~Large ribosomal subunit protein uL6~~~COG0097
MSRIGNKVIVIPAGVSVEVNGATVTVKGPKGELVRSFNENITLEIAENEITVKRPNDTKEMKMLHGTTRALLANMVEGVS
NGFSKALEMIGVGYRAQLQGTKLVLSVGKSHQDEVEAPENIKFVVATPTSIVVEGISKEAVGQTAAYIRSRRSPEPYKGK
GIRYVGEYVRRKEGKTGK
>Q8Y444 ~~~rplF~~~Large ribosomal subunit protein uL6~~~COG0097
MSRIGKKTIVIPAGVTVTLNGSTATVKGPKGELVKEFNPEITIKIEGNEINVSRPTDNKNHRALHGTTRAILNNMVVGVS
EGYEKKLELIGVGYRAQKQGDKLVLNVGYSHPVEFVAPKGVDIEVPANTQVIVKGYNKEHVGELAANIRAVRPPEPYKGK
GIRYEGEHVRRKEGKTGK
>C5CC48 ~~~rplF~~~Large ribosomal subunit protein uL6~~~COG0097
MSRIGRLPITIPAGVDVTIDGDRVSVKGPKGQLEHSLPTPITATLEEGQVTVARPDDERESRSLHGLTRTLISNMVEGVT
NGFSKQLEVVGTGYRVQAKGQDLEFALGYSHPVPVKAPQGITFTVEGNRVTVAGIDKQQVGETAANIRKLRRPDPYKGKG
VRYAGEQIRRKAGKAGK
>Q50303 ~~~rplF~~~Large ribosomal subunit protein uL6~~~
MSKIGNRTITLDPAKVNLNFQKDHIAVKGPLGQIELKLPPNLPLKFELKDNNLQITRNNELKQSKIFHGTYNALITNAII
GVTQGFEKKLRLVGVGYRANVEGETLNLQLGYSHPIKEKIPKGLTVKVEKNTEITISGISKELVGQFATEVRKWRKPEPY
KGKGVLYFDEVIVRKAGKTAEGKK
>A0QSG4 ~~~rplF~~~Large ribosomal subunit protein uL6~~~COG0097
MSRIGKQPVPVPSGVDVTINGQNLSVKGPKGTLTLDVAEPISVSRAEDGAIVVTRPDDERRSRSLHGLSRTLIANLVTGV
TEGYTQKMEIFGVGYRVQLKGQNLEFALGYSHPVLIEAPEGITFAVESPTKFSVSGIDKQKVGQISAVIRRLRRPDPYKG
KGVRYEGEQIRRKVGKTGK
>A5U0A5 ~~~rplF~~~Large ribosomal subunit protein uL6~~~COG0097
MSRIGKQPIPVPAGVDVTIEGQSISVKGPKGTLGLTVAEPIKVARNDDGAIVVTRPDDERRNRSLHGLSRTLVSNLVTGV
TQGYTTKMEIFGVGYRVQLKGSNLEFALGYSHPVVIEAPEGITFAVQAPTKFTVSGIDKQKVGQIAANIRRLRRPDPYKG
KGVRYEGEQIRRKVGKTGK
>P9WH81 ~~~rplF~~~Large ribosomal subunit protein uL6~~~COG0097
MSRIGKQPIPVPAGVDVTIEGQSISVKGPKGTLGLTVAEPIKVARNDDGAIVVTRPDDERRNRSLHGLSRTLVSNLVTGV
TQGYTTKMEIFGVGYRVQLKGSNLEFALGYSHPVVIEAPEGITFAVQAPTKFTVSGIDKQKVGQIAANIRRLRRPDPYKG
KGVRYEGEQIRRKVGKTGK
>Q9K1I3 ~~~rplF~~~Large ribosomal subunit protein uL6~~~
MSRVAKNPVTVPAGVEVKFGAEALVIKGKNGELSFPLHSDVAIEFNDGKLTFVANNSSKQANAMSGTARALVSNMVKGVS
EGFEKRLQLIGVGYRAQAQGKILNLSLGFSHPIVYEMPEGVSVQTPSQTEIVLTGSDKQVVGQVAAEIRAFRAPEPYKGK
GVRYVGEVVVMKEAKKK
>Q9HWF0 ~~~rplF~~~Large ribosomal subunit protein uL6~~~
MSRVAKNPVKLPAGVEIKLAGQQLSIKGAKGALELKVHPSVEVIQDSGELRFAARNGDQQTRAMAGTTRALVNNMVVGVS
QGFERKLQLVGVGYKAQAKGQVLSLSLGFSHPVDYELPAGIVAETPSQTDILIKGIDKQLVGQVAAEIRDFRPPEPYKGK
GVRYADEVVRRKEAKKK
>Q6N4U8 ~~~rplF~~~Large ribosomal subunit protein uL6~~~COG0097
MSRVGKKPVTVPSGVTATVEGQTVKMKGPKGQLQFVVHDDVDVKFEDGAVKVAPRHETNRARALYGTARAQIANLVEGVT
KGFEKKLEITGVGYRAAMQGKKLQLALGYSHDVLYDIPEGITITVPKPTEINVVGIDPQKVGQVAAEIRDYRPPEPYKGK
GVRYADEFIFRKEGKKK
>Q2FW21 ~~~rplF~~~Large ribosomal subunit protein uL6~~~COG0097
MSRVGKKIIDIPSDVTVTFDGNHVTVKGPKGELSRTLNERMTFKQEENTIEVVRPSDSKEDRTNHGTTRALLNNMVQGVS
QGYVKVLELVGVGYRAQMQGKDLILNVGYSHPVEIKAEENITFSVEKNTVVKVEGISKEQVGALASNIRSVRPPEPYKGK
GIRYQGEYVRRKEGKTGK
>Q2YYL2 ~~~rplF~~~Large ribosomal subunit protein uL6~~~
MSRVGKKIIDIPSDVTVTFDGNHVTVKGPKGELSRTLNERMTFKQEENTIEVVRPSDSKEDRTNHGTTRALLNNMVQGVS
QGYVKVLELVGVGYRAQMQGKDLILNVGYSHPVEIKAEENITFSVEKNTVVKVEGISKEQVGALASNIRSVRPPEPYKGK
GIRYQGEYVRRKEGKTGK
>Q7A466 ~~~rplF~~~Large ribosomal subunit protein uL6~~~
MSRVGKKIIDIPSDVTVTFDGNHVTVKGPKGELSRTLNERMTFKQEENTIEVVRPSDSKEDRTNHGTTRALLNNMVQGVS
QGYVKVLELVGVGYRAQMQGKDLILNVGYSHPVEIKAEENITFSVEKNTVVKVEGISKEQVGALASNIRSVRPPEPYKGK
GIRYQGEYVRRKEGKTGK
>Q72I19 ~~~rplF~~~Large ribosomal subunit protein uL6~~~COG0097
MSRIGRLPIPVPKGVSVEVAPGRVKVKGPKGELEVPVSPEMRVVVEEGVVRVERPSDERRHKSLHGLTRTLIANAVKGVS
EGYSKELLIKGIGYRARLVGRALELTVGFSHPVVVEPPEGITFEVPEPTRVRVSGIDKQKVGQVAANIRAIRKPSAYHEK
GIYYAGEPVRLKPGKAGAKK
>P0DOY8 ~~~rplF~~~Large ribosomal subunit protein uL6~~~COG0097
MSRIGRLPIPVPKGVSVEVAPGRVKVKGPKGELEVPVSPEMRVVVEEGVVRVERPSDERRHKSLHGLTRTLIANAVKGVS
EGYSKELLIKGIGYRARLVGRALELTVGFSHPVVVEPPEGITFEVPEPTRVRVSGIDKQKVGQVAANIRAIRKPSAYHEK
GIYYAGEPVRLKPGKAGAKK
>P24316 ~~~rplF~~~Large ribosomal subunit protein uL6~~~
MSRIGRLPIPVPKGVSVEVAPGRVKVKGPKGELEVPVSPEMRVVVEEGVVRVERPSDERRHKSLHGLTRTLIANAVKGVS
EGYSKELLIKGIGYRARLVGRALELTVGFSHPVVVEPPEGITFEVPEPTRVRVSGIDKQKVGQVAANIRAIRKPSAYHEK
GIYYAGEPVRLKPGKAGAKK
>Q8UE07 ~~~rplL~~~Large ribosomal subunit protein bL12~~~COG0222
MADLAKIVEDLSTLTVLEAAELSKLLEEKWGVSAAAPVAVAAAGGAGAAAAVEEEKTEFDVVLVDAGANKINVIKEVRAI
TGLGLKEAKDLVEGAPKAVKEAVSKAEAADLKKKLEDAGAKVDVK
>P02394 ~~~rplL~~~Large ribosomal subunit protein bL12~~~COG0222
MALNIEEIIASVKEATVLELNDLVKAIEEEFGVTAAAPVAVAGGAAAGGAAEEQSEFDLILAGAGSQKIKVIKVVREITG
LGLKEAKELVDNTPKPLKEGIAKEEAEELKAKLEEVGASVEVK
>P02397 ~~~rplL~~~Large ribosomal subunit protein bL12~~~
MADLNKLAEDIVGLTLLEAQELKTILKDKYGIEPAAGGAVMMAGPAAGAAAPAEEEKTEFDVGLTDAGANKINVIKEVRA
ITGLGLKEAKDLVEAGGKVKEAVAKADAEAMKKKLEEAGAKVELK
>B0B7N2 ~~~rplL~~~Large ribosomal subunit protein bL12~~~
MTTESLETLVEQLSGLTVLELSQLKKLLEEKWDVTAAAPVVAVAGAAAAGDAPASAEPTEFAVILEDVPSDKKIGVLKVV
REVTGLALKEAKEMTEGLPKTVKEKTSKSDAEDTVKKLQEAGAKAVAKGL
>P0C8S3 ~~~rplL~~~Large ribosomal subunit protein bL12~~~COG0222
MAQLSKDDILEAVANMSVMDVVDLVKAMEEKFGVSAQAAIAVAGPVAGGEAAAAEEKTEFNVKMVSFGDNKIGVIKAIRT
ITGLGLKEAKDLVESVPSVVKESVSKEEAEKIKKELEEAGAKVELE
>Q9RST0 ~~~rplL~~~Large ribosomal subunit protein bL12~~~COG0222
MAYDKQALIDQLGQLTIMELADLIDGLKETWGVTAAVAVSGGGAGAASPAAEEKTEFDVVLIDAGASKINVIKEIRGITG
LGLKEAKDMSEKGGVLKEGVAKDEAEKMKAQLEAAGARVELK
>P02393 ~~~rplL~~~Large ribosomal subunit protein bL12~~~COG0222
MSSITKEQVVEFIANMTVLELSEFIKELEEKFGVSAAAPAMAMVAAGPAEAAPAEEEKTEFDVILKAAGANKIGVIKVVR
ALTGLGLKEAKDKVDGAPSTLKEAVSKEEAEEAKKQLVEAGAEVEVK
>P0A7K2 ~~~rplL~~~Large ribosomal subunit protein bL12~~~COG0222
MSITKDQIIEAVAAMSVMDVVELISAMEEKFGVSAAAAVAVAAGPVEAAEEKTEFDVILKAAGANKVAVIKAVRGATGLG
LKEAKDLVESAPAALKEGVSKDDAEALKKALEEAGAEVEVK
>P05392 ~~~rplL~~~Large ribosomal subunit protein bL12~~~
MTKEQIIEAVKNMTVLELNELVKAIEEEFGVTAAAPVVVAGGAAAGAEAAAEKTEFDVILADAGAQKIKVIKVVREITGL
GLKEAKDLVDNTPKPIKEGIAKEEAEEIKAALEEAGAKVEIK
>P07472 ~~~rplL~~~Large ribosomal subunit protein bL12~~~
MALTQEDIINAVAEMSVMEVAELVSAMEEKFGVSAAAAVVAGPGGGEAEEAEEQTEFDLVLTSAGEKKVNVIKVVREITG
LGLKEAKAAVDGAPATLKEGMSKEDGDEAKTKLEEAGASVELK
>P14134 ~~~rplL~~~Large ribosomal subunit protein bL12~~~
MNKEEIMSAIEEMSVLELSELVEDLEEKFGVSAAAPVAVAGGAAGAGAAAEEKSEFDVFLADIGGKKIKVIKAVRELTGL
GLKEAKGVVDDAPGNVKEGLSKEDAEEMKEKLEEAGATVELK
>P55834 ~~~rplL~~~Large ribosomal subunit protein bL12~~~COG0222
MAISKEEVLEYIGSLSVLELSELVKMFEKKFGVSATPTVVAGAAVAGGAAAESEEKTEFNVILADSGAEKIKVIKVVREI
TGLGLKEAKDATEKTPHVLKEGVNKEEAETIKKKLEEVGAKVEVK
>P02395 ~~~rplL~~~Large ribosomal subunit protein bL12~~~
MNKEQILEAIKAMTVLELNDLVKAIEEEFGVTAAAPVVAGGAAAAAEEKTEFDVVLASAGAEKIKVIKVVREITGLGLKE
AKEVVDNAPKALKEGVSKDEAEEIKAKLEEVGASVEVK
>P75239 ~~~rplL~~~Large ribosomal subunit protein bL12~~~
MAKLDKNQLIESLKEMTIMEIDEIIKAVEEAFGVSATPVVAAGAVGGTQEAASEVTVKVTGYTDNAKLAVLKLYREIAGV
GLMEAKTAVEKLPCVVKQDIKPEEAEELKKRFVEVGATVEIK
>P9WHE3 ~~~rplL~~~Large ribosomal subunit protein bL12~~~COG0222
MAKLSTDELLDAFKEMTLLELSDFVKKFEETFEVTAAAPVAVAAAGAAPAGAAVEAAEEQSEFDVILEAAGDKKIGVIKV
VREIVSGLGLKEAKDLVDGAPKPLLEKVAKEAADEAKAKLEAAGATVTVK
>E6MUA0 ~~~rplL~~~Large ribosomal subunit protein bL12~~~
MAITKEDILEAVGSLTVMELNDLVKAFEEKFGVSAAAVAVAGPAGAGAADAEEKTEFDVVLASAGDQKVGVIKVVRAITG
LGLKEAKDIVDGAPKTIKEGVSKAEAEDIQKQLEEAGAKVEIK
>Q6N4R8 ~~~rplL~~~Large ribosomal subunit protein bL12~~~COG0222
MADLQKIVDDLSSLTVLEAAELAKLLEEKWGVSAAAAVAVAAAPGAGGAAAPAEEKTEFTVVLASAGDKKIEVIKEVRAI
TGLGLKEAKDLVEGAPKPLKEGVNKEEAEKVKAQLEKAGAKVELK
>P99154 ~~~rplL~~~Large ribosomal subunit protein bL12~~~
MANHEQIIEAIKEMSVLELNDLVKAIEEEFGVTAAAPVAVAGAAGGADAAAEKTEFDVELTSAGSSKIKVVKAVKEATGL
GLKDAKELVDGAPKVIKEALPKEEAEKLKEQLEEVGATVELK
>P02396 ~~~rplL~~~Large ribosomal subunit protein bL12~~~
MAKLSQDDLLAQFEEMTLIELSEFVKAFEEKFDVTAAAAVAVAGPAAGGAPAEAEAEQDEFDVILTGAGEKKIQVIKVVR
ELTSLGLKEAKDLVDGTPKPVLEKVAKEAAEKAAESLKAAGASVEVK
>P0A471 ~~~rplL~~~Large ribosomal subunit protein bL12~~~COG0222
MALNIENIIAEIKEASILELNDLVKAIEEEFGVTAAAPVAVAAADAADAGAAKDSFDVELTSAGDKKVGVIKVVREITGL
GLKEAKELVDGAPALVKEGVATAEAEEIKAKLEEAGASVTLK
>P23349 ~~~rplL~~~Large ribosomal subunit protein bL12~~~COG0222
MSAATDQILEQLKSLSLLEASELVKQIEEAFGVSAAAPVGGMVMAAAAAAPAEAAEEKTEFDVILEEVPADKKIAVLKVV
RTITGLGLKEAKELVESTPKAIKEATGKDDAEAIKKQIEEAGGKAAVK
>P29396 ~~~rplL~~~Large ribosomal subunit protein bL12~~~COG0222
MTIDEIIEAIEKLTVSELAELVKKLEDKFGVTAAAPVAVAAAPVAGAAAGAAQEEKTEFDVVLKSFGQNKIQVIKVVREI
TGLGLKEAKDLVEKAGSPDAVIKSGVSKEEAEEIKKKLEEAGAEVELK
>Q72GS2 ~~~rplL~~~Large ribosomal subunit protein bL12~~~COG0222
MALDIERIKEELSQATVLELKQLIDALKEAWGVTAAAPVAVAAAPAAGAAAAPAEEKTEFDVILKEAGAKKLEVIKELRA
ITGLGLKEAKDLAEKGGPVKEGVSKQEAEEIKKKLEAVGAVVELK
>Q8VVE2 ~~~rplL~~~Large ribosomal subunit protein bL12~~~COG0222
MALDIERIKEELSQATVLELKQLIDALKEAWGVTAAAPVAVAAAPAAGAAAAPAEEKTEFDVILKEAGAKKLEVIKELRA
ITGLGLKEAKDLAEKGGPVKEGVSKQEAEEIKKKLEAVGAVVELK
>B7IBC3 ~~~rplI~~~Large ribosomal subunit protein bL9~~~
MDVILLQRIKNLGKLGDKVSVKAGYGRNFLIPQGKAVAATEANTAAFEARRAELEKQEAEVLAAAQARAEQLNEVNIVIT
AKAGDEGKLFGSIGTRDIADALTNAGLTVDRAEVRLPNGALRHTGEFNIAIQLHHDVVAEVLVTIVSE
>P37437 ~~~rplI~~~Large ribosomal subunit protein bL9~~~COG0359
MKVIFLQDVKGKGKKGEVKNVADGYAHNFLIKKGLAVEANASNISALNGQKQKEKKEAIAELEQAKSLKETLEKLTVELS
AKSGEGGRLFGSVTSKQITEQLQKDHNIKVDKRKLELPDGIRALGYTNVPVKLHPEVQAVLKVHVKEEA
>Q83D73 ~~~rplI~~~Large ribosomal subunit protein bL9~~~COG0359
MKLILQEKVANLGNIGDQVVVKPGYARNFLLPLGKAVPATPEHIAEFEKQRAELEKAAAELLAKAKARAKKLEDKSFKIT
ANASDEGRLFGSIGPREIAQAITEAGIEIEKREVDLSQGPIRQVGEYEVPLRLHTDVSVNVKIEVAPENSNS
>Q9RY49 ~~~rplI~~~Large ribosomal subunit protein bL9~~~COG0359
MQVILLEPSRLGKTGEVVSVKDGYARNWLIPQGLAVSATRTNMKTLEAQLRSIEKRQAQEKAVAEDLASRLNGVAVELSV
RAGEGKIYGAVTHQDVANSLDQLGFDVDRRKIDMPKTVKEVGEYDIAYRAHPEVTIPMKLVVHAAK
>P0A7R1 ~~~rplI~~~Large ribosomal subunit protein bL9~~~COG0359
MQVILLDKVANLGSLGDQVNVKAGYARNFLVPQGKAVPATKKNIEFFEARRAELEAKLAEVLAAANARAEKINALETVTI
ASKAGDEGKLFGSIGTRDIADAVTAAGVEVAKSEVRLPNGVLRTTGEHEVSFQVHSEVFAKVIVNVVAE
>P02417 ~~~rplI~~~Large ribosomal subunit protein bL9~~~
MKVIFLKDVKGKGKKGEIKNVADGYANNFLFKQGLAIEATPANLKALEAQKQKEQRQAAEELANAKKLKEQLEKLTVTIP
AKAGEGGRLFGSITSKQIAESLQAQHGLKLDKRKIELADAIRALGYTNVPVKLHPEVTATLKVHVTEQK
>P75540 ~~~rplI~~~Large ribosomal subunit protein bL9~~~
MKVILKQDVSNLGKRFDVVDVKDGYAIHFLFPKKLAAPLTKKSLQDRDLFLKKQQEHYEINKALSHKLKEVIEQTELHFS
LKEHNGRPYGSIITKQIINQAHTKGMALQKFMFKDNVRLGFGDHEITLHIFEDTTAVLKVKVTPDNGVK
>A0R7F6 ~~~rplI~~~Large ribosomal subunit protein bL9~~~COG0359
MKLILTAEVEHLGAAGDTVEVKDGYGRNYLLPRGLAIVASRGAERQAEEIRRARESKVIRDIEHANELKTALEGLGDVTL
SVNAAGDTGKLFGSVTAADVVNAIKKAGGPNLDKRTVQLAKAHIKSVGTHPVTVKLHTGVEAKVSLNVVAQ
>A5TYC7 ~~~rplI~~~Large ribosomal subunit protein bL9~~~COG0359
MKLILTADVDHLGSIGDTVEVKDGYGRNFLLPRGLAIVASRGAQKQADEIRRARETKSVRDLEHANEIKAAIEALGPIAL
PVKTSADSGKLFGSVTAADVVAAIKKAGGPNLDKRIVRLPKTHIKAVGTHFVSVHLHPEIDVEVSLDVVAQS
>P9WH79 ~~~rplI~~~Large ribosomal subunit protein bL9~~~COG0359
MKLILTADVDHLGSIGDTVEVKDGYGRNFLLPRGLAIVASRGAQKQADEIRRARETKSVRDLEHANEIKAAIEALGPIAL
PVKTSADSGKLFGSVTAADVVAAIKKAGGPNLDKRIVRLPKTHIKAVGTHFVSVHLHPEIDVEVSLDVVAQS
>Q9HUN2 ~~~rplI~~~Large ribosomal subunit protein bL9~~~
MEVILLEKVANLGNLGDKVNIKGGYARNFLLPQGKATVATAENVAAFEARRAELEKAAAEKKAAAEARAAQLSELVVTLG
AHAGDEGKLFGSIGTRDIAEAVSAAGYPLEKAEVRLPNGALRNTGEFDVAVHLHTDVETTLKLIIVAE
>Q6N5A1 ~~~rplI~~~Large ribosomal subunit protein bL9~~~COG0359
MEVILLERVAKLGQMGELVRVKDGFARNFLLPRGKALRATAANREKYEHMKADLEARNIAAKAEATKVAEKIDGQNVVVI
RQASEGGQLFGSVSVRDIIASFDGQGVKIDRSQVLLDAPIKTIGKHSIQVAVHPEVEVAVSVTVARSAEEAERINRGEDI
STRREDEDAAAEALAAAGEFFDPDAQFGEEQPTEE
>P66318 ~~~rplI~~~Large ribosomal subunit protein bL9~~~
MKVIFTQDVKGKGKKGEVKEVPVGYANNFLLKKNYAVEATPGNLKQLELQKKRAKQERQQEIEDAKALKETLSNIEVEVS
AKTGEGGKLFGSVSTKQIAEALKAQHDIKIDKRKMDLPNGIHSLGYTNVPVKLDKEVEGTIRVHTVEQ
>Q72GV5 ~~~rplI~~~Large ribosomal subunit protein bL9~~~COG0359
MKVILLEPLENLGDVGQVVDVKPGYARNYLLPRGLAVLATESNLKALEARIRAQAKRLAERKAEAERLKEILENLTLTIP
VRAGETKIYGSVTAKDIAEALSRQHGITIDPKRLALEKPIKELGEYVLTYKPHPEVPIQLKVSVVAQE
>Q5SLQ1 ~~~rplI~~~Large ribosomal subunit protein bL9~~~COG0359
MKVILLEPLENLGDVGQVVDVKPGYARNYLLPRGLAVLATESNLKALEARIRAQAKRLAERKAEAERLKEILENLTLTIP
VRAGETKIYGSVTAKDIAEALSRQHGVTIDPKRLALEKPIKELGEYVLTYKPHPEVPIQLKVSVVAQE
>P27151 ~~~rplI~~~Large ribosomal subunit protein bL9~~~
MKVILLEPLENLGDVGQVVDVKPGYARNYLLPRGLAVLATESNLKALEARIRAQAKRLAERKAEAERLKKILENLTLTIP
VRAGETKIYGSVTAKDIAEALSRQHGVTIDPKRLALEKPIKELGEYVLTYKPHPEVPIQLKVSVVAQE
>P76104 ~~~rlhA~~~23S rRNA 5-hydroxycytidine C2501 synthase~~~COG0826
MTVSSHRLELLSPARDAAIAREAILHGADAVYIGGPGFGARHNASNSLKDIAELVPFAHRYGAKIFVTLNTILHDDELEP
AQRLITDLYQTGVDALIVQDMGILELDIPPIELHASTQCDIRTVEKAKFLSDVGFTQIVLARELNLDQIRAIHQATDATI
EFFIHGALCVAYSGQCYISHAQTGRSANRGDCSQACRLPYTLKDDQGRVVSYEKHLLSMKDNDQTANLGALIDAGVRSFK
IEGRYKDMSYVKNITAHYRQMLDAIIEERGDLARASSGRTEHFFVPSTEKTFHRGSTDYFVNARKGDIGAFDSPKFIGLP
VGEVVKVAKDHLDVAVTEPLANGDGLNVLIKREVVGFRANTVEKTGENQYRVWPNEMPADLHKIRPHHPLNRNLDHNWQQ
ALTKTSSERRVAVDIELGGWQEQLILTLTSEEGVSITHTLDGQFDEANNAEKAMNNLKDGLAKLGQTLYYARDVQINLPG
ALFVPNSLLNQFRREAADMLDAARLASYQRGSRKPVADPAPVYPQTHLSFLANVYNQKAREFYHRYGVQLIDAAYEAHEE
KGEVPVMITKHCLRFAFNLCPKQAKGNIKSWKATPMQLVNGDEVLTLKFDCRPCEMHVIGKIKNHILKMPLPGSVVASVS
PDELLKTLPKRKG
>E8T3K9 6.5.1.-~~~~~~Putative RNA ligase~~~COG0639
MEITLEKALKEIEGNKFFKVLKENDLVKVSYRFNAPQTFDTPLKRELRGITFSSKTGRVVSRPFHKFFNLGEHPETEKER
LKGKLFILREKLDGTMLHPAVVEGRVRLFTQKDFANPQIEKGEELLRRNDKLLKATRRLLEKGLTPIFELISPEFQLVIP
YETEELILTEVRDNRTGHYLLEEAENELIQMGFKLPRKRVGTVEEAERLIEEAENVEGFVAKNFDESEPFPLFVKIKSPW
YHRAHYAFTYLHNIPDHKLFNLFLNNRADDIFATVTNPAVKEKKSRRLKILTDIYHSLLSSAEKLSQLYGKVKEETLKAE
AQKELKKIEREFKEELKLFNFPVEHLTEAARLAKQKKKFDKFLGTKLYTALKHQTVKLKT
>Q5SI81 2.1.1.-~~~RlmO~~~Ribosomal RNA large subunit methyltransferase I~~~COG1092
MLGPVLRLVVKAGKERKLRNFYPNLYRDEIAAPPEGVGVAEAVDAEGHFLAVGYYDPRSRVPFRAFRFDPGPLNRAFFQG
RFARALRRRQGLGESHRLVHGEADGLPGLVVDRFGEVLVLQVRSRGMEALREVWLPALLEVVAPKGVYERSDVEARRQEG
LPERVGVVYGEVPEVLEVEEDGLRFPIPLALAQKTGYYLDQRENRRLFEAMVRPGERVLDVYSYVGGFALRAARKGAYAL
AVDKDLEALGVLDQAALRLGLRVDIRHGEALPTLRGLEGPFHHVLLDPPTLVKRPEELPAMKRHLVDLVREALRLLAEEG
FLWLSSCSYHLRLEDLLEVARRAAADLGRRLRVHRVTYQPEDHPWSLHIPESLYLKTLVLQDDPL
>Q9S1M6 2.1.1.188~~~rlmAII~~~23S rRNA (guanine(748)-N(1))-methyltransferase~~~COG2226
MRKNVVRYLRCPHCAAPLRSSDRTLRCENGHTFDVARQGYVNLLRRPTKLAADTTDMVAARAALLDSGHYAPLTERLAGT
ARRAAGAGAPDCVVDIGGGTGHHLARVLEEFEDAEGLLLDMSKPAVRRAARAHPRASSAVADVWDTLPLRDGAAAMALNV
FAPRNPPEIRRILRPGGTLLVVTPQQDHLAELVDALGLLRVRDHKEGRLAEQLAPHFEAVGQERLRTTLRLDHDALGRVV
AMGPSSWHQDPDELARRIAELPGIHEVTLSVTFTVCRPLP
>P36999 2.1.1.187~~~rlmA~~~23S rRNA (guanine(745)-N(1))-methyltransferase~~~COG2226
MSFSCPLCHQPLSREKNSYICPQRHQFDMAKEGYVNLLPVQHKRSRDPGDSAEMMQARRAFLDAGHYQPLRDAIVAQLRE
RLDDKATAVLDIGCGEGYYTHAFADALPEITTFGLDVSKVAIKAAAKRYPQVTFCVASSHRLPFSDTSMDAIIRIYAPCK
AEELARVVKPGGWVITATPGPRHLMELKGLIYNEVHLHAPHAEQLEGFTLQQSAELCYPMRLRGDEAVALLQMTPFAWRA
KPEVWQTLAAKEVFDCQTDFNIHLWQRSY
>P63177 2.1.1.185~~~rlmB~~~23S rRNA (guanosine-2'-O-)-methyltransferase RlmB~~~COG0566
MSEMIYGIHAVQALLERAPERFQEVFILKGREDKRLLPLIHALESQGVVIQLANRQYLDEKSDGAVHQGIIARVKPGRQY
QENDLPDLIASLDQPFLLILDGVTDPHNLGACLRSADAAGVHAVIVPKDRSAQLNATAKKVACGAAESVPLIRVTNLART
MRMLQEENIWIVGTAGEADHTLYQSKMTGRLALVMGAEGEGMRRLTREHCDELISIPMAGSVSSLNVSVATGICLFEAVR
QRS
>P44906 2.1.1.185~~~rlmB~~~23S rRNA (guanosine-2'-O-)-methyltransferase RlmB~~~COG0566
MSEQIYGIHAVNSILTHSPERLIEVFVLKGREDKRLQPLLNELYSLGIGVQFVNRQTLDKKADGEVHQGVIARVQAAKEL
NENDLDEILANKQNPLLLVLDGVTDPHNLGACLRTADAAGAVAVIVPKDKSAQLTSIARKVACGAAETVPLIRVTNLSRT
LRDLQQNHNIWVVGTAGEATETIYQSKLTGPLALVMGAEGEGMRRLTREHCDQLISIPMAGSVSSLNVSVATGVCLFEIV
RQRLGS
>O31503 2.1.1.189~~~rlmCD~~~23S rRNA (uracil-C(5))-methyltransferase RlmCD~~~COG2265
MKMKPPVEKNEYYDVTFEDLTHEGAGVAKVQGFPIFVPNALPEEKAQIKVTRVKKGFAFGRLIELKEESPHRTDAPCPIY
KQCGGCQLQHMTYEGQLLFKQKQVKDVLERIGKLDLSKVTVHPTLGMEDPWNYRNKAQVPVGEREGGLVAGFYQQRSHDI
IDMSACLIQQSKNDEAVQAVKDICANYGVKAYNEERHKGWLRHIMVRYGVVTGEMMIVFITRTSDFPHKAKIIEDITAQF
PHVKSIVQNINPNKTNVIFGNETNVIWGEEYIYDLIGDVKFAISARSFYQVNPEQTKVLYDKALEYAELQGEETVIDAYC
GIGTISLFLAKQAKKVYGVEIVPEAIEDAKRNAELNGNTNAEFAVGEAETVIPKWYEEGITADTLVVDPPRKGCDEALLR
TIVEMKPKRVVYVSCNPGTLARDLRVLEDGGYVTREVQPVDMFPHTNHVECCVLIKLKE
>P75817 2.1.1.189~~~rlmC~~~23S rRNA (uracil(747)-C(5))-methyltransferase RlmC~~~COG2265
MQCALYDAGRCRSCQWIMQPIPEQLSAKTADLKNLLADFPVEEWCAPVSGPEQGFRNKAKMVVSGSVEKPLLGMLHRDGT
PEDLCDCPLYPASFAPVFAALKPFIARAGLTPYNVARKRGELKYILLTESQSDGGMMLRFVLRSDTKLAQLRKALPWLHE
QLPQLKVITVNIQPVHMAIMEGETEIYLTEQQALAERFNDVPLWIRPQSFFQTNPAVASQLYATARDWVRQLPVKHMWDL
FCGVGGFGLHCATPDMQLTGIEIASEAIACAKQSAAELGLTRLQFQALDSTQFATAQGDVPELVLVNPPRRGIGKPLCDY
LSTMAPRFIIYSSCNAQTMAKDIRELPGFRIERVQLFDMFPHTAHYEVLTLLVKQ
>P55135 2.1.1.190~~~rlmD~~~23S rRNA (uracil(1939)-C(5))-methyltransferase RlmD~~~COG2265
MAQFYSAKRRTTTRQIITVSVNDLDSFGQGVARHNGKTLFIPGLLPQENAEVTVTEDKKQYARAKVVRRLSDSPERETPR
CPHFGVCGGCQQQHASVDLQQRSKSAALARLMKHDVSEVIADVPWGYRRRARLSLNYLPKTQQLQMGFRKAGSSDIVDVK
QCPILAPQLEALLPKVRACLGSLQAMRHLGHVELVQATSGTLMILRHTAPLSSADREKLERFSHSEGLDLYLAPDSEILE
TVSGEMPWYDSNGLRLTFSPRDFIQVNAGVNQKMVARALEWLDVQPEDRVLDLFCGMGNFTLPLATQAASVVGVEGVPAL
VEKGQQNARLNGLQNVTFYHENLEEDVTKQPWAKNGFDKVLLDPARAGAAGVMQQIIKLEPIRIVYVSCNPATLARDSEA
LLKAGYTIARLAMLDMFPHTGHLESMVLFSRVK
>P0C0R7 2.1.1.166~~~rlmE~~~Ribosomal RNA large subunit methyltransferase E~~~COG0293
MTGKKRSASSSRWLQEHFSDKYVQQAQKKGLRSRAWFKLDEIQQSDKLFKPGMTVVDLGAAPGGWSQYVVTQIGGKGRII
ACDLLPMDPIVGVDFLQGDFRDELVMKALLERVGDSKVQVVMSDMAPNMSGTPAVDIPRAMYLVELALEMCRDVLAPGGS
FVVKVFQGEGFDEYLREIRSLFTKVKVRKPDSSRARSREVYIVATGRKP
>P75782 2.1.1.181~~~rlmF~~~Ribosomal RNA large subunit methyltransferase F~~~COG3129
MSAQKPGLHPRNRHHSRYDLATLCQVNPELRQFLTLTPAGEQSVDFANPLAVKALNKALLAHFYAVANWDIPDGFLCPPV
PGRADYIHHLADLLAEASGTIPANASILDIGVGANCIYPLIGVHEYGWRFTGSETSSQALSSAQAIISSNPGLNRAIRLR
RQKESGAIFNGIIHKNEQYDATLCNPPFHDSAAAARAGSERKRRNLGLNKDDALNFGGQQQELWCEGGEVTFIKKMIEES
KGFAKQVMWFTSLVSRGENLPPLYRALTDVGAVKVVKKEMAQGQKQSRFIAWTFMNDEQRRRFVNRQR
>P42596 2.1.1.174~~~rlmG~~~Ribosomal RNA large subunit methyltransferase G~~~COG2813
MSHLDNGFRSLTLQRFPATDDVNPLQAWEAADEYLLQQLDDTEIRGPVLILNDAFGALSCALAEHKPYSIGDSYISELAT
RENLRLNGIDESSVKFLDSTADYPQQPGVVLIKVPKTLALLEQQLRALRKVVTSDTRIIAGAKARDIHTSTLELFEKVLG
PTTTTLAWKKARLINCTFNEPQLADAPQTVSWKLEGTDWTIHNHANVFSRTGLDIGARFFMQHLPENLEGEIVDLGCGNG
VIGLTLLDKNPQAKVVFVDESPMAVASSRLNVETNMPEALDRCEFMINNALSGVEPFRFNAVLCNPPFHQQHALTDNVAW
EMFHHARRCLKINGELYIVANRHLDYFHKLKKIFGNCTTIATNNKFVVLKAVKLGRRR
>Q45601 2.1.1.177~~~rlmH~~~Ribosomal RNA large subunit methyltransferase H~~~COG1576
MNINIVTIGKLKEKYLKQGIEEYTKRLSAYAKIDIIELPDEKAPENLSDQDMKIIKDKEGDRILSKISPDAHVIALAIEG
KMKTSEELADTIDKLATYGKSKVTFVIGGSLGLSDTVMKRADEKLSFSKMTFPHQLMRLILVEQIYRAFRINRGEPYHK
>P0A8I8 2.1.1.177~~~rlmH~~~Ribosomal RNA large subunit methyltransferase H~~~COG1576
MKLQLVAVGTKMPDWVQTGFTEYLRRFPKDMPFELIEIPAGKRGKNADIKRILDKEGEQMLAAAGKNRIVTLDIPGKPWD
TPQLAAELERWKLDGRDVSLLIGGPEGLSPACKAAAEQSWSLSALTLPHPLVRVLVAESLYRAWSITTNHPYHRE
>P0C1V0 2.1.1.177~~~rlmH~~~Ribosomal RNA large subunit methyltransferase H~~~
MKITILAVGKLKEKYWKQAIAEYEKRLGPYTKIDIIEVPDEKAPENMSDKEIEQVKEKEGQRILAKIKPQSTVITLEIQG
KMLSSEGLAQELNQRMTQGQSDFVFVIGGSNGLHKDVLQRSNYALSFSKMTFPHQMMRVVLIEQVYRAFKIMRGEAYHK
>Q9WZU8 2.1.1.177~~~rlmH~~~Ribosomal RNA large subunit methyltransferase H~~~COG1576
MRVRIAVIGKLDGFIKEGIKHYEKFLRRFCKPEVLEIKRVHRGSIEEIVRKETEDLTNRILPGSFVMVMDKRGEEVSSEE
FADFLKDLEMKGKDITILIGGPYGLNEEIFAKAHRVFSLSKMTFTHGMTVLIVLEQIFRAFKIIHGENYHY
>P75876 2.1.1.191~~~rlmI~~~Ribosomal RNA large subunit methyltransferase I~~~COG1092
MSVRLVLAKGREKSLLRRHPWVFSGAVARMEGKASLGETIDIVDHQGKWLARGAYSPASQIRARVWTFDPSESIDIAFFS
RRLQQAQKWRDWLAQKDGLDSYRLIAGESDGLPGITIDRFGNFLVLQLLSAGAEYQRAALISALQTLYPECSIYDRSDVA
VRKKEGMELTQGPVTGELPPALLPIEEHGMKLLVDIQHGHKTGYYLDQRDSRLATRRYVENKRVLNCFSYTGGFAVSALM
GGCSQVVSVDTSQEALDIARQNVELNKLDLSKAEFVRDDVFKLLRTYRDRGEKFDVIVMDPPKFVENKSQLMGACRGYKD
INMLAIQLLNEGGILLTFSCSGLMTSDLFQKIIADAAIDAGRDVQFIEQFRQAADHPVIATYPEGLYLKGFACRVM
>P37634 2.1.1.266~~~rlmJ~~~Ribosomal RNA large subunit methyltransferase J~~~COG2961
MLSYRHSFHAGNHADVLKHTVQSLIIESLKEKDKPFLYLDTHAGAGRYQLGSEHAERTGEYLEGIARIWQQDDLPAELEA
YINVVKHFNRSGQLRYYPGSPLIARLLLREQDSLQLTELHPSDYPLLRSEFQKDSRARVEKADGFQQLKAKLPPVSRRGL
ILIDPPYEMKTDYQAVVSGIAEGYKRFATGIYALWYPVVLRQQIKRMIHDLEATGIRKILQIELAVLPDSDRRGMTASGM
IVINPPWKLEQQMNNVLPWLHSKLVPAGTGHATVSWIVPE
>P31777 2.1.1.266~~~rlmJ~~~Ribosomal RNA large subunit methyltransferase J~~~COG2961
MLSYHHSFHAGNHADVLKHIVLMLILENLKLKEKGFFYLDTHSGVGRYRLSSNESEKTGEYKEGIGRLWDQTDLPEDIAR
YVKMIKKLNYGGKELRYYAGSPLIAAELLRSQDRALLTELHPSDYPILRNNFSDDKNVTVKCDNGFQQVKATLPPKERRG
LVLIDPPYELKDDYDLVVKAIEEGYKRFATGTYAIWYPVVLRQQTKRIFKGLEATGIRKILKIELAVRPDSDQRGMTASG
MVVINPPWTLETQMKEILPYLTKTLVPEGTGSWTVEWITPE
>P75864 ~~~rlmL~~~Ribosomal RNA large subunit methyltransferase K/L~~~COG0116
MNSLFASTARGLEELLKTELENLGAVECQVVQGGVHFKGDTRLVYQSLMWSRLASRIMLPLGECKVYSDLDLYLGVQAIN
WTEMFNPGATFAVHFSGLNDTIRNSQYGAMKVKDAIVDAFTRKNLPRPNVDRDAPDIRVNVWLHKETASIALDLSGDGLH
LRGYRDRAGIAPIKETLAAAIVMRSGWQPGTPLLDPMCGSGTLLIEAAMLATDRAPGLHRGRWGFSGWAQHDEAIWQEVK
AEAQTRARKGLAEYSSHFYGSDSDARVIQRARTNARLAGIGELITFEVKDVAQLTNPLPKGPYGTVLSNPPYGERLDSEP
ALIALHSLLGRIMKNQFGGWNLSLFSASPDLLSCLQLRADKQYKAKNGPLDCVQKNYHVAESTPDSKPAMVAEDYTNRLR
KNLKKFEKWARQEGIECYRLYDADLPEYNVAVDRYADWVVVQEYAPPKTIDAHKARQRLFDIIAATISVLGIAPNKLVLK
TRERQKGKNQYQKLGEKGEFLEVTEYNAHLWVNLTDYLDTGLFLDHRIARRMLGQMSKGKDFLNLFSYTGSATVHAGLGG
ARSTTTVDMSRTYLEWAERNLRLNGLTGRAHRLIQADCLAWLREANEQFDLIFIDPPTFSNSKRMEDAFDVQRDHLALMK
DLKRLLRAGGTIMFSNNKRGFRMDLDGLAKLGLKAQEITQKTLSQDFARNRQIHNCWLITAA
>Q9JYY8 2.1.1.264~~~rlmK~~~Ribosomal RNA large subunit methyltransferase K~~~
MASYDKISDGWYRVCPKRSSNRALITVKLPFSTLFRLKPMTDITPFANRLGKNIKHLMKWAKRNGIEAWRIYDRDIPQFP
FAADVYGDRIHLQEYDTGWLMRPEEYEAWLAEVLEAVAFVTGFAPEQIRLKRRERQKGLQQYEKTGKAGDDFVITENGRK
FWVNLDKYLDTGLFLDHRNTRKKVGETAAGKRFLNLFSYTGSFTVYAATGGAASSETVDLSNTYLDWAKRNFELNGIDTE
RHKIVRADVFQYLQTAYGEGRRFDLIVMDPPSFSNSKKMSDILDIQRDHKKLIDGAVKLLASDGILYFSNNLRSFVLDDL
VSEQYAVKDISKQSVPEDFRNKKIHRCWEIRHKS
>Q9K0V4 2.1.1.173~~~rlmL~~~Ribosomal RNA large subunit methyltransferase L~~~
MNTLYTLFATCPRGLETVLSQELESLGCTDVQVFDGGVSCRGGLEQVYAANLHSRTASRILLRLTKGTYRNERDIYKLAK
NINWFNWFTLQQTFKVKVEAKRANVKSIQFVGLTVKDAVCDAFRDIYDARPSVDKAAPDVRIHAFLNERNVEIFIDTSGE
ALFKRGYRLDTGEAPLRENLAAGLLLSAGYDGTQPFQDPFCGSGTIAIEAAWIAARRAPGMMRRFGFEKLQNFDKTLWSD
LRRRAEAQTRPVRAPIAGSDNDRRIVQTALDNARRAGVDDIVSFSVADAQSVRPNGENGIMVSNPPYGVRLEEVRALQAL
YPQLGTWLKKHYAGWLAAMFTGDREMPKFMCLSPKRKIPLYNGNIDCRLFLIDMVEGSNR
>P0ADR6 2.1.1.186~~~rlmM~~~Ribosomal RNA large subunit methyltransferase M~~~COG2933
MNKVVLLCRPGFEKECAAEITDKAGQREIFGFARVKENAGYVIYECYQPDDGDKLIRELPFSSLIFARQWFVVGELLQHL
PPEDRITPIVGMLQGVVEKGGELRVEVADTNESKELLKFCRKFTVPLRAALRDAGVLANYETPKRPVVHVFFIAPGCCYT
GYSYSNNNSPFYMGIPRLKFPADAPSRSTLKLEEAFHVFIPADEWDERLANGMWAVDLGACPGGWTYQLVKRNMWVYSVD
NGPMAQSLMDTGQVTWLREDGFKFRPTRSNISWMVCDMVEKPAKVAALMAQWLVNGWCRETIFNLKLPMKKRYEEVSHNL
AYIQAQLDEHGINAQIQARQLYHDREEVTVHVRRIWAAVGGRRDER
>P36979 2.1.1.192~~~rlmN~~~Dual-specificity RNA methyltransferase RlmN~~~COG0820
MSEQLVTPENVTTKDGKINLLDLNRQQMREFFKDLGEKPFRADQVMKWMYHYCCDNFDEMTDINKVLRGKLKEVAEIRAP
EVVEEQRSSDGTIKWAIAVGDQRVETVYIPEDDRATLCVSSQVGCALECKFCSTAQQGFNRNLRVSEIIGQVWRAAKIVG
AAKVTGQRPITNVVMMGMGEPLLNLNNVVPAMEIMLDDFGFGLSKRRVTLSTSGVVPALDKLGDMIDVALAISLHAPNDE
IRDEIVPINKKYNIETFLAAVRRYLEKSNANQGRVTIEYVMLDHVNDGTEHAHQLAELLKDTPCKINLIPWNPFPGAPYG
RSSNSRIDRFSKVLMSYGFTTIVRKTRGDDIDAACGQLAGDVIDRTKRTLRKRMQGEAIDIKAV
>A6QGB8 2.1.1.192~~~rlmN~~~Probable dual-specificity RNA methyltransferase RlmN~~~
MITAEKKKKNKFLPNFDKQSIYSLRFDEMQNWLVEQGQQKFRAKQIFEWLYQKRVDSIDEMTNLSKDLRQLLKDNFTVTT
LTTVVKQESKDGTIKFLFELQDGYTIETVLMRHDYGNSVCVTTQVGCRIGCTFCASTLGGLKRNLEAGEIVSQVLTVQKA
LDATEERVSQIVIMGIGEPFENYDEMMDFLRIVNDDNSLNIGARHITVSTSGIIPRIYDFADEDIQINFAVSLHAAKDEV
RSRLMPINRAYNVEKLIEAIQYYQEKTNRRVTFEYGLFGGVNDQLEHARELAHLIKGLNCHVNLIPVNHVPERNYVKTAK
NDIFKFEKELKRLGINATIRREQGSDIDAACGQLRAKERQVETR
>Q7A600 2.1.1.192~~~rlmN~~~Probable dual-specificity RNA methyltransferase RlmN~~~
MITAEKKKKNKFLPNFDKQSIYSLRFDEMQNWLVEQGQQKFRAKQIFEWLYQKRVDSIDEMTNLSKDLRQLLKDNFTVTT
LTTVVKQESKDGTIKFLFELQDGYTIETVLMRHDYGNSVCVTTQVGCRIGCTFCASTLGGLKRNLEAGEIVSQVLTVQKA
LDATEERVSQIVIMGIGEPFENYDEMMDFLRIVNDDNSLNIGARHITVSTSGIIPRIYDFADEDIQINFAVSLHAAKDEV
RSRLMPINRAYNVEKLIEAIQYYQEKTNRRVTFEYGLFGGVNDQLEHARELAHLIKGLNCHVNLIPVNHVPERNYVKTAK
NDIFKFEKELKRLGINATIRREQGSDIDAACGQLRAKERQVETR
>O83107 2.1.1.192~~~rlmN~~~Probable dual-specificity RNA methyltransferase RlmN~~~COG0820
MEWCCALSGLLPEEIQKVCAFAERFRGVQVFRWIAAGCTDFHAMSDLSSETRARLARACVISDTRVYTTLRDVDGTLKLG
IELKDKRRVEAVLLVDQVSRKTACLSCQVGCPMACAFCQTGQLGFARNLSASEIVEQFLHLERCVGTLDNVVFMGMGEPM
LNLDAVCRAIEILSHPQGRDLSEKRITISTSGHCRGIYSLADRALQVRLAVSLTTANAPLRARLMPRAAHDSLAKLKSAI
RYFNEKSGKRVTLELALMRGVNTSERHAQEVIDFAHGLNVHVNLIPWNPVASIHFETPREVEVAHFEALLMRARIPVTRR
YQRGNGIGGACGQLGKTAGV
>P94538 2.1.1.-~~~rlmP~~~23S rRNA (guanosine(2553)-2'-O)-methyltransferase RlmP~~~COG0566
MKQIESAKNQKVKDWKKLHTKKERTKTNTFLIEGEHLVEEALKSPGIVKEILVKDETRIPSDLETGIQCYMLSEDAFSAV
TETETPQQIAAVCHMPEEKLATARKVLLIDAVQDPGNLGTMIRTADAAGLDAVVLGDGTADAFNGKTLRSAQGSHFHIPV
VRRNLPSYVDELKAEGVKVYGTALQNGAPYQEIPQSESFALIVGNEGAGVDAALLEKTDLNLYVPLYGQAESLNVAVAAA
ILVYHLRG
>Q5SIT4 2.1.1.191~~~~~~Ribosomal RNA large subunit methyltransferase I~~~COG1092
MRIQVNAKGAARLLSRHLWVFRRDVVSGPETPGLYPVYWGRRFLALALYNPHTDLAVRAYRFAPAEDPVAALLENLAQAL
ARREAVLRQDPEGGYRLVHAEGDLLPGLVVDYYAGHAVVQATAHAWEGLLPQVAEALRPHVQSVLAKNDARTRELEGLPL
YVRPLLGEVPERVQVQEGRVRYLVDLRAGQKTGAYLDQRENRLYMERFRGERALDVFSYAGGFALHLALGFREVVAVDSS
AEALRRAEENARLNGLGNVRVLEANAFDLLRRLEKEGERFDLVVLDPPAFAKGKKDVERAYRAYKEVNLRAIKLLKEGGI
LATASCSHHMTEPLFYAMVAEAAQDAHRLLRVVEKRGQPFDHPVLLNHPETHYLKFAVFQVL
>P10100 4.2.2.-~~~rlpA~~~Endolytic peptidoglycan transglycosylase RlpA~~~COG0797
MRKQWLGICIAAGMLAACTSDDGQQQTVSVPQPAVCNGPIVEISGADPRFEPLNATANQDYQRDGKSYKIVQDPSRFSQA
GLAAIYDAEPGSNLTASGEAFDPTQLTAAHPTLPIPSYARITNLANGRMIVVRINDRGPYGNDRVISLSRAAADRLNTSN
NTKVRIDPIIVAQDGSLSGPGMACTTVAKQTYALPAPPDLSGGAGTSSVSGPQGDILPVSNSTLKSEDPTGAPVTSSGFL
GAPTTLAPGVLEGSEPTPAPQPVVTAPSTTPATSPAMVTPQAVSQSASGNFMVQVGAVSDQARAQQYQQQLGQKFGVPGR
VTQNGAVWRIQLGPFASKAEASTLQQRLQTEAQLQSFITTAQ
>Q57092 4.2.2.-~~~rlpA~~~Endolytic peptidoglycan transglycosylase RlpA~~~COG0797
MKLKTGLNLTALLLFMISVAFPAQADTQKMYGIRGDNLSIATQMPAPRTYSVKGQTYTTKSGNEAKSYIKEGLASYYHLK
FDGRKTASGDVYNSKQFTAAHKTLPINSYALVTNLHNNRKVIVRINDRGPFSDKRLIDLSHAAAKEIGLISRGIGQVRIE
ALHVAKNGNLSGAATKTLAKQAKTQEAADRLVLKSNTLFDNTSKSINALKGTEFYCLKMLELTSRSQANKLITQLALANI
QTEVNRSGNKYEIYIGPFDDKTKMAQVRTKLQKMANNKPLIVYTYKN
>A0A0H2ZFV1 4.2.2.-~~~rlpA~~~Endolytic peptidoglycan transglycosylase RlpA~~~
MSKRVRSSLILPAVCGLGLAAVLLSSCSSKAPQQPARQAGISGPGDYSRPHRDGAPWWDVDVSRIPDAVPMPHNGSVKAN
PYTVLGKTYYPMNDARAYRMVGTASWYGTKFHGQATANGETYDLYGMTAAHKTLPLPSYVRVTNLDNGKSVIVRVNDRGP
FYSDRVIDLSFAAAKKLGYAETGTARVKVEGIDPVQWWAQRGRPAPMVLAQPKQAVAQAPAATQTQAVAMAQPIETYTPP
PAQHAAAVLPVQIDSKKNASLPADGLYLQVGAFANPDAAELLKAKLSGVTAAPVFISSVVRNQQILHRVRLGPIGSADEV
SRTQDSIRVANLGQPTLVRPD
>Q9X6V6 4.2.2.-~~~rlpA~~~Endolytic peptidoglycan transglycosylase RlpA~~~
MSKRVRSSLILPAVCGLGLAAVLLSSCSSKAPQQPARQAGISGPGDYSRPHRDGAPWWDVDVSRIPDAVPMPHNGSVKAN
PYTVLGKTYYPMNDARAYRMVGTASWYGTKFHGQATANGETYDLYGMTAAHKTLPLPSYVRVTNLDNGKSVIVRVNDRGP
FYSDRVIDLSFAAAKKLGYAETGTARVKVEGIDPVQWWAQRGRPAPMVLAQPKQAVAQAAPAAAQTQAVAMAQPIETYTP
PPAQHAAAVLPVQIDSKKNASLPADGLYLQVGAFANPDAAELLKAKLSGVTAAPVFISSVVRNQQILHRVRLGPIGSADE
VSRTQDSIRVANLGQPTLVRPD
>Q1M964 3.1.4.-~~~~~~Multifunctional alkaline phosphatase superfamily protein pRL90232~~~
MRKKNVLLIVVDQWRADFVPHVLRADGKIDFLKTPNLDRLCREGVTFRNHVTTCVPCGPARASLLTGLYLMNHRAVQNTV
PLDQRHLNLGKALRGVGYDPALIGYTTTVPDPRTTSPNDPRFRVLGDLMDGFHPVGAFEPNMEGYFGWVAQNGFDLPEHR
PDIWLPEGEDAVAGATDRPSRIPKEFSDSTFFTERALTYLKGRDGKPFFLHLGYYRPHPPFVASAPYHAMYRPEDMPAPI
RAANPDIEAAQHPLMKFYVDSIRRGSFFQGAEGSGATLDEAELRQMRATYCGLITEVDDCLGRVFSYLDETGQWDDTLII
FTSDHGEQLGDHHLLGKIGYNDPSFRIPLVIKDAGENARAGAIESGFTESIDVMPTILDWLGGKIPHACDGLSLLPFLSE
GRPQDWRTELHYEYDFRDVYYSEPQSFLGLGMNDCSLCVIQDERYKYVHFAALPPLFFDLRHDPNEFTNLADDPAYAALV
RDYAQKALSWRLKHADRTLTHYRSGPEGLSERSH
>Q2RSU7 5.3.3.23~~~rlp~~~5-methylthioribulose-1-phosphate isomerase~~~COG1850
MTDRLRATYRVKATAASIEARAKGIAVEQSVEMPLSAIDDPAVLDGIVGVVEEITERGEDCFEVRLALSTATIGGDAGQL
FNMLFGNTSLQDDTVLLDIDLPDDLLASFGGPNIGAAGLRARVGASADRALTCSALKPQGLPPDRLADLARRMALGGLDF
IKDDHGMADQAYAPFASRVGAVAAAVDEVNRQTGGQTRYLPSLSGHLDQLRSQVRTGLDHGIDTFLIAPMIVGPSTFHAV
VREFPGAAFFAHPTLAGPSRIAPPAHFGKLFRLLGADAVIFPNSGGRFGYSRDTCQAVAEAALGPWGGLHASLPVPAGGM
SLARVPEMIATYGPDVIVLIGGNLLEARDRLTEETAAFVASVAGAASRGCGLAP
>P0AA37 5.4.99.28~~~rluA~~~Dual-specificity RNA pseudouridine synthase RluA~~~COG0564
MGMENYNPPQEPWLVILYQDDHIMVVNKPSGLLSVPGRLEEHKDSVMTRIQRDYPQAESVHRLDMATSGVIVVALTKAAE
RELKRQFREREPKKQYVARVWGHPSPAEGLVDLPLICDWPNRPKQKVCYETGKPAQTEYEVVEYAADNTARVVLKPITGR
SHQLRVHMLALGHPILGDRFYASPEARAMAPRLLLHAEMLTITHPAYGNSMTFKAPADF
>P35159 5.4.99.22~~~rluB~~~Ribosomal large subunit pseudouridine synthase B~~~COG1187
MERLQKVIAHAGVASRRKAEELIKEGKVKVNGKVVTELGVKVTGSDQIEVNGLKVEREEPVYFLLYKPRGVISAAQDDKG
RKVVTDFFKNIPQRIYPIGRLDYDTSGLLLLTNDGEFANKLMHPKYEIDKTYVAKVKGIPPKELLRKLERGIRLEEGKTA
PAKAKLLSLDKKKQTSIIQLTIHEGRNRQVRRMFEAIGHEVIKLKREEYAFLNLRGLHTGDARELTPHEVKRLRALADHG
KNAF
>P37765 5.4.99.22~~~rluB~~~Ribosomal large subunit pseudouridine synthase B~~~COG1187
MSEKLQKVLARAGHGSRREIESIIEAGRVSVDGKIAKLGDRVEVTPGLKIRIDGHLISVRESAEQICRVLAYYKPEGELC
TRNDPEGRPTVFDRLPKLRGARWIAVGRLDVNTCGLLLFTTDGELANRLMHPSREVEREYAVRVFGQVDDAKLRDLSRGV
QLEDGPAAFKTIKFSGGEGINQWYNVTLTEGRNREVRRLWEAVGVQVSRLIRVRYGDIPLPKGLPRGGWTELDLAQTNYL
RELVELPPETSSKVAVEKDRRRMKANQIRRAVKRHSQVSGGRRSGGRNNNG
>P45104 5.4.99.22~~~rluB~~~Ribosomal large subunit pseudouridine synthase B~~~COG1187
MKPSQKQTQRQPHFSKDSAKKRDFSAKNDRRSVSRTARIETANTKKSAVNSDNKFLSKPKAKPVVRASNQPKAEGEKLQK
VLARAGQGSRREIETMIAAGRVSVEGKIATLGDRIDVHSGVKVRIDGQIINLSHTQKEICRVLMYYKPEGELCTRSDPEG
RATVFDRLPRLTGSRWIAVGRLDINTSGLLLFTTDGELANRLMHPSREVEREYSVRVFGQVDDAMLARLRKGVQLEDGLA
NFKEIKFTGGVGINQWYDVTLMEGRNREVRRLWESQGIQVSRLIRIRYGNIKLMKGLPRGGWEEMDLENVNYLRELVGLP
AETETKLDVKQASRRPKSGQIRKAVKRYSEMNKRYKK
>P0AA39 5.4.99.24~~~rluC~~~Ribosomal large subunit pseudouridine synthase C~~~COG0564
MKTETPSVKIVAITADEAGQRIDNFLRTQLKGVPKSMIYRILRKGEVRVNKKRIKPEYKLEAGDEVRIPPVRVAEREEEA
VSPHLQKVAALADVILYEDDHILVLNKPSGTAVHGGSGLSFGVIEGLRALRPEARFLELVHRLDRDTSGVLLVAKKRSAL
RSLHEQLREKGMQKDYLALVRGQWQSHVKSVQAPLLKNILQSGERIVRVSQEGKPSETRFKVEERYAFATLVRCSPVTGR
THQIRVHTQYAGHPIAFDDRYGDREFDRQLTEAGTGLNRLFLHAAALKFTHPGTGEVMRIEAPMDEGLKRCLQKLRNAR
>P33643 5.4.99.23~~~rluD~~~Ribosomal large subunit pseudouridine synthase D~~~COG0564
MAQRVQLTATVSENQLGQRLDQALAEMFPDYSRSRIKEWILDQRVLVNGKVCDKPKEKVLGGEQVAINAEIEEEARFEPQ
DIPLDIVYEDEDIIIINKPRDLVVHPGAGNPDGTVLNALLHYYPPIADVPRAGIVHRLDKDTTGLMVVAKTVPAQTRLVE
SLQRREITREYEAVAIGHMTAGGTVDEPISRHPTKRTHMAVHPMGKPAVTHYRIMEHFRVHTRLRLRLETGRTHQIRVHM
AHITHPLVGDPVYGGRPRPPKGASEAFISTLRKFDRQALHATMLRLYHPISGIEMEWHAPIPQDMVELIEVMRADFEEHK
DEVDWL
>P65836 5.4.99.23~~~rluD~~~Ribosomal large subunit pseudouridine synthase D~~~
MAQRVQLTATVSENQLGQRLDQALAEMFPDYSRSRIKEWILNQRVLVNGQLCDKPKEKVLGGERVAIDAEIDEEIRFEAQ
DIPLDIVYEDDDILVINKPRDLVVHPGAGNPDGTVLNALLHYYPPIADVPRAGIVHRLDKDTTGLMVVAKTVPAQTRLVE
SLQLREITREYEAVAIGHMTAGGTVNEPISRHPTKRTHMSVHPMGKPAVTHYRIMEHFRVHTRLRLRLETGRTHQIRVHM
AHITHPLVGDQVYGGRPRPPKGASEEFISTLRKFDRQALHATMLRLYHPVSGIEMEWHAPIPQDMVDLIDAMRADFEDHK
DDVDWL
>P75966 5.4.99.20~~~rluE~~~Ribosomal large subunit pseudouridine synthase E~~~COG1187
MRQFIISENTMQKTSFRNHQVKRFSSQRSTRRKPENQPTRVILFNKPYDVLPQFTDEAGRKTLKEFIPVQGVYAAGRLDR
DSEGLLVLTNNGALQARLTQPGKRTGKIYYVQVEGIPTQDALEALRNGVTLNDGPTLPAGAELVDEPAWLWPRNPPIRER
KSIPTSWLKITLYEGRNRQVRRMTAHVGFPTLRLIRYAMGDYSLDNLANGEWREVTD
>P32684 5.4.99.-~~~rluF~~~Dual-specificity RNA pseudouridine synthase RluF~~~COG1187
MLPDSSVRLNKYISESGICSRREADRYIEQGNVFLNGKRATIGDQVKPGDVVKVNGQLIEPREAEDLVLIALNKPVGIVS
TTEDGERDNIVDFVNHSKRVFPIGRLDKDSQGLIFLTNHGDLVNKILRAGNDHEKEYLVTVDKPITEEFIRGMSAGVPIL
GTVTKKCKVKKEAPFVFRITLVQGLNRQIRRMCEHFGYEVKKLERTRIMNVSLSGIPLGEWRDLTDDELIDLFKLIENSS
SEVKPKAKAKPKTAGIKRPVVKMEKTAEKGGRPASNGKRFTSPGRKKKGR
>Q6T1X6 1.1.1.281~~~rmd~~~GDP-6-deoxy-D-mannose reductase~~~
MRALITGVAGFVGKYLANHLTEQNVEVFGTSRNNEAKLPNVEMISLDIMDSQRVKKVISDIKPDYIFHLAAKSSVKDSWL
NKKGTFSTNVFGTLHVLDAVRDSNLDCRILTIGSSEEYGMILPEESPVSEENQLRPMSPYGVSKASVGMLARQYVKAYGM
DIIHTRTFNHIGPGQSLGFVTQDFAKQIVDIEMEKQEPIIKVGNLEAVRDFTDVRDIVQAYWLLSQYGKTGDVYNVCSGI
GTRIQDVLDLLLAMANVKIDTELNPLQLRPSEVPTLIGSNKRLKDSTGWKPRIPLEKSLFEILQSYRQA
>Q9HTB6 1.1.1.281~~~rmd~~~GDP-6-deoxy-D-mannose reductase~~~
MTQRLFVTGLSGFVGKHLQAYLAAAHTPWALLPVPHRYDLLEPDSLGDLWPELPDAVIHLAGQTYVPEAFRDPARTLQIN
LLGTLNLLQALKARGFSGTFLYISSGDVYGQVAEAALPIHEELIPHPRNPYAVSKLAAESLCLQWGITEGWRVLVARPFN
HIGPGQKDSFVIASAARQIARMKQGLQANRLEVGDIDVSRDFLDVQDVLSAYLRLLSHGEAGAVYNVCSGQEQKIRELIE
LLADIAQVELEIVQDPARMRRAEQRRVRGSHARLHDTTGWKPEITIKQSLRAILSDWESRVREE
>P0AFW2 ~~~rmf~~~Ribosome modulation factor~~~COG3130
MKRQKRDRLERAHQRGYQAGIAGRSKEMCPYQTLNQRSQWLGGWREAMADRVVMA
>P37744 2.7.7.24~~~rfbA~~~Glucose-1-phosphate thymidylyltransferase 1~~~COG1209
MKMRKGIILAGGSGTRLYPVTMAVSKQLLPIYDKPMIYYPLSTLMLAGIRDILIISTPQDTPRFQQLLGDGSQWGLNLQY
KVQPSPDGLAQAFIIGEEFIGGDDCALVLGDNIFYGHDLPKLMEAAVNKESGATVFAYHVNDPERYGVVEFDKNGTAISL
EEKPLEPKSNYAVTGLYFYDNDVVQMAKNLKPSARGELEITDINRIYLEQGRLSVAMMGRGYAWLDTGTHQSLIEASNFI
ATIEERQGLKVSCPEEIAFRKGFIDVEQVRKLAVPLIKNNYGQYLYKMTKDSN
>P61887 2.7.7.24~~~rffH~~~Glucose-1-phosphate thymidylyltransferase 2~~~COG1209
MKGIILAGGSGTRLHPITRGVSKQLLPIYDKPMIYYPLSVLMLAGIREILIITTPEDKGYFQRLLGDGSEFGIQLEYAEQ
PSPDGLAQAFIIGETFLNGEPSCLVLGDNIFFGQGFSPKLRHVAARTEGATVFGYQVMDPERFGVVEFDDNFRAISLEEK
PKQPKSNWAVTGLYFYDSKVVEYAKQVKPSERGELEITSINQMYLEAGNLTVELLGRGFAWLDTGTHDSLIEASTFVQTV
EKRQGFKIACLEEIAWRNGWLDDEGVKRAASSLAKTGYGQYLLELLRARPRQY
>P55253 2.7.7.24~~~rmlA~~~Glucose-1-phosphate thymidylyltransferase~~~COG1209
MKTRKGIILAGGSGTRLYPVTMAVSKQLLPIYDKPMIYYPLSTLMLAGIRDILIISTPQDTPRFQQLLGDGSQWGLNLQY
KVQPSPDGLAQAFIIGEDFIGGDDCALVLGDNIFYGHDLPKLMEAAVNKESGATVFAYHVNDPERYGVVEFDNNGTAISL
EEKPLEPKSNYAVTGLYFYDNDVVEMARKNLKPSARGELEITDINRIYMEQGRLSVAMMGRGYAWLDTGTHQSLIEASNF
IATIEERQGLKVSCPEEIAYRKGFIDAEQVKVLAEPLKKNAYGQYLLKMIKGY
>P39629 2.7.7.24~~~rmlA~~~Glucose-1-phosphate thymidylyltransferase~~~COG1209
MKGVILAGGNGSRLMPLTKAVNKHLLPVGPYPMIYWSIMKLQEAGIKDILLISQKEHMPQFYKLLGNGEELGVTITYQVQ
PAASGISDGLSYAKRFTKKESFILLLGDNIFEDSLKPYTERFEQQGKGAKVLLKEVDDPERFGIAEIDEKNKRIRSIIEK
PEQPPTNLCVTGIYMYDAEVFSYIEQISPSKRGELEITDVNNLYIENSQLTYDVLSGWWVDAGTHESLYLASQLVHQALR
KGQDEK
>A0QPF9 2.7.7.24~~~rmlA~~~Glucose-1-phosphate thymidylyltransferase~~~COG1209
MRGIILAGGSGTRLHPLTIGVSKQLLPVYDKPLVYYPLSTLIMAGIRDILVITTPADAPAFRRLLGDGSDFGVNLSYAAQ
NEPEGLAQAFLIGADHIGNDTVALALGDNIFYGPGLGTSLRRFEHVSGGAIFAYWVANPSAYGVVEFDADGKAVSLEEKP
KTPKSHYAVPGLYFYDNTVIDIARSLKKSARGEYEITEVNQIYLNRGQLSVEVLARGTAWLDTGTFDSLLDASDFVRTIE
LRQGLKVGAPEEIAWRAGFIDDDQLATRAKELLKSGYGHYLLQLLDRE
>P9WH13 2.7.7.24~~~rmlA~~~Glucose-1-phosphate thymidylyltransferase~~~COG1209
MRGIILAGGSGTRLYPITMGISKQLLPVYDKPMIYYPLTTLMMAGIRDIQLITTPHDAPGFHRLLGDGAHLGVNISYATQ
DQPDGLAQAFVIGANHIGADSVALVLGDNIFYGPGLGTSLKRFQSISGGAIFAYWVANPSAYGVVEFGAEGMALSLEEKP
VTPKSNYAVPGLYFYDNDVIEIARGLKKSARGEYEITEVNQVYLNQGRLAVEVLARGTAWLDTGTFDSLLDAADFVRTLE
RRQGLKVSIPEEVAWRMGWIDDEQLVQRARALVKSGYGNYLLELLERN
>P26393 2.7.7.24~~~rmlA~~~Glucose-1-phosphate thymidylyltransferase~~~
MKTRKGIILAGGSGTRLYPVTMAVSKQLLPIYDKPMIYYPLSTLMLAGIRDILIISTPQDTPRFQQLLGDGSQWGLNLQY
KVQPSPDGLAQAFIIGEEFIGHDDCALVLGDNIFYGHDLPKLMEAAVNKESGATVFAYHVNDPERYGVVEFDQKGTAVSL
EEKPLQPKSNYAVTGLYFYDNSVVEMAKNLKPSARGELEITDINRIYMEQGRLSVAMMGRGYAWLDTGTHQSLIEASNFI
ATIEERQGLKVSCPEEIAFRKNFINAQQVIELAGPLSKNDYGKYLLKMVKGL
>P27830 4.2.1.46~~~rffG~~~dTDP-glucose 4,6-dehydratase 2~~~COG1088
MRKILITGGAGFIGSALVRYIINETSDAVVVVDKLTYAGNLMSLAPVAQSERFAFEKVDICDRAELARVFTEHQPDCVMH
LAAESHVDRSIDGPAAFIETNIVGTYTLLEAARAYWNALTEDKKSAFRFHHISTDEVYGDLHSTDDFFTETTPYAPSSPY
SASKASSDHLVRAWLRTYGLPTLITNCSNNYGPYHFPEKLIPLMILNALAGKSLPVYGNGQQIRDWLYVEDHARALYCVA
TTGKVGETYNIGGHNERKNLDVVETICELLEELAPNKPHGVAHYRDLITFVADRPGHDLRYAIDASKIARELGWLPQETF
ESGMRKTVQWYLANESWWKQVQDGSYQGERLGLKG
>Q6E7F4 4.2.1.46~~~rmlB~~~dTDP-glucose 4,6-dehydratase~~~
MKILVTGGAGFIGSAVVRHIINNTQDSVINVDKLTYAGNLESLTEIENNERYKFEHADICDSVAIANIFAHHQPDAIMHL
AAESHVDRSITGPADFIETNIVGTYILLEEARKYWLALSEDRKGAFRFHHISTDEVYGDLPHPDEVSSDTILPLFTEQTS
YSPSSPYSASKASSDHLVRAWRRTYGLPTIVTNCSNNYGPYHFPEKLIPLIILNAIAGKLLPVYGNGEQIRDWLYVEDHA
RALYEVVTKGVPGETYNIGGHNERKNIDVVKTICRILDELIADKPDGIENFEQLIRYVSDRPGHDLRYAIDASKIKQDLG
WVPQETFETGITKTIHWYLNNKEWWQRVMDGSYAGERLGLVE
>P39630 4.2.1.46~~~rfbB~~~dTDP-glucose 4,6-dehydratase~~~COG1088
MAKSYLITGGAGFIGLTFTKLMLRETDARITVLDKLTYASHPEEMEKLKQNSRFRFVKGDISVQEDIDRAFDETYDGVIH
FAAESHVDRSISQAEPFITTNVMGTYRLAEAVLKGKAKKLIHISTDEVYGDLKADDPAFTETTPLSPNNPYSASKASSDL
LVLSYVKTHKLPAIITRCSNNYGPYQHSEKMIPTIIRHAKQGLPVPLYGDGLQIRDWLFAEDHCRAIKLILEKGTDGEVY
NIGGGNERTNKELASVILKHLGCEELFAHVEDRKGHDRRYAINASKLKNELGWRQEVTFEEGIARTIQWYTDNDR
>A0QSK6 4.2.1.46~~~rmlB~~~dTDP-glucose 4,6-dehydratase~~~COG1088
MRLLVTGGAGFIGANFVHLALREARTSSITVLDALTYAGSRESLAPVADRIRLVQGDITDAALVGDLVAESDAVVHFAAE
THVDNALADPEPFLHSNVVGTYTILEAVRRHNVRLHHVSTDEVYGDLELDNPARFNETTPYNPSSPYSSTKAAADLLVRA
WVRSYGVRATISNCSNNYGPYQHVEKFIPRQITNVLTGRRPKLYGAGANVRDWIHVDDHNSAVWRILTDGTIGRTYLIGA
ECERNNLTVMRTILKLMGRDPDDFDHVTDRAGHDLRYAIDPSTLQDELGWAPKHTDFEAGLTDTIDWYRANESWWRPLKD
TVEAKYQERGQ
>P9WN65 4.2.1.46~~~rmlB~~~dTDP-glucose 4,6-dehydratase~~~COG1088
MRLLVTGGAGFIGTNFVHSAVREHPDDAVTVLDALTYAGRRESLADVEDAIRLVQGDITDAELVSQLVAESDAVVHFAAE
SHVDNALDNPEPFLHTNVIGTFTILEAVRRHGVRLHHISTDEVYGDLELDDRARFTESTPYNPSSPYSATKAGADMLVRA
WVRSYGVRATISNCSNNYGPYQHVEKFIPRQITNVLTGRRPKLYGAGANVRDWIHVDDHNSAVRRILDRGRIGRTYLISS
EGERDNLTVLRTLLRLMDRDPDDFDHVTDRVGHDLRYAIDPSTLYDELCWAPKHTDFEEGLRTTIDWYRDNESWWRPLKD
ATEARYQERGQ
>P26391 4.2.1.46~~~rfbB~~~dTDP-glucose 4,6-dehydratase~~~
MKILITGGAGFIGSAVVRHIIKNTQDTVVNIDKLTYAGNLESLSDISESNRYNFEHADICDSAEITRIFEQYQPDAVMHL
AAESHVDRSITGPAAFIETNIVGTYALLEVARKYWSALGEDKKNNFRFHHISTDEVYGDLPHPDEVENSVTLPLFTETTA
YAPSSPYSASKASSDHLVRAWRRTYGLPTIVTNCSNNYGPYHFPEKLIPLVILNALEGKPLPIYGKGDQIRDWLYVEDHA
RALHMVVTEGKAGETYNIGGHNEKKNLDVVFTICDLLDEIVPKATSYREQITYVADRPGHDRRYAIDAGKISRELGWKPL
ETFESGIRKTVEWYLANTQWVNNVKSGAYQSWIEQNYEGRQ
>P29782 4.2.1.46~~~strE~~~dTDP-glucose 4,6-dehydratase~~~
MTTHLLVTGAAGFIGSQYVRTLLGPGGPPDVVVTALDALTYAGNPDNLAAVRGHPRYRFERGDICDAPGRRVMAGQDQVV
HLAAESHVDRSLLDASVFVRTNVHGTQTLLDAATRHGVASFVQVSTDEVYGSLEHGSWTEDEPLRPNSPYSASKASGDLL
ALAHHVSHGLDVRVTRCSNNYGPRQFPEKLIPRFITLLMDGHRVPLYGDGLNVREWLHVDDHVRGIEAVRTRGRAGRVYN
IGGGATLSNKELVGLLLEAAGADWGSVEYVEDRKGHDRRYAVDSTRIQRELGFAPAVDLADGLAATVAWYHKHRSWWEPL
VPAGSLPA
>P95780 4.2.1.46~~~rmlB~~~dTDP-glucose 4,6-dehydratase~~~COG1088
MTEYKNIIVTGGAGFIGSNFVHYVYNNHPDVHVTVLDKLTYAGNRANLEEILGDRVELVVGDIADSELVDKLAAKADAIV
HYAAESHNDNSLKDPSPFIYTNFVGTYILLEAARKYDIRFHHVSTDEVYGDLPLREDLPGHGEGPGEKFTAETKYNPSSP
YSSTKAASDLIVKAWVRSFGVKATISNCSNNYGPYQHIEKFIPRQITNILSGIKPKLYGEGKNVRDWIHTNDHSTGVWAI
LTKGRIGETYLIGADGEKNNKEVLELILEKMSQPKNAYDHVTDRAGHDLRYAIDSTKLREELGWKPQFTNFEEGLEDTIK
WYTEHEDWWKAEKEAVEANYAKTQKILN
>Q5XCG7 5.1.3.-~~~rfbC~~~Protein RmlC homolog~~~
MTETFFDKPLACREIKEIPGLLEFDIPVRGDNRGWFKENFQKEKMLPIGFPERFFEEGKLQNNVSFSRQHVLRGLHAEPW
DKYISVADDGKVLGAWVDLREGETFGNVYQTVIDASKGMFVPRGVANGFQVLSETVSYSYLVNDYWALDLKPKYAFVNYA
DPSLGITWENLAAAEVSEADKNHPLLSDVKPLKPKDL
>P37745 5.1.3.13~~~rfbC~~~dTDP-4-dehydrorhamnose 3,5-epimerase~~~COG1898
MNVIRTEIEDVLILEPRVFGDDRGFFYESFNQSAFEHILGYPVSFVQDNHSRSSKNVLRGLHFQRGEYAQDKLVRCTHGA
VFDVAVDIRPNSVSFGKWVGVLLSADNKQQLWIPKGFAHGFLVLSDIAEFQYKTTNYYHPESDCGICWNDERIAIDWPQT
SGLILSPKDERLFTLDELIRLKLIA
>A0QSK5 5.1.3.13~~~rmlC~~~dTDP-4-dehydrorhamnose 3,5-epimerase~~~COG1898
MTARELSIAGAWEITPVLRTDSRGLFFEWFTDAGFTEFAGHQFDMRQANCSVSARGVLRGVHFAQVPPSQAKYVTCVRGA
VFDVVVDIRVGSPTFGQWDAVLLDDKDRRSIYISEGLGHAFLALDDDSTVMYLCSAPYAPQREHTVRPTDFGIEWPEVPE
LILSDRDAQAPSLAEAQAAGVLPTWADCQAFVETLRRNLVS
>P9WH11 5.1.3.13~~~rmlC~~~dTDP-4-dehydrorhamnose 3,5-epimerase~~~COG1898
MKARELDVPGAWEITPTIHVDSRGLFFEWLTDHGFRAFAGHSLDVRQVNCSVSSAGVLRGLHFAQLPPSQAKYVTCVSGS
VFDVVVDIREGSPTFGRWDSVLLDDQDRRTIYVSEGLAHGFLALQDNSTVMYLCSAEYNPQREHTICATDPTLAVDWPLV
DGAAPSLSDRDAAAPSFEDVRASGLLPRWEQTQRFIGEMRGT
>Q9HU21 5.1.3.13~~~rmlC~~~dTDP-4-dehydrorhamnose 3,5-epimerase~~~
MKATRLAIPDVILFEPRVFGDDRGFFFESYNQRAFEEACGHPVSFVQDNHSRSARGVLRGLHYQIRQAQGKLVRATLGEV
FDVAVDLRRGSPTFGQWVGERLSAENKRQMWIPAGFAHGFVVLSEYAEFLYKTTDFWAPEHERCIVWNDPELKIDWPLQD
APLLSEKDRQGKAFADADCFP
>P26394 5.1.3.13~~~rfbC~~~dTDP-4-dehydrorhamnose 3,5-epimerase~~~
MMIVIKTAIPDVLILEPKVFGDERGFFFESYNQQTFEELIGRKVTFVQDNHSKSKKNVLRGLHFQRGENAQGKLVRCAVG
EVFDVAVDIRKESPTFGQWVGVNLSAENKRQLWIPEGFAHGFVTLSEYAEFLYKATNYYSPSSEGSILWNDEAIGIEWPF
SQLPELSAKDAAAPLLDQALLTE
>P29783 5.1.3.13~~~strM~~~dTDP-4-dehydrorhamnose 3,5-epimerase~~~
MRPLSVQGAWLSETRAFADDRGEFQELYSARSLRGALGYDPGVAQVNRSVSRRGVLRGVHFAQLPPSQAKYVTCLSGAVL
DVVVDIRTGSPTYRAWEAVRLDDPHRSLYVEAGLGHSFMALTDDAVVVYLTSQGYAAGREHGVHPLDPDLGIAWPDGIEP
VLSEKDRQAPGIAEMERRGLLPDYEECLAFRRSLCERGTG
>O66251 1.1.1.133~~~rmlD~~~dTDP-4-dehydrorhamnose reductase~~~
MARLLITGAGGQLGRSLAKLLVDNGRYEVLALDFSELDITNKDMVFSIIDSFKPNVIINAAAYTSVDQAELEVSSAYSVN
VRGVQYLAEAAIRHNSAILHVSTDYVFDGYKSGKYKETDIIHPLCVYGKSKAEGERLLLTLSPKSIILRTSWTFGEYGNN
FVKTMLRLAKNRDILGVVADQIGGPTYSGDIASVLIQIAEKIIVGETVKYGIYHFTGEPCVSWYDFAIAIFDEAVAQKVL
ENVPLVNAITTADYPTLAKRPANSCLDLTKIQQAFGIQPSDWQRALKNIRAYAE
>Q2SYI1 1.1.1.133~~~rmlD~~~dTDP-4-dehydrorhamnose reductase~~~
MKILVTGANGQVGWELARSLAVLGQVVPLARDEADLGRPETLARIVEDAKPDVVVNAAAYTAVDAAESDGAAAKVVNGEA
VGVLAAATKRVGGLFVHYSTDYVFDGTKSSPYIETDPTCPVNAYGASKLLGELAVAETGGDWLTFRTTWVFAARGKNFLR
TMLRLAKEREEMKIVADQFGAPTWARSIADGTAHALATAMRERAAGAFTSGVYHMTSAGQTSWHGFADAIVASWRAVPGA
APLAVSRIVPIPTSAYPVPARRPANSVLSNEALKERFGIELPDWRYAVGLCVRDLLSQ
>Q46769 1.1.1.133~~~rfbD~~~dTDP-4-dehydrorhamnose reductase~~~COG1091
MNILLFGKTGQVGWELQRALAPLGNLIALDVHSTDYCGDFSNPEGVAETVKKIRPDVIVNAAAHTDVDKAESEPEFAQLL
NATSVEAIAKAANEVGAWVIHYSTDYVFPGTGEIPWQGGTDATAPLNVYGETKLSSEKKALQKHCGKHIIFRTSWVYAGK
GNNFAKTMLRLAKEREELAVINDQFGRPTGAELLADCTAHAIRVAVDKPEVAGLYHLVAGGTTTWHDYAALVFEEARKAG
INLALNKLNAVPTTAYPTPARRPHNSRLNTEKFQQNFALVLPDWQVGVKRMLNELFTTTAI
>A0QTF8 1.1.1.133~~~rmlD~~~dTDP-4-dehydrorhamnose reductase~~~COG1091
MDLINGMGTSPGYWRTPREPGNDHRRARLDVMAQRIVITGAGGMVGRVLADQAAAKGHTVLALTSSQCDITDEDAVRRFV
ANGDVVINCAAYTQVDKAEDEPERAHAVNAVGPGNLAKACAAVDAGLIHISTDYVFGAVDRDTPYEVDDETGPVNIYGRT
KLAGEQAVLAAKPDAYVVRTAWVYRGGDGSDFVATMRRLAAGDGAIDVVADQVGSPTYTGDLVGALLQIVDGGVEPGILH
AANAGVASRFDQARATFEAVGADPERVRPCGSDRHPRPAPRPSYTVLSSQRSAQAGLTPLRDWREALQDAVAAVVGATTD
GPLPSTP
>P9WH09 1.1.1.133~~~rmlD~~~dTDP-4-dehydrorhamnose reductase~~~COG1091
MAGRSERLVITGAGGQLGSHLTAQAAREGRDMLALTSSQWDITDPAAAERIIRHGDVVINCAAYTDVDGAESNEAVAYAV
NATGPQHLARACARVGARLIHVSTDYVFDGDFGGAEPRPYEPTDETAPQGVYARSKLAGEQAVLAAFPEAAVVRTAWVYT
GGTGKDFVAVMRRLAAGHGRVDVVDDQTGSPTYVADLAEALLALADAGVRGRVLHAANEGVVSRFGQARAVFEECGADPQ
RVRPVSSAQFPRPAPRSSYSALSSRQWALAGLTPLRHWRSALATALAAPANSTSIDRRLPSTRD
>P26392 1.1.1.133~~~rfbD~~~dTDP-4-dehydrorhamnose reductase~~~
MNILLFGKTGQVGWELQRSLAPVGNLIALDVHSKEFCGDFSNPKGVAETVRKLRPDVIVNAAAHTAVDKAESEPELAQLL
NATSVEAIAKAANETGAWVVHYSTDYVFPGTGDIPWQETDATSPLNVYGKTKLAGEKALQDNCPKHLIFRTSWVYAGKGN
NFAKTMLRLAKERQTLSVINDQYGAPTGAELLADCTAHAIRVALNKPEVAGLYHLVAGGTTTWHDYAALVFDEARKAGIT
LALTELNAVPTSAYPTPASRPGNSRLNTEKFQRNFDLILPQWELGVKRMLTEMFTTTTI
>P37778 1.1.1.133~~~rfbD~~~dTDP-4-dehydrorhamnose reductase~~~
MNILLFGKTGQVGWELQRALAPLGNLIALDVHSTDYCGDFSNPEGVAETVKKIRPDVIVNAAAHTAVDKAESEPNFAQLL
NATCVEAIAKAANEVGAWVIHYSTDYVFPGNGDTPWLETDATAPLNVYGETKLAGEKALQEHCAKHLIFRTSWVYAGKGN
NFAKTMLRLAKEREELAVINDQFGAPTGAELLADCTAHAIRVAANKPEVAGLYHLVAGGTTTWHDYAALVFEEARRAGIN
LALNKLNAVPTTAYPTPARRPHNSRLNTEKFQQNFALVLPDWQVGVKRMLNELFTTTAI
>P29781 1.1.1.133~~~strL~~~dTDP-4-dehydrorhamnose reductase~~~
MSPYPRPRWLVTGASGMLGRELTPLLDRRGAAVTALGRGHLDITDGAAVRSAVAEHRPAVVVNCAAWTAVDEAESEPALA
MAVNGEGPRHLAQACRAVGAVLLQLSTDYVFPGSGGRPYREDHPTGPRTVYGCTKRAGERAVLEVLPDTGYIVRTAWLYG
AGGPNFVAKMIRLEADEDTVLVVDDQHGQPTWTADLADRLAALGAAALAGTAPAGIYHATNTGGTTWNALAPETFRLLGA
DPARVRPTTSLALARPAVRPRYSVLDQSRWKAAGLEPLRHWRAALTESFPALCGRAGRPVPGPR
>Q76G15 2.1.1.179~~~rmtB~~~16S rRNA (guanine(1405)-N(7))-methyltransferase~~~
MNINDALTSILASKKYRALCPDTVRRILTEEWGRHKSPKQTVEAARTRLHGICGAYVTPESLKAAAAALSAGDVKKALSL
HASTKERLAELDTLYDFIFSAETPRRVLDIACGLNPLALYERGIASVWGCDIHQGLGDVITPFAREKDWDFTFALQDVLC
APPAEAGDLALIFKLLPLLEREQAGSAMALLQSLNTPRMAVSFPTRSLGGRGKGMEANYAAWFEGGLPAEFEIEDKKTIG
TELIYLIKKNG
>Q33DX5 2.1.1.179~~~rmtC~~~16S rRNA (guanine(1405)-N(7))-methyltransferase~~~
MKTNDNYIEEVTAKVLTSGKYSTLYPPTVRRVTERLFDRYPPKQLEKEVRKKLHQAYGAYIGGIDGKRLEKKIEKIIHEI
PNPTTDEATRTEWEKEICLKILNLHTSTNERTVAYDELYQKIFEVTGVPTSITDAGCALNPFSFPFFTEAGMLGQYIGFD
LDKGMIEAIEHSLRTLNAPEGIVVKQGDILSDPSGESDLLLMFKLYTLLDRQEEASGLKILQEWKYKNAVISFPIKTISG
RDVGMEENYTVKFENDLVGSDLRIMQKLKLGNEMYFIVSRL
>P00648 3.1.27.-~~~~~~Ribonuclease~~~COG4290
MMKMEGIALKKRLSWISVCLLVLVSAAGMLFSTAAKTETSSHKAHTEAQVINTFDGVADYLQTYHKLPDNYITKSEAQAL
GWVASKGNLADVAPGKSIGGDIFSNREGKLPGKSGRTWREADINYTSGFRNSDRILYSSDWLIYKTTDHYQTFTKIR
>P30850 3.1.13.1~~~rnb~~~Exoribonuclease 2~~~COG4776
MFQDNPLLAQLKQQLHSQTPRAEGVVKATEKGFGFLEVDAQKSYFIPPPQMKKVMHGDRIIAVIHSEKERESAEPEELVE
PFLTRFVGKVQGKNDRLAIVPDHPLLKDAIPCRAARGLNHEFKEGDWAVAEMRRHPLKGDRSFYAELTQYITFGDDHFVP
WWVTLARHNLEKEAPDGVATEMLDEGLVREDLTALDFVTIDSASTEDMDDALFAKALPDDKLQLIVAIADPTAWIAEGSK
LDKAAKIRAFTNYLPGFNIPMLPRELSDDLCSLRANEVRPVLACRMTLSADGTIEDNIEFFAATIESKAKLVYDQVSDWL
ENTGDWQPESEAIAEQVRLLAQICQRRGEWRHNHALVFKDRPDYRFILGEKGEVLDIVAEPRRIANRIVEEAMIAANICA
ARVLRDKLGFGIYNVHMGFDPANADALAALLKTHGLHVDAEEVLTLDGFCKLRRELDAQPTGFLDSRIRRFQSFAEISTE
PGPHFGLGLEAYATWTSPIRKYGDMINHRLLKAVIKGETATRPQDEITVQMAERRRLNRMAERDVGDWLYARFLKDKAGT
DTRFAAEIVDISRGGMRVRLVDNGAIAFIPAPFLHAVRDELVCSQENGTVQIKGETVYKVTDVIDVTIAEVRMETRSIIA
RPVA
>O67082 3.1.26.3~~~rnc~~~Ribonuclease 3~~~COG0571
MKMLEQLEKKLGYTFKDKSLLEKALTHVSYSKKEHYETLEFLGDALVNFFIVDLLVQYSPNKREGFLSPLKAYLISEEFF
NLLAQKLELHKFIRIKRGKINETIIGDVFEALWAAVYIDSGRDANFTRELFYKLFKEDILSAIKEGRVKKDYKTILQEIT
QKRWKERPEYRLISVEGPHHKKKFIVEAKIKEYRTLGEGKSKKEAEQRAAEELIKLLEESE
>P51833 3.1.26.3~~~rnc~~~Ribonuclease 3~~~COG0571
MSKHSHYKDKKKFYKKVEQFKEFQERISVHFQNEKLLYQAFTHSSYVNEHRKKPYEDNERLEFLGDAVLELTISRFLFAK
YPAMSEGDLTKLRAAIVCEPSLVSLAHELSFGDLVLLGKGEEMTGGRKRPALLADVFEAFIGALYLDQGLEPVESFLKVY
VFPKINDGAFSHVMDFKSQLQEYVQRDGKGSLEYKISNEKGPAHNREFEAIVSLKGEPLGVGNGRSKKEAEQHAAQEALA
KLQKHHTKQ
>Q9PM40 3.1.26.3~~~rnc~~~Ribonuclease 3~~~COG0571
MKNIEKLEQSLTYEFKDKNLLIHALTHKSFKKSYNNERLEFLGDAVLDLVVGEYLFHKFAKDAEGDLSKLRAALVNEKSF
AKIANSLNLGDFILMSVAEENNGGKEKPSILSDALEAIIGAIHLEAGFEFAKTIALRLIEKNFPQIDAKILIKDYKTKLQ
EITQGKIGQTPQYETVRAFGPDHLKQFEIALMLDGKELARAIAGSKKEAQQMAAKIALEKLGAL
>P0A7Y2 3.1.26.3~~~rnc~~~Ribonuclease 3~~~COG0571
MNPIVINRLQRKLGYTFNHQELLQQALTHRSASSKHNERLEFLGDSILSYVIANALYHRFPRVDEGDMSRMRATLVRGNT
LAELAREFELGECLRLGPGELKSGGFRRESILADTVEALIGGVFLDSDIQTVEKLILNWYQTRLDEISPGDKQKDPKTRL
QEYLQGRHLPLPTYLVVQVRGEAHDQEFTIHCQVSGLSEPVVGTGSSRRKAEQAAAEQALKKLELE
>P0A7Y0 3.1.26.3~~~rnc~~~Ribonuclease 3~~~COG0571
MNPIVINRLQRKLGYTFNHQELLQQALTHRSASSKHNERLEFLGDSILSYVIANALYHRFPRVDEGDMSRMRATLVRGNT
LAELAREFELGECLRLGPGELKSGGFRRESILADTVEALIGGVFLDSDIQTVEKLILNWYQTRLDEISPGDKQKDPKTRL
QEYLQGRHLPLPTYLVVQVRGEAHDQEFTIHCQVSGLSEPVVGTGSSRRKAEQAAAEQALKKLELE
>P9WH03 3.1.26.3~~~rnc~~~Ribonuclease 3~~~COG0571
MIRSRQPLLDALGVDLPDELLSLALTHRSYAYENGGLPTNERLEFLGDAVLGLTITDALFHRHPDRSEGDLAKLRASVVN
TQALADVARRLCAEGLGVHVLLGRGEANTGGADKSSILADGMESLLGAIYLQHGMEKAREVILRLFGPLLDAAPTLGAGL
DWKTSLQELTAARGLGAPSYLVTSTGPDHDKEFTAVVVVMDSEYGSGVGRSKKEAEQKAAAAAWKALEVLDNAMPGKTSA
>Q52698 3.1.26.3~~~rnc~~~Ribonuclease 3~~~
MKVAADLSAFMDRLGHRFTTPEHLVRALTHSSLGSATRPDNQRLEFLGDRVLGLSMAEALFHADGRASEGQLAPRFNALV
RKETCAAVARDIDLGAVLKLGRSEMMSGGRRKDALLGDAMEAVIAAVYLDAGFEVARALVLRLWAARIQSVDNDARDPKT
ALQEWAQARGLPPPRYETLGRDGPDHAPQFRIAVVLASGETEEAQAGSKRNAEQAAAKALLERLERGA
>A8GQT8 3.1.26.3~~~rnc~~~Ribonuclease 3~~~
MESFEKLEKLLSYSFKNKELLIEALSHPSLRQHHEYKDDKDYERLEFLGDAVLNLVITEILFRNFANYNEGNLAKIRSYL
VCKETICMVGAKLTLKNYIIMTHGEEVAGGRDNLNNIENATEALIAAIYLDSNIETTHDIIEKLWAEFIKVQNLTDYDPK
TALQEWAQASDHHLPIYRLIKREGASHSSTFTVLVKVKDYEQTGTGHAIKEAEKNAARSLLHRLKND
>P66668 3.1.26.3~~~rnc~~~Ribonuclease 3~~~
MSKQKKSEIVNRFRKRFDTKMTELGFTYQNIDLYQQAFSHSSFINDFNMNRLDHNERLEFLGDAVLELTVSRYLFDKHPN
LPEGNLTKMRATIVCEPSLVIFANKIGLNEMILLGKGEEKTGGRTRPSLISDAFEAFIGALYLDQGLDIVWKFAEKVIFP
HVEQNELLGVVDFKTQFQEYVHQQNKGDVTYNLIKEEGPAHHRLFTSEVILQGEAIAEGKGKTKKESEQRAAESAYKQLK
QIK
>Q9ZBQ7 3.1.26.3~~~rnc~~~Ribonuclease 3~~~COG0571
MRGTVSVPKKAEDAKADPPAKKKADTQASSHTLLEGRLGYQLESALLVRALTHRSYAYENGGLPTNERLEFLGDSVLGLV
VTDTLYRTHPDLPEGQLAKLRAAVVNSRALAEVGRGLELGSFIRLGRGEEGTGGRDKASILADTLEAVIGAVYLDQGLDA
ASELVHRLFDPLIEKSSNLGAGLDWKTSLQELTATEGLGVPEYLVTETGPDHEKTFTAAARVGGVSYGTGTGRSKKEAEQ
QAAESAWRSIRAAADERAKATADAVDADPDEASASA
>P66670 3.1.26.3~~~rnc~~~Ribonuclease 3~~~
MKQLEELLSTSFDIQFNDLTLLETAFTHTSYANEHRLLNVSHNERLEFLGDAVLQLIISEYLFAKYPKKTEGDMSKLRSM
IVREESLAGFSRFCSFDAYIKLGKGEEKSGGRRRDTILGDLFEAFLGALLLDKGIDAVRRFLKQVMIPQVEKGNFERVKD
YKTCLQEFLQTKGDVAIDYQVISEKGPAHAKQFEVSIVVNGAVLSKGLGKSKKLAEQDAAKNALAQLSEV
>Q9X0I6 3.1.26.3~~~rnc~~~Ribonuclease 3~~~COG0571
MNESERKIVEEFQKETGINFKNEELLFRALCHSSYANEQNQAGRKDVESNEKLEFLGDAVLELFVCEILYKKYPEAEVGD
LARVKSAAASEEVLAMVSRKMNLGKFLFLGKGEEKTGGRDRDSILADAFEALLAAIYLDQGYEKIKELFEQEFEFYIEKI
MKGEMLFDYKTALQEIVQSEHKVPPEYILVRTEKNDGDRIFVVEVRVNGKTIATGKGRTKKEAEKEAARIAYEKLLKERS
>P09155 3.1.13.5~~~rnd~~~Ribonuclease D~~~COG0349
MNYQMITTDDALASLCEAVRAFPAIALDTEFVRTRTYYPQLGLIQLFDGEHLALIDPLGITDWSPLKAILRDPSITKFLH
AGSEDLEVFLNVFGELPQPLIDTQILAAFCGRPMSWGFASMVEEYSGVTLDKSESRTDWLARPLTERQCEYAAADVWYLL
PITAKLMVETEASGWLPAALDECRLMQMRRQEVVAPEDAWRDITNAWQLRTRQLACLQLLADWRLRKARERDLAVNFVVR
EEHLWSVARYMPGSLGELDSLGLSGSEIRFHGKTLLALVEKAQTLPEDALPQPMLNLMDMPGYRKAFKAIKSLITDVSET
HKISAELLASRRQINQLLNWHWKLKPQNNLPELISGWRGELMAEALHNLLQEYPQ
>P21513 3.1.26.12~~~rne~~~Ribonuclease E~~~COG1530
MKRMLINATQQEELRVALVDGQRLYDLDIESPGHEQKKANIYKGKITRIEPSLEAAFVDYGAERHGFLPLKEIAREYFPA
NYSAHGRPNIKDVLREGQEVIVQIDKEERGNKGAALTTFISLAGSYLVLMPNNPRAGGISRRIEGDDRTELKEALASLEL
PEGMGLIVRTAGVGKSAEALQWDLSFRLKHWEAIKKAAESRPAPFLIHQESNVIVRAFRDYLRQDIGEILIDNPKVLELA
RQHIAALGRPDFSSKIKLYTGEIPLFSHYQIESQIESAFQREVRLPSGGSIVIDSTEALTAIDINSARATRGGDIEETAF
NTNLEAADEIARQLRLRDLGGLIVIDFIDMTPVRHQRAVENRLREAVRQDRARIQISHISRFGLLEMSRQRLSPSLGESS
HHVCPRCSGTGTVRDNESLSLSILRLIEEEALKENTQEVHAIVPVPIASYLLNEKRSAVNAIETRQDGVRCVIVPNDQME
TPHYHVLRVRKGEETPTLSYMLPKLHEEAMALPSEEEFAERKRPEQPALATFAMPDVPPAPTPAEPAAPVVAPAPKAAPA
TPAAPAQPGLLSRFFGALKALFSGGEETKPTEQPAPKAEAKPERQQDRRKPRQNNRRDRNERRDTRSERTEGSDNREENR
RNRRQAQQQTAETRESRQQAEVTEKARTADEQQAPRRERSRRRNDDKRQAQQEAKALNVEEQSVQETEQEERVRPVQPRR
KQRQLNQKVRYEQSVAEEAVVAPVVEETVAAEPIVQEAPAPRTELVKVPLPVVAQTAPEQQEENNADNRDNGGMPRRSRR
SPRHLRVSGQRRRRYRDERYPTQSPMPLTVACASPELASGKVWIRYPIVRPQDVQVEEQREQEEVHVQPMVTEVPVAAAI
EPVVSAPVVEEVAGVVEAPVQVAEPQPEVVETTHPEVIAAAVTEQPQVITESDVAVAQEVAEQAEPVVEPQEETADIEEV
VETAEVVVAEPEVVAQPAAPVVAEVAAEVETVAAVEPEVTVEHNHATAPMTRAPAPEYVPEAPRHSDWQRPTFAFEGKGA
AGGHTATHHASAAPARPQPVE
>A0R152 3.1.26.12~~~rne~~~Ribonuclease E~~~COG1530
MAEDAHTEDLSTQTPQQEGLPERLRVHSLARVLGTTSRRVLDALAEFDGRQRSAHSTVDKADAERVRAALTESPAAETPP
EEAPAAETPVADLVVVQAEQVEVVTVSEAGPAEPAEPAEPEAPAAEAEAEAETEVADEAETPEPTFRGAVLVGDEPESRL
ILEHANIPPARETQTERPDYLPLFVAPQPVSFEPAVVDDEDEDDDTETGAESDFDSGADSDSDDDQADRPRRRRRGRRGR
GRGRGEQNDDATSDADTDSTEDQTDGDEQESGEDSDDSGDEDSTTTEGGTRRRRRRRRRKSGSGDSDDAVSPDDPPNTVV
HERAPRTERSDKSDDSEIQGISGSTRLEAKRQRRRDGRDAGRRRPPILSEAEFLARREAVERTMIVRDKVRTEPPHEGAR
YTQIAVLEDGVVVEHFVTSAASASLVGNIYLGIVQNVLPSMEAAFVDIGRGRNGVLYAGEVNWEAAGLGGQNRKIEQALK
PGDYVVVQVSKDPVGHKGARLTTQVSLAGRYLVYVPGASSTGISRKLPDTERQRLKEILREVVPSDAGVIIRTASEGVKE
EDIRSDVERLQKRWSEIEAKAAEVTEKKAGAAVALYEEPDVLVKVIRDLFNEDFSSLIVSGDEAWNTINSYVEAVAPDLM
PRLTKYEPAGPDAPDVFAVHRIDEQLAKAMDRKVWLPSGGTLVIDRTEAMTVVDVNTGKFTGSGGNLEQTVTRNNLEAAE
EIVRQLRLRDIGGIVVIDFIDMVLESNRDLVLRRLTEALARDRTRHQVSEVTSLGLVQLTRKRLGTGLVEAFSTACTHCG
GRGIVLHGDPIDSASSNGGRKSDSSGGGGSGGGRRGKRGKKGAARTEEVHVAKVPDHTPGEHPMFKAMAAANGKHEGDED
HEDHEDHETAEDTTAAEVRDDTRDEHDADERAHVVTAAVGAAGDEDLDDSDEDSDLDSDEESDDESDEDEIELDDDEDEL
DEDIEVIGDSDDSDDSDDSDEDDDSDDSDDDSDEDEDSDSDEDEEPVREVYEPPVTAPRARVRRRAAARPAGPPSHD
>P71905 3.1.26.12~~~rne~~~Ribonuclease E~~~COG1530
MIDGAPPSDPPEPSQHEELPDRLRVHSLARTLGTTSRRVLDALTALDGRVRSAHSTVDRVDAVRVRDLLATHLETAGVLA
ASVHAPEASEEPESRLMLETQETRNADVERPHYMPLFVAPQPIPEPLADDEDVDDGPDYVADDSDADDEGQLDRPANRRR
RRGRRGRGRGRGEQGGSDGDPVDQQSEPRAQQFTSADAAETDDGDDRDSEDTEAGDNGEDENGSLEAGNRRRRRRRRRKS
ASGDDNDAALEGPLPDDPPNTVVHERVPRAGDKAGNSQDGGSGSTEIKGIDGSTRLEAKRQRRRDGRDAGRRRPPVLSEA
EFLARREAVERVMVVRDRVRTEPPLPGTRYTQIAVLEDGIVVEHFVTSAASASLVGNIYLGIVQNVLPSMEAAFVDIGRG
RNGVLYAGEVNWDAAGLGGADRKIEQALKPGDYVVVQVSKDPVGHKGARLTTQVSLAGRFLVYVPGASSTGISRKLPDTE
RQRLKEILREVVPSDAGVIIRTASEGVKEDDIRADVARLRERWEQIEAKAQETKEKAAGAAVALYEEPDVLVKVIRDLFN
EDFVGLIVSGDEAWNTINEYVNSVAPELVSKLTKYESADGPDGQSAPDVFTVHRIDEQLAKAMDRKVWLPSGGTLVIDRT
EAMTVIDVNTGKFTGAGGNLEQTVTKNNLEAAEEIVRQLRLRDIGGIVVIDFIDMVLESNRDLVLRRLTESLARDRTRHQ
VSEVTSLGLVQLTRKRLGTGLIEAFSTSCPNCSGRGILLHADPVDSAAATGRKSEPGARRGKRSKKSRSEESSDRSMVAK
VPVHAPGEHPMFKAMAAGLSSLAGRGDEESGEPAAELAEQAGDQPPTDLDDTAQADFEDTEDTDEDEDELDADEDLEDLD
DEDLDEDLDVEDSDSDDEDSDEDAADADVDEEDAAGLDGSPGEVDVPGVTELAPTRPRRRVAGRPAGPPIRLD
>Q8YP69 3.1.26.12~~~rne~~~Ribonuclease E~~~COG1530
MPKQIIIAEQHQIAAVFSEDQIQELVVATGHHQIGDIYLGVVENVLPGIDAAFVNIGDPERNGFIHVTDLGPLRLKRTAA
AITELLAPQQKVLVQVMKEPTGTKGPRLTGNITLPGRYVVLMPYGRGVNLSRRIKSESERNRLRALAILIKPAGMGLLVR
TEAEGKPEEAIIEDLEVLQKQWEAIQQEAQSTRAPALLNRDDDFIQRVLRDMYGADVNRIVVDSSTGLKRVKQYLQNWSG
GQTPQGLLIDHHRDRSPILEYFRINAAIREALKPRVDLPSGGYIIIEPTEALTVIDVNSGSFTRSATARETVLWTNCEAA
TEIARQLRLRNIAGVIVVDFIDMESRRDQLQVLEHFNKALRADKARPQIAQLTELGLVELTRKRQGQNIYELFGDTCPAC
GGLGHTVRLPGETENRLPTPAAEVPERFVSLPTREPRLPTARTTEPRETYDGFGEAFENDSDLGALNLINHPSYQELNDN
NKRRARTRRSRIGINGTNGKDEQRITANPLAFISESDLDLDGDVELSAPPELPTPNLGKSGWIERAERTKVIKTEPVKPV
VEPPEIRTVEMTPEEQDIFALMGISPLIKLEQEVKNPKSVIINIVQPGQTPTIPTEITPEPVAKVTPSVEVNTPKVKLES
KSVSVAATEPIKLTETMEESEVNAASTANRRRRRRSSASDSDTGEDS
>P72656 3.1.26.12~~~rne~~~Ribonuclease E~~~COG1530
MPKQIVIAEKHQVAAVFWKDQIQELVVSTGSQQVGDIYLGLVDNILPSIDAAFINIGDTEKNGFIHVSDLGPVRLRRTAG
SISELLSPQQRVLVQVMKEPTGNKGPRLTGNISMPGRYMVLMPYGRGVNLSRRINREEERSRLRALAVLIKPPGMGLLVR
TEAEDVPEDAIIEDLENLQKQWELVQQQAMTRSAPMLLDRDDDFIKRVLRDMYSSEVNRIVVDTPAGMKRIKQQLMNWDQ
GRLPEGVLIDCHRESLSILEYFRVNATIREALKPRVDLPSGGYIIIEPTEALTVIDVNSGSFTHSANSRETVLWTNYEAA
TEIARQLKLRNIGGVIIIDFIDMDSHKDQLQLLEHFNRCLETDKARPQIAQLTELGLVELTRKRQGQNLYELFGQPCPEC
GGLGHLVELPGEKGFVSLSPTAVNSSIPPRLVEKPILSPPVAKVNDLPKKEEAKISSPLDLLFHPNYQEQGDRDSNRRRR
RRRGSEFSEKENIKSVGISRSKGPSPSPTKEKVTGTAPPRRERPSRRVEKTLVPVDVAMTTLEQDIYARMGISPLIKTEY
ADQDPRSFMVSVVTAGAALEGNTNGSGSLVNAVITTVDNGDNGDNVPSDGLTIVSEVTAPTPVIEQPREETVEPEQVVLP
QLDDETPAAPVAEESAPIETKKRPGRRRRRSSAE
>H6LC28 7.2.1.2~~~rnfA~~~Na(+)-translocating ferredoxin:NAD(+) oxidoreductase complex subunit A~~~COG4657
MTLIFIMISAIFVNNFVLSRFLGICPFLGVSKQVETAVGMGVAVTFVMALASAITYVVQYAILDPLSLGYLQTIAFILII
AALVQLVEMIIKKSSPSLYQALGVYLPLITTNCAVLGVALINIQNEYNFIETIFNGVGAALGFTLAIVLFAGIRERLETS
AVPKALEGFPIALLTAGLMAIAFLGFSGMKLG
>P0CZ13 7.-.-.-~~~rnfA~~~Ion-translocating oxidoreductase complex subunit A~~~
MQDFLLVLLSTALVNNVVLVKFLGLCPFMGVSRKTDAAIGMGLATTFVITVASAACWLVEALILEPLDLKFLRILSMILV
IAAIVQFIETVMRKVTPDLHKALGIYLPLITTNCAVLGLPLMYIQGHLSLAMSTLSGFGASVGFTLVLVIFAGMRERLAQ
LSVPAAFAGTPIAFVSAGLLGLAFMGFAGLVHV
>D5ARY9 7.-.-.-~~~rnfA~~~Ion-translocating oxidoreductase complex subunit A~~~COG4657
MQDFLLVLLSTALVNNVVLVKFLGLCPFMGVSRKTDAAIGMGLATTFVITVASAACWLVEALILEPLDLKFLRILSMILV
IAAIVQFIETVMRKVTPDLHKALGIYLPLITTNCAVLGLPLMYIQGHLSLAMSTLSGFGASVGFTLVLVIFAGMRERLAQ
LSVPAAFAGTPIAFVSAGLLGLAFMGFAGLVHV
>A0A0H3AKU6 7.-.-.-~~~rnfA~~~Ion-translocating oxidoreductase complex subunit A~~~COG4657
MLLLWQSRIMPGSEANIYITMTEYLLLLIGTVLVNNFVLVKFLGLCPFMGVSKKLETAIGMGLATTFVLTLASVCAYLVE
SYVLRPLGIEYLRTMSFILVIAVVVQFTEMVVHKTSPTLYRLLGIFLPLITTNCAVLGVALLNINENHNFIQSIIYGFGA
AVGFSLVLILFASMRERIHVADVPAPFKGASIAMITAGLMSLAFMGFTGLVKL
>H6LC27 7.2.1.2~~~rnfB~~~Na(+)-translocating ferredoxin:NAD(+) oxidoreductase complex subunit B~~~COG1245
MLNAILVPVGILGVFGLIFGIGLAIAAKVFEVYEDPRVPLVRAALPGANCGGCGLPGCDALAANIVGGSAAIDACPVGGA
SCAAAVAEIMGMEAGSAVKKVATVICQGTCETAPNRAEYYGEMDCREAMIASGGSKGCRYGCLGYGTCKAVCPFDAIVIG
EDGLPKVDPEKCTSCGKCVEACPKSIMTLVPEAQEVIVKCHNFDKGKIARLSCTTACIACGACVKACRFDAITVENNCAK
IDYDKCRQCYECVDKCPMNCISGDVEYGKSTAYIIEENCIACGLCAKNCPVNAITGEIKKPPYVIDHDMCIGCGICFDKC
RKSAIEMRPNKTK
>P0CZ14 7.-.-.-~~~rnfB~~~Ion-translocating oxidoreductase complex subunit B~~~
MIAAAASMSALGLGLGYLLGAAARKFHVETPPIVEEIAKILPGTNCGACGFPGCNGLAEAMAEGNAPVTACTPGGRDVAL
ALAEIVTVEAGADAGPIAEIEPMVAFVFEDHCTGCQKCFKRCPTDAIVGGAKQIHTVVMDACIGCDACIEVCPTEAIVSR
VKPKTLKTWYWDKPQPGLVAASAETAA
>A5F2R3 7.-.-.-~~~rnfB~~~Ion-translocating oxidoreductase complex subunit B~~~COG2878
MSTIVIAVIALAALAAVFGAILGFASIRFKVEADPIVDQIDAILPQTQCGQCGYPGCRPYAEAIANGDAINKCPPGGQAT
IEKLADLMGVEVQDSAHDLDNKVKMVAFIHEDMCIGCTKCIQACPVDAIVGGNKAVHTVIKNECTGCDLCVAPCPTDCIE
MIPVQTTPESWKWQLNAIPVVNVTDSAPAAQKSAN
>H6LC32 7.2.1.2~~~rnfC~~~Na(+)-translocating ferredoxin:NAD(+) oxidoreductase complex subunit C~~~COG4656
MNVKHGTFKGGIHPPYRKESTAEVPLGFGKKPEMVIIPMSLHIGAPCTPIVKKGDTVFLGQRVGEPNGFVSVPVHASVSG
KVIAVEERPHASGDRVMSVVIESDGLDTIDPSIKPYGTLEDMDADAIKKMVLNAGIVGLGGATFPTHVKLAIPPDKKVDC
VVLNGAECEPYLTADHHLMTSQAEKVVMGLKLAMKSVGVEKGFIGVEDNKTDAIEALVKAIGNDSRLEVYSLHTKYPQGA
EKQLIAAITGREVPSGALPADAGVVVMNVGTAAQIAESMITGLPLYKRYLTCTGDAIKNPQTIEIRIGVPFQSVIDQCGG
FSSEPGKVISGGPMMGVTQFVTDIPVMKGTSGILCLTKESAKIATPSNCIHCGKCVGVCPIHLQPLNIAEYSQRNMWDKC
ESNNAMDCIECGSCSYICPAKRTLVSSIRVAKREIIAQRRKGN
>D8GR66 7.1.1.-~~~rnfC~~~Proton-translocating ferredoxin:NAD(+) oxidoreductase complex subunit C~~~COG4656
MLKSFRGGVHPDDSKKYTANKPIEIAPIPDKVFIPVRQHIGAPTSPVVQKGDEVKKGQLIAKSDAFVSANIYASTSGKVV
DIGDYPHPGFGKCQAIVIEKDGKDEWVEGIPTSRNWKELSVKEMLGIIREAGIVGMGGATFPVHVKLAPPPDKKVDVFIL
NGAECEPYLTADYRSMLENSDKVVAGVQIIMKILNVEKAFVGIEDNKPKAIEAMKKAFEGTKVQVVGLPTKYPQGAEKML
INVLTGREVPSGGLPADVGAVVQNVGTCIAISDAVERGIPLIQRVTTISGGAIKEPKNILVRIGTTFKDAIDFCGGFKEE
PVKIISGGPMMGFAQSNLDIPIMKGSSGILGLTKNDVNDGKESSCIRCGRCLKACPMHLNPSMLSILGQKDLYQEAKEEY
NLLDCVECGSCVYTCPAKRKIVQYIRYLKSENRAAGEREKAKAAKAKEKKEKEEVLK
>Q52716 7.-.-.-~~~rnfC~~~Ion-translocating oxidoreductase complex subunit C~~~
MRLPSIATLFHPLQSFSIRGGIHPETHKHLTSECEIETMPMPALIRLPLQQHIGAEAEPIVKRDDLVLKGQLIAKARGPL
SANIHAPTSGRVIAVGHFVAPHASGLPVPTITIRPDGEDKWGPHLPRLRPENAAPEEIAAQVAAAGIVGMGGATFPSAVK
LNLRAKYDLTTLIINGAECEPYLTCDDRLMRERAEEIADGIGIMARALGVKQVFVAIESNKPQAIEAMTRYNRALGYTFK
IHVVPTQYPMGSEKHLVKMITGQETPARALTADLGVVVHNIATAHAVHLAVRYGEPLIARTVTVSGHGIRRPANLRVLIG
TPVSEIIAHCGGFTEEPDRLLLGGPMMGMPIQNPRVPVVKGTNGILALTAAETPEAKTMPCIRCGRCVQGCPVGLTPFEL
NARIHAGDLEGAAKVGLMDCLACGCCSYNCPANLPLVQSFQFAKGKLSERQSRKHQQEETKRLAAARKAREEAIAEAKKQ
MMLKRKAEMAAKKKAEEAAAAAAMPPPATATAIQGEATP
>H6LC31 7.2.1.2~~~rnfD~~~Na(+)-translocating ferredoxin:NAD(+) oxidoreductase complex subunit D~~~COG4658
MNELNLTVSSSPHIRAKHSTASIMQNVIIALLPALAVAGYVFGLWALALVAICVISSVATEAVIQKLLKKPITVNDWSAV
VTGVLLAFNLPINAPWWIGVVGSVFAIAIVKQCFGGLGQNFINPALAARAFLLASWPGHMTSTAYIPLTDTVTTATPLAL
LKAGETGSMPSTLDLFTGLNGVYGCIGEISALALLIGGLYLIYKGIISWRIPTIYLLTIAIFALLVGQDPIVHMVSGGVM
LGAFFMATDYASSPVTAKGQIIYAIGCGLITMIIRLYGGYPEGCSYSILLMNVATPLIERFTKERIYGVTKIKKEAKA
>D8GR67 7.1.1.-~~~rnfD~~~Proton-translocating ferredoxin:NAD(+) oxidoreductase complex subunit D~~~COG4658
MAEAQIKKNIFTISSSPHVRCDESVSKIMWSVCLALTPAAVFGVFNFGIHALEVIITGIIAAVVTEYFVEKVRNKPITIT
DGSAFLTGLLLSMCLPPDIPPYMVAIGSFIAIAIAKHSMGGLGQNIFNPAHIGRAALMVSWPVAMTTWSKLSASGVDAVT
TATPLGILKLQGYSKLLETFGGQGALYKAMFLGTRNGSIGETSTILLVLGGLYLIYKKYINWQIPVVMIGTVGILTWAFG
GTTGIFTGDPVFHMMAGGLVIGAFFMATDMVTIPMTIKGQIIFALGAGALTSLIRLKGGYPEGVCYSILLMNAVTPLIDK
FTQPVKFGTRR
>Q52715 7.-.-.-~~~rnfD~~~Ion-translocating oxidoreductase complex subunit D~~~
MHMPVAGPHTHTLFTVSRTMLTVVAALTPATLFGLWEFGWPAIFLFLTTVVSAWVFEVACLKIADKPIRPFATDGSAILS
GWLVAMTLPPYAPWWVGVIGSFIAIVIAKHLFGGLGQNLFNPAMVARAMLVVALPVQMTTWIAPVGLLEGPSFLHGLSIT
FLGSQEFDAVSSASTLSHIKSQISAGATMGEILPTLADLESRLLGFVPGSLGETSTVLLALGGLLLLVTRIITPTIPLAV
LGTLVTLSAICSFLAPDRFAPPILHLTSGSTMLCAFFIATDYVTSPVTTAGKWVYGIGIGTLVFVIPLRRLPRGRGLCGA
LDELGHAPDRNLHPAADLRTVPHRQAACAAKPAKGAQK
>A5F2R1 7.-.-.-~~~rnfD~~~Ion-translocating oxidoreductase complex subunit D~~~COG4658
MAFFIASSPHLRSKRSTADVMRWVLVCALPGLIAQTYFFGYGTLIQLLLAISVAVALEAGIMLLRKRSPISALRDYSAVV
TAWLLAVAIPPLSPWWVVVIGLIFAIVIAKHLYGGLGQNPFNPAMIAYVVLLISFPVQMTSWMAPIKLTAEPSSLVDSFS
LIFGGFDSDGLSLQQIRTGIDGITMATPLDAIKTSLKAGHTMSETLTQPQFSGFAGIGWEWVNIAYLLGGLILLKLRIIR
WHIPVAMLAGLVFTALLAQLFAPGTTASPMIHLLSGATMLGAFFIATDPVSASTTDKGRLIYGFFIGAMVFLIRSWGGFP
DGVAFAVLLANMCVPLIDYYTKPRTYGH
>H6LC29 7.2.1.2~~~rnfE~~~Na(+)-translocating ferredoxin:NAD(+) oxidoreductase complex subunit E~~~COG4660
MNFMKNLTRGIIRENPTFVLVLGMCPTLAVTTSAINGMGMGLATMLVLIGSNVAISALRKVIPDNIRIPAFVVVIASFVT
IVGMLMKAYVPALDAALGIFIPLIVVNCIILARAEAFAFSNGIADSFADAVGMGLGFTLALTILGSIREILGAGSIFGFS
LFGAAYEPVLLMILPPGAFLTLGLLIGLINWKTKKA
>P97055 7.-.-.-~~~rnfE~~~Ion-translocating oxidoreductase complex subunit E~~~
MSESYAKIARDGLWDKNIVTGQMLALCPTLAITGTATNGLGMGLATTVVLILSNVVISALRKTIAPEIRIPAFILIIAAI
VTVVDLALNAWLHDLHKVLGLFIALIVTNCAILGRAEAFASRFGVLASALDGLMMGIGFTLALVVVGAIREILGSGTLFA
QASLLLGPHFAFMELQIFPDYPGFLIMILPPGGFLVVGGLFALKRIIDARKPTIEQEIKQMRTERVFTAAGVLKPKLETG
EEA
>A5F2S7 7.-.-.-~~~rnfE~~~Ion-translocating oxidoreductase complex subunit E~~~COG4660
MSENRTLMLNGMWNNNPALVQLLGLCPLLAVSSTVTNALGLGIATLLVLVGSNVTVSLVRDYVPKEVRIPVFVMIIASLV
TCVQLLMNAYAYGLYLSLGIFIPLIVTNCIIIGRAEAFASKNDVLPAALDGFWMGLGMTSVLVVLGSLREIIGNGTLFDG
ADLLLGEWAKVLRIEVFHFDSAFLLALLPPGAFIGVGFLIAAKSVIDKQIAARQPKQQKQAIERARVTNV
>H6LC30 7.2.1.2~~~rnfG~~~Na(+)-translocating ferredoxin:NAD(+) oxidoreductase complex subunit G~~~COG4659
METKEKVQIDWKVVFKLGLILFVISAVAACALALTNYVTAGTIEEMNVQTNTVARQEVLPKAADFEAVPAKDVEKIASEI
GMEKPEELLEVYIGKSNGEVVGYTVKTGPTSGYAGEVQVLTGISADGVITGITIIKSNETPGLGAKASGVWNDQFTGKSA
KEELVVVKGTTKEGSNEIQAITGSTITSKAVTSGVNMSIQVYQNLSK
>P97054 7.-.-.-~~~rnfG~~~Ion-translocating oxidoreductase complex subunit G~~~
MTDTPPPEKPKLPWFKASPLAHGIMLAMFALVTAVLLAVANDSTSAPIAARGAEDLAASLEQVIPHDLHDNDLAAAMRPV
SDAEEGTIKVYVATKAGAVTGLAYELSGPGYSGQIRVLLGIAPDGTLLGVRVLSHTETPGLGDKIEVAKDDWILGFAGKS
LADPEPGHWKVKRDGGVFDQFSGATITPRAVVKTIYRGLMFFDRNKAALTAPLPPKS
>A5F2S8 7.-.-.-~~~rnfG~~~Ion-translocating oxidoreductase complex subunit G~~~COG4659
MLTAIRKNGLILAVFACVSTGLVALTYALTAEQIQQQEQKQLLQVLNQVIPHKYHDNPLAQACTLVNDDKLGTAKPMHAY
LAQRDGQPTAIAIETIAPDGYNGEIKLIVGIANNGTVLGVRVLAHQETPGLGDKIDLRISNWVLGFNGQQVTADNQDDWK
VRKDGGQFDQFTGATITPRAVVLAVKKAVEYVNQHQQQLHNQPNPCEGQ
>Q4QNE7 ~~~rnfH~~~Protein RnfH~~~
MNQINIEIAYAFPERYYLKSFQVDEGITVQTAITQSGILSQFPEIDLSTNKIGIFSRPIKLTDVLKEGDRIEIYRPLLAD
PKEIRRKRAAEQAAAKDKEKGA
>P0A9J0 3.1.26.-~~~rng~~~Ribonuclease G~~~COG1530
MTAELLVNVTPSETRVAYIDGGILQEIHIEREARRGIVGNIYKGRVSRVLPGMQAAFVDIGLDKAAFLHASDIMPHTECV
AGEEQKQFTVRDISELVRQGQDLMVQVVKDPLGTKGARLTTDITLPSRYLVFMPGASHVGVSQRIESESERERLKKVVAE
YCDEQGGFIIRTAAEGVGEAELASDAAYLKRVWTKVMERKKRPQTRYQLYGELALAQRVLRDFADAELDRIRVDSRLTYE
ALLEFTSEYIPEMTSKLEHYTGRQPIFDLFDVENEIQRALERKVELKSGGYLIIDQTEAMTTVDINTGAFVGHRNLDDTI
FNTNIEATQAIARQLRLRNLGGIIIIDFIDMNNEDHRRRVLHSLEQALSKDRVKTSVNGFSALGLVEMTRKRTRESIEHV
LCNECPTCHGRGTVKTVETVCYEIMREIVRVHHAYDSDRFLVYASPAVAEALKGEESHSLAEVEIFVGKQVKVQIEPLYN
QEQFDVVMM
>A0A0H3NGK0 3.1.26.-~~~rng~~~Ribonuclease G~~~
MTAELLVNVTPSETRVAYIDGGILQEIHIEREARRGIVGNIYKGRVSRVLPGMQAAFVDIGLDKAAFLHASDIMPHTECV
AGDEQKQFTVRDISELVRQGQDLMVQVVKDPLGTKGARLTTDITLPSRYLVFMPGASHVGVSQRIESESERERLKKVVAE
YCDEQGGFIIRTAAEGVCEEDLASDAAYLKRVWTKVMERKKRPQTRYQMYGELALAQRVLRDFADAQLDRIRVDSRLTYE
SLLEFTAEYIPEMTSKLEHYSGHQPIFDLYDVENEIQRALERKVELKSGGYLIIDQTEAMTTVDINTGAFVGHRNLDDTI
FNTNIEATQAIARQLRLRNLGGIIIIDFIDMNNEDHRRRVLHSLEQALSKDRVKTSINGFSPLGLVEMTRKRTRESVEHV
LCNECPTCHGRGTVKTVETVCYEIMREIVRVHHAYDSDRFLVYASPAVAEALKGEESHALAEVEIFVGKQVKVQVEPLYN
QEQFDVVMM
>Q9KEI9 3.1.26.4~~~rnhA~~~Ribonuclease H~~~COG0328
MAKSKYYVVWNGRKPGIYTSWSACEAQVKGYTGAKFKSYPSKEEAEAAFRGEEATPKLAKEEIIWESLSVDVGSQGNPGI
VEYKGVDTKTGEVLFEREPIPIGTNNMGEFLAIVHGLRYLKERNSRKPIYSDSQTAIKWVKDKKAKSTLVRNEETALIWK
LVDEAEEWLNTHTYETPILKWQTDKWGEIKADYGRK
>O31744 3.1.26.4~~~rnhB~~~Ribonuclease HII~~~COG0164
MNTLTVKDIKDRLQEVKDAQDPFIAQCENDPRKSVQTLVEQWLKKQAKEKALKEQWVNMTSYERLARNKGFRLIAGVDEV
GRGPLAGPVVASAVILPEECEILGLTDSKKLSEKKREEYYELIMKEALAVGIGIVEATVIDEINIYEASKMAMVKAIQDL
SDTPDYLLVDAMTLPLDTAQASIIKGDAKSVSIAAGACIAKVTRDRMMSAYAETYPMYGFEKNKGYGTKEHLEALAAYGP
TELHRKTFAPVQSFR
>P10442 3.1.26.4~~~rnhB~~~Ribonuclease HII~~~COG0164
MIEFVYPHTQLVAGVDEVGRGPLVGAVVTAAVILDPARPIAGLNDSKKLSEKRRLALYEEIKEKALSWSLGRAEPHEIDE
LNILHATMLAMQRAVAGLHIAPEYVLIDGNRCPKLPMPAMAVVKGDSRVPEISAASILAKVTRDAEMAALDIVFPQYGFA
QHKGYPTAFHLEKLAEHGATEHHRRSFGPVKRALGLAS
>P9WH01 3.1.26.4~~~rnhB~~~Ribonuclease HII~~~COG0164
MTKTWPPRTVIRKSGGLRGMRTLESALHRGGLGPVAGVDEVGRGACAGPLVVAACVLGPGRIASLAALDDSKKLSEQARE
KLFPLICRYAVAYHVVFIPSAEVDRRGVHVANIEGMRRAVAGLAVRPGYVLSDGFRVPGLPMPSLPVIGGDAAAACIAAA
SVLAKVSRDRVMVALDADHPGYGFAEHKGYSTPAHSRALARLGPCPQHRYSFINVRRVASGSNTAEVADGQPDPRDGTAQ
TGEGRWSKSSHPATMRATGRAQGT
>Q2FZ38 3.1.26.4~~~rnhB~~~Ribonuclease HII~~~COG0164
MTLTIKEVTQLINAVNTIEELENHECFLDERKGVQNAIARRRKALEKEQALKEKYVEMTYFENEILKEHPNAIICGIDEV
GRGPLAGPVVACATILNSNHNYLGLDDSKKVPVTKRLELNEALKNEVTAFAYGIATAEEIDEFNIYKATQIAMQRAIDGL
SVQPTHLLIDAMTLDNALPQVSLIKGDARSVSIAAASIMAKVFRDDYMTQLSKDYPEYGFEKNAGYGTKQHLLAIDDIGI
MKEHRKSFEPIKSLL
>Q9X017 3.1.26.4~~~rnhB~~~Ribonuclease HII~~~COG0164
MGIDELYKKEFGIVAGVDEAGRGCLAGPVVAAAVVLEKEIEGINDSKQLSPAKRERLLDEIMEKAAVGIGIASPEEIDLY
NIFNATKLAMNRALENLSVKPSFVLVDGKGIELSVPGTCLVKGDQKSKLIGAASIVAKVFRDRLMSEFHRMYPQFSFHKH
KGYATKEHLNEIRKNGVLPIHRLSFEPVLELLTDDLLREFFEKGLISENRFERILNLLGARKSVVFRKERTNHNLPLF
>O67644 3.1.26.4~~~rnhC~~~Ribonuclease HIII~~~COG1039
MPSLKISPSEAEKIQNYLVSSGFRKINAPYTLWALEGNGVKVYYYKTGSLLIQGKNSEKVLKEVLNLLEKKKLPGCDESG
KGDIFGSLVLCCVCIPEENYLKVSSLNPRDTKRLSDKRVERLYLALKPLVKAYCYEIKPEEYNKLYRKFRNLNKMMTHFY
KLLIERVKEECGVSEVVVDKYQPSNPFGEDVIFETEAERNLAVAVASIFARYKFLQSLKEVERELGIKIPKGTSKEVKEL
AKSLKNPERFIKLNFNV
>P94541 3.1.26.4~~~rnhC~~~Ribonuclease HIII~~~COG1039
MSHSVIKVSLSAIDQMKMTYSGSLTASVPQGAVFQAKPPGCTITAYQSGKVLFQGKNAAAESARWGTAEPQEKKKTAKKP
ADPRYAPPADIAGMSVIGSDEVGTGDYFGPMTVVCAYVDKTMLPLMKELGVKDSKDLKDPQIIEIARNLIKTIPYSLLVL
KNEKYNSMQEKGMSQGKMKALLHNQAITHLLRKLDGVKPEAILIDQFAEPGVYFNHLKGRDIVKERTYFSTKAEGIHLAV
AAASIIARYSFLMEMDKLSRAAGMTLPKGAGPHVDEAAAKLILKKGASALRTFTKLHFANTQKAQRLADKKRS
>O07874 3.1.26.4~~~rnhC~~~Ribonuclease HIII~~~COG1039
MASITLTPSEKDIQAFLEHYQTSLAPSKNPYIRYFLKLPQATVSIYTSGKILLQGEGAEKYASFFGYQAVEQTSGQNLPL
IGTDEVGNGSYFGGLAVVAAFVTPDQHDFLRKLGVGDSKTLTDQKIRQITPILKEKIQHQALLLSPSKYNEVIGDRYNAV
SVKVALHNQAIYLLLQKGVQPEKIVIDAFTSAKNYDKYLAQEANRFSNPISLEEKAEGKYLSVAVSSVIARDLFLENLEN
LGRELGYQLPSGAGTASDKVASQILQAYGMQGLSFCAKLHFKNTEKAKKRLER
>P54162 ~~~rnhA~~~14.7 kDa ribonuclease H-like protein~~~COG0328
MPTEIYVDGASAGNPGPSGIGIFIKHEGKAESFSIPIGVHTNQEAEFLALIEGMKLCATRGYQSVSFRTDSDIVERATEL
EMVKNITFQPFVEEIIRLKAAFPLFFIKWIPGKQNQKADLLAKEAIRLNEKN
>P9WLH5 ~~~~~~Bifunctional protein Rv2228c~~~COG0328
MKVVIEADGGSRGNPGPAGYGAVVWTADHSTVLAESKQAIGRATNNVAEYRGLIAGLDDAVKLGATEAAVLMDSKLVVEQ
MSGRWKVKHPDLLKLYVQAQALASQFRRINYEWVPRARNTYADRLANDAMDAAAQSAAADADPAKIVATESPTSPGWTGA
RGTPTRLLLLRHGQTELSEQRRYSGRGNPGLNEVGWRQVGAAAGYLARRGGIAAVVSSPLQRAYDTAVTAARALALDVVV
DDDLVETDFGAWEGLTFAEAAERDPELHRRWLQDTSITPPGGESFDDVLRRVRRGRDRIIVGYEGATVLVVSHVTPIKML
LRLALDAGSGVLYRLHLDLASLSIAEFYADGASSVRLVNQTGYL
>P0A7Y4 3.1.26.4~~~rnhA~~~Ribonuclease HI~~~COG0328
MLKQVEIFTDGSCLGNPGPGGYGAILRYRGREKTFSAGYTRTTNNRMELMAAIVALEALKEHCEVILSTDSQYVRQGITQ
WIHNWKKRGWKTADKKPVKNVDLWQRLDAALGQHQIKWEWVKGHAGHPENERCDELARAAAMNPTLEDTGYQVEV
>Q8EE30 3.1.26.4~~~rnhA~~~Ribonuclease HI~~~COG0328
MTELKLIHIFTDGSCLGNPGPGGYGIVMNYKGHTKEMSDGFSLTTNNRMELLAPIVALEALKEPCKIILTSDSQYMRQGI
MTWIHGWKKKGWMTSNRTPVKNVDLWKRLDKAAQLHQIDWRWVKGHAGHAENERCDQLARAAAEANPTQIDTGYQAES
>P29253 3.1.26.4~~~rnhA~~~Ribonuclease H~~~COG0328
MNPSPRKRVALFTDGACLGNPGPGGWAALLRFHAHEKLLSGGEACTTNNRMELKAAIEGLKALKEPCEVDLYTDSHYLKK
AFTEGWLEGWRKRGWRTAEGKPVKNRDLWEALLLAMAPHRVRFHFVKGHTGHPENERVDREARRQAQSQAKTPCPPRAPT
LFHEEA
>Q07465 3.1.27.-~~~~~~Ribonuclease~~~
MKKIVVLLGMLLAPWFSSAVQAKGEAGEFDYYAMALSWSPEHCAIKPADRDQCSRQLGFVLHGLWPQYQRGYPSSCTRER
LDPAMEQEFAGLYPSRFLYRHEWEKHGTCSGLSQHDFHQLASDLRQKREDPGRLSVSCRAAAQKPLPAQGGSGQCQRLAG
PGQHHGGLRRRWRFLREVYICLNKEGTDAVTCSDEMQKRELPSCGQPDFLLRTVR
>P21338 4.6.1.21~~~rna~~~Ribonuclease I~~~COG3719
MKAFWRNAALLAVSLLPFSSANALALQAKQYGDFDRYVLALSWQTGFCQSQHDRNRNERDECRLQTETTNKADFLTVHGL
WPGLPKSVAARGVDERRWMRFGCATRPIPNLPEARASRMCSSPETGLSLETAAKLSEVMPGAGGRSCLERYEYAKHGACF
GFDPDAYFGTMVRLNQEIKESEAGKFLADNYGKTVSRRDFDAAFAKSWGKENVKAVKLTCQGNPAYLTEIQISIKADAIN
APLSANSFLPQPHPGNCGKTFVIDKAGY
>Q45493 3.1.-.-~~~rnjA~~~Ribonuclease J1~~~COG0595
MKFVKNDQTAVFALGGLGEIGKNTYAVQFQDEIVLIDAGIKFPEDELLGIDYVIPDYTYLVKNEDKIKGLFITHGHEDHI
GGIPYLLRQVNIPVYGGKLAIGLLRNKLEEHGLLRQTKLNIIGEDDIVKFRKTAVSFFRTTHSIPDSYGIVVKTPPGNIV
HTGDFKFDFTPVGEPANLTKMAEIGKEGVLCLLSDSTNSENPEFTMSERRVGESIHDIFRKVDGRIIFATFASNIHRLQQ
VIEAAVQNGRKVAVFGRSMESAIEIGQTLGYINCPKNTFIEHNEINRMPANKVTILCTGSQGEPMAALSRIANGTHRQIS
INPGDTVVFSSSPIPGNTISVSRTINQLYRAGAEVIHGPLNDIHTSGHGGQEEQKLMLRLIKPKFFMPIHGEYRMQKMHV
KLATDCGIPEENCFIMDNGEVLALKGDEASVAGKIPSGSVYIDGSGIGDIGNIVLRDRRILSEEGLVIVVVSIDMDDFKI
SAGPDLISRGFVYMRESGDLINDAQELISNHLQKVMERKTTQWSEIKNEITDTLAPFLYEKTKRRPMILPIIMEV
>Q2FZG9 3.1.-.-~~~rnj1~~~Ribonuclease J 1~~~COG0595
MKQLHPNEVGVYALGGLGEIGKNTYAVEYKDEIVIIDAGIKFPDDNLLGIDYVIPDYTYLVQNQDKIVGLFITHGHEDHI
GGVPFLLKQLNIPIYGGPLALGLIRNKLEEHHLLRTAKLNEINEDSVIKSKHFTISFYLTTHSIPETYGVIVDTPEGKVV
HTGDFKFDFTPVGKPANIAKMAQLGEEGVLCLLSDSTNSLVPDFTLSEREVGQNVDKIFRNCKGRIIFATFASNIYRVQQ
AVEAAIKNNRKIVTFGRSMENNIKIGMELGYIKAPPETFIEPNKINTVPKHELLILCTGSQGEPMAALSRIANGTHKQIK
IIPEDTVVFSSSPIPGNTKSINRTINSLYKAGADVIHSKISNIHTSGHGSQGDQQLMLRLIKPKYFLPIHGEYRMLKAHG
ETGVECGVEEDNVFIFDIGDVLALTHDSARKAGRIPSGNVLVDGSGIGDIGNVVIRDRKLLSEEGLVIVVVSIDFNTNKL
LSGPDIISRGFVYMRESGQLIYDAQRKIKTDVISKLNQNKDIQWHQIKSSIIETLQPYLFEKTARKPMILPVIMKVNEQK
ESNNK
>Q7A682 3.1.-.-~~~rnj1~~~Ribonuclease J 1~~~
MKQLHPNEVGVYALGGLGEIGKNTYAVEYKDEIVIIDAGIKFPDDNLLGIDYVIPDYTYLVQNQDKIVGLFITHGHEDHI
GGVPFLLKQLNIPIYGGPLALGLIRNKLEEHHLLRTAKLNEINEDSVIKSKHFTISFYLTTHSIPETYGVIVDTPEGKVV
HTGDFKFDFTPVGKPANIAKMAQLGEEGVLCLLSDSTNSLVPDFTLSEREVGQNVDKIFRNCKGRIIFATFASNIYRVQQ
AVEAAIKNNRKIVTFGRSMENNIKIGMELGYIKAPPETFIEPNKINTVPKHELLILCTGSQGEPMAALSRIANGTHKQIK
IIPEDTVVFSSSPIPGNTKSINRTINSLYKAGADVIHSKISNIHTSGHGSQGDQQLMLRLIKPKYFLPIHGEYRMLKAHG
ETGVECGVEEDNVFIFDIGDVLALTHDSARKAGRIPSGNVLVDGSGIGDIGNVVIRDRKLLSEEGLVIVVVSIDFNTNKL
LSGPDIISRGFVYMRESGQLIYDAQRKIKTDVISKLNQNKDIQWHQIKSSIIETLQPYLFEKTARKPMILPVIMKVNEQK
ESNNK
>Q6GHZ6 3.1.-.-~~~rnj1~~~Ribonuclease J 1~~~
MKQLHPNEVGVYALGGLGEIGKNTYAVEYKDEIVIIDAGIKFPDDNLLGIDYVIPDYTYLVQNQDKIVGLFITHGHEDHI
GGVPFLLKQLNIPIYGGPLALGLIRNKLEEHHLLRTAKLNEINEDSVIKSKHFTISFYLTTHSIPETYGVIVDTPEGKVV
HTGDFKFDFTPVGKPANIAKMAQLGEEGVLCLLSDSTNSLVPDFTLSEREVGQNVDKIFRNCKGRIIFATFASNIYRVQQ
AVEAAIKNNRKIVTFGRSMENNIKIGMELGYIKAPPETFIEPNKINTVPKHELLILCTGSQGEPMAALSRIANGTHKQIK
IIPEDTVVFSSSPIPGNTKSINRTINSLYKAGADVIHSKISNIHTSGHGSQGDQQLMLRLIKPKYFLPIHGEYRMLKAHG
ETGVECGVEEDNVFIFDIGDVLALTHDSARKAGRIPSGNVLVDGSGIGDIGNVVIRDRKLLSEEGLVIVVVSIDFNTNKL
LSGPDIISRGFVYMRESGQLIYDAQRKIKTDVISKLNQNKDIQWHQIKSSIIETLQPYLFEKTARKPMILPVIMKVNEQK
ESNNK
>Q8CT16 3.1.-.-~~~rnj1~~~Ribonuclease J 1~~~COG0595
MKQLHSNEVGVYALGGLGEVGKNTYAVEYKDEIVIIDAGIKFPDDNLLGIDYVIPDYTYLEQNQDKIVGLFITHGHEDHI
GGVPFLLKQINVPIYGGPLALGLIRNKLEEHHLLRTTELHEIDESSVIKSKHFEISFYLTTHSIPEAYGVIVDTPEGKIV
HTGDFKFDFTPVGEPANIAKMAQLGHEGVLCLLSDSTNALVPDFTLSEREVGQNVDKIFRNCKGRIIFATFASNIYRVQQ
AVEAAIKYNRKIVTFGRSMENNIKIGMELGYIKAPPETFIEPNKINSVPKHELLILCTGSQGEPMAALSRIANGTHKQIK
IIPEDTVVFSSSPIPGNTKSINRTINALYKAGADVIHSKISNIHTSGHGSQGDQQLMLRLIQPKYFLPIHGEYRMLKAHG
ETGVQCGVDEDNVFIFDIGDVLALTHDSARKAGRIPSGNVLVDGSGIGDIGNVVIRDRKLLSEEGLVIVVVSIDFNTNKL
LSGPDIISRGFVYMRESGQLIYDAQRKIKGDVISKLNSNKDIQWHQIKSSIIETLHPYLYEKTARKPMILPVIMKVNEDK
>Q8K5W8 3.1.-.-~~~rnj1~~~Ribonuclease J 1~~~
MTNISLKPNEVGVFAIGGLGEIGKNTYGIEYQDEIIIVDAGIKFPEDDLLGIDYVIPDYSYIVDNLDRVKALVITHGHED
HIGGIPFLLKQANIPIYAGPLALALIRGKLEEHGLWREATVYEINHNTELTFKNMSVTFFKTTHSIPEPVGIVIHTPQGK
IICTGDFKFDFTPVGDPADLQRMAALGEEGVLCLLSDSTNAEIPTFTNSEKVVGQSILKIIEGIHGRIIFASFASNIYRL
QQAAEAAVKTGRKIAVFGRSMEKAIVNGIELGYIKVPKGTFIEPSELKNLHASEVLIMCTGSQGESMAALARIANGTHRQ
VTLQPGDTVIFSSSPIPGNTTSVNKLINTIQEAGVDVIHGKVNNIHTSGHGGQQEQKLMLSLIKPKYFMPVHGEYRMQKI
HAGLAMDIGIPKENIFIMENGDVLALTSDSARIAGHFNAQDIYVDGNGIGDIGAAVLRDRRDLSEDGVVLAVATVDFNTQ
MILAGPDILSRGFIYMRESGDLIRESQRVLFNAIRIALKNKDASIQSVNGAIVNALRPFLYEKTEREPIIIPMVLTPDKH
>O31760 3.1.-.-~~~rnjB~~~Ribonuclease J2~~~COG0595
MKKKNTENVRIIALGGVGEIGKNLYVIEIDSDIFVVDAGLMHPENEMLGIDVVIPDISYLIERADRVKAIFLTHGHDENI
GGVFYLLNKLSVPVYGTKLTLALLREKLKQYGHNRKTDLREIHSKSVITFESTKVSFFRTIHSIPDSVGVSFKTSLGSIV
CTGDFKFDQTPALNQTCDIGEIAKIGNSGVLALLSDSANAERPGYTPSEAAVSGEISDALYNSQNRVIIAVFASNINRIQ
QVIHAAAQNGRKIAVAGKNLQSVLQLARKLGYIEADDELFISVQDVKKYPKREVAIITAGSQGEPLAALTRMANKAHKQL
NIEEGDTVVIASTPIPGQELIYSKTVDLLARAGAQVIFAQKRVHVSGHGSQEELKLMINLLKPKYLIPVNGEYRMQKAHS
KIAEETGMKRSDIFLIEKGDVVEFRGQNVKIGDKVPYGNILIDGLGVGDIGNIVLRDRRLLSQDGILIVVITLDKQKKHL
VSGPEIITRGFVYVRESEGLIVQATELVRSIVTEATETSNVEWSTLKQAMRDALNQFLYEKTKRKPMIIPIIMEV
>Q2FZ19 3.1.-.-~~~rnj2~~~Ribonuclease J 2~~~COG0595
MSLIKKKNKDIRIIPLGGVGEIAKNMYIVEVDDEMFMLDAGLMFPEDEMLGIDIVIPDISYVLENKDKLKGIFLTHGHEH
AIGAVSYVLEQLDAPVYGSKLTIALIKENMKARNIDKKVRYYTVNNDSIMRFKNVNISFFNTTHSIPDSLGVCIHTSYGA
IVYTGEFKFDQSLHGHYAPDIKRMAEIGEEGVFVLISDSTEAEKPGYNTPENVIEHHMYDAFAKVRGRLIVSCYASNFIR
IQQVLNIASKLNRKVSFLGRSLESSFNIARKMGYFDIPKDLLIPITEVDNYPKNEVIIIATGMQGEPVEALSQMAQHKHK
IMNIEEGDSVFLAITASANMEVIIANTLNELVRAGAHIIPNNKKIHASSHGCMEELKMMINIMKPEYFIPVQGEFKMQIA
HAKLAAEAGVAPEKIFLVEKGDVINYNGKDMILNEKVNSGNILIDGIGIGDVGNIVLRDRHLLAEDGIFIAVVTLDPKNR
RIAAGPEIQSRGFVYVRESEDLLREAEEKVREIVEAGLQEKRIEWSEIKQNMRDQISKLLFESTKRRPMIIPVISEI
>Q7A5X6 3.1.-.-~~~rnj2~~~Ribonuclease J 2~~~
MSLIKKKNKDIRIIPLGGVGEIAKNMYIVEVDDEMFMLDAGLMFPEDEMLGIDIVIPDISYVLENKDKLKGIFLTHGHEH
AIGAVSYVLEQLDAPVYGSKLTIALIKENMKARNIDKKVRYYTVNNDSIMRFKNVNISFFNTTHSIPDSLGVCIHTSYGA
IVYTGEFKFDQSLHGHYAPDIKRMAEIGEEGVFVLISDSTEAEKPGYNTPENVIEHHMYDAFAKVRGRLIVSCYASNFIR
IQQVLNIASKLNRKVSFLGRSLESSFNIARKMGYFDIPKDLLIPITEVDNYPKNEVIIIATGMQGEPVEALSQMAQHKHK
IMNIEEGDSVFLAITASANMEVIIANTLNELVRAGAHIIPNNKKIHASSHGCMEELKMMINIMKPEYFIPVQGEFKMQIA
HAKLAAEAGVAPEKIFLVEKGDVINYNGKDMILNEKVNSGNILIDGIGIGDVGNIVLRDRHLLAEDGIFIAVVTLDPKNR
RIAAGPEIQSRGFVYVRESEDLLREAEEKVREIVEAGLQEKRIEWSEIKQNMRDQISKLLFESTKRRPMIIPVISEI
>Q6GHG0 3.1.-.-~~~rnj2~~~Ribonuclease J 2~~~
MSLIKKKNKDIRIIPLGGVGEIAKNMYIVEVDDEMFMLDAGLMFPEDEMLGIDIVIPDISYVLENKEKLKGIFLTHGHEH
AIGAVSYVLEQLDAPVYGSKLTIALIKENMKARNIDKKVRYYTVNNDSIMRFKNVNISFFNTTHSIPDSLGVCIHTSYGA
IVYTGEFKFDQSLHGHYAPDIKRMAEIGEEGVFVLISDSTEAEKPGYNTPENVIEHHMYDAFAKVRGRLIVSCYASNFIR
IQQVLNIASKLNRKVSFLGRSLESSFNIARKMGYFDIPKDLLIPITEVDNYPKNEVIIIATGMQGEPVEALSQMAQHKHK
IMNIEEGDSVFLAITASANMEVIIANTLNELVRAGAHIIPNNKKIHASSHGCMEELKMMINIMKPEYFIPVQGEFKMQIA
HAKLAAEAGVAPEKIFLVEKGDVINYNGKDMILNEKVNSGNILIDGIGIGDVGNIVLRDRHLLAEDGIFIAVVTLDPKNR
RIAAGPEIQSRGFVYVRESEDLLREAEEKVREIVEAGLQEKRIEWSEIKQNMRDQISKLLFESTKRRPMIIPVISEI
>Q8CST0 3.1.-.-~~~rnj2~~~Ribonuclease J 2~~~COG0595
MSLIKKKNKDIRIIPLGGVGEIAKNMYIVEVDDEMFMLDAGLMFPEDEMLGVDIVIPDIQYVIENKERLKGIFLTHGHEH
AIGAVSYVLEQIDAPVYGSKLTIALVKEAMKARNIKKKVRYYTVNHDSIMRFKNVNVSFFNTTHSIPDSLGVCIHTSYGS
IVYTGEFKFDQSLHGHYAPDLKRMAEIGDEGVFALISDSTEAEKPGYNTPENIIEHHMYDAFAKVKGRLIVSCYASNFVR
IQQVLNIASQLNRKVSFLGRSLESSFNIARKMGYFDIPKDLLIPINEVENYPKNEVIIIATGMQGEPVEALSQMARKKHK
IMNIEEGDSIFLAITASANMEVIIADTLNELVRAGAHIIPNNKKIHASSHGCMEELKMMLNIMKPEYFVPVQGEFKMQIA
HAKLAAETGVAPEKIFLVEKGDVISYNGKDMILNEKVQSGNILIDGIGVGDVGNIVLRDRHLLAEDGIFIAVVTLDPKNR
RIAAGPEIQSRGFVYVRESEELLKEAEEKVRKIVEEGLQEKRIEWSEIKQNMRDQISKLLFESTKRRPMIIPVISEI
>Q8K7S6 3.1.-.-~~~rnj2~~~Ribonuclease J 2~~~
MTDIKMIALGGVREYGKNFYLVEINDSMFILDAGLKYPENEQLGVDLVIPNLDYVIENKGKVQGIFLSHGHADAIGALPY
LLAEVSAPVFGSELTIELAKLFVKSNNSTKKFNNFHVVDSDTEIEFKDGLVSFFRTTHSIPESMGIVIGTDKGNIVYTGD
FKFDQAAREGYQTDLLRLAEIGKEGVLALLSDSVNATSNDQIASESEVGEEMDSVISDADGRVIVAAVASNLVRIQQVFD
SATAHGRRVVLTGTDAENIVRTALRLEKLMITDERLLIKPKDMSKFEDHELIILEAGRMGEPINSLQKMAAGRHRYVQIK
EGDLVYIVTTPSTAKEAMVARVENLIYKAGGSVKLITQNLRVSGHANGRDLQLLMNLLKPQYLFPVQGEYRDLAAHAKLA
EEVGIFPENIHILKRGDIMVLNDEGFLHEGGVPASDVMIDGNAIGDVGNIVLRDRKVLSEDGIFIVAITVSKKEKRIISK
AKVNTRGFVYVKKSHDILRESAELVNTTVGNYLKKDTFDWGELKGNVRDDLSKFLFEQTKRRPAILPVVMEVR
>B9XZG7 3.1.-.-~~~rnaJ~~~Ribonuclease J~~~
MTDNNHYENNESNENSSENSKVDEARAGAFERFTNRKKRFRENAQKNGESSHHEAPSHHKKEHRPNKKPNNHHKQKHAKT
RNYAKEELDSNKVEGVTEILHVNERGTLGFHKELKKGVETNNKIQVEHLNPHYKMNLNSKASVKITPLGGLGEIGGNMMV
IETPKSAIVIDAGMSFPKEGLFGVDILIPDFSYLHQIKDKIAGIIITHAHEDHIGATPYLFKELQFPLYGTPLSLGLIGS
KFDEHGLKKYRSYFKIVEKRCPISVGEFIIEWIHITHSIIDSSALAIQTKAGTIIHTGDFKIDHTPVDNLPTDLYRLAHY
GEKGVMLLLSDSTNSHKSGTTPSESTIAPAFDTLFKEAQGRVIMSTFSSNIHRVYQAIQYGIKYNRKIAVIGRSMEKNLD
IARELGYIHLPYQSFIEANEVAKYPDNEVLIVTTGSQGETMSALYRMATDEHRHISIKPNDLVIISAKAIPGNEASVSAV
LNFLIKKEAKVAYQEFDNIHVSGHAAQEEQKLMLRLIKPKFFLPVHGEYNHVARHKQTAISCGVPEKNIYLMEDGDQVEV
GPAFIKKVGTIKSGKSYVDNQSNLSIDTSIVQQREEVASAGVFAATIFVNKNKQALLESSQFSSLGLVGFKDEKHLIKEI
QGGLEMLLKSSNAEILNNPKKLEDHTRNFIRKALFKKFRKYPAIICHAHSF
>P56185 3.1.-.-~~~rnj~~~Ribonuclease J~~~COG0595
MTDNNQNNENHENSSENSKADEMRAGAFERFTNRKKRFRENAQKNAEYSNHEASSHHKKEHRPNKKPNNHHKQKHAKTRN
YAQEELDSNKVEGVTEILHVNERGTLGFHKELKKGVEANNKIQVEHLNPHYKMNLNSKASVKITPLGGLGEIGGNMMVIE
TPKSAIVIDAGMSFPKEGLFGVDILIPDFSYLHQIKDKIAGIIITHAHEDHIGATPYLFKELQFPLYGTPLSLGLIGSKF
DEHGLKKYRSYFKIVEKRCPISVGEFIIEWIHITHSIIDSSALAIQTKAGTIIHTGDFKIDHTPVDNLPTDLYRLAHYGE
KGVMLLLSDSTNSHKSGTTPSESTIAPAFDTLFKEAQGRVIMSTFSSNIHRVYQAIQYGIKYNRKIAVIGRSMEKNLDIA
RELGYIHLPYQSFIEANEVAKYPDNEILIVTTGSQGETMSALYRMATDEHRHISIKPNDLVIISAKAIPGNEASVSAVLN
FLIKKEAKVAYQEFDNIHVSGHAAQEEQKLMLRLIKPKFFLPVHGEYNHVARHKQTAISCGVPEKNIYLMEDGDQVEVGP
AFIKKVGTIKSGKSYVDNQSNLSIDTSIVQQREEVASAGVFVATIFVNKNKQALLESSQFSSLGLVGFKDEKPLIKEIQG
GLEVLLKSSNAEILNNPKKLEDHTRNFIRKALFKKFRKYPAIICHAHSF
>A0QVT2 3.1.-.-~~~rnj~~~Ribonuclease J~~~COG0595
MSAELAPPPPLAPGGLRVTALGGISEIGRNMTVFEHLGRLLIVDCGVLFPGHDEPGVDLILPDLRHIEDRLDEIEALVVT
HAHEDHIGAIPFLLKLRPDIPVVGSKFTIALVREKCREHRLKPKFVEVAERQSSQHGVFECEYFAVNHSIPGCLAVAIHT
GAGTVLHTGDIKLDQLPLDGRPTDLPGMSRLGDAGVDLFLCDSTNSEHPGVSPSESEVGPTLHRLIRGAEGRVIVACFAS
NVDRVQQIIDAAVALGRRVSFVGRSMVRNMGIARELGYLKVDDSDILDIAAAEMMPPDRVVLITTGTQGEPMAALSRMSR
GEHRSITLTSGDLIILSSSLIPGNEEAVYGVIDSLSKIGARVVTNAQARVHVSGHAYAGELLFLYNGVRPRNVMPVHGTW
RHLRANAALAASTGVPPENIVLAENGVSVDLVAGRASISGAVTVGKMFVDGLITGDVGDATLGERLILSSGFVSITVVVH
RGTGRPAGPAHLISRGFSEDPKALEPVAQKVERELEALAADNVTDPTRIAQAVRRTVGKWVGETYRRQPMIVPTVIEI
>P9WGZ9 3.1.-.-~~~rnj~~~Ribonuclease J~~~COG0595
MDVDLPPPGPLTSGGLRVTALGGINEIGRNMTVFEHLGRLLIIDCGVLFPGHDEPGVDLILPDMRHVEDRLDDIEALVLT
HGHEDHIGAIPFLLKLRPDIPVVGSKFTLALVAEKCREYRITPVFVEVREGQSTRHGVFECEYFAVNHSTPDALAIAVYT
GAGTILHTGDIKFDQLPPDGRPTDLPGMSRLGDTGVDLLLCDSTNAEIPGVGPSESEVGPTLHRLIRGADGRVIVACFAS
NVDRVQQIIDAAVALGRRVSFVGRSMVRNMRVARQLGFLRVADSDLIDIAAAETMAPDQVVLITTGTQGEPMSALSRMSR
GEHRSITLTAGDLIVLSSSLIPGNEEAVFGVIDALSKIGARVVTNAQARVHVSGHAYAGELLFLYNGVRPRNVMPVHGTW
RMLRANAKLAASTGVPQESILLAENGVSVDLVAGKASISGAVPVGKMFVDGLIAGDVGDITLGERLILSSGFVAVTVVVR
RGTGQPLAAPHLHSRGFSEDPKALEPAVRKVEAELESLVAANVTDPIRIAQGVRRTVGKWVGETYRRQPMIVPTVIEV
>Q72JJ7 3.1.-.-~~~rnj~~~Ribonuclease J~~~COG0595
MENQERKPRRRRRRRPQEGSQGGPQDHVEIIPLGGMGEIGKNITVFRFRDEIFVLDGGLAFPEEGMPGVDLLIPRVDYLI
EHRHKIKAWVLTHGHEDHIGGLPFLLPMIFGKESPVPIYGARLTLGLLRGKLEEFGLRPGAFNLKEISPDDRIQVGRYFT
LDLFRMTHSIPDNSGVVIRTPIGTIVHTGDFKLDPTPIDGKVSHLAKVAQAGAEGVLLLIADATNAERPGYTPSEMEIAK
ELDRVIGRAPGRVFVTTFASHIHRIQSVIWAAEKYGRKVAMEGRSMLKFSRIALELGYLKVKDRLYTLEEVKDLPDHQVL
ILATGSQGQPMSVLHRLAFEGHAKMAIKPGDTVILSSSPIPGNEEAVNRVINRLYALGAYVLYPPTYKVHASGHASQEEL
KLILNLTTPRFFLPWHGEVRHQMNFKWLAESMSRPPEKTLIGENGAVYRLTRETFEKVGEVPHGVLYVDGLGVGDITEEI
LADRRHMAEEGLVVITALAGEDPVVEVVSRGFVKAGERLLGEVRRMALEALKNGVREKKPLERIRDDIYYPVKKFLKKAT
GRDPMILPVVIEG
>P0AFW4 ~~~rnk~~~Regulator of nucleoside diphosphate kinase~~~COG0782
MSRPTIIINDLDAERIDILLEQPAYAGLPIADALNAELDRAQMCSPEEMPHDVVTMNSRVKFRNLSDGEVRVRTLVYPAK
MTDSNTQLSVMAPVGAALLGLRVGDSIHWELPGGVATHLEVLELEYQPEAAGDYLL
>P52129 3.1.-.-~~~rnlA~~~mRNA endoribonuclease toxin LS~~~
MTIRSYKNLNLVRANIETESRQFIENKNYSIQSIGPMPGSRAGLRVVFTRPGVNLATVDIFYNGDGSTTIQYLTGANRSL
GQELADHLFETINPAEFEQVNMVLQGFVETSVLPVLELSADESHIEFREHSRNAHTVVWKIISTSYQDELTVSLHITTGK
LQIQGRPLSCYRVFTFNLAALLDLQGLEKVLIRQEDGKANIVQQEVARTYLQTVMADAYPHLHVTAEKLLVSGLCVKLAA
PDLPDYCMLLYPELRTIEGVLKSKMSGLGMPVQQPAGFGTYFDKPAAHYILKPQFAATLRPEQINIISTAYTFFNVERHS
LFHMETVVDASRMISDMARLMGKATRAWGIIKDLYIV
>P52130 ~~~rnlB~~~Antitoxin RnlB~~~
MFEITGINVSGALKAVVMATGFENPLSSVNEIETKLSALLGSETTGEILFDLLCANGPEWNRFVTLEMKYGRIMLDTAKI
IDEQDVPTHILSKLTFTLRNHPEYLEASVLSPDDVRQVLSMDF
>Q48MT7 1.6.3.5~~~~~~Renalase~~~COG3380
MTVPIAIIGTGIAGLSAAQALTSAGHQVHLFDKSRGSGGRMSSKRSDAGSLDMGAQYFTARDRRFATAVKQWQAQGHVSE
WTPLLYNFHGGRLSPSPDEQVRWVGEPGMSAITRAMRGDLPVSFSCRITDVFRGEQHWNLLDAEGENHGPFSHVIIATPA
PQATALLAAAPKLASVVAGVKMDPTWAVALAFETPLQTPMQGCFVQDSPLDWLARNRSKPGRDDTLDSWVLHATSQWSRQ
NLDASREQVIEHLHGAFAELIDCAMPAPVFSLAHRWLYARPAGSHEWGALSDADLGIYVCGDWCLSGRVEGAWLSGQEAA
RRLLEHLQ
>Q888A4 1.6.3.5~~~~~~Renalase~~~COG3380
MTVPIAIIGTGIAGLSAAQALTAAGHQVHLFDKSRGSGGRMSSKRSDAGALDMGAQYFTARDRRFATAVKQWQAQGHVAE
WTPLLYNFHAGRLSPSPDEQVRWVGKPGMSAITRAMRGDMPVSFSCRITEVFRGEEHWNLLDAEGQNHGPFSHVIIATPA
PQASTLLAAAPKLASVVAGVKMDPTWAVALAFETPLQTPMQGCFVQDSPLDWLARNRSKPERDDTLDTWILHATSQWSRQ
NLDASREQVIEHLHGAFAELIDCTMPAPVFSLAHRWLYARPAGAHEWGALSDADLGIYVCGDWCLSGRVEGAWLSGQEAA
RRLLEHLQ
>P37547 3.1.26.8~~~rnmV~~~Ribonuclease M5~~~COG1658
MKIKEIIVVEGRDDTARIKLAVDADTIETNGSAIDDHVIDQIRLAQKTRGVIILTDPDFPGEKIRKTISEAVPGCKHAFL
PKHLAKPKNKRGIGVEHASVESIRACLENVHEEMEAQPSDISAEDLIHAGLIGGPAAKCRRERLGDLLKIGYTNGKQLQK
RLQMFQIKKSDFMSALDTVMREEQNE
>P9WIM5 2.1.1.-~~~~~~Rhamnosyl O-methyltransferase~~~COG3510
MGLVWRSRTSLVGQLIGLVRLVASFAAQLFYRPSDAVAEEYHKWYYGNLVWTKTTYMGINCWKSVSDMWNYQEILSELQP
SLVIEFGTRYGGSAVYFANIMRQIGQPFKVLTVDNSHKALDPRARREPDVLFVESSSTDPAIAEQIQRLKNEYPGKIFAI
LDSDHSMNHVLAEMKLLRPLLSAGDYLVVEDSNINGHPVLPGFGPGPYEAIEAYEDEFPNDYKHDAERENKFGWTSAPNG
FLIRN
>P77766 3.1.13.-~~~rnm~~~5'-3' exoribonuclease Rnm~~~COG0613
MSDTNYAVIYDLHSHTTASDGCLTPEALVHRAVEMRVGTLAITDHDTTAAIAPAREEISRSGLALNLIPGVEISTVWENH
EIHIVGLNIDITHPLMCEFLAQQTERRNQRAQLIAERLEKAQIPGALEGAQRLAQGGAVTRGHFARFLVECGKASSMADV
FKKYLARGKTGYVPPQWCTIEQAIDVIHHSGGKAVLAHPGRYNLSAKWLKRLVAHFAEHHGDAMEVAQCQQSPNERTQLA
ALARQHHLWASQGSDFHQPCPWIELGRKLWLPAGVEGVWQLWEQPQNTTEREL
>P25814 3.1.26.5~~~rnpA~~~Ribonuclease P protein component~~~COG0594
MKKRNRLKKNEDFQKVFKHGTSVANRQFVLYTLDQPENDELRVGLSVSKKIGNAVMRNRIKRLIRQAFLEEKERLKEKDY
IIIARKPASQLTYEETKKSLQHLFRKSSLYKKSSSK
>P0A7Y8 3.1.26.5~~~rnpA~~~Ribonuclease P protein component~~~COG0594
MVKLAFPRELRLLTPSQFTFVFQQPQRAGTPQITILGRLNSLGHPRIGLTVAKKNVRRAHERNRIKRLTRESFRLRQHEL
PAMDFVVVAKKGVADLDNRALSEALEKLWRRHCRLARGS
>P9WGZ3 3.1.26.5~~~rnpA~~~Ribonuclease P protein component~~~COG0594
MLRARNRMRRSADFETTVKHGMRTVRSDMVVYWWRGSGGGPRVGLIIAKSVGSAVERHRVARRLRHVAGSIVKELHPSDH
VVIRALPSSRHVSSARLEQQLRCGLRRAVELAGSDR
>Q2FUQ1 3.1.26.5~~~rnpA~~~Ribonuclease P protein component~~~COG0594
MLLEKAYRIKKNADFQRIYKKGHSVANRQFVVYTCNNKEIDHFRLGISVSKKLGNAVLRNKIKRAIRENFKVHKSHILAK
DIIVIARQPAKDMTTLQIQNSLEHVLKIAKVFNKKIK
>P0A0H5 3.1.26.5~~~rnpA~~~Ribonuclease P protein component~~~
MLLEKAYRIKKNADFQRIYKKGHSVANRQFVVYTCNNKEIDHFRLGISVSKKLGNAVLRNKIKRAIRENFKVHKSHILAK
DIIVIARQPAKDMTTLQIQNSLEHVLKIAKVFNKKIK
>Q9X1H4 3.1.26.5~~~rnpA~~~Ribonuclease P protein component~~~COG0594
MTESFTRRERLRLRRDFLLIFKEGKSLQNEYFVVLFRKNGLDYSRLGIVVKRKFGKATRRNKLKRWVREIFRRNKGVIPK
GFDIVVIPRKKLSEEFERVDFWTVREKLLNLLKRIEG
>Q7X5K6 3.1.26.5~~~rnpA~~~Ribonuclease P protein component~~~COG0594
MDEKDVATQPQETGQNPRLSGQDEDPGRPEGAEAPPSEGALAPHARRSEAVGPKPPAPGGKLLSLKGDRAFQRLRKGRAG
RGRYVSVKWLPAAELRVGIVVSKKVGKAVVRNKVKRRLREILRRLHLPQAHLLVVASPEAREADFAELFRDVVRALRKSG
LVQ
>O67069 2.7.7.56~~~rph~~~Ribonuclease PH~~~COG0689
MRSDGRKEDQLRPVSIQRDFLEYPEGSCLISFGKTKVICTASVIENVPNWLKGKGQGWITAEYSMLPRATQQRTIRESVQ
GRIGGRTHEIQRMIGRAMRTAVELTKIGERTIWVDCDVIQADGGTRTAAITGAFVAVADAIIKLHKEGIIEETPIKDFVA
AVSVGIVNDRILLDLNFEEDSAAQVDMNVVGTGSGRLSEVHTMGEEYSFTKDELIKMLDLAQKGINELIELQKKLYVIQD
GKWERSELKEVSSTT
>Q81LA9 2.7.7.56~~~rph~~~Ribonuclease PH~~~COG0689
MRVDGREKTELRHIHIHTNYLKHPEGSVLIEVGDTKVICSATIEERVPPFMRGEGKGWVTAEYAMIPRATEQRTIRESSK
GKVTGRTMEIQRLIGRALRAVVDLEALGERTVWIDCDVIQADGGTRTASITGAYVAMVLAFEKLLQAEKVSKIPVKDYLA
ATSVGIVEEQGVVLDLNYAEDSKADVDMNVIMTGKGQFVEVQGTGEEATFSRAQLNELLDAAEQGIFQLIDIQKEALGDI
VSHIE
>P28619 2.7.7.56~~~rph~~~Ribonuclease PH~~~COG0689
MRHDGRQHDELRPITFDLDFISHPEGSVLITAGNTKVICNASVEDRVPPFLRGGGKGWITAEYSMLPRATNQRTIRESSK
GKISGRTMEIQRLIGRALRAVVDLEKLGERTIWIDCDVIQADGGTRTASITGAFLAMAIAIGKLIKAGTIKTNPITDFLA
AISVGIDKEQGILLDLNYEEDSSAEVDMNVIMTGSGRFVELQGTGEEATFSREDLNGLLGLAEKGIQELIDKQKEVLGDS
LPELK
>P0CG18 2.7.7.56~~~rph~~~Ribonuclease PH~~~
MRPAGRSNNQVRPVTLTRNYTKHAEGSVLVEFGDTKVLCTASIEEGVPRFLKGQGQGWITAEYGMLPRSTHTRNAREAAK
GKQGGRTMEIQRLIARALRAAVDLKALGEFTITLDCDVLQADGGTRTASITGACVALVDALQKLVENGKLKTNPMKGMVA
AVSVGIVNGEAVCDLEYVEDSAAETDMNVVMTEDGRIIEVQGTAEGEPFTHEELLILLALARGGIESIVATQKAALAN
>P0CG19 ~~~rph~~~Truncated inactive ribonuclease PH~~~
MRPAGRSNNQVRPVTLTRNYTKHAEGSVLVEFGDTKVLCTASIEEGVPRFLKGQGQGWITAEYGMLPRSTHTRNAREAAK
GKQGGRTMEIQRLIARALRAAVDLKALGEFTITLDCDVLQADGGTRTASITGACVALVDALQKLVENGKLKTNPMKGMVA
AVSVGIVNGEAVCDLEYVEDSAAETDMNVVMTEDGRIIEVQGTAEGEPFTHEELLILLALARGESNPL
>A0R1W8 2.7.7.56~~~rph~~~Ribonuclease PH~~~COG2123
MSRREDGRLDDELRPVVITRGFTSHPAGSVLVEFGQTRVMCTASVTEGVPRWRKGSGQGWLTAEYAMLPAATHDRSDRES
VKGRIGGRTQEISRLIGRSLRACIDLAALGENTIAIDCDVLQADGGTRTAAITGAYVALSDAVTWLAAAGRLSDPRPLSC
AIAAVSVGVVDGRVRVDLPYSEDSRAEVDMNVVATDTGTLVEIQGTGEGATFPRSTLDKMLDAALGATEQLFVLQREALD
APYPGVLPEGPAPKKAFGS
>P9WGZ7 2.7.7.56~~~rph~~~Ribonuclease PH~~~COG2123
MSKREDGRLDHELRPVIITRGFTENPAGSVLIEFGHTKVLCTASVTEGVPRWRKATGLGWLTAEYAMLPSATHSRSDRES
VRGRLSGRTQEISRLIGRSLRACIDLAALGENTIAIDCDVLQADGGTRTAAITGAYVALADAVTYLSAAGKLSDPRPLSC
AIAAVSVGVVDGRIRVDLPYEEDSRAEVDMNVVATDTGTLVEIQGTGEGATFARSTLDKLLDMALGACDTLFAAQRDALA
LPYPGVLPQGPPPPKAFGT
>P50597 2.7.7.56~~~rph~~~Ribonuclease PH~~~
MNRPSGRAADQLRPIRITRHYTKHAEGSVLVEFGDTKVICTVSAESGVPRFLKGQGQGWLTAEYGMLPRSTGERNQREAS
RGKQGGRTLEIQRLIGRSLRAALDLSKLGENTLYIDCDVIQADGGTRTASITGATVALIDALAVLKKRGALKGNPLKQMV
AAVSVGIYQGVPVLDLDYLEDSAAETDLNVVMTDAGGFIEVQGTAEGAPFRPAELNAMLELAQQGMQELFELQRAALAE
>O32231 3.1.13.1~~~rnr~~~Ribonuclease R~~~COG0557
MEKEAFMEKLLSFMKEEAYKPLTVQELEEMLNITEAEEFKELVKALVALEEKGLIVRTRSDRYGIPEKMNLIKGKISAHA
KGFAFLLPEDTSLSDVFIPPNELNTAMNGDIVMVRLNSQSSGSRQEGTVIRILERAIQRVVGTYTETRNFGFVIPDDKKI
TSDIFIPKNGKNGAAEGHKVVVKLTSYPEGRMNAEGEVETILGHKNDPGIDILSVIHKHGLPGEFPADAMEQASSTPDTI
DEKDLKDRRDLRDQVIVTIDGADAKDLDDAVTVTKLDDGSYKLGVHIADVSHYVTENSPIDKEALERGTSVYLVDRVIPM
IPHRLSNGICSLNPKVDRLTLSCEMTINSQGQVTEHEIFQSVIKTTERMTYSDVNKILVDDDEELKQKYEPLVPMFKDME
RLAQILRDKRMDRGAVDFDFKEAKVLVDDEGAVKDVVIRERSVAEKLIEEFMLVANETVAEHFHWMNVPFIYRIHEEPNA
EKLQKFLEFVTTFGYVVKGTAGNIHPRALQSILDAVRDRPEETVISTVMLRSMKQAKYDPQSLGHFGLSTEFYTHFTSPI
RRYPDLIVHRLIRTYLINGKVDEATQEKWAERLPDIAEHTSSMERRAVDAERETDDLKKAEYMLDKIGEEFDGMISSVTN
FGMFVELPNTIEGLVHVSFMTDDYYRFDEQHFAMIGERTGNVFRIGDEITVKVVDVNKDERNIDFEIVGMKGTPRRPREL
DSSRSRKRGKPARKRVQSTNTPVSPAPSEEKGEWFTKPKKKKKKRGFQNAPKQKRKKKK
>P21499 3.1.13.1~~~rnr~~~Ribonuclease R~~~COG0557
MSQDPFQEREAEKYANPIPSREFILEHLTKREKPASRDELAVELHIEGEEQLEGLRRRLRAMERDGQLVFTRRQCYALPE
RLDLVKGTVIGHRDGYGFLRVEGRKDDLYLSSEQMKTCIHGDQVLAQPLGADRKGRREARIVRVLVPKTSQIVGRYFTEA
GVGFVVPDDSRLSFDILIPPDQIMGARMGFVVVVELTQRPTRRTKAVGKIVEVLGDNMGTGMAVDIALRTHEIPYIWPQA
VEQQVAGLKEEVPEEAKAGRVDLRDLPLVTIDGEDARDFDDAVYCEKKRGGGWRLWVAIADVSYYVRPSTPLDREARNRG
TSVYFPSQVIPMLPEVLSNGLCSLNPQVDRLCMVCEMTVSSKGRLTGYKFYEAVMSSHARLTYTKVWHILQGDQDLREQY
APLVKHLEELHNLYKVLDKAREERGGISFESEEAKFIFNAERRIERIEQTQRNDAHKLIEECMILANISAARFVEKAKEP
ALFRIHDKPSTEAITSFRSVLAELGLELPGGNKPEPRDYAELLESVADRPDAEMLQTMLLRSMKQAIYDPENRGHFGLAL
QSYAHFTSPIRRYPDLTLHRAIKYLLAKEQGHQGNTTETGGYHYSMEEMLQLGQHCSMAERRADEATRDVADWLKCDFML
DQVGNVFKGVISSVTGFGFFVRLDDLFIDGLVHVSSLDNDYYRFDQVGQRLMGESSGQTYRLGDRVEVRVEAVNMDERKI
DFSLISSERAPRNVGKTAREKAKKGDAGKKGGKRRQVGKKVNFEPDSAFRGEKKTKPKAAKKDARKAKKPSAKTQKIAAA
TKAKRAAKKKVAE
>P47350 3.1.13.1~~~rnr~~~Ribonuclease R~~~COG0557
MKVLTELQKQIFTIVKKENGKPIPPGIVVRMMENSPNFPGKHLIYRAIDDLLDWAILRKAGGVTNQLLVNYEPAEPLLDK
KLQGILTLGNKNSGFIRSLDDDKTVYYVHYSNLTGALDGDLVEFCKLDKPQFGDKFDAAVITILKRARILYAGNFLVDQN
EFALEYKIVADNPRFYLTMIVNPDSIPNNLASNTKIAFQIDEYDPDNNLCKVSVQQVLGNNDDPLINIKAIMLDNSIVFE
TNDVVEQHANKLSFDTEEQHKAYRQDLTDLAFVTVDPTTSKDLDDAIYVKTIPTGFVLYVAIADVAHYVNRNSEIDIEAK
HKTSSIYLPGHYVVPMLPEQLSNQLCSLNPAQKRYVVVCEISFDNQGRIKTNKLYPATIISKNRFSYDQVNKWLNNKSEL
NCDETVINSLKAAFTLSDLIQAQRQKRGTIDLSHKETEIVVDEHYFPIKINFLVHDKAETMIENLMVVANETVAWVLTNN
KIALPYRVHPRPSKKKLQSLIETVGELNITKPQFNLDTVTSSQIASWLNENKDNPSYEIFVILLLRTLGKAFYSVNPLMH
FSIGSNHYTHFTSPIRRYIDLTIHRLLWMHLFTPDQFTDNERDQLKQELEKIADTVNDTEIKIINCERNANDYLTTLLLS
KQIGKTFSGFISAITSFGIFMRMDENNFDGLIKITTIPDDFFIFEKEKMVLKGRKTNKVYKIGDRLEAKLSEIDFIQKRA
ILTLI
>P30289 4.6.1.24~~~rnaSA3~~~Guanyl-specific ribonuclease Sa3~~~
MRIPPRLVALAGAAAVAATLIAGPVAAAAPASHAVAASSAASASVKAVGRVCYSALPSQAHDTLDLIDEGGPFPYSQDGV
VFQNREGLLPAHSTGYYHEYTVITPGSPTRGARRIITGQQWQEDYYTADHYASFRRVDFAC
>P05798 4.6.1.24~~~rnaSA~~~Guanyl-specific ribonuclease Sa~~~COG4290
DVSGTVCLSALPPEATDTLNLIASDGPFPYSQDGVVFQNRESVLPTQSYGYYHEYTVITPGARTRGTRRIITGEATQEDY
YTGDHYATFSLIDQTC
>Q5SLP1 3.1.-.-~~~~~~Ribonuclease TTHA0252~~~COG1236
MRIVPFGAAREVTGSAHLLLAGGRRVLLDCGMFQGKEEARNHAPFGFDPKEVDAVLLTHAHLDHVGRLPKLFREGYRGPV
YATRATVLLMEIVLEDALKVMDEPFFGPEDVEEALGHLRPLEYGEWLRLGALSLAFGQAGHLPGSAFVVAQGEGRTLVYS
GDLGNREKDVLPDPSLPPLADLVLAEGTYGDRPHRPYRETVREFLEILEKTLSQGGKVLIPTFAVERAQEILYVLYTHGH
RLPRAPIYLDSPMAGRVLSLYPRLVRYFSEEVQAHFLQGKNPFRPAGLEVVEHTEASKALNRAPGPMVVLAGSGMLAGGR
ILHHLKHGLSDPRNALVFVGYQPQGGLGAEIIARPPAVRILGEEVPLRASVHTLGGFSGHAGQDELLDWLQGEPRVVLVH
GEEEKLLALGKLLALRGQEVSLARFGEGVPV
>P00650 4.6.1.24~~~~~~Guanyl-specific ribonuclease St~~~
EAPCGDTSGFEQVRLADLPPEATDTYELIEKGGPYPYPEDGTVFENREGILPDCAEGYYHEYTVKTPSGDDRGARRFVVG
DGGEYFYTEDHYESFRLTIVN
>P16114 ~~~rns~~~Regulatory protein Rns~~~
MDFKYTEEKETIKINNIMIHKYTVLYTSNCIMDIYSEEEKITCFSNRLVFLERGVNISVRMQKQILSEKPYVAFRLNGDM
LRHLKDALMIIYGMSKIDTNACRSMSRKIMTTEVNKTLLDELKNINSHDNSAFISSLIYLISKLENNEKIIESIYISSVS
FFSDKVRNLIEKDLSRKWTLGIIADAFNASEITIRKRLESENTNFNQILMQLRMSKAALLLLENSYQISQISNMIGISSA
SYFIRIFNKHYGVTPKQFFTYFKGG
>P9WN09 2.4.1.-~~~~~~PGL/p-HBAD biosynthesis rhamnosyltransferase~~~COG1819
MRVSCVYATASRWGGPPVASEVRGDAAISTTPDAAPGLAARRRRILFVAEAVTLAHVVRPFALAQSLDPSRYEVHFACDP
RYNQLLGPLPFRHHAIHTIPSERFFGNLTQGRFYAMRTLRKYVEADLRVLDEIAPDLVVGDLRISLSVSARLAGIPYIAI
ANAYWSPYAQRRFPLPDVIWTRLFGVRLVKLLYRLERPLLFALQCMPLNWVRRRHGLSSLGWNLCRIFTDGDHTLYADVP
ELMPTYDLPANHEYLGPVLWSPAGKPPTWWDSLPTDRPIVYATLGTSGGRNLLQLVLNALAELPVTVIAATAGRSDLKTV
PANAFVADYLPGEAAAARSAVVVCNGGSLTTQQALVAGVPVIGVAGNLDQHLNMEAVERAGAGVLLRTERLKSQRVAGAV
MQVISRSEYRQAAARLADAFGRDRVGFPQHVENALRLMPENRPRTWLAS
>P30014 3.1.13.-~~~rnt~~~Ribonuclease T~~~COG0847
MSDNAQLTGLCDRFRGFYPVVIDVETAGFNAKTDALLEIAAITLKMDEQGWLMPDTTLHFHVEPFVGANLQPEALAFNGI
DPNDPDRGAVSEYEALHEIFKVVRKGIKASGCNRAIMVAHNANFDHSFMMAAAERASLKRNPFHPFATFDTAALAGLALG
QTVLSKACQTAGMDFDSTQAHSALYDTERTAVLFCEIVNRWKRLGGWPLSAAEEV
>Q9HY82 3.1.13.-~~~rnt~~~Ribonuclease T~~~
MSEDNFDDEFDGSLPSGPRHPMARRFRGYLPVVVDVETGGFNSATDALLEIAATTVGMDEKGFLFPEHTYFFRIEPFEGA
NIEPAALEFTGIKLDHPLRMAVQEEAALTEIFRGIRKALKANGCKRAILVGHNSSFDLGFLNAAVARTGIKRNPFHPFSS
FDTATLAGLAYGQTVLAKACQAAGMEFDNREAHSARYDTEKTAELFCGIVNRWKEMGGWMDDDD
>O31774 3.1.-.-~~~rny~~~Ribonuclease Y~~~COG1418
MTPIMMVLISILLILLGLVVGYFVRKTIAEAKIAGARGAAEQILEDAKRDAEALKKEALLEAKDEIHKLRIDAEQEVRER
RNELQKQENRLLQKEENLDRKHEGIDKREAMLEKKDHSLNERQQHIEEMESKVDEMIRMQQSELERISSLTRDEAKQIIL
ERVENELSHDIAIMTKETENRAKEEADKKAKNILSLALQRCAADHVAETTVSVVNLPNDEMKGRIIGREGRNIRTLETLT
GIDLIIDDTPEAVILSGFDPIRRETARIALDKLVQDGRIHPARIEEMVEKSRREVDDYIREMGEQTTFEVGVHGLHPDLI
KILGRLKFRTSYGQNVLKHSMEVAFLAGLMASELGEDAKLAKRAGLLHDIGKAIDHEVEGSHVEIGVELATKYKEHPVVI
NSIASHHGDEEPTSIIAVLVAAADALSAARPGARSETLENYIRRLEKLEEISESYEGVEKSFAIQAGREVRIMVKPDSIN
DLEAHRLARDIRKRIEDELDYPGHIKVTVIRETRAVEYAK
>Q2FZ08 3.1.-.-~~~rny~~~Ribonuclease Y~~~COG1418
MNLLSLLLILLGIILGVVGGYVVARNLLLQKQSQARQTAEDIVNQAHKEADNIKKEKLLEAKEENQILREQTEAELRERR
SELQRQETRLLQKEENLERKSDLLDKKDEILEQKESKIEEKQQQVDAKESSVQTLIMKHEQELERISGLTQEEAINEQLQ
RVEEELSQDIAVLVKEKEKEAKEKVDKTAKELLATAVQRLAADHTSESTVSVVNLPNDEMKGRIIGREGRNIRTLETLTG
IDLIIDDTPEAVILSGFDPIRREIARTALVNLVSDGRIHPGRIEDMVEKARKEVDDIIREAGEQATFEVNAHNMHPDLVK
IVGRLNYRTSYGQNVLKHSIEVAHLASMLAAELGEDETLAKRAGLLHDVGKAIDHEVEGSHVEIGVELAKKYGENETVIN
AIHSHHGDVEPTSIISILVAAADALSAARPGARKETLENYIRRLERLETLSESYDGVEKAFAIQAGREIRVIVSPEEIDD
LKSYRLARDIKNQIEDELQYPGHIKVTVVRETRAVEYAK
>P67278 3.1.-.-~~~rny~~~Ribonuclease Y~~~
MNLLSLLLILLGIILGVVGGYVVARNLLLQKQSQARQTAEDIVNQAHKEADNIKKEKLLEAKEENQILREQTEAELRERR
SELQRQETRLLQKEENLERKSDLLDKKDEILEQKESKIEEKQQQVDAKESSVQTLIMKHEQELERISGLTQEEAINEQLQ
RVEEELSQDIAVLVKEKEKEAKEKVDKTAKELLATAVQRLAADHTSESTVSVVNLPNDEMKGRIIGREGRNIRTLETLTG
IDLIIDDTPEAVILSGFDPIRREIARTALVNLVSDGRIHPGRIEDMVEKARKEVDDIIREAGEQATFEVNAHNMHPDLVK
IVGRLNYRTSYGQNVLKHSIEVAHLASMLAAELGEDETLAKRAGLLHDVGKAIDHEVEGSHVEIGVELAKKYGENETVIN
AIHSHHGDVEPTSIISILVAAADALSAARPGARKETLENYIRRLERLETLSESYDGVEKAFAIQAGREIRVIVSPEEIDD
LKSYRLARDIKNQIEDELQYPGHIKVTVVRETRAVEYAK
>P54548 3.1.26.11~~~rnz~~~Ribonuclease Z~~~COG1234
MELLFLGTGAGIPAKARNVTSVALKLLEERRSVWLFDCGEATQHQILHTTIKPRKIEKIFITHMHGDHVYGLPGLLGSRS
FQGGEDELTVYGPKGIKAFIETSLAVTKTHLTYPLAIQEIEEGIVFEDDQFIVTAVSVIHGVEAFGYRVQEKDVPGSLKA
DVLKEMNIPPGPVYQKIKKGETVTLEDGRIINGNDFLEPPKKGRSVVFSGDTRVSDKLKELARDCDVLVHEATFAKEDRK
LAYDYYHSTTEQAAVTAKEARAKQLILTHISARYQGDASLELQKEAVDVFPNSVAAYDFLEVNVPRG
>P00649 3.1.27.-~~~~~~Ribonuclease~~~
MKKISSVFTMFALIAAILFSGFIPQQAYAETPLTQTATNETATIQLTSDVHTLAVINTFDGVADYLIRYKRLPDNYITKS
QASALGWVASKGNLAEVAPGKSIGGDVFSNREGRLPSASGRTWREADINYVSGFRNADRLVYSSDWLIYKTTDHYATFTR
IR
>P35078 3.1.27.-~~~~~~Ribonuclease~~~
AQVINTFDGVADYLLTYHKLPDNYITKSEAQALGWVASKGNLADVAPGKSIGGDIFSNREGKLPAKSGRTWREADINYTS
GFRNSDRILYSSDWLIYKTTDHYKTFTKIR
>P37203 3.1.27.-~~~~~~Ribonuclease~~~
AVINTFDGVADYLIRYKRLPDNYITKSQASALGWVASKGNLAEVAPGKSIGGDVFSNREGRLPSASGRTWREADINYVSG
FRNADRLVYSSDWLIYKTTDHYATFARIR
>Q9RUW8 ~~~rsr~~~RNA-binding protein RO60~~~COG2304
MKNLLRAINPLNRPQTERLDERQVRNNAGGFVYTVSDESRLTRFLVLGVDGGTFYASAQKHTVQATDFVRELVQRDAALA
LRVTLDVVRGQRAPKADPALLVLALIAKTAPNAADRKAAWDALPEVARTGTMLLHFLAFADALGGWGRLTRRGVANVYET
ADVDKLALWAVKYKARDGWSQADALRKAHPKTDDAARNAVLKFMVDGVLPKVDSPALRVIEGHLKATEAQTDAAAAALMQ
EYRLPLEAVPTHVRGAEVYRAAMQTNGLTWLLRNLGNLGRVGVLTPNDSATVQAVIERLTDPAALKRGRIHPLDALKARL
VYAQGQGVRGKGTWLPVPRVVDALEEAFTLAFGNVQPANTRHLLALDVSGSMTCGDVAGVPGLTPNMAAAAMSLIALRTE
PDALTMGFAEQFRPLGITPRDTLESAMQKAQSMSFGGTDCAQPILWAAQERLDVDTFVVYTDNETWAGQVHPTVALDQYA
QKMGRAPKLIVVGLTATEFSIADPQRRDMLDVVGFDAAAPNVMTAFARGEV
>P0ACI0 ~~~rob~~~Right origin-binding protein~~~COG2207
MDQAGIIRDLLIWLEGHLDQPLSLDNVAAKAGYSKWHLQRMFKDVTGHAIGAYIRARRLSKSAVALRLTARPILDIALQY
RFDSQQTFTRAFKKQFAQTPALYRRSPEWSAFGIRPPLRLGEFTMPEHKFVTLEDTPLIGVTQSYSCSLEQISDFRHEMR
YQFWHDFLGNAPTIPPVLYGLNETRPSQDKDDEQEVFYTTALAQDQADGYVLTGHPVMLQGGEYVMFTYEGLGTGVQEFI
LTVYGTCMPMLNLTRRKGQDIERYYPAEDAKAGDRPINLRCELLIPIRR
>Q9K9B2 1.2.1.88~~~rocA1~~~1-pyrroline-5-carboxylate dehydrogenase 1~~~COG1012
MLQPYKHEPFTDFTVEANRKAFEEALGLVEKELGKEYPLIINGERVTTEDKIQSWNPARKDQLVGSVSKANQDLAEKAIQ
SADEAFQTWRNVNPEERANILVKAAAIIRRRKHEFSAWLVHEAGKPWKEADADTAEAIDFLEYYARQMIELNRGKEILSR
PGEQNRYFYTPMGVTVTISPWNFALAIMVGTAVAPIVTGNTVVLKPASTTPVVAAKFVEVLEDAGLPKGVINYVPGSGAE
VGDYLVDHPKTSLITFTGSKDVGVRLYERAAVVRPGQNHLKRVIVEMGGKDTVVVDRDADLDLAAESILVSAFGFSGQKC
SAGSRAVIHKDVYDEVLEKTVALAKNLTVGDPTNRDNYMGPVIDEKAFEKIMSYIEIGKKEGRLMTGGEGDSSTGFFIQP
TIIADLDPEAVIMQEEIFGPVVAFSKANDFDHALEIANNTEYGLTGAVITRNRAHIEQAKREFHVGNLYFNRNCTGAIVG
YHPFGGFKMSGTDSKAGGPDYLALHMQAKTVSEMY
>P94391 1.2.1.88~~~putC~~~1-pyrroline-5-carboxylate dehydrogenase 2~~~COG1012
MTTPYKHEPFTNFQDQNNVEAFKKALATVSEYLGKDYPLVINGERVETEAKIVSINPADKEEVVGRVSKASQEHAEQAIQ
AAAKAFEEWRYTSPEERAAVLFRAAAKVRRRKHEFSALLVKEAGKPWNEADADTAEAIDFMEYYARQMIELAKGKPVNSR
EGEKNQYVYTPTGVTVVIPPWNFLFAIMAGTTVAPIVTGNTVVLKPASATPVIAAKFVEVLEESGLPKGVVNFVPGSGAE
VGDYLVDHPKTSLITFTGSREVGTRIFERAAKVQPGQQHLKRVIAEMGGKDTVVVDEDADIELAAQSIFTSAFGFAGQKC
SAGSRAVVHEKVYDQVLERVIEITESKVTAKPDSADVYMGPVIDQGSYDKIMSYIEIGKQEGRLVSGGTGDDSKGYFIKP
TIFADLDPKARLMQEEIFGPVVAFCKVSDFDEALEVANNTEYGLTGAVITNNRKHIERAKQEFHVGNLYFNRNCTGAIVG
YHPFGGFKMSGTDSKAGGPDYLALHMQAKTISEMF
>Q65NN2 1.2.1.88~~~rocA~~~1-pyrroline-5-carboxylate dehydrogenase~~~COG1012
MTTPYKHEPFTNFGIEENRKAFEKALETVNNEWLGQSYPLVIDGERYETENKIVSINPANKEEVVGTVSKATQDHAEKAI
QAAAKAFETWRYTDPEERAAVLFRAVAKVRRKKHEFSALLVKEAGKPWNEADADTAEAIDFMEYYARQMIELAKGKPVNS
REGERNQYVYTPTGVTVVIPPWNFLFAIMAGTTVAPIVTGNTVVLKPASAAPVIAAKFVEVLEESGLPKGVVNFVPGSGA
EVGDYLVDHPKTSIITFTGSREVGTRIFERAAKVQPGQTHLKQVIAEMGGKDTVVVDEDCDIELAAQSIFTSAFGFAGQK
CSAGSRAVVHEKVYDEVLKRVIEITESKKVGEPDSADVYMGPVIDQASFNKIMDYIEIGKEEGRLVSGGKGDDSKGYFIE
PTIFADLDPKARLMQEEIFGPVVAFSKVSSFDEALEVANNTEYGLTGAVITKNRDHINRAKQEFHVGNLYFNRNCTGAIV
GYHPFGGFKMSGTDSKAGGPDYLALHMQAKTISEMF
>P39634 1.2.1.88~~~rocA~~~1-pyrroline-5-carboxylate dehydrogenase~~~COG1012
MTVTYAHEPFTDFTEAKNKTAFGESLAFVNTQLGKHYPLVINGEKIETDRKIISINPANKEEIIGYASTADQELAEKAMQ
AALQAFDSWKKQRPEHRANILFKAAAILRRRKHEFSSYLVKEAGKPWKEADADTAEAIDFLEFYARQMLKLKEGAPVKSR
AGEVNQYHYEALGVGIVISPFNFPLAIMAGTAVAAIVTGNTILLKPADAAPVVAAKFVEVMEEAGLPNGVLNYIPGDGAE
IGDFLVEHPKTRFVSFTGSRAVGCRIYERAAKVQPGQKWLKRVIAEMGGKDTVLVDKDADLDLAASSIVYSAFGYSGQKC
SAGSRAVIHQDVYDEVVEKAVALTKTLTVGNPEDPDTYMGPVIHEASYNKVMKYIEIGKSEGKLLAGGEGDDSKGYFIQP
TIFADVDENARLMQEEIFGPVVAICKARDFDHMLEIANNTEYGLTGALLTKNRAHIERAREDFHVGNLYFNRGCTGAIVG
YQPFGGFNMSGTDSKAGGPDYLILHMQAKTTSEAF
>P99076 1.2.1.88~~~rocA~~~1-pyrroline-5-carboxylate dehydrogenase~~~
MVVEFKNEPGYDFSVQENVDMFKKALKDVEKELGQDIPLVINGEKIFKDDKIKSINPADTSQVIANASKATKQDVEDAFK
AANEAYKSWKTWSANDRAELMLRVSAIIRRRKAEIAAIMVYEAGKPWDEAVGDAAEGIDFIEYYARSMMDLAQGKPVLDR
EGEHNKYFYKSIGTGVTIPPWNFPFAIMAGTTLAPVVAGNTVLLKPAEDTPYIAYKLMEILEEAGLPKGVVNFVPGDPKE
IGDYLVDHKDTHFVTFTGSRATGTRIYERSAVVQEGQNFLKRVIAEMGGKDAIVVDENIDTDMAAEAIVTSAFGFSGQKC
SACSRAIVHKDVYDEVLEKSIKLTKELTLGNTVDNTYMGPVINKKQFDKIKNYIEIGKEEGKLEQGGGTDDSKGYFVEPT
IISGLKSKDRIMQEEIFGPVVGFVKVNDFDEAIEVANDTDYGLTGAVITNNREHWIKAVNEFDVGNLYLNRGCTSAVVGY
HPFGGFKMSGTDAKTGSPDYLLHFLEQKVVSEMF
>P39635 ~~~rocB~~~Protein RocB~~~COG4187
MQYTKISHMNPAERVEALTASLVSLSSVNGTVGEGTKADFIKEVITSYPYFQENPSHVWEQAIPNDPYKRKNIFAFIKGH
GESRNTVIYHAHLDTVGIEDFGPLKDIAFDCEKLAEYFSRYEFDQDVQRDAKSGEWMFGRGSVDMQSGIAVHLANLLHFS
ENLETLPGNVLFMANPDEESQHSGILASISELNRLKKEKQLHYLAAINTDFITPLYDGDQTRYIYTGAAGKLLPCFYIYG
REVHVGDTLAGIDPNFISSEITSRLHNNIHLAEKVEGELVLPPSCLYQRDNKESYNVQTAVSTSLYFNCFIYERTAKEMM
DLLIEVTEEACRETEQKLSDYYEEYVKRANLPKKHLSWGIQVYSLEQYLEKLRNRGIDPEQCIEQTFKANEHLELRMRCF
QAIEELQKLDPDQGAKVILFYAPPYLPHNYLKEDSARDQLLQHVIKEAADKTAESTGETFVFKKFFPYLADGSFLSLHET
DGEIDSFIRNFPGWNMIGTIPFKDIRKLNIPSINMGVYGKDGHKWTERVYKPYTFHVLPLLIQQTTVHLLQSYRMTITAK
EPKGEG
>P39636 ~~~rocC~~~Amino-acid permease RocC~~~COG0833
MQNHKNELQRSMKSRHLFMIALGGVIGTGLFLGSGFTISQAGPLGAIAAYIIGGFLMYLVMLCLGELAVAMPVAGSFQAY
ATKFLGQSTGFMIGWLYWFSWANTVGLELTSAGILMQRWLPSVPIWIWCLVFGIVIFLINALSVRSFAEMEFWFSSIKVA
AIILFIVIGGAAVFGLIDFKGGQETPFLSNFMTDRGLFPNGVLAVMFTLVMVNFSFQGTELVGIAAGESESPEKTLPKSI
RNVIWRTLFFFVLAMFVLVAILPYKTAGVIESPFVAVLDQIGIPFSADIMNFVILTAILSVANSGLYAASRMMWSLSSNQ
MGPSFLTRLTKKGVPMNALLITLGISGCSLLTSVMAAETVYLWCISISGMVTVVAWMSICASQFFFRRRFLAEGGNVNDL
EFRTPLYPLVPILGFCLYGCVLISLIFIPDQRIGLYCGVPIIIFCYAYYHLSIKKRINHETIEKKQTEAQ
>P39137 ~~~rocE~~~Amino-acid permease RocE~~~COG0833
MNTNQDNGNQLQRTMKSRHLFMISLGGVIGTGFFLGTGFTINQAGPLGAVLSYLVGGFIMFLTMLCLGELAVAFPVSGSF
QTYATKFISPAFGFAFGWLYWLGWAVTCAIEFLSAGQLMQRWFPHIDVWIWCLVFAALMFILNAITTKAFAESEFWFSGI
KILIILLFIILGGAAMFGLIDLKGGEQAPFLTHFYEDGLFPNGIKAMLITMITVNFAFQGTELIGVAAGESEDPEKTIPR
SIKQTVWRTLVFFVLSIIVIAGMIPWKQAGVVESPFVAVFEQIGIPYAADIMNFVILIALLSVANSGLYASTRILYAMAN
EGQAFKALGKTNQRGVPMYSLIVTMAVACLSLLTKFAQAETVYMVLLSLAGMSAQVGWITISLSQIMFRRKYIREGGKIE
DLKFKTPLYPVLPLIGLTLNTVVLISLAFDPEQRIALYCGVPFMIICYIIYHVVIKKRQQANRQLEL
>P38022 ~~~rocR~~~Transcriptional activator RocR~~~COG3829
MVKDSEFLTLVFQSILDEIDVGLHVVDEHGNTIVYNNKMMQIEDMEKHDVLNKNLMDVFMFSKQQDSTLVQALQEGKTIK
NVKQSYFNNKGQEITTINHTYPIVQDGKIRGAVEIAKDVTKLERLIRENMNKKGSTTYTFDSILGTSPAIQDVIENAKRA
TRTSSSVLLAGETGTGKELFAQSIHNGSDRSGGPFISQNCAALPDSLVESILFGTKKGAFTGAVDQPGLFEQAHGGTLLL
DEINSLNLSLQAKLLRALQERKIRRIGSTKDTPIDVRIIATMNEDPIDAIAGERMRKDLYYRLSVVTLIIPPLRERKEDI
LLLASEFIQKNNHLFQMNVEHISDDVKQFFLSYDWPGNIRELEHMIEGAMNFMTDEQTITASHLPYQYRMKIKPADIPEP
ETPRHQPAADLKEKMESFEKYVIENVLRKHGHNISKAAQELGISRQSLQYRLKKFSHSSNE
>Q9HX69 3.1.4.52~~~rocR~~~Cyclic di-GMP phosphodiesterase RocR~~~
MNDLNVLVLEDEPFQRLVAVTALKKVVPGSILEAADGKEAVAILESCGHVDIAICDLQMSGMDGLAFLRHASLSGKVHSV
ILSSEVDPILRQATISMIECLGLNFLGDLGKPFSLERITALLTRYNARRQDLPRQIEVAELPSVADVVRGLDNGEFEAYY
QPKVALDGGGLIGAEVLARWNHPHLGVLPPSHFLYVMETYNLVDKLFWQLFSQGLATRRKLAQLGQPINLAFNVHPSQLG
SRALAENISALLTEFHLPPSSVMFEITETGLISAPASSLENLVRLRIMGCGLAMDDFGAGYSSLDRLCEFPFSQIKLDRT
FVQKMKTQPRSCAVISSVVALAQALGISLVVEGVESDEQRVRLIELGCSIAQGYLFARPMPEQHFLDYCSGS
>A0A0H2ZKY7 ~~~rocS~~~Regulator of chromosome segregation~~~
MSIEMTVSEIAEVLGLSRQAINNRVKELPEEDTDKNDKGVTVVTRSGLIKLEEIYKKTIFEDEPVSEDVKQRELMEILVD
EKNAEILRLYEQLKAKDRQLSEKDEQMRIKDRQIAEKDKQLDQQQQLTLQAMKDQENLKLELDQAKEEVQSTKKGFFARL
FGG
>Q8DQ15 ~~~rocS~~~Regulator of chromosome segregation~~~
MSIEMTVSEIAEVLGLSRQAINNRVKELPEEDTDKNDKGVTVVTRSGLIKLEEIYKKTIFEDEPVSEDVKQRELMEILVD
EKNAEILRLYEQLKAKDRQLSEKDEQMRIKDRQIAEKDKQLDQQQQLTLQAMKDQENLKLELDQAKEEVQSTKKGFFARL
FGG
>P39604 2.4.1.129~~~rodA~~~Peptidoglycan glycosyltransferase RodA~~~COG0772
MSRYKKQQSPFYQGDLIFIFGVFFIISVVSIYAAGQFGQYGNTDWIQQIVFYLLGAVAITVLLYFDLEQLEKLSLYIFII
GILSLIILKISPESIAPVIKGAKSWFRIGRITIQPSEFMKVGLIMMLASVIGKANPKGVRTLRDDIHLLLKIAGVAVIPV
GLILMQDAGTAGICMFIVLVMVFMSGINWKLIAIIAGSGILLISLILLVMINFPDVAKSVGIQDYQIKRVTSWVSASNET
QEDSNDSWQVDQAIMAIGSGGILGNGISNLKVYVPESTTDFIFSIIGESFGFIGCAIVVIMFFFLIYRLVVLIDKIHPFN
RFASFFCVGYTALIVIHTFQNIGMNIGIMPVTGIPLLFVSYGGSSTLSTLIGFGIVYNASVQLTKYRSYLFNS
>P0ABG7 2.4.1.129~~~mrdB~~~Peptidoglycan glycosyltransferase MrdB~~~COG0772
MTDNPNKKTFWDKVHLDPTMLLILLALLVYSALVIWSASGQDIGMMERKIGQIAMGLVIMVVMAQIPPRVYEGWAPYLYI
ICIILLVAVDAFGAISKGAQRWLDLGIVRFQPSEIAKIAVPLMVARFINRDVCPPSLKNTGIALVLIFMPTLLVAAQPDL
GTSILVALSGLFVLFLSGLSWRLIGVAVVLVAAFIPILWFFLMHDYQRQRVMMLLDPESDPLGAGYHIIQSKIAIGSGGL
RGKGWLHGTQSQLEFLPERHTDFIFAVLAEELGLVGILILLALYILLIMRGLWIAARAQTTFGRVMAGGLMLILFVYVFV
NIGMVSGILPVVGVPLPLVSYGGSALIVLMAGFGIVMSIHTHRKMLSKSV
>P9WN99 2.4.1.129~~~rodA~~~Peptidoglycan glycosyltransferase RodA~~~COG0772
MTTRLQAPVAVTPPLPTRRNAELLLLCFAAVITFAALLVVQANQDQGVPWDLTSYGLAFLTLFGSAHLAIRRFAPYTDPL
LLPVVALLNGLGLVMIHRLDLVDNEIGEHRHPSANQQMLWTLVGVAAFALVVTFLKDHRQLARYGYICGLAGLVFLAVPA
LLPAALSEQNGAKIWIRLPGFSIQPAEFSKILLLIFFSAVLVAKRGLFTSAGKHLLGMTLPRPRDLAPLLAAWVISVGVM
VFEKDLGASLLLYTSFLVVVYLATQRFSWVVIGLTLFAAGTLVAYFIFEHVRLRVQTWLDPFADPDGTGYQIVQSLFSFA
TGGIFGTGLGNGQPDTVPAASTDFIIAAFGEELGLVGLTAILMLYTIVIIRGLRTAIATRDSFGKLLAAGLSSTLAIQLF
IVVGGVTRLIPLTGLTTPWMSYGGSSLLANYILLAILARISHGARRPLRTRPRNKSPITAAGTEVIERV
>P27434 ~~~rodZ~~~Cytoskeleton protein RodZ~~~COG1426
MNTEATHDQNEALTTGARLRNAREQLGLSQQAVAERLCLKVSTVRDIEEDKAPADLASTFLRGYIRSYARLVHIPEEELL
PGLEKQAPLRAAKVAPMQSFSLGKRRKKRDGWLMTFTWLVLFVVIGLSGAWWWQDRKAQQEEITTMADQSSAELSSNSEQ
GQSVPLNTSTTTDPATTSTPPASVDTTATNTQTPAVTAPAPAVDPQQNAVVSPSQANVDTAATPAPTAATTPDGAAPLPT
DQAGVTTPVADPNALVMNFTADCWLEVTDATGKKLFSGMQRKDGNLNLTGQAPYKLKIGAPAAVQIQYQGKPVDLSRFIR
TNQVARLTLNAEQSPAQ
>Q8DMX7 ~~~rodZ~~~Cytoskeleton protein RodZ~~~COG1426
MTSMRKKTIGEVLRLARINQGLSLDELQKKTEIQLDMLEAMEADDFDQLPSPFYTRSFLKKYAWAVELDDQIVLDAYDSG
SMITYEEVDVDEDELTGRRRSSKKKKKKTSFLPLFYFILFALSILIFVTYYVWNYIQTQPEEPSLSNYSVVQSTSSTSSV
PHSSSSSSSSIESAISVSGEGNHVEIAYKTSKETVKLQLAVSDVTSWVSVSESELEGGVTLSPKKKSAEATVATKSPVTI
TLGVVKGVDLTVDNQTVDLSKLTAQTGQITVTFTKN
>P0AFW8 ~~~rof~~~Protein rof~~~COG4568
MNDTYQPINCDDYDNLELACQHHLMLTLELKDGEKLQAKASDLVSRKNVEYLVVEAAGETRELRLDKITSFSHPEIGTVV
VSES
>O34857 ~~~rok~~~Repressor Rok~~~
MFNEREALRLRLEQLNEAEVKVIREYQIERDKIYAKLRELDRNGSPSEIKKDFRSEKKPDSLPVLAELAAQEIRSYQPQS
QQQSVQPQLQSISSLPAGIPDGTTRRRRGTARPGSKAAKLREAAIKTLKRHNAAIKSSELQKEIEKESGLEIPNMTTFMQ
SLIKMYPEVKKPYRGQYILEGEIESAESANE
>Q9F0J6 1.-.-.-~~~roo~~~Rubredoxin-oxygen oxidoreductase~~~
MQATKIIDGFHLVGAIDWNSRDFHGYTLSPMGTTYNAYLVEDEKTTLFDTVKAEYKGELLCGIASVIDPKKIDYLVIQHL
ELDHAGALPALIEACQPEKIFTSSLGQKAMESHFHYKDWPVQVVKHGETLSLGKRTVTFYETRMLHWPDSMVSWFADEKV
LISNDIFGQNIAASERFSDQIPVHTLERAMREYYANIVNPYAPQTLKAIETLVGAGVAPEFICPDHGVIFRGADQCTFAV
QKYVEYAEQKPTNKVVIFYDSMWHSTEKMARVLAESFRDEGCTVKLMWCKACHHSQIMSEISDAGAVIVGSPTHNNGILP
YVAGTLQYIKGLRPQNKIGGAFGSFGWSGESTKVLAEWLTGMGFDMPATPVKVKNVPTHADYEQLKTMAQTIARALKAKL
AA
>P03051 ~~~rop~~~Regulatory protein rop~~~
MTKQEKTALNMARFIRSQTLTLLEKLNELDADEQADICESLHDHADELYRSCLARFGDDGENL
>K4RFM2 2.1.1.343~~~rosA~~~8-amino-8-demethylriboflavin N,N-dimethyltransferase~~~COG2226
MRPEPTEHPERTAAQRLYQYNVDLKVAFVLYAVAKLHLPDLLADGPRTTADLAAATGSDPSRLRRLLRAAAGADALREVP
EDSFELAPMGDLLRSGHPRSMRGMTTFFAEPDVLAAYGDLVESVRTGVPAFQLRHREPLYDFLARPQHKEVRDEFDAAMV
EFGQYFADDFLTSFDFGRFTRFADIGGGRGQFLAGVLTAVPSSTGVLVDGPAVAASAHKFLASQNLTERVEVRIGDFFDV
LPTGCDAYVLRGVLEDWADADAVRLLVRIRQAMGDAPEARLLILDSVIGETGELGKVLDLDMLVLVEGEHRTRAQWDDLL
ARAGFDIVGIHPAGDVWAVIECRGTAG
>K4REZ6 2.6.1.114~~~rosB~~~8-demethyl-8-aminoriboflavin-5'-phosphate synthase~~~COG0655
MALKALILNTTLRRSPSRSQTQGLIDKAVPLYEKEGIETEVVRVIDHDIEQEYWDDYDDWNAGEKARREDEWPWLLEKIR
EADILVIATPITLNMCTSAAHVILEKLNLMDELNGDTKQFPLYNKVAGLLMCGNEDGAHHVAGTVLNNLGRLGYSVPPNA
AAYWLGPAGTGPGYIEGKGDRHFHTNKLIRFMVANTSHLARMLQETPYTTDLEACAQAAREESDDVFAIRVNVNTPAIRY
KRFQKLGEVKVEESQLG
>Q04152 ~~~ros~~~Transcriptional regulatory protein ros~~~COG4957
MTETAYGNAQDLLVELTADIVAAYVSNHVVPVTELPGLISDVHTALSGTSAPASVAVNVEKQKPAVSVRKSVQDDHIVCL
ECGGSFKSLKRHLTTHHSMTPEEYREKWDLPVDYPMVAPAYAEARSRLAKEMGLGQRRKANR
>Q5HF12 ~~~rot~~~HTH-type transcriptional regulator rot~~~
MHKLAHTSFGIVGMFVNTCIVAKYVIINWEMFSMKKVNNDTVFGILQLETLLGDINSIFSEIESEYKMSREEILILLTLW
QKGFMTLKEMDRFVEVKPYKRTRTYNNLVELEWIYKERPVDDERTVIIHFNEKLQQEKVELLNFISDAIASRATAMQNSL
NAIIAV
>Q7A514 ~~~rot~~~HTH-type transcriptional regulator rot~~~
MHKLAHTSFGIVGMFVNTCMVAKYVIINWEMFSMKKVNNDTVFGILQLETLLGDINSIFSEIESEYKMSREEILILLTLW
QKGSMTLKEMDRFVEVKPYKRTRTYNNLVELEWIYKERPVDDERTVIIHFNEKLQQEKVELLNFISDAIASRATAMQNSL
NAIIAV
>Q6GFT9 ~~~rot~~~HTH-type transcriptional regulator rot~~~
MHKLAHISFGIVGMFVNTCMVAKYVIINWEMYSMKKVNNDTVFGILQLETLLGDINSIFSEIESEYKMSREEILILLTLW
QKGSMTLKEMDRFVEVKPYKRTRTYNNLVELEWIYKERPVDDERTVIIHFNEKLQQEKVELLNFISDAIASRATAMQNSL
NAIIAV
>P27431 1.14.11.47~~~roxA~~~Ribosomal protein uL16 3-hydroxylase~~~COG2850
MEYQLTLNWPDFLERHWQKRPVVLKRGFNNFIDPISPDELAGLAMESEVDSRLVSHQDGKWQVSHGPFESYDHLGETNWS
LLVQAVNHWHEPTAALMRPFRELPDWRIDDLMISFSVPGGGVGPHLDQYDVFIIQGTGRRRWRVGEKLQMKQHCPHPDLL
QVDPFEAIIDEELEPGDILYIPPGFPHEGYALENAMNYSVGFRAPNTRELISGFADYVLQRELGGNYYSDPDVPPRAHPA
DVLPQEMDKLREMMLELINQPEHFKQWFGEFISQSRHELDIAPPEPPYQPDEIYDALKQGEVLVRLGGLRVLRIGDDVYA
NGEKIDSPHRPALDALASNIALTAENFGDALEDPSFLAMLAALVNSGYWFFEG
>Q5YTV5 1.14.13.211~~~rox~~~Rifampicin monooxygenase~~~COG0654
MIDVIIAGGGPTGLMLAGELRLHGVRTVVLEKEPTPNQHSRSRGLHARSIEVMDQRGLLERFLAHGEQFRVGGFFAGLAA
EWPADLDTAHSYVLAIPQVVTERLLTEHATELGAEIRRGCEVAGLDQDADGVTAELADGTRLRARYLVGCDGGRSTVRRL
LGVDFPGEPTRVETLLADVRIDVPVETLTAVVAEVRKTQLRFGAVPAGDGFFRLIVPAQGLSADRAAPTLDELKRCLHAT
AGTDFGVHSPRWLSRFGDATRLAERYRTGRVLLAGDAAHIHPPTGGQGLNLGIQDAFNLGWKLAAAIGGWAPPDLLDSYH
DERHPVAAEVLDNTRAQMTLLSLDPGPRAVRRLMAELVEFPDVNRHLIEKITAIAVRYDLGDGHDLVGRRLRDIPLTEGR
LYERMRGGRGLLLDRTGRLSVSGWSDRVDHLADPGAALDVPAALLRPDGHVAWVGEDQDDLLAHLPRWFGAAT
>P95598 1.14.13.211~~~iri~~~Rifampicin monooxygenase~~~
MSDVIIVGAGPTGLMLAGELRLQGVDVVVVDKDEEPTQFVRALGIHVRSIEIMEQRGLLDKFLAHGRKYPLGGFFAGISK
PAPAHLDTAHGYVLGIPQPEIDRILAEHATEVGADIQRGKRVVAIRQDTDNVAAELSDGTTLHARYLVGCDGGRSTVRKL
RSTSVFPASRTSADTLIGEMDVTMPADELAAVVAEIRETHKRFGVGPAGNGAFRVVVPAAEVADGRATPTTLDDIKQQLL
AIAGTDFGVHSPRWLSRFGDATRLADDYRRDRVFLAGDAAHIHPPMGGQGLNLGVQDAFNLGWKLAAEINGWAPVGLLDT
YESERRPVAADVLDNTRAQAELISTAAGPQAVRRLISELMEFEDVKRYLTEKITAISIRYDFGEGDDLLGRRLRNIALTR
GNLYDLMRSGRGLLLDQGGQLSVDGWSDRADHIVDTSTELEAPAVLLRPDGHVAWIGDAQAELDTQLSTWFGRSARDRA
>F2R776 1.14.13.211~~~rox~~~Rifampicin monooxygenase~~~COG0654
MFDVIVVGGGPTGLMLAGELRLHGVRVLVLEKETEPTRQSRAQGLHVRSIEVMAQRGLLERFLERGHTVAVGGFFAGLAT
SWPERLDTAHSYVLAVPQVITEQLLAEHATALGAEIRRGRALVGLRQDEDGVTVDLADGEQLRARYVVGCDGGRSTVRKL
LGVAFPGEPSRVETLLGEMEMTASQEELTSVMTEVRKTQQRFGAMPLGDGVFRVVVPAEGVAEDRTASPTLDEFKQQLRA
HAGTDFGVHSPRWLSRFGDATRQAERYRVDRVFLAGDAAHIHPPTGGQGLNLGIQDAFNLGWKLAAEVDGWAPEGLLDTY
HAERHPVATEVLDNTRAQIQLMSTEPGPQAVRRLMAELVEFENVNRYLIEKITAISVRYDVGEGHELLGRRMRDLALKHG
RLYERMHEGRGLLLDQTGRLSVAGWEDRVDHVVEVSEELDVPAVLLRPDGHVVWAGEDQQELLTRMPAWFGAATAG
>P62181 ~~~sigK~~~RNA polymerase sigma-28 factor~~~
MSLFAAIGYMVREVFVFVSYVKNNAFPQPLSSDDERKYLELMEQGDAQARNLLIEHNLRLVAHIVKKFENTGEDAEDLIS
IGTIGLIKAIESYSAGKGTKLATYAARCIENEILMHLRVLKKTKKDVSLHDPIGQDKEGNEISLIDILKSESEDVIDMIQ
LSMELEKIKEYIDILDEREKEVIVKRFGLGLDKEKTQREIAKALGISRSYVSRIEKRALMKMFHEFVRAEKEKKAKE
>P62179 ~~~sigE~~~RNA polymerase sigma-35 factor~~~
MMKLKFYLVYLWYKVLLKLGIKTDEIYYIGGSEALPPPLTKEEEEVLLNKLPKGDQAARSLLIERNLRLVVYIARKFENT
GINIEDLISIGTIGLIKAVNTFNPEKKIKLATYASRCIENEILMHLRRNNKNRSEVSFDEPLNIDWDGNELLLSDVLGTD
DDIITKDLEATVDRHLLMKALHQLNDREKQIMELRFGLAGGEEKTQKDVADMLGISQSYISRLEKRIIKRLRKEFNKMV
>P24219 ~~~sigL~~~RNA polymerase sigma-54 factor~~~COG1508
MDMKLQQVQVLKPQLTQELRQAITLLGYHSAELAEYIDELSLENPLIERKETDTPPLSYHKTNKNRMNAQEAGLQLSNPQ
KTLQDALKQQSLDMNLTNTEKKIFNYLIHSLDSNGYLEEDIEEAARRLSVSAKEAEAVLAKLQSLEPAGIGARSLQECIL
LQLQRLPNRNEQAEMLVSAHFDAFAQKKWKTLSVETGIPLHTIQDISDDIAALHPRPGLLFARPEQDVYIEPDIFITVKN
GHIAAELNTRSFPEIDLHPQYRTLLSSGSCQDTVSYLSAKYQEWRWLSRALRQRKQTITRIINELITRQKDFFLKGRSAM
KPLTLREVADCLSLHESTVSRAIKGKTIQTPYGLFEMKLFFSAKAEASGDGDASNYAVKTHLENLINQEDKTKPLSDQKL
VDLLYEQHGIQISRRTVAKYRDQMNIPSSAARKRYK
>P30332 ~~~rpoN1~~~RNA polymerase sigma-54 factor 1~~~COG1508
MALTQRLEFRQSQSLVMSPQLMQAIKLLQLSNLDLMTFVEEELECNPLLERASDDAAGAEAPTEVDQVSGDQLAEAQVRD
ARDGAMTTYTEWGGGGSGDEDYNLEAFVASETTLSDHLAEQLSVAFTAPAQRMIGQYLIDLVDEAGYLPPDLGQAAERLG
ATQEDVEHVLAVLQEFDPPGVCARNLRECLAIQLRELDRYDPAMQALVEHLDLLAKRDIASLRKLCGVDDEDIADMIDEL
RRLSPKPGMKFGSARLQTMVPDVYVRPAPDGGWHVELNSDTLPRVLVNQTYYSKLSKKIGKDVDKSYFNDALQNATWLVR
ALDQRARTILKVATEIVRQQDGFFTLGVAHLRPLNLKAVAEAIQMHESTVSRVTANKYMATNRGTFELKYFFTASIPSAD
GGEAHSAEAVRHRIKQLIESEEPSAVLSDDAIVERLRVSGIDIARRTVAKYREAMRIRSSVQRRRDNMWSTMNSRASGGT
GLDK
>P24255 ~~~rpoN~~~RNA polymerase sigma-54 factor~~~COG1508
MKQGLQLRLSQQLAMTPQLQQAIRLLQLSTLELQQELQQALESNPLLEQIDTHEEIDTRETQDSETLDTADALEQKEMPE
ELPLDASWDTIYTAGTPSGTSGDYIDDELPVYQGETTQTLQDYLMWQVELTPFSDTDRAIATSIVDAVDETGYLTVPLED
ILESIGDEEIDIDEVEAVLKRIQRFDPVGVAAKDLRDCLLIQLSQFDKTTPWLEEARLIISDHLDLLANHDFRTLMRVTR
LKEDVLKEAVNLIQSLDPRPGQSIQTGEPEYVIPDVLVRKHNGHWTVELNSDSIPRLQINQHYASMCNNARNDGDSQFIR
SNLQDAKWLIKSLESRNDTLLRVSRCIVEQQQAFFEQGEEYMKPMVLADIAQAVEMHESTISRVTTQKYLHSPRGIFELK
YFFSSHVNTEGGGEASSTAIRALVKKLIAAENPAKPLSDSKLTSLLSEQGIMVARRTVAKYRESLSIPPSNQRKQLV
>P06223 ~~~rpoN~~~RNA polymerase sigma-54 factor~~~COG1508
MKQGLQLRLSQQLAMTPQLQQAIRLLQLSTLELQQELQQALDSNPLLEQTDLHDEVETKEAEDRESLDTVDALEQKEMPE
ELPLDASWDEIYTAGTPSGNGVDYQDDELPVYQGETTQSLQDYLMWQVELTPFTDTDRAIATSIVDAVDDTGYLTISVED
IVESIGDDEIGLEEVEAVLKRIQRFDPVGVAAKDLRDCLLVQLSQFAKETPWIEEARLIISDHLDLLANHDFRSLMRVTR
LKEEVLKEAVNLIQSLDPRPGQSIQTGEPEYVIPDVLVRKVNDRWVVELNSDSLPRLKINQQYAAMGNSTRNDADGQFIR
SNLQEARWLIKSLESRNDTLLRVSRCIVEQQQAFFEQGEEFMKPMVLADIAQAVEMHESTISRVTTQKYLHSPRGIFELK
YFFSSHVNTEGGGEASSTAIRALVKKLIAAENPAKPLSDSKLTTMLSDQGIMVARRTVAKYRESLSIPPSNQRKQLV
>P49988 ~~~rpoN~~~RNA polymerase sigma-54 factor~~~
MKPSLVLKMGQQLTMTPQLQQAIRLLQLSTLDLQQEIQEALESNPMLERQEDGDDFDNSDPLADGAEQAASAPQESPLQE
SATPSVESLDDDQWSERIPSELPVDTAWEDIYQTSASSLPSNDDDEWDFTARTSSGESLHSHLLWQVNLAPMSDTDRMIA
VTIIDSINNDGYLEESLEEILAAIDPELDVELDEVEVVLRRIQQLEPAGIGARNLRECLLLQLRQLPSTTPWLNEALRLV
SDYLDLLGGRDYSQLMRRMKLKEDELRQVIELIQCLHPRPGSQIESSEAEYIVPDVIVRKDNERWLVELNQEAMPRLRVN
ATYAGMVRRADSSADNTFMRNQLQEARWFIKSLQSRNETLMKVATQIVEHQRGFLDYGEEAMKPLVLHDIAEAVGMHEST
ISRVTTQKYMHTPRGIFELKYFFSSHVSTAEGGECSSTAIRAIIKKLVAAENAKKPLSDSKIAGLLEAQGIQVARRTVAK
YRESLGIAPSSERKRLV
>P0A171 ~~~rpoN~~~RNA polymerase sigma-54 factor~~~COG1508
MKPSLVLKMGQQLTMTPQLQQAIRLLQLSTLDLQQEIQEALESNPMLERQEDGEDFDNSDPMADNAENKPAAEVQDNSFQ
ESTVSADNLEDGEWSERIPNELPVDTAWEDIYQTSASSLPSNDDDEWDFTTRTSAGESLQSHLLWQLNLAPMSDTDRLIA
VTLIDSINGQGYLEDTLEEICAGFDPELDIELDEVEAVLHRIQQFEPAGVGARNLGECLLLQLRQLPATTPWMTEAKRLV
TDFIDLLGSRDYSQLMRRMKIKEDELRQVIELVQSLNPRPGSQIESSEPEYVVPDVIVRKDSDRWLVELNQEAIPRLRVN
PQYAGFVRRADTSADNTFMRNQLQEARWFIKSLQSRNETLMKVATQIVEHQRGFLDHGDEAMKPLVLHDIAEAVGMHEST
ISRVTTQKYMHTPRGIYELKYFFSSHVSTSEGGECSSTAIRAIIKKLVAAENQKKPLSDSKIAGLLEAQGIQVARRTVAK
YRESLGIAPSSERKRLM
>P30333 ~~~rpoN2~~~RNA polymerase sigma-54 factor 2~~~COG1508
MALTQRLEFRQSQSLVMTPQLMQAIKLLQLSNLDLTTFVEEELERNPLLERANDEASGGEAPAEAGQFSDSDGGHNDEPG
GGPGEAFEPGQEEWMSKDLGTRAEIEQTLDTGLDNVFSEEPAEAAARNAQDAAPTTYTEWGGGASGDEDYNLEAFVAAEV
TLGDHLAEQLSVAFTAPAQRMIGQYLIDLVDEAGYLPPDLGQAAERLGASQQEVEDVLAVLQKFDPPGVCARNLSECLAI
QLRELDRYDPAMQALVEHLDLLAKRDIAGLRKVCGVDDEDIADMIGEIRRLNPKPGMKFGAARLQTMVPDVYVRPGPDGG
WHVELNSDTLPRVLVNQTYYSELSKKIGKDGDKSYFTDALQNATWLVRALDQRARTILKVATEIVRQQDGFFTHGVAHLR
PLNLKAVADAIQMHESTVSRVTANKYMATNRGTFELKYFFTASIASADGGEAHSAEAVRHHIKQLIDSEAPAAILSDDTI
VERLRASGIDIARRTVAKYREAMRIPSSVQRRRDKQSALGNVLSTAMSDRSRNPEPA
>Q31S42 ~~~rpaA~~~DNA-binding dual master transcriptional regulator RpaA~~~COG0745
MKPRILVIDDDSAILELVAVNLEMSGYDVRKAEDGIKGQALAVQLVPDLIMLDLMLPRVDGFTVCQRLRRDERTAEIPVL
MLTALGQTQDKVEGFNAGADDYLTKPFEVEEMLARVRALLRRTDRIPHAARHSEILSYGPLTLIPERFEAIWFNRTVKLT
HLEFELLHCLLQRHGQTVAPSEILKEVWGYDPDDDIETIRVHIRHLRTKLEPDPRHPRYIKTVYGAGYCLELPAETELHQ
HADQFPSAS
>Q55890 ~~~rpaA~~~DNA-binding dual master transcriptional regulator RpaA~~~COG0745
MPRILIIDDDPAISDLVSINLEMAGYDVQQAVDGIKGQALAVQLQPDLIMLDLMLPKVDGFTVCQRLRRDERTADIPVLM
LTALGQIQDKIQGFDSGADDYLTKPFDVEEMLARVRALLRRTDRIPQAAKHSEILNQGPLTLVPERFEAIWFGKSIKLTH
LEFELLHCLLQRHGQTVSPSDILREVWGYEPDDDIETIRVHIRHLRTKLEPNPRRPRFIKTVYGAGYCLELSTEEGGGSP
T
>Q6NCZ6 2.3.1.229~~~rpaI~~~4-coumaroyl-homoserine lactone synthase~~~COG3916
MQVHVIRRENRALYAGLLEKYFRIRHQIYVVERGWKELDRPDGREIDQFDTEDAVYLLGVDNDDIVAGMRMVPTTSPTLL
SDVFPQLALAGPVRRPDAYELSRIFVVPRKRGEHGGPRAEAVIQAAAMEYGLSIGLSAFTIVLETWWLPRLVDQGWKAKP
LGLPQDINGFSTTAVIVDVDDDAWVGICNRRSVPGPTLEWRGLEAIRRHSLPEFQVIS
>Q6NCZ5 ~~~rpaR~~~HTH-type quorum sensing-dependent transcriptional regulator RpaR~~~COG2197
MIVGEDQLWGRRALEFVDSVERLEAPALISRFESLIASCGFTAYIMAGLPSRNAGLPELTLANGWPRDWFDLYVSENFSA
VDPVPRHGATTVHPFVWSDAPYDRDRDPAAHRVMTRAAEFGLVEGYCIPLHYDDGSAAISMAGKDPDLSPAARGAMQLVS
IYAHSRLRALSRPKPIRRNRLTPRECEILQWAAQGKTAWEISVILCITERTVKFHLIEAARKLDAANRTAAVAKALTLGL
IRL
>P0AG07 5.1.3.1~~~rpe~~~Ribulose-phosphate 3-epimerase~~~COG0036
MKQYLIAPSILSADFARLGEDTAKALAAGADVVHFDVMDNHYVPNLTIGPMVLKSLRNYGITAPIDVHLMVKPVDRIVPD
FAAAGASIITFHPEASEHVDRTLQLIKENGCKAGLVFNPATPLSYLDYVMDKLDVILLMSVNPGFGGQSFIPQTLDKLRE
VRRRIDESGFDIRLEVDGGVKVNNIGEIAAAGADMFVAGSAIFDQPDYKKVIDEMRSELAKVSHE
>P9WI51 5.1.3.1~~~rpe~~~Ribulose-phosphate 3-epimerase~~~COG0036
MAGSTGGPLIAPSILAADFARLADEAAAVNGADWLHVDVMDGHFVPNLTIGLPVVESLLAVTDIPMDCHLMIDNPDRWAP
PYAEAGAYNVTFHAEATDNPVGVARDIRAAGAKAGISVKPGTPLEPYLDILPHFDTLLVMSVEPGFGGQRFIPEVLSKVR
AVRKMVDAGELTILVEIDGGINDDTIEQAAEAGVDCFVAGSAVYGADDPAAAVAALRRQAGAASLHLSL
>P74061 5.1.3.1~~~rpe~~~Ribulose-phosphate 3-epimerase~~~COG0036
MSKNIVVAPSILSADFSRLGEEIKAVDEAGADWIHVDVMDGRFVPNITIGPLIVDAIRPLTKKTLDVHLMIVEPEKYVED
FAKAGADIISVHVEHNASPHLHRTLCQIRELGKKAGAVLNPSTPLDFLEYVLPVCDLILIMSVNPGFGGQSFIPEVLPKI
RALRQMCDERGLDPWIEVDGGLKPNNTWQVLEAGANAIVAGSAVFNAPNYAEAIAGVRNSKRPEPQLATV
>Q6M6N7 3.-.-.-~~~rpf2~~~Resuscitation-promoting factor Rpf2~~~COG3583
MAPHQKSRINRINSTRSVPLRLATGGVLATLLIGGVTAAATKKDIIVDVNGEQMSLVTMSGTVEGVLAQAGVELGDQDIV
SPSLDSSISDEDTVTVRTAKQVALVVEGQIQNVTTTAVSVEDLLQEVGGITGADAVDADLSETIPESGLKVSVTKPKIIS
INDGGKVTYVSLAAQNVQEALELRDIELGAQDRINVPLDQQLKNNAAIQIDRVDNTEITETVSFDAEPTYVDDPEAPAGD
ETVVEEGAPGTKEVTRTVTTVNGQEESSTVINEVEITAAKPATISRGTKTVAANSVWDQLAQCESGGNWAINTGNGFSGG
LQFHPQTWLAYGGGAFSGDASGASREQQISIAEKVQAAQGWGAWPACTASLGIR
>P9WG31 3.-.-.-~~~rpfA~~~Resuscitation-promoting factor RpfA~~~COG1652
MSGRHRKPTTSNVSVAKIAFTGAVLGGGGIAMAAQATAATDGEWDQVARCESGGNWSINTGNGYLGGLQFTQSTWAAHGG
GEFAPSAQLASREQQIAVGERVLATQGRGAWPVCGRGLSNATPREVLPASAAMDAPLDAAAVNGEPAPLAPPPADPAPPV
ELAANDLPAPLGEPLPAAPADPAPPADLAPPAPADVAPPVELAVNDLPAPLGEPLPAAPADPAPPADLAPPAPADLAPPA
PADLAPPAPADLAPPVELAVNDLPAPLGEPLPAAPAELAPPADLAPASADLAPPAPADLAPPAPAELAPPAPADLAPPAA
VNEQTAPGDQPATAPGGPVGLATDLELPEPDPQPADAPPPGDVTEAPAETPQVSNIAYTKKLWQAIRAQDVCGNDALDSL
AQPYVIG
>H8EZH5 3.-.-.-~~~rpfB~~~Resuscitation-promoting factor RpfB~~~
MLRLVVGALLLVLAFAGGYAVAACKTVTLTVDGTAMRVTTMKSRVIDIVEENGFSVDDRDDLYPAAGVQVHDADTIVLRR
SRPLQISLDGHDAKQVWTTASTVDEALAQLAMTDTAPAAASRASRVPLSGMALPVVSAKTVQLNDGGLVRTVHLPAPNVA
GLLSAAGVPLLQSDHVVPAATAPIVEGMQIQVTRNRIKKVTERLPLPPNARRVEDPEMNMSREVVEDPGVPGTQDVTFAV
AEVNGVETGRLPVANVVVTPAHEAVVRVGTKPGTEVPPVIDGSIWDAIAGCEAGGNWAINTGNGYYGGVQFDQGTWEANG
GLRYAPRADLATREEQIAVAEVTRLRQGWGAWPVCAARAGAR
>P9WG29 3.-.-.-~~~rpfB~~~Resuscitation-promoting factor RpfB~~~COG3583
MLRLVVGALLLVLAFAGGYAVAACKTVTLTVDGTAMRVTTMKSRVIDIVEENGFSVDDRDDLYPAAGVQVHDADTIVLRR
SRPLQISLDGHDAKQVWTTASTVDEALAQLAMTDTAPAAASRASRVPLSGMALPVVSAKTVQLNDGGLVRTVHLPAPNVA
GLLSAAGVPLLQSDHVVPAATAPIVEGMQIQVTRNRIKKVTERLPLPPNARRVEDPEMNMSREVVEDPGVPGTQDVTFAV
AEVNGVETGRLPVANVVVTPAHEAVVRVGTKPGTEVPPVIDGSIWDAIAGCEAGGNWAINTGNGYYGGVQFDQGTWEANG
GLRYAPRADLATREEQIAVAEVTRLRQGWGAWPVCAARAGAR
>H8F3N4 3.-.-.-~~~rpfC~~~Resuscitation-promoting factor RpfC~~~
MTRIAKPLIKSAMAAGLVTASMSLSTAVAHAGPSPNWDAVAQCESGGNWAANTGNGKYGGLQFKPATWAAFGGVGNPAAA
SREQQIAVANRVLAEQGLDAWPTCGAASGLPIALWSKPAQGIKQIINEIIWAGIQASIPR
>O07747 3.-.-.-~~~rpfC~~~Resuscitation-promoting factor RpfC~~~COG1652
MHPLPADHGRSRCNRHPISPLSLIGNASATSGDMSSMTRIAKPLIKSAMAAGLVTASMSLSTAVAHAGPSPNWDAVAQCE
SGGNWAANTGNGKYGGLQFKPATWAAFGGVGNPAAASREQQIAVANRVLAEQGLDAWPTCGAASGLPIALWSKPAQGIKQ
IINEIIWAGIQASIPR
>P0C0F7 2.7.13.3~~~rpfC~~~Sensory/regulatory protein RpfC~~~
MKSPLPWLKRRLSGRADSEHAQNLIRIIITTLFISYLGWRYQHTHGDTLMATWLILVGELLVSLGLMVAILLRPQVSHTR
RLIGMLLDYTCTGAIMAIQGEPASPLYAVCMWVTIGNGLRYGSNYLRAATAMGSLCFLGAILISPYWKANPYLSWGLLLG
LIAVPLYFDSLLRAMTRAVREARHANQAKSRFLANMSHEFRTPLNGLSGMTEVLATTRLDAEQKECLNTIQASARSLLSL
VEEVLDISAIEAGKIRIDRRDFSLREMIGSVNLILQPQARGRRLEYGTQVADDVPDLLKGDTAHLRQVLLNLVGNAVKFT
EHGHVLLRVTRVSGSAEDAVRLRFDVEDTGIGVPMDMRPRLFEAFEQADVGLSRRYEGTGLGTTIAKGLVEAMGGSIGFK
ENQPSGSVFWFELPMAIGEPLKSSTVRVPTGALVDAPEELESSNIIAFSNPFLRHRARVRSMRMLVADDHEANRMVLQRL
LEKAGHKVLCVNGAEQVLDAMAEEDYDAVIVDLHMPGMNGLDMLKQLRVMQASGMRYTPVVVLSADVTPEAIRACEQAGA
RAFLAKPVLAAKLLDTLADLAVSTRQLATPATTVQVATSFEGVLDSSVLDELAALGMGEEFERQFVRQCLDDAQNCVGDI
ERDGTCSDWEQLRESAHALRGVASNLGLAQVASSGGELMRMADWQLQAEWRLRLSTLREQLKAGKDALDARVQGVKDGEC
SPRSNE
>P0C0F6 2.7.13.3~~~rpfC~~~Sensory/regulatory protein RpfC~~~COG0784
MKSPLPWLKRRLSGRADSEHAQNLIRIIITTLFISYLGWRYQHTHGDTLMATWLILVGELLVSLGLMVAILLRPQVSHTR
RLIGMLLDYTCTGAIMAIQGEPASPLYAVCMWVTIGNGLRYGSNYLRAATAMGSLCFLGAILISPYWKANPYLSWGLLLG
LIAVPLYFDSLLRAMTRAVREARHANQAKSRFLANMSHEFRTPLNGLSGMTEVLATTRLDAEQKECLNTIQASARSLLSL
VEEVLDISAIEAGKIRIDRRDFSLREMIGSVNLILQPQARGRRLEYGTQVADDVPDLLKGDTAHLRQVLLNLVGNAVKFT
EHGHVLLRVTRVSGSAEDAVRLRFDVEDTGIGVPMDMRPRLFEAFEQADVGLSRRYEGTGLGTTIAKGLVEAMGGSIGFK
ENQPSGSVFWFELPMAIGEPLKSSTVRVPTGALVDAPEELESSNIIAFSNPFLRHRARVRSMRMLVADDHEANRMVLQRL
LEKAGHKVLCVNGAEQVLDAMAEEDYDAVIVDLHMPGMNGLDMLKQLRVMQASGMRYTPVVVLSADVTPEAIRACEQAGA
RAFLAKPVVAAKLLDTLADLAVSTRQLATPATTVQVATSFEGVLDSSVLDELAALGMGEEFERQFVRQCLDDAQNCVGDI
ERDGTCSDWEQLRESAHALRGVASNLGLAQVASSGGELMRMADWQLQAEWRLRLSTLREQLKAGKDALDARVQGVKDGEC
SPRSNE
>P9WG27 3.-.-.-~~~rpfD~~~Resuscitation-promoting factor RpfD~~~COG1388
MTPGLLTTAGAGRPRDRCARIVCTVFIETAVVATMFVALLGLSTISSKADDIDWDAIAQCESGGNWAANTGNGLYGGLQI
SQATWDSNGGVGSPAAASPQQQIEVADNIMKTQGPGAWPKCSSCSQGDAPLGSLTHILTFLAAETGGCSGSRDD
>O53177 3.-.-.-~~~rpfE~~~Resuscitation-promoting factor RpfE~~~COG1388
MKNARTTLIAAAIAGTLVTTSPAGIANADDAGLDPNAAAGPDAVGFDPNLPPAPDAAPVDTPPAPEDAGFDPNLPPPLAP
DFLSPPAEEAPPVPVAYSVNWDAIAQCESGGNWSINTGNGYYGGLRFTAGTWRANGGSGSAANASREEQIRVAENVLRSQ
GIRAWPVCGRRG
>Q4UU85 3.1.4.-~~~rpfG~~~Cyclic di-GMP phosphodiesterase response regulator RpfG~~~
MQDVLGNPAGVSSAETWGSWSEKADLGLNIVIVDDQMSARTMLRHVIEDIAPELKVYDFGDPLDALSWCEAGRVDLLLLD
YRMPGMDGLEFARRLRRLPSHRDIPIILITIVGDEPIRQAALEAGVIDFLVKPIRPRELRARCSNLLQLRQQSESVKQRA
LSLEQRLLASMNEVEERERETLSRLARAIEYRDGGTSAFLERMSHVAGLVAEQLGLSEEEVRIIEMAAPLHDMGKIAIPD
SVLLKPGKLTEDEMNVMKRHPRIGYELLSGSQNRFIQVGALIALRHHERYDGSGYPDGLVGEAIPLEARIVAVADVFDAL
LSARPYKEAWTMDAALAYLYAQRGRLFDPRCVDALLRGRAQLEQICGQFSTASARPGV
>O86308 3.-.-.-~~~rpf~~~Resuscitation-promoting factor Rpf~~~
MDTMTLFTTSATRSRRATASIVAGMTLAGAAAVGFSAPAQAATVDTWDRLAECESNGTWDINTGNGFYGGVQFTLSSWQA
VGGEGYPHQASKAEQIKRAEILQDLQGWGAWPLCSQKLGLTQADADAGDVDATEAAPVAVERTATVQRQSAADEAAAEQA
AAAEQAVVAEAETIVVKSGDSLWTLANEYEVEGGWTALYEANKGAVSDAAVIYVGQELVLPQA
>Q6G3V6 5.3.1.6~~~rpiA~~~Ribose-5-phosphate isomerase A~~~COG0120
MNVQQLKKMAALKALEFVEDDMRLGIGSGSTVNEFIPLLGERVANGLRVTCVATSQYSEQLCHKFGVPISTLEKIPELDL
DIDGADEIGPEMTLIKGGGGALLHEKIVASASRAMFVIADETKMVKTLGAFALPIEVNPFGIHATRIAIEKAADNLGLSG
EITLRMNGDDPFKTDGGHFIFDAFWGRILQPKLLSEALLAIPGVVEHGLFLGLASRAIVAMADSQIKVLEPFDF
>P0A7Z0 5.3.1.6~~~rpiA~~~Ribose-5-phosphate isomerase A~~~COG0120
MTQDELKKAVGWAALQYVQPGTIVGVGTGSTAAHFIDALGTMKGQIEGAVSSSDASTEKLKSLGIHVFDLNEVDSLGIYV
DGADEINGHMQMIKGGGAALTREKIIASVAEKFICIADASKQVDILGKFPLPVEVIPMARSAVARQLVKLGGRPEYRQGV
VTDNGNVILDVHGMEILDPIAMENAINAIPGVVTVGLFANRGADVALIGTPDGVKTIVK
>Q5NFM5 5.3.1.6~~~rpiA~~~Ribose-5-phosphate isomerase A~~~COG0120
MFFNKKNNQDELKKLAATEAAKSITTEITLGVGTGSTVGFLIEELVNYRDKIKTVVSSSEDSTRKLKALGFDVVDLNYAG
EIDLYIDGADECNNHKELIKGGGAALTREKICVAAAKKFICIIDESKKVNTLGNFPLPIEVIPMARSYIARQIVKLGGQP
VYREQTITDNGNVILDVYNLKIDNPLKLETELNQITGVVTNGIFALKPADTVIMATKDSNIVVL
>A4IYN5 5.3.1.6~~~rpiA~~~Ribose-5-phosphate isomerase A~~~
MFFNKKNNQDELKKLAATEAAKSITTEITLGVGTGSTVGFLIEELVNYRDKIKTVVSSSEDSTRKLKALGFDVVDLNYAG
EIDLYIDGADECNNHKELIKGGGAALTREKICVAAAKKFICIIDESKKVNTLGNFPLPIEVIPMARSYIARQIVKLGGQP
VYREQTITDNGNVILDVYNLKIDNPLKLETELNQITGVVTNGIFALKPADTVIMATKDSNIVVL
>P44725 5.3.1.6~~~rpiA~~~Ribose-5-phosphate isomerase A~~~COG0120
MNQLEMKKLAAQAALQYVKADTIVGVGSGSTVNCFIEALGTIKDKIQGAVAASKESEELLRKQGIEVFNANDVSSLDIYV
DGADEINPQKMMIKGGGAALTREKIVAALAKKFICIVDSSKQVDVLGSTFPLPVEVIPMARSQVGRKLAALGGSPEYREG
VVTDNGNVILDVHNFSILNPVEIEKELNNVAGVVTNGIFALRGADVVIVGTPEGAKVID
>Q5ZZB7 5.3.1.6~~~rpiA~~~Ribose-5-phosphate isomerase A~~~COG0120
MSELKIKAAKAAIAYIEDDMVIGVGTGSTVNFFIKELAAIKHKIEACVASSKATEALLRAEGIPVIDLNSVQDLPIYVDG
ADEVNERGEMIKGGGGALTREKIVANVATQFICIVDESKVVKRLGEFPVAVEVIPMARSFVARQIVKLGGDPEYREGFVT
DNGNIILDVFNLSFSTPMALEDSLNVIPGVVENGVFAKRLADKVLVASASGVNNLK
>B4RL16 5.3.1.6~~~rpiA~~~Ribose-5-phosphate isomerase A~~~
MTTQDELKRIAAEKAVEFVPENEYIGIGTGSTINFFIEALGKSGKKIKGAVSTSKKSGELLARYDIPVVSLNEVSGLAVY
IDGADEVNHALQMIKGGGGAHLNEKIVASASEKFVCIADESKYVSRLGKFPLPVEAVESARSLVSRKLLAMGGQPELRIG
YTTFYGNQIVDVHGLNIDQPLTMEDEINKITGVLENGIFARDAADVLILGTEEGAKVIYPCQG
>Q9I6G1 5.3.1.6~~~rpiA~~~Ribose-5-phosphate isomerase A~~~
MNQDQLKQAVAQAAVDHILPHLDSKSIVGVGTGSTANFFIDALARHKAEFDGAVASSEATAKRLKEHGIPVYELNTVSEL
EFYVDGADESNERLELIKGGGAALTREKIVAAVAKTFICIADASKLVPILGQFPLPVEVIPMARSHVARQLVKLGGDPVY
REGVLTDNGNIILDVHNLRIDSPVELEEKINAIVGVVTNGLFAARPADLLLLGTADGVKTLKA
>B2FT30 5.3.1.6~~~rpiA~~~Ribose-5-phosphate isomerase A~~~COG0120
MSEAKRLAAEKAIEYVEDGMIVGVGTGSTVAYFIDALARIQHRIKGAVSSSEQSTARLKQHGIEVIELNHSGNLSLYVDG
ADECDANKCLIKGGGAALTREKIIAEASERFICIIDPSKQVPVLGRFPLPVEVIPMARSLVARQIRDMTGGQPTWREGVV
TDNGNQILDIHNLQITDPEKLERELNQLPGVVCVGLFARRRADVVIVGGEPPVVL
>Q8DTT9 5.3.1.6~~~rpiA~~~Ribose-5-phosphate isomerase A~~~COG0120
MEELKKIAGVRAAQYVEDGMIVGLGTGSTAYYFVEEVGRRVQEEGLQVIGVTTSSRTTAQAQALGIPLKSIDEVDSVDVT
VDGADEVDPNFNGIKGGGGALLMEKIVGTLTKDYIWVVDESKMVDTLGAFRLPVEVVQYGAERLFREFEKKGYKPSFREY
DGVRFVTDMKNFIIDLDLGSIPDPIAFGNMLDHQVGVVEHGLFNGMVNRVIVAGKDGVRILEANK
>Q72J47 5.3.1.6~~~rpiA~~~Ribose-5-phosphate isomerase A~~~COG0120
MERPLESYKKEAAHAAIAYVQDGMVVGLGTGSTARYAVLELARRLREGELKGVVGVPTSRATEELAKREGIPLVDLPPEG
VDLAIDGADEIAPGLALIKGMGGALLREKIVERAAKEFIVIADHTKKVPVLGRGPVPVEIVPFGYRATLKAIADLGGEPE
LRMDGDEFYFTDGGHLIADCRFGPIGDPLGLHRALLEIPGVVETGLFVGMATRALVAGPFGVEELLP
>Q7MHL9 5.3.1.6~~~rpiA~~~Ribose-5-phosphate isomerase A~~~COG0120
MTQDEMKKAAGWAALKYVEKGSIVGVGTGSTVNHFIDALGTMSEEIKGAVSSSVASTEKLEALGIKIFDCNEVASLDIYV
DGADEINADREMIKGGGAALTREKIVAAIADKFICIVDGTKAVDVLGTFPLPVEVIPMARSYVARQLVKLGGDPCYREGV
ITDNGNVILDVYGMKITNPKQLEDQINAIPGVVTVGLFAHRGADVVITGTPEGAKIEE
>P37351 5.3.1.6~~~rpiB~~~Ribose-5-phosphate isomerase B~~~COG0698
MKKIAFGCDHVGFILKHEIVAHLVERGVEVIDKGTWSSERTDYPHYASQVALAVAGGEVDGGILICGTGVGISIAANKFA
GIRAVVCSEPYSAQLSRQHNDTNVLAFGSRVVGLELAKMIVDAWLGAQYEGGRHQQRVEAITAIEQRRN
>Q92EU5 5.3.1.6~~~rpiB~~~Ribose-5-P isomerase B~~~COG0698
MKIAIGCDEMGYELKQTLITRLKEKNIEFTDFGSFENEKVLYPSIAEKVALEVKNNDYDRGILICGTGIGMAITANKIHG
IRAAQIHDSYSAERARKSNDAHIMTMGALVIGPSLAVSLLDTWLDSDFSGGRSQAKVDLMEEIDQKNR
>P47636 5.3.1.6~~~rpiB~~~Probable ribose-5-phosphate isomerase B~~~COG0698
MSFNIFIASDHTGLTLKKIISEHLKTKQFNVVDLGPNYFDANDDYPDFAFLVADKVKKNSDKDLGILICGTGVGVCMAAN
KVKGVLAALVVSEKTAALARQHDNANVLCLSSRFVTDSENIKIVDDFLKANFEGGRHQRRIDKIIRYEKETE
>P9WKD7 5.3.1.6~~~rpiB~~~Ribose-5-phosphate isomerase B~~~COG0698
MSGMRVYLGADHAGYELKQRIIEHLKQTGHEPIDCGALRYDADDDYPAFCIAAATRTVADPGSLGIVLGGSGNGEQIAAN
KVPGARCALAWSVQTAALAREHNNAQLIGIGGRMHTVAEALAIVDAFVTTPWSKAQRHQRRIDILAEYERTHEAPPVPGA
PA
>P0ACS7 ~~~rpiR~~~HTH-type transcriptional regulator RpiR~~~COG1737
MSQSEFDSALPNGIGLAPYLRMKQEGMTENESRIVEWLLKPGNLSCAPAIKDVAEALAVSEAMIVKVSKLLGFSGFRNLR
SALEDYFSQSEQVLPSELAFDEAPQDVVNKVFNITLRTIMEGQSIVNVDEIHRAARFFYQARQRDLYGAGGSNAICADVQ
HKFLRIGVRCQAYPDAHIMMMSASLLQEGDVVLVVTHSGRTSDVKAAVELAKKNGAKIICITHSYHSPIAKLADYIICSP
APETPLLGRNASARILQLTLLDAFFVSVAQLNIEQANINMQKTGAIVDFFSPGALK
>P31667 3.1.21.-~~~rpnA~~~Recombination-promoting nuclease RpnA~~~COG5464
MSKKQSSTPHDALFKLFLRQPDTARDFLAFHLPAPIHALCDMKTLKLESSSFIDDDLRESYSDVLWSVKTEQGPGYIYCL
IEHQSTSNKLIAFRMMRYAIAAMQNHLDAGYKTLPMVVPLLFYHGIESPYPYSLCWLDCFADPKLARQLYASAFPLIDVT
VMPDDEIMQHRRMALLELIQKHIRQRDLMGLVEQMACLLSSGYANDRQIKGLFNYILQTGDAVRFNDFIDGVAERSPKHK
ESLMTIAERLRQEGEQSKALHIAKIMLESGVPLADIMRFTGLSEEELAAASQ
>B7NGZ6 3.1.21.-~~~rpnD~~~Recombination-promoting nuclease RpnD~~~
MTNFTTSTPHDALFKTFLTHPDTARDFMEIHLPKDLRELCDLDSLKLESASFVDEKLRALHSDILWSVKTREGDGYIYVV
IEHQSREDIHMAFRLMRYSMAVMQRHIEHDKRQPLPLVIPMLFYHGSRSPYPWSLCWLDEFADPTTARKLYNAAFPLVDV
TVVPDDEIVQHRRVALLELIQKHIRQRDLMGLIDQLVVLLVTECANDSQITALLNYILLTGDEARFNEFISELTRRMPQH
RERIMTIAERIHNDGYIKGEQRILRLLLQNGADPEWIQKITGLSAEQMQALRQPLPERERYSWLKS
>Q2A5E5 2.7.7.6~~~rpoA1~~~DNA-directed RNA polymerase subunit alpha 1~~~
MSNNNSKLEFVPNIQLKEDLGAFSYKVQLSPVEKGMAHILGNSIRRVLLSSLSGASIIKVNIANVLHEYSTLEDVKEDVV
EIVSNLKKVAIKLDTGIDRLDLELSVNKSGVVSAGDFKTTQGVEIINKDQPIATLTNQRAFSLTATVSVGRNVGILSAIP
TELERVGDIAVDADFNPIKRVAFEVFDNGDSETLEVFVKTNGTIEPLAAVTKALEYFCEQISVFVSLRVPSNGKTGDVLI
DSNIDPILLKPIDDLELTVRSSNCLRAENIKYLGDLVQYSESQLMKIPNLGKKSLNEIKQILIDNNLSLGVQIDNFRELV
EGK
>Q2A4H7 2.7.7.6~~~rpoA2~~~DNA-directed RNA polymerase subunit alpha 2~~~
MALENLLHPTNIKIDEYAKNATKFSFEALERGVGYTLGFALKQTMLYSIAGACVTSIKINDGKVTSLEDVIPCDETVADI
ILNVKSLSVTLAEDVETGTITFELSGSEEEIFSEEAKLSEGLAITEEVFICSYNGGKKLKIEAKVEKGVGFRPAQDNFKD
GEFLLDATFSPVVFCDFEIKDARVGRRTDLDKLELNIKTNGNVNCEEALRLAATKIQNQLRNIVDIEEINKGIFVEDPKD
INPILLKHVEELNLTARSSNCLKAVNIRLIGELVQKTENELLKAPNFGKKSLTEIKDKLSELGLSLGTLIENWPQDL
>P20429 2.7.7.6~~~rpoA~~~DNA-directed RNA polymerase subunit alpha~~~COG0202
MIEIEKPKIETVEISDDAKFGKFVVEPLERGYGTTLGNSLRRILLSSLPGAAVTSIQIDGVLHEFSTIEGVVEDVTTIIL
HIKKLALKIYSDEEKTLEIDVQGEGTVTAADITHDSDVEILNPDLHIATLGENASFRVRLTAQRGRGYTPADANKRDDQP
IGVIPIDSIYTPVSRVSYQVENTRVGQVANYDKLTLDVWTDGSTGPKEAIALGSKILTEHLNIFVGLTDEAQHAEIMVEK
EEDQKEKVLEMTIEELDLSVRSYNCLKRAGINTVQELANKTEEDMMKVRNLGRKSLEEVKAKLEELGLGLRKDD
>Q9PM80 2.7.7.6~~~rpoA~~~DNA-directed RNA polymerase subunit alpha~~~COG0202
MRNITTSAYTPTEFTIENISDTVAKISAWPFEIGYGITLAHPLRRLLYTSTIGYAPTAIHIDGVAHEFDSMRGMLEDVAL
FIINLKKLRFKIKGDSNKEIVEFSFKGSKEIYGKDLNNDQVEVVNKDAYLATINEDAELKFTLIVEKGIGYVPSEEIKEL
INDPKFIALDAFFTPVREATYDIEKVLFEDNPDYEKVVLTVTTDGQITPNEAFQNALEAMYKQLSVFDKITNVRSVIKNQ
ATSNELENTKLLQNITDLNLSARSYNCLEKAGVVYIGELALMSVSELAGLKNLGKKSLDEIKNIMESIGFPVGTSKLSDN
KEILKNKIAELKAQNEG
>B8H4F8 2.7.7.6~~~rpoA~~~DNA-directed RNA polymerase subunit alpha~~~
MIERNWNELIRPEKPQIETGADATRKARIVAEPLERGFGVTLGNALRRVLLSSLQGAAVTAIQIDGVVHEFSSLEGVRED
VVDIVLNIKQLAVRMHAEGPKRMTLRATGPGPVTAGQIETPADIEILNPDHVLCTLDDGASVRMEFTVNNGKGYVPADRN
RPEDAPIGLIAVDALYSPVKRVAYRVEPTRQGQSLDYDKLILEVETNGAVTPVDAVAYAARILQDQLQIFITFEEPKAKS
ADESKPELPFNPALLKKVDELELSVRSANCLKNDNIVYIGDLIQKTEAEMLRTPNFGRKSLNEIKEVLAGMGLHLGMDVP
NWPPENIEDLAKKFEDQI
>Q18CI5 2.7.7.6~~~rpoA~~~DNA-directed RNA polymerase subunit alpha~~~COG0202
MIEIEKPKVDIVELSEDYRYGKFVIEPLERGYGITIGNALRRILLSSLPGVAVNAIKIDGVLHEFSTIPGVKEDVTEIIL
TLKELSATIDGEGSRTLKIEAQGPCSITGADIICPPDVEILSKDLAIATLDDNAKLNMEIFVDKGRGYVSAEENKTENVP
IGVLPVDSIYTPVEKVSYHVENTRVGQKTDYDKLVLEVWTNGSINPQEGISLAAKVLVEHLNLFIDLTEHVSSVEIMVEK
EEDQKEKVLEMTIEELDLSVRSYNCLKRAGINTVEELANKSEDDMMKVRNLGKKSLEEVIQKLEELGLGLKPSEE
>Q72CF4 2.7.7.6~~~rpoA~~~DNA-directed RNA polymerase subunit alpha~~~COG0202
MLIKQGDRLINTRNWSELVKPEQISRDGEVSDTMYGKFVCEPLERGYATTIGNAMRRVLLSSLQGAAFVAVKISGVQHEF
TTIPGVLEDVTDVVLNLKQVRLAMDTEEPQYLELKVDKRGAITAGDVRTNQHVMVLNPDQHIATLTEDIELTFELEVRMG
KGYVPADMHEGLSEEIGLIKLDASFSPVRKVAYTVEQARVGQMTNYDKLILEVWTDGSVSPEDAIAYSAKIIKDQISVFI
NFDERISGENSNGSADSGEFNEHLFKSIDELELSVRATNCLKSANIALVGELVQKSENEMLKTKNFGRKSLDEIRRVLGD
MGLDFGTKVDGFEKKYQEWKRKQQHEA
>A7ZSI4 2.7.7.6~~~rpoA~~~DNA-directed RNA polymerase subunit alpha~~~
MQGSVTEFLKPRLVDIEQVSSTHAKVTLEPLERGFGHTLGNALRRILLSSMPGCAVTEVEIDGVLHEYSTKEGVQEDILE
ILLNLKGLAVRVQGKDEVILTLNKSGIGPVTAADITHDGDVEIVKPQHVICHLTDENASISMRIKVQRGRGYVPASTRIH
SEEDERPIGRLLVDACYSPVERIAYNVEAARVEQRTDLDKLVIEMETNGTIDPEEAIRRAATILAEQLEAFVDLRDVRQP
EVKEEKPEFDPILLRPVDDLELTVRSANCLKAEAIHYIGDLVQRTEVELLKTPNLGKKSLTEIKDVLASRGLSLGMRLEN
WPPASIADE
>P0A7Z6 2.7.7.6~~~rpoA~~~DNA-directed RNA polymerase subunit alpha~~~COG0202
MQGSVTEFLKPRLVDIEQVSSTHAKVTLEPLERGFGHTLGNALRRILLSSMPGCAVTEVEIDGVLHEYSTKEGVQEDILE
ILLNLKGLAVRVQGKDEVILTLNKSGIGPVTAADITHDGDVEIVKPQHVICHLTDENASISMRIKVQRGRGYVPASTRIH
SEEDERPIGRLLVDACYSPVERIAYNVEAARVEQRTDLDKLVIEMETNGTIDPEEAIRRAATILAEQLEAFVDLRDVRQP
EVKEEKPEFDPILLRPVDDLELTVRSANCLKAEAIHYIGDLVQRTEVELLKTPNLGKKSLTEIKDVLASRGLSLGMRLEN
WPPASIADE
>P0A7Z4 2.7.7.6~~~rpoA~~~DNA-directed RNA polymerase subunit alpha~~~COG0202
MQGSVTEFLKPRLVDIEQVSSTHAKVTLEPLERGFGHTLGNALRRILLSSMPGCAVTEVEIDGVLHEYSTKEGVQEDILE
ILLNLKGLAVRVQGKDEVILTLNKSGIGPVTAADITHDGDVEIVKPQHVICHLTDENASISMRIKVQRGRGYVPASTRIH
SEEDERPIGRLLVDACYSPVERIAYNVEAARVEQRTDLDKLVIEMETNGTIDPEEAIRRAATILAEQLEAFVDLRDVRQP
EVKEEKPEFDPILLRPVDDLELTVRSANCLKAEAIHYIGDLVQRTEVELLKTPNLGKKSLTEIKDVLASRGLSLGMRLEN
WPPASIADE
>Q9ZJT5 2.7.7.6~~~rpoA~~~DNA-directed RNA polymerase subunit alpha~~~COG0202
MKVIKTAPLIPSEIKVLEKEGNRVKISLAPFEFGYAVTLAHPIRRLLLLSSVGYAPVGLKIEGVHHEFDSLRGVTEDVSL
FIMNLKNIRFIAKALVGQDSSLENQSVVVDYSFKGPMELRARDLNSDHIEIVNPEMPLATINEDAQLNFSLIIYKGMGYV
PSENTRELMPEGYMPLDGSFTPIKNVVYEIENVLVEGDPNYEKIIFDIETDGQIDPYKAFLSAVKVMSKQLGVFGERPIA
NTEYSGDYAQRDDAKDLSAKIESMNLSARCFNCLDKIGIKYVGELVLMSEEELKGVKNMGKKSYDEIAEKLNDLGYPVGT
ELSPEQRESLKKRLEKLEDKGGND
>A0QSL8 2.7.7.6~~~rpoA~~~DNA-directed RNA polymerase subunit alpha~~~COG0202
MLISQRPTLSEETVAENRSRFVIEPLEPGFGYTLGNSLRRTLLSSIPGAAVTSIRIDGVLHEFTTVPGVKEDVTDIILNL
KGLVVSSDDDEPVTMYLRKQGPGVVTAGDIVPPAGVTVHNPDMHIATLNDKGKLEVELVVERGRGYVPAVQNKASGAEIG
RIPVDSIYSPVLKVTYKVEATRVEQRTDFDKLIIDVETKNSISPRDALASAGGTLVELFGLARELNADSEHIEIGPSPAE
ADHIASFALPIDDLDLTVRSYNCLKREGVHTVGELVARTESDLLDIRNFGQKSIDEVKIKLHQLGLSLKDSPATFDPSEV
AGYDAATGTWTSDAGYDLDDNQDYAETEQL
>P9WGZ1 2.7.7.6~~~rpoA~~~DNA-directed RNA polymerase subunit alpha~~~COG0202
MLISQRPTLSEDVLTDNRSQFVIEPLEPGFGYTLGNSLRRTLLSSIPGAAVTSIRIDGVLHEFTTVPGVKEDVTEIILNL
KSLVVSSEEDEPVTMYLRKQGPGEVTAGDIVPPAGVTVHNPGMHIATLNDKGKLEVELVVERGRGYVPAVQNRASGAEIG
RIPVDSIYSPVLKVTYKVDATRVEQRTDFDKLILDVETKNSISPRDALASAGKTLVELFGLARELNVEAEGIEIGPSPAE
ADHIASFALPIDDLDLTVRSYNCLKREGVHTVGELVARTESDLLDIRNFGQKSIDEVKIKLHQLGLSLKDSPPSFDPSEV
AGYDVATGTWSTEGAYDEQDYAETEQL
>O52760 2.7.7.6~~~rpoA~~~DNA-directed RNA polymerase subunit alpha~~~
MQSSVNEFLTPRHIDVQVVSQTRAKITLEPLERGFGHTLGNALRRILLSSMPGCAVVEAEIDGVLHEYSAIEGVQEDVIE
ILLNLKGLAIKLHGRDEVTLTLAKKGSGVVTAADIQLDHDVEIINGDHVIANLADNGALNMKLKVARGRGYEPADARQSD
EDESRSIGRLQLDASFSPVRRVSYVVENARVEQRTNLDKLVLDLETNGTLDPEEAIRRAATILQQQLAAFVDLKGDSEPV
VEEQEDEIDPILLRPVDDLELTVRSANCLKAENIYYIGDLIQRTEVELLKTPNLGKKSLTEIKDVLASRGLSLGMRLDNW
PPASLKKDDKATA
>Q925Z2 2.7.7.6~~~rpoA~~~DNA-directed RNA polymerase subunit alpha~~~COG0202
MIQKNWQELIKPNKVEFASSGRTKATLVAEPLERGFGLTLGNALRRVLLSSLRGAAVTAVQIDGVLHEFSSIPGVREDVT
DIVLNIKEIAIKMDGDDAKRMVVRKQGPGVVTAGDIQTVGDIEILNPNHVICTLDEGAEIRMEFTVNNGKGYVPADRNRS
EDAPIGLIPVDSLYSPVKKVSYKVENTREGQVLDYDKLTMSIETDGSVTGEDAIAFAARILQDQLSVFVNFDEPQKETEE
EAVTELAFNPALLKKVDELELSVRSANCLKNDNIVYIGDLIQKTEAEMLRTPNFGRKSLNEIKEVLASMGLHLGMEVPSW
PPENIEDLAKRYEDQY
>P66706 2.7.7.6~~~rpoA~~~DNA-directed RNA polymerase subunit alpha~~~
MIEIEKPRIETIEISEDAKFGKFVVEPLERGYGTTLGNSLRRILLSSLPGAAVKYIEIEGVLHEFSAVDNVVEDVSTIIM
NIKQLALKIYSEEDKTLEIDVRDEGEVTASDITHDSDVEILNPELKIATVSKGGHLKIRLVANKGRGYALAEQNNTSDLP
IGVIPVDSLYSPVERVNYTVENTRVGQSSDFDKLTLDVWTNGSITPQESVSLAAKIMTEHLNIFVGLTDEAQNAEIMIEK
EEDQKEKVLEMSIEELDLSVRSYNCLKRAGINSVQELADKSEADMMKVRNLGRKSLEEVKYKLEDLGLGLRKED
>P60312 2.7.7.6~~~rpoA~~~DNA-directed RNA polymerase subunit alpha~~~COG0202
MLIAQRPSLTEEVVDEFRSRFVIEPLEPGFGYTLGNSLRRTLLSSIPGAAVTSIRIDGVLHEFTTVPGVKEDVTDLILNI
KQLVVSSEHDEPVVMYLRKQGPGLVTAADIAPPAGVEVHNPDLVLATLNGKGKLEMELTVERGRGYVSAVQNKQVGQEIG
RIPVDSIYSPVLKVTYKVEATRVEQRTDFDKLIVDVETKQAMRPRDAMASAGKTLVELFGLARELNIDAEGIDMGPSPTD
AALAADLALPIEELELTVRSYNCLKREGIHSVGELVARSEADLLDIRNFGAKSIDEVKAKLAGMGLALKDSPPGFDPTAA
ADAFGADDDADAGFVETEQY
>Q9X4V6 2.7.7.6~~~rpoA~~~DNA-directed RNA polymerase subunit alpha~~~
MLIAQRPSLTEEVVDEFRSRFVIEPLEPGFGYTLGNSLRRTLLSSIPGAAVTSIRIDGVLHEFTTVPGVKEDVTDLILNI
KQLVVSSEHDEPVVMYLRKQGPGLVTAADIAPPAGVEVHNPDLVLATLNAKGKLEMELTVERGRGYVSAVQNKQVGQEIG
RIPVDSIYSPVLKVTYKVEATRVEQRTDFDKLIVDVETKQAMRPRDAMASAGKTLVELFGLARELNIDAEGIDMGPSPTD
AALAADLALPIEELELTVRSYNCLKREGIHSVGELVARSEADLLDIRNFGAKSIDEVKAKLAGMGLALKDSPPGFDPTAA
ADAFGADDDADAGFVETEQY
>P66709 2.7.7.6~~~rpoA~~~DNA-directed RNA polymerase subunit alpha~~~COG0202
MIEFEKPNITKIDENKDYGKFVIEPLERGYGTTLGNSLRRVLLASLPGAAVTSINIDGVLHEFDTVPGVREDVMQIILNI
KGIAVKSYVEDEKIIELDVEGPAEVTAGDILTDSDIEIVNPDHYLFTIGEGSSLKATMTVNSGRGYVPADENKKDNAPVG
TLAVDSIYTPVTKVNYQVEPARVGSNDGFDKLTLEILTNGTIIPEDALGLSARILTEHLDLFTNLTEIAKSTEVMKEADT
ESDDRILDRTIEELDLSVRSYNCLKRAGINTVHDLTEKSEAEMMKVRNLGRKSLEEVKLKLIDLGLGLKDK
>P73297 2.7.7.6~~~rpoA~~~DNA-directed RNA polymerase subunit alpha~~~COG0202
MAQFQIECVESSTRKNQQQYSKFSLEPLDRGQGTTVGNALRRVLLSNLPGAAVTAIRIAGVNHEFATILGVREDVLEIML
NMKELVLKSYTDQPQIGRLTAIGPGTVTAAQFEVPSEVEVIDPNQYIATLAEGAKLEMEFRVERGVGYRVIERGKDENSS
LDFLQIDSVFMPVTKVNYTVEDIRADGMSPKDRLILDIWTNGSIQPREALSEASDIIANLFIPLKDLNELEAAHSDYQDE
VNPESQIPIEELQLSVRAYNCLKRAQINSVADLLEYSQEDLLEIKNFGLKSAEEVIEALQKRLGITLPHEKAKA
>Q9KWU8 2.7.7.6~~~rpoA~~~DNA-directed RNA polymerase subunit alpha~~~
MLESKLKAPVFTATTQGDHYGEFVLEPLERGFGVTLGNPLRRILLSSIPGTAVTSVYIEDVLHEFSTIPGVKEDVVEIIL
NLKELVVRFLDPKMASTTLILRAEGPKEVRAGDFTPSADVEIMNPDLHIATLEEGGKLYMEVRVDRGVGYVPAERHGIKD
RINAIPVDAIFSPVRRVAFQVEDTRLGQRTDLDKLTLRIWTDGSVTPLEALNQAVAILKEHLNYFANPEASLLPTPEVSK
GEKRESAEEDLDLPLEELGLSTRVLHSLKEEGIESVRALLALNLKDLRNIPGIGERSLEEIRQALAKKGFTLKE
>Q5SHR6 2.7.7.6~~~rpoA~~~DNA-directed RNA polymerase subunit alpha~~~COG0202
MLDSKLKAPVFTVRTQGREYGEFVLEPLERGFGVTLGNPLRRILLSSIPGTAVTSVYIEDVLHEFSTIPGVKEDVVEIIL
NLKELVVRFLNPSLQTVTLLLKAEGPKEVKARDFLPVADVEIMNPDLHIATLEEGGRLNMEVRVDRGVGYVPAEKHGIKD
RINAIPVDAVFSPVRRVAFQVEDTRLGQRTDLDKLTLRIWTDGSVTPLEALNQAVEILREHLTYFSNPQAAAVAAPEEAK
EPEAPPEQEEELDLPLEELGLSTRVLHSLKEEGIESVRALLALNLKDLKNIPGIGERSLEEIKEALEKKGFTLKE
>Q9Z9H6 2.7.7.6~~~rpoA~~~DNA-directed RNA polymerase subunit alpha~~~
MLDSKLKAPVFTVRTQGREYGEFVLEPLERGFGVTLGNPLRRILLSSIPGTAVTSVYIEDVLHEFSTIPGVKEDVVEIIL
NLKELVVRFLNPSLQTVTLLLKAEGPKEVKARDFLPVADVEIMNPDLHIATLEEGGRLNMEVRVDRGVGYVPAEKHGIKD
RINAIPVDAVFSPVRRVAFQVEDTRLGQRTDLDKLTLRIWTDGSVTPLEALNQAVEILREHLTYFSNPQAAAVAAPEEAK
EPEAPPEQEEELDLPLEELGLSTRVLHSLKEEGIESVRALLALNLKDLKNIPGIGERSLEEIKEALEKKGFTLKE
>P0A0Y1 2.7.7.6~~~rpoA~~~DNA-directed RNA polymerase subunit alpha~~~COG0202
MTVTANQVLRPRGPQIERLTDNRAKVVIEPLERGYGHTLGNALRRVLLSSIPGFAITEVEIDGVLHEYTTVEGLQEDVLD
VLLNLKDVAIRMHSGDSATLSLSKQGPGTVTAADIRTDHNVEIINGDHVICHLTKDTALNMRLKIERGFGYQPAAARRRP
DEETRTIGRLMLDASFSPVRRVAYAVEAARVEQRTDLDKLVIDIETNGTIDAEEAVRTAADILSDQLSVFGDFTHRDRGA
AKPAASGVDPVLLRPIDDLELTVRSANCLKAESIYYIGDLIQKTEVELLKTPNLGKKSLTEIKEVLAQRGLALGMKLENW
PPAGVAQHGMLG
>B2SQT4 2.7.7.6~~~rpoA~~~DNA-directed RNA polymerase subunit alpha~~~COG0202
MTVTANQVLRPRGPQIERLTDNRAKVVIEPLERGYGHTLGNALRRVLLSSIPGFAITEVEIDGVLHEYTTVEGLQEDVLD
VLLNLKDVAIRMHSGDSATLSLSKQGPGTVTAADIRTDHNVEIINGDHVICHLTKDTALNMRLKIERGFGYQPAAARRRP
DEETRTIGRLMLDASFSPVRRVAYAVEAARVEQRTDLDKLVIDIETNGTIDAEEAVRTAADILSDQLSVFGDFTHRDRGA
AKPAASGVDPVLLRPIDDLELTVRSANCLKAESIYYIGDLIQKTEVELLKTPNLGKKSLTEIKEVLAQRGLALGMKLENW
PPAGVAQHGMLG
>Q664U6 2.7.7.6~~~rpoA~~~DNA-directed RNA polymerase subunit alpha~~~
MQGSVTEFLKPRLVDIEQVSSTHAKVTLEPLERGFGHTLGNALRRILLSSMPGCAVTEVEIDGVLHEYSTKEGVQEDILE
ILLNLKGLAVRVQGKDEVILTLNKSGIGPVTAADITHDGDVEIVKPQHVICHLTDENASINMRIKVQRGRGYVPASARIH
SEEDERPIGRLLVDACYSPVERIAYNVEAARVEQRTDLDKLVIEMETNGTIDPEEAIRRAATILAEQLEAFVDLRDVRQP
EVKEEKPEFDPILLRPVDDLELTVRSANCLKAEAIHYIGDLVQRTEVELLKTPNLGKKSLTEIKDVLASRGLSLGMRLEN
WPPASIADE
>O25806 2.7.7.6~~~rpoBC~~~Bifunctional DNA-directed RNA polymerase subunit beta-beta'~~~COG0085
MSKKIPLKNRLRADFTKTPTDLEVPNLLLLQRDSYDSFLYSKEGKESGIEKVFKSIFPIQDEHNRITLEYAGCEFGKSKY
TVREAMERGITYSIPLKIKVRLILWEKDTKSGEKNGIKDIKEQSIFIREIPLMTERTSFIINGVERVVVNQLHRSPGVIF
KEEESSTSLNKLIYTGQIIPDRGSWLYFEYDSKDVLYARINKRRKVPVTILFRAMDYQKQDIIKMFYPLVKVRYENDKYL
IPFASLDANQRMEFDLKDPQGKVILLAGKKLTSRKIKELKENHLEWVEYPMDILLNRHLAEPVMVGKEVLLDMLTQLDKN
KLEKIHDLGVQEFVIINDLALGHDASIIQSFSADSESLKLLKQTEKIDDENALAAIRIHKVMKPGDPVTTEVAKQFVKKL
FFDPERYDLTMVGRMKMNHKLGLHVPDYITTLTHEDIITTVKYLMKIKNNQGKIDDRDHLGNRRIRAVGELLANELHSGL
VKMQKTIKDKLTTMSGAFDSLMPHDLVNSKMITSTIMEFFMGGQLSQFMDQTNPLSEVTHKRRLSALGEGGLVKDRVGFE
ARDVHPTHYGRICPIETPEGQNIGLINTLSTFTRVNDLGFIEAPYKKVVDGKVVGETIYLTAIQEDSHIIAPASTPIDEE
GNILGDLIETRVEGEIVLNEKSKVTLMDLSSSMLVGVAASLIPFLEHDDANRALMGTNMQRQAVPLLRSDAPIVGTGIEK
IIARDSWGAIKANRAGVVEKIDSKNIYILGESKEEAYIDAYSLQKNLRTNQNTSFNQVPIVKVGDKVGAGQIIADGPSMD
RGELALGKNVRVAFMPWNGYNFEDAIVVSECITKDDIFTSTHIYEKEVDARELKHGVEEFTADIPDVKEEALAHLDESGI
VKVGTYVSAGMILVGKTSPKGEIKSTPEERLLRAIFGDKAGHVVNKSLYCPPSLEGTVIDVKVFTKKGYEKDARVLSAYE
EEKAKLDMEHFDRLTMLNREELLRVSSLLSQAILEEPFSHNGKDYKEGDQIPKEEIASINRFTLASLVKKYSKEVQNHYE
ITKNNFLEQKKVLGEEHEEKLSILEKDDILPNGVIKKVKLYIATKRKLKVGDKMAGRHGNKGIVSNIVPVADMPYTADGE
PVDIVLNPLGVPSRMNIGQILEMHLGLVGKEFGKQIARMLEDKTKDFAKELRAKMLEIANAINEKDPLTIHALENCSDEE
LLEYAKDWSKGVKMAIPVFEGISQEKFYKLFELAKIAMDGKMDLYDGRTGEKMRERVNVGYMYMIKLHHLVDEKVHARST
GPYSLVTHQPVGGKALFGGQRFGEMEVWALEAYGAAHTLKEMLTIKSDDIRGRENAYRAIAKGEQVGESEIPETFYVLTK
ELQSLALDINIFGDDVDEDGAPKPIVIKEDDRPKDFSSFQLTLASPEKIHSWSYGEVKKPETINYRTLKPERDGLFCMKI
FGPTKDYECLCGKYKKPRFKDIGTCEKCGVAITHSKVRRFRMGHIELATPVAHIWYVNSLPSRIGTLLGVKMKDLERVLY
YEAYIVKEPGEAAYDNEGTKLVMKYDILNEEQYQNISRRYEDRGFVAQMGGEAIKDLLEEIDLITLLQSLKEEVKDTNSD
AKKKKLIKRLKVVESFLNSGNRPEWMMLTVLPVLPPDLRPLVALDGGKFAVSDVNELYRRVINRNQRLKRLMELGAPEII
VRNEKRMLQEAVDVLFDNGRSTNAVKGANKRPLKSLSEIIKGKQGRFRQNLLGKRVDFSGRSVIVVGPNLKMDECGLPKN
MALELFKPHLLSKLEERGYATTLKQAKRMIEQKSNEVWECLQEITEGYPVLLNRAPTLHKQSIQAFHPKLIDGKAIQLHP
LVCSAFNADFDGDQMAVHVPLSQEAIAECKVLMLSSMNILLPASGKAVAIPSQDMVLGLYYLSLEKSGVKGEHKLFSSVN
EIITAIDTKELDIHAKIRVLDQGNIIATSAGRMIIKSILPDFIPTDLWNRPMKKKDIGVLVDYVHKVGGIGITATFLDNL
KTLGFRYATKAGISISMEDIITPKDKQKMVEKAKVEVKKIQQQYDQGLLTDQERYNKIIDTWTEVNDKMSKEMMTAIAQD
KEGFNSIYMMADSGARGSAAQIRQLSAMRGLMTKPDGSIIETPIISNFKEGLNVLEYFNSTHGARKGLADTALKTANAGY
LTRKLIDVSQNVKVVSDDCGTHEGIEITDIAVGSELIEPLEERIFGRVLLEDVIDPITNEILLYADTLIDEEGAKKVVEA
GIKSITIRTPVTCKAPKGVCAKCYGLNLGEGKMSYPGEAVGVVAAQSIGEPGTQLTLRTFHVGGTASRSQDEREIVASKE
GFVRFYNLRTYTNKEGKNIIANRRNASILVVEPKIKAPFDGELRIETVYEEVVVSVKNGDQEAKFVLRRSDIVKPSELAG
VGGKIEGKVYLPYASGHKVHKGGSIADIIQEGWNVPNRIPYASELLVKDNDPIAQDVYAKEKGVIKYYVLEANHLERTHG
IKKGDMVSEKGLFAVIADDNGREAARHYIARGSEILIDDNSEVSTNSVISKPTTNTFKTIATWDPYNTPIIADFKGKVGF
VDVIAGVTVAEKEDENTGITSLVVNDYIPSGYKPSLFLEGANGEEMRYFLEPKTSIAISDGSSVEQAEVLAKIPKATVKS
RDITGGLPRVSELFEARKPKPKDVAILSEVDGIVSFGKPIRNKEHIIVTSKDGRSMDYFVDKGKQILVHADEFVHAGEAM
TDGVISSHDILRISGEKELYKYIVSEVQQVYRRQGVSIADKHIEIIVSQMLRQVRILDSGDSKFIEGDLVSKKLFKEENA
RVIALKGEPAIAEPVLLGITRAAIGSDSIISAASFQETTKVLTEASIAMKKDFLEDLKENVVLGRMIPVGTGMYKNKKIV
LRALEDNSKF
>P37870 2.7.7.6~~~rpoB~~~DNA-directed RNA polymerase subunit beta~~~COG0085
MTGQLVQYGRHRQRRSYARISEVLELPNLIEIQTSSYQWFLDEGLREMFQDISPIEDFTGNLSLEFIDYSLGEPKYPVEE
SKERDVTYSAPLRVKVRLINKETGEVKDQDVFMGDFPIMTDTGTFIINGAERVIVSQLVRSPSVYFSGKVDKNGKKGFTA
TVIPNRGAWLEYETDAKDVVYVRIDRTRKLPVTVLLRALGFGSDQEILDLIGENEYLRNTLDKDNTENSDKALLEIYERL
RPGEPPTVENAKSLLDSRFFDPKRYDLANVGRYKINKKLHIKNRLFNQRLAETLVDPETGEILAEKGQILDRRTLDKVLP
YLENGIGFRKLYPNGGVVEDEVTLQSIKIFAPTDQEGEQVINVIGNAYIEEEIKNITPADIISSISYFFNLLHGVGDTDD
IDHLGNRRLRSVGELLQNQFRIGLSRMERVVRERMSIQDTNTITPQQLINIRPVIASIKEFFGSSQLSQFMDQTNPLAEL
THKRRLSALGPGGLTRERAGMEVRDVHYSHYGRMCPIETPEGPNIGLINSLSSYAKVNRFGFIETPYRRVDPETGKVTGR
IDYLTADEEDNYVVAQANARLDDEGAFIDDSIVARFRGENTVVSRNRVDYMDVSPKQVVSAATACIPFLENDDSNRALMG
ANMQRQAVPLMQPEAPFVGTGMEYVSGKDSGAAVICKHPGIVERVEAKNVWVRRYEEVDGQKVKGNLDKYSLLKFVRSNQ
GTCYNQRPIVSVGDEVVKGEILADGPSMELGELALGRNVMVGFMTWDGYNYEDAIIMSERLVKDDVYTSIHIEEYESEAR
DTKLGPEEITRDIPNVGEDALRNLDDRGIIRIGAEVKDGDLLVGKVTPKGVTELTAEERLLHAIFGEKAREVRDTSLRVP
HGGGGIIHDVKVFNREDGDELPPGVNQLVRVYIVQKRKISEGDKMAGRHGNKGVISKILPEEDMPYLPDGTPIDIMLNPL
GVPSRMNIGQVLELHMGMAARYLGIHIASPVFDGAREEDVWETLEEAGMSRDAKTVLYDGRTGEPFDNRVSVGIMYMIKL
AHMVDDKLHARSTGPYSLVTQQPLGGKAQFGGQRFGEMEVWALEAYGAAYTLQEILTVKSDDVVGRVKTYEAIVKGDNVP
EPGVPESFKVLIKELQSLGMDVKILSGDEEEIEMRDLEDEEDAKQADGLALSGDEEPEETASADVERDVVTKE
>B8GZW7 2.7.7.6~~~rpoB~~~DNA-directed RNA polymerase subunit beta~~~
MAQSFTGKKRIRKSFGRIPEAVQMPNLIEVQRSSYEQFLQRETRPGLRRDEGVEAVFKSVFPIKDFNERAVLEYVSYEFE
EPKYDVEECIQRDMTFAAPLKVKLRLIVFETEEETGARSVKDIKEQDVYMGDIPLMTDKGTFIVNGTERVIVSQMHRSPG
VFFDHDKGKTHASGKLLFAARVIPYRGSWLDFEFDAKDIVYVRIDRRRKLPATTFLYALGMDGEEILTTFYDVVPFEKRS
GGWATPYKPERWRGVKPEFPLVDADTGEEVAPAGTKITARQAKKFADGGLKTLLLAPEALTGRYLARDAVNMATGEIYAE
AGDELDVTSIQALADQGFSTIDVLDIDHVTVGAYMRNTLRVDKNAIREDALFDIYRVMRPGEPPTVEAAEAMFKSLFFDA
ERYDLSSVGRVKMNMRLEQDVSDEVRILRKEDVLAVLKVLVGLRDGRGEIDDIDNLGNRRVRSVGELLENQYRVGLLRME
RAIKERMSSVDIDTVMPHDLINAKPAAAAVREFFGSSQLSQFMDQTNPLSEITHKRRLSALGPGGLTRERAGFEVRDVHP
THYGRICPIETPEGPNIGLINSLATHARVNKYGFIESPYRRVKDGKPQDEVVYMSAMEESKHVIAQSNIKVAEGEIVEDL
VPGRINGEPTLLQKETVDLMDVSPRQVVSVAAALIPFLENDDANRALMGSNMQRQAVPLVQSDAPLVGTGMEAVVARDSG
AVVIAKRTGVVEQIDGTRIVIRATEETDPARSGVDIYRMSKFQRSNQSTCINQRPLVKVGDRIVAGDIIADGPSTELGEL
ALGRNALVAFMPWNGYNFEDSILISERIVRDDVFTSIHIEEFEVMARDTKLGPEEITRDIPNVGEEALRNLDEAGIVAIG
AEVQPGDILVGKVTPKGESPMTPEEKLLRAIFGEKASDVRDTSLRLPPGVAGTIVDVRVFNRHGVDKDERALAIERAEID
RLGKDRDDEFAILNRNISGRLKELLIGKVALSGPKGLSRGEITAEGLAQVASGLWWQIALEDEKAMGELESLRRLFDENR
KRLDRRFEDKVDKLQRGDELPPGVMKMVKVFVAVKRKLQPGDKMAGRHGNKGVISRILPIEDMPFLADGTHVDVVLNPLG
VPSRMNVGQIFETHLGWACANLGKQITNLLEDWQQGGQKQALVERLTEIYGPDEELPDTEEGLVELARNLGKGVPIATPV
FDGARMDDIEGHLEMAGVNKSGQSILFDGLTGEQFKRPVTVGYIYMLKLHHLVDDKIHARSIGPYSLVTQQPLGGKAQFG
GQRFGEMEVWALEAYGAAYTLQEMLTVKSDDVAGRTKVYESIVRGDDTFEAGIPESFNVLVKEMRSLGLNVELENS
>Q18CF1 2.7.7.6~~~rpoB~~~DNA-directed RNA polymerase subunit beta~~~COG0085
MPHPVTIGKRTRMSFSKIKEIADVPNLIEIQVDSYEWFLKEGLKEVFDDISPIEDYTGNLILEFVDYSLDDKPKYDIEEC
KERDATYCAPLKVKVRLINKETGEIKEQEVFMGDFPLMTERGTFVINGAERVIVSQLVRSPGVYYAEERDKTGKRLISST
VIPNRGAWLEYETDSNDVISVRVDRTRKQPVTVLLRALGIGTDAEIIDLLGEDERLSATLEKDNTKTVEEGLVEIYKKLR
PGEPPTVESASSLLNALFFDPKRYDLAKVGRYKFNKKLALCYRIMNKISAEDIINPETGEVFVKAGEKISYDLAKAIQNA
GINVVNLLMDDDKKVRVIGNNFVDIKSHIDFDIDDLNIKEKVHYPTLKEILDGYSDEEEIKEAIKSRIKELIPKHILLDD
IIASISYEFNIFYNIGNIDDIDHLGNRRIRSVGELLQNQVRIGLSRMERVIKERMTVQDMEAITPQALVNIRPVSAAIKE
FFGSSQLSQFMDQTNPLSELTHKRRLSALGPGGLSRERAGFEVRDVHHSHYGRMCPIETPEGPNIGLINSLGTYAKINEF
GFIESPYRKFDKETSTVTDEIHYLTADEEDLFVRAQANEPLTEDGKFVNHRVVCRTVNGAVEMVPESRVDYMDISPKQVV
SVATAMIPFLENDDANRALMGANMQRQAVPLVRREAPIIGTGIEYRAAKDSGAVVVARNSGIAERVTADEIIIKREDGNR
DRYNLLKFKRSNSGTCINQTPIINKGDQIIKGDVIADGPATDLGEVALGRNCLIAFMTWEGYNYEDAILINERLVKEDRL
STIHIEEYECEARDTKLGPEEITRDIPNVGDSAIKNLDDRGIIRIGAEVDSGDILVGKVTPKGETELTAEERLLRAIFGE
KAREVRDTSLKVPHGESGIIVDVKVFTRENGDDLSPGVNELVRCYIAKKRKIKVGDKMAGRHGNKGVISRVLPEEDMPFM
ENGTPLDIILNPQGIPSRMNIGQVLEVHLGLAAKTLGWYVATSVFDGANEYDIMDALEEAGYPRDGKLTLYDGRTGESFD
NRITVGYMYYLKLHHLVDEKLHARSTGPYSLVTQQPLGGKAQFGGQRFGEMEVWALEAYGAAHILQEILTVKSDDVVGRV
RTYEAIVKGENIPEPGIPESFKVLIKELQSLCLDVKVLTDEDQEIEVRESVDEDDTIGEFELDVVNHMGEVEESNIIEEI
EDDFAENAEDEDIENLEEFTEDDLFEEEIDFDSDDFDM
>Q727C7 2.7.7.6~~~rpoB~~~DNA-directed RNA polymerase subunit beta~~~COG0085
MGQLTKKFGKIDVSLPIPHLLNLQVDSYVKFLQEGATERRHDEGLEGVFRSVFPIEDFNRTASLEFVSYEVGEPKYDQPE
CISKGLTYEAPIRIKVRLVVYDVDEDSGNRTIRDIKEQEIYFGTLPLMTEKGTFIINGTERVIVNQLQRSPGIIFEHDSG
KTHSSRKVLYSCRIIPMRGSWLDFDFDHKDILYVRIDRRRKMPATILFKAMGMSKTDILDYFYKKEFYRLDPMGRLMWEV
QKDMYRKDSAFVDIEDGKGGTIVKAGKPITKRAWRLISEAGLETIEVAPDTIEGMFLAEDIVNPATGEVLAEAADEITAS
LVENLREAGISRLPVLHTKGLETSSSLRDTLVLDKTPDMEAAQVEIYRRLRPSSPPTPEIAASFFDNLFRSADYYDLSPV
GRYKLNQRLGIDQSVDLRTLTDDDILRAIRVLLHLKDSHGPADDIDHLGNRRVRPVGELVENQYRIGLVRMERAIKERMS
LQEVSTLMPHDLINPKPVAAVLKEFFGTSQLSQFMDQTNALSEVTHKRRLSALGPGGLTRERAGFEVRDVHTSHYGRICP
IETPEGPNIGLIVSLTTYAKVNDFGFIETPYRIIREGALTDEIKFLDASREQGEVVAQANAAVDADGKLADEYVTARVRG
DVLMSHRDEVTLMDISPSQMVSISAALIPFLEHDDANRALMGSNMQRQAVPLLRSEKPIVGTGMEGDVARDSGACILAEG
PGIVRYADATRIIVSYENGLYPDRGGVRAYDLQKYHKSNQNSCFGQRPTCHPGQIVKKGDVLADGPGIEDGELALGKNLV
VAFMPWCGYNFEDSILISERVVKEDVFTSIHIEEFEVVARDTKLGPEEITRDIPNVGEDMLRNLDGSGIIRIGASVKPDD
ILVGKITPKGETQLTPEEKLLRAIFGDKARDVKNTSLKVPPGIEGTIIDVKVFNRRSGEKDERTRNIEDYETARIDKKEQ
DHVRALGDALRDRLADTLVGKQIAVTLPGKRKGEVLAEAGAPMTRELLDALPVKRLAGLFKSREVDEMVDTALEDYDRQV
AFLKGIYDSKREKVTEGDDLPPGVIKMVKVHIAVKRKLNVGDKMAGRHGNKGVVSCILPEEDMPFFADGRPVDIVLNPLG
VPSRMNIGQIMETHLGWGAKELGRQLAEMLDSGAAMATLRHEVKDVFRSATIAKLVDEMDDETFRKAVSKLRTGIVTKTP
VFDGASEEDIWSWIERAGMDGDGKTVLYDGRTGDKFYNRVTTGVMYILKLHHLVDEKIHARSTGPYSLVTQQPLGGKAQF
GGQRLGEMEVWALEAYGASYLLQEFLTVKSDDVTGRVKMYEKIVKGDNFLEAGLPESFNVLVKELMSLGLNVTLHQEEGK
KRPKRVGFMSAL
>A7ZUK1 2.7.7.6~~~rpoB~~~DNA-directed RNA polymerase subunit beta~~~
MVYSYTEKKRIRKDFGKRPQVLDVPYLLSIQLDSFQKFIEQDPEGQYGLEAAFRSVFPIQSYSGNSELQYVSYRLGEPVF
DVQECQIRGVTYSAPLRVKLRLVIYEREAPEGTVKDIKEQEVYMGEIPLMTDNGTFVINGTERVIVSQLHRSPGVFFDSD
KGKTHSSGKVLYNARIIPYRGSWLDFEFDPKDNLFVRIDRRRKLPATIILRALNYTTEQILDLFFEKVIFEIRDNKLQME
LVPERLRGETASFDIEANGKVYVEKGRRITARHIRQLEKDDVKLIEVPVEYIAGKVVAKDYIDESTGELICAANMELSLD
LLAKLSQSGHKRIETLFTNDLDHGPYISETLRVDPTNDRLSALVEIYRMMRPGEPPTREAAESLFENLFFSEDRYDLSAV
GRMKFNRSLLREEIEGSGILSKDDIIDVMKKLIDIRNGKGEVDDIDHLGNRRIRSVGEMAENQFRVGLVRVERAVKERLS
LGDLDTLMPQDMINAKPISAAVKEFFGSSQLSQFMDQNNPLSEITHKRRISALGPGGLTRERAGFEVRDVHPTHYGRVCP
IETPEGPNIGLINSLSVYAQTNEYGFLETPYRKVTDGVVTDEIHYLSAIEEGNYVIAQANSNLDEEGHFVEDLVTCRSKG
ESSLFSRDQVDYMDVSTQQVVSVGASLIPFLEHDDANRALMGANMQRQAVPTLRADKPLVGTGMERAVAVDSGVTAVAKR
GGVVQYVDASRIVIKVNEDEMYPGEAGIDIYNLTKYTRSNQNTCINQMPCVSLGEPVERGDVLADGPSTDLGELALGQNM
RVAFMPWNGYNFEDSILVSERVVQEDRFTTIHIQELACVSRDTKLGPEEITADIPNVGEAALSKLDESGIVYIGAEVTGG
DILVGKVTPKGETQLTPEEKLLRAIFGEKASDVKDSSLRVPNGVSGTVIDVQVFTRDGVEKDKRALEIEEMQLKQAKKDL
SEELQILEAGLFSRIRAVLVAGGVEAEKLDKLPRDRWLELGLTDEEKQNQLEQLAEQYDELKHEFEKKLEAKRRKITQGD
DLAPGVLKIVKVYLAVKRRIQPGDKMAGRHGNKGVISKINPIEDMPYDENGTPVDIVLNPLGVPSRMNIGQILETHLGMA
AKGIGDKINAMLKQQQEVAKLREFIQRAYDLGADVRQKVDLSTFSDEEVMRLAENLRKGMPIATPVFDGAKEAEIKELLK
LGDLPTSGQIRLYDGRTGEQFERPVTVGYMYMLKLNHLVDDKMHARSTGSYSLVTQQPLGGKAQFGGQRFGEMEVWALEA
YGAAYTLQEMLTVKSDDVNGRTKMYKNIVDGNHQMEPGMPESFNVLLKEIRSLGINIELEDE
>P0A8V2 2.7.7.6~~~rpoB~~~DNA-directed RNA polymerase subunit beta~~~COG0085
MVYSYTEKKRIRKDFGKRPQVLDVPYLLSIQLDSFQKFIEQDPEGQYGLEAAFRSVFPIQSYSGNSELQYVSYRLGEPVF
DVQECQIRGVTYSAPLRVKLRLVIYEREAPEGTVKDIKEQEVYMGEIPLMTDNGTFVINGTERVIVSQLHRSPGVFFDSD
KGKTHSSGKVLYNARIIPYRGSWLDFEFDPKDNLFVRIDRRRKLPATIILRALNYTTEQILDLFFEKVIFEIRDNKLQME
LVPERLRGETASFDIEANGKVYVEKGRRITARHIRQLEKDDVKLIEVPVEYIAGKVVAKDYIDESTGELICAANMELSLD
LLAKLSQSGHKRIETLFTNDLDHGPYISETLRVDPTNDRLSALVEIYRMMRPGEPPTREAAESLFENLFFSEDRYDLSAV
GRMKFNRSLLREEIEGSGILSKDDIIDVMKKLIDIRNGKGEVDDIDHLGNRRIRSVGEMAENQFRVGLVRVERAVKERLS
LGDLDTLMPQDMINAKPISAAVKEFFGSSQLSQFMDQNNPLSEITHKRRISALGPGGLTRERAGFEVRDVHPTHYGRVCP
IETPEGPNIGLINSLSVYAQTNEYGFLETPYRKVTDGVVTDEIHYLSAIEEGNYVIAQANSNLDEEGHFVEDLVTCRSKG
ESSLFSRDQVDYMDVSTQQVVSVGASLIPFLEHDDANRALMGANMQRQAVPTLRADKPLVGTGMERAVAVDSGVTAVAKR
GGVVQYVDASRIVIKVNEDEMYPGEAGIDIYNLTKYTRSNQNTCINQMPCVSLGEPVERGDVLADGPSTDLGELALGQNM
RVAFMPWNGYNFEDSILVSERVVQEDRFTTIHIQELACVSRDTKLGPEEITADIPNVGEAALSKLDESGIVYIGAEVTGG
DILVGKVTPKGETQLTPEEKLLRAIFGEKASDVKDSSLRVPNGVSGTVIDVQVFTRDGVEKDKRALEIEEMQLKQAKKDL
SEELQILEAGLFSRIRAVLVAGGVEAEKLDKLPRDRWLELGLTDEEKQNQLEQLAEQYDELKHEFEKKLEAKRRKITQGD
DLAPGVLKIVKVYLAVKRRIQPGDKMAGRHGNKGVISKINPIEDMPYDENGTPVDIVLNPLGVPSRMNIGQILETHLGMA
AKGIGDKINAMLKQQQEVAKLREFIQRAYDLGADVRQKVDLSTFSDEEVMRLAENLRKGMPIATPVFDGAKEAEIKELLK
LGDLPTSGQIRLYDGRTGEQFERPVTVGYMYMLKLNHLVDDKMHARSTGSYSLVTQQPLGGKAQFGGQRFGEMEVWALEA
YGAAYTLQEMLTVKSDDVNGRTKMYKNIVDGNHQMEPGMPESFNVLLKEIRSLGINIELEDE
>Q2A1M7 2.7.7.6~~~rpoB~~~DNA-directed RNA polymerase subunit beta~~~
MSYSYAEKKRIRKEFGVLPHILDVPYLLSIQTESYKKFLTVDAAKGRLHSGLEIVLKQSFPVESKNGQYELHYVDYQIGE
PTFDETECQVRGATYDAPLNVKLRLVVYNKDALPNEKIVEDIREEYVYMGDIPLMTTNGTFIINGTERVVVSQLHRSPGA
FFSKDDSEEGAFSARIIPYRGSWLDFEFDSKGIIWARIDRKRKFCATVILKALGYTQEQILENFSESKTITFNSKGFALR
LDNLSNMKGELLKFDIVDAQDNVIVKKNKKLTSRDVKKIKDAGVDSVAIDFDLVSTLRVAKDIVNEATGEVIAYANDDVT
ESLLKSCVEVGMLELEVIDFITTERGRYISDTLKYDLTRNTDEALVEIYKVLRPGDPPAAASVKALFEGLFFIESRYSLS
DIGRMKLNARLGSDKVSKDIYTLENSDIVGVIEELINIRDGKGKVDDIDHLGNRRVRSVGEMVENQFRIGLYRVEKGIRE
SMSLVHKDKLMPKDIVNSKPITAAIKEFFTSGALSQFMDQDNPLSEVTHKRRISALGPGGLSRDRAGFEVRDVHATHYGR
LCPIETPEGPNIGLINSLASYARVNDYGFLEAPYRKVVDGKVTDEIEYLSAIDEDNYVIAQASTKLDENNHFVEDIIQCR
SGGEAIFTESSRVQYMDVSAKQMVSAAAALIPFLEHDDANRVLMGANMQRQAVPTLKSEKPLVGTGMEKIVARDSGNCII
ARNVGEVAEVDSNRIVIKVDTEKSQTSNLVDIYSLTKFKRSNKNTCINQRPIVNVGDKVEAGDILADGFATDFGELSLGH
NLMVAFMPWNGYNFEDSILLSERIVKDDKYTSIHIEEFTCVARDTKLGPEEITADIPNVSESSLAKLDESGIVHIGANVE
AGDILVAKITPKAEQQLTPEERLLRAIFNEKASNVVDSSLRMPSGTSGTVINVQVFENDKGGKSKRALKIEKELIDKARK
DFDEEFAVIESVVKSSIEQEVVGAKIQKAKGLKKGAILTKEFLATLPLSKWLEISFEDEKLEEKVQNAREYYEEAKIAID
AKFEAKKKSITQSNELSPGVLKTVKVFVAIKKRIQPGDKMAGRHGNKGVVSRVLPVEDMPYMEDGTPVDVCLNPLGIPSR
MNIGQILEAHLGLASYGLGKKIEKTLEKTRKAAELRKTLEEVYNSVGDKKVNLEALNDEEILTLCDNLKGGVPIATPVFD
GAKEEDIKSLLKIGGFATNGQMKLFDGRTGKPFDRHVTVGYMYMLKLDHLVDDKMHARSTGSYSLVTQQPLGGKAQFGGQ
RFGEMEVWALQAYGAAYTLREMLTVKSDDIAGRSKMYKNIVDGKLTMNVDVPESFNVLRNEVRALGIDMDFDYSSEEE
>A6TGP0 2.7.7.6~~~rpoB~~~DNA-directed RNA polymerase subunit beta~~~
MVYSYTEKKRIRKDFGKRPQVLDVPYLLSIQLDSFQKFIEQDPEGQYGLEAAFRSVFPIQSYSGNSELQYVSYRLGEPVF
DVKECQIRGVTYSAPLRVKLRLVIYEREAPEGTVKDIKEQEVYMGEIPLMTDNGTFVINGTERVIVSQLHRSPGVFFDSD
KGKTHSSGKVLYNARIIPYRGSWLDFEFDPKDNLFVRIDRRRKLPATIILRALNYTTEQILDLFFEKVVFEIRDNKLQME
LIPERLRGETASFDIEANGKVYVEKGRRITARHIRQLEKDDIKHIEVPVEYIAGKVAAKDYIDEATGELICPANMELSLD
LLAKLSQSGHKRIETLFTNDLDHGPYISETVRVDPTNDRLSALVEIYRMMRPGEPPTREAAESLFENLFFSEDRYDLSAV
GRMKFNRSLLRDEIEGSGILSKDDIIEVMKKLIDIRNGKGEVDDIDHLGNRRIRSVGEMAENQFRVGLVRVERAVKERLS
LGDLDTLMPQDMINAKPISAAVKEFFGSSQLSQFMDQNNPLSEITHKRRISALGPGGLTRERAGFEVRDVHPTHYGRVCP
IETPEGPNIGLINSLSVYAQTNEYGFLETPYRKVTDGVVTDEIHYLSAIEEGNYVIAQANSNLDENGHFVEDLVTCRSKG
ESSLFSRDQVDYMDVSTQQVVSVGASLIPFLEHDDANRALMGANMQRQAVPTLRADKPLVGTGMERAVAVDSGVTAVAKR
GGTVQYVDASRIVIKVNEDEMYPGEAGIDIYNLTKYTRSNQNTCINQMPCVSLGEPIERGDVLADGPSTDLGELALGQNM
RVAFMPWNGYNFEDSILVSERVVQEDRFTTIHIQELACVSRDTKLGPEEITADIPNVGEAALSKLDESGIVYIGAEVTGG
DILVGKVTPKGETQLTPEEKLLRAIFGEKASDVKDSSLRVPNGVSGTVIDVQVFTRDGVEKDKRALEIEEMQLKQAKKDL
SEELQILEAGLFSRIYAVLVSGGVEAEKLDKLPRDRWLELGLTDEEKQNQLEQLAEQYDELKHEFEKKLEAKRRKITQGD
DLAPGVLKIVKVYLAVKRRIQPGDKMAGRHGNKGVISKINPIEDMPHDANGTPVDIVLNPLGVPSRMNIGQILETHLGMA
AKGIGDKINAMLKQQQEVAKLREFIQRAYDLGADVRQKVDLNTFSDEEVLRLAENLRKGMPIATPVFDGAKEAEIKELLQ
LGDLPTSGQITLFDGRTGEQFERPVTVGYMYMLKLNHLVDDKMHARSTGSYSLVTQQPLGGKAQFGGQRFGEMEVWALEA
YGAAYTLQEMLTVKSDDVNGRTKMYKNIVDGNHQMEPGMPESFNVLLKEIRSLGINIELEDE
>P60281 2.7.7.6~~~rpoB~~~DNA-directed RNA polymerase subunit beta~~~COG0085
MLEGCILAVSSQSKSNAITNNSVPGAPNRVSFAKLREPLEVPGLLDVQTDSFEWLVGSDRWRQAAIDRGEENPVGGLEEV
LAELSPIEDFSGSMSLSFSDPRFDEVKASVDECKDKDMTYAAPLFVTAEFINNNTGEIKSQTVFMGDFPMMTEKGTFIIN
GTERVVVSQLVRSPGVYFDETIDKSTEKTLHSVKVIPGRGAWLEFDVDKRDTVGVRIDRKRRQPVTVLLKALGWTNEQIV
ERFGFSEIMMGTLEKDTTSGTDEALLDIYRKLRPGEPPTKESAQTLLENLFFKEKRYDLARVGRYKVNKKLGLNAGKPIT
SSTLTEEDVVATIEYLVRLHEGQTSMTVPGGVEVPVEVDDIDHFGNRRLRTVGELIQNQIRVGLSRMERVVRERMTTQDV
EAITPQTLINIRPVVAAIKEFFGTSQLSQFMDQNNPLSGLTHKRRLSALGPGGLSRERAGLEVRDVHPSHYGRMCPIETP
EGPNIGLIGSLSVYARVNPFGFIETPYRKVENGVVTDQIDYLTADEEDRHVVAQANSPTDENGRFTEDRVMVRKKGGEVE
FVSADQVDYMDVSPRQMVSVATAMIPFLEHDDANRALMGANMQRQAVPLVRSEAPLVGTGMELRAAIDAGDVVVADKTGV
IEEVSADYITVMADDGTRQSYRLRKFARSNHGTCANQRPIVDAGQRVEAGQVIADGPCTQNGEMALGKNLLVAIMPWEGH
NYEDAIILSNRLVEEDVLTSIHIEEHEIDARDTKLGAEEITRDIPNVSDEVLADLDERGIVRIGAEVRDGDILVGKVTPK
GETELTPEERLLRAIFGEKAREVRDTSLKVPHGESGKVIGIRVFSREDDDELPAGVNELVRVYVAQKRKISDGDKLAGRH
GNKGVIGKILPVEDMPFLPDGTPVDIILNTHGVPRRMNIGQILETHLGWVAKAGWNIDVAAGVPDWASKLPEELYSAPAD
STVATPVFDGAQEGELAGLLGSTLPNRDGEVMVDADGKSTLFDGRSGEPFPYPVTVGYMYILKLHHLVDDKIHARSTGPY
SMITQQPLGGKAQFGGQRFGEMECWAMQAYGAAYTLQELLTIKSDDTVGRVKVYEAIVKGENIPEPGIPESFKVLLKELQ
SLCLNVEVLSSDGAAIEMRDGDDEDLERAAANLGINLSRNESASVEDLA
>P9WGY9 2.7.7.6~~~rpoB~~~DNA-directed RNA polymerase subunit beta~~~COG0085
MLEGCILADSRQSKTAASPSPSRPQSSSNNSVPGAPNRVSFAKLREPLEVPGLLDVQTDSFEWLIGSPRWRESAAERGDV
NPVGGLEEVLYELSPIEDFSGSMSLSFSDPRFDDVKAPVDECKDKDMTYAAPLFVTAEFINNNTGEIKSQTVFMGDFPMM
TEKGTFIINGTERVVVSQLVRSPGVYFDETIDKSTDKTLHSVKVIPSRGAWLEFDVDKRDTVGVRIDRKRRQPVTVLLKA
LGWTSEQIVERFGFSEIMRSTLEKDNTVGTDEALLDIYRKLRPGEPPTKESAQTLLENLFFKEKRYDLARVGRYKVNKKL
GLHVGEPITSSTLTEEDVVATIEYLVRLHEGQTTMTVPGGVEVPVETDDIDHFGNRRLRTVGELIQNQIRVGMSRMERVV
RERMTTQDVEAITPQTLINIRPVVAAIKEFFGTSQLSQFMDQNNPLSGLTHKRRLSALGPGGLSRERAGLEVRDVHPSHY
GRMCPIETPEGPNIGLIGSLSVYARVNPFGFIETPYRKVVDGVVSDEIVYLTADEEDRHVVAQANSPIDADGRFVEPRVL
VRRKAGEVEYVPSSEVDYMDVSPRQMVSVATAMIPFLEHDDANRALMGANMQRQAVPLVRSEAPLVGTGMELRAAIDAGD
VVVAEESGVIEEVSADYITVMHDNGTRRTYRMRKFARSNHGTCANQCPIVDAGDRVEAGQVIADGPCTDDGEMALGKNLL
VAIMPWEGHNYEDAIILSNRLVEEDVLTSIHIEEHEIDARDTKLGAEEITRDIPNISDEVLADLDERGIVRIGAEVRDGD
ILVGKVTPKGETELTPEERLLRAIFGEKAREVRDTSLKVPHGESGKVIGIRVFSREDEDELPAGVNELVRVYVAQKRKIS
DGDKLAGRHGNKGVIGKILPVEDMPFLADGTPVDIILNTHGVPRRMNIGQILETHLGWCAHSGWKVDAAKGVPDWAARLP
DELLEAQPNAIVSTPVFDGAQEAELQGLLSCTLPNRDGDVLVDADGKAMLFDGRSGEPFPYPVTVGYMYIMKLHHLVDDK
IHARSTGPYSMITQQPLGGKAQFGGQRFGEMECWAMQAYGAAYTLQELLTIKSDDTVGRVKVYEAIVKGENIPEPGIPES
FKVLLKELQSLCLNVEVLSSDGAAIELREGEDEDLERAAANLGINLSRNESASVEDLA
>Q51561 2.7.7.6~~~rpoB~~~DNA-directed RNA polymerase subunit beta~~~
MAYSYTEKKRIRKDFSKLPDVMDVPYLLAIQLDSYREFLQAGATKEQFRDVGLHAAFKSVFPIISYSGNAALEYVGYRLG
EPAFDVKECVLRGVTFAVPLRVKVRLIIFDRESSNKAIKDIKEQEVYMGEIPLMTENGTFIINGTERVIVSQLHRSPGVF
FDHDRGKTHSSGKLLYSARIIPYRGSWLDFEFDPKDCVFVRIDRRRKLPASVLLRALGYSTEEILNAFYATNVFHIKGET
LNLELVPQRLRGEVASIDIKDGSGKVIVEQGRRITARHINQLEKAGVSQLEVPFDYLIGRTIAKAIVHPATGEIIAECNT
ELTLDLLAKVAKAQVVRIETLYTNDIDCGPFISDTLKIDNTSNQLEALVEIYRMMRPGEPPTKEAAETLFGNLFFSAERY
DLSAVGRMKFNRRIGRTEIEGPGVLSKEDIIDVLKTLVDIRNGKGIVDDIDHLGNRRVRCVGEMAENQFRVGLVRVERAV
KERLSMAESEGLMPQDLINAKPVAAAIKEFFGSSQLSQFMDQNNPLSEITHKRRVSALGPGGLTRERAGFEVRDVHPTHY
GRVCPIETPEGPNIGLINSLATYARTNKYGFLESPYRVVKDSLVTDEIVFLSAIEEADHVIAQASATLNEKGQLVDELVA
VRHLNEFTVKAPEDVTLMDVSPKQVVSVAASLIPFLEHDDANRALMGSNMQRQAVPTLRADKPLVGTGMERNVARDSGVC
VVARRGGVIDSVDASRVVVRVADDEVETGEAGVDIYNLTKYTRSNQNTCINQRPLVSKGDVVARGDILADGPSTDMGELA
LGQNMRVAFMPWNGFNFEDSICLSERVVQEDRFTTIHIQELTCVARDTKLGPEEITADIPNVGEAALNKLDEAGIVYVGA
EVQAGDILVGKVTPKGETQLTPEEKLLRAIFGEKASDVKDTSLRVPTGTKGTVIDVQVFTRDGVERDSRALSIEKMQLDQ
IRKDLNEEFRIVEGATFERLRAALVGAKAEGGPALKKGTEITDDYLDGLERGQWFKLRMADDALNEQLEKAQAYISDRRQ
LLDDKFEDKKRKLQQGDDLAPGVLKIVKVYLAIKRRIQPGDKMAGRHGNKGVVSVIMPVEDMPHDANGTPVDIVLNPLGV
PSRMNVGQILETHLGLAAKGLGEKINRMLEEQRKVAELRKFLHEIYNEIGGREENLDELGDNEILALAKNLRGGVPMATP
VFDGAKEREIKAMLKLADLPESGQMRLFDGRTGNQFERPTTVGYMYMLKLNHLVDDKMHARSTGSYSLVTQQPLGGKAQF
GGQRFGEMEVWALEAYGAAYTLQEMLTVKSDDVNGRTKMYKNIVDGDHRMEAGMPESFNVLIKEIRSLGIDIELETE
>P60278 2.7.7.6~~~rpoB~~~DNA-directed RNA polymerase subunit beta~~~
MAGQVVQYGRHRKRRNYARISEVLELPNLIEIQTKSYEWFLREGLIEMFRDISPIEDFTGNLSLEFVDYRLGEPKYDLEE
SKNRDATYAAPLRVKVRLIIKETGEVKEQEVFMGDFPLMTDTGTFVINGAERVIVSQLVRSPSVYFNEKIDKNGRENYDA
TIIPNRGAWLEYETDAKDVVYVRIDRTRKLPLTVLLRALGFSSDQEIVDLLGDNEYLRNTLEKDGTENTEQALLEIYERL
RPGEPPTVENAKSLLYSRFFDPKRYDLASVGRYKTNKKLHLKHRLFNQKLAEPIVNTETGEIVVEEGTVLDRRKIDEIMD
VLESNANSEVFELHGSVIDEPVEIQSIKVYVPNDDEGRTTTVIGNAFPDSEVKCITPADIIASMSYFFNLLSGIGYTDDI
DHLGNRRLRSVGELLQNQFRIGLSRMERVVRERMSIQDTESITPQQLINIRPVIASIKEFFGSSQLSQFMDQANPLAELT
HKRRLSALGPGGLTRERAQMEVRDVHYSHYGRMCPIETPEGPNIGLINSLSSYARVNEFGFIETPYRKVDLDTHAITDQI
DYLTADEEDSYVVAQANSKLDENGRFMDDEVVCRFRGNNTVMAKEKMDYMDVSPKQVVSAATACIPFLENDDSNRALMGA
NMQRQAVPLMNPEAPFVGTGMEHVAARDSGAAITAKHRGRVEHVESNEILVRRLVEENGVEHEGELDRYPLAKFKRSNSG
TCYNQRPIVAVGDVVEYNEILADGPSMELGEMALGRNVVVGFMTWDGYNYEDAVIMSERLVKDDVYTSIHIEEYESEARD
TKLGPEEITRDIPNVSESALKNLDDRGIVYIGAEVKDGDILVGKVTPKGVTELTAEERLLHAIFGEKAREVRDTSLRVPH
GAGGIVLDVKVFNREEGDDTLSPGVNQLVRVYIVQKRKIHVGDKMCGRHGNKGVISKIVPEEDMPYLPDGRPIDIMLNPL
GVPSRMNIGQVLELHLGMAAKNLGIHVASPVFDGANDDDVWSTIEEAGMARDGKTVLYDGRTGEPFDNRISVGVMYMLKL
AHMVDDKLHARSTGPYSLVTQQPLGGKAQFGGQRFGEMEVWALEAYGAAYTLQEILTYKSDDTVGRVKTYEAIVKGENIS
RPSVPESFRVLMKELQSLGLDVKVMDEQDNEIEMTDVDDDDVVERKVDLQQNDAPETQKEVTD
>Q9L0L0 2.7.7.6~~~rpoB~~~DNA-directed RNA polymerase subunit beta~~~COG0085
MAASRNASTANTNNAASTAPLRISFAKIKEPLEVPNLLALQTESFDWLLGNDAWKARVESALESGQDVPTKSGLEEIFEE
ISPIEDFSGSMSLTFRDHRFEPPKNSIDECKDRDFTYAAPLFVTAEFTNNETGEIKSQTVFMGDFPLMTNKGTFVINGTE
RVVVSQLVRSPGVYFDSSIDKTSDKDIFSAKIIPSRGAWLEMEIDKRDMVGVRIDRKRKQSVTVLLKALGWTTEQILEEF
GEYESMRATLEKDHTQGQDDALLDIYRKLRPGEPPTREAAQTLLENLYFNPKRYDLAKVGRYKVNKKLGADEPLDAGVLT
TDDVIATIKYLVKLHAGETETVGESGREIVVETDDIDHFGNRRIRNVGELIQNQVRTGLARMERVVRERMTTQDVEAITP
QTLINIRPVVASIKEFFGTSQLSQFMDQNNPLSGLTHKRRLNALGPGGLSRERAGFEVRDVHPSHYGRMCPIETPEGPNI
GLIGSLASYGRINPFGFIETPYRKVVEGQVTDDVDYLTADEEDRFVIAQANAALGDDMRFAEARVLVRRRGGEVDYVPGD
DVDYMDVSPRQMVSVATAMIPFLEHDDANRALMGANMMRQAVPLIKSESPLVGTGMEYRSAADAGDVVKAEKAGVVQEVS
ADYITTTNDDGTYITYRLAKFSRSNQGTSVNQKVIVAEGDRIIEGQVLADGPATENGEMALGKNLLVAFMPWEGHNYEDA
IILSQRLVQDDVLSSIHIEEHEVDARDTKLGPEEITRDIPNVSEEVLADLDERGIIRIGAEVVAGDILVGKVTPKGETEL
TPEERLLRAIFGEKAREVRDTSLKVPHGEIGKVIGVRVFDREEGDELPPGVNQLVRVYVAQKRKITDGDKLAGRHGNKGV
ISKINPIEDMPFLEDGTPVDIILNPLAVPSRMNPGQVLEIHLGWLASRGWDVSGLAEEWAQRLQVIGADKVEPGTNVATP
VFDGAREDELAGLLQHTIPNRDGERMVLPSGKARLFDGRSGEPFPEPISVGYMYILKLHHLVDDKLHARSTGPYSMITQQ
PLGGKAQFGGQRFGEMEVWALEAYGAAYALQELLTIKSDDVTGRVKVYEAIVKGENIPEPGIPESFKVLIKEMQSLCLNV
EVLSSDGMSIEMRDTDEDVFRAAEELGIDLSRREPSSVEEV
>P77965 2.7.7.6~~~rpoB~~~DNA-directed RNA polymerase subunit beta~~~COG0085
MTNLATTMLPDLIEIQHASFHWFLEEGLIEELNSFSPISDYTGKLELHFLGKDYKLKQPKYDVDESKRRDASYSVQMYVP
TRLINKETGEIKEQEVFIGDLPLMTERGTFIINGAERVIVNQIVRSPGVYYKKELDKNGRRTYSASLIPNRGAWLKFETD
KNGLVYVRIDKTRKLSAQVLLKAIGLSDNEILDSLSHPEFYQKTLDKEGNPTEEEALVELYKKLRPGEPPTVSGGQQLLE
SRFFDPKRYDLGRVGRYKLNKKLRLNEADTTRVLTPQDILAAINYLINLEFDVGTTDDIDHLGNRRVRSVGELLQNQIRV
GLNRLERIIRERMTVSESDALTPASLVNPKPLVAAIKEFFGSSQLSQFMDQTNPLAELTHKRRISALGPGGLTRERAGFA
VRDIHPSHHGRICPVETPEGPNAGLIGSLATCARVNDYGFIETPYFRVESGRVRKDLDPVYLTADEEDDMRVAPGDIPTD
EEGNIIGESVPIRYRQEFSTTSPEQVDYVAVSPVQIISVATSMIPFLEHDDANRALMGSNMQRQAVPLLRPERPLVGTGL
EAQAARDSGMVIVSRTHGIVTYVDATEIRVQPHSPDNPAEKGEEIVYPIQKYQRSNQDTCLNQRPLVYAGEDVVPGQVLA
DGSATEGGELALGQNILVAYMPWEGYNYEDAILISERLVYDDVYTSIHIEKFEIEARQTKLGPEEITREIPNVGEDALRN
LDEHGIIRIGAWVESGDILVGKVTPKGEADQPPEEKLLRAIFGEKARDVRDNSLRVPNGEKGRVVDVRVFTREKGDELPP
GANMVVRIYVAQKRKIQVGDKMAGRHGNKGIISRILPIEDMPYLPDGRPIDIALNPLGVPSRMNVGQVFECLLGWAGENL
GVRFKITPFDEMYGEEASRDTVHGLLEEASQRPNKDWVFNENHPGKIQVFDGRTGEPFDRPITVGQAYMLKLVHLVDDKI
HARSTGPYSLVTQQPLGGKAQQGGQRFGEMEVWALEAYGAAYILQELLTVKSDDMQGRNEALNAIVKGKSIPRPGTPESF
KVLMRELQSLGLDIAAHKVQLSEDGESADAEVDLMIDSQRRAPNRPTYESLHTEEDLEEEEV
>Q9KWU7 2.7.7.6~~~rpoB~~~DNA-directed RNA polymerase subunit beta~~~
MEIKRFGRIREVIPLPPLTEIQVESYKKALQADVPPEKRENVGIQAAFKETFPIEEGDKGKGGLVLDFLEYRIGDPPFSQ
DECREKDLTYQAPLYARLQLIHKDTGLIKEDEVFLGHLPLMTEDGSFIINGADRVIVSQIHRSPGVYFTPDPARPGRYIA
SIIPLPKRGPWIDLEVEASGVVTMKVNKRKFPLVLLLRVLGYDQETLVRELSAYGDLVQGLLDEAVLAMRPEEAMVRLFT
LLRPGDPPKKDKALAYLFGLLADPKRYDLGEAGRYKAEEKLGVGLSGRTLVRFEDGEFKDEVFLPTLRYLFALTAGVPGH
EVDDIDHLGNRRIRTVGELMADQFRVGLARLARGVRERMVMGSPDTLTPAKLVNSRPLEAALREFFSRSQLSQFKDETNP
LSSLRHKRRISALGPGGLTRERAGFDVRDVHRTHYGRICPVETPEGANIGLITSLAAYARVDALGFIRTPYRRVKNGVVT
EEVVYMTASEEDRYTIAQANTPLEGDRIATDRVVARRRGEPVIVAPEEVEFMDVSPKQVFSLNTNLIPFLEHDDANRALM
GSNMQTQAVPLIRAQAPVVMTGLEERVVRDSLAALYAEEDGEVVKVDGTRIAVRYEDGRLVEHPLRRYARSNQGTAFDQR
PRVRVGQRVKKGDLLADGPASEEGFLALGQNVLVAIMPFDGYNFEDAIVISEELLKRDFYTSIHIERYEIEARDTKLGPE
RITRDIPHLSEAALRDLDEEGIVRIGAEVKPGDILVGRTSFKGEQEPSPEERLLRSIFGEKARDVKDTSLRVPPGEGGIV
VGRLRLRRGDPGVELKPGVREVVRVFVAQKRKLQVGDKLANRHGNKGVVAKILPVEDMPHLPDGTPVDVILNPLGVPSRM
NLGQILETHLGLAGYFLGQRYISPVFDGATEPEIKELLAEAFNLYFGKRQGEGFGVDKREKEVLARAEKLGLVSPGKSPE
EQLKELFDLGKVVLYDGRTGEPFEGPIVVGQMFIMKLYHMVEDKMHARSTGPYSLITQQPLGGKAQFGGQRFGEMEVWAL
EAYGAAHTLQEMLTIKSDDIEGRNAAYQAIIKGEDVPEPSVPESFRVLVKELQALALDVQTLDEKDNPVDIFEGLASKR
>Q8RQE9 2.7.7.6~~~rpoB~~~DNA-directed RNA polymerase subunit beta~~~COG0085
MEIKRFGRIREVIPLPPLTEIQVESYRRALQADVPPEKRENVGIQAAFRETFPIEEEDKGKGGLVLDFLEYRLGEPPFPQ
DECREKDLTYQAPLYARLQLIHKDTGLIKEDEVFLGHIPLMTEDGSFIINGADRVIVSQIHRSPGVYFTPDPARPGRYIA
SIIPLPKRGPWIDLEVEPNGVVSMKVNKRKFPLVLLLRVLGYDQETLARELGAYGELVQGLMDESVFAMRPEEALIRLFT
LLRPGDPPKRDKAVAYVYGLIADPRRYDLGEAGRYKAEEKLGIRLSGRTLARFEDGEFKDEVFLPTLRYLFALTAGVPGH
EVDDIDHLGNRRIRTVGELMTDQFRVGLARLARGVRERMLMGSEDSLTPAKLVNSRPLEAAIREFFSRSQLSQFKDETNP
LSSLRHKRRISALGPGGLTRERAGFDVRDVHRTHYGRICPVETPEGANIGLITSLAAYARVDELGFIRTPYRRVVGGVVT
DEVVYMTATEEDRYTIAQANTPLEGNRIAAERVVARRKGEPVIVSPEEVEFMDVSPKQVFSVNTNLIPFLEHDDANRALM
GSNMQTQAVPLIRAQAPVVMTGLEERVVRDSLAALYAEEDGEVAKVDGNRIVVRYEDGRLVEYPLRRFYRSNQGTALDQR
PRVVVGQRVRKGDLLADGPASENGFLALGQNVLVAIMPFDGYNFEDAIVISEELLKRDFYTSIHIERYEIEARDTKLGPE
RITRDIPHLSEAALRDLDEEGVVRIGAEVKPGDILVGRTSFKGESEPTPEERLLRSIFGEKARDVKDTSLRVPPGEGGIV
VRTVRLRRGDPGVELKPGVREVVRVYVAQKRKLQVGDKLANRHGNKGVVAKILPVEDMPHLPDGTPVDVILNPLGVPSRM
NLGQILETHLGLAGYFLGQRYISPIFDGAKEPEIKELLAQAFEVYFGKRKGEGFGVDKREVEVLRRAEKLGLVTPGKTPE
EQLKELFLQGKVVLYDGRTGEPIEGPIVVGQMFIMKLYHMVEDKMHARSTGPYSLITQQPLGGKAQFGGQRFGEMEVWAL
EAYGAAHTLQEMLTLKSDDIEGRNAAYEAIIKGEDVPEPSVPESFRVLVKELQALALDVQTLDEKDNPVDIFEGLASKR
>Q9KV30 2.7.7.6~~~rpoB~~~DNA-directed RNA polymerase subunit beta~~~COG0085
MVYSYTEKKRIRKDFGTRPQVLDIPYLLSIQLDSFEKFIEQDPEGQYGLEAAFRSVFPIQSYNGNSELQYVSYRLGEPVF
DVKECQIRGVTYSKPLRVKLRLVIFDKDAPAGTVKDIKEQEVYMGEIPLMTENGTFVINGTERVIVSQLHRSPGVFFDSD
KGKTHSSGKVLYNARIIPYRGSWLDFEFDPKDNLYVRIDRRRKLPASIILRALGKTSAEILDIFFEKVNFEVKDQTLMME
LVPERLRGETATFDIEADGKVYVEKGRRVTARHIRQLEKDGVNFIEVPVEYIVGKVSAKDYVNEATGELIITANQEISLE
ALANLSQAGYKKLEVLFTNDLDHGPFMSETLRVDSTTDRISALVEIYRMMRPGEPPTKEAAESLFESLFFSAERYDLSTV
GRMKFNSSIGREDAEEQGTLDEVDIIEVMKKLISIRNGKGEVDDIDHLGNRRIRSVGEMAENQFRVGLVRVERAVKERLS
LGDLDNVMPQDLINAKPISAAVKEFFGSSQLSQFMDQNNPLSEVTHKRRISALGPGGLTRERAGFEVRDVHVTHYGRLCP
IETPEGPNIGLINSLSAFARCNEYGFLETPYRRVVNGVVTDEVDYLSAIEEGQFVIAQANAKLTEEGGFADELVTARQKG
ESGLHPREHVDYMDVATNQVVSIAASLIPFLEHDDANRALMGANMQRQAVPTLRSEKPLVGTGIERNVAVDSGVTAVAKR
GGVIQSVDASRIVVKVNEEELIPGEAGIDIYNLTKYTRSNQNTCINQRPCVMPGEPVARGDVLADGPSTDLGELALGQNM
RIAFMPWNGYNFEDSILVSERVVQDDRFTTIHIQELSCVARDTKLGAEEITADIPNVGEAALSKLDESGIVYIGAEVKGG
DILVGKVTPKGETQLTPEEKLLRAIFGEKASDVKDTSLRVPNSVAGTVIDVQVFTRDGVEKDKRALEIEQMQLKEAKKDL
TEEFQILEGGLLARVRSVLLAGGYTEAKLGSIERKKWLEQTLENEELQNQLEQLAEQYDELKADFDKKFEAKRRKITQGD
DLAPGVLKIVKVYLAVKRRIQPGDKMAGRHGNKGVISKINPVEDMPYDENGQPVDIVLNPLGVPSRMNIGQILEVHLGLA
AKGIGDKINQMIKEQQELAKLREFLQKVYDLGDTRQRVDISELSDEDVRTLAHNLRAGLPVATPVFDGAPESSIKAMLEL
ADLPASGQLTLFDGRTGDAFERPVTVGYMYMLKLNHLVDDKMHARSTGSYSLVTQQPLGGKAQFGGQRFGEMEVWALEAY
GAAYTLQEMLTVKSDDVNGRTKMYKNIVDGNHAMEPGMPESFNVLLKEIRSLGINIELEDE
>Q2NZX8 2.7.7.6~~~rpoB~~~DNA-directed RNA polymerase subunit beta~~~
MTSYSFTEKKRIRKDFGKQRSILEVPFLLAIQVDSYREFLQEDVESTKRKDLGLHAALKSVFPISSYSGNAALEYVGYKL
GQPVFDERECRQRGMSYGAPLRVTVRLVIYDRESSTKAIKYVKEQEVYLGEIPLMTGNGTFIVNGTERVIVSQLHRSPGV
FFDHDRGKTHSSGKLLYSARIIPYRGSWLDFEFDPKDALFTRIDRRRKLPVSILLRALGYNNEEMLAEFFEINTFHINPD
EGVQLELVPERLRGETLNFDLADGDKVIVEAGKRITARHVKQLEAAGVAALAVPDDYLVGRILSHDVVDGSTGELLANAN
DEISEDQLTAFRKAGVDAVGTLWVNDLDRGPYLSNTLRIDPTKTQLEALVEIYRMMRPGEPPTKEAAQNLFHNLFFTFER
YDLSTVGRMKFNRRVGRKDVLGESVLYDKKYFAERNDEESKRLVAEHTDTSDILEVIKVLTEIRNGRGVVDDIDHLGNRR
VRSVGEMAENVFRVGLVRVERAVKERLSMAESEGLTPQELINAKPVAAAIKEFFGSSQLSQFMDQNNPLSEVTHKRRVSA
LGPGGLTRERAGFEVRDVHPTHYGRVCTIETPEGPNIGLINSLAVFARTNQYGFLETPYRKVLDGKVSDDVEYLSAIEEN
EYVIAQANALTDAKNMLTEQFVPCRFQGESLLKPPSEVHFMDVSPMQTVSVAAALVPFLEHDDANRALMGANMQRQAVPT
LRSQKPLVGTGIERAVARDSGVTVNALRGGVIEQIDAARIVVKVNEAEIGGGTDAGVDIYNLIKYTRSNQNTCINQRPLV
NVGDVIARGDVLADGPSTDIGELALGQNMLIAFMPWNGYNFEDSILLSERVVEEDRYTTIHIEELTCVARDTKLGPEEIS
ADIPNVSEQALNRLDESGVVYIGAEVRAGDIMVGKVTPKGESQLTPEEKLLRAIFGEKASDVKDSSLRVPPGMDGTVIDV
QVFTRDGIEKDKRARQIEENEIKRVKKDFDDQFRILEAAIYARLRSQIVGKVANGGANLKKGDSVTDAYLDGLKKSDWFQ
LRMKDEDAADAIERAQKQIQAHEKEFEARFADKRGKITQGDDLAPGVLKMVKVFLAVKRRIQPGDKMAGRHGNKGVVSNV
VPVEDMPYMATGESVDIVLNPLGVPSRMNIGQILEVHLGWAAKGLGRKIQRMLEAQAAVSELRKFLDDIYNHDNAINAQR
VDLSQFSDEELLNLGKNLIDGVPMATPVFDGASEAEIKRMLELADLPQSGQTQLYDGRTGEAFDRKTTVGYMHYLKLNHL
VDDKMHARSTGPYSLVTQQPLGGKAQFGGQRFGEMEVWALEAYGAAYTLQEMLTVKSDDVQGRNQMYKNIVDGEHEMVAG
MPESFNVLVKEIRSLAIHMELEE
>B2SQQ1 2.7.7.6~~~rpoB~~~DNA-directed RNA polymerase subunit beta~~~COG0085
MTSYSFTEKKRIRKDFGKQRSILEVPFLLAIQVDSYREFLQEDVESTKRKDLGLHAALKSVFPISSYSGNAALEYVGYKL
GQPVFDERECRQRGMSYGAPLRVTVRLVIYDRESSTKAIKYVKEQEVYLGEIPLMTGNGTFIVNGTERVIVSQLHRSPGV
FFDHDRGKTHSSGKLLYSARIIPYRGSWLDFEFDPKDALFTRIDRRRKLPVSILLRALGYNNEEMLAEFFEINTFHINPD
EGVQLELVPERLRGETLNFDLADGDKVIVEAGKRITARHVKQLEAAGVAALAVPDDYLVGRILSHDVVDGSTGELLANAN
DEISEDQLTAFRKAGVDAVGTLWVNDLDRGPYLSNTLRIDPTKTQLEALVEIYRMMRPGEPPTKEAAQNLFHNLFFTFER
YDLSTVGRMKFNRRVGRKDVLGESVLYDKKYFAERNDEESKRLVAEHTDTSDILEVIKVLTEIRNGRGVVDDIDHLGNRR
VRSVGEMAENVFRVGLVRVERAVKERLSMAESEGLTPQELINAKPVAAAIKEFFGSSQLSQFMDQNNPLSEVTHKRRVSA
LGPGGLTRERAGFEVRDVHPTHYGRVCTIETPEGPNIGLINSLAVFARTNQYGFLETPYRKVLDGKVSDDVEYLSAIEEN
EYVIAQANALTDAKNMLTEQFVPCRFQGESLLKPPSEVHFMDVSPMQTVSVAAALVPFLEHDDANRALMGANMQRQAVPT
LRSQKPLVGTGIERAVARDSGVTVNALRGGVIEQIDAARIVVKVNEAEIGGGTDAGVDIYNLIKYTRSNQNTCINQRPLV
NVGDVIARGDVLADGPSTDIGELALGQNMLIAFMPWNGYNFEDSILLSERVVEEDRYTTIHIEELTCVARDTKLGPEEIS
ADIPNVSEQALNRLDESGVVYIGAEVRAGDIMVGKVTPKGESQLTPEEKLLRAIFGEKASDVKDSSLRVPPGMDGTVIDV
QVFTRDGIEKDKRARQIEENEIKRVKKDFDDQFRILEAAIYARLRSQIVGKVANGGANLKKGDSVTDAYLDGLKKSDWFQ
LRMKDEDAADAIERAQKQIQAHEKEFEARFADKRGKITQGDDLAPGVLKMVKVFLAVKRRIQPGDKMAGRHGNKGVVSNV
VPVEDMPYMATGESVDIVLNPLGVPSRMNIGQILEVHLGWAAKGLGRKIQRMLEAQAAVSELRKFLDDIYNHDNAINAQR
VDLSQFSDEELLNLGKNLIDGVPMATPVFDGASEAEIKRMLELADLPQSGQTQLYDGRTGEAFDRKTTVGYMHYLKLNHL
VDDKMHARSTGPYSLVTQQPLGGKAQFGGQRFGEMEVWALEAYGAAYTLQEMLTVKSDDVQGRNQMYKNIVDGEHEMVAG
MPESFNVLVKEIRSLAIHMELEE
>P74177 2.7.7.6~~~rpoC1~~~DNA-directed RNA polymerase subunit gamma~~~COG0086
MKAQSEPRFDYVKIAIASPERIRQWGERTLPNGTVVGEVTKPETINYRTLKPEMDGLFCEKIFGPSKDWECWCGKYKRVR
HRGIVCERCGVEVTESRVRRHRMGYIKLAAPVTHVWYLKGIPSYLSILLDMALRDVEQIVYFNAYVVLNPGNASNLQYKQ
LLTEDQWVEIEDQIYAEDSELEGIEVGIGAEAVQRLLAELQLEEVAEKLREEILASKGQKRAKLIKRLRVIDNFIATHSQ
AEWMTLDVIPVIPPDLRPMVQLDGGRFATSDLNDLYRRVINRNNRLARLQEILAPEIIVRNEKRMLQEAVDALIDNGRRG
RTVVGANNRALKSLSDIIEGKQGRFRQNLLGKRVDYSGRSVIVVGPNLKIYQCGLPREMAIELFQPFVIHRLIKLGIVNN
IKAAKKLILKGDPQIWSVLEEVITGHPVMLNRAPTLHRLGIQAFEPILVEGRAIQLHPLVCPAFNADFDGDQMAVHVPLS
LEAQCEARLLMLACHNVLSPATGKPIVAPSQDMVLGCYYLTAENPNAQKGAGRYFAGIEDALRAYDHGQVDLHSQIWIRH
LDEDVVTEKPDTEVIKTEDLGDGTVMKYYRERKIREGVDGEIITQYIQTTPGRIIYNKTIAEALVF
>Q31N15 2.7.7.6~~~rpoC2~~~DNA-directed RNA polymerase subunit beta'~~~COG0086
MAEAKSAPIFRNRVIDKKQLKKLIGWTFAHYGTAKTAVVADDLKALGFRYATRAGVSISIDDLKVPGSKAELLESAEKRI
QETEDRYTRGEITEVERFQKVIDTWANTNDELTDRVVKNFRESDPLNSVYMMAFSGARGNISQVRQLVGMRGLMANPQGE
IIDLPIKTNFREGLTVTEYIISSYGARKGLVDTALRTADSGYLTRRLVDVSQDVIIHEVDCGTSRGLFVEAMTDGDRILI
PISQRLLGRVTAEAVLDPSTDEVLAEAGQDINEDLANRIEKAGIKKVKVRSPLTCEAARSVCQKCYGWSLAHAQMVDMGE
AVGIIAAQSIGEPGTQLTMRTFHTGGVFTGETARLLRAPVAGTIKLGKKARTRPYRTRHGEEALLAEANFDLVLEGKGRK
ETFAILQGSTIFVQDGDKVAAEAILAEVPVSGRTKRTVEKATKDVATDLAGEIRFQDIVPEEKTDRQGNTTRIAQRGGLL
WVLAGDVYNLLPGAEPTVKNGDRVEVGDVLAETKLTTERGGTVRMGEDNGSSTHREVEIITASVVLDTATVKAEASQGRE
HYVIETKGGQRFNLLAAPGTKVTTGHVVAELIDSRYRTQTGGLLKYSGVEISKKGRAKAKQGYEVTKGGTLLWIPEETHE
VNKDISLLNVEDGQLVEAGTEVVKDIFCQTTGIVSVTQNNDILREIVIKPGDVHVLDDPDTAAKYDEGRLVNAGEEVFPG
LTAEQLVWAEAVDGTDGPLLLLRPVQELVIPDEPPVPSQDSSQESSSRSIRLRAVQRLQFQDGERIKSVEGVDLLRTQLV
LESEEGSSQLSADIELLPDSKDPETLRLQLVIIEPVVIRRDVASDTTHGSTHTELRVKDGQKVKPGAVIACTQIQCKEAG
VVRGIQEGSEAVRRLLVERERDCVTLDLDVTAATQLQPGSLIVAGTQLVDGIIAPESGEVRAIAPGQLQLRIARPYRVSQ
GAVLHVEDKGLVQRGDNLVLLVFERAKTGDIIQGLPRIEELLEARKPKEACILARRPGVAHINYSDDDAIDIQVIEADGT
QADYPVGPGQPLIISDGETVDAGQALTDGPANPHDLLEIYYDYFREQLGEDYEAALESLRRVQALLVNEVQSVYQSQGID
ISDKHIEVIVRQMTSKVRIDDGGDTIMLPGELHELREVYNSNNTMALTGMAPAQFTPVLLGITKASLNTNSFISAASFQE
TTRVLTEAAIEGKSDWLRGLKENVIIGRLIPAGTGFKAYEESLLTDVDGGYEDRVYDDDLADVVIDDRAARSYTLNEGRD
FSRSMTFAEGESMILDDGEELIDDSSASLRNLVDVDED
>P73334 2.7.7.6~~~rpoC2~~~DNA-directed RNA polymerase subunit beta'~~~COG0086
MTFYNYTIDKGRLKKLIALAYRRYGSARCSQLADELKELGFRFATKAGVSISVDDLTIPPEKKQMLEAAEKEIRTTEERY
ARGEITEVERFQKVIDTWNGTSEELKDQVVVNFRKTDPLNSVYMMAFSGARGNMSQVRQLVGMRGLMADPQGEIIDLPIK
TNFREGLTVTEYVISSYGARKGLVDTALRTADSGYLTRRLVDVSQDVIVREQDCGTERSLRVTAMTDGDQVKISLADRLF
GRLLAKDVVGPDGEIIAKRNDEIDEALANRIAAVTDEVYVRSPLTCEAARSVCQNCYGWSLAHGHKVDLGEAVGIIAAQS
IGEPGTQLTMRTFHTGGVFTGEVARQEKAPEDGTVKWGKGLSTRKVRTRHGEDAEQVEIAGDLIWKGEGKKAATQTYSLT
PGSLLFVQDGQTVTAGQLMTEISLSKTQRSTERATKDVAGDLAGEVLFDRLVPEEKTDRQGNTTRIAQRGGLVWILSGEV
YNLPPGAEPVVKNDEQVEVGSIMAETKLVTNDGGVVRLVSNREIEIITASVLLDQAQVKLESSGGREQYVIYTADKQRFL
LKAAPGTKVQNHSIVAELIDDRYRTTTGGMIRYAGVEVAKGGRKQGYEVTKGGTLLWIPEETHEINKDISLLIVEDGQYV
EAGTEVVKDIFCQSSGIVEVVQKNDILREIIIKPGDFYQDVDPGSVKIESGQLLQPGQDVFPGVTVSTLSQAEWIESPEG
NGLLLRPVEEYKVFDEPAAPSQGSQNEEGGRQIELRSVQRLFYKDGDRVKSVEGAPLLSTQLVLEIYGSGNEGISHLSAD
IELQDDEEEDCQRLQLVILESLVLRRDQESDPLGGASKTRLLVQDGDQIPPGAVVARTEIQCKEAGTVRGIKEGQESIRR
VLLERAADRLVVDLPSAPEVKPGQLLVAGQELVPGVKLEESGKVLEINGKGDNYQLVLRRARPYRVSPGAVLHIEDGDLV
QRGDNLVLLVFERAKTGDIVQGLPRIEELLEARKPKEACVLARAPGVCQVEYLEDESVDIKVVEDDGTVSEYPLLPGQNA
MVTDGQRIDVGHALTDGYNNPHEILDVFFSYYVDKDGCYQAALRGLQAAQKFLVNEVQTVYQSQGVDISDKHIEVIVRQM
TAKVRIDDGGDTTMLPGELVELRQVEQVNEAMGITGSAPARYTPVLLGITKASLNTDSFISAASFQETTRVLTEAAIEGK
SDWLRGLKENVIIGRLIPAGTGFSSHEEVLGLIETQDDIQGYMIEPIELPTTKKKASATKVKTKKVEADDDLLDDTRARA
YAGTQLSQDDEEFEETYDTDEDDFDMDDDDDFGDDED
>Q8DL57 2.7.7.6~~~rpoC2~~~DNA-directed RNA polymerase subunit beta'~~~COG0086
MTEKKPIFFNRIIDKKGLRDLIAWSFSNFGTARTAEMADKIKDLGFHYATRAGVSISVDDLLVPPKKQELLEAAEKEIKT
AQERYSRGEITEVERFQKVIDTWNSTNEELKNEVVRHFRNTDVLNSVYMMAFSGARGNLSQVRQLVGMRGLMANPQGEII
DLPIKTNFREGLTVTEYIISSYGARKGLVDTALRTADSGYLTRRLVDVSQDVIIREEDCETERGITLRSMTVGDKVLALQ
DRLLGRVVLNDVRHPQTGEVLVAKNQAISADLAKKIVDAGIEEVVVRSPLTCEATGSVCRLCYGWSLAHARLVDMGEAVG
IIAAQSIGEPGTQLTMRTFHTGGVFTGEVARQERAPFAGTVEYGKKLRVRPYRTRHGDDAFIVETAGKLVVKGGQQRQEF
DLSQGSIVLVADGETVAAGQLLAEVAQAARSVRKATEKVTKDVASDLAGQVKFVNLDAEEKRDRQGTTTRIAPKGGLIWV
LSGEVYNLPPGAEPVVKNGDRIEAGAVLAETTVKTEHGGVVRLPEQQDSKGGREVEIITASVMLDKAKVLKETQQGREHY
IIETATGQRFSLKAAPGTKVANGQVVAELIDDRYHTTTGGILKYADIEVAKKGKAKQGYEVLKGGTLLWIPEETHEVNKD
ISLLMVEDNQYVEAGTEVVKDIFCQNSGVVEVIQKNDILREIIIKPGELHLVDDPEAARLKHGTLARPGEEVLPGLVVDT
LSQVDYLEDTPEGPAILLRPVQEFSVPDEPSVPSQDSSDGSGQSIRLRAVQRLPYKHDERVKSVDGVDLLRTQLVLEIGS
EAPQLAADIEIVTDEVDPEAQRLQLVILESLIIRRDIAADQTQGSTFTSLLVKDGDHIGPGAVIARTDIKAKQAGEVQGI
VRSGESVRRILVVTDSDRLRVETNGAKPTVKVGDLVRPGDELAKGVTAPETAAVMAVADDHVILRLARPYLVSPGAVLQI
EEGDLVQRGDNLALLVFERAKTGDIIQGLPRIEELLEARHPKEKCVLAVRPGTCQVTYNSDDSVEIKVIEEDGTIQEYPV
LPGQNPLVVDGQKVNLADPLTDGPVDPHDILSIYFEYYKPQGLLKAAQTSLEKVQSFLVNEVQSVYLSQGIEIADKHIEV
IVRQMTSKVRIDDAGDTILLSGELMTLRQAEQANEPMALTGGAPAQYTPVLLGITKASLNTDSFISAASFQETTRVLTEA
AIEGKSDWLRGLKENVIIGRLIPAGTGFNSYEESSNGDEEWEEGEDRLGQTHVISPEPESPKMTVNVTADLGEDVLIDDE
TAPHVIEKITGGARDFEFASSDVEEDELTEEDDDYGDEEEEDAF
>P37871 2.7.7.6~~~rpoC~~~DNA-directed RNA polymerase subunit beta'~~~COG0086
MLDVNNFEYMNIGLASPDKIRSWSFGEVKKPETINYRTLKPEKDGLFCERIFGPTKDWECHCGKYKRVRYKGVVCDRCGV
EVTRAKVRRERMGHIELAAPVSHIWYFKGIPSRMGLVLDMSPRALEEVIYFASYVVTDPANTPLEKKQLLSEKEYRAYLD
KYGNKFQASMGAEAIHKLLQDIDLVKEVDMLKEELKTSQGQRRTRAIKRLEVLEAFRNSGNKPSWMILDVLPVIPPELRP
MVQLDGGRFATSDLNDLYRRVINRNNRLKRLLDLGAPSIIVQNEKRMLQEAVDALIDNGRRGRPVTGPGNRPLKSLSHML
KGKQGRFRQNLLGKRVDYSGRSVIVVGPHLKMYQCGLPKEMALELFKPFVMKELVEKGLAHNIKSAKRKIERVQPEVWDV
LESVIKEHPVLLNRAPTLHRLGIQAFEPTLVEGRAIRLHPLVCTAYNADFDGDQMAVHVPLSAEAQAEARILMLAAQNIL
NPKDGKPVVTPSQDMVLGNYYLTLERAGAVGEGMVFKNTDEALLAYQNGYVHLHTRVAVAANSLKNVTFTEEQRSKLLIT
TVGKLVFNEILPESFPYMNEPTKSNIEEKTPDRFFLEKGADVKAVIAQQPINAPFKKGILGKIIAEIFKRFHITETSKML
DRMKNLGFKYSTKAGITVGVSDIVVLDDKQEILEEAQSKVDNVMKQFRRGLITEEERYERVISIWSAAKDVIQGKLMKSL
DELNPIYMMSDSGARGNASNFTQLAGMRGLMANPAGRIIELPIKSSFREGLTVLEYFISTHGARKGLADTALKTADSGYL
TRRLVDVAQDVIIRETDCGTDRGILAKPLKEGTETIERLEERLIGRFARKQVKHPETGEVLVNENELIDEDKALEIVEAG
IEEVWIRSAFTCNTPHGVCKRCYGRNLATGSDVEVGEAVGIIAAQSIGEPGTQLTMRTFHTGGVAGDDITQGLPRIQELF
EARNPKGQATITEIDGTVVEINEVRDKQQEIVVQGAVETRSYTAPYNSRLKVAEGDKITRGQVLTEGSIDPKELLKVTDL
TTVQEYLLHEVQKVYRMQGVEIGDKHVEVMVRQMLRKVRVIDAGDTDVLPGTLLDIHQFTEANKKVLLEGNRPATGRPVL
LGITKASLETDSFLSAASFQETTRVLTDAAIKGKRDELLGLKENVIIGKLVPAGTGMMKYRKVKPVSNVQPTDDMVPVE
>Q18CF3 2.7.7.6~~~rpoC~~~DNA-directed RNA polymerase subunit beta'~~~COG0086
MFELNNFESIKIALASPEKIRQWSRGEVKKPETINYRTLKPEKDGLFCERIFGPQKDWECHCGKYRRVRYKGVVCDRCGV
EVTKSKVRRERMGHIELAAPMSHIWYFKGIPSRMGLLLDMSPRSLEKILYFASYVVVDPGETGLNEKQLLTEKEYRTALE
KYGYTFTVGMGAEAVKTLLQNIDLEQQSKDLRAELKDSTGQKKVRTIRRLEVVEAFKKSGNKPEWMILDAIPVIPPDLRP
MVQLDGGRFATSDLNDLYRRVINRNNRLKRLLELGAPDIIVRNEKRMLQEAVDALIDNGRRGRPVTGPGNRPLKSLSDML
KGKQGRFRQNLLGKRVDYSGRSVIVVGPELKFYQCGLPKKMALELFKPFVMDKLVKEGYAHNIKSAKSIVEKVKPEVWDV
LEDVIKSHPVLLNRAPTLHRLGIQAFEPILVEGKAIKLHPLVCTAYNADFDGDQMAVHVPLSVEAQAEARFLMLSVNNIL
APKDGSPITTPSQDMVLGCYYLTIEAQDGAKGTGMVFKDFNELLLAYYNKSVHLHALVKLKVTLEDGRSSLVESTVGRFI
FNENIPQDLGFVDRKENPFALEVDFLADKKSLGKIIDKCFRKHGNTETAELLDYIKALGFKYSTLGGITVAVDDMSVPEE
KKVFIAEAEAKVDKYEKAYRRGLISDEERYEKVIETWTETTDKVTDALMGGLDRLNNIYIMAHSGARGSKNQIRQLAGMR
GLMANASGKTVEIPVKSNFREGLSVLEYFTSSHGARKGLADTAIRTAESGYLTRRLVDVSQDVIVREIDCGTEDTTEIYA
IKEGNEVIEEIYDRIVGRYTIDPILNPETGEVIVEADSMIQEDEAETIVALGIEKIRIRTVLNCKTNHGVCSKCYGRNLA
TGKEVNIGEAVGIIAAQSIGEPGTQLTMRTFHTGGVAGADITQGLPRVEELFEARKPKGLAVITEVSGRVEIDETGKRKE
VNVIPEEGETQTYVIPYGSRLKVKQGQMLEAGDPLTQGFINPHDIVRVNGVKGVQEYIVKEVQRVYRLQGVDVNDKHIEV
IVRQMLSKVKVEDPGDTDLLPGGYEDVLTFNECNKDAIDKGLRPAVAKRVLLGITKASLATDSFLSAASFQETTRVLTEA
AIKGKEDHLIGLKENVILGKLIPAGTGMKKYRNIAVEKIED
>Q727C6 2.7.7.6~~~rpoC~~~DNA-directed RNA polymerase subunit beta'~~~COG0086
MTLDDLFTVRGSAANIANIRNLKAIQITIASPENIREWSYGEVKKPETINYRTFKPERDGLFCAKIFGPVKDYECNCGKY
KRMKHRGIVCEKCGVEVIASKVRRERMGHIELAAPVAHIWFLKTLPSKIGTLLDMTMADLEKVLYFDSYIVLDPGSTNLT
KMQVISEDQYLQVIDHYGEDALTVGMGAEAVRSLLEELNLEELRVQLREESQATKSQTKKKKLTKRLKIVEAFLESNNKP
EWMVMEVIPVIPPELRPLVPLDGGRFATSDLNDLYRRVINRNNRLKRLMELGAPDIIIRNEKRMLQEAVDALFDNGRRGR
AITGTNGRPLKSLSDMIKGKQGRFRQNLLGKRVDYSGRSVIVVGPKLKLHQCGLPKKMALELFKPFIYSELEKRGLASTI
KSAKKMVEREELVVWDILEEVVREYPIMLNRAPTLHRLGIQSFEPLLVEGKAIQLHPLVCSAYNADFDGDQMAVHVPLSV
EAQIECRVLMMSTNNILSPANGSPVIVPSQDIVLGLYYMTVDRSFEKGENMSFCAPWEVVAAYDAGVVALHARINVRMED
GKVVRTTVGRILVWELLPHCVPFSMVNTTLTKKNIARLVSTAYRDAGTKATVILCDRLKDVGYEYATRAGVTIAVKDLTI
PSTKKGLIETAQNEVDDIERQYRDGIITRTEKYNKVVDVWTKATQDVSNEMIREISSDIVEDPRTGAKEANSSFNSIYMM
STSGARGNQDQMRQLAGMRGLMAKPSGEIIETPITSSFREGLSVLQYFTSTHGARKGLADTALKTANSGYLTRRLVDVVQ
DVIVSEHDCGTVDGIELTHIKEGGEIKIPLADRALGRVLLYPVYDPETRDLLFPENTLVDENVAKVLVEREVSSVMIRSA
LTCQSDRGICTLCYGRDLARGHIVNIGETVGIIAAQSIGEPGTQLTMRTFHIGGTASREIERSSFEAQHPGRVILSRVKA
VRNRDGQYMVMGKSGQLAIVDDQGREREKYTLPNGSRLLVTEGEEIRKGQILAEWDPFNEPFVSEVDGVIRFSDIVEGKT
FQEKMDEATRMTTQTIIEYRTTNFRPSISICDEHGEVKMRGNNIPATYSLPVGAIIMVKNGQDLQAGDIIARKPRETSKT
KDIVGGLPRVAELFEVRKPKDMAVVSEIAGIVTYAGETKGKRKLVVTPEIGEAKEYLVPKGKHITVTDGDFVEAGDLLTE
GHPELHDILRTRGEKYLARYLTDEIQEVYRFQGVAIDDKHIEVIVRQMLKKVTVLDPGGTTFLVGEQVDKGEFRVENTRA
MGEGRTPATAEPLVLGITQASLTTSSFISAASFQETTKVLTEASLRGKMDYLRGLKENVIVGRLIPAGTGYREYVNTDIL
VPEQRERPDKFLEDLEENPLLVDIY
>A7ZUK2 2.7.7.6~~~rpoC~~~DNA-directed RNA polymerase subunit beta'~~~
MKDLLKFLKAQTKTEEFDAIKIALASPDMIRSWSFGEVKKPETINYRTFKPERDGLFCARIFGPVKDYECLCGKYKRLKH
RGVICEKCGVEVTQTKVRRERMGHIELASPTAHIWFLKSLPSRIGLLLDMPLRDIERVLYFESYVVIEGGMTNLERQQIL
TEEQYLDALEEFGDEFDAKMGAEAIQALLKSMDLEQECEQLREELNETNSETKRKKLTKRIKLLEAFVQSGNKPEWMILT
VLPVLPPDLRPLVPLDGGRFATSDLNDLYRRVINRNNRLKRLLDLAAPDIIVRNEKRMLQEAVDALLDNGRRGRAITGSN
KRPLKSLADMIKGKQGRFRQNLLGKRVDYSGRSVITVGPYLRLHQCGLPKKMALELFKPFIYGKLELRGLATTIKAAKKM
VEREEAVVWDILDEVIREHPVLLNRAPTLHRLGIQAFEPVLIEGKAIQLHPLVCAAYNADFDGDQMAVHVPLTLEAQLEA
RALMMSTNNILSPANGEPIIVPSQDVVLGLYYMTRDCVNAKGEGMVLTGPKEAERLYRSGLASLHARVKVRITEYEKDAN
GELVAKTSLKDTTVGRAILWMIVPKGLPYSIVNQALGKKAISKMLNTCYRILGLKPTVIFADQIMYTGFAYAARSGASVG
IDDMVIPEKKHEIISEAEAEVAEIQEQFQSGLVTAGERYNKVIDIWAAANDRVSKAMMDNLQTETVINRDGQEEKQVSFN
SIYMMADSGARGSAAQIRQLAGMRGLMAKPDGSIIETPITANFREGLNVLQYFISTHGARKGLADTALKTANSGYLTRRL
VDVAQDLVVTEDDCGTHEGIMMTPVIEGGDVKEPLRDRVLGRVTAEDVLKPGTADILVPRNTLLHEQWCDLLEENSVDAV
KVRSVVSCDTDFGVCAHCYGRDLARGHIINKGEAIGVIAAQSIGEPGTQLTMRTFHIGGAASRAAAESSIQVKNKGSIKL
SNVKSVVNSSGKLVITSRNTELKLIDEFGRTKESYKVPYGAVLAKGDGEQVAGGETVANWDPHTMPVITEVSGFVRFTDM
IDGQTITRQTDELTGLSSLVVLDSAERTAGGKDLRPALKIVDAQGNDVLIPGTDMPAQYFLPGKAIVQLEDGVQISSGDT
LARIPQESGGTKDITGGLPRVADLFEARRPKEPAILAEISGIVSFGKETKGKRRLVITPVDGSDPYEEMIPKWRQLNVFE
GERVERGDVISDGPEAPHDILRLRGVHAVTRYIVNEVQDVYRLQGVKINDKHIEVIVRQMLRKATIVNAGSSDFLEGEQV
EYSRVKIANRELEANGKVGATYSRDLLGITKASLATESFISAASFQETTRVLTEAAVAGKRDELRGLKENVIVGRLIPAG
TGYAYHQDRMRRRAAGEAPAAPQVTAEDASASLAELLNAGLGGSDNE
>P0A8T8 2.7.7.6~~~rpoC~~~DNA-directed RNA polymerase subunit beta'~~~COG0086
MKDLLKFLKAQTKTEEFDAIKIALASPDMIRSWSFGEVKKPETINYRTFKPERDGLFCARIFGPVKDYECLCGKYKRLKH
RGVICEKCGVEVTQTKVRRERMGHIELASPTAHIWFLKSLPSRIGLLLDMPLRDIERVLYFESYVVIEGGMTNLERQQIL
TEEQYLDALEEFGDEFDAKMGAEAIQALLKSMDLEQECEQLREELNETNSETKRKKLTKRIKLLEAFVQSGNKPEWMILT
VLPVLPPDLRPLVPLDGGRFATSDLNDLYRRVINRNNRLKRLLDLAAPDIIVRNEKRMLQEAVDALLDNGRRGRAITGSN
KRPLKSLADMIKGKQGRFRQNLLGKRVDYSGRSVITVGPYLRLHQCGLPKKMALELFKPFIYGKLELRGLATTIKAAKKM
VEREEAVVWDILDEVIREHPVLLNRAPTLHRLGIQAFEPVLIEGKAIQLHPLVCAAYNADFDGDQMAVHVPLTLEAQLEA
RALMMSTNNILSPANGEPIIVPSQDVVLGLYYMTRDCVNAKGEGMVLTGPKEAERLYRSGLASLHARVKVRITEYEKDAN
GELVAKTSLKDTTVGRAILWMIVPKGLPYSIVNQALGKKAISKMLNTCYRILGLKPTVIFADQIMYTGFAYAARSGASVG
IDDMVIPEKKHEIISEAEAEVAEIQEQFQSGLVTAGERYNKVIDIWAAANDRVSKAMMDNLQTETVINRDGQEEKQVSFN
SIYMMADSGARGSAAQIRQLAGMRGLMAKPDGSIIETPITANFREGLNVLQYFISTHGARKGLADTALKTANSGYLTRRL
VDVAQDLVVTEDDCGTHEGIMMTPVIEGGDVKEPLRDRVLGRVTAEDVLKPGTADILVPRNTLLHEQWCDLLEENSVDAV
KVRSVVSCDTDFGVCAHCYGRDLARGHIINKGEAIGVIAAQSIGEPGTQLTMRTFHIGGAASRAAAESSIQVKNKGSIKL
SNVKSVVNSSGKLVITSRNTELKLIDEFGRTKESYKVPYGAVLAKGDGEQVAGGETVANWDPHTMPVITEVSGFVRFTDM
IDGQTITRQTDELTGLSSLVVLDSAERTAGGKDLRPALKIVDAQGNDVLIPGTDMPAQYFLPGKAIVQLEDGVQISSGDT
LARIPQESGGTKDITGGLPRVADLFEARRPKEPAILAEISGIVSFGKETKGKRRLVITPVDGSDPYEEMIPKWRQLNVFE
GERVERGDVISDGPEAPHDILRLRGVHAVTRYIVNEVQDVYRLQGVKINDKHIEVIVRQMLRKATIVNAGSSDFLEGEQV
EYSRVKIANRELEANGKVGATYSRDLLGITKASLATESFISAASFQETTRVLTEAAVAGKRDELRGLKENVIVGRLIPAG
TGYAYHQDRMRRRAAGEAPAAPQVTAEDASASLAELLNAGLGGSDNE
>P0A8T7 2.7.7.6~~~rpoC~~~DNA-directed RNA polymerase subunit beta'~~~COG0086
MKDLLKFLKAQTKTEEFDAIKIALASPDMIRSWSFGEVKKPETINYRTFKPERDGLFCARIFGPVKDYECLCGKYKRLKH
RGVICEKCGVEVTQTKVRRERMGHIELASPTAHIWFLKSLPSRIGLLLDMPLRDIERVLYFESYVVIEGGMTNLERQQIL
TEEQYLDALEEFGDEFDAKMGAEAIQALLKSMDLEQECEQLREELNETNSETKRKKLTKRIKLLEAFVQSGNKPEWMILT
VLPVLPPDLRPLVPLDGGRFATSDLNDLYRRVINRNNRLKRLLDLAAPDIIVRNEKRMLQEAVDALLDNGRRGRAITGSN
KRPLKSLADMIKGKQGRFRQNLLGKRVDYSGRSVITVGPYLRLHQCGLPKKMALELFKPFIYGKLELRGLATTIKAAKKM
VEREEAVVWDILDEVIREHPVLLNRAPTLHRLGIQAFEPVLIEGKAIQLHPLVCAAYNADFDGDQMAVHVPLTLEAQLEA
RALMMSTNNILSPANGEPIIVPSQDVVLGLYYMTRDCVNAKGEGMVLTGPKEAERLYRSGLASLHARVKVRITEYEKDAN
GELVAKTSLKDTTVGRAILWMIVPKGLPYSIVNQALGKKAISKMLNTCYRILGLKPTVIFADQIMYTGFAYAARSGASVG
IDDMVIPEKKHEIISEAEAEVAEIQEQFQSGLVTAGERYNKVIDIWAAANDRVSKAMMDNLQTETVINRDGQEEKQVSFN
SIYMMADSGARGSAAQIRQLAGMRGLMAKPDGSIIETPITANFREGLNVLQYFISTHGARKGLADTALKTANSGYLTRRL
VDVAQDLVVTEDDCGTHEGIMMTPVIEGGDVKEPLRDRVLGRVTAEDVLKPGTADILVPRNTLLHEQWCDLLEENSVDAV
KVRSVVSCDTDFGVCAHCYGRDLARGHIINKGEAIGVIAAQSIGEPGTQLTMRTFHIGGAASRAAAESSIQVKNKGSIKL
SNVKSVVNSSGKLVITSRNTELKLIDEFGRTKESYKVPYGAVLAKGDGEQVAGGETVANWDPHTMPVITEVSGFVRFTDM
IDGQTITRQTDELTGLSSLVVLDSAERTAGGKDLRPALKIVDAQGNDVLIPGTDMPAQYFLPGKAIVQLEDGVQISSGDT
LARIPQESGGTKDITGGLPRVADLFEARRPKEPAILAEISGIVSFGKETKGKRRLVITPVDGSDPYEEMIPKWRQLNVFE
GERVERGDVISDGPEAPHDILRLRGVHAVTRYIVNEVQDVYRLQGVKINDKHIEVIVRQMLRKATIVNAGSSDFLEGEQV
EYSRVKIANRELEANGKVGATYSRDLLGITKASLATESFISAASFQETTRVLTEAAVAGKRDELRGLKENVIVGRLIPAG
TGYAYHQDRMRRRAAGEAPAAPQVTAEDASASLAELLNAGLGGSDNE
>Q2A1M8 2.7.7.6~~~rpoC~~~DNA-directed RNA polymerase subunit beta'~~~
MNNGILHQNYNSKKFDIIKISLASPEVIRSWSHGEVKKPETINYRTFKPERDGLFCAKIFGPIKDYECLCGKYKRLKHRG
VVCERCGVEVEQAKVRRERMGHIDLVCPVVHIWYLKSLPSRIGLFLDMPLKNVEKVLYFESYIVTDPGMTPLEKKQLLTD
EEYAEALENYGYEFEASMGAEAIRDLLADTDIESEIELLQAECEESKSTAKKEKAIKRLRLLETFQASGNKPEWMVMTVL
PVLPPDLRPLVPIEGGRFATSDLNDLYRRVINRNNRLKKLLDLNAPDIIVRNEKRMLQEAVDALLDNGRRGRAVTGSNKR
PLKSLADMIKGKQGRFRQNLLGKRVDYSGRSVITVGPSLRLHECGLPKKMALELFKPFVYSKLRLGGHATTIKQAKRMVE
LEEAVVWDILETVINEHPVLLNRAPTLHRLGIQAFEPRLIEGKAIQLHPLVCAAFNADFDGDQMAVHVPLTVESQLEARV
LMMSTNNILSPASGQPIITPTQDIVLGLYYITREKEGARGEGKLFSSYEDVSRAYNSGTIDIHAKIKLRIDRQVFDTKGN
TYNEKGVVNTTVGRALLLNILPEGLSFSLLNKVLVKKEISKIINQAFRVLGGKATVVLADKLMYAGFKYSTLSGVSVGVD
DMTIPDNKEAKIEEAEKEIKQITEQYQSSLITENERYNNIINIWSKTSDEVGASMMDAISKDTVSINGEKKEIESFNSVY
MMAKSGARGSYNQMRQLAGMRGLMAKPDGTMIETAITANFREGLSVLQYFTSTHGARKGLADTALKTANAGYLTRRLVDV
AQDLVVIEEDCGTDDGLMFSAIVEDGEVKVPLVERALGRTLAADVVTEKGVVLLEAGTLLDENLVELLDDNGIDMIKVRS
PITCKTRRGLCAKCYGRDLARERQVNVGESVGVIAAQSIGEPGTQLTMRTFHTGGAASLGITVSDIKVKTAGKIKFKNIR
TVTNKEGQEIVISRAGEIIVSDTMGRVREQHKIPMGAVVPLASGKAVEIGDVIATWDPHAQPLITDVAGKVVLEDVIDGI
TSKHTYDDLTGQQTIEITSISQRTTSKNLKPVVKIVDEKGAELKSIPLAVGAVLNVADDSILEVGDIVAKIPLEGSKNKD
ITGGLPRVAELFEARRPKDAAILSPCDGMVRLGNRDTKEKQRIEIIDKNGHIVEEILLPKSRHLVVFDGEQVSRGDVLAD
GPTDPHDLLKYKGLEEFADYILIEAQSVYRMQGVVINDKHIETIVRQMLRKAVILDEGDSKFVKDESIELVRILEENDKL
RKQGKKEVEYELVLMGITRSSLSTESFLSAASFQETTRVLTEASINSQIDNLRGLKENVLIGRLIPAGTGLAVRKESAKI
EKMREELGVEDNMVFTDLSSFNPEEISFDSIQSQKEDKDINEDIEESLRNALESLDF
>A0QS66 2.7.7.6~~~rpoC~~~DNA-directed RNA polymerase subunit beta'~~~COG0086
MLDVNFFDELRIGLATADDIRNWSYGEVKKPETINYRTLKPEKDGLFCEKIFGPTRDWECYCGKYKRVRFKGIICERCGV
EVTRAKVRRERMGHIELAAPVTHIWYFKGVPSRLGYLLDLAPKDLEKIIYFAAYVITSVDDEMRHNELSTLEAEMAVEKK
AVEDQRDADLEARAQKLEADLAELEAEGAKSDVRRKVRDSGEREMRQLRDRAQRELDRLDEIWNTFTKLAPKQLIVDEVL
YRELQDRYGEYFTGAMGAESIKKLIENFDIDAEAESLREVIRSGKGQKKLRALKRLKVVAAFQQSGNSPMGMVLDAVPVI
PPELRPMVQLDGGRFATSDLNDLYRRVINRNNRLKRLIDLGAPEIIVNNEKRMLQESVDALFDNGRRGRPVTGPGNRPLK
SLSDLLKGKQGRFRQNLLGKRVDYSGRSVIVVGPQLKLHQCGLPKLMALELFKPFVMKRLVDLNHAQNIKSAKRMVERQR
PQVWDVLEEVIAEHPVLLNRAPTLHRLGIQAFEPQLVEGKAIQLHPLVCEAFNADFDGDQMAVHLPLSAEAQAEARILML
SSNNILSPASGKPLAMPRLDMVTGLYYLTTLVEGATGEYQAATKDAPEQGVYSSPAEAIMAMDRGALSVRAKIKVRLTEL
RPPTDLEAQLFENGWKPGDAWTAETTLGRVMFNELLPKSYPFVNEQMHKKVQARIINDLAERFPMIVVAQTVDKLKDAGF
YWATRSGVTVSMADVLVPPQKQEILERHEAEADAIERKYQRGALNHTERNESLVKIWQDATEEVGKALEEFYPADNPIIT
IVKSGATGNLTQTRTLAGMKGLVTNPKGEFIPRPIKSSFREGLTVLEYFINTHGARKGLADTALRTADSGYLTRRLVDVS
QDVIVREHDCETERGINVTLAERGPDGTLIRDAHVETSAFARTLATDAVDANGNVIIERGHDLGDPAIDALLAAGITTVK
VRSVLTCTSATGVCAMCYGRSMATGKLVDIGEAVGIVAAQSIGEPGTQLTMRTFHQGGVTGGADIVGGLPRVQELFEARV
PRNKAPIADVAGRVRLEESDKFFKITIVPDDGGEEVVYDKLSKRQRLRVITHEDGTEGVLSDGDHVEVGDQLMEGAADPH
EVLRVQGPREVQIHLVKEVQEVYRAQGVSIHDKHIEVIVRQMLRRVTIIDSGSTEFLPGSLTERAEFEAENRRVVAEGGE
PAAGRPVLMGITKASLATDSWLSAASFQETTRVLTDAAINCRSDKLNGLKENVIIGKLIPAGTGISRYRNIQVQPTEEAR
AAAYTIPSYEDQYYSPDFGQATGAAVPLDDYGYSDYR
>P9WGY7 2.7.7.6~~~rpoC~~~DNA-directed RNA polymerase subunit beta'~~~COG0086
MLDVNFFDELRIGLATAEDIRQWSYGEVKKPETINYRTLKPEKDGLFCEKIFGPTRDWECYCGKYKRVRFKGIICERCGV
EVTRAKVRRERMGHIELAAPVTHIWYFKGVPSRLGYLLDLAPKDLEKIIYFAAYVITSVDEEMRHNELSTLEAEMAVERK
AVEDQRDGELEARAQKLEADLAELEAEGAKADARRKVRDGGEREMRQIRDRAQRELDRLEDIWSTFTKLAPKQLIVDENL
YRELVDRYGEYFTGAMGAESIQKLIENFDIDAEAESLRDVIRNGKGQKKLRALKRLKVVAAFQQSGNSPMGMVLDAVPVI
PPELRPMVQLDGGRFATSDLNDLYRRVINRNNRLKRLIDLGAPEIIVNNEKRMLQESVDALFDNGRRGRPVTGPGNRPLK
SLSDLLKGKQGRFRQNLLGKRVDYSGRSVIVVGPQLKLHQCGLPKLMALELFKPFVMKRLVDLNHAQNIKSAKRMVERQR
PQVWDVLEEVIAEHPVLLNRAPTLHRLGIQAFEPMLVEGKAIQLHPLVCEAFNADFDGDQMAVHLPLSAEAQAEARILML
SSNNILSPASGRPLAMPRLDMVTGLYYLTTEVPGDTGEYQPASGDHPETGVYSSPAEAIMAADRGVLSVRAKIKVRLTQL
RPPVEIEAELFGHSGWQPGDAWMAETTLGRVMFNELLPLGYPFVNKQMHKKVQAAIINDLAERYPMIVVAQTVDKLKDAG
FYWATRSGVTVSMADVLVPPRKKEILDHYEERADKVEKQFQRGALNHDERNEALVEIWKEATDEVGQALREHYPDDNPII
TIVDSGATGNFTQTRTLAGMKGLVTNPKGEFIPRPVKSSFREGLTVLEYFINTHGARKGLADTALRTADSGYLTRRLVDV
SQDVIVREHDCQTERGIVVELAERAPDGTLIRDPYIETSAYARTLGTDAVDEAGNVIVERGQDLGDPEIDALLAAGITQV
KVRSVLTCATSTGVCATCYGRSMATGKLVDIGEAVGIVAAQSIGEPGTQLTMRTFHQGGVGEDITGGLPRVQELFEARVP
RGKAPIADVTGRVRLEDGERFYKITIVPDDGGEEVVYDKISKRQRLRVFKHEDGSERVLSDGDHVEVGQQLMEGSADPHE
VLRVQGPREVQIHLVREVQEVYRAQGVSIHDKHIEVIVRQMLRRVTIIDSGSTEFLPGSLIDRAEFEAENRRVVAEGGEP
AAGRPVLMGITKASLATDSWLSAASFQETTRVLTDAAINCRSDKLNGLKENVIIGKLIPAGTGINRYRNIAVQPTEEARA
AAYTIPSYEDQYYSPDFGAATGAAVPLDDYGYSDYR
>Q9HWC9 2.7.7.6~~~rpoC~~~DNA-directed RNA polymerase subunit beta'~~~
MKDLLNLLKNQGQIEEFDAIRIGLASPEMIRSWSFGEVKKPETINYRTFKPERDGLFCAKIFGPVKDYECLCGKYKRLKH
RGVICEKCGVEVALAKVRRERMGHIELASPVAHIWFLKSLPSRIGLLLDMTLRDIERVLYFESYVVIDPGMTTLEKGQLL
NDEQYFEALEEFGDDFDARMGAEAVHELLNAIDLEHEIGRLREEIPQTNSETKIKKLSKRLKLMEAFQGSGNKPEWMVLT
VLPVLPPDLRPLVPLDGGRFATSDLNDLYRRVINRNNRLKRLLDLAAPDIIVRNEKRMLQEAVDALLDNGRRGRAITGSN
KRPLKSLADMIKGKQGRFRQNLLGKRVDYSGRSVITVGPTLRLHQCGLPKKMALELFKPFIFGKLEGRGMATTIKAAKKM
VERELPEVWDVLAEVIREHPVLLNRAPTLHRLGIQAFEPVLIEGKAIQLHPLVCAAYNADFDGDQMAVHVPLTLEAQLEA
RALMMSTNNILSPANGEPIIVPSQDVVMGLYYMTREAINAKGEGMAFADLQEVDRAYRSGQASLHARVKVRINEKIKGED
GQLTANTRIVDTTVGRALLFQVVPAGLPFDVVNQSMKKKAISKLINHCYRVVGLKDTVIFADQLMYTGFAYSTISGVSIG
VNDFVIPDEKARIINAATDEVKEIESQYASGLVTQGEKYNKVIDLWSKANDEVSKAMMANLSKEKVVDREGKEVDQESFN
SMYMMADSGARGSAAQIRQLAGMRGLMAKPDGSIIETPITANFREGLNVLQYFISTHGARKGLADTALKTANSGYLTRRL
VDVAQDLVVTEIDCGTEHGLLMSPHIEGGDVVEPLGERVLGRVIARDVFKPGSDEVIVPAGTLIDEKWVDFLEVMSVDEV
VVRSPITCETRHGICAMCYGRDLARGHRVNIGEAVGVIAAQSIGEPGTQLTMRTFHIGGAASRTSAADNVQVKNGGTIRL
HNLKHVVRADGALVAVSRSGELAVADDFGRERERYKLPYGAVISVKEGDKVDPGAIVAKWDPHTHPIVTEVDGTVAFVGM
EEGITVKRQTDELTGLTNIEVMDPKDRPAAGKDIRPAVKLIDAAGKDLLLPGTDVPAQYFLPANALVNLTDGAKVSIGDV
VARIPQETSKTRDITGGLPRVADLFEARRPKEPSILAEISGTISFGKETKGKRRLVITPNDGSDPYEELIPKWRHLNVFE
GEQVNRGEVISDGPSNPHDILRLLGVSSLAKYIVNEIQDVYRLQGVKINDKHIETILRQMLRKVEVSESGDSSFIKGDQV
ELTQVLEENEQLGTEDKFPAKYERVLLGITKASLSTESFISAASFQETTRVLTEAAVTGKRDFLRGLKENVVVGRLIPAG
TGLAYHSERKRQRDLGKPQRVSASEAEAALTEALNSSGN
>P0A2R4 2.7.7.6~~~rpoC~~~DNA-directed RNA polymerase subunit beta'~~~
MKDLLKFLKAQTKTEEFDAIKIALASPDMIRSWSFGEVKKPETINYRTFKPERDGLFCARIFGPVKDYECLCGKYKRLKH
RGVICEKCGVEVTQTKVRRERMGHIELASPTAHIWFLKSLPSRIGLLLDMPLRDIERVLYFESYVVIEGGMTNLERQQIL
TEEQYLDALEEFGDEFDAKMGAEAIQALLKSMDLEQECETLREELNETNSETKRKKLTKRIKLLEAFVQSGNKPEWMILT
VLPVLPPDLRPLVPLDGGRFATSDLNDLYRRVINRNNRLKRLLDLAAPDIIVRNEKRMLQEAVDALLDNGRRGRAITGSN
KRPLKSLADMIKGKQGRFRQNLLGKRVDYSGRSVITVGPYLRLHQCGLPKKMALELFKPFIYGKLELRGLATTIKAAKKM
VEREEAVVWDILDEVIREHPVLLNRAPTLHRLGIQAFEPVLIEGKAIQLHPLVCAAYNADFDGDQMAVHVPLTLEAQLEA
RALMMSTNNILSPANGEPIIVPSQDVVLGLYYMTRDCVNAKGEGMVLTGPKEAERIYRAGLASLHARVKVRITEYEKDEN
GEFVAHTSLKDTTVGRAILWMIVPKGLPFSIVNQALGKKAISKMLNTCYRILGLKPTVIFADQTMYTGFAYAARSGASVG
IDDMVIPEKKHEIISEAEAEVAEIQEQFQSGLVTAGERYNKVIDIWAAANDRVSKAMMDNLQTETVINRDGQEEQQVSFN
SIYMMADSGARGSAAQIRQLAGMRGLMAKPDGSIIETPITANFREGLNVLQYFISTHGARKGLADTALKTANSGYLTRRL
VDVAQDLVVTEDDCGTHEGILMTPVIEGGDVKEPLRDRVLGRVTAEDVLKPGTADILVPRNTLLHEQWCDLLEANSVDAV
KVRSVVSCDTDFGVCAHCYGRDLARGHIINKGEAIGVIAAQSIGEPGTQLTMRTFHIGGAASRAAAESSIQVKNKGSIKL
SNVKSVVNSSGKLVITSRNTELKLIDEFGRTKESYKVPYGAVMAKGDGEQVAGGETVANWDPHTMPVITEVSGFIRFTDM
IDGQTITRQTDELTGLSSLVVLDSAERTTGGKDLRPALKIVDAQGNDVLIPGTDMPAQYFLPGKAIVQLEDGVQISSGDT
LARIPQESGGTKDITGGLPRVADLFEARRPKEPAILAEIAGIVSFGKETKGKRRLVITPVDGSDPYEEMIPKWRQLNVFE
GERVERGDVISDGPEAPHDILRLRGVHAVTRYIVNEVQDVYRLQGVKINDKHIEVIVRQMLRKATIESAGSSDFLEGEQV
EYSRVKIANRELEANGKVGATFSRDLLGITKASLATESFISAASFQETTRVLTEAAVAGKRDELRGLKENVIVGRLIPAG
TGYAYHQDRMRRRAAGEQPATPQVTAEDASASLAELLNAGLGGSDNE
>P60285 2.7.7.6~~~rpoC~~~DNA-directed RNA polymerase subunit beta'~~~
MIDVNNFHYMKIGLASPEKIRSWSFGEVKKPETINYRTLKPEKDGLFCERIFGPTKDWECSCGKYKRVRYKGMVCDRCGV
EVTKSKVRRERMGHIELAAPVSHIWYFKGIPSRMGLLLDMSPRALEEVIYFASYVVVDPGPTGLEKKTLLSEAEFRDYYD
KYPGQFVAKMGAEGIKDLLEEIDLDEELKLLRDELESATGQRLTRAIKRLEVVESFRNSGNKPSWMILDVLPIIPPEIRP
MVQLDGGRFATSDLNDLYRRVINRNNRLKRLLDLGAPGIIVQNEKRMLQEAVDALIDNGRRGRPVTGPGNRPLKSLSHML
KGKQGRFRQNLLGKRVDYSGRSVIAVGPSLKMYQCGLPKEMALELFKPFVMKELVQREIATNIKNAKSKIERMDDEVWDV
LEEVIREHPVLLNRAPTLHRLGIQAFEPTLVEGRAIRLHPLVTTAYNADFDGDQMAVHVPLSKEAQAEARMLMLAAQNIL
NPKDGKPVVTPSQDMVLGNYYLTLERKDAVNTGAIFNNTNEVLKAYANGFVHLHTRIGVHASSFNNPTFTEEQNKKILAT
SVGKIIFNEIIPDSFAYINEPTQENLERKTPNRYFIDPTTLGEGGLKEYFENEELIEPFNKKFLGNIIAEVFNRFSITDT
SMMLDRMKDLGFKFSSKAGITVGVADIVVLPDKQQILDEHEKLVDRITKQFNRGLITEEERYNAVVEIWTDAKDQIQGEL
MQSLDKTNPIFMMSDSGARGNASNFTQLAGMRGLMAAPSGKIIELPITSSFREGLTVLEYFISTHGARKGLADTALKTAD
SGYLTRRLVDVAQDVIVREEDCGTDRGLLVSDIKEGTEMIEPFIERIEGRYSKETIRHPETDEIIIRPDELITPEIAKKI
TDAGIEQMYIRSAFTCNARHGVCEKCYGKNLATGEKVEVGEAVGTIAAQSIGEPGTQLTMRTFHTGGVAGSDITQGLPRI
QEIFEARNPKGQAVITEIEGVVEDIKLAKDRQQEIVVKGANETRSYLASGTSRIIVEIGQPVQRGEVLTEGSIEPKNYLS
VAGLNATESYLLKEVQKVYRMQGVEIDDKHVEVMVRQMLRKVRIIEAGDTKLLPGSLVDIHNFTDANREAFKHRKRPATA
KPVLLGITKASLETESFLSAASFQETTRVLTDAAIKGKRDDLLGLKENVIIGKLIPAGTGMRRYSDVKYEKTAKPVAEVE
SQTEVTE
>Q8CJT1 2.7.7.6~~~rpoC~~~DNA-directed RNA polymerase subunit beta'~~~COG0086
MLDVNFFDELRIGLATADDIRQWSHGEVKKPETINYRTLKPEKDGLFCEKIFGPTRDWECYCGKYKRVRFKGIICERCGV
EVTRAKVRRERMGHIELAAPVTHIWYFKGVPSRLGYLLDLAPKDLEKVIYFAAYMITFVDEERRTRDLPSLEAHVSVERQ
QIEQRRDSDLEARAKKLETDLAELEAEGAKADVRRKVREGAEREMKQLRDRAQREIDRLDEVWNRFKNLKVQDLEGDELL
YRELRDRFGTYFDGSMGAAALQKRLESFDLDEEAERLREIIRTGKGQKKTRALKRLKVVSAFLQTSNSPKGMVLDCVPVI
PPDLRPMVQLDGGRFATSDLNDLYRRVINRNNRLKRLLDLGAPEIIVNNEKRMLQEAVDALFDNGRRGRPVTGPGNRPLK
SLSDMLKGKQGRFRQNLLGKRVDYSARSVIVVGPQLKLHQCGLPKAMALELFKPFVMKRLVDLNHAQNIKSAKRMVERGR
TVVYDVLEEVIAEHPVLLNRAPTLHRLGIQAFEPQLVEGKAIQIHPLVCTAFNADFDGDQMAVHLPLSAEAQAEARILML
SSNNILKPADGRPVTMPTQDMVLGLFFLTTDSEGRSPKGEGRAFGSSAEAIMAFDAGDLTLQAKIDIRFPVGTIPPRGFE
PPAREEGEPEWQQGDTFTLKTTLGRALFNELLPEDYPFVDYEVGKKQLSEIVNDLAERYPKVIVAATLDNLKAAGFFWAT
RSGVTVAISDIVVPDAKKEIVKGYEGQDEKVQKQYERGLITKEERTQELIAIWTKATNEVAEAMNDNFPKTNPVSMMVNS
GARGNMMQMRQIAGMRGLVSNAKNETIPRPIKASFREGLSVLEYFISTHGARKGLADTALRTADSGYLTRRLVDVSQDVI
IREEDCGTERGLKLPIATRDADGTLRKAEDVETSVYARMLAEDVVIDGKVIAPANVDLGDVLIDALVAHGVEEVKTRSIL
TCESQVGTCAMCYGRSLATGKLVDIGEAVGIIAAQSIGEPGTQLTMRTFHTGGVAGDDITQGLPRVVELFEARTPKGVAP
ISEASGRVRIEETEKTKKIVVTPDDGSDETAFPISKRARLLVGEGDHVEVGQKLTVGATNPHDVLRILGQRAVQVHLVGE
VQKVYNSQGVSIHDKHIEIIIRQMLRRVTIIESGDAELLPGELVERTKFETENRRVVQEGGHPASGRPQLMGITKASLAT
ESWLSAASFQETTRVLTDAAINAKSDSLIGLKENVIIGKLIPAGTGLSRYRNIRVEPTEEAKAAMYSAVGYDDIDYSPFG
TGSGQAVPLEDYDYGPYNQ
>Q9KWU6 2.7.7.6~~~rpoC~~~DNA-directed RNA polymerase subunit beta'~~~
MKKEVRKVRIALASPEKIRSWSYGEVEKPETINYRTLKPERDGLFDERIFGPIKDYECACGKYKRQRFEGKVCERCGVEV
TRSIVRRYRMGHIELATPAAHIWFVKDVPSKIGTLLDLSATELEQVLYFNKYIVLDPKGAVLDGVPVEKRQLLTDEEYRE
LRYGKQETYPLPAGVDALVKDGEEVVKGQELAPGVVSRMDGVALYRFPRRVRVDYLRKERAALRIPLSAWVEKEAYRPGE
VLAELSEPYLFRAEESGVVELKDLAEGHLIYLRQEEEVVARYFLPAGMTPLVVEGEIVEVGQPLAEGKGLLRLPRHMTAK
EVEAEEEGDSVHLTLFLEWTEPKDYKVAPHMNVIVPEGAKVQAGEKIVAAIDPEEEVIAEAEGVVHLHEPASILVVKARV
YPFEDDVEVTTGDRVAPGDVLADGGKVKSEIYGRVEVDLVRNVVRVVESYDIDARMGAEAIQELLKELDLEKLERELLEE
MKHPSRARRAKARKRLEVVRAFLDSGNRPEWMILEAVPVLPPDLRPMVQVDGGRFATSDLNDLYRRLINRNNRLKKLLAQ
GAPEIIIRNEKRMLQEAVDAVIDNGRRGSPVTNPGSERPLRSLTDILSGKQGRFRQNLLGKRVDYSGRSVIVVGPQLKLH
QCGLPKRMALELFKPFLLKKMEEKAFAPNVKAARRMLERQRDIKDEVWDALEEVIHGKVVLLNRAPTLHRLGIQAFQPVL
VEGQSIQLHPLVCEAFNADFDGDQMAVHVPLSSFAQAEARIQMLSAHNLLSPASGEPLAKPSRDIILGLYYITQVRKEKK
GAGMAFATPEEALAAYERGEVALNAPIVVAGRETSVGRLKFVFANPDEALLAVAHGLLDLQDVVTVRYLGRRLETSPGRI
LFARIVGEAVGDEKVAQELIQMDVPQEKNSLKDLVYQAFLRLGMEKTARLLDALKYYGFTLSTTSGITIGIDDAVIPEEK
QRYLEEADRKLRQIEQAYEMGFLTDRERYDQVIQLWTETTEKVTQAVFKNFEENYPFNPLYVMAQSGARGNPQQIRQLCG
MRGLMQKPSGETFEVPVRSSFREGLTVLEYFISSHGARKGGADTALRTADSGYLTRKLVDVAHEIVVREADCGTTNYISV
PLFQMDEVTRTLRLRKRSDIESGLYGRVLAREVEALGRRLEEGRYLSLEDVHFLIKAAEAGEVREVPVRSPLTCQTRYGV
CQKCYGYDLSMARPVSIGEAVGVVAAESIGEPGTQLTMRTFHTGGVAVGTDITQGLPRVIELFEARRPKAKAVISEIDGV
VRIEEGEDRLSVFVESEGFSKEYKLPKDARLLVKDGDYVEAGQPLTRGAIDPHQLLEAKGPEAVERYLVDEIQKVYRAQG
VKLHDKHIEIVVRQMLKYVEVTDPGDSRLLEGQVLEKWDVEALNERLIAEGKVPVAWKPLLMGVTKSALSTKSWLSAASF
QNTTHVLTEAAIAGKKDELIGLKENVILGRLIPAGTGSDFVRFTQVVDQRTLKAIEEARKEAVEAKEKEAPRRPVRREQP
GKGL
>Q8RQE8 2.7.7.6~~~rpoC~~~DNA-directed RNA polymerase subunit beta'~~~COG0086
MKKEVRKVRIALASPEKIRSWSYGEVEKPETINYRTLKPERDGLFDERIFGPIKDYECACGKYKRQRFEGKVCERCGVEV
TKSIVRRYRMGHIELATPAAHIWFVKDVPSKIGTLLDLSATELEQVLYFSKYIVLDPKGAILNGVPVEKRQLLTDEEYRE
LRYGKQETYPLPPGVDALVKDGEEVVKGQELAPGVVSRLDGVALYRFPRRVRVEYVKKERAGLRLPLAAWVEKEAYKPGE
ILAELPEPYLFRAEEEGVVELKELEEGAFLVLRREDEPVATYFLPVGMTPLVVHGEIVEKGQPLAEAKGLLRMPRQVRAA
QVEAEEEGETVYLTLFLEWTEPKDYRVQPHMNVVVPEGARVEAGDKIVAAIDPEEEVIAEAEGVVHLHEPASILVVKARV
YPFEDDVEVSTGDRVAPGDVLADGGKVKSDVYGRVEVDLVRNVVRVVESYDIDARMGAEAIQQLLKELDLEALEKELLEE
MKHPSRARRAKARKRLEVVRAFLDSGNRPEWMILEAVPVLPPDLRPMVQVDGGRFATSDLNDLYRRLINRNNRLKKLLAQ
GAPEIIIRNEKRMLQEAVDALLDNGRRGAPVTNPGSDRPLRSLTDILSGKQGRFRQNLLGKRVDYSGRSVIVVGPQLKLH
QCGLPKRMALELFKPFLLKKMEEKGIAPNVKAARRMLERQRDIKDEVWDALEEVIHGKVVLLNRAPTLHRLGIQAFQPVL
VEGQSIQLHPLVCEAFNADFDGDQMAVHVPLSSFAQAEARIQMLSAHNLLSPASGEPLAKPSRDIILGLYYITQVRKEKK
GAGLEFATPEEALAAHERGEVALNAPIKVAGRETSVGRLKYVFANPDEALLAVAHGIVDLQDVVTVRYMGKRLETSPGRI
LFARIVAEAVEDEKVAWELIQLDVPQEKNSLKDLVYQAFLRLGMEKTARLLDALKYYGFTFSTTSGITIGIDDAVIPEEK
KQYLEEADRKLLQIEQAYEMGFLTDRERYDQILQLWTETTEKVTQAVFKNFEENYPFNPLYVMAQSGARGNPQQIRQLCG
LRGLMQKPSGETFEVPVRSSFREGLTVLEYFISSHGARKGGADTALRTADSGYLTRKLVDVTHEIVVREADCGTTNYISV
PLFQPDEVTRSLRLRKRADIEAGLYGRVLAREVEVLGVRLEEGRYLSMDDVHLLIKAAEAGEIQEVPVRSPLTCQTRYGV
CQKCYGYDLSMARPVSIGEAVGIVAAQSIGEPGTQLTMRTFHTGGVAGAADITQGLPRVIELFEARRPKAKAVISEIDGV
VRIEETEEKLSVFVESEGFSKEYKLPKEARLLVKDGDYVEAGQPLTRGAIDPHQLLEAKGPEAVERYLVEEIQKVYRAQG
VKLHDKHIEIVVRQMMKYVEVTDPGDSRLLEGQVLEKWDVEALNERLIAEGKTPVAWKPLLMGVTKSALSTKSWLSAASF
QNTTHVLTEAAIAGKKDELIGLKENVILGRLIPAGTGSDFVRFTQVVDQKTLKAIEEARKEAVEAKERPAARRGVKREQP
GKQA
>B2SQQ2 2.7.7.6~~~rpoC~~~DNA-directed RNA polymerase subunit beta'~~~COG0086
MKDLLNLFNQQRQTLDFDAIKIALASPDLIRSWSYGEVKKPETINYRTFKPERDGLFCAAIFGPIKDYECLCGKYKRMKH
RGVVCEKCGTEVTLAKVRRERMGHIDLASPVAHIWFLKSLPSRIGLMLDMTLRDIERVLYFEAYVVTEPGLTPLERRQLL
TEEQYLTARQEYNDDFDAAMGAEAVYELLRTIDLQSEMTRLREEIASTGSETKLKRLTKRIKLIEAFLESGNRPEWMVMT
VLPVLPPDLRPLVPLDGGRFATSDLNDLYRRVINRNNRLRRLLELNAPDIIVRNEKRMLQESVDALLDNGRRGRAITGTN
KRPLKSLADMIKGKQGRFRQNLLGKRVDYSGRSVITVGPYLKLHQCGLPKKMALELFKPFVFAKLQRRGLATTIKAAKKL
VEREEAEVWDILEEVIREHPVLLNRAPTLHRLGIQAFEPVLIEGKAIQLHPLVCTAFNADFDGDQMAVHVPLSLEAQLEA
RALMMSTNNILSPANGEPIIVPSQDVVLGLYYMSRALENKKGEGMVFANTSEVKRAYDNRVVELHAKVKVRITQVDVDAV
DGKRTSGTSIVDTTVGRALLSEILPEGLPFQLANTEMTKKNISRLINSSYRLLGLKDTVVFADKLMYTGYAYATRAGVSI
GIDDMLIPDEKKGILTEAEAEVLEIQEQYQSGLVTAGERYNKVVDIWSRTSERIAKAMMDTIGTEKVENAKGETIDQKSM
NSLYIMADSGARGSQAQIRQLAGMRGLMARPDGSIIETPIKANFREGLNVQEYFNSTHGARKGLADTALKTANSGYLTRR
LVDVAQDVVITEIDCGTTEGLIMTPIVEGGDVVEPLKERVLGRVVAEDVYLPGNDEEPIVTRNTLLDEAWVAKLEDASVQ
SVKVRSTISCESSFGVCARCYGRDLARGHQVNIGEAVGVIAAQSIGEPGTQLTMRTFHIGGAASRAAAVDNITVKTTGSV
KFNNLKSVAHASGSLVAVSRSGELSVLDGHGRERERYKLPYGATITAKDGDAVKAGQSVANWDPHNHPIVSEVAGFIRFI
DFVDGVTVIEKTDELTGLASREITDPKRRGAHAKELRPIVRIVDGKGNDLTIPNTDLPAQYLLPPRSIVNLQDGAAVGVG
DVVAKIPQEASKTRDITGGLPRVADLFEARKPKDPAILAERSGIISFGKDTKGKQRLIIKDTDGSEHEELIPKYRQIIVF
EGEHVTKGETVVDGEPSPQDILRLLGVEPLAAYLVKEIQDVYRLQGVKINDKHIEVITRQMLRKVEIVDQGNSKFLNGEQ
VERQRVIEENARLVKRNELPAKYDPVLLGITKASLATESFISAASFQETTRVLTEAAVRGTRDNLRGLKENVIVGRLIPA
GTGLAYHAGRRKASGLTDSEMETLSGKPAGAEPVAALADAGADEE
>Q8KTH8 2.7.7.6~~~rpoC~~~DNA-directed RNA polymerase subunit beta'~~~
MKDLLNLFNQQRQTLDFDAIKIALASPDLIRSWSYGEVKKPETINYRTFKPERDGLFCAAIFGPIKDYECLCGKYKRMKH
RGVVCEKCGTEVTLAKVRRERMGHIDLASPVAHIWFLKSLPSRIGLMLDMTLRDIERVLYFEAYVVTEPGLTPLERRQLL
TEEQYLTARQEYNDDFDAAMGAEAVYELLRTIDLQSEMTRLREEIASTGSETKLKRLTKRIKLIEAFLESGNRPEWMVMT
VLPVLPPDLRPLVPLDGGRFATSDLNDLYRRVINRNNRLRRLLELNAPDIIVRNEKRMLQESVDALLDNGRRGRAITGTN
KRPLKSLADMIKGKQGRFRQNLLGKRVDYSGRSVITVGPYLKLHQCGLPKKMALELFKPFVFAKLQRRGLATTIKAAKKL
VEREEAEVWDILEEVIREHPVLLNRAPTLHRLGIQAFEPVLIEGKAIQLHPLVCTAFNADFDGDQMAVHVPLSLEAQLEA
RALMMSTNNILSPANGEPIIVPSQDVVLGLYYMSRALENKKGEGMVFANTSEVKRAYDNRVVELHAKVKVRITQVDVDAV
DGKRTSGTSIVDTTVGRALLSEILPEGLPFQLANTEMTKKNISRLINSSYRLLGLKDTVVFADKLMYTGYAYATRAGVSI
GIDDMLIPDEKKGILTEAEAEVLEIQEQYQSGLVTAGERYNKVVDIWSRTSERIAKAMMDTIGTEKVENAKGETIDQKSM
NSLYIMADSGARGSQAQIRQLAGMRGLMARPDGSIIETPIKANFREGLNVQEYFNSTHGARKGLADTALKTANSGYLTRR
LVDVAQDVVITEIDCGTTEGLIMTPIVEGGDVVEPLKERVLGRVVAEDVYLPGNDEEPIVTRNTLLDEAWVAKLEDASVQ
SVKVRSTISCESSFGVCARCYGRDLARGHQVNIGEAVGVIAAQSIGEPGTQLTMRTFHIGGAASRAAAVDNITVKTTGSV
KFNNLKSVAHASGSLVAVSRSGELSVLDGHGRERERYKLPYGATITAKDGDAVKAGQSVANWDPHNHPIVSEVAGFIRFI
DFVDGVTVIEKTDELTGLASREITDPKRRGAHAKELRPIVRIVDGKGNDLTIPNTDLPAQYLLPPRSIVNLQDGAAVGVG
DVVAKIPQEASKTRDITGGLPRVADLFEARKPKDPAILAERSGIISFGKDTKGKQRLIIKDTDGSEHEELIPKYRQIIVF
EGEHVTKGETVVDGEPSPQDILRLLGVEPLAAYLVKEIQDVYRLQGVKINDKHIEVITRQMLRKVEIVDQGNSKFLNGEQ
VERQRVIEENARLVKRNELPAKYDPVLLGITKASLATESFISAASFQETTRVLTEAAVRGTRDNLRGLKENVIVGRLIPA
GTGLAYHAGRRKASGLTDSEMETLSGKPAGAEPVAALADAGADEE
>Q83BB6 ~~~rpoD~~~RNA polymerase sigma factor RpoD~~~COG0568
MPSRKKKSSVTKTKKKTTGKKARVTAKAKPSVKSKLKTVKRAPAKPKSVTVAKTKSKKEKAKVVSQPLPKTAEKKPSAPP
SETQQPKIGENAAESQRLGFGALLEEAKNKGYLVHEDLINLLPNDYADPSQMEGIIGRLTEMGIKVFEIPPDADSLLLEE
DTQSDDDEINEDVAEVLATETRTTDPVRMYMREMGSVELLTREGEIVIAKRIEEGIRQVMGAVVQYPELIEKFIKEYDTI
VATEGRLSDLLIGFFDEEESTASPVAPEARDKEAETSDEDDGGEGGSDEFGESGPDPLITQQHMEELKKLYGRYQNAIKR
YGKKAPATLKHQKALAELFSTFKLSIKQFNRLTGQLRRMLRAARAQERLIMSYCVVKAKTPRKKFIASFPHNETNMKWLE
QYKNENKEQAKRLEKYSKEIYRAQRNLAQLEEQNNLNIQEIKEINRRVAIGEAKSRRAKKEMIEANLRLVISIAKKYTNR
GLQFLDLIQEGNIGLMKAVDKFEYRRGYKFSTYATWWIRQAITRSIADQARTIRIPVHMIETINKLNRISRQILQETGIE
ATPEELGRRMDMPEEKVRKVLKIAKEPISMETPIGEDEDSNLGDFIEDINMESPVDFATSAGLMEATREILSTLTPREAK
VLRMRFGIDMNTDHTLEEVGKQFDVTRERIRQIEAKALRKLRHPSRAEKLHSFIEVEE
>P00579 ~~~rpoD~~~RNA polymerase sigma factor RpoD~~~COG0568
MEQNPQSQLKLLVTRGKEQGYLTYAEVNDHLPEDIVDSDQIEDIIQMINDMGIQVMEEAPDADDLMLAENTADEDAAEAA
AQVLSSVESEIGRTTDPVRMYMREMGTVELLTREGEIDIAKRIEDGINQVQCSVAEYPEAITYLLEQYDRVEAEEARLSD
LITGFVDPNAEEDLAPTATHVGSELSQEDLDDDEDEDEEDGDDDSADDDNSIDPELAREKFAELRAQYVVTRDTIKAKGR
SHATAQEEILKLSEVFKQFRLVPKQFDYLVNSMRVMMDRVRTQERLIMKLCVEQCKMPKKNFITLFTGNETSDTWFNAAI
AMNKPWSEKLHDVSEEVHRALQKLQQIEEETGLTIEQVKDINRRMSIGEAKARRAKKEMVEANLRLVISIAKKYTNRGLQ
FLDLIQEGNIGLMKAVDKFEYRRGYKFSTYATWWIRQAITRSIADQARTIRIPVHMIETINKLNRISRQMLQEMGREPTP
EELAERMLMPEDKIRKVLKIAKEPISMETPIGDDEDSHLGDFIEDTTLELPLDSATTESLRAATHDVLAGLTAREAKVLR
MRFGIDMNTDYTLEEVGKQFDVTRERIRQIEAKALRKLRHPSRSEVLRSFLDD
>P55993 ~~~rpoD~~~RNA polymerase sigma factor RpoD~~~COG0568
MKKKANEEKAQKRAKTEAKAEATQENKTKENNKAKESKIKESKIKEAKAKEPIPVKKLSFNEALEELFANSLSDCVSYES
IIQISAKVPTLAQIKKIKELCQKYQKKLVSSSEYAKKLNAIDKIKKTEEKQKVLDEELEDGYDFLKEKDFLEWSRSDSPV
RMYLREMGDIKLLSKDEEIELSKQIRLGEDIILDAICSVPYLIDFIYAYKDALINRERRVKELFRSFDDDDENSVSDSKK
DEDNEEDEENEERKKVVSEKDKKRVEKVQESFKALDKAKKEWLKALEAPIDEREDELVRSLTLAYKRQTLKDRLYDLEPT
SKLINELVKTMETTLKSGDGFEKELKRLEYKLPLFNDTLIANHKKILANITNMTKEDIIAQVPEATMVSVYMDLKKLFLT
KEASEEGFDLAPNKLKEILEQIKRGKLISDRAKNKMAKSNLRLVVSIAKRFTSRGLPFLDLIQEGNIGLMKAVDKFEHEK
GFKFSTYATWWIKQAISRAIADQARTIRIPIHMIDTINRINKVMRKHIQENGKEPDLEVVAEEVGLSLDKVKNVIKVTKE
PISLETPVGNDDDGKFGDFVEDKNIVSSIDHIMREDLKAQIESVLDQLNEREKAVIRMRFGLLDDESDRTLEEIGKELNV
TRERVRQIESSAIKKLRSPQYGRILRNYLRI
>P26480 ~~~rpoD~~~RNA polymerase sigma factor RpoD~~~
MSGKAQQQSRLKELIARGREQGYLTYAEVNDHLPEDISDPEQVEDIIRMINDMGINVFETAPDADALLLAEADTDEAAAE
EAAAALAAVESDIGRTTDPVRMYMREMGTVELLTREGEIEIAKRIEEGIREVMSAIAQFPGTVDSILADYNRIVAEGGRL
SDVLSGYIDPDDGSLPAEEVEPVNLKDDSADSKEKDDEEEESDDSSDSDDEGDGGPDPEEARLRFTAVSEQLDKAKKALK
KHGRGSKQATAELTGLAELFMPIKLVPKQFDALVARVRSALEGVRAQERAIMQLCVRDARMPRADFLRLFPNHETDEKWV
DSVLKSKPKYAEAIERLRDDILRNQQKLAALESEVELTVAEIKEINRAMSIGEAKARRAKKEMVEANLRLVISIAKKYTN
RGLQFLDLIQEGNIGLMKAVDKFEYRRGYKFSTYATWWIRQAITRSIADQARTIRIPVHMIETINKLNRISRQMLQEMGR
EPTPEELGERMDMPEDKIRKVLKIAKEPISMETPIGDDEDSHLGDFIEDSTMQSPIEMATSESLKESTREVLAGLTAREA
KVLRMRFGIDMNTDHTLEEVGKQFDVTRERIRQIEAKALRKLRHPSRSEHLRSFLDE
>Q2K619 ~~~rpoD~~~RNA polymerase sigma factor RpoD~~~COG0568
MATKVKENEDAEVERDGASDGPLLDLSDDAVKKMIKAAKKRGYVTMDELNAVLPSEEVTSEQIEDTMSMLSDMGINVIED
EEAEEAGASGGGDDDDAGGDEDSEGGELAPSAGTALATAKKKEPTDRTDDPVRMYLREMGSVELLSREGEIAIAKRIEAG
RETMIAGLCESPLTFQALIIWRDELNEGTTLLREIIDLETTYSGPEAKAAPQFQSPEKIEADRKAAEEKEKTRRARSGDD
DITDVGGEGLPPEEEEEDEDESNLSLAAMEAELRPQVMETLDIIAETYKKLRKLQDQQVEQRLAATGTLSSAQERRYKEL
KDELIKAVKSLSLNQNRIDALVEQLYDINKRLVQNEGRLLRLAESYGVKRDSFLEQYQGAELDPNWMKSIGNLAARGWKE
FAKAENTTIRDIRQEIQNLATETGISISEFRRIVHMVQKGEREARIAKKEMVEANLRLVISIAKKYTNRGLQFLDLIQEG
NIGLMKAVDKFEYRRGYKFSTYATWWIRQAITRSIADQARTIRIPVHMIETINKIVRTSRQMLHEIGREPTPEELAEKLA
MPLEKVRKVLKIAKEPISLETPVGDEEDSHLGDFIEDKNALLPIDAAIQANLRETTTRVLASLTPREERVLRMRFGIGMN
TDHTLEEVGQQFSVTRERIRQIEAKALRKLKHPSRSRKLRSFLDS
>P0CZ15 ~~~rpoD~~~RNA polymerase sigma factor RpoD~~~
MAAKDIDDTKPDTAADEDASFDMSQAAVKRMIGEAKERGYITIDQLNAVMPPETVSGEQIEDVMSMLSEMGINVVEGEEV
EESEGGEVVETGSGSREIAVAGAAGETLDRTDDPVRMYLREMGSVELLSREGEIAIAKRIEAGRNTMIAGLCESPLTFQA
ITIWRDELLSEEILLRDVIDLEATFGRSLDGDEGLEGMEGIEGPVVEGLDLEAAEGAAPAARRPASDEPEYDADGNPISR
IDEEEDDDDSSNMSLAAMEAALKPKVLETLELIARDYAKLAEMQDLRMSATLNEDGTFTVAEEAAYQKLRSEIVLLVNEL
HLHNNRIEALIDQLYGINRKIMSIDSGMVKLADAARINRREFIDEYRGYELDPTWMDRMSAKPARAWVTLFEKSRHKVED
LRHEMAQVGQYVGVDISEFRRIVNQVQKGEKEARQAKKEMVEANLRLVISIAKKYTNRGLQFLDLIQEGNIGLMKAVDKF
EYRRGYKFSTYATWWIRQAITRSIADQARTIRIPVHMIETINKLVRTGRQMLHEIGREPTPEELAEKLQMPLEKVRKVMK
IAKEPISLETPIGHEEDSQLGDFIEDKNAILPLDSAIQENLKETTTRVLPSLTPREERVLRMRFGIGMNTDHTLEEVGQQ
FSVTRERIRQIEAKALRKLKHPSRSRKLPSFLDQ
>P12464 ~~~rpoE~~~DNA-directed RNA polymerase subunit delta~~~COG3343
MGIKQYSQEELKEMALVEIAHELFEEHKKPVPFQELLNEIASLLGVKKEELGDRIAQFYTDLNIDGRFLALSDQTWGLRS
WYPYDQLDEETQPTVKAKKKKAKKAVEEDLDLDEFEEIDEDDLDLDEVEEELDLEADDFDEEDLDEDDDDLEIEEDIIDE
DDEDYDDEEEEIK
>Q3IYV6 ~~~rpoE~~~ECF RNA polymerase sigma factor RpoE~~~COG1595
MTDKSDRTDWVALMRAIRDHRDEAAFAELFQHFAPKVKGFLMKSGSVASQAEECAQDVMATVWQKAHLFDPSRASVATWI
FTIARNRRIDGLRKDRQPEPEDLFWGPDSEPDQADVYEMQQENARLGRAIARLPEAQRALIERAFFGDLTHRELAAETGL
PLGTIKSRIRLALDRLRQHMS
>P0AGB6 ~~~rpoE~~~ECF RNA polymerase sigma-E factor~~~COG1595
MSEQLTDQVLVERVQKGDQKAFNLLVVRYQHKVASLVSRYVPSGDVPDVVQEAFIKAYRALDSFRGDSAFYTWLYRIAVN
TAKNYLVAQGRRPPSSDVDAIEAENFESGGALKEISNPENLMLSEELRQIVFRTIESLPEDLRMAITLRELDGLSYEEIA
AIMDCPVGTVRSRIFRAREAIDNKVQPLIRR
>D0ZSY9 ~~~rpoE~~~ECF RNA polymerase sigma-E factor~~~
MSEQLTDQVLVERVQKGDQKAFNLLVVRYQHKVASLVSRYVPSGDVPDVVQESFIKAYRALDSFRGDSAFYTWLYRIAVN
TAKNYLVAQGRRPPSSDVDAIEAENFESGGALKEISNPENLMLSEELRQIVFRTIESLPEDLRMAITLRELDGLSYEEIA
AIMDCPVGTVRSRIFRAREAIDNKVQPLIRR
>P66715 ~~~rpoE~~~Probable DNA-directed RNA polymerase subunit delta~~~
MKIQDYTKQMVDEKSFIDMAYTLLNDKGETMNLYDIIDEFRALGDYEYEEIENRVVQFYTDLNTDGRFLNVGENLWGLRD
WYSVDDIEEKIAPTIQKFDILDADDEEDQNLKLLGEDEMDDDDDIPAQTDDQEELNDPEDEQVEEEINHSDIVIEEDEDE
LDEDEEVFEDEEDFND
>P38133 ~~~sigE~~~RNA polymerase sigma-E factor~~~COG1595
MGEVLEFEEYVRTRQDALLRSARRLVPDPVDAQDLLQTALARTYGRWETIEDKRLADAYLRRVMINTRTEWWRARKLEEV
PTEQLPESPMDDATEQHADRALLMDVLKVLAPKQRSVVVLRHWEQMSTEETAAALGMSAGTVKSTLHRALARLREELVAR
DLDARALEREERERCAA
>Q5XA09 ~~~rpoE~~~Probable DNA-directed RNA polymerase subunit delta~~~
MKLDVFAGQEKSELSMIEVARAILEERGRDNEMYFSDLVNEIQNYLGKSDAGIRHALPFFYTDLNTDGSFIPLGENKWGL
RSWYAIDEIDEEIITLEEDEDGAQKRKKKRVNAFVDGDEDAIDYRDDDPEDEDFTEESAEVEYDEEDPDDEKSEVESYDS
ELNEIIPEDDFEEVDINEEDEEDEEDEEPVL
>P0CAW9 ~~~rpoH~~~RNA polymerase sigma factor RpoH~~~COG0568
MAVNSLSVMSPDGGLSRYLTEIRKFPMLSKDEEFMLAQRWKEHQDPQAAHKMVTSHLRLVAKIAMGYRGYGLPIGEVISE
GNVGLMQAVKKFEPEKGFRLATYAMWWIRASIQEYILRSWSLVKMGTTAAQKKLFFNLRKAKSQIAAFQEGDLHPDQVSQ
IATKLGVLDSEVISMNRRLSGPDASLNAPLRADGESEWQDWLADEEQVSQETRVAEDEEKSLRMSLLEEAMVELTDRERH
ILTERRLKDDPTTLEELAAQYGVSRERVRQIEVRAFEKLQKTMREAAIAKNMVDA
>P0AGB3 ~~~rpoH~~~RNA polymerase sigma factor RpoH~~~COG0568
MTDKMQSLALAPVGNLDSYIRAANAWPMLSADEERALAEKLHYHGDLEAAKTLILSHLRFVVHIARNYAGYGLPQADLIQ
EGNIGLMKAVRRFNPEVGVRLVSFAVHWIKAEIHEYVLRNWRIVKVATTKAQRKLFFNLRKTKQRLGWFNQDEVEMVARE
LGVTSKDVREMESRMAAQDMTFDLSSDDDSDSQPMAPVLYLQDKSSNFADGIEDDNWEEQAANRLTDAMQGLDERSQDII
RARWLDEDNKSTLQELADRYGVSAERVRQLEKNAMKKLRAAIEA
>Q9KI19 ~~~rpoS~~~RNA polymerase sigma factor RpoS~~~COG0568
MKTKTTKKTIKKAAKKIKKPSKRKIKKTAKKSRPKKPIKASDHGLIFAKTKKETTEKEDAELANAKAKTKKRRETRSSDP
TQIYLRELGFQPLLNAKEELKIARRVHKGDPKARKQMIEANLRLVVKIARHYVNRGLPFLDLIEEGNLGLLTAVEKFDPE
RGFRFSTYATWWIRQTIERAIMNQSRTVRLPIHVIKELNVYLRTAKKLTQEVDHEATPEDVAHLIDKPVQEIRRIMDLAP
SATSIDVPISEDGQKSLVDTLADDNNIDPARLIQNVDLQDHIERWLAQLDERHREVVILRFGLHENEKGTLEAVGKAVGL
TRERVRQIQIDALQQLRHILEMEGVTGEEVED
>P13445 ~~~rpoS~~~RNA polymerase sigma factor RpoS~~~COG0568
MSQNTLKVHDLNEDAEFDENGVEVFDEKALVEQEPSDNDLAEEELLSQGATQRVLDATQLYLGEIGYSPLLTAEEEVYFA
RRALRGDVASRRRMIESNLRLVVKIARRYGNRGLALLDLIEEGNLGLIRAVEKFDPERGFRFSTYATWWIRQTIERAIMN
QTRTIRLPIHIVKELNVYLRTARELSHKLDHEPSAEEIAEQLDKPVDDVSRMLRLNERITSVDTPLGGDSEKALLDILAD
EKENGPEDTTQDDDMKQSIVKWLFELNAKQREVLARRFGLLGYEAATLEDVGREIGLTRERVRQIQVEGLRRLREILQTQ
GLNIEALFRE
>P45684 ~~~rpoS~~~RNA polymerase sigma factor RpoS~~~
MALKKEGPEFDHDDEVLLLEPGIMLDESSADEQPSPRATPKATTSFSSKQHKHIDYTRALDATQLYLNEIGFSPLLTPEE
EVHFARLAQKGDPAGRKRMIESNLRLVVKIARRYVNRGLSLLDLIEEGNLGLIRAVEKFDPERGFRFSTYATWWIRQTIE
RAIMNQTRTIRLPIHVVKELNVYLRAARELTHKLDHEPSPEEIANLLEKPVAEVKRMLGLNERVTSVDVSLGPDSDKTLL
DTLTDDRPTDPCELLQDDDLSESIDQWLTELTDKQREVVIRRFGLRGHESSTLEEVGQEIGLTRERVRQIQVEALKRLRE
ILEKNGLSSDALFQ
>D0ZVL4 ~~~rpoS~~~RNA polymerase sigma factor RpoS~~~
MSQNTLKVHDLNEDAEFDENGVEAFDEKALSEEEPSDNDLAEEELLSQGATQRVLDATQLYLGEIGYSPLLTAEEEVYFA
RRALRGDVASRRRMIESNLRLVVKIARRYGNRGLALLDLIEEGNLGLIRAVEKFDPERGFRFSTYATWWIRQTIERAIMN
QTRTIRLPIHIVKELNVYLRTARELSHKLDHEPSAEEIAEQLDKPVDDVSRMLRLNERITSVDTPLGGDSEKALLDILAD
EKENGPEDTTQDDDMKQSIVKWLFELNAKQREVLARRFGLLGYEAATLEDVGREIGLTRERVRQIQVEGLRRLREILQTQ
GLNIEALFRE
>O31718 2.7.7.6~~~rpoY~~~DNA-directed RNA polymerase subunit epsilon~~~COG5503
MIYKVFYQEKADEVPVREKTDSLYIEGVSERDVRTKLKEKKFNIEFITPVDGAFLEYEQQSENFKVLEL
>A0A0K2H5X8 2.7.7.6~~~rpoY~~~DNA-directed RNA polymerase subunit epsilon~~~
MIFKVFYQENADEVPVREKTKTLYIEAESERDVRRKLEGRPINIEYIQPLEGAHLEYEKKSPNFQVLEISS
>Q5XA23 2.7.7.6~~~rpoY~~~DNA-directed RNA polymerase subunit epsilon~~~
MIYKVFYQETKDQSPRRESTKALYLNIDATDELDGRIKARRLVEDNTYYNVEFIELLSDKHLDYEKETGVFELTEF
>Q97T34 2.7.7.6~~~rpoY~~~DNA-directed RNA polymerase subunit epsilon~~~COG5503
MIYKVFYQETKERSPRRETTRTLYLDIDASSELEGRITARQLVEENRPEYNIEYIELLSDKLLDYEKETGAFEITEF
>O35011 2.7.7.6~~~rpoZ~~~DNA-directed RNA polymerase subunit omega~~~COG1758
MLDPSIDSLMNKLDSKYTLVTVSARRAREMQIKKDQMIEHTISHKYVGKALEEIDAGLLSFEKEDRE
>B8H618 2.7.7.6~~~rpoZ~~~DNA-directed RNA polymerase subunit omega~~~
MARVTVEDCVEKVPNRFALVLLSAHRARGISAGAALMVDRDNDKNPVVALREIADDVIDHEGLKEHLISTLQRVDEHTEA
EEEAETLALLADPSHMQMSELELVRALQSDRDGGQEERY
>Q182S6 2.7.7.6~~~rpoZ~~~DNA-directed RNA polymerase subunit omega~~~COG1758
MLKPSINEVLEKIDNRYYLVGTVSKRARKLIDGEEPYVSNKTKEKPVCVATKEVASGKITYRLLTEEEIEIEEARHHAEQ
HQQISEEE
>A7ZTK1 2.7.7.6~~~rpoZ~~~DNA-directed RNA polymerase subunit omega~~~
MARVTVQDAVEKIGNRFDLVLVAARRARQMQVGGKDPLVPEENDKTTVIALREIEEGLINNQILDVRERQEQQEQEAAEL
QAVTAIAEGRR
>P0A800 2.7.7.6~~~rpoZ~~~DNA-directed RNA polymerase subunit omega~~~COG1758
MARVTVQDAVEKIGNRFDLVLVAARRARQMQVGGKDPLVPEENDKTTVIALREIEEGLINNQILDVRERQEQQEQEAAEL
QAVTAIAEGRR
>Q2A273 2.7.7.6~~~rpoZ~~~DNA-directed RNA polymerase subunit omega~~~
MARVTVEDCLDKVETRFDLVVLASMRANKILKNGYSESMENEKKEKATVVALREIAESEITSEQILRNEIEG
>A0QWT1 2.7.7.6~~~rpoZ~~~DNA-directed RNA polymerase subunit omega~~~COG1758
MSTPHADAQLNAADDLGIDSSAASAYDTPLGITNPPIDELLSRASSKYALVIYAAKRARQINDYYNQLGDGILEYVGPLV
EPGLQEKPLSIALREIHGDLLEHTEGE
>P9WGY5 2.7.7.6~~~rpoZ~~~DNA-directed RNA polymerase subunit omega~~~COG1758
MSISQSDASLAAVPAVDQFDPSSGASGGYDTPLGITNPPIDELLDRVSSKYALVIYAAKRARQINDYYNQLGEGILEYVG
PLVEPGLQEKPLSIALREIHADLLEHTEGE
>Q02E25 2.7.7.6~~~rpoZ~~~DNA-directed RNA polymerase subunit omega~~~
MARVTVEDCLDNVDNRFELVMLATKRARQLATGGKEPKVAWENDKPTVVALREIASGLVDENVVQQEDIVEDEPLFAAFD
DEANTEAL
>Q9HTM1 2.7.7.6~~~rpoZ~~~DNA-directed RNA polymerase subunit omega~~~
MARVTVEDCLDNVDNRFELVMLATKRARQLATGGKEPKVAWENDKPTVVALREIASGLVDENVVQQEDIVEDEPLFAAFD
DEANTEAL
>P66726 2.7.7.6~~~rpoZ~~~DNA-directed RNA polymerase subunit omega~~~
MLNPPLNQLTSQIKSKYLIATTAAKRAREIDEQPETELLSEYHSFKPVGRALEEIADGKIRPVISSDYYGKE
>Q9KXS1 2.7.7.6~~~rpoZ~~~DNA-directed RNA polymerase subunit omega~~~COG1758
MSSSISAPEGIINPPIDELLEATDSKYSLVIYAAKRARQINAYYSQLGEGLLEYVGPLVDTHVHEKPLSIALREINAGLL
TSEAIEGPAQ
>Q5XAP2 2.7.7.6~~~rpoZ~~~DNA-directed RNA polymerase subunit omega~~~
MLKPSIDTLLDKVPSKYSLVILQAKRAHELEAGATPTQEFKSVKSTLQALEEIESGNVVIHPDPSAKREAVRAKIEAERL
AKEEEERKIKEQIAKEKEEEGEKI
>P74352 2.7.7.6~~~rpoZ~~~DNA-directed RNA polymerase subunit omega~~~
MTKRSNLDSNHIIYRSEELLGAASNRYNITVRVAKRAKENRSEDFDSIDDPNMKPAIRAIIEMSDELTRPEIISDN
>Q9EVV4 2.7.7.6~~~rpoZ~~~DNA-directed RNA polymerase subunit omega~~~
MAEPGIDKLFGMVDSKYRLTVVVAKRAQQLLRHRFKNTVLEPEERPKMRTLEGLYDDPNAVTWAMKELLTGRLFFGENLV
PEDRLQKEMERLYPTEEEA
>Q8RQE7 2.7.7.6~~~rpoZ~~~DNA-directed RNA polymerase subunit omega~~~COG1758
MAEPGIDKLFGMVDSKYRLTVVVAKRAQQLLRHGFKNTVLEPEERPKMQTLEGLFDDPNAVTWAMKELLTGRLVFGENLV
PEDRLQKEMERLYPVEREE
>Q5H3R9 2.7.7.6~~~rpoZ~~~DNA-directed RNA polymerase subunit omega~~~
MARITVEDCLEVVNNRFELVMMASKRARQLANGVQPLIENAAASDKPTVMALREIAARRIDNALIDEVEKAERERAEREA
LEWAAAEVVADEDMSKNDD
>B0FYK7 2.3.1.233~~~rppA~~~1,3,6,8-tetrahydroxynaphthalene synthase~~~
MRVPVAVDDLVAPSTMGERHTVIDRGTSVAAVHTALPPHRYAQSDLTELIADLCLEPGADRALLRRLHTSAGVRTRHLAL
PIEQYAGLGDFGQANAAWLTVGLALAEEALSGALDAAGLTAADIDLLVCTSITGVAAPSLDARLAVRMGMRADVKRVPVF
GLGCVGGAAGLGRLHDYLLGHPDDTAVLLSVELCSLTLQRDGSLANLVAGALFGDGAAAVVARGGDAGRRGAGWPMVAAT
RGHLYPDTEHLLGWRIGASGFRVVVDAGIPDVVRTHLGGDLRNFLATHGLVPDDIGTWICHPGGPKVLAAVGDALELPDG
ALDSSWRSLAGVGNLSSASVLRVLEDVATRCRPDPGTWGVLLAMGPGFCAEFVLLRW
>Q9FCA7 2.3.1.233~~~~~~1,3,6,8-tetrahydroxynaphthalene synthase~~~COG3424
MATLCRPSVSVPEHVITMEETLELARRRHTDHPQLPLALRLIENTGVRTRHIVQPIEDTLEHPGFEDRNKVYEREAKSRV
PAVIQRALDDAELLATDIDVIIYVSCTGFMMPSLTAWLINEMGFDSTTRQIPIAQLGCAAGGAAINRAHDFCTAYPEANA
LIVACEFCSLCYQPTDLGVGSLLCNGLFGDGIAAAVVRGRGGTGVRLERNGSYLIPKTEDWIMYDVKATGFHFLLDKRVP
ATMEPLAPALKELAGEHGWDASDLDFYIVHAGGPRILDDLSTFLEVDPHAFRFSRATLTEYGNIASAVVLDALRRLFDEG
GVEEGARGLLAGFGPGITAEMSLGCWQTADVRRGIRQDVTRTAARGVSRRVRQA
>Q54240 2.3.1.233~~~rppA~~~1,3,6,8-tetrahydroxynaphthalene synthase~~~
MATLCRPAIAVPEHVITMQQTLDLARETHAGHPQRDLVLRLIQNTGVQTRHLVQPIEKTLAHPGFEVRNQVYEAEAKTRV
PEVVRRALANAETEPSEIDLIVYVSCTGFMMPSLTAWIINSMGFRPETRQLPIAQLGCAAGGAAINRAHDFCVAYPDSNV
LIVSCEFCSLCYQPTDIGVGSLLSNGLFGDALSAAVVRGQGGTGMRLERNGSHLVPDTEDWISYAVRDTGFHFQLDKRVP
GTMEMLAPVLLDLVDLHGWSVPNMDFFIVHAGGPRILDDLCHFLDLPPEMFRYSRATLTERGNIASSVVFDALARLFDDG
GAAESAQGLIAGFGPGITAEVAVGSWAKEGLGADVGRDLDELELTAGVALSG
>Q55933 ~~~rppA~~~Response regulator RppA~~~COG0745
MRILLVEDETDLGMAIKKVLVSEKYVVDWVTDGSQAWDYLENQWTEYTLAIVDWLLPGLSGLELCQKLRTQGNSLPVLML
TALGEPENRVEGLDAGADDYLTKPFVMAELLARLRALQRRSPQFQPQILTLGNFSLDPSNNLLSVTISEPLNLERQEIAL
TVREFQIFQYLMQNPERIISGSKIRQQLWDLDEEPMSNVVAAQMRLIRRKLAQQNCPCPIKTVPGQGYRFTLSP
>Q55932 2.7.13.3~~~rppB~~~Sensor histidine kinase RppB~~~COG5002
MNTRRLFARSRLQLAFWYALVMGGILTLLGLGVYRAIVQANWMALEREVESIAGTLHDSLEPMLPSNASPTGVLQKMLPD
LCLVNQPCQVNPTLIERHTLGISDRSLYYIRLFDYQGNLLAFSPNQPASLSSIFNQETWQTIHPPTGDRYRQFTTILHSA
GNTDKSSWGYLQIGRSLAAFDAENKRILWILGLSFPIALGLVAFSSWGLAGLAMRPIYQSYQQQQQFTANAAHELRSPLA
SLLATVEAVLRIDSSHSPEINTMLHTVERQGRRLSQLITDLLLLSRLEQETTAEDWRLCCLNDLVSDLTEEFLELAIAAH
IDLSSDLSSGEVYAWGNESQLYRLVSNLIANAIQYTTAGGRVDITLTSHEQMAIITVQDTGIGIAPDQQEHIFERFYRVN
RDRSRKTGGTGLGLAIAQVITVKHRGSLTVESALGKGSLFTIQLPIFSVPIVHS
>P35640 3.6.1.-~~~rppH~~~RNA pyrophosphohydrolase~~~COG0494
MDTMVDFKTLPYRKGVGIVVFNREGQVWIGRRLITSSHTYAEVSKLWQFPQGGIDEGEEPLDAARRELYEETGMRSVNLI
KEVQDWFCYDFPQELIGHVLNNQYRGQMQKWFAFQFIGETSEIVINSPENSNKAEFDQWKWINLEVLPSIVVSFKRHVYM
KVVHEFRNII
>P0A776 3.6.1.-~~~rppH~~~RNA pyrophosphohydrolase~~~COG0494
MIDDDGYRPNVGIVICNRQGQVMWARRFGQHSWQFPQGGINPGESAEQAMYRELFEEVGLSRKDVRILASTRNWLRYKLP
KRLVRWDTKPVCIGQKQKWFLLQLVSGDAEINMQTSSTPEFDGWRWVSYWYPVRQVVSFKRDVYRRVMKEFASVVMSLQE
NTPKPQNASAYRRKRG
>Q9ZDT9 3.6.1.-~~~rppH~~~RNA pyrophosphohydrolase~~~COG0494
MRNSSNKYLDLPYRPGVGMMILNADNQIFVGKRIDTKISSWQMPQGGIVPGETPSIAAMREMLEEIGSNKGYIIAESKCW
YSYDVPSFLIPKLWNGNFRGQKQRWFLIRFTGNNKDINIHTSNPEFDQWRWTSLDELLSIIIPFKRKLYQAVVKEFESLI
Q
>P06574 ~~~sigB~~~RNA polymerase sigma-B factor~~~COG1191
MTQPSKTTKLTKDEVDRLISDYQTKQDEQAQETLVRVYTNLVDMLAKKYSKGKSFHEDLRQVGMIGLLGAIKRYDPVVGK
SFEAFAIPTIIGEIKRFLRDKTWSVHVPRRIKELGPRIKMAVDQLTTETQRSPKVEEIAEFLDVSEEEVLETMEMGKSYQ
ALSVDHSIEADSDGSTVTILDIVGSQEDGYERVNQQLMLQSVLHVLSDREKQIIDLTYIQNKSQKETGDILGISQMHVSR
LQRKAVKKLREALIEDPSMELM
>P9WGH0 ~~~sigC~~~ECF RNA polymerase sigma factor SigC~~~
MTATASDDEAVTALALSAAKGNGRALEAFIKATQQDVWRFVAYLSDVGSADDLTQETFLRAIGAIPRFSARSSARTWLLA
IARHVVADHIRHVRSRPRTTRGARPEHLIDGDRHARGFEDLVEVTTMIADLTTDQREALLLTQLLGLSYADAAAVCGCPV
GTIRSRVARARDALLADAEPDDLTG
>P9WGH1 ~~~sigC~~~ECF RNA polymerase sigma factor SigC~~~COG1595
MTATASDDEAVTALALSAAKGNGRALEAFIKATQQDVWRFVAYLSDVGSADDLTQETFLRAIGAIPRFSARSSARTWLLA
IARHVVADHIRHVRSRPRTTRGARPEHLIDGDRHARGFEDLVEVTTMIADLTTDQREALLLTQLLGLSYADAAAVCGCPV
GTIRSRVARARDALLADAEPDDLTG
>P10726 ~~~sigD~~~RNA polymerase sigma-D factor~~~COG1191
MQSLNYEDQVLWTRWKEWKDPKAGDDLMRRYMPLVTYHVGRISVGLPKSVHKDDLMSLGMLGLYDALEKFDPSRDLKFDT
YASFRIRGAIIDGLRKEDWLPRTSREKTKKVEAAIEKLEQRYLRNVSPAEIAEELGMTVQDVVSTMNEGFFANLLSIDEK
LHDQDDGENIQVMIRDDKNVPPEEKIMKDELIAQLAEKIHELSEKEQLVVSLFYKEELTLTEIGQVLNLSTSRISQIHSK
ALFKLKNLLEKVIQ
>P9WGG9 ~~~sigD~~~ECF RNA polymerase sigma factor SigD~~~COG1595
MVDPGVSPGCVRFVTLEISPSMTMQGERLDAVVAEAVAGDRNALREVLETIRPIVVRYCRARVGTVERSGLSADDVAQEV
CLATITALPRYRDRGRPFLAFLYGIAAHKVADAHRAAGRDRAYPAETLPERWSADAGPEQMAIEADSVTRMNELLEILPA
KQREILILRVVVGLSAEETAAAVGSTTGAVRVAQHRALQRLKDEIVAAGDYA
>P06222 ~~~sigE~~~RNA polymerase sigma-E factor~~~COG1191
MKKLKLRLTHLWYKLLMKLGLKSDEVYYIGGSEALPPPLSKDEEQVLLMKLPNGDQAARAILIERNLRLVVYIARKFENT
GINIEDLISIGTIGLIKAVNTFNPEKKIKLATYASRCIENEILMYLRRNNKIRSEVSFDEPLNIDWDGNELLLSDVLGTD
DDIITKDIEANVDKKLLKKALEQLNEREKQIMELRFGLVGEEEKTQKDVADMMGISQSYISRLEKRIIKRLRKEFNKMV
>P07860 ~~~sigF~~~RNA polymerase sigma-F factor~~~COG1191
MDVEVKKNGKNAQLKDHEVKELIKQSQNGDQQARDLLIEKNMRLVWSVVQRFLNRGYEPDDLFQIGCIGLLKSVDKFDLT
YDVRFSTYAVPMIIGEIQRFIRDDGTVKVSRSLKELGNKIRRAKDELSKTLGRVPTVQEIADHLEIEAEDVVLAQEAVRA
PSSIHETVYENDGDPITLLDQIADNSEEKWFDKIALKEAISDLEEREKLIVYLRYYKDQTQSEVAERLGISQVQVSRLEK
KILKQIKVQMDHTDG
>P19940 ~~~sigG~~~RNA polymerase sigma-G factor~~~COG1191
MSRNKVEICGVDTSKLPVLKNEEMRKLFRQLQDEGDDSAREKLVNGNLRLVLSVIQRFNNRGEYVDDLFQVGCIGLMKSI
DNFDLSHNVKFSTYAVPMIIGEIRRYLRDNNPIRVSRSLRDIAYKALQVRERLISETSKEPTAEDIAKVLEVPHEEIVFA
LDAIQDPVSLFEPIYNDGGDPIYVMDQISDERNTDSQWIEELALKEGMRRLNDREKMILRKRFFQGKTQMEVAEEIGISQ
AQVSRLEKAAIKQMNKNIHQ
>P17869 ~~~sigH~~~RNA polymerase sigma-H factor~~~COG1595
MNLQNNKGKFNKEQFCQLEDEQVIEKVHVGDSDALDYLITKYRNFVRAKARSYFLIGADREDIVQEGMIGLYKSIRDFKE
DKLTSFKAFAELCITRQIITAIKTATRQKHIPLNSYASLDKPIFDEESDRTLLDVISGAKTLNPEEMIINQEEFDDIEMK
MGELLSDLERKVLVLYLDGRSYQEISDELNRHVKSIDNALQRVKRKLEKYLEIREISL
>Q06198 ~~~algU~~~RNA polymerase sigma-H factor~~~
MLTQEQDQQLVERVQRGDKRAFDLLVLKYQHKILGLIVRFVHDAQEAQDVAQEAFIKAYRALGNFRGDSAFYTWLYRIAI
NTAKNHLVARGRRPPDSDVTAEDAEFFEGDHALKDIESPERAMLRDEIEATVHQTIQQLPEDLRTALTLREFEGLSYEDI
ATVMQCPVGTVRSRIFRAREAIDKALQPLLREA
>P12254 ~~~sigK~~~RNA polymerase sigma-K factor~~~
MVTGVFAALGFVVKELVFLVSYVKNNAFPQPLSSSEEKKYLELMAKGDEHARNMLIEHNLRLVAHIVKKFENTGEDAEDL
ISIGTIGLIKGIESYSAGKGTKLATYAARCIENEIVITKGGCIHPSLIRFNIYGVRIHNGNFFHDKVNNCFFIFKSMPPL
FVMNNEILMHLRALKKTKKDVSLHDPIGQDKEGNEISLIDVLKSENEDVIDTIQLNMELEKVKQYIDILDDREKEVIVGR
FGLDLKKEKTQREIAKELGISRSYVSRIEKRALMKMFHEFYRAEKEKRKKAKGK
>D4FWG0 ~~~rqcH~~~Rqc2 homolog RqcH~~~
MSFDGMFTYGMTHELNEKIMGGRITKVHQPYKHDVIFHIRAKGKNQKLLLSAHPSYSRVHITAQAYENPSEPPMFCMLLR
KHIEGGFIEKIEQAGLDRIMIFHIKSRNEIGDETVRKLYVEIMGRHSNIILTDAAENVIIDGLKHLSPSMNSYRTVLPGQ
DYKLPPAQDKISPLEASEDDILRHLSFQEGKLDKQIVDHFSGVSPLFAKEAVHRAGLANKVTLPKALLALFAEVKEHRFI
PNITTVNGKEYFYLLELTHLKGEARSFESLSELLDRFYFGKAERDRVKQQAQDLERFVVNERKKNANKIKKLEKTLEYSE
NAKEFQLYGELLTANLYMLKKGDKQAEVINYYDEESPTITIPLNPNKTPSENAQAYFTKYQKAKNSVAVVEEQIRLAQEE
IEYFDQLIQQLSSASPRDISEIREELVEGKYLRPKQQKGQKKQKPHNPVLETYESTSGLTILVGKNNRQNEYLTTRVAAR
DDIWLHTKDIPGSHVVIRSSEPDEQTIMEAATIAAYFSKAKDSSSVPVDYTKIRHVKKPNGAKPGFVTYDSQHTVFVTPD
ADTVIKLKKS
>O34693 ~~~rqcH~~~Rqc2 homolog RqcH~~~COG1293
MSFDGMFTYGMTHELNEKIMGGRITKIHQPYKHDVIFHIRAKGKNQKLLLSAHPSYSRVHITAQAYENPSEPPMFCMLLR
KHIEGGFIEKIEQAGLDRIMIFHIKSRNEIGDETVRKLYVEIMGRHSNIILTDAAENVIIDGLKHLSPSMNSYRTVLPGQ
DYKLPPAQDKISPLEASEDDILRHLSFQEGRLDKQIVDHFSGVSPLFAKEAVHRAGLANKVTLPKALLALFAEVKEHRFI
PNITTVNGKEYFYLLELTHLKGEARRFDSLSELLDRFYFGKAERDRVKQQAQDLERFVVNERKKNANKIKKLEKTLEYSE
NAKEFQLYGELLTANLYMLKKGDKQAEVINYYDEESPTITIPLNPNKTPSENAQAYFTKYQKAKNSVAVVEEQIRLAQEE
IEYFDQLIQQLSSASPRDISEIREELVEGKYLRPKQQKGQKKQKPHNPVLETYESTSGLTILVGKNNRQNEYLTTRVAAR
DDIWLHTKDIPGSHVVIRSSEPDEQTIMEAATIAAYFSKAKDSSSVPVDYTKIRHVKKPNGAKPGFVTYDSQHTVFVTPD
ADTVIKLKKS
>Q8DQ36 ~~~rqcH~~~Rqc2 homolog RqcH~~~COG1293
MSFDGFFLHHIVEELRSELVNGRIQKINQPFEQELVLQIRSNRQSHRLLLSAHPVFGRIQLTQTTFENPAQPSTFIMVLR
KYLQGALIESIEQVENDRIVEMTVSNKNEIGDHIQATLIIEIMGKHSNILLVDKSSHKILEVIKHVGFSQNSYRTLLPGS
TYIAPPSTESLNPFTIKDEKLFEILQTQELTAKNLQSLFQGLGRDTANELERILVSEKLSAFRNFFNQETKPCLTETSFS
PVPFANQAGEPFANLSDLLDTYYKNKAERDRVKQQASELIRRVENELQKNRHKLKKQERELLATDNAEEFRQKGELLTTF
LHQVPNDQDQVILDNYYTNQPIMIALDKALTPNQNAQRYFKRYQKLKEAVKYLTDLIEETKATILYLESVETVLNQAGLE
EIAEIREELIQTGFIRRRQREKIQKRKKLEQYLASDGKTIIYVGRNNLQNEELTFKMARKEELWFHAKDIPGSHVVISGN
LDPSDAVKTDAAELAAYFSQGRLSNLVQVDMIEVKKLNKPTGGKPGFVTYTGQKTLRVTPDSKKIASMKKS
>A4VWG9 ~~~rqcH~~~Rqc2 homolog RqcH~~~COG1293
MSFDGFFLHHMTAELRANLEGGRIQKINQPFEQEIVLNIRSNRQSHKLLLSAHSVFGRVQLTQSDFTNPKVPNTFTMILR
KYLQGAIIEEIRQLDNDRILEFSVSNKDEIGDHIQATLIVEIMGKHSNIILVDKSEQKIIEAIKHVGFSQNSYRTILPGS
TYIRPPETHSLNPYTVSDEKLFEILSTQELSPKNLQQVFQGLGRDTASELANHLQIDRLKNFRAFFDQATQPSLTDKSYA
ALPFANSPENQPHFESLSSLLDFYYQDKAERDRVAQQANELIKRVASELEKNRKKLIKQEQELADTETAELVRQKGELLT
TYLHQVPNDQSSVRLDNYYTGKELEIELDVALTPSQNAQRYFKKYQKLKEAVKHLTNLIEETKSTIVYLESVDTMLGQAS
LAEIDEIREELIETGYLKRRHREKIHKRQKPERYLATDGKTIILVGKNNLQNDELTFKMAKKGELWFHAKDIPGSHVVIT
DNLDPSDEVKTDAAELAAYFSKARHSNLVQVDMIEAKKLHKPTGGKPGFVTYRGQKTLRVTPTEDKIKSMKI
>Q9RRH3 2.7.11.1~~~rqkA~~~DNA damage-responsive serine/threonine-protein kinase RqkA~~~COG0515
MPLTPGTLLAGRYELLALLGEGGSAQVYRAQDGLLGREVALKVMHDYLPESDRSRFLREVRTLARLTHPGVVPVLDLGQE
PEAGRPFFTMPLLTGGPITRLGPLEDAPGPLARFLTAAAFASRALHHVHSHGIVHRDLTPGNVLLDDTGLPRIMDFGLVA
LSEQTRHLTRSGVTLGTPAYMAPEQAKGGGVDARSDLYALGAVLYRVACGSPPFVGDSDQSVLYQHVYEPVPDPRDLNPA
VPDAVARVLLWLLAKRADRRPQSGAALAHLWALARRDLWTTHARGQYRGGRARTGEHPDGPARVSDMQELWSVALPGEVT
WPAAVVGEGDLVAVGTRGGQLVLTHTSGRPFATYAARDEVTAPATLIGGHVLYGAWDGTLRRVELQSGSEVWRHQARAEF
TGAPTVWGGRLLAPSRDGHLHALSLRTGELAWAYRAGGSLAASPLVWAGAALQCDETGWLHALDARSGTPLWKVEVGTVH
ATPALLPGPPGTATLVIATWEGEVHAIGLEVQNGRAALAGEDAIRWTYDVEDEVWASPALTALDLPPDSGAAPDASAAPG
GVVVVAGWGGKVRGLRLADGEDLWERTLDGRVTASPVISAGLVFLATEGGELLALDVRNGEVRWTCRERSGVQATPLAAS
GTLYVAFMDGTLRAYRNAHPEWRSEQEG
>A0R664 4.1.3.17~~~rraA~~~Putative 4-hydroxy-4-methyl-2-oxoglutarate aldolase~~~COG0684
MSIEPRATADLVDEIYPDVRSCDLQLQNYGGKIMFAGPVTTVRCFQDNALLKSILSTPGEGAVLVVDGAGSLHTALVGDV
IAGLAADNGWSGVIVNGAVRDAAALRTIDVGIKALGTNPRKGTKTGEGERDVEVSFGGVTFTPGDIAYCDEDGIVVVTP
>P9WGY3 4.1.3.17~~~rraA~~~Putative 4-hydroxy-4-methyl-2-oxoglutarate aldolase~~~COG0684
MAISFRPTADLVDDIGPDVRSCDLQFRQFGGRSQFAGPISTVRCFQDNALLKSVLSQPSAGGVLVIDGAGSLHTALVGDV
IAELARSTGWTGLIVHGAVRDAAALRGIDIGIKALGTNPRKSTKTGAGERDVEITLGGVTFVPGDIAYSDDDGIIVV
>Q02KR3 4.1.3.17~~~~~~Putative 4-hydroxy-4-methyl-2-oxoglutarate aldolase~~~
MHYVTPDLCDAYPELVQVVEPMFSNFGGRDSFGGEIVTIKCFEDNSLVKEQVDKDGKGKVLVVDGGGSLRRALLGDMLAE
KAAKNGWEGIVVYGCIRDVDVIAQTDLGVQALASHPLKTDKRGIGDLNVAVTFGGVTFRPGEFVYADNNGIIVSPQALKM
PE
>Q9I2W7 4.1.3.17~~~~~~Putative 4-hydroxy-4-methyl-2-oxoglutarate aldolase~~~
MHYVTPDLCDAYPELVQVVEPMFSNFGGRDSFGGEIVTIKCFEDNSLVKEQVDKDGKGKVLVVDGGGSLRRALLGDMLAE
KAAKNGWEGIVVYGCIRDVDVIAQTDLGVQALASHPLKTDKRGIGDLNVAVTFGGVTFRPGEFVYADNNGIIVSPQALKM
PE
>Q5SIP7 4.1.3.17~~~~~~4-hydroxy-4-methyl-2-oxoglutarate aldolase~~~COG0684
MEARTTDLSDLYPEGEALPMVFKSFGGRARFAGRVRTLRVFEDNALVRKVLEEEGAGQVLFVDGGGSLRTALLGGNLARL
AWEKGWAGVVVHGAVRDTEELREVPIGLLALAATPKKSAKEGKGEVDVPLKVLGVEVLPGSFLLADEDGLLLLPEPPSGV
RSGG
>Q9KPK1 4.1.3.17~~~~~~Putative 4-hydroxy-4-methyl-2-oxoglutarate aldolase~~~COG0684
MRDITPDLCDKYESQVTLLNLPLQNFGQRSAFWGEIVTVRCYHDNSKVRDVLSQNGKGKVLVVDGHGSCHKALMGDQLAI
LAIKNDWEGVIIYGAVRDVVAMSEMDLGIKALGTSPFKTEKRGAGQVNVTLTMQNQIVEPGDYLYADWNGILMSETALDV
A
>P0A8R0 ~~~rraA~~~Regulator of ribonuclease activity A~~~COG0684
MKYDTSELCDIYQEDVNVVEPLFSNFGGRASFGGQIITVKCFEDNGLLYDLLEQNGRGRVLVVDGGGSVRRALVDAELAR
LAVQNEWEGLVIYGAVRQVDDLEELDIGIQAMAAIPVGAAGEGIGESDVRVNFGGVTFFSGDHLYADNTGIILSEDPLDI
E
>P0AF90 ~~~rraB~~~Regulator of ribonuclease activity B~~~COG3076
MANPEQLEEQREETRLIIEELLEDGSDPDALYTIEHHLSADDLETLEKAAVEAFKLGYEVTDPEELEVEDGDIVICCDIL
SECALNADLIDAQVEQLMTLAEKFDVEYDGWGTYFEDPNGEDGDDEDFVDEDDDGVRH
>P44831 ~~~rraB~~~Regulator of ribonuclease activity B~~~COG3076
MSKLAELQAETREIIEDLLNDGSEPNALYIIEHHIAHHDFDLLEKIAVDAFKAGYEVSEAEEFKDDDGKPIFCFDIISEV
ELKAEIIDAQQKEILPLLEKHNGIYDGWGTYFEDPNADDDEYGDDGEFLDDEDEYGDDGEFFDDEDEEEPRVH
>Q97D82 1.11.1.1~~~rbr3A~~~Reverse rubrerythrin-1~~~COG1592
MKKFKCVVCGYIYTGEDAPEKCPVCGAGKDKFVEVKDEGEGWADEHKIGVAKGVDKEVLEGLRANFTGECTEVGMYLAMA
RQADREGYPEVAEAYKRIAFEEAEHASKFAELLGEVVVADTKTNLQMRVDAEKGACEGKKELATLAKKLNYDAIHDTVHE
MCKDEARHGSAFRGLLNRYFK
>Q97D83 1.11.1.1~~~rbr3B~~~Reverse rubrerythrin-2~~~COG1592
MKKFKCVVCGYIYTGEDAPEKCPVCGAGKDKFVEVKDEGEGWADEHKIGIAKGVDKEVLEGLRANFTGECTEVGMYLAMA
RQADREGYPEVAEAYKRIAFEEAEHASKFAELLGEVVVADTKTNLQMRVDAEKGACEGKKELATLAKKLNYDAIHDTVHE
MCKDEARHGSAFRGLLNRYFK
>P72781 ~~~rre1~~~Response regulator Rre1~~~COG2197
MAEPISLLLVDDEPGVRESVQAFLEDSGDFKVDLAANATEAWDYLQHHLPALVISDIMMPQVDGYQFLQKLREDARFQSL
PVVFLTARGMTGDRIQGYQTGCDAFLSKPFDPDELEAIVRNLLARQQASSDAGSESAKLQEIYQEIRALKEQIGQPSGIH
TTPSPIKLDFTPREQSVLDLVSQGLMNKEIAAQLKTSVRNVEKYVSRLFTKTGTNSRTELVRFALQHGLTE
>O66928 ~~~frr~~~Ribosome-recycling factor~~~COG0233
MIKELEDIFKEAEKDMKKAVEYYKNEIAGLRTSRASTALVEEIKVEYYGSKVPIKQLGTISVPEHNQIVIQVWDQNAVPA
IEKAIREELNLNPTVQGNVIRVTLPPLTEERRRELVRLLHKITEEARVRVRNVRREAKEMIEELEGISEDEKKRALERLQ
KLTDKYIDEINKLMEAKEKEIMSV
>Q81WL1 ~~~frr~~~Ribosome-recycling factor~~~COG0233
MGQQVLKFSNEKMEKAVAAYSRELATVRAGRASASVLDKVQVDYYGAPTPVVQLANITVPEARLLVIQPYDKTSIGDIEK
AILKADLGLNPSNDGTVIRIAFPALTEERRRDLVKVVKKYAEEAKVAVRNVRRDGNDDLKKLEKAGEITEDDLRGYTEDI
QKETDKYIAKVDEIAKNKEKEIMEV
>P81101 ~~~frr~~~Ribosome-recycling factor~~~COG0233
MSKEVLTQTKEKMEKAIAAYQRELATVRAGRANPSLLDKVTVEYYGAQTPLNQLSSINVPEARMLVITPYDKTAIGDIEK
AILKADLGLTPTSDGNMIRIAIPALTEERRKELVKVVKKYAEEAKVAVRNVRRDANDDLKKLEKNGDITEDELRASTEDV
QKLTDEYVSKIDSVTKDKEKEIMEV
>P0A805 ~~~frr~~~Ribosome-recycling factor~~~COG0233
MISDIRKDAEVRMDKCVEAFKTQISKIRTGRASPSLLDGIVVEYYGTPTPLRQLASVTVEDSRTLKINVFDRSMSPAVEK
AIMASDLGLNPNSAGSDIRVPLPPLTEERRKDLTKIVRGEAEQARVAVRNVRRDANDKVKALLKDKEISEDDDRRSQDDV
QKLTDAAIKKIEAALADKEAELMQF
>Q2GHJ5 ~~~frr~~~Ribosome-recycling factor~~~COG0233
MISEVKQDAKSRMEKSLSVYLSDIDGIRTGRARTSVLNGIVVETYGGRVKLNTISSVSVSDNKTLMIKVWDSNNIGAIKT
AIMNSNLGFGISCEATTIRLTVPDMTQDMRKNLVKLLGKISEDCRVSIRNIRRDIMDRLKVMQDSKEISEDDLRVAGVEI
QKITDDIMKKVNDAFTSKEKELLHV
>P44307 ~~~frr~~~Ribosome-recycling factor~~~COG0233
MLNQIKKDAQDRMEKSLEALKGHISKIRTGRAQPSLLDAIQVEYYGAATPLRQLANVVAEDARTLAVTVFDRSLISAVEK
AILTSDLGLNPSSAGTTIRVPLPPLTEERRRDLIKIVKGEGEQGKVAVRNVRRDANDKIKALLKDKEISENEQHKAEEEI
QKITDIYIKKVDEVLADKEKELMDF
>P75161 ~~~frr~~~Ribosome-recycling factor~~~
MSPEKYLNFFKETADKKFQWLKEELSKIRTGRPNPKLFDNLLVESYGDRMPMVALAQIAVNPPREIVIKPFDVKNNINAI
YSEIQRANLGVQPVIDGDKIRINFPPMTQESRLESIKQAKKVVEQIHQELRSVRRDTLQMIKKDDHKDEDFEEFLKEEVE
KVNKQYIAQLETIQKQKEKELLVV
>A0QVE0 ~~~frr~~~Ribosome-recycling factor~~~COG0233
MIDETLFDAEEKMEKAVSVARDELGSIRTGRANPGMFNRINIDYYGSMTPITQLASINVPEARLVVIKPYEASQLRAIED
AIRNSDLGVNPSNDGNIIRVAIPQLTEERRRELVKQAKSKGEDAKVSVRNIRRKAMEELSRIKKDGEAGEDEVSRAEKDL
DKSTHTYTAQIDELVKHKESELLEV
>P9WGY1 ~~~frr~~~Ribosome-recycling factor~~~COG0233
MIDEALFDAEEKMEKAVAVARDDLSTIRTGRANPGMFSRITIDYYGAATPITQLASINVPEARLVVIKPYEANQLRAIET
AIRNSDLGVNPTNDGALIRVAVPQLTEERRRELVKQAKHKGEEAKVSVRNIRRKAMEELHRIRKEGEAGEDEVGRAEKDL
DKTTHQYVTQIDELVKHKEGELLEV
>P99130 ~~~frr~~~Ribosome-recycling factor~~~
MSDIINETKSRMQKSIESLSRELANISAGRANSNLLNGVTVDYYGAPTPVQQLASINVPEARLLVISPYDKTSVADIEKA
IIAANLGVNPTSDGEVIRIAVPALTEERRKERVKDVKKIGEEAKVSVRNIRRDMNDQLKKDEKNGDITEDELRSGTEDVQ
KATDNSIKEIDQMIADKEKDIMSV
>Q5XDH3 ~~~frr~~~Ribosome-recycling factor~~~
MANAIIETAKERFAQSHQSLSREYASIRAGRANASLLDRIQVDYYGAPTPLNQLASITVPEARVLLISPFDKSSIKDIER
ALNASDLGITPANDGSVIRLVIPALTEETRKELAKEVKKVGENAKIAIRNIRRDAMDDAKKQEKAKEITEDELKTLEKDI
QKATDDAIKEIDRMTAEKEKELLSV
>P74456 ~~~frr~~~Ribosome-recycling factor~~~COG0233
MKLAELKDHMQKSVEATQRSFNTIRTGRANASLLDRITVEYYGAETPLKSLATIGTPDASTIVIQPFDMGSIGTIEKAIS
LSDLGLTPNNDGKVIRLNIPPLTAERRKELVKVAGKLAEEGKVAIRNIRRDAVDEVRKQEKNSDISEDEARDLQEEIQKL
TDQSTKRIDELLAAKEKDITTV
>Q9X1B9 ~~~frr~~~Ribosome-recycling factor~~~COG0233
MVNPFIKEAKEKMKRTLEKIEDELRKMRTGKPSPAILEEIKVDYYGVPTPVNQLATISISEERTLVIKPWDKSVLSLIEK
AINASDLGLNPINDGNVIRLVFPSPTTEQREKWVKKAKEIVEEGKIAIRNIRREILKKIKEDQKEGLIPEDDAKRLENEI
QKLTDEFIEKLDEVFEIKKEEIMEF
>Q9WX76 ~~~frr~~~Ribosome-recycling factor~~~COG0233
MTLKELYAETRSHMQKSLEVLEHNLAGLRTGRANPALLLHLKVEYYGAHVPLNQIATVTAPDPRTLVVQSWDQNALKAIE
KAIRDSDLGLNPSNKGDALYINIPPLTEERRKDLVRAVRQYAEEGRVAIRNIRREALDKLKKLAKELHLSEDETKRAEAE
IQKITDEFIAKADQLAEKKEQEILG
>Q8GRF5 ~~~frr~~~Ribosome-recycling factor~~~COG0233
MINEIKKDAQERMDKSVEALKNNLSKVRTGRAHPSLLSGISVEYYGAATPLNQVANVVAEDARTLAITVFDKELTQKVEK
AIMMSDLGLNPMSAGTIIRVPLPPLTEERRKDLVKIVRGEAEGGRVAVRNIRRDANNDLKALLKDKEISEDEDRKAQEEI
QKLTDVAVKKIDEVLAAKEKELMEV
>Q55385 ~~~~~~Probable small ribosomal subunit protein cS23~~~
MTTAEAASTVHTSFILKVLWLDQNVAIAVDQIVGKGTSPLTSYFFWPRADAWQQLKDELEAKHWIAEADRINVLNQATEV
INFWQDLKNQNKQISMAEAQGKFPEVVFSGSN
>B7IA40 ~~~rpsJ~~~Small ribosomal subunit protein uS10~~~
MSNQRIRIRLKSFDHRLIDQSAQEIVETAKRTGAQVCGPIPMPTRIERFNVLTSPHVNKDARDQYEIRTYKRLIDIVQPT
DKTVDALMKLDLAAGVDVQIALG
>O66430 ~~~rpsJ~~~Small ribosomal subunit protein uS10~~~COG0051
MEQEKIRIKLRAYDHRLLDQSVKQIIETVKRTGGVVKGPIPLPTRKRKWCVLRSPHKFDQSREHFEIREFSRILDIIRFT
PQTIEALMEISLPAGVDVEVKMRG
>P21471 ~~~rpsJ~~~Small ribosomal subunit protein uS10~~~COG0051
MAKQKIRIRLKAYDHRILDQSAEKIVETAKRSGASVSGPIPLPTEKSVYTILRAVHKYKDSREQFEMRTHKRLIDIVNPT
PQTVDALMRLDLPSGVDIEIKL
>P0A7R5 ~~~rpsJ~~~Small ribosomal subunit protein uS10~~~COG0051
MQNQRIRIRLKAFDHRLIDQATAEIVETAKRTGAQVRGPIPLPTRKERFTVLISPHVNKDARDQYEIRTHLRLVDIVEPT
EKTVDALMRLDLAAGVDVQISLG
>Q839G5 ~~~rpsJ~~~Small ribosomal subunit protein uS10~~~COG0051
MAKQKIRIRLKAYEHRILDQSADKIVETAKRTGADVSGPIPLPTERSLYTVIRATHKYKDSREQFEMRTHKRLIDIVNPT
PKTVDALMKLDLPSGVNIEIKL
>A2RNQ6 ~~~rpsJ~~~Small ribosomal subunit protein uS10~~~COG0051
MATKKIRIRLKAYEHRILDAAAEKIVETAKRTNAEVSGPIPLPTDRSVYTVIRATHKYKDSREQFEMRTHKRLIDIIEPT
QKTVDSLMKLDLPSGVNIEIKL
>P66330 ~~~rpsJ~~~Small ribosomal subunit protein uS10~~~COG0051
MAKQKIRIRLKAYDHRILDQSAEKIVETAKRSGASVSGPIPLPTEKSIYTVLRAVHKYKDSREQFEMRTHKRLIDIVNPT
PQTVDSLMRLDLPSGVDIEIKL
>P75581 ~~~rpsJ~~~Small ribosomal subunit protein uS10~~~
MNAANAVKYPELKIKLESYDSTLLDLTTKKIVEVVKGVDVKIKGPLPLPTKKEVITIIRSPHVDKASREQFEKNRHKRLM
ILVDVNQGAIDSLKRIKIPVGVTLRFSK
>A0QSD0 ~~~rpsJ~~~Small ribosomal subunit protein uS10~~~COG0051
MAGQKIRIRLKAYDHEAIDASARKIVETVTRTGASVVGPVPLPTEKNVYCVIRSPHKYKDSREHFEMRTHKRLIDILDPT
PKTVDALMRIDLPASVDVNIQ
>P9WH67 ~~~rpsJ~~~Small ribosomal subunit protein uS10~~~COG0051
MAGQKIRIRLKAYDHEAIDASARKIVETVVRTGASVVGPVPLPTEKNVYCVIRSPHKYKDSREHFEMRTHKRLIDIIDPT
PKTVDALMRIDLPASVDVNIQ
>Q9HWD4 ~~~rpsJ~~~Small ribosomal subunit protein uS10~~~
MQNQQIRIRLKAFDHRLIDQSTQEIVETAKRTGAQVRGPIPLPTRKERFTVLISPHVNKDARDQYEIRTHKRVLDIVQPT
DKTVDALMKLDLAAGVEVQISLG
>Q6N4T6 ~~~rpsJ~~~Small ribosomal subunit protein uS10~~~COG0051
MNGQNIRIRLKAFDHRILDTSTREIVNTAKRTGAQVRGPIPLPTRIEKFTVNRSPHVDKKSREQFEMRTHKRLLDIVDPT
PQTVDALMKLDLAAGVDVEIKL
>Q2YYP5 ~~~rpsJ~~~Small ribosomal subunit protein uS10~~~
MAKQKIRIRLKAYDHRVIDQSAEKIVETAKRSGADVSGPIPLPTEKSVYTIIRAVHKYKDSREQFEQRTHKRLIDIVNPT
PKTVDALMGLNLPSGVDIEIKL
>Q931G5 ~~~rpsJ~~~Small ribosomal subunit protein uS10~~~
MAKQKIRIRLKAYDHRVIDQSAEKIVETAKRSGADVSGPIPLPTEKSVYTIIRAVHMYKDSREQFEQRTHKRLIDIVNPT
PKTVDALMGLNLPSGVDIEIKL
>P66334 ~~~rpsJ~~~Small ribosomal subunit protein uS10~~~
MAKQKIRIRLKAYDHRVIDQSAEKIVETAKRSGADVSGPIPLPTEKSVYTIIRAVHKYKDSREQFEQRTHKRLIDIVNPT
PKTVDALMGLNLPSGVDIEIKL
>B1LBP1 ~~~rpsJ~~~Small ribosomal subunit protein uS10~~~
MPGQKIRIKLKAYDHELLDESAKKIVEVAKSTNSKVSGPIPLPTERTLYCVLRSPMKHKDSREHFEKRVHKRLIDIIDPS
PKTIDALMRINLPAGVDVEIKL
>P62653 ~~~rpsJ~~~Small ribosomal subunit protein uS10~~~COG0051
MPKIRIKLRGFDHKTLDASAQKIVEAARRSGAQVSGPIPLPTRVRRFTVIRGPFKHKDSREHFELRTHNRLVDIINPNRK
TIEQLMTLDLPTGVEIEIKTVGGGR
>Q5SHN7 ~~~rpsJ~~~Small ribosomal subunit protein uS10~~~COG0051
MPKIRIKLRGFDHKTLDASAQKIVEAARRSGAQVSGPIPLPTRVRRFTVIRGPFKHKDSREHFELRTHNRLVDIINPNRK
TIEQLMTLDLPTGVEIEIKTVGGGR
>P80375 ~~~rpsJ~~~Small ribosomal subunit protein uS10~~~
MPKIRIKLRGFDHKTLDASAQKIVEAARRSGAQVSGPIPLPTRVRRFTVIRGPFKHKDSREHFELRTHNRLVDIINPNRK
TIEQLMTLDLPTGVEIEIKTVGGGR
>B7IA16 ~~~rpsK~~~Small ribosomal subunit protein uS11~~~
MAKDTRTRKKVTRTVSEGVAHIHASFNNTIVTITDRQGNALAWATSGGQGFRGSRKSTPFAAQVAAEVAGKAALDYGLKN
LDVLVKGPGPGRESAVRALGAVGYKINSITDVTPIPHNGCRPPKKRRV
>P04969 ~~~rpsK~~~Small ribosomal subunit protein uS11~~~COG0100
MAAARKSNTRKRRVKKNIESGIAHIRSTFNNTIVTITDTHGNAISWSSAGALGFRGSRKSTPFAAQMAAETAAKGSIEHG
LKTLEVTVKGPGSGREAAIRALQAAGLEVTAIRDVTPVPHNGCRPPKRRRV
>P0A7R9 ~~~rpsK~~~Small ribosomal subunit protein uS11~~~COG0100
MAKAPIRARKRVRKQVSDGVAHIHASFNNTIVTITDRQGNALGWATAGGSGFRGSRKSTPFAAQVAAERCADAVKEYGIK
NLEVMVKGPGPGRESTIRALNAAGFRITNITDVTPIPHNGCRPPKKRRV
>Q839E0 ~~~rpsK~~~Small ribosomal subunit protein uS11~~~COG0100
MAAKKVSRKRRVKKNIESGVAHIHSTFNNTIVMITDTHGNALAWSSAGSLGFKGSKKSTPFAAQMAAEAATKVAMEHGLK
TVDVTVKGPGSGREAAIRSLQATGLEVTAIRDVTPVPHNGCRPPKRRRV
>P10789 ~~~rpsK~~~Small ribosomal subunit protein uS11~~~
MARRTNTRKRRVRKNIDTGIAHIRSTFNNTIVTITDVHGNALAWASAGSLGFKGSRKSTPFAAQMAAEAAAKASMEHGMK
TVEVNVKGPGAGREAAIRALQAAGLEITAIKDVTPIPHDGCRPPKRRRV
>A2RNM8 ~~~rpsK~~~Small ribosomal subunit protein uS11~~~COG0100
MAKITRKRRVKKNIESGIVHIQSTFNNTIVMITDVHGNALAWSSAGALGFKGSKKSTPFAAQMASEAAAKAAQEQGLKTV
SVTVKGPGSGRESAIRALAAAGLNVTSISDVTPVPHNGARPPKRRRV
>P66352 ~~~rpsK~~~Small ribosomal subunit protein uS11~~~COG0100
MARKTNTRKRRVKKNIESGIAHIRSTFNNTIVMITDTHGNALAWSSAGSLGFKGSRKSTPFAAQMAAESAAKSAQEHGLK
TLEVTVKGPGSGREAAIRALQAAGLEVTAIKDVTPVPHNGCRPPKRRRV
>Q50296 ~~~rpsK~~~Small ribosomal subunit protein uS11~~~
MAKKKKINVSSGIIHVSCSPNNTIVSASDPGGNVLCWASSGTMGFKGSRKKTPYSAGIAADKVAKTVKEMGMATVKLFVK
GTGRGKDTAIRSFANAGLSITEINEKTPIPHNGCKPPKRPR
>A0QSL6 ~~~rpsK~~~Small ribosomal subunit protein uS11~~~COG0100
MAQAKKGGTAAKKGQKTRRREKKNVPHGAAHIKSTFNNTIVSITDPQGNVIAWASSGHVGFKGSRKSTPFAAQLAAENAA
RKAQEHGVKKVDVFVKGPGSGRETAIRSLQAAGLEVGTISDVTPQPHNGCRPPKRRRV
>P9WH65 ~~~rpsK~~~Small ribosomal subunit protein uS11~~~COG0100
MPPAKKGPATSARKGQKTRRREKKNVPHGAAHIKSTFNNTIVTITDPQGNVIAWASSGHVGFKGSRKSTPFAAQLAAENA
ARKAQDHGVRKVDVFVKGPGSGRETAIRSLQAAGLEVGAISDVTPQPHNGVRPPKRRRV
>Q9HWF8 ~~~rpsK~~~Small ribosomal subunit protein uS11~~~
MAKPAARPRKKVKKTVVDGIAHIHASFNNTIVTITDRQGNALSWATSGGSGFRGSRKSTPFAAQVAAERAGQAALEYGLK
NLDVNVKGPGPGRESAVRALNACGYKIASITDVTPIPHNGCRPPKKRRV
>Q6N4V6 ~~~rpsK~~~Small ribosomal subunit protein uS11~~~COG0100
MAKDATRIRRRERKNIASGIAHVNSSFNNTTITITDAQGNAIAWSSAGTMGFKGSRKSTPYAAQVAAEDVAKKAQEHGMR
TLEVEVAGPGSGRESALRALQAAGFTVTSIRDVTTIPHNGCRPRKRRRV
>Q2FW31 ~~~rpsK~~~Small ribosomal subunit protein uS11~~~COG0100
MARKQVSRKRRVKKNIENGVAHIRSTFNNTIVTITDEFGNALSWSSAGALGFKGSKKSTPFAAQMASETASKSAMEHGLK
TVEVTVKGPGPGRESAIRALQSAGLEVTAIRDVTPVPHNGCRPPKRRRV
>Q2YYM1 ~~~rpsK~~~Small ribosomal subunit protein uS11~~~
MARKQVSRKRRVKKNIENGVAHIRSTFNNTIVTITDEFGNALSWSSAGALGFKGSKKSTPFAAQMASETASKSAMEHGLK
TVEVTVKGPGPGRESAIRALQSAGLEVTAIRDVTPVPHNGCRPPKRRRV
>P66357 ~~~rpsK~~~Small ribosomal subunit protein uS11~~~
MARKQVSRKRRVKKNIENGVAHIRSTFNNTIVTITDEFGNALSWSSAGALGFKGSKKSTPFAAQMASETASKSAMEHGLK
TVEVTVKGPGPGRESAIRALQSAGLEVTAIRDVTPVPHNGCRPPKRRRV
>P62654 ~~~rpsK~~~Small ribosomal subunit protein uS11~~~COG0100
MAKKPSKKKVKRQVASGRAYIHASYNNTIVTITDPDGNPITWSSGGVIGYKGSRKGTPYAAQLAALDAAKKAMAYGMQSV
DVIVRGTGAGREQAIRALQASGLQVKSIVDDTPVPHNGCRPKKKFRKAS
>P80376 ~~~rpsK~~~Small ribosomal subunit protein uS11~~~COG0100
MAKKPSKKKVKRQVASGRAYIHASYNNTIVTITDPDGNPITWSSGGVIGYKGSRKGTPYAAQLAALDAAKKAMAYGMQSV
DVIVRGTGAGREQAIRALQASGLQVKSIVDDTPVPHNGCRPKKKFRKAS
>B7I7R9 ~~~rpsL~~~Small ribosomal subunit protein uS12~~~
MATTNQLIRKGRTTLVEKSKVPALKACPQRRGVCTRVYTTTPKKPNSAMRKVCRVRLTSGFEVSSYIGGEGHNLQEHSVV
LIRGGRVKDLPGVRYHTVRGSLDCAGVKDRNQSRSKYGAKRPKK
>P21472 ~~~rpsL~~~Small ribosomal subunit protein uS12~~~COG0048
MPTINQLIRKGRVSKVENSKSPALNKGYNSFKKEHTNVSSPQKRGVCTRVGTMTPKKPNSALRKYARVRLTNGIEVTAYI
PGIGHNLQEHSVVLIRGGRVKDLPGVRYHIVRGALDTAGVENRAQGRSKYGTKKPKAK
>P0A7S3 ~~~rpsL~~~Small ribosomal subunit protein uS12~~~COG0048
MATVNQLVRKPRARKVAKSNVPALEACPQKRGVCTRVYTTTPKKPNSALRKVCRVRLTNGFEVTSYIGGEGHNLQEHSVI
LIRGGRVKDLPGVRYHTVRGALDCSGVKDRKQARSKYGVKRPKA
>Q839H1 ~~~rpsL~~~Small ribosomal subunit protein uS12~~~COG0048
MPTINQLVRKPRKSKVEKSDSPALNKGYNSFKKTQTNVNSPQKRGVCTRVGTMTPKKPNSALRKYARVRLSNLIEVTAYI
PGIGHNLQEHSVVLLRGGRVKDLPGVRYHIVRGALDTAGVNDRKQSRSKYGTKRPKA
>P09901 ~~~rpsL~~~Small ribosomal subunit protein uS12~~~
MPTINQLVRKGREKKVFKSKSPALNKGYNSFKKEQTNVASPQKRGVCTRVGTMTPKKPNSALRKYARVRLTNGIEVTAYI
PGIGHNLQEHSVVLIRGGRVKDLRGVRYHIIRGGLDTAGVANRMQGRSKYGAKKPKAAKK
>P0A0X4 ~~~rpsL~~~Small ribosomal subunit protein uS12~~~COG0048
MPTINQLIRKERKKVVKKTKSPALVECPQRRGVCTRVYTTTPKKPNSALRKVAKVRLTSKFEVISYIPGEGHNLQEHSIV
LVRGGRVKDLPGVKYHIVRGALDTAGVNKRTVSRSKYGTKKAKATDKKATDNKKK
>A2RP74 ~~~rpsL~~~Small ribosomal subunit protein uS12~~~COG0048
MPTINQLVRKPRRAQVTKSKSPAMNVGYNSRKKVQTKLASPQKRGVATRVGTMTPKKPNSALRKFARVRLSNLMEVTAYI
PGIGHNLQEHSVVLLRGGRVKDLPGVRYHIVRGALDTAGVADRKQSRSKYGAKKPKA
>P66372 ~~~rpsL~~~Small ribosomal subunit protein uS12~~~COG0048
MPTINQLVRKPRQSKIKKSTSPALNKGLNSFKRELTDVNSPQKRGVCTRVGTMTPKKPNSALRKYARVRLSNGIEVTAYI
PGIGHNLQEHSVVLIRGGRVKDLPGVRYHIVRGALDTAGVENRGQSRSKYGTKKPKK
>Q53538 ~~~rpsL~~~Small ribosomal subunit protein uS12~~~
MPTIQQLVRKGRRDKISKVKTAALKGSPQRRGVCTRVYTTTPKKPNSALRKVARVKLTSQVEVTAYIPGEGHNLQEHSMV
LVRGGRVKDLPGVRYKIIRGSLDTQGVKNRKQARSRYGAKKEKG
>P75546 ~~~rpsL~~~Small ribosomal subunit protein uS12~~~
MATIAQLIRKPRKKKKVKSKSPALHYNLNLLNKKVTNVYSPLKRGVCTRVGTMTPKKPNSALRKYAKVRLTNGFEVLTYI
PGEGHNLQEHSVTLLRGGRVKDLPGVRYHIVRGTLDTVGVEKRRQQRSAYGAKKPKAKS
>A0QS96 ~~~rpsL~~~Small ribosomal subunit protein uS12~~~COG0048
MPTIQQLVRKGRRDKIAKVKTAALKGSPQRRGVCTRVYTTTPKKPNSALRKVARVKLTSQVEVTAYIPGEGHNLQEHSMV
LVRGGRVKDLPGVRYKIIRGSLDTQGVKNRKQARSRYGAKKEKS
>P9WH63 ~~~rpsL~~~Small ribosomal subunit protein uS12~~~COG0048
MPTIQQLVRKGRRDKISKVKTAALKGSPQRRGVCTRVYTTTPKKPNSALRKVARVKLTSQVEVTAYIPGEGHNLQEHSMV
LVRGGRVKDLPGVRYKIIRGSLDTQGVKNRKQARSRYGAKKEKG
>Q9HWD0 ~~~rpsL~~~Small ribosomal subunit protein uS12~~~
MATINQLVRKPRKRMVDKSDVPALQNCPQRRGVCTRVYTTTPKKPNSALRKVCRVRLTNGFEVSSYIGGEGHNLQEHSVV
LIRGGRVKDLPGVRYHTVRGSLDTSGVKDRKQGRSKYGAKRPK
>Q6N4T2 ~~~rpsL~~~Small ribosomal subunit protein uS12~~~COG0048
MPTINQLIANPRVVQKSRKKVPALQQSPQKRGVCTRVYTTTPKKPNSALRKVAKVRLTNGFEVIGYIPGEGHNLQEHSVV
MIRGGRVKDLPGVRYHILRGVLDTQGVKNRKQRRSKYGAKRPK
>P0A0H0 ~~~rpsL~~~Small ribosomal subunit protein uS12~~~COG0048
MPTINQLVRKPRQSKIKKSDSPALNKGFNSKKKKFTDLNSPQKRGVCTRVGTMTPKKPNSALRKYARVRLSNNIEINAYI
PGIGHNLQEHSVVLVRGGRVKDLPGVRYHIVRGALDTSGVDGRRQGRSLYGTKKPKN
>P0A0G8 ~~~rpsL~~~Small ribosomal subunit protein uS12~~~
MPTINQLVRKPRQSKIKKSDSPALNKGFNSKKKKFTDLNSPQKRGVCTRVGTMTPKKPNSALRKYARVRLSNNIEINAYI
PGIGHNLQEHSVVLVRGGRVKDLPGVRYHIVRGALDTSGVDGRRQGRSLYGTKKPKN
>P0A4A4 ~~~rpsL~~~Small ribosomal subunit protein uS12~~~COG0048
MPTIQQLVRKGRQDKVEKNKTPALEGSPQRRGVCTRVFTTTPKKPNSALRKVARVRLTSGIEVTAYIPGEGHNLQEHSIV
LVRGGRVKDLPGVRYKIIRGSLDTQGVKNRKQARSRYGAKKEK
>P0A4A3 ~~~rpsL~~~Small ribosomal subunit protein uS12~~~COG0048
MPTIQQLVRKGRQDKVEKNKTPALEGSPQRRGVCTRVFTTTPKKPNSALRKVARVRLTSGIEVTAYIPGEGHNLQEHSIV
LVRGGRVKDLPGVRYKIIRGSLDTQGVKNRKQARSRYGAKKEK
>P0A4A6 ~~~rpsL~~~Small ribosomal subunit protein uS12~~~
MPTIQQLVRKGRQDKVEKNKTPALEGSPQRRGVCTRVFTTTPKKPNSALRKVARVRLTSGIEVTAYIPGEGHNLQEHSIV
LVRGGRVKDLPGVRYKIIRGSLDTQGVKNRKQARSRYGAKKEK
>P0A4A5 ~~~rpsL~~~Small ribosomal subunit protein uS12~~~
MPTIQQLVRKGRQDKVEKNKTPALEGSPQRRGVCTRVFTTTPKKPNSALRKVARVRLTSGIEVTAYIPGEGHNLQEHSIV
LVRGGRVKDLPGVRYKIIRGSLDTQGVKNRKQARSRYGAKKEK
>P61941 ~~~rpsL~~~Small ribosomal subunit protein uS12~~~COG0048
MPTINQLVRKGREKVRKKSKVPALKGAPFRRGVCTVVRTVTPKKPNSALRKVAKVRLTSGYEVTAYIPGEGHNLQEHSVV
LIRGGRVKDLPGVRYHIVRGVYDAAGVKDRKKSRSKYGTKKPKEAAKTAAKK
>Q5SHN3 ~~~rpsL~~~Small ribosomal subunit protein uS12~~~COG0048
MPTINQLVRKGREKVRKKSKVPALKGAPFRRGVCTVVRTVTPKKPNSALRKVAKVRLTSGYEVTAYIPGEGHNLQEHSVV
LIRGGRVKDLPGVRYHIVRGVYDAAGVKDRKKSRSKYGTKKPKEAAKTAAKK
>P17293 ~~~rpsL~~~Small ribosomal subunit protein uS12~~~
MPTINQLVRKGREKVRKKSKVPALKGAPFRRGVCTVVRTVTPKKPNSALRKVAKVRLTSGYEVTAYIPGEGHNLQEHSVV
LIRGGRVKDLPGVRYHIVRGVYDAAGVKDRKKSRSKYGTKKPKEAAKTAAKK
>B7IA17 ~~~rpsM~~~Small ribosomal subunit protein uS13~~~
MARIAGVNIPDNKHAVISLTYIFGIGRHTAKNILAAVGITETTKIRELDDAQLDAIRAEVAKVPTEGDLRREISMNIKRL
MDLGCYRGLRHRRSLPVRGQRTKTNARTRKGPRKPIKK
>P20282 ~~~rpsM~~~Small ribosomal subunit protein uS13~~~COG0099
MARIAGVDIPRDKRVVISLTYIFGIGRTTAQQVLKEAGVSEDTRVRDLTEEELGKIRDIIDKLKVEGDLRREVSLNIKRL
IEIGSYRGIRHRRGLPVRGQNSKNNARTRKGPRRTVANKKK
>P0A7S9 ~~~rpsM~~~Small ribosomal subunit protein uS13~~~COG0099
MARIAGINIPDHKHAVIALTSIYGVGKTRSKAILAAAGIAEDVKISELSEGQIDTLRDEVAKFVVEGDLRREISMSIKRL
MDLGCYRGLRHRRGLPVRGQRTKTNARTRKGPRKPIKK
>P59754 ~~~rpsM~~~Small ribosomal subunit protein uS13~~~COG0099
MARIAGVDIPRDKRVVVSLTYIYGIGNTTAKKVLANVGVSEDVRVRDLTNEQTDAIRAEIDKLKVEGDLRREVNLNIKRL
MEIGSYRGIRHRRGLPTRGQNTKNNARTRKGPTKTVAGKKK
>P15757 ~~~rpsM~~~Small ribosomal subunit protein uS13~~~
MARIAGVDIPRDKRVVISLTYIYGIGKPTAQILKEAGVSEDTRVRDLTEEELGRIREIVGRLKVEGDLRREVSLNIKRLM
EIGCYRGLRHRRGLPVRGQNTKNNARTRKGPRRTVANKKK
>A2RNM9 ~~~rpsM~~~Small ribosomal subunit protein uS13~~~COG0099
MARFAGVDIPNEKRIVISLTYVFGVGLQTSKKVLAAAGVSEDIRTKDLTSDQEDAIRRELDGLKLEGDLRREVSLNIKRL
MEIGSYRGMRHRRGLPTRGQNTKNNARTRKGPAKSIAGKKK
>P66383 ~~~rpsM~~~Small ribosomal subunit protein uS13~~~COG0099
MARIAGVDVPREKRIVISLTYIYGIGKQTAKEVLAEAGVSEDTRTRDLTEEELGKIREILDRIKVEGDLRREVNLNIKRL
IEIGSYRGMRHRRGLPVRGQNTKNNARTRKGPSKTVAGKKK
>Q50297 ~~~rpsM~~~Small ribosomal subunit protein uS13~~~
MARILGIDIPNQKRIEIALTYIFGIGLSRSQAILKQANINPDKRVKDLTEEEFVAIRNVASAYKIEGDLRREIALNIKHL
SEIGAWRGLRHRKNLPVRGQRTRTNARTRKGPRKTVANKKIESK
>A0QSL5 ~~~rpsM~~~Small ribosomal subunit protein uS13~~~COG0099
MARLVGVDLPRDKRMEIALTYIYGIGRTRSNEILAATGIDKNMRTKDLTDDQVTVLRDYIEGNLKVEGDLRREVQADIRR
KIEIGCYQGLRHRRGLPVRGQRTKTNARTRKGPKRTIAGKKKAR
>P9WH61 ~~~rpsM~~~Small ribosomal subunit protein uS13~~~COG0099
MARLVGVDLPRDKRMEVALTYIFGIGRTRSNEILAATGIDRDLRTRDLTEEQLIHLRDYIEANLKVEGDLRREVQADIRR
KIEIGCYQGLRHRRGMPVRGQRTKTNARTRKGPKRTIAGKKKAR
>Q9HWF7 ~~~rpsM~~~Small ribosomal subunit protein uS13~~~
MARIAGVNIPDNKHTVISLTYIYGVGRTTAQSICAATGVNPAAKIKDLSDEQIDQLRNEVAKITTEGDLRREINMNIKRL
MDLGCYRGLRHRRGLPVRGQRTKTNARTRKGPRKPIRK
>Q6N4V5 ~~~rpsM~~~Small ribosomal subunit protein uS13~~~COG0099
MTGEKSVARIAGVNIPTNKRVLIALQYIHGIGQKNAADILEKVKIPLDRRVNQLSDAEVLQIREVIDRDYLVEGDLRRET
GMNIKRLMDLGCYRGLRHRRGLPVRGQRTHTNARTRKGPAKAIAGKKK
>Q2FW30 ~~~rpsM~~~Small ribosomal subunit protein uS13~~~COG0099
MARIAGVDIPREKRVVISLTYIYGIGTSTAQKILEEANVSADTRVKDLTDDELGRIREVVDGYKVEGDLRRETNLNIKRL
MEISSYRGIRHRRGLPVRGQKTKNNARTRKGPVKTVANKKK
>Q2YYM0 ~~~rpsM~~~Small ribosomal subunit protein uS13~~~
MARIAGVDIPREKRVVISLTYIYGIGTSTAQKILEEANVSADTRVKDLTDDELGRIREVVDGYKVEGDLRRETNLNIKRL
MEISSYRGIRHRRGLPVRGQKTKNNARTRKGPVKTVANKKK
>P66388 ~~~rpsM~~~Small ribosomal subunit protein uS13~~~
MARIAGVDIPREKRVVISLTYIYGIGTSTAQKILEEANVSADTRVKDLTDDELGRIREVVDGYKVEGDLRRETNLNIKRL
MEISSYRGIRHRRGLPVRGQKTKNNARTRKGPVKTVANKKK
>P62655 ~~~rpsM~~~Small ribosomal subunit protein uS13~~~COG0099
MARIAGVEIPRNKRVDVALTYIYGIGKARAKEALEKTGINPATRVKDLTEAEVVRLREYVENTWKLEGELRAEVAANIKR
LMDIGCYRGLRHRRGLPVRGQRTRTNARTRKGPRKTVAGKKKAPRK
>P80377 ~~~rpsM~~~Small ribosomal subunit protein uS13~~~COG0099
MARIAGVEIPRNKRVDVALTYIYGIGKARAKEALEKTGINPATRVKDLTEAEVVRLREYVENTWKLEGELRAEVAANIKR
LMDIGCYRGLRHRRGLPVRGQRTRTNARTRKGPRKTVAGKKKAPRK
>P12878 ~~~rpsN1~~~Small ribosomal subunit protein uS14B~~~COG0199
MAKKSMIAKQQRTPKFKVQEYTRCERCGRPHSVIRKFKLCRICFRELAYKGQIPGVKKASW
>Q839F1 ~~~rpsZ~~~Small ribosomal subunit protein uS14C~~~COG0199
MAKKSMIAKNKRPAKHSTQAYTRCERCGRPHSVYRKFHLCRICFRELAYKGQIPGVKKASW
>P54798 ~~~rpsZ~~~Small ribosomal subunit protein uS14~~~
MAKKSMIIKQKRTPKFKVRAYTRCERCGRPHSVYRKFKLCRICFRELAYKGQLPGIKKASW
>A2RNP2 ~~~rpsZ~~~Small ribosomal subunit protein uS14~~~COG0199
MAKKSMVVKNQRPAKFSTQAYTRCERCGRPHSVYRKFKLCRICLRELAYKGQLPGVKKASW
>P66401 ~~~rpsZ~~~Small ribosomal subunit protein uS14B~~~COG0199
MAKKSMIAKQKRTPKYAVQAYTRCERCGRPHSVIRKFKLCRICFRELAYKGQIPGVKKASW
>Q50305 ~~~rpsZ~~~Small ribosomal subunit protein uS14~~~
MAKKSLKVKQTRIPKFAVRAYTRCQRCGRARAVLSHFGVCRLCFRELAYAGAIPGVKKASW
>A0QSG2 ~~~rpsZ~~~Small ribosomal subunit protein uS14B~~~COG0199
MAKKALVHKANKKPKFAVRAYTRCNKCGRPHSVYRKFGLCRICLREMAHAGELPGVQKSSW
>P9WH57 ~~~rpsZ~~~Small ribosomal subunit protein uS14B~~~COG0199
MAKKALVNKAAGKPRFAVRAYTRCSKCGRPRAVYRKFGLCRICLREMAHAGELPGVQKSSW
>Q2FW19 ~~~rpsZ~~~Small ribosomal subunit protein uS14B~~~
MAKTSMVAKQQKKQKYAVREYTRCERCGRPHSVYRKFKLCRICFRELAYKGQIPGVRKASW
>Q2YYL0 ~~~rpsZ~~~Small ribosomal subunit protein uS14B~~~
MAKTSMVAKQQKKQKYAVREYTRCERCGRPHSVYRKFKLCRICFRELAYKGQIPGVRKASW
>P62656 ~~~rpsZ~~~Small ribosomal subunit protein uS14~~~COG0199
MARKALIEKAKRTPKFKVRAYTRCVRCGRARSVYRFFGLCRICLRELAHKGQLPGVRKASW
>P0DOY6 ~~~rpsZ~~~Small ribosomal subunit protein uS14~~~COG0199
MARKALIEKAKRTPKFKVRAYTRCVRCGRARSVYRFFGLCRICLRELAHKGQLPGVRKASW
>P24320 ~~~rpsZ~~~Small ribosomal subunit protein uS14~~~
MARKALIEKAKRTPKFKVRAYTRCVRCGRARSVYRFFGLCRICLRELAHKGQLPGVRKASW
>B7IA26 ~~~rpsN~~~Small ribosomal subunit protein uS14~~~
MAKKGMINRELKREKTVAKYAAKRAELKATIANVNASDEERFEAMLKLQALPRNASPVRLRNRCGLTGRPHGYFRKFGLS
RNKLRDTVMQGDVPGVVKASW
>O31587 ~~~rpsN2~~~Small ribosomal subunit protein uS14A~~~COG0199
MAKKSKVAKELKRQQLVEQYAGIRRELKEKGDYEALSKLPRDSAPGRLHNRCMVTGRPRAYMRKFKMSRIAFRELAHKGQ
IPGVKKASW
>P0AG59 ~~~rpsN~~~Small ribosomal subunit protein uS14~~~COG0199
MAKQSMKAREVKRVALADKYFAKRAELKAIISDVNASDEDRWNAVLKLQTLPRDSSPSRQRNRCRQTGRPHGFLRKFGLS
RIKVREAAMRGEIPGLKKASW
>A0R550 ~~~rpsN~~~Small ribosomal subunit protein uS14A~~~COG0199
MAKKSKIVKNEQRRELVQRYAERRAELKRTIRDPASSPERRAAAVSALQRLPRDSSPVRLRNRDVVDGRPRGHLRKFGLS
RVRVREMAHRGELPGVRKASW
>Q9HWE8 ~~~rpsN~~~Small ribosomal subunit protein uS14~~~
MAKESMKNRELKRQLTVAKYAKKRAELKAIIANPNSSAEERWNAQVALQKQPRDASASRLRNRCRLTGRPHGFYRKFGLS
RNKLREAAMRGDVPGLVKASW
>Q6N4U6 ~~~rpsN~~~Small ribosomal subunit protein uS14~~~COG0199
MAKKSAIEKNNRRKKMTKNAAPKRARLKAIIADKSKPMEERFAATLKLAEMPRNSSATRIRNRCDLTGRPRSVYRLNKLS
RIAIRDLGSRGLVLGLVKSSW
>Q2FYU5 ~~~rpsN~~~Small ribosomal subunit protein uS14A~~~COG0199
MAKKSKIAKERKREELVNKYYELRKELKAKGDYEALRKLPRDSSPTRLTRRCKVTGRPRGVLRKFEMSRIAFREHAHKGQ
IPGVKKSSW
>B7I3U0 ~~~rpsO~~~Small ribosomal subunit protein uS15~~~
MALTNADRAEIIAKFARAENDTGSPEVQVALLTAQINDLQGHFKAHKHDHHSRRGLIRMVNQRRKLLDYLNGKDHERYTA
LIGALGLRR
>P21473 ~~~rpsO~~~Small ribosomal subunit protein uS15~~~COG0184
MAITQERKNQLINEFKTHESDTGSPEVQIAILTDSINNLNEHLRTHKKDHHSRRGLLKMVGKRRNLLTYLRNKDVTRYRE
LINKLGLRR
>Q0PA13 ~~~rpsO~~~Small ribosomal subunit protein uS15~~~COG0184
MALDSAKKAEIVAKFAKKPGDTGSTEVQVALLTARIAELTEHLKIYKKDFSSRLGLLKLVGQRKRLLSYLKRKDYNSYSK
LITELNLRDK
>P0ADZ4 ~~~rpsO~~~Small ribosomal subunit protein uS15~~~COG0184
MSLSTEATAKIVSEFGRDANDTGSTEVQVALLTAQINHLQGHFAEHKKDHHSRRGLLRMVSQRRKLLDYLKRKDVARYTQ
LIERLGLRR
>Q82ZJ1 ~~~rpsO~~~Small ribosomal subunit protein uS15~~~COG0184
MAISQERKNEIIKEYARHEGDTGSPEVQIAVLTEDINQLNEHARTHKKDHHSYRGLMKKIGHRRNLLAYLRKTDIQRYRE
LIQRLGLRR
>P05766 ~~~rpsO~~~Small ribosomal subunit protein uS15~~~
MALTQERKREIIEQFKVHENDTGSPEVQIAILTEQINNLNEHLRVHKKDHHSRRGLLKMVGKRRRLLAYLRNKDVARYRE
IVEKLGLRR
>A2RMV9 ~~~rpsO~~~Small ribosomal subunit protein uS15~~~COG0184
MAISKEKKQEIIAQYARKEGDTGSPEVQIAVLTWEINHLNDHIKSHKKDHATQRGLMKKIGHRRNLLGYLRGKDVQRYRE
LIASLGLRR
>Q92C24 ~~~rpsO~~~Small ribosomal subunit protein uS15~~~COG0184
MALTQERKNEIIAEYRVHDTDTGSPEVQIAVLTAEINSLNEHVRVHKKDHHSYRGLMKMVGHRRNLLTYLRKKDVQRYRE
LIKRLGLRR
>P75173 ~~~rpsO~~~Small ribosomal subunit protein uS15~~~
MQIDKNGIIKSAQLHDKDVGSIQVQVSLLTSQIKQLTDHLLANKKDFISKRGLYAKVSKRKRLLKYLKHNDLEAYRNLVK
TLNLRG
>A0QVQ3 ~~~rpsO~~~Small ribosomal subunit protein uS15~~~COG0184
MALTAEQKKEILGQYGLHDTDTGSPEAQVALLTKRIQDLTEHLKVHKHDHHSRRGLLLLVGRRRRLLKYVAQVDVARYRS
LIERLGLRR
>P9WH55 ~~~rpsO~~~Small ribosomal subunit protein uS15~~~COG0184
MALTAEQKKEILRSYGLHETDTGSPEAQIALLTKRIADLTEHLKVHKHDHHSRRGLLLLVGRRRRLIKYISQIDVERYRS
LIERLGLRR
>Q9HV58 ~~~rpsO~~~Small ribosomal subunit protein uS15~~~
MALSVEEKAQIVNEYKQAEGDTGSPEVQVALLSANINKLQDHFKANGKDHHSRRGLIRMVNQRRKLLDYLKGKDVSRYTA
LIGRLGLRR
>Q6NCN7 ~~~rpsO~~~Small ribosomal subunit protein uS15~~~COG0184
MSITAERKAEVIKTSATKAGDTGSPEVQVAILSERITNLTAHFKTHTKDNHSRRGLLKLVSTRRSLLDYIKKKDEARYKA
LLEKHNIRR
>Q2G2Q1 ~~~rpsO~~~Small ribosomal subunit protein uS15~~~COG0184
MAISQERKNEIIKEYRVHETDTGSPEVQIAVLTAEINAVNEHLRTHKKDHHSRRGLLKMVGRRRHLLNYLRSKDIQRYRE
LIKSLGIRR
>Q2YXP3 ~~~rpsO~~~Small ribosomal subunit protein uS15~~~
MAISQERKNEIIKEYRVHETDTGSPEVQIAVLTAEINAVNEHLRTHKKDHHSRRGLLKMVGRRRHLLNYLRSKDIQRYRE
LIKSLGIRR
>Q7A5X8 ~~~rpsO~~~Small ribosomal subunit protein uS15~~~
MAISQERKNEIIKEYRVHETDTGSPEVQIAVLTAEINAVNEHLRTHKKDHHSRRGLLKMVGRRRHLLNYLRSKDIQRYRE
LIKSLGIRR
>P62657 ~~~rpsO~~~Small ribosomal subunit protein uS15~~~COG0184
MPITKEEKQKVIQEFARFPGDTGSTEVQVALLTLRINRLSEHLKVHKKDHHSHRGLLMMVGQRRRLLRYLQREDPERYRA
LIEKLGIRG
>Q5SJ76 ~~~rpsO~~~Small ribosomal subunit protein uS15~~~COG0184
MPITKEEKQKVIQEFARFPGDTGSTEVQVALLTLRINRLSEHLKVHKKDHHSHRGLLMMVGQRRRLLRYLQREDPERYRA
LIEKLGIRG
>P80378 ~~~rpsO~~~Small ribosomal subunit protein uS15~~~
MPITKEEKQKVIQEFARFPGDTGSTEVQVALLTLRINRLSEHLKVHKKDHHSHRGLLMMVGQRRRLLRYLQREDPERYRA
LIEKLGIRG
>O66523 ~~~rpsP~~~Small ribosomal subunit protein bS16~~~COG0228
MAVRIRLAKFGRKHHPIYRIVVMDAKSPREGKYIDILGTYDPKRKVLINVYPEKVKEWVLKGVELSHRAKAILWNHGILK
EVVPEGYEMKRVGDYYVFEKRESKKSKGGEAA
>P21474 ~~~rpsP~~~Small ribosomal subunit protein bS16~~~COG0228
MAVKIRLKRMGAKKSPFYRIVVADSRSPRDGRFIETVGTYNPVAKPAEVKIDEELALKWLQTGAKPSDTVRNLFSSQGIM
EKFHNAKQGK
>P0A7T3 ~~~rpsP~~~Small ribosomal subunit protein bS16~~~COG0228
MVTIRLARHGAKKRPFYQVVVADSRNARNGRFIERVGFFNPIASEKEEGTRLDLDRIAHWVGQGATISDRVAALIKEVNK
AA
>Q834F9 ~~~rpsP~~~Small ribosomal subunit protein bS16~~~COG0228
MAVKIRLKRMGSKKSPFYRIVVADSRSPRDGRFIETVGTYNPLKDPAEVVLKEDLVLDWLSKGAQPSDTVRNILSKEGVM
KKHHEAKNVKK
>P81290 ~~~rpsP~~~Small ribosomal subunit protein bS16~~~
MAVKIRLKRMGAKKKPFYRIVVADSRSPRDGRFIETIGTYNPVAEPAEIKIDEELALKWLQNGAKPSDTRSLLSKQGLLE
KFHNLKYGK
>A2RJS1 ~~~rpsP~~~Small ribosomal subunit protein bS16~~~COG0228
MSVKIRLTRMGSKKKPFYRINVADSRAPRDGKFIETVGTYNPLVTENQVTLKEERVLEWLSNGAQPSDTVRNLLSKAGVM
KKFHESKLSK
>Q8Y699 ~~~rpsP~~~Small ribosomal subunit protein bS16~~~COG0228
MAVKIRLKRIGSKKKPFYRIVVADSRFPRDGRSIETIGTYNPLLDPVEVKIDEEATLKWMHNGAKPSDTVRNLLSREGIM
EKFHNQKLGK
>P75131 ~~~rpsP~~~Small ribosomal subunit protein bS16~~~
MRMGRVHYPTYRIVAVDSRVKRDGKYIALIGHLNPALKENKCKIDEAVALEWLNKGAKPTDTVRSLFSQTGLWKKFVESK
KKPVAKSK
>A0QV37 ~~~rpsP~~~Small ribosomal subunit protein bS16~~~COG0228
MAVKIKLTRLGKIRNPQYRIIVADARTRRDGRAIEVIGRYHPKEEPSLIQIDSERAQYWLGVGAQPTEPVLALLKITGDW
QKFKGLPGAEGTLKVKEPKPSKLDLFNAALAEAESGTTAAATTPKKKKAPKKDEAAEAPAEAAEAPAEAADAASES
>P9WH53 ~~~rpsP~~~Small ribosomal subunit protein bS16~~~COG0228
MAVKIKLTRLGKIRNPQYRVAVADARTRRDGRAIEVIGRYHPKEEPSLIEINSERAQYWLSVGAQPTEPVLKLLKITGDW
QKFKGLPGAQGRLKVAAPKPSKLEVFNAALAAADGGPTTEATKPKKKSPAKKAAKAAEPAPQPEQPDTPALGGEQAELTA
ES
>Q9HXP9 ~~~rpsP~~~Small ribosomal subunit protein bS16~~~
MVTIRLARGGSKKRPFYHLTVTNSRNARDGRFVERIGFFNPVATGGEVRLSVDQERATYWLGQGAQPSERVAQLLKDAAK
ANA
>P62236 ~~~rpsP~~~Small ribosomal subunit protein bS16~~~COG0228
MSVVIRLARAGTKKRPFYHVVVADSRFPRDGRFIERLGYFNPLMAKDNEARLKLDLDKVKDWLAKGAQPSDRVARFLDTA
GVRKREARNNPEKAVPRKERKAADGK
>Q2FZ45 ~~~rpsP~~~Small ribosomal subunit protein bS16~~~COG0228
MAVKIRLTRLGSKRNPFYRIVVADARSPRDGRIIEQIGTYNPTSANAPEIKVDEALALKWLNDGAKPTDTVHNILSKEGI
MKKFDEQKKAK
>Q2YXH8 ~~~rpsP~~~Small ribosomal subunit protein bS16~~~
MAVKIRLTRLGSKRNPFYRIVVADARSPRDGRIIEQIGTYNPTSANAPEIKVDEALALKWLNDGAKPTDTVHNILSKEGI
MKKFDEQKKAK
>P66440 ~~~rpsP~~~Small ribosomal subunit protein bS16~~~
MAVKIRLTRLGSKRNPFYRIVVADARSPRDGRIIEQIGTYNPTSANAPEIKVDEALALKWLNDGAKPTDTVHNILSKEGI
MKKFDEQKKAK
>P66444 ~~~rpsP~~~Small ribosomal subunit protein bS16~~~COG0228
MAVKIRLTRMGSKKKPFYRINVADSRSPRDGRFIETVGTYNPLVAENQVTLKEDRVLAWLANGAQPSDTVRNILSKEGVL
KKFHDSKFSK
>P62238 ~~~rpsP~~~Small ribosomal subunit protein bS16~~~COG0228
MVKIRLARFGSKHNPHYRIVVTDARRKRDGKYIEKIGYYDPRKTTPDWLKVDVERARYWLSVGAQPTDTARRLLRQAGVF
RQEAREGA
>Q5SJH3 ~~~rpsP~~~Small ribosomal subunit protein bS16~~~COG0228
MVKIRLARFGSKHNPHYRIVVTDARRKRDGKYIEKIGYYDPRKTTPDWLKVDVERARYWLSVGAQPTDTARRLLRQAGVF
RQEAREGA
>P80379 ~~~rpsP~~~Small ribosomal subunit protein bS16~~~
MVKIRLARFGSKHNPHYPHYRIVVTDARRKRDGKYIEKIGYYDPRKTTPDWLKVDVERARYWLSVGAQPTDTARRLLRQA
GVFRQEAREGA
>B7IA30 ~~~rpsQ~~~Small ribosomal subunit protein uS17~~~
MSEKTVRTLTGKVVSDKMDKSIVVLIERRVQHPLYGKSIRRSTKLHAHDENNVAKIGDVVTIKESRPISKTKAWTLVEVV
EAAAE
>P12874 ~~~rpsQ~~~Small ribosomal subunit protein uS17~~~COG0186
MSERNQRKVYQGRVVSDKMDKTITVVVETYKKHTLYGKRVKYSKKFKAHDENNQAKIGDIVKIMETRPLSATKRFRLVEV
VEEAVII
>P0AG63 ~~~rpsQ~~~Small ribosomal subunit protein uS17~~~COG0186
MTDKIRTLQGRVVSDKMEKSIVVAIERFVKHPIYGKFIKRTTKLHVHDENNECGIGDVVEIRECRPLSKTKSWTLVRVVE
KAVL
>Q839F5 ~~~rpsQ~~~Small ribosomal subunit protein uS17~~~COG0186
MTEERNQRKVYQGRVVSDKMDKTITVVVETKKNHPIYGKRMKYSKKYKAHDENNTAKVGDIVKIMETRPLSATKRFRLLE
VVEEAVII
>P23828 ~~~rpsQ~~~Small ribosomal subunit protein uS17~~~
MSERNQRKVYVGRVVSDKMDKTITVLVETYKKHPLYGKRVKYSKKYKAHDEHNEAKVGDIVKIMETRPLSATKRFRLVEI
VEKAVVL
>A2RNP6 ~~~rpsQ~~~Small ribosomal subunit protein uS17~~~COG0186
MERKQRKVYQGRVVSDKMDKTITVVVETKRNHPVYGKRINYSKKYKAHDENNSAKTGDIVRIMETRPLSKDKHFRLVEIV
EEAVII
>Q927L6 ~~~rpsQ~~~Small ribosomal subunit protein uS17~~~COG0186
MADRNQRKVYTGRVVSDKMDKTITVVVETYKKHGLYGKRVKYSKKFKAHDENNIAKTGDVVRISETRPLSATKHFRLLEV
VEEAVII
>O06051 ~~~rpsQ~~~Small ribosomal subunit protein uS17~~~
MAEAKTGAKAAPRVAKAAKAAPKKAAPNDAEAIGAANAANVKGPKHTPRTPKPRGRRKTRIGYVVSDKMQKTIVVELEDR
MRHPLYGKIIRTTKKVKAHDEDSVAGIGDRVSLMETRPLSATKRWRLVEILEKAK
>Q50309 ~~~rpsQ~~~Small ribosomal subunit protein uS17~~~
MKRNQRKVLIGIVKSTKNAKTATVQVESRFKHPLYHKSVVRHKKYQAHNEGEVLAKDGDKVQIVETRPLSATKRFRIAKI
IERAK
>A0QSE0 ~~~rpsQ~~~Small ribosomal subunit protein uS17~~~COG0186
MADQKGPKYTPAAEKPRGRRKTAIGYVVSDKMQKTIVVELEDRKSHPLYGKIIRTTKKVKAHDENGEAGIGDRVSLMETR
PLSATKRWRLVEILEKAK
>P9WH51 ~~~rpsQ~~~Small ribosomal subunit protein uS17~~~COG0186
MAEAKTGAKAAPRVAKAAKAAPKKAAPNDAEAIGAANAANVKGPKHTPRTPKPRGRRKTRIGYVVSDKMQKTIVVELEDR
MRHPLYGKIIRTTKKVKAHDEDSVAGIGDRVSLMETRPLSATKRWRLVEILEKAK
>Q9HWE4 ~~~rpsQ~~~Small ribosomal subunit protein uS17~~~
MAEAQKTVRTLTGRVVSDKMDKTVTVLIERRVKHPIYGKYVKRSTKLHAHDESNQCRIGDLVTIRETRPLAKTKAWTLVD
IVERAVEV
>Q6N4U3 ~~~rpsQ~~~Small ribosomal subunit protein uS17~~~COG0186
MPKRTLQGVVVSDKQAKTIVVRVDRRFTHPIYKKTIRRSKNYHAHDENNQFKPGDMVWIEESKPISKLKRWTVVRGEPKK
TA
>Q2FW15 ~~~rpsQ~~~Small ribosomal subunit protein uS17~~~COG0186
MSERNDRKVYVGKVVSDKMDKTITVLVETYKTHKLYGKRVKYSKKYKTHDENNSAKLGDIVKIQETRPLSATKRFRLVEI
VEESVII
>Q2YYK6 ~~~rpsQ~~~Small ribosomal subunit protein uS17~~~
MSERNDRKVYVGKVVSDKMDKTITVLVETYKTHKLYGKRVKYSKKYKTHDENNSAKLGDIVKIQETRPLSATKRFRLVEI
VEESVII
>Q7A462 ~~~rpsQ~~~Small ribosomal subunit protein uS17~~~
MSERNDRKVYVGKVVSDKMDKTITVLVETYKTHKLYGKRVKYSKKYKTHDENNSAKLGDIVKIQETRPLSATKRFRIVEI
VEESVII
>P62658 ~~~rpsQ~~~Small ribosomal subunit protein uS17~~~COG0186
MPKKVLTGVVVSDKMQKTVTVLVERQFPHPLYGKVIKRSKKYLAHDPEEKYKLGDVVEIIESRPISKRKRFRVLRLVESG
RMDLVEKYLIRRQNYQSLSKRGGKA
>P0DOY7 ~~~rpsQ~~~Small ribosomal subunit protein uS17~~~COG0186
MPKKVLTGVVVSDKMQKTVTVLVERQFPHPLYGKVIKRSKKYLAHDPEEKYKLGDVVEIIESRPISKRKRFRVLRLVESG
RMDLVEKYLIRRQNYESLSKRGGKA
>P24321 ~~~rpsQ~~~Small ribosomal subunit protein uS17~~~
MPKKVLTGVVVSDKMQKTVTVLVERQFPHPLYGKVIKRSKKYLAHDPEEKYKLGDVVEIIESRPISKRKRFRVLRLVESG
RMDLVEKYLIRRQNYQSLSKRGGKA
>A0R549 ~~~rpsR1~~~Small ribosomal subunit protein bS18A~~~COG0238
MMAVKKSRKRTAATELKKPRRNQLEALGVTTIDYKDVAVLRTFLSERGKIRSRHVTGLTPQQQRQVATAIKNAREMALLP
MAGPR
>P9WH49 ~~~rpsR1~~~Small ribosomal subunit protein bS18A~~~COG0238
MAKSSKRRPAPEKPVKTRKCVFCAKKDQAIDYKDTALLRTYISERGKIRARRVTGNCVQHQRDIALAVKNAREVALLPFT
SSVR
>A0R7F7 ~~~rpsR2~~~Small ribosomal subunit protein bS18B~~~COG0238
MAKSNKRRPAPEKPVKTRKCVFCSKKGQTIDYKDTALLRTYISERGKIRARRVTGNCVQHQRDIAVAVKNAREVALLPFG
SSTR
>B7IBC2 ~~~rpsR~~~Small ribosomal subunit protein bS18~~~
MARFYRRRKFCRFTAENVAYIDYKDIDTLKQYITENGKIVPSRITGTKARYQRQLALAIKQARYLSLIPYTDNHK
>P21475 ~~~rpsR~~~Small ribosomal subunit protein bS18~~~COG0238
MAGGRRGGRAKRRKVCYFTSNGITHIDYKDVDLLKKFVSERGKILPRRVTGTNAKYQRKLTAAIKRARQMALLPYVSGE
>P0A7T7 ~~~rpsR~~~Small ribosomal subunit protein bS18~~~COG0238
MARYFRRRKFCRFTAEGVQEIDYKDIATLKNYITESGKIVPSRITGTRAKYQRQLARAIKRARYLSLLPYTDRHQ
>Q839Y8 ~~~rpsR~~~Small ribosomal subunit protein bS18~~~COG0238
MAQQRRGGRKRRKVDYIAANHIEYIDYKDTELLKRFISERGKILPRRVTGTGAKNQRKLTIAIKRARIMGLLPFVSDEQ
>P10806 ~~~rpsR~~~Small ribosomal subunit protein bS18~~~
MAGRKGGRGKRRKVCYFTANNITHIDYKDVDLLKKFISERGKILPRRVTGTSAKYQRKLTVAIKRARQMALLPYVADE
>P66459 ~~~rpsR~~~Small ribosomal subunit protein bS18~~~COG0238
MERKRYSKRYCKYTEAKISFIDYKDLDMLKHTLSERYKIMPRRLTGNSKKWQERVEVAIKRARHMALIPYIVDRKKVVDS
PFKQH
>A2RNZ2 ~~~rpsR~~~Small ribosomal subunit protein bS18~~~COG0238
MAQQRRGGFKRRKKVDFIAANKIEVVDYKDTELLKRFISERGKILPRRVTGTSAKNQRKVVNAIKRARVMALLPFVAEDQ
N
>P66461 ~~~rpsR~~~Small ribosomal subunit protein bS18~~~COG0238
MAGGRRGGRRRKKVCYFTSNGITHIDYKDVELLKKFVSERGKILPRRVTGTSAKYQRKLTVAIKRSRQMALLPFVAEEK
>P75541 ~~~rpsR~~~Small ribosomal subunit protein bS18~~~
MMNNEHDNFQKEVETTTETTFNREEGKRMVRPLFKRSKKYCRFCAIGQLRIDLIDDLEALKRFLSPYAKINPRRITGNCQ
MHQRHVAKALKRARYLALVPFVKD
>Q9HUN0 ~~~rpsR~~~Small ribosomal subunit protein bS18~~~
MARFFRRRKFCRFTAEGVKEIDYKDLNTLKAYVSETGKIVPSRITGTKAKYQRQLATAIKRARYLALLPYTDSHGR
>Q6N5A3 ~~~rpsR~~~Small ribosomal subunit protein bS18~~~COG0238
MAEAGARRPFFRRRKTCPFTGANAPKIDYKDSKLLMRYVSERGKIVPSRITAVSAKKQRELARAIKRARFLGLLPYVIR
>Q8ZK81 ~~~rpsR~~~Small ribosomal subunit protein bS18~~~
MARYFRRRKFCRFTAEGVQEIDYKDIATLKNYITESGKIVPSRITGTRAKYQRQLRRAIKRARYLSLLPYTDRHQ
>Q2G111 ~~~rpsR~~~Small ribosomal subunit protein bS18~~~COG0238
MAGGPRRGGRRRKKVCYFTANGITHIDYKDTELLKRFISERGKILPRRVTGTSAKYQRMLTTAIKRSRHMALLPYVKEEQ
>P66468 ~~~rpsR~~~Small ribosomal subunit protein bS18~~~
MAGGPRRGGRRRKKVCYFTANGITHIDYKDTELLKRFISERGKILPRRVTGTSAKYQRMLTTAIKRSRHMALLPYVKEEQ
>P62659 ~~~rpsR~~~Small ribosomal subunit protein bS18~~~COG0238
MSTKNAKPKKEAQRRPSRKAKVKATLGEFDLRDYRNVEVLKRFLSETGKILPRRRTGLSAKEQRILAKTIKRARILGLLP
FTEKLVRK
>Q5SLQ0 ~~~rpsR~~~Small ribosomal subunit protein bS18~~~COG0238
MSTKNAKPKKEAQRRPSRKAKVKATLGEFDLRDYRNVEVLKRFLSETGKILPRRRTGLSAKEQRILAKTIKRARILGLLP
FTEKLVRK
>P80382 ~~~rpsR~~~Small ribosomal subunit protein bS18~~~
MSTKNAKPKKEAQRRPSRKAKVKATLGEFDLRDYRNVEVLKRFLSETGKILPRRRTGLSGKEQRILAKTIKRARILGLLP
FTEKLVRK
>B7IA35 ~~~rpsS~~~Small ribosomal subunit protein uS19~~~
MPRSLKKGPFVDAHLFAKVEAAVASNSRKPIKTWSRRSMILPDFVGLTISVHNGRNHVPVIVTEHMVGHKLGEFAPTRTY
RGHGVDKKSKR
>P21476 ~~~rpsS~~~Small ribosomal subunit protein uS19~~~COG0185
MARSLKKGPFVDGHLMTKIEKLNETDKKQVVKTWSRRSTIFPQFIGHTIAVYDGRKHVPVFISEDMVGHKLGEFAPTRTY
KGHASDDKKTRR
>P0A7U3 ~~~rpsS~~~Small ribosomal subunit protein uS19~~~COG0185
MPRSLKKGPFIDLHLLKKVEKAVESGDKKPLRTWSRRSTIFPNMIGLTIAVHNGRQHVPVFVTDEMVGHKLGEFAPTRTY
RGHAADKKAKKK
>Q839G0 ~~~rpsS~~~Small ribosomal subunit protein uS19~~~COG0185
MGRSLKKGPFVDDHLMKKVEAQQGAEKKKVIKTWSRRSTIFPSFVGFTIAVYDGRKHVPVYIQEDMVGHKLGEFAPTRTY
RGHVADDKKTKR
>P12731 ~~~rpsS~~~Small ribosomal subunit protein uS19~~~
MGRSLKKGPFCDEHLMKKIEKLNETGQKQVIKTWSRRSTIFPQFVGHTIAVYDGRRHVPVYITEDMVGHKLGEFAPTATF
RGHAGDDKKTKR
>A2RNQ1 ~~~rpsS~~~Small ribosomal subunit protein uS19~~~COG0185
MGRSLKKGPFVDEHLMKKVEAQTNAERKSVIKTWSRRSTIFPNFVGLTIAVYDGRKHVPVYVQEDMVGHKLGEFAPTRTY
RGHAADDKKTRR
>P66484 ~~~rpsS~~~Small ribosomal subunit protein uS19~~~COG0185
MGRSLKKGPFVDDHLMKKVEAAAESEKKQVIKTWSRRSTIFPTFVGQTIAVYDGRKHVPVYVQEDMVGHKLGEFAPTRTY
RGHAGDDKKTKR
>P0A5X5 ~~~rpsS~~~Small ribosomal subunit protein uS19~~~
MPRSLKKGPFVDEHLLKKVDVQNEKNTKQVIKTWSRRSTIIPDFIGHTFAVHDGRKHVPVFVTESMVGHKLGEFAPTRTF
KGHIKDDRKSKRR
>P75576 ~~~rpsS~~~Small ribosomal subunit protein uS19~~~
MSRSAKKGAFVDAHLLKKVIDMNKQEKKRPIKTWSRRSTIFPEFVGNTFAVHNGKTFINVYVTDDMVGHKLGEFSPTRNF
KQHTANR
>A0QSD5 ~~~rpsS~~~Small ribosomal subunit protein uS19~~~COG0185
MPRSLKKGPFVDDHLLKKVDVQNEKNTKQVIKTWSRRSTIIPDFIGHTFAVHDGRKHVPVFVTEAMVGHKLGEFAPTRTF
KGHIKDDRKSKRR
>P9WH45 ~~~rpsS~~~Small ribosomal subunit protein uS19~~~COG0185
MPRSLKKGPFVDEHLLKKVDVQNEKNTKQVIKTWSRRSTIIPDFIGHTFAVHDGRKHVPVFVTESMVGHKLGEFAPTRTF
KGHIKDDRKSKRR
>Q9HWD9 ~~~rpsS~~~Small ribosomal subunit protein uS19~~~
MPRSLKKGPFIDLHLLKKVEVAVEKNDRKPIKTWSRRSMILPHMVGLTIAVHNGRQHVPVLVNEDMVGHKLGEFAATRTY
RGHAADKKAKR
>Q6N4T8 ~~~rpsS~~~Small ribosomal subunit protein uS19~~~COG0185
MVRSVWKGPFVEASLLKKADAARASGRHDVIKIWSRRSTILPQFVGLTFGVYNGQKHVPVSVNEEMVGHKFGEFSPTRTF
HGHAGDKKSKKG
>Q2FW10 ~~~rpsS~~~Small ribosomal subunit protein uS19~~~COG0185
MARSIKKGPFVDEHLMKKVEAQEGSEKKQVIKTWSRRSTIFPNFIGHTFAVYDGRKHVPVYVTEDMVGHKLGEFAPTRTF
KGHVADDKKTRR
>P66494 ~~~rpsS~~~Small ribosomal subunit protein uS19~~~
MARSIKKGPFVDEHLMKKVEAQEGSEKKQVIKTWSRRSTIFPNFIGHTFAVYDGRKHVPVYVTEDMVGHKLGEFAPTRTF
KGHVADDKKTRR
>P62660 ~~~rpsS~~~Small ribosomal subunit protein uS19~~~COG0185
MPRSLKKGVFVDDHLLEKVLELNAKGEKRLIKTWSRRSTIVPEMVGHTIAVYNGKQHVPVYITENMVGHKLGEFAPTRTY
RGHGKEAKATKKK
>Q5SHP2 ~~~rpsS~~~Small ribosomal subunit protein uS19~~~COG0185
MPRSLKKGVFVDDHLLEKVLELNAKGEKRLIKTWSRRSTIVPEMVGHTIAVYNGKQHVPVYITENMVGHKLGEFAPTRTY
RGHGKEAKATKKK
>P80381 ~~~rpsS~~~Small ribosomal subunit protein uS19~~~
MPRSLKKGVFVDDHLLEKVLELNAKGEKRLIKTWSRRSTIVPEMVGHTIAVYNGKQHVPVYITENMVGHKLGEFAPTRTY
RGHGKEAKATKKK
>P38494 ~~~ypfD~~~Small ribosomal subunit protein bS1 homolog~~~COG0539
MTEEMNQIDVQVPEVGDVVKGIVTKVEDKHVDVEIINVKQSGIIPISELSSLHVEKASDVVKVDDELDLKVTKVEDDALI
LSKRAVDADRAWEDLEKKFETKEVFEAEVKDVVKGGLVVDIGVRGFIPASLVEAHFVEDFTDYKGKTLSLLVVELDRDKN
RVILSHRAVVESEQANKKQELLQSLEVGSVLDGKVQRLTDFGAFVDIGGIDGLVHISQLSHSHVEKPSDVVEEGQEVKVK
VLSVDRDNERISLSIKDTLPGPWNQIGEKVKPGDVLEGTVQRLVSFGAFVEILPGVEGLVHISQISNKHIGTPHEVLEEG
QTVKVKVLDVNENEERISLSMRELEETPKADQEDYRQYQAKEETSTGFQLGDLIGDKLNKLK
>Q83E09 ~~~rpsA~~~Small ribosomal subunit protein bS1~~~COG0539
MSETFAELFEKSLTETDLRPGALVKATVVEVRPDRVIVNAGLKSEGIIPASEFRNEEPHVGDEFFVVIEASDNGFGETRL
SREKARRAKAWSELEKAYKAGEMVKGVIIERVKGGFTVDLNSVRAFLPGSLVDVKPVRDPGYLEDKEIDFKIIKMDQRRN
NVVVSRRAVMEAETSAERQARLEELQEGQEIKGVIKNITDYGAFVDLGGVDGLLHITDMAWGRVKHPSDLLNVGDEVHVK
VLKFDRDKKRVSLGMKQLADDPWAKIERRYPVNSRVFGKVTNITDYGCFVKLEEGVEGLVHTSELDWTNKNIHPSKVVQS
GEEVEVMVLEIDEERRRISLGIKQCKRNPWQEFAEKHEKDEKITGKVRSITDFGMFIGLEGDIDGLVHLSDISWTESGEE
AIRNYKKGDEVQAVILGIDPERERISLGIKQLEGDPFMEFVESYDKDAVIQAKVKEVESKQAVLELADQVLGQMRLADYT
YDRVKDLTQELNVGDEVAVKIVNVDRKNRLINVSHKAVEGRSEKGTRTVSDVPTKTTLGDLLKEKIQSKDE
>P0AG67 ~~~rpsA~~~Small ribosomal subunit protein bS1~~~COG0539
MTESFAQLFEESLKEIETRPGSIVRGVVVAIDKDVVLVDAGLKSESAIPAEQFKNAQGELEIQVGDEVDVALDAVEDGFG
ETLLSREKAKRHEAWITLEKAYEDAETVTGVINGKVKGGFTVELNGIRAFLPGSLVDVRPVRDTLHLEGKELEFKVIKLD
QKRNNVVVSRRAVIESENSAERDQLLENLQEGMEVKGIVKNLTDYGAFVDLGGVDGLLHITDMAWKRVKHPSEIVNVGDE
ITVKVLKFDRERTRVSLGLKQLGEDPWVAIAKRYPEGTKLTGRVTNLTDYGCFVEIEEGVEGLVHVSEMDWTNKNIHPSK
VVNVGDVVEVMVLDIDEERRRISLGLKQCKANPWQQFAETHNKGDRVEGKIKSITDFGIFIGLDGGIDGLVHLSDISWNV
AGEEAVREYKKGDEIAAVVLQVDAERERISLGVKQLAEDPFNNWVALNKKGAIVTGKVTAVDAKGATVELADGVEGYLRA
SEASRDRVEDATLVLSVGDEVEAKFTGVDRKNRAISLSVRAKDEADEKDAIATVNKQEDANFSNNAMAEAFKAAKGE
>P50889 ~~~rps1~~~Small ribosomal subunit protein bS1~~~COG0539
MAHALKRILYATWYPDILVNYTHSVNCRRTLDVMSETNNEFLAALESAADQIKVGDVVTGELLAIDNDNQAVVGLSTGEE
GVVPAREYSDDRNINLADELKIGDTIEAVVISNVTSDKEGVAYLLSKKRLDARKAWENLSFAEGDTVDAKVINAVRGGLI
VDVNGVRGFVPASMVAERFVSDLNQFKNKDIKAQVIEIDPANARLILSRKAVAAQELAAQLAEVFSKLSVGEVVEGTVAR
LTDFGAFVDLGGVDGLVHVSEISHDRVKNPADVLTKGDKVDVKILALDTEKGRISLSIKATQRGPWDEAADQIAAGSVLE
GTVKRVKDFGAFVEILPGIEGLVHVSQISNKRIENPSEVLKSGDKVQVKVLDIKPAEERISLSMKALEEKPEREDRRGND
GSASRADIAAYKQQDDSAATLGDIFGDKL
>P46836 ~~~rpsA~~~Small ribosomal subunit protein bS1~~~COG0539
MSIPAVPSPQIAVNDVGSSEDFLAAIDKTIKYFNDGDIVEGTIVKVDRDEVLLDIGYKTEGVIPARELSIKHDVDPNEVV
SVGDEVEALVLTKEDKEGRLILSKKRAQYERAWGTIEALKEKDEAVKGIVIEVVKGGLILDIGLRGFLPASLVEMRRVRD
LQPYIGKEIEAKIIELDKNRNNVVLSRRAWLEQTQSEVRSEFLNQLQKGAIRKGVVSSIVNFGAFVDLGGVDGLVHVSEL
SWKHIDHPSEVVQVGNEVTVEVLDVDMDRERVSLSLKATQEDPWRHFARTHAIGQIVPGKVTKLVPFGAFVRVEEGIEGL
VHISELAERHVEVPDQVVAVGDDAMVKVIDIDLERRRISLSLKQANEDYIEEFDPAKYGMADSYDEQGNYIFPEGFDPDS
NEWLEGFDTQRAEWEARYAEAERRYKMHTIQMEKFAATEEAGHGSSEQPPASSTPSAKATGGSLASDAQLAALREKLAGS
A
>A0QYY6 ~~~rpsA~~~Small ribosomal subunit protein bS1~~~COG0539
MPSPSVTSPQVAVNDIGSAEDFLAAIDKTIKYFNDGDIVEGTIVKVDRDEVLLDIGYKTEGVIPSRELSIKHDVDPNEVV
SVGDEVEALVLTKEDKEGRLILSKKRAQYERAWGTIEELKEKDEAVKGTVIEVVKGGLILDIGLRGFLPASLVEMRRVRD
LQPYIGKEIEAKIIELDKNRNNVVLSRRAWLEQTQSEVRSEFLNQLQKGAIRKGVVSSIVNFGAFVDLGGVDGLVHVSEL
SWKHIDHPSEVVQVGDEVTVEVLDVDMDRERVSLSLKATQEDPWRHFARTHAIGQIVPGKVTKLVPFGAFVRVEEGIEGL
VHISELSERHVEVPDQVVQVGDDAMVKVIDIDLERRRISLSLKQANEDYTEEFDPSKYGMADSYDEQGNYIFPEGFDPET
NEWLEGFDKQREEWEARYAEAERRHKMHTAQMEKFAAAEAEAANAPVSNGSSRSEESSGGTLASDAQLAALREKLAGNA
>P9WH43 ~~~rpsA~~~Small ribosomal subunit protein bS1~~~COG0539
MPSPTVTSPQVAVNDIGSSEDFLAAIDKTIKYFNDGDIVEGTIVKVDRDEVLLDIGYKTEGVIPARELSIKHDVDPNEVV
SVGDEVEALVLTKEDKEGRLILSKKRAQYERAWGTIEALKEKDEAVKGTVIEVVKGGLILDIGLRGFLPASLVEMRRVRD
LQPYIGKEIEAKIIELDKNRNNVVLSRRAWLEQTQSEVRSEFLNNLQKGTIRKGVVSSIVNFGAFVDLGGVDGLVHVSEL
SWKHIDHPSEVVQVGDEVTVEVLDVDMDRERVSLSLKATQEDPWRHFARTHAIGQIVPGKVTKLVPFGAFVRVEEGIEGL
VHISELAERHVEVPDQVVAVGDDAMVKVIDIDLERRRISLSLKQANEDYTEEFDPAKYGMADSYDEQGNYIFPEGFDAET
NEWLEGFEKQRAEWEARYAEAERRHKMHTAQMEKFAAAEAAGRGADDQSSASSAPSEKTAGGSLASDAQLAALREKLAGS
A
>Q9JZ44 ~~~rpsA~~~Small ribosomal subunit protein bS1~~~
MSMENFAQLLEESFTLQEMNPGEVITAEVVAIDQNFVTVNAGLKSESLIDVAEFKNAQGEIEVKVGDFVTVTIESVENGF
GETKLSREKAKRAADWIALEEAMENGDILSGIINGKVKGGLTVMISSIRAFLPGSLVDVRPVKDTSHFEGKEIEFKVIKL
DKKRNNVVVSRRAVLEATLGEERKALLENLQEGSVIKGIVKNITDYGAFVDLGGIDGLLHITDLAWRRVKHPSEVLEVGQ
EVEAKVLKFDQEKQRVSLGMKQLGEDPWSGLTRRYPQGTRLFGKVSNLTDYGAFVEIEQGIEGLVHVSEMDWTNKNVHPS
KVVQLGDEVEVMILEIDEGRRRISLGMKQCQANPWEEFAANHNKGDKISGAVKSITDFGVFVGLPGGIDGLVHLSDLSWT
ESGEEAVRKYKKGEEVEAVVLAIDVEKERISLGIKQLEGDPFGNFISVNDKGSLVKGSVKSVDAKGAVIALSDEVEGYLP
ASEFAADRVEDLTTKLKEGDEVEAVIVTVDRKNRSIKLSVKAKDAKESREALNSVNAAANANAGTTSLGDLLKAKLSGEQ
E
>Q6NDP1 ~~~rpsA~~~Small ribosomal subunit protein bS1~~~COG0539
MASTDTYNPTRDDFAAMLDESFAGGNLQESSVIKGKVVAIEKDMAVIDVGLKTEGRVPLREFAGPGRDNEIKVGDTVEVF
LDRIENALGEAVLSRDKARREESWGKLEKAFQNNEKVFGVIFNQVKGGFTVDLDGAVAFLPRSQVDIRPIRDVAPLMNNS
QPFQILKMDRRRGNIVVSRRTVLEETRAEQRQELVQNLEEGQVIDGVVKNITDYGAFVDLGGIDGLLHVTDIAWRRVNHP
TEVLTIGQTVKVKIIKINHETHRISLGMKQLLDDPWQGIEAKYPLNARFTGRVTNITDYGAFVELEPGIEGLIHVSEMSW
TKKNMHPGKIVSTSQEVEVQVLEVDSVKRRISLGLKQTMRNPWEVFVEKHPVGSTVEGEVKNKTEFGLFLGLDGDVDGMV
HLSDLDWKLPGEQVIDNFKKGDMVKAVVLDVDVEKERISLGVKQLEGDPFAEPGDVKKGAVVTCEVLDVKESGIDVQIVG
TDFNTFIKRSELARDRNDQRSDRFAVGEKVDARVIQFDKKARKVQVSIKALEVAEEKEAIAQYGSSDSGATLGDILGTAL
KQRDK
>Q7A5J0 ~~~rpsA~~~Small ribosomal subunit protein bS1~~~
MTEEFNESMINDIKEGDKVTGEVQQVEDKQVVVHINGGKFNGIIPISQLSTHHIDSPSEVVKEGDEVEAYVTKVEFDEEN
ETGAYILSRRQLETEKSYSYLQEKLDNNEIIEAKVTEVVKGGLVVDVGQRGFVPASLISTDFIEDFSVFDGQTIRIKVEE
LDPENNRVILSRKAVEQEENDAKKDQLLQSLNEGDVIHGKVARLTQFGAFIDIGGVDGLVHVSELSHEHVQTPEEVVSIG
QDVKVKIKSIDRDTERISLSIKDTLPTPFENIKGQFHENDVIEGVVVRLANFGAFVEIAPGVQGLVHISEIAHKHIGTPG
EVLEPGQQVNVKILGIDEENERVSLSIKATLPNEDVVESDPSTTKAYLENEEEDNPTIGDMIGDKLKNLKL
>Q8CWR9 ~~~rpsA~~~Small ribosomal subunit protein bS1~~~COG0539
MNEFEDLLNSVSQVETGDVVSAEVLTVDATQANVAISGTGVEGVLTLRELTNDRDADINDFVKVGEVLDVLVLRQVVGKD
TDTVTYLVSKKRLEARKAWDKLVGREEEVVTVKGTRAVKGGLSVEFEGVRGFIPASMLDTRFVRNAERFVGQEFDTKIKE
VNAKENRFILSRREVVEAATAAARAEVFGKLAVGDVVTGKVARITSFGAFIDLGGVDGLVHLTELSHERNVSPKSVVTVG
EEIEVKILDLNEEEGRVSLSLKATVPGPWDGVEQKLAKGDVVEGTVKRLTDFGAFVEVLPGIDGLVHVSQISHKRIENPK
EALKVGQEVQVKVLEVNADAERVSLSIKALEERPAQEEGQKEEKRAARPRRPRRQEKRDFELPETQTGFSMADLFGDIEL
>P46228 ~~~rpsA~~~Small ribosomal subunit protein bS1~~~COG0539
MVTQDIPAVDIGFTHEDFAALLDQYDYHFNPGDTVVGTVFNLEPRGALIDIGAKTAAFLPVQEMSINRVESPEEVLQPSE
MREFFILSDENEDGQLTLSIRRIEYMRAWERVRQLQTEDATVRSEVFATNRGGALVRIEGLRGFIPGSHISTRKAKEDLV
GEELPLKFLEVDEDRNRLVLSHRRALVERKMNRLEVGEVVVGAVRGIKPYGAFIDIGGVSGLLHISEISHDHIETPHSVF
NVNDEVKVMIIDLDAERGRISLSTKQLEPEPGDMVRNPEVVYEKAEEMAAQYREKLKQQAEGLVVTE
>B7I5N9 ~~~rpsT~~~Small ribosomal subunit protein bS20~~~
MANSAQAKKRARQNVKARKHNASLRSMVRTYIKRTLSAIAGGDYAVATEAYKKAVPVIDRMADKGIIHKNKAARHKSRLN
AQVKALAN
>P21477 ~~~rpsT~~~Small ribosomal subunit protein bS20~~~COG0268
MPNIKSAIKRTKTNNERRVHNATIKSAMRTAIKQVEASVANNEADKAKTALTEAAKRIDKAVKTGLVHKNTAARYKSRLA
KKVNGLSA
>P0A7U7 ~~~rpsT~~~Small ribosomal subunit protein bS20~~~COG0268
MANIKSAKKRAIQSEKARKHNASRRSMMRTFIKKVYAAIEAGDKAAAQKAFNEMQPIVDRQAAKGLIHKNKAARHKANLT
AQINKLA
>Q831Q7 ~~~rpsT~~~Small ribosomal subunit protein bS20~~~COG0268
MPNIESAIKRVRTSANANAKNSSQTNAMRTAIKKFEEAVAAGADNVDALYNEAVKAVDMAATKGLIHKNKANRDKIRLSK
LAK
>A2RMG0 ~~~rpsT~~~Small ribosomal subunit protein bS20~~~COG0268
MANIKSAIKRAELNKIANERNAQQKSAMRTLIKKFEAAPSEELYRAASSTIDKAASKGLIHANKASRDKARLAAKLG
>P66503 ~~~rpsT~~~Small ribosomal subunit protein bS20~~~COG0268
MPNIKSAIKRVKTAETRNSRNASQRSAMRTAIKKFDEAAANNADNAKDLYVEASKKLDSAVSKGLIHKNNAARNKSRLAA
KLAK
>P75237 ~~~rpsT~~~Small ribosomal subunit protein bS20~~~
MANIKSNEKRLRQNIKRNLNNKGQKTKLKTNVKNFHKEINLDNLGNVYSQADRLARKGIISTNRARRLKSRNVAVLNKTQ
VTAVEGK
>A0R102 ~~~rpsT~~~Small ribosomal subunit protein bS20~~~COG0268
MANIKSQIKRIRTNERRRLRNQSVKSSLRTAIRGFREAVDAGDKDKASELLHATSRKLDKAASKGVIHPNQAANKKSALA
LALNKL
>P9WH41 ~~~rpsT~~~Small ribosomal subunit protein bS20~~~COG0268
MANIKSQQKRNRTNERARLRNKAVKSSLRTAVRAFREAAHAGDKAKAAELLASTNRKLDKAASKGVIHKNQAANKKSALA
QALNKL
>Q9HVM1 ~~~rpsT~~~Small ribosomal subunit protein bS20~~~
MANTPSAKKRAKQAEKRRSHNASLRSMVRTYIKNVVKAIDAKDLEKAQAAFTAAVPVIDRMADKGIIHKNKAARHKSRLS
GHIKALSTAAA
>Q6N0C7 ~~~rpsT~~~Small ribosomal subunit protein bS20~~~COG0268
MANTSSAKKATRKIARRTAVNKSRRTQMRGSVRIVEEAIASGDRDAALKAMARAEPELMRAAQRNIIHRNAASRKVSRLT
HSIAKLAK
>Q2FXY6 ~~~rpsT~~~Small ribosomal subunit protein bS20~~~COG0268
MANIKSAIKRVKTTEKAEARNISQKSAMRTAVKNAKTAVSNNADNKNELVSLAVKLVDKAAQSNLIHSNKADRIKSQLMT
ANK
>Q2YT41 ~~~rpsT~~~Small ribosomal subunit protein bS20~~~
MANIKSAIKRVKTTEKAEARNISQKSAMRTAVKNAKTAVSNNADNKNELVSLAVKLVDKAAQSNLIHSNKADRIKSQLMT
ANK
>Q7A5C0 ~~~rpsT~~~Small ribosomal subunit protein bS20~~~
MANIKSAIKRVKTTEKAEARNISQKSAMRTAVKNAKTAVSNNADNKNELVSLAVKLVDKAAQSNLIHSNKADRIKSQLMT
ANK
>P62661 ~~~rpsT~~~Small ribosomal subunit protein bS20~~~COG0268
MAQKKPKRNLSALKRHRQSLKRRLRNKAKKSAIKTLSKKAVQLAQEGKAEEALKIMRKAESLIDKAAKGSTLHKNAAARR
KSRLMRKVRQLLEAAGAPLIGGGLSA
>P80380 ~~~rpsT~~~Small ribosomal subunit protein bS20~~~COG0268
MAQKKPKRNLSALKRHRQSLKRRLRNKAKKSAIKTLSKKAIQLAQEGKAEEALKIMRKAESLIDKAAKGSTLHKNAAARR
KSRLMRKVRQLLEAAGAPLIGGGLSA
>B7I2K7 ~~~rpsU~~~Small ribosomal subunit protein bS21~~~
MPQVKLKEGEPVDVAIRRFKRSCEKAGVLADVRKREFYEKPTQERKRKKAAAVKRYQKKLARESVRTTRLY
>P0A4B8 ~~~rpsU~~~Small ribosomal subunit protein bS21~~~
MTQIVVGENEHIESALRRFKREVSKAGIFQDMRKHRHFETPIEKSKRKKLALHKQSKRRFRT
>P21478 ~~~rpsU~~~Small ribosomal subunit protein bS21~~~COG0828
MSKTVVRKNESLEDALRRFKRSVSKTGTLQEARKREFYEKPSVKRKKKSEAARKRKF
>A8APV6 ~~~rpsU~~~Small ribosomal subunit protein bS21~~~
MPVIKVRENEPFDVALRRFKRSCEKAGVLAEVRRREFYEKPTTERKRAKASAVKRHAKKLARENARRTRLY
>P68679 ~~~rpsU~~~Small ribosomal subunit protein bS21~~~COG0828
MPVIKVRENEPFDVALRRFKRSCEKAGVLAEVRRREFYEKPTTERKRAKASAVKRHAKKLARENARRTRLY
>P23829 ~~~rpsU~~~Small ribosomal subunit protein bS21~~~
MSKTIVRKNESIDDALRRFKRAVSKTGTLQEVRKREFYEKPSVRRKKKSEAARKRK
>A2RHW9 ~~~rpsU~~~Small ribosomal subunit protein bS21~~~COG0828
MSKTLVRKNESLDDALRRFKRSVTKAGTLQELRKREHYEKPSVKRKRKSEAARKRKKY
>P57079 ~~~rpsU~~~Small ribosomal subunit protein bS21~~~
MPKIEVKNDDLELALKKFKRVSLEIRRLAQRHEYHLRKGMRLREKRKIAQKKRRKFRNMV
>P66519 ~~~rpsU~~~Small ribosomal subunit protein bS21~~~
MPAIRVKENEPFEVAMRRFKRAVEKTGLLTELRAREAYEKPTTERKRKKAAAVKRLQKRLRSQQLPPKMY
>Q9I5V8 ~~~rpsU~~~Small ribosomal subunit protein bS21~~~
MPAVKVKENEPFDVALRRFKRSCEKAGVLAEVRSREFYEKPTAERKRKAAAAVKRHAKKVQREQRRRERLY
>Q6N274 ~~~rpsU~~~Small ribosomal subunit protein bS21~~~COG0828
MQVLVRDNNVDQALKALKKKMQREGIFREMKLRGHYEKPSEKKAREKAEAVRRARKLARKKLQREGLLPSKPKPAFGADR
RPSAAAR
>Q2FXZ7 ~~~rpsU~~~Small ribosomal subunit protein bS21~~~COG0828
MSKTVVRKNESLEDALRRFKRSVSKSGTIQEVRKREFYEKPSVKRKKKSEAARKRKFK
>Q2YT04 ~~~rpsU~~~Small ribosomal subunit protein bS21~~~
MSKTVVRKNESLEDALRRFKRSVSKSGTIQEVRKREFYEKPSVKRKKKSEAARKRKFK
>P66521 ~~~rpsU~~~Small ribosomal subunit protein bS21~~~
MSKTVVRKNESLEDALRRFKRSVSKSGTIQEVRKREFYEKPSVKRKKKSEAARKRKFK
>P21464 ~~~rpsB~~~Small ribosomal subunit protein uS2~~~COG0052
MSVISMKQLLEAGVHFGHQTRRWNPKMKRYIFTERNGIYIIDLQKTVKKVEEAYNFTKNLAAEGGKILFVGTKKQAQDSV
KEEAQRSGMYYVNQRWLGGTLTNFETIQKRIKRLKDIEKMQENGTFDVLPKKEVVQLKKELERLEKFLGGIKDMKDLPDA
LFIIDPRKERIAVAEARKLNIPIIGIVDTNCDPDEIDVVIPANDDAIRAVKLLTSKMADAILEAKQGEEEAEVAEETAPE
TETTTA
>P0A7V0 ~~~rpsB~~~Small ribosomal subunit protein uS2~~~COG0052
MATVSMRDMLKAGVHFGHQTRYWNPKMKPFIFGARNKVHIINLEKTVPMFNEALAELNKIASRKGKILFVGTKRAASEAV
KDAALSCDQFFVNHRWLGGMLTNWKTVRQSIKRLKDLETQSQDGTFDKLTKKEALMRTRELEKLENSLGGIKDMGGLPDA
LFVIDADHEHIAIKEANNLGIPVFAIVDTNSDPDGVDFVIPGNDDAIRAVTLYLGAVAATVREGRSQDLASQAEESFVEA
E
>Q831U9 ~~~rpsB~~~Small ribosomal subunit protein uS2~~~COG0052
MAVISMKQLLEAGVHFGHQTRRWNPKMKKYIFTERNGIYIIDLQKTVKLVDAAYDYMKNVAEEGGVALFVGTKKQAQEAI
KDEAIRAGQYYVNHRWLGGTLTNWDTIQKRIARLKKINAMEEDGTFEVLPKKEVAGLNKERERLEKFLGGIADMPRIPDV
MYIVDPRKERIAVQEAHKLNIPIVAMVDTNCDPDEIDVVIPSNDDAIRAVKLITAKMADAFIEGNQGEDQATEELFVEET
PEATSIEEIVDVVEGNNESAE
>P81289 ~~~rpsB~~~Small ribosomal subunit protein uS2~~~
MSVISMKQLLEAGVHFGWQTRRWNPKMKKYIFTERNGIYLIDIQKTVKKVEEAYNFVRELAANGGKILFVGTKRQAQESV
KEEAERCGMFYVNQRRIGGTLTNFATLQKRIKRLREIEKMEEDGVFDVLPKKEVIGLKKEKERLEKFIGGIKDMKELPDA
LFVIDPRKERIAVAEARKLNIPIIGIVDTNNDPCEGGYVIPANDDAIRAVKLLTSKIADAVLEAKQGEEAAVAAE
>P44371 ~~~rpsB~~~Small ribosomal subunit protein uS2~~~COG0052
MAQVSMRDMINAGVHFGHQTRYWNPQMKPFIFGARNGVHIINLEKTLPLFNEALAELTRIASNNGKVLFVGTKRAASEAV
QAAALDCQQYYVNHRWLGGMLTNWKTVRQSIKRLKDLETQSQDGTFDKLTKKEALMRSREMEKLELSLGGIKDMGGLPDA
LFVIGADHEHIAVKEANNLGIPVFAIVDTNSTPAGVDFVIPGNDDATRAIQLYVSAAAAAVKEGRGNEAQVAEELAADAE
>A2RNV0 ~~~rpsB~~~Small ribosomal subunit protein uS2~~~COG0052
MSVISMKQLLEAGVHFGHQTRRWNPKMKPYIFTERNGIHVIDLQKTVKLVDDAYNYVKNASQEGAVVLFVGTKKQAAEAV
KEEALRAGQYYVNHRWLGGMLTNWNTIQTRVTRLKEINKMEEEGTFEVLPKKEVVLLNKERERLEKFIGGIADMPRIPDV
MYIVDPHAEQIAVKEAKTLGIPVVAMVDTNADPEPIDVVIPANDDAIRAVKLITAKMADAIIEGRQGEDAAEDFVAEEAA
SEESLEELAEIVEGK
>Q8Y6M6 ~~~rpsB~~~Small ribosomal subunit protein uS2~~~COG0052
MPVISMKQLLEAGVHFGHQTRRWNPKMKKYIFTERNGIYIIDLQKTVKKVDEAFNFMREVASDNGTILFVGTKKQAQESV
RDEAIRSGQYFVNHRWLGGTLTNFETIQKRIQHLKKIERMEADGTFEVLPKKEVVLLKKEQEKLERFLGGIKDMKGLPDA
LFIVDPRKERIAVAEARKLHIPIIGIVDTNCDPDEIDYVIPANDDAIRAVKLLTAKMADAIIEVNQGEELTEAEVAPVEE
KATEETTEA
>P75560 ~~~rpsB~~~Small ribosomal subunit protein uS2~~~
MSELITTPVETTAKAELVSLAKLGEMRTHVGMVKRYWNPKMGFFIEPERKHNNDHFVLELQRQSLQTAYNYVKEVAQNNG
QILFVGTKNDYVKKLVNNIAKRVDVAFITQRWLGGTLTNFKTLSISINKLNKLVEKQAENAADLTKKENLMLSREIERLE
KFFGGVKSLKRLPNLLIVDDPVYEKNAVAEANILRIPVVALCNTNTNPELVDFIIPANNHQPQSTCLLMNLLADAVAEAK
AMPTMFAYKPDEEIQIEIPQKQEAPRQVVNRANSKQITSQRLNITRNPEVLTRE
>A0QVB8 ~~~rpsB~~~Small ribosomal subunit protein uS2~~~COG0052
MAVVTMKQLLDSGAHFGHQTRRWNPKMKRFIFTDRNGIYIIDLQQTLTYIDKAYEFVKETVAHGGTVLFVGTKKQAQESI
AEEATRVGMPYVNQRWLGGMLTNFSTVHKRLQRLKELEAMEQTGGFEGRTKKEILMLTREKNKLERSLGGIRDMQKVPSA
VWVVDTNKEHIAVGEARKLGIPVIAILDTNCDPDVVDYPIPGNDDAIRSAALLTKVIASAVAEGLQARAGQGSGEKPAEG
AEPLAEWEQELLAGATAGAADASAEGAAAPESSTDAS
>P9WH39 ~~~rpsB~~~Small ribosomal subunit protein uS2~~~COG0052
MAVVTMKQLLDSGTHFGHQTRRWNPKMKRFIFTDRNGIYIIDLQQTLTFIDKAYEFVKETVAHGGSVLFVGTKKQAQESV
AAEATRVGMPYVNQRWLGGMLTNFSTVHKRLQRLKELEAMEQTGGFEGRTKKEILGLTREKNKLERSLGGIRDMAKVPSA
IWVVDTNKEHIAVGEARKLGIPVIAILDTNCDPDEVDYPIPGNDDAIRSAALLTRVIASAVAEGLQARAGLGRADGKPEA
EAAEPLAEWEQELLASATASATPSATASTTALTDAPAGATEPTTDAS
>P66540 ~~~rpsB~~~Small ribosomal subunit protein uS2~~~
MSQITMRQMIEAGVHFGHQTRFWNPKMAQYIFGARNKIHIVNLEKTLPMFQDAQEAVRRLVANKGTVLFVGTKRQARDII
REEATRAGMPFVDYRWLGGMLTNYKTVKQSIKRLEEKTAALENAAESGFSKKEILEMQRDVEKLERSLGGIKNMKGLPDA
IFVIDTGYQKGTLVEAEKLGIPVIAVVDTNNSPDGVKYVIPGNDDSAKAIRLYCRGIADAVLEGKNQALQETVAAAQEAA
AE
>O82850 ~~~rpsB~~~Small ribosomal subunit protein uS2~~~
MSQVNMRDMLKAGVHFGHQTRYWNPKMGKFIFGARNKIHIINLEKTLPMFNEALTFVERLAAGKNKILFVGTKRSAGKIV
REEAARCGMPYVDHRWLGGMLTNYKTIRQSIKRLRDLETQSQDGTFDKLTKKEALMRSRDLEKLERSLGGIKDMGGLPDA
LFVIDVDHERIAITEANKLGIPVIGVVDTNSSPEGVDYVIPGNDDAIRAVQLYLNSMAEAVIRGKQGAATSADEFVEEAP
AESAEG
>Q6N5Q2 ~~~rpsB~~~Small ribosomal subunit protein uS2~~~COG0052
MSLPEFSMRQLLEAGVHFGHQSHRWNPKMADYIFGVRNNIHIVDLTQTVPLLHRALQAISDTVAKGGRVLFVGTKRQAQD
AVADAAKRSAQYFVNSRWLGGTLTNWKTISGSIRRLRHLEDVLSSADANAYTKKERLELQRERDKLNRSLGGIKDMGGLP
DLIFVIDTNKEDIAIQEAQRLGIPVAAIVDTNCDPKGITYLVPGNDDAGRAIALYCDLVARAVIDGISRAQGDVGIDIGA
AAQPLREDLPAAQATTFQGLPGPRGTPDDLKKLPGVSGAIEKKFNDLGIFHFWQLAELDQATAHQIGEELGLPSRADAWV
AQAKSLTAEAE
>Q2FZ25 ~~~rpsB~~~Small ribosomal subunit protein uS2~~~COG0052
MAVISMKQLLEAGVHFGHQTRRWNPKMKKYIFTERNGIYIIDLQKTVKKVDEAYNFLKQVSEDGGQVLFVGTKKQAQESV
KSEAERAGQFYINQRWLGGLLTNYKTISKRIKRISEIEKMEEDGLFEVLPKKEVVELKKEYDRLIKFLGGIRDMKSMPQA
LFVVDPRKERNAIAEARKLNIPIVGIVDTNCDPDEIDYVIPANDDAIRAVKLLTAKMADAILEGQQGVSNEEVAAEQNID
LDEKEKSEETEATEE
>Q2YXL2 ~~~rpsB~~~Small ribosomal subunit protein uS2~~~
MAVISMKQLLEAGVHFGHQTRRWNPKMKKYIFTERNGIYIIDLQKTVKKVDEAYNFLKQVSEDGGQVLFVGTKKQAQESV
KSEAERAGQFYINQRWLGGLLTNYKTISKRIKRISEIEKMEEDGLFEVLPKKEVVELKKEYDRLIKFLGGIRDMKSMPQA
LFVVDPRKERNAIAEARKLNIPIVGIVDTNCDPDEIDYVIPANDDAIRAVKLLTAKMADAILEGQQGVSNEEVAAEQNID
LDEKEKSEETEATEE
>P66544 ~~~rpsB~~~Small ribosomal subunit protein uS2~~~
MAVISMKQLLEAGVHFGHQTRRWNPKMKKYIFTERNGIYIIDLQKTVKKVDEAYNFLKQVSEDGGQVLFVGTKKQAQESV
KSEAERAGQFYINQRWLGGLLTNYKTISKRIKRISEIEKMEEDGLFEVLPKKEVVELKKEYDRLIKFLGGIRDMKSMPQA
LFVVDPRKERNAIAEARKLNIPIVGIVDTNCDPDEIDYVIPANDDAIRAVKLLTAKMADAILEGQQGVSNEEVAAEQNID
LDEKEKSEETEATEE
>Q5X9J9 ~~~rpsB~~~Small ribosomal subunit protein uS2~~~
MAVISMKQLLEAGVHFGHQTRRWNPKMAKYIFTERNGIHVIDLQQTVKLADQAYEFVRDAAANDAVILFVGTKKQAAEAV
ADEATRAGQYFINHRWLGGTLTNWGTIQKRIARLKEIKRMEEEGTFDVLPKKEVALLNKQRARLEKFLGGIEDMPRIPDV
MYVVDPHKEQIAVKEAKKLGIPVVAMVDTNADPDDIDIIIPANDDAIRAVKLITAKLADAIIEGRQGEDADVAFEADTQA
DSIEDIVEVVEGDNA
>P62662 ~~~rpsB~~~Small ribosomal subunit protein uS2~~~COG0052
MPVEITVKELLEAGVHFGHERKRWNPKFARYIYAERNGIHIIDLQKTMEELERTFRFIEDLAMRGGTILFVGTKKQAQDI
VRMEAERAGMPYVNQRWLGGMLTNFKTISQRVHRLEELEALFASPEIEERPKKEQVRLKHELERLQKYLSGFRLLKRLPD
AIFVVDPTKEAIAVREARKLFIPVIALADTDSDPDLVDYIIPGNDDAIRSIQLILSRAVDLIIQARGGVVEPSPSYALVQ
EAEATETPEGESEVEA
>P80371 ~~~rpsB~~~Small ribosomal subunit protein uS2~~~COG0052
MPVEITVKELLEAGVHFGHERKRWNPKFARYIYAERNGIHIIDLQKTMEELERTFRFIEDLAMRGGTILFVGTKKQAQDI
VRMEAERAGMPYVNQRWLGGMLTNFKTISQRVHRLEELEALFASPEIEERPKKEQVRLKHELERLQKYLSGFRLLKRLPD
AIFVVDPTKEAIAVREARKLFIPVIALADTDSDPDLVDYIIPGNDDAIRSIQLILSRAVDLIIQARGGVVEPSPSYALVQ
EAEATETPEGESEVEA
>B7IA33 ~~~rpsC~~~Small ribosomal subunit protein uS3~~~
MGQKVHPIGIRLGVVKRHNANWYANPKQYAEYLLKDLQVREFLTKNLKNAMVSNILIERPSGAAKVTISTARPGIVIGKK
GEDIEKLQRELTNIMGVPAQVSINEIDRPDLDARLVAEAIASQLEKRVMFRRAMKRAVQNTMRAGAKGIKVEVSGRLGGA
EIARTEWYREGRVPLHTLRADIDYATMRAETTYGTIGVKVWIFRGEILGGMKQVMNPAPAEERPAKRGRGRGEGQERRGR
RGDRAADKGE
>P21465 ~~~rpsC~~~Small ribosomal subunit protein uS3~~~COG0092
MGQKVNPVGLRIGVIRDWESKWYAGKDYADFLHEDLKIREYISKRLSDASVSKVEIERAANRVNITIHTAKPGMVIGKGG
SEVEALRKALNSLTGKRVHINILEIKRADLDAQLVADNIARQLENRVSFRRAQKQQIQRTMRAGAQGVKTMVSGRLGGAD
IARSEYYSEGTVPLHTLRADIDYATSEADTTYGKLGVKVWIYRGEVLPTKKKNEEGGK
>P0A7V3 ~~~rpsC~~~Small ribosomal subunit protein uS3~~~COG0092
MGQKVHPNGIRLGIVKPWNSTWFANTKEFADNLDSDFKVRQYLTKELAKASVSRIVIERPAKSIRVTIHTARPGIVIGKK
GEDVEKLRKVVADIAGVPAQINIAEVRKPELDAKLVADSITSQLERRVMFRRAMKRAVQNAMRLGAKGIKVEVSGRLGGA
EIARTEWYREGRVPLHTLRADIDYNTSEAHTTYGVIGVKVWIFKGEILGGMAAVEQPEKPAAQPKKQQRKGRK
>Q839F8 ~~~rpsC~~~Small ribosomal subunit protein uS3~~~COG0092
MGQKVHPIGMRVGIIRDWDAKWYAEKEYAEFLHEDLRIRKFIATKLADAAVSTIEIERAANRVNISIHTAKPGMVIGKGG
SEVENLRKELNKLTGKRVHINIVEIKKPDLDAKLVGEGIARQLENRVAFRRAQKQAIQRAMRAGAKGIKTQVSGRLNGAD
IARSEGYSEGTVPLHTLRADIDYAWEEADTTYGKLGVKVWIYRGEILPTKKNTEKGGK
>P23309 ~~~rpsC~~~Small ribosomal subunit protein uS3~~~
MGQKVNPIGLRIGIIRDWESRWYAEKDYADLVHEDLKIREYINKRLQDAAVSRVEIERAANRVNVTIHTAKPGMVIGKGG
SEVEALRKALTQLTGKREHINIVEIKKPDLDAKLVAENIARQLENRVSFRRAQKQAIQRAMRPGRKGVKTMVVRRLGGAE
IARSEHYSEGTVPLHTLRADIDYATAEADTTYGKIGVKVWIYRGEVLPTKKKAEEGGK
>A2RNP9 ~~~rpsC~~~Small ribosomal subunit protein uS3~~~COG0092
MGQKVHPIGMRVGVIRDWDAKWYAEKEYADYLHEDLAIRQLIQTKLADASVSLIETERAINKVIVTLHTAKPGMVIGKSG
ANVDALRAELNKLTGKQVHINIVEIKKPDLDAHLVGEGIAKQLEARIAFRRAQKQAIQRAMRAGAKGIKTQVSGRLNGAD
IARAEGYSEGTVPLHTLRADIDYAWEEADTTYGKLGVKVWIYRGEVLPTKKSVKGEK
>P66548 ~~~rpsC~~~Small ribosomal subunit protein uS3~~~COG0092
MGQKVHPIGMRIGVIRDWDSKWYAEKDYADFLHEDLRIRDYVAKRLSDASVSRVEIERAANRVNITIHTAKPGMVIGKGG
SEVEALRKNLNELTQKRVHINIVEIKRADLDAKLVAENIARQLEGRVSFRRAQKQAIQRTMRAGAKGIKTQVSGRLGGAD
IARAEHYSEGTVPLHTLRADIDYAWEEADTTYGKLGVKVWIYRGEVLPTKKNNVEGGK
>P41205 ~~~rpsC~~~Small ribosomal subunit protein uS3~~~
MGQKVNSNGLRFGINKNWISRWTANSHAQTAKWLIEDEKIRNLFFVNYRNAQVSNVEIERTQATVDVFVYAAQPAFLIGS
ENKNIQKITKQIKQIIGRTTNLDLTINEIGSPMLSARIIARDLANAIEARVPLRTAMRQSLIKVLKAGANGIKVLVSGRL
NGAEIARDKMYIEGNMPLSTLRADIDYALEKAQTTYGVIGVKVWINRGMIYTKGLNRTPAHILHPQKKQPNRQNQQPRHF
NQGQVLSANKLTGSDVETSSIQALTKPNKEDKQ
>A0QSD7 ~~~rpsC~~~Small ribosomal subunit protein uS3~~~COG0092
MGQKINPHGFRLGITTEWKSRWYADKQYKDYVKEDVAIRKLLATGLERAGIADVEIERTRDRVRVDIHTARPGIVIGRRG
TEADRIRADLEKLTGKQVQLNILEVKNPESQAQLVAQGVAEQLSNRVAFRRAMRKAIQSAMRQPNVKGIRVQCSGRLGGA
EMSRSEFYREGRVPLHTLRADIDYGLYEAKTTFGRIGVKVWIYKGDIVGGKRELAAAAPASDRPRRERPSGTRPRRSGSA
GTTATSTEAGRAATSDAPAAGTAAAAEAPAESTES
>P9WH37 ~~~rpsC~~~Small ribosomal subunit protein uS3~~~COG0092
MGQKINPHGFRLGITTDWKSRWYADKQYAEYVKEDVAIRRLLSSGLERAGIADVEIERTRDRVRVDIHTARPGIVIGRRG
TEADRIRADLEKLTGKQVQLNILEVKNPESQAQLVAQGVAEQLSNRVAFRRAMRKAIQSAMRQPNVKGIRVQCSGRLGGA
EMSRSEFYREGRVPLHTLRADIDYGLYEAKTTFGRIGVKVWIYKGDIVGGKRELAAAAPAGADRPRRERPSGTRPRRSGA
SGTTATGTDAGRAAGGEEAAPDAAAPVEAQSTES
>Q9HWE1 ~~~rpsC~~~Small ribosomal subunit protein uS3~~~
MGQKVHPNGIRLGIVKEHTSVWYADRKNYADYLFADLKVREYLQDKLKSASVSRIDIHRPAQTARITIHTARPGIVIGKK
GEDVEKLRQDLTKQMGVPVHINIEEIRKPELDAMLVAQSVAQQLERRVMFRRAMKRAVQNAMRIGAKGIKIQVSGRLGGA
EIARTEWYREGRVPLHTLRADIDYATYEAHTTYGVIGVKVWIFKGEVIGGRQEELKPVAPAPRKKAAR
>Q6N4U0 ~~~rpsC~~~Small ribosomal subunit protein uS3~~~COG0092
MGQKINPIGLRLGINRTWDSRWFAGKNEYGKLLHEDVKIREILHKELKQAAVARIVIERPHKKCRVTIHSARPGVVIGKK
GADIDKLRKKVADITSSDVVINIVEIRKPELDATLVAESIAQQLERRVAFRRAMKRAVQSAMRLGAEGIRINCSGRLGGA
EIARMEWYREGRVPLHTLRADIDYGVATAFTTFGTCGVKVWIFKGEILEHDPMAQDKRMAEGDGGGSSRPRRDAA
>Q2FW12 ~~~rpsC~~~Small ribosomal subunit protein uS3~~~COG0092
MGQKINPIGLRVGIIRDWEAKWYAEKDFASLLHEDLKIRKFIDNELKEASVSHVEIERAANRINIAIHTGKPGMVIGKGG
SEIEKLRNKLNALTDKKVHINVIEIKKVDLDARLVAENIARQLENRASFRRVQKQAITRAMKLGAKGIKTQVSGRLGGAD
IARAEQYSEGTVPLHTLRADIDYAHAEADTTYGKLGVKVWIYRGEVLPTKNTSGGGK
>Q2YYQ2 ~~~rpsC~~~Small ribosomal subunit protein uS3~~~
MGQKINPIGLRVGIIRDWEAKWYAEKDFASLLHEDLKIRKFIDNELKEASVSHVEIERAANRINIAIHTGKPGMVIGKGG
SEIEKLRNKLNALTDKKVHINVIEIKKVDLDARLVAENIARQLENRASFRRVQKQAITRAMKLGAKGIKTQVSGRLGGAD
IARAEQYSEGTVPLHTLRADIDYAHAEADTTYGKLGVKVWIYRGEVLPTKNTSGGGK
>P66553 ~~~rpsC~~~Small ribosomal subunit protein uS3~~~
MGQKINPIGLRVGIIRDWEAKWYAEKDFASLLHEDLKIRKFIDNELKEASVSHVEIERAANRINIAIHTGKPGMVIGKGG
SEIEKLRNKLNALTDKKVHINVIEIKKVDLDARLVAENIARQLENRASFRRVQKQAITRAMKLGAKGIKTQVSGRLGGAD
IARAEQYSEGTVPLHTLRADIDYAHAEADTTYGKLGVKVWIYRGEVLPTKNTSGGGK
>P73314 ~~~rpsC~~~Small ribosomal subunit protein uS3~~~COG0092
MGQKIHPVGFRLGITKDHKSCWYADPKRYPELLQEDHKIRQYIEKTLNNAGISDIRIERKAEQIELGIHTARPGVVVGRG
GSGIEQLREGLQKLLGSARQIRVNVIEVPNADADAALMAEYIGQQLERRVSFRRVVRQALQRAERAEVKGIKIQVSGRLN
GAEIARTEWVREGRVPLHTLRADIDYAYRTALTTYGILGIKVWIFKGEVIPGQEAAIVAPPSQPRRKSRRQQFDDRSQDG
>P62663 ~~~rpsC~~~Small ribosomal subunit protein uS3~~~COG0092
MGNKIHPIGFRLGITRDWESRWYAGKKQYRHLLLEDQRIRGLLEKELYSAGLARVDIERAADNVAVTVHVAKPGVVIGRG
GERIRVLREELAKLTGKNVALNVQEVQNPNLSAPLVAQRVAEQIERRFAVRRAIKQAVQRVMESGAKGAKVIVSGRIGGA
EQARTEWAAQGRVPLHTLRANIDYGFALARTTYGVLGVKAYIFLGEVIGGQKPKARPELPKAEERPRRRRPAVRVKKEE
>P80372 ~~~rpsC~~~Small ribosomal subunit protein uS3~~~COG0092
MGNKIHPIGFRLGITRDWESRWYAGKKQYRHLLLEDQRIRGLLEKELYSAGLARVDIERAADNVAVTVHVAKPGVVIGRG
GERIRVLREELAKLTGKNVALNVQEVQNPNLSAPLVAQRVAEQIERRFAVRRAIKQAVQRVMESGAKGAKVIVSGRIGGA
EQARTEWAAQGRVPLHTLRANIDYGFALARTTYGVLGVKAYIFLGEVIGGQKPKARPELPKAEERPRRRRPAVRVKKEE
>B7IA15 ~~~rpsD~~~Small ribosomal subunit protein uS4~~~
MARYIGPKCKLSRREGTDLQLKSGVKPFDVKTKKANKAPGQHGQARGGKQSEYSLQLREKQKVRRIYGVLERQFSNYYKE
AARVKGATGENLLKLLESRLDNVVYRMGFGSTRAEARQLVSHRSITLNGRRVNIASIQVKAGDVIAVHEGAKQQLRIKNA
IELAAQRGIPAWIEVDHSKLEGTFKAAPDRSDLPAEINESLIVELYSK
>P21466 ~~~rpsD~~~Small ribosomal subunit protein uS4~~~COG0522
MARYTGPSWKLSRRLGISLSGTGKELEKRPYAPGPHGPGQRKKLSEYGLQLQEKQKLRHMYGVNERQFRTLFDKAGKLAG
KHGENFMILLDSRLDNVVYKLGLARTRRQARQLVNHGHILVDGSRVDIPSYLVKPGQTIGVREKSRNLSIIKESVEVNNF
VPEYLTFDAEKLEGTFTRLPERSELAPEINEALIVEFYSR
>P0A7V8 ~~~rpsD~~~Small ribosomal subunit protein uS4~~~COG0522
MARYLGPKLKLSRREGTDLFLKSGVRAIDTKCKIEQAPGQHGARKPRLSDYGVQLREKQKVRRIYGVLERQFRNYYKEAA
RLKGNTGENLLALLEGRLDNVVYRMGFGATRAEARQLVSHKAIMVNGRVVNIASYQVSPNDVVSIREKAKKQSRVKAALE
LAEQREKPTWLEVDAGKMEGTFKRKPERSDLSADINEHLIVELYSK
>Q82ZI6 ~~~rpsD~~~Small ribosomal subunit protein uS4~~~COG0522
MSRYTGPSWKVSRRLGISLSGTGKELARRPYKPGQHGPNSRGKVSEYGMQLTEKQKLRHMYGMNERQFRTLFIKASKIKE
GKHGVNFMVLLEQRLDNVVYRLGLATTRRQARQLVNHGHITVDGKRVDIPSYHVEVGQVIGVREKSQNISTIKEAVEATV
GRPAFVSFDTEKLEGSFTRLPERDELYPEIDEALVVEYYNQKL
>P81288 ~~~rpsD~~~Small ribosomal subunit protein uS4~~~
MARYTGPMWKISRRLGISLSGTGKELQKRPYPPGQHGPGQRRKLSEYGLQLQEKQKLRHMYGVNERQFRKTFEEAGKMPG
KHGENFMILLESRLDNLVYRLGLARTRRQARQLVTHGHIIVDGSRVNIPSYRVKPGQTIAVREKSRNLQVIKEALEANNY
IPDYLSFDPEKMEGTYTRLPERSELPAEINEALIVEFYSR
>A2RI10 ~~~rpsD~~~Small ribosomal subunit protein uS4~~~COG0522
MSRYTGPSWKQSRRYGISLTGSGKEIARRNYVPGQHGPNNRSKLSEYGLQLAEKQKLRFTYGLSERQFRNLYVAATKVKE
GTVGYNFMTLLEQRLDNVVYRLGLATTRRQARQFVNHGHILVDGKRVDIPSFRVQPGQVISVREKSMKVPAILEAVEATK
GRANFVSFDADKLEGTLVRLPERDEINPEINDALIVEFYNKMM
>Q8Y6T6 ~~~rpsD~~~Small ribosomal subunit protein uS4~~~COG0522
MARYTGPSWKVSRRLGISLSGTGKELERRPYAPGQHGPTQRKKISEYGLQQAEKQKLRHMYGLTERQFKNTFNKAGKLRG
KHGENFMILLEQRLDNIVYRLGLARTRRAARQLVNHGHITVDGKRVDIPSYQVSVGQVISVREKSAKNSAIAESLEVSSF
VPEYVTFDAEKLTGSLNRLPERSELAAEINEAFIVEFYSR
>P45811 ~~~rpsD~~~Small ribosomal subunit protein uS4~~~
MARYTGPVTRKSRRLRTDLVGGDQAFEKRPYPPGQHGRARIKESEYLLQLQEKQKARFTYGVMEKQFRRYYEEAVRQPGK
TGEELLKILESRLDNVIYRAGLARTRRMARQLVSHGHFNVNGVHVNVPSYRVSQYDIVDVRDKSLNTVPFQIARETAGER
PIPSWLQVVGERQRVLIHQLPERAQIDVPLTEQLIVEYYSK
>P46775 ~~~rpsD~~~Small ribosomal subunit protein uS4~~~
MKYTGSIFKRSRRLGFSLLENNKEFSKGKKRKTIPGQHGNRFRSSTMSGYAQQLQEKQRMQYMYGITDKQFRRLFRLVLK
QRGNLAVNLFRVLESRLDNIVYRMGFAPTRRSARQLVNHGHVLLNDRTVDTPSIILNPGDKVRLKAKTIKIPIVKAASES
GVVSPFVETNNKTFEGTYVRFPERSELPAGINESYVVEWYKRLVK
>A0QSL7 ~~~rpsD~~~Small ribosomal subunit protein uS4~~~COG0522
MARYTGPATRKSRRLGVDLVGGDQSFEKRPYPPGQHGRARIKESEYRQQLQEKQKARFSYGVMEKQFRRYYEEANRQPGK
TGDNLLRILESRLDNVVYRAGLARTRRMARQLVSHGHFLVNGVKVDIPSYRVSQYDIIDVKEKSLNTLPFQIARETAGER
PIPSWLQVVGERQRILVHQLPERAQIDVPLTEQLIVELYSK
>P9WH35 ~~~rpsD~~~Small ribosomal subunit protein uS4~~~COG0522
MARYTGPVTRKSRRLRTDLVGGDQAFEKRPYPPGQHGRARIKESEYLLQLQEKQKARFTYGVMEKQFRRYYEEAVRQPGK
TGEELLKILESRLDNVIYRAGLARTRRMARQLVSHGHFNVNGVHVNVPSYRVSQYDIVDVRDKSLNTVPFQIARETAGER
PIPSWLQVVGERQRVLIHQLPERAQIDVPLTEQLIVEYYSK
>O52759 ~~~rpsD~~~Small ribosomal subunit protein uS4~~~
MARYIGPKCKLSRREGTDLFLKSGARALDSKCKAENVPGQHGQRRGRLSDYGLQLREKQKVRRIYGVLERQFRGYYQEAS
RRKGSTGENLLQLLECRLDNVVYRMGFGSTRSESRQLVSHKAITVNGQTVNIPSYQVKAGDVVAVREKSKNQLRIAQALE
LCGQRGRVEWVEVDLDKKAGTFKSAPARSDLSADINENLIVELYSK
>Q6N9G0 ~~~rpsD~~~Small ribosomal subunit protein uS4~~~COG0522
MTKRAEAKYKIDRRMGQNIWGRPKSPVNRREYGPGQHGQRRKGKLSDFGVQLRAKQKLKGYYANISERQFHAIYVEATRL
KGDSGENLIGLLERRLDAVVYRAKFVSTMFAARQFINHGHIKVNGKRVNIPSYKVRVGDVIEVKEASKQLAFVLEASQLA
ERDVPDYIEVDHNKMTAKFARIPALSDVPFAVQMEPHLIVEFYSR
>O54297 ~~~rpsD~~~Small ribosomal subunit protein uS4~~~
MARYLGPKLKLSRREGTDLFLKSGVRAIDTKCKIEQAPGQHGARKPRLSDYGVQLREKQKVRRIYGVLERQFRNYYKEAA
RLKGNTGENLLALLEGRLDNVVYRMGFGATRAEARQLVSHKAIMVNGRVVNIASYQVSPNDVVSIREKAKKQSRVKAALE
LAEQREKPTWLEVDAGKMEGTYKRKPERSDLSADINEHLIVELYSK
>Q2FXK6 ~~~rpsD~~~Small ribosomal subunit protein uS4~~~COG0522
MARFRGSNWKKSRRLGISLSGTGKELEKRPYAPGQHGPNQRKKLSEYGLQLREKQKLRYLYGMTERQFRNTFDIAGKKFG
VHGENFMILLASRLDAVVYSLGLARTRRQARQLVNHGHILVDGKRVDIPSYSVKPGQTISVREKSQKLNIIVESVEINNF
VPEYLNFDADSLTGTFVRLPERSELPAEINEQLIVEYYSR
>Q2YTH0 ~~~rpsD~~~Small ribosomal subunit protein uS4~~~
MARFRGSNWKKSRRLGISLSGTGKELEKRPYAPGQHGPNQRKKLSEYGLQLREKQKLRYLYGMTERQFRNTFDIAGKKFG
VHGENFMILLASRLDAVVYSLGLARTRRQARQLVNHGHILVDGKRVDIPSYSVKPGQTISVREKSQKLNIIVESVEINNF
VPEYLNFDADSLTGTFVRLPERSELPAEINEQLIVEYYSR
>P66563 ~~~rpsD~~~Small ribosomal subunit protein uS4~~~
MARFRGSNWKKSRRLGISLSGTGKELEKRPYAPGQHGPNQRKKLSEYGLQLREKQKLRYLYGMTERQFRNTFDIAGKKFG
VHGENFMILLASRLDAVVYSLGLARTRRQARQLVNHGHILVDGKRVDIPSYSVKPGQTISVREKSQKLNIIVESVEINNF
VPEYLNFDADSLTGTFVRLPERSELPAEINEQLIVEYYSR
>P62664 ~~~rpsD~~~Small ribosomal subunit protein uS4~~~COG0522
MGRYIGPVCRLCRREGVKLYLKGERCYSPKCAMERRPYPPGQHGQKRARRPSDYAVRLREKQKLRRIYGISERQFRNLFE
EASKKKGVTGSVFLGLLESRLDNVVYRLGFAVSRRQARQLVRHGHITVNGRRVDLPSYRVRPGDEIAVAEKSRNLELIRQ
NLEAMKGRKVGPWLSLDVEGMKGKFLRLPDREDLALPVNEQLVIEFYSR
>P80373 ~~~rpsD~~~Small ribosomal subunit protein uS4~~~COG0522
MGRYIGPVCRLCRREGVKLYLKGERCYSPKCAMERRPYPPGQHGQKRARRPSDYAVRLREKQKLRRIYGISERQFRNLFE
EASKKKGVTGSVFLGLLESRLDNVVYRLGFAVSRRQARQLVRHGHITVNGRRVDLPSYRVRPGDEIAVAEKSRNLELIRQ
NLEAMKGRKVGPWLSLDVEGMKGKFLRLPDREDLALPVNEQLVIEFYSR
>B7IA22 ~~~rpsE~~~Small ribosomal subunit protein uS5~~~
MAKVEQNEGLVEKLVAVDRVAKVVKGGRIFSFTALTVVGDGNGRVGFGRGKAREVPAAISKALEAARRNMITVDLAGTTL
QHPVNARHGASRVYMQPASEGTGVIAGGAMRAVLEAAGVHNVLAKCYGSTNAANVVNATFKGLRDMTSPEKVAAKRGKSV
EEIQG
>P21467 ~~~rpsE~~~Small ribosomal subunit protein uS5~~~COG0098
MRRIDPSKLELEERLVTVNRVAKVVKGGRRFRFAALVVVGDKNGHVGFGTGKAQEVPEAIRKAVEDAKKNLIEVPMVGTT
IPHEIIGRFGAGNILLKPASEGTGVIAGGPVRAVLELAGVADILSKSLGSNTPINMIRATLQGLSELKRAEDVAKLRGKS
VEELLG
>P0A7W1 ~~~rpsE~~~Small ribosomal subunit protein uS5~~~COG0098
MAHIEKQAGELQEKLIAVNRVSKTVKGGRIFSFTALTVVGDGNGRVGFGYGKAREVPAAIQKAMEKARRNMINVALNNGT
LQHPVKGVHTGSRVFMQPASEGTGIIAGGAMRAVLEVAGVHNVLAKAYGSTNPINVVRATIDGLENMNSPEMVAAKRGKS
VEEILGK
>Q839E7 ~~~rpsE~~~Small ribosomal subunit protein uS5~~~COG0098
MVYIDPKHLELEDRVVAINRVTKVVKGGRRLRFAALVVVGDKNGHVGFGTGKAQEVPEAIRKAIEDAKKNLVEVPMVGST
IPHEVIGVFGGGRILMKPAVEGSGVAAGGPVRAVLELAGVADITSKSLGSNTPINVVRATVEGLKQLKRAEEVAALRGKS
VEEIIG
>P02357 ~~~rpsE~~~Small ribosomal subunit protein uS5~~~
MRRINPNKLELEERVVAVNRVAKVVKGGRRLRFSALVVVGDKNGHVGFGTGKAQEVPEAIRKAIEDAKKNLIEVPIVGTT
IPHEVIGHFGAGEIILKPASEGTGVIAGGPARAVLELAGISDILSKSIGSNTPINMVRATFDGLKQLKRAEDVAKLRGKT
VEELLG
>A2RNN6 ~~~rpsE~~~Small ribosomal subunit protein uS5~~~COG0098
MAENRRNDREQSEFEERVVSINRVTKVVKGGRRLRFAALVVVGDRNGRVGFGTGKAQEVPEAIRKAIEAAKKNLITVPMV
GTTLPHEALGVFGGGKILLKPAVEGAGVAAGGAVRAVLELAGVADVTSKSLGSNTPINVVRATVDGLTQLKRAEEVAALR
GKSVSDFA
>Q8Y446 ~~~rpsE~~~Small ribosomal subunit protein uS5~~~COG0098
MPEQIDGNKLDLEERVVTINRVAKVVKGGRRFRFTALVVVGDKNGHVGFGTGKAQEVPDAIRKAVEDAKKNMVLVPTVDT
TIPHTVVGHFGGGEILLKPASAGSGVTAGGPVRAVLELAGVADVSSKSLGSNTPINMVRATIDGIKQLKNAEDVAKLRGK
TVEELLG
>Q50301 ~~~rpsE~~~Small ribosomal subunit protein uS5~~~
MTDQNQKANQGNGLQTTNLQAHAQRKHNLRPSSEGIKKAVSKKEGGGHNRNNQNRRFQKPAFKSEFEERIVKLKRISKTT
KGGRNMRFSVLVVVGNRKGKIGYGIAKALEVPNAIKKAIKAAHNSLHTIEIHKGSIYHEVIGRSGASRVLLKPAPQGTGI
IAGGAIRAIIELAGYSDIYTKNLGRNTPINMIHATMDGILKQLSPRRVAILRNKNLNEL
>A0QSG6 ~~~rpsE~~~Small ribosomal subunit protein uS5~~~COG0098
MAEQAGAGSAQDNRGGRGRRDDRGGRGRDGGDKSNYIERVVSINRVSKVVKGGRRFSFTALVIVGDGKGMVGVGYGKAKE
VPAAIAKGVEEARKNFFRVPLIGSTITHPVQGEAAAGVVMLRPASPGTGVIAGGAARAVLECAGVHDILAKSLGSDNAIN
VVHATVAALKLLQRPEEVAARRGLPIEDVAPAGMLKARRESEALAAAAAREGSA
>P9WH33 ~~~rpsE~~~Small ribosomal subunit protein uS5~~~COG0098
MAEQPAGQAGTTDNRDARGDREGRRRDSGRGSRERDGEKSNYLERVVAINRVSKVVKGGRRFSFTALVIVGDGNGMVGVG
YGKAKEVPAAIAKGVEEARKSFFRVPLIGGTITHPVQGEAAAGVVLLRPASPGTGVIAGGAARAVLECAGVHDILAKSLG
SDNAINVVHATVAALKLLQRPEEVAARRGLPIEDVAPAGMLKARRKSEALAASVLPDRTI
>Q9HWF2 ~~~rpsE~~~Small ribosomal subunit protein uS5~~~
MANNEQKRDEGYIEKLVQVNRVAKTVKGGRIFAFTALTVVGDGKGRVGFGRGKAREVPAAIQKAMEAARRNMIQVDLNGT
TLQYPTKSAHGASKVYMQPASEGTGIIAGGAMRAVLEVAGVQNVLAKCYGSTNPVNVVYATFKGLKNMQAPEAVAAKRGK
SVEEIL
>Q6N4V0 ~~~rpsE~~~Small ribosomal subunit protein uS5~~~COG0098
MAAERERGGRERSREREERDSEFVDKLVHINRVAKVVKGGKRFGFAALVVVGDQKGRVGFGHGKAREVPEAIRKATESAK
RNLTRVALREGRTLHHDIAGRHGAGRVYLRAAPAGTGIIAGGPMRAVFETLGIADVVAKSVGSSNPYNMVRATFDALKHL
DSPRSVAARRNIKVSTLQARRVGGDAEVVAE
>P0A7W4 ~~~rpsE~~~Small ribosomal subunit protein uS5~~~
MAHIEKQAGELQEKLIAVNRVSKTVKGGRIFSFTALTVVGDGNGRVGFGYGKAREVPAAIQKAMEKARRNMINVALNNGT
LQHPVKGVHTGSRVFMQPASEGTGIIAGGAMRAVLEVAGVHNVLAKAYGSTNPINVVRATIDGLENMNSPEMVAAKRGKS
VEEILGK
>Q2FW23 ~~~rpsE~~~Small ribosomal subunit protein uS5~~~COG0098
MARREEETKEFEERVVTINRVAKVVKGGRRFRFTALVVVGDKNGRVGFGTGKAQEVPEAIKKAVEAAKKDLVVVPRVEGT
TPHTITGRYGSGSVFMKPAAPGTGVIAGGPVRAVLELAGITDILSKSLGSNTPINMVRATIDGLQNLKNAEDVAKLRGKT
VEELYN
>Q2YYL4 ~~~rpsE~~~Small ribosomal subunit protein uS5~~~
MARREEETKEFEERVVTINRVAKVVKGGRRFRFTALVVVGDKNGRVGFGTGKAQEVPEAIKKAVEAAKKDLVVVPRVEGT
TPHTITGRYGSGSVFMKPAAPGTGVIAGGPVRAVLELAGITDILSKSLGSNTPINMVRATIDGLQNLKNAEDVAKLRGKT
VEELYN
>P66579 ~~~rpsE~~~Small ribosomal subunit protein uS5~~~
MARREEETKEFEERVVTINRVAKVVKGGRRFRFTALVVVGDKNGRVGFGTGKAQEVPEAIKKAVEAAKKDLVVVPRVEGT
TPHTITGRYGSGSVFMKPAAPGTGVIAGGPVRAVLELAGITDILSKSLGSNTPINMVRATIDGLQNLKNAEDVAKLRGKT
VEELYN
>P62665 ~~~rpsE~~~Small ribosomal subunit protein uS5~~~COG0098
MPETDFEEKMILIRRTARMQAGGRRFRFGALVVVGDRQGRVGLGFGKAPEVPLAVQKAGYYARRNMVEVPLQNGTIPHEI
EVEFGASKIVLKPAAPGTGVIAGAVPRAILELAGVTDILTKELGSRNPINIAYATMEALRQLRTKADVERLRKGEAHAQA
QG
>Q5SHQ5 ~~~rpsE~~~Small ribosomal subunit protein uS5~~~COG0098
MPETDFEEKMILIRRTARMQAGGRRFRFGALVVVGDRQGRVGLGFGKAPEVPLAVQKAGYYARRNMVEVPLQNGTIPHEI
EVEFGASKIVLKPAAPGTGVIAGAVPRAILELAGVTDILTKELGSRNPINIAYATMEALRQLRTKADVERLRKGEAHAQA
QG
>P27152 ~~~rpsE~~~Small ribosomal subunit protein uS5~~~
MPETDFEEKMILIRRTARMQAGGRRFRFGALVVVGDRQGRVGLGFGKAPEVPLAVQKAGYYARRNMVEVPLQNGTIPHEI
EVEFGASKIVLKPAAPGTGVIAGAVPRAILELAGVTDILTKELGSRNPINIAYATMEALRQLRTKADVERLRKGEAHAQA
QG
>B7IBC1 ~~~rpsF~~~Small ribosomal subunit protein bS6~~~
MRHYEIVLLVHPDQSDQVVGMVERYISQIKEADGQIHRLEDWGRRQLAYPINKIHKAHYILMNVECGQSTLDELEELFRY
NDAIIRNLIIRREHAITEESLLAKSAEEKRARKAQREEAQQVAQEAE
>O66474 ~~~rpsF~~~Small ribosomal subunit protein bS6~~~COG0360
MRHYKTLRYYETVFAVKPTLSEEEMKKKFEQVKEFIKQKGGEILYEEDWGMRQLAYPIQKFNNARYFLVQFKTENPQLPN
ELDFQLKIDEDVIRWLNFQIKESEVKKNAQ
>Q81JI2 ~~~rpsF~~~Small ribosomal subunit protein bS6~~~COG0360
MRKYEIMYIIRPGVEEEAQKALVERFAGVLTNNGAEIINTKEWGKRRLAYEINDLREGFYMILNVNANAEAINEFDRLAK
INEDILRHIVVKEEEK
>P21468 ~~~rpsF~~~Small ribosomal subunit protein bS6~~~COG0360
MRKYEVMYIIRPNIDEESKKAVIERFNNVLTSNGAEITGTKDWGKRRLAYEINDFRDGFYQIVNVQSDAAAVQEFDRLAK
ISDDIIRHIVVKEEE
>P02358 ~~~rpsF~~~Small ribosomal subunit protein bS6~~~COG0360
MRHYEIVFMVHPDQSEQVPGMIERYTAAITGAEGKIHRLEDWGRRQLAYPINKLHKAHYVLMNVEAPQEVIDELETTFRF
NDAVIRSMVMRTKHAVTEASPMVKAKDERRERRDDFANETADDAEAGDSEEEEEE
>Q839Z0 ~~~rpsF~~~Small ribosomal subunit protein bS6~~~COG0360
MSQDTKYEIMYIIRPNIDEEAKTALVERFDTILKDNGAEVIESKDWEKRRLAYEMNGFREGIYHIVNVTSPSTAGAINEF
DRLAKINDDIIRHMIVKVEA
>P56013 ~~~rpsF~~~Small ribosomal subunit protein bS6~~~COG0360
MRHYETMFILKPTLVEEEIKSKIEFYKEVITKHHGVIETSLDMGMRNLAYEIKKHKRGYYYVAYFKAEPSMIVELERLYR
INEDVLRFIVIKYESKKEVEAWHALVDRANKKPSHAKEKHEKTEHTHSHHTEEAESVGSHSE
>A2RNZ4 ~~~rpsF~~~Small ribosomal subunit protein bS6~~~COG0360
MTKYEILYIIRPNIDEEAKTALVERFDAILTENGAANLESKDWEKRKLAYEINDFREGIYHIATFEAETTSEALSEFDRL
AKINLDILRHMIVKVEA
>Q8YAR9 ~~~rpsF~~~Small ribosomal subunit protein bS6~~~COG0360
MARKYEIMYIIRPNIEEDEKKAVVERFDGILTENGAEIIESKEWGKRRLAYEINDYRDGFYHIVKLNADKADSINEFDRL
AKISDDIIRHMVIKEEA
>P75543 ~~~rpsF~~~Small ribosomal subunit protein bS6~~~
MQYNIILLVDGSLSLEQANQVNEKQQQTLTNVEGLQTEYLGLKELAYPIKKQLSAHYYRWKFSGDNQSTKDFKRTANINK
QVLRELIINLEREYGYLASINPKKQQLALQKRAKYDEIIARENNPENPDVPVTSGLASTQPRLSRTEKAQKPKEELWDVV
QKMGNFDSVQANPYRPRFKRFNAEHVNQRQNQQNNNNNRFDRNRNRQHNRFKDKQ
>A0R7F9 ~~~rpsF~~~Small ribosomal subunit protein bS6~~~COG0360
MVILDPTLDERTVAPSLETFLNVIRKDGGTVDKVDIWGRRRLAYEIAKHAEGIYAVIDVKAEPATVSELDRQLNLNESVL
RTKVLRTDKH
>P9WH31 ~~~rpsF~~~Small ribosomal subunit protein bS6~~~COG0360
MRPYEIMVILDPTLDERTVAPSLETFLNVVRKDGGKVEKVDIWGKRRLAYEIAKHAEGIYVVIDVKAAPATVSELDRQLS
LNESVLRTKVMRTDKH
>Q9HUM9 ~~~rpsF~~~Small ribosomal subunit protein bS6~~~
MRHYEIVFLVHPDQSEQVGGMVERYTKAIEEDGGKIHRLEDWGRRQLAYAINNVHKAHYVLMNVECSAKALAELEDNFRY
NDAVIRNLVMRRDEAVTEQSEMLKAEESRNERRERRERPNDNAEGADGDDNSDSDNADE
>Q6N5A4 ~~~rpsF~~~Small ribosomal subunit protein bS6~~~COG0360
MPLYEHVFLARQDASAQQVEELTTQITGVIEGLGGKVTKTESWGLRSLTYRMNKNRKAHFVLLNIDGPAAVVSEIERQER
INEDVIRYLTVRVDEHEEGPSAMMRKADRDRERDDRGPREGGFRGDREGRGDRDGFRGDRGPRRPREDADAPAAAVEE
>Q2G113 ~~~rpsF~~~Small ribosomal subunit protein bS6~~~COG0360
MRTYEVMYIVRPNIEEDAKKALVERFNGILATEGAEVLEAKDWGKRRLAYEINDFKDGFYNIVRVKSDNNKATDEFQRLA
KISDDIIRYMVIREDEDK
>Q2YVJ2 ~~~rpsF~~~Small ribosomal subunit protein bS6~~~
MRTYEVMYIVRPNIEEDAKKALVERFNGILATEGAEVLEAKDWGKRRLAYEINDFKDGFYNIVRVKSDNNKATDEFQRLA
KISDDIIRYMVIREDEDK
>P99142 ~~~rpsF~~~Small ribosomal subunit protein bS6~~~
MRTYEVMYIVRPNIEEDAKKALVERFNGILATEGAEVLEAKDWGKRRLAYEINDFKDGFYNIVRVKSDNNKATDEFQRLA
KISDDIIRYMVIREDEDK
>Q9WZ72 ~~~rpsF~~~Small ribosomal subunit protein bS6~~~COG0360
MAYVKERIYESMFIIAPNVPEEERENLVERVKKIIEERVKGKIDKVERMGMRKFAYEIKKFNEGDYTVIYFRCDGQNLQE
LENFYRVTPEIIRWQTFRRFDLEKKERKAQREKAAAEATESSEGGSED
>P62666 ~~~rpsF~~~Small ribosomal subunit protein bS6~~~COG0360
MRRYEVNIVLNPNLDQSQLALEKEIIQRALENYGARVEKVEELGLRRLAYPIAKDPQGYFLWYQVEMPEDRVNDLARELR
IRDNVRRVMVVKSQEPFLANA
>Q5SLP8 ~~~rpsF~~~Small ribosomal subunit protein bS6~~~COG0360
MRRYEVNIVLNPNLDQSQLALEKEIIQRALENYGARVEKVEELGLRRLAYPIAKDPQGYFLWYQVEMPEDRVNDLARELR
IRDNVRRVMVVKSQEPFLANA
>P23370 ~~~rpsF~~~Small ribosomal subunit protein bS6~~~
MRRYEVNIVLNPNLDQSQLALEKEIIQRALENYGARVEKVEELGLRRLAYPIAKDPQGYFLWYQVEMPEDRVNDLARELR
IRDNVRRVMVVKSQEPFLANA
>B7I7S0 ~~~rpsG~~~Small ribosomal subunit protein uS7~~~
MPRRRVVAAREILPDPKFSSQTIAKFMNHVMQDGKKSIAESIVYGALERVQEKNKVDPVEFFETTLEKVRPMVEVKARRV
GGATYQVPMEVRPSRRTALAMRWLVDAAAKRSEKTMALRLAGELLDAAEGKGAAIKKREDVHRMAEANKAFSHYRF
>P21469 ~~~rpsG~~~Small ribosomal subunit protein uS7~~~COG0049
MPRKGPVAKRDVLPDPIYNSKLVSRLINKMMIDGKKGKSQTILYKSFDIIKERTGNDAMEVFEQALKNIMPVLEVKARRV
GGANYQVPVEVRPERRTTLGLRWLVNYARLRGEKTMEERLANEILDAANNTGAAVKKREDTHKMAEANKAFAHYRW
>P02359 ~~~rpsG~~~Small ribosomal subunit protein uS7~~~COG0049
MPRRRVIGQRKILPDPKFGSELLAKFVNILMVDGKKSTAESIVYSALETLAQRSGKSELEAFEVALENVRPTVEVKSRRV
GGSTYQVPVEVRPVRRNALAMRWIVEAARKRGDKSMALRLANELSDAAENKGTAVKKREDVHRMAEANKAFAHYRWLSLR
SFSHQAGASSKQPALGYLN
>Q839H0 ~~~rpsG~~~Small ribosomal subunit protein uS7~~~COG0049
MPRKGPVAKRDVLPDPIYNSKLVTRLINRVMVDGKRGIAANIIYNSFDIIKESTGNDPLEVFEQAMKNVMPVLEVKARRV
GGSNYQVPVEVRPERRTTLGLRWVVNYARLRGEHTMEERLAKEIMDAANNTGASVKKREDTHKMADANRAFAHYRW
>P22744 ~~~rpsG~~~Small ribosomal subunit protein uS7~~~
MPRRGPVAKRDVLPDPIYNSKLVTRLINKIMIDGKKSKAQKILYTAFDIIRERTGKDPMEVFEQALKNVMPVLEVRARRV
GGANYQVPVEVRPDRRVSLGLRWLVQYARLRNEKTMEERLANEIMDAANNTGAAVKKREDTHKMAEANKAFAHYRW
>A2RP73 ~~~rpsG~~~Small ribosomal subunit protein uS7~~~COG0049
MRKNRAPKREVLADPMYNSIVVTRLINRVMLDGKRGVAAQIVYGAFKQIEEATGNNPLEVFETAMENIMPVLEVRARRVG
GSNYQVPVEVRPERRTTLGLRWLVTIARNRGEHTMQDRLAKEILDAANNTGAAVKKREDTHKMAEANRAFAHFRW
>P66611 ~~~rpsG~~~Small ribosomal subunit protein uS7~~~COG0049
MPRKGPVAKRDVLPDPIYNSKLVTRLINKMMVDGKRGKSQAILYSAFDIIAQETGKDPMEVFEQAMKNIMPLLEVKARRV
GGANYQVPIEVRADRRSTLGLRWLVNYARLRGEKTMEVRVAREIMDAANNTGASVKKREDTHKMADANRAFAHYRW
>P75545 ~~~rpsG~~~Small ribosomal subunit protein uS7~~~
MRKNRAPKRTVLPDPVFNNTLVTRIINVIMEDGKKGLAQRILYGAFDLIEQRTKEKPLTVFERAVGNVMPRLELRVRRIA
GSNYQVPTEVPQDRKIALALRWIAMFARKRHEKTMLEKIANEIIDASNNTGAAIKKKDDTHKMAEANKAFAHMRW
>A0QS97 ~~~rpsG~~~Small ribosomal subunit protein uS7~~~COG0049
MPRKGPAPKRPLVNDPVYGSQLVTQLVNKVLLEGKKSLAERIVYGALEQAREKTGTDPVVTLKRALDNVKPALEVRSRRV
GGATYQVPVEVRPDRSTTLALRWLVNFSRQRREKTMVERLANEILDASNGLGASVKRREDTHKMAEANRAFAHYRW
>P9WH29 ~~~rpsG~~~Small ribosomal subunit protein uS7~~~COG0049
MPRKGPAPKRPLVNDPVYGSQLVTQLVNKVLLKGKKSLAERIVYGALEQARDKTGTDPVITLKRALDNVKPALEVRSRRV
GGATYQVPVEVRPDRSTTLALRWLVGYSRQRREKTMIERLANEILDASNGLGASVKRREDTHKMAEANRAFAHYRW
>Q9HWD1 ~~~rpsG~~~Small ribosomal subunit protein uS7~~~
MPRRRVAAKREVLADPKYGSQILAKFMNHVMESGKKAVAERIVYGALDKVKERGKADPLETFEKALDAIAPLVEVKSRRV
GGATYQVPVEVRPSRRNALAMRWLVDFARKRGEKSMALRLAGELLDAAEGKGAAVKKREDVHRMAEANKAFSHYRF
>Q6N4T3 ~~~rpsG~~~Small ribosomal subunit protein uS7~~~COG0049
MSRRHAAEKREVLPDPKFGNIIVTKFMNSVMYAGKKSVAESIVYGAFDLIEAKTKQPPLGVFEQALDNVMPTIEVRSRRV
GGATYQVPVEVRSTRRQALGIRWLITAARGRNEKTMTERLSAELLDASNNRGNAVKKREDVHKMAEANRAFSHYRW
>P0A2B3 ~~~rpsG~~~Small ribosomal subunit protein uS7~~~
MPRRRVIGQRKILPDPKFGSELLAKFVNILMVDGKKSTAESIVYSALETLAQRSGKSELEAFEVALENVRPTVEVKSRRV
GGSTYQVPVEVRPVRRNALAMRWIVEAARKRGDKSMALRLANELSDAADNKGTAVKKREDVHRMAEANKAFAHYRW
>P48940 ~~~rpsG~~~Small ribosomal subunit protein uS7~~~COG0049
MPRKGSVPKRDVLPDPIHNSKLVTKLINKIMLDGKRGTAQRILYSAFDLVEQRSGRDALEVFEEAINNIMPVLEVKARRV
GGSNYQVPVEVRPERRTTLGLRWLVNYARLRGEKTMEDRLANEILDAANNTGGAVKKREDTHKMAEANKAFAHYRW
>P66616 ~~~rpsG~~~Small ribosomal subunit protein uS7~~~
MPRKGSVPKRDVLPDPIHNSKLVTKLINKIMLDGKRGTAQRILYSAFDLVEQRSGRDALEVFEEAINNIMPVLEVKARRV
GGSNYQVPVEVRPERRTTLGLRWLVNYARLRGEKTMEDRLANEILDAANNTGGAVKKREDTHKMAEANKAFAHYRW
>P38526 ~~~rpsG~~~Small ribosomal subunit protein uS7~~~COG0049
MRRRRAEKRQIPPDPVFGDVLVAKLINRVMWDGKKTIAQKIVYGAFDIIREKTKKDPLEVFRQAVENVKPVLEVRPRRVG
GATYQVPIEVQEPRRTSLALRWIVEAARAKKGRPMKEKLAEEIIAAYNNTGTAIKKKEDTHRMAEANRAFAHYRW
>P62667 ~~~rpsG~~~Small ribosomal subunit protein uS7~~~COG0049
MARRRRAEVRQLQPDLVYGDVLVTAFINKIMRDGKKNLAARIFYDACKIIQEKTGQEPLKVFKQAVENVKPRMEVRSRRV
GGANYQVPMEVSPRRQQSLALRWLVQAANQRPERRAAVRIAHELMDAAEGKGGAVKKKEDVERMAEANRAYAHYRW
>P17291 ~~~rpsG~~~Small ribosomal subunit protein uS7~~~COG0049
MARRRRAEVRQLQPDLVYGDVLVTAFINKIMRDGKKNLAARIFYDACKIIQEKTGQEPLKVFKQAVENVKPRMEVRSRRV
GGANYQVPMEVSPRRQQSLALRWLVQAANQRPERRAAVRIAHELMDAAEGKGGAVKKKEDVERMAEANRAYAHYRW
>B7IA25 ~~~rpsH~~~Small ribosomal subunit protein uS8~~~
MSMQDTVADMLTRVRNAQMAKKQTVSMPSSKLKVAIANVLQQEGYISNVEVAQEETKSTLTITLKYFEGKPVIEMVKRVS
RPGLRQYRGKDKLPSVKQGLGIAIVSTSKGIMTDRAARAAGIGGEVIAFVS
>O67566 ~~~rpsH~~~Small ribosomal subunit protein uS8~~~COG0096
MSAVDPIADMFSAIKNAIMRRDDFLYVPSSKLKERILDVLKKEGFIQDWEALKGEKYEEEYKKMKELAEKSPNPKMKRYL
KQLEEYNKGTQYPIKIYLKYLDPKKRKSAITNIVKVSKGGRRVYAGVRTMPYVKRGLGIAIVSTDAGVMTDHEARRMRKG
GEVIAFVW
>Q81VR6 ~~~rpsH~~~Small ribosomal subunit protein uS8~~~COG0096
MVMTDPIADMLTRIRNANMVRHEKLEVPASKIKKEIAELLKREGFIRDVEYIEDNKQGILRIFLKYGANNERVITGLKRI
SKPGLRVYAKADEVPRVLNGLGIALVSTSKGVMTDKDARQLQTGGEVVAYVW
>P12879 ~~~rpsH~~~Small ribosomal subunit protein uS8~~~COG0096
MVMTDPIADMLTRIRNANMVRHEKLEIPASKLKREIAEILKREGFIRDVEFVEDSKQGIIRVFLKYGQNNERVITGLKRI
SKPGLRVYAKSNEVPRVLNGLGIAIISTSQGVLTDKEARAKQAGGEVLAYVW
>P0A7W7 ~~~rpsH~~~Small ribosomal subunit protein uS8~~~COG0096
MSMQDPIADMLTRIRNGQAANKAAVTMPSSKLKVAIANVLKEEGFIEDFKVEGDTKPELELTLKYFQGKAVVESIQRVSR
PGLRIYKRKDELPKVMAGLGIAVVSTSKGVMTDRAARQAGLGGEIICYVA
>Q839F0 ~~~rpsH~~~Small ribosomal subunit protein uS8~~~COG0096
MVMTDPIADFLTRIRNANMVKHETLEVPASKIKRDIAEILKREGFIRDVEYIEDDKQGVIRVFLKYGKNEERVITNLKRI
SKPGLRAYVKADEVPKVLNGLGIAIISTSEGVITDKEARAKNIGGEVIAYVW
>P56209 ~~~rpsH~~~Small ribosomal subunit protein uS8~~~
MVMTDPIADMLTAIRNANMVRHEKLEVPASKIKREIAEILKREGFIRDYEYIEDNKQGILRIFLKYGPNERVITGLKRIS
KPGLRVYVKAHEVPRVLNGLGIAILSTSQGVLTDKEARQKGTGGEIIAYVI
>A2RNN9 ~~~rpsH~~~Small ribosomal subunit protein uS8~~~COG0096
MVMTDPIADFLTRIRNGNMRKFDVVEAPASKIKRQIAEILKAEGYVKDVEYVEDNKQGVIRVFLKYGKNGEKVITNLKRI
SKPGLRVYVKSDDVPKVLNGLGTAIISTSTGVVTDKVARQTNVGGEVIAYIW
>P66623 ~~~rpsH~~~Small ribosomal subunit protein uS8~~~COG0096
MVMTDPIADFLTRIRNANMVKHDKLELPASKIKKEIAEILKREGFIRDVEYIEDDNAGTIRVFLKYGATGERVITGLKRI
SKPGLRVYAKSTEVPKVLNGLGIAIVSTSQGVLTDKEARAKQVGGEVLAYVW
>Q50304 ~~~rpsH~~~Small ribosomal subunit protein uS8~~~
MITTTKPIKAHFDPVADLLTKINNARKAKLMTVTTIASKLKIAILEILVKEGYLANFQVLENKSKTKRIVTFNLKYTQRR
IPSINGVKQISKPGLRIYRPFEKLPLVLNGLGIAIISTSDGVMTDKVARLKKIGGEILAYVW
>A0QSG3 ~~~rpsH~~~Small ribosomal subunit protein uS8~~~COG0096
MTMTDPIADFLTRLRNANSAYHDEVTLPHSKLKANIAEILKREGYISDYRTEDARVGKSLVVQLKYGPSRERSIAGLRRV
SKPGLRVYAKSTNLPRVLGGLGVAIISTSSGLLTDRQAARQGVGGEVLAYVW
>P9WH27 ~~~rpsH~~~Small ribosomal subunit protein uS8~~~COG0096
MTMTDPIADFLTRLRNANSAYHDEVSLPHSKLKANIAQILKNEGYISDFRTEDARVGKSLVIQLKYGPSRERSIAGLRRV
SKPGLRVYAKSTNLPRVLGGLGVAIISTSSGLLTDRQAARQGVGGEVLAYVW
>Q9HWE9 ~~~rpsH~~~Small ribosomal subunit protein uS8~~~
MSMQDPLADMLTRIRNAQMAEKTVVSMPSSKLKAAVAKVLKDEGYIADFQISSEVKPQLSIELKYFEGKPVIEEVKRISR
PGLRQYKSVEQLPKVRGGLGVSIVSTNKGVMTDRAARAAGVGGEVLCTVF
>Q6N4U7 ~~~rpsH~~~Small ribosomal subunit protein uS8~~~COG0096
MSTHDPISDLITRIRNAQMRSKSKVSTPGSKMRANVLDVLKAEGYIRGYATVEHPSGRSELEIELKYFDGEPVIREIERV
SRPGRRVYASVKNLPRVNNGLGISVLSTPKGIMADHDARDANVGGEVLFTVF
>Q2FW20 ~~~rpsH~~~Small ribosomal subunit protein uS8~~~COG0096
MTMTDPIADMLTRVRNANMVRHEKLELPASNIKKEIAEILKSEGFIKNVEYVEDDKQGVLRLFLKYGQNDERVITGLKRI
SKPGLRVYAKASEMPKVLNGLGIALVSTSEGVITDKEARKRNVGGEIIAYVW
>Q2YYL1 ~~~rpsH~~~Small ribosomal subunit protein uS8~~~
MTMTDPIADMLTRVRNANMVRHEKLELPASNIKKEIAEILKSEGFIKNVEYVEDDKQGVLRLFLKYGQNDERVITGLKRI
SKPGLRVYAKASEMPKVLNGLGIALVSTSEGVITDKEARKRNVGGEIIAYVW
>P66630 ~~~rpsH~~~Small ribosomal subunit protein uS8~~~
MTMTDPIADMLTRVRNANMVRHEKLELPASNIKKEIAEILKSEGFIKNVEYVEDDKQGVLRLFLKYGQNDERVITGLKRI
SKPGLRVYAKASEMPKVLNGLGIALVSTSEGVITDKEARKRNVGGEIIAYVW
>P62668 ~~~rpsH~~~Small ribosomal subunit protein uS8~~~COG0096
MLTDPIADMLTRIRNATRVYKESTDVPASRFKEEILRILAREGFIKGYERVDVDGKPYLRVYLKYGPRRQGPDPRPEQVI
HHIRRISKPGRRVYVGVKEIPRVRRGLGIAILSTSKGVLTDREARKLGVGGELICEVW
>P0DOY9 ~~~rpsH~~~Small ribosomal subunit protein uS8~~~COG0096
MLTDPIADMLTRIRNATRVYKESTDVPASRFKEEILRILAREGFIKGYERVDVDGKPYLRVYLKYGPRRQGPDPRPEQVI
HHIRRISKPGRRVYVGVKEIPRVRRGLGIAILSTSKGVLTDREARKLGVGGELICEVW
>P24319 ~~~rpsH~~~Small ribosomal subunit protein uS8~~~
MLTDPIADMLTRIRNATRVYKESTDVPASRFKEEILRILAREGFIKGYERVDVDGKPYLRVYLKYGPRRQGPDPRPEQVI
HHIRRISKPGRRVYVGVKEIPRVRRGLGIAILSTSKGVLTDREARKLGVGGELICEVW
>P21470 ~~~rpsI~~~Small ribosomal subunit protein uS9~~~COG0103
MAQVQYYGTGRRKSSVARVRLVPGEGRIVVNNREISEHIPSAALIEDIKQPLTLTETAGTYDVLVNVHGGGLSGQAGAIR
HGIARALLEADPEYRTTLKRAGLLTRDARMKERKKYGLKGARRAPQFSKR
>P0A7X3 ~~~rpsI~~~Small ribosomal subunit protein uS9~~~COG0103
MAENQYYGTGRRKSSAARVFIKPGNGKIVINQRSLEQYFGRETARMVVRQPLELVDMVEKLDLYITVKGGGISGQAGAIR
HGITRALMEYDESLRSELRKAGFVTRDARQVERKKVGLRKARRRPQFSKR
>Q82Z47 ~~~rpsI~~~Small ribosomal subunit protein uS9~~~COG0103
MAQVQYSGTGRRKNAVARVRLVPGTGKITVNKKDVEEYIPHADLREVINQPFGVTETKGAYDVIVNVNGGGYAGQSGAIR
HGIARALLQVDPDFRSALKRAGLLTRDARMVERKKPGLKKARKASQFSKR
>P07842 ~~~rpsI~~~Small ribosomal subunit protein uS9~~~
MAQVQYYGTGRRKSSVARVRLVPGDGRIIVNKQDIREYIPTEALIEMVKQPLVLTETLGSYDVLVNVHGGGFAGQAGAIR
HGIARALLQVDPEFRTVLKRAGLLTRDARVKERKKYGLKGARRAPQFSKR
>A2RP61 ~~~rpsI~~~Small ribosomal subunit protein uS9~~~COG0103
MAQVQYAGTGRRKNAVARVRLVPGTGKITVNGREVESYIPHADMRLVINQPFAATQTEGSYDTLVNVNGGGVSGQAGAIR
HGIARALLQVDPDFRSALKRAGLLTRDARMVERKKPGLKKARKASQFSKR
>Q8Y459 ~~~rpsI~~~Small ribosomal subunit protein uS9~~~COG0103
MAQVQYYGTGRRKSSVARVRLVPGDGKIVINNRDWEDYIPFAALREVIKQPLVATETLGNYDVLVNVHGGGYTGQAGAIR
HGVARALLQVAPEYRPALKSAGLLTRDSRMKERKKPGLKGARRAPQFSKR
>P75179 ~~~rpsI~~~Small ribosomal subunit protein uS9~~~
MEKQSYYGLGRRKSSSAKVYLTPTQDKGKITVNRRDPSEYFPNKLVIQDMEQPLDLTDLKKNFDINVVVKGGGFTGQAGA
IRLGIVRALLQFNPELKKILKSKKLTTRDKRVKERKKFGLYGARRAPQFTKR
>A0QSP9 ~~~rpsI~~~Small ribosomal subunit protein uS9~~~COG0103
MTDVTETEVVTESAEPREPVIIDRPIQTVGRRKEAVVRVRLVPGTGQFNLDGRTLENYFPNKVHQQLIKAPLVTVDRVDQ
FDIYAHLDGGGPSGQAGALRLAIARALILVQPEDRPALKKAGFLTRDPRAIERKKYGLKKARKAPQYSKR
>P9WH25 ~~~rpsI~~~Small ribosomal subunit protein uS9~~~COG0103
MTETTPAPQTPAAPAGPAQSFVLERPIQTVGRRKEAVVRVRLVPGTGKFDLNGRSLEDYFPNKVHQQLIKAPLVTVDRVE
SFDIFAHLGGGGPSGQAGALRLGIARALILVSPEDRPALKKAGFLTRDPRATERKKYGLKKARKAPQYSKR
>Q9HVY3 ~~~rpsI~~~Small ribosomal subunit protein uS9~~~
MSATQNYGTGRRKTATARVFLRPGTGKISINNRGLDQFFGRETARMVVRQPLELTETVEKFDIFVTVVGGGVSGQAGAIR
HGITRALIEYDETLRSSLRKAGYVTRDAREVERKKVGLRKARKRPQYSKR
>Q6N650 ~~~rpsI~~~Small ribosomal subunit protein uS9~~~COG0103
MSETMQSLDQLAALKTTVTGADAPTYTKKVDKFGRAYATGKRKDAVARVWIKPGAGKITVNSREVETYFARPVLRMMIQQ
PLVAAARAGQYDVICTVAGGGLSGQAGAVRHGISKALTNFEPELRSVLKKGGFLTRDSRVVERKKYGKAKARRSFQFSKR
>P66643 ~~~rpsI~~~Small ribosomal subunit protein uS9~~~
MAENQYYGTGRRKSSAARVFIKPGNGKIVINQRSLEQYFGRETARMVVRQPLELVDMVEKLDLYITVKGGGISGQAGAIR
HGITRALMEYDESLRGELRKAGFVTRDARQVERKKVGLRKARRRPQFSKR
>Q2FW39 ~~~rpsI~~~Small ribosomal subunit protein uS9~~~COG0103
MTLAQVEYRGTGRRKNSVARVRLVPGEGNITVNNRDVREYLPFESLILDLNQPFDVTETKGNYDVLVNVHGGGFTGQAQA
IRHGIARALLEADPEYRGSLKRAGLLTRDPRMKERKKPGLKAARRSPQFSKR
>Q2YYM9 ~~~rpsI~~~Small ribosomal subunit protein uS9~~~
MTLAQVEYRGTGRRKNSVARVRLVPGEGNITVNNRDVREYLPFESLILDLNQPFDVTETKGNYDVLVNVHGGGFTGQAQA
IRHGIARALLEADPEYRGSLKRAGLLTRDPRMKERKKPGLKAARRSPQFSKR
>P66646 ~~~rpsI~~~Small ribosomal subunit protein uS9~~~
MAQVEYRGTGRRKNSVARVRLVPGEGNITVNNRDVREYLPFESLILDLNQPFDVTETKGNYDVLVNVHGGGFTGQAQAIR
HGIARALLEADPEYRGSLKRAGLLTRDPRMKERKKPGLKAARRSPQFSKR
>P62669 ~~~rpsI~~~Small ribosomal subunit protein uS9~~~COG0103
MEQYYGTGRRKEAVARVFLRPGNGKVTVNGQDFNEYFQGLVRAVAALEPLRAVDALGRFDAYITVRGGGKSGQIDAIKLG
IARALVQYNPDYRAKLKPLGFLTRDARVVERKKYGKHKARRAPQYSKR
>P80374 ~~~rpsI~~~Small ribosomal subunit protein uS9~~~COG0103
MEQYYGTGRRKEAVARVFLRPGNGKVTVNGQDFNEYFQGLVRAVAALEPLRAVDALGHFDAYITVRGGGKSGQIDAIKLG
IARALVQYNPDYRAKLKPLGFLTRDARVVERKKYGKHKARRAPQYSKR
>F8KAY7 3.1.6.19~~~pisa1~~~(R)-specific secondary-alkylsulfatase~~~
MSRFIRASQRRTLLATLIAATLAQPLLAAESLDSKPASAITAAKNAEVLKNLPFADREEFEAAKRGLIAPFSGQIKNAEG
QVVWDMGAYQFLNDKDAADTVNPSLWHQAQLNNIAGLFEVMPKLYQVRGLDPANMTIIEGDSGLVLIDTLTTAETARAAL
DLYFQHRPKKPIVAVVYSHSHIDHFGGARGIIDEADVKAGKVKVFAPSGFMEHAVSENILAGTAMARRGQYQSGVMVPRG
AQAQVDSGLFKTTATNATNTLVAPNVLIEKPYERHTVDGVELEFQLTLGSEAPSDMNIYLPQFKVLNTADNAPPAMHNLL
TPRGAEVRDAKAWAGYIDASLEKYGDRTDVLIQQHNWPVWGGDKVRTYLADQRDMYAFLNNRALNLMNKGLTLHEIAAEV
SKLPGELDRKWYLRSYYGALSTNLRAVYQRYLGFYDGNPANLDPFPPVEAGKRYVEAMGGADAVLKQMRAAIDKGDYRWA
VQLGNHLVFADPANKDARALQADAMEQLGYQTENALWRNMYMTGAMELRHGVPTYDSRGKSEMGRALTPDMFFDLLAIRL
DTDKAVGHDMTLNWVFEDLKQDIALTLRNGVLTQRVGSLNPKADVTVKLTKPTLDQIAARKLDLPTAIKQGTVKLDGDGK
KLGEFFGLLDSFSPKFNIVEPLE
>A8M783 3.13.2.3~~~~~~(R)-S-adenosyl-L-methionine hydrolase~~~COG1912
MASTPWISFTTDYGLADGFVAACHGVLARLAPTARVIDVTHLVPPGDVRRGAAVLAQTVPYLPAAVHVAVVDPGVGTARR
AIALTAGNGLLVGPDNGLLLDAATALGGVDAAVELTNPDWLGARMSATFHGRDVFAPVAARLALGAPLADAGPAVEPGAL
VRLPTPLVQPETDGFTAEVLTVDHFGNVQLAATGALLESLPRSLRVAHRPAVHARTFDDAPPGGLLVHVDSAGLVAVAVN
GGRAADLLAVTPGDQLRVTAG
>A4X4S2 3.13.2.3~~~~~~(R)-S-adenosyl-L-methionine hydrolase~~~COG1912
MAPTPWISFTTDYGLADGFVAACHGVLARLTPTTRVIDVTHLVPPGDVRRGAAVLAQAVPYLPAAVHLAVVDPGVGTARR
AIALAAGDGLLVGPDNGLLLDAAAALGGVRAAVELTNRDWLGADVSATFHGRDIFAPVAARLALGAPLADAGPAVEPSTL
VRLPVPLVRPEADGFTAEVLTVDHFGNVQLAASGSLLEPLPRSLRVERQPAVRVHTFGDVAPGELLVHVDSTGQVAVAVN
GGRAADLLGVTPGDRLRVTAG
>Q5SLF5 3.13.2.3~~~~~~(R)-S-adenosyl-L-methionine hydrolase~~~COG1912
MRPVYFLSDFGLEDPYVAVVKAVLAERAPGPAVVDLAHALPPQDLRRAAYALFEALPYLPEGAVVLAVVDPGVGTARRAV
AALGRWTYVGPDNGLFTLAWLLDPPRRAFLLEPPRPRPKAALPGWAPGEATFHGRDVFAPAAAHLALGLPPEGLGPEVPV
ETLARLPLALTEGPEGEVLTFDRFGNAITTLLRAPVGGFVEVGGRRVPVRRTFGEVPEGAPVAYLGSAGLLEVAVNRGSA
REALGLKEGMPVRLL
>Q75SP7 3.5.1.100~~~ramA~~~(R)-stereoselective amidase~~~
MKIELVQLAGRDGDTAYNLSRTLNAIATCAGDTDLLVFPETYLSGFVGGAQLAQVAEPLHGTTLQTLLQAVRQRDVAVVL
GFAEVHQGRFYNSSVLVTPEGIALQYRKTHLWPSERSDFSPGDRFTTVLWRGVRVGLLICYDIELPETSRALAQLGAEVV
IVTNGNMDPYGPVHRTAIMARAQENQLFAVMVNRVGAGDDGLVFAGGSMAVDPFGRVLFEAGRDEVRHVVELDLDQLKAA
RRDYDYLKDRRLMLSGEQTEHPDGRRELLIGASQ
>O07014 3.1.3.3~~~rsbP~~~Phosphoserine phosphatase RsbP~~~COG2208
MDKQLNDAPCGFLALSEEGSIIAANRTLIKILDYEPEQVIGQHMNMMLTIPAQLFCQLYFFPLLKLEHHIEEIYISLKAR
DGEEIPVLINAIARHDSGASVFDCVLIPMRKRNEYENELLIARNEAQEALLAKQKANAELEIALETLKAKQEELLEINKQ
NQQFKLNTKRELELARKIQKNSLTEPIVNDQVQIDSYYNASSELSGDLYGYYQIDEHRYGIIILDVMGHGISSALITMSL
HPLFQRQITQGLSPVKVMKELDRHLHSLFQNDEEARHYCTAIYLEIDIARQRIDYVNAGHPPALWQDDSGTQHLLHATSP
PIGMFEDLEFQSSSLSYTEDGRLLLYTDGVMDPTASCYLFDLLKDHPDSPIADLKEKILTSLQHQKEAHHKSDDECFILV
DVK
>O07015 ~~~rsbQ~~~Sigma factor SigB regulation protein RsbQ~~~COG0596
MNEAILSRNHVKVKGSGKASIMFAPGFGCDQSVWNAVAPAFEEDHRVILFDYVGSGHSDLRAYDLNRYQTLDGYAQDVLD
VCEALDLKETVFVGHSVGALIGMLASIRRPELFSHLVMVGPSPCYLNDPPEYYGGFEEEQLLGLLEMMEKNYIGWATVFA
ATVLNQPDRPEIKEELESRFCSTDPVIARQFAKAAFFSDHREDLSKVTVPSLILQCADDIIAPATVGKYMHQHLPYSSLK
QMEARGHCPHMSHPDETIQLIGDYLKAHV
>P42409 ~~~rsbRA~~~RsbT co-antagonist protein RsbRA~~~COG1366
MMSNQTVYQFIAENQNELLQLWTDTLKELSEQESYQLTDQVYENISKEYIDILLLSVKDENAAESQISELALRAVQIGLS
MKFLATALAEFWKRLYTKMNDKRLPDQESTELIWQIDRFFSPINTEIFNQYSISWEKTVSLQKIALQELSAPLIPVFENI
TVMPLVGTIDTERAKRIMENLLNGVVKHRSQVVLIDITGVPVVDTMVAHHIIQASEAVRLVGAKCLLAGIRPEIAQTIVN
LGIDLSQVITKNTLQKGIQTALEMTDRKIVSLGE
>O34860 ~~~rsbRB~~~RsbT co-antagonist protein RsbRB~~~COG1366
MKLNEKLYAFFSEHVEQMAEEWIETMEESDPNSLYALHNATVTEELKEQDREFYRHLNYMYVLPEKQFLEEFQEWVIELT
NDQKHLDTPVQYVIREFMRNRRLYTKYFEKFAEENESAFEPGEKQKWADLIVKVFDFTIYTFVDHAEMNAKQQLNAQREM
ILELSSPVITLSKSTALLPLVGDIDTERAKFILENTLQACAKRRVEHLLIDLSGVVVVDTMVAHQIFKLIEALNLIGVRS
TLSGIRPEIAQTAVQLGIDFSNITIKTNLAQALNYHQ
>O31856 ~~~rsbRC~~~RsbT co-antagonist protein RsbRC~~~COG1366
MAKNKKLFEYLSQHAETISSTWYETIEETDPNSIYASTDPVVIHNLKSQNLAFNYKINRIFIDDEDVYLPILKEWAFEVT
QDQEHLKTPIHYIIREFVRVRDLYVSYVKEFVHLNQNTVKSEEAEDLYHALIKAFDLVIHIFIEEMYKNTSLQLQAQKDM
ITELSAPVIVLFHSVGLLPLIGDIDTVRAKLIMENTLHQCAKKKVTQLYIDLSGVAVIDTMVAHQLFSLIEALRLIGVSS
TLSGIRPEIAQTAVQLGLSFEGISLRSTLASAIASDLKLKKV
>P54504 ~~~rsbRD~~~RsbT co-antagonist protein RsbRD~~~COG1366
MIALDQHLTEHKKDITQQWLEVCTSNGSWLYSAKDQQKLEQKLKDQHELLVTIVAKSLRKEDVEDELNRWSLQCARDRAV
HEVTVTQSVGQFNTFRHIMFEWIHKFSEASSQDISIQEFYEWSRILNQNIDEIIEVFTEEYHQVTMIQLNAQKEMINELS
APIMPITDGIGILPLVGEIDTHRARTILESVLEQCSALKLSYLFLDISGVPIVDTMVAYQIFKVIDSTKLLGIETIISGI
RPEIAQTVVKLGLDFSNVKTEQSLAKALANKGFKIKEC
>P42410 ~~~rsbS~~~RsbT antagonist protein RsbS~~~COG1366
MRHPKIPILKLYNCLLVSIQWELDDQTALTFQEDLLNKIYETGANGVVIDLTSVDMIDSFIAKVLGDVITMSKLMGAKVV
LTGIQPAVAVTLIELGIALEEIETALDLEQGLETLKRELGE
>P42411 2.7.11.1~~~rsbT~~~Serine/threonine-protein kinase RsbT~~~COG2172
MNDQSCVRIMTEWDIVAARQLGRNVAKELGFGTVDQARITTAISELARNIYLYAGKGQIGIEQVADRGKKGLKIIAEDQG
PGIPDIRKVMEDGFSTSGGLGAGLPGVKRLMDEFSLNSVAGEGTEIQAIKWLR
>P40399 3.1.3.3~~~rsbU~~~Phosphoserine phosphatase RsbU~~~COG2208
MDFREVIEQRYHQLLSRYIAELTETSLYQAQKFSRKTIEHQIPPEEIISIHRKVLKELYPSLPEDVFHSLDFLIEVMIGY
GMAYQEHQTLRGIQQEIKSEIEIAANVQQTLLGTKVPQEEALDIGAISVPAKQMSGDYYHFVKDKESINIAIADVIGKGI
PAALCMSMIKYAMDSLPETGIHPSQVLKNLNRVVEQNVDASMFITMFYANYNMDKHQFTYASAGHEPGFYYSQKDNTFYD
LEAKGLVLGISQDYDYKQFDQHLEKGDMIVLFSDGVTECRTENGFLERPDLQKLIEEHMCSSAQEMVKNIYDSLLKLQDF
QLHDDFTLIVLRRKV
>P17903 ~~~rsbV~~~Anti-sigma-B factor antagonist~~~COG1366
MNINVDVKQNENDIQVNIAGEIDVYSAPVLREKLVPLAEQGADLRICLKDVSYMDSTGLGVFVGTFKMVKKQGGSLKLEN
LSERLIRLFDITGLKDIIDISAKSEGGVQ
>P66838 ~~~rsbV~~~Anti-sigma-B factor antagonist~~~
MNLNIETTTQDKFYEVKVGGELDVYTVPELEEVLTPMRQDGTRDIYVNLENVSYMDSTGLGLFVGTLKALNQNDKELYIL
GVSDRIGRLFEITGLKDLMHVNEGTEVE
>Q9WVX8 ~~~rsbV~~~Anti-sigma-B factor antagonist~~~COG1366
MDLSLSTRTVGDRTVVEVGGEIDVYTAPKLREQLVELVNDGSFHLVVDMEGVDFLDSTGLGVLVGGLKRVRAHEGSLRLV
CNQERILKIFRITGLTKVFPIHTSVEEAVAATD
>P17904 2.7.11.1~~~rsbW~~~Serine-protein kinase RsbW~~~COG2172
MKNNADYIEMKVPAQPEYVGIIRLTLSGVASRMGYTYDEIEDLKIAVSEACTNAVQHAYKEDKNGEVSIRFGVFEDRLEV
IVADEGDSFDFDQKQQDLGPYTPSHTVDQLSEGGLGLYLMETLMDEVRVQNHSGVTVAMTKYLNGERVDHDTTIKNYETN
>P9WGX7 ~~~rsbW~~~Anti-sigma-F factor RsbW~~~COG2172
MTDQLEDQTQGGSTVDRSLPGGCMADSDLPTKGRQRGVRAVELNVAARLENLALLRTLVGAIGTFEDLDFDAVADLRLAV
DEVCTRLIRSALPDATLRLVVDPRKDEVVVEASAACDTHDVVAPGSFSWHVLTALADDVQTFHDGRQPDVAGSVFGITLT
ARRAASSR
>P0A0H7 2.7.11.1~~~rsbW~~~Serine-protein kinase RsbW~~~
MQSKEDFIEMRVPASAEYVSLIRLTLSGVFSRAGATYDDIEDAKIAVSEAVTNAVKHAYKENNNVGIINIYFEILEDKIK
IVISDKGDSFDYETTKSKIGPYDKDENIDFLREGGLGLFLIESLMDEVTVYKESGVTISMTKYIKKEQVRNNGERVEIS
>P17906 3.1.3.3~~~rsbX~~~Phosphoserine phosphatase RsbX~~~COG2208
MIQVEENEHIQTLVYQLNKEGKSICGDSFFMKADDKELICAVADGLGSGSLANESSAAIKDLVENYASEDVESIIERCNQ
AMKNKRGATASILKINFEQRQFTYCSVGNVRFILHSPSGESFYPLPISGYLSGKPQKYKTHTATYEKGSKFIIHTDGLNV
PDIRSHLKKGQSVEEISNSLKMYTTSRKDDLTYILGQLS
>P9WJ71 ~~~rsdA~~~Anti-sigma-D factor RsdA~~~
MREFGNPLGDRPPLDELARTDLLLDALAEREEVDFADPRDDALAALLGQWRDDLRWPPASALVSQDEAVAALRAGVAQRR
RARRSLAAVGSVAAALLVLSGFGAVVADARPGDLLYGLHAMMFNRSRVSDDQIVLSAKANLAKVEQMIAQGQWAEAQDEL
AEVSSTVQAVTDGSRRQDLINEVNLLNTKVETRDPNATLRPGSPSNPAAPGSVGNSWTPLAPVVEPPTPPTPASAAEPSM
SAGVSESPMPNSTSTVAASPSTPSSKPEPGSIDPSLEPADEATNPAGQPAPETPVSPTH
>P0AFX4 ~~~rsd~~~Regulator of sigma D~~~COG3160
MLNQLDNLTERVRGSNKLVDRWLHVRKHLLVAYYNLVGIKPGKESYMRLNEKALDDFCQSLVDYLSAGHFSIYERILHKL
EGNGQLARAAKIWPQLEANTQQIMDYYDSSLETAIDHDNYLEFQQVLSDIGEALEARFVLEDKLILLVLDAARVKHPA
>P0AFX7 ~~~rseA~~~Anti-sigma-E factor RseA~~~COG3073
MQKEQLSALMDGETLDSELLNELAHNPEMQKTWESYHLIRDSMRGDTPEVLHFDISSRVMAAIEEEPVRQPATLIPEAQP
APHQWQKMPFWQKVRPWAAQLTQMGVAACVSLAVIVGVQHYNGQSETSQQPETPVFNTLPMMGKASPVSLGVPSEATANN
GQQQQVQEQRRRINAMLQDYELQRRLHSEQLQFEQAQTQQAAVQVPGIQTLGTQSQ
>A0R2D3 ~~~rseA~~~Anti-sigma-E factor RseA~~~COG5662
MADPGHVFRRAFSWLPSQFASQSDAPVGAPRQFGSTEHLSVEAIAAFVDGELRMSAHLRAAHHLSLCPECAAEVDAQSQA
RTALRESCPIAIPNSLLGMLSQIPHRTPEVTPDVSEQAKFADDPTRGRRKRR
>L0T905 ~~~rseA~~~Anti-sigma-E factor RseA~~~COG5662
MADPGSVGHVFRRAFSWLPAQFASQSDAPVGAPRQFRSTEHLSIEAIAAFVDGELRMNAHLRAAHHLSLCAQCAAEVDDQ
SRARAALRDSHPIRIPSTLLGLLSEIPRCPPEGPSKGSSGGSSQGPPDGAAAGFGDRFADGDGGNRGRQSRVRR
>D0ZSY8 ~~~rseA~~~Anti-sigma-E factor RseA~~~
MQKEKLSALMDGETLDSELLKALTHDPEMQKTWESYHLIRDSMRGDTPDVLHFDISARVMAAIENEPVRQVSPLIPEAQP
APQQWQKMPFWKKVRPWAAQLTQMGVAACVSLAVIVGVQHYNGQSETSQQPETPVFNTLPMMGKASPVSLGVPSEAAPVG
SQQQQVQEQRRRINAMLQDYELQRRLHSEQLQFEQAQTQQAAVQVPGIQTLGTQSQ
>P0AFX9 ~~~rseB~~~Sigma-E factor regulatory protein RseB~~~COG3026
MKQLWFAMSLVTGSLLFSANASATPASGALLQQMNLASQSLNYELSFISINKQGVESLRYRHARLDNRPLAQLLQMDGPR
REVVQRGNEISYFEPGLEPFTLNGDYIVDSLPSLIYTDFKRLSPYYDFISVGRTRIADRLCEVIRVVARDGTRYSYIVWM
DTESKLPMRVDLLDRDGETLEQFRVIAFNVNQDISSSMQTLAKANLPPLLSVPVGEKAKFSWTPTWLPQGFSEVSSSRRP
LPTMDNMPIESRLYSDGLFSFSVNVNRATPSSTDQMLRTGRRTVSTSVRDNAEITIVGELPPQTAKRIAENIKFGAAQ
>P46187 ~~~rseC~~~Protein RseC~~~COG3086
MIKEWATVVSWQNGQALVSCDVKASCSSCASRAGCGSRVLNKLGPQTTHTIVVPCDEPLVPGQKVELGIAEGSLLSSALL
VYMSPLVGLFLIASLFQLLFASDVAALCGAILGGIGGFLIARGYSRKFAARAEWQPIILSVALPPGLVRFETSSEDASQ
>P0AEH1 3.4.24.-~~~rseP~~~Regulator of sigma-E protease RseP~~~COG0750
MLSFLWDLASFIVALGVLITVHEFGHFWVARRCGVRVERFSIGFGKALWRRTDKLGTEYVIALIPLGGYVKMLDERAEPV
VPELRHHAFNNKSVGQRAAIIAAGPVANFIFAIFAYWLVFIIGVPGVRPVVGEIAANSIAAEAQIAPGTELKAVDGIETP
DWDAVRLQLVDKIGDESTTITVAPFGSDQRRDVKLDLRHWAFEPDKEDPVSSLGIRPRGPQIEPVLENVQPNSAASKAGL
QAGDRIVKVDGQPLTQWVTFVMLVRDNPGKSLALEIERQGSPLSLTLIPESKPGNGKAIGFVGIEPKVIPLPDEYKVVRQ
YGPFNAIVEATDKTWQLMKLTVSMLGKLITGDVKLNNLSGPISIAKGAGMTAELGVVYYLPFLALISVNLGIINLFPLPV
LDGGHLLFLAIEKIKGGPVSERVQDFCYRIGSILLVLLMGLALFNDFSRL
>P39650 ~~~rsfA~~~Prespore-specific transcriptional regulator RsfA~~~
MTKQRQDAWSEENDLLLAETVLRHVREGSTQLNAFEEVGDKLNRTSAACGFRWNAVVRHQYEKALQLAKKQRKQRMRALG
NGQPAKKRLLYQPPAVDPEIIQETAAEEPVKTETPSVENEQPLMSGEHMPFVDESFKEELASLSHLLSPQTPAAAPEMTM
NDVIRFLQNYEGNHEQSSALKMENERLKKENQELQNKTEQLEAEVQKLEKDQKTIQEDYETLVKIMNRARKLVLFEEDEH
ASPSFKMDRNGNLEKLAE
>P9WGE3 ~~~rsfA~~~Anti-sigma-F factor antagonist RsfA~~~COG1366
MNPTQAGSFTTPVSNALKATIQHHDSAVIIHARGEIDAANEHTWQDLVTKAAAATTAPEPLVVNLNGLDFMGCCAVAVLA
HEAERCRRRGVDVRLVSRDRAVARIIHACGYGDVLPVHPTTESALSAT
>P9WGE1 ~~~rsfB~~~Anti-sigma-F factor antagonist RsfB~~~COG1366
MSAPDSITVTVADHNGVAVLSIGGEIDLITAAALEEAIGEVVADNPTALVIDLSAVEFLGSVGLKILAATSEKIGQSVKF
GVVARGSVTRRPIHLMGLDKTFRLFSTLHDALTGVRGGRIDR
>O66555 3.6.1.-~~~rsgA~~~Small ribosomal subunit biogenesis GTPase RsgA~~~COG1162
MGKKELKRGLVVDREAQMIGVYLFEDGKTYRGIPRGKVLKKTKINAGDYVWGEVVDPNTFAIEEVEERKNLLIRPKVANV
DRVIIVETLKMPEFNNYLLDNMLVVYEYFKVEPVIVFNKIDLLNEEEKKELERMDSIYRDAGYDVLKVSAKTGEGIDELV
DYLEGFICILAGPSGVGKSSILSRLTGEELRTQEVSEKTERGRHTTTGVRLIPFGKGSFVGDTPGFSKVEATMFVKPREV
RNYFREFLRYQCKYPDCTHTNEPGCAVKEAVKNGEISCERYKSYLKIIKVYLEEIKELCRED
>O34530 3.6.1.-~~~rsgA~~~Small ribosomal subunit biogenesis GTPase RsgA~~~COG1162
MPEGKIIKALSGFYYVLDESEDSDKVIQCRGRGIFRKNKITPLVGDYVVYQAENDKEGYLMEIKERTNELIRPPICNVDQ
AVLVFSAVQPSFSTALLDRFLVLVEANDIQPIICITKMDLIEDQDTEDTIQAYAEDYRNIGYDVYLTSSKDQDSLADIIP
HFQDKTTVFAGQSGVGKSSLLNAISPELGLRTNEISEHLGRGKHTTRHVELIHTSGGLVADTPGFSSLEFTDIEEEELGY
TFPDIREKSSSCKFRGCLHLKEPKCAVKQAVEDGELKQYRYDHYVEFMTEIKDRKPRY
>P39286 3.6.1.-~~~rsgA~~~Small ribosomal subunit biogenesis GTPase RsgA~~~COG1162
MSKNKLSKGQQRRVNANHQRRLKTSKEKPDYDDNLFGEPDEGIVISRFGMHADVESADGDVHRCNIRRTIRSLVTGDRVV
WRPGKPAAEGVNVKGIVEAVHERTSVLTRPDFYDGVKPIAANIDQIVIVSAILPELSLNIIDRYLVACETLQIEPIIVLN
KIDLLDDEGMAFVNEQMDIYRNIGYRVLMVSSHTQDGLKPLEEALTGRISIFAGQSGVGKSSLLNALLGLQKEILTNDIS
DNSGLGQHTTTAARLYHFPHGGDVIDSPGVREFGLWHLEPEQITQGFVEFHDYLGLCKYRDCKHDTDPGCAIREAVEEGK
IAETRFENYHRILESMAQVKTRKNFSDTDD
>Q9HUL3 3.6.1.-~~~rsgA~~~Small ribosomal subunit biogenesis GTPase RsgA~~~
MAKRHLTRRQSWRIEKIQEERAARAARRESRAVEELEGGDLGPEQTGQVIAHFGVQVEVESADGQVSRCHLRANLPALVT
GDQVVWRAGNQGIGVIVAQLPRRSELCRPDMRGLLKPVAANVDRIVIVFAPRPEPHANLIDRYLIAAEHAGIQPLLLLNK
ADLVDESNAEGIDALLNVYRTLGYPLIEVSAFNGLAMDELRGALDGHVSVFVGQSGVGKSSLVNALLPGVDTRVGDLSTV
TGKGTHTTTTARLFHFPGGGDLIDSPGIREFGLGHVSRDDVEAGFIEFRDLLGHCRFRDCKHDREPGCALLQALEDGRIM
PQRMASYRHILASMPETDY
>Q8ZKB0 3.6.1.-~~~rsgA~~~Small ribosomal subunit biogenesis GTPase RsgA~~~
MSKNKLSKGQQRRVNANHQRRLKTSAEKADYDDNLFGEPAEGIVISRFGMHADVESADGEVHRCNIRRTIRSLVTGDRVV
WRPGKAAAEGVNVKGIVEAVHERTSVLTRPDFYDGVKPIAANIDQIVIVSAILPELSLNIIDRYLVGCETLQVEPLIVLN
KIDLLDDEGMDFVNEQMDIYRNIGYRVLMVSSHTQDGLKPLEEALTGRISIFAGQSGVGKSSLLNALLGLQNEILTNDVS
NVSGLGQHTTTAARLYHFPHGGDVIDSPGVREFGLWHLEPEQITQGFVEFHDYLGHCKYRDCKHDADPGCAIREAVENGA
IAETRFENYHRILESMAQVKTRKNFSDTDD
>Q9KX08 3.6.1.-~~~rsgA~~~Small ribosomal subunit biogenesis GTPase RsgA~~~
MKTGRIVKSISGVYQVDVNGERFNTKPRGLFRKKKFSPVVGDIVEFEVQNINEGYIHQVFERENELKRPPVSNIDTLVIV
MSAVEPNFSTQLLDRFLVIAHSYQLNARILVTKKDKTPIEKQFEINELLKIYENIGYETEFIGNDDDRKKIVEAWPAGLI
VLSGQSGVGKSTFLNHYRPELNLETNDISKSLNRGKHTTRHVELFERQNGYIADTPGFSALDFDHIDKDEIKDYFLELNR
YGETCKFRNCNHIKEPNCNVKHQLEIGNIAQFRYDHYLQLFNEISNRKVRY
>P67682 3.6.1.-~~~rsgA~~~Small ribosomal subunit biogenesis GTPase RsgA~~~
MKTGRIVKSISGVYQVDVNGERFNTKPRGLFRKKKFSPVVGDIVEFEVQNINEGYIHQVFERKNELKRPPVSNIDTLVIV
MSAVEPNFSTQLLDRFLVIAHSYQLNARVLVTKKDKTPIEKQFEINELLKIYENIGYETEFIGNDDDRKKIVEAWPAGLI
VLSGQSGVGKSTFLNHYRPELNLETNDISKSLNRGKHTTRHVELFERQNGYIADTPGFSALDFDHIDKDEIKDYFLELNR
YGETCKFRNCNHIKEPNCNVKHQLEIGNIAQFRYDHYLQLFNEISNRKVRY
>Q9X242 3.6.1.-~~~rsgA~~~Small ribosomal subunit biogenesis GTPase RsgA~~~COG1162
MNLRRRGIVVSFHSNMVTVEDEETGERILCKLRGKFRLQNLKIYVGDRVEYTPDETGSGVIENVLHRKNLLTKPHVANVD
QVILVVTVKMPETSTYIIDKFLVLAEKNELETVMVINKMDLYDEDDLRKVRELEEIYSGLYPIVKTSAKTGMGIEELKEY
LKGKISTMAGLSGVGKSSLLNAINPGLKLRVSEVSEKLQRGRHTTTTAQLLKFDFGGYVVDTPGFANLEINDIEPEELKH
YFKEFGDKQCFFSDCNHVDEPECGVKEAVENGEIAESRYENYVKMFYELLGRRKK
>A3DBH1 ~~~rsgI1~~~Anti-sigma-I factor RsgI1~~~COG4447
MNRLGIIYEIQGMKAVVLTSEGEFLIIRRRKDMKVGQQVSFENEDIYNVRGKRFLYVAAAVSSVAAVLVVMFLYFQSAFL
SNTDNIYGYICVDINPSVELVIDETCRVLEVRPQNKDGEQLISGLELLDKNVEDVVYELINRSISFGFVKADDNRKIVLI
SGALNDKRNELKTKKENDEAELTELLDNIKARVDRIDNIKVRTITATSRERKDALKYGLSMGKYCLYLEAQELNGSITID
EVHDMSISDMIEKLEQMKLALKDEASPKLQTTPTLGGETAQISPESMQHSTVPGLPETPSSSEKTIAPTLHGTPGVPDEK
TLQPSTPTESSEYVQDGTKGLKIQYYSRKPHDSAGIDFSFRMFNTGNEAIDLKDVKVRYYFKEDVSIDEMNWAVYFYSLG
SEKDVQCRFYELPGKKEANKYLEITFKSGTLSPNDVMYITGEFYKNDWTKFEQRDDYSYNPADSYSDWKRMTAYISNKLV
WGIEPN
>A3DC27 ~~~rsgI2~~~Anti-sigma-I factor RsgI2~~~COG4447
MSHYTGIILKLESDRAIVLTDGLDFMELKLKPGMQRGQHVIFDESDLYSAGLITRYKSIIMPFSAFAAAAAVFLVILFSL
RFVSISQEYAYIDVDINPSIGLVIDKKEKVIDAKPLNNDAKPILDEAAPKDMPLYDALSKILDISKKNGYINSADNIVLF
SASINSGRNNVSESDKGIQEIISTLKDVAKDAGVKFEIIPSTEEDRQKALDQNLSMGRYAIYVKAVEEGVNLNLEDARNL
SVSEILGKVNIGKFAISDTPEDSGIMPAISVPAEPVPSVTPAYTAVPEKTEAQPVDIPKSSPTPASFTAHVPTPPKTPSI
PHTSGPAIVHTPAADKTTPTFTGSSTPVPTNVVAIASTPVPVSTPKPVSTPAYSSTPTPESTPVPVSTPKPASTPTPAST
PKPVSTPTHVSTPKPISTPTSTPRPASTPKPTSTPTPESTPKPTSTPAPVSTPTSTPIPTYTSTPASTPIPAYTSTPTSI
PTLTPATSPAPTSSPTPIPSPAPTETDLLTKIELQAYNHIRTSETKELQPRIKLINTGNTPITLSEVKIRYYYTKDQVIN
EIYTCDWSNITSSKITGTVVQMSNPKPNADSYVEIGFTNSAGVLNPGEYVEIISRIGNSYALSLATPPYSEWNYMYDQNS
DYSFNNSSSDFVVWDKITVYISGTLYWGIEP
>A3DC75 ~~~rsgI3~~~Anti-sigma-I factor RsgI3~~~COG2133
MDNIGVIIKIEGNEAIVMTDDCSFKKVPIKDGMHPGQKILVPNNEVIQKENKSIKRISAVATGIAAVFLMVLSLIWINKP
GRPDGIYAYIDVDINPSLNFLIDREGKVKALNPLNDDAQEIIRGVEFEDMFFSEALTQIIKISKAKGIIDENKTNYVLIC
AALDDNYNLQSDDKSRAQTEFEEFLDGIRESIEKACGNTVIPQTVKVPFEYLKMAKQNDVSMGRYLVYQKLEDIGVNLSI
EELKSLDIDEILKKYGVGFDELFKSEYTELPYGTLQTGEDSVVSTEDVPVSPKNAFETMAVPTNTPSISTKPSATPAENP
TPKLTQKPTPVPAKTGERTSTTPTPTPAPTVRNGTGSGLRGEYYNNMDFSRFQFVRIDPCIDFDWGEGTPDQSIGKDTYS
VRWTGKVEPRYSETYTFYTVTDDGVRLWVDGVLLIDKWKSQSATEHSEQIYLEAGKKYDIKMEYYQHVRAASAKLMWSSK
SQQKEIIPSSQLYPSDGPLPQKDVNGLSAEYYGDAELKDKRFTRIDDAINFNWDKDFPVGELKDGKFSVRWVGKIDTRYT
EEYTFHTVANGGVRVWINNVLIIDNWQNQGKEAENSGKIELKAGRQYDIKVEYCNYGEPAFIKLLWSSQRQKKEVVPSKN
LFAD
>A3DCG3 ~~~rsgI4~~~Anti-sigma-I factor RsgI4~~~COG4447
MNLGVVIKIKRKKAIIVTETGEFKAVNARNGMFLGQKILFDQQDVIENNRNGIGLAYSAAIAGMVAVFVFMFTYFGLHNF
NGTFAYVDVDINPSVEFAVNRDGIVVNAEPLNDDGRKVLEELIYKDALLEDVILDLVDKSRKYGFIEDNDRKNIILISAA
LNSDEQEQRNDFEKKLVDNLMPELENLDVNIEMRFVIASKEQRKKAQENKVSMGKYMIYEMARRQGEKLTLESIMSETLE
NLLLGQDFGVIETEKTPVNTPVKSTATPTKALAAEITPTKTPEQVVMTPANTPAKPTAAPTKAPAAVAVTSAKTPERATT
VPVNTPVKPTDAPTKSPATATATATRAPVKATATPAKTLKPSDTPVKTPDGEQSVKVRFYNNNTLSETGVIYMRINVINT
GNAPLDLSDLKLRYYYTIDSESEQRFNCDWSSIGAHNVTGSFGKVNPSRNGADTYVEIGFTKEAGMLQPGESVELNARFS
KTDNTQYNKADDYSFNSHYYEYVDWDRITAYISGILKWGREP
>A3DH97 3.2.1.8~~~rsgI6~~~Anti-sigma-I factor RsgI6~~~COG3693
MIVGKVLDMDEKTAIIMTDDFAFLNVVRTSEMAVGKKVKVLDSDIIKPKNSLRRYLPVAAVAACFVIVLSFVLMFINGNT
ARKNIYAYVGIDINPSIELWINYNNKIAEAKALNGDAETVLEGLELKEKTVAEAVNEIVQKSMELGFISREKENIILIST
ACDLKAGEGSENKDVQNKIGQLFDDVNKAVSDLKNSGITTRILNLTLEERESSKEENISMGRYAVYLKAKEQNVNLTIDE
IKDADLLELIAKVGIDNENVPEDIVTEDKDNLDAINTGPAESAVPEVTETLPATSTPGRTEGNTATGSVDSTPALSKNET
PGKTETPGRTFNTPAKSSLGQSSTPKPVSPVQTATATKGIGTLTPRNSPTPVIPSTGIQWIDQANERINEIRKRNVQIKV
VDSSNKPIENAYVEAVLTNHAFGFGTAITRRAMYDSNYTKFIKDHFNWAVFENESKWYTNEPSMGIITYDDADYLYEFCR
SNGIKVRGHCIFWEAEEWQPAWVRSLDPFTLRFAVDNRLNSAVGHFKGKFEHWDVNNEMIHGNFFKSRLGESIWPYMFNR
AREIDPNAKYFVNNNITTLKEADDCVALVNWLRSQGVRVDGVGVHGHFGDSVDRNLLKGILDKLSVLNLPIWITEYDSVT
PDEYRRADNLENLYRTAFSHPSVEGIVMWGFWERVHWRGRDASIVNDNWTLNEAGRRFESLMNEWTTRAYGSTDGSGSFG
FRGFYGTYRITVTVPGKGKYNYTLNLNRGSGTLQTTYRIP
>A3DIE5 ~~~rsgI7~~~Anti-sigma-I factor RsgI7~~~
MRAMVVDMNDKYAVVVNKEGQYIKIKRKAEHRLGYQVELPDRVIGFERRTLLKVVSVAAALLIVSSISFAVYSYNLPYSY
VNVDINPSLEIILNMYNRIIDVKALNSEGEMLIEDSYKNSRLDEGVEKIIDSAVAQGFLKNDEENTIMLTVAGKNSRKVL
EIKEEVESTANKVLNDDNVVSEVIVENIVLERREEARELGIAPGKLLLIEKLKEVDPKATTEEYKDKPVNEIVKTIRDIK
KVPNENNRKDDDKKVNNEPNKPLPDRKADVETSAGVKENTAGPDAGIKPVNKTDNAKPNVGTDINNKENKTVSNAKIDSG
IDKGNKDSKPNSNTKINNDVKKDNKDNKTNSDAKTFNDVSKDNKNDKADGNAKINNNINRDNKITPINPDNKFSSGGSKD
DKDNKHVDSKDKMNNEDNKNINNGSCPQYNPYWNPYWNPYWNPYWGNPKEKEDMTKQNDEWFKKMQEEQKKQYDEWLKKM
QEEQKKQHDEWVKKMEEMKNTEKMKNPYQENKIEKPKEAEKENKPDRPPEPGKEILKKRC
>A3DC20 ~~~rsgI9~~~Anti-sigma-I factor RsgI9~~~COG0265
MKITGVIVRIHKDRAIIRTDDNRLLAVKRHNDMMVGQIVSFDANEVHKVESKKYKYAASGKRIEKVQKTPKIKNFSRINN
IKEFSRVDDIKNFSRVAATKETSQDSPQESKVENFSRVVDFSRVMNFSRVSNSKKNEIKNFSRISNIKNFSRIASIAAAF
VLIFLFGRNVMLNNSSDSEYAYVSVDVNPSVEFTINSKHKVIVTSAINQDASEVLDGLELKEKDLKSALVMVLEKAESLG
YISDDKNYVLVSMALNDKNKKTRDKREEKIDELKETIEQGIEALDNDTIVHRTVTVDLEERNKALENELSMGRYYLYLEA
KEKGMDITIDEVKSSKISDLIEKIEDNTELAPTPTPVPPETPEPTPTPTASEATPSNSPVESKSPEAVPELGSREIEILG
ESVVLVTAYDENRKVVSQGSGFAVGTGLFATNYHLVKDGVVVKITAGDGKVYDVDGIVKYDKAKDLALLKTRVETGVNPL
KLGTKKSLTKGSRIVAIGKANGAKNTVTKGSIKSLKVDGLTDAIELSASISKESTGGPVFDMKGNVVGITAYGISKQNVN
AVIPADYVADWVKELSKHSFGNIRIVRKTLVFDSDFEFNFVVYKIIRALENEDAATYFGCMTDELYKDETRKNLEVLFTT
YDLAYNIESINVVSKSEEQAKVSYVYTINKEAGPNFKNYRIIGECSLIKVDGTWKINDSEEKKEYIQ
>P0DO99 ~~~rsgI~~~Anti-sigma-I factor RsgI~~~
MMNKGIVMDIKKHSVVVLTPNGEFITCKRKGDSCMIGEEISFDEQEQKASRFSIPYFLKPASLLVACFLCALLFFYNQPE
EKVFAYVSVDINPSLEVSVTKDLRVIDLQACNDDGRRILKELKQWENKQLQEVIRTIIKQSQEDKYLTNDKQVMLTAIAK
DKLLEPKLEKVMKELKKEYELKHITVEYQSSTMQVRENAIKAGIGTGVYIKQENEKNKSVTPPATPSNPVENEEERQSQP
DSSPDVVPDLSSVKDKKYEKPEYKEQKKIEEQPTKQIKENNGRGSQQENRGNQQENNGRESQQGNNGNQQGNNGRESQQG
NNGNQQGNNGRGSQGNNGHQQENNGRGSQGNNGNQQGNNGRGSQGNNGHQQENNGRGSQGNNGNQQGDNGRGSQQGNNGN
QQGDNGRGSQKENVGNEQGNNGRGSQQENRGHQQGNEKKNQ
>O31655 ~~~rsgI~~~Anti-sigma-I factor RsgI~~~
MRRGIIVEKNKKFVTLLTPDGQFLKAKNDRHSYEIGEEIMLPSETRMGRRASFFDFFKLRPFKMGIFTMTAIMLFIFIVL
PVFSNNKAYAYMTIDINPSVEMALNSDYEVIELTPLNDEGQKVVNDIDDWEKTDFKKVIDDIITDCSEHGYVKKSKEILI
STVYENTEDNTYKKAVKKQLNDVTEKYKTTYRMESLESDMQTREKAKKEGVSTGSYIKSNEKNDNKDIKDDSSKPSGEED
QKSDENEDENTDQTDTQDSKQGDNEQLNDADSGDQKEEKADDQIDDSDKDKKIKESDENTNTEKDGDHEQTPIQDPQDKG
NENNGADKGQSQYHRDWNNGEQGKNRSSSRRDNASDRRNPNGYSSDNHSAKNEDSPSAPGE
>A0QTP3 ~~~rshA~~~Anti-sigma factor RshA~~~COG5662
MSETEREDERWTPPIGPIDPEHPECAAVIAEVWTLLDGECTPETRDKLKQHLEECPTCLRHYGIEERVKRLIAAKCSGEK
APDSLRERLRIQISRTTIIRG
>P9WJ68 ~~~rshA~~~Anti-sigma factor RshA~~~
MSENCGPTDAHADHDDSHGGMGCAEVIAEVWTLLDGECTPETRERLRRHLEACPGCLRHYGLEERIKALIGTKCRGDRAP
EGLRERLRLEIRRTTIIRGGP
>P9WJ69 ~~~rshA~~~Anti-sigma factor RshA~~~COG5662
MSENCGPTDAHADHDDSHGGMGCAEVIAEVWTLLDGECTPETRERLRRHLEACPGCLRHYGLEERIKALIGTKCRGDRAP
EGLRERLRLEIRRTTIIRGGP
>P62611 ~~~rpsU~~~Small ribosomal subunit protein bTHX~~~
MGKGDRRTRRGKIWRGTYGKYRPRKKK
>P62613 ~~~rpsU~~~Small ribosomal subunit protein bTHX~~~
MGKGDRRTRRGKIWRGTYGKYRPRKKK
>Q5SIH3 ~~~rpsU~~~Small ribosomal subunit protein bTHX~~~
MGKGDRRTRRGKIWRGTYGKYRPRKKK
>P62612 ~~~rpsU~~~Small ribosomal subunit protein bTHX~~~
MGKGDRRTRRGKIWRGTYGKYRPRKKK
>Q57E90 2.7.6.5~~~rsh~~~GTP pyrophosphokinase rsh~~~
MMRQYELVERVQRYKPDVNEALLNKAYVYAMQKHGSQKRASGDPYFSHPLEVAAILTDMHLDEATIAIALLHDTIEDTTA
TRQEIDQLFGPEIGKLVEGLTKLKKLDLVSKKAVQAENLRKLLLAISEDVRVLLVKLADRLHNMRTLGVMREDKRLRIAE
ETMDIYAPLAGRMGMQDMREELEELAFRYINPDAWRAVTDRLAELLEKNRGLLQKIETDLSEIFEKNGIKASVKSRQKKP
WSVFRKMETKGLSFEQLSDIFGFRVMVDTVQDCYRALGLIHTTWSMVPGRFKDYISTPKQNDYRSIHTTIIGPSRQRIEL
QIRTREMDEIAEFGVAAHSIYKDRGSANNPHKISTETNAYAWLRQTIEQLSEGDNPEEFLEHTKLELFQDQVFCFTPKGR
LIALPRGATPIDFAYAVHTDIGDSCVGAKVNGRIMPLMTELKNGDEVDIIRSKAQVPPAAWESLVATGKARAAIRRATRS
AVRKQYSGLGMRILERAFERAGKPFSKDILKPGLPRLARKDVEDVLAAVGRGELPSADVVKAVYPDYQDTRVTTQNNPAK
AGEKGWFNIQNAAGMIFKVPEGGEGAAAKVDPAATTPKPGKRALPIRGTNPDLPVRFAPEGAVPGDRIVGILQPGAGITI
YPIQSPALTAYDDQPERWIDVRWDIDDQMSERFPARISVSAINSPGSLAEIAQIAAANDANIHNLSMARTAPDFTEMIID
VEVWDLKHLNRIISQLKESASVSSAKRVNG
>Q8YG65 2.7.6.5~~~rsh~~~GTP pyrophosphokinase rsh~~~COG0317
MMRQYELVERVQRYKPDVNEALLNKAYVYAMQKHRSQKRASGDPYFSHPLEVAAILTDMHLDEATIAIALLHDTIEDTTA
TRQEIDQLFGPEIGKLVEGLTKLKKLDLVSKKAVQAENLRKLLLAISEDVRVLLVKLADRLHNMRTLGVMREDKRLRIAE
ETMDIYAPLAGRMGMQDMREELEELAFRYINPDAWRAVTDRLAELLEKNRGLLQKIETDLSEIFEKNGIKASVKSRQKKP
WSVFRKMESKGLSFEQLSDIFGFRVMVDTVQDCYRALGLIHTTWSMVPGRFKDYISTPKQNDYRSIHTTIIGPSRQRIEL
QIRTREMDEIAEFGVAAHSIYKDRGSANNPHKISTETNAYAWLRQTIEQLSEGDNPEEFLEHTKLELFQDQVFCFTPKGR
LIALPRGATPIDFAYAVHTDIGDSCVGAKVNGRIMPLMTELKNGDEVDIIRSKAQVPPAAWESLVATGKARAAIRRATRS
AVRKQYSGLGMRILERAFERAGKPFSKDILKPGLPRLARKDVEDVLAAVGRGELPSTDVVKAVYPDYQDTRVTTQNNPAK
AGEKGWFNIQNAAGMIFKVPEGGEGAAAKVDPAATTPKPGKRALPIRGTNPDLPVRFAPEGAVPGDRIVGILQPGAGITI
YPIQSPALTAYDDQPERWIDVRWDIDDQMSERFPARISVSAINSPGSLAEIAQIAAANDANIHNLSMVRTAPDFTEMIID
VEVWDLKHLNRIISQLKESASVSSAKRVNG
>Q8CY42 2.7.6.5~~~rsh~~~GTP pyrophosphokinase rsh~~~
MMRQYELVERVQRYKPDVNEALLNKAYVYAMQKHGSQKRASGDPYFSHPLEVAAILTDMHLDEATIAIALLHDTIEDTTA
TRQEIDQLFGPEIGKLVEGLTKLKKLDLVSKKAVQAENLRKLLLAISEDVRVLLVKLADRLHNMRTLGVMCEDKRLRIAE
ETMDIYAPLAGRMGMQDMREELEELAFRYINPDAWRAVTDRLAELLEKNRGLLQKIETDLSEIFEKNGIKASVKSRQKKP
WSVFRKMETKGLSFEQLSDIFGFRVMVDTVQDCYRALGLIHTTWSMVPGRFKDYISTPKQNDYRSIHTTIIGPSRQRIEL
QIRTREMDEIAEFGVAAHSIYKDRGSANNPHKISTETNAYAWLRQTIEQLSEGDNPEEFLEHTKLELFQDQVFCFTPKGR
LIALPRGATPIDFAYAVHTDIGDSCVGAKVNGRIMPLMTELKNGDEVDIIRSKAQVPPAAWESLVATGKARAAIRRATRS
AVRKQYSGLGMRILERAFERAGKPFSKDILKPGLPRLARKDVEDVLAAVGRGELPSTDVVKAVYPDYQDTRVTTQNNPAK
AGEKGWFNIQNAAGMIFKVPEGGEGAAAKVDPAATTPKPGKRALPIRGTNPDLPVRFAPEGAVPGDRIVGILQPGAGITI
YPIQSPALTAYDDQPERWIDVRWDIDDQMSERFPARISVSAINSPGSLAKIAQIAAANDANIHNLSMVRTAPDFTEMIID
VEVWDLKHLNRIISQLKESASVSSAKRVNG
>O05403 ~~~rsiV~~~Anti-sigma-V factor RsiV~~~
MDKRLQQLREEYKNVQIPKELDIIVEKALQQEPKKKRIVMWPTSAAIAAAILFTALVNINPDAAQAMSKIPVIGKIVKAI
TFIEIKEEKDQSSIDVKTPALSGLSNKELENSINEKYLKESQQLYKEFIQSTSKNKKGHLSIYSDYETVTDTPDLLSIRR
NIETTQASSYTQSRYITIDKKNDILLTLKSLFKDERYIKVISQNIKEQMKQQMKEDPNKIYWLTDEDAEPFKTILPDQTF
YITEDHKLVISFDEYEVAPGYMGVTEFTIPTGVISNLLVGERYIR
>Q45588 ~~~rsiW~~~Anti-sigma-W factor RsiW~~~COG5662
MSCPEQIVQLMHMHLDGDILPKDEHVLNEHLETCEKCRKHFYEMEKSIALVRSTSHVEAPADFTANVMAKLPKEKKRASV
KRWFRTHPVIAAAAVFIILMGGGFFNSWHNDHNFSVSKQPNLVVHNHTVTVPEGETVKGDVTVKNGKLIIKGKIDGDVTV
VNGEKYMASAGQVTGQIEEINQLFDWTWYKMKSAGKSVLDAFNPNGEE
>P35166 ~~~rsiX~~~Anti-sigma-X factor RsiX~~~
MMKSEWNEEQIKELLSQLPAVKDHRSPQDIYKRLTMAKRKNKPAVRWIGPACAAAIAVYIAFIISPHFFDQAQPQQKEAS
QENAVTKTETEDSPKAASSLDQTSFVVPEKEQDNYITVAVADADTSAIIPVSIQKTNADQTIQDMLFESSELGILDHAIT
IPTFIDEVEIKEKPKQKELSIRVHQPATAFSIKDDTLLKKLLKESLKWSPYEKVKFLSDQNETGVRIGSYGTFTEISIPK
QSKRSYYLYQNKQGQDFLVPSNHSFDTVKEAIKEMESSSQEDTTPLIQAGAVQSVTKKQKHLYIRFSKESEVDDSIAGIL
MIEGLLLTAKEFGFTEVTFTETRTKKIGKYDISDAIPVPAAPNPISLN
>Q7U1Z7 ~~~rskA~~~Dysfunctional anti-sigma-K factor RskA~~~
MTEHTDFELLELATPYALNAVSDDERADIDRRVAAAPSPVAAAFNDEVRAVRETMAVVSAATTAEPPAHLRTAILDATKP
EVRRQSRWRTAAFASAAAIAVGLGAFDLGVLTRPSPPPTVAEQVLTAPDVRTVSRPLGAGTATVVFSRDRNTGLLVMNNV
APPSRGTVYQMWLLGGAKGPRSAETMGTAAVTPSTTATLTDLGASTALAFTVEPGTGSPQPTGTILAELPLG
>H8EVS9 ~~~rskA~~~Anti-sigma-K factor RskA~~~
MTEHTDFELLELATPYALNAVSDDERADIDRRVAAAPSPVAAAFNDEVRAVRETMAVVSAATTAEPPAHLRTAILDATKP
EVRRQSRWRTAAFASAAAIAVGLGAFGLGVLTRPSPPPTVAEQVLTAPDVRTVSRPLGAGTATVVFSRDRNTGLLVMNNV
APPSRGTVYQMWLLGGAKGPRSAGTMGTAAVTPSTTATLTDLGASTALAFTVEPGTGSPQPTGTILAELPLG
>P9WGX5 ~~~rskA~~~Anti-sigma-K factor RskA~~~COG5343
MTEHTDFELLELATPYALNAVSDDERADIDRRVAAAPSPVAAAFNDEVRAVRETMAVVSAATTAEPPAHLRTAILDATKP
EVRRQSRWRTAAFASAAAIAVGLGAFGLGVLTRPSPPPTVAEQVLTAPDVRTVSRPLGAGTATVVFSRDRNTGLLVMNNV
APPSRGTVYQMWLLGGAKGPRSAGTMGTAAVTPSTTATLTDLGASTALAFTVEPGTGSPQPTGTILAELPLG
>H8EXN2 ~~~rslA~~~Anti-sigma-L factor RslA~~~
MTMPLRGLGPPDDTGVREVSTGDDHHYAMWDAAYVLGALSAADRREFEAHLAGCPECRGAVTELCGVPALLSQLDRDEVA
AISESAPTVVASGLSPELLPSLLAAVHRRRRRTRLITWVASSAAAAVLAIGVLVGVQGHSAAPQRAAVSALPMAQVGTQL
LASTVSISGEPWGTFINLRCVCLAPPYASHDTLAMVVVGRDGSQTRLATWLAEPGHTATPAGSISTPVDQIAAVQVVAAD
TGQVLLQRSL
>P9WJ67 ~~~rslA~~~Anti-sigma-L factor RslA~~~COG1595
MTMPLRGLGPPDDTGVREVSTGDDHHYAMWDAAYVLGALSAADRREFEAHLAGCPECRGAVTELCGVPALLSQLDRDEVA
AISESAPTVVASGLSPELLPSLLAAVHRRRRRTRLITWVASSAAAAVLAIGVLVGVQGHSAAPQRAAVSALPMAQVGTQL
LASTVSISGEPWGTFINLRCVCLAPPYASHDTLAMVVVGRDGSQTRLATWLAEPGHTATPAGSISTPVDQIAAVQVVAAD
TGQVLLQRSL
>H8F2P5 ~~~rsmA~~~Anti-sigma-M factor RsmA~~~
MSAADKDPDKHSADADPPLTVELLADLQAGLLDDATAARIRSRVRSDPQAQQILRALNRVRRDVAAMGADPAWGPAARPA
VVDSISAALRSARPNSSPGAAHAARPHVHPVRMIAGAAGLCAVATAIGVGAVVDAPPPAPSAPTTAQHITVSKPAPVIPL
SRPQVLDLLHHTPDYGPPGGPLGDPSRRTSCLSGLGYPASTPVLGAQPIDIDARPAVLLVIPADTPDKLAVFAVAPHCSA
ADTGLLASTVVPRA
>O67680 2.1.1.182~~~rsmA~~~Ribosomal RNA small subunit methyltransferase A~~~COG0030
MVRLKKSFGQHLLVSEGVLKKIAEELNIEEGNTVVEVGGGTGNLTKVLLQHPLKKLYVIELDREMVENLKSIGDERLEVI
NEDASKFPFCSLGKELKVVGNLPYNVASLIIENTVYNKDCVPLAVFMVQKEVAEKLQGKKDTGWLSVFVRTFYDVNYVMT
VPPRFFVPPPKVQSAVIKLVKNEKFPVKDLKNYKKFLTKIFQNRRKVLRKKIPEELLKEAGINPDARVEQLSLEDFFKLY
RLIEDSGE
>P37468 2.1.1.182~~~rsmA~~~Ribosomal RNA small subunit methyltransferase A~~~COG0030
MNKDIATPIRTKEILKKYGFSFKKSLGQNFLIDTNILNRIVDHAEVTEKTGVIEIGPGIGALTEQLAKRAKKVVAFEIDQ
RLLPILKDTLSPYENVTVIHQDVLKADVKSVIEEQFQDCDEIMVVANLPYYVTTPIIMKLLEEHLPLKGIVVMLQKEVAE
RMAADPSSKEYGSLSIAVQFYTEAKTVMIVPKTVFVPQPNVDSAVIRLILRDGPAVDVENESFFFQLIKASFAQRRKTLL
NNLVNNLPEGKAQKSTIEQVLEETNIDGKRRGESLSIEEFAALSNGLYKALF
>Q63X76 2.1.1.182~~~rsmA~~~Ribosomal RNA small subunit methyltransferase A~~~COG0030
MSNSRQHQGHFARKRFGQNFLVDHGVIDAIVAAIRPERGERMVEIGPGLGALTGPVIARLATPGSPLHAVELDRDLIGRL
EQRFGELLELHAGDALTFDFGSIARPGDEPSLRIIGNLPYNISSPLLFHLMSFAPVVIDQHFMLQNEVVERMVAEPGTKA
FSRLSVMLQYRYVMDKLIDVPPESFQPPPKVDSAIVRMIPHAPHELPAVDPAVLGEVVTAAFSQRRKMLRNTLGGYRDLV
DFDALGFDLARRAEDIGVDEYVRVAQAVASARASG
>Q83AC2 2.1.1.182~~~rsmA~~~Ribosomal RNA small subunit methyltransferase A~~~COG0030
MKKMPMRKRFGQHFLHDSFVLQKIVSAIHPQKTDTLVEIGPGRGALTDYLLTECDNLALVEIDRDLVAFLQKKYNQQKNI
TIYQNDALQFDFSSVKTDKPLRVVGNLPYNISTPLLFHLFSQIHCIEDMHFMLQKEVVRRITAEVGSHDYGRLSVMAQYF
CDNTYLFTVSPQAFTPPPRVESAIIRLIPRHNFTPVAKNLDQLSHVVKEAFSYRRKTVGNALKKLINPSQWPLLEINPQL
RPQELTVEDFVKISNILN
>P06992 2.1.1.182~~~rsmA~~~Ribosomal RNA small subunit methyltransferase A~~~COG0030
MNNRVHQGHLARKRFGQNFLNDQFVIDSIVSAINPQKGQAMVEIGPGLAALTEPVGERLDQLTVIELDRDLAARLQTHPF
LGPKLTIYQQDAMTFNFGELAEKMGQPLRVFGNLPYNISTPLMFHLFSYTDAIADMHFMLQKEVVNRLVAGPNSKAYGRL
SVMAQYYCNVIPVLEVPPSAFTPPPKVDSAVVRLVPHATMPHPVKDVRVLSRITTEAFNQRRKTIRNSLGNLFSVEVLTG
MGIDPAMRAENISVAQYCQMANYLAENAPLQES
>P9WH07 2.1.1.182~~~ksgA~~~Ribosomal RNA small subunit methyltransferase A~~~COG0030
MCCTSGCALTIRLLGRTEIRRLAKELDFRPRKSLGQNFVHDANTVRRVVAASGVSRSDLVLEVGPGLGSLTLALLDRGAT
VTAVEIDPLLASRLQQTVAEHSHSEVHRLTVVNRDVLALRREDLAAAPTAVVANLPYNVAVPALLHLLVEFPSIRVVTVM
VQAEVAERLAAEPGSKEYGVPSVKLRFFGRVRRCGMVSPTVFWPIPRVYSGLVRIDRYETSPWPTDDAFRRRVFELVDIA
FAQRRKTSRNAFVQWAGSGSESANRLLAASIDPARRGETLSIDDFVRLLRRSGGSDEATSTGRDARAPDISGHASAS
>Q1RK29 2.1.1.182~~~rsmA~~~Ribosomal RNA small subunit methyltransferase A~~~COG0030
MLPSIAKHAASHQIHPLKKHGQNFIFDGSLCDKIVRASGLEENSNVLEIGPGTGGLTRSILHKNPKLLTVIETDERCIPL
LNEIKQYHPNLNIIKQDALKLKLSDLNTNKITIISNLPYHIGTELVIRWLKESSLVASMTLMLQKEVVERICAKPSTKAY
GRLSVICSLIATVEKCFDVAPTAFYPPPKVYSAIVKLTPLENIPNSDLISKVELITKMAFAGRRKMIKSSLKNLAPNISE
LLAKLNISDNCRAENLTPNDYLSLASLI
>P66662 2.1.1.182~~~rsmA~~~Ribosomal RNA small subunit methyltransferase A~~~
MLDNKDIATPSRTRALLDKYGFNFKKSLGQNFLIDVNIINNIIDASDIDAQTGVIEIGPGMGSLTEQLARHAKRVLAFEI
DQRLIPVLNDTLSPYDNVTVINEDILKANIKEAVENHLQDCEKIMVVANLPYYITTPILLNLMQQDIPIDGYVVMMQKEV
GERLNAEVGSKAYGSLSIVVQYYTETSKVLTVPKSVFMPPPNVDSIVVKLMQRTEPLVTVDNEEAFFKLAKAAFAQRRKT
INNNYQNYFKDGKQHKEVILQWLEQAGIDPRRRGETLSIQDFAKLYEEKKKFPQLEN
>Q5SM60 2.1.1.182~~~rsmA~~~Ribosomal RNA small subunit methyltransferase A~~~COG0030
MSKLASPQSVRALLERHGLFADKRFGQNFLVSEAHLRRIVEAARPFTGPVFEVGPGLGALTRALLEAGAEVTAIEKDLRL
RPVLEETLSGLPVRLVFQDALLYPWEEVPQGSLLVANLPYHIATPLVTRLLKTGRFARLVFLVQKEVAERMTARPKTPAY
GVLTLRVAHHAVAERLFDLPPGAFFPPPKVWSSLVRLTPTGALDDPGLFRLVEAAFGKRRKTLLNALAAAGYPKARVEEA
LRALGLPPRVRAEELDLEAFRRLREGLEGAV
>P36929 2.1.1.176~~~rsmB~~~Ribosomal RNA small subunit methyltransferase B~~~COG0144
MKKQRNLRSMAAQAVEQVVEQGQSLSNILPPLQQKVSDKDKALLQELCFGVLRTLSQLDWLINKLMARPMTGKQRTVHYL
IMVGLYQLLYTRIPPHAALAETVEGAIAIKRPQLKGLINGVLRQFQRQQEELLAEFNASDARYLHPSWLLKRLQKAYPEQ
WQSIVEANNQRPPMWLRINRTHHSRDSWLALLDEAGMKGFPHADYPDAVRLETPAPVHALPGFEDGWVTVQDASAQGCMT
WLAPQNGEHILDLCAAPGGKTTHILEVAPEAQVVAVDIDEQRLSRVYDNLKRLGMKATVKQGDGRYPSQWCGEQQFDRIL
LDAPCSATGVIRRHPDIKWLRRDRDIPELAQLQSEILDAIWPHLKTGGTLVYATCSVLPEENSLQIKAFLQRTADAELCE
TGTPEQPGKQNLPGAEEGDGFFYAKLIKK
>Q5SK01 2.1.1.176~~~rsmB~~~Ribosomal RNA small subunit methyltransferase B~~~COG0144
MRAGTPRALAVAVLLEVDRGGRAQLLLDRALRRSPWPERDRAYATFLVYGALRRLRLLDHLLAPLLPRPEGLPPEVRWIL
RLGALEWLEGKPDHARVSPWVEEAKRRYPGLAGLVNAVLRRLAPREAPECVRLSLPDWLCEAWRGFFGDVAFAEGFNEPA
PLFVTAYREVDLRPGPVPGSYLWEGPKTDFSALGLQPQNPASLFAAKLLEARPGERVLDLCGGAGLKAFYLAAQGAEVVS
YDLNRRRQEAGARTARRLGLWVHYRTQDLTRPVPERAKKVLLDAPCTGTGTFRAHPELRYRLSPEDPARMAALQLQLLET
AAQATEEGGVLVYSVCTLTEEEGEGVVRAFLARHPEFRPEPVQPPFPVLARGLGVYVDPRGGLDGFYYAKLRRVNSQA
>P39406 2.1.1.172~~~rsmC~~~Ribosomal RNA small subunit methyltransferase C~~~COG2813
MSAFTPASEVLLRHSDDFEQSRILFAGDLQDDLPARLDTAASRAHTQQFHHWQVLSRQMGDNARFSLVATADDVADCDTL
IYYWPKNKPEAQFQLMNLLSLLPVGTDIFVVGENRSGVRSAEQMLADYAPLNKVDSARRCGLYFGRLEKQPVFDAEKFWG
EYSVDGLTVKTLPGVFSRDGLDVGSQLLLSTLTPHTKGKVLDVGCGAGVLSVAFARHSPKIRLTLCDVSAPAVEASRATL
AANGVEGEVFASNVFSEVKGRFDMIISNPPFHDGMQTSLDAAQTLIRGAVRHLNSGGELRIVANAFLPYPDVLDETFGFH
EVIAQTGRFKVYRAIMTRQAKKG
>P0ADX9 2.1.1.171~~~rsmD~~~Ribosomal RNA small subunit methyltransferase D~~~COG0742
MKKPNHSGSGQIRIIGGQWRGRKLPVPDSPGLRPTTDRVRETLFNWLAPVIVDAQCLDCFAGSGALGLEALSRYAAGATL
IEMDRAVSQQLIKNLATLKAGNARVVNSNAMSFLAQKGTPHNIVFVDPPFRRGLLEETINLLEDNGWLADEALIYVESEV
ENGLPTVPANWSLHREKVAGQVAYRLYQREAQGESDAD
>P44869 2.1.1.171~~~rsmD~~~Ribosomal RNA small subunit methyltransferase D~~~COG0742
MKKIQTPNAKGEVRIIAGLWRGRKLPVLNSEGLRPTGDRVKETLFNWLMPYIHQSECLDGFAGSGSLGFEALSRQAKKVT
FLELDKTVANQLKKNLQTLKCSSEQAEVINQSSLDFLKQPQNQPHFDVVFLDPPFHFNLAEQAISLLCENNWLKPNALIY
VETEKDKPLITPENWTLLKEKTTGIVSYRLYQN
>O66552 2.1.1.193~~~rsmE~~~Ribosomal RNA small subunit methyltransferase E~~~COG1385
MHVFYSEERRGNLLILREGEVKHFRVRRIEKDEEFGVIHEGKIYVCKVRREDKREISCEIVEELETKLPPKDITLYQSVT
VDLKTMDTIVRQATELGVLTFVPIISERSFQKEEAILKKTEKWKRIVIEAMKQSRRPIPMEIKKPVRLSDLIPESEENII
LDNFYEGVKPKDVNLEAKTYSVVVGPEGGFSKRESQILREKGFKSVLLEPYTLRTETAVVSIVSILMNF
>P54461 2.1.1.193~~~rsmE~~~Ribosomal RNA small subunit methyltransferase E~~~COG1385
MQRYFIELTKQQIEEAPTFSITGEEVHHIVNVMRMNEGDQIICCSQDGFEAKCELQSVSKDKVSCLVIEWTNENRELPIK
VYIASGLPKGDKLEWIIQKGTELGAHAFIPFQAARSVVKLDDKKAKKKRERWTKIAKEAAEQSYRNEVPRVMDVHSFQQL
LQRMQDFDKCVVAYEESSKQGEISAFSAIVSSLPKGSSLLIVFGPEGGLTEAEVERLTEQDGVTCGLGPRILRTETAPLY
ALSAISYQTELLRGDQ
>P0AGL7 2.1.1.193~~~rsmE~~~Ribosomal RNA small subunit methyltransferase E~~~COG1385
MRIPRIYHPEPLTSHSHIALCEDAANHIGRVLRMGPGQALQLFDGSNQVFDAEITSASKKSVEVKVLEGQIDDRESPLHI
HLGQVMSRGEKMEFTIQKSIELGVSLITPLFSERCGVKLDSERLNKKLQQWQKIAIAACEQCGRNRVPEIRPAMDLEAWC
AEQDEGLKLNLHPRASNSINTLPLPVERVRLLIGPEGGLSADEIAMTARYQFTDILLGPRVLRTETTALTAITALQVRFG
DLG
>P44627 2.1.1.193~~~rsmE~~~Ribosomal RNA small subunit methyltransferase E~~~COG1385
MRIPRIYHPISLENQTQCYLSEDAANHVARVLRMTEGEQLELFDGSNHIYPAKIIESNKKSVKVEILGRELADKESHLKI
HLGQVISRGERMEFTIQKSVELGVNVITPLWSERCGVKLDAERMDKKIQQWQKIAIAACEQCGRNIVPEIRPLMKLQDWC
AENDGALKLNLHPRAHYSIKTLPTIPAGGVRLLIGSEGGLSAQEIAQTEQQGFTEILLGKRVLRTETASLAAISALQICF
GDLGE
>P9WGX1 2.1.1.193~~~rsmE~~~Ribosomal RNA small subunit methyltransferase E~~~COG1385
MVAMLFYVDTLPDTGAVAVVDGDEGFHAATVRRIRPGEQLVLGDGVGRLARCVVEQAGRGGLRARVLRRWSVPPVRPPVT
VVQALPKSERSELAIELATEAGADAFLAWQAARCVANWDGARVDKGLRRWRAVVRSAARQSRRARIPPVDGVLSTPMLVQ
RVREEVAAGAAVLVLHEEATERIVDIAAAQAGSLMLVVGPEGGIAPDELAALTDAGAVAVRLGPTVLRTSTAAAVALGAV
GVLTSRWDASASDCEYCDVTRR
>P76273 2.1.1.178~~~rsmF~~~Ribosomal RNA small subunit methyltransferase F~~~COG0144
MAQHTVYFPDAFLTQMREAMPSTLSFDDFLAACQRPLRRSIRVNTLKISVADFLQLTAPYGWTLTPIPWCEEGFWIERDN
EDALPLGSTAEHLSGLFYIQEASSMLPVAALFADGNAPQRVMDVAAAPGSKTTQISARMNNEGAILANEFSASRVKVLHA
NISRCGISNVALTHFDGRVFGAAVPEMFDAILLDAPCSGEGVVRKDPDALKNWSPESNQEIAATQRELIDSAFHALRPGG
TLVYSTCTLNQEENEAVCLWLKETYPDAVEFLPLGDLFPGANKALTEEGFLHVFPQIYDCEGFFVARLRKTQAIPALPAP
KYKVGNFPFSPVKDREAGQIRQAATGVGLNWDENLRLWQRDKELWLFPVGIEALIGKVRFSRLGIKLAETHNKGYRWQHE
AVIALASPDNMNAFELTPQEAEEWYRGRDVYPQAAPVADDVLVTFQHQPIGLAKRIGSRLKNSYPRELVRDGKLFTGNA
>Q5SII2 2.1.1.-~~~rsmF~~~Ribosomal RNA small subunit methyltransferase F~~~COG0144
MLPKAFLSRMAELLGEEFPAFLKALTEGKRTYGLRVNTLKLPPEAFQRISPWPLRPIPWCQEGFYYPEEARPGPHPFFYA
GLYYIQEPSAQAVGVLLDPKPGERVLDLAAAPGGKTTHLAARMGGKGLLLANEVDGKRVRGLLENVERWGAPLAVTQAPP
RALAEAFGTYFHRVLLDAPCSGEGMFRKDREAARHWGPSAPKRMAEVQKALLAQASRLLGPGGVLVYSTCTFAPEENEGV
VAHFLKAHPEFRLEDARLHPLFAPGVPEWGEGNPELLKTARLWPHRLEGEGHFLARFRKEGGAWSTPRLERPSPLSQEAL
RAFRGFLEEAGLTLEGPVLDRAGHLYLLPEGLPTLLGLKAPAPGLYLGKVQKGRFLPARALALAFGATLPWPEGLPRLAL
TPEDPRALAFATGEGVAWEGEDHPLALVVLKTAAGEFPLDFGKAKRGVLRPVGVGL
>P25813 2.1.1.-~~~rsmG~~~Ribosomal RNA small subunit methyltransferase G~~~COG0357
MNIEEFTSGLAEKGISLSPRQLEQFELYYDMLVEWNEKINLTSITEKKEVYLKHFYDSITAAFYVDFNQVNTICDVGAGA
GFPSLPIKICFPHLHVTIVDSLNKRITFLEKLSEALQLENTTFCHDRAETFGQRKDVRESYDIVTARAVARLSVLSELCL
PLVKKNGLFVALKAASAEEELNAGKKAITTLGGELENIHSFKLPIEESDRNIMVIRKIKNTPKKYPRKPGTPNKSPIEG
>P0A6U5 2.1.1.170~~~rsmG~~~Ribosomal RNA small subunit methyltransferase G~~~COG0357
MLNKLSLLLKDAGISLTDHQKNQLIAYVNMLHKWNKAYNLTSVRDPNEMLVRHILDSIVVAPYLQGERFIDVGTGPGLPG
IPLSIVRPEAHFTLLDSLGKRVRFLRQVQHELKLENIEPVQSRVEEFPSEPPFDGVISRAFASLNDMVSWCHHLPGEQGR
FYALKGQMPEDEIALLPEEYQVESVVKLQVPALDGERHLVVIKANKI
>P9WGW9 2.1.1.-~~~rsmG~~~Ribosomal RNA small subunit methyltransferase G~~~COG0357
MSPIEPAASAIFGPRLGLARRYAEALAGPGVERGLVGPREVGRLWDRHLLNCAVIGELLERGDRVVDIGSGAGLPGVPLA
IARPDLQVVLLEPLLRRTEFLREMVTDLGVAVEIVRGRAEESWVQDQLGGSDAAVSRAVAALDKLTKWSMPLIRPNGRML
AIKGERAHDEVREHRRVMIASGAVDVRVVTCGANYLRPPATVVFARRGKQIARGSARMASGGTA
>P64237 2.1.1.170~~~rsmG~~~Ribosomal RNA small subunit methyltransferase G~~~
MLNKLSRLLADAGISLTDHQKTLLVAYVDMLHKWNKAYNLTSVRDPNEMLVRHILDSIVVAPYLQGQRFIDVGTGPGLPG
IPLAIVLPDAHFTLLDSLGKRVRFLRQVQHELKLENITPVQSRVEAYPSEPPFDGVISRAFASLNDMVSWCHHLPGEKGR
FYALKGQLPGDEIASLPDNFSVESVEKLRVPQLEGERHLVIIKSNKV
>O54571 2.1.1.-~~~rsmG~~~Ribosomal RNA small subunit methyltransferase G~~~COG0357
MSEAAELPPVPEQARDVFGDRYADAVRYAELLAEAGVKRGLIGPREVPRLWERHLLNCAVLSEVVPEGVTVCDVGSGAGL
PGIPLALVREDLKITLLEPLLRRTNFLTEVVELLGLDHVTVVRGRAEEVMGKLPPVHVVTARAVAPLDRLATWGIPLLRP
YGEMLALKGDTAEEELKAATAALSKLGAEQTSILHVGEGVVSPLSTVVRVEVGESPGGVRFAAKRAKAARTGRTRRRRG
>Q97QD4 2.1.1.-~~~rsmG~~~Ribosomal RNA small subunit methyltransferase G~~~COG0357
MKPKTFYNLLAEQNLPLSDQQKEQFERYFELLVEWNEKINLTAITDKEEVYLKHFYDSIAPILQGLIPNETIKLLDIGAG
AGFPSLPMKILYPELDVTIIDSLNKRINFLQLLAQELDLNGVHFYHGRAEDFAQDKNFRAQYDFVTARAVARMQVLSELT
IPYLKVGGKLLALKASNAPEELLEAKNALNLLFSKVEDNLSYALPNRDPRYITVVEKKKETPNKYPRKAGMPNKRPL
>Q9LCY2 2.1.1.170~~~rsmG~~~Ribosomal RNA small subunit methyltransferase G~~~COG0357
MFHGKHPGGLSERGRALLLEGGKALGLDLKPHLEAFSRLYALLQEASGKVNLTALRGEEEVVVKHFLDSLTLLRLPLWQG
PLRVLDLGTGAGFPGLPLKIVRPELELVLVDATRKKVAFVERAIEVLGLKGARALWGRAEVLAREAGHREAYARAVARAV
APLCVLSELLLPFLEVGGAAVAMKGPRVEEELAPLPPALERLGGRLGEVLALQLPLSGEARHLVVLEKTAPTPPAYPRRP
GVPERHPLC
>Q07876 2.1.1.199~~~rsmH~~~Ribosomal RNA small subunit methyltransferase H~~~COG0275
MFQHKTVLLRETVDGLNIKPDGTYVDCTLGGAGHSTYLLQQLSEKGRLIAFDQDDTALQHAKEVLSDYKGQLILIKSNFR
YLKEYLNEQGVTEVDGILFDLGVSSPQLDTPERGFSYHHDAPLDMRMDQSATLSAKEVVNEWRYEDLVRIFFKYGEEKFS
KQIARKIEEARMKSPIQTTGQLVDLIKDAIPAPARRSGGHPAKRVFQAIRIAVNDELRVFEEALEQAIEVLKPGGRVSVI
TFHSLEDRICKTTFKEKSSLPELPPGLPVIPEEFEPELKLITRKPITASQEELEENNRARSAKLRIAEKRK
>P60390 2.1.1.199~~~rsmH~~~Ribosomal RNA small subunit methyltransferase H~~~COG0275
MMENYKHTTVLLDEAVNGLNIRPDGIYIDGTFGRGGHSRLILSQLGEEGRLLAIDRDPQAIAVAKTIDDPRFSIIHGPFS
ALGEYVAERDLIGKIDGILLDLGVSSPQLDDAERGFSFMRDGPLDMRMDPTRGQSAAEWLQTAEEADIAWVLKTYGEERF
AKRIARAIVERNREQPMTRTKELAEVVAAATPVKDKFKHPATRTFQAVRIWVNSELEEIEQALKSSLNVLAPGGRLSIIS
FHSLEDRIVKRFMRENSRGPQVPAGLPMTEEQLKKLGGRQLRALGKLMPGEEEVAENPRARSSVLRIAERTNA
>P45057 2.1.1.199~~~rsmH~~~Ribosomal RNA small subunit methyltransferase H~~~COG0275
MNSENSFSSSEHITVLLHEAVNGLALKENGIYIDGTFGRGGHSRFILSQLSSNGRLIAVDRDPRAIAEAHKIQDLRFQIE
HNSFSHIPEICDKLNLVGKIDGILLDLGVSSPQLDEAERGFSFMKDGPLDMRMDTTQGLSAEEWLKQVSIEDLTWVLKTF
GEERFAKRIATAIVDFNKSAVKNGTEFLSRTSQLAELISQAVPFKDKHKHPATRSFQAIRIFINSELDELESLLNSALDM
LAPEGRLSIISFHSLEDRMVKHFMKKQSKGEDIPKGLPLREDQIQRNQKLRIIGKAIQPSDAEIQANPRSRSAILRVAER
I
>P9WJP1 2.1.1.199~~~rsmH~~~Ribosomal RNA small subunit methyltransferase H~~~COG0275
MQTRAPWSLPEATLAYFPNARFVSSDRDLGAGAAPGIAASRSTACQTWGGITVADPGSGPTGFGHVPVLAQRCFELLTPA
LTRYYPDGSQAVLLDATIGAGGHAERFLEGLPGLRLIGLDRDPTALDVARSRLVRFADRLTLVHTRYDCLGAALAESGYA
AVGSVDGILFDLGVSSMQLDRAERGFAYATDAPLDMRMDPTTPLTAADIVNTYDEAALADILRRYGEERFARRIAAGIVR
RRAKTPFTSTAELVALLYQAIPAPARRVGGHPAKRTFQALRIAVNDELESLRTAVPAALDALAIGGRIAVLAYQSLEDRI
VKRVFAEAVASATPAGLPVELPGHEPRFRSLTHGAERASVAEIERNPRSTPVRLRALQRVEHRAQSQQWATEKGDS
>P60392 2.1.1.199~~~rsmH~~~Ribosomal RNA small subunit methyltransferase H~~~
MFHHISVMLNETIDYLNVKENGVYIDCTLGGAGHALYLLNQLNDDGRLIAIDQDQTAIDNAKEVLKDHLHKVTFVHSNFR
ELTQILKDLNIEKVDGIYYDLGVSSPQLDIPERGFSYHHDATLDMRMDQTQELTAYEIVNNWSYEALVKIFYRYGEEKFS
KQIARRIEAHREQQPITTTLELVDIIKEGIPAKARRKGGHPAKRVFQALRIAVNDELSAFEDSIEQAIELVKVDGRISVI
TFHSLEDRLCKQVFQEYEKGPEVPRGLPVIPEAYTPKLKRVNRKPITATEEDLDDNNRARSAKLRVAEILK
>Q9WZX6 2.1.1.199~~~rsmH~~~Ribosomal RNA small subunit methyltransferase H~~~COG0275
MRKYSQRHIPVMVREVIEFLKPEDEKIILDCTVGEGGHSRAILEHCPGCRIIGIDVDSEVLRIAEEKLKEFSDRVSLFKV
SYREADFLLKTLGIEKVDGILMDLGVSTYQLKGENRGFTFEREEPLDMRMDLESEVTAQKVLNELPEEELARIIFEYGEE
KRFARRIARKIVENRPLNTTLDLVKAVREALPSYEIRRRKRHFATKTFQAIRIYVNRELENLKEFLKKAEDLLNPGGRIV
VISFHSLEDRIVKETFRNSKKLRILTEKPVRPSEEEIRENPRARSGRLRAAERIEEGGD
>Q5SJD8 2.1.1.199~~~rsmH~~~Ribosomal RNA small subunit methyltransferase H~~~COG0275
MRPMTHVPVLYQEALDLLAVRPGGVYVDATLGGAGHARGILERGGRVIGLDQDPEAVARAKGLHLPGLTVVQGNFRHLKR
HLAALGVERVDGILADLGVSSFHLDDPSRGFSYQKEGPLDMRMGLEGPTAKEVVNRLPLEALARLLRELGEEPQAYRIAR
AIVAAREKAPIETTTQLAEIVRKAVGFRRAGHPARKTFQALRIYVNDELNALKEFLEQAAEVLAPGGRLVVIAFHSLEDR
VVKRFLRESGLKVLTKKPLVPSEKEAAQNPRARSAKLRAAEKEAP
>P67087 2.1.1.198~~~rsmI~~~Ribosomal RNA small subunit methyltransferase I~~~COG0313
MKQHQSADNSQGQLYIVPTPIGNLADITQRALEVLQAVDLIAAEDTRHTGLLLQHFGINARLFALHDHNEQQKAETLLAK
LQEGQNIALVSDAGTPLINDPGYHLVRTCREAGIRVVPLPGPCAAITALSAAGLPSDRFCYEGFLPAKSKGRRDALKAIE
AEPRTLIFYESTHRLLDSLEDIVAVLGESRYVVLARELTKTWETIHGAPVGELLAWVKEDENRRKGEMVLIVEGHKAQEE
DLPADALRTLALLQAELPLKKAAALAAEIHGVKKNALYKYALEQQG
>P9WGW7 2.1.1.198~~~rsmI~~~Ribosomal RNA small subunit methyltransferase I~~~COG0313
MSSGRLLLGATPLGQPSDASPRLAAALATADVVAAEDTRRVRKLAKALDIRIGGRVVSLFDRVEALRVTALLDAINNGAT
VLVVSDAGTPVISDPGYRLVAACIDAGVSVTCLPGPSAVTTALVMSGLPAEKFCFEGFAPRKGAARRAWLAELAEERRTC
VFFESPRRLAACLNDAVEQLGGARPAAICRELTKVHEEVVRGSLDELAIWAAGGVLGEITVVVAGAAPHAELSSLIAQVE
EFVAAGIRVKDACSEVAAAHPGVRTRQLYDAVLQSRRETGGPAQP
>P68568 2.1.1.242~~~rsmJ~~~Ribosomal RNA small subunit methyltransferase J~~~COG0742
MKICLIDETGTGDGALSVLAARWGLEHDEDNLMALVLTPEHLELRKRDEPKLGGIFVDFVGGAMAHRRKFGGGRGEAVAK
AVGIKGDYLPDVVDATAGLGRDAFVLASVGCRVRMLERNPVVAALLDDGLARGYADAEIGGWLQERLQLIHASSLTALTD
ITPRPQVVYLDPMFPHKQKSALVKKEMRVFQSLVGPDLDADGLLEPARLLATKRVVVKRPDYAPPLANVATPNAVVTKGH
RFDIYAGTPV
>P68567 2.1.1.242~~~rsmJ~~~Ribosomal RNA small subunit methyltransferase J~~~COG0742
MKICLIDETGTGDGALSVLAARWGLEHDEDNLMALVLTPEHLELRKRDEPKLGGIFVDFVGGAMAHRRKFGGGRGEAVAK
AVGIKGDYLPDVVDATAGLGRDAFVLASVGCRVRMLERNPVVAALLDDGLARGYADAEIGGWLQERLQLIHASSLTALTD
ITPRPQVVYLDPMFPHKQKSALVKKEMRVFQSLVGPDLDADGLLEPARLLATKRVVVKRPDYAPPLANVATPNAVVTKGH
RFDIYAGTPV
>P72077 2.1.1.242~~~rsmJ~~~Ribosomal RNA small subunit methyltransferase J~~~
MTDILIDDTATEAVRTLIRAFPLVPVSQPPEQGSYLLAEHDTVSLRLVGEKSNVIVDFTSGAAQYRRTKGGGELIAKAVN
HTAHPTVWDATAGLGRDSFVLASLGLTVTAFEQHPAVACLLSDGIRRALLNPETQDTAARINLHFGNAAEQMPALVKTQG
KPDIVYLDPMYPERRKSAAVKKEMAYFHRLVGEAQDEVVLLHTARQTAKKRVVVKRPRLGEHLAGQAPAYQYTGKSTRFD
VYLPYGADKG
>Q9X6G2 2.1.1.242~~~rsmJ~~~Ribosomal RNA small subunit methyltransferase J~~~
MQICLMDETGATDGALSVLAARWGLEHDEDNPMALVMTPQHLELRKRDEPKLGGIFVDFVGGAMAHRRKFGGGRGEAVAK
AVGIKGDYLPDVVDATAGLGRDAFVLASVGCRVRMLERNPVVAALLDDGLTRGYADADIGGWLQERLQLIHASSLTALTD
ITPRPQVVYLDPMFPHRQKSALVKKEMRVFQSLVGPDLDADGLLEPARQLATKRVVVKRPDYAPPLADVATPNAIVTKGH
RFDIYAGTPLTE
>Q7UAV7 2.1.1.242~~~rsmJ~~~Ribosomal RNA small subunit methyltransferase J~~~
MKICLIDETGAGDGALSVLAARWGLEHDEDNLMALVLTPEHLELRKRDEPKLGGIFVDFVGGAMAHRRKFGGGRGEAVAK
AVGIKGDYLPDVVDATAGLGRDAFVLASVGCRVRMLERNPVVAALLDDGLARGYADAEIGGWLQERLQLIHASSLTALTD
ITPRPQVVYLDPMFPHKQKSALVKKEMRVFQSLVGPDLDADGLLEPARLLATKRVVVKRPDYAPPLANVATPNAVVTKGH
RFDIYAGTPV
>C0H3R4 ~~~rsoA~~~Sigma-O factor regulatory protein RsoA~~~
MDGQFEQKKKQKDETYDIEHLIACFSPMIRKKLSNTSYQEREDLEQELKIKMFEKADMLLCQDVPGFWEFILYMVDENS
>P38104 ~~~rspA~~~Starvation-sensing protein RspA~~~COG4948
MKIVKAEVFVTCPGRNFVTLKITTEDGITGLGDATLNGRELSVASYLQDHLCPQLIGRDAHRIEDIWQFFYKGAYWRRGP
VTMSAISAVDMALWDIKAKAANMPLYQLLGGASREGVMVYCHTTGHSIDEALDDYARHQELGFKAIRVQCGIPGMKTTYG
MSKGKGLAYEPATKGQWPEEQLWSTEKYLDFMPKLFDAVRNKFGFNEHLLHDMHHRLTPIEAARFGKSIEDYRMFWMEDP
TPAENQECFRLIRQHTVTPIAVGEVFNSIWDCKQLIEEQLIDYIRTTLTHAGGITGMRRIADFASLYQVRTGSHGPSDLS
PVCMAAALHFDLWVPNFGVQEYMGYSEQMLEVFPHNWTFDNGYMHPGDKPGLGIEFDEKLAAKYPYEPAYLPVARLEDGT
LWNW
>P38105 1.1.1.-~~~rspB~~~Starvation-sensing protein RspB~~~COG1063
MKSILIEKPNQLAIVEREIPTPSAGEVRVKVKLAGICGSDSHIYRGHNPFAKYPRVIGHEFFGVIDAVGEGVESARVGER
VAVDPVVSCGHCYPCSIGKPNVCTTLAVLGVHADGGFSEYAVVPAKNAWKIPEAVADQYAVMIEPFTIAANVTGHGQPTE
NDTVLVYGAGPIGLTIVQVLKGVYNVKNVIVADRIDERLEKAKESGADWAINNSQTPLGEIFTEKGIKPTLIIDAACHPS
ILKEAVTLASPAARIVLMGFSSEPSEVIQQGITGKELSIFSSRLNANKFPIVIDWLSKGLIKPEKLITHTFDFQHVADAI
SLFEQDQKHCCKVLLTFSE
>P0ACM2 ~~~rspR~~~HTH-type transcriptional repressor RspR~~~COG1802
MTVETQLNPTQPVNQQIYRILRRDIVHCLIAPGTPLSEKEVSVRFNVSRQPVREAFIKLAENGLIQIRPQRGSYVNKISM
AQVRNGSFIRQAIECAVARRAASMITESQCYQLEQNLHQQRIAIERKQLDDFFELDDNFHQLLTQIADCQLAWDTIENLK
ATVDRVRYMSFDHVSPPEMLLRQHLDIFSALQKRDGDAVERAMTQHLQEISESVRQIRQENSDWFSEE
>Q7AKG8 ~~~rsrA~~~Anti-sigma factor RsrA~~~COG5662
MSCGEPHETDCSEILDHLYEFLDKEMPDSDCVKFEHHFEECSPCLEKYGLEQAVKKLVKRCCGQDDVPGDLRAKVMGRLD
LIRSGQSVPEHDVAAAPSSSAPQES
>Q8GP19 2.7.13.3~~~rssA~~~Swarming motility regulation sensor protein RssA~~~
MIGFKSFFMRTIIFQVLAILLLWGLLVAWVKYWYYPDMEKYFDNQQRIVAAGIANILDETGTDNIDYRGIIKTIEGMYID
SINNGMQDEIDYHPLFVVYDRDNRVLYSSQTQGEPLRLPPSVLSGSVNYAGANWHLAGSWKEKRQYRVIVGESFNDRTTL
FGNPADVPLLGILAAIIVTLLFTAYFSLRPLRQIARTISDRQPGNLSPINVSEQYQEIRPVVMEVNKLMARIDAANQREK
RFMADAAHELRTPIAAVLAQLHLLTQVTEQQERREIIGDMQQGLDRAASLSRQLINLAKLEAEDFPLKIEAVDIYAEIGK
CIAQHVPYALEKDVELSLDGSEDVVVSTDRRALIAIFTNLLDNALKYAPPGSRIEANIRSLAPLGCYITLRDNGPGVSEE
HRSRLFERFYRVPGTQQTGSGLGLAIARNLADKIGAQLRVTEGLDDRGIGFIIDLPESYRPQTESEPRP
>P0AEV1 ~~~rssB~~~Regulator of RpoS~~~COG0745
MTQPLVGKQILIVEDEQVFRSLLDSWFSSLGATTVLAADGVDALELLGGFTPDLMICDIAMPRMNGLKLLEHIRNRGDQT
PVLVISATENMADIAKALRLGVEDVLLKPVKDLNRLREMVFACLYPSMFNSRVEEEERLFRDWDAMVDNPAAAAKLLQEL
QPPVQQVISHCRVNYRQLVAADKPGLVLDIAALSENDLAFYCLDVTRAGHNGVLAALLLRALFNGLLQEQLAHQNQRLPE
LGALLKQVNHLLRQANLPGQFPLLVGYYHRELKNLILVSAGLNATLNTGEHQVQISNGVPLGTLGNAYLNQLSQRCDAWQ
CQIWGTGGRLRLMLSAE
>Q8GP20 ~~~rssB~~~Swarming motility regulation protein RssB~~~
MNILLVEDDLQLGKALCRALELAGFNLCWVRLIADAENKLSSGGFDLMLLDLTLPDGDGLQKLIAWRAAGQNIPIIILTA
RDRIESLVNSLDSGANDFLAKPFALPELISRVKAVNRRMAGFASQTWSLGALYLDPVNHQVMLDNELLMLSKKEYHLLHE
LMRCAGTVVRKAVLEQRLFGHGDSVESNSLEVHMHNLRRKIGKDRVITVRGIGYLLKKE
>P52108 ~~~rstA~~~Transcriptional regulatory protein RstA~~~COG0745
MNTIVFVEDDAEVGSLIAAYLAKHDMQVTVEPRGDQAEETILRENPDLVLLDIMLPGKDGMTICRDLRAKWSGPIVLLTS
LDSDMNHILALEMGACDYILKTTPPAVLLARLRLHLRQNEQATLTKGLQETSLTPYKALHFGTLTIDPINRVVTLANTEI
SLSTADFELLWELATHAGQIMDRDALLKNLRGVSYDGLDRSVDVAISRLRKKLLDNAAEPYRIKTVRNKGYLFAPHAWE
>P18392 2.7.13.3~~~rstB~~~Sensor protein RstB~~~COG2205
MKKLFIQFYLLLFVCFLVMSLLVGLVYKFTAERAGKQSLDDLMNSSLYLMRSELREIPPHDWGKTLKEMDLNLSFDLRVE
PLSKYHLDDISMHRLRGGEIVALDDQYTFLQRIPRSHYVLAVGPVPYLYYLHQMRLLDIALIAFIAISLAFPVFIWMRPH
WQDMLKLEAAAQRFGDGHLNERIHFDEGSSFERLGVAFNQMADNINALIASKKQLIDGIAHELRTPLVRLRYRLEMSDNL
SAAESQALNRDISQLEALIEELLTYARLDRPQNELHLSEPDLPLWLSTHLADIQAVTPDKTVRIKTLVQGHYAALDMRLM
ERVLDNLLNNALRYCHSTVETSLLLSGNRATLIVEDDGPGIAPENREHIFEPFVRLDPSRDRSTGGCGLGLAIVHSIALA
MGGTVNCDTSELGGARFSFSWPLWHNIPQFTSA
>P0AA43 5.4.99.19~~~rsuA~~~Ribosomal small subunit pseudouridine synthase A~~~COG1187
MRLDKFIAQQLGVSRAIAGREIRGNRVTVDGEIVRNAAFKLLPEHDVAYDGNPLAQQHGPRYFMLNKPQGYVCSTDDPDH
PTVLYFLDEPVAWKLHAAGRLDIDTTGLVLMTDDGQWSHRITSPRHHCEKTYLVTLESPVADDTAEQFAKGVQLHNEKDL
TKPAVLEVITPTQVRLTISEGRYHQVKRMFAAVGNHVVELHRERIGGITLDADLAPGEYRPLTEEEIASVV
>P45124 5.4.99.19~~~rsuA~~~Ribosomal small subunit pseudouridine synthase A~~~COG1187
MRLDKFIAENVGLTRSQATKAIRQSAVKINGEIVKSGSVQISQEDEIYFEDELLTWIEEGQYFMLNKPQGCVCSNDDGDY
PTIYQFFDYPLAGKLHSAGRLDVDTTGLVLLTDDGQWSHRITSPKHHCEKTYLVTLADPVEENYSAACAEGILLRGEKEP
TKPAKLEILDDYNVNLTISEGRYHQVKRMFAALGNKVVGLHRWKIGDVVLDESLEEGEYRPLTQSEIEKLVK
>P0A766 7.-.-.-~~~rsxA~~~Ion-translocating oxidoreductase complex subunit A~~~COG4657
MTDYLLLFVGTVLVNNFVLVKFLGLCPFMGVSKKLETAMGMGLATTFVMTLASICAWLIDTWILIPLNLIYLRTLAFILV
IAVVVQFTEMVVRKTSPVLYRLLGIFLPLITTNCAVLGVALLNINLGHNFLQSALYGFSAAVGFSLVMVLFAAIRERLAV
ADVPAPFRGNAIALITAGLMSLAFMGFSGLVKL
>P77223 7.-.-.-~~~rsxB~~~Ion-translocating oxidoreductase complex subunit B~~~COG2878
MNAIWIAVAAVSLLGLAFGAILGYASRRFAVEDDPVVEKIDEILPQSQCGQCGYPGCRPYAEAISCNGEKINRCAPGGEA
VMLKIAELLNVEPQPLDGEAQEITPARMVAVIDENNCIGCTKCIQACPVDAIVGATRAMHTVMSDLCTGCNLCVDPCPTH
CISLQPVAETPDSWKWDLNTIPVRIIPVEHHA
>P77611 7.-.-.-~~~rsxC~~~Ion-translocating oxidoreductase complex subunit C~~~COG4656
MLKLFSAFRKNKIWDFNGGIHPPEMKTQSNGTPLRQVPLAQRFVIPLKQHIGAEGELCVSVGDKVLRGQPLTRGRGKMLP
VHAPTSGTVTAIAPHSTAHPSALAELSVIIDADGEDCWIPRDGWADYRTRSREELIERIHQFGVAGLGGAGFPTGVKLQG
GGDKIETLIINAAECEPYITADDRLMQDCAAQVVEGIRILAHILQPREILIGIEDNKPQAISMLRAVLADSNDISLRVIP
TKYPSGGAKQLTYILTGKQVPHGGRSSDIGVLMQNVGTAYAVKRAVIDGEPITERVVTLTGEAIARPGNVWARLGTPVRH
LLNDAGFCPSADQMVIMGGPLMGFTLPWLDVPVVKITNCLLAPSANELGEPQEEQSCIRCSACADACPADLLPQQLYWFS
KGQQHDKATTHNIADCIECGACAWVCPSNIPLVQYFRQEKAEIAAIRQEEKRAAEAKARFEARQARLEREKAARLERHKS
AAVQPAAKDKDAIAAALARVKEKQAQATQPIVIKAGERPDNSAIIAAREARKAQARAKQAELQQTNDAATVADPRKTAVE
AAIARAKARKLEQQQANAEPEQQVDPRKAAVEAAIARAKARKLEQQQANAEPEEQVDPRKAAVEAAIARAKARKLEQQQA
NAEPEQQVDPRKAAVEAAIARAKARKREQQPANAEPEEQVDPRKAAVEAAIARAKARKLEQQQANAVPEEQVDPRKAAVA
AAIARAQAKKAAQQKVVNED
>P76182 7.-.-.-~~~rsxD~~~Ion-translocating oxidoreductase complex subunit D~~~COG4658
MVFRIASSPYTHNQRQTSRIMLLVLLAAVPGIAAQLWFFGWGTLVQILLASVSALLAEALVLKLRKQSVAATLKDNSALL
TGLLLAVSIPPLAPWWMVVLGTVFAVIIAKQLYGGLGQNPFNPAMIGYVVLLISFPVQMTSWLPPHEIAVNIPGFIDAIQ
VIFSGHTASGGDMNTLRLGIDGISQATPLDTFKTSVRAGHSVEQIMQYPIYSGILAGAGWQWVNLAWLAGGVWLLWQKAI
RWHIPLSFLVTLALCAMLGWLFSPETLAAPQIHLLSGATMLGAFFILTDPVTASTTNRGRLIFGALAGLLVWLIRSFGGY
PDGVAFAVLLANITVPLIDYYTRPRVYGHRKG
>P77179 7.-.-.-~~~rsxE~~~Ion-translocating oxidoreductase complex subunit E~~~COG4660
MSEIKDVIVQGLWKNNSALVQLLGLCPLLAVTSTATNALGLGLATTLVLTLTNLTISTLRHWTPAEIRIPIYVMIIASVV
SAVQMLINAYAFGLYQSLGIFIPLIVTNCIVVGRAEAFAAKKGPALSALDGFSIGMGATCAMFVLGSLREIIGNGTLFDG
ADALLGSWAKVLRVEIFHTDSPFLLAMLPPGAFIGLGLMLAGKYLIDERMKKRRAEAAAERALPNGETGNV
>P77285 7.-.-.-~~~rsxG~~~Ion-translocating oxidoreductase complex subunit G~~~COG4659
MLKTIRKHGITLALFAAGSTGLTAAINQMTKTTIAEQASLQQKALFDQVLPAERYNNALAQSCYLVTAPELGKGEHRVYI
AKQDDKPVAAVLEATAPDGYSGAIQLLVGADFNGTVLGTRVTEHHETPGLGDKIELRLSDWITHFAGKKISGADDAHWAV
KKDGGDFDQFTGATITPRAVVNAVKRAGLYAQTLPAQLSQLPACGE
>P71276 2.7.7.49~~~ret~~~Retron Ec48 reverse transcriptase~~~
MGRPYVTLNLNGMFMDKFKPYSKSNAPITTLEKLSKALSISVEELKAIAELPLDEKYTLKEIPKIDGSKRIVYSLHPKMR
LLQSRINKRIFKELVVFPSFLFGSVPSKNDVLNSNVKRDYVSCAKAHCGAKTVLKVDISNFFDNIHRDLVRSVFEEILHI
KDEALEYLVDICTKDDFVVQGALTSSYIATLCLFAVEGDVVRRAQRKGLVYTRLVDDITVSSKISNYDFSQMQSHIERML
SEHDLPINKRKTKIFHCSSEPIKVHGLRVDYDSPRLPSDEVKRIRASIHNLKLLAAKNNTKTSVAYRKEFNRCMGRVNKL
GRVAHEKYESFKKQLQAIKPMPSKRDVAVIDAAIKSLELSYSKGNQNKHWYKRKYDLTRYKMIILTRSESFKEKLECFKS
RLASLKPL
>P21325 ~~~ret~~~Retron Ec67 protein~~~
MTKTSKLDALRAATSREDLAKILDVKLVFLTNVLYRIGSDNQYTQFTIPKKGKGVRTISAPTDRLKDIQRRICDLLSDCR
DEIFAIRKISNNYSFGFERGKSIILNAYKHRGKQIILNIDLKDFFESFNFGRVRGYFLSNQDFLLNPVVATTLAKAACYN
GTLPQGSPCSPIISNLICNIMDMRLAKLAKKYGCTYSRYADDITISTNKNTFPLEMATVQPEGVVLGKVLVKEIENSGFE
INDSKTRLTYKTSRQEVTGLTVNRIVNIDRCYYKKTRALAHALYRTGEYKVPDENGVLVSGGLDKLEGMFGFIDQVDKFN
NIKKKLNKQPDRYVLTNATLHGFKLKLNAREKAYSKFIYYKFFHGNTCPTIITEGKTDRIYLKAALHSLETSYPELFREK
TDSKKKEINLNIFKSNEKTKYFLDLSGGTADLKKFVERYKNNYASYYGSVPKQPVIMVLDNDTGPSDLLNFLRNKVKSCP
DDVTEMRKMKYIHVFYNLYIVLTPLSPSGEQTSMEDLFPKDILDIKIDGKKFNKNNDGDSKTEYGKHIFSMRVVRDKKRK
IDFKAFCCIFDAIKDIKEHYKLMLNS
>P0DV89 2.7.7.49~~~ret~~~Retron Se72 reverse transcriptase~~~
MNKPRFNGTPVASLDSLSAMLGIERKRLDWIVKSVSMSYKQFKVETGKNKKERQIFEPKRSLKGIQKKINKEIFEKIDYP
HYLHGALSGRDYISNAAVHTRKRTVICLDITNFYPSISKKDVCSIFKNLMRFSPDVSLCLTELVTLNNKVPQGGCCSSYI
ANLLFFNSEYNLYNRLKSMGLSYSRLLDDITISSDKDLSSEEKTKVIKLVHGMVNQYRLSINESKTTIEHSKDSSSKLSV
TGLWVKHGVPKLTKENRRYIRYLVYICKKQGAYERHTKEYHDLWNRCSGKVAQMSRLGHVQAVELRAILSEIMPVYDDYK
ISKLKLMAKHYLNKFTPPLTDDQIRKIDRMLYDFDIVGRTNKNLAKLYRRKLVALLPDR
>P0DV86 2.7.7.49~~~ret~~~Retron Ec73 reverse transcriptase~~~
MRIYSLIDSQTLMTKGFASEVMRSPEPPKKWDIAKKKGGMRTIYHPSSKVKLIQYWLMNNVFSKLPMHNAAYAFVKNRSI
KSNALLHAESKNKYYVKIDLKDFFPSIKFTDFEYAFTRYRDRIEFTTEYDKELLQLIKTICFISDSTLPIGFPTSPLIAN
FVARELDEKLTQKLNAIDKLNATYTRYADDIIVSTNMKGASKLILDCFKRTMKEIGPDFKINIKKFKICSASGGSIVVTG
LKVCHDFHITLHRSMKDKIRLHLSLLSKGILKDEDHNKLSGYIAYAKDIDPHFYTKLNRKYFQEIKWIQNLHNKVE
>Q46666 2.7.7.49~~~ret~~~Retron Ec78 reverse transcriptase~~~
MSVIRRLAAVLRQSDSGISAFLVTAPRKYKVYKIPKRTTGFRVIAQPAKGLKDIQRAFVQLYNFPVHDASMAYMKGKGIR
DNAAAHAGNQYLLKADLEDFFNSITPAIFWRCIEMSSALTPQFEPQDKFFIEKILFWQPIKHRKTKLILSVGAPSSPVIS
NFCMYEFDNRIHAACNKLEITYTRYADDLTFSCNIPNVLKAVPSTIEALLKDLFGSELRLNHSKTVFSSKAHNRHVTGVT
INNEETLSLGRDRKRFIKHLINQYKYGLLDNEDKAYLTGLLAFASHIEPGFITRMNEKYSLELMGRLRGQR
>Q47526 2.7.7.49~~~ret~~~Retron Ec83 reverse transcriptase~~~
MSIDIETTLQKAYPDFDVLLKSRPATHYKVYKIPKRTIGYRIIAQPTPRVKAIQRDIIEILKQHTHIHDAATAYVDGKNI
LDNAKIHQSSVYLLKLDLVNFFNKITPELLFKALARQKVDISDTNKNLLKQFCFWNRTKRKNGALVLSVGAPSSPFISNI
VMSSFDEEISSFCKENKISYSRYADDLTFSTNERDVLGLAHQKVKTTLIRFFGTRIIINNNKIVYSSKAHNRHVTGVTLT
NNNKLSLGRERKRYITSLVFKFKEGKLSNVDINHLRGLIGFAYNIEPAFIERLEKKYGESTIKSIKKYSEGG
>P23070 2.7.7.49~~~ret~~~Retron Ec86 reverse transcriptase~~~
MKSAEYLNTFRLRNLGLPVMNNLHDMSKATRISVETLRLLIYTADFRYRIYTVEKKGPEKRMRTIYQPSRELKALQGWVL
RNILDKLSSSPFSIGFEKHQSILNNATPHIGANFILNIDLEDFFPSLTANKVFGVFHSLGYNRLISSVLTKICCYKNLLP
QGAPSSPKLANLICSKLDYRIQGYAGSRGLIYTRYADDLTLSAQSMKKVVKARDFLFSIIPSEGLVINSKKTCISGPRSQ
RKVTGLVISQEKVGIGREKYKEIRAKIHHIFCGKSSEIEHVRGWLSFILSVDSKSHRRLITYISKLEKKYGKNPLNKAKT
>P0DV59 2.7.7.49~~~ret~~~Retron Eco8 reverse transcriptase~~~
MKTKKMILVDKVFYEKILSVESFKENIITQSAIPKISNKEVRLISSGSKIFYAINNTSPHSHVQLRLNRFFLSHIPLNSA
AKAFVRGGSYLKYLEPHIYGSSYCRLDISSFFNNISFDDVKQSLSPYIKDEYLIGTEQKLIDAILNSVGYESPIRKDKGM
IIPMGFRTSPAISNIVFRKMDLLIQDFCAKKGVIYSRYADDMLFSNPRESKLLMSDYFIDEISSLLSIMGFNINQSKYIS
REKEISINGYVIENKGGNGSIGTIRLSKSKLNTVLKVTHALAQNIPYKNICNKYIKVRLKEKNIKYESKKDEFEKKYYRD
QLINYLGGYRSYLISLVKFHSEYKCVNSDFIIQINGILNDIQNHIQKIKKNRRL
>P0DV94 2.7.7.49~~~ret~~~Retron Vc95 reverse transcriptase~~~
MNILTTLREQLLTNNVIMPQEFERLEVRGSHAYKVYSIPKRKAGRRTIAHPSSKLKICQRHLNAILNPLLKVHDSSYAYV
KGRSIKDNALVHSHSAYVLKMDFQNFFNSITPTILRQCLIQNDILLSVNELEKLEQLIFWNPSKKRNGKLILSVGSPISP
LISNAIMYPFDKIINDICTKHGINYTRYADDITFSTNIKNTLNKLPEIVEQLIIQTYAGRIIINKRKTVFSSKKHNRHVT
GITLTNDSKISIGRSRKRYISSLVFKYINKNLDIDEINHMKGMLAFAYNIEPIYIHRLSHKYKVNIVEKILRGSN
>P46849 6.5.1.4~~~rtcA~~~RNA 3'-terminal phosphate cyclase~~~COG0430
MKRMIALDGAQGEGGGQILRSALSLSMITGQPFTITSIRAGRAKPGLLRQHLTAVKAATEICGATVEGAELGSQRLLFRP
GTVRGGDYRFAIGSAGSCTLVLQTVLPALWFADGPSRVEVSGGTDNPSAPPADFIRRVLEPLLAKIGIHQQTTLLRHGFY
PAGGGVVATEVSPVASFNTLQLGERGNIVQMRGEVLLAGVPRHVAEREIATLAGSFSLHEQNIHNLPRDQGPGNTVSLEV
ESENITERFFVVGEKRVSAEVVAAQLVKEVKRYLASTAAVGEYLADQLVLPMALAGAGEFTVAHPSCHLLTNIAVVERFL
PVRFSLIETDGVTRVSIE
>P46850 6.5.1.8~~~rtcB~~~RNA-splicing ligase RtcB~~~COG1690
MNYELLTTENAPVKMWTKGVPVEADARQQLINTAKMPFIFKHIAVMPDVHLGKGSTIGSVIPTKGAIIPAAVGVDIGCGM
NALRTALTAEDLPENLAELRQAIETAVPHGRTTGRCKRDKGAWENPPVNVDAKWAELEAGYQWLTQKYPRFLNTNNYKHL
GTLGTGNHFIEICLDESDQVWIMLHSGSRGIGNAIGTYFIDLAQKEMQETLETLPSRDLAYFMEGTEYFDDYLKAVAWAQ
LFASLNRDAMMENVVTALQSITQKTVRQPQTLAMEEINCHHNYVQKEQHFGEEIYVTRKGAVSARAGQYGIIPGSMGAKS
FIVRGLGNEESFCSCSHGAGRVMSRTKAKKLFSVEDQIRATAHVECRKDAEVIDEIPMAYKDIDAVMAAQSDLVEVIYTL
RQVVCVKG
>P9WGW4 6.5.1.8~~~rtcB~~~RNA-splicing ligase RtcB~~~
MQVVNVATLPGIVRASYAMPDVHWGYGFPIGGVAATDVDNDGVVSPGGVGFDISCGVRLLVGEGLDREELQPRLPAVMDR
LDRAIPRGVGTAGVWRLPDRNTLQEVLTGGARFAVEQGHGVALDLERCEDGGVMTGADAAKISDRALQRGLGQIGSLGSG
NHFLEVQAVDRVYDPVAAAPMGLAEGTVCVMIHTGSRGLGHQICTDHVRQMEQAMGRYGIAVPDRQLACVPVHSPDGQAY
LAAMAAAANYGRANRQLLTEATRRVFADATGTPLDLLYDVSHNLAKIETHPIDGQLRSVCVHRKGATRSLPPHHHELPAE
LAAVGQPVLIPGTMGTASYVLAGVTGNPAFFSTAHGAGRVLSRHQAARHTSGEAIRASLAKRGIIVRGTSRRDIAEEKPE
AYKDVDEVIEASHQSGLARKVARLVPLGCVKG
>P9WGW5 6.5.1.8~~~rtcB~~~RNA-splicing ligase RtcB~~~COG1690
MQVVNVATLPGIVRASYAMPDVHWGYGFPIGGVAATDVDNDGVVSPGGVGFDISCGVRLLVGEGLDREELQPRLPAVMDR
LDRAIPRGVGTAGVWRLPDRNTLQEVLTGGARFAVEQGHGVALDLERCEDGGVMTGADAAKISDRALQRGLGQIGSLGSG
NHFLEVQAVDRVYDPVAAAPMGLAEGTVCVMIHTGSRGLGHQICTDHVRQMEQAMGRYGIAVPDRQLACVPVHSPDGQAY
LAAMAAAANYGRANRQLLTEATRRVFADATGTPLDLLYDVSHNLAKIETHPIDGQLRSVCVHRKGATRSLPPHHHELPAE
LAAVGQPVLIPGTMGTASYVLAGVTGNPAFFSTAHGAGRVLSRHQAARHTSGEAIRASLAKRGIIVRGTSRRGIAEEKPE
AYKDVDEVIEASHQSGLARKVARLVPLGCVKG
>O31466 ~~~rtpA~~~Tryptophan RNA-binding attenuator protein inhibitory protein~~~
MVIATDDLEVACPKCERAGEIEGTPCPACSGKGVILTAQGYTLLDFIQKHLNK
>Q59490 1.17.4.2~~~rtpR~~~Adenosylcobalamin-dependent ribonucleoside-triphosphate reductase~~~
MSEEISLSAEFIDRVKASVKPHWGKLGWVTYKRTYARWLPEKGRSENWDETVKRVVEGNINLDPRLQDSPSLELKQSLTE
EAERLYKLIYGLGATPSGRNLWISGTDYQRRTGDSLNNCWFVAIRPQKYGDSKIVPSYLGKQEKAVSMPFSFLFDELMKG
GGVGFSVARSNISQIPRVDFAIDLQLVVDETSESYDASVKVGAVGKNELVQDADSIYYRLPDTREGWVLANALLIDLHFA
QTNPDRKQKLILDLSDIRPYGAEIHGFGGTASGPMPLISMLLDVNEVLNNKAGGRLTAVDAADICNLIGKAVVAGNVRRS
AELALGSNDDQDFISMKQDQEKLMHHRWASNNSVAVDSAFSGYQPIAAGIRENGEPGIVNLDLSKNYGRIVDGYQAGIDG
DVEGTNPCGEISLANGEPCNLFEVFPLIAEEQGWDLQEVFALAARYAKRVTFSPYDWEISREIIQKNRRIGISMSGIQDW
LLTRLGNRVVTGFKDDFDPETHEAIKVPVYDKRAIKMVDQLYKAVVKADQDYSKTLGCNESIKHTTVKPSGTVAKLAGAS
EGMHFHYGAYLIQRIRFQDSDPLLPALKACGYRTEADIYTENTTCVEFPIKAVGADNPNFASAGTVSIAEQFATQAFLQT
YWSDNAVSCTITFQDSEGDQVESLLRQYRFITKSTSLLPYFGGSLQQAPKEPIDKETYEKRSQEITGNVEEVFSQLNSDV
KDLELVDQTDCEGGACPIK
>P0CI76 ~~~rtp~~~Replication termination protein~~~COG1695
MKEEKRSSTGFLVKQRAFLKLYMITMTEQERLYGLKLLEVLRSEFKEIGFKPNHTEVYRSLHELLDDGILKQIKVKKEGA
KLQEVVLYQFKDYEAAKLYKKQLKVELDRCKKLIEKALSDNF
>P55128 ~~~apxIA~~~RTX-I toxin determinant A from serotypes 1/9~~~
MANSQLDRVKGLIDSLNQHTKSAAKSGAGALKNGLGQVKQAGQKLILYIPKDYQASTGSSLNDLVKAAEALGIEVHRSEK
NGTALAKELFGTTEKLLGFSERGIALFAPQFDKLLNKNQKLSKSLGGSSEALGQRLNKTQTALSALQSFLGTAIAGMDLD
SLLRRRRNGEDVSGSELAKAGVDLAAQLVDNIASATGTVDAFAEQLGKLAMPYLTLALSGLASKLNNLPDLSLAGPGFDA
VSGILSVVSASFILSNKDADAGTKAAAGIEISTKILGNIGKAVSQYIIAQRVAAGLSTTAATGGLIGSVVALAISPLSFL
NVADKFERAKQLEQYSERFKKFGYEGDSLLASFYRETGAIEAALTTINSVLSARSAGVGAAATGSLVGAPVAALVSAITG
IISGILDASKQAIFERVATKLANKIDEWEKKHGKNYFENGYDARHSAFLEDTFELLSQYNKEYSVERVVAITQQRWDVNI
GELAGITRKGSDTKSGKAYVDFFEEGKLLEKEPDRFDKKVFDPLEGKIDLSSINKTTLLKFVTPVFTAGEEIRERKQTGK
YQYMTELFVKGKEKWVVTGVQSHNAIYDYTNLIQLAIDKKGEKRQVTIESHLGEKNDRIYLSSGSSIVYAGNGHDVAYYD
KTDTGYLTFDGQSAQKAGEYIVTKELKADVKVLKEVVKTQDISVGKTCSEKLEYRDYELSPFELGNGIRAKDELHSVEEI
IGSNRKDKFFGSRFTDIFHGAKGDDEIYGNDGHDILYGDDGNDVIHGGDGNDHLVGGNGNDRLIGGKGNNFLNGGDGDDE
LQVFEGQYNVLLGGAGNDILYGSDGTNLFDGGVGNDKIYGGLGKDIYRYSKEYGRHIIIEKGGDDDTLLLSDLSFKDVGF
IRIGDDLLVNKRIGGTLYYHEDYNGNALTIKDWFKEGKEGQNNKIEKIVDKDGAYVLSQYLTELTAPGRGINYFNGLEEK
LYYGEGYNALPQLRKDIEQIISSTGAFTGDHGKVSVGSGGPLVYNNSANNVANSLSYSLAQAA
>P26760 ~~~apxIB~~~Toxin RTX-I translocation ATP-binding protein~~~
MDFYREEDYGLYALTILAQYHNIAVNPEELKHKFDLEGKGLDLTAWLLAAKSLELKAKQVKKAIDRLAFIALPALVWRED
GKHFILTKIDNEAKKYLIFDLETHNPRILEQAEFESLYQGKLILVASRASIVGKLAKFDFTWFIPAVIKYRKIFIETLIV
SIFLQIFALITPLFFQVVMDKVLVHRGFSTLNVITVALAIVVLFEIVLNGLRTYIFAHSTSRIDVELGARLFRHLLALPI
SYFENRRVGDTVARVRELDQIRNFLTGQALTSVLDLMFSFIFFAVMWYYSPKLTLVILGSLPFYMGWSIFISPILRRRLD
EKFARGADNQSFLVESVTAINTIKALAVTPQMTNTWDKQLASYVSAGFRVTTLATIGQQGVQFIQKVVMVITLWLGAHLV
ISGDLSIGQLIAFNMLSGQVIAPVIRLAQLWQDFQQVGISVTRLGDVLNSPTESYQGKLALPEIKGDITFRNIRFRYKPD
APVILNDVNLSIQQGEVIGIVGRSGSGKSTLTKLIQRFYIPENGQVLIDGHDLALADPNWLRRQVGVVLQDNVLLNRSIR
DNIALADPGMPMEKIVHAAKLAGAHEFISELREGYNTIVGEQGAGLSGGQRQRIAIARALVNNPKILIFDEATSALDYES
EHIIMRNMHQICKGRTVIIIAHRLSTVKNADRIIVMEKGQIVEQGKHKELLADPNGLYHYLHQLQSE
>P55132 2.3.1.-~~~apxIC~~~RTX-I toxin-activating lysine-acyltransferase ApxIC~~~
MSKKINGFEVLGEVAWLWASSPLHRKWPLSLLAINVLPAIESNQYVLLKRDGFPIAFCSWANLNLENEIKYLDDVASLVA
DDWTSGDRRWFIDWIAPFGDSAALYKHMRDNFPNELFRAIRVDPDSRVGKISEFHGGKIDKKLASKIFQQYHFELMSELK
NKQNFKFSLVNS
>A1YKW7 ~~~rtxA~~~Cytolysin RtxA~~~
MNLATTKAKLKSGAQAGVQALNKAGHAAKTGTVAAGKATVAGAKSLYLTIPKDYDIEKGGSLNELIKAADELGIARLQED
ANNIESAKKSIDTVEKLLSFTQTGVAVSAKKLDELLQKYSSSQLAKSLGSSANIDSKLTKTNHILSTLSSFLGTALAGMD
LDSLVKQGDASATDLAKASLDLINELVNNISNSVQSIEAFSEQLGRLGAAISQTKGLSGLGNKLQNLPNFGKANLALEMI
SGLLSGISAGFTLADKNASTEKKVAAGFELSNQVIGNVTKAISSYVLAQRAAAGLSTTGAVASLITASIMLAISPLAFMN
AADKFKNASLIDEFAKQFKKFGYDGDSLLAEYQRGAGTIEASLTAINTALGAVSAGVSAAAVGSVVGSPVALLVAGVTGL
ISGILEASKQAMFESVANRLQSKILAWEKENGGKNYFENGYDARHAHYLERNLKLLSELNKELQAERVIAITQQRWDANI
GELAGITKLGDRISSGKAYADAFEDGKKLDGASNVTVDTRTGVVDISNANGKKTQALHFTSPLLTAGTETRERVQNGKYS
YINQLKFNRVKSWTVKDGEANSRLDFSKVIQHVAFNDEDGRLSGKTEEIALNVNAGSGNDDIFAGQGKMNVDGGTGHDRV
FYSKDGGLGQVNVDGTKATEAGSYTVNRSINNGSFYHEVIKRQTTQVGKRTETLEYRDFELKRPEHGYQTTDTLKSVEEI
VGSQFSDTFKGSKFADIFHGGDGNDTLEGNDGDDRLFGGNGDDHLYGGNGDDLLDGGKGNDVINGGDGNDVYISRKGDGN
DTLYDSHGSDKLAFADADLSELTIERTAQGIMIKRNDGSGSINMAEWYKTLSQQNYHGNATDDKIEQIIGKNGDYITSEQ
IDKLLKDKQTGTITSAQLQQLAQENKSKSIDSGNLASTLNKLIESMASFGSRGATASNYLQPAHKSPQNVLAPSAV
>Q93HS4 1.14.19.61~~~rtxC~~~Dihydrorhizobitoxine desaturase~~~
MNQADAWKLRSSRYEAVQFSREINKQLSELRPDNVMGAIYIAKDYAVIAACTLATLCVSWWLYPLAVLLIGAYQRGLTTI
AHDAAHRTLAKNTTWNYVLGILFAAYPLFQRHWAYRISHVYLHHPYLGDPEKDPDLKFFMANGVYDVQPPKRYAFNIIWK
PIFGGATLAYLKYLWTNRFSITDSEDQSRSSILVDKYGFYLFWIGILAGSYALGLLHIVILFWIVPYLTTFQVLGWFVEL
AEHSPMCESETKNVYLTRNRKGNFLERAILGQNLDEYHLEHHLSPGIPFWLLHKAQKIRMQDPGYAKVAASWGGLFVKGP
QGQPSVITQLKERNRRLYEQSLADAHAKGHVA
>A1YKW8 2.3.1.-~~~rtxC~~~Protein-lysine myristoyltransferase RtxC~~~
MDKFSELGSIAWLWTNSELHQNWPLSLFSTNVIPAIETQQYVLLVRDGMPIAYCSWARLNLETEVKYINDVTSLKLEDWQ
SGDRYWFIDWIAPFGDSYLLTKHMRKLFSDGLFRAIRVDAGSPNGKISEFYGRNVDAKLAMQSFEQYQKELMNALSQQDN
FIISTSK
>P04170 ~~~rd1~~~Rubredoxin-1~~~COG1773
MQKYVCNVCGYEYDPAEHDNVPFDQLPDDWCCPVCGVSKDQFSPA
>Q9HTK7 ~~~rubA1~~~Rubredoxin-1~~~
MKKWQCVVCGLIYDEAKGWPEEGIEAGTRWEDVPEDWLCPDCGVGKLDFEMIEIG
>P12692 ~~~alkF~~~Rubredoxin-1~~~
MSRYQCPDCQYIYDENKGEPHEGFHPNTSWNDIPKDWACPDCAVRDKVDFIFLADSPSKETQLGVNSQLANSESGISDAT
PTGMAVLAAELVIPLNQENKNEGCAAKTEVLDQASTPQVVRKSSTRKKMRNK
>P14072 ~~~rubR2~~~Rubredoxin-2~~~
MKKFICDVCGYIYDPAVGDPDNGVEPGTEFKDIPDDWVCPLCGVDKSQFSETE
>Q93PP8 ~~~rd2~~~Rubredoxin-2~~~COG1773
MAEPQDMWRCQMVNCGYVYDPDRGDKRRKVPAGTKFEDLPEDWRCPVCGAGKKSFRRLSDEA
>Q9HTK8 ~~~rubA2~~~Rubredoxin-2~~~
MRKWQCVVCGFIYDEALGLPEEGIPAGTRWEDIPADWVCPDCGVGKIDFEMIEIA
>P00272 ~~~alkG~~~Rubredoxin-2~~~
MASYKCPDCNYVYDESAGNVHEGFSPGTPWHLIPEDWCCPDCAVRDKLDFMLIESGVGEKGVTSTHTSPNLSEVSGTSLT
AEAVVAPTSLEKLPSADVKGQDLYKTQPPRSDAQGGKAYLKWICITCGHIYDEALGDEAEGFTPGTRFEDIPDDWCCPDC
GATKEDYVLYEEK
>P58025 ~~~rub3~~~Rubredoxin 3~~~COG1773
MQKWVCVPCGYEYDPADGDPENGIEPGTAFEDLPEDWVCPVCGVDKSFFEPVS
>P23474 ~~~~~~Rubredoxin~~~COG1773
MTKYVCTVCGYVYDPEVGDPDNNINPGTSFQDIPEDWVCPLCGVGKDQFEEEA
>P42453 ~~~rubA~~~Rubredoxin~~~COG1773
MKKYQCIVCGWIYDEAEGWPQDGIAPGTKWEDIPDDWTCPDCGVSKVDFEMIEV
>P14071 ~~~~~~Rubredoxin~~~
MQKYVCDICGYVYDPAVGDPDNGVAPGTAFADLPEDWVCPECGVSKDEFSPEA
>P09947 ~~~rub~~~Rubredoxin~~~
MQKYVCSVCGYVYDPADGEPDDPIDPGTGFEDLPEDWVCPVCGVDKDLFEPES
>Q9AL94 1.-.-.-~~~rd~~~Rubredoxin~~~COG1773
MKKYVCVVCGYIYDPAEGDPDNGVNPGTSFEDIPDDWVCPLCGVGKDQFEPSEE
>P00268 ~~~~~~Rubredoxin~~~
MKKYTCTVCGYIYNPEDGDPDNGVNPGTDFKDIPDDWVCPLCGVGKDQFEEVEE
>P00269 ~~~rub~~~Rubredoxin~~~COG1773
MKKYVCTVCGYEYDPAEGDPDNGVKPGTSFDDLPADWVCPVCGAPKSEFEAA
>P15412 ~~~rub~~~Rubredoxin~~~COG1773
MKKYVCTVCGYEYDPAEGDPDNGVKPGTAFEDVPADWVCPICGAPKSEFEPA
>P56263 ~~~~~~Rubredoxin~~~
MKKYGCLVCGYVYDPAKGDPDHGIAPGTAFEDLPADWVCPLCGVSKDEFEPL
>P00271 ~~~~~~Rubredoxin~~~
MDKYECSICGYIYDEAEGDDGNVAAGTKFADLPADWVCPTCGADKDAFVKMD
>P00270 ~~~~~~Rubredoxin~~~
MDIYVCTVCGYEYDPAKGDPDSGIKPGTKFEDLPDDWACPVCGASKDAFEKQ
>P00267 ~~~~~~Rubredoxin~~~
MQKFECTLCGYIYDPALVGPDTPDQDGAFEDVSENWVCPLCGAGKEDFEVYED
>P19500 ~~~~~~Rubredoxin~~~COG1773
MEKWQCTVCGYIYDPEVGDPTQNIPPGTKFEDLPDDWVCPDCGVGKDQFEKI
>Q97FZ9 1.11.1.1~~~rbr1~~~Rubrerythrin-1~~~COG1592
MKSLKGTKTAENLMKAFAGESQARNRYTFYSNTAKKEGYVQISNIFLETAENERMHAKRFFKFLSEGLDDEAVEINGASY
PTTLGDTKKNLIAAAKGENEEWTDLYPSFAKTAEDEGFKGVAAAFRLIAAVEKEHEKRYNALLKNIEENKVFEKDEVKFW
KCIKCGYIFEGKTAPKVCPACLHPQAYFEILSENY
>Q97ET8 1.11.1.1~~~rbr2~~~Rubrerythrin-2~~~COG1592
MSVKNAMTADFLRSAYGGESMAHMRYLIWGEEAENSNYPNIGRLFKAIAYSEHIHAKNHFNVLKEDLYDSSVVAGAVFGS
TNLIDNLQGAINGELHEIKQMYPVYLETARYQEEKEAERTFHYALEAEKIHAKLFQDAQDSAKENKDINIGKVYICPVCG
FTTLDENIEQCPICGVKKDKFQAF
>P24931 ~~~rbr~~~Rubrerythrin~~~COG1592
MKSLKGSRTEKNILTAFAGESQARNRYNYFGGQAKKDGFVQISDIFAETADQEREHAKRLFKFLEGGDLEIVAAFPAGII
ADTHANLIASAAGEHHEYTEMYPSFARIAREEGYEEIARVFASIAVAEEFHEKRFLDFARNIKEGRVFLREQATKWRCRN
CGYVHEGTGAPELCPACAHPKAHFELLGINW
>Q9AGG3 ~~~rbr~~~Rubrerythrin~~~COG1592
MSIKKKTEMNKSIKGSKTEKHLLMAFAGESQARSRYTFFASVAKKEGYEQIAGVFMETAEQEKEHAKRFFSFLEGGMLEI
TASFPAGIIGSTAENLRAAAAGENEEWTDLYPAFAETAEEEGFKEIAAVFRQIAKVEAEHERRYLALLAHVEDGSVFERT
EEIAWQCRNCGYVITSKKAPKLCPACAHPQAYFEPMKTNY
>P42454 1.18.1.1~~~rubB~~~Rubredoxin-NAD(+) reductase~~~COG1251
MHPIVIIGSGMAGYTLAREFRKLNPEHELVMICADDAVNYAKPTLSNALSGNKAPEQIPLGDAEKMSTQLKLQILSETWV
KAINPETHELKLEKNGQETIQPYSKLVLAVGANPTRLAIAGDGSDDIHVVNSLIDYRAFRENLAKRQDKRVVILGAGLIG
CEFANDLQHTGHQVTVIDLSPRPLGRLLPAHIADAFQKNLEESGIHFVLSTTVEKVSKINDGQDYAVTLANGQTLVADIV
LSAIGLQPNIDLAKHAGVHTSRGILTNSLLETNLEDIYAIGDCAEVNGTLLPYVMPIMQQARALAKTLSGETTHVHYPAM
PVAVKTPAAPLTVLPVPVDVDVNWETEEFEDGMLAKAIDNTDTLRGFVLLGATAGKQRLTLTKLVPDLIPAQL
>Q0VTB0 1.18.1.1~~~rubB~~~Rubredoxin-NAD(+) reductase~~~COG1251
MHPIVIVGTGLAGFNTVKEFRKLDKETPIVMLTADDGRNYSKPMLSAGFSKGKTADDLCMATPEKVAEQLNVDVRTGVHV
AGIDATNKRVLLPDDHLDYSKLVLALGADTWTPPLEGDAVGEVFSVNDLMDYGKFRAAVEGKKTVTILGGGLIGCEFAND
LSNGGFKVSLVEPMGRCLPLLLPEQASEAVGRGLADLGVQFHFGPLAKAVHHGDNGQLVTELSDGSQLESDVVLSAIGLR
PRISLAKEAGLDTNRGILTDKSLRTSAEHIYALGDCAEVQGHVLPYVLPLMASARALAKTLAGETTEVSYGVMPVTIKTP
ACPVVVCPAAEGSEGAWEVEAEGNTVQALFRSKDGSLLGYALTGEAVKEKMKLNKELPAIMP
>Q9HTK9 1.18.1.1~~~alkT~~~Rubredoxin-NAD(+) reductase~~~
MSERAPLVIIGTGLAGYNLAREWRKLDGETPLLMITADDGRSYSKPMLSTGFSKNKDADGLAMAEPGAMAEQLNARILTH
TRVTGIDPGHQRIWIGEEEVRYRDLVLAWGAEPIRVPVEGDAQDALYPINDLEDYARFRQAAAGKRRVLLLGAGLIGCEF
ANDLSSGGYQLDVVAPCEQVMPGLLHPAAAKAVQAGLEGLGVRFHLGPVLASLKKAGEGLEAHLSDGEVIPCDLVVSAVG
LRPRTELAFAAGLAVNRGIVVDRSLRTSHANIYALGDCAEVDGLNLLYVMPLMACARALAQTLAGNPSQVAYGPMPVTVK
TPACPLVVSPPPRGMDGQWLVEGSGTDLKVLCRDTAGRVIGYALTGAAVNEKLALNKELPGLMA
>P17052 1.18.1.1~~~alkT~~~Rubredoxin-NAD(+) reductase~~~
MAIVVVGAGTAGVNAAFWLRQYGYKGEIRIFSRESVAPYQRPPLSKAFLTSEIAESAVPLKPEGFYTNNNITISLNTPIV
SIDVGRKIVSSKDGKEYAYEKLILATPASARRLTCEGSELSGVCYLRSMEDAKNLRRKLVESASVVVLGGGVIGLEVASA
AVGLGKRVTVIEATPRVMARVVTPAAANLVRARLEAEGIEFKLNAKLTSIKGRNGHVEQCVLESGEEIQADLIVVGIGAI
PELELATEAALEVSNGVVVDDQMCTSDTSIYAIGDCAMARNPFWGTMVRLETIHNAVTHAQIVASSICGTSTPAPTPPRF
WSDLKGMALQGLGALKDYDKLVVAINNETLELEVLAYKQERLIATETINLPKRQGALAGSIKLPD
>P23742 ~~~rus~~~Rusticyanin~~~
GALDGSWKEATLPQVKAMLQKDTGKASGDTVTYSGKTVHVVAAAVLPGFPFPSFEVHDKKNPTLDIPAGATVDVTFINTN
KGFGHSFDITKKGPPFAVMPNIKPIVAGTGFSPVPKDGKFGYSEFTWHPTAGTYYYVCQIPGHAATGMFGKIIVK
>P0C918 ~~~rus~~~Rusticyanin~~~
MYTQNTMKKNWYVTVGAAAALAATVGMGTAMAGTLDSTWKEATLPQVKAMLEKDTGKVSGDTVTYSGKTVHVVAAAVLPG
FPFPSFEVHDKKNPTLEIPAGATVDVTFINTNKGFGHSFDITKKGPPYAVMPVIDPIVAGTGFSPVPKDGKFGYTDFTWH
PTAGTYYYVCQIPGHAATGMFGKIIVK
>P0AG74 3.1.21.10~~~rusA~~~Crossover junction endodeoxyribonuclease RusA~~~COG4570
MNTYSITLPWPPSNNRYYRHNRGRTHVSAEGQAYRDNVARIIKNAMLDIGLAMPVKIRIECHMPDRRRRDLDNLQKAAFD
ALTKAGFWLDDAQVVDYRVVKMPVTKGGRLELTITEMGNE
>B7JAQ0 ~~~rus~~~Rusticyanin~~~COG4454
MYTQNTMKKNWYVTVGAAAALAATVGMGTAMAGTLDTTWKEATLPQVKAMLEKDTGKVSGDTVTYSGKTVHVVAAAVLPG
FPFPSFEVHDKKNPTLEIPAGATVDVTFINTNKGFGHSFDITKKGPPYAVMPVIDPIVAGTGFSPVPKDGKFGYTDFTWH
PTAGTYYYVCQIPGHAATGMFGKIVVK
>P75898 1.14.99.46~~~rutA~~~Pyrimidine monooxygenase RutA~~~COG2141
MQDAAPRLTFTLRDEERLMMKIGVFVPIGNNGWLISTHAPQYMPTFELNKAIVQKAEHYHFDFALSMIKLRGFGGKTEFW
DHNLESFTLMAGLAAVTSRIQIYATAATLTLPPAIVARMAATIDSISGGRFGVNLVTGWQKPEYEQMGIWPGDDYFSRRY
DYLTEYVQVLRDLWGTGKSDFKGDFFTMNDCRVSPQPSVPMKVICAGQSDAGMAFSARYADFNFCFGKGVNTPTAFAPTA
ARMKQAAEQTGRDVGSYVLFMVIADETDDAARAKWEHYKAGADEEALSWLTEQSQKDTRSGTDTNVRQMADPTSAVNINM
GTLVGSYASVARMLDEVASVPGAEGVLLTFDDFLSGIETFGERIQPLMQCRAHLPALTQEVA
>P75897 3.5.1.110~~~rutB~~~Ureidoacrylate amidohydrolase RutB~~~COG1335
MTTLTARPEAITFDPQQSALIVVDMQNAYATPGGYLDLAGFDVSTTRPVIANIQTAVTAARAAGMLIIWFQNGWDEQYVE
AGGPGSPNFHKSNALKTMRKQPQLQGKLLAKGSWDYQLVDELVPQPGDIVLPKPRYSGFFNTPLDSILRSRGIRHLVFTG
IATNVCVESTLRDGFFLEYFGVVLEDATHQAGPKFAQKAALFNIETFFGWVSDVETFCDALSPTSFAHIA
>P0AFQ6 3.5.-.-~~~rutC~~~3-aminoacrylate deaminase RutC~~~COG0251
MPKSVIIPAGSSAPLAPFVPGTLADGVVYVSGTLAFDQHNNVLFADDPKAQTRHVLETIRKVIETAGGTMADVTFNSIFI
TDWKNYAAINEIYAEFFPGDKPARFCIQCGLVKPDALVEIATIAHIAK
>P0AFQ5 3.5.-.-~~~rutC~~~3-aminoacrylate deaminase RutC~~~COG0251
MPKSVIIPAGSSAPLAPFVPGTLADGVVYVSGTLAFDQHNNVLFADDPKAQTRHVLETIRKVIETAGGTMADVTFNSIFI
TDWKNYAAINEIYAEFFPGDKPARFCIQCGLVKPDALVEIATIAHIAK
>C8U5H1 3.5.1.-~~~rutD~~~Putative carbamate hydrolase RutD~~~
MKLSLSPPPYADAPVVVLISGLGGSGSYWLQQLAVLEQEYQVVCYDQRGTGNNPDTLAEDYSIAQMAAELHQALVAAGIE
HYAVVGHALGALVGMQLALDYPASVTVLISVNGWLRINAHTRRCFQVRERLLYSGGAQAWVEAQPLFLYPADWMAARAPR
LEAEDALALAHFQGKNNLLRRLNALKRADFSHHADRIRCPVQIICASDDLLVPTACSSELHAALPDSQKMVMPYGGHACN
VTDPETFNALLLNGLASLLHHREAAL
>P75895 3.5.1.-~~~rutD~~~Putative carbamate hydrolase RutD~~~COG2021
MKLSLSPPPYADAPVVVLISGLGGSGSYWLPQLAVLEQEYQVVCYDQRGTGNNPDTLAEDYSIAQMAAELHQALVAAGIE
HYAVVGHALGALVGMQLALDYPASVTVLISVNGWLRINAHTRRCFQVRERLLYSGGAQAWVEAQPLFLYPADWMAARAPR
LEAEDALALAHFQGKNNLLRRLNALKRADFSHHADRIRCPVQIICASDDLLVPTACSSELHAALPDSQKMVMPYGGHACN
VTDPETFNALLLNGLASLLHHREAAL
>B6I985 3.5.1.-~~~rutD~~~Putative carbamate hydrolase RutD~~~
MKLSLSPPPYADAPVVVLISGLGGSGSYWLPQLAVLEQEYQVVCYDQRGTGNNPDTLAEDYSIAQMAAELHQALVAAGIE
HYAVVGHALGALVGMQLALDYPASVTVLISVNGWLRINAHTRRCFQVRERLLYSGGAQAWVEAQPLFLYPADWMAARAPR
LEAEDALALAHFQGKNNLLRRLNALKRADFSHHADRIRCPVQIICASDDLLVPTACSSELHAALPDSQKMVMPYGGHACN
VTDPETFNALLLNGLASLLHHREAAL
>P75894 1.1.1.298~~~rutE~~~Probable malonic semialdehyde reductase RutE~~~COG0778
MNEAVSPGALSTLFTDARTHNGWRETPVSDETLREIYALMKWGPTSANCSPARIVFTRTAEGKERLRPALSSGNLQKTLT
APVTAIVAWDSEFYERLPLLFPHGDARSWFTSSPQLAEETAFRNSSMQAAYLIVACRALGLDTGPMSGFDRQHVDDAFFT
GSTLKSNLLINIGYGDSSKLYARLPRLSFEEACGLL
>P75893 1.5.1.42~~~rutF~~~FMN reductase (NADH) RutF~~~COG1853
MNIVDQQTFRDAMSCMGAAVNIITTDGPAGRAGFTASAVCSVTDTPPTLLVCLNRGASVWPAFNENRTLCVNTLSAGQEP
LSNLFGGKTPMEHRFAAARWQTGVTGCPQLEEALVSFDCRISQVVSVGTHDILFCAIEAIHRHTTPYGLVWFDRSYHALM
RPAC
>P75892 ~~~rutG~~~Putative pyrimidine permease RutG~~~COG2233
MAMFGFPHWQLKSTSTESGVVAPDERLPFAQTAVMGVQHAVAMFGATVLMPILMGLDPNLSILMSGIGTLLFFFITGGRV
PSYLGSSAAFVGVVIAATGFNGQGINPNISIALGGIIACGLVYTVIGLVVMKIGTRWIERLMPPVVTGAVVMAIGLNLAP
IAVKSVSASAFDSWMAVMTVLCIGLVAVFTRGMIQRLLILVGLIVACLLYGVMTNVLGLGKAVDFTLVSHAAWFGLPHFS
TPAFNGQAMMLIAPVAVILVAENLGHLKAVAGMTGRNMDPYMGRAFVGDGLATMLSGSVGGSGVTTYAENIGVMAVTKVY
STLVFVAAAVIAMLLGFSPKFGALIHTIPAAVIGGASIVVFGLIAVAGARIWVQNRVDLSQNGNLIMVAVTLVLGAGDFA
LTLGGFTLGGIGTATFGAILLNALLSRKLVDVPPPEVVHQEP
>P0ACU2 ~~~rutR~~~HTH-type transcriptional regulator RutR~~~COG1309
MTQGAVKTTGKRSRAVSAKKKAILSAALDTFSQFGFHGTRLEQIAELAGVSKTNLLYYFPSKEALYIAVLRQILDIWLAP
LKAFREDFAPLAAIKEYIRLKLEVSRDYPQASRLFCMEMLAGAPLLMDELTGDLKALIDEKSALIAGWVKSGKLAPIDPQ
HLIFMIWASTQHYADFAPQVEAVTGATLRDEVFFNQTVENVQRIIIEGIRPR
>P0A809 ~~~ruvA~~~Holliday junction branch migration complex subunit RuvA~~~COG0632
MIGRLRGIIIEKQPPLVLIEVGGVGYEVHMPMTCFYELPEAGQEAIVFTHFVVREDAQLLYGFNNKQERTLFKELIKTNG
VGPKLALAILSGMSAQQFVNAVEREEVGALVKLPGIGKKTAERLIVEMKDRFKGLHGDLFTPAADLVLTSPASPATDDAE
QEAVAALVALGYKPQEASRMVSKIARPDASSETLIREALRAAL
>P40832 ~~~ruvA~~~Holliday junction branch migration complex subunit RuvA~~~COG0632
MIFSVRGEVLEVALDHAVIEAAGIGYRVNATPSALATLRQGSQARLVTAMVVREDSMTLYGFSDAENRDLFLALLSVSGV
GPRLAMATLAVHDAAALRQALADSDVASLTRVPGIGKRGAERIVLELRDKVGPVGASGLTVGTAADGNAVRGSVVEALVG
LGFAAKQAEEATDQVLDGELGKDGAVATSSALRAALSLLGKTR
>A0QWH5 ~~~ruvA~~~Holliday junction branch migration complex subunit RuvA~~~COG0632
MIASVRGEVIDIALDHAVIEAAGVGYKVMATPSTLATLRRGTEARLITAMIVREDSMTLYGFVDGDARDLFLTLLGVSGV
GPKIALATLAVYDPQALRQALADGDVTALTRVPGIGKRGAERMVLELRDKIGPVSAGGGAAVGGHAIRGPVVEALVGLGF
AAKQAEEATDKVLANDPEATTSSALRAALSMLGKK
>P9WGW3 ~~~ruvA~~~Holliday junction branch migration complex subunit RuvA~~~COG0632
MIASVRGEVLEVALDHVVIEAAGVGYRVNATPATLATLRQGTEARLITAMIVREDSMTLYGFPDGETRDLFLTLLSVSGV
GPRLAMAALAVHDAPALRQVLADGNVAALTRVPGIGKRGAERMVLELRDKVGVAATGGALSTNGHAVRSPVVEALVGLGF
AAKQAEEATDTVLAANHDATTSSALRSALSLLGKAR
>Q51425 ~~~ruvA~~~Holliday junction branch migration complex subunit RuvA~~~
MIGRLRGTLAEKQPPHLILDVNGVGYEVEVPMTTLYRLPSVGEPVTLHTHLVVREDAHLLYGFAEKRERELFRELIRLNG
VGPKLALALMSGLEVDELVRCVQAQDTSTLVKIPGVGKKTAERLLVELKDRFKAWENMPTIAPLVMEPRASATVSSAEAD
AVSALIALGFKPQEASRAVAAVPGEDLSSEEMIRQALKGMV
>P66746 ~~~ruvA~~~Holliday junction branch migration complex subunit RuvA~~~
MIGRLRGIILEKQPPIVLLETGGVGYEVHMPMTCFYELPEAGQEAIVFTHFVVREDAQLLYGFNNKQERTLFKELIKTNG
VGPKLALAILSGMSAQQFVNAVEREELGALVKLPGIGKKTAERLIVEMKDRFKGLHGDLFTPAVDLVLTSPASPTSEDAE
QEAVAALVALGYKPQEASRMVSKIARPDASSETLIRDALRAAL
>Q9F1Q3 ~~~ruvA~~~Holliday junction branch migration complex subunit RuvA~~~COG0632
MIRYLRGLVLKKEAGGFVLLAGGVGFFLQAPTPFLQALEEGKEVGVHTHLLLKEEGLSLYGFPDEENLALFELLLSVSGV
GPKVALALLSALPPRLLARALLEGDARLLTSASGVGRRLAERIALELKGKVPPHLLAGEKVESEAAEEAVMALAALGFKE
AQARAVVLDLLAQNPKARAQDLIKEALKRLR
>O32055 3.6.4.12~~~ruvB~~~Holliday junction branch migration complex subunit RuvB~~~COG2255
MDERLVSSEADNHESVIEQSLRPQNLAQYIGQHKVKENLRVFIDAAKMRQETLDHVLLYGPPGLGKTTLASIVANEMGVE
LRTTSGPAIERPGDLAAILTALEPGDVLFIDEIHRLHRSIEEVLYPAMEDFCLDIVIGKGPSARSVRLDLPPFTLVGATT
RVGLLTAPLRDRFGVMSRLEYYTQEELADIVTRTADVFEVEIDKPSALEIARRSRGTPRVANRLLRRVRDFAQVLGDSRI
TEDISQNALERLQVDRLGLDHIDHKLLMGMIEKFNGGPVGLDTISATIGEESHTIEDVYEPYLLQIGFIQRTPRGRIVTP
AVYHHFQMEAPRYD
>Q9PMT7 3.6.4.12~~~ruvB~~~Holliday junction branch migration complex subunit RuvB~~~COG2255
MDRIVEIEKYSFDETYETSLRPSNFDGYIGQESIKKNLNVFIAAAKKRNECLDHILFSGPAGLGKTTLANIISYEMSANI
KTTAAPMIEKSGDLAAILTNLSEGDILFIDEIHRLSPAIEEVLYPAMEDYRLDIIIGSGPAAQTIKIDLPKFTLIGATTR
AGMLSNPLRDRFGMQFRLEFYKDSELALILQKAALKLNKTCEEKAALEIAKRSRSTPRIALRLLKRVRDFADVNDEEIIT
EKRANEALNSLGVNELGFDAMDLRYLELLTAAKQKPIGLASIAAALSEDENTIEDVIEPYLLANGYIERTAKGRIASAKS
YSALKLNYEKTLFEE
>P0A812 3.6.4.12~~~ruvB~~~Holliday junction branch migration complex subunit RuvB~~~COG2255
MIEADRLISAGTTLPEDVADRAIRPKLLEEYVGQPQVRSQMEIFIKAAKLRGDALDHLLIFGPPGLGKTTLANIVANEMG
VNLRTTSGPVLEKAGDLAAMLTNLEPHDVLFIDEIHRLSPVVEEVLYPAMEDYQLDIMIGEGPAARSIKIDLPPFTLIGA
TTRAGSLTSPLRDRFGIVQRLEFYQVPDLQYIVSRSARFMGLEMSDDGALEVARRARGTPRIANRLLRRVRDFAEVKHDG
TISADIAAQALDMLNVDAEGFDYMDRKLLLAVIDKFFGGPVGLDNLAAAIGEERETIEDVLEPYLIQQGFLQRTPRGRMA
TTRAWNHFGITPPEMP
>A0QWH6 3.6.4.12~~~ruvB~~~Holliday junction branch migration complex subunit RuvB~~~COG2255
MGRFEDDAEVEDREVSPALTVGEGDIDASLRPRSLGEFIGQPRVREQLQLVLEGAKKRGGTPDHILLSGPPGLGKTSLAM
IIAAELGSSLRVTSGPALERAGDLAAMLSNLVEHDVLFIDEIHRIARPAEEMLYLAMEDFRVDVVVGKGPGATSIPLEVA
PFTLVGATTRSGALTGPLRDRFGFTAHMDFYEPVELERVLARSAGILGIELGAEAGAEIARRSRGTPRIANRLLRRVRDF
AEVRADGIITRDIAKSALEVYDVDELGLDRLDRAVLSALTRSFNGGPVGVSTLAVAVGEEATTVEEVCEPFLVRAGMIAR
TPRGRVATPLAWTHLGLQPPVTGIGQAGLFD
>P9WGW1 3.6.4.12~~~ruvB~~~Holliday junction branch migration complex subunit RuvB~~~COG2255
MTERSDRDVSPALTVGEGDIDVSLRPRSLREFIGQPRVREQLQLVIEGAKNRGGTPDHILLSGPPGLGKTSLAMIIAAEL
GSSLRVTSGPALERAGDLAAMLSNLVEHDVLFIDEIHRIARPAEEMLYLAMEDFRVDVVVGKGPGATSIPLEVAPFTLVG
ATTRSGALTGPLRDRFGFTAHMDFYEPAELERVLARSAGILGIELGADAGAEIARRSRGTPRIANRLLRRVRDFAEVRAD
GVITRDVAKAALEVYDVDELGLDRLDRAVLSALTRSFGGGPVGVSTLAVAVGEEAATVEEVCEPFLVRAGMVARTPRGRV
ATALAWTHLGMTPPVGASQPGLFE
>Q51426 3.6.4.12~~~ruvB~~~Holliday junction branch migration complex subunit RuvB~~~
MIEPDRLISAVSGRERDEQLDRAIRPLKLADYIGQPSVREQMELFIHAARGRQEALDHTLIFGPPGLGKTTLANIIAQEM
GVSIKSTSGPVLERPGDLAALLTNLEAGDVLFVDEIHRLSPIVEEVLYPAMEDFQLDIMIGEGPAARSIKLDLPPFTLVG
ATTRAGMLTNPLRDRFGIVQRLEFYNVEDLATIVSRSAGILGLEIEPQGAAEIAKRARGTPRIANRLLRRVRDFAEVRGQ
GDITRVIADKALNLLDVDERGFDHLDRRLLLTMIDKFDGGPVGIDNLAAALSEERHTIEDVLEPYLIQQGYIMRTPRGRV
VTRHAYLHFGLNIPKRLGPGVTTDLFTSEDGN
>P66758 3.6.4.12~~~ruvB~~~Holliday junction branch migration complex subunit RuvB~~~
MNERMVDQSMHSEETDFELSLRPTRLRQYIGQNSIKSNLEVFIKAAKLRHEPLDHVLLFGPPGLGKTTLSNIIANEMEVN
IRTVSGPSLERPGDLAAILSGLQPGDVLFIDEIHRLSSVVEEVLYPAMEDFFLDIIIGKGDEARSIRIDLPPFTLVGATT
RAGSLTGPLRDRFGVHLRLEYYNESDLKEIIIRTAEVLGTGIDEESAIELAKRSRGTPRVANRLLKRVRDFQQVNEDEQI
YIETTKHALGLLQVDQHGLDYIDHKMMNCIIKQYNGGPVGLDTIAVTIGEERITIEDVYEPFLIQKGFLERTPRGRKATP
LAYEHFAKSNEERG
>Q97SR6 3.6.4.12~~~ruvB~~~Holliday junction branch migration complex subunit RuvB~~~COG2255
MSRILDNEMMGDEELVERTLRPQYLREYIGQDKVKDQLQIFIEAAKMRDEALDHVLLFGPPGLGKTTMAFVIANELGVNL
KQTSGPVIEKAGDLVAILNDLEPGDVLFIDEIHRLPMSVEEVLYSAMEDFYIDIMIGAGEGSRSVHLELPPFTLIGATTR
AGMLSNPLRARFGITGHMEYYAHADLTEIVERTADIFEMEITHEAASELALRSRGTPRIANRLLKRVRDFAQIMGNGVID
DVITDKALTMLDVDHEGLDYVDQKILRTMIEMYSGGPVGLGTLSVNIAEERETVEDMYEPYLIQKGFIMRTRSGRVATAK
AYEHLGYEYSEK
>Q5M2B1 3.6.4.12~~~ruvB~~~Holliday junction branch migration complex subunit RuvB~~~COG2255
MARILDNDLLGDEEYVERTLRPQYFKEYIGQDKVKDQLKIFIEAAKLRDEALDHTLLFGPPGLGKTTMAFVIANEMGVNL
KQTSGPAIEKAGDLVAILNDLEPGDILFIDEIHRMPMAVEEVLYSAMEDYYIDIMIGAGETSRSVHLDLPPFTLVGATTR
AGMLSNPLRARFGINGHMEYYELPDLTEIVERTSEIFEMTITPEAALELARRSRGTPRIANRLLKRVRDYAQIMGDGVID
DKIADQALTMLDVDHEGLDYVDQKILRTMIEMYGGGPVGLGTLSVNIAEERETVEDMYEPYLIQKGFIMRTRTGRVATAK
AYEHMGYDYTRDN
>Q57396 3.6.4.12~~~ruvB~~~Holliday junction branch migration complex subunit RuvB~~~COG2255
MAIKRSGNNNLSPNVKSDLLSPEVIPQERSTSPELEQQEASLRPQRLADYIGQRDLKEVLRIAIQAAQGRQEAIDHLLLY
GPPGLGKTTLALILAEEMQVRCKITAAPALERPRDITGLLLALQPGDILFIDEIHRLNRLTEELLYPAMEDFRLDITMGK
GQSAKVRSLKLAHFTLVGATTKVGSLTSPLRDRFGLIQRLRFYEVDELQQIILRTAGILSVSISPTGAEAIAMRARGTPR
IANRLLKRVRDYAQVKQQPEIDPALASEALDLYQVDKRGLDWTDRLVLQTLIEQFQGGPTGLEAIAAATGEDAKTIEEVY
EPYLLQIGYLARTSRGRIATTAAYEHLGLTPPTPLLPWKES
>Q56313 3.6.4.12~~~ruvB~~~Holliday junction branch migration complex subunit RuvB~~~COG2255
MSEFLTPERTVYDSGVQFLRPKSLDEFIGQENVKKKLSLALEAAKMRGEVLDHVLLAGPPGLGKTTLAHIIASELQTNIH
VTSGPVLVKQGDMAAILTSLERGDVLFIDEIHRLNKAVEELLYSAIEDFQIDIMIGKGPSAKSIRIDIQPFTLVGATTRS
GLLSSPLRSRFGIILELDFYTVKELKEIIKRAASLMDVEIEDAAAEMIAKRSRGTPRIAIRLTKRVRDMLTVVKADRINT
DIVLKTMEVLNIDDEGLDEFDRKILKTIIEIYRGGPVGLNALAASLGVEADTLSEVYEPYLLQAGFLARTPRGRIVTEKA
YKHLKYEVPENRLF
>Q5SL87 3.6.4.12~~~ruvB~~~Holliday junction branch migration complex subunit RuvB~~~COG2255
MEDLALRPKTLDEYIGQERLKQKLRVYLEAAKARKEPLEHLLLFGPPGLGKTTLAHVIAHELGVNLRVTSGPAIEKPGDL
AAILANSLEEGDILFIDEIHRLSRQAEEHLYPAMEDFVMDIVIGQGPAARTIRLELPRFTLIGATTRPGLITAPLLSRFG
IVEHLEYYTPEELAQGVMRDARLLGVRITEEAALEIGRRSRGTMRVAKRLFRRVRDFAQVAGEEVITRERALEALAALGL
DELGLEKRDREILEVLILRFGGGPVGLATLATALSEDPGTLEEVHEPYLIRQGLLKRTPRGRVATELAYRHLGYPPPVGP
LLEP
>Q56214 3.6.4.12~~~ruvB~~~Holliday junction branch migration complex subunit RuvB~~~
MEDLALRPKTLDEYIGQERLKQKLRVYLEAAKARKEPLEHLLLFGPPGLGKTTLAHVIAHELGVNLRVTSGPAIEKPGDL
AAILANSLEEGDILFIDEIHRLSRQAEEHLYPAMEDFVMDIVIGQGPAARTIRLELPRFALIGATTRPGLITAPLLSRFG
IVEHLEYYTPEELAQGVMRDARLLGVRITEEAALEIGRRSRGTMRVAKRLFRRVRDFAQVEGEEVITRERALEALAALGL
DELGLEKRDREILEVLILRFGAGPVGLATLATALSEDPGTLEEVHEPYLIRQGLLKRTPRGRVATELAYRHLGYPPPVGP
LLEP
>Q9RX75 3.1.21.10~~~ruvC~~~Crossover junction endodeoxyribonuclease RuvC~~~COG0817
MRVLGIDPGLANLGLGLVEGDVRRAKHLYHVCLTTESAWLMPRRLQYLHEELTRLLTEYRPDAVAIEDQILRRQADVAFK
VGQAFGVVQLACAQAGVPIHAYGPMQVKKSLVGTGRADKEQVIYMVKASLGIRELFNNHAADALALALTHLAHAPMQERS
ERLAAAGRAARTGDAPLRR
>P0A814 3.1.21.10~~~ruvC~~~Crossover junction endodeoxyribonuclease RuvC~~~COG0817
MAIILGIDPGSRVTGYGVIRQVGRQLSYLGSGCIRTKVDDLPSRLKLIYAGVTEIITQFQPDYFAIEQVFMAKNADSALK
LGQARGVAIVAAVNQELPVFEYAARQVKQTVVGIGSAEKSQVQHMVRTLLKLPANPQADAADALAIAITHCHVSQNAMQM
SESRLNLARGRLR
>B5Z7N7 3.1.21.10~~~ruvC~~~Crossover junction endodeoxyribonuclease RuvC~~~
MRILGIDPGSRKCGYAIISHASNKLSLITAGFINITTTRLQEQILDLIEALDCLLDRYEVNEVAIEDIFFAYNPKSVIKL
AQFRGALSLKILERIGNFSEYTPLQVKKALTGNGKAAKEQVAFMVKRLLNITSEIKPLDISDAIAVAITHAQRLKLH
>O25544 3.1.21.10~~~ruvC~~~Crossover junction endodeoxyribonuclease RuvC~~~COG0817
MRILGIDPGSRKCGYAIISHASNKLSLITAGFINITTTRLQEQILDLIEALDCLLDRYEVNEVAIEDIFFGYNPKSVIKL
AQFRGALSLKILERIGNFSEYTPLQVKKALTGNGKAAKEQVAFMVKRLLNITSEIKPLDISDAIAVAITHAQRLKLH
>Q51424 3.1.21.10~~~ruvC~~~Crossover junction endodeoxyribonuclease RuvC~~~
MTLILGIDPGSRITGFGVVRETARGCEYVASGCIRTGNGPLHERLHVVFRSVREVIRTHGPTALSIEQVFMARNADSALK
LGQARGAAIVAAMEEGLSVAEYTASQVKQAVVGTGGADKQQVQMMVMHLLKLTQKPQIDASDALAIALCHAHTQQSLVPH
GLVGARRRGGRLRL
>Q5SJC4 3.1.21.10~~~ruvC~~~Crossover junction endodeoxyribonuclease RuvC~~~COG0817
MVVAGIDPGITHLGLGVVAVEGKGALKARLLHGEVVKTSPQEPAKERVGRIHARVLEVLHRFRPEAVAVEEQFFYRQNEL
AYKVGWALGAVLVAAFEAGVPVYAYGPMQVKQALAGHGHAAKEEVALMVRGILGLKEAPRPSHLADALAIALTHAFYARM
GTAKPL
>I6X8R5 ~~~~~~Heme-binding protein Rv0203~~~
MKTGTATTRRRLLAVLIALALPGAAVALLAEPSATGASDPCAASEVARTVGSVAKSMGDYLDSHPETNQVMTAVLQQQVG
PGSVASLKAHFEANPKVASDLHALSQPLTDLSTRCSLPISGLQAIGLMQAVQGARR
>P46350 ~~~rulS~~~RNA-binding protein YbxF~~~COG1358
MSYDKVSQAKSIIIGTKQTVKALKRGSVKEVVVAKDADPILTSSVVSLAEDQGISVSMVESMKKLGKACGIEVGAAAVAI
IL
>P0DV65 4.1.1.-~~~~~~L-serine phosphate decarboxylase~~~
MVDTMNARNTQFTKAFHALKQNAGSHSPSMEDLKKMFPTLEIKIDACYLSNPYASELVLDYIDRELIQTNAYKKVLTHYP
SQQRSLQKVMAESLHVKPENIFIGNGATEIIQMLLQQEEVQKVALMIPTFSSYYEFVGKGCEVVYFPLNERDDYSFDADK
YCQFIENEQPDTVVLINPNNPNGAYLSLEKMHILLKRLAFVPRIIIDESFIHFAYEDEALTCLSSTVLFDMYPNVIIVKS
LSKDFGIAGVRLGYALMDSRKIDALLEHGFLWNINGIGEYCLRLFVREDFLKRYEEARKQYIKEMCRFKEALLGIENVYV
YPSMANFVMLKLPSRIKASFVISALLVEYGIYVRTMADKIGVEGECIRIAGRTREENNCIVMALKSILKDSK
>P0AG05 2.7.7.47~~~aadA~~~Aminoglycoside (3'') (9) adenylyltransferase~~~
MREAVIAEVSTQLSEVVGVIERHLEPTLLAVHLYGSAVDGGLKPHSDIDLLVTVTVRLDETTRRALINDLLETSASPGES
EILRAVEVTIVVHDDIIPWRYPAKRELQFGEWQRNDILAGIFEPATIDIDLAILLTKAREHSVALVGPAAEELFDPVPEQ
DLFEALNETLTLWNSPPDWAGDERNVVLTLSRIWYSAVTGKIAPKDVAADWAMERLPAQYQPVILEARQAYLGQEEDRLA
SRADQLEEFVHYVKGEITKVVGK
>P14511 2.7.7.47~~~~~~Aminoglycoside (3'') (9) adenylyltransferase~~~
MSNVRHHEGSVTIEISNQLSEVLSVIERHSGINVAGRAFVRSAVDGGLKPYSDIDLLVTVAVKLDETTRRALLNDLMEAS
AFPGESETLRAIEVTLVVHDDIIPWRYPAKRELQFGEWQRNDILAGIFEPAMIDIDLAILLTKAREHSVALVGPAAEEFF
DPVPEQDLFEALRETLKLWNSQPDWAGDERNVVLTLSRIWYSAITGKIAPKDVAADWAIKRLPAQYQPVLLEAKQAYLGQ
KEDHLASRADHLEEFIRFVKGEIIKSVGK
>Q8ZPX9 2.7.7.47~~~aadA~~~Aminoglycoside (3'') (9) adenylyltransferase~~~
MTLSIPPSIQCQTEAACRLITRVTGDTLRAIHLYGSAVAGGLKPNSDIDLLVTICQPLTEAQRATLMQELLALSSPPGAS
AEKRALEVTVVLYSQLVPWCFPPSREMQFGEWLREDICQGIYEPAQQDWDMVLLITQILETSIPLKGERAERLFTPAPAA
QLLKALRYPLDLWQSTADVQGDEYHIVLTLARIWYTLSTGRFTSKDAAADWLLPQLPEDYAATLRAAQREYLGLEQQDWH
ILLPAVVRFVDFAKAHIPTQFT
>P0AG06 2.7.7.47~~~aadA~~~Aminoglycoside (3'') (9) adenylyltransferase~~~
MREAVIAEVSTQLSEVVGVIERHLEPTLLAVHLYGSAVDGGLKPHSDIDLLVTVTVRLDETTRRALINDLLETSASPGES
EILRAVEVTIVVHDDIIPWRYPAKRELQFGEWQRNDILAGIFEPATIDIDLAILLTKAREHSVALVGPAAEELFDPVPEQ
DLFEALNETLTLWNSPPDWAGDERNVVLTLSRIWYSAVTGKIAPKDVAADWAMERLPAQYQPVILEARQAYLGQEDRLAS
RADQLEEFVHYVKGEITKVVGK
>Q4VR99 ~~~aad9~~~Spectinomycin 9-adenylyltransferase~~~
MNINEFPQQVNQVISIAETILQGQILGIYLYGSATMNGLRPDSDIDILIITKQELSNSIRADLTKQLLKISGSVGCIEKR
PLEVTIINQSDIVPLQFPPKCQYMYGEWLRGEMEAGEYPQACNDPDIMILLWQARKNSITLKGAESKELIPAIPFHEIKK
AIRFSLPGLISSFKGDERNVLLTLSRMWFTLVTEEITTKDVAAKWVILKLPERFPPLLTTAKEAYLGNLSDEWETVEKEA
MALVEYMKKQIEELLRTE
>Q07448 ~~~spc~~~Spectinomycin 9-adenylyltransferase~~~
MRRIYLNTYEQINKVKKILRKHLKNNLIGTYMFGSGVESGLKPNSDLDFLVVVSEPLTDQSKEILIQKIRPISKKIGDKS
NLRYIELTIIIQQEMVPWNHPPKQEFIYGEWLQELYEQGYIPQKELNSDLTIMLYQAKRKNKRIYGNYDLEELLPDIPFS
DVRRAIMDSSEELIDNYQDDETNSILTLCRMILTMDTGKIIPKDIAGNAVAESSPLEHRERILLAVRSYLGENIEWTNEN
VNLTINYLNNRLKKL
>P0A0D1 ~~~ant1~~~Spectinomycin 9-adenylyltransferase~~~
MSNLINGKIPNQAIQTLKIVKDLFGSSIVGVYLFGSAVNGGLRINSDVDVLVVVNHSLPQLTRKKLTERLMTISGKIGNT
DSVRPLEVTVINRSEVVPWQYPPKREFIYGEWLRGEFENGQIQEPSYDPDLAIVLAQARKNSISLFGPDSSSILVSVPLT
DIRRAIKDSLPELIEGIKGDERNVILTLARMWQTVTTGEITSKDVAAEWAIPLLPKEHVTLLDIARKGYRGECDDKWEGL
YSKVKALVKYMKNSIETSLN
>P0A0D2 ~~~ant~~~Spectinomycin 9-adenylyltransferase~~~
MSNLINGKIPNQAIQTLKIVKDLFGSSIVGVYLFGSAVNGGLRINSDVDVLVVVNHSLPQLTRKKLTERLMTISGKIGNT
DSVRPLEVTVINRSEVVPWQYPPKREFIYGEWLRGEFENGQIQEPSYDPDLAIVLAQARKNSISLFGPDSSSILVSVPLT
DIRRAIKDSLPELIEGIKGDERNVILTLARMWQTVTTGEITSKDVAAEWAIPLLPKEHVTLLDIARKGYRGECDDKWEGL
YSKVKALVKYMKNSIETSLN
>A0A0U1RGY0 5.1.3.14~~~sacA~~~UDP-N-acetylglucosamine 2-epimerase~~~
MKVLTVFGTRPEAIKMAPVILELQKHNTITSKVCITAQHREMLDQVLSLFEIKADYDLNIMKPNQSLQEITTNIISSLTD
VLEDFKPDCVLVHGDTTTTFAASLAAFYQKIPVGHIEAGLRTYNLYSPWPEEANRRLTSVLSQWHFAPTEDSKNNLLSES
IPSDKVIVTGNTVIDALMVSLEKLKITTIKKQMEQAFPFIQDNSKVILITAHRRENHGEGIKNIGLSILELAKKYPTFSF
VIPLHLNPNVRKPIQDLLSSVHNVHLIEPQEYLPFVYLMSKSHIILSDSGGIQEEAPSLGKPVLVLRDTTERPEAVAAGT
VKLVGSETQNIIESFTQLIEYPEYYEKMANIENPYGIGNASKIIVETLLKNR
>O68215 2.7.-.-~~~sacB~~~Capsular polysaccharide phosphotransferase SacB~~~
MFILNNRKWRKLKRDPSAFFRDSKFNFLRYFSAKKFAKNFKNSSHIHKTNISKAQSNISSTLKENRKQDMLIPINFFNFE
YIVKKLNNQNAIGVYILPSNLTLKPALCILESHKEDFLNKFLLTISSENLKLQYKFNGQIKNPKSVNEIWTDLFSIAHVD
MKLSTDRTLSSSISQFWFRLEFCKEDKDFILFSTANRYSRKLWKHSIKNNQLFKEGIRNYSEISSLPYEEDHNFDIDLVF
TWVNSEDKNWQELYKKYKPDFNSDATSTSRFLSRDELKFALRSWEMSGSFIRKIFIVSNCAPPAWLDLNNPKIQWVYHEE
IMPQSALPTFSSHAIETSLHHIPGISNYFIYSNDDFLLTKPLNKDNFFYSNGIAKLRLEAWGNVNGECTEGEPDYLNGAR
NANTLLEKEFKKFTTKLHTHSPQSMRTDILFEMEKKYPEEFNRTLHNKFRSLDDIAVTGYLYHHYALLSGRALQSSDKTE
LVQQNHDFKKKLNNVVTLTKERNFDKLPLSVCINDGADSHLNEEWNVQVIKFLETLFPLPSSFEK
>P21130 2.4.1.10~~~sacB~~~Levansucrase~~~COG1621
MNIKKIVKQATVLTFTTALLAGGATQAFAKENNQKAYKETYGVSHITRHDMLQIPKQQQNEKYQVPQFDQSTIKNIESAK
GLDVWDSWPLQNADGTVAEYNGYHVVFALAGSPKDADDTSIYMFYQKVGDNSIDSWKNAGRVFKDSDKFDANDPILKDQT
QEWSGSATFTSDGKIRLFYTDYSGKHYGKQSLTTAQVNVSKSDDTLKINGVEDHKTIFDGDGKTYQNVQQFIDEGNYTSG
DNHTLRDPHYVEDKGHKYLVFEANTGTENGYQGEESLFNKAYYGGGTNFFRKESQKLQQSAKKRDAELANGALGIIELNN
DYTLKKVMKPLITSNTVTDEIERANVFKMNGKWYLFTDSRGSKMTIDGINSNDIYMLGYVSNSLTGPYKPLNKTGLVLQM
GLDPNDVTFTYSHFAVPQAKGNNVVITSYMTNRGFFEDKKATFGPSFLMNIKGNKTSVVKNSILEQGQLTVN
>P05655 2.4.1.10~~~sacB~~~Levansucrase~~~COG1621
MNIKKFAKQATVLTFTTALLAGGATQAFAKETNQKPYKETYGISHITRHDMLQIPEQQKNEKYQVPEFDSSTIKNISSAK
GLDVWDSWPLQNADGTVANYHGYHIVFALAGDPKNADDTSIYMFYQKVGETSIDSWKNAGRVFKDSDKFDANDSILKDQT
QEWSGSATFTSDGKIRLFYTDFSGKHYGKQTLTTAQVNVSASDSSLNINGVEDYKSIFDGDGKTYQNVQQFIDEGNYSSG
DNHTLRDPHYVEDKGHKYLVFEANTGTEDGYQGEESLFNKAYYGKSTSFFRQESQKLLQSDKKRTAELANGALGMIELND
DYTLKKVMKPLIASNTVTDEIERANVFKMNGKWYLFTDSRGSKMTIDGITSNDIYMLGYVSNSLTGPYKPLNKTGLVLKM
DLDPNDVTFTYSHFAVPQAKGNNVVITSYMTNRGFYADKQSTFAPSFLLNIKGKKTSVVKDSILEQGQLTVNK
>P94468 ~~~sacB~~~Inactive levansucrase~~~
MNIKKFAKQATVLTFTTALLAGGATQAFAKETNQKPYKETYGISHITRHDMLQIPEQQKNEKYQVPEFDSSTIKNISSAK
GLDVWDSWPLQNADGTVANYHGYHIVFALAGDPKNADDTSIYMFYQKVGETSIDSWKTPGRVFKDSDKFDANDSILKDQT
QEWSGSATFTSDGKIRLFYTDFSGKHYGKQTLTTAQVNVSASDSSLNINGVEDYKSIFDGDSKTYQNVQQFIDEGNYSSG
DNHTLRDPHYVEDKGHKYLVFEANTGTEDGYQGEESLFNKAYYGKSTSFFRQESQKLLQSDKNRTAELANGALGMIELND
DYTLKKVMKPLIASNTVTDEIERANVFKMNGKWYLSTDSRGSQMTIDGITSNDIYMLGYVSNSLTGPYKPLNKTGLVLKM
DLDPNDVTFTYSHFAVPQATGNNVVITSYMTNRGFYADKQSTFAPSFLLNIQGKKTSVVKASILDQGQLTVNQ
>Q43998 2.4.1.10~~~lsdA~~~Levansucrase~~~
MAHVRRKVATLNMALAGSLLMVLGAQSALAQGNFSRQEAARMAHRPGVMPRGGPLFPGRSLAGVPGFPLPSIHTQQAYDP
QSDFTARWTRADALQIKAHSDATVAAGQNSLPAQLTMPNIPADFPVINPDVWVWDTWTLIDKHADQFSYNGWEVIFCLTA
DPNAGYGFDDRHVHARIGFFYRRAGIPASRRPVNGGWTYGGHLFPDGASAQVYAGQTYTNQAEWSGSSRLMQIHGNTVSV
FYTDVAFNRDANANNITPPQAIITQTLGRIHADFNHVWFTGFTAHTPLLQPDGVLYQNGAQNEFFNFRDPFTFEDPKHPG
VNYMVFEGNTAGQRGVANCTEADLGFRPNDPNAETLQEVLDSGAYYQKANIGLAIATDSTLSKWKFLSPLISANCVNDQT
ERPQVYLHNGKYYIFTISHRTTFAAGVDGPDGVYGFVGDGIRSDFQPMNYGSGLTMGNPTDLNTAAGTDFDPSPDQNPRA
FQSYSHYVMPGGLVESFIDTVENRRGGTLAPTVRVRIAQNASAVDLRYGNGGLGGYGDIPANRADVNIAGFIQDLFGQPT
SGLAAQASTNNAQVLAQVRQFLNQ
>P0DJA3 2.4.1.10~~~sacB~~~Levansucrase~~~COG1621
MLNKAGIAEPSLWTRADAMKVHTDDPTATMPTIDYDFPVMTDKYWVWDTWPLRDINGQVVSFQGWSVIFALVADRTKYGW
HNRNDGARIGYFYSRGGSNWIFGGHLLKDGANPRSWEWSGCTIMAPGTANSVEVFFTSVNDTPSESVPAQCKGYIYADDK
SVWFDGFDKVTDLFQADGLYYADYAENNFWDFRDPHVFINPEDGKTYALFEGNVAMERGTVAVGEEEIGPVPPKTETPDG
ARYCAAAIGIAQALNEARTEWKLLPPLVTAFGVNDQTERPHVVFQNGLTYLFTISHHSTYADGLSGPDGVYGFVSENGIF
GPYEPLNGSGLVLGNPSSQPYQAYSHYVMTNGLVTSFIDTIPSSDPNVYRYGGTLAPTIKLELVGHRSFVTEVKGYGYIP
PQIEWLAEDESSNSAAALSLLNK
>P05656 3.2.1.80~~~sacC~~~Levanase~~~COG1621
MKKRLIQVMIMFTLLLTMAFSADAADSSYYDEDYRPQYHFTPEANWMNDPNGMVYYAGEYHLFYQYHPYGLQWGPMHWGH
AVSKDLVTWEHLPVALYPDEKGTIFSGSAVVDKNNTSGFQTGKEKPLVAIYTQDREGHQVQSIAYSNDKGRTWTKYAGNP
VIPNPGKKDFRDPKVFWYEKEKKWVMVLAAGDRILIYTSKNLKQWTYASEFGQDQGSHGGVWECPDLFELPVDGNPNQKK
WVMQVSVGNGAVSGGSGMQYFVGDFDGTHFKNENPPNKVLWTDYGRDFYAAVSWSDIPSTDSRRLWLGWMSNWQYANDVP
TSPWRSATSIPRELKLKAFTEGVRVVQTPVKELETIRGTSKKWKNLTISPASHNVLAGQSGDAYEINAEFKVSPGSAAEF
GFKVRTGENQFTKVGYDRRNAKLFVDRSESGNDTFNPAFNTGKETAPLKPVNGKVKLRIFVDRSSVEVFGNDGKQVITDI
ILPDRSSKGLELYAANGGVKVKSLTIHPLKKVWGTTPFMSNMTGWTTVNGTWADTIEGKQGRSDGDSFILSSASGSDFTY
ESDITIKDGNGRGAGALMFRSDKDAKNGYLANVDAKHDLVKFFKFENGAASVIAEYKTPIDVNKKYHLKTEAEGDRFKIY
LDDRLVIDAHDSVFSEGQFGLNVWDATAVFQNVTKES
>P15400 ~~~sacX~~~Negative regulator of SacY activity~~~COG1263
MHKEIAKELLLLAGGKNNIISISHCTTRLRFDVKDETKIDIHAIENLQGVQGTFFRYGLFQIIFGAGVVNKIYKEVVHVW
ETAPSEEPVHQKKASRKLNPAAAFAKTLSDIFVPIIPAITASGLLMGLIGMIKVFHWFAAGSPWIKMLDLVSSTAFILLP
ILVGFSAARQFGSNPYLGAVIAGLLTHPDLLDPSMLGSKTPSSLDIWGLHIPMMGYQGSMIPILLSVFVMSKIEKLLKSI
VPKSLDVVIIPFITVMVTGCLALIVMNPAASIIGQIMTQSIVYIYDHAGIAAGALFGGIYSTIVLSGLHHSFYAIEATLL
ANPHVGVNFLVPIWSMANVAQGGAGLAVFLKTKQSSLKKIALPASLTAFLGIVEPIVFGVNLKLIRPFIGAAIGGAIGGA
YVVAVQVVANSYGLTGIPMISIVLPFGAANFVHYMIGFLIAAVSAFIATLFLGFKEETE
>P15401 ~~~sacY~~~Levansucrase and sucrase synthesis operon antiterminator~~~COG3711
MKIKRILNHNAIVVKDQNEEKILLGAGIAFNKKKNDIVDPSKIEKTFIRKDTPDYKQFEEILETLPEDHIQISEQIISHA
EKELNIKINERIHVAFSDHLSFAIERLSNGMVIKNPLLNEIKVLYPKEFQIGLWARALIKDKLGIHIPDDEIGNIAMHIH
TARNNAGDMTQTLDITTMIRDIIEIIEIQLSINIVEDTISYERLVTHLRFAIQHIKAGESIYELDAEMIDIIKEKFKDAF
LCALSIGTFVKKEYGFEFPEKELCYIAMHIQRFYQRSVAR
>Q8ZL64 ~~~sadA~~~Autotransporter adhesin SadA~~~
MNRIFKVLWNAATGTFVVTSETAKSRGKKNGRRKLAVSALIGLSSIMVSADALANAGNDTGDGVTPTGTQTGGKGWIAIG
TDATANTYTNVDGASAAMGYKASAMGKWSTAIGSYSQSTGDSSLALGVKSVSAGDRAIAMGASSSASGSYSMAMGVYANS
SGAKSVALGYKSVASGATSSALGYQATASGDDSAAFGNGAKAIGTNSVALGSGSVAQEDNSVAVGNSTTQRQITYVAKGD
INSTSTDAVTGAQIYSLSQSVADRLGGGASVNSDGTVNAPLYEVGTGIYNNVGSALSALNTSITNTEASVAGLAEDALLW
DESISAFSASHTGNASKITNLAAGTLAADSTDAVNGSQLFDTNEKVDKNTADIATNTGSINQNTADITANTDSINQNTTD
IAANTTSINQNTTDIATNTTNINSLSDSVTTLTDDALLWDAASGAFSAKHNGSDSKITNLAAGTLAADSTDAVNGSQLFD
TNEKVDQNTADITTNTNSINQNTTDIATNTTNINNLSDSITTLTDDALLWDAASGAFSANHNGSASKITNLAAGTLAADS
TDAVNGSQLFATNENVSQNTADITTNTNSINQNTTDIATNTTSINNLSDSITTLTDDALLWDAASGTFSASRSGSASKIT
NLAAGTLAADSTDAVNGSQLYETNQKVDQNTSAIADINTSITNLSSDNLSWNETTSSFSASHGSSTTNKITNVAAGELSE
ESTDAVNGSQLFETNEKVDQNTTDIAANTTNITQNSTAIENLNTSVSDINTSITGLTDNALLWDEDTGAFSANHGGSTSK
ITNVAAGALSEDSTDAVNGSQLYETNQKVDQNTSAIADINTSITNLGTDALSWDDEEGAFSASHGTSGTNKITNVAAGEI
ASDSTDAVNGSQLYETNMLISQYNESISQLAGDTSETYITENGTGVKYIRTNDNGLEGQDAYATGNGATAVGYDAVASGA
GSLALGQNSSSSIEGSIALGSGSTSNRAITTGIRETSATSDGVVIGYNTTDRELLGALSLGTDGESYRQITNVADGSEAQ
DAVTVRQLQNAIGAVTTTPTKYYHANSTEEDSLAVGTDSLAMGAKTIVNADAGIGIGLNTLVMADAINGIAIGSNARANH
ANSIAMGNGSQTTRGAQTDYTAYNMDTPQNSVGEFSVGSEDGQRQITNVAAGSADTDAVNVGQLKVTDAQVSRNTQSITN
LNTQVSNLDTRVTNIENGIGDIVTTGSTKYFKTNTDGADANAQGADSVAIGSGSIAAAENSVALGTNSVADEANTVSVGS
STQQRRITNVAAGVNNTDAVNVAQLKASEAGSVRYETNADGSVNYSVLNLGDGSGGTTRIGNVSAAVNDTDAVNYAQLKR
SVEEANTYTDQKMGEMNSKIKGVENKMSGGIASAMAMAGLPQAYAPGANMTSIAGGTFNGESAVAIGVSMVSESGGWVYK
LQGTSNSQGDYSAAIGAGFQW
>Q8ZL65 ~~~sadB~~~Inner membrane lipoprotein SadB~~~
MHKNGKFIPLLALGFTFFLSGCDYFADKHLVEEMKEQQKEQETKINLLEKQQKEQEAKINLLEKQQATIINTTKKVTEVV
GRVERKQRLFDYTELDPSQTHYFIINNGNIGLAGRILSIEPIDNGSVIHLDLVNLLSIPVSNLAFNMTWGTKKPSEAKDL
PRWKQLLLNTKMDSTIELLPGAWTNVTLTLKGVSPNNLKYLKIGIDMENVIFDSIQPINDTKKKPKK
>P9WGP9 1.-.-.-~~~sadH~~~Putative oxidoreductase SadH~~~COG4221
MSSFEGKVAVITGAGSGIGRALALNLSEKRAKLALSDVDTDGLAKTVRLAQALGAQVKSDRLDVAEREAVLAHADAVVAH
FGTVHQVYNNAGIAYNGNVDKSEFKDIERIIDVDFWGVVNGTKAFLPHVIASGDGHIVNISSLFGLIAVPGQSAYNAAKF
AVRGFTEALRQEMLVARHPVKVTCVHPGGIKTAVARNATVADGEDQQTFAEFFDRRLALHSPEMAAKTIVNGVAKGQARV
VVGLEAKAVDVLARIMGSSYQRLVAAGVAKFFPWAK
>P76149 1.2.1.16~~~sad~~~Succinate semialdehyde dehydrogenase [NAD(P)+] Sad~~~COG1012
MTITPATHAISINPATGEQLSVLPWAGADDIENALQLAAAGFRDWRETNIDYRAEKLRDIGKALRARSEEMAQMITREMG
KPINQARAEVAKSANLCDWYAEHGPAMLKAEPTLVENQQAVIEYRPLGTILAIMPWNFPLWQVMRGAVPIILAGNGYLLK
HAPNVMGCAQLIAQVFKDAGIPQGVYGWLNADNDGVSQMIKDSRIAAVTVTGSVRAGAAIGAQAGAALKKCVLELGGSDP
FIVLNDADLELAVKAAVAGRYQNTGQVCAAAKRFIIEEGIASAFTERFVAAAAALKMGDPRDEENALGPMARFDLRDELH
HQVEKTLAQGARLLLGGEKMAGAGNYYPPTVLANVTPEMTAFREEMFGPVAAITIAKDAEHALELANDSEFGLSATIFTT
DETQARQMAARLECGGVFINGYCASDARVAFGGVKKSGFGRELSHFGLHEFCNIQTVWKDRI
>Q2G2G2 ~~~saeR~~~Response regulator SaeR~~~COG0745
MTHLLIVDDEQDIVDICQTYFEYEGYKVTTTTSGKEAISLLSNDIDIMVLDIMMPEVNGYDIVKEMKRQKLDIPFIYLTA
KTQEHDTIYALTLGADDYVKKPFSPRELVLRINNLLTRMKKYHHQPVEQLSFDELTLINLSKVVTVNGHEVPMRIKEFEL
LWYLASRENEVISKSELLEKVWGYDYYEDANTVNVHIHRIREKLEKESFTTYTITTVWGLGYKFERSR
>Q5HHW4 ~~~saeR~~~Response regulator SaeR~~~
MTHLLIVDDEQDIVDICQTYFEYEGYKVTTTTSGKEAISLLSNDIDIMVLDIMMPEVNGYDIVKEMKRQKLDIPFIYLTA
KTQEHDTIYALTLGADDYVKKPFSPRELVLRINNLLTRMKKYHHQPVEQLSFDELTLINLSKVVTVNGHEVPMRIKEFEL
LWYLASRENEVISKSELLEKVWGYDYYEDANTVNVHIHRIREKLEKESFTTYTITTVWGLGYKFERSR
>Q840P8 ~~~saeR~~~Response regulator SaeR~~~
MTHLLIVDDEQDIVDICQTYFEYEGYKVTTTTSGKEAISLLSNDIDIMVLDIMMPEVNGYDIVKEMKRQKLDIPFIYLTA
KTQEHDTIYALTLGADDYVKKPFSPRELVLRINNLLTRMKKYHHQPVEQLSFDELTLINLSKVVTVNGHEVPMRIKEFEL
LWYLASRENEVISKSELLEKVWGYDYYEDANTVNVHIHRIREKLEKESFTTYTITTVWGLGYKFERSR
>Q7A6V3 ~~~saeR~~~Response regulator SaeR~~~
MTHLLIVDDEQDIVDICQTYFEYEGYKVTTTTSGKEAISLLSNDIDIMVLDIMMPEVNGYDIVKEMKRQKLDIPFIYLTA
KTQEHDTIYALTLGADDYVKKPFSPRELVLRINNLLTRMKKYHHQPVEQLSFDELTLINLSKVVTVNGHEVPMRIKEFEL
LWYLASRENEVISKSELLEKVWGYDYYEDANTVNVHIHRIREKLEKESFTTYTITTVWGLGYKFERSR
>Q8CQ17 ~~~saeR~~~Response regulator SaeR~~~COG0745
MTHLLIVDDEKDIVDICQTYFEYEGYQVTTTTCGKEALKLLSSDIDIMILDIMMPEVSGYDIVKKMKDMQLDIPFIYLTA
KTQEHDTIYALTLGADDYIKKPFSPRELVLRTNNLLARMSKSNHSNKIEQLEFDGLVLKNLSKTLTINNIEIPMRIKEFE
LLWYLASREGEVISKSELLEKVWGYDYYEDANTVNVHIHRIREKLEKHDFLPYTITTVWGLGYKFERSR
>Q2G2U1 2.7.13.3~~~saeS~~~Histidine protein kinase SaeS~~~COG5002
MVLSIRSQIIIGVVSSILLTSTILAIAYILMWFNGHMTLTLTLTTIITSCLTLLICSIFINPLIQKIKQFNIKTKQFANG
NYASNDKTFNSPKEIYELNQSFNKMASEITQQMNQIKSEQQEKTELIQNLAHDLKTPLASIISYSEGLRDGIITKDHEIK
ESYDILIKQANRLSTLFDDMTHIITLNTGKTYPPELIQLDQLLVSILQPYEQRIKHENRTLEVNFCNEIDAFYQYRTPLE
RILTNLLDNALKFSNVGSRIDINISENEDQDTIDIAISDEGIGIIPELQERIFERTFRVENSRNTKTGGSGLGLYIANEL
AQQNNAKISVSSDIDVGTTMTVTLHKLDITS
>Q5HHW5 2.7.13.3~~~saeS~~~Histidine protein kinase SaeS~~~
MVLSIRSQIIIGVVSSILLTSTILAIAYILMWFNGHMTLTLTLTTIITSCLTLLICSIFINPLIQKIKQFNIKTKQFANG
NYASNDKTFNSPKEIYELNQSFNKMASEITQQMNQIKSEQQEKTELIQNLAHDLKTPLASIISYSEGLRDGIITKDHEIK
ESYDILIKQANRLSTLFDDMTHIITLNTGKTYPPELIQLDQLLVSILQPYEQRIKHENRTLEVNFCNEIDAFYQYRTPLE
RILTNLLDNALKFSNVGSRIDINISENEDQDTIDIAISDEGIGIIPELQERIFERTFRVENSRNTKTGGSGLGLYIANEL
AQQNNAKISVSSDIDVGTTMTVTLHKLDITS
>Q840P7 2.7.13.3~~~saeS~~~Histidine protein kinase SaeS~~~
MVLSIRSQIIIGVVSSIPLTSTILAIAYILMWFNGHMTLTLTLTTIITSCLTLLICSIFINPLIQKIKQFNIKTKQFANG
NYASNDKTFNSPKEIYELNQSFNKMASEITQQMNQIKSEQQEKTELIQNLAHDLKTPLASIISYSEGLRDGIITKDHEIK
ESYDILIKQANRLSTLFDDMTHIITLNTGKTYPPELIQLDQLLVSILQPYEQRIKHENRTLEVNFCNEIDAFYQYRTPLE
RILTNLLDNALKFSNVGSRIDINISENEDQDTIDIAISDEGIGIIPELQERIFERTFRVENSRNTKTGGSGLGLYIANEL
AQQNNAKISVSSDIDVGTTMTVTLHKLDITS
>Q7A6V4 2.7.13.3~~~saeS~~~Histidine protein kinase SaeS~~~
MVLSIRSQIIIGVVSSILLTSTILAIAYILMWFNGHMTLTLTLTTIITSCLTLLICSIFINPLIQKIKQFNIKTKQFANG
NYASNDKTFNSPKEIYELNQSFNKMASEITQQMNQIKSEQQEKTELIQNLAHDLKTPLASIISYSEGLRDGIITKDHEIK
ESYDILIKQANRLSTLFDDMTHIITLNTGKTYPPELIQLDQLLVSILQPYEQRIKHENRTLEVNFCSEIDAFYQYRTPLE
RILTNLLDNALKFSNVGSRIDINISENKDQDTIDIAISDEGIGIIPELQERIFERTFRVENSRNTKTGGSGLGLYIANEL
AQQNNAKISVSSDIDVGTTMTVTLHKLDITS
>O32062 ~~~safA~~~SpoIVD-associated factor A~~~COG1388
MKIHIVQKGDSLWKIAEKYGVDVEEVKKLNTQLSNPDLIMPGMKIKVPSEGVPVRKEPKAGKSPAAGSVKQEHPYAKEKP
KSVVDVEDTKPKEKKSMPYVPPMPNLQENVYPEADVNDYYDMKQLFQPWSPPKPEEPKKHHDGNMDHMYHMQDQFPQQEA
MSNMENANYPNMPNMPKAPEVGGIEEENVHHTVPNMPMPAVQPYYHYPAHFVPCPVPVSPILPGSGLCYPYYPAQAYPMH
PMHGYQPGFVSPQYDPGYENQHHENSHHGHYGSYGAPQYASPAYGSPYGHMPYGPYYGTPQVMGAYQPAAAHGYMPYKDH
DDCGCDGDHQPYFSAPGHSGMGAYGSPNMPYGTANPNPNPYSAGVSMPMTNQPSVNQMFGRPEEENE
>P76136 ~~~safA~~~Two-component-system connector protein SafA~~~
MHATTVKNKITQRDNYKEIMSAIVVVLLLTLTLIAIFSAIDQLSISEMGRIARDLTHFIINSLQG
>Q2BN77 1.2.1.73~~~safD~~~Sulfoacetaldehyde dehydrogenase~~~COG1012
MSNTYSLVNPFDNSPLGEYEYTPWATLENQLAMLKEGQLSQRKTAAFQRAGVLNKLAALLKEHSEEMATLITQETGKTIL
DSRVEMMRAYNAAIASAEEARQIQGESLDSDAYAPAGGKIGVVCWKPLGTILCITPFNFPINIAIHKIGPAYAAGNTILF
KPGPQNTASAQLLVKLCYEAGMPENTLQLCMPEFSDLDRLNAHPDVNAINFTGGTAAANAISAAAGYKKLLLELGGNDPL
IVMDDGDLEAATTAAINHRFATAGQRCTAAKRLFIHANVYEAFRDLLVEKSSKLVVGDPMKDDTFVGPVINQGAADQIRT
LIEQAIEDGASVALGNQYEGAFVYPTILENVSPTSEIMVEEAFGPVMPLYKFESVEEIIPIINNTAYGLQAGVFSQNLAT
IKELYEQLDVGTLAANDGPGFRTEHFPFGGVKESGIGREGIKYAIREMSYTKTLVI
>A0A087WNH6 3.13.2.1~~~ahcY~~~Adenosylhomocysteinase~~~COG0499
MNAKPGFTDYIVKDIALADFGRKEISLAETEMPGLMATREEYGPKQPLKGARIAGSLHMTIQTAVLIETLAALGADIRWV
SCNIYSTQDHAAAAIAAAGIPVFAVKGETLTEYWDYTAKLFDWHGGGTPNMILDDGGDATMLVHAGYRAEQGDTAFLDKP
GSEEEEIFYALVKRLLKEKPKGWFAEIAKNIKGVSEETTTGVHRLYEMANKGTLLFPAINVNDSVTKSKFDNLYGCRESL
VDGIRRGTDVMLSGKVAMVAGFGDVGKGSAASLRQAGCRVMVSEVDPICALQAAMEGYEVVTMEDAAPRADIFVTATGNK
DIITIEHMRAMKDRAIVCNIGHFDNEIQIASLRNLKWTNIKPQVDEIEFPDKHRIIMLSEGRLVNLGNAMGHPSFVMSAS
FTNQTLAQIELFANNKDSKYAKKVYVLPKTLDEKVARLHLAKIGVKLTELRKDQADYIGVKQEGPYKSDHYRY
>Q2YQX8 3.13.2.1~~~ahcY~~~Adenosylhomocysteinase~~~
MTASQDFVVKDISLADWGRKELDIAETEMPGLMAAREEFGKSQPLKGARISGSLHMTIQTAVLIETLKVLGAEVRWASCN
IFSTQDHAAAAIAATGTPVFAVKGETLEEYWTYTDQIFQWPDGEPSNMILDDGGDATMYILIGARAEAGEDVLSNPQSEE
EEVLFAQIKKRMAATPGFFTKQRAAIKGVTEETTTGVNRLYQLQKKGLLPFPAINVNDSVTKSKFDNKYGCKESLVDGIR
RGTDVMMAGKVAVVCGYGDVGKGSAQSLAGAGARVKVTEVDPICALQAAMDGFEVVTLDDAASTADIVVTTTGNKDVITI
DHMRKMKDMCIVGNIGHFDNEIQVAALRNLKWTNVKPQVDLIEFPDGKRLILLSEGRLLNLGNATGHPSFVMSASFTNQV
LGQIELFTRTDAYKNEVYVLPKHLDEKVARLHLDKLGAKLTVLSEEQAAYIGVTPQGPFKSEHYRY
>Q3JY79 3.13.2.1~~~ahcY~~~Adenosylhomocysteinase~~~
MNAAVIDSHSAQDYVVADIALAGWGRKELNIAETEMPGLVQIRDEYKAQQPLKGARIAGSLHMTIQTGVLIETLKALGAD
VRWASCNIFSTQDHAAAAIVEAGTPVFAFKGESLDEYWEFSHRIFEWPNGEFANMILDDGGDATLLLILGSKAEKDRSVI
ARPTNEEEVALFKSIERHLEIDGSWYSKRLAHIKGVTEETTTGVHRLYQMEKDGRLPFPAFNVNDSVTKSKFDNLYGCRE
SLVDGIKRATDVMIAGKIAVVAGYGDVGKGCAQSLRGLGATVWVTEIDPICALQAAMEGYRVVTMEYAADKADIFVTATG
NYHVINHDHMKAMRHNAIVCNIGHFDSEIDVASTRQYQWENIKPQVDHIIFPDGKRVILLAEGRLVNLGCATGHPSFVMS
NSFTNQTLAQIELFTRGGEYANKVYVLPKHLDEKVARLHLARIGAQLSELSDDQAAYIGVSKAGPFKPDHYRY
>Q7TWW7 3.13.2.1~~~ahcY~~~Adenosylhomocysteinase~~~
MTGNLVTKNSLTPDVRNGIDFKIADLSLADFGRKELRIAEHEMPGLMSLRREYAEVQPLKGARISGSLHMTVQTAVLIET
LTALGAEVRWASCNIFSTQDHAAAAVVVGPHGTPDEPKGVPVFAWKGETLEEYWWAAEQMLTWPDPDKPANMILDDGGDA
TMLVLRGMQYEKAGVVPPAEEDDPAEWKIFLNLLRTRFETDKDKWTKIAESVKGVTEETTTGVLRLYQFAAAGDLAFPAI
NVNDSVTKSKFDNKYGTRHSLIDGINRGTDALIGGKKVLICGYGDVGKGCAEAMKGQGARVSVTEIDPINALQAMMEGFD
VVTVEEAIGDADIVVTATGNKDIIMLEHIKAMKDHAILGNIGHFDNEIDMAGLERSGATRVNVKPQVDLWTFGDTGRSII
VLSEGRLLNLGNATGHPSFVMSNSFANQTIAQIELWTKNDEYDNEVYRLPKHLDEKVARIHVEALGGHLTKLTKEQAEYL
GVDVEGPYKPDHYRY
>A4ZHR8 3.13.2.1~~~ahcY~~~Adenosylhomocysteinase~~~COG0499
MTELKADVRNGIDYKVADLSLADFGRKEIRLAEHEMPGLMALRREYHDVQPLKGARISGSLHMTVQTAVLIETLVSLGAE
VRWASCNIFSTQDHAAAAVVVGPNGTPEEPKGVSVFAWKGETLEEYWWAAEQMLTWDGEPANMILDDGGDATMMVLRGAQ
YEKAGVVPPAEDDDPAEWKVFLGVLRERFEQDKTKWTKIAESVKGVTEETTTGVLRLYQFAAAGELAFPAINVNDSVTKS
KFDNKYGTRHSLIDGINRGTDVLIGGKKVLICGYGDVGKGCAESLAGQGARVSVTEIDPINALQALMDGFDVRTVEEAIG
EADIVITATGNKDIITLDHMKAMKNQAILGNIGHFDNEIDMAALERSGAKKINIKPQVDQWIFDDGKSIIVLSEGRLLNL
GNATGHPSFVMSNSFSNQVIAQIELWTKNDEYDNEVYRLAKHLDEKVARIHVEALGGTLTKLSKDQAEYIGVDVEGPYKP
EHYRY
>P9WGV3 3.13.2.1~~~ahcY~~~Adenosylhomocysteinase~~~COG0499
MTGNLVTKNSLTPDVRNGIDFKIADLSLADFGRKELRIAEHEMPGLMSLRREYAEVQPLKGARISGSLHMTVQTAVLIET
LTALGAEVRWASCNIFSTQDHAAAAVVVGPHGTPDEPKGVPVFAWKGETLEEYWWAAEQMLTWPDPDKPANMILDDGGDA
TMLVLRGMQYEKAGVVPPAEEDDPAEWKVFLNLLRTRFETDKDKWTKIAESVKGVTEETTTGVLRLYQFAAAGDLAFPAI
NVNDSVTKSKFDNKYGTRHSLIDGINRGTDALIGGKKVLICGYGDVGKGCAEAMKGQGARVSVTEIDPINALQAMMEGFD
VVTVEEAIGDADIVVTATGNKDIIMLEHIKAMKDHAILGNIGHFDNEIDMAGLERSGATRVNVKPQVDLWTFGDTGRSII
VLSEGRLLNLGNATGHPSFVMSNSFANQTIAQIELWTKNDEYDNEVYRLPKHLDEKVARIHVEALGGHLTKLTKEQAEYL
GVDVEGPYKPDHYRY
>Q9I685 3.13.2.1~~~ahcY~~~Adenosylhomocysteinase~~~
MSAVMTPAGFTDYKVADITLAAWGRRELIIAESEMPALMGLRRKYAGQQPLKGAKILGCIHMTIQTGVLIETLVALGAEV
RWSSCNIFSTQDQAAAAIAAAGIPVFAWKGETEEEYEWCIEQTILKDGQPWDANMVLDDGGDLTEILHKKYPQMLERIHG
ITEETTTGVHRLLDMLKNGTLKVPAINVNDSVTKSKNDNKYGCRHSLNDAIKRGTDHLLSGKQALVIGYGDVGKGSSQSL
RQEGMIVKVAEVDPICAMQACMDGFEVVSPYKNGINDGTEASIDAALLGKIDLIVTTTGNVNVCDANMLKALKKRAVVCN
IGHFDNEIDTAFMRKNWAWEEVKPQVHKIHRTGKDGFDAHNDDYLILLAEGRLVNLGNATGHPSRIMDGSFANQVLAQIH
LFEQKYADLPAAEKAKRLSVEVLPKKLDEEVALEMVKGFGGVVTQLTPKQAEYIGVSVEGPFKPDTYRY
>P74008 3.13.2.1~~~ahcY~~~Adenosylhomocysteinase~~~COG0499
MVATPVKQKYDIKDISLAPQGRQRIEWAAREMPVLKQIRERFAQEKPFAGIRLVACCHVTTETANLAIALHAGGADSLLI
ASNPLSTQDDVAACLVADYGIPVYAIKGEDNETYHRHVQIALDHRPNIIIDDGSDVVATLVQERQHQLSDIIGTTEETTT
GIVRLRAMFNDGVLTFPAMNVNDADTKHFYDNRYGTGQSTLDGIIRATNILLAGKTIVVAGYGWCGKGVAMRAKGMGADV
IVTEISPVPAIEAAMDGFRVMPMAEAAHQGDIFITVTGNKHVIRPEHFAVMKDGAIVCNSGHFDIEIDLKSLKEQAKEVK
EVRNFTEQYILPNGKSIIVIGEGRLVNLAAAEGHPSAVMDMSFANQALACEHLVKNKGQLEPGMHSIPVEVDQEIARLKL
QAMGIAIDSLTPEQVEYINSWASGT
>O51933 3.13.2.1~~~ahcY~~~Adenosylhomocysteinase~~~COG0499
MNTGEMKINWVSRYMPLLNKIAEEYSREKPLSGFTVGMSIHLEAKTAYLAITLSKLGAKVVITGSNPLSTQDDVAEALRS
KGITVYARRTHDESIYRENLMKVLDERPDFIIDDGGDLTVISHTEREEVLENLKGVSEETTTGVRRLKALEETGKLRVPV
IAVNDSKMKYLFDNRYGTGQSTWDAIMRNTNLLVAGKNVVVAGYGWCGRGIALRAAGLGARVIVTEVDPVKAVEAIMDGF
TVMPMKEAVKIADFVITASGNTDVLSKEDILSLKDGAVLANAGHFNVEIPVRVLEEIAVEKFEARPNVTGYTLENGKTVF
LLAEGRLVNLAAGDGHPVEIMDLSFALQIFAVLYLLENHRKMSPKVYMLPDEIDERVARMKLDSLGVKIDELTEKQRRYL
RSWQ
>Q48864 ~~~saiA~~~Sakacin-A immunity factor~~~
MKADYKKINSILTYTSTALKNPKIIKDKDLVVLLTIIQEEAKQNRIFYDYKRKFRPAVTRFTIDNNFEIPDCLVKLLSAV
ETPKAWSGFS
>P0A311 ~~~curA~~~Bacteriocin curvacin-A~~~
MNNVKELSMTELQTITGGARSYGNGVYCNNKKCWVNRGEATQSIIGGMISGWASGLAGM
>P0A310 ~~~sapA~~~Bacteriocin sakacin-A~~~
MNNVKELSMTELQTITGGARSYGNGVYCNNKKCWVNRGEATQSIIGGMISGWASGLAGM
>P35618 ~~~sakP~~~Bacteriocin sakacin-P~~~
MEKFIELSLKEVTAITGGKYYGNGVHCGKHSCTVDWGTAIGNIGNNAAANWATGGNAGWNK
>Q99SU7 ~~~sak~~~Staphylokinase~~~
MLKRSLLFLTVLLLLFSFSSITNEVSASSSFDKGKYKKGDDASYFEPTGPYLMVNVTGVDSKGNELLSPHYVEFPIKPGT
TLTKEKIEYYVEWALDATAYKEFRVVELDPSAKIEVTYYDKNKKKEETKSFPITEKGFVVPDLSEHIKNPGFNLITKVVI
EKK
>P68802 ~~~sak~~~Staphylokinase~~~
MLKRSLLFLTVLLLLFSFSSITNEVSASSSFDKGKYKKGDDASYFEPTGPYLMVNVTGVDGKGNELLSPHYVEFPIKPGT
TLTKEKIEYYVEWALDATAYKEFRVVELDPSAKIEVTYYDKNKKKEETKSFPITEKGFVVPDLSEHIKNPGFNLITKVVI
EKK
>A4X3Q0 2.5.1.94~~~salL~~~Adenosyl-chloride synthase~~~COG1912
MQHNLIAFLSDVGSADEAHALCKGVMYGVAPAATIVDITHDVAPFDVREGALFLADVPHSFPAHTVICAYVYPETGTATH
TIAVRNEKGQLLVGPNNGLLSFALDASPAVECHEVLSPDVMNQPVTPTWYGKDIVAACAAHLAAGTDLAAVGPRIDPKQI
VRLPYASASEVEGGIRGEVVRIDRAFGNVWTNIPTHLIGSMLQDGERLEVKIEALSDTVLELPFCKTFGEVDEGQPLLYL
NSRGRLALGLNQSNFIEKWPVVPGDSITVSPRVPDSNLGPVLG
>A8D7K2 4.2.3.152~~~salQ~~~2-epi-5-epi-valiolone synthase~~~
MTGTSLTDTSSGLYFRDHSQGWLLRAQKQISYEVRLRDGIFRPECTDLLEQGAGTPGRSRRFVVVDSNVDLMYGNRIRSY
FDYHGVDCSIMVVEANETLKNLETATRIVDEIDAFGIARRKEPLIVIGGGVLMDIVGLVASLYRRGAPFVRVPTTLIGLV
DAGVGVKTGVNFNGHKNRLGTYTPADLTLLDRQFLATLDRRHIGNGLAEILKIALIKDLSLFAALEEHGPTLLDEKFQGS
TAAGDRAARSVLHSAIHGMLDELQPNLWEAELERCVDYGHTFSPTVEMRALPELLHGEAVCVDMALTTVIAWRRGLLTEA
QRDRIFAVMAALELPSWHPILDPDVLVNALQDTVRHRDGLQRLPLPVGIGGVTFVNDVTPRELEAAVTLQQELGDARTPK
TSGDRGGRNL
>B0B8F4 ~~~~~~S-adenosylmethionine/S-adenosylhomocysteine transporter~~~
MVAVKALLFACTLRTCVFKPCCDMAIFLIFLNAFIWSSSFALSKSAMEAAAPLFVTGSRMVLAGVVLFGLLLCKRESLRL
PRPAIMPIVLLSVIGFYLTNVLEFIGLQRLSSSTACFIYGFSPFTAAFCSYVQLREVVTWKKLGGLSLGLVSYLVYLLFG
GSEDVAEWGWQLGLPELLLIAATCLSSYGWTLLRKLGRRCESLSMTAINAYAMVIAGVLSLIHSAVTEVWNPVPVENPLL
FLQAIGALVIFSNLICYNLFAKLLRSFSSTFLSFCNLVMPLFASFFGWLLLGESFPPGLLFAVGFMVLGCRLIYHEEFRQ
GYVLTSE
>D5EID4 4.2.-.-~~~~~~S-adenosylmethionine-dependent nucleotide dehydratase~~~COG0535
MKPAIPPTINLHTIRACNYGCKYCFAGFQDCDTGVMPQADLHEILRQFAATTGMAIHPAKVNFAGGEPMLSPTFVEDICY
AKSLGLTTSLVTNGSLLSERLLDKLSGQLDLLTISIDSLKPGTNRAIGRTNRQNPLTVSEYLDRILKARTRGITVKLNTV
VNRLNLDEDMTDFIREAQPIRWKLFKVLKIQNENSAHFDSWAIRDEEFVHFVERHRKVESSGVTLVPESNEQMYGTYGII
SPDGRFIDNSQGTHRYSPRIVDVGITQAFADVNFSMAGFQQRGGIYSIKRSTTNRSLQTSALHPKRELTK
>A0A1M7D0R2 4.2.-.-~~~~~~S-adenosylmethionine-dependent nucleotide dehydratase~~~
MNIKTIVINWHITESCNYKCKYCFAKWNRVKEIWTNPDNVRKILENLKSIRLEDCLFTQKRLNIVGGEPILQQERLWQVI
KMAHEMDFEISIITNGSHLEYICPFVHLISQVGVSIDSFDHKTNVRIGRECNGKTISFQQLKEKLEELRTLNPGLNIKIN
TVVNEYNFNEILVDRMAELKIDKWKILRQLPFDGKEGISDFKFNTFLFNNLKEEKMPKKDPLSNFLAAFSAPQKQNNVIF
VEDNDVMTESYLMIAPDGRLFQNGHKEYEYSHPLTEISIDEALEEINFDQEKFNNRYENYATEEAKYRMEEFFLMNEYED
VSFDCCCPFGDKD
>A0A1S1YUU1 4.2.-.-~~~~~~S-adenosylmethionine-dependent nucleotide dehydratase~~~
MNKLVSGNNIIPSVNFHLWEPCNMRCKFCFAKFQDVKSTILPKGHLKKEQTLEIVEQLAEYGFQKITFVGGEPTLCPWIS
ELIKKANLLGMTTMIVTNGSNLSKDFLVQNQSYLDWITLSIDSINSSTNKVVGRSTNSIHPDRIYYNQLIQTIYEYGYRL
KINTVVTKANLNEDLNDFVNDAKPERWKVFQVLPVRGQNDNDIDELLISEKEFNEYVNRHSKNKFLITETNTDMTNTYVM
VDPAGRFFNNQNGNYMYSDHILEVGVQKAFEEMGYNYDKFIDRKGIYQWK
>P0DW53 4.2.-.-~~~~~~S-adenosylmethionine-dependent nucleotide dehydratase~~~
MKTKITLSGFAGTGKSTVGKRIQEQLNFEFVSVGNYSRQYAMEKYGLTINEFQEQCKAQPELDNEIDEKFRLECNSKENL
VIDYRLGFHFIKNAFHVLLKVSDESASKRIRLANRSDEVTSTKAIQQRNQKMRDRFQDNYGVDFTNDKNYDLVIDTDDLT
ANEVADLIIEHYQKSNAVSKIPSVNFHLWQPCNMRCKFCFATFLDVKQEYVPKGHLPEDEALEVVRKIAAAGFEKITFAG
GEPLLCKWLPKLIKTAKQLGMTTMIVTNGSKLTDSFLKENKAYLDWIAVSIDSLDEENNIKIGRAITGKKPLSKAFYYDL
IDKIHQYGYGLKINTVVNKVNYKDNLASFIAKAKPKRWKVLQVLPIKGQNDNKIDAFKITDEEYANFLDTHKDVETIVPE
SNDEIKGSYVMVDPAGRFFDNAAGTHNYSKPILEVGIQEALKTMNYDLDKFLNRGGVYNWNTNKNQDLRKEEVSYE
>P0DW52 4.2.-.-~~~~~~S-adenosylmethionine-dependent nucleotide dehydratase~~~
MTTPSFIPSVNFHLIKPCNMGCKYCFARFNDVASKSLTRGGLPKEDALAVVSALADFGFEKITFAGGEPTLYPWLTDVIE
LAKNKGMTTMLVTNGSRLNEAFYLRHAGLLDWITVSIDSLSVGTNLAIGRAKHGNQVFAREDYEVIAAMIHDYSYRLKIN
TVVSRYNHEEDMNDFIAHAKPERWKVFQALPIVGENDEYLEEFEITAEEFQQFLGRHGSQAKLVKENNDEMRGSYAMVDP
KGCFFTNVNGQLEASSPILTVGCDAALREMNYDLTKFHDRGGRYDW
>A0A244CMP0 4.2.-.-~~~~~~S-adenosylmethionine-dependent nucleotide dehydratase~~~
MEIHMTSIQELVINFHMTEACNYRCGYCYATWQDNSSDTELHHASENIHSLLLKLADYFFADNSLRQTLKYQSVRINFAG
GEPVMLGSRFIDAILFAKQLGFATSIITNGHLLSTVMLKKIAVHLDMLGISFDTGDYLIAQSIGRVDRKKSWLSPARVLD
VVTQYRALNPKGKVKINTVVNAYNWRENLTQTITQLKPDKWKLLRVLPVYSKEMTVLQWQYESYVHKHQVHADVIVVEDN
DDMWQSYLMINPEGRFYQNAGACKGLTYSPPVLEVGVEEALKYINFNAEAFSKRYQSIHLPLAMSAGA
>P0DW49 4.2.-.-~~~vip8~~~S-adenosylmethionine-dependent nucleotide dehydratase~~~
MHNHNKIANKELVVNWHITEACNYRCGYCFAKWGKQKGELIQDVASISQLMDAISGLPAVLNQMHAANFEGVRLNLVGGE
TFLNYRKIKEVVKQAKKRGLKLSAITNGSRINNDFINLIANNFASIGFSVDSVDNSTNLNIGRVEKNAVMNPEKIIHTIA
SIRAINPKIEIKVNTVVSDLNKSEDLSDFIGQVMPNKWKIFKVLPVVANHHLISEEQFTRFLRRHQRFGEIIYAEDNTEM
VDSYIMIDPIGRFFQNSDFNNGYYYSRPILQVGIHQAFNEINFNANKFYSRYKRASLN
>A0A1H0NKS3 4.2.-.-~~~vip6~~~S-adenosylmethionine-dependent nucleotide dehydratase~~~
MAYKVNLHITQKCNYACKYCFAHFDHHNDLTLGQWKHIIDNLKTSGLVDAINFAGGEPVLHRDFAAIVNYAYDQGFKLSI
ITNGSLMLNPKLMPPELFAKFDTLGISVDSINPKTLIALGACNNSQEVLSYDKLSHLITLARSVNPTIRIKLNTVITNLN
ADEDLTIIGQELDIARWKMLRMKLFIHEGFNNAPLLVSQADFDGFVERHAEVSHDIVPENDLTRSYIMVDNQGRLLDDET
EEYKVVGSLLAEDFGTVFDRYHFDEATYASRYAG
>P36634 ~~~sapA~~~Peptide transport periplasmic protein SapA~~~
MRLVLSSLIVIAGLLSSQATAATAPEQTASADIRDSGFVYCVSGQVNTFNPQKASSGLIVDTLAAQLYDRLLDVDPYTYR
LVPELAESWEVLDNGATYRFHLRRDVSFQKTAWFTPTRKLNADDVVFTFQRIFDRRHPWHNINGSSFPYFDSLQFADNVK
SVRKLDNNTVEFRLTQPDASFLWHLATHYASVMSAEYAAQLSRKDRQELLDRQPVGTGPFQLSEYRAGQFIRLQRHDGFW
RGKPLMPQVVVDLGSGGTGRLSKLLTGECDVLAWPAASQLTILRDDPRLRLTLRPGMNIAYLAFNTDKPPLNNPAVRHAL
ALSINNQRLMQSIYYGTAETAASILPRASWAYDNDAKITEYNPQKSREQLKALGIENLTLHLWVPTSSQAWNPSPLKTAE
LIQADMAQVGVKVVIVPVEGRFQEARLMDMNHDLTLSGWATDSNDPDSFFRPLLSCAAINSQTNFAHWCNPEFDSVLRKA
LSSQQLASRIEAYEEAQNILEKELPILPLASSLRLQAYRYDIKGLVLSPFGNASFAGVSREKHEEVKKP
>P12690 ~~~sapA~~~Spore-associated protein A~~~COG0739
MQAVGATLTAVGAIGAGLLVTAPAAGAATAGATASYNGVCGSGYKVVNSMPIGSTGTVYLTYNSATGKNCTVTIRNTTGT
PTYMVAYVRNIESGADQYDEGDYRSYAGPVYVSARGACVEWGGVIGNLQAWNYGSNCGALAAKAPQKDWFAGQR
>Q45514 ~~~sapB~~~Protein SapB~~~COG1285
MLLSWYIDPDILLKLGIATLIGMVIGLERELKNKPLGLKTCIVIAVSSCMLTIVSINAAYHFPKYYRIMMDPLRLPAQII
SGVGFIGAGVILRKSNDVISGLTTSAMIWGAAGLGLATGAGFYKEAFASLLFILISVEFLPWVVRKIGPDRLQEKDIRIR
MSLSDKDKMTEILKEMKRRDIKAHSVRIDDLGEKDFPIMEVKVRVHKNRYTTDVYYDIKAIEGVVGVKCDTL
>P0AGH3 ~~~sapB~~~Putrescine export system permease protein SapB~~~COG4168
MIIFTLRRILLLIVTLFLLTFVGFSLSYFTPHAPLQGASLWNAWVFWFNGLIHWDFGVSSINGQPIAEQLKEVFPATMEL
CILAFGFALIVGIPVGMIAGITRHKWQDNLINAIALLGFSIPVFWLALLLTLFCSLTLGWLPVSGRFDLLYEVKPITGFA
LIDAWLSDSPWRDEMIMSAIRHMILPVITLSVAPTTEVIRLMRISTIEVYDQNYVKAAATRGLSRFTILRRHVLHNALPP
VIPRLGLQFSTMLTLAMITEMVFSWPGLGRWLINAIRQQDYAAISAGVMVCGSLVIIVNVISDILGAMANPLKHKEWYAL
R
>P0A2J3 ~~~sapB~~~Peptide transport system permease protein SapB~~~
MIIFTLRRLLLLLVTLFFLTFIGFSLSYFTPHAPLQGASLWNAWVFWFNGLLHWDFGVSSINGQLISEQLKEVFPATMEL
CILAFGFALMVGIPVGMLAGVTRSKWPDRFISALALLGFSIPVFWLALLLTLFFSLTLGWLPVSGRFDLLYEVKPVTGFA
IIDAWISDSPWRDEMVMSAIRHMVLPVLTLSVAPTTEVIRLMRISTIEVYDQNYVKAAATRGLSRFTILRRHVLHNALPP
VIPRLGLQFSTMLTLAMITEMVFSWPGLGRWLIHAIRQQDYAAISAGVMVIGSLVIVVNVISDILGAMANPLKHKEWYAL
R
>P0AGH5 ~~~sapC~~~Putrescine export system permease protein SapC~~~COG4171
MPYDSVYSEKRPPGTLRTAWRKFYSDASAMVGLYGCAGLAVLCIFGGWFAPYGIDQQFLGYQLLPPSWSRYGEVSFFLGT
DDLGRDVLSRLLSGAAPTVGGAFVVTLAATICGLVLGTFAGATHGLRSAVLNHILDTLLAIPSLLLAIIVVAFAGPSLSH
AMFAVWLALLPRMVRSIYSMVHDELEKEYVIAARLDGASTLNILWFAVMPNITAGLVTEITRALSMAILDIAALGFLDLG
AQLPSPEWGAMLGDALELIYVAPWTVMLPGAAIMISVLLVNLLGDGVRRAIIAGVE
>P0A2J5 ~~~sapC~~~Peptide transport system permease protein SapC~~~
MPYDSVYSEKRPPGTLRTAWRKFYSDAPAMVGLYGCAGLALLCIFGGWIAPYGIDQQFLGYQLLPPSWSRYGEVSFFLGT
DDLGRDVLSRLLSGAAPTVGGAFIVTLAATLCGLVLGVVAGATHGLRSAVLNHILDTLLSIPSLLLAIIVVAFAGPHLSH
AMFAVWLALLPRMVRSVYSMVHDELEKEYVIAARLDGATTLNILWFAILPNITAGLVTEITRALSMAILDIAALGFLDLG
AQLPSPEWGAMLGDALELIYVAPWTVMLPGAAITLSVLLVNLLGDGIRRAIIAGVE
>P0AAH4 ~~~sapD~~~Putrescine export system ATP-binding protein SapD~~~COG4172
MPLLDIRNLTIEFKTGDEWVKAVDRVSMTLTEGEIRGLVGESGSGKSLIAKAICGVNKDNWRVTADRMRFDDIDLLRLSA
RERRKLVGHNVSMIFQEPQSCLDPSERVGRQLMQNIPAWTYKGRWWQRFGWRKRRAIELLHRVGIKDHKDAMRSFPYELT
EGECQKVMIAIALANQPRLLIADEPTNSMEPTTQAQIFRLLTRLNQNSNTTILLISHDLQMLSQWADKINVLYCGQTVET
APSKELVTMPHHPYTQALIRAIPDFGSAMPHKSRLNTLPGAIPLLEQLPIGCRLGPRCPYAQRECIVTPRLTGAKNHLYA
CHFPLNMEKE
>H8ZPX2 1.2.1.83~~~ald~~~3-succinoylsemialdehyde-pyridine dehydrogenase~~~
MRDYREFYIDGQWVRPKGAREAEVINPATEKIVGLISLGTEEHVDLAVRAARRAFDGWSRTSKDQRLELLEQVCRAFESK
LDEIAKAITEEMGAPLVQLALPLQAPAGLGHFLTAASILRDYDFEESLGTTRVVREPAGVCGLITPWNWPLNQIAAKVAP
ALAAGCTMVLKPSEIAPFSAYLLARIFDEVGVPPGVFNLVNGDGPGVGAPLAAHPEVDLVSFTGSTRAGTLVSTAAAPTV
KRVALELGGKSANIILDDADLETAVKHGVRTMMLNTGQSCNAPSRMLVPLSKLDEVEHLAEHFCKEIVVGDPMHSDTNIG
PLASGMQYEKVQDCIRQGVAEGAKLICGGLGRPDGLESGYFAQPTIFSAVNKQMYIAREEIFGPVLCIMPYGDENEAIQI
ANDSCYGLSGYVSSGSLERARNVAKQLRTGAVHLNGAALDFTAPFGGYKQSGNGREWGKYGFEEFLEIKAVMGYEGS
>P36636 ~~~sapD~~~Peptide transport system ATP-binding protein SapD~~~
MPLLDIRNLTIEFKTSEGWVKAVDRVSMTLSEGEIRGLVGESGSGKSLIAKAICGVAKDNWRVTADRMRFDDIDLLRLSS
RERRKLVGHNVSMIFQEPQSCLDPSERVGRQLMQNIPAWTYKGRWWQRLGWRKRRAIELLHRVGIKDHKDAMRSFPYELT
DGECQKVMIAIALANQPRLLIADEPTNSMEPTTQAQIFRLLTRLNQNSNTTILLISHDLQMLSQWADKINVLYCGQTVET
APSKDLVTMPHHPYTQALIRAIPDFGSAMPHKSRLNTLPGAIPLLEQLPIGCRLGPRCPYAQRECIITPRLTGAKNHLYA
CHFPLNMERE
>P0AAH8 ~~~sapF~~~Putrescine export system ATP-binding protein SapF~~~COG4172
MIETLLEVRNLSKTFRYRTGWFRRQTVEAVKPLSFTLREGQTLAIIGENGSGKSTLAKMLAGMIEPTSGELLIDDHPLHF
GDYSFRSQRIRMIFQDPSTSLNPRQRISQILDFPLRLNTDLEPEQRRKQIIETMRMVGLLPDHVSYYPHMLAPGQKQRLG
LARALILRPKVIIADEALASLDMSMRSQLINLMLELQEKQGISYIYVTQHIGMMKHISDQVLVMHQGEVVERGSTADVLA
SPLHELTKRLIAGHFGEALTADAWRKDR
>P36638 ~~~sapF~~~Peptide transport system ATP-binding protein SapF~~~
MVETLLEVRNLSKTFRYRTGWFRRQTVDAVKPLSFTLRERQTLAIIGENGSGKSTLAKMLAGMIEPTSGELLIDDHPLHY
GDYSFRSQRIRMIFQDPSTSLNPRQRISQILDFPLRLNTDLEPEQRRKQIVETMRMVGLLPDHVSYYPHMLAPGQKQRLG
LARALILRPKVIIADEALASLDMSMRSQLINLMLELQEKQGISYIYVTQHIGMMKHISDQVLVMHQGEVVERGSTADVLA
SPLHELTRRLIAGHFGEALTADAWRKDR
>O53361 3.1.3.64~~~sapM~~~Phosphatidylinositol-3-phosphatase~~~COG3511
MLRGIQALSRPLTRVYRALAVIGVLAASLLASWVGAVPQVGLAASALPTFAHVVIVVEENRSQAAIIGNKSAPFINSLAA
NGAMMAQAFAETHPSEPNYLALFAGNTFGLTKNTCPVNGGALPNLGSELLSAGYTFMGFAEDLPAVGSTVCSAGKYARKH
VPWVNFSNVPTTLSVPFSAFPKPQNYPGLPTVSFVIPNADNDMHDGSIAQGDAWLNRHLSAYANWAKTNNSLLVVTWDED
DGSSRNQIPTVFYGAHVRPGTYNETISHYNVLSTLEQIYGLPKTGYATNAPPITDIWGD
>O07802 ~~~sap~~~Sulfolipid-1 exporter Sap~~~
MWSTVLVLALSVICEPVRIGLVVLMLNRRRPLLHLLTFLCGGYTMAGGVAMVTLVVLGATPLAGHFSVAEVQIGTGLIAL
LIAFALTTNVIGKHVRRATHARVGDDGGRVLRESVPPSGAHKLAVRARCFLQGDSLYVAGVSGLGAALPSANYMGAMAAI
LASGATPATQALAVVTFNVVAFTVAEVPLVSYLAAPRKTRAFMAALQSWLRSRSRRDAALLVAAGGCLMLTLGLSNL
>A0A0F6B506 ~~~sarA~~~Salmonella anti-inflammatory response activator~~~
MMRFVYIYILVIYGSYLWFSLGGNMFTINSTNRVASTIAPYACVSDVNLEDKATFLDEHTSIHANDSSLQCFVLNDQHVP
QNTLATDVEGYNRGLQERISLEYQPLESIVFLLGTPAVLETKESLSLPVSPDALTQKLLSISSNDECKLSGSTSCTTPAS
HNPPSGYIAQYRHSAEVFPDE
>Q2G2U9 ~~~sarA~~~Transcriptional regulator SarA~~~COG1846
MAITKINDCFELLSMVTYADKLKSLIKKEFSISFEEFAVLTYISENKEKEYYLKDIINHLNYKQPQVVKAVKILSQEDYF
DKKRNEHDERTVLILVNAQQRKKIESLLSRVNKRITEANNEIEL
>Q5HI51 ~~~sarA~~~Transcriptional regulator SarA~~~
MAITKINDCFELLSMVTYADKLKSLIKKEFSISFEEFAVLTYISENKEKEYYLKDIINHLNYKQPQVVKAVKILSQEDYF
DKKRNEHDERTVLILVNAQQRKKIESLLSRVNKRITEANNEIEL
>A6QES8 ~~~sarA~~~Transcriptional regulator SarA~~~
MAITKINDCFELLSMVTYADKLKSLIKKEFSISFEEFAVLTYISENKEKEYYLKDIINHLNYKQPQVVKAVKILSQEDYF
DKKRNEHDERTVLILVNAQQRKKIESLLSRVNKRITEANNEIEL
>Q7A732 ~~~sarA~~~Transcriptional regulator SarA~~~
MAITKINDCFELLSMVTYADKLKSLIKKEFSISFEEFAVLTYISENKEKEYYLKDIINHLNYKQPQVVKAVKILSQEDYF
DKKRNEHDERTVLILVNAQQRKKIESLLSRVNKRITEANNEIEL
>P0C1U6 ~~~sarA~~~Transcriptional regulator SarA~~~
MAITKINDCFELLSMVTYADKLKSLIKKEFSISFEEFAVLTYISENKEKEYYLKDIINHLNYKQPQVVKAVKILSQEDYF
DKKRNEHDERTVLILVNAQQRKKIESLLSRVNKRITEANNEIEL
>Q7A1N5 ~~~sarA~~~Transcriptional regulator SarA~~~
MAITKINDCFELLSMVTYADKLKSLIKKEFSISFEEFAVLTYISENKEKEYYLKDIINHLNYKQPQVVKAVKILSQEDYF
DKKRNEHDERTVLILVNAQQRKKIESLLSRVNKRITEANNEIEL
>P31306 ~~~sarA~~~Oligopeptide-binding protein SarA~~~COG4166
MKKGKILALAGVALLATGVLAACSNSTSNSSNSSSSGADQVFNYIYEVDPENLNYLISSKAATTDLTANLIDGLLENDNY
GNLVPSMAEDWTVSKDGLTYTYTLRKDAKWYTSDGEEYADVKAQDFVAGLKYAADNKSETLYLVQSSIKGLDDYVNGKTK
DFSSVGVKAVDDHTVQYTLNEPESFWNSKTTMGILYPVNEEFLKSKGDKFAQSADPTSLLYNGPFLLKSITSKSSIEFAK
NPNYWDKDNVHVSDVKLTYFDGQDQGKPAEQFAKGALSAARLAPTSATFSKVEKEFKDNIVYTPQDSTSYLVGVNIDRQA
YNHTAKSSDAQKSSTKKALMNKDFRQALSFAFDRTAYASQVNGKEGATKMLRNLYIPPTFVQADGKSFGELVKEKVASYG
DEWKDVNFDDAQDGLYNKEKAKAEFAKAKKALQEEGVEFPIHLDMPVDQTATAKVQRVQSLKQSIESSLGTDNVVVDIHQ
MKTDDVLNITYYAASAAEEDWDISDNVGWSPDYQDPSTYLEIIKPGGENTKTFLGFDGKENAAAEQVGLKEYAKLVDEAA
AEKTDVNKRYEKYATAQAWLTDSALLIPTTSRTGRPVLTKIVPFTAPFAWSGAKGRDMASYKYLKLQDKAVTAKEYQKAQ
EKWNKERAESNKKAQEELEKHVK
>E5Y946 1.1.1.-~~~sarD~~~Sulfoacetaldehyde reductase~~~COG1454
MAFVHYTVKKIVHGLGAIKEAANEVKNLKGSKAFIVTDPGLAKIGVQKPLEEALTAGGIEWKLYAEAQLEPSMDSIQHCT
DEAKAFGADVIIGFGGGSALDTTKAASVLLSNEGPIDKYFGINLVPNPSLPCILIPTTSGTGSEMTNISVLADTKNGGKK
GVVSEYMYADTVILDAELTFGLPPRVTAMTGVDAFVHAMESFCGIAATPITDALNLQAMKLVGANIRQAYANGKNAAARD
AMMYASALAGMGFGNTQNGIIHAIGTTLPVECHIPHGLAMSFCAPFSVGFNYIANPEKYAIVADILRGDDRSGCMSVMDR
AADVEDAFRDLLNDLDIATGLSNYGVKREDLPACADRAFAAKRLLNNNPRAASRDQILALLEANFEA
>Q9F0R1 ~~~sarR~~~HTH-type transcriptional regulator SarR~~~COG1846
MSKINDINDLVNATFQVKKFFRDTKKKFNLNYEEIYILNHILRSESNEISSKEIAKCSEFKPYYLTKALQKLKDLKLLSK
KRSLQDERTVIVYVTDTQKANIQKLISELEEYIKN
>Q7A425 ~~~sarR~~~HTH-type transcriptional regulator SarR~~~
MSKINDINDLVNATFQVKKFFRDTKKKFNLNYEEIYILNHILRSESNEISSKEIAKCSEFKPYYLTKALQKLKDLKLLSK
KRSLQDERTVIVYVTDTQKANIQKLISELEEYIKN
>P0C0R2 ~~~sarS~~~HTH-type transcriptional regulator SarS~~~
MKYNNHDKIRDFIIIEAYMFRFKKKVKPEVDMTIKEFILLTYLFHQQENTLPFKKIVSDLCYKQSDLVQHIKVLVKHSYI
SKVRSKIDERNTYISISEEQREKIAERVTLFDQIIKQFNLADQSESQMIPKDSKEFLNLMMYTMYFKNIIKKHLTLSFVE
FTILAIITSQNKNIVLLKDLIETIHHKYPQTVRALNNLKKQGYLIKERSTEDERKILIHMDDAQQDHAEQLLAQVNQLLA
DKDHLHLVFE
>Q2G1N7 ~~~sarS~~~HTH-type transcriptional regulator SarS~~~COG1846
MKYNNHDKIRDFIIIEAYMFRFKKKVKPEVDMTIKEFILLTYLFHQQENTLPFKKIVSDLCYKQSDLVQHIKVLVKHSYI
SKVRSKIDERNTYISISEEQREKIAERVTLFDQIIKQFNLADQSESQMIPKDSKEFLNLMMYTMYFKNIIKKHLTLSFVE
FTILAIITSQNKNIVLLKDLIETIHHKYPQTVRALNNLKKQGYLIKERSTEDERKILIHMDDAQQDHAEQLLAQVNQLLA
DKDHLHLVFE
>Q5HJQ7 ~~~sarS~~~HTH-type transcriptional regulator SarS~~~
MKYNNHDKIRDFIIIEAYMFRFKKKVKPEVDMTIKEFILLTYLFHQQENTLPFKKIVSDLCYKQSDLVQHIKVLVKHSYI
SKVRSKIDERNTYISISEEQREKIAERVTLFDQIIKQFNLADQSESQMIPKDSKEFLNLMMYTMYFKNIIKKHLTLSFVE
FTILAIITSQNKNIVLLKDLIETIHHKYPQTVRALNNLKKQGYLIKERSTEDERKILIHMDDAQQDHAEQLLAQVNQLLA
DKDHLHLVFE
>Q7A872 ~~~sarS~~~HTH-type transcriptional regulator SarS~~~
MKYNNHDKIRDFIIIEAYMFRFKKKVKPEVDMTIKEFILLTYLFHQQENTLPFKKIVSDLCYKQSDLVQHIKVLVKHSYI
SKVRSKIDERNTYISISEEQREKIAERVTLFDQIIKQFNLADQSESQMIPKDSKEFLNLMMYTMYFKNIIKKHLTLSFVE
FTILAIITSQNKNIVLLKDLIETIHHKYPQTVRALNNLKKQGYLIKERSTEDERKILIHMDDAQQDHAEQLLAQVNQLLA
DKDHLHLVFE
>Q2G2B1 ~~~sarT~~~HTH-type transcriptional regulator SarT~~~COG1846
MNDLKSKSNIKLMKRVLTTYELRKYLKKYFCLTLDNYLVLAYLDVFKNDEGKYFMRDIISYIGIDQSRIVKSVKDLSKKG
YLNKCRDPHDSRNVIIVVSVKQHNYIKNLLSEININET
>Q2G1T7 ~~~sarU~~~HTH-type transcriptional regulator SarU~~~COG1846
MDYQTFEKVNKFINVEAYIFFLTQELKQQYKLSLKELLILAYFYYKNEHSISLKEIIGDILYKQSDVVKNIKSLSKKGFI
NKSRNEADERRIFVSVTPIQRKKIACVINELDKIIKGFNKERDYIKYQWAPKYSKEFFILFMNIMYSKDFLKYRFNLTFL
DLSILYVISSRKNEILNLKDLFESIRFMYPQIVRSVNRLNNKGMLIKERSLADERIVLIKINKIQYNTIKSIFTDTSKIL
KPRKFFF
>Q7A3K0 ~~~sarU~~~HTH-type transcriptional regulator SarU~~~
MDYQTFEKVNKFINVEAYIFFLTQELKQQYKLSLKELLILAYFYYKNEHSISLKEIIGDILYKQSDVVKNIKSLSKKGFI
NKSRNEADERRIFVSVTPIQRKKIACVINELDKIIKGFNKERDYIKYQWAPKYSKEFFILFMNIMYSKDFLKYRFNLTFL
DLSILYVISSRKNEILNLKDLFESIRFMYPQIVRSVNRLNNKGMLIKERSLADERIVLIKINKIQYNTIKSIFTDTSKIL
KPRKFFF
>Q2FVY9 ~~~sarV~~~HTH-type transcriptional regulator SarV~~~
MSNKVQRFIEAERELSQLKHWLKTTHKISIEEFVVLFKVYEAEKISGKELRDTLHFEMLWDTSKIDVIIRKIYKKELISK
LRSETDERQVFYFYSTSQKKLLDKITKEIEVLSVTN
>Q2G0D1 ~~~sarX~~~HTH-type transcriptional regulator SarX~~~
MNTEKLETLLGFYKQYKALSEYIDKKYKLSLNDLAVLDLTMKHCKDEKVLMQSFLKTAMDELDLSRTKLLVSIRRLIEKE
RLSKVRSSKDERKIYIYLNNDDISKFNALFEDVEQFLNI
>Q2FVN3 ~~~sarZ~~~HTH-type transcriptional regulator SarZ~~~COG1846
MYVENSYLSKQLCFLFYVSSKEIIKKYTNYLKEYDLTYTGYIVLMAIENDEKLNIKKLGERVFLDSGTLTPLLKKLEKKD
YVVRTREEKDERNLQISLTEQGKAIKSPLAEISVKVFNEFNISEREASDIINNLRNFVSKNFDYSDKK
>Q5HDG9 ~~~sarZ~~~HTH-type transcriptional regulator SarZ~~~
MYVENSYLSKQLCFLFYVSSKEIIKKYTNYLKEYDLTYTGYIVLMAIENDEKLNIKKLGERVFLDSGTLTPLLKKLEKKD
YVVRTREEKDERNLQISLTEQGKAIKSPLAEISVKVFNEFNISEREASDIINNLRNFVSKNFDYSDKK
>P84583 ~~~~~~Small, acid-soluble spore protein 1~~~
MPNQSGSNSSNQLLVPGAAQAIDQMKFEIASEFGVNLGAETTSRANGSVGGEITKRLVSFAQQQMGGGVQ
>P21886 ~~~sspC1~~~Small, acid-soluble spore protein C1~~~
MSQHLVPEAKNGLSKFKNEVAAEMGVPFSDYNGDLSSKQCGSVGGEMVKRMVEQYEKGI
>P22065 ~~~~~~Small, acid-soluble spore protein alpha~~~
TTNNNNTKAVPEAKAALKQMKLEIANELGISNYDTADKGNMTARQNGYVGGYMTKKLVEMAEQQMSGQQR
>P84584 ~~~~~~Small, acid-soluble spore protein 2~~~
MAQNSQNGNSSNQLLVPGAAQAIDQMKFEIASEFGVNLGAETTSRANGSVGGEITKRLVSFAQQNMSGQQF
>P21887 ~~~sspC2~~~Small, acid-soluble spore protein C2~~~
MSQHLVPEAKNGLSKFKNEVANEMGVPFSDYNGDLSSRQCGSVGGEMVKRMVEKYEQSMK
>P22066 ~~~~~~Small, acid-soluble spore protein beta~~~
STKKAVPEAKAALNQMKLEIANELGLSNYESVDKGNLTARQNGYVGGYMTKKLVEMAERQMSGK
>P02959 ~~~sasP-A~~~Small, acid-soluble spore protein A~~~
MANTNKLVAPGSAAAIDQMKYEIASEFGVNLGPEATARANGSVGGEITKRLVQMAEQQLGGK
>Q06904 2.7.13.3~~~sasA~~~Adaptive-response sensory kinase SasA~~~COG2205
MGESLSPQALAQPLLLQLFVDTRPLSQHIVQRVKNILAAVEATVPISLQVINVADQPQLVEYYRLVVTPALVKIGPGSRQ
VLSGIDLTDQLANQLPQWLVQQEAFFADREPPEVNIPFTELGQPETPALQQADAFFQLQQQYADLSERTKFLEQVIALVA
HDLRNPLTAALLAVDTIQIRSQSFSVATAKEMQGLCSLFDQARSQLREIERMIAEILEATRHSGESLRINPREVVFEPLL
QQVLEQLHERWRSKQQQLITDVPGDLPTLYADPDRLRQVLVNLLDNAIKYTPPGGTITIAALHRTSQKVQISISDTGSGI
PRDQLSVIFKNLVRLSRDSSQEGYGIGLSVCQRIVQAHFGRIWVASELGQGSTFHFTMPVYRYTMPC
>Q55630 2.7.13.3~~~sasA~~~Adaptive-response sensory kinase SasA~~~COG2205
MSSSSELGNASSVPLQFLLFIDDRPNSQDSVQEIGQCLTNLLDGHSHDLQILQISKHPHLVEHFRLVATPSLIKLQPEPR
QVLAGSNIIQQLQKWWPRWQQELAMDPNPEDTGQSPSCPREISSVGYSGELMKMSDELFLLKKDKEELLQQIQFKDQILA
MLAHDLRSPLTAASIAVDTLELLQHKPIEEQKPALRSQLLYQARKQFKIMDRLIEDILQASKNLNSQFQVHGRPLAIADL
CQEVLELYQAKFSKKNLTITYDIPKDLPNVFADEELIRQVIANLLDNAIKYTPAHGSITVGALHRTTQKVQVSITDNGPG
IPNSKQETIFEGHFRLQRDEQTDGYGLGLSLCRKIIQAHYGQIWVDSRPKQGSSFHFTLPVYR
>Q8DMT2 2.7.13.3~~~sasA~~~Adaptive-response sensory kinase SasA~~~COG2205
MKASADASSPQETTPPLSLLLFVANRPGDEEETAAIQAHIQQLPSNFSFELKVVPIGEQPYLLEEYKLVATPALIKVRPE
PRQTLAGRKLLQKVDYWWPRWQREVALGLQADMQKSAAEQSDCSMELSRLKDELFQLRQERDRLAEQLQFKDRIISLLAH
ELRNPLTAGGIALETLESNLQEESSQQLPIEDIQRLFHHARSQTQTMGQLITDLLLAARGPQDKLQIMARQLDLRQLCQE
TVEDVRLNFERKKQHFTTDIPLDLPLVYGDGDRIRQVLVNLLDNACKYTPEGGKIHLSAFHRMTQKVQVTVSDTGPGIPI
EQQEKIFGETVRLDRDRAIEGYGIGLALCRQIIRMHYGQIWVDSQPGKGSCFHFTLPVYS
>P02960 ~~~sasP-C~~~Small, acid-soluble spore protein C~~~
MANYQNASNRNSSNKLVAPGAQAAIDQMKFEIASEFGVNLGPDATARANGSVGGEITKRLVQLAEQNLGGKY
>P84585 ~~~~~~Small, acid-soluble spore protein gamma-type~~~
MANSNNKTNAQQVRKQNQQSASGQGQFGTEFASETNVQQVRKQNQQSAAGQGQFGTEFASETDAQQVRQQNQSAEQNKQQ
NS
>P02961 ~~~sasP-B~~~Small, acid-soluble spore protein gamma-type~~~
MAKQTNKTASGTSTQHVKQQNAQASKNNFGTEFGSETNVQEVKQQNAQAAANKSQNAQASKNNFGTEFASETSAQEVRQQ
NAQAQAKKNQNSGKYQG
>Q2G2B2 ~~~sasG~~~Surface protein G~~~COG3583
MRDKKGPVNKRVDFLSNKLNKYSIRKFTVGTASILIGSLMYLGTQQEAEAAENNIENPTTLKDNVQSKEVKIEEVTNKDT
APQGVEAKSEVTSNKDTIEHEPSVKAEDISKKEDTPKEVADVAEVQPKSSVTHNAETPKVRKARSVDEGSFDITRDSKNV
VESTPITIQGKEHFEGYGSVDIQKKPTDLGVSEVTRFNVGNESNGLIGALQLKNKIDFSKDFNFKVRVANNHQSNTTGAD
GWGFLFSKGNAEEYLTNGGILGDKGLVNSGGFKIDTGYIYTSSMDKTEKQAGQGYRGYGAFVKNDSSGNSQMVGENIDKS
KTNFLNYADNSTNTSDGKFHGQRLNDVILTYVASTGKMRAEYAGKTWETSITDLGLSKNQAYNFLITSSQRWGLNQGINA
NGWMRTDLKGSEFTFTPEAPKTITELEKKVEEIPFKKERKFNPDLAPGTEKVTREGQKGEKTITTPTLKNPLTGVIISKG
EPKEEITKDPINELTEYGPETIAPGHRDEFDPKLPTGEKEEVPGKPGIKNPETGDVVRPPVDSVTKYGPVKGDSIVEKEE
IPFEKERKFNPDLAPGTEKVTREGQKGEKTITTPTLKNPLTGEIISKGESKEEITKDPINELTEYGPETITPGHRDEFDP
KLPTGEKEEVPGKPGIKNPETGDVVRPPVDSVTKYGPVKGDSIVEKEEIPFEKERKFNPDLAPGTEKVTREGQKGEKTIT
TPTLKNPLTGVIISKGEPKEEITKDPINELTEYGPETITPGHRDEFDPKLPTGEKEEVPGKPGIKNPETGDVVRPPVDSV
TKYGPVKGDSIVEKEEIPFKKERKFNPDLAPGTEKVTREGQKGEKTITTPTLKNPLTGEIISKGESKEEITKDPINELTE
YGPETITPGHRDEFDPKLPTGEKEEVPGKPGIKNPETGDVVRPPVDSVTKYGPVKGDSIVEKEEIPFEKERKFNPDLAPG
TEKVTREGQKGEKTITTPTLKNPLTGEIISKGESKEEITKDPINELTEYGPETITPGHRDEFDPKLPTGEKEEVPGKPGI
KNPETGDVVRPPVDSVTKYGPVKGDSIVEKEEIPFKKERKFNPDLAPGTEKVTREGQKGEKTITTPTLKNPLTGEIISKG
ESKEEITKDPINELTEYGPETITPGHRDEFDPKLPTGEKEEVPGKPGIKNPETGDVVRPPVDSVTKYGPVKGDSIVEKEE
IPFEKERKFNPDLAPGTEKVTREGQKGEKTITTPTLKNPLTGEIISKGESKEEITKDPINELTEYGPETITPGHRDEFDP
KLPTGEKEEVPGKPGIKNPETGDVVRPPVDSVTKYGPVKGDSIVEKEEIPFEKERKFNPDLAPGTEKVTREGQKGEKTIT
TPTLKNPLTGEIISKGESKEEITKDPVNELTEFGGEKIPQGHKDIFDPNLPTDQTEKVPGKPGIKNPDTGKVIEEPVDDV
IKHGPKTGTPETKTVEIPFETKREFNPKLQPGEERVKQEGQPGSKTITTPITVNPLTGEKVGEGQPTEEITKQPVDKIVE
FGGEKPKDPKGPENPEKPSRPTHPSGPVNPNNPGLSKDRAKPNGPVHSMDKNDKVKKSKIAKESVANQEKKRAELPKTGL
ESTQKGLIFSSIIGIAGLMLLARRRKN
>O34764 2.7.7.4~~~sat~~~Sulfate adenylyltransferase~~~COG2046
MSLAPHGGTLVNRVDESYDVSGIQKEIELDLISFADLELIGIGAYSPIEGFFNEKDYVSVVENMRLSSGVVWSLPITLPV
DAQKAAELSLGETVKLTYEGETYGVIQIEDLYVPDKQKEAVNVYKTDEQEHPGVKKLFSRGNTYVGGPITLIKKASKQFP
EFTFEPSETRRQFAEKGWETIVGFQTRNPVHRAHEYIQKTALETVDGLFLNPLVGETKSDDIPADVRMESYQVLLDHYYP
KDRVFLGVFLAAMRYAGPREAIFHALVRKNYGCTHFIVGRDHAGVGDYYGTYEAQELFDTFKPEELGITPLKFEHSFFCK
KCGNMGTAKTCPHGREHHVILSGTKVRGMLRDGVLPPAEFSRKEVVEVLIKGMKKKEEVGVS
>O67174 ~~~sat/cysC~~~Probable bifunctional SAT/APS kinase~~~COG0529
MEKIKYLKSIQISQRSVLDLKLLAVGAFTPLDRFMGEEDYRNVVESMRLKSGTLFPIPITLPMEKEIAKDLKEGEWIVLR
DPKNVPLAIMRVEEVYKWNLEYEAKNVLGTTDPRHPLVAEMHTWGEYYISGELKVIQLPKYYDFPEYRKTPKQVREEIKS
LGLDKIVAFQTRNPMHRVHEELTKRAMEKVGGGLLLHPVVGLTKPGDVDVYTRMRIYKVLYEKYYDKKKTILAFLPLAMR
MAGPREALWHGIIRRNYGATHFIVGRDHASPGKDSKGKPFYDPYEAQELFKKYEDEIGIKMVPFEELVYVPELDQYVEIN
EAKKRNLKYINISGTEIRENFLKQGRKLPEWFTRPEVAEILAETYVPKHKQGFCVWLTGLPCAGKSTIAEILATMLQARG
RKVTLLDGDVVRTHLSRGLGFSKEDRITNILRVGFVASEIVKHNGVVICALVSPYRSARNQVRNMMEEGKFIEVFVDAPV
EVCEERDVKGLYKKAKEGLIKGFTGVDDPYEPPVAPEVRVDTTKLTPEESALKILEFLKKEGFIKD
>P0AC98 ~~~satP~~~Succinate-acetate/proton symporter SatP~~~COG1584
MGNTKLANPAPLGLMGFGMTTILLNLHNVGYFALDGIILAMGIFYGGIAQIFAGLLEYKKGNTFGLTAFTSYGSFWLTLV
AILLMPKLGLTDAPNAQFLGVYLGLWGVFTLFMFFGTLKGARVLQFVFFSLTVLFALLAIGNIAGNAAIIHFAGWIGLIC
GASAIYLAMGEVLNEQFGRTVLPIGESH
>O66036 2.7.7.4~~~sat~~~Sulfate adenylyltransferase~~~COG2046
MIKPVGSDELRPRFVYDPEQHHRLSSEAESLPSVIVSSQAAGNAVMLGAGYFSPLDGFMNLADALSSAQSMTLTDGRFFP
VPLLCLLESADAIAGATRIALRDPNVEGNPVLAVMDVTAVEQVSDAQMALMTEQVYGTSDPKHPGVETFNSQGRTAISGP
IQVLNFSYFQTDFPDTFRTAVEIRHEIQERGWQKIVAFQTRNPMHRAHEELCKMAMEAVEADGVVIHMLLGQLKPGDIPA
PVRDAAIRTMAELYFPPNTVMVTGYGFDMLYAGPREAVLHAYFRQNMGATHFIIGRDHAGVGDYYGPFDAQTIFDDAVPT
DVLAIEIFRADNTAYSKKLGRVVMMRDAPDHTPDDFIQLSGTRVREMLGQGEAPPPEFSRPEVAQILMDYYRSLPQS
>Q8FDW4 3.4.21.-~~~sat~~~Serine protease sat autotransporter~~~COG3468
MNKIYSLKYSAATGGLIAVSELAKRVSGKTNRKLVATMLSLAVAGTVNAANIDISNVWARDYLDLAQNKGIFQPGATDVT
ITLKNGDKFSFHNLSIPDFSGAAASGAATAIGGSYSVTVAHNKKNPQAAETQVYAQSSYRVVDRRNSNDFEIQRLNKFVV
ETVGATPAETNPTTYSDALERYGIVTSDGSKKIIGFRAGSGGTSFINGESKISTNSAYSHDLLSASLFEVTQWDSYGMMI
YKNDKTFRNLEIFGDSGSGAYLYDNKLEKWVLVGTTHGIASVNGDQLTWITKYNDKLVSELKDTYSHKINLNGNNVTIKN
TDITLHQNNADTTGTQEKITKDKDIVFTNGGDVLFKDNLDFGSGGIIFDEGHEYNINGQGFTFKGAGIDIGKESIVNWNA
LYSSDDVLHKIGPGTLNVQKKQGANIKIGEGNVILNEEGTFNNIYLASGNGKVILNKDNSLGNDQYAGIFFTKRGGTLDL
NGHNQTFTRIAATDDGTTITNSDTTKEAVLAINNEDSYIYHGNINGNIKLTHNINSQDKKTNAKLILDGSVNTKNDVEVS
NASLTMQGHATEHAIFRSSANHCSLVFLCGTDWVTVLKETESSYNKKFNSDYKSNNQQTSFDQPDWKTGVFKFDTLHLNN
ADFSISRNANVEGNISANKSAITIGDKNVYIDNLAGKNITNNGFDFKQTISTNLSIGETKFTGGITAHNSQIAIGDQAVV
TLNGATFLDNTPISIDKGAKVIAQNSMFTTKGIDISGELTMMGIPEQNSKTVTPGLHYAADGFRLSGGNANFIARNMASV
TGNIYADDAATITLGQPETETPTISSAYQAWAETLLYGFDTAYRGAITAPKATVSMNNAIWHLNSQSSINRLETKDSMVR
FTGDNGKFTTLTVNNLTIDDSAFVLRANLAQADQLVVNKSLSGKNNLLLVDFIEKNGNSNGLNIDLVSAPKGTAVDVFKA
TTRSIGFSDVTPVIEQKNDTDKATWTLIGYKSVANADAAKKATLLMSGGYKAFLAEVNNLNKRMGDLRDINGESGAWARI
ISGTGSAGGGFSDNYTHVQVGADNKHELDGLDLFTGVTMTYTDSHAGSDAFSGETKSVGAGLYASAMFESGAYIDLIGKY
VHHDNEYTATFAGLGTRDYSSHSWYAGAEVGYRYHVTDSAWIEPQAELVYGAVSGKQFSWKDQGMNLTMKDKDFNPLIGR
TGVDVGKSFSGKDWKVTARAGLGYQFDLFANGETVLRDASGEKRIKGEKDGRMLMNVGLNAEIRDNLRFGLEFEKSAFGK
YNVDNAINANFRYSF
>A0A1D3PCK2 2.3.1.30~~~metA~~~Serine O-acetyltransferase~~~
MANKVKIGILNLMHDKLDTQSHFIKVLPNADLTFFYPRMHYQNRPIPPEVNMTSEPLDINRVSEFDGFIITGAPIDQIDF
SKITYIEEIRYLLQALDNHKIQQLYFCWGAMAALNYFYGIKKKILAEKIFGVFPHLITEPHPLLSGLSQGFMAPHARYAE
MDKKQIMQDERLAINAVDDNSHLFMVSAKDNPERNFIFSHIEYGKDSLRDEYNREINAHPERHYKKPINYSMSNPSFQWQ
DTQKIFFNNWLKKVKDNKLVLN
>Q54506 2.7.7.4~~~sat~~~Sulfate adenylyltransferase~~~
MIKPVGSDELKPLFVYDPEEHHKLSHEAESLPSVVISSQGPRVSSMMGAGYFSPAGFMNVADAMGAAEKMTLSDGSSSCS
VLCLLENTDAIGDAKRIALRDPNVEGNPVLAVMDIEAIEEVSDEQMAVMTDKVYRTTDMDHIGVKTFNSQGRVAVSGPIQ
VLNFSYFQADFPDTFRTAVEIRNEIKEHGWSKVVAFQTRNPMHRAHEELCRMPMESLDADGVVVHMLLGKLKKGDIPAPV
RDAAIRTMAEVYFPPNTVMVTGYGFDMLYAGPREAVLHAYFRQNMGATHFIIGREPPAWVTTTVPSTPRPSSMTKCQRAP
WRSRSSCRPHGLLQEAEQDCDDARRAGSHQGRLRTALRHQGREMLGQGIAPPPEFSRPEVAKILMDLLPVHQQLILIWFS
GKTRPGVGRWRVFLCAAGALWPEAVAVANMEKRSSTG
>Q9X1C0 2.6.1.51~~~~~~Serine-pyruvate aminotransferase~~~COG0075
MGKFLKKHYIMAPGPTPVPNDILTEGAKETIHHRTPQFVSIMEETLESAKYIFQTKHNVYAFASTGTGAMEAAVANLVSP
GDKVIVVVAGKFGERWRELCQAYGADIVEIALEWGDAVTPEQIEEALNKNPDAKVVFTTYSETSTGTVIDLEGIARVTKE
KDVVLVTDAVSALGAEPLKMDEWGVDLVVTGSQKGLMLPPGLALISLNDKAWGLVEKSRSPRYYFDLRAYRKSYPDNPYT
PAVNMIYMLRKALQMIKEEGIENVWERHRILGDATRAAVKALGLELLSKRPGNVVTAVKVPEGIDGKQIPKIMRDKYGVT
IAGGQAKLKGKIFRIAHLGYMSPFDTITAISALELTLKELGYEFELGVGVKAAEAVFAKEFIGE
>Q0K845 1.2.1.81~~~sauS~~~Sulfoacetaldehyde dehydrogenase (acylating)~~~COG1012
MSVQILHRRQSNNSDLPLPTASLPVQPAQAAAEAVAAVVARARQAQREFARADQATVDTAVAAAAWAIMEPARNRQLAER
AVADTGLGNVDDKIRKNHRKTLGLLRDLHGRRTVGVIAQDAAAGITEIARPVGVVAAITPSTNPAATPANKIINALKCGN
SVILAPSPKGQDTCALLLSFIHAEFARAGLPADLVQMLPAPVSKTATAELMRQADLVVATGSQANVRMAYTCGTPAFGVG
AGNVASIIDASATLDDAAAKVARSKTFDNATSCSSENSLVVVDAVYTPMLDALAAVGGVLLTASEKARLQALMWRDAKLA
GSFTGQSATRIAELAGLERVRALQPAMLLVEETGVGSDYPFSGEKLSPVLTLYRATDFAAAVERVASLYAYMGAGHSVSL
HSSNPRHALQLGQELPVARVIVNQAHCFATGGNFDNGLPFSLSMGCGTWGGNNFSDNLGWRQYLNITRIAVPIAEHVPDE
ADLLGDYFARVGK
>Q0K844 6.2.1.-~~~sauT~~~Probable sulfoacetate--CoA ligase~~~COG0318
MNARTEPEVFDTLAALIAVRAAQWPDKPYLLSPDSGHALTFGALATDAGTLGRSYAAAGLGSGQTVSVYLPNGEQTARLL
LGTMACGLVVNPINLLCQPAQLRYILAHSDTRLVFTWPDGEAAIREALREAGLDVPVLVTAPDANSLPALPATHDAASPL
PPPQPDAPALLMYTSGTTGTPKGVLLTQRNLVANGTNVSREHCLGPADRVLATLPLYHINGLVVTAIAPLVHGGSVVMPM
RFSASAFWQDSARHGCTWLNVVPTIIAYLLNDPHGQAPAGVRFCRSASAALPPEHHRAFEARFGIGVIETMGMTETAAPA
FSNPLDPGQRRIGSIGRPSGTRARVLGRDGKPAPDGQVGEIVLQGESVMAGYYKAPDITREAFTHDGWLRTGDLGYRDAD
GYFYISGRAKELIIKGGENIAPREIDEALLRHPGVLEAAAVGVPDPAYGQEIVAYVVMREAARCDDAALRAHCLRELGRY
KTPKEFRFIAELPRGPSGKVQRLKLLNHA
>Q0K843 ~~~sauU~~~Probable sulfoacetate transporter SauU~~~COG2271
MKQRIKTRHMILGVMCLMYFIAYIDRVNISVAAPLIREEMGLTTSQLGLVFSAFAYPYAAMQILGGWMADKFGPKKVLIV
LSLIWGVATVLTGFAGSVLILVVLRFVLGIGEGGAFPTATRAFTYWMPVAERGFAQGITHSFARLGGAITPPVVLVIVAA
AGWREAFIVLGAVSLGWTLLYAFFFKDSPDKHSRVTAQELQEIGYRHGDSRQAAKAATPWRRLFRRMWLVTFVDFCYGWS
LWVYLTWLPSYLKEARGFDLKQLALFTALPLMAGVVGDTLGGVLSDRIYKRTGNLRLARGAVLFVGLAGSLMFIAPMTFT
ADAVNAVILLSLSFFFLELTNAVLWSLPLDIAGKYAGTAGGMMNTGFGVAGMVSPVVFGYLIERTGSYDLPFMISGALLG
VGALASLFINPLLTVDSPDEKAGEVRHALP
>P22629 ~~~~~~Streptavidin~~~
MRKIVVAAIAVSLTTVSITASASADPSKDSKAQVSAAEAGITGTWYNQLGSTFIVTAGADGALTGTYESAVGNAESRYVL
TGRYDSAPATDGSGTALGWTVAWKNNYRNAHSATTWSGQYVGGAEARINTQWLLTSGTTEANAWKSTLVGHDTFTKVKPS
AASIDAAKKAGVNNGNPLDAVQQ
>P13458 ~~~sbcC~~~Nuclease SbcCD subunit C~~~COG0419
MKILSLRLKNLNSLKGEWKIDFTREPFASNGLFAITGPTGAGKTTLLDAICLALYHETPRLSNVSQSQNDLMTRDTAECL
AEVEFEVKGEAYRAFWSQNRARNQPDGNLQVPRVELARCADGKILADKVKDKLELTATLTGLDYGRFTRSMLLSQGQFAA
FLNAKPKERAELLEELTGTEIYGQISAMVFEQHKSARTELEKLQAQASGVTLLTPEQVQSLTASLQVLTDEEKQLITAQQ
QEQQSLNWLTRQDELQQEASRRQQALQQALAEEEKAQPQLAALSLAQPARNLRPHWERIAEHSAALAHIRQQIEEVNTRL
QSTMALRASIRHHAAKQSAELQQQQQSLNTWLQEHDRFRQWNNEPAGWRAQFSQQTSDREHLRQWQQQLTHAEQKLNALA
AITLTLTADEVATALAQHAEQRPLRQHLVALHGQIVPQQKRLAQLQVAIQNVTQEQTQRNAALNEMRQRYKEKTQQLADV
KTICEQEARIKTLEAQRAQLQAGQPCPLCGSTSHPAVEAYQALEPGVNQSRLLALENEVKKLGEEGATLRGQLDAITKQL
QRDENEAQSLRQDEQALTQQWQAVTASLNITLQPLDDIQPWLDAQDEHERQLRLLSQRHELQGQIAAHNQQIIQYQQQIE
QRQQLLLTTLTGYALTLPQEDEEESWLATRQQEAQSWQQRQNELTALQNRIQQLTPILETLPQSDELPHCEETVVLENWR
QVHEQCLALHSQQQTLQQQDVLAAQSLQKAQAQFDTALQASVFDDQQAFLAALMDEQTLTQLEQLKQNLENQRRQAQTLV
TQTAETLAQHQQHRPDDGLALTVTVEQIQQELAQTHQKLRENTTSQGEIRQQLKQDADNRQQQQTLMQQIAQMTQQVEDW
GYLNSLIGSKEGDKFRKFAQGLTLDNLVHLANQQLTRLHGRYLLQRKASEALEVEVVDTWQADAVRDTRTLSGGESFLVS
LALALALSDLVSHKTRIDSLFLDEGFGTLDSETLDTALDALDALNASGKTIGVISHVEAMKERIPVQIKVKKINGLGYSK
LESTFAVK
>Q2FYT3 ~~~sbcC~~~Nuclease SbcCD subunit C~~~COG0419
MKPLHLKLNNFGPFLKEEIDFSKIDNNELFLISGKTGSGKTMIFDAMTYALFGKASTEQREENDLRSHFADGKQPMSVTF
EFQLNHRIYKVHRQGPYIKEGNTTKTNAKFDVFEMVDGKYEIRESKVISGTQFIIELLGVNADQFRQLFILPQGEFKRFL
ISNSREKQGILRTLFDSEKFEAIREILKEEVKKEKAQIENRYQQIDLLWQEIESFDDDNIKGLLEVATQQIDKLIENIPL
LQARSKEILASVNESKETAIKEFEIIEKKTLENNILKDNINQLNKNKIDFVQLNEQQPEIEGIEAKLKLLQDITNLLNYI
ENREKIETKIANSKKDISKTNNKILNLDCDKRNIDKEKKMLEENGDLIESKISFIDKTRVLFNDINKYQQSYLNIERLRT
EGEQLGDELNDLIKGLETVEDSIGNNQSDYEKIIELNNTITNINNEINIIKENEKAKAELDKLLGSKQELENQINEETSI
LKNLEIKLDRYDKTKLDLNDKESFISEIKSAVNIGDQCPICGNEIQDLGHHIDFDSIAKRQNEIKEIEANIHAIKSNIAV
HNSEIKFVNEKISNINIKTQSDFSLEVLNKRLLENENALNNQRDLNKFIEQMKEEKDNLTLQIHNKQLRLNKNESELKLC
RDLITEFETLSKYNNITNFEVDYKKYVQDVNQHQELSKEIEDKLMQLSQRKLIEQNNLNHYENQLETYNNDLELNEQSIE
MEMSRLNLTDDNDIDEIIAWRGEQEELEQKRDTYKKRYHEFEMEIARLESLTKDKELLDSDKLKDEYELKKGKMNTLIDE
YSAVHYQCQNNINKTQSIVSHINYLNQELKDQQEIFQLAEIVSGKNNKNLTLENFVLIYYLDQIIAQANLRLATMSDNRY
QLIRREAVSHGLSGLEIDVFDLHSNKSRHISSLSGGETFQSSLALALGLSEIVQQQSGGISLTSIFIDEGFGTLDQETLE
TALDTLLNLKSTGRMVGIISHVSELKNRIPLVLEVKSDQYQSSTRFKRN
>A6QGP8 ~~~sbcC~~~Nuclease SbcCD subunit C~~~
MKPLHLKLNNFGPFLKEEIDFSKIDNNELFLISGKTGSGKTMIFDAMTYALFGKASTEQREENDLRSHFADGKQPMSVTF
EFQLNHRIYKVHRQGPYIKEGNTTKTNAKFDVFEMVDGKYEIRESKVISGTQFIIELLGVNADQFRQLFILPQGEFKRFL
ISNSREKQGILRTLFDSEKFEAIREILKEEVKKEKAQIENRYQQIDLLWQEIESFDDDNIKGLLEVATQQIDKLIENIPL
LQARSKEILASVNESKETAIKEFEIIEKKTLENNILKDNINQLNKNKIDFVQLKEQQPEIEGIEAKLKLLQDITNLLNYI
ENREKIETKIANSKKDISKTNNKILNLDCDKRNIDKEKKMLEENGDLIESKISFIDKTRVLFNDINKYQQSYLNIERLRT
EGEQLGDELNDLIKGLETVEDSIGNNQSDYEKIIELNNTITNINNEINIIKENEKAKAELDKLLGSKQELENQINEETSI
LKNLEIKLDRYDKTKLDLNDKESFISEIKSAVNIGDQCPICGNEIQDLGHHIDFDSIAKRQNEIKEIEANIHAIKSNIAV
HNSEIKFVNEKISNINIKTQSDFSLEVLNKRLLENENALNNQRDLNKFIEQMKEEKDNLTLQIHNKQLRLNKNESELKLC
RDLITEFETLSKYNNITNFEVDYKKYVQDVNQHQELSKEIEDKLMQLSQRKLIEQNNLNHYENQLETYNNDLELNEQSIE
MEMSRLNLTDDNDIDEIIAWRGEQEELEQKRDTYKKRYHEFEMEIARLESLTKDKELLDSDKLKDEYELKKGKMNTLIDE
YSAVHYQCQNNINKTQSIVSHINYLNQELKDQQEIFQLAEIVSGKNNKNLTLENFVLIYYLDQIIAQANLRLATMSDNRY
QLIRREAVSHGLSGLEIDVFDLHSNKSRHISSLSGGETFQSSLALALGLSEIVQQQSGGISLESIFIDEGFGTLDQETLE
TALDTLLNLKSTGRMVGIISHVSELKNRIPLVLEVKSDQYQSSTRFKRN
>Q7A5S6 ~~~sbcC~~~Nuclease SbcCD subunit C~~~
MKPLHLKLNNFGPFLKEEIDFSKIDNNELFLISGKTGSGKTMIFDAMTYALFGKASTEQREENDLRSHFADGKQPMSVTF
EFQLNHRIYKVHRQGPYIKEGNTTKTNAKFDVFEMVDGKYEIRESKVISGTQFIIELLGVNADQFRQLFILPQGEFKRFL
ISNSREKQGILRTLFDSEKFEAIREILKEELKKEKAQIENRYQQIDLLWQEIESFDDDKIKGLLELATQQIDKLIENIPL
LQARSKEILAFVNESKETAIKEYEIIEKKTLENNILKDNINQLNKNKIDFVQLKEQQPEIDEIEAKLKLLQDITNLLNYI
ENREKIETKIANSKKDISKTNNKILNLDCDKRNIDKEKKMLEENGDLIESKTSFIDKTRVLFNDINKYQQSYLNIECLIT
EGEQLGDELNNLIKGLEKVEDSIGNNESDYEKIIELNNAITNINNEINIIKENEKAKAELDKLLGSKQELENQINEETTI
MKNLEIKLDHYDKSKLDLNDKESFISEIKSAVKIGDQCPICGNEIQDLGHHIDFDSIAKRQNEIKEIEANIHAIKSNIAV
HNSEIKFVNEKISNINIKTQSDFSLEVLNKRLLENENALNNQRDLNKFIEQMKEEKDNLTLQIHNKQLRLNKNESELKLC
RDLITEFETLSKYNNITNFEVDYKKYVQDVNQHQELSKEIEDKLMQLSQRKLIEQNNLNHYENQLETYNNDLELNEQSIE
MEMSRLNLTDDNDINEIIAWRGEQEELEQKRDTYKKRYHEFEMEIARLESLTKDKELLDSDKLKDEYEQKKEKMNTLIDE
YSAVHYQCQNNINKTQSIVSHINYLNQELKDQQEIFQLAEIVSGKNNKNLTLENFVLIYYLDQIIAQANLRLATMSDNRY
QLIRREAVSHGLSGLEIDVFDLHSNKSRHISSLSGGETFQSSLALALGLSEIVQQQSGGISLESIFIDEGFGTLDQETLE
TALDTLLNLKSTGRMVGIISHVSELKNRIPLVLEVKSDQYQSSTRFKRN
>P0AG76 ~~~sbcD~~~Nuclease SbcCD subunit D~~~COG0420
MRILHTSDWHLGQNFYSKSREAEHQAFLDWLLETAQTHQVDAIIVAGDVFDTGSPPSYARTLYNRFVVNLQQTGCHLVVL
AGNHDSVATLNESRDIMAFLNTTVVASAGHAPQILPRRDGTPGAVLCPIPFLRPRDIITSQAGLNGIEKQQHLLAAITDY
YQQHYADACKLRGDQPLPIIATGHLTTVGASKSDAVRDIYIGTLDAFPAQNFPPADYIALGHIHRAQIIGGMEHVRYCGS
PIPLSFDECGKSKYVHLVTFSNGKLESVENLNVPVTQPMAVLKGDLASITAQLEQWRDVSQEPPVWLDIEITTDEYLHDI
QRKIQALTESLPVEVLLVRRSREQRERVLASQQRETLSELSVEEVFNRRLALEELDESQQQRLQHLFTTTLHTLAGEHEA
>Q2FYT4 ~~~sbcD~~~Nuclease SbcCD subunit D~~~COG0420
MKIIHTADWHLGKILNGKQLLEDQAYILDMFVEKMKEEEPDIIVIAGDLYDTTYPSKDAIMLLEQAIGKLNLELRIPIII
ISGNHDGKERLNYGASWFEHNQLFIRTDFTSINSPIEINGVNFYTLPYATVSEMKHYFEDDTIETHQQGITRCIETIAPE
IDEDAVNILISHLTVQGGKTSDSERPLTIGTVESVQKGVFDIFDYVMLGHLHHPFSIEDDKIKYSGSLLQYSFSEAGQAK
GYRRVTINDGIINDVFIPLKPLRQLEIISGEYNDVINEKVHVKNKDNYLHFKLKNMSHITDPMMSLKQIYPNTLALTNET
FNYNEENNAIEISEKDDMSIIEMFYKHITDKELSDIQSKKIKNILENELRKED
>A6QGP7 ~~~sbcD~~~Nuclease SbcCD subunit D~~~
MKIIHTADWHLGKILNGKQLLEDQAYILDMFVEKMKEEEPDIIVIAGDLYDTTYPSKDAIMLLEQAIGKLNLELRIPIII
ISGNHDGKERLNYGASWFEHNQLFIRTDFTSINSPIEINGVNFYTLPYATVSEMKHYFEDDTIETHQQGITRCIETIAPE
IDEDAVNILISHLTVQGGKTSDSERPLTIGTVESVQKGVFDIFDYVMLGHLHHPFSIEDDKIKYSGSLLQYSFSEAGQAK
GYRRVTINDGIINDVFIPLKPLRQLEIISGEYNDVINEKVHVKNKDNYLHFKLKNMSHITDPMMSLKQIYPNTLALTNET
FNYNEENNAIEISEKDDMSIIEMFYKHITDKELSDIQSKKIKNILENELRKED
>Q99UD1 ~~~sbcD~~~Nuclease SbcCD subunit D~~~
MKIIHTADWHLGKILNGKQLLEDQAYILDMFVEKMKEEEPDIIVIAGDLYDTTYPSKDAIMLLEQAIGKLNLELRIPIIM
ISGNHDGKERLNYGASWFEHNQLFIRTDFTSINSPIEINGVNFYTLPYATVSEMKHYFEDDTIETHQQGITRCIETIAPE
IDEDAVNILISHLTVQGGKTSDSERPLTIGTVESVQKGVFDIFDYVMLGHLHHPFSIEDDKIKYSGSLLQYSFSEAGQAK
GYRRLTINDGIINDVFIPLKPLRQLEIISGEYNDVINEKVHVKNKDNYLHFKLKNMSHITDPMMSLKQIYPNTLALTNET
FNYNEENNAIEISEKDDMSIIEMFYKHITDKELSDIQSKKIKNILENELRKED
>Q2FVK5 ~~~sbi~~~Immunoglobulin-binding protein Sbi~~~COG1388
MKNKYISKLLVGAATITLATMISNGEAKASENTQQTSTKHQTTQNNYVTDQQKAFYQVLHLKGITEEQRNQYIKTLREHP
ERAQEVFSESLKDSKNPDRRVAQQNAFYNVLKNDNLTEQEKNNYIAQIKENPDRSQQVWVESVQSSKAKERQNIENADKA
IKDFQDNKAPHDKSAAYEANSKLPKDLRDKNNRFVEKVSIEKAIVRHDERVKSANDAISKLNEKDSIENRRLAQREVNKA
PMDVKEHLQKQLDALVAQKDAEKKVAPKVEAPQIQSPQIEKPKVESPKVEVPQIQSPKVEVPQSKLLGYYQSLKDSFNYG
YKYLTDTYKSYKEKYDTAKYYYNTYYKYKGAIDQTVLTVLGSGSKSYIQPLKVDDKNGYLAKSYAQVRNYVTESINTGKV
LYTFYQNPTLVKTAIKAQETASSIKNTLSNLLSFWK
>A6QJQ7 ~~~sbi~~~Immunoglobulin-binding protein Sbi~~~
MKNKYISKLLVGAATITLATMISNGEAKASENTQQTSTKHQTTQNNYVTDQQKAFYQVLHLKGITEEQRNQYIKTLREHP
ERAQEVFSESLKDSKNPDRRVAQQNAFYNVLKNDNLTEQEKNNYIAQIKENPDRSQQVWVESVQSSKAKERQNIENADKA
IKDFQDNKAPHDKSAAYEANSKLPKDLRDKNNRFVEKVSIEKAIVRHDERVKSANDAISKLNEKDSIENRRLAQREVNKA
PMDVKEHLQKQLDALVAQKDAEKKVAPKVEAPQIQSPQIEKPKVESPKVEVPQIQSPKVEVPQSKLLGYYQSLKDSFNYG
YKYLTDTYKSYKEKYDTAKYYYNTYYKYKGAIDQTVLTVLGSGSKSYIQPLKVDDKNGYLAKSYAQVRNYVTESINTGKV
LYTFYQNPTLVKTAIKAQETASSIKNTLSNLLSFWK
>Q931F4 ~~~sbi~~~Immunoglobulin-binding protein Sbi~~~
MKNKYISKLLVGAATITLATMISNGEAKASENTQQTSTKHQTTQNNYVTDQQKAFYQVLHLKGITEEQRNQYIKTLREHP
ERAQEVFSESLKDSKNPDRRVAQQNAFYNVLKNDNLTEQEKNNYIAQIKENPDRSQQVWVESVQSSKAKERQNIENADKA
IKDFQDNKAPHDKSAAYEANSKLPKDLRDKNNRFVEKVSIEKAIVRHDERVKSANDAISKLNEKDSIENRRLAQREVNKA
PMDVKEHLQKQLDALVAQKDAEKKVAPKVEAPQIQSPQIEKPKAESPKVEVPQSKLLGYYQSLKDSFNYGYKYLTDTYKS
YKEKYDTAKYYYNTYYKYKGAIDQTVLTVLGSGSKSYIQPLKVDDKNGYLAKSYAQVRNYVTESINTGKVLYTFYQNPTL
VKTAIKAQETASSIKNTLSNLLSFWK
>Q99RL2 ~~~sbi~~~Immunoglobulin-binding protein Sbi~~~
MKNKYISKLLVGAATITLATMISNGEAKASENTQQTSTKHQTTQNNYVTDQQKAFYQVLHLKGITEEQRNQYIKTLREHP
ERAQEVFSESLKDSKNPDRRVAQQNAFYNVLKNDNLTEQEKNNYIAQIKENPDRSQQVWVESVQSSKAKERQNIENADKA
IKDFQDNKAPHDKSAAYEANSKLPKDLRDKNNRFVEKVSIEKAIVRHDERVKSANDAISKLNEKDSIENRRLAQREVNKA
PMDVKEHLQKQLDALVAQKDAEKKVAPKVEAPQIQSPQIEKPKAESPKVEVPQIQSPKVEVPQSKLLGYYQSLKDSFNYG
YKYLTDTYKSYKEKYDTAKYYYNTYYKYKGAIDQTVLTVLGSGSKSYIQPLKVDDKNGYLAKSYAQVRNYVTESINTGKV
LYTFYQNPTLVKTAIKAQETASSIKNTLSNLLSFWK
>P0AFY6 ~~~sbmA~~~Peptide antibiotic transporter SbmA~~~COG1133
MFKSFFPKPGTFFLSAFVWALIAVIFWQAGGGDWVARITGASGQIPISAARFWSLDFLIFYAYYIVCVGLFALFWFIYSP
HRWQYWSILGTALIIFVTWFLVEVGVAVNAWYAPFYDLIQTALSSPHKVTIEQFYREVGVFLGIALIAVVISVLNNFFVS
HYVFRWRTAMNEYYMANWQQLRHIEGAAQRVQEDTMRFASTLENMGVSFINAIMTLIAFLPVLVTLSAHVPELPIIGHIP
YGLVIAAIVWSLMGTGLLAVVGIKLPGLEFKNQRVEAAYRKELVYGEDDATRATPPTVRELFSAVRKNYFRLYFHYMYFN
IARILYLQVDNVFGLFLLFPSIVAGTITLGLMTQITNVFGQVRGAFQYLINSWTTLVELMSIYKRLRSFEHELDGDKIQE
VTHTLS
>P33012 ~~~sbmC~~~DNA gyrase inhibitor~~~COG3449
MNYEIKQEEKRTVAGFHLVGPWEQTVKKGFEQLMMWVDSKNIVPKEWVAVYYDNPDETPAEKLRCDTVVTVPGYFTLPEN
SEGVILTEITGGQYAVAVARVVGDDFAKPWYQFFNSLLQDSAYEMLPKPCFEVYLNNGAEDGYWDIEMYVAVQPKHH
>Q2G1N3 2.5.1.140~~~sbnA~~~N-(2-amino-2-carboxyethyl)-L-glutamate synthase~~~COG0031
MIEKSQACHDSLLDSVGQTPMVQLHQLFPKHEVFAKLEYMNPGGSMKDRPAKYIIEHGIKHGLITENTHLIESTSGNLGI
ALAMIAKIKGLKLTCVVDPKISPTNLKIIKSYGANVEMVEEPDAHGGYLMTRIAKVQELLATIDDAYWINQYANELNWQS
HYHGAGTEIVETIKQPIDYFVAPVSTTGSIMGMSRKIKEVHPNAQIVAVDAKGSVIFGDKPINRELPGIGASRVPEILNR
SEINQVIHVDDYQSALGCRKLIDYEGIFAGGSTGSIIAAIEQLITSIEEGATIVTILPDRGDRYLDLVYSDTWLEKMKSR
QGVKSE
>A6QDA0 2.5.1.140~~~sbnA~~~N-(2-amino-2-carboxyethyl)-L-glutamate synthase~~~
MIEKSQACHDSLLDSVGQTPMVQLHQLFPKHEVFAKLEYMNPGGSMKDRPAKYIIEHGIKHGLITENTHLIESTSGNLGI
ALAMIAKIKGLKLTCVVDPKISPTNLKIIKSYGANVEMVEEPDAHGGYLMTRIAKVQELLATIDDAYWINQYANELNWQS
HYHGAGTEIVETIKQPIDYFVAPVSTTGSIMGMSRKIKEVHPNAQIVAVDAKGSVIFGDKPINRELPGIGASRVPEILNR
SEINQVIHVDDYQSALGCRKLIDYEGIFAGGSTGSIIAAIEQLITSIEEGATIVTILPDRGDRYLDLVYSDTWLEKMKSR
QGVKSE
>Q2G1N2 1.5.1.51~~~sbnB~~~N-((2S)-2-amino-2-carboxyethyl)-L-glutamate dehydrogenase~~~COG2423
MNREMLYLNRSDIEQAGGNHSQVYVDALTEALTAHAHNDFVQPLKPYLRQDPENGHIADRIIAMPSHIGGEHAISGIKWI
GSKHDNPSKRNMERASGVIILNDPETNYPIAVMEASLISSMRTAAVSVIAAKHLAKKGFKDLTIIGCGLIGDKQLQSMLE
QFDHIERVFVYDQFSEACARFVDRWQQQRPEINFIATENAKEAVSNGEVVITCTVTDQPYIEYDWLQKGAFISNISIMDV
HKEVFIKADKVVVDDWSQCNREKKTINQLVLEGKFSKEALHAELGQLVTGDIPGREDDDEIILLNPMGMAIEDISSAYFI
YQQAQQQNIGTTLNLY
>Q2G1N1 6.3.2.56~~~sbnC~~~Staphyloferrin B synthase~~~COG4264
MQNHTAVNTAQAIILRDLVDALLFEDIAGIVSNSEITKENGQTLLIYERETQQIKIPVYFSALNMFRYESSQPITIEGRV
SKQPLTAAEFWQTIANMNCDLSHEWEVARVEEGLTTAATQLAKQLSELDLASHPFVMSEQFASLKDRPFHPLAKEKRGLR
EADYQVYQAELNQSFPLMVAAVKKTHMIHGDTANIDELENLTVPIKEQATDMLNDQGLSIDDYVLFPVHPWQYQHILPNV
FAKEISEKLVVLLPLKFGDYLSSSSMRSLIDIGAPYNHVKVPFAMQSLGALRLTPTRYMKNGEQAEQLLRQLIEKDEALA
KYVMVCDETAWWSYMGQDNDIFKDQLGHLTVQLRKYPEVLAKNDTQQLVSMAALAANDRTLYQMICGKDNISKNDVMTLF
EDIAQVFLKVTLSFMQYGALPELHGQNILLSFEDGRVQKCVLRDHDTVRIYKPWLTAHQLSLPKYVVREDTPNTLINEDL
ETFFAYFQTLAVSVNLYAIIDAIQDLFGVSEHELMSLLKQILKNEVATISWVTTDQLAVRHILFDKQTWPFKQILLPLLY
QRDSGGGSMPSGLTTVPNPMVTYD
>Q2G1N0 ~~~sbnD~~~Staphyloferrin B transporter~~~COG2814
MINQSIWRSNFRILWLSQFIAIAGLTVLVPLLPIYMASLQNLSVVEIQLWSGIAIAAPAVTTMIASPIWGKLGDKISRKW
MVLRALLGLAVCLFLMALCTTPLQFVLVRLLQGLFGGVVDASSAFASAEAPAEDRGKVLGRLQSSVSAGSLVGPLIGGVT
ASILGFSALLMSIAVITFIVCIFGALKLIETTHMPKSQTPNINKGIRRSFQCLLCTQQTCRFIIVGVLANFAMYGMLTAL
SPLASSVNHTAIDDRSVIGFLQSAFWTASILSAPLWGRFNDKSYVKSVYIFATIACGCSAILQGLATNIEFLMAARILQG
LTYSALIQSVMFVVVNACHQQLKGTFVGTTNSMLVVGQIIGSLSGAAITSYTTPATTFIVMGVVFAVSSLFLICSTITNQ
INDHTLMKLWELKQKSAK
>Q2G1M9 6.3.2.54~~~sbnE~~~L-2,3-diaminopropanoate--citrate ligase~~~COG4264
MQNKELIQHAAYAAIERILNEYFREENLYQVPPQNHQWSIQLSELETLTGEFRYWSAMGHHMYHPEVWLIDGKSKKITTY
KEAIARILQHMAQSADNQTAVQQHMAQIMSDIDNSIHRTARYLQSNTIDYVEDRYIVSEQSLYLGHPFHPTPKSASGFSE
ADLEKYAPECHTSFQLHYLAVHQDVLLTRYVEGKEDQVEKVLYQLADIDISEIPKDFILLPTHPYQINVLRQHPQYMQYS
EQGLIKDLGVSGDSVYPTSSVRTVFSKALNIYLKLPIHVKITNFIRTNDLEQIERTIDAAQVIASVKDEVETPHFKLMFE
EGYRALLPNPLGQTVEPEMDLLTNSAMIVREGIPNYHADKDIHVLASLFETMPDSPMSKLSQVIEQSGLAPEAWLECYLN
RTLLPILKLFSNTGISLEAHVQNTLIELKDGIPDVCFVRDLEGICLSRTIATEKQLVPNVVAASSPVVYAHDEAWHRLKY
YVVVNHLGHLVSTIGKATRNEVVLWQLVAHRLMTWKKEYANNAVFVDCVEDLYQTPTIAAKANLMSKLNDCGANPIYTHI
PNPICHNKEVSYCESNNS
>Q2G1M8 6.3.2.55~~~sbnF~~~2-[(L-alanin-3-ylcarbamoyl)methyl]-3-(2-aminoethylcarbamoyl)-2-hydroxypropanoate synthase~~~COG4264
MIVVQTLFIHIYQIQFVITRRYRIVNQTILNRVKTRVMHQLVSSLIYENIVVYKASYQDGVGHFTIEGHDSEYRFTAEKT
HSFDRIRITSPIERVVGDEADTTTDYTQLLREVVFTFPKNDEKLEQFIVELLQTELKDTQSMQYRESNPPATPETFNDYE
FYAMEGHQYHPSYKSRLGFTLSDNLKFGPDFVPNVKLQWLAIDKDKVETTVSRNVVVNEMLRQQVGDKTYEHFVQQIEAS
GKHVNDVEMIPVHPWQFEHVIQVDLAEERLNGTVLWLGESDELYHPQQSIRTMSPIDTTKYYLKVPISITNTSTKRVLAP
HTIENAAQITDWLKQIQQQDMYLKDELKTVFLGEVLGQSYLNTQLSPYKQTQVYGALGVIWRENIYHMLIDEEDAIPFNA
LYASDKDGVPFIENWIKQYGSEAWTKQFLAVAIRPMIHMLYYHGIAFESHAQNMMLIHENGWPTRIALKDFHDGVRFKRE
HLSEAASHLTLKPMPEAHKKVNSNSFIETDDERLVRDFLHDAFFFINIAEIILFIEKQYGIDEELQWQWVKGIIEAYQEA
FPELNNYQHFDLFEPTIQVEKLTTRRLLSDSELRIHHVTNPLGVGGINDATTISET
>Q2G1M6 4.1.1.117~~~sbnH~~~2-[(L-alanin-3-ylcarbamoyl)methyl]-2-hydroxybutanedioate decarboxylase~~~COG0019
MRIVQPVIEQLKAQSHPVCHYIYDLVGLEHHLQHITSSLPSNCQMYYAMKANSERKILDTISQYVEGFEVASQGEIAKGL
AFKPANHIIFGGPGKTDEELRYAVSEGVQRIHVESMHELQRLNAILEDEDKTQHILLRVNLAGPFPNATLHMAGRPTQFG
ISEDEVDDVIEAALAMPKIHLDGFHFHSISNNLDSNLHVDVVKLYFKKAKAWSEKHRFPLKHINLGGGIGVNYADLTNQF
EWDNFVERFKTLIVEQEMEDVTLNFECGRFIVAHIGYYVTEVLDIKKVHGAWYAILRGGTQQFRLPVSWQHNHPFDIYRY
KDNPYSFEKVSISRQDTTLVGQLCTPKDVFAREVQIDAISTGDVIVFKYAGAYGWSISHHDFLSHPHPEFIYLTQTKEDE
>Q2G1M5 2.7.1.225~~~sbnI~~~L-serine kinase SbnI~~~COG1475
MNHIHEHLKLVPVDKIDLHETFEPLRLEKTKSSIEADDFIRHPILVTAMQHGRYMVIDGVHRYTSLKALGCKKVPVQEIH
ETQYSISTWQHKVPFGVWWETLQQEHRLPWTTETRQEAPFITMCHGDTEQYLYTKDLGEAHFQVWEKVVASYSGCCSVER
IAQGTYPCLSQQDVLMKYQPLSYKEIEAVVHKGETVPAGVTRFNISGRCLNLQVPLALLKQDDDVEQLRNWKQFLADKFA
NMRCYTEKVYLVEQ
>O07623 ~~~sboA~~~Subtilosin-A~~~
MKKAVIVENKGCATCSIGAACLVDGPIPDFEIAGATGLFGLWG
>C0HLK6 ~~~~~~Subtilosin-A~~~
NKGCATCSIGIACLVDGPIPDFECAGATGLGLWG
>Q7WY57 ~~~sboX~~~Bacteriocin-like protein SboX~~~
MKLPVQQVYSVYGGKDLPKGHSHSTMPFLSKLQFLTKIYLLDIHTQPFFI
>A0A0H2UMY0 ~~~~~~Carbohydrate ABC transporter substrate-binding protein~~~COG1653
MKNWKKYAFASASVVALAAGLAACGNLTGNSKKAADSGDKPVIKMYQIGDKPDNLDELLANANKIIEEKVGAKLDIQYLG
WGDYGKKMSVITSSGENYDIAFADNYIVNAQKGAYADLTELYKKEGKDLYKALDPAYIKGNTVNGKIYAVPVAANVASSQ
NFAFNGTLLAKYGIDISGVTSYETLEPVLKQIKEKAPDVVPFAIGKVFIPSDNFDYPVANGLPFVIDLEGDTTKVVNRYE
VPRFKEHLKTLHKFYEAGYIPKDVATSDTSFDLQQDTWFVREETVGPADYGNSLLSRVANKDIQIKPITNFIKKNQTTQV
ANFVISNNSKNKEKSMEILNLLNTNPELLNGLVYGPEGKNWEKIEGKENRVRVLDGYKGNTHMGGWNTGNNWILYINENV
TDQQIENSKKELAEAKESPALGFIFNTDNVKSEISAIANTMQQFDTAINTGTVDPDKAIPELMEKLKSEGAYEKVLNEMQ
KQYDEFLKNKK
>P54083 ~~~sbpA~~~Multiple sugar-binding periplasmic protein SbpA~~~
MSSSFTTTLAGMAVGMLVLATGTNPTLAQDKPTVGIAMPTKSSARWIDDGNNMVKQFQAKGYKTDLQYAEDDIPNQLAQI
ETMVAKNSKVLVIAAIDGTTLTDVLQQAKDRGVKVIAYDRLIRGSENVDYYATFDNFQVGVLQGSYIVDALGLKDGKGPF
NIELFGGSPDDNNAYFFYNGAMSVLQPYIDSGKLTVGSGQVGMDKVSTLRWDGATAQARMDNLLSAFYGNRRVDAVLSPY
DGISIGIISSLKGVGYGSPSQPMPVVTGQDAEVPSIKSILAGEQRATVFKDTRELARITVEMVDAVLGGGTAVNDTKTYD
NGKKVVPAYLLKPVSVDASNWKGTLVDSGYYTEAQFK
>Q92JF7 ~~~sca2~~~Putative surface cell antigen sca2~~~
MNLQNSHSKKYVLTFFMSTCLLTSSFLSTSARAASFKDLVSKTPAWEKHNSTQQQNIWKDLTPNEKIKKWQEAALVPSFT
QAQNDLGIKYKETDLSSFLDNTRHKARQARAEILLYIERVKQQDFDTKKQAYINQGVVPTDIEAATNLGISYDPSKIDNN
VEHDQKVRRAEKDKKAVIELYVSSINRGIKYKHYVDNDIIPEIQEVRTALNMNKDDAQSFVASIRTEIMENAKGQYIADS
HIPTEKELKKKFGISRDDNRDGYIKSIRLKVMDKEKPQYIADSHIPTEKELEQKFGADKGEATNYIASIATQMMLDKKSY
YIDNNIIPNADELMNEFKIGPVKATSYINQIRAGIEANQFLNNNDTTKPSTGRSQKKSGSKNDHWYMSNQSINNTGTSAR
IVTGREKKQRYFFDPISTFKTYFNTKASKGNLTQSQHNINRIIQQEENIEEFKNLIKTDPIAALTLQVDSSYKQEAVTTI
LSDFNDDTIQRVLFSNDKGKLDFNTNIDVKNRPILQELLENSSSEEKTKFAERIQDYATRNISNSQFEEKARLDLIKLAA
SKDKSSVENFLTLQLELKNRMQPYVVNSVYILTPEIVKEINIELKNKGLIRDSLTKDYMIKLAKEVNNHTLNSVIKVILS
DSKILSNETNKILGLAVSNNANNLEQTQSGIPNPPPLPLNGGIPNPPPLPLNGSMPPPPLHSQGFSSNSKHFDLNQLQTE
YPHIHSLYVQFTHNTTVQSKAPLQPTASSATSTGRSTPETAYAKLYAEYRTETGGTKANDLQDQLIKRQADLTNVIRQIL
TESYANQGADEKTLLNLFSISTPEIAEKAKEAFNTLAQDQYIKDITVNGKKTITSEEIIKNLFNEDTDDAIKRILLSSCK
ISEELKRPIKLEFNKSELIRELQGKQNPFKQLEFAYINTKNFDQDIFGNRIDELINNPNILTIVQQATFLTKEDTNLRKT
INSDQAQAKLDDLRTAILSTIKIEELITANLPQHDFIAIVKEKDPELLKEFLKATTLTVTGNNNLDQLRLALPSFTGMSN
EQIRILSNKLKMSIILKALKECSQEKATQYIHTGNMPPPPPPPPPLPDSQDLELAYLKSLGITKANTSTFKTTPKTYHFS
SDIALRYKEFTLSGQKSAGYKAKYSDADLLKKAIVESVAFEHSKNLSKAHQNNKYFEQIQKAVNTMYSSFIGHRTELEQK
IHNIYTSKLLELTKDKEFIKYVEDNIILNKKLTKAFTSADSDFIDSRTELEQKIHNIYIQQLTKYPEEEVKEAFNTASLD
FIGPRTEIGQEVHNIYKSQLLELTKDTELCLFTQQVLAEATELEQKYGSDIQSENSNNEKKVERLDQEKLQLFKQENEAT
NDESSTKDDTQPEDSNKKSEQSDSKTALSPRLLSSNDSKNDKSSDDKKSLLALRSSDEDDTGYATDEEELEESNSTTDEE
LKKDVVLESEDEAIDVSFKTEAITEQDEVTQRQQVSDDTSGKVAILVQATSTLHKPVHYNINDRLTVAAIGAGDEETSIN
RGVWISGLYGINKQRIWKNIPKYQNRTTGITIGTDAEFINSHDVIGIAYSRLESQIKYNKKLGKTTVNGHLLSIYSLKEL
IKGFSLQTITSYGHNYIKNRSKNINNIIGKYQNNSLSFQTLLNYKYRTKYDLHFIPNIGFQYDYSRASNYKEYNVDIENL
MIQKKSNQLFESSLGGKIVFKPIVTTNNIVLTPSLYGNIEHHFNNKNTKVNAKATFKGQTLQETIITLKQPKLGYNIGSN
ILMSRKNINVLLEYNYYTHRKYQSHQGLIKLKVNL
>B3EY95 2.8.3.18~~~~~~Succinyl-CoA:acetate CoA-transferase~~~
MTERIRNVALRSKVCPAETASELIKHGDVVGTSGFTGAGYPKEVPKALAQRMEAAHDRGEKYQISLITGASTGPQLDGEL
AKANGVYFRSPFNTDATMRNRINAGETEYFDNHLGQVAGRAVQGNYGKFNIALVEATAITEDGGIVPTSSVGNSQTFLNL
AEKVIIEVNEWQNPMLEGIHDIWDGNVSGVPTRDIVPIVRADQRVGGPVLRVNPDKIAAIVRTNDRDRNAPFAAPDETAK
AIAGYLLDFFGHEVKQNRLPPSLLPLQSGVGNVANAVLEGLKEGPFENLVGYSEVIQDGMLAMLDSGRMRIASASSFSLS
PEAAEEINNRMDFFRSKIILRQQDVSNSPGIIRRLGCIAMNGMIEADIYGNVNSTRVMGSKMMNGIGGSGDFARSSYLSI
FLSPSTAKGGKISAIVPMAAHVDHIMQDAQIFVTEQGLADLRGLSPVQRAREIISKCAHPDYRPMLQDYFDRALKNSFGK
HTPHLLTEALSWHQRFIDTGTMLPS
>Q9L1E4 2.4.2.-~~~~~~Guanine-specific ADP-ribosyl transferase~~~COG1396
MITTSLRRRTAAAVLSLSAVLATTAATAPGAAPAPSAAPAKAAPACPQFDDRTKAAADRGVDVDRITPEPVWRTTCGTLY
RSDSRGPQVVFEEGFHAKDVQNGQYDVEKYVLVNQPSPYVSTSYDHDLYKTWYKSGYNYYVDAPGGIDVNKTIGDTHKWA
DQVEVAFPGGIQRKYIIGVCPVDRQTKTEIMSDCESNPHYQPWH
>Q7AKF0 2.3.1.277~~~scbA~~~2-oxo-3-(phosphooxy)propyl 3-oxoalkanoate synthase~~~
MPEAVVLINSASDANSIEQTALPVPMALVHRTRVQDAFPVSWIPKGGDRFSVTAVLPHDHPFFAPVHGDRHDPLLIAETL
RQAAMLVFHAGYGVPVGYHFLMATLDYTCHLDHLGVSGEVAELEVEVACSQLKFRGGQPVQGQVDWAVRRAGRLAATGTA
TTRFTSPQVYRRMRGDFATPTASVPGTAPVPAARAGRTRDEDVVLSASSQQDTWRLRVDTSHPTLFQRPNDHVPGMLLLE
AARQAACLVTGPAPFVPSIGGTRFVRYAEFDSPCWIQATVRPGPAAGLTTVRVTGHQDGSLVFLTTLSGPAFSG
>P72360 ~~~scdA~~~Iron-sulfur cluster repair protein ScdA~~~COG2846
MINKNDIVADVVTDYPKAADIFRSVGIDFCCGGQVSIEAAALEKKNVDLNELLQRLNDVEQTNTPGSLNPKFLNVSSLIQ
YIQSAYHEPLREEFKNLTPYVTKLSKVHGPNHPYLVELKETYDTFKNGMLEHMQKEDDVDFPKLIKYEQGEVVDDINTVI
DDLVSDHIATGELLVKMSELTSSYEPPIEACGTWRLVYQRLKALEVLTHEHVHLENHVLFKKVS
>Q5HJB7 ~~~scdA~~~Iron-sulfur cluster repair protein ScdA~~~
MINKNDIVADVVTDYPKAADIFRSVGIDFCCGGQVSIEAAALEKKNVDLNELLQRLNDVEQTNTPGSLNPKFLNVSSLIQ
YIQSAYHEPLREEFKNLTPYVTKLSKVHGPNHPYLVELKETYDTFKNGMLEHMQKEDDVDFPKLIKYEQGEVVDDINTVI
DDLVSDHIATGELLVKMSELTSSYEPPIEACGTWRLVYQRLKALEVLTHEHVHLENHVLFKKVS
>Q7A7U6 ~~~scdA~~~Iron-sulfur cluster repair protein ScdA~~~
MINKNDIVADIVIDYPKAADIFRSVGIDFCCGGQVSIEAASLEKKNVDLNELLQRLNDVEQTNTPGSLNPKFLNVSSLIQ
YIQAAYHEPLREEFKNLTPYVTKLSKVHGPNHPYLVELKETYDTFKSGMLEHMQKEDDVDFPKLIKYEQGEVVNDINTVI
DDLVSDHIATGQLLVKMSDLTSSYEPPIEACGTWRLVYQRLKALEVLTHEHVHLENHVLFKKVS
>Q2FWF8 3.2.-.-~~~sceD~~~Probable transglycosylase SceD~~~COG1388
MKKTLLASSLAVGLGIVAGNAGHEAHASEADLNKASLAQMAQSNDQTLNQKPIEAGAYNYTFDYEGFTYHFESDGTHFAW
NYHATGTNGADMSAQAPATNNVAPSAVQANQVQSQEVEAPQNAQTQQPQASTSNNSQVTATPTESKSSEGSSVNVNAHLK
QIAQRESGGNIHAVNPTSGAAGKYQFLQSTWDSVAPAKYKGVSPANAPESVQDAAAVKLYNTGGAGHWVTA
>Q5HEA4 3.2.-.-~~~sceD~~~Probable transglycosylase SceD~~~
MKKTLLASSLAVGLGIVAGNAGHEAHASEADLNKASLAQMAQSNDQTLNQKPIEAGAYNYTFDYEGFTYHFESDGTHFAW
NYHATGTNGADMSAQAPTTNNVAPSAVQANQVQSQEVEAPQNAQTQQPQASTSNNSQVTATPTESKSSEGSSVNVNAHLK
QIAQRESGGNIHAVNPTSGAAGKYQFLQSTWDSVAPAKYKGVSPANAPESVQDAAAVKLYNTGGAGHWVTA
>Q99SG2 3.2.-.-~~~sceD~~~Probable transglycosylase SceD~~~
MKKTLLASSLAVGLGIVAGNAGHEAHASEADLNKASLAQMAQSNDQTLNQKPIEAGAYNYTFDYEGFTYHFESDGTHFAW
NYHATGANGANMSAQAPATNNVEPSAVQANQVQSQEVEAPQNAQTQQPQASTSNNSQVTATPTESKASEGSSVNVNAHLK
QIAQRESGGNIHAVNPTSGAAGKYQFLQSTWDSVAPAKYKGVSPANAPESVQDAAAVKLYNTGGAGHWVTA
>Q8YQ15 ~~~schE~~~Schizokinen exporter SchE~~~COG2814
MLPKLILLATLYISQFIPTTFFIQALPVFMRQQKMSLDVIGFLGLLILPSGLKFLWSPFIDRYRLGKLGHYRGWIICFQL
LLISTMLVTAFIDIQDNLNAFLTCMFLASLFSSSQDIATDALAVNLLEPQERGLGNAIQSGGNIFGAIIGGGVMLILLDK
IGWRYSLITLSIFMLINLVPILIYREKSQHQLENSTFFRSYFQPFISFLSRPKALPWLFVVLLYMMGDSVTSLMIRPLLV
DRGLSLPDIGWILGIVSYSARIVSALIAGLVIVKLGRIKSLIIFGFIADLTTLLYIIPAIGVSSLLVLYTVCIIVNATQS
MAYTALLSAMMDKCEKNTAATDYTMQVSVMFLGGIAATVLSGMLATTMGYSFIFIMSAAVSLLSVFLITQEYGVSS
>Q8YZR0 ~~~schT~~~Schizokinen transporter SchT~~~COG4771
MDCVTSHNPVATFRCEVKMKPGKILFLLLLTGSVWSLISHPGKTQEAPSPTQLNTQSPAPNAQELTQVTGVRVVPTVQGL
EVILDSTAAEKLQVSTQNQGNSLIADITNAQLNLSEGNTFSQNNPATGVTNVTVVNHNDNTIRVTVTGEKSLPKFELFDS
DTGLILAFTATEVAQDSPAEVDEPIELVVTATRTETPIQNVPRSITVIDREQIAAQASTSRNLIEILGKTVPGLAPPAQG
ASNFGLTLRGRNPQVLIDGVPQSTTRNASRDLRTIDAAAIERIEVVRGPSAIYGDGATGGVINIITRRPTEEKLTSRTEV
GVSAALGNLEGDSFSTNLQHFISAKQGNFDFTFNFAVAKNGGFFDAQGDRIPSDPNAQGGFADASSINLFGKFGIDIDAN
QRLQLTFNRFDEKQDTDIASDPRVNTIPGRQKARALEGLSLDERPGNENTFINLQYTHDDLFNSKLQAQLYYRDYLTRFF
PFDGRSFASLGNEIFQSRVESEKYGGRLQIETPLFNQGAAKLLWGVDYSQEDTSQPVSVFDQAAFVASGGLAFRKTGDRS
WTPPLELRSLGLFAQLNWEISDRFVFNGGVRYENADVSVNDFRTLANPNVTIGGGDLNFNATLFNVGAVYALNPQLSVFA
NYAQGFSLSDIGLALRNAPPGFSVESLNPEPQKVDNYEIGIRGQWDTVQASLSAFYNESDLGTTFTAPGTVIRAPERIYG
LEAAIDAQPSSTWQVGGTFTLIGGEIDSNNDGDYESLDGFRIPPLKLTAYVENETLPGWRNRLQALYSGNREVFGNNNTA
FGRRPVESYLTVDYISSIKLGAGTLQLGLENLFNSQYFPVVSQLQANDSAYAAARGRTLSIKYSFDW
>Q2FWV6 ~~~scn~~~Staphylococcal complement inhibitor~~~
MKIRKSILAGTLAIVLASPLVTNLDKNEAQASTSLPTSNEYQNEKLANELKSLLDELNVNELATGSLNTYYKRTIKISGL
KAMYALKSKDFKKMSEAKYQLQKIYNEIDEALKSKY
>Q931M7 ~~~scn~~~Staphylococcal complement inhibitor~~~
MKIRKSILAGTLAIVLASPLVTNLDKNEAQASTSLPTSNEYQNEKLANELKSLLDELNVNELATGSLNTYYKRTIKISGQ
KAMYALKSKDFKKMSEAKYQLQKIYNEIDEALKSKY
>Q6GFB4 ~~~scn~~~Staphylococcal complement inhibitor~~~
MKIRKSILAGTLAIVLASPLVTNLDKNEAQASTSLPTSNEYQNEKLANELKSLLDELNVNELATGSLNTYYKRTIKISGQ
KAMYALKSKDFKKMSEAKYQLQKIYNEIDEALKSKY
>P54950 1.14.13.-~~~scmK~~~N-acetyl-S-(2-succino)cysteine monooxygenase~~~COG2141
MTSKKKQIKLGVFLAGTGHHVASWRHPDAPSDASMNLDYFKELAKTAERGKLDMLFLADSLSIDSKSHPNVLTRFEPFTL
LSALAQVTSKIGLTATASTTYSEPFHIARQFASLDHLSNGRAGWNVVTSSIESTALNFSGEKHLEHHLRYQRAEEFVEIV
KGLWDSWEEDAFIRNKETGEFFDKEKMHELNHKGEYFSVRGPLNVSRTPQGQPVIIQAGSSGDGKALAAKTAEVIFTAQN
HLESAQEFYQSIKEQAAEFGRDPEKIAIMPGIFPIIADTEEAAQAKYKELQDLIIPSVGLQILQNYLGGIDLSAYPLDGP
LPKLDAEASNAVKSRFKLVQEMAERDNMTIRELYKYVAGSRGHHIFVGTPEQLADKMQEWVDTKACDGFNIMPPLLPEGI
EVFVDQVVPILQERGVFRKEYEGTTLREHFGLEKPVNRYAK
>P54951 2.3.1.-~~~scmL~~~S-(2-succino)cysteine N-acetyltransferase~~~COG0456
MKPRYRLAVERDAEQLLELTLRAYEPIRKLGIRFAAAHADLDLVLKNIRENACYVMEEDGRIIATITLRMPWGKQPGPYG
VPHIWWFAVDPDTGKKGIGTKLLQWLEETILRDTLKVPFVSLGTADKHPWLIEMYERKGYVRSGEQDLGKGHITVYMKKQ
LRHDL
>P54955 3.5.1.-~~~scmP~~~N-acetylcysteine deacetylase~~~COG1473
MADKAFHTRLINMRRDLHEHPELSFQEVETTKKIRRWLEEEQIEILDVPQLKTGVIAEIKGREDGPVIAIRADIDALPIQ
EQTNLPFASKVDGTMHACGHDFHTASIIGTAMLLNQRRAELKGTVRFIFQPAEEIAAGARKVLEAGVLNGVSAIFGMHNK
PDLPVGTIGVKEGPLMASVDRFEIVIKGKGGHAGIPNNSIDPIAAAGQIISGLQSVVSRNISSLQNAVVSITRVQAGTSW
NVIPDQAEMEGTVRTFQKEARQAVPEHMRRVAEGIAAGYGAQAEFKWFPYLPSVQNDGTFLNAASEAAARLGYQTVHAEQ
SPGGEDFALYQEKIPGFFVWMGTNGTEEWHHPAFTLDEEALTVASQYFAELAVIVLETIK
>A0QU81 5.4.99.5~~~~~~Secreted chorismate mutase~~~COG1605
MLASVALAALAGVGTPHATADDASPLVPLVDAAAQRLQTADPVAASKFRSGGAIDDPDREQQVIAAVTGDATRHNIDPGY
VHDVFRNQIDATSSVEHTRFAQWKLDPAAAPSSAPDLSESRQKIDTLNRTMVDEIARQWPVLHSPVCRPDLDRALDAVAT
ARGFDPVYRHALEYATHSYCR
>P9WIB9 5.4.99.5~~~~~~Secreted chorismate mutase~~~COG1605
MLTRPREIYLATAVSIGILLSLIAPLGPPLARADGTSQLAELVDAAAERLEVADPVAAFKWRAQLPIEDSGRVEQQLAKL
GEDARSQHIDPDYVTRVFDDQIRATEAIEYSRFSDWKLNPASAPPEPPDLSASRSAIDSLNNRMLSQIWSHWSLLSAPSC
AAQLDRAKRDIVRSRHLDSLYQRALTTATQSYCQALPPA
>Q7CHH5 5.4.99.5~~~pheA2~~~Secreted chorismate mutase~~~COG1605
MQPTHTLTRLTVIGKLIIASSFFLSLAVQAQQCGQTAPLINERLSYMKDVAGYKAENHLPIEDRIQEEKVINSAMAQAES
LGLNGESIKPLMVAQINAAKAIQYRYRADWLSQPEPGWQPKPLDDVRANIGELSTKILEQIAEELKTCKPAEMGDKAHFI
NTIRQHNLTSADVEAIFSTFNQVKLK
>O66187 3.5.5.8~~~scnA~~~Thiocyanate hydrolase subunit alpha~~~
MSDSHHKPVWDRTHHAKMATGIGDPQCFKGMAGKSKFNVGDRVRIKDLPDLFYTRTMTYTRGATGTIVRLVYESPAAEDE
AFGNEENVEWFYSIVFAQKDLWPEYSDTFANDTLETEIPERYLEKA
>O66186 3.5.5.8~~~scnB~~~Thiocyanate hydrolase subunit beta~~~
MSSSIREEVHRHLGTVALMQPALHQQTHAPAPTEITHTLFRAYTRVPHDVGGEADVPIEYHEKEEEIWELNTFATCECLA
WRGVWTAEERRRKQNCDVGQTVYLGMPYYGRWLLTAARILVDKQFVTLTELHNKIVEMRERVASGQGLGEYLPPKAK
>O66188 3.5.5.8~~~scnC~~~Thiocyanate hydrolase subunit gamma~~~
MSADHDHDHDHDHDHKPAPMVEEVSDFEILEMAVRELAIEKGLFSAEDHRVWKDYVHTLGPLPAARLVAKAWLDPEYKKL
CIEDGVEASKAVGVNWVTSPPTQFGTPSDYCNLRVLADSPTLKHVVVCTLCSCYPRPILGQSPEWYRSPNYRRRLVRWPR
QVLAEFGLQLPSEVQIRVADSNQKTRYIVMPVRPEGTDGWTEDQLAEIVTRDCLIGVAVPKPGITVNAKRPVLKANRPVH
HDH
>P54178 ~~~ypmQ~~~SCO1 protein homolog~~~COG1999
MKVIKGLTAGLIFLFLCACGGQQIKDPLNYEVEPFTFQNQDGKNVSLESLKGEVWLADFIFTNCETICPPMTAHMTDLQK
KLKAENIDVRIISFSVDPENDKPKQLKKFAANYPLSFDNWDFLTGYSQSEIEEFALKSFKAIVKKPEGEDQVIHQSSFYL
VGPDGKVLKDYNGVENTPYDDIISDVKSASTLK
>P42315 2.8.3.5~~~scoA~~~Probable succinyl-CoA:3-ketoacid coenzyme A transferase subunit A~~~COG1788
MGKVLSSSKEAAKLIHDGDTLIAGGFGLCGIPEQLILSIRDQGVKDLTVVSNNCGVDDWGLGLLLANKQIKKMIASYVGE
NKIFERQFLSGELEVELVPQGTLAERIRAGGAGIPGFYTATGVGTSIAEGKEHKTFGGRTYVLERGITGDVAIVKAWKAD
TMGNLIFRKTARNFNPIAAMAGKITIAEAEEIVEAGELDPDHIHTPGIYVQHVVLGASQEKRIEKRTVQQASGKGEAK
>P56006 2.8.3.5~~~scoA~~~Succinyl-CoA:3-ketoacid coenzyme A transferase subunit A~~~COG1788
MNKVITDLDKALSALKDGDTILVGGFGLCGIPEYAIDYIYKKGIKDLIVVSNNCGVDDFGLGILLEKKQIKKIIASYVGE
NKIFESQMLNGEIEVVLTPQGTLAENLHAGGAGIPAYYTPTGVGTLIAQGKESREFNGKEYILERAITGDYGLIKAYKSD
TLGNLVFRKTARNFNPLCAMAAKICVAEVEEIVPAGELDPDEIHLPGIYVQHIYKGEKFEKRIEKITTRSTK
>P9WPW5 2.8.3.5~~~scoA~~~Probable succinyl-CoA:3-ketoacid coenzyme A transferase subunit A~~~COG1788
MDKVVATAAEAVADIANGSSLAVGGFGLCGIPEALIAALVDSGVTDLETVSNNCGIDGVGLGLLLQHKRIRRTVSSYVGE
NKEFARQFLAGELEVELTPQGTLAERLRAGGMGIPAFYTPAGVGTQVADGGLPWRYDASGGVAVVSPAKETREFDGVTYV
LERGIRTDFALVHAWQGDRHGNLMYRHAAANFNPECASAGRITIAEVEHLVEPGEIDPATVHTPGVFVHRVVHVPNPAKK
IERETVRQ
>P42316 2.8.3.5~~~scoB~~~Probable succinyl-CoA:3-ketoacid coenzyme A transferase subunit B~~~COG2057
MKEARKRMVKRAVQEIKDGMNVNLGIGMPTLVANEIPDGVHVMLQSENGLLGIGPYPLEGTEDADLINAGKETITEVTGA
SYFDSAESFAMIRGGHIDLAILGGMEVSEQGDLANWMIPGKMVKGMGGAMDLVNGAKRIVVIMEHVNKHGESKVKKTCSL
PLTGQKVVHRLITDLAVFDFVNGRMTLTELQDGVTIEEVYEKTEADFAVSQSVLNS
>P56007 2.8.3.5~~~scoB~~~Succinyl-CoA:3-ketoacid coenzyme A transferase subunit B~~~COG2057
MREAIIKRAAKELKEGMYVNLGIGLPTLVANEVSGMNIVFQSENGLLGIGAYPLEGSVDADLINAGKETITVVPGASFFN
SADSFAMIRGGHIDLAILGGMEVSQNGDLANWMIPKKLIKGMGGAMDLVHGAKKVIVIMEHCNKYGESKVKKECSLPLTG
KGVVHQLITDLAVFEFSNNAMKLVELQEGVSLDQVKEKTEAEFEVRL
>P9WPW3 2.8.3.5~~~scoB~~~Probable succinyl-CoA:3-ketoacid coenzyme A transferase subunit B~~~COG2057
MSAPGWSRDEMAARVAAEFEDGQYVNLGIGMPTLIPNHIPDGVHVVLHSENGILGVGPYPRREDVDADLINAGKETVTTL
PGAAFFSSSTSFGIIRGGHLDVAVLGAMQVSVTGDLANWMIPGKMVKGMGGAMDLVHGARKVIVMMEHTAKDGSPKILER
CTLPLTGVGCVDRIVTELAVIDVCADGLHLVQTAPGVSVDEVVAKTQPPLVLRDLATQ
>P68575 ~~~yorD~~~SPbeta prophage-derived stress response protein SCP1~~~
MEFKVGQDVSEIWNIHGSILPEVLMYMFPRSDESYDWEFVNDNGRHIFTAWRKSEPIPTLEEIEKAAIELEEKKNAPKPK
TLEERVADLEKQVAYLTSKVEGTN
>P81100 ~~~yceC~~~Stress response protein SCP2~~~COG2310
MAISLEKGQRIDLTKGKAGLSKLMVGLGWDPVSSGGGFFSKLLGGGGPNIDCDASVLMLENGKFTDKKNLIYFGNLKSRC
GGVQHTGDNLTGDGAGDDEQIMIDLDKVPGNIDKLVFVVNIYDCVRRKQDFGMIQNAFIRVVDQSNHEEMLKYNLRDNYA
GRTSLITAEIYRSGSEWKFAAVGEGTNDTRLEDIISRYV
>P35154 ~~~scpA~~~Segregation and condensation protein A~~~COG1354
MEEYQVKIDTFEGPLDLLLHLINRLEIDIYDIPVAKITEQYLLYVHTMRVLELDIASEYLVMAATLLSIKSRMLLPKQEE
ELFEDELLEEEDPREELIEKLIEYRKYKDAAKDLKEREEERQKSFTKPPSDLSEYAKEVKQSEQKLSVTVYDMIGAFQKV
LKRKKINRPMETTITRQDIPIEARMNEIVHSLKSRGTRINFMDLFPYEQKEHLVVTFLAVLELMKNQLVLIEQEHNFSDI
YITGSESIHGA
>P27253 5.4.99.2~~~scpA~~~Methylmalonyl-CoA mutase~~~COG1884
MSNVQEWQQLANKELSRREKTVDSLVHQTAEGIAIKPLYTEADLDNLEVTGTLPGLPPYVRGPRATMYTAQPWTIRQYAG
FSTAKESNAFYRRNLAAGQKGLSVAFDLATHRGYDSDNPRVAGDVGKAGVAIDTVEDMKVLFDQIPLDKMSVSMTMNGAV
LPVLAFYIVAAEEQGVTPDKLTGTIQNDILKEYLCRNTYIYPPKPSMRIIADIIAWCSGNMPRFNTISISGYHMGEAGAN
CVQQVAFTLADGIEYIKAAISAGLKIDDFAPRLSFFFGIGMDLFMNVAMLRAARYLWSEAVSGFGAQDPKSLALRTHCQT
SGWSLTEQDPYNNVIRTTIEALAATLGGTQSLHTNAFDEALGLPTDFSARIARNTQIIIQEESELCRTVDPLAGSYYIES
LTDQIVKQARAIIQQIDEAGGMAKAIEAGLPKRMIEEASAREQSLIDQGKRVIVGVNKYKLDHEDETDVLEIDNVMVRNE
QIASLERIRATRDDAAVTAALNALTHAAQHNENLLAAAVNAARVRATLGEISDALEVAFDRYLVPSQCVTGVIAQSYHQS
EKSASEFDAIVAQTEQFLADNGRRPRILIAKMGQDGHDRGAKVIASAYSDLGFDVDLSPMFSTPEEIARLAVENDVHVVG
ASSLAAGHKTLIPELVEALKKWGREDICVVAGGVIPPQDYAFLQERGVAAIYGPGTPMLDSVRDVLNLISQHHD
>Q97NX5 ~~~scpA~~~Segregation and condensation protein A~~~COG1354
MDIKLKDFEGPLDLLLHLVSKYQMDIYDVPITEVIEQYLAYVSTLQAMRLEVTGEYMVMASQLMLIKSRKLLPKVAEVTD
LGDDLEQDLLSQIEEYRKFKLLGEHLEAKHQERAQYYSKAPTELIYEDAELVHDKTTIDLFLTFSNILAKKKEEFAQNHT
TILRDEYKIEDMMIIVKESLIGRDQLRLQDLFKEAQNVQEVITLFLATLELIKTQELILVQEESFGDIYLMEKKEESQVP
QS
>Q9EUQ7 ~~~scpA~~~Segregation and condensation protein A~~~COG1354
MDIKLKDFEGPLDLLLHLVSKYQMDIYDVPITEVIEQYLAYVSTLQAMRLEVTGEYMVMASQLMLIKSRKLLPKVAEVTD
LGDDLEQDLLSQIEEYRKFKLLGEHLEAKHQERAQYYSKAPTELIYEDAELVHDKTTIDLFLAFSNILAKKKEEFAQNHT
TILRDEYKIEDMMIIVKESLIGRDQLRLQDLFKEAQNVQEVITLFLATLELIKTQELILVQEESFGDIYLMEKKEESQVP
QS
>C1CMI6 ~~~scpA~~~Segregation and condensation protein A~~~
MDIKLKDFEGPLDLLLHLVSKYQMDIYDVPITEVIEQYLAYVSTLQAMRLEVTGEYMVMASQLMLIKSRKLLPKVAEVTD
LGDDLEQDLLSQIEEYRKFKLLGEHLEAKHQERAQYYSKAPTELIYEDAELVHDKTTIDLFLAFSNILAKKKEEFAQNHT
TILRDEYKIEDMMIIVKESLIGRDQLRLQDLFKEAQNVQEVITLFLATLELIKTQELILVQEESFGDIYLMEKKEESQVP
QS
>Q83CP9 ~~~scpB~~~Segregation and condensation protein B homolog~~~COG1386
MSEVNPKIVIEAALFASAEPLTPERLQQLFDEQNPISLSEIKNLLSELKEDYRERGVDLQEVASGYRFQARPDFSPWLQR
LWEKKPARYSRALLETLALIVYRQPISRGEIEEVRGVAVSSDIIKKLLDREWISVVAHRDVPGKPALFGTTKTFLDYFNL
KSLEELPPLEDVVDLEKIEAQFGEQLALVVEKKAAEGTEPSALPLAEED
>P35155 ~~~scpB~~~Segregation and condensation protein B~~~COG1386
MGLDIVNWKAIVEALLYAAGDEGLTKKQLLTVLEIEEPELNTIMADVADEYRGDTRGIELIEYADTYMLSTKKDFAPYLK
KLIEVPSKGLSQASLEVLAIVSYKQPITRAEIEEIRGVKSERILHSLVAKALLCEVGRADGPGRAILYGTTPTFLEQFGL
KTLDELPPLPENAEEDVLQEEADLFFENFNQTFEDIK
>P52045 4.1.1.-~~~scpB~~~Methylmalonyl-CoA decarboxylase~~~COG1024
MSYQYVNVVTINKVAVIEFNYGRKLNALSKVFIDDLMQALSDLNRPEIRCIILRAPSGSKVFSAGHDIHELPSGGRDPLS
YDDPLRQITRMIQKFPKPIISMVEGSVWGGAFEMIMSSDLIIAASTSTFSMTPVNLGVPYNLVGIHNLTRDAGFHIVKEL
IFTASPITAQRALAVGILNHVVEVEELEDFTLQMAHHISEKAPLAIAVIKEELRVLGEAHTMNSDEFERIQGMRRAVYDS
EDYQEGMNAFLEKRKPNFVGH
>Q97NX6 ~~~scpB~~~Segregation and condensation protein B~~~COG1386
MSTLAKIEALLFVAGEDGIRVRQLAELLSLPPTGIQQSLGKLAQKYEKDPDSSLALIETSGAYRLVTKPQFAEILKEYSK
APINQSLSRAALETLSIIAYKQPITRIEIDAIRGVNSSGALAKLQAFDLIKEDGKKEVLGRPNLYVTTDYFLDYMGINHL
EELPVIDELEIQAQESQLFGERIEEDENQ
>Q8DNI9 ~~~scpB~~~Segregation and condensation protein B~~~COG1386
MSTLAKIEALLFVAGEDGIRVRQLAELLSLPPTGIQQSLGKLAQKYEKDPDSSLALIETSGAYRLVTKPQFAEILKEYSK
APINQSLSRAALETLSIIAYKQPITRIEIDAIRGVNSSGALAKLQAFDLIKEDGKKEVLGRPNLYVTTDYFLDYMGINHL
EELPVIDELEIQAQESQLFGERIEEDENQ
>C1CMI5 ~~~scpB~~~Segregation and condensation protein B~~~
MSTLAKIEALLFVAGEDGIRVRQLAELLSLPPTGIQQSLGKLAQKYEKDPDSSLALIETSGAYRLVTKPQFAEILKEYSK
APINQSLSRAALETLSIIAYKQPITRIEIDAIRGVNSSGALAKLQAFDLIKEDGKKEVLGRPNLYVTTDYFLDYMGINHL
EELPVIDELEIQAQESQLFGERIEEDENQ
>P52043 2.8.3.-~~~scpC~~~Propionyl-CoA:succinate CoA transferase~~~COG0427
METQWTRMTANEAAEIIQHNDMVAFSGFTPAGSPKALPTAIARRANEQHEAKKPYQIRLLTGASISAAADDVLSDADAVS
WRAPYQTSSGLRKKINQGAVSFVDLHLSEVAQMVNYGFFGDIDVAVIEASALAPDGRVWLTSGIGNAPTWLLRAKKVIIE
LNHYHDPRVAELADIVIPGAPPRRNSVSIFHAMDRVGTRYVQIDPKKIVAVVETNLPDAGNMLDKQNPMCQQIADNVVTF
LLQEMAHGRIPPEFLPLQSGVGNINNAVMARLGENPVIPPFMMYSEVLQESVVHLLETGKISGASASSLTISADSLRKIY
DNMDYFASRIVLRPQEISNNPEIIRRLGVIALNVGLEFDIYGHANSTHVAGVDLMNGIGGSGDFERNAYLSIFMAPSIAK
EGKISTVVPMCSHVDHSEHSVKVIITEQGIADLRGLSPLQRARTIIDNCAHPMYRDYLHRYLENAPGGHIHHDLSHVFDL
HRNLIATGSMLG
>P27217 3.2.1.26~~~scrB~~~Sucrose-6-phosphate hydrolase~~~
MSLPSRLPAILQAVMQGQPQALADSHYPQWHLAPVNGLLNDPNGFCQVAGRYHLFYQWNPLACDHTYKCWGHWSSADLLH
WRHEPIALMPDEEYDRNGCYSGSAVEFEGALTLCYTGNVKFPDGGRTAWQCLATENADGTFRKLGPVLPLPEGYTGHVRD
PKVWRQDGRWYMVLGAQDVQQRGKVLLFTASDLREWRLVGEIAGHDVNGLANAGYMWECPDLFPLADTHLLICCPQGLAR
EAQRFLNTYPAVWMAGRFDAERGIFDHGPLHELDSGFEFYAPQTMQADDGRRLLVGWMGVPDGDEMHQPTRAQGWIHQMT
CVRELEWQAGTLYQRPLRELVALRGEAQGWCGQTLPLAPMELAFDLSPDSTLGLDFAGALQLTVNRDGLRLSRRGLQTAE
MHHRYWRGEARRLRIFIDRSSVEIFINDGEGVMSSRFFPGYPGQLIFSGATPVAFCRWLLRPCMVE
>Q04937 3.2.1.26~~~scrB~~~Sucrose-6-phosphate hydrolase~~~
MKWSTKQRYRTYDSYSESDLESLRKLALKSPWKSNFHIEPETGLLNDPNGFSYFNEKWHLFYQHFPFGPVHGLKSWVHLV
SDDLVHFEKTGLVLYPDTKYDNAGVYSGSALAFENFLFLIYTGNHRGEDWVRTPYQLGAKIDKNNQLVKFTEPLIYPDFS
QTTDHFRDPQIFSFQGQIYCLIGAQSSQKNGIIKLYKAIENNLTDWKDLGNLDFSKEKMGYMIECPNLIFINGRSVLVFC
PQGLDKSIVKYDNIYPNVYVIADDFTTGSKNQLKNAGQLINLDEGFDCYATQSFNAPDGSAYAISWLGLPETSYPTDKYN
VQGVLSMVKKLSIKDNKLYQYPVEKMKELRQMEQDLLLADNNIITSNSYELEVDFRQQTSTLLSLMTNEKGDSALKVEID
KENNTITLIRNYEKRLAHVKIEKMNVFIDQSIFEIFINDGEKVLSDCRVFPNKNQYSIRSQNPIKIKLWELKK
>P13394 3.2.1.26~~~scrB~~~Sucrose-6-phosphate hydrolase~~~COG1621
MSLNNRWTVEQRYRRLEQIPQCDIEEMTLSRQQDKGFPSFHIAPKFGLLNDPNGLCYFNGEHHIFYQWTPVGPVHGMKYW
YHLSTKDFIHFTDHGVGLHPDQDYDSHGVYSGGALVENNQVLLFFTGNKRDQNWNRIPTQCFATMDSDGSIEKHGVVIEN
EHYTEHFRDPKVWKKGDDYLMVVGAQTKTEHGSMALYQSKDLKTWQHKGPIKTKFSDLGYMWECPDFFEINGQSVMLFSP
QGVSSSNPYDFKNIYSVAYIVGDQLNLESMTLENHQDILQPDYGFDFYAPQTYLDESGRRILIAWIGLPEIDTPSVTHQW
AGMLSLPRELTLKDGFLVQTPLPELKSLRKEEVVFAQSHTLESTSCLIQLDLVGDGFELELSNLKGDNIVFSATEHEFML
DRRYMSHLYAEEFGGIRKAPRLDAKQTIDIYIDNSVIEIFINGGKHTMTSRFFIDDLNKVTLKGLEQARLFPLKGITGLF
ESAK
>O05510 2.7.1.4~~~gmuE~~~Putative fructokinase~~~COG1940
MLGGIEAGGTKFVCAVGREDGTIIDRIEFPTKMPDETIEKVIQYFSQFSLQAIGIGSFGPVDNDKTSQTYGTITATPKAG
WRHYPFLQTVKNEMKIPVGFSTDVNAAALGEFLFGEAKGLDSCLYITIGTGIGAGAIVEGRLLQGLSHPEMGHIYIRRHP
DDVYQGKCPYHGDCFEGLASGPAIEARWGKKAADLSDIAQVWELEGYYIAQALAQYILILAPKKIILGGGVMQQKQVFSY
IYQYVPKIMNSYLDFSELSDDISDYIVPPRLGSNAGIIGTLVLAHQALQAEAASGEVRS
>P26420 2.7.1.4~~~scrK~~~Fructokinase~~~
MNGKIWVLGDAVVDLLPDGEGRLLQCPGGAPANVAVGVARLGGDSGFIGRVGDDPFGRFMRHTLAQEQVDVNYMRLDAAQ
RTSTVVVDLDSHGERTFTFMVRPSADLFLQPEDLPPFAAGQWLHVCSIALSAEPSRSTTFAALEAIKRAGGYVSFDPNIR
SDLWQDPQDLRDCLDRALALADAIKLSEEELAFISGSDDIVSGIARLNARFQPTLLLVTQGKAGVQAALRGQVSHFPARP
VVAVDTTGAGDAFVAGLLAGLAAHGIPDNLAALAPDLALAQTCGALATTAKGAMTALPYKDDLQRSL
>P22340 ~~~scrY~~~Sucrose porin~~~
MYRKSTLAMLIALLTSAASAHAQTDISTIEARLNALEKRLQEAENRAQTAENRAGAAEKKVQQLTAQQQKNQNSTQEVAQ
RTARLEKKADDKSGFEFHGYARSGVIMNDSGASTKSGAYITPAGETGGAIGRLGNQADTYVEMNLEHKQTLDNGATTRFK
VMVADGQTSYNDWTASTSDLNVRQAFVELGNLPTFAGPFKGSTLWAGKRFDRDNFDIHWIDSDVVFLAGTGGGIYDVKWN
DGLRSNFSLYGRNFGDIDDSSNSVQNYILTMNHFAGPLQMMVSGLRAKDNDERKDSNGNLAKGDAANTGVHALLGLHNDS
FYGLRDGSSKTALLYGHGLGAEVKGIGSDGALRPGADTWRIASYGTTPLSENWSVAPAMLAQRSKDRYADGDSYQWATFN
LRLIQAINQNFALAYEGSYQYMDLKPEGYNDRQAVNGSFYKLTFAPTFKVGSIGDFFSRPEIRFYTSWMDWSKKLNNYAS
DDALGSDGFNSGGEWSFGVQMETWF
>P0DJA7 3.2.1.26~~~sacA~~~Sucrose-6-phosphate hydrolase~~~COG1621
MESPSYKNLIKAEDAQKKAGKRLLSSEWYPGFHVTPLTGWMNDPNGLIFFKGEYHLFYQYYPFAPVWGPMHWGHAKSRDL
VHWETLPVALAPGDSFDRDGCFSGCAVDNNGVLTLIYTGHIVLSNDSLDAIREVQCMATSIDGIHFQKEGIVLEKAPMPQ
VAHFRDPRVWKENNHWFMVVGYRTDDEKHQGIGHVALYRSENLKDWIFVKTLLGDNSQLPLGKRAFMWECPDFFSLGNRS
VLMFSPQGLKASGYKNRNLFQNGYILGKWQAPQFTPETSFQELDYGHDFYAAQRFEAKDGRQILIAWFDMWENQKPSQRD
GWAGCMTLPRKLDLIDNKIVMTPVREMEILRQSEKIESVVTLSDAEHPFTMDSPLQEIELIFDLEKSSAYQAGLALRCNG
KGQETLLYIDRSQNRIILDRNRSGQNVKGIRSCPLPNTSKVRLHIFLDRSSIEIFVGDDQTQGLYSISSRIFPDKDSLKG
RLFAIEGYAVFDSFKRWTLQDANLAAFSSDAC
>E1WAC8 ~~~sctB1~~~SPI-1 type 3 secretion system translocon protein SctB~~~
MLISNVGINPAAYLNNHSVENSSQTASQSVSAKDILNSIGISSSKVSDLGLSPTLSAPAPGVLTQTPGTITSFLKASIQN
TDMNQDLNALANNVTTKANEVVQTQLREQQAEVGKFFDISGMSSSAVALLAAANTLMLTLNQADSKLSGKLSLVSFDAAK
TTASSMMREGMNALSGSISQSALQLGITGVGAKLEYKGLQNERGALKHNAAKIDKLTTESHSIKNVLNGQNSVKLGAEGV
DSLKSLNMKKTGTDATKNLNDATLKSNAGTSATESLGIKDSNKQISPEHQAILSKRLESVESDIRLEQNTMDMTRIDARK
MQMTGDLIMKNSVTVGGIAGASGQYAATQERSEQQISQVNNRVASTASDEARESSRKSTSLIQEMLKTMESINQSKASAL
AAIAGNIRA
>P0CL47 ~~~sctB1~~~SPI-1 type 3 secretion system translocon protein SctB~~~
MLISNVGINPAAYLNNHSVENSSQTASQSVSAKDILNSIGISSSKVSDLGLSPTLSAPAPGVLTQTPGTITSFLKASIQN
TDMNQDLNALANNVTTKANEVVQTQLREQQAEVGKFFDISGMSSSAVALLAAANTLMLTLNQADSKLSGKLSLVSFDAAK
TTASSMMREGMNALSGSISQSALQLGITGVGAKLEYKGLQNERGALKHNAAKIDKLTTESHSIKNVLNGQNSVKLGAEGV
DSLKSLNMKKTGTDATKNLNDATLKSNAGTSATESLGIKDSNKQISPEHQAILSKRLESVESDIRLEQNTMDMTRIDARK
MQMTGDLIMKNSVTVGGIAGASGQYAATQERSEQQISQVNNRVASTASDEARESSRKSTSLIQEMLKTMESINQSKASAL
AAIAGNIRA
>Q9R803 ~~~sctB2~~~SPI-2 type 3 secretion system translocon protein SctB~~~
MEASNVALVLPAPSLLTPSSTPSPSGEGMGTESMLLLFDDIWMKLMELAKKLRDIMRSYNVEKQRLAWELQVNVLQTQMK
TIDEAFRASMITAGGAMLSGVLTIGLGAVGGETGLIAGQAVGHTAGGVMGLGAGVAQRQSDQDKAIADLQQNGAQSYNKS
LTEIMEKATEIMQQIIGVGSSLVTVLAEILRALTR
>Q05129 ~~~sctB~~~Type 3 secretion system translocon protein SctB~~~
MNTIDNNNAAIAVNSVLSSTTDSTSSTTTSTSSISSSLLTDGRVDISKLLLEVQKLLREMVTTLQDYLQKQLAQSYDIQK
AVFESQNKAIDEKKAGATAALIGGAISSVLGILGSFAAINSATKGASDVAQQAASTSAKSIGTVSEASTKALAKASEGIA
DAADDAAGAMQQTIATAAKAASRTSGITDDVATSAQKASQVAEEAADAAQELAQKAGLLSRFTAAAGRISGSTPFIVVTS
LAEGTKTLPTTISESVKSNHDINEQRAKSVENLQASNLDTYKQDVRRAQDDISSRLRDMTTTARDLTDLINRMGQAARLA
G
>P18012 ~~~sctB~~~Type 3 secretion system translocon protein SctB~~~
MEIQNTKPTQTLYTDISTKQTQSSSETQKSQNYQQIAAHIPLNVGKNPVLTTTLNDDQLLKLSEQVQHDSEIIARLTDKK
MKDLSEMSHTLTPENTLDISSLSSNAVSLIISVAVLLSALRTAETKLGSQLSLIAFDATKSAAENIVRQGLAALSSSITG
AVTQVGITGIGAKKTHSGISDQKGALRKNLATAQSLEKELAGSKLGLNKQIDTNITSPQTNSSTKFLGKNKLAPDNISLS
TEHKTSLSSPDISLQDKIDTQRRTYELNTLSAQQKQNIGRATMETSAVAGNISTSGGRYASALEEEEQLISQASSKQAEE
ASQVSKEASQATNQLIQKLLNIIDSINQSKNSAASQIAGNIRA
>P37132 ~~~sctB~~~Type 3 secretion system translocon protein SctB~~~
MTINIKTDSPIITTGSQLDAITTETVGQSGEVKKTEDTRHEAQAIKSSEASLSRSQVPELIKPSQGINVALLSKSQGDLN
GTLSILLLLLELARKAREMGLQQRDIENKATISAQKEQVAEMVSGAKLMIAMAVVSGIMAATSTVASAFSIAKEVKIVKQ
EQILNSNIAGRDQLIDTKMQQMSNAGDKAVSREDIGRIWKPEQVADQNKLALLDKEFRMTDSKANAFNAATQPLGQMANS
AIQVHQGYSQAEVKEKEVNASIAANEKQKAEEAMNYNDNFMKDVLRLIEQYVSSHTHAMKAAFGVV
>Q06131 ~~~sctB~~~Type 3 secretion system translocon protein SctB~~~
MTINIKTDSPIITTGSQLDAITTETVKQSGEIKKTEDTRHEAQAIKSSEASLSRSQVPELIKPSQGINVALLSKSQGDLN
GTLSILLLLLELARKAREMGLQQRDIENKATITAQKEQVAEMVSGAKLMIAMAVVSGIMAATSTVASAFSIAKEVKIVKQ
EQILNSNIAGREQLIDTKMQQMSNIGDKAVSREDIGRIWKPEQVADQNKLALLDKEFRMTDSKANAFNAATQPLGQMANS
AIQVHQGYSQAEVKEKEVNASIAANEKQKAEEAMNYNDNFMKDVLRLIEQYVSSHTHAMKAAFGVV
>P35672 ~~~sctC1~~~SPI-1 type 3 secretion system secretin~~~
MKTHILLARVLACAALVLVTPGYSSEKIPVTGSGFVAKDDSLRTFFDAMALQLKEPVIVSKMAARKKITGNFEFHDPNAL
LEKLSLQLGLIWYFDGQAIYIYDASEMRNAVVSLRNVSLNEFNNFLKRSGLYNKNYPLRGDNRKGTFYVSGPPVYVDMVV
NAATMMDKQNDGIELGRQKIGVMRLNNTFVGDRTYNLRDQKMVIPGIATAIERLLQGEEQPLGNIVSSEPPAMPAFSANG
EKGKAANYAGGMSLQEALKQNAAAGNIKIVAYPDTNSLLVKGTAEQVHFIEMLVKALDVAKRHVELSLWIVDLNKSDLER
LGTSWSGSITIGDKLGVSLNQSSISTLDGSRFIAAVNALEEKKQATVVSRPVLLTQENVPAIFDNNRTFYTKLIGERNVA
LEHVTYGTMIRVLPRFSADGQIEMSLDIEDGNDKTPQSDTTTSVDALPEVGRTLISTIARVPHGKSLLVGGYTRDANTDT
VQSIPFLGKLPLIGSLFRYSSKNKSNVVRVFMIEPKEIVDPLTPDASESVNNILKQSGAWSGDDKLQKWVRVYLDRGQEA
IK
>D0ZWR9 ~~~sctC2~~~SPI-2 type 3 secretion system secretin~~~
MVVNKRLILILLFILNTAKSDELSWKGNDFTLYARQMPLAEVLHLLSENYDTAITISPLITATFSGKIPPGPPVDILNNL
AAQYDLLTWFDGSMLYVYPASLLKHQVITFNILSTGRFIHYLRSQNILSSPGCEVKEITGTKAVEVSGVPSCLTRISQLA
SVLDNALIKRKDSAVSVSIYTLKYATAMDTQYQYRDQSVVVPGVVSVLREMSKTSVPTSSTNNGSPATQALPMFAADPRQ
NAVIVRDYAANMAGYRKLITELDQRQQMIEISVKIIDVNAGDINQLGIDWGTAVSLGGKKIAFNTGLNDGGASGFSTVIS
DTSNFMVRLNALEKSSQAYVLSQPSVVTLNNIQAVLDKNITFYTKLQGEKVAKLESITTGSLLRVTPRLLNDNGTQKIML
NLNIQDGQQSDTQSETDPLPEVQNSEIASQATLLAGQSLLLGGFKQGKQIHSQNKIPLLGDIPVVGHLFRNDTTQVHSVI
RLFLIKASVVNNGISHG
>Q04641 ~~~sctC~~~Type 3 secretion system secretin~~~
MKKFNIKSLTLLIVLLPLIVNANNIDSHLLEQNDIAKYVAQSDTVGSFFERFSALLNYPIVVSKQAAKKRISGEFDLSNP
EEMLEKLTLLVGLIWYKDGNALYIYDSGELISKVILLENISLNYLIQYLKDANLYDHRYPIRGNISDKTFYISGPPALVE
LVANTATLLDKQVSSIGTDKVNFGVIKLKNTFVSDRTYNMRGEDIVIPGVATVVERLLNNGKALSNRQAQNDPMPPFNIT
QKVSEDSNDFSFSSVTNSSILEDVSLIAYPETNSILVKGNDQQIQIIRDIITQLDVAKRHIELSLWIIDIDKSELNNLGV
NWQGTASFGDSFGASFNMSSSASISTLDGNKFIASVMALNQKKKANVVSRPVILTQENIPAIFDNNRTFYVSLVGERNSS
LEHVTYGTLINVIPRFSSRGQIEMSLTIEDGTGNSQSNYNYNNENTSVLPEVGRTKISTIARVPQGKSLLIGGYTHETNS
NEIISIPFLSSIPVIGNVFKYKTSNISNIVRVFLIQPREIKESSYYNTAEYKSLISEREIQKTTQIIPSETTLLEDEKSL
VSYLNY
>Q01244 ~~~sctC~~~Type 3 secretion system secretin~~~
MAFPLHSFFKRVLTGTLLLLSSYSWAQELDWLPIPYVYVAKGESLRDLLTDFGANYDATVVVSDKINDKVSGQFEHDNPQ
DFLQHIASLYNLVWYYDGNVLYIFKNSEVASRLIRLQESEAAELKQALQRSGIWEPRFGWRPDASNRLVYVSGPPRYLEL
VEQTAAALEQQTQIRSEKTGALAIEIFPLKYASASDRTIHYRDDEVAAPGVATILQRVLSDATIQQVTVDNQRIPQAATR
ASAQARVEADPSLNAIIVRDSPERMPMYQRLIHALDKPSARIEVALSIVDINADQLTELGVDWRVGIRTGNNHQVVIKTT
GDQSNIASNGALGSLVDARGLDYLLARVNLLENEGSAQVVSRPTLLTQENAQAVIDHSETYYVKVTGKEVAELKGITYGT
MLRMTPRVLTQGDKSEISLNLHIEDGNQKPNSSGIEGIPTISRTVVDTVARVGHGQSLIIGGIYRDELSVALSKVPLLGD
IPYIGALFRRKSELTRRTVRLFIIEPRIIDEGIAHHLALGNGQDLRTGILTVDEISNQSTTLNKLLGGSQCQPLNKAQEV
QKWLSQNNKSSYLTQCKMDKSLGWRVVEGACTPAQSWCVSAPKRGVL
>Q56974 ~~~sctC~~~Type 3 secretion system secretin~~~COG1450
MAFPLHSFFKRVLTGTLLLLSNYSWAQELDWLPIPYVYVAKGESLRDLLIDFSANYDATVVVSDKINDKVSGQFEHDNPQ
DFLQHIASLYNLVWYYDGNVLYIFKNSEVASRLIRLQESEAAELKLALQRSGIWEPRFGWRPDASNRLVYVSGPPRYLEL
VEQTAAALEQQTQIRSEKTGALAIEIFPLKYASASDRTIHYRDDEVAAPGVATILQRVLSDATIQQVTVDNQRIPQAATR
ASAQAKVEADPSLNAIIVRDSPERMPMYQRLIHALDKPSARIEVALSIVDINADQLTELGVDWRVGIRTGNNHQVVIKTT
GDQSNIASNGALGSLIDARGLDYLLARVNLLENEGSAQVVSRPTLLTQENAQAVIDHHETYYVKVTGKEVAELKGITYGT
MLRMTPRVLTQGDKSEISLNLHIEDGNQKPNSSGIDGIPTISRTVVDTVARVGHGQSLIIGGIYRDELSVALSKVPLLGD
IPYLGALFRRKSELTRRTVRLFIIEPRIIDEGIAHHLALGNGRDLRTGILAVDEISNQSTTLNKLLGGFQCQPLNKAQEV
QKWLSQNNKSSYLTQCKMDKSLGWRVVEGACTPAESWCVSAPKRGVL
>Q56134 ~~~sctE1~~~SPI-1 type 3 secretion system translocon protein SctE~~~
MVNDASSISRSGYTQNPRLAEAAFEGVRKNTDFLKAADKAFKDVVATKAGDLKAGTKSGESAINTVGLKPPTDAAREKLS
SEGQLTLLLGKLMTLLGDVSLSQLESRLAVWQAMIESQKEMGIQVSKEFQTALGEAQEATDLYEASIKKTDTAKSVYDAA
AKKLTQAQNKLQSLDPADPGYAQAEAAVEQAGKEATEAKEALDKATDATVKAGTDAKAKAEKADNILTKFQGTANAASQN
QVSQGEQDNLSNVARLTMLMAMFIEIVGKNTEESLQNDLALFNALQEGRQAEMEKKSAEFQEETRKAEETNRIMGCIGKV
LGALLTIVSVVAAVFTGGASLALAAVGLAVMVADEIVKAATGVSFIQQALNPIMEHVLKPLMELIGKAITKALEGLGVDK
KTAEMAGSIVGAIVAAIAMVAVIVVVAVVGKGAAAKLGNALSKMMGETIKKLVPNVLKQLAQNGSKLFTQGMQRITSGLG
NVGSKMGLQTNALSKELVGNTLNKVALGMEVTNTAAQSAGGVAEGVFIKNASEALADFMLARFAMDQIQQWLKQSVEIFG
ENQKVTAELQKAMSSAVQQNADASRFILRQSRA
>Q56019 ~~~sctE1~~~SPI-1 type 3 secretion system translocon protein SctE~~~
MVNDASSISRSGYTQNPRLAEAAFEGVRKNTDFLKAADKAFKDVVATKAGDLKAGTKSGESAINTVGLKPPTDAAREKLS
SEGQLTLLLGKLMTLLGDVSLSQLESRLAVWQAMIESQKEMGIQVSKEFQTALGEAQEATDLYEASIKKTDTAKSVYDAA
TKKLTQAQNKLQSLDPADPGYAQAEAAVEQAGKEATEAKEALDKATDATVKAGTDAKAKAEKADNILTKFQGTANAASQN
QVSQGEQDNLSNVARLTMLMAMFIEIVGKNTEESLQNDLALFNALQEGRQAEMEKKSAEFQEETRKAEETNRIMGCIGKV
LGALLTIVSVVAAVFTGGASLALAAVGLAVMVADEIVKAATGVSFIQQALNPIMEHVLKPLMELIGKAITKALEGLGVDK
KTAEMAGSIVGAIVAAIAMVAVIVVVAVVGKGAAAKLGNALSKMMGETIKKLVPNVLKQLAQNGSKLFTQGMQRITSGLG
NVGSKMGLQTNALSKELVGNTLNKVALGMEVTNTAAQSAGGVAEGVFIKNASEALADFMLARFAMDQIQQWLKQSVEIFG
ENQKVTAELQKAMSSAVQQNADASRFILRQSRA
>O84947 ~~~sctE2~~~SPI-2 type 3 secretion system translocon protein SctE~~~
MNRIHSNSDSAAGVTALTHHHLSNVSCVSSGSLGKRQHRVNSTFGDGNAACLLSGKISLQEASNALKQLLDAVPGNHKRP
SLPDFLQTNPAVLSMMMTSLILNVFGNNAQSLCQQLERATEVQNALRNKQVKEYQEQIQKAIEQEDKARKAGIFGAIFDW
ITGIFETVIGALKVVEGFLSGNPAEMASGVAYMAAGCAGMVKAGAETAMMCGADHDTCQAIIDVTSKIQFGCEAVALALD
VFQIGRAFMATRGLSGAAAKVLDSGFGEEVVERMVGAGEAEIEELAEKFGEEVSESFSKQFEPLEREMAMANEMAEEAAE
FSRNVENNMTRSAGKSFTKEGVKAMAKEAAKEALEKCVQEGGKFLLKKFRNKVLFNMFKKILYALLRDCSFKGLQAIRCA
TEGASQMNTGMVNTEKAKIEKKIEQLITQQRFLDFIMQQTENQKKIEQKRLEELYKGSGAALRDVLDTIDHYSSVQARIA
GYRA
>P18011 ~~~sctE~~~Type 3 secretion system translocon protein SctE~~~
MHNVSTTTTGFPLAKILTSTELGDNTIQAANDAANKLFSLTIADLTANQNINTTNAHSTSNILIPELKAPKSLNASSQLT
LLIGNLIQILGEKSLTALTNKITAWKSQQQARQQKNLEFSDKINTLLSETEGLTRDYEKQINKLKNADSKIKDLENKINQ
IQTRLSELDPESPEKKKLSREEIQLTIKKDAAVKDRTLIEQKTLSIHSKLTDKSMQLEKEIDSFSAFSNTASAEQLSTQQ
KSLTGLASVTQLMATFIQLVGKNNEESLKNDLALFQSLQESRKTEMERKSDEYAAEVRKAEELNRVMGCVGKILGALLTI
VSVVAAAFSGGASLALAAVGLALMVTDAIVQAATGNSFMEQALNPIMKAVIEPLIKLLSDAFTKMLEGLGVDSKKAKMIG
SILGAIAGALVLVAAVVLVATVGKQAAAKLAENIGKIIGKTLTDLIPKFLKNFSSQLDDLITNAVARLNKFLGAAGDEVI
SKQIISTHLNQAVLLGESVNSATQAGGSVASAVFQNSASTNLADLTLSKYQVEQLSKYISEAIEKFGQLQEVIADLLASM
SNSQANRTDVAKAILQQTTA
>P37131 ~~~sctE~~~Type 3 secretion system translocon protein SctE~~~
MSALITHDRSTPVTGSLVPYIETPAPAPLQTQQVAGELKDKNGGVSSQGVQLPAPLAVVASQVTEGQQQEITKLLESVTR
GTAGSQLISNYVSVLTNFTLASPDTFEIELGKLVSNLEEVRKDIKIADIQRLHEQNMKKIEENQEKIKETEENAKQVKKS
GMASKIFGWLIAIASVVIGAIMVASGVGAVAGAMMIASGVIGMANMAVKQAAEDGLISQEAMQVLGPILTAIEVALTVVS
TVMTFGGSALKCLADIGAKLGANTASLAAKGAEFSAKVAQISTGISNTVGSAVTKLGGSFGSLTMSHVIRTGSQATQVAV
GVGSGITQTINNKKQADLQHNNADLALNKADMAALQSIIDRLKEELSHLSESHRQVMELIFQMINAKGDMLHNLAGRPHT
V
>Q06114 ~~~sctE~~~Type 3 secretion system translocon protein SctE~~~
MSALITHDRSTPVTGSLLPYVETPAPAPLQTQQVAGELKDKNGGVSSQGVQLPAPLAVVASQVTEGQQQEVTKLLESVTR
GAAGSQLISNYVSVLTKFTLASPDTFEIELGKLVSNLEEVRKDIKIADIQRLHEQNMKKIEENQEKIKETEENAKQVKKS
GIASKIFGWLSAIASVIVGAIMVASGVGAVAGAMMVASGVIGMANMAVKQAAEDGLISQEAMKILGPILTAIEVALTVVS
TVMTFGGSALKCLANIGAKLGANTASLAAKGAEFSAKVAQISTGISNTVGSAVTKLGGSFAGLTMSHAIRTGSQATQVAV
GVGSGITQTINNKKQADLQHNNADLALNKADMAALQSIIDRLKEELSHLSESHQQVMELIFQMINAKGDMLHNLAGRPHT
V
>P41784 ~~~sctF1~~~SPI-1 type 3 secretion system needle filament protein~~~
MATPWSGYLDDVSAKFDTGVDNLQTQVTEALDKLAAKPSDPALLAAYQSKLSEYNLYRNAQSNTVKVFKDIDAAIIQNFR
>H9L4C5 ~~~sctF2~~~SPI-2 type 3 secretion system needle filament protein~~~
MDIAQLVDMLSHMAHQAGQAINDKMNGNDLLNPESMIKAQFALQQYSTFINYESSLIKMIKDMLSGIIAKI
>P95434 ~~~sctF~~~Type 3 secretion system needle filament protein~~~
MAQIFNPNPGNTLDTVANALKEQANAANKDVNDAIKALQGTDNADNPALLAELQHKINKWSVIYNINSTVTRALRDLMQG
ILQKI
>P0A223 ~~~sctF~~~Type 3 secretion system needle filament protein~~~
MSVTVPNDDWTLSSLSETFDDGTQTLQGELTLALDKLAKNPSNPQLLAEYQSKLSEYTLYRNAQSNTVKVIKDVDAAIIQ
NFR
>Q01247 ~~~sctF~~~Type 3 secretion system needle filament protein~~~
MSNFSGFTKGNDIADLDAVAQTLKKPADDANKAVNDSIAALKDTPDNPALLADLQHSINKWSVIYNISSTIVRSMKDLMQ
GILQKFP
>O68691 ~~~sctF~~~Type 3 secretion system needle filament protein~~~
MSNFSGFTKGTDIADLDAVAQTLKKPADDANKAVNDSIAALKDKPDNPALLADLQHSINKWSVIYNINSTIVRSMKDLMQ
GILQKFP
>P0CL43 ~~~invH~~~SPI-1 type 3 secretion system pilotin~~~
MKKFYSCLPVFLLIGCAQVPLPSSVSKPVQQPGAQKEQLANANSIDECQSLPYVPSDLAKNKSLSNHNADNSASKNSAIS
SSIFCEKYKQTKEQALTFFQEHPQYMRSKEDEEQLMTEFKKVLLEPGSKNLSIYQTLLAAHERLQAL
>P0A1X2 ~~~mxiM~~~Type 3 secretion system pilotin~~~
MIRHGSNKLKIFILSILLLTLSGCALKSSSNSEKEWHIVPVSKDYFSIPNDLLWSFNTTNKSINVYSKCISGKAVYSFNA
GKFMGNFNVKEVDGCFMDAQKIAIDKLFSMLKDGVVLKGNKINDTILIEKDGEVKLKLIRGI
>Q56851 ~~~yscW~~~Type 3 secretion system pilotin~~~
MSRIIALIISFLLVGCATPPMPAQRIVGEVRMSRPLSRTAHIDVSIFGLYEGKVREVQRTRFETGNLPLFFSIKLNPAQR
GEGELYLRSTLSFPERGVQAVAQQKLIGKNKVVLQMIPKTCYPNCQSPNTR
>E1WAB4 ~~~sctL1~~~SPI-1 type 3 secretion system stator protein~~~
MLKNIPIPSPLSPVEGILIKRKTLERYFSIERLEQQAHQRAKRILREAEEEAKTLRMYAYQEGYEQGMIDALQQVAAYLT
DNQTMAWKWMEKIQIYARELFSAAVDHPETLLTVLDEWLRDFDKPEGQLFLTLPVNAKKDHQKLMVLLMENWPGTFNLKY
HQEQRFIMSCGDQIAEFSPEQFVETAVGVIKHHLDELPQDCRTISDNAINALIDEWKTKTQAEVIR
>P0CL45 ~~~sctL1~~~SPI-1 type 3 secretion system stator protein~~~
MLKNIPIPSPLSPVEGILIKRKTLERYFSIERLEQQAHQRAKRILREAEEEAKTLRMYAYQEGYEQGMIDALQQVAAYLT
DNQTMAWKWMEKIQIYARELFSAAVDHPETLLTVLDEWLRDFDKPEGQLFLTLPVNAKKDHQKLMVLLMENWPGTFNLKY
HQEQRFIMSCGDQIAEFSPEQFVETAVGVIKHHLDELPQDCRTISDNAINALIDEWKTKTQAEVIR
>P74853 ~~~sctL2~~~SPI-2 type 3 secretion system stator protein~~~
MSFTSLPLTEINHKLPARNIIESQWITLQLTLFAQEQQAKRVSHAIVSSAYRKAEKIIRDAYRYQREQKVEQQQELACLR
KNTLEKMEVEWLEQHVKHLQDDENQFRSLVDHAAHHIKNSIEQVLLAWFDQQSVDSVMCHRLARQATAMAEEGALYLRIH
PEKEALMRETFGKRFTLIIEPGFSPDQAELSSTRYAVEFSLSRHFNALLKWLRNGEDKRGSDEY
>Q99PY0 ~~~sctL~~~Type 3 secretion system stator protein~~~
MKVCNMQKGTLPVSRHHAYDGVVIKRIEKELCKTIKDRDTESKKKAICVIKEATKKAESLRIDAVCDGYQIGIQTAFEHI
IDYICEWKLKQNENRRNIEDYITSLLSENLHDERIISTLLEQWLSSLRNTVTELKVVLPKCNLALRKKLELDLHKYRSDV
KIILKYSEGNNYIFCSGNQVVEFSPQDVISGVKIELAEKLTKNDKKYFKELAHKKLRQIAEDLLKENPVND
>Q01253 ~~~sctL~~~Type 3 secretion system stator protein~~~
MSQTCQTGYAYMQPFVQIIPSNLSLACGLRILRAEDYQSSLTTEELISAAKQDAEKILADAQEVYEQQKQLGWQAGMDEA
RTLQATLIHETQLQCQQFYRHVEQQMSEVVLLAVRKILNDYDQVDMTLQVVREALALVSNQKQVVVRVNPDQAGTIREQI
AKVHKDFPEISYLEVTADARLDQGGCILETEVGIIDASIDGQIEALSRAISTTLGQMKVTEEE
>P69976 ~~~sctL~~~Type 3 secretion system stator protein~~~COG1317
MSQTCQTGYAYMQPFVQIIPSNLSLACGLRILRAEDYQSSLTTEELISAAKQDAEKILADAQEVYEQQKQLGWQAGMDEA
RTLQATLIHETQLQCQQFYRHVEQQMSEVVLLAVRKILNDYDQVAMTLQVVREALALVSNQKQVVVRVNPDQAGAIREQI
AKVHKDFPEISYLEVTADARLDQGGCILETEVGIIDASIDGQIEALSRAISTTLGQMKVTE
>P69977 ~~~sctL~~~Type 3 secretion system stator protein~~~
MSQTCQTGYAYMQPFVQIIPSNLSLACGLRILRAEDYQSSLTTEELISAAKQDAEKILADAQEVYEQQKQLGWQAGMDEA
RTLQATLIHETQLQCQQFYRHVEQQMSEVVLLAVRKILNDYDQVAMTLQVVREALALVSNQKQVVVRVNPDQAGAIREQI
AKVHKDFPEISYLEVTADARLDQGGCILETEVGIIDASIDGQIEALSRAISTTLGQMKVTE
>P0A1B9 7.4.2.8~~~sctN1~~~SPI-1 type 3 secretion system ATPase~~~
MKTPRLLQYLAYPQKITGPIIEAELRDVAIGELCEIRRGWHQKQVVARAQVVGLQRERTVLSLIGNAQGLSRDVVLYPTG
RALSAWVGYSVLGAVLDPTGKIVERFTPEVAPISEERVIDVAPPSYASRVGVREPLITGVRAIDGLLTCGVGQRMGIFAS
AGCGKTMLMHMLIEQTEADVFVIGLIGERGREVTEFVDMLRASHKKEKCVLVFATSDFPSVDRCNAAQLATTVAEYFRDQ
GKRVVLFIDSMTRYARALRDVALASGERPARRGYPASVFDNLPRLLERPGATSEGSITAFYTVLLESEEEADPMADEIRS
ILDGHLYLSRKLAGQGHYPAIDVLKSVSRVFGQVTTPTHAEQASAVRKLMTRLEELQLFIDLGEYRPGENIDNDRAMQMR
DSLKAWLCQPVAQYSSFDDTLSGMNAFADQN
>P74857 7.4.2.8~~~sctN2~~~SPI-2 type 3 secretion system ATPase~~~
MKNELMQRLRLKYPPPDGYCRWGRIQDVSATLLNAWLPGVFMGELCCIKPGEELAEVVGINGSKALLSPFTSTIGLHCGQ
QVMALRRRHQVPVGEALLGRVIDGFGRPLDGRELPDVCWKDYDAMPPPAMVRQPITQPLMTGIRAIDSVATCGEGQRVGI
FSAPGVGKSTLLAMLCNAPDADSNVLVLIGERGREVREFIDFTLSEETRKRCVIVVATSDRPALERVRALFVATTIAEFF
RDNGKRVVLLADSLTRYARAAREIALAAGETAVSGEYPPGVFSALPRLLERTGMGEKGSITAFYTVLVEGDDMNEPLADE
VRSLLDGHIVLSRRLAERGHYPAIDVLATLSRVFPVVTSHEHRQLAAILRRCLALYQEVELLIRIGEYQRGVDTDTDKAI
DTYPDICTFLRQSKDEVCGPELLIEKLHQILTE
>Q8RK01 7.4.2.8~~~sctN~~~Type 3 secretion system ATPase~~~
MNAALSQWKDAHAARLRDYSAVRVSGRVSAVRGILLECKIPAAKVGDLCEVSKADGSFLLAEIVGFTQECTLLSALGAPD
GIQVGAPIRPLGIAHRIGVDDSLLGCVLDGFGRPLLGDCLGAFAGPDDRRETLPVIADALPPTQRPRITNALPTGVRAID
SAILLGEGQRVGLFAGAGCGKTTLMAELARNMGCDVIVFGLIGERGRELREFLDHELDETLRRRSVLVCATSDRSSMERA
RAAFTATAIAEAFRARGQKVLLLLDSLTRFARAQREIGIASGEPLGRGGLPPSVYTLLPRLVERAGMSENGSITALYTVL
IEQDSMNDPVADEVRSLLDGHIVLSRKLAERGHYPAIDVSASISRILSNVTGREHQRANNRLRQLLAAYKQVEMLLRLGE
YQAGADPVTDCAVQLNDDINEFLRQDLREPVPLQETLDGLLRLTSRLPE
>Q52371 7.4.2.8~~~sctN~~~Type 3 secretion system ATPase~~~
MNAALNLWKDAHAKRLSQYCAVRVIGRVSAVRRILLECRIPSAKVGDLCEVSKADGSLLLAEIVGFTQECTLLSALGPPD
GIQVGAPIRPLGVAHRIGVDDSLLGCVLDGFGRPLMGDCLGAFAGPEDRRTTLPVIADALPPTQRPRITRALPTGIRAID
SAILLGEGQRVGLFAGAGCGKTTLMAELARNMDCDVIVFGLIGERGRELREFLDHELDETLRRRSVLVCATSDRSSMERA
RAAFTATAIAEAFRARGQKVLLLLDSLTRFARAQREIGIASGEPLGRGGLPPSVYTLLPRLVERAGMSENGSITALYTVL
IEQDSMNDPVADEVRSLLDGHIVLSRKLAERGHYPAIDVSASISRILSNVTGRKHQRANNRLRQLLAAYKQVEMLLRLGE
YQAGADPVTDCAVQLNEAINAFLRQDLREPVPLQETLDRLLQLTSQLPE
>P0A1C1 7.4.2.8~~~sctN~~~Type 3 secretion system ATPase~~~
MSYTKLLTQLSFPNRISGPILETSLSDVSIGEICNIQAGIESNEIVARAQVVGFHDEKTILSLIGNSRGLSRQTLIKPTA
QFLHTQVGRGLLGAVVNPLGEVTDKFAVTDNSEILYRPVDNAPPLYSERAAIEKPFLTGIKVIDSLLTCGEGQRMGIFAS
AGCGKTFLMNMLIEHSGADIYVIGLIGERGREVTETVDYLKNSEKKSRCVLVYATSDYSSVDRCNAAYIATAIAEFFRTE
GHKVALFIDSLTRYARALRDVALAAGESPARRGYPVSVFDSLPRLLERPGKLKAGGSITAFYTVLLEDDDFADPLAEEVR
SILDGHIYLSRNLAQKGQFPAIDSLKSISRVFTQVVDEKHRIMAAAFRELLSEIEELRTIIDFGEYKPGENASQDKIYNK
ISVVESFLKQDYRLGFTYEQTMELIGETIR
>P40290 7.4.2.8~~~sctN~~~Type 3 secretion system ATPase~~~
MLSLDQIPHHIRHGIVGSRLIQIRGRVTQVTGTLLKAVVPGVRIGELCYLRNPDNSLSLQAEVIGFAQHQALLIPLGEMY
GISSNTEVSPTGTMHQVGVGEHLLGQVLDGLGQPFDGGHLPEPAAWYPVYQDAPAPMSRKLITTPLSLGIRVIDGLLTCG
EGQRMGIFAAAGGGKSTLLASLIRSAEVDVTVLALIGERGREVREFIESDLGEEGLRKAVLVVATSDRPSMERAKAGFVA
TSIAEYFRDQGKRVLLLMDSVTRFARAQREIGLAAGEPPTRRGYPPSVFAALPRLMERAGQSSKGSITALYTVLVEGDDM
TEPVADETRSILDGHIILSRKLAAANHYPAIDVLRSASRVMNQIVSKEHKTWAGDLRRLLAKYEEVELLLQIGEYQKGQD
KEADQAIERIGAIRGWLCQGTHELSHFNETLNLLETLTQ
>Q7ARI8 7.4.2.8~~~sctN~~~Type 3 secretion system ATPase~~~COG1157
MLSLDQIPHHIRHGIVGSRLIQIRGRVTQVTGTLLKAVVPGVRIGELCYLRNPDNSLSLQAEVIGFAQHQALLIPLGEMY
GISSNTEVSPTGTMHQVGVGEHLLGQVLDGLGQPFDGGHLPEPAAWYPVYQDAPAPMSRKLITTPLSLGIRVIDGLLTCG
EGQRMGIFAAAGGGKSTLLASLIRSAEVDVTVLALIGERGREVREFIESDLGEEGLRKAVLVVATSDRPSMERAKAGFVA
TSIAEYFRDQGKRVLLLMDSVTRFARAQREIGLAAGEPPTRRGYPPSVFAALPRLMERAGQSSKGSITALYTVLVEGDDM
TEPVADETRSILDGHIILSRKLAAANHYPAIDVLRSASRVMNQIVSKEHKTWAGDLRRLLAKYEEVELLLQIGEYQKGQD
KEADQAIERMGAIRGWLCQGTHELSHFNETLNLLETLTQ
>P40291 7.4.2.8~~~sctN~~~Type 3 secretion system ATPase~~~
MLSLDQIPHHIRHGIVGSRLIQIRGRVTQVTGTLLKAVVPGVRIGELCYLRNPDNSLSLQAEVIGFAQHQALLIPLGEMY
GISSNTEVSPTGTMHQVGVGEHLLGQVLDGLGQPFDGGHLPEPAAWYPVYQDAPAPMSRKLITTPLSLGIRVIDGLLTCG
EGQRMGIFAAAGGGKSTLLASLIRSAEVDVTVLALIGERGREVREFIESDLGEEGLRKAVLVVATSDRPSMERAKAGFVA
TSIAEYFRDQGKRVLLLMDSVTRFARAQREIGLAAGEPPTRRGYPPSVFAALPRLMERAGQSSKGSITALYTVLVEGDDM
TEPVADETRSILDGHIILSRKLAAANHYPAIDVLRSASRVMNQIVSKEHKTWAGDLRRLLAKYEEVELLLQIGEYQKGQD
KEADQAIERMGAIRGWLCQGTHELSHFNETLNLLETLTQ
>Q45966 ~~~scvA~~~Protein ScvA~~~
MERQNVQQQRGKDQRPQRPGASNPRRPNQR
>P0AAD6 ~~~sdaC~~~Serine transporter SdaC~~~COG0814
METTQTSTIASKDSRSAWRKTDTMWMLGLYGTAIGAGVLFLPINAGVGGMIPLIIMAILAFPMTFFAHRGLTRFVLSGKN
PGEDITEVVEEHFGIGAGKLITLLYFFAIYPILLVYSVAITNTVESFMSHQLGMTPPPRAILSLILIVGMMTIVRFGEQM
IVKAMSILVFPFVGVLMLLALYLIPQWNGAALETLSLDTASATGNGLWMTLWLAIPVMVFSFNHSPIISSFAVAKREEYG
DMAEQKCSKILAFAHIMMVLTVMFFVFSCVLSLTPADLAAAKEQNISILSYLANHFNAPVIAWMAPIIAIIAITKSFLGH
YLGAREGFNGMVIKSLRGKGKSIEINKLNRITALFMLVTTWIVATLNPSILGMIETLGGPIIAMILFLMPMYAIQKVPAM
RKYSGHISNVFVVVMGLIAISAIFYSLFS
>Q7WY62 ~~~sda~~~Sporulation inhibitor sda~~~
MNWVPSMRKLSDELLIESYFKATEMNLNRDFIELIENEIKRRSLGHIISVSS
>Q99SX1 ~~~sdcS~~~Sodium-dependent dicarboxylate transporter SdcS~~~
MAYFNQHQSMISKRYLTFFSKSKKKKPFSAGQLIGLILGPLLFLLTLLFFHPQDLPWKGVYVLAITLWIATWWITEAIPI
AATSLLPIVLLPLGHILTPEQVSSEYGNDIIFLFLGGFILAIAMERWNLHTRVALTIINLIGASTSKILLGFMVATGFLS
MFVSNTAAVMIMIPIGLAIIKEAHDLQEANTNQTSIQKFEKSLVLAIGYAGTIGGLGTLIGTPPLIILKGQYMQHFGHEI
SFAKWMIVGIPTVIVLLGITWLYLRYVAFRHDLKYLPGGQTLIKQKLDELGKMKYEEKVVQTIFVLASLLWITREFLLKK
WEVTSSVADGTIAIFISILLFIIPAKNTEKHRRIIDWEVAKELPWGVLILFGGGLALAKGISESGLAKWLGEQLKSLNGV
SPILIVIVITIFVLFLTEVTSNTATATMILPILATLSVAVGVHPLLLMAPAAMAANCAYMLPVGTPPNAIIFGSGKISIK
QMASVGFWVNLISAIIIILVVYYVMPIVLGIDINQPLPLK
>Q7A4P8 ~~~sdcS~~~Sodium-dependent dicarboxylate transporter SdcS~~~
MAYFNQHQSMISKRYLTFFSKSKKKKPFSAGQLIGLILGPLLFLLTLLFFHPQDLPWKGVYVLAITLWIATWWITEAIPI
AATSLLPIVLLPLGHILTPEQVSSEYGNDIIFLFLGGFILAIAMERWNLHTRVALTIINLIGASTSKILLGFMVATGFLS
MFVSNTAAVMIMIPIGLAIIKEAHDLQEANTNQTSIQKFEKSLVLAIGYAGTIGGLGTLIGTPPLIILKGQYMQHFGHEI
SFAKWMIVGIPTVIVLLGITWLYLRYVAFRHDLKYLPGGQTLIKQKLDELGKMKYEEKVVQTIFVLASLLWITREFLLKK
WEVTSSVADGTIAIFISILLFIIPAKNTEKHRRIIDWEVAKELPWGVLILFGGGLALAKGISESGLAKWLGEQLKSLNGV
SPILIVIVITIFVLFLTEVTSNTATATMILPILATLSVAVGVHPLLLMAPAAMAANCAYMLPVGTPPNAIIFGSGKISIK
QMASVGFWVNLISAIIIILVVYYVMPIVLGIDINQPLPLK
>Q5ZTK4 ~~~sdeA~~~Ubiquitinating/deubiquitinating enzyme SdeA~~~COG1196
MPKYVEGVELTQEGMHAIFARMGYGDITSGSIYNGVPTIDTGALNRQGFMPVLTGVGPHRDSGHWIMLIKGPGNQYYLFD
PLGKTSGEGYQNILAAQLPMGSTLSVIPNGSGLNMGLCGYWVASAGLRAHQALNQHNPPTLLNVGQTITNEMRNELDHDG
YRKITGWLRAVADEFPEGDPQLDGKALRENTEKDLKIEIPTLVLPGKDTSPKEMSVKPTAPQDKSVPVWNGFSLYTDDTV
KAAAQYAYDNYLGKPYTGSVESAPANFGGRMVYRQHHGLSHTLRTMAYAELIVEEARKAKLRGETLGKFKDGRTIADVTP
QELKKIMIAQAFFVAGRDDEASDAKNYQKYHEQSRDAFLKYVKDNESTLIPDVFKDQEDVNFYARVIEDKSHDWESTPAH
VLINQGHMVDLVRVKQPPESFLQRYFSSMQRWIGSQATEAVFGIQRQFFHATYEVVAGFDSDNKEPHLVVSGLGRYVIGE
DGQPIREAPKKGQKEGDLKVFPQTYKLKENERLMRVDEFLKLPEIQNTFPGSGKHLQGGMPGMNEMDYWNRLNSLNRARC
ENDVDFCLKQLQTAHDKAKIEPIKQAFQSSKGKERRQPNVDEIAAARIIQQILANPDCIHDDHVLINGQKLEQQFFRDLL
AKCEMAVVGSLLNDTDIGNIDTLMRHEKDTEFHSTNPEAVPVKIGEYWINDQRINNSSGNITQKKHDLIFLMQNDAWYFS
RVNAIAQNRDKGSTFKEVLITTLMTPLTSKALVDTSQAKPPTRLFRGLNLSEEFTKGLIDQANAMIANTTERLFTDHSPE
AFKQIKLNDLSKMSGRTNASTTTEIKLVKETWDSNVIFEMLDPDGLLHSKQVGRHGEGTESEFSVYLPEDVALVPVKVTL
DGKTQKGENRYVFTFVAVKSPDFTPRHESGYAVEPFLRMQAAKLAEVKSSIEKAQRAPDLETIFNLQNEVEAVQYSHLST
GYKNFLKNTVGPVLENSLSGLMESDTDTLSKALAAFPSDTQWSAFNFEEARQAKRQMDAIKQMVGNKVVLDALTQCQDAL
EKQNIAGALDALKKIPSEKEMGTIRRELREQIQSARQELESLQRAVVTPVVTDEKKVRERYDALIENTSKKITELETGKL
PNLDAVKKGISNLSNLKQEVTVLRNEKIRMHVGTDKVDFSDVEKLEQQIQVIDTKLADAYLLEVTKQISALDNTKPKNQT
ELKTKIAAFLDRTTDIEMLRNERIKKHGSSKDPLDLSDLDKLSGSLQRINQSLVSDLITTIRVSINQMEAKTFHEQEKEI
QQNFELLAKLEKTLDKSKTSEKLREDIPKLNDLLVAKQKAYPQMVQMQLKSEVFVTQLREVCQANHDDLDKTRNARLREL
DRLDREAGITRMVGNLIWGLTNKVGLTTDERLDIRTKQQSLARFKNELFNDKIDTDQLISNLARKRPSELQEGLGISTDN
AMELHLLLTELAGKTTSPDELEERMKAIDDISTKIGREPEHLKFVMVEEDESNKKTIGF
>Q7X279 6.2.1.65~~~sdgA~~~Salicylyl-CoA synthase / salicylate adenylyltransferase~~~
MTREGFVPWPKEAADRYREAGYWRGRPLGSYLHEWAETYGDTVAVVDGDTRLTYRQLVDRADGLACRLLDSGLNPGDAML
VQLPNGWEFVTLTLACLRAGIAPVMAMPAHRGHELRYLAAHAEVTSIAVPDRLGDFDHQALGREVAEDTPSVGLLLVAGG
TVGTDATDLRALAEPADDPVTARARLDRIAPDSGDIAVFLLSGGTTGLPKLITRTHDDYEYNARRSAEVCGLDSDSVYLV
ALPAGHNFPLACPGILGTLMNGGRVVLARTPEPGKVLPLMAAEGVTATAAVPAVVQRWIDAVASGRHPAPPALRLLQVGG
ARLAPEVARRAEPVLGGTLQQVFGMAEGLLNYTRPDDPDDIKIETQGRPMCPDDEILVVDASDNPVPPGEMGALLTRGPY
TPRGYYRAAEHNARAFTPDGWYRTGDVVRLHPSGNLVVEGRDKDLINRGGEKISAEEVENLIYRLPGVARVAAVAKADPD
LGERVCAVVVVEPGTQLSLESVRAALTAMQVARYKLPEDLLVVDELPLTKVGKIDKKRLRDVVRGKADSVEAV
>Q7X281 1.14.13.209~~~sdgC~~~Salicyloyl-CoA 5-hydroxylase~~~
MKVACIGAGPGGLFFATLLKRSRPDAEVVVFERNRPDDTFGFGVVFSDATLDAIDAADPVLSEALEKHGRHWDDIEVRVH
GERERVGGMGMAAVVRKTLLSLLQERARAEGVQMRFQDEVRDPAELDDFDLVVVCDGANSRFRTLFADDFGPTAEVASAK
FIWFGTTYMFDGLTFVHQDGPHGVFAAHAYPISDSLSTFIVETDADSWARAGLDAFDPATPLGMSDEKTKSYLEDLFRAQ
IDGHPLVGNNSRWANFATRRARSWRSGKWVLLGDAAHTAHFSVGSGTKMAMEDAVALAETLGEASRSVPEALDLYEERRR
PKVERIQNSARPSLSWWEHFGRYVRSFDAPTQFAFHFLTRSIPRGKLAVRDAAYVDRVDGWWLRHHEAGPLKTPFRVGPY
RLPTRRVTVGDDLLTGTDGTGIPMVPFSGQPFGAGVWIDAPDTEEGLPLALDQVRETAEAGVLLVGVRGGTALTRVLVAE
EARLAHSLPAAIVGAYDDDTATTLVLSGRADLVGGTK
>Q7X284 1.13.11.4~~~sdgD~~~Gentisate 1,2-dioxygenase~~~
MTDTTSEQTERKLLDELYADFEDAGLIPLWTQVDGLMPMSPQPAAVPHLWRWAELLPIAQRSGELVPVGRGGERRAMALS
NPGFPGLPYATPTLWTAIQYLGPREVAPSHRHSQGAFRFVVEGEGVWTNVDGDAVAMRRGDLLLTPSWAFHEHQNVTDEP
MAWLDGLDIPLVSKLDAGFFEFGPDELSTRETPERSRGERLWGHPGLRPIGRPDQPNSPLNAYRWEHTDAALTAQLELEQ
EGVPGVLEPGHAGVRFSNPTTGRDALVTMRTEMRRLRAGTRTAPVRTVGSAIWQVFEGEAVARVGDKVFEIAKGDLFVVP
SWCEVSLSARTQVDLFRFSDEPVYEALGLARTSRGEHK
>P08065 1.3.5.1~~~sdhA~~~Succinate dehydrogenase flavoprotein subunit~~~COG1053
MSQSSIIVVGGGLAGLMATIKAAESGMAVKLFSIVPVKRSHSVCAQGGINGAVNTKGEGDSPWEHFDDTVYGGDFLANQP
PVKAMCEAAPSIIHLLDRMGVMFNRTPEGLLDFRRFGGTQHHRTAYAGATTGQQLLYALDEQVRRYEVAGLVTKYEGWEF
LGAVLDDDRTCRGIVAQNLTNMQIESFRSDAVIMATGGPGIIFGKSTNSMINTGSAASIVYQQGAYYANGEFIQIHPTAI
PGDDKLRLMSESARGEGGRVWTYKDGKPWYFLEEKYPAYGNLVPRDIATREIFDVCVNQKLGINGENMVYLDLSHKDPKE
LDIKLGGIIEIYEKFMGDDPRKLPMKIFPAVHYSMGGLWVDYDQMTNIPGLFAAGECDYSMHGGNRLGANSLLSAIYGGM
VAGPNAVKYVNGLESSAEDMSSSLFDAHVKKEEEKWADIMSMDGTENAYVLHKELGEWMTANVTVVRHNDKLLKTDDKIQ
ELMERFKKININDTTKWSNQGAMFTRQFSNMLQLARVITLGAYNRNESRGAHYKPDYPERNDDEWLKTTMAKHVSPYEAP
EFEYQDVDVSLITPRKRDYSKKKVAK
>P0AC41 1.3.5.1~~~sdhA~~~Succinate dehydrogenase flavoprotein subunit~~~COG1053
MKLPVREFDAVVIGAGGAGMRAALQISQSGQTCALLSKVFPTRSHTVSAQGGITVALGNTHEDNWEWHMYDTVKGSDYIG
DQDAIEYMCKTGPEAILELEHMGLPFSRLDDGRIYQRPFGGQSKNFGGEQAARTAAAADRTGHALLHTLYQQNLKNHTTI
FSEWYALDLVKNQDGAVVGCTALCIETGEVVYFKARATVLATGGAGRIYQSTTNAHINTGDGVGMAIRAGVPVQDMEMWQ
FHPTGIAGAGVLVTEGCRGEGGYLLNKHGERFMERYAPNAKDLAGRDVVARSIMIEIREGRGCDGPWGPHAKLKLDHLGK
EVLESRLPGILELSRTFAHVDPVKEPIPVIPTCHYMMGGIPTKVTGQALTVNEKGEDVVVPGLFAVGEIACVSVHGANRL
GGNSLLDLVVFGRAAGLHLQESIAEQGALRDASESDVEASLDRLNRWNNNRNGEDPVAIRKALQECMQHNFSVFREGDAM
AKGLEQLKVIRERLKNARLDDTSSEFNTQRVECLELDNLMETAYATAVSANFRTESRGAHSRFDFPDRDDENWLCHSLYL
PESESMTRRSVNMEPKLRPAFPPKIRTY
>P33073 4.3.1.17~~~sdhA~~~L-serine dehydratase, alpha chain~~~
MLNTAREIIDVCNERGIKIYDLVLEEEIKNSHTTEEEIRKKLDAVIDVMHASATKNLTQSDVTEYKMIDGFAKRTYEYAN
SGKSIVGDFLAKAMAMAFSTSEVNASMGKIVAAPTAGSSGIMPAMLVAATEKYNFDRTTIQNGFLTSIGIGQVITKYATF
AGAEGGCQAECGSASAMAAAALVEMLGGTVEQALHAASITIINVLGLVCDPIAGLVQYPCTFRNASGVINAFISADLALA
GVESLVPFDEVVIAMGEVGNSMIEALRETGLGGLAGSKTGQKIRRDFLKEGD
>G4V4G6 1.3.5.1~~~sdhA~~~Succinate dehydrogenase flavoprotein subunit~~~COG1053
MNLPVREFDAVVIGAGGAGMRAALQISQMGLSCALLSKVFPTRSHTVSAQGGITVALGNTHEDNWEWHMYDTVKGSDYIG
DQDAIEYMCKTGPEAVLELEHMGLPFSRLDDGSIYQRPFGGQSRNFGGEQAARTAAAADRTGHALLHTLYQQNLKNHTTI
FSEWYALDLVKNQDGAVMGCTAICIETGEVVYFKARATVLATGGAGRIYQSTTNAHINTGDGVGMALRAGVPVQDMEMWQ
FHPTGIAGAGVLVTEGCRGEGGYLLNKHGERFMERYAPNAKDLAGRDVVARSIMIEIREGRGCDGPWGPHAKLKLDHLGK
DVLESRLPGILELSRTFAHVDPVKEPIPVIPTCHYMMGGIPTKVTGQVLAVNEQGEDVVIPGLFAVGEIACVSVHGANRL
GGNSLLDLIVFGRSAGMHLQESLEEQGATRDASNSDIEASLDRLNRWNSTRSGEDPVEIRKALQACMQHNFSVFREGDAM
AKGLEELKVIRERLKNARLDDTSNDFNTQRIECLELDNLMETAYATAVSANFRTESRGAHSRFDYPDRDDDKWLCHTLYQ
PQTESMTRRKVNMQPKLRPAFPPKVRTY
>P07014 1.3.5.1~~~sdhB~~~Succinate dehydrogenase iron-sulfur subunit~~~COG0479
MRLEFSIYRYNPDVDDAPRMQDYTLEADEGRDMMLLDALIQLKEKDPSLSFRRSCREGVCGSDGLNMNGKNGLACITPIS
ALNQPGKKIVIRPLPGLPVIRDLVVDMGQFYAQYEKIKPYLLNNGQNPPAREHLQMPEQREKLDGLYECILCACCSTSCP
SFWWNPDKFIGPAGLLAAYRFLIDSRDTETDSRLDGLSDAFSVFRCHSIMNCVSVCPKGLNPTRAIGHIKSMLLQRNA
>P33074 4.3.1.17~~~sdhB~~~L-serine dehydratase, beta chain~~~
MTDYSAFEVMGPIMVGPSSSHTAGACKIANVATSIVSNNYNQVEFQLHGSFAHTFKGHGTDRALVGGILGFEPDDDRIKT
SFELAKQAGLNYIFTTTNLGDNYHPNSVKIVFSYPNGEEEYVIGSSIGGGAMKIVNINGIAIEFRGEYSTILLEYPEQRG
VISYVSSLLTGSEYNIESLNTKKNKLTNIVTLTVEIDKPLTESLKSAILGVERFTTAKYVEV
>O53368 ~~~sdhC~~~Succinate dehydrogenase 2 membrane subunit SdhC~~~COG2009
MNTTATTVSRGRRPPRTLYRGDPGMWSWVCHRISGATIFFFLFVHVLDAAMLRVSPQTYNAVLATYKTPIVGLMEYGLVA
AVLFHALNGIRVILIDFWSEGPRYQRLMLWIIGSVFLLLMVPAGVVVGIHMWEHFR
>P00926 4.3.1.18~~~dsdA~~~D-serine dehydratase~~~COG3048
MENAKMNSLIAQYPLVKDLVALKETTWFNPGTTSLAEGLPYVGLTEQDVQDAHARLSRFAPYLAKAFPETAATGGIIESE
LVAIPAMQKRLEKEYQQPISGQLLLKKDSHLPISGSIKARGGIYEVLAHAEKLALEAGLLTLDDDYSKLLSPEFKQFFSQ
YSIAVGSTGNLGLSIGIMSARIGFKVTVHMSADARAWKKAKLRSHGVTVVEYEQDYGVAVEEGRKAAQSDPNCFFIDDEN
SRTLFLGYSVAGQRLKAQFAQQGRIVDADNPLFVYLPCGVGGGPGGVAFGLKLAFGDHVHCFFAEPTHSPCMLLGVHTGL
HDQISVQDIGIDNLTAADGLAVGRASGFVGRAMERLLDGFYTLSDQTMYDMLGWLAQEEGIRLEPSALAGMAGPQRVCAS
VSYQQMHGFSAEQLRNTTHLVWATGGGMVPEEEMNQYLAKGR
>Q8ZL08 4.3.1.18~~~dsdA~~~D-serine dehydratase~~~
MENIQKLIARYPLVEDLVALKETTWFNPGATSLAQGLPYVGLTEQDVNAAHDRLARFAPYLAKAFPQTAAAGGMIESDVV
AIPAMQKRLEKEYGQTINGEMLLKKDSHLAISGSIKARGGIYEVLTHAEKLALEAGLLTTDDDYSVLLSPEFKQFFSQYS
IAVGSTGNLGLSIGIMSACIGFKVTVHMSADARAWKKAKLRSHGVTVVEYEDDYGVAVEQGRKAAQSDPNCFFIDDENSR
TLFLGYAVAGQRLKAQFAQQGRVVDASHPLFVYLPCGVGGGPGGVAFGLKLAFGDNVHCFFAEPTHSPCMLLGVYTGLHD
AISVQDIGIDNLTAADGLAVGRASGFVGRAMERLLDGLYTLDDQTMYDMLGWLAQEEGIRLEPSALAGMAGPQRICASVA
YQQRHGFSQTQLGNATHLVWATGGGMVPEDEMEQYLAKGR
>P64559 ~~~sdhE~~~FAD assembly factor SdhE~~~COG2938
MDINNKARIHWACRRGMRELDISIMPFFEHEYDSLSDDEKRIFIRLLECDDPDLFNWLMNHGKPADAELEMMVRLIQTRN
RERGPVAI
>G4V4G2 ~~~sdhE~~~FAD assembly factor SdhE~~~COG2938
MDIDNKPRIHWACRRGMRELDISIMPFFEHDYDTLSDDDKRNFIRLLQCDDPDLFNWLMNHGEPTDQGLKHMVSLIQTRN
KNRGPVAM
>Q9KPA2 ~~~sdhE~~~FAD assembly factor SdhE~~~COG2938
MYTAEQKARIKWACRRGMLELDVVIMPFFEECFDSLTESEQDDFVALLESDDPDLFAWVMGHGRCENLGLAAMVDKIVAH
NLSKVR
>P16095 4.3.1.17~~~sdaA~~~L-serine dehydratase 1~~~COG1760
MISLFDMFKVGIGPSSSHTVGPMKAGKQFVDDLVEKGLLDSVTRVAVDVYGSLSLTGKGHHTDIAIIMGLAGNEPATVDI
DSIPGFIRDVEERERLLLAQGRHEVDFPRDNGMRFHNGNLPLHENGMQIHAYNGDEVVYSKTYYSIGGGFIVDEEHFGQD
AANEVSVPYPFKSATELLAYCNETGYSLSGLAMQNELALHSKKEIDEYFAHVWQTMQACIDRGMNTEGVLPGPLRVPRRA
SALRRMLVSSDKLSNDPMNVIDWVNMFALAVNEENAAGGRVVTAPTNGACGIVPAVLAYYDHFIESVSPDIYTRYFMAAG
AIGALYKMNASISGAEVGCQGEVGVACSMAAAGLAELLGGSPEQVCVAAEIGMEHNLGLTCDPVAGQVQVPCIERNAIAS
VKAINAARMALRRTSAPRVSLDKVIETMYETGKDMNAKYRETSRGGLAIKVQCD
>P9WGT5 4.3.1.17~~~sdaA~~~L-serine dehydratase~~~COG1760
MTISVFDLFTIGIGPSSSHTVGPMRAANQFVVALRRRGHLDDLEAMRVDLFGSLAATGAGHGTMSAILLGLEGCQPETIT
TEHKERRLAEIAASGVTRIGGVIPVPLTERDIDLHPDIVLPTHPNGMTFTAAGPHGRVLATETYFSVGGGFIVTEQTSGN
SGQHPCSVALPYVSAQELLDICDRLDVSISEAALRNETCCRTENEVRAALLHLRDVMVECEQRSIAREGLLPGGLRVRRR
AKVWYDRLNAEDPTRKPEFAEDWVNLVALAVNEENASGGRVVTAPTNGAAGIVPAVLHYAIHYTSAGAGDPDDVTVRFLL
TAGAIGSLFKERASISGAEVGCQGEVGSAAAMAAAGLAEILGGTPRQVENAAEIAMEHSLGLTCDPIAGLVQIPCIERNA
ISAGKAINAARMALRGDGIHRVTLDQVIDTMRATGADMHTKYKETSAGGLAINVAVNIVEC
>P30744 4.3.1.17~~~sdaB~~~L-serine dehydratase 2~~~COG1760
MISVFDIFKIGIGPSSSHTVGPMKAGKQFTDDLIARNLLKDVTRVVVDVYGSLSLTGKGHHTDIAIIMGLAGNLPDTVDI
DSIPSFIQDVNTHGRLMLANGQHEVEFPVDQCMNFHADNLSLHENGMRITALAGDKVVYSQTYYSIGGGFIVDEEHFGQQ
DSAPVEVPYPYSSAADLQKHCQETGLSLSGLMMKNELALHSKEELEQHLANVWEVMRGGIERGISTEGVLPGKLRVPRRA
AALRRMLVSQDKTTTDPMAVVDWINMFALAVNEENAAGGRVVTAPTNGACGIIPAVLAYYDKFIREVNANSLARYLLVAS
AIGSLYKMNASISGAEVGCQGEVGVACSMAAAGLAELLGASPAQVCIAAEIAMEHNLGLTCDPVAGQVQVPCIERNAIAA
VKAVNAARMALRRTSEPRVCLDKVIETMYETGKDMNAKYRETSRGGLAMKIVACD
>Q59787 1.1.1.-~~~polS~~~Sorbitol dehydrogenase~~~
MRLDGKTALITGSARGIGRAFAEAYVREGARVAIADINLEAARATAAEIGPAACAIALDVTDQASIDRCVAELLDRWGSI
DILVNNAALFDLAPIVEITRESYDRLFAINVSGTLFMMQAVARAMIAGGRGGKIINMASQAGRRGEALVGVYCATKAAVI
SLTQSAGLNLIRHGINVNAIAPGVVDGEHWDGVDAKFADYENLPRGEKKRQVGAAVPFGRMGRAEDLTGMAIFLATPEAD
YIVAQTYNVDGGNWMS
>Q9KWN1 1.1.1.276~~~sdh~~~Serine 3-dehydrogenase~~~COG4221
MSGTILITGATSGFGQATAQRFVKEGWKVIGTGRRAERLEALSAELGSAFHGVAFDITDEEATKKALAGLPDGFRDIDIL
VNNAGLALGTAPAPQVPLKDWQTMVDTNITGLLNVTHHLLPTLIERKGIVINLSSVAAHYPYLGGNVYGGTKAFLRQFSL
GLRSDLHGKGVRVTSIEPGMCETEFTLVRTGGNQEASDNLYKGVNPITADDIANTIHWVASQPKHININSLELMPVNQSF
AGFQVYRES
>P07026 ~~~sdiA~~~Regulatory protein SdiA~~~COG2197
MQDKDFFSWRRTMLLRFQRMETAEEVYHEIELQAQQLEYDYYSLCVRHPVPFTRPKVAFYTNYPEAWVSYYQAKNFLAID
PVLNPENFSQGHLMWNDDLFSEAQPLWEAARAHGLRRGVTQYLMLPNRALGFLSFSRCSAREIPILSDELQLKMQLLVRE
SLMALMRLNDEIVMTPEMNFSKREKEILRWTAEGKTSAEIAMILSISENTVNFHQKNMQKKINAPNKTQVACYAAATGLI
>P00947 5.3.3.1~~~ksi~~~Steroid Delta-isomerase~~~
MNTPEHMTAVVQRYVAALNAGDLDGIVALFADDATVEDPVGSEPRSGTAAIREFYANSLKLPLAVELTQEVRAVANEAAF
AFTVSFEYQGRKTVVAPIDHFRFNGAGKVVSMRALFGEKNIHAGA
>P07445 5.3.3.1~~~ksi~~~Steroid Delta-isomerase~~~
MNLPTAQEVQGLMARYIELVDVGDIEAIVQMYADDATVEDPFGQPPIHGREQIAAFYRQGLGGGKVRACLTGPVRASHNG
CGAMPFRVEMVWNGQPCALDVIDVMRFDEHGRIQTMQAYWSEVNLSVREPQ
>Q60A55 1.14.13.246~~~sdmA~~~4beta-methylsterol monooxygenase~~~COG4638
MSRSIRNQDVPELPRRRQVRTVGMSGNYWYVVEIDGRLKPRQVKRVRFWGQDIALFRDAAGELHAVEDRCPHRQLPLSQG
FVEGGNLVCTYHGWKFDGCGRCTEIHHELGKGRTRLPRIRIRTYPVKAQWGLIWLFPGDPALADGTPLPTIPQLEGGRPW
PFFPIDVTIKAHFSMIVENVCDFNHEYLHRHKRPFLQPILREWKQDADSVRVYYDTRFDGSPVAKLFMEGGARDLNEIEI
WYQYPYQGSDIGGKYIHWLFMLPEDERTTRCFFVFLFGPIHVPIVNWKMPEFLRKPILWFTNKWYIEPLLGEDKWALELE
QDGFERHPDAPQIELNPAISSFQRLSLEKWKAYQQSMERAGPKPAADPA
>Q60A54 1.1.1.270~~~sdmB~~~Sterol demethylase protein B~~~COG0451
MTTLVTGATGHLGANLVRALLARGEKVRAFIRRQSDVAALDGLAVERAYGDLRDRRSIRDALEGVERLYHTAAFVSIRDG
DRQELFDVNVVGTRMLMQEARRAGVRRVVHTSSFGAVGINPQGASNEHWTVSPFEPGTDYERTKAVSEHDVILEAVRGLD
VTIVNPAAIVGPWDFRPSLVGRTILDFAHGRMRAFVPGAFDFVPMRDVVAVELLAMDKGIRGERYLVTGEHCTIGQILQW
LEELTGHPRPRLAIPPRLMQGIALLKDPLERRFFPRRTPRFNYHSIRLLNSGKRGDSSRSRRELGLVPTSTRAAFADAVA
WFRERGMI
>Q83WC3 2.1.1.157~~~~~~Sarcosine/dimethylglycine N-methyltransferase~~~
MTKADAVAKQAQDYYDSGSADGFYYRIWGGEDLHIGIYNTPDEPIYDASVRTVSRICDKIKNWPAGTKVLDLGAGYGGSA
RYMAKHHGFDVDCLNISLVQNERNRQMNQEQGLADKIRVFDGSFEELPFENKSYDVLWSQDSILHSGNRRKVMEEADRVL
KSGGDFVFTDPMQTDNCPEGVLEPVLARIHLDSLGSVGFYRQVAEELGWEFVEFDEQTHQLVNHYSRVLQELEAHYDQLQ
PECSQEYLDRMKVGLNHWINAGKSGYMAWGILKFHKP
>Q9KJ21 2.1.1.157~~~~~~Sarcosine/dimethylglycine N-methyltransferase~~~
MATRYDDQAIETARQYYNSEDADNFYAIIWGGEDIHIGLYNDDEEPIADASRRTVERMSSLSRQLGPDSYVLDMGAGYGG
SARYLAHKYGCKVAALNLSERENERDRQMNKEQGVDHLIEVVDAAFEDVPYDDGVFDLVWSQDSFLHSPDRERVLREASR
VLRSGGEFIFTDPMQADDCPEGVIQPILDRIHLETMGTPNFYRQTLRDLGFEEITFEDHTHQLPRHYGRVRRELDRREGE
LQGHVSAEYIERMKNGLDHWVNGGNKGYLTWGIFYFRKG
>Q7U4Z9 2.1.1.161~~~bsmB~~~Dimethylglycine N-methyltransferase~~~COG2230
MGTTNGCAADSVAATYYDSQDADQFYEQVWGGEDIHIGLYATPDEAIATASDRTVHALLDLADPLPQGGCVVDLGAGYGG
ASRRLARWSERPVHAINISAVENDRHRRLNVDAGLEQQITVHDASFEQVPMADASADLVWSQDAILHAGDRAKVLAEVSR
LLKPGGCFVFTDPMAADGVEMGLLQPILDRIHLPDLASPSRYKAWGEAVGLTMEVWDERTEMLVRHYDRVRQDTRLRRAE
LETSISSGYLDRMDVGLGHWVDGGQQGRLSWGLMRLRKPG
>O34889 ~~~sdpA~~~Sporulation-delaying protein SdpA~~~
MTICFLLFSSYYFSNISPQNPLFKKNFLQQLSPQGFGFYSKSPTEENISFHTKENLKLPNALPNNFFGIKREGRVQAIEL
GKIVENIDPKNWKTCENNNSCTNLEKQIKPIKVIKNEDYIHLSKGEYLIYRQKPLSWYWIDFKQTTSFERKVLKIKIV
>P83309 1.14.11.43~~~sdpA~~~(S)-phenoxypropionate/alpha-ketoglutarate-dioxygenase~~~
MQTTLQITPTGATLGATVTGVHLATLDDAGFAALHAAWLQHALLIFPGQHLSNDQQITFAKRFGAIERIGGGDIVAISNV
KADGTVRQHSPAEWDDMMKVIVGNMAWHADSTYMPVMAQGAVFSAEVVPAVGGRTCFADMRAAYDALDEATRALVHQRSA
RHSLVYSQSKLGHVQQAGSAYIGYGMDTTATPLRPLVKVHPETGRPSLLIGRHAHAIPGMDAAESERFLEGLVDWACQAP
RVHAHQWAAGDVVVWDNRCLLHRAEPWDFKLPRVMWHSRLAGRPETEGAALV
>Q700X4 1.14.11.43~~~sdpA~~~(S)-phenoxypropionate/alpha-ketoglutarate-dioxygenase~~~COG2175
MSPAFDIAPLDATFGAVVTGVKLADLDDAGWLDLQAAWLEYALLVFPDQHLTREQQIAFARRFGPLEFEMAAISNVRPDG
SLRVESDNDDMMKILKGNMGWHADSTYMPVQAKGAVFSAEVVPSVGGQTGFADMRAAYDALDEDLKARVETLQARHSLHY
SQSKLGHQTKAADGEYSGYGLHDGPVPLRPLVKIHPETGRKSLLIGRHAHAIPGLEPAESERLLQQLIDFACQPPRIYHH
DWAPGDAVLWDNRCLLHQATPWDMTQKRIMWHSRIAGDPASETALAH
>O34616 ~~~sdpB~~~Sporulation-delaying protein SdpB~~~
MKILNSLEGYIDTYNPWKNTYALFRSLLGFSTLLVLLFNSTDILFSYSANNVTCENVYIPTAFCFAKEYSINFEIIRYLM
IFILTLVVIGWRPRFTGLFHWYICYSIQTSALTIDGGEQIATVLSFLILPVTLLDSRRNHWNIKKNNNESFTKKTVLFYI
MTIIKIQVFIIYLNAALERLKNKEWAEGTAIYYFFSDPVFGLPEYQLNLMNPLLESNFIVVITWLVTIFELFLAASIISN
IRIKRIALVLGILFHIGIIFSIGIVSFGLIMISALIIYLHPVQQNITMNWCSPLFKYIYVKGKRNFKRIGGESVKFLTKL
FHS
>O34344 ~~~sdpC~~~Sporulation delaying protein C~~~
MKSKLLRLLIVSMVTILVFSLVGLSKESSTSAKENHTFSGEDYFRGLLFGQGEVGKLISNDLDPKLVKEANSTEGKKLVN
DVVKFIKKDQPQYMDELKQSIDSKDPKKLIENMTKADQLIQKYAKKNENVKYSSNKVTPSCGLYAVCVAAGYLYVVGVNA
VALQTAAAVTTAVWKYVAKYSSSASNNSDLEAAAAKTLKLIHQ
>O32241 ~~~sdpI~~~Immunity protein SdpI~~~COG5658
MKKNIISIIIVCLSFLTSIILYQYLPEEIPIQWSGNKPAAIVSKPLTIFIIPVVMLIYYLTFYMLTIKSTQKNKALLFLA
SNNMLILLYILQLSTLLISLGYEVNIDLIIGLGVGIFLIIGGNSMQLAEQNHLIGLRTPWTLKDETVWKLGNRFASKVLV
VCGFIIAVLSFFTGEYIILIMIVLVLLALVISTLASYHYYKKLNGSR
>O32242 ~~~sdpR~~~Transcriptional repressor SdpR~~~COG0640
MNNVFKAISDPTRRKILDLLKGGDMTAGDIAEHFNISKPSISHHLNILKQAEVISDHRKGQFIYYSLNTTVLQDSINWML
NFINKGDNDL
>Q2G0L5 ~~~sdrC~~~Serine-aspartate repeat-containing protein C~~~COG3266
MNNKKTATNRKGMIPNRLNKFSIRKYSVGTASILVGTTLIFGLSGHEAKAAEHTNGELNQSKNETTAPSENKTTKKVDSR
QLKDNTQTATADQPKVTMSDSATVKETSSNMQSPQNATANQSTTKTSNVTTNDKSSTTYSNETDKSNLTQAKDVSTTPKT
TTIKPRTLNRMAVNTVAAPQQGTNVNDKVHFSNIDIAIDKGHVNQTTGKTEFWATSSDVLKLKANYTIDDSVKEGDTFTF
KYGQYFRPGSVRLPSQTQNLYNAQGNIIAKGIYDSTTNTTTYTFTNYVDQYTNVRGSFEQVAFAKRKNATTDKTAYKMEV
TLGNDTYSEEIIVDYGNKKAQPLISSTNYINNEDLSRNMTAYVNQPKNTYTKQTFVTNLTGYKFNPNAKNFKIYEVTDQN
QFVDSFTPDTSKLKDVTDQFDVIYSNDNKTATVDLMKGQTSSNKQYIIQQVAYPDNSSTDNGKIDYTLDTDKTKYSWSNS
YSNVNGSSTANGDQKKYNLGDYVWEDTNKDGKQDANEKGIKGVYVILKDSNGKELDRTTTDENGKYQFTGLSNGTYSVEF
STPAGYTPTTANVGTDDAVDSDGLTTTGVIKDADNMTLDSGFYKTPKYSLGDYVWYDSNKDGKQDSTEKGIKGVKVTLQN
EKGEVIGTTETDENGKYRFDNLDSGKYKVIFEKPAGLTQTGTNTTEDDKDADGGEVDVTITDHDDFTLDNGYYEEETSDS
DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS
DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDNDSDSDSDSDSDSDSDSDSDSDS
DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDNDSDSDSDSDSDAGKHTPAKPMSTVKDQHKTAKALPE
TGSENNNSNNGTLFGGLFAALGSLLLFGRRKKQNK
>O86487 ~~~sdrC~~~Serine-aspartate repeat-containing protein C~~~
MNNKKTATNRKGMIPNRLNKFSIRKYSVGTASILVGTTLIFGLSGHEAKAAEHTNGELNQSKNETTAPSENKTTKKVDSR
QLKDNTQTATADQPKVTMSDSATVKETSSNMQSPQNATANQSTTKTSNVTTNDKSSTTYSNETDKSNLTQAKDVSTTPKT
TTIKPRTLNRMAVNTVAAPQQGTNVNDKVHFSNIDIAIDKGHVNQTTGKTEFWATSSDVLKLKANYTIDDSVKEGDTFTF
KYGQYFRPGSVRLPSQTQNLYNAQGNIIAKGIYDSTTNTTTYTFTNYVDQYTNVRGSFEQVAFAKRKNATTDKTAYKMEV
TLGNDTYSEEIIVDYGNKKAQPLISSTNYINNEDLSRNMTAYVNQPKNTYTKQTFVTNLTGYKFNPNAKNFKIYEVTDQN
QFVDSFTPDTSKLKDVTDQFDVIYSNDNKTATVDLMKGQTSSNKQYIIQQVAYPDNSSTDNGKIDYTLDTDKTKYSWSNS
YSNVNGSSTANGDQKKYNLGDYVWEDTNKDGKQDANEKGIKGVYVILKDSNGKELDRTTTDENGKYQFTGLSNGTYSVEF
STPAGYTPTTANVGTDDAVDSDGLTTTGVIKDADNMTLDSGFYKTPKYSLGDYVWYDSNKDGKQDSTEKGIKGVKVTLQN
EKGEVIGTTETDENGKYRFDNLDSGKYKVIFEKPAGLTQTGTNTTEDDKDADGGEVDVTITDHDDFTLDNGYYEEETSDS
DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSNSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS
DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDNDSDS
DSDSDSDAGKHTPAKPMSTVKDQHKTAKALPETGSENNNSNNGTLFGGLFAALGSLLLFGRRKKQNK
>Q7A781 ~~~sdrC~~~Serine-aspartate repeat-containing protein C~~~
MNNKKTATNRKGMIPNRLNKFSIRKYSVGTASILVGTTLIFGLSGHEAKAAEHTNGELNQSKNETTAPSENKTTEKVDSR
QLKDNTQTATADQPKVTMSDSATVKETSSNMQSPQNATASQSTTQTSNVTTNDKSSTTYSNETDKSNLTQAKNVSTTPKT
TTIKQRALNRMAVNTVAAPQQGTNVNDKVHFTNIDIAIDKGHVNKTTGNTEFWATSSDVLKLKANYTIDDSVKEGDTFTF
KYGQYFRPGSVRLPSQTQNLYNAQGNIIAKGIYDSKTNTTTYTFTNYVDQYTNVSGSFEQVAFAKRENATTDKTAYKMEV
TLGNDTYSKDVIVDYGNQKGQQLISSTNYINNEDLSRNMTVYVNQPKKTYTKETFVTNLTGYKFNPDAKNFKIYEVTDQN
QFVDSFTPDTSKLKDVTGQFDVIYSNDNKTATVDLLNGQSSSDKQYIIQQVAYPDNSSTDNGKIDYTLETQNGKSSWSNS
YSNVNGSSTANGDQKKYNLGDYVWEDTNKDGKQDANEKGIKGVYVILKDSNGKELDRTTTDENGKYQFTGLSNGTYSVEF
STPAGYTPTTANAGTDDAVDSDGLTTTGVIKDADNMTLDSGFYKTPKYSLGDYVWYDSNKDGKQDSTEKGIKGVKVTLQN
EKGEVIGTTETDENGKYRFDNLDSGKYKVIFEKPAGLTQTGTNTTEDDKDADGGEVDVTITDHDDFTLDNGYYEEETSDS
DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSESDSDSDSDSDSDSDSDSDSDSDSDSDS
DSDSDSDSDSDSDSDSDSDSDSDNDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS
DSDSDSDSDSDSDAGKHTPTKPMSTVKDQHKTAKALPETGSENNNSNNGTLFGGLFAALGSLLLFGRRKKQNK
>Q2G0L4 ~~~sdrD~~~Serine-aspartate repeat-containing protein D~~~COG4932
MLNRENKTAITRKGMVSNRLNKFSIRKYTVGTASILVGTTLIFGLGNQEAKAAESTNKELNEATTSASDNQSSDKVDMQQ
LNQEDNTKNDNQKEMVSSQGNETTSNGNKLIEKESVQSTTGNKVEVSTAKSDEQASPKSTNEDLNTKQTISNQEALQPDL
QENKSVVNVQPTNEENKKVDAKTESTTLNVKSDAIKSNDETLVDNNSNSNNENNADIILPKSTAPKRLNTRMRIAAVQPS
STEAKNVNDLITSNTTLTVVDADKNNKIVPAQDYLSLKSQITVDDKVKSGDYFTIKYSDTVQVYGLNPEDIKNIGDIKDP
NNGETIATAKHDTANNLITYTFTDYVDRFNSVQMGINYSIYMDADTIPVSKNDVEFNVTIGNTTTKTTANIQYPDYVVNE
KNSIGSAFTETVSHVGNKENPGYYKQTIYVNPSENSLTNAKLKVQAYHSSYPNNIGQINKDVTDIKIYQVPKGYTLNKGY
DVNTKELTDVTNQYLQKITYGDNNSAVIDFGNADSAYVVMVNTKFQYTNSESPTLVQMATLSSTGNKSVSTGNALGFTNN
QSGGAGQEVYKIGNYVWEDTNKNGVQELGEKGVGNVTVTVFDNNTNTKVGEAVTKEDGSYLIPNLPNGDYRVEFSNLPKG
YEVTPSKQGNNEELDSNGLSSVITVNGKDNLSADLGIYKPKYNLGDYVWEDTNKNGIQDQDEKGISGVTVTLKDENGNVL
KTVTTDADGKYKFTDLDNGNYKVEFTTPEGYTPTTVTSGSDIEKDSNGLTTTGVINGADNMTLDSGFYKTPKYNLGNYVW
EDTNKDGKQDSTEKGISGVTVTLKNENGEVLQTTKTDKDGKYQFTGLENGTYKVEFETPSGYTPTQVGSGTDEGIDSNGT
STTGVIKDKDNDTIDSGFYKPTYNLGDYVWEDTNKNGVQDKDEKGISGVTVTLKDENDKVLKTVTTDENGKYQFTDLNNG
TYKVEFETPSGYTPTSVTSGNDTEKDSNGLTTTGVIKDADNMTLDSGFYKTPKYSLGDYVWYDSNKDGKQDSTEKGIKDV
KVTLLNEKGEVIGTTKTDENGKYCFDNLDSGKYKVIFEKPAGLTQTVTNTTEDDKDADGGEVDVTITDHDDFTLDNGYFE
EDTSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS
DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS
DSDSDSDSDAGKHTPVKPMSTTKDHHNKAKALPETGSENNGSNNATLFGGLFAALGSLLLFGRRKKQNK
>O86488 ~~~sdrD~~~Serine-aspartate repeat-containing protein D~~~
MLNRENKTAITRKGMVSNRLNKFSIRKYTVGTASILVGTTLIFGLGNQEAKAAESTNKELNEATTSASDNQSSDKVDMQQ
LNQEDNTKNDNQKEMVSSQGNETTSNGNKLIEKESVQSTTGNKVEVSTAKSDEQASPKSTNEDLNTKQTISNQEALQPDL
QENKSVVNVQPTNEENKKVDAKTESTTLNVKSDAIKSNDETLVDNNSNSNNENNADIILPKSTAPKRLNTRMRIAAVQPS
STEAKNVNDLITSNTTLTVVDADKNNKIVPAQDYLSLKSQITVDDKVKSGDYFTIKYSDTVQVYGLNPEDIKNIGDIKDP
NNGETIATAKHDTANNLITYTFTDYVDRFNSVQMGINYSIYMDADTIPVSKNDVEFNVTIGNTTTKTTANIQYPDYVVNE
KNSIGSAFTETVSHVGNKENPGYYKQTIYVNPSENSLTNAKLKVQAYHSSYPNNIGQINKDVTDIKIYQVPKGYTLNKGY
DVNTKELTDVTNQYLQKITYGDNNSAVIDFGNADSAYVVMVNTKFQYTNSESPTLVQMATLSSTGNKSVSTGNALGFTNN
QSGGAGQEVYKIGNYVWEDTNKNGVQELGEKGVGNVTVTVFDNNTNTKVGEAVTKEDGSYLIPNLPNGDYRVEFSNLPKG
YEVTPSKQGNNEELDSNGLSSVITVNGKDNLSADLGIYKPKYNLGDYVWEDTNKNGIQDQDEKGISGVTVTLKDENGNVL
KTVTTDADGKYKFTDLDNGNYKVEFTTPEGYTPTTVTSGSDIEKDSNGLTTTGVINGADNMTLDSGFYKTPKYNLGNYVW
EDTNKDGKQDSTEKGISGVTVTLKNENGEVLQTTKTDKDGKYQFTGLENGTYKVEFETPSGYTPTQVGSGTDEGIDSNGT
STTGVIKDKDNDTIDSGFYKPTYNLGDYVWEDTNKNGVQDKDEKGISGVTVTLKDENDKVLKTVTTDENGKYQFTDLNNG
TYKVEFETPSGYTPTSVTSGNDTEKDSNGLTTTGVIKDADNMTLDSGFYKTPKYSLGDYVWYDSNKDGKQDSTEKGIKDV
KVTLLNEKGEVIGTTKTDENGKYCFDNLDSGKYKVIFEKPAGLTQTGTNTTEDDKDADGGEVDVTITDHDDFTLDNGYYE
EETSDSDSDSDSDSDSDRDSDSDSDSDSDSDSDSDSDSDSDSDSDSDRDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS
DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDAGKHTPVKPMSTTKDHHNKAKALPE
TGNENSGSNNATLFGGLFAALGSLLLFGRRKKQNK
>Q7A780 ~~~sdrD~~~Serine-aspartate repeat-containing protein D~~~
MLNRENKTAITRKGMVSNRLNKFSIRKYTVGTASILVGTTLIFGLGNQEAKAAESTNKELNEATTSASDNQSSDKVDMQQ
LNQEDNTKNDNQKEMVSSQGNETTSNGNKSIEKESVQSTTGNKVEVSTAKSDEQASPKSTNEDLNTKQTISNQEGLQPDL
LENKSVVNVQPTNEENKKVDAKTESTTLNVKSDAIKSNAETLVDNNSNSNNENNADIILPKSTAPKSLNTRMRMAAIQPN
STDSKNVNDLITSNTTLTVVDADNSKTIVPAQDYLSLKSQITVDDKVKSGDYFTIKYSDTVQVYGLNPEDIKNIGDIKDP
NNGETIATAKHDTANNLITYTFTDYVDRFNSVKMGINYSIYMDADTIPVDKKDVPFSVTIGNQITTTTADITYPAYKEAD
NNSIGSAFTETVSHVGNVEDPGYYNQVVYVNPMDKDLKGAKLKVEAYHPKYPTNIGQINQNVTNIKIYRVPEGYTLNKGY
DVNTNDLVDVTDEFKNKMTYGSNQSVNLDFGDITSAYVVMVNTKFQYTNSESPTLVQMATLSSTGNKSVSTGNALGFTNN
QSGGAGQEVYKIGNYVWEDTNKNGVQELGEKGVGNVTVTVFDNNTNTKVGEAVTKEDGSYLIPNLPNGDYRVEFSNLPKG
YEVTPSKQGNNEELDSNGLSSVITVNGKDNLSADLGIYKPKYNLGDYVWEDTNKNGIQDQDEKGISGVTVTLKDENGNVL
KTVTTDADGKYKFTDLDNGNYKVEFTTPEGYTPTTVTSGSDIEKDSNGLTTTGVINGADNMTLDSGFYKTPKYNLGNYVW
EDTNKDGKQDSTEKGISGVTVTLKNENGEVLQTTKTDKDGKYQFTGLENGTYKVEFETPSGYTPTQVGSGTDEGIDSNGT
STTGVIKDKDNDTIDSGFYKPTYNLGDYVWEDTNKNGVQDKDEKGISGVTVTLKDENDKVLKTVTTDENGKYQFTDLNNG
TYKVEFETPSGYTPTSVTSGNDTEKDSNGLTTTGVIKDADNMTLDSGFYKTPKYSLGDYVWYDSNKDGKQDSTEKGIKDV
KVILLNEKGEVIGTTKTDENGKYRFDNLDSGKYKVIFEKPTGLTQTGTNTTEDDKDADGGEVDVTITDHDDFTLDNGYYE
EETSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS
DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS
DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDAGKHTPVKPMSTTKDHHNKAKALPETGNENSGSNN
ATLFGGLFAALGSLLLFGRRKKQNK
>O86489 ~~~sdrE~~~Serine-aspartate repeat-containing protein E~~~
MINRDNKKAITKKGMISNRLNKFSIRKYTVGTASILVGTTLIFGLGNQEAKAAENTSTENAKQDDATTSDNKEVVSETEN
NSTTENNSTNPIKKETNTDSQPEAKKESTSSSTQKQQNNVTATTETKPQNIEKENVKPSTDKTATEDTSVILEEKKAPNN
TNNDVTTKPSTSEPSTSEIQTKPTTPQESTNIENSQPQPTPSKVDNQVTDATNPKEPVNVSKEELKNNPEKLKELVRNDS
NTDHSTKPVATAPTSVAPKRVNAKMRFAVAQPAAVASNNVNDLIKVTKQTIKVGDGKDNVAAAHDGKDIEYDTEFTIDNK
VKKGDTMTINYDKNVIPSDLTDKNDPIDITDPSGEVIAKGTFDKATKQITYTFTDYVDKYEDIKSRLTLYSYIDKKTVPN
ETSLNLTFATAGKETSQNVTVDYQDPMVHGDSNIQSIFTKLDEDKQTIEQQIYVNPLKKSATNTKVDIAGSQVDDYGNIK
LGNGSTIIDQNTEIKVYKVNSDQQLPQSNRIYDFSQYEDVTSQFDNKKSFSNNVATLDFGDINSAYIIKVVSKYTPTSDG
ELDIAQGTSMRTTDKYGYYNYAGYSNFIVTSNDTGGGDGTVKPEEKLYKIGDYVWEDVDKDGVQGTDSKEKPMANVLVTL
TYPDGTTKSVRTDANGHYEFGGLKDGETYTVKFETPTGYLPTKVNGTTDGEKDSNGSSVTVKINGKDDMSLDTGFYKEPK
YNLGDYVWEDTNKDGIQDANEPGIKDVKVTLKDSTGKVIGTTTTDASGKYKFTDLDNGNYTVEFETPAGYTPTVKNTTAD
DKDSNGLTTTGVIKDADNMTLDRGFYKTPKYSLGDYVWYDSNKDGKQDSTEKGIKDVTVTLQNEKGEVIGTTKTDENGKY
RFDNLDSGKYKVIFEKPAGLTQTVTNTTEDDKDADGGEVDVTITDHDDFTLDNGYFEEDTSDSDSDSDSDSDSDSDSDSD
SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD
SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDAGKHTPVKPMSTTK
DHHNKAKALPETGSENNGSNNATLFGGLFAALGSLLLFGRRKKQNK
>Q932F7 ~~~sdrE~~~Serine-aspartate repeat-containing protein E~~~
MINRDNKKAITKKGMISNRLNKFSIRKYTVGTASILVGTTLIFGLGNQEAKAAENTSTENAKQDDATTSDNKEVVSETEN
NSTTENDSTNPIKKETNTDSQPEAKEESTTSSTQQQQNNVTATTETKPQNIEKENVKPSTDKTATEDTSVILEEKKAPNY
TNNDVTTKPSTSEIQTKPTTPQESTNIENSQPQPTPSKVDNQVTDATNPKEPVNVSKEELKNNPEKLKELVRNDNNTDRS
TKPVATAPTSVAPKRLNAKMRFAVAQPAAVASNNVNDLITVTKQTIKVGDGKDNVAAAHDGKDIEYDTEFTIDNKVKKGD
TMTINYDKNVIPSDLTDKNDPIDITDPSGEVIAKGTFDKATKQITYTFTDYVDKYEDIKARLTLYSYIDKQAVPNETSLN
LTFATAGKETSQNVSVDYQDPMVHGDSNIQSIFTKLDENKQTIEQQIYVNPLKKTATNTKVDIAGSQVDDYGNIKLGNGS
TIIDQNTEIKVYKVNPNQQLPQSNRIYDFSQYEDVTSQFDNKKSFSNNVATLDFGDINSAYIIKVVSKYTPTSDGELDIA
QGTSMRTTDKYGYYNYAGYSNFIVTSNDTGGGDGTVKPEEKLYKIGDYVWEDVDKDGVQGTDSKEKPMANVLVTLTYPDG
TTKSVRTDANGHYEFGGLKDGETYTVKFETPAGYLPTKVNGTTDGEKDSNGSSITVKINGKDDMSLDTGFYKEPKYNLGD
YVWEDTNKDGIQDANEPGIKDVKVTLKDSTGKVIGTTTTDASGKYKFTDLDNGNYTVEFETPAGYTPTVKNTTAEDKDSN
GLTTTGVIKDADNMTLDSGFYKTPKYSLGDYVWYDSNKDGKQDSTEKGIKDVKVTLLNEKGEVIGTTKTDENGKYRFDNL
DSGKYKVIFEKPAGLTQTVTNTTEDDKDADGGEVDVTITDHDDFILDNGYFEEDTSDSDSDSDSDSDSDSDSDSDSDSDS
DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS
DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDAGKHTPVKPMSTTKDHHNKAKALPETGSENNGSNNATLF
GGLFAALGSLLLFGRRKKQNK
>Q99W46 ~~~sdrE~~~Serine-aspartate repeat-containing protein E~~~
MINRDNKKAITKKGMISNRLNKFSIRKYTVGTASILVGTTLIFGLGNQEAKAAENTSTENAKQDDATTSDNKEVVSETEN
NSTTENDSTNPIKKETNTDSQPEAKEESTTSSTQQQQNNVTATTETKPQNIEKENVKPSTDKTATEDTSVILEEKKAPNY
TNNDVTTKPSTSEIQTKPTTPQESTNIENSQPQPTPSKVDNQVTDATNPKEPVNVSKEELKNNPEKLKELVRNDNNTDRS
TKPVATAPTSVAPKRLNAKMRFAVAQPAAVASNNVNDLITVTKQTIKVGDGKDNVAAAHDGKDIEYDTEFTIDNKVKKGD
TMTINYDKNVIPSDLTDKNDPIDITDPSGEVIAKGTFDKATKQITYTFTDYVDKYEDIKARLTLYSYIDKQAVPNETSLN
LTFATAGKETSQNVSVDYQDPMVHGDSNIQSIFTKLDENKQTIEQQIYVNPLKKTATNTKVDIAGSQVDDYGNIKLGNGS
TIIDQNTEIKVYKVNPNQQLPQSNRIYDFSQYEDVTSQFDNKKSFSNNVATLDFGDINSAYIIKVVSKYTPTSDGELDIA
QGTSMRTTDKYGYYNYAGYSNFIVTSNDTGGGDGTVKPEEKLYKIGDYVWEDVDKDGVQGTDSKEKPMANVLVTLTYPDG
TTKSVRTDANGHYEFGGLKDGETYTVKFETPAGYLPTKVNGTTDGEKDSNGSSITVKINGKDDMSLDTGFYKEPKYNLGD
YVWEDTNKDGIQDANEPGIKDVKVTLKDSTGKVIGTTTTDASGKYKFTDLDNGNYTVEFETPAGYTPTVKNTTAEDKDSN
GLTTTGVIKDADNMTLDSGFYKTPKYSLGDYVWYDSNKDGKQDSTEKGIKDVKVTLLNEKGEVIGTTKTDENGKYRFDNL
DSGKYKVIFEKPAGLTQTVTNTTEDDKDADGGEVDVTITDHDDFTLDNGYFEEDTSDSDSDSDSDSDSDSDSDSDSDSDS
DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS
DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDAGKHTPVKPMSTTKDHHNKAKALPETGSENNGSNNATLF
GGLFAALGSLLLFGRRKKQNK
>Q6GBS4 ~~~sdrE~~~Serine-aspartate repeat-containing protein E~~~
MINRDNKKAITKKGMISNRLNKFSIRKYTVGTASILVGTTLIFGLGNQEAKAAENTSTENAKQDDATTSDNKEVVSEAEN
NSTTENDSTNPIKKETNTDSQPEAKEESTKSSTQQQQNNVTATTETKPQNIEKENVKPSTDKTATEDTSVILEEKKAPNN
TNNDVTTKPSTSEIQTKPTTPQESTNIENSQPQPTPSKVDNQVTDATNPKEPVNVSKEELKNNPEKLKELVRNDSNTDHS
TKPVATAPTSVAPKRVNAKMRFAVAQPAAVASNNVNDLIKVTKQTIKVGDGKDNVAAAHDGKDIEYDTEFTIDNKVKKGD
TMTINYDKNVIPSDLTDKNDPIDITDPSGEVIAKGTFDKATKQITYTFTDYVDKYEDIKSRLTLYSYIDKKTVPNETSLN
LTFATAGKETSQNVTVDYQDPMVHGDSNIQSIFTKLDEDKQTIEQQIYVNPLKKSATNTKVDIAGSQVDDYGNIKLGNGS
TIIDQNTEIKVYKVNSDQQLPQSNRIYDFSQYEDVTSQFDNKKSFSNNVATLDFGDINSAYIIKVVSKYTPTSDGELDIA
QGTSMRTTDKYGYYNYAGYSNFIVTSNDSGGGDGTVKPEEKLYKIGDYVWEDVDKDGVQGTDSKEKPMANVLVTLTYPDG
TTKSVRTDAKGHYEFGGLKDGETYTVKFETPTGYLPTKVNGTTDGEKDSNGSSVTVKINGKDDMSLDTGFYKEPKYNLGD
YVWEDTNKDGIQDANEPGIKDVKVTLKDSTGKVIGTTTTDASGKYKFTDLDNGNYTVEFETPAGYTPTVKNTTAEDKDSN
GLTTTGVIKDADNMTLDSGFYKTPKYSLGDYVWYDSNKDGKQDSTEKGIKDVTVTLQNEKGEVIGTTKTDENGKYRFDNL
DSGKYKVIFEKPAGLTQTVTNTTEDDKDADGGEVDVTITDHDDFTLDNGYFEEDTSDSDSDSDSDSDSDSDSDSDSDSDS
DSDSDSESDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS
DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDAGKHTPVKPMSTTKDHHNKAKALPETGSENNGSNNATLF
GGLFAALGSLLLFGRRKKQNK
>Q9KI14 ~~~sdrF~~~Serine-aspartate repeat-containing protein F~~~
MKKRRQGPINKRVDFLSNKVNKYSIRKFTVGTASILVGATLMFGAADNEAKAAEDNQLESASKEEQKGSRDNENSKLNQV
DLDNGSHSSEKTTNVNNATEVKKVEAPTTSDVSKPKANEAVVTNESTKPKTTEAPTVNEESIAETPKTSTTQQDSTEKNN
PSLKDNLNSSSTTSKESKTDEHSTKQAQMSTNKSNLDTNDSPTQSEKTSSQANNDSTDNQSAPSKQLDSKPSEQKVYKTK
FNDEPTQDVEHTTTKLKTPSVSTDSSVNDKQDYTRSAVASLGVDSNETEAITNAVRDNLDLKAASREQINEAIIAEALKK
DFSNPDYGVDTPLALNRSQSKNSPHKSASPRMNLMSLAAEPNSGKNVNDKVKITNPTLSLNKSNNHANNVIWPTSNEQFN
LKANYELDDSIKEGDTFTIKYGQYIRPGGLELPAIKTQLRSKDGSIVANGVYDKTTNTTTYTFTNYVDQYQNITGSFDLI
ATPKRETAIKDNQNYPMEVTIANEVVKKDFIVDYGNKKDNTTTAAVANVDNVNNKHNEVVYLNQNNQNPKYAKYFSTVKN
GEFIPGEVKVYEVTDTNAMVDSFNPDLNSSNVKDVTSQFAPKVSADGTRVDINFARSMANGKKYIVTQAVRPTGTGNVYT
EYWLTRDGTTNTNDFYRGTKSTTVTYLNGSSTAQGDNPTYSLGDYVWLDKNKNGVQDDDEKGLAGVYVTLKDSNNRELQR
VTTDQSGHYQFDNLQNGTYTVEFAIPDNYTPSPANNSTNDAIDSDGERDGTRKVVVAKGTINNADNMTVDTGFYLTPKYN
VGDYVWEDTNKDGIQDDNEKGISGVKVTLKNKNGDTIGTTTTDSNGKYEFTGLENGDYTIEFETPEGYTPTKQNSGSDEG
KDSNGTKTTVTVKDADNKTIDSGFYKPTYNLGDYVWEDTNKDGIQDDSEKGISGVKVTLKDKNGNAIGTTTTDASGHYQF
KGLENGSYTVEFETPSGYTPTKANSGQDITVDSNGITTTGIINGADNLTIDSGFYKTPKYSVGDYVWEDTNKDGIQDDNE
KGISGVKVTLKDEKGNIISTTTTDENGKYQFDNLDSGNYIIHFEKPEGMTQTTANSGNDDEKDADGEDVRVTITDHDDFS
IDNGYFDDDSDSDSDADSDSDSDSDSDADSDSDADSDSDADSDSDSDSDSDADSDSDSDSDSDSDSDSDADSDSDSDSDS
DADSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDADSDSDADSDSDSDSDSDADSDSDSDSDSDADSDS
DSDSDSDSDSDSDADSDSDSDSDSDSDSDSDSDSDSDSDSDSDADSDSDSDSDSDSDSDSDSDSDSDSDSDSDADSDADS
DSDADSDSDADSDSDSDSDSDADSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDADSDSDSDSDSDSDSDSDADSDS
DSDSDSDADSDSDSDSDSDADSDSDSDSDSDADSDSDSDSDSDSDSDSDADSDSDSDSDSDSDSDSDSDSDSDSDSDSDS
DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS
DSDSDSDSDSDSDSDSDSDSDADSDSDSDSDSDADSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS
DSDSDSDKNAKDKLPDTGANEDHDSKGTLLGTLFAGLGALLLGRRRKKDNKEK
>Q9KI13 ~~~sdrG~~~Serine-aspartate repeat-containing protein G~~~
MIKKNNLLTKKKPIANKSNKYAIRKFTVGTASIVIGAALLFGLGHNEAKAEENTVQDVKDSNMDDELSDSNDQSSNEEKN
DVINNSQSINTDDDNQIKKEETNSNDAIENRSKDITQSTTNVDENEATFLQKTPQDNTQLKEEVVKEPSSVESSNSSMDT
AQQPSHTTINSEASIQTSDNEENSRVSDFANSKIIESNTESNKEENTIEQPNKVREDSITSQPSSYKNIDEKISNQDELL
NLPINEYENKVRPLSTTSAQPSSKRVTVNQLAAEQGSNVNHLIKVTDQSITEGYDDSDGIIKAHDAENLIYDVTFEVDDK
VKSGDTMTVNIDKNTVPSDLTDSFAIPKIKDNSGEIIATGTYDNTNKQITYTFTDYVDKYENIKAHLKLTSYIDKSKVPN
NNTKLDVEYKTALSSVNKTITVEYQKPNENRTANLQSMFTNIDTKNHTVEQTIYINPLRYSAKETNVNISGNGDEGSTII
DDSTIIKVYKVGDNQNLPDSNRIYDYSEYEDVTNDDYAQLGNNNDVNINFGNIDSPYIIKVISKYDPNKDDYTTIQQTVT
MQTTINEYTGEFRTASYDNTIAFSTSSGQGQGDLPPEKTYKIGDYVWEDVDKDGIQNTNDNEKPLSNVLVTLTYPDGTSK
SVRTDEEGKYQFDGLKNGLTYKITFETPEGYTPTLKHSGTNPALDSEGNSVWVTINGQDDMTIDSGFYQTPKYSLGNYVW
YDTNKDGIQGDDEKGISGVKVTLKDENGNIISTTTTDENGKYQFDNLNSGNYIVHFDKPSGMTQTTTDSGDDDEQDADGE
EVHVTITDHDDFSIDNGYYDDDSDSDSDSDSDSDDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSGL
DNSSDKNTKDKLPDTGANEDHDSKGTLLGALFAGLGALLLGKRRKNRKNKN
>Q8KWM1 ~~~sdrI~~~Serine-aspartate repeat-containing protein I~~~
MNFKGVKLLKNSKKRLDFLPNTLNKYSIRKFTVGTASILVGATLFLGVSNEAEAAEKIDSPTKEKVATTEEAATKEEAAT
TEEPATKEEAATTEEPATKEEAAIAEEPATKEEAATTEEPATKEEAAIAEEPATKEEAATTEEPATKEEAATTEEPATKE
EAAIAEEPATKEEAATTEEPATKEEAAIAEEPATKEEAVTSEEAATKEKAAIAEEPATKEEAAIAEEPETKEEAATTEEP
ATKEEAAIAEEAATKEKAVTSEEAATKEKAAIAEEAATKEKAAIAEEPETKEEAATTEEPETKEEAAIAEEPATKEKAVT
SEEAHGINNKNKQLLDMDKNSTIDEKFDYAKQAINELNINQKDISNIEASIKNNSDLKNLSKEELNNEILRAALVNESNN
NDYGLQTLSAIEPLTTNVRNKNNSLSPVSRLKMLATATSGQNVNDKINITNASLTLNQKNNQHDDNTVWPTSNEQLRLSA
DYELDNSIKEGDTFTIKYGDYIRPGALELPAKNTQLRSKEGSIVANGVYDENTTTTTYTFTNYVDQYQNITGSFNLLATP
KRETVTTDKQTYPMNVTIANQEVSENFVVDYGNHEDHLTNAAVVNVDNVNNQHNEVVYLNQSGDRIYDAKYFSIVQNGTF
IPNEVKVYEVLDDNVLVDSFNPDLNGPAVKDVTSEFTPQYSLNNTRVDIDLNRSNMNKGSRYIITQAVKPSGTGNVNTTY
ELTRYGNEASRYPTGTKSTTVSYINGSSTAQGDNPTYNLGDYVWLDKNKDGIQNDDEKGISGVYVILKDSNNKELQRATT
DDTGRYQFNNLQNGTYNVEFVIPNNYTPSPSNTIDNDTIDSDGQKDGDSNVVVAKGTINNADNMTVDTGFYETPKYSLGD
YVWKDTNKDGVQDSDEKGIQGVTVTLKDKNGNVLKTTTTDENGSYRFDNLDSGDYIVHFEKPEGLTQTTTNSDSDENKDA
DGEEVHVTITDHDDFSIDNGYFDEDSDADADSDADADSDADADADADSDADADADADSDADSDADADADSDADADADADA
DSDSDADADADADADSDADADADSDADADADADSDADADADSDADSDADADADSDADADADADADSDADADSDADADADA
DADSDADADADADSDADADSDSDADADADADSDADADADADSDADADADSDADADADSDADADADSDADSDADADADSDA
DADADADADSDADADADSDADADADADADSDADSDADSDADSDADADADSDADADADADADSDADADSDADADSDADADA
DADADSDADADSDADADSDADADSDADADADADSDADADADADSDADADADSDSDADADADADSDADADADADSDADADS
DADADSDADADADADSDADADSDADADADADSDADADADSDADADADADSDADADSDADADADADADSDADADADADSDA
DADADADADADSDADADADSDADADADSDADSDADADADSDADADADADSDADADSDADADSDADADSDADADADADSDA
DADSDADADSDADADADADSDADADSDADADADSDADADADSDADADSDADADADADADSDADADSDADSDADADADSDS
DADADADADSDADADADADSDADADSDADADADSDSDADADADADSDADADSDADADADADSDADADADSDADADADSDA
DADADSDADSDADADADSDADADADADSDADADSDADADADADSDADADSDADADADADSDADADSDADADSDADADSDA
DADSDADADSDADADADADSDADADSDADADADADSDADADADSDADADADSDADADADSDADADADSDADADADSDADK
YHNDTADKSNDNELPDTGNNTQNNGTLFGSLFAALGGLFLVGSRRKNKNNEEK
>Q99S97 ~~~sdrM~~~Multidrug efflux pump SdrM~~~
MRLKSIITVIALILIMFMSAIESSIISLALPTIKQDLNAGNLISLIFTAYFIALVIANPIVGELLSRFKIIYVAIAGLLL
FSIGSFMCGLSTNFTMLIISRVIQGFGSGVLMSLSQIVPKLAFEIPLRYKIMGIVGSVWGISSIIGPLLGGGILEFATWH
WLFYINIPIAIIAIILVIWTFHFPEEETVAKSKFDTKGLTLFYVFIGLIMFALLNQQLLLLNFLSFILAIVVAMCLFKVE
KHVSSPFLPVVEFNRSITLVFITDLLTAICLMGFNLYIPVYLQEQLGLSPLQSGLVIFPLSVAWITLNFNLHRIEAKLSR
KVIYLLSFTLLLVSSIIISFGIKLPVLIAFVLILAGLSFGYIYTKDSVIVQEETSPLQMKKMMSFYGLTKNLGASIGSTI
MGYLYAIQSGIFGPNLHNVLSAVAVISIGLIVLWVVFFKEQSSQSKE
>Q5SIL0 ~~~~~~Transcriptional regulator SdrP~~~COG0664
MTQVRETVSFKAGDVILYPGVPGPRDRAYRVLEGLVRLEAVDEEGNALTLRLVRPGGFFGEEALFGQERIYFAEAATDVR
LEPLPENPDPELLKDLAQHLSQGLAEAYRRIERLATQRLKNRMAAALLELSETPLAHEEEGKVVLKATHDELAAAVGSVR
ETVTKVIGELAREGYIRSGYGKIQLLDLKGLKELAESRGQGR
>O24743 2.5.1.84~~~sdsA~~~All-trans-nonaprenyl-diphosphate synthase (geranyl-diphosphate specific)~~~
MAIDFKQDILAPVAQDFAAMDQFINEGISSKVALVMSVSKHVVEAGGKRMRPIMCLLAAYACGETNLKHAQKLAAIIEML
HTATLVHDDVVDESGLRRGRPTANATWNNQTAVLVGDFLIARAFDLLVDLDNMILLKDFSTGTCEIAEGEVLQLQAQHQP
DTTEDIYLQIIHGKTSRLFELATEGAAILAGKPEYREPLRRFAGHFGNAFQIIDDILDYTSDADTLGKNIGDDLMEGKPT
LPLIAAMQNTQGEQRDLIRRSIATGGTSQLEQVIAIVQNSGALDYCHKRATEETERALQALEILPESTYRQALVNLTRLA
LDRIQ
>P0A5Y9 7.4.2.8~~~secA1~~~Protein translocase subunit SecA 1~~~
MLSKLLRLGEGRMVKRLKKVADYVGTLSDDVEKLTDAELRAKTDEFKRRLADQKNPETLDDLLPEAFAVAREAAWRVLDQ
RPFDVQVMGAAALHLGNVAEMKTGEGKTLTCVLPAYLNALAGNGVHIVTVNDYLAKRDSEWMGRVHRFLGLQVGVILATM
TPDERRVAYNADITYGTNNEFGFDYLRDNMAHSLDDLVQRGHHYAIVDEVDSILIDEARTPLIISGPADGASNWYTEFAR
LAPLMEKDVHYEVDLRKRTVGVHEKGVEFVEDQLGIDNLYEAANSPLVSYLNNALKAKELFSRDKDYIVRDGEVLIVDEF
TGRVLIGRRYNEGMHQAIEAKEHVEIKAENQTLATITLQNYFRLYDKLAGMTGTAQTEAAELHEIYKLGVVSIPTNMPMI
REDQSDLIYKTEEAKYIAVVDDVAERYAKGQPVLIGTTSVERSEYLSRQFTKRRIPHNVLNAKYHEQEATIIAVAGRRGG
VTVATNMAGRGTDIVLGGNVDFLTDQRLRERGLDPVETPEEYEAAWHSELPIVKEEASKEAKEVIEAGGLYVLGTERHES
RRIDNQLRGRSGRQGDPGESRFYLSLGDELMRRFNGAALETLLTRLNLPDDVPIEAKMVTRAIKSAQTQVEQQNFEVRKN
VLKYDEVMNQQRKVIYAERRRILEGENLKDQALDMVRDVITAYVDGATGEGYAEDWDLDALWTALKTLYPVGITADSLTR
KDHEFERDDLTREELLEALLKDAERAYAAREAELEEIAGEGAMRQLERNVLLNVIDRKWREHLYEMDYLKEGIGLRAMAQ
RDPLVEYQREGYDMFMAMLDGMKEESVGFLFNVTVEAVPAPPVAPAAEPAELAEFAAAAAAAAQQRSAVDGGARERAPSA
LRAKGVASESPALTYSGPAEDGSAQVQRNGGGAHKTPAGVPAGASRRERREAARRQGRGAKPPKSVKKR
>P71533 7.4.2.8~~~secA1~~~Protein translocase subunit SecA 1~~~COG0653
MLSKLLRLGEGRMVKRLRKVADYVNALSDDVEKLSDAELRAKTEEFKKRVADGEDLDDLLPEAFAVAREAAWRVLNQRHF
DVQVMGGAALHFGNVAEMKTGEGKTLTAVLPSYLNALSGKGVHVVTVNDYLARRDSEWMGRVHRFLGLDVGVILSGMTPD
ERRAAYAADITYGTNNEFGFDYLRDNMAHSVDDMVQRGHNFAIVDEVDSILIDEARTPLIISGPADGASHWYQEFARIVP
MMEKDVHYEVDLRKRTVGVHELGVEFVEDQLGIDNLYEAANSPLVSYLNNALKAKELFQRDKDYIVRNGEVLIVDEFTGR
VLMGRRYNEGMHQAIEAKERVEIKAENQTLATITLQNYFRLYDKLSGMTGTAETEAAELHEIYKLGVVPIPTNKPMVRQD
QSDLIYKTEEAKFLAVVDDVAERHAKGQPVLIGTTSVERSEYLSKMLTKRRVPHNVLNAKYHEQEANIIAEAGRRGAVTV
ATNMAGRGTDIVLGGNVDFLADKRLRERGLDPVETPEEYEAAWHEVLPQVKAECAKEAEQVIEAGGLYVLGTERHESRRI
DNQLRGRSGRQGDPGESRFYLSLGDELMRRFNGATLETLLTRLNLPDDVPIEAKMVSRAIKSAQTQVEQQNFEVRKNVLK
YDEVMNQQRKVIYAERRRILEGENLAEQAHKMLVDVITAYVDGATAEGYAEDWDLETLWTALKTLYPVGIDHRDLIDSDA
VGEPGELTREELLDALIKDAERAYAEREKQIEAIAGEGAMRQLERNVLLNVIDRKWREHLYEMDYLKEGIGLRAMAQRDP
LVEYQREGYDMFVGMLEALKEESVGFLFNVQVEAAPQQPQVAPQAPPPTLSEFAAAAAAKASDSAAKPDSGSVATKERAE
AERPAPALRAKGIDNEAPPLTYTGPSEDGTAQVQRSGNGGRHAAPAGGSRRERREAARKQAKADRPAKSHRKG
>P9WGP5 7.4.2.8~~~secA1~~~Protein translocase subunit SecA 1~~~COG0653
MLSKLLRLGEGRMVKRLKKVADYVGTLSDDVEKLTDAELRAKTDEFKRRLADQKNPETLDDLLPEAFAVAREAAWRVLDQ
RPFDVQVMGAAALHLGNVAEMKTGEGKTLTCVLPAYLNALAGNGVHIVTVNDYLAKRDSEWMGRVHRFLGLQVGVILATM
TPDERRVAYNADITYGTNNEFGFDYLRDNMAHSLDDLVQRGHHYAIVDEVDSILIDEARTPLIISGPADGASNWYTEFAR
LAPLMEKDVHYEVDLRKRTVGVHEKGVEFVEDQLGIDNLYEAANSPLVSYLNNALKAKELFSRDKDYIVRDGEVLIVDEF
TGRVLIGRRYNEGMHQAIEAKEHVEIKAENQTLATITLQNYFRLYDKLAGMTGTAQTEAAELHEIYKLGVVSIPTNMPMI
REDQSDLIYKTEEAKYIAVVDDVAERYAKGQPVLIGTTSVERSEYLSRQFTKRRIPHNVLNAKYHEQEATIIAVAGRRGG
VTVATNMAGRGTDIVLGGNVDFLTDQRLRERGLDPVETPEEYEAAWHSELPIVKEEASKEAKEVIEAGGLYVLGTERHES
RRIDNQLRGRSGRQGDPGESRFYLSLGDELMRRFNGAALETLLTRLNLPDDVPIEAKMVTRAIKSAQTQVEQQNFEVRKN
VLKYDEVMNQQRKVIYAERRRILEGENLKDQALDMVRDVITAYVDGATGEGYAEDWDLDALWTALKTLYPVGITADSLTR
KDHEFERDDLTREELLEALLKDAERAYAAREAELEEIAGEGAMRQLERNVLLNVIDRKWREHLYEMDYLKEGIGLRAMAQ
RDPLVEYQREGYDMFMAMLDGMKEESVGFLFNVTVEAVPAPPVAPAAEPAELAEFAAAAAAAAQQRSAVDGGARERAPSA
LRAKGVASESPALTYSGPAEDGSAQVQRNGGGAHKTPAGVPAGASRRERREAARRQGRGAKPPKSVKKR
>O06446 7.4.2.8~~~secA1~~~Protein translocase subunit SecA 1~~~COG0653
MGFLSKILDGNNKEIKQLGKLADKVIALEEKTAILTDEEIRNKTKQFQTELADIDNVKKQNDYLDKILPEAYALVREGSK
RVFNMTPYKVQIMGGIAIHKGDIAEMRTGEGKTLTATMPTYLNALAGRGVHVITVNEYLSSVQSEEMAELYNFLGLTVGL
NLNSKTTEEKREAYAQDITYSTNNELGFDYLRDNMVNYSEDRVMRPLHFAIIDEVDSILIDEARTPLIISGEAEKSTSLY
TQANVFAKMLKQDEDYKYDEKTKAVHLTEQGADKAERMFKVENLYDVQNVDVISHINTALRAHVTLQRDVDYMVVDGEVL
IVDQFTGRTMPGRRFSEGLHQAIEAKEGVQIQNESKTMASITFQNYFRMYNKLAGMTGTAKTEEEEFRNIYNMTVTQIPT
NKPVQRNDKSDLIYISQKGKFDAVVEDVVEKHKAGQPVLLGTVAVETSEYISNLLKKRGIRHDVLNAKNHEREAEIVAGA
GQKGAVTIATNMAGRGTDIKLGEGVEELGGLAVIGTERHESRRIDDQLRGRSGRQGDKGDSRFYLSLQDELMIRFGSERL
QKMMSRLGLDDSTPIESKMVSRAVESAQKRVEGNNFDARKRILEYDEVLRKQREIIYNERNSIIDEEDSSQVVDAMLRST
LQRSINYYINTADDEPEYQPFIDYINDIFLQEGDITEDDIKGKDAEDIFEVVWAKIEAAYQSQKDILEEQMNEFERMILL
RSIDSHWTDHIDTMDQLRQGIHLRSYAQQNPLRDYQNEGHELFDIMMQNIEEDTCKFILKSVVQVEDNIEREKTTEFGEA
KHVSAEDGKEKVKPKPIVKGDQVGRNDDCPCGSGKKFKNCHGK
>Q7A6R5 7.4.2.8~~~secA1~~~Protein translocase subunit SecA 1~~~
MGFLSKILDGNNKEIKQLGKLADKVIALEEKTAILTDEEIRNKTKQFQTELADIDNVKKQNDYLDKILPEAYALVREGSK
RVFNMTPYKVQIMGGIAIHKGDIAEMRTGEGKTLTATMPTYLNALAGRGVHVITVNEYLSSVQSEEMAELYNFLGLTVGL
NLNSKTTEEKREAYAQDITYSTNNELGFDYLRDNMVNYSEDRVMRPLHFAIIDEVDSILIDEARTPLIISGEAEKSTSLY
TQANVFAKMLKQDEDYKYDEKTKAVHLTEQGADKAERMFKVENLYDVQNVDVISHINTALRAHVTLQRDVDYMVVDGEVL
IVDQFTGRTMPGRRFSEGLHQAIEAKEGVQIQNESKTMASITFQNYFRMYNKLAGMTGTAKTEEEEFRNIYNMTVTQIPT
NKPVQRNDKSDLIYISQKGKFDAVVEDVVEKHKAGQPVLLGTVAVETSEYISNLLKKRGIRHDVLNAKNHEREAEIVAGA
GQKGAVTIATNMAGRGTDIKLGEGVEELGGLAVIGTERHESRRIDDQLRGRSGRQGDKGDSRFYLSLQDELMIRFGSERL
QKMMSRLGLDDSTPIESKMVSRAVESAQKRVEGNNFDARKRILEYDEVLRKQREIIYNERNSIIDEEDSSQVVDAMLRST
LQRSINYYINTADDEPEYQPFIDYINDIFLQEGDITEDDIKGKDAEDIFEVVWAKIEAAYQSQKDILEEQMNEFERMILL
RSIDSHWTDHIDTMDQLRQGIHLRSYAQQNPLRDYQNEGHELFDIMMQNIEEDTCKFILKSVVQVEDNIEREKTTEFGEA
KHVSAEDGKEKVKPKPIVKGDQVGRNDDCPCGSGKKFKNCHGK
>Q183M9 7.4.2.8~~~secA2~~~Protein translocase subunit SecA 2~~~COG0653
MSVIDSILDKADEQEIKKLNVIVDKIDALEDSMKNLSYEELKDMTAIFKNRLKKGETLDDILPEAFAVVREVSKRKLGMR
QYRVQLIGGIVIHQGKIAEMKTGEGKTLVEVAPVYLNALTGKGVHVITVNDYLAERDKELMSPVYESLGMTVGVIISNQD
PNIRKQQYKCDITYGTNSEFGFDYLRDNMVPDLSHKVQRELNFAIVDEVDSILIDEARTPLIIAGDGDEDLKLYELANSF
VKTVKEEDFELDRKDKTIALTASGISKAESFFGITNLTDIKNIELYHHINQALRGHKLMEKDVDYVISNGEVMIVDEFTG
RVMDGRRYTDGLHQAIEAKEGVEIKNESKTMATVTYQNFFRLYEKLSGMTGTAKTEEGEFESIYKLNVVQIPTNRPVIRA
DLHDKVFKTEEEKYSAVVEEIIRIHKTRQPILVGTVSVEKSEKLSKMLKKQGIKHQVLNAKQHDKEAEIISKAGKLDAIT
IATNMAGRGTDISLGAGDKEEEQEVKDLGGLYVIGTERHESRRIDNQLRGRSGRQGDPGTSRFFVSLEDDVIKLYGGKTI
EKLMKRTSSNENTAIESKALTRAIERAQKGVEGKNFEIRKNVLKYDDTINEQRKVIYNERNKVLNDEDIQEDIQKMVKDI
IQEAGETYLIGRKRDYYGYFKHLYSTFMPADTLLIPGVDKKSVQEIIDSTYEISKRVYDLKKMMLGIDKVAELEKTVLLK
VVDQYWIDHIDAMEQLKQYIGLKSYAQKDPFKEYALEGYDMFEALNKNIREATVQYLYKFN
>P0DJP3 7.4.2.8~~~secA2~~~Protein translocase subunit SecA 2~~~COG0653
MRQNYDDRKIVKQYREIARQIVKKEGLYKNMEQAELCEQTNFWREKFKTKPMTDRDKINIFALAREAASRIIGLDAVVVQ
LIGALVLGDGKVAEMKTGEGKTLMSLFVMFIEVMRGNRVHLVTANEYLARRDREEIGQVLEYLGVSVALNESGLDIAQKK
AIYTADVIYGTASEFGFDYLRDNMVRQKEDKVQSGLDFVLIDEADSILIDEARTPLLISDRKEEDLSLYHTANKLVKKMM
KDDYEMEEHKRFVWLNDAGIEKAQKFWGVESLYSAEAQSELRITMLLMRAHFLMHKDKDYVVLDDEVLIIDPHTGRALPG
RRFNDGLHQAIEAKEGVEVKEESRTLATITIQNYFRMYKKISGMTGTAKTEEEEFRQIYNMDVVVIPTNLRVNREDMQDD
IFYTKKEKGRAIVYEVSWRYEKGQPTLIGTSSIKSNEWISGLLDAAGIPHQVLNAKNHAQEAEIIAKAGKRGMVTLATNM
AGRGTDIKLDPDVHKLGGLAVIGTERHESRRIDLQLMGRSGRRGDPGFSKFMISLEDDLLEQFESKSWEKLSAKLKRKAP
RDGKPVNSRKIHAVVVDAQKRLEGANYDIRKDLLSYDEVIDLQRKMVYKERDLLLERNKLGVSSEKILREVAEYSFIHPS
DIPEEELEIYYSRQKELLGGTKFPISFDQVTLMEPREVVEEIVSWHKKERNKFPAETIAAIEREVYLNLMDQMWVMHLDA
MVQLREGIHLRAYGQQDPLVMYQKEGAQLFEKFQADYHFYFAHALLELDPDGLIQG
>A0QYG9 7.4.2.8~~~secA2~~~Protein translocase subunit SecA 2~~~COG0653
MPKTSSAKPGRLSSKFWKLLGASTERNQARSLSEVKGAADFEKKAADLDDEQLTKAAKLLKLEDLAGASDITQFLAIARE
AAERTTGLRPFDVQLLAALRMLAGDVVEMATGEGKTLAGAIAAAGYALGGRRVHVITINDYLARRDAEWMGPLLKALGLT
VGWITADSTADERREAYQCDVTYASVNEIGFDVLRDQLVTDVADLVSPNPDVALIDEADSVLVDEALVPLVLAGTSHREQ
PRVEIIRMVGELEAGKHYDTDAESRNVHLTEAGARVMEAKLGGIDLYSEEHVGTTLTEINVALHAHVLLQRDVHYIVRDD
AVHLINASRGRIASLQRWPDGLQAAVEAKEGIETTETGEVLDTITVQALINRYPRVCGMTGTALAAGEQLRQFYKLGVSP
IPPNTPNIRKDEPDRVYITAAAKIDAIVEHIAEVHKTGQPVLVGTHDVAESEELHEKLLKAGVPAVVLNAKNDAEEAAVI
AEAGKLGAVTVSTQMAGRGTDIRLGGSDVGDDDAEKKKVAELGGLHVVGTGRHHTERLDNQLRGRAGRQGDPGSSVFFSS
WEDDVVAAHLERSKLPMETDPDAGDGRIIAPRAASLLDHAQRVAEGRLLDVHANTWRYNQLIAQQRAIIVERRETLLRTD
TAREELKERSPERYAKLAEELGEDAEERLEKICRLIMLYHLDRGWCEHLAFLADIRESIHLRALGRQNPLDEFHRMAVDA
FASLAADAIEAAQQTFETAESVADEPGVDLSKLARPTSTWTYMVHDNPLADDTMSALSLPGVFR
>P9WGP3 7.4.2.8~~~secA2~~~Protein translocase subunit SecA 2~~~COG0653
MNVHGCPRIAACRCTDTHPRGRPAFAYRWFVPKTTRAQPGRLSSRFWRLLGASTEKNRSRSLADVTASAEYDKEAADLSD
EKLRKAAGLLNLDDLAESADIPQFLAIAREAAERRTGLRPFDVQLLGALRMLAGDVIEMATGEGKTLAGAIAAAGYALAG
RHVHVVTINDYLARRDAEWMGPLLDAMGLTVGWITADSTPDERRTAYDRDVTYASVNEIGFDVLRDQLVTDVNDLVSPNP
DVALIDEADSVLVDEALVPLVLAGTTHRETPRLEIIRLVAELVGDKDADEYFATDSDNRNVHLTEHGARKVEKALGGIDL
YSEEHVGTTLTEVNVALHAHVLLQRDVHYIVRDDAVHLINASRGRIAQLQRWPDGLQAAVEAKEGIETTETGEVLDTITV
QALINRYATVCGMTGTALAAGEQLRQFYQLGVSPIPPNKPNIREDEADRVYITTAAKNDGIVEHITEVHQRGQPVLVGTR
DVAESEELHERLVRRGVPAVVLNAKNDAEEARVIAEAGKYGAVTVSTQMAGRGTDIRLGGSDEADHDRVAELGGLHVVGT
GRHHTERLDNQLRGRAGRQGDPGSSVFFSSWEDDVVAANLDHNKLPMATDENGRIVSPRTGSLLDHAQRVAEGRLLDVHA
NTWRYNQLIAQQRAIIVERRNTLLRTVTAREELAELAPKRYEELSDKVSEERLETICRQIMLYHLDRGWADHLAYLADIR
ESIHLRALGRQNPLDEFHRMAVDAFASLAADAIEAAQQTFETANVLDHEPGLDLSKLARPTSTWTYMVNDNPLSDDTLSA
LSLPGVFR
>P28366 7.4.2.8~~~secA~~~Protein translocase subunit SecA~~~COG0653
MLGILNKMFDPTKRTLNRYEKIANDIDAIRGDYENLSDDALKHKTIEFKERLEKGATTDDLLVEAFAVVREASRRVTGMF
PFKVQLMGGVALHDGNIAEMKTGEGKTLTSTLPVYLNALTGKGVHVVTVNEYLASRDAEQMGKIFEFLGLTVGLNLNSMS
KDEKREAYAADITYSTNNELGFDYLRDNMVLYKEQMVQRPLHFAVIDEVDSILIDEARTPLIISGQAAKSTKLYVQANAF
VRTLKAEKDYTYDIKTKAVQLTEEGMTKAEKAFGIDNLFDVKHVALNHHINQALKAHVAMQKDVDYVVEDGQVVIVDSFT
GRLMKGRRYSEGLHQAIEAKEGLEIQNESMTLATITFQNYFRMYEKLAGMTGTAKTEEEEFRNIYNMQVVTIPTNRPVVR
DDRPDLIYRTMEGKFKAVAEDVAQRYMTGQPVLVGTVAVETSELISKLLKNKGIPHQVLNAKNHEREAQIIEEAGQKGAV
TIATNMAGRGTDIKLGEGVKELGGLAVVGTERHESRRIDNQLRGRSGRQGDPGITQFYLSMEDELMRRFGAERTMAMLDR
FGMDDSTPIQSKMVSRAVESSQKRVEGNNFDSRKQLLQYDDVLRQQREVIYKQRFEVIDSENLREIVENMIKSSLERAIA
AYTPREELPEEWKLDGLVDLINTTYLDEGALEKSDIFGKEPDEMLELIMDRIITKYNEKEEQFGKEQMREFEKVIVLRAV
DSKWMDHIDAMDQLRQGIHLRAYAQTNPLREYQMEGFAMFEHMIESIEDEVAKFVMKAEIENNLEREEVVQGQTTAHQPQ
EGDDNKKAKKAPVRKVVDIGRNAPCHCGSGKKYKNCCGRTE
>C0QZS7 7.4.2.8~~~secA~~~Protein translocase subunit SecA~~~COG0653
MGAMDLVFKLIFGSKEQNDAKILKPIAEKTLTFEEEIKKLSNEELTNKTKEFRERVEKYIGCKTEELDLSKEENKKKLQN
ILDEILPEAFAVVREASIRTTGMRHFDVQVMGGAVLHQGRIAEMKTGEGKTLVATLAVYLNALTGLGVHVVTVNDYLAKR
DAEWMTPIYSMLGISVGILDNTRPHSPERRAVYNCDVVYGTNNEFGFDYLRDNMVTRKEDKVQRKFYFAIVDEVDSILID
EARTPLIISGPAEKNIKMYYEIDRIIPMLKQAEVDERMREVAGTGDYVLDEKDKNVYLTEEGVHKVEKLLNVENLYGAQS
STIVHHVNQALKAHKVFKKDVDYMVTDGEVLIVDEFTGRVLEGRRYSDGLHQAIEAKEKVAIQNESQTYATITFQNYFRM
YPKLSGMTGTAETEAEEFYKIYKLDVAVIPTNKPIARQDLSDRIYRTRKAKFEALAKYIKELQDAGKPALVGTVSVEMNE
ELSKVFKRHKINHEVLNAKNHSREAAIIAQAGEPGAVTLATNMAGRGTDIVLGGNPVAKGVAEIEQILVLMRDKAFKERD
PYKKEELTKKIKSIDLYKEAFVRSVISGKIEEAKELAQKNNADEMIEKIDRIIQINEKAKVDKERVLAAGGLHVIGSERH
EARRIDNQLRGRSGRQGDPGLSVFFLSLEDDLMRLFGGERVSKMMLAMGMGEEEELGHKWLNKSIENAQRKVEGRNFDIR
KHLLEYDDVMNQQRMAVYGERDYILYSDDISPRVEEIISEVTEETIEDISGNKKNVDALEVTKWLNSYLIGIDEDAANKA
VEGGVDNAVKNLTNLLLEAYRKKASEIDEKIFREVEKNIFLSIIDNRWKDHLFAMDSLREGIGLRGYAEKNPLTEYKLEG
YKMFMATMNVIHNELVNLIMRVRIIPNSFDTIERESAFDGGVEEKSSASAMNGGNAQAIQSKVKNAQPNVKMAQKIGRND
PCPCGSGKKYKHCHGKDNPQ
>P10408 7.4.2.8~~~secA~~~Protein translocase subunit SecA~~~COG0653
MLIKLLTKVFGSRNDRTLRRMRKVVNIINAMEPEMEKLSDEELKGKTAEFRARLEKGEVLENLIPEAFAVVREASKRVFG
MRHFDVQLLGGMVLNERCIAEMRTGEGKTLTATLPAYLNALTGKGVHVVTVNDYLAQRDAENNRPLFEFLGLTVGINLPG
MPAPAKREAYAADITYGTNNEYGFDYLRDNMAFSPEERVQRKLHYALVDEVDSILIDEARTPLIISGPAEDSSEMYKRVN
KIIPHLIRQEKEDSETFQGEGHFSVDEKSRQVNLTERGLVLIEELLVKEGIMDEGESLYSPANIMLMHHVTAALRAHALF
TRDVDYIVKDGEVIIVDEHTGRTMQGRRWSDGLHQAVEAKEGVQIQNENQTLASITFQNYFRLYEKLAGMTGTADTEAFE
FSSIYKLDTVVVPTNRPMIRKDLPDLVYMTEAEKIQAIIEDIKERTAKGQPVLVGTISIEKSELVSNELTKAGIKHNVLN
AKFHANEAAIVAQAGYPAAVTIATNMAGRGTDIVLGGSWQAEVAALENPTAEQIEKIKADWQVRHDAVLEAGGLHIIGTE
RHESRRIDNQLRGRSGRQGDAGSSRFYLSMEDALMRIFASDRVSGMMRKLGMKPGEAIEHPWVTKAIANAQRKVESRNFD
IRKQLLEYDDVANDQRRAIYSQRNELLDVSDVSETINSIREDVFKATIDAYIPPQSLEEMWDIPGLQERLKNDFDLDLPI
AEWLDKEPELHEETLRERILAQSIEVYQRKEEVVGAEMMRHFEKGVMLQTLDSLWKEHLAAMDYLRQGIHLRGYAQKDPK
QEYKRESFSMFAAMLESLKYEVISTLSKVQVRMPEEVEELEQQRRMEAERLAQMQQLSHQDDDSAAAAALAAQTGERKVG
RNDPCPCGSGKKYKQCHGRLQ
>P43803 7.4.2.8~~~secA~~~Protein translocase subunit SecA~~~COG0653
MSILTRIFGSRNERVLRKLKKQVVKINKMEPAFEALSDDELKAKTQEFRDRLSGGETLQQILPEAFATVREASKRVLGMR
HFDVQLIGGMVLTNRCIAEMRTGEGKTLTATLPCYLIALEGKGVHVVTVNDYLARRDAETNRPLFEFLGMSVGVNIPGLS
PEEKRAAYAADITYATNSELGFDYLRDNLAHSKEERFQRTLGYALVDEVDSILIDEARTPLIISGQAENSSELYIAVNKL
IPSLIKQEKEDTEEYQGEGDFTLDLKSKQAHLTERGQEKVEDWLIAQGLMPEGDSLYSPSRIVLLHHVMAALRAHTLFEK
DVDYIVKDGEIVIVDEHTGRTMAGRRWSDGLHQAIEAKEGVDVKSENQTVASISYQNYFRLYERLAGMTGTADTEAFEFQ
QIYGLETVVIPTNRPMIRDDRTDVMFENEQYKFNAIIEDIKDCVERQQPVLVGTISVEKSEELSKALDKAGIKHNVLNAK
FHQQEAEIVAEAGFPSAVTIATNMAGRGTDIILGGNWKAQAAKLENPTQEQIEALKAEWEKNHEIVMKAGGLHIIGTERH
ESRRIDNQLRGRSGRQGDPGSSRFYLSLEDGLMRIYLNEGKLNLMRKAFTVAGEAMESKMLAKVIASAQAKVEAFHFDGR
KNLLEYDDVANDQRHAIYEQRNHLLDNDDISETINAIRHDVFNGVIDQYIPPQSLEEQWDIKGLEERLSQEFGMELPISN
WLEEDNNLHEESLRERIVEIAEKEYKEKEALVGEDAMRHFEKGVMLQTLDELWKEHLASMDYLRQGIHLRGYAQKDPKQE
YKKESFRMFTEMLDSLKHQVITALTRVRVRTQEEMEEAERARQEMAARINQNNLPVDENSQTTQNSETEDYSDRRIGRNE
PCPCGSGKKYKHCHGSRVARQ
>P52966 7.4.2.8~~~secA~~~Protein translocase subunit SecA~~~
MLGLGYIGRKLFGTPNDRKVKRTRPLVAKINALEPAFEKLSDAEIVAKTRELQARAQAGESLDALLVEAFANCREAARRA
LGLRAFDTQLMGGIFLHQGNIAEMKTGEGKTLVATFPAYLNALAGKGVHIVTVNDYLARRDSEWMGKVYRHLGLTCGVVY
PFQPDDEKRAAYGADITYATNNELGFDYLRDNMKSSVAEMYQRDHFFAIVDEVDSILIDEARTPLIISGPSQDRSDMYRT
LDAYIPFLTEEHYKLDEKQRNATFTEEGNEFLEQKLQADGLLPEGQSLYDPESTTIVHHIGQALRAHKLFFKDQNYVVTD
DEIVLIDEFTGRMMKGRRLSDGLHQAIEAKERVTIQPENVTLASVTFQNYFRLYEKLAGMTGTAVTEAEEFGDIYKLGVV
EVPTNRPVARKDEHDRVYRTAKEKYAAVIEAIKTAHEKGQPTLVGTTSIEKSEMLSEMLKAEGLPHNVLNARQHEQEAQI
VADAGRLGAITIATNMAGRGTDIQLGGNVEMKVQEEIAANPEAAPEEIRARIEAEHAAEKQKVIEAGGLFVLATERHESR
RIDNQLRGRSGRQGDPGRSLFFLSLEDDLMRIFGSDRLEGVLSKLGMKEGEAIIHPWVNKSLERAQAKVEGRNFDWRKQL
LKFDDVMNDQRKAVFGQRREIMETDEISEIVADMRQQVIDDLIDDFAPPKSYVDQWDIEGMRAAFIDHAGVDLPLADWAA
EEGVDQDVLRERVTAALDAVMAQKTEAFGAETMRVIEKQILLQTIDAKWREHLVTLEHLRSVVGFRGYAQRDPLSEYKTE
SFQLFESMLDSLRYEVTKRLGQIRPMSDEERAEMLRQQAAALAAAEGAADPAEAPAPQPAAQVALAAAPGFVESDPTTWG
EPSRNDPCPCGSGEKFKHCHGRLA
>P47994 7.4.2.8~~~secA~~~Protein translocase subunit SecA~~~COG0653
MGFLTKIVDGNKREIKRLSKQADKVISLEEEMSILTDEEIRNKTKAFQERLQAEEDVSKQDKILEEILPEAFALVREGAK
RVFNMTPYPVQIMGGIAIHNGDISEMRTGEGKTLTATMPTYLNALAGRGVHVITVNEYLASSQSEEMAELYNFLGLSVGL
NLNSLSTEQKREAYNADITYSTNNELGFDYLRDNMVNYSEERVMRPLHFAIIDEVDSILIDEARTPLIISGEAEKSTSLY
TQANVFAKMLKAEDDYNYDEKTKSVQLTDQGADKAERMFKLDNLYDLKNVDIITHINTALRANYTLQRDVDYMVVDGEVL
IVDQFTGRTMPGRRFSEGLHQAIEAKEGVQIQNESKTMASITFQNYFRMYNKLAGMTGTAKTEEEEFRNIYNMTVTQIPT
NRPVQREDRPDLIFISQKGKFDAVVEDVVEKHKKGQPILLGTVAVETSEYISQLLKKRGVRHDVLNAKNHEREAEIVSTA
GQKGAVTIATNMAGRGTDIKLGEGVEELGGLAVIGTERHESRRIDDQLRGRSGRQGDRGESRFYLSLQDELMVRFGSERL
QKMMGRLGMDDSTPIESKMVSRAVESAQKRVEGNNFDARKRILEYDEVLRKQREIIYGERNNIIDSESSSELVITMIRST
LDRAISYYVNEELEEIDYAPFINFVEDVFLHEGEVKEDEIKGKDREDIFDTVWAKIEKAYEAQKANIPDQFNEFERMILL
RSIDGRWTDHIDTMDQLRQGIHLRSYGQQNPLRDYQNEGHQLFDTMMVNIEEDVSKYILKSIITVDDDIERDKAKEYQGQ
HVSAEDGKEKVKPQPVVKDNHIGRNDPCPCGSGKKYKNCCGK
>P95759 7.4.2.8~~~secA~~~Protein translocase subunit SecA~~~
MSVFNKLMRAGEGKILRKLHRIADQVSSIEEDFVNLSDAELRALTDEYKERYADGESLDDLLPEAFATVREAAKRVLGQR
HYDVQMMGGVALHLGYVAEMKTGEGKTLVGTLPAYLNALSGKGVHLITVNDYLAERDSELMGRVHKFLGLSVGCIVANMT
PAQRREQYGCDITYGTNNEFGFDYLRDNMAWSKDELVQRGHNFAVVDEVDSILVDEARTPLIISGPADQPPSGTADFAKL
VTRLTKGEAGNQLKGIEETGDYEVDEKKRTVAIHEAGVAKVEDWLGIDNLYESVNTPLVGYLNNAIKAKELFKKDKDYVV
IDGEVMIVDEHTGRILAGRRYNEGMHQAIEAKEGVDIKDENQTLATITLQQNFFRLYDKLSGMTGTAMTEAAEFHQIYKL
GVVPIPTNRPMVRADQSDLIYRTEVAKFAAVVDDIAEKHEKGQPILVGTTSVEKSEYLSQQLSKRGVQHEVLNAKQHDRE
ATIVAQAGRKGAVTVATNMAGRGTDIKLGGNPDDLAEAELRQRGLDPVENVEEWAAALPAALETAEQAVKAEFEEVKDLG
GLYVLGTERHESRRIDNQLRGRSGRQGDPGESRFYLSLGDDLMRLFKAQMVERVMSMANVPDDVPIENKMVTRAIASAQS
QVEQQNFETRKNVLKYDEVLNRQREVIYGERRRVLEGEDLQEQIRHFMDDTIDDYIRQETAEGFAEEWDLDRLWGAFKQL
YPVKVTVDELEEAAGDLAGVTAEFIAESVKNDIHEQYEERENTLGSDIMRELEPRWVLSVLDRKWREHLYEMDYLQEGIG
LRAMAQKDPLVEYQREGFDMFNAMMEGIKEESVGYLFNLEVQVEQQVEEVPVQDGAERPSLEKEGATAAPQIRAKGLEAP
QRPDRLHFSAPTVDGEGGVVEGDFANDEATGDTRSGSADGMTRADAARRRKGGGGRRRKK
>P0A4G7 7.4.2.8~~~secA~~~Protein translocase subunit SecA~~~
MSVLSKLMRAGEGKILRKLHRIADQVNSIEEDFADLSDAELRALTDEYKQRYADGESLDDLLPEAFATVREAAKRVLGQR
HYDVQIMGGAALHMGYVAEMKTGEGKTLVGTLPAYLNALSGEGVHIVTVNDYLAERDSELMGRVHKFLGLNVGCILANQT
PAQRREMYACDITYGTNNEFGFDYLRDNMAWSKDELVQRGHNFAIVDEVDSILVDEARTPLIISGPADQATKWYGDFAKL
VTRLKKGEAGNTLKGIEETGDYEVDEKKRTVAIHESGVAKVEDWLGIDNLYESVNTPLVGYLNNAIKAKELFKKDKDYVV
LDGEVMIVDEHTGRILAGRRYNEGMHQAIEAKEGVDIKDENQTLATITLQNFFRLYKRHDHDGKEQPGLSGMTGTAMTEA
AEFHQIYKLGVVPIPTNRPMVRKDQSDLIYRTEVAKFEAVVDDIEEKHRKGQPILVGTTSVEKSEYLSQQLSKRGVQHEV
LNAKQHDREATIVAQAGRKGSVTVATNMAGRGTDIKLGGNPEDLAEAELRQRGLDPEEHIEEWAAALPAALERAEQAVKA
EFEEVKELGGLYVLGTERHESRRIDNQLRGRSGRQGDPGESRFYLSLGDDLMRLFKAQMVERVMSMANVPDDVPIENKMV
TRAIASAQSQVETQNFETRKNVLKYDEVLNRQREVIYGERRRVLEGEDLQEQIQHFTNDTIDAYVQAETAEGFPEDWDLD
RLWGAFKQLYPVKVTVEELEEAAGDRAGLTADYIAESIKDDVREQYEAREKQLGSEIMRELERRVVLSVLDRKWREHLYE
MDYLQEGIGLRAMAQKDPLVEYQREGFDMFQAMMDGIKEESVGYLFNLEVQVEQQVEEVPVEDAAPSLDKGAQDAVPAQA
GARPEIRAKGLDAPQRRDLHFSAPTVDGEGGVVEGEFTDGEPAQAQSDGLTRAERRKQAKGGRRRKK
>Q9X1R4 7.4.2.8~~~secA~~~Protein translocase subunit SecA~~~COG0653
MILFDKNKRILKKYAKMVSKINQIESDLRSKKNSELIRLSMVLKEKVNSFEDADEHLFEAFALVREAARRTLGMRPFDVQ
VMGGIALHEGKVAEMKTGEGKTLAATMPIYLNALIGKGVHLVTVNDYLARRDALWMGPVYLFLGLRVGVINSLGKSYEVV
WKNPDLARKAIEENWSVWPDGFNGEVLKEESMNKEAVEAFQVELKEITRKEAYLCDVTYGTNNEFGFDYLRDNLVLDYND
KVQRGHFYAIVDEADSVLIDEARTPLIISGPSKESPSVYRRFAQIAKKFVKDKDFTVDEKARTIILTEEGVAKAEKIIGV
ENLYDPGNVSLLYHLINALKALHLFKKDVDYVVMNGEVIIVDEFTGRLLPGRRYSGGLHQAIEAKEGVPIKEESITYATI
TFQNYFRMYEKLAGMTGTAKTEESEFVQVYGMEVVVIPTHKPMIRKDHDDLVFRTQKEKYEKIVEEIEKRYKKGQPVLVG
TTSIEKSELLSSMLKKKGIPHQVLNAKYHEKEAEIVAKAGQKGMVTIATNMAGRGTDIKLGPGVAELGGLCIIGTERHES
RRIDNQLRGRAGRQGDPGESIFFLSLEDDLLRIFGSEQIGKVMNILKIEEGQPIQHPMLSKLIENIQKKVEGINFSIRKT
LMEMDDVLDKQRRAVYSLRDQILLEKDYDEYLKDIFEDVVSTRVEEFCSGKNWDIESLKNSLSFFPAGLFDLDEKQFSSS
EELHDYLFNRLWEEYQRKKQEIGEDYRKVIRFLMLRIIDDHWRRYLEEVEHVKEAVQLRSYGQKDPIVEFKKETYYMFDE
MMRRINDTIANYVLRVVKVSEKDEKEAKEELGKIRLVHEEFNLVNRAMRRATEKKKKKDGLHSFGRIRVKR
>Q5SIW3 7.4.2.8~~~secA~~~Protein translocase subunit SecA~~~COG0653
MLGLLRRLFDNNEREIARYYKQVVEPVNRLEAEVEKLPDLAAAYRELKEKHEKGASLDELLPMAFALTRESAKRYLGMRH
FDVQLIGGAVLHEGKIAEMKTGEGKTLVATLAVALNALTGKGVHVVTVNDYLARRDAEWMGPVYRGLGLSVGVIQHASTP
AERRKAYLADVTYVTNSELGFDYLRDNMAISPDQLVLRHDHPLHYAIIDEVDSILIDEARTPLIISGPAEKATDLYYKMA
EIAKKLERGLPAEPGVRKEPTGDYTVEEKNRSVHLTLQGIAKAEKLLGIEGLFSPENMELAHMLIQAIRAKELYHRDRDY
IVQDGQVIIVDEFTGRLMPGRRYGEGLHQAIEAKEGVRIERENQTLATITYQNFFRLYEKRAGMTGTAKTEEKEFQEIYG
MDVVVVPTNRPVIRKDFPDVVYRTEKGKFYAVVEEIAEKYERGQPVLVGTISIEKSERLSQMLKEPRLYLPRLEMRLELF
KKASQKQQGPEWERLRKLLERPAQLKDEDLAPFEGLIPPKGNLRTAWEGLKRAVHTLAVLRQGIPHQVLNAKHHAREAEI
VAQAGRSKTVTIATNMAGRGTDIKLGGNPEYLAAALLEKEGFDRYEWKVELFIKKMVAGKEEEARALAQELGIREELLER
IREIREECKQDEERVRALGGLFIIGTERHESRRIDNQLRGRAGRQGDPGGSRFYVSFDDDLMRLFASDRVIAMLDRMGFD
DSEPIEHPMVTRSIERAQKRVEDRNFAIRKQLLQFDDVLSRQREVIYAQRRLILLGKDEEVKEAAIGMVEETVASLAENF
LNPEVHPEDWDLEGLKATLLDTAPQLQDFPFAELRALKAEEAVERLVEAALKAYEAREAELSPPLMRAVERFVILNVVDN
AWKEHLHNLDVLRQGIFLRGYGQKDPFQEYKIEATRLFNEMVAFIKSEVAKFLFRLKVEAEPVRPVREAPYVPVPEAKPE
PSEVFGVERKRATPPPQPGLSRAERRRLMRQEKKRKK
>P95257 ~~~secBL~~~SecB-like chaperone Rv1957~~~
MTDRTDADDLDLQRVGARLAARAQIRDIRLLRTQAAVHRAPKPAQGLTYDLEFEPAVDADPATISAFVVRISCHLRIQNQ
AADDDVKEGDTKDETQDVATADFEFAALFDYHLQEGEDDPTEEELTAYAATTGRFALYPYIREYVYDLTGRLALPPLTLE
ILSRPMPVSPGAQWPATRGTP
>P0AG86 ~~~secB~~~Protein-export protein SecB~~~COG1952
MSEQNNTEMTFQIQRIYTKDISFEAPNAPHVFQKDWQPEVKLDLDTASSQLADDVYEVVLRVTVTASLGEETAFLCEVQQ
GGIFSIAGIEGTQMAHCLGAYCPNILFPYARECITSMVSRGTFPQLNLAPVNFDALFMNYLQQQAGEGTEEHQDA
>P44853 ~~~secB~~~Protein-export protein SecB~~~COG1952
MSEQKQDVAATEEQQPVLQIQRIYVKDVSFEAPNLPHIFQQEWKPKLGFDLSTETTQVGDDLYEVVLNISVETTLEDSGD
VAFICEVKQAGVFTISGLEDVQMAHCLTSQCPNMLFPYARELVSNLVNRGTFPALNLSPVNFDALFVEYMNRQQAENAEE
KSEEEQTKH
>O32047 ~~~secDF~~~Protein translocase subunit SecDF~~~COG0341
MKKGRLIAFFLFVLLIGTGLGYFTKPAANNITLGLDLQGGFEVLYDVQPVKKGDKITKDVLVSTVEALNRRANVLGVSEP
NIQIEGNNRIRVQLAGVTNQNRAREILATEAQLSFRDANDKELLNGADLVENGAKQTYDSTTNEPIVTIKLKDADKFGEV
TKKVMKMAPNNQLVIWLDYDKGDSFKKEVQKEHPKFVSAPNVSQELNTTDVKIEGHFTAQEAKDLASILNAGALPVKLTE
KYSTSVGAQFGQQALHDTVFAGIVGIAIIFLFMLFYYRLPGLIAVITLSVYIYITLQIFDWMNAVLTLPGIAALILGVGM
AVDANIITYERIKEELKLGKSVRSAFRSGNRRSFATIFDANITTIIAAVVLFIFGTSSVKGFATMLILSILTSFITAVFL
SRFLLALLVESRWLDRKKGWFGVNKKHIMDIQDTDENTEPHTPFQKWDFTSKRKYFFIFSSAVTVAGIIILLVFRLNLGI
DFASGARIEVQSDHKLTTEQVEKDFESLGMDPDTVVLSGEKSNIGVARFVGVPDKETIAKVKTYFKDKYGSDPNVSTVSP
TVGKELARNALYAVAIASIGIIIYVSIRFEYKMAIAAIASLLYDAFFIVTFFSITRLEVDVTFIAAILTIIGYSINDTIV
TFDRVREHMKKRKPKTFADLNHIVNLSLQQTFTRSINTVLTVVIVVVTLLIFGASSITNFSIALLVGLLTGVYSSLYIAA
QIWLAWKGRELKKDSAQ
>Q5SKE6 ~~~secDF~~~Protein translocase subunit SecDF~~~COG0341
MNRKNLTSLFLLGVFLLALLFVWKPWAPEEPKVRLGLDLKGGLRIVLEADVENPTLDDLEKARTVLENRINALGVAEPLI
QIQGQKRIVVELPGLSQADQDRALKLIGQRAVLEFRIVKEGATGTTVAQINQALRENPRLNREELEKDLIKPEDLGPPLL
TGADLADARAVFDQFGRPQVSLTFTPEGAKKFEEVTRQNIGKRLAIVLDGRVYTAPVIRQAITGGQAVIEGLSSVEEASE
IALVLRSGSLPVPLKVAEIRAIGPTLGQDAIQAGIRSALIGTLAIFLLIFAYYGPHLGLVASLGLLYTSALILGLLSGLG
ATLTLPGIAGLVLTLGAAVDGNVLSFERIKEELRAGKKLRQAIPEGFRHSTLTIMDVNIAHLLAAAALYQYATGPVRGFA
VILAIGVVASVFSNLVFSRHLLERLADRGEIRPPMWLVDPRFNFMGPARYVTAATLLLAALAAGVVFAKGFNYSIDFTGG
TAYTLRAEPNVEVETLRRFLEEKGFPGKEAVITQVQAPTAAYREFLVKLPPLSDERRLELERLFASELKATVLASETVGP
AIGEELRRNAVMAVLVGLGLILLYVAFRFDWTFGVASILAVAHDVAIVAGMYSLLGLEFSIPTIAALLTIVGYSINDSIV
VSDRIRENQKLLRHLPYAELVNRSINQTLSRTVMTSLTTLLPILALLFLGGSVLRDFALAIFVGIFVGTYSSIYVVSALV
VAWKNRRKAQEASKA
>P0AG90 ~~~secD~~~Protein translocase subunit SecD~~~COG0342
MLNRYPLWKYVMLIVVIVIGLLYALPNLFGEDPAVQITGARGVAASEQTLIQVQKTLQEEKITAKSVALEEGAILARFDS
TDTQLRAREALMGVMGDKYVVALNLAPATPRWLAAIHAEPMKLGLDLRGGVHFLMEVDMDTALGKLQEQNIDSLRSDLRE
KGIPYTTVRKENNYGLSITFRDAKARDEAIAYLSKRHPDLVISSQGSNQLRAVMSDARLSEAREYAVQQNINILRNRVNQ
LGVAEPVVQRQGADRIVVELPGIQDTARAKEILGATATLEFRLVNTNVDQAAAASGRVPGDSEVKQTREGQPVVLYKRVI
LTGDHITDSTSSQDEYNQPQVNISLDSAGGNIMSNFTKDNIGKPMATLFVEYKDSGKKDANGRAVLVKQEEVINIANIQS
RLGNSFRITGINNPNEARQLSLLLRAGALIAPIQIVEERTIGPTLGMQNIEQGLEACLAGLLVSILFMIIFYKKFGLIAT
SALIANLILIVGIMSLLPGATLSMPGIAGIVLTLAVAVDANVLINERIKEELSNGRTVQQAIDEGYRGAFSSIFDANITT
LIKVIILYAVGTGAIKGFAITTGIGVATSMFTAIVGTRAIVNLLYGGKRVKKLSI
>P9WGP1 ~~~secD~~~Protein translocase subunit SecD~~~COG0342
MASSSAPVHPARYLSVFLVMLIGIYLLVFFTGDKHTAPKLGIDLQGGTRVTLTARTPDGSAPSREALAQAQQIISARVNG
LGVSGSEVVVDGDNLVITVPGNDGSEARNLGQTARLYIRPVLNSMPAQPAAEEPQPAPSAEPQPPGQPAAPPPAQSGAPA
SPQPGAQPRPYPQDPAPSPNPTSPASPPPAPPAEAPATDPRKDLAERIAQEKKLRQSTNQYMQMVALQFQATRCESDDIL
AGNDDPKLPLVTCSTDHKTAYLLAPSIISGDQIQNATSGMDQRGIGYVVDLQFKGPAANIWADYTAAHIGTQTAFTLDSQ
VVSAPQIQEAIPGGRTQISGGDPPFTAATARQLANVLKYGSLPLSFEPSEAQTVSATLGLSSLRAGMIAGAIGLLLVLVY
SLLYYRVLGLLTALSLVASGSMVFAILVLLGRYINYTLDLAGIAGLIIGIGTTADSFVVFFERIKDEIREGRSFRSAVPR
GWARARKTIVSGNAVTFLAAAVLYFLAIGQVKGFAFTLGLTTILDLVVVFLVTWPLVYLASKSSLLAKPAYNGLGAVQQV
ARERRAMARTGRG
>E9RGS3 ~~~secD~~~Protein translocase subunit SecD~~~COG0342
MLNRYPLWKYLMVFFAIITAALYALPNIYGEDPAIQVTGARGASVDMSTLDAVTTALDKEQLSHKSIALENGSILVRFND
TDTQISARDVISEALGKDTIVALNLAPSTPDWLEAIGASPLKLGLDLRGGVHFLMEVDMDAAMEKLVGQQEEAFRSELRE
AKIRYRAIRPSGKEAVEVLLRNEEQLAEAKAMLEKNHPDMLFVESDSNGRYALTATFTEQRLQEIRNYAVEQNITILRNR
VNELGVAEPLVQRQGASRIVVELPGVQDTARAKEILGATATLEFREVDDKADLSAAANGRAPAGSEIKMDRDGRPVVLKK
RVILGGQSITDASSSVDEYGRPQVNISLDSEGGNKMSAFSKKNIGKLMATVFAEYKDSGRRTPEGKVILDKHEEVINQAT
IQSALGRNFRITGIDSAAEAHNLALLLRAGALIAPISIVEERTIGPSMGQQNIDMGIQACIWGMVAVMLFTLLYYRGFGL
IANIALMANLVLIIGVMSMIPGATMTLPGIAGIVLTVGMAVDANVLIFERIREELRDGRSPQQAIHQGYANAFSTIADAN
ITTLITAIILFAVGTGAVKGFAVTLSIGILTSMFTAIIGTRCIVNLIYGGKRVDKLSI
>D0VWU4 ~~~secE~~~Protein translocase subunit SecE~~~
MEKLKEFLKGVRDELKRVVWPSRELVVKATISVIIFSLAIGVYLWILDLTFTKIISFILSLRGSL
>P0AG96 ~~~secE~~~Protein translocase subunit SecE~~~COG0690
MSANTEAQGSGRGLEAMKWVVVVALLLVAIVGNYLYRDIMLPLRALAVVILIAAAGGVALLTTKGKATVAFAREARTEVR
KVIWPTRQETLHTTLIVAAVTAVMSLILWGLDGILVRLVSFITGLRF
>P9WGN7 ~~~secE~~~Protein translocase subunit SecE~~~COG0690
MSDEGDVADEAVADGAENADSRGSGGRTALVTKPVVRPQRPTGKRSRSRAAGADADVDVEEPSTAASEATGVAKDDSTTK
AVSKAARAKKASKPKARSVNPIAFVYNYLKQVVAEMRKVIWPNRKQMLTYTSVVLAFLAFMVALVAGADLGLTKLVMLVF
G
>P35874 ~~~secE~~~Protein translocase subunit SecE~~~COG0690
MEKLRKFFREVIAEAKKISWPSRKELLTSFGVVLVILAVTSVYFFVLDFIFSGVVSAIFKALGIG
>P38383 ~~~secE~~~Protein translocase subunit SecE~~~COG0690
MFARLIRYFQEARAELARVTWPTREQVVEGTQAILLFTLAFMVILGLYDTVFRFLIGLLR
>P0AG93 ~~~secF~~~Protein translocase subunit SecF~~~COG0341
MAQEYTVEQLNHGRKVYDFMRWDYWAFGISGLLLIAAIVIMGVRGFNWGLDFTGGTVIEITLEKPAEIDVMRDALQKAGF
EEPMLQNFGSSHDIMVRMPPAEGETGGQVLGSQVLKVINESTNQNAAVKRIEFVGPSVGADLAQTGAMALMAALLSILVY
VGFRFEWRLAAGVVIALAHDVIITLGILSLFHIEIDLTIVASLMSVIGYSLNDSIVVSDRIRENFRKIRRGTPYEIFNVS
LTQTLHRTLITSGTTLMVILMLYLFGGPVLEGFSLTMLIGVSIGTASSIYVASALALKLGMKREHMLQQKVEKEGADQPS
ILP
>P9WGN9 ~~~secF~~~Protein translocase subunit SecF~~~COG0341
MASKAKTGRDDEATSAVELTEATESAVARTDGDSTTDTASKLGHHSFLSRLYTGTGAFEVVGRRRLWFGVSGAIVAVAIA
SIVFRGFTFGIDFKGGTTVSFPRGSTQVAQVEDVYYRALGSEPQSVVIVGAGASATVQIRSETLTSDQTAKLRDALFEAF
GPKGTDGQPSKQAISDSAVSETWGGQITKKAVIALVVFLVLVALYITVRYERYMTISAITAMLFDLTVTAGVYSLVGFEV
TPATVIGLLTILGFSLYDTVIVFDKVEENTHGFQHTTRRTFAEQANLAINQTFMRSINTSLIGVLPVLALMVVAVWLLGV
GTLKDLALVQLIGIIIGTYSSIFFATPLLVTLRERTELVRNHTRRVLKRRNSGSPAGSEDASTDGGEQPAAADEQSLVGI
TQASSQSAPRAAQGSSKPAPGARPVRPVGTRRPTGKRNAGRR
>A8GQT5 ~~~secF~~~Protein translocase subunit SecF~~~
MQIYPLRLLPNKIDFDFMNFKKVSYTFSIILSLISFIWIGIYKFNFGIDFAGGIVIEVRLDQAPDLPKMRGVLGKLGIGE
VVLQNFGSERDLSIRFGSNSEENLMKNIELIKGSLQSNFPYKFEYRKVDFVGPQVGRQLIEAGAMAMLFSFLAIMVYIWV
RFEWYFGFGILIALVHDVILALGFMSMTKLDFNLSTIAAVLTIIGYSVNDSVVIYDRIRENLRKYHKKNITEIINLSINE
TLSRTILTVITTLLANLALILFGGEAIRSFSILVFFGIIVGTYSSIFISAPILTMFVNRKFNKKVIER
>E9RGS4 ~~~secF~~~Protein translocase subunit SecF~~~COG0341
MFQILKAEKTIGFMRWSKVAFVFSIFMIAASIFTLSTKWLNWGLDFTGGTLIEVGFEKPANLEKIRTALDAKGFGDATVQ
NFGSAREVMVRLRPRDDVSGETLGNQIIGAIKDGTGESVEMRRIEFVGPNVGDELTEAGGLAILVSLICILLYVSMRFEW
RLAAGAVMALAHDIIITLGVFSFLQIEVDLTIVAALLTVVGYSLNDTIVVFDRIRENFRKMRKGEPADIMDASITQTLSR
TLITSGTTLFVVIALFMQGGAMIHGFATALLLGITVGTYSSIYVASALALKLGIQKEHLMPPQVEKEGAEFDEMP
>O66505 ~~~secG~~~Protein-export membrane protein SecG~~~COG1314
MYYALLTLFVIIAVVLIISTLLQKGRGDVGAAFGGGMGQSIFGVGGVETILTKATYWLGALFLVLALLLSVIPKEKGSVV
EKSVQTEQSEGKGTTQESGK
>O32233 ~~~secG~~~Probable protein-export membrane protein SecG~~~COG1314
MHAVLITLLVIVSIALIIVVLLQSSKSAGLSGAISGGAEQLFGKQKARGLDLILHRITVVLAVLFFVLTIALAYIL
>P0AG99 ~~~secG~~~Protein-export membrane protein SecG~~~COG1314
MYEALLVVFLIVAIGLVGLIMLQQGKGADMGASFGAGASATLFGSSGSGNFMTRMTALLATLFFIISLVLGNINSNKTNK
GSEWENLSAPAKTEQTQPAAPAKPTSDIPN
>P9WGN5 ~~~secG~~~Probable protein-export membrane protein SecG~~~COG1314
MELALQITLIVTSVLVVLLVLLHRAKGGGLSTLFGGGVQSSLSGSTVVEKNLDRLTLFVTGIWLVSIIGVALLIKYR
>Q9WYU9 ~~~secG~~~Protein-export membrane protein SecG~~~COG1314
MKTFFLIVHTIISVALIYMVQVQMSKFSELGGASEVEDFTPFLEEEKASTPVERSLLSCLYSFSFPA
>P62395 ~~~secM~~~Secretion monitor~~~
MSGILTRWRQFGKRYFWPHLLLGMVAASLGLPALSNAAEPNAPAKATTRNHEPSAKVNFGQLALLEANTRRPNSNYSVDY
WHQHAIRTVIRHLSFAMAPQTLPVAEESLPLQAQHLALLDTLSALLTQEGTPSEKGYRIDYAHFTPQAKFSTPVWISQAQ
GIRAGPQRLT
>O66491 ~~~secY~~~Protein translocase subunit SecY~~~COG0201
MSEYLKALFELKELRQKFIFTLLMFVIYRLGSHIPIPGINPEALRDFLKAFEGSVFALYDIFSGGNLGRLTVFALGVMPY
ISASIMMQLLTVAIPSLQRLAKEEGDYGRYKINEYTKYLTLFVATVQSLGIAFWIRGQVSPKGIPVVENPGISFILITVL
TLVAGTMFLVWIADRITEKGIGNGASLIIFAGIVANFPNAVIQFYEKVKTGDIGPLTLLLIIALIIAIIVGIVYVQEAER
RIPIQYPGRQVGRQLYAGRKTYLPIKINPAGVIPIIFAQALLLIPSTLLNFVQNPFIKVIADMFQPGAIFYNFLYVTFIV
FFTYFYTAVLINPVELAENLHKAGAFIPGVRPGQDTVKYLERIINRLIFFGALFLSVIALIPILISVWFNIPFYFGGTTA
LIVVGVALDTFRQIETYLIQKKYKSYVRR
>P16336 ~~~secY~~~Protein translocase subunit SecY~~~COG0201
MFKTISNFMRVSDIRNKIIFTLLMLIVFRIGAFIPVPYVNAEALQAQSQMGVFDLLNTFGGGALYQFSIFAMGITPYITA
SIIIQLLQMDVVPKFTEWSKQGEVGRRKLAQFTRYFTIVLGFIQALGMSYGFNNLANGMLIEKSGVSTYLIIALVLTGGT
AFLMWLGEQITSHGVGNGISIIIFAGIVSSIPKTIGQIYETQFVGSNDQLFIHIVKVALLVIAILAVIVGVIFIQQAVRK
IAIQYAKGTGRSPAGGGQSTHLPLKVNPAGVIPVIFAVAFLITPRTIASFFGTNDVTKWIQNNFDNTHPVGMAIYVALII
AFTYFYAFVQVNPEQMADNLKKQGGYIPGVRPGKMTQDRITSILYRLTFVGSIFLAVISILPIFFIQFAGLPQSAQIGGT
SLLIVVGVALETMKQLESQLVKRNYRGFMKN
>P0AGA2 ~~~secY~~~Protein translocase subunit SecY~~~COG0201
MAKQPGLDFQSAKGGLGELKRRLLFVIGALIVFRIGSFIPIPGIDAAVLAKLLEQQRGTIIEMFNMFSGGALSRASIFAL
GIMPYISASIIIQLLTVVHPTLAEIKKEGESGRRKISQYTRYGTLVLAIFQSIGIATGLPNMPGMQGLVINPGFAFYFTA
VVSLVTGTMFLMWLGEQITERGIGNGISIIIFAGIVAGLPPAIAHTIEQARQGDLHFLVLLLVAVLVFAVTFFVVFVERG
QRRIVVNYAKRQQGRRVYAAQSTHLPLKVNMAGVIPAIFASSIILFPATIASWFGGGTGWNWLTTISLYLQPGQPLYVLL
YASAIIFFCFFYTALVFNPRETADNLKKSGAFVPGIRPGEQTAKYIDKVMTRLTLVGALYITFICLIPEFMRDAMKVPFY
FGGTSLLIVVVVIMDFMAQVQTLMMSSQYESALKKANLKGYGR
>P9WGN3 ~~~secY~~~Protein translocase subunit SecY~~~COG0201
MLSAFISSLRTVDLRRKILFTLGIVILYRVGAALPSPGVNFPNVQQCIKEASAGEAGQIYSLINLFSGGALLKLTVFAVG
VMPYITASIIVQLLTVVIPRFEELRKEGQAGQSKMTQYTRYLAIALAILQATSIVALAANGGLLQGCSLDIIADQSIFTL
VVIVLVMTGGAALVMWMGELITERGIGNGMSLLIFVGIAARIPAEGQSILESRGGVVFTAVCAAALIIIVGVVFVEQGQR
RIPVQYAKRMVGRRMYGGTSTYLPLKVNQAGVIPVIFASSLIYIPHLITQLIRSGSGVVGNSWWDKFVGTYLSDPSNLVY
IGIYFGLIIFFTYFYVSITFNPDERADEMKKFGGFIPGIRPGRPTADYLRYVLSRITLPGSIYLGVIAVLPNLFLQIGAG
GTVQNLPFGGTAVLIMIGVGLDTVKQIESQLMQRNYEGFLK
>Q7A468 ~~~secY~~~Protein translocase subunit SecY~~~
MIQTLVNFFRTKEVRNKIFFTLAMLVIFKIGTYIPAPGVNPAAFDNPQGSQGATELLNTFGGGALKRFSIFAMGIVPYIT
ASIVMQLLQMDIVPKFSEWAKQGEVGRRKLNNVTRYLAISLAFIQSIGMAFQFNNYLKGALIINQSIMSYLLIALVLTAG
TAFLIWLGDQITQFGVGNGISIIIFAGILSTLPASLIQFGQTAFVGQEDTSLAWLKVLGLLVSLILLTVGAIYVLEAVRK
IPIQYAKKQTAQRLGSQATYLPLKVNSAGVIPVIFAMAFFLLPRTLTLFYPDKEWAQNIANAANPSSNVGMVVYIVLIIL
FTYFYAFVQVNPEKMADNLKKQGSYVPGIRPGEQTKKYITKVLYRLTFVGSIFLAVISILPILATKFMGLPQSIQIGGTS
LLIVIGVAIETMKSLEAQVSQKEYKGFGGR
>Q9X1I9 ~~~secY~~~Protein translocase subunit SecY~~~COG0201
MWQAFKNAFKIPELRDRIIFTFLALIVFRMGIYIPVPGLNLEAWGEIFRRIAETAGVAGILSFYDVFTGGALSRFSVFTM
SVTPYITASIILQLLASVMPSLKEMLREGEEGRKKFAKYTRRLTLLIGGFQAFFVSFSLARSNPDMVAPGVNVLQFTVLS
TMSMLAGTMFLLWLGERITEKGIGNGISILIFAGIVARYPSYIRQAYLGGLNLLEWIFLIAVALITIFGIILVQQAERRI
TIQYARRVTGRRVYGGASTYLPIKVNQGGVIPIIFASAIVSIPSAIASITNNETLKNLFRAGGFLYLLIYGLLVFFFTYF
YSVVIFDPREISENIRKYGGYIPGLRPGRSTEQYLHRVLNRVTFIGAVFLVVIALLPYLVQGAIKVNVWIGGTSALIAVG
VALDIIQQMETHMVMRHYEGFIKKGKIRGRR
>Q5SHQ8 ~~~secY~~~Protein translocase subunit SecY~~~COG0201
MLKAFWSALQIPELRQRVLFTLLVLAAYRLGAFIPTPGVDLDKIQEFLRTAQGGVFGIINLFSGGNFERFSIFALGIMPY
ITAAIIMQILVTVVPALEKLSKEGEEGRRIINQYTRIGGIALGAFQGFFLATAFLGAEGGRFLLPGWSPGPFFWFVVVVT
QVAGIALLLWMAERITEYGIGNGTSLIIFAGIVVEWLPQILRTIGLIRTGEVNLVAFLFFLAFIVLAFAGMAAVQQAERR
IPVQYARKVVGRRVYGGQATYIPIKLNAAGVIPIIFAAAILQIPIFLAAPFQDNPVLQGIANFFNPTRPSGLFIEVLLVI
LFTYVYTAVQFDPKRIAESLREYGGFIPGIRPGEPTVKFLEHIVSRLTLWGALFLGLVTLLPQIIQNLTGIHSIAFSGIG
LLIVVGVALDTLRQVESQLMLRSYEGFLSRGRLRGRNR
>B5HDJ6 4.2.3.181~~~~~~Selina-4(15),7(11)-diene synthase ((2E,6E)-farnesyl diphosphate cyclizing)~~~COG0664
MEPELTVPPLFSPIRQAIHPKHADIDVQTAAWAETFRIGSEELRGKLVTQDIGTFSARILPEGREEVVSLLADFILWLFG
VDDGHCEEGELGHRPGDLAGLLHRLIRVAQNPEAPMMQDDPLAAGLRDLRMRVDRFGTAGQTARWVDALREYFFSVVWEA
AHRRAGTVPDLNDYTLMRLYDGATSVVLPMLEMGHGYELQPYERDRTAVRAVAEMASFIITWDNDIFSYHKERRGSGYYL
NALRVLEQERGLTPAQALDAAISQRDRVMCLFTTVSEQLAEQGSPQLRQYLHSLRCFIRGAQDWGISSVRYTTPDDPANM
PSVFTDVPTDDSTEPLDIPAVSWWWDLLAEDARSVRRQVPAQRSA
>O67140 2.9.1.1~~~selA~~~L-seryl-tRNA(Sec) selenium transferase~~~COG1921
MKSLLRQIPQISKVVEIFKKKYPEIYVVKAAREVAEKYRKEIIEGKRKDLNGFLEDVERKIKSLMKPNIKRVINATGVVI
NTNLGRAPLSKDVINFISEIANGYSNLEYNLEEGKRGSRIAHIEKYLNELTGAESSFVVNNNAGAVFLVLNTLAEGKEVI
ISRGELVEIGGSFRIPDIMKKSGAILREVGTTNKTKVSDYEGAINQNTALLMKVHKSNFYMEGFVEEVKLEDLVKLGHKY
GIPTYYDAGSGLLINLKEFGISVDEPNFRDCISLGIDLVSGSGDKLLGGPQAGIIVGKKNLIEKIKKNPIARALRIDKLT
LSGLEMTLKLYFEKRYEDIPVIRMLTQDEKALRQKAKRLEKLLKDIPGLKISVIKDKAKPGGGSLPELELPTYCVAIRHD
RLSSQELSRRLRLAEPPIVCRIREDQLLFDMRTVFHEDLKTIKKTLQELLSI
>P0A821 2.9.1.1~~~selA~~~L-seryl-tRNA(Sec) selenium transferase~~~COG1921
MTTETRSLYSQLPAIDRLLRDSSFLSLRDTYGHTRVVELLRQMLDEAREVIRGSQTLPAWCENWAQEVDARLTKEAQSAL
RPVINLTGTVLHTNLGRALQAEAAVEAVAQAMRSPVTLEYDLDDAGRGHRDRALAQLLCRITGAEDACIVNNNAAAVLLM
LAATASGKEVVVSRGELVEIGGAFRIPDVMRQAGCTLHEVGTTNRTHANDYRQAVNENTALLMKVHTSNYSIQGFTKAID
EAELVALGKELDVPVVTDLGSGSLVDLSQYGLPKEPMPQELIAAGVSLVSFSGDKLLGGPQAGIIVGKKEMIARLQSHPL
KRALRADKMTLAALEATLRLYLHPEALSEKLPTLRLLTRSAEVIQIQAQRLQAPLAAHYGAEFAVQVMPCLSQIGSGSLP
VDRLPSAALTFTPHDGRGSHLESLAARWRELPVPVIGRIYDGRLWLDLRCLEDEQRFLEMLLK
>P14081 ~~~selB~~~Selenocysteine-specific elongation factor~~~COG3276
MIIATAGHVDHGKTTLLQAITGVNADRLPEEKKRGMTIDLGYAYWPQPDGRVPGFIDVPGHEKFLSNMLAGVGGIDHALL
VVACDDGVMAQTREHLAILQLTGNPMLTVALTKADRVDEARVDEVERQVKEVLREYGFAEAKLFITAATEGRGMDALREH
LLQLPEREHASQHSFRLAIDRAFTVKGAGLVVTGTALSGEVKVGDSLWLTGVNKPMRVRALHAQNQPTETANAGQRIALN
IAGDAEKEQINRGDWLLADVPPEPFTRVIVELQTHTPLTQWQPLHIHHAASHVTGRVSLLEDNLAELVFDTPLWLADNDR
LVLRDISARNTLAGARVVMLNPPRRGKRKPEYLQWLASLARAQSDADALSVHLERGAVNLADFAWARQLNGEGMRELLQQ
PGYIQAGYSLLNAPVAARWQRKILDTLATYHEQHRDEPGPGRERLRRMALPMEDEALVLLLIEKMRESGDIHSHHGWLHL
PDHKAGFSEEQQAIWQKAEPLFGDEPWWVRDLAKETGTDEQAMRLTLRQAAQQGIITAIVKDRYYRNDRIVEFANMIRDL
DQECGSTCAADFRDRLGVGRKLAIQILEYFDRIGFTRRRGNDHLLRDALLFPEK
>Q46455 ~~~selB~~~Selenocysteine-specific elongation factor~~~
MDYIVVGTAGHVDHGKTVLVKALTGVDTDRLKEEKERGISIELGFAPLTLPSGRQLGLVDVPGHERFIRQMLAGVGGMDL
VMLVVAADEGVMPQTREHLAIIDLLQIKKGIIVITKIDLVEADWLELVREEVRQAVKGTVLEDAPLVEVSALTGEGIAEL
REQLDALAAVTPPRPAAGRVRLPIDRVFSVTGFGTVVTGTLWSGTIKVGDELEVQPEGLKTRARNLQVHGRTVKEARAGQ
RVAVNLAGIETEAVHRGSSLLTPGFLTPTYRLDASFKLLNGARPLANRDRVHFYLGTSEALGRVVLLDRDELNGGEEALI
QLLMEKPVVASREDRFILRSYSPMETIGGGIIIDPVPPKHRRFQPEVLVSLQRRLEGSPEKILAQIIQEHREGLDWQEAA
TRASLSLEETRKLLQSMAAAGQVTLLRVENDLYAISTERYQAWWQAVTRALEEFHSRYPLRPGLAREELRSRYFSRLPAR
VYQALLEEWSREGRLQLAANTVALAGFTPSFSETQKKLLKDLEDKYRVSRWQPPSFKEVAGSFNLDPSELEELLHYLVRE
GVLVKINDEFYWHRQALGEAREVIKNLASTGPFGLAEARDALGSSRKYVLPLLEYLDQVKFTRRVGDKRVVVGN
>O67139 2.7.9.3~~~selD~~~Selenide, water dikinase~~~COG0709
MVELLKLVRSSGUAAKVGPGDLQEILKGFNIYTDESTLVSIGDDAGVYEHNGIIWVYTVDIITPVVNDPYLWGAISTANA
LSDVYAMGGIPVNALAISCFNNCELDIEIFREVIRGALDKLREAKTVLLGGHTIDDKEPKFGLSVAGICPEGKYITQSGA
QVGQLLILTKPIGTGILIKGLKEGILKEEDINEAIENMLALNDKARNLMLSLDATACTDVTGFGLLGHAWNICKNSNIGA
RIFFEKVPYYQLSENLVKKKIYPKGAIENLNFVKNYLKSNLDNWKLILLSDPVTSGGLLFTINKEKLEKIDETAKELEVN
YWIIGETIAENVLEVL
>P16456 2.7.9.3~~~selD~~~Selenide, water dikinase~~~COG0709
MSENSIRLTQYSHGAGCGCKISPKVLETILHSEQAKFVDPNLLVGNETRDDAAVYDLGNGTSVISTTDFFMPIVDNPFDF
GRIAATNAISDIFAMGGKPIMAIAILGWPINKLSPEIAREVTEGGRYACRQAGIALAGGHSIDAPEPIFGLAVTGIVPTE
RVKKNSTAQAGCKLFLTKPLGIGVLTTAEKKSLLKPEHQGLATEVMCRMNIAGASFANIEGVKAMTDVTGFGLLGHLSEM
CQGAGVQARVDYEAIPKLPGVEEYIKLGAVPGGTERNFASYGHLMGEMPREVRDLLCDPQTSGGLLLAVMPEAENEVKAT
AAEFGIELTAIGELVPARGGRAMVEIR
>P77649 2.7.7.108~~~selO~~~Protein adenylyltransferase SelO~~~COG0397
MTLSFVTRWRDELPETYTALSPTPLNNARLIWHNTELANTLSIPSSLFKNGAGVWGGEALLPGMSPLAQVYSGHQFGVWA
GQLGDGRGILLGEQLLADGTTMDWHLKGAGLTPYSRMGDGRAVLRSTIRESLASEAMHYLGIPTTRALSIVTSDSPVYRE
TAEPGAMLMRVAPSHLRFGHFEHFYYRRESEKVRQLADFAIRHYWSHLADDEDKYRLWFSDVVARTASLIAQWQTVGFAH
GVMNTDNMSLLGLTLDYGPFGFLDDYEPGFICNHSDHQGRYSFDNQPAVALWNLQRLAQTLSPFVAVDALNEALDSYQQV
LLTHYGERMRQKLGFMTEQKEDNALLNELFSLMARERSDYTRTFRMLSLTEQHSAASPLRDEFIDRAAFDDWFARYRGRL
QQDEVSDSERQQLMQSVNPALVLRNWLAQRAIEAAEKGDMTELHRLHEALRNPFSDRDDDYVSRPPDWGKRLEVSCSS
>Q87VB1 2.7.7.108~~~selO~~~Protein adenylyltransferase SelO~~~COG0397
MKALDELVFDNRFARLGDAFSTHVLPEPIDAPRLVVASESALALLDLAPEQSELPLFAEIFSGHKLWAEAEPRAMVYSGH
QFGSYNPRLGDGRGLLLGEVYNDAGEHWDLHLKGAGRTPYSRMGDGRAVLRSSIREFLASEALHALGIPSSRAACVVSSN
TPVWREKQEYAAMVLRLAQSHVRFGSLEYLFYTKQPEHLKTLAEHVLTMHYPHCQEQPEPYLAMFREIVERNAELIAKWQ
AYGFCHGVMNTDNMSILGITFDFGPFAFLDDFDEHFICNHSDHEGRYSFSNQVPIAQWNLSALGQALTPFVSVEALRETI
GLFLPLYQAHYLDLMRRRLGLTVAQDQDDKLVSQLLQLMQNSGVDYTLFFRRLGDQPAAQALRALRDDFVDIKVFDDWAQ
AYQARIAAEENGTEQARKERMHAVNPLYILRNYLAQNAIEAAEKGDYEEVRRLHQVLCTPFTEQPGMEGYAQRPPDWGKH
LEISCSS
>P33667 2.9.1.3~~~selU~~~tRNA 2-selenouridine synthase~~~COG2603
MQERHTEQDYRALLIADTPIIDVRAPIEFEHGAMPAAINLPLMNNDERAAVGTCYKQQGSDAALALGHKLVAGEIRQQRM
DAWRAACLQNPQGILCCARGGQRSHIVQSWLHAAGIDYPLVEGGYKALRQTAIQATIELAQKPIVLIGGCTGSGKTLLVQ
QQPNGVDLEGLARHRGSAFGRTLQPQLSQASFENLLAAEMLKTDARQNLRLWVLEDESRMIGSNHLPECLRERMTQAAIA
VVEDPFEIRLERLNEEYFLRMHHDFTHAYGDEQGWQEYCEYLHHGLSAIKRRLGLQRYNELAARLDAALTTQLTTGSTDG
HLAWLVPLLEEYYDPMYRYQLEKKAEKVVFRGEWAEVAEWVKAR
>Q8ZR88 2.9.1.3~~~selU~~~tRNA 2-selenouridine synthase~~~
MQDRQKAQDYRALLLADTPLIDVRAPIEFEQGAMPGAINLPLMMDDERAAVGTCYKRQGADAALALGHRLVCGDIRQQRL
EAWKAAYQRFPNGYLCCARGGQRSHIVQRWLQETGIDCPLIEGGYKALRQTAIQATWQLAQKPILLIGGCTGSGKTQLVR
QQPNGVDLEGLARHRGSSFGRTLNPQLSQASFENKLAVELLKINARQTLKRWVLEDEGRTIGANHLPECLRERMAQAPIA
VVEDPFALRLERLREEYFIRMHHDFTHAYGDEAGWQAYSEYLHHGLFAIRRRLGLQRFAELTDTLDRALAEQLSSGSTDG
HMAWLVPLLNEYYDPMYRYQLEKKAANIVFRGTWQDVANWLKAQ
>A0A0H3K6X4 ~~~selX~~~Enterotoxin-like toxin X~~~
MFKKYDSKNSIVLKSILSLGIIYGGTFGIYPKADASTQNSSSVQDKQLQKVEEVPNNSEKALVKKLYDRYSKDTINGKSN
KSRNWVYSERPLNENQVRIHLEGTYTVAGRVYTPKRNITLNKEVVTLKELDHIIRFAHISYGLYMGEHLPKGNIVINTKD
GGKYTLESHKELQKDRENVKINTADIKNVTFKLVKSVNDIEQV
>G0Z026 ~~~selX~~~Enterotoxin-like toxin X~~~
MFKKYDSKNSIVLKSILSLGIIYSGSFGIYPKADASTQNSSSVQDKQLQKVEEVPNNSEKALVKKLYDRYSKDTINGKSN
KSRNWVYSERPLNENQVRIHLEGTYTVAGRVYTPKRNITLNKEVVTLKELDHIIRFAHISYGLYMGEHLPKGNIVINTKN
GGKYTLESHKELQKNRENVEINTDDIKNVTFELVKSVNDIEQV
>P0A601 2.7.13.3~~~senX3~~~Sensor-like histidine kinase SenX3~~~
MTVFSALLLAGVLSALALAVGGAVGMRLTSRVVEQRQRVATEWSGITVSQMLQCIVTLMPLGAAVVDTHRDVVYLNERAK
ELGLVRDRQLDDQAWRAARQALGGEDVEFDLSPRKRSATGRSGLSVHGHARLLSEEDRRFAVVFVHDQSDYARMEAARRD
FVANVSHELKTPVGAMALLAEALLASADDSETVRRFAEKVLIEANRLGDMVAELIELSRLQGAERLPNMTDVDVDTIVSE
AISRHKVAADNADIEVRTDAPSNLRVLGDQTLLVTALANLVSNAIAYSPRGSLVSISRRRRGANIEIAVTDRGIGIAPED
QERVFERFFRGDKARSRATGGSGLGLAIVKHVAANHDGTIRVWSKPGTGSTFTLALPALIEAYHDDERPEQAREPELRSN
RSQREEELSR
>A0QR01 2.7.13.3~~~senX3~~~Sensor-like histidine kinase SenX3~~~COG5002
MSLLTLIAGVAVGVTVVPRIVARRQRRAAYAAGMTVSQMLQHITSLSPMGVAVVDTFNDVVYSNDRAVELNVVRDRILDD
RAWQAAQRVFETGQDVEVDLSPLKVANPGRSGISVRGKVRLLTDDDRRFAVVYIDDQSEHARMEATRRDFVANVSHELKT
PVGAMSVLAEALLASADDPDTVRRFAEKMVAESHRLADMIGELIELSRLQGAERLPDLDAVDVDSIVSEAVSRHKVAADN
SQISITTDAPTGYRVLGDEGLLVTAIANLVSNAIAYSPNGTDVSISRRKRGGNIEIAVTDRGIGIAKDDQERVFERFFRV
DKARSRATGGTGLGLAIVKHVAANHNGSIRLWSQPGTGSTFTLSIPEYPDPESHSDEREDQRER
>P9WGK4 2.7.13.3~~~senX3~~~Sensor-like histidine kinase SenX3~~~
MTVFSALLLAGVLSALALAVGGAVGMRLTSRVVEQRQRVATEWSGITVSQMLQCIVTLMPLGAAVVDTHRDVVYLNERAK
ELGLVRDRQLDDQAWRAARQALGGEDVEFDLSPRKRSATGRSGLSVHGHARLLSEEDRRFAVVFVHDQSDYARMEAARRD
FVANVSHELKTPVGAMALLAEALLASADDSETVRRFAEKVLIEANRLGDMVAELIELSRLQGAERLPNMTDVDVDTIVSE
AISRHKVAADNADIEVRTDAPSNLRVLGDQTLLVTALANLVSNAIAYSPRGSLVSISRRRRGANIEIAVTDRGIGIAPED
QERVFERFFRGDKARSRATGGSGLGLAIVKHVAANHDGTIRVWSKPGTGSTFTLALPALIEAYHDDERPEQAREPELRSN
RSQREEELSR
>P9WGK5 2.7.13.3~~~senX3~~~Sensor-like histidine kinase SenX3~~~COG5002
MTVFSALLLAGVLSALALAVGGAVGMRLTSRVVEQRQRVATEWSGITVSQMLQCIVTLMPLGAAVVDTHRDVVYLNERAK
ELGLVRDRQLDDQAWRAARQALGGEDVEFDLSPRKRSATGRSGLSVHGHARLLSEEDRRFAVVFVHDQSDYARMEAARRD
FVANVSHELKTPVGAMALLAEALLASADDSETVRRFAEKVLIEANRLGDMVAELIELSRLQGAERLPNMTDVDVDTIVSE
AISRHKVAADNADIEVRTDAPSNLRVLGDQTLLVTALANLVSNAIAYSPRGSLVSISRRRRGANIEIAVTDRGIGIAPED
QERVFERFFRGDKARSRATGGSGLGLAIVKHVAANHDGTIRVWSKPGTGSTFTLALPALIEAYHDDERPEQAREPELRSN
RSQREEELSR
>Q8VSL2 3.4.21.-~~~sepA~~~Serine protease SepA autotransporter~~~
MNKIYYLKYCHITKSLIAVSELARRVTCKSHRRLSRRVILTSVAALSLSSAWPALSATVSAEIPYQIFRDFAENKGQFTP
GTTNISIYDKQGNLVGKLDKAPMADFSSATITTGSLPPGDHTLYSPQYVVTAKHVSGSDTMSFGYAKNTYTAVGTNNNSG
LDIKTRRLSKLVTEVAPAEVSDIGAVSGAYQAGGRFTEFYRLGGGMQYVKDKNGNRTQVYTNGGFLVGGTVSALNSYNNG
QMITAQTGDIFNPANGPLANYLNMGDSGSPLFAYDSLQKKWVLIGVLSSGTNYGNNWVVTTQDFLGQQPQNDFDKTIAYT
SGEGVLQWKYDAANGTGTLTQGNTTWDMHGKKGNDLNAGKNLLFTGNNGEVVLQNSVNQGAGYLQFAGDYRVSALNGQTW
MGGGIITDKGTHVLWQVNGVAGDNLHKTGEGTLTVNGTGVNAGGLKVGDGTVILNQQADADGKVQAFSSVGIASGRPTVV
LSDSQQVNPDNISWGYRGGRLELNGNNLTFTRLQAADYGAIITNNSEKKSTVTLDLQTLKASDINVPVNTVSIFGGRGAP
GDLYYDSSTKQYFILKASSYSPFFSDLNNSSVWQNVGKDRNKAIDTVKQQKIEASSQPYMYHGQLNGNMDVNIPQLSGKD
VLALDGSVNLPEGSITKKSGTLIFQGHPVIHAGTTTSSSQSDWETRQFTLEKLKLDAATFHLSRNGKMQGDINATNGSTV
ILGSSRVFTDRSDGTGNAVFSVEGSATATTVGDQSDYSGNVTLENKSSLQIMERFTGGIEAYDSTVSVTSQNAVFDRVGS
FVNSSLTLGKGAKLTAQSGIFSTGAVDVKENASLTLTGMPSAQKQGYYSPVISTTEGINLEDNASFSVKNMGYLSSDIHA
GTTAATINLGDSDADAGKTDSPLFSSLMKGYNAVLRGSITGAQSTVNMINALWYSDGKSEAGALKAKGSRIELGDGKHFA
TLQVKELSADNTTFLMHTNNSRADQLNVTDKLSGSNNSVLVDFLNKPASEMSVTLITAPKGSDEKTFTAGTQQIGFSNVT
PVISTEKTDDATKWVLTGYQTTADAGASKAAKDFMASGYKSFLTEVNNLNKRMGDLRDTQGDAGVWARIMNGTGSADGDY
SDNYTHVQIGVDRKHELDGVDLFTGALLTYTDSNASSHAFSGKNKSVGGGLYASALFNSGAYFDLIGKYLHHDNQHTANF
ASLGTKDYSSHSWYAGAEVGYRYHLTKESWVEPQIELVYGSVSGKAFSWEDRGMALSMKDKDYNPLIGRTGVDVGRAFSG
DDWKITARAGLGYQFDLLANGETVLQDASGEKRFEGEKDSRMLMTVGMNAEIKDNMRLGLELEKSAFGKYNVDNAINANF
RYVF
>P0C0Q3 3.4.24.-~~~sepA~~~Extracellular elastase~~~
MKNFSKFALTSIAALTVASPLVNTEVDAKDKVSATQNIDAKVTQESQATDALKELPKSENIKKHYKDYKVTDTEKDNKGF
THYTLQPKVGNTYAPDKEVKVHTNKEGKVVLVNGDTDAKKVQPTNKVSISKESATDKAFEAIKIDRQKAKNLKSDVIKTN
KVEIDGEKNKYVYNIEIITTSPKISHWNVKIDAETGQVVDKLNMIKEAATTGTGKGVLGDTKQININSVSGGYALQDLTQ
QGTLSAYNYDANTGQAYLMQDKDRNFDDDEQRAGVDANYYAKETYDYYKNTFGRESYDNQGSPIISLAHVNNFQGQDNRN
NAAWIGDKMIYGDGDGRTFTALSGANDVVAHEITHGVTQQTANLVYRSQSGALNESFSDVFGYFVDDEDFLMGEDVYTPG
VGGDALRSMSNPERFGQPSHMNDFVYTNSDNGGVHTNSGIPNKAAYNTIRSIGKQRSEQIYYRALTVYLTSNSDFQDAKA
SLQQAALDLYGEGIAQQVGQAWDSVGV
>Q5SJ85 3.5.3.24~~~~~~N(1)-aminopropylagmatine ureohydrolase~~~COG0010
MRLVFGEKDTPYEEARVVVLPVPYDLSLSFLPGARRGPEAILLASRELEPFLLELGAAPEEVGIHAAEPVPWVAGMAEES
HRLIREEALRHLRAGKWVVALGGDHSVTHPLVQAHREALGDFSLLHVDAHADLYPEWQGSVYSHASPFYRLLTEGFPLVQ
VGIRAMDRDSLRLARKKGVALFPAHRIHREGLPLDEILRALGKRVYISLDFDALDPSLMPSVGTPLPGGLSYRQVVDLLE
AVFREKEVVGMDFVELSPNGQFHAEMTAAQLVYHAIGLKGLQAGWLSREVDHI
>O31728 ~~~sepF~~~Cell division protein SepF~~~COG1799
MSMKNKLKNFFSMEDEEYEYEYIETERESHEEHEQKEKPAYNGNKPAGKQNVVSLQSVQKSSKVVLSEPRVYAEAQEIAD
HLKNRRAVVVNLQRIQHDQAKRIVDFLSGTVYAIGGDIQRIGSDIFLCTPDNVDVSGTISELISEDEHQRW
>P9WGJ5 ~~~sepF~~~Cell division protein SepF~~~COG1799
MSTLHKVKAYFGMAPMEDYDDEYYDDRAPSRGYARPRFDDDYGRYDGRDYDDARSDSRGDLRGEPADYPPPGYRGGYADE
PRFRPREFDRAEMTRPRFGSWLRNSTRGALAMDPRRMAMMFEDGHPLSKITTLRPKDYSEARTIGERFRDGSPVIMDLVS
MDNADAKRLVDFAAGLAFALRGSFDKVATKVFLLSPADVDVSPEERRRIAETGFYAYQ
>Q7A615 ~~~sepF~~~Cell division protein SepF~~~
MSHLALKDLFSGFFVIDDEEEVEVPDKQQQVNEAPAKEQSQQTTKQNAIKSVPQKSASRYTTTSEERNNRMSNYSKNNSR
NVVTMNNATPNNASQESSKMCLFEPRVFSDTQDIADELKNRRATLVNLQRIDKVSAKRIIDFLSGTVYAIGGDIQRVGTD
IFLCTPDNVEVAGSITDHIENMEHSFD
>P0CB77 ~~~sepF~~~Cell division protein SepF~~~COG1799
MSLKDRFDRFIDYFTEDEDSSLPYEKRDEPVFTSVNSSQEPALPMNQPSQSAGTKENNITRLHARQQELANQSQRATDKV
IIDVRYPRKYEDATEIVDLLAGNESILIDFQYMTEVQARRCLDYLDGACHVLAGNLKKVASTMYLLTPVNVIVNVEDIRL
PDEDQQGEFGFDMKRNRVR
>Q31LI0 ~~~sepF~~~Cell division protein SepF~~~COG1799
MSFVNRIRDIVGLNESLDYDEEYETYDVAADSYNGYNDAAETSSRRRQRNHTPTASIEPVSTASNVIGLPGLSSSSEVVV
MEPRSFEEMPQAIQALRERKTIVLNLTMMEPDQAQRAVDFVAGGTFAIDGHQERVGESIFLFTPSCVHVTTQGGEQYLNE
SPAQPVQTTTSFGRTATPTPAWGTDSRYAAQ
>P73376 ~~~sepF~~~Cell division protein SepF~~~COG1799
MILTKLKDFVGISEHDEYEEDYDEEMEFPQSVASQPPAEEVAPPPRISREPLSLNNETSIGTGVRNNVIGMPGINNSVAE
VVVVEPHSFDEMPQVIQTLRERKSVVLNLNVMDPEEAQRAVDFVAGGTFAMDGHQERIGESIFLFTPNCVKVSSLAGRSQ
ESNETSSSVSSDNFPTWGYETSRLAQ
>P80146 3.4.21.-~~~~~~Extracellular serine proteinase~~~
MKRGGLWLLLGLLVLSACSSNPPAASTQEAPLLGLEAPEAIPGRYIVVYKENADVLPALEALKAALEPGLMQPQGLQAQA
LRTLGLEGARVDKVYTAALRGVAVEVPDQELARLRQDPRVAYIEADQEVRAFAVQSPATWGLDRIDQRTLPLDGRYTYTA
TGAGVHAYVVDTGILLSHQEFTGRIGKGYDAITPGGSAQDCNGHGTHVAGTIGGTTYGVAKGVTLHPVRVLDCNGSGSNS
SVIAGLDWVTQNHVKPAVINMSLGGGASTALDTAVMNAINAGVTVVVAAGNDNRDACFYSPARVTAAITVGATTSTDYRA
SFSNYGRCLDLFAPGQSITSAWYTSSTATNTISGTSMATPHVTGAAALYLQWYPTATPSQVASALLYYATPNVVKNAGRY
SPNLLLYTPF
>P0AFY8 ~~~seqA~~~Negative modulator of initiation of replication~~~COG3057
MKTIEVDDELYSYIASHTKHIGESASDILRRMLKFSAASQPAAPVTKEVRVASPAIVEAKPVKTIKDKVRAMRELLLSDE
YAEQKRAVNRFMLLLSTLYSLDAQAFAEATESLHGRTRVYFAADEQTLLKNGNQTKPKHVPGTPYWVITNTNTGRKCSMI
EHIMQSMQFPAELIEKVCGTI
>P0A9T0 1.1.1.95~~~serA~~~D-3-phosphoglycerate dehydrogenase~~~COG0111
MAKVSLEKDKIKFLLVEGVHQKALESLRAAGYTNIEFHKGALDDEQLKESIRDAHFIGLRSRTHLTEDVINAAEKLVAIG
CFCIGTNQVDLDAAAKRGIPVFNAPFSNTRSVAELVIGELLLLLRGVPEANAKAHRGVWNKLAAGSFEARGKKLGIIGYG
HIGTQLGILAESLGMYVYFYDIENKLPLGNATQVQHLSDLLNMSDVVSLHVPENPSTKNMMGAKEISLMKPGSLLINASR
GTVVDIPALCDALASKHLAGAAIDVFPTEPATNSDPFTSPLCEFDNVLLTPHIGGSTQEAQENIGLEVAGKLIKYSDNGS
TLSAVNFPEVSLPLHGGRRLMHIHENRPGVLTALNKIFAEQGVNIAAQYLQTSAQMGYVVIDIEADEDVAEKALQAMKAI
PGTIRARLLY
>P9WNX3 1.1.1.95~~~serA~~~D-3-phosphoglycerate dehydrogenase~~~COG0111
MSLPVVLIADKLAPSTVAALGDQVEVRWVDGPDRDKLLAAVPEADALLVRSATTVDAEVLAAAPKLKIVARAGVGLDNVD
VDAATARGVLVVNAPTSNIHSAAEHALALLLAASRQIPAADASLREHTWKRSSFSGTEIFGKTVGVVGLGRIGQLVAQRI
AAFGAYVVAYDPYVSPARAAQLGIELLSLDDLLARADFISVHLPKTPETAGLIDKEALAKTKPGVIIVNAARGGLVDEAA
LADAITGGHVRAAGLDVFATEPCTDSPLFELAQVVVTPHLGASTAEAQDRAGTDVAESVRLALAGEFVPDAVNVGGGVVN
EEVAPWLDLVRKLGVLAGVLSDELPVSLSVQVRGELAAEEVEVLRLSALRGLFSAVIEDAVTFVNAPALAAERGVTAEIC
KASESPNHRSVVDVRAVGADGSVVTVSGTLYGPQLSQKIVQINGRHFDLRAQGINLIIHYVDRPGALGKIGTLLGTAGVN
IQAAQLSEDAEGPGATILLRLDQDVPDDVRTAIAAAVDAYKLEVVDLS
>Q9S1H0 1.97.1.9~~~serA~~~Selenate reductase subunit alpha~~~
MRKVMNSPDDGNGRRRFLQFSMAALASAAAPSSVWAFSKIQPIEDPLKSYPYRDWEDLYRKEWTWDSTGFITHSNGCVAG
CAWRVFVKNGVPMREEQVSEYPQLPGVPDMNPRGCQKGAVYCSWSKQPDFLKYPLKRVGERGERKWKRISWDEAFTEIAD
KIIDTTVKRGPGNVCMPKRPFAVITSAGYSRLANLIGAIKPDVSSMTGDLYPGIQTVRMPARTVSTFDDWFTSDLILMWH
KNPIVTRIPDAHFLTEARYNGARLVNISPDYNPSSVHADLHLPVTTGTDSHLAAAIVNVLIADKKYKADYLKEQTDLPFL
VRTDNGKFLREKDFNKDGSDEVFYIWDSKSGKAVLAPGSMGSKDKTLKLGAVEPALEGTFDANGIEVTTVFARLKAEIAP
YTPEATHKTTGIHPSVVRQLAGWIGDCKALRILDGYNNQKHFDGFQCGRLKILILTLIGHHGTTGSIDTTYEGWVLEGNK
ALGGVKGRPGRSVSMVLAQWVWGEQYRRSKAYFDDTELREQIGFGVDEMEALRKESEANGWMPNWQSIKDPVVYINAGIN
TFATSTGYQHLRENFLKRCELYVVVDFRLNSGAMYADIVLPAATNLEKLDIRETSSTRFIHAFGQPIKPMYDRRTDWQIS
VGLARKIQERARARGITRVDDPEIKSFIDFDKVYDEFTMNGAVEKDEDALRFVMEKSKALGPGSYEEVLKRGFVGVGPSA
GKTGPVPADKPYRPFTVNVSEKVPYKTLTGRLQFYIDHDWYQRFGATVPKPQYGGGVLGPKKYPFVYNTPHTRWGVHSFA
RTDQWMLRHQRGEPDVRLNPAAMARKGIKDGDQVRIFNSSGEFFAMAKAWPGLPENMLFSEHGWEQYLYKNMTHYNSVNA
ELINPLELVGGYGHVKFAAGGFNPNRIFHETTVDVEKA
>P9WGJ3 3.1.3.3~~~serB1~~~Phosphoserine phosphatase SerB1~~~COG0560
MMVSSHLGSPDQAGHVDLASPADPPPPDASASHSPVDMPAPVAAAGSDRQPPIDLTAAAFFDVDNTLVQGSSAVHFGRGL
AARHYFTYRDVLGFLYAQAKFQLLGKENSNDVAAGRRKALAFIEGRSVAELVALGEEIYDEIIADKIWDGTRELTQMHLD
AGQQVWLITATPYELAATIARRLGLTGALGTVAESVDGIFTGRLVGEILHGTGKAHAVRSLAIREGLNLKRCTAYSDSYN
DVPMLSLVGTAVAINPDARLRSLARERGWEIRDFRIARKAARIGVPSALALGAAGGALAALASRRQSR
>O53289 3.1.3.3~~~serB2~~~Phosphoserine phosphatase SerB2~~~COG0560
MPAKVSVLITVTGMDQPGVTSALFEVLAQHGVELLNVEQVVIRGRLTLGVLVSCPLDVADGTALRDDVAAAIHGVGLDVA
IERSDDLPIIRQPSTHTIFVLGRPITAGAFSAVARGVAALGVNIDFIRGISDYPVTGLELRVSVPPGCVGPLQIALTKVA
AEEHVDVAVEDYGLAWRTKRLIVFDVDSTLVQGEVIEMLAARAGAQGQVAAITEAAMRGELDFAESLQRRVATLAGLPAT
VIDDVAEQLELMPGARTTIRTLRRLGFRCGVVSGGFRRIIEPLARELMLDFVASNELEIVDGILTGRVVGPIVDRPGKAK
ALRDFASQYGVPMEQTVAVGDGANDIDMLGAAGLGIAFNAKPALREVADASLSHPYLDTVLFLLGVTRGEIEAADAGDCG
VRRVEIPAD
>Q21YU0 3.1.3.3~~~serB~~~Phosphoserine phosphatase~~~COG0560
MHPTAIAPGLIIQGFAPPLRLSDFKLIAFDMDSTLINIECVDEIADAVGRKREVAAITEAAMRGEITDYKESLRQRVALL
QGVTEVQMNQIYQERMQFNPGAAELVAACKAAGLKVLLVSGGFTHFTDRVAQRLGIDYTRSNVLQIENGVLTGRMVDQPW
GDICDGAEKRKMLLETCALLGIAPKQAIAVGDGANDLPMMREAGLSVAFHAKSAVRELANVSIESGGLDRLLELFQP
>A7H590 3.1.3.3~~~serB~~~Phosphoserine phosphatase~~~
MIKLCVFDFDATLMDGETIDILATAHGKGNQISEITRYAMAGELDFFESLQKRVSFLKGMSYKKVLELGSTLPLMHGAHE
LIQYLKSKNIQIVIFSGGFHEGIDPAMQKLGINLGFANYLHHKNDILTGLIGGEIMFSNSKGLMLQRLKSFLNLKTDEVM
CVGDGANDLAMFNESGLKIAFCAKEILRSQADICIDIKDLKEIIKVI
>P0AGB0 3.1.3.3~~~serB~~~Phosphoserine phosphatase~~~COG0560
MPNITWCDLPEDVSLWPGLPLSLSGDEVMPLDYHAGRSGWLLYGRGLDKQRLTQYQSKLGAAMVIVAAWCVEDYQVIRLA
GSLTARATRLAHEAQLDVAPLGKIPHLRTPGLLVMDMDSTAIQIECIDEIAKLAGTGEMVAEVTERAMRGELDFTASLRS
RVATLKGADANILQQVRENLPLMPGLTQLVLKLETLGWKVAIASGGFTFFAEYLRDKLRLTAVVANELEIMDGKFTGNVI
GDIVDAQYKAKTLTRLAQEYEIPLAQTVAIGDGANDLPMIKAAGLGIAYHAKPKVNEKAEVTIRHADLMGVFCILSGSLN
QK
>Q5QXU4 3.1.3.3~~~serB_2~~~Phosphoserine phosphatase~~~COG0560
MKHSSGLIVFDMDSTLIHIECIDEIARLNNRYTKVSAITEAAMRGEIDFAESLTQRVACLEGIKESDLESLFSPIPFNPG
AKELIQALQAAGWKTALVSGGFTWFANRVQAALNLDAVVANQLEVADGCLTGKVLGDIVDAQVKAEQLQQLAGHWNIPPD
RTVAVGDGANDGLMLKAAAVGIAFNAKPALQAIADYSVNSNNLLEILGCLKQSELIEPVI
>Q9CHW3 3.1.3.3~~~serB~~~Phosphoserine phosphatase~~~COG0560
MAAKGLLVMDVDSTLIEEEVIDLLGEKAGMGDKISEITAAAMSGEIDFKESLRERVALLSGLPTTIFDDVYKEIHLTKGA
TGLIETLHAKGWKVGLVSGGFHEIVDKIARDLKIDYVFANRLSVENGHLTGKTHGTVVDKDFKVDRLKQWANENKLNLSE
VIAVGDGANDIPMLNTAGIGIAFCAKPAVKAAVSYHIDKRNLLTVLEFVDKLADKE
>A0QJI1 3.1.3.3~~~serB~~~Phosphoserine phosphatase~~~
MNSPPKVSVLITVTGVDQPGVTATLFEVLSGHGVELLNVEQVVIRHRLTLGVLVCCPADVADGPALRHDVEAAIRKVGLD
VSIERSDDVPIIREPSTHTIFVLGRPITAAAFGAVAREVAALGVNIDLIRGVSDYPVIGLELRVSVPPGADGALRTALNR
VSSEEHVDVAVEDYTLERRAKRLIVFDVDSTLVQGEVIEMLAAKAGAEGQVAAITDAAMRGELDFAQSLQQRVATLAGLP
ATVIDEVAGQLELMPGARTTLRTLRRLGYACGVVSGGFRRIIEPLAEELMLDYVAANELEIVDGTLTGRVVGPIIDRAGK
ATALREFAQRAGVPMAQTVAVGDGANDIDMLAAAGLGIAFNAKPALREVADASLSHPYLDTVLFLLGVTRGEIEAADAID
GEVRRVEIPPE
>Q12A06 3.1.3.3~~~serB~~~Phosphoserine phosphatase~~~COG0560
MQPTEISPGLVVNVATPDLKLSDFKLIAFDMDSTLINIECVDEIADAAGRKAEVAAITEAAMRGEISDYKESLRQRVALL
KGVSVASMDEVYRTRLRLNPGAARLVQACKDAGLKVLLVSGGFTFFTDRIRDELGIDYTRSNVLETTDGLLTGRMVDQPW
GDICDGEEKRKMLLETCGQLGISPRQAIAMGDGANDLPMMGEAGLSVAYHAKPRVREQAMVAINEGGLDRLLELVK
>Q9S281 3.1.3.3~~~~~~Phosphoserine phosphatase~~~COG0560
MSASQTSDVPTLLVKIFGKDRPGITAGLFDTLAAYSVDVVDIEQVVTRGRIVLCALVTEPPRGLEGDLRATVHSWAESLK
LQAEIISGIGDNRPRGFGRSLVTVLGHPLTAEATAAIAARITESGSNIDRIFRLAKYPVTAVEFAVSGVETEPLRTALAT
EAAALGVDIAVVAAGLHRRAQRLVVMDVDSTLIQDEVIELFAAHAGCEDEVAEVTAAAMRGELDFEQSLHARVALLAGLD
ASVVDKVRAEVRLTPGARTLIRTLKRLGYQVGVVSGGFTQVTDALQEQLGLDFAQANTLEIVDGRLTGRVTGEIVDRAGK
ARLLRRFAAAAGVPLSQTVAIGDGANDLDMLNAAGLGVAFNAKPVVREAAHTAVNVPFLDTVLYLLGITREEVEAADTLA
DDLGDGPGRP
>Q5M3B3 3.1.3.3~~~serB~~~Phosphoserine phosphatase~~~COG0560
MSEVKGLLVMDVDSTLVQEEVIDLLGEEAGVGREVAEITERAMRGELDFRQALNERVATLKGLPDSIFEKVYARIHFNKG
AKELVDELHSRGFKVGLVSGGFHETVDRLAKEAGIDYVKANHLEVIDGFLTGKVYGEIVTKDVKVAKLKDWAAENGLKLS
QTIAMGDGANDLPMIKTAGIGIAFCAKPIVRVQAPYQITEPDLYKVIEILDEVGK
>Q9S1G9 1.97.1.9~~~serB~~~Selenate reductase subunit beta~~~
MSQRQLAYVFDLNKCIGCHTCTMACKQLWTNRDGREYMYWNNVESRPGKGYPKNWEQKGGGFDKDGKLKTNGIIPIRADY
GGTWNYNLLETLVEGKSNQVVPDEKPTWGPNWDEDEGKGEFPNNHYFYLPRICNHCSNPACLAACPTKAIYKREEDGLVV
VDQSRCKGYRYCVKACPYGKMYFNLQKGTSEKCIGCYPRVEKGEAPACVKQCSGRIRFWGYRDDKDGPIYKLVDQWKVAL
PLHAEYGTEPNVFYVPPMNTTPPPFEEDGRLGDKPRIPIEDLEALFGPGVKQALATLGGEMAKRRKAQASELTDILIGYT
NKDRYGI
>Q9KPM2 3.1.3.3~~~~~~Phosphoserine phosphatase~~~COG0560
MDMDALTTLPIKKHTALLNRFPETRFVTQLAKKRASWIVFGHYLTPAQFEDMDFFTNRFNAILDMWKVGRYEVALMDGEL
TSEHETILKALELDYARIQDVPDLTKPGLIVLDMDSTAIQIECIDEIAKLAGVGEEVAEVTERAMQGELDFEQSLRLRVS
KLKDAPEQILSQVRETLPLMPELPELVATLHAFGWKVAIASGGFTYFSDYLKEQLSLDYAQSNTLEIVSGKLTGQVLGEV
VSAQTKADILLTLAQQYDVEIHNTVAVGDGANDLVMMAAAGLGVAYHAKPKVEAKAQTAVRFAGLGGVVCILSAALVAQQ
KLSWKSKP
>Q7M7U5 3.1.3.3~~~SERB~~~Phosphoserine phosphatase~~~COG0560
MKLAVFDFDSTLMDGETIDILAHHYGVGEEVDRITKGAMEGGLDFYESLKRRVALLRGMELSLVEEICANLTLMEGAKEL
IQELKRRDYKVVVFSGGFKNATSKARETLGLDADFSNILHHKEGKLTGEVGGEMMFGSSKGEMMQTLQRLLGISPELTMA
VGDGANDASMFPFAKQRVAFCAKPILREKANIIIEKKDLREILAHL
>Q9RME2 2.6.1.52~~~serC~~~Phosphoserine aminotransferase~~~
MVKQVFNFNAGPSALPKPALERAQKELLNFNDTQMSVMELSHRSQSYEEVHEQAQNLLRELLQIPNDYQILFLQGGASLQ
FTMLPMNLLTKGTIGNYVLTGSWSEKALKEAKLLGETHIAASTKANSYQSIPDFSEFQLNENDAYLHITSNNTIYGTQYQ
NFPEINHAPLIADMSSDILSRPLKVNQFGMIYAGAQKNLGPSGVTVVIVKKDLLNTKVEQVPTMLQYATHIKSDSLYNTP
PTFSIYMLRNVLDWIKDLGGAEAIAKQNEEKAKIIYDTIDESNGFYVGHAEKGSRSLMNVTFNLRNEELNQQFLAKAKEQ
GFVGLNGHRSVGGCRASIYNAVPIDACIALRELMIQFKENA
>P80862 2.6.1.52~~~serC~~~Phosphoserine aminotransferase~~~COG1932
MERTTNFNAGPAALPLEVLQKAQKEFIDFNESGMSVMELSHRSKEYEAVHQKAKSLLIELMGIPEDYDILFLQGGASLQF
SMLPMNFLTPEKTAHFVMTGAWSEKALAETKLFGNTSITATSETDNYSYIPEVDLTDVKDGAYLHITSNNTIFGTQWQEF
PNSPIPLVADMSSDILSRKIDVSKFDVIYGGAQKNLGPSGVTVVIMKKSWLQNENANVPKILKYSTHVKADSLYNTPPTF
AIYMLSLVLEWLKENGGVEAVEQRNEQKAQVLYSCIDESNGFYKGHARKDSRSRMNVTFTLRDDELTKTFVQKAKDAKMI
GLGGHRSVGGCRASIYNAVSLEDCEKLAAFMKKFQQENE
>Q9PIH3 2.6.1.52~~~serC~~~Phosphoserine aminotransferase~~~COG1932
MRKINFSAGPSTLPLEILEQAQKELCDYQGRGYSIMEISHRTKVFEEVHFGAQEKAKKLYELNDDYEVLFLQGGASLQFA
MIPMNLALNGVCEYANTGVWTKKAIKEAQILGVNVKTVASSEESNFDHIPRVEFSDNADYAYICSNNTIYGTQYQNYPKT
KTPLIVDASSDFFSRKVDFSNIALFYGGVQKNAGISGLSCIFIRKDMLERSKNKQIPSMLNYLTHAENQSLFNTPPTFAI
YMFNLEMDWLLNQGGLDKVHEKNSQKATMLYECIDLSNGFYKGHADKKDRSLMNVSFNIAKNKDLEPLFVKEAEEAGMIG
LKGHRILGGIRASIYNALNLDQVKTLCEFMKEFQGKYA
>P23721 2.6.1.52~~~serC~~~Phosphoserine aminotransferase~~~COG1932
MAQIFNFSSGPAMLPAEVLKQAQQELRDWNGLGTSVMEVSHRGKEFIQVAEEAEKDFRDLLNVPSNYKVLFCHGGGRGQF
AAVPLNILGDKTTADYVDAGYWAASAIKEAKKYCTPNVFDAKVTVDGLRAVKPMREWQLSDNAAYMHYCPNETIDGIAID
ETPDFGADVVVAADFSSTILSRPIDVSRYGVIYAGAQKNIGPAGLTIVIVREDLLGKANIACPSILDYSILNDNGSMFNT
PPTFAWYLSGLVFKWLKANGGVAEMDKINQQKAELLYGVIDNSDFYRNDVAKANRSRMNVPFQLADSALDKLFLEESFAA
GLHALKGHRVVGGMRASIYNAMPLEGVKALTDFMVEFERRHG
>A6T700 2.6.1.52~~~serC~~~Phosphoserine aminotransferase~~~
MAQVYNFSSGPAMLPAEVLKLAQQELCDWHGLGTSVMEISHRGKEFIQVAEEAEQDFRALLNIPSNYKVLFCHGGGRGQF
AGIPLNILGDKKVADYVDAGYWAASAVKEAKKYCTPNVIDAKITVDGKRAVKPMSEWQLTPGAAYLHYCPNETIDGIAID
ETPNFGDDVIVTADFSSTILSREIDVNRFGVIYAGAQKNIGPAGLTLVIVREDLLGKASVACPSILDYTVLSENDSMFNT
PPTFAWYLAGLVFKWLKQQGGVAAMDKINQQKAELLYGVIDNSGFYRNDVAQANRSRMNVPFQLADSALDKLFLEESFAA
GLHALKGHRVVGGMRASIYNAMPLDGVKTLTDFMLDFERRHG
>P9WQ73 2.6.1.52~~~serC~~~Phosphoserine aminotransferase~~~COG1932
MADQLTPHLEIPTAIKPRDGRFGSGPSKVRLEQLQTLTTTAAALFGTSHRQAPVKNLVGRVRSGLAELFSLPDGYEVILG
NGGATAFWDAAAFGLIDKRSLHLTYGEFSAKFASAVSKNPFVGEPIIITSDPGSAPEPQTDPSVDVIAWAHNETSTGVAV
AVRRPEGSDDALVVIDATSGAGGLPVDIAETDAYYFAPQKNFASDGGLWLAIMSPAALSRIEAIAATGRWVPDFLSLPIA
VENSLKNQTYNTPAIATLALLAEQIDWLVGNGGLDWAVKRTADSSQRLYSWAQERPYTTPFVTDPGLRSQVVGTIDFVDD
VDAGTVAKILRANGIVDTEPYRKLGRNQLRVAMFPAVEPDDVSALTECVDWVVERL
>Q59196 2.6.1.52~~~serC~~~Phosphoserine aminotransferase~~~
MSKRAYNFNAGPAALPLEVLERAQAEFVDYQHTGMSIMEMSHRGAVYEAVHNEAQARLLALLGNPTGYKVLFIQGGASTQ
FAMIPMNFLKEGQTANYVMTGSWASKALKEAKLIGDTHVAASSEASNYMTLPKLQEIQLQDNAAYLHLTSNETIEGAQFK
AFPDTGSVPLIGDMSSDILSRPFDLNQFGLVYAGAQKNLGPSGVTVVIVREDLVAESPKHLPTMLRYDTYVKNNSLYNTP
PSFGIYMVNEVLKWIEERGGLEGVQQANRKKASLIYDAIDQSGGFYRGCVDVDSRSDMNITFRLASEELEKEFVKASEQE
GFVGLKGHRSVGGLRASIYNAVPYESCEALVQFMEHFKRSRG
>Q9HZ66 2.6.1.52~~~serC~~~Phosphoserine aminotransferase~~~
MSKRAFNFCAGPAALPDAVLQRAQAELLDWRGKGLSVMEMSHRSDDYVAIASKAEQDLRDLLDIPSDYKVLFLQGGASQQ
FAEIPLNLLPEDGVADYIDTGIWSKKAIEEARRYGTVNVAASAKEYDYFAIPGQNEWTLTKDAAYVHYASNETIGGLEFD
WIPETGDVPLVTDMSSDILSRPLDVSRFGLIYAGAQKNIGPSGLVVVIVREDLLGRARSVCPTMLNYKTAADNGSMYNTP
ATYSWYLSGLVFEWLKEQGGVTAMEQRNRAKKDLLYKTIDASDFYTNPIQPSARSWMNVPFRLADERLDKPFLEGAEARG
LLNLKGHRSVGGMRASIYNALGLDAVEALVAYMAEFEKEHG
>P55900 2.6.1.52~~~serC~~~Phosphoserine aminotransferase~~~
MAQVFNFSSGPAMLPAEVLKLAQQELCDWHGLGTSVMEISHRGKEFIQVAEEAEQDFRDLLNIPSNYKVLFCHGGGRGQF
AGVPLNLLGDKTTADYVDAGYWAASAIKEAKKYCAPQIIDAKITVDGKRAVKPMREWQLSDNAAYLHYCPNETIDGIAID
ETPDFGPEVVVTADFSSTILSAPLDVSRYGVIYAGAQKNIGPAGLTLVIVREDLLGKAHESCPSILDYTVLNDNDSMFNT
PPTFAWYLSGLVFKWLKAQGGVAAMHKINQQKAELLYGVIDNSDFYRNDVAQANRSRMNVPFQLADNTLDKVFLEESFAA
GLHALKGHRVVGGMRASIYNAMPIEGVKALTDFMIDFERRHG
>B2FKF0 2.6.1.52~~~serC~~~Phosphoserine aminotransferase~~~COG1932
MTRAFNFSAGPATLPESVLRQAQAEMLDWHGSGASIVEMSHRGAEFMSVAAEAEADLRRLLDIPDDYAVLFLSGGATTQQ
ALIPLNFAAPGQRADYVVSGHWGKTAVKQAGVYVDVNIAASSEANGYRELPARADWQLSRDAAYVHITANETIHGVEFRD
VPDTGNVPLIADFSSSIASEPLDVRRYGVIYAGAQKNLGPVGVAVMIIRRDLLERSGQPRADIFDYRSHVARDSMLNTPP
TWNWYLAGLVFKWMLAEGGVTEFAKRNAAKAALVYGAIDGSGGFYRNEVAYAARSRMNIPFFLPDAELDARFVAEAKAAG
LLALKGHKVVGGIRASLYNAMPLAGAEALVAFMADFQQRHG
>Q9S1G7 1.97.1.9~~~serC~~~Selenate reductase subunit gamma~~~
MRTSSMMKRMAAMSLAAAAAWATGAAAAADGAPAAQRTIQVLSVKGGDAASPQAAVWKKAPTGQVALQTAFPGHASIVGT
ALTQQMTAQAVRAGDRLFVRLAWRDATANTEIKDTDQFVDGAAVQFPVNGKDTTLAFMGDPDNPVNVWHWRADGRTRNLV
AKGFGTATPVPAEGLRSTATRTRDGWEVVISRPLRVKAEEGADLQGRRTMPIAFAAWDGENQERDGLKAVTMEWWQLNF
>Q9I5I6 1.1.1.387~~~~~~NAD-dependent L-serine dehydrogenase~~~
MKQIAFIGLGHMGAPMATNLLKAGYLLNVFDLVQSAVDGLVAAGASAARSARDAVQGADVVISMLPASQHVEGLYLDDDG
LLAHIAPGTLVLECSTIAPTSARKIHAAARERGLAMLDAPVSGGTAGAAAGTLTFMVGGDAEALEKARPLFEAMGRNIFH
AGPDGAGQVAKVCNNQLLAVLMIGTAEAMALGVANGLEAKVLAEIMRRSSGGNWALEVYNPWPGVMENAPASRDYSGGFM
AQLMAKDLGLAQEAAQASASSTPMGSLALSLYRLLLKQGYAERDFSVVQKLFDPTQGQ
>Q9S1G8 ~~~serD~~~Selenate reductase assembly chaperone protein~~~
MNALIDNPEALASGYLAMAQVFSYPDAGAWSRLTERGLVDPALTHETLEAEYLAAFEMGGGKATVSLYEGQNRPDLGRDG
ILQELLRFYEFFDAQLSEDDREYPDHLVTELEFLAWLCLQEHAAVRDGRDAEPFRRAARDFLDRHLAAWLPEFRRRLEAT
DSAYAQYGPALGELVEAHRSRLGEQAPQLGELQ
>A2RI87 ~~~serP1~~~Serine permease SerP1~~~COG1113
MEDIQKNHEAQRGLQNRHIQLIAIAGTIGTGLFLGAGKTIQMTGPSVIFAYILIGIAMFFFLRTIGEMLYNDPSQHSFLN
FVTKYSGIRTGYFTQWSYWLVIVFVCISELTAIGTYIQFWLPHLPLWLIEIVMLALLFGLNTLNSRFFGETEFWFAMIKV
AAILGMIVTAIILVASNFHYTTVLSGKTVNDTASLNNIFDGFQLFPHGAWNFVGALQMVMFAFTSMEFIGMTAAETVNPK
KSLPKAINQIPVRILLFYVGALLAIMAIFNWHYIPADKSPFVIVFQLIGIKWAAALINFVVLTSAASALNSSLFSATRNM
YSLAKQHDKGRLTAFTKLSKAGIPINALYMATALSLLAPVLTLIPQIKNAFNFAASCTTNLFLVVYFITLYTYWQYRKSD
DYNPNGFLTPKPTIAVPFIAIIFAIVFASLFFNADTFYPALGAIVWTLIFGLYSHFKKI
>A2RI86 ~~~serP2~~~DL-alanine permease SerP2~~~COG1113
MNTNQNDENIEKQPSQRGLKNRHIQLIAIAGTIGTGLFLGAGKSIHLTGPSIIFVYLIIGALMYILLRAIGEMLYQDPSQ
HSFLNFVSRYMGAKPGYFIQWSYLLVVVFVAMAELIAIGTYINFWLPDLPIWMTEVFVLVLLTLLNTLNPKFFGETEFWF
GMIKIVAIIGLILTAIILIFSHYHTGTDTVSLTNITKGFEFFPNGVSSFFESFQMVMFAFVSMEFIGMTAAETDNPRPTL
KKAINQIPIRIVLFYIGALLAIMSIYQWRDIPADKSPFVTIFQLIGIKWAAALVNFVVLTSAASALNSALFSITRNLYSL
SQLNDDKILKPFTKFSKAGVPVNALLFTSLLILFTPFISMIPAISNSFVFITSVATNLFLVVYLMTLITYLKYRKSKDFD
PSGFTLPAAHIFIPLAIAGFVLIFISLFCFKDTIIPAIGSVIWVLIFGLFTFFRKIKTAD
>A0A1J1EM40 2.1.5.1~~~sesA~~~Sesamin methylene transferase~~~
MTAEQAINEGAFSLAASFGFVPLEYRGYEAEVLASKETAYIGTALNGAMSPIYDVTGPDALEFLRSVCINSFRGFQVGQI
RHAVLCNDKGQILTDGVVARIDEDTYRTYWLAPALEYRLINSGLDVKGEDQSSNEFFFQLAGPRSLEVLEAAAHEDLHDI
AFGRHRMSTIAGIPVRILRLGMAGGLAYEVHGAAADTETAYRAIWEAGQPFGLVKQGLNAYLMQHTEAGFPNINLHYPLP
WYEDPDMAAFFDTRPTQNFYNKYRFFYGSVGPDAEARFVTPYQIGLGKMVDFNHDFIGKEALQREAEADHWAAVTLVWNE
DDVADVVASKYRGRDVEPYDKIDDRPFDIYHNLGQPGFAYHADWVLADGERIGTSTGRINSVYYRRMISLGFIDKRHAAE
GTELTVLWGRPGTPQKEIRVTVGRYPYFDLEKNNAIDVASIPRPALDVSAGA
>P31675 ~~~setA~~~Sugar efflux transporter A~~~COG2814
MIWIMTMARRMNGVYAAFMLVAFMMGVAGALQAPTLSLFLSREVGAQPFWIGLFYTVNAIAGIGVSLWLAKRSDSQGDRR
KLIIFCCLMAIGNALLFAFNRHYLTLITCGVLLASLANTAMPQLFALAREYADNSAREVVMFSSVMRAQLSLAWVIGPPL
AFMLALNYGFTVMFSIAAGIFTLSLVLIAFMLPSVARVELPSENALSMQGGWQDSNVRMLFVASTLMWTCNTMYIIDMPL
WISSELGLPDKLAGFLMGTAAGLEIPAMILAGYYVKRYGKRRMMVIAVAAGVLFYTGLIFFNSRMALMTLQLFNAVFIGI
VAGIGMLWFQDLMPGRAGAATTLFTNSISTGVILAGVIQGAIAQSWGHFAVYWVIAVISVVALFLTAKVKDV
>Q5ZU30 ~~~setA~~~Subversion of eukaryotic traffic protein A~~~COG3774
MYKIYSYLGWRIDMKTENLPQAGQEAQIDKKIHFIWVGHIMPQKNIQVVSEWAEKNPGYETIIWVDKKIAPAKELDLFIL
DMKSKGITVKDINEEGVCRDSIRHELDQESPNYGMVSDMLRLNILAAEGGIYLDSDILCSAPFPDEIYAPFGFLLSPWSQ
GANNTLCNDIILCSKGNQIIQQLADAIEQSYIARDSFEFTHEYASMKETKGERIAKTLGVTGPGFLFHQLKKMGILNDKS
EMEAIHWELQDQRYLIDGSVKEPDYFYVPQNNTNDASWVPSIKRPGIENMSFQERLENAVQLIAFDIQKTGLFNLDHYAN
ELKVKQNSWCIAAETSPELKPDSYLLIRPRDKTGEWTLYYVDEDKKLNPVTLPVIKGAIKLSEVSDPLRKFHTLLSQVSD
PVNPTAHELKQIGRALIELKPRQDEWHCKNKWSGAEEIAQELWQRITSNETLRAQIKQCFTQFESLKPRVAELGLTRASG
AGTEVEAHESTVKEQEIISQNTVGEEGTKEKNSVQLASENSSDEKIKTAHDLIDEIIQDVIQLDGKLGLLGGNTRQLEDG
RVINIPNGAAMIFDDYKKYKQGELTAESALESMIKIAKLSNQLNRHTFFNQRQPETGQFYKKVAAIDLQTTIAAEYDNNH
GLRI
>P33026 ~~~setB~~~Sugar efflux transporter B~~~COG2814
MHNSPAVSSAKSFDLTSTAFLIVAFLTGIAGALQTPTLSIFLTDEVHARPAMVGFFFTGSAVIGILVSQFLAGRSDKRGD
RKSLIVFCCLLGVLACTLFAWNRNYFVLLFVGVFLSSFGSTANPQMFALAREHADKTGREAVMFSSFLRAQVSLAWVIGP
PLAYALAMGFSFTVMYLSAAVAFIVCGVMVWLFLPSMRKELPLATGTIEAPRRNRRDTLLLFVICTLMWGSNSLYIINMP
LFIINELHLPEKLAGVMMGTAAGLEIPTMLIAGYFAKRLGKRFLMRVAAVGGVCFYAGMLMAHSPVILLGLQLLNAIFIG
ILGGIGMLYFQDLMPGQAGSATTLYTNTSRVGWIIAGSVAGIVAEIWNYHAVFWFAMVMIIATLFCLLRIKDV
>P31436 ~~~setC~~~Sugar efflux transporter C~~~COG2814
MQKTATTPSKILDLTAAAFLLVAFLTGIAGALQTPTLSIFLADELKARPIMVGFFFTGSAIMGILVSQFLARHSDKQGDR
KLLILLCCLFGVLACTLFAWNRNYFILLSTGVLLSSFASTANPQMFALAREHADRTGRETVMFSTFLRAQISLAWVIGPP
LAYELAMGFSFKVMYLTAAIAFVVCGLIVWLFLPSIQRNIPVVTQPVEILPSTHRKRDTRLLFVVCSMMWAANNLYMINM
PLFIIDELHLTDKLTGEMIGIAAGLEIPMMLIAGYYMKRIGKRLLMLIAIVSGMCFYASVLMATTPAVELELQILNAIFL
GILCGIGMLYFQDLMPEKIGSATTLYANTSRVGWIIAGSVDGIMVEIWSYHALFWLAIGMLGIAMICLLFIKDI
>P12730 ~~~sfaA~~~S-fimbrial protein subunit SfaA~~~
MKLKFISMAVFSALTLGVATNASAVTTVNGGTVHFKGEVVDAACAVNTNSANQTVLLGQVRSAKLANDGEKSSPVGFSIE
LNDCSSATAGHASIIFAGNVIATHNDVLSLQNSAAGSATNVGIQILDHTGTAVQFDGVTASTQFTLTDGTNKIPFQAVYY
ATGKSTPGIANADATFKVQYQ
>P13429 ~~~sfaG~~~S-fimbrial protein subunit SfaG~~~
MVKDIIKTVTFSCMLAGSMFVTCHVCAAGSVVNITGNVQDNTCDVDINSRNFDVSLGSYDSRQFTAAGDTTPASVFHVGL
TSCGSAVRAVKLTFTGTPDNQEAGLIQINSINGARGVGIQLLDKDKHELKINVPTTIALMPGTQTIAFYARLKATYLPVK
AGNVDAVVNFVLDYQ
>P13431 ~~~sfaH~~~S-fimbrial protein subunit SfaH~~~
MAYSQPSFALLCRNNQTGQEFNSGDTSFRVNVSPVVQYDKSISVLDLSQLVSCQNEDSTGQNYDYLKILKGSGFSPALDT
KTYGRLDFTSRPTGYARQLPLQFDLQVTEAFYQYGVWKPFPAKLYLYPEPGVFGKVINNGDLLATLYVNKFSTKGQEAGE
RNFTWRFYATNDVHIQTGTCRVSSNNVKVDLPSYPGGPVTVPLTVRCDQTQSVSYTLSGPVTGSGNTVFANTAASGAGGV
GVQLSDKAGPVPAGQPRSLGQVGSSPVSLGLKASYALTGQASLTPGAVQSVINVTFSYN
>Q03424 3.4.21.-~~~~~~Serine protease 1~~~
MRRTTRARTGLSALLLAASLGLGAAPAGADAPQRPAPTPASDSAAALHALDAAVERTLGDDSAGTYVDAGTGELVVTVTT
EAAAAKVRAAGATPRRVQRGAAELDAAMAALEARAKIPGTSWGLDPRTNRIAVEADSSVSARDLARLRKVAASLDGAVSV
TRVPGVFQREVAGGDAIYGGGSRCSAAFNVTKNGVRYFLTAGHCTNLSSTWSSTSGGTSIGVREGTSFPTNDYGIVRYTT
TTNVDGRVNLYNGGYQDIASAADAVVGQAIKKSGSTTKVTSGTVSAVNVTVNYSDGPVYGMVRTTACSAGGDSGGAHFAG
SVALGIHSGSSGCTGTNGSAIHQPVREALSAYGVNVY
>P41140 3.4.21.-~~~~~~Serine protease 2~~~
IAGGEAIYAAGGGRCSLGFNVRSSSGATYALTAGHCTEIASTWYTNSGQTSLLGTRAGTSFPGNDYGLIRHSNASAADGR
VYLYNGSYRDITGAGNAYVGQTVQRSGSTTGLHSGRVTGLNATVNYGGGDIVSGLIQTNVCAEPGDSGGALFAGSTALGL
TSGGSGNCRTGGTT
>P13430 ~~~sfaS~~~S-fimbrial adhesin protein SfaS~~~
MKLKAIILATGLINCIAFSAQAVDTTITVTGNVLQRTCNVPGNVDVSLGNLYVSDFPNAGSGSPWVNFDLSLTGCQNMNT
VRATFSGTADGQTYYANTGNAGGIKIEIQDRDGSNASYHNGMFKTLNVQNNNATFNLKARAVSKGQVTPGNISSVITVTY
TYA
>P51025 3.1.2.12~~~frmB~~~S-formylglutathione hydrolase FrmB~~~COG0627
MELIEKHVSFGGWQNMYRHYSQSLKCEMNVGVYLPPKAANEKLPVLYWLSGLTCNEQNFITKSGMQRYAAEHNIIVVAPD
TSPRGSHVADADRYDLGQGAGFYLNATQAPWNEHYKMYDYIRNELPDLVMHHFPATAKKSISGHSMGGLGALVLALRNPD
EYVSVSAFSPIVSPSQVPWGQQAFAAYLAENKDAWLDYDPVSLISQGQRVAEIMVDQGLSDDFYAEQLRTPNLEKICQEM
NIKTLIRYHEGYDHSYYFVSSFIGEHIAYHANKLNMR
>P33018 3.1.2.12~~~yeiG~~~S-formylglutathione hydrolase YeiG~~~COG0627
MEMLEEHRCFEGWQQRWRHDSSTLNCPMTFSIFLPPPRDHTPPPVLYWLSGLTCNDENFTTKAGAQRVAAELGIVLVMPD
TSPRGEKVANDDGYDLGQGAGFYLNATQPPWATHYRMYDYLRDELPALVQSQFNVSDRCAISGHSMGGHGALIMALKNPG
KYTSVSAFAPIVNPCSVPWGIKAFSSYLGEDKNAWLEWDSCALMYASNAQDAIPTLIDQGDNDQFLADQLQPAVLAEAAR
QKAWPMTLRIQPGYDHSYYFIASFIEDHLRFHAQYLLK
>P44556 3.1.2.12~~~~~~S-formylglutathione hydrolase~~~COG0627
MKLIEQHQIFGGSQQVWAHNAQTLQCEMKFAVYLPNNPENRPLGVIYWLSGLTCTEQNFITKSGFQRYAAEHQVIVVAPD
TSPRGEQVPNDAAYDLGQGAGFYLNATEQPWATNYQMYDYILNELPDLIEANFPTNGKRSIMGHSMGGHGALVLALRNRE
RYQSVSAFSPILSPSLVPWGEKAFSAYLGEDREKWQQYDASSLIQQGYKVQGMRIDQGLEDEFLPTQLRTEDFIETCRVA
NQPVDVRFHKGYDHSYYFIASFIGEHIAYHAEFLK
>A1AXZ2 3.1.2.12~~~fghA~~~S-formylglutathione hydrolase~~~COG0627
MTLAYETVSENRSFGGIQGVYRHQSQATGTPMTFAIYLPPDARHGKVPVLWYLSGLTCTHENAMTKAGAQEWAAEYGIAV
IFPDTSPRGEGVANDETYDLGQGAGFYVDATEAPWAPHFRMWHYVTHELPELVFNNFPLDREAQGITGHSMGGHGALTIA
MTFPERYRSVSAFAPIAHPSESDWGRKQFAAYLGDDKAAWKRHDSTILMREKGYPGEVLIDQGASDQFLDLLKPEALAHA
MAERRQPGTFRMQQGYDHSYFFVQSFMADHIRWHAERLG
>P0ABW5 ~~~sfmA~~~Uncharacterized fimbrial-like protein SfmA~~~COG3539
MKLRFISSALAAALFAATGSYAAVVDGGTIHFEGELVNAACSVNTDSADQVVTLGQYRTDIFNAVGNTSALIPFTIQLND
CDPVVAANAAVAFSGQADAINDNLLAIASSTNTTTATGVGIEILDNTSAILKPDGNSFSTNQNLIPGTNVLHFSARYKGT
GTSASAGQANADATFIMRYE
>B0CN26 6.2.1.66~~~sfmB~~~Nonribosomal peptide synthase SfmB~~~
MNEMAGKAYGLAVRLDLRGALERTALQSALSAVVERHEALRTGLRQIDGTLTQVVVPGVTVSLPVVDLGGRGPDPAQLDR
EVRRLARQEAQRGWNLAQPPLLRGLLARLADDHHVLLLCVHHAVCDGLSLQIVLRELLENYTGGGPAGADEPLQFADYVV
WNNGGEEFPDPEWSARRQAAREHWSSTLAGAPQVLDLPTDRRRPPLQSYAGARVPVRLDTAFADRVRDWSAQRGVTPFTT
LLAAYTVVLARNGGADDLLVGLPVANRSHADLAGTVGYLANTCPLRADLRADPTLGELVRDLYDRLTGVLEHADLPFGEL
VELLAPPRMPERNPVFQVMFGLQQDVRRGWDLPGLRVDVEDVDCGNARVDLSLFLFEEADGAIDGFLEYASALFDRATAE
RFADQLHTVLRQILRDARIPVSAVDLVGNGSATTIDSFLDGGPLEQPWPLVWPRIRELAARRPSAEAVRDDAEALDYASL
VDRVDAAAARLTAAGAGPGDRVAVLAERGVRAVVAMLACWRAGGVYVPVDPAAPLPRRELILEQAAPAVLVCEDPDEQPP
HHRSRAVAIGDLTAEADAGAGTPAEPAPRPHDPAYLMFTSGSTGRPKGVAVSHANLSSFLHALTGRLALGPADRLLALTT
TAFDISLLELLGPLVTGGTVVVAPSSAQRGAADLAARLSSPGITTAQATPAVWRLALSAGWRPREGFTLLCGGEALPPDL
ADLLAATPAEAHNLYGPTETTIWSCAARIRPGEPVTIGRPIPGTRVLVADAALRPVPPGVCGELLVGGPGVALGYLDDPA
RTAARFVPDPYHPGERLYRTGDVVRLRSDGLIEFVGRVDEQVKVRGHRIELGEIESALRALPGVRDAAATVLDPRGNARI
AGYLVADDGALDTAGRAARLRQDLSEALPASMVPSELYAVPAIPLNPNGKVDRRALPGTGRRLEGGSERVAPSTDAEHAV
AALWCELLSLPEVGVREDFFGLGGHSLLAADLLQRLERDLGARVPVAEFFMEPTVARLAATVSQLSGAVHQTPTQDPDGR
AEQTAEPSPEDRGADGDGWDFPTVRRSVTVPGSDPVPLEVSP
>A0LNN5 ~~~~~~L-lactate transporter~~~COG2223
MADQQTTMPRWVPLLLGLLGSTTCGMLLYAWSVFIKPLNAEFGWSRAEIAMAFAICCLIFGLMTFPAGRLSDKMGPRKVV
MTGGVLLAIGFILSGFIQSKYQLYITYGVIAGFGGGMIYLPPIATAPKWWPDRRALATGFAVVGLGLGSFLMGPLATYII
EKPGMGWRYVFWYCGVAMGIMALIAGAFLEPPPAGWKPAGYTPPAPPAGAAAPKVTRDWTYEEAKGDTKFWLLYLAYFCG
SFAGLMVIGHLAGFGRDAGLTAMAAAGAVSSLAFSNAATRILSGWFVDKIGIRVYFAALFALQTAAMIAIFQLGGSVVGL
SIVAIVIGWNYGAMFTLFPATCLQFYGPTAQGSNYGLLFTACGLAGFAGPWVGGWLKDTTGTYYLPFLCAAALCALGTAI
VFMTKPPEKKHA
>P77249 ~~~sfmC~~~Probable fimbrial chaperone SfmC~~~COG3121
MMTKIKLLMLIIFYLIISASAHAAGGIALGATRIIYPADAKQTAVWIRNSHTNERFLVNSWIENSSGVKEKSFIITPPLF
VSEPKSENTLRIIYTGPPLAADRESLFWMNVKTIPSVDKNALNGRNVLQLAILSRMKLFLRPIQLQELPAEAPDTLKFSR
SGNYINVHNPSPFYVTLVNLQVGSQKLGNAMAAPRVNSQIPLPSGVQGKLKFQTVNDYGSVTPVREVNLN
>P77468 ~~~sfmD~~~Outer membrane usher protein SfmD~~~COG3188
MKIPTTTDIPQRYTWCLAGICYSSLAILPSFLSYAESYFNPAFLLENGTSVADLSRFERGNHQPAGVYRVDLWRNDEFIG
SQDIVFESTTENTGDKSGGLMPCFNQVLLERIGLNSSAFPELAQQQNNKCINLLKAVPDATINFDFAAMRLNITIPQIAL
LSSAHGYIPPEEWDEGIPALLLNYNFTGNRGNGNDSYFFSELSGINIGPWRLRNNGSWNYFRGNGYHSEQWNNIGTWVQR
AIIPLKSELVMGDGNTGSDIFDGVGFRGVRLYSSDNMYPDSQQGFAPTVRGIARTAAQLTIRQNGFIIYQSYVSPGAFEI
TDLHPTSSNGDLDVTIDERDGNQQNYTIPYSTVPILQREGRFKFDLTAGDFRSGNSQQSSPFFFQGTALGGLPQEFTAYG
GTQLSANYTAFLLGLGRNLGNWGAVSLDVTHARSQLADASRHEGDSIRFLYAKSMNTFGTNFQLMGYRYSTQGFYTLDDV
AYRRMEGYEYDYDGEHRDEPIIVNYHNLRFSRKDRLQLNVSQSLNDFGSLYISGTHQKYWNTSDSDTWYQVGYTSSWVGI
SYSLSFSWNESVGIPDNERIVGLNVSVPFNVLTKRRYTRENALDRAYASFNANRNSNGQNSWLAGVGGTLLEGHNLSYHV
SQGDTSNNGYTGSATANWQAAYGTLGGGYNYDRDQHDVNWQLSGGVVGHENGITLSQPLGDTNVLIKAPGAGGVRIENQT
GILTDWRGYAVMLYATVYRYNRIALDTNTMGNSIDVEKNISSVVPTQGALVRANFDTRIGVRALITVTQGGKPVPFGSLV
RENSTGITSMVGDDGQVYLSGAPLSGELLVQWGDGANSRCIAHYVLPKQSLQQAVTVISAVCTHPGS
>B0CN28 1.11.2.5~~~sfmD~~~3-methyl-L-tyrosine peroxygenase~~~
MTAPADTVHPAGQPDYVAQVATVPFRLGRPEELPGTLDELRAAVSARAGEAVRGLNRPGARTDLAALLAATERTRAALAP
VGAGPVGDDPSESEANRDNDLAFGIVRTRGPVAELLVDAALAALAGILEVAVDRGSDLEDAAWQRFIGGFDALLGWLADP
HSAPRPATVPGAGPAGPPVHQDALRRWVRGHHVFMVLAQGCALATACLRDSAARGDLPGAEASAAAAEALMRGCQGALLY
AGDANREQYNEQIRPTLMPPVAPPKMSGLHWRDHEVLIKELAGSRDAWEWLSAQGSERPATFRAALAETYDSHIGVCGHF
VGDQSPSLLAAQGSTRSAVGVIGQFRKIRLSALPEQPATQQGEPS
>P38052 ~~~sfmF~~~Uncharacterized fimbrial-like protein SfmF~~~COG3539
MRRVLFSCFCGLLWSSSGWAVDPLGTININLHGNVVDFSCTVNTADIDKTVDLGRWPTTQLLNAGDTTALVPFSLRLEGC
PPGSVAILFTGTPASDTNLLALDDPAMAQTVAIELRNSDRSRLALGEASPTEEVDANGNVTLNFFANYRALASGVRPGVA
KADAIFMINYN
>P75715 ~~~sfmH~~~Uncharacterized fimbrial-like protein SfmH~~~COG3539
MAMACLCLANISWATVCANSTGVAEDEHYDLSNIFNSTNNQPGQIVVLPEKSGWVGVSAICPPGTLVNYTYRSYVTNFIV
QETIDNYKYMQLHDYLLGAMSLVDSVMDIQFPPQNYIRMGTDPNVSQNLPFGVMDSRLIFRLKVIRPFINMVEIPRQVMF
TVYVTSTPYDPLVTPVYTISFGGRVEVPQNCELNAGQIVEFDFGDIGASLFSAAGPGNRPAGVMPQTKSIAVKCTNVAAQ
AYLTMRLEASAVSGQAMVSDNQDLGFIVADQNDTPITPNDLNSVIPFRLDAAAAANVTLRAWPISITGQKPTEGPFSALG
YLRVDYQ
>B0CN31 2.1.1.304~~~sfmM2~~~L-tyrosine C(3)-methyltransferase~~~
MTISLENTTVGQNPAGGPPTGKAPLDMEGLAWILFGASAFQYLNAACELNLFELLENKPGLTKPQIGAELGLADRANDIL
LLGATATGMLTVEDGRYQLATVLAELLKTDDWQRFKDTVGFEQYVCYEGQIDFTESLRSNSNVGLRRVRGSGRDLYHRLH
ENPQMEQAFYKYMRSWSELANQHLVEVLDLSGTSKLLDCGGGDAVNSIALAQANPHIEAGILEIPPTAPLTEKKIAEAGL
SDRITVKPGDMHTDEFPTGYDTVMFAHQLVIWTPEENTALLRKAYNALPEGGRVIIFNSMSNDEGDGPVVAALDSVYFAA
LPAEGGMIYSWATYEESLTKAGFNPETFQRIDFPGWTPHGVIIATK
>Q2FW72 6.3.2.57~~~sfnaB~~~Staphyloferrin A synthase~~~COG4264
MVYLEWAKADRNIQYRVINAIIKERIYPEQTFISQKGSLIEIQYHMHVLTIEVVRKSALERYEFTGDITYLNKGETSLII
TLEGLLDVLNHDFDIPISERLREELIHSRDSLVETYKQMSHRQTLISQSFKFSRLPQDINFFSWLQHVKDSDKTDDLTYS
ESLVPEGHPTHPLTKTKLPLTMEEVRAYAPEFEKEIPLQIMMIEKDHVVCTAMDGNDQFIIDEIIPEYYNQIRVFLKSLG
LKSEDYRAILVHPWQYDHTIGKYFEAWIAKKILIPTPFTILSKATLSFRTMSLIDKPYHVKLPVDAQATSAVRTVSTVTT
VDGPKLSYALQNMLNQYPGFKVAMEPFGEYANVDKDRARQLACIIRQKPEIDGKGATVVSASLVNKNPIDQKVIVDSYLE
WLNQGITKESITTFIERYAQALIPPLIAFIQNYGIALEAHMQNTVVNLGPHFDIQFLVRDLGGSRIDLETLQHRVSDIKI
TNDSLIADSIDAVIAKFQHAVIQNQMAELIHHFNQYDCVEETELFNIVQQVVAHAINPTLPHANELKDILFGPTITVKAL
LNMRMENKVKQYLNIELDNPIKKEV
>Q2FW70 6.3.2.58~~~sfnaD~~~D-ornithine--citrate ligase~~~COG4264
MNLNLIFKEQTLKFNKEEQETYLFLQQHNSDWANIFKEMILQGRDKVTQRLVTSMHRENLVKARTQSKKILSRDLIMLDI
STTHILEIQFPQAKQTLYAPITGEHAFDRIDVEGPFYIKDDITNTITRVHHPNEILECILIEAPDLKNAASDQFQQDLIN
SATNMTFAISYQALSMQHDSAPLFNIIENSEDSYLRSEQAVIEGHPLHPGAKLRKGLNALQTFLYSSEFNQPIKLKIVLI
HSKLSRTMSLSKDYDTTVHQLFPDLIKQLENEFTPKFNFNDYHIMIVHPWQLDDVLHSDYQAEVDKELIIEAKHTLDYYA
GLSFRTLVPKYPAMSPHIKLSTNVHITGEIRTLSEQTTHNGPLMTRILNDILEKDVIFKSYASTIIDEVAGIHFYNEQDE
ADYQTERSEQLGTLFRKNIYQMIPQEVTPLIPSSLVATYPFNNESPIVTLIKRYQSAASLSDFESSAKSWVETYSKALLG
LVIPLVTKYGIALEAHLQNAIATFRKDGLLDTMYIRDFEGLRIDKAQLNEMVYSTSHFHEKSRILTDSKTSVFNKAFYST
VQNHLGELILTISKASNDSNLERHMWYIVRDVLDNIFDQLVLSTHKSNQVNENRINEIKDTMFAPFIDYKCVTTMRLEDE
AHHYTYIKVNNPLYRENN
>Q845S8 1.14.14.-~~~sfnC~~~Probable FMNH2-dependent monooxygenase SfnC~~~
MNAPVNTPPRPALAIARELAGQFAQTAVERDDRGGTPKAERDALRDSGLLSLVIPQAFGGQGASWHDTFAVVREFARVDS
SIAHVFGFHHLMLATVRLFSRPDQWQPWFEQTARKQWFWGNALNPLDTRTVVKHFDGWCEFSGKKSFCSGASDSEMLIAS
AVDERAGGKLLIAAIPSGRTGISLHNDWNNIGQRQTDSGSATFERVRVEHNELLLDPGPLSTPFAALRPLIAQLHFANLF
LGIAEGAFEEARQYTLKESRPWFRSSAASSAEDPYVLRHYGEFWVGLESVRLLIERAARQLDAAWAKEHALTAQERGDLA
LAIGTAKVAASRHGLDICNRLFEVTGARATHASLRFDRHWRNLRTQTLHDPVDYRIHELGEWALNDKRPAPSFYS
>Q845S9 1.5.1.42~~~sfnE~~~NADH-dependent FMN reductase SfnE~~~
MSTPLNVVALSGGTSRPSRTLALTEAILAELAEHLHIKPHLIELGEIARPLGSALWRSELPEAVEQQLRLVEKADLLVVT
TPVYRGSFTGHFKHLFDLIGQDALVDTPVLLAATGGSERHALVLDHQLRPLFSFLQALTLPIGVFASQAEMADYRVSSAA
LAARIRLAAERAVPLFGAHHALRKSA
>Q3K9A2 1.5.1.42~~~sfnF~~~NADH-dependent FMN reductase SfnF~~~COG0431
MSRPLKVVALSGGTWRPSRTLVLTQALLAELSGHLPIESHLIELGDIARPLGAALSRQELPAEIEAELQAIEQADLLIVA
APVYRGSYPGLLKHLFDLIDLNALIDTPVLLAATGGSERHALVLDHQLRPLFSFFQAVTLPIGVYATEADFADYQITSEP
LKARIRLAAERAAPLFGTHLKPLLKIA
>Q65YX0 1.5.1.42~~~sfnF~~~NADH-dependent FMN reductase SfnF~~~
MNTSVIRVVVVSGSLRAPSRTHGLLQALVEKLQARLSNLDVHWVRIAELCSALSGSLERDTASADLQLHLQAIEQADLLL
VGSPVYRASYTGLFKHLFDLVDHQSLRGVPVVLAATGGSERHALMIDHQLRPLFAFFQAHTLPYGLYASVEAFDQHHLVE
PAQFERIERVLDTVSAFFQIPVASAA
>Q3KC85 1.14.14.35~~~sfnG~~~FMNH(2)-dependent dimethylsulfone monooxygenase~~~COG2141
MSQQAVKFAYWVPNVSGGLVVSRIEQRTDWGIDYNRKLAQLAEAAGFEYALTQIRFTAGYGAEFQHESVAFSHALLAATS
QLKVIAAILPGPWQPALAAKQLATIDQLTNGRIAVNIVSGWFRGEFQAIGEHWLEHDERYRRSEEFIRSLRGIWSQDNFT
FRGDFYRFDNYSLKPKPLGRPEIFQGGSSRAARDMAARVSDWYFTNGNSVEGIKAQVDDIRAKAAANHHSVKIGVNAFVI
ARDTEEEAKAVLAQIIDQADPEAVNAFGDAAKQAGRASPEGEGNWAKSTFEDLVQYNDGFKTNLIGTPQQIAERIVALKA
VGVDLVLAGFLHFQEEVEYFGQRVLPLVRELEAKAQSARTAEVA
>Q65YW9 1.14.14.35~~~sfnG~~~FMNH(2)-dependent dimethylsulfone monooxygenase~~~
MSQPIKFAYWVPNVSGGLVVSKIEQRTSWDIDYNRKLAQIAERSGFEYALSQIRFTAGYGADNQHESVTISHALLAATEK
LKVIAAILPGPWSPALAAKQLATIDQFTGGRIAVNVVSGWFKGEFRAIGEPWLEHDERYRRSEEFIRALKGIWTQDNFSF
HGDFYRFNDYTLKPKPLQRPHPEIFQGGSSRAARDMASRVSDWYFTNGNSVEGIKAQVDDIRAKAAANGHAVKIGVNAFV
IARDTEEEARAVLAEIIAKADPEAVNGFGSEVKNAGAASPEGEGNWAKSTFEDLVQYNDGFKTNLIGTPRQIAERIVALK
AIGVDLILSGFLHFQEEVEYFGRHVLPLVRELEQERRAAVAVA
>Q845S7 ~~~sfnR~~~Sigma54-dependent transcriptional activator SfnR~~~
MQLLTLPPSPTLATSIRATAQVFEDPRSQALLAHLQQVAPSEASVLIIGETGTGKELVARHIHNLSGRRNGPFVAVNCGA
FSESLVEAELFGHEKGAFTGALAAKAGWFEEANGGTLFLDEIGDLPLPIQVKLLRVLQEREVVRLGSRKSIPINVRVLAA
TNVQLEKAINAGHFREDLYYRLNVVTLQLHPLRDRPGDILPLARHFIRSYSDRLGYGPVELSAKAQAKLVEYSWPGNIRE
LENVIHHSLLTCGDGTVQAQDLRLSNLRIERQEEEPAGNGVEDLLQRAFSRLYEEQSGDLYEKVENALLRSAYRFCHYNQ
VHTAQLLGLSRNITRTRLIAIGELVVNKRRGQEQQVLDNRVVRLSI
>P39135 2.7.8.7~~~sfp~~~4'-phosphopantetheinyl transferase Sfp~~~
MKIYGIYMDRPLSQEENERFMSFISPEKREKCRRFYHKEDAHRTLLGDVLVRSVISRQYQLDKSDIRFSTQEYGKPCIPD
LPDAHFNISHSGRWVICAFDSQPIGIDIEKTKPISLEIAKRFFSKTEYSDLLAKDKDEQTDYFYHLWSMKESFIKQEGKG
LSLPLDSFSVRLHQDGQVSIELPDSHSPCYIKTYEVDPGYKMAVCAAHPDFPEDITMVSYEELL
>Q74FU6 1.-.-.-~~~sfrA~~~NADPH-Fe(3+) oxidoreductase subunit alpha~~~COG3383
MVSLTIDGKDITVAKETTILDAAALLGITIPTLCWLKKVSPTGACRVCAVEIEGVDRPMTACNTPVKDGIKVTTQSEKLS
RIRQKIMELMLVNHPLDCPVCDAGGECDLQNACYGLGAAKQEYGAVLERRKIRYDWPLIESDPNRCILCEKCVKVDHEIV
GCNAIRVVNRGEATIIDTVDGNPLNCEFCGNCVAACPTGTLISKPFKFRGRPWAFTTTPSVCPFCATGCQIEYHSRNGRV
ERVTSDDSTYNSGNLCINGRFGYSYINSPDRLAEPMVKGQKADWNTAMGTAATALKQIVASHGADAVAGFGSPRVTNEDN
YLFQKLMRSAIGTGNIDSEARLGFAATQKVLREMLGIAGASTTIDAIDRATAVLVVGCDLNAEATGMEYRVIKAATKNNA
KLVLAAMRDIKLKKFANSHLKYRPGNETLLINALTKAVLEEGLENKEFCSANISNLSDLTAALAGVSIADAAAATGVTEA
DLRAAARLVGGKKGVAVIFGAELMRGGNTDAVKALINLALILGATAGDTGGLFPVYEKTNIRGLLDMGVAPDHFPGHQTD
GTTFEKAWGKKLPAAAGKDLWQIIEGIEQGSVKALYLLGCDPVASFPEGERIRKALEKLELLIVQDPFPGEAAKMAHVVF
PSSVAAEKNGTFTTIDGRVQPLAKAVAPSGDAREDWDILTELYNRLTGESRIHSPAAVLDEVAALVPAYASVGRTGGTIT
AQPRSGGLALAPVSARAVAGSPTTLLVGTILYHSGTTTTWSKNNLEIIPKGYIEIHPNDAAKLGIAEGGKVRLSAGSVKV
EGTAKITPRVQPGLLFAPSHFRGMNVNALLSRDGGVVPVTVEKA
>Q74FU5 1.-.-.-~~~sfrB~~~NADPH-Fe(3+) oxidoreductase subunit beta~~~COG0493
MAQVVFSSWGRTIVDNRKGGEAQDVSFRLPTTLDGERQIAAFMGWDGIILYDLKVDVPAMAAEYMKRVQTQYCCGKCTPG
KKGTKVLADVLAAIIEGRATEADLDTIDDLADLLTNCKCTLCQSSTIPVLDAVKHFREDFLAYITGIRKPANVHRFIDKY
TAPCMDRCPAHIDIPAYIEAIKEYRFDESLDIIRDNMPLPSVCGRVCPHPCETHCRRKNVDDSVNIMVLKRSASDYEWMH
NAAPPMQPKPQKNKKVAIVGAGPAGLACAYYLALEGYPCTIYEALPEGYGGGMIAVGIPPYRQPRHLLQRDIDIISSMGV
DIIYDTRIGKDISLEELKQKFDAVFLAPGAHRSKPMGVEGEDKGYKGFLKGGIDFLREAYMGRPTGMGKKVVVVGGGNTA
IDCVRVALREGAEESTLLYRRSRKEMPADVWEVDGADEEGVRFEFQVLPTRVLVDENEQVTGVECVRMALGEPDASGRRR
PEPVPGSEFVVECDTVIPAIGQDPDLSFIPDNLGIDITKWNTVVTKYVPLKDAAGKDLKDGMGNPLARVLITDLEGVFAG
GDAEIGPLTVVACIGNAHRAARVIQRWLEEGKAYLTEDELMEDILTNMPVYDKNEKVPWLDSRERAHQAEVHGQERASKG
NYQEVELGFVDTQAVEEAERCLRCYRVAMAAI
>P0A823 ~~~sfsA~~~Sugar fermentation stimulation protein A~~~COG1489
MEFSPPLQRATLIQRYKRFLADVITPDGRELTLHCPNTGAMTGCATPGDTVWYSTSDNTKRKYPHTWELTQSQSGAFICV
NTLWANRLTKEAILNESISELSGYSSLKSEVKYGAERSRIDFMLQADSRPDCYIEVKSVTLAENEQGYFPDAVTERGQKH
LRELMSVAAEGQRAVIFFAVLHSAITRFSPARHIDEKYAQLLSEAQQRGVEILAYKAEISAEGMALKKSLPVTL
>C0SP86 ~~~sftA~~~DNA translocase SftA~~~COG1674
MSWLHKFFDLFLGESEEDAERETKPAQIPQQQEVHHPEGQLKRLEDPKIYYEYPKGKFRFPVVPDGYKNHDLRRRRTPSD
EPKSAPRPSAAPYRERPRNEEEQHTYQAAEPAKKPFKPTNIPSPVYGFNQKPSVKKDVPKKPSETLNEPDKSVKEKVTLL
SEEIERERGYPASDTQAHSKIESPFFPDTQFEKQPSGVLNRKDTEHDEALAKRPAEPSGNKVPFESGVQQPEKEEPFFPA
EQAEEQTPPEMLTDTAAEGLSDSEVGREEPATAEEEQREQQPEKFEEPVFSAELDEEQTAPESQTEAVSEDEKAKEPSDS
PVYNHHENAAEGAESPFVQEEQMDIRQEEPLFTDHEYSSEALAQAETVAKESEEPSESIINNHYDTLGEAQETKIDVQPD
SHTELEKTEHMEQGSKSSTATLENRQEIRADKPREASEEPKKRPGVQEKRTEQSASSQKGPSVPFNVMMLKRDTHKQQKA
EERRGSYVFPNVALLDVPPAQVQDDTAWIEEQRQLLDLTLKNFNVRANVVHVTQGPSVTRFEVHPEPGVKVNKITNLSDD
IKLSLSAKDIRIEAPIPGKNTIGIEVPNRTSKVVDLRQMIRSSAFRTSKSPLTAALGLDISGNPVVIDLKKMPHGLIAGA
TGSGKSVCINTILVSLLYKADPSEVKVLLIDPKMVELAPYNKIPHLVSPVITDAKAATAALKWVVEEMERRYELFAHSGV
RDIDRFNQLTAEHQMGEKLPYLVVIIDELADLMMVAPNDVEESIARIAQKARACGIHLLVATQRPSVDVITGLIKANIPT
RIAFSVSSQVDSRTIIDIAGAEKLLGKGDMLFLENGSGKPVRLQGNFVSDREIDRVVSHVRSQMPPTYLFEQEELVRQGS
ALKEEDELFYEACEFVVEQNSASTSSLQRRFRIGYNRAARLIDMMEAEGMISEAKGSKPREVLITASDLINE
>O08374 2.6.1.45~~~sgaA~~~Serine--glyoxylate aminotransferase~~~
MTVTPHLFIPGPTNIPDAVRMAMNIPMEDMRSPEFPKFTLPLFEDLKKAFKMKDGRVFIFPSSGTGAWESAVENTLATGD
KVLMSRFGQFSLLWVDMCERLGLKVEVCDEEWGTGVPVEKYADILAKDKNHEIKAVFVTHNETATGVSSDVAGVRKALDA
AKHPALLMVDGVSSVGSLDMRMGEWGVDCCVSGSQKGFMLPTGLGILAVSQKALDINKSKNGRMNRCFFSFEDMIKTNDQ
GFFPYTPATQLLRGLRTSLDLLFAEGLDNVFARHTRLASGVRAAVDAWGLKLCAKEPKWYSDTVSAILVPEGIDSNAITK
TAYYRYNTSFGLGLNKVAGKVFRIGHLGMLDEVMIGGALFAAEMALKDNGVNLKLGSGTGAAAEYFSKNATKSATALTPK
QAKAA
>P55819 2.6.1.45~~~sgaA~~~Serine--glyoxylate aminotransferase~~~COG0075
MAATRRPGRNHLFVPGPTNIPDRVMRAMMVQSEDHRSVDFPSLTKPLFEDTKKVFGSTEGTIFLFPASGTGIWESALSNT
LARGDKVLAARFGQFSHLWIDMAQRLGLDVVVQEEEWGTGAKPEKIEEALRADKNHEIKAVMVVHNETATGVTSNIGAVR
KAIDAAGHPALLFVDGVSSIGSLPFKADEWKVDCAIAGSQKGLMLPAGLGVICVSQKALKAAEGQSGRNDRLARVYFDWE
DQKKQNPTGYFPYTPPLPLLYGLREALACLFEEGLENVYHRHAVLGEATRQAVAAWGLKTCAKSPEWNSDTVTAILAPEG
VDAAKIIKHAYVRYNLALGAGLSQVAGKVFRIGHVGDLNELSLLGAIAGAEMSLIDNGVKVTPGSGVAAASSYLRENPLA
KA
>P37680 5.1.3.4~~~sgbE~~~L-ribulose-5-phosphate 4-epimerase SgbE~~~COG0235
MLEQLKADVLAANLALPAHHLVTFTWGNVSAVDETRQWMVIKPSGVEYDVMTADDMVVVEIASGKVVEGSKKPSSDTPTH
LALYRRYAEIGGIVHTHSRHATIWSQAGLDLPAWGTTHADYFYGAIPCTRQMTAEEINGEYEYQTGEVIIETFEERGRSP
AQIPAVLVHSHGPFAWGKNAADAVHNAVVLEECAYMGLFSRQLAPQLPAMQNELLDKHYLRKHGANAYYGQ
>P37678 4.1.1.85~~~sgbH~~~3-keto-L-gulonate-6-phosphate decarboxylase SgbH~~~COG0269
MSRPLLQLALDHSSLEAAQRDVTLLKDSVDIVEAGTILCLNEGLGAVKALREQCPDKIIVADWKVADAGETLAQQAFGAG
ANWMTIICAAPLATVEKGHAMAQRCGGEIQIELFGNWTLDDARDWHRIGVRQAIYHRGRDAQASGQQWGEADLARMKALS
DIGLELSITGGITPADLPLFKDIRVKAFIAGRALAGAANPAQVAGDFHAQIDAIWGGARA
>Q8GMG6 1.14.14.15~~~sgcC~~~(3S)-3-amino-3-(3-chloro-4-hydroxyphenyl)propanoyl-[peptidyl-carrier protein SgcC2] monooxygenase~~~
MPHGAEREASPAEESAGTRPLTGEEYLESLRDAREVYLDGSRVKDVTAHPAFHNPARMTARLYDSLHDPAQKAVLTAPTD
AGDGFTHRFFTAPRSVDDLVKDQAAIASWARKSYGWMGRSPDYKASFLGTLGANADFYEPFADNARRWYRESQEKVLYWN
HAFLHPPVDRSLPADEVGDVFIHVERETDAGLVVSGAKVVATGSALTHAAFISHWGLPIKDRKFALVATVPMDADGLKVI
CRPSYSANAATTGSPFDNPLSSRLDENDAILVLDQVLIPWENVFVYGNLGKVHLLAGQSGMIERATFHGCTRLAVKLEFI
AGLLAKALDITGAKDFRGVQTRLGEVLAWRNLFWSLSDAAARNPVPWKNGTLLPNPQAGMAYRWFMQIGYPRVLEIVQQD
VASGLMYVNSSTEDFRNPETGPYLEKYLRGSDGAGAVERVKVMKLLWDAVGSDFGGRHELYERNYSGNHENTRIELLLSQ
TASGKLDSYMDFAQACMDEYDLDGWTAPDLESFHAMRSASRDLLGGL
>Q8GMH4 2.6.1.86~~~sgcD~~~2-amino-4-deoxychorismate synthase~~~
MTDQCVVSAPVRVRTRRLDVKETGALPAYRALAEHFGPDEVYLLESAAGPARDRRHQFVGFGALLSLSVTDRVVRVEGVP
ALRGLLLERAGALLEDGPQGLRLRTAGGLWPLLRAMRDMFDAEGSASGFRFGFLGFFGYDTARYIEDLPHLIENRPGLPD
VRMVLHRGSVVTDLATGRCELLLHESPYWPGLAPETVTGLLADVEQAWPDPSADGFPASAVTDDSAPEVFANDVERCLKH
IAVGDIYQVQIGHELSIRSTADPADVYQRLRGRNASPYMYLAGIDGHRLIGASPELFVRIEDGEVTMRPIAGTVPRSGAD
GGIAAGVRLRSDPKEIAEHTMLVDLCRNDIGRIARPNTLDVPDQLDVEGYSHVLHLVSTVVGRARVDTDAFDTIAALFPA
GTMTGAPKIRAMEIIESVERSRRGLYAGALGLLDVGGYTNLALCIRTLFHHEGVYRTRASAGIVADSEPGAEWTETLAKM
SATHWAVTGEELL
>Q8GME2 1.5.1.37~~~sgcE6~~~NADH-dependent FAD reductase~~~
MSPIIAPPAELVDPKDRVQLRRVFGDFPTGVTVVTVGGSEPRGMTANSFTSVSLSPPLVLICVGKDAVMHQRLTALPTFA
VSVLEAGQEKAARHFADHSRPPGVDQFDTVDWVLGEESGAPLIAGAVAHLECAIHRLYEGGDHTIFLGEVITATRWPARE
GMLFSGGRFRRFAPDADEGRAA
>Q8GMH2 1.3.8.16~~~sgcG~~~2-amino-4-deoxychorismate dehydrogenase~~~
MSAQLKILAINGSERDGNTADVLRHAARVAENRGVDFEAVDLRSIRMERCGPCGDCNDRPVACTLADGVPEVVAKMVAAD
GIIFAAPVHGFGTASLMQTFIERAGVGYLRFDRPLSNKVAGIISVARRYSAGEVWAQLTVNALLNRMILVGSGFPATVHA
LHRGDALKDEEGLTNVSRLVERMTDMIELLDEHRRLTGRSDVLASNEVNERVGLALNELQAQP
>Q7CXU0 2.8.3.-~~~caiB~~~Succinate--glutarate CoA-transferase~~~COG1804
MTDMPNRKPPLSGIRVIELARVLAGPWAGQMLADMGADVIKVENPEGGDDTRAWGPPFVESADGENLSAAYYHATNRGKR
SIVADLKTPEGCALVRRLVRTADVVIENFKRDGLAKYGLDYESLRVLNPKLIYCSITGFGQTGPYADFAGYDYIVQGMSG
FMSITGEPDGQPMKAGVAVADIFTGIYSVSAIQAALIHAMRSGEGQHIDMALLDVQSAVLANQNMNYLISGRPPIRLGNA
HPNISPYEVVPTADGFLILAVGNDGQFRRLCNILGIGAIADDERYATNKARVAHKVEVRQIISTETLKWNKRDLLTACET
NAVPAGPINSIEEMFADPQVQARGLRVDLEAEDGTVIPGVRTPIIMSQTPLRYERPSPKLGEHQAQVLAELETIERTATP
>P0DOV7 4.2.1.162~~~~~~6-deoxy-6-sulfo-D-gluconate dehydratase~~~
MSEKHKKIEELRSQRWFAPDTIRAFAHRQRLQQIGLRREEFMGKPVIAILNTWSEMSPCHSHLRDRAEAVKRGVWAAGGF
PVELPVQSVGEVMVKPTTMLYRNLLAMEAEELLRSLPIDGAVLLGGCDKSTPGLLMGALSMDLPVIYCPAGPMSNGQWRG
VKTGAGTHTKKYWDERRLGLIDTVAWEELEGAMTRSIGTCNTVGTASTMTSIADAMGFTLPGASSIPAADGAHPRMASQC
GSAIVDLVWRDRRPSTWLTDKHVANGVAVYMAMGGSTNAAIHLIAIARRAGIDLTLDQLAAAAAKIPVLLNLFPSGTALM
EDYHFAGGLRALMRKIEPHLHLECEGATGQSWDSLLADAPCYDDDIIRSLDNPVVSLEQGATLALLRGNLCPDGAVMKSS
AAEPRLRRHSGPALVFDDHETLSRMIDDPALEVTADTVLILRNAGPVGAPGMPEWGNLPIPKRLLEAGVRDLLRISDSRM
SGTHYGSCVLHVAPEAAVGGPLALVRTGDIIDLDVAAGTLNMRVSDDELARRRAGHVPQHKTYGRSFAALYQQHVTQANE
GCDFDFLQAGEAVPEPPIH
>P96169 ~~~sglT~~~Sodium/glucose cotransporter~~~
MSNIEHGLSFIDIMVFAIYVAIIIGVGLWVSRDKKGTQKSTEDYFLAGKSLPWWAVGASLIAANISAEQFIGMSGSGYSI
GLAIASYEWMSAITLIIVGKYFLPIFIEKGIYTIPEFVEKRFNKKLKTILAVFWISLYIFVNLTSVLYLGGLALETILGI
PLMYSILGLALFALVYSIYGGLSAVVWTDVIQVFFLVLGGFMTTYMAVSFIGGTDGWFAGVSKMVDAAPGHFEMILDQSN
PQYMNLPGIAVLIGGLWVANLYYWGFNQYIIQRTLAAKSVSEAQKGIVFAAFLKLIVPFLVVLPGIAAYVITSDPQLMAS
LGDIAATNLPSAANADKAYPWLTQFLPVGVKGVVFAALAAAIVSSLASMLNSTATIFTMDIYKEYISPDSGDHKLVNVGR
TAAVVALIIACLIAPMLGGIGQAFQYIQEYTGLVSPGILAVFLLGLFWKKTTSKGAIIGVVASIPFALFLKFMPLSMPFM
DQMLYTLLFTMVVIAFTSLSTSINDDDPKGISVTSSMFVTDRSFNIAAYGIMIVLAVLYTLFW
>P0DOV6 3.1.1.99~~~~~~6-deoxy-6-sulfogluconolactonase~~~
MNETLKCVVRQPSVLGECPVWSVREQVLYWADILAGRLHRLDPRDGSVSTLQLPEELGCFGLREQGGFIVALRSGIYLLD
AHGQLGERLAENPTGAEHSRFNDGRVDPWGRFWAGTLWQPRDRNGGQLLRVDAEHRAQVMAGDVMVSNGLAFSPDRAWAY
HSDTPNHVLYRYPLDEDGQPGTRQLLREFARGSGGRPDGAAFDSAGCYWSAQFDGGRVLRLSPDGQVLDEIQLPTRWPTM
VAFGGEDLRTLYITSSRENRSAEELADWPLSGCVFATRVNVPGCAEPLFAG
>Q7M0R2 2.1.1.179~~~sgm~~~16S rRNA (guanine(1405)-N(7))-methyltransferase~~~
MTAPAADDRIDEIERAITKSRRYQTVAPATVRRLARAALVAARGDVPDAVKRTKRGLHEIYGAFLPPSPPNYAALLRHLD
SAVDAGDDEAVRAALLRAMSVHISTRERLPHLDEFYRELFRHLPRPNTLRDLACGLNPLAAPWMGLPAETVYIASDIDAR
LVGFVDEALTRLNVPHRTNVADLLEDRLDEPADVTLLLKTLPCLETQQRGSGWEVIDIVNSPNIVVTFPTKSLGQRSKGM
FQNYSQSFESQARERSCRIQRLEIGNELIYVIQK
>O52057 ~~~sgpA~~~Sulfur globule protein CV1~~~
MIKSNRITACALAALFAGASFSASAWWGGPGYGNGLWDNMGDMFGDGYGDFNMSMGGGGRGYGRGYGRGNGYGYGAPYGY
GAPYGYGAPYGYGAPYGYGAPYGAMPYGAMPPQMPAAPAQPQAAPSR
>O52179 ~~~sgpB~~~Sulfur globule protein CV2~~~
MKKLATAAAVAALLGASASASAWWGPGWGGPGYGSGMGDWMNDMFGDGYGDFNMSMSGGGRGYGRGNGYGRGYGYGNPYY
GYGYPYYGGYGAPYGAYGAPYAFPYGAPYGAPVAPAAPAQSESK
>O52055 ~~~sgpC~~~Sulfur globule protein CV3~~~
MTMKRLLLVSTLAGASALATLPANAFWGWNPFGWGGGPWDGPWGGGPWGSPWYGGYPYHGGYYPYGLYGVPYGWGAPVYG
YPGYVYPGYAYPAPTQPSTKSQ
>Q5X3A8 4.1.2.27~~~~~~Probable sphingosine-1-phosphate lyase~~~
MFGFISDLLTAAVSSLDELLQDTPAHQIILGTAALYFLYNQYHNPSISRWCRSRNNASMKQRIIDSAYALAKNLPGVNQI
IEKELNKELSSTREKLRIQRSGMTLREEIPEEGLSPQDILSAFDVDVEKCHFDFLSVTNDSPEREFLVGRGDGKDSGALY
AIHPKELTELLKEVYGATALTNPLHDKWPRINAMQAEVIRWCQNLFHGSKEGYGLLTHGGTTSIIEAMAAYVIRARAKGI
DYPEIVVPETAHAAFKKAAELTGAILITVPVDKKTGAVNPNVMSSYITRNTAVMVGSAPSFMNGIHDPISELGQLAKKKN
VPFHVDACLGGFLTAFLDTSSEPMDFRVPGVTSISADLHKYGCCPKGTSVCLFSEDSPALSVYAALNWSGGLYATPGILD
GSTSGARVAEVYATLSYYGKNKYQEIAKSIIRLRNAIQKELTALVEEGNGLTSEDIYVYGNPQWSILGFRSNTCNAHFIA
DELEKRGWKLNLLQNPDGFHLCLTHVHTLVGSFETQFIKDLREAVIDVKNYPPGKKPSGNVKVYGAVGMMPVELQKEICK
QYQKARLDFTAASHGSLGIFAPSSTEEDDGLRNRKVGEQKVQTSL
>P33595 ~~~sgrR~~~HTH-type transcriptional regulator SgrR~~~COG4533
MPSARLQQQFIRLWQCCEGKSQDTTLNELAALLSCSRRHMRTLLNTMQDRGWLTWEAEVGRGKRSRLTFLYTGLALQQQR
AEDLLEQDRIDQLVQLVGDKATVRQMLVSHLGRSFRQGRHILRVLYYRPLRNLLPGSALRRSETHIARQIFSSLTRINEE
NGELEADIAHHWQQISPLHWRFFLRPGVHFHHGRELEMDDVIASLKRINTLPLYSHIADIVSPTPWTLDIHLTQPDRWLP
LLLGQVPAMILPREWETLSNFASHPIGTGPYAVIRNSTNQLKIQAFDDFFGYRALIDEVNVWVLPEIADEPAGGLMLKGP
QGEEKEIESRLEEGCYYLLFDSRTHRGANQQVRDWVSYVLSPTNLVYFAEEQYQQLWFPAYGLLPRWHHARTIKSEKPAG
LESLTLTFYQDHSEHRVIAGIMQQILASHQVTLKIKEIDYDQWHTGEIESDIWLNSANFTLPLDFSVFAHLCEVPLLQHC
IPIDWQADAARWRNGEMNLANWCQQLVASKAMVPLLHHWLIIQGQRSMRGLRMNTLGWFDFKSAWFAPPDP
>C1P5Z7 ~~~sgrT~~~Putative inhibitor of glucose uptake transporter SgrT~~~
MRQFYQHYFTATAKLCWLRWLSVPQRLTMLEGLMQWDDRNSES
>B1W3T1 ~~~shbA~~~ECF RNA polymerase sigma factor ShbA~~~COG1595
MRDDETTVIGALVHRAVEGDAQATHDLLAHVHPLALRYCRSRLNRLPGDARHFVEDLAQEVCVAVLMALPRYKDTGRPFE
AFVFAIAGHKVADLQRAAMRHPGSTAVPSDEMPERPDDSLGPEERALLSSDAAWAKKLLANLPENQRELLVLRVAVGLTA
EETGQMLGMSPGAVRVAQHRALSRLRALAEQ
>Q4R102 ~~~shdD~~~Protein ShdD~~~
MKCHRCGSDNVRKMVDSPVGDAWEVYVCEKCCYSWRSTENPVVMEKFKLDDNKIANMGVIPPIPPLKK
>P44774 1.1.1.25~~~sdhL~~~Shikimate dehydrogenase-like protein HI_0607~~~COG0169
MINKDTQLCMSLSGRPSNFGTTFHNYLYDKLGLNFIYKAFTTQDIEHAIKGVRALGIRGCAVSMPFKETCMPFLDEIHPS
AQAIESVNTIVNDNGFLRAYNTDYIAIVKLIEKYHLNKNAKVIVHGSGGMAKAVVAAFKNSGFEKLKIYARNVKTGQYLA
ALYGYAYINSLENQQADILVNVTSIGMKGGKEEMDLAFPKAFIDNASVAFDVVAMPVETPFIRYAQARGKQTISGAAVIV
LQAVEQFELYTHQRPSDELIAEAAAFARTKF
>A4QH18 ~~~shiA~~~Shikimate transporter~~~
MTINSPAPKVGDTMSHEAYSAKAHPTKGTPQAKRAALSAFLGSTLEYYDFFIYGSAAALVFSHVFFPEGGANSVLLSIST
LGVAYVARPAGAVLFGHLGDTIGRRKTLMIILFTMGLSTLAIGLLPTYGQVGILAPIMLVVLRLLQGLSAGGESPGAAAL
SMEHAPQRRRGFYSSFTISGVMFGIVLSSVVFIPIAALPDEQLFSWGWRIPFILSIILTGVALWIRRHLEESETFEEEVE
EAKVPETPVVELFKNHWAAVFRVMFSSTYAMMNTILNVFGLAYAVYSDIERTTMLGVIAVANLMAVFSQPFFGLLSDKIG
RKPVFITGALGAGGMMFVFFNAISTGNIAMIYLSAMVLMGLFYAAPNATYMAAFPEQFPAKVRYSGMAIGLMLGLLVSGF
APAVAEMLTGGQAENWQPVAWMCLGFAVLSAVAFATGKETYKTPTHLLGK
>P76350 ~~~shiA~~~Shikimate transporter~~~COG0477
MDSTLISTRPDEGTLSLSRARRAALGSFAGAVVDWYDFLLYGITAALVFNREFFPQVSPAMGTLAAFATFGVGFLFRPLG
GVIFGHFGDRLGRKRMLMLTVWMMGIATALIGILPSFSTIGWWAPILLVTLRAIQGFAVGGEWGGAALLSVESAPKNKKA
FYSSGVQVGYGVGLLLSTGLVSLISMMTTDEQFLSWGWRIPFLFSIVLVLGALWVRNGMEESAEFEQQQHYQAAAKKRIP
VIEALLRHPGAFLKIIALRLCELLTMYIVTAFALNYSTQNMGLPRELFLNIGLLVGGLSCLTIPCFAWLADRFGRRRVYI
TGTLIGTLSAFPFFMALEAQSIFWIVFFSIMLANIAHDMVVCVQQPMFTEMFGASYRYSGAGVGYQVASVVGGGFTPFIA
AALITYFAGNWHSVAIYLLAGCLISAMTALLMKDSQRA
>A4QH19 ~~~shiR~~~HTH-type transcriptional regulator ShiR~~~
MEIRWLEGFIAVAEELHFSNAAIRLGMPQSPLSQLIRRLESELGQKLFDRSTRSVELTAAGRAFLPHARGIVASAAVARE
AVNAAEGEIVGVVRIGFSGVLNYSTLPLLTSEVHKRLPNVELELVGQKLTREAVSLLRLGALDITLMGLPIEDPEIETRL
ISLEEFCVVLPKDHRLAGEDVVDLVDLAEDGFVTTPEFAGSVFRNSTFQLCAEAGFVPRISQQVNDPYMALLLVGAGVGV
AITTHGTGLLAPPNTVHLPIKQHSVELRHGIAWMKGSGRVARDAVIDIALDIFKP
>C1P611 ~~~shoB~~~Small toxic protein ShoB~~~
MTDCRYLIKRVIKIIIAVLQLILLFL
>Q08002 3.4.24.-~~~shpI~~~Neutral metalloprotease ShpI~~~
MINKKKLVTSLVTSSLLATFTLGSFADAHTYIINNEDINKNAQESSIGTLKQNNFKQSTIDSMKPRNLQSFQEDKVFKAP
KEKTPITERARKSENALSNSKLNDVRSFTTVNMRTNENERTAAKLKYNGKNTNVWVADNYITDKQAKNIGEEFDNKIDPL
VKEKFGEPSDVDHDGKVNILVYDIKDDFETTGSYTGGYFHPRDLYDVPHSNKAEVFYMDTYPSMGTDKNNLNEKKVYSTL
AHEYQHMVNANQKLLKEQKEDGMDVWLDEAFAMASEHMYLQKPLDHRIEYYNNSTSIANGHSLIKWNHRGDVLSNYALSY
LFSQYLSAQSDNGDKIFKEILQDPANTSEALENAIHKHVDPKMSLGEFMTNFRVALEKKEATGLHGFNGAPGLNSISPKP
VRELPQTLAPQGSVMFETTSPIKVPKDKDEKVNYVKVK
>P81238 ~~~shp~~~Cytochrome c-type protein SHP~~~COG1858
MTRFLILSAVLAGPALAGDTSPAQLIAGYEAAAGAPADAERGRALFLSTQTGGKPDTPSCTTCHGADVTRAGQTRTGKEI
APLAPSATPDRFTDSARVEKWLGRNCNSVIGRDCTPGEKADLLAWLAAQ
>H2VFI5 3.2.1.183~~~siaA~~~UDP-N-acetylglucosamine 2-epimerase~~~
MKRILCITGTRADFGKLKPLLAYIENHPDLELHLIVTGMHMMKTYGRTYKEVTRENYQHTYLFSNQIQGEPMGAVLGNTI
TFISRLSDEIEPDMVMIHGDRLEALAGAAVGALSSRLVCHIEGGELSGTVDDSIRHSISKLSHIHLVANEQAVTRLVQMG
EKRKHIHIIGSPDLDVMASSTLPSLEEVKEYYGLPYENYGISMFHPVTTEAHLMPQYAAQYFKALELSGQNIISIYPNND
TGTESILQELLKYQSDKFIAFPSIRFEYFLVLLKHAKFMVGNSSAGIREAPLYGVPSIDVGTRQSNRHMGKSIIHTDYET
KNIFDAIQQACSLGKFEADDTFNGGDTRTSTERFAEVINNPETWNVSAQKRFIDLNL
>Q9KR66 ~~~siaM~~~Sialic acid TRAP transporter large permease protein SiaM~~~COG1593
MVGSIFGWLGLLFAGMPVGFSLIFVALAFLILTNSTGINFAAQQMLGGIDNFTLLAVPFFVLTGHLMNSAGITERIFNFA
KSLVGHITGSLGHVNIMASLLFSGMSGSALADAGGLGQLEIKSMRDAKYHDDFAGGLTAASCIIGPLVPPSVPLVIYGVV
SNTSIGALFLAGAIPGLLCCIALMVMSYFICKKRGYMTLPKASRREQFKSLKEAFLSLLTPVIIIGGIFSGKFTPTEAAA
VSSLYALFLGTVVYNTLTLQGFIEILKETVNTTAVVALMVMGVTVFGWIVAREQLPQMLADYFLTISDNPLVLLLLINLL
LLFLGTFIESLALLLLLVPFLVPVASAVGIDPVHFGVMAILNLMIGILTPPMGMALYVVSRVGDIPFHTLTRGVLPLLVP
LFIVLALVAVFPQFTLLLPELFLGYGQ
>P44542 ~~~siaP~~~Sialic acid-binding periplasmic protein SiaP~~~COG1638
MMKLTKLFLATAISLGVSSAVLAADYDLKFGMNAGTSSNEYKAAEMFAKEVKEKSQGKIEISLYPSSQLGDDRAMLKQLK
DGSLDFTFAESARFQLFYPEAAVFALPYVISNYNVAQKALFDTEFGKDLIKKMDKDLGVTLLSQAYNGTRQTTSNRAINS
IADMKGLKLRVPNAATNLAYAKYVGASPTPMAFSEVYLALQTNAVDGQENPLAAVQAQKFYEVQKFLAMTNHILNDQLYL
VSNETYKELPEDLQKVVKDAAENAAKYHTKLFVDGEKDLVTFFEKQGVKITHPDLVPFKESMKPYYAEFVKQTGQKGESA
LKQIEAINP
>Q9KR64 ~~~siaP~~~Sialic acid-binding periplasmic protein SiaP~~~COG1638
MKTINKITIAILTLSAAASVNAATTLKMGMQASVGSVEYNSAKMLADTLEEMSQGEIKLALYPSAQLGDDRAMLQQLTLG
DLDITYAEFGRMGLWIPRAEAVMLPYVAKDFDHLRRMFESDFGQGVRDEMLQKFNWRALDTWYNGTRETTSNRPLNSIED
FKGLKLRVPNAKQNLNYAKLSGASPTPMSFSEVYLALQTNAVDGQENPLPTIKTMKFYEVQKNLAMTHHIVNDQMVIISE
STWQKLSDTDKDIIQKAVQKVGDAHTQTVKTQEAELVSFFKSEGINVTYPDLEPFREAMQPLYKEFDSNIGQPIVSKLAA
M
>Q9KR65 ~~~siaQ~~~Sialic acid TRAP transporter small permease protein SiaQ~~~COG3090
MELKMLRKIINNIEEIITVPLMAALLAVLTWQIGTRWLLNDPSLWSEELARLLFMYMCLVGCAIAIKRSSHVNITFFSDK
LPEKARLSLVLSLEIAVLVSIGAIIVLGYQHAQRNAFFELITLGISSSWMNYSLPVGGVFMVFRQLEKIFNLMKLLLGVS
SSASLIDQQVTER
>P44543 ~~~siaT~~~Sialic acid TRAP transporter permease protein SiaT~~~COG1593
MKYINKLEEWLGGALFIAIFGILIAQILSRQVFHSPLIWSEELAKLLFVYVGMLGISVAVRKQEHVFIDFLTNLMPEKIR
KFTNTFVQLLVFICIFLFIHFGIRTFNGASFPIDALGGISEKWIFAALPVVAILMMFRFIQAQTLNFKTGKSYLPATFFI
ISAVILFAILFFAPDWFKVLRISNYIKLGSSSVYVALLVWLIIMFIGVPVGWSLFIATLLYFSMTRWNVVNAATEKLVYS
LDSFPLLAVPFYILTGILMNTGGITERIFNFAKALLGHYTGGMGHVNIGASLLFSGMSGSALADAGGLGQLEIKAMRDAG
YDDDICGGITAASCIIGPLVPPSIAMIIYGVIANESIAKLFIAGFIPGVLITLALMAMNYRIAKKRGYPRTPKATREQLC
SSFKQSFWAILTPLLIIGGIFSGLFSPTESAIVAAAYSVIIGKFVYKELTLKSLFNSCIEAMAITGVVALMIMTVTFFGD
MIAREQVAMRVADVFVAVADSPLTVLIMINALLLFLGMFIDALALQFLVLPMLIPIAMQFNIDLIFFGVMTTLNMMVGIL
TPPMGMALFVVARVGNMSVSTVTKGVLPFLIPVFVTLVLITIFPQIITFVPNLLIP
>C0LTM1 1.14.13.223~~~sibG~~~3-hydroxy-4-methyl-anthranilyl-[aryl-carrier protein] 5-monooxygenase~~~
MRILVNGGGPAGMAFAMFAARSGRGDEITVRDWTGPGDTYGFGVILPPAAVEVFRDAEPDLADELNSHITAWDRLSVHRH
GRTASIPAPRLGAMDRRTLLKVLRRRCAERGVRFEHGAVDPALGDHDLVVAADGARSVTRRHRAAAFGTTTREIGPAYIW
LGADRALEHLRFLVAETPDGPAVAHAYPYSPDRSTFLVEADGAPPPAVLAEWFAGPLGGARLLENRSRWSRFQEIHNRTW
SAGNVVLIGDAAHTAHYSIGSGTRLALDDARALADALCAQPRLADALKGYEDARRPIVEHTQRIGRLSATWFTRLPDVPM
ERLLDDLATRGGQISWRDLATEGSGAVPVRG
>P69066 ~~~sicA~~~Chaperone protein SicA~~~
MDYQNNVSEERVAEMIWDAVSEGATLKDVHGIPQDMMDGLYAHAYEFYNQGRLDEAETFFRFLCIYDFYNPDYTMGLAAV
CQLKKQFQKACDLYAVAFTLLKNDYRPVFFTGQCQLLMRKAAKARQCFELVNERTEDESLRAKALVYLEALKTAETEQHS
EQEKE
>P0CL16 ~~~sicP~~~Chaperone protein SicP~~~
MGLPLTFDDNNQCLLLLDSDIFTSIEAKDDIWLLNGMIIPLSPVCGDSIWRQIMVINGELAANNEGTLAYIDAAETLLLI
HAITDLTNTYHIISQLESFVNQQEALKNILQEYAKV
>B8GWS7 ~~~sidA~~~Cell division inhibitor SidA~~~
MIRVARESFALVSVIGFVWMMCTVANLVA
>Q5ZSQ2 3.1.4.-~~~sidD~~~Adenosine monophosphate-protein hydrolase SidD~~~
MVYYEIIKDIVFTYNLQFTHLIHNDRISEVNLGGVTMRSIITQICNGVLHGQSYQSGSNDLDKGNSEIFASSLFVHLNEQ
GKEIIKHKDSDDKIVIGYTKDGMAFQIVVDGFYGCERQAVFSFIDNYVLPLIDNFSLDLTRYPDSKKVTESLIHTIYSLR
SKHAPLAEFTMSLCVTYQKDEQLFCAGFGIGDTGIAIKRNEGTIEQLVCHTEVDGFKDAFDNYSSANIDLVIERNSVFNT
KVMPGDELVGYTYVPPMLEMTEKEFEVETVDGKKINKRIVRHLNLDPGNFDDKDPLFSQLLQVVKSKQKQLVEQAKETGQ
IQRFGDDFTVGRLVIPDQLLINQLRIHALSIGVSDGLLSYIKNENENKGFLGIYGFFTGADKNIEKATLYKNLIAKYQNN
HFISLIILSALVSDSKTPLMTQYLVGYLDFPSKALLANKITELLLKELENPDMREILGSRLATDVIEELETKIIRYIHNP
AGSDIHSTLNLWTADKIKAATNSSLTI
>Q5ZYX7 ~~~sidE~~~Ubiquitinating enzyme SidE~~~COG1196
MPKYVEGIELTQEGMHAIFERMGHPNITSGTIYNGEPTIDKGALDRQGFMPVLTGVSPRQDSGHWIMLIKGQGNQYFLFD
PLGESSGKYYQNILAKKLPGATLSVIPNNAGLNMGLCGYWVASVGLRAHAALTQPIPPSLRNLGQTITQEMRDELTQDGS
EKITQWLRAVGNEFPDGDIQPDATALRRATEKNVRIDEFQPVLTGTSPKEISINPTAPQEVSVPTWNGFSLYTDETVRNA
ARYAYDNYLGKPYTGTVEATPVNFGGQMVYRQHHGLAHTLRTMAYAEIIVEEARKAKLRGESLKTFADGRTLADVTPEEL
RKIMIAQAFFVTGRDDEESSKNYEKYHEQSRDAFLKYVEENKSTLIPDVFKDEKDVKFYADVIEDKDHKWADSPAHVLVN
QGHMVDLVRVKQPPESYLEYYFSQLQPWIGSTATEAVFATQRQFFHATYEAVAGFDSENKEPHLVVDGLGRYVIGQDGNP
IREESDDEDEEESGELKFFSQKKKLEENQRYMRVDEYLKLDEVQKRFPGAGKKLDGGLPGLKEYQYLQRLNSINRARCEN
DVDFCLGQLQTAHHQTKITPIKRAFQSSSEKARRQPNMDEIAAARIVQQIMANPDCIHDDHVFLNGQKLEEKFFRDLLAK
CDMAIVGSLLNDTDIRNIDTLMQHERNTEFHSTDAKAKPVKLGETWEKTIRSGGGVTQIKHDLIFLMQNDAWYHTRVNAI
AQNRDKDSTFKEVLITALMTPLTNKSLMDTSRSPAPKTLFRGLDLSEEFKNKLINQAETIIANTTEHLFTDLSTEAFKQI
KLNDFSQVSARTCASTSTNIEVPRTIFGSNTIFEILDPDGLLHPKQVGTHVSGSESEYSIYLPEDVALVPIKVSFDGKTG
KGKDRHIFTLVAVKSPDFTPRHESGYAVGPLLKMQTPKLEEIQRLVEQAREEPDLERVFNLQSRVARQAKFSTESGYKTF
LNEKVAPVLEQSLNGLLDNNVTILGKVLSAFPSDGQWSAFNSVEARQMKIQMDAIKQMVEKKAVLEGQILPALAQCQNAL
EKQNIAGALQALRNIPSEKEMQTMLSISGGLRGQIQRAKQDLTETLEPLQRAITAKLVSDQEKVKVRYEKLIAGIPQQIA
DLEKAELADLAKVKKVVSRFNHLQEELKLLRNEKIRMHTGSEKVDFSDIAQLEAQLQKIHTKLYDAYLVELTKEISALVK
EKPKNLADVKRMVSNFYAMSADIEQLRQEKIKEHGESKDPIDMSDIDKLKEELQKINQFLVKAMGTNIRVSLNQMEVKTF
DAQEKEAQQNLKQLDALINKLESSDAVQKQKEELEKLNQLLVEKRKAYPAMVQLQFRSEALIIHLRELCEAHQAQMAKTR
NVRAQEITNGRWKVQWLTDWVGLTTDERVTLANKEKELAKFKEDLNNDEYDLQELISNLAEKNPSELEEAIGISKESAQK
LHKLLTHLNHSTTFMSKIEQRLQSIDELLNEFGKQAPRTEMIKTVEEKQGTLLRL
>Q5ZTK6 6.-.-.-~~~sidJ~~~Calmodulin-dependent glutamylase SidJ~~~COG1413
MFGFIKKVLDFFGVDQSEDNPSETAVETTDVSTKIKTTDTTQEESSVKTKTVVPTQPGGSVKPETIAPDQQKKHQIKTET
TTSTTKQKGPKVTLMDGHVKQYYFARRGETSTHDTSLPPPVKVLSGRSIPLKEIPFEATRNELVQIYLTSIDKLIKSNKL
NSIPSQQIASHYLFLRSLANSETDGIKKNQILSLAKPLGTYLASKEPHVWKMINELIEKSEYPIIHYLKNNRAHSNFMLA
LIHEYHKEPLTKNQSAFVQKFRDSSVFLFPNPIYTAWLAHSYDEDSSFNPMFRERLSTNFYHSTLTDNLLLRTEPKEVTL
SSEHHYKKEKGPIDSSFRYQMSSDRLLRIQGRTLLFSTPQNDVVAVKVQKKGEPKSTLEEEFEMADYLLKHQRRLDVHSK
LPQPLGQYSVKKSEILEISRGSLDFERFKTLIDDSKDLEVYVYKAPQSYFTYLHDKNQDLEDLTASVKTNVHDLFVLLRE
GIVFPQLADIFHTHFGEDEREDKGRYQALVQLLNVLQFQLGRIDKWQKAVEYVNLRSSGLADLGDSLPITSLFTSSDFTK
HYFSELLTGGYHPTFFDKSSGTANSLFTGKRRLFGNYLYLNTIAEYLLVIQLTLGSYGDKVTRDMMDKPKKEAVWRELAN
VMFTSCAEAIHIMTGIPQSRALTLLKQRANIEKHFRQTQFWMTPDYSKLDEDTLQMEQYSIYSGEPEYEFTDKLVSGVGL
SVDGVHQDLGGYNRESPLRELEKLLYATVTLIEGTMQLDKEFFKQLEQVEKILSGEIKTDANSCFEAVAQLLDLARPGCH
FQKRLVLSYYEEAKLKYPSAPTDAYDSRFQVVARTNAAITIQRFWREARKNLSEKSDIDSEKPESERTTDKRL
>Q56061 ~~~sifA~~~Secreted effector protein SifA~~~
MPITIGNGFLKSEILTNSPRNTKEAWWKVLWEKIKDFFFSTGKAKADRCLHEMLFAERAPTRERLTEIFFELKELACASQ
RDRFQVHNPHENDATIILRIMDQNEENELLRITQNTDTFSCEVMGNLYFLMKDRPDILKSHPQMTAMIKRRYSEIVDYPL
PSTLCLNPAGAPILSVPLDNIEGYLYTELRKGHLDGWKAQEKATYLAAKIQSGIEKTTRILHHANISESTQQNAFLETMA
MCGLKQLEIPPPHTHIPIEKMVKEVLLADKTFQAFLVTDPSTSQSMLAEIVEAISDQVFHAIFRIDPQAIQKMAEEQLTT
LHVRSEQQSGCLCCFL
>Q31ME3 ~~~sigA2~~~RNA polymerase sigma factor SigA2~~~COG0568
MSTSSAQYSPDLVRAYLQEIGRVRLLTAEEELCFGRQVQRLMMLLDAQTELRDRLGHEPSKEEWAAAVDLNLEDLDRQIE
QGQRAKRKMIEANLRLVVSIAKKYQKRHMEFLDLIQEGTLGLERGVEKFDPSKGYKFSTYAYWWIRQAITRAIAQQSRTI
RLPIHITEKLNKLKKTQRELSQQLGRSATASELAEVLELPLEQVREYIQMNRQPVSLDVKVGDSQDTELQELLEDEQSSP
SDYVEQESLRRDLRNLMAELTPQQQAVIALRYGLDEGDSLSLAKVGERLNISRERVRKLERQAMDHLRRRSRLLAEYAAS
>P06224 ~~~sigA~~~RNA polymerase sigma factor SigA~~~COG0568
MADKQTHETELTFDQVKEQLTESGKKRGVLTYEEIAERMSSFEIESDQMDEYYEFLGEQGVELISENEETEDPNIQQLAK
AEEEFDLNDLSVPPGVKINDPVRMYLKEIGRVNLLSAKEEIAYAQKIEEGDEESKRRLAEANLRLVVSIAKRYVGRGMLF
LDLIQEGNMGLMKAVEKFDYRKGYKFSTYATWWIRQAITRAIADQARTIRIPVHMVETINKLIRVQRQLLQDLGREPTPE
EIAEDMDLTPEKVREILKIAQEPVSLETPIGEEDDSHLGDFIEDQEATSPSDHAAYELLKEQLEDVLDTLTDREENVLRL
RFGLDDGRTRTLEEVGKVFGVTRERIRQIEAKALRKLRHPSRSKRLKDFLE
>P18333 ~~~sigA~~~RNA polymerase sigma factor SigA~~~
MRMDTLDSQAAEAAQEEEIQRKLEELVTLAKDQGFITYEEINEILPPSFDTPEQIDQVLIFLAGMDVQVLNQADVERQKE
RKKEAKELEGLAKRSEGTPDDPVRMYLKEMGTVPLLTREEEVEISKRIEKAQVQIERIILRFRYSTKEAVSIAQYLINGK
ERFDKIVSEKEVEDKTHFLNLLPKLISLLKEEDAYLEERLLALKDPALSKPDQARLNDELEKCRIRTQAYLRCFHCRHNV
TEDFGEVVFKAYDSFLQLEQQINDLKARAERNKFAAAKLAAARRKLHKREVAAGRTLEEFKKDVRMLQRWMDKSQEAKKE
MVESNLRLVISIAKKYTNRGLSFLDLIQEGNMGLMKAVEKFEYRRGYKFSTYATWWIRQAVTRAIADQARTIRIPVHMIE
TINKVLRGAKKLMMETGKEPTPEELGEELGFTPDRVREIYKIAQHPISLQAEVGDGGESSFGDFLEDTAVESPAEATGYS
MLKDKMKEVLKTLTDRERFVLIHRFGLLDGRPKTLEEVGSAFNVTRERIRQIEAKALRKMRHPIRSKQLRAFLDLLEEEK
IGSGKIKSYKN
>P9WGI1 ~~~sigA~~~RNA polymerase sigma factor SigA~~~COG0568
MAATKASTATDEPVKRTATKSPAASASGAKTGAKRTAAKSASGSPPAKRATKPAARSVKPASAPQDTTTSTIPKRKTRAA
AKSAAAKAPSARGHATKPRAPKDAQHEAATDPEDALDSVEELDAEPDLDVEPGEDLDLDAADLNLDDLEDDVAPDADDDL
DSGDDEDHEDLEAEAAVAPGQTADDDEEIAEPTEKDKASGDFVWDEDESEALRQARKDAELTASADSVRAYLKQIGKVAL
LNAEEEVELAKRIEAGLYATQLMTELSERGEKLPAAQRRDMMWICRDGDRAKNHLLEANLRLVVSLAKRYTGRGMAFLDL
IQEGNLGLIRAVEKFDYTKGYKFSTYATWWIRQAITRAMADQARTIRIPVHMVEVINKLGRIQRELLQDLGREPTPEELA
KEMDITPEKVLEIQQYAREPISLDQTIGDEGDSQLGDFIEDSEAVVAVDAVSFTLLQDQLQSVLDTLSEREAGVVRLRFG
LTDGQPRTLDEIGQVYGVTRERIRQIESKTMSKLRHPSRSQVLRDYLD
>P0A0J0 ~~~sigA~~~RNA polymerase sigma factor SigA~~~COG0568
MSDNTVKIKKQTIDPTLTLEDVKKQLIEKGKKEGHLSHEEIAEKLQNFDIDSDQMDDFFDQLNDNDISLVNEKDSSDTDE
KLNPSDLSAPPGVKINDPVRMYLKEIGRVNLLSAQEEIELAKRIEQGDEVAKSRLAEANLRLVVSIAKRYVGRGMLFLDL
IQEGNMGLIKAVEKFDFNKGFKFSTYATWWIRQAITRAIADQARTIRIPVHMVETINKLIRVQRQLLQDLGRDPAPEEIG
EEMDLPAEKVREILKIAQEPVSLETPIGEEDDSHLGDFIEDQEAQSPSDHAAYELLKEQLEDVLDTLTDREENVLRLRFG
LDDGRTRTLEEVGKVFGVTRERIRQIEAKALRKLRHPSRSKRLKDFMD
>Q99TT5 ~~~sigA~~~RNA polymerase sigma factor SigA~~~
MSDNTVKIKKQTIDPTLTLEDVKKQLIEKGKKEGHLSHEEIAEKLQNFDIDSDQMDDFFDQLNDNDISLVNEKDSSDTDE
KLNPSDLSAPPGVKINDPVRMYLKEIGRVNLLSAQEEIELAKRIEQGDEVAKSRLAEANLRLVVSIAKRYVGRGMLFLDL
IQEGNMGLIKAVEKFDFNKGFKFSTYATWWIRQAITRAIADQARTIRIPVHMVETINKLIRVQRQLLQDLGRDPAPEEIG
EEMDLPAEKVREVLKIAQEPVSLETPIGEEDDSHLGDFIEDQEAQSPSDHAAYELLKEQLEDVLDTLTDREENVLRLRFG
LDDGRTRTLEEVGKVFGVTRERIRQIEAKALRKLRHPSRSKRLKDFMD
>P18183 ~~~hrdB~~~RNA polymerase principal sigma factor HrdB~~~COG0568
MSASTSRTLPPEIAESVSVMALIERGKAEGQIAGDDVRRAFEADQIPATQWKNVLRSLNQILEEEGVTLMVSAAEPKRTR
KSVAAKSPAKRTATKAVAAKPVTSRKATAPAAPAAPATEPAAVEEEAPAKKAAAKKTTAKKATAKKTTAKKAAAKKTTAK
KEDGELLEDEATEEPKAATEEPEGTENAGFVLSDEDEDDAPAQQVAAAGATADPVKDYLKQIGKVPLLNAEQEVELAKRI
EAGLFAEDKLANSDKLAPKLKRELEIIAEDGRRAKNHLLEANLRLVVSLAKRYTGRGMLFLDLIQEGNLGLIRAVEKFDY
TKGYKFSTYATWWIRQAITRAMADQARTIRIPVHMVEVINKLARVQRQMLQDLGREPTPEELAKELDMTPEKVIEVQKYG
REPISLHTPLGEDGDSEFGDLIEDSEAVVPADAVSFTLLQEQLHSVLDTLSEREAGVVSMRFGLTDGQPKTLDEIGKVYG
VTRERIRQIESKTMSKLRHPSRSQVLRDYLD
>B1VXR4 ~~~hrdB~~~RNA polymerase principal sigma factor HrdB~~~COG0568
MSASTSRTLPPEIAESESVMALIERGKADGQIAGDDVRRAFEADQIPPTQWKNVLRSLNQILEEEGVTLMVSAAESPKRA
RKSVAAKSPVKRTATKTVAAKTTVTRTVAATAAPAVESADAADDAVAAAPAKKTAAKKATAKKAAAKKTTAKKTAAKKSG
KQDDEILDGDEAAEEVKAGKGEEEEGEGENKGFVLSDDDEDDAPAQQVAVAGATADPVKDYLKQIGKVPLLNAEQEVELA
KRIEAGLFAEDKLANADKLAPKLKRELEIIAEDGRRAKNHLLEANLRLVVSLAKRYTGRGMLFLDLIQEGNLGLIRAVEK
FDYTKGYKFSTYATWWIRQAITRAMADQARTIRIPVHMVEVINKLARVQRQMLQDLGREPTPEELAKELDMTPEKVIEVQ
KYGREPISLHTPLGEDGDSEFGDLIEDSEAVVPADAVSFTLLQEQLHSVLDTLSEREAGVVSMRFGLTDGQPKTLDEIGK
VYGVTRERIRQIESKTMSKLRHPSRSQVLRDYLD
>P77951 ~~~sigA~~~RNA polymerase sigma factor SigA~~~
MSASTSRTLPPEIAESESVMALIERGKADGQIAGDDVRRAFEADQIPPTQWKNVLRSLNQILEEEGVTLMVSAAESPKRA
RKSVAAKSPVKRTATKTVAAKTTVTRTVAATAAPAVESADAADDAVAAAPAKKTAAKKATAKKAAAKKTTAKKTAAKKSG
KQDDEILDGDEAAEEVKAGKGEEEEGEGENKGFVLSDDDEDDAPAQQVAVAGATADPVKDYLKQIGKVPLLNAEQEVELA
KRIEAGLFAEDKLANADKLAPKLKRELEIIAEDGRRAKNHLLEANLRLVVSLAKRYTGRGMLFLDLIQEGNLGLIRAVEK
FDYTKGYKFSTYATWWIRQAITRAMADQARTIRIPVHMVEVINKLARVQRQMLQDLGREPTPEELAKELDMTPEKVIEVQ
KYGREPISLHTPLGEDGDSEFGDLIEDSEAVVPADAVSFTLLQEQLHSVLDTLSEREAGVVSMRFGLTDGQPKTLDEIGK
VYGVTRERIRQIESKTMSKLRHPSRSQVLRDYLD
>P74565 ~~~sigA~~~RNA polymerase sigma factor SigA~~~COG0568
MTQTKEPLTKAESAELEQEIELSQYINTDDIDDDDIDVEDLEQEVAATEGKEKKVRKIRKDAVKKKPYTEDSIRIYLQEI
GRIRLLRAEEEIELARQIADLLELELIRDNLTLQLERQPSELEWGKQVWKLETAKQRLVGDKKKEPKKKDIDSYLANPDN
ELSLENEWSQQPNKNFAAFRRRLFLDRRAKDKMVQSNLRLVVSIAKKYMNRGLSFQDLIQEGSLGLIRAAEKFDHEKGYK
FSTYATWWIRQAITRAIADQSRTIRLPVHLYETISRIKKTTKLLSQEMRRKPTEEEIAEKMEMTIEKLRFIAKSAQLPIS
LETPIGKEEDSRLGDFIEADGETPEDEVSKNLLREDLENVLDTLSPRERDVLRLRYGLDDGRMKTLEEIGQIFNVTRERI
RQIEAKALRKLRHPNRNSILKEYIR
>Q9EZJ8 ~~~sigA~~~RNA polymerase sigma factor SigA~~~
MKKSKSKKKAAKAQEVEVKEPVKEPEPLPELEAAEDLQDLPEPDPELLASEPELEDLADPLDLEGPLEADLLPEEGLLEE
EEEELSLPKVSTSDPVRQYLHEIGQVPLLTLEEEIDLARKVEEGMEAIKKLSEATGLDQELIREVVRAKILGTARIQKIP
GLKEKPDPKTVEEVDGKLKSLPKELKRYLHIAREGEAARQHLIEANLRLVVSIAKKYTGRGLSFLDLIQEGNQGLIRAVE
KFEYKRRFKFSTYATWWIRQAINRAIADQARTIRIPVHMVETINKLSRTARQLQQELGREPSYEEIAEAMGPGWDAKRVE
ETLKIAQEPVSLETPIGDEKDSFYGDFIPDENLPSPVEAAAQSLLSEELEKALSKLSEREAMVLKLRKGLIDGREHTLEE
VGAYFGVTRERIRQIENKALRKLKYHESRTRKLRDFLE
>P77994 ~~~sigA~~~RNA polymerase sigma factor SigA~~~COG0568
MNEEQQVLQEQHQEQTQEQTQEQKETLPPQIERRIKKLISLGKKKGYITYEDIDKAFPPDFEGFDTNLIERIHEELEKHG
INIVENEPEEEEISASSDEQELEELLEKESPEIHDSSNVRDSIKMYLKEIGKIPLLTPAQERELARRAQMGDKKAKEKLI
TSNLRLVVSIAKRYMGRGLSFQDLIQEGNIGLLKAVEKFDWRKGYKFSTYATWWIRQAITRAIADQARTIRIPVHMVETI
NKLNRLRREYYQKHGEEPSIEELAKMMGKPPEKIKEILEAAKETISLESPIGEDEDSSIEDFVADDSIASPKKEAMRMLM
REELEKVLKTLSPREAMVLRMRYGLLDGKPKTLEEVGQYFNVTRERIRQIEVKALRKLRHPSRSKYLKSLLSLMDENEG
>Q72L95 ~~~sigA~~~RNA polymerase sigma factor SigA~~~COG0568
MKKSKRKNAQAQEAQETEVLVQEEAEELPEFPEGEPDPDLEDPDLALEDDLLDLPEEGEGLDLEEEEEDLPIPKISTSDP
VRQYLHEIGQVPLLTLEEEVELARKVEEGMEAIKKLSEITGLDPDLIREVVRAKILGSARVRHIPGLKETLDPKTVEEID
QKLKSLPKEHKRYLHIAREGEAARQHLIEANLRLVVSIAKKYTGRGLSFLDLIQEGNQGLIRAVEKFEYKRRFKFSTYAT
WWIRQAINRAIADQARTIRIPVHMVETINKLSRTARQLQQELGREPTYEEIAEAMGPGWDAKRVEETLKIAQEPVSLETP
IGDEKDSFYGDFIPDEHLPSPVDAATQSLLSEELEKALSKLSEREAMVLKLRKGLIDGREHTLEEVGAFFGVTRERIRQI
ENKALRKLKYHESRTRKLRDFLD
>P9WGI5 ~~~sigB~~~RNA polymerase sigma factor SigB~~~COG0568
MADAPTRATTSRVDSDLDAQSPAADLVRVYLNGIGKTALLNAAGEVELAKRIEAGLYAEHLLETRKRLGENRKRDLAAVV
RDGEAARRHLLEANLRLVVSLAKRYTGRGMPLLDLIQEGNLGLIRAMEKFDYTKGFKFSTYATWWIRQAITRGMADQSRT
IRLPVHLVEQVNKLARIKREMHQHLGREATDEELAAESGIPIDKINDLLEHSRDPVSLDMPVGSEEEAPLGDFIEDAEAM
SAENAVIAELLHTDIRSVLATLDEREHQVIRLRFGLDDGQPRTLDQIGKLFGLSRERVRQIERDVMSKLRHGERADRLRS
YAS
>A0R2D4 ~~~sigE~~~ECF RNA polymerase sigma factor SigE~~~COG1595
MEHDDRRASQVRAGNNRVVIRVEQIENNAVHEQEGSTTIASQPVTATHAAPVSMAHLEQFTDSDWVEPSDEPTGTAVFDA
TGDQAAMPSWDELVRQHADRVYRLAYRLSGNQHDAEDLTQETFIRVFRSVQNYQPGTFEGWLHRITTNLFLDMVRRRGRI
RMEALPEDYDRVPAEDPNPEQIYHDSRLGADLQAALDSLPPEFRAAVVLCDIEGLSYEEIGATLGVKLGTVRSRIHRGRQ
QLRDYLAKHSSETAQSA
>H8F0N6 ~~~sigE~~~ECF RNA polymerase sigma factor SigE~~~
MELLGGPRVGNTESQLCVADGDDLPTYCSANSEDLNITTITTLSPTSMSHPQQVRDDQWVEPSDQLQGTAVFDATGDKAT
MPSWDELVRQHADRVYRLAYRLSGNQHDAEDLTQETFIRVFRSVQNYQPGTFEGWLHRITTNLFLDMVRRRARIRMEALP
EDYDRVPADEPNPEQIYHDARLGPDLQAALASLPPEFRAAVVLCDIEGLSYEEIGATLGVKLGTVRSRIHRGRQALRDYL
AAHPEHGECAVHVNPVR
>P9WGG7 ~~~sigE~~~ECF RNA polymerase sigma factor SigE~~~COG1595
MELLGGPRVGNTESQLCVADGDDLPTYCSANSEDLNITTITTLSPTSMSHPQQVRDDQWVEPSDQLQGTAVFDATGDKAT
MPSWDELVRQHADRVYRLAYRLSGNQHDAEDLTQETFIRVFRSVQNYQPGTFEGWLHRITTNLFLDMVRRRARIRMEALP
EDYDRVPADEPNPEQIYHDARLGPDLQAALASLPPEFRAAVVLCDIEGLSYEEIGATLGVKLGTVRSRIHRGRQALRDYL
AAHPEHGECAVHVNPVR
>O30917 ~~~sigE~~~Chaperone protein SigE~~~
MESLLNRLYDALGLDAPEDEPLLIIDDGIQVYFNESDHTLEMCCPFMPLPDDILTLQHFLRLNYTSAVTIGADADNTALV
ALYRLPQTSTEEEALTGFELFISNVKQLKEHYA
>G8QM61 ~~~sigF~~~ECF RNA polymerase sigma factor SigF~~~COG1595
MQRINREESFRAKEERLKDLFVRGLSGNNAAYQTFLGELSSYLRAFLRKRLIRLPDEVEDLVQEALLAVHNQRHTYDPSQ
PLSAWVQAIARYKLVDLFRRRAIYEQRNDTLDDGMDLFSSADAEAAEARRDLNKLLADLPDHFRLPIMHTKLEGLSVREA
ADVSGMSESAIKVGVHRGLKALAAKIRGAL
>A0A0H3CCX2 ~~~sigF~~~ECF RNA polymerase sigma factor SigF~~~
MTDTETRLKALMLRGLDGDTAAYREGLALLGVRLRAYFMRRMSGAPGDVEDLVQETLLAVHLKRSTWDSAQSFTAWAHAV
ARYKLIDHWRRRKIRQTLPLEDHVDFLADDAPDPGVALELDRALASLPQRQRMLVSDVKLTGLSLAEAGARAGISEGAAK
VALHRALKALAERMRRADG
>P9WGI3 ~~~sigF~~~RNA polymerase sigma factor SigF~~~COG1191
MTARAAGGSASRANEYADVPEMFRELVGLPAGSPEFQRHRDKIVQRCLPLADHIARRFEGRGEPRDDLIQVARVGLVNAA
VRFDVKTGSDFVSFAVPTIMGEVRRHFRDNSWSVKVPRRLKELHLRLGTATADLSQRLGRAPSASELAAELGMDRAEVIE
GLLAGSSYHTLSIDSGGGSDDDARAITDTLGDVDAGLDQIENREVLRPLLEALPERERTVLVLRFFDSMTQTQIAERVGI
SQMHVSRLLAKSLARLRDQLE
>P9WGG5 ~~~sigG~~~ECF RNA polymerase sigma factor SigG~~~COG1595
MRTSPMPAKFRSVRVVVITGSVTAAPVRVSETLRRLIDVSVLAENSGREPADERRGDFSAHTEPYRRELLAHCYRMTGSL
HDAEDLVQETLLRAWKAYEGFAGKSSLRTWLHRIATNTCLTALEGRRRRPLPTGLGRPSADPSGELVERREVSWLEPLPD
VTDDPADPSTIVGNRESVRLAFVAALQHLSPRQRAVLLLRDVLQWKSAEVADAIGTSTVAVNSLLQRARSQLQTVRPSAA
DRLSAPDSPEAQDLLARYIAAFEAYDIDRLVELFTAEAIWEMPPYTGWYQGAQAIVTLIHQQCPAYSPGDMRLISLIANG
QPAAAMYMRAGDVHLPFQLHVLDMAADRVSHVVAFLDTTLFPKFGLPDSL
>P66808 ~~~sigH~~~ECF RNA polymerase sigma factor SigH~~~
MADIDGVTGSAGLQPGPSEETDEELTARFERDAIPLLDQLYGGALRMTRNPADAEDLLQETMVKAYAGFRSFRHGTNLKA
WLYRILTNTYINSYRKKQRQPAEYPTEQITDWQLASNAEHSSTGLRSAEVEALEALPDTEIKEALQALPEEFRMAVYYAD
VEGFPYKEIAEIMDTPIGTVMSRLHRGRRQLRGLLADVARDRGFARGEQAHEGVSS
>A0QTP2 ~~~sigH~~~ECF RNA polymerase sigma factor SigH~~~COG1595
MTDVDRVEPETPPEREETDAELTARFERDAIPLLDQLYGGALRMTRNPADAEDLLQETMVKAYAGFRSFREGTNLKAWLY
RILTNTYINSYRKKQRQPSEYPTDEITDWQLASNAEHSSTGLRSAEVEALEALPDTEIKAALQALPEEFRMAVYYADVEG
FPYKEIAEIMETPIGTVMSRLHRGRRQLRDLLAGVARDRGFIRGPQLGEPEEVTS
>P9WGH8 ~~~sigH~~~ECF RNA polymerase sigma factor SigH~~~
MADIDGVTGSAGLQPGPSEETDEELTARFERDAIPLLDQLYGGALRMTRNPADAEDLLQETMVKAYAGFRSFRHGTNLKA
WLYRILTNTYINSYRKKQRQPAEYPTEQITDWQLASNAEHSSTGLRSAEVEALEALPDTEIKEALQALPEEFRMAVYYAD
VEGFPYKEIAEIMDTPIGTVMSRLHRGRRQLRGLLADVARDRGFARGEQAHEGVSS
>P9WGH9 ~~~sigH~~~ECF RNA polymerase sigma factor SigH~~~COG1595
MADIDGVTGSAGLQPGPSEETDEELTARFERDAIPLLDQLYGGALRMTRNPADAEDLLQETMVKAYAGFRSFRHGTNLKA
WLYRILTNTYINSYRKKQRQPAEYPTEQITDWQLASNAEHSSTGLRSAEVEALEALPDTEIKEALQALPEEFRMAVYYAD
VEGFPYKEIAEIMDTPIGTVMSRLHRGRRQLRGLLADVARDRGFARGEQAHEGVSS
>A3DBH0 ~~~sigI1~~~RNA polymerase sigma factor SigI1~~~COG1191
MEVRKINTHNREKKLDDIVYDDFISTLRQIKEGNHQLREEFISEYKPFILKVTSNATGKYIDTRNSDEFSIALSAFNEAI
DKFDIEKGYNFFLFSEQVIRRRLIDYSRSNKDDKEYPFSFFDDEYFYNNEKLLSKSYIGFEDIEAREDIEELKKKLQEFG
ITFLDLVLNVPKHRDSRQLCIRLAKMLAEDEQMYNALMKNKNIPRNELKKKAKVHGRTIGNNRKYIIALCLIFRSNLNLS
KRYLEYYTMDGESDLI
>A3DC28 ~~~sigI2~~~RNA polymerase sigma factor SigI2~~~COG1191
MIDLFSPKGKKDTVSTTNKDKSFEDGIVNIINKIKAGDKLLREEFINSYTPYIIRTVSNLTGKYVDVENSDEFSVGLAAF
NEAIDSFDEGKNMFFFKFSTLVIKRRLTDYARHNKKHCHVYPFTYFEDKNNSYFEQIHLKSEIDLQNTYEISREIELYEQ
KLRDFGISLEHLAKCAPKHKDSKSLCIKIAKVIADNKELFSKLERTRNIPKTELLKLLKINKKTIERNRTFIIAAALIFG
NDFNLLKDFLDISEPGGDNIERSTK
>A3DC74 ~~~sigI3~~~RNA polymerase sigma factor SigI3~~~COG1191
MHGLFVNKKKNDTGSTALVLKKIQSGDTKLKEEFIKDNVPYIIRTISNILGIVVDDRNSEEFSIGLAAFNEAIDRYDADK
NGNFYTYSFVVIKSRLYDFIRRNRKHNNVLPFSYIEESTRVDERLLMSDASGQFEKIEVRQELVSFEKSLKEFGISLEDL
VLSSPKHKDSRLLLIKIARIIADDDNMFRKLVEKKYIPMKEVLSRIKVNHKTIQRNRKFIIAVSLILRSNLYDLKEYVQG
FEREGKYHG
>A3DCG2 ~~~sigI4~~~RNA polymerase sigma factor SigI4~~~COG1191
MLNVQLKIFCHAALIFLEVVYLFEPNLIYGVKKREKKSRSDSINHIIIKIKNGDIELKEKFIKKYKPYLLKIISSTLGRY
VDPEVSEEYSVGLMAFNEAIDGFNPEINGNFTNYCNMVVNHRIIDYIRKNKKYSNVIPFSYFEERNDFEEKYLVSDSHYL
YENIEVKEEILQFEQQLKQFGITLEDLVMNSPKHKDSRELCISIARILSENDKLFEKMIRKKCIPLSELMGLVNVHRKTV
ERNRKFIIAVSLILRSGLDEIKQFFRASEERREK
>A3DEX5 ~~~sigI5~~~RNA polymerase sigma factor SigI5~~~COG1191
MLFVSAIIDYAGETATHIVEKIKNGDKHLKEKFIKDYIPFILNIVSSFYSSKTGDLKSSDEYSIGLMAFNEAIEKFDVNK
SKSFLKFAEMVIKKRMIDYFRKTSSIGRKEIPFSYFNSNNETEFRKKINNLDIEEEFNSYEFICELRDFSKKLESFGLSI
NNLPDYMPKHKDSREMCINIAKKIVENKSIFDKLKTKKYIQMKELSKIIDVHPKTVERNRAFIICLCILFDNDYGNFKSY
LNKIF
>A3DH98 ~~~sigI6~~~RNA polymerase sigma factor SigI6~~~COG1191
MDWHFQGTNDDREHTKRIIIEYLNRIKAGDDSAREEFILRFRPFILKLVYKATDRHVEPENSEEYSVALLAFNEAINAYD
EEKHSNFLVFSEQVINRRLIDYKRKNHKNKMVYPFSYFENEDIKLERTLSDADGNNAIERLEFTDEIRLFKSELASFDIT
FKDLLSCTPKHRDSRELLINIAKKIASNDGLYEKLKKTKKLPTLELLKLAKVSRRTIERNKKYIIAVSLILRSNLEIFKE
YAAGIQEKEVDLR
>A3DIE4 ~~~sigI7~~~RNA polymerase sigma factor SigI7~~~COG1191
MYSVTINQRVEAIKNNEEEINLFVEEYKPFIAACTQKVVGRYVAYGQDDELSIALMAFVETIRSYDVSKGNFLSFSQNVI
KRRIIDYYRKEKKHSVVVNINGHLEDEEEETDLGIAMSIDKYSEEEISEYRRLELEQLKKELKEWDISFFDLVNISPKHK
RTKKIYSKIIKFVLSRPDIMEKIKQKKYLPVAEIEQSLKIPRKTIERARKYIITVVIIFTGDYEFIRDYVNWEVE
>P0DO98 ~~~sigI~~~RNA polymerase sigma factor SigI~~~COG1191
MLSLVMKILRKPKIEDIVCNIQNNEEDKEAFIVQYQPFIRKSISSVCRRYITEQDDEYSIGLFAFNEAIEQYSYKKGKSF
LAFADLLIKRDVIDYIRKESKHNLVFLKEDEQEEMLEMQVSLTEYMKEIENGNRKEEILHFQSVLADFKITFSELAKESP
KHRDTREHLIEIVKVIIKEEEMMEELFRKKKLPLKHIEPRVRVSRKTLERHRKYIIAMCIIFANNYTYILDYIRGGKHDE
>O31654 ~~~sigI~~~RNA polymerase sigma factor SigI~~~COG1191
MKPVLSLLFKLGKKKQTLEKAVESIQKGNKDLQNELIQQYKPFIAKTVSSVCKRYIDEKDDEFSIGLIAFNEAIEKYSPE
KGNSLLAFAELIIKRKVIDYIRKEARSAQNINIDLQEGDDQESSQSLIEAELSIDEYRRQIEQEQRREEILYFQKQLKDY
GLSFKELLENSPKHTDARQNAIKVAMTLVEHEELAAILYTKKQLPVKQLEQLVSVSRKTIERNRKYIIAMCIIITGDYIY
LKDYLKGVLHS
>P9WGH3 ~~~sigI~~~Probable ECF RNA polymerase sigma factor SigI~~~COG1595
MSQHDPVSAAWRAHRAYLVDLAFRMVGDIGVAEDMVQEAFSRLLRAPVGDIDDERGWLIVVTSRLCLDHIKSASTRRERP
QDIAAWHDGDASVSSVDPADRVTLDDEVRLALLIMLERLGPAERVVFVLHEIFGLPYQQIATTIGSQASTCRQLAHRARR
KINESRIAASVEPAQHRVVTRAFIEACSNGDLDTLLEVLDPGVAGEIDARKGVVVVGADRVGPTILRHWSHPATVLVAQP
VCGQPAVLAFVNRALAGVLALSIEAGKITKIHVLVQPSTLDPLRAELGGG
>L0TCG5 ~~~sigJ~~~ECF RNA polymerase sigma factor SigJ~~~COG1595
MEVSEFEALRQHLMSVAYRLTGTVADAEDIVQEAWLRWDSPDTVIADPRAWLTTVVSRLGLDKLRSAAHRRETYTGTWLP
EPVVTGLDATDPLAAVVAAEDARFAAMVVLERLRPDQRVAFVLHDGFAVPFAEVAEVLGTSEAAARQLASRARKAVTAQP
ALISGDPDPAHNEVVGRLMAAMAAGDLDTVVSLLHPDVTFTGDSNGKAPTAVRAVRGSDKVVRFILGLVQRYGPGLFGAN
QLALVNGELGAYTAGLPGVDGYRAMAPRITAITVRDGKVCALWDIANPDKFTGSPLKERRAQPTGRGRHHRN
>A1KFR7 ~~~sigK~~~ECF RNA polymerase sigma factor SigK~~~
MTGPPRLSSDLDALLRRVAGHDQAAFAEFYDHTKSRVYGLVMRVLRDTGYSEETTQEIYLEVWRNASEFDSAKGSALAWL
LTMAHRRAVDRVRCEQAGNQREVRYGAANVDPASDVVADLAIAGDERRRVTECLKALTDTQRQCIELAYYGGLTYVEVSR
RLAANLSTIKSRMRDALRSLRNCLDVS
>P9WGH7 ~~~sigK~~~ECF RNA polymerase sigma factor SigK~~~COG1595
MTGPPRLSSDLDALLRRVAGHDQAAFAEFYDHTKSRVYGLVMRVLRDTGYSEETTQEIYLEVWRNASEFDSAKGSALAWL
LTMAHRRAVDRVRCEQAGNQREVRYGAANVDPASDVVADLAIAGDERRRVTECLKALTDTQRQCIELAYYGGLTYVEVSR
RLAANLSTIKSRMRDALRSLRNCLDVS
>P9WGH5 ~~~sigL~~~ECF RNA polymerase sigma factor SigL~~~COG1595
MARVSGAAAAEAALMRALYDEHAAVLWRYALRLTGDAAQAEDVVQETLLRAWQHPEVIGDTARPARAWLFTVARNMIIDE
RRSARFRNVVGSTDQSGTPEQSTPDEVNAALDRLLIADALAQLSAEHRAVIQRSYYRGWSTAQIATDLGIAEGTVKSRLH
YAVRALRLTLQELGVTR
>O07582 ~~~sigM~~~ECF RNA polymerase sigma factor SigM~~~COG1595
MTIDEIYQMYMNDVYRFLLSMTKDKHLAEDLLQETFMRAYIHIHSYDHSKVKPWLFQVARNAFIDYVRKHKKEVTISDDL
IGSLFQNAVQSPAHQVEIKEVLTGYMSELPDNYREALTLYYLKELNYKEASHIMNISEANFKSVLFRARQRLKALYNRGV
NDE
>O53590 ~~~sigM~~~ECF RNA polymerase sigma factor SigM~~~COG1595
MPPPIGYCPAVGFGGRHERSDAELLAAHVAGDRYAFDQLFRRHHRQLHRLARLTSRTSEDADDALQDAMLSAHRGAGSFR
YDAAVSSWLHRIVVNACLDRLRRAKAHPTAPLEDVYPVADRTAQVETAIAVQRALMRLPVEQRAAVVAVDMQGYSIADTA
RMLGVAEGTVKSRCARARARLARLLGYLNTGVNIRR
>O34843 ~~~sigO~~~RNA polymerase sigma factor SigO~~~COG1191
MKHPIVKHFLSNPQHYRLFKNVMESPNEKDARSLDELFKQFYKEIRIVKYMNSMIRIFSIDFDKRVRKNQKRYPLTVDHP
EAGDRLSSETGSDAFEEFLDRQDDLSQHVQDYQLYQAIQKLTDKQKSVLTKVYLHGATMQEIADSLGESRQNISNIHKKG
LENIRKQLAAQKKGEK
>Q7AKG9 ~~~sigR~~~ECF RNA polymerase sigma factor SigR~~~COG1595
MGPVTGTDAGTEHGQAEQPEGRGTGAESTAERSARFERDALEFLDQMYSAALRMTRNPADAEDLVQETYAKAYASFHQFR
EGTNLKAWLYRILTNTFINSYRKKQREPQRSAAEEIEDWQLARAESHMSTGLRSAESQALDHLPDSDVKQALQAIPEEFR
IAVYLADVEGFAYKEIADIMGTPIGTVMSRLHRGRRQLRGMLEDYARDRGLVPAGAGESNEAKGSGS
>Q2FXF2 ~~~sigS~~~RNA polymerase sigma factor SigS~~~COG1595
MKFNDVYNKHQKIIHYLLKKYNISYNYDEYYQLLLIKMWQLSQIYKPSSKQSLSSFLFTRLNFYLIDLFRQQNQLKDVIL
CENNSPTLTEQPTYFNEHDLRLQDIFKLLNQRERLWLKLYLEGYKQFEIAEIMSLSLSTIKLIKMSVKRKCQHNFN
>O05404 ~~~sigV~~~RNA polymerase sigma factor SigV~~~COG1595
MKKKQTTKALLVTCITDHKQDFYRLAFSYVKNQDDALDIVQESIKKALSSVETVRNPETIKSWFYKILVRTAIDFLRKQK
KIRVMDDETIEFLSKGKEDHYKDTDLHEALDELPYRYKTIIILRFFEDLKLEEIAEITGENTNTVKTRLYRALKLMRIQL
TKEDLS
>Q45585 ~~~sigW~~~ECF RNA polymerase sigma factor SigW~~~COG1595
MEMMIKKRIKQVKKGDQDAFADIVDIYKDKIYQLCYRMLGNVHEAEDIAQEAFIRAYVNIDSFDINRKFSTWLYRIATNL
TIDRIRKKKPDYYLDAEVAGTEGLTMYSQIVADGVLPEDAVVSLELSNTIQQKILKLPDKYRTVIVLKYIDELSLIEIGE
ILNIPVGTVKTRIHRGREALRKQLRDL
>P35165 ~~~sigX~~~ECF RNA polymerase sigma factor SigX~~~COG1595
MEETFQLLYDTYHQDLYQFLFYMVKDKNQTEDLLQEVYIRVLNSYHTFEGRSSEKTWLLSIARHVAIDWFRKQQTIRQRI
LGTFDWDTQDVRDQQLLPDELAVQHENVREIYAALDQCTIDQRAVIILRFIQGYSIQETAKALRFSESKVKTTQHRGLKV
LRKHMELLREELMDDEVRMERRTDKGVVKSTSGS
>P94370 ~~~sigY~~~RNA polymerase sigma factor SigY~~~COG1595
MDTQEEQRLIQQAKEGNDEAFTALFHYHYSFLYKYLLKLSLHPDLSEELVQETFLKGYIHLRSFQGRSKFSTWLISIASR
LYLDHQKKRKREWKRNQTVTEETIRKIKWDVSAKGAEWSETLDLFSKLDPKLRTPVLLRHYYGYTYAEIGVMLQIKEGTV
KSRVHKGLQQIRKEWDDE
>Q9Z4N3 ~~~silE~~~Silver-binding protein SilE~~~
MKNIVLASLLGFGLISSAWATETVNIHERVNNAQAPAHQMQSAAAPVGIQGTAPRMAGMDQHEQAIIAHETMTNGSADAH
QKMVESHQRMMGSQTVSPTGPSKSLAAMNEHERAAVAHEFMNNGQSGPHQAMAEAHRRMLSAG
>Q9ZHC7 7.2.2.15~~~silP~~~Silver exporting P-type ATPase~~~
MLQICIRRVTVKNDNAVEHNNQDCFLSRTSSRDESHALHKVRESVCGMVILPDKAHSSIRYQDHQLYFCSASCESKFKAH
PDHYFTEDASEHHHHHDHHEVSPDKIKQSHRQAEKEISEGVWTCPMHPEIRRSGPGSCPVCGMALEPLVATASTGTSDEL
RDMTRRFWLGLLLAFPVLILEMGSHLFPALRNTVPPQYNTWLQLLLASPVVLWCGWPFFARAGMSLRNRSLNMFTLVAMG
TGVAWVYSVIATVFPSWFPASFRNMDGLVAIYFEAAAVITVLVLLGQVLELRAREQTSGAITALLNLAPKTARRLDQDGH
ETDINAEDVLPGDKLRIRPGESIPVDGIVVEGKTTVDESMVTGESMPVTKTEGEPVIGGTINQTGSLIIRAEKVGDETML
SRIVQMVADAQRSRAPIQRMADSVSGWFVPLVILIAVVAFMIWSVWGPEPRMAHGLIAAVSVLIIACPCALGLATPMSIM
VGVGKGAQAGVLIKNAEALERLEKVDTLVVDKTGTLTEGSPTVTGIISLNPGGETSLLRVTAAVDKGSQHPLGMAVVKAA
QEKGIAIPAVTHFNAPSGKGVSGDVEGQRVVIGNELAMQENSIVIDNQKAVADTLRMEGTTVIYVATDGHLAGLIAISDP
VKATTPDALKALRQAGIRIVMLTGDNQLTAEAVARKLGIDEVEAGILPDGKKAVITRLKASGHVVAMAGDGVNDAPALAA
ADVGIAMGTGTDVAIESAGVTLLKGDLMILNRARHLSEITMKNIRQNLFFAFIYNALGVPVAAGLLYPVYGILLSPVIAA
AAMALSSVSVIVNALRLKSVRLGK
>P23308 ~~~sinI~~~Protein SinI~~~
MKNAKQEHFELDQEWVELMVEAKEANISPEEIRKYLLLNKKSAHPGPAARSHTVNPF
>P06533 ~~~sinR~~~HTH-type transcriptional regulator SinR~~~COG1396
MIGQRIKQYRKEKGYSLSELAEKAGVAKSYLSSIERNLQTNPSIQFLEKVSAVLDVSVHTLLDEKHETEYDGQLDSEWEK
LVRDAMTSGVSKKQFREFLDYQKWRKSQKEE
>E1WAC6 ~~~sipA~~~Cell invasion protein SipA~~~
MVTSVRTQPPVIMPGMQTEIKTQATNLAANLSAVRESATATLSGEIKGPQLEDFPALIKQASLDALFKCGKDAEALKEVF
TNSNNVAGKKAIMEFAGLFRSALNATSDSPEAKTLLMKVGAEYTAQIIKDGLKEKSAFGPWLPETKKAEAKLENLEKQLL
DIIKNNTGGELSKLSTNLVMQEVMPYIASCIEHNFGCTLDPLTRSNLTHLVDKAAAKAVEALDMCHQKLTQEQGTSVGRE
ARHLEMQTLIPLLLRNVFAQIPADKLPDPKIPEPAAGPVPDGGKKAEPTGINININIDSSNHSVDNSKHINNSRSHVDNS
QRHIDNSNHDNSRKTIDNSRTFIDNSQRNGESHHSTNSSNVSHSHSRVDSTTHQTETAHSASTGAIDHGIAGKIDVTAHA
TAEAVTNASSESKDGKVVTSEKGTTGETTSFDEVDGVTSKSIIGKPVQATVHGVDDNKQQSQTAEIVNVKPLASQLAGVE
NVKTDTLQSDTTVITGNKAGTTDNDNSQTDKTGPFSGLKFKQNSFLSTVPSVTNMHSMHFDARETFLGVIRKALEPDTST
PFPVRRAFDGLRAEILPNDTIKSAALKAQCSDIDKHPELKAKMETLKEVITHHPQKEKLAEIALQFAREAGLTRLKGETD
YVLSNVLDGLIGDGSWRAGPAYESYLNKPGVDRVITTVDGLHMQR
>P0CL52 ~~~sipA~~~Cell invasion protein SipA~~~
MVTSVRTQPPVIMPGMQTEIKTQATNLAANLSAVRESATATLSGEIKGPQLEDFPALIKQASLDALFKCGKDAEALKEVF
TNSNNVAGKKAIMEFAGLFRSALNATSDSPEAKTLLMKVGAEYTAQIIKDGLKEKSAFGPWLPETKKAEAKLENLEKQLL
DIIKNNTGGELSKLSTNLVMQEVMPYIASCIEHNFGCTLDPLTRSNLTHLVDKAAAKAVEALDMCHQKLTQEQGTSVGRE
ARHLEMQTLIPLLLRNVFAQIPADKLPDPKIPEPAAGPVPDGGKKAEPTGINININIDSSNHSVDNSKHINNSRSHVDNS
QRHIDNSNHDNSRKTIDNSRTFIDNSQRNGESHHSTNSSNVSHSHSRVDSTTHQTETAHSASTGAIDHGIAGKIDVTAHA
TAEAVTNASSESKDGKVVTSEKGTTGETTSFDEVDGVTSKSIIGKPVQATVHGVDDNKQQSQTAEIVNVKPLASQLAGVE
NVKTDTLQSDTTVITGNKAGTTDNDNSQTDKTGPFSGLKFKQNSFLSTVPSVTNMHSMHFDARETFLGVIRKALEPDTST
PFPVRRAFDGLRAEILPNDTIKSAALKAQCSDIDKHPELKAKMETLKEVITHHPQKEKLAEIALQFAREAGLTRLKGETD
YVLSNVLDGLIGDGSWRAGPAYESYLNKPGVDRVITTVDGLHMQR
>Q56026 ~~~sipD~~~Cell invasion protein SipD~~~
MLNIQNYSASPHPGIVAERPQTPSASEHVETAVVPSTTEHRGTDIISLSQAATKIHQAQQTLQSTPPISEENNDERTLAR
QQLTSSLNALAKSGVSLSAEQNENLRSAFSAPTSALFSASPMAQPRTTISDAEIWDMVSQNISAIGDSYLGVYENVVAVY
TDFYQAFSDILSKMGGWLLPGKDGNTVKLDVTSLKNDLNSLVNKYNQINSNTVLFPAQSGSGVKVATEAEARQWLSELNL
PNSCLKSYGSGYVVTVDLTPLQKMVQDIDGLGAPGKDSKLEMDNAKYQAWQSGFKAQEENMKTTLQTLTQKYSNANSLYD
NLVKVLSSTISSSLETAKSFLQG
>P45707 ~~~sirA~~~Sporulation inhibitor of replication protein SirA~~~
MERHYYTYLIKEEFANHYFGRESVMFELFQDYHWTSLEKQQYEMTEKQIQYITQPIPILHMHQRLKMNLNKTDYRQLDYI
YRIALPKAKGHATFMMKEHMIEIVASGDYEAETIFFEVLRKVSPCFLAMDFNSKRYGWLNPVKERNFV
>Q46755 ~~~ychQ~~~Protein YchQ~~~COG3094
MTSFSTLLSVHLISIALSVGLLTLRFWLRYQKHPQAFARWTRIVPPVVDTLLLLSGIALMAKAHILPFSGQAQWLTEKLF
GVIIYIVLGFIALDYRRMHSQQARIIAFPLALVVLYIIIKLATTKVPLLG
>O34632 4.99.1.4~~~sirB~~~Sirohydrochlorin ferrochelatase~~~COG2138
MKQAILYVGHGSRVKKAQQEAAAFLEGCKAHISVPVQEISFLELQEPTIETGFEACVKQGATHIAVVPLLLLTAAHAKHD
IPEEIVRVASRYPSVRISYGKPIGIDEEVVKAVYHRMKDIGVPYENARVVLIGRGSSDPDVKRDVTGIANLLQEMVPVKE
VIPCFLTACGPNYKEVFSELEKDDGITTFIVPYLLFTGMLMNEIEREVQKLKAHNPNVYLSSYIGFHPHVKNAFLNRVRE
TAANSEGQFDFDGGSYASAAH
>O34813 1.3.1.76~~~sirC~~~Precorrin-2 dehydrogenase~~~COG1648
MLPLHISLEKKKVVIAGGGSIALRRLKTVISEGADITLVSPDVEPEIKQMAEERRIKWEKRTIEKEDYLNAFFIIAATDN
AAVNKEIAQSASPFQLVNCVSDAELGNVYMPKIVKRGHVTVSVSTSGASPKHTKELAENVDKLIDGDFVAEVNRLYQMRR
KK
>P61818 1.3.1.76~~~sirC~~~Precorrin-2 dehydrogenase~~~
MYTVMLDLKGRSVLVVGGGTIATRRIKGFLQEGAAITVVAPTVSAEINEWEAKGQLRVKRKKVGEEDLLNVFFIVVATND
QAVNKFVKQHIKNDQLVNMASSFSDGNIQIPAQFSRGRLSLAISTDGASPLLTKRIKEDLSSNYDESYTQYTQFLYECRV
LIHRLNVSKSRKHELLTEIIDDQYRLSLVKQREFLQQIEKYK
>A0A0H3JQ59 2.4.2.-~~~~~~Protein ADP-ribosyltransferase~~~
MMQSSKWNAMSLLMDEKTKQAEVLRTAIDEADAIVIGIGAGMSASDGFTYVGERFTENFPDFIEKYRFFDMLQASLHPYG
SWQEYWAFESRFITLNYLDQPVGQSYLALKSLVEGKQYHIITTNADNAFDAAEYDMTHVFHIQGEYILQQCSQHCHAQTY
RNDDLIRKMVVAQQDMLIPWEMIPRCPKCDAPMEVNKRKAEVGMVEDAEFHAQLQRYNAFLEQHQDDKVLYLEIGIGYTT
PQFVKHPFQRMTRKNENAIYMTMNKKAYRIPNSIQERTIHLTEDISTLITTALRNDSTTQNNNIGETEDVLNRTD
>P0DN71 2.4.2.-~~~~~~Protein ADP-ribosyltransferase~~~
MSNWTTYPQKNLTQAEQLAQLIKEADALVVGIGAGMSAADGFTYIGPRFETAFPDFIAKYQFLDMLQASLFDFEDWQEYW
AFQSRFVALNYLDQPVGQSYLDLKEILETKDYHIITTNADNAFWVAGYDPHNIFHIQGEYGLWQCSQHCHQQTYKDDTVI
RQMIAEQKNMKVPGQLIPHCPECEAPFEINKRNEEKGMVEDADFHAQKARYEAFLSEHKEGKVLYLEIGVGHTTPQFIKH
PFWKRVSENPNALFVTLNHKHYRIPLSIRRQSLELTEHIAQLISATKTIYQKS
>P9WJ03 1.8.7.1~~~sir~~~Sulfite reductase [ferredoxin]~~~COG0155
MTTARPAKARNEGQWALGHREPLNANEELKKAGNPLDVRERIENIYAKQGFDSIDKTDLRGRFRWWGLYTQREQGYDGTW
TGDDNIDKLEAKYFMMRVRCDGGALSAAALRTLGQISTEFARDTADISDRQNVQYHWIEVENVPEIWRRLDDVGLQTTEA
CGDCPRVVLGSPLAGESLDEVLDPTWAIEEIVRRYIGKPDFADLPRKYKTAISGLQDVAHEINDVAFIGVNHPEHGPGLD
LWVGGGLSTNPMLAQRVGAWVPLGEVPEVWAAVTSVFRDYGYRRLRAKARLKFLIKDWGIAKFREVLETEYLKRPLIDGP
APEPVKHPIDHVGVQRLKNGLNAVGVAPIAGRVSGTILTAVADLMARAGSDRIRFTPYQKLVILDIPDALLDDLIAGLDA
LGLQSRPSHWRRNLMACSGIEFCKLSFAETRVRAQHLVPELERRLEDINSQLDVPITVNINGCPNSCARIQIADIGFKGQ
MIDDGHGGSVEGFQVHLGGHLGLDAGFGRKLRQHKVTSDELGDYIDRVVRNFVKHRSEGERFAQWVIRAEEDDLR
>P72854 1.8.7.1~~~sir~~~Sulfite reductase [ferredoxin]~~~COG0155
MVTTPTAAPRKPSKVEGIKERSNYLREPLATELLNDANYFTDDAVQILKFHGSYQQDNRDNRVKGQEKDYQFMLRTRNPG
GLIPAQLYTALDDLSKTHGNQTLRVTTRQGLQIHGIVKKDLKMAIATVVNNLGSTLGACGDINRNVMAPAAPFRDKPEYG
YAWDYANKVADLLSPQSGAYYEIWLDGEKVISGEEAPEVKAARQKDLNGTNLNDPKEPIYGQQFMPRKFKISVTVPGDNS
IDVYTHDISLVVITDRHGELRGFNVLAGGGLGRTHNKEETFARAADPIGYVSKDDVYDLVKAIVATQRDYGDRHNRRHAR
MKYLLADWGVEKFRKQVETYMGKPFQSFKPLPAWRYQDYLGWHEQGDGKLFFGLSVENGRIKDEGDFQLKTALRKVVDQF
QLPLRLTANHNILLYDINAQDKAAIEQIFQQHGVVTDPEAIDTLVRYSMACPALPTCGLAVTESERIMPSVNARLRDLLN
SLDLPNESIVTRMTGCPNGCARPYMAEIGFVGSAPNSYQVWLGGSPNQERLAAAYTEKMPLEQLESLFEPLFVYFKQSRK
GKESFGDFCHRVGFTALREFSHGYTAPAKGGKNRKNQRRVSLSDEMYAQLKARSERDNCPMNQIVQQALTAYLGK
>P76502 3.1.3.-~~~sixA~~~Phosphohistidine phosphatase SixA~~~COG2062
MQVFIMRHGDAALDAASDSVRPLTTNGCDESRLMANWLKGQKVEIERVLVSPFLRAEQTLEEVGDCLNLPSSAEVLPELT
PCGDVGLVSAYLQALTNEGVASVLVISHLPLVGYLVAELCPGETPPMFTTSAIASVTLDESGNGTFNWQMSPCNLKMAKA
I
>O31422 ~~~skfA~~~Sporulation killing factor~~~
MKRNQKEWESVSKKGLMKPGGTSIVKAAGCMGCWASKSIAMTRVCALPHPAMRAI
>O31423 1.21.98.-~~~skfB~~~Sporulation killing factor maturation protein SkfB~~~COG0535
MSYDRVKDFDLPELAVHLQPHGAVMIDRKSMFYFRLSGRGAQLAFLLSKNKNLHKTARIWEIMKKEEMSADQLKEELSAH
PFTEAWTEGLLDQPLHVSGSLDSYLPISCTLQLTNACNLSCSFCYASSGKPYPEELSSEQWILVMQKLAAHGVADITLTG
GEAKLIKGFKELVVVASSLFTNVNVFSNGLNWRDEEVELLSHLGNVSVQISIDGMDNTHDQLRGRKGGFKESMNTIKKLS
EANIPVIVAMTINESNADEVSDVVEQCANAGAFIFRAGKTLSVGRATEGFKALDIDFEEMVQIQLREARHKWGDRLNIID
WEHEESSFTTDFCTPGYLAWYIRADGYVTPCQLEDLPLGHILEDSMADIGSPARLLQLKCEAKNCKCIGKIELSEPDLPF
QKEVKAGIQE
>O31425 ~~~skfC~~~Sporulation-killing factor biosynthesis protein SkfC~~~COG1266
MNSLSLVFWSILAVVGLLLFIKFKPPTIASLLLSKDEAKEISIQFIKEFVGIDVENWDFYSVYWYDHDTVNKLHHLGILK
KNRKVLYDVGLVESWRVRFVHQNQSFVVGVNANREITFFYADVPKKTLSGKFEQVSPETLKQRLMASPDGLWSRANMTGT
GKKEEDFREVSTYWYIAEAGDIRLKVTVELQGGRISYIGTEQEILTDQMSKVIRDEQVESTFGVSGMLGSALAMILAILI
LVFMDVQTSIIFSLVLGLLIIICQSLTLKEDIQLTIVNAYDARMSVKTVSLLGILSTLLTGLLTGFVVFICSLAGNALAG
DFGWKTFEQPIVQIFYGIGAGLISLGVTSLLFNLLEKKQYLRISPELSNRTVFLSGFTFRQGLNMSIQSSIGEEVIYRLL
MIPVIWWMSGNILISIIVSSFLWAVMHQVTGYDPRWIRWLHLFIFGCFLGVLFIKFGFICVLVAHFIHNLVLVCMPLWQF
KLQKHMHHDQPKHTSL
>O31427 7.3.2.3~~~skfE~~~SkfA peptide export ATP-binding protein SkfE~~~COG1131
MQLMQVQNLSKCYRNGDGVEHLSFSIQRGEIVALLGPNGAGKTTTIRCLTGLYKPDKGDILIEGSPPGDINVQKKVALIP
DQPYLYPALTAAEHIQFRARGYHPGKKDVKERVYHALKEVHLEEKANQLCGQLSRGQKQRVVLAGAIVQDALLYILDEPT
VGLDIPSKQWLSNWLKTKTDQGCSAFVSTHSLEFVIETADRVILIRDGKLMQDLYVPQFEEQAEWRKEVIRLLGEWSDE
>O31428 ~~~skfF~~~Putative bacteriocin-SkfA transport system permease protein SkfF~~~
MPFLIMLLFVGAIGFQVSFVSRSTTWDMSIAGWVLTGVFILYTAFGLFSNRLPSQMADIIWLYGTATSFSKVVYSVLFFS
VTWKALLWIISAIFGDVLIVLLSGDHINLLGRSIIFVGLFFIAEVWLMSVSCARTVKKMKRVYVLVFLLMLGIYSICLYR
FFFLQHSSGIWESIARFISGVGLVFDTLSPLYVVVFIGIITVSFMTIAFTSRQVEMKESLVKEAEFWEEFQERQFGSGQI
IQKPKTTWWGLQGLNGIWSFLWLELLLFKKYLFFHSIHTVMLSGVFYVVIFMYPEWFYLLFFLIVSAVMLSSYYSGIVRH
SQSGTLHLFPGALWKKIIILELTNTVWLYILYCVSITFMAVGNLVYWYIYGLGIYIWFMTIRLFAFTHTNRNDIKLSLPQ
YYKSFFMALGLSGICLYVIHLLTADWYTLVVVVCIGSLSWCLFYRFR
>O31429 ~~~skfG~~~Uncharacterized protein SkfG~~~COG1413
MNSNGDKLSLSVQNLANTNEITIVQAIGELKKSGKDAIPVLVEALKEEGSLCNIAAAVLGEFGEDASEAAEELSCLLKSH
AEDTRMAAAISLMRIGKPSLPFVIKIAQESEGQSCFWASWCIAWIDPSCIEPKMYKCLKYEHEHPSGIVAPFAAEEALGK
LIAFQLKDKED
>O31430 ~~~skfH~~~Thioredoxin-like protein SkfH~~~COG0526
MKDEQMLTEWPSHLPWLNQSQNDFTFPSDTYLLLYFWSMSCPNCHQLTDKVLQDIKDMNVKVIGVHVPYIEEEKSMEVVL
TYALDRGLAIPIVLDQNYEIVTTCHVQGIPSFCLLSQYGQIITKTMGDVGWDKMLKKIAGL
>P0CAV4 ~~~skgA~~~HTH-type transcriptional regulator SkgA~~~COG0789
MSVYTVKQMARLSGVSVRALHHYDAIGLLKPRAVGANGYRYYDRQDLLRLQQILFHRALETPLKDIQAALDQPGFDLAAA
LRAQRERLAAQAERYARLVDVVDRTLADLEGDETMDDKHLFEGFDPEKQARHEAWLVEHYGDEATRRIADAKAGMKSWGK
KDWSQFQEEAKAIEHDLAKALTQGLPVDSAPVTAIMRRHWAWVGRSWNREPTPDAFAGLGHLYQANPEFTARYEAIAPGL
TEYFSEAMRAFARGR
>P0AEU7 ~~~skp~~~Chaperone protein Skp~~~COG2825
MKKWLLAAGLGLALATSAQAADKIAIVNMGSLFQQVAQKTGVSNTLENEFKGRASELQRMETDLQAKMKKLQSMKAGSDR
TKLEKDVMAQRQTFAQKAQAFEQDRARRSNEERGKLVTRIQTAVKSVANSQDIDLVVDANAVAYNSSDVKDITADVLKQV
K
>P0A1Z2 ~~~skp~~~Chaperone protein Skp~~~
MKKWLLAAGLGLAMVTSAQAADKIAIVNMGNLFQQVAQKTGVSNTLENEFKGRAAELQKMETDLQSKMQRLQSMKAGSDR
TKLEKDVMSQRQTFAQKAQAFEKDRARRSNEERNKLVTRIQTAVKKVANDQSIDLVVDANTVAYNSSDVKDITADVLKQV
K
>P31519 ~~~skp~~~Chaperone protein Skp~~~COG2825
MKKWLCAASLGLALAASASVQAAKIAIVNVSRIFQQLPESETVAKQLENEFKGRATELQGMESDLQTKMQKLQRDGSTMK
ASDRTKLENDVMKQRETFSTKAQAFEQDNRRRQMEERNKILSRIQDAVKSVASKGGYDVVIDANAVAYADPSKDITADVL
KQVK
>P0DOV9 1.2.1.97~~~~~~3-sulfolactaldehyde dehydrogenase~~~
MLELKDPSLLKQQAFIDGLWVSADSGETFAVTNPATGDELARIPQMGAAEAERAVLAAHRAFKPWKRKTAKERAELLQRW
YALMLENQEDLARLLTAEQGKPLAEAHGELGNGMSFVQWFAEEAKRVYGDTIPQPSADKRLIVTKEPIGVTAAITPWNFP
HAMITRKVAPALAAGCSMVLRPASQTPLSALALVALAERAGIPAGVFSVVTGSATQIGSVLTGHPLVRKFSFTGSTPVGK
LLIGQCAETVKKVSMELGGNAPFIVFDDADLDLAVEGAMLSKFRNAGQTCVCANRIYVQDGIYERFAEKLAAAASGLRLG
NGVEAGVTQGPMIDENAVRKVEEHISDALEKGARLIAGGQRHALGGSFFEPTVLTEVTAQMKVAHEETFGPLAPLFRFSS
EDEVVELANATEFGLASYFYSRDIGRVLRVSEDLEYGMVGVNTAAIANEMAPFGGVKQSGLGREGSRYGIEDYLEIKYVC
LGGVDR
>Q9K165 ~~~~~~Surface lipoprotein assembly modifier 1~~~
MVIFYFCGKTFMPARNRWMLLLPLLASAAYAEETPREPDLRSRPEFRLHEAEVKPIDREKVPGQVREKGKVLQIDGETLL
KNPELLSRAMYSAVVSNNIAGIRVILPIYLQQAQQDKMLALYAQGILAQADGRVKEAISHYRELIAAQPDAPAVRMRLAA
ALFENRQNEAAADQFDRLKAENLPPQLMEQVELYRKALRERDAWKVNGGFSVTREHNINQAPKRQQYGKWTFPKQVDGTA
VNYRLGAEKKWSLKNGWYTTAGGDVSGRVYPGNKKFNDMTAGVSGGIGFADRRKDAGLAVFHERRTYGNDAYSYTNGARL
YFNRWQTPKWQTLSSAEWGRLKNTRRARSDNTHLQISNSLVFYRNARQYWMGGLDFYRERNPADRGDNFNRYGLRFAWGQ
EWGGSGLSSLLRLGAAKRHYEKPGFFSGFKGERRRDKELNTSLSLWHRALHFKGITPRLTLSHRETRSNDVFNEYEKNRA
FVEFNKTF
>P49051 ~~~sap~~~S-layer protein sap~~~COG0860
MAKTNSYKKVIAGTMTAAMVAGVVSPVAAAGKTFPDVPADHWGIDSINYLVEKGAVKGNDKGMFEPGKELTRAEAATMMA
QILNLPIDKDAKPSFADSQGQWYTPFIAAVEKAGVIKGTGNGFEPNGKIDRVSMASLLVEAYKLDTKVNGTPATKFKDLE
TLNWGKEKANILVELGISVGTGDQWEPKKTVTKAEAAQFIAKTDKQFGTEAAKVESAKAVTTQKVEVKFSKAVEKLTKED
IKVTNKANNDKVLVKEVTLSEDKKSATVELYSNLAAKQTYTVDVNKVGKTEVAVGSLEAKTIEMADQTVVADEPTALQFT
VKDENGTEVVSPEGIEFVTPAAEKINAKGEITLAKGTSTTVKAVYKKDGKVVAESKEVKVSAEGAAVASISNWTVAEQNK
ADFTSKDFKQNNKVYEGDNAYVQVELKDQFNAVTTGKVEYESLNTEVAVVDKATGKVTVLSAGKAPVKVTVKDSKGKELV
SKTVEIEAFAQKAMKEIKLEKTNVALSTKDVTDLKVKAPVLDQYGKEFTAPVTVKVLDKDGKELKEQKLEAKYVNKELVL
NAAGQEAGNYTVVLTAKSGEKEAKATLALELKAPGAFSKFEVRGLEKELDKYVTEENQKNAMTVSVLPVDANGLVLKGAE
AAELKVTTTNKEGKEVDATDAQVTVQNNSVITVGQGAKAGETYKVTVVLDGKLITTHSFKVVDTAPTAKGLAVEFTSTSL
KEVAPNADLKAALLNILSVDGVPATTAKATVSNVEFVSADTNVVAENGTVGAKGATSIYVKNLTVVKDGKEQKVEFDKAV
QVAVSIKEAKPATK
>Q06853 ~~~~~~Cell surface glycoprotein 2~~~COG1361
MKKNNVLTIAAMIALLLTSLLTSITFGETSSIPSRISMELDKTKANIGDIIIATIRIDNINNFSGYQLNIKYDPSYLQAV
NPLTGEPIKKRTMPAVNGTVLLKGDQYSITEVVENNVDEGILNFGKGYANLTEYRKSGKPETTGIIGKIGFKALKLGKTE
IKFENTPVMPGAKEGTLLFDWDAETITEYNVIQPKELAITLPDDAHIALELDKTKVKVGDVIVATVKAKNMTSMAGIQVN
IKYDPEVLQAIDPATGKPFTKETLLVDPELLSNREYNPLLTAVNDINSGIINYASCYVYWDSYRESGVSESTGIIGKVGF
KVLKAANTTVKLEETRFTPNSIDGTLVIDWYGQQIVGYKVIQPDKITVISEPEVPTQTPTQTPPTTTAPSQTPTQTPPTT
TAPSQTPTQTPAVTPTQSATPSDPGGGGGGLPGGGGGAVNPSASPTPTPTSKPTPTATKKPEPTEIEEPEPEIPGTVGIH
YSYLTGYPDKMFRPEKSITRAEAAVIFAKLLGANENTKINYNVSYTDVDSSHWASWAIKFVSYKKLFTGYPDGSFKPNQN
ITRAEFSTVVFKLLVSEKGLKEEKIEKSKFGDTKGHWAQQFIEQLSDLGYINGYPDGTFKPNNNIKRSESVALINRAMGR
GPLHGAPQVFEDVPQTHWAFKDIAEGVLNHRYKLDNEGKEQLLEIIDN
>P38538 ~~~~~~Surface layer protein~~~
MQDSGFKKKDRSTNIPQEQFVYTRGGEHKVMKKVVNSVLASALAITVAPMAFAAEDTTTAPKMDAAMEKTVKRLEALGLV
AGYGNGDFGADKTITRAEFATLIVRARGLEQGAKLAQFNTTYTDVRSTDWFAGFVNVASGEEIVKGFPDKSFKPQNQVTY
AEAVTMIVRALGYEPSVRGVWPNSMISKGSELNIAKGINNPNMQQFAATIFKMLDNALRVKLMEQIEYGTDIRLNVTDET
LLTKYLKVTVRDMDWAHEKGNNSDELPLVTNVPAIGLGSLKANEVTLNGKDADLGSNTTYKVAEGINPNAFDGQKVQVWI
KDDRENVIVWMEGSEDEDVVMDRVSALYLKGKAFTDDIVKDLSKSDLDDVKIEMDGSEKSYRLTEDTKITYNFTRFNDPV
DALSKIYKDNDTFGVKVVLNDNNEVAYLHIIDDQTIDKSVKGVKYGSKVISKIDADKKKITNLDNSKFSDLEDQDEGKDF
LVFLDGQPAKLGDLKESDVYSVYYADGDKDKYLVFANRNVAEGKVEKVVSRNKTDIRLTVGGKTYKVYPDASYSENANKD
VKKVNSDLDLISNLDGEEVKLLLDPSGRVRHIETKDAIDDRKPLAIITKGATYNSSKDTYDFTVMTQKGKTQIVSLDQKD
IYDRYGVNYDKSNDKRQAFEKDLVELLQPKVVKEDSATDANQTVLLEVNFDSKGEVDKVKVLDSKLKYSEKSTWDKLADE
DDDVVGDYEVTDKTAVFKMTGDLTPATGTKRGELKNAGTAKFKDVAKKSDLKVWYSVDEDKGEVQAIFVVDGSGLGGDHQ
FGMVKQYGTASKQDTITIVTKDGDSVTEKEYKLDGDADDLKVDQDIRRGDVISFTLNSDGEVIVDDVVEVVNNNHIDNTA
SKSATLMPEDERQKAGIDKLVVARVDEVDGNTISLNYADGKTQKYYTKASTAFIDVYDGLEGIDGVDEGDYIVMIDSADI
DGTRFDYVLVVSSDDEIRTQHISTKAVTDFLNKPTRLCTKSWRWGRSSHGTKVNTVNDEAVVDGIVTLPADASVRNFNIA
FDQEINSKDATVTVTNEDTLGNVTVSEVATDAKVLSFKTAKLDTTKTYIITVKGLKDKNGKAVKDVTLYVEFVAGV
>P09333 ~~~~~~Outer cell wall protein~~~COG0544
MNKKVVLSVLSTTLVASVAASAFAAPKDGIYIGGNIKKYYSTDVLFEMTPQAKATYASELNAMASDFNNVVFVDYKGKGA
SIEELFTKGSKVALGEPLKKEDFADLYKVVNKDGSSTATEDARAKVDPTPTGDLNVESVSANNLKEVVVTFDKAVDADTA
GDKAYYTFTANKLAVDKVTVSGKTVVLTLAAKAENQASYELNVDGIKGLVKTTKEVKFFDNTTPTVAAVAAIGPKQVKVT
FSEPLSAKPSFSVNNGAIAVVADNFVEGTKEVILTLGAQPTASTNTVTVEGGADYASYKVEKVTKDFTVVADTTPPTVSV
KKASAKQVVLEFSEDVQNVQDKNVVFYHTTKGHEGYKGTILGVDGKEVTISFVNPLPEGQFKIFVDYVVDNGTQISDLWG
NKLPEQVITGTFAADTTPPTVTKVEAKTNTEIHVTFSETVNGADNKANFTLKGVTGNVIPLTKAEVVDAAKNIYKVVTTE
PLNGGSYYLTVKGIEDASKNKLVEYTATVAVADTVPPNVKDLDPATPGTDAQLISPTKVKIAFTEPMDKASIENKNNYMF
NGFNLDSKVTLTATDSNTAVVVDFTNVVGFNGFKNGDAISVGRVLDTAGNPKTEMQTKVNLPNSVSAPLFDKAEVTGKNT
VKLYFKELIINAKADDFAVDNGEGYKAVNSISNDVVENKSVITLTTGNDLPTTAAGVKVKTVGEVDAKNQYGVAVALTDV
PADDKIGPNWLKAETVDTNNNGKIDQFKLTFSEALYVASVQDSDFRIEGYTIAGVETKGEVVTIKVTELDIDDSDATPTV
AVIGSVEDLKRNASGPFEPQKAIDGVSAPDKEAPVVTGVEAGKTYNTAVTPDSADKDIKTVVLKKDGKELAGYALKTPIS
ENGSYELVVTDNAGNTTTVKFKVDIPAEDKKAPEIKTVTDDKVAVADAPKWEAPKATATDDVDGDISDKIAVTYSSEDAG
SKVTDLASAQTHLGTAGNTVKVTYNVTDKAGNPATAVSATFTAI
>P35823 ~~~vapA~~~S-layer protein~~~
MFKKTLIAAAIVVGSAAPAFADVVISPNDNTFVTTSLASVTKQPVLDFSTAQQNLTLNFSEVGDLKNNGFIVLEIQGEGQ
FNDAEIRQWLSNGFWRRPFTGLLVNPNDHGNFANSGEVNDVRKFFKIISDGTQLTIVHTIDSNGKRLRLALASDVEETIN
FADAEVELKLNLANQAFKLTSGSQGTVALTAGALWNASYTADPVATKPLFKLGKLFQLSLTNAGKATALVSEGFLKLNIG
DANISATDFAITNVTTNQTIQRDKVNLTLTGDVSAFKKDANGNLVNKAGASIGWKAAADGQSATAVLGAGNMAGGVQNAL
AAFGTLYVAADNTVPVPAVNFNVKAEIQGDSQATYNYFKDELADLFILTRDGMKFDTITTGTTSANLIHIRDVSNILPTE
GGKIFVTITEYADHAANGRGEGTVLVTRKALSVTLPSGGAVTLKPADVAADVGASITAGRQARLVFEVETNQGEVAVKKS
NAEGVDIQNGTRGTAPLVDFTL
>Q9ZES5 ~~~ctc~~~S-layer protein~~~
MAKTNSYKKVIAGTMTAAMVAGVVSPVAAAGKSFPDVPADHWGIDSINYLVEKGAVTGNDKGMFEPGKELTRAEAATMMA
QILNLPIDKDAKPSFADSQGQWYTPFIAAVEKAGVIKGTGNGFEPNGKIDRVSMASLLVEAYKLDTKVNGTPATKFKDLE
TLNWGKEKANILVELGISVGTTADKWEPKKTVTKAEAAQFIAKTDKQFGTEVAKVESAKAVTTQKVEVKFSKAVEKLTKE
DVKLANKANNDKVLVKDVKLSEDKKSATVELYSNLAAKQTYTVDVNKVGKVEVTVGSLEAKTIEMADQTVVADEPTALKY
TVKDENGTEVVSPAGIEFVTPAAEKINAKGEITLAKGTSTTVKAVYKKDGKVVAESKEVKVSAEGTAVASISNWTVAAEK
ADFTSKDFKQNDKVYEGDNVSVQVELKDQFNNVVNNVKAEYESLNTEVAVVDKATGKVTVLSAGKAPVKVTVKDSKGKEL
VSKTVEIEAFAQKAMKEIKLEKTNVALSTKDVTDFKVKAPVLDQYGKEFAAPVEVKVLDKDGKELKEQKLVAKYENKELV
LNAHGQEAGKYTVELTAKSGKKEVKSKLALELKAPGVFSKFDVRGLENELDKYVTEENKKNEMVVSVLPVDANGLVLREK
EAATLKVTTTDKDGKVVDATSGQVAVNDAAGTITVGNEAKAGETYKVTVVADGKLITTHSFKVVDTAPAAKKLAVDFTST
SLNEVAQGSELKTALLNILSVDGVPATTAGATVTDVKFVSADTNVVSEETAKFGTKGSTSIFVKELTVKKGEQTQKVELD
KPVRVDVSIKEVKEVK
>P35827 ~~~sapA~~~S-layer protein~~~
MLNKTDVSMLYITIMGMASEGDGNKYWLDYANNNSLGVSSLANIMLDSPGAAKFFGDSLLAGNEKDFVTKIYSIALGNTS
DVDGINYWTKAITGGGEFTDSKGNVISVASLSKGDLIGAMINSMVNGGSAESKAIFEAKAAASDYFADATLVRDISGLDE
GTTSKLISEINSASDLDKVKSEIDALKSELPNPGSTYDLTEGNDNLKGTDLDDTFNGTTYVGNGTNKSTLSAFDKIDGGA
GRDTLNAIFTANNNAAAATKLDQAEIDKSLKGVTNVENINIISDLETSGDFVFNGYEKVGFNVLGDIVSFATDASKSVNV
ETTGTITAFTAAGTGKVDVVAGKISALTADSATSVNLTATNDTITLTSANAATSVNLKTSGAAKSATITSANAAKNITID
ATGVAAVTSATAVENLTVKHATNVTLAGNMDKLATVTLDNAALTAAIDIKSASTLNLINSSVNGHNISTAAKDVTVNLSG
SAAKVKLNTTAATDQTVTLKANATDNSLEFDSATAKTTSVTASGSGKTLVIKGAEVETLVNIDTTAFNGAADVSFGKDGQ
GGKFSVKTGTGDDKIEFVGTTLTEGSVIDGAGNDTIAMKSAALTSANFTMIKNIENVAISDAVATADLSSSAFKNSVIIT
TKEAADTTLTINKDQVINFTAADAGSVKLITVKLNDVTGANDVVKIVLDAAAKDASIALGTAAADKALVIDTGIETLNIT
SLVKATSPETTANTVNAKLTDVTSIIIDGDAKITLGHAGTAGTDYSKVSMIDASALKAGLTFDASAITLGANATIKGGSG
ADSITVKGGNIVVDLVAGGDDTITLKKGAEKTDITTVNNFNAGDKIDIADAKNGTFTFNKITMNSDANLDDYITKAVAGD
GSTNSAVSYFLHNGYTYVVVDGTAGATFTKATDTIIKLSGTLDLKLSGDNVVVDDGSVI
>P35828 ~~~rsaA~~~S-layer protein~~~COG2931
MAYTTAQLVTAYTNANLGKAPDAATTLTLDAYATQTQTGGLSDAAALTNTLKLVNSTTAVAIQTYQFFTGVAPSAAGLDF
LVDSTTNTNDLNDAYYSKFAQENRFINFSINLATGAGAGATAFAAAYTGVSYAQTVATAYDKIIGNAVATAAGVDVAAAV
AFLSRQANIDYLTAFVRANTPFTAAADIDLAVKAALIGTILNAATVSGIGGYATATAAMINDLSDGALSTDNAAGVNLFT
AYPSSGVSGSTLSLTTGTDTLTGTANNDTFVAGEVAGAATLTVGDTLSGGAGTDVLNWVQAAAVTALPTGVTISGIETMN
VTSGAAITLNTSSGVTGLTALNTNTSGAAQTVTAGAGQNLTATTAAQAANNVAVDGGANVTVASTGVTSGTTTVGANSAA
SGTVSVSVANSSTTTTGAIAVTGGTAVTVAQTAGNAVNTTLTQADVTVTGNSSTTAVTVTQTAAATAGATVAGRVNGAVT
ITDSAAASATTAGKIATVTLGSFGAATIDSSALTTVNLSGTGTSLGIGRGALTATPTANTLTLNVNGLTTTGAITDSEAA
ADDGFTTINIAGSTASSTIASLVAADATTLNISGDARVTITSHTAAALTGITVTNSVGATLGAELATGLVFTGGAGADSI
LLGATTKAIVMGAGDDTVTVSSATLGAGGSVNGGDGTDVLVANVNGSSFSADPAFGGFETLRVAGAAAQGSHNANGFTAL
QLGATAGATTFTNVAVNVGLTVLAAPTGTTTVTLANATGTSDVFNLTLSSSAALAAGTVALAGVETVNIAATDTNTTAHV
DTLTLQATSAKSIVVTGNAGLNLTNTGNTAVTSFDASAVTGTGSAVTFVSANTTVGEVVTIRGGAGADSLTGSATANDTI
IGGAGADTLVYTGGTDTFTGGTGADIFDINAIGTSTAFVTITDAAVGDKLDLVGISTNGAIADGAFGAAVTLGAAATLAQ
YLDAAAAGDGSGTSVAKWFQFGGDTYVVVDSSAGATFVSGADAVIKLTGLVTLTTSAFATEVLTLA
>P35829 ~~~slpA~~~S-layer protein~~~
MKKNLRIVSAAAAALLAVAPVAASAVSTVSAATTINASSSAINTNTNAKYDVDVTPSVSAVAANTANNTPAIAGNLTGTI
SASYNGKTYTANLKADTENATITAAGSTTAVKPAELAAGVAYTVTVNDVSFNFGSENAGKTVTLGSANSNVKFTGTNSDN
QTETNVSTLKVKLDQNGVASLTNVSIANVYAINTTDNSNVNFYDVTSGATVTNGAVSVNADNQGQVNVANVVAAINSKYF
AAQYADKKLNTRTANTEDAIKAALKDQKIDVNSVGYFKAPHTFTVNVKATSNTNGKSATLPVVVTVPNVAEPTVASVSKR
IMHNAYYYDKDAKRVGTDSVKRYNSVSVLPNTTTINGKTYYQVVENGKAVDKYINAANIDGTKRTLKHNAYVYASSKKRA
NKVVLKKGEVVTTYGASYTFKNGQKYYKIGDNTDKTYVKVANFR
>P38059 ~~~slpH~~~S-layer protein~~~
MKKNLRIVSAAAAALLAVAPIAATAMPVNAATTINADSAINANTNAKYDVDVTPSISAIAAVAKSDTMPAIPGSLTGSIS
ASYNGKSYTANLPKDSGNATITDSNNNTVKPAELEADKAYTVTVPDVSFNFGSENAGKEITIGSANPNVTFTEKTGDQPA
STVKVTLDQDGVAKLSSVQIKNVYAIDTTYNSNVNFYDVTTGATVTTGAVSIDADNQGQLNITSVVAAINSKYFAAQYDK
KQLTNVTFDTETAVKDALKAQKIEVSSVGYFKAPHTFTVNVKATSNKNGKSATLPVTVTVPNVADPVVPSQSKTIMHNAY
FYDKDAKRVGTDKVTRYNTVTVAMNTTKLANGISYYEVIENGKATGKYINADNIDGTKRTLKHNAYVYKTSKKRANKVVL
KKGTEVTTYGGSYKFKNGQRYYKIGANTEKTYVKVANFE
>Q05044 ~~~~~~S-layer protein~~~
MQSSLKKSLYLGLAALSFAGVAAVSTTASAKSYATAGAYSTLKTDAATRNVEATGTNALYTKPGTVKGAKVVASKATMAK
LASSKKSADYFRAYGVKTTNRGSVYYRVVTMDGKYRGYVYGGKSDTAFAGGIKSAETTTKADMPARTTGFYLTDTSKNTL
WTAPKYTQYKASKVSLYGVAKDTKFTVDQAATKTREGSLYYHVTATNGSGISGWIYAGKGFSTTATGTQVLGGLSTDKSV
TATNDNSVKIVYRTTDGTQVGSNTWVTSTDGTKAGSKVSDKAADQTALEAYINANKPSGYTVTNPNAADATYGNTVYATV
SQAATSKVALKVSGTPVTTALTTADANDKVAANDTTANGSSVAGSTVYAAGTKLAQLTTDLTGEKGQVVTLTAIDTDLED
ATFTGTTTYYSDLGKAYHYTYTYNKDSAASSNASTQFGSNVTGTLTATLVMGKSTATANGTTWFN
>P73817 ~~~~~~S-layer protein~~~COG2931
MALSPNVIAALQIMYTGRGVSASDLNWWATDGANITYAEAVALFASSPDAAIKYPFFQAPQTADKRQYVAQVFANLYNID
INDTSLVPTEELDYWINWLSLSPDNYLDFPNALNNASAAAGLTDRLEALTNKADVSLSYTEALSTAGVNTFTEAQYAEAA
GIIATVDDTNASVLAAEAQIVEIAASLSVFTIAQAQATPNLPPAYTISDTADNLIAGADDPVVTGANNVIANQSPAAPLS
VEDANILLATADELAAGVTWDILDTAADVLAGGAAVSGAASVGITDIVDVATASQLLALGNFDGVYAIADTSANIVADPG
VSGGATAITLSDPDVPVSVASATFLQGLGIPVGPSYIVEDTSANILAALSTPAIVNAAEVIVNNTDVPLSVAQAEDLLSL
PNLNAGFTYIIADTLDNLSAAPSTLLDGAVSYSLTNTNPDLGVITEAEAVIVNGATNASDFNFLVADVILTPQADIRSGN
SFLSVAVVEGGSIFNTLNSNDRLTGTGEDPTLSLTWQEATFGNINTIFPVLDGIETLVATLIENDLTLVSNDFDVVGQGF
ITGLKNVAASGTKGGDLELINLQTALETVSVTNYFFGDDVSFSIADPELAGDNDLLLLTVDQVTEDGPDVTSIKISDFSG
NGGYETLGLTSGVTTSSKGNTNTVDIEGIVAVESIGITGIENLTLSTSLIGSVVKVDATGSALIPEFEGREVFTGDLKAF
FDDRPGGDITFLSGSGNDEISIARDAFTLSEDLKDVISKGHILDGGAGNDELTITGDAFSDTDAGHTVIGGEGNDSILLT
GVAEGPIAGHVVNSFDLINEVGGAGDDDINISGDAIGDSAGHVVFGGAGEDDIFIGFDKTLAVSGNGAALGVDLAGHVVF
AGDDDDTVRITGDSFTSDSANGSGHSVEGGTGDDLIEISGDALTADPDSETIANPFFDDSEPSDLDLFIAADQPIPTTEE
QYQVLLAQLGLPADYNPRNFIRGVAAISGAHTVRGGEGNDVILFGPIAGEPGNGDGQHLAFGDEGDDFIEMTGIGSVEFN
GGAGDDTLVGGDGDPILGFGNDILNGDEGNDFLFGGKGNDNLQGGEGDDIMSGGEGDDFFFVDAGFDVIEDLGDANSETG
DQFQVSEDAEAEIRVVQDWEATGLTFNLGIATLTIENPGGGSVDLSASNVPPNTNGYTVIGNIGDDEIIGSRDDDSIFGG
RGEDSIAGLGGDDIIEGNDDDDFISGDSLLLPLLPLEEILPFGNDDIDAGSGNDVIAGDLLVVTGDDIDLNLFNGGKDTI
EAGLGSDITVGDWSIGAFGDIDLNASLERTAIGGDDTITTKQGDNGIVFPIGQVAIDNFLVGDLAAAVDGVGNDIFLTET
LTVIGGDDTMTGADGLDVIVGDVGLFGFEFNDSEINLTNFKLGQVNGSTVSAGDDSITGEGGNDILVGDLFVGVINNNGI
IIDGGKGFQLGKDGTTSFIGGDDSISGGDGNDFLAGDFVLVDQLSAPFDPLDPNDWTFVNPYATLQGQAGDSKAQAAQAA
INLAQLRLEFRAVGGDDELVGGRGNDTFYGGLGADTIDIGNDVTVGGVGVNGANEIWYMNGAFENAAVNGANVDNITGFN
VNNDKFVFAAGANNFLSGDATSGLAVQRVLNLQAGNTVFNLNDPILNASANNINDVFLAVNADNSVGASLSFSLLPGLPS
LVEMQQINVSSGALAGREFLFINNGVAAVSSQDDFLVELTGISGTFGLDLTPNFEVREFYA
>P22258 ~~~~~~Cell surface protein~~~COG2373
MKNLKKLIAVVSTFALVFSAMAVGFAATTPFTDVKDDAPYASAVARLYALNITNGVGDPKFGVDQPVTRAQMITFVNRML
GYEDLAEMAKSEKSAFKDVPQNHWAVGQINLAYKLGLAQGVGNGKFDPNSELRYAQALAFVLRALGFKDLDWPYGYLAKA
QDLGLVHGLNLAYNGVIKRGDLALILDRALEVPMVKYVDGKEVLGEPLISKVATKAEYTVIATNAQDRSVEEGKVAVLDK
DGKLTTINAGLVDFSEYLGKKVIVYSERFGDPVYVAEGDNDVVSFTEGQDSVGTTVYKNDDNKTAIKVDDNAYVLYNGYL
TKVSKVTVKEGAEVTIINNNYLIVNGSYDNSTIVYNDVQSGDKYLNRDSNYELKGTVTVTGAVSKVTDIKANDYIYYGKQ
YDVNGNVVGTVIYVVRNQVTGTVTEKSVSGSTYKASIDNVSYTVADNNVWNQLEPGKKVTVILNKDNVIVGISSTTTTTA
VNYAIFKEKSDPFTAWFAKVKLILPDAAEKVFDAVYSDVYDKVNLAEGTIVTYTVDANGKLNDIQRANDQPFSSASYKAD
AKVLTEGSTTYYITDNTVLLNNTSDGYKALKLTDLKDATNLNVKIVADNYNVAKVVVFNNASFVSTTTSTVYAYVTGTAD
VYVNGSTFTRLTVLENGQTKTYDANAQLATNYTHKAVVLTLTNAKIANIALPTVASGVKLTNIDQANLRITDTTNKGYLL
DPNFIVVDTNGNLKGLSDITKDTGVNLYTNDVGKVFVIEIVQ
>Q1QWN6 1.1.1.310~~~slcC~~~(S)-sulfolactate dehydrogenase~~~COG0111
MSDVLISEFMDEAAVADLERDCSVTFDATLVDDRARLLSSGAGVRALIVRNRTRVDRELLARFPDLRAVGRLGVGLDNID
VDACRESDIAVLPATGGNTVSVAEYVLTGIFMLRRGAYLSTPRVLAGEWPRQALMGHETQGATLGLVGFGGIARDLARRA
QCLGMQVMAHDPFVPADDAAWQTVERAERLATLLEKADAVSLHVPLSEGTRHLIDGEALATMKPGSLLINTARGGIVDER
ALAASLRDRHLGGAMLDVFEEEPLTADSVLSGVEGLIATPHIAGVTHESNERISWITVDNVRRALGVRA
>Q8KIL1 1.1.99.22~~~sldA~~~Glycerol dehydrogenase large subunit~~~
MRRPYLLATAAGLALACSPLIAHAQFAPAGAGGEPSSSVPGPGNASEPTENSPKSQSYFAGPSPYAPQAPGVNAANLPDI
ESIDPSQVPAMAPQQSANPARGDWVAYGRDDHQTRYSPLSEITPENASKLKVAFVYHTGSYPRPGQVNKWAAETTPIKVG
DGLYTCSAMNDIIKLDPATGKQIWRRNVDVKYHSIPYTAACKGVTYFTSSVVPEGQPCHNRLIEGTLDMRLIAVDAETGD
FCPNFGHGGQVNLMQGLGESVPGFVSMTAPPPVINGVVVVNHEVLDGQRRWAPSGVIRGYDAESGKFVWAWDVNNSDDHS
QPTGNRHYSRGTPNSWATMTGDNEEGLVYVPTGNSAADYYSALRSDAENKVSSAVVAIDVKTGSPRWVFQTAHKDVWDYD
IGSQATLMDMPGPDGQTVPALIMPTKRGQTFVLDRRTGKPILPVEERPAPSPGVIPGDPRSPTQPWSVGMPALRVPDLKE
TDMWGMSPIDQLFCRIKFRRANYVGEFTPPSVDKPWIEYPGYNGGSDWGSMSYDPQSGILIANWNITPMYDQLVTRKKAD
SLGLMPIDDPNFKPGGGGAEGNGAMDGTPYGIVVTPFWDQYTGMMCNRPPYGMITAIDMKHGQKVLWQHPLGTARANGPW
GLPTGLPWEIGTPNNGGSVVTGGGLIFIGAATDNQIRAIDEHTGKVVWSAVLPGGGQANPMTYEANGHQYVAIMAGGHHF
MMTPVSDQLVVYALPDAIKQ
>Q8L1D5 1.1.99.22~~~sldB~~~Glycerol dehydrogenase small subunit~~~
MPNLQGNRTLTEWLTLLLGVIVLLVGLFFVIGGADLAMLGGSTYYVLCGILLVASGVFMLMGRTLGAFLYLGALAYTWVW
SFWEVGFSPIDLLPRAFGPTILGILVALTIPVLRRMESRRTLRGAV
>Q2G0U9 3.5.1.28~~~sle1~~~N-acetylmuramoyl-L-alanine amidase sle1~~~COG1388
MQKKVIAAIIGTSAISAVAATQANAATTHTVKPGESVWAISNKYGISIAKLKSLNNLTSNLIFPNQVLKVSGSSNSTSNS
SRPSTNSGGGSYYTVQAGDSLSLIASKYGTTYQNIMRLNGLNNFFIYPGQKLKVSGTASSSNAASNSSRPSTNSGGGSYY
TVQAGDSLSLIASKYGTTYQKIMSLNGLNNFFIYPGQKLKVTGNASTNSGSATTTNRGYNTPVFSHQNLYTWGQCTYHVF
NRRAEIGKGISTYWWNANNWDNAAAADGYTIDNRPTVGSIAQTDVGYYGHVMFVERVNNDGSILVSEMNYSAAPGILTYR
TVPAYQVNNYRYIH
>Q7A7E0 3.5.1.28~~~sle1~~~N-acetylmuramoyl-L-alanine amidase sle1~~~
MQKKVIAAIIGTSAISAVAATQANAATTHTVKPGESVWAISNKYGISIAKLKSLNNLTSNLIFPNQVLKVSGSSNSTSNS
SRPSTNSGGGSYYTVQAGDSLSLIASKYGTTYQNIMRLNGLNNFFIYPGQKLKVSGTASSSNAASNSSRPSTNSGGGSYY
TVQAGDSLSLIASKYGTTYQKIMSLNGLNNFFIYPGQKLKVTGNASTNSGSATTTNRGYNTPVFSHQNLYTWGQCTYHVF
NRRAEIGKGISTYWWNANNWDNAAAADGYTIDNRPTVGSIAQTDVGYYGHVMFVERVNNDGSILVSEMNYSAAPGILTYR
TVPAYQVNNYRYIH
>P0A3V1 ~~~sleB~~~Spore cortex-lytic enzyme~~~COG3409
MRQKAIFKIAVLLAFIGLSLMVSSIQLKNVEAFSNQVIQRGASGEDVIELQSRLKYNGFYTGKVDGVFGWGTYWALRNFQ
EKFGLPVDGLAGAKTKQMLVKATKYDKSTANKGTTTNKGNSGGTAQENKPPQNKGTNVPNGYSQNDIQLMANAVYGESRG
EPYLGQVAVAAVILNRVTSASFPNTVSGVIFEPRAFTAVADGQIYLTPNETAKKAVLDAINGWDPTGNALYYFNPDTATS
KWIWTRPQIKKIGKHIFCK
>P0A3V0 ~~~sleB~~~Spore cortex-lytic enzyme~~~
MRQKAIFKIAVLLAFIGLSLMVSSIQLKNVEAFSNQVIQRGASGEDVIELQSRLKYNGFYTGKVDGVFGWGTYWALRNFQ
EKFGLPVDGLAGAKTKQMLVKATKYDKSTANKGTTTNKGNSGGTAQENKPPQNKGTNVPNGYSQNDIQLMANAVYGESRG
EPYLGQVAVAAVILNRVTSASFPNTVSGVIFEPRAFTAVADGQIYLTPNETAKKAVLDAINGWDPTGNALYYFNPDTATS
KWIWTRPQIKKIGKHIFCK
>P50739 ~~~sleB~~~Spore cortex-lytic enzyme~~~COG3409
MKSKGSIMACLILFSFTITTFINTETISAFSNQVIQRGATGDDVVELQARLQYNGYYNGKIDGVYGWGTYWAVRNFQDQF
GLKEVDGLVGAKTKQTLICKSKYYREYVMEQLNKGNTFTHYGKIPLKYQTKPSKAATQKARQQAEARQKQPAEKTTQKPK
ANANKQQNNTPAKARKQDAVAANMPGGFSNNDIRLLAQAVYGEARGEPYEGQVAIAAVILNRLNSPLFPNSVAGVIFEPL
AFTAVADGQIYMQPNETAREAVLDAINGWDPSEEALYYFNPDTATSPWIWGRPQIKRIGKHIFCE
>P0DPJ9 3.2.1.-~~~sleL~~~Cortical fragment-lytic enzyme~~~
MIQIVTVRSGDSVYSLASKYGSTPDEIVKDNGLNPAETLVVGQALIVNTKGNNYYVQPGDSLYRISQTYNVPLASLAKVN
NLSLKSILHVGQQLYIPKGTKRAVESIAYLQPSTIPIKESLVNATRAINPFLTYLAYFSFEAKRDGTLKEPTETAKIANI
ATQGNTIPMLVITNIENGNFSADLTSVILRDATIQNKFITNILQTAEKYGMRDIHFDFESVAPEDREAYNRFLRNVKTRL
PSGYTLSTTLVPKTSSNQKGKFFETHDYKAQGQIVDFVVIMTYDWGWQGGPPMAISPIGPVKEVLQYAKSQMPPQKIMMG
QNLYGFDWKLPFKEGNPPAKAISSVAAVALARKYNVPIRYDFTAQAPHFNYFDENGVQHEVWFEDSRSVQSKFNLMKEQG
IGGISYWKIGLPFPQNWRLLVENFTITKKG
>Q9K3E4 3.2.1.-~~~sleL~~~Cortical fragment-lytic enzyme~~~
MIQIVTVRSGDSVYSLASKYGSTPDEIVKDNGLNPAETLVVGQALIVNTKGNNYYVQPGDSLYRISQTYNVPLASLAKVN
NLSLKSILHVGQQLYVPKGTKRAVESIAYLQPSTIPIKESLVNATRAINPFLTYLAYFSFEAKRDGTLKEPTETAKIANI
ATQGKTIPMLVITNIENGNFSADLTSVILRDATIQNKFITNILQTAQKYGMRDIHFDFESVAPEDREAYNRFLRNVKTRL
PSGYTLSTTLVPKTSSNQKGKFFEAHDYKAQGQIVDFVVNMTYDWGWQGGPPMAISPIGPVKEVLQYAKSQMPPQKIMMG
QNLYGFDWKLPFKQGNPPAKAISSVAAVALARKYNVPIRYDFTAQAPHFNYFDENGVQHEVWFEDSRSVQSKFNLMKEQG
IGGISYWKIGLPFPQNWRLLVENFTITKKG
>P37531 3.2.1.-~~~sleL~~~Cortical fragment-lytic enzyme~~~COG1388
MQIYVVKQGDTLSAIASQYRTTTNDITETNEIPNPDSLVVGQTIVIPIAGQFYDVKRGDTLTSIARQFNTTAAELARVNR
IQLNTVLQIGFRLYIPPAPKRDIESNAYLEPRGNQVSENLQQAAREASPYLTYLGAFSFQAQRNGTLVAPPLTNLRSITE
SQNTTLMMIITNLENQAFSDELGRILLNDETVKRRLLNEIVENARRYGFRDIHFDFEYLRPQDREAYNQFLREARDLFHR
EGLEISTALAPKTSATQQGRWYEAHDYRAHGEIVDFVVLMTYEWGYSGGPPQAVSPIGPVRDVIEYALTEMPANKIVMGQ
NLYGYDWTLPYTAGGTPARAVSPQQAIVIADQNNASIQYDQTAQAPFFRYTDAENRRHEVWFEDARSIQAKFNLIKELNL
RGISYWKLGLSFPQNWLLLSDQFNVVKKTFR
>P0C093 ~~~slmA~~~Nucleoid occlusion factor SlmA~~~COG1309
MAEKQTAKRNRREEILQSLALMLESSDGSQRITTAKLAASVGVSEAALYRHFPSKTRMFDSLIEFIEDSLITRINLILKD
EKDTTARLRLIVLLLLGFGERNPGLTRILTGHALMFEQDRLQGRINQLFERIEAQLRQVLREKRMREGEGYTTDETLLAS
QILAFCEGMLSRFVRSEFKYRPTDDFDARWPLIAAQLQ
>B5XTG2 ~~~slmA~~~Nucleoid occlusion factor SlmA~~~
MAEKQTAKRNRREEILQSLALMLESSDGSQRITTAKLAASVGVSEAALYRHFPSKTRMFDSLIEFIEDSLITRINLILKD
EKDTTARLRLIVLLILGFGERNPGLTRILTGHALMFEQDRLQGRINQLFERIEVQLRQVMREKKMREGEGYTLDETLLAS
QLLAFCEGMLSRFVRSEFKYRPTDDFEARWPLVAAQLQ
>A6TFN2 ~~~slmA~~~Nucleoid occlusion factor SlmA~~~
MAEKQTAKRNRREEILQSLALMLESSDGSQRITTAKLAASVGVSEAALYRHFPSKTRMFDSLIEFIEDSLITRINLILKD
EKDTTARLRLIVLLILGFGERNPGLTRILTGHALMFEQDRLQGRINQLFERIEAQLRQVMREKKMREGEGYTLDETLLAS
QLLAFCEGMLSRFVRSEFKYRPTDDFDARWPLVAAQLQ
>Q9KVD2 ~~~slmA~~~Nucleoid occlusion factor SlmA~~~COG1309
MAGNKKINRREEILQALAEMLESNEGASRITTAKLAKQVGVSEAALYRHFPSKARMFEGLIEFIEESLMSRINRIFDEEK
DTLNRIRLVMQLLLAFAERNPGLTRILSGHALMFENERLRDRINQLFERIETSLRQILRERKLREGKSFPVDENILAAQL
LGQVEGSLNRFVRSDFKYLPTANFDEYWALLSAQIK
>A6XIG6 3.1.1.92~~~~~~4-sulfomuconolactone hydrolase~~~
MSEQAVEVSPKCLGPQHHINPLRFVMPPGSWDTHFHVFGPTTKYPYSETRKYTPPDSPFEEYVKLMLALGIERGVCVHPN
IHGPDNSVTLDAVERSEGRFLAIVKIAPDVTLPQLKEMKKKGACGVRFAFNPEHGSGELDTALFDRVVQWCGELDWCVNL
HFASNAIHSLAERLSQLTIPTLIDHFGRVHPTKGVDQPDFKTLVDLMRLPHMWVKLTGADRISRNSPSYQDVVPLARTLV
DVAPDRVIWGTDWPHSGYFDVKRMPNDGDLTNLLLDFAPSEEQRRRILVDNPSRLFGQVAKGA
>A6XIG7 3.1.1.92~~~~~~4-sulfomuconolactone hydrolase~~~
MLPADQAGIPPCQGPRARSAPISFAIPKGAWDTHLHVFGPTAVFPYAEKRPYTPPDSPLEDYLALMERLGIERGVCVHPN
VHGIDNSVTIDAVERSDRRLLGIIKPHRVMTFTELRDLKTRGVRGVRFAFNPQHGSGALDTELFERMHGWCRELDWCINM
HFAPDALEGLCDLIAGAETPIIIDHFGRVETAAGVNQLPFKILRDLATLDHVWIKLTGADRISHSGVPYDDVVPFAHALS
EIAPDRLLWGSDWPHSGYFDPKRMPDDGDLLNLVARFAPDVALRHKILVDNPARLFGVI
>Q9RRB6 ~~~slpA~~~Outer membrane protein SlpA~~~COG3206
MKKSLIALTTALSFGLAAAQTAAPVSAPQVPALTDVPAGHWAKDAIDRLVSRGVILGYPDGTFRGTQNLTRYEAAIIIAR
LLDQMRDGETPAGMTAEDMTALQNAIQELAADLAALGVRVSDLEANAVSKDDFARLEARIEEVAAAGGEQGATEALQGQI
DDLTARVDEYDALRADVDDNASSIAALNDLTVLLNQDILDLQDRVSAVEAAQADFVQRSDFDALGGRVTTVETRVETVNN
SLTGRIAALERNAFSVKPSLTIGYSVSRTSRNFDVDRLFPLNADGTVANNAFTSGGIDTDTGAQRRDFGDFGNASDPVVA
GAAGLYGFADGVSYTVYFTDGSTATFDGLNPADYKVPTGKVIDTTKGRNGFGFNNLARYKEGSTDIGISLGFDTSGQFSQ
VTSGTGGSLFSTAGRLQVNQIDLNFGLVTGLPSDAYVDTNGNGKKDDGEATGRGTYLGSGGTAAILRDPAGNVYRPVFFR
FKNATTQFSVGNNPVIVTLGQQQKFYFSDYVFDNNYDGRGDGFTVTVDGSNVPVIGAWKPQIKGVYGSRSGLDGTAEAGY
GVYYRGVRAQITPVGTLTAGIHYAQEGRDMFGAAQNTTSTPSDVTTYGADLHGKAFGVELHSEYATSRVRPNTANAAVQT
SNAFYARVATRKDNLAFDLNTPAAKFGNDTFGVSLYDLNYRKIDAGYNNVAGISEYGYGSYSRTSAQNIAYNPDTGVTAP
FANLDRQAYTDANNDGTSDRNADGTVVATNTKIGQMGFGVKAAANLGPVAIGGYYDTSTGANGDNANRMTEAGGSAKVAY
SIFSLRGTYNTLDSNRPQIYRDAAGTQIIGDAKVRRYAVQADVTPGLGLFVGAYYRDVNVNGVRSTTDRGLLGRGYLASS
FEPGVGNNAYRTGLRCADNNFGTGTRDIDGVGGVLNPAVNLDQSRTATCFTSYGVEAGHAGDNANALVKDLFFRVGYSRV
YVPTTATATTGDFSGSVTYGDARYDRKVGVANVRLAGSFSTTNTQLDSRPAGTRGAVGLIVRTDPLENVPFRPQFNGQVG
YYTADNRVAAGNYNANATKYGAGVVLNDFLLPQTKIGVRYDGYMAQNRQYTPFDGDGTQGYFSDANNNRRTNLNGVYVEG
AYQDLIFSYGTYTLSQKDLNGVEYGSGINNGQPARGQTFKISYKVNF
>P37194 ~~~slp~~~Outer membrane protein Slp~~~COG3065
MNMTKGALILSLSFLLAACSSIPQNIKGNNQPDIQKSFVAVHNQPGLYVGQQARFGGKVINVINGKTDTLLEISVLPLDS
YAKPDIEANYQGRLLARQSGFLDPVNYRNHFVTILGTIQGEQPGFINKVPYNFLEVNMQGIQVWHLREVVNTTYNLWDYG
YGAFWPEPGWGAPYYTNAVSQVTPELVK
>P0C8M5 ~~~slrA~~~Transcriptional regulator SlrA~~~
MKTHVKKDLDKGWHMLIQEARSIGLGIHDVRQFLESETASRKKNHKKTVRQD
>D0ZRB2 2.3.2.27~~~slrP~~~E3 ubiquitin-protein ligase SlrP~~~
MFNITNIQSTARHQSISNEASTEVPLKEEIWNKISAFFSSEHQVEAQNCIAYLCHPPETASPEEIKSKFECLRMLAFPAY
ADNIQYSRGGADQYCILSENSQEILSIVFNTEGYTVEGGGKSVTYTRVTESEQASSASGSKDAVNYELIWSEWVKEAPAK
EAANREEAVQRMRDCLKNNKTELRLKILGLTTIPAYIPEQITTLILDNNELKSLPENLQGNIKTLYANSNQLTSIPATLP
DTIQEMELSINRITELPERLPSALQSLDLFHNKISCLPENLPEELRYLSVYDNSIRTLPAHLPSEITHLNVQSNSLTALP
ETLPPGLKTLEAGENALTSLPASLPPELQVLDVSKNQITVLPETLPPTITTLDVSRNALTNLPENLPAALQIMQASRNNL
VRLPESLPHFRGEGPQPTRIIVEYNPFSERTIQNMQRLMSSVDYQGPRVLFAMGDFSIVRVTRPLHQAVQGWLTSLEEED
VNQWRAFEAEANAAAFSGFLDYLGDTQNTRHPDFKEQVSAWLMRLAEDSALRETVFIIAMNATISCEDRVTLAYHQMQEA
TLVHDAERGAFDSHLAELIMAGREIFRLEQIESLAREKVKRLFFIDEVEVFLGFQNQLRESLSLTTMTRDMRFYNVSGIT
ESDLDEAEIRIKMAENRDFHKWFALWGPWHKVLERIAPEEWREMMAKRDECIETDEYQSRVNAELEDLRIADDSDAERTT
EVQMDAERAIGIKIMEEINQTLFTEIMENILLKKEVSSLMSAYWR
>Q8ZQQ2 2.3.2.27~~~slrP~~~E3 ubiquitin-protein ligase SlrP~~~
MFNITNIQSTARHQSISNEASTEVPLKEEIWNKISAFFSSEHQVEAQNCIAYLCHPPETASPEEIKSKFECLRMLAFPAY
ADNIQYSRGGADQYCILSENSQEILSIVFNTEGYTVEGGGKSVTYTRVTESEQASSASGSKDAVNYELIWSEWVKEAPAK
EAANREEAVQRMRDCLKNNKTELRLKILGLTTIPAYIPEQITTLILDNNELKSLPENLQGNIKTLYANSNQLTSIPATLP
DTIQEMELSINRITELPERLPSALQSLDLFHNKISCLPENLPEELRYLSVYDNSIRTLPAHLPSGITHLNVQSNSLTALP
ETLPPGLKTLEAGENALTSLPASLPPELQVLDVSKNQITVLPETLPPTITTLDVSRNALTNLPENLPAALQIMQASRNNL
VRLPESLPHFRGEGPQPTRIIVEYNPFSERTIQNMQRLMSSVDYQGPRVLFAMGDFSIVRVTRPLHQAVQGWLTSLEEED
VNQWRAFEAEANAAAFSGFLDYLGDTQNTRHPDFKEQVSAWLMRLAEDSALRETVFIIAMNATISCEDRVTLAYHQMQEA
TLVHDAERGAFDSHLAELIMAGREIFRLEQIESLAREKVKRLFFIDEVEVFLGFQNQLRESLSLTTMTRDMRFYNVSGIT
ESDLDEAEIRIKMAENRDFHKWFALWGPWHKVLERIAPEEWREMMAKRDECIETDEYQSRVNAELEDLRIADDSDAERTT
EVQMDAERAIGIKIMEEINQTLFTEIMENILLKKEVSSLMSAYWR
>O34321 ~~~yocM~~~Salt stress-responsive protein YocM~~~COG0071
MDFEKMKQWMEFAQQMYGGDFWKQVFDEDQKTPFMTNGQSPFPFAQQDQRGKGDASFPSMDIVDTVAEVQFLIYLPGYRK
QDVHILSYGDYLVVKGQRFSYFNEQDFRQKEGKYGSFEKKIPLSDHLHGKMNAIFKDGILYITIQKDEGQAKTIVIDD
>P0AGC3 4.2.2.n1~~~slt~~~Soluble lytic murein transglycosylase~~~COG0741
MEKAKQVTWRLLAAGVCLLTVSSVARADSLDEQRSRYAQIKQAWDNRQMDVVEQMMPGLKDYPLYPYLEYRQITDDLMNQ
PAVTVTNFVRANPTLPPARTLQSRFVNELARREDWRGLLAFSPEKPGTTEAQCNYYYAKWNTGQSEEAWQGAKELWLTGK
SQPNACDKLFSVWRASGKQDPLAYLERIRLAMKAGNTGLVTVLAGQMPADYQTIASAIISLANNPNTVLTFARTTGATDF
TRQMAAVAFASVARQDAENARLMIPSLAQAQQLNEDQIQELRDIVAWRLMGNDVTDEQAKWRDDAIMRSQSTSLIERRVR
MALGTGDRRGLNTWLARLPMEAKEKDEWRYWQADLLLERGREAEAKEILHQLMQQRGFYPMVAAQRIGEEYELKIDKAPQ
NVDSALTQGPEMARVRELMYWNLDNTARSEWANLVKSKSKTEQAQLARYAFNNQWWDLSVQATIAGKLWDHLEERFPLAY
NDLFKRYTSGKEIPQSYAMAIARQESAWNPKVKSPVGASGLMQIMPGTATHTVKMFSIPGYSSPGQLLDPETNINIGTSY
LQYVYQQFGNNRIFSSAAYNAGPGRVRTWLGNSAGRIDAVAFVESIPFSETRGYVKNVLAYDAYYRYFMGDKPTLMSATE
WGRRY
>P40676 ~~~slyA~~~Transcriptional regulator SlyA~~~
MESPLGSDLARLVRIWRALIDHRLKPLELTQTHWVTLHNIHQLPPDQSQIQLAKAIGIEQPSLVRTLDQLEDKGLISRQT
CASDRRAKRIKLTEKADALIAEMEEVIHKTRGEILAGISSEEIELLIKLIAKLEHNIMELHSHD
>P69989 ~~~slyA~~~Transcriptional regulator SlyA~~~
MESTLGSDLARLVRVWRALIDHRLKPLELTQTHWVTLYNINRLPPEQSQIQLAKAIGIEQPSLVRTLDQLEEKGLITRHT
CANDRRAKRIKLTEQSSPIIEQVDGVICSTRKEILGGISSDEIAVLSGLIDKLEKNIIQLQTK
>B1JJ73 ~~~slyA~~~Transcriptional regulator SlyA~~~
MESTLGSDLARLVRVWRALIDHRLKPLELTQTHWVTLYNINRLPPEQSQIQLAKAIGIEQPSLVRTLDQLEEKGLITRHT
CANDRRAKRIKLTEQSSPIIEQVDGVICSTRKEILGGISSDEIAVLSGLIDKLEKNIIQLQTK
>P0A905 ~~~slyB~~~Outer membrane lipoprotein SlyB~~~COG3133
MIKRVLVVSMVGLSLVGCVNNDTLSGDVYTASEAKQVQNVSYGTIVNVRPVQIQGGDDSNVIGAIGGAVLGGFLGNTVGG
GTGRSLATAAGAVAGGVAGQGVQSAMNKTQGVELEIRKDDGNTIMVVQKQGNTRFSPGQRVVLASNGSQVTVSPR
>P0A1X0 ~~~slyB~~~Outer membrane lipoprotein SlyB~~~
MIKRVLAVSLMGLSLAGCVNNDSLSGDVYTASEAKQVQNVTYGTIVNVRPVQIQGGDDSNVIGAIGGAVLGGFLGNTIGG
GTGRSLATAAGAVAGGVAGQGVQSAMNKTQGVELEIRKDDGNTIMVVQKQGNTRFSAGQRVVLASNGSQVTVSPR
>P0A9K9 5.2.1.8~~~slyD~~~FKBP-type peptidyl-prolyl cis-trans isomerase SlyD~~~COG1047
MKVAKDLVVSLAYQVRTEDGVLVDESPVSAPLDYLHGHGSLISGLETALEGHEVGDKFDVAVGANDAYGQYDENLVQRVP
KDVFMGVDELQVGMRFLAETDQGPVPVEITAVEDDHVVVDGNHMLAGQNLKFNVEVVAIREATEEELAHGHVHGAHDHHH
DHDHDGCCGGHGHDHGHEHGGEGCCGGKGNGGCGCH
>O25748 5.2.1.8~~~slyD~~~FKBP-type peptidyl-prolyl cis-trans isomerase SlyD~~~COG1047
MQNHDLESIKQAALIEYEVREQGSSIVLDSNISKEPLEFIIGTNQIIAGLEKAVLKAQIGEWEEVVIAPEEAYGVYESSY
LQEVPRDQFEGIELEKGMSVFGQTEDNQTIQAIIKDFSATHVMVDYNHPLAGKTLAFRFKVLGFREVSEEEILASHHGGG
TGCCGGHGGHGGKKGGGCGCSCSHG
>Q9CKP2 5.2.1.8~~~slyD~~~FKBP-type peptidyl-prolyl cis-trans isomerase SlyD~~~
MKIAKNVVVSIAYQVRTEDGVLVDEAPVNQPLEYLQGHNNLVIGLENALEGKAVGDKFEVRVKPEEAYGEYNENMVQRVP
KDVFQGVDELVVGMRFIADTDIGPLPVVITEVAENDVVVDGNHMLAGQELLFSVEVVATREATLEEIAHGHIHQEGGCCG
GHHHDSDEEGHGCGCGSHHHHEHEHHAHDGCCGNGGCKH
>O83369 5.2.1.8~~~slyD~~~FKBP-type peptidyl-prolyl cis-trans isomerase SlyD~~~COG1047
MKIANECVVNIEYTLRDDTGEIIDSSDVMGALEYVQGHGMIIPGLETALINREEGEEFSVTIPPVGAYGEVQEDLRMTVG
RDQFPPNVPIEVGMRFDAGSGGDSRPVTVTDVQGETIIVDGNHPLAGKTLHFEVAVRSVREATDDDLAALLFRESTSGGG
CGSGAGGCGSCGAGCH
>Q9KNX6 5.2.1.8~~~slyD~~~FKBP-type peptidyl-prolyl cis-trans isomerase SlyD~~~COG1047
MKIEKNTVASLAYQLTIEDGVVVDQSTVDAPLDYLHGHNNLITGLERELEGKVAGDKFTVTIAPEDAYGEHNEDLVQRVP
ADVFQGVDELEVGMRFLADTDQGPIPVEITEVDGDEVVVDGNHMLAGQSLTFTVEVVAVRAATEDEIAHGHIHQAGGCGH
DHDHDHDHEGGCCGGEGHGHDHHGHGKKEGGCCGGGGCGSH
>Q7CFU4 5.2.1.8~~~slyD~~~FKBP-type peptidyl-prolyl cis-trans isomerase SlyD~~~COG1047
MKVTKDLVVSLAYQVRTEDGVLVDESPVSAPLDYLHGHGSLIAGLENALEGHEAGDSFDVRVNADEGYGSYDENLVQRVP
KDVFMGVDELEVGMRFLADTDQGPVPVEITAVEDEHVVVDGNHMLAGQDLNFHVEVVAVREATEEELQHGHVHGEHDHHH
EHGDGCCGGHGHDDHEHEHEHGKGGCGKSGGCGCH
>Q8PAH9 ~~~slyX~~~Protein SlyX homolog~~~COG2900
MHEQLSPRDQELEARLVELETRLSFQEQALTELSEALADARLTGARNAELIRHLLEDLGKVRSTLFADAADEPPPPHY
>P9WKQ1 3.1.4.12~~~spmT~~~Sphingomyelinase~~~COG3021
MDYAKRIGQVGALAVVLGVGAAVTTHAIGSAAPTDPSSSSTDSPVDACSPLGGSASSLAAIPGASVPQVGVRQVDPGSIP
DDLLNALIDFLAAVRNGLVPIIENRTPVANPQQVSVPEGGTVGPVRFDACDPDGNRMTFAVRERGAPGGPQHGIVTVDQR
TASFIYTADPGFVGTDTFSVNVSDDTSLHVHGLAGYLGPFHGHDDVATVTVFVGNTPTDTISGDFSMLTYNIAGLPFPLS
SAILPRFFYTKEIGKRLNAYYVANVQEDFAYHQFLIKKSKMPSQTPPEPPTLLWPIGVPFSDGLNTLSEFKVQRLDRQTW
YECTSDNCLTLKGFTYSQMRLPGGDTVDVYNLHTNTGGGPTTNANLAQVANYIQQNSAGRAVIVTGDFNARYSDDQSALL
QFAQVNGLTDAWVQVEHGPTTPPFAPTCMVGNECELLDKIFYRSGQGVTLQAVSYGNEAPKFFNSKGEPLSDHSPAVVGF
HYVADNVAVR
>Q82S91 ~~~smbP~~~Metal-binding protein SmbP~~~
MKTTLIKVIAASVTALFLSMQVYASGHTAHVDEAVKHAEEAVAHGKEGHTDQLLEHAKESLTHAKAASEAGGNTHVGHGI
KHLEDAIKHGEEGHVGVATKHAQEAIEHLRASEHKSH
>P51834 ~~~smc~~~Chromosome partition protein Smc~~~COG1196
MFLKRLDVIGFKSFAERISVDFVKGVTAVVGPNGSGKSNITDAIRWVLGEQSARSLRGGKMEDIIFAGSDSRKRLNLAEV
TLTLDNDDHFLPIDFHEVSVTRRVYRSGESEFLINNQPCRLKDIIDLFMDSGLGKEAFSIISQGKVEEILSSKAEDRRSI
FEEAAGVLKYKTRKKKAENKLFETQDNLNRVEDILHELEGQVEPLKIQASIAKDYLEKKKELEHVEIALTAYDIEELHGK
WSTLKEKVQMAKEEELAESSAISAKEAKIEDTRDKIQALDESVDELQQVLLVTSEELEKLEGRKEVLKERKKNAVQNQEQ
LEEAIVQFQQKETVLKEELSKQEAVFETLQAEVKQLRAQVKEKQQALSLHNENVEEKIEQLKSDYFELLNSQASIRNELQ
LLDDQMSQSAVTLQRLADNNEKHLQERHDISARKAACETEFARIEQEIHSQVGAYRDMQTKYEQKKRQYEKNESALYQAY
QYVQQARSKKDMLETMQGDFSGFYQGVKEVLKAKERLGGIRGAVLELISTEQKYETAIEIALGASAQHVVTDDEQSARKA
IQYLKQNSFGRATFLPLSVIRDRQLQSRDAETAARHSSFLGVASELVTFDPAYRSVIQNLLGTVLITEDLKGANELAKLL
GHRYRIVTLEGDVVNPGGSMTGGAVKKKNNSLLGRSRELEDVTKRLAEMEEKTALLEQEVKTLKHSIQDMEKKLADLRET
GEGLRLKQQDVKGQLYELQVAEKNINTHLELYDQEKSALSESDEERKVRKRKLEEELSAVSEKMKQLEEDIDRLTKQKQT
QSSTKESLSNELTELKIAAAKKEQACKGEEDNLARLKKELTETELALKEAKEDLSFLTSEMSSSTSGEEKLEEAAKHKLN
DKTKTIELIALRRDQRIKLQHGLDTYERELKEMKRLYKQKTTLLKDEEVKLGRMEVELDNLLQYLREEYSLSFEGAKEKY
QLETDPEEARKRVKLIKLAIEELGTVNLGSIDEFERVNERYKFLSEQKEDLTEAKNTLFQVIEEMDEEMTKRFNDTFVQI
RSHFDQVFRSLFGGGRAELRLTDPNDLLHSGVEIIAQPPGKKLQNLNLLSGGERALTAIALLFSILKVRPVPFCVLDEVE
AALDEANVFRFAQYLKKYSSDTQFIVITHRKGTMEEADVLYGVTMQESGVSKVISVKLEETKEFVQ
>B8GZ28 ~~~smc~~~Chromosome partition protein Smc~~~
MQFQRLRLSGFKSFVEPTEFRIEPGLTGIVGPNGCGKSNLLEALRWVMGANSAKAMRAGGMDDVIFAGSGARPARNHADV
TLTIDNADRTAPAQFNDDPILEVVRRIDRGEGSTYRINGREVRARDVQLLFADASTGANSPALVRQGQISELIGAKPQNR
RRILEEAAGVSGLHTRRHEAELRLRAAETNLSRLEDVARELETALNRLRREARQAEKYKRLSSEIRAVQGAVLYARWTEA
RDHLERTTSEATAAARLVEETARASAAAQVAITEAEAAMPPLREEATIAQAILGQLAIQKDRAEREAEAAAAEFERLSND
LSRIDADRAREAQAKDDAAAALARIAPELEEVRALVAAAPERGPELAAVAKAAEEARAAAEAAVEQLAARVAAEEAQGRA
AAARLSEAEARANRTNRALEQARAERAAVGPEVDPAAADARQRFANAEAALAAARAALEEAETARVKAAEQEAQARQLAR
SVEDQLGRLRTEARGLAQLTAPRSKSGHAPALDSVSPDKGYGAALAAALGDDLDAALDPKAPSYWGGAEAPAPVWPEGAE
PLAPLVKAPPALAARLSHVAVVTRANGDRLQKELKPGMRLVSKDGDLWRWDGFVARADAPRPAAVRLEQRTRLAEVEAEI
DVMAPRAEATTIALKAAADRLRAAEDLLRDKRRGPPDAERLLTQAREQVAKFEREQALRAARAQSLDDTIGRFEAEKVEA
DAALGEAREAHAAAQTSGDLQPQLAEARQAAAQAREAAGAARTALDVETRERAGRQRRLESLERDRADWSKRAEAAAKRA
ESLEGDRVKAAAALEAAREAPAALQEKLVALLDEFAAAEARRAKASDALETAETTRLNADRAARAAEQAAGEAREKRAAL
VAHLDGARQRFAEVASAIREQARMEPEELGRHVAGEAVAVPKDAAGVEAHLFALERERDAIGPVNLRAEEEAQEYAGRLE
TMRSERADLSGAVTKLRAGIEELNAEGRERLLAAFDVINANFQTLFQALFGGGQAELKLIESDDPLEAGLEIFACPPGKR
MASMSLMSGGEQALTASALIFGVFLANPAPICVLDEVDAPLDDANVDRYCNMLDEMRRRTQTRFIAITHNPVTMSRMDRL
FGVTMAERGVSQLVSVDLSTAEKLVAA
>P41508 ~~~smc~~~Chromosome partition protein Smc~~~
MLKLIKIEIEGFKSFADPISINFDGSVVGIVGPNGSGKSNINDAIRWVLGEQSAKQLRGLNMDDVIFAGSKTVKPQEKAM
VKLTFKNEDAIEETKQIFTISRLLKRGQGTNEYFYNDQPVRYKDIKNLAVESGISKSSLAIISQGTISEIAEATPEQRKA
VIEEAAGTSKYKLDKEEAQKKLIRTNDAIDKLQGAIKELERQVNSLDKQASKAKIYLEKSKALESVEVGLIVNDLNFFNE
KLNNLNTSLLEVEQQRNDLELNIQTYESSISQTVHFKTEVESSIQEITSKLDNLKNALSEINLQEARIEERRKLIISGEI
VVDQKTKIEEIKKQVESLKIQINASKQREIELDQQLTRLNAKANSLKLQENDINKEIGVLLEKKSAAAANINILKQQFEN
KSFLSKGIKTIKDNSFLFDGYIGLASELFKVESEFSLAIETVLGAALNQIVMKTSEDVLQAIDFLKKNLSGKATFIPLTS
IKEREVREDHLLVLKGQKGFLGVAKELIEFDTQFNKLFGFLLGNILVVDNVDNANRIAKILDHKYTIVSLEGDLFRPGGT
ITGGSKLERTSILNYDIKIKEHTNTLKFAEDQIHDLKIKQQTIYNEIETVNSTIQQVKIEANSINSKLNILNEELNNLKL
NASEIFKEQQEDQESLNLSFDSEKLNIEKQISTLTIELNSKKDRLTNLISEQGKGETKKQELDAKLRKLNTQHSDSITEQ
NRAKFLVEQNQKRLSEHYKLTLEAASEQYSLDLDIEQARHFVDSLKKELKELGNVNLEAITEFEEVNQRYQEKKQYIEEL
TTAKSKIEEAISDLDKIIINKTTEIVNLVNNEFNMVFQKMFGGGKAEIHFTDKNDILNSGVEISAQPPGKTIKNLRLFSG
GEKAIIAISLLFAILKARPIPLCILDEVEAALDESNVIRYVEFLKLLKENTQFLIITHRSGTMSRVDQLLGVTMQKRGVT
SIFSVELSKAKEMLKDELK
>P9WGF3 ~~~smc~~~Chromosome partition protein Smc~~~COG1196
MYLKSLTLKGFKSFAAPTTLRFEPGITAVVGPNGSGKSNVVDALAWVMGEQGAKTLRGGKMEDVIFAGTSSRAPLGRAEV
TVSIDNSDNALPIEYTEVSITRRMFRDGASEYEINGSSCRLMDVQELLSDSGIGREMHVIVGQGKLEEILQSRPEDRRAF
IEEAAGVLKHRKRKEKALRKLDTMAANLARLTDLTTELRRQLKPLGRQAEAAQRAAAIQADLRDARLRLAADDLVSRRAE
REAVFQAEAAMRREHDEAAARLAVASEELAAHESAVAELSTRAESIQHTWFGLSALAERVDATVRIASERAHHLDIEPVA
VSDTDPRKPEELEAEAQQVAVAEQQLLAELDAARARLDAARAELADRERRAAEADRAHLAAVREEADRREGLARLAGQVE
TMRARVESIDESVARLSERIEDAAMRAQQTRAEFETVQGRIGELDQGEVGLDEHHERTVAALRLADERVAELQSAERAAE
RQVASLRARIDALAVGLQRKDGAAWLAHNRSGAGLFGSIAQLVKVRSGYEAALAAALGPAADALAVDGLTAAGSAVSALK
QADGGRAVLVLSDWPAPQAPQSASGEMLPSGAQWALDLVESPPQLVGAMIAMLSGVAVVNDLTEAMGLVEIRPELRAVTV
DGDLVGAGWVSGGSDRKLSTLEVTSEIDKARSELAAAEALAAQLNAALAGALTEQSARQDAAEQALAALNESDTAISAMY
EQLGRLGQEARAAEEEWNRLLQQRTEQEAVRTQTLDDVIQLETQLRKAQETQRVQVAQPIDRQAISAAADRARGVEVEAR
LAVRTAEERANAVRGRADSLRRAAAAEREARVRAQQARAARLHAAAVAAAVADCGRLLAGRLHRAVDGASQLRDASAAQR
QQRLAAMAAVRDEVNTLSARVGELTDSLHRDELANAQAALRIEQLEQMVLEQFGMAPADLITEYGPHVALPPTELEMAEF
EQARERGEQVIAPAPMPFDRVTQERRAKRAERALAELGRVNPLALEEFAALEERYNFLSTQLEDVKAARKDLLGVVADVD
ARILQVFNDAFVDVEREFRGVFTALFPGGEGRLRLTEPDDMLTTGIEVEARPPGKKITRLSLLSGGEKALTAVAMLVAIF
RARPSPFYIMDEVEAALDDVNLRRLLSLFEQLREQSQIIIITHQKPTMEVADALYGVTMQNDGITAVISQRMRGQQVDQL
VTNSS
>B2FNJ0 ~~~smf-1~~~Major fimbrial subunit SMF-1~~~COG3539
MLAAAPLAANAADGTITFNGKVTDKTCTISTPGGKDFAVNLPTVSKNTLATAGAVAGRTPFAINLTKCSAGNVATYFEPG
STVDFNTGRLLNQASANAATNVQLQLLGSNNQVLPIKAAGAGLAQTNSQWVTVGTDGSADLNYYAEYYATAAATPGDVTS
SVKYTIIYN
>P0AGC7 ~~~ytjB~~~Probable inner membrane protein Smp~~~COG3726
MARTKLKFRLHRAVIVLFCLALLVALMQGASWFSQNHQRQRNPQLEELARTLARQVTLNVAPLMRTDSPDEKRIQAILDQ
LTDESRILDAGVYDEQGDLIARSGESVEVRDRLALDGKKAGGYFNQQIVEPIAGKNGPLGYLRLTLDTHTLATEAQQVDN
TTNILRLMLLLSLAIGVVLTRTLLQGKRTRWQQSPFLLTASKPVPEEEESEKKE
>Q06517 3.4.24.-~~~smp~~~Extracellular minor metalloprotease~~~
MPAQRMRSVIPPYMLRALLTRYAPQRDCALHTLNHVQSLLGNKPLRSPTEKNARAGERSAISTTPERHPTARQTGAQGGA
AQQPRRAVDEAYDHLGVTYDFFWQAYRRNSVDNKGLPLVQRALRQGLPEQLSGTASRWCSETATARSSTVSPSPSTLVGH
ELTHGSDRERSRLIYYQQSGALNESLSDVFGSLVKQFHLQQTADKADWLIGAGLLAKGIKGKGLRSMSAPGTAYDDPLLG
KDPQPASMKDYIQTKEDNGGVHLNSGIPNRAFYLAATVLGGFAGKKPVTSGMTRCATKRCRKTPTSDHLRPRHGETRAGL
RTKRGDKVQQAWASGWQWSNETAADAQSGYGH
>P76053 3.1.-.-~~~smrA~~~Probable DNA endonuclease SmrA~~~COG2840
MNLDDKSLFLDAMEDVQPLKRATDVHWHPTRNQRAPQRIDTLQLDNFLTTGFLDIIPLSQPLEFRREGLQHGVLDKLRSG
KYPQQASLNLLRQPVEECRKMVFSFIQQALADGLRNVLIIHGKGRDDKSHANIVRSYVARWLTEFDDVQAYCTALPHHGG
SGACYVALRKTAQAKQENWERHAKRSR
>Q1KLK1 2.8.3.22~~~smtA~~~Succinyl-CoA--L-malate CoA-transferase alpha subunit~~~
MPPTGEEPSGHAESKPPASDPMSTPGTGQEQLPLSGIRVIDVGNFLAGPYAASILGEFGAEVLKIEHPLGGDPMRRFGTA
TARHDATLAWLSEARNRKSVTIDLRQQEGVALFLKLVAKSDILIENFRPGTMEEWGLSWPVLQATNPGLIMLRVSGYGQT
GPYRRRSGFAHIAHAFSGLSYLAGFPGETPVLPGTAPLGDYIASLFGAIGILIALRHKEQTGRGQLIDVGIYEAVFRILD
EIAPAYGLFGKIREREGAGSFIAVPHGHFRSKDGKWVAIACTTDKMFERLAEAMERPELASPELYGDQRKRLAARDIVNQ
ITIEWVGSLTRDEVMRRCLEKEVPVGPLNSIADMFNDEHFLARGNFACIEAEGIGEVVVPNVIPRLSETPGRVTNLGPPL
GNATYEVLRELLDISAEEIKRLRSRKII
>Q1KLK0 2.8.3.22~~~smtB~~~Succinyl-CoA--L-malate CoA-transferase beta subunit~~~
MDGTTTTLPLAGIRVIDAATVIAAPFCATLLGEFGADVLKVEHPIGGDALRRFGTPTARGDTLTWLSESRNKRSVTLNLQ
HPEGARVFKELIAHSDVLCENFRPGTLEKWGLGWDVLSKINPRLIMLRVTGYGQTGPYRDRPGFARIAHAVGGIAYLAGM
PKGTPVTPGSTTLADYMTGLYGCIGVLLALRHREQTGRGQYIDAALYESVFRCSDELVPAYGMYRKVRERHGSHYNEFAC
PHGHFQTKDGKWVAISCATDKLFARLANAMGRPELASSSVYGDQKVRLAHASDVNEIVRDWCSSLTRAEVLERCYATATP
AAPLNDIADFFGDRHVHARRNLVAIDAEDLGETLIMPNVVPKLSETPGSIRSLGPKLGEHTEEVLKEILGMCDEQINDLR
SKRVI
>P9WMI5 ~~~smtB~~~HTH-type transcriptional repressor SmtB~~~COG0640
MVTSPSTPTAAHEDVGADEVGGHQHPADRFAECPTFPAPPPREILDAAGELLRALAAPVRIAIVLQLRESQRCVHELVDA
LHVPQPLVSQHLKILKAAGVVTGERSGREVLYRLADHHLAHIVLDAVAHAGEDAI
>P30340 ~~~smtB~~~Transcriptional repressor SmtB~~~COG0640
MTKPVLQDGETVVCQGTHAAIASELQAIAPEVAQSLAEFFAVLADPNRLRLLSLLARSELCVGDLAQAIGVSESAVSHQL
RSLRNLRLVSYRKQGRHVYYQLQDHHIVALYQNALDHLQECR
>D0ZXQ3 ~~~smvA~~~Methyl viologen resistance protein SmvA~~~
MFRQWLTLVIIVLVYIPVAIDATVLHVAAPTLSMTLGASGNELLWIIDIYSLVMAGMVLPMGALGDRIGFKRLLMLGGTL
FGLASLAAAFSHTASWLIATRVLLAIGAAMIVPATLAGIRATFCEEKHRNMALGVWAAVGSGGAAFGPLIGGILLEHFYW
GSVFLINVPIVLVVMGLTARYVPRQAGRRDQPLNLGHAVMLIIAILLLVYSAKTALKGHLSLWVISFTLLTGALLLGLFI
RTQLATSRPMIDMRLFTHRIILSGVVMAMTAMITLVGFELLMAQELQFVHGLSPYEAGVFMLPVMVASGFSGPIAGVLVS
RLGLRLVATGGMALSALSFYGLAMTDFSTQQWQAWGLMALLGFSAASALLASTSAIMAAAPAEKAAAAGAIETMAYELGA
GLGIAIFGLLLSRSFSASIRLPAGLEAQEIARASSSMGEAVQLANSLPPTQGQAILDAARHAFIWSHSVALSSAGSMLLL
LAVGMWFSLAKAQRR
>O34350 2.3.1.80~~~snaA~~~S-alkylcysteine N-acetyltransferase~~~COG0454
MSDDIFRLATVEDASELLKLVNSAFQPIRQLDIDWPSTRADIQMVSENIEHHSAIVLERDGKLISTITIRFPWESETPPS
KYPFVWWFATLPEYKGQGAGSKLLTYVEEKVLRDMLKAPALTLGTSARKHPWLADMYRRRGYEVYFEQEKDGDIGVMMHK
VLIPERFNPTLLGAPSWA
>P54991 ~~~snaA~~~Pristinamycin IIA synthase subunit A~~~
MTAPRRRITLAGIIDGPGGHVAAWRHPATKADAQLDFEFHRDNARTLERGLFDAVFIADIVAVWGTRLDSLCRTSRTEHF
EPLTLLAAYAAVTEHIGLCATATTTYNEPAHIAARFASLDHLSGGRAGWNVVTSAAPWESANFGFPEHLEHGKRYERAEE
FIDVVKKLWDSDGRPVDHRGTHFEAPGPLGIARPPQGRPVIIQAGSSPVGREFAARHAEVIFTRHNRLSDAQDFYGDLKA
RVARHGRDPEKVLVWPTLAPIVAATDTEAKQRLQELQDLTHDHVALRTLQDHLGDVDLSAYPIDGPVPDIPYTNQSQSTT
ERLIGLARRENLSIRELALRLMGDIVVGTPEQLADHMESWFTGRGADGFNIDFPYLPGSADDFVDHVVPELQRRGLYRSG
YEGTTLRANLGIDAPRKAGAAA
>P54993 ~~~snaB~~~Pristinamycin IIA synthase subunit B~~~
MTAPILVATLDTRGPAATLGTITRAVRAAEAAGFDAVLIDDRAAAGVQGRFETTTLTAALAAVTEHIGLITAPLPADQAP
YHVSRITASLDHLAHGRTGWLASTDTTDPEGRTGELIDVVRGLWDSFDDDAFVHDRADGLYWRLPAVHQLDHQGRHFDVA
GPLNVARPPQGHPVVAVTGPALAAAADLVLLDEAADAASVKQQAPHAKILLPLPGPAAELPADSPADGFTVALTGSDDPV
LAALAARPGRPDRTAATTLRERLGLARPESRHALTTA
>P54994 ~~~snaC~~~NADH:riboflavin 5'-phosphate oxidoreductase~~~
MTGADDPARPAVGPQSFRDAMAQLASPVTVVTVLDAAGRRHGFTAGSVVSVSLDPPLVMVGIALTSSCHTAMAAAAEFCV
SILGEDQRAVAKRCATHGADRFAGGEFAAWDGTGVPYLPDAKVVLRCRTTDVVRAGDHDLVLGTPVEIRTGDPAKPPLLW
YRRDFHTPTPTTPALA
>O34980 3.5.1.-~~~sndA~~~N-acetyl-L-cysteine deacetylase~~~COG1473
MSLDYWRNIEGSYPYQTTGNDILTLKEESNPVNLSTLEKQLIGIRRHLHQYPELSKEEFETTAFIKKCLKEKGIQIRPTA
LKTGVFADIAGESEGPAIALRADIDALPIEEKTGLPYASKHKGIMHACGHDFHTAALLGAAFLLKENQDSLKGKIRLLFQ
PAEEAGAGATKVIEDGQLDGIDAVIGLHNKPDIAVGTVGLKTGPLMAAVDRFKVEIEGKGAHAALPHNGFDPIIGASQLI
VALQTIVSRNVNPLQSAILTVGKINGGSTWNVIPDTVVIEGTVRTFDSEVRNQVKQRFFAVTEQISAAFSLKANVKWHSG
PPPLCNDEAITGLVRDAAHKAKLQVIDPAPSTAGEDFAYYLEHIPGSFAFFGTDGDHDWHHPAFTIDETAIIKASYFLYE
SAKRLLDSNEESKISD
>O54259 1.13.12.22~~~snoaB~~~Deoxynogalonate monooxygenase~~~
MPTRVNDGVDADEVTFVNRFTVHGGPAEFESVFARTAAFFARQPGFVRHTLLRERDKDNSYVNIAVWTDHDAFRRALAQP
GFLPHATALRALSTSEHGLFTARQTLPEGGDTTGSGHR
>Q9RN59 5.5.1.26~~~snoaL~~~Nogalonic acid methyl ester cyclase~~~
MVSAFNTGRTDDVDEYIHPDYLNPATLEHGIHTGPKAFAQLVGWVRATFSEEARLEEVRIEERGPWVKAYLVLYGRHVGR
LVGMPPTDRRFSGEQVHLMRIVDGKIRDHRDWPDFQGTLRQLGDPWPDDEGWRP
>P0A3Z8 3.4.24.77~~~snpA~~~Extracellular small neutral protease~~~
MRITLPLLSTAVGLGLTAAVLGTGPAATAAAPQEPVRAAQLGYQPSAGSGEDAAANRAFFEAVVKSVAEKRAANPSAAAA
VTVYYSATNAPSFRSQISRSAQIWNSSVSNVRLAESSSGADFAYYEGNDSRGSYASTDGHGSGYIFLDYRQNQQYDSTRV
TAHETGHVLGLPDHYSGPCSELMSGGGPGPSCTNPYPNSTERSRVNQLWAYGFQAALDKALEKASQR
>P56406 3.4.24.77~~~snpA~~~Extracellular small neutral protease~~~
TVTVTYDPSNAPSFQQEIANAAQIWNSSVRNVQLRAGGNADFSYYEGNDSRGSYAQTDGHGRGYIFLDYQQNQQYDSTRV
TAHETGHVLGLPDHYQGPCSELMSGGGPGPSCTNPYPNAQERSRVNALWANG
>P43163 3.4.24.77~~~snpA~~~Extracellular small neutral protease~~~
MRMPLSVLTAAGLSLATLGLGTAGPASATPTAEGAPVVAYDGSPSAGSPADAKAEAAANRAFFEAVLRSVAEKRAANPKS
TAAVTVVYNASGAPSFATQIARGTQIWNSSVSNVRLQAGSSGVDFTYREGNDPRGSYASTDGHGRGYIFLDYRQNQTYDS
TRVTAHETGHVLGLPDHYSGPCSELMSGGGPGPSCTNAYPNSAERSRVNQLWANGFAAAMDKALEKSAR
>A0A0H3CCP8 ~~~socA~~~Antitoxin SocA~~~
MPPLTQDPQSVDARAVANLLLDKAAALDIPISNLALQKLLYFAHGRFLVDKGRPLVNGFFEAWKFGPVHPVVYRCFSANG
PKYIINRAIKKDILSGLHIIVSPPRDQDIHEGIERVLLTMGRMSASQLVAVSHASGGPWDVIANGPGTNLGLGLRICDKV
IKDRFRFQKVSVSVPPGLGDTLEEAPPS
>E2FZM4 ~~~socA~~~Uncharacterized protein SocA~~~
MHRRARRMPMRPRRSKRVRNRYTMGTFALHGLTHRLPSASLQTTAARHPDVTQFSMPGHYR
>A0A0H3CC30 ~~~socB~~~DNA replication inhibitor toxin SocB~~~
MKKHRLPDTELARIAPLAPDARRKALLKFKNGFPDFSYEPTRRRLPELTNAQPSLLSLGDTEWSKIESGLKRLKNEKEAA
SNIEVAELLYNFIREEKYIAVMEPFGKLQLGAGVAISYWSDAIFFGPDGPTIFGFDFRRAGGFNDSARRFAFSAQHEHIR
QRGDDYATAKLGLVQFPALRNGTRKVRVEFADQVELIPYDELIQMARETYSVWFEILEQREDEARKTGTGGSWWDGD
>E2FZM5 ~~~socB~~~Uncharacterized protein SocB~~~
MWTLKARKEHTGISGKPTARTDRHGSTRSGDSELQASARRFSRLPDRCGAQGVT
>E8XDJ8 1.15.1.1~~~sodC1~~~Superoxide dismutase [Cu-Zn] 1~~~
MKYTILSLVAGALISCSAMAENTLTVKMNDALSSGTGENIGEITVSETPYGLLFTPHLNGLTPGIHGFHVHTNPSCMPGM
KDGKEVPALMAGGHLDPEKTGKHLGPYNDKGHLGDLPGLVVNADGTATYPLLAPRLKSLSELKGHSLMIHKGGDNYSDKP
APLGGGGARFACGVIEK
>P0CW86 1.15.1.1~~~sodC1~~~Superoxide dismutase [Cu-Zn] 1~~~
MKYTILSLVAGALISCSAMAENTLTVKMNDALSSGTGENIGEITVSETPYGLLFTPHLNGLTPGIHGFHVHTNPSCMPGM
KDGKEVPALMAGGHLDPEKTGKHLGPYNDKGHLGDLPGLVVNADGTATYPLLAPRLKSLSELKGHSLMIHKGGDNYSDKP
APLGGGGARFACGVIEK
>P24702 1.15.1.1~~~sodC~~~Superoxide dismutase [Cu-Zn]~~~
MKLTNLALAFTLFGASAVAFAHADHDHKKADNSSVEKLVVQVQQLDPVKGNKDVGTVEITESAYGLVFTPHLHGLAQGLH
GFHIHQNPSCEPKEKDGKLVAGLGAGGHWDPKETKQHGYPWSDNAHLGDLPALFVEHDGSATNPVLAPRLKKLDEVKGHS
LMIHEGGDNHSDHPAPLGGGGPRMACGVIK
>P15453 1.15.1.1~~~sodC~~~Superoxide dismutase [Cu-Zn]~~~
MKSLFIASTMVLMAFPAFAESTTVKMYEALPTGPGKEVGTVVISEAPGGLHFKVNMEKLTPGYHGFHVHENPSCAPGEKD
GKIVPALAAGGHYDPGNTHHHLGPEGDGHMGDLPRLSANADGKVSETVVAPHLKKLAEIKQRSLMVHVGGDNYSDKPEPL
GGGGARFACGVIE
>P20379 1.15.1.1~~~sodC~~~Superoxide dismutase [Cu-Zn]~~~COG2032
MIRLSAAAALGLAAALAASPALAQTSATAVVKAGDGKDAGAVTVTEAPHGVLLKLELKGLTPGWHAAHFHEKGDCGTPDF
KSAGAHVHTAATTVHGLLNPDANDSGDLPNIFAAADGAATAEIYSPLVSLKGAGGRPALLDADGSSIVVHANPDDHKTQP
IGGAGARVACGVIK
>P0AGD1 1.15.1.1~~~sodC~~~Superoxide dismutase [Cu-Zn]~~~COG2032
MKRFSLAILALVVATGAQAASEKVEMNLVTSQGVGQSIGSVTITETDKGLEFSPDLKALPPGEHGFHIHAKGSCQPATKD
GKASAAESAGGHLDPQNTGKHEGPEGAGHLGDLPALVVNNDGKATDAVIAPRLKSLDEIKDKALMVHVGGDNMSDQPKPL
GGGGERYACGVIK
>Q59452 1.15.1.1~~~sodC~~~Superoxide dismutase [Cu-Zn]~~~COG2032
MKLTKVALFSLGLFGFSSMALAHGDHMHNHDTKMDTMSKDMMSMEKIVVPVQQLDPQNGNKDVGTVEITESAYGLVFTPK
LHDLAHGLHGFHIHEKPSCEPKEKDGKLVAGLGAGGHWDPKQTQKHGYPWSDDAHMGDLPALFVMHDGSATTPVLAPRLK
KLAEVKGHSLMIHAGGDNHSDHPAPLGGGGPRMACGVIK
>P25841 ~~~sodC~~~Superoxide dismutase [Cu-Zn]-like~~~
MMKMKTLLALAISGICAAGVANAHDHMAKPAGPSIEVKVQQLDPANGNKDVGTVTITESNYGLVFTPNLQGLAEGLHGFH
IYENPSCEPKEKDGKLIAGLAAGGHWDSKGAKQHGYPWQDDAHLGDLPALTVLHDGTATNPVLAPRLKKLDEVRGHSIMI
HAGGDNHSDHPAPLGGGGPRMACGVIK
>P25842 1.15.1.1~~~sodC~~~Superoxide dismutase [Cu-Zn]~~~
MMKMKTLLALAISGICAAGVANAHDHMAKPAGPSIEVKVQQLDPANGNKDVGTVTITESNYGLVFTPNLQGLAEGLHGFH
IHENPSCDPKEKDGKLTSGLAAGGHWDPKGAKQHGYPWQDDAHLGDLPALTVLHDGTATNPVLAPRLKKLDEVRGHSIMI
HAGGDNHSDHPAPLGGGGPRMACGVIK
>P0A609 1.15.1.1~~~sodC~~~Superoxide dismutase [Cu-Zn]~~~
MPKPADHRNHAAVSTSVLSALFLGAGAALLSACSSPQHASTVPGTTPSIWTGSPAPSGLSGHDEESPGAQSLTSTLTAPD
GTKVATAKFEFANGYATVTIATTGVGKLTPGFHGLHIHQVGKCEPNSVAPTGGAPGNFLSAGGHYHVPGHTGTPASGDLA
SLQVRGDGSAMLVTTTDAFTMDDLLSGAKTAIIIHAGADNFANIPPERYVQVNGTPGPDETTLTTGDAGKRVACGVIGSG
>P9WGE9 1.15.1.1~~~sodC~~~Superoxide dismutase [Cu-Zn]~~~COG2032
MPKPADHRNHAAVSTSVLSALFLGAGAALLSACSSPQHASTVPGTTPSIWTGSPAPSGLSGHDEESPGAQSLTSTLTAPD
GTKVATAKFEFANGYATVTIATTGVGKLTPGFHGLHIHQVGKCEPNSVAPTGGAPGNFLSAGGHYHVPGHTGTPASGDLA
SLQVRGDGSAMLVTTTDAFTMDDLLSGAKTAIIIHAGADNFANIPPERYVQVNGTPGPDETTLTTGDAGKRVACGVIGSG
>P57005 1.15.1.1~~~sodC~~~Superoxide dismutase [Cu-Zn]~~~
MNMKTLLALAVSAVCSVSVAQAHEHNTIPKGASIEVKVQQLDPVNGNKDVGTVTITESNYGLVFTPDLQGLSEGLHGFHI
HENPSCEPKEKEGKLTAGLGAGGHWDPKGAKQHGYPWQDDAHLGDLPALTVLHDGTATNPVLAPRLKHLDDVRGHSIMIH
TGGDNHSDHPAPLGGGGPRMACGVIK
>Q59623 1.15.1.1~~~sodC~~~Superoxide dismutase [Cu-Zn]~~~
MNMKTLLALAVSAVCSVGVAQAHEHNTIPKGASIEVKVQQLDPVNGNKDVGTVTITESNYGLVFTPDLQGLSEGLHGFHI
HENPSCEPKEKEGKLTAGLGAGGHWDPKGAKQHGYPWQDDAHLGDLPALTVLHDGTATNPVLAPRLKHLDDVRGHSIMIH
TGGDNHSDHPAPLGGGGPRMACGVIK
>P00446 1.15.1.1~~~sodC~~~Superoxide dismutase [Cu-Zn]~~~
MNKAKTLLFTALAFGLSHQALAQDLTVKMTDLQTGKPVGTIELSQNKYGVVFTPELADLTPGMHGFHIHQNGSCASSEKD
GKVVLGGAAGGHYDPEHTNKHGFPWTDDNHKGDLPALFVSANGLATNPVLAPRLTLKELKGHAIMIHAGGDNHSDMPKAL
GGGGARVACGVIQ
>Q9X6W9 1.15.1.1~~~sodB~~~Superoxide dismutase [Fe]~~~
MGVHKLEPKDHLKPQNLEGISNEQIEPHFEAHYKGYVAKYNEIQEKLADQNFADRSKANQNYSEYRELKVEETFNYMGVV
LHELYFGMLTPGGKGEPSEALKKKIEEDIGGLDACTNELKAAAMAFRGWAILGLDIFSGRLVVNGLDAHNVYNLTGLIPL
IVIDTYEHAYYVDYKNKRPPYIDAFFKNINWDVVNERFEKAMKAYEALKDFIK
>P53638 1.15.1.1~~~sodB~~~Superoxide dismutase [Fe]~~~
MTYEMPKLPYANNALEPVISQQTIDYHYGKHLQTYVNNLNSLVPGTEYEGKTVEAIVASAPDGAIFNNAGQVLNHTLYFL
QFAPKPAKNEPAGKLGEAIKRDFGSFENFKKEFNAASVGLFGSGWAWLSVDKDGKLHITKEPNGSNPVRAGLKPLLGFDV
WEHAYYLDYQNRRADHVNKLWEIIDWDVVEKRL
>D3KVM5 1.15.1.1~~~~~~Superoxide dismutase [Fe]~~~
MEHTLPPLPFDKNALAPHMSEETLEYHYGKHHQTYVTNLNKLIPGTEFENLSLEEIVKKSSGGVFNNSAQVWNHTFFWNS
LSPKGGGAPTGALADAINAKYGSFDKFKEEFAKVATGTFGSGWTWLVKKTDGTVDIVSTSNAATPLTTDAKALLTIDVWE
HAYYIDYRNARPKFIEAYWNIANWDFAAKNFGA
>P19685 1.15.1.1~~~sodB~~~Superoxide dismutase [Fe]~~~COG0605
MAFELPDLPYKLNALEPHISQETLEYHHGKHHRAYVNKLNKLIEGTPFEKEPLEEIIRKSDGGIFNNAAQHWNHTFYWHC
MSPDGGGDPSGELASAIDKTFGSLEKFKALFTDSANNHFGSGWAWLVKDNNGKLEVLSTVNARNPMTEGKKPLMTCDVWE
HAYYIDTRNDRPKYVNNFWQVVNWDFVMKNFKS
>P0AGD3 1.15.1.1~~~sodB~~~Superoxide dismutase [Fe]~~~COG0605
MSFELPALPYAKDALAPHISAETIEYHYGKHHQTYVTNLNNLIKGTAFEGKSLEEIIRSSEGGVFNNAAQVWNHTFYWNC
LAPNAGGEPTGKVAEAIAASFGSFADFKAQFTDAAIKNFGSGWTWLVKNSDGKLAIVSTSNAGTPLTTDATPLLTVDVWE
HAYYIDYRNARPGYLEHFWALVNWEFVAKNLAA
>P43312 1.15.1.1~~~sodB~~~Superoxide dismutase [Fe]~~~COG0605
MFTLRELPFAKDSMGDFLSPVAFDFHHGKHHQTYVNNLNNLIKGTDFEKSSLFDILTKSSGGVFNNAAQIYNHDFYWDCL
SPKATALSDELKGALEKDFGSLEKFKEDFIKSATTLFGSGWNWAAYNLDTQKIEIIQTSNAQTPVTDKKVPLLVVDVWEH
AYYIDHKNARPVYLEKFYGHINWHFVSQCYEWAKKEGLGSVDYYINELVHKKA
>P50061 1.15.1.1~~~sodB~~~Superoxide dismutase [Fe]~~~
MAFTQPPLPFPKDALEPYGMKAETFDYHYGKHHAAYVTNLNKLVEGTPMESLSLEDVIKQSFGDSSKVGVFNNAAQVWNH
TFFWNCLKAGGGGAPTGELAAKIDAAFGSLDKFKEEFSNAAATQFGSGWAWLVDDGGTLKVTKTPNAENPLVHGQKPLLT
LDVWEHAYYLDFQNARPAFIKNFLDNLVNWDFVAQNLAA
>P23744 1.15.1.1~~~sodB~~~Superoxide dismutase [Mn/Fe]~~~
AYTLPPLDYAYTALEPHIDAQTMEIHHTKHHQTYINNVNAALEGTSFANEPVEALLQKLDSLPENLRGPVRNNGGGHANH
SLFWKVLTPNGGGEPKGALADAIKSDIGGLDTFKEAFTKAALTRFGSGWAWLSVTPEKKLVVESTGNQDSPLSTGNTPIL
GLDVWEHAYYLKYQNRRPEYIGAFFNVVNWDEVSRRYQEALA
>P9WGE7 1.15.1.1~~~sodB~~~Superoxide dismutase [Fe]~~~COG0605
MAEYTLPDLDWDYGALEPHISGQINELHHSKHHATYVKGANDAVAKLEEARAKEDHSAILLNEKNLAFNLAGHVNHTIWW
KNLSPNGGDKPTGELAAAIADAFGSFDKFRAQFHAAATTVQGSGWAALGWDTLGNKLLIFQVYDHQTNFPLGIVPLLLLD
MWEHAFYLQYKNVKVDFAKAFWNVVNWADVQSRYAAATSQTKGLIFG
>P09213 1.15.1.1~~~sodB~~~Superoxide dismutase [Fe]~~~
AFELPALPFAMNALEPHISQETLEYHYGKHHNTYVVKLNGLVEGTELAEKSLEEIIKTSTGGVFNNAAQVWNHTFYWNCL
APNAGGEPTGEVAAAIEKAFGSFAEFKAKFTDSAINNFGSSWTWLVKNANGSLAIVNTSNAGCPITEEGVTPLLTVDLWE
HAYYIDYRNLRPSYMDGFWALVNWDFVSKNLAA
>P19665 1.15.1.1~~~sodB~~~Superoxide dismutase [Mn/Fe]~~~COG0605
MTHELISLPYAVDALAPVISKETVEFHHGKHLKTYVDNLNKLIIGTEFENADLNTIVQKSEGGIFNNAGQTLNHNLYFTQ
FRPGKGGAPKGKLGEAIDKQFGSFEKFKEEFNTAGTTLFGSGWVWLASDANGKLSIEKEPNAGNPVRKGLNPLLGFDVWE
HAYYLTYQNRRADHLKDLWSIVDWDIVESRY
>P53641 1.15.1.1~~~sodB~~~Superoxide dismutase [Fe]~~~
MAFELPPLPYEKNALEPHISAETLEYHHDKHHNTYVVNLNNLIPGTEFEGKSLEEIVKSSSGGIFNNAAQVWNHTFYWNC
LSPNGGGQPTGALADAINAAFGSFDKFKEEFTKTSVGTFGSGWGWLVKKADGSLALASTIGAGNPLTSGDTPLLTCDVWE
HAYYIDYRNLRPKYVEAFWNLVNWDFVAKNFAA
>P09223 1.15.1.1~~~sodB~~~Superoxide dismutase [Fe]~~~COG0605
MAFELPPLPYAHDALQPHISKETLEFHHDKHHNTYVVNLNNLVPGTEFEGKTLEEIVKTSSGGIFNNAAQVWNHTFYWNC
LAPNAGGQPTGALADAINAAFGSFDKFKEEFTKTSVGTFGSGWGWLVKKADGSLALASTIGAGCPLTSGDTPLLTCDVWE
HAYYIDYRNLRPKYVEAFWNLVNWAFVAEQFEGKTFKA
>P84612 1.15.1.1~~~sodB~~~Superoxide dismutase [Fe]~~~COG0605
AFELPSLPYAIDALEPHISKETLEFHHGKHHNTYVVKLNGLIPGTKFENKSLEEIVCSSDGGVFNNAAQIWNHTFYWNSL
SPNGGGAPTGAVADAINAKWGSFDAFKEALNDKAVNNFGSSWTWLVKLADGSLDIVNTSNAATPLTDDGVTPILTVDLWE
HAYYIDYRNVRPDYLKGFWSLVNWEFANANFA
>Q9XD74 1.15.1.1~~~sodB~~~Superoxide dismutase [Mn]~~~COG0605
MAFELPNLPYDYDALAPYMSRETLEYHHDKHHLAYVTNGNKLAEEAGLSDLSLEDIVKKSYGTNQPLFNNAGQHYNHVHF
WKWMKKGGGGTSLPGKLDAAIKSDLGGYDKFRADFSAAGAGQFGSGWAWLSVKNGKLEISKTPNGENPLVHGATPILGVD
VWEHSYYIDYRNARPKYLEAFVDNLINWDYVLELYEAAAK
>O30970 1.15.1.1~~~sodB~~~Superoxide dismutase [Fe]~~~
MAFELPALPYAHDALASLGMSKETLEYHHDLHHKAYVDNGNKLIAGTEWEGKSVEEIVKGTYCAGAVAQSGIFNNASQHW
NHAQFWEMMGPGEDKKMPGALEKALVESFGSVAKFKEDFAAAGAGQFGSGWAWLVKDSDGALKITKTENGVNPLCFGQTA
LLGCDVWEHSYYIDFRNKRPAYLTNFLDKLVNWENVASRM
>O51917 1.15.1.1~~~sodF1~~~Superoxide dismutase [Fe-Zn] 1~~~COG0605
MSVYTLPELPYDYSALAPVISPEIIELHHDKHHAAYVKGANDTLEQLAEARDKETWGSINGLEKNLAFHLSGHILHSIYW
HNMTGDGGGEPLDKDGVGELADAIAESFGSFAGFRAQLTKAAATTQGSGWGVLAYEPLSGRLIVEQIYDHQGNVGQGSTP
ILVFDAWEHAFYLQYKNQKVDFIDAMWAVVNWQDVARRYEAAKSRTNTLLLAP
>P77968 1.15.1.1~~~sodB~~~Superoxide dismutase [Fe]~~~COG0605
MAYALPNLPYDYTALEPCISKSTLEFHHDKHHAAYVNNFNNAVAGTDLDNQSIEDVIKAVAGDASKAGIFNNAAQAWNHS
FYWNCMKPGGGGQPSGALADKINADFGSFDAFVEAFKQAGATQFGSGWAWLVLDNGTLKVTKTGNAENPMTAGQTPLLTM
DVWEHAYYLDYQNRRPDYIADFLGKLVNWDFVAANLAAA
>Q81LW0 1.15.1.1~~~sodA1~~~Superoxide dismutase [Mn] 1~~~COG0605
MAKHELPNLPYAYDALEPHFDKETMNIHHTKHHNTYITNLNAALEGHAELADKSVEELVANLNEVPEAIRTAVRNNGGGH
ANHTFFWTILSPNGGGQPVGELATAIEAKFGSFDAFKEEFAKAGATRFGSGWAWLVVNNGELEVTSTPNQDSPLTEGKTP
VIGLDVWEHAYYLNYQNRRPDYIGAFWNVVDWNAAEKRYQEAK
>P50058 1.15.1.1~~~sodA1~~~Superoxide dismutase [Mn] 1~~~
MQTTFRRILILFVGLLVPLFFACQSNSQVDAAPSAAPQLSASPAKLDPLPYDYAALEPYIDAQTMRLHHDKHHATYVNNI
NETLKAYPDLQKQSVDSLIQNLNQVPEAIRTKIRNNGGGHVNHTMFWQIMAPKAGGTPTGAVAKAIDQTFGSFDAFKQQF
NKAGADRFGSGWAWLVSDRQGKLSITSTANQDNPLMSNPNAYPILGNDVWEHAYYLKYQNRRAEYLTNWWNVVNWQAVNQ
RYAQAQRK
>P0A0J3 1.15.1.1~~~sodA~~~Superoxide dismutase [Mn] 1~~~COG0605
MAFELPKLPYAFDALEPHFDKETMEIHHDRHHNTYVTKLNAAVEGTDLESKSIEEIVANLDSVPANIQTAVRNNGGGHLN
HSLFWELLSPNSEEKGTVVEKIKEQWGSLEEFKKEFADKAAARFGSGWAWLVVNNGQLEIVTTPNQDNPLTEGKTPILGL
DVWEHAYYLKYQNKRPDYIGAFWNVVNWEKVDELYNATK
>P99098 1.15.1.1~~~sodA~~~Superoxide dismutase [Mn/Fe] 1~~~
MAFELPKLPYAFDALEPHFDKETMEIHHDRHHNTYVTKLNAAVEGTDLESKSIEEIVANLDSVPANIQTAVRNNGGGHLN
HSLFWELLSPNSEEKGTVVEKIKEQWGSLEEFKKEFADKAAARFGSGWAWLVVNNGQLEIVTTPNQDNPLTEGKTPILGL
DVWEHAYYLKYQNKRPDYIGAFWNVVNWEKVDELYNATK
>Q81JK8 1.15.1.1~~~sodA2~~~Superoxide dismutase [Mn] 2~~~COG0605
MSSFQLPKLSYDYDELEPYIDSNTLSIHHGKHHATYVNNLNAALENYSELHNKSLEELLCNLETLPKEIVTAVRNNGGGH
YCHSLFWEVMSPRGGGEPNGDVAKVIDYYFNTFDNLKDQLSKAAISRFGSGYGWLVLDGEELSVMSTPNQDTPLQEGKIP
LLVIDVWEHAYYLKYQNRRPEFVTNWWHTVNWDRVNEKYLQAIQSQKH
>P50059 1.15.1.1~~~sodA2~~~Superoxide dismutase [Mn] 2~~~
MAFELKPLPYAYDALEPYIDATTMQLHHDKHHAAYVNNLNAAIEKYSDLQSMSVEDLVTHLDRVPEDVRTTVRNNAGGHV
NHTMFWEIMGANGSGAPTGAISEAINNSFGSFDAFKQQFNDAGTKRFGSGWVWLVRSQQGDLQILSTPNQDSPLIEGHTP
IMGNDVWEHAYYLKYQNRRPEYLNAWWNVLNWEEINRRFDAAMSGH
>Q2G261 1.15.1.1~~~sodM~~~Superoxide dismutase [Mn/Fe] 2~~~COG0605
MAFKLPNLPYAYDALEPYIDQRTMEFHHDKHHNTYVTKLNATVEGTELEHQSLADMIANLDKVPEAMRMSVRNNGGGHFN
HSLFWEILSPNSEEKGGVIDDIKAQWGTLDEFKNEFANKATTLFGSGWTWLVVNDGKLEIVTTPNQDNPLTEGKTPILLF
DVWEHAYYLKYQNKRPDYMTAFWNIVNWKKVDELYQAAK
>P66831 1.15.1.1~~~sodM~~~Superoxide dismutase [Mn/Fe] 2~~~
MAFKLPNLPYAYDALEPYIDQRTMEFHHDKHHNTYVTKLNATVEGTELEHQSLADMIANLDKVPEAMRMSVRNNGGGHFN
HSLFWEILSPNSEEKGGVIDDIKAQWGTLDEFKNEFANKATTLFGSGWTWLVVNDGKLEIVTTPNQDNPLTEGKTPILLF
DVWEHAYYLKYQNKRPDYMTAFWNIVNWKKVDELYQAAK
>P54375 1.15.1.1~~~sodA~~~Superoxide dismutase [Mn]~~~COG0605
MAYELPELPYAYDALEPHIDKETMTIHHTKHHNTYVTNLNKAVEGNTALANKSVEELVADLDSVPENIRTAVRNNGGGHA
NHKLFWTLLSPNGGGEPTGALAEEINSVFGSFDKFKEQFAAAAAGRFGSGWAWLVVNNGKLEITSTPNQDSPLSEGKTPI
LGLDVWEHAYYLNYQNRRPDYISAFWNVVNWDEVARLYSEAK
>P17550 1.15.1.1~~~chrC~~~Superoxide dismutase [Fe]~~~
MLYEMKPLGCEPAKLTGLSEKLIFSHYENNYGGAVKRLNAITATLAELDMATAPVFTLNGLKREELIATNSMILHEVYFD
SLGDGGSLDGALKTAIERDFGSVERWQAEFTAMGKALGGGSGWVLLTYSPRDGRLVNQWASDHAHTLAGGTPVLALDMYE
HSYHMDYGAKAAAYVDAFMQNIHWQRAATRFAAAVRD
>Q9RUV2 1.15.1.1~~~sodA~~~Superoxide dismutase [Mn]~~~COG0605
MAYTLPQLPYAYDALEPHIDARTMEIHHTKHHQTYVDNANKALEGTEFADLPVEQLIQQLDRVPADKKGALRNNAGGHAN
HSMFWQIMGQGQGQNGANQPSGELLDAINSAFGSFDAFKQKFEDAAKTRFGSGWAWLVVKDGKLDVVSTANQDNPLMGEA
IAGVSGTPILGVDVWEHAYYLNYQNRRPDYLAAFWNVVNWDEVSKRYAAAK
>P00448 1.15.1.1~~~sodA~~~Superoxide dismutase [Mn]~~~COG0605
MSYTLPSLPYAYDALEPHFDKQTMEIHHTKHHQTYVNNANAALESLPEFANLPVEELITKLDQLPADKKTVLRNNAGGHA
NHSLFWKGLKKGTTLQGDLKAAIERDFGSVDNFKAEFEKAAASRFGSGWAWLVLKGDKLAVVSTANQDSPLMGEAISGAS
GFPIMGLDVWEHAYYLKFQNRRPDYIKEFWNVVNWDEAAARFAAKK
>P00449 1.15.1.1~~~sodA~~~Superoxide dismutase [Mn]~~~
MPFELPALPYPYDALEPHIDKETMNIHHTKHHNTYVTNLNAALEGHPDLQNKSLEELLSNLEALPESIRTAVRNNGGGHA
NHSLFWTILSPNGGGEPTGELADAINKKFGSFTAFKDEFSKAAAGRFGSGWAWLVVNNGELEITSTPNQDSPIMEGKTPI
LGLDVWEHAYYLKYQNRRPEYIAAFWNVVNWDEVAKRYSEAKAK
>P43725 1.15.1.1~~~sodA~~~Superoxide dismutase [Mn]~~~COG0605
MSYTLPELGYAYNALEPHFDAQTMEIHHSKHHQAYVNNANAALEGLPAELVEMYPGHLISNLDKIPAEKRGALRNNAGGH
TNHSLFWKSLKKGTTLQGALKDAIERDFGSVDAFKAEFEKAAATRFGSGWAWLVLTAEGKLAVVSTANQDNPLMGKEVAG
CEGFPLLGLDVWEHAYYLKFQNRRPDYIKEFWNVVNWDFVAERFEQKTAHSNCAK
>P53647 1.15.1.1~~~sodA~~~Superoxide dismutase [Mn]~~~COG0605
MAEYTLPDLDWDYAALEPHISGQINEIHHTKHHATYVKGVNDALAKLEEARANEDHAAIFLNEKNLAFHLGGHVNHSIWW
KNLSPDGGDKPTGELAAAIDDAFGSFDKFRAQFSAAANGLQGSGWAVLGYDTVGSRLLTFQLYDQQANVPLGIIPLLQVD
MWEHAFYLQYKNVKADYVKAFWNVVNWADVQKRYAAATSKAQGLIFG
>A0R652 1.15.1.1~~~sodA~~~Superoxide dismutase [Mn]~~~COG0605
MAEYTLPDLDYDYGALEPHISGQINELHHSKHHATYVKGVNDAIAKLEEARANGDHAAIFLNEKNLAFHLGGHINHSIWW
KNLSPNGGDKPTGELAAAIDDQFGSFDKFQAQFTAAANGLQGSGWAVLGYDSLGGRLLTFQLYDQQANVPLGIIPLLQVD
MWEHAFYLQYKNVKADYVKAFWNVVNWDDVQNRFAAATSKTSGLIFG
>P53649 1.15.1.1~~~sodA~~~Superoxide dismutase [Mn]~~~
MAEYTLPDLDYDYGALEPHISGQINELHHSKHHATYVKGVNDAIAKLEEARANGDHAAIFLNEKNLAFHLGGHINHSIWW
KNLSPNGGDKPTGELAAAIDDQFGSFDKFQAQFTAAANGLQGSGWAVLGYDSLGGRLLTFQLYDQQANVPLGIIPLLQVD
MWEHAFYLQYKNVKADYVKAFWNVVNWDDVQNRFAAATSKTSGLIFG
>P80293 1.15.1.1~~~sodA~~~Superoxide dismutase [Mn/Fe]~~~
AVYTLPELPYDYSALEPYISGEIMELHHDKHHKAYVDGANTALDKLAEARDKADFGAINKLEKDLAFNLAGHVNHSVFWK
NMAPKGSAPERPTDELGAAIDEFFGSFDNMKAQFTAAATGIQGSGWASLVWDPLGKRINTLQFYDHQNNLPAGSIPLLQL
DMWEHAFYLQYKNVKGDYVKSWWNVVNWDDVALRFSEARVA
>P53652 1.15.1.1~~~sodA~~~Superoxide dismutase [Mn]~~~
MPHALPPLPYAYDALEPHIDALTMEIHHSKHHQTYVNNLNAALEGTPYAEQPVESLLRQLAGLPEKLRTPVVNNGGGHAN
HSLFWTVMSPQGGGRPDGDLGRAIDEQLGGFEAFKDAFTKAALTRFGSGWAWLSVTPQGSLLVESSGNQDSPLMNGNTPI
LGLDVWEHAYYLKYQNRRPEYIGAFYNVIDWREVARRYAQALA
>Q8VQ15 1.15.1.1~~~sodA~~~Superoxide dismutase [Mn/Fe]~~~
MAFELPKLPYAFDALEPHIDKETMEIHHDKHHNTYVTKLNAVVEGTDLEAKSIEEIVANLDSVPSDIQTAVRNNGGGHLN
HSLFWELLTPNSEEKGEVVEKIKEQWGSLDEFKKEFADKAAARFGSGWAWLVVNNGQLEIVTTPNQDNPLTEGKTPILGL
DVWEHAYYLKYQNKRPDYISAFWNVVNWEKVDELYNAAK
>P0C0Q6 1.15.1.1~~~sodA~~~Superoxide dismutase [Mn/Fe]~~~COG0605
MAFELPNLPYAYDALEPHIDKQTMEIHHDKHHNTYVTKLNSAVEGTDLEAKSIEEIVANLDSVPSNIQTAVRNNGGGHLN
HSLFWELLSPNSEEKGEVVDKIKEQWGSLDEFKKEFADKAAARFGSGWAWLVVNNGQLEIVTTPNQDNPITEGKTPILGL
DVWEHAYYLKYQNKRPDYINAFWNVVNWEKVNELYNATK
>Q9K4V3 1.15.1.1~~~sodA~~~Superoxide dismutase [Mn/Fe]~~~COG0605
MAFELPNLPYGFDALEPHIDQQTMEIHHGKHHNTYVTKLNAAVEGTDLESKSIEEIVANLDSVPENIQTAVRNNGGGHLN
HSLFWELLTPNSEEKGTVVDKIKEQWGSLDAFKEEFADKAAARFGSGWAWLVVNNGNLEIVTTPNQDNPITEGKTPILGL
DVWEHAYYLKYQNKRPDYISAFWNVVNWEKVDELYNAAK
>P09738 1.15.1.1~~~sodA~~~Superoxide dismutase [Mn/Fe]~~~COG0605
MAILLPDLPYAYDALEPYIDAETMTLHHDKHHATYVANANAALEKHPEIGENLEVLLADVEQIPADIRQSLINNGGGHLN
HALFWELLSPEKTKVTAEVAAAINEAFGSFDDFKAAFTAAATTRFGSGWAWLVVDKEGKLEVTSTANQDTPISQGLKPIL
ALDVWEHAYYLNYRNVRPNYIKAFFEVINWNTVARLYAEALTK
>P0A4J6 1.15.1.1~~~sodA~~~Superoxide dismutase [Mn]~~~COG0605
MAIILPELPYAYDALEPYIDAETMHLHHDKHHQTYVNNANAALEKHPEIGEDLEALLADVESIPADIRQALINNGGGHLN
HALFWELMTPEKTAPSAELAAAIDATFGSFEEFQAAFTAAATTRFGSGWAWLVVNKEGKLEVTSTANQDTPISEGKKPIL
GLDVWEHAYYVKYRNVRPDYIKAFFSVINWNKVDELYAAAK
>P0C0I0 1.15.1.1~~~sodA~~~Superoxide dismutase [Mn]~~~COG0605
MAIILPELPYAYDALEPQFDAETMTLHHDKHHATYVANTDAALEKHPEIGENLEELLADVPKIPEDIRQALINNGGGHLN
HALFWELLSPEKQDVTPDVAQAIDDAFGSFDAFKEQFTAAATGRFGSGWAWLVVNKEGQLEITSTANQDTPISEGKKPIL
ALDVWEHAYYLNYRNVRPNYIKAFFEIINWKKVSALYQAAK
>P61503 1.15.1.1~~~sodA~~~Superoxide dismutase [Mn]~~~COG0605
MPYPFKLPDLGYPYEALEPHIDAKTMEIHHQKHHGAYVTNLNAALEKYPYLHGVEVEVLLRHLAALPQDIQTAVRNNGGG
HLNHSLFWRLLTPGGAKEPVGELKKAIDEQFGGFQALKEKLTQAAMGRFGSGWAWLVKDPFGKLHVLSTPNQDNPVMEGF
TPIVGIDVWEHAYYLKYQNRRADYLQAIWNVLNWDVAEEFFKKA
>P80735 1.15.1.1~~~sodN~~~Superoxide dismutase [Ni]~~~
MLSRLFAPKVTVSAHCDLPCGVYDPAQARIEAESVKAVQEKMAGNDDPHFQTRATVIKEQRAELAKHHVSVLWSDYFKPP
HFEKYPELHQLVNDTLKALSAAKGSKDPATGQKALDYIAQIDKIFWETKKA
>P80734 1.15.1.1~~~sodN~~~Superoxide dismutase [Ni]~~~
MLSRLFAPKVKVSAHCDLPCGVYDPAQARIEAESVKAIQEKMAANDDLHFQIRATVIKEQRAELAKHHLDVLWSDYFKPP
HFESYPELHTLVNEAVKALSAAKASTDPATGQKALDYIAQIDKIFWETKKA
>D3RNN8 1.8.5.6~~~soeA~~~Sulfite dehydrogenase subunit A~~~COG0243
MQDPASHSDSLVGRVEVKETTCYMCACRCGIRVHLRDGEVRYIDGNPNHPLNKGVICAKGSSGIMKQYSPGRLTQPLRRK
AGAERGESAFEVISWDEAFAMLEERLAKLRAEDPKKFALFTGRDQMQALTGLFAKQYGTPNYAAHGGFCSVNMAAGLIYT
IGGSFWEFGGPDLERAKLFVMIGTAEDHHSNPLKMAISEFKRRGGRFISVNPVRTGYSAVADEWVPIKPGTDGALLLAIT
REILDKGLFDRDFLVRYTNAAELVIDDPSRDDHGLFYRAEMHVEPDCFDPQNKLWWDRDIDGPISTHTPGADPRLMGRYV
LPDGTPVKPSFQLLKERLEQYTPEWAAPITGIPADTIRRLAHEMGVMARDQKIELPIKWTDCWDDEHESVTGNPVAFHAM
RGLAAHSNGFQTIRALGVLMTVLGTIDRPGGFRHKAPYPRPIPPCPKPPHGPEAVQPNTPLDGMPLGWPSKPEDLFVDAE
GEAVRLDKAFSWEYPLSVHGLMHNVITNAWRGDPYPIDTLFLFMANMAWNSTMNTVEVRKMLVDKHPNGDYKIPFLVVCD
TFASETVAFADLVLPDTSYLERHDVLSMLDRPISEFDGPVDSVRIPVLPPKGECKPFQEVLVELGSRLKLPAFTNADGSR
KYRNYPDFIVNYETSPGSGIGFLAGWRGKGGDQFLKGEPNPHQWEMYAQNNCVYHHELPRSYQYMRNWNKGYLHWARAHG
MIRYAEPITLHLYSEVLQRFRLAAQGKRPGRQPPERLRQRVETYFDPLPFYYEPLESRFTDTQRYPLNALTQRPMAMYHS
WDSQNAWLRQIHSHNYLFLSPKVGLAQGFADGDWVWVESPHGKVRCMCRFSEAVEPGTVWTWNAIGKGAGAWGLAPNADE
ARKGFLLNHVIAEELPAHEAGEHLSNSDPVTGQAAWFDVRVRVYKAEAGEPEVTSPQFKPMPRLPGQEKKRGKWQAYVAG
IFGKQAS
>D3RNN7 ~~~soeB~~~Sulfite dehydrogenase subunit B~~~COG0437
MTQLALVIDLNVCVGCHACVTSCKEWNTSGWAGPLVDQNPYEGSPTGTFFNRVQTFEIGTFPNTETVHFPKSCLHCEEPP
CVPVCPTGASYKRPDNGVVLVDYDKCIGCKYCSWACPYGARELDAQQKVMKKCTLCIDRITDAKLSERDRKPSCVLACPA
NARLFGDVHDPDSEVSIAIRERGGYQLMPEWGTKPANHYLPRRKTRMHIDPEELTRVDNPWRKEDLTDYTGEETLDDVAW
>D3RNN6 ~~~soeC~~~Sulfite dehydrogenase subunit C~~~COG3302
MHPAFSVIFLTTLLGAGQGLYLAMVTGQLYAVARFLPAQADQFYAVGSLVALLLLIAGLGASFFHLGRPERAWRAAAMWR
TSWLSREVIVLPIVMALVFAYGVAHWFEWTQPLFQVGAALQVDLTLLLGVLGTIASLALFVCTAMIYAAVRFLQEWHTPL
TVSNFLFLGAASGFMLAAAYSAYIGNPLVTFYGTWAVILTLVGLASRLAHLRRNARLKHKSTVQTAIGVRHASVVQKAQG
ATGGSFNTREFFHGRSQSLLERLRTVYLVLVFPIPVLLIGLSYLIGSSNLPIIAFFVQFAGLLIERWSFFAEARHPQNLY
YQSVA
>Q8E9K5 2.7.7.108~~~fic~~~Protein adenylyltransferase SoFic~~~COG3177
MEWQAEQAYNHLPPLPLDSKLAELAETLPILKACIPARAALAELKQAGELLPNQGLLINLLPLLEAQGSSEIENIVTTTD
KLFQYAQEDSQADPMTKEALRYRTALYQGFTQLSNRPLCVTTALEICSTIKSVQMDVRKVPGTSLTNQATGEVIYTPPAG
ESVIRDLLSNWEAFLHNQDDVDPLIKMAMAHYQFEAIHPFIDGNGRTGRVLNILYLIDQQLLSAPILYLSRYIVAHKQDY
YRLLLNVTTQQEWQPWIIFILNAVEQTAKWTTHKIAAARELIAHTTEYVRQQLPKIYSHELVQVIFEQPYCRIQNLVESG
LAKRQTASVYLKQLCDIGVLEEVQSGKEKLFVHPKFVTLMTKDSNQFSRYAL
>P0AG14 3.4.21.-~~~sohB~~~Probable protease SohB~~~COG0616
MELLSEYGLFLAKIVTVVLAIAAIAAIIVNVAQRNKRQRGELRVNNLSEQYKEMKEELAAALMDSHQQKQWHKAQKKKHK
QEAKAAKAKAKLGEVATDSKPRVWVLDFKGSMDAHEVNSLREEITAVLAAFKPQDQVVLRLESPGGMVHGYGLAASQLQR
LRDKNIPLTVTVDKVAASGGYMMACVADKIVSAPFAIVGSIGVVAQMPNFNRFLKSKDIDIELHTAGQYKRTLTLLGENT
EEGREKFREELNETHQLFKDFVKRMRPSLDIEQVATGEHWYGQQAVEKGLVDEINTSDEVILSLMEGREVVNVRYMQRKR
LIDRFTGSAAESADRLLLRWWQRGQKPLM
>P37522 3.6.-.-~~~soj~~~Sporulation initiation inhibitor protein Soj~~~COG1192
MGKIIAITNQKGGVGKTTTSVNLGACLAYIGKRVLLVDIDPQGNATSGLGIEKADVEQCVYDILVDDADVIDIIKATTVE
NLDVIPATIQLAGAEIELVPTISREVRLKRALEAVKQNYDYIIIDCPPSLGLLTINALTASDSVVIPVQCEYYALEGLSQ
LLNTVRLVQKHLNTDLMIEGVLLTMLDARTNLGIQVIEEVKKYFRDKVYKTVIPRNVRLSEAPSHGKPIILYDPRSRGAE
VYLDLAKEVAANG
>Q72H90 3.6.-.-~~~soj~~~Chromosome-partitioning ATPase Soj~~~COG1192
MLRAKVRRIALANQKGGVGKTTTAINLAAYLARLGKRVLLVDLDPQGNATSGLGVRAERGVYHLLQGEPLEGLVHPVDGF
HLLPATPDLVGATVELAGAPTALREALRDEGYDLVLLDAPPSLSPLTLNALAAAEGVVVPVQAEYYALEGVAGLLATLEE
VRAGLNPRLRLLGILVTMYDGRTLLAQQVEAQLRAHFGEKVFWTVIPRNVRLAEAPSFGKTIAQHAPTSPGAHAYRRLAE
EVMARVQEA
>Q9S4P4 2.3.2.26~~~sopA~~~E3 ubiquitin-protein ligase SopA~~~
MKISSGAINFSTIPNQVKKLITSIREHTKNGLASKITSVKNTHTSLNEKFKTGKDSPIEFALPQKIKDFFQPKDKNTLNK
TLITVKNIKDTNNAGKKNISAEDVSKMNAAFMRKHIANQTCDYNYRMTGAAPLPGGVSVSANNRPTVSEGRTPPVSPSLS
LQATSSPSSPADWAKKLTDAVLRQKAGETLTAADRDFSNADFRNITFSKILPPSFMERDGDIIKGFNFSNSKFTYSDISH
LHFDECRFTYSTLSDVVCSNTKFSNSDMNEVVLQYSITTQQQPSFIHTTLKNTLIRHKANLSGVILNEPHNSSPPSVSGG
GNFIRLGDIWLQMPLLWTENAVDGFLNHEHNNGKSILMTIDSLPDKYSQEKVQAMEDLVKSLRGGRLTEACIRPVESSLV
SVLAHPPYTQSALIREWLGPVQERFFAHQCQTYNDVPLPTPDTYYQQRILPVLLDSFDRNSAAMTTHSGLFNQVILHCMT
GVDCTDGTRQKAAALYEQYLAHPAVSPHIHNGLFGNYDGSSDWTTRAADNFLLLSSQDSDTAMMLSTDTLLTMLNPTPDT
AWDNFYLQRAGENVSTAQISPVELFRHDFPVFLAAFNQQATQRRFGELIDIILSTEEHGELNQQFLAATNQKHSTVKLID
DASVSRLATIFDPLLPEGKLSPAHYQHILSAYHLTDATPQKQAETLFCLSTAFARYSSSAIFGTEHDSPPALRGYAEALM
QKAWELSPAIFPTSEQFTDWSDRFHGLHGAFTCTCVVADSMQRHARKYFPSVLSSILPLSWA
>Q8ZNR3 2.3.2.26~~~sopA~~~E3 ubiquitin-protein ligase SopA~~~
MKISSGAINFSTIPNQVKKLITSIREHTKNGLTSKITSVKNTHTSLNEKFKTGKDSPIEFALPQKIKDFFQPKDKNTLNK
TLITVKNIKDTNNAGKKNISAEDVSKMNAAFMRKHIANQTCDYNYRMTGAAPLPGGVSVSANNRPTVSEGRTPPVSPSLS
LQATSSPSSPADWAKKLTDAVLRQKAGETLTAADRDFSNADFRNITFSKILPPSFMERDGDIIKGFNFSNSKFTYSDISH
LHFDECRFTYSTLSDVVCSNTKFSNSDMNEVFLQYSITTQQQPSFIDTTLKNTLIRHKANLSGVILNEPDNSSPPSVSGG
GNFIRLGDIWLQMPLLWTENAVDGFLNHEHNNGKSILMTIDSLPDKYSQEKVQAMEDLVKSLRGGRLTEACIRPVESSLV
SVLAHPPYTQSALISEWLGPVQERFFAHQCQTYNDVPLPAPDTYYQQRILPVLLDSFDRNSAAMTTHSGLFNQVILHCMT
GVDCTDGTRQKAAALYEQYLAHPAVSPHIHNGLFGNYDGSPDWTTRAADNFLLLSSQDSDTAMMLSTDTLLTMLNPTPDT
AWDNFYLLRAGENVSTAQISPVELFRHDFPVFLAAFNQQATQRRFGELIDIILSTEEHGELNQQFLAATNQKHSTVKLID
DASVSRLATIFDPLLPEGKLSPAHYQHILSAYHLTDATPQKQAETLFCLSTAFARYSSSAIFGTEHDSPPALRGYAEALM
QKAWELSPAIFPSSEQFTEWSDRFHGLHGAFTCTSVVADSMQRHARKYFPSVLSSILPLAWA
>P62558 ~~~sopB~~~Protein SopB~~~
MKRAPVIPKHTLNTQPVEDTSLSTPAAPMVDSLIARVGVMARGNAITLPVCGRDVKFTLEVLRGDSVEKTSRVWSGNERD
QELLTEDALDDLIPSFLLTGQQTPAFGRRVSGVIEIADGSRRRKAAALTESDYRVLVGELDDEQMAALSRLGNDYRPTSA
YERGQRYASRLQNEFAGNISALADAENISRKIITRCINTAKLPKSVVALFSHPGELSARSGDALQKAFTDKEELLKQQAS
NLHEQKKAGVIFEAEEVITLLTSVLKTSSASRTSLSSRHQFAPGATVLYKGDKMVLNLDRSRVPTECIEKIEAILKELEK
PAP
>O34105 3.1.3.-~~~sopB~~~Inositol phosphate phosphatase SopB~~~
MQIQSFYHSASLKTQEAFKSLQKTLYNGMQILSGQGKAPAKAPDARPEIIVLREPGATWGNYLQHQKTSNHSLHNLYNLR
RDLLTVGATVLGKQDPVLTSMANQMELAKVKADRPATKQEEAAAKALKKNLIELIAARTQQQDGLPAKEAHRFAAVAFRD
AQDKQLNNQPWQTIKNTLTHNGHHYTNTQLPAAEMKIGAKDIFPSAYEGKGVCSWDTKNIHHANNLWMSTVSVHEDGKDK
TLFCGIRHGVLSPYHEKDPLLRQVGAENKAKEVLTAALFSKPELLNKALAGEAVSLKLVSVGLLTASNIFGKEGTMVEDQ
MRAWQSLTQPGKMIHLKIRNKDGDLQTVKIKPDVAAFNVGVNELALKLGFGLKASDSYNAEALYQLLGNDLRPEARPGGW
VGEWLAQYPDNYEVVNTLARQIKDIWKNNQHHKDGGEPYKLAQRLAMLAHEIDAVPAWNCKSGKDRTGMMDSEIKREIIS
LHQTHMLSAPGSLPDSGGQKIFQKVLLNSGNPGDSEPNTGGAGNKVMKNLSPEVLNLSYQKRVGDENIWQSVKGISSLIT
S
>O30916 3.1.3.-~~~sopB~~~Inositol phosphate phosphatase SopB~~~
MQIQSFYHSASLKTQEAFKSLQKTLYNGMQILSGQGKAPAKAPDARPEIIVLREPGATWGNYLQHQKASNHSLHNLYNLQ
RDLLTVAATVLGKQDPVLTSMANQMELAKVKADRPATKQEEAAAKALKKNLIELIAARTQQQDGLPAKEAHRFAAVAFRD
AQVKQLNNQPWQTIKNTLTHNGHHYTNTQLPAAEMKIGAKDIFPSAYEGKGVCSWDTKNIHHANNLWMSTVSVHEDGKDK
TLFCGIRHGVLSPYHEKDPLLRHVGAENKAKEVLTAALFSKPELLNKALAGEAVSLKLVSVGLLTASNIFGKEGTMVEDQ
MRAWQSLTQPGKMIHLKIRNKDGDLQTVKIKPDVAAFNVGVNELALKLGFGLKASDSYNAEALHQLLGNDLRPEARPGGW
VGEWLAQYPDNYEVVNTLARQIKDIWKNNQHHKDGGEPYKLAQRLAMLAHEIDAVPAWNCKSGKDRTGMMDSEIKREIIS
LHQTHMLSAPGSLPDSGGQKIFQKVLLNSGNLEIQKQNTGGAGNKVMKNLSPEVLNLSYQKRVGDENIWQSVKGISSLIT
S
>Q8ZQC8 ~~~sopD2~~~Secreted effector protein sopD2~~~
MPVTLSFGNRHNYEINHSRLARLMSPDKEEALYMGVWDRFKDCFRTHKKQEVLEVLYTLIHGCERENQAELNVDITGMEK
IHAFTQLKEYANPSQQDRFVMRFDMNQTQVLFEIDGKVIDKCNLHRLLNVSENCIFKVMEEDEEELFLKICIKYGEKISR
YPELLEGFANKLKDAVNEDDDVKDEVYKLMRSGEDRKMECVEWNGTLTEEEKNKLRCLQMGSFNITTQFFKIGYWELEGE
VLFDMVHPTLSYLLQAYKPSLSSDLIETNTMLFSDVLNKDYDDYQNNKREIDAILRRIYRSHNNTLFISEKSSCRNMLI
>P40722 ~~~sopD~~~Secreted effector protein SopD~~~
MPVTLSFGNHQNYTLNESRLAHLLSADKEKAIHMGGWDKVQDHFRAEKKDHALEVLHSIIHGQGRGEPGEMEVNVEDINK
IYAFKRLQHLACPAHQDLFTIKMDASQTQFLLMVGDTVISQSNIKDILNISDDAVIESMSREERQLFLQICEVIGSKMTW
HPELLQESISTLRKEVTGNAQIKTAVYEMMRPAEAPDHPLVEWQDSLTADEKSMLACINAGNFEPTTQFCKIGYQEVQGE
VAFSMMHPCISYLLHSYSPFSEFKPTNSGFLKKLNQDYNDYHAKKMFIDVILEKLYLTHERSLHIGKDGCSRNILLT
>Q7CQD4 ~~~sopE2~~~Guanine nucleotide exchange factor sopE2~~~
MTNITLSTQHYRIHRSDVEPVKEKTTEKDIFAKSITAVRNSFISLSTSLSDRFSLHQQTDIPTTHFHRGNASEGRAVLTS
KTVKDFMLQKLNSLDIKGNASKDPAYARQTCEAILSAVYSNNKDQCCKLLISKGVSITPFLKEIGEAAQNAGLPGEIKNG
VFTPGGAGANPFVVPLIASASIKYPHMFINHNQQVSFKAYAEKIVMKEVTPLFNKGTMPTPQQFQLTIENIANKYLQNAS
>O06949 ~~~sopE~~~Guanine nucleotide exchange factor SopE~~~
MTKITLFPHNFRIQKQEATPLKEKSTEKNSLAKSILAVKNHFIKLNSKLSERFISHKNTESSATHFHRGSASEGRAVLTN
KVVKNFMLQTLHDIDIRGSASKDPAYASQTREAILSAVYSKYKDQYCNLLISKGIDIAPFLKEIGEAAQNAGLPGATKND
VFSPSGAGANPFITPLITSAYSKYPHMFTSQHQKASFNIYAEKIIMTEVVPLFNECAMPTPQQFQQILENIANKYIQNTP
>O52623 ~~~sopE~~~Guanine nucleotide exchange factor SopE~~~
MTKITLSPQNFRIQKQETTLLKEKSTEKNSLAKSILAVKNHFIELRSKLSERFISHKNTESSATHFHRGSASEGRAVLTN
KVVKDFMLQTLNDIDIRGSASKDPAYASQTREAILSAVYSKNKDQCCNLLISKGINIAPFLQEIGEAAKNAGLPGTTKND
VFTPSGAGANPFITPLISSANSKYPRMFINQHQQASFKIYAEKIIMTEVAPLFNECAMPTPQQFQLILENIANKYIQNTP
>A6LGF6 3.2.1.214~~~~~~Exo beta-1,2-glucooligosaccharide sophorohydrolase (non-reducing end)~~~COG5368
MKHIALLTTLLLSASLQAVEKPYDYVFFENSLMKGDYFYSQAKYTSPSWIKNARHHLPVAGSVAFTPGNSLELTYVSAPG
GDWYSEIQYCPVRGNDFFREPSTLSMQVRLRESMNAAALPNIAIRYADSTYTQYLNLRNYLKDTRPGVWHPVSIPLEDFG
LNAVNDTNIKKLAAVALRPGTADGNEYTIYLDDIELLPASLPSVSALNAPVLQEAKAYERHIDIKWIPQSKEDIKYYRIY
RSFDGITYQPVAVRRPWMNRYTDFLGEVGKKAYYKVTAVDYALNESNDSQTVSATTYPMTDEQLLDMVQEANFRYYWEGA
EPNSGLARENIPGRNDMIATGASGFGIMAIVAGIERGFITREEGVQRFLKITSFLEKADKFHGAVSHFIDGTTGKTVAFF
GPKDNGGDLVETSFLFQGLLTARQYFNQENDKEKQIRKSIDNLWKNVEWSWYKQFKDSPYLYWHWSPDQAWVINHKLIGW
NETMITYMLAIMGPKYGISPEMYYSGWASQEEYAQEYRADWGRVEDGKMYTNGNTYYGENLKVGVSNGGPLFFIHYSYLG
LDPHKFTDKYTNYFENNQKMAKINQRYCIENQGGYVGYGEDCWGLTASDFAWNYQAQEPMPHRDNGTMAPTGALASFPYT
PDASMKALRNYYRNHGSFLWGEYGFRDAFNLTVNWVSPLFMGLNQAPVTVMIENYRTNLLWNLFMSHPDVQKGIQKIQSI
K
>P37078 ~~~sorC~~~Sorbitol operon regulator~~~
MENSDDIRLIVKIAQLYYEQDMTQAQIARELGIYRTTISRLLKRGREQGIVTIAINYDYNENLWLEQQLKQKFGLKEAVV
ASSDGLLEEEQLSAMGQHGALLVDRLLEPGDIIGFSWGRAVRSLVENLPQRSQSRQVICVPIIGGPSGKLESRYHVNTLT
YGAAARLKAESHLADFPALLDNPLIRNGIMQSQHFKTISSYWDSLDVALVGIGSPAIRDGANWHAFYGSEESDDLNARHV
AGDICSRFYDINGGLVDTNMSEKTLSIEMAKLRQARYSIGIAMGEEKYSGILGALHGRYINCLVTNRETAELLLK
>Q9WZC6 1.15.1.2~~~~~~Putative superoxide reductase~~~COG2033
MKLSDFIKTEDFKKEKHVPVIEAPEKVKKDEKVQIVVTVGKEIPHPNTTEHHIRWIKVFFQPDGDPYVYEVGRYEFNAHG
ESVQGPNIGAVYTEPTVTTVVKLNRSGTIIALSYCNIHGLWESSQKITVEE
>Q9S3K0 ~~~sotA~~~Sugar efflux transporter A~~~
MTISSARTARRLPDLTSSAFLVIAFLTGIAGALQLPTLSLFLSTEVQVRPFMVGLFYTGSAVIGIVVSQILATYSDRQGD
RKTLILQCCLLGALACLLYAWNRNYFVLLFIGVLLSSFGSTANPQLFALAREHADRTGRGAAMFSSVMRAQISLSWVIGP
PVAFALALGFGFPAMYLTAAVVFVLCGLLVWLLLPSMPKTRVKSAATLESPRQNRRDTLLLFTACTLMWTCNGIYLINMP
LYLVNELRLPEKLAGVMMGTAAGLEIPVMLLAGYLTSRLGKRLLMRLAVIAGLIFYTGLTLLNGSWALLALQLLNAIFIG
ILAGMGMLYFQDLMPGQAGAATTLFTNTTRVGWIISGSLAGIVAEVWSYHAGFVIAIAMLAGAAVCMWRIRDV
>Q9S3J9 ~~~sotB~~~Sugar efflux transporter B~~~
MTSASPSRSTAWLRVVTLAIAAFIFNTTEFIPVGLLSDIANSFAMKTEDVGLMITIYAWIVAVASLICMLLTSGIERRKL
LIGLFSLFILSHLLSAVAWNFTVLVISRAGVALAHSVFWSITASLAIRMAPPGKRAQALGLIATGSSLAMVLGLPLGRVI
GQYLGWRVTFLTIAAGATVAMILLARLLPLLPSEHSGSLGSVPKLFRRPALVGIYLLTVVVVTAHFTAYSYIEPFIQTVA
GLPENFTTLILLLFGCAGIAGSMLYSRYSDRFPIGFLVTAMLLLLACLTLLMPLSGYPFGLTLLCLVWGLAMMSIGLAMQ
AKVLSLAPDASDVAMSIFSGLFNLGIGGGALLGSQVSLHLGMDKIGYVGAPLVLVALFATLLSVYRSVRLVHSRV
>P31122 ~~~sotB~~~Sugar efflux transporter~~~COG2814
MTTNTVSRKVAWLRVVTLAVAAFIFNTTEFVPVGLLSDIAQSFHMQTAQVGIMLTIYAWVVALMSLPFMLMTSQVERRKL
LICLFVVFIASHVLSFLSWSFTVLVISRIGVAFAHAIFWSITASLAIRMAPAGKRAQALSLIATGTALAMVLGLPLGRIV
GQYFGWRMTFFAIGIGALITLLCLIKLLPLLPSEHSGSLKSLPLLFRRPALMSIYLLTVVVVTAHYTAYSYIEPFVQNIA
GFSANFATALLLLLGGAGIIGSVIFGKLGNQYASALVSTAIALLLVCLALLLPAANSEIHLGVLSIFWGIAMMIIGLGMQ
VKVLALAPDATDVAMALFSGIFNIGIGAGALVGNQVSLHWSMSMIGYVGAVPAFAALIWSIIIFRRWPVTLEEQTQ
>E1WKT5 2.1.3.11~~~argF'~~~N-succinylornithine carbamoyltransferase~~~
MKKFTCVQDIGDLKSALAESFEIKKDRFKYVELGRNKTLLMIFFNSSLRTRLSTQKAALNLGMNVIVLDINQGAWKLETE
RGVIMDGDKPEHLLEAIPVMGCYCDIIGVRSFARFENREYDYNEVIINQFIQHSGRPVFSMEAATRHPLQSFADLITIEE
YKKTARPKVVMTWAPHPRPLPQAVPNSFAEWMNATDYEFVITHPEGYELDPKFVGNARVEYDQMKAFEGADFIYAKNWAA
YTGDNYGQILSTDRNWTVGDRQMAVTNNAYFMHCLPVRRNMIVTDDVIESPQSIVIPEAANREISATVVLKRLLENLP
>D3DJG4 2.8.5.2~~~soxA1~~~L-cysteine S-thiosulfotransferase subunit SoxA~~~COG3258
MGKWVTIIFVLFLYAIAQQENPAEEVKKQKELLLKEMGILPGDVYAEQGRDMFNKPMGNAGKSCSSCHGQDGRYLRGAYA
HMPRYYKDMDAVADLDTRIKYCMEKYMGVGNVKHDLNFKSIATYVATLSNGMKMDVKLTHPKEREMYEKGRELWYARVGK
MDFSCAICHDSEAGKRVFLQTVVAVKEDKVATHWPAYRFSNDQLWTMEDRIRGCFGDMRVAPPEHFHWAVVALNLYLSYK
AKGGVVRVPGFIY
>D3DJG5 2.8.5.2~~~soxA2~~~L-cysteine S-thiosulfotransferase subunit SoxA~~~COG3258
MRKLWFLPILLGAVGGVSLYAIAQQENPAEEVKKQKELLLKEMGILPGDVYAEQGRDMFNKPMGNAGKSCSSCHGQDGRY
LRGAYAHMPRYYKDMDAVADLDTRIKYCMEKYMGVGNVKHDLNFKSIATYVATLSNGMKMDVKLTHPKEREMYEKGRELW
YARVGKMDFSCAICHDTFGGQRIRLQTLAKVKEDKVATHWPAYRFSNDQLWTMEDRIRGCYNQIRVTPPPHFSWPQIALS
LYMAYESKGGTIETPGFVR
>Q1W3E4 2.8.5.2~~~soxA~~~L-cysteine S-thiosulfotransferase subunit SoxA~~~
MTKHGFLLATLVLAGATLPIGPVTAATPEEEQAAFQAYFKQRFPNVPEDEFKNGTYAIDPVTRENWEAIEEFPPYENAIS
QGETLWNTPFADGQGYADCFPDGPAIMNHYPRWDRERGQVMTLPLALNACRTAHGETPLKYKKGPIADLLAYIAFESRGQ
ITRVEIPQDDPRALAAYEQGKRFYFARRGQLNFACAHCHLATSGTKLRTETLSPAYGHTTHWPVYRSEWGEMGTLHRRFA
GCNEQVRAKAFEPQGEEYRNLEYFLTYMNNGLELNGPGARK
>Q9AGP1 1.5.3.1~~~soxA~~~Sarcosine oxidase subunit alpha~~~
MSQNKSYRLPAEQSPAARIDRGEALVLSVDGKQLDAFRGDTVASAMLANGQRSCGNSMYLDRPRGIFSAGVEEPNALVTV
AARHEEDINESMLAATTVPVTANLSATLLRGLGVLDPSTDPAYYDHVHVHTDVLVVGAGPAGLAAAREASRSGARVLLLD
ERAEAGGSLRDAAGEQIDGQDAAAWIDATAAELASAAETTHLQRTTVLGSYDANYVIAAQRRTVHLDAPSGAGVSRERIW
HIRANQVVLATGAHERPIVFQNNDRPGIMLAGAVRSYLNRYGVRAGTRIVVATTNDSAYPLVADLAASGGVVAVVDARTT
VSAAAAEAVGAGVRVITGSVVADTEANESGELSAVVVAELGEDRELGAPQRFEADVLAVAGGFNPVVHLHSQRQGKLVWD
TSIHAFVPDTAVANQHLAGALTGLFDTASALSTGAATGAAAATAAGFERIAQVPQALAVPAGEARPVWLVPSLNGDQAAN
YTTHFVDLQRDQTVSDVLRATGAGLESVEHIKRYTSISTANDQGKTSGVAAIGVIAAVLGIENPAEIGTTTFRAPYTPVS
FAALAGRTRGALLDPARITPMHSWHLAHGAAFEDVGQWKRAWYFPQDGEDMDAAVYRESKAVRDSVGMLDATTLGKIEIR
GKDAAEFLNRIYTNGYTKLKVGMARYGVMCKADGMVFDDGVTLRLAEDRFLMHTTTGGAAGVLDWLEEWLQTEWPELDVT
CTSVTEQLATVAVVGPRSRDVVAKLASGLDVSNEAFKFMSFRDVTLDSGIEARISRISFSGELAYEIAIPSWHGLRVWKD
VFAAGQEFNITPYGTETMHVLRAEKGFIIVGQDTDGTVTPQDAGMEWVVSKLKDFVGKRSFARADNLREDRKHLVSVLPV
DTTLRLAEGAALVADGAVETGGCTPMEGWVTSSYNSPALGRTFGLALIKNGRSRIGEVLKTPVNGQLVDVLVSDLVLFDP
EGSRRDG
>Q8KDM7 2.8.5.2~~~soxA~~~L-cysteine S-thiosulfotransferase subunit SoxA~~~COG3258
MKKTIQRGLFTGALVLMTAMTAKPANAEVNYQALVDADVKAFQGFFRKEFPDVKLEDFGNGVYALDEDARKQWKEMEEFP
PYELDVEAGKALFNKPFANGKSLASCFPNGGAVRGMYPYFDEKRKEVVTLEMAINECRVANGEKPYAWEKGDIARVSAYI
ASISRGQKVDVKVKSKAAYDAYMKGKKFFYAKRGQLNMSCSGCHMEYAGRHLRAEIISPALGHTTHFPVFRSKWGEIGTL
HRRYAGCSNNIGAKPFAPQSEEYRDLEFFQTVMSNGLKYNGPASRK
>Q8RLX0 2.8.5.2~~~soxA~~~L-cysteine S-thiosulfotransferase subunit SoxA~~~
MKKTIQRGLFTGALVLLTAMTSKPAHAAVNYQALVDADVKKFQGYFLKEFPGVKLEDFGDGVYALDEDSRKQWKEMEEFP
PYELDVEAGKALFNKPFANGKSLGSCFSNGGAVRGMYPYFDEKRKEVITLEMAINECRVANGEKPYAPKKGDIARVSAYI
ASISRGQKIDVKVKSKAAYDAYMKGKEMFYAKRGQLNMSCSGCHMEYSGRHLRAEIISPALGHTTHFPVFRSKWGEIGTL
HRRYAGCNENIGAKPFPAQSKEYRDLEFFQTVMSNGLKFNGPASRK
>Q46337 1.5.3.1~~~soxA~~~Sarcosine oxidase subunit alpha~~~
MSQNTSYRLPAEQSPAARIDRGEALVLTVDGKQLEAFRGDTVASAMLANGQRACGNSMYLDRPRGIFSAGVEEPNALVTV
EARHEQDINESMLAATTVPVTANLSATLLRGLGVLDPSTDPAYYDHVHVHTDVLVVGAGPAGLAAAREASRSGARVLLLD
ERAEAGGSLRDAAGEQIDGQDAAAWIDATVAELAAAEETTHLQRTTVLGSYDANYVVAVQRRTVHLDGPSGAGVSRERIW
HIRANQVVLATGAHERPIVFENNDRPGIMLAGAVRSYLNCYGVRAGSQIVVATTNDSAYPLVADLAASGGVVAVIDARTT
VSAAAAEAVGAGVRVITGSVVVDTEANESGELSAVIVAELGEDRELGEPQRFEADVLAVAGGFNPVVHLHSQRQGKLVWD
TSIHAFVPDAAVANQHLAGALTGLFDTASALSTGAAVGAAAATAAGFERIAQVPQALPVPAGETRPVWLVPSLNGDQATN
YTTHFVDLQRDQTVSDVLRATGAGLESVEHIKRYTSISTANDQGKTSGVAAIGVIAAVLGIENPAEIGTTTFRAPYTPVA
FAALAGRTRGELLDPARITPMHSWHLAQGAKFEDVGQWKRAWYFPQDGEDMDAAVYRECAAVRESVGMLDATTLGKIEIR
GADAAEFLNRIYTNGYTKLKVGMARYGVMCKADGMVFDDGVTLRLAEDRFLMHTTTGGAAGVLDWLEEWLQTEWPELDVT
CTSVTEQLATVAVVGPRSRDVVAKLVTGLDVSNDAFKFMSFQDVTLDSGIEARISRISFSGELAYEIAIPSWHGLRVWED
VYAAGQEFNITPYGTETMHVLRAEKGFIIVGQDTDGTVTPQDAGMEWVVSKLKDFVGKRSFSREDNLREDRKHLVSVLPV
DTALRLAEGAALVADGAVETEGCTPMEGWVTSSYNSPALGRTFGLALIKNGRSRIGEVLKTPVNGQLVDVLVSDLVLFDP
EGSRRDG
>Q08IS0 2.8.5.2~~~soxA~~~L-cysteine S-thiosulfotransferase subunit SoxA~~~
MKKTVTAVALLCALSSTAIAPTFAADDDEAARLKAIEEYRKQIADGNPSDLLAMEGEELWRTPYGPKNQSLEQCDLGLGP
GVVKGAAAKLPRYFPDTGKVEDLESRLMTCMERLQGVERQKVIDSPWRKGEKLRMDKIVAYIVTESNGEKIDVDMSHPKM
KEMYELGKRMFFYRTGPFDFSCATCHGKDGQRIRLQELPNLTKHEGAAAGWGSWPAYRVSNSQFWTMQMRLNDCFRQQRT
AEPIYGSDATIALSVYMAANGNGGVMLTPGIKR
>O33434 2.8.5.2~~~soxA~~~L-cysteine S-thiosulfotransferase subunit SoxA~~~COG3258
MPRFTKTKGTLAATALGLALAGAAFADPVEDGLVIETDSGPVEIVTKTAPPAFLADTFDTIYSGWHFRDDSTRDLERDDF
DNPAMVFVDRGLDKWNAAMGVNGESCASCHQGPESMAGLRAVMPRVDEHTGKLMIMEDYVNACVTERMGLEKWGVTSDNM
KDMLSLISLQSRGMAVNVKIDGPAAPYWEHGKEIYYTRYGQLEMSCANCHEDNAGNMIRADHLSQGQINGFPTYRLKDSG
MVTAQHRFVGCVRDTRAETFKAGSDDFKALELYVASRGNGLSVEGVSVRH
>Q9K4M4 2.8.5.2~~~soxA~~~L-cysteine S-thiosulfotransferase subunit SoxA~~~
MTVSKRFLAPVFAMVGGLVLAFSANADPVDEELVIDDIPMVTRAAAPEGHPFDEGLSGWLFREAETRETEADSFANPGML
AVERGADIWNTVEGSAGKSCASCHDDAATSMKNVGAQYPKWDADAKRPINIELQIDKCRVENMGAEPYKFDAEGQVALTS
YIKHQSLGTPVKMDLSDGELQDWWEKGKELYYTRTGQLNFACASCHEDSMGKYIRADHLSQGQANGFPTYRFNTGGMVSL
HNRFRGCIRDTRAEMPKAFSDELMALEVYVTWRGTGLSVETPAVRQ
>Q939U1 2.8.5.2~~~soxA~~~L-cysteine S-thiosulfotransferase subunit SoxA~~~COG3258
MKTMTGRLVAAALVCGGAFSGAAVSAGPDDPLVINGEIEIVTRAPTPAHLADRFDEIRSGWTFRTDDTQALEMDDFENSG
MVFVEEARAVWDRPEGTEGKACADCHGAVDDGMYGLRAVYPKYVESAGKVRTVEQMINACRTSRMGAPEWDYIGPDMTAM
VALIASVSRGMPVSVAIDGPAQSTWEKGREIYYTRYGQLDLSCASCHEQYFDHYIRADHLSQGQINGFPSYRLKNARLNA
VHDRFRGCIRDTRGVPFAVGSPEFVALELYVASRGNGLSVEGPSVRN
>D7A6E5 2.8.5.2~~~soxA~~~L-cysteine S-thiosulfotransferase subunit SoxA~~~COG3258
MRRFAAGCLALALLVLPFVLTGARAAEDESEKEIERYRQMIEDPMANPGFLNVDRGEVLWSEPRGTRNVSLETCDLGEGP
GKLEGAYAHLPRYFADTGKVMDLEQRLLWCMETIQGRDTKPLVAKPFSGPGRTSDMEDLVAFIANKSDGVKIKVALATPQ
EKEMYAIGEALFFRRSSINDFSCSTCHGAAGKRIRLQALPQLDVPGKDAQLTMATWPTYRVSQSALRTMQHRMWDCYRQM
RMPAPDYASEAVTALTLYLTKQAEGGELKVPSIKR
>Q9AGP3 1.5.3.1~~~soxB~~~Sarcosine oxidase subunit beta~~~
MADLLPEHPEFLWANPEPKKSYDVVIVGGGGHGLATAYYLAKNHGITNVAVLEKGWLAGGNMARNTTIIRSNYLWDESAG
IYEKSLKLWEQLPEELDYDFLFSQRGVLNLAHTLGDVRESIRRVEANKFNGVDAEWLTPEQVKEVCPIINIGDDIRYPVM
GATYQPRAGIAKHDHVAWAFARKANEMGVDIIQNCEVTGFLKDGEKVTGVKTTRGTIHAGKVALAGAGHSSVLAELAGFE
LPIQSHPLQALVSELFEPVHPTVVMSNHIHVYVSQAHKGELVMGAGIDTYNGYGQRGAFHVIEEQMAAAVELFPIFARAH
VLRTWGGIVDTTMDALPIISKTPIQNLYVNCGWGTGGFKGTPGAGFTLAHTIANDEAHALNAPFSLERFETGHLIDEHGA
AAVAH
>P40875 1.5.3.1~~~soxB~~~Sarcosine oxidase subunit beta~~~
MADLLPEHPEFLWANPEPKKSYDVVIVGGGGHGLATAYYLAKNHGITNVAVLEKGWLAGGNMARNTTIIRSNYLWDESAG
IYEKSLKLWEQLPEELDYDFLFSQRGVLNLAHTLGDVRESVRRVEANKFNGVDAEWLTPEQVKEVCPIINIGDDIRYPVM
GATYQPRAGIAKHDHVAWAFARKANEMGVDIIQNCEVTGFLKDGEKVTGVKTTRGTIHAGKVALAGAGHSSVLAELAGFE
LPIQSHPLQALVSELFEPVHPTVVMSNHIHVYVSQAHKGELVMGAGIDSYNGYGQRGAFHVIEEQMAAAVELFPIFARAH
VLRTWGGIVDTTMDASPIISKTPIQNLYVNCGWGTGGFKGTPGAGFTLAHTIANDEAHALNAPFSLERFETGHLIDEHGA
AAVAH
>Q9AGP2 1.5.3.1~~~soxD~~~Sarcosine oxidase subunit delta~~~
MMLIECPNCGPRNETEFSYGGQAHVAYPEDPNTLSDKEWSRYLFYRGNSKGIFAERWVHSGGCRKWFNALRDTATYEFKA
VYRAGEPRPELNTQGGSR
>Q46336 1.5.3.1~~~soxD~~~Sarcosine oxidase subunit delta~~~
MMLIECPNCGPRNETEFSYGGQAHVAYPEDPNALSDKEWSRYLFYRENSKGIFAERWVHSGGCRKWFNALRDTATYEFKA
VYRAGEPRPELNTQGGSR
>Q9AGP0 1.5.3.1~~~soxG~~~Sarcosine oxidase subunit gamma~~~
MASNTLIESTSLRRSPAEHLAEAMAQGSTAGAVVLREIAFATQVGVRAVPGSAGHAALAAALGTGLPQQVGEVAGAAEGT
AVLWLGPDEFLAIAPEGSGLAGELVAALGGEPGQVIDLSANRSVLELSGPAAPLVLRKSCPADLHPRAFGVNRAIATTLA
NIPVLLWRTGEQSWYVLPRVFFTEHTVHWLIDAMTEFASPTVA
>Q46338 1.5.3.1~~~soxG~~~Sarcosine oxidase subunit gamma~~~
MASNTLIESTSVRRSPAEHLAEAMAQGSTAGTVQLREIAFATQVGVRAVPGSGGFAALAEAVGTGLPQQVGVVAGSVEGT
AVLWLGPDEFLAIAPEGAELAAELVAALGDEPGQVLDLSANRSVLELSGPAAPLVLRKSCPADLHPRAFGVNLAITTTLA
NIPVLLWRTGEQSWYILPRASFTEHTVHWLIDAMSEFASEPVA
>P0ACS2 ~~~soxR~~~Redox-sensitive transcriptional activator SoxR~~~COG0789
MEKKLPRIKALLTPGEVAKRSGVAVSALHFYESKGLITSIRNSGNQRRYKRDVLRYVAIIKIAQRIGIPLATIGEAFGVL
PEGHTLSAKEWKQLSSQWREELDRRIHTLVALRDELDGCIGCGCLSRSDCPLRNPGDRLGEEGTGARLLEDEQN
>P0A9E2 ~~~soxS~~~Regulatory protein SoxS~~~COG2207
MSHQKIIQDLIAWIDEHIDQPLNIDVVAKKSGYSKWYLQRMFRTVTHQTLGDYIRQRRLLLAAVELRTTERPIFDIAMDL
GYVSQQTFSRVFRRQFDRTPSDYRHRL
>P06534 ~~~spo0A~~~Stage 0 sporulation protein A~~~COG0745
MEKIKVCVADDNRELVSLLSEYIEGQEDMEVIGVAYNGQECLSLFKEKDPDVLVLDIIMPHLDGLAVLERLRESDLKKQP
NVIMLTAFGQEDVTKKAVDLGASYFILKPFDMENLVGHIRQVSGNASSVTHRAPSSQSSIIRSSQPEPKKKNLDASITSI
IHEIGVPAHIKGYLYLREAISMVYNDIELLGSITKVLYPDIAKKFNTTASRVERAIRHAIEVAWSRGNIDSISSLFGYTV
SMTKAKPTNSEFIAMVADKLRLEHKAS
>P52934 ~~~spo0A~~~Stage 0 sporulation protein A~~~
MSIKVCIADDNRELVSLLDEYISSQPDMEVIGTAYNGQDCLQMLEEKRPDILLLDIIMPHLDGLAVLERIRAGFEHQPNV
IMLTAFGQEDVTKKAVELGASYFILKPFDMENLAHHIRQVYGKTTPVVRKAAPAPQVRDNKPKNLDASITSIIHEIGVPA
HIKGYLYLREAIAMVYHDIELLGSITKVLYPDIAKKYNTTASRVERAIRHAIEVAWSRGNLESISSLFGYTVSVSKAKPT
NSEFIAMVADKLRLEHKAS
>P06535 2.7.-.-~~~spo0B~~~Sporulation initiation phosphotransferase B~~~COG3290
MKDVSKNQEENISDTALTNELIHLLGHSRHDWMNKLQLIKGNLSLQKYDRVFEMIEEMVIDAKHESKLSNLKTPHLAFDF
LTFNWKTHYMTLEYEVLGEIKDLSAYDQKLAKLMRKLFHLFDQAVSRESENHLTVSLQTDHPDRQLILYLDFHGAFADPS
AFDDIRQNGYEDVDIMRFEITSHECLIEIGLD
>P05043 3.1.3.-~~~spo0E~~~Aspartyl-phosphate phosphatase Spo0E~~~
MGGSSEQERLLVSIDEKRKLMIDAARKQGFTGHDTIRHSQELDCLINEYHQLMQENEHSQGIQGLVKKLGLWPRRDVMPA
YDANK
>P06628 2.7.-.-~~~spo0F~~~Sporulation initiation phosphotransferase F~~~COG2204
MMNEKILIVDDQYGIRILLNEVFNKEGYQTFQAANGLQALDIVTKERPDLVLLDMKIPGMDGIEILKRMKVIDENIRVII
MTAYGELDMIQESKELGALTHFAKPFDIDEIRDAVKKYLPLKSN
>P26497 ~~~spo0J~~~Stage 0 sporulation protein J~~~COG1475
MAKGLGKGINALFNQVDLSEETVEEIKIADLRPNPYQPRKHFDDEALAELKESVLQHGILQPLIVRKSLKGYDIVAGERR
FRAAKLAGLDTVPAIVRELSEALMREIALLENLQREDLSPLEEAQAYDSLLKHLDLTQEQLAKRLGKSRPHIANHLRLLT
LPENIQQLIAEGTLSMGHGRTLLGLKNKNKLEPLVQKVIAEQLNVRQLEQLIQQLNQNVPRETKKKEPVKDAVLKERESY
LQNYFGTTVNIKRQKKKGKIEIEFFSNEDLDRILELLSERES
>Q72H91 ~~~spo0C~~~Chromosome-partitioning protein Spo0J~~~COG1475
MSRKPSGLGRGLEALLPKTGAGVVRLPLASIRPNPRQPRKRFAEESLKELADSIREKGLLQPLLVRPQGDGYELVAGERR
YRAALMAGLQEVPAVVKDLTDREALELALVENLQREDLSPVEEARGYQALLEMGLTQEEVARRVGKARSTVANALRLLQL
PPEALEALERGEITAGHARALLMLEPEDRLWGLKEILEKGLSVRQAEALRERLAMAPKRSAEPSPLSLELSRHLGLPVRV
VGGKKGKVVIQYRSLEELEALLRRLGYQA
>P71088 ~~~spo0M~~~Sporulation-control protein spo0M~~~COG4326
MSFFKKLAASAGIGAAKVDTILEKDAYFPGEEVQGTVHVKGGKIAQDIRYIDLQLSTRYVIVKDDEEHRKYATIHSFRVT
GSFTIQPGEEHQFPFTFTLPLDTPITVGKVEVAVVTDLDIQGGIDKSDHDRIFVEAHPWIENVLEAIENLGFRLNEADCE
QAPYFQRRLPFVQEFEFVPTSGYYRQMLDELELIFLLDEDGLEIIFEVDRRARGLRGWLEEMYNDGEQLVRVRFSQSELE
DTEELEEVLEEILDQYAE
>P19471 ~~~~~~55.5 kDa and 49.5 kDa sporulation proteins~~~
MSREQRGPNEKLGTVLALAGISNAGLARRVNDLGAQRGLTLRYDKTSVARWVAKGMVPQGAAPHLIAAAIGAKLGRPVPL
HEIGLADADPAPEVGLAFPRDVGEAVRSATELYRLDLAGRRGGGGIWQSLAGSFSVSAYATPASRWLITPADPSVARDPT
AAQAAILGARGGGSGSPGGAGARGPGAAPDARGAHLLHPGGATPGAAGSVPLQPGPEAVADASPLRVGHSDVSKLREAAQ
DARRWDSKYGGGDWRSSMVPECLRVDAAPLLLGSYTDEVGRALFGASAELTRLAGWMAFDTGQQEAAQRYYIQALRLARA
AADVPLGGYVLASMSLQATYRGFADEGVDLAQAAVERNRGLATARTMSFFRLVEARAHAKAGDAPAAGAALKGAESWLER
SRAGDSDPPWLGFYGYDRFAADAAECYRDLKAPRQVRRFTEQALSRPTEEFVRSHGLRLVVSAVAELESGNLDAACAAGT
RAVEVAGRISSARTTEYVRDLLHRLEPYGDEPRVAELRERARPLLVTPG
>Q05308 3.4.21.-~~~~~~Serine protease 1~~~
MKCKKPSALFSALALVGALGAASVLGAASANSASPVAAATVQASSGSAKTSVAATSKSQDGDVLAAIVRDLKITKTQAKK
RIKLEEKARQLEPRLQKKLGKKFAGLWISKNGKKIVVGVTTKKAAKVVKKAGATPKIVKSNLTTLKKRATKISKNAPSDI
KNVNSWWVDPATNKVVIEARSKKAAKAAATAAGLTAGTYEITVSDDVIVPVRDYWGGDALSGCTLAFPVYGGFLTAGHCA
VEGKGHILKTEMTGGQIGTVEASQFGDGIDAAWAKNYGDWNGRGRVTHWNGGGGVDIKGSNEAAVGAHMCKSGRTTKWTC
GYLLRKDVSVNYGNGHIVTLNETSACALGGDSGGAYVWNDQAQGITSGSNMDTNNCRSFYQPVNTVLNKWKLSLVTSTDV
TTSYVQGYQNNCIDVPNSDFTDGKQLQVWNCNGTNAQKVSFHPDGTLRINGKCLDARWAWTHNGTEVQLMNCNGHIAQKF
TLNGAGDLVNVHANKCVDVKDWGGQGGKLQLWECSGGANQKWWRR
>Q06823 ~~~hspA~~~Spore protein SP21~~~COG0071
MADLSVRRGTGSTPQRTREWDPFQQMQELMNWDPFELANHPWFANRQGPPAFVPAFEVRETKEAYIFKADLPGVDEKDIE
VTLTGDRVSVSGKREREKREESERFYAYERSFGSFSRAFTLPEGVDGDNVRADLKNGVLTLTLPKRPEVQPKRIQVASSG
TEQKEHIKA
>P10727 ~~~spoIIAA~~~Anti-sigma F factor antagonist~~~COG1366
MSLGIDMNVKESVLCIRLTGELDHHTAETLKQKVTQSLEKDDIRHIVLNLEDLSFMDSSGLGVILGRYKQIKQIGGEMVV
CAISPAVKRLFDMSGLFKIIRFEQSEQQALLTLGVAS
>O32726 ~~~spoIIAA~~~Anti-sigma F factor antagonist~~~
MSLAIDLEVKQDELIVRLSGELDHHTAENCMNKCRMCLEKRAIRHIVLNLGQLTFMDSSGLGVILGRYKQIKNVGGQMVV
CAVSPAVKRLFDMSGLFKIIRVEADEQFALQALGVA
>O32723 ~~~spoIIAA~~~Anti-sigma F factor antagonist~~~
MHFQLEMVTRETVVIRLFGELDHHAVEQIRAKISAAIFQGTVTTIIWNLEGLSFMDSSGVGLVLGRMRELEAVAGRTILL
NPSPTMRKVFQFSGLGPWMMDATEEQAIDRVRGIVNG
>P10728 2.7.11.1~~~spoIIAB~~~Anti-sigma F factor~~~COG2172
MKNEMHLEFSALSQNESFARVTVASFIAQLDPTMDELTEIKTVVSEAVTNAIIHGYEENCEGKVYISVTLEDHVVYMTIR
DEGLGITDLEEARQPLFTTKPELERSGMGFTIMENFMDDVSIDSSPEMGTTIRLTKHLSKSKALCN
>O32727 2.7.11.1~~~spoIIAB~~~Anti-sigma F factor~~~
MRNEMHLQFSARSENESFARVTVAAFVAQLDPTTDELTEIKTVVSEAVTNAIIHGYNNDPNGIVSISVIIEDGVVHLTVR
DEGVGIPDIEEARQPLFTTKPELERSGMGFTIMENFMDEVIVESEVNKGTTVYLKKAYCEKQTLCN
>P37575 ~~~spoIIB~~~Stage II sporulation protein B~~~
MKKRKNKKNSKAAEKALKVTINGKEETVYEQETPETEANKSMTFSNWEEKRQAEQEVAASQEHPDEDEFNWDSEEDKVFK
EDPKVVPPFQKKKTKLYAKGKTGAAKPVKRVAATIAFAAVIGTGLGLFALNISGNKEASAPASLEDSLGSQTAKAGDTSA
DKQTSGAEKQAAQTEGTYKTYAVQAGKFSNEKGAETLTEQLTEKGYSAVSLSKDDGYTYVIAGLASEKEVSQQLGQVLID
SDFEAWGGKELSLSIESDMTDSFKETAELAAKAILDEDITKASVEKIEKSLGETKASETGEKKAILQALKELEDPSAEAG
WKAQQELLAVVK
>P37475 3.1.3.16~~~spoIIE~~~Stage II sporulation protein E~~~COG2208
MEKAERRVNGPMAGQALEKLQSFFNRGTKLVTHHLHSLFFYKGFIYVVIGFLLGRAFILSEVLPFALPFFGAMLLIRRDK
AFYAVLAVLAGALTISPKHSLLILAALLAFFVFSKVAAFITDDRVKALPIVVFFSMAAARAGFVYAQNGVFTTYDYVMAI
VEAGLSFILTLIFLQSLPIFTVKKVKQSLKIEEIICFMILIASVLTGLAGLSYQGMQAEHILARYVVLSFSFIGGASIGC
TVGVVTGLILGLANIGNLYQMSLLAFSGLLGGLLKEGKKAGAAIGLIVGSLLISLYGEGSAGLMTTLYESLIAVCLFLLT
PQSITRKVARYIPGTVEHLQEQQQYARKIRDVTAQKVDQFSNVFHALSESFATFYQASDEQTDDSEVDLFLSKITEHSCQ
TCYKKNRCWVQNFDKTYDLMKQVMLETEEKEYASNRRLKKEFQQYCSKSKQVEELIEDELAHHHAHLTLKKKVQDSRRLV
AEQLLGVSEVMADFSREIKREREQHFLQEEQIIEALQHFGIEIQHVEIYSLEQGNIDIEMTIPFSGHGESEKIIAPMLSD
ILEEQILVKAEQHSPHPNGYSHVAFGSTKSYRVSTGAAHAAKGGGLVSGDSYSMMELGARKYAAAISDGMGNGARAHFES
NETIKLLEKILESGIDEKIAIKTINSILSLRTTDEIYSTLDLSIIDLQDASCKFLKVGSTPSFIKRGDQVMKVQASNLPI
GIINEFDVEVVSEQLKAGDLLIMMSDGIFEGPKHVENHDLWMKRKMKGLKTNDPQEIADLLMEEVIRTRSGQIEDDMTVV
VVRIDHNTPKWASIPVPAIFQNKQEIS
>P13801 3.4.23.-~~~spoIIGA~~~Sporulation sigma-E factor-processing peptidase~~~
MKIYLDVIWLLNFCFDALLLLLTAFILKRHVKKRRLVGGAFIGSSIVLLMFTPFSPIVEHPAGKLAFSVVIVVVTFGFKR
FRFFFQNLFSFYFATFLMGGGIIGAHSLLQSNSIVQNGVMITNQTGFGDPISWLFIVGGFPALWFFSKRRIEDIETKNIQ
YEERVSVQADLGSQTLHVRGLIDSGNQLYDPLTKTPVMIIYIDKLEPIFGTAETMIIRNTDPLEAIEQLDDSFRFLDKMR
LIPYRGVGQQNQFLLCVKPDHVTIMTKEEMISADKCLIGISTTKLSADGEFDAIIHPKMLSGKAVKHVS
>P37873 ~~~spoIIM~~~Stage II sporulation protein M~~~COG1300
MRKISYKDMFLRHVKDHLSLYIFVSVLFFMGVIFGAIIVNSMTISQKEDLYYYLSQFFGQLSDGKQASSADMFGQSIFHN
AKYLGLMWILGISVIGMPIIFIMIFLKGIVVGFTVGFLVNQMGVSGFFLSFVSVLPQNVLLIPAYLIMGTCAIAFSLKLI
RQLFVKRSLHDAPIQWFGRYAFVLLVILFLALISSLFEAYLSPVLMEKLTSRLF
>P71044 ~~~spoIIQ~~~Stage II sporulation protein Q~~~COG0739
MREEEKKTSQVKKLQQFFRKRWVFPAIYLVSAAVILTAVLWYQSVSNDEVKDQLADNGGNSAYDNNDDAVEVGKSMENVA
MPVVDSENVSVVKKFYETDAAKEEKEAALVTYNNTYSLSKGIDLAEKDGKDFDVSASLSGTVVKAEKDPVLGYVVEVEHA
DGLSTVYQSLSEVSVEQGDKVKQNQVIGKSGKNLYSEDSGNHVHFEIRKDGVAMNPLNFMDKPVSSIEKAATQETEESIQ
QSSEKKDGSTEKGTEEKSGEKKDDSTDKSGSKESSTTEDTEQS
>Q81QD6 ~~~spoIISA~~~Stage II sporulation protein SA~~~
MSLVISNIRIGLFILAIVFLVLVFFYWRNEELYEEKKQRIRKTWYGLFIVSVTVYFMIKGIDLTLWKNLLMFTAMVIFVD
IAFILTPNISEIWGAKFSDIGKTVQSIKRSLIASKARGEIYTTIIQNVNPAVFGTMEWHTEEEYTKSLNAFLDSYGEKIG
AKIVVFEAAKELNTNFRGIRSQFSTIIPLEYIEQLNEQRAVQVENVGIIPAKIVSDVFIVIDGKKNNLQDRDFENVYNLT
IHHSYFSK
>Q81DD1 ~~~spoIISA~~~Stage II sporulation protein SA~~~
MISNIRIGLFVLAIVFVVLVFFYWKNEELYEEKKQRIRKTWYGLFIISVTVYFMIKGIDLTLWKNLLMFTAMVIFVDIAF
ILTPNISEIWGAKFSDIGKTVQSIKRSLIASKARGEIYTTIIQNVNPAVFGTMEWHTEEEYTKSLNAFLDSYGEKIGAKI
VVFEAAKELNTNFRGIRSQFSIIVPLEHIEQLNEQKAVQVENVGIIPAKIVSDVFIVIDGKKNNLQDRDFENVYNLTIHH
SYFSK
>O34853 ~~~spoIISA~~~Stage II sporulation protein SA~~~
MVLFFQIMVWCIVAGLGLYVYATWRFEAKVKEKMSAIRKTWYLLFVLGAMVYWTYEPTSLFTHWERYLIVAVSFALIDAF
IFLSAYVKKLAGSELETDTREILEENNEMLHMYLNRLKTYQYLLKNEPIHVYYGSIDAYAEGIDKLLKTYADKMNLTASL
CHYSTQADKDRLTEHMDDPADVQTRLDRKDVYYDQYGKVVLIPFTIETQNYVIKLTSDSIVTEFDYLLFTSLTSIYDLVL
PIEEEGEG
>Q81DD0 ~~~spoIISB~~~Stage II sporulation protein SB~~~
MAEVNVQKSSFFKEKKEESNTDFSLVKGALTENINRLEKLMNNSSSKYIQVKRTKENA
>O34800 ~~~spoIISB~~~Stage II sporulation protein SB~~~
MERAFQNRCEPRAAKPFKILKKRSTTSVASYQVSPHTARIFKENERLIDEYKRKKA
>O06875 ~~~~~~Probable sugar-binding periplasmic protein~~~
MHKLLKLAAMGTAACALLAGMAPVANAQEKQNVEVLHWWTSGGEASALEVLKKDLESKGISWTDMPVAGGGGTEAMTVLR
ARVTAGNAPTAVQMLGFDIRDWAEQGALGNLDTVASKEGWEKVIPAPLQEFAKYDGHWIAAPVNIHSTNWMWINKAALDK
AGGKEPTNWDELIALLDNFKAQGITPIAHGGQPWQDATIFDAVVLSFGPDFYKKAFIDLDPEALGSDTMKQAFDRMSKLR
TYVDDNFSGRDWNLASAMVIEGKAGVQFMGDWAKGEFLKAGKKPGEDFVCMRYPGTQGAVTFNSDMFAMFKVSEDKVPAQ
LEMASAIESPAFQSAFNVVKGSAPARTDVPDTAFDACGKKAIADVKEANSKGTLLGSMAHGYANPAAVKNAIYDVVTRQF
NGQLSSEDAVKELVAAVEAAK
>Q01368 ~~~spoIIIAB~~~Stage III sporulation protein AB~~~
MLKLLGAVFIVVATTWTGFEMAKIYTERPRQIRQLRAALQSLEAEIMYGHTPLHTASQQIAKQLAQPVSTLFSAFSDQLD
KGSDSAKTAWEQSLKKVWDTLSLKKSEYEVLKQFGETLGIHDRISQQKHIKLALTHLEASEADAEQAQAKNEKMIKSLGF
LAGLLLILLLM
>P49782 ~~~spoIIIAE~~~Stage III sporulation protein AE~~~
MKRFQWVLLLAVLIIAGRAEIVQAAGNAEQTEDHAETAEQLAERTAASLETDKIGEFWNDIMTEYGGLLPESQKGSLMEF
INGDKSFSPQEWLKALFSYLFHEVLANGKLLGTLILLTIFCVILQLLQNAFQQSTVSKVAYSIVYMVLIILALNSFHVAI
NYATEAIQTMTSFILALIPLLLALLASSGGAVSAAFFHPVILFLMNTSGLLIQNIVMPLIFLSAILSIVSTMTEQYKVTQ
LANLLRNIAIGALAVFLTIFLGVISVQGASAAVTDGITLRTAKFITGNFIPVLGRMFTDATDTVISASLLLKNTVGILGV
AILICIAAFPAIKVLSLAFIYKLAAAILQPLGGGPVITCLDVISKSVIYIFAALAIVSLMFFLSLTVIITAGNLTMMMK
>P49783 ~~~spoIIIAF~~~Stage III sporulation protein AF~~~
MSFLTEWLTTIVLFILFAIVIDMLLPSSSMQKYAKMVVSLLLIVVMLTPIFKLFKTDPEVIFEYLTKNGQSESADIKNQI
NSKKIEIQASQRAYILEEMAVQLKKKAEERFSHDEYKVGRIKLTAGEKVDSEEDIKTISVYMAPSSEKTVQTVAPVHIDT
DHAYVTKEAAEQKEAKQIQTQLADIWEIGSEKITVHMEGGESVGNE
>P49785 ~~~spoIIIAH~~~Stage III sporulation protein AH~~~
MLKKQTVWLLTMLSLVVVLSVYYIMSPESKNAVQMQSEKSASDSGEVATEKAPAKQDTKEKSGTETEKGKEDGTKGTKDS
SADKETSAEASEKGTVVTETADDDLFTTYRLDLEDARSKEREELNAIVSSDDATAKEKSEAYDKMTALSEVEGTEKQLET
LIKTQGYEDALVNAEGDKINITVKSDKHSKSKATAIIDLVAKEIKTMKDVAVTFEPSK
>P15281 ~~~spoIIID~~~Stage III sporulation protein D~~~COG1609
MHDYIKERTIKIGKYIVETKKTVRVIAKEFGVSKSTVHKDLTERLPEINPDLANEVKEILDYHKSIRHLRGGEATKLKYK
KDEILEGEPVQQS
>P21458 ~~~spoIIIE~~~DNA translocase SpoIIIE~~~COG1674
MAKKKRKSRKKQAKQLNIKYELNGLLCIAISIIAILQLGVVGQTFIYLFRFFAGEWFILCLLGLLVLGVSLFWKKKTPSL
LTRRKAGLYCIIASILLLSHVQLFKNLTHKGSIESASVVRNTWELFLMDMNGSSASPDLGGGMIGALLFAASHFLFASTG
SQIMAIVMILIGMILVTGRSLQETLKKWMSPIGRFIKEQWLAFIDDMKSFKSNMQSSKKTKAPSKKQKPARKKQQMEPEP
PDEEGDYETVSPLIHSEPIISSFSDRNEEEESPVIEKRAEPVSKPLQDIQPETGDQETVSAPPMTFTELENKDYEMPSLD
LLADPKHTGQQADKKNIYENARKLERTFQSFGVKAKVTQVHLGPAVTKYEVYPDVGVKVSKIVNLSDDLALALAAKDIRI
EAPIPGKSAIGIEVPNAEVAMVSLKEVLESKLNDRPDAKLLIGLGRNISGEAVLAELNKMPHLLVAGATGSGKSVCVNGI
ITSILMRAKPHEVKMMMIDPKMVELNVYNGIPHLLAPVVTDPKKASQALKKVVNEMERRYELFSHTGTRNIEGYNDYIKR
ANNEEGAKQPELPYIVVIVDELADLMMVASSDVEDSITRLSQMARAAGIHLIIATQRPSVDVITGVIKANIPSRIAFSVS
SQTDSRTILDMGGAEKLLGRGDMLFLPVGANKPVRVQGAFLSDDEVEKVVDHVITQQKAQYQEEMIPEETTETHSEVTDE
LYDEAVELIVGMQTASVSMLQRRFRIGYTRAARLIDAMEERGVVGPYEGSKPREVLLSKEKYDELSS
>Q81SW4 3.6.1.-~~~spoIVA~~~Stage IV sporulation protein A~~~COG0699
MEKVDIFKDIAERTGGDIYFGVVGAVRTGKSTFIKKFMELVVIPNIENESDRQRAQDELPQSAAGRTIMTTEPKFVPNQA
VSIEVDEGLEVNIRLVDCVGYTVPGAKGYEDENGPRMINTPWYEEPIPFHEAAEIGTRKVIQEHSTIGVVITTDGTIGEI
PRRDYIEAEERVVNELKEVGKPFIMIINTVQPYHPDTEQLRQSLSEEYDIPVIAMSVESLRETDVYNVLREALFEFPVLE
VNVNLPSWVMVLNEGHWLRQSYQEAVQETVKDIKRLRDVDRVVWQFSQYEFIDRASLAGIDMGQGVAEIDLYAPDELYDQ
ILKEVVGVEIRGKDHLLKLMLDLSHAKIEYDQVADALRMVKQTGYGVAAPALADMSLDEPEIIRHGSRFGVKLKAVAPSI
HMIKVDVESTFEPIIGTEKQSEELVRYLMQDFEDDPLSIWNSDIFGRSLSSIVREGIQAKLSLMPENARYKLKETLERII
NEGSGGLIAIIL
>P35149 3.6.1.-~~~spoIVA~~~Stage IV sporulation protein A~~~COG0699
MEKVDIFKDIAERTGGDIYLGVVGAVRTGKSTFIKKFMELVVLPNISNEADRARAQDELPQSAAGKTIMTTEPKFVPNQA
MSVHVSDGLDVNIRLVDCVGYTVPGAKGYEDENGPRMINTPWYEEPIPFHEAAEIGTRKVIQEHSTIGVVITTDGTIGDI
ARSDYIEAEERVIEELKEVGKPFIMVINSVRPYHPETEAMRQDLSEKYDIPVLAMSVESMRESDVLSVLREALYEFPVLE
VNVNLPSWVMVLKENHWLRESYQESVKETVKDIKRLRDVDRVVGQFSEFEFIESAGLAGIELGQGVAEIDLYAPDHLYDQ
ILKEVVGVEIRGRDHLLELMQDFAHAKTEYDQVSDALKMVKQTGYGIAAPALADMSLDEPEIIRQGSRFGVRLKAVAPSI
HMIKVDVESEFAPIIGTEKQSEELVRYLMQDFEDDPLSIWNSDIFGRSLSSIVREGIQAKLSLMPENARYKLKETLERII
NEGSGGLIAIIL
>Q182W3 3.6.1.-~~~spoIVA~~~Stage IV sporulation protein A~~~COG0699
MNNNIYEDISKRTQGDIYIGVVGPVRTGKSTFIRKFMEKLVIPNIDNEFKKDRTRDEIPQSGSGKTIMTVEPKFVPADGV
EIKIKDTVSLKVRMVDCVGYIVEGALGHEEGGKQRLVSTPWSQEAMTFEKAAEIGTKKVIKDHSTIGIVVLTDGSVTGID
RKSYVEPEERVIQELKNLKKPFAVVLNTLSPKSEETSMLRSELEEKYEVPVLPMNVVEMEEEDIEEVMEAVLYDFPLTEI
RINLPKWVEGLERNHWIKSSIITTLKQSIIDIGKIRDIEGIIQGFSELEFLEDTGVDNVELGEGVINIDLQTKQDLFYNV
LEEKSGFKIEGDYQLLSLITRLSKVKNEYDKIESALIDAKIKGYGVVAPSLEELSLEEPEIMKQGKQYGIKLKANAPSLH
IIKADISTEVSPIVGNQNQGEEMIKYLMEVFEEQPADLWESNMFGKSLHDLVKEQLQSKLYTMPEEIRVKMQKTLQKIVN
EGSSNIITILL
>P17896 3.4.21.116~~~spoIVB~~~SpoIVB peptidase~~~COG0750
MPDNIRKAVGLILLVSLLSVGLCKPLKEYLLIPTQMRVFETQTQAIETSLSVNAQTSESSEAFTVKKDPHEIKVTGKKSG
ESELVYDLAGFPIKKTKVHVLPDLKVIPGGQSIGVKLHSVGVLVVGFHQINTSEGKKSPGETAGIEAGDIIIEMNGQKIE
KMNDVAPFIQKAGKTGESLDLLIKRDKQKIKTKLIPEKDEGEGKYRIGLYIRDSAAGIGTMTFYEPKTKKYGALGHVISD
MDTKKPIVVENGEIVKSTVTSIEKGTGGNPGEKLARFSSERKTIGDINRNSPFGIFGTLHQPIQNNISDQALPVAFSTEV
KKGPAEILTVIDDDKVEKFDIEIVSTTPQKFPATKGMVLKITDPRLLKETGGIVQGMSGSPIIQNGKVIGAVTHVFVNDP
TSGYGVHIEWMLSEAGIDIYGKEKAS
>P26936 ~~~spoIVFA~~~Stage IV sporulation protein FA~~~COG0739
MSHRADEIRKRLEKRRKQLSGSKRFSTQTVSEKQKPPSWVMVTDQEKHGTLPVYEDNMPTFNGKHPLVKTDSIILKCLLS
ACLVLVSAIAYKTNIGPVSQIKPAVAKTFETEFQFASASHWFETKFGNPLAFLAPEHKNKEQQIEVGKDLIAPASGKVQQ
DFQDNGEGIKVETSSDKIDSVKEGYVVEVSKDSQTGLTVKVQHADNTYSIYGELKDVDVALYDFVDKGKKLGSIKLDDHN
KGVYYFAMKDGDKFIDPIQVISFE
>P26937 3.4.24.-~~~spoIVFB~~~Stage IV sporulation protein FB~~~COG1994
MNKWLDLILKIHVHPFLWIIAALGLLTGHMKALLCLLLIVLIHELGHAALAVFFSWRIKRVFLLPFGGTVEVEEHGNRPL
KEEFAVIIAGPLQHIWLQFAAWMLAEVSVIHQHTFELFTFYNLSILFVNLLPIWPLDGGKLLFLLFSKQLPFQKAHRLNL
KTSLCFCLLLGCWVLFVIPLQISAWVLFVFLAVSLFEEYRQRHYIHVRFLLERYYGKNRELEKLLPLTVKAEDKVYHVMA
EFKRGCKHPIIIEKSGQKLSQLDENEVLHAYFADKRTNSSMEELLLPY
>P40869 ~~~spoVAD~~~Stage V sporulation protein AD~~~COG0332
MKLTGKQTWVFEHPIFVNSAGTAAGPKEKDGPLGSLFDKTYDEMHCNQKSWEMAERQLMEDAVNVALQKNNLTKDDIDLL
LAGDLLNQNVTANYVARHLKIPFLCMFGACSTSMETVAVASALVDGGFAKRALAATSSHNATAERQFRYPTEYGGQKPDT
ATSTVTGSGAVVISQTPGDIQITSATVGKVSDLGITDPFDMGSAMAPAAADTIKQHFKDLNRTADDYDLILTGDLSGVGS
PIVKDILKEDGYPVGTKHDDCGLLIYTPDQQVFAGGSGCACSAVVTYSHIFKQLREGKLNRVFVVATGALLSPTMIQQKE
TIPTIAHGVVFERAGGAS
>Q00758 ~~~spoVB~~~Stage V sporulation protein B~~~COG2244
MAKQTFLKGTLILIAAGMVTRMLGFVNRVVIARFIGEEGVGLYMMAAPTFFLATTLTQFGLPVAISKLVAEASARGDHQK
TKNILVMSLTITGVLSLIFTPLFLFFAPVMAETMLTDKRTLYPLLAITPVVPIIAISSVLRGYFQGKQNMNPLAMSQVLE
QVVRISLVAVCTTIFLPYGIEYAAAGAMLSSVAGELASLLYLFVCFKYKKTIKIRKHFLQSIKNGKQTFTQLMSVSLPTT
GSRFIGNLSWFFEPIVVAQSLAIAGVATVAATKQYGELTGFAMTLLTLPSFITYSLSTALVPAISEGMEQKKLQVVEYRL
EQAMRLCLLSGGISVVILFVFADELMRVMYGSSGAAVFIKVMAPFFLLYYFQGPLQAVLQALNLAGAAMMNSLIGALVKT
GLIFVLATRPSLGIMGAALAIVTGMVLVTLLHAATVSKVLPISIKIKEYALSFAVIVICGFISSAIKQYISFGASEAVNL
AGWIAASAAIYMILLLVFRLIKKDELRRIPIIGRLIIR
>Q03524 3.4.16.4~~~spoVD~~~Stage V sporulation protein D~~~COG0768
MRVSNVTVRKRLLFVLLFGVIVFLIIDTRLGYVQFVMGEKLTSLAKDSWSRNLPFEPERGEILDRNGVKLATNKSAPTVF
VVPRQVQNPMKTSKQLAAVLNMSEEKVYKHVTKKASIEKITPEGRKISNEKAKEIKALDLKGVYVAEDSIRHYPFGSFLS
HVLGFAGIDNQGLLGLEAYYDDDLKGEKGSVKFYTDAKGKKMPDEADDYTPPKDGLDMKLTVDSKVQTIMERELDNAEAK
YHPDGMIAVAMNPKNGEILGMSSRPDFDPADYQSVDPSVYNRNLPVWSTYEPGSTFKIITLAAALEEQKVNLKRDQFYDK
GHAEVDGARLRCWKRGGHGLQTYLEVVQNSCNPGFVELGERLGKEKLFKYIKDFGFGQKTGIDLQGEGTGILFPLERVGP
VEQATTAFGQGVSVTPIQQVAAVSAAVNGGTLYTPYIAKEWIDPVTKKTVKKQSPIAKKQVISEETSKQIRYALESVVAE
GTGRNAFVEGYRVGGKTGTAQKVKDGKYLENNHIVSFIGFAPADDPSLVVYVAVDNPKGTIQFGGTVAAPIVGNIMRDSL
PELGVKPRKNQIEKKYQWNDTKTIEVPNVVGMSVSDLESLLVNLNVDASGKGSKIVKQSPAAGTKVKEGSKIRVYLTEED
EKEAAD
>P28015 ~~~spoVG~~~Putative septation protein SpoVG~~~COG2088
MEVTDVRLRRVNTDGRMRAIASITLDHEFVVHDIRVIDGNNGLFVAMPSKRTPDGEFRDITHPINSSTRGKIQDAVLNEY
HRLGDTEALEFEEAGAS
>P28016 ~~~spoVG~~~Putative septation protein SpoVG~~~COG2088
MEVTDVRLRRVNTEGRMRAIASITLDGEFVVHDIRVIDGNNGLFVAMPSKRTPDGEFRDIAHPINSNHRGKIQDAVLAEY
HRLGEVEVEFEEAGAS
>Q7A7B5 ~~~spoVG~~~Putative septation protein SpoVG~~~
MKVTDVRLRKIQTDGRMKALVSITLDEAFVIHDLRVIEGNSGLFVAMPSKRTPDGEFRDIAHPINSDMRQEIQDAVMKVY
DETDEVVPDKNATSEDSEEA
>Q8CML1 ~~~spoVG~~~Putative septation protein SpoVG~~~COG2088
MKVTDVRLRKIQTDGRMKALVSITLDEAFVIHDLRVIEGNSGLFVAMPSKRTPDGEFRDIAHPINSDMRQEIQDAVMKVY
DETDEVIPDKNATSDNEESDEA
>P27643 ~~~spoVK~~~Stage V sporulation protein K~~~COG0464
MLERAVTYKNNGQINIILNGQKQVLTNAEAEAEYQAALQKNEAKHGILKEIEKEMSALVGMEEMKRNIKEIYAWIFVNQK
RAEQGLKVGKQALHMMFKGNPGTGKTTVARLIGKLFFEMNVLSKGHLIEAERADLVGEYIGHTAQKTRDLIKKSLGGILF
IDEAYSLARGGEKDFGKEAIDTLVKHMEDKQHEFILILAGYSREMDHFLSLNPGLQSRFPISIDFPDYSVTQLMEIAKRM
IDEREYQLSQEAEWKLKDYLMTVKSTTSPIKFSNGRFVRNVIEKSIRAQAMRLLMGDQYLKSDLMTIKSQDLSIKEEASG
SA
>P45693 ~~~spoVS~~~Stage V sporulation protein S~~~COG2359
MEILKVSAKSSPNSVAGALAGVLRERGAAEIQAIGAGALNQAVKAVAIARGFVAPSGVDLICIPAFTDIQIDGEERTAIK
LIVEPR
>P37963 ~~~spoVID~~~Stage VI sporulation protein D~~~COG1388
MPQNHRLQFSVEESICFQKGQEVSELLSISLDPDIRVQEVNDYVSIIGSLELTGEYNIDQNKHTEEIYTDKRFVEQVRKR
EDGSAELTHCFPVDITIPKNKVSHLQDVFVFIDAFDYQLTDSRILTIQADLAIEGLLDDTQDKEPEIPLYEAPAAFREEE
LSEPPAHSVVEEPGASSAEEAVLQHEPPAEPPELFISKAGLREELETEKAESEPPESVASEPEAREDVKEEEESEELAVP
ETEVRAESETEESEPEPDPSEIEIQEIVKAKKETAEPAAAIADVREEADSPAETELREHVGAEESPALEAELHSETVIAK
EKEETTVSPNHEYALRQEAQNEEAAQSDQADPALCQEEAEPDEALESVSEAALSIEDSRETASAVYMENDNADLHFHFNQ
KTSSEEASQEELPEPAYRTFLPEQEEEDSFYSAPKLLEEEEQEEESFEIEVRKTPSAEEPKEETPFQSFQLPESSETERK
ETDAVPRVAPAAETKEPQTKESDNSLYLTKLFTKEADEFSRMKICIVQQEDTIERLCERYEITSQQLIRMNSLALDDELK
AGQILYIPQYKNSHA
>P0A1N0 ~~~spaK~~~Surface presentation of antigens protein SpaK~~~
MQHLDIAELVRSALEVSGCDPSLIGGIDSHSTIVLDLFALPSICISVKDDDVWIWAQLGADSMVVLQQRAYEILMTIMEG
CHFARGGQLLLGEQNGELTLKALVHPDFLSDGEKFSTALNGFYNYLEVFSRSLMR
>P35530 ~~~spaK~~~Surface presentation of antigens protein SpaK~~~
MSNINLVQLVRDSLFTIGCPPSIITDLDSHSAITISLDSMPAINIALVNEQVMLWANFDAPSDVKLQSSAYNILNLMLMN
FSYSINELVELHRSDEYLQLRVVIKDDYVHDGIVFAEILHEFYQRMEILNGVL
>P0A1K5 ~~~spaN~~~Surface presentation of antigens protein SpaN~~~
MALDNINLNFSSDKQIEKCEKLSSIDNIDSLVLKKKRKVEIPEYSLIASNYFTIDKHFEHKHDKGEIYSGIKNAFELRNE
RATYSDIPESMAIKENILIPDQDIKAREKINIGDMRGIFSYNKSGNADKNFERSHTSSVNPDNLLESDNRNGQIGLKNHS
LSIDKNIADIISLLNGSVAKSFELPVMNKNTADITPSMSLQEKSIVENDKNVFQKNSEMTYHFKQWGAGHSVSISVESGS
FVLKPSDQFVGNKLDLILKQDAEGNYRFDSSQHNKGNKNNSTGYNEQSEEEC
>P40699 ~~~spaO~~~Surface presentation of antigens protein SpaO~~~
MSLRVRQIDRREWLLAQTATECQRHGREATLEYPTRQGMWVRLSDAEKRWSAWIKPGDWLEHVSPALAGAAVSAGAEHLV
VPWLAATERPFELPVPHLSCRRLCVENPVPGSALPEGKLLHIMSDRGGLWFEHLPELPAVGGGRPKMLRWPLRFVIGSSD
TQRSLLGRIGIGDVLLIRTSRAEVYCYAKKLGHFNRVEGGIIVETLDIQHIEEENNTTETAETLPGLNQLPVKLEFVLYR
KNVTLAELEAMGQQQLLSLPTNAELNVEIMANGVLLGNGELVQMNDTLGVEIHEWLSESGNGE
>P0A1K9 ~~~spaO~~~Surface presentation of antigens protein SpaO~~~
MLRIKHFDANEKLQILYAKQLCERFSIQTFKNKFTGSESLVTLTSVCGDWVIRIDTLSFLKKKYEVFSGFSTQESLLHLS
KCVFIESSSVFSIPELSDKITFRITNEIQYATTGSHLCCFSSSLGIIYFDKMPVLRNQVSLDLLHHLLEFCLGSSNVRLA
TLKRIRTGDIIIVQKLYNLLLCNQVIIGDYIVNDNNEAKINLSESNGESEHTEVSLALFNYDDINVKVDFILLEKNMTIN
ELKMYVENELFKFPDDIVKHVNIKVNGSLVGHGELVSIEDGYGIEISSWMVKE
>P40700 ~~~spaP~~~Surface presentation of antigens protein SpaP~~~
MGNDISLIALLAFSTLLPFIIASGTCFVKFSIVFVMVRNALGLQQIPSNMTLNGVALLLSMFVMWPIMHDAYVYFEDEDV
TFNDISSLSKHVDEGLDGYRDYLIKYSDRELVQFFENAQLKRQYGEETETVKRDKDEIEKPSIFALLPAYALSEIKSAFK
IGFYLYLPFVVVDLVVSSVLLALGMMMMSPVTISTPIKLVLFVALDGWTLLSKGLILQYMDIAT
>P0A1L3 ~~~spaP~~~Surface presentation of antigens protein SpaP~~~
MLSDMSLIATLSFFTLLPFLVAAGTCYIKFSIVFVMVRNALGLQQVPSNMTLNGIALIMALFVMKPIIEAGYENYLNGPQ
KFDTISDIVRFSDSGLMEYKQYLKKHTDLELARFFQRSEEENADLKSAENNDYSLFSLLPAYALSEIKDAFKIGFYLYLP
FVVVDLVISSILLALGMMMMSPITISVPIKLVLFVALDGWGILSKALIEQYINIPA
>P23504 ~~~spaP~~~Cell surface antigen I/II~~~COG3064
MKVKKTYGFRKSKISKTLCGAVLGTVAAVSVAGQKVFADETTTTSDVDTKVVGTQTGNPATNLPEAQGSASKEAEQSQNQ
AGETNGSIPVEVPKTDLDQAAKDAKSAGVNVVQDADVNKGTVKTAEEAVQKETEIKEDYTKQAEDIKKTTDQYKSDVAAH
EAEVAKIKAKNQATKEQYEKDMAAHKAEVERINAANAASKTAYEAKLAQYQADLAAVQKTNAANQAAYQKALAAYQAELK
RVQEANAAAKAAYDTAVAANNAKNTEIAAANEEIRKRNATAKAEYETKLAQYQAELKRVQEANAANEADYQAKLTAYQTE
LARVQKANADAKAAYEAAVAANNAKNAALTAENTAIKQRNENAKATYEAALKQYEADLAAVKKANAANEADYQAKLTAYQ
TELARVQKANADAKAAYEAAVAANNAANAALTAENTAIKKRNADAKADYEAKLAKYQADLAKYQKDLADYPVKLKAYEDE
QASIKAALAELEKHKNEDGNLTEPSAQNLVYDLEPNANLSLTTDGKFLKASAVDDAFSKSTSKAKYDQKILQLDDLDITN
LEQSNDVASSMELYGNFGDKAGWSTTVSNNSQVKWGSVLLERGQSATATYTNLQNSYYNGKKISKIVYKYTVDPKSKFQG
QKVWLGIFTDPTLGVFASAYTGQVEKNTSIFIKNEFTFYDEDGKPINFDNALLSVASLNRENNSIEMAKDYTGKFVKISG
SSIGEKNGMIYATDTLNFRQGQGGARWTMYTRASEPGSGWDSSDAPNSWYGAGAIRMSGPNNSVTLGAISSTLVVPADPT
MAIETGKKPNIWYSLNGKIRAVNVPKVTKEKPTPPVKPTAPTKPTYETEKPLKPAPVAPNYEKEPTPPTRTPDQAEPNKP
TPPTYETEKPLEPAPVEPSYEAEPTPPTRTPDQAEPNKPTPPTYETEKPLEPAPVEPSYEAEPTPPTPTPDQPEPNKPVE
PTYEVIPTPPTDPVYQDLPTPPSVPTVHFHYFKLAVQPQVNKEIRNNNDINIDRTLVAKQSVVKFQLKTADLPAGRDETT
SFVLVDPLPSGYQFNPEATKAASPGFDVTYDNATNTVTFKATAATLATFNADLTKSVATIYPTVVGQVLNDGATYKNNFT
LTVNDAYGIKSNVVRVTTPGKPNDPDNPNNNYIKPTKVNKNENGVVIDGKTVLAGSTNYYELTWDLDQYKNDRSSADTIQ
KGFYYVDDYPEEALELRQDLVKITDANGNEVTGVSVDNYTNLEAAPQEIRDVLSKAGIRPKGAFQIFRADNPREFYDTYV
KTGIDLKIVSPMVVKKQMGQTGGSYENQAYQIDFGNGYASNIIINNVPKINPKKDVTLTLDPADTNNVDGQTIPLNTVFN
YRLIGGIIPADHSEELFEYNFYDDYDQTGDHYTGQYKVFAKVDITFKDGSIIKSGAELTQYTTAEVDTAKGAITIKFKEA
FLRSVSIDSAFQAESYIQMKRIAVGTFENTYINTVNGVTYSSNTVKTTTPEDPTDPTDPQDPSSPRTSTVINYKPQSTAY
QPSSVQETLPNTGVTNNAYMPLLGIIGLVTSFSLLGLKAKKD
>P0A1L7 ~~~spaQ~~~Surface presentation of antigens protein SpaQ~~~
MDDLVFAGNKALYLVLILSGWPTIVATIIGLLVGLFQTVTQLQEQTLPFGIKLLGVCLCLFLLSGWYGEVLLSYGRQVIF
LALAKG
>P0A1M4 ~~~spaQ~~~Surface presentation of antigens protein SpaQ~~~
MSDIVYMGNKALYLILIFSLWPVGIATVIGLSIGLLQTVTQLQEQTLPFGIKLIGVSISLLLLSGWYGEVLLSFCHEIMF
LIKSGV
>P40701 ~~~spaR~~~Surface presentation of antigens protein SpaR~~~
MFYALYFEIHHLVASAALGFARVAPIFFFLPFLNSGVLSGAPRNAIIILVALGVWPHALNEAPPFLSVAMIPLVLQEAAV
GVMLGCLLSWPFWVMHALGCIIDNQRGATLSSSIDPANGIDTSEMANFLNMFAAVVYLQNGGLVTMVDVLNKSYQLCDPM
NECTPSLPPLLTFINQVAQNALVLASPVVLVLLLSEVFLGLLSRFAPQMNAFAISLTVKSGIAVLIMLLYFSPVLPDNVL
RLSFQATGLSSWFYERGATHVLE
>P0A1M6 ~~~spaR~~~Surface presentation of antigens protein SpaR~~~
MDISSWFESIHVFLILLNGVFFRLAPLFFFLPFLNNGIISPSIRIPVIFLVASGLITSGKVDIGSSVFEHVYFLMFKEII
VGLLLSFCLSLPFWIFHAVGSIIDNQRGATLSSSIDPANGVDTSELAKFFNLFSAVVFLYSGGMVFILESIQLSYNICPL
FSQCSFRISNILTFLTLLASQAVILASPVMIVLLLSEVLLGVLSRFAPQMNAFSVSLTIKSLLAIFIIFICSSTIYFSKV
QFFLGEHKFFTNLFVR
>P10946 ~~~spaS~~~Lantibiotic subtilin~~~
MSKFDDFDLDVVKVSKQDSKITPQWKSESLCTPGCVTGALQTCFLQTLTCNCKISK
>P40702 ~~~spaS~~~Surface presentation of antigens protein SpaS~~~
MSSNKTEKPTKKRLEDSAKKGQSFKSKDLIIACLTLGGIAYLVSYGSFNEFMGIIKIIIADNFDQSMADYSLAVFGIGLK
YLIPFMLLCLVCSALPALLQAGFVLATEALKPNLSALNPVEGAKKLFSMRTVKDTVKTLLYLSSFVVAAIICWKKYKVEI
FSQLNGNIVGIAVIWRELLLALVLTCLACALIVLLLDAIAEYFLTMKDMKMDKEEVKREMKEQEGNPEVKSKRREVHMEI
LSEQVKSDIENSRLIVANPTHITIGIYFKPELMPIPMISVYETNQRALAVRAYAEKVGVPVIVDIKLARSLFKTHRRYDL
VSLEEIDEVLRLLVWLEEVENAGKDVIQPQENEVRH
>P0A1M8 ~~~spaS~~~Surface presentation of antigens protein SpaS~~~
MANKTEKPTPKKLKDAAKKGQSFKFKDLTTVVIILVGTFTIISFFSLSDVMLLYRYVIINDFEINEGKYFFAVVIVFFKI
IGFPLFFCVLSAVLPTLVQTKFVLATKAIKIDFSVLNPVKGLKKIFSIKTIKEFFKSILLLIILALTTYFFWINDRKIIF
SQVFSSVDGLYLIWGRLFKDIILFFLAFSILVIILDFVIEFILYMKDMMMDKQEIKREYIEQEGHFETKSRRRELHIEIL
SEQTKSDIRNSKLVVMNPTHIAIGIYFNPEIAPAPFISLIETNQCALAVRKYANEVGIPTVRDVKLARKLYKTHTKYSFV
DFEHLDEVLRLIVWLEQVENTH
>P02976 ~~~spa~~~Immunoglobulin G-binding protein A~~~COG1388
MKKKNIYSIRKLGVGIASVTLGTLLISGGVTPAANAAQHDEAQQNAFYQVLNMPNLNADQRNGFIQSLKDDPSQSANVLG
EAQKLNDSQAPKADAQQNNFNKDQQSAFYEILNMPNLNEAQRNGFIQSLKDDPSQSTNVLGEAKKLNESQAPKADNNFNK
EQQNAFYEILNMPNLNEEQRNGFIQSLKDDPSQSANLLSEAKKLNESQAPKADNKFNKEQQNAFYEILHLPNLNEEQRNG
FIQSLKDDPSQSANLLAEAKKLNDAQAPKADNKFNKEQQNAFYEILHLPNLTEEQRNGFIQSLKDDPSVSKEILAEAKKL
NDAQAPKEEDNNKPGKEDNNKPGKEDNNKPGKEDNNKPGKEDNNKPGKEDGNKPGKEDNKKPGKEDGNKPGKEDNKKPGK
EDGNKPGKEDGNKPGKEDGNGVHVVKPGDTVNDIAKANGTTADKIAADNKLADKNMIKPGQELVVDKKQPANHADANKAQ
ALPETGEENPFIGTTVFGGLSLALGAALLAGRRREL
>A0A0H3K686 ~~~spa~~~Immunoglobulin G-binding protein A~~~
MKKKNIYSIRKLGVGIASVTLGTLLISGGVTPAANAAQHDEAQQNAFYQVLNMPNLNADQRNGFIQSLKDDPSQSANVLG
EAQKLNDSQAPKADAQQNNFNKDQQSAFYEILNMPNLNEAQRNGFIQSLKDDPSQSTNVLGEAKKLNESQAPKADNNFNK
EQQNAFYEILNMPNLNEEQRNGFIQSLKDDPSQSANLLSEAKKLNESQAPKADNKFNKEQQNAFYEILHLPNLNEEQRNG
FIQSLKDDPSQSANLLAEAKKLNDAQAPKADNKFNKEQQNAFYEILHLPNLTEEQRNGFIQSLKDDPSVSKEILAEAKKL
NDAQAPKEEDNNKPGKEDNNKPGKEDNNKPGKEDNNKPGKEDGNKPGKEDNKKPGKEDGNKPGKEDNKKPGKEDGNKPGK
EDGNKPGKEDGNGVHVVKPGDTVNDIAKANGTTADKIAADNKLADKNMIKPGQELVVDKKQPANHADANKAQALPETGEE
NPFIGTTVFGGLSLALGAALLAGRRREL
>P0A015 ~~~spa~~~Immunoglobulin G-binding protein A~~~
MKKKNIYSIRKLGVGIASVTLGTLLISGGVTPAANAAQHDEAQQNAFYQVLNMPNLNADQRNGFIQSLKDDPSQSANVLG
EAQKLNDSQAPKADAQQNNFNKDQQSAFYEILNMPNLNEAQRNGFIQSLKDDPSQSTNVLGEAKKLNESQAPKADNNFNK
EQQNAFYEILNMPNLNEEQRNGFIQSLKDDPSQSANLLSEAKKLNESQAPKADNKFNKEQQNAFYEILHLPNLNEEQRNG
FIQSLKDDPSVSKEILAEAKKLNDAQAPKEEDNKKPGKEDGNKPGKEDGNKPGKEDNKKPGKEDGNKPGKEDNNKPGKED
GNKPGKEDNNKPGKEDGNKPGKEDGNKPGKEDGNGVHVVKPGDTVNDIAKANGTTADKIAADNKLADKNMIKPGQELVVD
KKQPANHADANKAQALPETGEENPFIGTTVFGGLSLALGAALLAGRRREL
>P99134 ~~~spa~~~Immunoglobulin G-binding protein A~~~
MKKKNIYSIRKLGVGIASVTLGTLLISGGVTPAANAAQHDEAQQNAFYQVLNMPNLNADQRNGFIQSLKDDPSQSANVLG
EAQKLNDSQAPKADAQQNNFNKDQQSAFYEILNMPNLNEAQRNGFIQSLKDDPSQSTNVLGEAKKLNESQAPKADNNFNK
EQQNAFYEILNMPNLNEEQRNGFIQSLKDDPSQSANLLSEAKKLNESQAPKADNKFNKEQQNAFYEILHLPNLNEEQRNG
FIQSLKDDPSVSKEILAEAKKLNDAQAPKEEDNKKPGKEDGNKPGKEDGNKPGKEDNKKPGKEDGNKPGKEDNNKPGKED
GNKPGKEDNNKPGKEDGNKPGKEDGNKPGKEDGNGVHVVKPGDTVNDIAKANGTTADKIAADNKLADKNMIKPGQELVVD
KKQPANHADANKAQALPETGEENPFIGTTVFGGLSLALGAALLAGRRREL
>P38507 ~~~spa~~~Immunoglobulin G-binding protein A~~~
MKKKNIYSIRKLGVGIASVTLGTLLISGGVTPAANAAQHDEAQQNAFYQVLNMPNLNADQRNGFIQSLKDDPSQSANVLG
EAQKLNDSQAPKADAQQNKFNKDQQSAFYEILNMPNLNEEQRNGFIQSLKDDPSQSTNVLGEAKKLNESQAPKADNNFNK
EQQNAFYEILNMPNLNEEQRNGFIQSLKDDPSQSANLLAEAKKLNESQAPKADNKFNKEQQNAFYEILHLPNLNEEQRNG
FIQSLKDDPSQSANLLAEAKKLNDAQAPKADNKFNKEQQNAFYEILHLPNLTEEQRNGFIQSLKDDPSVSKEILAEAKKL
NDAQAPKEEDNNKPGKEDGNKPGKEDGNKPGKEDNKKPGKEDGNKPGKEDNKKPGKEDGNKPGKEDGNKPGKEDGNKPGK
EDGNKPGKEDGNGVHVVKPGDTVNDIAKANGTTADKIAADNKLADKNMIKPGQELVVDKKQPANHADANKAQALPETGEE
NPFIGTTVFGGLSLALGAALLAGRRREL
>K4L7X3 3.13.1.4~~~acd~~~3-sulfinopropanoyl-CoA desulfinase~~~COG1960
MYELTPEQRTLQTQARELAQSVFASTAVQTDLTEQYPWDNVAQLRDAGFMGMMLPTSVGGRGLSTLDTVIVIEEMAKACA
TMGRITVDSNLGAIGAITKYGSEEQIKLAADLVLAGDKPAICISEPNAGSAASEMTTRADKNGDHYILNGEKYWITGGGV
SKLHLIFARVFDDGVEQGIGAFITVLDDHGPEGLKVGRRLYAMGVRGIPETHLEFHDLKIHKSMMITFPDGLKRGFAALM
SAYNAQRVGAGAVALGIAQCAFEEGVAYLKRREQFGRPLAEFQGLQWMVADMSVQLEAARLMLRSAAVSGETFPDINKAA
QAKIFAAETANKVTNDALQFFGSSGYGRHNPMERHVRDARMFTIAGGTAQILRTQVASKILDMKLPQTRDGYLKAAQNSK
R
>F8GVD3 3.13.1.4~~~acd~~~3-sulfinopropanoyl-CoA desulfinase~~~
MYSLTNAQKDLQLKARDLAQCAFAPTAANTDVTEAYPWANVDRLLTEGFMGMTIPKEYGGQGRSYHDTVIVIEEMAKACA
TMGRITVEANMGAIGAIMNYGTEEQKKIAAAAVLSGDKPAICISEPNAGSAASEMTTRADRKGDRYILNGEKYWITGGGV
SRLHLIFARVFDDGVDQGICAFICVREGNSPENLVVGRRLYAMGVRGIPETHLEFRDLQVHKSMLVVPPGGLKRGFASLM
NAYNAQRVGAGTVALGIAQGAFEEAVTYAKERQQFGRPIAEFQGLQWMISDMSIQLEAARLLLHAAACSGESFPDIAMAA
RAKIFAAETANKVTNDSLQIYGSSGYGRHNPMERHVRDARMFTIAGGTAQILRTQVAGSILDMKLPQTRGGFLPK
>Q13PC1 3.13.1.4~~~acd~~~3-sulfinopropanoyl-CoA desulfinase~~~COG1960
MFELTDAQRQLQQSARRLALEAIAPHAAQTDRSEQYPWHTVEALREQRLMGMTLPPEYGGKGASYFDTVLVIEELSKVCA
ASGRIMVESNMGAIGAIMKYGSDAQKQLAARLVLAGDKPAICITEPQAGSAASDMQTRAERRGDTWHLSGCKHWITGGGV
SKLHFVFARAIEDGKDTGIAGFIVVGPDVPGMTIQRIPAMGIRGVPEARIEFDDMRVRHDMKVTPPGRTEAGFAGLMNAY
NAQRVGAATVALGIAQGAFDLALDYAKRREQFGRPIAEFQGLQWMLADMSIQLEAARLMVWKAAASGSEFPSMFAAAQAK
IAAGEAAIKVTNDALQIHGAVGYGRDLPLERMVRDARMFTISGGTAQILRTQVAGTLLGQKLSQRRSA
>B9U6P5 3.13.1.4~~~acd~~~3-sulfinopropanoyl-CoA desulfinase~~~
MYDLTSAQLDLQARARELAQTKFAPTAAQTDQTEEYPWKNVELLRDAGFMGMTLPKSIGGQGLSYLDAVIVVEEMAKACA
TMGRITVEANMGAIGAIAKYGTPEQLKIAADLVLAGDKPAICISEPNAGSAASEMTTRADRQGDHYIINGEKYWITGGGV
SKVHLIFARVLEDGVDQGIGGFICVRDGENSPAGLVIGRRLYAMGVRGIPETHIEFHDLKVHKSMLVVPPGGLKRGFASL
MTAYNAQRVGAGTVALGIAQGAFEEGLERLKTRHQFGRPIAEFQGLQWMAADMSTQLEAARLLLRHAAASGEEFPDIDKA
ARAKIFAAETANKVTNDALQFWGSSGYGRENPMERHVRDARMFTIAGGTAQILRTQVAGKLLGMKLPQTRDGFAKVAAR
>P21885 4.1.1.19~~~speA~~~Arginine decarboxylase~~~COG1982
MSQHETPLYTGLKKHASRQPVQFHIPGHKKGAGMDPEFRQFIGENALSIDLINIEPLDDLHAPKGIIKQAQDLAAEAFGA
DHTFFSVQGTSGAIMTMVMAVCGPGDKIIIPRNVHKSIMTAIVFSGAVPIFIHPEIDNELGISHGITLESAKRALTEHPD
AKGLLVINPTYFGVAADLKSIVELAHSFDVPVLVDEAHGVHIHFHDELPLSAMQAGADIAATSVHKLGGSLTQSSILNMR
EGLVSKDRVQSILSMLTTTSTSYLLLASLDVARKRLATEGRQLAEETLKLANQTRDRLNQIEGIYCVGSEILGSKAAYSY
DPTKLIISVKSLGLTGHDVEKWLRESFNIEVELSDLYNILCIFTPGDSQNDADRLVEALTEIAQQMSEQDVTHQQTEVLL
PEIPLLAMTPRDAFYANTEVIPLKEASGRIIAEFVMVYPPGIPIFIPGEIITEENISYIFKNLDAGLPVQGPEDSTLHMI
RVIKEQKAIQ
>P21170 4.1.1.19~~~speA~~~Biosynthetic arginine decarboxylase~~~COG1166
MSDDMSMGLPSSAGEHGVLRSMQEVAMSSQEASKMLRTYNIAWWGNNYYDVNELGHISVCPDPDVPEARVDLAQLVKTRE
AQGQRLPALFCFPQILQHRLRSINAAFKRARESYGYNGDYFLVYPIKVNQHRRVIESLIHSGEPLGLEAGSKAELMAVLA
HAGMTRSVIVCNGYKDREYIRLALIGEKMGHKVYLVIEKMSEIAIVLDEAERLNVVPRLGVRARLASQGSGKWQSSGGEK
SKFGLAATQVLQLVETLREAGRLDSLQLLHFHLGSQMANIRDIATGVRESARFYVELHKLGVNIQCFDVGGGLGVDYEGT
RSQSDCSVNYGLNEYANNIIWAIGDACEENGLPHPTVITESGRAVTAHHTVLVSNIIGVERNEYTVPTAPAEDAPRALQS
MWETWQEMHEPGTRRSLREWLHDSQMDLHDIHIGYSSGIFSLQERAWAEQLYLSMCHEVQKQLDPQNRAHRPIIDELQER
MADKMYVNFSLFQSMPDAWGIDQLFPVLPLEGLDQVPERRAVLLDITCDSDGAIDHYIDGDGIATTMPMPEYDPENPPML
GFFMVGAYQEILGNMHNLFGDTEAVDVFVFPDGSVEVELSDEGDTVADMLQYVQLDPKTLLTQFRDQVKKTDLDAELQQQ
FLEEFEAGLYGYTYLEDE
>P62561 ~~~speA~~~Exotoxin type A~~~
MENNKKVLKKMVFFVLVTFLGLTISQEVFAQQDPDPSQLHRSSLVKNLQNIYFLYEGDPVTHENVKSVDQLLSHDLIYNV
SGPNYDKLKTELKNQEMATLFKDKNVDIYGVEYYHLCYLCENAERSACIYGGVTNHEGNHLEIPKKIVVKVSIDGIQSLS
FDIETNKKMVTAQELDYKVRKYLTDNKQLYTNGPSKYETGYIKFIPKNKESFWFDFFPEPEFTQSKYLMIYKDNETLDSN
TSQIEVYLTTK
>P0DJY7 ~~~speA~~~Exotoxin type A~~~
MENNKEVLKKMVFFVLMKFLGLTILPKGICSTRPKPSQLQRSNLVKTFKIYIFFMRVTLVTHENVKSVDQLLSHDLIYNV
SGPNYDKLKTELKNQEMATLFKDKNVDIYGVEYYHLCYLCENAERSACLYGGVTNHEGNHLEIPKKIVVKVSIDGIQSLS
FDIEQIKNGNCSRISYTVRKYLTDNKQLYTNGPSKYETGYIKFIPKNKESFWFDFFPEPEFTQSKYLMIYKDNETLDSNT
SQIEVYLTTK
>Q7MK24 4.1.1.19~~~speA~~~Biosynthetic arginine decarboxylase~~~COG1166
MRLDVEQTSKLDRVRADYNVHYWSQGFYGIDDQGEMYVSPRSDNAHQIQLSKIVKQLEERQLNVPVLVRFPQILHQRVHS
ICDAFNQAIEEYQYPNKYLLVYPIKVNQQREVVDEILASQAQLETKQLGLEAGSKPELLAVLAMAQHASSVIVCNGYKDR
EYIRLALIGEKLGHKVFIVLEKMSELDLVLREAKSLGVTPRLGIRIRLASQGAGKWQASGGEKSKFGLSASQVLNVISRL
KKENQLDTLQLVHFHLGSQMANIRDVRNGVNESARFYCELRTLGANITYFDVGGGLAIDYDGTRSQSSNSMNYGLVEYAR
NIVNTVGDVCKDYKQPMPVIISESGRSLTAHHAVLISNVIGTETYKPETVTEPEEDFPLLLNNMWRSWLNLHNGTDARAL
IEIYNDTQSDLAEVHSQFATGVLTLEHRAWAEQTSLRIYYELNRLMSTKNRFHRPILDELSERLADKFFVNFSLFQSLPD
SWGIDQVFPVLPLSGLQNAADRRAVMLDITCDSDGAIDAYVDGQGIESTLPVPAWNEDEPYLMGFFLVGAYQEILGDMHN
LFGDTHSVVVNVGDQGEINIDFINEGDTVEDMMRYVHIDVDQIRKNYHSLVSQRVDQEEQQQILAELEQGLSGYTYLEDF
>P73270 3.5.3.11~~~speB2~~~Probable agmatinase 2~~~COG0010
MSDATPFRPPSEAEEALIKETRLPLTGWQQEVDQGLTYGLEAAASIKDRSIPTFSRGELPHYAGINTFMKAPYLEDVREV
GKYDVAIVGVPHDSGTTYRPGTRFGPQGIRRISALYTPYNFEMGVDLREQISLCDVGDIFTIPANNEKSFDQISKGIAHI
FSSGAFPIILGGDHSIGFPTVRGICRHLGDKKVGIIHFDRHVDTQETDLDERMHTCPWFHATNMANAPAKNLVQLGIGGW
QVPRQGVKVCRERATNILTVTDITEMSLDAAADFAIARATDGTDCVWISFDIDCIDAGFVPGTGWPEPGGLLPREALYLL
KRIIRETNVCGMEVVEVSPPYDISDMTSLMATRVICDTMAHLVVSGQLPRTEKPAYIHAEANMAVDEPWQ
>Q72JK8 3.5.3.24~~~~~~N(1)-aminopropylagmatine ureohydrolase~~~COG0010
MRLVFGEKDAPYEEARVVVLPVPYDLSLSFLPGARRGPEAILLASRELEPFLLELGAAPEEVGIHAAEPVPWVAGMAEES
HRLIREEALKHLRAGKWLVALGGDHSVTHPLVQAHREALGEFSLLHVDAHADLYPEWQGSVYSHASPFYRLLTEGFPLVQ
VGIRAMDRDSLRLARKRGVALFPAHRIHREGLPLDEILEALGKRVYISLDFDALDPSLMPSVGTPLPGGLSYRQVVDLLE
AVFREKEVVGMDFVELSPNGQFHAEMTAAQLVYHAIGLKGLQAGWLSREVDHI
>P70999 3.5.3.11~~~speB~~~Agmatinase~~~COG0010
MRFDEAYSGKVFIASRPEWEEADAILYGMPMDWTVSYRPGSRFGPSRIREVSIGLEEYSPYLDRDLADLNFFDAGDIPLP
FGNPQRSLDMIEEYVDSILEKGKFPMGMGGEHLVSWPVIKAMYKKYPDLAIIHFDAHTDLRVDYEGEPLSHSTPIRKAAE
LIGPHNVYSFGIRSGMKEEFEWAKENGMHISKFEVLEPLKEVLPKLAGRPVYVTIDIDVLDPAHAPGTGTVDAGGITSKE
LLASVHEIARSEVNVKGADLVEVAPVYDHSEQTANTASKIIREMLLGFVK
>P60651 3.5.3.11~~~speB~~~Agmatinase~~~COG0010
MSTLGHQYDNSLVSNAFGFLRLPMNFQPYDSDADWVITGVPFDMATSGRAGGRHGPAAIRQVSTNLAWEHNRFPWNFDMR
ERLNVVDCGDLVYAFGDAREMSEKLQAHAEKLLAAGKRMLSFGGDHFVTLPLLRAHAKHFGKMALVHFDAHTDTYANGCE
FDHGTMFYTAPKEGLIDPNHSVQIGIRTEFDKDNGFTVLDACQVNDRSVDDVIAQVKQIVGDMPVYLTFDIDCLDPAFAP
GTGTPVIGGLTSDRAIKLVRGLKDLNIVGMDVVEVAPAYDQSEITALAAATLALEMLYIQAAKKGE
>P0C0J1 3.4.22.10~~~speB~~~Streptopain~~~
MNKKKLGVRLLSLLALGGFVLANPVFADQNFARNEKEAKDSAITFIQKSAAIKAGARSAEDIKLDKVNLGGELSGSNMYV
YNISTGGFVIVSGDKRSPEILGYSTSGSFDANGKENIASFMESYVEQIKENKKLDTTYAGTAEIKQPVVKSLLDSKGIHY
NQGNPYNLLTPVIEKVKPGEQSFVGQHAATGCVATATAQIMKYHNYPNKGLKDYTYTLSSNNPYFNHPKNLFAAISTRQY
NWNNILPTYSGRESNVQKMAISELMADVGISVDMDYGPSSGSAGSSRVQRALKENFGYNQSVHQINRGDFSKQDWEAQID
KELSQNQPVYYQGVGKVGGHAFVIDGADGRNFYHVNWGWGGVSDGFFRLDALNPSALGTGGGAGGFNGYQSAVVGIKP
>P0C0J0 3.4.22.10~~~speB~~~Streptopain~~~
MNKKKLGIRLLSLLALGGFVLANPVFADQNFARNEKEAKDSAITFIQKSAAIKAGARSAEDIKLDKVNLGGELSGSNMYV
YNISTGGFVIVSGDKRSPEILGYSTSGSFDANGKENIASFMESYVEQIKENKKLDTTYAGTAEIKQPVVKSLLDSKGIHY
NQGNPYNLLTPVIEKVKPGEQSFVGQHAATGCVATATAQIMKYHNYPNKGLKDYTYTLSSNNPYFNHPKNLFAAISTRQY
NWNNILPTYSGRESNVQKMAISELMADVGISVDMDYGPSSGSAGSSRVQRALKENFGYNQSVHQINRSDFSKQDWEAQID
KELSQNQPVYYQGVGKVGGHAFVIDGADGRNFYHVNWGWGGVSDGFFRLDALNPSALGTGGGAGGFNGYQSAVVGIKP
>Q8NKX2 ~~~speC~~~Exotoxin type C~~~
MKKINIIKIVFIITVILISTISPIIKSDSKKDISNVKSDLLYAYTITPYDYKDCRVNFSTTHTLNIDTQKYRGKDYYISS
EMSYEASQKFKRDDHVDVFGLFYILNSHTGEYIYGGITPAQNNKVNHKLLGNLFISGESQQNLNNKIILEKDIVTFQEID
FKIRKYLMDNYKIYDATSPYVSGRIEIGTKDGKHEQIDLFDSPNEGTRSDIFAKYKDNRIINMKNFSHFDIYLEK
>P0A7F6 4.1.1.50~~~speD~~~S-adenosylmethionine decarboxylase proenzyme~~~COG1586
MKKLKLHGFNNLTKSLSFCIYDICYAKTAEERDGYIAYIDELYNANRLTEILSETCSIIGANILNIARQDYEPQGASVTI
LVSEEPVDPKLIDKTEHPGPLPETVVAHLDKSHICVHTYPESHPEGGLCTFRADIEVSTCGVISPLKALNYLIHQLESDI
VTIDYRVRGFTRDINGMKHFIDHEINSIQNFMSDDMKALYDMVDVNVYQENIFHTKMLLKEFDLKHYMFHTKPEDLTDSE
RQEITAALWKEMREIYYGRNMPAV
>P70998 2.5.1.16~~~speE~~~Polyamine aminopropyltransferase~~~COG0421
MSELWYTEKQTKNFGITMKVNKTLHTEQTEFQHLEMVETEEFGNMLFLDGMVMTSEKDEFVYHEMVAHVPLFTHPNPEHV
LVVGGGDGGVIREILKHPSVKKATLVDIDGKVIEYSKKFLPSIAGKLDDPRVDVQVDDGFMHIAKSENQYDVIMVDSTEP
VGPAVNLFTKGFYAGIAKALKEDGIFVAQTDNPWFTPELITNVQRDVKEIFPITKLYTANIPTYPSGLWTFTIGSKKYDP
LAVEDSRFFDIETKYYTKDIHKAAFVLPKFVSDLIK
>P09158 2.5.1.-~~~speE~~~Polyamine aminopropyltransferase~~~COG0421
MAEKKQWHETLHDQFGQYFAVDNVLYHEKTDHQDLIIFENAAFGRVMALDGVVQTTERDEFIYHEMMTHVPLLAHGHAKH
VLIIGGGDGAMLREVTRHKNVESITMVEIDAGVVSFCRQYLPNHNAGSYDDPRFKLVIDDGVNFVNQTSQTFDVIISDCT
DPIGPGESLFTSAFYEGCKRCLNPGGIFVAQNGVCFLQQEEAIDSHRKLSHYFSDVGFYQAAIPTYYGGIMTFAWATDND
ALRHLSTEIIQARFLASGLKCRYYNPAIHTAAFALPQYLQDALASQPS
>O25503 2.5.1.16~~~speE~~~Polyamine aminopropyltransferase~~~COG0421
MWITQEITPYLRKEYTIEAKLLDVRSEHNILEIFKSKDFGEIAMLNRQLLFKNFLHIESELLAHMGGCTKKELKEVLIVD
GFDLELAHQLFKYDTHIDFVQADEKILDSFISFFPHFHEVKNNKNFTHAKQLLDLDIKKYDLIFCLQEPDIHRIDGLKRM
LKEDGVFISVAKHPLLEHVSMQNALKNMGGVFSVAMPFVAPLRILSNKGYIYASFKTHPLKDLMTPKIEALTSVRYYNED
IHRAAFALPKNLQEVFKDNIKS
>Q31QK9 2.5.1.16~~~speE~~~Polyamine aminopropyltransferase~~~COG0421
MSADAPVWIDEVFEDRVRYGLRGQILWEETSPFQKITIVDTEHYGRGLLLDDCWMTAERCEVCYHEYLVHPPLTTAASIA
RVLVIGGGDGGTVREVLRYAEVEQVDLVEIDGRVVELSQEYLGAIGTAWADPRLNVKIGDGIAFVQTAPDASYDVILVDG
SDPAGPAAGLFNREFYENCRRVLKPGGVFASQAESPDSFLAVHLEMIETLSAVFAEAKPYYGWVPMYPSGWWSWLYASDT
PGQFQKPQSDRLAAIEPQVEIYNRDIHQAAFAQPNFVRRGLSARQG
>Q9WZC2 2.5.1.16~~~speE~~~Polyamine aminopropyltransferase~~~COG0421
MRTLKELERELQPRQHLWYFEYYTGNNVGLFMKMNRVIYSGQSDIQRIDIFENPDLGVVFALDGITMTTEKDEFMYHEML
AHVPMFLHPNPKKVLIIGGGDGGTLREVLKHDSVEKAILCEVDGLVIEAARKYLKQTSCGFDDPRAEIVIANGAEYVRKF
KNEFDVIIIDSTDPTAGQGGHLFTEEFYQACYDALKEDGVFSAETEDPFYDIGWFKLAYRRISKVFPITRVYLGFMTTYP
SGMWSYTFASKGIDPIKDFDPEKVRKFNKELKYYNEEVHVASFALPNFVKKELGLM
>Q5SK28 2.5.1.104~~~speE~~~Polyamine aminopropyltransferase~~~COG0421
MDYGMYFFEHVTPYETLVRRMERVIASGKTPFQDYFLFESKGFGKVLILDKDVQSTERDEYIYHETLVHPAMLTHPEPKR
VLIVGGGEGATLREVLKHPTVEKAVMVDIDGELVEVAKRHMPEWHQGAFDDPRAVLVIDDARAYLERTEERYDVVIIDLT
DPVGEDNPARLLYTVEFYRLVKAHLNPGGVMGMQAGMILLTHHRVHPVVHRTVREAFRYVRSYKNHIPGFFLNFGFLLAS
DAFDPAAFSEGVIEARIRERNLALRHLTAPYLEAMFVLPKDLLEALEKETMVSTDQNPFYVTPEGEARQAPYKG
>P0DTV7 ~~~speFL~~~Leader peptide SpeFL~~~
MENNSRTMPHIRRTTHIMKFAHRNSFDFHFFNAR
>P0DTV8 ~~~speFL~~~Leader peptide SpeFL~~~
MENNNRFMPHIRRTTHIMMFAHRNSFDFHFFNAR
>O66615 4.1.1.50~~~speH~~~S-adenosylmethionine decarboxylase proenzyme~~~COG1586
MAKTLGLHILADLYGVDADKIDRVEDIRELLEGAVKYANLTKISSHYYQFQPHGATGVVLLAESHISIHTWPEHGLATVD
VYTCGDPSKAYRAMDYIITQLNPKRIDKQVHERGIVEEESNQSEAEKLRSILLQV
>O34426 4.1.1.50~~~speH~~~S-adenosylmethionine decarboxylase proenzyme~~~COG1586
METMGRHVISELWGCDFDKLNDMDFIEKTFVNAALKSGAEVREVAFHKFAPQGVSGVVIISESHLTIHSFPEHGYASIDV
YTCGDLDPNVAADYIAEALHADTRENIEIPRGMGPVQIKQAQAKVL
>P0C0I6 ~~~speH~~~Exotoxin type H~~~
MRYNCRYSHIDKKIYSMIICLSFLLYSNVVQANSYNTTNRHNLESLYKHDSNLIEADSIKNSPDIVTSHMLKYSVKDKNL
SVFFEKDWISQEFKDKEVDIYALSAQEVCECPGKRYEAFGGITLTNSEKKEIKVPVNVWDKSKQQPPMFITVNKPKVTAQ
EVDIKVRKLLIKKYDIYNNREQKYSKGTVTLDLNSGKDIVFDLYYFGNGDFNSMLKIYSNNERIDSTQFHVDVSIS
>Q9WZC3 4.1.1.50~~~speH~~~S-adenosylmethionine decarboxylase proenzyme~~~COG1586
MKSLGRHLVAEFYECDREVLDNVQLIEQEMKQAAYESGATIVTSTFHRFLPYGVSGVVVISESHLTIHTWPEYGYAAIDL
FTCGEDVDPWKAFEHLKKALKAKRVHVVEHERGRYDEIGIPEDSPHKAAV
>P06654 ~~~spg~~~Immunoglobulin G-binding protein G~~~
MEKEKKVKYFLRKSAFGLASVSAAFLVGSTVFAVDSPIEDTPIIRNGGELTNLLGNSETTLALRNEESATADLTAAAVAD
TVAAAAAENAGAAAWEAAAAADALAKAKADALKEFNKYGVSDYYKNLINNAKTVEGIKDLQAQVVESAKKARISEATDGL
SDFLKSQTPAEDTVKSIELAEAKVLANRELDKYGVSDYHKNLINNAKTVEGVKELIDEILAALPKTDTYKLILNGKTLKG
ETTTEAVDAATAEKVFKQYANDNGVDGEWTYDDATKTFTVTEKPEVIDASELTPAVTTYKLVINGKTLKGETTTKAVDAE
TAEKAFKQYANDNGVDGVWTYDDATKTFTVTEMVTEVPGDAPTEPEKPEASIPLVPLTPATPIAKDDAKKDDTKKEDAKK
PEAKKDDAKKAETLPTTGEGSNPFFTAAALAVMAGAGALAVASKRKED
>P19909 ~~~spg~~~Immunoglobulin G-binding protein G~~~
MEKEKKVKYFLRKSAFGLASVSAAFLVGSTVFAVDSPIEDTPIIRNGGELTNLLGNSETTLALRNEESATADLTAAAVAD
TVAAAAAENAGAAAWEAAAAADALAKAKADALKEFNKYGVSDYYKNLINNAKTVEGVKDLQAQVVESAKKARISEATDGL
SDFLKSQTPAEDTVKSIELAEAKVLANRELDKYGVSDYHKNLINNAKTVEGVKDLQAQVVESAKKARISEATDGLSDFLK
SQTPAEDTVKSIELAEAKVLANRELDKYGVSDYYKNLINNAKTVEGVKALIDEILAALPKTDTYKLILNGKTLKGETTTE
AVDAATAEKVFKQYANDNGVDGEWTYDDATKTFTVTEKPEVIDASELTPAVTTYKLVINGKTLKGETTTEAVDAATAEKV
FKQYANDNGVDGEWTYDDATKTFTVTEKPEVIDASELTPAVTTYKLVINGKTLKGETTTKAVDAETAEKAFKQYANDNGV
DGVWTYDDATKTFTVTEMVTEVPGDAPTEPEKPEASIPLVPLTPATPIAKDDAKKDDTKKEDAKKPEAKKEDAKKAETLP
TTGEGSNPFFTAAALAVMAGAGALAVASKRKED
>P39663 ~~~sphR~~~Alkaline phosphatase synthesis transcriptional regulatory protein SphR~~~COG0745
MTSNPLDESMESLTVVPPAASPASRILVVEDEAVIRDMVALVLQQEGFTVDVAADGRTALNYFRSDSPEAGSVTENPDLV
VLDLMLPAVNGLDFCRLLRRQGVTVPILMLSAKDTETDRVVGLEIGADDYLTKPFGTRELVARCRALLRRSQNQPAETPA
VLRYEGLKLFPEECRVLLDDRELTLSPKEFRLLELFMRHPRRVWSRDQLLEKIWGIDFMGDSKTIDVHIRWLREKIEANP
SNPSYLLTVRGFGYRLG
>P39664 2.7.13.3~~~sphS~~~Sensor protein SphS~~~COG5002
MAAWEFALGLLTASLWRWARKWRSPVKVKPMLAAVSSLEPQLEQITTDLRDRDRLLEDLPVSFLLLDADNLVLEANRSAR
VLLALPPEDYCRPLLEVVRSYELDRLVARCRAANAPQTDRWTLTPVNPDPLQVVPQTPRPVQGQAIPLSNGQIGVLIEDR
QELVDLAQQRNRWVSDVAHELKTPLTSIRLLAEALRDRLQDEPQVWVDRLLGETQRLGQLVQDLLELSRLEQGPSGLQKL
EAVDLVALLTSVRNSLEPLAEPLRLGWAYQGPEQGFVRGDRQRLFRLWLNLVDNAIRHSPSGGCLYVELRQRGDTWICDL
YDDGPGFADADLPYLFERFYRGDPSRVRPAAASSSSPGSGLGLAIARQVVEAHQGRISARNHPVTGGAWLRVQLPQEPSL
TPALKIGTGRRSG
>P39665 ~~~sphX~~~Protein SphX~~~COG0226
MTTLKPALRRAAVLLPIAAVASSLFPIQEASAQRALVTADGSSTVFPISEAVAEEFQKRNKNINVTVGVSGTGGGFKRFC
NGEIDIANASRPIKKEEVEACRKKGIRYIELPVAFDALTVVVNKSNPVNSITTAELAKIFGRDAEKKTTNWRQVKSSFPN
LPLRVYAPGTDSGTYDYFNEAILNKKGTRGDLTASEDDNILVQGVSRDRGGIGFFGFSYYEENKGKLKALAVVNSNGKAV
MPSVQNVLNGTYDPLARPVFIYVSEQAAKKANVRSFVNFYLQNAGKLSREVGFVPLPAKAYTAATQRFRSNKTGTVFAGK
SLVGGSIEDLLKAEGIN
>P50470 ~~~~~~Immunoglobulin G-binding protein H~~~
MTRQQTKKNYSLRKLKTGTASVAVALTVLGAGFANQTTVKAEGAKIDWQEEYKKLDEDNAKLVEVVETTSLENEKLKSEN
EENKKNLDKLSKENQGKLEKLELDYLKKLDHEHKEHQKEQQEQEERQKNQEQLERKYQREVEKRYQEQLQKQQQLETEKQ
ISEASRKSLSRDLEASRAAKKDLEAEHQKLEAEHQKLKEDKQISDASRQGLSRDLEASRAAKKELEANHQKLEAEHQKLK
EDKQISDASRQGLSRDLEASRAAKKELEANHQKLEAEAKALKEQLAKQAEELAKLRAGKASDSQTPDTKPGNKAVPGKGQ
APQAGTKPNQNKAPMKETKRQLPSTGETANPFFTAAALTVMATAGVAAVVKRKEEN
>D0ZWR8 ~~~spiC~~~Salmonella pathogenicity island 2 protein C~~~
MSEEGFMLAVLKGIPLIQDIRAEGNSRSWIMTIDGHPARGEIFSEAFSISLFLNDLESLPKPCLAYVTLLLAAHPDVHDY
AIQLTADGGWLNGYYTTSSSSELIAIEIEKHLALTCILKNVIRNHHKLYSGGV
>P21625 ~~~spi~~~Spiralin~~~
MKKLLSILAVFGVSAVGTTSVVACNKTESNNLSRVKTIAAPATVAASTPKAVTKPEIKTALEANVLKAVQGVVKTATAAD
FQFEVYKNSKGTALETIDLEAGKVEVYLQITPAKDKTVVIGETRYIKVTLPKHGEVTKVDIKDVTVTEQTVGIKASTPKA
VKKDELNAVNTYATLAKAVLDAIQNIAPNAGASDFEITNNGAEGDYEAAKEVEVTVKAKNDSANISGQFKFKAKVTATAP
TE
>P43131 ~~~~~~Protease inhibitor~~~
MKTIRTGMMTLAALAVLGTNVVSATSEPVKELSVNVNGQHIEQAAIFDKGQQTVLVPLRDVAESLGFQVKWNAETKAAEV
NKGAIFSYAKVGEDRYPFAKMYKTLGAEPRLLNGNTYVPVAFVDEILQAEVNVTDDAVTVVDEESDVAPVRTGTITTLNK
REDGGVSFQLNGYETGIILHVDKETKITTADGKELKPEDLQLGMEVEATHQKFMAMSMPQSGAVSIVVKSGLETPEVLGT
AGKVASIDKDQEGSYKMLVEGQALAENAPEKVALIVGKDTKIVSAKDNKELAPEDLKAEMKVFAYYGPKLTRSLPPIGVA
EKIVVE
>O50393 ~~~~~~Serine protease inhibitor Rv3364c~~~COG2018
MKARLPDSPLDWLVSKFAREVPGVAHALLVSVDGLPVAASEHLPRERADQLAAVTSGLASLAGGAAQLFDGGQVLQSVVE
MQNGYLLLMQVGDGSALAALAATGCDIGQIGYEMAILVERVGGVVQSCRR
>Q9FAB3 2.7.11.1~~~spkA~~~Serine/threonine-protein kinase A~~~COG0515
MTPDSRHRRLLANRYQLVELVGSGAMGQVYRAEDKLLGGVTVAVKFLSQALLNPRMKERFEREATISALLGEKSIHIVRV
RDYGLDEKEIPFYVMEYLQGENISDVIKYRPLKVERFLKIARQICFGLDCAHKGIIYQGEACPVVHRDIKPSNVLLVEDP
ALGELVKILDFGIAKLVQAAEESKTQAFMGTLAYCSPEQMEGKELDSRSDIYSLGVMMYEMLTGEMPLFPDNSSFGGWYE
AHHHTKPHPFSARYKIPASLEALVMNCLAKSPKGRPQSVDVIIRAIDAIEAEIKAPPISDTEKTQIAPHLLNTDMEATVV
AQRGPGIVPETRLPLVSELCKQLEWPSDKPKQKIVFPYVLNAAEGKLASLWVMLNQEDILTRMSSIRYNQFLLMTSPHPM
VLWITVLYHREYGPRWLPCYLDLKTRSGQSFAQMLGESGTYWLLFFALENPTRCQHMLTATVAPNQCKLLKEWAQTSQSM
PGGKPQVTKRLLKQELDRLKPKIEAKLSQVKPSFNKEVSGL
>P74297 2.7.11.1~~~spkB~~~Serine/threonine-protein kinase B~~~COG0515
MSFCVNPNCPHPKNPNNVQVCQACGNSLRLNGRYQTLGLLGKGGFGATFAAADVALPGTPICVVKQLRPQTDDPNVFRMA
KELFEREAQTLGRVGNHPQVPRLLDYFEDDHQFYLVQEYVKGHNLHQEVKKNGTFTEGSVKQFLTEILPILDYIHSQKVI
HRDIKPANLIRRQTDQKLVLIDFGAVKNQIDSVLSSNTSAQTALTAFAVGTAGFAPPEQMAMRPVYASDIYATGVTCLYL
LTGKTPKEIDCNSQTGEMDWEKHVTVSSKFAEVIRKMLELSVRHRYKSAQQVLDALEMPTYEDGMMQGMVSTPFTTLTGA
GDEPATGIRMGNSSSPDYGDPSTRFNTNVQPRDPSSTSLNTGIKTRTAKPRQSPRDRATSNIESPTTRVRPASNMADGGS
VGAGGIDYNMVNPKPFSRREEEKQAIANQPETKRWNGKTFLAEYAQGKRDFADQNLVGIVLAKAFVPGINCYQANLTNAN
FEQAELTRADFGKARLKNVIFKGANLSDAYFGYADLRGADLRGANLNGVNFKYANLQGANFSGADLGSAKVSPEQLKLAK
TNWRTVMPGSGRRR
>P74745 2.7.11.1~~~spkC~~~Serine/threonine-protein kinase C~~~COG0515
MVTPLKLLNNRYRIIETLGRGGFGETFLAQDTHMPSARKCVIKHLKPVLENPEIPSWLRERFHREAATLEELGENHPQIP
QLYAYFSEGEDFYLVQEWIPGLTLTQAHAQKGNFSSTAVEELLLGILPVLEFIHQRRIIHRDIKPDNIILREADGKPILI
DFGIIKETMGTLVNPDGRSAYSVALGTPGYMASEQAAGRPVFSSDLYSLGLTAIFLLTGKTPQYLTSDSRTGEILWRQGA
PQVSPTLAKVIDQAVRYHPRERFNSATAMAQTLQGNFSNVPMTKGDRPGNTVANGKTKSNHQPTAPTLVVGTPYNANDTQ
ATKVYTQEFTGYTETQEGSPLMKWVVMPLVVLLVIGGGMAAGFWVTSQRRNNPPPAVEEPTEETPIPLPSLEPRPNLFET
PSPIPTPATPSPEPTPSPSPSPETTSSPTEDTITPMEPEPSLDEPAPIPEPKPSPSPTISPQPSPTISIPVTPAPVPKPS
PSPTPKPTVPPQISPTPQPSNTVPVIPPPENPSAETEPNLPAPPVGEKPIDPEQN
>P54735 2.7.11.1~~~spkD~~~Serine/threonine-protein kinase D~~~COG0515
MNVQVLDRYEIVKSLGSGGFGDTFLAKDTQIPSQKLVVIKRLKPANANSNTSTELIQKLFEKEASVLEDLGEHNSQIPKL
YSYFSNDNEFYLVQEYIQGVSLNEIAPISSEQAKTILSSLLTTLKYIHSKGIIHRDIKPENIILRDSDHLPVLIDFGAVK
ETMGAVTLGSGSTVSSVVIGTRGFMAPEQSSGRSVFSTDLYALGLTIIYTLTKKLPVEFSSDQQTGQLDWQSHVSKIDSV
LAKVINKAIEMEPSRRYSSAEAMYQALHSLISSGAEPALPMETVRVAPSNEFLVTRSSTKTAETVVKPVGNSHNNYSNNN
GKSKIATLLTVLIGIIVVTAGLGGGFIITQQIKEAEARAAQAEKEKQEAEQKRIEAEQKIAENEKRQRELEQKRVEEERQ
RLAAEAERAKQERQRLAAERQRVQVLANQAKAMASGASATIGGIPGSKNIRSGPGTDYGVITQGYTGEGLDILDSSTDSS
GHVWYKVYHYGSGSTGWIASQLVNF
>P73469 2.7.11.1~~~spkF~~~Serine/threonine-protein kinase F~~~COG0515
MDLLCTRPGCARLNSFPDLDNRNTLQTVQQRFCTSCRMPLILAGRYLPVKLLGQGGFGAAYLALDRFTPTMRFCVVKQFQ
PSGNLNQEQLDLALSLFEREAVVLEKLGNRHDQIPDLFAYFPLLVDDPRTGKQDQFFYLVQEFINGQDLEKTVEKHGPLS
EAEVRWVLTEMLKILSFVHGTGAIHRDIKPSNLMRDQEGKLYLLDFGAVKQATAGVGASNEGSTGIYSMGFAPPEQMAGN
QVYPATDLYALAVTCLYLLTGKTAQDLYDAYHNQWNWRSPGLKVSQPLADVIDRLLLPTPKDRYASAEEVLAVLNGGKGN
QGKAPPGATVSTPQGTNTQIQPTPASSASPLTAPKTPGKISQAVQNLPVLKVLFQGALTGSALVFWGIIAVSLFPQTNIS
LGILGMVVAGIILAQFKRWLEVTEMLSLNTLTILALLAVPGLSRWPKIVELATQLDFPVLVTVIIAAIAGAIAVVATIAL
FLLILKLLFAVLTRV
>Q2FXC2 3.4.21.-~~~splA~~~Serine protease SplA~~~COG3591
MNKNVMVKGLTALTILTSLGFAENISNQPHSIAKAEKNVKEITDATKEPYNSVVAFVGGTGVVVGKNTIVTNKHIAKSND
IFKNRVSAHHSSKGKGGGNYDVKDIVEYPGKEDLAIVHVHETSTEGLNFNKNVSYTKFADGAKVKDRISVIGYPKGAQTK
YKMFESTGTINHISGTFMEFDAYAQPGNSGSPVLNSKHELIGILYAGSGKDESEKNFGVYFTPQLKEFIQNNIEK
>Q5HEW0 3.4.21.-~~~splA~~~Serine protease SplA~~~
MNKNVMVKGLTALTILTILTSLGFAENISNQPHSIAKAEKNVKEITDATKEPYNSVVAFVGGTGVVVGKNTIVTNKHIAK
SNDIFKNRVSAHHSSKGKGGGNYDVKDIVEYPGKEDLAIVHVHETSTEGLNFNKNVSYTKFADGAKVKDRISVIGYPKGA
QTKYKMFESTGTINHISGTFMEFDAYAQPGNSGSPVLNSKHELIGILYAGSGKDESEKNFGVYFTPQLKEFIQNNIEK
>Q97L63 4.1.99.14~~~splB~~~Spore photoproduct lyase~~~COG1533
MENMFRRVIFEKKALDYPMGRDILRQFENTDIEIRYSETGRITGIPGKDEAQSFFEGKNTLVVGVRRELDFQTCKPSANY
QLPIVSGCAAMCEYCYLNTHGGKKPYVKINVNLDDILSKAGEYIEKRKPDITVFEGAAISDPVPVERYSGALKKAIEYFG
KNEYSRFRFVTKYADISELLAVQHNNHTTIRFSINTPRVIKNYEHRTSSLEDRIESAYNILNSGYKTGFIVGPVFLYENW
KKEYEELLKKASDKLGDKELEFEIISHRFTTSAKNKILKVFPNTKLPMDDEARKFKFGQFGYGKYVYDKDDMQEIKEFFI
NNINLYFNKATIKYII
>Q2FXC3 3.4.21.-~~~splB~~~Serine protease SplB~~~COG3591
MNKNVVIKSLAALTILTSVTGIGTTLVEEVQQTAKAENNVTKVKDTNIFPYTGVVAFKSATGFVVGKNTILTNKHVSKNY
KVGDRITAHPNSDKGNGGIYSIKKIINYPGKEDVSVIQVEERAIERGPKGFNFNDNVTPFKYAAGAKAGERIKVIGYPHP
YKNKYVLYESTGPVMSVEGSSIVYSAHTESGNSGSPVLNSNNELVGIHFASDVKNDDNRNAYGVYFTPEIKKFIAENIDK
>Q2FXC4 3.4.21.-~~~splC~~~Serine protease SplC~~~COG3591
MNKNIVIKSMAALAILTSVTGINAAVVEETQQIANAEKNVTQVKDTNIFPYNGVVSFKDATGFVIGKNTIITNKHVSKDY
KVGDRITAHPNGDKGNGGIYKIKSISDYPGDEDISVMNIEEQAVERGPKGFNFNENVQAFNFAKDAKVDDKIKVIGYPLP
AQNSFKQFESTGTIKRIKDNILNFDAYIEPGNSGSPVLNSNNEVIGVVYGGIGKIGSEYNGAVYFTPQIKDFIQKHIEQ
>Q5HEW2 3.4.21.-~~~splC~~~Serine protease SplC~~~
MNKNIVIKSMAALAILTSVTGINAAVVEETQQIANAEKNVTQVKDTNIFPYNGVVSFKDATGFVIGKNTIITNKHVSKDY
KVGDRITAHPNGDKGNGGIYKIKSISDYPGDEDISVMNIEEQAVERGPKGFNFNENVQAFNFAKDAKVDDKIKVIGYPLP
AQNSFKQFESTGTIKRIKDNILNFDAYIEPGNSGSPVLNSNNEVIGVVYGGIGKIGSEYNGAVYFTPQIKDFIQKHIEQ
>Q53782 3.4.21.-~~~splC~~~Serine protease SplC~~~
MNKNIVIKSMAALAILTSVTGINAAVVEETQQIANAEKNVTQVKDTNNFPYNGVVSFKDATGFVIGKNTIITNKHVSKDY
KVGDRITAHPNGDKGNGGIYKIKSISDYPGDEDISVMNIEEQAVERGPKGFNFNENVQAFNFAKDAKVDDKIKVIGYPLP
AQNSFKQFESTGTIKRIKDNILNFDAYIEPGNSGSPVLNSNNEVIGVVYGGIGKIGSEYNGAVYFTPQIKDFIQKHIEQ
>Q2FXC5 3.4.21.-~~~splD~~~Serine protease SplD~~~COG3591
MNKNIIIKSIAALTILTSITGVGTTVVDGIQQTAKAENSVKLITNTNVAPYSGVTWMGAGTGFVVGNHTIITNKHVTYHM
KVGDEIKAHPNGFYNNGGGLYKVTKIVDYPGKEDIAVVQVEEKSTQPKGRKFKDFTSKFNIASEAKENEPISVIGYPNPN
GNKLQMYESTGKVLSVNGNIVTSDAVVQPGSSGSPILNSKREAIGVMYASDKPTGESTRSFAVYFSPEIKKFIADNLDK
>Q2FXC7 3.4.21.-~~~splE~~~Serine protease SplE~~~COG3591
MNKNIIIKSIAALTILTSVTGVGTTVVEGIQQTAKAEHNVKLIKNTNVAPYNGVVSIGSGTGFIVGKNTIVTNKHVVAGM
EIGAHIIAHPNGEYNNGGFYKVKKIVRYSGQEDIAILHVEDKAVHPKNRNFKDYTGILKIASEAKENERISIVGYPEPYI
NKFQMYESTGKVLSVKGNMIITDAFVEPGNSGSAVFNSKYEVVGVHFGGNGPGNKSTKGYGVYFSPEIKKFIADNTDK
>Q2FXC8 3.4.21.-~~~splF~~~Serine protease SplF~~~COG3591
MNKNIIIKSIAALTILTSITGVGTTMVEGIQQTAKAENTVKQITNTNVAPYSGVTWMGAGTGFVVGNHTIITNKHVTYHM
KVGDEIKAHPNGFYNNGGGLYKVTKIVDYPGKEDIAVVQVEEKSTQPKGRKFKDFTSKFNIASEAKENEPISVIGYPNPN
GNKLQMYESTGKVLSVNGNIVSSDAIIQPGSSGSPILNSKHEAIGVIYAGNKPSGESTRGFAVYFSPEIKKFIADNLDK
>P37956 4.1.99.14~~~splB~~~Spore photoproduct lyase~~~COG1533
MQNPFVPQLVYIEPRALEYPLGQELQDKFENMGIEIRETTSHNQVRNIPGKNHLQQYRNAKSTLVIGVRKTLKFDSSKPS
AEYAIPFATGCMGHCHYCYLQTTMGSKPYIRTYVNVEEILDQADKYMKERAPEFTRFEASCTSDIVGIDHLTHTLKRAIE
HFGQSDLGKLRFVTKFHHVDHLLDAKHNGKTRFRFSINADYVIKNFEPGTSPLDKRIEAAVKVAKAGYPLGFIVAPIYIH
EGWEEGYRHLFEKLDAALPQDVRHDITFELIQHRFTKPAKRVIEKNYPKTKLELDEEKRRYKWGRYGIGKYIYQKDEEHA
LREALESYIDTFFPNAKIEYFT
>C9RZ55 4.1.99.14~~~splG~~~Spore photoproduct lyase~~~
MKPFVPKLVYFEPEALSYPLGQELYEKFTQMGIEIRETTSHNQVRGIPGETELARYRNAKSTLVVGVRRTLKFDSSKPSA
EYAIPLATGCMGHCHYCYLQTTLGSKPYIRVYVNLDDIFAQAQKYIDERAPEITRFEAACTSDIVGIDHLTHSLKKAIEF
IGATDYGRLRFVTKYEHVDHLLDAKHNGKTRFRFSVNSRYVINHFEPGTSSFDARLQAARKVAGAGYKLGFVVAPIYRHD
GWEQGYFELFQELARQLEGVDLSDLTFELIQHRFTKPAKRVIEQRYPKTKLDLDESKRKYKWGRYGIGKYVYRDKEAREL
EETMRSYIARFFPSAQVQYFT
>P35157 ~~~spmA~~~Spore maturation protein A~~~COG2715
MVNIIWVSLTVIGLVFAMCNGTLQDVNEAVFKGAKEAITISFGLMSVLVFWLGLMKIAEQSGLLDIFSRMCRPFISKLFP
DIPPDHPAMGYILSNLMANFFGLGNAATPLGIKAMEQMKKLNGNRSEASRSMITFLAVNTSCITLIPTTVIAVRMAYSSK
TPTDIVGPSILATLISGIGAIIIDRYFYYRRKKKGR
>P35158 ~~~spmB~~~Spore maturation protein B~~~COG0700
MEIINWLSLAMIPIIIAGILLYGTIKKVPTYESFVEGGKEGIEIAFSIIPYLVGMLVAITVFRSSGALDFIMDLLKPAFS
AIGIPAEVVPLALIRPISGTAALGMTTDLIAVYGPDSFIGRLASVMQGSTDTTLYVLTVYFGAVGIKKMGDALKVGLLAD
LIGVVASIIIVTLLFGSA
>Q9ALN5 1.1.1.384~~~spnN~~~dTDP-3,4-didehydro-2,6-dideoxy-alpha-D-glucose 3-reductase~~~
MRKPVRIGVLGCASFAWRRMLPAMCDVAETEVVAVASRDPAKAERFAARFECEAVLGYQRLLERPDIDAVYVPLPPGMHA
EWIGKALEADKHVLAEKPLTTTASDTARLVGLARRKNLLLRENYLFLHHGRHDVVRDLLQSGEIGELREFTAVFGIPPLP
DTDIRYRTELGGGALLDIGVYPARAARHFLLGPLTVLGASSHEAQESGVDLSGSVLLQSEGGTVAHLGYGFVHHYRSAYE
LWGSRGRIVVDRAFTPPAEWQAVIRIERKGVVDELSLPAEDQVRKAVTAFARDIRAGTGVDDPAVAGDSGESMIQQAALV
EAIGQARRCGST
>Q9ALN6 4.2.1.159~~~spnO~~~dTDP-4-dehydro-6-deoxy-alpha-D-glucopyranose 2,3-dehydratase~~~
MSSSVEAEASAAAPLGSNNTRRFVDSALSACNGMIPTTEFHCWLADRLGENSFETNRIPFDRLSKWKFDASTENLVHADG
RFFTVEGLQVETNYGAAPSWHQPIINQAEVGILGILVKEIDGVLHCLMSAKMEPGNVNVLQLSPTVQATRSNYTQAHRGS
VPPYVDYFLGRGRGRVLVDVLQSEQGSWFYRKRNRNMVVEVQEEVPVLPDFCWLTLGQVLALLRQDNIVNMDTRTVLSCI
PFHDSATGPELAASEEPFRQAVARSLSHGIDSSSISEAVGWFEEAKARYRLRATRVPLSRVDKWYRTDTEIAHQDGKYFA
VIAVSVSATNREVASWTQPMIEPREQGEIALLVKRIGGVLHGLVHARVEAGYKWTAEIAPTVQCSVANYQSTPSNDWPPF
LDDVLTADPETVRYESILSEEGGRFYQAQNRYRIIEVHEDFAARPPSDFRWMTLGQLGELLRSTHFLNIQARSLVASLHS
LWALGR
>Q9ALN8 4.2.1.164~~~spnQ~~~dTDP-4-dehydro-2,6-dideoxy-D-glucose 3-dehydratase~~~
MQSRKTRALGKGRARVTSCDDTCATATEMVPDAKDRILASVRDYHREQESPTFVAGSTPIRPSGAVLDEDDRVALVEAAL
ELRIAAGGNARRFESEFARFFGLRKAHLVNSGSSANLLALSSLTSPKLGEARLRPGDEVITAAVGFPTTINPAVQNGLVP
VFVDVELGTYNATPDRIKAAVTERTRAIMLAHTLGNPFAADEIAEIAKEHELFLVEDNCDAVGSTYRGRLTGTFGDLTTV
SFYPAHHITSGEGGCVLTGSLELARIIESLRDWGRDCWCEPGVDNTCRKRFDYHLGTLPPGYDHKYTFSHVGYNLKTTDL
QAALALSQLSKISAFGSARRRNWRRLREGLSGLPGLLLPVATPHSDPSWFGFAITISADAGFTRAALVNFLESRNIGTRL
LFGGNITRHPAFEQVRYRIADALTNSDIVTDRTFWVGVYPGITDQMIDYVVESIAEFVAKSS
>Q9ALN9 2.6.1.110~~~spnR~~~dTDP-4-dehydro-2,3,6-trideoxy-D-glucose 4-aminotransferase~~~
MINLHQPILGTEELDAIAEVFASNWIGLGPRTRTFEAEFAHHLGVDPEQVVFLNSGTAALFLTVQVLDLGPGDDVVLPSI
SFVAAANAIASSGARPVFCDVDPRTLNPTLDDVARAITPATKAVLLLHYGGSPGEVTAIADFCREKGLMLIEDSACAVAS
SVHGTACGTFGDLATWSFDAMKILVTGDGGMFYAADPELAHRARRLAYHGLEQMSGFDSAKSSNRWWDIRVEDIGQRLIG
NDMTAALGSVQLRKLPEFINRRREIATQYDRLLSDVPGVLLPPTLPDGHVSSHYFYWVQLAPEIRDQVAQQMLERGIYTS
YRYPPLHKVPIYRADCKLPSAEDACRRTLLLPLHPSLDDAEVRTVADEFQKAVEHHISQRSPLRK
>Q9ALP0 2.1.1.324~~~spnS~~~dTDP-4-amino-2,3,4,6-tetradeoxy-D-glucose N,N-dimethyltransferase~~~
MSRVSDTFAETSSVYSPDHADIYDAIHSARGRDWAAEAGEVVQLVRTRLPEAQSLLDVACGTGAHLERFRAEYAKVAGLE
LSDAMREIAIRRVPEVPIHIGDIRDFDLGEPFDVITCLCFTAAYMRTVDDLRRVTRNMARHLAPGGVAVIEPWWFPDKFI
DGFVTGAVAHHGERVISRLSHSVLEGRTSRMTVRYTVAEPTGIRDFTEFEILSLFTEDEYTAALEDAGIRAEYLPGAPNG
RGLFVGIRN
>P0AG24 ~~~spoT~~~Bifunctional (p)ppGpp synthase/hydrolase SpoT~~~COG0317
MYLFESLNQLIQTYLPEDQIKRLRQAYLVARDAHEGQTRSSGEPYITHPVAVACILAEMKLDYETLMAALLHDVIEDTPA
TYQDMEQLFGKSVAELVEGVSKLDKLKFRDKKEAQAENFRKMIMAMVQDIRVILIKLADRTHNMRTLGSLRPDKRRRIAR
ETLEIYSPLAHRLGIHHIKTELEELGFEALYPNRYRVIKEVVKAARGNRKEMIQKILSEIEGRLQEAGIPCRVSGREKHL
YSIYCKMVLKEQRFHSIMDIYAFRVIVNDSDTCYRVLGQMHSLYKPRPGRVKDYIAIPKANGYQSLHTSMIGPHGVPVEV
QIRTEDMDQMAEMGVAAHWAYKEHGETSTTAQIRAQRWMQSLLELQQSAGSSFEFIESVKSDLFPDEIYVFTPEGRIVEL
PAGATPVDFAYAVHTDIGHACVGARVDRQPYPLSQPLTSGQTVEIITAPGARPNAAWLNFVVSSKARAKIRQLLKNLKRD
DSVSLGRRLLNHALGGSRKLNEIPQENIQRELDRMKLATLDDLLAEIGLGNAMSVVVAKNLQHGDASIPPATQSHGHLPI
KGADGVLITFAKCCRPIPGDPIIAHVSPGKGLVIHHESCRNIRGYQKEPEKFMAVEWDKETAQEFITEIKVEMFNHQGAL
ANLTAAINTTTSNIQSLNTEEKDGRVYSAFIRLTARDRVHLANIMRKIRVMPDVIKVTRNRN
>Q9KNM2 3.1.7.2~~~spoT~~~Guanosine-3',5'-bis(diphosphate) 3'-pyrophosphohydrolase~~~COG0317
MYLFDSLKDVAQEYLTEPQIEALRQSYVVARDAHEGQTRSSGEPYIIHPVAVARILAEMRLDLETLQAALLHDVIEDCDV
TKEDLDAHFGSSVAELVDGVSKLDKLKFRDRKEAQAENFRKMVLAMVQDIRVILIKLADRTPNMRTLGALRPDKKRRIAR
ETLEIYAPLAHRLGIHNIKTELEELGFEALYPNRYRVLKEVVKAARGNRKEMIQRIHSEIEGRLQEVGLPARVVGREKNL
FSIYNKMKTKEQRFHTIMDIYAFRIVVDTADTCYRVLGQVHSLYKPRPARMKDYIAVPKANGYQSLHTSMVGPHGVPVEV
QIRTEDMDQMADKGVAAHWSYKANSERGGTTAQIKAQRWMQSLLELQQSAGNSFEFIENVKSDLFPDEIYVFTPKGRIVE
LPMGATAVDFAYAVHTDIGNTCVGARVDRTPYPLSQSLKSGQTVEIISAPGARPNAAWLNYVVTSRARTKIRQVLKTMRR
EDSITLGRRLLNHALGEHSVNEIAPENISKVLSDLKIASMDDLLAAIGLGELMSIVIARRLLGNADELTEPSKSGGNKNK
LPIRGAEGILLTFANCCHPIPDDHIIAHVSPGRGLVVHRETCPNVRGYQKEPDKYMAVEWTKDYDQEFITELKVDMHNRQ
GALAELTNVISKTGSNIHGLSTEERDGRLYTVTVLLTTKDRVHLAGIMRKIRTMPHALKVRRRKN
>P37817 ~~~spoVM~~~Stage V sporulation protein M~~~
MKFYTIKLPKFLGGIVRAMLGSFRKD
>P37554 ~~~spoVT~~~Stage V sporulation protein T~~~COG2002
MKATGIVRRIDDLGRVVIPKEIRRTLRIREGDPLEIFVDRDGEVILKKYSPISELGDFAKEYADALYDSLGHSVLICDRD
VYIAVSGSSKKDYLNKSISEMLERTMDQRSSVLESDAKSVQLVNGIDEDMNSYTVGPIVANGDPIGAVVIFSKDQTMGEV
EHKAVETAAGFLARQMEQ
>O34525 3.4.21.-~~~sppA~~~Putative signal peptide peptidase SppA~~~COG0616
MNAKRWIALVIALGIFGVSIIVSISMSFFESVKGAQTDLTSLTDESQEKTLENGSPSSKIAVLEVSGTIQDNGDSSSLLG
ADGYNHRTFLKNLERAKDDKTVKGIVLKVNSPGGGVYESAEIHKKLEEIKKETKKPIYVSMGSMAASGGYYISTAADKIF
ATPETLTGSLGVIMESVNYSKLADKLGISFETIKSGAHKDIMSPSREMTKEEKNIMQSMVDNSYEGFVDVISKGRGMPKA
EVKKIADGRVYDGRQAKKLNLVDELGFYDDTITAMKKDHKDLKNASVISYEESFGLGSLFSMGANKMFKSEIDFLNMREI
LSQSGSPRMMYLYAK
>P08395 3.4.21.-~~~sppA~~~Protease 4~~~COG0616
MRTLWRFIAGFFKWTWRLLNFVREMVLNLFFIFLVLVGVGIWMQVSGGDSKETASRGALLLDISGVIVDKPDSSQRFSKL
SRQLLGASSDRLQENSLFDIVNTIRQAKDDRNITGIVMDLKNFAGGDQPSMQYIGKALKEFRDSGKPVYAVGENYSQGQY
YLASFANKIWLSPQGVVDLHGFATNGLYYKSLLDKLKVSTHVFRVGTYKSAVEPFIRDDMSPAAREADSRWIGELWQNYL
NTVAANRQIPAEQVFPGAQGLLEGLTKTGGDTAKYALENKLVDALASSAEIEKALTKEFGWSKTDKNYRAISYYDYALKT
PADTGDSIGVVFANGAIMDGEETQGNVGGDTTAAQIRDARLDPKVKAIVLRVNSPGGSVTASEVIRAELAAARAAGKPVV
VSMGGMAASGGYWISTPANYIVANPSTLTGSIGIFGVITTVENSLDSIGVHTDGVSTSPLADVSITRALPPEAQLMMQLS
IENGYKRFITLVADARHSTPEQIDKIAQGHVWTGQDAKANGLVDSLGDFDDAVAKAAELAKVKQWHLEYYVDEPTFFDKV
MDNMSGSVRAMLPDAFQAMLPAPLASVASTVKSESDKLAAFNDPQNRYAFCLTCANMR
>P45243 3.4.21.-~~~sppA~~~Protease 4~~~COG0616
MFQVLKFCWKVLCFIRDLVMNVVFLGFVLLLVAIISFSSGGKKSTALTSEGALLLNLDGYLADNRDETLRWQDALSELNG
EHVPRKISTFDVVFAIQQAEDDPKIKGLVLDLNYFEGADLPALDFIGGAISHFKDAGKPVIAYADNYSQGQYYLASFADE
IYLNSIGSVDIHGLSQENLYFKEMLDKLAVTPHIFRVGTYKSAVEPFLRNDMSAEAKANMQRWLGEMWNNYVLSVSENRN
IKKDRILPNAKQYLAELKALKGNSTAYAQQRGLVTDVVTRLDLDKKLSALFGKGSDGKANLIEFDDYLTQLPDRLEHYNV
PNKIAVVNVEGTIIDGESDEENAGGDTIARILRKAHDDNSVKAVILRVNSPGGSAFASEIIRQETENLQKIGKPVIVSMG
AMAASGGYWISSTADYIIADSNTITGSIGIFTMFPTFENSIKKIGVHADGVSTTELANTSAFSPLAKPVQDIYQTEIEHG
YDRFLEIVSKGRQLSKTQVDKLAQGQVWLGSDAFQNGLVDEIGSFNEAVNKAEQLVNQRQDTAVQDFSVEWFTDDNVSLI
STLLSDTKKGAQEQLVKWLGLPAPIQKLQKELNILTKFNDPKGQYLYCLNCGKVK
>Q8KES3 1.1.1.325~~~~~~Sepiapterin reductase~~~COG4221
MKHILLITGAGKGIGRAIALEFARAARHHPDFEPVLVLSSRTAADLEKISLECRAEGALTDTITADISDMADVRRLTTHI
VERYGHIDCLVNNAGVGRFGALSDLTEEDFDYTMNTNLKGTFFLTQALFALMERQHSGHIFFITSVAATKAFRHSSIYCM
SKFGQRGLVETMRLYARKCNVRITDVQPGAVYTPMWGKVDDEMQALMMMPEDIAAPVVQAYLQPSRTVVEEIILRPTSGD
IQDD
>P39621 ~~~spsA~~~Spore coat polysaccharide biosynthesis protein SpsA~~~COG0463
MPKVSVIMTSYNKSDYVAKSISSILSQTFSDFELFIMDDNSNEETLNVIRPFLNDNRVRFYQSDISGVKERTEKTRYAAL
INQAIEMAEGEYITYATDDNIYMPDRLLKMVRELDTHPEKAVIYSASKTYHLNENRDIVKETVRPAAQVTWNAPCAIDHC
SVMHRYSVLEKVKEKFGSYWDESPAFYRIGDARFFWRVNHFYPFYPLDEELDLNYITDQSIHFQLFELEKNEFVRNLPPQ
RNCRELRESLKKLGMG
>P39625 ~~~spsE~~~Spore coat polysaccharide biosynthesis protein SpsE~~~COG2089
MAAFQIANKTVGKDAPVFIIAEAGINHDGKLDQAFALIDAAAEAGADAVKFQMFQADRMYQKDPGLYKTAAGKDVSIFSL
VQSMEMPAEWILPLLDYCREKQVIFLSTVCDEGSADLLQSTSPSAFKIASYEINHLPLLKYVARLNRPMIFSTAGAEISD
VHEAWRTIRAEGNNQIAIMHCVAKYPAPPEYSNLSVIPMLAAAFPEAVIGFSDHSEHPTEAPCAAVRLGAKLIEKHFTID
KNLPGADHSFALNPDELKEMVDGIRKTEAELKQGITKPVSEKLLGSSYKTTTAIEGEIRNFAYRGIFTTAPIQKGEAFSE
DNIAVLRPGQKPQGLHPRFFELLTSGVRAVRDIPADTGIVWDDILLKDSPFHE
>P74873 ~~~sptP~~~Secreted effector protein SptP~~~
MLKYEERKLNNLTLSSFSKVGVSNDARLYIAKENTDKAYVAPEKFSSKVLTWLGKMPLFKNTEVVQKHTENIRVQDQKIL
QTFLHALTEKYGETAVNDALLMSRINMNKPLTQRLAVQITECVKAADEGFINLIKSKDNVGVRNAALVIKGGDTKVAEKN
NDVGAESKQPLLDIALKGLKRTLPQLEQMDGNSLRENFQEMASGNGPLRSLMTNLQNLNKIPEAKQLNDYVTTLTNIQVG
VARFSQWGTCGGEVERWVDKASTHELTQAVKKIHVIAKELKNVTAELEKIEAGAPMPQTMSGPTLGLARFAVSSIPINQQ
TQVKLSDGMPVPVNTLTFDGKPVALAGSYPKNTPDALEAHMKMLLEKECSCLVVLTSEDQMQAKQLPPYFRGSYTFGEVH
TNSQKVSSASQGEAIDQYNMQLSCGEKRYTIPVLHVKNWPDHQPLPSTDQLEYLADRVKNSNQNGAPGRSSSDKHLPMIH
CLGGVGRTGTMAAALVLKDNPHSNLEQVRADFRDSRNNRMLEDASQFVQLKAMQAQLLMTTAS
>A7BFV8 2.3.1.50~~~spt~~~Serine palmitoyltransferase~~~
MKHNLQDNLQGEQMANTNSNGGKKPFSDAKIIERANLLRDNDLYFFFRAIEETEASTVTVKGKKQIMIGSNNYLGLTHHP
AVKEAAIKAVEKYGTGCTGSRFLNGNLNIHEELDEKLAAYLGHEKAIVFSTGMQANLGALSAICGPKDLMLFDSENHASI
IDASRLSLGTTFKYKHNDMASLEELLESNMSRFNRVIIVADGVFSMTGDILRLPEVVKLAKKYGAYVYVDDAHGLGVMGP
QGRGTMAHFDVTKDVDFNMGTFSKSFASIGGVISGSKDAIDYVRHSARSFMFSASMPPAAVATVSACIDVVQNDETILNN
LWSNVEFMRNGFKELGFFTYGSQTPIIPLFIGDDMKALKMTKWLESKGVFCTPVLPPAVPKGETLIRTSYMASHNREDLS
TVLEVFAEAKKIFDIPNHLH
>Q8A9E5 2.3.1.50~~~spt~~~Serine palmitoyltransferase~~~COG0156
MGLLQEKLAKYDLPQKFMAQGVYPYFREIEGKQGTEVEMGGQHVLMFGSNAYTGLTGDERVIEAGIKAMRKYGSGCAGSR
FLNGTLDLHVQLEKELAAFVGKDEALCFSTGFTVNSGVISCLTDRNDYIICDDRDHASIVDGRRLSFSQQLKYKHNDMAD
LEKQLQKCNPDSVKLIIVDGVFSMEGDLANLPEIVRLKHKYNATIMVDEAHGLGVFGKQGRGVCDHFGLTHEVDLIMGTF
SKSLASIGGFIAADSSIINWLRHNARTYIFSASNTPAATAAALEALHIIQNEPERLNALWEATNYALRRFREAGFEIGAT
ESPIIPLYVRDTEKTFMVTKLAFDEGVFINPVIPPACAPQDTLVRVALMATHTKEQIDSAVEKLVKAFKALDLL
>A0A0H3C7E9 2.3.1.50~~~spt~~~Serine palmitoyltransferase~~~
MGLFDKHLAYRDAYKAIQDVGANPFKVRFDAVHSPTEGVVDGRPTILLGTNNYLGLTFDEQAIAASVKAVQERGTGTTGS
RIANGSFESHVELEQELAKFYGRKHAMVFTTGYQANLGVLSTLVGRGDHLILDADSHASIYDGSRLGHAEVIRFRHNDPE
DLAKRLRRLGDAPGERLIVVEGIYSMIGDVAPLKEIAAVKREMGGYLLVDEAHSMGVLGATGRGLAEAAGVEEDVDFIVG
TFSKSLGAIGGFCVSDHDDFDVMRVICRPYMFTASLPPAVAASTVTALRRMIEQPELRDRLNRNAKRLYDGLTAMGFLTG
PSASPIVAATMPDQERAIAMWNGLLQAGVYLNLALPPATPDSRPLLRASVSAAHTDEQIDAVLKTYGEIGAALGVIEPLK
RARA
>A7BFV6 2.3.1.50~~~spt~~~Serine palmitoyltransferase~~~
MSKGKLGEKISQFKIVEELKAKGLYAYFRPIQSKQDTEVKIDGRRVLMFGSNSYLGLTTDTRIIKAAQDALEKYGTGCAG
SRFLNGTLDIHVELEEKLSAYVGKEAAILFSTGFQSNLGPLSCLMGRNDYILLDERDHASIIDGSRLSFSKVIKYGHNNM
EDLRAKLSRLPEDSAKLICTDGIFSMEGDIVNLPELTSIANEFDAAVMVDDAHSLGVIGHKGAGTASHFGLNDDVDLIMG
TFSKSLASLGGFVAGDADVIDFLKHNARSVMFSASMTPASVASTLKALEIIQNEPEHIEKLWKNTDYAKAQLLDHGFDLG
ATESPILPIFIRSNEKTFWVTKMLQDDGVFVNPVVSPAVPAEESLIRFSLMATHTYDQIDEAIEKMVKVFKQAEVETLI
>Q93UV0 2.3.1.50~~~spt~~~Serine palmitoyltransferase~~~
MTEAAAQPHALPADAPDIAPERDLLSKFDGLIAERQKLLDSGVTDPFAIVMEQVKSPTEAVIRGKDTILLGTYNYMGMTF
DPDVIAAGKEALEKFGSGTNGSRMLNGTFHDHMEVEQALRDFYGTTGAIVFSTGYMANLGIISTLAGKGEYVILDADSHA
SIYDGCQQGNAEIVRFRHNSVEDLDKRLGRLPKEPAKLVVLEGVYSMLGDIAPLKEMVAVAKKHGAMVLVDEAHSMGFFG
PNGRGVYEAQGLEGQIDFVVGTFSKSVGTVGGFVVSNHPKFEAVRLACRPYIFTASLPPSVVATATTSIRKLMTAHEKRE
RLWSNARALHGGLKAMGFRLGTETCDSAIVAVMLEDQEQAAMMWQALLDGGLYVNMARPPATPAGTFLLRCSICAEHTPA
QIQTVLGMFQAAGRAVGVIG
>A7BFV7 2.3.1.50~~~spt~~~Serine palmitoyltransferase~~~
MSKGKLSERISHFNIVEELKSKGLYAYFRPIQSKQDTEVMIDGKRVLMFGSNSYLGLTIDPRIIEAAQDALSKYGTGCAG
SRFLNGTLDIHIELEHKLSQLVGKEASILFSTGFQSNLGPISCLMGRNDYILLDERDHASIIDGSRLSFSKVIKYGHNDM
DDLRAKLSRLPSESAKLIVTDGIFSMEGDIVNLPEMVKIADEYDAALMVDDAHSLGVIGEHGAGTASHFGLTDKVDLIMG
TFSKSLASLGGFVAGDADVIDYLKHNARSVMFSASMTPASVASTLKALEIMISEPEHMENLWKNTNYAKQQLLESGFDLG
ATESPILPIFIRNNEKTFWVTKMLQDDGVFVNPVVSPAVPSEESLIRFSLMATHTFDQIDEAVEKMVRVFKQAEIESLI
>Q9I6J2 2.6.1.113~~~spuC~~~Putrescine--pyruvate aminotransferase~~~
MNSQITNAKTREWQALSRDHHLPPFTDYKQLNEKGARIITKAEGVYIWDSEGNKILDAMAGLWCVNVGYGREELVQAATR
QMRELPFYNLFFQTAHPPVVELAKAIADVAPEGMNHVFFTGSGSEANDTVLRMVRHYWATKGQPQKKVVIGRWNGYHGST
VAGVSLGGMKALHEQGDFPIPGIVHIAQPYWYGEGGDMSPDEFGVWAAEQLEKKILEVGEENVAAFIAEPIQGAGGVIVP
PDTYWPKIREILAKYDILFIADEVICGFGRTGEWFGSQYYGNAPDLMPIAKGLTSGYIPMGGVVVRDEIVEVLNQGGEFY
HGFTYSGHPVAAAVALENIRILREEKIIEKVKAETAPYLQKRWQELADHPLVGEARGVGMVAALELVKNKKTRERFTDKG
VGMLCREHCFRNGLIMRAVGDTMIISPPLVIDPSQIDELITLARKCLDQTAAAVLA
>Q02UB7 ~~~spuD~~~Putrescine-binding periplasmic protein SpuD~~~
MMKRFGKTLLALTLAGSVAGMAQAADNKVLHVYNWSDYIAPDTLEKFTKETGIKVVYDVYDSNEVLEAKLLAGKSGYDVV
VPSNSFLAKQIKAGVYQKLDKSKLPNWKNLNKDLMHTLEVSDPGNEHAIPYMWGTIGIGYNPDKVKAAFGDNAPVDSWDL
VFKPENIQKLKQCGVSFLDSPTEILPAALHYLGYKPDTDNPKELKAAEELFLKIRPYVTYFHSSKYISDLANGNICVAIG
YSGDIYQAKSRAEEAKNKVTVKYNIPKEGAGSFFDMVAIPKDAENTEGALAFVNFLMKPEIMAEITDVVQFPNGNAAATP
LVSEAIRNDPGIYPSEEVMKKLYTFPDLPAKTQRAMTRSWTKIKSGK
>Q9I6J1 ~~~spuD~~~Putrescine-binding periplasmic protein SpuD~~~
MMKRFGKTLLALTLAGSVAGMAQAADNKVLHVYNWSDYIAPDTLEKFTKETGIKVVYDVYDSNEVLEAKLLAGKSGYDVV
VPSNSFLAKQIKAGVYQKLDKSKLPNWKNLNKDLMHTLEVSDPGNEHAIPYMWGTIGIGYNPDKVKAAFGDNAPVDSWDL
VFKPENIQKLKQCGVSFLDSPTEILPAALHYLGYKPDTDNPKELKAAEELFLKIRPYVTYFHSSKYISDLANGNICVAIG
YSGDIYQAKSRAEEAKNKVTVKYNIPKEGAGSFFDMVAIPKDAENTEGALAFVNFLMKPEIMAEITDVVQFPNGNAAATP
LVSEAIRNDPGIYPSEEVMKKLYTFPDLPAKTQRAMTRSWTKIKSGK
>Q9I6J0 ~~~spuE~~~Spermidine-binding periplasmic protein SpuE~~~
MQHSIGKTLLVAALATAIAGPVQAEKKSLHIYNWTDYIAPTTLKDFTKESGIDVSYDVFDSNETLEGKLVSGHSGYDIVV
PSNNFLGKQIQAGAFQKLDKSKLPNWKNLDPALLKQLEVSDPGNQYAVPYLWGTNGIGYNVAKVKEVLGDQPIDSWAILF
EPENMKKLAKCGVAFMDSGDEMLPAALNYLGLDPNTHDPKDYKKAEEVLTKVRPYVSYFHSSKYISDLANGNICVAFGYS
GDVFQAAARAEEAGKGIDIQYVIPKEGANLWFDLMAIPADAKAADNAYAFIDYLLRPEVIAKVSDYVGYANAIPGARPLM
DKSVSDSEEVYPPQAVLDKLYVSAVLPAKVLRLQTRTWTRIKTGK
>P24419 2.4.2.31~~~spvB~~~Mono(ADP-ribosyl)transferase SpvB~~~
MLILNGFSSATLALITPPFLPKGGKALSQSGPDGLASITLPLPISAERGFAPALALHYSSGGGNGPFGVGWSCATMSIAR
RTSHGVPQYNDSDEFLGPDGEVLVQTLSTGDAPNPVTCFAYGDVSFPQSYTVTRYQPRTESSFYRLEYWVGNSNGDDFWL
LHDSNGILHLLGKTAAARLSDPQAASHTAQWLVEESVTPAGEHIYYSYLAENGDNVDLNGNEAGRDRSAMRYLSKVQYGN
ATPAADLYLWTSATPAVQWLFTLVFDYGERGVDPQVPPAFTAQNSWLARQDPFSLYNYGFEIRLHRLCRQVLMFHHFPDE
LGEADTLVSRLLLEYDENPILTQLCAARTLAYEGDGYRRAPVNNMMPPPPPPPPPMMGGNSSRPKSKWAIVEESKQIQAL
RYYSAQGYSVINKYLRGDDYPETQAKETLLSRDYLSTNEPSDEEFKNAMSVYINDIAEGLSSLPETDHRVVYRGLKLDKP
ALSDVLKEYTTIGNIIIDKAFMSTSPDKAWINDTILNIYLEKGHKGRILGDVAHFKGEAEMLFPPNTKLKIESIVNCGSQ
DFASQLSKLRLSDDATADTNRIKRIINMRVLNS
>P21454 2.4.2.31~~~spvB~~~Mono(ADP-ribosyl)transferase SpvB~~~
MLILNGFSSATLALITPPFLPKGGKALSQSGPDGLASITLPLPISAERGFAPALALPYSSGGGNGPFGVGWSCATMSIAR
RTSHGVPQYNDSDEFLGPDGEVLVQTLSTGDAPNPVTCFAYGDVSFPQSYTVTRYQPRTESSFYRLEYWVGNSNGDDFWL
LHDSNGILHLLGKTAAARLSDPQAASHTAQWLVEESVTPAGEHIYYSYLAENGDNVDLNGNEAGRDRSAMRYLSKVQYGN
ATPAADLYLWTSATPAVQWLFTLVFDYGERGVDPQVPPAFTAQNSWLARQDPFSLYNYGFEIRLLRLCRQVLMFHHFPDE
LGEADTLVSRLLLEYDENPIRTQLCAARTLAYEGDGYRRAPVNNMMPPPPPPPMMGGNSSRPKSKWAIVEESKQIQALRY
YSAQGYSVINKYLRGDDYPETQAKETLLSRDYLSTNEPSDEEFKNAMSVYINDIAEGLSSLPETDHRVVYRGLKLDKPAL
SDVLKEYTTIVNIIIDKAFMSTSPDKAWINDTILNIYLEKGHKGRILGDVAHFKGEAEMLFPPNTKLKIESIVNCGSQDF
ASQLSKLRLSDDATADTNRIKRIINMRVLNS
>P0A2M9 4.2.3.-~~~spvC~~~MAPK phosphothreonine lyase~~~
MPINRPNLNLNIPPLNIVAAYDGAEIPSTNKHLKNNFNSLHNQMRKMPVSHFKEALDVPDYSGMRQSGFFAMSQGFQLNN
HGYDVFIHARRESPQSQGKFAGDKFHISVLRDMVPQAFQALSGLLFSEDSPVDKWKVTDMEKVVQQARVSLGAQFTLYIK
PDQENSQYSASFLHKTRQFIECLESRLSENGVISGQCPESDVHPENWKYLSYRNELRSGRDGGEMQRQALREEPFYRLMT
E
>P0DPK6 4.2.3.158~~~~~~Spiroviolene synthase~~~
MTVNEIDLPPIFCPLESARHPRAHLVDERAREWIRTSPMCTTDEERTWVAASCSTDFFARFAPDAATDDRLLWTSLWVYW
GFAFDDHRCDNGPFSNRPAAFSALAGRVQRALEAPSARDESDGFIPALQEIAAQFRSFGTPLQVRRFAAAHRAWLSGVTW
QIGNAAAGRMPGLDEYVAMRLLSAGGEPPFAMLELATGLEVPAQDLERPAVRALTEMAIMVAALDNDRHSLRKELARGQT
DQNVYSVLMQETGLPLQEAVAAATRLRDRVLLRFMAVHDRVRPGAGLELSTYLQGLRYGIRGNAEWGLRVPRYLSLGRVP
DPMDEAPLEWAESPADDDRSAPRGLPTVAWWWDDALLGV
>O31606 ~~~spxH~~~ClpXP adapter protein SpxH~~~COG2761
MTNYQHELYFAHCHGHPKKPLEIYMFVDPLCPECWSLEPVIKKLKIRYGRFFTLRIIASASLTALNKKRKKHLLAEAWEK
IASRSGMSCDGNVWFEQDQPLSSPYMAALAFKAAELQGRKAGMQFLRNMQESLFVSKKNITDENVLLEIAENTSLDLEEF
KKDLHSQSAVKALQCDMKIAAEMDVSVNPTLTFFNTQHEDEGLKVPGSYSYDVYEEILFEMLGDEPKPSETPPLECFIEY
FRFVASKEIALVYDLSLEEVEKEMKKLAFAKKVAKVEAKHGMFWKSLSTYSDEYQSCEK
>Q5L1S1 ~~~spxH~~~ClpXP adapter protein SpxH~~~COG2761
MSEKFAGKTTSTCYPSQPLGNTNKPLELYLFIDPLCPECWGLEPVIKKLTIEYGRFFTLRHILSGTWATWSARKGTKPEA
MAKAWEWAANRTGMSCDGSVWLENPISSPFAPSLAIKAAEMQGKRAGLRFLRKLQEQLFLEKQNVADLSVLAECAVKAGL
DVDEFLRDMHSPGAAKAFQCDLKITSEMDVDEIPTLVLFNENIEDEGIKISGCYPYDIYVELIAEMLGFHPEPSSPPPLE
SFLSHFKFVATKEVAVVYNWTIQEAETEMKKLQLKQKVERVPVKHGTFWRYIDDSRP
>Q2G203 ~~~spxH~~~ClpXP adapter protein SpxH~~~COG2761
MAGELRIMENKSREDINLSPVSKIEIYSFFDPFSSDCFKLSAILSKLRIEYNQYIRIRHILNPSLKVLTKCQAQSTSNFD
NIALAYKAAELQGRVRAERFIHLMQNEIIPKRDIITESMICDCIQNAGIDLEVFKDDLQKSKLTESLKIDLHIAREMEIE
QAPSLVFFSEDVHEEGLKVEGLYPYHIYTYIINELMGKPIEKNLPPKLETYIQQQQLVTMEELLTIYEWPEKLLNKELKK
LAIQQKIEKLKYPDGDFWKSKMPKIKSK
>Q99V89 ~~~spxH~~~ClpXP adapter protein SpxH~~~
MAGELRIMENKSREDINLSPVSKIEIYSFFDPFSSDCFKLSAILSKLRIEYNQYIRIRHILNPSLKVLTKCQAQSTSNFD
NIALAYKAAELQGRVRAERFIHLMQNEIIPKRDIITESMICDCIQNAGIDLEVFKDDLQKSKLTESLKIDLHIAREMEIE
QAPSLVFFSEDVHEEGLKVEGLYPYHIYTYIINELMGKPIEKNLPPKLETYIQQQQLVTMEELLTIYEWPEKLLNKELKK
LAIQQKIEKLKYPDGDFWKSKMPKIKSK
>O32302 ~~~spxO~~~Anti-adapter protein SpxO~~~
MRELDEMISRLRNRGIKVEKVKYPKQTLSEKKWVHQCKQPLKTNYRDFNGYSFT
>O31602 ~~~spx~~~Global transcriptional regulator Spx~~~COG1393
MVTLYTSPSCTSCRKARAWLEEHEIPFVERNIFSEPLSIDEIKQILRMTEDGTDEIISTRSKVFQKLNVNVESMPLQDLY
RLINEHPGLLRRPIIIDEKRLQVGYNEDEIRRFLPRKVRSFQLREAQRLAN
>P60379 ~~~spx~~~Global transcriptional regulator Spx~~~
MVTLFTSPSCTSCRKAKAWLQEHDIPYTERNIFSEHLTIDEIKQILKMTEDGTDEIISTRSKTYQKLNVDIDSLPLQDLY
SIIQDNPGLLRRPIILDNKRLQVGYNEDEIRRFLPRKVRTFQLQEAQRMVD
>Q8DU17 ~~~spx~~~Global transcriptional regulator Spx~~~COG1393
MVTLFLSPSCTSCRKARAWLNRHDVVFQEHNIMTSPLSRDELLKILSYTENGTEDIISTRSKVFQKLDIDVDELSVSELI
NLISKNPSLLRRPIIMDNKRMQIGFNEDEIRAFLPRDYRKQELRQATIRAEVEGEDD
>Q8XDZ4 ~~~spy~~~Periplasmic chaperone Spy~~~COG3678
MRKLTALFVASTLALGAANLAHAADTTTAAPADAKPMMHHKGKFGPHQDMMFKDLNLTDAQKQQIREIMKGQRDQMKRPP
LEERRAMHDIIASDTFDKAKAEAQIAKMEEQRKANMLAHMETQNKIYNILTPEQKKQFNANFEKRLTERPAAKGKMPATA
E
>C6ECL5 ~~~spy~~~Periplasmic chaperone Spy~~~COG3678
MRKLTALFVASTLALGAANLAHAADTTTAAPADAKPMMHHKGKFGPHQDMMFKDLNLTDAQKQQIREIMKGQRDQMKRPP
LEERRAMHDIITSDTFDKVKAEAQIAKMEEQRKANMLAHMETQNKIYNILTPEQKKQFNANFEKRLTERPAAKGKMPATA
E
>P77754 ~~~spy~~~Periplasmic chaperone Spy~~~COG3678
MRKLTALFVASTLALGAANLAHAADTTTAAPADAKPMMHHKGKFGPHQDMMFKDLNLTDAQKQQIREIMKGQRDQMKRPP
LEERRAMHDIIASDTFDKVKAEAQIAKMEEQRKANMLAHMETQNKIYNILTPEQKKQFNANFEKRLTERPAAKGKMPATA
E
>P32138 3.2.1.199~~~yihQ~~~Sulfoquinovosidase~~~COG1501
MDTPRPQLLDFQFHQNNDSFTLHFQQRLILTHSKDNPCLWIGSGIADIDMFRGNFSIKDKLQEKIALTDAIVSQSPDGWL
IHFSRGSDISATLNISADDQGRLLLELQNDNLNHNRIWLRLAAQPEDHIYGCGEQFSYFDLRGKPFPLWTSEQGVGRNKQ
TYVTWQADCKENAGGDYYWTFFPQPTFVSTQKYYCHVDNSCYMNFDFSAPEYHELALWEDKATLRFECADTYISLLEKLT
ALLGRQPELPDWIYDGVTLGIQGGTEVCQKKLDTMRNAGVKVNGIWAQDWSGIRMTSFGKRVMWNWKWNSENYPQLDSRI
KQWNQEGVQFLAYINPYVASDKDLCEEAAQHGYLAKDASGGDYLVEFGEFYGGVVDLTNPEAYAWFKEVIKKNMIELGCG
GWMADFGEYLPTDTYLHNGVSAEIMHNAWPALWAKCNYEALEETGKLGEILFFMRAGSTGSQKYSTMMWAGDQNVDWSLD
DGLASVVPAALSLAMTGHGLHHSDIGGYTTLFEMKRSKELLLRWCDFSAFTPMMRTHEGNRPGDNWQFDGDAETIAHFAR
MTTVFTTLKPYLKEAVALNAKSGLPVMRPLFLHYEDDAHTYTLKYQYLLGRDILVAPVHEEGRSDWTLYLPEDNWVHAWT
GEAFRGGEVTVNAPIGKPPVFYRADSEWAALFASLKSI
>P0DOV5 1.1.1.390~~~~~~Sulfoquinovose 1-dehydrogenase~~~
MNRHTDTHYPSLADKVVLISGGASGIGRAFVEAFVAQGSRVAFLDLDAEAGQGLAHALGANSLFLPCDVRDIERLKACVA
EVERTWGAVDVLINNAARDDRHALADVSVEYWDERMQTNLRHAFFAAQAVAPGMARRGSGAIINMGSISWMRGRPGMVCY
TTAKAALNGMTRTLARELGGQGIRINSLVPGAIRTERQDAMWAADPAGLEAASQAFIDQQMLKFRLDASDCARLALFLAS
DDSRGCTGQNFVVDAGLSIQ
>P33247 4.2.1.129~~~shc~~~Squalene--hopene cyclase~~~COG1657
MAEQLVEAPAYARTLDRAVEYLLSCQKDEGYWWGPLLSNVTMEAEYVLLCHILDRVDRDRMEKIRRYLLHEQREDGTWAL
YPGGPPDLDTTIEAYVALKYIGMSRDEEPMQKALRFIQSQGGIESSRVFTRMWLALVGEYPWEKVPMVPPEIMFLGKRMP
LNIYEFGSWARATVVALSIVMSRQPVFPLPERARVPELYETDVPPRRRGAKGGGGWIFDALDRALHGYQKLSVHPFRRAA
EIRALDWLLERQAGDGSWGGIQPPWFYALIALKILDMTQHPAFIKGWEGLELYGVELDYGGWMFQASISPVWDTGLAVLA
LRAAGLPADHDRLVKAGEWLLDRQITVPGDWAVKRPNLKPGGFAFQFDNVYYPDVDDTAVVVWALNTLRLPDERRRRDAM
TKGFRWIVGMQSSNGGWGAYDVDNTSDLPNHIPFCDFGEVTDPPSEDVTAHVLECFGSFGYDDAWKVIRRAVEYLKREQK
PDGSWFGRWGVNYLYGTGAVVSALKAVGIDTREPYIQKALDWVEQHQNPDGGWGEDCRSYEDPAYAGKGASTPSQTAWAL
MALIAGGRAESEAARRGVQYLVETQRPDGGWDEPYYTGTGFPGDFYLGYTMYRHVFPTLALGRYKQAIERR
>Q796C3 4.2.1.137~~~sqhC~~~Sporulenol synthase~~~COG1657
MGTLQEKVRRFQKKTITELRDRQNADGSWTFCFEGPIMTNSFFILLLTSLDEGENEKELISSLAAGIHAKQQPDGTFINY
PDETRGNLTATVQGYVGMLASGCFHRTEPHMKKAEQFIISHGGLRHVHFMTKWMLAANGLYPWPALYLPLSLMALPPTLP
IHFYQFSSYARIHFAPMAVTLNQRFVLINRNISSLHHLDPHMTKNPFTWLRSDAFEERDLTSILLHWKRVFHAPFAFQQL
GLQTAKTYMLDRIEKDGTLYSYASATIYMVYSLLSLGVSRYSPIIRRAITGIKSLVTKCNGIPYLENSTSTVWDTALISY
ALQKNGVTETDGSVTKAADFLLERQHTKIADWSVKNPNSVPGGWGFSNINTNNPDCDDTTAVLKAIPRNHSPAAWERGVS
WLLSMQNNDGGFSAFEKNVNHPLIRLLPLESAEDAAVDPSTADLTGRVLHFLGEKVGFTEKHQHIQRAVKWLFEHQEQNG
SWYGRWGVCYIYGTWAALTGMHACGVDRKHPGIQKALRWLKSIQNDDGSWGESCKSAEIKTYVPLHRGTIVQTAWALDAL
LTYENSEHPSVVKGMQYLTDSSSHSADSLAYPAGIGLPKQFYIRYHSYPYVFSLLAVGKYLDSIEKETANET
>B7JBP8 1.8.5.4~~~~~~Sulfide-quinone reductase~~~COG0446
MAHVVILGAGTGGMPAAYEMKEALGSGHEVTLISANDYFQFVPSNPWVGVGWKERDDIAFPIRHYVERKGIHFIAQSAEQ
IDAEAQNITLADGNTVHYDYLMIATGPKLAFENVPGSDPHEGPVQSICTVDHAERAFAEYQALLREPGPIVIGAMAGASC
FGPAYEYAMIVASDLKKRGMRDKIPSFTFITSEPYIGHLGIQGVGDSKGILTKGLKEEGIEAYTNCKVTKVEDNKMYVTQ
VDEKGETIKEMVLPVKFGMMIPAFKGVPAVAGVEGLCNPGGFVLVDEHQRSKKYANIFAAGIAIAIPPVETTPVPTGAPK
TGYMIESMVSAAVHNIKADLEGRKGEQTMGTWNAVCFADMGDRGAAFIALPQLKPRKVDVFAYGRWVHLAKVAFEKYFIR
KMKMGVSEPFYEKVLFKMMGITRLKEEDTHRKAS
>O67931 1.8.5.4~~~sqr~~~Sulfide-quinone reductase~~~COG0446
MAKHVVVIGGGVGGIATAYNLRNLMPDLKITLISDRPYFGFTPAFPHLAMGWRKFEDISVPLAPLLPKFNIEFINEKAES
IDPDANTVTTQSGKKIEYDYLVIATGPKLVFGAEGQEENSTSICTAEHALETQKKLQELYANPGPVVIGAIPGVSCFGPA
YEFALMLHYELKKRGIRYKVPMTFITSEPYLGHFGVGGIGASKRLVEDLFAERNIDWIANVAVKAIEPDKVIYEDLNGNT
HEVPAKFTMFMPSFQGPEVVASAGDKVANPANKMVIVNRCFQNPTYKNIFGVGVVTAIPPIEKTPIPTGVPKTGMMIEQM
AMAVAHNIVNDIRNNPDKYAPRLSAICIADFGEDAGFFFADPVIPPRERVITKMGKWAHYFKTAFEKYFLWKVRNGNIAP
SFEEKVLEIFLKVHPIELCKDCEGAPGSRC
>P32140 5.3.1.31~~~yihS~~~Sulfoquinovose isomerase~~~COG2942
MKWFNTLSHNRWLEQETDRIFDFGKNSVVPTGFGWLGNKGQIKEEMGTHLWITARMLHVYSVAAAMGRPGAYSLVDHGIK
AMNGALRDKKYGGWYACVNDEGVVDASKQGYQHFFALLGAASAVTTGHPEARKLLDYTIEIIEKYFWSEEEQMCLESWDE
AFSKTEEYRGGNANMHAVEAFLIVYDVTHDKKWLDRAIRVASVIIHDVARNNHYRVNEHFDTQWNPLPDYNKDNPAHRFR
AFGGTPGHWIEWGRLMLHIHAALEARCEQPPAWLLEDAKGLFNATVRDAWAPDGADGIVYTVDWEGKPVVRERVRWPIVE
AMGTAYALYTVTGDRQYETWYQTWWEYCIKYLMDYENGSWWQELDADNKVTTKVWDGKQDIYHLLHCLVIPRIPLAPGMA
PAVAAGLLDINAK
>Q8ZKT7 5.3.1.31~~~yihS~~~Sulfoquinovose isomerase~~~
MKWFNTLSHNRWLEQETDRIFNFGKNAVVPTGFGWLGNKGQIKEEMGTHLWITARMLHVYSVAASMGRPGAYDLVDHGIK
AMNGALRDKKYGGWYACVNDQGVVDASKQGYQHFFALLGAASAVTTGHPEARKLLDYTIEVIEKYFWSEEEQMCLESWDE
AFSQTEDYRGGNANMHAVEAFLIVYDVTHDKKWLDRALRIASVIIHDVARNGDYRVNEHFDSQWNPIRDYNKDNPAHRFR
AYGGTPGHWIEWGRLMLHLHAALEARFETPPAWLLEDAKGLFHATIRDAWAPDGADGFVYSVDWDGKPIVRERVRWPIVE
AMGTAYALYTLTDDSQYEEWYQKWWDYCIKYLMDYENGSWWQELDADNKVTTKVWDGKQDIYHLLHCLVIPRLPLAPGLA
PAVAAGLLDINAK
>P32141 4.1.2.57~~~yihT~~~Sulfofructosephosphate aldolase~~~COG3684
MNKYTINDITRASGGFAMLAVDQREAMRMMFAAAGAPAPVADSVLTDFKVNAAKALSPYASAILVDQQFCYRQVVEQNAI
AKSCAMIVAADEFIPGNGIPVDSVVIDRKINPLQIKQDGGKALKLLVLWRSDEDAQQRLDMVKEFNELCHSHGLVSIIEP
VVRPPRRGDKFDREQAIIDAAKELGDSGADLYKVEMPLYGKGPQQELLCASQRLNDHINMPWVILSSGVDEKLFPRAVRV
AMTAGASGFLAGRAVWASVVGLPDNELMLRDVCAPKLQQLGDIVDEMMAKRR
>Q9L7R9 4.1.2.57~~~yihT~~~Sulfofructosephosphate aldolase~~~
MNNYTIKDITRASGGFAMLAVDQREAMRLMFAAAGAKTPVADSVLTDFKVNAAKILSPYASAVLLDQQFCYRQAVEQNAV
AKSCAMIVAADDFIPGNGIPVDNVVLDKKINAQAVKRDGAKALKLLVLWRSDEDAQQRLNMVKEFNELCHSNGLLSIIEP
VVRPPRCGDKFDREQAIIDAAKELGDSGADLYKVEMPLYGKGARSDLLTASQRLNGHINMPWVILSSGVDEKLFPRAVRV
AMEAGASGFLAGRAVWSSVIGLPDTELMLRDVSAPKLQRLGEIVDEMMAKRR
>P0A9V8 1.1.1.373~~~yihU~~~3-sulfolactaldehyde reductase~~~COG2084
MAAIAFIGLGQMGSPMASNLLQQGHQLRVFDVNAEAVRHLVDKGATPAANPAQAAKDAEFIITMLPNGDLVRNVLFGENG
VCEGLSTDALVIDMSTIHPLQTDKLIADMQAKGFSMMDVPVGRTSANAITGTLLLLAGGTAEQVERATPILMAMGSELIN
AGGPGMGIRVKLINNYMSIALNALSAEAAVLCEALNLPFDVAVKVMSGTAAGKGHFTTSWPNKVLSGDLSPAFMIDLAHK
DLGIALDVANQLHVPMPLGAASREVYSQARAAGRGRQDWSAILEQVRVSAGMTAKVKM
>P32143 2.7.1.184~~~yihV~~~Sulfofructose kinase~~~COG0524
MIRVACVGITVMDRIYYVEGLPTESGKYVARNYTEVGGGPAATAAVAAARLGAQVDFIGRVGDDDTGNSLLAELESWGVN
TRYTKRYNQAKSSQSAIMVDTKGERIIINYPSPDLLPDAEWLEEIDFSQWDVVLADVRWHDGAKKAFTLARQAGVMTVLD
GDITPQDISELVALSDHAAFSEPGLARLTGVKEMASALKQAQTLTNGHVYVTQGSAGCDWLENGGRQHQPAFKVDVVDTT
GAGDVFHGALAVALATSGDLAESVRFASGVAALKCTRPGGRAGIPDCDQTRSFLSLFV
>Q2FUW1 ~~~sraP~~~Serine-rich adhesin for platelets~~~COG5492
MSKRQKAFHDSLANEKTRVRLYKSGKNWVKSGIKEIEMFKIMGLPFISHSLVSQDNQSISKKMTGYGLKTTAVIGGAFTV
NMLHDQQAFAASDAPLTSELNTQSETVGNQNSTTIEASTSTADSTSVTKNSSSVQTSNSDTVSSEKSEKVTSTTNSTSNQ
QEKLTSTSESTSSKNTTSSSDTKSVASTSSTEQPINTSTNQSTASNNTSQSTTPSSVNLNKTSTTSTSTAPVKLRTFSRL
AMSTFASAATTTAVTANTITVNKDNLKQYMTTSGNATYDQSTGIVTLTQDAYSQKGAITLGTRIDSNKSFHFSGKVNLGN
KYEGHGNGGDGIGFAFSPGVLGETGLNGAAVGIGGLSNAFGFKLDTYHNTSKPNSAAKANADPSNVAGGGAFGAFVTTDS
YGVATTYTSSSTADNAAKLNVQPTNNTFQDFDINYNGDTKVMTVKYAGQTWTRNISDWIAKSGTTNFSLSMTASTGGATN
LQQVQFGTFEYTESAVTQVRYVDVTTGKDIIPPKTYSGNVDQVVTIDNQQSALTAKGYNYTSVDSSYASTYNDTNKTVKM
TNAGQSVTYYFTDVKAPTVTVGNQTIEVGKTMNPIVLTTTDNGTGTVTNTVTGLPSGLSYDSATNSIIGTPTKIGQSTVT
VVSTDQANNKSTTTFTINVVDTTAPTVTPIGDQSSEVYSPISPIKIATQDNSGNAVTNTVTGLPSGLTFDSTNNTISGTP
TNIGTSTISIVSTDASGNKTTTTFKYEVTRNSMSDSVSTSGSTQQSQSVSTSKADSQSASTSTSGSIVVSTSASTSKSTS
VSLSDSVSASKSLSTSESNSVSSSTSTSLVNSQSVSSSMSDSASKSTSLSDSISNSSSTEKSESLSTSTSDSLRTSTSLS
DSLSMSTSGSLSKSQSLSTSISGSSSTSASLSDSTSNAISTSTSLSESASTSDSISISNSIANSQSASTSKSDSQSTSIS
LSTSDSKSMSTSESLSDSTSTSGSVSGSLSIAASQSVSTSTSDSMSTSEIVSDSISTSGSLSASDSKSMSVSSSMSTSQS
GSTSESLSDSQSTSDSDSKSLSQSTSQSGSTSTSTSTSASVRTSESQSTSGSMSASQSDSMSISTSFSDSTSDSKSASTA
SSESISQSASTSTSGSVSTSTSLSTSNSERTSTSMSDSTSLSTSESDSISESTSTSDSISEAISASESTFISLSESNSTS
DSESQSASAFLSESLSESTSESTSESVSSSTSESTSLSDSTSESGSTSTSLSNSTSGSTSISTSTSISESTSTFKSESVS
TSLSMSTSTSLSDSTSLSTSLSDSTSDSKSDSLSTSMSTSDSISTSKSDSISTSTSLSGSTSESESDSTSSSESKSDSTS
MSISMSQSTSGSTSTSTSTSLSDSTSTSLSLSASMNQSGVDSNSASQSASNSTSTSTSESDSQSTSSYTSQSTSQSESTS
TSTSLSDSTSISKSTSQSGSVSTSASLSGSESESDSQSISTSASESTSESASTSLSDSTSTSNSGSASTSTSLNNSASAS
ESDLSSTSLSDSTSASMQSSESDSQSTSASLSDSLSTSTSNRMSTIASLSTSVSTSESGSTSESTSESDSTSTSLSDSQS
TSRSTSASGSASTSTSTSDSRSTSASTSTSMRTSTSDSQSMSLSTSTSTSMSDSTSLSDSVSDSTSDSTSASTSGSMSVS
ISLSDSTSTSTSASEVMSASISDSQSMSESVNDSESVSESNSESDSKSMSGSTSVSDSGSLSVSTSLRKSESVSESSSLS
CSQSMSDSVSTSDSSSLSVSTSLRSSESVSESDSLSDSKSTSGSTSTSTSGSLSTSTSLSGSESVSESTSLSDSISMSDS
TSTSDSDSLSGSISLSGSTSLSTSDSLSDSKSLSSSQSMSGSESTSTSVSDSQSSSTSNSQFDSMSISASESDSMSTSDS
SSISGSNSTSTSLSTSDSMSGSVSVSTSTSLSDSISGSTSVSDSSSTSTSTSLSDSMSQSQSTSTSASGSLSTSISTSMS
MSASTSSSQSTSVSTSLSTSDSISDSTSISISGSQSTVESESTSDSTSISDSESLSTSDSDSTSTSTSDSTSGSTSTSIS
ESLSTSGSGSTSVSDSTSMSESNSSSVSMSQDKSDSTSISDSESVSTSTSTSLSTSDSTSTSESLSTSMSGSQSISDSTS
TSMSGSTSTSESNSMHPSDSMSMHHTHSTSTSRLSSEATTSTSESQSTLSATSEVTKHNGTPAQSEKRLPDTGDSIKQNG
LLGGVMTLLVGLGLMKRKKKKDENDQDDSQA
>Q7A362 ~~~sraP~~~Serine-rich adhesin for platelets~~~
MSKRQKAFHDSLANEKTRVRLYKSGKNWVKSGIKEIEMFKIMGLPFISHSLVSQDNQSISKKMTGYGLKTTAVIGGAFTV
NMLHDQQAFAASDAPLTSELNTQSETVGNQNSTTIEASTSTADSTSVTKNSSSVQTSNSDTVSSEKSEKVTSTTNSTSNQ
QEKLTSTSESTSSKNTTSSSDTKSVASTSSTEQPINTSTNQSTASNNTSQSTTPSSVNLNKTSTTSTSTAPVKLRTFSRL
AMSTFASAATTTAVTANTITVNKDNLKQYMTTSGNATYDQSTGIVTLTQDAYSQKGAITLGTRIDSNKSFHFSGKVNLGN
KYEGHGNGGDGIGFAFSPGVLGETGLNGAAVGIGGLSNAFGFKLDTYHNTSKPNSAAKANADPSNVAGGGAFGAFVTTDS
YGVATTYTSSSTADNAAKLNVQPTNNTFQDFDINYNGDTKVMTVKYAGQTWTRNISDWIAKSGTTNFSLSMTASTGGATN
LQQVQFGTFEYTESAVTQVRYVDVTTGKDIIPPKTYSGNVDQVVTIDNQQSALTAKGYNYTSVDSSYASTYNDTNKTVKM
TNAGQSVTYYFTDVKAPTVTVGNQTIEVGKTMNPVVLTTTDNGTGTVTNTVTGLPSGLSYDSATNSIIGTPTKIGQSTVT
VVSTDQANNKSTTTFTINVVDTTAPTVTPIGDQSSEVYSPISPIKIATQDNSGNAVTNTVTGLPSGLTFDSTNNTISGTP
TNIGTSTISIVSTDASGNKTTTTFKYEVTRNSMSDSVSTSGSTQQSQSVSTSKADSQSASTSTSGSIVVSTSASTSKSTS
VSLSDSVSASKSLSTSESNSVSSSTSTSLVNSQSVSSSMSGSVSKSTSLSDSISNSNSTEKSESLSTSTSDSLRTSTSLS
DSLSMSTSGSLSKSQSLSTSISGSSSTSASLSDSTSNAISTSTSLSESASTSDSISISNSIANSQSASTSKSDSQSTSIS
LSTSDSKSMSTSESLSDSTSTSGSVSGSLSIAASQSVSTSTSDSMSTSEIVSDSISTSGSLSASDSKSMSVSSSMSTSQS
GSTSESLSDSQSTSDSDSKSLSLSTSQSGSTSTSTSTSASVRTSESQSTSGSMSASQSDSMSISTSFSDSTSDSKSASTA
SSESISQSASTSTSGSVSTSTSLSTSNSERTSTSVSDSTSLSTSESDSISESTSTSDSISEAISASESTSISLSESNSTS
DSESQSASAFLSESLSESTSESTSESVSSSTSESTSLSDSTSESGSTSTSLSNSTSGSASISTSTSISESTSTFKSESVS
TSLSMSTSTSLSNSTSLSTSLSDSTSDSKSDSLSTSMSTSDSISTSKSDSISTSTSLSGSTSESESDSTSSSESKSDSTS
MSISMSQSTSGSTSTSTSTSLSDSTSTSLSLSASMNQSGVDSNSASQSASNSTSTSTSESDSQSTSTYTSQSTSQSESTS
TSTSLSDSTSISKSTSQSGSTSTSASLSGSESESDSQSISTSASESTSESASTSLSDSTSTSNSGSASTSTSLSNSASAS
ESDSSSTSLSDSTSASMQSSESDSQSTSASLSDSLSTSTSNRMSTIASLSTSVSTSESGSTSESTSESDSTSTSLSDSQS
TSRSTSASGSASTSTSTSDSRSTSASTSTSMRTSTSDSQSMSLSTSTSTSMSDSTSLSDSVSDSTSDSTSASTSGSMSVS
ISLSDSTSTSTSASEVMSASISDSQSMSESVNDSESVSESNSESDSKSMSGSTSVSDSGSLSVSTSLRKSESVSESSSLS
GSQSMSDSVSTSDSSSLSVSTSLRSSESVSESDSLSDSKSTSGSTSTSTSGSLSTSTSLSGSESVSESTSLSDSISMSDS
TSTSDSDSLSGSISLSGSTSLSTSDSLSDSKSLSSSQSMSGSESTSTSVSDSQSSSTSNSQFDSMSISASESDSMSTSDS
SNISGSNSTSTSLSTSDSMSGSVSVSTSTSLSDSISGSTSVSDSSSTSTSTSLSDSMSQSQSTSTSASGSLSTSISTSMS
MSASTSSSQSTSVSTSLSTSDSISDSTSISISGSQSTVESESTSDSTSISDSESLSTSDSDSTSTSTSDSTSGSTSTSIS
ESLSTSGSGSTSVSDSTSMSESDSTSVSMSQDKSDSTSISDSESVSTSTSTSLSTSDSTSTSESLSTSMSGSQSISDSTS
TSMSGSTSTSESNSMHPSDSMSMHHTHSTSTSRLSSEATTSTSESQSTLSATSEVTKHNGTPAQSEKRLPDTGDSIKQNG
LLGGVMTLLVGLGLMKRKKKKDENDQDDSQA
>Q8NUJ3 ~~~sraP~~~Serine-rich adhesin for platelets~~~
MSKRQKEFHDSLANEKTRVRLYKSGKNWVKSGIKEIEMFKIMGLPFISHSLVSQDNQSISKKMTGYGLKTTAVIGGAFTV
NMLHDQQAFAASDAPLTSELNTQSETVGNQNSTTIEASTSTADSTSVTKNSSSVQTSNSDTVSSEKSEKVTSTTNSTSNQ
QEKLTSTSESTSSKNTTSSSDTKSVASTSSTEQPINTSTNQSTASNNTSQSTTPSSVNLNKTSTTSTSTAPVKLRTFSRL
AMSTFASAATTTAVTANTITVNKDNLKQYMTTSGNATYDQSTGIVTLTQDAYSQKGAITLGTRIDSNKSFHFSGKVNLGN
KYEGNGNGGDGIGFAFSPGVLGETGLNGAAVGIGGLSNAFGFKLDTYHNTSKPNSAAKANADPSNVAGGGAFGAFVTTDS
YGVATTYTSSSTADNAAKLKVQPTNNTFQDFDINYNGDTKVMTVTYAGQTWTRNISDWIAKSGTTNFSLSMTASTGGATN
LQQVQFGTFEYTESAVTQVRYVDVTTGKDIIPPKTYSGNVDQVVTIDNQQSALTAKGYNYTSVDSSYASTYNDTNKTVKM
TNAGQSVTYYFTDVKAPTVTVGNQTIEVGKTMNPVVLTTTDNGTGTVTNTVTGLPSGLSYDSATNSIIGTPTKIGQSTVT
VVSTDQANNKSTTTFTINVVDTTAPTVTPIGDQSSEVYSPISPIKIATQDNSGNAVTNTVTGLPSGLTFDSTNNTISGTP
TNIGTSTITIVSTDASGNKTTTTFKYEVTRNSMSDSVSTSGSTQQSQSVSTSKADSQSASTSTSGSIVVSTSASTSKSTS
VSLSDSVSASKSLSTSESNSVSSSTSTSLVNSQSVSSSMSDSASKSTSLSDSISNSSSTEKSESLSTSTSDSLRTSTSLS
DSLSMSTSGSLSKSQSLSTSTSESSSTSASLSDSTSNAISTSESLSESASTSDSISISNSIANSQSASTSKSDSQSTSIS
LSTSDSKSMSTSESLSDSTSTSGSVSGSLSIAASQSVSTSTSDSMSTSEIVSDSISTSGSLSASDSKSMSVSSSMSTSQS
GSTSESLSDSQSTSDSDSKSLSLSTSQSGSTSTSTSTSASVRTSESQSTSGSMSASQSDSMSISTSFSDSTSDSKSASTA
SSESISQSASTSTSGSVSTSTSLSTSNSERTSTSMSDSTSLSTSESDSISESTSTSDSISEAISASESTFISLSESNSTS
DSESQSASAFLSESLSESTSESTSESVSSSTSESTSLSDSTSESGSTSTSLSNSTSGSASISTSTSISESTSTFKSESVS
TSLSMSTSTSLSDSTSLSTSLSDSTSDSKSDSLSTSMSTSDSISTSKSDSISTSTSLSGSTSESESDSTSSSESKSDSTS
MSISMSQSTSGSTSTSTSTSLSDSTSTSLSLSASMNQSGVDSNSASQSASNSTSTSTSESDSQSTSSYTSQSTSQSESTS
TSTSLSDSTSISKSTSQSGSVSTSASLSGSESESDSQSISTSASESTSESASTSLSDSTSTSNSGSASTSTSLSNSASAS
ESDSSSTSLSDSTSASMQSSESDSQSTSASLSDSLSTSTSNRMSTIASLSTSVSTSESGSTSESTSESDSTSTSLSDSQS
TSRSTSASGSASTSTSTSDSRSTSASTSTSMRTSTSDSQSMSLSTSTSTSMSDSTSLSDSVSDSTSDSTSASTSGSMSVS
ISLSDSTSTSTSASEVMSASISDSQSMSESVNDSESVSESNSESDSKSMSGSTSVSDSGSLSVSTSLRKSESVSESSSLS
GSQSMSDSVSTSDSSSLSVSTSLRSSESVSESDSLSDSKSTSGSTSTSTSGSLSTSTSLSGSESVSESTSLSDSISMSDS
TSTSDSDSLSGSISLSGSTSLSTSDSLSDSKSLSSSQSMSGSESTSTSVSDSQSSSTSNSQFDSMSISASESDSMSTSDS
SSISGSNSTSTSLSTSDSMSGSVSVSTSTSLSDSISGSTSLSDSSSTSTSTSLSDSMSQSQSTSTSASGSLSTSISTSMS
MSASTSSSQSTSVSTSLSTSDSISDSTSISISGSQSTVESESTSDSTSISDSESLSTSDSDSTSTSTSDSTSGSTSTSIS
ESLSTSGSGSTSVSDSTSMSESDSTSVSMSQSMSGSTYNSTSVSDSESVSTSTSTSLSTSDSTSTSESLSTSMSGSQSIS
DSTSTSMSGSTSTSESNSMHPSDSMSMHHTHSTSTSRLSSEATTSTSESQSTLSATSEVTKHNGTPAQSEKRLPDTGDSI
KQNGLLGGVMTLLVGLGLMKRKKKKDENDQDDSQA
>P68191 ~~~sra~~~Stationary-phase-induced ribosome-associated protein~~~
MKSNRQARHILGLDHKISNQRKIVTEGDKSSVVNNPTGRKRPAEK
>P27206 ~~~srfAA~~~Surfactin synthase subunit 1~~~COG1020
MEITFYPLTDAQKRIWYTEKFYPHTSISNLAGIGKLVSADAIDYVLVEQAIQEFIRRNDAMRLRLRLDENGEPVQYISEY
RPVDIKHTDTTEDPNAIEFISQWSREETKKPLPLYDCDLFRFSLFTIKENEVWFYANVHHVISDGISMNILGNAIMHIYL
ELASGSETKEGISHSFIDHVLSEQEYAQSKRFEKDKAFWNKQFESVPELVSLKRNASAGGSLDAERFSKDVPEALHQQIL
SFCEANKVSVLSVFQSLLAAYLYRVSGQNDVVTGTFMGNRTNAKEKQMLGMFVSTVPLRTNIDGGQAFSEFVKDRMKDLM
KTLRHQKYPYNLLINDLRETKSSLTKLFTVSLEYQVMQWQKEEDLAFLTEPIFSGSGLNDVSIHVKDRWDTGKLTIDFDY
RTDLFSREEINMICERMITMLENALTHPEHTIDELTLISDAEKEKLLARAGGKSVSYRKDMTIPELFQEKAELLSDHPAV
VFEDRTLSYRTLHEQSARIANVLKQKGVGPDSPVAVLIERSERMITAIMGILKAGGAYVPIDPGFPAERIQYILEDCGAD
FILTESKVAAPEADAELIDLDQAIEEGAEESLNADVNARNLAYIIYTSGTTGRPKGVMIEHRQVHHLVESLQQTIYQSGS
QTLRMALLAPFHFDASVKQIFASLLLGQTLYIVPKKTVTNGAALTAYYRKNSIEATDGTPAHLQMLAAAGDFEGLKLKHM
LIGGEGLSSVVADKLLKLFKEAGTAPRLTNVYGPTETCVDASVHPVIPENAVQSAYVPIGKALGNNRLYILDQKGRLQPE
GVAGELYIAGDGVGRGYLHLPELTEEKFLQDPFVPGDRMYRTGDVVRWLPDGTIEYLGREDDQVKVRGYRIELGEIEAVI
QQAPDVAKAVVLARPDEQGNLEVCAYVVQKPGSEFAPAGLREHAARQLPDYMVPAYFTEVTEIPLTPSGKVDRRKLFALE
VKAVSGTAYTAPRNETEKAIAAIWQDVLNVEKAGIFDNFFETGGHSLKAMTLLTKIHKETGIEIPLQFLFEHPTITALAE
EADHRESKAFAVIEPAEKQEHYPLSLAQQRTYIVSQFEDAGVGYNMPAAAILEGPLDIQKLERAFQGLIRRHESLRTSFV
LENSTPRQKIHDSVDFNIEMIERGGRSDEAIMASFVRTFDLAKAPLFRIGLLGLEENRHMLLFDMHHLISDGVSIGIMLE
ELARIYKGEQLPDLRLQYKDYAVWQSRQAAEGYKKDQAYWKEVFAGELPVLQLLSDYPRPPVQSFEGDRVSIKLDAGVKD
RLNRLAEQNGATLYMVMLSAYYTLLSKYTGQDDIIVGTPSAGRNHSDTEGIIGMFVNTLAIRSEVKQNETFTQLISRVRK
RVLDAFSHQDYPFEWLVEDLNIPRDVSRHPLFDTMFSLQNATEGIPAVGDLSLSVQETNFKIAKFDLTVQARETDEGIEI
DVDYSTKLFKQSTADRLLTHFARLLEDAAADPEKPISEYKLLSEEEAASQIQQFNPGRTPYPKDKTIVQLFEEQAANTPD
HTALQYEGESLTYRELNERANRLARGILSLGAGEGRTAAVLCERSMDMIVSILAVLKSGSAYVPIDPEHPIQRMQHFFRD
SGAKVLLTQRKLKALAEEAEFKGVIVLADEEESYHADARNLALPLDSAAMANLTYTSGTTGTPKGNIVTHANILRTVKET
NYLSITEQDTILGLSNYVFDAFMFDMFGSLLNGAKLVLIPKETVLDMARLSRVIERENISILMITTALFHLLVDLNPACL
STLRKIMFGGERASVEHVRKALQTVGKGKLLHMYGPSESTVFATYHPVDELEEHTLSVPIGKPVSNTEVYILDRTGHVQP
AGIAGELCVSGEGLVKGYYNRPELTEEKFVPHPFTSGERMYKTGDLARWLPNGDIEFIGRIDHQVKIRGQRIELGEIEHQ
LQTHDRVQESVVLAVDQGAGDKLLCAYYVGEGDISSQEMREHAAKDLPAYMVPAVFIQMDELPLTGNGKIDRRALPIPDA
NVSRGVSYVAPRNGTEQKVADIWAQVLQAEQVGAYDHFFDIGGHSLAGMKMLALVHQELGVELSLKDLFQSPTVEGLAQV
IASAEKGTAASISPAEKQDTYPVSSPQKRMYVLQQLEDAQTSYNMPAVLRLTGELDVERLNSVMQQLMQRHEALRTTFEI
KDGETVQRIWEEAECEIAYFEAPEEETERIVSEFIKPFKIDQLPLFRIGLIKHSDTEHVLLFDMHHIISDGASVGVLIEE
LSKLYDGETLEPLRIQYKDYAVWQQQFIQSELYKKQEEHWLKELDGELPVLTLPTDYSRPAVQTFEGDRIAFSLEAGKAD
ALRRLAKETDSTLYMVLLASYSAFLSKISGQDDIIVGSPVAGRSQADVSRVIGMFVNTLALRTYPKGEKTFADYLNEVKE
TALSAFDAQDYPLEDLIGNVQVQRDTSRNPLFDAVFSMQNANIKDLTMKGIQLEPHPFERKTAKFDLTLTADETDGGLTF
VLEYNTALFKQETIERWKQYWMELLDAVTGNPNQPLSSLSLVTETEKQALLEAWKGKALPVPTDKTVHQLFEETAQRHKD
RPAVTYNGQSWTYGELNAKANRLARILMDCGISPDDRVGVLTKPSLEMSAAVLGVLKAGAAFVPIDPDYPDQRIEYILQD
SGAKLLLKQEGISVPDSYTGDVILLDGSRTILSLPLDENDEENPETAVTAENLAYMIYTSGTTGQPKGVMVEHHALVNLC
FWHHDAFSMTAEDRSAKYAGFGFDASIWEMFPTWTIGAELHVIEEAIRLDIVRLNDYFETNGVTITFLPTQLAEQFMELE
NTSLRVLLTGGDKLKRAVKKPYTLVNNYGPTENTVVATSAEIHPEEGSLSIGRAIANTRVYILGEGNQVQPEGVAGELCV
AGRGLARGYLNREDETAKRFVADPFVPGERMYRTGDLVKWTGGGIEYIGRIDQQVKVRGYRIELSEIEVQLAQLSEVQDA
AVTAVKDKGGNTAIAAYVTPESADIEALKSALKETLPDYMIPAFWVTLNELPVTANGKVDRKALPEPDIEAGSGEYKAPT
TDMEELLAGIWQDVLGMSEVGVTDNFFSLGGDSIKGIQMASRLNQHGWKLEMKDLFQHPTIEELTQYVERAEGKQADQGP
VEGEVILTPIQRWFFEKNFTNKHHWNQSVMLHAKKGFDPERVEKTLQALIEHHDALRMVYREGQEDVIQYNRGLEAASAQ
LEVIQIEGQAADYEDRIEREAERLQSSIDLQEGGLLKAGLFQAEDGDHLLLAIHHLVVDGVSWRILLEDFAAVYTQLEQG
NEPVLPQKTHSFAEYAERLQDFANSKAFLKEKEYWRQLEEQAVAAKLPKDRESGDQRMKHTKTIEFSLTAEETEQLTTKV
HEAYHTEMNDILLTAFGLAMKEWTGQDRVSVHLEGHGREEIIEDLTISRTVGWFTSMYPMVLDMKHADDLGYQLKQMKED
IRHVPNKGVGYGILRYLTAPEHKEDVAFSIQPDVSFNYLGQFDEMSDAGLFTRSELPSGQSLSPETEKPNALDVVGYIEN
GKLTMSLAYHSLEFHEKTVQTFSDSFKAHLLRIIEHCLSQDGTELTPSDLGDDDLTLDELDKLMEIF
>Q04747 ~~~srfAB~~~Surfactin synthase subunit 2~~~COG1020
MSKKSIQKVYALTPMQEGMLYHAMLDPHSSSYFTQLELGIHGAFDLEIFEKSVNELIRSYDILRTVFVHQQLQKPRQVVL
AERKTKVHYEDISHADENRQKEHIERYKQDVQRQGFNLAKDILFKVAVFRLAADQLYLVWSNHHIMMDGWSMGVLMKSLF
QNYEALRAGRTPANGQGKPYSDYIKWLGKQDNEEAESYWSERLAGFEQPSVLPGRLPVKKDEYVNKEYSFTWDETLVARI
QQTANLHQVTGPNLFQAVWGIVLSKYNFTDDVIFGTVVSGRPSEINGIETMAGLFINTIPVRVKVERDAAFADIFTAVQQ
HAVEAERYDYVPLYEIQKRSALDGNLLNHLVAFENYPLDQELENGSMEDRLGFSIKVESAFEQTSFDFNLIVYPGKTWTV
KIKYNGAAFDSAFIERTAEHLTRMMEAAVDQPAAFVREYGLVGDEEQRQIVEVFNSTKAELPEGMAVHQVFEEQAKRTPA
STAVVYEGTKLTYRELNAAANRLARKLVEHGLQKGETAAIMNDRSVETVVGMLAVLKAGAAYVPLDPALPGDRLRFMAED
SSVRMVLIGNSYTGQAHQLQVPVLTLDIGFEESEAADNLNLPSAPSDLAYIMYTSGSTGKPKGVMIEHKSILRLVKNAGY
VPVTEEDRMAQTGAVSFDAGTFEVFGALLNGAALYPVKKETLLDAKQFAAFLREQSITTMWLTSPLFNQLAAKDAGMFGT
LRHLIIGGDALVPHIVSKVKQASPSLSLWNGYGPTENTTFSTSFLIDREYGGSIPIGKPIGNSTAYIMDEQQCLQPIGAP
GELCVGGIGVARGYVNLPELTEKQFLEDPFRPGERIYRTGDLARWLPDGNIEFLGRIDNQVKVRGFRIELGEIETKLNMA
EHVTEAAVIIRKNKADENEICAYFTADREVAVSELRKTLSQSLPDYMVPAHLIQMDSLPLTPNGKINKKELPAPQSEAVQ
PEYAAPKTESEKKLAEIWEGILGVKAGVTDNFFMIGGHSLKAMMMTAKIQEHFHKEVPIKVLFEKPTIQELALYLEENES
KEEQTFEPIRQASYQQHYPVSPAQRRMYILNQLGQANTSYNVPAVLLLEGEVDKDRLENAIQQLINRHEILRTSFDMIDG
EVVQTVHKNISFQLEAAKGREEDAEEIIKAFVQPFELNRAPLVRSKLVQLEEKRHLLLIDMHHIITDGSSTGILIGDLAK
IYQGADLELPQIHYKDYAVWHKEQTNYQKDEEYWLDVFKGELPILDLPADFERPAERSFAGERVMFGLDKQITAQIKSLM
AETDTTMYMFLLAAFNVLLSKYASQDDIIVGSPTAGRTHPDLQGVPGMFVNTVALRTAPAGDKTFAQFLEEVKTASLQAF
EHQSYPLEELIEKLPLTRDTSRSPLFSVMFNMQNMEIPSLRLGDLKISSYSMLHHVAKFDLSLEAVEREEDIGLSFDYAT
ALFKDETIRRWSRHFVNIIKAAAANPNVRLSDVDLLSSAETAALLEERHMTQITEATFAALFEKQAQQTPDHSAVKAGGN
LLTYRELDEQANQLAHHLRAQGAGNEDIVAIVMDRSAEVMVSILGVMKAGAAFLPIDPDTPEERIRYSLEDSGAKFAVVN
ERNMTAIGQYEGIIVSLDDGKWRNESKERPSSISGSRNLAYVIYTSGTTGKPKGVQIEHRNLTNYVSWFSEEAGLTENDK
TVLLSSYAFDLGYTSMFPVLLGGGELHIVQKETYTAPDEIAHYIKEHGITYIKLTPSLFHTIVNTASFAKDANFESLRLI
VLGGEKIIPTDVIAFRKMYGHTEFINHYGPTEATIGAIAGRVDLYEPDAFAKRPTIGRPIANAGALVLNEALKLVPPGAS
GQLYITGQGLARGYLNRPQLTAERFVENPYSPGSLMYKTGDVVRRLSDGTLAFIGRADDQVKIRGYRIEPKEIETVMLSL
SGIQEAVVLAVSEGGLQELCAYYTSDQDIEKAELRYQLSLTLPSHMIPAFFVQVDAIPLTANGKTDRNALPKPNAAQSGG
KALAAPETALEESLCRIWQKTLGIEAIGIDDNFFDLGGHSLKGMMLIANIQAELEKSVPLKALFEQPTVRQLAAYMEASA
VSGGHQVLKPADKQDMYPLSSAQKRMYVLNQLDRQTISYNMPSVLLMEGELDISRLRDSLNQLVNRHESLRTSFMEANGE
PVQRIIEKAEVDLHVFEAKEDEADQKIKEFIRPFDLNDAPLIRAALLRIEAKKHLLLLDMHHIIADGVSRGIFVKELALL
YKGEQLPEPTLHYKDFAVWQNEAEQKERMKEHEAYWMSVLSGELPELDLPLDYARPPVQSFKGDTIRFRTGSETAKAVEK
LLAETGTTLHMVLHAVFHVFLSKISGQRDIVIGSVTAGRTNADVQDMPGMFVNTLALRMEAKEQQTFAELLELAKQTNLS
ALEHQEYPFEDLVNQLDLPRDMSRNPLFNVMVTTENPDKEQLTLQNLSISPYEAHQGTSKFDLTLGGFTDENGIGLQLEY
ATDLFAKETAEKWSEYVLRLLKAVADNPNQPLSSLLLVTETEKQALLEAWKGKALPVPTDKTVHQLFEETVQRHKDRPAV
TYNGQSWTYGELNAKANRLARILMDCGISPDDRVGVLTKPSLEMSAAVLGVLKAGAAFVPIDPDYPDQRIEYILQDSGAK
LLLKQEGISVPDSYTGDVILLDGSRTILSLPLDENDEGNPETAVTAENLAYMIYTSGTTGQPKGVMVEHHALVNLCFWHH
DAFSMTAEDRSAKYAGFGFDASIWEMFPTWTIGAELHVIDEAIRLDIVRLNDYFETNGVTITFLPTQLAEQFMELENTSL
RVLLTGGDKLKRAVKKPYTLVNNYGPTENTVVATSAEIHPEEGSLSIGRAIANTRVYILGEGNQVQPEGVAGELCVAGRG
LARGYLNREDETAKRFVADPFVPGERMYRTGDLVKWVNGGIEYIGRIDQQVKVRGYRIELSEIEVQLAQLSEVQDAAVTA
VKDKGGNTAIAAYVTPETADIEALKSTLKETLPDYMIPAFWVTLNELPVTANGKVDRKALPEPDIEAGSGEYKAPTTDME
ELLAGIWQDVLGMSEVGVTDNFFSLGGDSIKGIQMASRLNQHGWKLEMKDLFQHPTIEELTQYVERAEGKQADQGPVEGE
VILTPIQRWFFEKNFTNKHHWNQSVMLHAKKGFDPERVEKTLQALIEHHDALRMVYREENGDIVQVYKPIGESKVSFEIV
DLYGSDEEMLRSQIKLLANKLQSSLDLRNGPLLKAEQYRTEAGDHLLIAVHHLVVDGVSWRILLEDFASGYMQAEKEESL
VFPQKTNSFKDWAEELAAFSQSAHLLQQAEYWSQIAAEQVSPLPKDCETEQRIVKDTSSVLCELTAEDTKHLLTDVHQPY
GTEINDILLSALGLTMKEWTKGAKIGINLEGHGREDIIPNVNISRTVGWFTAQYPVVLDISDADASAVIKTVKENLRRIP
DKGVGYGILRYFTETAETKGFTPEISFNYLGQFDSEVKTDFFEPSAFDMGRQVSGESEALYALSFSGMIRNGRFVLSCSY
NEKEFERATVEEQMERFKENLLMLIRHCTEKEDKEFTPSDFSAEDLEMDEMGDIFDMLEENLK
>Q08787 ~~~srfAC~~~Surfactin synthase subunit 3~~~COG1020
MSQFSKDQVQDMYYLSPMQEGMLFHAILNPGQSFYLEQITMKVKGSLNIKCLEESMNVIMDRYDVFRTVFIHEKVKRPVQ
VVLKKRQFHIEEIDLTHLTGSEQTAKINEYKEQDKIRGFDLTRDIPMRAAIFKKAEESFEWVWSYHHIILDGWCFGIVVQ
DLFKVYNALREQKPYSLPPVKPYKDYIKWLEKQDKQASLRYWREYLEGFEGQTTFAEQRKKQKDGYEPKELLFSLSEAET
KAFTELAKSQHTTLSTALQAVWSVLISRYQQSGDLAFGTVVSGRPAEIKGVEHMVGLFINVVPRRVKLSEGITFNGLLKR
LQEQSLQSEPHQYVPLYDIQSQADQPKLIDHIIVFENYPLQDAKNEESSENGFDMVDVHVFEKSNYDLNLMASPGDEMLI
KLAYNENVFDEAFILRLKSQLLTAIQQLIQNPDQPVSTINLVDDREREFLLTGLNPPAQAHETKPLTYWFKEAVNANPDA
PALTYSGQTLSYRELDEEANRIARRLQKHGAGKGSVVALYTKRSLELVIGILGVLKAGAAYLPVDPKLPEDRISYMLADS
AAACLLTHQEMKEQAAELPYTGTTLFIDDQTRFEEQASDPATAIDPNDPAYIMYTSGTTGKPKGNITTHANIQGLVKHVD
YMAFSDQDTFLSVSNYAFDAFTFDFYASMLNAARLIIADEHTLLDTERLTDLILQENVNVMFATTALFNLLTDAGEDWMK
GLRCILFGGERASVPHVRKALRIMGPGKLINCYGPTEGTVFATAHVVHDLPDSISSLPIGKPISNASVYILNEQSQLQPF
GAVGELCISGMGVSKGYVNRADLTKEKFIENPFKPGETLYRTGDLARWLPDGTIEYAGRIDDQVKIRGHRIELEEIEKQL
QEYPGVKDAVVVADRHESGDASINAYLVNRTQLSAEDVKAHLKKQLPAYMVPQTFTFLDELPLTTNGKVNKRLLPKPDQD
QLAEEWIGPRNEMEETIAQIWSEVLGRKQIGIHDDFFALGGHSLKAMTAASRIKKELGIDLPVKLLFEAPTIAGISAYLK
NGGSDGLQDVTIMNQDQEQIIFAFPPVLGYGLMYQNLSSRLPSYKLCAFDFIEEEDRLDRYADLIQKLQPEGPLTLFGYS
AGCSLAFEAAKKLEEQGRIVQRIIMVDSYKKQGVSDLDGRTVESDVEALMNVNRDNEALNSEAVKHGLKQKTHAFYSYYV
NLISTGQVKADIDLLTSGADFDMPEWLASWEEATTGVYRVKRGFGTHAEMLQGETLDRNAEILLEFLNTQTVTVS
>Q08788 3.1.2.-~~~srfAD~~~Surfactin synthase thioesterase subunit~~~COG3208
MSQLFKSFDASEKTQLICFPFAGGYSASFRPLHAFLQGECEMLAAEPPGHGTNQTSAIEDLEELTDLYKQELNLRPDRPF
VLFGHSMGGMITFRLAQKLEREGIFPQAVIISAIQPPHIQRKKVSHLPDDQFLDHIIQLGGMPAELVENKEVMSFFLPSF
RSDYRALEQFELYDLAQIQSPVHVFNGLDDKKCIRDAEGWKKWAKDITFHQFDGGHMFLLSQTEEVAERIFAILNQHPII
QP
>P0C0K3 2.7.11.1~~~srkA~~~Stress response kinase A~~~COG2334
MNNSAFTFQTLHPDTIMDALFEHGIRVDSGLTPLNSYENRVYQFQDEDRRRFVVKFYRPERWTADQILEEHQFALQLVND
EVPVAAPVAFNGQTLLNHQGFYFAVFPSVGGRQFEADNIDQMEAVGRYLGRMHQTGRKQLFIHRPTIGLNEYLIEPRKLF
EDATLIPSGLKAAFLKATDELIAAVTAHWREDFTVLRLHGDCHAGNILWRDGPMFVDLDDARNGPAVQDLWMLLNGDKAE
QRMQLETIIEAYEEFSEFDTAEIGLIEPLRAMRLVYYLAWLMRRWADPAFPKNFPWLTGEDYWLRQTATFIEQAKVLQEP
PLQLTPMY
>Q83IV7 2.7.11.1~~~srkA~~~Stress response kinase A~~~
MNNSAFTFQTLHPDTIMDALFKQGIRVDSGLTPLNSYENRVYQFQDEERRRFVVKFYRPERWTADQILEEHQFALQLVND
EVPVAAPVAFNGQTLLNHQGFYFAVFPSVGGRQFEADNIDQMEAVGRYLGRMHQTGRKQLFIHRPTIGLNEYLIEPRKLF
EDATLIPSGLKAAFLKATDELIAAVTAHWREDFTVLRLHGDCHAGNILWRDGPMFVDLDDARNGPAIQDLWMLLNGDKAQ
QRMQLETIIEAYEEFSEFDTAEIGLIEPLRAMRLVYYLAWLMRHWADPAFPKNFPWLTGEDYWLRQTATFIEQAKVLQEP
PLQLTPMY
>P05707 1.1.1.140~~~srlD~~~Sorbitol-6-phosphate 2-dehydrogenase~~~COG1028
MNQVAVVIGGGQTLGAFLCHGLAAEGYRVAVVDIQSDKAANVAQEINAEYGESMAYGFGADATSEQSVLALSRGVDEIFG
RVDLLVYSAGIAKAAFISDFQLGDFDRSLQVNLVGYFLCAREFSRLMIRDGIQGRIIQINSKSGKVGSKHNSGYSAAKFG
GVGLTQSLALDLAEYGITVHSLMLGNLLKSPMFQSLLPQYATKLGIKPDQVEQYYIDKVPLKRGCDYQDVLNMLLFYASP
KASYCTGQSINVTGGQVMF
>P15082 ~~~srlR~~~Glucitol operon repressor~~~COG1349
MKPRQRQAAILEYLQKQGKCSVEELAQYFDTTGTTIRKDLVILEHAGTVIRTYGGVVLNKEESDPPIDHKTLINTHKKEL
IAEAAVSFIHDGDSIILDAGSTVLQMVPLLSRFNNITVMTNSLHIVNALSELDNEQTILMPGGTFRKKSASFHGQLAENA
FEHFTFDKLFMGTDGIDLNAGVTTFNEVYTVSKAMCNAAREVILMADSSKFGRKSPNVVCSLESVDKLITDAGIDPAFRQ
ALEEKGIDVIITGESNE
>P21507 3.6.4.13~~~srmB~~~ATP-dependent RNA helicase SrmB~~~COG0513
MTVTTFSELELDESLLEALQDKGFTRPTAIQAAAIPPALDGRDVLGSAPTGTGKTAAYLLPALQHLLDFPRKKSGPPRIL
ILTPTRELAMQVSDHARELAKHTHLDIATITGGVAYMNHAEVFSENQDIVVATTGRLLQYIKEENFDCRAVETLILDEAD
RMLDMGFAQDIEHIAGETRWRKQTLLFSATLEGDAIQDFAERLLEDPVEVSANPSTRERKKIHQWYYRADDLEHKTALLV
HLLKQPEATRSIVFVRKRERVHELANWLREAGINNCYLEGEMVQGKRNEAIKRLTEGRVNVLVATDVAARGIDIPDVSHV
FNFDMPRSGDTYLHRIGRTARAGRKGTAISLVEAHDHLLLGKVGRYIEEPIKARVIDELRPKTRAPSEKQTGKPSKKVLA
KRAEKKKAKEKEKPRVKKRHRDTKNIGKRRKPSGTGVPPQTTEE
>P37105 3.6.5.4~~~ffh~~~Signal recognition particle protein~~~COG0541
MAFEGLADRLQQTISKIRGKGKVSEQDVKEMMREVRLALLEADVNFKVVKDFVKKVSERAVGQDVMKSLTPGQQVIKVVQ
EELTELMGGEESKIAVAKRPPTVIMMVGLQGAGKTTTSGKLANLLRKKHNRKPMLVAADIYRPAAIKQLETLGKQLDMPV
FSLGDQVSPVEIAKQAIEKAKEEHYDYVILDTAGRLHIDHELMDELTNVKEIANPEEIFLVVDSMTGQDAVNVAKSFNEQ
LGLTGVVLTKLDGDTRGGAALSIRAVTNTPIKFAGLGEKLDALEPFHPERMASRILGMGDVLTLIEKAQASVDEDKAKEL
EQKMRTMSFTLDDFLEQLGQVRNMGPLDELLQMMPGAGKMKGLKNIQVDEKQLNHVEAIIKSMTVLEKEQPDIINASRRK
RIAKGSGTSVQEVNRLLKQFDEMKKMMKQMTNMSKGKKKGFKLPFM
>P0AGD7 3.6.5.4~~~ffh~~~Signal recognition particle protein~~~COG0541
MFDNLTDRLSRTLRNISGRGRLTEDNVKDTLREVRMALLEADVALPVVREFINRVKEKAVGHEVNKSLTPGQEFVKIVRN
ELVAAMGEENQTLNLAAQPPAVVLMAGLQGAGKTTSVGKLGKFLREKHKKKVLVVSADVYRPAAIKQLETLAEQVGVDFF
PSDVGQKPVDIVNAALKEAKLKFYDVLLVDTAGRLHVDEAMMDEIKQVHASINPVETLFVVDAMTGQDAANTAKAFNEAL
PLTGVVLTKVDGDARGGAALSIRHITGKPIKFLGVGEKTEALEPFHPDRIASRILGMGDVLSLIEDIESKVDRAQAEKLA
SKLKKGDGFDLNDFLEQLRQMKNMGGMASLMGKLPGMGQIPDNVKSQMDDKVLVRMEAIINSMTMKERAKPEIIKGSRKR
RIAAGCGMQVQDVNRLLKQFDDMQRMMKKMKKGGMAKMMRSMKGMMPPGFPGR
>Q01442 3.6.5.4~~~ffh~~~Signal recognition particle protein~~~
MGFGDFLSKRMQKSIEKNMKNSTLNEENIKETLKEIRLSLLEADVNIEAAKEIINNVKQKALGGYISEGASAHQQMIKIV
HEELVNILGKENAPLDINKKPSVVMMVGLQGSGKTTTANKLAYLLNKKNKKKVLLVGLDIYRPGAIEQLVQLGQKTNTQV
FEKGKQDPVKTAEQALEYAKENNFDVVILDTAGRLQVDQVLMKELDNLKKKTSPNEILLVVDGMSGQEIINVTNEFNDKL
KLSGVVVTKLDGDARGGATLSISYLTKLPIKFIGEGEGYNALAAFYPKRMADRLMGMGDIETLFERAVENIDERSIQKTM
NRMFLGQFDLEDLRNQLAQIAKMGSLNKLMKMLPINKVSESQIQEAQRKLAVFSILMDSMTLKERRDPRVLKAISRKNRI
IKGSGRSEKEFNELINSFEKGKKQVLEITKMIKSGRMPNLSKGGFKF
>P9WGD7 3.6.5.4~~~ffh~~~Signal recognition particle protein~~~COG0541
MFESLSDRLTAALQGLRGKGRLTDADIDATTREIRLALLEADVSLPVVRAFIHRIKERARGAEVSSALNPAQQVVKIVNE
ELISILGGETRELAFAKTPPTVVMLAGLQGSGKTTLAGKLAARLRGQGHTPLLVACDLQRPAAVNQLQVVGERAGVPVFA
PHPGASPESGPGDPVAVAAAGLAEARAKHFDVVIVDTAGRLGIDEELMAQAAAIRDAINPDEVLFVLDAMIGQDAVTTAA
AFGEGVGFTGVALTKLDGDARGGAALSVREVTGVPILFASTGEKLEDFDVFHPDRMASRILGMGDVLSLIEQAEQVFDAQ
QAEEAAAKIGAGELTLEDFLEQMLAVRKMGPIGNLLGMLPGAAQMKDALAEVDDKQLDRVQAIIRGMTPQERADPKIINA
SRRLRIANGSGVTVSEVNQLVERFFEARKMMSSMLGGMGIPGIGRKSATRKSKGAKGKSGKKSKKGTRGPTPPKVKSPFG
VPGMPGLAGLPGGLPDLSQMPKGLDELPPGLADFDLSKLKFPGKK
>O07347 3.6.5.4~~~ffh~~~Signal recognition particle protein~~~
MFQQLSARLQEAIGRLRGRGRITEEDLKATLREIRRALMDADVNLEVARDFVERVREEALGKQVLESLTPAEVILATVYE
ALKEALGGEARLPVLKDRNLWFLVGLQGSGKTTTAAKLALYYKGKGRRPLLVAADTQRPAAREQLRLLGEKVGVPVLEVM
DGESPESIRRRVEEKARLEARDLILVDTAGRLQIDEPLMGELARLKEVLGPDEVLLVLDAMTGQEALSVARAFDEKVGVT
GLVLTKLDGDARGGAALSARHVTGKPIYFAGVSEKPEGLEPFYPERLAGRILGMGDVASLAEKVRAAGLEAEAPKSAKEL
SLEDFLKQMQNLKRLGPFSEILGLLPGVPQGLKVDEKAIKRLEAIVLSMTPEERKDPRILNGSRRKRIAKGSGTSVQEVN
RFIKAFEEMKALMKSLEKKKGRGLMGMFRR
>O31099 ~~~srpA~~~Solvent efflux pump periplasmic linker SrpA~~~
MRQIRSPRALRVIPLTALMLISGCGEKEQVSSATPPPDVGVYTVRAQALTLTTDLPGRTSAFRVAEVRPQVSGILQKRSF
VEGAEVKLGQQLYQIDPRTYEAQLRRAEANRTSAQNLARRYETLLKTKAVSKQQYDDALAAWKQAEADYQVARIDVQYTR
VLSPISGRIGRSTVTEGALVTNGQAQSLATVTQLDPIYVDVTQPITKLLGLQKALESGRLQKTGENQAEVSLTLDDGSAY
PLPGTLKFSEVSVDPTTGSVTLRAEFPNPNRKLLPGMFVHALLKEGVQNAAILVPQQAISRDTRGVPSVWVVKADNTVES
REIQTLRTVGNAWLISNGVTEGERIITEGVQRVRSGIAVNAVEAKNVNLVDGFAATTEASAN
>Q55025 1.11.1.-~~~srpA~~~Catalase-related peroxidase~~~COG0753
MIRIRNRWFRWLAIALASLVASIGIATVGFAATGVTPDQVLSAIEGTFGVNVGQRRNHIKGTCAVGNFVATTEAKTYSRS
PLFSGQSIPVVARFSLAGGNPKAPDTAKNPRGLGLQFQLPNNRFLNMALLNTPVFGVASPEGFYENILAIRPDPTTGKPD
PEKVKAFREKYPENKAQAAFLASNNPPTSYANTSYFGLHAFKFINQTNQTRLVRWQFVPQDGEKRLTDAELQAAPANFLE
QKLIERTQDSPVKWDFWITLGQPGDAEDNPTIAWPSDRQQVKVGTLTLTAASPQPGAACEGINYDPLVLSDGIEPTNDPV
LQFRSGVYALSYSKRTRGL
>O31100 ~~~srpB~~~Solvent-resistant pump membrane transporter SrpB~~~
MSRFFIDRPIFAWVLAIVAMLAGALSLAKMPISQYPNIAAPAVSIQVSYPGASAQTVQDTVVQVIEQQLSGLDGFRYMSA
ESASDGSMTIIVTFEQGTDPDIAQVQVQNKLQLATPRLPEEVQRQGLRVVKYQMNFFLVMSLVDRSGKLDNFDLGNLIAS
QLQDPISRIPGVGDFQLFGSPYAMRIWLDPGKLNSYQLTPTDVASAIREQNVQVSSGQLGGLPTRSGVQLNATVLGKTRM
TTPSQFDEILVKVNPDGSQVRVKDVGRAELGADSFAISAQYKDSPTASLALRLSTGGNLLETVDAVKKLMEQQKAYLPDG
VEVIYPYDTTPVVEASIESVVHTIFEAVVLVFLVMYLFLQSFRATLIPTLAVPVVLLATFALLPYFGLNINVLTMYAMVL
AIGLLVDDAIVVVENVERLMHDEGLSPLEATRKSMDQISGALVGIGMVLSAVFVPMAFFGGSAGIIYQQFAITIVVCMGL
SILVALVFTPALCVTILKAPEGNSHHERKGFFGWFNRIFDRGTRRFERGVGAMLKGRGRYLLAFLLITGGTGYLFTQIPK
AFLPNEDQGLMMIEVRTPANASAERTEGVLQEVRDYLANDEGALVEHFMTVNGFNFAGRGQNSGLVLITFKDWKERHGAG
QDVFSIAQRANQHFAKIKDASVMAFVPPAILEMGNAMGFNLYLQDNLGLGHEALMAARNQFLQLASQNPKLQAVRPNGKD
DEPQFQVNIDDEKARALQVSIASINETMSAAWGSMYVNDFIDRGRVKRVYVQGEDISRISPEDFDKWYVRNSLGQMVPFS
AFATGEWVNGSPKLERYGGISSLNILGEPAPGYSTGDAMIAIAEIMQQLPAGIGLSYTGLSYEEIQTGDQAPLLYALTVL
IVFLCLAALYESWSVPVSVIMVVPLGILGAVLATLWRDLTADVYFQVGLMTTVGLSAKNAILIVEFAKELYEKEGYPIVK
AAIEAAKLRLRPILMTSLAFTFGVLPMAIASGAGAGSQHSIATGVVGGMITATVLAVFFVPLFYVVVVKLFEGLMKRKPN
AVKEVTHEV
>Q55026 ~~~srpB~~~Protein SrpB~~~COG1285
MNVIFLPQGDWLGLSFRLLLAMLVGAVIGLNRQRGGRPAGMRTFTLVAMGSALFVMVPIQAEGDSSFAAINALSRTVQGV
AAGVGFLGAGLILQRAPKTKRSGRPRVSGLTTAATIWITAALGAVIGCGLWQLGLIGTFFTLLTLSGFKRLQRIAWLRQS
WERLIAWEAKTLPPDAEEEDDD
>O31101 ~~~srpC~~~Solvent efflux pump outer membrane protein SrpC~~~
MKFKSLPMFALLMLGGCSLIPDYQQPAAPMQAQWPTGQAYGGQGDQRSIATALPKAKEFFKDPALVRLLDAALENNRDLR
IAAKNVESYRALYRIQRAERFPTLDGQASGNRTRLPDDLSPTGDSRIDSQYQVGLVTAYELDLFGRIRSLSNQALEKYLA
TEEAQRSVQIALIGDVATTYFLWRTDQALLELTEATLTSYVESLAMIESSAWAGTSSELDVRQARTLVNQAQAQQALYTR
RIAQDVNALELLLGSKIPTDLPKNSPLAMSALGKVPAGLPADLLLNRPDIRSAEHQLMAANANIGAARAAFFPRISLTAS
AGSASSDLDGLFNSGSDSWSFAPQISVPIFNAGKLRANLDYAELQKDVGVATYEKSIQTAFREVADGLAARGTYGKQLSA
QSELVDNYKAYFSLAQQRYDQGVDSYLTVLDAQRELFSSQQKLLNDQLDQINSEVQLYKALGGGWSVSQN
>Q55027 ~~~srpC~~~Probable chromate transport protein~~~COG2059
MKDSDSLLHVHPAYSLKQLTQYFLKLGALGFGGPIALVGYMHRDLVEERQWVSEAEYQEGLTLAQVAPGPLAAQLSFYLG
YVHYGFLGSALVGLAFVLPSFLIVVALGWAYTLYGGLNWMQAVFYGVGAAVIGIIAISAYKLTRKTVGTSWLLWSIYLVN
AATTIVTESERVELILGSGALVLLVKFPPKHWIKQNRLNSFIGLPLIPLFAAVPTATTSLLGQIALFFTQAGAFVFGSGL
AIVPFLYGGVVKDFGWLNSQQFLDAVAVAMITPGPVVITTGFIGFLVAGFPGACVAAIAMFIPCYLLTVIPAPYFKKHGK
NPKISTFVNGVTVAATGAIAGAVVVLGRQSLHDLPTFLIGLIALISSWKLGKKLPEPLIIVIAAIAGVIFWSK
>Q59967 2.3.1.30~~~srpH~~~Serine acetyltransferase, plasmid~~~COG1045
MSLSPRSDRTEIRRSWGLDSIVSALSQASTDPLPHHLLSDQFYPLPSRESLGLILHGLRSVLFPRHFGDPELSVETTHYF
IGNTLDKTLNLLNEQIRRELWLQHVTQGTPEATPAVLSQHASELTQAFAQALPEIKRLLDSDVNAAYLGDPAAQSISEIL
FCYPGITAITFHRLAHRLYQLGLPLLARITAEVSHSETGIDIHPGAAIGGSFFIDHGTGVVIGETCVIGDRVRIYQAVTL
GAKSFPRDETGALIKGQARHPVIEDDVVIYAGATLLGRITVGRGSTIGGNVWLTRSVPAGSFISQAQIRSDNFESGGGI
>Q9R9T9 ~~~srpR~~~HTH-type transcriptional regulator SrpR~~~
MARKTAAEAEETRQRIIDAALEVFVAQGVSDATLDQIARKAGVTRGAVYWHFNGKLEVLQAVLASRQHPLELDFTPDLGI
ERSWEAVVVAMLDAVHSPQSKQFSEILIYQGLDESGLIHNRMVQASDRFLQYIHQVLRHAVTQGELPINLDLQTSIGVFK
GLITGLLYEGLRSKDQQAQIIKVALGSFWALLREPPRFLLCEEAQIKQVKSFE
>Q9R9U0 ~~~srpS~~~HTH-type transcriptional regulator SrpS~~~
MNQSDENVGKAGGIQVIARAASIMRALGSHPHGLSLAAIAQLVGLPRSTVQRIINALEEEFLVEALGPAGGFRLGPALGQ
LINQAQSDILSLVKPYLRSLAEELEESVCLASLAGDKIYVLDRIVSERELRVVFPIGINVPAAATAAGKVLLAALPDETL
QAALGEQLPVFTSNTLRRKALVKQLSEVRQSGFASDLDEHIDGVCSFATLLDTYLGYYSLAVVMPSSRASKQSDLIKKAL
LQSKQNIERAIGRASKKAP
>Q5HFT0 ~~~srrA~~~Transcriptional regulatory protein SrrA~~~
MSNEILIVDDEDRIRRLLKMYLERESFEIHEASNGQEAYELAMENNYACILLDLMLPEMDGIQVATKLREHKQTPIIMLT
AKGEETNRVEGFESGADDYIVKPFSPREVVLRVKALLRRTQSTTVEQSEPHARDVIEFKHLEIDNDAHRVLADNQEVNLT
PKEYELLIYLAKTPNKVFDREQLLKEVWHYEFYGDLRTVDTHVKRLREKLNRVSSEAAHMIQTVWGVGYKFEVKSNDEPA
K
>Q7A5H6 ~~~srrA~~~Transcriptional regulatory protein SrrA~~~
MSNEILIVDDEDRIRRLLKMYLERESFEIHEASNGQEAYELAMENNYACILLDLMLPEMDGIQVATKLREHKQTPIIMLT
AKGEETNRVEGFESGADDYIVKPFSPREVVLRVKALLRRTQSTTVEQSEPHARDVIEFKHLEIDNDAHRVLADNQEVNLT
PKEYELLIYLAKTPNKVFDREQLLKEVWHYEFYGDLRTVDTHVKRLREKLNRVSSEAAHMIQTVWGVGYKFEVKSNDEPA
K
>Q9L524 ~~~srrA~~~Transcriptional regulatory protein SrrA~~~
MSNEILIVDDEDRIRRLLKMYLERESFEIHEASNGQEAYELAMENNYACILLDLMLPEMDGIQVATKLREHKQTPIIMLT
AKGEETNRVEGFESGADDYIVKPFSPREVVLRVKALLRRTQSTTVEQSEPHARDVIEFKHLEIDNDAHRVLADNQEVNLT
PKEYELLIYLAKTPNKVFDREQLLKEVWHYEFYGDLRTVDTHVKRLREKLNRVSSEAAHMIQTVWGVGYKFEVKSNDEPA
K
>Q5HFT1 2.7.13.3~~~srrB~~~Sensor protein SrrB~~~
MMSRLNSVVIKLWLTIILIVTTVLILLSIALITFMQYYFTQETENAIREDARRISSLVEQSHNKEEAIKYSQTLIENPGG
LMIINNKHRQSTASLSNIKKQMLNEVVNNDHFDDVFDKGKSVTRNVTIKEKGSSQTYILLGYPTKAQKNSHSKYSGVFIY
KDLKSIEDTNNAITIITIITAVIFLTITTVFAFFLSSRITKPLRRLRDQATRVSEGDYSYKPSVTTKDEIGQLSQAFNQM
STEIEEHVDALSTSKNIRDSLINSMVEGVLGINESRQIILSNKMANDIMDNIDEDAKAFLLRQIEDTFKSKQTEMRDLEM
NARFFVVTTSYIDKIEQGGKSGVVVTVRDMTNEHNLDQMKKDFIANVSHELRTPISLLQGYTESIVDGIVTEPDEIKESL
AIVLDESKRLNRLVNELLNVARMDAEGLSVNKEVQPIAALLDKMKIKYRQQADDLGLNMTFNYCKKRVWSYDMDRMDQVL
TNLIDNASRYTKPGDEIAITCDENESEDILYIKDTGTGIAPEHLQQVFDRFYKVDAARTRGKQGTGLGLFICKMIIEEHG
GSIDVKSELGKGTTFIIKLPKPE
>Q7A5H7 2.7.13.3~~~srrB~~~Sensor protein SrrB~~~
MMSRLNSVVIKLWLTIILIVTTVLILLSIALITFMQYYFTQETENAIREDARRISSLVEQSHNKEEAIKYSQTLIENPGG
LMIINNKHRQSTASLSNIKKQMLNEVVNNDHFDDVFDKGKSVTRNVTIKEKGSSQTYILLGYPTKAQKNSHSKYSGVFIY
KDLKSIEDTNNAITIITIITAVIFLTITTVFAFFLSSRITKPLRRLRDQATRVSEGDYSYKPSVTTKDEIGQLSQAFNQM
STEIEEHVDALSTSKNIRDSLINSMVEGVLGINESRQIILSNKMANDIMDNIDEDAKAFLLRQIEDTFKSKQTEMRDLEM
NARFFVVTTSYIDKIEQGGKSGVVVTVRDMTNEHNLDQMKKDFIANVSHELRTPISLLQGYTESIVDGIVTEPDEIKESL
AVVLDESKRLNRLVNELLNVARMDAEGLSVNKEVQPIAALLDKMKIKYRQQADDLGLNMTFNYCKKRVWSYDMDRMDQVL
TNLIDNASRYTKPGDEIAITCDENESEDILYIKDTGTGIAPEHLQQVFDRFYKVDAARTRGKQGTGLGLFICKMIIEEHG
GSIDVKSELGKGTTFIIKLPKPE
>Q9L523 2.7.13.3~~~srrB~~~Sensor protein SrrB~~~
MMSRLNSVVIKLWLTIILIVTTVLILLSIALITFMQYYFTQETENAIREDARRISSLVEQSHNKEEAIKYSQTLIENPGG
LMIINNKHRQSTASLSNIKKQMLNEVVNNDHFDDVFDKGKSVTRNVTIKEKGSSQTYILLGYPTKAQKNSHSKYSGVFIY
KDLKSIEDTNNAITIITIITAVIFLTITTVFAFFLSSRITKPLRRLRDQATRVSEGDYSYKPSVTTKDEIGQLSQAFNQM
STEIEEHVDALSTSKNIRDSLINSMVEGVLGINESRQIILSNKMANDIMDNIDEDAKAFLLRQIEDTFKSKQTEMRDLEM
NTRFFVVTTSYIDKIEQGGKSGVVVTVRDMTNEHNLDQMKKDFIANVSHELRTPISLLQGYTESIVDGIVTEPDEIKESL
AIVLDESKRLNRLVNELLNVARMDAEGLSVNKEVQPIAALLDKMKIKYRQQADDLGLNMTFNYCKKRVWSYDMDRMDQVL
TNLIDNASRYTKPGDEIAITCDENESEDILYIKDTGTGIAPEHLQQVFDRFYKVDAARTRGKQGTGLGLFICKMIIEEHG
GSIDVKSELGKGTTFIIKLPKPE
>P0DPQ5 3.4.22.-~~~srtA~~~Sortase A~~~
MNKQRIYSIVAILLFVVGGVLIGKPFYDGYQAEKKQTENVQAVQKMDYEKHETEFVDASKIDQPDLAEVANASLDKKQVI
GRISIPSVSLELPVLKSSTEKNLLSGAATVKENQVMGKGNYALAGHNMSKKGVLFSDIASLKKGDKIYLYDNENEYEYAV
TGVSEVTPDKWEVVEDHGKDEITLITCVSVKDNSKRYVVAGDLVGTKAKK
>Q8Y8H5 3.4.22.-~~~srtA~~~Sortase A~~~COG3764
MLKKTIAIIILIIGLLLIFSPFIKNGIVKYMSGHETIEQYKASDIKKNNEKDATFDFESVQLPSMTSVIKGAANYDKDAV
VGSIAVPSVDVNLLVFKGTNTANLLAGATTMRSDQVMGKGNYPLAGHHMRDESMLFGPIMKVKKGDKIYLTDLENLYEYT
VTETKTIDETEVSVIDNTKDARITLITCDKPTETTKRFVAVGELEKTEKLTKELENKYFPSK
>Q2FV99 3.4.22.70~~~srtA~~~Sortase A~~~COG3764
MKKWTNRLMTIAGVVLILVAAYLFAKPHIDNYLHDKDKDEKIEQYDKNVKEQASKDKKQQAKPQIPKDKSKVAGYIEIPD
ADIKEPVYPGPATPEQLNRGVSFAEENESLDDQNISIAGHTFIDRPNYQFTNLKAAKKGSMVYFKVGNETRKYKMTSIRD
VKPTDVGVLDEQKGKDKQLTLITCDDYNEKTGVWEKRKIFVATEVK
>Q8E5N2 3.4.22.-~~~strA~~~Sortase A~~~COG3764
MRNKKKLHGFFNFVRWLLVVLLIIVGLALVFNKPIRNAFIAHQSNHYQISRVSKKTIEKNKKSKTSYDFSSVKSISTESI
LSAQTKSHNLPVIGGIAIPDVEINLPIFKGLGNTELSYGAGTMKENQIMGGQNNYALASHHVFGLTGSSKMLFSPLEHAK
KGMKVYLTDKSKVYTYTITEISKVTPEHVEVIDDTPGKSQLTLVTCTDPEATERIIVHAELEKTGEFSTADESILKAFSK
KYNQINL
>P0C0H8 ~~~srtA~~~Lantibiotic streptin~~~
MNNTIKDFDLDLKTNKKDTATPYVGSRYLCTPGSCWKLVCFTTTVK
>Q8Y588 3.4.22.-~~~srtB~~~Sortase B~~~COG4509
MKIKSFLGKSLTLVVLGVFLFSGWKIGMELYENKHNQTILDDAKAVYTKDAATTNVNGEVRDELRDLQKLNKDMVGWLTI
IDTEIDYPILQSKDNDYYLHHNYKNEKARAGSIFKDYRNTNEFLDKNTIIYGHNMKDGSMFADLRKYLDKDFLVAHPTFS
YESGLTNYEVEIFAVYETTTDFYYIETEFPETTDFEDYLQKVKQQSVYTSNVKVSGKDRIITLSTCDTEKDYEKGRMVIQ
GKLVTK
>Q2FZE3 3.4.22.71~~~srtB~~~Sortase B~~~COG4509
MRMKRFLTIVQILLVVIIIIFGYKIVQTYIEDKQERANYEKLQQKFQMLMSKHQEHVRPQFESLEKINKDIVGWIKLSGT
SLNYPVLQGKTNHDYLNLDFEREHRRKGSIFMDFRNELKNLNHNTILYGHHVGDNTMFDVLEDYLKQSFYEKHKIIEFDN
KYGKYQLQVFSAYKTTTKDNYIRTDFENDQDYQQFLDETKRKSVINSDVNVTVKDRIMTLSTCEDAYSETTKRIVVVAKI
IKVS
>P54603 3.4.22.-~~~srtD~~~Sortase D~~~COG3764
MKKVIPLFIIAAGLVIAGYGGFKLIDTNTKTEQTLKEAKLAAKKPQEASGTKNSTDQAKNKASFKPETGQASGILEIPKI
NAELPIVEGTDADDLEKGVGHYKDSYYPDENGQIVLSGHRDTVFRRTGELEKGDQLRLLLSYGEFTYEIVKTKIVDKDDT
SIITLQHEKEELILTTCYPFSYVGNAPKRYIIYGKRVT
>Q9XA14 3.4.22.70~~~srtE1~~~Sortase SrtE1~~~COG3764
MTALRPERDSGTAGDQGSSYGQPYGDSGAFGGGRYEESAAGEENRPPLLDDETVALRIPEPPAPRTAAGTGPIGGGPDGG
GRAARRKAAKRRHGRRGAPRDQAPEEEAEQAPKAPLSRVEARRQARARKPGAAVVASRAIGEIFITTGVLMLLFVTYQLW
WTNVRAHAQANQAASNLQDDWANGKRSPGSFEPGQGFALLHIPKLDVVVPIAEGISSKKVLDRGMVGHYAEDGLKTAMPD
AKAGNFGLAGHRNTHGEPFRYINKLEPGDPIVVETQDKYFVYKMASILPVTSPSNVSVLDPVPKQSGFKGPGRYITLTTC
TPEFTSKYRMIVWGKMVEERPRSKGKPDALVS
>Q9XA15 3.4.22.70~~~srtE2~~~Sortase SrtE2~~~COG3764
MAATTDTEHQEQAGTGGRGRRRPGRIAAQAVSVLGELLITAGLVMGLFVVYSLWWTNVVADRAADKQAEKVRDDWAQDRV
GGSGQDGPGALDTKAGIGFLHVPAMSEGDILVEKGTSMKILNDGVAGYYTDPVKATLPTSDEKGNFSLAAHRDGHGARFH
NIDKIEKGDPIVFETKDTWYVYKTYAVLPETSKYNVEVLGGIPKESGKKKAGHYITLTTCTPVYTSRYRYVVWGELVRTE
KVDGDRTPPKELR
>P31631 3.4.21.-~~~ssa1~~~Serotype-specific antigen 1~~~
MYKIKHSFNKTLIAISISSFLSIAYATESIENPQPIIQLSESLSSKYSGKGVKLGVMDEGFMVKHPRHSSHLHPLIHQLT
TPEGEVRIYDASYPQFEVNPVEKEDGIDLIPSLETHGAGVAGIIAAQADKTLGDGYSGGIAKGAELYVATKSYKRTLEKV
IQDAKKELENAKDEEDEKTPSLDQMAKNDLLASKEKEMAIERAEWASGLNKLLDNNVFAINNSWNPFSISDDINVVDKFY
QSIKQNKHNPLLQAIMRAKNSNTLLVFAAGNESKKQPGVMALLPRYFPELEKNLISAVAVDKEQKIASYSNHCGASKNWC
VAAPGDLHVLIGVADEHKKPQYGLTKEQGTSFSAPAITASLAVLKERFDYLTATQIRDTLLTTATDLGEKGVDNVYGWGL
INLKKAVNGPTQFLNDETITVTRDDHWSNPLASQFKITKKGDKSLHLDGENHLDTVAVEEGRLALNGKTKVKTISNHANL
AVNGTEVEQNYSSSGQSQLEVLGKSGLIANAQANIHLAGSLKIDDKLTEKTEAGDVSATVVQLKDKATYQGGFTQLVENE
NLAKRGLIQDLYFKESEIIAKVNKPLTDEKADTNGQAGLALLNALRTTPIAYRRSWYNGWLQSALEQRKLDNLHYAVSNN
IYADSLELLRSQNRKGLTQAQQHLFTAYHTPLQTTVWAEHLNQKQSASSKHTDVKHHQSQLGVNHKLADKTVLSATLSQQ
KNRLEKPFAQATLKQTALNIGLRYHLDNAWFSEATLQFARQKYQQSRRFASHQLGTAETRGSTLGGEMRIGYQFMPNQWI
IEPSLGVQWIQTKMNGLNESGELATQTAAMRYRDVNIVPSVKLQRTFQLEQGSISPYIGLNYLHRLNGKITKITSNIAGK
TLHSEATTKRNRQLNGEVGVKLHYKNWFTAMNLDYSRVKSCKPIWLESKCWL
>Q2G2J2 ~~~ssaA2~~~Staphylococcal secretory antigen ssaA2~~~COG3942
MKKIATATIATAGFATIAIASGNQAHASEQDNYGYNPNDPTSYSYTYTIDAQGNYHYTWKGNWHPSQLNQDNGYYSYYYY
NGYNNYNNYNNGYSYNNYSRYNNYSNNNQSYNYNNYNSYNTNSYRTGGLGASYSTSSNNVQVTTTMAPSSNGRSISSGYT
SGRNLYTSGQCTYYVFDRVGGKIGSTWGNASNWANAAARAGYTVNNTPKAGAIMQTTQGAYGHVAYVESVNSNGSVRVSE
MNYGYGPGVVTSRTISASQAAGYNFIH
>Q7A423 ~~~ssaA2~~~Staphylococcal secretory antigen ssaA2~~~
MKKIATATIATAGFATIAIASGNQAHASEQDNYGYNPNDPTSYSYTYTIDAQGNYHYTWKGNWHPSQLNQDNGYYSYYYY
NGYNNYNNYNNGYSYNNYSRYNNYSNNNQSYNYNNYNSYNTNSYRTGGLGASYSTSSNNVQVTTTMAPSSNGRSISSGYT
SGRNLYTSGQCTYYVFDRVGGKIGSTWGNASNWANAAARAGYTVNNTPKAGAIMQTTQGAYGHVAYVESVNSNGSVRVSE
MNYGYGPGVVTSRTISASQAAGYNFIH
>Q2G0D4 3.5.1.28~~~~~~Probable autolysin SsaALP~~~COG1388
MKKLAFAITATSGAAAFLTHHDAQASTQHTVQSGESLWSIAQKYNTSVESIKQNNQLDNNLVFPGQVISVGGSDAQNTSN
TSPQAGSASSHTVQAGESLNIIASRYGVSVDQLMAANNLRGYLIMPNQTLQIPNGGSGGTTPTATTGSNGNASSFNHQNL
YTAGQCTWYVFDRRAQAGSPISTYWSDAKYWAGNAANDGYQVNNTPSVGSIMQSTPGPYGHVAYVERVNGDGSILISEMN
YTYGPYNMNYRTIPASEVSSYAFIH
>Q2FV55 ~~~ssaA~~~Staphylococcal secretory antigen SsaA~~~COG3942
MKKIVTATIATAGLATIAFAGHDAQAAEQNNNGYNSNDAQSYSYTYTIDAQGNYHYTWTGNWNPSQLTQNNTYYYNNYNT
YSYNNASYNNYYNHSYQYNNYTNNSQTATNNYYTGGSGASYSTTSNNVHVTTTAAPSSNGRSISNGYASGSNLYTSGQCT
YYVFDRVGGKIGSTWGNASNWANAAASSGYTVNNTPKVGAIMQTTQGYYGHVAYVEGVNSNGSVRVSEMNYGHGAGVVTS
RTISANQAGSYNFIH
>P74856 ~~~ssaV~~~Secretion system apparatus protein SsaV~~~
MRSWLGEGVRAQQWLSVCAGRQDMVLATVLLIAIVMMLLPLPTWMVDILITINLMFSVILLLIAIYLSDPLDLSVFPSLL
LITTLYRLSLTISTSRLVLLQHNAGNIVDAFGKFVVGGNLTVGLVVFTIITIVQFIVITKGIERVAEVSARFSLDGMPGK
QMSIDGDLRAGVIDADHARTLRQHVQQESRFLGAMDGAMKFVKGDTIAGIIVVLVNIIGGIIIAIVQYDMSMSEAVHTYS
VLSIGDGLCGQIPSLLISLSAGIIVTRVPGEKRQNLATELSSQIARQPQSLILTAVVLMLLALIPGFPFITLAFFSALLA
LPIILIRRKKSVVSANGVEAPEKDSMVPGACPLILRLSPTLHSADLIRDIDAMRWFLFEDTGVPLPEVNIEVLPEPTEKL
TVLLYQEPVFSLSIPAQADYLLIGADASVVGDSQTLPNGMGQICWLTKDMAHKAQGFGLDVFAGSQRISALLKCVLLRHM
GEFIGVQETRYLMNAMEKNYSELVKELQRQLPINKIAETLQRLVSERVSIRDLRLIFGTLIDWAPREKDVLMLTEYVRIA
LRRHILRRLNPEGKPLPILRIGEGIENLVRESIRQTAMGTYTALSSRHKTQILQLIEQALKQSAKLFIVTSVDTRRFLRK
ITEATLFDVPILSWQELGEESLIQVVESIDLSEEELADNEE
>P0A2F6 ~~~ssb~~~Single-stranded DNA-binding protein 1~~~
MASRGVNKVILVGNLGQDPEVRYMPSGGAVANLTLATSESWRDKQTGEMKEQTEWHRVVMFGKLAEVAGEYLRKGSQVYI
EGQLRTRKWTDQSGQERYTTEINVPQIGGVMQMLGGRQGGGAPAGGQQQGGWGQPQQPQQPQGGNQFSGGAQSRPQQSAP
APSNEPPMDFDDDIPF
>Q9KYI9 ~~~ssb1~~~Single-stranded DNA-binding protein 1~~~COG0629
MNETMICAVGNVATTPVFRDLANGPSVRFRLAVTARYWDREKNAWTDGHTNFFTVWANRQLATNASGSLAVGDPVVVQGR
LKVRTDVREGQSRTSADIDAVAIGHDLARGTAAFRRTARTEASTSPPRPEPNWEVPAGGTPGEPVPEQRPDPVPVG
>Q9X8U3 ~~~ssb2~~~Single-stranded DNA-binding protein 2~~~COG0629
MAGETVITVVGNLVDDPELRFTPSGAAVAKFRVASTPRTFDRQTNEWKDGESLFLTCSVWRQAAENVAESLQRGMRVIVQ
GRLKQRSYEDREGVKRTVYELDVDEVGASLRSATAKVTKTSGQGRGGQGGYGGGGGGQGGGGWGGGPGGGQQGGGAPADD
PWATGGAPAGGQQGGGGQGGGGWGGGSGGGGGYSDEPPF
>P73145 ~~~~~~Thylakoid-associated single-stranded DNA-binding protein slr1034~~~COG0629
MNSFVLMATVIREPELRFTKENQTPVCEFLVEFPGMRDDSPKESLKVVGWGNLANTIKETYHPGDRLIIEGRLGMNMIER
QEGFKEKRAELTASRISLVDSGNGINPGELSSPPEPEAVDLSNTDDIPF
>P28044 ~~~ssb~~~Plasmid-derived single-stranded DNA-binding protein~~~
MAVRGINKVILVGRLGKDPEVRYIPNGGAVANLQVATSESWRDKQTGEIREQTEWHRVVLFGKLAEVAGEYLRKGAQVYI
EGQLRTRSWEDNGITRYVTEILVKTTGTMQMLGRAAGTQTQPEEAQQFSGQPQPESQPEPKKGGAKTKGRERKAAQPEPR
QPSEPAYDFDDDIPF
>P37455 ~~~ssbA~~~Single-stranded DNA-binding protein A~~~COG0629
MLNRVVLVGRLTKDPELRYTPNGAAVATFTLAVNRTFTNQSGEREADFINCVTWRRQAENVANFLKKGSLAGVDGRLQTR
NYENQQGQRVFVTEVQAESVQFLEPKNGGGSGSGGYNEGNSGGGQYFGGGQNDNPFGGNQNNQRRNQGNSFNDDPFANDG
KPIDISDDDLPF
>C0SPB6 ~~~ssbB~~~Single-stranded DNA-binding protein B~~~COG0629
MFNQVMLVGRLTKDPDLRYTSAGAAVAHVTLAVNRSFKNASGEIEADYVNCTLWRKTAENTALYCQKGSLVGVSGRIQTR
SYENEEGVNVYVTEVLADTVRFMDPKPREKAAD
>Q83EP4 ~~~ssb~~~Single-stranded DNA-binding protein~~~COG0629
MARGVNKVILIGNLGQDPEVRYTPNGNAVANVTLATSTTWRDKQTGELQERTEWHRIAFFNRLAEIVGEYLRKGSKIYIE
GSLRTRKWQDKNGVDRYTTEIIANEMHMLDNRGGGNSGNYGNHSEGGASNKQSAPTSSQTPTAGDDSSVADFDDDIPF
>Q9RY51 ~~~ssb~~~Single-stranded DNA-binding protein~~~COG0629
MARGMNHVYLIGALARDPELRYTGNGMAVFEATVAGEDRVIGNDGRERNLPWYHRVSILGKPAEWQAERNLKGGDAVVVE
GTLEYRQWEAPEGGKRSAVNVKALRMEQLGTQPELIQDAGGGVRMSGAMNEVLVLGNVTRDPEIRYTPAGDAVLSLSIAV
NENYQDRQGQRQEKVHYIDATLWRDLAENMKELRKGDPVMIMGRLVNEGWTDKDGNKRNSTRVEATRVEALARGAGNANS
GYAAATPAAPRTQTASSAARPTSGGYQSQPSRAANTGSRSGGLDIDQGLDDFPPEEDDLPF
>P0AGE0 ~~~ssb~~~Single-stranded DNA-binding protein~~~COG0629
MASRGVNKVILVGNLGQDPEVRYMPNGGAVANITLATSESWRDKATGEMKEQTEWHRVVLFGKLAEVASEYLRKGSQVYI
EGQLRTRKWTDQSGQDRYTTEVVVNVGGTMQMLGGRQGGGAPAGGNIGGGQPQGGWGQPQQPQGGNQFSGGAQSRPQQSA
PAAPSNEPPMDFDDDIPF
>O25841 ~~~ssb~~~Single-stranded DNA-binding protein~~~COG0629
MFNKVIMVGRLTRNVELKYLPSGSAAATIGLATSRRFKKQDGTLGEEVCFIDARLFGRTAEIANQYLSKGSSVLIEGRLT
YESWMDQTGKKNSRHTITADSLQFMDKKSDNPQANAMQDSIMHENSNNAYPANHNAPSQDPFNQAYAQNAYAKENLQAQP
SKYQNSVPEINIDEEEIPF
>P46390 ~~~ssb~~~Single-stranded DNA-binding protein~~~COG0629
MAGDTTITIVGNLTADPELRFTSSGAAVVNFTVASTPRIYDRQSGEWKDGEALFLRCNIWREAAENVAESLTRGARVIVT
GRLKQRSFETREGEKRTVVEVEVDEIGPSLRYATAKVNKASRSGGGGGGFGSGSRQAPAQMSGGVGDDPWGSAPTSGSFG
VGDEEPPF
>Q9AFI5 ~~~ssb~~~Single-stranded DNA-binding protein~~~COG0629
MAGDTTITVVGNLTADPELRFTPSGAAVANFTVASTPRMFDRQSGEWKDGEALFLRCNIWREAAENVAESLTRGSRVIVT
GRLKQRSFETREGEKRTVVEVEVDEIGPSLRYATAKVNKASRSGGGGGGFGSGGGGSRQSEPKDDPWGSAPASGSFSGAD
DEPPF
>P9WGD5 ~~~ssb~~~Single-stranded DNA-binding protein~~~COG0629
MAGDTTITIVGNLTADPELRFTPSGAAVANFTVASTPRIYDRQTGEWKDGEALFLRCNIWREAAENVAESLTRGARVIVS
GRLKQRSFETREGEKRTVIEVEVDEIGPSLRYATAKVNKASRSGGFGSGSRPAPAQTSSASGDDPWGSAPASGSFGGGDD
EPPF
>P28046 ~~~ssb~~~Single-stranded DNA-binding protein~~~
MASRGVNKVILIGNLGQDPEIRYMPSGGAVANLTLATSESWRDKQTGEMKEKTEWHRVVIFGKLAEIAGEYLRKGSQVYI
EGQLQTRKWQDQSGQDRYSTEVVVNIGGTMQMLGGRGGQDNAPSQGQGGWGQPQQPQASQQFSGGAPSRPAQQPAAAPAP
SNEPPMDFDDDIPF
>P40947 ~~~ssb~~~Single-stranded DNA-binding protein~~~
MARGVNKVILVGNVGGDPETRYMPNGNAVTNITLATSESWKDKQTGQQQERTEWHRVVFFGRLAEIAGEYLRKGSQVYVE
GSLRTRKWQGQDGQDRYTTEIVVDINGNMQLLGGRPSGDDSQRAPREPMQRPQQAPQQQSRPAPQQQPAPQPAQDYDSFD
DDIPF
>P25762 ~~~ssb~~~Single-stranded DNA-binding protein~~~
MASRGVNKVILVGNLGQDPEVRYMPNGGAVANITLATSESWRDKATGEQKEKTEWHRVVLFGKLAEVAGEYLRKGSQVYI
EGSLQTRKWQDQSGQDRYTTEIVVNVGGTMQMLGGRQGGGAPAGQSAGGQGGWGQPQQPQGGNQFSGGQQQSRPAQNSAP
ATSNEPPMDFDDDIPF
>P66854 ~~~ssb~~~Single-stranded DNA-binding protein~~~COG0629
MINNVVLVGRMTRDAELRYTPSNVAVATFTLAVNRTFKSQNGEREADFINVVMWRQQAENLANWAKKGSLIGVTGRIQTR
SYDNQQGQRVYVTEVVAENFQMLESRSVREGHTGGAYSAPTANYSAPTNSVPDFSRNENPFGATNPLDISDDDLPF
>Q9KH06 ~~~ssb~~~Single-stranded DNA-binding protein~~~
MARGLNQVFLIGTLTARPDMRYTPGGLAILDLNLAGQDAFTDESGQEREVPWYHRVRLLGRQAEMWGDLLEKGQLIFVEG
RLEYRQWEKDGEKKSEVQVRAEFIDPLEGRGRETLEDARGQPRLRRALNQVILMGNLTRDPDLRYTPQGTAVVRLGLAVN
ERRRGQEEERTHFLEVQAWRELAEWASELRKGDGLLVIGRLVNDSWTSSSGERRFQTRVEALRLERPTRGPAQAGGSRPP
TVQTGGVDIDEGLEDFPPEEDLPF
>Q9WZ73 ~~~ssb~~~Single-stranded DNA-binding protein~~~COG0629
MSFFNKIILIGRLVRDPEERYTLSGTPVTTFTIAVDRVPRKNAPDDAQTTDFFRIVTFGRLAEFARTYLTKGRLVLVEGE
MRMRRWETPTGEKRVSPEVVANVVRFMDRKPAETVSETEEELEIPEEDFSSDTFSEDEPPF
>Q5SLP9 ~~~ssb~~~Single-stranded DNA-binding protein~~~COG0629
MARGLNRVFLIGALATRPDMRYTPAGLAILDLTLAGQDLLLSDNGGEREVSWYHRVRLLGRQAEMWGDLLDQGQLVFVEG
RLEYRQWEREGERRSELQIRADFLDPLDDRGKERAEDSRGQPRLRAALNQVFLMGNLTRDPELRYTPQGTAVARLGLAVN
ERRQGAEERTHFVEVQAWRDLAEWAAELRKGDGLFVIGRLVNDSWTSSSGERRFQTRVEALRLERPTRGPAQAGGSRSRE
VQTGGVDIDEGLEDFPPEEELPF
>C0H3Y2 ~~~sscA~~~Small spore coat assembly protein A~~~
MSGGYSNGFALLVVLFILLIIVGAAYIY
>L8EBJ6 ~~~sscB~~~Probable small spore coat assembly protein B~~~
MGEVFAGGFALLVVLFILLIIIGASWLY
>Q8GAI8 1.2.1.16~~~sad~~~Succinate-semialdehyde dehydrogenase~~~
MLATLASATSEDAVAALEAACAAQTSWARTAPRVRAEILRRAFDLVTARSEDFALLMTLEMGKPLAEARGEVAYGAEFLR
WFSEETVRDYGRYLTTPEGKNKILVQHKPVGPCLLITPWNFPLAMATRKVAPAVAAGCTMVLKPAKLTPLTSQLFAQTMM
EAGLPAGVLNVVSSSSASGISGPLLKDSRLRKVSFTGSTPVGKRLMSDASRHVLRTSMELGGNAPFVVFEDADLDKAVEG
AMAAKMRNMGEACTAANRFLVQESVAQEFTRKFAAAMGALSTGRGTDPASQVGPLINNGARDDIHALVTAAVDAGAVAVT
GGAPVDGPGYFYQPTVLADVPNNAAILGQEIFGPVAPVTTFTTEQDAIKLANASEYGLAAYLYSRDFNRLLRVAEQIEFG
MVGFNAGIISNAAAPFGGVKQSGLGREGGSEGIAEYTTTQYIGIADPYEN
>O84944 ~~~sseA~~~Type III secretion system chaperone SseA~~~
MMIKKKAAFSEYRDLEQSYMQLNHCLKKFHQIRAKVSQQLAERAESPKNSRETESILHNLFPQGVAGVNQEAEKDLKKIV
SLFKQLEVRLKQLNAQAPVEIPSGKTKR
>Q7BVH7 ~~~sseB~~~Secreted effector protein SseB~~~
MSSGNILWGSQNPIVFKNSFGVSNADTGSQDDLSQQNPFAEGYGVLLILLMVIQAIANNKFIEVQKNAERARNTQEKSNE
MDEVIAKAAKGDAKTKEEVPEDVIKYMRDNGILIDGMTIDDYMAKYGDHGKLDKGGLQAIKAALDNDANRNTDLMSQGQI
TIQKMSQELNAVLTQLTGLISKWGEISSMIAQKTYS
>Q8ZQ79 ~~~sseI~~~Secreted effector protein SseI~~~
MPFHIGSGCLPAIISNRRIYRIAWSDTPPEMSSWEKMKEFFCSTHQAEALECIWTICHPPAGTTREDVVSRFELLRTLAY
DGWEENIHSGLHGENYFCILDEDSQEILSVTLDDVGNYTVNCQGYSETHHLTMATEPGVERTDITYNLTSDIDAAAYLEE
LKQNPIINNKIMNPVGQCESLMTPVSNFMNEKGFDNIRYRGIFIWDKPTEEIPTNHFAVVGNKEGKDYVFDVSAHQFENR
GMSNLNGPLILSADEWVCKYRMATRRKLIYYTDFSNSSIAANAYDALPRELESESMAGKVFVTSPRWFNTFKKQKYSLIG
KM
>Q9FD10 3.-.-.-~~~sseJ~~~Secreted effector protein SseJ~~~
MPLSVGQGYFTSSISSEKFNAIKESARLPELSLWEKIKAYFFTTHHAEALECIFNLYHHQELNLTPVQVRGAYIKLRALA
SQGCKEQFIIESQEHADKLIIKDDNGENILSIEVECHPEAFGLAKEINKSHPKPKNISLGDITRLVFFGDSLSDSLGRMF
EKTHHILPSYGQYFGGRFTNGFTWTEFLSSPHFLGKEMLNFAEGGSTSASYSCFNCIGDFVSNTDRQVASYTPSHQDLAI
FLLGANDYMTLHKDNVIMVVEQQIDDIEKIISGGVNNVLVMGIPDLSLTPYGKHSDEKRKLKDESIAHNALLKTNVEELK
EKYPQHKICYYETADAFKVIMEAASNIGYDTENPYTHHGYVHVPGAKDPQLDICPQYVFNDLVHPTQEVHHCFAIMLESF
IAHHYSTE
>A0A0H3NK84 2.4.1.-~~~sseK1~~~Protein-arginine N-acetylglucosaminyltransferase SseK1~~~
MIPPLNRYVPALSKNELVKTVTNRDIQFTSFNGKDYPLCFLDEKTPLLFQWFERNPARFGKNDIPIINTEKNPYLNNIIK
AATIEKERLIGIFVDGDFFPGQKDAFSKLEYDYENIKVIYRNDIDFSMYDKKLSEIYMENISKQESMPEEKRDCHLLQLL
KKELSDIQEGNDSLIKSYLLDKGHGWFDFYRNMAMLKAGQLFLEADKVGCYDLSTNSGCIYLDADMIITEKLGGIYIPDG
IAVHVERIDGRASMENGIIAVDRNNHPALLAGLEIMHTKFDADPYSDGVCNGIRKHFNYSLNEDYNSFCDFIEFKHDNII
MNTSQFTQSSWARHVQ
>Q9L9J3 2.4.1.-~~~sseK1~~~Protein-arginine N-acetylglucosaminyltransferase SseK1~~~
MIPPLNRYVPALSKNELVKTVTNRDIQFTSFNGKDYPLCFLDEKTPLLFQWFERNPARFGKNDIPIINTEKNPYLNNIIK
AATIEKERLIGIFVDGDFFPGQKDAFSKLEYDYENIKVIYRNDIDFSMYDKKLSEIYMENISKQESMPEEKRDCHLLQLL
KKELSDIQEGNDSLIKSYLLDKGHGWFDFYRNMAMLKAGQLFLEADKVGCYDLSTNSGCIYLDADMIITEKLGGIYIPDG
IAVHVERIDGRASMENGIIAVDRNNHPALLAGLEIMHTKFDADPYSDGVCNGIRKHFNYSLNEDYNSFCDFIEFKHDNII
MNTSQFTQSSWARHVQ
>P0DUJ8 2.4.1.-~~~sseK2~~~Protein-arginine N-acetylglucosaminyltransferase SseK2~~~
MARFNAAFTRIKIMFSRIRGLISCQSNTQTIAPTLSPPSSGHVSFAGIDYPLLPLNHQTPLVFQWFERNPDRFGQNEIPI
INTQKNPYLNNIINAAIIEKERIIGIFVDGDFSKGQRKALGKLEQNYRNIKVIYNSDLNYSMYDKKLTTIYLENITKLEA
QSASERDEVLLNGVKKSLEDVLKNNPEETLISSHNKDKGHLWFDFYRNLFLLKGSDAFLEAGKPGCHHLQPGGGCIYLDA
DMLLTDKLGTLYLPDGIAIHVSRKDNHVSLENGIIAVNRSEHPALIKGLEIMHSKPYGDPYNDWLSKGLRHYFDGSHIQD
YDAFCDFIEFKHENIIMNTSSLTASSWR
>P0DUJ7 2.4.1.-~~~sseK3~~~Protein-arginine N-acetylglucosaminyltransferase SseK3~~~
MFSRVRGFLSCQNYSHTATPAITLPSSGSANFAGVEYPLLPLDQHTPLLFQWFERNPSRFGENQIPIINTQQNPYLNNII
NAAIIEKERTIGVLVDGNFSAGQKKALAKLEKQYENIKVIYNSDLDYSMYDKKLSDIYLENIAKIEAQPANVRDEYLLGE
IKKSLNEVLKNNPEESLVSSHDKRLGHVRFDFYRNLFLLKGSNAFLEAGKHGCHHLQPGGGCIYLDADMLLTGKLGTLYL
PDGIAVHVSRKGNSMSLENGIIAVNRSEHPALKKGLEIMHSKPYGDPYIDGVCGGLRHYFNCSIRHNYEEFCNFIEFKHE
HIFMDTSSLTISSWR
>Q8ZNG2 3.4.22.-~~~sseL~~~Deubiquitinase SseL~~~
MNICVNSLYRLSIPQFHSLYTEEVSDEALTLLFSAVENGDQNCIDLLCNLALRNDDLGHRVEKFLFDLFSGKRTGSSDID
KKINQACLVLHQIANNDITKDNTEWKKLHAPSRLLYMAGSATTDLSKKIGIAHKIMGDQFAQTDQEQVGVENLWCGARML
SSDELAAATQGLVQESPLLSVNYPIGLIHPTTKENILSTQLLEKIAQSGLSHNEVFLVNTGDHWLLCLFYKLAEKIKCLI
FNTYYDLNENTKQEIIEAAKIAGISESDEVNFIEMNLQNNVPNGCGLFCYHTIQLLSNAGQNDPATTLREFAENFLTLSV
EEQALFNTQTRRQIYEYSLQ
>Q9L268 ~~~ssgB~~~Sporulation-specific cell division protein SsgB~~~
MNTTVSCELHLRLVVSSESSLPVPAGLRYDTADPYAVHATFHTGAEETVEWVFARDLLAEGLHRPTGTGDVRVWPSRSHG
QGVVCIALSSPEGEALLEAPARALESFLKRTDAAVPPGTEHRHFDLDQELSHILAES
>Q47N25 ~~~ssgB~~~Sporulation-specific cell division protein SsgB~~~
MSSSGTSITCEVGLQLIVPDRAPVPLVARLDYSVDDPYAIRAAFHVGDDEPVEWIFARELLTVGIIRETGEGDVRIWPSQ
DGKERMVNIALSSPFGQARFHAQVAPLSEFLHRTYELVPAGQESDYIDIDAEIAEHLS
>Q9R641 ~~~~~~Subtilisin inhibitor-like protein 12~~~
SLYPASALVLTVGHGADAATAEVQRAVTLSCRPTPTGTHPAPAQACAELHSVGGALGLLRTGAEPGRMCTKEWRPITVTA
EGVWDGRRVSYEHTFANNCFKNAAPTTVFEF
>Q9R645 ~~~~~~Subtilisin inhibitor-like protein 15~~~
SLYAPSAVVLSIGKGDASGPVTVLRATTLSCAPVPGGTHPAPEAACAELKAGFAGGGFGGLLASPDPDRACPQHFDPVTV
TLDGVWEGARTSWQHTFSNACVMGTTLDGGEAF
>P29606 ~~~~~~Subtilisin inhibitor-like protein 1~~~
SLYAPSAVVISKTQGASADAPAQRAVTLRCLPVGGDHPAPEKACAALREAGGDPAALPRYVEDTGRVCTREYRPVTVSVQ
GVWDGRRIDHAQTFSNSCELEKQTASVYAF
>P35706 ~~~sti2~~~Trypsin inhibitor STI2~~~
MRNTARWAATLALTATAVCGPLTGAALATPAAAPASLYAPSALVLTVGHGTSAAAASPLRAVTLNCAPTASGTHPAPALA
CADLRGVGGDIDALKARDGVICNKLYDPVVVTVDGVWQGKRVSYERTFGNECVKNSYGTSLFAF
>P29607 ~~~~~~Subtilisin inhibitor-like protein 2~~~
TAPASLYAPSALVLTIGQGESAAATSPLRAVTLTCAPKATGTHPAADAACAELRRAGGDFDALSAADGVMCTREYAPVVV
TVDGVWQGRRLSYERTFANECVKNAGSASVFTF
>P29608 ~~~~~~Subtilisin inhibitor-like protein 3~~~
YAPSALVLTVGHGESAATAAPLRAVTLTCAPTASGTHPAADAACAELRAAHGDPSALAADDAVMCTREYAPVVVTVDGVW
QGRRLSYERTFANECVKNAGSASVFTF
>P29609 ~~~~~~Subtilisin inhibitor-like protein 4~~~
APDAAPASLYAPSALVLTIGHGGAAATATPERAVTLTCAPTSSGTHPAASAACAELRGVGGDFAALKARDDVWCNKLYDP
VVVTAQGVWQGQRVSYERTFGNSCERDAVGGSLFAF
>Q9R643 ~~~~~~Subtilisin inhibitor-like protein 5~~~
HAPNALVLTVAKGETARTATPLRAVTLTCAPTPGGTHPAPEAACAELRAVDGRFSALRGDQDRACIKIYDPLVVTAEGVW
EGQRVRYERTFGNSCTLQTEAGPVFSF
>Q9R642 ~~~~~~Subtilisin inhibitor-like protein 7~~~
SLHAPSALVLTVGHGESAATAVPLRAVTLTCAPTASGTHPATVSACAELRGAGGDFDALAADAGVMCTREYAPVVVTVDG
VWQGRRLSYERTFANECVKNAGSSSVFTF
>P80388 ~~~sil8~~~Subtilisin inhibitor-like protein 8~~~
SLYAPSAMVFSVAQGDDVAAPTVVRATTVSCAPGARGTHPDPKAACAALKSTGGAFDRLLSEPNPDRACPMHYAPVTVSA
VGVWEGRRVAWDHTFANSCTMAATLDGNAVF
>O33702 ~~~~~~Kexstatin-1~~~
MRYITGAVALGAALVLGTLATTAQAAAPAQPARTGGLYAPTELVLTVGQGESRATATVQRAVTLSCMPGARGSHPNPLGA
CTQLRAVAGDFNAITAATSDRLCTKEWNPLVVTADGVWQGKRVSYSYTFANRCEMNIDSDTVFNF
>P83544 ~~~sti~~~Transglutaminase-activating metalloprotease inhibitor~~~
MRYITGGIALGSALILGSLVGAGATASATPAPAPAAQQSLYAPSALVLTVGQGDKAASAGVQRAVTLNCMPKPSGTHPDA
RGACDQLRAASGNFAEITKIKSGTACTKEWNPFVVTAEGVWEGQRVKYEHTFANPCEMKAGKGTVFEF
>P01007 ~~~~~~Plasminostreptin~~~
GLYAPSALVLTMGHGNSAATVNPERAVTLNCAPTASGTHPAALQACAELRGAGGDFDALTVRGDVACTKQFDPVVVTVDG
VWQGKRVSYERTFANECVKNSYGMTVFTF
>P01006 ~~~ssi~~~Subtilisin inhibitor~~~
MRNTGAGPSPSVSRPPPSAAPLSGAALAAPGDAPSALYAPSALVLTVGKGVSATTAAPERAVTLTCAPGPSGTHPAAGSA
CADLAAVGGDLNALTRGEDVMCPMVYDPVLLTVDGVWQGKRVSYERVFSNECEMNAHGSSVFAF
>P61152 ~~~sti1~~~Subtilase-type protease inhibitor~~~
MRNTARWAATLGLTATAVCGPLAGASLASPATAPASLYAPSALVLTVGHGESAATAAPLRAVTLTCAPTASGTHPAAAAA
CAELRAAHGDPSALAAEDSVMCTREYAPVVVTVDGVWQGRRLSYERTFANECVKNAGSASVFTF
>P80598 ~~~~~~Protease inhibitor SIL-V3~~~
YAPSALVLTIGQGATAAESGVQRAVTLTCTPKSSGTHPDAKGACTQLRAAGGDFDKVTRIKSDTVCTKEWNPTVVTAEGV
WDGRRISYEHTFANPCMAKAGKGLVFEF
>P28592 ~~~~~~Alkaline protease inhibitor 2C'~~~
YAPSALVLTVGKGVSAATVTPERAVTLTCAPGPSGTHPAADSACADLAAVGGDLDALTRSEGVMCPMIYDPVLLTVDGVW
QGERVSYERVFSNECEMNAHGSSVLAF
>P61153 ~~~sti1~~~Protease inhibitor~~~
MRNTARWAATLGLTATAVCGPLAGASLASPATAPASLYAPSALVLTVGHGESAATAAPLRAVTLTCAPTASGTHPAAAAA
CAELRAAHGDPSALAAEDSVMCTREYAPVVVTVDGVWQGRRLSYERTFANECVKNAGSASVFTF
>P80600 ~~~~~~Protease inhibitor SIL-V5~~~
YAPSALVLTIGQGDSAATAGVQRAVTLTCTPKAAGSHPNTSGACAQLRLSNGDFDKLVKIKDGTMCTREWNPSTVTAEGV
WEGRRVSFEHTFANPCEMKAGKGTVFEF
>P80596 ~~~~~~Protease inhibitor SIL-V1/SIL-V4~~~
SLFAPSALVLTVGEGESAADSGVQRAVTLTCTPKASGTHPAARAACDQLRAVDGDFKALVTTKSDRVCTKEYRPIVITAE
GVWDGHRVSYEHKFANPCMASDGKGVVFEF
>P80597 ~~~~~~Protease inhibitor SIL-V2~~~
SLYAPSALVLTIGQGDSASAGIQRAVTLSCMPTPSGTHPDARDACAQLRQADGKFDELTATKAGTYCTKEWNPVTVTATG
VWEGQRVNYSHTFGNPCMAKAAKSTVFSF
>O06871 ~~~vsi~~~Subtilisin inhibitor~~~
MRRTLKAVGAAAAAATCVLAATAGTAQAEAPKAESLYAPSALVLTVGQGENAESAAVERAVTLTCAPRPGGTHPSAAAAC
AELAKVNGQFARLVGASSDAICTKEWRPVTVSVVGAWNGKHVNWTSTFANQCTMKAGLGEGAALTF
>Q2G2X7 ~~~~~~Staphylococcal superantigen-like 10~~~
MKFTALAKATLALGILTTGTLTTEVHSGHAKQNQKSVNKHDKEALYRYYTGKTMEMKNISALKHGKNNLRFKFRGIKIQV
LLPGNDKSKFQQRSYEGLDVFFVQEKRDKHDIFYTVGGVIQNNKTSGVVSAPILNISKEKGEDAFVKGYPYYIKKEKITL
KELDYKLRKHLIEKYGLYKTISKDGRVKISLKDGSFYNLDLRSKLKFKYMGEVIESKQIKDIEVNLK
>A0A0H3K7M3 ~~~~~~Superantigen-like protein 13~~~
MNNNITKKIILSTTLLLLGTASTQFPNTPINSSSEAKAYYINQNETNVNELTKYYSQKYLTFSNSTLWQKDNGTIHATLL
QFSWYSHIQVYGPESWGNINQLRNKSVDIFGIKDQETIDSFALSQETFTGGVTPAATSNDKHYKLNVTYKDKAETFTGGF
PVYEGNKPVLTLKELDFRIRQTLIKSKKLYNNSYNKGQIKITGADNNYTIDLSKRLPSTDANRYVKKPQNAKIEVILEKS
N
>Q2G0X9 3.4.21.-~~~ssl1~~~Staphylococcal superantigen-like 1~~~
MKFKAIAKASLALGMLATGVITSNVQSVQAKAEVKQQSESELKHYYNKPILERKNVTGFKYTDEGKHYLEVTVGQQHSRI
TLLGSDKDKFKDGENSNIDVFILREGDSRQATNYSIGGVTKSNSVQYIDYINTPILEIKKDNEDVLKDFYYISKEDISLK
ELDYRLRERAIKQHGLYSNGLKQGQITITMNDGTTHTIDLSQKLEKERMGESIDGTKINKILVEMK
>A0A0H3KAV3 3.4.21.-~~~ssl1~~~Staphylococcal superantigen-like 1~~~
MKFKAIAKASLALGMLATGVITSNVQSVQAKAEVKQQSESELKHYYNKPILERKNVTGFKYTDEGKHYLEVTVGQQHSRI
TLLGSDKDKFKDGENSNIDVFILREGDSRQATNYSIGGVTKSNSVQYIDYINTPILEIKKDNEDVLKDFYYISKEDISLK
ELDYRLRERAIKQHGLYSNGLKQGQITITMNDGTTHTIDLSQKLEKERMGESIDGTKINKILVEMK
>Q2G0X7 ~~~ssl3~~~Staphylococcal superantigen-like 3~~~
MKMRTIAKTSLALGLLTTGAITVTTQSVKAEKIQSTKVDKVPTLKAERLAMINITAGANSATTQAANTRQERTPKLEKAP
NTNEEKTSASKIEKISQPKQEEQKTLNISATPAPKQEQSQTTTESTTPKTKVTTPPSTNTPQPMQSTKSDTPQSPTIKQA
QTDMTPKYEDLRAYYTKPSFEFEKQFGFMLKPWTTVRFMNVIPNRFIYKIALVGKDEKKYKDGPYDNIDVFIVLEDNKYQ
LKKYSVGGITKTNSKKVNHKVELSITKKDNQGMISRDVSEYMITKEEISLKELDFKLRKQLIEKHNLYGNMGSGTIVIKM
KNGGKYTFELHKKLQEHRMADVIDGTNIDNIEVNIK
>Q2G1S8 ~~~ssl4~~~Staphylococcal superantigen-like 4~~~
MKITTIAKTSLALGLLTTGVITTTTQAANATTLSSTKVEAPQSTPPSTKIEAPQSKPNATTPPSTKVEAPQQTANATTPP
STKVTTPPSTNTPQPMQSTKSDTPQSPTTKQVPTEINPKFKDLRAYYTKPSLEFKNEIGIILKKWTTIRFMNVVPDYFIY
KIALVGKDDKKYGEGVHRNVDVFVVLEENNYNLEKYSVGGITKSNSKKVDHKAGVRITKEDNKGTISHDVSEFKITKEQI
SLKELDFKLRKQLIEKNNLYGNVGSGKIVIKMKNGGKYTFELHKKLQENRMADVIDGTNIDNIEVNIK
>A0A0H3K6A3 ~~~ssl4~~~Staphylococcal superantigen-like 4~~~
MKITTIAKTSLALGLLTTGVITTTTQAANATTPSSTKVEAPQSTPPSTKIEAPQSKPNATTPPSTKVEAPQQTANATTPP
STKVTTPPSTNTPQPMQSTKSDTPQSPTTKQVPTEINPKFKDLRAYYTKPSLEFKNEIGIILKKWTTIRFMNVVPDYFIY
KIALVGKDDKKYGEGVHRNVDVFVVLEENNYNLEKYSVGGITKSNSKKVDHKAGVRITKEDNKGTISHDVSEFKITKEQI
SLKELDFKLRKQLIEKNNLYGNVGSGKIVIKMKNGGKYTFELHKKLQENRMADVIDGTNIDNIEVNIK
>Q2G1S6 ~~~ssl5~~~Staphylococcal superantigen-like 5~~~
MKMTAIAKASLALGILATGTITSLHQTVNASEHKAKYENVTKDIFDLRDYYSGASKELKNVTGYRYSKGGKHYLIFDKNR
KFTRVQIFGKDIERFKARKNPGLDIFVVKEAENRNGTVFSYGGVTKKNQDAYYDYINAPRFQIKRDEGDGIATYGRVHYI
YKEEISLKELDFKLRQYLIQNFDLYKKFPKDSKIKVIMKDGGYYTFELNKKLQTNRMSDVIDGRNIEKIEANIR
>A0A0H3K6Z6 ~~~ssl5~~~Staphylococcal superantigen-like 5~~~
MKMTAIAKASLALGILATGTITSLHQTVNASEHKAKYENVTKDIFDLRDYYSGASKELKNVTGYRYSKGGKHYLIFDKNR
KFTRVQIFGKDIERFKARKNPGLDIFVVKEAENRNGTVFSYGGVTKKNQDAYYDYINAPRFQIKRDEGDGIATYGRVHYI
YKEEISLKELDFKLRQYLIQNFDLYKKFPKDSKIKVIMKDGGYYTFELNKKLQTNRMSDVIDGRNIEKIEANIR
>Q2G2Y0 ~~~ssl7~~~Staphylococcal superantigen-like 7~~~
MKLKTLAKATLALGLLTTGVITSEGQAVQAKEKQERVQHLYDIKDLHRYYSSESFEFSNISGKVENYNGSNVVRFNQENQ
NHQLFLLGKDKEKYKEGIEGKDVFVVKELIDPNGRLSTVGGVTKKNNKSSETNTHLFVNKVYGGNLDASIDSFSINKEEV
SLKELDFKIRQHLVKNYGLYKGTTKYGKITINLKDGEKQEIDLGDKLQFERMGDVLNSKDINKIEVTLKQI
>A0A0H3K6Z8 ~~~ssl7~~~Staphylococcal superantigen-like 7~~~
MKLKTLAKATLALGLLTTGVITSEGQAVQAKEKQERVQHLYDIKDLHRYYSSESFEFSNISGKVENYNGSNVVRFNQENQ
NHQLFLLGKDKEKYKEGIEGKDVFVVKELIDPNGRLSTVGGVTKKNNKSSETNTHLFVNKVYGGNLDASIDSFSINKEEV
SLKELDFKIRQHLVKNYGLYKGTTKYGKITINLKDGEKQEIDLGDKLQFERMGDVLNSKDINKIEVTLKQI
>Q46812 3.-.-.-~~~ssnA~~~Putative aminohydrolase SsnA~~~COG0402
MLILKNVTAVQLHPAKVQEGVDIAIENDVIVAIGDALTQRYPDASFKEMHGRIVMPGIVCSHNHFYSGLSRGIMANIAPC
PDFISTLKNLWWRLDRALDEESLYYSGLICSLEAIKSGCTSVIDHHASPAYIGGSLSTLRDAFLKVGLRAMTCFETTDRN
NGIKELQEGVEENIRFARLIDEAKKATSEPYLVEAHIGAHAPFTVPDAGLEMLREAVKATGRGLHIHAAEDLYDVSYSHH
WYGKDLLARLAQFDLIDSKTLVAHGLYLSKDDITLLNQRDAFLVHNARSNMNNHVGYNHHLSDIRNLALGTDGIGSDMFE
EMKFAFFKHRDAGGPLWPDSFAKALTNGNELMSRNFGAKFGLLEAGYKADLTICDYNSPTPLLADNIAGHIAFGMGSGSV
HSVMVNGVMVYEDRQFNFDCDSIYAQARKAAASMWRRMDALA
>Q83AY0 ~~~sspA~~~Stringent starvation protein A homolog~~~COG0625
MLKRSIMTLYSGPLDIYSHQVRIVLAEKGVTVDIHNVDANHPSEDLIELNPYATLPTLVDRDLVLFESRVIMEYLDERFP
HPPLLPVYPVARSRCRLLMYRIERNFYHSMKIIEEGTPKQAETEREFLTKELIELDPVFGEKTYFMNDDFTLVDCVMAPL
LWRLPHLGVHVPPRAAKSMYKYKKLIFERESFKASLSESESELREVDGI
>P0ACA3 ~~~sspA~~~Stringent starvation protein A~~~COG0625
MAVAANKRSVMTLFSGPTDIYSHQVRIVLAEKGVSFEIEHVEKDNPPQDLIDLNPNQSVPTLVDRELTLWESRIIMEYLD
ERFPHPPLMPVYPVARGESRLYMHRIEKDWYTLMNTIINGSASEADAARKQLREELLAIAPVFGQKPYFLSDEFSLVDCY
LAPLLWRLPQLGIEFSGPGAKELKGYMTRVFERDSFLASLTEAEREMRLGRS
>P45207 ~~~sspA~~~Stringent starvation protein A homolog~~~COG0625
MSSASSKRSVMTLFSNKDDIYCHQVKIVLAEKGVLYENAEVDLQALPEDLMELNPYGTVPTLVDRDLVLFNSRIIMEYLD
ERFPHPPLMQVYPVSRAKDRLLMLRIEQDWYPTLAKAENGTEKEKTSALKQLKEELLGIAPIFQQMPYFMNEEFGLVDCY
VAPLLWKLKHLGVEFTGTGSKAIKAYMERVFTRDSFLQSVGEAAPKNLMDDK
>Q2FZL2 3.4.21.19~~~sspA~~~Glutamyl endopeptidase~~~COG3591
MKGKFLKVSSLFVATLTTATLVSSPAANALSSKAMDNHPQQTQSSKQQTPKIQKGGNLKPLEQREHANVILPNNDRHQIT
DTTNGHYAPVTYIQVEAPTGTFIASGVVVGKDTLLTNKHVVDATHGDPHALKAFPSAINQDNYPNGGFTAEQITKYSGEG
DLAIVKFSPNEQNKHIGEVVKPATMSNNAETQVNQNITVTGYPGDKPVATMWESKGKITYLKGEAMQYDLSTTGGNSGSP
VFNEKNEVIGIHWGGVPNEFNGAVFINENVRNFLKQNIEDIHFANDDQPNNPDNPDNPNNPDNPNNPDEPNNPDNPNNPD
NPDNGDNNNSDNPDAA
>Q99V45 3.4.21.19~~~sspA~~~Glutamyl endopeptidase~~~
MKGKFLKVSSLFVATLTTATLVSSPAANALSSKAMDNHPQQTQSSKQQTPKIKKGGNLKPLEQREHANVILPNNDRHQIT
DTTNGHYAPVTYIQVEAPTGTFIASGVVVGKDTLLTNKHVVDATHGDPHALKAFPSAINQDNYPNGGFTAEQITKYSGEG
DLAIVKFSPNEQNKHIGEVVKPATMSNNAETQVNQNITVTGYPGDKPVATMWESKGKITYLKGEAMQYDLSTTGGNSGSP
VFNEKNEVIGIHWGGVPNEFNGAVFINENVRNFLKQNIEDIHFANDDQPNNPDNPDNPNNPDNPNNPDNPNNPDEPNNPD
NPNNPDNPDNGDNNNSDNPDAA
>P0C1U8 3.4.21.19~~~sspA~~~Glutamyl endopeptidase~~~
MKGKFLKVSSLFVATLTTATLVSSPAANALSSKAMDNHPQQTQSSKQQTPKIQKGGNLKPLEQREHANVILPNNDRHQIT
DTTNGHYAPVTYIQVEAPTGTFIASGVVVGKDTLLTNKHVVDATHGDPHALKAFPSAINQDNYPNGGFTAEQITKYSGEG
DLAIVKFSPNEQNKHIGEVVKPATMSNNAETQVNQNITVTGYPGDKPVATMWESKGKITYLKGEAMQYDLSTTGGNSGSP
VFNEKNEVIGIHWGGVPNEFNGAVFINENVRNFLKQNIEDIHFANDDQPNNPDNPDNPNNPDNPNNPDEPNNPDNPNNPD
NPDNGDNNNSDNPDAA
>A8AUS0 ~~~sspA~~~Streptococcal surface protein A~~~COG3064
MNKRKEVFGFRKSKVAKTLCGAVLGAALIAIADQQVLADEVTETNSTANVAVTTTGNPATNLPEAQGEATEAASQSQAQA
GSKEGALPVEVSADDLNQAVTDAKAAGVNVVQDQTSDKGTATTAAENAQKQAEIKSDYAKQAEEIKKTTEAYKKEVEAHQ
AETDKINAENKAAEDKYQEDLKAHQAEVEKINTANATAKAEYEAKLAQYQKDLAAVQKANEDSQLDYQNKLSAYQAELAR
VQKANAEAKEAYEKAVKENTAKNAALQAENEAIKQRNETAKANYDAAMKQYEADLAAIKKAKEDNDADYQAKLAAYQAEL
ARVQKANADAKAAYEKAVEENTAKNTAIQAENEAIKQRNAAAKATYEAALKQYEADLAAAKKANEDSDADYQAKLAAYQT
ELARVQKANADAKAAYEKAVEDNKAKNAALQAENEEIKQRNAAAKTDYEAKLAKYEADLAKYKKELAEYPAKLKAYEDEQ
AQIKAALVELEKNKNQDGYLSKPSAQSLVYDSEPNAQLSLTTNGKMLKASAVDEAFSHDTAQYSKKILQPDNLNVSYLQQ
ADDVTSSMELYGNFGDKAGWTTTVGNNTEVKFASVLLERGQSVTATYTNLEKSYYNGKKISKVVFKYTLDSDSKFKNVDK
AWLGVFTDPTLGVFASAYTGQEEKDTSIFIKNEFTFYDENDQPINFDNALLSVASLNRENNSIEMAKDYSGTFVKISGSS
VGEKDGKIYATETLNFKQGQGGSRWTMYKNSQPGSGWDSSDAPNSWYGAGAISMSGPTNHVTVGAISATQVVPSDPVMAV
ATGKRPNIWYSLNGKIRAVNVPKITKEKPTPPVAPTEPQAPTYEVEKPLEPAPVAPTYENEPTPPVKTPDQPEPSKPEEP
TYETEKPLEPAPVVPTYENEPTPPVKTPDQPEPSKPEEPTYETEKPLEPAPVAPTYENEPTPPVKTPDQPEPSKPEEPTY
DPLPTPPVAPTPKQLPTPPVVPTVHFHYSSLLAQPQINKEIKNEDGVDIDRTLVAKQSIVKFELKTEALTAGRPKTTSFV
LVDPLPTGYKFDLDATKAASTGFDTTYDEASHTVTFKATDETLATYNADLTKPVETLHPTVVGRVLNDGATYTNNFTLTV
NDAYGIKSNVVRVTTPGKPNDPDNPNNNYIKPTKVNKNKEGLNIDGKEVLAGSTNYYELTWDLDQYKGDKSSKEAIQNGF
YYVDDYPEEALDVRPDLVKVADEKGNQVSGVSVQQYDSLEAAPKKVQDLLKKANITVKGAFQLFSADNPEEFYKQYVSTG
TSLVITDPMTVKSEFGKTGGKYENKAYQIDFGNGYATEVVVNNVPKITPKKDVTVSLDPTSENLDGQTVQLYQTFNYRLI
GGLIPQNHSEELEDYSFVDDYDQAGDQYTGNYKTFSSLNLTMKDGSVIKAGTDLTSQTTAETDAANGIVTVRFKEDFLQK
ISLDSPFQAETYLQMRRIAIGTFENTYVNTVNKVAYASNTVRTTTPIPRTPDKPTPIPTPKPKDPDKPETPKEPKVPSPK
VEDPSAPIPVSVGKELTTLPKTGTNDSSYMPYLGLAALVGVLGLGQLKRKEDESN
>P0AFZ3 ~~~sspB~~~Stringent starvation protein B~~~COG2969
MDLSQLTPRRPYLLRAFYEWLLDNQLTPHLVVDVTLPGVQVPMEYARDGQIVLNIAPRAVGNLELANDEVRFNARFGGIP
RQVSVPLAAVLAIYARENGAGTMFEPEAAYDEDTSIMNDEEASADNETVMSVIDGDKPDHDDDTHPDDEPPQPPRGGRPA
LRVVK
>P45206 ~~~sspB~~~Stringent starvation protein B homolog~~~COG2969
MEYKSSPKRPYLLRAYYDWLVDNSFTPYLVVDATYLGVNVPVEYVKDGQIVLNLSASATGNLQLTNDFIQFNARFKGVSR
ELYIPMGAALAIYARENGDGVMFEPEEIYDELNIEPDTEQPTGFYEAVDKPKKREEKKKTKSVSHLRIVD
>Q2FZL3 3.4.22.-~~~sspB~~~Staphopain B~~~
MNSSCKSRVFNIISIIMVSMLILSLGAFANNNKAKADSHSKQLEINVKSDKVPQKVKDLAQQQFAGYAKALDKQSNAKTG
KYELGEAFKIYKFNGEEDNSYYYPVIKDGKIVYTLTLSPKNKDDLNKSKEDMNYSVKISNFIAKDLDQIKDKNSNITVLT
DEKGFYFEEDGKVRLVKATPLPGNVKEKESAKTVSAKLKQELKNTVTPTKVEENEAIQEDQVQYENTLKNFKIREQQFDN
SWCAGFSMAALLNATKNTDTYNAHDIMRTLYPEVSEQDLPNCATFPNQMIEYGKSQGRDIHYQEGVPSYEQVDQLTKDNV
GIMILAQSVSQNPNDPHLGHALAVVGNAKINDQEKLIYWNPWDTELSIQDADSSLLHLSFNRDYNWYGSMIGY
>P0C1S6 3.4.22.-~~~sspB~~~Staphopain B~~~
MNSSCKTRVFNIISIIMVSMLILSLGAFANNNKAKADSHSKQLEINVKSDKVPQKVKDLAQQQFAGYAKALDKQSNAKTG
KYELGEAFKIYKFNGEEDNSYYYPVIKDGKIVYTLTLSPKNKDDLNKSKEDMNYSVKISNFIAKDLDQIKDKNSNITVLT
DEKGFYFEEDGKVRLVKATPLANNIKEKESAKTVSPQLKQELKTTVTPTKVEENEAIQEDQVQYENTLKNFKIREQQFDN
SWCAGFSMAALLNATKNTDTYNAHDIMRTLYPEVSEQDLPNCATFPNQMIEYGKSQGRDIHYQEGVPSYNQVDQLTKDNV
GIMILAQSVSQNPNDPHLGHALAVVGNAKINDQEKLIYWNPWDTELSIQDADSSLLHLSFNRDYNWYGSMIGY
>A8AUS1 ~~~sspB~~~Streptococcal surface protein B~~~COG3064
MQKREVFGFRKSKVAKTLCGAVLGAALIAIADQQVLADEVTETNSTANVAVTTTGNPATNLPEAQGEATEAASQSQAQAG
SKDGALPVEVSADDLNKAVTDAKAAGVNVVQDQTSDKGTATTAAENAQKQAEIKSDYAKQAEEIKKTTEAYKKEVEAHQA
ETDKINAENKAAEDKYQEDLKAHQAEVEKINTANATAKAEYEAKLAQYQKDLAAVQKANEDSQLDYQNKLSAYQAELARV
QKANAEAKEAYEKAVKENTAKNAALQAENEAIKQRNETAKANYDAAMKQYEADLAAIKKAKEDNDADYQAKLAAYQAELA
RVQKANADAKAAYEKAVEENTAKNTAIQAENEAIKQRNETAKATYEAAVKQYEADLAAVKQANATNEADYQAKLAAYQTE
LARVQKANADAKATYEKAVEDNKAKNAALQAENEEIKQRNAAAKTDYEAKLAKYEADLAKYKKDFAAYTAALAEAESKKK
QDGYLSEPRSQSLNFKSEPNAIRTIDSSVHQYGQQELDALVKSWGISPTNPDRKKSTAYSYFNAINSNNTYAKLVLEKDK
PVDVTYTGLKNSSFNGKKISKVVYTYTLKETGFNDGTKMTMFASSDPTVTAWYNDYFTSTNINVKVKFYDEEGQLMNLTG
GLVNFSSLNRGNGSGAIDKDAIESVRNFNGRYIPISGSSIKIHENNSAYADSSNAEKSRGARWDTSEWDTTSSPNNWYGA
IVGEITQSEISFNMASSKSGNIWFAFNSNINAIGVPTKPVAPTAPTQPMYETEKPLEPAPVVPTYENEPTPPVKTPDQPE
PSKPEEPTYETEKPLEPAPVAPTYENEPTPPVKIPDQPEPSKPEEPTYETEKPLEPAPVAPTYENEPTPPVKTPDQPEPS
KPEEPTYDPLPTPPLAPTPKQLPTPPVVPTVHFHYSSLLAQPQINKEIKNEDGVDIDRTLVAKQSIVKFELKTEALTAGR
PKTTSFVLVDPLPTGYKFDLDATKAASTGFDTTYDEASHTVTFKATDETLATYNADLTKPVETLHPTVVGRVLNDGATYT
NNFTLTVNDAYGIKSNVVRVTTPGKPNDPDNPNNNYIKPTKVNKNKEGLNIDGKEVLAGSTNYYELTWDLDQYKGDKSSK
EAIQNGFYYVDDYPEEALDVRPDLVKVADEKGNQVSGVSVQQYDSLEAAPKKVQDLLKKANITVKGAFQLFSADNPEEFY
KQYVSTGTSLVITDPMTVKSEFGKTGGKYENKAYQIDFGNGYATEVVVNNVPKITPKKDVTVSLDPTSENLDGQTVQLYQ
TFNYRLIGGLIPQNHSEELEDYSFVDDYDQAGDQYTGNYKTFSSLNLTMKDGSVIKAGTDLTSQTTAETDATNGIVTVRF
KEDFLQKISLDSPFQAETYLQMRRIAIGTFENTYVNTVNKVAYASNTVRTTTPIPRTPDKPTPIPTPKPKDPDKPETPKE
PKVPSPKVEDPSAPIPVSVGKELTTLPKTGTNDATYMPYLGLAALVGFLGLGLAKRKED
>P16952 ~~~sspB~~~Agglutinin receptor~~~
MKNKKEVYGFRKSKVAKTLCGAVLGTALIAFADKAVFADEVTETTSTSTVEVATTGNPATNLPEAQGEMSQVAKESQAKA
GSKESALPVEVSSADLDKAVADAKSAGVKVVQDETKDKGTATTATDNAQKQDEIKSDYAKQAEEIKTTTEAYKKEVAAHQ
AETDKINAENKAADDKYQKDLKSHQEEVEKINTANATAKAEYEAKLAQYQKDLATVKKANEDSQQDYQNKLSAYQTELAR
VQKANAEAKEAYEKAVKENTAKNEALKVENEAIKQRNETAKATYEAAMKQYEADLAAIKKANEDNDADYQAKLAAYQTEL
ARVQKANAEAKEAYDKAVKENTAKNTAIQAENEAIKQRNETAKATYDAAVKKYEADLAAVKQANATNEADYQAKLAAYQT
ELARVQKANADAKATYEKAVEDNKAKNAAIKAENEEIKQRNAVAKTDYEAKLAKYEADLAKYKKEFAAYTAALAEAESKK
KQDGYLSEPRSQSLNFKSEPNAIRTIDSSVHQYGQQELDALVKSWGISPTNPDRKKSRAYSYFNAINSNNTYAKLVLEKD
KPVDVTYTGLKNSSFNGKKISKVVYTYTLKETGFNDGTKMTMFASSDPTVTAWYNDYFTSTNINVKVKFYDEEGQLMNLT
GGLVNFSSLNRGNGSGAIDKDAIESVRNFNGRYIPISGSSIKIHENNSAYADSSNAEKSLGARWNTSEWDTTSSPNNWYG
AIVGEITQSEISFNMASSKSGNIWFAFNSNINAIGVPTKPVAPTAPTQPMYETEKPLEPAPVAPSYENEPTPPVKTPDQP
EPSKPEEPTYETEKPLEPAPVAPSYENEPTPPVKTPDQPEPSKPEEPNYETEKPLEPAPVAPSYENEPTPPVKIPDQPEP
SKPEEPTYDPLPTPPLAPTPKQLPTPPVVPTVHFHYSSLLAQPQINKEIKNEDGVDIDRTLVAKQSIVKFELKTEALTAG
RPKTTSFVLVDPLPTGYKFDLDATKAASTGFDTTYDEASHTVTFKATDETLATYNADLTKPVETLHPTVVGRVLNDGATY
TNNFTLTVNDAYGIKSNVVRVTTPGKPNDPDNPNNNYIKPTKVNKNKEGLNIDGKEVLAGSTNYYELTWDLDQYKGDKSS
KEAIQNGFYYVDDYPEEALDVRPDLVKVADEKGNQVSGVSVQQYDSLEAAPKKVQDLLKKANITVKGAFQLFSADNPEEF
YKQYVATGTSLVITDPMTVKSEFGKTGGKYENKAYQIDFGNGYATEVVVNNVPKITPKKDVTVSLDPTSENLDGQTVQLY
QTFNYRLIGGLIPQNHSEELEDYSFVDDYDQAGDQYTGNYKTFSSLNLTMKDGSVIKAGTDLTSQTTAETDATNGIVTVR
FKEDFLQKISLDSPFQAETYLQMRRIAIGTFENTYVNTVNKVAYASNTVRTTTPIPRTPDKPTPIPTPKPKDPDKPETPK
EPKVPSPKVEDPSAPIPVSVGKELTTLPKTGTNDATYMPYLGLAALVGFLGLGLAKRKED
>P02958 ~~~sspC~~~Small, acid-soluble spore protein C~~~
MAQQSRSRSNNNNDLLIPQAASAIEQMKLEIASEFGVQLGAETTSRANGSVGGEITKRLVRLAQQNMGGQFH
>Q9EYW6 ~~~sspC~~~Staphostatin B~~~
MYQLQFINLVYDTTKLTHLEQTNINLFIGNWSNHQLQKSICIRHGDDTSHNQYHILFIDTAHQRIKFSSIDNEEIIYILD
YDDTQHILMQTSSKQGIGTSRPIVYERLV
>Q7A189 ~~~sspC~~~Staphostatin B~~~
MYQLQFINLVYDTTKLTHLEQTNINLFIGNWSNHQLQKSICIRHGDDTSHNQYHILFIDTAHQRIKFSSIDNEEIIYILD
YDDTQHILMQTSSKQGIGTSRPIVYERLV
>P04833 ~~~sspD~~~Small, acid-soluble spore protein D~~~
MASRNKLVVPGVEQALDQFKLEVAQEFGVNLGSDTVARANGSVGGEMTKRLVQQAQSQLNGTTK
>P07784 ~~~sspE~~~Small, acid-soluble spore protein gamma-type~~~
MANSNNFSKTNAQQVRKQNQQSAAGQGQFGTEFASETNAQQVRKQNQQSAGQQGQFGTEFASETDAQQVRQQNQSAEQNK
QQNS
>Q7WY59 ~~~sspG~~~Small, acid-soluble spore protein G~~~
MSENRHENEENRRDAAVAKVQNSGNAKVVVSVNTDQDQAQAQSQDGED
>D0ZVG2 2.3.2.27~~~sspH1~~~E3 ubiquitin-protein ligase SspH1~~~
MFNIRNTQPSVSMQAIAGAAAPEASPEEIVWEKIQVFFPQENYEEAQQCLAELCHPARGMLPDHISSQFARLKALTFPAW
EENIQCNRDGINQFCILDAGSKEILSITLDDAGNYTVNCQGYSEAHDFIMDTEPGEECTEFAEGASGTSLRPATTVSQKA
AEYDAVWSKWERDAPAGESPGRAAVVQEMRDCLNNGNPVLNVGASGLTTLPDRLPPHITTLVIPDNNLTSLPELPEGLRE
LEVSGNLQLTSLPSLPQGLQKLWAYNNWLASLPTLPPGLGDLAVSNNQLTSLPEMPPALRELRVSGNNLTSLPALPSGLQ
KLWAYNNRLTSLPEMSPGLQELDVSHNQLTRLPQSLTGLSSAARVYLDGNPLSVRTLQALRDIIGHSGIRIHFDMAGPSV
PREARALHLAVADWLTSAREGEAAQADRWQAFGLEDNAAAFSLVLDRLRETENFKKDAGFKAQISSWLTQLAEDAALRAK
TFAMATEATSTCEDRVTHALHQMNNVQLVHNAEKGEYDNNLQGLVSTGREMFRLATLEQIAREKAGTLALVDDVEVYLAF
QNKLKESLELTSVTSEMRFFDVSGVTVSDLQAAELQVKTAENSGFSKWILQWGPLHSVLERKVPERFNALREKQISDYED
TYRKLYDEVLKSSGLVDDTDAERTIGVSAMDSAKKEFLDGLRALVDEVLGSYLTARWRLN
>D0ZPH9 2.3.2.27~~~sspH2~~~E3 ubiquitin-protein ligase SspH2~~~
MPFHIGSGCLPATISNRRIYRIAWSDTPPEMSSWEKMKEFFCSTHQTEALECIWTICHPPAGTTREDVINRFELLRTLAY
AGWEESIHSGQHGENYFCILDEDSQEILSVTLDDAGNYTVNCQGYSETHRLTLDTAQGEEGTGHAEGASGTFRTSFLPAT
TAPQTPAEYDAVWSAWRRAAPAEESRGRAAVVQKMRACLNNGNAVLNVGESGLTTLPDCLPAHITTLVIPDNNLTSLPAL
PPELRTLEVSGNQLTSLPVLPPGLLELSIFSNPLTHLPALPSGLCKLWIFGNQLTSLPVLPPGLQELSVSDNQLASLPAL
PSELCKLWAYNNQLTSLPMLPSGLQELSVSDNQLASLPTLPSELYKLWAYNNRLTSLPALPSGLKELIVSGNRLTSLPVL
PSELKELMVSGNRLTSLPMLPSGLLSLSVYRNQLTRLPESLIHLSSETTVNLEGNPLSERTLQALREITSAPGYSGPIIR
FDMAGASAPRETRALHLAAADWLVPAREGEPAPADRWHMFGQEDNADAFSLFLDRLSETENFIKDAGFKAQISSWLAQLA
EDEALRANTFAMATEATSSCEDRVTFFLHQMKNVQLVHNAEKGQYDNDLAALVATGREMFRLGKLEQIAREKVRTLALVD
EIEVWLAYQNKLKKSLGLTSVTSEMRFFDVSGVTVTDLQDAELQVKAAEKSEFREWILQWGPLHRVLERKAPERVNALRE
KQISDYEETYRMLSDTELRPSGLVGNTDAERTIGARAMESAKKTFLDGLRPLVEEMLGSYLNVQWRRN
>P0CE12 2.3.2.27~~~sspH2~~~E3 ubiquitin-protein ligase SspH2~~~
MPFHIGSGCLPATISNRRIYRIAWSDTPPEMSSWEKMKEFFCSTHQTEALECIWTICHPPAGTTREDVINRFELLRTLAY
AGWEESIHSGQHGENYFCILDEDSQEILSVTLDDAGNYTVNCQGYSETHRLTLDTAQGEEGTGHAEGASGTFRTSFLPAT
TAPQTPAEYDAVWSAWRRAAPAEESRGRAAVVQKMRACLNNGNAVLNVGESGLTTLPDCLPAHITTLVIPDNNLTSLPAL
PPELRTLEVSGNQLTSLPVLPPGLLELSIFSNPLTHLPALPSGLCKLWIFGNQLTSLPVLPPGLQELSVSDNQLASLPAL
PSELCKLWAYNNQLTSLPMLPSGLQELSVSDNQLASLPTLPSELYKLWAYNNRLTSLPALPSGLKELIVSGNRLTSLPVL
PSELKELMVSGNRLTSLPMLPSGLLSLSVYRNQLTRLPESLIHLSSETTVNLEGNPLSERTLQALREITSAPGYSGPIIR
FDMAGASAPRETRALHLAAADWLVPAREGEPAPADRWHMFGQEDNADAFSLFLDRLSETENFIKDAGFKAQISSWLAQLA
EDEALRANTFAMATEATSSCEDRVTFFLHQMKNVQLVHNAEKGQYDNDLAALVATGREMFRLGKLEQIAREKVRTLALVD
EIEVWLAYQNKLKKSLGLTSVTSEMRFFDVSGVTVTDLQDAELQVKAAEKSEFREWILQWGPLHRVLERKAPERVNALRE
KQISDYEETYRMLSDTELRPSGLVGNTDAERTIGARAMESAKKTFLDGLRPLVEEMLGSYLNVQWRRN
>O31552 ~~~sspH~~~Small, acid-soluble spore protein H~~~
MNIQRAKEIVESPDMKKVTYNGVPIYIQHVNEETGTARIYPLDEPQEEHEVQLNSLKED
>P94537 ~~~sspI~~~Small, acid-soluble spore protein I~~~
MDLNLRHAVIANVTGNNQEQLEHTIVDAIQSGEEKMLPGLGVLFEVIWQHASESEKNEMLKTLEGGLKPAE
>Q7WY58 ~~~sspJ~~~Small, acid-soluble spore protein J~~~
MGFFNKDKGKRSEKEKNVIQGALEDAGSALKDDPLQEAVQKKKNNR
>Q7WY75 ~~~sspK~~~Small, acid-soluble spore protein K~~~
MVRNKEKGFPYENENKFQGEPRAKDDYASKRADGSINQHPQERMRASGKR
>Q7WY66 ~~~sspL~~~Small, acid-soluble spore protein L~~~
MKKKDKGRLTGGVTPQGDLEGNTHNDPKTELEERAKKSNTKR
>Q7WY65 ~~~sspM~~~Small, acid-soluble spore protein M~~~
MKTRPKKAGQQKKTESKAIDSLDKKLGGPNRPST
>Q7WY69 ~~~sspN~~~Small, acid-soluble spore protein N~~~
MGNNKKNGQPQYAPSHLGTKPVKYKANKGEKMHDTSGQRPIIMQTKGE
>P71031 ~~~sspO~~~Small, acid-soluble spore protein O~~~
MVKRKANHVINGMNDAKSQGKGAGYIENDQLVLTEAERQNNKKRKTNQ
>P71032 ~~~sspP~~~Small, acid-soluble spore protein P~~~
MTNKNTSKDMHKNAPKGHNPGQPEPLSGSKKVKNRNHTRQKHNSSHDM
>Q2G2R8 3.4.22.48~~~sspP~~~Staphopain A~~~
MKRNFPKLIALSLIFSLSVTPIANAESNSNIKAKDKKHVQVNVEDKSVPTDVRNLAQKDYLSYVTSLDKIYNKEKASYTL
GEPFKIYKFNKKSDGNYYFPVLNTEGNIDYIVTISPKITKYSSSSSKYTINVSPFLSKVLNQYKDQQITILTNSKGYYVV
TQNHKAKLVLKTPRLEDKKLKKTESIPTGNNVTQLKQKASVTMPTSQFKSNNYTYNEQYINKLENFKIRETQGNNGWCAG
YTMSELLNATYNTNKYHAEAVMRFLHPNLQGQRFQFTGLTPREMIYFGQTQGRSPQLLNRMTTYNEVDNLTKNNKGIAVL
GSRVESRNGMHAGHAMAVVGNAKLDNGQEVIIIWNPWDNGFMTQDAKNNVIPVSNGDHYRWYSSIYGY
>P81297 3.4.22.48~~~sspP~~~Staphopain A~~~
MKRNFPKLIALSLIFSLSITPIANAESNSNIKAKDKRHVQVNVEDKSVPTDVRNLAQKDYLSYVTSLDKIYNKEKASYTL
GEPFKIYKFNKKSDGNYYFPVLNTEGNIDYIVTISPKVTKDSSSSSKYTINVSSFLSKALNEYKDQQITILTNSKGYYVV
TQNHKAKLVLKTPRLEDKKAKKTESIPTGNNVTQLKQKASVTMPTSQFKSNNYTYNEQYVNKLENFKIRETQGNNGWCAG
YTMSALLNATYNTNKYHAEAVMRFLHPNLQGQQFQFTGLTPREMIYFEQTQGRSPQLLNRMTTYNEVDNLTKNNKGIAIL
GSRVESRNGMHAGHAMAVVGNAKLNNGQEVIIIWNPWDNGFMTQDAKNNVIPVSNGDHYQWYSSIYGY
>O66640 ~~~smpB~~~SsrA-binding protein~~~COG0691
MGKSDKIIPIAENKEAKAKYDILETYEAGIVLKGSEVKSLREKGTVSFKDSFVRIENGEAWLYNLYIAPYKHATIENHDP
LRKRKLLLHKREIMRLYGKVQEKGYTIIPLKLYWKNNKVKVLIALAKGKKLYDRRRELKEKAMKRELEREFKGKIHL
>O32230 ~~~smpB~~~SsrA-binding protein~~~COG0691
MPKGSGKVLSQNKKANHDYFIEETYETGIVLQGTEIKSIRAGRVNLKDSFAKIERGEVFLHNMHVSPYEQGNRYNHDPLR
TRKLLMHRKEINKLIGLTKEKGYSLVPLKLYLKNGFAKVLLGLGKGKKNYDKREDLKRKDAKREIERAFRDSQKGF
>B8H482 ~~~smpB~~~SsrA-binding protein~~~
MSKPIAENRRARFDYFIEETFEAGIMLTGTEVKSLRTGRANIAESYASVEGREIVLINADIPPYGHANRFNHEPRRHRKL
LLHRRQIDKLIGAVQREGRTLVPIKLYWNDKGLAKLEVGLAKGKKLHDKRDTAAERDWQRDKARLMKGDRGD
>P0A832 ~~~smpB~~~SsrA-binding protein~~~COG0691
MTKKKAHKPGSATIALNKRARHEYFIEEEFEAGLALQGWEVKSLRAGKANISDSYVLLRDGEAFLFGANITPMAVASTHV
VCDPTRTRKLLLNQRELDSLYGRVNREGYTVVALSLYWKNAWCKVKIGVAKGKKQHDKRSDIKEREWQVDKARIMKNAHR
>O25985 ~~~smpB~~~SsrA-binding protein~~~COG0691
MKLIASNKKAYFDYEILETLEAGLALLGSEVKALRQTRVNLKDNFVKIIKGEAFLFGVHISYLDTIHAYYKPNERRERKL
LLHKKQLLKWQIEASKERLSIVGLKLYFNQRNRAKIQIALVKGKRLHDKRQSLKEKALNKEILADLKHHFKG
>A0QU63 ~~~smpB~~~SsrA-binding protein~~~COG0691
MTKKSASSNNKVVATNRKARHNYTILDTYEAGIVLMGTEVKSLREGQASLADAFATVDDGEIWLRNVHIAEYHHGTWTNH
APRRNRKLLLHRKQIDNLIGKIRDGNLTLVPLSIYFTDGKVKVELALARGKQAHDKRQDLARRDAQREVIRELGRRAKGK
I
>P9WGD3 ~~~smpB~~~SsrA-binding protein~~~COG0691
MSKSSRGGRQIVASNRKARHNYSIIEVFEAGVALQGTEVKSLREGQASLADSFATIDDGEVWLRNAHIPEYRHGSWTNHE
PRRNRKLLLHRRQIDTLVGKIREGNFALVPLSLYFAEGKVKVELALARGKQARDKRQDMARRDAQREVLRELGRRAKGMT
>P56944 ~~~smpB~~~SsrA-binding protein~~~COG0691
MVKVVATNKKAYTDYEILETYEAGIVLTGTEVKSLRNGSVNFKDSFCRFKNGELYLLNLHIPPYSHGGVYNHDPERPRKL
LLHKRELKRLMGKVQEEGVTIVPLKIYFNDRGIAKVEIAVARGKKKYDKREAIKKREMERKIREYMKYSR
>Q8RR57 ~~~smpB~~~SsrA-binding protein~~~COG0691
MAPVLENRRARHDYEILETYEAGIALKGTEVKSLRAGKVDFTGSFARFEDGELYLENLYIAPYEKGSYANVDPRRKRKLL
LHKHELRRLLGKVEQKGLTLVPLKIYFNERGYAKVLLGLARGKKAYEKRREDKKEAVRRALEEL
>P0AGE4 ~~~sstT~~~Serine/threonine transporter SstT~~~COG3633
MTTQRSPGLFRRLAHGSLVKQILVGLVLGILLAWISKPAAEAVGLLGTLFVGALKAVAPILVLMLVMASIANHQHGQKTN
IRPILFLYLLGTFSAALAAVVFSFAFPSTLHLSSSAGDISPPSGIVEVMRGLVMSMVSNPIDALLKGNYIGILVWAIGLG
FALRHGNETTKNLVNDMSNAVTFMVKLVIRFAPIGIFGLVSSTLATTGFSTLWGYAQLLVVLVGCMLLVALVVNPLLVWW
KIRRNPFPLVLLCLRESGVYAFFTRSSAANIPVNMALCEKLNLDRDTYSVSIPLGATINMAGAAITITVLTLAAVNTLGI
PVDLPTALLLSVVASLCACGASGVAGGSLLLIPLACNMFGISNDIAMQVVAVGFIIGVLQDSCETALNSSTDVLFTAAAC
QAEDDRLANSALRN
>H8L374 2.3.1.-~~~~~~Serine O-succinyltransferase~~~COG2021
MSDARRYHALPSPFPMKRGGQLQGARLAYETWGRLSPGCDNAVLILTGLSPSAHAASHPDGDTSPGWWEGMLGPGKAIDT
DRWYVICVNSLGSDKGSTGPASPDPATGAPYRLSFPELALEDVASAAHDLVKALGIARLACLIGCSMGGMSALAYMLQYE
GEVEAHISVDTAPQAQPFAIAIRSLQREAIRLDPNWQDGHYTDAAYPETGMAMARKLGVITYRSAMEWNGRFARIRLDGD
QRPDEPFAREFQVESYLEHHAQRFVRRFDPACYLYLTRASDWFDVAEYGEGSVMQGLARIHVRRALVIGVSTDILFPLEQ
QQQIAEGLQAAGAEVDFVALDSPQGHDAFLVDIENYSAAIGGFLRTL
>S2KHP1 2.3.1.-~~~~~~Serine O-succinyltransferase~~~COG2021
MPDARRFIELPGPVRMYRGGELPSVTIAYETWGELRGQGDNALLLFTGLSPSAHAASSMADPSPGWWEYMIGPGKPIDTE
RFFVIAINSLGSCFGSTGPASINPATGQPYRLDFPKLSVEDIVAAARGACRALGIDHVHTVAGASLGGMDALAYAVMYPG
TYRDIISISAAAHATPFTIALRSIQREAVRADPAWAGGNYAPGEGPKDGMRVARQLGILTYRSAEEWLQRFDRERLEGSD
DSANPFAMAFQVQSYMEANARKFADRFDANCYLYLSQAMDLFDMAEHGDGSLEAAVRRIDAKRALVAGVTTDWLFPLWQQ
RQVAELLEHAGVAVSYHELGSIQGHDAFLVDSERFAPMVAEFLAHSS
>A0A0I9RJ56 2.3.1.-~~~~~~Serine O-succinyltransferase~~~COG2021
MTEFIPPGTRFHALPSPFPFKRGGALHGARVAYETWGTLAADASNAILIVTGLSPDAHAAANDANPAAGWWEGMVGPGKA
IDTDRWFVVCVNSLGSCRGSTGPASLNPATGQPYRLDFPELSIEDGARAAIEVVRAQGIEQLACVVGNSMGGMTALAVLM
LHPGIARSHVNISGSAQALPFSIAIRSLQREAIRLDPRWNGGHYDDDAYPESGMRMARKLGVITYRSALEWDGRFGRVRL
DSDQTDDDPFGLEFQVESYLEGHARRFVRFFDPNCYLYLSRSMDWFDLAEYADGDVLAGLAKIRVEKALAIGANTDILFP
VQQQQQVADGLRAGGADARFIGLESPQGHDAFLVDFERFCPAVRGFLDAL
>Q8P8L2 2.3.1.-~~~~~~Serine O-succinyltransferase~~~COG2021
MTEFIPTGTRFHALPSPLPMKRGGVLHQARVAYETWGTLDADHGNAVLIVTGLSPNAHAAANADNPEPGWWEAMVGPGKP
IDTDRWFVVCVNSLGSCKGSTGPASIDPATGAPYRLSFPELSIEDVADAAADVVRALGIAQLACLIGNSMGGMTALALLL
RHPGIARSHINISGSAQALPFSIAIRSLQREAIRLDPHWNGGHYDDVQYPESGMRMARKLGVITYRSALEWDGRFGRVRL
DSELTAEDPFGLEFQVESYLEGHARRFVRFFDPNCYLYLSRSMDWFDLAEYAPDTRADAAAPESGVLAGLAQIRIARALA
IGANTDILFPVQQQEQIAEGLRAGGADAQFLGLDSPQGHDAFLVDFARFGPAVRAFLADC
>P40400 ~~~ssuA~~~Putative aliphatic sulfonates-binding protein~~~COG0715
MKKGLIVLVAVIFLLAGCGANGASGDHKQLKEIRIGIQQSLSPLLIAKEKGWFEDAFEKEGIKVKWVEFQSGPPQFEGLA
ADKLDFSQVGNSPVIAGQAAGIDFKEIGLSQDGLKANGILVNQNSGIQDVKGLKGKKIAVAKGSSGFDFLYKALDQVGLS
ANDVTIIQLQPDEAASAFENGSVDAWSIWEPYLSLETMKHGAKILVNGESTDLYSPGFTLVRTKFSEEHPDEVVRFLKVF
NKAVVWQKEHLDEAADLYSDIKDLDKKVVENVLKNTEPLNEIISDDIVKAQQETADFQFRTKAIDKKIDVKDVVDNTFIK
KALEEHSSGGDQ
>P75853 ~~~ssuA~~~Putative aliphatic sulfonates-binding protein~~~COG0715
MRNIIKLALAGLLSVSTFAVAAESSPEALRIGYQKGSIGMVLAKSHQLLEKRYPESKISWVEFPAGPQMLEALNVGSIDL
GSTGDIPPIFAQAAGADLVYVGVEPPKPKAEVILVAENSPIKTVADLKGHKVAFQKGSSSHNLLLRALRQAGLKFTDIQP
TYLTPADARAAFQQGNVDAWAIWDPYYSAALLQGGVRVLKDGTDLNQTGSFYLAARPYAEKNGAFIQGVLATFSEADALT
RSQREQSIALLAKTMGLPAPVIASYLDHRPPTTIKPVNAEVAALQQQTADLFYENRLVPKKVDIRQRIWQPTQLEGKQL
>P97027 7.6.2.14~~~ssuB~~~Aliphatic sulfonates import ATP-binding protein SsuB~~~COG1116
MAVTISIKEKAFVQEGRKNTVLENIELSIAPGEFLTLIGPSGCGKSTLLKIIAGLDSEYDGSVEINGRSVTAPGIQQGFI
FQEHRLFPWLTVEQNIAADLNLKDPKVKQKVDELIEIVRLKGSEKAYPRELSGGMSQRVAIARALLREPEVLLLDEPFGA
LDAFTRKHLQDVLLDIWRKKKTTMILVTHDIDESVYLGNELAILKAKPGKIHKLMPIHLAYPRNRTTPDFQAIRQRVLSE
FEKTEDLEYAEGSGI
>Q8NR42 7.6.2.14~~~ssuB~~~Aliphatic sulfonates import ATP-binding protein SsuB~~~COG1116
MTATLSLKPAATVRGLRKSYGTKEVLQGIDLTINCGEVTALIGRSGSGKSTILRVLAGLSKEHSGSVEISGNPAVAFQEP
RLLPWKTVLDNVTFGLNRTDISWSEAQERASALLAEVKLPDSDAAWPLTLSGGQAQRVSLARALISEPELLLLDEPFGAL
DALTRLTAQDLLLKTVNTRNLGVLLVTHDVSEAIALADHVLLLDDGAITHSLTVDIPGDRRTHPSFASYTAQLLEWLEIT
TPA
>P0AAI1 7.6.2.14~~~ssuB~~~Aliphatic sulfonates import ATP-binding protein SsuB~~~COG1116
MNTARLNQGTPLLLNAVSKHYAENIVLNQLDLHIPAGQFVAVVGRSGGGKSTLLRLLAGLETPTAGDVLAGTTPLAEIQE
DTRMMFQDARLLPWKSVIDNVGLGLKGQWRDAARRALAAVGLENRAGEWPAALSGGQKQRVALARALIHRPGLLLLDEPL
GALDALTRLEMQDLIVSLWQEHGFTVLLVTHDVSEAVAMADRVLLIEEGKIGLDLTVDIPRPRRLGSVRLAELEAEVLQR
VMQRGESETRLRKQG
>Q8KZQ6 7.6.2.14~~~ssuB~~~Aliphatic sulfonates import ATP-binding protein SsuB~~~COG1116
MTVLKEQPPRLLRGIPLASSGLRKTFGQREVLKGIELHIPAGQFVAIVGRSGCGKSTLLRLLAGLDQPTAGQLLAGAAPL
EQAREETRLMFQDARLLPWKKVIDNVGLGLSGDWRPRALEALESVGLADRANEWPAALSGGQKQRVALARALIHQPRLLL
LDEPLGALDALTRIEMQQLIERLWRQHGFTVLLVTHDVSEAVAVADRVILIEDGEVGLDLTVDLARPRARGSHRLAALES
EVLNRVLSAPGAAPEPDPVAPLPTQLRWAH
>P40401 ~~~ssuC~~~Putative aliphatic sulfonates transport permease protein SsuC~~~COG0600
MMKAEAAGSLPKTNAEAVRKKPGRKRYGWMKGLLLPAVIIAIWQVIGGLGVVSATVLPTPVTIVLTFKELILSGELFGHL
QISIYRAALGFLLGAGLGLMIGILAGFSKRTELYLDPSLQMLRTVPHLAVTPLFILWFGFDEVSKILLIALGAFFPVYIN
TFNGIRGVDAKLFEVARVLEFKWHQQISKVILPAALPNILLGIRLSLGIAWLGLVVAELMGSSSGVGYMIMDARQFSQTN
KVFAGIIIFAVVGKLTDSFVRLLERKLLKWRNSYEG
>P75851 ~~~ssuC~~~Putative aliphatic sulfonates transport permease protein SsuC~~~COG0600
MATPVKKWLLRVAPWFLPVGIVAVWQLASSVGWLSTRILPSPEGVVTAFWTLSASGELWQHLAISSWRALIGFSIGGSLG
LILGLISGLSRWGERLLDTSIQMLRNVPHLALIPLVILWFGIDESAKIFLVALGTLFPIYINTWHGIRNIDRGLVEMARS
YGLSGIPLFIHVILPGALPSIMVGVRFALGLMWLTLIVAETISANSGIGYLAMNAREFLQTDVVVVAIILYALLGKLADV
SAQLLERLWLRWNPAYHLKEATV
>P40402 1.14.14.5~~~ssuD~~~Alkanesulfonate monooxygenase~~~COG2141
MEILWFIPTHGDARYLGSESDGRTADHLYFKQVAQAADRLGYTGVLLPTGRSCEDPWLTASALAGETKDLKFLVAVRPGL
MQPSLAARMTSTLDRISDGRLLINVVAGGDPYELAGDGLFISHDERYEATDEFLTVWRRLLQGETVSYEGKHIKVENSNL
LFPPQQEPHPPIYFGGSSQAGIEAAAKHTDVYLTWGEPPEQVKEKIERVKKQAAKEGRSVRFGIRLHVIARETEQEAWEA
AERLISHLDDDTIAKAQAALSRYDSSGQQRMAVLHQGDRTKLEISPNLWAGIGLVRGGAGTALVGDPQTIADRIAEYQAL
GIESFIFSGYPHLEEAYYFAELVFPLLPFENDRTRKLQNKRGEAVGNTYFVKEKNA
>P80645 1.14.14.5~~~ssuD~~~Alkanesulfonate monooxygenase~~~COG2141
MSLNMFWFLPTHGDGHYLGTEEGSRPVDHGYLQQIAQAADRLGYTGVLIPTGRSCEDAWLVAASMIPVTQRLKFLVALRP
SVTSPTVAARQAATLDRLSNGRALFNLVTGSDPQELAGDGVFLDHSERYEASAEFTQVWRRLLQRETVDFNGKHIHVRGA
KLLFPAIQQPYPPLYFGGSSDVAQELAAEQVDLYLTWGEPPELVKEKIEQVRAKAAAHGRKIRFGIRLHVIVRETNDEAW
QAAERLISHLDDETIAKAQAAFARTDSVGQQRMAALHNGKRDNLEISPNLWAGVGLVRGGAGTALVGDGPTVAARINEYA
ALGIDSFVLSGYPHLEEAYRVGELLFPLLDVAIPEIPQPQPLNPQGEAVANDFIPRKVAQS
>Q9HYG2 1.14.14.5~~~ssuD~~~Alkanesulfonate monooxygenase~~~
MSLEIFWFLPTHGDGHYLGTTQGARAVDHGYLQQIAQAADRLGFGGVLIPTGRSCEDSWLVAASLIPVTQRLKFLVALRP
GIISPTVAARQAATLDRLSNGRALFNLVTGGDPDELAGDGLHLSHAERYEASVEFTRIWRRVLEGETVDYAGKHIQVKGA
KLLYPPLQQPRPPLYFGGSSEAAQDLAAEQVELYLTWGEPPAAVAEKIAQVREKAARQGRQVRFGIRLHVIVRETSEEAW
QAADRLIAHLDDDTIARAQASLARFDSVGQQRMAALHGGSRDNLEVSPNLWAGVGLVRGGAGTALVGDGPTVAARVREYA
ELGIDTFIFSGYPHLEESYRVAELLFPHLDVQRPAQPEGRGYVSPFGEMVANDILPRQAAQS
>O85764 1.14.14.5~~~ssuD~~~Alkanesulfonate monooxygenase~~~COG2141
MSLNIFWFLPTHGDGKYLGTSEGARAVDHGYLQQIAQAADRLGFGGVLIPTGRSCEDSWLVAASLIPVTQRLKFLVALRP
GIISPTVAARQAATLDRLSNGRALFNLVTGGDPDELAGDGLHLNHQERYEASVEFTRIWRKVLEGEVVDYDGKHIQVKGA
KLLYPPIQQPRPPLYFGGSSEAAQDLAAEQVELYLTWGEPPSAVAEKIAQVREKAAAQGREVRFGIRLHVIVRETNEEAW
AAADKLISHLDDDTIARAQASLARFDSVGQQRMAALHNGNRDKLEVSPNLWAGVGLVRGGAGTALVGDGPTVAARVKEYA
ELGIDTFIFSGYPHLEESYRVAELLFPHLDVQRPEQAKTSGYVSPFGEMVANDILPKSVAQS
>P80644 1.5.1.38~~~ssuE~~~FMN reductase (NADPH)~~~COG0431
MRVITLAGSPRFPSRSSSLLEYAREKLNGLDVEVYHWNLQNFAPEDLLYARFDSPALKTFTEQLQQADGLIVATPVYKAA
YSGALKTLLDLLPERALQGKVVLPLATGGTVAHLLAVDYALKPVLSALKAQEILHGVFADDSQVIDYHHRPQFTPNLQTR
LDTALETFWQALHRRDVQVPDLLSLRGNAHA
>B4E8S9 ~~~ssuR~~~HTH-type transcriptional regulator SsuR~~~COG0583
MNFQQLRFVREAVRQNMNLTEVANVLYTSQSGVSKQIKDLEDELGVDIFIRRGKRLTGLTEPGKAVHQLIERMLLDAENL
RRVARQFADQDSGHLVVATTHTQARYALPKVIRQFTDVFPKVHLALRQGSPQQIAQMILNGEADLGISTEALDRYPDIVT
FPCYSWHHTVVVPKGHPLVGRENLTLEEIAEYPIITYDQDFTGRSHIDQAFTQAGAVPDVVLTAIDADVIKTYVELGMGI
GVVAAMAYDPQRDTGLVALDTQHLFEASTTRVGLRKGAFLRAYAYRLIEMFAPHLNEAEIAGLLREAV
>Q8KLM3 2.8.2.36~~~staL~~~Desulfo-A47934 sulfotransferase~~~
MNGMCWIASYPKAGGHWLRCMLTSYVTGEPVETWPGIQAGVPHLEGLLRDGEAPSADPDEQVLLATHFTADRPVLRFYRE
STAKVVCLIRNPRDAMLSLMRMKGIPPEDVEACRKIAETFIADEGFSSVRIWAGEGSWPENIRSWTDSVHESFPNAAVLA
VRYEDLRKDPEGELWKVVDFLELGGRDGVADAVANCTLERMREMEERSKLLGLETTGLMTRGGKQLPFVGKGGQRKSLKF
MGDDIEKAYADLLHGETDFAHYARLYGYAE
>P37506 2.3.-.-~~~satA~~~Streptothricin acetyltransferase A~~~COG0456
MIMKMTHLNMKDFNKPNEPFVVFGRMIPAFENGVWTYTEERFSKPYFKQYEDDDMDVSYVEEEGKAAFLYYLENNCIGRI
KIRSNWNGYALIEDIAVAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHFYAKHHFIIGAVDTMLYSNFP
TANEIAIFWYYKF
>P08457 2.3.-.-~~~sta~~~Streptothricin acetyltransferase~~~
MTTTHGSTYEFRSARPGDAEAIEGLDGSFTTSTVFEVDVTGDGFALREVPADPPLVKVFPDDGGSDGEDGAEGEDADSRT
FVAVGADGDLAGFAAVSYSAWNQRLTIEDIEVAPGHRGKGIGRVLMRHAADFARERGAGHLWLEVTNVNAPAIHAYRRMG
FAFCGLDSALYQGTASEGEHALYMSMPCP
>P11906 ~~~stbB~~~Protein StbB~~~
MMDKRRTIAFKLNPDVNQTDKIVCDTLDSIPQGERSRLNRAALTAGLALYRQDPRTPFLLCELLTKETTFSDIVNILRSL
FPKEMADFNSSIVTQSSSQQEQKSDEETKKNAMKLIN
>P07767 ~~~~~~Staphylocoagulase~~~
MKKQIISLGALAVASSLFTWDNKADAIVTKDYSKESRVNEKSKKGATVSDYYYWKIIDSLEAQFTGAIDLLENYKYGDPI
YKEAKDRLMTRVLGEDQYLLKKKIDEYELYKKWYKSSNKNTNMLTFHKYNLYNLTMNEYNDIFNSLKDAVYQFNKEVKEI
EHKNVDLKQFDKDGEDKATKEVYDLVSEIDTLVVTYYADKDYGEHAKELRAKLDLILGDTDNPHKITNERIKKEMIDDLN
SIIDDFFMETKQNRPNSITKYDPTKHNFKEKSENKPNFDKLVEETKKAVKEADESWKNKTVKKYEETVTKSPVVKEEKKV
EEPQLPKVGNQQEVKTTAGKAEETTQPVAQPLVKIPQETIYGETVKGPEYPTMENKTLQGEIVQGPDFLTMEQNRPSLSD
NYTQPTTPNPILEGLEGSSSKLEIKPQGTESTLKGIQGESSDIEVKPQATETTEASQYGPRPQFNKTPKYVKYRDAGTGI
REYNDGTFGYEARPRFNKPSETNAYNVTTNQDGTVSYGARPTQNKPSETNAYNVTTHANGQVSYGARPTQNKPSKTNAYN
VTTHANGQVSYGARPTQKKPSKTNAYNVTTHANGQVSYGARPTYKKPSETNAYNVTTHANGQVSYGARPTQKKPSETNAY
NVTTHADGTATYGPRVTK
>P17855 ~~~~~~Staphylocoagulase~~~
MKKQIISLGALAVASSLFTWDNKADAIVTKDYSKESRVNENSKYGTLISDWYLKGRLTSLESQFINALDILETYHYGEKE
YKDAKDKLMTRILGEDQYLLERKKVQYEEYKKLYQKYKEENPTSKGLKLKTFDQYTIEDLTMREYNELTESLKSAVKDFE
KDVEKIENQHHDLKPFTDEMEEKATSRVDDLANKAYSVYFAFVRDTQHKTEALELKAKVDLVLGDEDKPHRISNERIEKE
MIKDLESIIEDFFIETGLNKPGNITSYDSSKHHYKNHSEGFEALVKETREAVANADESWKTKTVKKYGESETKSPVVKEE
NKVEDPQSPKFDNQQEVKTTAGKAEETTQPVAQPLVKIPQGTITGEIVKGPEYPTMENKTLQGEIVQGPDFPTMEQSGPS
LSDNYTQPTTPNPILEGLEGSSSKLEIKPQGTESTLKGIQGESSDIEVKPQATETTEASQYGPRPQFNKTPKYVKYRDAG
TGIREYNDGTFGYEARPRFNKPSETNAYNVTTNQDGTVSYGARPTQNKASETNAYNVTTHANGQVSYGARPTQKKPSETN
AYNVTTHANGQVSYGARPTYNKPSETNAYNVTTHGNGQVSYGARPTYKKPSKTNAYNVTTHANGQVSYGARPTQNKPSET
NAYNVTTHANGQVSYGARPTQNKPSETNAYNVTTHGNGQVSYGARPTYNKPSKTNAYNVTTHADGTATYGPRVTK
>O87278 1.-.-.-~~~stcD~~~Probable N-methylproline demethylase~~~COG0446
MPNDPLLQPYQLKHLTLRNRIIVTAHEPAYPEDGMPKERYRAYTVERARGGVAMTMTAGSAAVSKDSPPVFNNLLAYRDE
IVPWIREMTDAVHEEGAVIMIQLTHLGRRTRWDKGDWLPVVAPSHHREAAHRAFPKKIEDWDIDRIIKDFADAAERMKAG
GMDGVELEAYGHLIDQFASPLTNELDGPYGGSLDNRMRFCFDVLKAIRARVGDEFILGVRYTADECLPGGTDKAEGLEIS
KRLKESGLIDYLNIIRGHIDTDPGLTDVIPIQGMANSPHLDFAGEIRAATNFPTFHAAKIPDVATARHAIASGKVDMVGM
TRAHMTDPHIVRKIIEKREEDIRPCVGANYCLDRIYQGGAAYCIHNAATGRELTMPHSIAKAHCRRKVVVVGTGPAGLEA
ARVAGERGHEVIVFEAASDPGGQVRLTAQSPRRREMISIIDWRMSQCEKLGVTFHFNTWAEAEAIQAESPDVVIIATGGL
PHTEVLSRGNELVVSAWDIISGDAKPGTNVLIFDDAGDHAALQAAEFLATAGARVEIMTPDRSFAPEVMAMNLVPYMRCL
QKLDVTFTVTYRLEAVEKSGNELVAHVGSDYGGISKQRTFDQVVVNHGTIPLDELYFELKPFSSNLGEIAHDQMIAGEPQ
SVVRNAEGKFQLFRIGDAVAARNTHAAIYDALRLLKDI
>O82882 3.4.24.-~~~stcE~~~Metalloprotease StcE~~~
MNTKMNERWRTPMKLKYLSCTILAPLAIGVFSATAADNNSAIYFNTSQPINDLQGSLAAEVKFAQSQILPAHPKEGDSQP
HLTSLRKSLLLVRPVKADDKTPVQVEARDDNNKILGTLTLYPPSSLPDTIYHLDGVPEGGIDFTPHNGTKKIINTVAEVN
KLSDASGSSIHSHLTNNALVEIHTANGRWVRDIYLPQGPDLEGKMVRFVSSAGYSSTVFYGDRKVTLSVGNTLLFKYVNG
QWFRSGELENNRITYAQHIWSAELPAHWIVPGLNLVIKQGNLSGRLNDIKIGAPGELLLHTIDIGMLTTPRDRFDFAKDK
EAHREYFQTIPVSRMIVNNYAPLHLKEVMLPTGELLTDMDPGNGGWHSGTMRQRIGKELVSHGIDNANYGLNSTAGLGEN
SHPYVVAQLAAHNSRGNYANGIQVHGGSGGGGIVTLDSTLGNEFSHEVGHNYGLGHYVDGFKGSVHRSAENNNSTWGWDG
DKKRFIPNFYPSQTNEKSCLNNQCQEPFDGHKFGFDAMAGGSPFSAANRFTMYTPNSSAIIQRFFENKAVFDSRSSTGFS
KWNADTQEMEPYEHTIDRAEQITASVNELSESKMAELMAEYAVVKVHMWNGNWTRNIYIPTASADNRGSILTINHEAGYN
SYLFINGDEKVVSQGYKKSFVSDGQFWKERDVVDTREARKPEQFGVPVTTLVGYYDPEGTLSSYIYPAMYGAYGFTYSDD
SQNLSDNDCQLQVDTKEGQLRFRLANHRANNTVMNKFHINVPTESQPTQATLVCNNKILDTKSLTPAPEGLTYTVNGQAL
PAKENEGCIVSVNSGKRYCLPVGQRSGYSLPDWIVGQEVYVDSGAKAKVLLSDWDNLSYNRIGEFVGNVNPADMKKVKAW
NGQYLDFSKPRSMRVVYK
>D0ZIB5 2.7.-.-~~~steC~~~Secreted effector kinase SteC~~~
MPFTFQIGNHSCQISERYLRDIIDNKREHVFSTCEKFIDFFRNIFTRRSLISDYREIYNLLCQKKEHPDIKGPFSPGPFS
KRDEDCTRWRPLLGYIKLIDASRPETIDKYTVEVLAHQENMLLLQMFYDGVLVTETECSERCVDFLKETMFNYNNGEITL
AALGNDNLPPSEAGSNGIYEAFEQRLIDFLTTPATASGYESGAIDQTDASQPAAIEAFINSPEFQKNIRMRDIEKNKIGS
GSYGTVYRLHDDFVVKIPVNERGIKVDVNSPEHRNCHPDRVSKYLNMANDDKNFSRSAIMNINGKDVTVLVSKYIQGQEF
DVEDEDNYRMAEALLKSRGVYMHDINILGNILVKEGVLFFVDGDQIVLSQESRQQRSVSLATRQLEEQIKAHHMIKLKRA
ETEGNTEDVEYYKSLITDLDALIGEEEQTPAPGRRFKLAAPEEGTLVAKVLKDELKK
>Q8ZP57 2.7.-.-~~~steC~~~Secreted effector kinase SteC~~~
MPFTFQIGNHSCQISERYLRDIIDNKREHVFSTCEKFIDFFRNIFTRRSLISDYREIYNLLCQKKEHPDIKGPFSPGPFS
KRDEDCTRWRPLLGYIKLIDASRPETIDKYTVEVLAHQENMLLLQMFYDGVLVTETECSERCVDFLKETMFNYNNGEITL
AALGNDNLPPSEAGSNGIYEAFEQRLIDFLTTPATASGYESGAIDQTDASQPAAIEAFINSPEFQKNIRMRDIEKNKIGS
GSYGTVYRLHDDFVVKIPVNERGIKVDVNSPEHRNCHPDRVSKYLNMANDDKNFSRSAIMNINGKDVTVLVSKYIQGQEF
DVEDEDNYRMAEALLKSRGVYMHDINILGNILVKEGVLFFVDGDQIVLSQESRQQRSVSLATRQLEEQIKAHHMIKLKRA
ETEGNTEDVEYYKSLITDLDALIGEEEQTPAPGRRFKLAAPEEGTLVAKVLKDELKK
>O34739 ~~~steT~~~Serine/threonine exchanger SteT~~~COG0531
MHTEDNGLKKEIGLLFALTLVIGTIIGSGVFMKPGAVLAYSGDSKMALFAWLLGGILTLAGGLTIAEIGTQIPKTGGLYT
YLEEVYGEFWGFLCGWVQIIIYGPAIIGALGLYFGSLMANLFGWGSGLSKVIGIIAVLFLCVINIIGTKYGGFVQTLTTI
GKLIPIACIIVFGLWKGDQHIFTAVNESISDMNFGAAILATLFAYDGWILLAALGGEMKNPEKLLPRAMTGGLLIVTAIY
IFINFALLHILSANEIVTLGENATSTAATMLFGSIGGKLISVGIIVSIFGCLNGKVLSFPRVSFAMAERKQLPFAEKLSH
VHPSFRTPWIAISFQIALALIMMLISNPDKLSEISIFMIYIFYVMAFFAVFILRKRAKGEKRAYSVPLYPFMPILAIAGS
FFVLGSTLITDTMSCGLSILIGLAGLPVYYGMKKRKAS
>A0QQ53 2.8.2.37~~~stf0~~~Trehalose 2-sulfotransferase~~~COG4424
MSDHPTAYLVLASQRSGSTLLVESLRATGVAGEPQEFFQYLPNTSMSPQPREWFADVEDQSILRLLDPLIEGKPDLAPAT
IWRDYIQTVGRTPNGVWGGKLMWNQTPLLVQRAKDLPDRSGSGLLSAIRDVVGSDPVLIHIHRPDVVSQAVSFWRAVQTR
VWRGRPDPVRDARAEYHAGAIAHVITMLRAQEEGWRAWFTEENVEPIDVDYPYLWRNLTEVVGTVLEALGQDPRLAPKPV
LERQADQRSDEWVERYRRDAQRDGLPL
>O53699 2.8.2.37~~~stf0~~~Trehalose 2-sulfotransferase~~~COG4424
MSRAVRPYLVLATQRSGSTLLVESLRATGCAGEPQEFFQYLPSTGMAPQPREWFAGVDDDTILQLLDPLDPGTPDTATPV
AWREHVRTSGRTPNGVWGGKLMWNQTALLQQRAAQLPDRSGDGLRAAIRDVIGNEPVFVHVHRPDVVSQAVSFWRAVQTQ
VWRGHPDPKRDSQAVYHAGAIAHIIRNLRDQENGWRAWFAEEGIDPIDIAYPVLWRNLTAIVASVLDAIGQDPKLAPAPM
LERQANQRSDEWVDRYRAEAPRLGLPT
>P9WLG1 2.8.2.40~~~stf3~~~Omega-hydroxy-beta-dihydromenaquinone-9 sulfotransferase Stf3~~~COG0446
MKALRSSSRLSRWREWAAPLWVGCNFSAWMRLLIRNRFAVHHSRWHFAVLYTFLSMVNSCLGLWQKIVFGRRVAETVIAD
PPIFIVGHWRTGTTLLHELLVVDDRHTGPTGYECLAPHHFLLTEWFAPYVEFLVSKHRAMDNMDLSLHHPQEDEFVWCMQ
GLPSPYLTIAFPNRPPQYEEYLDLEQVAPRELEIWKRTLFRFVQQVYFRRRKTVILKNPTHSFRIKVLLEVFPQAKFIHI
VRDPYVVYPSTIHLHKALYRIHGLQQPTFDGLDDKVVSTYVDLYRKLDEGRELVDPTRFYELRYEDLIGDPEGQLRRLYQ
HLGLGDFECYLPRLRQYLADHADYKTNSYQLTVEQRAIVDEHWGEIIDRYGYDRHTPEPARLRPAVGG
>Q9XBQ9 1.6.1.1~~~sthA~~~Soluble pyridine nucleotide transhydrogenase~~~
MAVYNYDVVVIGTGPAGEGAAMNAVKAGRKVAVVDDRPQVGGNCTHLGTIPSKALRHSVRQIMQYNNNPLFRQIGEPRWF
SFADVLKSAEQVIAKQVSSRTGYYARNRIDTFFGTASFCDEHTIEVVHLNGMVETLVAKQFVIATGSRPYRPADVDFTHP
RIYDSDTILSLGHTPRRLIIYGAGVIGCEYASIFSGLGVLVDLIDNRDQLLSFLDDEISDSLSYHLRNNNVLIRHNEEYE
RVEGLDNGVILHLKSGKKIKADAFLWSNGRTGNTDKLGLENIGLKANGRGQIQVDEHYRTEVSNIYAAGDVIGWPSLASA
AYDQGRSAAGSITENDSWRFVDDVPTGIYTIPEISSVGKTERELTQAKVPYEVGKAFFKGMARAQIAVEKAGMLKILFHR
ETLEILGVHCFGYQASEIVHIGQAIMNQKGEANTLKYFINTTFNYPTMAEAYRVAAYDGLNRLF
>P27306 1.6.1.1~~~sthA~~~Soluble pyridine nucleotide transhydrogenase~~~COG1249
MPHSYDYDAIVIGSGPGGEGAAMGLVKQGARVAVIERYQNVGGGCTHWGTIPSKALRHAVSRIIEFNQNPLYSDHSRLLR
SSFADILNHADNVINQQTRMRQGFYERNHCEILQGNARFVDEHTLALDCPDGSVETLTAEKFVIACGSRPYHPTDVDFTH
PRIYDSDSILSMHHEPRHVLIYGAGVIGCEYASIFRGMDVKVDLINTRDRLLAFLDQEMSDSLSYHFWNSGVVIRHNEEY
EKIEGCDDGVIMHLKSGKKLKADCLLYANGRTGNTDSLALQNIGLETDSRGQLKVNSMYQTAQPHVYAVGDVIGYPSLAS
AAYDQGRIAAQALVKGEATAHLIEDIPTGIYTIPEISSVGKTEQQLTAMKVPYEVGRAQFKHLARAQIVGMNVGTLKILF
HRETKEILGIHCFGERAAEIIHIGQAIMEQKGGGNTIEYFVNTTFNYPTMAEAYRVAALNGLNRLF
>P9WHH5 1.6.1.1~~~sthA~~~Probable soluble pyridine nucleotide transhydrogenase~~~COG1249
MREYDIVVIGSGPGGQKAAIASAKLGKSVAIVERGRMLGGVCVNTGTIPSKTLREAVLYLTGMNQRELYGASYRVKDRIT
PADLLARTQHVIGKEVDVVRNQLMRNRVDLIVGHGRFIDPHTILVEDQARREKTTVTGDYIIIATGTRPARPSGVEFDEE
RVLDSDGILDLKSLPSSMVVVGAGVIGIEYASMFAALGTKVTVVEKRDNMLDFCDPEVVEALKFHLRDLAVTFRFGEEVT
AVDVGSAGTVTTLASGKQIPAETVMYSAGRQGQTDHLDLHNAGLEVQGRGRIFVDDRFQTKVDHIYAVGDVIGFPALAAT
SMEQGRLAAYHAFGEPTDGITELQPIGIYSIPEVSYVGATEVELTKSSIPYEVGVARYRELARGQIAGDSYGMLKLLVST
EDLKLLGVHIFGTSATEMVHIGQAVMGCGGSVEYLVDAVFNYPTFSEAYKNAALDVMNKMRALNQFRR
>P57112 1.6.1.1~~~sthA~~~Soluble pyridine nucleotide transhydrogenase~~~
MAVYNYDVVILGTGPAGEGAAMNASKYGRKLAVVDSRRVVGGNCTHLGTIPSKALRHSVKQIIEFNTNPMFRQIGEPRWF
SFPDVLKSADRVISKQVASRTGYYARNRIDMFTGTASFVDERTVEVVTPSGAVERLVADQFVIATGSRPYRPSDINFNHP
RVYDSDTILSLSHTPRRLIIYGAGVIGCEYASIFSGLGVLVDLIDTRDQLLSFLDDEISDALSYHLRNNNVLIRHNEEYE
RVEGLDNGVILHLKSGKKIKADALLWCNGRTGNTDKLGLENVGIKVNSRGQIEVDENYRTSVSNIFAAGDVIGWPSLASA
AYDQGRSAAGNIVESDSWRFVNDVPTGIYTIPEISSIGKNESELTAAKIPYEVGKAFFKGMARAQISNEPVGMLKILFHR
ETLEILGVHCFGDQASEIVHIGQAIMNQPGELNTLKYFVNTTFNYPTMAEAYRVAAFDGLNRLF
>O05139 1.6.1.1~~~sthA~~~Soluble pyridine nucleotide transhydrogenase~~~COG1249
MAVYNYDVVVLGSGPAGEGAAMNAAKAGRKVAMVDSRRQVGGNCTHLGTIPSKALRHSVRQIMQFNTNPMFRAIGEPRWF
SFPDVLKSAEKVISKQVASRTGYYARNRVDLFFGTGSFADEQTVEVVCANGVVEKLVAKHIIIATGSRPYRPADIDFHHP
RIYDSDTILSLGHTPRKLIIYGAGVIGCEYASIFSGLGVLVELVDNRDQLLSFLDSEISQALSYHFSNNNITVRHNEEYD
RVEGLDNGVILHLKSGKKIKADALLWCNGRTGNTDKLGMENIGVKVNSRGQIEVDENYRTCVTNIYGAGDVIGWPSLASA
AHDQGRSAAGSIVDNGSWRYVNDVPTGIYTIPEISSIGKNEHELTKAKVPYEVGKAFFKSMARAQIAGEPQGMLKILFHR
ETLEVLGVHCFGYQASEIVHIGQAIMNQPGEQNTLKYFVNTTFNYPTMAEAYRVAAYDGLNRLF
>Q6FAX7 3.4.22.-~~~stiP~~~Cysteine protease StiP~~~COG1358
MAIINKDKATELILKQGFSGSYQSEQVTFLLKRTHIEPTDTAEKERLIQSGEKHYSQMISLENAPTARHLELFEQAMQQG
QQRLAQEVQQLAQTLVVEFNEPIVLVSFVRAGVPLGVLLYHAIQDLGRDCVHYGISIIRDRGIDFAALETIIARHGHASI
VFVDGWTGKGAIRQELQRSLGNDTRFIGKPLPLVVLSDIAGCAWLAASGDDWLIPSGILGSTISGLISRSICEGETLSAD
EITAENIDQWHRCIEYHHLKEFDISQQFIQRINQIRLKLNPQSNAVWAETQQQAQQDQSQQVVHKLAQEYDIQNINRIKP
SIAEATRAILRRVPDLVLLRDADDEDTRLLRHLTQITKTPVQVVGDQIAPYRAITLIQKLGKG
>Q8KY50 2.7.11.1~~~stkP~~~Serine/threonine-protein kinase StkP~~~
MIQIGKIFAGRYRIVKQIGRGGMADVYLAKDLILDGEEVAVKVLRTNYQTDPIAVARFQREARAMADLDHPHIVRITDIG
EEDGQQYLAMEYVAGLDLKRYIKEHYPLSNEEAVRIMRQILLAMRLAHTRGIVHRDLKPQNILLTPDGTAKVTDFGIAVA
FAETSLTQTNSMLGSVHYLSPEQARGSKATVQSDIYAMGIIFYEMLTGHIPYDGDSAVTIALQHFQNPLPSVIAENSSVP
QALENVIIKATAKKLTNRYRSVSEMYVDLSSSLSYNRRNESKLIFDETSKADTKTLPKVSQSTLTSIPKVQAQTEHKSIK
NPSQAVTEETYQPQAPKKHRFKMRYLILLASLVLVAASLIWILSRSPATIAIPDVAGQTVAEAKATLKKANFEIGEEKTE
ASEKVEEGRIIRTDPGAGTGRKEGTKINLVVSSGKQSFQISNYVGRKSSDVIAELKEKKVPDNLIKIEEEESNESEAGTV
LKQSLPEGTTYDLSKATQIVLTVAKKATTIQLGNYIGRNSTEVISELKQKKVPENLIKIEEEESSESEPGTIMKQSPGAG
TTYDVSKPTQIVLTVAKKVTSVAMPSYIGSSLEFTKNNLIQIVGIKEANIEVVEVTTAPAGSVEGMVVEQSPRAGEKVDL
NKTRVKISIYKPKTTSATP
>Q04J43 2.7.11.1~~~stkP~~~Serine/threonine-protein kinase StkP~~~COG0515
MIQIGKIFAGRYRIVKQIGRGGMADVYLAKDLILDGEEVAVKVLRTNYQTDPIAVARFQREARAMADLDHPHIVRITDIG
EEDGQQYLAMEYVAGLDLKRYIKEHYPLSNEEAVRIMGQILLAMRLAHTRGIVHRDLKPQNILLTPDGTAKVTDFGIAVA
FAETSLTQTNSMLGSVHYLSPEQARGSKATVQSDIYAMGIIFYEMLTGHIPYDGDSAVTIALQHFQNPLPSVIAENSSVP
QALENVIIKATAKKLTNRYRSVSEMYVDLSSSLSYNRRNESKLIFDETSKADTKTLPKVSQSTLTSIPKVQAQTEHKSIK
NPSQAVTEETYQPQAPKKHRFKMRYLILLASLVLVAASLIWILSRTPATIAIPDVAGQTVAEAKATLKKANFEIGEEKTE
ASEKVEEGRIIRTDPGAGTGRKEGTKINLVVSSGKQSFQISNYVGRKSSDVIAELKEKKVPDNLIKIEEEESNESEAGTV
LKQSLPEGTTYDLSKATQIVLTVAKKATTIQLGNYIGRNSTEVISELKQKKVPENLIKIEEEESSESEPGTIMKQSPGAG
TTYDVSKPTQIVLTVAKKVTSVAMPSYIGSSLEFTKNNLIQIVGIKEANIEVVEVTTAPAGSVEGMVVEQSPRAGEKVDL
NKTRVKISIYKPKTTSATP
>Q97PA9 2.7.11.1~~~stkP~~~Serine/threonine-protein kinase StkP~~~COG0515
MIQIGKIFAGRYRIVKQIGRGGMADVYLAKDLILDGEEVAVKVLRTNYQTDPIAVARFQREARAMADLDHPHIVRITDIG
EEDGQQYLAMEYVAGLDLKRYIKEHYPLSNEEAVRIMGQILLAMRLAHTRGIVHRDLKPQNILLTPDGTAKVTDFGIAVA
FAETSLTQTNSMLGSVHYLSPEQARGSKATVQSDIYAMGIIFYEMLTGHIPYDGDSAVTIALQHFQKPLPSVIAENPSVP
QALENVIIKATAKKLTNRYRSVSEMYVDLSSSLSYNRRNESKLIFDETSKADTKTLPKVSQSTLTSIPKVQAQTEHKSIK
NPSQAVTEETYQPQAPKKHRFKMRYLILLASLVLVAASLIWILSRTPATIAIPDVAGQTVAEAKATLKKANFEIGEEKTE
ASEKVEEGRIIRTDPGAGTGRKEGTKINLVVSSGKQSFQISNYVGRKSSDVIAELKEKKVPDNLIKIEEEESNESEAGTV
LKQSLPEGTTYDLSKATQIVLTVAKKATTIQLGNYIGRNSTEVISELKQKKVPENLIKIEEEESSESEPGTIMKQSPGAG
TTYDVSKPTQIVLTVAKKVTSVAMPSYIGSSLEFTKNNLIQIVGIKEANIEVVEVTTAPAGSAEGMVVEQSPRAGEKVDL
NKTRVKISIYKPKTTSATP
>Q8DNS0 2.7.11.1~~~stkP~~~Serine/threonine-protein kinase StkP~~~COG0515
MIQIGKIFAGRYRIVKQIGRGGMADVYLAKDLILDGEEVAVKVLRTNYQTDPIAVARFQREARAMADLDHPHIVRITDIG
EEDGQQYLAMEYVAGLDLKRYIKEHYPLSNEEAVRIMGQILLAMRLAHTRGIVHRDLKPQNILLTPDGTAKVTDFGIAVA
FAETSLTQTNSMLGSVHYLSPEQARGSKATVQSDIYAMGIIFYEMLTGHIPYDGDSAVTIALQHFQNPLPSVIAENSSVP
QALENVIIKATAKKLTNRYRSVSEMYVDLSSSLSYNRRNESKLIFDETSKADTKTLPKVSQSTLTSIPKVQAQTEHKSIK
NPSQAVTEETYQPQAPKKHRFKMRYLILLASLVLVAASLIWILSRTPATIAIPDVAGQTVAEAKATLKKANFEIGEEKTE
ASEKVEEGRIIRTDPGAGTGRKEGTKINLVVSSGKQSFQISNYVGRKSSDVIAELKEKKVPDNLIKIEEEESNESEAGTV
LKQSLPEGTTYDLSKATQIVLTVAKKATTIQLGNYIGRNSTEVISELKQKKVPENLIKIEEEESSESEPGTIMKQSPGAG
TTYDVSKPTQIVLTVAKKVTSVAMPSYIGSSLEFTKNNLIQIVGIKEANIEVVEVTTAPAGSVEGMVVEQSPRAGEKVDL
NKTRVKISIYKPKTTSATP
>O31687 ~~~stoA~~~Sporulation thiol-disulfide oxidoreductase A~~~COG0526
MLTKRLLTIYIMLLGLIAWFPGAAQAEEKQPAVPAVFLMKTIEGEDISIPNKGQKTILHFWTSWCPPCKKELPQFQSFYD
AHPSDSVKLVTVNLVNSEQNQQVVEDFIKANKLTFPIVLDSKGELMKEYHIITIPTSFLLNEKGEIEKTKIGPMTAEQLK
EWTEE
>Q8Y678 3.1.3.16~~~stp~~~Serine/threonine phosphatase stp~~~COG0631
MHAEFRTDRGRIRHHNEDNGGVFENKDNQPIVIVADGMGGHRAGDVASEMAVRLLSDAWKETTALLTAEEIETWLRKTIQ
EVNKEIVLYAESEMDLNGMGTTLVAAIMAQSQVVIANVGDSRGYLLQNHVLRQLTEDHSLVHELLRTGEISKEDAMNHPR
KNILLRALGVEGKVEVDTFVVPFQTSDTLLLCSDGLTNMVPETEMEEILKSKRTLSEKADVFITKANSYGGEDNITVLLV
ERDLTQKGRDAS
>P0ACG1 ~~~stpA~~~DNA-binding protein StpA~~~COG2916
MSVMLQSLNNIRTLRAMAREFSIDVLEEMLEKFRVVTKERREEEEQQQRELAERQEKISTWLELMKADGINPEELLGNSS
AAAPRAGKKRQPRPAKYKFTDVNGETKTWTGQGRTPKPIAQALAEGKSLDDFLI
>P0ACG3 ~~~stpA~~~DNA-binding protein StpA~~~
MSVMLQSLNNIRTLRAMAREFSIDVLEEMLEKFRVVTKERREEEEQQQRELAERQEKISTWLELMKADGINPEELLGNSS
AAAPRAGKKRQPRPAKYKFTDVNGETKTWTGQGRTPKPIAQALAEGKSLDDFLI
>Q55034 3.1.3.69~~~stpA~~~Glucosylglycerol-phosphate phosphatase~~~
MVLHQQRFSLDHGAFCQTLAQTENLLIVQDLDGVCMELVQDPLSRRLDADYVRATTLFAEHFYVLTNGEHVGKRGVQGIV
EQSFGDASFVQQEGLYLPGLAAGGVQWQDRHGKVSHPGVGQTELEFLAAVPEKITNCLKTFFGDRPHSLSPEQLQTGIEA
SVLDNVASPTANLNTLANLLQDFPQIYRDLQETMAQLLDQLMAEAVAQGLGNSFFVHYAPNLGRDERGKEIIRWAKAGDS
GTTDFQFMLRGGVKEAGVLALLNRYYHNRTGQYPLGESFSARQAPPSHQDLLHLVKAQFDPALMPLIIGVGDTVTSQVDE
ATGEIRRGGSDRQFLQLIQDLGDWGNHGNLVVYVDSSQGEVKNRQPLQLETVAGQTQVVAGPGDMRDREEPLKINVAFPG
GHDQYVAAFKQAAQRRRVHFSQ
>P9WG91 ~~~stp~~~Multidrug resistance protein Stp~~~COG0477
MNRTQLLTLIATGLGLFMIFLDALIVNVALPDIQRSFAVGEDGLQWVVASYSLGMAVFIMSAATLADLDGRRRWYLIGVS
LFTLGSIACGLAPSIAVLTTARGAQGLGAAAVSVTSLALVSAAFPEAKEKARAIGIWTAIASIGTTTGPTLGGLLVDQWG
WRSIFYVNLPMGALVLFLTLCYVEESCNERARRFDLSGQLLFIVAVGALVYAVIEGPQIGWTSVQTIVMLWTAAVGCALF
VWLERRSSNPMMDLTLFRDTSYALAIATICTVFFAVYGMLLLTTQFLQNVRGYTPSVTGLMILPFSAAVAIVSPLVGHLV
GRIGARVPILAGLCMLMLGLLMLIFSEHRSSALVLVGLGLCGSGVALCLTPITTVAMTAVPAERAGMASGIMSAQRAIGS
TIGFAVLGSVLAAWLSATLEPHLERAVPDPVQRHVLAEIIIDSANPRAHVGGIVPRRHIEHRDPVAIAEEDFIEGIRVAL
LVATATLAVVFLAGWRWFPRDVHTAGSDLSERLPTAMTVECAVSHMPGATWCRLWPA
>P08078 2.1.4.2~~~strB1~~~Inosamine-phosphate amidinotransferase 1~~~
MSLVSVHNEWDPLEEVIVGTAVGARVPTADRSVFAVEYAGDYESQEQIPSGAYPDRVLKETEEELHVLAAELTKLGVTVR
RPGPRDHSALIKTPDWETDGFHDYCPRDGLLSVGQTIIETPMALRSRFLESLAYKDLLLEYFASGSRWLSAPKPRLTDDS
YAPQAPAGERLTDEEPVFDAANVLRFGTDLLYLVSDSGNELGAKWLQSAVGDTYTVHPCRKLYASTHVDSTIVPLRPGLV
LTNPSRVNDENMPDFLRSWENITCPELVDIGFTGDKPHCSVWIGMNLLVVRPDLAVVDRRQTALIRLLEKHGMNVLPLQL
THSRTLGGGFHCATLDVRRTARETYQF
>P49610 3.2.1.52~~~strH~~~Beta-N-acetylhexosaminidase~~~COG3064
MKHEKQQRFSIRKYAVGAASVLIGFAFQAQTVAADGVTPTTTENQPTIHTVSDSPQSSENRTEETPKAVLQPEAPKTVET
ETPATDKVASLPKTEEKPQEEVSSTPSDKAEVVTPTSAEKETANKKAEEASPKKEEAKEVDSKESNTDKTDKDKPAKKDE
AKAEADKPATEAGKERAATVNEKLAKKKIVSIDAGRKYFSPEQLKEIIDKAKHYGYTDLHLLVGNDGLRFMLDDMSITAN
GKTYASDDVKRAIEKGTNDYYNDPNGNHLTESQMTDLINYAKDKGIGLIPTVNSPGHMDAILNAMKELGIQNPNFSYFGK
KSARTVDLDNEQAVAFTKALIDKYAAYFAKKTEIFNIGLDEYANDATDAKGWSVLQADKYYPNEGYPVKGYEKFIAYAND
LARIVKSHGLKPMAFNDGIYYNSDTSFGSFDKDIIVSMWTGGWGGYDVASSKLLAEKGHQILNTNDAWYYVLGRNADGQG
WYNLDQGLNGIKNTPITSVPKTEGADIPIIGGMVAAWADTPSARYSPSRLFKLMRHFANANAEYFAADYESAEQALNEVP
KDLNRYTAESVTAVKEAEKAIRSLDSNLSRAQQDTIDQAIAKLQETVNNLTLTPEAQKEEEAKREVEKLAKNKVISIDAG
RKYFTLNQLKRIVDKASELGYSDVHLLLGNDGLRFLLDDMTITANGKTYASDDVKKAIIEGTKAYYDDPNGTALTQAEVT
ELIEYAKSKDIGLIPAINSPGHMDAMLVAMEKLGIKNPQAHFDKVSKTTMDLKNEEAMNFVKALIGKYMDFFAGKTKIFN
FGTDEYANDATSAQGWYYLKWYQLYGKFAEYANTLAAMAKERGLQPMAFNDGFYYEDKDDVQFDKDVLISYWSKGWWGYN
LASPQYLASKGYKFLNTNGDWYYILGQKPEDGGGFLKKAIENTGKTPFNQLASTKYPEVDLPTVGSMLSIWADRPSAEYK
EEEIFELMTAFADHNKDYFRANYNALREELAKIPTNLEGYSKESLEALDAAKTALNYNLNRNKQAELDTLVANLKAALQG
LKPAVTHSGSLDENEVAANVETRPELITRTEEIPFEVIKKENPNLPAGQENIITAGVKGERTHYISVLTENGKTTETVLD
SQVTKEVINQVVEVGAPVTHKGDESGLAPTTEVKPRLDIQEEEIPFTTVTCENPLLLKGKTQVITKGVNGHRSNFYSVST
SADGKEVKTLVNSVVAQEAVTQIVEVGTMVTHVGDENGQAAIAEEKPKLEIPSQPAPSTAPAEESKVLPQDPAPVVTEKK
LPETGTHDSAGLVVAGLMSTLAAYGLTKRKED
>P00779 ~~~skc~~~Streptokinase C~~~
MKNYLSFGMFALLFALTFGTVNSVQAIAGPEWLLDRPSVNNSQLVVSVAGTVEGTNQDISLKFFEIDLTSRPAHGGKTEQ
GLSPKSKPFATDSGAMSHKLEKADLLKAIQEQLIANVHSNDDYFEVIDFASDATITDRNGKVYFADKDGSVTLPTQPVQE
FLLSGHVRVRPYKEKPIQNQAKSVDVEYTVQFTPLNPDDDFRPGLKDTKLLKTLAIGDTITSQELLAQAQSILNKNHPGY
TIYERDSSIVTHDNDIFRTILPMDQEFTYRVKNREQAYRINKKSGLNEEINNTDLISEKYYVLKKGEKPYDPFDRSHLKL
FTIKYVDVDTNELLKSEQLLTASERNLDFRDLYDPRDKAKLLYNNLDAFGIMDYTLTGKVEDNHDDTNRIITVYMGKRPE
GENASYHLAYDKDRYTEEEREVYSYLRYTGTPIPDNPNDK
>Q1MW86 3.5.2.19~~~sttH~~~Streptothricin hydrolase~~~
MIRPDRCPWQPCPSGRYLSRPSGRVPRSTMMTPMTAMPAVTAMPPETAAPPETAAPARPLRPVQALLVVDVQTAFVSGAE
AVPEAARVLDRTRGLLARARTAGALVVHLQNDGAPGAVDAPHTPGWELHLPVEPGPREHVVRKTEDDGFADTGLGALLDA
AGVTELAVCGVLSEMCVAATARTALELGHRVVLPHDAHATYDIPAAPDISDTVPAAMSSRAAEWALGDEVEIVPRAAAVP
FVAPPLAPAPEAPAAAAAPAAGTGLSPAGPPPAPAR
>C5NU54 3.5.2.19~~~sttH~~~Streptothricin hydrolase~~~
MIRPGRCLWQPCLSGRKRSRPSGPVSRHGMMAAMTAPTAAAIRPVQALLVVDVQAAFVSGWEAVPDADRVLRCTRDLLSR
ARAAGALVVHLQNDGEPGAVDAPHTPGWELHLPVEPGPRERVVRKTEDDGFADTPLGDLLTDAGVTELAVCGVLSEMCVA
ATARTALVRGHRVVLPHDAHATYDIPAAPGISDTVPAAMSSRAAEWALGDEVEIVPHAAAVPFAAAPRPAVGPAAAPGLP
VSPAAPPPSPVR
>Q9FBI2 3.2.2.22~~~stxA~~~Shiga toxin subunit A~~~
MKIIIFRVLTFFFVIFSVNVVAKEFTLDFSTAKTYVDSLNVIRSAIGTPLQTISSGGTSLLMIDSGTGDNLFAVDVRGID
PEEGRFNNLRLIVERNNLYVTGFVNRTNNVFYRFADFSHVTFPGTTAVTLSGDSSYTTLQRVAGISRTGMQINRHSLTTS
YLDLMSHSGTSLTQSVARAMLRFVTVTAEALRFRQIQRGFRTTLDDLSGRSYVMTAEDVDLTLNWGRLSSVLPDYHGQDS
VRVGRISFGSINAILGSVALILNCHHHASRVARMASDEFPSMCPADGRVRGITHNKILWDSSTLGAILMRRTISS
>Q7BQ98 ~~~stxB~~~Shiga toxin subunit B~~~
MKKTLLIAASLSFFSASALATPDCVTGKVEYTKYNDDDTFTVKVGDKELFTNRWNLQSLLLSAQITGMTVTIKTNACHNG
GGFSEVIFR
>O06834 1.14.14.11~~~styA~~~Styrene monooxygenase StyA~~~
MKKRIGIVGAGTAGLHLGLFLRQHDVDVTVYTDRKPDEYSGLRLLNTVAHHAVTVQREVALDVNEWPSEEFGYFGHYYYV
GGSQPMRFYGDLKAPSRAVDYRLYQPMLMRALEARGGKSCYDDVSTEDLEGLSEQYDLLVVCTGKNALGKVFEKQSENSP
FEKPQRALCVGLFKGIKEAPIRAVTMSFSPGHGELIEIPTLSFNGMSTALVLENHIGSDLEVLAHTKYDDDPRAFLDLML
EKLRKHHPSVAERIDPAEFDLANSSLDILQGGVVPAFRDGHATLSNGKTIIGLGDVQATVDPVLGQGANMASHAAWILGE
EILAHSVYDLRFSEHLERRRQDRVLCATRWTNFILSALSALPPEFLAFLQILSQSREMADEFTDNFNYPERQWDRFSSPE
RIGQWCNQYAPTIAA
>O50214 1.14.14.11~~~styA~~~Styrene monooxygenase StyA~~~
MKKRIGIVGAGTAGLHLGLFLRQHDVDVTVYTDRKPDEYSGLRLLNTVAHNAVTVQREVALDVNEWPSEEFGYFGHYYYV
GGPQPMRFYGDLKAPSRAVDYRLYQPMLMRALEARGGKFCYDAVSAEDLEGLSEQYDLLVVCTGKYALGKVFEKQSENSP
FEKPQRALCVGLFKGIKEAPIRAVTMSFSPGHGELIEIPTLSFNGMSTALVLENHIGSDLEVLAHTKYDDDPRAFLDLML
EKLGKHHPSVAERIDPAEFDLANSSLDILQGGVVPAFRDGHATLNNGKTIIGLGDIQATVDPVLGQGANMASYAAWILGE
EILAHSVYDLRFSEHLERRRQDRVLCATRWTNFTLSALSALPPEFLAFLQILSQSREMADEFTDNFNYPERQWDRFSSPE
RIGQWCSQFAPTIAA
>O06836 5.3.99.7~~~styC~~~Styrene-oxide isomerase~~~
MLHAFERKMAGHGILMIFCTLLFGVGLWMHLVGGFEIIPGYILEFHVPGSPEGWARAHSGPALNGMMVIAVAFVLPSLGF
ADKKPHLLGNIIILDGWANVGFYFFSNFSPNRGLTFGPNHFGPGDIFSFLALAPAYLFGVLAMGALAVIGYQALKSVGSR
KAVPHATAE
>O06837 1.2.1.39~~~styD~~~Phenylacetaldehyde dehydrogenase~~~
MTRSLTMNSSLPAIDGLRLPHQMLIGGQWVNAQSDKTLNVYNPATGDTLTDVPDGDVEDVNAAVESAAATLQSDAWRRMP
PSARERILLRLADLLEAHGDELARLETLNNGKLLIYSKMMEVGASAQWLRYMAGWATKLTGSTLDLSLPLPPDVRSRAST
QRVPVGVVAAIIPWNFPLLMAVWKIAPALACGNTVVLKPAEETPLTALRLAELAMEAGLPAGALNVVTGRGETAGDALVR
HPKVAKVAFTGSTEVGRIIGSACGRSLKAVSLELGGKSPVIVLADCDPQEAAEGAAAAIFFNHGQVCTAGSRLYVHESIY
EDVIQRLAVIGESIVVGSGLEQGVHMGPMVSKKHHENVLRHIRNGIEDGADLICGGTEAPCAQGFFVKPTIFANREKKDI
RLLSQEVFGPVLVATPFSDIAEVVNEANRSVYGLGASIWTNDLSAALRINDELEAGTVWVNTHNMVDPNLPFGGFKDSGV
GREHGAAAIEHYTTTRSLVIAY
>P39153 2.7.7.87~~~ywlC~~~Threonylcarbamoyl-AMP synthase~~~COG0009
MKTKRWFVDVTDELSTNDPQIAQAAALLRENEVVAFPTETVYGLGANAKNTDAVKKIYEAKGRPSDNPLIVHIADISQLE
DLTGPAPEKAKTLMKRFWPGALTLILPCKPDALSPRVTAGLETVAIRMPDHPLALALIRESGLPIAAPSANLSGKPSPTK
AEHVAHDLDGRIAGIVDGGPTGIGVESTVLSCADDIPVLLRPGGITKEQIEAVIGPIHVDKGLSDQNEKPISPGMKYTHY
APTAPLAICEGSPERIQHLIQEYQQGGRRVGVLTTEEKAGVYSADYVKSCGRRAQLETVAAGLYDALRSFDENKVDFIIA
ESFPDTGVGLAIMNRLMKAAGGRVIR
>Q9RYM8 3.4.21.-~~~~~~Probable subtilase-type serine protease DR_A0283~~~COG1404
MPGALPMKKISLAVLSLTTLLAACGQPQTSPQSPAASAPSVAVPRTHALDIDPAQVVTTKNGDMYVRNQLVVNLRGHSAD
ALADQLGGRVLDQLPELDVALIELPQGKDARSVGVALMREGQVLYAAAQTVQRQIEPVRTAQDQLGAQAVNQVFDTLPQY
ALDSNHLHAKAAWDAGFTGKGVKVGVIDDPSDVSHPDLRPNWAGKAYDPATNTTYTTVQGWIDAIDGFDGKVDNKVDPGI
EHGTAVASTIAAAKNGQGIVGVAPDSKFYTAAIFQPGFIGDYLVARSVIWTVNQGAQVINNSWGGTGYSPLLKQAFDYAL
ERDITVVVSAGNSYREEWRNPAQLPGVIASAALDINNDKAGFSTYGRHVSVAAPGVDVMLASPLFINADGTRKTGGYTKD
GGSGYQLISGTSFSGPYTSGVAAVILGAKPDLDPHQVRRLMEETADGSVGSNKAGFDRETGYGLIRMDKLADRLKGSNMP
QKGGAGRVKVEIQTPGGYVPGILADVILEGDGADGAVYAVQTDSDGYANFVSIAPGTYTLRVATPDLTLTGGQSDERDTY
VGKLTVTSGSVLSTVAPQRVVLVKGAVDLNPVDPYEPNDTMAEAKPISYGKMTELAYIFGKPRDVDFFSFTGKAGDNIQA
DVHARTAIGGSLDSFLVLRDASGKNLAYNDDANGQDSVITFKLPADGTYFLEVSSCNILCKSNGDDASKGQDDDSPFNKY
VLELQLLK
>Q6EZC2 3.4.21.-~~~subA~~~Subtilase cytotoxin subunit A~~~
MLKILWTYILFLLFISASARAEKPWYFDAIGLTETTMSLTDKNTPVVVSVVDSGVAFIGGLSDSEFAKFSFTQDGSPFPV
KKSEALYIHGTAMASLIASRYGIYGVYPHALISSRRVIPDGVQDSWIRAIESIMSNVFLAPGEEKIINISGGQKGVASAS
VWTELLSRMGRNNDRLIVAAVGNDGADIRKLSAQQRIWPAAYHPVSSVNKKQDPVIRVAALAQYRKGETPVLHGGGITGS
RFGNNWVDIAAPGQNITFLRPDAKTGTGSGTSEATAIVSGVLAAMTSCNPRATATELKRTLLESADKYPSLVDKVTEGRV
LNAEKAISMFCKKNYIPVRQGRMSEEL
>Q6EZC3 ~~~subB~~~Subtilase cytotoxin subunit B~~~
MTIKRFFVCAGIMGCLSLNPAMAEWTGDARDGMFSGVVITQFHTGQIDNKPYFCIEGKQSAGSSISACSMKNSSVWGASF
STLYNQALYFYTTGQPVRIYYKPGVWTYPPFVKALTSNALVGLSTCTTSTECFGPDRKKNS
>P29599 3.4.21.62~~~~~~Subtilisin BL~~~
AQSVPWGISRVQAPAAHNRGLTGSGVKVAVLDTGISTHPDLNIRGGASFVPGEPSTQDGNGHGTHVAGTIAALNNSIGVL
GVAPSAELYAVKVLGADGRGAISSIAQGLEWAGNNGMHVANLSLGSPSPSATLEQAVNSATSRGVLVVAASGNSGASSIS
YPARYANAMAVGATDQNNNRASFSQYGAGLDIVAPGVNVQSTYPGSTYASLNGTSMATPHVAGAAALVKQKNPSWSNVQI
RNHLKNTATSLGSTNLYGSGLVNAEAATR
>P00780 3.4.21.62~~~subC~~~Subtilisin Carlsberg~~~
MMRKKSFWLGMLTAFMLVFTMAFSDSASAAQPAKNVEKDYIVGFKSGVKTASVKKDIIKESGGKVDKQFRIINAAKAKLD
KEALKEVKNDPDVAYVEEDHVAHALAQTVPYGIPLIKADKVQAQGFKGANVKVAVLDTGIQASHPDLNVVGGASFVAGEA
YNTDGNGHGTHVAGTVAALDNTTGVLGVAPSVSLYAVKVLNSSGSGSYSGIVSGIEWATTNGMDVINMSLGGASGSTAMK
QAVDNAYAKGVVVVAAAGNSGSSGNTNTIGYPAKYDSVIAVGAVDSNSNRASFSSVGAELEVMAPGAGVYSTYPTNTYAT
LNGTSMASPHVAGAAALILSKHPNLSASQVRNRLSSTATYLGSSFYYGKGLINVEAAAQ
>P00781 3.4.21.62~~~apr~~~Subtilisin DY~~~
AQTVPYGIPLIKADKVQAQGYKGANVKVGIIDTGIAASHTDLKVVGGASFVSGESYNTDGNGHGTHVAGTVAALDNTTGV
LGVAPNVSLYAIKVLNSSGSGTYSAIVSGIEWATQNGLDVINMSLGGPSGSTALKQAVDKAYASGIVVVAAAGNSGSSGS
QNTIGYPAKYDSVIAVGAVDSNKNRASFSSVGAELEVMAPGVSVYSTYPSNTYTSLNGTSMASPHVAGAAALILSKYPTL
SASQVRNRLSSTATNLGDSFYYGKGLINVEAAAQ
>P16396 3.4.21.-~~~epr~~~Minor extracellular protease Epr~~~COG1404
MKNMSCKLVVSVTLFFSFLTIGPLAHAQNSSEKEVIVVYKNKAGKETILDSDADVEQQYKHLPAVAVTADQETVKELKQD
PDILYVENNVSFTAADSTDFKVLSDGTDTSDNFEQWNLEPIQVKQAWKAGLTGKNIKIAVIDSGISPHDDLSIAGGYSAV
SYTSSYKDDNGHGTHVAGIIGAKHNGYGIDGIAPEAQIYAVKALDQNGSGDLQSLLQGIDWSIANRMDIVNMSLGTTSDS
KILHDAVNKAYEQGVLLVAASGNDGNGKPVNYPAAYSSVVAVSATNEKNQLASFSTTGDEVEFSAPGTNITSTYLNQYYA
TGSGTSQATPHAAAMFALLKQRDPAETNVQLREEMRKNIVDLGTAGRDQQFGYGLIQYKAQATDSAYAAAEQAVKKAEQT
KAQIDINKARELISQLPNSDAKTALHKRLDKVQSYRNVKDAKDKVAKAEKYKTQQTVDTAQTAINKLPNGTDKKNLQKRL
DQVKRYIASKQAKDKVAKAEKSKKKTDVDSAQSAIGKLPASSEKTSLQKRLNKVKSTNLKTAQQSVSAAEKKSTDANAAK
AQSAVNQLQAGKDKTALQKRLDKVKKKVAAAEAKKVETAKAKVKKAEKDKTKKSKTSAQSAVNQLKASNEKTKLQKRLNA
VKPKK
>P16397 3.4.21.-~~~bpr~~~Bacillopeptidase F~~~COG1404
MRKKTKNRLISSVLSTVVISSLLFPGAAGASSKVTSPSVKKELQSAESIQNKISSSLKKSFKKKEKTTFLIKFKDLANPE
KAAKAAVKKAKSKKLSAAKTEYQKRSAVVSSLKVTADESQQDVLKYLNTQKDKGNADQIHSYYVVNGIAVHASKEVMEKV
VQFPEVEKVLPNEKRQLFKSSSPFNMKKAQKAIKATDGVEWNVDQIDAPKAWALGYDGTGTVVASIDTGVEWNHPALKEK
YRGYNPENPNEPENEMNWYDAVAGEASPYDDLAHGTHVTGTMVGSEPDGTNQIGVAPGAKWIAVKAFSEDGGTDADILEA
GEWVLAPKDAEGNPHPEMAPDVVNNSWGGGSGLDEWYRDMVNAWRAADIFPEFSAGNTDLFIPGGPGSIANPANYPESFA
TGATDINKKLADFSLQGPSPYDEIKPEISAPGVNIRSSVPGQTYEDGWDGTSMAGPHVSAVAALLKQANASLSVDEMEDI
LTSTAEPLTDSTFPDSPNNGYGHGLVNAFDAVSAVTDGLGKAEGQVSVEGDDQEPPVYQHEKVTEAYEGGSLPLTLTAED
NVSVTSVKLSYKLDQGEWTEITAKRISGDHLKGTYQAEIPDIKGTKLSYKWMIHDFGGHVVSSDVYDVTVKPSITAGYKQ
DFETAPGGWVASGTNNNWEWGVPSTGPNTAASGEKVYGTNLTGNYANSANMNLVMPPIKAPDSGSLFLQFKSWHNLEDDF
DYGYVFVLPEGEKNWEQAGVYNGKTSSWTDEEIDLSAYKGQNIQVMFNLQSDESIAKEGWYIDDVVLSDKSAGKTVKKNK
LGVEKPSGKQKKKPVNPKKAKPSANTAVKHQNKAIQPQVLPLKAQVSVVETGKSTYSDQSTGQYTLKHKAGDYTLMAEAY
GYQSKTQKVSLKTDQTTQANFTLEEMKKGTLKGTVINKTTGEPVTGASVYVVEDAAVEPAMTNDKGEYMLEAYEGAYTIK
VAAPGYYSDEFSVELKGDVTKETALKPFVGYPGEIAYDDGTAENANSYFAAGNGWAVKMTLADGKDKGMLTGGLFRFWDT
EFPDPGGTEFKVEVYDATGKDGAPGKKIAGPFNAEALRNGEWTKVDLSSKGIMVDKDFYLVYIQSKPDPYSPGLAMDETG
QNSGRNWQYIDGKWQPGDKADGNYMIRALVDYEAAVPEITSPTDKSYTNKDSVTVKGNASPGTTVHIYNGEKEAGETKAA
ADGTFHAGIILNKGENELTATASTDNGTTDASSPITVTLDQEKPELTLDNPKDGGKTNKETLTVKGAVSDDNLKDVKVNG
KKATVADGSYSARILLENGRNEIKVIATDLAGNKTTKKTVIDVNFDKPVISGLIPGEDKNLKAGESVKIAFSSAEDLDAT
FTIRMPLTNARASVQNATELPLREISPGRYEGYWTATSSIKAKGAKVEVIVRDDYGNETRKTANGKLNMNTEN
>P0AG78 ~~~sbp~~~Sulfate-binding protein~~~COG1613
MNKWGVGLTFLLAATSVMAKDIQLLNVSYDPTRELYEQYNKAFSAHWKQQTGDNVVIRQSHGGSGKQATSVINGIEADVV
TLALAYDVDAIAERGRIDKEWIKRLPDNSAPYTSTIVFLVRKGNPKQIHDWNDLIKPGVSVITPNPKSSGGARWNYLAAW
GYALHHNNNDQAKAQDFVRALYKNVEVLDSGARGSTNTFVERGIGDVLIAWENEALLAANELGKDKFEIVTPSESILAEP
TVSVVDKVVEKKGTKEVAEAYLKYLYSPEGQEIAAKNYYRPRDAEVAKKYENAFPKLKLFTIDEEFGGWTKAQKEHFANG
GTFDQISKR
>P02906 ~~~sbp~~~Sulfate-binding protein~~~
MKKWGVGFTLLLASTSILAKDIQLLNVSYDPTRELYEQYNKAFSAHWKQETGDNVVIRQSHGGSGKQATSVINGIEADVV
TLALAYDVDAIAERGRIDKNWIKRLPDNSAPYTSTIVFLVRKGNPKQIHDWNDLIKPGVSVITPNPKSSGGARWNYLAAW
GYALHHNNNDQAKAQDFVKALFKNVEVLDSGARGSTNTFVERGIGDVLIAWENEALLATNELGKDKFEIVTPSESILAEP
TVSVVDKVVEKKDTKAVAEAYLKYLYSPEGQEIAAKNFYRPRDADVAKKYDDAFPKLKLFTIDEVFGGWAKAQKDHFANG
GTFDQISKR
>P27366 ~~~sbpA~~~Sulfate-binding protein~~~COG1613
MKTAWTRRSFLQSAALATATVITIAACGGNNQSSSGGSGQPVEVTLVSYAVTQAAYEQIIPKFAAQWKEKTGQEVRFNQS
YGGSGSQTRAVIDGLEADVVALALESDINQIEKAGLIQPGWQQRVPNNGIITNSVVALVTQEGNPKGIKDWTDLTKPGVR
IVTANPKTSGGARWNFLGAWGSVTQTGGTEEQALQFTTDIYKNVPILAKDARESTDVFTKGQADVLLNYENELILAQQKG
EKVDYAIPPVNINIQGPVAVVDTYTDKHGTRKVSEAFVQFLFTPEAQAEFAKVGFRPALPEGVDPQLLAPFPKIQTWFTV
ADLGGWAKVQPEFFGDGGWFDKVQQAVAGR
>Q01903 ~~~sbpA~~~Sulfate-binding protein~~~COG1613
MARSAFGWGFSVIAVLMVGSITACNTTTTTEPGQGENASQAPANLTLVSYAVTRDAFEKIIPKFTEEWKSKTGQDVTFEQ
SYGGSGSQTRAVVDGLEADIVALALSSDVQKIESAGLIQPGWEQEAPNGSIVTNSVIAFVTKASDNIKVEKWADLANPEV
KVITANPKTSGGARWNFLGIWGSVTKTGGTEEQAFDFAGKVLANAPVLPKDARESTDVFYKQGQGNVLLNYENEVLLAKQ
KGENQPYIIPQDFNVSISGPVAVVDTTVDKKGTREVRDAFVQYLFTPEAQQIFAETGFRPVNEEVLAKFASQYPKVENLA
TIEEFGGWKKAQAEFFDEGGIFDKVITKIGRQ
>P35835 3.4.21.62~~~aprN~~~Subtilisin NAT~~~
MRSKKLWISLLFALTLIFTMAFSNMSAQAAGKSSTEKKYIVGFKQTMSAMSSAKKKDVISEKGGKVQKQFKYVNAAAATL
DEKAVKELKKDPSVAYVEEDHIAHEYAQSVPYGISQIKAPALHSQGYTGSNVKVAVIDSGIDSSHPDLNVRGGASFVPSE
TNPYQDGSSHGTHVAGTIAALNNSIGVLGVAPSASLYAVKVLDSTGSGQYSWIINGIEWAISNNMDVINMSLGGPTGSTA
LKTVVDKAVSSGIVVAAAAGNEGSSGSTSTVGYPAKYPSTIAVGAVNSSNQRASFSSVGSELDVMAPGVSIQSTLPGGTY
GAYNGTSMATPHVAGAAALILSKHPTWTNAQVRDRLESTATYLGNSFYYGKGLINVQAAAQ
>P29600 3.4.21.62~~~~~~Subtilisin Savinase~~~
AQSVPWGISRVQAPAAHNRGLTGSGVKVAVLDTGISTHPDLNIRGGASFVPGEPSTQDGNGHGTHVAGTIAALNNSIGVL
GVAPSAELYAVKVLGASGSGSVSSIAQGLEWAGNNGMHVANLSLGSPSPSATLEQAVNSATSRGVLVVAASGNSGAGSIS
YPARYANAMAVGATDQNNNRASFSQYGAGLDIVAPGVNVQSTYPGSTYASLNGTSMATPHVAGAAALVKQKNPSWSNVQI
RNHLKNTATSLGSTNLYGSGLVNAEAATR
>P00782 3.4.21.62~~~apr~~~Subtilisin BPN'~~~COG1404
MRGKKVWISLLFALALIFTMAFGSTSSAQAAGKSNGEKKYIVGFKQTMSTMSAAKKKDVISEKGGKVQKQFKYVDAASAT
LNEKAVKELKKDPSVAYVEEDHVAHAYAQSVPYGVSQIKAPALHSQGYTGSNVKVAVIDSGIDSSHPDLKVAGGASMVPS
ETNPFQDNNSHGTHVAGTVAALNNSIGVLGVAPSASLYAVKVLGADGSGQYSWIINGIEWAIANNMDVINMSLGGPSGSA
ALKAAVDKAVASGVVVVAAAGNEGTSGSSSTVGYPGKYPSVIAVGAVDSSNQRASFSSVGPELDVMAPGVSIQSTLPGNK
YGAYNGTSMASPHVAGAAALILSKHPNWTNTQVRSSLENTTTKLGDSFYYGKGLINVQAAAQ
>P07518 3.4.21.62~~~apr~~~Subtilisin~~~
AQSVPYGISQIKAPALHSQGYTGSNVKVAVIDSGIDSSHPDLNVRGGASFVPSETNPYQDGSSHGTHVAGTIAALNNSIG
VLGVAPSSALYAVKVLDSTGSGQYSWIINGIEWAISNNMDVINMSLGGPTGSTALKTVVDKAVSSGIVVAAAAGNEGSSG
STSTVGYPAKYPSTIAVGAVNSANQRASFSSAGSELDVMAPGVSIQSTLPGGTYGAYNGTSMATPHVAGAAALILSKHPT
WTNAQVRDRLESTATYLGSSFYYGKGLINVQAAAQ
>P28842 3.4.21.62~~~sub1~~~Subtilisin~~~
MKRSGKIFTTAMLAVTLMMPAMGVSANEGNAAAEGNEKFRVLVDSVDQKNLKNAKQQYGVHWDFAGEGFTTDMNEKQFNA
LKKNKNLTVEKVPELEIATATDKPEALYNAMAASQSTPWGIKAIYNNSSITQTSGGGGINIAVLDTGVNTNHPDLRNNVE
QCKDFTVGTTYTNNSCTDRQGHGTHVAGSALADGGTGNGVYGVAPDADLWAYKVLGDDGSGYADDIAAAIRHAGDQATAL
NTKVVINMSLGSSGESSLITNAVNYSYNKGVLIIAAAGNSGPYQGSIGYPGALVNAVAVAALENKVENGTYRVADFSSRG
YSWTDGDYAIQKGDVEISAPGAAIYSTWFDGGYATISGTSMASPHAAGLAAKIWAQYPSASNVDVRGELQYRAYENDILS
GYYAGYGDDFASGFGFATVQ
>P00783 3.4.21.62~~~apr~~~Subtilisin amylosacchariticus~~~
MRSKKLWISLLFALTLIFTMAFSNMSAQAAGKSSTEKKYIVGFKQTMSAMSSAKKKDVISEKGGKVQKQFKYVNAAAATL
DEKAVKELKKDPSVAYVEEDHIAHEYAQSVPYGISQIKAPALHSQGYTGSNVKVAVIDSGIDSSHPDLNVRGGASFVPSE
TNPYQDGSSHGTHVAGTIAALNNSIGVLGVSPSASLYAVKVLDSTGSGQYSWIINGIEWAISNNMDVINMSLGGPSGSTA
LKTVVDKAVSSGIVVAAAAGNEGSSGSSSTVGYPAKYPSTIAVGAVNSSNQRASFSSAGSELDVMAPGVSIQSTLPGGTY
GAYNGTSMATPHVAGAAALILSKHPTWTNAQVRDRLESTATYLGNSFYYGKGLINVQAAAQ
>P04189 3.4.21.62~~~aprE~~~Subtilisin E~~~COG1404
MRSKKLWISLLFALTLIFTMAFSNMSAQAAGKSSTEKKYIVGFKQTMSAMSSAKKKDVISEKGGKVQKQFKYVNAAAATL
DEKAVKELKKDPSVAYVEEDHIAHEYAQSVPYGISQIKAPALHSQGYTGSNVKVAVIDSGIDSSHPDLNVRGGASFVPSE
TNPYQDGSSHGTHVAGTIAALNNSIGVLGVAPSASLYAVKVLDSTGSGQYSWIINGIEWAISNNMDVINMSLGGPTGSTA
LKTVVDKAVSSGIVVAAAAGNEGSSGSTSTVGYPAKYPSTIAVGAVNSSNQRASFSSAGSELDVMAPGVSIQSTLPGGTY
GAYNGTSMATPHVAGAAALILSKHPTWTNAQVRDRLESTATYLGNSFYYGKGLINVQAAAQ
>P29141 3.4.21.-~~~vpr~~~Minor extracellular protease Vpr~~~COG1404
MKKGIIRFLLVSFVLFFALSTGITGVQAAPASSKTSADLEKAEVFGDIDMTTSKKTTVIVELKEKSLAEAKEAGESQSKS
KLKTARTKAKNKAIKAVKNGKVNREYEQVFSGFSMKLPANEIPKLLAVKDVKAVYPNVTYKTDNMKDKDVTISEDAVSPQ
MDDSAPYIGANDAWDLGYTGKGIKVAIIDTGVEYNHPDLKKNFGQYKGYDFVDNDYDPKETPTGDPRGEATDHGTHVAGT
VAANGTIKGVAPDATLLAYRVLGPGGSGTTENVIAGVERAVQDGADVMNLSLGNSLNNPDWATSTALDWAMSEGVVAVTS
NGNSGPNGWTVGSPGTSREAISVGATQLPLNEYAVTFGSYSSAKVMGYNKEDDVKALNNKEVELVEAGIGEAKDFEGKDL
TGKVAVVKRGSIAFVDKADNAKKAGAIGMVVYNNLSGEIEANVPGMSVPTIKLSLEDGEKLVSALKAGETKTTFKLTVSK
ALGEQVADFSSRGPVMDTWMIKPDISAPGVNIVSTIPTHDPDHPYGYGSKQGTSMASPHIAGAVAVIKQAKPKWSVEQIK
AAIMNTAVTLKDSDGEVYPHNAQGAGSARIMNAIKADSLVSPGSYSYGTFLKENGNETKNETFTIENQSSIRKSYTLEYS
FNGSGISTSGTSRVVIPAHQTGKATAKVKVNTKKTKAGTYEGTVIVREGGKTVAKVPTLLIVKEPDYPRVTSVSVSEGSV
QGTYQIETYLPAGAEELAFLVYDSNLDFAGQAGIYKNQDKGYQYFDWDGTINGGTKLPAGEYYLLAYAANKGKSSQVLTE
EPFTVE
>P80886 6.2.1.5~~~sucC~~~Succinate--CoA ligase [ADP-forming] subunit beta~~~COG0045
MNIHEYQGKEVLRKYGVSVPEGKVAFTAEEAVESAKSLSSSVYVVKAQIHAGGRGKAGGVKIAKSLDEVKAYAEELLGKT
LVTHQTGPDGQVIKRLLIEEGCDIKKEYYIGLVLDRATSRIVLMASEEGGTEIEEVAEKTPEKIKKAVIDPAVGLQGYQA
REIAFAINIPKELVGKAAKFMLGLYKAFVEKDCSIAEINPLVVTGDGNVMALDAKLNFDSNALYRQKDIMEYRDLDEEDP
KEIEASKYDLSYISLDGNIGCMVNGAGLAMSTMDIIKHYGGEPANFLDVGGGATAEKVTEAFKIILSDQNVKGIFVNIFG
GIMKCDVIAEGVVEATRQVGLTLPLVVRLEGTNVDLGKKILSESGLNITSAESMADGAQKIVSLV
>Q5HVN3 6.2.1.5~~~sucC~~~Succinate--CoA ligase [ADP-forming] subunit beta~~~
MNIHEYQAKAIFVDNGIPTLKGKVAFSVDEAVANAKELGGSVWAVKAQIHAGGRGLGGGVKIAKNLDEVKDYASKILGMN
LVTHQTGPEGKLVQKLYIESGANIVKEYYLAILFNRMAEQITIIASSEGGMDIEKVAKESPEKIAKVGIDPQIGFKMFHG
LEVARVLGLDKDEGKKLISMIAKLYKLYMDKDMNMLEINPLIKTAEGDFYALDAKCSFDDSALYRHPEIAELRDTTEENP
AEREAAEFGLSYVKLDGDVACMVNGAGLAMATMDIINYSGAKPANFLDVGGGASPETVAKAFEIILRDKNVKVIFINIFG
GIVRCDRIANGILEATKNVEVNIPIVVRLDGTNAAEAKTILDNSNLKNIKAATNLKNGAELVKSLVG
>P0A836 6.2.1.5~~~sucC~~~Succinate--CoA ligase [ADP-forming] subunit beta~~~COG0045
MNLHEYQAKQLFARYGLPAPVGYACTTPREAEEAASKIGAGPWVVKCQVHAGGRGKAGGVKVVNSKEDIRAFAENWLGKR
LVTYQTDANGQPVNQILVEAATDIAKELYLGAVVDRSSRRVVFMASTEGGVEIEKVAEETPHLIHKVALDPLTGPMPYQG
RELAFKLGLEGKLVQQFTKIFMGLATIFLERDLALIEINPLVITKQGDLICLDGKLGADGNALFRQPDLREMRDQSQEDP
REAQAAQWELNYVALDGNIGCMVNGAGLAMGTMDIVKLHGGEPANFLDVGGGATKERVTEAFKIILSDDKVKAVLVNIFG
GIVRCDLIADGIIGAVAEVGVNVPVVVRLEGNNAELGAKKLADSGLNIIAAKGLTDAAQQVVAAVEGK
>Q5NHF3 6.2.1.5~~~sucC~~~Succinate--CoA ligase [ADP-forming] subunit beta~~~COG0045
MNLHEYQAKDLLESYGLKVQKGIVAHNPNEAAQAFDQLGGKFAVVKAQVHAGGRGKAGGVKVVKSSQEAREVAESLIGKN
LVTFQTDAEGQPVNSVGVFEDVYPVTRELYLGAVVDRSSRKVTFMASTEGGVDIEEVAHNSPEKILKVEVDPLVGLQPFQ
AREVAFKLGLEGKQINDFVKTMLGAYKAFIECDFALFEINPLAVRENGEIVCVDGKINLDSNALYRHPKLLALRDKSQEN
AKELKASEHELNYVALEGNIGCMVNGAGLAMATMDIIQLYGGKPANFLDVGGGATKERVIEAFKLILDDENVKAILINIF
GGIVRCDMIAEAIIEAVKEVNVTVPVVVRLEGNNAEKGAKILADSGLKLIPADGLADAADKVVKSLG
>A0R3M4 6.2.1.5~~~sucC~~~Succinate--CoA ligase [ADP-forming] subunit beta~~~COG0045
MDLFEYQAKELFAKHNVPTSPGRVTDSAEDAKTIAEEIGRPVMVKAQVKTGGRGKAGGVKYAATPDDAFTHANNILGLDI
KGHVVKKLLVAEASDIAEEYYISFLLDRANRTYLAMCSVEGGMEIEEVAATKPERLAKVPVDAVKGVDLAFARSIAEQGH
LPAEVLDAAAVTIQKLWEVFVKEDATLVEVNPLVRTPDDQILALDGKVTLDENAGFRQPGHAEFEDRDATDPLELKAKEN
DLNYVKLDGQVGIIGNGAGLVMSTLDVVAYAGENHGGVKPANFLDIGGGASAAVMAAGLDVILGDSQVKSVFVNVFGGIT
ACDAVANGIVQALQILGDEANKPLVVRLDGNNVEEGRRILAEANHPLVIQAETMDAGADKAAELANK
>P9WGC5 6.2.1.5~~~sucC~~~Succinate--CoA ligase [ADP-forming] subunit beta~~~COG0045
MDLFEYQAKELFAKHNVPSTPGRVTDTAEGAKAIATEIGRPVMVKAQVKIGGRGKAGGVKYAATPQDAYEHAKNILGLDI
KGHIVKKLLVAEASDIAEEYYLSFLLDRANRTYLAMCSVEGGMEIEEVAATKPERLAKVPVNAVKGVDLDFARSIAEQGH
LPAEVLDTAAVTIAKLWELFVAEDATLVEVNPLVRTPDHKILALDAKITLDGNADFRQPGHAEFEDRAATDPLELKAKEH
DLNYVKLDGQVGIIGNGAGLVMSTLDVVAYAGEKHGGVKPANFLDIGGGASAEVMAAGLDVVLGDQQVKSVFVNVFGGIT
SCDAVATGIVKALGMLGDEANKPLVVRLDGNNVEEGRRILTEANHPLVTLVATMDEAADKAAELASA
>P53593 6.2.1.5~~~sucC~~~Succinate--CoA ligase [ADP-forming] subunit beta~~~
MNLHEYQGKQLFAEYGLPVSKGFAVDTPEEAAEACDKIGGSEWVVKAQVHAGGRGKAGGVKLVKSKEDAKAFAQQWLGKN
LVTYQTDANGQPVSKILVESCTDIDKELYLGAVVDRSSRRIVFMASTEGGVDIEKVAHDTPEKILKATIDPLVGAQPYQG
RELAFQLGLKGDQIKQFTHIFVGLAKLFQDYDLALLEVNPLVIKKDGNLHCLDAKINIDSNALYRQPKLRAMHDPSQDDA
REAHAQKWELNYVALEGNIGCMVNGAGLAMGTMDIVNLHGGKPANFLDVGGGATKERVTEAFKIILSDSNVKAVLVNIFG
GIVRCDMIAEGIIGAVKEVGVKVPVVVRLEGNNAELGAKVLAESGLNIIAATSLTDAAQQVVKAAEGK
>P99071 6.2.1.5~~~sucC~~~Succinate--CoA ligase [ADP-forming] subunit beta~~~
MNIHEYQGKEIFRSMGVAVPEGRVAFTAEEAVEKAKELNSDVYVVKAQIHAGGRGKAGGVKIAKSLSEVETYAKELLGKT
LVTHQTGPEGKEIKRLYIEEGCAIQKEYYVGFVIDRATDQVTLMASEEGGTEIEEVAAKTPEKIFKETIDPVIGLSPFQA
RRIAFNINIPKESVNKAAKFLLALYNVFIEKDCSIVEINPLVTTADGDVLALDAKINFDDNALFRHKDVVELRDLEEEDP
KEIEASKHDLSYIALDGDIGCMVNGAGLAMATMDTINHFGGNPANFLDAGGSATREKVTEAFKIILGDENVKGIFVNIFG
GIMKCDVIAEGIVEAVKEVDLTLPLVVRLEGTNVELGKKILKDSGLAIEPAATMAEGAQKIVKLVKEA
>P25126 6.2.1.4~~~sucC~~~Succinate--CoA ligase [GDP-forming] subunit beta~~~
MNLHEYQAKEILARYGVPVPPGKVAYTPEEAKRIAEEFGKRVVIKAQVHVGGRGKAGGVKLADTPQEAYEKAQAILGMNI
KGLTVKKVLVAEAVDIAKEYYAGLILDRAKKRVVLMLSKEGGVDIEEVAAERPEAIHKFWIDPHKGFRPFEAREMVKRAG
LEGNLNKLAQVLVALYRAYEGVDASIAEINPLVVTTDGGIVAADAKIVLDDNALFRHPDLAELREVEAEHPLEVEASNYG
FAYVKLDGNIGIIGNGAGLVMYTLDLVNRVGGKPANFLDIGGGAKADVVYNALKVVLKDPDVKGVFINIFGGITRADEVA
KGVIRALEEGLLTKPVVMRVAGTAEEEAKKLLEGKPVYMYPTSIEAAKAIVAMVGGAA
>P80865 6.2.1.5~~~sucD~~~Succinate--CoA ligase [ADP-forming] subunit alpha~~~COG0074
MSVFINKDTRVIVQGITGSTALFHTKQMLEYGTNIVGGVTPGKGGTEAEGVPVFNTVAEAVQTTGANASVIYVPAPFAAD
AIMEAVDAELDLVICITEHIPVLDMVKVKRFMEGKKTRLIGPNCPGVITPEECKIGIMPGYIHKKGHVGVVSRSGTLTYE
AVHQLSEAGVGQSTAVGIGGDPVNGTNFIDVLKAFNEDPDTHAVIMIGEIGGTAEEEAAEWVKANMTKPVVGFIGGKTAP
PGKRMGHAGAIISGGKGTADEKIKTLNACGIEVAETPSVMGETLIKVLKEKNLFETCKTH
>P38947 1.2.1.76~~~sucD~~~Succinate-semialdehyde dehydrogenase (acetylating)~~~COG1012
MSNEVSIKELIEKAKVAQKKLEAYSQEQVDVLVKALGKVVYDNAEMFAKEAVEETEMGVYEDKVAKCHLKSGAIWNHIKD
KKTVGIIKEEPERALVYVAKPKGVVAATTPITNPVVTPMCNAMAAIKGRNTIIVAPHPKAKKVSAHTVELMNAELKKLGA
PENIIQIVEAPSREAAKELMESADVVIATGGAGRVKAAYSSGRPAYGVGPGNSQVIVDKGYDYNKAAQDIITGRKYDNGI
ICSSEQSVIAPAEDYDKVIAAFVENGAFYVEDEETVEKFRSTLFKDGKINSKIIGKSVQIIADLAGVKVPEGTKVIVLKG
KGAGEKDVLCKEKMCPVLVALKYDTFEEAVEIAMANYMYEGAGHTAGIHSDNDENIRYAGTVLPISRLVVNQPATTAGGS
FNNGFNPTTTLGCGSWGRNSISENLTYEHLINVSRIGYFNKEAKVPSYEEIWG
>P0AGE9 6.2.1.5~~~sucD~~~Succinate--CoA ligase [ADP-forming] subunit alpha~~~COG0074
MSILIDKNTKVICQGFTGSQGTFHSEQAIAYGTKMVGGVTPGKGGTTHLGLPVFNTVREAVAATGATASVIYVPAPFCKD
SILEAIDAGIKLIITITEGIPTLDMLTVKVKLDEAGVRMIGPNCPGVITPGECKIGIQPGHIHKPGKVGIVSRSGTLTYE
AVKQTTDYGFGQSTCVGIGGDPIPGSNFIDILEMFEKDPQTEAIVMIGEIGGSAEEEAAAYIKEHVTKPVVGYIAGVTAP
KGKRMGHAGAIIAGGKGTADEKFAALEAAGVKTVRSLADIGEALKTVLK
>P9WGC7 6.2.1.5~~~sucD~~~Succinate--CoA ligase [ADP-forming] subunit alpha~~~COG0074
MTHMSIFLSRDNKVIVQGITGSEATVHTARMLRAGTQIVGGVNARKAGTTVTHEDKGGRLIKLPVFGSVAEAMEKTGADV
SIIFVPPTFAKDAIIEAIDAEIPLLVVITEGIPVQDTAYAWAYNLEAGHKTRIIGPNCPGIISPGQSLAGITPANITGPG
PIGLVSKSGTLTYQMMFELRDLGFSTAIGIGGDPVIGTTHIDAIEAFERDPDTKLIVMIGEIGGDAEERAADFIKTNVSK
PVVGYVAGFTAPEGKTMGHAGAIVSGSSGTAAAKQEALEAAGVKVGKTPSATAALAREILLSL
>Q51567 6.2.1.5~~~sucD~~~Succinate--CoA ligase [ADP-forming] subunit alpha~~~
MSVLINKDTKVICQGFTGSQGTFHSEQAIAYGTKMVGGVTPGKGGTTHLGLPVFNTVKEAVEATGAEASVIYVPAPFCKD
SILEAAFGGIKLIVCITEGIPTLDMLDAKVKCDELGVRLIGPNCPGVITPGECKIGIMPGHIHLPGKVGIVSRSGTLTYE
AVKQTTDAGFGQSTCVGIGGDPIPGSNFIDILKLFQEDPQTEAIVMIGEIGGSAEEEAAAFIKANVTKPVVSYIAGVTAP
PGKRMGHAGAIISGGKGTADEKFAALQDAGVKTVRSLADIGKALAELTGWEVKKA
>P99070 6.2.1.5~~~sucD~~~Succinate--CoA ligase [ADP-forming] subunit alpha~~~
MSVFIDKNTKVMVQGITGSTALFHTKQMLDYGTKIVAGVTPGKGGQVVEGVPVFNTVEEAKNETGATVSVIYVPAPFAAD
SILEAADADLDMVICITEHIPVLDMVKVKRYLQGRKTRLVGPNCPGVITADECKIGIMPGYIHKKGHVGVVSRSGTLTYE
AVHQLTEEGIGQTTAVGIGGDPVNGTNFIDVLKAFNEDDETKAVVMIGEIGGTAEEEAAEWIKANMTKPVVGFIGGQTAP
PGKRMGHAGAIISGGKGTAEEKIKTLNSCGVKTAATPSEIGSTLIEAAKEAGIYESLLTVNK
>P09143 6.2.1.4~~~sucD~~~Succinate--CoA ligase [GDP-forming] subunit alpha~~~
MILVNRETRVLVQGITGREGQFHTKQMLDYGTKIVAGVTPGKGGTEVLGVPVYDTVKEAVAHHEVDASIIFVPAPAAADA
ALEAAHAGIPLIVLITEGIPTLDMVRAVEEIKALGSRLIGGNCPGIISAEETKIGIMPGHVFKRGRVGIISRSGTLTYEA
AAALSQAGLGTTTTVGIGGDPVIGTTFKDLLPLFNEDPETEAVVLIGEIGGSDEEEAAAWVKDHMKKPVVGFIGGRSAPK
GKRMGHAGAIIMGNVGTPESKLRAFAEAGIPVADTIDEIVELVKKALG
>A0A6C7EEG6 2.4.1.329~~~~~~Sucrose 6(F)-phosphate phosphorylase~~~
MTHAGMKASLPNRVMLNAYPDSIDGDLAGTVRMLQRPEFTDAFGLFYVLPSIFNSDLDRGFSIIDYDLNSDLASAEDLAA
LDELGIMLKFDMVLNHLSVGSPQFQDLLKHGDDSAFRDFFIDWNEFWEGEGELHADGHVVPSPEHLDRLFMRKPGLPILQ
VRFPDGSDRFYWNTFYQRVETIDGERSYLGQMDLNAESPRVWTFYRETFEKLARYGAKIVRLDAFAYLHKAVGDTNFFNT
PGTWDHLDRLRTISEENGLVLLPEIHGEYGTKIHEELSDRDYPVYDFFFPGLVIDAIDSASNTHLLRWIDEIIERDIATV
NMLGCHDGIPVIDLKGGPTGQGLLPDATIEAMISRLLERGGRVKNLYGADGTKVSYYQVNATFFSALGESDARLRLARAI
QLFVPGTPQVWYLDLFAGANDVEAADRAGADGHKEINRTNLSAADVEAGLARPIVLDQLEMIRLRNASPAFDGRFEVVPT
DDTRLQLRWQNGSTVALLDADLATERFTITHEHDGHTEILGYD
>D9TT09 2.4.1.329~~~spp~~~Sucrose 6(F)-phosphate phosphorylase~~~COG0366
MALKNKVQLITYPDSLGGNLKTLNDVLEKYFSDVFGGVHILPPFPSSGDRGFAPITYSEIEPKFGTWYDIKKMAENFDIL
LDLMVNHVSRRSIYFQDFLKKGRKSEYADMFITLDKLWKDGKPVKGDIEKMFLRRTLPYSTFKIEETGEEEKVWTTFGKT
DPSEQIDLDVNSHLVREFLLEVFKTFSNFGVKIVRLDAVGYVIKKIGTSCFFVEPEIYEFLDWAKGQAASYGIELLLEVH
SQFEVQYKLAERGFLIYDFILPFTVLYTLINKSNEMLYHYLKNRPINQFTMLDCHDGIPVKPDLDGLIDTKKAKEVVDIC
VQRGANLSLIYGDKYKSEDGFDVHQINCTYYSALNCDDDAYLAARAIQFFTPGIPQVYYVGLLAGVNDFEAVKKTKEGRE
INRHNYGLKEIEESVQKNVVQRLLKLIRFRNEYEAFNGEFFIEDCRKDEIRLTWKKDDKRCSLFIDLKTYKTTIDYINEN
GEEVKYLV
>A0ZZH6 2.4.1.7~~~sucP~~~Sucrose phosphorylase~~~
MKNKVQLITYADRLGDGTIKSMTDILRTRFDGVYDGVHILPFFTPFDGADAGFDPIDHTKVDERLGSWDDVAELSKTHNI
MVDAIVNHMSWESKQFQDVLAKGEESEYYPMFLTMSSVFPNGATEEDLAGIYRPRPGLPFTHYKFAGKTRLVWVSFTPQQ
VDIDTDSDKGWEYLMSIFDQMAASHVSYIRLDAVGYGAKEAGTSCFMTPKTFKLISRLREEGVKRGLEILIEVHSYYKKQ
VEIASKVDRVYDFALPPLLLHALSTGHVEPVAHWTDIRPNNAVTVLDTHDGIGVIDIGSDQLDRSLKGLVPDEDVDNLVN
TIHANTHGESQAATGAAASNLDLYQVNSTYYSALGCNDQHYIAARAVQFFLPGVPQVYYVGALAGKNDMELLRKTNNGRD
INRHYYSTAEIDENLKRPVVKALNALAKFRNELDAFDGTFSYTTDDDTSISFTWRGETSQATLTFEPKRGLGVDNTTPVA
MLEWEDSAGDHRSDDLIANPPVVA
>Q59495 2.4.1.7~~~~~~Sucrose phosphorylase~~~
MEIQNKAMLITYADSLGKNLKDVHQVLKEDIGDAIGGVHLLPFFPSTGDRGFAPADYTRVDAAFGDWADVEALGEEYYLM
FDFMINHISRESVMYQDFKKNHDDSKYKDFFIRWEKFWAKAGENRPTQADVDLIYKRKDKAPTQEITFDDGTTENLWNTF
GEEQIDIDVNSAIAKEFIKTTLEDMVKHGANLIRLDAFAYAVKKVDTNDFFVEPEIWDTLNEVREILTPLKAEILPEIHE
HYSIPKKINDHGYFTYDFALPMTTLYTLYSGKTNQLAKWLKMSPMKQFTTLDTHDGIGVVDARDILTDDEIDYASEQLYK
VGANVKKTYSSASYNNLDIYQINSTYYSALGNDDAAYLLSRVFQVFAPGIPQIYYVGLLAGENDIALLESTKEGRNINRH
YYTREEVKSEVKRPVVANLLKLLSWRNESPAFDLAGSITVDTPTDTTIVVTRQDENGQNKAVLTADAANKTFEIVENGQT
VMSSDNLTQN
>P10249 2.4.1.7~~~gtfA~~~Sucrose phosphorylase~~~COG0366
MPIINKTMLITYADSLGKNLKELNENIENYFGDAVGGVHLLPFFPSTGDRGFAPIDYHEVDSAFGDWDDVKCLGEKYYLM
FDFMINHISRQSKYYKDYQEKHEASAYKDLFLNWDKFWPKNRPTQEDVDLIYKRKDRAPKQEIQFADGSVEHLWNTFGEE
QIDLDVTKEVTMDFIRSTIENLAANGCDLIRLDAFAYAVKKLDTNDFFVEPEIWTLLDKVRDIAAVSGAEILPEIHEHYT
IQFKIADHDYYVYDFALPMVTLYSLYSSKVDRLAKWLKMSPMKQFTTLDTHDGIGVVDVKDILTDEEITYTSNELYKVGA
NVNRKYSTAEYNNLDIYQINSTYYSALGDDDQKYFLARLIQAFAPGIPQVYYVGFLAGKNDLELLESTKEGRNINRHYYS
SEEIAKEVKRPVVKALLNLFTYRNQSAAFDLDGRIEVETPNEATIVIERQNKDGSHIAKAEINLQDMTYRVTENDQTISF
E
>P77667 ~~~sufA~~~Protein SufA~~~COG0316
MDMHSGTFNPQDFAWQGLTLTPAAAIHIRELVAKQPGMVGVRLGVKQTGCAGFGYVLDSVSEPDKDDLLFEHDGAKLFVP
LQAMPFIDGTEVDFVREGLNQIFKFHNPKAQNECGCGESFGV
>P77522 ~~~sufB~~~FeS cluster assembly protein SufB~~~COG0719
MSRNTEATDDVKTWTGGPLNYKEGFFTQLATDELAKGINEEVVRAISAKRNEPEWMLEFRLNAYRAWLEMEEPHWLKAHY
DKLNYQDYSYYSAPSCGNCDDTCASEPGAVQQTGANAFLSKEVEAAFEQLGVPVREGKEVAVDAIFDSVSVATTYREKLA
EQGIIFCSFGEAIHDHPELVRKYLGTVVPGNDNFFAALNAAVASDGTFIYVPKGVRCPMELSTYFRINAEKTGQFERTIL
VADEDSYVSYIEGCSAPVRDSYQLHAAVVEVIIHKNAEVKYSTVQNWFPGDNNTGGILNFVTKRALCEGENSKMSWTQSE
TGSAITWKYPSCILRGDNSIGEFYSVALTSGHQQADTGTKMIHIGKNTKSTIISKGISAGHSQNSYRGLVKIMPTATNAR
NFTQCDSMLIGANCGAHTFPYVECRNNSAQLEHEATTSRIGEDQLFYCLQRGISEEDAISMIVNGFCKDVFSELPLEFAV
EAQKLLAISLEHSVG
>P80866 ~~~sufC~~~Vegetative protein 296~~~COG0396
MAASTLTIKDLHVEIEGKEILKGVNLEIKGGEFHAVMGPNGTGKSTLSAAIMGHPKYEVTKGSITLDGKDVLEMEVDERA
QAGLFLAMQYPSEISGVTNADFLRSAINARREEGDEISLMKFIRKMDENMEFLEMDPEMAQRYLNEGFSGGEKKRNEILQ
LMMIEPKIAILDEIDSGLDIDALKVVSKGINKMRSENFGCLMITHYQRLLNYITPDVVHVMMQGRVVKSGGAELAQRLEA
EGYDWIKQELGIEDETVGQEA
>P77499 ~~~sufC~~~Probable ATP-dependent transporter SufC~~~COG0396
MLSIKDLHVSVEDKAILRGLSLDVHPGEVHAIMGPNGSGKSTLSATLAGREDYEVTGGTVEFKGKDLLALSPEDRAGEGI
FMAFQYPVEIPGVSNQFFLQTALNAVRSYRGQETLDRFDFQDLMEEKIALLKMPEDLLTRSVNVGFSGGEKKRNDILQMA
VLEPELCILDESDSGLDIDALKVVADGVNSLRDGKRSFIIVTHYQRILDYIKPDYVHVLYQGRIVKSGDFTLVKQLEEQG
YGWLTEQQ
>P77689 ~~~sufD~~~FeS cluster assembly protein SufD~~~COG0719
MAGLPNSSNALQQWHHLFEAEGTKRSPQAQQHLQQLLRTGLPTRKHENWKYTPLEGLINSQFVSIAGEISPQQRDALALT
LDSVRLVFVDGRYVPALSDATEGSGYEVSINDDRQGLPDAIQAEVFLHLTESLAQSVTHIAVKRGQRPAKPLLLMHITQG
VAGEEVNTAHYRHHLDLAEGAEATVIEHFVSLNDARHFTGARFTINVAANAHLQHIKLAFENPLSHHFAHNDLLLAEDAT
AFSHSFLLGGAVLRHNTSTQLNGENSTLRINSLAMPVKNEVCDTRTWLEHNKGFCNSRQLHKTIVSDKGRAVFNGLINVA
QHAIKTDGQMTNNNLLMGKLAEVDTKPQLEIYADDVKCSHGATVGRIDDEQIFYLRSRGINQQDAQQMIIYAFAAELTEA
LRDEGLKQQVLARIGQRLPGGAR
>Q9EXP1 ~~~sufE~~~Cysteine desulfuration protein SufE~~~COG2166
MAQLPDPQKLLRNFSRCSNWEEKYLYIIELGAGLAPLSDAQRQDGNRVSGCQSQVWIDLASNEQGNVVLHGDSDAAIVKG
LIAIVFSLYQGLSVREIVELDVRPFFASLALTQHLTPSRSQGLEAMLRAVRARASALI
>P76194 ~~~sufE~~~Cysteine desulfuration protein SufE~~~COG2166
MALLPDKEKLLRNFLRCANWEEKYLYIIELGQRLPELRDEDRSPQNSIQGCQSQVWIVMRQNAQGIIELQGDSDAAIVKG
LIAVVFILYDQMTPQDIVNFDVRPWFEKMALTQHLTPSRSQGLEAMIRAIRAKAAALS
>Q8ZPQ1 ~~~sufE~~~Cysteine desulfuration protein SufE~~~
MAALPDKEKLLRNFTRCANWEEKYLYIIELGQRLAELNPQDRNPQNTIHGCQSQVWIVMRRNANGIIELQGDSDAAIVKG
LMAVVFILYHQMTAQDIVHFDVRPWFEKMALAQHLTPSRSQGLEAMIRAIRAKAATLS
>O32164 2.8.1.7~~~sufS~~~Cysteine desulfurase SufS~~~COG0520
MNITDIREQFPILHQQVNGHDLVYLDSAATSQKPRAVIETLDKYYNQYNSNVHRGVHTLGTRATDGYEGAREKVRKFINA
KSMAEIIFTKGTTTSLNMVALSYARANLKPGDEVVITYMEHHANIIPWQQAVKATGATLKYIPLQEDGTISLEDVRETVT
SNTKIVAVSHVSNVLGTVNPIKEMAKIAHDNGAVIVVDGAQSTPHMKIDVQDLDCDFFALSSHKMCGPTGVGVLYGKKAL
LENMEPAEFGGEMIDFVGLYESTWKELPWKFEAGTPIIAGAIGLGAAIDFLEEIGLDEISRHEHKLAAYALERFRQLDGV
TVYGPEERAGLVTFNLDDVHPHDVATVLDAEGIAVRAGHHCAQPLMKWLDVTATARASFYLYNTEEEIDKLVEALQKTKE
YFTNVF
>Q9EXP2 2.8.1.7~~~sufS~~~Cysteine desulfurase~~~COG0520
MNYPVEHYPIDRVRADFPILQQSVNGQPLAYLDSAASAQKPLAVIDRERDFYLHEYAAVHRGIHTLSARATSAMEEVRAK
VATFIHAASAEDIVFVRGTTEAINLVANSYGRTAFQPGDNLVISEMEHHANIVPWQMLAQARGLTLRVLPITDDGELDMA
QLPALLDERTRLVAVTQVSNVLGTVNPLAEIIRQAHACGAKVLVDGAQAVMHQAVDVQALDCDFYAFSGHKLYGPSGIGV
LYGKSELLQAMPPWEGGGAMIREVSLTQGTTYADPPWRFEAGSPHVAGIIGLGAALDYVSALGVDAIQAHEGLLMRYALA
SLAEVPTLRLYGPVHRQGVIAFNLGRHHAFDVGSFLDQYGIAIRTGHHCAMPLMSRYGVPSMCRASLALYSCQDEIDRLV
AGLHRIHRLLGE
>P77444 2.8.1.7~~~sufS~~~Cysteine desulfurase~~~COG0520
MIFSVDKVRADFPVLSREVNGLPLAYLDSAASAQKPSQVIDAEAEFYRHGYAAVHRGIHTLSAQATEKMENVRKRASLFI
NARSAEELVFVRGTTEGINLVANSWGNSNVRAGDNIIISQMEHHANIVPWQMLCARVGAELRVIPLNPDGTLQLETLPTL
FDEKTRLLAITHVSNVLGTENPLAEMITLAHQHGAKVLVDGAQAVMHHPVDVQALDCDFYVFSGHKLYGPTGIGILYVKE
ALLQEMPPWEGGGSMIATVSLSEGTTWTKAPWRFEAGTPNTGGIIGLGAALEYVSALGLNNIAEYEQNLMHYALSQLESV
PDLTLYGPQNRLGVIAFNLGKHHAYDVGSFLDNYGIAVRTGHHCAMPLMAYYNVPAMCRASLAMYNTHEEVDRLVTGLQR
IHRLLG
>A0A0H2XI17 ~~~sufT~~~Fe-S protein maturation auxiliary factor SufT~~~
MVIDPELGIDIVNLGLVYKVNVDDEGVCTVDMTLTSMGCPMGPQIIDQVKTVLAEIPEIQDTEVNIVWSPPWTKDMMSRY
AKIALGVS
>O32163 2.-.-.-~~~sufU~~~Zinc-dependent sulfurtransferase SufU~~~COG0822
MSFNANLDTLYRQVIMDHYKNPRNKGVLNDSIVVDMNNPTCGDRIRLTMKLDGDIVEDAKFEGEGCSISMASASMMTQAI
KGKDIETALSMSKIFSDMMQGKEYDDSIDLGDIEALQGVSKFPARIKCATLSWKALEKGVAKEEGGN
>P9WG03 ~~~sugA~~~Trehalose transport system permease protein SugA~~~COG1175
MTSVEQRTATAVFSRTGSRMAERRLAFMLVAPAAMLMVAVTAYPIGYALWLSLQRNNLATPNDTAFIGLGNYHTILIDRY
WWTALAVTLAITAVSVTIEFVLGLALALVMHRTLIGKGLVRTAVLIPYGIVTVVASYSWYYAWTPGTGYLANLLPYDSAP
LTQQIPSLGIVVIAEVWKTTPFMSLLLLAGLALVPEDLLRAAQVDGASAWRRLTKVILPMIKPAIVVALLFRTLDAFRIF
DNIYVLTGGSNNTGSVSILGYDNLFKGFNVGLGSAISVLIFGCVAVIAFIFIKLFGAAAPGGEPSGR
>P9WG01 ~~~sugB~~~Trehalose transport system permease protein SugB~~~COG0395
MGARRATYWAVLDTLVVGYALLPVLWIFSLSLKPTSTVKDGKLIPSTVTFDNYRGIFRGDLFSSALINSIGIGLITTVIA
VVLGAMAAYAVARLEFPGKRLLIGAALLITMFPSISLVTPLFNIERAIGLFDTWPGLILPYITFALPLAIYTLSAFFREI
PWDLEKAAKMDGATPGQAFRKVIVPLAAPGLVTAAILVFIFAWNDLLLALSLTATKAAITAPVAIANFTGSSQFEEPTGS
IAAGAIVITIPIIVFVLIFQRRIVAGLTSGAVKG
>P9WQI3 7.5.2.-~~~sugC~~~Trehalose import ATP-binding protein SugC~~~COG3842
MAEIVLDHVNKSYPDGHTAVRDLNLTIADGEFLILVGPSGCGKTTTLNMIAGLEDISSGELRIAGERVNEKAPKDRDIAM
VFQSYALYPHMTVRQNIAFPLTLAKMRKADIAQKVSETAKILDLTNLLDRKPSQLSGGQRQRVAMGRAIVRHPKAFLMDE
PLSNLDAKLRVQMRGEIAQLQRRLGTTTVYVTHDQTEAMTLGDRVVVMYGGIAQQIGTPEELYERPANLFVAGFIGSPAM
NFFPARLTAIGLTLPFGEVTLAPEVQGVIAAHPKPENVIVGVRPEHIQDAALIDAYQRIRALTFQVKVNLVESLGADKYL
YFTTESPAVHSVQLDELAEVEGESALHENQFVARVPAESKVAIGQSVELAFDTARLAVFDADSGANLTIPHRA
>P9WKZ3 2.1.2.-~~~~~~dTDP-4-amino-4,6-dideoxyglucose formyltransferase~~~COG0223
MTILILTDNVHAHALAVDLQARHGDMDVYQSPIGQLPGVPRCDVAERVAEIVERYDLVLSFHCKQRFPAALIDGVRCVNV
HPGFNPYNRGWFPQVFSIIDGQKVGVTIHEIDDQLDHGPIIAQRECAIESWDSSGSVYARLMDIERELVLEHFDAIRDGS
YTAKSPATEGNLNLKKDFEQLRRLDLNERGTFGHFLNRLRALTHDDFRNAWFVDASGRKVFVRVVLEPEKPAEA
>Q6MWZ7 3.1.3.2~~~gpm2~~~Acid phosphatase~~~COG0406
MGVRNHRLLLLRHGETAWSTLGRHTGGTEVELTDTGRTQAELAGQLLGELELDDPIVICSPRRRTLDTAKLAGLTVNEVT
GLLAEWDYGSYEGLTTPQIRESEPDWLVWTHGCPAGESVAQVNDRADSAVALALEHMSSRDVLFVSHGHFSRAVITRWVQ
LPLAEGSRFAMPTASIGICGFEHGVRQLAVLGLTGHPQPIAAG
>Q45499 3.1.3.25~~~suhB~~~Inositol-1-monophosphatase~~~COG0483
MTNWTEIDEIAKKWIREAGARITQSMHESLTIETKSNPNDLVTNIDKETEKFFIDRIQETFPGHRILGEEGQGDKIHSLE
GVVWIIDPIDGTMNFVHQQRNFAISIGIFENGEGKIGLIYDVVHDELYHAFSGRGAYMNETKLAPLKETVIEEAILAINA
TWVTENRRIDQSVLAPLVKRVRGTRSYGSAALELANVAAGRIDAYITMRLAPWDYAAGCVLLNEVGGTYTTIEGEPFTFL
ENHSVLAGNPSIHKTIFEEYLHARK
>P0ADG4 3.1.3.25~~~suhB~~~Nus factor SuhB~~~COG0483
MHPMLNIAVRAARKAGNLIAKNYETPDAVEASQKGSNDFVTNVDKAAEAVIIDTIRKSYPQHTIITEESGELEGTDQDVQ
WVIDPLDGTTNFIKRLPHFAVSIAVRIKGRTEVAVVYDPMRNELFTATRGQGAQLNGYRLRGSTARDLDGTILATGFPFK
AKQYATTYINIVGKLFNECADFRRTGSAALDLAYVAAGRVDGFFEIGLRPWDFAAGELLVREAGGIVSDFTGGHNYMLTG
NIVAGNPRVVKAMLANMRDELSDALKR
>P9WKI9 3.1.3.25~~~suhB~~~Inositol-1-monophosphatase SuhB~~~COG0483
MTRPDNEPARLRSVAENLAAEAAAFVRGRRAEVFGISRAGDGDGAVRAKSSPTDPVTVVDTDTERLLRDRLAQLRPGDPI
LGEEGGGPADVTATPSDRVTWVLDPIDGTVNFVYGIPAYAVSIGAQVGGITVAGAVADVAARTVYSAATGLGAHLTDERG
RHVLRCTGVDELSMALLGTGFGYSVRCREKQAELLAHVVPLVRDVRRIGSAALDLCMVAAGRLDAYYEHGVQVWDCAAGA
LIAAEAGARVLLSTPRAGGAGLVVVAAAPGIADELLAALQRFNGLEPIPD
>Q9HXI4 3.1.3.25~~~suhB~~~Nus factor SuhB~~~
MQPMLNIALRAARSAGELIFRSIERLDVISVNEKDAKDYVTEVDRAAEQTIVAALRKAYPTHAIMGEEGGFIEGSGEGAD
YLWVIDPLDGTTNFIHGVPHFAVSIACKYKGRLEHAVVLDPVRQEEFTASRGRGAALNGRRLRVSGRKSLEGALLGTGFP
FRDNQIDNLDNYLNMFRSLVGQTAGIRRAGAASLDLAYVAAGRYDAFWEFGLSEWDMAAGALLVQEAGGLVSDFTGSHEF
LEKGHIVAGNTKCFKALLTTIQPHLPPSLKR
>A0A0F6B4W4 3.1.3.25~~~suhB~~~Nus factor SuhB~~~
MHPMLTIAVRAARKAGNVIAKNYETPDAVEASQKGSNDFVTNVDKAAEAVIIDTIRKSYPQHTIITEESGEHVGTDQDVQ
WVIDPLDGTTNFIKRLPHFAVSIAVRIKGRTEVAVVYDPMRNELFTATRGQGAQLNGYRLRGSTARDLDGTILATGFPFK
AKQYATTYINIIGKLFTECADFRRTGSAALDLAYVAAGRVDGFFEIGLRPWDFAAGELLVREAGGIVSDFTGGHNYMMTG
NIVAGNPRVVKAMLANMRDELSDALKR
>P0AFZ5 ~~~sulA~~~Cell division inhibitor SulA~~~COG5404
MYTSGYAHRSSSFSSAASKIARVSTENTTAGLISEVVYREDQPMMTQLLLLPLLQQLGQQSRWQLWLTPQQKLSREWVQA
SGLPLTKVMQISQLSPCHTVESMVRALRTGNYSVVIGWLADDLTEEEHAELVDAANEGNAMGFIMRPVSASSHATRQLSG
LKIHSNLYH
>Q9HZJ8 ~~~sulA~~~Cell division inhibitor SulA~~~
MQTSHSLPSAQLPLFQEAFWASNGAPLLDDVIDSPSSASIEEPAAFSELSLSGLPGHCLTLLAPILRELSEEQDARWLTL
IAPPASLTHEWLRRAGLNRERILLLQAKDNAAALALSCEALRLGRSHTVVSWLEPLSRAARKQLSRAAQLGQAQSLNIRL
G
>P22291 ~~~sulD~~~Bifunctional folate synthesis protein~~~COG0801
MDQLQIKDLEMFAYHGLFPSEKELGQKFVVSAILSYDMTKAATDLDLTASVHYGELCQQWTTWFQETSEDLIETVAYKLV
ERTFEFYPLVQEMKLELKKPWAPVHLSLDTCSVTIHRRKQRAFIALGSNMGDKQANLKQAIDKLRARGIHILKESSVLAT
EPWGGVEQDSFANQVVEVETWLPAQDLLETLLAIESELGRVREVHWGPRLIDLDLLFVEDQILYTDDLILPHPYIAERLF
VLESLQEIAPHFIHPILKQPIRNLYDALKK
>P59657 ~~~sulD~~~Bifunctional folate synthesis protein~~~COG0801
MDQLQIKDLEMFAYHGLFPSEKELGQKFIVSAILSYDMTKAATDLDLTASVHYGELCQQWTTWFQETSEDLIETVAYKLV
ERTFESYPLVQEMKLELKKPWAPVHLSLDTCSVTIHRRKQRAFIALGSNMGDKQANLKQAIDKLRARGIHILKESSVLAT
EPWGGVEQDSFANQVVEVETWLPAQDLLETLLAIESELGRVREVHWGPRLIDLDLLFVEDQILYTDDLILPHPYIAERLF
VLESLQEIAPHFIHPILKQPIRNLYDALKK
>Q0TUK6 3.1.6.1~~~~~~Arylsulfatase~~~COG3119
MKPNIVLIMVDQMRGDCLGVNGNEFIETPNLDMMATEGYNFENAYTAVPSCIASRASILTGMSQKSHGRVGYEDGVSWNY
ENTIASEFSKAGYHTQCIGKMHVYPERNLCGFHNIMLHDGYLHFARNKEGKASTQIEQCDDYLKWFREKKGHNVDLIDIG
LDCNSWVSRPWGYEENLHPTNWVVNESIDFLRRKDPSKPFFLKMSFVRPHSPLDPPKFYFDMYKDEDLPEPLMGDWANKE
DEENRGKDINCVKGIINKKALKRAKAAYYGSITHIDHQIGRFLIALSEYGELNNTIFLFVSDHGDMMGDHNWFRKGIPYE
GSSRVPFFIYDPGNLLKGKKGKVFDEVLELRDIMPTLLDFAHISIPDSVEGLSLKNLIEERNSTWRDYIHGEHSFGEDSN
HYIVTKRDKFLWFSQRGEEQYFDLENDPKELTNLIDSEEYKERIDYLRKILIKELEGREEGYTDGNRLLKGHPVSTLKHI
R
>O34734 ~~~cysP~~~Sulfate permease CysP~~~COG0306
MELAAILFSLFFAMNIGASGAAASMGVAYGSGAIKKKTYALILCAVGVFAGAVIGGGEVVKTISSGIIPEQTITLTIVCI
IIGAAALSLFTANLLGIPLSTSEVTVGAVVGVGVAYKVLFVNNLLIIVSFWVFVPLFAFGFTYFVSKLFRYFKIEVKSSK
KQKILGIVLLVAGFFEAFSAGMNNVANAVGPLVAAGVLDVGKGTLYGGAFVALGALLLGRRVLETNGKKITRFSKGEGIL
LSGTGAGLVIISSVFGMPVPLAQVTSSSIIGIGMAKNGPNVFHKQVVQTMLKVWIVSPFLSLSISYLLVSLFLKADYYSI
FIMVSVLLAAGGAISLTKAIRKERRSVHEQGGGI
>O34744 2.1.1.107~~~sumT~~~Uroporphyrinogen-III C-methyltransferase~~~COG0007
MGKVYIVGAGPGDPDLLTIKALKAIEKADVILYDRLVNKEILQYAKEQADLIYCGKLPDFHTMKQETINRFLVKYAQKGK
MVVRLKGGDPFVFGRGGEEAECLSENGIPFEIIPGITSGIAAAAYAGIPVTHRDAGSNVAFVTGHYKKEEDFEEKWKALA
TGIDTLVIYMGIKNVQQIERKLLENGRDGSTPAAFIHWGTTDKQKSVFCTVDTLSETVIKENITNPSLIVIGNVVNYHYK
LEWFESELKKQDLSEAL
>P29928 2.1.1.107~~~cobA~~~Uroporphyrinogen-III C-methyltransferase~~~
MGKVYLVGAGPGDPDLITLKGLKAIQQADVILYDRLVNKDLLEYAKSDADIIYCGKLPNYHTLKQETINNFLVKFAKKGK
IVTRLKGGDPFVFGRGGEEAEALVQQGISFEIVPGITSGIAAAAYAGIPVTHREYSASFAFVAGHRKDSKHDAIKWDSLA
KGVDTLAIYMGVRNLPYICQQLMKHGKTSATPIALIHWGTCADQRTVTGTLGTIVDIVKEEQIENPSMIIVGEVVNFS
>P21631 2.1.1.107~~~cobA~~~Uroporphyrinogen-III C-methyltransferase~~~
MIDDLFAGLPALEKGSVWLVGAGPGDPGLLTLHAANALRQADVIVHDALVNEDCLKLARPGAVLEFAGKRGGKPSPKQRD
ISLRLVELARAGNRVLRLKGGDPFVFGRGGEEALTLVEHQVPFRIVPGITAGIGGLAYAGIPVTHREVNHAVTFLTGHDS
SGLVPDRINWQGIASGSPVIVMYMAMKHIGAITANLIAGGRSPDEPVAFVCNAATPQQAVLETTLARAEADVAAAGLEPP
AIVVVGEVVRLRAALDWIGALDGRKLAADPFANRILRNPA
>P68577 ~~~sunA~~~SPbeta prophage-derived bacteriocin sublancin-168~~~
MEKLFKEVKLEELENQKGSGLGKAQCAALWLQCASGGTIGCGGGAVACQNYRQFCR
>O31989 ~~~sunI~~~Sublancin immunity protein SunI~~~
MEYVVMIIILLALFFIFTVFLNTRYSFDEKCLVLKFGLSKTEIPINQIVSIKESDKYGVADNIDYKIGMPYAQPDRIVIE
TTNKRFLVFLNGAQQFIQKYKRVSV
>O31986 2.4.1.-~~~sunS~~~SPbeta prophage-derived glycosyltransferase SunS~~~COG0463
MKLSDIYLELKKGYADSLLYSDLSLLVNIMEYEKDIDVMSIQSLVAGYEKSDTPTITCGIIVYNESKRIKKCLNSVKDDF
NEIIVLDSYSTDDTVDIIKCDFPDVEIKYEKWKNDFSYARNKIIEYATSEWIYFIDADNLYSKENKGKIAKVARVLEFFS
IDCVVSPYIEEYTGHLYSDTRRMFRLNGKVKFHGKVHEEPMNYNHSLPFNFIVNLKVYHNGYNPSENNIKSKTRRNINLT
EEMLRLEPENPKWLFFFGRELHLLDKDEEAIDYLKKSINNYKKFNDQRHFIDALVLLCTLLLQRNNYVDLTLYLDILETE
YPRCVDVDYFRSAILLVDMQNKLTSLSNMIDEALTDERYSAINTTKDHFKRILISLNIQLENWERVKEISGEIKNDNMKK
EIKQYLANSLHNIEHVLKGIEV
>P75792 3.1.3.23~~~ybiV~~~Sugar phosphatase YbiV~~~COG0561
MSVKVIVTDMDGTFLNDAKTYNQPRFMAQYQELKKRGIKFVVASGNQYYQLISFFPELKDEISFVAENGALVYEHGKQLF
HGELTRHESRIVIGELLKDKQLNFVACGLQSAYVSENAPEAFVALMAKHYHRLKPVKDYQEIDDVLFKFSLNLPDEQIPL
VIDKLHVALDGIMKPVTSGFGFIDLIIPGLHKANGISRLLKRWDLSPQNVVAIGDSGNDAEMLKMARYSFAMGNAAENIK
QIARYATDDNNHEGALNVIQAVLDNTSPFNS
>P0ABZ8 5.2.1.8~~~surA~~~Chaperone SurA~~~COG0760
MKNWKTLLLGIAMIANTSFAAPQVVDKVAAVVNNGVVLESDVDGLMQSVKLNAAQARQQLPDDATLRHQIMERLIMDQII
LQMGQKMGVKISDEQLDQAIANIAKQNNMTLDQMRSRLAYDGLNYNTYRNQIRKEMIISEVRNNEVRRRITILPQEVESL
AQQVGNQNDASTELNLSHILIPLPENPTSDQVNEAESQARAIVDQARNGADFGKLAIAHSADQQALNGGQMGWGRIQELP
GIFAQALSTAKKGDIVGPIRSGVGFHILKVNDLRGESKNISVTEVHARHILLKPSPIMTDEQARVKLEQIAADIKSGKTT
FAAAAKEFSQDPGSANQGGDLGWATPDIFDPAFRDALTRLNKGQMSAPVHSSFGWHLIELLDTRNVDKTDAAQKDRAYRM
LMNRKFSEEAASWMQEQRASAYVKILSN
>P0ABZ6 5.2.1.8~~~surA~~~Chaperone SurA~~~COG0760
MKNWKTLLLGIAMIANTSFAAPQVVDKVAAVVNNGVVLESDVDGLMQSVKLNAAQARQQLPDDATLRHQIMERLIMDQII
LQMGQKMGVKISDEQLDQAIANIAKQNNMTLDQMRSRLAYDGLNYNTYRNQIRKEMIISEVRNNEVRRRITILPQEVESL
AQQVGNQNDASTELNLSHILIPLPENPTSDQVNEAESQARAIVDQARNGADFGKLAIAHSADQQALNGGQMGWGRIQELP
GIFAQALSTAKKGDIVGPIRSGVGFHILKVNDLRGESKNISVTEVHARHILLKPSPIMTDEQARVKLEQIAADIKSGKTT
FAAAAKEFSQDPGSANQGGDLGWATPDIFDPAFRDALTRLNKGQMSAPVHSSFGWHLIELLDTRNVDKTDAAQKDRAYRM
LMNRKFSEEAASWMQEQRASAYVKILSN
>O67004 3.1.3.5~~~surE~~~5'-nucleotidase SurE~~~COG0496
MPTFLLVNDDGYFSPGINALREALKSLGRVVVVAPDRNLSGVGHSLTFTEPLKMRKIDTDFYTVIDGTPADCVHLGYRVI
LEEKKPDLVLSGINEGPNLGEDITYSGTVSGAMEGRILGIPSIAFSAFGRENIMFEEIAKVCVDIVKKVLNEGIPEDTYL
NVNIPNLRYEEIKGIKVTRQGKRAYKERVFKYIDPYGKPFYWIAAEEFGWHAEEGTDYWAVLNGYVSVTPLHLDLTNYKV
MKSIKYLEDSP
>B2S5B9 3.1.3.5~~~surE~~~5'-nucleotidase SurE~~~
MRILLTNDDGIHAEGLAVLERIARKLSDDVWVVAPETDQSGLAHSLTLLEPLRLRQIDARHFALRGTPTDCVIMGVRHVL
PGAPDLVLSGVNSGANMADDVTYSGTVAGAMEGTLLGVRAIALSQEYEYAGDRRIVPWETAEAHAPELIGRLMEAGWPEG
VLLNLNFPNCAPEEVKGVRVTAQGKLSHDARLDERRDGRGFPYFWLHFGRGKAPVADDSDIAAIRSGCISMTPLHLDLTA
HKVRAELGAALGVEA
>Q9KI21 3.1.3.5~~~surE~~~5'-nucleotidase SurE~~~COG0496
MKKTATPKLRLLLSNDDGVYAKGLAILAKTLADLGEVDVVAPDRNRSGASNSLTLNAPLHIKNLENGMISVEGTPTDCVH
LAITGVLPEMPDMVVAGINAGPNLGDDVWYSGTVAAAMEGRFLGLPALAVSLGGELFRYYETAAKVVYQLIQRIEKDPLP
PSTILNINVPDLPYEELKGFEVTRLGTRHRAEPTIRQIDPRGHPIYWVGAAGPEQDSGPGTDFFAMNHHCVSITPLRVDL
THYEAFDQLASWVKRLEM
>P0A840 3.1.3.5~~~surE~~~5'/3'-nucleotidase SurE~~~COG0496
MRILLSNDDGVHAPGIQTLAKALREFADVQVVAPDRNRSGASNSLTLESSLRTFTFENGDIAVQMGTPTDCVYLGVNALM
RPRPDIVVSGINAGPNLGDDVIYSGTVAAAMEGRHLGFPALAVSLDGHKHYDTAAAVTCSILRALCKEPLRTGRILNINV
PDLPLDQIKGIRVTRCGTRHPADQVIPQQDPRGNTLYWIGPPGGKCDAGPGTDFAAVDEGYVSITPLHVDLTAHSAQDVV
SDWLNSVGVGTQW
>P66881 3.1.3.5~~~surE~~~5'/3'-nucleotidase SurE~~~
MRILLSNDDGVHAPGIQTLAKALREFADVQVVAPDRNRSGASNSLTLESSLRTFTFDNGDIAVQMGTPTDCVYLGVNALM
RPRPDIVVSGINAGPNLGDDVIYSGTVAAAMEGRHLGFPALAVSLNGYQHYDTAAAVTCALLRGLSREPLRTGRILNVNV
PDLPLAQVKGIRVTRCGSRHPADKVIPQEDPRGNTLYWIGPPGDKYDAGPDTDFAAVDEGYVSVTPLHVDLTAHSAHDVV
SDWLDSVGVGTQW
>P96112 3.1.3.5~~~surE~~~5'-nucleotidase SurE~~~COG0496
MRILVTNDDGIQSKGIIVLAELLSEEHEVFVVAPDKERSATGHSITIHVPLWMKKVFISERVVAYSTTGTPADCVKLAYN
VVMDKRVDLIVSGVNRGPNMGMDILHSGTVSGAMEGAMMNIPSIAISSANYESPDFEGAARFLIDFLKEFDFSLLDPFTM
LNINVPAGEIKGWRFTRQSRRRWNDYFEERVSPFGEKYYWMMGEVIEDDDRDDVDYKAVREGYVSITPIHPFLTNEQCLK
KLREVYD
>Q53W92 3.1.3.5~~~surE~~~5'-nucleotidase SurE~~~
MRILVTNDDGIYSPGLWALAEAASQFGEVFVAAPDTEQSAAGHAITIAHPVRAYPHPSPLHAPHFPAYRVRGTPADCVAL
GLHLFGPVDLVLSGVNLGSNLGHEIWHSGTVAAAKQGYLFGLSAAAFSVPLNGEVPDFAGLRPWLLRTLETLLRLERPFL
VNVNLPLRPKGFLWTRQSVRAYEGVVIPGEDPMGRPFYWFAPRPLKEAEEGTDRWAVAQGFVSATPLRLDLTDETRLQPT
LAHD
>Q9PF20 3.1.3.5~~~surE~~~5'-nucleotidase SurE~~~COG0496
MRVLVSNDDGVDAPGIKILADALRNAGHEVMVVAPDRDRSGASNSLTLDTPIRAKQIDMHTYSVAGTPTDCVHLALTGLL
NYDPDIVVSGINNTGNLGDDVIYSGTVSAAMEGRFLGLPAVAVSLVTLYREGQQAPQYETAAHAAINIVAQLKTDPLPAD
TILNVNVPDVTWQQMRGFKVTRLGNRHRSAPCLTQTDPRGHTIYWIGPAGPEQDAGPGTDFDAVRNTYISITPIHVDLTR
YQALENVTRWTDRLTAHMDWPT
>G8JZS4 3.2.1.3~~~susB~~~Glucan 1,4-alpha-glucosidase SusB~~~COG1082
MKKRKILSLIAFLCISFIANAQQKLTSPDNNLVMTFQVDSKGAPTYELTYKNKVVIKPSTLGLELKKEDNTRTDFDWVDR
RDLTKLDSKTNLYDGFEVKDTQTATFDETWQPVWGEEKEIRNHYNELAVTLYQPMNDRSIVIRFRLFNDGLGFRYEFPQQ
KSLNYFVIKEEHSQFGMNGDHIAFWIPGDYDTQEYDYTISRLSEIRGLMKEAITPNSSQTPFSQTGVQTALMMKTDDGLY
INLHEAALVDYSCMHLNLDDKNMVFESWLTPDAKGDKGYMQTPCNTPWRTIIVSDDARNILASRITLNLNEPCKIADAAS
WVKPVKYIGVWWDMITGKGSWAYTDELTSVKLGETDYSKTKPNGKHSANTANVKRYIDFAAAHGFDAVLVEGWNEGWEDW
FGNSKDYVFDFVTPYPDFDVKEIHRYAARKGIKMMMHHETSASVRNYERHMDKAYQFMADNGYNSVKSGYVGNIIPRGEH
HYGQWMNNHYLYAVKKAADYKIMVNAHEATRPTGICRTYPNLIGNESARGTEYESFGGNKVYHTTILPFTRLVGGPMDYT
PGIFETHCNKMNPANNSQVRSTIARQLALYVTMYSPLQMAADIPENYERFMDAFQFIKDVALDWDETNYLEAEPGEYITI
ARKAKDTDDWYVGCTAGENGHTSKLVFDFLTPGKQYIATVYADAKDADWKENPQAYTIKKGILTNKSKLNLHAANGGGYA
ISIKEVKDKSEAKGLKRL
>Q8A1G1 ~~~susC~~~TonB-dependent receptor SusC~~~COG1629
MKKGNFMFKVLLMLIAGIFLSIDAFAQQITVKGIVKDTTGEPVIGANVVVKGTTTGTITDFDGNFQLSAKQGDIIVVSFI
GYQPQELPVAAQMNVILKDDTEILDEVVVIGYGQVKKNDMTGSVMAIKPDELSKGITTNAQDMLSGKIAGVSVISNDGTP
GGGAQIRIRGGSSLNASNDPLIVIDGLAIDNEGIKGMANGLSMVNPADIETLTVLKDASATAIYGSRASNGVIIITTKKG
KNGQAPSVSYNGSVSFSKTQKRYDVLSGDEYRAYANQLWGDKLPADLGTANTDWQDQIFRTAVSTDHHVSINGGFKNLPY
RVSLGYTDDNGIVKTSNFRRFTASVNLAPSFFEDHLKFNINAKFMNGKNRYADTGAAIGGALAIDPTRPVYSNEDPYQFT
GGYWQNINSTTGFSNPDWKYTSNPNSPQNPLAALELKNDKANSNDFVGNVDVDYKFHFLPDLRLHASIGGEYAEGTQTTI
VSPYSFGNNYYGWNGDVTQYKYNLSYNIYVQYIKSLGANDFDIMVGGEEQHFHRNGFEEGQGWDSYTQEPHDAKLREQTA
YATRNTLVSYFGRLNYSLLNRYLFTFTMRWDGSSRFSKDNRWGTFPSLALGWKIKEENFLKDVNVLSDLKLRLGWGITGQ
QNIGDDFAYLPLYVVNNEYAQYPFGDTYYSTSRPKAFNENLKWEKTTTWNAGLDFGFLNGRITGGIDGYFRKTDDLLNSV
KIPVGTNFNAQMTQNIGSLENYGMEFSINAKPIVTKDFTWDLSYNITWNHNEITKLTGGDDSDYYVEAGDKISRGNNTKV
QAHKVGYAANSFYVYQQVYDENGKPIENMFVDRNGNGTIDSGDKYIYKKPAGDVLMGLTSKMQYKNFDFSFSLRASLNNY
VYYDFLSNKANVSTSGLFSNNAYSNTSAEAVALGLSGQGDYMSDYFIHNASFLRCDNITLGYSFQNLWKTQTYKGVGGRV
YATVQNPFIISKYKGLDPEVKSGIDANPYPRAMTFLLGLSLQF
>A7LXT5 ~~~~~~SusD-like protein BACOVA_02651~~~COG0561
MRIFMKSKLLVIATTALLFAACSDSFLDRAPEGNYVDATFYTSDEALEAATAPLYNRAWFDYNQRSIVPIGSGRANDMYS
PWNYPQFVTFQVTALDENLSGAWSGFYSVVTMANSVINAVETQTQGSVSEAAKTKAIAEARLMRACAYFYMLRIWGPVIL
IEDNQKLVDNPVRPLNREEDVFQFIINDLNYAVDNLSEQSDKGRATSWAAKGILAKVYLARSGWNNGGTRDEGDLELARQ
YASDVCENSGLDLMTNYEDLFKYKNNNNQESLLAMQWVPLGEWYECNTLLSDLAFSTEVTGGVNCWSSYNGSIDMLQQYE
LADTLRRNATFFTKGSYYSYICIKDGGYTYKGTASPIKKGVPGGPDDDNDGKVKQMNSPLNTYILRLADVYLTYAEACLG
NNSTLSDGRGLYFFNRVRERAKINKKSSITLDDIIRERRVEFGMEYSNWYDMVTWFRYLPDKMLNYFNNQWRGYRTDAII
KDEDGKLHFGKYDTDGTTFLEGPEYYTAPEFTINIEAEDIFLPYPESDVIQNPLLNEPPVPYTFNE
>Q8A1G2 ~~~susD~~~Starch-binding protein SusD~~~COG3637
MKTKYIKQLFSAALIAVLSSGVTSCINDLDISPIDPQTGGSFDQQGVFVKGYAMLGVTGQKGIDGSPDLDGQDEGESGFY
RTTFNCNELPTDECLWAWQENQDIPQLTSISWSPSSQRTEWVYVRLGYDITQYNFFLDQTEGMTDAETLRQRAEIRFLRA
LHYWYFLDLFGKAPFKEHFSNDLPVEKKGTELYTYIQNELNEIEADMYEPRQAPFGRADKAANWLLRARLYLNAGVYTGQ
TDYAKAEEYASKVIGSAYKLCTNYSELFMADNDENENAMQEIILPIRQDGVKTRNYGGSTYLVCGTRVAGMPRMGTTNGW
SCIFARAAMVQKFFSNLEDVPMLPADVEIPTKGLDTDEQIDAFDAEHGIRTEDMIKAAGDDRALLYSGVGGGRRKIQTDA
ISGFTDGLSIVKWQNYRSDGKPVSHATYPDTDIPLFRLAEAYLTRAEAIFRQGGDATGDINELRKRANCTRKVQTVTEQE
LIDEWAREFYLEGRRRSDLVRFGMFTTNKYLWDWKGGAMNGTSVASYYNKYPIPVSDINNNRNMSQNEGYK
>G8JZT0 ~~~susE~~~Outer membrane protein SusE~~~
MKKISNILLAVTFALPLFTACETDNDSNPILNEPDTFTLNTPAYAANNVYDLKNAQTVELTCSQPDYGFPAATTYTVQAS
FEQDFIEATDESKANYTVLESTSPTAKINVDASELNNALLDLWTAVNGEQAELPTEPVAVYIRLKANITSSGKGVCFSNV
IELPNVLISKSTSSLTPPKTMFIVGSMLDTDWKVWKPMAGVYGMDGQFYSMIYFDANSEFKFGTKENEYIGINDNRVTVT
DKAGAGVSGSDNFVVENAGWYLFYVKAAVKGDDYQFTITFYPAEVYLFGNTTGGSWAFNDEWKFTVPATKDGNFVSPAMT
ASGEVRMCFKTDLDWWRTEFTLHDGEIFYRDFNLIDSWTEKGDGYSIQGSAGNVIHLNFTAGTGEKK
>G8JZS6 ~~~susF~~~Outer membrane protein SusF~~~
MKKHLIYTGMFLAAIGFSACNEDFKDWADPQSNPQEESAGQLTATFTAGKDASIVMDAATADSVEIAKLSSTTAEEGSKI
AVNSLTLNENHTIPFSMTEDHVFKVALAQLDSVTQEAYKSRASVVRELKISINASAVTPSGEGIQLVGNEVSITLQPATT
PAVDPDGYYIVGDFTGWDGNSAQQMKKDALDENLYILEAEIESTSNFKIFPASAINGNDIDWTKALGSSVDGDDSGDNFV
SWTNAGAINTALDGKIKISFDAFNYRFTVKDNSAPTELYMTGSAYNWGTPAGDPNAWKALVPVNGTKGTFWGIFYFAAND
QVKFAPQANWGNDFGFVDAISQESKDLAGLSDEGGNLKVGIAGWYLVYVSVIGDDKVIEFEKPNVYLMGDTSYNGWDAQL
VEQDLFTVPGTADGEFVSPAFLKDGAVRICVNPKAVSAGDWWKTEFIIFDGQIAYRGNGGDQAAVQGKTGQKVYLNFGNG
TGRIE
>Q8A1G3 3.2.1.1~~~susG~~~Alpha-amylase SusG~~~COG0366
MNKHLHFLSLLWLSMLMAFMTACSDDKNITDPAPEPEPPVEGQWTALTASPDTWDETKRADISYQLLLYSFADSDGDGYG
DLNGVTQKLDYLNQLGVKALWLSPIHPCMSYHGYDVTDYTKVNPQLGTESDFDRLVTEAHNRGIKIYLDYVMNHTGTAHP
WFTEASSSSESPYRNYYSFSEDPKTDIAAGKIAMITQEGAAGYNAAEWFQVSDETAAVKGLLKFTLDWSNAPSPILVVST
GTKADEDNPDTGTDNAKYLYYGEDICKKFYDKGNNIYELTVDFESTWGLLIRTSNASFWPSGTKYGASSSSEKLALNKDF
KLTNAGNPANIMFDSQQITYFHSHFCTDWFADLNYGPVDQAGESPAYQAIADAAKGWIARGVDGLRLDAVKHIYHSETSE
ENPRFLKMFYEDMNAYYKQKGHTDDFYMIGEVLSEYDKVAPYYKGLPALFEFSFWYRLEWGINNSTGCYFAKDILSYQQK
YANYRSDYIEATKLSNHDEDRTSSKLGKSADKCKLAAAVLLTSAGHPYIYYGEELGLYGTKDNGDEYVRSPMLWGDSYTT
NYTDKTDATVSKNVKTVADQQADTHSLLNIYFSLTRLRNTYPALAEGNMTKHSVYNESQEKDYKPIAAWYMTKDNEKLLV
IHNFGGTAMQLPLTDKIEKVLFVNGETQQNTDSDSYTLKLGGYASVVFKLGN
>A0A059ZV61 2.4.1.13~~~~~~Sucrose synthase~~~COG0438
MIEALRQQLLDDPRSWYAFLRHLVASQRDSWLYTDLQRACADFREQLPEGYAEGIGPLEDFVAHTQEVIFRDPWMVFAWR
PRPGRWIYVRIHREQLALEELSTDAYLQAKEGIVGLGAEGEAVLTVDFRDFRPVSRRLRDESTIGDGLTHLNRRLAGRIF
SDLAAGRSQILEFLSLHRLDGQNLMLSNGNTDFDSLRQTVQYLGTLPRETPWAEIREDMRRRGFAPGWGNTAGRVRETMR
LLMDLLDSPSPAALESFLDRIPMISRILIVSIHGWFAQDKVLGRPDTGGQVVYILDQARALEREMRNRLRQQGVDVEPRI
LIATRLIPESDGTTCDQRLEPVVGAENVQILRVPFRYPDGRIHPHWISRFKIWPWLERYAQDLEREVLAELGSRPDLIIG
NYSDGNLVATLLSERLGVTQCNIAHALEKSKYLYSDLHWRDHEQDHHFACQFTADLIAMNAADIIVTSTYQEIAGNDREI
GQYEGHQDYTLPGLYRVENGIDVFDSKFNIVSPGADPRFYFSYARTEERPSFLEPEIESLLFGREPGADRRGVLEDRQKP
LLLSMARMDRIKNLSGLAELYGRSSRLRGLANLVIIGGHVDVGNSRDAEEREEIRRMHEIMDHYQLDGQLRWVGALLDKT
VAGELYRVVADGRGVFVQPALFEAFGLTVIEAMSSGLPVFATRFGGPLEIIEDGVSGFHIDPNDHEATAERLADFLEAAR
ERPKYWLEISDAALARVAERYTWERYAERLMTIARIFGFWRFVLDRESQVMERYLQMFRHLQWRPLAHAVPME
>D4H6M0 2.4.1.13~~~~~~Sucrose synthase~~~COG0438
MNLSNKELEGLDEIISDHREDFCPFLGRIEEEDKQFFLSSEMKEMYAGDTVPDFIASLQEAVKMPGQIYFATRASIGEWA
FVTVFTDTLDYMEVSPTEYQEAKEKTVLGENAAWMPSVDLKPFNRDFPKPSSADFIGKGVEFLNRHQSSRIFMNPEKGLK
QLLDFLRVHKYDGRQLMLNNRIDSVDKLKKALKKAQALLKNKSDETEWEEVESDMAHLGFEPGWGKKLGYVKEFLALLSD
ILAAPEPVVLEKFLDRIPMIFSLVVLSPHGFFGQAGVFGKPDTGGQVVYILDQVKALEHELKSRLDEKGLDITPKILVVT
RLIPEAEGTNCDMEEELIRGTDNCHIVRVPFRDESGEVVRQWISRFRIWPYLERFSTEAQNIILSKLQGNPDLIIGNYSD
GNLVASLIAQRLGVTQCTIAHALEKTKYLYSDLYWQDNNDKYHFACQYTADLISMNYSDFIITSTYQEIAGTNDSVGQYE
SYMNYTLPGLYRVVNGIDVFDPKFNVVSPGAAPDIFFSYKSKDRFPEHIEEIESILFEDNLEGSRGSLADPDKPLIFTMA
RLDKIKNLTGLVRWFGENEELRKTANLLVIGGFVDESLSSDDEEREQIRIMHSVIDELGLDGSVRWVGAHLGKRMTGEFY
RYVADRKGVFVQPALFEAFGLTIIEAMSSGLPVFATVYGGPSEIIEDGKSGFTLDPNKGDECAEKLLEFIQKCQSDPGHW
IKISDNALKRVEERYNWPLYAKRLMTFARVYGFWKFVTNLEREETVRYLEMLYGMVYRRLADPKEY
>I7A3T6 2.4.1.13~~~~~~Sucrose synthase~~~COG0438
MIKDIYKTAETFHNDFYDFLKAVSTQPKKLMITGELINLYVASGYDKNSGLYEFIEKIQETISLDHSVILDVRIKIASIK
FYRISLEEFLIEEISSKEFLIYKETVAKPDTLNTTLNLNFKPFYDKSPAVRDIKYIGSGVEYLNRFLSSQMFTNEERWKK
NLFDFIRLHNFNGEQLILNDRIKDTKHLNNQINAALAKLGNHPANTPYENIKHILQELGFEKGLGKDAGTITHNLNLLDQ
LLNSPDHNALAEFISSIPMILNIAIISPHGFFGQEGVLGLPDTGGQVVYILDQVKALEKQLIDSLKKSGLNLLPKIIVLT
RLIPNARGTTCNQRLEKIYGAKNSWILRVPFREYNKRVTDEWISRFEIWPYLEDFAEDSYTALLAEFKKRPDLIIGNYSD
GNLVAYLLAKKFKVTQCGIAHALEKSKYLYSALYWYDLEKYYHFSMQFTADLLAINSADFLITSSFQEIAGTEKSIGQYE
SYMHFTMPGLYRVENGVNPFHVKFNIVSPGVNEKIYFPYPKTKWRLKETKRRIENLFFSNSEDPDVIGWLDNPEKTPIFT
MSRLDRIKNISFLVRCFGESEELQQTSNLIVVAGKIDETMTDDYEEKEQIRLMHELITKYKLHNKIRWIGKLLPKDESGE
AYRIIAERRGIFVQPALFEGFGLTVLEAMTSGLPVFATKYGGPLEIIQNGVNGFHIDPVNQEETTEKIVRFLSDSYIDSS
VWDKLSKAAIKRVTEKYSWKLYSKRLLSLAKLYGFWKYATNLEHEDINAYLDLIYHTIYKSRAKILLEEHMKR
>Q820M5 2.4.1.13~~~ss2~~~Sucrose synthase~~~COG0438
MTTIDTLATCTQQNRDAVYTLLRRYFTANRTLLLQSDLREGLLQTEQDCGQSDMLRAFVFRLQEGIFSSPWAYLALRPEI
AKWEFMRIHQEHLIPEKLTISEFLKFKETVVKGEATESVLEVDFGPFNRGFPRLKESRSIGQGVIFLNRKLSSEMFSRIE
AGHTSLLHFLGVHAIEGQQLMFSNNSHDIHAVRNQLRQALEMLETLDGTTPWIELAPKMNQLGFAPGWGHNANRVAETMN
MLMDILEAPSPSALEEFLACIPMISRLLILSPHGYFGQDNVLGLPDTGGQVVYILDQVRALEKEMHDRLQLQGVQVEPKI
LIVTRLIPDAGDTTCNQRLEKVSGCTNTWILRVPFRKHNGEIIPHWISRFEIWPHLEIFAGDVEREALAELGGHPDLIIG
NYSDGNLVATLLSRRLGVTQCNIAHALEKTKYLHSDIYWQENEDKYHFSCQYTADLLAMNSADFIVTSTYQEIAGTREAE
GQYESYQAFSMPDLYRVIHGIDLFDPKFNIVSPGANADIYFPYSDPNRRLHSLIPEIESLIFDDATNLPARGYLQDPDKP
LIFTMARLDRIKNITGLVELYAASPRLRSLANLVIVGGKIDPQHSSDHEEQEQIHRMHQLMDEHELDQQVRWLGMRLDKN
LAGELYRYIADKRGIFVQPALFEAFGLTIIEAMASGLPTFATRYGGPLEIIQNNRSGFHIDPNQGAATADLIADFFEKNL
ENPQEWERISQGALDRVASRYTWKLYAERMMTLSRIYGFWKFVSGLEREETDRYLNMFYHLQFRPLANRLAHEI
>Q8DK23 2.4.1.13~~~susA~~~Sucrose synthase~~~COG0438
MTCVLLKAVVESDERADLRQFSRILQLGEKRYLLRNDILDAFADYCRDQERPVPPPSESRLSKLVFYTQEIIVDNESLCW
IVRPRIAQQEVCRLLVEDLTIVPMTIPELLDLRDRLVNHYHPNEGDVFEIDVQPFYDYSPIIRDAKNIGKGVEFLNRYLS
SKLFQDPRQWQQNLFNFLRIHRYNGYQLLINERIRSPQHLSEQVKQALVVLSDRPPTEAYSEFRFELQNLGFEPGWGNTV
ARVRDTLEILDQLLDSPDHQVLEAFVSRIPMLFRIALISPHGWFGQEGVLGRPDTGGQVVYILDQVKSLEKQMREDLELA
GLGVLEAQPKIIVLTRLIPNAEGTLCNQRLEKIYGTNDAWILRVPFREFNPKVTQNWISRFEIWPYLETFAIDAERELRA
EFGHVPDLIIGNYSDGNLVAFLLARRLKVTQCNIAHALEKSKYLFSNLYWQDLEDKYHFSLQFTADLIAMNAANFIISST
YQEIVGTPDSIGQYESYQSFTMPDLYHVVNGIELFSPKFNVVPPGVNEQVYFPYYHYTERLEGDRQRLEELLFTLEDPQQ
IYGYLEAPEKRPLFSMARLDRIKNLTGLAEAFGRSKALQERCNLILVAGKLRTADSSDREEIAEIEKLYQIIHQYNLHGK
IRWLGIRLPKADSGEIYRIIADRQGIFVQPALFEAFGLTILEAMISGLPTFGTRFGGPLEIIQDGVNGFYINPTHLEEMA
ETIVRFLEACDRDPQEWQRISKAGIERVYSTYTWKIHCTRLLSLAKIYGFWNFSSQENREDMMRYMEALFHLLYKPRAQA
LLAEHLQR
>A0A0H2ZJC1 ~~~sutA~~~Transcriptional regulator SutA~~~
MSEEELEQDELDGADEDDGEELAAADDGEADSSDGDEAPAPGKKAKAAVVEEELPSVEAKQKERDALAKAMEEFLSRGGK
VQEIEPNVVADPPKKPDSKYGSRPI
>P77626 ~~~sutR~~~HTH-type transcriptional regulator SutR~~~COG1396
MENLARFLSTTLKQLRQQRGWSLSRLAEATGVSKAMLGQIERNESSPTVATLWKIATGLNVPFSTFISPPQSATPSVYDP
QQQAMVITSLFPYDPQLCFEHFSIQMASGAISESTPHEKGVIEHVVVIDGQLDLCVDGEWQTLNCGEGVRFAADVTHIYR
NGGEQTVHFHSLIHYPRS
>Q1QWP1 4.4.1.24~~~suyA~~~(2R)-sulfolactate sulfo-lyase subunit alpha~~~COG2721
MSIDFVVHDADDAVGVVVVEGVEAGQMLTGWVMDQDRTLQFEVKDAIPIGHKLAIRDLAEDETVIKYSVDIGRVVQSIRQ
GEHVHVHNVKTKRW
>Q58Y44 4.4.1.24~~~suyA~~~(2R)-sulfolactate sulfo-lyase subunit alpha~~~
MLCVVTSDNSDFRLTAKADIPIGHKVALKALKAGDTVIKYHEDIGKMVGDAEVGGHVHTHNCKTKRW
>Q1QWP0 4.4.1.24~~~suyB~~~(2R)-sulfolactate sulfo-lyase subunit beta~~~COG2721
MELKGRTFLGYRRDNGRVGIRNHVIVLPVDDISNAAAEAVANNIKGTLALPHPYGRLQFGADLDLHFRTLIGTGCNPNVA
AVIVIGIEPGWTGKVVDGIRATGKPVEGFWIEQNGDHNTIANASRKAREFVQYASELQREPCDVSELWVSTKCGESDTTS
GCGANPTVGEAFDKLYEQGCTLVFGETSELTGGEHLVAARCANDDVRERFQAMFDRYSAMIDRHKTSDLSESQPTKGNIE
GGLTTIEEKALGNIQKIGKRCRVDGVLDKAETPTGPGLWFMDSSSAAAEMVTLCAASGYVAHFFPTGQGNVIGNPILPVI
KLCANPRTVRTMSEHIDVDVSGVLRREINLQEAGDQLLEMLLRTANGRHTNAEALGHREFVLTRLYESA
>Q58Y43 4.4.1.24~~~suyB~~~(2R)-sulfolactate sulfo-lyase subunit beta~~~
MALDFSNATVKAWRRENGRVGVRNHVLILPVDDISNAACEAVANNVKGTLAIPHAYGRLQFGEDLELHFRTIIGTGANPN
VAAVVVIGIEPEWTQVIVDGIAKTGKPVTGFSIEQKGDFETIRQAGWKAKEYVHWASELQKEDCPISDLWISTKCGESDT
TTGLSSCPTVGNMYDKLLPQGIYGCFGETSEITGAEHICEKRAANPETARKFKEIWQAYSDDVIFAHQTDDLSDSQPTKG
NILGGLTTIEEKALGNLEKIGRTSTYIDAMGPAETPSKGPGLYFMDSSSAAAECVTLMAAGGYVIHTFPTGQGNVVGNPI
VPVIKISGNPRTLRTMSEHIDVDVTGVLTREMTIDQAGDALIEMIIRTANGRMTAAEALGHREFSMTKLYRSA
>P86041 ~~~~~~Scytovirin~~~
GSGPTYCWNEANNPGGPNRCSNNKQCDGARTCSSSGFCQGTSRKPDPGPKGPTYCWDEAKNPGGPNRCSNSKQCDGARTC
SSSGFCQGTAGHAAA
>Q89G85 ~~~~~~Sugar transporter SemiSWEET~~~COG4095
MDPFLIKLIGFAAATCTTVAYAPQFIKVLKTRSARDISLGMFLVMVLGLALWLIYGLLSGDAPLIASNAVTMLLAGGILV
MKLRYG
>P0DMV3 ~~~~~~Sugar transporter SemiSWEET~~~
MDTILLTGLFAAFFTTFAFAPQSIKTIRTRNTEGISVVMYIMFLTGVISWIAYGIMRSDFAVLIANIVTLFLAAPVLVIT
LINRRKKHV
>B0SR19 ~~~~~~Sugar transporter SemiSWEET~~~
MENLIGYVAAFLTTVSFLPQVLRVVMTKQTRDISRNMYIMFFLGVVLWFVYGILRSDLPIILANVVTLFFVTIILYYKLT
EGNQT
>F9RBV9 ~~~~~~Sugar transporter SemiSWEET~~~
MALIERIGKALEPLMLVMGLISPLATMPQLYKLYVSHSEHALGLSLTTWLLYSFIALLWTIYGIYHKNPTIWVGNCLGFL
MYVAMVVGIIAHTGGTY
>O31501 ~~~swrC~~~Swarming motility protein SwrC~~~COG0841
MNHVINFVLKNKFAVWLMTIIVTAAGLYAGMNMKQESIPDVNMPYLTISTTYPGATPSQVADEVTKPVEQAVQNLDGVSV
VTSTSYENASSVMIEYDYEKDMDKAKTEAAEALENVNLPDDAKDPEISRYSLNSFPILTLSVSSDKDNLQELTKQVEDSL
VSKLEGIEGVASVQVSGQQVEEVEFSFKEDKLKEYGLDEDTVKQVIQGSDVTTPLGLYTFGNEEKSVVVSGDIETIKDLK
NMRIPTASASSAGSSAASQAGAQSAQAAQSAQAAAQVQQSASTAVPTVKLSDIATIKDVKKAESVSRTNGKDSIGINIVK
ANDANTVEVADDVKAELKKFKEDHKGFNYSATLDMAEPITQSVDTMLSKAIFGAIFAIVIILLFLRDIKSTLISVVSIPL
SLLIALLVLQQLDITLNIMTLGAMTVAIGRVVDDSIVVIENIYRRMRLKDEPLRGKALVREATKEMFKPIMSSTIVTIAV
FLPLALVGGQIGELFIPFALTIVFALAASLVISITLVPMLAHSLFKKSLTGAPIKAKEHKPGRLANIYKKVLNWALSHKW
ITSIIAVLMLLGSLFLVPLIGASYLPSEEEKTMQLTYSPEPGETKKEAENEAEKAEKILLDRKHVDTVQYSLGSGSPLAG
GDSNGALFYIKYESDTPDFDKEKDNVLKEIQKQSDRGEWKSQDFSSSGNNNELTYYVYGDSENDIKDTVKDIEKIMKDEK
DLKNVNSGLSSTYDEYTFVADQEKLSKLGLTASQISQALMSQTSQEPLTTVKKDGKELDVNIKTEKDEYKSVKDLENKKI
TSATGQEVKIGDVAKVKEGSTSDTVSKRDGKVYADVTGEVTSDNVTAVSAAIQKKIDKLDHPDNVSIDTGGVSADIADSF
TKLGLAMLAAIAIVYLVLVITFGGALAPFAILFSLPFTVIGALVGLYVSGETISLNAMIGMLMLIGIVVTNAIVLIDRVI
HKEAEGLSTREALLEAGSTRLRPILMTAIATIGALIPLALGFEGGSQVISKGLGVTVIGGLISSTLLTLLIVPIVYEVLA
KFRKKKPGTEEE
>C0H412 ~~~swrD~~~Swarming motility protein SwrD~~~COG1582
MIKVTRLNGQPFTLNALFIEQIECFPDTTITLSNGKKFVVKEDEEAVLEKIAAFYRKIQIFAMDQGIEEPE
>P75869 ~~~sxy~~~Protein Sxy~~~COG3070
MKSLSYKRIYKSQEYLATLGTIEYRSLFGSYSLTVDDTVFAMVSDGELYLRACEQSAQYCVKHPPVWLTYKKCGRSVTLN
YYRVDESLWRNQLKLVRLSKYSLDAALKEKSTRNTRERLKDLPNMSFHLEAILGEVGIKDVRALRILGAKMCWLRLRQQN
SLVTEKILFMLEGAIIGIHEAALPVARRQELAEWADSLTPKQEFPAELE
>O67323 6.1.1.7~~~alaS~~~Alanine--tRNA ligase~~~COG0013
MSLSAHEIRELFLSFFEKKGHTRVKSAPLVPENDPTLLFVNAGMVPFKNVFLGLEKRPYKRATSCQKCLRVSGKHNDLEQ
VGYTSRHHTFFEMLGNFSFGDYFKKEAIEYAWEFVTEVLKLPKEKLYVSVYKDDEEAYRIWNEHIGIPSERIWRLGEEDN
FWQMGDVGPCGPSSEIYVDRGEEYEGDERYLEIWNLVFMQYNRDENGVLTPLPHPNIDTGMGLERIASVLQGKNSNFEID
IIFPLIQFGEEVSGKKYGEKFETDVALRVIADHLRAITFAISDGVIPSNEGRGYVIRRILRRAMRFGYKLGIENPFLYKG
VDLVVDIMKEPYPELELSREFVKGIVKGEEKRFIKTLKAGMEYIQEVIQKALEEGRKTLSGKEVFTAYDTYGFPVDLIDE
IAREKGLGIDLEGFQCELEEQRERARKHFKVEAKKVKPVYSHLKELGKTSAFVGYEHMEWESQVVGLVKGEGLVSELKEG
EEGEVVLKETPFYPEGGGQIGDAGIIESDKALFKVEDTQKPTEGIIVHIGKVLKGTLKVGDTVHARVDKERRWDIMRNHT
ATHLLHAALRNVLGEHVRQAGSLVADKYLRFDFTHFSALTEEELKRVEELVNEKIRENLPVNVMEMAYDEALKTGAIAIF
EEKYGERVRVISCGEFSKELCGGTHVSATGDIGYFKIISESSVGAGVRRIVAQTGRWSVETAFKEHQTLKKASSALGVGE
EEVIQKIEELKEEIKDREREIQRLKQELLKLQIREVVKEENVGDFTLHYGVFEEVEPEELRNLADMLRQRTKKDVVFIAS
RKGDKINFVIGVSKEISDKVNAKEVIREVGKVLKGGGGGRADLAQGGGKAPDKFPEAVKLLKEILSG
>P00957 6.1.1.7~~~alaS~~~Alanine--tRNA ligase~~~COG0013
MSKSTAEIRQAFLDFFHSKGHQVVASSSLVPHNDPTLLFTNAGMNQFKDVFLGLDKRNYSRATTSQRCVRAGGKHNDLEN
VGYTARHHTFFEMLGNFSFGDYFKHDAIQFAWELLTSEKWFALPKERLWVTVYESDDEAYEIWEKEVGIPRERIIRIGDN
KGAPYASDNFWQMGDTGPCGPCTEIFYDHGDHIWGGPPGSPEEDGDRYIEIWNIVFMQFNRQADGTMEPLPKPSVDTGMG
LERIAAVLQHVNSNYDIDLFRTLIQAVAKVTGATDLSNKSLRVIADHIRSCAFLIADGVMPSNENRGYVLRRIIRRAVRH
GNMLGAKETFFYKLVGPLIDVMGSAGEDLKRQQAQVEQVLKTEEEQFARTLERGLALLDEELAKLSGDTLDGETAFRLYD
TYGFPVDLTADVCRERNIKVDEAGFEAAMEEQRRRAREASGFGADYNAMIRVDSASEFKGYDHLELNGKVTALFVDGKAV
DAINAGQEAVVVLDQTPFYAESGGQVGDKGELKGANFSFAVEDTQKYGQAIGHIGKLAAGSLKVGDAVQADVDEARRARI
RLNHSATHLMHAALRQVLGTHVSQKGSLVNDKVLRFDFSHNEAMKPEEIRAVEDLVNTQIRRNLPIETNIMDLEAAKAKG
AMALFGEKYDERVRVLSMGDFSTELCGGTHASRTGDIGLFRIISESGTAAGVRRIEAVTGEGAIATVHADSDRLSEVAHL
LKGDSNNLADKVRSVLERTRQLEKELQQLKEQAAAQESANLSSKAIDVNGVKLLVSELSGVEPKMLRTMVDDLKNQLGST
IIVLATVVEGKVSLIAGVSKDVTDRVKAGELIGMVAQQVGGKGGGRPDMAQAGGTDAAALPAALASVKGWVSAKLQ
>A0QWQ4 6.1.1.7~~~alaS~~~Alanine--tRNA ligase~~~COG0013
MQTHEIRKRFLDHFVKAGHTEVPSASVILDDPNLLFVNAGMVQFVPYFLGQRTPPWNRATSIQKCIRTPDIDEVGITTRH
NTFFQMAGNFSFGDYFKRGAIELAWTLLTNPVEEGGYGFDPERLWATVYLDDDEAIGLWQEVAGLPAERIQRRGMADNYW
SMGIPGPCGPSSEIYYDRGPEYGVEGGPEANEDRYIEIWNLVFMQNERGEGTSKEDFEILGPLPRKNIDTGMGIERVACL
LQGVDNVYETDLLRPVIDKVAAVAPRGYGAGNHDDDVRYRIIADHTRTAAIIIADGVSPGNEGRGYVLRRLLRRIIRAAK
LLGVEQPVMGDLIATVRDAMGPSYPELVTDFERINRIAVAEETAFNRTLASGSKLFEDAARATKKSGATVLSGSDAFTLH
DTYGFPIDLTLEMAAEAGLSVDQEGFRTLMAEQRQRAKADAAARKQAHTDLSAYRELVDAGPTEFTGFDELTSEATILGI
FVDGKRVPVVSHDGLEADRVELILDRTPFYAEAGGQIADEGTISGTGASGTARAAVTDVQKIARTLWAHRVNVESGEFVE
GDTVTAAVDPKWRRGATQGHSGTHMVHAALREVLGPNAVQAGSLNRPGYLRFDFNWQGPLSDDQRTQIEEVTNQAVEADY
EVHTFVTELEKAKAMGAMAMFGERYPDQVRVVEIGGPFSLELCGGTHVHNSAQIGPVTILGESSVGSGVRRVEAYVGLDS
FRHLAKERALMAGLASSLKVPSEEVPARVANLVERLKAAEKELDRMRLANARAAAVNAVAGAETVGKVRLVAQRMSGGMS
ANDLRSLVGDIRGKLGSEPAVVALIAEGENDAVPFVVAVNPAAQDLGLRANDLVKQFAAPVNGRGGGKADLAQGSGKGAA
GIDAALAALRAEIGRS
>P9WFW7 6.1.1.7~~~alaS~~~Alanine--tRNA ligase~~~COG0013
MQTHEIRKRFLDHFVKAGHTEVPSASVILDDPNLLFVNAGMVQFVPFFLGQRTPPYPTATSIQKCIRTPDIDEVGITTRH
NTFFQMAGNFSFGDYFKRGAIELAWALLTNSLAAGGYGLDPERIWTTVYFDDDEAVRLWQEVAGLPAERIQRRGMADNYW
SMGIPGPCGPSSEIYYDRGPEFGPAGGPIVSEDRYLEVWNLVFMQNERGEGTTKEDYQILGPLPRKNIDTGMGVERIALV
LQDVHNVYETDLLRPVIDTVARVAARAYDVGNHEDDVRYRIIADHSRTAAILIGDGVSPGNDGRGYVLRRLLRRVIRSAK
LLGIDAAIVGDLMATVRNAMGPSYPELVADFERISRIAVAEETAFNRTLASGSRLFEEVASSTKKSGATVLSGSDAFTLH
DTYGFPIELTLEMAAETGLQVDEIGFRELMAEQRRRAKADAAARKHAHADLSAYRELVDAGATEFTGFDELRSQARILGI
FVDGKRVPVVAHGVAGGAGEGQRVELVLDRTPLYAESGGQIADEGTISGTGSSEAARAAVTDVQKIAKTLWVHRVNVESG
EFVEGDTVIAAVDPGWRRGATQGHSGTHMVHAALRQVLGPNAVQAGSLNRPGYLRFDFNWQGPLTDDQRTQVEEVTNEAV
QADFEVRTFTEQLDKAKAMGAIALFGESYPDEVRVVEMGGPFSLELCGGTHVSNTAQIGPVTILGESSIGSGVRRVEAYV
GLDSFRHLAKERALMAGLASSLKVPSEEVPARVANLVERLRAAEKELERVRMASARAAATNAAAGAQRIGNVRLVAQRMS
GGMTAADLRSLIGDIRGKLGSEPAVVALIAEGESQTVPYAVAANPAAQDLGIRANDLVKQLAVAVEGRGGGKADLAQGSG
KNPTGIDAALDAVRSEIAVIARVG
>P67011 6.1.1.7~~~alaS~~~Alanine--tRNA ligase~~~
MKKLKASEIRQKYLDFFVEKGHMVEPSAPLVPIDDDTLLWINSGVATLKKYFDGRETPKKPRIVNSQKAIRTNDIENVGF
TARHHTFFEMLGNFSIGDYFKQEAIEFAWEFLTSDKWMGMEPDKLYVTIHPEDMEAYNIWHKDIGLEESRIIRIEGNFWD
IGEGPSGPNTEIFYDRGEAYGQDDPAEEMYPGGENERYLEVWNLVFSEFNHNKDHSYTPLPNKNIDTGMGLERMASVSQN
VRTNYETDLFMPIMNEIEKVSGKQYLVNNEQDVAFKVIADHIRTIAFAISDGALPANEGRGYVLRRLLRRAVRFSQTLGI
NEPFMYKLVDIVADIMEPYYPNVKEKADFIKRVIKSEEERFHETLEDGLAILNELIKKAKATTNEINGKDAFKLYDTYGF
PIELTEEIAVQAGLKVDMTTFESEMQQQRDRARQARQNSQSMQVQSEVLKNITSASTFVGYDTATAQTTLTHLIYNGEEV
SQVEAGETVYFMLTETPFYAVSGGQVADTGIVYNDNFEIAVSEVTKAPNGQNLHKGVVQFGQVNVGATVSAEVNQNDRRD
IQKNHSATHLLHAALKSVLGDHVNQAGSLVEADRLRFDFSHFGPMTNDEIDQVERLVNEEIWKGIDVNIQEMDIASAKEM
GAMALFGEKYGDVVRVVNMAPFSIELCGGIHVRNTSEIGLFKIVSESGTGAGVRRIEALTGKAAFLYLEDIQEKFNTMKS
QMKVKSDDQVVEKLTQLQDEEKALLKQLEQRDKEITSLKMGNIEDQVEEINGYKVLVTEVDVPNAKAIRSTMDDFKSKLQ
DTIIILASNVDDKVSMVATVPKSLTNNVKAGDLIKQMAPIVGGKGGGRPDMAQGGGTQPENISKSLSFIKDYIKNL
>P0C2V8 ~~~sycN~~~Chaperone protein SycN~~~
MSWIEPIISHFCQDLGVPTSSPLSPLIQLEMAQSGTLQLEQHGATLTLWLARSLAWHQCEDAMVKALTLTAAQKSGALPL
RAGWLGENQLVLFVSLDERSLTLPLLHQAFEQLLRLQQEVLAP
>P61380 ~~~sycN~~~Chaperone protein SycN~~~
MSWIEPIISHFCQDLGVPTSSPLSPLIQLEMAQSGTLQLEQHGATLTLWLARSLAWHRCEDAMVKALTLTAAQKSGALPL
RAGWLGESQLVLFVSLDERSLTLPLLHQAFEQLLRLQQEVLAP
>P0C2V9 ~~~sycT~~~Chaperone protein SycT~~~
MQTTFTELMQQLFLKLGLNHQVNENDVYTFEVDGHIQVLIACYHQQWVQLFSELGADLPTNDNLFGEHWPAHVQGRLDGK
SILWSQQSLVGLDIDEMQAWLERFIDDIEQRKEPQNTKFQPNSTSPILFI
>Q06752 6.1.1.16~~~cysS~~~Cysteine--tRNA ligase~~~COG0215
MTITLYNTLTRQKETFVPLEEGKVKMYVCGPTVYNYIHIGNARPAIVYDTVRNYLEYKGYDVQYVSNFTDVDDKLIKAAN
ELGEDVPTISERFIKAYFEDVGALGCRKADLHPRVMENMDAIIEFVDQLVKKGYAYESEGDVYFKTRAFEGYGKLSQQSI
DELRSGARIRVGEKKEDALDFALWKAAKEGEISWDSPWGKGRPGWHIECSAMVKKYLGDQIDIHAGGQDLTFPHHENEIA
QSEALTGKTFAKYWLHNGYINIDNEKMSKSLGNFVLVHDIIKQHDPQLLRFFMLSVHYRHPINYSEELLENTKSAFSRLK
TAYSNLQHRLNSSTNLTEDDDQWLEKVEEHRKAFEEEMDDDFNTANAISVLFDLAKHANYYLQKDHTADHVITAFIEMFD
RIVSVLGFSLGEQELLDQEIEDLIEKRNEARRNRDFALSDQIRDQLKSMNIILEDTAQGTRWKRGE
>O51545 6.1.1.16~~~cysS~~~Cysteine--tRNA ligase~~~
MILKLYNTRTKDFSELTNFENVKVYACGPTVYNYAHIGNFRTYIFGDLLIKTLRFLGYKVNYAMNITDIGHLTGDLDDGE
DKVAKTAREKGLTVYEISEFFTEAFFNDCRKLNIVYPDKVLVASKHIPIMIEVVKILEEKKITYFSNGNVYFDTSCFKSY
GEMAGIDLIDKDMTLPRVDVDKFKRNKTDFVLWFTNSKFKDQEMKWDSPWGFGYPSWHLECAAMNLEYFKDALDIHLGGV
DHIGVHHINEIAIAECFLNKKWCDVFVHGEFLIMDYNKMSKSRGNFITVKDLEDQNFSPLDFRYLCLTSHYRNQLKFSLD
NLQASKIARENLINKLSYFYESLDPVDLNTLNKDLKNFGFSVEKEYYDSFVEKISFDLNVAQGLALLWEIIKSDNLSFVS
KLRLAFIFDEIMSLNLREEILKNLQNHDVVIDENMKALIEERRIAKCEKNFKRADEIRDFFAKKGFVLVDTKEGTKVKRG
>Q83BL7 6.1.1.16~~~cysS~~~Cysteine--tRNA ligase~~~COG0215
MSVKIFNSLTKQKEIFKPIESGKVKLYVCGMTVYDYMHIGHGRSWIIFDMVVRYLRMRGYEVTFVRNITDIDDKIIKRAG
ENKESPAALAERFIQILHEDEKALRVLSPDQEPRATQYVPEIIKLIQKLLDNQYAYTGQNGDVFFDVRRFKDYGKLSHRH
LDELQAGARVEVSDSKRDPLDFVLWKKAKPGEPKWDSPWGEGRPGWHIECSAMSSSILGQPFDIHGGGLDLKFPHHENEI
AQSEAGEEKPFVKLWMHAGLLEINKEKMSKSLGNIISIREALKESDVEVLRYFLLSGHYRNPLSYSKENLENGRLALERF
YLALRGLPVVNHEKTSSYTDRFYEAMDDDFNTPIAFALLFEMVREINRFRDNNQIEKAAVLAAELKCLGNIFGLLQYSPE
QFLQGAKKEADVQEIKKLIDQRNEARAKKDWKTADQIRDQLTDLGVAIEDSSDGTSWRQE
>P21888 6.1.1.16~~~cysS~~~Cysteine--tRNA ligase~~~COG0215
MLKIFNTLTRQKEEFKPIHAGEVGMYVCGITVYDLCHIGHGRTFVAFDVVARYLRFLGYKLKYVRNITDIDDKIIKRANE
NGESFVAMVDRMIAEMHKDFDALNILRPDMEPRATHHIAEIIELTEQLIAKGHAYVADNGDVMFDVPTDPTYGVLSRQDL
DQLQAGARVDVVDDKRNPMDFVLWKMSKEGEPSWPSPWGAGRPGWHIECSAMNCKQLGNHFDIHGGGSDLMFPHHENEIA
QSTCAHDGQYVNYWMHSGMVMVDREKMSKSLGNFFTVRDVLKYYDAETVRYFLMSGHYRSQLNYSEENLKQARAALERLY
TALRGTDKTVAPAGGEAFEARFIEAMDDDFNTPEAYSVLFDMAREVNRLKAEDMAAANAMASHLRKLSAVLGLLEQEPEA
FLQSGAQADDSEVAEIEALIQQRLDARKAKDWAAADAARDRLNEMGIVLEDGPQGTTWRRK
>P9WFW1 6.1.1.16~~~cysS~~~Cysteine--tRNA ligase~~~COG0215
MTDRARLRLHDTAAGVVRDFVPLRPGHVSIYLCGATVQGLPHIGHVRSGVAFDILRRWLLARGYDVAFIRNVTDIEDKIL
AKAAAAGRPWWEWAATHERAFTAAYDALDVLPPSAEPRATGHITQMIEMIERLIQAGHAYTGGGDVYFDVLSYPEYGQLS
GHKIDDVHQGEGVAAGKRDQRDFTLWKGEKPGEPSWPTPWGRGRPGWHLECSAMARSYLGPEFDIHCGGMDLVFPHHENE
IAQSRAAGDGFARYWLHNGWVTMGGEKMSKSLGNVLSMPAMLQRVRPAELRYYLGSAHYRSMLEFSETAMQDAVKAYVGL
EDFLHRVRTRVGAVCPGDPTPRFAEALDDDLSVPIALAEIHHVRAEGNRALDAGDHDGALRSASAIRAMMGILGCDPLDQ
RWESRDETSAALAAVDVLVQAELQNREKAREQRNWALADEIRGRLKRAGIEVTDTADGPQWSLLGGDTK
>Q99W73 6.1.1.16~~~cysS~~~Cysteine--tRNA ligase~~~
MITLYNTLTRQKEVFKPIEPGKVKMYVCGPTVYNYIHIGNARPAINYDVVRRYFEYQGYNVEYVSNFTDVDDKLIKRSQE
LNQSVPEIAEKYIAAFHEDVGALNVRKATSNPRVMDHMDDIIQFIKDLVDQGYAYESGGDVYFRTRKFEGYGKLSHQSID
DLKVGARIDAGEHKEDALDFTLWKKAKPGEISWNSPFGEGRPGWHIECSVMAFHELGPTIDIHAGGSDLQFPHHENEIAQ
SEAHNHAPFANYWMHNGFINIDNEKMSKSLGNFILVHDIIKEVDPDVLRFFMISVHYRSPINYNLELVESARSGLERIRN
SYQLIEERAQIATNIENQQTYIDQIDAILNRFETVMNDDFNTANAITAWYDLAKLANKYVLENTTSTEVIDKFKAVYQIF
SDVLGVPLKSKNADELLDEDVEKLIEERNEARKNKDFARADEIRDMLKSQNIILEDTPQGVRFKRG
>O84546 6.1.1.23~~~aspS~~~Aspartate--tRNA(Asp/Asn) ligase~~~
MKYRTHKCNELSLDHVGEHVRLSGWVHRYRNHGGVVFIDLRDCFGITQIVCRQEENPELHQLMDQVRSEWVLCVEGLVCA
RLEGMENPNLVTGSIEVEVSSLEVLSRAQNLPFSISDEHINVNEELRLTYRYLDMRRGDILDRLMCRHKVMLACRQYLDE
QGFTEVVTPILGKSTPEGARDYLVPSRIYPGNFYALPQSPQLFKQILMVGGLDRYFQIATCFRDEDLRADRQPEFTQIDM
EMSFGGPEDLFPVVEELVARLFAVKGIELKAPFLRMTYQEAKDSYGTDKPDLRFGLRLKNCCEYARKFTFSIFLDQLAYG
GTVKGFCVPGGADMSRKQLDIYTDFVKRYGAMGLVWIKKQDGGVSSNVAKFASEDVFQEMFEAFEAKDQDILLLIAAPEA
VANQALDHLRRLIARERQLYDSTQYNFVWITDFPLFAKEEGELCPEHHPFTAPLDEDISLLDSDPFAVRSSSYDLVLNGY
EIASGSQRIHNPDLQNKIFALLKLSQESVKEKFGFFIDALSFGTPPHLGIALGLDRIMMVLTGAETIREVIAFPKTQKAG
DLMMSAPSEILPIQLKELGLKL
>Q9RVH4 6.1.1.23~~~aspS2~~~Aspartate--tRNA(Asp/Asn) ligase~~~COG0017
MTSPLKRTLTRELPQHEGQTVKLQGFVHARRDLGGVQFLVLRDVTGVTQCVGSGLTLPLAESSVEVVGKVKAHPKAPGGF
EVQVEDFRVISAATEATPVEIPKMEWNVNPETMLDYRVVTVRGLKERAALKVQAELVDAFRAHLRGEGFTEISTPKIVSA
GAEGGANLFPIDYFGHPAYLAQSPQLYKQIMVGVFERVFEVAAVYRAEEHATSRHLNEYLSLDVEMGFIEDEEDVMGLEN
RLLASIMERLRATSQAEFELLGATIPDVPAHIPRITLMDARQLVTEKYGHPVGGKDLDPEAERLLSQHFAETEGSDFVFV
TKYPRAARPFYAHPELNEDGSVNGEVTRGFDLLFRGIEITSGGQRIHDYGMLMDSIAAYKLKPESLEGYTEVFKYGMPPH
GGFAIGAERLTAKLLGIANVRYARAFPRDRHRLTP
>P56459 6.1.1.23~~~aspS~~~Aspartate--tRNA(Asp/Asn) ligase~~~COG0173
MRSHFCTEISEKDVGKIVKVAGWCNTYRDHGGVVFIDLRDKSGLVQLVCDPSSKAYEKALEVRSEFVLVAKGKVRLRGAG
LENPKLKTGKIEIVLEELIIENKSATPPIEIGNKHVNEDLRLKYRYLDLRSPNSYEIFKLRSEVALITRNTLAQKGFLEI
ETPILSKTTPEGARDYLVPSRVHEGEFFALPQSPQLFKQLLMVGGMDRYFQIARCFRDEDLRADRQPEFTQIDAEMSFCD
ENDVMGVVEDLLQEIFKAVGHTISKPFKRMPYKEAMENYGSDKPDLRFELPLIEVGDCFRDSSNAIFSNTAKDPKNKRIK
ALNVKGADALFSRSVLKELEEFVRQFGAKGLAYLQIKEDEIKGPLVKFLSEKGLKNILERTDAQVGDIVFFGAGDKKIVL
DYMGRLRLKVAETLDLIDKDALNFLWVVNFPMFEKTENGYHAAHHPFTMPKNIECEDIEEVEAHAYDVVLNGVELGGGSI
RIHKEEMQKKVFEKINIHEEEAQKKFGFLLEALKFGAPPHGGFAIGFDRLIMLMTKSHSIRDVIAFPKTQKASCLLTNAP
SPINEEQLRELHIRLRK
>A0QWN3 6.1.1.23~~~aspS~~~Aspartate--tRNA(Asp/Asn) ligase~~~COG0173
MLRTHAAGSLRPADAGQTVTLAGWVARRRDHGGVIFIDLRDASGVSQVVFREGDVLAAAHRLRAEFCVAVTGVVEVRPEG
NENPEIPTGQIEVNATELTVLGESAPLPFQLDEQAGEEARLKYRYLDLRREGPGNALRLRSKVNAAARSVLAEHDFVEIE
TPTLTRSTPEGARDFLVPARLQPGSFYALPQSPQLFKQLLMVAGMERYYQIARCYRDEDFRADRQPEFTQLDMEMSFVEA
DDVIAISEQVLKAVWATIGYDLPLPLPRISYEEAMRRFGSDKPDLRFGIELVECTEYFKDTTFRVFQAPYVGAVVMPGGA
SQPRRTLDGWQEFAKQRGHKGLAYVLVGEDGTLGGPVAKNLSDAERDGLVAHVGANPGDCIFFAAGPAKGARALLGATRI
EIAKRLDLIDPNAWAFTWVVDFPMFEAADEATAAGDVAVGSGAWTAMHHAFTAPKPDSVDTFDSDPGNALSDAYDIVCNG
NEIGGGSIRIHRRDIQERVFAMMGIDHDEAQEKFGFLLDAFSYGAPPHGGIAFGWDRITALLAGVDSIREVIAFPKSGGG
VDPLTDAPAPITPQQRKESGIDAKPREDKPKEDAKSKA
>P9WFW3 6.1.1.23~~~aspS~~~Aspartate--tRNA(Asp/Asn) ligase~~~COG0173
MLRSHAAGLLREGDAGQQVTLAGWVARRRDHGGVIFIDLRDASGIAQVVFRDPQDTEVLAQAHRLRAEFCVSVAGVVEIR
PEGNANPEIATGEIEVNATSLTVLGECAPLPFQLDEPAGEELRLKYRYLDLRRDDPAAAIRLRSRVNAAARAVLARHDFV
EIETPTITRSTPEGARDFLVPARLHPGSFYALPQSPQLFKQLLMVAGMERYYQIARCYRDEDFRADRQPEFTQLDMEMSF
VDAEDIIAISEEVLTELWALIGYRIPTPIPRIGYAEAMRRFGTDKPDLRFGLELVECTDFFSDTTFRVFQAPYVGAVVMP
GGASQPRRTLDGWQDWAKQRGHRGLAYVLVAEDGTLGGPVAKNLTEAERTGLADHVGAKPGDCIFFSAGPVKSSRALLGA
ARVEIANRLGLIDPDAWAFVWVVDPPLFEPADEATAAGEVAVGSGAWTAVHHAFTAPKPEWEDRIESDTGSVLADAYDIV
CNGHEIGGGSVRIHRRDIQERVFAVMGLDKAEAEEKFGFLLEAFMFGAPPHGGIAFGWDRTTALLAGMDSIREVIAFPKT
GGGVDPLTDAPAPITAQQRKESGIDAQPKRVQQA
>Q51422 6.1.1.23~~~aspS~~~Aspartate--tRNA(Asp/Asn) ligase~~~
MMRSHYCGQLNESLDGQEVTLCGWVHRRRDHGGVIFLDVRDREGLAQVVFDPDRAETFAKADRVRSEFVVKITGKVRLRP
EGARNPNMASGSIEVLGYELEVLNQAETPPFPLDEYSDVGEETRLRYRFIDLRRPEMAAKLKLRARITSSIRRYLDDNGF
LDVETPILGRPTPEGARDYLVPSRTYPGHFFALPQSPQLFKQLLMVAGFDRYYQIAKCFRDEDLRADRQPEFTQIDIETS
FLDESDIIGITEKMVRQLFKEVLDVEFDEFPHMPFEEAMRRYGSDKPDLRIPLELVDVADQLKEVEFKVFSGPANDPKGR
VAALRVPGAASMPRSQIDDYTKFVGIYGAKGLAYIKVNERAKGVEGLQSPIVKFIPEANLNVILDRVGAVDGDIVFFGAD
KAKIVCDALGALRIKVGHDLKLLTREWAPMWVVDFPMFEENDDGSLSALHHPFTSPKCTPAELEANPGAALSRAYDMVLN
GTELGGGSIRIHDKSMQQAVFRVLGIDEAEQEEKFGFLLDALKYGAPPHGGLAFGLDRLVMLMTGASSIREVIAFPKTQS
AGDVMTQAPGSVDGKALRELHIRLREQPKAE
>Q5SIC2 6.1.1.23~~~aspS2~~~Aspartate--tRNA(Asp/Asn) ligase~~~COG0017
MRVLVRDLKAHVGQEVELLGFLHWRRDLGRIQFLLLRDRSGVVQVVTGGLKLPLPESALRVRGLVVENAKAPGGLEVQAK
EVEVLSPALEPTPVEIPKEEWRANPDTLLEYRYVTLRGEKARAPLKVQAALVRGFRRYLDRQDFTEIFTPKVVRAGAEGG
SGLFGVDYFEKRAYLAQSPQLYKQIMVGVFERVYEVAPVWRMEEHHTSRHLNEYLSLDVEMGFIADEEDLMRLEEALLAE
MLEEALNTAGDEIRLLGATWPSFPQDIPRLTHAEAKRILKEELGYPVGQDLSEEAERLLGEYAKERWGSDWLFVTRYPRS
VRPFYTYPEEDGTTRSFDLLFRGLEITSGGQRIHRYEELLESLKAKGMDPEAFHGYLEVFKYGMPPHGGFAIGAERLTQK
LLGLPNVRYARAFPRDRHRLTP
>P0A8U0 ~~~syd~~~Protein Syd~~~
MDDLTAQALKDFTARYCDAWHEEHKSWPLSEELYGVPSPCIISTTEDAVYWQPQPFTGEQNVNAVERAFDIVIQPTIHTF
YTTQFAGDMHAQFGDIKLTLLQTWSEDDFRRVQENLIGHLVTQKRLKLPPTLFIATLEEELEVISVCNLSGEVCKETLGT
RKRTHLASNLAEFLNQLKPLL
>Q9RUN7 6.1.1.12~~~aspS1~~~Aspartate--tRNA(Asp) ligase~~~COG0173
MMKRTSLIGQLGQAQQQQTVTLQGWVSRRRDLGGLIFLELRDRSGTVQVQVEPDSPAFAEADRLRAEYVAEIEGTFQPRP
ESQRKGGLADFEVIASRVKVLNAAKTPPFELDKGESVAEDIRLKYRYLDLRRPEMQRALMLRSKAVTAVTEFLDAEGFIQ
VETPMLTKSTPEGARDFLVPSRLNPGEFYALPQSPQLFKQLLMIAGFDRYYQLARCFRDEDLRADRQPDFTQLDMEMSFV
EQDDVLEVQERLLAHVFRRVLDVELPLPFPRMSYFDAMDRYGSDKPDLRFDSAFTDVTGLFRGGEFAAFASAPSVKVLVA
PELTRKQIDELERVAKQNGAGGLAWLRRDGEGFTGGISKFVGGIAPQLIEQTGVAQGGTLLFAAGEWKKAVTALGAVRLA
LRDLFDLAAGGPQFHVSWVVDFPQLEFDEDSQSWTYMHHPFTAPHPGDVALFGTERQGEMRAQAYDLVMNGFEIGGGSVR
IHDPEVQAKMFQAIGFSEEAAREKFGFFLDALEYGTPPHGGIAWGFDRLLMLMSGAGSIREVIAFPKNNRGADLMAQAPS
PVEDAQLAEVGVQVRGE
>P21889 6.1.1.12~~~aspS~~~Aspartate--tRNA ligase~~~COG0173
MRTEYCGQLRLSHVGQQVTLCGWVNRRRDLGSLIFIDMRDREGIVQVFFDPDRADALKLASELRNEFCIQVTGTVRARDE
KNINRDMATGEIEVLASSLTIINRADVLPLDSNHVNTEEARLKYRYLDLRRPEMAQRLKTRAKITSLVRRFMDDHGFLDI
ETPMLTKATPEGARDYLVPSRVHKGKFYALPQSPQLFKQLLMMSGFDRYYQIVKCFRDEDLRADRQPEFTQIDVETSFMT
APQVREVMEALVRHLWLEVKGVDLGDFPVMTFAEAERRYGSDKPDLRNPMELTDVADLLKSVEFAVFAGPANDPKGRVAA
LRVPGGASLTRKQIDEYGNFVKIYGAKGLAYIKVNERAKGLEGINSPVAKFLNAEIIEDILDRTAAQDGDMIFFGADNKK
IVADAMGALRLKVGKDLGLTDESKWAPLWVIDFPMFEDDGEGGLTAMHHPFTSPKDMTAAELKAAPENAVANAYDMVING
YEVGGGSVRIHNGDMQQTVFGILGINEEEQREKFGFLLDALKYGTPPHAGLAFGLDRLTMLLTGTDNIRDVIAFPKTTAA
ACLMTEAPSFANPTALAELSIQVVKKAENN
>P67015 6.1.1.12~~~aspS~~~Aspartate--tRNA ligase~~~
MSKRTTYCGLVTEAFLGQEITLKGWVNNRRDLGGLIFVDLRDREGIVQVVFNPAFSEEALKIAETVRSEYVVEVQGTVTK
RDPETVNPKIKTGQVEVQVTNIKVINKSETPPFSINEENVNVDENIRLKYRYLDLRRQELAQTFKMRHQITRSIRQYLDD
EGFFDIETPVLTKSTPEGARDYLVPSRVHDGEFYALPQSPQLFKQLLMISGFDKYYQIVKCFRDEDLRADRQPEFTQVDI
EMSFVDQEDVMQMGEEMLKKVVKEVKGVEINGAFPRMTYKEAMRRYGSDKPDTRFEMELIDVSQLGRDMDFKVFKDTVEN
DGEIKAIVAKGAAEQYTRKDMDALTEFVNIYGAKGLAWVKVVEDGLTGPIGRFFETENVETLLTLTGAEAGDLVMFVADK
PNVVAQSLGALRVKLAKELGLIDETKLNFLWVTDWPLLEYDEDAKRYVAAHHPFTSPKEADIAKLGTAPEEAEANAYDIV
LNGYELGGGSIRIHDGELQEKMFEVLGFTKEQAQEQFGFLLDAFKYGAPPHGGIALGLDRLVMLLTNRTNLRDTIAFPKT
ASATCLLTNAPGEVSDKQLEELSLRIRH
>Q5SKD2 6.1.1.12~~~aspS1~~~Aspartate--tRNA(Asp) ligase~~~COG0173
MRRTHYAGSLRETHVGEEVVLEGWVNRRRDLGGLIFLDLRDREGLVQLVAHPASPAYATAERVRPEWVVRAKGLVRLRPE
PNPRLATGRVEVELSALEVLAEAKTPPFPVDAGWRGEEEKEASEELRLKYRYLDLRRRRMQENLRLRHRVIKAIWDFLDR
EGFVQVETPFLTKSTPEGARDFLVPYRHEPGLFYALPQSPQLFKQMLMVAGLDRYFQIARCFRDEDLRADRQPDFTQLDL
EMSFVEVEDVLELNERLMAHVFREALGVELPLPFPRLSYEEAMERYGSDKPDLRFGLELKEVGPLFRQSGFRVFQEAESV
KALALPKALSRKEVAELEEVAKRHKAQGLAWARVEEGGFSGGVAKFLEPVREALLQATEARPGDTLLFVAGPRKVAATAL
GAVRLRAADLLGLKREGFRFLWVVDFPLLEWDEEEEAWTYMHHPFTSPHPEDLPLLEKDPGRVRALAYDLVLNGVEVGGG
SIRIHDPRLQARVFRLLGIGEEEQREKFGFFLEALEYGAPPHGGIAWGLDRLLALMTGSPSIREVIAFPKNKEGKDPLTG
APSPVPEEQLRELGLMVVRP
>P36419 6.1.1.12~~~aspS~~~Aspartate--tRNA(Asp) ligase~~~
MRRTHYAGSLRETHVGEEVVLEGWVNRRRDLGGLIFLDLRDREGLVQLVAHPASPAYATAERVRPEWVVRAKGLVRLRPE
PNPRLATGRVEVELSALEVLAEAKTPPFPVDAGWRGEEEKEASEELRLKYRYLDLRRRRMQENLRLRHRVIKAIWDFLDR
EGFVQVETPFLTKSTPEGARDFLVPYRHEPGLFYALPQSPQLFKQMLMVAGLDRYFQIARCFRDEDLRADRQPDFTQLDL
EMSFVEVEDVLELNERLMAHVFREALGVELPLPFPRLSYEEAMERYGSDKPDLRFGLELKEVGPLFRQSGFRVFQEAESV
KALALPKALSRKEVAELEEVAKRHKAQGLAWARVEEGGFSGGVAKFLEPVREALLQATEARPGDTLLFVAGPRKVAATAL
GAVRLRAADLLGLKREGFRFLWVVDFPLLEWDEEEEAWTYMHHPFTSPHPEDLPLLEKDPGRVRALAYDLVLNGVEVGGG
SIRIHDPRLQARVFRLLGIGEEEQREKFGFFLEALEYGAPPHGGIAWGLDRLLALMTGSPSIREVIAFPKNKEGKDPLTG
APSPVPEEQLRELGLMVVRP
>B5Z6J9 6.1.1.17~~~gltX1~~~Glutamate--tRNA ligase 1~~~
MSLIVTRFAPSPTGYLHIGGLRTAIFNYLFARANQGKFFLRIEDTDLSRNSIEAANAIIEAFKWVGLEYDGEILYQSKRF
EIYKEYIQKLLDEDKAYYCYMSKDELDALREEQKARKETPRYDNRYRDFKGTPPKGIEPVVRIKVPQNEVIGFNDGVKGE
VKVNTNELDDFIIARSDGTPTYNFVVIVDDALMGITDVIRGDDHLSNTPKQIVLYKALNFKIPNFFHVPMILNEEGQKLS
KRHGATNVMDYQEMGYLKEALVNFLVRLGWSYQDKEIFSMQELLECFDPKDLNSSPSCFSWHKLNWLNAHYLKNQSAQKL
LELLKPFSFSDLSHLNPAQLDRLLDALKERSQTLKELALKIDEVLIAPVEYEEKVFKKLNQALIMPLLEKFKLELKEANF
NDESALENAMHKIIEEEKIKAGSFMQPLRLALLGKGGGIGLKEALFILGKTESVKRIENFLKN
>Q9X172 6.1.1.17~~~gltX1~~~Glutamate--tRNA ligase 1~~~COG0008
MVRVRFAPSPTGFLHVGGARTALFNFLFARKEKGKFILRIEDTDLERSEREYEEKLMESLRWLGLLWDEGPDVGGDHGPY
RQSERVEIYREHAERLVKEGKAYYVYAYPEEIEEMREKLLSEGKAPHYSQEMFEKFDTPERRREYEEKGLRPAVFFKMPR
KDYVLNDVVKGEVVFKTGAIGDFVIMRSNGLPTYNFACVVDDMLMEITHVIRGDDHLSNTLRQLALYEAFEKAPPVFAHV
STILGPDGKKLSKRHGATSVEAFRDMGYLPEALVNYLALLGWSHPEGKELLTLEELISSFSLDRLSPNPAIFDPQKLKWM
NGYYLRNMPIEKLAELAKPFFEKAGIKIIDEEYFKKVLEITKERVEVLSEFPEESRFFFEDPAPVEIPEEMKEVFSQLKE
ELQNVRWTMEEITPVFKKVLKQHGVKPKEFYMTLRRVLTGREEGPELVNIIPLLGKEIFLRRIERSLGG
>Q9X2I8 6.1.1.17~~~gltX2~~~Glutamate--tRNA ligase 2~~~COG0008
MFITGAFFDILEVGPKKIRRCFELVRVRFAPSPTGHLHVGGARTALFNWMFARKEGGKFILRIEDTDTERSSREYEQQIL
ESLRWCGLDWDEGPDIGGDFGPYRQSERLEIYREYAEKLVEDKRAYYVVYDKEDPSKELFTTYEYPHEYKEKGHPVTIKF
KVLPGKTSFEDLLKGYMEFDNSTLEDFIIMKSNGFPTYNFAVVVDDHLMRISHVFRGEDHLSNTPKQLMIYEAFGWEAPV
FMHIPLILGSDRTPLSKRHGATSVEHFRREGILSRALMNYLALLGWRVEGDEIFTIEEKLQSFDPKDISNKGVIFDYQKL
EWVNGKHMRRIDLEDLKREFIEWAKYAGKEIPSVDERYFSETLRICREKVNTLSQLYDIMYPFMNDDYEYEKDYVEKFLK
REEAERVLEEAKKAFKDLNSWNMEEIEKTLRDLSEKGLASKKVVFQLIRGAVTGKLVTPGLFETIEVLGKERTLKRLERT
LQFLKKT
>O51345 6.1.1.17~~~gltX~~~Glutamate--tRNA ligase~~~
MSTRVRYAPSPTGLQHIGGIRTALFNYFFAKSCGGKFLLRIEDTDQSRYSPEAENDLYSSLKWLGISFDEGPVVGGDYAP
YVQSQRSAIYKQYAKYLIESGHAYYCYCSPERLERIKKIQNINKMPPGYDRHCRNLSNEEVENALIKKIKPVVRFKIPLE
GDTSFDDILLGRITWANKDISPDPVILKSDGLPTYHLANVVDDYLMKITHVLRAQEWVSSGPLHVLLYKAFKWKPPIYCH
LPMVMGNDGQKLSKRHGSTALRQFIEDGYLPEAIINYVTLLGWSYDDKREFFSKNDLEQFFSIEKINKSPAIFDYHKLDF
FNSYYIREKKDEDLFNLLLPFFQKKGYVSKPSTLEENQKLKLLIPLIKSRIKKLSDALNMTKFFYEDIKSWNLDEFLSRK
KTAKEVCSILELIKPILEGFEKRSSEENDKIFYDFAESNGFKLGEILLPIRIAALGSKVSPPLFDSLKLIGKSKVFERIK
LAQEFLRINE
>Q2SX36 6.1.1.17~~~gltX~~~Glutamate--tRNA ligase~~~
MTRPVRTRFAPSPTGFIHLGNIRSALYPWAFARKMKGTFVLRIEDTDVERSSQEAVDAILEGMAWLGLDYDEGPYYQMQR
MDRYREVLAQMQEKGLVYPCYMSTEELDALRERQRAAGEKPRYDGTWRPEPGKVLPEPPAGVAPVLRFRNPLTGTVAWDD
AVKGRVEISNEELDDLVVARPDGTPMYNFCVVVDDLDMGITHVIRGDDHVNNTPRQINILRALGGEVPVYAHLPTVLNEQ
GEKMSKRHGAMSVMGYRDAGYLPEAVLNYLARLGWSHGDAEIFTREQFVEWFDLEHLGKSPAQYDHNKLNWLNNHYIKEA
DDARLAGLAKPFFAALGIDAGAIEQGPDLVSVMGLMKDRASTVKEIAENSAMFYRAPAPGADALAQHVTDAVRPALVEFA
AALKTVEWTKEAIAAALKAVLGAHKLKMPQLAMPVRLLVAGTTHTPSIDAVLLLFGRDVVVSRIEAALA
>P04805 6.1.1.17~~~gltX~~~Glutamate--tRNA ligase~~~COG0008
MKIKTRFAPSPTGYLHVGGARTALYSWLFARNHGGEFVLRIEDTDLERSTPEAIEAIMDGMNWLSLEWDEGPYYQTKRFD
RYNAVIDQMLEEGTAYKCYCSKERLEALREEQMAKGEKPRYDGRCRHSHEHHADDEPCVVRFANPQEGSVVFDDQIRGPI
EFSNQELDDLIIRRTDGSPTYNFCVVVDDWDMEITHVIRGEDHINNTPRQINILKALKAPVPVYAHVSMINGDDGKKLSK
RHGAVSVMQYRDDGYLPEALLNYLVRLGWSHGDQEIFTREEMIKYFTLNAVSKSASAFNTDKLLWLNHHYINALPPEYVA
THLQWHIEQENIDTRNGPQLADLVKLLGERCKTLKEMAQSCRYFYEDFAEFDADAAKKHLRPVARQPLEVVRDKLAAITD
WTAENVHHAIQATADELEVGMGKVGMPLRVAVTGAGQSPALDVTVHAIGKTRSIERINKALDFIAERENQQ
>A0QUY7 6.1.1.17~~~gltX~~~Glutamate--tRNA ligase~~~COG0008
MTTKKVRVRFCPSPTGTPHVGLVRTALFNWAYARHTGGDFVFRIEDTDAARDSDESYAAILDALRWLGMDWDEGPEVGGP
YEPYRQSQRREIYRDVVARLLEAGEVYEAYSTPEEVEARHLAAGRNPKLGYDNYDRDLTDAQRKAFADEGRRPVLRLRMP
DEDLSWNDLVRGPTTFAAGSVPDFAITRSNGDPLYTLVNPVDDALMKITHVLRGEDILPSTPRQIALYRALMRIGVAEFV
PEFAHLPSVLGEGNKKLSKRDPQSNLFLHRDRGFIPEGLLNYLALLGWGIADDHDVFSLDEMVAAFDVADVNSNPARFDQ
KKADAINAEHIRMLAPEDFTARLREYFVTHGYDTTLDDAAFAEAAALVQTRVVVLGDAWGLLKFFNDDAYEIDEKSAAKE
LKPESAAVLDAALSALEAVGDWTTPAIEAALKTALLEGLELKPRKAFGPIRVAVTGAAVSPPLFESMELLGRDRSLARLR
AARDRV
>P9WFV9 6.1.1.17~~~gltX~~~Glutamate--tRNA ligase~~~COG0008
MTATETVRVRFCPSPTGTPHVGLVRTALFNWAYARHTGGTFVFRIEDTDAQRDSEESYLALLDALRWLGLDWDEGPEVGG
PYGPYRQSQRAEIYRDVLARLLAAGEAYHAFSTPEEVEARHVAAGRNPKLGYDNFDRHLTDAQRAAYLAEGRQPVVRLRM
PDDDLAWNDLVRGPVTFAAGSVPDFALTRASGDPLYTLVNPCDDALMKITHVLRGEDLLPSTPRQLALHQALIRIGVAER
IPKFAHLPTVLGEGTKKLSKRDPQSNLFAHRDRGFIPEGLLNYLALLGWSIADDHDLFGLDEMVAAFDVADVNSSPARFD
QKKADALNAEHIRMLDVGDFTVRLRDHLDTHGHHIALDEAAFAAAAELVQTRIVVLGDAWELLKFFNDDQYVIDPKAAAK
ELGPDGAAVLDAALAALTSVTDWTAPLIEAALKDALIEGLALKPRKAFSPIRVAATGTTVSPPLFESLELLGRDRSMQRL
RAARQLVGHA
>Q9XCL6 6.1.1.17~~~gltX~~~Glutamate--tRNA ligase~~~
MTTVRTRIAPSPTGDPHVGTAYIALFNLCFARQHGGQFILRIEDTDQLRSTRESEQQIYDALRWLGIEWDEGPDVGGPHG
PYRQSERGHIYKKYSDELVEKGHAFTCFCTPERLDAVRAEQMARKETPRYDGHCMHLPKDEVQRRLAAGESHVTRMKVPT
EGVCVVPDMLRGDVEIPWDRMDMQVLMKADGLPTYFLANVVDDHLMGITHVLRGEEWLPSAPKLIKLYEYFGWEQPQLCY
MPLLRNPDKSKLSKRKNPTSITFYERMGYLPQALLNYLGRMGWSMPDEREKFTLAEMIEHFDLSRVSLGGPIFDLEKLSW
LNGQWIREQSVEEFAREVQKWALNPEYLMKIAPHVQGRVENFSQIAPLAGFFFSGGVPLDASLFEHKKLDPTQVRQVLQL
VLWKLESLRQWEKERITGCIQAVAEHLQLKLRDVMPLMFPAITGHASSVSVLDAMEILGADLSRYRLRQALELLGGASKK
ETKEWEKIRDAIPG
>P99170 6.1.1.17~~~gltX~~~Glutamate--tRNA ligase~~~
MSDRIRVRYAPSPTGYLHIGNARTALFNYLYAKHYNGDFVIRIEDTDKKRNLEDGETSQFDNLKWLGLDWDESVDKDNGY
GPYRQSERQHIYQPLIDQLLAEDKAYKCYMTEEELEAEREAQIARGEMPRYGGQHAHLTEEQRQQFEAEGRQPSIRFRVP
QNQTYSFDDMVKGNISFDSNGIGDWVIVKKDGIPTYNFAVAIDDHYMQISDVIRGDDHISNTPKQIMIYEAFGWEPPRFG
HMSLIVNEERKKLSKRDGQILQFIEQYRDLGYLPEALFNFIALLGWSPEGEEEIFSKEEFIKIFDEKRLSKSPAFFDKQK
LAWVNNQYMKQKDTETVFQLALPHLIKANLIPEVPSEEDLSWGRKLIALYQKEMSYAGEIVPLSEMFFKEMPALGEEEQQ
VINGEQVPELMTHLFSKLEALEPFEAAEIKKTIKEVQKETGIKGKQLFMPIRVAVTGQMHGPELPNTIEVLGKEKVLNRL
KQYK
>B2FHI7 6.1.1.17~~~gltX~~~Glutamate--tRNA ligase~~~COG0008
MTCRTRFAPSPTGYLHIGGARTALYCWLEARHRGGEFVLRIEDTDRERSTQGAIDAILEAMEWLGLDYDEGPIYQTDRVA
RYLEVAEQLVADGKAYYAYETREELDAMREAAMARQEKPRYNGAARDLGLPRRDDPNRVIRFKNPLEGTVVFDDLIKGRI
EIANSELDDMVIFRPDGYPTYNFAVVVDDWDMGITEVIRGDDHINNTPRQINLYEGIGAPVPKFGHMPMILDEQGAKLSK
RTGAADVMQYKDAGYLPDALLSYLARLGWSHGDQELFSRQELIELFDVKDCNSKASRLDMAKLGWVNQHFLKTEDVAAIV
PHLVYQLQKLGLDVAAGPAPEDVVVALRERVQTLKEMAEKAVVWYQPLTEYDEAAVAKHFKAGAEVALGKARELLAALPE
WTAESVGVALHDAAAALEIGMGKVAQPLRVAITGTQVSPDISHTVYLAGREQALKRIDVAITKVATA
>Q97NG1 6.1.1.17~~~gltX~~~Glutamate--tRNA ligase~~~COG0008
MSKDIRVRYAPSPTGLLHIGNARTALFNYLYARHHGGTFLIRIEDTDRKRHVEDGERSQLENLRWLGMDWDESPESHENY
RQSERLDLYQKYIDQLLAEGKAYKSYVTEEELAAERERQEVAGETPRYINEYLGMSEEEKAAYIAEREAAGIIPTVRLAV
NESGIYKWHDMVKGDIEFEGGNIGGDWVIQKKDGYPTYNFAVVIDDHDMQISHVIRGDDHIANTPKQLMVYEALGWEAPE
FGHMTLIINSETGKKLSKRDTNTLQFIEDYRKKGYLPEAVFNFIALLGWNPGGEDEIFSREEFIKLFDENRLSKSPAAFD
QKKLDWMSNDYIKNADLETIFEMAKPFLEEAGRLTDKAEKLVELYKPQMKSVDEIIPLTDLFFSDFPELTEAEREVMTGE
TVPTVLEAFKAKLEAMTDDEFVTENIFPQIKAVQKETGIKGKNLFMPIRIAVSGEMHGPELPDTIFLLGREKSIQHIENM
LKEISK
>P27000 6.1.1.17~~~gltX~~~Glutamate--tRNA ligase~~~COG0008
MVVTRIAPSPTGDPHVGTAYIALFNYAWARRNGGRFIVRIEDTDRARYVPGAEERILAALKWLGLSYDEGPDVGGPHGPY
RQSERLPLYQKYAEELLKRGWAYRAFETPEELEQIRKEKGGYDGRARNIPPEEAEERARRGEPHVIRLKVPRPGTTEVKD
ELRGVVVYDNQEIPDVVLLKSDGYPTYHLANVVDDHLMGVTDVIRAEEWLVSTPIHVLLYRAFGWEAPRFYHMPLLRNPD
KTKISKRKSHTSLDWYKAEGFLPEALRNYLCLMGFSMPDGREIFTLEEFIQAFTWERVSLGGPVFDLEKLRWMNGKYIRE
VLSLEEVAERVKPFLREAGLSWESEAYLRRAVELMRPRFDTLKEFPEKARYLFTEDYPVSEKAQRKLEEGLPLLKELYPR
LRAQEEWTEAALEALLRGFAAEKGVKLGQVAQPLRAALTGSLETPGLFEILALLGKERALRRLERALA
>Q8DLI5 6.1.1.17~~~gltX~~~Glutamate--tRNA ligase~~~COG0008
MTVRVRLAPSPTGNLHIGTARTAVFNWLYARHRGGKFILRIEDTDRERSRPEYTENILEGLQWLGLTWDEGPYFQSDRLD
LYRQAIQTLLDKGLAYYCYCTPEELEALRAEQKAKGQAPRYDNRHRHLTPEEQAAFEAAGRTPVIRFKIEDDRQIEWQDL
VRGRVSWQGADLGGDMVIARAAPRGEIGYPLYNLVVVVDDIAMGITDVIRGEDHIGNTPKQILLYEALGATPPNFAHTPL
ILNSTGQKLSKRDGVTSISDFRAMGYLAPALANYMTLLGWSPPEGVGELFTLDLAAKHFSFERINKAGARFDWDKLNWLN
RQYIQQLEPEEFLAELIPLWQGAGYAFDEERDRPWLFDLAQLLQPGLNTLREAIDQGAVFFIPSVTFDSEAMAQLGQPQS
ATILAYLLEHLPAEPALTVAMGQQLIQQAAKAAGVKKGATMRTLRAALTGAVHGPDLMAAWQILHQRGWDEPRLAAALKQ
AQTTS
>Q5H2R3 6.1.1.17~~~gltX~~~Glutamate--tRNA ligase~~~
MACRTRFAPSPTGYLHIGGARTALYCWLEARRRGGQFVLRIEDTDRQRSTQAAIDAILEAMQWLGLGYDEGPIYQTQRVA
RYQEVAEQLLAQGKAYYAYETREELDAMREAAMAKQEKPRYDGAAREQNLPYRDDPNRVIRFKNPIGGTVVFDDLIKGRI
EIANSELDDMVIFRPDGLPTYNFAVVVDDWDMGITEVIRGDDHINNTPRQINIYAALGAPVPKFAHMPMILDEQGTKLSK
RTGAADVMQYKDAGYLPHALINYLARLGWSHGDQELFTPQELLDLFDVKDVNSKAARLDMAKLGWVNQHYLKTDDPASIA
PQLEYQLAKLGVDLAAGPAAADVVVALRERVHTLKEMAEKAVVWYQPLETYDAAAVMKHLKLGAEVPLGKARELLAAVDQ
WSVDSVSAALHDAAAALELGMGKVAQPLRVAITGTQVSPDISQTVYLAGREGALKRIDAALTKIGAA
>Q728R9 6.1.1.20~~~pheS~~~Phenylalanine--tRNA ligase alpha subunit~~~COG0016
MDLLKELESLVPELERGLGQASSLQELEEQRIAFLGRKGRLAQVMAHLPELAPAERPRVGQAANTVKQALTELFEQRKVV
LEQAKEAAALSRFDPSLPGRMPWRGSLHPVTLVMEEICSVFGALGYDIVTGPEVENDYHNFEALNMPPEHPARDMQDTLY
ITESILMRTHTSPLQVRTMKSLRPPVAAIAPGKVYRRDSDITHTPMFHQIEGFMVDHNVSMAELRGTLTSFLRTVFGGDT
RVRFRPSFFPFTEPSAEVDISCCICGGKGHVGNEPCRVCKTTGWVEILGCGMIDPEVFKSVGYDPEVYTGFAFGLGVERV
AMLKYGIGDLRMFFENDVRFLSQFA
>P08312 6.1.1.20~~~pheS~~~Phenylalanine--tRNA ligase alpha subunit~~~COG0016
MSHLAELVASAKAAISQASDVAALDNVRVEYLGKKGHLTLQMTTLRELPPEERPAAGAVINEAKEQVQQALNARKAELES
AALNARLAAETIDVSLPGRRIENGGLHPVTRTIDRIESFFGELGFTVATGPEIEDDYHNFDALNIPGHHPARADHDTFWF
DTTRLLRTQTSGVQIRTMKAQQPPIRIIAPGRVYRNDYDQTHTPMFHQMEGLIVDTNISFTNLKGTLHDFLRNFFEEDLQ
IRFRPSYFPFTEPSAEVDVMGKNGKWLEVLGCGMVHPNVLRNVGIDPEVYSGFAFGMGMERLTMLRYGVTDLRSFFENDL
RFLKQFK
>P75564 6.1.1.20~~~pheS~~~Phenylalanine--tRNA ligase alpha subunit~~~
MIDQSKLIERWKTTFETAQNPTELLAFKNSFRNADLKPLLSQIKETTDIETKRHLGQLYKQLESTLQTLHDTQLQVFTQA
QSSSVLTHGDVMLLATSFAPGSSNIIYQVIDELVNYFKKFLFTVNYDSELTTIADCFDLLNIPKDHPSRNLTDTFYLDKN
RLLRTHCTAATLRAVKETKKSNNPDIRIASFGAVFRKDDDDATHSHQFNQLDFMWIKKDFSLTNLKWFMQNMINHIFGEN
TSARFRLSHFPFTEPSFEIDIRCWLCQNGCGVCKKTRWIEVLGAGILHPQVMANMGFSDTDNIRGIAAGIGIERLVMLKH
GISDIRDLYDNNFKFLAQFTD
>P9WFU3 6.1.1.20~~~pheS~~~Phenylalanine--tRNA ligase alpha subunit~~~COG0016
MLSPEALTTAVDAAQQAIALADTLDVLARVKTEHLGDRSPLALARQALAVLPKEQRAEAGKRVNAARNAAQRSYDERLAT
LRAERDAAVLVAEGIDVTLPSTRVPAGARHPIIMLAEHVADTFIAMGWELAEGPEVETEQFNFDALNFPADHPARGEQDT
FYIAPEDSRQLLRTHTSPVQIRTLLARELPVYIISIGRTFRTDELDATHTPIFHQVEGLAVDRGLSMAHLRGTLDAFARA
EFGPSARTRIRPHFFPFTEPSAEVDVWFANKIGGAAWVEWGGCGMVHPNVLRATGIDPDLYSGFAFGMGLERTLQFRNGI
PDMRDMVEGDVRFSLPFGVGA
>Q9I0A3 6.1.1.20~~~pheS~~~Phenylalanine--tRNA ligase alpha subunit~~~
MENLDALVSQALEAVRHTEDVNALEQIRVHYLGKKGELTQVMKTLGDLPAEERPKVGALINVAKEKVQDVLNARKTELEG
AALAARLAAERIDVTLPGRGQLSGGLHPVTRTLERIEQCFSRIGYEVAEGPEVEDDYHNFEALNIPGHHPARAMHDTFYF
NANMLLRTHTSPVQVRTMESQQPPIRIVCPGRVYRCDSDLTHSPMFHQVEGLLVDEGVSFADLKGTIEEFLRAFFEKQLE
VRFRPSFFPFTEPSAEVDIQCVICSGNGCRVCKQTGWLEVMGCGMVHPNVLRMSNIDPEKFQGFAFGMGAERLAMLRYGV
NDLRLFFDNDLRFLGQFR
>Q4L5E3 6.1.1.20~~~pheS~~~Phenylalanine--tRNA ligase alpha subunit~~~COG0016
MTQNDSMAELKQQALVDINEAQNERELQDVKVKYLGKKGSVSGLMKNMKDLPNEEKPAYGQKVNELRQTIQKELDEKQEL
LKNEKLNQQLAEETIDVTLPSRQISIGSKHPLTRTVEEIEDLFLGLGYEIVDGYEVEQDYYNFEALNLPKSHPARDMQDS
FYITDEILMRTHTSPVQARTMEKRNGQGPVKIICPGKVYRRDSDDATHSHQFTQIEGLVVDKNIKMSDLKGTLELVAKKL
FGADREIRLRPSYFPFTEPSVEVDVSCFKCKGKGCNVCKHTGWIEILGAGMVHPNVLEMAGFDSNEYSGFAFGMGPDRIA
MLKYGIEDIRYFYTNDVRFLEQFKAVEDRGEA
>Q5SGX2 6.1.1.20~~~pheS~~~Phenylalanine--tRNA ligase alpha subunit~~~COG0016
MLEEALAAIQNARDLEELKALKARYLGKKGLLTQEMKGLSALPLEERRKRGQELNAIKAALEAALEAREKALEEAALKEA
LERERVDVSLPGASLFSGGLHPITLMERELVEIFRALGYQAVEGPEVESEFFNFDALNIPEHHPARDMWDTFWLTGEGFR
LEGPLGEEVEGRLLLRTHTSPMQVRYMVAHTPPFRIVVPGRVFRFEQTDATHEAVFHQLEGLVVGEGIAMAHLKGAIYEL
AQALFGPDSKVRFQPVYFPFVEPGAQFAVWWPEGGKWLELGGAGMVHPKVFQAVDAYRERLGLPPAYRGVTGFAFGLGVE
RLAMLRYGIPDIRYFFGGRLKFLEQFKGVL
>P27001 6.1.1.20~~~pheS~~~Phenylalanine--tRNA ligase alpha subunit~~~
MLEEALAAIQNARDLEELKALKARYLGKKGLLTQEMKGLSALPLEERRKRGQELNAIKAALEAALEAREKALEEAALKEA
LERERVDVSLPGASLFSGGLHPITLMERELVEIFRALGYQAVEGPEVESEFFNFDALNIPEHHPARDMWDTFWLTGEGFR
LEGPLGEEVEGRLLLRTHTSPMQVRYMVAHTPPFRIVVPGRVFRFEQTDATHEAVFHQLEGLVVGEGIAMAHLKGAIYEL
AQALFGPDSKVRFQPVYFPFVEPGAQFAVWWPEGGKWLELGGAGMVHPKVFQAVDAYRERLGLPPAYRGVTGFAFGLGVE
RLAMLRYGIPDIRYFFGGRLKFLEQFKGVL
>Q5LC76 6.1.1.20~~~pheT~~~Phenylalanine--tRNA ligase beta subunit~~~COG0072
MNISYNWLKEYVNFDLTPDEVAAALTSIGLETGGVEEVQTIKGGLEGLVIGEVLTCVEHPNSDHLHITTVNLGNGEPTQI
VCGAPNVAAGQKVVVATLGTKLYDGDECFTIKKSKIRGVESIGMICAEDEIGIGTSHDGIIVLPEDAVPGTLAKDYYNVK
SDYVLEVDITPNRADACSHYGVARDLYAYLVQNGKQAALTRPSVDAFAVENHDLDIKVTVENSEACPRYAGVTVKGVTVK
ESPEWLQNKLRIIGLRPINNVVDITNYIVHAFGQPLHCFDANKIKGGEVIVKTMPEGTTFVTLDGVERKLNERDLMICNK
EDAMCIAGVFGGLDSGSTEATTDVFLESAYFHPTWVRKTARRHGLNTDASFRFERGIDPNITIYCLKLAAMMVKELAGGT
ISSEIKDVCAAPAQDFIVELTYEKVHSLIGKVIPVETIKSIVTSLEMKIMDETAEGLTLAVPPYRVDVQRDCDVIEDILR
IYGYNNVEIPSTLKSSLTTKGDCDKSNKLQNLVAEQLVGCGFNEILNNSLTRAAYYDGLESYPSKNLVMLLNPLSADLNC
MRQTLLFGGLESIAHNANRKNADLKFFEFGNCYHFDAEKKNPEKVLAPYSEDYHLGLWVTGKMVSNSWAHADENTSVYEL
KAYVENIFKRLGLDLHSLVVGNLSDDIYSTALTVNTKGGKRLATFGVVTKKMLKAFDVDNEVYYADLNWKELMKAIRSVK
VSYKEISKFPAVKRDLALLLDKKVQFAEIEKIAYETEKKLLKEVSLFDVYEGKNLEAGKKSYAVSFLLQDESQTLNDKMI
DKIMSKLVKNLEDKLGAKLR
>Q728S0 6.1.1.20~~~pheT~~~Phenylalanine--tRNA ligase beta subunit~~~COG0072
MLLSLKWLREFVPFEGTAAELGDRLTMLGLELEEIIRPFDAIEPIVVGHVVSRERHPEADKLSVCKVDVGQGEPLDIVCG
APNVAAGQRVPVALVGTTMPGGMVIKKAKLRGQPSHGMICSERELGLGEDHDGIMVLPESFRIGARLVDELDLDREVCDI
AITPNRADCLSVLGLAREVALAFGLPLTMPALDLKEEGADASNALRIDIPDASLCPLFHGRVLEGAAVRKSPAWMRYRLI
AVGVRPISNIVDVTNYILMELGQPLHAYDLDLLAGGRIEVSAAREGERLTTLDGVERVLTSNDLLIRDGEKPVGLAGVMG
GAETEISDKSSRVFLEAAVFRPGTIRKTARRLGLSSEASYRFERGVDQVVCTYAMNRAAQLIAGLSGATLRPGICHNEPL
PWQAPVLRFRRARGEALLGISLDETFCRETLERLGCKVDAADAADWKVTAPSHRRDFEREADLIEEVARVRGMDTIEPVL
PKVMRPLDRAGAPESKYSFWLRLKHWAAGLGLNEAINYSFVGQKDLDHLNLAVDGRIPIMNPLTADQNVLRTELAPGLLQ
NLRHNIAQGNAGLRLFEVAHIFEADATSDTTARERARLDILVYGSRYDSQWPHVEADADYADIKGIVEHCLAFLHLEGAT
FTLAASHPFLMPCVDVAVQGRQVGVIGRVRPEIADAYHARKDAWLADLDLDVLRELHDAARIAFRSLPVYPPVRRDITVA
APGSLQVGAVLDHILGLRLPLLCGVELIDVFEPEGKDERNLTFRMTFRHASRTLKDAEVDKERDKVAHSLVEKLPVRI
>P07395 6.1.1.20~~~pheT~~~Phenylalanine--tRNA ligase beta subunit~~~COG0072
MKFSELWLREWVNPAIDSDALANQITMAGLEVDGVEPVAGSFHGVVVGEVVECAQHPNADKLRVTKVNVGGDRLLDIVCG
APNCRQGLRVAVATIGAVLPGDFKIKAAKLRGEPSEGMLCSFSELGISDDHSGIIELPADAPIGTDIREYLKLDDNTIEI
SVTPNRADCLGIIGVARDVAVLNQLPLVQPEIVPVGATIDDTLPITVEAPEACPRYLGRVVKGINVKAPTPLWMKEKLRR
CGIRSIDAVVDVTNYVLLELGQPMHAFDKDRIEGGIVVRMAKEGETLVLLDGTEAKLNADTLVIADHNKALAMGGIFGGE
HSGVNDETQNVLLECAFFSPLSITGRARRHGLHTDASHRYERGVDPALQHKAMERATRLLIDICGGEAGPVIDITNEATL
PKRATITLRRSKLDRLIGHHIADEQVTDILRRLGCEVTEGKDEWQAVAPSWRFDMEIEEDLVEEVARVYGYNNIPDEPVQ
ASLIMGTHREADLSLKRVKTLLNDKGYQEVITYSFVDPKVQQMIHPGVEALLLPSPISVEMSAMRLSLWTGLLATVVYNQ
NRQQNRVRIFESGLRFVPDTQAPLGIRQDLMLAGVICGNRYEEHWNLAKETVDFYDLKGDLESVLDLTGKLNEVEFRAEA
NPALHPGQSAAIYLKGERIGFVGVVHPELERKLDLNGRTLVFELEWNKLADRVVPQAREISRFPANRRDIAVVVAENVPA
ADILSECKKVGVNQVVGVNLFDVYRGKGVAEGYKSLAISLILQDTSRTLEEEEIAATVAKCVEALKERFQASLRD
>P9WFU1 6.1.1.20~~~pheT~~~Phenylalanine--tRNA ligase beta subunit~~~COG0072
MRLPYSWLREVVAVGASGWDVTPGELEQTLLRIGHEVEEVIPLGPVDGPVTVGRVADIEELTGYKKPIRACAVDIGDRQY
REIICGATNFAVGDLVVVALPGATLPGGFTISARKAYGRNSDGMICSAAELNLGADHSGILVLPPGAAEPGADGAGVLGL
DDVVFHLAITPDRGYCMSVRGLARELACAYDLDFVDPASNSRVPPLPIEGPAWPLTVQPETGVRRFALRPVIGIDPAAVS
PWWLQRRLLLCGIRATCPAVDVTNYVMLELGHPMHAHDRNRISGTLGVRFARSGETAVTLDGIERKLDTADVLIVDDAAT
AAIGGVMGAASTEVRADSTDVLLEAAIWDPAAVSRTQRRLHLPSEAARRYERTVDPAISVAALDRCARLLADIAGGEVSP
TLTDWRGDPPCDDWSPPPIRMGVDVPDRIAGVAYPQGTTARRLAQIGAVVTHDGDTLTVTPPSWRPDLRQPADLVEEVLR
LEGLEVIPSVLPPAPAGRGLTAGQQRRRTIGRSLALSGYVEILPTPFLPAGVFDLWGLEADDSRRMTTRVLNPLEADRPQ
LATTLLPALLEALVRNVSRGLVDVALFAIAQVVQPTEQTRGVGLIPVDRRPTDDEIAMLDASLPRQPQHVAAVLAGLREP
RGPWGPGRPVEAADAFEAVRIIARASRVDVTLRPAQYLPWHPGRCAQVFVGESSVGHAGQLHPAVIERSGLPKGTCAVEL
NLDAIPCSAPLPAPRVSPYPAVFQDVSLVVAADIPAQAVADAVRAGAGDLLEDIALFDVFTGPQIGEHRKSLTFALRFRA
PDRTLTEDDASAARDAAVQSAAERVGAVLRG
>Q7MXR4 6.1.1.20~~~pheT~~~Phenylalanine--tRNA ligase beta subunit~~~COG0072
MNISYKWLLEYLPCTLSPQEIADTLTSIGLETGGVEEIETIRGGLRGLVIGHVLTCEEHPNSDHLHITTVDVGADAPLQI
VCGAPNVAAGQKVVVATVGTTLYHGEEEFAIKKSKIRGVESFGMICSEVEIGVGSSNDGTIVLPSDAPVGMPAAEYYHVE
SDYCIEVDITPNRVDATSHYGVARDLAASLKRNGVPAELKLPEVNLPTDIIDSRIEVKVADATACPRYQGLVIRDITVGE
SPEWLRNRLQAIGLRPINNIVDITNYVLHEFGQPLHAFDLAFIKGDRVHVQTVAEGTPFVTLDGVERKLTAEDLMICDSN
GDPMCVAGVFGGLHSGVTEKTTDIFLESANFNPTMVRRTARRLGLNTDSSFRFERGLDPERTDWALRRAASLILEIAGGH
LGGMTDVYSNPLKPHLISLSFEKVNSVIGRTIEPEMVRSILNSLEIRISKEEDGVMTLEVPRYRTDVTRDVDVIEEIMRI
YGYNQVELTGYIRASLGHETETDRRYKWQTVVSEQLVGAGFNEILNNSLTAGSYYEGLKSHPREMAVELMNPLSQELNCM
RQTLLFGGLETLSHNLRRKHLSLYLFEWGKCYRFHAAKRTDETPLAAYAEDDRLGIWICGQRVHNSWAHPEEPTSVFELK
AVVEQVLCRVGIETGAYTLKTADNDLYASAMEVKTRSGKLLGTFGTVSTELIKRFEIEQPVYFAELLWDALMSESARYKL
EARDLPRFPEVKRDLALLLDKAVSFAEIESLARGCEKKLLRRVELFDVYEGKNLPAGKKSYAVSFFLRNDEKTLNDKQIE
AIMAKIRTTLEQKLGAQLR
>Q9I0A4 6.1.1.20~~~pheT~~~Phenylalanine--tRNA ligase beta subunit~~~
MKFSEKWLRSWANPQVSHDELVARLSMVGLEVDADLPVAGAFSGVVVGEVLSTEQHPDADKLRVCQVSNGSETFQVVCGA
PNVRAGLKIPFAMIGAELPDDFKIKKAKLRGVESFGMLCSAKELQISEENAGLLELPADAPVGQDVRTYLELADYTIEVG
LTPNRGDCLSLAGLAREVSAIYDVPLAPVAVDAVAAQHDETRPVELAAPAACPRYLGRVIRNVDLSRPTPLWMVERLRRS
DIRSIDPVVDVTNYVMIELGQPMHAFDLAEINGGVRVRMAEDGEKLVLLDGQEITLRADTLVIADHQRALAIAGVMGGEH
SGVSDSTRDLFLEAAFFDTIALAGKARSYGLHTDSSHRFERGVDSQLARKAMERATRLILDIVGGEPGPIVEQVSEAHLP
KVAPITLRAERVTQMLGMPLDAAEIVRLLQALELTVVADGEGQWSVGVPSHRFDISLEVDLIEELARLYGYNRLPVRYPQ
ARLAPNNKPEARAALPLLRRLLVARGYQEAITFSFIDPALFELFDPGTQPLTLANPISADMAAMRSSLWPGLVKALQHNL
NRQQSRVRLFESGLRFVGQLEGLKQEAMLAGAICGKRLPEGWANGRDGVDFFDAKADVEAVLASAGALGDFSFVPGEHPA
LHPGQTARIEREGRLVGYLGALHPELAKKLDLEQPVFLFELLLAEVVDGHLPKFRELSRFPEVRRDLALLVDQDVPAQDI
LTQIRAAAGEWLTDLRLFDVYHGKGIDPHRKSLAVGLTWQHPSRTLNDDEVNSTTQNIVTSLEERFNATLRK
>P67041 6.1.1.20~~~pheT~~~Phenylalanine--tRNA ligase beta subunit~~~
MLISNEWLKEYVTIDDSVSDLAERITRTGIEVDDLIDYTKDIKNLVVGFVKSKEKHPDADKLNVCQVDIGEDEPVQIVCG
APNVDAGQYVIVAKVGGRLPGGIKIKRAKLRGERSEGMICSLQEIGISSNYIPKSFESGIYVFSESQVPGTDALQALYLD
DQVMEFDLTPNRADALSMIGTAYEVAALYNTKMTKPETTSNELELSANDELTVTIENEDKVPYYSARVVHDVTIEPSPIW
MQARLIKAGIRPINNVVDISNYVLLEYGQPLHMFDQDAIGSQQIVVRQANEGEKMTTLDDTERELLTSDIVITNGQTPIA
LAGVMGGDFSEVKEQTSNIVIEGAIFDPVSIRHTSRRLNLRSESSSRFEKGIATEFVDEAVDRACYLLQTYANGKVLKDR
VSSGELGAFITPIDITADKINRTIGFDLSQNDIVTIFNQLGFDTEINDDVITVLVPSRRKDITIKEDLIEEVARIYGYDD
IPSTLPVFDKVTSGQLTDRQYKTRMVKEVLEGAGLDQAITYSLVSKEDATAFSMQQRQTIDLLMPMSEAHASLRQSLLPH
LIEAASYNVARKNKDVKLFEIGNVFFANGEGELPDQVEYLSGILTGDYVVNQWQGKKETVDFYLAKGVVDRVSEKLNLEF
SYRRADIDGLHPGRTAEILLENKVVGFIGELHPTLAADNDLKRTYVFELNFDALMAVSVGYINYQPIPRFPGMSRDIALE
VDQNIPAADLLSTIHAHGGNILKDTLVFDVYQGEHLEKGKKSIAIRLNYLDTEETLTDERVSKVQAEIEAALIEQGAVIR
>Q4L5E4 6.1.1.20~~~pheT~~~Phenylalanine--tRNA ligase beta subunit~~~COG0072
MLISNEWLKDYVDAGVKVEDLAERITRTGIEVDDMIDYSKDIKNLVVGYIQSKEKHPDADKLNICQVDIGEEEPVQIVCG
APNVDAGQHVIVAKVGGRLPGGIKIKRAKLRGERSEGMICSLQEIGISSNVVPKAYENGIFVFQTEVEPGTDALTALYLN
DQVMEFDLTPNRADALSMVGTAYEVAALYQTEMTKPETQSNETSESATNELSVTIDNPEKVPYYSARVVKNVSIEPSPIW
VQARLIKAGIRPINNVVDISNYVLLEYGQPLHMFDQDHIGSKEIVVRQAKDEETMTTLDNNERKLVDTDIVISNGQEPIA
LAGVMGGDFSEVTEQTTNVVIEGAIFDPVSIRHTSRRLNLRSEASSRFEKGIATEFVDEAVDRACYLLQELASGEVLQDR
VSSGDLGSFVTPIDITAEKVNKTIGFNLSNDEIQSIFRQLGFETTLKGETLTVNVPSRRKDITIKEDLIEEVARIYGYDE
IPSSLPVFGEVTSGELTDRQHKTRTLKETLEGAGLNQAITYSLVSKDHAKDFALQERPTISLLMPMSEAHATLRQSLLPH
LIEATAYNVARKNKDVRLYEIGRVFFGNGEGELPDEVEYLSGILTGEYVVNAWQGKKEEIDFFIAKGVVDRVAEKLNLEF
SYKAGKIEGLHPGRTAIVSLEGQDIGFIGELHPQVAADNDLKRTYVFELNYDAMMQVAVGYINYEQIPKFPGVTRDIALE
VNHDVPSSELKQIIHNNGEDILQSTLVFDVYEGEHLEKGKKSVAIRLNYLDTEDTLTDERVSKIHDKILEALQAQGATIR
>Q5XCX3 6.1.1.20~~~pheT~~~Phenylalanine--tRNA ligase beta subunit~~~
MLVSYKWLKELVDIDVTPAALAEKMSTTGIEVEGIEVPAEGLSKLVVGHVLSCEDVPETHLHLCQVDTGDETPRQIVCGA
PNVKAGIKVIVAVPGARIADNYKIKKGKIRGMESLGMICSLQELGLSDSIIPKEFSDGIQILPEEAVPGDAIFKYLDLDD
HIIELSITPNRADALSMRGVAHEVAAIYGKSVSFPQKNLQESDKATSEAIEVAIASDKVLTYASRVVENVKVKPSPQWLQ
NLLMNAGIRPINNVVDVTNYVLLYFGQPMHAFDYDKFEDHKIVARAARQGESLVTLDGEKRDLTTEDLVITVADKPVALA
GVMGGQATEIDGNSQTVVLEAAVFDGKSIRKTSGRLNLRSESSSRFEKGVNYATVLEALDFAAAMLQELAEGQVLSGHVQ
AGQLPTEPVEVSTSLDYVNVRLGTELTFADIQRIFNQLGFGLTGDETRFTVAVPRRRWDISIPADLVEEIARIYGYDKLP
TTLPEAGGTAAELTPTQALRRKVRGLAEGLGLTEIISYALTTPEKAIEFAVAPSHLTELMWPMSVERSALRQNMVSGMLD
TVAYNVARKQSNLALYEIGKIFEQEVNPKEDLPNEVNHFAFAICGLVAQKDFQTQAQAVDFYHAKGILDTLFANLNLKVQ
YVPTKDLANMHPGRTALILLDEQVIGFVGQVHPGTAKAYSIPETYVAELDMAALEAALPSDQTFVEITKFPAMTRDVALL
LDREVSHQTIVTAIESAGVKRLTKIKLFDVYEGATIQAGQKSMAYSLTFQNPNDNLTDEEVAKYMEKITKSLTEQVGAEV
R
>Q5SGX1 6.1.1.20~~~pheT~~~Phenylalanine--tRNA ligase beta subunit~~~COG0072
MRVPFSWLKAYVPELESPEVLEERLAGLGFETDRIERVFPIPRGVVFARVLEAHPIPGTRLKRLVLDAGRTVEVVSGAEN
ARKGIGVALALPGTELPGLGQKVGERVIQGVRSFGMALSPRELGVGEYGGGLLEFPEDALPPGTPLSEAWPEEVVLDLEV
TPNRPDALGLLGLARDLHALGYALVEPEAALKAEALPLPFALKVEDPEGAPHFTLGYAFGLRVAPSPLWMQRALFAAGMR
PINNVVDVTNYVMLERAQPMHAFDLRFVGEGIAVRRAREGERLKTLDGVERTLHPEDLVIAGWRGEESFPLGLAGVMGGA
ESEVREDTEAIALEVACFDPVSIRKTARRHGLRTEASHRFERGVDPLGQVPAQRRALSLLQALAGARVAEALLEAGSPKP
PEAIPFRPEYANRLLGTSYPEAEQIAILKRLGCRVEGEGPTYRVTPPSHRLDLRLEEDLVEEVARIQGYETIPLALPAFF
PAPDNRGVEAPYRKEQRLREVLSGLGFQEVYTYSFMDPEDARRFRLDPPRLLLLNPLAPEKAALRTHLFPGLVRVLKENL
DLDRPERALLFEVGRVFREREETHLAGLLFGEGVGLPWAKERLSGYFLLKGYLEALFARLGLAFRVEAQAFPFLHPGVSG
RVLVEGEEVGFLGALHPEIAQELELPPVHLFELRLPLPDKPLAFQDPSRHPAAFRDLAVVVPAPTPYGEVEALVREAAGP
YLESLALFDLYQGPPLPEGHKSLAFHLRFRHPKRTLRDEEVEEAVSRVAEALRARGFGLRGLDTP
>P27002 6.1.1.20~~~pheT~~~Phenylalanine--tRNA ligase beta subunit~~~
MRVPFSWLKAYVPELESPEVLEERLAGLGFETDRIERVFPIPRGVVFARVLEAHPIPGTRLKRLVLDAGRTVEVVSGAEN
ARKGIGVALALPGTELPGLGQKVGERVIQGVRSFGMALSPRELGVGEYGGGLLEFPEDALPPGTPLSEAWPEEVVLDLEV
TPNRPDALGLLGLARDLHALGYALVEPEAALKAEALPLPFALKVEDPEGAPHFTLGYAFGLRVAPSPLWMQRALFAAGMR
PINNVVDVTNYVMLERAQPMHAFDLRFVGEGIAVRRAREGERLKTLDGVERTLHPEDLVIAGWRGEESFPLGLAGVMGGA
ESEVREDTEAIALEVACFDPVSIRKTARRHGLRTEASHRFERGVDPLGQVPAQRRALSLLQALAGARVAEALLEAGSPKP
PEAIPFRPEYANRLLGTSYPEAEQIAILKRLGCRVEGEGPTYRVTPPSHRLDLRLEEDLVEEVARIQGYETIPLALPAFF
PAPDNRGVEAPYRKEQRLREVLSGLGFQEVYTYSFMDPEDARRFRLDPPRLLLLNPLAPEKAALRTHLFPGLVRVLKENL
DLDRPERALLFEVGRVFREREETHLAGLLFGEGVGLPWAKERLSGYFLLKGYLEALFARLGLAFRVEAQAFPFLHPGVSG
RVLVEGEEVGFLGALHPEIAQELELPPVHLFELRLPLPDKPLAFQDPSRHPAAFRDLAVVVPAPTPYGEVEALVREAAGP
YLESLALFDLYQGPPLPEGHKSLAFHLRFRHPKRTLRDEEVEEAVSRVAEALRARGFGLRGLDTP
>O67081 6.1.1.14~~~glyQ~~~Glycine--tRNA ligase alpha subunit~~~COG0752
MYFQDIIMTLHKFWAEKGCLIWQPYDVEVGAGTMNPATFLKVLGKKPWNVAYVEPSRRPQDGRYGENPNRLQHYYQFQVI
LKPAPRNPQEIYLESLERLGINPLEHDIRFVEDDWESPTLGAWGLGWEVWLDGMEITQFTYFQQAGGLDLDEISVEITYG
LERIAMYIQDKDSVFDIEWKEGITYGEIFKRSEWEWSKYNFELADTDMLFQVYEMFEKESKRMVEEGLIFPAYDYLLKCS
HVFNILDARGAISVQERARYIRRMNNLAREIAKLYLQVFENVGAT
>Q9PPK3 6.1.1.14~~~glyQ~~~Glycine--tRNA ligase alpha subunit~~~COG0752
MTFSQMILNLQNYWQEQGCAIMQPYDMPAGAGTFHPATFLRSLGKKPWAAAYVAPSRRPTDGRYGENPNRLGAYYQFQVL
IKPSPDNIQELYLKSLENLGFDLKSHDIRFVEDNWESPSLGAWGLGWEVWLDGMEVTQFTYFQQVGGIAVDLVSAEITYG
LERIAMYLQNVDNVYDIVWSEFNGEKIKYADVHKQSEYEFSKYNFEVSDVKILNEQFENSYKECKNILEQGLALPAYDYC
MLAAHTFNLLDARGAISVAQRQDYMLKIRELSKNCAEIYKKNLNEAE
>A1VZ59 6.1.1.14~~~glyQ~~~Glycine--tRNA ligase alpha subunit~~~COG0752
MTFSQMILNLQNYWQEQGCAIMQPYDMPAGAGTFHPATFLRSLGKKPWAAAYVAPSRRPTDGRYGENPNRLGAYYQFQVL
IKPSPDNIQELYLKSLENLGFDLKSHDIRFVEDNWESPSLGAWGLGWEVWLDGMEVTQFTYFQQVGGIAVDLVSAEITYG
LERIAMYLQNVDNVYDIVWSEFNGEKIKYADVHKQSEYEFSKYNFEVSDVKILNEQFENSYKECKNILEQGLALPAYDYC
MLAAHTFNLLDARGAISVAQRQDYMLKIRELSKNCAEIYKKNLNEAE
>P00960 6.1.1.14~~~glyQ~~~Glycine--tRNA ligase alpha subunit~~~COG0752
MQKFDTRTFQGLILTLQDYWARQGCTIVQPLDMEVGAGTSHPMTCLRELGPEPMAAAYVQPSRRPTDGRYGENPNRLQHY
YQFQVVIKPSPDNIQELYLGSLKELGMDPTIHDIRFVEDNWENPTLGAWGLGWEVWLNGMEVTQFTYFQQVGGLECKPVT
GEITYGLERLAMYIQGVDSVYDLVWSDGPLGKTTYGDVFHQNEVEQSTYNFEYADVDFLFTCFEQYEKEAQQLLALENPL
PLPAYERILKAAHSFNLLDARKAISVTERQRYILRIRTLTKAVAEAYYASREALGFPMCNKDK
>Q9WY59 6.1.1.14~~~glyQ~~~Glycine--tRNA ligase alpha subunit~~~COG0752
MYLQDVIMKLNDFWASKGCLLEQPYDMEVGAGTFHPATFFGSLRKGPWKVAYVQPSRRPTDGRYGENPNRLQRYFQYQVI
IKPSPENSQELYLESLEYLGINLKEHDIRFVEDNWESPTLGAWGVGWEVWLDGMEITQFTYFQQIGGISLKDIPLEITYG
LERIAMYLQGVDNVYEVQWNENVKYGDVFLENEREFSVFNFEEANVGLLFRHFDEYEKEFYRLVEKNLYLPAYDYILKCS
HTFNLLDARGAISVSQRQTYVKRIQAMARKAARVFLEVQANENSPA
>P00961 6.1.1.14~~~glyS~~~Glycine--tRNA ligase beta subunit~~~COG0751
MSEKTFLVEIGTEELPPKALRSLAESFAANFTAELDNAGLAHGTVQWFAAPRRLALKVANLAEAQPDREIEKRGPAIAQA
FDAEGKPSKAAEGWARGCGITVDQAERLTTDKGEWLLYRAHVKGESTEALLPNMVATSLAKLPIPKLMRWGASDVHFVRP
VHTVTLLLGDKVIPATILGIQSDRVIRGHRFMGEPEFTIDNADQYPEILRERGKVIADYEERKAKIKADAEEAARKIGGN
ADLSESLLEEVASLVEWPVVLTAKFEEKFLAVPAEALVYTMKGDQKYFPVYANDGKLLPNFIFVANIESKDPQQIISGNE
KVVRPRLADAEFFFNTDRKKRLEDNLPRLQTVLFQQQLGTLRDKTDRIQALAGWIAEQIGADVNHATRAGLLSKCDLMTN
MVFEFTDTQGVMGMHYARHDGEAEDVAVALNEQYQPRFAGDDLPSNPVACALAIADKMDTLAGIFGIGQHPKGDKDPFAL
RRAALGVLRIIVEKNLNLDLQTLTEEAVRLYGDKLTNANVVDDVIDFMLGRFRAWYQDEGYTVDTIQAVLARRPTRPADF
DARMKAVSHFRTLDAAAALAAANKRVSNILAKSDEVLSDRVNASTLKEPEEIKLAMQVVVLRDKLEPYFTEGRYQDALVE
LAELREPVDAFFDKVMVMVDDKELRINRLTMLEKLRELFLRVADISLLQ
>P9WFV7 6.1.1.14~~~glyQS~~~Glycine--tRNA ligase~~~COG0441
MHHPVAPVIDTVVNLAKRRGFVYPSGEIYGGTKSAWDYGPLGVELKENIKRQWWRSVVTGRDDVVGIDSSIILPREVWVA
SGHVDVFHDPLVESLITHKRYRADHLIEAYEAKHGHPPPNGLADIRDPETGEPGQWTQPREFNMMLKTYLGPIETEEGLH
YLRPETAQGIFVNFANVVTTARKKPPFGIGQIGKSFRNEITPGNFIFRTREFEQMEMEFFVEPATAKEWHQYWIDNRLQW
YIDLGIRRENLRLWEHPKDKLSHYSDRTVDIEYKFGFMGNPWGELEGVANRTDFDLSTHARHSGVDLSFYDQINDVRYTP
YVIEPAAGLTRSFMAFLIDAYTEDEAPNTKGGMDKRTVLRLDPRLAPVKAAVLPLSRHADLSPKARDLGAELRKCWNIDF
DDAGAIGRRYRRQDEVGTPFCVTVDFDSLQDNAVTVRERDAMTQDRVAMSSVADYLAVRLKGS
>P99129 6.1.1.14~~~glyQS~~~Glycine--tRNA ligase~~~
MAKDMDTIVSLAKHRGFVFPGSDIYGGLSNTWDYGPLGVELKNNVKKAWWQKFITQSPFNVGIDAAILMNPKVWEASGHL
NNFNDPMIDNKDSKIRYRADKLIEDYMQDVKGDENFIADGLSFEQMKKIIDDEGIVCPVSKTANWTEIRQFNLMFKTFQG
VTEDSTNEIFLRPETAQGIFVNYKNVQRSMRKKLPFGIGQIGKSFRNEITPGNFIFRTREFEQMELEFFCKPGEEIEWQN
YWKTFASDWLTSLNMSSENMRLRDHDEDELSHYSNATTDIEYKFPFGWGELWGIASRTDFDLRKHAEHSGEDFRYHDPET
NEKYIPYCIEPSLGADRVTLAFLCDAYDEEGVEGSKDARTVLHFHPALAPYKAAILPLSKKLSGEAIKIFEQLSSKFSID
FDESQSIGKRYRRQDEIGTPYCVTFDFDSLEDNQVTVRDRDSMEQVRMPISELEAFLTEKTKF
>P56206 6.1.1.14~~~glyQS~~~Glycine--tRNA ligase~~~COG0423
MPASSLDELVALCKRRGFIFQSSEIYGGLQGVYDYGPLGVELKNNLKQAWWRRNVYERDDMEGLDASVLTHRLVLHYSGH
EATFADPMVDNRITKKRYRLDHLLKEQPEEVLKRLYRAMEVEEENLHALVQAMMQAPERAGGAMTAAGVLDPASGEPGDW
TPPRYFNMMFKTYVGPVEDEASLAYLRPETAQGIFVNFKNVLDATSRKLPFGIAQIGKAFRNEITPRNFIFRVREFEQME
IEYFVRPGEDEYWHRYWVEERLKWWQEMGLSRENLVPYQQPPEELAHYAKATVDILYRFPHGLEELEGIANRTDFDLGSH
TKDQEALGITARVLRNEHSTQRLAYRDPETGKWFVPYVIEPSAGVDRGVLALLAEAFTREELPNGEERIVLKLKPQLAPI
KVAVIPLVKNRPEITEYAKRLKARLLALGLGRVLYEDTGNIGKAYRRHDEVGTPFAVTVDYDTIGQSKDGTTRLKDTVTV
RDRDTMEQIRLHVDELEGFLRERLRW
>B0VKR7 6.1.1.21~~~hisS~~~Histidine--tRNA ligase~~~
MSSIVAIKGFNDVLPTQTAAWRRLEQHLASLMDAYGYQQIRLPIVEQTGLFKRAIGDATDIVEKEMYTFFDKGNPPESLT
LRPEGTAGCVRALVEHNLLRGATPRVWYMGPMFRYEKPQKGRYRQFHQFGVETFGVATPDIDAELIMLTARLWKRMGVDH
MVQLELNTLGETDERTEYRNALVAFLNEHKDALDEDSQRRLTTNPLRILDSKIESTQKILENAPKLHDFLKEDSLSHFQQ
LQDYLTAAGIKFVINQKLVRGLDYYNKTVFEWTTTALGSQGTVCAGGRYDGLVGQLKGKADQSVPAVGFAMGMERLLLLL
EQVEQAEIVRHCEAFLVAEPAYQSKALVLAEQLRDQLEAANSNIRIKTGSQSSMKSQMKKADQAGAVYAIILGEREWEAQ
QLAVKELATAEQSQVALAELVPFLIEKFTK
>Q2SWE3 6.1.1.21~~~hisS~~~Histidine--tRNA ligase~~~
MTEQKRKLEKLTGVKGMNDILPQDAGLWEFFEATVKSLLRAYGYQNIRTPIVEHTPLFTRGIGEVTDIVEKEMYSFVDAL
NGENLTLRPENTAAVVRAAIEHNMLYDGPKRLWYIGPMFRHERPQRGRYRQFHQVGVEALGFAGPDADAEIVMMCQRLWE
DLGLTGIKLEINSLGLAEERAAHRVELIKYLEQHADKLDDDAQRRLYTNPLRVLDTKNPALQEIVRNAPKLIDFLGDVSR
AHFEGLQRLLKANNVPFTINPRLVRGLDYYNLTVFEWVTDKLGAQGTVAAGGRYDPLIEQLGGKPTAACGWAMGIERILE
LLKEEHLVPEQEGVDVYVVHQGDAAREQAFIVAERLRDTGLDVILHCSADGAGASFKSQMKRADASGAAFAVIFGEDEVT
NGTASVKPLRGTGDDGEKSVQQSVPVESLTEFLINAMVATAEDGDD
>P60906 6.1.1.21~~~hisS~~~Histidine--tRNA ligase~~~COG0124
MAKNIQAIRGMNDYLPGETAIWQRIEGTLKNVLGSYGYSEIRLPIVEQTPLFKRAIGEVTDVVEKEMYTFEDRNGDSLTL
RPEGTAGCVRAGIEHGLLYNQEQRLWYIGPMFRHERPQKGRYRQFHQLGCEVFGLQGPDIDAELIMLTARWWRALGISEH
VTLELNSIGSLEARANYRDALVAFLEQHKEKLDEDCKRRMYTNPLRVLDSKNPEVQALLNDAPALGDYLDEESREHFAGL
CKLLESAGIAYTVNQRLVRGLDYYNRTVFEWVTNSLGSQGTVCAGGRYDGLVEQLGGRATPAVGFAMGLERLVLLVQAVN
PEFKADPVVDIYLVASGADTQSAAMALAERLRDELPGVKLMTNHGGGNFKKQFARADKWGARVAVVLGESEVANGTAVVK
DLRSGEQTAVAQDSVAAHLRTLLG
>P75069 6.1.1.21~~~hisS~~~Histidine--tRNA ligase~~~
MSVLQKPRGVKDWYGEELIYFNWTVHQITNLAWKWGFSEVKTPLLEYAEAFKRTNANADIVKKELYEFHDKSNRLLALRP
EATAGIVRLVCENKLLQPQNYPLRLFTIGTMYRYERPQSNRYREHYQFSCEVIGDTNPTVLLDTLLLGHAIIQQLGIEGV
ILKLNNLGNSATIQQWNQALQAYLTQFKAQLTELSQSRLSTNPLRILDDKVDGQLPFISDAPQIEQFLDAEQQALNTWLQ
QQLTQQQVPFEWNPTLVRGLDYYTGVVFEFVKDDTTVLAGGVYDNLVEELGGTPTKALGFACGIERSINCLSAVKKQAIL
ANQPPRLLVIGLTEAALEKLLQLSLGWRAYHPVTIYPKVIRIINGIRAAQRLGYRFLGVIGGNNLEQQTITVKDLATEQQ
TTYTWDEFRQRQVL
>P9WFV5 6.1.1.21~~~hisS~~~Histidine--tRNA ligase~~~COG0124
MTEFSSFSAPKGVPDYVPPDSAQFVAVRDGLLAAARQAGYSHIELPIFEDTALFARGVGESTDVVSKEMYTFADRGDRSV
TLRPEGTAGVVRAVIEHGLDRGALPVKLCYAGPFFRYERPQAGRYRQLQQVGVEAIGVDDPALDAEVIAIADAGFRSLGL
DGFRLEITSLGDESCRPQYRELLQEFLFGLDLDEDTRRRAGINPLRVLDDKRPELRAMTASAPVLLDHLSDVAKQHFDTV
LAHLDALGVPYVINPRMVRGLDYYTKTAFEFVHDGLGAQSGIGGGGRYDGLMHQLGGQDLSGIGFGLGVDRTVLALRAEG
KTAGDSARCDVFGVPLGEAAKLRLAVLAGRLRAAGVRVDLAYGDRGLKGAMRAAARSGARVALVAGDRDIEAGTVAVKDL
TTGEQVSVSMDSVVAEVISRLAG
>Q8YMC2 6.1.1.21~~~hisS~~~Histidine--tRNA ligase~~~COG0124
MAKNDKINFSTPSGFPEFLPSEKRLELYLLDTIRRVYESYGFTPIETPAVERLEVLQAKGNQGDNIIYGLEPILPPNRQA
EKDKSGDTGSEARALKFDQTVPLAAYIARHLNDLTFPFARYQMDVVFRGERAKDGRFRQFRQCDIDVVGREKLSLLYDAQ
MPAIITEIFEAVNIGDFVIRINNRKVLTGFFQSLNISETQIKSCISIIDNLEKIGEAKVKLELEKEGINPEQTQKIIDFV
KIDGSVDDVLDKLKHLSQTLPESEQFNLGVSELETVITGVRNLGVPDKRFCIDLAIARGLNYYTGTVYETTLIGHEALGS
ICSGGRYEELVGTFIGEKMPGVGISIGLTRLISRLLKAGILNTLPPTPAQVVVVNMQDELMPTYLKVSQQLRQAGLNVIT
NFEKRQLGKQFQAADKQGIRFCVIIGADEAAAQKSSLKDLQSGEQVEVALADLAEEIKRRLT
>O52765 6.1.1.21~~~hisS~~~Histidine--tRNA ligase~~~
MAKNIQAIRGMNDYLPGETAIWQRIEGTLKNVLGSYGYSEIRLPIVEQTPLFKRAIGEVTDVVEKEMYTFEDRNGDSLTL
RPEGTAGCVRAGIEHGLLYNQEQRLWYIGPMFRHERPQKGRYRQFHQLGAEVFGLQGPDIDAELIMLTARWWRALGIAEH
VSLELNSIGSLEARANYRDALVAFLEQHQETLDEDCKRRMYTNPLRVLDSKNPDVQALLNDAPALGDYLDDDSREHFAGL
CKLLDAAGIAYTVNQRLVRGLDYYNRTVFEWVTNSLGSQGTVCAGGRYDGLVEQLGGRATPAVGFAMGLERLVLLVQAVN
PEFIASPVVDIYLVAAGAQTQSAAMTLAERLRDEMPGVKLMTNHGGGNFKKQFARADKWGARIALVLGESEVADGTVVVK
DLRSGEQTAVAQDSVAAHLRTLLG
>P60910 6.1.1.21~~~hisS~~~Histidine--tRNA ligase~~~
MIKIPRGTQDILPEDSKKWRYIENQLDELMTFYNYKEIRTPIFESTDLFARGVGDSTDVVQKEMYTFKDKGDRSITLRPE
GTAAVVRSYIEHKMQGNPNQPIKLYYNGPMFRYERKQKGRYRQFNQFGVEAIGAENPSVDAEVLAMVMHIYQSFGLKHLK
LVINSVGDMASRKEYNEALVKHFEPVIHEFCSDCQSRLHTNPMRILDCKVDRDKEAIKTAPRITDFLNEESKAYYEQVKA
YLDDLGIPYIEDPNLVRGLDYYTHTAFELMMDNPNYDGAITTLCGGGRYNGLLELLDGPSETGIGFALSIERLLLALEEE
GIELDIEENLDLFIVTMGDQADRYAVKLLNHLRHNGIKADKDYLQRKIKGQMKQADRLGAKFTIVIGDQELENNKIDVKN
MTTGESETIELDALVEYFKK
>P60911 6.1.1.21~~~hisS~~~Histidine--tRNA ligase~~~
MIKIPRGTQDILPEDSKKWRYIENQLDELMTFYNYKEIRTPIFESTDLFARGVGDSTDVVQKEMYTFKDKGDRSITLRPE
GTAAVVRSYIEHKMQGNPNQPIKLYYNGPMFRYERKQKGRYRQFNQFGVEAIGAENPSVDAEVLAMVMHIYQSFGLKHLK
LVINSVGDMASRKEYNEALVKHFEPVIHEFCSDCQSRLHTNPMRILDCKVDRDKEAIKTAPRITDFLNEESKAYYEQVKA
YLDDLGIPYIEDPNLVRGLDYYTHTAFELMMDNPNYDGAITTLCGGGRYNGLLELLDGPSETGIGFALSIERLLLALEEE
GIELDIEENLDLFIVTMGDQADRYAVKLLNHLRHNGIKADKDYLQRKIKGQMKQADRLGAKFTIVIGDQELENNKIDVKN
MTTGESETIELDALVEYFKK
>P30053 6.1.1.21~~~hisS~~~Histidine--tRNA ligase~~~
MKLQKPKGTQDILSVAAAKWQYVEGVARETFKQYHYGEIRTPMFEHYEVISRSVGDTTDIVTKEMYDFYDKGDRHITLRP
EGTAPVVRSYVENKLFAPEVQKPVKLYYIGSMFRYERPQAGRLREFHQIGVECFGSANPATDVETIAMAYHLFERLGIKG
VTLHLNSLGNAASRAAYRQALIDYLSPMRDTLSKDSQRRLDENPLRVLDSKEKEDKIAVANAPSILDYQDEESQAHFDAV
RSMLEALAIPYVIDTNMVRGLDYYNHTIFEFITEVDQSELTICAGGRYDGLVEYFGGPATPGFGFGLGLERLLLILDKQG
VELPVEEGLDVYIAVLGADANVAALALTQAIRRQGFTVERDYLGRKIKAQFKSADTFKAKVVITLGESEIKAGQAVLKHN
QTRQEMTVSFDQIQTDFASIFAECVQ
>P62374 6.1.1.21~~~hisS~~~Histidine--tRNA ligase~~~COG0124
MTARAVRGTKDLFGKELRMHQRIVATARKVLEAAGALELVTPIFEETQVFEKGVGAATDIVRKEMFTFQDRGGRSLTLRP
EGTAAMVRAYLEHGMKVWPQPVRLWMAGPMFRAERPQKGRYRQFHQVNYEALGSENPILDAEAVVLLYECLKELGLRRLK
VKLSSVGDPEDRARYNAYLREVLSPHREALSEDSKERLELNPMRILDSKSERDQALLKELGVRPMLDFLGEEARAHLKEV
ERHLERLSVPYELEPALVRGLDYYVRTAFEVHHEEIGAQSALGGGGRYDGLSELLGGPRVPGVGFAFGVERVALALEAEG
FGLPEEKGPDLYLIPLTEEAVAEAFYLAEALRPRLRAEYALAPRKPAKGLEEALKRGAAFAGFLGEDELRAGEVTLKRLA
TGEQVRLSREEVPGYLLQALG
>P56194 6.1.1.21~~~hisS~~~Histidine--tRNA ligase~~~COG0124
MTARAVRGTKDLFGKELRMHQRIVATARKVLEAAGALELVTPIFEETQVFEKGVGAATDIVRKEMFTFQDRGGRSLTLRP
EGTAAMVRAYLEHGMKVWPQPVRLWMAGPMFRAERPQKGRYRQFHQVNYEALGSENPILDAEAVVLLYECLKELGLRRLK
VKLSSVGDPEDRARYNAYLREVLSPHREALSEDSKERLELNPMRILDSKSERDQALLKELGVRPMLDFLGEEARAHLKEV
ERHLERLSVPYELEPALVRGLDYYVRTAFEVHHEEIGAQSALGGGGRYDGLSELLGGPRVPGVGFAFGVERVALALEAEG
FGLPEEKGPDLYLIPLTEEAVAEAFYLAEALRPRLRAEYALAPRKPAKGLEEALKRGAAFAGFLGEDELRAGEVTLKRLA
TGEQVRLSREEVPGYLLQALG
>P41972 6.1.1.5~~~ileS~~~Isoleucine--tRNA ligase~~~
MDYKETLLMPKTDFPMRGGLPNKEPQIQEKWDAEDQYHKALEKNKGNETFILHDGPPYANGNLHMGHALNKILKDFIVRY
KTMQGFYAPYVPGWDTHGLPIEQALTKKGVDRKKMSTAEFREKCKEFALEQIELQKKDFRRLGVRGDFNDPYITLKPEYE
AAQIRIFGEMADKGLIYKGKKPVYWSPSSESSLAEAEIEYHDKRSASIYVAFNVKDDKGVVDADAKFIIWTTTPWTIPSN
VAITVHPELKYGQYNVNGEKYIIAEALSDAVAEALDWDKASIKLEKEYTGKELEYVVAQHPFLDRESLVINGDHVTTDAG
TGCVHTAPGHGEDDYIVGQKYELPVISPIDDKGVFTEEGGQFEGMFYDKANKAVTDLLTEKGALLKLDFITHSYPHDWRT
KKPVIFRATPQWFASISKVRQDILDAIENTNFKVNWGKTRIYNMVRDRGEWVISRQRVWGVPLPVFYAENGEIIMTKETV
NHVADLFAEHGSNIWFEREAKDLLPEGFTHPGSPNGTFTKETDIMDVWFDSGSSHRGVLETRPELSFPADMYLEGSDQYR
GWFNSSITTSVATRGVSPYKFLLSHGFVMDGEGKKMSKSLGNVIVPDQVVKQKGADIARLWVSSTDYLADVRISDEILKQ
TSDVYRKIRNTLRFMLGNINDFNPDTDSIPESELLEVDRYLLNRLREFTASTINNYENFDYLNIYQEVQNFINVELSNFY
LDYGKDILYIEQRDSHIRRSMQTVLYQILVDMTKLLAPILVHTAEEVWSHTPHVKEESVHLADMPKVVEVDQALLDKWRT
FMNLRDDVNRALETARNEKVIGKSLEAKVTIASNDKFNASEFLTSFDALHQLFIVSQVKVVDKLDDQATAYEHGDIVIEH
ADGEKCERCWNYSEDLGAVDELTHLCPRCQQVVKSLV
>Q8L1B1 6.1.1.5~~~ileS2~~~Isoleucine--tRNA ligase 2~~~
MSTEGSGPVRFPAMEDAVLERWEKEKTFEQSISAREGKPVYVFYDGPPFATGLPHYGHILTSYIKDVIPRYQTMLGKQVP
RRWGWDCHGLPVEFEVEKAMGFKSKRDILEFGVEQFNDECRELVLKYADDWRGFVNRMGRWVDFDGAYKTMDNDYMESVL
WGFKTLHDKGHVYEGGKIVPYCVRCQTVLSNFEARLDDAFRPRRDMSAYVKFRQQDRPDTFFLAWTTTPWTLPANVALAV
AADENYVCIEHGEERLWLAEGCLGGLFDEPVILERCTGAELAGLRYLPVVGEVIDASAHRVVTADFVQMGDGSGIVHIAP
AFGEDDALLGQQYELPAPNPVRDDGTFSDAVAQYAGQNIFEATPRILADLKSSGLLFKQEQIEHNYPHCWRCDNPLIYRA
VESWFIRASALREQLVENNSQVNWVPEHVKEGRFGDWIRNARDWAVSRNRFWGAPIPVWRCDQCGTVEVMGSIAQIEARS
GRKVEDLHVPHIDEHRFACQCCEGTMSRVTGVFDCWFESGAMPFASRHYPFENKQEFEQTFPADFIVEYLAQTRGWFYTM
MVISTGCFEQNPFKNAMCHGVILAKDGRKMSKRLKNYPNPMDLMQTHGSDALRVALLASPVCKGEDIKFSEESVRDVVRR
YHLLFWNCLQFYKTFTEIDQFSPSGDLGQPLDNVLDHYLLHELAALESDIKMWMESLDFSKIYSRIEVFINVLSTWYLRL
NKARIWRDGLDDDKRQCYEVLHYALSNFARLLAPFMPFLAEAVYTELGYADSVHLQDWPSIDRQYLSYELADEMSSLRNL
IASVRNVRETNGVSQKFPLRSIRVAGIEQAVLERYAQFLEEELNVKQVQWAADADEWAQPVVVLIFSLLGKRLGPAMKAV
TTAVKAGEYVIDEQGGLVAAGQTIQPHEFERRLTVRDTLNNVGIVENMVVWLDLDIDASLKREGAVRELNRRLQDLRKKA
KLGYTEKVDIAVLGGAYVDEILVHHEDWLKSQLLVQSLLRSDLEAPLAVDEVELPEGDPVRIQLRRSVLA
>Q72AR5 6.1.1.5~~~ileS~~~Isoleucine--tRNA ligase~~~COG0060
MSDYKKTLQLPETKFPMKANLTQREPEMLRKWEKDDAYGAMVRASGQQGTYVLHDGPPYANGNIHMGHALNKILKDIIVK
SRNLQGFKAEYVPGWDCHGLPIELKVEHELGEKKRTMPAHAVRKRCRQYAEKYLDIQRKEFKRLGVFGAWDKPYVTMHPS
YEAATARELGNFAAKGGLVRSKKPIYWCCSCQTALAEAEVEYHDHTSPSVHVRFPLRDPRVAEVLPGVDPAHAYIVIWTT
TPWTLPDNMAVAVHPDFDYVVVRHGGDFHIVAEGLLEACLKAFKWDEHEVVARIGGRALEGLKATHPFYDRPSPIVLADY
VTLESGTGCVHTAPGHGREDYETGLRYGLDIYSPLTDEGRYLDCVEFFAGMTIFEANPKVIEKLREVGNLLAEGRITHSY
PHCWRCKKPVIFRATTQWFIAMERNDLRQKALDAIRDDVRWIPSWGQERIHNMIEFRPDWCISRQRMWGVPIVALLCEDC
GEAWNDADWMRDIAERFAKHATGCDYWYETDLSDIVPAGLRCPKCGGDHWKKETDILDVWFDSGTSFAAVVEQREECGFP
ADLYLEGSDQHRGWFHSSLLASIGTRGVPPYRSVLTHGYVVDGDGRKMSKSVGNVVAPQEIIDKHGAEVLRLWVASVDYR
EDIRISEEILNRLVDAYRRIRNTCRYLLGNISDLTPETMVPFEAMDPLDRFALDLASRAHERIQDAYTEYEFHKVFHTLH
NLCVTDLSAFYLDILKDRLYSSAADSHARRSAQTALYRILMLMVRDMAPVLSFTAEEVFGYVPAALRPDVISVFALPATD
APAFTLDTTSRAAWEKLLAVRSETTKAIEPLRKSGEVGHSLDTHVTLFADPSLKATLEGLGSDLRAMFIVSRLEVMDLAD
APADAWTSEELPELKVTVRKAEGEKCERCWIISADLGTDAAHPTLCPRCTAVLTGTGA
>P00956 6.1.1.5~~~ileS~~~Isoleucine--tRNA ligase~~~COG0060
MSDYKSTLNLPETGFPMRGDLAKREPGMLARWTDDDLYGIIRAAKKGKKTFILHDGPPYANGSIHIGHSVNKILKDIIVK
SKGLSGYDSPYVPGWDCHGLPIELKVEQEYGKPGEKFTAAEFRAKCREYAATQVDGQRKDFIRLGVLGDWSHPYLTMDFK
TEANIIRALGKIIGNGHLHKGAKPVHWCVDCRSALAEAEVEYYDKTSPSIDVAFQAVDQDALKAKFAVSNVNGPISLVIW
TTTPWTLPANRAISIAPDFDYALVQIDGQAVILAKDLVESVMQRIGVTDYTILGTVKGAELELLRFTHPFMGFDVPAILG
DHVTLDAGTGAVHTAPGHGPDDYVIGQKYGLETANPVGPDGTYLPGTYPTLDGVNVFKANDIVVALLQEKGALLHVEKMQ
HSYPCCWRHKTPIIFRATPQWFVSMDQKGLRAQSLKEIKGVQWIPDWGQARIESMVANRPDWCISRQRTWGVPMSLFVHK
DTEELHPRTLELMEEVAKRVEVDGIQAWWDLDAKEILGDEADQYVKVPDTLDVWFDSGSTHSSVVDVRPEFAGHAADMYL
EGSDQHRGWFMSSLMISTAMKGKAPYRQVLTHGFTVDGQGRKMSKSIGNTVSPQDVMNKLGADILRLWVASTDYTGEMAV
SDEILKRAADSYRRIRNTARFLLANLNGFDPAKDMVKPEEMVVLDRWAVGCAKAAQEDILKAYEAYDFHEVVQRLMRFCS
VEMGSFYLDIIKDRQYTAKADSVARRSCQTALYHIAEALVRWMAPILSFTADEVWGYLPGEREKYVFTGEWYEGLFGLAD
SEAMNDAFWDELLKVRGEVNKVIEQARADKKVGGSLEAAVTLYAEPELSAKLTALGDELRFVLLTSGATVADYNDAPADA
QQSEVLKGLKVALSKAEGEKCPRCWHYTQDVGKVAEHAEICGRCVSNVAGDGEKRKFA
>P9WFV3 6.1.1.5~~~ileS~~~Isoleucine--tRNA ligase~~~COG0060
MTDNAYPKLAGGAPDLPALELEVLDYWSRDDTFRASIARRDGAPEYVFYDGPPFANGLPHYGHLLTGYVKDIVPRYRTMR
GYKVERRFGWDTHGLPAELEVERQLGITDKSQIEAMGIAAFNDACRASVLRYTDEWQAYVTRQARWVDFDNDYKTLDLAY
MESVIWAFKQLWDKGLAYEGYRVLPYCWRDETPLSNHELRMDDDVYQSRQDPAVTVGFKVVGGQPDNGLDGAYLLVWTTT
PWTLPSNLAVAVSPDITYVQVQAGDRRFVLAEARLAAYARELGEEPVVLGTYRGAELLGTRYLPPFAYFMDWPNAFQVLA
GDFVTTDDGTGIVHMAPAYGEDDMVVAEAVGIAPVTPVDSKGRFDVTVADYQGQHVFDANAQIVRDLKTQSGPAAVNGPV
LIRHETYEHPYPHCWRCRNPLIYRSVSSWFVRVTDFRDRMVELNQQITWYPEHVKDGQFGKWLQGARDWSISRNRYWGTP
IPVWKSDDPAYPRIDVYGSLDELERDFGVRPANLHRPYIDELTRPNPDDPTGRSTMRRIPDVLDVWFDSGSMPYAQVHYP
FENLDWFQGHYPGDFIVEYIGQTRGWFYTLHVLATALFDRPAFKTCVAHGIVLGFDGQKMSKSLRNYPDVTEVFDRDGSD
AMRWFLMASPILRGGNLIVTEQGIRDGVRQVLLPLWNTYSFLALYAPKVGTWRVDSVHVLDRYILAKLAVLRDDLSESME
VYDIPGACEHLRQFTEALTNWYVRRSRSRFWAEDADAIDTLHTVLEVTTRLAAPLLPLITEIIWRGLTRERSVHLTDWPA
PDLLPSDADLVAAMDQVRDVCSAASSLRKAKKLRVRLPLPKLIVAVENPQLLRPFVDLIGDELNVKQVELTDAIDTYGRF
ELTVNARVAGPRLGKDVQAAIKAVKAGDGVINPDGTLLAGPAVLTPDEYNSRLVAADPESTAALPDGAGLVVLDGTVTAE
LEAEGWAKDRIRELQELRKSTGLDVSDRIRVVMSVPAEREDWARTHRDLIAGEILATDFEFADLADGVAIGDGVRVSIEK
T
>P67509 6.1.1.5~~~ileS~~~Isoleucine--tRNA ligase~~~
MDYKETLLMPKTDFPMRGGLPNKEPQIQEKWDAEDQYHKALEKNKGNETFILHDGPPYANGNLHMGHALNKILKDFIVRY
KTMQGFYAPYVPGWDTHGLPIEQALTKKGVDRKKMSTAEFREKCKEFALEQIELQKKDFRRLGVRGDFNDPYITLKPEYE
AAQIRIFGEMADKGLIYKGKKPVYWSPSSESSLAEAEIEYHDKRSASIYVAFDVKDDKGVVDADAKFIIWTTTPWTIPSN
VAITVHPELKYGQYNVNGEKYIIAEALSDAVAEALDWDKASIKLEKEYTGKELEYVVAQHPFLDRESLVINGDHVTTDAG
TGCVHTAPGHGEDDYIVGQKYELPVISPIDDKGVFTEEGGQFEGMFYDKANKAVTDLLTEKGALLKLDFITHSYPHDWRT
KKPVIFRATPQWFASISKVRQDILDAIENTNFKVNWGKTRIYNMVRDRGEWVISRQRVWGVPLPVFYAENGEIIMTKETV
NHVADLFAEHGSNIWFEREAKDLLPEGFTHPGSPNGTFTKETDIMDVWFDSGSSHRGVLETRPELSFPADMYLEGSDQYR
GWFNSSITTSVATRGVSPYKFLLSHGFVMDGEGKKMSKSLGNVIVPDQVVKQKGADIARLWVSSTDYLADVRISDEILKQ
TSDVYRKIRNTLRFMLGNINDFNPDTDSIPESELLEVDRYLLNRLREFTASTINNYENFDYLNIYQEVQNFINVELSNFY
LDYGKDILYIEQRDSHIRRSMQTVLYQILVDMTKLLAPILVHTAEEVWSHTPHVKEESVHLADMPKVVEVDQALLDKWRT
FMNLRDDVNRALETARNEKVIGKSLEAKVTIASNDKFNASEFLTSFDALHQLFIVSQVKVVDKLDDQATAYEHGDIVIEH
ADGEKCERCWNYSEDLGAVDELTHLCPRCQQVVKSLV
>P56690 6.1.1.5~~~ileS~~~Isoleucine--tRNA ligase~~~COG0060
MFKEVGEPNFPKLEEEVLAFWKREKIFQKSVENRKGGPRYTVYEGPPTANGLPHVGHAQARSYKDLFPRYKTMRGYYAPR
RAGWDTHGLPVELEVEKKLGLKSKREIEAYGIERFNQACRESVFTYEKEWEAFTERIAYWVDLENAYATLEPTYIESIWW
SLKNLFDRGLLYRDHKVVPYCPRCGTPLSSHEVALGYKEIQDPSVYVRFPLKEPKKLGLEKASLLIWTTTPWTLPGNVAA
AVHPEYTYAAFQVGDEALILEEGLGRKLLGEGTPVLKTFPGKALEGLPYTPPYPQALEKGYFVVLADYVSQEDGTGIVHQ
APAFGAEDLETARVYGLPLLKTVDEEGKLLVEPFKGLYFREANRAILRDLRGRGLLFKEESYLHSYPHCWRCSTPLMYYA
TESWFIKNTLFKDELIRKNQEIHWVPPHIKEGRYGEWLKNLVDWALSRNRYWGTPLPIWVCQACGKEEAIGSFQELKARA
TKPLPEPFDPHRPYVDQVELACACGGTMRRVPYVIDVWYDSGAMPFASLHYPFEHEEVFRESFPADFIAEGIDQTRGWFN
SLHQLGVMLFGSIAFKNVICHGLILDEKGQKMSKSKGNVVDPWDIIREFGADALRWYIYVSAPPEADRRFGPNLVRETVR
DYFLTLWNVYSFFVTYANLDRPDLKNPPPPEKRPEMDRWLLARMQDLIQRVTEALEAYDPTTSARALRDFVVEDLSQWYV
RRNRRRFWKNEDALDREAAYATLYEALVLVATLAAPFTPFLAEVLWQNLVRSVRPEAKESVHLADWPEADPALADEALVA
QMRAVLKVVDLARAARAKSGVKTRTPLPLLLVTAPTALEREGLKRFAHEIAEELNVKEVRVLEPGEEILSYRVLPNLKLL
GRKYGKLVPKIREALQRERERAAALALKGEAIPLEVEGEALTLLPEEVLLEAEAPKGYQALEKDGYVAALKVEVTEALRM
EGLARDLIRLLQQARKDMGLKVSDRIRVGYEAEGPYLEALKRHGPWIAEEVLATAFGEGLFGGFEARVEDEEGKAVFHLA
RAE
>P0A8N3 6.1.1.6~~~lysS~~~Lysine--tRNA ligase~~~COG1190
MSEQHAQGADAVVDLNNELKTRREKLANLREQGIAFPNDFRRDHTSDQLHAEFDGKENEELEALNIEVAVAGRMMTRRIM
GKASFVTLQDVGGRIQLYVARDDLPEGVYNEQFKKWDLGDILGAKGKLFKTKTGELSIHCTELRLLTKALRPLPDKFHGL
QDQEARYRQRYLDLISNDESRNTFKVRSQILSGIRQFMVNRGFMEVETPMMQVIPGGAAARPFITHHNALDLDMYLRIAP
ELYLKRLVVGGFERVFEINRNFRNEGISVRHNPEFTMMELYMAYADYKDLIELTESLFRTLAQDILGKTEVTYGDVTLDF
GKPFEKLTMREAIKKYRPETDMADLDNFDSAKAIAESIGIHVEKSWGLGRIVTEIFEEVAEAHLIQPTFITEYPAEVSPL
ARRNDVNPEITDRFEFFIGGREIGNGFSELNDAEDQAQRFLDQVAAKDAGDDEAMFYDEDYVTALEHGLPPTAGLGIGID
RMVMLFTNSHTIRDVILFPAMRPVK
>P9WFU9 6.1.1.6~~~lysS1~~~Lysine--tRNA ligase 1~~~COG1190
MSAADTAEDLPEQFRIRRDKRARLLAQGRDPYPVAVPRTHTLAEVRAAHPDLPIDTATEDIVGVAGRVIFARNSGKLCFA
TLQDGDGTQLQVMISLDKVGQAALDAWKADVDLGDIVYVHGAVISSRRGELSVLADCWRIAAKSLRPLPVAHKEMSEESR
VRQRYVDLIVRPEARAVARLRIAVVRAIRTALQRRGFLEVETPVLQTLAGGAAARPFATHSNALDIDLYLRIAPELFLKR
CIVGGFDKVFELNRVFRNEGADSTHSPEFSMLETYQTYGTYDDSAVVTRELIQEVADEAIGTRQLPLPDGSVYDIDGEWA
TIQMYPSLSVALGEEITPQTTVDRLRGIADSLGLEKDPAIHDNRGFGHGKLIEELWERTVGKSLSAPTFVKDFPVQTTPL
TRQHRSIPGVTEKWDLYLRGIELATGYSELSDPVVQRERFADQARAAAAGDDEAMVLDEDFLAALEYGMPPCTGTGMGID
RLLMSLTGLSIRETVLFPIVRPHSN
>P0A8N5 6.1.1.6~~~lysU~~~Lysine--tRNA ligase, heat inducible~~~COG1190
MSEQETRGANEAIDFNDELRNRREKLAALRQQGVAFPNDFRRDHTSDQLHEEFDAKDNQELESLNIEVSVAGRMMTRRIM
GKASFVTLQDVGGRIQLYVARDSLPEGVYNDQFKKWDLGDIIGARGTLFKTQTGELSIHCTELRLLTKALRPLPDKFHGL
QDQEVRYRQRYLDLIANDKSRQTFVVRSKILAAIRQFMVARGFMEVETPMMQVIPGGASARPFITHHNALDLDMYLRIAP
ELYLKRLVVGGFERVFEINRNFRNEGISVRHNPEFTMMELYMAYADYHDLIELTESLFRTLAQEVLGTTKVTYGEHVFDF
GKPFEKLTMREAIKKYRPETDMADLDNFDAAKALAESIGITVEKSWGLGRIVTEIFDEVAEAHLIQPTFITEYPAEVSPL
ARRNDVNPEITDRFEFFIGGREIGNGFSELNDAEDQAERFQEQVNAKAAGDDEAMFYDEDYVTALEYGLPPTAGLGIGID
RMIMLFTNSHTIRDVILFPAMRPQK
>O51603 6.1.1.6~~~lysS~~~Lysine--tRNA ligase~~~
MKTAHWADFYAEKIKKEKGPKNLYTVASGITPSGTVHIGNFREVISVDLVARALRDSGSKVRFIYSWDNYDVFRKVPKNM
PEQELLTTYLRQAITRVPDTRSHKTSYARANEIEFEKYLPVVGINPEFIDQSKQYTSNAYASQIKFALDHKKELSEALNE
YRTSKLEENWYPISVFCTKCNRDTTTVNNYDNHYSVEYSCECGNQESLDIRTTWAIKLPWRIDWPMRWKYEKVDFEPAGK
DHHSSGGSFDTSKNIVKIFQGSPPVTFQYDFISIKGRGGKISSSSGDVISLKDVLEVYTPEVTRFLFAATKPNTEFSISF
DLDVIKIYEDYDKFERIYYGVEDVKEEKKRAFKRIYELSQPYMPSKRIPYQVGFRHLSVISQIFENNINKILNYLKNVQE
DQKDKLINKINCAINWIRDFAPEDFKFSLRSKFDNMEILEENSKKAINELLDFLKKNFEVATEQDIQNEIYKISRENNIE
PALFFKQIYKILIDKEKGPKLAGFIKIIGIDRFEKITSKYV
>Q2SXD6 6.1.1.6~~~lysS~~~Lysine--tRNA ligase~~~
MTEPIQPQAAVAADENQIVAERRDKLRALRDQGIAYPNDFQPTHHAADLQTAYADADKEALEAKSLEVAIAGRMMLKRVM
GKASFATVQDGSGQIQFFVTPADVGAETYDAFKKWDLGDIVAARGVLFRTNKGELSVKCTQLRLLAKALRPLPDKFHGLA
DQETRYRQRYVDLIVTPETRTTFRARTKAIASIRKFMGDADFMEVETPMLHPIPGGAAAKPFVTHHNALDMEMFLRIAPE
LYLKRLIVGGFERVFEINRNFRNEGVSPRHNPEFTMMEFYAAYTDYRWLMDFTERLIRQAAVDALGTATIQYQGRELDLA
QPFHRLTITQAIQKYAPSYTDGQLSDDAFLRSELKRLGVDVTQPAFLNAGIGALQLALFEETAEAQLWEPTFIIDYPIEV
SPLARESDTVAGITERFELFITGREIANGFSELNDPEDQAARFKKQVEQKDAGDEEAMFFDADYIRALEYGMPPTGGCGI
GIDRLVMLLTDSPTIRDVLLFPHLRRED
>B0BAN6 6.1.1.6~~~lysS~~~Lysine--tRNA ligase~~~
MSVEVEYLQHEDYLYRTSKLKEIRDLGINPYPYQYTDCLEVQEIRNQFVDNELGDSEAAFRKETPKVRFAGRLVLFRSMG
KNAFGQILDNDAKIQVMFNRDFSAVAGLAADAGISPIKFIEKKLDLGDILGLEGYLFFTHSGELTVLVETVTLLCKSLIS
LPDKHAGLADKEIRYRKRWADLISSEDVRKTFLTRSRILKLIREYMDQQSFLEVETPILQTVYGGAEATPFVTTLQALHA
EMFLRISLEIALKKLLVGGMSRVYEIGKVFRNEGIDRTHNPEFTMIEAYAAYWDYNDVMKCVENLVEYIVRALNNGETQV
QYSHLKSGPQVVDFKAPWIRMTMKESISVYGGVDVDLHADHELRKILETQTSLPEKTYVHASRGELIALLFDELVCDKLI
APHHITDHPLETTPLCKTLRSGDETLVERFESFCLGKELCNAYSELNDPLQQRKLLEEQMRKKALNPDSEYHPIDEEFLE
ALCQGMPPAGGFGIGIDRLVMMLTDAASIRDVLFFPVMRRIEAKKD
>Q9RHV9 6.1.1.6~~~lysS~~~Lysine--tRNA ligase~~~
MSHEELNDQLRVRREKLKKIEELGVDPFGKRFERTHKAEELFELYGDLSKEELEEQQIEVAVAGRIMTKRGMGKAGFAHI
QDVTGQIQIYVRQDDVGEQQYELFKISDLGDIVGVRGTMFKTKVGELSIKVSSYEFLTKALRPLPEKYHGLKDIEQRYRQ
RYLDLIMNPESKKTFITRSLIIQSMRRYLDSHGYLEVETPMMHAVAGGAAARPFITHHNALDMTLYMRIAIELHLKRLIV
GGLEKVYEIGRVFRNEGISTRHNPEFTMLELYEAYADFRDIMKLTENLIAHIATEVLGTTKIQYGEHLVDLTPEWRRLHM
VDAIKEYVGVDFWRQMSDEEARELAKEHGVEVAPHMTFGHIVNEFFEQKVEDKLIQPTFIYGHPVEISPLAKKNPDDPRF
TDRFELFIVGREHANAFTELNDPIDQRQRFEEQLKEREQGNDEAHEMDEDFLEALEYGMPPTGGLGIGVDRLVMLLTNSP
SIRDVLLFPQMRHK
>P67610 6.1.1.6~~~lysS~~~Lysine--tRNA ligase~~~
MSEEMNDQMLVRRQKLQELYDLGIDPFGSKFDRSGLSSDLKEEWDQYSKEELVEKEADSHVAIAGRLMTKRGKGKAGFAH
VQDLAGQIQIYVRKDQVGDDEFDLWKNADLGDIVGVEGVMFKTNTGELSVKAKKFTLLTKSLRPLPDKFHGLQDIEQRYR
QRYLDLITNEDSTRTFINRSKIIQEMRNYLNNKGFLEVETPMMHQIAGGAAARPFVTHHNALDATLYMRIAIELHLKRLI
VGGLEKVYEIGRVFRNEGVSTRHNPEFTMIELYEAYADYHDIMDLTESMVRHIANEVLGSAKVQYNGETIDLESAWTRLH
IVDAVKEATGVDFYEVKSDEEAKALAKEHGIEIKDTMKYGHILNEFFEQKVEETLIQPTFIYGHPTEISPLAKKNPEDPR
FTDRFELFIVGREHANAFTELNDPIDQKGRFEAQLVEKAQGNDEAHEMDEDYIEALEYGMPPTGGLGIGIDRLVMLLTDS
PSIRDVLLFPYMRQK
>Q53638 6.1.1.6~~~lysS~~~Lysine--tRNA ligase~~~
MSEEMNDQMLVRRQKLQELYDLGIDPFGSKFDRSGLSSDLKEEWDQYSKEELVEKEADSHVAIAGRLMTKRGKGKAGFAH
VQDLAGQIQIYVRKDQVGDDEFDLWKNADLGDIVGVEGVMFKTNTGELSVKAKKFTLLTKSLRPLPDKFHGLQDIEQRYR
QRYLDLITNEDSTRTFINRSKIIQEMRNYLNNKGFLEVETPMMHQIAGGAAARPFVTHHNALDATLYMRIAIELHLKRLI
VGGLEKVYEIGRVFRNEGVSTRHNPEFTMIELYEAYADYHDIMDLTESMVRHIANEVLGSAKVQYNGETIDLESAWTRLH
IVDAVKEATGVDFYEVKSDEERKALAKEHGIEIKDTMKYGHILNEFFEQKVEETLIQPTFIYGHPTEISPLAKKNPEDPR
FTDRFELFIVGREHANRFTELNDPIDQKGRFEAQLVEKAQGNDEAHEMDEDYIEALEYGMPPTGGLGIGIDRLVMLLTDS
PSIRDVLLFPYMRQK
>Q97RS9 6.1.1.6~~~lysS~~~Lysine--tRNA ligase~~~COG1190
MSTEHMEELNDQQIVRREKMAALREQGIDPFGKRFERTANSQELKDKYANLDKEQLHDKNETATIAGRLITKRGKGKVGF
AHLQDREGQIQIYVRKDAVGEENYEIFKKADLGDFLGVEGEVMRTDMGELSIKATHITHLSKALRPLPEKFHGLTDVETI
YRKRYLDLISNRESFERFVTRSKIISEIRRYLDQKGFLEVETPVLHNEAGGAAARPFITHHNAQNIDMVLRIATELHLKR
LIVGGMERVYEIGRIFRNEGMDATHNPEFTSIEVYQAYADFQDIMDLTEGIIQHAAKSVKGDGPVNYQGTEIKINEPFKR
VHMVDAIREITGVDFWQDMTLEEAKAIAAEKKVPVEKHYTEVGHIINAFFEEFVEETLIQPTFVYGHPVAVSPLAKKNPE
DQRFTDRFELFIMTKEYGNAFTELNDPIDQLSRFEAQAKAKELGDDEATGIDYDYIEALEYGMPPTGGLGIGIDRLCMLL
TDTTTIRDVLLFPTMK
>P41255 6.1.1.6~~~lysS~~~Lysine--tRNA ligase~~~
MNDQTRQRLLNLEALVEAGFAPYPYRFPKTHSAEAILKAKRGAPPESEWPEEEVAVAGRLVALRRMGKVTFAHLLDETGR
IQLYFQRDLTPKYELLKKLDVGDILGVRGHPFTTKTGEVTVKVLDWTPLVKSLHPLPDKWHGLRDKEVRYRQRYLDLIVN
PEVREVFRRRSEIVRYIRRFFEAKGFLEVETPILQPTTGGAEARPFKTYHNALDHEFYLRISLELYLKRLLVGGYEKVFE
IGRNFRNEGIDHNHNPEFTMLEAYWAYADYQDMAGLVEELLSGLVLHLFGSHEVPYQGRVLNFKPPFRRISFVEALKEKA
GLPFDPLDLERLRLWADAHHPELSQVPNYKLLDKLFGIYVEPELQDPTFVFDFPLAISPLAKRHREKPGLVERWDLYAGG
MELAPCYSELNDPLDQRERFLEQARRRKEGDEEAPEPDEDFLLALEYGMPPAAGLGLGIDRLAMLLTDQPSLRDVLLFPL
LKPKKEAVEEGV
>O66680 6.1.1.4~~~leuS~~~Leucine--tRNA ligase subunit alpha~~~COG0495
MMKEFNPREIEKKWQKRWEEAGVFKAQEGKPNKFYVLEMFPYPSGRIHMGHVRNYTIGDAIARYLKMRGKNILHPMGWDA
FGLPAENAAIKHGIHPAKWTYENIDYMKKQLKILGFSYDWDREIATCDPEYYKWNQWIFLKMLERGIAYRKTAKVNWCPH
DQTVLANEQVIEGKCWRCGTPIVQKEVPSWFLRITAYADRLLEDLKKLEGKWPERVIAQQRNWIGRSEGALIRFYVEIEE
PEKFLNCVPEELKETLLKEKRIYIDVFTTRPDTVFGATFVVLAPEHPLVPVLACIGERLGNACYSDVENFVEKMKKMSTR
ERTMEEDKEGVFLGVYATNPANGEKIPVWSANYVLYEYGTGAIMCVPAHDQRDWEFAKKYDLPIKVVVKPEGAWDFEKGA
YEGKGTLVNSDGFDGLDSETAKRKITEWLQDRGLGEKKVSYRLRDWNISRQRYWGTPIPVVYCEKCGMVPVPEDQLPVKL
PLDVKFTGQGNPLETSEEFVNTTCPKCGGKARRETDTMDTFFDSSWYFLRFCDPKNDREPFSREKVDYWMPVDVYIGGIE
HAVLHLLYARFFQKFLKDLGLVRDDEPFEKLITQGMVLKKWVSVKKLLDYLGLSEEDEVEELKKRLEELGARRA
>P07813 6.1.1.4~~~leuS~~~Leucine--tRNA ligase~~~COG0495
MQEQYRPEEIESKVQLHWDEKRTFEVTEDESKEKYYCLSMLPYPSGRLHMGHVRNYTIGDVIARYQRMLGKNVLQPIGWD
AFGLPAEGAAVKNNTAPAPWTYDNIAYMKNQLKMLGFGYDWSRELATCTPEYYRWEQKFFTELYKKGLVYKKTSAVNWCP
NDQTVLANEQVIDGCCWRCDTKVERKEIPQWFIKITAYADELLNDLDKLDHWPDTVKTMQRNWIGRSEGVEITFNVNDYD
NTLTVYTTRPDTFMGCTYLAVAAGHPLAQKAAENNPELAAFIDECRNTKVAEAEMATMEKKGVDTGFKAVHPLTGEEIPV
WAANFVLMEYGTGAVMAVPGHDQRDYEFASKYGLNIKPVILAADGSEPDLSQQALTEKGVLFNSGEFNGLDHEAAFNAIA
DKLTAMGVGERKVNYRLRDWGVSRQRYWGAPIPMVTLEDGTVMPTPDDQLPVILPEDVVMDGITSPIKADPEWAKTTVNG
MPALRETDTFDTFMESSWYYARYTCPQYKEGMLDSEAANYWLPVDIYIGGIEHAIMHLLYFRFFHKLMRDAGMVNSDEPA
KQLLCQGMVLADAFYYVGENGERNWVSPVDAIVERDEKGRIVKAKDAAGHELVYTGMSKMSKSKNNGIDPQVMVERYGAD
TVRLFMMFASPADMTLEWQESGVEGANRFLKRVWKLVYEHTAKGDVAALNVDALTENQKALRRDVHKTIAKVTDDIGRRQ
TFNTAIAAIMELMNKLAKAPTDGEQDRALMQEALLAVVRMLNPFTPHICFTLWQELKGEGDIDNAPWPVADEKAMVEDST
LVVVQVNGKVRAKITVPVDATEEQVRERAGQEHLVAKYLDGVTVRKVIYVPGKLLNLVVG
>A0R7H5 6.1.1.4~~~leuS~~~Leucine--tRNA ligase~~~COG0495
MTEPATTPTTPDEQIPRHRYNADLAGQIERAWQETWSDRGTFNVANPVGSLAPTDGSDVPADKMFVQDMFPYPSGDGLHV
GHPLGYIATDVYARYYRMLGRNVLHALGFDAFGLPAEQYAVQTGTHPRTRTEANIVNFRRQLGRLGLGHDTRRSFSTTDV
DYYKWTQWIFLQIYNAWFDRDQNKARRISELVEEFESGKRTLDDGRNWADLSKGERADVIDGYRLVYRADSMVNWCPGLG
TVLANEEVTSEGRSDRGNFPVFRKRLRQWMMRITAYSDRLLEDLDVLDWPEKVKTMQRNWIGRSTGASVLFATAADDIEV
FTTRPDTLFGATYLVLAPEHDLVDTLVTDAWPDGTDERWTYGAATPREAVAAYRTDIAAKSDLERQENKTKTGVFLGAYA
TNPADGKQVPIFIADYVLAGYGTGAIMAVPGGDQRDWDFAKEFGLPIIEVVTGGDISEAAYAGDGTMVNSGFLDGMDVAS
AKEAIIARLEADGRGKRRVEYKLRDWLFARQRYWGEPFPIVYDADGRAHPLPESALPVELPDVPDYSPVLFDPDDADSEP
SPPLNKATEWVHVELDLGDGLQSYTRDTNVMPQWAGSSWYELRYTDPHNPDEMCAKENEAYWMGPRPDEHGPEDPGGVDL
YVGGVEHAVLHLLYSRFWHKVLYDLGYVSSREPYRRLVNQGYIQAFAYTDSRGTYVPAAEVIERDGKFFWPGPDGEIEVN
QEFGKIGKSLKNSVSPDEICDNYGADTLRVYEMSMGPLEASRPWATKDVVGAHRFLQRVWRVVIDETSGNVRVVEHEALS
DETLRLLHRTIEGVREDYAALRNNTAAAKLIEYTNHLTKEGVAARAAIEPLVLMVAPLAPHLAEELWKRLGHDTSLAHGP
FPEADPQYLVEDTIEFPVQVNGKVRGKIVVAADADKAALEAAALADEKVQAFLAGATPKKVIVVPGRLVNLVV
>P9WFV1 6.1.1.4~~~leuS~~~Leucine--tRNA ligase~~~COG0495
MTESPTAGPGGVPRADDADSDVPRYRYTAELAARLERTWQENWARLGTFNVPNPVGSLAPPDGAAVPDDKLFVQDMFPYP
SGEGLHVGHPLGYIATDVYARYFRMVGRNVLHALGFDAFGLPAEQYAVQTGTHPRTRTEANVVNFRRQLGRLGFGHDSRR
SFSTTDVDFYRWTQWIFLQIYNAWFDTTANKARPISELVAEFESGARCLDGGRDWAKLTAGERADVIDEYRLVYRADSLV
NWCPGLGTVLANEEVTADGRSDRGNFPVFRKRLRQWMMRITAYADRLLDDLDVLDWPEQVKTMQRNWIGRSTGAVALFSA
RAASDDGFEVDIEVFTTRPDTLFGATYLVLAPEHDLVDELVAASWPAGVNPLWTYGGGTPGEAIAAYRRAIAAKSDLERQ
ESREKTGVFLGSYAINPANGEPVPIFIADYVLAGYGTGAIMAVPGHDQRDWDFARAFGLPIVEVIAGGNISESAYTGDGI
LVNSDYLNGMSVPAAKRAIVDRLESAGRGRARIEFKLRDWLFARQRYWGEPFPIVYDSDGRPHALDEAALPVELPDVPDY
SPVLFDPDDADSEPSPPLAKATEWVHVDLDLGDGLKPYSRDTNVMPQWAGSSWYELRYTDPHNSERFCAKENEAYWMGPR
PAEHGPDDPGGVDLYVGGAEHAVLHLLYSRFWHKVLYDLGHVSSREPYRRLVNQGYIQAYAYTDARGSYVPAEQVIERGD
RFVYPGPDGEVEVFQEFGKIGKSLKNSVSPDEICDAYGADTLRVYEMSMGPLEASRPWATKDVVGAYRFLQRVWRLVVDE
HTGETRVADGVELDIDTLRALHRTIVGVSEDFAALRNNTATAKLIEYTNHLTKKHRDAVPRAAVEPLVQMLAPLAPHIAE
ELWLRLGNTTSLAHGPFPKADAAYLVDETVEYPVQVNGKVRGRVVVAADTDEETLKAAVLTDEKVQAFLAGATPRKVIVV
AGRLVNLVI
>Q5FAJ3 6.1.1.4~~~leuS~~~Leucine--tRNA ligase~~~
MQEHYQPAAIEPAAQKKWDDARISNVSEDASKPKYYCLSMFPYPSGKLHMGHVRNYTIGDVLSRFKLLNGFNVMQPMGWD
AFGMPAENAAMKNNVAPAAWTYDNIEYMKTQLKSLGFAIDWEREVATCKPEYYRWEQWLFTKLFEKGIVYRKNGTVNWDP
VDQTVLANEQVIDGRGWRSGALIEKREIPMYYFKITDYAEELLNDLDKLEHWPEQVKTMQRNWIGKSRGMTVRFAVSDDS
KQGLEGDYAKFLQVYTTRPDTLMGATYVAVAAEHPLATAAAADKPELQAFIAECKAGSVAEADMATMEKKGVPTGRYVVN
PLNGDKLEVWIANYVLWGYGDGAVMAVPAHDERDFEFAAKYNLPKKQVIAVGDNAFDANRWQEWYGDKENGVLVNSGDLD
GLDFQTAFDAVAAKLQSQGAGEPKTQYRLRDWGISRQRYWGCPIPIVHCEKCGDVPVPADQLPVVLPENVVPDGMGSPLA
KMPEFYETSCPCCGGAAKRETDTMDTFMESSWYFFRYMSPKFSDGMVSAESAKYWGAVDQYIGGIEHAILHLLYARFFTK
LMRDEGLVNVDEPFERLLTQGMVVCETYYRENDKGGKDWINPADVELTFDDKGRPVSAVLKADGLPVVISGTEKMSKSKN
NGVDPQELINAYGADTARLFMMFAAPPEQSLEWSDSGVEGAHRFLRRLWRTVYEYLKQGGAVKAFAGNQDGLSKELKDLR
HKLHSTTAKVSDDYGRRQQFNTAIAAVMELLNQYDKTDTGSEQGRAVAQEVLEAAVRLLWPIVPHICETLWSELNGAKLW
EAGWPTVDEAALVKSEIEVMVQVNGKLRGKITVAADASKADLEAAALANEGAVKFMEGKPAKKIIVVPGRLVNIVV
>B4RNT1 6.1.1.4~~~leuS~~~Leucine--tRNA ligase~~~
MQEHYQPAAIEPAAQKKWDDARISNVSEDASKPKYYCLSMFPYPSGKLHMGHVRNYTIGDVLSRFKLLNGFNVMQPMGWD
AFGMPAENAAMKNNVAPAAWTYDNIEYMKTQLKSLGFAVDWEREVATCKPEYYRWEQWLFTKLFEKGIVYRKNGTVNWDP
VDQTVLANEQVIDGRGWRSGALIEKREIPMYYFKITDYAEELLNDLDKLEHWPEQVKTMQRNWIGKSRGMTVRFAVSDDS
KQGLEGDYAKFLQVYTTRPDTLMGATYVAVAAEHPLATAAAADKPELQAFIAECKAGSVAEADMATMEKKGVPTGRYVVN
PLNGDKLEVWIANYVLWGYGDGAVMAVPAHDERDFEFAAKYNLPKKQVIAVGDNAFDANRWQEWYGDKENGVLVNSGDLD
GLDFQTAFDAVAAKLQSQGAGEPKTQYRLRDWGISRQRYWGCPIPIVHCEKCGDVPVPADQLPVVLPENVVPDGMGSPLA
KMPEFYETSCPCCGGAAKRETDTMDTFMESSWYFFRYMSPKFSDGMVSAESAKYWGAVDQYIGGIEHAILHLLYARFFTK
LMRDEGLVNVDEPFERLLTQGMVVCETYYRENDKGGKDWINPADVELTFDDKGRPVSAVLKADGLPVVISGTEKMSKSKN
NGVDPQELINAYGADTARLFMMFAAPPEQSLEWSDSGVEGAHRFLRRLWRTVYEYLKQGGAVKAFAGNQDGLSKELKDLR
HKLHSTTAKVSDDYGRRQQFNTAIAAVMELLNQYDKTDTGSEQGRAVAQEVLEAAVRLLWPIVPHICETLWSELNGAKLW
EAGWPTVDEAALVKSEIEVMVQVNGKLRGKITVAADASKADLEAAALANEGAVKFMEGKPAKKIIVVPGRLVNIVV
>Q9JXT2 6.1.1.4~~~leuS~~~Leucine--tRNA ligase~~~
MQEQYRPAAIEPAAQKKWDDARIFNVSEDASKPKYYCLSMFPYPSGKLHMGHVRNYTIGDVLSRFKLLNGFNVMQPMGWD
AFGMPAENAAMKNNVAPAAWTYDNIEYMKTQLKSLGFAIDWARETATCKPEYYRWEQWLFTKLFEKGIVYRKNGTVNWDP
VDQTVLANEQVIDGRGWRSGALIEKREIPMYYFKITDYAEELLNDLDKLEHWPEQVKTMQRNWIGKSRGMTVRFAVSDDS
KQGLEGDYAKFLQVYTTRPDTLMGATYVAVAAEHPLAAAAAADKPELQAFIAECKAGSVAEADMATMEKKGVPTGRYVVN
PLNGDKLEVWIANYVLWGYGDGAVMAVPAHDERDFEFATKYNLPKKQVIAVGDNAFDENQWQEWYGDKENGVLVNSGDLD
GLDFQTAFDAVAAKLQSQGAGEPKTQYRLRDWGISRQRYWGCPIPIVHCEQCGDVPVPADQLPVVLPENVVPDGMGSPLA
KMPEFYETACPCCGGAAKRETDTMDTFMESSWYFFRYMSPKFSDGMVDPAAAKYWGAVDQYIGGIEHAILHLLYARFFTK
LMRDEGLVNVDEPFERLLTQGMVVCETYYRENDKGGKDWINPADVELTFDDKGRPISAVLKADGLPVVISGTEKMSKSKN
NGVDPQELINAYGADTARLFMMFAAPPEQSLEWSDSGVEGAHRFLRRLWRTVYEYLKQGEAVKAFAGSQDGLSKELKDLR
HKLHATTAKVSDDYGRRQQFNTAIAAVMELLNQYDKTDTGGEQGRAVAQEVLETAVRLLWPIVPHICETLWSELNGAKLW
EAGWPTVDEAALVKSEIEVMVQVNGKLRGKITVAADASKADLEAAALATEGAVKFMEGKPAKKIIVVPGRLVNIVV
>P67513 6.1.1.4~~~leuS~~~Leucine--tRNA ligase~~~
MNYNHNQIEKKWQDYWDENKTFKTNDNLGQKKFYALDMFPYPSGAGLHVGHPEGYTATDIISRYKRMQGYNVLHPMGWDA
FGLPAEQYALDTGNDPREFTKKNIQTFKRQIKELGFSYDWDREVNTTDPEYYKWTQWIFIQLYNKGLAYVDEVAVNWCPA
LGTVLSNEEVIDGVSERGGHPVYRKPMKQWVLKITEYADQLLADLDDLDWPESLKDMQRNWIGRSEGAKVSFDVDNTEGK
VEVFTTRPDTIYGASFLVLSPEHALVNSITTDEYKEKVKAYQTEASKKSDLERTDLAKDKSGVFTGAYAINPLSGEKVQI
WIADYVLSTYGTGAIMAVPAHDDRDYEFAKKFDLPIIEVIEGGNVEEAAYTGEGKHINSGELDGLENEAAITKAIQLLEQ
KGAGEKKVNYKLRDWLFSRQRYWGEPIPVIHWEDGTMTTVPEEELPLLLPETDEIKPSGTGESPLANIDSFVNVVDEKTG
MKGRRETNTMPQWAGSCWYYLRYIDPKNENMLADPEKLKHWLPVDLYIGGVEHAVLHLLYARFWHKVLYDLGIVPTKEPF
QKLFNQGMILGEGNEKMSKSKGNVINPDDIVQSHGADTLRLYEMFMGPLDAAIAWSEKGLDGSRRFLDRVWRLMVNEDGT
LSSKIVTTNNKSLDKVYNQTVKKVTEDFETLGFNTAISQLMVFINECYKVDEVYKPYIEGFVKMLAPIAPHIGEELWSKL
GHEESITYQPWPTYDEALLVDDEVEIVVQVNGKLRAKIKIAKDTSKEEMQEIALSNDNVKASIEGKDIMKVIAVPQKLVN
IVAK
>B8ZKS5 6.1.1.4~~~leuS~~~Leucine--tRNA ligase~~~
MSFYNHKEIEPKWQGYWAEHHTFKTGTDASKPKFYALDMFPYPSGAGLHVGHPEGYTATDILSRYKRAQGYNVLHPMGWD
AFGLPAEQYAMDTGNDPAEFTAENIANFKRQINALGFSYDWDREVNTTDPNYYKWTQWIFTKLYEKGLAYEAEVPVNWVE
ELGTAIANEEVLPDGTSERGGYPVVRKPMRQWMLKITAYAERLLNDLDELDWSESIKDMQRNWIGKSTGANVTFKVKGTD
KEFTVFTTRPDTLFGATFTVLAPEHELVDAITSSEQAEAVADYKHQASLKSDLVRTDLAKEKTGVWTGAYAINPVNGKEM
PIWIADYVLASYGTGAVMAVPAHDQRDWEFAKQFDLPIVEVLEGGNVEEAAYTEDGLHVNSDFLDGLNKEDAIAKIVACL
EEKGCGQEKVTYRLRDWLFSRQRYWGEPIPIIHWEDGTSTAVPETELPLVLPVTKDIRPSGTGESPLANLTDWLEVTRED
GVKGRRETNTMPQWAGSSWYYLRYIDPHNTEKLADEDLLKQWLPVDIYVGGAEHAVLHLLYARFWHKFLYDLGVVPTKEP
FQKLFNQGMILGTSYRDHRGALVATDKVEKRDGSFFHIETGEELEQAPAKMSKSLKNVVNPDDVVEQYGADTLRVYEMFM
GPLDASIAWSEEGLEGSRKFLDRVYRLITSKEILAENNGALDKAYNETVKAVTEQIESLKFNTAIAQLMVFVNAANKEDK
LYVDYAKGFIQLIAPFAPHLAEELWQTVAETGESISYVAWPTWDESKLVEDEIEIVVQIKGKVRAKLMVAKDLSREELQE
IALADEKVKAEIDGKEIVKVISVPNKLVNIVVK
>P39394 3.1.-.-~~~symE~~~Toxic protein SymE~~~
MTDTHSIAQPFEAEVSPANNRHVTVGYASRYPDYSRIPAITLKGQWLEAAGFATGTAVDVKVMEGCIVLTAQPPAAEESE
LMQSLRQVCKLSARKQKQVQAFIGVIAGKQKVA
>O67298 6.1.1.10~~~metG~~~Methionine--tRNA ligase~~~COG0143
MTLMKKFYVTTPIYYVNDVPHLGHAYTTIAADTIARYYRLRDYDVFFLTGTDEHGLKIQKKAEELGISPKELVDRNAERF
KKLWEFLKIEYTKFIRTTDPYHVKFVQKVFEECYKRGDIYLGEYEGWYCVGCEEFKSEAELAEDHTCPIHQKKCEYIKEP
SYFFRLSKYQDKLLELYEKNPEFIQPDYRRNEIISFVKQGLKDLSVTRPRSRVKWGIPVPFDPEHTIYVWFDALFNYISA
LEDKVEIYWPADLHLVGKDILRFHTVYWPAFLMSLGYELPKKVFAHGWWTVEGKKMSKTLGNVVDPYEVVQEYGLDEVRY
FLLREVPFGQDGDFSKKAILNRINGELANEIGNLYSRVVNMAHKFLGGEVSGARDEEYAKIAQESIKNYENYMEKVNFYK
AIEEILKFTSYLNKYVDEKQPWALNKERKKEELQKVLYALVDGLFVLTHLLYPITPNKMKEALQMLGEKEFLKELKPYSK
NTYKLGERKILFPKREG
>P59078 6.1.1.10~~~metG~~~Methionine--tRNA ligase~~~
MSREKYYITTAIAYPNGKPHIGHAYELIATDAMARFQRLNGMDVYFLTGTDEHGIKMLQSARKEGITPRELADRNTSAFR
RMAEVLNSSNDDYIRTSEERHYKASQAIWQAMVANGDIYKGGYAGWYSVRDEAYYGEEETEVRADGVRYGPQGTPVEWVE
EESYFFRLSAYQDKLLDLYENNPGFIMPAERRNEIVSFVKSGLKDLSISRTTFDWGIPVPGDEKHVMYVWVDALTNYITA
LGYPDTTDERWAYWPANAHIIGKDISRFHAVYWPAFLMSAQLPLPKRVFAHGFLFNRGEKMSKSVGNVIDPFELVERYGL
DQLRYFLMREVPFGQDGSYSHEAIVNRTNADLANDLGNLAQRSLSMIAKNCEGKVPQPGAFSEADKAILDQADAALETAR
KAMDDQALHLALGAIFAVVAEANRYFAGQEPWALRKTDPARMGTVLYVTAEVLRRVGIMVQPFIPQSAEKLLDILAVPAD
KRQFADVLASPLAGGTDLPAPQPVFPRYVEADEQN
>P00959 6.1.1.10~~~metG~~~Methionine--tRNA ligase~~~COG0073
MTQVAKKILVTCALPYANGSIHLGHMLEHIQADVWVRYQRMRGHEVNFICADDAHGTPIMLKAQQLGITPEQMIGEMSQE
HQTDFAGFNISYDNYHSTHSEENRQLSELIYSRLKENGFIKNRTISQLYDPEKGMFLPDRFVKGTCPKCKSPDQYGDNCE
VCGATYSPTELIEPKSVVSGATPVMRDSEHFFFDLPSFSEMLQAWTRSGALQEQVANKMQEWFESGLQQWDISRDAPYFG
FEIPNAPGKYFYVWLDAPIGYMGSFKNLCDKRGDSVSFDEYWKKDSTAELYHFIGKDIVYFHSLFWPAMLEGSNFRKPSN
LFVHGYVTVNGAKMSKSRGTFIKASTWLNHFDADSLRYYYTAKLSSRIDDIDLNLEDFVQRVNADIVNKVVNLASRNAGF
INKRFDGVLASELADPQLYKTFTDAAEVIGEAWESREFGKAVREIMALADLANRYVDEQAPWVVAKQEGRDADLQAICSM
GINLFRVLMTYLKPVLPKLTERAEAFLNTELTWDGIQQPLLGHKVNPFKALYNRIDMRQVEALVEASKEEVKAAAAPVTG
PLADDPIQETITFDDFAKVDLRVALIENAEFVEGSDKLLRLTLDLGGEKRNVFSGIRSAYPDPQALIGRHTIMVANLAPR
KMRFGISEGMVMAAGPGGKDIFLLSPDAGAKPGHQVK
>P75091 6.1.1.10~~~metG~~~Methionine--tRNA ligase~~~
MKRCYITTPIYYASGKPHIGHAFTTILADVIKRYKQQNGYEAYFLTGTDEHGNKIESKAKSLGLDPQTFVDQNVAYFQQM
WKQLDINFDHFIRTTDLSHKAQVQHAFQLLYDKGLIYQSNWEGAYCVECEQNYFTYDKQTMLCEIGHQLTLVQEPSLFIA
FKDSKDWIGEMIATNKLNITPESRAAELKNNFLDGGLNDLALTRQNVTWGIPVPFDNKQTIYVWFDALFNYITNLGFAHN
DPKFNKWWNNNDEEHEVIHLISREITRFHCIYWPIFLHQLGFKLPTQFLSHGWIVDGNGHKMSKSLGNVISPEELLAQFG
VDGTRYCLLKEMRLDKDNRCSMAIFKDIYNADLANSFGNHASRTFGMIKKYLGGQLDFIEVQDPQVKQLMDQANQAMVQF
DTAWNNFQFYKGINGLLQLVFQASKLIDQLKPWELVKQTDYTLLKQLLFACVRCTQVCFVLLAPILVHTSTQIFDLFNFS
AQARSKTHLADPQQLQKISLAPVIQPLFKRLD
>P9WFU5 6.1.1.10~~~metG~~~Methionine--tRNA ligase~~~COG0143
MKPYYVTTAIAYPNAAPHVGHAYEYIATDAIARFKRLDRYDVRFLTGTDEHGLKVAQAAAAAGVPTAALARRNSDVFQRM
QEALNISFDRFIRTTDADHHEASKELWRRMSAAGDIYLDNYSGWYSVRDERFFVESETQLVDGTRLTVETGTPVTWTEEQ
TYFFRLSAYTDKLLAHYHANPDFIAPETRRNEVISFVSGGLDDLSISRTSFDWGVQVPEHPDHVMYVWVDALTNYLTGAG
FPDTDSELFRRYWPADLHMIGKDIIRFHAVYWPAFLMSAGIELPRRIFAHGFLHNRGEKMSKSVGNIVDPVALAEALGVD
QVRYFLLREVPFGQDGSYSDEAIVTRINTDLANELGNLAQRSLSMVAKNLDGRVPNPGEFADADAALLATADGLLERVRG
HFDAQAMHLALEAIWLMLGDANKYFSVQQPWVLRKSESEADQARFRTTLYVTCEVVRIAALLIQPVMPESAGKILDLLGQ
APNQRSFAAVGVRLTPGTALPPPTGVFPRYQPPQPPEGK
>Q9HYC7 6.1.1.10~~~metG~~~Methionine--tRNA ligase~~~
MSEPRKILVTSALPYANGSIHLGHMLEYIQTDMWVRFQKMRGNQAVYVCADDAHGSAIMLRAEREGITSEQLIDAVRAEH
MGDFADFLVDFDNYHSTHSEENRELSSAIYLKLRDAGHIDTRPVTQYFDPEKQMFLADRFIKGTCPKCGTADQYGDNCEA
CGATYAPTELKDPKSAISGATPVLKESLHYFFKLPDFEAMLKQWTRSGALQESVANKLAEWLDSGLQQWDISRDAPYFGF
EIPDAPGKYFYVWLDAPIGYMASFKNLCARRPELDFDAFWGKDSGAELYHFIGKDIVNFHALFWPAMLEGAGYRKPTALN
VHGYLTVNGQKMSKSRGTFVKARTYLDHLDPEYLRYYYASKLGRGVEDLDLNLEDFVQKVNSDLVGKVVNIASRCAGFIH
KGNAGVLVGADPAPELLAAFREAAPGIAEAYEARDFNRAMREIMALADRANAWIAEQAPWALAKQEGQQDKVQAVCGLGI
NLFRQLVIFLKPVLPKLAAAAEAFLNVAPLTWADHQTLLANHQLNPFQPLMTRIEPAKVEAMIEASKEDLAAASQPAGNG
ELVKEPIAAEIDFDAFAAVDLRIALIEKCEFVEGADKLLRLSLDIGDAKRNVFSGIKSAYPDPSALEGRLTLYVANLAPR
KMKFGVSEGMVLAAGPGGEEIYLLSPDSGAKPGQRVK
>Q5HII6 6.1.1.10~~~metG~~~Methionine--tRNA ligase~~~
MAKETFYITTPIYYPSGNLHIGHAYSTVAGDVIARYKRMQGYDVRYLTGTDEHGQKIQEKAQKAGKTEIEYLDEMIAGIK
QLWAKLEISNDDFIRTTEERHKHVVEQVFERLLKQGDIYLGEYEGWYSVPDETYYTESQLVDPQYENGKIIGGKSPDSGH
EVELVKEESYFFNISKYTDRLLEFYDQNPDFIQPPSRKNEMINNFIKPGLADLAVSRTSFNWGVHVPSNPKHVVYVWIDA
LVNYISALGYLSDDESLFNKYWPADIHLMAKEIVRFHSIIWPILLMALDLPLPKKVFAHGWILMKDGKMSKSKGNVVDPN
ILIDRYGLDATRYYLMRELPFGSDGVFTPEAFVERTNFDLANDLGNLVNRTISMVNKYFDGELPAYQGPLHELDEEMEAM
ALETVKSYTESMESLQFSVALSTVWKFISRTNKYIDETTPWVLAKDDSQKDMLGNVMAHLVENIRYAAVLLRPFLTHAPK
EIFEQLNINNPQFMEFSSLEQYGVLNESIMVTGQPKPIFPRLDSEAEIAYIKESMQPPATKEEKEEIPSKPQIDIKDFDK
VEIKAATIIDAEHVKKSDKLLKIQVDLDSEQRQIVSGIAKFYTPDDIIGKKVAVVTNLKPAKLMGQKSEGMILSAEKDGV
LTLVSLPSAIPNGAVIK
>P67579 6.1.1.10~~~metG~~~Methionine--tRNA ligase~~~
MAKETFYITTPIYYPSGNLHIGHAYSTVAGDVIARYKRMQGYDVRYLTGTDEHGQKIQEKAQKAGKTEIEYLDEMIAGIK
QLWAKLEISNDDFIRTTEERHKHVVEQVFERLLKQGDIYLGEYEGWYSVPDETYYTESQLVDPQYENGKIIGGKSPDSGH
EVELVKEESYFFNISKYTDRLLEFYDQNPDFIQPPSRKNEMINNFIKPGLADLAVSRTSFNWGVHVPSNPKHVVYVWIDA
LVNYISALGYLSDDESLFNKYWPADIHLMAKEIVRFHSIIWPILLMALDLPLPKKVFAHGWILMKDGKMSKSKGNVVDPN
ILIDRYGLDATRYYLMRELPFGSDGVFTPEAFVERTNFDLANDLGNLVNRTISMVNKYFDGELPAYQGPLHELDEEMEAM
ALETVKSYTESMESLQFSVALSTVWKFISRTNKYIDETTPWVLAKDDSQKDMLGNVMAHLVENIRYAAVLLRPFLTHAPK
EIFEQLNINNPQFMEFSSLEQYGVLTESIMVTGQPKPIFPRLDSEAEIAYIKESMQPPATEEEKEEIPSKPQIDIKDFDK
VEIKAATIIDAEHVKKSDKLLKIQVDLDSEQRQIVSGIAKFYTPDDIIGKKVAVVTNLKPAKLMGQKSEGMILSAEKDGV
LTLVSLPSAIPNGAVIK
>P23395 6.1.1.10~~~metG~~~Methionine--tRNA ligase~~~COG0073
MEKVFYVTTPIYYVNAEPHLGHAYTTVVADFLARWHRLDGYRTFFLTGTDEHGETVYRAAQAAGEDPKAFVDRVSGRFKR
AWDLLGIAYDDFIRTTEERHKKVVQLVLKKVYEAGDIYYGEYEGLYCVSCERFYTEKELVEGLCPIHGRPVERRKEGNYF
FRMEKYRPWLQEYIQENPDLIRPEGYRNEVLAMLAEPIGDLSISRPKSRVPWGIPLPWDENHVTYVWFDALLNYVSALDY
PEGEAYRTFWPHAWHLIGKDILKPHAVFWPTMLKAAGIPMYRHLNVGGFLLGPDGRKMSKTLGNVVDPFALLEKYGRDAL
RYYLLREIPYGQDTPVSEEALRTRYEADLADDLGNLVQRTRAMLFRFAEGRIPEPVAGEELAEGTGLAGRLRPLVRELKF
HVALEEAMAYVKALNRYINEKKPWELFKKEPEEARAVLYRVVEGLRIASILLTPAMPDKMAELRRALGLKEEVRLEEAER
WGLAEPRPIPEEAPVLFPKKEAKVEAKPKEEAWIGIEDFAKVELRVAEVLAAEKHPNADRLLVLRLSLGNEERTVVSGIA
KWYRPEELVGKKVVLVANLKPAKLRGIESQGMILAAQEGEALALVTVEGEVPPGAVVK
>Q8PMP0 6.1.1.10~~~metG~~~Methionine--tRNA ligase~~~COG0073
MTRTALVTTALPYANGPLHLGHLVGYIQADIWVRARRLRGDKTWFVCADDTHGTPIMLAAEKAGVTPEAFIANVQASHER
DFAAFGVTFDHYDSTNSPVNRELTEAFYAKLEAAGHISRRSVAQFYDTAKGMFLPDRYIKGICPNCGSPDQYGDNCEVCG
ATYAPTELKEPKSVISGATPELRDSEHFFFEVGHFDGFLREWLAGDVALPGVKAKLKEWLDAEGGLRAWDISRDAPYFGF
QIPGQPGKYFYVWLDAPIGYLCSFKTLCAQMGENFEAHLVAGTQTELHHFIGKDIVNFHGLFWPAVLHGTGHRAPTRLHV
NGYLTVDGAKMSKSRGTFVMARTFLDVGLEPEALRYYFAAKSSGGVDDLDLNLGDFIARVNADLVGKFVNLASRCAGFIG
KRFDGKLADALPDAAQYDRFVAALAPIREAYERNDAASAIRQTMALADEANKYIDDTKPWVIAKQDGADAQLQSVCTQGL
NLFRILVAALKPILPRTCAEAEAFLSAPMTSWEDVIGPLTAHTIQPYTALFTRIDPKLIDAMTDASKDTLAAPATPATAS
KPAPAKADAKPAAAANPQSPIATPGFIGMDDFAKLDLRIGKVLACEFVEGSDKLLRFELDAGELGTRQIFSGIRASYREP
ETLVGRSVVFIANLAPRKMRFGISEGMILSAGFDGGALALLDADSGAQPGMPVR
>P0A8M0 6.1.1.22~~~asnS~~~Asparagine--tRNA ligase~~~COG0017
MSVVPVADVLQGRVAVDSEVTVRGWVRTRRDSKAGISFLAVYDGSCFDPVQAVINNSLPNYNEDVLRLTTGCSVIVTGKV
VASPGQGQQFEIQASKVEVAGWVEDPDTYPMAAKRHSIEYLREVAHLRPRTNLIGAVARVRHTLAQALHRFFNEQGFFWV
STPLITASDTEGAGEMFRVSTLDLENLPRNDQGKVDFDKDFFGKESFLTVSGQLNGETYACALSKIYTFGPTFRAENSNT
SRHLAEFWMLEPEVAFANLNDIAGLAEAMLKYVFKAVLEERADDMKFFAERVDKDAVSRLERFIEADFAQVDYTDAVTIL
ENCGRKFENPVYWGVDLSSEHERYLAEEHFKAPVVVKNYPKDIKAFYMRLNEDGKTVAAMDVLAPGIGEIIGGSQREERL
DVLDERMLEMGLNKEDYWWYRDLRRYGTVPHSGFGLGFERLIAYVTGVQNVRDVIPFPRTPRNASF
>P67572 6.1.1.22~~~asnS~~~Asparagine--tRNA ligase~~~
MKTTIKQAKDHLNQDVTIGAWLTNKRSSGKIAFLQLRDGTGFMQGVVVKSEVDEEVFKLAKEIAQESSLYVTGTITEDNR
SDLGYEMQVKSIEVISEAHDYPITPKNHGTEFLMDHRHLWLRSKKQHAVMKIRNEVIRATYEFFNKDGFTKVDPPILTAS
APEGTSELFHTKYFDQDAFLSQSGQLYLEAAAMAHGKVFSFGPTFRAEKSKTRRHLIEFWMIEGEMAFTNHAESLEIQEQ
YVTHVVKSVLENCKLELKILERDTSKLEKVATPFPRISYDDAIEFLKAEGFDDIEWGEDFGAPHETAIANHYDLPVFITN
YPTKIKPFYMQPNPENEETVLCADLIAPEGYGEIIGGSERVDDLELLEQRVKEHGLDEEAYSYYLDLRRYGSVPHCGFGL
GLERTVAWISGVEHVRETAPFPRLLNRLYP
>Q97PR0 6.1.1.22~~~asnS~~~Asparagine--tRNA ligase~~~COG0017
MTKRVTIIDVKDYVGQEVTIGAWVANKSGKGKIAFLQLRDGTAFFQGVAFKPNFVEKFGEEVGLEKFDVIKRLSQETSVY
VTGIVKEDERSKFGYELDITDIEVIGESQDYPITPKEHGTDFLMDNRHLWLRSRKQVAVLQIRNAIIYATYEFFDKNGFM
KFDSPILSGNAAEDSTELFETDYFGTPAYLSQSGQLYLEAGAMALGRVFDFGPVFRAEKSKTRRHLTEFWMMDAEYSYLT
HDESLDLQEAYVKALLQGVLDRAPQALETLERDTELLKRYIAEPFKRITYDQAIDLLQEHENDEDADYEHLEHGDDFGSP
HETWISNHFGVPTFVMNYPAAIKAFYMKPVPGNPERVLCADLLAPEGYGEIIGGSMREEDYDALVAKMDELGMDRTEYEF
YLDLRKYGTVPHGGFGIGIERMVTFAAGTKHIREAIPFPRMLHRIKP
>P54263 6.1.1.22~~~asnS~~~Asparagine--tRNA ligase~~~COG0017
MRVFIDEIARHVDQEVELRGWLYQRRSKGKIHFLILRDGTGFLQATVVQGEVPEAVFREADHLPQETALRVWGRVREDRR
APGGFELAVRDLQVVSRPQGEYPIGPKEHGIDFLMDHRHLWLRHRRPFAVMRIRDELERAIHEFFGERGFLRFDAPILTP
SAVEGTTELFEVELFDGEKAYLSQSGQLYAEAGALAFAKVYTFGPTFRAERSKTRRHLLEFWMVEPEVAFMTHEENMALQ
EELVSFLVARVLERRSRELEMLGRDPKALEPAAEGHYPRLTYKEAVALVNRIAQEDPEVPPLPYGEDFGAPHEAALSRRF
DRPVFVERYPARIKAFYMEPDPEDPELVLNDDLLAPEGYGEIIGGSQRIHDLELLRRKIQEFGLPEEVYDWYLDLRRFGS
VPHSGFGLGLERTVAWICGLAHVREAIPFPRMYTRMRP
>Q9L4Q8 6.1.1.15~~~proS~~~Proline--tRNA ligase~~~COG0442
MAKKDQEFVKDITNMDEDFPQWYTDVITKTDLVDYSPVKGFMVIKPYGYAIWENIQAFLDRRFKETGHQNCYFPLLIPES
LLNKEKEHVEGFAPEVAWVTHGGSEKLAERLCVRPTSETIICSMYSKWLTSYRELPYLYNQWCSVVRWEKSTRPFLRTSE
FLWQEGHTLHETAEEAQAETLQMLAIYKEMAEDLLAIPVVDGRKSDRERFAGAAATYTIEALMHDGKALQSGTSHNLAQH
FTKAFDITFQGRTGELEYPHHTSWGASTRLIGGIIMVHGDNRGLVLPPRVAPTQVVIIPIAQNKEGVLDKAYEIKKELEA
KGIRVTLDDDTNYSPGWKFNQYEMKGVPLRLEIGPRDIENNVAMIARRDTLSKDSYSLDNIGDTVKNLLDTVHTDMLERA
RAHRDSKTFTFKDYEEFKRKMIETPGFAKGMWCGEEECEAKIKEDTGVTIRCIPFVQENLGETCQFCGKPAKHMVYLAKA
Y
>O66690 6.1.1.15~~~proS~~~Proline--tRNA ligase~~~COG0442
MRWSRYFLYTEKEEPKEAEAPSHRLLLKAGFIKQVSAGIYELLPPAYKVLKKVESIIRKEMDRSGAQELLLTVLNPKELW
EETGRWETYGEELFKLKDRNGREYCLGPTHEEEITDLVRRVVRSYRQLPVILYQIQVKFRDEKRPRFGLIRAREFIMKDA
YSFDTDDMSAMISYEAMKFAYQRIFNKLRLNVIMAEADVGQIGGKMSHEFIAFTDYGEAKVAYCENCGYAANAEIVPLPK
PEEEKEEEKPMEKVHTPNVHTIEELSKFLDVHPSKIMKAVLYIVNEKEPVLVLIRGDREIDENKLEKVLGTDNFRLATDE
EVQELLGTKKGFIGIFNLPENIKVLWDNSLYGVKNLVVALNEPDWHYINVNPGRDFQYGEFVDVAEVREGDPCPKCGSPL
KVRRGLELGHIFLLGTRYSEPMKAYFTDRDGKEKPIIMGCYGIGVSRILAALVEQYHDDKGIKWPTPVAPFELDIILLNT
KDEEMKNVAEKLYLEAEEKGIDVIFDDREESPGFKFADADLVGFPYRIVVGKKVKEGKVEVQSRHTGEKWDVEIEKAIDF
VKEKIEEDKK
>O51363 6.1.1.15~~~proS~~~Proline--tRNA ligase~~~
MSDFIASKEDDYSKWYLDIVQKAKLADYSPVKGCMVIMPYGYSIWSKIQSILDKKFKETGHENAYFPMLIPYSFLEREKD
HIDGFSPEFAIIKDAGGESLAEPLVLRPTSETIIWNMYSKWIKSYRDLPLKINQWANVVRWEKRTRPFLRTTEFLWQEGH
TAHATEEEALEETLLILDVYKRFIEDYLAIPVFCGKKSEKEKFAGAVSTYSIEALMQDKKALQAATSHYLGLNFAKAFDV
KFQDKDGKMRHVFASSWGVSTRLIGALIMVHSDEKGLVLPPRIAPIEIIVIPIFKKEDEINKKILDYSDCVVDALKKAEF
RVEIDKDVRSSPGFRFSSAEFKGIPIRLEVGINDVLLNSVTISRRDKDRKFKYQISLDSLISKVKVELDLMQKDLFQRAL
NFRILNTKEIFRSSKDSYETFKAYVNDYSGFVLSCWCGSLNCENIIKNETKATIRCIPDDFKARDLTGMTCIYCSSKAKY
FVLFAKSY
>Q9RUW4 6.1.1.15~~~proS~~~Proline--tRNA ligase~~~COG0442
MTKDGGKKDNQGQDKKAQQYGVTPQSVDFNDWYNEVVKKADLADNSPVAGAMVVRPYGSALWENIQRWLDDRFKASGHES
LIFPTLIPMNFIMKEADHVEGFAPELFTVNKIGTEELAEPYVMRPTSETIIGHMWSGWLNSYRDLPFLHYQWGSVFRAEL
RTKAFLRTSEFFWHEGHTAHADEAEARAEVRQQLDLYHEFCRDVLALPVVRGEKTASERFAGAVATYSIEGMMRDGKALQ
SGTSHYLGQNFSRAFDVKYQTREQKEEFAHTTSWAISSRIIGAIIMTHGDDSGLMMPPRIAPIQVVVIPVGRKDNFDQMV
QEGEKLAAELRAQGLRVKVDGRDGVTNGFKYNDWELKGVPVRIELGPRDLESGVLVVKNRHSEDKETLPRAEAVSGMSAR
LDTIHDFLMKRATDFLLANTAEVDSYDAFQREIEAGHWVRAYHCGEPACEKSIKEDTKATARNVPFDDAEFFAERGEGQC
VKCGQPSAYGKRVLFGRQY
>P16659 6.1.1.15~~~proS~~~Proline--tRNA ligase~~~COG0442
MRTSQYLLSTLKETPADAEVISHQLMLRAGMIRKLASGLYTWLPTGVRVLKKVENIVREEMNNAGAIEVSMPVVQPADLW
QESGRWEQYGPELLRFVDRGERPFVLGPTHEEVITDLIRNELSSYKQLPLNFYQIQTKFRDEVRPRFGVMRSREFLMKDA
YSFHTSQESLQETYDAMYAAYSKIFSRMGLDFRAVQADTGSIGGSASHEFQVLAQSGEDDVVFSDTSDYAANIELAEAIA
PKEPRAAATQEMTLVDTPNAKTIAELVEQFNLPIEKTVKTLLVKAVEGSSFPQVALLVRGDHELNEVKAEKLPQVASPLT
FATEEEIRAVVKAGPGSLGPVNMPIPVVIDRTVAAMSDFAAGANIDGKHYFGINWDRDVATPEVADIRNVVAGDPSPDGQ
GRLLIKRGIEVGHIFQLGTKYSEALKASVQGEDGRNQILTMGCYGIGVTRVVAAAIEQNYDERGIVWPDAIAPFQVAILP
MNMHKSFRVQELAEKLYSELRAQGIEVLLDDRKERPGVMFADMELIGIPHTIVLGDRNLDNDDIEYKYRRNGEKQLIKTG
DIVEYLVKQIKG
>Q831W7 6.1.1.15~~~proS~~~Proline--tRNA ligase~~~COG0442
MKQSKMLIPTLREVPNDAEVLSHQILLRAGYIRQVAAGIYSYLPLANRVLEKLKTIMREEFEKIDAVEMLMPALLPAELW
KESGRYETYGPNLYRLKDRNDRDYILGPTHEETFTELIRDEINSYKRLPLNLYQIQTKYRDEKRSRSGLLRGREFIMKDG
YSFHADEASLDQSYRDYEKAYSRIFERCGLEFRAIIGDGGAMGGKDSKEFMAISEIGEDTICYSTESDYAANLEMATSLY
TPKKSHETQLDLEKIATPEVGTIAEVANFFEVEPQRIIKSVLFIADEEPVMVLVRGDHDVNDVKLKNFLGADFLDEATEE
DARRVLGAGFGSIGPVNVSEDVKIYADLAVQDLANAIVGANEDGYHLTNVNPDRDFQPISYEDLRFVQEGDPSPDGNGVL
AFTKGIEIGHIFKLGTRYSDAMGATVLDENGREKSVIMGCYGIGVSRLLSAIVEQNADERGINWPTGIAPFDLHVVQMNV
KDEYQTKLSQEVEAMMTEAGYEVLVDDRNERAGVKFADADLIGCPIRITVGKKAVDGVVEVKIKRTGEMLEVRKEELEST
LSILMNTTSEVE
>P43830 6.1.1.15~~~proS~~~Proline--tRNA ligase~~~COG0442
MRTSQYLFSTLKETPNDAQVVSHQLMLRAGMIRPMASGLYNWLPTGIRVLKKVEKVVREEMNKGGAIEVLMPVVQPAELW
EESGRWDQYGPELLRFEDRGNRNFVLGPTHEEVITDLVRREVSSYKQLPLNLYQIQTKFRDEVRPRFGVMRSREFIMKDA
YSFHTTQESLQATYDVMYQVYSNIFNRLGLDFRAVQADTGSIGGSASHEFQVLASSGEDDVVFSTESDFAANIELAEAIA
IGERQAPTAEMCLVDTPNAKTIAELVEQFNLPIEKTVKTLIVKGADENQPLVALIIRGDHELNEIKAQKHPLVADPLEFA
DETEIKAKIGSGVGSLGAVNLNIPAIIDRTVALMSDFSCGANIDGKHYFNVNWERDVAIPKVFDLRNVVEGDPSPDGKGT
LQIKRGIEVGHIFQLGKKYSEAMKATVQGEDGKPLVMTMGCYGIGVTRVVASAIEQHHDERGIIWPSDEIAPFTVAIVPM
NMHKSEAVQKYAEELYRTLQSQGVDVIFDDRKERPGVMFADMELIGVPHMVVIGEKNLDNGEIEYKNRRTGEKEMISKDK
LLSVLNEKLGNL
>A0QVM0 6.1.1.15~~~proS~~~Proline--tRNA ligase~~~COG0442
MITRMSELFLRTLRDDPADAEVPSHKLLIRAGYVRAVGPGIYSWLPLGLRVLRKIENVVRSEMNAIGAQEILLPALLPRG
PYETTNRWTEYGDTLFRLQDRRNNDYLLGPTHEELFTLTVKGEYSSYKDFPVILYQIQTKYRDEARPRAGILRGREFVMK
DSYSFDVDDDGLKNAYYQHREAYQRIFARLGVRYVIVSAVSGAMGGSASEEFLAESEVGEDTFVRCVESGYAANVEAVIT
RAPEAQPTEGLPEAKVYDTPDTPTIATLVEWANSASLPQFEGRTVTAADTLKNVLLKTREPGGEWELLAVGVPGDREVDE
KRLGAALEPAEFALLDDADFAANPFLVKGYVGPKALQDNGVRYLVDPRVVHGSSWITGADAPNRHVVGLVAGRDFTPDGT
IEAAEVRDGDPSPDGAGVLTSARGIEIGHIFQLGRKYTDAFSADVLGEDGKPLRLTMGSYGIGVSRLVAVIAEQQHDQLG
LRWPSSVAPFDVHVVVANKDAGARAGAAELVADLDRLGHEVLFDDRQASPGVKFKDAELLGMPWIVVVGRGWADGVVELR
NRFTGETREIAADGAAAEISSVLAG
>P9WFT9 6.1.1.15~~~proS~~~Proline--tRNA ligase~~~COG0442
MITRMSELFLRTLRDDPADAEVASHKLLIRAGYIRPVAPGLYSWLPLGLRVLRNIERVIRDEMNAIGGQEILFPALLPRA
PYETTNRWTQYGDSVFRLKDRRGNDYLLGPTHEELFTLTVKGEYSSYKDFPLTLYQIQTKYRDEARPRAGILRAREFVMK
DSYSFDIDAAGLKAAYHAHREAYQRIFDRLQVRYVIVSAVSGAMGGSASEEFLAESPSGEDAFVRCLESGYAANVEAVVT
ARPDTLPIDGLPEAVVHDTGDTPTIASLVAWANEADLGRTVTAADTLKNVLIKVRQPGGDTELLAIGVPGDREVDDKRLG
AALEPADYALLDDDDFAKHPFLVKGYIGPKALRENNVRYLVDPRIVDGTSWITGADQPGRHVVGLVAGRDFTADGTIEAA
EVREGDPSPDGAGPLVMARGIEIGHIFQLGSKYTDAFTADVLGEDGKPVRLTMGSYGIGVSRLVAVVAEQHHDELGLRWP
STVAPFDVHLVIANKDAQARAGATALAADLDRLGVEVLLDDRQASPGVKFKDAELLGMPWIVVVGRGWADGVVELRDRFS
GQTRELVAGASLATDIAAAVTG
>Q9I502 6.1.1.15~~~proS~~~Proline--tRNA ligase~~~
MRTSQYLLSTLKETPADAVVISHQLLLRAGMIRRLASGLYTWLPMGLRVLRKVETIVREEMNAAGALEVLMPAVQPAELW
QESGRWEQYGPELLRLKDRHEREFCVGPTHEEVITDLARNELNSYKQLPINFYQIQTKFRDEIRPRFGLMRGREFIMKDA
YSFHLSQDSLQQTYDGMYQAYSKIFSRLGLDFRPVQADNGSIGGSGSHEFHVLANSGEDDIVFSDSSDYAANIEKAEAVP
RESARGSATEDMRLVDTPNTKTIAALVDGFQLPIEKTIKTLVVHGAEEGTLVALIVRGDHELNEIKAANQPLVASPLVFA
SEAEIRAAIGAGPGSLGPVNLPIACIVDRSVALMSDFAAGANIEDKHYFGVNWERDLPLPEVADLRNVVEGDPSPDGKGT
LVIKRGIEVGHIFQLGTKYSEAMKLSVLSEQGKPVNLIMGCYGIGVSRVVAAAIEQNHDERGILWPSALAPFQIALVPLK
YETESVKQATDKLYAELTAAGFEVLLDDRDKKTSPGVKFADMELIGIPHRIVISDRGLSEGVLEYKGRRDSESQNLPIGE
LMSFITEKLSR
>Q6N5P6 6.1.1.15~~~proS~~~Proline--tRNA ligase~~~COG0442
MRLSRFFLPILKENPKEAEIVSHRLMLRAGMLRQEAAGIYAWLPLGHRVLKKIEQIVREEQNRAGAIELLMPTLQLADLW
RESGRYDAYGPEMLRIADRHKRELLYGPTNEEMITEIFRAYIKSYKSLPLNLYHIQWKFRDEQRPRFGVMRGREFLMKDA
YSFDVDEAGARKSYNKMFVAYLRTFARMGLKAIPMRAETGPIGGDLSHEFIVLAETGESGVYIDRDVLNLPVPDENVDYD
GDLTPIIKQWTSVYAATEDVHEPARYESEVPEANRLNTRGIEVGQIFYFGTKYSDSMKANVTGPDGTDAPIHGGSYGVGV
SRLLGAIIEACHDDNGIIWPEAVAPFRVTILNLKQGDAATDAACDQLYRELSAKGVDVLYDDTDQRAGAKFATADLIGIP
WQIHVGPRGLAEGKVELKRRSDGARENLALADVVARLT
>Q7A5Y3 6.1.1.15~~~proS~~~Proline--tRNA ligase~~~
MKQSKVFIPTMRDVPSEAEAQSHRLLLKSGLIKQSTSGIYSYLPLATRVLNNITAIVRQEMERIDSVEILMPALQQAELW
EESGRWGAYGPELMRLQDRHGRQFALGPTHEELVTSIVRNELKSYKQLPMTLFQIQSKFRDEKRPRFGLLRGREFIMKDA
YSFHADEASLDQTYQDMYQAYSRIFERVGINARPVVADSGAIGGSHTHEFMALSAIGEDTIVYSKESDYAANIEKAEVVY
EPNHKHSTVQPLEKIETPNVKTAQELADFLGRPVDEIVKTMIFKVDGEYIMVLVRGHHEINDIKLKSYFGTDNIELATQD
EIVNLVGANPGSLGPVIDKEIKIYADNFVQDLNNLVVGANEDGYHLINVNVGRDFNVDEYGDFRFILEGEKLSDGSGVAH
FAEGIEVGQVFKLGTKYSESMNATFLDNQGKAQPLIMGCYGIGISRTLSAIVEQNHDDNGIVWPKSVTPFDLHLISINPK
KDDQRELADALYAEFNTKFDVLYDDRQERAGVKFNDADLIGLPLRIVVGKRASEGIVEVKERLTGDSEEVHIDDLMTVIT
NKYDNLK
>Q5SM28 6.1.1.15~~~proS~~~Proline--tRNA ligase~~~COG0442
MAKEKGLTPQSQDFSEWYLEVIQKAELADYGPVRGTIVVRPYGYAIWENIQQVLDRMFKETGHQNAYFPLFIPMSFLRKE
AEHVEGFSPELAVVTHAGGEELEEPLAVRPTSETVIGYMWSKWIRSWRDLPQLLNQWGNVVRWEMRTRPFLRTSEFLWQE
GHTAHATREEAEEEVRRMLSIYARLAREYAAIPVIEGLKTEKEKFAGAVYTTTIEALMKDGKALQAGTSHYLGENFARAF
DIKFQDRDLQVKYVHTTSWGLSWRFIGAIIMTHGDDRGLVLPPRLAPIQVVIVPIYKDESRERVLEAAQGLRQALLAQGL
RVHLDDRDQHTPGYKFHEWELKGVPFRVELGPKDLEGGQAVLASRLGGKETLPLAALPEALPGKLDAFHEELYRRALAFR
EDHTRKVDTYEAFKEAVQEGFALAFHCGDKACERLIQEETTATTRCVPFEAEPEEGFCVRCGRPSAYGKRVVFAKAY
>P56926 6.1.1.18~~~glnS~~~Glutamine--tRNA ligase~~~COG0008
MGAFGWEQDRGAPFSGRSPRILTRMTDAPRPTAGADAPARPPAAPLVAPNFITEIIERDLEAGKYPRVVTRFPPDPSGYA
HLGHVFASLLDFNTARQYGGQFNLRMDDTNPELARQEYVDSIADDLKWLGLDWGEHFYYASDYFDRYYAYAEQLIRQGDA
YVESVSPEELSRLRGNATTPGTPSPYRDRSVEENLDLLRRMKAGEFADGEHVLRAKIDLTAPNMKLRDPVLYRIVNKPHF
RTSDEWHIYPAYDFEHPLQDAIEGVTHSMCSLEFVDNRAIYDWLMEKLNFDPRPHQYEFGRRGLEYTITSKRKLRELVQA
GRVSGWDDPRMPTLRAQRRLGVTPEAVRAFAAQIGVSRTNRTVDIAVYENAVRDDLNHRAPRVMAVLDPVKVTLTNLDGE
KTLSLPYWPHDVVRDSPDGLVGMPGGGRVAPEEAVRDVPLTRELYIERDDFSPAPPKGFKRLTPGGTVRLRGAGIIRADD
FGTDEAGQVTHIRATLLGEDAKAAGVIHWVSAERALPAEFRLYDRLFRVPHPEGENADVEDDSAGPAEHEAEPGAGQETA
PVSQGFMRYLTPDSLRVLRGYVEPSVAGDPADTRYQFERQGYFWRDPVELERVDSREDALVFGRIITLKDTWGKQGGGTQ
QKAEGKKRPSTKGRGPDEVRGEGSSSPAKAHAPKAQPLTPEQDAEFTRLLGLGASEGDARTIARDPALLAFVGGAAPGDT
FAQVASWTVNELVAGLRAGEVKVRAADLAPLAEGVASGQLSARIAREALARAAASGDAPLTIIEREGLNAGLSAEALQQV
VAQVIAANPDKAEAYRGGKTALLGFFTGQVMRATAGKADPQALAAALKDALA
>P00962 6.1.1.18~~~glnS~~~Glutamine--tRNA ligase~~~COG0008
MSEAEARPTNFIRQIIDEDLASGKHTTVHTRFPPEPNGYLHIGHAKSICLNFGIAQDYKGQCNLRFDDTNPVKEDIEYVE
SIKNDVEWLGFHWSGNVRYSSDYFDQLHAYAIELINKGLAYVDELTPEQIREYRGTLTQPGKNSPYRDRSVEENLALFEK
MRAGGFEEGKACLRAKIDMASPFIVMRDPVLYRIKFAEHHQTGNKWCIYPMYDFTHCISDALEGITHSLCTLEFQDNRRL
YDWVLDNITIPVHPRQYEFSRLNLEYTVMSKRKLNLLVTDKHVEGWDDPRMPTISGLRRRGYTAASIREFCKRIGVTKQD
NTIEMASLESCIREDLNENAPRAMAVIDPVKLVIENYQGEGEMVTMPNHPNKPEMGSRQVPFSGEIWIDRADFREEANKQ
YKRLVLGKEVRLRNAYVIKAERVEKDAEGNITTIFCTYDADTLSKDPADGRKVKGVIHWVSAAHALPVEIRLYDRLFSVP
NPGAADDFLSVINPESLVIKQGFAEPSLKDAVAGKAFQFEREGYFCLDSRHSTAEKPVFNRTVGLRDTWAKVGE
>Q9I2U8 6.1.1.18~~~glnS~~~Glutamine--tRNA ligase~~~
MSKPETTAAPNFLRQIVQADLDAGKHAKIVTRFPPEPNGYLHIGHAKSICLNFGLAQEFAGDCHLRFDDTNPAKEDQEYI
DAIEADIKWLGFQWSGEVCYASNYFDQLHAWAVELIKAGKAFVCDLGPEEMREYRGTLTEPGRNSPYRDRSVEENLDLFA
RMKAGEFPDGARSLRAKIDMGSPNMNLRDPILYRIRHAHHHQTGDKWCIYPSYDFTHGQSDAIEGITHSICTLEFEDHRP
LYEWFLANLPVPAQPRQYEFSRLNLNYTVTSKRKLKQLVDEGHVSGWDDPRMSTLSGYRRRGYTPESIRNFCEMIGVNRA
SGVVDIGMLEFSIRDHLDATAPRAMCVLKPLKVVITNYPEGQVENLELPRHPKEDMGVRVLPFGRELFIDAGDFEEVPPA
GYKRLIPGGEVRLRGSYVIRADEAIKDADGNIVELRCSYDPDTLGKNPEGRKVKGVIHWVPAEGSVECEVRLYDRLFRSA
NPEKAEEGGSFLDNINADSLQVLAGCRAEPSLGQANPEDRFQFEREGYFVADLKDSRPGKPVFNRTVTLRDSWGQG
>Q52400 6.2.1.70~~~syrB1~~~Syringomycin synthase SyrB1~~~
MPITNTDESLSAASAPLKPGAFLHEIFSDRARQFPERTAVSDAARTLSYAQLDALSTKLAARLRDEGVTYGTRVGMYLPR
SVDLVTSLLGILKAGGTYVPVDPQYPGKRVEHIVRDSELSLIIGDAANLPKISSLRVLALDELLSAPALQPAAQDTRIDP
NNSTAYIIYTSGSTGEPKGVQVSHGNVSRLLESTQRAYGFNAQDVWSMFHSIGFDFSVWEIWGALAHGGQVAVVPYDISR
SPAALRQWLADQRITVLSQTPSAFRGLDEADRGNTAPLALRYVVLGGEALPASVLRPWVERHGDQKPALINMYGITEATV
HTTFKRVLAQDLETAAMVSLGKPLDGWRLHLLDANQAPVAAGTTGELYIEGAGVAQGYLNREALNVERFVELPGAVRAYR
TGDLMTLESNGEYRYAGRCDEQLKISGFRIEPGEIEASLQTSPSVAAAHVGVHDYGDGDLRLVAYVVPGQGVDAWTEQAR
SEVAALMAENLPGYMRPSVYVPLAELPVTHHGKIDKQQLPSPAAGTALSGAADVKGLSEQEHFVLKVWSEDLGLKNIGVN
DDFFDSGGTSLALIRSLSKLKTHYKINLDPGILADGATAKVLADHITRSLVQAH
>Q9RBY6 1.14.20.15~~~syrB2~~~L-threonyl-[L-threonyl-carrier protein] 4-chlorinase~~~
MSKKFALTAEQRASFEKNGFIGPFDAYSPEEMKETWKRTRLRLLDRSAAAYQDLDAISGGTNIANYDRHLDDDFLASHIC
RPEICDRVESILGPNVLCWRTEFFPKYPGDEGTDWHQADTFANASGKPQIIWPENEEFGGTITVWTAFTDANIANGCLQF
IPGTQNSMNYDETKRMTYEPDANNSVVKDGVRRGFFGYDYRQLQIDENWKPDEASAVPMQMKAGQFIIFWSTLMHASYPH
SGESQEMRMGFASRYVPSFVHVYPDSDHIEEYGGRISLEKYGAVQVIGDETPEYNRLVTHTTRGKKFEAV
>P55619 ~~~syrM1~~~HTH-type transcriptional regulator SyrM 1~~~COG0583
MDEPACNGTSNEWQRPQLIVAGPRAARRRQMLASLDLNTLLALEALLEHRNVTQAARHLGLSQPSVSRALIRLRGVFNDD
LLVRGSSGMVPTPHAQRLGQMLPPVLDSIRGMVDPGLDQGEWRLTARMAMPDHQAIVLLPPFLPLMRERAPNLDIVTDSL
LALRRLEQGEIDLAVGQIGEAPPGYFRRRLYNDRFACLLRNGHPALEQESIIDTFSALRHAAIASDTKDGFGRVHDDLVK
LDLQDPDPVLVSNVLTAGLAIVSTDLVLVVPRRVATRNAALLPLVIVDPPVELPPYEVALIWHERCHRDPDHRWLRQEIA
AAATATEQGQSTDAASRQ
>Q9PNC0 6.1.1.19~~~argS~~~Arginine--tRNA ligase~~~COG0018
MKSIIFNEIKKILECDFALENPKDKNLAHFATPLAFSLAKELKKSPMLIASDLASKFQNHDCFESVEAVNGYLNFRISKT
FLNELANQALTNPNDFTKGEKKQESFLLEYVSANPTGPLHIGHARGAVFGDTLTRLARHLGYKFNTEYYVNDAGNQIYLL
GLSILLSVKESILHENVEYPEQYYKGEYIVDLAKEAFEKFGKEFFSEENIPSLADWAKDKMLVLIKQNLEQAKIKIDSYV
SERSYYDALNATLESLKEHKGIYEQEGKIWLASSQKGDEKDRVIIREDGRGTYLAADIVYHKDKMSRGYGKCINIWGADH
HGYIPRMKAAMEFLGFDSNNLEIILAQMVSLLKDGEPYKMSKRAGNFILMSDVVDEIGSDALRYIFLSKKCDTHLEFDIS
DLQKEDSSNPVYYINYAHARIHQVFAKAGKKIDDVMKADLQSLNQDGVNLLFEALNLKAVLNDAFEARALQKIPDYLKNL
AANFHKFYNENKVVGSANENDLLKLFSLVALSIKTAFSLMGIEAKNKMEH
>P35868 6.1.1.19~~~argS~~~Arginine--tRNA ligase~~~COG0018
MTPADLATLIKETAVEVLTSRELDTSVLPEQVVVERPRNPEHGDYATNIALQVAKKVGQNPRDLATWLAEALAADDAIDS
AEIAGPGFLNIRLAAAAQGEIVAKILAQGETFGNSDHLSHLDVNLEFVSANPTGPIHLGGTRWAAVGDSLGRVLEASGAK
VTREYYFNDHGRQIDRFALSLLAAAKGEPTPEDGYGGEYIKEIAEAIVEKHPEALALEPAATQELFRAEGVEMMFEHIKS
SLHEFGTDFDVYYHENSLFESGAVDKAVQVLKDNGNLYENEGAWWLRSTEFGDDKDRVVIKSDGDAAYIAGDIAYVADKF
SRGHNLNIYMLGADHHGYIARLKAAAAALGYKPEGVEVLIGQMVNLLRDGKAVRMSKRAGTVVTLDDLVEAIGIDAARYS
LIRSSVDSSLDIDLGLWESQSSDNPVYYVQYGHARLCSIARKAETLGVTEEGADLSLLTHDREGDLIRTLGEFPAVVKAA
ADLREPHRIARYAEELAGTFHRFYDSCHILPKVDEDTAPIHTARLALAAATRQTLANALHLVGVSAPEKM
>P11875 6.1.1.19~~~argS~~~Arginine--tRNA ligase~~~COG0018
MNIQALLSEKVRQAMIAAGAPADCEPQVRQSAKVQFGDYQANGMMAVAKKLGMAPRQLAEQVLTHLDLNGIASKVEIAGP
GFINIFLDPAFLAEHVQQALASDRLGVATPEKQTIVVDYSAPNVAKEMHVGHLRSTIIGDAAVRTLEFLGHKVIRANHVG
DWGTQFGMLIAWLEKQQQENAGEMELADLEGFYRDAKKHYDEDEEFAERARNYVVKLQSGDEYFREMWRKLVDITMTQNQ
ITYDRLNVTLTRDDVMGESLYNPMLPGIVADLKAKGLAVESEGATVVFLDEFKNKEGEPMGVIIQKKDGGYLYTTTDIAC
AKYRYETLHADRVLYYIDSRQHQHLMQAWAIVRKAGYVPESVPLEHHMFGMMLGKDGKPFKTRAGGTVKLADLLDEALER
ARRLVAEKNPDMPADELEKLANAVGIGAVKYADLSKNRTTDYIFDWDNMLAFEGNTAPYMQYAYTRVLSVFRKAEIDEEQ
LAAAPVIIREDREAQLAARLLQFEETLTVVAREGTPHVMCAYLYDLAGLFSGFYEHCPILSAENEEVRNSRLKLAQLTAK
TLKLGLDTLGIETVERM
>A6TB43 6.1.1.19~~~argS~~~Arginine--tRNA ligase~~~
MNIQALLSEKVSQALIAAGAPADCEPQVRQSAKVQFGDYQANGVMAVAKKLGMAPRQLAEQVLSHLDLNGIANKVEIAGP
GFINIFLDPAFLADNVNRALQSERLGVTKPQAQTIVVDYSAPNVAKEMHVGHLRSTIIGDASVRTLEFLGHKVIRANHVG
DWGTQFGMLIAYLEKQQQENAGEMALADLEGFYREAKKHYDEDEAFAERARSYVVKLQGGDEYFLQMWRKLVDITMSQNQ
ITYDRLNVTLTRDDVMGESLYNPMLPGIVADLKAKGLAVESEGATVVFLDEYKNKEGEPMGVIIQKKDGGYLYTTTDIAC
AKYRYETLHADRVLYYIDSRQHQHLMQAWTIVRKAGYVPDSVPLEHHMFGMMLGKDGKPFKTRAGGTVKLADLLDEALER
ARRLVAEKNPDMSADELENLAKVVGIGAVKYADLSKNRTTDYVFDWDNMLAFEGNTAPYMQYAYTRVLSVFRKAGIDENA
MIDAPVVIAEDREAQLAARLLQFEETLSVVAREGTPHVMCAYLYDLAGLFSGFYEHCPILSAESEETRNSRLKLALLTAK
TLKLGLDTLGIETVERM
>P9WFW5 6.1.1.19~~~argS~~~Arginine--tRNA ligase~~~COG0018
MTPADLAELLKATAAAVLAERGLDASALPQMVTVERPRIPEHGDYASNLAMQLAKKVGTNPRELAGWLAEALTKVDGIAS
AEVAGPGFINMRLETAAQAKVVTSVIDAGHSYGHSLLLAGRKVNLEFVSANPTGPIHIGGTRWAAVGDALGRLLTTQGAD
VVREYYFNDHGAQIDRFANSLIAAAKGEPTPQDGYAGSYITNIAEQVLQKAPDALSLPDAELRETFRAIGVDLMFDHIKQ
SLHEFGTDFDVYTHEDSMHTGGRVENAIARLRETGNIYEKDGATWLRTSAFGDDKDRVVIKSDGKPAYIAGDLAYYLDKR
QRGFDLCIYMLGADHHGYIARLKAAAAAFGDDPATVEVLIGQMVNLVRDGQPVRMSKRAGTVLTLDDLVEAIGVDAARYS
LIRSSVDTAIDIDLALWSSASNENPVYYVQYAHARLSALARNAAELALIPDTNHLELLNHDKEGTLLRTLGEFPRVLETA
ASLREPHRVCRYLEDLAGDYHRFYDSCRVLPQGDEQPTDLHTARLALCQATRQVIANGLAIIGVTAPERM
>B4RL22 6.1.1.19~~~argS~~~Arginine--tRNA ligase~~~
MNLHQTVEHEAAAAFAAAGIAGSPVVLQPTKNAEHGDFQINGVMGAAKKAKQNPRELAQKVADALAGNAVIESAEVAGPG
FINLRLRHEFLAQNIHAALNDARFGVAKTAQPQTVVIDYSSPNLAKEMHVGHLRSSIIGDSISRVLEFTGNTVIRQNHVG
DWGTQFGMLVAYLVEQQKDNAAFELADLEQFYRAAKVRFDEDPAFADTAREYVVKLQGGDETVLALWKQFVDISLSHAQA
VYDTLGLKLRPEDVAGESKYNDDLQPVADDLVQKGLAVEDDGAKVVFLDEFKNKEGEPAAFIVQKQGGGFLYASTDLACL
RYRIGRLKAGRLLYVVDHRQALHFEQLFTTSRKAGYLPENAKAEFIGFGTMMGKDGKPFKTRSGDTVKLVDLLTEAVERA
TALVKEKNPELGADEAAKIGKTVGIGAVKYADLSKNRTSDYVFDWDAMLSFEGNTAPYLQYAYTRVQSVFRKAGEWDATA
PTVLTEPLEKQLAAELLKFENVLQSVADTAYPHYLAAYLYQAATLFSRFYEACPILKAEGASRNSRLQLAKLTGNTLKQG
LDLLGIDVLDVM
>Q99W05 6.1.1.19~~~argS~~~Arginine--tRNA ligase~~~
MNIIDQVKQTLVEEIAASINKAGLADEIPDIKIEVPKDTKNGDYATNIAMVLTKIAKRNPREIAQAIVDNLDTEKAHVKQ
IDIAGPGFINFYLDNQYLTAIIPEAIEKGDQFGHVNESKGQNVLLEYVSANPTGDLHIGHARNAAVGDALANILTAAGYN
VTREYYINDAGNQITNLARSIETRFFEALGDNSYSMPEDGYNGKDIIEIGKDLAEKHPEIKDYSEEARLKEFRKLGVEYE
MAKLKNDLAEFNTHFDNWFSETSLYEKGEILEVLAKMKELGYTYEADGATWLRTTDFKDDKDRVLIKNDGTYTYFLPDIA
YHFDKVKRGNDILIDLFGADHHGYINRLKASLETFGVDSNRLEIQIMQMVRLMENGKEVKMSKRTGNAITLREIMDEVGV
DAARYFLTMRSPDSHFDFDMELAKEQSQDNPVYYAQYAHARICSILKQAKEQGIEVTAANDFTTITNEKAIELLKKVADF
EPTIESAAEHRSAHRITNYIQDLAAHFHKFYNAEKVLTDDIEKTKAHVAMIEAVRITLKNALAMVGVSAPESM
>Q54869 6.1.1.19~~~argS~~~Arginine--tRNA ligase~~~COG0018
MNTKELIASELSSIIDSLDQEAILKLLETPKNSEMGDIAFPAFSLAKVERKAPQMIAAELAEKMNSQAFEKVVATGPYVN
FFLDKSAISAQVLQAVTTEKEHYADQNIGKQENVVIDMSSPNIAKPFSIGHLRSTVIGDSLSHIFQKIGYQTVKVNHLGD
WGKQFGMLIVAYKKWGDEEAVKAHPIDELLKLYVRINAEAENDPSLDEEAREWFRKLENGDEEALALWQWFRDESLVEFN
RLYNELKVEFDSYNGEAFYNDKMDAVVDILSEKGLLLESEGAQVVNLEKYGIEHPALIKKSDGATLYITRDLAAALYRKN
EYQFAKSIYVVGQEQSAHFKQLKAVLQEMGYDWSDDITHVPFGLVTKEGKKLSTRKGNVILLEPTVAEAVSRAKVQIEAK
NPELENKDQVAHAVGIGAIKFYDLKTDRTNGYDFDLEAMVSFEGETGPYVQYAYARIQSILRKADFKPETAGNYSLNDTE
SWEIIKLIQDFPRIINRAADNFEPSIIAKFAISLAQSFNKYYAHTRILDESPERDSRLALSYATAVVLKEALRLLGVEAP
EKM
>Q5SM45 6.1.1.19~~~argS~~~Arginine--tRNA ligase~~~COG0018
MLRRALEEAIAQALKEMGVPVRLKVARAPKDKPGDYGVPLFALAKELRKPPQAIAQELKDRLPLPEFVEEAVPVGGYLNF
RLRTEALLREALRPKAPFPRRPGVVLVEHTSVNPNKELHVGHLRNIALGDAIARILAYAGREVLVLNYIDDTGRQAAETL
FALRHYGLTWDGKEKYDHFAGRAYVRLHQDPEYERLQPAIEEVLHALERGELREEVNRILLAQMATMHALNARYDLLVWE
SDIVRAGLLQKALALLEQSPHVFRPREGKYAGALVMDASPVIPGLEDPFFVLLRSNGTATYYAKDIAFQFWKMGILEGLR
FRPYENPYYPGLRTSAPEGEAYTPKAEETINVVDVRQSHPQALVRAALALAGYPALAEKAHHLAYETVLLEGRQMSGRKG
LAVSVDEVLEEATRRARAIVEEKNPDHPDKEEAARMVALGAIRFSMVKTEPKKQIDFRYQEALSFEGDTGPYVQYAHARA
HSILRKAGEWGAPDLSQATPYERALALDLLDFEEAVLEAAEERTPHVLAQYLLDLAASWNAYYNARENGQPATPVLTAPE
GLRELRLSLVQSLQRTLATGLDLLGIPAPEVM
>O66647 6.1.1.11~~~serS~~~Serine--tRNA ligase~~~COG0172
MIDINLIREKPDYVKERLATRDKELVSLVDKVLELDKRRREIIKRLEALRSERNKLSKEIGKLKREGKDTTEIQNRVKEL
KEEIDRLEEELRKVEEELKNTLLWIPNLPHPSVPVGEDEKDNVEVRRWGEPRKFDFEPKPHWEIGERLGILDFKRGAKLS
GSRFTVIAGWGARLERALINFMLDLHTKKGYKEICPPHLVKPEILIGTGQLPKFEEDLYKCERDNLYLIPTAEVPLTNLY
REEILKEENLPIYLTAYTPCYRREAGAYGKDIRGIIRQHQFDKVELVKIVHPDTSYDELEKLVKDAEEVLQLLGLPYRVV
ELCTGDLGFSAAKTYDIEVWFPSQNKYREISSCSNCEDFQARRMNTRFKDSKTGKNRFVHTLNGSGLAVGRTLAAILENY
QQEDGSVVVPEVLRDYVGTDVIRPE
>P0A8L1 6.1.1.11~~~serS~~~Serine--tRNA ligase~~~COG0172
MLDPNLLRNEPDAVAEKLARRGFKLDVDKLGALEERRKVLQVKTENLQAERNSRSKSIGQAKARGEDIEPLRLEVNKLGE
ELDAAKAELDALQAEIRDIALTIPNLPADEVPVGKDENDNVEVSRWGTPREFDFEVRDHVTLGEMHSGLDFAAAVKLTGS
RFVVMKGQIARMHRALSQFMLDLHTEQHGYSENYVPYLVNQDTLYGTGQLPKFAGDLFHTRPLEEEADTSNYALIPTAEV
PLTNLVRGEIIDEDDLPIKMTAHTPCFRSEAGSYGRDTRGLIRMHQFDKVEMVQIVRPEDSMAALEEMTGHAEKVLQLLG
LPYRKIILCTGDMGFGACKTYDLEVWIPAQNTYREISSCSNVWDFQARRMQARCRSKSDKKTRLVHTLNGSGLAVGRTLV
AVMENYQQADGRIEVPEVLRPYMNGLEYIG
>A6T6Z0 6.1.1.11~~~serS~~~Serine--tRNA ligase~~~
MLDPNLLRTEPDAVAEKLARRGFKLDVDKLRALEERRKVLQVQTENLQAERNSRSKSIGQAKARGEDIEPLRLEVNKLGE
QLDAAKSELETLLAEIRDIALAIPNIPHDDVPVGRDENDNVEVSRWGTPRQFDFEVRDHVTLGEMHGGLDFAAAVKLTGS
RFVVMKGQLARLHRALAQFMLDLHTEQHGYSENYVPYLVNQDTLYGTGQLPKFAGDLFHTRPLEEEADSSNYALIPTAEV
PLTNLVRDEIIDEDDLPIKMTAHTPCFRSEAGSYGRDTRGLIRMHQFDKVEMVQIVRPEDSMAALEEMTGHAEKVLQLLG
LPYRKVALCTGDMGFSACKTYDLEVWVPAQNTYREISSCSNVWDFQARRMQARCRSKSDKKTRLVHTLNGSGLAVGRTLV
ALMENYQQADGRIEIPEVLRPYMRGLEYIG
>A0R638 6.1.1.11~~~serS~~~Serine--tRNA ligase~~~COG0172
MIDLRLLRENPDIVRASQRARGEDPALVDALLAADTARRSAVSAADNLRAEQKAASKLVGKASPDERPALLQRAKDLAEQ
VKAAEAAQAEAEQAFTAAHMAISNVIFEGVPAGGEDDFVVLDTVGEPRAIENPKDHLELGESLGLIDMERGAKVSGSRFY
FLTGAGALLQLGLLQLATQVAVQNGFTLMIPPVLVRPEVMRGTGFLGAHADEVYRLEADDMYLVGTSEVPLAGYHADEIL
DLSAGPRRYAGWSSCFRREAGSYGKDTRGIIRVHQFDKVEGFIYCKPEDAEAEHQRLLGWQREMLAAIEVPYRVIDVAAG
DLGSSAARKYDCEAWVPTQQTYRELTSTSNCTTFQARRLSTRYRDDNGKPQIAATLNGTLATTRWLVAILENHQQPDGSV
RVPAALVPYVRTEVLEP
>P9WFT7 6.1.1.11~~~serS~~~Serine--tRNA ligase~~~COG0172
MIDLKLLRENPDAVRRSQLSRGEDPALVDALLTADAARRAVISTADSLRAEQKAASKSVGGASPEERPPLLRRAKELAEQ
VKAAEADEVEAEAAFTAAHLAISNVIVDGVPAGGEDDYAVLDVVGEPSYLENPKDHLELGESLGLIDMQRGAKVSGSRFY
FLTGRGALLQLGLLQLALKLAVDNGFVPTIPPVLVRPEVMVGTGFLGAHAEEVYRVEGDGLYLVGTSEVPLAGYHSGEIL
DLSRGPLRYAGWSSCFRREAGSHGKDTRGIIRVHQFDKVEGFVYCTPADAEHEHERLLGWQRQMLARIEVPYRVIDVAAG
DLGSSAARKFDCEAWIPTQGAYRELTSTSNCTTFQARRLATRYRDASGKPQIAATLNGTLATTRWLVAILENHQRPDGSV
RVPDALVPFVGVEVLEPVA
>Q9I0M6 6.1.1.11~~~serS~~~Serine--tRNA ligase~~~
MLDPKLVRTQPQEVAARLATRGFQLDVARIEALEEQRKSVQTRTEQLQAERNARSKAIGQAKQRGEDIAPLLADVDRMGS
ELEEGKRQLDAIQGELDAMLLGIPNLPHESVPVGADEDANVEVRRWGTPKTFDFEVKDHVALGERHGWLDFETAAKLSGA
RFALMRGPIARLHRALAQFMINLHTAEHGYEEAYTPYLVQAPALQGTGQLPKFEEDLFKIGRDGEADLYLIPTAEVSLTN
IVSGQILDAKQLPLKFVAHTPCFRSEAGASGRDTRGMIRQHQFDKVEMVQIVDPATSYEALEGLTANAERVLQLLELPYR
VLALCTGDMGFGATKTYDLEVWVPSQDKYREISSCSNCGDFQARRMQARYRNPETGKPELVHTLNGSGLAVGRTLVAVLE
NYQQADGSIRVPEVLKPYMAGIEVIG
>P95689 6.1.1.11~~~serS~~~Serine--tRNA ligase~~~COG0172
MLDIRLFRNEPDTVKSKIELRGDDPKVVDEILELDEQRRKLISATEEMKARRNKVSEEIALKKRNKENADDVIAEMRTLG
DDIKEKDSQLNEIDNKMTGILCRIPNLISDDVPQGESDEDNVEVKKWGTPREFSFEPKAHWDIVEELKMADFDRAAKVSG
ARFVYLTNEGAQLERALMNYMITKHTTQHGYTEMMVPQLVNADTMYGTGQLPKFEEDLFKVEKEGLYTIPTAEVPLTNFY
RNEIIQPGVLPEKFTGQSACFRSEAGSAGRDTRGLIRLHQFDKVEMVRFEQPEDSWNALEEMTTNAEAILEELGLPYRRV
ILCTGDIGFSASKTYDLEVWLPSYNDYKEISSCSNCTDFQARRANIRFKRDKAAKPELAHTLNGSGLAVGRTFAAIVENY
QNEDGTVTIPEALVPFMGGKTQISKPVK
>P99178 6.1.1.11~~~serS~~~Serine--tRNA ligase~~~
MLDIRLFRNEPDTVKSKIELRGDDPKVVDEILELDEQRRKLISATEEMKARRNKVSEEIALKKRNKENADDVIAEMRTLG
DDIKEKDSQLNEIDNKMTGILCRIPNLISDDVPQGESDEDNVEVKKWGTPREFSFEPKAHWDIVEELKMADFDRAAKVSG
ARFVYLTNEGAQLERALMNYMITKHTTQHGYTEMMVPQLVNADTMYGTGQLPKFEEDLFKVEKEGLYTIPTAEVPLTNFY
RNEIIQPGVLPEKFTGQSACFRSEAGSAGRDTRGLIRLHQFDKVEMVRFEQPEDSWNALEEMTTNAEAILEELGLPYRRV
ILCTGDIGFSASKTYDIEVWLPSYNDYKEISSCSNCTDFQARRANIRFKRDKAAKPELAHTLNGSGLAVGRTFAAIVENY
QNEDGTVTIPEALVPFMGGKTQISKPVK
>P34945 6.1.1.11~~~serS~~~Serine--tRNA ligase~~~COG0172
MVDLKRLRQEPEVFHRAIREKGVALDLEALLALDREVQELKKRLQEVQTERNQVAKRVPKAPPEEKEALIARGKALGEEA
KRLEEALREKEARLEALLLQVPLPPWPGAPVGGEEANREIKRVGGPPEFSFPPLDHVALMEKNGWWEPRISQVSGSRSYA
LKGDLALYELALLRFAMDFMARRGFLPMTLPSYAREKAFLGTGHFPAYRDQVWAIAETDLYLTGTAEVVLNALHSGEILP
YEALPLRYAGYAPAFRSEAGSFGKDVRGLMRVHQFHKVEQYVLTEASLEASDRAFQELLENAEEILRLLELPYRLVEVAT
GDMGPGKWRQVDIEVYLPSEGRYRETHSCSALLDWQARRANLRYRDPEGRVRYAYTLNNTALATPRILAMLLENHQLQDG
RVRVPQALIPYMGKEVLEPCG
>Q5SJX7 6.1.1.11~~~serS~~~Serine--tRNA ligase~~~COG0172
MVDLKRLRQEPEVFHRAIREKGVALDLEALLALDQEVQELKKRLQEVQTERNQVAKRVPKAPPEEKEALIARGRALGEEA
KRLEEALREKEAQLEALLLQVPLPPWPGAPVGGEEANREIKRVGSPPEFSFPPLDHVALMEKNGWWEPRISQVSGSRSYA
LKGDLALYELALLRFAMDFMARRGFLPMTLPSYAREKAFLGTGHFPAYRDQVWAIAETDLYLTGTAEVVLNALHSGEILP
YEALPLRYAGYAPAFRSEAGSFGKDVRGLMRVHQFHKVEQYVLTEASLEASDRAFQELLENAEEILRLLELPYRLVEVAT
GDMGPGKWRQVDVEVYLPSEGRYRETHSCSALLDWQARRANLRYRDPEGRVRYAYTLNNTALATPRILAMLLENHQLQDG
RVRVPKALVPYMGKEVLEPCG
>P18255 6.1.1.3~~~thrS~~~Threonine--tRNA ligase 1~~~COG0441
MSDMVKITFPDGAVKEFAKGTTTEDIAASISPGLKKKSLAGKLNGKEIDLRTPINEDGTVEIITEGSEEGLQIMRHSAAH
LLAQAIKRIYKDVKFGVGPVIENGFYYDVEMDEAITPEDLPKIEKEMKKIVNANLPIVRKEVSREEAKARFAEIGDDLKL
ELLDAIPEGETVSIYEQGEFFDLCRGVHVPSTGKIKEFKLLSLAGAYWRGDSKNQMLQRVYGTAFFKKADLEEHLRMLEE
AKERDHRKLGKELKLFANSQKVGQGLPLWLPKGATIRRVIERYIVDKEISLGYEHVYTPVLGSKELYETSGHWDHYQEGM
FPPMEMDNETLVLRPMNCPHHMMIYKQDIHSYRELPIRIAELGTMHRYEMSGALSGLQRVRGMTLNDAHIFVRPDQIKDE
FIRTVRLIQDVYEDFGLSDYTFRLSYRDPEDTEKYFDDDEMWNKAQSMLKEAMDEIGHDYYEAEGEAAFYGPKLDVQVKT
AIGKEETLSTVQLDFLLPERFDLTYIGEDGKQHRPVVIHRGVVSTMERFVAFLIEEHKGALPTWLAPVQFQVIPVSPAVH
LDYAKKVQERLQCEGLRVEVDSRDEKIGYKIREAQMQKIPYMLVVGDQEAENGAVNVRKYGEQNSETISLDEFVKKAVAE
AKK
>P18256 6.1.1.3~~~thrZ~~~Threonine--tRNA ligase 2~~~COG0441
MSKHVHIQLPDGQIQEYPKGITIKEAAGSISSSLQKKAAAGQVNGKLVDLSFKLEEDAELSIVTLDSQEGLQVLRHTTAH
VLAQAVKRLYGEVSLGVGPVILDGFYYDMKLGKSLASGDLEAIEKEMKNIINENLEIKRIEVSYEEAEELFAQKDERLKL
EILKDIPRGEDITLYQQGEFVDLCRGPHLPSTGMIKAFKLTRVSGAYWRGDSKNEVLQRVYGVAFQKKKDLDAHLHMLEE
AAKRDHRKLGKQLGLFMFSEEAPGMPFYLPKGQIVRNELERFSRELQTNAGYDEVRTPFMMNQRLWEQSGHWDHYRDNMY
FSEVDDTRFAMKPMNCPGHMLIFKNSLYSYRDLPIRMAEFGQVHRHEYSGALNGMLRVRTFCQDDAHIFVREDQIESEIK
EAIRLIDEVYRTFGFEYSVELSTRPEDSLGDDSLWEASERALARVLEELGLSYEINEGDGAFYGPKIDFHIKDALKRSHQ
CATIQLDFQMPEKFDLTYINELNEKVRPVVIHRAVFGSIDRFFGILIEHYGGAFPVWLAPIQVQIIPVSHVHLDYCRKVQ
AELKQAGIRAGIDERNEKLGYKIRESQVQKIPYVLVLGDHEEQENAVNVRRFGHQQNEHVPFQTFKDKLVKQVENRGM
>P0A8M3 6.1.1.3~~~thrS~~~Threonine--tRNA ligase~~~COG0441
MPVITLPDGSQRHYDHAVSPMDVALDIGPGLAKACIAGRVNGELVDACDLIENDAQLSIITAKDEEGLEIIRHSCAHLLG
HAIKQLWPHTKMAIGPVIDNGFYYDVDLDRTLTQEDVEALEKRMHELAEKNYDVIKKKVSWHEARETFANRGESYKVSIL
DENIAHDDKPGLYFHEEYVDMCRGPHVPNMRFCHHFKLMKTAGAYWRGDSNNKMLQRIYGTAWADKKALNAYLQRLEEAA
KRDHRKIGKQLDLYHMQEEAPGMVFWHNDGWTIFRELEVFVRSKLKEYQYQEVKGPFMMDRVLWEKTGHWDNYKDAMFTT
SSENREYCIKPMNCPGHVQIFNQGLKSYRDLPLRMAEFGSCHRNEPSGSLHGLMRVRGFTQDDAHIFCTEEQIRDEVNGC
IRLVYDMYSTFGFEKIVVKLSTRPEKRIGSDEMWDRAEADLAVALEENNIPFEYQLGEGAFYGPKIEFTLYDCLDRAWQC
GTVQLDFSLPSRLSASYVGEDNERKVPVMIHRAILGSMERFIGILTEEFAGFFPTWLAPVQVVIMNITDSQSEYVNELTQ
KLSNAGIRVKADLRNEKIGFKIREHTLRRVPYMLVCGDKEVESGKVAVRTRRGKDLGSMDVNEVIEKLQQEIRSRSLKQL
EE
>P9WFT5 6.1.1.3~~~thrS~~~Threonine--tRNA ligase~~~COG0441
MSAPAQPAPGVDGGDPSQARIRVPAGTTAATAVGEAGLPRRGTPDAIVVVRDADGNLRDLSWVPDVDTDITPVAANTDDG
RSVIRHSTAHVLAQAVQELFPQAKLGIGPPITDGFYYDFDVPEPFTPEDLAALEKRMRQIVKEGQLFDRRVYESTEQARA
ELANEPYKLELVDDKSGDAEIMEVGGDELTAYDNLNPRTRERVWGDLCRGPHIPTTKHIPAFKLTRSSAAYWRGDQKNAS
LQRIYGTAWESQEALDRHLEFIEEAQRRDHRKLGVELDLFSFPDEIGSGLAVFHPKGGIVRRELEDYSRRKHTEAGYQFV
NSPHITKAQLFHTSGHLDWYADGMFPPMHIDAEYNADGSLRKPGQDYYLKPMNCPMHCLIFRARGRSYRELPLRLFEFGT
VYRYEKSGVVHGLTRVRGLTMDDAHIFCTRDQMRDELRSLLRFVLDLLADYGLTDFYLELSTKDPEKFVGAEEVWEEATT
VLAEVGAESGLELVPDPGGAAFYGPKISVQVKDALGRTWQMSTIQLDFNFPERFGLEYTAADGTRHRPVMIHRALFGSIE
RFFGILTEHYAGAFPAWLAPVQVVGIPVADEHVAYLEEVATQLKSHGVRAEVDASDDRMAKKIVHHTNHKVPFMVLAGDR
DVAAGAVSFRFGDRTQINGVARDDAVAAIVAWIADRENAVPTAELVKVAGRE
>P67585 6.1.1.3~~~thrS~~~Threonine--tRNA ligase~~~
MEQINIQFPDGNKKAFDKGTTTEDIAQSISPGLRKKAVAGKFNGQLVDLTKPLETDGSIGIVTPGSEEALEVLRHSTAHL
MAHAIKRLYGNVKFGVGPVIEGGFYYDFDIDQNISSDDFEQIEKTMKQIVNENMKIERKVVSRDEAKELFSNDEYKLELI
DAIPEDENVTLYSQGDFTDLCRGVHVPSTAKIKEFKLLSTAGAYWRGDSNNKMLQRIYGTAFFDKKELKAHLQMLEERKE
RDHRKIGKELELFTNSQLVGAGLPLWLPNGATIRREIERYIVDKEVSMGYDHVYTPVLANVDLYKTSGHWDHYQEDMFPP
MQLDETESMVLRPMNCPHHMMIYANKPHSYRELPIRIAELGTMHRYEASGAVSGLQRVRGMTLNDSHIFVRPDQIKEEFK
RVVNMIIDVYKDFGFEDYSFRLSYRDPEDKEKYFDDDDMWNKAENMLKEAADELGLSYEEAIGEAAFYGPKLDVQVKTAM
GKEETLSTAQLDFLLPERFDLTYIGQDGEHHRPVVIHRGVVSTMERFVAFLTEETKGAFPTWLAPKQVQIIPVNVDLHYD
YARQLQDELKSQGVRVSIDDRNEKMGYKIREAQMQKIPYQIVVGDKEVENNQVNVRQYGSQDQETVEKDEFIWNLVDEIR
LKKHR
>Q8NW68 6.1.1.3~~~thrS~~~Threonine--tRNA ligase~~~
MEQINIQFPDGNKKAFDKGTTTEDIAQSISPGLRKKAVAGKFNGQLVDLTKPLETDGSIEIVTPGSEEALEVLRHSTAHL
MAHAIKRLYGNVKFGVGPVIEGGFYYDFDIDQNISSDDFEQIEKTMKQIVNENMKIERKVVSRDEAKELFSNDEYKLELI
DAIPEDENVTLYSQGDFTDLCRGVHVPSTAKIKEFKLLSTAGAYWRGDSNNKMLQRIYGTAFFDKKELKAHLQMLEERKE
RDHRKIGKELELFTNSQLVGAGLPLWLPNGATIRREIERYIVDKEVSMGYDHVYTPVLANVDLYKTSGHWDHYQEDMFPP
MQLDETESMVLRPMNCPHHMMIYANKPHSYRELPIRIAELGTMHRYEASGAVSGLQRVRGMTLNDSHIFVRPDQIKEEFK
RVVNMIIDVYKDFGFEDYSFRLSYRDPEDKEKYFDDDDMWNKAENMLKEAADELGLSYEEAIGEAAFYGPKLDVQVKTAM
GKEETLSTAQLDFLLPERFDLTYIGQDGEHHRPVVIHRGVVSTMERFVAFLTEETKGAFPTWLAPKQVQIIPVNVDLHYD
YARQLQDELKSQGVRVSIDDRNEKMGYKIREAQMQKIPYQIVVGDKEVENNQVNVRQYGSQDQETVEKDEFIWNLVDEIR
LKKHR
>B2FN79 6.1.1.3~~~thrS~~~Threonine--tRNA ligase~~~COG0441
MINITLPDGSRREFENPVSVMEVAQSIGAGLAKATIAGAVDGVLVDASDVIDHDASLRIITAKDEEGVEIIRHSCAHLVG
HAVKQLYPDVKMVIGPVIAEGFYYDIYSERPFTPDDMAAIEKRMGELIAQDYDVIKKMTPRAEVIEIFKARGEDYKLRLI
EDMSEDIQAMGMYYHQEYVDMCRGPHVPNTRFLKAFKLTRISGAYWRGDAQNEQLQRIYGTAWADKKQLEAYIKRIEEAE
MRDHRRIGKQQDLFHLQEEAPGLVFWHPKGWALWQVVEQYMRKVYRNSGYGEVRCPQILDVSLWKKSGHWDNYQDNMFFT
ESEKRTYAVKPMNCPGHIQVFNQGLHSYRDLPIRYGEFGSCHRNEPSGALHGILRVRGFTQDDGHVFCTENQIESEVTAF
HQQALAVYQHFGFDEIQIKIALRPESRLGDDATWDKAEGALRSALTACGVEWQELPGEGAFYGPKIEYHLKDAIGRTWQL
GTMQVDFMMPGRLGAEYVDENSQKKHPVMLHRAIVGSMERFLGILIEHHAGQFPAWLAPTQVVVANITDAQADYVSGVTK
TLAEQGFRVSSDLRNEKIGYKIREHTLQRVPYLLVIGDREKENGAVAVRTRSGEDLGSMSLQAFIERLHAEGA
>Q97PI4 6.1.1.3~~~thrS~~~Threonine--tRNA ligase~~~COG0441
MINITFPDGAVREFESGVTTFEIAQSISNSLAKKALAGKFNGKLIDTTRAITEDGSIEIVTPDHEDALPILRHSAAHLFA
QAARRLFPDIHLGVGPAIEDGFYYDTDNTAGQISNEDLPRIEEEMQKIVKENFPSIREEVTKDEAREIFKNDPYKLELIE
EHSEDEGGLTIYRQGEYVDLCRGPHVPSTGRIQIFHLLHVAGAYWRGNSDNAMMQRIYGTAWFDKKDLKNYLQMREEAKE
RDHRKLGKELDLFMISQEVGQGLPFWLPNGATIRRELERYIVNKELVSGYQHVYTPPLASVELYKTSGHWDHYQEDMFPT
MDMGDGEEFVLRPMNCPHHIQVFKHHVHSYRELPIRIAEIGMMHRYEKSGALTGLQRVREMSLNDGHLFVTPEQIQEEFQ
RALQLIIDVYEDFNLTDYRFRLSLRDPQDTHKYFDNDEMWENAQTMLRAALDEMGVDYFEAEGEAAFYGPKLDIQIKTAL
GKEETLSTIQLDFLLPERFDLKYIGADGEDHRPVMIHRGVISTMERFTAILIENYKGAFPTWLAPHQVTLIPVSNEKHVD
YAWEVAKKLRDRGVRADVDERNEKMQFKIRASQTSKIPYQLIVGDKEMEDETVNVRRYGQKETQTVSVDNFVQAILADIA
NKSRVEK
>P56881 6.1.1.3~~~thrS~~~Threonine--tRNA ligase~~~COG0441
MTVYLPDGKPLELPEGATAKDVARALGEGWERRAVGAIVDGELYDLLKPLPQGAKVRLLTEKDPEFQTLFRHTLAHVLAQ
AVKEFFREKGYDPESVRLGVGPVIEKGFYYDIEAPEPLSDEDLPAIEAKMREILKRDLPLRRFVLSREEALARYRGKDPY
KTELVLEIPEGEEISFYQQGDEAYGFTDLCRGPHVPSTGRIPPHFKLTHVAGAYWRGDENRPMLQRVYGVAFRTAEELKE
YLWQLEEAKKRDHRRLGRELELFLIDPLVGKGLVLWLPKGNVVREELMAFMREEQVRRGYQLVTTPHIGSLELYKTSGHY
PYYAESQFPPISFKERGEEEEYLLKPMNCPHHIRIYAYRKRSYRELPLRLAEFGTVYRYEKAGELLGLTRVRGFTQDDAH
IFCTPEEVKGEFLGVLDLVLKVFATLGLKDYRARIGVRDPKSDKYVGDEAKWALAERQIEEAAAEAGLRYTVEEGDAAFY
GPKLDFVVKDALGREWQLGTIQVDYNLPERFGLTYVGKDGEEHRPVMLHRAPFGSLERFIGILIEHFAGDFPLWLAPVQA
VVVPVSEKQEDYAREVAGRLKEAGLRAEADTRPERMQARIRDAEVQKVPYILVVGEREKAEGAVSVRRRKKGNLGTMPLA
AFLEGALREYRERRLEPVF
>Q05873 6.1.1.9~~~valS~~~Valine--tRNA ligase~~~COG0525
METNEQTMPTKYDPAAVEKDRYDFWLKGKFFEAGSDQTKEPYSVVIPPPNVTGRLHLGHAWDTTLQDIVTRMKRMQGYDV
LWLPGMDHAGIATQAKVEAKLREEGKSRYDLGREKFLEETWKWKEEYADFIRSQWAKLGLGLDYSRERFTLDEGLSKAVR
EVFVKLYEKGLIYRGEYIINWDPATKTALSDIEVIYKDVQGAFYHMSYPLADGSGSIEIATTRPETMLGDTAVAVHPEDE
RYKHLIGKTVILPIVNREIPIVGDDYVDMEFGSGAVKITPAHDPNDFELGNRHNLERILVMNEDGTMNENALQYQGMDRF
ECRKKLVKDLQEAGVLFKIEDHMHSVGHSERSGAVVEPYLSTQWFVRMQPLADAAIELQKKEEKVNFVPDRFEKTYLHWM
ENIRDWCISRQLWWGHRIPAWYHKETGELYVGLEAPEDSENWEQDTDVLDTWFSSALWPFSTMGWPDVTAEDFKRYYPTD
VLVTGYDIIFFWVSRMIFQGIEFTGERPFKDVLIHGLIRDEQGRKMSKSLGNGVDPMDVIDKYGADSLRYFLATGSSPGQ
DLRFSYEKVESTWNFANKIWNASRFALMNMDGMTYDELDLSGEKSVADKWILTRLNETIEHVTQLADRYEFGEVGRHLYN
FIWDDFCDWYIEMAKLPLYGEDEAAKKTTRSILAYVLDQTMRLLHPFMPFLTEEIWQHLPHQGESITVSQWPAVVPEHTD
TEAAADMKLLVELIRSVRNIRSEVNTPMSKQVELYIKTSTDEIASRLEANRSYVERFTNPSVLKIGTDIEAVDKAMTAVV
SGAEVILPLEGLINIDEEIARLQKEFDKLTKEVERVQKKLGNEGFMKKAPAHVIDEEREKEKDYVAKRDAVQKRMAELKG
>Q72E47 6.1.1.9~~~valS~~~Valine--tRNA ligase~~~COG0525
MAENALPKGYEPRDVEERWRRHWEDNRTFTPDMDAPGEPYSIVIPPPNVTGALHIGHALNHVLIDVLCRNARQQGKKVLW
LPGTDHAGIATQNVVERALAKEGLSRHDLGREAFIERVWQWKEEYGNRILNQIRMLGDSVDWTRERFTMDEGLSKAVRKV
FVDLYNGGYIYRGNYIINWCNRCHTALADDEVDHMPEQGHLYHVRYDFEDGSGSVVIATTRPETIMADTGVCVHPEDERY
AGLIGKKILVPVIGRAVPLFADTYVDREFGTGALKVTPCHDPNDWTLGERHGLAFIQCIDEDGNMTAEAGPYAGLTKEEC
RKRIVADLEASGQLVRVEELNHSVGHCYRCKTVVEPHMSEQWFVASTKLAPRARAAVPQMTQIFPESWMKTYFNWLDNIR
DWCISRQIWWGHRIPAWTCGKCGKLIVSEQDPTACPDCGCTDLTQDPDVLDTWFSSALWPFSTMGWPDKTKDLATFYPTS
VLVTGFDILFFWVARMMMLGMHFMDEVPFKHVYLHALVRDGEGRKMSKSTGNVIDPLAMIDKYGTDSLRFTLAAFAAMGR
DIKLSEDRIEGYRHFVNKVWNAARFSLMNLPEEAPAALDLDNVKGMHHKWILHRLEELKASQAAGIDGYRFNEVAQGLYR
FWWNEFCDWYLELIKPDMQAGGERQATAQYVLWTVLREALLLLHPFMPFVTAEVWQALPGHAGDDIATKLYPAARPGCRD
VKDAEHMELVQATISAVRTIRAELNIAPSYRLTTLVRPASAEDAATLEEGREMLMTLARLDGLTVAVDVEAPKASASSVV
AGNEVIVPLTGAVDFEAELARLDKELGKIEKDFVQVNKKLANESFVSKAPADVVAKERARAEELSDAKAKLEALQQRFRD
AIGK
>P07118 6.1.1.9~~~valS~~~Valine--tRNA ligase~~~COG0525
MEKTYNPQDIEQPLYEHWEKQGYFKPNGDESQESFCIMIPPPNVTGSLHMGHAFQQTIMDTMIRYQRMQGKNTLWQVGTD
HAGIATQMVVERKIAAEEGKTRHDYGREAFIDKIWEWKAESGGTITRQMRRLGNSVDWERERFTMDEGLSNAVKEVFVRL
YKEDLIYRGKRLVNWDPKLRTAISDLEVENRESKGSMWHIRYPLADGAKTADGKDYLVVATTRPETLLGDTGVAVNPEDP
RYKDLIGKYVILPLVNRRIPIVGDEHADMEKGTGCVKITPAHDFNDYEVGKRHALPMINILTFDGDIRESAQVFDTKGNE
SDVYSSEIPAEFQKLERFAARKAVVAAVDALGLLEEIKPHDLTVPYGDRGGVVIEPMLTDQWYVRADVLAKPAVEAVENG
DIQFVPKQYENMYFSWMRDIQDWCISRQLWWGHRIPAWYDEAGNVYVGRNEDEVRKENNLGADVVLRQDEDVLDTWFSSA
LWTFSTLGWPENTDALRQFHPTSVMVSGFDIIFFWIARMIMMTMHFIKDENGKPQVPFHTVYMTGLIRDDEGQKMSKSKG
NVIDPLDMVDGISLPELLEKRTGNMMQPQLADKIRKRTEKQFPNGIEPHGTDALRFTLAALASTGRDINWDMKRLEGYRN
FCNKLWNASRFVLMNTEGQDCGFNGGEMTLSLADRWILAEFNQTIKAYREALDSFRFDIAAGILYEFTWNQFCDWYLELT
KPVMNGGTEAELRGTRHTLVTVLEGLLRLAHPIIPFITETIWQRVKVLCGITADTIMLQPFPQYDASQVDEAALADTEWL
KQAIVAVRNIRAEMNIAPGKPLELLLRGCSADAERRVNENRGFLQTLARLESITVLPADDKGPVSVTKIIDGAELLIPMA
GLINKEDELARLAKEVAKIEGEISRIENKLANEGFVARAPEAVIAKEREKLEGYAEAKAKLIEQQAVIAAL
>P11931 6.1.1.9~~~valS~~~Valine--tRNA ligase~~~
MAQHEVSMPPKYDHRAVEAGRYEWWLKGKFFEATGDPNKRPFTIVIPPPNVTGKLHLGHAWDTTLQDIITRMKRMQGYDV
LWLPGMDHAGIATQAKVEEKLRQQGLSRYDLGREKFLEETWKWKEEYAGHIRSQWAKLGLGLDYTRERFTLDEGLSKAVR
EVFVSLYRKGLIYRGEYIINWDPVTKTALSDIEVVYKEVKGALYHMRYPLADGSGFIEVATTRPETMLGDTAVAVHPDDE
RYKHLIGKMVKLPIVGREIPIIADEYVDMEFGSGAVKITPAHDPNDFEIGNRHNLPRILVMNEDGTMNENAMQYQGLDRF
ECRKQIVRDLQEQGVLFKIEEHVHSVGHSERSGAVIEPYLSTQWFVKMKPLAEAAIKLQQTDGKVQFVPERFEKTYLHWL
ENIRHWCISRQLWWGHRIPAWYHKETGEIYVDHEPPKDIENWEQDPDVLDTWFSSALWPFSTMGWPDTDSPDYKRYYPTD
VLVTGYDIIFFWVSRMIFQGLEFTGKRPFKDVLIHGLVRDAQGRKMSKSLGNGVDPMDVIDQYGADALRYFLATGSSPGQ
DLRFSTEKVEATWNFANKIWNASRFALMNMGGMTYEELDLSGEKTVADHWILTRLNETIETVTKLAEKYEFGERGRTLYN
FIWDDLCDWYIEMAKLPLYGDDEAAKKTTRSVLAYVLDNTMRLLHPFMPFITEEIWQNLPHEGESITVAPWPQVRPELSN
EEAAEEMRMLVDIIRAVRNVRAEVNTPPSKPIALYIKTKDEHVRAALLKNRAYLERFCNPSELLIDTNVPAPDKAMTAVV
TGAELIMPLEGLINIEEEIKRLEKELDKWNKEVERVEKKLANEGFLAKAPAHVVEEERRKRQDYIEKREAVKARLAELKR
>P9WFS9 6.1.1.9~~~valS~~~Valine--tRNA ligase~~~COG0525
MTASPHPAADMLPKSWDPAAMESAIYQKWLDAGYFTADPTSTKPAYSIVLPPPNVTGSLHMGHALEHTMMDALTRRKRMQ
GYEVLWQPGTDHAGIATQSVVEQQLAVDGKTKEDLGRELFVDKVWDWKRESGGAIGGQMRRLGDGVDWSRDRFTMDEGLS
RAVRTIFKRLYDAGLIYRAERLVNWSPVLQTAISDLEVNYRDVEGELVSFRYGSLDDSQPHIVVATTRVETMLGDTAIAV
HPDDERYRHLVGTSLAHPFVDRELAIVADEHVDPEFGTGAVKVTPAHDPNDFEIGVRHQLPMPSILDTKGRIVDTGTRFD
GMDRFEARVAVRQALAAQGRVVEEKRPYLHSVGHSERSGEPIEPRLSLQWWVRVESLAKAAGDAVRNGDTVIHPASMEPR
WFSWVDDMHDWCISRQLWWGHRIPIWYGPDGEQVCVGPDETPPQGWEQDPDVLDTWFSSALWPFSTLGWPDKTAELEKFY
PTSVLVTGYDILFFWVARMMMFGTFVGDDAAITLDGRRGPQVPFTDVFLHGLIRDESGRKMSKSKGNVIDPLDWVEMFGA
DALRFTLARGASPGGDLAVSEDAVRASRNFGTKLFNATRYALLNGAAPAPLPSPNELTDADRWILGRLEEVRAEVDSAFD
GYEFSRACESLYHFAWDEFCDWYLELAKTQLAQGLTHTTAVLAAGLDTLLRLLHPVIPFLTEALWLALTGRESLVSADWP
EPSGISVDLVAAQRINDMQKLVTEVRRFRSDQGLADRQKVPARMHGVRDSDLSNQVAAVTSLAWLTEPGPDFEPSVSLEV
RLGPEMNRTVVVELDTSGTIDVAAERRRLEKELAGAQKELASTAAKLANADFLAKAPDAVIAKIRDRQRVAQQETERITT
RLAALQ
>Q9HXH0 6.1.1.9~~~valS~~~Valine--tRNA ligase~~~
MDKTYQPHAIETSWYETWESNDYFAPSGEGQPYTIMIPPPNVTGSLHMGHGFNNAIMDALIRYRRMQGRNTLWQPGTDHA
GIATQMVVERQLGAQGVSRHDLGREKFLEKVWEWKEQSGGNITRQIRRLGSSVDWSRERFTMDDGLSEAVKEAFVRLHED
GLIYRGKRLVNWDTKLHTAISDLEVENHDEKGHLWHLRYPLVNGAKTSEGLDYLVVATTRPETLLGDAAVAVHPEDERYA
KLIGQFAELPIVGRHIPIIADEYVDREFGTGCVKITPAHDFNDYEVGKRHDLPLINIFDKNAAVLAQAQVFHLDGSVNPN
LDPSLPQSYAGMDRFAARKAIVAEFEAMGLLEKVDDHALKVPKGDRSGTVIEPWLTDQWYVSTKPLAEDAIAAVEDGRIQ
FVPKQYENMYFSWMRDIQDWCISRQLWWGHRIPAWYDEAGNVYVGRDEVEVRTKHKLGNEAELRQDEDVLDTWFSSGLWT
FSTLGWPQQTEFLKTFHPTDVLVTGFDIIFFWVARMIMLTMHLVKNPDGTPQIPFKTVYVHGLVRDGQGQKMSKSKGNVL
DPLDIVDGIDLDTLLQKRTSGMMQPKLAEKIAKQTRAEFPEGIASYGTDALRFTFCSLASTGRDIKFDMGRVEGFRNFCN
KIWNAANFVIENTDGQDTGVNGEPVELSSVDRWIISQLQRTEQEVTRQLDAFRFDLAAQALYEFIWDEYCAWYLELVKPV
LWDENAPIERQRGTRRTLIRVLETALRLAHPFMPFITEEIWQRIKGQAGKEGPTLMLQPWPVADEGRIDAAAEGDIEWVK
ALMLGVRQIRGEMNISMAKRIDIILKNASPSDHRRLADNEPLLMKLAKLESIRVLEAGEEAPMSATALVGDMEVLVPMAG
LIDKSAELGRLDKEIQRLEGEVKRVGGKLSNEGFVAKAPADVIEKERAKLAEAEQALAKLAEQRQKIAAL
>Q99TJ8 6.1.1.9~~~valS~~~Valine--tRNA ligase~~~
MEMKPKYDPREVEAGRYEEWVKNGYFKPSEDKSKETYTIVIPPPNVTGKLHLGHAWDTTLQDIITRMKRMQGYDTLYLPG
MDHAGIATQAKVEAKLNEQGITRYDLGREKFLEQAWDWKEEYASFIRAQWAKLGLGLDYSRERFTLDEGLSKAVKKVFVD
LYNKGIIYRGERIINWDPKARTALSDIEVIHEDVQGAFYHFKYPYADGEGFIEIATTRPETMLGDTAIVVNPNDERYKDV
IGKTVILPIVGRELPILADEYVDIDFGSGAMKVTPAHDPNDFEIGQRHQLENIIVMDENGKMNDKAGKYEGMDRFDCRKQ
LVKDLKEQDLVIKIEDHVHSVGHSERSGAVVEPYLSTQWFVRMEDLAKRSLDNQKTDDRIDFYPQRFEHTFNQWMENIRD
WTISRQLWWGHQIPAWYHKETGEIYVGEEAPTDIENWQQDEDVLDTWFSSALWPFSTLGWPDLESEDFKRYYPTNALVTG
YDIIFFWVARMIFQGLEFTDRRPFNDVLLHGLVRAEDGRKMSKSLGNGVDPMDVIDEYGADSLRYFLATGSSPGHDLRYS
TEKVESVWNFINKIWNGARFSLMNIGEDFKVEDIDLSGNLSLADKWILTRLNETIATVTDLSDKYEFGEVGRALYNFIWD
DFCDWYIEMSKIPMNSNDEEQKQVTRSVLSYTLDNIMRMLHPFMPFVTEKIWQSLPHEGDTIVKASWPEVRESLIFEESK
QTMQQLVEIIKSVRQSRVEVNTPLSKEIPILIQAKDKEIETTLSQNKDYLIKFCNPSTLNISTDVEIPEKAMTSVVIAGK
VVLPLEGLIDMDKEISRLEKELAKLQSELDRVDKKLSNENFVSKAPEKVINEEKRKKQDYQEKYDGVKARIEQLKA
>P96142 6.1.1.9~~~valS~~~Valine--tRNA ligase~~~
MDLPKAYDPKSVEPKWAEKWAKNPFVANPKSGKPPFVIFMPPPNVTGSLHMGHALDNSLQDALIRYKRMRGFEAVWLPGT
DHAGIATQVVVERLLLKEGKTRHDLGREKFLERVWQWKEESGGTILKQLKRLGASADWSREAFTMDEKRSRAVRYAFSRY
YHEGLAYRAPRLVNWCPRCETTLSDLEVETEPTPGKLYTLRYEVEGGGFIEIATVRPETVFADQAIAVHPEDERYRHLLG
KRARIPLTEVWIPILADPAVEKDFGTGALKVTPAHDPLDYEIGERHGLKPVSVINLEGRMEGERVPEALRGLDRFEARRK
AVELFREAGHLVKEEDYTIALATCSRCGTPIEYAIFPQWWLRMRPLAEEVLKGLRRGDIAFVPERWKKVNMDWLENVKDW
NISRQLWWGHQIPAWYCEDCQAVNVPRPERYLEDPTSCEACGSPRLKRDEDVFDTWFSSALWPLSTLGWPEETEDLKAFY
PGDVLVTGYDILFLWVSRMEVSGYHFMGERPFKTVLLHGLVLDEKGQKMSKSKGNVIDPLEMVERYGADALRFALIYLAT
GGQDIRLDLRWLEMARNFANKLYNAARFVLLSREGFQAKEDTPTLADRFMRSRLSRGVEEITALYEALDLAQAAREVYEL
VWSEFCDWYLEAAKPALKAGNAHTLRTLEEVLAVLLKLLHPMMPFLTSELYQALTGKEELALEAWPEPGGRDEEAERAFE
ALKQAVTAVRALKAEAGLPPAQEVRVYLEGETAPVEENLEVFRFLSRADLLPERPAKALVKAMPRVTARMPLEGLLDVEE
WRRRQEKRLKELLALAERSQRKLASPGFREKAPKEVVEAEEARLKENLEQAERIREALSQIG
>Q9RVD6 6.1.1.2~~~trpS2~~~Tryptophan--tRNA ligase 2~~~COG0180
MPFVDLEVPTMTTPTPAATPARPRVLTGDRPTGALHLGHLAGSLQNRVRLQDEAELFVLLADVQALTDHFDRPEQVRENV
LAVALDYLAAGLDPQKTTCVVQSAVPELAELTVYFLNLVTVSHLRQNPTVKAEIAQKGYGERVPAGFFVYPVSQAADIAA
FGATLVPVGDDQLPMLEQTREIVRRFNALYAPVLAEPQAQLSRVPRLPGLDGQAKMSKSLGNAIALGDSADEVARKVMGM
YTDPGHLRASDPGRVEGNPVFTFLDAFDPDPARVQALKDQYRAGGLGDVKVKKHLIDVLNGVLAPIRTRRAEYERDPDAV
LRFVTEGTARGREVAAQTLGQVRRAMRLFGH
>P21656 6.1.1.2~~~trpS~~~Tryptophan--tRNA ligase~~~COG0180
MKQTIFSGIQPSGSVTLGNYIGAMKQFVELQHDYNSYFCIVDQHAITVPQDRLELRKNIRNLAALYLAVGLDPEKATLFI
QSEVPAHAQAGWMMQCVAYIGELERMTQFKDKSKGNEAVVSGLLTYPPLMAADILLYGTDLVPVGEDQKQHLELTRNLAE
RFNKKYNDIFTIPEVKIPKVGARIMSLNDPLKKMSKSDPNQKAYITLLDEPKQLEKKIKSAVTDSEGIVKFDKENKPGVS
NLLTIYSILGNTTIEELEAKYEGKGYGEFKGDLAEVVVNALKPIQDRYYELIESEELDRILDEGAERANRTANKMLKKME
NAMGLGRKRR
>Q9PIB4 6.1.1.2~~~trpS~~~Tryptophan--tRNA ligase~~~COG0180
MRVLTGLQPSGDLHIGNYFGAIKQMVDAQEKSQMFMFIANYHAMTSSQDGEKLKQNSLKAAAAFLSLGIDPQKSVFWLQS
DVKEVMELYWILSQFTPMGLLERAHSYKDKVAKGLSASHGLFSYPVLMAADILLFDTRIVPVGKDQIQHVEIARDIALKV
NNEWGEIFTLPEARVNEEVAVVVGTDGAKMSKSYQNTIDIFSSEKTLKKQISSIVTDSTALEDPKDHENCNIFKIAKLFL
DESGQKELQIRYEKGGEGYGHFKIYLNELVNAYFKEAREKYNELLEKPSHLKEILDFGATKARKIAQEKMQKIYEKIGL
>O84589 6.1.1.2~~~trpS~~~Tryptophan--tRNA ligase~~~
MKKKRVLTGDRPTGKLHLGHWIGSIMNRLQLQNDSRYDCFFIIADLHTLTTKTRKEEILQIDNHIYDVLADWLSVGIDPE
KSAIYLQSAIPEIYELNLIFSMLTPLNHIMGIPSIKEMARNASLNEESLSHGLIGYPVLQSADILLAKAHLVPVGKDNEA
HVELTRDIAKTFNRLYGEVFPEPDILQGELTALVGTNGQGKMSKSANNAIYLSDDAKTVQEKIRKLYTDPNRIHATTPGR
VEGNPLFIYHDLFNPHKEEVEEFKTRYRQGCIRDVEVKARLAEEINLFLNPFREKRSELVAQPKFLEEALQQGTEKMRTV
ARETMEEVHDHLGLSRKWRTILASSK
>P00954 6.1.1.2~~~trpS~~~Tryptophan--tRNA ligase~~~COG0180
MTKPIVFSGAQPSGELTIGNYMGALRQWVNMQDDYHCIYCIVDQHAITVRQDAQKLRKATLDTLALYLACGIDPEKSTIF
VQSHVPEHAQLGWALNCYTYFGELSRMTQFKDKSARYAENINAGLFDYPVLMAADILLYQTNLVPVGEDQKQHLELSRDI
AQRFNALYGEIFKVPEPFIPKSGARVMSLLEPTKKMSKSDDNRNNVIGLLEDPKSVVKKIKRAVTDSDEPPVVRYDVQNK
AGVSNLLDILSAVTGQSIPELEKQFEGKMYGHLKGEVADAVSGMLTELQERYHRFRNDEAFLQQVMKDGAEKASAHASRT
LKAVYEAIGFVAKP
>P00953 6.1.1.2~~~trpS~~~Tryptophan--tRNA ligase~~~
MKTIFSGIQPSGVITIGNYIGALRQFVELQHEYNCYFCIVDQHAITVWQDPHELRQNIRRLAAKYLAVGIDPTQATLFIQ
SEVPAHAQAAWMLQCIVYIGELERMTQFKEKSAGKEAVSAGLLTYPPLMAADILLYNTDIVPVGEDQKQHIELTRDLAER
FNKRYGELFTIPEARIPKVGARIMSLVDPTKKMSKSDPNPKAYITLLDDAKTIEKKIKSAVTDSEGTIRYDKEAKPGISN
LLNIYSTLSGQSIEELERQYEGKGYGVFKADLAQVVIETLRPIQERYHHWMESEELDRVLDEGAEKANRVASEMVRKMEQ
AMGLGRRR
>P43835 6.1.1.2~~~trpS~~~Tryptophan--tRNA ligase~~~COG0180
MAKPIVFSGVQPSGELTIGNYLGALRNWVKMQEDYECIFCVVDLHAITVRQDPVALRKATLDVLALYLACGIDPNKSTIF
VQSHVPEHTQLSWVLNCYTYFGEMSRMTQFKDKSARYAENINVGLFDYPVLMAADILLYQAKSVPVGDDQKQHLEITRDI
ANRFNALYGNIFTIPEIFIGKAGARIMSLQDPEKKMSKSDDNRNNVVTLLEDPKSVAKKIKRAVTDSDEPPVVRYDVQNK
AGVSNLLDILSAVTDKPIADLEKEFEGKMYGHLKTAVADEVSTLLASLQERFHQYRNDETLLDNILRQGAEKARAKAQET
LAKVYEAVGFVAAK
>P75510 6.1.1.2~~~trpS~~~Tryptophan--tRNA ligase~~~
MMKRALTGIQASGKQHLGNYLGVMQSLIELQEQCQLFVFVADLHSITVDFQPQALKQNNFDLVRTLLAVGLDPQKACLFL
QSDLLEHSMMGYLMMVQSNLGELQRMTQFKAKKAEQTRNPNGTLNIPTGLLTYPALMAGDILLYQPDIVPVGNDQKQHLE
LTRDLAQRIQKKFKLKLRLPQFVQNKDTNRIMDLFDPTKKMSKSSKNQNGVIYLDDPKEVVVKKIRQATTDSFNKIRFAP
KTQPGVTNMLTILKALLKEPVNQSLTNQLGNDLEAYFSTKSYLDLKNALTEATVNLLVNIQRKREQISREQVFNCLQAGK
NQAQATARTTLALFYDGFGLGSQNIK
>P9WFT3 6.1.1.2~~~trpS~~~Tryptophan--tRNA ligase~~~COG0180
MSTPTGSRRIFSGVQPTSDSLHLGNALGAVAQWVGLQDDHDAFFCVVDLHAITIPQDPEALRRRTLITAAQYLALGIDPG
RATIFVQSQVPAHTQLAWVLGCFTGFGQASRMTQFKDKSARQGSEATTVGLFTYPVLQAADVLAYDTELVPVGEDQRQHL
ELARDVAQRFNSRFPGTLVVPDVLIPKMTAKIYDLQDPTSKMSKSAGTDAGLINLLDDPALSAKKIRSAVTDSERDIRYD
PDVKPGVSNLLNIQSAVTGTDIDVLVDGYAGHGYGDLKKDTAEAVVEFVNPIQARVDELTADPAELEAVLAAGAQRAHDV
ASKTVQRVYDRLGFLL
>P67593 6.1.1.2~~~trpS~~~Tryptophan--tRNA ligase~~~
METLFSGIQPSGIPTIGNYIGALKQFVDVQNDYDCYFCIVDQHAITMPQDRLKLRKQTRQLAAIYLASGIDPDKATLFIQ
SEVPAHVQAGWMLTTIASVGELERMTQYKDKAQKAVEGIPAGLLTYPPLMAADIVLYNTNIVPVGDDQKQHIELTRNLVD
RFNSRYNDVLVKPEIRMPKVGGRVMSLQDPTRKMSKSDDNAKNFISLLDEPNVAAKKIKSAVTDSDGIIKFDRDNKPGIT
NLISIYAGLTDMPIKDIEAKYEGEGYGKFKGDLAEIVKAFLVEFQEKYESFYNSDKLDDILDQGRDKAHKVSFKTVKKME
KAMGLGRKR
>Q9WYW2 6.1.1.2~~~trpS~~~Tryptophan--tRNA ligase~~~COG0180
MRILSGMRPTGKLHIGHLVGALENWVKLQEEGNECFYFVADWHALTTHYDDVSKLKEYTRDLVRGFLACGIDPEKSVIFV
QSGVKEHAELALLFSMIVSVSRLERVPTYKEIKSELNYKDLSTAGFLIYPVLQAADILIYKAEGVPVGEDQVYHIELTRE
IARRFNYLYDEVFPEPEAILSRVPKLPGTDGRKMSKSYGNIINLEISEKELEQTILRMMTDPARVRRSDPGNPENCPVWK
YHQAFDISEEESKWVWEGCTTASIGCVDCKKLLLKNMKRKLAPIWENFRKIDEDPHYVDDVIMEGTKKAREVAAKTMEEV
RRAMNLMF
>Q9KNV7 6.1.1.2~~~trpS~~~Tryptophan--tRNA ligase~~~COG0180
MSKPIVLSGVQPSGELSIGNYLGALRQWQQMQDDYDCQYCVVDLHAITVRQDPQALHEATLDALAICLAVGVDPKKSTLF
VQSHVPEHAQLGWVLNCYTQMGELSRMTQFKDKSARYANDVNAGLFGYPVLMAADILLYGAHQVPVGSDQKQHLELARDI
ATRFNNIYSPEQPIFTIPEPYIPTVNARVMSLQDATKKMSKSDDNRKNVITLLEDPKSIIKKINKAQTDAETPPRIAYDV
ENKAGIANLMGLYSAATGKTFAEIEAQYAGVEMYGPFKKDVGEAVVAMLEPVQAEYQRIRNDREYLNSVMRDGAEKASAK
ALQTLKKVYAAVGFVARP
>Q8ZJF2 6.1.1.2~~~trpS~~~Tryptophan--tRNA ligase~~~COG0180
MVLSKPTVSSKPIVFSGAQPSGELTIGNYMGALRQWVQMQDDYDCIYCIVDLHAITARQDPALLRKRTLDTLALYLACGI
DPKKSTIFVQSHVPEHSQLSWALNCYTYFGELSRMTQFKDKSARYAENINAGLFDYPVLMAADILLYQTNQVPVGEDQKQ
HLELSRDIASRFNNLYGDIFKIPEPFIPKAGARVMSLQDPTKKMSKSDDNRNNVIELLEDPKSVVKKIKRAMTDSDEPAL
IRYDVEKKAGVSNLLDILSGVTGQSIPELEAQFTGQMYGHLKGAVADAVSGMLSELQERYRTYREDEALLQDVMREGAAK
ARARAQVTLAKVYEAIGFVAQP
>P22326 6.1.1.1~~~tyrS1~~~Tyrosine--tRNA ligase 1~~~COG0162
MTNLLEDLSFRGLIQQMTDEEGLNKQLNEEKIRLYSGFDPTADSLHIGHLLPILTLRRFQLAGHHPIALVGGATGLIGDP
SGKKAERTLNTADIVSEWSQKIKNQLSRFLDFEAAENPAVIANNFDWIGKMNVIDFLRDVGKNFGINYMLAKDTVSSRIE
SGISYTEFSYMILQSYDFLNLYRDKNCKLQIGGSDQWGNITAGLELIRKSEEEGAKAFGLTIPLVTKADGTKFGKTEGGA
IWLDKEKTSPYEFYQFWINTDDRDVVKYLKYFTFLSKEEIEAYAEKTETAPEKREAQKRLAEEVTSLVHGREALEQAINI
SQALFSGNIKELSAQDVKVGFKDVPSMEVDSTQELSLVDVLVQSKLSPSKRQAREDIQNGAVYINGERQTEINYTLSGED
RIENQFTVLRRGKKKYFLVTYK
>P41256 6.1.1.1~~~tyrS~~~Tyrosine--tRNA ligase~~~
MTMKHQDAFEQIAFGTVDMLPEGEMLARLAAAQRDNRPLRIKLGMDPTAPDLHLGAYVLLHKARQFQDLGHRLLFVIGDF
TAMIGDPTGKSVTRKALSREEVVANAATYRPQVFKILDPERTEVMFNSEWLGALRPEELIQIAACYTVARMLERDDFNKR
YSANQPIAIHEFLYPLLQGYDSVAIKADVELGGTDQRFNLLVGRELQREYGQKPQLVLTMPILEGLDGVQKMSKSLGNFI
AVEDPPAEMFGKIMSISDFLMWRYYALLSRVPAVEQTRLQKEAASGARNPRDIKLDLAGELVRRFHGTAAAQEAHIAFLA
RFQRHETPEDLPLQAIKLSEAPRLSQLLVQVHLAASTSEAMRKMKEGAVRVDWRRVVDPATILALDAVYLLQFGKRHFAR
VALQKGE
>P0AGJ9 6.1.1.1~~~tyrS~~~Tyrosine--tRNA ligase~~~COG0162
MASSNLIKQLQERGLVAQVTDEEALAERLAQGPIALYCGFDPTADSLHLGHLVPLLCLKRFQQAGHKPVALVGGATGLIG
DPSFKAAERKLNTEETVQEWVDKIRKQVAPFLDFDCGENSAIAANNYDWFGNMNVLTFLRDIGKHFSVNQMINKEAVKQR
LNREDQGISFTEFSYNLLQGYDFACLNKQYGVVLQIGGSDQWGNITSGIDLTRRLHQNQVFGLTVPLITKADGTKFGKTE
GGAVWLDPKKTSPYKFYQFWINTADADVYRFLKFFTFMSIEEINALEEEDKNSGKAPRAQYVLAEQVTRLVHGEEGLQAA
KRITECLFSGSLSALSEADFEQLAQDGVPMVEMEKGADLMQALVDSELQPSRGQARKTIASNAITINGEKQSDPEYFFKE
EDRLFGRFTLLRRGKKNYCLICWK
>P00952 6.1.1.1~~~tyrS~~~Tyrosine--tRNA ligase~~~
MDLLAELQWRGLVNQTTDEDGLRKLLNEERVTLYCGFDPTADSLHIGHLATILTMRRFQQAGHRPIALVGGATGLIGDPS
GKKSERTLNAKETVEAWSARIKEQLGRFLDFEADGNPAKIKNNYDWIGPLDVITFLRDVGKHFSVNYMMAKESVQSRIET
GISFTEFSYMMLQAYDFLRLYETEGCRLQIGGSDQWGNITAGLELIRKTKGEARAFGLTIPLVTKADGTKFGKTESGTIW
LDKEKTSPYEFYQFWINTDDRDVIRYLKYFTFLSKEEIEALEQELREAPEKRAAQKTLAEEVTKLVHGEEALRQAIRISE
ALFSGDIANLTAAEIEQGFKDVPSFVHEGGDVPLVELLVSAGISPSKRQAREDIQNGAIYVNGERLQDVGAILTAEHRLE
GRFTVIRRGKKKYYLIRYA
>P9WFT1 6.1.1.1~~~tyrS~~~Tyrosine--tRNA ligase~~~COG0162
MSGMILDELSWRGLIAQSTDLDTLAAEAQRGPMTVYAGFDPTAPSLHAGHLVPLLTLRRFQRAGHRPIVLAGGATGMIGD
PRDVGERSLNEADTVAEWTERIRGQLERFVDFDDSPMGAIVENNLEWTGSLSAIEFLRDIGKHFSVNVMLARDTIRRRLA
GEGISYTEFSYLLLQANDYVELHRRHGCTLQIGGADQWGNIIAGVRLVRQKLGATVHALTVPLVTAADGTKFGKSTGGGS
LWLDPQMTSPYAWYQYFVNTADADVIRYLRWFTFLSADELAELEQATAQRPQQRAAQRRLASELTVLVHGEAATAAVEHA
SRALFGRGELARLDEATLAAALRETTVAELKPGSPDGIVDLLVASGLSASKGAARRTIHEGGVSVNNIRVDNEEWVPQSS
DFLHGRWLVLRRGKRSIAGVERIG
>B4RP13 6.1.1.1~~~tyrS~~~Tyrosine--tRNA ligase~~~
MSVIQDLQSRGLIAQTTDIEALDALLNEQKIALYCGFDPTADSLHIGHLLPVLALRRFQQAGHTPIALVGGATGMIGDPS
FKAAERSLNSAETVAGWVGSIRSQLTPFLSFEGGNAAIMANNADWFGSMNCLDFLRDIGKHFSVNAMLNKESVKQRIDRD
GAGISFTEFAYSLLQGYDFAELNKRHGAVLEIGGSDQWGNITAGIDLTRRLNQKQVFGLTLPLVTKSDGTKFGKTEGGAV
WLNAKKTSPYQFYQFWLKVADADVYKFLKYFTFLSIEEIGVVEAKDKASGSKPEAQRILAEEMTRLIHGEEALAAAQRIS
ESLFAEDQSRLTESDFEQLALDGLPAFEVSDGINAVEALVKTGLAASNKEARGFVNAKAVLLNGKPAEANNPNHAAERPD
DAYLLIGEYKRFGKYTILRRGKRNHALLVWK
>A6QHR2 6.1.1.1~~~tyrS~~~Tyrosine--tRNA ligase~~~
MTNVLIEDLKWRGLIYQQTDEQGIEDLLNKEQVTLYCGADPTADSLHIGHLLPFLTLRRFQEHGHRPIVLIGGGTGMIGD
PSGKSEERVLQTEEQVDKNIEGISKQMHNIFEFGTDHGAVLVNNRDWLGQISLISFLRDYGKHVGVNYMLGKDSIQSRLE
HGISYTEFTYTILQAIDFGHLNRELNCKIQVGGSDQWGNITSGIELMRRMYGQTDAYGLTIPLVTKSDGKKFGKSESGAV
WLDAEKTSPYEFYQFWINQSDEDVIKFLKYFTFLGKEEIDRLEQSKNEAPHLREAQKTLAEEVTKFIHGEDALNDAIRIS
QALFSGDLKSLSAKELKDGFKDVPQVTLSNDTTNIVEVLIETGISPSKRQAREDVNNGAIYINGERQQDVNYALAPEDKI
DGEFTIIRRGKKKYFMVNYQ
>Q7A537 6.1.1.1~~~tyrS~~~Tyrosine--tRNA ligase~~~
MTNVLIEDLKWRGLIYQQTDEQGIEDLLNKEQVTLYCGADPTADSLHIGHLLPFLTLRRFQEHGHRPIVLIGGGTGMIGD
PSGKSEERVLQTEEQVDKNIEGISKQMHNIFEFGTDHGAVLVNNRDWLGQISLISFLRDYGKHVGVNYMLGKDSIQSRLE
HGISYTEFTYTILQAIDFGHLNRELNCKIQVGGSDQWGNITSGIELMRRMYGQTDAYGLTIPLVTKSDGKKFGKSESGAV
WLDAEKTSPYEFYQFWINQSDEDVIKFLKYFTFLGKEEIDRLEQSKNEAPHLREAQKTLAEEVTKFIHGEDALNDAIRIS
QALFSGDLKSLSAKELKDGFKDVPQVTLSNDTTNIVEVLIETGISPSKRQAREDVNNGAIYINGERQQDVNYALAPEDKI
DGEFTIIRRGKKKYFMVNYQ
>Q97NE3 6.1.1.1~~~tyrS~~~Tyrosine--tRNA ligase~~~COG0162
MHIFDELKERGLIFQTTDEEALRKALEEGQVSYYTGYDPTADSLHLGHLVAILTSRRLQLAGHKPYALVGGATGLIGDPS
FKDAERSLQTKDTVDGWVKSIQGQLSRFLDFENGENKAVMVNNYDWFGSISFIDFLRDIGKYFTVNYMMSKESVKKRIET
GISYTEFAYQIMQGYDFFVLNQDHNVTLQIGGSDQWGNMTAGTELLRRKADKTGHVITVPLITDATGKKFGKSEGNAVWL
NPEKTSPYEMYQFWMNVMDADAVRFLKIFTFLSLDEIEDIRKQFEAAPHERLAQKVLAREVVTLVHGEEAYKEALNITEQ
LFAGNIKNLSVKELKQGLRGVPNYQVQADENNNIVELLVSSGIVNSKRQAREDVQNGAIYVNGDRIQELDYVLSDADKLE
NELTVIRRGKKKYFVLTY
>P83453 6.1.1.1~~~tyrS~~~Tyrosine--tRNA ligase~~~COG0162
MAGTGHTPEEALALLKRGAEEIVPEEELLAKLKEGRPLTVKLGADPTRPDLHLGHAVVLRKMRQFQELGHKVVLIIGDFT
GMIGDPSGRSKTRPPLTLEETRENAKTYVAQAGKILRQEPHLFELRYNSEWLEGLTFKEVVRLTSLMTVAQMLEREDFKK
RYEAGIPISLHELLYPFAQAYDSVAIRADVEMGGTDQRFNLLVGREVQRAYGQSPQVCFLMPLLVGLDGREKMSKSLDNY
IGLTEPPEAMFKKLMRVPDPLLPSYFRLLTDLEEEEIEALLKAGPVPAHRVLARLLTAAYALPQIPPRIDRAFYESLGYA
WEAFGRDKEAGPEEVRRAEARYDEVAKGGIPEEIPEVTIPASELKEGRIWVARLFTLAGLTPSNAEARRLIQNRGLRLDG
EVLTDPMLQVDLSRPRILQRGKDRFVRVRLSD
>A0A411EW25 2.1.1.381~~~sznE~~~Arginine N(omega)-methyltransferase~~~
MRHVQEARAVPAEHEARPAPVTMPANGSPYRLGAALLQSLAQEMNALAEQASALLSMPPESLSLDADAFAQIARRNVPRW
HFAMLNDTERNTALMTALERGIPAGATVLDIGSGSGLLAMAAARAGAGRVFTCEMNPLLAEIARNVISAHGMSDVITVIG
KPSTALDPVRDLGGPVDVLVSEIVDCGLIGEGLLPSVRHAREHLLKPDGIMLPSAARLHGRLVSSDEVLKLNQVTTAGGF
DVSLMNTVATRGHFPVRLDTWPHRFLSEAAPLVEFDLARSALEPGERPLALTATADGEVQALAVWFELDMGSGITLSNPP
DNPRSHWMQGWVPLDKPVPVKAGETLALRLGWSDFTLRVSI
>A0A411MR89 1.14.13.250~~~sznF~~~Nitrosourea synthase~~~
MSHVPPHVPFELSGAELRDAIVQYATNPIYHDNLDWLNHDNPYRRQLRPQVLPHLDYDKVPGRENILNYASLAVQRLLTS
VYEADLVFFPKSGLKGKEEDFRAFYSPANRALGERIRPALERYAFGFLDDEVETSGTWTAQSLDAYLDSLDTAGGAEQSP
VEKAILGSADRERAARMWLVQFAPDFLSEASPMMRNVLGYYGPAQSEWFKVVIDEYGYGVHDTKHSTLFERTLESVGLES
DLHRYWQYYLNSSLLLNNYFHYLGKNHELFFRYVGALYYTESSLVDFCRRADHLLREVFGDTVDTTYFTEHIHIDQHHGR
MAREKIIKPLVEAHGDGIIPEIVRGIEEYRVLLEIGDFDFSEQIAWMDAQPELKKLHDPVFEGLKQGKVDAPVAHLVEPR
GELSNTHCHDGDELCHIVSGTMRFESGLGSSLTLQAGEGVVIKRNRLHGANIESDECVYEIHSVGDYRKCL
>P10484 2.1.1.72~~~hsdM~~~Type I restriction enzyme EcoR124I/EcoR124II methylase subunit~~~
MKMTSIQQRAELHRQIWQIANDVRGSVDGWDFKQYVLGALFYRFISENFSSYIEAGDDSICYAKLDDSVITDDIKDDAIK
TKGYFIYPSQLFCNVAAKANTNDRLNADLNSIFVAIESSAYGYPSEADIKGLFADFDTTSNRLGNTVKDKNARLAAVLKG
VEGLKLGDFNEHQIDLFGDAYEFLISNYAANAGKSGGEFFTPQHVSKLIAQLAMHGQTHVNKIYDPAAGSGSLLLQAKKQ
FDNHIIEEGFFGQEINHTTYNLARMNMFLHNINYDKFDIKLGNTLTEPHFRDEKPFDAIVSNPPYSVKWIGSDDPTLIND
ERFAPAGVLAPKSKADFAFVLHALNYLSAKGRAAIVCFPGIFYRGGAEQKIRQYLVDNNYVETVISLAPNLFFGTTIAVN
ILVLSKHKTDTNVQFIDASELFKKETNNNILTDAHIEQIMQVFASKEDVAHLAKSVAFETVVANDYNLSVSSYVEAKDNR
EIIDIAELNAELKTTVSKIDQLRKDIDAIVAEIEGCEVQK
>P75436 2.1.1.72~~~~~~Type I restriction enzyme MpnII methylase subunit~~~
MEKKRTEQRNGVEKKIWEIADKLRGTIDGWDFKSYVLIGLFYRFLSENLCKYFNDSERRNNPDFSYENLTDDYEAIDALK
DAAIASKGFFIKPSQLFQNVVKSIRENKNNEDLNTTLRDIFDDIEKSTELGDGRSKESFKGLFKDFNVSEVKLGSTLTIR
TEKLKELLTSIDTMELDEFEKNSIDAFGDAYEFLISMYAQNAGKSGGEFFTPQDISELLARIAIGKKDTVDDVYDMACGS
GSLLLQVIKVLGKEKTSLVSYYGQEINHTTYNLCRMNMILHNIDYANFNIINADTLTTKEWEKHYVNCSNENGFEVVVSN
PPYSISWAGDKKSNLVSDVRFKDAGTLAPNSKADLAFVLHALYVLGQEGTAAIVCFPGILYREGKEQTIRKYLVDQNFVD
AVIQLPSNLFSTTSIATSILVLKKNRDKKDPIFFIDGSNEFVREKKNNRLSPKNIEKIVDCFNSKKEEANFAKSVERDKI
RESNYDLTVGKYVNSEAEKEELDIKVLNHSIDEIVDKQKDLRTKIKDIIQDIKVDFDNIDINN
>Q57168 2.1.1.72~~~hsdM~~~Type I restriction enzyme HindI methylase subunit~~~COG0286
MPASARWQALQEVSILNTGAELPWGGKFSGVAKLIDDAFDAIEKDNEKLKGVLQRISGYAVNEDTLRGLIILFSDTHFTR
PTYNGEPVHLGAKDILGHVYEYFLSRFAQAEGKRSGQYFTPKSIVSLIVEMLEPYSGRVYDPAMGSGGFFVQTERFITAH
QGNINNVSIYGQEFNPTTWKLAAMNMAIRGIDYDFGKYNADSFTQPQHIDKKMDFIMANPHFNDKEWWNESLADDPRWAY
GTPPKGNANFAWLQHMIYHLSPNGKIALLLANGSMSSQTNNEGEIRKAIINADLVECMVALPGQLFTNTKIPACIWFLNR
NKKRKGEVLFIDARQIGYMKDRVLRDFTADDIAKIADTLHAWQTSDGYEDQAAFCKSATLEEIKNNDFVLTPGRYVGTAE
QEDDGVPFAEKMQNLTALLKEQFAKSAELEAEIKKNLGGLGYE
>P08957 2.1.1.72~~~hsdM~~~Type I restriction enzyme EcoKI methylase subunit~~~COG0286
MNNNDLVAKLWKLCDNLRDGGVSYQNYVNELASLLFLKMCKETGQEAEYLPEGYRWDDLKSRIGQEQLQFYRKMLVHLGE
DDKKLVQAVFHNVSTTITEPKQITALVSNMDSLDWYNGAHGKSRDDFGDMYEGLLQKNANETKSGAGQYFTPRPLIKTII
HLLKPQPREVVQDPAAGTAGFLIEADRYVKSQTNDLDDLDGDTQDFQIHRAFIGLELVPGTRRLALMNCLLHDIEGNLDH
GGAIRLGNTLGSDGENLPKAHIVATNPPFGSAAGTNITRTFVHPTSNKQLCFMQHIIETLHPGGRAAVVVPDNVLFEGGK
GTDIRRDLMDKCHLHTILRLPTGIFYAQGVKTNVLFFTKGTVANPNQDKNCTDDVWVYDLRTNMPSFGKRTPFTDEHLQP
FERVYGEDPHGLSPRTEGEWSFNAEETEVADSEENKNTDQHLATSRWRKFSREWIRTAKSDSLDISWLKDKDSIDADSLP
EPDVLAAEAMGELVQALSELDALMRELGASDEADLQRQLLEEAFGGVKE
>Q89Z59 2.1.1.72~~~hsdM~~~Type I restriction enzyme BthVORF4518P methylase subunit~~~COG0286
MATNSSTEQSLTKKVWNLATTLAGQGIGFTDYITQLTYLLFLKMDAENVEMFGEESAIPTGYQWADLIAFDGLDLVKQYE
ETLKLLSELDNLIGTIYTKAQNKIDKPVYLKKVITMIDEEQWLIMDGDVKGAIYESILEKNGQDKKSGAGQYFTPRPLIQ
AMVDCINPQMGETVCDPACGTGGFLLTAYDYMKGQSASKEKRDFLRDKALHGVDNTPLVVTLASMNLYLHGIGTDRSPIV
CEDSLEKEPSTLVDVILANPPFGTRPAGSVDINRPDFYVETKNNQLNFLQHMMLMLKTGGRAAVVLPDNVLFEAGAGETI
RKRLLQDFNLHTILRLPTGIFYAQGVKANVLFFSKGQPTKEIWFYDYRTDIKHTLATNKLERHHLDDFVSCYNNRVEIYD
AENNPQGRWRKYPVDEIIARDKTSLDITWIKPGGEVDDRSLAELMADIKDKSQTISRAVTELEKLLANIEEN
>P10486 3.1.21.3~~~hsdR~~~Type I restriction enzyme EcoR124I/EcoR124II endonuclease subunit~~~
MTHQTHTIAESNNFIVLDKYIKAEPTGDSYQSESDLERELIQDLRNQGYEFISVKSQSAMLANVREQLQNLNGVVFNDSE
WRRFTEQYLDNPSDGILDKTRKIHIDYICDFIFDDERLENIYLIDKKNLMRNKVQIIQQFEQAGSHANRYDVTILVNGLP
LVQIELKKRGVAIREAFNQIHRYSKESFNSENSLFKYLQLFVISNGTDTRYFANTTKRDKNSFDFTMNWAKSDNTLIKDL
KDFTATCFQKHTLLNVLVNYSVFDSSQTLLVMRPYQIAATERILWKIKSSFTAKNWSKPESGGYIWHTTGSGKTLTSFKA
ARLATELDFIDKVFFVVDRKDLDYQTMKEYQRFSPDSVNGSENTAGLKRNLDKDDNKIIVTTIQKLNNLMKAESDLPVYN
QQVVFIFDECHRSQFGEAQKNLKKKFKRYYQFGFTGTPIFPENALGSETTASVFGRELHSYVITDAIRDEKVLKFKVDYN
DVRPQFKSLETETDEKKLSAAENQQAFLHPMRIQEITQYILNNFRQKTHRTFPGSKGFNAMLAVSSVDAAKAYYATFKRL
QEEAANKSATYKPLRIATIFSFAANEEQNAIGEISDETFDTSAMDSSAKEFLDAAIREYNSHFKTNFSTDSNGFQNYYRD
LAQRVKNQDIDLLIVVGMFLTGFDAPTLNTLFVDKNLRYHGLMQAFSRTNRIYDATKTFGNIVTFRDLERSTIDAITLFG
DKNTKNVVLEKSYTEYMEGFTDAATGEAKRGFMTVVSELEQRFPDPTSIESEKEKKDFVKLFGEYLRAENILQNYDEFAT
LKALQQIDLSDPVAVEKFKAEHYVDDEKFAELQTIRLPADRKIQDYRSAYNDIRDWQRREKEAEKKEKSTTDWDDVVFEV
DLLKSQEINLDYILGLIFEHNRQNKGKGEMIEEVKRLIRSSLGNRAKEGLVVDFIQQTNLDDLPDKASIIDAFFTFAQRE
QQREAEALIKEENLNEDAAKRYIRTSLKREYATENGTELNETLPKLSPLNPQYKTKKQAVFQKIVSFIEKFKGVGGKI
>Q07736 3.1.21.3~~~hsdR~~~Type I restriction enzyme EcoAI endonuclease subunit~~~
MAELNLSNLTEADIITKCVMPAILNAGWDNTTQIRQEVKLRDGKVIVRGKVAARRTVKSADIVLYHKPGIPLAVIEAKAN
KHEIGKGMQQGIEYARLLDVPFVFATNGDGFIFRDATAAEGECLEKQITLDDFPSPAELWQKFCLWKGYTQAQLPVITQD
YYDDGSGKSPRYYQLQAINKTIEAVSNGQNRVLLVMATGTGKTYTAFQIIWRLWKSKNKKRILFLADRNILVDQTKNNDF
QPFGTAMTKVSGRTIDPAYEIHLALYQAITGPEEDQKAFKQVAPDFFDLIVIDECHRGSASEDSAWREILDYFSSATQIG
LTATPKETHEVSSTDYFGDPVYVYSLKEGIEDGFLAPYKVVRVDIDVDLQGWRPTKGQTDLNGEVIDDRIYNQKDFDRTM
VIDERTELVARTITDYLKRTNPMDKTIVFCNDIDHAERMRRALVNLNPEQVKKNDKYVMKITGDDEIGKAQLDNFINPKK
PYPVIATTSELMTTGVDAKTCKLVVLDQNIQSMTKFKQIIGRGTRIDERYGKLWFTILDFKKATELFADERFDGIPEKVM
DTTPEDIADPESDFEEKLEEISEHDEEQVTGVDEPPAPPYQVTDTDDVGPLPEEDEKKIRKFHVNGVAVGVIAQRVQYYD
ADGKLVTESFKDYTRKTLLKEYASLDDFTRKWQDADRKEAIIHELEQQGIIWEVLAEEVGKDLDPFDMLCHVVYGQPPLT
RKERAENVRKRNYFTKYSEAAQAVLDNLLDKYADAGVQEIESIQVLKLKPFDSMGTLPEIIKTGFGDRNGYNQALSELEN
EIYQLPPRSA
>P08956 3.1.21.3~~~hsdR~~~Type I restriction enzyme EcoKI endonuclease subunit~~~COG4096
MMNKSNFEFLKGVNDFTYAIACAAENNYPDDPNTTLIKMRMFGEATAKHLGLLLNIPPCENQHDLLRELGKIAFVDDNIL
SVFHKLRRIGNQAVHEYHNDLNDAQMCLRLGFRLAVWYYRLVTKDYDFPVPVFVLPERGENLYHQEVLTLKQQLEQQVRE
KAQTQAEVEAQQQKLVALNGYIAILEGKQQETEAQTQARLAALEAQLAEKNAELAKQTEQERKAYHKEITDQAIKRTLNL
SEEESRFLIDAQLRKAGWQADSKTLRFSKGARPEPGVNKAIAEWPTGKDETGNQGFADYVLFVGLKPIAVVEAKRNNIDV
PARLNESYRYSKCFDNGFLRETLLEHYSPDEVHEAVPEYETSWQDTSGKQRFKIPFCYSTNGREYRATMKTKSGIWYRDV
RDTRNMSKALPEWHRPEELLEMLGSEPQKQNQWFADNPGMSELGLRYYQEDAVRAVEKAIVKGQQEILLAMATGTGKTRT
AIAMMFRLIQSQRFKRILFLVDRRSLGEQALGAFEDTRINGDTFNSIFDIKGLTDKFPEDSTKIHVATVQSLVKRTLQSD
EPMPVARYDCIVVDEAHRGYILDKEQTEGELQFRSQLDYVSAYRRILDHFDAVKIALTATPALHTVQIFGEPVYRYTYRT
AVIDGFLIDQDPPIQIITRNAQEGVYLSKGEQVERISPQGEVINDTLEDDQDFEVADFNRGLVIPAFNRAVCNELTNYLD
PTGSQKTLVFCVTNAHADMVVEELRAAFKKKYPQLEHDAIIKITGDADKDARKVQTMITRFNKERLPNIVVTVDLLTTGV
DIPSICNIVFLRKVRSRILYEQMKGRATRLCPEVNKTSFKIFDCVDIYSTLESVDTMRPVVVRPKVELQTLVNEITDSET
YKITEADGRSFAEHSHEQLVAKLQRIIGLATFNRDRSETIDKQVRRLDELCQDAAGVNFNGFASRLREKGPHWSAEVFNK
LPGFIARLEKLKTDINNLNDAPIFLDIDDEVVSVKSLYGDYDTPQDFLEAFDSLVQRSPNAQPALQAVINRPRDLTRKGL
VELQEWFDRQHFEESSLRKAWKETRNEDIAARLIGHIRRAAVGDALKPFEERVDHALTRIKGENDWSSEQLSWLDRLAQA
LKEKVVLDDDVFKTGNFHRRGGKAMLQRTFDDNLDTLLGKFSDYIWDELA
>P10485 ~~~hsdS~~~Type I restriction enzyme EcoR124I/EcoR124II specificity subunit~~~
MSEMSYLEKLLDGVEVEWLPLGEITKYEQPTKYLVKAKDYHDTYTIPVLTAGKTFILGYTNETHGIYQASKAPVIIFDDF
TTANKWVDFDFKAKSSAMKMVTSCDDNKTLLKYVYYWLNTLPSEFAEGDHKRQWISNYSQKKIPIPCPDNPEKSLAIQSE
IVRILDKFTALTAELTAELNMRKKQYNYYRDQLLSFKEGEVEWKTLGEIGKWYGGGTPSKNKIEFWENGSIPWISPKDMG
RTLVDSSEDYITEEAVLHSSTKLIPANSIAIVVRSSILDKVLPSALIKVPATLNQDMKAVIPHENILVKYIYHMIGSRGS
DILRAAKKTGGSVASIDSKKLFSFKIPVPNINEQQRIVEILDKFDTLTNSITEGLPREIELRQKQYEYYRDLLFSFPKPE
TVSN
>P19704 ~~~hsdS~~~Type I restriction enzyme EcoAI specificity subunit~~~COG0732
MSVEKLIVDHMETWTSALQTRSTAGRGSSGKIDLYGIKKLRELILELAVRGKLVPQDPNDEPASELLKRIAAEKAELVKQ
GKIKKQKPLPEISEEEKPFELPDGWEWTTLTRIAEINPKIDVSDDEQEISFIPMPLISTKFDGSHEFEIKKWKDVKKGYT
HFANGDIAIAKITPCFENSKAAIFSGLKNGIGVGTTELHVARPFSDIINRKYLLLNFKSPNFLKSGESQMTGSAGQKRVP
RFFFENNPIPFPPLQEQERIIIRFTQLMSLCDQLEQQSLTSLDAHQQLVETLLGTLTDSQNVEELAENWARISEHFDTLF
TTEASVDALKQTILQLAVMGKLVPQDPNDEPASELLKRIAQEKAQLVKEGKIKKQKPLPPISDEEKPFELPEGWEWCRLG
SIYNFLNGYAFKSEWFTSVGLRLLRNANIAHGVTNWKDVVHIPNDMISDFENYILSENDIVISLDRPIINTGLKYAIISK
SDLPCLLLQRVAKFKNYANTVSNSFLTIWLQSYFFINSIDPGRSNGVPHISTKQLEMTLFPLLPQSEQDRIISKMDELIQ
TCNKLKYIIKTAKQTQLHLADALTDAAIN
>P06990 ~~~hsdS~~~Type I restriction enzyme EcoBI specificity subunit~~~
MSFNSTSKELIEQNINGLLSIHDSWLRISMDSVANITNGFAFKSSEFNNRKDGVPLIRIRDVLKGNTSTYYSGQIPEGYW
VYPEDLIVGMDGDFNATIWCSEPALLNQRVCKIEVQEDKYNKRFFYHALPGYLSAINANTSSVTVKHLSSRTLQDTLLPL
PPLAEQKIIAEKLDTLLAQVDSTKARLEQIPQILKRFRQAVLAAAVTGRLTKEDKDFITKKVELDNYKILIPEDWSETIL
NNIINTQRPLCYGVVQPGDDIKDGIELIRVCDINDGEVDLNHLRKISKEIDLQYKRSKVRKNDILVTIVGAIGRIGIVRE
DINVNIARAVARISPEYKIIVPMFLHIWLSSPVMQTWLVQSSKEVARKTLNLKDLKNAFVPLPSIEEQHEIVRRVEQLFA
YADSIEKQVNNALARVNNLTQSILAKAFRGELTAQWRAENPDLISGENSAAALLEKIKAERAASGGKKASRKKF
>P75279 ~~~~~~Putative type I specificity subunit S.MpnORF507P~~~
MQIRTYKIKDICDIQRGRGITKEYIKNNSGKYPVYSAATTNNGELGFINTYDFAGEYVTWTTNGYAGVVFYRNGKFSASQ
DCGVLKVRNKEINAQFLAFALSLKTPQFVHNLGSRPKLNRKVVAEISLDFPPLEVQEKIAHFLKSFNELSSQLKAELIKR
QKQYAFYSDYLLNPKHSQGEEYKLFKLKDIAKKILVGGEKPSDFQKEKDQVYKYPILSNSRKADDFLGYSKTFRIAEKSI
TVSARGTIGAVFYRDFSYLPAVSLICFIPKPEFNINFLFHALKATKFHKQGSGTGQLTMAQFKEYQVYIPSLKKQQEIAA
TLDPLYYIFANSNWGIYKEIELRKKQMQYYQERLFQWIENQKV
>P75416 ~~~~~~Putative type I specificity subunit S.MpnORF365P~~~
MEAPKHVNNACVIPNLTLKKMREIELDFPSKKIQEKIATILDTFTELSAELRERKKQYAFYRDYLLNQENIRKIYGANIP
FETFQVKDICEIRRGRAITKAYIRNNPGENPVYSAATTNDGELGRIKDCDFDGEYITWTTNGYAGVVFYRNGKFNASQDC
GVLKVKNKKICTKFLSFLLKIEAPKFVHNLASRPKLSQKVMAEIELSFPPLEIQEKIADILFAFEKLCNDLVEGIPAEVE
MRKKQLDYYQNFLFNWVQEQKTQLEQIM
>P75435 ~~~~~~Type I restriction enzyme MpnII specificity subunit~~~
MEAPKFVNNACPIPNLNLSRTEEIELDFPPLQIQQKIATILDTFTELSAELSAELSAELSAELSAELSAELSAELSAELS
AELSAELSAELSAELSAELSAELSAELSAELRERKKQYAFYRDYLLNQENIRKIYGANIPFETFQVKDICEIRRGRAITK
AYIRNNPGENPVYSAATTNDGELGRIKDCDFDGEYITWTTNGYAGVVFYRNGKFNASQDCGVLKVKNKKICTKFLSFLLK
IEAPKFVHNLASRPKLSQKVMAEIELSFPPLEIQEKIADILFAFEKLCNDLVEGIPAEIELRKKQLDYYQNFLFNWVQEQ
KKNSLSTNLN
>P44152 ~~~hsdS~~~Type I restriction enzyme HindI specificity subunit~~~COG0732
MSDWKEYSLGDISRNISRRFDFNAYPNVVFINTGDVLNNKFLHCEISNVKDLPGQAKKAIKKGDILYSEIRPGNGRYLFV
DNDLDNYVVSTKFMVIEPNANIVLPEFLFLLLISNETTEYFKMIAESRSGTFPQITFDSVSSLSLNIPDKETQQKILDII
TPLDDKIELNTQINQTLEQIAQALFKSWFVDFDPVRAKAQALSDGMSLEQAELAAMQAISGKTPEELTALSQTQPDRYAE
LAETAKAFPCEMVEVDGVEVDGVEVPRGWEMKALSDLGQIICGKTPSKSNKEFYGDDVPFIKIPDMHNQVFITQTTDNLS
VVGANYQSKKYIPAKSICVSCIATVGLVSMTSKPSHTNQQINSIIPDDEQSCEFLYLSLKQPSMTKYLKDLASGGTATLN
LNTSTFSKIEIITPSKEIIYIFQKKVVSIFEKTLSNSIENKRLTEIRDLLLPRLLNGEI
>P75180 ~~~~~~Putative type I specificity subunit S.MpnORF615P~~~
MQGILAEIELDFPPLQIQEKIATILDTFTELSAELRERKKQYAFYRDYLLNQENIRKIYGANIPFETFQVKDICEIRRGR
AITKAYIRNNPGENPVYSAATTNDGELGRIKDCDFDGEYITWTTNGYAGVVFYRNGKFNASQDCGVLKVKNKKICTKFLS
FLLKIEAPKFVHNLASRPKLSQKVMAEIELSFPPLEIQEKIADILFAFEKLCNDLVEGIPAEIEMRKKQLDYYYHLIFSK
IAHFSKQLA
>P05719 ~~~hsdS~~~Type I restriction enzyme EcoKI specificity subunit~~~COG0732
MSAGKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKNLVKESQKISPEDIVI
AMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIP
PLAEQKIIAEKLDTLLAQVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLS
SKPNESGVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKL
IRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVN
NALARVNNLTQSILAKAFRGELTAQWRAENPDLISGENSAAALLEKIKAERAASGGKKASRKKS
>Q49434 ~~~~~~Putative type I specificity subunit S.MgeORF438P~~~COG0732
MTPKLKLNNNINWTKRTIDSLFDLKKGEMLEKELITPEGKYEYFNGGVKNSGRTDKFNTFKNTISVIVGGSCGYVRLADK
NFFCGQSNCTLNLLDPLELDLKFAYYALKSQQERIEALAFGTTIQNIRISDLKELEIPFTSNKNEQHAIANTLSVFDERL
ENLASLIEINRKLRDEYAHKLFSLDEAFLSHWKLEALQSQMHEITLGEIFNFKSGKYLKSEERLEEGKFPYYGAGIDNTG
FVAEPNTEKDTISIISNGYSLGNIRYHEIPWFNGTGSIALEPMNNEIYVPFFYCALKYLQKDIKERMKSDDSPFLSLKLA
GEIKVPYVKSFQLQRKAGKIVFLLDQKLDQYKKELSSLTVIRDTLLKKLFPDMTERTKSIKDY
>P75159 ~~~~~~Putative type I specificity subunit S.MpnORF638P~~~
MTPKLKLNTNSNWTKKTLGSLFELKKGEMLEKELLAPDGKYEYFNGGIKASGRTNEFNTFKNTISIIIGGSCGYVRLADK
DYFCGQSSCTLTVLDPLEIDLKFAYYALKSQEEKITSLASGTTIKNIRLSDLKDLPIPLVKSIQDQRTIAHALSVFDLRI
EHLNELIEVNRKLRDEYAHKLFTLDPDFLTHWNLHELHEQMGEISLGEVFHLKSGKYLKADERFEDGKFPYYGAGIESTS
FVNEPNTKGDTLSMIANGYSIGNIRYHTIPWFNGTGGIAMEALKPNKTYVPFFYCALKYMQKDLKERFKRDESPFISLKL
AGEIKVPFVKSFALQRKAGKIIYLLDKTLEECKEEAKSLISIRDNLLGKLFPTLS
>Q1LK00 1.13.11.11~~~kynA~~~Tryptophan 2,3-dioxygenase~~~COG3483
MSEFKGCPMGHGAAPQNGDGGDSGDTGNGWHGAQMDFARDMSYGDYLGLDQILSAQHPLSPDHNEMLFIVQHQTTELWMK
LMLHELRAARDGVKSDQLQPAFKMLARVSRIMDQLVQAWNVLATMTPPEYSAMRPYLGASSGFQSYQYREIEFILGNKNA
AMLRPHAHRPEHLELVETALHTPSMYDEAIRLMARRGFQIDPEVVERDWTQPTQYNASVEAAWLEVYRNPSAHWELYELG
EKFVDLEDAFRQWRFRHVTTVERVIGFKRGTGGTEGVSYLRRMLDVVLFPELWKLRTDL
>Q8PDA8 1.13.11.11~~~kynA~~~Tryptophan 2,3-dioxygenase~~~COG3483
MPVDKNLRDLEPGIHTDLEGRLTYGGYLRLDQLLSAQQPLSEPAHHDEMLFIIQHQTSELWLKLLAHELRAAIVHLQRDE
VWQCRKVLARSKQVLRQLTEQWSVLETLTPSEYMGFRDVLGPSSGFQSLQYRYIEFLLGNKNPQMLQVFAYDPAGQARLR
EVLEAPSLYEEFLRYLARFGHAIPQQYQARDWTAAHVADDTLRPVFERIYENTDRYWREYSLCEDLVDVETQFQLWRFRH
MRTVMRVIGFKRGTGGSSGVGFLQQALALTFFPELFDVRTSVGVDNRPPQGSADAGKR
>P25239 3.1.21.4~~~~~~Type II restriction enzyme and methyltransferase RM.Eco57I~~~
MNKKDQLQRLIDKYKSDIDYYRSARYNETQLRTDFLDQLFLILGWDITNAAGKPTNEREVLVEEGLKARAGENTKKPDYT
FRLFSERKFFLEAKKPSVDISTTIEPALQVRRYGFTAKLKISVLSNFEYTAIYDCSNQVKETDSVANSRIKLYHFTELVD
KFDEINNLIGRESVYTGHFDNEWSEIENKILRFSVDDLFLKQINDWRLLLANEFLQIKNELPEEKLNDLVQNYINSIVFL
RVCEDRDLEEYETLYHFAQDKDFQSLVKKLKSSDKKYNSGLFSLEYIDELLSNANSCIWSIIEQLYFPQSTYSFSVFSSD
ILGNIYEIFLSEKVRIDELGNVKIQPKEEHIDRDVVTTPTHIVKEIIRNTVVEYCKGKSDIEILNSKFADIACGSGAFII
EAFQFIQDILIDYYIQNDKSKLQQISEHTYKLKFEVKREILCKCIYGIDKDYNATKACTFGLLLKLLEGETTETIGKDTP
ILPALDTNILFGNSLIDSGDKVKQEDIFSINPFDLTNYQFDVIVGNPPYMATEHMNQLTPKELDIYKRKYKSAYKQFDKY
FLFIERSIQILKEYGYLGYILPSRFIKVDAGKKLRKFLSENKYLSKLISFGSHQVFKNKTTYTCLLFLNKENHDNFSFYE
VKDFKKWLTREDKYLLSSTYQTSSLDSDTWVLEKKINDILKLMFSKSEQLGNIVGKSNVANGIQTSANKYYIHKEIKSEN
GFIYFEYDGIEYHIEKELTRPYFETNRSGDDSFYTYKDVEPNSFVVYPYKKVGERIQFIEYDELKRQYPKLFEFLQVVKV
HLNDKKRSIKPDPTGPNEWYRYGRSQALENCDVDQKLIVGILSNGYKYSIDNHRTFVSSGGTAGYSIINIPNNVRYSIYY
IQAILTSKYLEWFASIYGDIFRGRFVARGTKVQTRMPIPTIDFDDPKQKEIHDTISSKQQYLNKLYSQTQKSADRDKIIF
ERQFEQEKIQMDYLIKNLFDLGDLDSEIPTVEDLYKNL
>P24546 3.1.21.4~~~accIR~~~Type II restriction enzyme AccI~~~
MDYYDRIRELTKNVPVELVDFEQPRDLARTPTQASSNFITNKEQGDWAEDLVTRAINENSKNFVAVKYGKSDNLVAGENG
FDTFYQDFQTELDTIGKRPDLLIFKKTDFDTTLGFDVSQIPHHQITDYVKKAIAGIEVRSSAFLIDKYEEAMQVRTQRFT
EIAFQTRDKILAEFLDVLDHPSRSKYITLLNTLTLETISIFDFKVPGWRSNERLIEVNNLFKRLKVAIKEIQKRDYLSIT
PKVEDIKVVYKWIETFNVPHFYFQVFFDKVYGISFEQILTIISNSDNDGVIFSVEKDVQNQNKTTIKINSKTGYPIASKV
DEPTHESIRKEMDRGRLLFYVTFKGGTAYLDLDNLRTILGIEEAEF
>Q9KHV6 3.1.21.4~~~ageIR~~~Type II restriction enzyme AgeI~~~
MRLDLDFGRGLVAHVMLDNVSEEQYQQISDYFVPLVNKPKLKSRDAIGQAFVMATEVCPDANPSDLWHHVLYRIYIREKI
GTDPSQSWVRTSGEAFEVALVERYNPVLARHGIRLTALFKGQKGLALTRMGVADRVGSRKVDVMIEKQGGGRSPDAEGFG
VVGGIHAKVSLAERVSDDIPASRIMMGEGLLSVLSTLDVKSFPPPHGDLVNRGELGTPDRPSDKRNYIEGHGDFSACFSY
NLRTSPSNATTPSGRHIYVSGFSGQDDEFTDYLVAQLA
>D4ZX34 3.1.21.4~~~aplIR~~~Type II restriction enzyme AplI~~~COG0827
MSSNNPTPNLEQLMLEARQLMKDLGLPDKMQGDTPIFVLLVMLDMKPAKSWSEANNQKWGITPLMNKMRELGFKNLAPNT
RENIRDDCVGQLVDAELATENPDKPRPKNSPKYCYQINQEVLYLVKKIGSADYPIALNNFLSNYQTIKHKYQAKRQSQRL
NVKIAHNFSVSIAPGGQGVLIKSVLQDFCKYFNIDKVLYIDNTVDTARGYSPFIDENLINYLGIDIDKFKNSYDKPDIVL
YKSDNKYLIIIEAVKTGGAINVERRDRLLSLFENVDVKLSFVNAFESFKELKRLTKEITRETHAWIMEFPDHMIHFNGDQ
YLFH
>O68557 3.1.21.4~~~bglIR~~~Type II restriction enzyme BglI~~~
MYNLHREKIFMSYNQNKQYLEDNPEIQEKIELYGLNLLNEVISDNEEEIRADYNEANFLHPFWMNYPPLDRGKMPKGDQI
PWIEVGEKAVGSKLTRLVSQREDITVREIGLPTGPDERYLLTSPTIYSLTNGFTDSIMMFVDIKSVGPRDSDYDLVLSPN
QVSGNGDWAQLEGGIQNNQQTIQGPRSSQIFLPTIPPLYILSDGTIAPVVHLFIKPIYAMRSLTKGDTGQSLYKIKLASV
PNGLGLFCNPGYAFDSAYKFLFRPGKDDRTKSLLQKRVRVDLRVLDKIGPRVMTIDMDK
>P70985 3.1.21.4~~~bsoBIR~~~Type II restriction enzyme BsoBI~~~
MNTQKPFENHLKSVDDLKTTYEEYRAGFIAFALEKNKRSTPYIERARALKVAASVAKTPKDLLYLEDIQDALLYASGISD
KAKKFLTEDDKKESINNLIENFLEPAGEEFIDELIFRYLLFQGDSLGGTMRNIAGALAQQKLTRAIISALDIANIPYKWL
DSRDKKYTNWMDKPEDDYELETFAKGISWTINGKHRTLMYNITVPLVKKNVDICLFNCEPEIYTPQKVHQQPEKYLLLGE
LKGGIDPAGADEHWKTANTALTRIRNKFSEKGLSPKTIFIGAAIEHSMAEEIWDQLQSGSLTNSANLTKTEQVGSLCRWI
INI
>P25257 3.1.21.4~~~hgiBIR~~~Type II restriction enzyme HgiBI~~~
MAINPITRNKIKDYLNSFIQQQLSVYSQRSLREFQDVDSYPSSLSKDGDLKPFHASLIPASIMRLNRFERSLSTGLGSTF
EECTRLIALDHHAVALRNYDIQAALDQAQWAAIDQLISTIDRGLKHQTPSLNQMLEQIQSIPLTGILETHIVRADLYIQR
HDGSELFFEIKSPKPSKGQCLEVMQRLLRIYTIKQQSAVPVKAFYAMAYNPWGISRASYRSSITKKYTDFSNAVVIGQEF
WSLIGEPSTYTELLEIYHEVGLAKSAEITQKLLQ
>Q45488 3.1.21.4~~~bglIIR~~~Type II restriction enzyme BglII~~~
MKIDITDYNHADEILNPQLWKEIEETLLKMPLHVKASDQASKVGSLIFDPVGTNQYIKDELVPKHWKNNIPIPKRFDFLG
TDIDFGKRDTLVEVQFSNYPFLLNNTVRSELFHKSNMDIDEEGMKVAIIITKGHMFPASNSSLYYEQAQNQLNSLAEYNV
FDVPIRLVGLIEDFETDIDIVSTTYADKRYSRTITKRDTVKGKVIDTNTPNTRRRKRGTIVTY
>P19887 3.1.21.4~~~banIR~~~Type II restriction enzyme BanI~~~
MAQLKYNKDIDELERNAAKWWPDFLAKKESSTSIIPKLVESQDAFISLLNLSKNNPFDIFQLIDASKFPPNLFLKHLVVL
TDFGGEPLNRLNQNFDSLFPMIPYGIHYITKVLGKFEFFWNEKKYEYVFQELPVTSLTNSKLKIDGASISKTVPLSDLYK
DVIVLLMFGANAVNSEVSEVLMKCEVGNLIGKTDELKKFIKERYIFVSRITGGAEANTLGQVAQTHVIDFLRTRFGSKGH
DIKSNGHIEGVTHNDGQTLTTFDVVIKKGSKSVAIEISFQVTTNSTIERKAGQAKARYDMVSDTGNYIAYIIDGAGNFQR
KNAITTICNNSHCTVAYTEEELNVLLKFILEKLE
>P23940 3.1.21.4~~~bamHIR~~~Type II restriction enzyme BamHI~~~
MEVEKEFITDEAKELLSKDKLIQQAYNEVKTSICSPIWPATSKTFTINNTEKNCNGVVPIKELCYTLLEDTYNWYREKPL
DILKLEKKKGGPIDVYKEFIENSELKRVGMEFETGNISSAHRSMNKLLLGLKHGEIDLAIILMPIKQLAYYLTDRVTNFE
ELEPYFELTEGQPFIFIGFNAEAYNSNVPLIPKGSDGMSKRSIKKWKDKVENK
>P33562 3.1.21.4~~~hsdBR~~~Type II restriction enzyme BsuBI~~~
MTEGMHSNVKEAIKILKELGLPKGQQNERSALCLLSLMNITQDKTWSEAESPLIGITPMMEFCRINYGKEYAPNSRETFR
RFTMHQFVDAGIALYNPDKPTRPVNSPKAVYQIEAETLELIKCYNTEEWSELLARYLSNRQTLVERYAKERQQNKIPVQI
AEGKEIYITPGEHSELIKAIIEEFAPRYVPGGRLIYAGDTGEKMGYFDEELLRQLGVVIDSHGKMPDVVIYFPEKKWLLL
IESVTSHGPVDHKRHEELAKLFNGSTAGIVYVTAFPNRSLMARYLNNISWETEVWVADAPSHLIHFNGVRFLGPYE
>P25217 3.1.21.4~~~hsdFR~~~Type II restriction enzyme BsuFI~~~
MNKDNQIKNESGKQAKILVSEIVNNLKNELGINIEIEEGYSIGYPNQEKQFKMDFLVQFTDFDNEQWLIKSTNSIRERIY
GTEFFAQNIRLIDEKVKNIYVVVPDSISSAEMKKKRNYSVKINGTTYTSFLTDVLTVNELRQKIVEKASQNIAQGLRANV
LGNDAETSIVNLLNDLKNKALWNDYQNAQQTIKSSTYKIYKEILEKIDLKEGFDKILEVTATNDIPLLSNRGKPKTDVSV
TIKTNTKELIRNISIKNTREKTVTIHEGSVSDLISRLKLSETDPLSQALIHFEKVGSKKKLIAEHPNSDKILEENLKLYN
RELIEFLHSPLLNDKIQMVDLIIFTNKFAVWNRDDYIKHYIEEYSGKGQFGTPFKWTYPSKKRGQKIQIKGFSNN
>Q9LAI1 3.1.21.4~~~bslIRalpha~~~Type II restriction enzyme BslI subunit alpha~~~
MERQLKSIAYAFVANDIDVYIPDGESNCIVVTKLVCKDCGQYWHTSLSECYFCGTLNFYLYECNSCGKKYSLTSSSKSCD
TDGCNGKLIKRCSNPECISRTNEEIQRATDEQGGVFDLNSSFNVSLNHCVTCGSKENYYKTYRIYSYRTEVEPNIEALRE
FANNNKLNSDEDVIIIKHLVDNVIHYGYIPYSKLDETTEITTTFSRFSDLVSELFPVNVPPNVTE
>Q9LAI0 3.1.21.4~~~bslIRbeta~~~Type II restriction enzyme BslI subunit beta~~~
MEQQKFPNPRIFEDIDATDFSKHNKKHVTEDFVAENFKDVGWRVYRPFNDTGIDLIAKKFVCPDGHTKWNQNLTKEMTCS
ECGKSLIEITRFIQVKTREVKQVKTREAKGEKFFFGYTLKSKDFRTDPRHVFLLYSDFTMDFIILPMYDYLNLFYTNQSL
GSTHFSTPSFRQGNNKLNGLSKDKNDNWVWSGVSFNEFVNEKGMDKLSCPIYDIELESYTKKIQELKFSLFYRYSPGRKN
QVSAPTVEFINNHFSIFISLPKEAIASKRKAHLESLRQDLPEDLKKSVNEGYLVKFKGVDL
>P25258 3.1.21.4~~~hgiCIR~~~Type II restriction enzyme HgiCI~~~
MNYQRSFEDLEFNAIKWWPQELSATVAEASVLPILISSQDLFISILKLSGTHPEQIFDVINAAQISANLFLKHLVVLADY
GGEMIKRLGKEFQEIFPRMDSTLEYYMNYTFKGEQYTYIFKKLPIKGLDNSKLAIDGKAIIEIKPLSDLYRDMIMILLYG
STTEQFNLAGLEKCEIGTILGKNEIIYTYITQKYLYVSRITNGANTNSLGQIAQTYVCDILSKYLPNDYSVTRNGKILLS
DLNSQDSTKTSFDILVEFADKKVGIEVSFQVTTNSTIERKAGQARDRQNRMHAHYYWIAYVIDGAGNFERSGAVRAICRY
SDCTVAYSESEIAVLAAFIQEKFNA
>P17743 3.1.21.4~~~hincIIR~~~Type II restriction enzyme HincII~~~
MSFIKPIYQDINSILIGQKVKRPKSGTLSGHAAGEPFEKLVYKFLKENLSDLTFKQYEYLNDLFMKNPAIIGHEARYKLF
NSPTLLFLLSRGKAATENWSIENLFEEKQNDTADILLVKDQFYELLDVKRRNISKSAQAPNIISAYKLAQTCAKMIDNKE
FDLFDINYLEVDSELNGEDLVCVSTSFAELFKSEPSELYINWAAAMQIQFHVRDLDQGFNGTREEWAKSYLKHFVTQAEQ
RAISMIDKFVKPFKKYIL
>Q60132 3.1.21.4~~~cfr9IR~~~Type II restriction enzyme Cfr9I~~~
MTNKIVFPEPKQQVDFAFSLKRFRGIYLQNALLETVRDMDIVALDTQLAEYVNKADLATLATYGLRAELLFPVPVLLETN
PFLLGYYRLLMGYSQKEFYGKDKGFNAGCFKSMEVKGNIGKVAKPKISELCHAFCSVASSLLQGVGPLRISRELLDDLTL
LTVGPQLRGGANNQRGADGIVLVFEIIKEIVSHAVAEVRENAIEVNSATGRNVLIEFAPDPDIIIREEMSLDNYRNVVAI
EVKSGTDVSNIHNRIGEAEKSHQKARGHGYTECWTVVNVSRLDMDKARKESPSTNRFYSITDLSLREGEQYEDFRRRVLS
LTAISAAPTP
>P56200 3.1.21.4~~~~~~Type II restriction enzyme Cfr10I~~~
MDIISKSGEGNKYTINSAIAFVAYASHIDINTTEFSKVLSGLRDFINDEAIRLGGKISDGSFNKCNGDWYEWLIGIRAIE
FFLESETNFIVVKMPNATSFDVMSIYKSCLSEFIYDLRSKLSLNNVNLITSNPDFSIIDIRGRREELKSMLKDISFSNIS
LSTISEIDNLYKNFIDYAELEHIKSFLSVKTTFRPDRRLQLAHEGSLMKALYTHLQTRTWTINPTGIRYYAAATSIGNAD
VIGLKTVATHSITDVKSLPQSAVDEIFKINSVLDVDSCLSHILSS
>P0A459 3.1.21.4~~~dpnC~~~Type II methyl-directed restriction enzyme DpnI~~~
MELHFNLELVETYKSNSQKARILTEDWVYRQSYCPNCGNNPLNHFENNRPVADFYCNHCSEEFELKSKKGNFSSTINDGA
YATMMKRVQADNNPNFFFLTYTKNFEVNNFLVLPKQFVTPKSIIQRKPLAPTARRAGWIGCNIDLSQVPSKGRIFLVQDG
QVRDPEKVTKEFKQGLFLRKSSLSSRGWTIEILNCIDKIEGSEFTLEDMYRFESDLKNIFVKNNHIKEKIRQQLQILRDK
EIIEFKGRGKYRKL
>P0A460 3.1.21.4~~~dpnC~~~Type II Methyl-directed restriction enzyme DpnI~~~
MELHFNLELVETYKSNSQKARILTEDWVYRQSYCPNCGNNPLNHFENNRPVADFYCNHCSEEFELKSKKGNFSSTINDGA
YATMMKRVQADNNPNFFFLTYTKNFEVNNFLVLPKQFVTPKSIIQRKPLAPTARRAGWIGCNIDLSQVPSKGRIFLVQDG
QVRDPEKVTKEFKQGLFLRKSSLSSRGWTIEILNCIDKIEGSEFTLEDMYRFESDLKNIFVKNNHIKEKIRQQLQILRDK
EIIEFKGRGKYRKL
>P09357 3.1.21.4~~~dpnB~~~Type II restriction enzyme DpnII~~~
MKQTRNFDEWLSTMTDTVADWTYYTDFPKVYKNVSSIKVALNIMNSLIGSKNIQEDFLDLYQNYPEILKVVPLLIAKRLR
DTIIVKDPIKDFYFDFSKRNYSIEEYTMFLEKSGIFDLLQNHLVSNLVDYVTGVEVGMDTNGRKNRTGDAMENIVQSYLE
AEGYILGENLFKEIEQNEIEEIFSVDLSAITNDGNTVKRFDFVIKNEQVLYLIEVNFYSGSGSKLNETARSYKMIAEETK
AIPNVEFMWITDGQGWYKAKNNLRETFDILPFLYNINDLEHNILKNLK
>P43870 3.1.21.4~~~hindIIIR~~~Type II restriction enzyme HindIII~~~
MKKSALEKLLSLIENLTNQEFKQATNSLISFIYKLNRNEVIELVRSIGILPEAIKPSSTQEKLFSKAGDIVLAKAFQLLN
LNSKPLEQRGNAGDVIALSKEFNYGLVADAKSFRLSRTAKNQKDFKVKALSEWREDKDYAVLTAPFFQYPTTKSQIFKQS
LDENVLLFSWEHLAILLQLDLEETNIFSFEQLWNFPKKQSKKTSVSDAENNFMRDFNKYFMDLFKIDKDTLNQLLQKEIN
FIEERSLIEKEYWKKQINIIKNFTREEAIEALLKDINMSSKIETIDSFIKGIKSNDRLYL
>P00642 3.1.21.4~~~ecoRIR~~~Type II restriction enzyme EcoRI~~~
MSNKKQSNRLTEQHKLSQGVIGIFGDYAKAHDLAVGEVSKLVKKALSNEYPQLSFRYRDSIKKTEINEALKKIDPDLGGT
LFVSNSSIKPDGGIVEVKDDYGEWRVVLVAEAKHQGKDIINIRNGLLVGKRGDQDLMAAGNAIERSHKNISEIANFMLSE
SHFPYVLFLEGSNFLTENISITRPDGRVVNLEYNSGILNRLDRLTAANYGMPINSNLCINKFVNHKDKSIMLQAASIYTQ
GDGREWDSKIMFEIMFDISTTSLRVLGRDLFEQLTSK
>P25260 3.1.21.4~~~hgiEIR~~~Type II restriction enzyme HgiEI~~~
MAINPITRNKIKDYLNSFIQQQLSVYRERSLREFQDVDSYLPSLSKDGDLKPFHASLIPASIMRLNRFERSLSTGLGSTF
EECTRLIALDHHAVALRNYDIQAALDQAQWAAIDQLISIIDRGLKHQTPSLNQMLEQIQSIPLTGILETHIVRADLYIQR
HDGSELFFEIKSPKPNKGQCLEVMQRLLRIYTIKQQSAVPVKAFYAMAYNPWGISRASYRSSNTKKYTDFSNAVVIGQEF
WSLIGEPSTYTELLEIYHEVGLAKSAEITQKLLQ
>P14633 3.1.21.4~~~ecoRIIR~~~Type II restriction enzyme EcoRII~~~
MLMSVFHNWLLEIACENYFVYIKRLSANDTGATGGHQVGLYIPSGIVEKLFPSINHTRELNPSVFLTAHVSSHDCPDSEA
RAIYYNSRHFGKTRNEKRITRWVEAAHFRILKITGALTLLAFKLDEQGGDCKEVNIWVCASTDEEDVIETAIGEVIPGAL
ISGPAGQILGGLSLQQAPVNHKYILPEDWHLRFPSGSEIIQYAASHYVKNSLDPDEQLLDRRRVEYDIFLLVEELHVLDI
IRKGFGSVDEFIALANSVSNRRKSRAGKSLELHLEHLFIEHGLRHFATQAITEGNKKPDFLFPSAGAYHDTEFPVENLRM
LAVKTTCKDRWRQILNEADKIHQVHLFTLQEGVSLAQYREMRESGVRLVVPSSLHKKYPEAVRAELMTLGAFIAELTGLY
ADIP
>P04390 3.1.21.4~~~ecoRVR~~~Type II restriction enzyme EcoRV~~~
MSLRSDLINALYDENQKYDVCGIISAEGKIYPLGSDTKVLSTIFELFSRPIINKIAEKHGYIVEEPKQQNHYPDFTLYKP
SEPNKKIAIDIKTTYTNKENEKIKFTLGGYTSFIRNNTKNIVYPFDQYIAHWIIGYVYTRVATRKSSLKTYNINELNEIP
KPYKGVKVFLQDKWVIAGDLAGSGNTTNIGSIHAHYKDFVEGKGIFDSEDEFLDYWRNYERTSQLRNDKYNNISEYRNWI
YRGRK
>P14870 3.1.21.4~~~fokIR~~~Type II restriction enzyme FokI~~~
MVSKIRTFGWVQNPGKFENLKRVVQVFDRNSKVHNEVKNIKIPTLVKESKIQKELVAIMNQHDLIYTYKELVGTGTSIRS
EAPCDAIIQATIADQGNKKGYIDNWSSDGFLRWAHALGFIEYINKSDSFVITDVGLAYSKSADGSAIEKEILIEAISSYP
PAIRILTLLEDGQHLTKFDLGKNLGFSGESGFTSLPEGILLDTLANAMPKDKGEIRNNWEGSSDKYARMIGGWLDKLGLV
KQGKKEFIIPTLGKPDNKEFISHAFKITGEGLKVLRRAKGSTKFTRVPKRVYWEMLATNLTDKEYVRTRRALILEILIKA
GSLKIEQIQDNLKKLGFDEVIETIENDIKGLINTGIFIEIKGRFYQLKDHILQFVIPNRGVTKQLVKSELEEKKSELRHK
LKYVPHEYIELIEIARNSTQDRILEMKVMEFFMKVYGYRGKHLGGSRKPDGAIYTVGSPIDYGVIVDTKAYSGGYNLPIG
QADEMQRYVEENQTRNKHINPNEWWKVYPSSVTEFKFLFVSGHFKGNYKAQLTRLNHITNCNGAVLSVEELLIGGEMIKA
GTLTLEEVRRKFNNGEINF
>P43418 3.1.21.4~~~hgaIR~~~Type II restriction enzyme HgaI~~~
MEKILMLNDDQIWIFKKHTNNIQLLIEVALYLKSNKSSVSKKDKDAMYDIFSESELYNPRESLRDKPLDTINHKLDGLSY
FMFGYSDRINDENKFIFSPLGNLFLKYLHDKDKLSKIFSCMLISMQFPHPYSKPSECFLLYPFRLIFKLLLDKRLQGRLY
HYEVYKIIIHTISIDEAKYEFLVKSILNSRKKSWNEKLNELSEIQHKVVKSVYEWQYYIVPLLGSLHIFKINNGDIEQKL
YHPQKDGSKSPPTARKANNGYVEINDNLTNFIDKLLNKYSFLDTPILLSDSQRKSNDVTKEIYSFYPELLLAEIGETISF
ESHILNIPKLITEYSKNPDNSTSGKFEKILEEAFNLFIDVEAQWLAGAGRTDIECMYLPINEKFSIEAKSTKNKLSMINS
GRLKRHRTLISANYTIVITPRYVPSVRYDIEAQDIVLITADTLAEYLYNNIISNNRDISYADIQAIIVANLGKDISTQIS
NLTLSKFG
>P29537 3.1.21.4~~~hpaIR~~~Type II restriction enzyme HpaI~~~
MKYEEINFKVPVESPYYPNYSQCVIERIYSILRNQKDMGDDRIIINTNLKKGLPLENINKIAGPMIEAWAEEVFSGIRDN
RDNQYNLINVEAQERLGISDIILQFQVNNNVITGNVDVKATSNDIPDSGKSPNITSFSRIRTAYVKDPNFIFIILSIKHS
VYVKRNEYTNLMDGIMQIIDFNVYDLKYISDSDISYNPALGTGQIQIKDIHYVSSQKRTTWQMCQLLDLKYLRSKKRTIE
QFYNEAKRNKWIKD
>P25237 3.1.21.4~~~kpnIR~~~Type II restriction enzyme KpnI~~~
MDVFDKVYSDDNNSYDQKTVSQRIEALFLNNLGKVVTRQQIIRAATDPKTGKQPENWHQRLSELRTDKGYTILSWRDMKV
LAPQEYIMPHATRRPKAAKRVLPTKETWEQVLDRANYSCEWQEDGQHCGLVEGDIDPIGGGTVKLTPDHMTPHSIDPATD
VNDPKMWQALCGRHQVMKKNYWDSNNGKINVIGILQSVNEKQKNDALEFLLNYYGLKR
>P50189 3.1.21.4~~~mamIR~~~Type II restriction enzyme MamI~~~
MQNAVSQAISQGIHVRREILGSLTYEQRVFLLEDLFVDLFGHQHVMLQRWAALTGQSAQVDTGYIAQFVASIVLGEPGQG
FRGKGDDLADGSEVKSAANISGVDRPRWNHNLGSLDDDEHRRSRGLPTAGEEYLGVPYMFYLLVDRPHGVSDPAPIRIRA
WCIDAQEDGDWRDLFETFLTSRRGRTYNFQLHPPVGYDDDVVVNTLGNLDFSNVLVFDARLSLADRDRPEIDWHVPLPTQ
VIPVTGRTRALRYGGRGARPTRLTNTADIVLGTNDLGALFPGVLAPRDSYDLATVSEIETEAEVEEYS
>P34719 3.1.21.4~~~mboIR~~~Type II restriction enzyme MboI~~~
MKLAFDDFLNSMSETNTTLDYFTDFDKVKKNVAQIEIHLNQLNYLLGKDDLKQAVYDLYAECPNAFSILEILIAVRKKEQ
KKSLDEKGQVVTLNSYFQSADKIIDFLNNTGLADVFRDKNIKNLVDYVFGIEVGLDTNARKNRGGDNMSKAVQLLFDNAD
IYYKKEVRNTIFTDIESLGADVKQFDFVIKTKRKTYVIETNYYNSGGSKLNEVARAYTDVAPKINQYSQYEFVWITDGQG
WKTAKNKLQEAYTHIPSVYNLYTLHGFIEQLNSEGVIKDW
>P11405 3.1.21.4~~~mspIR~~~Type II restriction enzyme MspI~~~
MRTELLSKLYDDFGIDQLPHTQHGVTSDRLGKLYEKYILDIFKDIESLKKYNTNAFPQEKDISSKLLKALNLDLDNIIDV
SSSDTDLGRTIAGGSPKTDATIRFTFHNQSSRLVPLNIKHSSKKKVSIAEYDVETICTGVGISDGELKELIRKHQNDQSA
KLFTPVQKQRLTELLEPYRERFIRWCVTLRAEKSEGNILHPDLLIRFQVIDREYVDVTIKNIDDYVSDRIAEGSKARKPG
FGTGLNWTYASGSKAKKMQFKG
>P23191 3.1.21.4~~~mboIIR~~~Type II restriction enzyme MboII~~~
MKNYVSNINLGNSSLKFIDERLQSENYRGIHLSQHNRYDLPKLIDILTLLNKHAPNQSLMQIRTTDISKRPQNIPEEQSY
AEFCNEAKSLTNIGTQDAMRKNLFVDFARMGLINRYNDKKVLTDPFKRGVTKYVALSDMGVKLIDPKLDILSKNLIFSKS
LNKLLTGFVEDVLSLLTNSDLKEISFDEFMLFVSAMNCNFNFSISTEQCESLIKEYRLLSRVQKNAVIDTLKSELIPDNF
NGDKKDKRDYHNWANENQQIWTLFENIPFFIMEKDSRKLILITSDVDLSKYSKSKMKRSQQAKNDYFKHHKVNKIKGYEL
DHIIPLLEAESVDEYRYLDNWLNLLYIDGKTHAIKSQSGSKYYIFTFDDNDYNQIYFLDTQGDKLSINNDDTALFDKNKV
PKIYEYNQNFINAKTS
>P31032 3.1.21.4~~~ngoMIVR~~~Type II restriction enzyme NgoMIV~~~
MNPLFTQERRIFHKKLLDGNILATNNRGVVSNADGSNTRSFNIAKGIADLLHSETVSERLPGQTSGNAFEAICSEFVQSA
FEKLQHIRPGDWNVKQVGSRNRLEIARYQQYAHLTALAKAAEENPELAAALGSDYTITPDIIVTRNLIADAEINRNEFLV
DENIATYASLRAGNGNMPLLHASISCKWTIRSDRAQNARSEGLNLVRNRKGRLPHIVVVTAEPTPSRISSIALGTGEIDC
VYHFALYELEQILQSLNYEDALDLFYIMVNGKRLKDISDLPLDLAV
>P43642 3.1.21.4~~~munIR~~~Type II restriction enzyme MunI~~~
MGKSELSGRLNWQALAGLKASGAEQNLYNVFNAVFEGTKYVLYEKPKHLKNLYAQVVLPDDVIKEIFNPLIDLSTTQWGV
SPDFAIENTETHKILFGEIKRQDGWVEGKDPSAGRGNAHERSCKLFTPGLLKAYRTIGGINDEEILPFWVVFEGDITRDP
KRVREITFWYDHYQDNYFMWRPNESGEKLVQHFNEKLKKYLD
>P50187 3.1.21.4~~~naeIR~~~Type II restriction enzyme NaeI~~~
MTELPLQFAEPDDDLERVRATLYSLDPDGDRTAGVLRDTLDQLYDGQRTGRWNFDQLHKTEKTHMGTLVEINLHREFQFG
DGFETDYEIAGVQVDCKFSMSQGAWMLPPESIGHICLVIWASDQQCAWTAGLVKVIPQFLGTANRDLKRRLTPEGRAQVV
KLWPDHGKLQENLLLHIPGDVRDQIFSAKSSRGNQHGQARVNELFRRVHGRLIGRAVIATVAQQDDFMKRVRGSGGARSI
LRPEGIIILGHQDNDPKVANDLGLPVPRKGQVVAARVVPADEGDQRQTAEIQGRRWAVAVPGDPIVEAPVVPRKSAE
>P50183 3.1.21.4~~~nlaIVR~~~Type II restriction enzyme NlaIV~~~
MIKLTAQQIFDKLLDEEKILSANGQIRFFLGDVDIIVKQKDVVGNIIQEWLGGWLRKREIEFDVSTNTQMPPDFFLNKKD
RSRELLEVKAFNRNASPGFDIADFKMYSDEIIHKPYMLDVDYLIFGYDMDDNGNVTIKDLWLKKVWQITRSMDGWAINLQ
VKKGVVHKIRPGVWYSINKKNMPMFECLEDFVSAIEETVYQNPATRHNASLWKRKFEEAYKKHYNRSISIPRWHEIAHKY
KKK
>P35677 3.1.21.4~~~nspVR~~~Type II restriction enzyme NspV~~~COG0827
MTILTIEALRTEAAIFSAAESIHPEPLLYGVTDGKAVGTYIEQKFRLYLKEHYEFVQGNSASGIDFPGLLVDVKVTSIRQ
PQSSCPFKSARQKIFGLGYSLLIFVYDKIDNSTNRTATLNILHTIYVSAERTADFQMTRGIRNILANEGNKDDLIAFMSD
RNLPVDEIEAGNVADEILRNPPMQGFLTISNALQWRLQYGRVIERAGQEDGILTVYRNNP
>P23657 3.1.21.4~~~pvuIIR~~~Type II restriction enzyme PvuII~~~
MSHPDLNKLLELWPHIQEYQDLALKHGINDIFQDNGGKLLQVLLITGLTVLPGREGNDAVDNAGQEYELKSINIDLTKGF
STHHHMNPVIIAKYRQVPWIFAIYRGIAIEAIYRLEPKDLEFYYDKWERKWYSDGHKDINNPKIPVKYVMEHGTKIY
>P05104 3.1.21.4~~~paeR7IR~~~Type II restriction enzyme PaeR7I~~~
MALDLVDYEQKARDAVKAFWGNREAARQKQIESGKADQGERAGVTGGKNMDGFLALVLDVIKANGLAHAEIHQNRAMLTL
PGYFRPTKLWDLLVIYKGELIAAIELKSHVGPSFSNNFNNRTEEAIGTAHDLWTAYREEAFGKQPRPFVGWLMMVEDAPE
SRRPVRDSSPHFPVFEEFKGASYLTRYDLLCQRLVQEQLYTTAAVIAAERSAVDTGNFTELSSMTSLKTFVSALAGHIAA
EAARLG
>P00640 3.1.21.4~~~pstIR~~~Type II restriction enzyme PstI~~~
MKELKLKEAKEILKALGLPPQQYNDRSGWVLLALANIKPEDSWKEAKAPLLPTVSIMEFIRTEYGKDYKPNSRETIRRQT
LHQFEQARIVDRNRDLPSRATNSKDNNYSLNQVIIDILHNYPNGNWKELIQQFLTHVPSLQELYERALARDRIPIKLLDG
TQISLSPGEHNQLHADIVHEFCPRFVGDMGKILYIGDTASSRNEGGKLMVLDSEYLKKLGVPPMSHDKLPDVVVYDEKRK
WLFLIEAVTSHGPISPKRWLELEAALSSCTVGKVYVTAFPTRTEFRKNAANIAWETEVWIADNPDHMVHFNGDRFLGPHD
KKPELS
>P21763 3.1.21.4~~~rsrIR~~~Type II restriction enzyme RsrI~~~
MAGEVEFKGKGQALRLGIQQELGGGPLSIFGAAAQKHDLSIREVTAGVLTKLAEDFPNLEFQLRTSLTKKAINEKLRSFD
PRLGQALFVESASIRPDGGITEVKDRHGNWRVILVGESKHQGNDVEKILAGVLQGKAKDQDFMAAGNAIERMHKNVLELR
NYMLDEKHFPYVVFLQGSNFATESFEVTRPDGRVVKIVHDSGMLNRIDRVTASSLSREINQNYCENIVVRAGSFDHMFQI
ASLYCKAAPWTAGEMAEAMLAVAKTSLRIIADDLDQN
>P75453 ~~~~~~Putative type II restriction enzyme and methyltransferase RM.MpnORF109P N-terminus~~~
MKFKLNFHEKINQKDCWQSLIDHKERSYSLDFVNNTEKELPLIYGYEVKDFENHGVKIGYTTCKPSDKIQSAIEERILSQ
EKEFRFLDENIEKIEEVKVIFWAIAINEKDESFKDYSLHSFIKEKNLLKESQAGGEWFIVDENKDKFEYLSQIFRQFRAP
SLFKK
>O52512 3.1.21.4~~~sfiIR~~~Type II restriction enzyme SfiI~~~
MHQDYRELSLDELESVEKQTLRTIVQALQQYSKEAKSIFETTAADSSGEVIVLAEDITQYALEVAETYPINRRFAGFIDY
KRVRWLPSPHGLLPQVLLVDAKASTEKNRDTLQRSQLPMDAEFRNTSSGEVVTMEAGVIPHLMLQSANDGVLPAVTTSIF
VHFYYRELKDVEGRYRELKSIYVLSLPHARLKQRYNPDPDTSFFGAGKHSPARGEVARIRVYFDRLKEACPWRLQELHYS
ADSEYTQPRWRDLNDAGHEVTKEFLFLER
>P29346 3.1.21.4~~~stsIR~~~Type II restriction enzyme StsI~~~
MTISINEYSDLNNLAFGLGQDVSQDLKELVKVASIFMPDSKIHKWLIDTRLEEVVTDLNLRYELKSVITNTPISVTWKQL
TGTRTKREANSLVQAVFPGQCSRLAIVDWAAKNYVSVAVAFGLLKFHRADKTFTISEIGIQAVKLYDSEELAELDKFLYE
RLLEYPYAAWLIRLLGNQPSKQFSKFDLGEHFGFIDELGFETAPIEIFLNGLAQAEIDGDKTAAQKIKSNFESTSDKYMR
WLAGVLVTAGLATSTTKKVTHTYKNRKFELTLGTVYQITAKGLTALKEVNGKSRYPRSRKRVMWEFLATKDKEAIAKKTS
RSLMLKHLTEKKNPIQAEVIATLINTDYPTLEITPEEVIDDCIGLNRIGIEILIDGDKLTLNDKLFDFEIPVQKDVVLEK
SDIEKFKNQLRTELTNIDHSYLKGIDIASKKKTSNVENTEFEAISTKIFTDELGFSGKHLGGSNKPDGLLWDDDCAIILD
SKAYSEGFPLTASHTDAMGRYLRQFTERKEEIKPTWWDIAPEHLDNTYFAYVSGSFSGNYKEQLQKFRQDTNHLGGALEF
VKLLLLANNYKTQKMSKKEVKKSILDYNISYEEYAPLLAEIE
>P16667 3.1.21.4~~~sau3AIR~~~Type II restriction enzyme Sau3AI~~~
MESYLTKQAVHNRAKEAVGKSVLELNGGESIKQSKSSVGDAFENWFGKKKDSDSKPDMAEAGVELKATPFKKLKNGKYSS
KERLVLNIINYEKVANENFETSSFLSKNNTIELAFYEYIKGTPSDNWIIKEAVLYEMHKNPIDYEIIKQDWEIINQYINE
GKAHELSEGLTSYLAPCTKGANASSLRNQPYSDIKAKQRAFSLKSGYMTSILRKYVLGDEKIDSIVKDPFEIKEKSIEDI
VFEKFQPYINWSIDKLCEHFSINKGEKGLNYRIASAILNLKGKTTKSKPFPEVEEFEKSSIVVKTVHFNKKNVNKESMSF
GAFKFEELANEEWEDSEGYPSAQWRNFLLETRFLFFVVKEDEDGVDIFKGIKFFSMPEEDINGPVKRMWDDTVKKLKEGV
TLEAVPDKSTKDGWRIKNNFVDKSDDLICHVRPHTNNRDYRGGSNADKLPKKINWINRPDSDDYSDEWMTKQSFWINNDY
IKKQVEDLL
>P23736 3.1.21.4~~~~~~Type II restriction enzyme Sau96I~~~
MTNKYLSFITDEDLFECIEFLYTEYEKALEGIDFDKFFKNRIDTFKMTFDMGINNLSEQDWLAAELQRQVEKTITNHVGT
FHEKLIGKIEGYTNYPVGYDYDVAKDDNTLFAEIKNKHNTLTGTHTKSLFQKICGYAEKYPDAICYYVRIIDTKSRNDIW
EFRSGSIDENTREKPRFSHPRVRIASGDQFYKIVTGEEDAFKQLAYNIPIALDDWIETKRAKKGSSLGLFAELYQQAKAN
NRTLSEEIISINYPKSNYISF
>P14229 3.1.21.4~~~smaIR~~~Type II restriction enzyme SmaI~~~
MSRDDQLFTLWGKLNDRQKDNFLKWMKAFDVEKTYQKTSGDIFNDDFFDIFGDRLITHHFSSTQALTKTLFEHAFNDSLN
ESGVISSLAESRTNPGHDITIDSIKVALKTEAAKNISKSYIHVSKWMELGKGEWILELLLERFLEHLENYERIFTLRYFK
ISEYKFSYQLVEIPKSLLLEAKNAKLEIMSGSKQSPKPGYGYVLDENENKKFSLYFDGGAERKLQIKHLNLEHCIVHGVW
DFILPPP
>P14386 3.1.21.4~~~taqIR~~~Type II restriction enzyme TaqI~~~
MASTQAQKALETFERFLASLDLESYQQKYRPIKTVEQDLPRELNPLPDLYEHYWKALEDNPSFLGFEEFFDHWWEKRLRP
LDEFIRKYFWGCSYAFVRLGLEARLYRTAVSIWTQFHFCYRWNASCELPLEAAPELDAQGIDALIHTSGSSTGIQIKKET
YRSEAKSENRFLRKQRGTALIEIPYTLQTPEELEEKAKRARVNGETYRLWAKVAHHLDRLENGFVIFRESYVKSIELFLQ
KNAPTLSGLIRWDRVAQEALTAP
>B9K4G4 4.2.1.77~~~~~~Trans-3-hydroxy-L-proline dehydratase~~~COG3938
MRTNKVIHVIGVHAEGEVGDVIVGGVSPPPGDTLWEQSRFIASDETLRNFVLNEPRGGVFRHVNLLVPPKDPRAQMGFII
MEPADTPPMSGSNSICVSTAILDSGIISMQEPLTHMVLEAPGGVIEVTAECANGKAERINVLNVASFVTRLAAALEVEGL
GTLTVDTAYGGDSFVIVDAIGLGFSLKPDEARELAELGMKITAAANEQLGFVHPCNADWNHISFCQMTTPITRENGILTG
KSAVAIRPGKIDRSPTGTGCSARLAVMHARGEIGIGETYIGRSIIDSEFKCHIDSLTEIGGLSAIRPVISGRAWITGVSQ
LMLDPTDPWPSGYQLSDTWPAI
>V5YXI5 4.2.1.77~~~lhpH~~~Trans-3-hydroxy-L-proline dehydratase~~~
MKITRSLSTVEVHTGGEAFRIVTSGLPRAPGDTIVQRRAWLKENADEIRRALMFEPRGHADMYGGYLTEPVSPNADFGVI
FVHNEGYSDHCGHGVIALSTAAVELGWVQRTVPETRVGIDAPCGFIEAFVKWDGEHAGPVRFVNVPSFIWQRDVSVETPS
FGTVTGDIAYGGAFYFYVDGAPFDLPVREAAVEKLIRFGAEVKAAANAKYPVVHPEIPEINHIYGTIIANAPRHPGSTQA
NCCVFADREVDRSPTGSGTGGRVAQLYQRGLLAAGDTLVNESIVGTVFKGRVLRETTVGDIPAVIPEVEGSAHICGFANW
IVDERDPLTYGFLVR
>Q73CS0 4.2.1.77~~~~~~Trans-3-hydroxy-L-proline dehydratase~~~
MKVSKVYTTIDAHVAGEPLRIITGGVPEIKGETQLERRAYCMEHLDHLREILMYEPRGHHGMYGCIITPPASAHADFGVL
FMHNEGWSTMCGHGIIAVITVGIETGMFEVTGEKQKFIIDSPAGEVIAYATYRGSEVESVSFENVPSFVYKKDVPIKIDD
YEFQVDIAFGGAFYAVVDSKEFGLKVDFNDLPAIQTWGGKIKHYIESKMEVKHPLEEGLKGIYGVIFSDEPKGKDATLRN
VTIFADGQVDRSPCGTGTSARIATLFEKGILQKGEIFIHECITEGKFEGEVLSVTAVHTYEAVVPKITGNAFITGFHQFV
VDPRDDLNRGFLLG
>Q81HB1 4.2.1.77~~~~~~Trans-3-hydroxy-L-proline dehydratase~~~
MKVSKIYTTIDAHVAGEPLRIITGGVPEIKGETQLERRAYCMEHLDYLREILMYEPRGHHGMYGCIITPPASAHADFGVL
FMHNEGWSTMCGHGIIAVITVGIETGMFEVTGEKQKFIIDSPAGEVIAYATYSGSEVESVSFENVPSFVYKKDVPIKIDS
YEFQVDIAFGGAFYAVVDSKEFGLKVDFKDLSAIQMWGGKIKHYIESKMEVKHPLEEGLKGIYGVIFSDDPKEKDATLRN
VTIFADGQVDRSPCGTGTSARIATLFAKDALQKGEIFVHECITDGKFEGEVLSVTAVHTYEAVVPKVTGNAFITGFHQFV
VDPRDDLNRGFLLG
>Q6HMS9 4.2.1.77~~~prdF~~~Trans-3-hydroxy-L-proline dehydratase~~~
MKVSKVYTTIDAHVAGEPLRIITGGVPEIKGDTQLERRAYCMEHLDHLREVLMYEPRGHHGMYGCIITPPASAHADFGVL
FMHNEGWSTMCGHGIIAVITVGIETGMFEVTGEKQNFIIDSPAGEVIAYAKYNGSEVESVSFENVPSFVYKKDVPIIIDD
YEFQVDIAFGGAFYAVVDSKEFGLKVDFKDLSAIQMWGGKIKHYIESKMEVKHPLEEGLKGIYGVIFSDEPKGEDATLRN
VTIFADGQVDRSPCGTGTSARIATLFEKDALQKGEIFVHECITDGKFEGEVLSVTAVDTYEAVVPKVTGHAFITGFHQFV
VDPRDDLKRGFLLG
>Q8G2I3 4.2.1.77~~~~~~Trans-3-hydroxy-L-proline dehydratase~~~
MRSTKVIHIVGCHAEGEVGDVIVGGVAPPPGETVWEQSRFIANDETLRNFVLNEPRGGVFRHVNLLVPPKDPRAQMGFII
MEPADTPPMSGSNSICVSTVLLDSGIIAMQEPVTHMVLEAPGGIIEVEAECRNGKAERISVRNVPSFADRLDAPLDVTGL
GTIMVDTAYGGDSFVIVDAAQIGMKIEPGQARELAEIGVKITKAANEQLGFRHPERDWRHISFCQITEPVTREGDVLTGV
NTVAIRPAKLDRSPTGTGCSARMAVLHAKGQMKAGERFIGKSVLGTEFHCRLDKVLELGGKPAISPIISGRAWVTGTSQL
MLDPSDPFPHGYRLSDTWPRDE
>A0B0B8 4.2.1.77~~~lhpH~~~Trans-3-hydroxy-L-proline dehydratase~~~
MKISRSLSTVEVHTGGEAFRIVTSGLPRLPGDTIVQRRAWLKAHADEIRRALMFEPRGHADMYGGYLTEPVSPNADFGVI
FVHNEGYSDHCGHGVIALSTAAVELGWVQRTVPETRVGIDAPCGFIEAFVQWDGEHAGPVRFVNVPSFIWRRDVSVDTPS
FGTVTGDIAYGGAFYFYVDGAPFDLPVRESAVEKLIRFGAEVKAAANATYPVVHPEIPEINHIYGTIIANAPRHAGSTQA
NCCVFADREVDRSPTGSGTGGRVAQLYQRGLLAAGDTLVNESIVGTVFKGRVLRETTVGDFPAVIPEVEGSAHICGFANW
IVDERDPLTYGFLVR
>Q0B950 4.2.1.77~~~~~~Trans-3-hydroxy-L-proline dehydratase~~~COG3938
MKISRSLSTVEVHTGGEAFRIVTSGLPRLPGDTIVQRRAWLKENADEIRRALMFEPRGHADMYGGYLTEPVSPTADFGVI
FLHNEGYSDHCGHGVIALSTAAVELGWVQRTVPETRVGIDAPCGFIEAFVQWDGEHAGPVRFVNVPSFIWRRDVSVDTPS
FGTVTGDIAYGGAFYFYVDGAPFDLPVREAAVEKLIRFGAEVKAAANAKYPVVHPEIPEINHIYGTIIANAPRHPGSTQA
NCCVFADREVDRSPTGSGTGGRVAQLYQRGVLAAGDTLVNESIVGTVFKGRVLRETMVGDIPAVIPEVEGSAHICGFANW
IVDERDPLTYGFLVR
>Q485S0 4.2.1.77~~~lhpH~~~Trans-3-hydroxy-L-proline dehydratase~~~
MTKNIAQAAVKFEQWQPKIEQESYLTINSLECHTGGEPLRIITSGFPVLKGNTILAKANDCKQNYDQLRRALMFEPRGHA
DMYGAIITDAERDDSHFGAVFIHNEGYSSMCGHAVIALTKTAVESGVVARTGDVTQVVIDVPCGQIYAMAYSHNNVVKHV
SFQCVPSFVYAKDQQVEVDGIGMVQFDIAYGGAFYAYVQASSLGLSLVPEQQEKLIAYGRKIKQAIIPQFEINHPTTAEL
SFLYGVIFIDDSPNQDVHSRNVCIFADGELDRSPTGSGVSGRIALHHAKQQIVLNETITIESILASSFSVRAIETVCFAG
FDAVIPEVTGDAYVCGKGQWFINAEDPLKYGFLLR
>A1B195 4.2.1.77~~~~~~Trans-3-hydroxy-L-proline dehydratase~~~COG3938
MRTSRVVHVVSCHAEGEVGDVIVGGVAPPPGETVWDQSRWIARDETLRNFVLNEPRGGVFRHVNLLVPPKDPRAQMGWII
MEPADTPPMSGSNAICVATVLLDTGIIPMQEPITRMVLEPPGGLIEVEAECRGGKAERIRVRNVPSFADRLDARIEVEGL
GTITVDTAYGGDSFVLVDAASVGMRIAPDQARDLAEMGVRITRAANEQLGFRHPANDWSHISFCQFTDPLSERDGVLYGR
NAVAIRPGKIDRSPTGTGCSARMAVLHARGRMKPGDRFVGRSIIDTEFHCSIADEVELNGKRAIRPIISGRAWVIGTKQL
MVDPDDPFQNGYRLSDTWPMDL
>Q9I489 5.1.1.-~~~lhpL~~~Bifunctional trans-3-hydroxy-L-proline dehydratase/2-epimerase~~~
MRSQRIVHIVSCHAEGEVGDVIVGGVAAPPGATLWEQSRWIARDQDLRNFVLNEPRGGVFRHANLLVPAKDPRAQMGWII
MEPADTPPMSGSNSLCVATVLLDSGILPMREPLTRLLLEAPGGLIEARAECRDGKAERVEIRNVPSFADRLDAWIEVEGL
GSLQVDTAYGGDSFVIADARRLGFALRADEAAELVATGLKITHAANEQLGFRHPTNPDWDHLSFCQLAAPPERRDGVLGA
NNAVVIRPGKIDRSPCGTGCSARMAVLQAKGQLRVGERFVGRSIIGSEFHCHIESLTELGGRPAILPCLSGRAWITGIHQ
YLLDPDDPWPQGYRLSDTWPGGHC
>Q92WR9 4.2.1.77~~~~~~Trans-3-hydroxy-L-proline dehydratase~~~COG3938
MRSTKTIHVISAHAEGEVGDVIVGGVAPPPGDTIWEQSRWIAREQTLRNFVLNEPRGGVFRHVNLLVPPKHPDADAAFII
MEPEDTPPMSGSNSICVSTVLLDSGILPMKEPVTEITLEAPGGLVRVRAECRDGKAERIFVENLPSFAERLDAKLEVEGL
GTLTVDTAYGGDSFVIVDAAAMGFALKPDEAHDIARLGVRITNAANAKLGFHHPENPDWRHFSFCLFAGPVERTAEGLRA
GAAVAIQPGKVDRSPTGTALSARMAVLHARGQMGLSDRLTAVSLIGSTFSGRILGTTEVGGRPAVLPEISGRAWITGTHQ
HMLDPSDPWPEGYRLTDTWGAR
>A0NXQ9 4.2.1.77~~~~~~Trans-3-hydroxy-L-proline dehydratase~~~COG3938
MRSTKTIHVISCHAEGEVGDVIVGGVAPPPGETLWEQRSFIARDQTLRNFVLNEPRGGVFRHVNLLVPPRHPEADAAFII
MEPEDTPPMSGSNSICVSTVLLDGGIVPMIEPITEMVLEAPGGLVRVKAECRNGKAERIFVQNVTSFADKLSVPLDVEGI
GTLTVDTAYGGDSFVVVDAEALGFAIVEDEAKDIARLGVRITNAANEQLGFSHPENPDWNHISFCAFCGPLSQTPTGLTG
RSAVAIQPGKVDRSPTGTAVSARMALMAARGQMTIGDTFEAVSIIGSSFTGRIVSQQMAGDRPGIVPEISGRGWITGIHQ
HMLDPSDPWPGGYKLSDTWGA
>B1KJ76 4.2.1.77~~~~~~Trans-3-hydroxy-L-proline dehydratase~~~COG3938
MKLNKLPLEVVGNRQQFITLDAHTEGEPLRIIISGYPEILGETILEMKEYVAQHLDRYRTLLMHEPRGHADMYGALITRP
VSKEADFGVLFLHNEGYSSMCGHGILALVKVMCETNTILLGCEPRVIKIDAPAGLITATASLDEEGRVQASFENVDSWAE
AINCSVMVEGLGEVNYDIGFGGAYYAYIDADALGLSCGRENVAQLIDLGRRIKHAVMDSHPLVHPLESDLSFLYGTIFIS
KEVTEKAAHSRHVCIFADGEVDRSPTGTGVAARAALLYAKGEIGLNQPLVIESIVDGKMTVSALREQDFHGKKAIIPQVS
GRSYITGQHQFIVDPDDQFQDGFILR
>D7A0Y1 4.2.1.77~~~~~~Trans-3-hydroxy-L-proline dehydratase~~~COG3938
MRSSKVIHVVGCHAEGEVGDVIVGGVAPPPGETVWAQSRFVASDNTLRNFVLQEPRGGVFRHVNLLVPPKNKEAVAAWII
MEPEDTPPMSGSNSICVSTVLLDTGIVPMVEPETHMVLEAPGGLIEATAYCKNGKAERIRVKNHPSFADKLDAKLELEGY
GTLTVDTAYGGDSFCIVDAHALGFSIKPDEAKDFADLGMKIVKAANQQLGFQHPTNKDWSHISFCQFAAPLTDDNGTPSG
ANAVAIRPGKIDRSPCGTGCSARMAVLHAKGILKVGDAFVGRSIIGSRFDCRVEAETSIGGRPAIVPSIMGRAFITHTAQ
LMVDPDDPWQTGYRLSDTWPVWKQD
>P12364 2.1.1.72~~~mod~~~Type III restriction-modification enzyme EcoP15I Mod subunit~~~
MKKETIFSEVETANSKQLAVLKANFPQCFDKNGAFIQEKLLEIIRASEVELSKESYSLNWLGKSYARLLANLPPKTLLAE
DKTHNQQEENKNSQHLLIKGDNLEVLKHMVNAYAEKVKMIYIDPPYNTGKDGFVYNDDRKFTPEQLSELAGIDLDEAKRI
LEFTTKGSSSHSAWLTFIYPRLYIARELMREDGTIFISIDHNEFSQLKLVCDEIFGEQNHVGDLVWKNATDNNPSNIAVE
HEYIIVYTKNKEQLISEWKSNISDVKNLLVNIGEEFASKYTGNELQEKYTQWFREHRSELWPLDRYKYIDKDGIYTGSQS
VHNPGKEGYRYDIIHPKTKKPCKQPLMGYRFPLDTMDRLLSEEKIIFGDDENKIIELKVYAKDYKQKLSSVIHLDGRVAT
NELKELFPEMTQPFTNAKTIKLVEDLISFACDGEGIVLDFFAGSGTTAHTVFNLNNKNKTSYQFITVQLDEPTKDKSDAM
KHGYNTIFDLTKERLIRASKKNRDQGFKVYQLMPDFRAKDESELTLSNHTFFDDVVLTPEQYDTLLTTWCLYDGSLLTTP
IEDVDLGGYKAHLCDGRLYLIAPNFTSEALKALLQKVDSDKDFAPNKVVFYGSNFESAKQMELNEALKSYANKKSIELDL
VVRN
>P40814 2.1.1.72~~~mod~~~Type III restriction-modification enzyme StyLTI Mod subunit~~~
MLKDNQKHNESVAPNSAFLSELQRALPEFFTADRYNEQGELIAKGGFDLARFERALKARNIDELTSGYQIDFIGKDYAKK
QAGEKSVTVIVPDVEHNTLAENKNSHNLFLTGDNLDVLRHLQNNYADTVDMIYIDPPYNTGSDGFVYPDHFEYSDRALQD
MFGLNDTELARLKSIQGKSTHSAWLSFMYPRLFLARKLLKDTGFIFISIDDNEYANLKLMMDEIFGEGGFVTNVMWKRKK
EISNDSDNVSIQGEYILVYAKTGQGALRLEPLSKEYIQKSYKEPTEQFPEGKWRPVPLTVSKGLSGGGYTYKITTPNGTV
HERLWAYPEASYQKLVADNLVYFGKDNGGIPQRVMYAHHSKGQPTTNYWDNVASNKEGKKEILDLFGDNVFDTPKPTALL
KKIIKLAIDKDGVVLDFFAGSGTTAHAVMALNEEDGGQRTFILCTIDQALSNNTIAKKAGYNTIDEISRERITRVAAKIR
ANNPATNSDLGFKHYRFATPTQQTLDDLDSFDIATGHFINTSGQLAAFTESGFTDMINPFSARGLGVPGGASGEETLLTT
WLVADGYKMDIDVQTVDFSGYCARYVDNTRLYLIDERWGTEQTRDLLNHIGTHQLPVQTIVIYGYSFDLESIRELEIGLK
QLDQKVNLVKRY
>Q5ZND2 3.1.21.5~~~res~~~Type III restriction-modification enzyme EcoP15I Res subunit~~~
MSKGFTLEKNLPHQKAGVDAVMNVFVSATPHLTDNVAVRLLANPELKLSEQQYYNNIKNVQAFNGIAHSKDNHNAKSNII
DVSMETGTGKTYTYIKTIFDLNKSFGINKFIIIVPTLSIKAGTVNFLKSDALKEHFRDDYKRELRTYVVESQKNAGKNTK
SYMPQAIHDFVEASNFNKKYIHVLVINSGMINSKSLTDTYDTGLLDNQFNTPVDALRAVKPFIIIDEPHRFPTGKKTWEN
IEKFNAQYIIRYGATFSEGYKNLVYRLTAVDAFNDDLVKGIDAYIEDIVGDGNANLKFVKSDGKEATFELNENNNKKSFK
LAKGESLSKTHSAIHDLTLDALNKSTAVLSNGIELKIGSSINPYSYDQTLADNMMRKAVKEHFKLEKELLTQRPRIKPLT
LFFIDDIEGYRDGNDISGSLKTKFEEYVLAEANELLKTEQDAFYKNYLEKTVTNISSVHGGYFSKDNSDKDDKIEQEINE
ILHDKELLLSLDNPRRFIFSKWTLREGWDNPNVFQICKLRSSGSTTSKLQEVGRGLRLPVNEYMCRVKDRNFTLKYYVDF
TEKDFVDSLVKEVNESSFKERVPSKFTQELKEQIMAQYPELSSRALMNELFNDEIIDDNDNFKDSDAYSRLKSKYPAAFP
IGVKPGKIKKATDGKRRTKMRVGKFSELKELWDLINQKAVIEYKINSESEFLSIFKSFMLEETERFTKSGVHTRIDKIYI
HNDMAMSKSIVSDDDDFAKLNTMSYREFLDNLSQTIFVKHGTLHKVFCDIKDTINITEYLNIQTIRKIKSGFSKYLLNNS
FNKFSLGYNLISGSIHPTKFTNADGNPLGEVLSSDLGVLQDNAKAPLDTYLFEEVFYDSELERRNITDREIQSVVVFSKI
PKNSIKIPVAGGYTYSPDFAYVVKTAEGDYLNFIIETKNVDSKDSLRLEEKRKIEHAQALFNQISQSVKVEFRTQFANDD
IYQLIKSALP
>Q07605 3.1.21.4~~~bcgIA~~~Type II restriction enzyme and methyltransferase RM.BcgI~~~
MVNEKTSTDQLVRRFIEDLGVTYEEQGSSNIQIKQALRSASKSKSQSGVGKPEFIFFSGNHLIIVEDKLAIDKLEYKNND
GIIDTEFPFRRDYAVNGAVHYARHIIEKTNSYKEVIAIGIAGDGLHYQIHPYFVSETELKKLPEMKSLEDISPENIEEFY
KVAVLGELPKEERELREVNKIAADMHEDLRNYGQLEGEKKATVVSAILLALEEPTFSLNQLIGSDRVGGTDGEIIFNAVR
VYLENAGIVPYAKVGEMLDQFIFIQRDVTLNTVNSNLEMTPLKYFATTLEAEIMDKIKSNTDFDILGNFYGEFVKYGGND
GNPLGIVLTPRHITSLMAELIGINKSDFVLDPACGTGAFLISAMNRMLGQAENDDERRDIKQNRLYGIEIQQKLFTIATT
NMILRGDGKSNLIRDNCLTFDNTIMNGYGINKILMNPPYSQAKNDQTQHLSELSFIQQALEMLVVGGKLCAIVPQSTMVG
KNRHDKARKKQILKQHTLETVITLNKDTFHGVGVNPCIVIFKAGIKHPENKRVSFVNFEDDGHVVRKHVGLVGDGTEKGK
REHLLAVLAGDEDDGTDLIVKTAIKDTDEWLHSFYYFNDGIPSEDDFYKTVANYLTFQFDMTANGKGYLFEGVEENE
>Q07606 3.1.21.4~~~bcgIB~~~Type II restriction enzyme BgcI specificity subunit S.BcgI~~~
MNNLIKYSTFLISDLFDVVIGKTIDGNKAQRNENGTPYITRKATRNGFEFMIDGEKEKLYSGKLPVITIGNETSKPFVQE
FHFFTGTKVNICIPKLDLNRNHLLYITTMIENATKMFSYSYTINSTRLKSLKILLPIKGEEPDWDYMNTYISKILSNMEK
NFDVQQNDGVSDLRSLKDLSWSQFKMDEIFSINSGVRLTKADMKPGNIPFIGATDSNNGITEFTSSTNASFDGNVLGVNY
NGSVVENFYHPYKAVFSDDVKRLKLKNYPNNKHVLLFMKVVILQQKVKYAYGYKFNATRMKEQIILLPTKADGTPDYEFM
EQYMMRMENKVVGRTTEKEAD
>P29722 ~~~~~~17 kDa lipoprotein~~~COG3015
MKGSVRALCAFLGVGALGSALCVSCTTVCPHAGKAKAEKVECALKGGIFRGTLPAADCPGIDTTVTFNADGTAQKVELAL
EKKSAPSPLTYRGTWMVREDGIVELSLVSSEQSKAPHEKELYELIDSNSVRYMGAPGAGKPSKEMAPFYVLKKTKK
>P19478 ~~~tpd~~~34 kDa membrane antigen~~~COG3470
MKRVSLLGSAAIFALVFSACGGGGEHQHGEEMMAAVPAPDAEGAAGFDEFPIGEDRDVGPLHVGGVYFQPVEMHPAPGAQ
PSKEEADCHIEADIHANEAGKDLGYGVGDFVPYLRVVAFLQKHGSEKVQKVMFAPMNAGDGPHYGANVKFEEGLGTYKVR
FEIAAPSHDEYSLHIDEQTGVSGRFWSEPLVAEWDDFEWKGPQW
>P29723 3.4.-.-~~~~~~Putative DD-carboxypeptidase TP_0574~~~
MKVKYALLSAGALQLLVVGCGSSHHETHYGYATLSYADYWAGELGQSRDVLLAGNAEADRAGDLDAGMFDAVSRATHGHG
AFRQQFQYAVEVLGEKVLSKQETEDSRGRKKWEYETDPSVTKMVRASASFQDLGEDGEIKFEAVEGAVALADRASSFMVD
SEEYKITNVKVHGMKFVPVAVPHELKGIAKEKFHFVEDSRVTENTNGLKTMLTEDSFSARKVSSMESPHDLVVDTVGTGY
HSRFGSDAEASVMLKRADGSELSHREFIDYVMNFNTVRYDYYGDDASYTNLMASYGTKHSADSWWKTGRVPRISCGINYG
FDRFKGSGPGYYRLTLIANGYRDVVADVRFLPKYEGNIDIGLKGKVLTIGGADAETLMDAAVDVFADGQPKLVSDQAVSL
GQNVLSADFTPGTEYTVEVRFKEFGSVRAKVVAQ
>P0AF96 ~~~tabA~~~Toxin-antitoxin biofilm protein TabA~~~COG2731
MIIGNIHNLQPWLPQELRQAIEHIKAHVTAETPKGKHDIEGNRLFYLISEDMTEPYEARRAEYHARYLDIQIVLKGQEGM
TFSTQPAGAPDTDWLADKDIAFLPEGVDEKTVILNEGDFVVFYPGEVHKPLCAVGAPAQVRKAVVKMLMA
>Q0TUS0 ~~~pfo~~~Perfringolysin O~~~
MIRFKKTKLIASIAMALCLFSQPVISFSKDITDKNQSIDSGISSLSYNRNEVLASNGDKIESFVPKEGKKAGNKFIVVER
QKRSLTTSPVDISIIDSVNDRTYPGALQLADKAFVENRPTILMVKRKPININIDLPGLKGENSIKVDDPTYGKVSGAIDE
LVSKWNEKYSSTHTLPARTQYSESMVYSKSQISSALNVNAKVLENSLGVDFNAVANNEKKVMILAYKQIFYTVSADLPKN
PSDLFDDSVTFNDLKQKGVSNEAPPLMVSNVAYGRTIYVKLETTSSSKDVQAAFKALIKNTDIKNSQQYKDIYENSSFTA
VVLGGDAQEHNKVVTKDFDEIRKVIKDNATFSTKNPAYPISYTSVFLKDNSVAAVHNKTDYIETTSTEYSKGKINLDHSG
AYVAQFEVAWDEVSYDKEGNEVLTHKTWDGNYQDKTAHYSTVIPLEANARNIRIKARECTGLAWEWWRDVISEYDVPLTN
NINVSIWGTTLYPGSSITYN
>P0C2E9 ~~~pfo~~~Perfringolysin O~~~
MIRFKKTKLIASIAMALCLFSQPVISFSKDITDKNQSIDSGISSLSYNRNEVLASNGDKIESFVPKEGKKTGNKFIVVER
QKRSLTTSPVDISIIDSVNDRTYPGALQLADKAFVENRPTILMVKRKPININIDLPGLKGENSIKVDDPTYGKVSGAIDE
LVSKWNEKYSSTHTLPARTQYSESMVYSKSQISSALNVNAKVLENSLGVDFNAVANNEKKVMILAYKQIFYTVSADLPKN
PSDLFDDSVTFNDLKQKGVSNEAPPLMVSNVAYGRTIYVKLETTSSSKDVQAAFKALIKNTDIKNSQQYKDIYENSSFTA
VVLGGDAQEHNKVVTKDFDEIRKVIKDNATFSTKNPAYPISYTSVFLKDNSVAAVHNKTDYIETTSTEYSKGKINLDHSG
AYVAQFEVAWDEVSYDKEGNEVLTHKTWDGNYQDKTAHYSTVIPLEANARNIRIKARECTGLAWEWWRDVISEYDVPLTN
NINVSIWGTTLYPGSSITYN
>Q893D9 ~~~~~~Tetanolysin~~~
MNKNVLKFVSRSLLIFSMTGLISNYNSSNVLAKGNVEEHSLINNGQVVTSNTKCNLAKDNSSDIDKNIYGLSYDPRKILS
YNGEQVENFVPAEGFENPDKFIVVKREKKSISDSTADISIIDSINDRTYPGAIQLANRNLMENKPDIISCERKPITISVD
LPGMAEDGKKVVNSPTYSSVNSAINSILDTWNSKYSSKYTIPTRMSYSDTMVYSQSQLSAAVGCNFKALNKALNIDFDSI
FKGEKKVMLLAYKQIFYTVSVDPPNRPSDLFGDSVTFDELALKGINNNNPPAYVSNVAYGRTIYVKLETTSKSSHVKAAF
KALINNQDISSNAEYKDILNQSSFTATVLGGGAQEHNKIITKDFDEIRNIIKNNSVYSPQNPGYPISYTTTFLKDNSIAS
VNNKTEYIETTATEYTNGKIVLDHSGAYVAQFQVTWDEVSYDEKGNEIVEHKAWEGNNRDRTAHFNTEIYLKGNARNISV
KIRECTGLAWEWWRTIVDVKNIPLAKERTFYIWGTTLYPKTSIETKM
>P13128 ~~~hly~~~Listeriolysin O~~~
MKKIMLVFITLILVSLPIAQQTEAKDASAFNKENSISSMAPPASPPASPKTPIEKKHADEIDKYIQGLDYNKNNVLVYHG
DAVTNVPPRKGYKDGNEYIVVEKKKKSINQNNADIQVVNAISSLTYPGALVKANSELVENQPDVLPVKRDSLTLSIDLPG
MTNQDNKIVVKNATKSNVNNAVNTLVERWNEKYAQAYPNVSAKIDYDDEMAYSESQLIAKFGTAFKAVNNSLNVNFGAIS
EGKMQEEVISFKQIYYNVNVNEPTRPSRFFGKAVTKEQLQALGVNAENPPAYISSVAYGRQVYLKLSTNSHSTKVKAAFD
AAVSGKSVSGDVELTNIIKNSSFKAVIYGGSAKDEVQIIDGNLGDLRDILKKGATFNRETPGVPIAYTTNFLKDNELAVI
KNNSEYIETTSKAYTDGKINIDHSGGYVAQFNISWDEVNYDPEGNEIVQHKNWSENNKSKLAHFTSSIYLPGNARNINVY
AKECTGLAWEWWRTVIDDRNLPLVKNRNISIWGTTLYPKYSNKVDNPIE
>P31830 ~~~lso~~~Seeligeriolysin~~~
MKIFGLVIMSLLFVSLPITQQPEARDVPAYDRSEVTISPAETPESPPATPKTPVEKKHAEEINKYIWGLNYDKNSILVYQ
GEAVTNVPPKKGYKDGSEYIVVEKKKKGINQNNADISVINAISSLTYPGALVKANRELVENQPNVLPVKRDSLTLSVDLP
GMTKKDNKIFVKNPTKSNVNNAVNTLVERWNDKYSKAYPNINAKIDYSDEMAYSESQLIAKFGTAFKAVNNSLNVNFEAI
SDGKVQEEVISFKQIYYNINVNEPTSPSKFFGGSVTKEQLDALGVNAENPPAYISSVAYGRQVYVKLSSSSHSNKVKTAF
EAAMSGKSVKGDVELTNIIKNSSFKAVIYGGSAKEEVEIIDGNLGELRDILKKGSTYDRENPGVPISYTTNFLKDNDLAV
VKNNSEYIETTSKSYTDGKINIDHSGGYVAQFNISWDEVSYDENGNEIKVHKKWGENYKSKLAHFTSSIYLPGNARNINI
YARECTGLFWEWWRTVIDDRNLPLVKNRNVSIWGTTLYPRHSNNVDNPIQ
>P23564 ~~~alv~~~Alveolysin~~~
MKKKSNHLKGRKVLVSLLVSLQVFAFASISSAAPTEPNDIDMGIAGLNYNRNEVLAIQGDQISSFVPKEGIQSNGKFIVV
ERDKKSLTTSPVDISIVDSITNRTYPGAIQLANKDFADNQPSLVMAARKPLDISIDLPGLKNENTISVQNPNYGTVSSAI
DQLVSTWGEKYSSTHTLPARLQYAESMVYSQNQISSALNVNAKVLNGTLGIDFNAVANGEKKVMVAAYKQIFYTVSAGLP
NNPSDLFDDSVTFAELARKGVSNEAPPLMVSNVAYGRTIYVKLETTSKSNDVQTAFKLLLNNPSIQASGQYKDIYENSSF
TAVVLGGDAQTHNQVVTKDFNVIQSVIKDNAQFSSKNPAYPISYTSVFLKDNSIAAVHNNTEYIETKTTEYSKGKIKLDH
SGAYVAQFEVYWDEFSYDADGQEIVTRKSWDGNWRDRSAHFSTEIPLPPNAKNIRIFARECTGLAWEWWRTVVDEYNVPL
ASDINVSIWGTTLYPKSSITH
>P0C0I3 ~~~slo~~~Streptolysin O~~~
MSNKKTFKKYSRVAGLLTAALIIGNLVTANAESNKQNTASTETTTTNEQPKPESSELTTEKAGQKTDDMLNSNDMIKLAP
KEMPLESAEKEEKKSEDKKKSEEDHTEEINDKIYSLNYNELEVLAKNGETIENFVPKEGVKKADKFIVIERKKKNINTTP
VDISIIDSVTDRTYPAALQLANKGFTENKPDAVVTKRNPQKIHIDLPGMGDKATVEVNDPTYANVSTAIDNLVNQWHDNY
SGGNTLPARTQYTESMVYSKSQIEAALNVNSKILDGTLGIDFKSISKGEKKVMIAAYKQIFYTVSANLPNNPADVFDKSV
TFKELQRKGVSNEAPPLFVSNVAYGRTVFVKLETSSKSNDVEAAFSAALKGTDVKTNGKYSDILENSSFTAVVLGGDAAE
HNKVVTKDFDVIRNVIKDNATFSRKNPAYPISYTSVFLKNNKIAGVNNRTEYVETTSTEYTSGKINLSHQGAYVAQYEIL
WDEINYDDKGKEVITKRRWDNNWYSKTSPFSTVIPLGANSRNIRIMARECTGLAWEWWRKVIDERDVKLSKEINVNISGS
TLSPYGSITYK
>Q04IN8 ~~~ply~~~Pneumolysin~~~
MANKAVNDFILAMNYDKKKLLTHQGESIENRFIKEGNQLPDEFVVIERKKRSLSTNTSDISVTATNDSRLYPGALLVVDE
TLLENNPTLLAVDRAPMTYSIDLPGLASSDSFLQVEDPSNSSVRGAVNDLLAKWHQDYGQVNNVPARMQYEKITAHSMEQ
LKVKFGSDFEKTGNSLDIDFNSVHSGEKQIQIVNFKQIYYTVSVDAVKNPGDVFQDTVTVEDLKQRGISAERPLVYISSV
AYGRQVYLKLETTSKSDEVEAAFEALIKGVKVAPQTEWKQILDNTEVKAVILGGDPSSGARVVTGKVDMVEDLIQEGSRF
TADHPGLPISYTTSFLRDNVVATFQNSTDYVETKVTAYRNGDLLLDHSGAYVAQYYITWDELSYDHQGKEVLTPKAWDRN
GQDLTAHFTTSIPLKGNVRNLSVKIRECTGLAWEWWRTVYEKTDLPLVRKRTISIWGTTLYPQVEDKVEND
>P0DF96 ~~~slo~~~Streptolysin O~~~
MSNKKTFKKYSRVAGLLTAALIIGNLVTANAESNKQNTASTETTTTNEQPKPESSELTTEKAGQKTDDMLNSNDMIKLAP
KEMPLESAEKEEKKSEDKKKSEEDHTEEINDKIYSLNYNELEVLAKNGETIENFVPKEGVKKADKFIVIERKKKNINTTP
VDISIIDSVTDRTYPAALQLANKGFTENKPDAVVTKRNPQKIHIDLPGMGDKATVEVNDPTYANVSTAIDNLVNQWHDNY
SGGNTLPARTQYTESMVYSKSQIEAALNVNSKILDGTLGIDFKSISKGEKKVMIAAYKQIFYTVSANLPNNPADVFDKSV
TFKELQRKGVSNEAPPLFVSNVAYGRTVFVKLETSSKSNDVEAAFSAALKGTDVKTNGKYSDILENSSFTAVVLGGDAAE
HNKVVTKDFDVIRNVIKDNATFSRKNPAYPISYTSVFLKNNKIAGVNNRTEYVETTSTEYTSGKINLSHQGAYVAQYEIL
WDEINYDDKGKEVITKRRWDNNWYSKTSPFSTVIPLGANSRNIRIMARECTGLAWEWWRKVIDERDVKLSKEINVNISGS
TLSPYGSITYK
>P0C2J9 ~~~ply~~~Pneumolysin~~~
MANKAVNDFILAMNYDKKKLLTHQGESIENRFIKEGNQLPDEFVVIERKKRSLSTNTSDISVTATNDSRLYPGALLVVDE
TLLENNPTLLAVDRAPMTYSIDLPGLASSDSFLQVEDPSNSSVRGAVNDLLAKWHQDYGQVNNVPARMQYEKITAHSMEQ
LKVKFGSDFEKTGNSLDIDFNSVHSGEKQIQIVNFKQIYYTVSVDAVKNPGDVFQDTVTVEDLKQRGISAERPLVYISSV
AYGRQVYLKLETTSKSDEVEAAFEALIKGVKVAPQTEWKQILDNTEVKAVILGGDPSSGARVVTGKVDMVEDLIQEGSRF
TADHPGLPISYTTSFLRDNVVATFQNSTDYVETKVTAYRNGDLLLDHSGAYVAQYYITWNELSYDHQGKEVLTPKAWDRN
GQDLTAHFTTSIPLKGNVRNLSVKIRECTGLAWEWWRTVYEKTDLPLVRKRTISIWGTTLYPQVEDKVEND
>P0DF97 ~~~slo~~~Streptolysin O~~~
MSNKKTFKKYSRVAGLLTAALIIGNLVTANAESNKQNTASTETTTTNEQPKPESSELTTEKAGQKTDDMLNSNDMIKLAP
KEMPLESAEKEEKKSEDKKKSEEDHTEEINDKIYSLNYNELEVLAKNGETIENFVPKEGVKKADKFIVIERKKKNINTTP
VDISIIDSVTDRTYPAALQLANKGFTENKPDAVVTKRNPQKIHIDLPGMGDKATVEVNDPTYANVSTAIDNLVNQWHDNY
SGGNTLPARTQYTESMVYSKSQIEAALNVNSKILDGTLGIDFKSISKGEKKVMIAAYKQIFYTVSANLPNNPADVFDKSV
TFKELQRKGVSNEAPPLFVSNVAYGRTVFVKLETSSKSNDVEAAFSAALKGTDVKTNGKYSDILENSSFTAVVLGGDAAE
HNKVVTKDFDVIRNVIKDNATFSRKNPAYPISYTSVFLKNNKIAGVNNRTEYVETTSTEYTSGKINLSHQGAYVAQYEIL
WDEINYDDKGKEVITKRRWDNNWYSKTSPFSTVIPLGANSRNIRIMARECTGLAWEWWRKVIDERDVKLSKEINVNISGS
TLSPYGSITYK
>P0DW58 ~~~tad1~~~Thoeris anti-defense 1~~~
MKELSTIQKREKLNTVERIGSEGPGGAYHEYVIKSNSMDSQGNYDVYETIKFQKGARKEEKSQHGVIDSDLLEIVRDRLK
SFQAGPFSSRENACALTHVEEALMWMNRRVEDRIERNVLGTNTK
>P0DW61 ~~~tad1~~~Thoeris anti-defense 1~~~
MEIKNGLCTQKYTKVYAEDKEKWKFNAPHHFIVGKADCEDEYIEPIEYVNFQEGPIKEYGINGVNNEDLILMVITRLQAF
QDSPYKCRENAMAITKLQECLMWLGKRTLDREVKGIEGTSEI
>A9CK16 3.5.4.33~~~tadA~~~tRNA-specific adenosine deaminase~~~COG0590
MAERTHFMELALVEARSAGERDEVPIGAVLVLDGRVIARSGNRTRELNDVTAHAEIAVIRMACEALGQERLPGADLYVTL
EPCTMCAAAISFARIRRLYYGAQDPKGGAVESGVRFFSQPTCHHAPDVYSGLAESESAEILRQFFREKRLDD
>O67050 3.5.4.33~~~tadA~~~tRNA-specific adenosine deaminase~~~COG0590
MGKEYFLKVALREAKRAFEKGEVPVGAIIVKEGEIISKAHNSVEELKDPTAHAEMLAIKEACRRLNTKYLEGCELYVTLE
PCIMCSYALVLSRIEKVIFSALDKKHGGVVSVFNILDEPTLNHRVKWEYYPLEEASELLSEFFKKLRNNII
>P21335 3.5.4.33~~~tadA~~~tRNA-specific adenosine deaminase~~~COG0590
MTQDELYMKEAIKEAKKAEEKGEVPIGAVLVINGEIIARAHNLRETEQRSIAHAEMLVIDEACKALGTWRLEGATLYVTL
EPCPMCAGAVVLSRVEKVVFGAFDPKGGCSGTLMNLLQEERFNHQAEVVSGVLEEECGGMLSAFFRELRKKKKAARKNLS
E
>P68398 3.5.4.33~~~tadA~~~tRNA-specific adenosine deaminase~~~COG0590
MSEVEFSHEYWMRHALTLAKRAWDEREVPVGAVLVHNNRVIGEGWNRPIGRHDPTAHAEIMALRQGGLVMQNYRLIDATL
YVTLEPCVMCAGAMIHSRIGRVVFGARDAKTGAAGSLMDVLHHPGMNHRVEITEGILADECAALLSDFFRMRRQEIKAQK
KAQSSTD
>Q99W51 3.5.4.33~~~tadA~~~tRNA-specific adenosine deaminase~~~
MTNDIYFMTLAIEEAKKAAQLGEVPIGAIITKDDEVIARAHNLRETLQQPTAHAEHIAIERAAKVLGSWRLEGCTLYVTL
EPCVMCAGTIVMSRIPRVVYGADDPKGGCSGSLMNLLQQSNFNHRAIVDKGVLKEACSTLLTTFFKNLRANKKSTN
>Q5XE14 3.5.4.33~~~tadA~~~tRNA-specific adenosine deaminase~~~
MPYSLEEQTYFMQEALKEAEKSLQKAEIPIGCVIVKDGEIIGRGHNAREESNQAIMHAEMMAINEANAHEGNWRLLDTTL
FVTIEPCVMCSGAIGLARIPHVIYGASNQKFGGADSLYQILTDERLNHRVQVERGLLAADCANIMQTFFRQGRERKKIAK
HLIKEQSDPFD
>P9WGV5 1.3.1.-~~~~~~Trans-acting enoyl reductase~~~COG3268
MSPAEREFDIVLYGATGFSGKLTAEHLAHSGSTARIALAGRSSERLRGVRMMLGPNAADWPLILADASQPLTLEAMAARA
QVVLTTVGPYTRYGLPLVAACAKAGTDYADLTGELMFCRNSIDLYHKQAADTGARIILACGFDSIPSDLNVYQLYRRSVE
DGTGELCDTDLVLRSFSQRWVSGGSVATYSEAMRTASSDPEARRLVTDPYTLTTDRGAEPELGAQPDFLRRPGRDLAPEL
AGFWTGGFVQAPFNTRIVRRSNALQEWAYGRRFRYSETMSLGKSMAAPILAAAVTGTVAGTIGLGNKYFDRLPRRLVERV
TPKPGTGPSRKTQERGHYTFETYTTTTTGARYRATFAHNVDAYKSTAVLLAQSGLALALDRDRLAELRGVLTPAAAMGDA
LLARLPGAGVVMGTTRLS
>Q8ZL58 4.2.1.156~~~~~~L-talarate/galactarate dehydratase~~~
MALSANSDAVTYAKAANTRTAAETGDRIEWVKLSLAFLPLATPVSDAKVLTGRQKPLTEVAIIIAEIRSRDGFEGVGFSY
SKRAGGQGIYAHAKEIADNLLGEDPNDIDKIYTKLLWAGASVGRSGMAVQAISPIDIALWDMKAKRAGLPLAKLLGAHRD
SVQCYNTSGGFLHTPLDQVLKNVVISRENGIGGIKLKVGQPNCAEDIRRLTAVREALGDEFPLMVDANQQWDRETAIRMG
RKMEQFNLIWIEEPLDAYDIEGHAQLAAALDTPIATGEMLTSFREHEQLILGNASDFVQPDAPRVGGISPFLKIMDLAAK
HGRKLAPHFAMEVHLHLSAAYPLEPWLEHFEWLNPLFNEQLELRDGRMWISDRHGLGFTLSEQARRWTQLTCEFGKRP
>P27620 2.4.1.187~~~tagA~~~N-acetylglucosaminyldiphosphoundecaprenol N-acetyl-beta-D-mannosaminyltransferase~~~COG1922
MQTETIHNIPYVNSNLTSFIDYLEKHYIDQKIGAVISTVNPEIAFAAIKDRDYFDVLSSSNFILPDGIGVVMMSRLTNNR
LQSRIAGYDVFKELLGVANKKKKRIFLYGAKKDVIKGVVSKISSEYPNIKIAGYSDGYVQDRTLVAKQIARANPDMVFVA
LGYPHQEKFIHNYRHLFPKAVSIGLGGSFDVFSGNVKRAPSWMIRLNLEWFYRLILNPWRWKRMLSIPKYALTVLKEEKN
KKTFYPKPEKDHTKQI
>P27621 2.7.8.44~~~tagB~~~Teichoic acid glycerol-phosphate primase~~~COG1887
MKIRSLLANCYLYFLSAIAFFLQWVKPESKVTLLISFEANAKAILEEYEQGQYSYKLNILYTQQASAIAESFPNVDAYLL
QEKNPIHLIKAVYLMFNSKVIITDNYFLLTSVLNKRKQTKCIQVWHANGSLKKFGLEDITNMQRTKTDIKRFQKVYSSYD
YLTVGSEEMANIFKKSFGIKDNQLLKIGVPLTDPYYRENKKKISDTLNIQRKKIILYAPTFRDYNMQSIQLPFTEEQLIH
QLKEEYVLFVKLHPAIQNNIDIKYSSDYIKDVSNYALFDLLMAADILITDYSSVPFEFSILNKPILFYTYDLKLYQQKRG
LVDNYLSIIPGRACYDSESLINEIQTPFNYSKIKVFSDRWNKYSDGNSSQNLLNFIENLIS
>P27623 2.7.7.39~~~tagD~~~Glycerol-3-phosphate cytidylyltransferase~~~COG0615
MKKVITYGTFDLLHWGHIKLLERAKQLGDYLVVAISTDEFNLQKQKKAYHSYEHRKLILETIRYVDEVIPEKNWEQKKQD
IIDHNIDVFVMGDDWEGKFDFLKDQCEVVYLPRTEGISTTKIKEEIAGL
>P13484 2.4.1.52~~~tagE~~~Poly(glycerol-phosphate) alpha-glucosyltransferase~~~COG0438
MSLHAVSESNIKQIPDMDYYFISGGLPSNYGGLTKSLLLRSKLFGEECNQNTFFLTFRFDLELSSKIDELYSNGKIDKKF
TSVINLFDDFLSVRTNGKRSYEERIGLDQIKKQVGMGKFAKTLLRLFGKKNNEMSVVYYGDGETIRYVDYWNDKNQLIKR
EEYTKNGNLVLVTHYDVQLNKMYLQEYINDQNQVYLDKLYVWNNEEKDVQLSHIIWYSLEGEIKVKDESELRQYWIEYLQ
KQNDKPKLFLVDSRPQDKHVFKVKKSPSSYYGAIIHNKHYGSNKYQIKGRYKEVFSQMYNLDAVFFITEEQLEDFKLISG
EQETFFFTPHTIDKPLDPAVLNVPSEKYKAVIISRLASMKNLIHAVKAFSLVVKEIPEAKLDIFGSGEDFEKIKKEIEDT
KLQNNVFLKGYTDNPDSEFQKAWLTISTSHFEGFGLSNMEALSNGCPVVTYDYDYGARSLVTDGANGYVIEQYNIEKLGQ
AIISLMKDESTHQKFSEQAFKMAEKYSRPNYIENWAFALNQMIEVRIEREKFSKKVGKKDPSISSYTEDFDKTKIEIDIE
NFDHNDIKKIRLVGLDRKNKAEIISTNLQNDQLFVIDLEKDVNIEKIAANKTQVIDFYIVFNANGHIKTMRRLSSEETKL
SGNSIDTNNGYRVEPYTTVKGNFSWRVTEIKES
>P13485 2.7.8.12~~~tagF~~~Teichoic acid poly(glycerol phosphate) polymerase~~~COG1887
MSLVVDTNKRKQKGKSFYTEEQKKVMIENTVIKCILKSLKNNLGSLELLISIDSEHQFLEDYQLFLKLKERRSGTESEFP
LQNTGSLEYKTEINAHVLPMPVEMGQTYDFYVEFRKKYEDAEQEPLLKRLSAEVNSIERAFHVDQTTELLILPYTTDKGN
FSIKVKREAKIIRFDQIEISSEEISITGYAGYLSSENQYRIKNLNLILKKGGETPIEEKFPIKLERKTHGLENMRADGFV
PELYDFEVKVPLKEIPFSNEKRYVYRLFMEYICNDDEGTDIQFNSTALVLGDRKNKLKGLVSIIKTNNAPVRYEVFKKKK
KQTLGIRVNDYSLKTRMKYFIKGKKKRLVSKIKKITKMRNKLITKTYKSLFMMASRMPVKRKTVIFESFNGKQYSCNPRA
IYEYMRENHPEYKMYWSVNKQYSAPFDEKGIPYINRLSLKWLFAMARAEYWVVNSRLPLWIPKPSHTTYLQTWHGTPLKR
LAMDMEEVHMPGTNTKKYKRNFIKEASNWDYLISPNGYSTEIFTRAFQFNKTMIESGYPRNDFLHNDNNEETISLIKSRL
NIPRDKKVILYAPTWRDDQFYAKGRYKFDLDLDLHQLRQELGNEYIVILRMHYLVAENFDLGPFEGFAYDFSAYEDIREL
YMVSDLLITDYSSVFFDFANLKRPMLFFVPDIETYRDKLRGFYFDFEKEAPGPLVKTTEETIEAIKQISSPDYKLPVSFG
PFYDKFCYLESGRSSEKVVNTVFKAE
>Q5HLM5 2.7.8.12~~~tagF~~~Teichoic acid poly(glycerol phosphate) polymerase~~~COG0463
MNKLTIIVTYYNAEEYITGCLESIKQQRTQDFNLIIVNDGSTDQSKKLMDEAIKDYDKNIRFIDLDENSGHAHARNIALE
EVETPYFMFLDADDELASYAITFYLEKFNNTDGLIAPIHSFTTQRPQFVDLDRVRVEYFNAKENINSFLRKQSACNIIFR
TAIVRAHHIRFNENLNTYVDWSFVLEYMKYVNKFVRIFNFPFYFRGEVYDPFETLTLSEQNFDILFKDYVNSFYDAIKRA
TNPKVREFIVTKMGNKIANEFEPTRYDINERYQTHKDTLVELSKFLHVHLVKNQKLINKIETILLMNNETDKAFKVNQFR
KTLRHVKNIVLRRKNKERSLYDLTDKEDNVKPKTIVFESFGGKNYSDSPKYIYEYMQKYYPNYRYIWSFKNPDKNVVPGS
AEKVKRNSAEYYQAYSEASHWVSNARTPLYLNKKENQTYIQTWHGTPLKRLANDMKVVRMPGTTTPKYKRNFNRETSRWD
YLISPNRYSTEIFRSAFWMDEERILEIGYPRNDVLVNRANDQEYLDEIRTHLNLPSDKKVIMYAPTWRDDEFVSKGKYLF
ELKIDLDNLYKELGDDYVILLRMHYLISNALDLSGYENFAIDVSNYNDVSELFLISDCLITDYSSVMFDYGILKRPQFFF
AYDIDKYDKGLRGFYMNYMEDLPGPIYTEPYGLAKELKNLDKVQQQYQEKIDAFYDRFCSVDNGKASQYIGDLIHKDIKE
Q
>P42953 ~~~tagG~~~Teichoic acid translocation permease protein TagG~~~COG1682
MNDLLRILREQITSFPLILRLAAYETKSKYQMNYLGVLWQFLNPLIQMLAYWFVFGMGIRKGGPVTTGAGEVPFIIWMLA
GLIPWFFISPTILDGSNSVFKRINMVAKMNFPISSLPSVAIASNLFSYMIMMVIYIIVLLVNGVFPSVHWLQYIYYFICM
IAFMFSFSLFNSTISVLIRDYQFLLQAVTRLLFFLLPIFWDVNAKLGQSHPELVPVLKLNPLFYIIEGFRNSFLDGAWFF
HDMKYTLYFWLFTFLLLLVGSILHMKFRDKFVDFL
>P42954 7.5.2.4~~~tagH~~~Teichoic acids export ATP-binding protein TagH~~~COG1134
MKLKVSFRNVSKQYHLYKKQSDKIKGLFFPAKDNGFFAVRNVSFDVYEGETIGFVGINGSGKSTMSNLLAKIIPPTSGEI
EMNGQPSLIAIAAGLNNQLTGRDNVRLKCLMMGLTNKEIDDMYDSIVEFAEIGDFINQPVKNYSSGMKSRLGFAISVHID
PDILIIDEALSVGDQTFYQKCVDRINEFKKQGKTIFFVSHSIGQIEKMCDRVAWMHYGELRMFDETKTVVKEYKAFIDWF
NKLSKKEKETYKKEQTEERKKEDPEAFARFRKKKKKPKSLANAIQIAILSILTVFMAGTMFFNAPLRTIASFGAIPQNEV
KNHHGDAKGKSEERLTAINKQGFIANEKAAAYKDQGLKQKADVTLPFGTKVTVAAKGKQAAKIKFDGHSYYVKQSAVATN
MKHAELHATAFTSYVSQNAASSYEYFLKFLGDSSTSIQSKLNGYTEGNKADGRKTLNFDYEKISYVLENDKATELIFHNI
SPINPASLSLSDSDVLYDSSKKRFLVNTDDQVFAVDNEEHTLTLMLK
>Q7A713 7.5.2.4~~~tagH~~~Teichoic acids export ATP-binding protein TagH~~~
MNVSVNIKNVTKEYRIYRTNKERMKDALIPKHKNKTFFALDDISLKAYEGDVIGLVGINGSGKSTLSNIIGGSLSPTVGK
VDRNGEVSVIAISAGLSGQLTGIENIEFKMLCMGFKRKEIKAMTPKIIEFSELGEFIYQPVKKYSSGMRAKLGFSINITV
NPDILVIDEALSVGDQTFAQKCLDKIYEFKEQNKTIFFVSHNLGQVRQFCTKIAWIEGGKLKDYGELDDVLPKYEAFLND
FKKKSKAEQKEFRNKLDEARFVIK
>Q9I746 ~~~tagJ~~~Type VI secretion system accessory component TagJ~~~
MADPSFASGRLGSRLQGSIAMIAEELLRAGRLDDALKALQEQVRSQPSNATLRIFLFQLLAVMGQWARAQNQLKVVGELD
ASALPMVQTYSTAIDCEALRREVFAGRLTPVILGQPAEWIAPLLQALSLDAEGHGEAAQALREQAFDAAPAVPGRIGEAP
FAWLADADTRLGPVLEVIVNGRYAWLPMSNLRSLKVEAPSDLRDLVWLPAELTLANGGATVALLPARYAETVEHGDDAAR
LGRKTEWLDSGLPVGQRLFVTDAGETALFDLRELDFEPTDA
>A9CES5 2.7.1.101~~~~~~Tagatose kinase~~~COG0524
MRQASVLAPHTNGPTVTAGEILVEIMATTVGDGFLEPQTLIGPFPSGAPAIFIDQVARCGGTAGIIAAVGDDDFGRLNIE
RLRRDGVDVSAITTIADRPTGSAFVRYRENGARDFVFNIAHSAASETRMTDEAQALIEKAGHVHIMGSAFAIAGIGAIIL
EAVKSVKARGGSVSFDPNIRKELAQGDEGRRLIDDLLAVTDLLLPSGEELQAASGLDDEEAAIEKLLAAGIGEIVLKRGA
NGASHFSRAHGRIDAPGLEVEEIDPTGAGDCFGATYLTCRRLGMRPGKALVYANASGAHNVTKRGPMEGAASLAELDAFM
AAQLSEEVL
>O34753 2.7.8.33~~~tagO~~~Probable undecaprenyl-phosphate N-acetylglucosaminyl 1-phosphate transferase~~~COG0472
MLDERMIRIVVAFIVSLLTVLIITPIVKRIAIKIGAVDQPSNRKVHDKIMPRMGGLAIFIGVVAGVLASGIYTETRMTAI
TVGAFIIIVLGILDDKYQLSAKVKFLIQLGVAIMIVSTGLKMDFFSVPFLTERFELGWMAYPLTVLWIVGITNAINLIDG
LDGLAAGLSVIGLSTIAVMALSGGKVLILSLSLVVIASTLGFLFYNFHPAKIFMGDTGSLFLGYSISILSLLGLYKSVTL
FSIVIPIIILGVPIFDTTFAIIRRILNKQPISAPDKSHIHHRLMAFGLSHRMSVLVIYLIGFIFSISAIVLKSATIWLSL
FIIFILIIFMQIIAEVTGLVNEKFKPFTKFYKRLVKRN
>A9CES6 5.1.3.40~~~~~~D-tagatose 6-phosphate 4-epimerase~~~COG4573
MTAILENLAAARRAGKPAGITSVCSAHPVVLRAAIRRAAASQTAVLIEATCNQVNHLGGYTGMTPRDFVAFVNSIAAEEG
LPAELLIFGGDHLGPNPWRREKAEDALTKAAAMVDAYVTAGFRKIHLDASMGCAGEPAALDDVTIAHRAAKLTAVAEKAA
TEAGLPKPLYILGTEVPVPGGADHVLETVAPTEPQAARNTIDLHREIFAQHGLSDAFERVIAFVVQPGVEFGSDNVVAYD
PQAAQSLSAVLDGEPRLVFEAHSTDYQTEPALAALVRDGYPILKVGPGLTFAYREALYALDMIASEMVGTYGDRPLARTM
EKLMLSAPGDWQGHYHGDDITLRLQRHYSYSDRIRYYWTRPEALAAVSTLHKALDGKTIPETLLRQYLGELPLAAVAGKE
PEEVLVAAVDQVLATYHAATGEGRH
>Q7WY78 2.7.8.-~~~tagT~~~Polyisoprenyl-teichoic acid--peptidoglycan teichoic acid transferase TagT~~~COG1316
MEERSQRRKKKRKLKKWVKVVAGLMAFLVIAAGSVGAYAFVKLNNASKEAHVSLARGEQSVKRIKEFDPGKDSFSVLLLG
IDAREKNGETVDQARSDANVLVTFNRKEKTAKMLSIPRDAYVNIPGHGYDKFTHAHAYGGVDLTVKTVEEMLDIPVDYVV
ESNFTAFEDVVNELNGVKVTVKSDKVIQQIKKDTKGKVVLQKGTHTLDGEEALAYVRTRKADSDLLRGQRQMEVLSAIID
KSKSLSSIPAYDDIVDTMGQNLKMNLSLKDAIGLFPFITSLKSVESIQLTGYDYEPAGVYYFKLNQQKLQEVKKELQNDL
GV
>Q02115 2.7.8.-~~~tagU~~~Polyisoprenyl-teichoic acid--peptidoglycan teichoic acid transferase TagU~~~COG1316
MRNERRKKKKTLLLTILTIIGLLVLGTGGYAYYLWHKAASTVASIHESIDKSKKRDKEVSINKKDPFSVLIMGVDERDGD
KGRADTLIYMTVNPKTNTTDMVSIPRDTYTKIIGKGTMDKINHSYAFGGTQMTVDTVENFLDVPVDYFVKVNMESFRDVV
DTLGGITVNSTFAFSYDGYSFGKGEITLNGKEALAYTRMRKEDPRGDFGRQDRQRQVIQGIINKGANISSITKFGDMFKV
VENNVKTNLTFDNMWDIQSDYKGARKHIKQHELKGTGTKINGIYYYQADESALSDITKELKESLEK
>P96499 2.7.8.-~~~tagV~~~Polyisoprenyl-teichoic acid--peptidoglycan teichoic acid transferase TagV~~~COG1316
MAERVRVRVRKKKKSKRRKILKRIMLLFALALLVVVGLGGYKLYKTINAADESYDALSRGNKSNLRNEVVDMKKKPFSIL
FMGIEDYATKGQKGRSDSLIVVTLDPKNKTMKMLSIPRDTRVQLAGDTTGSKTKINAAYSKGGKDETVETVENFLQIPID
KYVTVDFDGFKDVINEVGGIDVDVPFDFDEKSDVDESKRIYFKKGEMHLNGEEALAYARMRKQDKRGDFGRNDRQKQILN
ALIDRMSSASNIAKIDKIAEKASENVETNIRITEGLALQQIYSGFTSKKIDTLSITGSDLYLGPNNTYYFEPDATNLEKV
RKTLQEHLDYTPDTSTGTSGTEDGTDSSSSSGSTGSTGTTTDGTTNGSSYSNDSSTSSNNSTTNSTTDSSY
>Q7A711 2.4.-.-~~~tagX~~~Putative glycosyltransferase TagX~~~
MRLTIIIPTCNNEATIRQLLISIESKEHYRILCIDGGSTDQTIPMIERLQRELKHISLIQLQNASIATCINKGLMDIKMT
DPHDSDAFMVINPTSIVLPGKLDRLTAAFKNNDNIDMVIGQRAYNYHGEWKLKSADEFIKDNRIVTLTEQPDLLSMMSFD
GKLFSAKFAELQCDETLANTYNHAILVKAMQKATDIHLVSQMIVGDNDIDTHATSNDEDFNRYIIEIMKIRQRVMEMLLL
PEQRLLYSDMVDRILFNNSLKYYMNEHPAVTHTTIQLVKDYIMSMQHSDYVSQNMFDIINTVEFIGENWDREIYELWRQT
LIQVGINRPTYKKFLIQLKGRKFAHRTKSMLKR
>Q3J1R2 ~~~takP~~~Alpha-keto acid-binding periplasmic protein TakP~~~COG4663
MDRRSFITKAAVGGAAASALAAPALAQSAPKVTWRLASSFPKSLDTIFGGAEVLSKMLSEATDGNFQIQVFSAGELVPGL
QAADAVTEGTVECCHTVGYYYWGKDPTFALAAAVPFSLSARGINAWHYHGGGIDLYNEFLSQHNIVAFPGGNTGVQMGGW
FRREINTVADMQGLKMRVGGFAGKVMERLGVVPQQIAGGDIYPALEKGTIDATEWVGPYDDEKLGFFKVAPYYYYPGWWE
GGPTVHFMFNKSAYEGLTPTYQSLLRTACHAADANMLQLYDWKNPTAIKSLVAQGTQLRPFSPEILQACFEAANEVYAEM
EASNPAFKKIWDSIKAFRSEHYTWAQIAEYNYDTFMMVQQNAGKL
>P0A870 2.2.1.2~~~talB~~~Transaldolase B~~~COG0176
MTDKLTSLRQYTTVVADTGDIAAMKLYQPQDATTNPSLILNAAQIPEYRKLIDDAVAWAKQQSNDRAQQIVDATDKLAVN
IGLEILKLVPGRISTEVDARLSYDTEASIAKAKRLIKLYNDAGISNDRILIKLASTWQGIRAAEQLEKEGINCNLTLLFS
FAQARACAEAGVFLISPFVGRILDWYKANTDKKEYAPAEDPGVVSVSEIYQYYKEHGYETVVMGASFRNIGEILELAGCD
RLTIAPALLKELAESEGAIERKLSYTGEVKARPARITESEFLWQHNQDPMAVDKLAEGIRKFAIDQEKLEKMIGDLL
>Q3IWB0 4.3.1.23~~~hutH~~~Tyrosine ammonia-lyase~~~COG2986
MLAMSPPKPAVELDRHIDLDQAHAVASGGARIVLAPPARDRCRASEARLGAVIREARHVYGLTTGFGPLANRLISGENVR
TLQANLVHHLASGVGPVLDWTTARAMVLARLVSIAQGASGASEGTIARLIDLLNSELAPAVPSRGTVGASGDLTPLAHMV
LCLQGRGDFLDRDGTRLDGAEGLRRGRLQPLDLSHRDALALVNGTSAMTGIALVNAHACRHLGNWAVALTALLAECLRGR
TEAWAAALSDLRPHPGQKDAAARLRARVDGSARVVRHVIAERRLDAGDIGTEPEAGQDAYSLRCAPQVLGAGFDTLAWHD
RVLTIELNAVTDNPVFPPDGSVPALHGGNFMGQHVALTSDALATAVTVLAGLAERQIARLTDERLNRGLPPFLHRGPAGL
NSGFMGAQVTATALLAEMRATGPASIHSISTNAANQDVVSLGTIAARLCREKIDRWAEILAILALCLAQAAELRCGSGLD
GVSPAGKKLVQALREQFPPLETDRPLGQEIAALATHLLQQSPV
>Q1LRV9 4.3.1.23~~~~~~Tyrosine ammonia-lyase~~~COG2986
MPHAHPADIDGHHLTPDTVAAIARGQRAAIVPEPVLGKVADARARFEQVAAANVPIYGVSTGFGELVHNWVDIEHGRALQ
ENLLRSHCAGVGPLFSRDEVRAMMVARANALARGYSAVRPAVIEQLLKYLEAGITPAVPQVGSLGASGDLAPLSHVAITL
IGEGKVLTEDGGTAPTAEVLRERGITPLALAYKEGLALINGTSAMTGVSCLLLETLRAQVQQAEIIAALALEGLSASADA
FMAHGHDIAKPHPGQIRSAANMRALLADSARLSGHGELSAEMKTRAGEAKNTGTGVFIQKAYTLRCIPQVLGAVRDTLDH
CATVVERELNSSNDNPLFFEDGELFHGGNFHGQQVAFAMDFLAIAATQLGVVSERRLNRLLSPHLNNNLPAFLAAANEGL
SCGFAGAQYPATALIAENRTICSPASIQSVPSNGDNQDVVSMGLIAARNARRILDNNQYILALELLASCQAAELAGAVEQ
LAPAGRAVFAFVRERVPFLSIDRYMTDDIEAMAALLRQGALVEVVRGAGIELA
>P19669 2.2.1.2~~~tal~~~Transaldolase~~~COG0176
MLFFVDTANIDEIREANELGILAGVTTNPSLVAKEANVSFHDRLREITDVVKGSVSAEVISLKAEEMIEEGKELAKIAPN
ITVKIPMTSDGLKAVRALTDLGIKTNVTLIFNANQALLAARAGATYVSPFLGRLDDIGHNGLDLISEVKQIFDIHGLDTQ
IIAASIRHPQHVTEAALRGAHIGTMPLKVIHALTKHPLTDKGIEQFLADWNK
>E5F146 1.1.1.344~~~tal~~~dTDP-6-deoxy-L-talose 4-dehydrogenase (NAD(P)(+))~~~
MTAPHTAPARVAVLGGTGFIGRVLGARLLAQGAEVLSLARKAPAEPAPGRFVAFDLSNGDPAELTALLDRERIDTVVNAA
GGMWGLNDEQMYQANVVLTERLIEAVAAMASPARLVHLGTVHEYGMAPVGTSQRESDPAAPVMEYGKLKLAATEAVVRAV
EAGRISGVVLRLGNVVGAGQPGHSLLGVMAAKLDAARAAGETAQLSLQPLTALRDFVDLTDTLDAVLLAAADRSAPPVVN
VGTGSASTARHLVELLIEESGVPTEITEVPAPDGTGPETEWQQLDVTVARDSLGWTPRRTLREAVRELWTAQSTAPVA
>A0QWX9 2.2.1.2~~~tal~~~Transaldolase~~~COG0176
MAQNPNLAALSAAGVSVWLDDLSRDRLQTGNLTELINTRSVVGVTTNPSIFQAALSKGTAYDAQVNELAARGADVDATIR
TVTTDDVRNACDLLAKEYEASDGVDGRVSIEVDPRLAHDTDKTILQAIELWKIVDRPNLLIKIPATMAGLPAISAVIAEG
ISVNVTLIFSVERHRLVMDAYLEGLEKAKEAGHDLSKIHSVASFFVSRVDTEIDARLEKIGSDEALALRGKAGVANARLA
YAAYEEVFGSDRFAKLKADGARVQRPLWASTGVKNPEYSDTLYVTELVAPNTVNTMPEKTLEAVADHGEITGNTIAGTAA
SSQETFDKLAAIGIDLPDVFRVLEDEGVEKFEKSWQELLDATQGQLDAAKK
>P9WG33 2.2.1.2~~~tal~~~Transaldolase~~~COG0176
MTAQNPNLAALSAAGVSVWLDDLSRDRLRSGNLQELIDTKSVVGVTTNPSIFQKALSEGHTYDAQIAELAARGADVDATI
RTVTTDDVRSACDVLVPQWEDSDGVDGRVSIEVDPRLAHETEKTIQQAIELWKIVDRPNLFIKIPATKAGLPAISAVLAE
GISVNVTLIFSVQRYREVMDAYLTGMEKARQAGHSLSKIHSVASFFVSRVDTEIDKRLDRIGSRQALELRGQAGVANARL
AYATYREVFEDSDRYRSLKVDGARVQRPLWASTGVKNPDYSDTLYVTELVAPHTVNTMPEKTIDAVADHGVIQGDTVTGT
ASDAQAVFDQLGAIGIDLTDVFAVLEEEGVRKFEASWNELLQETRAHLDTAAQ
>Q5F6E9 2.2.1.2~~~tal~~~Transaldolase~~~
MTILSDVKALGQQIWLDNLSRSLVQSGELAQMLKQGVCGVTSNPAIFQKAFAGDALYADEVAALKRQNLSPKQRYETMAV
ADVRAACDVCLAEHESTGGKTGFVSLEVSPELAKDAQGTVEEARRLHAAIARKNAMIKVPATDAGIDALETLVSDGISVN
LTLLFSRAQTLKAYAAYARGIAKRLAAGQSVAHIQVVASFFISRVDSALDATLPDRLKGKTAIALAKAAYQDWEQYFTAP
EFAALEAQGANRVQLLWASTGVKNPAYPDTLYVDSLIGVHTVNTVPDATLKAFIDHGTAKATLTESADEARARLAEIAAL
GIDVETLAARLQEDGLKQFEEAFEKLLAPLV
>Q31C15 2.2.1.2~~~tal~~~Transaldolase~~~COG0176
MKSILEQLSSMTVVVADTGDLDSIKKFQPRDATTNPSLILAAAKNPDYVKLIDKAIESSENTLPNGFSEIELIKETVDQV
SVFFGKEILKIISGRVSTEVDARLSFDTEATVKKARKLINLYKNFGIEKERILIKIAATWEGIKAAEILEKEGIKCNLTL
LFNFCQAVTCANANITLISPFVGRILDWHKAKTGKTSFIGAEDPGVISVTQIYKYFKEKGFKTEVMGASFRNLDEIKELA
GCDLLTIAPKFLEELKREKGVLIRKLDASTKINNSIDYKFEEKDFRLSMLEDQMASEKLSEGITGFSKAIEELEELLIER
LSEMKNHKLISAN
>Q9WYD1 2.2.1.2~~~tal~~~Transaldolase~~~COG0176
MKIFLDTANLEEIKKGVEWGIVDGVTTNPTLISKEGAEFKQRVKEICDLVKGPVSAEVVSLDYEGMVREARELAQISEYV
VIKIPMTPDGIKAVKTLSAEGIKTNVTLVFSPAQAILAAKAGATYVSPFVGRMDDLSNDGMRMLGEIVEIYNNYGFETEI
IAASIRHPMHVVEAALMGVDIVTMPFAVLEKLFKHPMTDLGIERFMEDWKKYLENLKK
>Q5SJE8 2.2.1.2~~~tal~~~Probable transaldolase~~~COG0176
MELYLDTASLEEIREIAAWGVLSGVTTNPTLVAKAFAAKGEALTEEAFAAHLRAICETVGGPVSAEVTALEAEAMVAEGR
RLAAIHPNIVVKLPTTEEGLKACKRLSAEGIKVNMTLIFSANQALLAARAGASYVSPFLGRVDDISWDGGELLREIVEMI
QVQDLPVKVIAASIRHPRHVTEAALLGADIATMPHAVFKQLLKHPLTDIGLKRFLEDWEKVKP
>D2TN56 ~~~tamA~~~Translocation and assembly module subunit TamA~~~COG0729
MPHIRQLCWVSLMCLSSSAFAANVRLQVEGLSGELERNVRAQLSTIQSDEVTPDRRFRARVDDAIREGLKALGYYEPTID
FDLRPPPAKGRQVLLARVSPGEPVRIGGTGVILRGGARTDRDYLDLLKTRPAIGTVLNHGDYDNFKKSLTSVALRKGYFD
SEFNKSQLGVALDRHQAFWDIDYNSGERYRFGHVTFEGSQIRDEYLQNLVPFKEGDEYESKDLGELNRRLSATGWFNSVV
VAPEFDKARETKVLPLKGVVSPRTENTVETGVGYSTDVGPRVKATWKKPWMNSYGHSLTTSASISAPEQVLDFSYKIPLL
KNPLEQYYLVQGGFKRTDLNDTEQDSTTLALSRYWDLSSGWQRAINLRWSLDHFTQGEVTNTTMLLYPGVMISRTRSRGG
LMPTWGDSQRYSVDYSNTAWGSDVDFSVIQAQNVWIRTLYDRHRFVMRGNLGWIETGDFDKVPPDLRFFAGGDRSIRGYK
YKSISPKDSNGDLKGASKLATGSLEYQYNVTGKWWGAVFVDSGEAVSDIRRSDFKTGAGVGVRWQSPVGPIKLDFAAPIG
DKDEHGLQFYIGLGPEL
>P0ADE4 ~~~tamA~~~Translocation and assembly module subunit TamA~~~COG0729
MRYIRQLCCVSLLCLSGSAVAANVRLQVEGLSGQLEKNVRAQLSTIESDEVTPDRRFRARVDDAIREGLKALGYYQPTIE
FDLRPPPKKGRQVLIAKVTPGVPVLIGGTDVVLRGGARTDKDYLKLLDTRPAIGTVLNQGDYENFKKSLTSIALRKGYFD
SEFTKAQLGIALGLHKAFWDIDYNSGERYRFGHVTFEGSQIRDEYLQNLVPFKEGDEYESKDLAELNRRLSATGWFNSVV
VAPQFDKARETKVLPLTGVVSPRTENTIETGVGYSTDVGPRVKATWKKPWMNSYGHSLTTSTSISAPEQTLDFSYKMPLL
KNPLEQYYLVQGGFKRTDLNDTESDSTTLVASRYWDLSSGWQRAINLRWSLDHFTQGEITNTTMLFYPGVMISRTRSRGG
LMPTWGDSQRYSIDYSNTAWGSDVDFSVFQAQNVWIRTLYDRHRFVTRGTLGWIETGDFDKVPPDLRFFAGGDRSIRGYK
YKSIAPKYANGDLKGASKLITGSLEYQYNVTGKWWGAVFVDSGEAVSDIRRSDFKTGTGVGVRWESPVGPIKLDFAVPVA
DKDEHGLQFYIGLGPEL
>P44038 ~~~tama~~~Translocation and assembly module subunit TamA~~~COG0729
MKKKSLKLTALFLALSCFPAFAEQTVDIEVQGIRGFRAVRNTDLNVNLINKEEMDGSERYQHLVTKAVDRGLRVFGYYES
SVRFERKQRQGKRDLLIAHVTPGEPTKIAGTDVQIEGEAAQDENFNALRKNLPKDGVLVEHQTYDDYKTAISRLALNRGY
FDGNFKISRLEISPETHQAWWRMLFDSGVRYHYGNITFSHSQIRDDYLNNILNIKSGDPYLMNNLSDLTSDFPSSNWFSS
VLVQPNVNHKSKTVDVEIILYPRKKNAMELGVGFSTDGGVHGQIGWTKPWINSRGHSLRSNLYLSAPKQTLEATYRMPLL
KNPLNYYYDFAVGWEGEKENDTNTRVLTLSALRYWNNAHGWQYFGGLRMRYDSFTQADITDKTLLLYPTVGFTRTRLRGG
SFATWGDVQKITFDLSKRIWLSESSFIKVQASSAWVRTYAENHRVVARAEIGYLHTKGIEKIPPTLRFFAGGDRSVRGYG
YKKIAPKNRNGKLVGGSRLLTTSLEYQYQVYPNWWAATFADSGLAADNYTAKELRYGTGVGVRWASPVGAIKFDIATPIR
DKDNSKNIQFYIGLGTEI
>D2TN57 ~~~tamB~~~Translocation and assembly module subunit TamB~~~COG2911
MSLWKKISLGVLIFILLLLATVGFLVGTTTGLHLVFSAANRWVPGLEIGQVTGGWRDLSLKNIRYDQPGVAVNAGEVHLA
VGLECLWKSSLCVNDLSLKDINVVIDSKKMPPGEQVEEEEESGPLNLSTPYPVTLSRVALENINVKIDDTTVSVMDFTSG
LNWQEKNLTLKPTALQGLLIALPKVAEVAQEEVVEPKIQNPQPDEKPLGETLQDLFSKPVLPEMTDVHLPLNLNIEEFKG
EQLRLTGDTDLTVFSLLLKVSSIDGNMKLDALDIDSSQGAVNATGTAQLANNWPVDITLNSTLNVEPLKGEKIKLKVGGA
LREQLEVGVNLSGPLDVNLRAQARLAEAGLPLNLEVVSEQISWPFTGDRQFQADNTRLKLTGKMTDYTLSMRTAVKGQDV
PPATITLDAKGNEQQINLDKLTVAALEGKTELKALVDWRQAISWRGELTLDGINTAKEVPDWPSKLNGLIKTRGSLYGGS
WQMEVPELKLTGNVKQNKVNVNGSLKGNSYMQWTIPGLHLVLGPNSADVKGELGVKDLNLDATIDAPGLDNALPGLGGTA
KGLVKVRGTVDAPQLLADITARGLRWQELSIAQVRVDGDIKSTDQIAGKLDVRVERISQPDVNINLVTLHAKGSEKQHEL
QLRIQGEPVSGQLDLAGSFDREEMRWKGTLSNTRFRTPVGPWSQTRAIALDYRGQEQKISIGPHCWTNPNAELCVPQTID
AGAEGRAVVNLNRFDLAMLKPFMPETTQASGVFSGKADVAWDTTKEGLPQGNVTLSGRSVKVTQTVNDAPLPLAFDTLNV
SADLHDNRAELGWQIRLSNNGQLDGQVQVTDPQGRRNLGGNVSIRNLNLAMVNPIFARGEKAAGLLNANLRLGGDVQSPQ
MFGQLQLNGVDIDGNFMPFDMQPSQLAMNFNGTRSTLTGVVRTQQGEINLSGDADWSQIENWRARIAAKGSRVRITVPPM
VRLDVSPDVVFEATPSLFTLDGRVDVPWARIVVHELPESAVGVSSDEVMLNNQLQPEEPQTAAIPINSNLIVHVGNNVRM
DAFGLRARLTGDLKVAQDKQGLGLNGQINIPEGRFHAYGQDLLVRKGELLFSGPPDQPILNIEAIRNPEATEDDVIAGVR
VTGSADEPKAEIFSDPAMSQQEALSYLLRGQGLDSNQSDSAAMTSMLIGLGVAQSGQVVGKIGETFGVSNLALDTQGVGD
SSQVVVSGYVLPGLQVKYGVGIFDSLATLTLRYRLMPKLYLEAVSGVDQALDLLYQFEF
>P39321 ~~~tamB~~~Translocation and assembly module subunit TamB~~~COG2911
MSLWKKISLGVVIVILLLLGSVAFLVGTTSGLHLVFKAADRWVPGLDIGKVTGGWRDLTLSDVRYEQPGVAVKAGNLHLA
VGLECLWNSSVCINDLALKDIQVNIDSKKMPPSEQVEEEEDSGPLDLSTPYPITLTRVALDNVNIKIDDTTVSVMDFTSG
LNWQEKTLTLKPTSLKGLLIALPKVAEVAQEEVVEPKIENPQPDEKPLGETLKDLFSRPVLPEMTDVHLPLNLNIEEFKG
EQLRVTGDTDITVSTMLLKVSSIDGNTKLDALDIDSSQGIVNASGTAQLSDNWPVDITLNSTLNVEPLKGEKVKLKMGGA
LREQLEIGVNLSGPVDMDLRAQTRLAEAGLPLNVEVNSKQLYWPFTGEKQYQADDLKLKLTGKMTDYTLSMRTAVKGQEI
PPATITLDAKGNEQQVNLDKLTVAALEGKTELKALLDWQQAISWRGELTLNGINTAKEFPDWPSKLNGLIKTRGSLYGGT
WQMDVPELKLTGNVKQNKVNVDGTLKGNSYMQWMIPGLHLELGPNSAEVKGELGVKDLNLDATINAPGLDNALPGLGGTA
KGLVKVRGTVEAPQLLADITARGLRWQELSVAQVRVEGDIKSTDQIAGKLDVRVEQISQPDVNINLVTLNAKGSEKQHEL
QLRIQGEPVSGQLNLAGSFDRKEERWKGTLSNTRFQTPVGPWSLTRDIALDYRNKEQKISIGPHCWLNPNAELCVPQTID
AGAEGRAVVNLNRFDLAMLKPFMPETTQASGIFTGKADVAWDTTKEGLPQGSITLSGRNVQVTQTVNDAALPVAFQTLNL
TAELRNNRAELGWTIRLTNNGQFDGQVQVTDPQGRRNLGGNVNIRNFNLAMINPIFTRGEKAAGMVSANLRLGGDVQSPQ
LFGQLQVTGVDIDGNFMPFDMQPSQLAVNFNGMRSTLAGTVRTQQGEIYLNGDADWSQIENWRARVTAKGSKVRITVPPM
VRMDVSPDVVFEATPNLFTLDGRVDVPWARIVVHDLPESAVGVSSDVVMLNDNLQPEEPKTASIPINSNLIVHVGNNVRI
DAFGLKARLTGDLNVVQDKQGLGLNGQINIPEGRFHAYGQDLIVRKGELLFSGPPDQPYLNIEAIRNPDATEDDVIAGVR
VTGLADEPKAEIFSDPAMSQQAALSYLLRGQGLESDQSDSAAMTSMLIGLGVAQSGQIVGKIGETFGVSNLALDTQGVGD
SSQVVVSGYVLPGLQVKYGVGIFDSIATLTLRYRLMPKLYLEAVSGVDQALDLLYQFEF
>Q57523 ~~~tamB~~~Translocation and assembly module subunit TamB~~~COG2911
MTEQIQPSETSPKSPEKPNKKHWVRKAVCIGSAVILVPVLGVAGALSFDAGQKSLIQLVDKMLDSFSVEQVEGGLQNGLV
LKNVRYQTAGIETHIAQARLQLDFGCLFSREVCLRDFTLNKPTIAINTALLPPSAPDNSKSGSMKRISLPISINAENLVM
QDLSVNIDQTSITLGNFKSAVSLNNEKGLTIAPTEINDISVIAKKLSEVKSEPKAEQPNKPVDWAAIEQSLTPAFLGNVS
EIILPFDLHIPEISGKNWQYQAVNEKGETLQSVEMSSLIAQADTVDNQLQLQKLAVESSLGNLSSQGKLQLDGDMPLDLT
LKSHLEPLKSDGKEILPASDVDLTLSGSLKKSTALSLKTKGVLDAELNGNVQLAQDKMPLNLTLNVAKGQYTFVNTMTPL
KINDVTLKLTGDLLNYHAELKGDVAGMNYIPASQVELNADGKLYEVTVNKLGIDSLDGKSEFVGNANWKNGANWDIQADL
EKMNIAFFVPVMPATLSGKLHSRGFAGSQGWQVEVPVADLNGMLSAKPISLKGSATLNQNVLLTVPDLQIKYGENYLKAS
GVLDDHSDFALDINAPNLRGLWSDLKGRVKGRVAISGQITTPNLDLDLTSSNLHLQGFQLAKASIKGHINNASLSSGKLN
IKAEQLHYGGNIKLHLLDLDLSGDEQNHKLILKSQGEPVAANLQINGHFDRTLEQWKGTISQVKFETPIGDVKSNQAIAV
SYDNKQTQANIASHCWQNTDVELCFPQAFNAGKQGNIPFQFKRVNLDLVNKLIEQNSLKGNLQVQGNVAWFTDKPFQFTA
NVDGNHLAFSQKLDYRTFKLYIPKLTLNADIQNNNLVLKTDINVHNQGRIVGDIHLNDLAKNRQLGGTLAIERLNLSIAN
QLLTSGESVNGEVVSKLSFGGNLEKPLLNGDFNIRNIRTKLKSMPVNITDGDIALRFNDNRSTLQGKIKTVDSHLNLTGR
ANWANIEHWTTELNAQANNFNVDIPSMAKLRFSPNITIKANPKELNLSGTVDIPWARIKIDSLPDTAEPVSEDEVILNGP
HKSKEELIKREFAAKTKSGMEIRSDLRINIGKDVSLDAYGLKTNLDGLLSVKQDKGNLGLFGQINLTKGRYASFGQDLLI
RKGLISFSGQATQPTLNIEAIRNPETMEDSKITAGVRVIGIADSPEVTIFSEPSKPQDQALSYLLTGRSLESSGEVGSTG
SVGAALIGLGISKSGKLVGSIGEVFGIQDLNLGTSGVGDKSKVTVSGNITNRLQIKYGVGLFDGLAEVTLRYRLMPQLYF
QSVSSTNQVFDLLYKFEF
>P83543 3.4.-.-~~~~~~Transglutaminase-activating metalloprotease~~~
MRPTPQRRAVATGALVAVTAMLAVGVQTTSANAGQDKAAHPAPRQSIHKPDPGAEPVKLTPSQRAELIRDANATKAETAK
NLGLGAKEKLVVKDVVKDKNGTLHTRYERTYDGLPVLGGDLVVDATRSGQVKTAAKATKQRIAVASTTPSLAASAAEKDA
VKAARAKGSKAGKADKAPRKVVWAAKGTPVLAYETVVGGVQDDGTPSQLHVITDAKTGKKLFEFQGVKQGTGNSQHSGQV
QIGTTKSGSSYQMNDTTRGGHKTYNLNHGSSGTGTLFTDSDDVWGNGTNSDPATAGVDAHYGAQLTWDYYKNVHGRNGIR
GDGVGAYSRVHYGNNYVNAFWDDSCFCMTYGDGNGIPLTSIDVAAHEMTHGVTSATANLTYSGESGGLNEATSDMMATAV
EFWANNPADPGDYLIGEKININGDGTPLRYMDKPSKDGASKDAWYSGLGGIDVHYSSGPANHWFYLASEGSGPKDIGGVH
YDSPTSDGLPVTGVGRDNAAKIWFKALTERMQSNTDYKGARDATLWAAGELFGVNSDTYNNVANAWAAINVGPRASSGVS
VTSPGDQTSIVNQAVSLQIKATGSTSGALTYSATGLPAGLSINASTGLISGTPTTTGTSNVTVTVKDSAGKTGSTSFKWT
VNTTGGGSVFENTTQVAIPDAGAAVTSPIVVTRSGNGPSALKVDVNITHTYRGDLTIDLVAPNGKTWRLKNSDAWDSAAD
VSETYTVDASSVSANGTWKLKVQDVYSGDSGTIDKWRLTF
>Q8UH15 2.1.1.144~~~tam~~~Trans-aconitate 2-methyltransferase~~~COG4106
MAWSAQQYLKFEDERTRPARDLLAQVPLERVLNGYDLGCGPGNSTELLTDRYGVNVITGIDSDDDMLEKAADRLPNTNFG
KADLATWKPAQKADLLYANAVFQWVPDHLAVLSQLMDQLESGGVLAVQMPDNLQEPTHIAMHETADGGPWKDAFSGGGLR
RKPLPPPSDYFNALSPKSSRVDVWHTVYNHPMKDADSIVEWVKGTGLRPYLAAAGEENREAFLADYTRRIAAAYPPMADG
RLLLRFPRLFVVAVKK
>Q0VZ68 5.4.3.6~~~cmdF~~~Tyrosine 2,3-aminomutase~~~
MKITGSNLSIYDVADVCMKRATVELDPSQLERVAVAHERTQAWGEAQHPIYGVNTGFGELVPVMIPRQHKRELQENLIRS
HAAGGGEPFADDVVRAIMLARLNCLMKGYSGASVETVKLLAEFINRGIHPVIPQQGSLGASGDLSPLSHIALALIGEGTV
SFKGQVRKTGDVLREEGLKPLELGFKGGLTLINGTSAMTGAACVALGRAYHLFRLALLATADFVQCLGGSTGPFEERGHL
PKNHSGQVIVAREIRKLLAGSQLTSDHQDLMKEMVARSGVGNDVVDTGVYLQDAYTLRAVPQILGPVLDTLDFARKLIEE
ELNSTNDNPLIFDVPEQTFHGANFHGQYVAMACDYLNIAVTEIGVLAERQLNRLVDPNINGKLPPFLASAHSGLLCGFEG
GQYLATSIASENLDLAAPSSIKSLPSNGSNQDVVSMGTTSARKSLRLCENVGTIVSTLIAACNQAGHILGNERFSPPIRE
LHGELSRSVPLYQDDSPIFELFQTVRAFVGGDGFRAHLVTHLDLAATTASS
>P76145 2.1.1.144~~~tam~~~Trans-aconitate 2-methyltransferase~~~COG4106
MSDWNPSLYLHFSAERSRPAVELLARVPLENVEYVADLGCGPGNSTALLQQRWPAARITGIDSSPAMIAEARSALPDCQF
VEADIRNWQPVQALDLIFANASLQWLPDHYELFPHLVSLLNPQGVLAVQMPDNWLEPTHVLMREVAWEQNYPDRGREPLA
GVHAYYDILSEAGCEVDIWRTTYYHQMPSHQAIIDWVTATGLRPWLQDLTESEQQLFLKRYHQMLEEQYPLQENGQILLA
FPRLFIVARRME
>P9WGA3 2.1.1.144~~~tam~~~Probable trans-aconitate 2-methyltransferase~~~COG4106
MWDPDVYLAFSGHRNRPFYELVSRVGLERARRVVDLGCGPGHLTRYLARRWPGAVIEALDSSPEMVAAAAERGIDATTGD
LRDWKPKPDTDVVVSNAALHWVPEHSDLLVRWVDELAPGSWIAVQIPGNFETPSHAAVRALARREPYAKLMRDIPFRVGA
VVQSPAYYAELLMDTGCKVDVWETTYLHQLTGEHPVLDWITGSALVPVRERLSDESWQQFRQELIPLLNDAYPPRADGST
IFPFRRLFMVAEVGGARRSGG
>B8ZV93 5.4.3.6~~~tam~~~Tyrosine 2,3-aminomutase~~~
MDIYAVAVGRVGVELDAAQLERVRATHLRVQGWGMEKYPMYGVNTGFGELINVIIPPQFKSDLQHNLLRSHAAGGGEPFP
DEVVRAIMTVRINCLMKGYSGISPEALQLLATMLNRGIHPVIPMQGSLGASGDLAPLSHMALPLIGDGHVRKNGVTRPTM
EVFQEEGLTPLKLGFKEGLALVNGTSAMTGAASLALYRARHLLRLSLLASADIVQAMNASTRPFSHTGNALKNHPGQVVI
ARLMRDLTQGTGLMRDHQDIMRAISERTSHSNDVEETEIYLQNAYSLRCMPQVLGVVLETLQMCQRFIEEEANSVNDNPV
ILDTPAETYHGANFHGQYVAMACDYLSIAVAEMGVLAERQLNRLLDPHINKPLPGFLAHAKTGLFCGFEGGQYLATSIAS
ENLDLAAPSSIKSIPSNGQNQDIVSMGLIAARKTLALCENVGTILSVLMAALNQASHFTEAAKYSAPIRSIHEKLGKVAP
RYEDERPMSTVIAQVRGVLLQEQGLALAQSLVNLDLTPDLSLEPRA
>B8ZV94 5.4.3.6~~~tam~~~Tyrosine 2,3-aminomutase~~~
MDIYAVAVGRVGVELDAAQLERVRATHLRVQGWGMEKYPMYGVNTGFGELINVIIPPQFKSDLQHNLLRSHAAGGGEPFP
DEVVRAIMTVRINCLMKGYSGISPEALQLLATMLNRGIHPVIPMQGSLGASGDLAPLSHMALPLIGDGHVRKNGVTRPTM
EVFQEEGLTPLKLGFKEGLALVNGTSAMTGAASLALYRARHLLRLSLLASADIVQAMNASTRPFSHTGNAVKNHPGQVVI
ARLMRDLTEGTGLMRDHQDIMRAISERTSHSNDVEETEIYLQNAYSLRCMPQVLGVVLETLQMCQRFIEEEANSVNDNPV
ILDTPAETYHGANFHGQYVAMACDYLSIAVAEMGVLAERQLNRLLDPHINKPLPGFLAHAKTGLFCGFEGGQYLATSIAS
ENLDLAAPSSIKSIPSNGQNQDIVSMGLIAARKTLALCENVGTILSVLMAALNQASHFTEAAKYSAPIRSIHEKLGKVAP
RYEDERPMSTVIAQVRGVLLQEQGLALAQSLVNLDLTPDLSLEPRA
>Q8GMG0 5.4.3.6~~~~~~MIO-dependent tyrosine 2,3-aminomutase~~~
MALTQVETEIVPVSVDGETLTVEAVRRVAEERATVDVPAESIAKAQKSREIFEGIAEQNIPIYGVTTGYGEMIYMQVDKS
KEVELQTNLVRSHSAGVGPLFAEDEARAIVAARLNTLAKGHSAVRPIILERLAQYLNEGITPAIPEIGSLGASGDLAPLS
HVASTLIGEGYVLRDGRPVETAQVLAERGIEPLELRFKEGLALINGTSGMTGLGSLVVGRALEQAQQAEIVTALLIEAVR
GSTSPFLAEGHDIARPHEGQIDTAANMRALMRGSGLTVEHADLRRELQKDKEAGKDVQRSEIYLQKAYSLRAIPQVVGAV
RDTLYHARHKLRIELNSANDNPLFFEGKEIFHGANFHGQPIAFAMDFVTIALTQLGVLAERQINRVLNRHLSYGLPEFLV
SGDPGLHSGFAGAQYPATALVAENRTIGPASTQSVPSNGDNQDVVSMGLISARNARRVLSNNNKILAVEYLAAAQAVDIS
GRFDGLSPAAKATYEAVRRLVPTLGVDRYMADDIELVADALSRGEFLRAIARETDIQLR
>P40949 ~~~tapA~~~TasA anchoring/assembly protein~~~
MFRLFHNQQKAKTKLKVLLIFQLSVIFSLTAAICLQFSDDTSAAFHDIETFDVSLQTCKDFQHTDKNCHYDKRWDQSDLH
ISDQTDTKGTVCSPFALFAVLENTGEKLKKSKWKWELHKLENARKPLKDGNVIEKGFVSNQIGDSLYKIETKKKMKPGIY
AFKVYKPAGYPANGSTFEWSEPMRLAKCDEKPTVPKKETKSDVKKENETTQKDIPEKTMKEETSQEAVTKEKETQSDQKE
SGEEDEKSNEADQ
>Q47319 2.5.1.25~~~tapT~~~tRNA-uridine aminocarboxypropyltransferase~~~COG3148
MTENAVLQLRAERIARATRPFLARGNRVRRCQRCLLPEKLCLCSTITPAQAKSRFCLLMFDTEPMKPSNTGRLIADILPD
TVAFQWSRTEPSQDLLELVQNPDYQPMVVFPASYADEQREVIFTPPAGKPPLFIMLDGTWPEARKMFRKSPYLDNLPVIS
VDLSRLSAYRLREAQAEGQYCTAEVAIALLDMAGDTGAAAGLGEHFTRFKTRYLAGKTQHLGSITAEQLESV
>P9WJX9 ~~~tap~~~Multidrug efflux pump Tap~~~COG2271
MRNSNRGPAFLILFATLMAAAGDGVSIVAFPWLVLQREGSAGQASIVASATMLPLLFATLVAGTAVDYFGRRRVSMVADA
LSGAAVAGVPLVAWGYGGDAVNVLVLAVLAALAAAFGPAGMTARDSMLPEAAARAGWSLDRINGAYEAILNLAFIVGPAI
GGLMIATVGGITTMWITATAFGLSILAIAALQLEGAGKPHHTSRPQGLVSGIAEGLRFVWNLRVLRTLGMIDLTVTALYL
PMESVLFPKYFTDHQQPVQLGWALMAIAGGGLVGALGYAVLAIRVPRRVTMSTAVLTLGLASMVIAFLPPLPVIMVLCAV
VGLVYGPIQPIYNYVIQTRAAQHLRGRVVGVMTSLAYAAGPLGLLLAGPLTDAAGLHATFLALALPIVCTGLVAIRLPAL
RELDLAPQADIDRPVGSAQ
>Q54410 3.4.14.-~~~tap~~~Tripeptidyl aminopeptidase~~~
MRKSSIRRRATAFGTAGALVTATLIAGAVSAPAASAAPADGHGHGRSWDREARGAAIAAARAARAGIDWEDCAADWNLPK
PIQCGYVTVPMDYAKPYGKQIRLAVDRIGNTGTRSERQGALIYNPGGPGGSGLRFPARVTNKSAVWANTAKAYDFVGFDP
RGVGHSAPISCVDPQEFVKAPKADPVPGSEADKRAQRKLAREYAEGCFERSGEMLPHMTTPNTARDLDVIRAALGEKKLN
YLGVSYGTYLGAVYGTLFPDHVRRMVVDSVVNPSRDKIWYQANLDQDVAFEGRWKDWQDWVAANDAAYHLGDTRAEVQDQ
WLKLRAAAAKKPLGGVVGPAELISFFQSAPYYDSAWAPTAEIFSKYVAGDTQALVDAAAPDLSDTAGNASAENGNAVYTA
VECTDAKWPANWRTWDRDNTRLHRDHPFMTWANAWMNLPCATWPVKQQTPLNVKTGKGLPPVLIVQSERDAATPYEGAVE
LHQRFRGSRLITERDAGSHGVTGLVNPCINDRVDTYLLTGRTDARDVTCAPHATPRP
>Q2G2L3 2.4.1.187~~~tarA~~~N-acetylglucosaminyldiphosphoundecaprenol N-acetyl-beta-D-mannosaminyltransferase~~~COG1922
MTVEERSNTAKVDILGVDFDNTTMLQMVENIKTFFANQSTNNLFIVTANPEIVNYATTHQAYLELINQASYIVADGTGVV
KASHRLKQPLAHRIPGIELMDECLKIAHVNHQKVFLLGATNEVVEAAQYALQQRYPNISFAHHHGYIDLEDETVVKRIKL
FKPDYIFVGMGFPKQEEWIMTHENQFESTVMMGVGGSLEVFAGAKKRAPYIFRKLNIEWIYRALIDWKRIGRLKSIPIFM
YKIAKAKRKIKKAK
>Q7A714 2.4.1.187~~~tarA~~~N-acetylglucosaminyldiphosphoundecaprenol N-acetyl-beta-D-mannosaminyltransferase~~~
MTVEERSNTAKVDILGVDFDNTTMLQMVENIKTFFANQSTNNLFIVTANPEIVNYATTHQAYLELINQASYIVADGTGVV
KASHRLKQPLAHRIPGIELMDECLKIAHVNHQKVFLLGATNEVVEAAQYALQQRYPNISFAHHHGYIDLEDETVVKRIKL
FKPDYIFVGMGFPKQEEWIMTHENQFESTVMMGVGGSLEVFAGAKKRAPYIFRKLNIEWIYRALIDWKRIGRLKSIPIFM
YKIAKAKRKIKKAK
>Q2G2X4 2.7.8.44~~~tarB~~~Teichoic acid glycerol-phosphate primase~~~COG1887
MNVLIKKFYHLVVRILSKMITPQVIDKPHIVFMMTFPEDIKPIIKALNNSSYQKTVLTTPKQAPYLSELSDDVDVIEMTN
RTLVKQIKALKSAQMIIIDNYYLLLGGYNKTSNQHIVQTWHASGALKNFGLTDHQVDVSDKAMVQQYRKVYQATDFYLVG
CEQMSQCFKQSLGATEEQMLYFGLPRINKYYTADRATVKAELKDKYGITNKLVLYVPTYREDKADNRAIDKAYFEKCLPG
YTLINKLHPSIEDSDIDDVSSIDTSTLMLMSDIIISDYSSLPIEASLLDIPTIFYVYDEGTYDQVRGLNQFYKAIPDSYK
VYTEEDLIMTIQEKEHLLSPLFKDWHKYNTDKSLHQLTEYIDKMVTK
>Q89FH0 4.2.1.81~~~tarD~~~D(-)-tartrate dehydratase~~~COG4948
MSVRIVDVREITKPISSPIRNAYIDFTKMTTSLVAVVTDVVREGKRVVGYGFNSNGRYGQGGLIRERFASRILEADPKKL
LNEAGDNLDPDKVWAAMMINEKPGGHGERSVAVGTIDMAVWDAVAKIAGKPLFRLLAERHGVKANPRVFVYAAGGYYYPG
KGLSMLRGEMRGYLDRGYNVVKMKIGGAPIEEDRMRIEAVLEEIGKDAQLAVDANGRFNLETGIAYAKMLRDYPLFWYEE
VGDPLDYALQAALAEFYPGPMATGENLFSHQDARNLLRYGGMRPDRDWLQFDCALSYGLCEYQRTLEVLKTHGWSPSRCI
PHGGHQMSLNIAAGLGLGGNESYPDLFQPYGGFPDGVRVENGHITMPDLPGIGFEGKSDLYKEMKALAE
>Q2G2X2 2.7.7.39~~~tarD~~~Glycerol-3-phosphate cytidylyltransferase~~~COG0615
MKRVITYGTYDLLHYGHIELLRRAREMGDYLIVALSTDEFNQIKHKKSYYDYEQRKMMLESIRYVDLVIPEKGWGQKEDD
VEKFDVDVFVMGHDWEGEFDFLKDKCEVIYLKRTEGISTTKIKQELYGKDAK
>Q8RKI5 2.7.8.12~~~tarF~~~Teichoic acid poly(glycerol phosphate) polymerase~~~
MKSKILMKYRSLLVRIYSIVFRIIGLLPRNEKLIIFESYSGKQFSCNPRAIFEYLEENKDKYDYQLIWSIDKRNKDLFDN
SDVNYLRRFSLKWLWYMATAKYWVTNSRLPLWIPKPRNTTYVQTWHGTPLKKLANDMDEVHMPGTTTEQYKRNFLKEASK
WDYLISPNAYSTEIFRSAFQFKKTFIESGYPRNDFLHKKNRNEEMLKIKERLGINKDKKIILYAPTWRDNSFYAKGKYKF
NMVLDLESLKNQLCNEYILILRMHYLVSENINLTEYKEFAYDFSDHNDIRELYLISDILITDYSSVFFDFAGLKRPILFY
VPDIEFYRDNLRGFYYDFEKCAPGPLLKTTEKVIEAIHKTKNYKQDENITSFYDQFCYLEKGDSSKKVVEELLG
>Q2G1C1 2.7.8.45~~~tarF~~~Teichoic acid glycerol-phosphate transferase~~~COG1887
MIKNTIKKLIEHSIYTTFKLLSKLPNKNLIYFESFHGKQYSDNPKALYEYLTEHSDAQLIWGVKKGYEHIFQQHNVPYVT
KFSMKWFLAMPRAKAWMINTRTPDWLYKSPRTTYLQTWHGTPLKKIGLDISNVKMLGTNTQNYQDGFKKESQRWDYLVSP
NPYSTSIFQNAFHVSRDKILETGYPRNDKLSHKRNDTEYINGIKTRLNIPLDKKVIMYAPTWRDDEAIREGSYQFNVNFD
IEALRQALDDDYVILLRMHYLVVTRIDEHDDFVKDVSDYEDISDLYLISDALVTDYSSVMFDFGVLKRPQIFYAYDLDKY
GDELRGFYMDYKKELPGPIVENQTALIDALKQIDETANEYIEARTVFYQKFCSLEDGQASQRICQTIFK
>Q2G1C0 2.7.7.40~~~tarI~~~Ribitol-5-phosphate cytidylyltransferase 1~~~COG1211
MKYAGILAGGIGSRMGNVPLPKQFLDLDNKPILIHTLEKFILINDFEKIIIATPQQWMTHTKDTLRKFKISDERIEVIQG
GSDRNDTIMNIVKHIESTNGINDDDVIVTHDAVRPFLTHRIIKENIQAALEYGAVDTVIDAIDTIVTSKDNQTIDAIPVR
NEMYQGQTPQSFNINLLKESYAQLSDEQKSILSDACKIIVETNKPVRLVKGELYNIKVTTPYDLKVANAIIRGGIADD
>Q7A7V0 2.7.7.40~~~tarI1~~~Ribitol-5-phosphate cytidylyltransferase 1~~~
MKYAGILAGGIGSRMGNVPLPKQFLDLDNKPILIHTLEKFILINDFEKIIIATPQQWMTHTKDTLRKFKISDERIEVIQG
GSDRNDTIMNIVKHIESTNGINDDDVIVTHDAVRPFLTHRIIKENIQAALEYGAVDTVIDAIDTIVTSKDNQTIDAIPVR
NEMYQGQTPQSFNINLLKESYAQLSDEQKSILSDACKIIVETNKPVRLVKGELYNIKVTTPYDLKVANAIIRGGIADD
>Q2G2C4 2.7.7.40~~~tarI'~~~Ribitol-5-phosphate cytidylyltransferase 2~~~COG1211
MIYAGILAGGIGSRMGNVPLPKQFLDIDNKPILIHTIEKFILVSEFNEIIIATPAQWISHTQDILKKYNITDQRVKVVAG
GTDRNETIMNIIDHIRNVNGINNDDVIVTHDAVRPFLTQRIIKENIEVAAKYGAVDTVIEAIDTIVMSKDKQNIHSIPVR
NEMYQGQTPQSFNIKLLQDSYRALSSEQKEILSDACKIIVESGHAVKLVRGELYNIKVTTPYDLKVANAIIQGDIADD
>P65177 2.7.7.40~~~tarI2~~~Ribitol-5-phosphate cytidylyltransferase 2~~~
MIYAGILAGGIGSRMGNVPLPKQFLDIDNKPILIHTIEKFILVSEFNEIIIATPAQWISHTQDILKKYNITDQRVKVVAG
GTDRNETIMNIIDHIRNVNGINNDDVIVTHDAVRPFLTQRIIKENIEVAAKYGAVDTVIEAIDTIVMSKDKQNIHSIPVR
NEMYQGQTPQSFNIKLLQDSYRALSSEQKEILSDACKIIVESGHAVKLVRGELYNIKVTTPYDLKVANAIIQGDIADD
>Q8RKI9 2.7.7.40~~~tarI~~~Ribitol-5-phosphate cytidylyltransferase~~~
MIYAEILAGGKGSRMGNVNMPKQFLPLNKRPIIIHTVEKFLLNDRFDKILIVSPKEWINHTKDILKKFIGQDDRLVVVEG
GSDRNESIMSGIRYIEKEFGIQDNDVIITHDSVRPFLTHRIIDENIDAVLQYGAVDTVISAIDTIIASEDQEFISDIPVR
DNMYQGQTPQSFRISKLVELYNKLSDEQKAVLTDACKICSLAGEKVKLVRGEVFNIKVTTPYDLKVANAILQERISQ
>Q720Y7 2.7.7.40~~~tarI~~~Ribitol-5-phosphate cytidylyltransferase~~~
MIYAQILAGGKGTRMGNVSMPKQFLPLNGKPIIVHTVEKFILNTRFDKILISSPKEWMNHAEDNIKKYISDDRIVVIEGG
EDRNETIMNGIRFVEKTYGLTDDDIIVTHDAVRPFLTHRIIEENIDAALETGAVDTVIEALDTIVESSNHEVITDIPVRD
HMYQGQTPQSFNMKKVFNHYQNLTPEKKQILTDACKICLLAGDDVKLVKGEIFNIKITTPYDLKVANAIIQERIAND
>Q8DPI2 2.7.7.40~~~tarI~~~Ribitol-5-phosphate cytidylyltransferase~~~COG1211
MIYAGILAGGTGTRMGISNLPKQFLELGDRPILIHTIEKFVLEPSIEKIVVGVHGDWVSHAEDLVDKYLPLYKERIIITK
GGADRNTSIKNIIEAIDAYRPLTPEDIVVTHDSVRPFITLRMIQDNIQLAQNHDAVDTVVEAVDTIVESTNGQFITDIPN
RAHLYQGQTPQTFRCKDFMDLYGSLSDEEKEILTDACKIFVIKGKDVALAKGEYSNLKITTVTDLKIAKSMIEKD
>Q2G1B9 1.1.1.405~~~tarJ~~~Ribulose-5-phosphate reductase 1~~~COG1063
MINQVYQLVAPRQFEVTYNNVDIYSDYVIVRPLYMSICAADQRYYTGSRDENVLSQKLPMSLIHEGVGEVVFDSKGVFNK
GTKVVMVPNTPTEKDDVIAENYLKSSYFRSSGHDGFMQDFVLLNHDRAVPLPDDIDLSIISYTELVTVSLHAIRRFEKKS
ISNKNTFGIWGDGNLGYITAILLRKLYPESKIYVFGKTDYKLSHFSFVDDVFFINKIPEGLTFDHAFECVGGRGSQSAIN
QMIDYISPEGSIALLGVSEFPVEVNTRLVLEKGLTLIGSSRSGSKDFQDVVDLYIQYPDIVDKLALLKGQEFEIATINDL
TEAFEADLSTSWGKTVLKWIM
>Q2G1C4 1.1.1.405~~~tarJ'~~~Ribulose-5-phosphate reductase 2~~~COG1063
MINQVYQLVAPRQFDVTYNNVDIYGNHVIVRPLYLSICAADQRYYTGRRDENVLRKKLPMSLVHEAVGEVVFDSKGVFEK
GTKVVMVPNTPTEQHHIIAENYLASSYFRSSGYDGFMQDYVVMAHDRIVPLPNDIDLSTISYTELVSVSYHAIQRFERKS
IPLKTSFGIWGDGNLGYITAILLRKLYPEAKTYVFGKTDYKLSHFSFVDDIFTVNQIPDDLKIDHAFECVGGKGSQVALQ
QIVEHISPEGSIALLGVSELPVEVNTRLVLEKGLTLIGSSRSGSKDFEQVVDLYRKYPDIVEKLALLKGHEINVCTMQDI
VQAFEMDLSTSWGKTVLKWTI
>Q8DPI3 1.1.1.405~~~tarJ~~~Ribulose-5-phosphate reductase~~~COG1063
MRKTSKMINQIYQLTKPKFINVKYQEEAIDQENHILIRPNYMAVCHADQRYYQGKRDPKILNKKLPMAMIHESCGTVISD
PTGTYEVGQKVVMIPNQSPMQSDEEFYENYMTGTHFLSSGFDGFMREFVSLPKDRVVAYDAIEDTVAAITEFVSVGMHAM
NRLLTLAHSKRERIAVIGDGSLAFVVANIINYTLPEAEIVVIGRHWEKLELFSFAKECYITDNIPEDLAFDHAFECCGGD
GTGPAINDLIRYIRPQGTILMMGVSEYKVNLNTRDALEKGLILVGSSRSGRIDFENAIQMMEVKKFANRLKNILYLEEPV
REIKDIHRVFATDLNTAFKTVFKWEV
>Q8RKJ1 2.7.8.46~~~tarK~~~Teichoic acid ribitol-phosphate primase~~~
MKTFLTRIVKGVFGTAYKLLSALLPVQHDKVVIASYREDQLSDNFRGVYEKLKQDPSLRITLLFRKMDKGLIGRAAYLLH
LFCSLYHLATCRVLLLDDYYFPLYVVPKRKETVAIQLWHACGAFKKFGYSIVNKPFGPSSDYLKIVPVHSNYDYAIVSAP
AAVPHFAEAFQMEEKQILPLGIPRTDYFYHKEHIRTVLDEFHQAYPELKHKKKLLYAPTFRGSGHHQEGDATPLDLLQLK
SALHHKDYVVMLHLHPYMRKHAHTEEDDFVLDLTDSYSLYDLMAISDGLITDYSSVIFEYSLLKRPMYFYCPDLEDYLKE
RDFYYPFESFVPGPISKDVPSLVHDIESDHEADTKRIEAFSQTYITHQDGKSSERVADFISSFLTSGAD
>Q2G1C2 2.7.8.14~~~tarK~~~Teichoic acid ribitol-phosphate polymerase TarK~~~COG1887
MKIDGNTFICRFNVAILDDGYYLPMDKYLFVYHDQLEYIGQLNPNIIDQAYAALNEEQIEEYNELTTQNGKVNYLLAYDA
KVFRKGGVSQHTVYTITPEIASDVNEFVFDIEITLPQEKSGVIATSAHWLHKQGHKASFESRSFLFKAIFNITKLLHIKR
SKTILFTSDSRPNLSGNFKYVYDELLRQKVDFDYDIKTVFKENITDRRKWRDKFRLPYLLGKADYIFVDDFHPLIYTVRF
RPSQEIIQVWHAVGAFKTVGFSRTGKKGGPFIDSLNHRSYTKAYVSSETDIPFYAEAFGIREENVVPTGVPRTDVLFDEA
YATQIKQEMEDELPIIKGKKVILFAPTFRGNGHGTAHYPFFKIDFERLARYCEKHNAVVLFKMHPFVKNRLNISREHRQY
FIDVSDHREVNDILFVTDLLISDYSSLIYEYAVFKKPMIFYAFDLEDYITTRDFYEPFESFVPGKIVQSFDALMDALDNE
DYEVEKVVPFLDKHFKYQDGRSSERLVKDLFRR
>Q8RKJ2 2.7.8.47~~~tarL~~~Teichoic acid poly(ribitol-phosphate) polymerase~~~
MKLARKIKNRLFRSKKKTQKENTAVIVHPADNRVFSLFDKTKRIEENQQVPVRKISEFSWNGSILKIAGYMYIKGLPLQK
EDQVRKRLLLVNNGVLFTAVSLRDIPVDQLSIDTSNVPGAYKWAGFSQQINFSKLMNDKPLPQGEYKLFLEIEAVDDQNV
KHQEVHTVGNVSNFLSNDVYATKMEFHSAKKLMKFNLIVNYDEGEKTINLSCNKLQEIDPSLLELDTGKEANRFIRKLNT
SLFHFAYDVFRLLPIKSNKIVFASDSRLDVTGNFEFVYEELLKREENFDFKFFLKSSIRDRKSLSELMSMAYHFATSKII
FIDDFYPIIYPLKIRKNADLVQLWHAVGAFKTFGYSRIGLPGGPSPHSKNHRNYTKVIVSSENIRKHYAEGFGVDIENVI
ATGVPRTDFFFDEAKKAFVKERLYTEYPFLKDKKVILFAPTFRGNGQQSAHYPFEVLDFDRLYRELKDEYIFLFKIHPFV
RNDANIPYQYSDFFYDFSSFREINELLLVTDVLITDYSSVCFEYALLNKPMIFFSYDVDDYIRKRDFYYDYFDFIPGPLA
KTSDQMISIIKEEKYNFEQIDSFVHYFFDDLDGKASERVVDQIVFPQEEEPVDDKVLKR
>Q2G1B8 2.7.8.14~~~tarL~~~Teichoic acid ribitol-phosphate polymerase TarL~~~COG1887
MVKSKIYIDKIYWERVQLFVEGHSENLDLEDSNFVLRNLTETRTMKANDVKIDGNQFVCRFNVAILDNGYYLPEDKYLLV
NEQELDYIAQLNPDVINDAYQNLKPEQEEEYNELETQNGKINFLLQTYLKEFRKGGISKKTVYTVTPEISSDVNEFVLDV
VVTTPEVKSIYIVRKYKELRKYFRKQSFNTRQFIFKAIFNTTKFFHLKKGNTVLFTSDSRPTMSGNFEYIYNEMLRQNLD
KKYDIHTVFKANITDRRGIIDKFRLPYLLGKADYIFVDDFHPLIYTVRFRRSQEVIQVWHAVGAFKTVGFSRTGKKGGPF
IDSLNHRSYTKAYVSSETDIPFYAEAFGIKEKNVVPTGVPRTDVLFDEAYATQIKQEMEDELPIIKGKKVILFAPTFRGS
GHGTAHYPFFKIDFERLARYCEKNNAVVLFKMHPFVKNRLNIADKHKQYFVDVSDFREVNDILFITDLLISDYSSLIYEY
AVFKKPMIFYAFDLEDYITTRDFYEPYESFVPGKIVQSFDALMDALDNEDYEGEKVIPFLDKHFKYQDGRSSERLVRNLF
GS
>A0A0H2WWV6 2.4.1.70~~~tarM~~~Poly(ribitol-phosphate) alpha-N-acetylglucosaminyltransferase~~~
MKKIFMMVHELDVNKGGMTSSMFNRSKEFYDADIPADIVTFDYKGNYDEIIKALKKQGKMDRRTKMYNVFEYFKQISNNK
HFKSNKLLYKHISERLKNTIEIEESKGISRYFDITTGTYIAYIRKSKSEKVIDFFKDNKRIERFSFIDNKVHMKETFNVD
NKVCYQVFYDEKGYPYISRNINANNGAVGKTYVLVNKKEFKNNLALCVYYLEKLIKDSKDSIMICDGPGSFPKMFNTNHK
NAQKYGVIHVNHHENFDDTGAFKKSEKYIIENANKINGVIVLTEAQRLDILNQFDVENIFTISNFVKIHNAPKHFQTEKI
VGHISRMVPTKRIDLLIEVAELVVKKDNAVKFHIYGEGSVKDKIAKMIEDKNLERNVFLKGYTTTPQKCLEDFKLVVSTS
QYEGQGLSMIEAMISKRPVVAFDIKYGPSDFIEDNKNGYLIENHNINDMADKILQLVNNDVLAAEFGSKARENIIEKYST
ESILEKWLNLFNS
>Q6GX35 ~~~tarP~~~Translocated actin-recruiting phosphoprotein~~~
MTNSISGDQPTVTTFTSSTTSASGASGSLGASSVSTTANATVTQTANATNSAATSSIQTTGETVVNYTNSASAPTVTVST
SSSSTQATATSNKTSQAVAGKITSPDTSESSETSSTSSSDHIPSDYEPISTTENIYENIYESIDDSSTSGPENTSGGAAA
LNSLRGSSYSNYDDAAADYEPISTTENIYESIDDSSTSDPENTSGGAAALNSLRGSSYSNYDDAAADYEPISTTENIYEN
IYESIDDSSTSGPENTSGGAAALNSLRGSSYSNYDDAAADYEPISTTENIYESIDDSSTSDPENTSGGAAAALNSLRGSS
YSNYDDAAADYEPISTTENIYESIDDSSTSDPENTSGGAAALNSLRGSSYSNYDDAAADYEPISTTENIYENIYESIDGS
STSDPENTSGGAAAALNSLRGSSYTTGPRNEGVFGPGPEGLPDMSLPSYDPTNKTSLLTFLSNPHVKSKMLENSGHFVFI
DTDRSSFILVPNGNWDQVCSIKVQNGKTKEDLDIKDLENMCAKFCTGFNKFSGDWDSRVEPMMSAKAGVASGGNLPNTVI
INNKFKTCVAYGPWNSREASSGYTPSAWRRGHQVNFGEIFEKANDFNKINWGTQAGPSSEDDGISFSNETPGAGPAAAPS
PTPSSIPVINVNVNVGGTNVNIRDTNVNTTNTTPTTQSTDASTDTSDIDNINTNNQTDDINTTDKDSDGAGGVNGDISET
ESSSGDDSGSVSSSESDKNASVGNDGPAMKDILSAVRKHLDVVYPGDNGGSTEGPLQANQTLGDIVQDMETTGTSQETVV
SPWKGSTSSTGSAGGSGSVQTLLPSPPPTPSTTTLRTGTGATTTSLMMGGPIKADIITTGGGGRIPGGGTLEKLLPRIRA
HLDISFDGQGDLVSTEEPQLGSIVNKFRKETGSGGIVASVESAPGKPGSAQVLTGTGGDKGNLFQAAAAVTQALGNVAGK
VNLAIQGQKLSSLVNDDGKGSVGRDLFQAATQTTQALSSLIDTVG
>A0A0H3JNB0 2.4.1.-~~~tarP~~~Poly(ribitol-phosphate) beta-N-acetylglucosaminyltransferase TarP~~~
MKKVSVIMPTFNNGEKLHRTISSVLNQTMKSTDYELIIIDDHSNDNGETLNVIKKYKGLVRFKQLKKNSGNASVPRNTGL
KMSKAEYVFFLDSDDLLHERALEDLYNYGKENNSDLIIGKYGVEGKGRSVPKAIFEKGNVAKADIIDNSIFYALSVLKMF
KKSVIDKNKIKFKTFSKTAEDQLFTIEFLMNSKNYSIKTDYEYYIVVNDFESSNHLSVNKSTGNQYFATINEIYKAIYKS
PIYKNQEKRHQLAGKYTTRLLRHGQKKNFANSKMKYEDKIEWLNNFSKTINKVPRDSDKYVTQIFNLKLEAIRQNDLLAV
MIADKLL
>E0U4V7 2.4.1.53~~~tarQ~~~Poly(ribitol-phosphate) beta-glucosyltransferase~~~
MKISIVIPVYNSEDLISECLDSLVNQTMPKEDYEIICVDDKSTDSSLDILNQYKKKYENVVVIERTVNSGGPGAPRNDAI
KIAKGEYILFVDSDDYIGSEALLRWYNFSKENQSDITLGKLKGINGRGVPKSMFKETNPDVDLVDSKIVFTLGPQKLFKA
SLLKENKITFPTHIKAAEDQVFTMNAYLKAKKISVSADYDYYYLVKRDGEHMSVAYVPPENFYGAMEDIISAIKASDLEE
ARKIKLMAVFLNRHFDFSRTKNVTIKMKTDEERAEWFRYLSSFIHAVPEEADQFVLPHIKLRLLFIRNNDLRGLTQYERE
EQDIKKFCTVNNGELIARYPSLERYSISEELLKVNYKNKLEHYLQNIEFSDHSLSIQGTITHKLLDDETNKNQSLTGVFV
HRDTKAEKYIAPASYDNSTFTFECKFDELASAEEDLGVWDFFIESSIDGYKLRARIGNKRAAYKYSTKTMYLGHNALFVY
SARPYFTMNYDNLSIDIKKHAYTEAELSYETESKDLSFIFKDKQIYLPNHSKIIVNTGQSEISLPVKRIDLEPNCTKLTV
NVQSLLEQLAHVKKERLIEFAINTSQNKISAKVDNQAIILDTKSVERKSMLFFNKMVEVQYKLLTSKSKFYFQY
>A0A0H3JPC6 2.4.1.355~~~tarS~~~Poly(ribitol-phosphate) beta-N-acetylglucosaminyltransferase TarS~~~
MMKFSVIVPTYNSEKYITELLNSLAKQDFPKTEFEVVVVDDCSTDQTLQIVEKYRNKLNLKVSQLETNSGGPGKPRNVAL
KQAEGEFVLFVDSDDYINKETLKDAAAFIDEHHSDVLLIKMKGVNGRGVPQSMFKETAPEVTLLNSRIIYTLSPTKIYRT
ALLKDNDIYFPEELKSAEDQLFTMKAYLNANRISVLSDKAYYYATKREGEHMSSAYVSPEDFYEVMRLIAVEILNADLEE
AHKDQILAEFLNRHFSFSRTNGFSLKVKLEEQPQWINALGDFIQAVPERVDALVMSKLRPLLHYARAKDIDNYRTVEESY
RQGQYYRFDIVDGKLNIQFNEGEPYFEGIDIAKPKVKMTAFKFDNHKIVTELTLNEFMIGEGHYDVRLKLHSRNKKHTMY
VPLSVNANKQYRFNIMLEDIKAYLPKEKIWDVFLEVQIGTEVFEVRVGNQRNKYAYTAETSALIHLNNDFYRLTPYFTKD
FNNISLYFTAITLTDSISMKLKGKNKIILTGLDRGYVFEEGMASVVLKDDMIMGMLSQTSENEVEILLSKDIKKRDFKNI
VKLNTAHMTYSLK
>A0A0H3JVA1 2.4.1.355~~~tarS~~~Poly(ribitol-phosphate) beta-N-acetylglucosaminyltransferase TarS~~~
MMKFSVIVPTYNSEKYITELLNSLAKQDFPKTEFEVVVVDDCSTDQTLQIVEKYRNKLNLKVSQLETNSGGPGKPRNVAL
KQAEGEFVLFVDSDDYINKETLKDAAAFIDEHHSDVLLIKMKGVNGRGVPQSMFKETAPEVTLLNSRIIYTLSPTKIYRT
TLLKDNDIYFPEELKSAEDQLFTMKAYLNANRISVLSDKAYYYATKREGEHMSSAYVSPEDFYEVMRLIAVEILNADLEE
AHKDQILAEFLNRHFSFSRTNGFSLKVKLEDQPQWINALGDFIQAVPERVDALVMSKLRPLLHYARAKDIDNYRTVEESY
RQGQYYRFDIVDGKLNIQFNEGEPYFEGIDIAKPKVKMTAFKFDNHKIVTELTLNEFMIGEGHYDVRLKLHSRNKKHTMY
VPLSVNANKQYRFNIMLEDIKAYLPKEKIWDVFLEVQIGTEVFEVRVGNQRNKYAYTAETSALIHLNNDFYRLTPYFTKD
FNNISLYFTAITLTDSISLKLKGKNKIILTGLDRGYVFEEGMASVVLKDDMIMGMLSQTSENEVEILLSKDIKKRDFKNI
VKLNTAHMTYSLK
>A0A0H2ZJS4 ~~~tas1~~~(p)ppApp synthetase toxin Tas1~~~
MVAGAVAGALIGAAVVAATAATGGLAAVILAGSIAAGGLSMFQIVKGLTTIFELPEPTTGVLIRGSFNVYVNSRNAMRAG
DDVSATCSGLPLNHPLWPFPVLIAEGSATVYINGKPAARLQSKMVCGAHIKTGSQNTFIGGPTERVAFVLDLEEWLHTGL
EALGLAALAGGLLLAAMAGVAALVGVVAIGGLMMGGMALLGDLGDRLGPGYRDLFQGVAGMALLGFGPKMARLGNAPKGA
PKTQVPKGFEKVYGKAPAAKAEIDAVADGLAAKHGGRVAKAPIKSRERAMQKINNDYKGDPTKIKDLARNTIIVEGDKVN
TVAAELANRGAKVKVIDGNADPLGYSGVNSTMNTKAGIPGEIQVNSPEMIYAKESEDMARILLGNDTYDAVAAKAGVPGG
QGHKYYEDWRVLDPKSPEAQAIAEKSRAYYDAVRKGNGN
>P54507 ~~~tasA~~~Major biofilm matrix component~~~
MGMKKKLSLGVASAALGLALVGGGTWAAFNDIKSKDATFASGTLDLSAKENSASVNLSNLKPGDKLTKDFQFENNGSLAI
KEVLMALNYGDFKANGGSNTSPEDFLSQFEVTLLTVGKEGGNGYPKNIILDDANLKDLYLMSAKNDAAAAEKIKKQIDPK
FLNASGKVNVATIDGKTAPEYDGVPKTPTDFDQVQMEIQFKDDKTKDEKGLMVQNKYQGNSIKLQFSFEATQWNGLTIKK
DHTDKDGYVKENEKAHSEDKN
>P0A9T4 ~~~tas~~~Protein tas~~~COG0667
MQYHRIPHSSLEVSTLGLGTMTFGEQNSEADAHAQLDYAVAQGINLIDVAEMYPVPPRPETQGLTETYVGNWLAKHGSRE
KLIIASKVSGPSRNNDKGIRPDQALDRKNIREALHDSLKRLQTDYLDLYQVHWPQRPTNCFGKLGYSWTDSAPAVSLLDT
LDALAEYQRAGKIRYIGVSNETAFGVMRYLHLADKHDLPRIVTIQNPYSLLNRSFEVGLAEVSQYEGVELLAYSCLGFGT
LTGKYLNGAKPAGARNTLFSRFTRYSGEQTQKAVAAYVDIARRHGLDPAQMALAFVRRQPFVASTLLGATTMDQLKTNIE
SLHLELSEDVLAEIEAVHQVYTYPAP
>O31467 ~~~tatAd~~~Sec-independent protein translocase protein TatAd~~~COG1826
MFSNIGIPGLILIFVIALIIFGPSKLPEIGRAAGRTLLEFKSATKSLVSGDEKEEKSAELTAVKQDKNAG
>O05522 ~~~tatAy~~~Sec-independent protein translocase protein TatAy~~~COG1826
MPIGPGSLAVIAIVALIIFGPKKLPELGKAAGDTLREFKNATKGLTSDEEEKKKEDQ
>P69428 ~~~tatA~~~Sec-independent protein translocase protein TatA~~~COG1826
MGGISIWQLLIIAVIVVLLFGTKKLGSIGSDLGASIKGFKKAMSDDEPKQDKTSQDADFTAKTIADKQADTNQEQAKTED
AKRHDKEQV
>P9WGA1 ~~~tatA~~~Sec-independent protein translocase protein TatA~~~COG1826
MGSLSPWHWAILAVVVIVLFGAKKLPDAARSLGKSLRIFKSEVRELQNENKAEASIETPTPVQSQRVDPSAASGQDSTEA
RPA
>P69425 ~~~tatB~~~Sec-independent protein translocase protein TatB~~~COG1826
MFDIGFSELLLVFIIGLVVLGPQRLPVAVKTVAGWIRALRSLATTVQNELTQELKLQEFQDSLKKVEKASLTNLTPELKA
SMDELRQAAESMKRSYVANDPEKASDEAHTIHNPVVKDNEAAHEGVTPAAAQTQASSPEQKPETTPEPVVKPAADAEPKT
AAPSPSSSDKP
>P42252 ~~~tatC1~~~Sec-independent protein translocase protein TatCd~~~COG0805
MDKKETHLIGHLEELRRRIIVTLAAFFLFLITAFLFVQDIYDWLIRDLDGKLAVLGPSEILWVYMMLSGICAIAASIPVA
AYQLWRFVAPALTKTERKVTLMYIPGLFALFLAGISFGYFVLFPIVLSFLTHLSSGHFETMFTADRYFRFMVNLSLPFGF
LFEMPLVVMFLTRLGILNPYRLAKARKLSYFLLIVVSILITPPDFISDFLVMIPLLVLFEVSVTLSAFVYKKRMREETAA
AA
>O05523 ~~~tatC2~~~Sec-independent protein translocase protein TatCy~~~COG0805
MTRMKVNQMSLLEHIAELRKRLLIVALAFVVFFIAGFFLAKPIIVYLQETDEAKQLTLNAFNLTDPLYVFMQFAFIIGIV
LTSPVILYQLWAFVSPGLYEKERKVTLSYIPVSILLFLAGLSFSYYILFPFVVDFMKRISQDLNVNQVIGINEYFHFLLQ
LTIPFGLLFQMPVILMFLTRLGIVTPMFLAKIRKYAYFTLLVIAALITPPELLSHMMVTVPLLILYEISILISKAAYRKA
QKSSAADRDVSSGQ
>O67305 ~~~tatC~~~Sec-independent protein translocase protein TatC~~~COG0805
MPLTEHLRELRYRLIISIIAFLIGSGIAFYFAKYVFEILKEPILKSYPEVELITLSPTEPLFILIKISLAVGFIIASPVI
LYQFWRFIEPALYSHEKRAFIPLLLGSILLFMLGALFAYFIVLPLALKFLLGLGFTQLLATPYLSVDMYISFVLKLVVAF
GIAFEMPIVLYVLQKAGVITPEQLASFRKYFIVIAFVIGAIIAPDVSTQVLMAIPLLLLYEISIFLGKLATRKKKEIQKA
>P69423 ~~~tatC~~~Sec-independent protein translocase protein TatC~~~COG0805
MSVEDTQPLITHLIELRKRLLNCIIAVIVIFLCLVYFANDIYHLVSAPLIKQLPQGSTMIATDVASPFFTPIKLTFMVSL
ILSAPVILYQVWAFIAPALYKHERRLVVPLLVSSSLLFYIGMAFAYFVVFPLAFGFLANTAPEGVQVSTDIASYLSFVMA
LFMAFGVSFEVPVAIVLLCWMGITSPEDLRKKRPYVLVGAFVVGMLLTPPDVFSQTLLAIPMYCLFEIGVFFSRFYVGKG
RNREEENDAEAESEKTEE
>P9WG97 ~~~tatC~~~Sec-independent protein translocase protein TatC~~~COG0805
MRAAGLLKRLNPRNRRSRVNPDATMSLVDHLTELRTRLLISLAAILVTTIFGFVWYSHSIFGLDSLGEWLRHPYCALPQS
ARADISADGECRLLATAPFDQFMLRLKVGMAAGIVLACPVWFYQLWAFITPGLYQRERRFAVAFVIPAAVLFVAGAVLAY
LVLSKALGFLLTVGSDVQVTALSGDRYFGFLLNLLVVFGVSFEFPLLIVMLNLAGLLTYERLKSWRRGLIFAMFVFAAIF
TPGSDPFSMTALGAALTVLLELAIQIARVHDKRKAKREAAIPDDEASVIDPPSPVPAPSVIGSHDDVT
>P27859 3.1.11.-~~~tatD~~~3'-5' ssDNA/RNA exonuclease TatD~~~COG0084
MFDIGVNLTSSQFAKDRDDVVACAFDAGVNGLLITGTNLRESQQAQKLARQYSSCWSTAGVHPHDSSQWQAATEEAIIEL
AAQPEVVAIGECGLDFNRNFSTPEEQERAFVAQLRIAADLNMPVFMHCRDAHERFMTLLEPWLDKLPGAVLHCFTGTREE
MQACVAHGIYIGITGWVCDERRGLELRELLPLIPAEKLLIETDAPYLLPRDLTPKPSSRRNEPAHLPHILQRIAHWRGED
AAWLAATTDANVKTLFGIAF
>O08343 3.1.-.-~~~tatD~~~Uncharacterized metal-dependent hydrolase TatD~~~COG0084
MVDAHTHLDACGARDADTVRSLVERAAAAGVTAVVTVADDLESARWVTRAAEWDRRVYAAVALHPTRADALTDAARAELE
RLVAHPRVVAVGETGIDMYWPGRLDGCAEPHVQREAFAWHIDLAKRTGKPLMIHNRQADRDVLDVLRAEGAPDTVILHCF
SSDAAMARTCVDAGWLLSLSGTVSFRTARELREAVPLMPVEQLLVETDAPYLTPHPHRGLANEPYCLPYTVRALAELVNR
RPEEVALITTSNARRAYGLGWMRQ
>P0A843 ~~~tatE~~~Sec-independent protein translocase protein TatE~~~COG1826
MGEISITKLLVVAALVVLLFGTKKLRTLGGDLGAAIKGFKKAMNDDDAAAKKGADVDLQAEKLSHKE
>Q47537 ~~~tauA~~~Taurine-binding periplasmic protein~~~COG4521
MAISSRNTLLAALAFIAFQAQAVNVTVAYQTSAEPAKVAQADNTFAKESGATVDWRKFDSGASIVRALASGDVQIGNLGS
SPLAVAASQQVPIEVFLLASKLGNSEALVVKKTISKPEDLIGKRIAVPFISTTHYSLLAALKHWGIKPGQVEIVNLQPPA
IIAAWQRGDIDGAYVWAPAVNALEKDGKVLTDSEQVGQWGAPTLDVWVVRKDFAEKHPEVVKAFAKSAIDAQQPYIANPD
VWLKQPENISKLARLSGVPEGDVPGLVKGNTYLTPQQQTAELTGPVNKAIIDTAQFLKEQGKVPAVANDYSQYVTSRFVQ
>Q47538 7.6.2.7~~~tauB~~~Taurine import ATP-binding protein TauB~~~COG4525
MLQISHLYADYGGKPALEDINLTLESGELLVVLGPSGCGKTTLLNLIAGFVPYQHGSIQLAGKRIEGPGAERGVVFQNEG
LLPWRNVQDNVAFGLQLAGIEKMQRLEIAHQMLKKVGLEGAEKRYIWQLSGGQRQRVGIARALAANPQLLLLDEPFGALD
AFTRDQMQTLLLKLWQETGKQVLLITHDIEEAVFMATELVLLSSGPGRVLERLPLNFARRFVAGESSRSIKSDPQFIAMR
EYVLSRVFEQREAFS
>P37610 1.14.11.17~~~tauD~~~Alpha-ketoglutarate-dependent taurine dioxygenase~~~COG2175
MSERLSITPLGPYIGAQISGADLTRPLSDNQFEQLYHAVLRHQVVFLRDQAITPQQQRALAQRFGELHIHPVYPHAEGVD
EIIVLDTHNDNPPDNDNWHTDVTFIETPPAGAILAAKELPSTGGDTLWTSGIAAYEALSVPFRQLLSGLRAEHDFRKSFP
EYKYRKTEEEHQRWREAVAKNPPLLHPVVRTHPVSGKQALFVNEGFTTRIVDVSEKESEALLSFLFAHITKPEFQVRWRW
QPNDIAIWDNRVTQHYANADYLPQRRIMHRATILGDKPFYRAG
>Q88RA3 1.14.11.17~~~tauD~~~Alpha-ketoglutarate-dependent taurine dioxygenase~~~COG2175
MSLTITPLSPALGAQISGVDISRDISAEERDAIEQALLQHQVLFLRDQPINPEQQARFAARFGDLHIHPIYPNVPDTPQV
LVLDTAVTDVRDNAVWHTDVTFLPTPALGAVLSAKQLPAYGGDTLWASGIAAFEALSAPLREMLDGLTATHDFTKSFPLE
RFGTTPQDLARWEATRRNNPPLSHPVVRTHPVSGRKALFVNEGFTTRINELSELESDALLRLLFAHATRPEFSIRWRWQE
NDVAFWDNRVTQHFAVDDYRPNRRVMHRATILGDAPF
>Q0K020 ~~~tauE~~~Probable sulfite/organosulfonate exporter TauE~~~COG0730
MKAELLLPLLGLQALLGAGTYFQTVTGFGLGMIVMGVTSGLGLAPVATVAAVVSLVTLANSACALPGKLQHIDWRAVAAA
AIGILPSVVVGVLVLEYLSSSAATLLQLLLGAVILYGGLSAALKPAPLAQRSGDGTFFVSGVFGGLLSGMFGVSGPPLIF
QFYRQPMKPVEIRCALILVFTVTSTVRTLFSAWQGQLDAAVCVQAAIAVPVVVIATLLGRRFPPPFSPATTRRVAFGVLI
GIGASLMLPAISAWVL
>D5AKX9 ~~~tauR~~~HTH-type transcriptional regulator TauR~~~COG1167
MAIPVESFFLAPGGQGSLQHRLRQMVTEGILSGRFRPGDRMPSTRALAAHLGVARITVTLAYADLVASDYLLARGRSGTF
VSAAAPDARKARPLPRDGARTDWARLLHPRAQGLPRPDRPRDWSLYRYPFIYGQADPELFDHQNWRACALQALGRREFHR
LSADCYDEDDPLLVEYILRHILPRRGIAAVPSEVLITMGAQNGLWLAAQVLLGPGERAAMENPGYPGTRAVLGTTGAEVL
SVDVDDRGLVPAQLPARLKLVVTTASHHCPTNATLPVERRLALLAAAEAGDFLILEDDYEFEMSFLQSAAPSLKSLDAGG
RVVHVGSFSKSLFPGLRLGYLVAPAPFVAAVRALRATVLRHPPGQLQRTLALFLSLGHYDALVARMKAAYRLRREVMTKA
IEDNGLQIAGQGGFGGSSFWMQAPGAVDTEDLALRLRAEGVLIEPGRVFFDPARERRNFYRLAYSSIGPAAIPEGIARIA
RALR
>Q09056 ~~~tbp1~~~Transferrin-binding protein A~~~
MQQQHLFRLNILCLSLMTALPAYAENVQAGQAQEKQLDTIQVKAKKQKTRRDNEVTGLGKLVKTADTLSKEQVLDIRDLT
RYDPGIAVVEQGRGASSGYSIRGMDKNRVSLTVDGLAQIQSYTAQAALGGTRTAGSSGAINEIEYENVKAVEISKGSNSV
EQGSGALAGSVAFQTKTADDVIGEGRQWGIQSKTAYSGKNRGLTQSIALAGRIGGAEALLIHTGRRAGEIRAHEDAGRGV
QSFNRLVPVEDSSEYAYFIVEDECEGKNYETCKSKPKKDVVGKDERQTVSTRDYTGPNRFLADPLSYESRSWLFRPGFRF
ENKRHYIGGILEHTQQTFDTRDMTVPAFLTKAVFDANSKQAGSLPGNGKYAGNHKYGGLFTNGENGALVGAEYGTGVFYD
ETHTKSRYGLEYVYTNADKDTWADYARLSYDRQGIGLDNHFQQTHCSADGSDKYCRPSADKPFSYYKSDRVIYGESHRLL
QAAFKKSFDTAKIRHNLSVNLGFDRFDSNLRHQDYYYQHANRAYSSKTPPKTANPNGDKSKPYWVSIGGGNVVTGQICLF
GNNTYTDCTPRSINGKSYYAAVRDNVRLGRWADVGAGLRYDYRSTHSDDGSVSTGTHRTLSWNAGIVLKPADWLDLTYRT
STGFRLPSFAEMYGWRSGVQSKAVKIDPEKSFNKEAGIVFKGDFGNLEASWFNNAYRDLIVRGYEAQIKNGKEEAKGDPA
YLNAQSARITGINILGKIDWNGVWDKLPEGWYSTFAYNRVHVRDIKKRADRTDIQSHLFDAIQPSRYVVGLGYDQPEGKW
GVNGMLTYSKAKEITELLGSRALLNGNSRNTKATARRTRPWYIVDVSGYYTIKKHFTLRAGVYNLLNYRYVTWENVRQTA
GGAVNQHKNVGVYNRYAAPGRNYTFSLEMKF
>Q06987 ~~~tbp1~~~Transferrin-binding protein A~~~
MQQQHLFRLNILCLSLMTALPVYAENVQAEQAQEKQLDTIQVKAKKQKTRRDNEVTGLGKLVKSSDTLSKEQVLNIRDLT
RYDPGIAVVEQGRGASSGYSIRGMDKNRVSLTVDGVSQIQSYTAQAALGGTRTAGSSGAINEIEYENVKAVEISKGSNSS
EYGNGALAGSVAFQTKTAADIIGEGKQWGIQSKTAYSGKDHALTQSLALAGRSGGAEALLIYTKRRGREIHAHKDAGKGV
QSFNRLVLDEDKKEGGSQYRYFIVEEECHNGYAACKNKLKEDASVKDERKTVSTQDYTGSNRLLANPLEYGSQSWLFRPG
WHLDNRHYVGAVLERTQQTFDTRDMTVPAYFTSEDYVPGSLKGLGKYSGDNKAERLFVQGEGSTLQGIGYGTGVFYDERH
TKNRYGVEYVYHNADKDTWADYARLSYDRQGIDLDNRLQQTHCSHDGSDKNCRPDGNKPYSFYKSDRMIYEESRNLFQAV
FKKAFDTAKIRHNLSINLGYDRFKSQLSHSDYYLQNAVQAYDLITPKKPPFPNGSKDNPYRVSIGKTTVNTSPICRFGNN
TYTDCTPRNIGGNGYYAAVQDNVRLGRWADVGAGIRYDYRSTHSEDKSVSTGTHRNLSWNAGVVLKPFTWMDLTYRASTG
FRLPSFAEMYGWRAGESLKTLDLKPEKSFNREAGIVFKGDFGNLEASYFNNAYRDLIAFGYETRTQNGQTSASGDPGYRN
AQNARIAGINILGKIDWHGVWGGLPDGLYSTLAYNRIKVKDADIRADRTFVTSYLFDAVQPSRYVLGLGYDHPDGIWGIN
TMFTYSKAKSVDELLGSQALLNGNANAKKAASRRTRPWYVTDVSGYYNIKKHLTLRAGVYNLLNYRYVTWENVRQTAGGA
VNQHKNVGVYNRYAAPGRNYTFSLEMKF
>Q9K0U9 ~~~tbp1~~~Transferrin-binding protein A~~~
MQQQHLFRFNILCLSLMTALPAYAENVQAGQAQEKQLDTIQVKAKKQKTRRDNEVTGLGKLVKSSDTLSKEQVLNIRDLT
RYDPGIAVVEQGRGASSGYSIRGMDKNRVSLTVDGVSQIQSYTAQAALGGTRTAGSSGAINEIEYENVKAVEISKGSNSV
EQGSGALAGSVAFQTKTADDVIGEGRQWGIQSKTAYSGKNRGLTQSIALAGRIGGAEALLIHTGRRAGEIRAHEDAGRGV
QSFNRLVPVEDSSNYAYFIVKEECKNGSYETCKANPKKDVVGKDERQTVSTRDYTGPNRFLADPLSYESRSWLFRPGFRF
ENKRHYIGGILEHTQQTFDTRDMTVPAFLTKAVFDANKKQAGSLPGNGKYAGNHKYGGLFTNGENGALVGAEYGTGVFYD
ETHTKSRYGLEYVYTNADKDTWADYARLSYDRQGIGLDNHFQQTHCSADGSDKYCRPSADKPFSYYKSDRVIYGESHRLL
QAAFKKSFDTAKIRHNLSVNLGFDRFGSNLRHQDYYYQHANRAYSSNTPPQNNGKKISPNGSETSPYWVTIGRGNVVTGQ
ICRLGNNTYTDCTPRSINGKSYYAAVRDNVRLGRWADVGAGLRYDYRSTHSDDGSVSTGTHRTLSWNAGIVLKPTDWLDL
TYRTSTGFRLPSFAEMYGWRAGVQSKAVKIDPEKSFNKEAGIVFKGDFGNLEASWFNNAYRDLIVRGYEAQIKDGKEEAK
GDPAYLNAQSARITGINILGKIDWNGVWDKLPEGWYSTFAYNRVRVRDIKKRADRTDIQSHLFDAIQPSRYVVGLGYDQP
EGKWGVNGMLTYSKAKEITELLGSRALLNGNSRNTKATARRTRPWYIVDVSGYYTVKKHFTLRAGVYNLLNYRYVTWENV
RQTAGGAVNQHKNVGVYNRYAAPGRNYTFSLEMKF
>Q09057 ~~~tbpB~~~Transferrin-binding protein B~~~
MNNPLVNQAAMVLPVFLLSACLGGGGSFDLDSVDTEAPRPAPKYQDVSSEKPQAQKDQGGYGFAMRLKRRNWYPGAEESE
VKLNESDWEATGLPTKPKELPKRQKSVIEKVETDGDSDIYSSPYLTPSNHQNGSAGNGVNQPKNQATGHENFQYVYSGWF
YKHAASEKDFSNKKIKSGDDGYIFYHGEKPSRQLPASGKVIYKGVWHFVTDTKKGQDFREIIQPSKKQGDRYSGFSGDGS
EEYSNKNESTLKDDHEGYGFTSNLEVDFGNKKLTGKLIRNNASLNNNTNNDKHTTQYYSLDAQITGNRFNGTATATDKKE
NETKLHPFVSDSSSLSGGFFGPQGEELGFRFLSDDQKVAVVGSAKTKDKLENGAAASGSTGAAASGGAAGTSSENSKLTT
VLDAVELTLNDKKIKNLDNFSNAAQLVVDGIMIPLLPKDSESGNTQADKGKNGGTEFTRKFEHTPESDKKDAQAGTQTNG
AQTASNTAGDTNGKTKTYEVEVCCSNLNYLKYGMLTRKNSKSAMQAGGNSSQADAKTEQVEQSMFLQGERTDEKEIPTDQ
NVVYRGSWYGHIANGTSWSGNASDKEGGNRAEFTVNFADKKITGKLTAENRQAQTFTIEGMIQGNGFEGTAKTAESGFDL
DQKNTTRTPKAYITDAKVKGGFYGPKAEELGGWFAYPGDKQTEKATATSSDGNSASSATVVFGAKRQQPVQ
>Q06988 ~~~tbpB~~~Transferrin-binding protein B~~~
MNNPLVNQAAMVLPVFLLSACLGGGGSFDLDSVETVQDMHSKPKYEDEKSQPESQQDVSENSGAAYGFAVKLPRRNAHFN
PKYKEKHKPLGSMDWKKLQRGEPNSFSERDELEKKRGSSELIESKWEDGQSRVVGYTNFTYVRSGYVYLNKNNIDIKNNI
VLFGPDGYLYYKGKEPSKELPSEKITYKGTWDYVTDAMEKQRFEGLGSAAGGDKSGALSALEEGVLRNQAEASSGHTDFG
MTSEFEVDFSDKTIKGTLYRNNRITQNNSENKQIKTTRYTIQATLHGNRFKGKALAADKGATNGSHPFISDSDSLEGGFY
GPKGEELAGKFLSNDNKVAAVFGAKQKDKKDGENAAGPATETVIDAYRITGEEFKKEQIDSFGDVKKLLVDGVELSLLPS
EGNKAAFQHEIEQNGVKATVCCSNLDYMSFGKLSKENKDDMFLQGVRTPVSDVAARTEANAKYRGTWYGYIANGTSWSGE
ASNQEGGNRAEFDVDFSTKKISGTLTAKDRTSPAFTITAMIKDNGFSGVAKTGENGFALDPQNTGNSHYTHIEATVSGGF
YGKNAIEMGGSFSFPGNAPEGKQEKASVVFGAKRQQLVQ
>Q4QLR5 ~~~tbpB~~~Transferrin-binding protein B~~~
MKSVPLITGGLSFLLSACSGGGGSFDVDDVSNPSSSKPRYQDDTSSSRTKSNLEKLSIPSLGGGMKLVAQNLSGNKEPSF
LNENGYISYFSSPSTIEDDVKNVKTENKIHTNPIGLEPNRALQDPNLQKYVYSGLYYIENWKDFSKLATEKKAYSGHYGY
AFYYGNKTATDLPVSGVATYKGTWDFITATKYGQNYSLFSNARGQAYFRRSATRGDIDLENNSKNGDIGLISEFSADFGT
KKLTGQLSYTKRKTDIQQYEKEKLYDIDAHIYSNRFRGKVTPTKSTSDEHPFTSEGTLEGGFYGPNAEELGGKFLARDKR
VFGVFSAKETPETEKEKLSKETLIDGKLITFSTKTADATTSTTASTTADVKTDEKNFTTKDISSFGEADYLLIDNYPVPL
FPEGDTDDFVTSKHHDIGNKTYKVEACCKNLSYVKFGMYYEDKEKKNTNQTGQYHQFLLGLRTPSSQIPVTGNVKYLGSW
FGYIGDDKTSYSTTGNKQQDKNAPAEFDVNFDNKTLTGKLKRADSQNTVFNIEATFKNGSNAFEGKATANVVIDPKNTQA
TSKVNFTTTVNGAFYGPHATELGGYFTYNGNNPTATNSESSSTVPSPPNSPNARAAVVFGAKRQVEKTNK
>P0DTW7 ~~~tbpB~~~Transferrin-binding protein B~~~
MKHIPLTTLCVAISAVLLTACGGSGGSNPPAPTPIPNASGSGNTGNTGNAGGTDNTANAGNTGGASSGTGSANTPEPKYQ
DVPTDKNEKEQVSPIQEPAMGYGMALSKINLHNRQDTPLDEKNIITLDGKKQVAEGKKSPLPFSLDVENKLLDGYIAKMD
KADKNAIRRRIESENKAKPLSEAELAEKIKEAVRKSYEFQQVMTSLENKIFHSNDGTTKATTRDLQYVDYGYYLANDANY
LTVKTDKPKLWNSGPVGGVFYNGSTTAKELPTQDAVKYKGHWDFMTDVANKGNRFSEVKGTRQAGWWYGASSKDEYNRLL
TDEKNKPDGYNGEYGHSSEFTVNFKEKKLTGGLFSNLQDSHKQKVTKTKRYDIDANIHGNRFRGSAIASDKEKDSETKHP
FTSDAKDRLEGGFYGPKGEELAGKFLTDDNKLFGVFGAKQESKADKTEAILDAYALGAFNKNDANTFTPFTKKQLDNFGN
AKKLVLGSTVINLVSTDATKNEFTEDKPKSATNKAGETLMVNDKVSVKTYGYGRNFEYLKFGELSVGGSHSVFLQGERTA
TTGDKAVPTEGKAKYLGNWVGYITGTGTGKSFNEAQDIADFDIDFKNKTVNGKLTTKGRTDPVFNITGEISGNGWTGKAS
TAKADAGGYNIDSNGTNKSIVIRDADVTGGFYGPNATEMGGSFTHNTNDSKASVVFGTKRQEEVKP
>Q9K0V0 ~~~tbpB~~~Transferrin-binding protein B~~~
MNNPLVNQAAMVLPVFLLSACLGGGGSFDLDSVDTEAPRPAPKYQDVFSEKPQAQKDQGGYGFAMRLKRRNWYPQAKEDE
VKLDESDWEATGLPDEPKELPKRQKSVIEKVETDSDNNIYSSPYLKPSNHQNGNTGNGINQPKNQAKDYENFKYVYSGWF
YKHAKREFNLKVEPKSAKNGDDGYIFYHGKEPSRQLPASGKITYKGVWHFATDTKKGQKFREIIQPSKSQGDRYSGFSGD
DGEEYSNKNKSTLTDGQEGYGFTSNLEVDFHNKKLTGKLIRNNANTDNNQATTTQYYSLEAQVTGNRFNGKATATDKPQQ
NSETKEHPFVSDSSSLSGGFFGPQGEELGFRFLSDDQKVAVVGSAKTKDKPANGNTAAASGGTDAAASNGAAGTSSENGK
LTTVLDAVELKLGDKEVQKLDNFSNAAQLVVDGIMIPLLPEASESGNNQANQGTNGGTAFTRKFDHTPESDKKDAQAGTQ
TNGAQTASNTAGDTNGKTKTYEVEVCCSNLNYLKYGMLTRKNSKSAMQAGESSSQADAKTEQVEQSMFLQGERTDEKEIP
SEQNIVYRGSWYGYIANDKSTSWSGNASNATSGNRAEFTVNFADKKITGTLTADNRQEATFTIDGNIKDNGFEGTAKTAE
SGFDLDQSNTTRTPKAYITDAKVQGGFYGPKAEELGGWFAYPGDKQTKNATNASGNSSATVVFGAKRQQPVR
>Q01551 1.14.13.7~~~tbuD~~~Phenol 2-monooxygenase~~~
MTKYNEAYCDVLIVGAGPAGVMAAAHLLSYGTTARPHRVRIFDATKEVNGSDESTESLSTDVIADALNSGASGPEKDAAS
TTEDLPMLVTTLQVSDVLHDTGDDTKIAYRETATEQQVLLLADTTANTSSTMNPRSMCEAGCRFHQIYQGHCFPEYELDS
ERLRSVDGRAQVLEDEHETGQLRLERLGRPEELLELDEENSMSVVTNLKAAPYKFLMKDVDENFPGELSTSGGKTTSISA
DESAIDAALHAVWDADDLGAAWHLDEASGLRAVDWNAAQWFKSGQPWTPDAAKSLQEGRVFLAGDARHRHPPLTGIGKNT
SIADCYNLTWKLLGVLLGVARADPARTYVAERVYIRMRAATDIAVDAEMESLAAKWITVQLTLSRSWISSAKEAERWDAV
LRDSAMSASKPMWTTSDMRASFDAGLMGHGHAHDHVTPTIKEFASSSISRSISELASTSWWESRGWGNGGPFESLMEDAR
WTGAVESNCRYAAYDRDAPVLHEHVAWVTRFTSRARTAVLEAAVGQAHVVDCWDVGLVEPALDDLDSAGAGLHVAHHADQ
WPAQLDEAVWPRESLSDWRIVTDTSATGEGYQTSPREAPGDYADLNADNAKAHFNGQFAGHKAYGDAAAADGGGCHGRIL
VGPAVRGRHLHREIPLGEECQRAAQPLFKEV
>A6U483 ~~~tcaA~~~Membrane-associated protein TcaA~~~
MKSCPKCGQQAQDDVQICTQCGHKFDSRQALYRKSTDEDIQTNNIKMRKMVPWAIVFFILILIIILFFLLRNFNSPEAQT
KILVNAIENNDKQKVATLLSTKDNKVDSEEAKVYINYIKDEVGLKQFVSDLKNTVHKLNKSKTSVASYIQTRSGQNILRV
SKNGTRYIFFDNMSFTAPTKQPIVKPKEKTKYEFKSGGKKKMVIAEANKVTPIGNFIPGTYRIPAMKSTENGDFAGYLKF
DFRQSNSETVDVTEDFEEANITVTLKGDTKLNDSSKKVTINDREMAFSSSKTYGPYPQNKDITISASGKAKGKTFTTQTK
TIKASDLKYNTEITLNFDSEDIEDYVEKKEKEENSLKNKLIEFFAGYSLANNAAFNQSDFDFVSSYIKKGSSFYDDVKKR
VSKGSLMMISSPQIIDAEKHGDKITATVRLINENGKQVDKEYELEQGSQDRLQLIKTSEK
>Q2G2I1 ~~~tcaA~~~Membrane-associated protein TcaA~~~COG4640
MKSCPKCGQQAQDDVQICTQCGHKFDSRQALYRKSTDEDIQTNNIKMRKMVPWAIGFFILILIIILFFLLRNFNSPEAQT
KILVNAIENNDKQKVATLLSTKDNKVDSEEAKVYINYIKDEVGLKQFVSDLKNTVHKLNKSKTSVASYIQTRSGQNILRV
SKNGTRYIFFDNMSFTAPTKQPIVKPKEKTKYEFKSGGKKKMVIAEANKVTPIGNFIPGTYRIPAMKSTENGDFAGHLKF
DFRQSNSETVDVTEDFEEANISVTLKGDTKLNDSSKKVTINDHEMAFSSSKTYGPYPQNKDITISASGKAKDKTFTTQTK
TIKASDLKYNTEITLNFDSEDIEDYVEKKEKEENSLKNKLIEFFAGYSLANNAAFNQSDFDFVSSYIKKGSSFYDDVKKR
VSKGSLMMISSPQIIDAEKHGDKITATVRLINENGKQVDKEYELEQGSQDRLQLIKTSEK
>A5IVD7 ~~~tcaA~~~Membrane-associated protein TcaA~~~
MKSCPKCGQQAQDDVQICTQCGHKFDSRQALYRKSTDEDIQTNNIKMRKMVPWAIVFFILILIIILFFLLRNFNSPEAQT
KILVNAIENNDKQKVATLLSTKDNKVDSEEAKVYINYIKDEVGLKQFVSDLKNTVHKLNKSKTSVASYIQTRSGQNILRV
SKNGTRYIFFDNMSFTAPTKQPIVKPKEKTKYEFKSGGKKKMVIAEANKVTPIGNFIPGTYRIPAMKSTENGDFAGYLKF
DFRQSNSETVDVTEDFEEANITVTLKGDTKLNDSSKKVTINDREMAFSSSKTYGPYPQNKDITISASGKAKGKTFTTQTK
TIKASDLKYNTEITLNFDSEDIEDYVEKKEKEENSLKNKLIEFFAGYSLANNAAFNQSDFDFVSSYIKKGSSFYDDVKKR
VSKGSLMMISSPQIIDAEKHGDKITATVRLINENGKQVDKEYELEQGSQDRLQLIKTSEK
>Q5HDJ9 ~~~tcaA~~~Membrane-associated protein TcaA~~~
MKSCPKCGQQAQDDVQICTQCGHKFDSRQAFYRKSTDEDIQTNNIKMRKMVPWAIGFFILILIIILFFLLRNFNSPEAQT
KILVNAIENNDKQKVATLLSTKDNKVDSEEAKVYINYIKDEVGLKQFVSDLKNTVHKLNKSKTSVASYIQTRSGQNILRV
SKNGTRYIFFDNMSFTAPTKQPIVKPKEKTKYEFKSGGKKKMVIAEANKVTPIGNFIPGTYRIPAMKSTENGDFAGHLKF
DFRQSNSETVDVTEDFEEANISVTLKGDTKLNDSSKKVTINDHEMAFSSSKTYGPYPQNKDITISASGKAKDKTFTTQTK
TIKASDLKYNTEITLNFDSEDIEDYVEKKEKEENSLKNKLIEFFAGYSLANNAAFNQSDFDFVSSYIKKGSSFYDDVKKR
VSKGSLMMISSPQIIDAEKHGDKITATVRLINENGKQVDKEYELEQGSQDRLQLIKTSEK
>A6QJJ7 ~~~tcaA~~~Membrane-associated protein TcaA~~~
MKSCPKCGQQAQDDVQICTQCGHKFDSRQALYRKSTDEDIQTNNIKMRKMVPWAIGFFILILIIILFFLLRNFNSPEAQT
KILVNAIENNDKQKVATLLSTKDNKVDSEEAKVYINYIKDEVGLKQFVSDLKNTVHKLNKSKTSVASYIQTRSGQNILRV
SKNGTRYIFFDNMSFTAPTKQPIVKPKEKTKYEFKSGGKKKMVIAEANKVTPIGNFIPGTYRIPAMKSTENGDFAGHLKF
DFRQSNSETVDVTEDFEEANISVTLKGDTKLNDSSKKVTINDHEMAFSSSKTYGPYPQNKDITISASGKAKDKTFTTQTK
TIKASDLKYNTEITLNFDSEDIEDYVEKKEKEENSLKNKLIEFFAGYSLANNAAFNQSDFDFVSSYIKKGSSFYDDVKKR
VSKGSLMMISSPQIIDAEKHGDKITATVRLINENGKQVDKEYELEQGSQDRLQLIKTSEK
>Q99RS0 ~~~tcaA~~~Membrane-associated protein TcaA~~~
MKSCPKCGQQAQDDVQICTQCGHKFDSRQALYRKSTDEDIQTNNIKMRKMVPWAIGFFILILIIILFFLLRNFNSPEAQT
KILVNAIENNDKQKVATLLSTKDNKVDSEEAKVYINYIKDEVGLKQFVSDLKNTVHKLNKSKTSVASYIQTRSGQNILRV
SKNGTRYIFFDNMSFTAPTKQPIVKPKEKTKYEFKSGGKKKMVIAEANKVTPIGNFILGTYRIPAMKSTENGDFAGYLKF
DFRQSNSETVDVTEDFEEANITVTLKGDTKLNDSSKKVTINDREMAFSSSKTYGPYPQNKDITISASGKAKGKTFTTQTK
TIKASDLKYNTEITLNFDSEDIEDYVEKKEKEENSLKNKLIEFFAGYSLANNAAFNQSDFDFVSSYIKKGSSFYDDVKKR
VSKGSLMMISSPQIIDAEKHGDKITATVRLINENGKQVDKEYELEQGSQDRLQLIKTSEK
>Q7A3X6 ~~~tcaA~~~Membrane-associated protein TcaA~~~
MKSCPKCGQQAQDDVQICTQCGHKFDSRQALYRKSTDEDIQTNNIKMRKMVPWAIGFFILILIIILFFLLRNFNSPEAQT
KILVNAIENNDKQKVATLLSTKDNKVDSEEAKVYINYIKDEVGLKQFVSDLKNTVHKLNKSKTSVASYIQTRSGQNILRV
SKNGTRYIFFDNMSFTAPTKQPIVKPKEKTKYEFKSGGKKKMVIAEANKVTPIGNFILGTYRIPAMKSTENGDFAGYLKF
DFRQSNSETVDVTEDFEEANITVTLKGDTKLNDSSKKVTINDREMAFSSSKTYGPYPQNKDITISASGKAKGKTFTTQTK
TIKASDLKYNTEITLNFDSEDIEDYVEKKEKEENSLKNKLIEFFAGYSLANNAAFNQSDFDFVSSYIKKGSSFYDDVKKR
VSKGSLMMISSPQIIDAEKHGDKITATVRLINENGKQVDKEYELEQGSQDRLQLIKTSEK
>A6QJJ8 ~~~tcaR~~~HTH-type transcriptional regulator TcaR~~~
MVKHLQDHIQFLEQFINNVNALTAKMLKDLQNEYEISLEQSNVLGMLNKEPLTISEITQRQGVNKAAVSRRIKKLIDAKL
VKLDKPNLNIDQRLKFITLTDKGRAYLKERNAIMTDIAQDITNDLNSEDIENVRQVLEVINHRIKTYSNHK
>P27098 1.13.11.-~~~tcbC~~~Chlorocatechol 1,2-dioxygenase~~~
MNERVKQVASALVDAIQKTLTEQRVTEEEWRAGVGYMMKLAEAKEVAVLLDAFFNHTIVDLKAQATRGSRPAMQGPYFLE
GAPVVAGALKTYEDDSHHPLVIRGAVRTDDGAPAAGAVIDVWHSTPDGKYSGIHDQIPTDMYRGKVVADAQGKYAVRTTM
PAPYQIPNKGPTGVLLEMMGSHTWRPAHVHFKVRKDGFAPLTTQYYFEGGDWVDSDCCKGVAPDLVMPTKTEGGAQVMDI
DFVIERAREHV
>P27099 5.5.1.7~~~tcbD~~~Chloromuconate cycloisomerase~~~
MKIEAISTTIVDVPTRRPLQMSFTTVHKQSYVIVQVKAGGLVGIGEGGSVGGPTWGSESAETIKVIIDNYLAPLLVGKDA
SNLSQARVLMDRAVTGNLSAKAAIDIALHDLKARALNLSIADLIGGTMRTSIPIAWTLASGDTARDIDSALEMIETRRHN
RFKVKLGARTPAQDLEHIRSIVKAVGDRASVRVDVNQGWDEQTASIWIPRLEEAGVELVEQPVPRANFGALRRLTEQNGV
AILADESLSSLSSAFELARDHAVDAFSLKLCNMGGIANTLKVAAVAEAAGISSYGGTMLDSTVGTAAALHVYATLPSLPY
GCELIGPWVLGDRLTQQDLEIKDFEVHLPLGSGLGVDLDHDKVRHYTRAA
>P16154 3.4.22.-~~~tcdA~~~Toxin A~~~
MSLISKEELIKLAYSIRPRENEYKTILTNLDEYNKLTTNNNENKYLQLKKLNESIDVFMNKYKTSSRNRALSNLKKDILK
EVILIKNSNTSPVEKNLHFVWIGGEVSDIALEYIKQWADINAEYNIKLWYDSEAFLVNTLKKAIVESSTTEALQLLEEEI
QNPQFDNMKFYKKRMEFIYDRQKRFINYYKSQINKPTVPTIDDIIKSHLVSEYNRDETVLESYRTNSLRKINSNHGIDIR
ANSLFTEQELLNIYSQELLNRGNLAAASDIVRLLALKNFGGVYLDVDMLPGIHSDLFKTISRPSSIGLDRWEMIKLEAIM
KYKKYINNYTSENFDKLDQQLKDNFKLIIESKSEKSEIFSKLENLNVSDLEIKIAFALGSVINQALISKQGSYLTNLVIE
QVKNRYQFLNQHLNPAIESDNNFTDTTKIFHDSLFNSATAENSMFLTKIAPYLQVGFMPEARSTISLSGPGAYASAYYDF
INLQENTIEKTLKASDLIEFKFPENNLSQLTEQEINSLWSFDQASAKYQFEKYVRDYTGGSLSEDNGVDFNKNTALDKNY
LLNNKIPSNNVEEAGSKNYVHYIIQLQGDDISYEATCNLFSKNPKNSIIIQRNMNESAKSYFLSDDGESILELNKYRIPE
RLKNKEKVKVTFIGHGKDEFNTSEFARLSVDSLSNEISSFLDTIKLDISPKNVEVNLLGCNMFSYDFNVEETYPGKLLLS
IMDKITSTLPDVNKNSITIGANQYEVRINSEGRKELLAHSGKWINKEEAIMSDLSSKEYIFFDSIDNKLKAKSKNIPGLA
SISEDIKTLLLDASVSPDTKFILNNLKLNIESSIGDYIYYEKLEPVKNIIHNSIDDLIDEFNLLENVSDELYELKKLNNL
DEKYLISFEDISKNNSTYSVRFINKSNGESVYVETEKEIFSKYSEHITKEISTIKNSIITDVNGNLLDNIQLDHTSQVNT
LNAAFFIQSLIDYSSNKDVLNDLSTSVKVQLYAQLFSTGLNTIYDSIQLVNLISNAVNDTINVLPTITEGIPIVSTILDG
INLGAAIKELLDEHDPLLKKELEAKVGVLAINMSLSIAATVASIVGIGAEVTIFLLPIAGISAGIPSLVNNELILHDKAT
SVVNYFNHLSESKKYGPLKTEDDKILVPIDDLVISEIDFNNNSIKLGTCNILAMEGGSGHTVTGNIDHFFSSPSISSHIP
SLSIYSAIGIETENLDFSKKIMMLPNAPSRVFWWETGAVPGLRSLENDGTRLLDSIRDLYPGKFYWRFYAFFDYAITTLK
PVYEDTNIKIKLDKDTRNFIMPTITTNEIRNKLSYSFDGAGGTYSLLLSSYPISTNINLSKDDLWIFNIDNEVREISIEN
GTIKKGKLIKDVLSKIDINKNKLIIGNQTIDFSGDIDNKDRYIFLTCELDDKISLIIEINLVAKSYSLLLSGDKNYLISN
LSNTIEKINTLGLDSKNIAYNYTDESNNKYFGAISKTSQKSIIHYKKDSKNILEFYNDSTLEFNSKDFIAEDINVFMKDD
INTITGKYYVDNNTDKSIDFSISLVSKNQVKVNGLYLNESVYSSYLDFVKNSDGHHNTSNFMNLFLDNISFWKLFGFENI
NFVIDKYFTLVGKTNLGYVEFICDNNKNIDIYFGEWKTSSSKSTIFSGNGRNVVVEPIYNPDTGEDISTSLDFSYEPLYG
IDRYINKVLIAPDLYTSLININTNYYSNEYYPEIIVLNPNTFHKKVNINLDSSSFEYKWSTEGSDFILVRYLEESNKKIL
QKIRIKGILSNTQSFNKMSIDFKDIKKLSLGYIMSNFKSFNSENELDRDHLGFKIIDNKTYYYDEDSKLVKGLININNSL
FYFDPIEFNLVTGWQTINGKKYYFDINTGAALTSYKIINGKHFYFNNDGVMQLGVFKGPDGFEYFAPANTQNNNIEGQAI
VYQSKFLTLNGKKYYFDNNSKAVTGWRIINNEKYYFNPNNAIAAVGLQVIDNNKYYFNPDTAIISKGWQTVNGSRYYFDT
DTAIAFNGYKTIDGKHFYFDSDCVVKIGVFSTSNGFEYFAPANTYNNNIEGQAIVYQSKFLTLNGKKYYFDNNSKAVTGL
QTIDSKKYYFNTNTAEAATGWQTIDGKKYYFNTNTAEAATGWQTIDGKKYYFNTNTAIASTGYTIINGKHFYFNTDGIMQ
IGVFKGPNGFEYFAPANTDANNIEGQAILYQNEFLTLNGKKYYFGSDSKAVTGWRIINNKKYYFNPNNAIAAIHLCTINN
DKYYFSYDGILQNGYITIERNNFYFDANNESKMVTGVFKGPNGFEYFAPANTHNNNIEGQAIVYQNKFLTLNGKKYYFDN
DSKAVTGWQTIDGKKYYFNLNTAEAATGWQTIDGKKYYFNLNTAEAATGWQTIDGKKYYFNTNTFIASTGYTSINGKHFY
FNTDGIMQIGVFKGPNGFEYFAPANTDANNIEGQAILYQNKFLTLNGKKYYFGSDSKAVTGLRTIDGKKYYFNTNTAVAV
TGWQTINGKKYYFNTNTSIASTGYTIISGKHFYFNTDGIMQIGVFKGPDGFEYFAPANTDANNIEGQAIRYQNRFLYLHD
NIYYFGNNSKAATGWVTIDGNRYYFEPNTAMGANGYKTIDNKNFYFRNGLPQIGVFKGSNGFEYFAPANTDANNIEGQAI
RYQNRFLHLLGKIYYFGNNSKAVTGWQTINGKVYYFMPDTAMAAAGGLFEIDGVIYFFGVDGVKAPGIYG
>Q46149 3.4.22.-~~~tcdA~~~Toxin A~~~
MLITREQLMKIASIPLKRKEPEYNLILDALENFNRDIEGTSVKEIYSKLSKLNELVDNYQTKYPSSGRNLALENFRDSLY
SELRELIKNSRTSTIASKNLSFIWIGGPISDQSLEYYNMWKMFNKDYNIRLFYDKNSLLVNTLKTAIIQESSKVIIEQNQ
SNILDGTYGHNKFYSDRMKLIYRYKRELKMLYENMKQNNSVDDIIINFLSNYFKYDIGKLNNQKENNNNKMIAIGATDIN
TENILTNKLKSYYYQELIQTNNLAAASDILRIAILKKYGGVYCDLDFLPGVNLSLFNDISKPNGMDSNYWEAAIFEAIAN
EKKLMNNYPYKYMEQVPSEIKERILSFVRNHDINDLILPLGDIKISQLEILLSRLKAATGKKTFSNAFIISNNDSLTLNN
LISQLENRYEILNSIIQEKFKICETYDSYINSVSELVLETTPKNLSMDGSSFYQQIIGYLSSGFKPEVNSTVFFSGPNIY
SSATCDTYHFIKNTFDMLSSQNQEIFEASNNLYFSKTHDEFKSSWLLRSNIAEKEFQKLIKTYIGRTLNYEDGLNFNKWK
RVTTSELLKVIEEVNSTKIYENYDLNMILQIQGDDISYESAVNVFGKNPNKSILIQGVDDFANVFYFENGIVQSDNINNI
LSRFNDIKKIKLTLIGHGENVFNPKLFGGKTVNDLYTNIIKPKLQHLLEREGVILKNKYLKINILGCYMFTPKVDINSTF
VGKLFNKISRDLQPKGFSKNQLEISANKYAIRINREGKREVLDYFGKWVSNTDLIAEQISNKYVVYWNEVENTLSARVEQ
LNKVAEFAKDINSIIQTTNNQELKQSLVNTYADLITTLYSELLKEDIPFELDNIQIKERIILNEISRLHDFSNIILDFYQ
KNNISNNMIILFDSIIKEKDYYNVKLANKITGETSVIKTYSDSLWNFTNKYKKIVDDIKGIIVKDINGEFIKKADFEIEQ
NPSLLNSAMLMQLLIDYKPYTEILTNMNTSLKVQAYAQIFQLSIGAIQEATEIVTIISDALNANFNILSKLKVGSSVASV
IIDGINLIAALTELKNVKTNFERKLIEAKVGMYSIGFILESSSLISGLLGATAVSEILGVISVPVAGILVGLPSLVNNIL
VLGEKYNQILDYFSKFYPIVGKNPFSIQDNIIIPYDDIAITELNFKYNKFKYGYAKISGLKVGLVTHIGENIDHYFSAPS
LDHYIELSIYPALKLNDTNLPKGNVVLLPSGLNKVYKPEISAIAGANSQEGNGVEVLNLIRNYYVDSNGNTKFPWKYEAP
FEYSFSYMRVEYFDTKVNVILDNENKTLIIPVLTIDEMRNKISYEILGDGGQYNVILPVNQTNINIVSNKNDIWNFDVSY
IVKESKIEDNKFVLDGFINNIFSTLKVSNDGFKIGKQFISIKNTPRAINLSFKINNNIVIVSIYLNHEKSNSITIISSDL
NDIKNNFDNLLDNINYIGLGSISDNTINCIVRNDEVYMEGKIFLNEKKLVFIQNELELHLYDSVNKDSQYLINNPINNVV
KYKDGYIVEGTFLINSTENKYSLYIENNKIMLKGLYLESSVFKTIQDKIYSKEKVNDYILSLIKKFFTVNIQLCPFMIVS
GVDENNRYLEYMLSTNNKWIINGGYWENDFNNYKIVDFEKCNVIVSGSNKLNSEGDLADTIDVLDKDLENLYIDSVIIIP
KVYTKKIIIHPIPNNPQINIINTQSIHDKCHLIIDSVLTNNYHWESDGDDLIITNGLDINIRILQGLSFGFKYKNIYLKF
SNYDELSLNDFLLQNYNVKGLYYINGELHYKNIPGDTFEYGWINIDSRWYFFDSINLIAKKGYQEIEGERYYFNPNTGVQ
ESGVFLTPNGLEYFTNKHASSKRWGRAINYTGWLTLDGNKYYFQSNSKAVTGLQKISDKYYYFNDNGQMQIKWQIINNNK
YYFDGNTGEAIIGWFNNNKERYYFDSEGRLLTGYQVIGDKSYYFSDNINGNWEEGSGVLKSGIFKTPSGFKLFSSEGDKS
AINYKGWLDLNGNKYYFNSDSIAVTGSYNIKGIQYYFNPKTAVLTNGWYTLDNNNYYVSNGHNVLGYQDIDGKGYYFDPS
TGIQKAGVFPTPNGLRYFTMKPIDGQRWGQCIDYTGWLHLNGNKYYFGYYNSAVTGWRVLGGKRYFFNIKTGAATTGLLT
LSGKRYYFNEKGEQLTLV
>Q46927 6.1.-.-~~~tcdA~~~tRNA threonylcarbamoyladenosine dehydratase~~~COG1179
MSVVISDAWRQRFGGTARLYGEKALQLFADAHICVVGIGGVGSWAAEALARTGIGAITLIDMDDVCVTNTNRQIHALRDN
VGLAKAEVMAERIRQINPECRVTVVDDFVTPDNVAQYMSVGYSYVIDAIDSVRPKAALIAYCRRNKIPLVTTGGAGGQID
PTQIQVTDLAKTIQDPLAAKLRERLKSDFGVVKNSKGKLGVDCVFSTEALVYPQSDGTVCAMKATAEGPKRMDCASGFGA
ATMVTATFGFVAVSHALKKMMAKAARQG
>Q9EXR0 3.4.22.-~~~tcdB~~~Toxin B~~~
MSLVNRKQLEKMANVRFRVQEDEYVAILDALEEYHNMSENTVVEKYLKLKDINSLTDTYIDTYKKSGRNKALKKFKEYLV
TEILELKNSNLTPVEKNLHFIWIGGQINDTAINYINQWKDVNSDYNVNVFYDSNAFLINTLKKTIIESASNDTLESFREN
LNDPEFNHTAFFRKRMQIIYDKQQNFINYYKAQKEENPDLIIDDIVKTYLSNEYSKDIDELNAYIEESLNKVTENSGNDV
RNFEEFKTGEVFNLYEQELVERWNLAGASDILRVAILKNIGGVYLDVDMLPGIHPDLFKDINKPDSVKTAVDWEEMQLEA
IMKYKEYIPEYTSKHFDTLDEEVQSSFESVLASKSDKSEIFLPLGGIEVSPLEVKVAFAKGSIIDQALISAKDSYCSDLL
IKQIQNRYKILNDTLGPIISQGNDFNTTMNNFGESLGAIANEENISFIAKIGSYLRVGFYPEANTTITLSGPTIYAGAYK
DLLTFKEMSIDTSILSSELRNFEFPKVNISQATEQEKNSLWQFNEERAKIQFEEYKKNYFEGALGEDDNLDFSQNTVTDK
EYLLEKISSSTKSSERGYVHYIVQLQGDKISYEAACNLFAKNPYDSILFQKNIEDSEVAYYYNPTDSEIQEIDKYRIPDR
ISDRPKIKLTLIGHGKAEFNTDIFAGLDVDSLSSEIETILDLAKADISPKSIEINLLGCNMFSYSVNVEETYPGKLLLRV
KDKVSELMPSISQDSIIVSANQYEVRINSEGRRELLDHSGEWINKEESIIKDISSKEYISFNPKENKIIVKSKNLPELST
LLQEIRNNSNSSDIELEEKVMLAECEINVISNIETQVVEERIEEAKSLTSDSINYIKNEFKLIESISDALYDLKQQNELE
ESHFISFEDISKTDEGFSIRFIDKETGESIFVETEKAIFSEYANHITEEISKLKDTIFDTVNGKLVKKVTLDATHEVNTL
NAAFFIQSLIGYNSSKESLSNLSVAMKVQVYAQLFSTGLNTITDAAKVVELVSTALDETIDLLPTLSEGLPVIATIIDGV
SLGASIKELSETSDPLLRQEIEAKIGIMAVNLTAATTAIITSSLGIASGFSILLVPLAGISAGIPSLVNNELILRAEAKN
VVDYFGHISLAESEGAFTLLDDKIMMPQDDLVISEIDFNNNSITLGKCEIWRMEGGSGHTVTDDIDHFFSAPSTTYREPY
LSIYDVLDVKEEELDLSKDLMVLPNAPDRIFGWERGWTPGLRSLENDGTKLLDRIRDHYEGQFYWRFFAFIADSVITKLK
PRYEDTNIRISLDSNTRSFIVPVITTEYIREKLSYSFYGSGGTYALSLSQYNMNINIELNENDTWVIDVDNVVRDVTIES
DKIKKGDLIENILSKLSIEDNKIILDNHEINFSGTLNGGNGFVSLTFSILEGINAVIEVDLLSKSYKVLISGELKTLMAN
SNSVQQKIDYIGLNSELQKNIPYSFMDDEGKENGFINCFTKEGLFVSELSDVVLIIKVYMDNSKPPFGYYSNDLKDVKVI
TKDDVIIITGYYLKDDIKISLSFTIQDKNTIKLNGVYLDENGVAEILKFMNKKGSTNTSDSLMSFLESMNIKSIFIKSLK
SNAKLILDTNFIISGTTFIGQFEFICDKDNNIQPYFIKFNTLETKYTLYVGNRQNMIVEPNYNLDDSGDISSTVINFSQK
YLYGIDSCVNKVVISPGIYTDEINITPVHEANNTYPEVIVLDTNYISEKINININDLSIRYVWSNDGSDFILMSTDEENK
VSQVKIRFTNVFKGNTISDKISFNFSDKQDISINKIISTFTPSYYVEGLLNYDLGLISLYNEKFYINNLGMMVSGLVYIN
DSLYYFKPPIKNLITGFTTIGDDKYYFNPDNGGPASVGETIIDGKNYYFSQNGVLQTGVFSTEDGFKYFAPADTLDENLE
GEAIDFTGKLIIDENVYYFGDNYRAAIEWQTLDDEVYYFSTDTGRAFKGLNQIGDDKFYFNSDGIMQKGFVNINDKTFYF
DDSGVMKSGYTEIDGRYFYFAENGEMQIGVFNTADGFKYFAHHDEDLGNEEGEALSYSGILNFNNKIYYFDDSFTAVVGW
KDLEDGSKYYFDENTAEASIGISIINDGKYYFNDSGIMQIGFVTINNEVFYFSDSGIVESGMQNIDDNYFYISENGLVQI
GVFDTSDGYKYFAPANTVNDNIYGQAVEYSGLVRVNEDVYSFGESYTIETGWIYDSENESDKYYFDPEAKKAYKGINVID
DIKYYFDENGIMRTGLITFEDNHYYFNEDGEMQYGYLNIEDKMFYFSEDGIMQIGVFNTPDGFKYFAHQNTLDENFEGES
INYTGWLDLDEKRYYFTDEYIAATGSVIIDGEEYYFDPDTAQLVISE
>P18177 3.4.22.-~~~tcdB~~~Toxin B~~~
MSLVNRKQLEKMANVRFRTQEDEYVAILDALEEYHNMSENTVVEKYLKLKDINSLTDIYIDTYKKSGRNKALKKFKEYLV
TEVLELKNNNLTPVEKNLHFVWIGGQINDTAINYINQWKDVNSDYNVNVFYDSNAFLINTLKKTVVESAINDTLESFREN
LNDPRFDYNKFFRKRMEIIYDKQKNFINYYKAQREENPELIIDDIVKTYLSNEYSKEIDELNTYIEESLNKITQNSGNDV
RNFEEFKNGESFNLYEQELVERWNLAAASDILRISALKEIGGMYLDVDMLPGIQPDLFESIEKPSSVTVDFWEMTKLEAI
MKYKEYIPEYTSEHFDMLDEEVQSSFESVLASKSDKSEIFSSLGDMEASPLEVKIAFNSKGIINQGLISVKDSYCSNLIV
KQIENRYKILNNSLNPAISEDNDFNTTTNTFIDSIMAEANADNGRFMMELGKYLRVGFFPDVKTTINLSGPEAYAAAYQD
LLMFKEGSMNIHLIEADLRNFEISKTNISQSTEQEMASLWSFDDARAKAQFEEYKRNYFEGSLGEDDNLDFSQNIVVDKE
YLLEKISSLARSSERGYIHYIVQLQGDKISYEAACNLFAKTPYDSVLFQKNIEDSEIAYYYNPGDGEIQEIDKYKIPSII
SDRPKIKLTFIGHGKDEFNTDIFAGFDVDSLSTEIEAAIDLAKEDISPKSIEINLLGCNMFSYSINVEETYPGKLLLKVK
DKISELMPSISQDSIIVSANQYEVRINSEGRRELLDHSGEWINKEESIIKDISSKEYISFNPKENKITVKSKNLPELSTL
LQEIRNNSNSSDIELEEKVMLTECEINVISNIDTQIVEERIEEAKNLTSDSINYIKDEFKLIESISDALCDLKQQNELED
SHFISFEDISETDEGFSIRFINKETGESIFVETEKTIFSEYANHITEEISKIKGTIFDTVNGKLVKKVNLDTTHEVNTLN
AAFFIQSLIEYNSSKESLSNLSVAMKVQVYAQLFSTGLNTITDAAKVVELVSTALDETIDLLPTLSEGLPIIATIIDGVS
LGAAIKELSETSDPLLRQEIEAKIGIMAVNLTTATTAIITSSLGIASGFSILLVPLAGISAGIPSLVNNELVLRDKATKV
VDYFKHVSLVETEGVFTLLDDKIMMPQDDLVISEIDFNNNSIVLGKCEIWRMEGGSGHTVTDDIDHFFSAPSITYREPHL
SIYDVLEVQKEELDLSKDLMVLPNAPNRVFAWETGWTPGLRSLENDGTKLLDRIRDNYEGEFYWRYFAFIADALITTLKP
RYEDTNIRINLDSNTRSFIVPIITTEYIREKLSYSFYGSGGTYALSLSQYNMGINIELSESDVWIIDVDNVVRDVTIESD
KIKKGDLIEGILSTLSIEENKIILNSHEINFSGEVNGSNGFVSLTFSILEGINAIIEVDLLSKSYKLLISGELKILMLNS
NHIQQKIDYIGFNSELQKNIPYSFVDSEGKENGFINGSTKEGLFVSELPDVVLISKVYMDDSKPSFGYYSNNLKDVKVIT
KDNVNILTGYYLKDDIKISLSLTLQDEKTIKLNSVHLDESGVAEILKFMNRKGNTNTSDSLMSFLESMNIKSIFVNFLQS
NIKFILDANFIISGTTSIGQFEFICDENDNIQPYFIKFNTLETNYTLYVGNRQNMIVEPNYDLDDSGDISSTVINFSQKY
LYGIDSCVNKVVISPNIYTDEINITPVYETNNTYPEVIVLDANYINEKINVNINDLSIRYVWSNDGNDFILMSTSEENKV
SQVKIRFVNVFKDKTLANKLSFNFSDKQDVPVSEIILSFTPSYYEDGLIGYDLGLVSLYNEKFYINNFGMMVSGLIYIND
SLYYFKPPVNNLITGFVTVGDDKYYFNPINGGAASIGETIIDDKNYYFNQSGVLQTGVFSTEDGFKYFAPANTLDENLEG
EAIDFTGKLIIDENIYYFDDNYRGAVEWKELDGEMHYFSPETGKAFKGLNQIGDYKYYFNSDGVMQKGFVSINDNKHYFD
DSGVMKVGYTEIDGKHFYFAENGEMQIGVFNTEDGFKYFAHHNEDLGNEEGEEISYSGILNFNNKIYYFDDSFTAVVGWK
DLEDGSKYYFDEDTAEAYIGLSLINDGQYYFNDDGIMQVGFVTINDKVFYFSDSGIIESGVQNIDDNYFYIDDNGIVQIG
VFDTSDGYKYFAPANTVNDNIYGQAVEYSGLVRVGEDVYYFGETYTIETGWIYDMENESDKYYFNPETKKACKGINLIDD
IKYYFDEKGIMRTGLISFENNNYYFNENGEMQFGYINIEDKMFYFGEDGVMQIGVFNTPDGFKYFAHQNTLDENFEGESI
NYTGWLDLDEKRYYFTDEYIAATGSVIIDGEEYYFDPDTAQLVISE
>P16153 ~~~tcdE~~~Holin-like protein TcdE~~~
MHSSSPFYISNGNKIFFYINLGGVMNMTISFLSEHIFIKLVILTISFDTLLGCLSAIKSRKFNSSFGIDGGIRKVAMIAC
IFFLSVVDILTKFNFLFMLPQDCINFLRLKHLGISEFFSILFILYESVSILKNMCLCGLPVPKRLKEKIAILLDAMTDEM
NAKDEK
>Q3ZAB8 1.21.99.-~~~tceA~~~Trichloroethene reductive dehalogenase~~~COG2768
MSEKYHSTVTRRDFMKRLGLAGAGAGALGAAVLAENNLPHEFKDVDDLLSAGKALEGDHANKVNNHPWWVTTRDHEDPTC
NIDWSLIKRYSGWNNQGAYFLPEDYLSPTYTGRRHTIVDSKLEIELQGKKYRDSAFIKSGIDWMKENIDPDYDPGELGYG
DRREDALIYAATNGSHNCWENPLYGRYEGSRPYLSMRTMNGINGLHEFGHADIKTTNYPKWEGTPEENLLIMRTAARYFG
ASSVGAIKITDNVKKIFYAKVQPFCLGPWYTITNMAEYIEYPVPVDNYAIPIVFEDIPADQGHYSYKRFGGDDKIAVPNA
LDNIFTYTIMLPEKRFKYAHSIPMDPCSCIAYPLFTEVEARIQQFIAGLGYNSMGGGVEAWGPGSAFGNLSGLGEQSRVS
SIIEPRYGSNTKGSLRMLTDLPLAPTKPIDAGIREFCKTCGICAEHCPTQAISHEGPRYDSPHWDCVSGYEGWHLDYHKC
INCTICEAVCPFFTMSNNSWVHNLVKSTVATTPVFNGFFKNMEGAFGYGPRYSPSRDEWWASENPIRGASVDIF
>P39888 1.14.13.200~~~tcmG~~~Tetracenomycin A2 monooxygenase-dioxygenase~~~COG0654
MSTEEVPVLIVGGGLTGLSAALFLSQHGVSCRLVEKHRGTTVLTRASGISSRTMELLRGVGLERTVIERGPKLVEGARWR
ELGQPADQIPWVVIRANGLHDLENAVTVEEPSLDVGHLSPTRPYWCGQDRLEPILRDEAVRRGARIDFNTRMEAFTADES
GVTATIVDQATGEQSTVRARYLIAADGVRSPVRETLGITRTGHGTIGNAMSVLFKADLRDTVKGRRFVICYLPNPDEPGV
LQLPEVPAVLQLFDEDRWIFGFFFDPRETSPEQFTDERCAQIIRTATGLPGLPVEVQMARPWEMSHNSARSYRSGRVFLA
GDAAHVHPPAGAFGANGGIQDAHNLAWKLAAVLKGTAGDALLDTYEQERLPIGAAVADQAWIRHTWRLNDSEELRRALRE
STLVATGYRYTSDAVLGAAYPEPIPAAHDLTGRPGYRVPHVWLGRGGERVSTVDLGGDAFVVLAGPDGGEWQAAADKVAQ
DLGVPVHCHPVGGDGQLTDPDGAFLGTTGLTRNGALLIRPDGFVAWRAEYLPEDAAGELRSALERILARTSGTPGGTALE
G
>P39889 1.13.12.21~~~tcmH~~~Tetracenomycin-F1 monooxygenase~~~COG2329
MATISPSPDLFTLVNVFGVAPEKQRELRDHLVQVTEDLIRHMPGFVSATFHLSRDGEQVVNYAQWRSEADFRAMHADPRL
QPHFDYCRSVSRPKPIFCEVTHSFGATSPEGA
>P39890 4.2.1.154~~~tcmI~~~Tetracenomycin F2 cyclase~~~
MAYRALMVLRMDPADAEHVAAAFAEHDTTELPLEIGVRRRVLFRFHDLYMHLIEADDDIMERLYQARSHPLFQEVNERVG
QYLTPYAQDWEELKDSKAEVFYSWTAPDS
>P16558 2.3.1.235~~~tcmJ~~~Tetracenomycin polyketide synthase protein TcmJ~~~COG0662
MTTAQHEGEQTVAAAAKIAMSSVAPSHRQGGSTRALLTPSSVGATSGFLGHIELAPGESVTEHYHPFSDKYLYLIEGSLV
VRVNGEEVRLERDEALFVTRGQRHRIENRGNVPARVVFQISPLAPRPELGHVDTEPVPNPAAAPPKVGG
>P16538 2.3.1.235~~~tcmK~~~Tetracenomycin polyketide synthase ketoacyl synthase alpha subunit~~~COG0304
MTRHAEKRVVITGIGVRAPGGAGTAAFWDLLTAGRTATRTISLFDAAPYRSRIAGEIDFDPIGEGLSPRQASTYDRATQL
AVVCAREALKDSGLDPAAVNPERIGVSIGTAVGCTTGLDREYARVSEGGSRWLVDHTLAVEQLFDYFVPTSICREVAWEA
GAEGPVTVVSTGCTSGLDAVGYGTELIRDGRADVVVCGATDAPISPITVACFDAIKATSANNDDPAHASRPFDRNRDGFV
LGEGSAVFVLEELSAARRRGAHAYAEVRGFATRSNAFHMTGLKPDGREMAEAITAALDQARRTGDDLHYINAHGSGTRQN
DRHETAAFKRSLGQRAYDVPVSSIKSMIGHSLGAIGSLELAACALAIEHGVIPPTANYEEPDPECDLDYVPNVAREQRVD
TVLSVGSGFGGFQSAAVLARPKETRS
>P16539 2.3.1.235~~~tcmL~~~Tetracenomycin polyketide synthase ketoacyl synthase beta subunit~~~COG0304
MSAPAPVVVTGLGIVAPNGTGTEEYWAATLAGKSGIDVIQRFDPHGYPVRVGGEVLAFDAAAHLPGRLLPQTDRMTQHAL
VAAEWALADAGLEPEKQDEYGLGVLTAAGAGGFEFGQREMQKLWGTGPERVSAYQSFAWFYAVNTGQISIRHGMRGHSSV
FVTEQAGGLDAAAHAARLLRKGTLNTALTGGCEASLCPWGLVAQIPSGFLSEATDPHDAYLPFDARAAGYVPGEGGAMLV
AERADSARERDAATVYGRIAGHASTFDARPGTGRPTGPARAIRLALEEARVAPEDVDVVYADAAGVPALDRAEAEALAEV
FGPGAVPVTAPKTMTGRLYAGGAALDVATALLSIRDCVVPPTVGTGAPAPGLGIDLVLHQPRELRVDTALVVARGMGGFN
SALVVRRHG
>P12884 2.3.1.235~~~tcmM~~~Tetracenomycin polyketide synthase acyl carrier protein~~~COG0236
MPQIGLPRLVEIIRECAGDPDERDLDGDILDVTYQDLGYDSIALLEISAKLEQDLGVSIPGEELKTPRHTLHLVNTETAG
EVA
>P16559 2.1.1.-~~~tcmN~~~Tetracenomycin biosynthesis bifunctional cyclase/O-methyl transferase TcmN~~~COG2226
MAARTDNSIVVNAPFELVWDVTNDIEAWPELFSEYAEAEILRQDGDGFDFRLKTRPDANGRVWEWVSHRVPDKGSRTVRA
HRVETGPFAYMNLHWTYRAVAGGTEMRWVQEFDMKPGAPFDNAHMTAHLNTTTRANMERIKKIIEDRHREGQRTPASVLP
TELHAQQLLLLAASGRLARIVHVLTELRIADLLADGPRHVAELAKETDTHELSLYRVLRSAASVGVFAEGPVRTFSATPL
SDGLRTGNPDGVLPLVKYNNMELTRRPYDEIMHSVRTGEPAFRRVFGSSFFEHLEANPEAGEFFERFMAHWSRRLVLDGL
ADQGMERFSRIADLGGGDGWFLAQILRRHPHATGLLMDLPRVAASAGPVLEEAKVADRVTVLPGDFFTDPVPTGYDAYLF
KGVLHNWSDERAVTVLRRVREAIGDDDARLLIFDQVMAPENEWDHAKLLDIDMLVLFGGRERVLAEWRQLLLEADFDIVN
TPSHTWTTLECRPV
>Q8DUY4 ~~~~~~Putative two-component membrane permease complex subunit SMU_746c~~~COG3689
MIRFLILAGYFELGMYLQLSGKLDRYINSHYSYLAYISMALSFILALVQLTIWMKRLKMHSHLSGKAAKFFSPIILAIPV
FIGLLVPTVPLDSTTVSAKGYHFPLAAGSTTSGTSSDGTRVQYLKPDTSLYFTKSAYQKEMRATLKKYKGSGKLQITTQN
YMEVMEIIYLFPDEFKNRQIEYVGFIYNDPKDKNSQFLFRFGIIHCIADSGVYGLLTTGGQTHYQNNTWVTVSGKLAIEY
NQNLKQTLPVLHISQSSQTMQPKNPYVYRVF
>Q8DUY3 ~~~~~~Putative two-component membrane permease complex subunit SMU_747c~~~COG0701
MAIFNHLPSSVLQCLAIFLSIIIEALPFILLGAILSGFIEVYLTPDIVQKYLPKNKIGRILFGTFVGFIFPSCECGIVPI
VNRFLEKKVPSYTAIPFLATAPIINPIVLFATFSAFGNSWRFVFLRLFGAIIVAISLGILLGFIVDEHIIKESAKPCHFH
DYSHKKAYQKIFYALAHAVDELFDTGRYLIFGSFVAASMQIYVPTRILTSIGHNPLTAILIMMLLAFILSLCSEADAFIG
TSLLATFGVAPVVAFLLIGPMVDIKNLMMMKNAFKTKFILQFVGTSSLIIIIYCLIVGVMQ
>Q471I2 1.14.14.173~~~tcpA~~~2,4,6-trichlorophenol monooxygenase~~~COG2368
MIRTGKQYLESLNDGRNVWVGNEKIDNVATHPKTRDYAQRHADFYDLHHRPDLQDVMTFVDKDGERRTMQWFGHYDKEQL
RRKRKYHETIMREMAGASFPRTPDVNNYVLQTYIDDPSPWETQTIGAEGKVKAKNIVDFVNFAKKHDLNCAPQFVDPQMD
RSNPDAQQRSPGLRVIEKNDKGIVVSGVKAIGTGVAFADWIHIGVFFRPGIPGDQIIFAATPVNTPGVTIVCRESVVKED
PIEHPLASQGDELDGMTVFDNVFIPWSHVFHLGNPEHAKLYPQRVFDWLHYHALIRQSVRAELMAGLAILITEHIGTNKI
PAVQTRVAKLIGFHQAMLAHIVASEELGFHTPGGAYKPNILIYDFGRALYLENFSQMIYELVDLSGRSALIFASEDQWND
EALNGWFERMNNGPVGQPHDRVKIGRVIRDLFLTDWGNRLFVFENFNGTPLQAIRMLTMQRAEFSAAGPYGTLARKVCGI
ELTEGHDSEYKATAGYAQALDSARHQEKLALSGTMTV
>Q60153 ~~~tcpA~~~Toxin coregulated pilin~~~COG2165
MQLLKQLFKKKFVKEEHDKKTGQEGMTLLEVIIVLGIMGVVSAGVVTLAQRAIDSQNMTKAAQNLNSVQIAMTQTYRSLG
NYPATANANAATQLANGLVSLGKVSADEAKNPFTGTAMGIFSFPRNSAANKAFAITVGGLTQAQCKTLVTSVGDMFPFIN
VKEGAFAAVADLGDFETSVADAATGAGVIKSIAPGSANLNLTNITHVEKLCTGTAPFTVAFGNS
>P23024 ~~~tcpA~~~Toxin coregulated pilin~~~
MQLLKQLFKKKFVKEEHDKKTGQEGMTLLEVIIVLGIMGVVSAGVVTLAQRAIDSQIMTKAAQSLNSIQVALTQTYRGLG
NYPATADATAASKLTSGLVSLGKISSDEAKNPFNGTNMNIFSFPRNAAANKAFAISVDGLTQAQCKTLITSVGDMFPYIA
IKAGGAVALADLGDFENSAAAAETGVGVIKSIAPASKNLDLTNITHVEKLCKGTAPFGVAFGNS
>Q8YF53 3.2.2.-~~~tcpB~~~Probable 2' cyclic ADP-D-ribose synthase TcpB~~~COG4916
MSKEKQAQSKAHKAQQAISSAKSLSTQKSKMSELERATRDGAAIGKKRADIAKKIADKAKQLSSYQAKQFKADEQAVKKV
AQEQKRLSDERTKHEAFIKQSLSSMRTTASATMEAEEEYDFFISHASEDKEAFVQDLVAALRDLGAKIFYDAYTLKVGDS
LRRKIDQGLANSKFGIVVLSEHFFSKQWPARELDGLTAMEIGGQTRILPIWHKVSYDEVRRFSPSLADKVALNTSLKSVE
EIAKELHSLI
>Q471I0 1.13.11.-~~~tcpC~~~6-chlorohydroxyquinol 1,2-dioxygenase~~~COG3485
MQEYDQHNLTKAVIARLADTPNARTKQIMTSLVRHLHDFAREVRLTEAEWKQGIDYLTATGQMCDDKRQEFILLSDVLGL
SMLTVAMNQEKPEGCTEPTVFGPFHVEGAPHYAHGADVANGAKGEPCMVYGRVTGVDGRPVAGAVVETWQADADGHYDVQ
YEGLEVAQGRGVLKSGEDGRFHFRTIVAQAYPIPDDGPVGELLRATGRHPWRPAHLHFMIKAPGYETLVTHVFRRGDKYL
DSDAVFGVRTSLIGDWVRQTDGTYRLDFDFVLNPTL
>A0A0H2V8B5 3.2.2.6~~~TcpC~~~NAD(+) hydrolase TcpC~~~COG4916
MIAYENIEFFICLVNVLGNNMYNILFFIFLSIAIPFLLFLAWKQHLKTKEIRSYLLKEGYNIIFNGEGNSYLAFNISNAT
FRAGNLTSNDYFQASISYIHDYRWEWKEVEAKKINNIFIIYISNIDFPSQKLFYRNNKSLAEIDWAKLQAIFHQPYEIQN
DVMQDNNNTHYDFFISHAKEDKDTFVRPLVDELNRLGVIIWYDEQTLEVGDSLRRNIDLGLRKANYGIVILSHNFLNKKW
TQYELDSLINRAVYDDNKIILPIWHNINAQEVSKYSHYLADKMALQTSLYSVKEIARELAEIAYRRR
>P0C6C9 ~~~tcpE~~~Toxin coregulated pilus biosynthesis protein E~~~COG1459
MKIISKKYRLELYSMLVDLLNDNIPLYDALNKIQNEGVGIYDKNFIKSIELIKDRMKSNSSLTDALTGLIPDKEVLMINV
AENSGKISSGIAAIRKNIIDADEIKSKAISSMITPSVMLIVTMVVIAGYSVKVFPTFESVLPVSRWPGVTQALYNLGFSL
YEGLWIKVLIFVAIFITILVFMSKNITGNFRDGFLDKLPPFNFVKHIAATEFLANMSMLLDSRVPFKEGLDIVDHKTTRW
LSSHLQRMKANMQEGLDYKQALDTNLLDKKMLLTMAVYSELPNFSDVMQKLAIEANINLHKKIATLAGVMKNISLITLAL
SVIWIFGAIFSLVDKLSSSL
>Q833J1 3.2.2.6~~~tcpF~~~NAD(+) hydrolase TcpF~~~
MSNGKKIFISHSSKDQEYVDAFIQLLKKFGFRTQDIFYSSTIETGVQPGELIFDTIKRELTNQPVMLYFLSDHYYQSIPC
LNEMGASWMLSDKHYPIALNNFSMKDMKGVISSERLAIAFNDKTSTNEINCLLKKLSHDTDVQAEPDFELNVEKNIQPFQ
NKLTQLIRQASYLKPDEKGYFETILSTHRPVYGTAKGVYDCFKLPSLIEPKSLGLDTLSEDESHWLFFFLTWGTFQEGEK
VRFKLKKDKAYNNREFSDIGKCKNIYVSYLEKVE
>A5F383 ~~~tcpF~~~Toxin coregulated pilus biosynthesis protein F~~~
MRYKKTLMLSIMITSFNSFAFNDNYSSTSTVYATSNEATDSRGSEHLRYPYLECIKIGMSRDYLENCVKVSFPTSQDMFY
DAYPSTESDGAKTRTKEDFSARLLAGDYDSLQKLYIDFYLAQTTFDWEIPTRDQIETLVNYANEGKLSTALNQEYITGRF
LTKENGRYDIVNVGGVPDNTPVKLPAIVSKRGLMGTTSVVNAIPNEIYPHIKVYEGTLSRLKPGGAMIAVLEYDVNELSK
HGYTNLWDVQFKVLVGVPHAETGVIYDPVYEETVKPYQPSNNLTGKKLYNVSTNDMHNGYKWSNTMFSNSNYKTQILLTK
GDGSGVKLYSKAYSENFK
>P0C6Q5 ~~~tcpF~~~Toxin coregulated pilus biosynthesis protein F~~~
MRYKKTLMLSIMITSFNSFAFNDNYSSTSTVYATSNEATDSRGSEHLRYPYLECIKIGMSRDYLENCVKVSFPTSQDMFY
DAYPSTESDGAKTRTKEDFSARLLAGDYDSLQKLYIDFYLAQTTFDWEIPTRDQIETLVNYANEGKLSTALNQEYITGRF
LTKENGRYDIVNVGGVPDNTPVKLPAIVSKRGLMGTTSVVNAIPNEIYPHIKVYEGTLSRLKPGGAMIAVLEYDVNELSK
HGYTNLWDVQFKVLVGVPHAETGVIYDPVYEETVKPYQPSNNLTGKKLYNVSTNDMHNGYKWSNTMFSNSNYKTQILLTK
GDGSGVKLYSKAYSENFK
>A5F384 ~~~tcpN~~~TCP pilus virulence regulatory protein~~~COG2207
MIGKKSFQTNVYRMSKFDTYIFNNLYINDYKMFWIDSGIAKLIDKNCLVSYEINSSSIILLKKNSIQRFSLTSLSDENIN
VSVITISDSFIRSLKSYILGDLMIRNLYSENKDLLLWNCEHNDIAVLSEVVNGFREINYSDEFLKVFFSGFFSKVEKKYN
SIFITDDLDAMEKISCLVKSDITRNWRWADICGELRTNRMILKKELESRGVKFRELINSIRISYSISLMKTGEFKIKQIA
YQSGFASVSYFSTVFKSTMNVAPSEYLFMLTGVAEK
>P0C6D6 ~~~tcpN~~~TCP pilus virulence regulatory protein~~~COG2207
MIGKKSFQTNVYRMSKFDTYIFNNLYINDYKMFWIDSGIAKLIDKNCLVSYEINSSSIILLKKNSIQRFSLTSLSDENIN
VSVITISDSFIRSLKSYILGDLMIRNLYSENKDLLLWNCEHNDIAVLSEVVNGFREINYSDEFLKVFFSGFFSKVEKKYN
SIFITDDLDAMEKISCLVKSDITRNWRWADICGELRTNRMILKKELESRGVKFRELINSIRISYSISLMKTGEFKIKQIA
YQSGFASVSYFSTVFKSTMNVAPSEYLFMLTGVAEK
>P0C6D5 ~~~tcpR~~~Toxin coregulated pilus biosynthesis protein R~~~
MTSIWLHESDFRYVNLDVERYQKKYRLTLTNGNKYVFIKDKEDISDVNPFIPYILMEEGMNNVLVKSDDYKDILIYQGVN
IQCLDFLNDNDISVEKLVDFETVELKADELNKIKARRLDAQLIEDEVKNNKVFIIGFIAIVIISIGVFWLM
>P29480 ~~~tcpT~~~Toxin coregulated pilus biosynthesis protein T~~~COG2804
MSIDIKYLSRIDIDREEFFFKDSRLMCKKFDEEREVLTLLEFDTKFRVNLLKKDKVYKYFLVSDANHKLLIANLVTEQQA
KDLSFIEKDIMKIASSATAYGASDIHFIREDRICKIKFRVNGTMIDYREILSSEADALMFVLYNVMATTKETTWNRKLPQ
DANIILVINEKAYRFRYAHMPLFGEGGKNYHAVVRIIYPSNNFVCTNYQDIGYNEADTDAIARILNTSYGLFIVSGTTGS
GKSTSLKKYIELLFFNKYKGKGCFVTVEDPVEYLISGAQQSSIVADNDDKTKNPFADAVRSAMRRDPDVIMIGEIRDKPT
VEALSSAVESGHYCLTTIHAGSVVSVLQRLSGLGMKADKIASPGFLAGITSQKLIPELCPSCKVSFVDERYQRAVFSANE
NGCEACNHSGFKGRLLLLETLVPTVEDLELVASENWVSLYRKYRERRFIKTGKKGLGEGFSIKDKAYYNVLKGKVCHEYF
MLHFGQLDHEDENIIYENYLQEV
>P02980 ~~~tetA~~~Tetracycline resistance protein, class B~~~
MNSSTKIALVITLLDAMGIGLIMPVLPTLLREFIASEDIANHFGVLLALYALMQVIFAPWLGKMSDRFGRRPVLLLSLIG
ASLDYLLLAFSSALWMLYLGRLLSGITGATGAVAASVIADTTSASQRVKWFGWLGASFGLGLIAGPIIGGFAGEISPHSP
FFIAALLNIVTFLVVMFWFRETKNTRDNTDTEVGVETQSNSVYITLFKTMPILLIIYFSAQLIGQIPATVWVLFTENRFG
WNSMMVGFSLAGLGLLHSVFQAFVAGRIATKWGEKTAVLLGFIADSSAFAFLAFISEGWLVFPVLILLAGGGIALPALQG
VMSIQTKSHQQGALQGLLVSLTNATGVIGPLLFAVIYNHSLPIWDGWIWIIGLAFYCIIILLSMTFMLTPQAQGSKQETS
A
>P02981 ~~~tetA~~~Tetracycline resistance protein, class C~~~
MKSNNALIVILGTVTLDAVGIGLVMPVLPGLLRDIVHSDSIASHYGVLLALYALMQFLCAPVLGALSDRFGRRPVLLASL
LGATIDYAIMATTPVLWILYAGRIVAGITGATGAVAGAYIADITDGEDRARHFGLMSACFGVGMVAGPVAGGLLGAISLH
APFLAAAVLNGLNLLLGCFLMQESHKGERRPMPLRAFNPVSSFRWARGMTIVAALMTVFFIMQLVGQVPAALWVIFGEDR
FRWSATMIGLSLAVFGILHALAQAFVTGPATKRFGEKQAIIAGMAADALGYVLLAFATRGWMAFPIMILLASGGIGMPAL
QAMLSRQVDDDHQGQLQGSLAALTSLTSITGPLIVTAIYAASASTWNGLAWIVGAALYLVCLPALRRGAWSRATST
>O07776 ~~~tcrA~~~Transcriptional regulatory protein TcrA~~~COG0745
MADETTMRAGRGPGRACGRVSGVRILVVEDEPKMTALLARALTEEGHTVDTVADGRHAVAAVDGGDYDAVVLDVMLPGID
GFEVCARLRRQRVWTPVLMLTARGAVTDRIAGLDGGADDYLTKPFNLDELFARLRALSRRGPIPRPPTLEAGDLRLDPSE
HRVWRADTEIRLSHKEFTLLEALIRRPGIVHTRAQLLERCWDAAYEARSNIVDVYIRYLRDKIDRPFGVTSLETIRGAGY
RLRKDGGRHALPR
>O69730 ~~~tcrX~~~Probable transcriptional regulatory protein TcrX~~~COG0745
MRRADGQPVTVLVVDDEPVLAEMVSMALRYEGWNITTAGDGSSAIAAARRQRPDVVVLDVMLPDMSGLDVLHKLRSENPG
LPVLLLTAKDAVEDRIAGLTAGGDDYVTKPFSIEEVVLRLRALLRRTGVTTVDSGAQLVVGDLVLDEDSHEVMRAGEPVS
LTSTEFELLRFMMHNSKRVLSKAQILDRVWSYDFGGRSNIVELYISYLRKKIDNGREPMIHTLRGAGYVLKPAR
>O69729 2.7.13.3~~~tcrY~~~Probable sensor histidine kinase TcrY~~~COG2205
MGITAATEMALRRHLVAQLDNQLGGTSYRSVLMYPEKMPRPPWRHETHNYIRSGPGPRFLDAPGQPAGMVAAVVSDGTTV
AAGYLTGSGSRAALTSTGRSQLERIAGSRTPLTLDLDGLGRYRVLAAPSRNGHDVIVTGLSMGNVDATMLQMLIIFGIVT
VIALVAATTAGIVIIKRALAPLRRVAQTASEVVDLPLDRGEVKLPVRVPEPDANPSTEVGQLGSALNRMLDHIAAALSAR
QASETCVRQFVADASHELRTPLAAIRGYTELTQRIGDDPEAVAHAMSRVASETERITRLVEDLLLLARLDSGRPLERGPV
DMSRLAVDAVSDAHVAGPDHQWALDLPPEPVVIPGDAARLHQVVTNLLANARVHTGPGTIVTTRLSTGPTHVVLQVIDNG
PGIPAALQSEVFERFARGDTSRSRQAGSTGLGLAIVSAVVKAHNGTITVSSSPGYTEFAVRLPLDGWQPLESSPR
>Q46342 3.4.22.-~~~tcsL~~~Cytotoxin-L~~~
MNLVNKAQLQKMVYVKFRIQEDEYVAILNALEEYHNMSESSVVEKYLKLKDINNLTDNYLNTYKKSGRNKALKKFKEYLT
MEVLELKNNSLTPVEKNLHFIWIGGQINDTAINYINQWKDVNSDYTVKVFYDSNAFLINTLKKTIVESATNNTLESFREN
LNDPEFDYNKFYRKRMEIIYDKQKHFIDYYKSQIEENPEFIIDNIIKTYLSNEYSKDLEALNKYIEESLNKITANNGNDI
RNLEKFADEDLVRLYNQELVERWNLAAASDILRISMLKEDGGVYLDVDILPGIQPDLFKSINKPDSITNTSWEMIKLEAI
MKYKEYIPGYTSKNFDMLDEEVQRSFESALSSKSDKSEIFLPLDDIKVSPLEVKIAFANNSVINQALISLKDSYCSDLVI
NQIKNRYKILNDNLNPSINEGTDFNTTMKIFSDKLASISNEDNMMFMIKITNYLKVGFAPDVRSTINLSGPGVYTGAYQD
LLMFKDNSTNIHLLEPELRNFEFPKTKISQLTEQEITSLWSFNQARAKSQFEEYKKGYFEGALGEDDNLDFAQNTVLDKD
YVSKKILSSMKTRNKEYIHYIVQLQGDKISYEASCNLFSKDPYSSILYQKNIEGSETAYYYYVADAEIKEIDKYRIPYQI
SNKRNIKLTFIGHGKSEFNTDTFANLDVDSLSSEIETILNLAKADISPKYIEINLLGCNMFSYSISAEETYPGKLLLKIK
DRVSELMPSISQDSITVSANQYEVRINEEGKREILDHSGKWINKEESIIKDISSKEYISFNPKENKIIVKSKYLHELSTL
LQEIRNNANSSDIDLEKKVMLTECEINVASNIDRQIVEGRIEEAKNLTSDSINYIKNEFKLIESISDSLYDLKHQNGLDD
SHFISFEDISKTENGFRIRFINKETGNSIFIETEKEIFSEYATHISKEISNIKDTIFDNVNGKLVKKVNLDAAHEVNTLN
SAFFIQSLIEYNTTKESLSNLSVAMKVQVYAQLFSTGLNTITDASKVVELVSTALDETIDLLPTLSEGLPIIATIIDGVS
LGAAIKELSETNDPLLRQEIEAKIGIMAVNLTAASTAIVTSALGIASGFSILLVPLAGISAGIPSLVNNELILQDKATKV
IDYFKHISLAETEGAFTLLDDKIIMPQDDLVLSEIDFNNNSITLGKCEIWRAEGGSGHTLTDDIDHFFSSPSITYRKPWL
SIYDVLNIKKEKIDFSKDLMVLPNAPNRVFGYEMGWTPGFRSLDNDGTKLLDRIRDHYEGQFYWRYFAFIADALITKLKP
RYEDTNVRINLDGNTRSFIVPVITTEQIRKNLSYSFYGSGGSYSLSLSPYNMNIDLNLVENDTWVIDVDNVVKNITIESD
EIQKGELIENILSKLNIEDNKIILNNHTINFYGDINESNRFISLTFSILEDINIIIEIDLVSKSYKILLSGNCMKLIENS
SDIQQKIDHIGFNGEHQKYIPYSYIDNETKYNGFIDYSKKEGLFTAEFSNESIIRNIYMPDSNNLFIYSSKDLKDIRIIN
KGDVKLLIGNYFKDDMKVSLSFTIEDTNTIKLNGVYLDENGVAQILKFMNNAKSALNTSNSLMNFLESINIKNIFYNNLD
PNIEFILDTNFIISGSNSIGQFELICDKDKNIQPYFINFKIKETSYTLYVGNRQNLIVEPSYHLDDSGNISSTVINFSQK
YLYGIDRYVNKVIIAPNLYTDEINITPVYKPNYICPEVIILDANYINEKINVNINDLSIRYVWDNDGSDLILIANSEEDN
QPQVKIRFVNVFKSDTAADKLSFNFSDKQDVSVSKIISTFSLAAYSDGFFDYEFGLVSLDNDYFYINSFGNMVSGLIYIN
DSLYYFKPPKNNLITGFTTIDGNKYYFDPTKSGAASIGEITIDGKDYYFNKQGILQVGVINTSDGLKYFAPAGTLDENLE
GESVNFIGKLNIDGKIYYFEDNYRAAVEWKLLDDETYYFNPKTGEALKGLHQIGDNKYYFDDNGIMQTGFITINDKVFYF
NNDGVMQVGYIEVNGKYFYFGKNGERQLGVFNTPDGFKFFGPKDDDLGTEEGELTLYNGILNFNGKIYFFDISNTAVVGW
GTLDDGSTYYFDDNRAEACIGLTVINDCKYYFDDNGIRQLGFITINDNIFYFSESGKIELGYQNINGNYFYIDESGLVLI
GVFDTPDGYKYFAPLNTVNDNIYGQAVKYSGLVRVNEDVYYFGETYKIETGWIENETDKYYFDPETKKAYKGINVVDDIK
YYFDENGIMRTGLISFENNNYYFNEDGKMQFGYLNIKDKMFYFGKDGKMQIGVFNTPDGFKYFAHQNTLDENFEGESINY
TGWLDLDGKRYYFTDEYIAATGSLTIDGYNYYFDPDTAELVVSE
>P0DUB4 3.4.22.-~~~tcsL~~~Cytotoxin-L~~~
MSLVNKAQLQKMAYVKFRIQEDEYVAILNALEEYHNMSESSVVEKYLKLKDINNLTDNYLNTYKKSGRNKALKKFKEYLT
MEVLELKNNSLTPVEKNLHFIWIGGQINDTAINYINQWKDVNSDYTVKVFYDSNAFLINTLKKTIVESATNNTLESFREN
LNDPEFDYNKFYRKRMEIIYDKQKHFIDYYKSQIEENPEFIIDNIIKTYLSNEYSKDLEALNKYIEESLNKITANNGNDI
RNLEKFADEDLVRLYNQELVERWNLAAASDILRISMLKEDGGVYLDVDMLPGIQPDLFKSINKPDSITDTSWEMIKLEAI
MKYKEYIPGYTSKNFDMLDEEVQSSFESALSSTSDKSEIFLPLDDIKVSPLEVKIAFANNSVINQALISLKDSYGSDLVI
SQIKNRYKILNDNLNPAINEGNDFNTTMKTFNDNLVSISNEDNIMFMIKIADYLKVGFAPDVRSTINLSGPGVYTGAYQD
LLMFKDNSINIHLLEPELRNFEFPKTKISQLTEQEITSLWSFNQARAKSQFEEYKKGYFEGALGEDDILDFSQNTVLDKD
YVVEKISSSMRTPNKEYVHYIVQLQGDNVSYEAACNLFAKNPYYNILFQKNIENSETAYYYNLIYNKLQEIDKYRIPNLI
SNRHKIKLTFIGHGKSEFNTDTFANLDVNSLSSEIETILNLAKEDISPKSIEINLLGCNMFSYNVNVEETYPGKLLLKIK
DIVSKLMPSISQDSITVSANQYEVRINKEGRRELLDHSGKWINKEESIIKDISSKEYISFNPKENKIIVKSKNLHELSTL
LQEIKNNSNSSDIDLEKKVMLTECEINVASNIDTQIVEERIEEAKNLTSDSINYIKNEFKLIESISDALYDLKHQNGLDD
SHFISFEDISKTENGFRIRFINKETGNSIFIETEKEIFSEYAAHISKEISNIKDTIFDNVNGKLVKKVNLDAAHEVNTLN
SAFFIQSLIEYNTTKESLSNLSVAMKVQVYAQLFSTGLNTITDASKVVELVSTALDETIDLLPTLSEGLPIIATIIDGVS
LGAAIKELSETNDPLLRQEIEAKIGIMAVNLTAASTAIVTSALGIASGFSILLVPLAGISAGIPSLVNNELILQDKATKV
IDYFKHISLAETEGAFTLLDDKIIMPQDDLVLSEIDFNNNSITLGKCEIWRTEGGSGHTFTDDIDHFFSSPSITYRKPWL
SIYDVLNIKKEKIDFSKDLMVLPNAPNRVFSYEMGWTPGFRSLDNDGTKLLDRIRDHYEGQFYWRYFAFIADALITKLKP
RYEDTNVRINLDGNTRSFIVPVITTEQIRKNLSYSFYGSGGSYSLSLSPYNMNIDLNLVENDTWVIDVDNVVKNITIESD
EIQKGELIENILSKLNIEDNKIVLNNHTINFYGAINESNRFISLTFSILEDINIIIEIDLVSKSYKILLSGNCIKLIENS
SDIQQKIDHIGFNGEHQKYIPYSYIDNETKYNGFIDYSKKEGLFTAEFSNESIIRNIYMPDSNNLFIYSSKDLKDIRIIN
KGDVKLLIGNYFKDNMKVSLSFTIEDTNTIKLNGVYLDENGVAQILKFMNNAKSALNTSNSLMNFLESINIKNIFYNNLD
PNIKFILDTNFIISGSNSIGQFELICDKDKNIQPYFIKFKIKETSYTLYAGNRQNLIVEPSYHLDDSGNISSTVINFSQK
YLYGIDRYVNKVIITPNLYTDEINITPVYKPNYICPEVIILDANYINEKINININDLSIRYVWDNDGSDLILIANSEEDN
QPQVKIRFVNVFKSDTAADKLSFNFSDKQDVSVSKIISTFSLAAYSDGVFDYEFGLVSLDNECFYINSFGNMVSGLIYIN
DSLYYFKPPKNNLITGFTTIDDNKYYFDPTKSGAASIGEITIDGKDYYFNKQGILQVGVINTSDGLKYFAPAGTLDENLE
GESVNFIGKLNIDGKIYYFEDNYRAAVEWKSLDGETYYFNPKTGEALKGLHQIGDNKYYFDNNGIMQTGFITINDKVFYF
NNDGVMQVGYIEVNGKYFYFGKNGERQLGVFNTPDGFKFFGPKDDDLGTEEGELTLYNGILNFNGKIYFFDISNTAVVGW
GILDDGSTYYFDDNTAEACIGLTVINDCKYYFDDNGIRQLGFITINDNIFYFSESGKIELGYQNINGNYFYIDESGLVLI
GVFDTPDGYKYFAPLNTVNDNIYGQAVEYSGLVRLNEDVYYFGETYKIETGWIENETDKYYFDPETKKAYKGINVVDDIK
YYFDENGIMKTGLISFENNNYYFNEDGKMQFGYLNIKDKMFYFGKDGKMQIGVFNTPDGFKYFAHQNTLDENFEGESINY
TGWLDLDGKRYYFTDEYIAATSSLTIDGYNYYFDPDTAELVVSE
>T0D3N5 3.4.22.-~~~tcsL~~~Cytotoxin-L~~~
MNLVNKAQLQKMAYVKFRIQEDEYVAILNALEEYHNMSESSVVEKYLKLKDINNLTDNYLNTYKKSGRNKALKKFKEYLT
MEVLELKNNSLTPVEKNLHFIWIGGQINDTAINYINQWKDVNSDYTVKVFYDSNAFLINTLKKTIVESATNNTLESFREN
LNDPEFDYNKFYRKRMEIIYDKQKHFIDYYKSQIEENPEFIIDNIIKTYLSNEYSKDLEALNKYIEESLNKITANNGNDI
RNLEKFADEDLVRLYNQELVERWNLAAASDILRISMLKEDGGVYLDVDMLPGIQPDLFKSINKPDSITNTSWEMIKLEAI
MKYKEYIPGYTSKNFDMLDEEVQRSFESALSSKSDKSEIFLPLDDIKVSPLEVKIAFANNSVINQALISLKDSYCSDLVI
NQIKNRYKILNDNLNPSINEGTDFNTTMKIFSDKLASISNEDNMMFMIKITNYLKVGFAPDVRSTINLSGPGVYTGAYQD
LLMFKDNSTNIHLLEPELRNFEFPKTKISQLTEQEITSLWSFNQARAKSQFEEYKKGYFEGALGEDDNLDFAQNTVLDKD
YVSKKILSSMKTRNKEYIHYIVQLQGDKISYEASCNLFSKDPYSSILYQKNIEGSETAYYYSVADAEIKEIDKYRIPYQI
SNKRKIKLTFIGHGKSEFNTDTFANLDVDSLSSEIETILNLAKADISPKYIEINLLGCNMFSYSISAEETYPGKLLLKIK
DRVSELMPSISQDSITVSANQYEVRINEEGKREILDHSGKWINKEESIIKDISSKEYISFNPKENKIIVKSKYLHELSTL
LQEIRNNANSSDIDLEKKVMLTECEINVASNIDRQIVEGRIEEAKNLTSDSINYIKNEFKLIESISDSLYDLKHQNGLDD
SHFISFEDISKTENGFRIRFINKETGNSIFIETEKEIFSEYATHISKEISNIKDTIFDNVNGKLVKKVNLDAAHEVNTLN
SAFFIQSLIEYNTTKESLSNLSVAMKVQVYAQLFSTGLNTITDASKVVELVSTALDETIDLLPTLSEGLPIIATIIDGVS
LGAAIKELSETNDPLLRQEIEAKIGIMAVNLTAASTAIVTSALGIASGFSILLVPLAGISAGIPSLVNNELILQDKATKV
IDYFKHISLAETEGAFTLLDDKIIMPQDDLVLSEIDFNNNSITLGKCEIWRAEGGSGHTLTDDIDHFFSSPSITYRKPWL
SIYDVLNIKKEKIDFSKDLMVLPNAPNRVFGYEMGWTPGFRSLDNDGTKLLDRIRDHYEGQFYWRYFAFIADALITKLKP
RYEDTNVRINLDGNTRSFIVPVITTEQIRKNLSYSFYGSGGSYSLSLSPYNMNIDLNLVENDTWVIDVDNVVKNITIESD
EIQKGELIENILSKLNIEDNKIILNNHTINFYGDINESNRFISLTFSILEDINIIIEIDLVSKSYKILLSGNCMKLIENS
SDIQQKIDHIGFNGEHQKYIPYSYIDNETKYNGFIDYSKKEGLFTAEFSNESIIRNIYMPDSNNLFIYSSKDLKDIRIIN
KGDVKLLIGNYFKDDMKVSLSFTIEDTNTIKLNGVYLDENGVAQILKFMNNAKSALNTSNSLMNFLESINIKNIFYNNLD
PNIEFILDTNFIISGSNSIGQFELICDKDKNIQPYFIKFKIKETSYTLYVGNRQNLIVEPSYHLDDSGNISSTVINFSQK
YLYGIDRYVNKVIIAPNLYTDEINITPVYKPNYICPEVIILDANYINEKINVNINDLSIRYVWDNDGSDLILIANSEEDN
QPQVKIRFVNVFKSDTAADKLSFNFSDKQDVSVSKIISTFSLAAYSDGFFDYEFGLVSLDNDYFYINSFGNMVSGLIYIN
DSLYYFKPPKNNLITGFTTIDGNKYYFDPTKSGAASIGEITIDGKDYYFNKQGILQVGVINTSDGLKYFAPAGTLDENLE
GESVNFIGKLNIDGKIYYFEDNYRAAVEWKLLDDETYYFNPKTGEALKGLHQIGDNKYYFDDNGIMQTGFITINDKVFYF
NNDGVMQVGYIEVNGKYFYFGKNGERQLGVFNTPDGFKFFGPKDDDLGTEEGELTLYNGILNFNGKIYFFDISNTAVVGW
GTLDDGSTYYFDDNTAEACIGLTVINDCKYYFDDNGIRQLGFITINDNIFYFSESGKIELGYQNINGNYFYIDESGLVLI
GVFDTPDGYKYFAPLNTVNDNIYGQAVKYSGLVRVNEDVYYFGETYKIETGWIENETDKYYFDPETKKAYKGINVVDDIK
YYFDENGIMRTGLISFENNNYYFNEDGKMQFGYLNIKDKMFYFGKDGKMQIGVFNTPDGFKYFAHQNTLDENFEGESINY
TGWLDLDGKRYYFTDEYIAATGSLTIDGYNYYFDPDTAELVVSE
>P0CL17 ~~~tctD~~~Transcriptional regulatory protein TctD~~~
MRLLLAEDNRELAHWLEKALVQNGFAVDCVFDGLAADHLLHSEMYALAVLDINMPGMDGLEVVQRLRKRGQTLPVLLLTA
RSAVADRVKGLNVGADDYLPKPFELEELDARLRALLRRSAGQVHEVQQLGELIFHDEGYFLLQGQPLALTPREQALLTVL
MYRRTRPVSRQQLFEQVFSLNDEVSPESIELYIHRLRKKLQGSDVRITTLRGLGYVLERGDEVG
>P42199 ~~~tcyA~~~L-cystine-binding protein TcyA~~~COG0834
MKKALLALFMVVSIAALAACGAGNDNQSKDNAKDGDLWASIKKKGVLTVGTEGTYEPFTYHDKDTDKLTGYDVEVITEVA
KRLGLKVDFKETQWDSMFAGLNSKRFDVVANQVGKTDREDKYDFSDKYTTSRAVVVTKKDNNDIKSEADVKGKTSAQSLT
SNYNKLATNAGAKVEGVEGMAQALQMIQQGRVDMTYNDKLAVLNYLKTSGNKNVKIAFETGEPQSTYFTFRKGSGEVVDQ
VNKALKEMKEDGTLSKISKKWFGEDVSK
>P42200 ~~~tcyB~~~L-cystine transport system permease protein TcyB~~~COG0765
MFLNNLPALTLGTAIPWDLVQQSFWPILSGGIYYTIPLTILSFIFGMILALITALARMSKVRPLRWVFSVYVSAIRGTPL
LVQLFIIFYLFPAFNVTLDPFPSAVIAFSLNVGAYASEIIRASILSVPKGQWEAGYTIGMTHQKTLFRVILPQAFRVSIP
PLSNTFISLIKDTSLASQILVAELFRKAQEIGARNLDQILVIYIEAAFIYWIICFLLSLVQHVIERRLDRYVAK
>P39456 7.4.2.-~~~tcyC~~~L-cystine import ATP-binding protein TcyC~~~COG1126
MLTVKGLNKSFGENEILKKIDMKIEKGKVIAILGPSGSGKTTLLRCLNALEIPNRGELAFDDFSIDFSKKVKQADILKLR
RKSGMVFQAYHLFPHRTALENVMEGPVQVQKRNKEEVRKEAIQLLDKVGLKDKMDLYPFQLSGGQQQRVGIARALAIQPE
LMLFDEPTSALDPELVGEVLKVIKDLANEGWTMVVVTHEIKFAQEVADEVIFIDGGVIVEQGPPEQIFSAPKEERTQRFL
NRILNPL
>O34406 ~~~tcyJ~~~L-cystine-binding protein TcyJ~~~COG0834
MNKRKGLVLLLSVFALLGGGCSQTNNKTDRQAQTVIVGTGTDFPNIAFLNEKGELTGYDIEVMKAIDKELPQYTFEFKTM
DFSNLLTSLGNKKIDVIAHNMAKNKEREKRFLYHKVPYNYSPMYITVKEDNHKIHTLKDLHGKTVIVGATSNAADYITKY
NKTHGSPIHLKYAGQGSNDTANQIETGRADATIATPFAVDFQNKTHAFRQKTVGDVLLDTEVYFMFNKGSQTLADDTDQA
IKKLEKNGTLKKLSRKWLGADYSKSSFEK
>P0AEM9 ~~~tcyJ~~~L-cystine-binding protein TcyJ~~~COG0834
MKLAHLGRQALMGVMAVALVAGMSVKSFADEGLLNKVKERGTLLVGLEGTYPPFSFQGDDGKLTGFEVEFAQQLAKHLGV
EASLKPTKWDGMLASLDSKRIDVVINQVTISDERKKKYDFSTPYTISGIQALVKKGNEGTIKTADDLKGKKVGVGLGTNY
EEWLRQNVQGVDVRTYDDDPTKYQDLRVGRIDAILVDRLAALDLVKKTNDTLAVTGEAFSRQESGVALRKGNEDLLKAVN
DAIAEMQKDGTLQALSEKWFGADVTK
>O34852 ~~~tcyK~~~L-cystine-binding protein TcyK~~~COG0834
MKTKTAFMAILFSLITVLSACGAGSQTTGAGQKKVQTITVGTGTQFPNICFIDEKGDLTGYDVELIKELDKRLPHYKFTF
KTMEFSNLLVSLGQHKVDIVAHQMEKSKEREKKFLFNKVAYNHFPLKITVLQNNDTIRGIEDLKGKRVITSATSNGALVL
KKWNEDNGRPFEIAYEGQGANETANQLKSGRADATISTPFAVDFQNKTSTIKEKTVGNVLSNAKVYFMFNKNEQTLSDDI
DKALQEIIDDGTLKRLSLKWLGDDYSKEQY
>O34315 ~~~tcyL~~~L-cystine transport system permease protein TcyL~~~COG0765
MEKAFDMNMIGDFVPTLTAYLPVTLYILTLSLLFGFVLGLFLALPRIYNIPIVNQLAKVYISFFRGTPIMVQLFIVFYGI
PALTGLIGIDTSKMDPFYAAVATYALSNAAAAAEIIRAGVGSVDKGQTEAAYSIGLSGSQAFRRIVLPQALVQAFPNMGN
MVISSLKDTSLAFSIGVMDMSGRGQTLITSSNHSLEVYIALSIVYYAVAVLFEWFFRVAEKRIKKNQTRIVTVFDMNIH
>P0AFT2 ~~~tcyL~~~L-cystine transport system permease protein TcyL~~~COG0765
MQESIQLVIDSLPFLLKGAGYTLQLSIGGMFFGLLLGFILALMRLSPIWPVRWLARFYISIFRGTPLIAQLFMIYYGLPQ
FGIELDPIPSAMIGLSLNTAAYAAETLRAAISSIDKGQWEAAASIGMTPWQTMRRAILPQAARVALPPLSNSFISLVKDT
SLAATIQVPELFRQAQLITSRTLEVFTMYLAASLIYWIMATVLSTLQNHFENQLNRQEREPK
>O34931 ~~~tcyM~~~L-cystine transport system permease protein TcyM~~~COG0765
MQFDFPFIVSAMKEMVKTIPLTLMMAVLPIVFGFLVALGNIIVRIFRIKGLVACSRFYVSFFRSTPAILHIMLIYLGIPP
VADQVSSFFHLGWSANEIPVSMFVIMALSLTAGAYLTEIIRSGILAMDTGQVEAAYSIGLTYSQTFRRVILPQALKVSIP
NFTNLGIGFLHTTSIAAIVAVPEITGTATIVASDNYAFLEAFIGAAIIYWVLTLILESANGVLERKAARFQGGTL
>O34900 7.4.2.-~~~tcyN~~~L-cystine import ATP-binding protein TcyN~~~COG1126
MIEIKNIHKQFGIHHVLKGINLTVRKGEVVTIIGPSGSGKTTFLRCLNLLERPDEGIISIHDKVINCRFPSKKEVHWLRK
QTAMVFQQYHLFAHKTVIENVMEGLTIARKMRKQDAYAVAENELRKVGLQDKLNAYPSQLSGGQKQRVGIARALAIHPDV
LLFDEPTAALDPELVGEVLEVMLEIVKTGATMIVVTHEMEFARRVSDQVVFMDEGVIVEQGTPEEVFRHTKKDRTRQFLR
RVSPEYLFEPKEHIKEPVI
>P37774 7.4.2.12~~~tcyN~~~L-cystine transport system ATP-binding protein TcyN~~~COG1126
MSAIEVKNLVKKFHGQTVLHGIDLEVKPGEVVAIIGPSGSGKTTLLRSINLLEQPEAGTITVGDITIDTARSLSQQKSLI
RQLRQHVGFVFQNFNLFPHRTVLENIIEGPVIVKGEPKEEATARARELLAKVGLAGKETSYPRRLSGGQQQRVAIARALA
MRPEVILFDEPTSALDPELVGEVLNTIRQLAQEKRTMVIVTHEMSFARDVADRAIFMDQGRIVEQGAAKALFADPEQPRT
RQFLEKFLLQ
>P54596 ~~~tcyP~~~L-cystine uptake protein TcyP~~~COG1823
METLLVVLHVFILFLLILGLFVMQKKHVSFSKRVFTALGLGIVFGFALQLIYGPTSNIVIQTADWFNIAGGGYVKLLQMV
VMPLVFISILGAFTKLKLTKNLGKISGLIIGILVATTAVAAAVGIASALSFDLQAIQVDQGSTELSRGQELQQKSEDMTA
KTLPQQIVELLPGNPFLDFTGARPTSTIAVVIFAAFLGVAFLGVKHKQPEQAETFKKLVDAVYAIVMRVVTLILRLTPYG
VLAIMTKTIATSDLDSILKLGMFVIASYAALITMFIIHLLLLTFSGLNPVIYLKKAVPVLVFAFTSRSSAGALPLNIKTQ
RSMGVPEGIANFAGSFGLSIGQNGCAGIYPAMLAMMIAPTVGQNPFDPVFIITVIAVVAISSFGVAGVGGGATFAALLVL
SSLNMPVALAGLLISIEPLIDMGRTALNVSGSMTSGLITSKVTKEIDQGAFHDQSRVIEAEEA
>P77529 ~~~tcyP~~~L-cystine transporter TcyP~~~COG1823
MNFPLIANIVVFVVLLFALAQTRHKQWSLAKKVLVGLVMGVVFGLALHTIYGSDSQVLKDSVQWFNIVGNGYVQLLQMIV
MPLVFASILSAVARLHNASQLGKISFLTIGTLLFTTLIAALVGVLVTNLFGLTAEGLVQGGAETARLNAIESNYVGKVSD
LSVPQLVLSFIPKNPFADLTGANPTSIISVVIFAAFLGVAALKLLKDDAPKGERVLAAIDTLQSWVMKLVRLVMQLTPYG
VLALMTKVVAGSNLQDIIKLGSFVVASYLGLLIMFAVHGILLGINGVSPLKYFRKVWPVLTFAFTSRSSAASIPLNVEAQ
TRRLGVPESIASFAASFGATIGQNGCAGLYPAMLAVMVAPTVGINPLDPMWIATLVGIVTVSSAGVAGVGGGATFAALIV
LPAMGLPVTLVALLISVEPLIDMGRTALNVSGSMTAGTLTSQWLKQTDKAILDSEDDAELAHH
>P0AGF6 4.3.1.19~~~tdcB~~~L-threonine dehydratase catabolic TdcB~~~COG1171
MHITYDLPVAIDDIIEAKQRLAGRIYKTGMPRSNYFSERCKGEIFLKFENMQRTGSFKIRGAFNKLSSLTDAEKRKGVVA
CSAGNHAQGVSLSCAMLGIDGKVVMPKGAPKSKVAATCDYSAEVVLHGDNFNDTIAKVSEIVEMEGRIFIPPYDDPKVIA
GQGTIGLEIMEDLYDVDNVIVPIGGGGLIAGIAVAIKSINPTIRVIGVQSENVHGMAASFHSGEITTHRTTGTLADGCDV
SRPGNLTYEIVRELVDDIVLVSEDEIRNSMIALIQRNKVVTEGAGALACAALLSGKLDQYIQNRKTVSIISGGNIDLSRV
SQITGFVDA
>P11954 4.3.1.19~~~tdcB~~~L-threonine dehydratase catabolic TdcB~~~
MHITYDLPVAIEDILEAKKRLAGKIYKTGMPRSNYFSERCKGEIFLKFENMQRTGSFKIRGAFNKLSSLTEAEKRKGVVA
CSAGNHAQGVSLSCAMLGIDGKVVMPKGAPKSKVAATCDYSAEVVLHGDNFNDTIAKVSEIVETEGRIFIPPYDDPKVIA
GQGTIGLEIMEDLYDVDNVIVPIGGGGLIAGIAIAIKSINPTIKVIGVQAENVHGMAASYYTGEITTHRTTGTLADGCDV
SRPGNLTYEIVRELVDDIVLVSEDEIRNSMIALIQRNKVITEGAGALACAALLSGKLDSHIQNRKTVSIISGGNIDLSRV
SQITGLVDA
>Q7A5L8 4.3.1.19~~~tdcB~~~L-threonine dehydratase catabolic TdcB~~~
MTTNTVTLQTAHIVSLGDIEEAKASIKPFIRRTPLIKSMYLSQNITKGNVYLKLENMQFTGSFKFRGASNKINHLSDEQK
AKGIIGASAGNHAQGVALTAKLLGIDATIVMPETAPIAKQNATKGYGAKVILKGKNFNETRLYMEELAKENGMTIVHPYD
DKFVMAGQGTIGLEILDDIWNVNTVIVPVGGGGLIAGIATALKSFNPSIHIIGVQAENVHGMAESFYKRALTEHREDSTI
ADGCDVKVPGEKTYEVVKHLVDEFILVSEEEIEHAMQDLMQRAKIITEGAGALPTAAILSGKIDKKWLEGKNVVALVSGG
NVDLTRVSGVIEHGLNIADTSKGVVG
>P0AAD8 ~~~tdcC~~~Threonine/serine transporter TdcC~~~COG0814
MSTSDSIVSSQTKQSSWRKSDTTWTLGLFGTAIGAGVLFFPIRAGFGGLIPILLMLVLAYPIAFYCHRALARLCLSGSNP
SGNITETVEEHFGKTGGVVITFLYFFAICPLLWIYGVTITNTFMTFWENQLGFAPLNRGFVALFLLLLMAFVIWFGKDLM
VKVMSYLVWPFIASLVLISLSLIPYWNSAVIDQVDLGSLSLTGHDGILITVWLGISIMVFSFNFSPIVSSFVVSKREEYE
KDFGRDFTERKCSQIISRASMLMVAVVMFFAFSCLFTLSPANMAEAKAQNIPVLSYLANHFASMTGTKTTFAITLEYAAS
IIALVAIFKSFFGHYLGTLEGLNGLVLKFGYKGDKTKVSLGKLNTISMIFIMGSTWVVAYANPNILDLIEAMGAPIIASL
LCLLPMYAIRKAPSLAKYRGRLDNVFVTVIGLLTILNIVYKLF
>P11868 2.7.2.15~~~tdcD~~~Propionate kinase~~~COG0282
MNEFPVVLVINCGSSSIKFSVLDASDCEVLMSGIADGINSENAFLSVNGGEPAPLAHHSYEGALKAIAFELEKRNLNDSV
ALIGHRIAHGGSIFTESAIITDEVIDNIRRVSPLAPLHNYANLSGIESAQQLFPGVTQVAVFDTSFHQTMAPEAYLYGLP
WKYYEELGVRRYGFHGTSHRYVSQRAHSLLNLAEDDSGLVVAHLGNGASICAVRNGQSVDTSMGMTPLEGLMMGTRSGDV
DFGAMSWVASQTNQSLGDLERVVNKESGLLGISGLSSDLRVLEKAWHEGHERAQLAIKTFVHRIARHIAGHAASLRRLDG
IIFTGGIGENSSLIRRLVMEHLAVLGLEIDTEMNNRSNSCGERIVSSENARVICAVIPTNEEKMIALDAIHLGKVNAPAE
FA
>O06961 2.7.2.15~~~tdcD~~~Propionate kinase~~~
MNEFPVVLVINCGSSSIKFSVLDVATCDVLMAGIADGMNTENAFLSINGDKPINLAHSNYEDALKAIAFELEKRDLTDSV
ALIGHRIAHGGELFTQSVIITDEIIDNIRRVSPLAPLHNYANLSGIDAARHLFPAVRQVAVFDTSFHQTLAPEAYLYGLP
WEYFSSLGVRRYGFHGTSHRYVSRRAYELLDLDEKDSGLIVAHLGNGASICAVRNGQSVDTSMGMTPLEGLMMGTRSGDV
DFGAMAWIAKETGQTLSDLERVVNKESGLLGISGLSSDLRVLEKAWHEGHERARLAIKTFVHRIARHIAGHAASLHRLDG
IIFTGGIGENSVLIRQLVIEHLGVLGLTLDVEMNKQPNSHGERIISANPSQVICAVIPTNEEKMIALDAIHLGNVKAPVE
FA
>P42632 2.3.1.-~~~tdcE~~~PFL-like enzyme TdcE~~~COG1882
MKVDIDTSDKLYADAWLGFKGTDWKNEINVRDFIQHNYTPYEGDESFLAEATPATTELWEKVMEGIRIENATHAPVDFDT
NIATTITAHDAGYINQPLEKIVGLQTDAPLKRALHPFGGINMIKSSFHAYGREMDSEFEYLFTDLRKTHNQGVFDVYSPD
MLRCRKSGVLTGLPDGYGRGRIIGDYRRVALYGISYLVRERELQFADLQSRLEKGEDLEATIRLREELAEHRHALLQIQE
MAAKYGFDISRPAQNAQEAVQWLYFAYLAAVKSQNGGAMSLGRTASFLDIYIERDFKAGVLNEQQAQELIDHFIMKIRMV
RFLRTPEFDSLFSGDPIWATEVIGGMGLDGRTLVTKNSFRYLHTLHTMGPAPEPNLTILWSEELPIAFKKYAAQVSIVTS
SLQYENDDLMRTDFNSDDYAIACCVSPMVIGKQMQFFGARANLAKTLLYAINGGVDEKLKIQVGPKTAPLMDDVLDYDKV
MDSLDHFMDWLAVQYISALNIIHYMHDKYSYEASLMALHDRDVYRTMACGIAGLSVATDSLSAIKYARVKPIRDENGLAV
DFEIDGEYPQYGNNDERVDSIACDLVERFMKKIKALPTYRNAVPTQSILTITSNVVYGQKTGNTPDGRRAGTPFAPGANP
MHGRDRKGAVASLTSVAKLPFTYAKDGISYTFSIVPAALGKEDPVRKTNLVGLLDGYFHHEADVEGGQHLNVNVMNREML
LDAIEHPEKYPNLTIRVSGYAVRFNALTREQQQDVISRTFTQAL
>P0AGL2 3.5.4.-~~~tdcF~~~Putative reactive intermediate deaminase TdcF~~~COG0251
MKKIIETQRAPGAIGPYVQGVDLGSMVFTSGQIPVCPQTGEIPADVQDQARLSLENVKAIVVAAGLSVGDIIKMTVFITD
LNDFATINEVYKQFFDEHQATYPTRSCVQVARLPKDVKLEIEAIAVRSA
>P42630 4.3.1.17~~~tdcG~~~L-serine dehydratase TdcG~~~COG1760
MISAFDIFKIGIGPSSSHTVGPMNAGKSFIDRLESSGLLTATSHIVVDLYGSLSLTGKGHATDVAIIMGLAGNSPQDVVI
DEIPAFIELVTRSGRLPVASGAHIVDFPVAKNIIFHPEMLPRHENGMRITAWKGQEELLSKTYYSVGGGFIVEEEHFGLS
HDVETSVPYDFHSAGELLKMCDYNGLSISGLMMHNELALRSKAEIDAGFARIWQVMHDGIERGMNTEGVLPGPLNVPRRA
VALRRQLVSSDNISNDPMNVIDWINMYALAVSEENAAGGRVVTAPTNGACGIIPAVLAYYDKFRRPVNERSIARYFLAAG
AIGALYKMNASISGAEVGCQGEIGVACSMAAAGLTELLGGSPAQVCNAAEIAMEHNLGLTCDPVAGQVQIPCIERNAINA
VKAVNAARMAMRRTSAPRVSLDKVIETMYETGKDMNDKYRETSRGGLAIKVVCG
>Q2EHL7 ~~~tdeA~~~Toxin and drug export protein A~~~COG1538
MFTIKKLTLTIVVATTLTGCANIGDSYRASLKNYKQYEEITKQYNIKNDWWKLYKDAQLNRVVEKALLNNKDLAKATISV
NRALYSANLAGANLVPAFSGSTRSTAQKNIKTGGNSTISHTGSLNVSYTLDLWFRLADTADAAEWAHKATVQDMESTKLS
LINSVVTTYYQIAYLNDAISTTKESIKYYTDISNIMRNRLAQGVADSISVDQAQQAVLTARNNLITYQLNRKTAEQTLRN
LLNLKPDETLKITFPHILKVKSVGVNLNVPVSVIANRPDIKGYQARLSSAFKNVKATEKSWFPEITLGGSLNSSGKKLNS
ATNTLIGGGALGISLPFLNWNTVKWNVKISEADYETARLNYEKSITVALNDVDTNYFSFTQAKKRFTNAQKTYIYNQRIT
QYYRNRYNAGVSELREWLTAANTEKNSQLSILQAKYNVIQAENAVYSSMAGYYSVKK
>Q7VNU1 ~~~tdhA~~~TonB-dependent heme receptor A~~~COG1629
MKMKKQCATLTFFIGLHGYTIAEDNPKNISLSVITVPGHHERQPDRSIITQNEIDQKQSDNVADLVNTVPGVSMAGGFRP
SGQTLNIRGMGDTEDIRVQVDGATKNFEKYQQGSLFIEPELLRRVSIDKGNHYPQYGNGGFAGTIKLETKNAKDFLQENQ
LLGGLLKYGYNTNNNQRTFSGAIFMQNDQKNIDALVYATVRRAHDYKRADKTPIKYSANNQANFLAKVNWWLTPSQLLAF
SKVHGNHNGWEPFAAKRDLLPGPTEAEITKYGLDLAWKRKLVAREQQDRSYSLQYQFLPENNPWINTVAQLSHSSTYQHD
TRSEQASKTYLASLGNESWTRYTDLTFDVNNTSLFNVAKTSHTLLVGLQWVKHKRQTLIFDPSKLQKAEYNHGYFQPSYM
PSGHQYTHAFYAQDKIKIHNLTVSIGARYDYVKNIGKPNIATIYNDPTAGHDYSSKHYPGWSSYLGLNYKLTPYLNLFSN
ISNTWRAPVIDEQYETQYAKATLSPTASSLDLKKERITQLRVGKQIHFDHILSNNDQLSFNSTFFYYKGKDEIFKTRGVR
CFESAQNNNNEVCSKKIGNYRNLPGYQIKGFELEANYDSTYWFTNLSYSHTIGKRLASPRNPWLASTSWIAEIPPRKAVV
TLGSHIPDTNLTLGWKSEFVRRQDRSPTDQDKDAGHWALPKSSGYALHGIFATWQPKQIKHLRIQFTVDNLLNRSYRPYL
SELAAGTGRNIKLSISKQF
>Q2T9E1 1.1.1.103~~~tdh~~~L-threonine 3-dehydrogenase~~~
MKALAKLERGPGLTLTRVKKPEVGHNDVLIKIRRTAICGTDIHIWKWDDWAQKTIPVPMHVGHEYVGEIVEMGQEVRGFS
IGDRVSGEGHITCGFCRNCRAGRRHLCRNTVGVGVNREGAFAEYLAIPAFNAFKIPPEISDDLAAIFDPFGNATHTALSF
NLVGEDVLITGAGPIGVMAVAIAKHVGARNVVITDINDYRLDLARRMGATRAVNVSRESLRDVMADLHMTEGFDVGLEMS
GVPSAFTSLLESMNHGGKVALLGIPPAQTAIDWNQVIFKGLEIKGIYGREMFETWYKMVAMLQSGLDLSPIITHRFAVDD
YEKGFAAMLSGESGKVILDWAAA
>P07913 1.1.1.103~~~tdh~~~L-threonine 3-dehydrogenase~~~COG1063
MKALSKLKAEEGIWMTDVPVPELGHNDLLIKIRKTAICGTDVHIYNWDEWSQKTIPVPMVVGHEYVGEVVGIGQEVKGFK
IGDRVSGEGHITCGHCRNCRGGRTHLCRNTIGVGVNRPGCFAEYLVIPAFNAFKIPDNISDDLAAIFDPFGNAVHTALSF
DLVGEDVLVSGAGPIGIMAAAVAKHVGARNVVITDVNEYRLELARKMGITRAVNVAKENLNDVMAELGMTEGFDVGLEMS
GAPPAFRTMLDTMNHGGRIAMLGIPPSDMSIDWTKVIFKGLFIKGIYGREMFETWYKMAALIQSGLDLSPIITHRFSIDD
FQKGFDAMRSGQSGKVILSWD
>Q5SKS4 1.1.1.103~~~tdh~~~L-threonine 3-dehydrogenase~~~COG1063
MRALAKLAPEEGLTLVDRPVPEPGPGEILVRVEAASICGTDLHIWKWDAWARGRIRPPLVTGHEFSGVVEAVGPGVRRPQ
VGDHVSLESHIVCHACPACRTGNYHVCLNTQILGVDRDGGFAEYVVVPAENAWVNPKDLPFEVAAILEPFGNAVHTVYAG
SGVSGKSVLITGAGPIGLMAAMVVRASGAGPILVSDPNPYRLAFARPYADRLVNPLEEDLLEVVRRVTGSGVEVLLEFSG
NEAAIHQGLMALIPGGEARILGIPSDPIRFDLAGELVMRGITAFGIAGRRLWQTWMQGTALVYSGRVDLSPLLTHRLPLS
RYREAFGLLASGQAVKVILDPKA
>O87940 ~~~tdiR~~~Transcriptional regulatory protein TdiR~~~
MQATKTGNASTVFVVDDEASVRDSLTWLLNSISLDVRTFESAKDFLDADISCTHGCVVLDVRMQNVSGLQLQQALSERGF
KLPIIFLSAYGDAQMGAQAVKKGAFDFLQKPYRNQDLLDAVNAALALNREMADKQNEKQKHLDLLATLSQREMEILDKVV
AGSSSKEIAKLLGISYKTVEAHRGRIISKLGLKSTGDLMHFVMRGSSHCSDCGRQPLPGSSPCRPAA
>Q5LT52 4.1.2.32~~~tdm~~~Trimethylamine-oxide aldolase~~~COG0404
MLDTKYPEIIPGPPKPSAILQPRVFSLPPGTERYVVPGAGAILLRLETGDRIEIENTEGGQPCEIVCADLLGGFDAGLIG
ARAQGAPSGLMALLTSDDPSLRGLRMGIEARGLNLSQAGAVHLFEHSTPAGTTESFTASREGIVIIAAPGGIMDFQAQDT
ATPLTVFIRRAVLKSAARFELPDPLADPVADIRVHSATAEAYFVKAGDYIQILDVDGRQCTDFQCFAARKLDKGIAHALD
VTTTRTLMGHAYPMPGLHAKYYDQEMQPLVEVIQDTCGRHDAFALACAAKYYDDIGYPGHINCSDNFNAALAPHGVAGRA
GWMAINFFFNTGLDDHGVMYADEPWSRPGDYVLLRALTDLVCVSSACPDDTSAANGWNPTDIHVRTYSGKETFQRAVAYR
PTPDAEPKMTKQTGFHDRFARFTENFIEYNGFWLANCMSTAGPIEEYHACREKCVVLDLSALRKFEITGPDSEALCQYIF
TRNMKTLPVGGVVYTAMCYPHGGMIDDGTVFRLGKDNFRWIGGSDYGGEWIREKAAELGLKVLVRSSTDMQHNIAVQGPE
SRELLKKVIWTAPHQPKFEELGWFRFAPARIGDDQGVPVVVSRTGYTGELGYEIFCHPKHAGAVFDAVWEAGQAHGIRPM
GLEALDMVRIEAGLIFAGYDFSDQTDPFEAGIGFTVPLKSKPDDFIGREALIRRKEHPARVLVGLDIDSNVDVGHGDCVH
IGRAQIGEVTSSMRSPILGKNIALARVDVAHHEVGTRVEIGKLDGHQKRLPATIVPFAHYDPQKTRPRS
>O67024 1.11.1.24~~~~~~Peroxiredoxin~~~COG0450
MEVVSLPRLGEPAPAFEAQTTFGPVKFPDDFKGQWVVLFSHPADFTPVCTTEFVAFAKNYEEFKKRNVQLIGLSVDSNFS
HIAWVMNIKEKFGIEIPFPIIADHNMEVAKKYGMIHPAQSTTFTVRALFVIDDKGILRAMIYYPLTTGRNIREVIRLVDA
LQTADREGVATPADWVPEPQTWEFTEENTKVIVPPPTTYEDAVKRLQEGYECADWYICKKKV
>E1VBK1 ~~~teaA~~~Ectoine-binding periplasmic protein TeaA~~~COG1638
MKAYKLLTTASIGALMLGMSTAAYSDNWRYAHEEYEGDVQDVFAQAFKGYVEDNSDHTVQVYRFGELGESDDIMEQTQNG
ILQFVNQSPGFTGSLIPSAQIFFIPYLMPTDMDTVLEFFDESKAINEMFPKLYAEHGLELLKMYPEGEMVVTADEPITSP
EDFDNKKIRTMTNPLLAETYKAFGATPTPLPWGEVYGGLQTGIIDGQENPIFWIESGGLYEVSPNLTFTSHGWFTTAMMA
NQDFYEGLSEEDQQLVQDAADAAYDHTIEHIKGLSEESLEKIKAASDEVTVTRLNDEQIQAFKERAPQVEEKFIEMTGEQ
GQELLDQFKADLKAVQSESEG
>E1VBK2 ~~~teaB~~~Ectoine TRAP transporter small permease protein TeaB~~~COG3090
MTDEEEAEKHYHSGLPGILGTIDTLISKLEAIILALGVLLMATNTVANVIGRFALGESLFFTGEVNRILIIMITFAGIGY
AARHGRHIRMSAIYDALPVGGRRALMIVISLFTSLVMFFLMYYSVHYVLDLYDKGRILPALGFPIFIIYVWVPLGFLITG
IQYLFTAIKNLTSRDVYLSTSVVDGYKDTETEV
>E1VBK3 ~~~teaC~~~Ectoine TRAP transporter large permease protein TeaC~~~COG1593
MTTIMVATMIGLLLLGFPMMIPLATASIIGFFMMFGGLGQMETLIQQLMAGIRPASLIAVPMFILAADIMTRGQSANRLI
NMVMAFIGHIKGGLAVSTAASCTLFGAVSGSTQATVVAVGSPLRPRMLKAGYSDSFSLALIINSSDIAFLIPPSIGMIIY
GIISGTSIGELFIAGIGPGLMILVMFAIYCVIYAIVRGVPTEPKASWGERFSAVRLALWPLGFPVIIIGGIYGGIFSPTE
AAAACVLYAVLLEFVVFRSLKISDIYAIAKSTGLITAVVFILVAVGNSFSWIISFAQIPQAILEAVGINEAGPTGVLIAI
CVAFFVACMFVDPIVVILVLTPVFAPAIEATGLDPVLVGILITLQVAIGSATPPFGCDIFTAIAIFKRPYLDVIKGTPPF
IFMLVLAAALLILFPQIALFLRDLAFR
>E1VBK4 ~~~teaD~~~TRAP-T-associated universal stress protein TeaD~~~COG0589
MFNRIMVPVDGSKGAVKALEKGVGLQQLTGAELYILCVFKHHSLLEASLSMVRPEQLDIPDDALKDYATEIAVQAKTRAT
ELGVPADKVRAFVKGGRPSRTIVRFARKRECDLVVIGAQGTNGDKSLLLGSVAQRVAGSAHCPVLVV
>O24676 1.14.12.26~~~tecA1~~~Chlorobenzene dioxygenase subunit alpha~~~
MNHTDTSPIKLRKNWNAREMQALFDERAGRTDPRIYTDEDLYQIELERVFGRSWLLLGHETQIKKPGDYTTNYMGEDPVL
VVRQKDGSIAVFLNQCRHRGMRICRSDAGNAKAFTCSYHGWAYDTAGNLVNVPFEAESFPCLDKKEWSPLKARVATYKGL
IFANWDHDAPDLDTYLGEAKFYMDHMLDRTEAGTEAIPGVQKWVIPCNWKLAAEQFCWDAYHAATTAHLSGILAGLPDGV
ELADLAPPTVGKQYRAPWGGHGSGFFIGEPDLLLAIMGPKITSYWTEGPASEKAAQRLGSVERGSKLTVEHMTVFPTCSF
LLGANTVRTWHPRGPNEVEVWAFTVVDADAPDDIKEEFRRQTVRTFSAGGVFEQDDGENWVEIQHVLRGHKARSRPFNAE
MSMGQTIDDDPVYPGRISNVYSDEAARGFYAQWLRMMTSSDWAALNATR
>O24677 1.14.12.26~~~tecA2~~~Chlorobenzene dioxygenase subunit beta~~~
MLDSVKRADVFLRKPAPVAPELQHEIEQFYYWEAKLLNDRRFEEWFALLAADIHYFMPIRTTRIMRDARLEYSGTGEHAH
FDDDAAMMKGRLRKVTSDVGWSENPASRTRHLVSNVMIADGPVEGEYEISSAFIVYRNRLERQLDIFAGERRDTLRRNKT
ETGFEIVNRTILIDQSTILANNLSFFF
>O24678 ~~~tecA3~~~Chlorobenzene dioxygenase, ferredoxin component~~~
MAWTYIMRQSDLPPGEMQRHEGGPEPVMVCNVDGEFFAVQDTCTHGNWALSDGYLDGGVVECTLHFGKFCVRTGKVKALP
ACKPIKVFPIKVEGGDVHVDLDAGEVK
>O24679 1.18.1.3~~~tecA4~~~Chlorobenzene dioxygenase, ferredoxin reductase component~~~
MATHVAIIGNGVAGFTTAQALRAEGFEGRISLIGNEPHLPYDRPSLSKAVLGGSLEHPPVLAEADWYGEARIDMLSGRSV
TNLNVDARTISLDDGSTFAADAIVIATGSRARTLALPGSQLTGVVTLRTNDDVRPLCRGWTPATRLVIAGGGLIGCEVAT
TARKLGLAVTILESADELLVRVLGRRIGAWLRGLLTELGVRVELGTGVAGFSGDDRLEEVLASDGRRFAADNALVCIGAE
PEDQLARQAGLSCDRGVIVDDHGATHAEGVFAVGDAASWPLRDGGRRSLETYMNAQRQAAAVAAAILGKHGSAPQVPVSW
TEIAGHRMQMAGDIEGPGEFVLRGTLGDGAALLFRLRDGRIQAVVAVDAPRDFAMAARLVEARAAIEPARLADFSNSMRD
LVRAQQGDSA
>O69264 1.3.1.119~~~tecB~~~Chlorobenzene dihydrodiol dehydrogenase~~~
MKLKGEVALVTGGGAGLGRAIVDRYVAEGARVAVLDKSAAGLEEIRKRHGDAVVGIEGDVRSLDSHREAVARCVETFGKL
DCLIGNAGVWDYQTQLADIPDNGISEAFDEMFAIIVKGYILAAKAALPALYKSKGSAIFTVSNAGFYPGGGGVLYTAGKH
AVIGLVKQLAHEWGPRIRVNGIAPGGILGSDIRGLKTLGLQDQTIATMPLADMLGPVLPTGRVATAEEYAGAYVFFATRA
DTVPLTGSVLNIDGGMGVRGLFEASLGAQLDKHFA
>P18481 ~~~tee6~~~Trypsin-resistant surface T6 protein~~~
MLACLAILAVVGLGMTRVSALSKDDTAQLKITNIEGGPTVTLYKIGEGVYNTNGDSFINFKYAEGVSLTETGPTSQEITT
IANGINTGKIKPFSTENVSISNGTATYNARGASVYIALLTGATDGRTYNPILLAASYNGEGNLVTKNIDSKSNYLYGQTS
VAKSSLPSITKKVTGTIDDVNKKTTSLGSVLSYSLTFELPSYTKEAVNKTVYVSDNMSEGLTFNFNSLTVEWKGKMANIT
EDGSVMVENTKIGIAKEVNNGFNLSFIYDSLESISPNISYKAVVNNKAIVGEEGNPNKAEFFYSNNPTKGNTYDNLDKKP
DKGNGITSKEDSKIVYTYQIAFRKVDSVSKTPLIGAIFGVYDTSNKLIDIVTTNKNGYAISTQVSSGKYKIKELKAPKGY
SLNTETYEITANWVTATVKTSANSKSTTYTSDKNKATDNSEQVGWLKNGIFYSIDSRPTGNDVKEAYIESTKALTDGTTF
SKSNEGSGTVLLETDIPNTKLGELPSTGSIGTYLFKAIGSAAMIGAIGIYIVKRRKA
>P25396 ~~~tehA~~~Tellurite resistance protein TehA~~~COG1275
MQSDKVLNLPAGYFGIVLGTIGMGFAWRYASQVWQVSHWLGDGLVILAMIIWGLLTSAFIARLIRFPHSVLAEVRHPVLS
SFVSLFPATTMLVAIGFVPWFRPLAVCLFSFGVVVQLAYAAWQTAGLWRGSHPEEATTPGLYLPTVANNFISAMACGALG
YTDAGLVFLGAGVFSWLSLEPVILQRLRSSGELPTALRTSLGIQLAPALVACSAWLSVNGGEGDTLAKMLFGYGLLQLLF
MLRLMPWYLSQPFNASFWSFSFGVSALATTGLHLGSGSDNGFFHTLAVPLFIFTNFIIAILLIRTFALLMQGKLLVRTER
AVLMKAEDKE
>P44741 ~~~tehA~~~Tellurite resistance protein TehA homolog~~~COG1275
MLHFAHIFQNKVHTMNITKPFPLPTGYFGIPLGLAALSLAWFHLENLFPAARMVSDVLGIVASAVWILFILMYAYKLRYY
FEEVRAEYHSPVRFSFIALIPITTMLVGDILYRWNPLIAEVLIWIGTIGQLLFSTLRVSELWQGGVFEQKSTHPSFYLPA
VAANFTSASSLALLGYHDLGYLFFGAGMIAWIIFEPVLLQHLRISSLEPQFRATMGIVLAPAFVCVSAYLSINHGEVDTL
AKILWGYGFLQLFFLLRLFPWIVEKGLNIGLWAFSFGLASMANSATAFYHGNVLQGVSIFAFVFSNVMIGLLVLMTIYKL
TKGQFFLK
>P25397 2.1.1.265~~~tehB~~~Tellurite methyltransferase~~~COG0500
MIIRDENYFTDKYELTRTHSEVLEAVKVVKPGKTLDLGCGNGRNSLYLAANGYDVDAWDKNAMSIANVERIKSIENLDNL
HTRVVDLNNLTFDRQYDFILSTVVLMFLEAKTIPGLIANMQRCTKPGGYNLIVAAMDTADYPCTVGFPFAFKEGELRRYY
EGWERVKYNEDVGELHRTDANGNRIKLRFATMLARKK
>E1X791 2.1.1.-~~~tehB~~~Probable S-adenosyl-L-methionine-dependent methyltransferase TehB~~~
MKNELICYKQMPVWTKDKLPQMFQEKHNTKVGTWGKLTVLKGKIKFYELTENGDVVAEHIFTPESHIPFVEPQAWHRVEA
LSDDLECTLGFYCKKEDYFSKKYNMTAIHGDVVDAAKIISPCKVLDLGCGQGRNSLYLSLLGYDVTSWDHNENSIAFLNE
TKEKENLNISTALYDINAANIQENYDFIVSTVVFMFLNRERVPSIIKNMQEHTNVGGYNLIVAAMSTDDVPCPLPFSFTF
AENELKEYYKDWEFLEYNENMGELHKTDENGNRIKMKFATMLARKK
>P45134 2.1.1.-~~~tehB~~~Probable S-adenosyl-L-methionine-dependent methyltransferase TehB~~~COG0500
MKNELICYKQMPVWTKDNLPQMFQEKHNTKVGTWGKLTVLKGKLKFYELTENGDVIAEHIFTPESHIPFVEPQAWHRVEA
LSDDLECTLGFYCKKEDYFSKKYNTTAIHGDVVDAAKIISPCKVLDLGCGQGRNSLYLSLLGYDVTSWDHNENSIAFLNE
TKEKENLNISTALYDINAANIQENYDFIVSTVVFMFLNRERVPSIIKNMKEHTNVGGYNLIVAAMSTDDVPCPLPFSFTF
AENELKEYYKDWEFLEYNENMGELHKTDENGNRIKMKFATMLARKK
>P60108 ~~~~~~TelA-like protein SA1238~~~
MTENKSFKESHPLDDFISDKELSNTTIQKEKLTIEQQKQVDTISKQINPLDNEGLLAFGSDLQKQMSQFSHQMLDEVQSK
DVGPIGDTLSDLMSKLKSVNPNELNTDKPSMLKRIFSRAKSSINEIFSRMQSVSAQVDRITIQLQKHQTHLTRDIELLDT
LYDKNKQYFDDLSLHIIAAQQKKLQLENEKLPQLQQQAQQSTNQMDIQQVSDMQQFIDRLDKRIYDLQLSRQIALQTAPQ
IRMIQNVNQALAEKIQSSILTSIPLWKNQMAIALTLMRQRNAVAAQRAVTDTTNDLLTANAEMLKQNAIETATENERGIV
DLDTLKRTQRNIIETIEETLIIQQHGREERQLAEKELQQLEQDLKSHLVNIKGPNKQS
>P25052 3.5.99.2~~~tenA~~~Aminopyrimidine aminohydrolase~~~COG0819
MKFSEECRSAAAEWWEGSFVHPFVQGIGDGTLPIDRFKYYVLQDSYYLTHFAKVQSFGAAYAKDLYTTGRMASHAQGTYE
AEMALHREFAELLEISEEERKAFKPSPTAYSYTSHMYRSVLSGNFAEILAALLPCYWLYYEVGEKLLHCDPGHPIYQKWI
GTYGGDWFRQQVEEQINRFDELAENSTEEVRAKMKENFVISSYYEYQFWGMAYRKEGWSDSAIKEVEECGASRHNG
>Q9K9G8 3.5.99.2~~~tenA~~~Aminopyrimidine aminohydrolase~~~COG0819
MSFAASLYEKAQPIWEAGYNHPFVQGIGDGSLEKSKFQFFMKQDYLYLIDYARLFALGTLKGNDLQTMSTFSKLLHATLN
VEMDLHRAYAKRLGISAEELEAIEPAATTLAYTSYMLNVAQRGSLLDLIAAVLPCTWSYYEIGVKLKGIPGASDHPFYGE
WIKLYASDEFKELADWLIQMLDEEAKGLSSKEKAKLETIFLTTSRLENEFWDMAYNERMWNYNG
>A8KRL3 3.5.99.2~~~tenA~~~Aminopyrimidine aminohydrolase~~~COG0819
MQVSQYLYQNAQSIWGDCISHPFVQGIGRGTLERDKFRFYIIQDYLFLLEYAKVFALGVVKACDEAVMREFSNAIQDILN
NEMSIHNHYIRELQITQKELQNACPTLANKSYTSYMLAEGFKGSIKEVAAAVLSCGWSYLVIAQNLSQIPNALEHAFYGH
WIKGYSSKEFQACVNWNINLLDSLTLASSKQEIEKLKEIFITTSEYEYLFWDMAYQS
>Q7A4F3 3.5.99.2~~~tenA~~~Aminopyrimidine aminohydrolase~~~
MEFSQKLYQAAKPIINDIYEDDFIQKMLSGDIGADALRHYLKADAAYLKEFTNLYALLIPKMNSMNDVKFLVEQIEFMVE
GEVLAHDILAQIVGESYEEIIKTKVWPPSGDHYIKHMYFQAHSRENAIYTIAAMAPCPYIYAELAKRSQSDHKLNREKDT
AKWFDFYSTEMDDIINVFEALMNKLAESMSDKELEQVKQVFLESCIHERRFFNMAMTLEQWEFGGKVND
>Q6GEY1 3.5.99.2~~~tenA~~~Aminopyrimidine aminohydrolase~~~
MEFSQKLYQAAKPIINDIYEDDFIQKMLLGNIQADALRHYLQADAAYLKEFTNLYALLIPKMNSMNDVKFLVEQIEFMVE
GEVLAHDILAQIVGESYEEIIKTKVWPPSGDHYIKHMYFQAHSRENAIYTIAAMAPCPYIYAELAKRSQSDHKLNREKDT
AKWFDFYSTEMDDIINVFESLMNKLAESMSDKELEQVKQVFLESCIHERRFFNMAMTLEQWEFGGKVND
>Q8CNK1 3.5.99.2~~~tenA~~~Aminopyrimidine aminohydrolase~~~COG0819
MTFSKELREASRPIIDDIYNDGFIQDLLAGKLSNQAVRQYLRADASYLKEFTNIYAMLIPKMSSMEDVKFLVEQIEFMLE
GEVEAHEVLADFINEPYEEIVKEKVWPPSGDHYIKHMYFNAFARENAAFTIAAMAPCPYVYAVIGKRAMEDPKLNKESVT
SKWFQFYSTEMDELVDVFDQLMDRLTKHCSETEKKEIKENFLQSTIHERHFFNMAYINEKWEYGGNNNE
>P25053 5.3.99.10~~~tenI~~~Thiazole tautomerase~~~COG0352
MELHAITDDSKPVEELARIIITIQNEVDFIHIRERSKSAADILKLLDLIFEGGIDKRKLVMNGRVDIALFSTIHRVQLPS
GSFSPKQIRARFPHLHIGRSVHSLEEAVQAEKEDADYVLFGHVFETDCKKGLEGRGVSLLSDIKQRISIPVIAIGGMTPD
RLRDVKQAGADGIAVMSGIFSSAEPLEAARRYSRKLKEMRYEKAL
>Q99171 ~~~tepA~~~Translocation-enhancing protein TepA~~~COG0740
MDHRMENTEEERPEKNDAKDSIMNKIQQLGETTLPQLPQDTNIHCLTIIGQIEGHVQLPPQNKTTKYEHVIPQIVAIEQN
PKIEGLLIILNTVGGDVEAGLAIAEMLASLSKPTVSIVLGGGHSIGVPIAVSCDYSYIAETATMTIHPVRLTGLVIGVPQ
TFEYLDKMQERVVKFVTSHSNITEEKFKELMFSKGNLTRDIGTNVVGKDAVKYGLIDHAGGVGQAINKLNELIDEARKEE
GRMIQ
>P33007 ~~~terPB~~~Terpredoxin~~~
MPRVVFIDEQSGEYAVDAQDGQSLMEVATQNGVPGIVAECGGSCVCATCRIEIEDAWVEIVGEANPDENDLLQSTGEPMT
AGTRLSCQVFIDPSMDGLIVRVPLPA
>P0DX03 ~~~terW~~~Probable tellurium resistance transcriptional regulator TerW~~~
MQLNTRQARIFKLANLLGTGKPVSAADIITSLECSEPTLTRALKELRESYSAEIKYSKAGHSYHLVNPGQLDKKTLRRMN
EALAQNAELKTGESTGKVVLDKDKKTAVSLSLRMRILRKIDRLAALSGSTRSEAVEKLALHSVDELIKEYSAKKS
>P0ADA1 3.1.2.2~~~tesA~~~Thioesterase 1/protease 1/lysophospholipase L1~~~COG2755
MMNFNNVFRWHLPFLFLVLLTFRAAAADTLLILGDSLSAGYRMSASAAWPALLNDKWQSKTSVVNASISGDTSQQGLARL
PALLKQHQPRWVLVELGGNDGLRGFQPQQTEQTLRQILQDVKAANAEPLLMQIRLPANYGRRYNEAFSAIYPKLAKEFDV
PLLPFFMEEVYLKPQWMQDDGIHPNRDAQPFIADWMAKQLQPLVNHDS
>B2HIN3 3.1.2.-~~~tesA~~~Thioesterase TesA~~~COG3208
MNGRSSNSKSDEKLTAPTLYIFPHAGGTAKDYVPFAKEFSGEVKRVAVQYPGQQDGYGLPPLESIPGLAEEIFAIMKPAA
RIDTPVALFGHSMGGMLAFEVALRFEAAGYRVLALFLSACSAPGHIKYKQLKGYSDNEMLDLVARATGTDPEFFNDEEFR
VGVLPTLRAVRAIAGYSCPPENKLSCPIYTFIGSKDWIATREDMEPWRERTTGDFSLREFPGDHFYLNKNLPELVSDIEI
GTLQQFDQI
>P9WQD5 3.1.2.-~~~tesA~~~Thioesterase TesA~~~COG3208
MLARHGPRYGGSVNGHSDDSSGDAKQAAPTLYIFPHAGGTAKDYVAFSREFSADVKRIAVQYPGQHDRSGLPPLESIPTL
ADEIFAMMKPSARIDDPVAFFGHSMGGMLAFEVALRYQSAGHRVLAFFVSACSAPGHIRYKQLQDLSDREMLDLFTRMTG
MNPDFFTDDEFFVGALPTLRAVRAIAGYSCPPETKLSCPIYAFIGDKDWIATQDDMDPWRDRTTEEFSIRVFPGDHFYLN
DNLPELVSDIEDKTLQWHDRA
>P0AGG2 3.1.2.20~~~tesB~~~Acyl-CoA thioesterase 2~~~COG1946
MSQALKNLLTLLNLEKIEEGLFRGQSEDLGLRQVFGGQVVGQALYAAKETVPEERLVHSFHSYFLRPGDSKKPIIYDVET
LRDGNSFSARRVAAIQNGKPIFYMTASFQAPEAGFEHQKTMPSAPAPDGLPSETQIAQSLAHLLPPVLKDKFICDRPLEV
RPVEFHNPLKGHVAEPHRQVWIRANGSVPDDLRVHQYLLGYASDLNFLPVALQPHGIGFLEPGIQIATIDHSMWFHRPFN
LNEWLLYSVESTSASSARGFVRGEFYTQDGVLVASTVQEGVMRNHN
>Q83VZ5 4.2.1.132~~~tesE~~~2-hydroxyhexa-2,4-dienoate hydratase~~~
MSDKQFIETQGQRLYEALRSARTLAPLTDNHPEMTVEDAYHISLHMLRLREASGERVIGKKIGVTSKPVQDMLNVHQPDF
GFLTDSMEYEDGAAVSLKAAGLIQPRAEGEIAFMLKKDLQGPGVTREDVLAATEWVAPCFEIVDSRINDWKIKIQDTVAD
NASCGVFVIGKQHTDPASLDLAAAAMQMSKNGQPAGSGLGSAVQGHPAEAVAWLANTLGAFGIPFKAGEVILSGSLAPLV
PAAAGDRFDMVIEGMGTCSIQFTE
>Q47810 ~~~tetM~~~Tetracycline resistance protein TetM from transposon TnFO1~~~
MKIINIGVLAHVDAGKTTLTESLLYNSGAITELGSVDKGTTRTDNTLLERQRGITIQTGITSFQWENTKVNIIDTPGHMD
FLAEVYRSLSVLDGAILLISAKDGVQAQTRILFHALRKMGIPTIFFINKIDQNGIDLSTVYQDIKEKLSAEIVIKQKVEL
YPNVCVTNFTESEQWDTVIEGNDDLLEKYMSGKSLEALELEQEESIRFQNCSLFPLYHGSAKSNIGIDNLIEVITNKFYS
STHRGPSELCGNVFKIEYTKKRQRLAYIRLYSGVLHLRDSVRVSEKEKIKVTEMYTSINGELCKIDRAYSGEIVILQNEF
LKLNSVLGDTKLLPQRKKIENPHPLLQTTVEPSKPEQREMLLDALLEISDSDPLLRYYVDSTTHEIILSFLGKVQMEVIS
ALLQEKYHVEIELKEPTVIYMERPLKNAEYTIHIEVPPNPFWASIGLSVSPLPLGSGMQYESSVSLGYLNQSFQNAVMEG
IRYGCEQGLYGWNVTDCKICFKYGLYYSPVSTPADFRMLAPIVLEQVLKKAGTELLEPYLSFKIYAPQEYLSRAYNDAPK
YCANIVDTQLKNNEVILSGEIPARCIQEYRSDLTFFTNGRSVCLTELKGYHVTTGEPVCQPRRPNSRIDKVRYMFNKIT
>A0A059WYP6 1.14.13.-~~~~~~Flavin-dependent monooxygenase~~~
MTKHIKILVIGVGVAGPAVAYWLKRFGFSPVLIEKSAAVRKGGQALDIRGIATHIAKEMGIYDQICNMRTQIKCGRYVDV
KGNVLHEEQGETFGFRQDDEVEILRGDLVEILMKAIADIPCEFKQSVIKIEQNEDSVTVTYKDGRVENYDLVIAADGIHS
ATRGMVFSKNEYQLINLGSYVSAFTIPNYLGLDHMELLCESNHKLVTLQSDSQADKAMAGFMFRSKHVLEDIRDEQEQKH
FLHASFQNFGWETQNILNRMPESDDFYFDAITQIKMKSWTKGRIALIGDAAYCPSPLSGQGNNLAFVGAYILAGELKKAD
GDYIQAFTRYNELLHPFVEANQQFGVWVSESFLLKDDEVSKEIAEARSNKILAMIKSVSNSINLPQYE
>A0A0H4TXY1 1.14.13.-~~~~~~Flavin-dependent monooxygenase~~~
MPHTKKILVIGASIAGPALCYWLNHYGFQPTLVEKNQSTRKGGYAIDLRGIAVDVAKQMGIYDSVCAMRTSLQCVRYVDA
AGNLLFEEHGEKGGFRQGDEVEIVRGDLVDILMKTITDIPCFYDHAIESLTQHDDHVTVQFKNGKTENYDLVIAADGLHS
ATRRMVFSKDDYHLRNLGCYISVFSIPNYLQLDHCETLLEAKQKLVSITSDKDSTKAFAGFMFRSSNSPNYIRDEASQKD
FLRENFTNHGWESNKLLSLMNDANDFYFDAIMQVKMKDWTKGRIALVGDAGYTPSPLSGQGTSLALVGAYILAGELKTAT
DHVAAFARYNELLKPYVEANQAFGVWVSESFLADEPLSAEQAEERNNIVLGIMKKATHAIELPEY
>D3HKY4 1.14.13.-~~~~~~Flavin-dependent monooxygenase~~~COG0654
MSKNIKILVIGAGVAGPAVCYWLRRFGFSPVLIEKYASIRKGGQALDVRGIATHIAREMGIYDQICEMRTRIERGRFVDS
SGKVLHEEQGEKFGFRQDDEVEILRGDLVEILMKTIADVPCYFNQSIISIEQNADNVTVIFMDGRIEQYDLVIAADGIHS
AIRRMIFEKNEYQLIHLGAYLSTFTIPNYLGLSHIDLECEANNKLVSINSDNNPEIARAGFMFRSQHLLNDIRDEQEQKQ
FLRDTFRDFGWETQNILNRMPESNDFYFDAITQVKMNSWTKGRIALVGDAGYCPSPLSGQGNNLAFVGAYILAGELKVAN
GNYTRAFTRYNALLRSFVDANQKFGVWVSESFLVKDEVSKEIAEERSNKILAMIKSISNGITLPQYESS
>P21598 ~~~tetM~~~Tetracycline resistance protein TetM from transposon Tn916~~~
MKIINIGVLAHVDAGKTTLTESLLYNSGAITELGSVDKGTTRTDNTLLERQRGITIQTGITSFQWENTKVNIIDTPGHMD
FLAEVYRSLSVLDGAILLISAKDGVQAQTRILFHALRKMGIPTIFFINKIDQNGIDLSTVYQDIKEKLSAEIVIKQKVEL
YPNVCVTNFTESEQWDTVIEGNDDLLEKYMSGKSLEALELEQEESIRFQNCSLFPLYHGSAKSNIGIDNLIEVITNKFYS
STHRGPSELCGNVFKIEYTKKRQRLAYIRLYSGVLHLRDSVRVSEKEKIKVTEMYTSINGELCKIDRAYSGEIVILQNEF
LKLNSVLGDTKLLPQRKKIENPHPLLQTTVEPSKPEQREMLLDALLEISDSDPLLRYYVDSTTHEIILSFLGKVQMEVIS
ALLQEKYHVEIEITEPTVIYMERPLKNAEYTIHIEVPPNPFWASIGLSVSPLPLGSGMQYESSVSLGYLNQSFQNAVMEG
IRYGCEQGLYGWNVTDCKICFKYGLYYSPVSTPADFRMLAPIVLEQVLKKAGTELLEPYLSFKIYAPQEYLSRAYNDAPK
YCANIVDTQLKNNEVILSGEIPARCIQEYRSDLTFFTNGRSVCLTELKGYHVTTGEPVCQPRRPNSRIDKVRYMFNKIT
>P28815 ~~~tetC~~~Transposon Tn10 TetC protein~~~
MENKNHQQENFKSTYQSLVNSARILFVEKGYQAVSIDEISGKALVTKGAFYHHFKNKKQLLSACYKQQLIMIDAYITTKT
DLTNGWSALESIFEHYLDYIIDNNKNLIPIQEVMPIIGWNELEKISLEYITGKVNAIVSKLIQENQLKAYDSDVLKNLLN
GWFMHIAIHAKNLKELADKKGQFIAIYRGFLLSLKDK
>P28816 ~~~tetD~~~Transposon Tn10 TetD protein~~~
MYIEQHSRYQNKANNIQLEYDDRQFHTTVIKDVLLWIEHNLDQSLLLDDVANKAGYTKWYFQRLFKKVTGVTLASYIRAR
RLTKAAVELRLTKKTILEIALKYQFDSQQSFTRRFKYIFKVTPSYYRRNKLWELEAMH
>P10952 ~~~tetO~~~Tetracycline resistance protein TetO~~~
MKIINLGILAHVDAGKTTLTESLLYTSGAIAELGSVDEGTTRTDTMNLERQRGITIQTAVTSFQWEDVKVNIIDTPGHMD
FLAEVYRSLSVLDGAVLLVSAKDGIQAQTRILFHALQIMKIPTIFFINKIDQEGIDLPMVYREMKAKLSSEIIVKQKVGQ
HPHINVTDNDDMEQWDAVIMGNDELLEKYMSGKPFKMSELEQEENRRFQNGTLFPVYHGSAKNNLGTRQLIEVIASKFYS
STPEGQSELCGQVFKIEYSEKRRRFVYVRIYSGTLHLRDVIRISEKEKIKITEMYVPTNGELYSSDTACSGDIVILPNDV
LQLNSILGNEILLPQRKFIENPLPMIQTTIAVKKSEQREILLGALTEISDCDPLLKYYVDTTTHEIILSFLGNVQMEVIC
AILEEKYHVEAEIKEPTVIYMERPLRKAEYTIHIEVPPNPFWASVGLSIEPLPIGSGVQYESRVSLGYLNQSFQNAVMEG
VLYGCEQGLYGWKVTDCKICFEYGLYYSPVSTPADFRLLSPIVLEQALKKAGTELLEPYLHFEIYAPQEYLSRAYHDAPR
YCADIVSTQIKNDEVILKGEIPARCIQEYRNDLTYFTNGQGVCLTELKGYQPAIGKFICQPRRPNSRIDKVRHMFHKLA
>P03038 ~~~tetR~~~Tetracycline repressor protein class A from transposon 1721~~~
MTKLQPNTVIRAALDLLNEVGVDGLTTRKLAERLGVQQPALYWHFRNKRALLDALAEAMLAENHTHSVPRADDDWRSFLI
GNARSFRQALLAYRDGARIHAGTRPGAPQMETADAQLRFLCEAGFSAGDAVNALMTISYFTVGAVLEEQAGDSDAGERGG
TVEQAPLSPLLRAAIDAFDEAGPDAAFEQGLAVIVDGLAKRRLVVRNVEGPRKGDD
>P04483 ~~~tetR~~~Tetracycline repressor protein class B from transposon Tn10~~~
MSRLDKSKVINSALELLNEVGIEGLTTRKLAQKLGVEQPTLYWHVKNKRALLDALAIEMLDRHHTHFCPLEGESWQDFLR
NNAKSFRCALLSHRDGAKVHLGTRPTEKQYETLENQLAFLCQQGFSLENALYALSAVGHFTLGCVLEDQEHQVAKEERET
PTTDSMPPLLRQAIELFDHQGAEPAFLFGLELIICGLEKQLKCESGS
>P0ACT4 ~~~tetR~~~Tetracycline repressor protein class D~~~
MARLNRESVIDAALELLNETGIDGLTTRKLAQKLGIEQPTLYWHVKNKRALLDALAVEILARHHDYSLPAAGESWQSFLR
NNAMSFRRALLRYRDGAKVHLGTRPDEKQYDTVETQLRFMTENGFSLRDGLYAISAVSHFTLGAVLEQQEHTAALTDRPA
APDENLPPLLREALQIMDSDDGEQAFLHGLESLIRGFEVQLTALLQIVGGDKLIIPFC
>P21337 ~~~tetR~~~Tetracycline repressor protein class E~~~
MARLSLDDVISMALTLLDSEGLEGLTTRKLAQSLKIEQPTLYWHVRNKQTLMNMLSEAILAKHHTRSAPLPTESWQQFLQ
ENALSFRKALLVHRDGARLHIGTSPTPPQFEQAEAQLRCLCDAGFSVEEALFILQSISHFTLGAVLEEQATNQIENNHVI
DAAPPLLQEAFNIQARTSAEMAFHFGLKSLIFGFSAQLDEKKHTPIEDGNK
>P51560 ~~~tetR~~~Tetracycline repressor protein class G~~~
MTKLDKGTVIAAGLELLNEVGMDSLTTRKLAERLKVQQPALYWHFQNKRALLDALPEAMLRERHTRSLPEENEDWRVFLK
ENALSFRTALLSYRDGARIHAGTRPTEPNFGTAETQIRFLCAEGFCPKRAVWALRAVSHYVVGSVLEQQASDADERVPDR
PDVSEQAPSSFLHVLFHELETDGMDAAFNFGLDSLIAGFERLRAAVLATD
>P51561 ~~~tetR~~~Tetracycline repressor protein class H~~~
MAKLDKEQVIDDALILLNEVGIEGLTTRNVAQKIGVEQPTLYWHVKNKRALLDALAETILQKHHHHVLPLPNETWQDFLR
NNAKSFRQALLMYRDGGKIHAGTRPSESQFETSEQQLQFLCDAGFSLSQAVYALSSIAHFTLGSVLETQEHQESQKEREK
VETDTVAYPPLLTQAVAIMDSDNGDAAFLFVLDVMISGLETVLKSAK
>P51562 ~~~tetR~~~Tetracycline repressor protein class H~~~
MARLNRESVIDAALELLNETGIDGLTTRKLAQKLGIEQPTLYWHVKNKRALLDALAVEILARHHDYSLPAAGESWQSFLR
NNAMSFRRALLRYRDGAKVHLGTRPDEKQYDTVETQLRFMTENGFSLRDGLYAISAVSHFTLGAVLEQQEHTAALTDRPA
APDENLPPLLREALQIMDSDDGEQAFLHGLESLIRGFEVQLTALLQIVGGDKLIIPFC
>B1MM06 ~~~tetR~~~DNA-binding transcriptional repressor TetR~~~
MVARGETRSAALLAVALQVLIRDGYDRFSMDSVAALAHASKTTIYRRWSNKAELIKAALDAHDASFNDEVFDTGGLRTDL
IATMGMLRRKAQALPPTLYPDLIRAMEHDETLSDAIRRHLADPGLSPFDAPLSRAVGRGEAGVDVDRQLIHDVAEAMLTH
RLTLGGPLDDAFIGRLVDDVLLVLIRPGAR
>Q01911 1.14.13.231~~~tetX~~~Flavin-dependent monooxygenase~~~
MTMRIDTDKQMNLLSDKNVAIIGGGPVGLTMAKLLQQNGIDVSVYERDNDREARIFGGTLDLHKGSGQEAMKKAGLLQTY
YDLALPMGVNIADKKGNILSTKNVKPENRFDNPEINRNDLRAILLNSLENDTVIWDRKLVMLEPGKKKWTLTFENKPSET
ADLVILANGGMSKVRKFVTDTEVEETGTFNIQADIHQPEINCPGFFQLCNGNRLMASHQGNLLFANPNNNGALHFGISFK
TPDEWKNQTQVDFQNRNSVVDFLLKEFSDWDERYKELIHTTLSFVGLATRIFPLEKPWKSKRPLPITMIGDAAHLMPPFA
GQGVNSGLVDALILSDNLADGKFNSIEEAVKNYEQQMFMYGKEAQEESTQNEIEMFKPDFTFQQLLNV
>Q93L51 1.14.13.231~~~tetX2~~~Flavin-dependent monooxygenase~~~
MTMRIDTDKQMNLLSDKNVAIIGGGPVGLTMAKLLQQNGIDVSVYERDNDREARIFGGTLDLHKGSGQEAMKKAGLLQTY
YDLALPMGVNIADEKGNILSTKNVKPENRFDNPEINRNDLRAILLNSLENDTVIWDRKLVMLEPGKKKWTLTFENKPSET
ADLVILANGGMSKVRKFVTDTEVEETGTFNIQADIHQPEINCPGFFQLCNGNRLMASHQGNLLFANPNNNGALHFGISFK
TPDEWKNQTQVDFQNRNSVVDFLLKEFSDWDERYKELIHTTLSFVGLATRIFPLEKPWKSKRPLPITMIGDAAHLMPPFA
GQGVNSGLVDALILSDNLADGKFNSIEEAVKNYEQQMFIYGKEAQEESTQNEIEMFKPDFTFQQLLNV
>P04958 3.4.24.68~~~tetX~~~Tetanus toxin~~~
MPITINNFRYSDPVNNDTIIMMEPPYCKGLDIYYKAFKITDRIWIVPERYEFGTKPEDFNPPSSLIEGASEYYDPNYLRT
DSDKDRFLQTMVKLFNRIKNNVAGEALLDKIINAIPYLGNSYSLLDKFDTNSNSVSFNLLEQDPSGATTKSAMLTNLIIF
GPGPVLNKNEVRGIVLRVDNKNYFPCRDGFGSIMQMAFCPEYVPTFDNVIENITSLTIGKSKYFQDPALLLMHELIHVLH
GLYGMQVSSHEIIPSKQEIYMQHTYPISAEELFTFGGQDANLISIDIKNDLYEKTLNDYKAIANKLSQVTSCNDPNIDID
SYKQIYQQKYQFDKDSNGQYIVNEDKFQILYNSIMYGFTEIELGKKFNIKTRLSYFSMNHDPVKIPNLLDDTIYNDTEGF
NIESKDLKSEYKGQNMRVNTNAFRNVDGSGLVSKLIGLCKKIIPPTNIRENLYNRTASLTDLGGELCIKIKNEDLTFIAE
KNSFSEEPFQDEIVSYNTKNKPLNFNYSLDKIIVDYNLQSKITLPNDRTTPVTKGIPYAPEYKSNAASTIEIHNIDDNTI
YQYLYAQKSPTTLQRITMTNSVDDALINSTKIYSYFPSVISKVNQGAQGILFLQWVRDIIDDFTNESSQKTTIDKISDVS
TIVPYIGPALNIVKQGYEGNFIGALETTGVVLLLEYIPEITLPVIAALSIAESSTQKEKIIKTIDNFLEKRYEKWIEVYK
LVKAKWLGTVNTQFQKRSYQMYRSLEYQVDAIKKIIDYEYKIYSGPDKEQIADEINNLKNKLEEKANKAMININIFMRES
SRSFLVNQMINEAKKQLLEFDTQSKNILMQYIKANSKFIGITELKKLESKINKVFSTPIPFSYSKNLDCWVDNEEDIDVI
LKKSTILNLDINNDIISDISGFNSSVITYPDAQLVPGINGKAIHLVNNESSEVIVHKAMDIEYNDMFNNFTVSFWLRVPK
VSASHLEQYGTNEYSIISSMKKHSLSIGSGWSVSLKGNNLIWTLKDSAGEVRQITFRDLPDKFNAYLANKWVFITITNDR
LSSANLYINGVLMGSAEITGLGAIREDNNITLKLDRCNNNNQYVSIDKFRIFCKALNPKEIEKLYTSYLSITFLRDFWGN
PLRYDTEYYLIPVASSSKDVQLKNITDYMYLTNAPSYTNGKLNIYYRRLYNGLKFIIKRYTPNNEIDSFVKSGDFIKLYV
SYNNNEHIVGYPKDGNAFNNLDRILRVGYNAPGIPLYKKMEAVKLRDLKTYSVQLKLYDDKNASLGLVGTHNGQIGNDPN
RDILIASNWYFNHLKDKILGCDWYFVPTDEGWTND
>B1MM05 1.14.13.231~~~tetX~~~Flavin-dependent monooxygenase~~~
MTVVIAGAGPTGLTLACELTRRGIACRVLDKAPDLFPGSRGKGLSPRTQEVFDDLGIAPAINSGGMAMPPFRIYAGHEVV
AERSLVEMLGTDIPSGPGIPYPGFWLVPQWRTDEILLDRLRQFGGDVEFNCEVVGFTQRSDAVSVMVSQGGAPELLHASY
LVGADGGRSTVRKMLGVGFAGETFERERTLIGDVRADGLEGSFCHVLTRDGQVSERFSLWNLPGSEHYQFVANMATEDVP
ALTLDAVQKLVVDRSGRDDIVLRDLRWISLYRVNARMVDRFRVGRVILAGDAAHVHSSAGGQGLNTSVQDAYNLGWKLAA
VIYGAPEKLLDTYEEERMPVAASVLGLSTDLHHRNFAPAKGPAPQLHQMDITYRGCSLAVDDRVFQGNLRAGDRAPDALL
DNGVRLFDVLRGTHFTLLTFGAQAPVIADVCIQQMTPSPDYDVTATTLVLVRPDGYIGVMTESGRTVLEYLARVV
>Q06DK7 1.14.13.231~~~tet(X)~~~Flavin-dependent monooxygenase~~~
MTMRIDTDKQMNLLSDKNVAIIGGGPVGLTMAKLLQQNGIDVSVYERDNDREARIFGGTLDLHKGSGQEAMKKAGLLQTY
YDLALPMGVNIADKKGNILSTKNVKPENRFDNPEINRNDLRAILLNSLENDTVIWDRKLVMLEPGKKKWTLTFENKPSET
ADLVILANGGMSKVRKFVTDTEVEETGTFNIQADIHQPEINCPGFFQLCNGNRLMASHQGNLLFANPNNNGALHFGISFK
TPDEWKNQTQVDFQNRNSVVDFLLKEFSDWDERYKELIHTTLSFVGLATRIFPLEKPWKSKRPLPITMIGDAAHLMPPFA
GQGVNSGLVDALILSDNLADGKFNSIEEAVKNYEQQMFMYGKEAQEESTQNEIEMFKPDFTFQQLLNV
>P09153 ~~~tfaE~~~Prophage tail fiber assembly protein homolog TfaE~~~COG2110
MHKAILNSDLIATKAGDVTVYNYDGETREYISTSNEYLAVGVGIPACSCLDAPGTHKAGYAICRSADFNSWEYVPDHRGE
TVYSTKTGESKEIKAPGDYPENTTTIAPLSPYDKWDGEKWVTDTEAQHSAAVDAAEAQRQSLIDAAMASISLIQLKLQAG
RKLTQAETTRLNAVLDYIDAVTATDTSTAPDVIWPELPEA
>P10088 1.14.11.-~~~tfdA~~~Alpha-ketoglutarate-dependent 2,4-dichlorophenoxyacetate dioxygenase~~~
MSVVANPLHPLFAAGVEDIDLREALGSTEVREIERLMDEKSVLVFRGQPLSQDQQIAFARNFGPLEGGFIKVNQRPSRFK
YAELADISNVSLDGKVAQRDAREVVGNFANQLWHSDSSFQQPAARYSMLSAVVVPPSGGDTEFCDMRAAYDALPRDLQSE
LEGLRAEHYALNSRFLLGDTDYSEAQRNAMPPVNWPLVRTHAGSGRKFLFIGAHASHVEGLPVAEGRMLLAELLEHATQR
EFVYRHRWNVGDLVMWDNRCVLHRGRRYDISARRELRRATTLDDAVV
>Q8KN28 1.14.13.20~~~tfdB~~~2,4-dichlorophenol 6-monooxygenase~~~
MNEKATVIETDVLVVGSGPAGAASTLLLATYGVKTLCVSKYATTSRTPRSHITNQRTMEVMRDLGLELECEAMASPAELM
GENVYCTSLVGDELGRVLTWGTHPQRRADYELASPTHMCDLPQNLLEPIMINHAARRGADVRFHTEFVSLKQDETGVTAT
VRDHLLDRQYDIRAKYLIGADGANSQVVDQVGLPMEGKMGVSGSINVVFEADLTKYVGHRPSVLYWVIQPGSSVGGLGIG
VIRMVRPWNKWLCIWGYDIAGGPPDLNEAHARQIVHSLLGDSTIPVKIESTSTWTVNDMYATRLFDNRVFCMGDAVHRHP
PTNGLGSNTSIQDAFNLCWKLSHVLQGKAGPELLATYNEERAPVARQVVQRANKSLGDFPPILAALGLFDTKDPEQMQRN
IARLKEQSPEAQEQRAALRAAIDGTQYVYNAHGVEMNQRYQSAAIVPDGTPDPGFRRDSELYHAHSGRPGAPVPHVWVTR
HGRRVSTLDLCGKGRFSLLSGIAGSPWVEAAVHAAESLGIDLDVHIIGPGQELEDLYGDFARVREIEESGALLVRPDNFI
CWRAMRWQEGSGDELRAALKRVLSVH
>P05404 5.5.1.7~~~tfdDI~~~Chloromuconate cycloisomerase~~~
MKIDAIEAVIVDVPTKRPIQMSITTVHQQSYVIVRVYSEGLVGVGEGGSVGGPVWSAECAETIKIIVERYLAPHLLGTDA
FNVSGALQTMARAVTGNASAKAAVEMALLDLKARALGVSIAELLGGPLRSAIPIAWTLASGDTKRDLDSAVEMIERRRHN
RFKVKLGFRSPQDDLIHMEALSNSLGSKAYLRVDVNQAWDEQVASVYIPELEALGVELIEQPVGRENTQALRRLSDNNRV
AIMADESLSTLASAFDLARDRSVDVFSLKLCNMGGVSATQKIAAVAEASGIASYGGTMLDSTIGTSVALQLYSTVPSLPF
GCELIGPFVLADTLSHEPLEIRDYELQVPTGVGHGMTLDEDKVRQYARVS
>Q9RNZ9 5.5.1.7~~~tfdD~~~Chloromuconate cycloisomerase~~~
MKIEAISTTIVDVPTRRPLQMSFTTVHKQSYVIVQVTAGGLVGIGEGGSVGGPTWGSESAETIKVIIDNYLAPLLIGKDA
SNLSEARALMDRAVTGNLSAKAAIDIALHDLKARALNLSIADLIGGTMRKSIPIAWTLASGDTARDIDSALEMIEARRHN
RFKVKLGARTPAQDLEHIRSIVKAVGDKASVRVDVNQGWDEQTASIWIPRLEEAGVELVEQPVPRANFGALRRLTEQNGV
AILADESLSSLSSAFELARDRAVDAFSLKLCNMGGIANTLKVAAIAEAAGISSYGGTMLDSTVGTAAALHVYATLPSLPY
GCELIGPWVLSDRLTQQDLEIKDFEVHLPVGSGLGVDLDHDKVRHYTRAA
>P94135 1.3.1.32~~~tfdFII~~~Maleylacetate reductase 2~~~
MTGDLNEFVAHFWPVRVVFGAGSTERIPAEVKRLGARRALVLCTPDQRDLAQRVLGDLGDLGAGFHDGAVMHVPEASVTR
AAQAARDADADLLVAVGGGSTIGLAKALALHHGMRFVALPTTYAGSEMTPIWGLTADGAKRTGRDPRVLPSTVLYDPHHL
TSLPPEVTGPSGMNAIAHAVESMYAPDRNPITMLLAEESIRAMAQGLPVAVDSPGDLDARTRTLYAAWLAGTVLGMVSMG
LHHKLCHVLGGRFNLPHAPMHAVLLPHVAAFNEVAAPAELGRVAAALGAPGPGGAGAALHALLRFTCTERSLAAIGMPAQ
GIYDAAEHALADAYANPRQASREDIARLLRAAFTGEMPA
>Q93T12 1.3.1.32~~~tfdF~~~Maleylacetate reductase~~~
MNFIHDPLTPRVLFGAGRLQSLGEELKLLGIRRVLVISTPEQRELANQVAALIPGSVAGFFDRATMHVPSQIVDQAASVA
RELGVDSYVAPGGGSTIGLAKMLALHSSLPIVAIPTTYAGSEMTSIYGVTENELKKTGRDRRVLARTVIYDPELTFGLPT
GISVTSGLNAIAHAVEGLYAPEVNPILAIMAQQGIAALAKSIPTIRSAPTDLEARSQAQYGAWLCGSVLGNVSMALHHKL
CHTLGGTFNLPHAETHTVVLPHALAYNTPAIPRANAWLQEALATREPAQALFDLAKSNGAPVSLQSIGMKEADLDRACEL
VMSAQYPNPRPLEKHAIANLLRRAYLGEPPQP
>Q46M57 ~~~tfdR~~~HTH-type transcriptional regulator TdfR~~~
MEFRQLRYFVAAAEEGNVGAAARRLHISQPPVTRQIHALEQHLGVLLFERSARGVQLTPAGAAFLEDARRMLELGRTSVD
RSRAASRGEIGQLDIGYLGTAIYQTVPALLHAFTQAVPGATLSLALMPKVRQIEALRAGTIHLGVGRFYPQEPGITVEHL
HYERLYIAAGSSIARQLRQDPTLLRLKSESLVLFPKEGRPSFADEVIALMRRAGVEPRVTAIVEDVNAALGLVAAGAGVT
LVPASVAAIRRPFVRTMEMADASDKVPVSLTYLTDSRVPVLRAFLDVARRGKGQK
>Q46M54 ~~~tfdS~~~HTH-type transcriptional regulator TfdS~~~
MEFRQLRYFVAAAEEGNVGAAARRLHISQPPVTRQIHALEQHLGVLLFERSARGVQLTPAGAAFLEDARRMLELGRTSVD
RSRAASRGEIGQLDIGYLGTAIYQTVPALLHAFTQAVPGATLSLALMPKVRQIEALRAGTIHLGVGRFYPQEPGITVEHL
HYERLYIAAGSSIARQLRQDPTLLRLKSESLVLFPKEGRPSFADEVIALMRRAGVEPRVTAIVEDVNAALGLVAAGAGVT
LVPASVAAIRRPFVRTMEMADASAKVPVSLTYLTDSRVPVLRAFLDVARRGKGQK
>Q5E6F5 ~~~tfoX1~~~DNA transformation protein TfoX1~~~COG3070
MTGTGDVNLKIKEIEMSHLEERFFNFVKKLGRFNKRSMFGGVGLFNEDAMFSLVTDGRLYVRGGGEVDARFEELGCERFK
HVKKTTIATVNYFDVTELFEKKPASLLSIVQEAMENSRLEREFKKSKENRRLRDLPNMQLTLERMVKKAGVPDVDSFLEI
GAVDVFKKVNHTYGGDVDVKLLWKFAGAESGVHWTLLQEPAKQALLQQV
>P43779 ~~~tfoX~~~DNA transformation protein TfoX~~~COG3070
MNIKDEHIDSVCSLLDQLVGNVSFKNLFTGYGLFHKEETMFAIWQNKKLYLRGEGVLAIQLTKLGCEPFTTNELNKRFVL
SQYYALSDQILRSNRLCRKLIILSIKQILEQKLECTLRKLNRLKDLPNLTIKHERALIKVGITNVAMLREIGAENALVEL
KKSGSGATLDFYWKLVCALQNKNSQMLSQAEKERLLKKLNEVWRKNGLKGYRKLDDE
>O87008 1.5.1.37~~~tftC~~~NADH:FAD oxidoreductase~~~
MHAGEAVQQLKKAFETVASFDFRDALSKASTPVTVVATNGPFGLAGLTCSAVCSVCDRPPTVLLCINRKSYAAGIIKSNG
VLSVNWLAAGQAVISQTFAGVGSVPMEERFADKGWQTIATGAPYRMDAAVSFDCTIANIVDVGSHSVIFAEVVARNHAEE
CTPLIYHRRQYATTRSLAE
>O87009 1.14.14.-~~~tftD~~~FADH(2)-dependent monooxygenase TftD~~~
MRTGKQYLESLNDGRVVWVGNEKIDNVATHPLTRDYAERVAQFYDLHHRPDLQDVLTFVDADGVRRSRQWQDPKDAAGLR
VKRKYHETILREIAAGSYGRLPDAHNYTFTTYADDPEVWEKQSIGAEGRNLTQNIHNFLKLLREKDLNCPLNFVDPQTDR
SSDAAQARSPNLRIVEKTDDGIIVNGVKAVGTGIAFGDYMHIGCLYRPGIPGEQVIFAAIPTNTPGVTVFCRESTVKNDP
AEHPLASQGDELDSTTVFDNVFIPWEQVFHIGNPEHAKLYPQRIFDWVHYHILIRQVLRAELIVGLAILITEHIGTSKLP
TVSARVAKLVAFHLAMQAHLIASEETGFHTKGGRYKPNPLIYDFGRAHFLQNQMSVMYELLDLAGRSSLMIPSEGQWDDS
QSGQWFVKLNNGPKGNPRERVQIGRVIRDLYLTDWGGRQFMFENFNGTPLFAVFAATMTRDDMSAAGTYGKFASQVCGIE
FGGAEPTAYAATADYAKALDKGLAPEPAAAESATS
>P42723 ~~~tfxA~~~Trifolitoxin~~~
MDNKVAKNVEVKKGSIKATFKAAVLKSKTKVDIGGSRQGCVA
>P81453 2.3.2.13~~~~~~Protein-glutamine gamma-glutamyltransferase~~~
MRIRRRALVFATMSAVLCTAGFMPSAGEAAADNGAGEETKSYAETYRLTADDVANINALNESAPAASSAGPSFRAPDSDD
RVTPPAEPLDRMPDPYRPSYGRAETVVNNYIRKWQQVYSHRDGRKQQMTEEQREWLSYGCVGVTWVNSGQYPTNRLAFAS
FDEDRFKNELKNGRPRSGETRAEFEGRVAKESFDEEKGFQRAREVASVMNRALENAHDESAYLDNLKKELANGNDALRNE
DARSPFYSALRNTPSFKERNGGNHDPSRMKAVIYSKHFWSGQDRSSSADKRKYGDPDAFRPAPGTGLVDMSRDRNIPRSP
TSPGEGFVNFDYGWFGAQTEADADKTVWTHGNHYHAPNGSLGAMHVYESKFRNWSEGYSDFDRGAYVITFIPKSWNTAPD
KVKQGWP
>P40746 2.3.2.13~~~tgl~~~Protein-glutamine gamma-glutamyltransferase~~~
MIIVSGQLLRPQDIENWQIDQDLNPLLKEMIETPVQFDYHSIAELMFELKLRMNIVAAAKTLHKSGAKFATFLKTYGNTT
YWRVSPEGALELKYRMPPSKAIRDIAENGPFYAFECATAIVIIYYLALIDTIGEDKFNASFDRIILYDWHYEKLPIYTET
GHHFFLGDCLYFKNPEFDPQKAQWRGENVILLGEDKYFAHGLGILNGKQIIDKLNSFRKKGALQSAYLLSQATRLDVPSL
FRIVR
>Q6F9F5 1.5.1.36~~~tgnA~~~Flavin-dependent trigonelline monooxygenase, reductase component~~~COG1853
MSEMDAVNKIRELRDAFGSFMTGVTVVTTCKDDGTPLGFTANSFASVSLDPALLLVSIAKTSSNYHNFADASHFAINILA
EEQKDVSNIFARPSDDRFAQLVWAKSEYQNPLIDGVSAWFDCTTYQVVDAGDHAILIGKVENFTSAGFAGLGYYRGAYFT
PAKSSTDVISSMKVMMMALIGHENKILLEQTADHKWALPHLMVEKDGAEKALEKIFATYQPEASPSFIYSVYDDVTTQQQ
YIAFLCNTPVPTAHKGQFVDLNDLEKLTFTDSALQSMLMRYRKENYLKTYGVYYGNHTSGTVRQIVKEGV
>Q6F9F6 1.14.14.-~~~tgnB~~~Flavin-dependent trigonelline monooxygenase, oxygenase component~~~COG2141
MRFSLFVHMERVSDQQTQKQLYDEMIELCQIADRGGMHAIWTGEHHAMNFTIAPNPFLNIADLANKTKHVRLGTGTVVAP
FWHPIKLAGEAAMTDIISNGRLDIGIARGAYSFEYERMVPGMDAWSAGQRLREMIPAIKNLWKGDYEHNGEFWQFPKTTS
APQPLQQPNPPIWVAARDPNSHEFAVQNGCNVQVTPLHLGDEEVEKLMGHFNSACEKFQDIERPEIMLLRHTYVADSEED
AQVAANEMNVFYNYFGAWFKNEREINQGLIAPLSDEEIAAHPFYTPEAMRKNNVIGQAQEVIDRLKAYEAMGYDEYSFWI
DTGMSFERKKASLERMINEVMPAFSESKVDRRHATISAVY
>Q6F9F7 1.2.1.-~~~tgnC~~~(Z)-2-((N-methylformamido)methylene)-5-hydroxybutyrolactone dehydrogenase~~~COG1012
MQQFQLYINGKFEDGAAQFDSINPATGEIWAKMPEARTDQVNRAVDAAEQAFYDSSWSGLTASQRGKLLYKLADLVEKSA
PRLAALETTDTGKIIRETSSQIAYVAEYYRYYAGLADKIEGSFIPVDKPDMQAWLVREPVGVVAAIVPWNSQLFLSAVKV
GPALAAGCTVVLKASEEAPAPLLEFAKLIDEAGFPAGVVNVITGFGPECGAVLSAHPKVAHIAFTGGPETAKHIVRNSAE
NLAKVSLELGGKSPFIVFADTDINSALNAQIAAIFAATGQSCVAGSRLLIEESIKDEFLQRLAERVQSIKMGLPDDMQTE
YGPLCTLKQREKIQQVVQRSVEQGAKLITGGQVCDGAGYYYPPTILDCSGVSDAQSIHTELFGPVLSVDTFSTEAEAIQK
ANSTPYGLASGVFTSNLTRAHRMTRAIRSGIVWLNTYRVVSPLAPFGGYGLSGHGREGGLSAALEYTTTKTVWLRMSDQP
IDDPFVMR
>Q6F9F4 3.5.1.-~~~tgnD~~~(E)-2-((N-methylformamido)methylene)succinate hydrolase~~~COG2267
MISKTLQLSNNRTAHYFEQGEGEPLVLIHGVGMQAEAWYPQIEYFSKHYHVISLDMPGHGQSTALAADAQLQDFVDWAIE
CIHTLNLGPVNLAGHSMGSLITTGVSVTRPDLVKRMAVLNGVYKRTHAAREAVIQRAEALKQGHLDIETPLQRWFGQSEI
EKIASERVKLWLENVNMSGYTTAYRAFAQGDLVYADGWSDIECPALVLTGTDDPNSTAEMTIQMAHQAKHGTAIVIENER
HMVNLTAPEKVNQAMQAWLETTP
>Q6F9G0 1.2.1.24~~~tgnE~~~Succinate semialdehyde dehydrogenase~~~COG1012
MDLQQQPLFRQKALVAGQWCDADNAEKTPIFNPATQELIGYVPNMGRAETERAIEAAYASWEMWKTKTAKERSALLKKWY
DLILLNLDVLAEILTTEQGKPFNEAKGEIIYAASFIEWFAEEAKRIYGDIIPSPYPDARIVVNKQPIGVVAAITPWNFPA
AMITRKVAPALAAGCPCIVKPAPETPFTALALADLAIQAGIPAEIMSVVTGDAAQIGDAIFASDHVRKFTFTGSTPIGKL
LLEKSAKTLKKVSLELGGNAPFIVFDDADIEAAVEGALIAKFRNAGQTCVCVNRFLVQSGVYEKFIQVFKAKIESLKIGN
GLEAGSEIGPLINAQAVAKVQSHIEDALSKNGRLITGGQVHATGELFFEPTLIADANTEMMVATQETFGPLAAIFKFDTE
QQAIQMANDTEFGLAAYCYTRDLGRAWRMSEQLEYGMVGINKGLISNEVAPFGGIKHSGLGREGSKYGIEDYLEIKYTLF
GGL
>Q9HZX3 2.3.2.13~~~tgpA~~~Protein-glutamine gamma-glutamyltransferase~~~
MNAIPRVALVWLLVAQVLVILPHLAYMPLWIAAMWLGCAAWRVQVFRMRAGYPRAWVKLALALLAGAGVWLSRGSLVGLD
AGAVLLIAAFILKLVEMKTRRDALVLVFLGFFAVVVGYLFDDGFLAALYSLLPVTALLAALIGLQQSAFASRPWPTLRLA
GGLLLQALPLMLLLFLFFPRLGPLWSLPMPGNKGVTGLSESMAPGDIAELGRSAELAFRVRFEGALPPREQLYWRALTME
RFDGRRWAQAPQWSGEDALHWQKRGPELRYDVIMQPSSQPWLFALDVAQTDQTDTRLMSDFHLQRRQPVEQRLFYRVSSW
PQALRESSIDPRTRWRNLQLPMHGNPRARALADELRQAHAQPQALVAALLQRFNHEPFAYTLKPPATGADGVDDFLFDTR
SGFCAHYAGAMAFVLRAAGIPARVVAGYQGGELNPAGNYLLVHQFDAHAWVEYWQPEQGWLSVDPTYQVAPERIEQGLEQ
ALAGDSEYLADAPLSPLRYRGLPWLNDMRLAWDSLNYGWQRWVLAYQGEQQGAFLQRWFGGLDPTRLGLLLGAAAILSVG
LLALFLLKPWQGRGDLRSRQLRRFERLLEMHGLRRSPGEGLRSYGERAARVLPAQAPAIAAFVGAFEAQRYGHGGADDPG
LRLRALRRALPWRLVRTPTRDGRGEEQA
>P9WKC8 2.3.1.20~~~tgs1~~~Probable diacyglycerol O-acyltransferase tgs1~~~
MNHLTTLDAGFLKAEDVDRHVSLAIGALAVIEGPAPDQEAFLSSLAQRLRPCTRFGQRLRLRPFDLGAPKWVDDPDFDLG
RHVWRIALPRPGNEDQLFELIADLMARRLDRGRPLWEVWVIEGLADSKWAILTKLHHCMADGIAATHLLAGLSDESMSDS
FASNIHTTMQSQSASVRRGGFRVNPSEALTASTAVMAGIVRAAKGASEIAAGVLSPAASSLNGPISDLRRYSAAKVPLAD
VEQVCRKFDVTINDVALAAITESYRNVFIQRGERPRFDSLRTLVPVSTRSNSALSKTDNRVSLMLPNLPVDQENPLQRLR
IVHSRLTRAKAGGQRQFGNTLMAIANRLPFPMTAWAVGLLMRLPQRGVVTVATNVPGPRRPLQIMGRRVLDLYPVSPIAM
QLRTSVAMLSYADDLYFGILADYDVVADAGQLARGIEDAVARLVAISKRRKVTRRRGALSLVV
>P9WKC9 2.3.1.20~~~tgs1~~~Probable diacyglycerol O-acyltransferase tgs1~~~COG1020
MNHLTTLDAGFLKAEDVDRHVSLAIGALAVIEGPAPDQEAFLSSLAQRLRPCTRFGQRLRLRPFDLGAPKWVDDPDFDLG
RHVWRIALPRPGNEDQLFELIADLMARRLDRGRPLWEVWVIEGLADSKWAILTKLHHCMADGIAATHLLAGLSDESMSDS
FASNIHTTMQSQSASVRRGGFRVNPSEALTASTAVMAGIVRAAKGASEIAAGVLSPAASSLNGPISDLRRYSAAKVPLAD
VEQVCRKFDVTINDVALAAITESYRNVLIQRGERPRFDSLRTLVPVSTRSNSALSKTDNRVSLMLPNLPVDQENPLQRLR
IVHSRLTRAKAGGQRQFGNTLMAIANRLPFPMTAWAVGLLMRLPQRGVVTVATNVPGPRRPLQIMGRRVLDLYPVSPIAM
QLRTSVAMLSYADDLYFGILADYDVVADAGQLARGIEDAVARLVAISKRRKVTRRRGALSLVV
>P9WKC7 2.3.1.20~~~tgs2~~~Probable diacyglycerol O-acyltransferase tgs2~~~COG1020
MDLMMPNDSMFLFIESREHPMHVGGLSLFEPPQGAGPEFVREFTERLVANDEFQPMFRKHPATIGGGIARVAWAYDDDID
IDYHVRRSALPSPGRVRDLLELTSRLHTSLLDRHRPLWELHVVEGLNDGRFAMYTKMHHALIDGVSAMKLAQRTLSADPD
DAEVRAIWNLPPRPRTRPPSDGSSLLDALFKMAGSVVGLAPSTLKLARAALLEQQLTLPFAAPHSMFNVKVGGARRCAAQ
SWSLDRIKSVKQAAGVTVNDAVLAMCAGALRYYLIERNALPDRPLIAMVPVSLRSKEDADAGGNLVGSVLCNLATHVDDP
AQRIQTISASMDGNKKVLSELPQLQVLALSALNMAPLTLAGVPGFLSAVPPPFNIVISNVPGPVDPLYYGTARLDGSYPL
SNIPDGQALNITLVNNAGNLDFGLVGCRRSVPHLQRLLAHLESSLKDLEQAVGI
>P9WKC5 2.3.1.20~~~tgs3~~~Probable diacyglycerol O-acyltransferase tgs3~~~COG1020
MVTRLSASDASFYQLENTATPMYVGLLLILRRPRAGLSYEALLETVEQRLPQIPRYRQKVQEVKLGLARPVWIDDRDFDI
TYHVRRSALPSPGSDEQLHELIARLAARPLDKSRPLWEMYLVEGLEKNRIALYTKSHQALINGVTALAIGHVIADRTRRP
PAFPEDIWVPERDPGTTRLLLRAVGDWLVRPGAQLQAVGSAVAGLVTNSGQLVETGRKVLDIARTVARGTAPSSPLNATV
SRNRRFTVARASLDDYRTVRARYDCDSTTWC
>P9WKC3 2.3.1.20~~~tgs4~~~Probable diacyglycerol O-acyltransferase Tgs4~~~COG1020
MTRINPIDLSFLLLERANRPNHMAAYTIFEKPKGQKSSFGPRLFDAYRHSQAAKPFNHKLKWLGTDVAAWETVEPDMGYH
IRHLALPAPGSMQQFHETVSFLNTGLLDRGHPMWECYIIDGIERGRIAILLKVHHALIDGEGGLRAMRNFLSDSPDDTTL
AGPWMSAQGADRPRRTPATVSRRAQLQGQLQGMIKGLTKLPSGLFGVSADAADLGAQALSLKARKASLPFTARRTLFNNT
AKSAARAYGNVELPLADVKALAKATGTSVNDVVMTVIDDALHHYLAEHQASTDRPLVAFMPMSLREKSGEGGGNRVSAEL
VPMGAPKASPVERLKEINAATTRAKDKGRGMQTTSRQAYALLLLGSLTVADALPLLGKLPSANVVISNMKGPTEQLYLAG
APLVAFSGLPIVPPGAGLNVTFASINTALCIAIGAAPEAVHEPSRLAELMQRAFTELQTEAGTTSPTTSKSRTP
>P0A847 2.4.2.29~~~tgt~~~Queuine tRNA-ribosyltransferase~~~COG0343
MKFELDTTDGRARRGRLVFDRGVVETPCFMPVGTYGTVKGMTPEEVEATGAQIILGNTFHLWLRPGQEIMKLHGDLHDFM
QWKGPILTDSGGFQVFSLGDIRKITEQGVHFRNPINGDPIFLDPEKSMEIQYDLGSDIVMIFDECTPYPADWDYAKRSME
MSLRWAKRSRERFDSLGNKNALFGIIQGSVYEDLRDISVKGLVDIGFDGYAVGGLAVGEPKADMHRILEHVCPQIPADKP
RYLMGVGKPEDLVEGVRRGIDMFDCVMPTRNARNGHLFVTDGVVKIRNAKYKSDTGPLDPECDCYTCRNYSRAYLHHLDR
CNEILGARLNTIHNLRYYQRLMAGLRKAIEEGKLESFVTDFYQRQGREVPPLNVD
>P66905 2.4.2.29~~~tgt~~~Queuine tRNA-ribosyltransferase~~~
MPAVTYEHIKTCKQSGARLGIVHTPHGSFETPMFMPVGTKATVKTMSPEELRQIEAKIILGNTYHLWLQPGNDIIKHAGG
LHKFMNWDGPILTDSGGFQVFSLSNLRKITEEGVEFRHHTNGSKLFLSPEKSMQIQNDLGSDIMMAFDECPPMPAEYDYV
KKSIERTTRWAKRCLDAHQRPEDQALFGIIQGGEYEDLREQSAKDLVELDFPGYAIGGLSVGEPKPVMYKMVEHTEQFMP
KDKPRYLMGVGSPDALIECSIRGMDMFDCVLPTRIARNGTCMTSQGRLVIKNAKFADDLRPLDENCDCYTCQNYSRAYIR
HLIKAEETFGIRLTTIHNLHFLLKLMEDIRQAIREDRLLDFKEEFFEQYGLNVENPKNF
>Q9X1P7 2.4.2.29~~~tgt~~~Queuine tRNA-ribosyltransferase~~~COG0343
MEFEVKKTFGKARLGVMKLHHGAVETPVFMPVGTNASVKLLTPRDLEEAGAEIILSNTFHLMLKPGVEIIKLHRGLHNFM
GWKRPILTDSGGFQVFSLPKIRIDDEGVVFRSPIDGSKVFLNPEISMEVQIALGSDICMVFDHCPVPDADYEEVKEATER
TYRWALRSKKAFKTENQALFGIVQGGIYPDLRRESALQLTSIGFDGYAIGGLSIGEERSLTLEMTEVTVEFLPEDKPRYF
MGGGSPELILELVDRGVDMFDSVFPTRIARHGTALTWNGKLNLKASYNKRSLEPVDERCGCYTCKNFTRSYIHHLFDRGE
VLGQILLTIHNINFMISLMKEVRRSIESGTFKELKSKVVEVYSSGGVNV
>P28720 2.4.2.29~~~tgt~~~Queuine tRNA-ribosyltransferase~~~COG0343
MVEATAQETDRPRFSFSIAAREGKARTGTIEMKRGVIRTPAFMPVGTAATVKALKPETVRATGADIILGNTYHLMLRPGA
ERIAKLGGLHSFMGWDRPILTDSGGYQVMSLSSLTKQSEEGVTFKSHLDGSRHMLSPERSIEIQHLLGSDIVMAFDECTP
YPATPSRAASSMERSMRWAKRSRDAFDSRKEQAENAALFGIQQGSVFENLRQQSADALAEIGFDGYAVGGLAVGEGQDEM
FRVLDFSVPMLPDDKPHYLMGVGKPDDIVGAVERGIDMFDCVLPTRSGRNGQAFTWDGPINIRNARFSEDLTPLDSECHC
AVCQKWSRAYIHHLIRAGEILGAMLMTEHNIAFYQQLMQKIRDSISEGRFSQFAQDFRARYFARNS
>Q2T4N1 ~~~thaG~~~Polyketide synthase ThaG~~~
MGIEQLNTTRAQHDDIAVIGIACRFPGASDYRQFWDNLCERRVSIGEIPGERWDWRAYWGDPQQDANACNSRWGGFVDEV
GAFDLGFFGMSAREVDKMDPQQRLALELAWHCFEDAGLRPSAVAKRDVGVFVGIANLDYKEIVEAHASEIDAYYASGVAA
SVASNRLSYWFDLRGPSVTVDTACSGSLYALHLAREALRRGECEMALAGGVSLLLTPRRYLGFARARMLSPTGAIRAFDD
AADGMVRGEGGGLVLLKPLSRALDDGDRIFGVLAGSAVNHSGRTYSLTYPDADAQARVIRAAFDAAGVSPAQVSYVEAHG
TGTPKGDPIELEGLLRVFGAQPRDALAPRCAIGSVKANIGHLESAAGIAGVIKTLLALHHGMLPPMPHFAALNHRVDAAR
FAAAGLAVADALAPWEAGASADGASAARIAGVSAFGFAGTNAHVVLREAPAAARAAVAAGRADGAGVLCLSAKTDAALER
VRAGFVEWLARDGARHAFADICAALVSRREHFASRIAVAACDVDGALAALRARADTASSVYRAGRVALDDDGNLVDASGP
HDEQADHDGSGEHGEHGERARAGADDLSRARRRASRYVRGASPDAWGPSDGPTPHVPLPDYPFERTVCWTGPRPSRAVAG
PARGPAVDEAVDEAVDEAGAADGDVFVLEPRWHLAPAADARPAPARPEDAAVVVCVVADAAQRAAVERALARIATAPACM
FVEPDATMGEPARAVDALVRTLDGIADRARADVSGVDTSGAAAPISVWYCADLARDARAPIDVRAYDRLLTLLQALAKTR
APLRRVLLAGVSESLAADAWPALVTVLAPRRPALAVTPVLFDHGTPEALWVSALLQEARAAGGAAVRYAAGERSVAVLAR
AANAASENAARLAGGAPLRTGGCYLLTGGLGGLGRLFAVRLMRRYGARVVALGRSAHDAGVQASVAALCAEAGAGTLRYL
QADVCDAAAMAAVLDDIGRHEGRLNGVIHAAGCEDRASLADKSLDDVHAVLAPKLAAATVLDRLTAGLPLDFVCLFSSLA
GIVGDFGCGAYALGNRLLMSYAQRADGRVDAHGRRRRVVAIAWPLWRDGAMGFDDPVKRDRYLAASGQRMLDADEGFAWF
ERLLASPSAAPVVLAGERARIVSWLGSIDGAPVADASPGAASSGAAAPNAASDASRDTERAEATDAAAPIVAAGAPAHAH
ASAPRERDAHGSARLPALRGIAARVLGADAALLDEQASFADLGFDSIGLMDFARAVGEAFGIDMSPAMLFSHVTLKRLAG
HLATRLDDAPRAVAGARSGAAHASAGRACAAAGRADGARASRIDVRPPEAGAGERAADARPPLTAPEIAPEIAPESASGF
ASAPHPAPIPPAYEPIAIVGMSGRFPDARDIDAFWSLLMSGRGAARRVDPHDRFAAPRDEAAAWLAPVPGIDEFDPLFFD
IAPAEAQRMDPRERLLLQHAWLALEDAGIGARALADHALGMFVGAEAGEYGGLSREPRSIVSDHNGVLAARLSYFLDLTG
PNLSVNTACSSGLVALHLAIRSLHARECGIALAAGVNLMLSAELHDKMARAGMLSQHGVCRALDRDADGFVPGEAVVVVA
LKRLADALADGDAIHGVIRASGVNYDGRTNGMTAPSGLAQARLLTDAWRTAGIDPAELGYVVAHGTGTRLGDPVELNALG
DALRASTGRRHFCAVTSNKPNVGHTLAASGLVSLVNLVEALRRAVIPPSAHWREGNEFVAWGDSPLRVNTVASAWPDGAR
RLGGASAFGMSGTNAHVIVEAAEPARARAPHAPRGGVLFVVSAKTGDALAQRLRDLADCLASDAWSDGDLADVAHTLLHG
RQHFEHRSAVFARGKADAIDALRAAADAVRNAPARRTGALRLLDDYAAHLSARYAAWRDAPARDVDEARRLLEAIGDAYR
EGATPSALVVAPAPGRTVRLPGYPFACERHWHPAPARAGGASAGANAREPGARERAEAAAAPHVVVLTLTGSEPYLRDHL
IDGQRVLPGAAHLDMARAAFERVRAAGGRAARFVALRDVVWRAPLVAGDAGAEVHVELTPSGTAFAYRILSRPADRAAAV
GAALTLHSEGVVDELNEPAPEPLDLDGWRAACAGERVDGAALHARFASLGIGYGPSHRGVAHVRLGEHAALARLDLGALA
DADFDGFGLHPGIVDSAFQASAALSAPREDAAVPFMLRALHVYSRAARTMWAAIRVAAPAGGGAGGVDTRPIDIDLVHDD
GAICASLRGWVSARWRKLAGAPPAARATAALRGWRWRAAEPDPRAAEAATRVVLLAPDFGALAAELAAHDAVRCEMLDDV
SSHALPDAFAARATQAFAAVRRELAAKPGRPVLFQAVCREADSATAWGLAGLFKTAADENPLASGHVVLVSESFDRARLA
AALLAERAWTGHLRYGGAGREAFAIDADEDDRDGREPAGGPPLRDDGVYAISGGAGGIGRAIARDIVARTRAARVYLLGR
SAHAPSDLFATADERARIRYCRVDVADREQVHAWIAGLKRDGEALRGVVHAAGVVDDAFVVAKEPSRVHAVLRPKVAGAV
WLDEATRDAPLDFFVAFSSLASAFGNAGQADYAMANAFVDGYMTERRARVAGGVSRGASLSIQWPLWADAGMRMPAAIAS
RLAAATGLAPLPSAEGIAAWRAALARDDANVAVLYGERAAIAAWLATSDAMPPRARAARPMPAARVDLHALRALAASLIG
VDAHDINVDADIDEYGLDAVALAHLAREIGERAGRADVGLGFVRGERTLRAIARALDASAPGGDAEVDIGADADADADNA
PDAEACVECAGIDARATPPDAGAPQRGAHAAAAEGSARDEAARSIAPLAGAGSPDGASALRARVGARLSALLADVLKVPV
ARLEPDAPFERYGIDSLTVVALNESLGRHVDALPKTLFFEYRTLAELTDYFVRRHAGAAWFAPDARAEHGAAGALATLGA
ATRATQTEQAEQSRVAPLAPPRRVFAAATAATRSRAASAPPAATADAIAVIGLAGRYPQARDLDAFWRNLRDGRDCITEI
PAERWNHGDFFDPQKGVAGKTYSKWGGFIEGVDEFDAAFFNIAPRDAERMDPQERLFLQASYQAIEDAGYARASLGAGRV
GVFAGVMYSEYQLYGIEESAAGRPAALSGSAASIANRVSYHLDLHGPSMAVDTMCSSSLTALHLACRSLQRGECELALAG
GVNVSVHPNKYLMLADNRFVSSAGRCESFGAGGDGYVPAEGVGVAVLKPLRAAIADGDAIHGVICATALNHGGKNNGYTV
PNPAWQAAVIEAALGEAGVAPGDVSYVEAHGTGTTLGDPIEIAGLARAFGEPGARRGAPCAIGSVKSNIGHAESAAGIAG
LTKVLLQMRHRMLVPSLHADTLNPNIAFETTPFCVQRALERWERPGDGERVAGVSSFGAGGANAHVIVREYRSDDERDGR
DEPASTAPARARPAWIVLSARNEDGLRARAAQLRELAAGCEGDADLHAIAYTLQTGRDAMDERLAFEATSIADLIASLDA
FARGEPGKKQLRGNGRARRGDAPPAGVADARLARGEHRAALDDWVRGASIDWRRAYGPGAPFGPAPRRMHLPVYPFARTR
HWLPAPLDARRRAARAGGGLHPMLDANRSEFGRQRFVVEFDGREPWLADHRVDGRRVLPGVAYLEMARAALAASAPDMHA
DGGATLEDVRWLRPCIVPDGGATLEIALERDGDGIAFSISQTSAHAQPALCCTGRSPSRAARGGGERIDVDAMRDAFRAA
PAFDADACYRAFARRGVQYGPSHRTIERVWADGDRALARLRSARPADPRLVMWPGLLDGALQSLIGLHGLDGDLLAAPYR
LARLDVHGACGPAMWALARRAGDGALDLVLCDDAGEACVTMRGFASRAAASWRTAAQAAGAPHEAVAGGDARAAFGPAAE
SPSAATSTSAATSPAISTSAATPAAADGDDWLLLPRWLACDLDPERDAGARAVAPRSVLAIADDACVHDLSAAAFGGASV
RRMPASDAADAARLAAILGDAPRLDALVFVAPGAHARSAQALIDAQESGAIALFRIVKALLAHGYRDDALTLAIVTEQAV
ALYPGEPIDPAHAALHGMAGVLGRDLPRWRVLCADVERARAYAGAALVAQAGPAGEGPRINRLGRWHRRALARVDAPAAR
ESAFRRGGVYVIVGGAGGLGRVLTEHLIRRADARVVWVGRRAADGRIAADCASFESIGAPPLYLQADASDAGAMGAVRDA
VKARYGAIHGVVHSALALRDATLATMTEADFRRVLAAKLDTSVRLAEAFADEPLDFMLFFSSFAAFSFPQGQANYAAGCA
FQDAFAQHLAARAPFAVKTVNWGFWGHAGVVATPAHRDRMARLGIGSIEPDAAMRSLECLLACDVGQLALINVRRDDAIA
PLMTPRRVALHAPGAGPRALDSVARVRPDAERTAAARLHGGLQRDEWRDVLLSLLDATFDALGARRAAEAGGDALRAALG
IAPRHARWFAASLDWLRALREGAHAAAAMSADDAWRAWARLGAACDGDAGRSAQYALVDATLRHLADIVTGRRKATEIMF
PKGSMALVENVYRHHPVADYFNLAVAEIVAARAADVVRERGVRILEVGAGTGGTTARVLAVLRERGVRVDDYRFTDVSPG
FLDHAAARFGAGAPFMSYGLFDVMRAPAAQGIAPHGFDIVVATNVLHATADVRASLRHCRDALRAGGLLVVNEISDKSLF
THLTFGLLDGWWAYEDAALRIDGSPALDAHHWAFALRSEGYADVAFPWADAHDLGQQIVLADAGDVVDAADAQAMRAREA
EPWAWATGAESESVLEAVSETETALTPAPTQSPAAPSDATRTAALLRSLLGEALKVEPVSIEGDGAFGDYGLDSIIGMGF
VDAINRALDLSLDVTAVFEHNTVDALAAYVGSQLAARAPAAAGARDVEPASSLASSSASDFVSARLPAVDAAASSAFDAA
PRARTGADAPDTSLASSASSISSARASSPASPARDAASFDVAIVGASCRFPGADGLDALWRCIVDERSCVRDVGAKWLRT
RDPADAGADYRAAVLDGIDRFDAARFGISPREARRMDPQQRLLLTEAWRALQDAGDAARAAAHRTGVFVAAGANEYGAGL
DAADNPFSMTSMAPALMPNRISYALDLRGPSEMTDTACSSSLVALHRAVRSLRDGECDQAVVAAVNLLLSAEKFEGFAEL
GFLSPSGRTRSFDAAGDGFVRGEGAAALVLKPLAAARRDGDFVYACIKGTAVHHGGRGAALTAPNAAGIREAMSAAYRNA
GIDARTVSYLEAHGVGSPVGDAIELNAIRDAYAALSGEPAPAAPAASCRIGSVKPVFGHVELASGLLAVCKVLMALRHGV
LPGVPGFERPNPHANLAGSPLVVARAAAPWPAPRDDANGAAVPRRASVNSFGFGGVNAHVVLEEDVSSRHAPRVFAQPPL
DGARHWECPREIAGAPAARPGRPAPDAPHARIEAVIRDALAQALGVAPDAFELAVPLGEYGADSMLDLHLATRIEETLGV
QLSVRELFAHRTLGALRDHVAERIARDGGRRAAEAARAPDAAMASDVAEASVVSEATEASDASEASDASEASEASEASEA
SKAPADLAALLERFRAGQMDLDDIVDLV
>C0HLR2 ~~~~~~Bacteriocin thailandicin~~~
KKNMLLVNPIVGIGGLFVGAPMLTANLGISSYAAKKVIDDINTGSAVATIIALVTAVVGGGLITAGIVATTKSLIKKYGA
KYSAAW
>P46369 1.2.1.3~~~thcA~~~EPTC-inducible aldehyde dehydrogenase~~~
MTKYARPGTADAIMSFQSRYDNWIGNEWVAPVKGQYFENPTPVTGQNFCDVARSTAEDIELALDAAHAAAPAWGKTSVAE
RAIILNKIADRMEENLESIALAESWDNGKPIRETLNADIPLAIDHFRYFAGAIRAQEGSLSEINSDTVAYHFHEPLGVVG
QIIPWNFPILMAVWKLAPALAAGNAIVLKPAEQTPVSILHLIGIIGDLLPAGVLNIVNGFGVEAGKPLASSPRIKKIAFT
GETTTGRLIMQYASQNLIPVTLELGGKSPNVFFSDVLASNDDYQDKALEGFTMFALNQGEVCTAPSRALIQEDIFDEFLA
MAAIRTKAVRQGDPLDTDTMIGAQASNDQLEKILSYIEIGKAEGAKVITGGERAELGGDLSGGYYVQPTVFTGNNKMRIF
QEIFGPVVSVTSFKDYDEAIEIANDTLYGLGAGVWSRDGGVAYRAGRDIQAGRVWTNTYHQYPAHAAFGGYKQSGIGREN
HLMMLSHYQQTKNLLVSYAQKAQGFF
>P43492 1.14.-.-~~~thcB~~~Cytochrome P450 116~~~
MTVDHAPEGVKSPTGCPVSGMAADFDPFRGAYQVDPSSSLRQARKDEPVFFSPLLDYWVVTRYEDIKQIFKTPSVFSPSI
TVDQITPISDEALQILGSYQFAAGRMLVNEDEPIHTERRRLLMQPFEADNVATLEPKIREVVNTYLDRVIKDGRADLIGD
LLYEVPCIVALIFLGVPDEDIETCRQYGMQQTLFTWGHPTGDEQTRVATGMGKFWEFAGGLVDKLKADPNAKGWIPHAIE
MQRQHPDLFDDNYLQNIMFGGVFAAHETTTNATGNAFRTLLENRSSWDEICADPTLIPKAIEECLRYSGSVVAWRRKAVV
DTTVGEVDIPAGGRLLIVMASANRDDSMFPEPDDFDIHRGNAQRHLTFGIGSHTCLGATLARLEMKVFLEEVSRRLPHMS
LVAGQEFSYLPNTSFRGPEHVLVEWDPQQNPVPADRP
>P43493 ~~~thcC~~~Rhodocoxin~~~
MPTVTYVHPDGTKHEVEVPTGKRVMQAAIGAGIDGIVAECGGQAMCATCHVYVESPWADKFPSISEEEDEMLDDTVSPRT
EASRLSCQLVVSDDVDGLIVRLPEEQV
>P43494 1.18.1.-~~~thcD~~~Rhodocoxin reductase~~~
MSIVIIGSGQAGFEAAVSLRSHGFSGTITLVGDEPGVPYQRPPLSKAYLHSDPDRESLALRPAQYFDDHRITLTCGKPVV
RIDRDAQRVELIDATAIEYDHLILATGARNRLLPVPGANLPGVHYLRTAGEAESLTSSMASCSSLVVIGAGFIGLEVAAA
ARKKGLDVTVVEAMDRPMARALSSVMSGYFSTAHTEHGVHMRLSTGVKTINAADGRAAGVTTNSGDVIHADAVVVGIGVV
PNIELAALTGLPVDNGIVVDEYLRTPDENISAIGDCAAYPIPGKAGLVRLESVQNAVDQARCLAAQLTGTSTHYRSVPWF
WSEQYESKLQMAGLTAGADTHVVRGSVDSGVFSIFCFLGTRLLGVESVNKPRDHMAARKILATEMPLTPEQAADTDFDLK
LAIARHKDTHKDEVASADIGERQVVAS
>Q812G9 ~~~~~~Thiocillin~~~
MSEIKKALNTLEIEDFDAIEMVDVDAMPENEALEIMGASCTTCVCTCSCCTT
>P0C8P8 ~~~tpdA~~~Thiostrepton~~~
MDATAIHERWSVMSNASIGQEIGVEGLTGLDVDALEISDYVDETLLDGEDLTVTMIASASCTTCICTCSCSS
>P0C8P9 ~~~getA~~~Thiocillin GE37468~~~
MGNNEEYFIDVNDLSIDVFDVVEQGGAVTALTADHGMPEVGASTNCFCYICCSCSSN
>Q43880 3.4.24.27~~~~~~Thermolysin~~~
MNKRAMLGAIGLAFGLMAWPFGASAKEKSMVWNEQWKTPSFVSGSLLKGEDAPEELVYRYLDQEKNTFQLGGQARERLSL
IGKQTDELGHTVMRFEQRYRGIPVYGAVLVAHVNDGELSSLSGTLIPNLDKRTLKTEAAISIQQAEMIAKQDVADAVTKE
RPAAEEGKPTRLVIYPDGETPRLAYEVNVRFLTPVPGNWIYMIDAADGKVLNKWNQMDEAKPGGGQPVAGTSTVGVGRGV
LGDQKYINTTYSSYYGYYYLQDNTRGSGIFTYDGRNRTVLPGSLWADGDNQFFASYDAAAVDAHYYAGVVYDYYKNVHGR
LSYDGSNAAIRSTVHYGRGYNNAFWNGSQMVYGDGDGQTFLPFSGGIDVVGHELTHAVTDYTAGLVYQNESGAINEAMSD
IFGTLVEFYANRNPDWEIGEDIYTPGIAGDALRSMSDPAKYGDPDHYSKRYTGTQDNGGVHTNSGIINKAAYLLSQGGVH
YGVSVTGIGRDKMGKIFYRALVYYLTPTSNFSQLRAACVQAAADLYGSTSQEVNSVKQAFNAVGVY
>Q59193 3.4.24.27~~~npr~~~Thermolysin~~~
MDKRAMLGAIGLAFGLMAWPFGASAKEKSMVWNEQWKTPSFVSGSLLKGEDAPEELVYRYLDQEKNTFQLGGQARERLSL
IGKQTDELGHTVMRFEQRYRGIPVYGAVLVAHVNDGELSSLSGTLIPNLDKRTLKTEAAISIQQAEMIAKQDVADAVTKE
RPAAEEGKPTRLVIYPDGETPRLAYEVNVRFLTPVPGNWIYMIDAADGKVLNKWNQMDEAKPGGGQPVAGTSTVGVGRGV
LGDQKYINTTYSSYYGYYYLQDNTRGSGIFTYDGRNRTVLPGSLWADGDNQFFASYDAAAVDAHYYAGVVYDYYKNVHGR
LSYDGSNAAIRSTVHYGRGYNNAFWNGSQMVYGDGDGQTFLPFSGGIDVVGHELTHAVTDYTAGLVYQNESGAINEAMSD
IFGTLVEFYANRNPDWEIGEDIYTPGIAGDALRSMSDPAKYGDPDHYSKRYTGTQDNGGVHTNSGIINKAAYLLSQGGVH
YGVSVTGIGRDKMGKIFYRALVYYLTPTSNFSQLRAACVQAAADLYGSTSQEVNSVKQAFNAVGVY
>Q59223 3.4.24.27~~~npr~~~Thermolysin~~~
MDKRAMLGAIGLAFGLMAWPFGASAKEKSMVWNEQWKTPSFVSGSLLKGEDAPEELVYRYLDQEKNTFQLGGQARERLSL
IGKQTDELGHTVMRFEQRYRGIPVYGAVLVAHVNDGELSSLSGTLIPNLDKRTLKTEAAISIQQAEMIAKQDVADAVTKE
RPAAEEGKPTRLVIYPDGETPRLAYEVNVRFLTPVPGNWIYMIDAADGKVLNKWNQMDEAKPGGGQPVAGTSTVGVGRGV
LGDQKYINTTYSSYYGYYYLQDNTRGSGIFTYDGRNRTVLPGSLWADVDNQFFASYDAAAVDAHYYAGVVYDYYKNVHGR
LSYDGSNAAIRSTVHYGRGYNNAFWNGSQMVYGDGDGQTFLPFSGGIDVVGHELTHAVTDYTAGLVYQNESGAINEAMSD
IFGTLVEFYANRNPDWEIGEDIYTPGIAGDALRSMSDPAKYGDPDHYSKRYTGTQDNGGVHTNSGIINKAAYLLSQGGVH
YGVSVTGIGRDKMGKIFYRALVYYLTPTSNFSQLRAACVQAAADLYGSTSQEVNSVKQAFNAVGVY
>P00800 3.4.24.27~~~npr~~~Thermolysin~~~
MKMKMKLASFGLAAGLAAQVFLPYNALASTEHVTWNQQFQTPQFISGDLLKVNGTSPEELVYQYVEKNENKFKFHENAKD
TLQLKEKKNDNLGFTFMRFQQTYKGIPVFGAVVTSHVKDGTLTALSGTLIPNLDTKGSLKSGKKLSEKQARDIAEKDLVA
NVTKEVPEYEQGKDTEFVVYVNGDEASLAYVVNLNFLTPEPGNWLYIIDAVDGKILNKFNQLDAAKPGDVKSITGTSTVG
VGRGVLGDQKNINTTYSTYYYLQDNTRGNGIFTYDAKYRTTLPGSLWADADNQFFASYDAPAVDAHYYAGVTYDYYKNVH
NRLSYDGNNAAIRSSVHYSQGYNNAFWNGSQMVYGDGDGQTFIPLSGGIDVVAHELTHAVTDYTAGLIYQNESGAINEAI
SDIFGTLVEFYANKNPDWEIGEDVYTPGISGDSLRSMSDPAKYGDPDHYSKRYTGTQDNGGVHINSGIINKAAYLISQGG
THYGVSVVGIGRDKLGKIFYRALTQYLTPTSNFSQLRAAAVQSATDLYGSTSQEVASVKQAFDAVGVK
>P43133 3.4.24.27~~~nprS~~~Thermolysin~~~
MKRKMKMKLVRFGLAAGLAAQVFFLPYNALASTEHVTWNQQFQTPQFISGDLLKVNGTSPEELVYQYVEKNENKFKFHEN
AKDTLQLKEKKNDNLGFTFMRFQQTYKGIPVFGAVVTAHVKDGTLTALSGTLIPNLDTKGSLKSGKKLSEKQARDIAEKD
LVANVTKEVPEYEQGKDTEFVVYVNGDEASLAYVVNLNFLTPEPGNWLYIIDAVDGKILNKFNQLDAAKPGDVKSITGTS
TVGVGRGVLGDQKNINTTYSTYYYLQDNTRGNGIFTYDAKYRTTLPGSLWADADNQFFASYDAPAVDAHYYAGVTYDYYK
NVHNRLSYDGNNAAIRSSVHYSQGYNNAFWNGSQMVYGDGDGQTFIPLSGGIDVVAHELTHAVTDYTAGLIYQNESGAIN
EAISDIFGTLVEFYANKNPDWEIGEDVYTPGISGDSLRSMSDPAKYGDPDHYSKRYTGTQDNGGVHINSGIINKAAYLIS
QGGTHYGVSVVGIGRDKLGKIFYRALTQYLTPTSNFSQLRAAAVQSATDLYGSTSQEVASVKQAFDAVGVK
>Q45670 3.4.21.-~~~~~~Thermophilic serine proteinase~~~
MKFKAIVSLSLAVSMSLFPFLVEAASNDGVESPKTVSEINVSHEKGAYVQGEVIVQFKEQVNAEEKAKALKEVGATAVPD
NDRVKSKFNVLKVGNVEAVVKALNNNPLVEYAEPNYLFNAAWTPNDTYYQGYQYGPQNTYTDYAWDVTKGSSGQEIAVID
TGVDYTHPDLDGKVIKGYDFVDNDYDPMDLNNHGTHVAGIAAAETNNATGIAGMAPNTRILAVRALDRNGSGTLSDIADA
IIYAADSGAEVINLSLGCDCHTTTLENAVNYAWNKGSVVVAAAGNNGSSTTFEPASYENVIAVGAVDQYDRLASFSNYGT
WVDVVAPGVDIVSTITGNRYAYMSGTSMASPHVAGLAALLASQGRNNIEIRQAIEQTADKISGTGTYFKYGRINSYNAVT
Y
>P04072 3.4.21.66~~~~~~Thermitase~~~
YTPNDPYFSSRQYGPQKIQAPQAWDIAEGSGAKIAIVDTGVQSNHPDLAGKVVGGWDFVDNDSTPQNGNGHGTHCAGIAA
AVTNNSTGIAGTAPKASILAVRVLDNSGSGTWTAVANGITYAADQGAKVISLSLGGTVGNSGLQQAVNYAWNKGSVVVAA
AGNAGNTAPNYPAYYSNAIAVASTDQNDNKSSFSTYGSVVDVAAPGSWIYSTYPTSTYASLSGTSMATPHVAGVAGLLAS
QGRSASNIRAAIENTADKISGTGTYWAKGRVNAYKAVQY
>Q8DJT8 ~~~thf1~~~Protein Thf1~~~
MQNPRTVSDTKRAFYAAHTRPIHSIYRRFIEELLVEIHLLRVNVDFRYSPLFALGVVTAFDQFMEGYQPEGDRDRIFHAL
CVAEEMNPQQLKEDAASWQQYQGRPLSQILDELNSGQPSAPLNSLNHTGKYSRLHAVGLYAFLQELAGEVTIHLNETLDQ
LAPVIPLPIEKVKRDLELYRSNLDKINQARSLMKELVEQERKRRAQQTSAPPAVDASSDAPA
>P07464 2.3.1.18~~~lacA~~~Galactoside O-acetyltransferase~~~COG0110
MNMPMTERIRAGKLFTDMCEGLPEKRLRGKTLMYEFNHSHPSEVEKRESLIKEMFATVGENAWVEPPVYFSYGSNIHIGR
NFYANFNLTIVDDYTVTIGDNVLIAPNVTLSVTGHPVHHELRKNGEMYSFPITIGNNVWIGSHVVINPGVTIGDNSVIGA
GSIVTKDIPPNVVAAGVPCRVIREINDRDKHYYFKDYKVESSV
>P45741 2.5.1.2~~~~~~Thiaminase-1~~~
MSKVKGFIYKPLMVMLALLLVVVSPAGAGAAHSDASSDITLKVAIYPYVPDPARFQAAVLDQWQRQEPGVKLEFTDWDSY
SADPPDDLDVFVLDSIFLSHFVDAGYLLPFGSQDIDQAEDVLPFALQGAKRNGEVYGLPQILCTNLLFYRKGDLKIGQVD
NIYELYKKIGTSHSEQIPPPQNKGLLINMAGGTTKASMYLEALIDVTGQYTEYDLLPPLDPLNDKVIRGLRLLINMAGEK
PSQYVPEDGDAYVRASWFAQGSGRAFIGYSESMMRMGDYAEQVRFKPISSSAGQDIPLFYSDVVSVNSKTAHPELAKKLA
NVMASADTVEQALRPQADGQYPQYLLPARHQVYEALMQDYPIYSELAQIVNKPSNRVFRLGPEVRTWLKDAKQVLPEALG
LTDVSSLAS
>Q5ZV75 2.-.-.-~~~thi5~~~4-amino-5-hydroxymethyl-2-methylpyrimidine phosphate synthase~~~COG0715
MAMSSLKSRVTLLLNWYTNPYHTPILVAQQLGFYSEEDIKLAILEPADPSDVTEIVGLGTVDFGVKAMIHTVAAKAKGYP
VTSIGTLLDEPPTGLIALKSSGINSFQDIVGKRVGYIGEFGKKIIDDLASLAGIDPTSYKTVRIGMNVTDAIYRDVIDTG
IGFINFQKVELEHLCGETVFLRIDQLAGLGCCCFCSIQFIVPEITLQQPELVKGFLRATQRGAAYTTEKPEEAYELLCQA
KPQLRTPLYQKIFTRTLPFFSRTLINVDRDWDKVGRYTKHLKIIDEHFDISQCYTNRFLPDTPYSDLKPIACCLEN
>P31550 ~~~thiB~~~Thiamine-binding periplasmic protein~~~COG4143
MLKKCLPLLLLCTAPVFAKPVLTVYTYDSFAADWGPGPVVKKAFEADCNCELKLVALEDGVSLLNRLRMEGKNSKADVVL
GLDNNLLDAASKTGLFAKSGVAADAVNVPGGWNNDTFVPFDYGYFAFVYDKNKLKNPPQSLKELVESDQNWRVIYQDPRT
STPGLGLLLWMQKVYGDDAPQAWQKLAKKTVTVTKGWSEAYGLFLKGESDLVLSYTTSPAYHILEEKKDNYAAANFSEGH
YLQVEVAARTAASKQPELAQKFLQFMVSPAFQNAIPTGNWMYPVANVTLPAGFEKLTKPATTLEFTPAEVAAQRQAWISE
WQRAVSR
>Q7CR85 ~~~thiB~~~Thiamine-binding periplasmic protein~~~
MLKKYLPLLLLCAAPAFAKPVLTVYTYDSFAADWGPGPAVKKAFEADCNCELKLVALEDGVSLLNRLRMEGKNSKADVVL
GLDNNLLEAATQTKLFAKSGVANEAVKVPGGWKNDTFVPFDYGYFAFVYDKSKLKNPPKSLKELVESDQKWRVIYQDPRT
STPGLGLLLWMRKVYGDNAPQAWQKLAAKTVTVTKGWSEAYGLFLKGESDLVLSYTTSPAYHIIEEKKDNYAAANFSEGH
YLQVEVAARTVASKQPELAEKFLKFMVSPAFQNAIPTGNWMYPVADVALPAGFESLAKPATTLEFTPQQVAAQRQAWISE
WQRAVSR
>P45740 4.1.99.17~~~thiC~~~Phosphomethylpyrimidine synthase~~~COG0422
MQNNSVQQANISIMSSFSGSKKVYVEGSSSDIQVPMREIALSPTTGSFGEEENAPVRVYDTSGPYTDPEVTINIQEGLKP
LRQKWITERGDVEEYEGRAIKPEDNGYKKAKPNVSYPGLKRKPLRAKAGQNVTQMHYAKKGIITPEMEFIAIREHVSPEF
VRDEVASGRAIIPSNINHPESEPMIIGRNFHVKINANIGNSAVTSSIEEEVEKMTWAIRWGADTMMDLSTGKDIHTTREW
IIRNCPVPVGTVPIYQALEKVNGVAEDLTWEIYRDTLIEQAEQGVDYFTIHAGVLLRYVPLTAKRTTGIVSRGGAIMAQW
CLAHHQESFLYTHFEEICEIMKMYDIAFSLGDGLRPGSIADANDEAQFAELETLGELTQIAWKHDVQVMIEGPGHVPMHK
IKENVDKQMDICKEAPFYTLGPLTTDIAPGYDHITSAIGAAMIGWYGTAMLCYVTPKEHLGLPNRDDVREGVITYKIAAH
AADLAKGHPGAQIRDDALSKARFEFRWRDQFNLSLDPERALEYHDETLPAEGAKTAHFCSMCGPKFCSMRISQDIRDYAK
KNDLSEAEAINKGLKEKAKEFVDTGSNLYQ
>Q9A6Q5 4.1.99.17~~~thiC~~~Phosphomethylpyrimidine synthase~~~COG0422
MNIQSTIKAVAETISTGPIPGSRKVYQAGELFPELRVPFREVAVHPSANEPPVTIYDPSGPYSDPAIQIDIEKGLPRTRE
ALVVARGDVEEVADPRQVKPEDNGFAQGKHLAPEFPDTGRKIYRAKPGKLVTQLEYARAGIITAEMEYVAIRENLRREQD
RPCVRDGEDFGASIPDFVTPEFVRQEIARGRAIIPANINHGELEPMAIGRNFLVKINANIGNSAVLSTVADEVDKLVWAT
RWGADTVMDLSTGRNIHNIRDWIIRNSSVPIGTVPIYQALEKVNGVAEDLNWEVFRDTLIEQCEQGVDYFTIHAGVRLPF
IPMTAKRVTGIVSRGGSIMAKWCLAHHKENFLYERFDEICEIMRAYDVSFSLGDGLRPGSTADANDEAQFSELRTLGELT
KVAWKHGVQVMIEGPGHVAMHKIKANMDEQLKHCHEAPFYTLGPLTTDIAPGYDHITSAIGAAMIGWFGTAMLCYVTPKE
HLGLPDRDDVKTGVITYKLAAHAADLAKGHPGAAMWDDAISRARFEFRWEDQFNLGLDPETARKFHDETLPKEAHKTAHF
CSMCGPKFCSMKISQEVRDFAAGKAPNSAELGMAEMSEKFREQGSEIYLKTE
>P30136 4.1.99.17~~~thiC~~~Phosphomethylpyrimidine synthase~~~COG0422
MSATKLTRREQRARAQHFIDTLEGTAFPNSKRIYITGTHPGVRVPMREIQLSPTLIGGSKEQPQYEENEAIPVYDTSGPY
GDPQIAINVQQGLAKLRQPWIDARGDTEELTVRSSDYTKARLADDGLDELRFSGVLTPKRAKAGRRVTQLHYARQGIITP
EMEFIAIRENMGRERIRSEVLRHQHPGMSFGAHLPENITAEFVRDEVAAGRAIIPANINHPESEPMIIGRNFLVKVNANI
GNSAVTSSIEEEVEKLVWSTRWGADTVMDLSTGRYIHETREWILRNSPVPIGTVPIYQALEKVNGIAEDLTWEAFRDTLL
EQAEQGVDYFTIHAGVLLRYVPMTAKRLTGIVSRGGSIMAKWCLSHHQENFLYQHFREICEICAAYDVSLSLGDGLRPGS
IQDANDEAQFAELHTLGELTKIAWEYDVQVMIEGPGHVPMQMIRRNMTEELEHCHEAPFYTLGPLTTDIAPGYDHFTSGI
GAAMIGWFGCAMLCYVTPKEHLGLPNKEDVKQGLITYKIAAHAADLAKGHPGAQIRDNAMSKARFEFRWEDQFNLALDPF
TARAYHDETLPQESGKVAHFCSMCGPKFCSMKISQEVRDYAATQTIEMGMADMSENFRARGGEIYLRKEEA
>P9WG79 4.1.99.17~~~thiC~~~Phosphomethylpyrimidine synthase~~~COG0422
MTITVEPSVTTGPIAGSAKAYREIEAPGSGATLQVPFRRVHLSTGDHFDLYDTSGPYTDTDTVIDLTAGLPHRPGVVRDR
GTQLQRARAGEITAEMAFIAAREDMSAELVRDEVARGRAVIPANHHHPESEPMIIGKAFAVKVNANIGNSAVTSSIAEEV
DKMVWATRWGADTIMDLSTGKNIHETREWILRNSPVPVGTVPIYQALEKVKGDPTELTWEIYRDTVIEQCEQGVDYMTVH
AGVLLRYVPLTAKRVTGIVSRGGSIMAAWCLAHHRESFLYTNFEELCDIFARYDVTFSLGDGLRPGSIADANDAAQFAEL
RTLGELTKIAKAHGAQVMIEGPGHIPMHKIVENVRLEEELCEEAPFYTLGPLATDIAPAYDHITSAIGAAIIAQAGTAML
CYVTPKEHLGLPDRKDVKDGVIAYKIAAHAADLAKGHPRAQERDDALSTARFEFRWNDQFALSLDPDTAREFHDETLPAE
PAKTAHFCSMCGPKFCSMRITQDVREYAAEHGLETEADIEAVLAAGMAEKSREFAEHGNRVYLPITQ
>Q9L9I7 4.1.99.17~~~thiC~~~Phosphomethylpyrimidine synthase~~~
MSTTTLTRREQRAKAQHFIDTLEGTAFPNSKRIYVTGSQHDIRVPMREIQLSPTLIGGSKDNPQFEENEAVPVYDTSGPY
GDPEVAINVQQGLAKLRQPWIDARNDSEELDDRSSAYTRERLADDGLDDLRFTGLLTPKRAKAGKRVTQLHYARKGIVTP
EMEFIAIRENMGRERIRSEVLRHQHPGMNFGARLPENITPEFVRDEVAAGRAIIPANINHPESEPMIIGRNFLVKVNANI
GNSAVTSSIEEEVEKLVWSTRWGADTVMDLSTGRYIHETREWILRNSPVPIGTVPIYQALEKVNGIAEDLTWEAFRDTLL
EQAEQGVDYFTIHAGVLLRYVPMTAKRLTGIVSRGGSIMAKWCLSHHKENFLFEHFREICEICAAYDVSLSLGDGLRPGS
IQDANDEAQFSELHTLGELTKIAWEYDVQVMIEGPGHVPMHMIQRNMTEELESCHEAPFYTLGPLTTDIAPGYDHFTSGI
GAAMIGWFGCAMLCYVTPKEHLGLPNKEDVKQGLITYKIAAHAADLAKGHPGAQIRDNAMSKARFEFRWEDQFNLALDPF
TARAYHDETLPQESGKVAHFCSMCGPKFCSMKISQEVRDYAAAQTIEIGMADMSEDFRAKGGEIYLKREEA
>Q9WZP7 ~~~thiDN~~~Bifunctional thiamine biosynthesis protein ThiDN~~~COG0351
MVLVVAGFDPSGGAGIIQDVKVLSALGVKTHAVISALTVQNENRVFSVNFRDWEEMRKEIEVLTPPRVIKVGLSAPETVK
RLREMFPDSAIVWNVVLESSSGFGFQDPEEVKKFVEYADYVILNSEEAKKLGEYNNFIVTGGHEKGNTVKVKYRDFVFEI
PRVPGEFHGTGCAFSSAVSGFLAMSYPVEEAIRSAMELLKKILERSSGVVETEKLLRDWYRYDTLNTLDEILPEFLEIGH
LTVPEVGQNVSYALPWAKNEFEVGKFPGRIRLKEGKAVAVSCASFKDRSHTARMAVTMMRYHPHMRCVVNVRYEREYVER
AKKRGLKVFHYDRSKEPKEVQEKEGQSMVWMIEQAIAELKSPPDVIYDEGWWGKEAMIRVFGRNPKEVLEKIKLMVRE
>O31620 2.7.1.49~~~thiD~~~Hydroxymethylpyrimidine/phosphomethylpyrimidine kinase~~~COG0351
MSIYKALTIAGSDSGGGAGIQADIKTFQELDVFGMSAITAVTAQNTLGVHGVHPLTVETLRQQIDAVAEDLRPDAVKTGM
LWNADMIEEVARKIDEYGFNRVIVDPVMIAKGGASLLRDESVATLKELLIPRSYAITPNVPEAETLTGMTISSLDDRKKA
AEQLVKMGAQHVIIKGGHQPEDNHITDLLFDGSMFMQITHPYINTKHTHGTGCTFAAALTAQTAKGDSIHQAFEVAANFV
REAVENTLGIGSGHGPTNHFAFKRNSLNTSR
>P76422 2.7.1.49~~~thiD~~~Hydroxymethylpyrimidine/phosphomethylpyrimidine kinase~~~COG0351
MKRINALTIAGTDPSGGAGIQADLKTFSALGAYGCSVITALVAQNTRGVQSVYRIEPDFVAAQLDSVFSDVRIDTTKIGM
LAETDIVEAVAERLQRYQIQNVVLDTVMLAKSGDPLLSPSAVATLRSRLLPQVSLITPNLPEAAALLDAPHARTEQEMLE
QGRSLLAMGCGAVLMKGGHLDDEQSPDWLFTREGEQRFTAPRIMTKNTHGTGCTLSAALAALRPRHTNWADTVQEAKSWL
SSALAQADTLEVGHGIGPVHHFHAWW
>P9WG77 2.7.1.49~~~thiD~~~Hydroxymethylpyrimidine/phosphomethylpyrimidine kinase~~~COG0351
MNYLPLAPPGMTPPRVLSIAGSDSGGGAGIQADMRTMALLGVHACVAVTAVTVQNTLGVKDIHEVPNDVVAGQIEAVVTD
IGVQAAKTGMLASSRIVATVAATWRRLELSVPLVVDPVCASMHGDPLLAPSALDSLRGQLFPLATLLTPNLDEARLLVDI
EVVDAESQRAAAKALHALGPQWVLVKGGHLRSSDGSCDLLYDGVSCYQFDAQRLPTGDDHGGGDTLATAIAAALAHGFTV
PDAVDFGKRWVTECLRAAYPLGRGHGPVSPLFRLS
>P55882 2.7.1.49~~~thiD~~~Hydroxymethylpyrimidine/phosphomethylpyrimidine kinase~~~
MQRINALTIAGTDPSGGAGIQADLKTFSALGAYGCSVITALVAQNTCGVQSVYRIEPDFVAAQLDSVFSDVRIDTTKIGM
LAETDIVEAVAERLQRHHVRNVVLDTVMLAKSGDPLLSPSAIETLRVRLLPQVSLITPNLPEAAALLDAPHARTEQEMLA
QGRALLAMGCEAVLMKGGHLEDAQSPDWLFTREGEQRFSAPRVNTKNTHGTGCTLSAALAALRPRHRSWGETVNEAKAWL
SAALAQADTLEVGKGIGPVHHFHAWW
>P99124 2.7.1.49~~~thiD~~~Hydroxymethylpyrimidine/phosphomethylpyrimidine kinase~~~
MIKPKIALTIAGTDPTGGAGVMADLKSFHSCGVYGMGVVTSIVAQNTLGVQHIHNLNHQWVDEQLDSVFNDTLPHAIKTG
MIATADTMETIRHYLMQHESIPYVIDPVMLAKSGDSLMDNDTKQNLQHTLLPLADVVTPNLPEAEEITGLTIDSEEKIMQ
AGRIFINEIGSKGIIIKGGHSNDTDIAKDYLFTNEGVQTFENERFKTKHTHGTGCTFSAVITAELAKGRPLFEAVHKAKK
FISMSIQYTPEIGRGRGPVNHFAYLKKEGLDDELSK
>P39594 2.5.1.3~~~thiE~~~Thiamine-phosphate synthase~~~COG0352
MTRISREMMKELLSVYFIMGSNNTKADPVTVVQKALKGGATLYQFREKGGDALTGEARIKFAEKAQAACREAGVPFIVND
DVELALNLKADGIHIGQEDANAKEVRAAIGDMILGVSAHTMSEVKQAEEDGADYVGLGPIYPTETKKDTRAVQGVSLIEA
VRRQGISIPIVGIGGITIDNAAPVIQAGADGVSMISAISQAEDPESAARKFREEIQTYKTGR
>P30137 2.5.1.3~~~thiE~~~Thiamine-phosphate synthase~~~COG0352
MYQPDFPPVPFRSGLYPVVDSVQWIERLLDAGVRTLQLRIKDRRDEEVEADVVAAIALGRRYNARLFINDYWRLAIKHQA
YGVHLGQEDLQATDLNAIRAAGLRLGVSTHDDMEIDVALAARPSYIALGHVFPTQTKQMPSAPQGLEQLARHVERLADYP
TVAIGGISLARAPAVIATGVGSIAVVSAITQAADWRLATAQLLEIAGVGDE
>P71350 2.5.1.3~~~thiE~~~Thiamine-phosphate synthase~~~COG0352
MKNIQKILPLYFVAGTQDCRHLGENLSENLLFVLKQALEGGITCFQFRDKGKFSLEHTPSAQKALAINCRDLCREYGVPF
IVDDNVDLALEIEADGIHVGQSDMPVQEIRAKTDKPLIIGWSVNRLDEAKIGENLAEIDYFGIGPIFPTQSKENPKPTLG
MAFIQTLRNAGITKPLVAIGGVKLAHVKTLREFGADGVAVITAITHADNVQAATKALREASDEYAK
>P9WG75 2.5.1.3~~~thiE~~~Thiamine-phosphate synthase~~~COG0352
MHESRLASARLYLCTDARRERGDLAQFAEAALAGGVDIIQLRDKGSPGELRFGPLQARDELAACEILADAAHRYGALFAV
NDRADIARAAGADVLHLGQRDLPVNVARQILAPDTLIGRSTHDPDQVAAAAAGDADYFCVGPCWPTPTKPGRAAPGLGLV
RVAAELGGDDKPWFAIGGINAQRLPAVLDAGARRIVVVRAITSADDPRAAAEQLRSALTAAN
>P66919 2.5.1.3~~~thiE~~~Thiamine-phosphate synthase~~~
MFNQSYLNVYFICGTSDVPSHRTIHEVLEAALKAGITLFQFREKGESALKGNDKLVLAKELQHLCHQYDVPFIVNDDVSL
AKEINADGIHVGQDDAKVKEIAQYFTDKIIGLSISDLDEYAKSDLTHVDYIGVGPIYPTPSKHDAHIPVGPEMIATFKEM
NPQLPIVAIGGINTNNVAPIVEAGANGISVISAISKSENIEKTVNRFKDFFNN
>P30138 2.7.7.73~~~thiF~~~Sulfur carrier protein ThiS adenylyltransferase~~~COG0476
MNDRDFMRYSRQILLDDIALDGQQKLLDSQVLIIGLGGLGTPAALYLAGAGVGTLVLADDDDVHLSNLQRQILFTTEDID
RPKSQVSQQRLTQLNPDIQLTALQQRLTGEALKDAVARADVVLDCTDNMATRQEINAACVALNTPLITASAVGFGGQLMV
LTPPWEQGCYRCLWPDNQEPERNCRTAGVVGPVVGVMGTLQALEAIKLLSGIETPAGELRLFDGKSSQWRSLALRRASGC
PVCGGSNADPV
>O31618 2.8.1.10~~~thiG~~~Thiazole synthase~~~COG2022
MSMLTIGGKSFQSRLLLGTGKYPSFDIQKEAVAVSESDILTFAVRRMNIFEASQPNFLEQLDLSKYTLLPNTAGASTAEE
AVRIARLAKASGLCDMIKVEVIGCSRSLLPDPVETLKASEQLLEEGFIVLPYTSDDVVLARKLEELGVHAIMPGASPIGS
GQGILNPLNLSFIIEQAKVPVIVDAGIGSPKDAAYAMELGADGVLLNTAVSGADDPVKMARAMKLAVEAGRLSYEAGRIP
LKQYGTASSPGEGLPV
>P30139 2.8.1.10~~~thiG~~~Thiazole synthase~~~COG2022
MLRIADKTFDSHLFTGTGKFASSQLMVEAIRASGSQLVTLAMKRVDLRQHNDAILEPLIAAGVTLLPNTSGAKTAEEAIF
AAHLAREALGTNWLKLEIHPDARWLLPDPIETLKAAETLVQQGFVVLPYCGADPVLCKRLEEVGCAAVMPLGAPIGSNQG
LETRAMLEIIIQQATVPVVVDAGIGVPSHAAQALEMGADAVLVNTAIAVADDPVNMAKAFRLAVEAGLLARQSGPGSRSY
FAHATSPLTGFLEASA
>A0QQL0 2.8.1.10~~~thiG~~~Thiazole synthase~~~COG2022
MADSVLRIGGREFGSRLIMGTGGAPNLSVLEEALIASGTELTTVAMRRVDAETGTGVLDLLNRLGIAALPNTAGCRGAAE
AVLTAQLAREALGTDMVKLEVIADERTLLPDAVELVKAAEQLVDDGFTVLPYTNDDPVLARRLEDIGCAAVMPLGSPIGT
GLGISNPHNIEMIVAAAGVPVVLDAGIGTASDAALAMELGCDAVLLATAVTRASDPPTMAAAMASAVTAGHLARQAGRIP
KRFWAQASSPAL
>P9WG73 2.8.1.10~~~thiG~~~Thiazole synthase~~~COG2022
MAESKLVIGDRSFASRLIMGTGGATNLAVLEQALIASGTELTTVAIRRVDADGGTGLLDLLNRLGITPLPNTAGSRSAAE
AVLTAQLAREALNTNWVKLEVIADERTLWPDAVELVRAAEQLVDDGFVVLPYTTDDPVLARRLEDTGCAAVMPLGSPIGT
GLGIANPHNIEMIVAGARVPVVLDAGIGTASDAALAMELGCDAVLLASAVTRAADPPAMAAAMAAAVTAGYLARCAGRIP
KRFWAQASSPAR
>Q9I6B4 2.8.1.10~~~thiG~~~Thiazole synthase~~~
MSQASSTDTPFVIAGRTYGSRLLVGTGKYKDLDETRRAIEASGAEIVTVAVRRTNIGQNPDEPNLLDVIPPDRYTILPNT
AGCYDAVEAVRTCRLARELLDGHNLVKLEVLADQKTLFPNVVETLKAAEQLVKDGFDVMVYTSDDPIIARQLAEIGCIAV
MPLAGLIGSGLGICNPYNLRIILEEAKVPVLVDAGVGTASDAAIAMELGCEAVLMNTAIAHAKDPVMMAEAMKHAIVAGR
LAYLAGRMPRKLYASASSPLDGLID
>Q5SKG7 2.8.1.10~~~thiG~~~Thiazole synthase~~~COG2022
MDTWKVGPVELKSRLILGSGKYEDFGVMREAIAAAKAEVVTVSVRRVELKAPGHVGLLEALEGVRLLPNTAGARTAEEAV
RLARLGRLLTGERWVKLEVIPDPTYLLPDPLETLKAAERLIEEDFLVLPYMGPDLVLAKRLAALGTATVMPLAAPIGSGW
GVRTRALLELFAREKASLPPVVVDAGLGLPSHAAEVMELGLDAVLVNTAIAEAQDPPAMAEAFRLAVEAGRKAYLAGPMR
PREAASPSSPVEGVPFTPTGPRPGRGPQ
>P30140 4.1.99.19~~~thiH~~~2-iminoacetate synthase~~~COG0502
MKTFSDRWRQLDWDDIRLRINGKTAADVERALNASQLTRDDMMALLSPAASGYLEQLAQRAQRLTRQRFGNTVSFYVPLY
LSNLCANDCTYCGFSMSNRIKRKTLDEADIARESAAIREMGFEHLLLVTGEHQAKVGMDYFRRHLPALREQFSSLQMEVQ
PLAETEYAELKQLGLDGVMVYQETYHEATYARHHLKGKKQDFFWRLETPDRLGRAGIDKIGLGALIGLSDNWRVDSYMVA
EHLLWLQQHYWQSRYSVSFPRLRPCTGGIEPASIMDERQLVQTICAFRLLAPEIELSLSTRESPWFRDRVIPLAINNVSA
FSKTQPGGYADNHPELEQFSPHDDRRPEAVAAALTAQGLQPVWKDWDSYLGRASQRL
>Q9S498 4.1.99.19~~~thiH~~~2-iminoacetate synthase~~~
MKTFTDRWRQLEWDDIRLRINGKTAADVERALNAAHLSRDDLMALLSPAAADYLEPIAQRAQRLTRQRFGNTVSFYVPLY
LSNLCANDCTYCGFSMSNRIKRKTLDEVDIQRECDAIRKLGFEHLLLVTGEHQAKVGMDYFRRHLPTIRRQFSSLQMEVQ
PLSQENYAELKTLGIDGVMVYQETYHEAIYAQHHLKGKKQDFFWRLETPDRLGRAGIDKIGLGALIGLSDNWRVDCYMVA
EHLLWMQKHYWQSRYSVSFPRLRPCTGGVEPASVMDEKQLVQTICAFRLLAPEIELSLSTRESPWFRDHVIPLAINNVSA
FSKTQPGGYADDHPELEQFSPHDARRPETVASALSAQGLQPVWKDWDSWLGRASQTR
>Q81KU0 2.8.1.4~~~thiI~~~Probable tRNA sulfurtransferase~~~COG0301
MTYEYILVRYGEMTTKGKNRSKFVSTLKDNVKFKLKKFPNIKIDATHDRMYIQLNGEDHEAVSERLKDVFGIHKFNLAMK
VPSELEDIKKGALAAFLQVKGDVKTFKITVHRSYKHFPMRTMELLPEIGGHILENTEDITVDVHNPDVNVRVEIRSGYSY
IMCDERMGAGGLPVGVGGKVMVLLSGGIDSPVAAYLTMKRGVSVEAVHFHSPPFTSERAKQKVIDLAQELTKYCKRVTLH
LVPFTEVQKTINKEIPSSYSMTVMRRMMMRITERIAEERNALAITTGESLGQVASQTLDSMHTINEVTNYPVIRPLITMD
KLEIIKIAEEIGTYDISIRPYEDCCTVFTPASPATKPKREKANRFEAKYDFTPLIDEAVANKETMVLQTVEVVAEEEKFE
ELF
>Q8XE74 2.8.1.4~~~thiI~~~tRNA sulfurtransferase~~~COG0301
MKFIIKLFPEITIKSQSVRLRFIKILTGNIRNVLKHYDETLAVVRHWDNIEVRAKDENQRLTIRDALTRIPGIHHILEVE
DVPFTDMHDIFEKALVQYRDQLDGKTFCVRVKRRGKHDFSSIDVERYVGGGLNQHIESARVKLTNPDVTVHLEVEDDRLL
LIKGRYEGIGGFPIGTQEDVLSLISGGFDSGVSSYMLMRRGCRVHYCFFNLGGAAHEIGVRQVAHYLWNRFGSSHRVRFV
AINFEPVVGEILEKIDDGQMGVILKRMMVRAASKVAERYGVQALVTGEALGQVSSQTLTNLRLIDNVSDTLILRPLISYD
KEHIINLARQIGTEDFARTMPEYCGVISKSPTVKAVKSKIEAEEEKFDFSILDKVVEEANNVDIREIAQQTGQEVVEVET
VNDFGPNDVILDIRSVDEQEDKPLKVEGIDVVSLPFYKLSTKFGDLDQNRTWLLWCERGVMSRLQALYLREQGFKNVKVY
RP
>P77718 2.8.1.4~~~thiI~~~tRNA sulfurtransferase~~~COG0301
MKFIIKLFPEITIKSQSVRLRFIKILTGNIRNVLKHYDETLAVVRHWDNIEVRAKDENQRLAIRDALTRIPGIHHILEVE
DVPFTDMHDIFEKALVQYRDQLEGKTFCVRVKRRGKHDFSSIDVERYVGGGLNQHIESARVKLTNPDVTVHLEVEDDRLL
LIKGRYEGIGGFPIGTQEDVLSLISGGFDSGVSSYMLMRRGCRVHYCFFNLGGAAHEIGVRQVAHYLWNRFGSSHRVRFV
AINFEPVVGEILEKIDDGQMGVILKRMMVRAASKVAERYGVQALVTGEALGQVSSQTLTNLRLIDNVSDTLILRPLISYD
KEHIINLARQIGTEDFARTMPEYCGVISKSPTVKAVKSKIEAEEEKFDFSILDKVVEEANNVDIREIAQQTEQEVVEVET
VNGFGPNDVILDIRSIDEQEDKPLKVEGIDVVSLPFYKLSTKFGDLDQNKTWLLWCERGVMSRLQALYLREQGFNNVKVY
RP
>Q9X220 2.8.1.4~~~thiI~~~Probable tRNA sulfurtransferase~~~COG0301
MKELRVYIVRYSEIGLKGKNRKDFEEALRRNIERVTGMKVKRQWGRFLIPIDENVTLDDKLKKIFGIQNFSKGFLVSHDF
EEVKKYSLIAVKEKLEKGNYRTFKVQAKKAYKEYKKGVYEINSELGALILKNFKELSVDVRNPDFVLGVEVRPEGVLIFT
DRVECYGGLPVGTGGKAVLLLSGGIDSPVAGWYALKRGVLIESVTFVSPPFTSEGAVEKVRDILRVLREFSGGHPLRLHI
VNLTKLQLEVKKRVPDKYSLIMYRRSMFRIAEKIAEETGAVAFYTGENIGQVASQTLENLWSIESVTTRPVIRPLSGFDK
TEIVEKAKEIGTYEISIKPYQDSCVFFAPKNPATRSHPSILEKLEQQVPDLPVLEEEAFTSRKVEVIE
>P75948 2.7.1.89~~~thiK~~~Thiamine kinase~~~COG0510
MPFRSNNPITRDELLSRFFPQYHPVTTFNSGLSGGSFLIEHQGQRFVVRQPHDPDAPQSAFLRQYRALSQLPACIAPKPH
LYLRDWMVVDYLPGAVKTYLPDTNELAGLLYYLHQQPRFGWRITLLPLLELYWQQSDPARRTVGWLRMLKRLRKAREPRP
LRLSPLHMDVHAGNLVHSASGLKLIDWEYAGDGDIALELAAVWVENTEQHRQLVNDYATRAKIYPAQLWRQVRRWFPWLL
MLKAGWFEYRWRQTGDQQFIRLADDTWRQLLIKQ
>O67883 2.7.4.16~~~thiL~~~Thiamine-monophosphate kinase~~~COG0611
MRLKELGEFGLIDLIKKTLESKVIGDDTAPVEYCSKKLLLTTDVLNEGVHFLRSYIPEAVGWKAISVNVSDVIANGGLPK
WALISLNLPEDLEVSYVERFYIGVKRACEFYKCEVVGGNISKSEKIGISVFLVGETERFVGRDGARLGDSVFVSGTLGDS
RAGLELLLMEKEEYEPFELALIQRHLRPTARIDYVKHIQKYANASMDISDGLVADANHLAQRSGVKIEILSEKLPLSNEL
KMYCEKYGKNPIEYALFGGEDYQLLFTHPKERWNPFLDMTEIGRVEEGEGVFVDGKKVEPKGWKHF
>P14611 2.3.1.9~~~phaA~~~Acetyl-CoA acetyltransferase~~~COG0183
MTDVVIVSAARTAVGKFGGSLAKIPAPELGAVVIKAALERAGVKPEQVSEVIMGQVLTAGSGQNPARQAAIKAGLPAMVP
AMTINKVCGSGLKAVMLAANAIMAGDAEIVVAGGQENMSAAPHVLPGSRDGFRMGDAKLVDTMIVDGLWDVYNQYHMGIT
AENVAKEYGITREAQDEFAVGSQNKAEAAQKAGKFDEEIVPVLIPQRKGDPVAFKTDEFVRQGATLDSMSGLKPAFDKAG
TVTAANASGLNDGAAAVVVMSAAKAKELGLTPLATIKSYANAGVDPKVMGMGPVPASKRALSRAEWTPQDLDLMEINEAF
AAQALAVHQQMGWDTSKVNVNGGAIAIGHPIGASGCRILVTLLHEMKRRDAKKGLASLCIGGGMGVALAVERK
>P0AGG0 2.7.4.16~~~thiL~~~Thiamine-monophosphate kinase~~~COG0611
MACGEFSLIARYFDRVRSSRLDVELGIGDDCALLNIPEKQTLAISTDTLVAGNHFLPDIDPADLAYKALAVNLSDLAAMG
ADPAWLTLALTLPDVDEAWLESFSDSLFDLLNYYDMQLIGGDTTRGPLSMTLGIHGFVPMGRALTRSGAKPGDWIYVTGT
PGDSAAGLAILQNRLQVADAKDADYLIKRHLRPSPRILQGQALRDLANSAIDLSDGLISDLGHIVKASDCGARIDLALLP
FSDALSRHVEPEQALRWALSGGEDYELCFTVPELNRGALDVALGHLGVPFTCIGQMTADIEGLCFIRDGEPVTLDWKGYD
HFATP
>P9WG71 2.7.4.16~~~thiL~~~Thiamine-monophosphate kinase~~~COG0611
MTTKDHSLATESPTLQQLGEFAVIDRLVRGRRQPATVLLGPGDDAALVSAGDGRTVVSTDMLVQDSHFRLDWSTPQDVGR
KAIAQNAADIEAMGARATAFVVGFGAPAETPAAQASALVDGMWEEAGRIGAGIVGGDLVSCRQWVVSVTAIGDLDGRAPV
LRSGAKAGSVLAVVGELGRSAAGYALWCNGIEDFAELRRRHLVPQPPYGHGAAAAAVGAQAMIDVSDGLLADLRHIAEAS
GVRIDLSAAALAADRDALTAAATALGTDPWPWVLSGGEDHALVACFVGPVPAGWRTIGRVLDGPARVLVDGEEWTGYAGW
QSFGEPDNQGSLG
>P55881 2.7.4.16~~~thiL~~~Thiamine-monophosphate kinase~~~
MACGEFSLIARYFDRVRSSRLDVETGIGDDCALLNIPEKQTLAISTDTLVAGNHFLPDIDPADLAYKALAVNLSDLAAMG
ADPAWLTLALTLPEVDEPWLEAFSDSLFALLNYYDMQLIGGDTTRGPLSMTLGIHGYIPAGRALKRSGAKPGDWIYVTGT
PGDSAAGLAVLQNRLQVSEETDAHYLIQRHLRPTPRILHGQALRDIASAAIDLSDGLISDLGHIVKASGCGARVDVDALP
KSDAMMRHVDDGQALRWALSGGEDYELCFTVPELNRGALDVAIGQLGVPFTCIGQMSADIEGLNFVRDGMPVTFDWKGYD
HFATP
>P07097 2.3.1.9~~~phaA~~~Acetyl-CoA acetyltransferase~~~
MSTPSIVIASARTAVGSFNGAFANTPAHELGATVISAVLERAGVAAGEVNEVILGQVLPAGEGQNPARQAAMKAGVPQEA
TAWGMNQLCGSGLRAVALGMQQIATGDASIIVAGGMESMSMAPHCAHLAGGVKMGDFKMIDTMIKDGLTDAFYGYHMGTT
AENVAKQWQLSRDEQDAFAVASQNKAEAAQKDGRFKDEIVPFIVKGRKGDITVDADEYIRHGATLDSMAKLRPAFDKEGT
VTAGNASGLNDGAAAALLMSEAEASRRGIQPLGRIVSWATVGVDPKVMGTGPIPASRKALERAGWKIGDLDLVEANEAFA
AQACAVNKDLGWDPSIVNVNGGAIAIGHPIGASGARILNTLLFEMKRRGARKGLATLCIGGGMGVAMCIESL
>P39593 2.7.1.50~~~thiM~~~Hydroxyethylthiazole kinase~~~COG2145
MDAQSAAKCLTAVRRHSPLVHSITNNVVTNFTANGLLALGASPVMAYAKEEVADMAKIAGALVLNIGTLSKESVEAMIIA
GKSANEHGVPVILDPVGAGATPFRTESARDIIREVRLAAIRGNAAEIAHTVGVTDWLIKGVDAGEGGGDIIRLAQQAAQK
LNTVIAITGEVDVIADTSHVYTLHNGHKLLTKVTGAGCLLTSVVGAFCAVEENPLFAAIAAISSYGVAAQLAAQQTADKG
PGSFQIELLNKLSTVTEQDVQEWATIERVTVS
>P76423 2.7.1.50~~~thiM~~~Hydroxyethylthiazole kinase~~~COG2145
MQVDLLGSAQSAHALHLFHQHSPLVHCMTNDVVQTFTANTLLALGASPAMVIETEEASQFAAIASALLINVGTLTQPRAQ
AMRAAVEQAKSSQTPWTLDPVAVGALDYRRHFCHELLSFKPAAIRGNASEIMALAGIANGGRGVDTTDAAANAIPAAQTL
ARETGAIVVVTGEMDYVTDGHRIIGIHGGDPLMTKVVGTGCALSAVVAACCALPGDTLENVASACHWMKQAGERAVARSE
GPGSFVPHFLDALWQLTQEVQA
>Q830K4 2.7.1.50~~~thiM~~~Hydroxyethylthiazole kinase~~~COG2145
MKTSVKFETIFPLTTAPLIQCITNEITCESMANALLYIDAKPIMADDPREFPQMFQQTSALVLNLGHLSQEREQSLLAAS
DYARQVNKLTVVDLVGYGASDIRNEVGEKLVHNQPTVVKGNLSEMRTFCQLVSHGRGVDGSPLDQSEEAIEELIQALRQQ
TQKFPQTVFLATGIQDVLVSQEQVIVLQNGVPELDCFTGTGDLVGALVAALLGEGNAPMTAAVAAVSYFNLCGEKAKTKS
QGLADFRQNTLNQLSLLMKEKDWFEAVKGRVL
>A6TBJ8 2.7.1.50~~~thiM~~~Hydroxyethylthiazole kinase~~~
MPELLNPAPVAHLRHLLRAHSPLVHCMTNDVVQTFTANVLLAVGASPAMVIDPREAAQFAAIADALLINVGTLTEDRAVA
MRAAVEHARQAGKPWTLDPVAVGALTVRTAFCHELLALQPAAIRGNASEILALAGMSAGGRGVDTTDTAAAALPAAQALA
RRLATVVAVTGEVDYVTDGERVLSVAGGNPLMTRVVGTGCALSAVVAASAALPGDRLENVAAACGLMKQAGAIAARQGGP
GSFIPAFLDALYQEVQG
>P66923 2.7.1.50~~~thiM~~~Hydroxyethylthiazole kinase~~~
MNYLNKIRIENPLTICYTNDVVKNFTANGLLSIGASPAMSEAPEEAEEFYKVAQALLINIGTLTAQNEQDIIAIAQTANE
AGLPIVFDPVAVGASTYRKQFCKLLLKSAKVSVIKGNASEILALIDDTATMKGTDSDANLDAVAIAKKAYAIYKTAIVIT
GKEDVIVQDNKAIVLANGSPLLARVTGAGCLLGGVIAGFLFRETEPDIEALIEAVSVFNIAAEVAAENENCGGPGTFSPL
LLDTLYHLNETTYQQRIRIQEVE
>Q6GEY3 2.7.1.50~~~thiM~~~Hydroxyethylthiazole kinase~~~
MNYLNNIRIENPLTICYTNDVVKNFTANGLLSIGASPAMSEAPEEAEEFYKVAQALLINIGTLTAQNEQDIIAIAQTANE
AGLPIVFDPVAVGASTYRKQFCKLLLKSAKVSVIKGNASEILALIDDTATMKGTDSDANLDAVTIAKKAYAIYKTAIVIT
GKEDVIVQGDKAIVLANGSPLLARVTGAGCLLGGIIAGFLFRETEPDIEALIEAVSVFNIAAEVAAENENCGGPGTFSPL
LLDTLYHLNETTYQQRIRIQEVE
>O34664 2.7.6.2~~~thiN~~~Thiamine pyrophosphokinase~~~COG1564
MKTINIVAGGPKNLIPDLTGYTDEHTLWIGVDKGTVTLLDAGIIPVEAFGDFDSITEQERRRIEKAAPALHVYQAEKDQT
DLDLALDWALEKQPDIIQIFGITGGRADHFLGNIQLLYKGVKTNIKIRLIDKQNHIQMFPPGEYDIEKDENKRYISFIPF
SEDIHELTLTGFKYPLNNCHITLGSTLCISNELIHSRGTFSFAKGILIMIRSTD
>P00275 ~~~~~~Thioredoxin C-1~~~
ATVKVDNSNFQSDVLQSSEPVVVDFWAEWCGPCKMIAPALDEIATEMAGQVKIAKVNIDENPELAAQFGVRSIPTLLMFK
DGELAANMVGAAPKSRLADWIKASA
>P0A4L2 ~~~trxA~~~Thioredoxin 1~~~
MSAAAQVTDSTFKQEVLDSDVPVLVDFWAPWCGPCRMVAPVVDEIAQQYEGKIKVVKVNTDENPQVASQYGIRSIPTLMI
FKGGQKVDMVVGAVPKTTLSQTLEKHL
>P52230 ~~~trxA~~~Thioredoxin 1~~~COG3118
MAGTLKHVTDDSFEQDVLKNDKPVLVDFWAAWCGPCRQIAPSLEAIAAEYGDKIEIVKLNIDENPGTAAKYGVMSIPTLN
VYQGGEVAKTIVGAKPKAAIVRDLEDFIAD
>P52232 ~~~~~~Thioredoxin-like protein slr0233~~~COG3118
MAVKKQFANFAEMLAGSPKPVLVDFYATWCGPCQMMAPILEQVGSHLRQQIQVVKIDTDKYPAIATQYQIQSLPTLVLFK
QGQPVHRMEGVQQAAQLIQQLQVFV
>P07887 ~~~~~~Thioredoxin C-2~~~
MSATIVNTTDENFQADVLDAETPVLVDFWAGWCAPCKAIAPVLEELSNEYAGKVKIVKVDVTSCEDTAVKYNIRNIPALL
MFKDGEVVAQQVGAAPRSKLAAFIDQNI
>P0AGG4 1.8.1.8~~~trxC~~~Thioredoxin 2~~~COG3118
MNTVCTHCQAINRIPDDRIEDAAKCGRCGHDLFDGEVINATGETLDKLLKDDLPVVIDFWAPWCGPCRNFAPIFEDVAQE
RSGKVRFVKVNTEAERELSSRFGIRSIPTIMIFKNGQVVDMLNGAVPKAPFDSWLNESL
>P20857 ~~~trxB~~~Thioredoxin 2~~~COG3118
MSKGVITITDAEFESEVLKAEQPVLVYFWASWCGPCQLMSPLINLAANTYSDRLKVVKLEIDPNPTTVKKYKVEGVPALR
LVKGEQILDSTEGVISKDKLLSFLDTHLNNN
>Q9RD25 ~~~trxC~~~Putative thioredoxin 2~~~COG0526
MTSTVELTKENFDQTVTDNEFVLIDFWAEWCGPCKQFGPVYEKAAEANPDLVFGKVDTEAQPELAQAFGISSIPTLMIVR
EQVAVFAQPGALPEAALTDVIGQARKLDMDEVRKAVAEQQAQAGQNGQEGQEGQ
>P73263 ~~~~~~Thioredoxin-like protein slr1139~~~COG3118
MSLLEITDAEFEQETQGQTKPVLVYFWASWCGPCRLMAPAIQAIAKDYGDKLKVLKLEVDPNPAAVAQCKVEGVPALRLF
KNNELVMTHEGAIAKPKLLELLKEELDFI
>P81109 ~~~trxA~~~Thioredoxin~~~COG3118
MFELDKDTFETEVLQGTGYVLVDFWSEGCEPCKALMPDIQEMEKTYGEQVRFTKLDTTKARRLAIKEKVLGLPTIAIYKD
GQKIDELTKEDATAANVEAMVKKYI
>P80579 ~~~trxA~~~Thioredoxin~~~
ATMTLTDANFQQAIQGDKPVLVDFWAAWCGPCRMMAPVLEEFAEAHADKVTVAKLNVDENPETTSQFGIMSIPTLILFKG
GRPVKQLIGYQPKEQLEAQLADVLQ
>P09857 ~~~trxA~~~Thioredoxin~~~
SDSIVHVTDDSFEEEVXKSPDPVLVDYWADWCGPCKMXAPVXDEIADEYAGRVKXAKXNXDENPNTPPRYGXRGIPTLML
FRGGEVEATKVGAVSKSQLTAFLDSNX
>P14949 ~~~trxA~~~Thioredoxin~~~COG3118
MAIVKATDQSFSAETSEGVVLADFWAPWCGPCKMIAPVLEELDQEMGDKLKIVKIDVDENQETAGKYGVMSIPTLLVLKD
GEVVETSVGFKPKEALQELVNKHL
>P08058 ~~~trxA~~~Thioredoxin~~~
MSTVPVTDATFDTEVRKSDVPVVVDFWAEWCGPCRQIGPALEELSKEYAGKVKIVKVNVDENPESPAMLGVRGIPALFLF
KNGQVVSNKVGAAPKAALATWIASAL
>Q7M1B9 ~~~trxA~~~Thioredoxin~~~COG3118
MAKPIEVHDSDFAEKVLKSKTPVVVDFWAPWCGPCRVIAPILDKLAGEYAGRLTIAKVNTDDNVQYASQLGIQGIPTLVI
FKDGREVGRLVGARPEAMYREIFDKVLAMA
>P10472 ~~~trxA~~~Thioredoxin~~~
AGKYFEATDKNFQTEILDSDKAVLVDFWASWCGPCMMLGPVIEQLADDYEGKAIIAKLNVDENPNIAGQYGIRSIPTMLI
IKGGKVVDQMVGALPKNMIAKKIDEHIG
>P0AA27 ~~~trxA~~~Thioredoxin 1~~~COG3118
MSDKIIHLTDDSFDTDVLKADGAILVDFWAEWCGPCKMIAPILDEIADEYQGKLTVAKLNIDQNPGTAPKYGIRGIPTLL
LFKNGEVAATKVGALSKGQLKEFLDANLA
>P0AA25 ~~~trxA~~~Thioredoxin 1~~~COG3118
MSDKIIHLTDDSFDTDVLKADGAILVDFWAEWCGPCKMIAPILDEIADEYQGKLTVAKLNIDQNPGTAPKYGIRGIPTLL
LFKNGEVAATKVGALSKGQLKEFLDANLA
>P66928 ~~~trxA~~~Thioredoxin~~~COG3118
MSHYIELTEENFESTIKKGVALVDFWAPWCGPCKMLSPVIDELASEYEGKAKICKVNTDEQEELSAKFGIRSIPTLLFTK
DGEVVHQLVGVQTKVALKEQLNKLLG
>O30974 ~~~trxA~~~Thioredoxin~~~
MSEDSATVAVTDDSFSTDVLGSSKPVLVDFWATWCGPCKMVAPVLEEIAAEKGDQLTVAKIDVDVDANPATARDFQVVSI
PTMILFKDGAPVKRIVGAKGKAALLRELSDAL
>P9WG67 ~~~trxA~~~Thioredoxin~~~COG3118
MTDSEKSATIKVTDASFATDVLSSNKPVLVDFWATWCGPCKMVAPVLEEIATERATDLTVAKLDVDTNPETARNFQVVSI
PTLILFKDGQPVKRIVGAKGKAALLRELSDVVPNLN
>P21610 ~~~trxA~~~Thioredoxin~~~
MSALLVEIDKDQFQAEVLEAEGYVLVDYFSDGCVPCKALMPDVEELAAKYEGKVAFRKFNTSSARRLAISQKILGLPTIT
LYKGGQKVEEVTKDDATRENIDAMIAKHVG
>P21609 ~~~trxA~~~Thioredoxin~~~
MLMLDKDTFKTEVLEGTGYVLVDYFSDGCVPCKALMPAVEELSKKYEGRVVFAKLNTTGARRLAISQKILGLPTLSLYKD
GVKVDEVTKDDATIENIEAMVEEHISK
>P10473 ~~~trxA~~~Thioredoxin~~~
MKQVSDASFEEDVLKADGPNXVDFWAEWCGPCRQXAPALEELATALGDKVTVAKINIDENPQTPSKYGVRGIPTLMIFKD
GQVAATKIGALPKTKLFEWVEASV
>Q9ZEE0 ~~~trxA~~~Thioredoxin~~~COG3118
MVNNVTDSSFKNEVLESDLPVMVDFWAEWCGPCKMLIPIIDEISKELQDKVKVLKMNIDENPKTPSEYGIRSIPTIMLFK
NGEQKDTKIGLQQKNSLLDWINKSI
>P0AA29 ~~~trxA~~~Thioredoxin 1~~~COG3118
MSDKIIHLTDDSFDTDVLKADGAILVDFWAEWCGPCKMIAPILDEIADEYQGKLTVAKLNIDQNPGTAPKYGIRGIPTLL
LFKNGEVAATKVGALSKGQLKEFLDANLA
>P0AA28 ~~~trxA~~~Thioredoxin 1~~~
MSDKIIHLTDDSFDTDVLKADGAILVDFWAEWCGPCKMIAPILDEIADEYQGKLTVAKLNIDQNPGTAPKYGIRGIPTLL
LFKNGEVAATKVGALSKGQLKEFLDANLA
>Q2FZD2 ~~~trxA~~~Thioredoxin~~~COG3118
MAIVKVTDADFDSKVESGVQLVDFWATWCGPCKMIAPVLEELAADYEGKADILKLDVDENPSTAAKYEVMSIPTLIVFKD
GQPVDKVVGFQPKENLAEVLDKHL
>P99122 ~~~trxA~~~Thioredoxin~~~
MAIVKVTDADFDSKVESGVQLVDFWATWCGPCKMIAPVLEELAADYEGKADILKLDVDENPSTAAKYEVMSIPTLIVFKD
GQPVDKVVGFQPKENLAEVLDKHL
>P0A0K6 ~~~trxA~~~Thioredoxin~~~
MAIVKVTDADFDSKVESGVQLVDFWATWCGPCKMIAPVLEELAADYEGKADILKLDVDENPSTAAKYEVMSIPTLIVFKD
GQPVDKVVGFQPKENLAEVLDKHL
>Q05739 ~~~trxA~~~Thioredoxin~~~COG3118
MAGVLKNVTDDTFEADVLKSEKPVLVDFWAEWCGPCRQIAPSLEAITEHGGQIEIVKLNIDQNPATAAKYGVMSIPTLNV
YQGGEVVKTIVGAKPKAALLRPGPVPR
>P52231 ~~~trxA~~~Thioredoxin~~~COG3118
MSATPQVSDASFKEDVLDSELPVLVDFWAPWCGPCRMVAPVVDEISQQYEGKVKVVKLNTDENPNTASQYGIRSIPTLMI
FKGGQRVDMVVGAVPKTTLASTLEKYL
>P31549 ~~~thiP~~~Thiamine transport system permease protein ThiP~~~COG1178
MATRRQPLIPGWLIPGVSATTLVVAVALAAFLALWWNAPQDDWVAVWQDSYLWHVVRFSFWQAFLSALLSVIPAIFLARA
LYRRRFPGRLALLRLCAMTLILPVLVAVFGILSVYGRQGWLATLCQSLGLEWTFSPYGLQGILLAHVFFNLPMASRLLLQ
ALENIPGEQRQLAAQLGMRSWHFFRFVEWPWLRRQIPPVAALIFMLCFASFATVLSLGGGPQATTIELAIYQALSYDYDP
ARAAMLALLQMVCCLGLVLLSQRLSKAIAPGTTLLQGWRDPDDRLHSRICDTVLIVLALLLLLPPLLAVIVDGVNRQLPE
VLAQPVLWQALWTSLRIALAAGVLCVVLTMMLLWSSRELRARQKMLAGQVLEMSGMLILAMPGIVLATGFFLLLNNTIGL
PQSADGIVIFTNALMAIPYALKVLENPMRDITARYSMLCQSLGIEGWSRLKVVELRALKRPLAQALAFACVLSIGDFGVV
ALFGNDDFRTLPFYLYQQIGSYRSQDGAVTALILLLLCFLLFTVIEKLPGRNVKTD
>Q8ZRV1 ~~~thiP~~~Thiamine transport system permease protein ThiP~~~
MATRRQPLIPGWLIPGLCAAALMITVSLAAFLALWLNAPSGAWSTIWRDSYLWHVVRFSFWQAFLSAVLSVVPAVFLARA
LYRRRFPGRLALLRLCAMTLILPVLVAVFGILSVYGRQGWLASLWQMLGLQWTFSPYGLQGILLAHVFFNLPMASRLLLQ
SLESIPGEQRQLAAQLGMRGWHFFRFVEWPWLRRQIPPVAALIFMLCFASFATVLSLGGGPQATTIELAIFQALSYDYDP
ARAAMLALIQMVCCLALVLLSQRLSKAIAPGMTLTQGWRDPDDRLHSRLTDALLIVLALLLLLPPLVAVVVDGVNRSLPE
VLAQPILWQAVWTSLRIALAAGVLCVVLTMMLLWSSRELRQRQQLFAGQTLELSGMLILAMPGIVLATGFFLLLNNSVGL
PESADGIVIFTNALMAIPYALKVLENPMRDITARYGMLCQSLGIEGWSRLKIVELRALKRPLAQALAFACVLSIGDFGVV
ALFGNDNFRTLPFYLYQQIGSYRSQDGAVTALILLLLCFTLFTLIEKLPGRHAKTD
>P31548 7.6.2.15~~~thiQ~~~Thiamine import ATP-binding protein ThiQ~~~COG3840
MLKLTDITWLYHHLPMRFSLTVERGEQVAILGPSGAGKSTLLNLIAGFLTPASGSLTIDGVDHTTMPPSRRPVSMLFQEN
NLFSHLTVAQNIGLGLNPGLKLNAVQQGKMHAIARQMGIDNLMARLPGELSGGQRQRVALARCLVREQPILLLDEPFSAL
DPALRQEMLTLVSTSCQQQKMTLLMVSHSVEDAARIATRSVVVADGRIAWQGMTNELLSGKASASALLGITG
>P44986 7.6.2.15~~~thiQ~~~Thiamine import ATP-binding protein ThiQ~~~COG3840
MIYLNNVILNDKTLPMCFNLSVNAGERVAIIGESGAGKSTLLNLIAGFEFPAQGEIWLNDKNHTRSAPYERPVSMLFQEN
NLFPHLTVQQNLALGIKPSLKLTALEQEKIEQVACSVGLGDYLERLPNSLSGGQKQRVALARCLLRDKPILLLDEPFSAL
DQKLRVEMLALIAKLCDEKDLTLLLVTHQPSELIGSIDQVLVVENGQISQLQKGV
>Q8ZRV2 7.6.2.15~~~thiQ~~~Thiamine import ATP-binding protein ThiQ~~~
MLKLIDITWLYHHLPMRFTLAVERGEQVAILGPSGAGKSTLLNLIAGFLAPASGTLLIAGDDHTLTPPSRRPVSMLFQEN
NLFSHLNVQQNIGLGLNPGLTLNASQREKRDAIAHQMGIESLMTRLPGELSGGQRQRVALARCLVREQPVLLLDEPFSAL
DPALRQEMLTLVSDICRERQLTLLMVSHSVEDAARIAPRSIVVADGRIAWQGKTDELLSGQASASALLGIKSHIL
>O31617 ~~~thiS~~~Sulfur carrier protein ThiS~~~COG2104
MLQLNGKDVKWKKDTGTIQDLLASYQLENKIVIVERNKEIIGKERYHEVELCDRDVIEIVHFVGGG
>O32583 ~~~thiS~~~Sulfur carrier protein ThiS~~~COG2104
MQILFNDQAMQCAAGQTVHELLEQLDQRQAGAALAINQQIVPREQWAQHIVQDGDQILLFQVIAGG
>Q72KL7 ~~~thiS~~~Sulfur carrier protein ThiS~~~COG2104
MVWLNGEPRPLEGKTLKEVLEEMGVELKGVAVLLNEEAFLGLEVPDRPLRDGDVVEVVALMQGG
>O32074 ~~~thiT~~~Thiamine transporter ThiT~~~COG3859
MNQSKQLVRLIEIAIMTAAAVILDIVSGMFLSMPQGGSVSIMMIPIFLISFRWGVKAGLTTGLLTGLVQIAIGNLFAQHP
VQLLLDYIVAFAAIGISGCFASSVRKAAVSKTKGKLIVSVVSAVFIGSLLRYAAHVISGAVFFGSFAPKGTPVWIYSLTY
NATYMVPSFIICAIVLCLLFMTAPRLLKSDKA
>A2RI47 ~~~thiT~~~Thiamine transporter ThiT~~~COG3859
MSNSKFNVRLLTEIAFMAALAFIISLIPNTVYGWIIVEIACIPILLLSLRRGLTAGLVGGLIWGILSMITGHAYILSLSQ
AFLEYLVAPVSLGIAGLFRQKTAPLKLAPVLLGTFVAVLLKYFFHFIAGIIFWSQYAWKGWGAVAYSLAVNGISGILTAI
AAFVILIIFVKKFPKLFIHSNY
>Q037U3 ~~~thiT~~~Thiamine transporter ThiT~~~
MQQHKQLVVILETAIIAAFAMALTYIPHTTGVSAIELNYGLIPIAVLAMRRGLVPAAWAGFVWGILDLILRGIGGGSVLN
PLQGILEYPIAFTLVGLMGLTFASFQKAVRGSEKVKASGYAFAGIIIGTFAKYFIHFIAGVVFWGAYAPKGTNVWVYSLI
VNGGSALFSTVLTIVVVGVLLTVAPQLFVAKDGKSFSTKAA
>P42461 ~~~thiX~~~Thiamine biosynthesis protein X~~~
MSISRTVFGIAATAALSAALVACSPPHQQDSPVQRTNEILTTSQNPTSASSTSTSSATTTSSAPVEEDVEIVVSPAALVD
GEQVTFEISGLDPEGGYYAAICDSVANPGNPVPSCTGEMADFTSQAWLSNSQPGATVEIAEDGTATVELEATATGTGLDC
TTQACVAKVFGDHTEGFRDVAEVPVTFAAA
>Q9K9G5 ~~~thiY~~~Formylaminopyrimidine-binding protein~~~COG0715
MKSFKIISLLLAILFLASCQTNTASDDEALETVEVMLDWYPNAVHTFLYVAIENGYFAEEGLDVDIVFPTNPTDPIQLTA
SGAIPLALSYQPDVILARSKDLPVVSVASVVRSPLNHVMFLAEQDFDSPADLVGLTVGYPGIPVNEPILKTMVEAAGGDY
EQVHLMDVGFELGASIVSGRADAVVGTYINHEYPVLKHEGHDISYFNPVDYGVPEYDELVLISNEAYVEESGEVLAAFWR
AALKGYEWMVENPDEALNVLLTNQDEANFPLIQEVEEESLSILLEKMENPNGPFGGQDAESWEEVISWLDAHDWLEQPVV
AEDAFSSITD
>P45359 2.3.1.9~~~thlA~~~Acetyl-CoA acetyltransferase~~~COG0183
MKEVVIASAVRTAIGSYGKSLKDVPAVDLGATAIKEAVKKAGIKPEDVNEVILGNVLQAGLGQNPARQASFKAGLPVEIP
AMTINKVCGSGLRTVSLAAQIIKAGDADVIIAGGMENMSRAPYLANNARWGYRMGNAKFVDEMITDGLWDAFNDYHMGIT
AENIAERWNISREEQDEFALASQKKAEEAIKSGQFKDEIVPVVIKGRKGETVVDTDEHPRFGSTIEGLAKLKPAFKKDGT
VTAGNASGLNDCAAVLVIMSAEKAKELGVKPLAKIVSYGSAGVDPAIMGYGPFYATKAAIEKAGWTVDELDLIESNEAFA
AQSLAVAKDLKFDMNKVNVNGGAIALGHPIGASGARILVTLVHAMQKRDAKKGLATLCIGGGQGTAILLEKC
>Q18AR0 2.3.1.9~~~thlA~~~Acetyl-CoA acetyltransferase~~~COG0183
MREVVIASAARTAVGSFGGAFKSVSAVELGVTAAKEAIKRANITPDMIDESLLGGVLTAGLGQNIARQIALGAGIPVEKP
AMTINIVCGSGLRSVSMASQLIALGDADIMLVGGAENMSMSPYLVPSARYGARMGDAAFVDSMIKDGLSDIFNNYHMGIT
AENIAEQWNITREEQDELALASQNKAEKAQAEGKFDEEIVPVVIKGRKGDTVVDKDEYIKPGTTMEKLAKLRPAFKKDGT
VTAGNASGINDGAAMLVVMAKEKAEELGIEPLATIVSYGTAGVDPKIMGYGPVPATKKALEAANMTIEDIDLVEANEAFA
AQSVAVIRDLNIDMNKVNVNGGAIAIGHPIGCSGARILTTLLYEMKRRDAKTGLATLCIGGGMGTTLIVKR
>Q7A7L2 2.3.1.9~~~~~~Probable acetyl-CoA acyltransferase~~~
MTRVVLAAAYRTPIGVFGGAFKDVPAYDLGATLIEHIIKETGLNPSEIDEVIIGNVLQAGQGQNPARIAAMKGGLPETVP
AFTVNKVCGSGLKSIQLAYQSIVTGENDIVLAGGMENMSQSPMLVNNSRFGFKMGHQSMVDSMVYDGLTDVFNQYHMGIT
AENLVEQYGISREEQDTFAVNSQHKAVRAQQNGEFDSEIVPVSIPQRKGEPILVTKDEGVRENVSVEKLSRLRPAFKKDG
TVTAGNASGINDGAAMMLVMSEDKAKELNIEPLAVLDGFGSHGVDPSIMGIAPVGAVEKALKRSKKELSDIDVFELNEAF
AAQLLAVDRELKLPPEKVNVKGGAIALGHPIGASGARVLVTLLHQLNDEVETGLTSLCIGGGQAIAAVVSKYK
>Q0AVM3 2.3.1.9~~~~~~Acetyl-CoA acetyltransferase~~~COG0183
MTREVVLVGACRTPVGTFGGTLKDVGSAQLGAIVMGEAIKRAGIKAEQIDEVIFGCVLQAGLGQNVARQCMINAGIPKEV
TAFTINKVCGSGLRAVSLAAQVIKAGDADIIMAGGTENMDKAPFILPNARWGYRMSMPKGDLIDEMVWGGLTDVFNGYHM
GITAENINDMYGITREEQDAFGFRSQTLAAQAIESGRFKDEIVPVVIKGKKGDIVFDTDEHPRKSTPEAMAKLAPAFKKG
GSVTAGNASGINDAAAAVIVMSKEKADELGIKPMAKVVSYASGGVDPSVMGLGPIPASRKALEKAGLTIDDIDLIEANEA
FAAQSIAVARDLGWADKMEKVNVNGGAIAIGHPIGSSGARILVTLLYEMQKRGSKKGLATLCIGGGMGTALIVEAL
>P45855 2.3.1.9~~~mmgA~~~Acetyl-CoA acetyltransferase~~~COG0183
MRKTVIVSAARTPFGKFGGVLKEVKAAELGGIVMKEALQQAGVSGDDVEGNVMGMVVQAGSGQIPSRQAARLAGMPWSVP
SETLNKVCASGLRAVTLCDQMIRAQDADILVAGGMESMSNIPYAVPAGRWGARMGDGELRDLMVYDGLTCAFDEVHMAVH
GNTAAKEYAISRREQDEWALRSHARAAKAADEGKFQDEIVPVNWIGRKGKPNVVDKDEAIRRDTSLDQLAKLAPIYASDG
SITAGNAPGVNDGAGAFVLMSEEKAAELGKRPLATILGFSTTGMPAHELAAAPGFAINKLLKKNGLTVQDIDLFEVNEAF
ASVVLTCEKIVGFDLEKVNVNGGAIALGHPIGASGARILMTLVYELKRRGGGLGVAAICSGAAQGDAVLVQVH
>A0QYB5 ~~~thpA~~~D-threitol-binding protein~~~COG1879
MRLGTTAFAIASATALGLGLTACGAGDPAANSDTTRIGVTVYDMSSFITAGKEGMDAYAKDNNIELIWNSANLDVSTQAS
QVDSMINQGVDAIIVVPVQADSLAPQVASAKAKGIPLVPVNAALDSKDIAGNVQPDDVAAGAQEMQMMADRLGGKGNIVI
LQGPLGQSGELDRSKGIEQVLAKYPDIKVLAKDTANWKRDEAVNKMKNWISGFGPQIDGVVAQNDDMGLGALQALKESGR
TGVPIVGIDGIEDGLNAVKSGDFIGTSLQNGTVELAAGLAVANRLAKGEPVNKEPVYIMPAITKDNVDVAIEHVVTERQQ
FLDGLTELINKNLETGDIAYEGIPGQKQP
>O34570 3.1.4.58~~~ytlP~~~RNA 2',3'-cyclic phosphodiesterase~~~COG1514
MPDIRPHYFIGVPIPEGIANPIYQAAKNEPILTFQKWVHPLDYHITLIFLGAADETQIKKLEGSLAEIASEIDPFSIKFG
KIDVFGDRRKPRVLHLEPKKNKTLDRLREHTKQAVLQAGFQVEKRPYHPHMTLARKWTGEDGFPAHVPFESGEVSMMAER
FSLFQIHLNQSPKYEEIFKFQLS
>P37025 3.1.4.58~~~thpR~~~RNA 2',3'-cyclic phosphodiesterase~~~COG1514
MSEPQRLFFAIDLPAEIREQIIHWRATHFPPEAGRPVAADNLHLTLAFLGEVSAEKEKALSLLAGRIRQPGFTLTLDDAG
QWLRSRVVWLGMRQPPRGLIQLANMLRSQAARSGCFQSNRPFHPHITLLRDASEAVTIPPPGFNWSYAVTEFTLYASSFA
RGRTRYTPLKRWALTQ
>Q5SHB1 3.1.4.58~~~~~~RNA 2',3'-cyclic phosphodiesterase~~~COG1514
MRLFYAVFLPEEVRAALVEAQTKVRPFRGWKPVPPHQLHLTLLFLGERPEEELPDYLALGHRLARLEAPFRARLRGTGYF
PNEGTPRVWFAKAEAEGFLRLAEGLRAGVEELLGEEAVRIPGWDKPFKPHITLARRKAPAPRVPPVLFGLEWPVEGFALV
RSELKPKGPVYTVLEKFSLRGEHGREQAQGPGERPEGD
>P23669 4.2.3.1~~~thrC~~~Threonine synthase~~~COG0498
MDYISTRDASRTPARFSDILLGGLAPDGGLYLPATYPQLDDAQLSKWREVLANEGYAALAAEVISLFVDDIPVEDIKAIT
ARAYTYPKFNSEDIVPVTELEDNIYLGHLSEGPTAAFKDMAMQLLGELFEYELRRRNETINILGATSGDTGSSAEYAMRG
REGIRVFMLTPAGRMTPFQQAQMFGLDDPNIFNIALDGVFDDCQDVVKAVSADAEFKKDNRIGAVNSINWARLMAQVVYY
VSSWIRTTTSNDQKVSFSVPTGNFGDICAGHIARQMGLPIDRLIVATNENDVLDEFFRTGDYRVRSSADTHETSSPSMDI
SRASNFERFIFDLLGRDATRVNDLFGTQVRQGGFSLADDANFEKAAAEYGFASGRSTHADRVATIADVHSRLDVLIDPHT
ADGVHVARQWRDEVNTPIIVLETALPVKFADTIVEAIGEAPQTPERFAAIMDAPFKVSDLPNDTDAVKQYIVDAIANTSV
K
>P00934 4.2.3.1~~~thrC~~~Threonine synthase~~~COG0498
MKLYNLKDHNEQVSFAQAVTQGLGKNQGLFFPHDLPEFSLTEIDEMLKLDFVTRSAKILSAFIGDEIPQEILEERVRAAF
AFPAPVANVESDVGCLELFHGPTLAFKDFGGRFMAQMLTHIAGDKPVTILTATSGDTGAAVAHAFYGLPNVKVVILYPRG
KISPLQEKLFCTLGGNIETVAIDGDFDACQALVKQAFDDEELKVALGLNSANSINISRLLAQICYYFEAVAQLPQETRNQ
LVVSVPSGNFGDLTAGLLAKSLGLPVKRFIAATNVNDTVPRFLHDGQWSPKATQATLSNAMDVSQPNNWPRVEELFRRKI
WQLKELGYAAVDDETTQQTMRELKELGYTSEPHAAVAYRALRDQLNPGEYGLFLGTAHPAKFKESVEAILGETLDLPKEL
AERADLPLLSHNLPADFAALRKLMMNHQ
>A0R220 4.2.3.1~~~thrC~~~Threonine synthase~~~COG0498
MSAAKAAVHQPWPGLIEAYRDRLPIGDDWTTVTLLEGGTPLIHAKRISELTGCTVHLKVEGLNPTGSFKDRGMTVAVTES
LARGQQAVLCASTGNTSASAAAYAARAGITCAVLIPQGKIAMGKLAQAVMHGAKIIQVDGNFDDCLELARKLTADFPTIA
LVNSVNPYRIEGQKTAAFEIVDALGTAPDVHALPVGNAGNITAYWKGYSEYHRDGVSDRLPRMLGTQAAGAAPLVTGAPV
KDPETIATAIRIGSPASWNSAVEAQQQSDGRFLAATDEEILAAYHLVARTEGVFVEPASAASIAGLLKSVEDGWVKRGST
VVCTVTGNGLKDPDTALKGMPQVTPVPVDPSAVVAELGLS
>P9WG59 4.2.3.1~~~thrC~~~Threonine synthase~~~COG0498
MTVPPTATHQPWPGVIAAYRDRLPVGDDWTPVTLLEGGTPLIAATNLSKQTGCTIHLKVEGLNPTGSFKDRGMTMAVTDA
LAHGQRAVLCASTGNTSASAAAYAARAGITCAVLIPQGKIAMGKLAQAVMHGAKIIQIDGNFDDCLELARKMAADFPTIS
LVNSVNPVRIEGQKTAAFEIVDVLGTAPDVHALPVGNAGNITAYWKGYTEYHQLGLIDKLPRMLGTQAAGAAPLVLGEPV
SHPETIATAIRIGSPASWTSAVEAQQQSKGRFLAASDEEILAAYHLVARVEGVFVEPASAASIAGLLKAIDDGWVARGST
VVCTVTGNGLKDPDTALKDMPSVSPVPVDPVAVVEKLGLA
>H7C6B6 ~~~thrE~~~Threonine/serine exporter~~~
MLSFATLRGRISTVDAAKAAPPPSPLAPIDLTDHSQVAGVMNLAARIGDILLSSGTSNSDTKVQVRAVTSAYGLYYTHVD
ITLNTITIFTNIGVERKMPVNVFHVVGKLDTNFSKLSEVDRLIRSIQAGATPPEVAEKILDELEQSPASYGFPVALLGWA
MMGGAVAVLLGGGWQVSLIAFITAFTIIATTSFLGKKGLPTFFQNVVGGFIATLPASIAYSLALQFGLEIKPSQIIASGI
VVLLAGLTLVQSLQDGITGAPVTASARFFETLLFTGGIVAGVGLGIQLSEILHVMLPAMESAAAPNYSSTFARIIAGGVT
AAAFAVGCYAEWSSVIIAGLTALMGSAFYYLFVVYLGPVSAAAIAATAVGFTGGLLARRFLIPPLIVAIAGITPMLPGLA
IYRGMYATLNDQTLMGFTNIAVALATASSLAAGVVLGEWIARRLRRPPRFNPYRAFTKANEFSFQEEAEQNQRRQRKRPK
TNQRFGNKR
>O69704 ~~~~~~Probable threonine/serine exporter~~~COG2966
MDQDRSDNTALRRGLRIALRGRRDPLPVAGRRSRTSGGIDDLHTRKVLDLTIRLAEVMLSSGSGTADVVATAQDVAQAYQ
LTDCVVDITVTTIIVSALATTDTPPVTIMRSVRTRSTDYSRLAELDRLVQRITSGGVAVDQAHEAMDELTERPHPYPRWL
ATAGAAGFALGVAMLLGGTWLTCVLAAVTSGVIDRLGRLLNRIGTPLFFQRVFGAGIATLVAVAAYLIAGQDPTALVATG
IVVLLSGMTLVGSMQDAVTGYMLTALARLGDALFLTAGIVVGILISLRGVTNAGIQIELHVDATTTLATPGMPLPILVAV
SGAALSGVCLTIASYAPLRSVATAGLSAGLAELVLIGLGAAGFGRVVATWTAAIGVGFLATLISIRRQAPALVTATAGIM
PMLPGLAVFRAVFAFAVNDTPDGGLTQLLEAAATALALGSGVVLGEFLASPLRYGAGRIGDLFRIEGPPGLRRAVGRVVR
LQPAKSQQPTGTGGQRWRSVALEPTTADDVDAGYRGDWPATCTSATEVR
>Q9I2Y2 3.1.3.3~~~thrH~~~Phosphoserine phosphatase ThrH~~~
MEIACLDLEGVLVPEIWIAFAEKTGIDALKATTRDIPDYDVLMKQRLRILDEHGLKLGDIQEVIATLKPLEGAVEFVDWL
RERFQVVILSDTFYEFSQPLMRQLGFPTLLCHKLEIDDSDRVVGYQLRQKDPKRQSVIAFKSLYYRVIAAGDSYNDTTML
SEAHAGILFHAPENVIREFPQFPAVHTYEDLKREFLKASSRSLSL
>Q883R9 3.1.3.3~~~~~~Phosphoserine phosphatase~~~COG0560
MAAMVTKSAVQWPLPEARVPLPLILNAVGECAVEIACLDLEGVLVPEIWIAFAEKTGIESLRATTRDIPDYDVLMKQRLR
ILDEHGLKLADIQAVISTLKPLEGAVEFVDWLRERFQVVILSDTFYEFSQPLMRQLGFPTLLCHRLITDETDRVVSYQLR
QKDPKRQSVLAFKSLYYRIIAAGDSYNDTTMLGEADAGILFHAPDNVIREFPQFPAVHTFDELKKEFIKASNRELVL
>P0DW59 3.2.2.5~~~thsA~~~NAD(+) hydrolase ThsA~~~
MFEHEQKIMIDRIVKELEENNFAIFAGAGLSAPAGYVNWKELLRPLSIELNLDIDKETDLVSLAQYYVNENHGRNRLTER
LIDEVGVAREPTPNHKILAKLPISTYWTTNYDDLIEKALDNEGKIADKKFTKNHLSQTKKGRSAVVYKMHGDASLPDQAI
ITKDQYESYPLHFAPFVTALSGDLVSKTFLFLGFSFNDPNLDYILSRIRIHFEQNQRQHYCIFRKVNRADYSNDEDFSYN
LLKQQFVIKDLARFSIKVVLIDAWNDLTRILEEITKRFRCKNVFLSGSAHEFGSWGQTATELFLSKLGEVLIQEGFKITS
GLGLGIGNAFISGAIKEIYNRKYTKIDDYLTMKVFPQFVADPTERKNIWTAWRKDLLSQTGIALFFMGNKIIKDPESGKQ
TIVLADGMDEEFHIAHELGLKLIPIGASGYKAKELFNQIISDFDHYYPNSSPKFREAFEKLNEEVDEPVKLLSKIHDVIK
LI
>I2C645 ~~~thsA~~~Thoeris protein ThsA~~~
MNIIGSDRNTNIGGFMWKFKELIRDKTFRRWALSIILTIPTSVSTFISFLDLDARCRLIILLILVGLSLVIIIVQFIRLL
FMNNITLNLNGSEVEIKKGDIFEVPRNNYKVIAFNEYFDTQVDDVIIARETLNGQYIKRYYSHQDITELDQKIKDDVKLK
IEEKNVERPFGGKTTRYSLGSVFKDMDFFLVAFSKFDRENRAQLKLNEYASCMLNVWNEINTLHASKEVFIPLLGSGITR
HVDSDVGVNELLHIMLWTFQISKVKFREPAKVTILLYKNDHKKINFYKLKEFEKNGL
>J8G6Z1 3.2.2.5~~~thsA~~~NAD(+) hydrolase ThsA~~~
MKMNPIVELFIKDFTKEVMEENAAIFAGAGLSMSVGYVSWAKLLEPIAQEIGLDVNKENDLVSLAQYYCNENQGNRGRIN
QIILDEFSRKVDLTENHKILARLPIHTYWTTNYDRLIEKALEEENKIADVKYTVKQLATTKVKRDAVVYKMHGDVEHPSE
AVLIKDDYEKYSIKMDPYIKALSGDLVSKTFLFVGFSFTDPNLDYILSRVRSAYERDQRRHYCLIKKEERRPDELEADFE
YRVRKQELFISDLSRFNIKTIVLNNYNEITEILQRIENNIKTKTVFLSGSAVEYNHWETEHAEQFIHQLSKELIRKDFNI
VSGFGLGVGSFVINGVLEELYMNQGTIDDDRLILRPFPQGKKGEEQWDKYRRDMITRTGVSIFLYGNKIDKGQVVKAKGV
QSEFNISFEQNNYVVPVGATGYIAKDLWNKVNEEFETYYPGADARMKKLFGELNNEALSIEELINTIIEFVEILSN
>A0A5B8Z1N3 3.2.2.5~~~thsA~~~NAD(+) hydrolase ThsA~~~
MKIVLEEIAMATDKEVLIKEFLKALHEDNAAIFAGAGLSAASGFVNWKGLLKEAADELELDIEKETDLISLAQYFFNKNG
RQRLSQLVIDNFSAEAQLNENHRILAQLPIDTYWTTNYDRLIEKSLTDVGKNPDVKIKQSDFALLKPKRDAIVYKMHGDI
ERASETVLIKDEYEMFHENNQLFSIGLKGDLISKTFLFIGYSFEDPDLEYILSRIRVLMGQDGRNHYCFFRKVNRNQYNH
LPKEEGDEKFRYDSIKQELKCADLERYHIKPVLVDKYEDITEILQTILQRYCRSKILISGSAVEYKQFVPDHNTAQMFIH
TLSREMVKAGFKIASGFGLGVGSAVINGSLDYVYSTNKRKISDYLILRPFPQYATNGLELMDLWDQYRRDFISDVGCAVF
IFGNKEVNGKVVDAGGVRKEFDIAVAQGIKVIPVGATGYMSKTLWEETITNYDKYYSDFPALKADFEFIGDASHNHHEII
TRIIKIITALRAGR
>P0DW60 3.2.2.5~~~thsA~~~NAD(+) hydrolase ThsA~~~
MDKKVLIKRFSEAIEKGNAAIFAGAGLSMSQGYVSWPELLNDPATEIGLDSKKETDLVTLAQYYKNENGGSRGILNQILM
DNFGEELEISENHRILASLPIETYWTTNYDHLIEKSIREAYKNPQVKKNYTQLATTNPNVDTIVYKMHGDIDDVSSTVIT
RDDYEKYDDDSYALFKETLKGDLLTKTFLFLGFSFTDPNLERILSDIRWVLRENQRPHYCIMRKILKENFVDSEDFFDQE
RYNYELTKRRLQINDLSRFSINVVEVDDYSEITDILKSIRKKYLRKTIFISGSAVDYTPFTSESKGLKFVEKLAYRLSES
GYRIVSGYGLGIGNSIVSGVLKQRRNLRKNNIQDVLSLRPLPLDMPHEWRRYRENIIAESGISIFVFGNKLESGEIVTAD
GMIEEFELSVQNENIVVPIGFTKGASKVLFDKIQENFSDYFDDSLKSKFQALEKLDTEGEVDKKVEQIVSLINSIQED
>C0MAL8 3.2.2.5~~~thsA~~~NAD(+) hydrolase ThsA~~~
MFIIKGEVSRKDLIREIEKAIKSDELGAFIGAGLSIPAGFCSWKELLREPAEEIGLDVEKESDLVNLAQYYSNSKKRTSI
DDLIKGQFSQLVKPTENHKLLSQLPISTFWTTNYDKLIEKALENNMKKPYVKTKDEQLRGTNHNFDAIVYKLHGDVETPE
DAVITRSDYEEFGYNKRKLFREVLEGDLLTKTFLFLGFSFEDPNFNYVIGRLRVLLDEKNTRKHYCIMKRVQDADEDYEY
KKARQELQIEDLNRYGIFTYLVNKYDEITEILSTLVDRFRRKTIFISGSAYSYSAYSQKTGENFIHKLSFELSKNGYHIV
NGYGKGVGEFVLNGVADYCLTHKSKINDFLTLMPFPQNSSLGIDLDKLYKENREQMIESCGIAIFLFGNKEAEDIASGVM
DEYELSKKHGLVCLPIEYTGGASKEIYDQTTQEISDKNTISAIEQANKQCDGDIDMSVKNIVQAVKILNKEEF
>J8G8J6 3.2.2.-~~~thsB1~~~Putative cyclic ADP-D-ribose synthase ThsB1~~~
MAKRVFFSFHYQDVIDFRVNVVRNHWVTKLNQSAAGVFDASLWEDAKKTSDIALKRLINGGLNNTSVTCVLIGSQTFNRR
WVRYEIMKSIEKGNKIIGIHINAFKDKYGNIKSKGPNPFDYLGYQYSSDGKQLHLYEWTGGKWEEYKDLAPYRVNQIAPE
SLRGKFYSLSSVYRVYDWVADDGYNKFSSWVN
>A0A5B8Z670 3.2.2.-~~~thsB1~~~Putative cyclic ADP-D-ribose synthase TIR1~~~
MGRKIFISYKYSDSGVYPLNGNYSTTVRDYVDELQSKLKEGDHINKGEADGEDLSEFKDETIWTKLKDKIFDSSITIIFI
SKNMKALFQSEEDQWIPWEISYSLRELTRNGRTSGANALLAVVLPDQYNSYEYFIHDNSCPYCNCITLKTNTLFTILREN
MFNIKSPTYNDCSNHSESNKVYTGESSYVRSVKWVDFIDNFDIYLQVAYNINENKDEYTLHKSV
>J8CSK2 3.2.2.-~~~thsB'~~~Probable 3' cyclic ADP-D-ribose synthase ThsB'~~~
MTLFTENDLLNNSYKSIQKSYHFSENQAAKNILEQAYKNYDKNKIYDIFLSHSFLDARKILGLKNYIEGLGYSVYVDWIE
DKQLDRSKVSKETAGILRERMQSCKSLFFAISENSDHSLWMPWELGYFDGIKQKVAILPVLKSSYDDSYNGQEYLGLYPY
VAKGTIINSTQEEIWIHSSQKQYVRFRNWLQQN
>A0A5B8Z260 3.2.2.-~~~thsB2~~~Putative cyclic ADP-D-ribose synthase TIR2~~~
MPGVAKRKYVGETMRINKNLSQVLRRLSDVLPYNYNAETLLELFQQLYPHEWRELNQRFDQYKEKDEFLLKKGKKIRYKP
NPPKEHFFKLPIVKNILSKGRIAKHNANFDELAYQERFAKFKAKRENAIRSRNEKIAKANELIQNVEPLFIDTFIAAYHK
RGISFDEKMEIFKELQKYKSKKTAEFFYKLSESERNNQIRNMAFKHLQVTGNYVKLRKNFNGKKKEYMTESSEFFMTPLD
LLKRIESNNVQNKKVYDVFISHSYKDSSVIKKIIKAFNKLSISIYCDWTSDSDFLKRELVSEYTKVVLKKRIEQSKNIVF
VKTDNSLESHWVRFELDYSRELGKTLFCINLSDEAEGECNVLQFDVKNETISWTTGLVK
>Q9I3L9 ~~~cysP~~~Thiosulfate-binding protein~~~
MKRLFSASLLAAGLALGGAAHAAQPLLNVSYDVMRDFYKEYNPAFQKYWKAEKGENITIQMSHGGSSKQARSVIDGLPAD
VITMNQATDIDALADNGGLVPKDWATRLPNNSAPFTSATVFIVRKGNPKALKDWPDLLKDGVQVVVPNPKTSGNGRYTYL
SAWGYVLKNGGDENKAKEFVGKLFKQVPVLDTGGRAATTTFMQNQIGDVLVTFENEAEMIAREFGRGGFEVVYPSVSAEA
EPPVAVVDKVVEKKGSRAQAEAYLKYLWSDEGQTIAANNYLRPRNPEILAKFADRFPKVDFFSVEKTFGDWRSVQKTHFI
DGGVFDQIYSPN
>I2C644 3.2.2.-~~~thsB~~~Putative cyclic ADP-D-ribose synthase ThsB~~~
MGYRNGNYAAFYVSEPFSESSLGANATKDFVSYNMLRAWKGKDNNYPFNDSHDKTYNVRDGSDWEKTLKPRLRKRLDQSK
NIIFFLSKHTENSKALREEIDYGINVKGLPVIVVYPELSEKSDIIDCTTKVFRSEVVNLWSRVPVFKDSMLKVPTIHIPY
KKDQIKKALENKDFMINSKISAGSVYFYPC
>P9WHF7 2.8.1.1~~~sseA~~~Putative thiosulfate sulfurtransferase SseA~~~COG2897
MPLPADPSPTLSAYAHPERLVTADWLSAHMGAPGLAIVESDEDVLLYDVGHIPGAVKIDWHTDLNDPRVRDYINGEQFAE
LMDRKGIARDDTVVIYGDKSNWWAAYALWVFTLFGHADVRLLNGGRDLWLAERRETTLDVPTKTCTGYPVVQRNDAPIRA
FRDDVLAILGAQPLIDVRSPEEYTGKRTHMPDYPEEGALRAGHIPTAVHIPWGKAADESGRFRSREELERLYDFINPDDQ
TVVYCRIGERSSHTWFVLTHLLGKADVRNYDGSWTEWGNAVRVPIVAGEEPGVVPVV
>P9WHF5 2.8.1.1~~~sseB~~~Putative thiosulfate sulfurtransferase SseB~~~COG2897
MQARGQVLITAAELAGMIQAGDPVSILDVRWRLDEPDGHAAYLQGHLPGAVFVSLEDELSDHTIAGRGRHPLPSGASLQA
TVRRCGIRHDVPVVVYDDWNRAGSARAWWVLTAAGIANVRILDGGLPAWRSAGGSIETGQVSPQLGNVTVLHDDLYAGQR
LTLTAQQAGAGGVTLLDARVPERFRGDVEPVDAVAGHIPGAINVPSGSVLADDGTFLGNGALNALLSDHGIDHGGRVGVY
CGSGVSAAVIVAALAVIGQDAELFPGSWSEWSSDPTRPVGRGTA
>P31142 2.8.1.2~~~sseA~~~3-mercaptopyruvate sulfurtransferase~~~COG2897
MSTTWFVGADWLAEHIDDPEIQIIDARMASPGQEDRNVAQEYLNGHIPGAVFFDIEALSDHTSPLPHMLPRPETFAVAMR
ELGVNQDKHLIVYDEGNLFSAPRAWWMLRTFGVEKVSILGGGLAGWQRDDLLLEEGAVELPEGEFNAAFNPEAVVKVTDV
LLASHENTAQIIDARPAARFNAEVDEPRPGLRRGHIPGALNVPWTELVREGELKTTDELDAIFFGRGVSYDKPIIVSCGS
GVTAAVVLLALATLDVPNVKLYDGAWSEWGARADLPVEPVK
>P52197 2.8.1.1~~~rhdA~~~Thiosulfate sulfurtransferase~~~
MDDFASLPLVIEPADLQARLSAPELILVDLTSAARYAEGHIPGARFVDPKRTQLGQPPAPGLQPPREQLESLFGELGHRP
EAVYVVYDDEGGGWAGRFIWLLDVIGQQRYHYLNGGLTAWLAEDRPLSRELPAPAGGPVALSLHDEPTASRDYLLGRLGA
ADLAIWDARSPQEYRGEKVLAAKGGHIPGAVNFEWTAAMDPSRALRIRTDIAGRLEELGITPDKEIVTHCQTHHRSGLTY
LIAKALGYPRVKGYAGSWGEWGNHPDTPVEL
>Q9RXT9 2.8.1.1~~~~~~Thiosulfate sulfurtransferase~~~COG2897
MDYAKDVLVSTEWAAQNLQTPGVRFIEVDEDILLYETGHLPGAVKLDWQTDLWHPVERDFIEPQQVSELLGKLGIKADDT
IVLYGDKSNWWASYAYWFLTYSGVSNLKIMNGGRQKWVAEGREMTTEAPTVTATTYPALQRDESLRAYRDEVRAHLESVN
NGQGAMVDVRSPDEFSGKVTHMPNYPQEGVLRGGHIPGARNIPWAKATNEDGTFKSADELKALYEGEGVTADKDVIAYCR
IAERSSHSWFVLRELLGYPKVRNYDGSWTEWGNGVGLPIEKTYSEE
>A0R4C9 2.8.1.1~~~~~~Putative thiosulfate sulfurtransferase~~~COG2897
MARSDVLVSTDWAESNLKAPKTVFVEVDEDTSAYDTGHIEGAVKLDWKTDLQDPIRRDFVDAQQFSKLLSERGIANDDTV
ILYGGNNNWFAAYAYWYFKLYGHQDVKLLDGGRKKWELDARPLSAEKVERPQTSYTAKEPDNSIRAFRDEVIAAIGTKNL
VDVRSPDEFSGKILAPAHLPQEQSQRPGHIPGAINVPWSKAANEDGTFKSDEELAKLYAEAGLDGEKETIAYCRIGERSS
HTWFVLQELLGHKNVKNYDGSWTEYGSLVGAPIELGS
>P9WHF9 2.8.1.1~~~cysA1~~~Putative thiosulfate sulfurtransferase~~~COG2897
MARCDVLVSADWAESNLHAPKVVFVEVDEDTSAYDRDHIAGAIKLDWRTDLQDPVKRDFVDAQQFSKLLSERGIANEDTV
ILYGGNNNWFAAYAYWYFKLYGHEKVKLLDGGRKKWELDGRPLSSDPVSRPVTSYTASPPDNTIRAFRDEVLAAINVKNL
IDVRSPDEFSGKILAPAHLPQEQSQRPGHIPGAINVPWSRAANEDGTFKSDEELAKLYADAGLDNSKETIAYCRIGERSS
HTWFVLRELLGHQNVKNYDGSWTEYGSLVGAPIELGS
>Q9HUK9 2.8.1.1~~~rhdA~~~Thiosulfate sulfurtransferase~~~
MSVFSDLPLVIEPSDLAPRLGAPELILVDLTSAARYAEGHIPGARFVDPKRTQWGQPPAPGLLPAKADLEALFGELGHRP
EATYVVYDDEGGGWAGRFIWLLDVIGHHHYHYLNGGLPAWIADAQALDREVPAPVGGPLPLTLHDEPSATREYLQSRLGA
ADLAVWDARNPSEYAGTKVLAAKAGHVPGAINFEWTAGMDPARALRIRADIAEVLEDLGITPDKEVITHCQTHHRSGFTY
LVAKALGYPRVKGYAGSWSEWGNHPDTPVEV
>P16385 2.8.1.1~~~cysA~~~Putative thiosulfate sulfurtransferase~~~
MSREEVLVSTDWAEQNLNTDGVVFAEVDEDTTAYDGGHIPGAIKLDWKNELQDHVRRDFVNREGFEKLLSAKGIGNDDTV
ILYGGNNNWFAAYAYWYFKLYGHSDVKLLDGGRKKWELDGRELTKEEPNRAATAYKAQEPDASIRAFRDEVVDAIGNKNL
VDVRSPDEFAGKLLAPAHLPQESAQRAGHIPSAINVPWSKAANEDGTFKSDEELKQVYGEAGLDTDKDTIAYCRIGERSS
HTWFVLRELLGHTNVKNYDGSWTEYGSLVGVPIENPQEQGA
>P27477 2.8.1.1~~~rhdA~~~Putative thiosulfate sulfurtransferase~~~COG2897
MSVRSLRWPRQKAFLAVISLVVAVLLAVPGWLTPATAASQATVQFVAPTWAAERLNNKQLKILDVRTNPLAYIEGHLPGA
VNIADAAYRGPNGFLPVQIWDPEKLASLFGRAGVSNNDTVLVYSDGNDVLGATLVAYLLERSGVQNIAVLDGGYKGYKDA
GLPVTKEYPRYQAARFAPKDNRAFRVDIKQVEQLTGKSTFVDPRPPALFSGEQQVFIRNGHIPGARNIPWPTFTEANNAN
ESLKNPHKLKPLSELKAILEAKGVTPDKDVIVTCSTGREASLQYLVLKHLLKYPKVRIYEGSWTEYSASNLPVETGPDRV
>P40111 2.1.1.148~~~thyX~~~Flavin-dependent thymidylate synthase~~~COG1351
MAEQVKLSVELIACSSFTPPADVEWSTDVEGAEALVEFAGRACYETFDKPNPRTASNAAYLRHIMEVGHTALLEHANATM
YIRGISRSATHELVRHRHFSFSQLSQRFVHSGESEVVVPTLIDEDPQLRELFMHAMDESRFAFNELLNALEEKLGDEPNA
LLRKKQARQAARAVLPNATESRIVVSGNFRTWRHFIGMRASEHADVEIREVAVECLRKLQVAAPTVFGDFEIETLADGSQ
MATSPYVMDF
>O26061 2.1.1.148~~~thyX~~~Flavin-dependent thymidylate synthase~~~COG1351
MEVICKHYTPLDIASQAIRTCWQSFEYSDDGGCKDKELIHRVGNIFRHSSTLEHLYYNFEIKGLSRGALQELSRHRIASL
SVKSSRYTLRELKEVESFLPLNETNLERAKEFLVFVDNEKVNAMSVLALENLRILLSEHNIKNDLAKYAMPESYKTHLAY
SINARSLQNFLTLRSSNKALKEMQDLAKALFDALPGEHQYLFEDCLKH
>P9WG57 2.1.1.148~~~thyX~~~Flavin-dependent thymidylate synthase~~~COG1351
MAETAPLRVQLIAKTDFLAPPDVPWTTDADGGPALVEFAGRACYQSWSKPNPKTATNAGYLRHIIDVGHFSVLEHASVSF
YITGISRSCTHELIRHRHFSYSQLSQRYVPEKDSRVVVPPGMEDDADLRHILTEAADAARATYSELLAKLEAKFADQPNA
ILRRKQARQAARAVLPNATETRIVVTGNYRAWRHFIAMRASEHADVEIRRLAIECLRQLAAVAPAVFADFEVTTLADGTE
VATSPLATEA
>Q9WYT0 2.1.1.148~~~thyX~~~Flavin-dependent thymidylate synthase~~~COG1351
MKIDILDKGFVELVDVMGNDLSAVRAARVSFDMGLKDEERDRHLIEYLMKHGHETPFEHIVFTFHVKAPIFVARQWFRHR
IASYNELSGRYSKLSYEFYIPSPERLEGYKTTIPPERVTEKISEIVDKAYRTYLELIESGVPREVARIVLPLNLYTRFFW
TVNARSLMNFLNLRADSHAQWEIQQYALAIARIFKEKCPWTFEAFLKYAYKGDILKEVQV
>Q5SJB8 2.1.1.148~~~thyX~~~Flavin-dependent thymidylate synthase~~~COG1351
MEGPLTIPVLDKGFVRLVDQMGDDRAIVQAARVSYGEGTKTVREDAALIDYLMRHRHTSPFEMVVFKFHVKAPIFVARQW
FRHRTASVNEISGRYSILKEEFYEPEAFRKQAKRNKQASEGALLDEEALALLRKVQQEAYGAYRALLEKGVAREMARMVL
PLNLYTEFYWKQDLHNLFHFLKLRLAPEAQWEIRQYARAIAEIVKERVPLAWAAFEEHLLEGAFLSRTELRALRGLLTPE
VYEKALSSLGLGGSRLKEALEKVFGPGEAL
>Q9XD84 ~~~tibA~~~Autotransporter adhesin/invasin TibA~~~
MNKVYNTVWNESTGTWVVTSELTRKGGLRPRQIKRTVLAGLIAGLLMPSMPALAAAYDNQTIGRGETSKSMHLSAGDTAK
NTTINSGGKQYVSSGGSATSTTINIGGVQHVSSGGSATSSTINSGGHQHVSSGGSATNTTVNNGGRQTVFSGGSAMGTII
NSGGDQYVISGGSATSASVTSGARQFVSSGGIVKATSVNSGGRQYVRDGGSATDTVLNNTGRQFVSSGGSAAKTTINSGG
GMYLYGGSATGTSIYNGGRQYVSSGGSATNTTVYSGGRQHVYIDGNVTETTITSGGMLQVEAGGSASKVIQNSGGAVITN
TSAAVSGTNDNGSFSIAGGSAVNMLLENGGYLTVFDGHQASDTMVGSDGTLDVRSGGVLYGTTTLTDKGALVGDVVTNEG
NLYYLNNSTATFTGTLTGTGTLTQEGGNTRFSGLLSQDGGIFLQSGGAMTMDALQAKANVTTQSGTTLTLDNGTILTGNV
AGDSTGAGDMAVKGASVWHLDGDSTVGALTLDNGTVDFRPSTTTRMTPAFQAVSLALGSLSGSGTFQMNTDIASHTGDML
NVAGNASGNFVLDIKNTGLEPVSAGAPLQVVQTGGGDAAFTLKGGKVDAGTWEYGLSKENTNWYLKADTPPPVTPPTNPD
ADNPDAGNPDAGNPDAGNPDAGNPDAGKPGTGKPDAGTSSSPVRRTTKSVDAVLGMATAPAYVFNSELDNLRFRHGDVMQ
NTRAPGGVWGRYTGSDNRISGGASSGYTLTQNGFETGADMVFDLSDSSLAVGTFFSYSDNSIKHARGGKSNVDSSGGGLY
ATWFDNDGYYVDGVLKYNRFNNELRTWMSDGTAVKGDYSQNGFGGSLEAGRTFSLNENAWAQPYVRTTAFRADKKEIRLN
NGMKASIGATKSLQAEAGLKLGMTLDVAGKEVKPYLSAAVSHEFSDNNKVRINDTYDFRNDISGTTGKYGLGVNAQLTPN
AGVWAEARYENGKQTESPITGGVGFRINF
>Q9S4K6 2.4.99.-~~~tibC~~~Autotransporter heptosyltransferase TibC~~~
MSTLKNTFFITPPDTPTQAGPENIFYDFNDGARVLLPEGKWHVRLLDADSENILFCCDVDKGWVTSSKKYFVRFRIQVFR
QGEETPLLDETLKLKDRPVLISFPTGTLGDLLGWFPYAERFQSLHKCRLECTMSQDIIDLLAPQYPQIQFSTPDKPRTVA
PYATYRVGLYFGGDTNNQPVDFRKVGFHRSAGYILGVDPREAPVRLDLSAPRVIQEPYVCIATQSTCQAKYWNNGTGWSE
VIAHLKSLGYRVMCIDRDAHYGQGFVWNHIPWGAEDFTGKLPLQERVNLLRHASFFIGLPSGLSWLAWATRIPVVLISGF
SLPNSEFYTPWRVFNSHGCYGCWDDTSLNFDHHDFLWCPRHKNTDRQFECTRLITGAQVNGVINKLHRSLTEQGVEATLK
KGVSNE
>P80698 5.2.1.8~~~tig~~~Trigger factor~~~COG0544
MSVKWEKQEGNEGVLTVEVDAETFKTALDDAFKKVVKQVSIPGFRKGKIPRGLFEQRFGVEALYQDALDILLPVEYPKAV
EEAGIEPVDRPEIDVEKIEKGESLIFTAKVTVKPEVKLGEYKGLGIEKDDTTVTDEDVQNELKALQERQAELVVKEEGAV
EEGNTVVLDFEGFVDGEAFEGGKAENYSLEVGSGSFIPGFEDQLVGLEAGAEKDVEVTFPEEYHAEDLAGKPAVFKVKIH
EIKAKELPELDDEFAKDIDEEVETLAELTEKTKKRLEEAKENEADAKLREELVLKASENAEIDVPQAMVDTELDRMLKEF
EQRLQMQGMNLELYTQFSGQDEAALKEQMKEDAEKRVKSNLTLEAIAKAENLEVSDEEVDAELTKMAEAYNMPVENIKQA
IGSTDAMKEDLKVRKAIDFLVENR
>Q83DJ3 5.2.1.8~~~tig~~~Trigger factor~~~COG0544
MRGNFMSSIEKLGGLKQRLTITVPAEEVDKAYKSRLLKVARTAKIPKFRPGKASPAVVEKLYGKAILQEVGSELIQSSLR
EAVEEHQLRVAGAPDIKMDKILRGEPFKYVVNFEVYPEITLESLAGETIERTQVEITEEDLDKMLEALRKQYAEWKEMDR
PAKADDRVIIDFEGTLDGKPFERGSAKDFQLELGSKRMIAGFEEGIEGMKPGESKALDITFPADYPSEDLAGKAAVFNIT
LQKVMAPELPVLDEQFAERLGIKEGGLEALRQKVRTNMEKEVHHHMENKLKMAVLDKLIERNPIEVPESLIEAEIDHLQQ
MTRQQVAMQTHKPDEAKKMELPRDPYREQATKRVKLGLLLAEVVKQHKIKADPEQLRARVEEVAASYQDPEKVISWYYTN
KQMLSEIESVVLEDQAVAQLMSELEVKDQAIPYEEAVKQIQQ
>Q9RT21 5.2.1.8~~~tig~~~Trigger factor~~~COG0544
MAELISKEGNKVEFKVSVPAAEVNRAYDQVWAGLARDVRVPGFRPGKAPRKVIENRVGKGYVESQVRDRLLETHYSQGLR
ELGLNLVDATVDPQDVQSGQAFEFTVKGETYPEVKLGDWQGLKVSAQAPEITDEVLEQTLSDLRERNASFEKAERPIEAA
DQVTIQELGEGDSEEGGSYPIYLDMAEEHVRNALLGKSAGDVVDITVPAHQHGDHEHAEHTVRVKVVEVSSKKLQDLNDE
FATSLNYESMDKLRTDLREELERRAQQEGDNLRREELVGHLVEGMTVEIPQALIDRRREGMMSEIQDDLRRQGVQWKEYE
AFMQEQGKLDEFEADLTKNAETRVRRDLALEQLATDLNAQVNEAEFNQTLMNLAQANGMNVQQLVQQLGQDGVQSYYISL
LRERGLQRALAQLSGEGQSTEAASPKATGTEAAGTEQSEPAQTETAQNDAGQTETAQSEGEQQSE
>C4ZTJ3 5.2.1.8~~~tig~~~Trigger factor~~~
MQVSVETTQGLGRRVTITIAADSIETAVKSELVNVAKKVRIDGFRKGKVPMNIVAQRYGASVRQDVLGDLMSRNFIDAII
KEKINPAGAPTYVPGEYKLGEDFTYSVEFEVYPEVELQGLEAIEVEKPIVEVTDADVDGMLDTLRKQQATWKEKDGAVEA
EDRVTIDFTGSVDGEEFEGGKASDFVLAMGQGRMIPGFEDGIKGHKAGEEFTIDVTFPEEYHAENLKGKAAKFAINLKKV
EERELPELTAEFIKRFGVEDGSVEGLRAEVRKNMERELKSAIRNRVKSQAIEGLVKANDIDVPAALIDSEIDVLRRQAAQ
RFGGNEKQALELPRELFEEQAKRRVVVGLLLGEVIRTNELKADEERVKGLIEEMASAYEDPKEVIEFYSKNKELMDNMRN
VALEEQAVEAVLAKAKVTEKETTFNELMNQQA
>P0A850 5.2.1.8~~~tig~~~Trigger factor~~~COG0544
MQVSVETTQGLGRRVTITIAADSIETAVKSELVNVAKKVRIDGFRKGKVPMNIVAQRYGASVRQDVLGDLMSRNFIDAII
KEKINPAGAPTYVPGEYKLGEDFTYSVEFEVYPEVELQGLEAIEVEKPIVEVTDADVDGMLDTLRKQQATWKEKDGAVEA
EDRVTIDFTGSVDGEEFEGGKASDFVLAMGQGRMIPGFEDGIKGHKAGEEFTIDVTFPEEYHAENLKGKAAKFAINLKKV
EERELPELTAEFIKRFGVEDGSVEGLRAEVRKNMERELKSAIRNRVKSQAIEGLVKANDIDVPAALIDSEIDVLRRQAAQ
RFGGNEKQALELPRELFEEQAKRRVVVGLLLGEVIRTNELKADEERVKGLIEEMASAYEDPKEVIEFYSKNKELMDNMRN
VALEEQAVEAVLAKAKVTEKETTFNELMNQQA
>P47480 5.2.1.8~~~tig~~~Trigger factor~~~COG0544
MKLYKVLNSKTTDKSLCLEVEIDPNYWQATQKKLVGEMAKSIKIKGFRPGKIPPNLASQSINKAELMQKSAQNVMNSIYE
SVQQEEIVASNDNVIDDYPTIDFKTITEQNCVLLFYFDLIPNFQLPDYKKIKDLTPLTKLTEAEFNNEIEKLAKTKSTMV
DVSDKKLANGDIAIIDFTGIVDNKKLASASAQNYELTIGSNSFIKGFETGLIAMKVNQKKTLALTFPSDYHVKELQSKPV
TFEVVLKAIKKLEFTPMDETNFKSFLPEQFQSFTSLKAFKSYFHKLMENKKQETILQENNQKIRQFLLTNTKLPFLPEAL
IKLEANRLLKLQQSQAEQYKIPFEKLLSASNITLTELQDRNIKEAKENVTFALVMKKIADIEKIKVDNNKIKAEIENVIA
VEYPFASDEMKKQLFFNMEQQKEFVESIIINRLTTTKIVSYSTH
>A0R199 5.2.1.8~~~tig~~~Trigger factor~~~COG0544
MKSTVEQLSPTRVRINVEVPFTELEPDFDRAFKELAKQVRLPGFRPGKAPRKLLEARIGRGAVLEQVVNDALPSRYSEAV
STSDLKPLGQPEIEITKLEDNEELVFTAEVDIRPEITLPELESLKITVDPIEVTDEEVDAELQSLRARFGTLKGVERGVQ
EGDFVSIDLSATVDGNEVPEAATEGLSHEVGSGQLIDGLDEAIIGLKADESKTFTTKLVAGEYAGQDAEVTVTVKSVKER
ELPEPDDEFAQLASEYDTIEELRNSLVDQVRRLKSVQQAEQIRDKAIEALLEQTEVPLPEKIVQAQIDEVVHNAIHGLDH
DEEKFAEQLAEQGSSREEFDAETRTEAEKAVKTQLLMDAVADKLEIQVSQNDLTERLVLMSRQYGLEPQQLIQILQQNNQ
LPAMFADVRRGLTIAAVVHAATVTDTDGNVIDTMEFFGPSGEQAAEDSAEESTDAAEGEAAEDADDTDK
>P9WG55 5.2.1.8~~~tig~~~Trigger factor~~~COG0544
MKSTVEQLSPTRVRINVEVPFAELEPDFQRAYKELAKQVRLPGFRPGKAPAKLLEARIGREAMLDQIVNDALPSRYGQAV
AESDVQPLGRPNIEVTKKEYGQDLQFTAEVDIRPKISPPDLSALTVSVDPIEIGEDDVDAELQSLRTRFGTLTAVDRPVA
VGDVVSIDLSATVDGEDIPNAAAEGLSHEVGSGRLIAGLDDAVVGLSADESRVFTAKLAAGEHAGQEAQVTVTVRSVKER
ELPEPDDEFAQLASEFDSIDELRASLSDQVRQAKRAQQAEQIRNATIDALLEQVDVPLPESYVQAQFDSVLHSALSGLNH
DEARFNELLVEQGSSRAAFDAEARTASEKDVKRQLLLDALADELQVQVGQDDLTERLVTTSRQYGIEPQQLFGYLQERNQ
LPTMFADVRRELAIRAAVEAATVTDSDGNTIDTSEFFGKRVSAGEAEEAEPADEGAARAASDEATT
>Q9JZ37 5.2.1.8~~~tig~~~Trigger factor~~~
MMSVTVETLENLERKVVLSLPWSEINAETDKKLKQTQRRAKIDGFRPGKAPLKMIAQMYGASAQNDVINELVQRRFYDVA
VAQELKVAGFPRFEGVEEQDDKESFKVAAIFEVFPEVVIGDLSAQEVEKVTASVGDAEVDQTVEILRKQRTRFNHVEREA
RNGDRVIIDFEGKIDGEPFAGGASKNYAFVLGASQMLPEFEAGVVGMKAGESKDVTVNFPEDYHGKDVAGKTAVFTITLN
NVSEATLPEVDADFAKALGIADGDVAKMREEVQKNVSREVERRVNEQTKESVMNALLKAVELKAPVALVNEEAARLANEM
KQNFVNQGMADAANLDLPLDMFKEQAERRVSLGLILAKLVDENKLEPTEEQIKAVVANFAESYEDPQEVIDWYYADPSRL
QAPTSLAVESNVVDFVLGKAKVNEKALSFDEVMGAQA
>P99080 5.2.1.8~~~tig~~~Trigger factor~~~
MTATWEKKEGNEGLLTVTVPAEKVNKALDQAFKKVVKQINVPGFRKGKVPRPIFEQRFGVEALYQDAIDILLPDAYGEAI
DETDIKPVAQPEVSVTQIEKGKDFIFEATVTVEPEVKLGDYKGLEIEKQETELSDDELQEAIDHSLGHLAEMVVKEDGVV
ENGDTVNIDFSGSVDGEEFEGGQAEGYDLEIGSGSFIPGFEEQLEGMKVDEEKDVVVTFPEEYHAEELAGKEATFKTKVN
EIKFKEVPELTDEIANELDAEANTVDEYKENLRKRLAEQKATDAENVEKEEAITKATDNTTIDIPEAMVNTELDRMVSEF
AQRIQQQGLDLQTYFQISGQDETQLREQMKDDAEQRVKTNLTLTAIAEAEKIEATDEDIDKELEKMSKQFNISVEDIKNT
LGNTDIIKNDVRIQKVIDLLRDNAKFVEGTKED
>Q9WZF8 5.2.1.8~~~tig~~~Trigger factor~~~COG0544
MEVKELERDKNRVVLEYVFGAEEIAQAEDKAVRYLNQRVEIPGFRKGRIPKNVLKMKLGEEFQEYTLDFLMDLIPDTLKD
RKLILSPIVTERELKDVTARVVVEVHEEPEVRIGDISKIEVEKVDEEKVLEKYVERRIEDLRESHALLEPKEGPAEAGDL
VRVNMEVYNEEGKKLTSREYEYVISEDEDRPFVKDLVGKKKGDVVEIEREYEGKKYTYKLEVEEVYKRTLPEIGDELAKS
VNNEFETLEQLKESLKKEGKEIYDVEMKESMREQLLEKLPEIVEIEISDRTLEILVNEAINRLKREGRYEQIVSSYESEE
KFREELKERILDDIKRDRVIEVLAQEKGISVNDEELEKEAEELAPFWGISPDRAKSLVKARQDLREELRWAILKRKVLDL
LLQEVKVKVVEPKGEGDDSEGKEDN
>Q9KQS5 5.2.1.8~~~tig~~~Trigger factor~~~COG0544
MQVTVETLEGLQRRLNITVPAANIEDAVAAELRNIAKNRRFDGFRKGKVPMKMVAKMYGKAVRQDVLGEVMQRHFIEAIV
KEKINPAGAPTFAPVEIGEGKDLVFTATFEVYPEVELKGLENIAVEKPAAEVTDADVAEMLETLRKQQATWKEVDEAAEN
GKRVSIDFVGSIDGVEFEGGKAENFPLEMGAGRMIPGFEDGIVGKTKGMEFVIDVTFPEDYHAENLKGKAAKFAIKVNKV
EARELPELNDEFVARFGVAEGGVDALKAEVRKNMERELKQAIKARIKEQAIEGLVKENEIQVPSALIDQEINVLRQQAAQ
RFGGNVEAAAQLPRELFEEQAKRRVVVGLLLGEVIRTHELKADEEKVKALITEMATAYEDPSEVVSYYEQNQQLMNNMRN
VALEEQAVDAIIAKAKVTEKAISFSELMNPVAA
>O67728 6.3.4.19~~~tilS~~~tRNA(Ile)-lysidine synthase~~~COG0037
MNPESRVIRKVLALQNDEKIFSGERRVLIAFSGGVDSVVLTDVLLKLKNYFSLKEVALAHFNHMLRESAERDEEFCKEFA
KERNMKIFVGKEDVRAFAKENRMSLEEAGRFLRYKFLKEILESEGFDCIATAHHLNDLLETSLLFFTRGTGLDGLIGFLP
KEEVIRRPLYYVKRSEIEEYAKFKGLRWVEDETNYEVSIPRNRIRHRVIPELKRINENLEDTFLKMVKVLRAEREFLEEE
AQKLYKEVKKGNCLDVKKLKEKPLALQRRVIRKFIGEKDYEKVELVRSLLEKGGEVNLGKGKVLKRKERWLCFSPEV
>P37563 6.3.4.19~~~tilS~~~tRNA(Ile)-lysidine synthase~~~COG0037
MKSVKDFLNKHNLTLKGATIIVGVSGGPDSMALLHALHTLCGRSANVIAAHVDHRFRGAESEEDMRFVQAYCKAEQLVCE
TAQINVTAYAQEKGLNKQAAARDCRYQFFEEIMSKHQADYLALAHHGDDQVETMLMKLAKGTLGTGLAGMQPVRRFGTGR
IIRPFLTITKEEILHYCHENGLSYRTDESNAKDDYTRNRFRKTVLPFLKQESPDVHKRFQKVSEALTEDEQFLQSLTKDE
MNKVITSQSNTSVEINSSQLLALPMPLQRRGVQLILNYLYENVPSSFSAHHIQQFLDWAENGGPSGVLDFPKGLKVVKSY
QTCLFTFEQWQCKNVPFEYQISGAADETAVLPNGYLIEARHYADSPEEHGNAVFITSEKKVRFPLTIRTRKAGDRIKLKG
MNGSKKVKDIFIDKKLPLQERDNWPIVTDASGEIIWIPGLKKSIFEDLVIPNSDRIVLQYRQHEKCRGQAKS
>P52097 6.3.4.19~~~tilS~~~tRNA(Ile)-lysidine synthase~~~COG0037
MTLTLNRQLLTSRQILVAFSGGLDSTVLLHQLVQWRTENPGVALRAIHVHHGLSANADAWVTHCENVCQQWQVPLVVERV
QLAQEGLGIEAQARQARYQAFARTLLPGEVLVTAQHLDDQCETFLLALKRGSGPAGLSAMAEVSEFAGTRLIRPLLARTR
GELVQWARQYDLRWIEDESNQDDSYDRNFLRLRVVPLLQQRWPHFAEATARSAALCAEQESLLDELLADDLAHCQSPQGT
LQIVPMLAMSDARRAAIIRRWLAGQNAPMPSRDALVRIWQEVALAREDASPCLRLGAFEIRRYQSQLWWIKSVTGQSENI
VPWQTWLQPLELPAGLGSVQLNAGGDIRPPRADEAVSVRFKAPGLLHIVGRNGGRKLKKIWQELGVPPWLRDTTPLLFYG
ETLIAAAGVFVTQEGVAEGENGVSFVWQKTLS
>Q5L3T3 6.3.4.19~~~tilS~~~tRNA(Ile)-lysidine synthase~~~COG0037
MIDKVRAFIHRHQLLSEGAAVIVGVSGGPDSLALLHVFLSLRDEWKLQVIAAHVDHMFRGRESEEEMEFVKRFCVERRIL
CETAQIDVPAFQRSAGLGAQEAARICRYRFFAELMEKHQAGYVAVGHHGDDQVETILMRLVRGSTSKGYAGIPVKRPFHG
GYLIRPFLAVSRAEIEAYCRQMGLSPRCDPSNEKDDYTRNRFRHHIVPLLRQENPRLHERFQQYSEMMAEDEQFLEELAA
DALNKVMEKQHRDAALSIGPFLELPRPLQRRVLQLLLLRLYGGVPPTLTSVHIGHILMLCERGRPSGMIDLPKGLKVIRS
YDRCLFTFDAESGEKGYWFELPVPALLPLPNGYAIISEFGEHYPRKQAGNDWFVVDPASVSLPLRVRTRRRGDRMVLKGT
GGTKKLKEIFIEAKIPRMERDRWPIVEDADGRILWVPGLKKSAFEAQNRGQARYILLQYQAMNS
>P9WG53 6.3.4.19~~~tilS~~~tRNA(Ile)-lysidine synthase~~~COG0037
MDRQSAVAQLRAAAEQFARVHLDACDRWSVGLSGGPDSLALTAVAARLWPTTALIVDHGLQPGSATVAETARIQAISLGC
VDARVLCVQVGAAGGREAAARSARYSALEEHRDGPVLLAHTLDDQAETVLLGLGRGSGARSIAGMRPYDPPWCRPLLGVR
RSVTHAACRELGLTAWQDPHNTDRRFTRTRLRTEVLPLLEDVLGGGVAEALARTATALREDTDLIDTIAAQALPGAAVAG
SRGQELSTSALTALPDAVRRRVIRGWLLAGGATGLTDRQIRGVDRLVTAWRGQGGVAVGSTLRGQRLVAGRRDGVLVLRR
EPV
>P0DUM2 ~~~timP~~~Toxic protein TimP~~~
MKVRCFCVVLLVSGTLCLHADRSYPGNSVPVTLNVQSR
>P0A4T9 ~~~tipA~~~HTH-type transcriptional activator TipA~~~
MSYSVGQVAGFAGVTVRTLHHYDDIGLLVPSERSHAGHRRYSDADLDRLQQILFYRELGFPLDEVAALLDDPAADPRAHL
RRQHELLSARIGKLQKMAAAVEQAMEARSMGINLTPEEKFEVFGDFDPDQYEEEVRERWGNTDAYRQSKEKTASYTKEDW
QRIQDEADELTRRFVALMDAGEPADSEGAMDAAEDHRQGIARNHYDCGYEMHTCLGEMYVSDERFTRNIDAAKPGLAAYM
RDAILANAVRHTP
>P0DJ90 ~~~tir~~~Translocated intimin receptor Tir~~~
MPIGNLGHNPNVRALIPPAPPLPSQTDGAGGARNQLINSNGPMGSRLLFTPIRNSVADAADSRASDIPGLPTNPLRFAAS
EVSLHGALEVLHDKGGLDTLNSAIGSSLFRVETRDDGSHVAIGQKNGLETTVVLSEQEFSSLQSLDPEGKNKFVFTGGRG
GAGHAMVTVASDIAEARQRIIDKLEPKDTKETKEPGDPNSGEGKIIEIHTSTSTSSLRADPKLWLSLGTIAAGLIGMAAT
GIAQAVALTPEPDDPITTDPDAAANTAEAAAKDQLTKEAFQNPDNQKVNIDENGNAIPSGELKDDVVAQIAEQAKAAGEQ
ARQEAIESNSQAQQKYDEQHAKREQEMSLSSGVGYGISGALILGGGIGAGVTAALHRKNQPAEQTITTRTVVDNQPTNNA
SAQGNTDTSGPEESPASRRNSNASLASNGSDTSSTGTVENPYADVGMPRNDSLARISEEPIYDEVAADPNYSVIQHFSGN
SPVTGRLVGTPGQGIQSTYALLASSGGLRLGMGGLTGGGESAVSTANAAPTPGPARFV
>P0DJ91 ~~~tir~~~Translocated intimin receptor Tir~~~
MPIGNLGHNPNVRALIPPAPPLPSQTDGAGGARNQLINSNGPMGSRLLFTPIRNSVADAADSRASDIPGLPTNPLRFAAS
EVSLHGALEVLHDKGGLDTLNSAIGSSLFRVETRDDGSHVAIGQKNGLETTVVLSEQEFSSLQSLDPEGKNKFVFTGGRG
GAGHAMVTVASDIAEARQRIIDKLEPKDTKETKEPGDPNSGEGKIIEIHTSTSTSSLRADPKLWLSLGTIAAGLIGMAAT
GIAQAVALTPEPDDPITTDPDAAANTAEAAAKDQLTKEAFQNPDNQKVNIDENGNAIPSGELKDDVVAQIAEQAKAAGEQ
ARQEAIESNSQAQQKYDEQHAKREQEMSLSSGVGYGISGALILGGGIGAGVTAALHRKNQPAEQTITTRTVVDNQPTNNA
SAQGNTDTSGPEESPASRRNSNASLASNGSDTSSTGTVENPYADVGMPRNDSLARISEEPIYDEVAADPNYSVIQHFSGN
SPVTGRLVGTPGQGIQSTYALLASSGGLRLGMGGLTGGGESAVSTANAAPTPGPARFV
>P0DTS9 3.2.2.6~~~tirS~~~NAD(+) hydrolase TirS~~~
MSVLETKLKSQMSKSAKIARNMNKLPDEIDRLRKRIERINKKRKPTSSNIRDLEKSNKQLVTKQQKLADLQVEYTKIEKK
INETKINLQKEQSRNQKKLSSMLDKNTKGNEEIMEKLLTNSDQINEISNQIKKAVNQKEIIEYDVFLSHSSLDKEDYVSK
ISEKLIEKGLKVFEDVKVFEIGKSQTETMNMGILNSRFVVVFLSPNFIESGWSRYEFLSFLNREINEEHVIILPIWHKVS
VEDVRAYNPYLVDKYALNTSDFSIEEIVEKIYQVIVNSKN
>B7UM99 ~~~tir~~~Translocated intimin receptor Tir~~~
MPIGNLGNNVNGNHLIPPAPPLPSQTDGAARGGTGHLISSTGALGSRSLFSPLRNSMADSVDSRDIPGLPTNPSRLAAAT
SETCLLGGFEVLHDKGPLDILNTQIGPSAFRVEVQADGTHAAIGEKNGLEVSVTLSPQEWSSLQSIDTEGKNRFVFTGGR
GGSGHPMVTVASDIAEARTKILAKLDPDNHGGRQPKDVDTRSVGVGSASGIDDGVVSETHTSTTNSSVRSDPKFWVSVGA
IAAGLAGLAATGIAQALALTPEPDDPTTTDPDQAANAAESATKDQLTQEAFKNPENQKVNIDANGNAIPSGELKDDIVEQ
IAQQAKEAGEVARQQAVESNAQAQQRYEDQHARRQEELQLSSGIGYGLSSALIVAGGIGAGVTTALHRRNQPAEQTTTTT
THTVVQQQTGGNTPAQGGTDATRAEDASLNRRDSQGSVASTHWSDSSSEVVNPYAEVGGARNSLSAHQPEEHIYDEVAAD
PGYSVIQNFSGSGPVTGRLIGTPGQGIQSTYALLANSGGLRLGMGGLTSGGESAVSSVNAAPTPGPVRFV
>Q7DB77 ~~~tir~~~Translocated intimin receptor Tir~~~
MPIGNLGHNPNVNNSIPPAPPLPSQTDGAGGRGQLINSTGPLGSRALFTPVRNSMADSGDNRASDVPGLPVNPMRLAASE
ITLNDGFEVLHDHGPLDTLNRQIGSSVFRVETQEDGKHIAVGQRNGVETSVVLSDQEYARLQSIDPEGKDKFVFTGGRGG
AGHAMVTVASDITEARQRILELLEPKGTGESKGAGESKGVGELRESNSGAENTTETQTSTSTSSLRSDPKLWLALGTVAT
GLIGLAATGIVQALALTPEPDSPTTTDPDAAASATETATRDQLTKEAFQNPDNQKVNIDELGNAIPSGVLKDDVVANIEE
QAKAAGEEAKQQAIENNAQAQKKYDEQQAKRQEELKVSSGAGYGLSGALILGGGIGVAVTAALHRKNQPVEQTTTTTTTT
TTTSARTVENKPANNTPAQGNVDTPGSEDTMESRRSSMASTSSTFFDTSSIGTVQNPYADVKTSLHDSQVPTSNSNTSVQ
NMGNTDSVVYSTIQHPPRDTTDNGARLLGNPSAGIQSTYARLALSGGLRHDMGGLTGGSNSAVNTSNNPPAPGSHRFV
>C6UYL8 ~~~tir~~~Translocated intimin receptor Tir~~~
MPIGNLGHNPNVNNSIPPAPPLPSQTDGAGGRGQLINSTGPLGSRALFTPVRNSMADSGDNRASDVPGLPVNPMRLAASE
ITLNDGFEVLHDHGPLDTLNRQIGSSVFRVETQEDGKHIAVGQRNGVETSVVLSDQEYARLQSIDPEGKDKFVFTGGRGG
AGHAMVTVASDITEARQRILELLEPKGTGESKGAGESKGVGELRESNSGAENTTETQTSTSTSSLRSDPKLWLALGTVAT
GLIGLAATGIVQALALTPEPDSPTTTDPDAAASATETATRDQLTKEAFQNPDNQKVNIDELGNAIPSGVLKDDVVANIEE
QAKAAGEEAKQQAIENNAQAQKKYDEQQAKRQEELKVSSGAGYGLSGALILGGGIGVAVTAALHRKNQPVEQTTTTTTTT
TTTSARTVENKPANNTPAQGNVDTPGSEDTMESRRSSMASTSSTFFDTSSIGTVQNPYADVKTSLHDSQVPTSNSNTSVQ
NMGNTDSVVYSTIQHPPRDTTDNGARLLGNPSAGIQSTYARLALSGGLRHDMGGLTGGSNSAVNTSNNPPAPGSHRFV
>A5A627 ~~~tisB~~~Small toxic protein TisB~~~
MNLVDIAILILKLIVAALQLLDAVLKYLK
>P27302 2.2.1.1~~~tktA~~~Transketolase 1~~~COG0021
MSSRKELANAIRALSMDAVQKAKSGHPGAPMGMADIAEVLWRDFLKHNPQNPSWADRDRFVLSNGHGSMLIYSLLHLTGY
DLPMEELKNFRQLHSKTPGHPEVGYTAGVETTTGPLGQGIANAVGMAIAEKTLAAQFNRPGHDIVDHYTYAFMGDGCMME
GISHEVCSLAGTLKLGKLIAFYDDNGISIDGHVEGWFTDDTAMRFEAYGWHVIRDIDGHDAASIKRAVEEARAVTDKPSL
LMCKTIIGFGSPNKAGTHDSHGAPLGDAEIALTREQLGWKYAPFEIPSEIYAQWDAKEAGQAKESAWNEKFAAYAKAYPQ
EAAEFTRRMKGEMPSDFDAKAKEFIAKLQANPAKIASRKASQNAIEAFGPLLPEFLGGSADLAPSNLTLWSGSKAINEDA
AGNYIHYGVREFGMTAIANGISLHGGFLPYTSTFLMFVEYARNAVRMAALMKQRQVMVYTHDSIGLGEDGPTHQPVEQVA
SLRVTPNMSTWRPCDQVESAVAWKYGVERQDGPTALILSRQNLAQQERTEEQLANIARGGYVLKDCAGQPELIFIATGSE
VELAVAAYEKLTAEGVKARVVSMPSTDAFDKQDAAYRESVLPKAVTARVAVEAGIADYWYKYVGLNGAIVGMTTFGESAP
AELLFEEFGFTVDNVVAKAKELL
>P33570 2.2.1.1~~~tktB~~~Transketolase 2~~~COG0021
MSRKDLANAIRALSMDAVQKANSGHPGAPMGMADIAEVLWNDFLKHNPTDPTWYDRDRFILSNGHASMLLYSLLHLTGYD
LPLEELKNFRQLHSKTPGHPEIGYTPGVETTTGPLGQGLANAVGLAIAERTLAAQFNQPDHEIVDHFTYVFMGDGCLMEG
ISHEVCSLAGTLGLGKLIGFYDHNGISIDGETEGWFTDDTAKRFEAYHWHVIHEIDGHDPQAVKEAILEAQSVKDKPSLI
ICRTVIGFGSPNKAGKEEAHGAPLGEEEVALARQKLGWHHPPFEIPKEIYHAWDAREKGEKAQQSWNEKFAAYKKAHPQL
AEEFTRRMSGGLPKDWEKTTQKYINELQANPAKIATRKASQNTLNAYGPMLPELLGGSADLAPSNLTIWKGSVSLKEDPA
GNYIHYGVREFGMTAIANGIAHHGGFVPYTATFLMFVEYARNAARMAALMKARQIMVYTHDSIGLGEDGPTHQAVEQLAS
LRLTPNFSTWRPCDQVEAAVGWKLAVERHNGPTALILSRQNLAQVERTPDQVKEIARGGYVLKDSGGKPDIILIATGSEM
EITLQAAEKLAGEGRNVRVVSLPSTDIFDAQDEEYRESVLPSNVAARVAVEAGIADYWYKYVGLKGAIVGMTGYGESAPA
DKLFPFFGFTAENIVAKAHKVLGVKGA
>Q60103 2.2.1.1~~~cbbT~~~Transketolase 2~~~
MTAHAAALAAADAPAPVDRSPGALGWPVTAALRALAMDGVEQAKSGHPGAPMGMAEIAAVLWREHLRHNPADPSWPDRDR
FVLSNGHGSMLIYALLHLTGYDLPIAELKRFRQLHSRTPGHPELGMTPGVETTTGPLGQGLANAVGMAIAEKTLAAQFNR
PGLSIVDHRTFVFLGDGCLMEGVSHEACSLAGRLGLGKLVAFYDDNGISIDGKVEEWFPDDTPARFAAYGWHVIRNVDGH
DPAMLRDAVEAALSETGKPTLICCKTTIGRGAPTKEGHQDTHGAPLGAEEIARTRAAMGWDHAPFEVPEDIYALWDARRS
GAARQSAWDARMEAYERAYPAEAAEFRRRLKGDLSPAFAATYAAALKATVEKAETVATRKASQLALAALAPAVPEFLGGS
ADLAHSNLTTFPGAVPITRDPAGNQIFYGVREFGMSAIANGIALHGGFIPFVATFLVFSDYARNAMRMSALMGQRVIYIL
THDSIGLGEDGPTHQPVEHVESLRLIPNLDVWRPADTVETLAAWHAALTRTNGPSAFILSRQNLPCWPRDAAQIEGIEAG
AYVLRESEGLARAVLVATGSEVKLAAAAADLLDTAGIPTRIVSMPCRERFEALTETERAALFPKGVPVVAVEAGVTRGWR
GLSGTRADGIIAIGIDRFGESAPEKDLWPLFGFTPEAVADAVRRAVG
>A0A0I9QGZ2 2.2.1.1~~~tkt~~~Transketolase~~~
MAHSIEELAITTIRTLSIDAIEKAKSGHPGMPMGAAPMAYTLWTKFMNHNPANPNWFNRDRFVLSAGHGSMLLYSLLHLS
GYDVSMDDLKQFRQWGSKTPGHPEYGHTPGVEATTGPLGQGIAMAVGMAMAERHLAATYNRDGFEIINHYTYAICGDGDL
MEGVASEAASLAGHLKLGRLIVLYDSNDISLDGELNLSFSENVAQRFQAYGWQYLRVEDGNNIEEIAKALEEARADLSRP
TLIEVKTTIGYGAPNKAGTSGVHGAPLGAQEAKLTKEAYRWTFAEDFYVPEEVYAHFRATVQEPGAKKEAKWNEQLAAYE
QAHPELAAQLKRAIEGKLPDGWEASLPVYEAGKSLATRSSSGEVINAIAKAVPQLFGGSADLASSNKTLIKGGGNFLPDS
YEGRNVWFGVREFAMGAALNGMALHGGLKVFGGTFFVFSDYLRPAIRLAALMGLPVIYVLTHDSIAVGEDGPTHEPIEHL
ASLRAMPNLSVIRPADANETAAAWRLALESTDKPTALVLTRQDVPTLAATAELAYEGVKKGAYVVSPAKNGAPEALLLAT
GSEVGLAVKAQEALAAEGIHVSVISMPSWDRFEAQPKSYRDEVLPPAVTKRLAIEMGASLGWERYVGAEGDILAIDRFGA
SAPGEKIMAEYGFTVDNVVRRTKALLGK
>P75611 2.2.1.1~~~tkt~~~Transketolase~~~
MKNLFACQHLALSAIQHAKGGHVGMALGASPILYTLWTKHIQFNPNCPKWINRDRLVMSAGHGSMALYPILHFAGLITKQ
EMLHHKYGQVNTSSHPEYAPNNFIDASTGPLGQGLGMAVGMALTQRVLAAEFKALSPKLFDHFTYVVVGDGDLQEGVSYE
VAHLAGVYQLNKLIVLHDSNRVQMDSVVRDVSLENLQTRFTNMGWNYLETSDAVADIDAAIKQAKKSDKPTFIEVHTTIA
KNTTLEDQPAGHWFIPTDKDFARFNSNTKTNFTPFEYPQTVYDFFHKQVIARQAKPVQAYKELLEKLKDKPLYTKFINWT
ENDYQALYLNQLDERKVAQANAATRNYLKDFLGQINNSNSNLYCLNADVARSCNIKLGDDNLHTNPHSRNIQVGIREFGM
STIMNGMALHGGVKVMGGTFLAFADYSKPAIRLGALMNLPTFYVYTHDSYQVGGDGPTHQPYDQLPMLRAIENVQVWRPC
DEKETAAGVNYGLLSQDQTNVLILTRQALPSLEQSDSVQTLKGGYIISNRKQPDVIVAASGSEVQLALQLEQALNEQQLK
TRVVSVPNINMLLSQPQSYLQQLFDPNSVLLTLEASASMEWYALAKYVKKHTHLGAFSFGESNDGQVVYEHKGFNVTNLL
KLIKTLKS
>P9WG25 2.2.1.1~~~tkt~~~Transketolase~~~COG0021
MTTLEEISALTRPRHPDYWTEIDSAAVDTIRVLAADAVQKVGNGHPGTAMSLAPLAYTLFQRTMRHDPSDTHWLGRDRFV
LSAGHSSLTLYIQLYLGGFGLELSDIESLRTWGSKTPGHPEFRHTPGVEITTGPLGQGLASAVGMAMASRYERGLFDPDA
EPGASPFDHYIYVIASDGDIEEGVTSEASSLAAVQQLGNLIVFYDRNQISIEDDTNIALCEDTAARYRAYGWHVQEVEGG
ENVVGIEEAIANAQAVTDRPSFIALRTVIGYPAPNLMDTGKAHGAALGDDEVAAVKKIVGFDPDKTFQVREDVLTHTRGL
VARGKQAHERWQLEFDAWARREPERKALLDRLLAQKLPDGWDADLPHWEPGSKALATRAASGAVLSALGPKLPELWGGSA
DLAGSNNTTIKGADSFGPPSISTKEYTAHWYGRTLHFGVREHAMGAILSGIVLHGPTRAYGGTFLQFSDYMRPAVRLAAL
MDIDTIYVWTHDSIGLGEDGPTHQPIEHLSALRAIPRLSVVRPADANETAYAWRTILARRNGSGPVGLILTRQGVPVLDG
TDAEGVARGGYVLSDAGGLQPGEEPDVILIATGSEVQLAVAAQTLLADNDILARVVSMPCLEWFEAQPYEYRDAVLPPTV
SARVAVEAGVAQCWHQLVGDTGEIVSIEHYGESADHKTLFREYGFTAEAVAAAAERALDN
>Q8YRU9 2.2.1.1~~~tkt~~~Transketolase~~~COG0021
MAVATQSLEELSINAIRFLAVDAIEKAKSGHPGLPMGAAPMAFVLWNRFMRYNPKNPKWFNRDRFVLSAGHGSMLQYALL
YLTGYDSVSIEDIKQFRQWESKTPGHPENFMTAGVEVTTGPLGQGIANGVGLAIAEAHLAAKFNKPDAKIVDHYTYVILG
DGCNMEGVSGEAASFAGHLGLGKLIALYDDNHISIDGSTDVAFTEDVSKRFESYGWHVIHVKDGNTDLEAIHKAIEEAKA
VTDKPTMIKVTTIIGYGSPNKSNTAGVHGAALGGDEVALTRQNLGWSHDPFVVPEDVLNYTRKAVERGAGYESDWNKTYA
DYKAKYPQEAAEFERYLSGKLADGWDKVLPSYTPEDKGLPTRKHSETCLNKLAAVLPELIGGSADLTHSNLTEIKGKGDF
QKGQYQNPNIHFGVREHGMGAICNGIALHGSGLIPYGATFLIFSDYMRAPIRLSALSQAGSIWVMTHDSIGQGEDGPTHQ
PIETLASLRAIPNLTVIRPADGNETSGAYKVAIERAKNNAPTLLAFTRQNVPNLAGTSIDDVAKGGYIVVDTDGTPDLIL
IGTGSELSLCVTAAEKLKAEGKKVRVVSLAAWDLFDAQDAAYKESVLPKAVTKRLAVEAASSFGWHKYIGSEGDAVTIDR
FGASAPGGVCLEKFGFSVDNVLAKAKQLLG
>P99161 2.2.1.1~~~tkt~~~Transketolase~~~
MFNEKDQLAVDTLRALSIDTIEKANSGHPGLPMGAAPMAYTLWTRHLNFNPQSKDYFNRDRFVLSAGHGSALLYSLLHVS
GSLELEELKQFRQWGSKTPGHPEYRHTDGVEVTTGPLGQGFAMSVGLALAEDHLAGKFNKEGYNVVDHYTYVLASDGDLM
EGISHEAASFAGHNKLSKLVVLYDSNDISLDGELNKAFSENTKARFEAYGWNYLLVKDGNDLEEIDKAITTAKSQEGPTI
IEVKTTIGFGSPNKAGTNGVHGAPLGEVERKLTFENYGLDPEKRFNVSEEVYEIFQNTMLKRANEDESQWNSLLEKYAET
YPELAEEFKLAISGKLPKNYKDELPRFELGHNGASRADSGTVIQAISKTVPSFFGGSADLAGSNKSNVNDATDYSSETPE
GKNVWFGVREFAMGAAVNGMAAHGGLHPYGATFFVFSDYLKPALRLSSIMGLNATFIFTHDSIAVGEDGPTHEPIEQLAG
LRAIPNMNVIRPADGNETRVAWEVALESESTPTSLVLTRQNLPVLDVPEDVVEEGVRKGAYTVYGSEETPEFLLLASGSE
VSLAVEAAKDLEKQGKSVRVVSMPNWNAFEQQSEEYKESVIPSSVTKRVAIEMASPLGWHKYVGTAGKVIAIDGFGASAP
GDLVVEKYGFTKENILNQVMSL
>Q5XAK5 2.2.1.1~~~tkt~~~Transketolase~~~
MATVSTGSLIFIVKKNSPMMSKLVFFWQNREKEFRDFGGFSEKSVYFCDTIDNRKRLILVVINREVLLMTFDAIDQLAVN
TVRTLSMDAIQAANSGHPGLPMGAAPMAYVLWNHFMNINPKTSRNWSNRDRFILSAGHGSAMLYSLLHLAGYDLSVEDLK
NFRQWGSKTPGHPEVNHTDGVEATTGPLGQGIANAVGMAMAEAHLAAKFNKPGFDIVDHYTFALNGDGDLMEGVSQEAAS
MAGHLKLGKLVLLYDSNDISLDGPTSMAFTEDVKGRFEAYGWQHILVKDGNDLEEIAAAIEAAKAETEKPTIIEVKTIIG
FGAEKQGTSAVHGAPLGAEGIAFAKKAYQWTHQDFEVPAEVTERFAQGLQARGEKAEQAWNDLFAAYEAEYPELAAEYQK
AFANEAAQVELEAHELGSSMASRVSSQQAIQQISEQVASFWGGSADLSASNNTMVKAETDFQPGHYEGRNVWFGVREFAM
AAAMNGIALHGGTRVYGGTFFVFSNYLLPAVRMAALQNLPTVYVMTHDSIAVGEDGPTHEPIEQLASVRSMPNLNVIRPA
DGNETNAAWKRAIAETDRPTMLVLTRQNLPVLEGTKELAEDGLNKGAYILSEAKGDLEGILIATGSEVKLAMDTQEALEA
EGIHVRVVSMPSQNIFDEQSAEYKESILPAAVTKRLAIEAGSSFGWAKYVGLAGKTLTIDTWGASAPGNRIFEEYGFTVA
NATELYKSL
>Q5SK82 ~~~~~~Lactate-binding periplasmic protein TTHA0766~~~COG4663
MKRVSRRAFLRRLGVGVAATAAFSPLAVAQARRYRWRIQTAWDAGTVGYSLFQKFTERVKELTDGQLEVQPFPAGAVVGT
FDMFDAVKTGVLDGMNPFTLYWAGRMPVTAFLSSYALGLDRPDQWETWFYSLGGLDIARRAFAEQGLFYVGPVQHDLNII
HSKKPIRRFEDFKGVKLRVPGGMIAEVFAAAGASTVLLPGGEVYPALERGVIDAADFVGPAVNYNLGFHQVAKYIIMGPP
ETPAIHQPVDLMDFTINLNRWRSLPKPLQERFIAAVHEYSWIHYAGIQKANLEAWPKYRQAGVEVIRLSNEDVRKFRRLA
IPIWFKWAKMDKYSREAFASQLEYMKGIGYVTDEELKGLSL
>P19568 ~~~tlcA~~~ADP,ATP carrier protein 1~~~COG3202
MSTSKSENYLSELRKIIWPIEQYENKKFLPLAFMMFCILLNYSTLRSIKDGFVVTDIGTESISFLKTYIVLPSAVIAMII
YVKLCDILKQENVFYVITSFFLGYFALFAFVLYPYPDLVHPDHKTIESLSLAYPNFKWFIKIVGKWSFASFYTIAELWGT
MMLSLLFWQFANQITKIAEAKRFYSMFGLLANLALPVTSVVIGYFLHEKTQIVAEHLKFVPLFVIMITSSFLIILTYRWM
NKNVLTDPRLYDPALVKEKKTKAKLSFIESLKMIFTSKYVGYIALLIIAYGVSVNLVEGVWKSKVKELYPTKEAYTIYMG
QFQFYQGWVAIAFMLIGSNILRKVSWLTAAMITPLMMFITGAAFFSFIFFDSVIAMNLTGILASSPLTLAVMIGMIQNVL
SKGVKYSLFDATKNMAYIPLDKDLRVKGQAAVEVIGGRLGKSGGAIIQSTFFILFPVFGFIEATPYFASIFFIIVILWIF
AVKGLNKEYQVLVNKNEK
>P0AGG8 3.4.-.-~~~tldD~~~Metalloprotease TldD~~~COG0312
MSLNLVSEQLLAANGLKHQDLFAILGQLAERRLDYGDLYFQSSYHESWVLEDRIIKDGSYNIDQGVGVRAISGEKTGFAY
ADQISLLALEQSAQAARTIVRDSGDGKVQTLGAVEHSPLYTSVDPLQSMSREEKLDILRRVDKVAREADKRVQEVTASLS
GVYELILVAATDGTLAADVRPLVRLSVSVLVEEDGKRERGASGGGGRFGYEFFLADLDGEVRADAWAKEAVRMALVNLSA
VAAPAGTMPVVLGAGWPGVLLHEAVGHGLEGDFNRRGTSVFSGQVGELVASELCTVVDDGTMVDRRGSVAIDDEGTPGQY
NVLIENGILKGYMQDKLNARLMGMTPTGNGRRESYAHLPMPRMTNTYMLPGKSTPQEIIESVEYGIYAPNFGGGQVDITS
GKFVFSTSEAYLIENGKVTKPVKGATLIGSGIETMQQISMVGNDLKLDNGVGVCGKEGQSLPVGVGQPTLKVDNLTVGGT
A
>Q9JRN7 1.1.1.135~~~tld~~~GDP-6-deoxy-D-talose 4-dehydrogenase~~~COG0451
MKILVTGGSGFIGKNLIYLLREKREFEVFGATVEETMDLTNPCSVQSVLEKTKPDFIVHLAALTFVPNNNPITFYLVNTI
GTENLLRSIVDLNVAKLGVLCFSTAGIYGIQETKLLSESLTPKPVNHYSMSKHCMEHIVNKYRCFRGITVVRPFNVLGLG
QNINFLVPKMVSAFVKKDKTIELGNLDSVRDFISVNDCCDIIYRLISKLIENETINICTGIGYSVYQIFQLLCEISMHQM
EIKQNELFVRHDDIPQMIGDPSKLLNVLGNDYRFTSVRAILEEMYKNRLLELSI
>O66256 1.1.1.339~~~tll~~~dTDP-6-deoxy-L-talose 4-dehydrogenase (NAD(+))~~~
MNIIITGANGYIGRYVVKELLNKGHKVIAILFDGESPHSFLSGAELFYGDIFALSQEKKVDLVQNAECLLHLAWQAGFNH
RDPSHLNNVMKHYQFLTSMAELGIKNISVAGTMHEVGYFVGPIDANTPCNPRNPYGIAKNFLRQAMFDFASVTPELNLRW
LRFYYITGDDRFNNSIFTKILKAEDEVQEYFPLNSGEMLYDFVDIKDLSLQIEERIISKESGIFNCCSGKPKSLRTAVEE
FIAEHNLKIKPKYNVFPARSYDSMAVWGAK
>P43221 ~~~tlpA~~~Thiol:disulfide interchange protein TlpA~~~COG0526
MLDTKPSATRRIPLVIATVAVGGLAGFAALYGLGLSRAPTGDPACRAAVATAQKIAPLAHGEVAALTMASAPLKLPDLAF
EDADGKPKKLSDFRGKTLLVNLWATWCVPCRKEMPALDELQGKLSGPNFEVVAINIDTRDPEKPKTFLKEANLTRLGYFN
DQKAKVFQDLKAIGRALGMPTSVLVDPQGCEIATIAGPAEWASEDALKLIRAATGKAAAAL
>Q9I0I4 ~~~tlpQ~~~Methyl-accepting chemotaxis protein TlpQ~~~
MFLRRLSIQWKITLLAGLCLLGVVALLVGLSVYRMQHSSVLVKSASTQMLDESARLRLEARGELQALRIQRYFMDAFQYG
KGFSRQILFLRDQAQKRFLDAYDLREDLTRQVRTALAANPEVLGLYVVFEPNALDGKDELFVDQPALGSNDKGRFSLYWA
QATPGQLESESMIESELADTSSGPSGAAYNAWYTCPKESGQPCVLDPYFDKVGERQLLMTSIAFPLELDGKVIGVMGLDI
NLSNLQALSEQGNRELYDGVGQVGILSPAGLFAGNSRDAGLLGKNLAKADPQHAGELLQLLAAGKSRLFNENDDLKVLQP
LQPIPGAKPWGVLLEVPKSALLGPALALERQLDDMRREGTWVELGLGLGAAVLGLLVLWLSARGVTRPILGVAHMLRDIA
SGEGDLTQRLPHTGRDELGELAGWFNRFLDKLQPIIRDVKVSVRDARSTADQSAAISSQTSAGMQQQFREIDQVATASHE
MTATAQDVARSAAQAADAARGADQATRDGLALIDRTTQSIDSLAANLTSAMGQVEQLASSSEEIGSVLEVIRAIAEQTNL
LALNAAIEAARAGDAGRGFAVVADEVRNLARRTQDSVEQIRGVIEGLQQGTRDVVDAMHGSHRQAQGSVEQVDEAVAALQ
RIGEAVTVINDMNLQIASAAEEQSSVAEEINRNVAAIRDVTESLSSQAEESAQVSQSLNRLANHQQGLMEQFKA
>Q45060 ~~~tlp~~~Small, acid-soluble spore protein Tlp~~~
MTKNQNQYQQPNPDDRSDNVEKLQDMVQNTIENIEEAEASMEFASGEDKQRIKEKNARREQSIEAFRNEIQDESAARQNG
YRS
>A0A0H3PEK7 2.1.1.226~~~tlyA~~~23S rRNA (cytidine-2'-O)-methyltransferase TlyA~~~COG1189
MRFDFFVSKRLNISRNKALELIENEEVLLNGKSFKASFDVKNFLENLKKTQDLNPEDILLTDGLKLDLLSEIYVSRAALK
LKNFLEENGIEIKHKNCLDIGSSTGGFVQILLENQALKITALDVGNNQLHLSLRTNEKIILHENTDLRTFKSEEKFELIT
CDVSFISLINLLYYIDNLALKEIILLFKPQFEVGKNIKRDKKGVLKDDKAILKARMDFEKACAKLGWLLKNTQKSSIKGK
EGNVEYFYYYIKN
>P9WJ62 2.1.1.226~~~tlyA~~~16S/23S rRNA (cytidine-2'-O)-methyltransferase TlyA~~~
MARRARVDAELVRRGLARSRQQAAELIGAGKVRIDGLPAVKPATAVSDTTALTVVTDSERAWVSRGAHKLVGALEAFAIA
VAGRRCLDAGASTGGFTEVLLDRGAAHVVAADVGYGQLAWSLRNDPRVVVLERTNARGLTPEAIGGRVDLVVADLSFISL
ATVLPALVGCASRDADIVPLVKPQFEVGKGQVGPGGVVHDPQLRARSVLAVARRAQELGWHSVGVKASPLPGPSGNVEYF
LWLRTQTDRALSAKGLEDAVHRAISEGP
>P9WJ63 2.1.1.226~~~tlyA~~~16S/23S rRNA (cytidine-2'-O)-methyltransferase TlyA~~~COG1189
MARRARVDAELVRRGLARSRQQAAELIGAGKVRIDGLPAVKPATAVSDTTALTVVTDSERAWVSRGAHKLVGALEAFAIA
VAGRRCLDAGASTGGFTEVLLDRGAAHVVAADVGYGQLAWSLRNDPRVVVLERTNARGLTPEAIGGRVDLVVADLSFISL
ATVLPALVGCASRDADIVPLVKPQFEVGKGQVGPGGVVHDPQLRARSVLAVARRAQELGWHSVGVKASPLPGPSGNVEYF
LWLRTQTDRALSAKGLEDAVHRAISEGP
>K9UJK2 ~~~~~~Potassium channel Cha6605_3372~~~COG3548
MVEAPEQSETGRIEAFSDGVFAIAITLLVLEIKVPQHKIVETVGLVSSLLSLWPSYLAFLTSFASILVMWVNHHRIFSLV
ARTDHAFFYWNGLLLMLVTFVPFPTALLAEYLIHPQARVAASVYAGIFLAIAIVFNRLWKHAATADRLLAQKADRHEVDA
ITKQYRFGPGLYLVAFALSFISVWLSVGVCFVLAIYFALRSNA
>A0A086F3E3 ~~~~~~Potassium channel HX13_20290~~~
MTKGRLEAFSDGVLAIIITIMVLELKVPEGSSWASLQPILPRFLAYIFSFIYVGIYWNNHHHLFQTVKKVNGSILWANLH
LLFWLSLMPIATEWIGTSHFAQNPVATYGIGLIMSAIAYTILENVIIRCEGENSKLKEAIHSKFKEYISIIFYVLGIATS
FFYPYIAIGFYYLVALIWLIPDKRIEKSLKEN
>E4TN31 ~~~~~~Potassium channel Ftrac_2467~~~COG3548
MRKVFETVVGLNPNFSFRGKQQTRIETFSDAVFALAITLLVLSSTIPETFEDLWASMRDVIPFAICVALIIVIWYQHYIF
FLKYGLQDKVTILLNTILLFVLLVYVYPLKFLARFLSEIYGGIFGIIETDLSRFGEYSHQNLKLLMVNYGLGAFAIFLVF
SLMYWRAYKMKSLLDLNSYEIFDTKSSIIANLLMCSVPLLSLIITLIDPWGNFRTTILSGFLYFLYVPIMIVFGRITSKK
SRRLLQD
>S5VBU1 ~~~~~~Potassium channel B446_29190~~~COG3548
MNESGRVEAFSDGVFAIAITLLILDIKVPKADGPGGLWHALGAQWPSYAAYVVSFLVIGIMWVNHHQVFSYVARVDRALM
FLNLLVLMVVAAVPWPTAMLAEYLREDRASHVAAAVYSLVMVAMALAFQALWWHLTRTGHLFDPRVDAPAARATRIRFAL
GSLGYPLTVGLAFVSAPLTLAAHGLLALYYGFNQVPVPTREAAAPS
>P0DV85 ~~~~~~Retron Ec48 transmembrane protein~~~
MNANIRLLKYIVGVSSALFLIFSLISLFETIQNEKLYERDICFDSQCLKFFAEKTSGIVMYFQAFGWLITTFVTVFGVMI
ALMTYNAGVKNNNNSNYTSHLTMFREFASAELTKRSSIYPEKVNFFRWYRVMFPEAQGGDISVSRDYLEIISRIKCVIEE
ANAHITEENKDYKYKTHQRKMMAVLDEIGISISNGPKNIFIEVESQILDYIDTINLSFCHSSSVIELSRVKRKYI
>A0A0B5RUB0 ~~~tmaT~~~Trimethylamine transporter~~~
MFKKLLDNKNLVINPPVFITSILLIVALILTCVLFPEKVGVWFPAAQLAVTSNFGWFFVVTVNVILIFAIYLAFSKFGRI
RLGGDDAEPEFTKASWFAMLFSTGMGIGIMFFSIAEPVSHFFNTPRPVDTDIEAAVQAMQFTSLHWGLHAWGIYAMVGLA
LAFFGFNRKLPMTFRSLFYPFWGERIHGWWGHIIDILSALATVFGLSTSLGLGVIQITAGLEYLYGWEISPMMQAGIILF
VIGIATISVFSGLDKGVKILSNANMYIAASFMLLIFILGPTLFIMKGYVENTGAYLANFIDISTWNDTYLGSGWQNVWTI
FYWAWWIAWSPFVGSFIARISKGRTVKEFVLGVLIVPGLITLLWMNVFGGSALHTILSGDVTMIAAVKADVSTALFVFLE
NFPFTKFLSIVAIILIFSFFITSSDSGSLVVDNITSGSNGESPVWQRVFWSFAQGIIAIVLLWGGGLDALQTAVIITGLP
FAVILLVMCYSLQKGLKEELAKSSKKAKSKEEKSYKEIIAELLDEPQSK
>Q8YSQ6 ~~~~~~Monocarboxylate 2-oxoacid-binding periplasmic protein all3028~~~COG4663
MKRREVLNTAAIATATTALVSCTQTNTSSVQAGLPNVRWRMTTSWPKSLGTFIGAETVAKRVAEMTNGRFKITPFAAGEL
VPGLQVLDAVQAGTVECGHTSSYYYIGKSPALAFATSVPFGLNAQQQYAWLYQGGGLAAIQKIYANFNVINFPAGSTGAQ
MGGWFKKEIKSVSDLKGLKMRIPGLGGQVMSRLGVNVQVLPGGEIYLALDRGAIDAAEWVGPYDDEKLGLNKAAQFYYYP
GWWEPGPTLDVLVNLNAWNRLPKEYQEIFKTATVEANLTMLNQYDALNGEALTRLLAGGTKLVPYSQEIMQAAQKISFDI
FEENASKDAAFKQVYEQWKAFRKQIFAWNRVNELSYENFASSSQ
>O34513 6.3.4.-~~~tmcAL~~~tRNA(Met) cytidine acetate ligase~~~COG1323
MKAVGLVVEYNPFHNGHLYHAQTAKLQTGCDTAVAVMSGHFLQRGEPAVVSKWARTKMALQSGVDLVIELPYLYAVQKAD
IFARGSVSILNELECEALFFGSENGDIKPFLETAQLIDEHKHILNDRIKEELKKGASYPAAAAIAFSSILHTESALDLSK
PNNILGYQYVTSILTGGYPMKPYTTARISSDYHDADLPEGENHIASATSIRKAMIGQNLEACLRFLPAASARELAAYRKS
FGLWHTPESYFSYLKYSLSTVTARELQQVYEVEEGLEHRIIRSIRKSSSYQEFMELLKTKRYTWTRLQRMNTHILTRTKK
QDMQKLLDNDKAPYIRLLGMTKKGQAYLSEKKKALSVPLVSKLSSFSHPALDLDVKASRIYSLPIEEPLRTEFDLQEYGH
APIRYDEDEQHFLNV
>Q97P99 6.3.4.-~~~tmcAL~~~tRNA(Met) cytidine acetate ligase~~~COG1323
MTITGIIAEFNPFHNGHKYLLDQAEGLKIVAMSGNFMQRGEPAIVDKWTRTQMALENGADLVVELPFLVSVQAADFFGQG
AMDILDRLGIDSLVFGTEEVRDYQKIADLYTEKGAEMEKFVENLPDSLSYPQKTQAMWKEFAGLDFSGNTPNHVLALAYA
KAVAGRNIKLHPIQRQGAGYHSVNKDVDFASATALRQHQKDQDFLERFMPSVALFEQASKVIWEDYFPLLRYQILSNPDL
TTIYQVNQEMAVRIKEAIKTAQSVEELVELVTTKRYTKARVRRLLTYILMQARESDLPEAIHVLGFTEKGRQHLKSLKGQ
VSLVSRIGKEPWDAMTQKADQIYQLGKPSIAEQNFGRVPIRIETN
>Q9X1K1 6.3.4.-~~~tmcAL~~~tRNA(Met) cytidine acetate ligase~~~COG1323
MEYNPFHNGHLYHLTSARELVKPDYTIAVMSGNFCQRGEPAVIDKFARAEIALRMGVDVVLELPVVFATQDAGGFAFGAV
CVLDATGVVTDVVFGSESNDIEFLQRVARILYEQPDEYQKFLHEELKKGYSFPNARKYALMRYFSMKGWNEEEVLKLEKS
NDILGVEYIHSALKIGSNIRFHTIKRVGAEEKDTSFRGRFSSATAIRNLMREKRWEEVRDSLPEDSFEILMREINEGRGP
VFLENMGDFLLSFFRLKNMDFFEKIHGFSEGLEKRFHVCARQTGSYRDFLECVKAKRFTFSRIRRLALFSVFEVNKEFVE
KSNTKGPQYIRILGFTEKGREILSLMRKKAKLPIVTNMSLYRKVLEKTDLPVDKQLFLEQIDLDVKATNFYSMFFPSVEQ
RCGERDFSIHPIFLRTEM
>P76562 2.3.1.193~~~tmcA~~~tRNA(Met) cytidine acetyltransferase TmcA~~~COG1444
MAELTALHTLTAQMKREGIRRLLVLSGEEGWCFEHTLKLRDALPGDWLWISPRPDAENHCSPSALQTLLGREFRHAVFDA
RHGFDAAAFAALSGTLKAGSWLVLLLPVWEEWENQPDADSLRWSDCPDPIATPHFVQHLKRVLTADNEAILWRQNQPFSL
AHFTPRTDWYPATGAPQPEQQQLLKQLMTMPPGVAAVTAARGRGKSALAGQLISRIAGRAIVTAPAKASTDVLAQFAGEK
FRFIAPDALLASDEQADWLVVDEAAAIPAPLLHQLVSRFPRTLLTTTVQGYEGTGRGFLLKFCARFPHLHRFELQQPIRW
AQGCPLEKMVSEALVFDDENFTHTPQGNIVISAFEQTLWQSDPETPLKVYQLLSGAHYRTSPLDLRRMMDAPGQHFLQAA
GENEIAGALWLVDEGGLSQQLSQAVWAGFRRPRGNLVAQSLAAHGNNPLAATLRGRRVSRIAVHPARQREGTGRQLIAGA
LQYTQDLDYLSVSFGYTGELWRFWQRCGFVLVRMGNHREASSGCYTAMALLPMSDAGKQLAEREHYRLRRDAQALAQWNG
ETLPVDPLNDAVLSDDDWLELAGFAFAHRPLLTSLGCLLRLLQTSELALPALRGRLQKNASDAQLCTTLKLSGRKMLLVR
QREEAAQALFALNDVRTERLRDRITQWQLFH
>F8J3D9 6.2.1.60~~~tmlU~~~Marinolic acid--CoA ligase~~~
MNESALNPNYLFHPFVKRFPGDPEHVRNISDIQQLEKYPISERFPCQTVTDALFIAAKKYQHARAMSYLPNGRADDVLQS
WSYLEFLEQCIGAANLFHSLGIESDHSVAFLLPNMPEMVFGLWGAQAVAISTPINPFLAVNHICGIVTETKTTILVTLSP
DTNPSLFEKALAVKHNTTHTMTLVTIGTPCEDAIDWHTELKKQPFNRYLFNRQLTGMETSAYFHTGGTTGTPKIARHTHR
GAMINACQMLIVGPTETELDTKSKVSLCALPLFHVNAIVVSSLTSLLNGSELLLAGQQGFRNKALMSDFWRIVERFKVNF
FAGVPTVYAALLEQPVEQHNIDSLFYCGCGSSPMPQVLIKEFTQRTGADICEGYGMTETTACASTHYYYGDRKVGSVGMR
VPYQHIRAVHLDDNGQIIKECDCDEVGVLLIQGPNVIPEYKQAFANEQAWPEPGWLNTGDLGKFDADGYLWLTGRQKDLI
IRGGHNIDPLIIENTLVSHSDVVMAAAVGKPDAYAGELPVAYVTLTSGATLSADELKQYCKDYISEPAASPVEIYITAEL
PMTPIGKIFKLPLKHDVIVRFVRELIQALDDTLDFSIDIIDDVSTGNIVSINFAATREKMEAVASQLQQELDKLHFQWKC
TFTPVLAEQTVLES
>B8EIZ7 1.14.13.148~~~tmm~~~Trimethylamine monooxygenase~~~COG2072
MTRVAIIGAGPSGLAQLRAFQSAGKKGAAIPELVCFEKQSDWGGLWNYTWRTGVDEYGEPVHGSMYRYLWSNGPKECLEF
ADYSFEEHFGRPIPSYPPRAVLHDYIMGRVEKSDVRKFVRFSTVVRWIDFDETTQLFTVTVKDLKKDELYSETFDYVVVA
SGHFSTPNVPHFPGIEVFPGRVLHAHDFRDANEFVGKNLLVVGSSYSAEDIASQCYKYGAKSITFSYRSKPLNFDWPECF
TVKPLLTKLTGKTAHFKDGSEAVVDAVLLCTGYLHHFPFLADNLRLKTNNRLYPAGLYKGIFWQDNPKLIYLGMQDQYFT
FNMFDAQAWYARDVILGRIKLPAAEERQADIDHWRGLEEKLETAFDGIDFQTEYMRDLIPATDYPMFDLDKVAALFKEWE
EDKVKSIMGYRDNSYVSIMTGNKAPPHHTKWMEALDDSFDAFQNRPEAAAE
>A0A0B5RNJ4 1.14.13.148~~~Mptmm~~~Trimethylamine monooxygenase~~~
MLNLKVGIIGAGPSGLAMLRAFESEQKKGNPIPEIKCYEKQDNWGGMWNYTWRTGVGKYGEPIHGSMYKYLWSNGPKECL
EFSDYTFMEHFKQPISSYPPREVLFDYIQGRIKQSNARDFIKFNTVARWVDYLEDKKQFRVIFDDLVKNETFEEYFDYLV
VGTGHFSTPNMPYFKGIDSFPGTVMHAHDFRGADQFIDKDILLIGSSYSAEDIGVQCFKHGSKSVTISYRTNPIGAKWPK
GIEEKPIVTHFEDNVAHFKDGSKKEYDAVILCTGYQHKFPFLPDNLRLKTKNNLYPDNLYKGVVFNENERLIFLGMQDQY
YTFNMFDTQAWFARDYMLGRIALPNKEIRDKDIAKWVELEKTSVTGEEHVDFQTDYIKELIEMTDYPTFDLDRVAEMFKS
WLNDKETNILNYRDKVYTSVMTGVTAEEHHTPWMKELDDSLERYLDEVEVDELELSKENYY
>B6BQB2 1.14.13.148~~~tmm~~~Trimethylamine monooxygenase~~~COG2072
MSKVAIIGAGPCGLSILRAFEHLEKKGEKIPEIVCFEKQESWGGLWNYNWRTGSDQYGDPVPNSMYRYLWSNGPKECLEF
ADYSFDQHFGKSIPSFPPREVLQDYILGRVSKGNIKNKIKFNTRVINTVYRNDKFEINYQDKVNDKTLSDTFDYLVVSTG
HFSVPFIPEYEGMSSFPGRIMHSHDFRDAEEFRGKNVIVLGSSYSAEDVALQCNKYGAKSVTIGYRHNPMGFKWPKGMKE
VHYLDKLDGKKAIFKDGTEQDADVVILCTGYLHHFPFLDESLKLKTHNRLYPPKLYKGVVWQDNHKLLYLGMQDQFHTFN
MFDCQAWFARDVIMDKIKMPSDDEIDKDINKWVSMEEKLENPDQMIDFQTEYTKELHNISDYPKIDFELIRKHFKEWEHH
KVEDILTYRNKSFSSPVTGSVAPVHHTPWEKAMDDSMKTFLNKR
>Q1V023 1.14.13.148~~~tmm~~~Trimethylamine monooxygenase~~~
MTKVAIIGAGPCGLSALRSFEQAEKNGEKIPEIVCFDKQEDWGGLWNYSWRTGSDQYGDPVPNSMYRYLWSNGPKECLEF
ADYSFDEHFGKPIPSFPPREVLYNYILGRVKKGNLKSKIKFNTTVTNVSYDNENFEVTYRDKKNDKISKDIFDYVIVSTG
HFSVPFIPEYPGMKAFPGRIMHSHDFRDAEEFRGKNVVVLGSSYSAEDVALQCHKYGAKSVTIGYRHNPMGFKWPEGMKE
VFHLDRLEGNKAIFKDGHVQETDAVILCTGYLHHFPFMSEDLKLKTGNRLYPPMLYKGVVWQNNHKLMYLGMQDQFHTFN
MFDCQAWFARDVIMGKIKTPNDSEIEKDINKWVSMEEKLENADQMIDFQTEYTKELHELSDYPKIDFELIRKTFKEWEHH
KVENIMTYRNKSFASPVTGSVGPIHHTPWEEAMDDSLKTFLNK
>A3SLM3 1.14.13.148~~~tmm~~~Trimethylamine monooxygenase~~~COG2072
MTKRVAVIGAGPSGLAQLRAFQSAADQGAEIPEIVCFEKQANWGGLWNYTWRTGLDENGEPVHCSMYRYLWSNGPKEGLE
FADYSFEEHFGKQIASYPPRAVLFDYIEGRVHKADVRKWIRFNSPVRWVSYDAETAKFTVTAHNHETDSTYSEDFDHVIC
ASGHFSTPNVPFYEGFDTFNGRIVHAHDFRDAREFEGKDVLVMGASYSAEDIGSQCWKYGAKSITSCYRSAPMGYAWPDN
WEEKPALEKLTGKTAHFADGSTRDVDAIILCTGYKHFFSFLPDDLRLKTANRLATADLYKGVAYVHNPAMFYLGMQDQWF
TFNMFDAQAWWVRDAILGRITLPKDKAAMLADVAERETREEASDDVKYAIRYQADYVKELVAETDYPSFDIDGACDAFFE
WKKHKAKDIMAFRDNSYKSVITGTMAPVHHTPWKEALDDSMEAYLQN
>A3VVZ4 1.14.13.148~~~tmm~~~Trimethylamine monooxygenase~~~COG2072
MTKKRIAIIGAGPSGLAQLRAFQSAAAKGAEIPEIVCFEKQDNWGGLWNYTWRTGLDQYGEPVHGSMYRYLWSNGPKEGL
EFADYSFEEHFGKQIASYPPRAVLFDYIEGRVLKAGVRNLIRFSTAVRWVEKAGDKFNVTVCHLPEDRTYTEEFDHVIVC
SGHFSTPNVPYFPGFENFKGRVLHAHDFRDALEFKDKDILIVGTSYSAEDIGSQCWKYGCKSVTVSHRTAPMGFNWPDNW
QEVPLLQKVEGNTAYFKDGTTKDVDAVILCTGYKHHFPFLPDDLRLKTANRLATADLYKGVAFVREPALFYLGMQDQWFT
FNMFDAQAWWVRDVIMGRIALPDQATMEADVIDRVTREDAGEDDYAAIWYQGDYVKELIDETDYPSFDVEGACKAFKEWK
GHKKKDIMGFRNNAYKSVITGTMAPMHHTPWKDALDDSLEVYLQN
>Q5LT63 1.14.13.148~~~tmm~~~Trimethylamine monooxygenase~~~COG2072
MTTSKRVAIIGAGPSGLAQLRAFQSAAAKGEEIPEIVCFEKQDNWGGLWNYTWRTGLDENGEPVHCSMYRYLWSNGPKEG
LEFADYSFEEHFGKQIASYPPRAVLFDYIEGRVLKANVRDWIRFSTAVRWIDYNDETGLFKVTVHDHTNDRVYSEEFDHV
ICASGHFSTPNVPHYEGFETFNGRLVHAHDFRDAREFAGKDILVVGSSYSAEDIGSQCWKYGAKSITSCYRSAPMGFKWP
DNWEEKPALVKVDKNTVFFSDGTSREVDAIILCTGYKHFFNFLPDDLRLKTANRLATADLYKGVVYVHNPKMFYLGMQDQ
WFTFNMFDAQAWWVRDAIMGKIDLSNVTKEQMLADVTERETREEASDDVKYAIRYQADYVKELVAETDYPSFDIDGACEA
FFQWKKHKGEDIMAFRNNSYTSVITGTLAPVHHTPWKDALDDSMEAYLRN
>Q00456 1.14.13.236~~~tmoA~~~Toluene-4-monooxygenase system, hydroxylase component subunit alpha~~~
MAMHPRKDWYELTRATNWTPSYVTEEQLFPERMSGHMGIPLEKWESYDEPYKTSYPEYVSIQREKDAGAYSVKAALERAK
IYENSDPGWISTLKSHYGAIAVGEYAAVTGEGRMARFSKAPGNRNMATFGMMDELRHGQLQLFFPHEYCKKDRQFDWAWR
AYHSNEWAAIAAKHFFDDIITGRDAISVAIMLTFSFETGFTNMQFLGLAADAAEAGDYTFANLISSIQTDESRHAQQGGP
ALQLLIENGKREEAQKKVDMAIWRAWRLFAVLTGPVMDYYTPLEDRSQSFKEFMYEWIIGQFERSLIDLGLDKPWYWDLF
LKDIDELHHSYHMGVWYWRTTAWWNPAAGVTPEERDWLEEKYPGWNKRWGRCWDVITENVLNDRMDLVSPETLPSVCNMS
QIPLVGVPGDDWNIEVFSLEHNGRLYHFGSEVDRWVFQQDPVQYQNHMNIVDRFLAGQIQPMTLEGALKYMGFQSIEEMG
KDAHDFAWADKCKPAMKKSA
>Q00457 1.14.13.236~~~tmoB~~~Toluene-4-monooxygenase system, hydroxylase component subunit gamma~~~
MSAFPVHAAFEKDFLVQLVVVDLNDSMDQVAEKVAYHCVNRRVAPREGVMRVRKHRSTELFPRDMTIAESGLNPTEVIDV
VFEE
>Q00458 ~~~tmoC~~~Toluene-4-monooxygenase system, ferredoxin component~~~
MSFEKICSLDDIWVGEMETFETSDGTEVLIVNSEEHGVKAYQAMCPHQEILLSEGSYEGGVITCRAHLWTFNDGTGHGIN
PDDCCLAEYPVEVKGDDIYVSTKGILPNKAHS
>Q00459 ~~~tmoD~~~Toluene-4-monooxygenase system, effector component~~~
MSTLADQALHNNNVGPIIRAGDLVEPVIETAEIDNPGKEITVEDRRAYVRIAAEGELILTRKTLEEQLGRPFNMQELEIN
LASFAGQIQADEDQIRFYFDKTM
>Q00460 1.14.13.236~~~tmoE~~~Toluene-4-monooxygenase system, hydroxylase component subunit beta~~~
MSFESKKPMRTWSHLAEMRKKPSEYDIVSRKLHYSTNNPDSPWELSPDSPMNLWYKQYRNASPLKHDNWDAFTDPDQLVY
RTYNLMQDGQESYVQSLFDQFNEREHDQMVREGWEHTMARCYSPLRYLFHCLQMSSAYVQQMAPASTISNCCILQTADSL
RWLTHTAYRTHELSLTYPDAGLGEHERELWEKEPGWQGLRELMEKQLTAFDWGEAFVSLNLVVKPMIVESIFKPLQQQAW
ENNDTLLPLLIDSQLKDAERHSRWSKALVKHALENPDNHAVIEGWIEKWRPLADRAAEAYLSMLSSDILPAQYLERSTSL
RASILTV
>Q03304 1.18.1.3~~~tmoF~~~Toluene-4-monooxygenase system, ferredoxin--NAD(+) reductase component~~~
MFNIQSDDLLHHFEADSNDTLLSAALRAELVFPYECNSGGCGACKIELLEGEVSNLWPDAPGLAARELRKNRFLACQCKP
LSDLKIKVINRAEGRASHPPKRFSTRVVSKRFLSDEMFELRLEAEQKVVFSPGQYFMVDVPELGTRAYSAANPVDGNTLT
LIVKAVPNGKVSCALANETIETLQLDGPYGLSVLKTADETQSVFIAGGSGIAPMVSMVNTLIAQGYEKPITVFYGSRLEA
ELEAAETLFGWKENLKLINVSSSVVGNSEKKYPTGYVHEIIPEYMEGLLGAEFYLCGPPQMINSVQKLLMIENKVPFEAI
HFDRFF
>Q8KIY1 2.7.13.3~~~tmoS~~~Sensor histidine kinase TmoS~~~
MSSLDKRKTQNRSKKNSYSICLKEKASAELKREELARIIFDGLYEFVGLLDAQGNVLEVNQAALNGAGVTLEEIRGKPFW
KARWWQISKESVANQKRLVEAASSGEFVRCDIEILGKSGGREVIAVDFSLLPIRDEQENIVFLLAEGRNITDKKKAEAML
ALKNHELEQLVERIRKLDNAKSDFFAKVSHELRTPLSLILGPLETIMEAESGRGSPYWKKFEVIQRNAMTLLKQVNTLLD
LAKMDAQQMGLSYRRADLSQLTRVISSNFDGIAQQKSITLDAELPPHLIAEVDCEKYERIILNLLSNAFKFTPDGGLIRC
HLSLSQPAHALITVSDSGPGIPQNLRKEIFERFHQLNQEGQQANQGTGLGLSIVKEFVELHHGTISVSDAPGGGALFQVK
LPLNAPEGAYVANNAMSRSDNPQTVNPDEYLLPIPTAGSGAELPQFQSDQPRVLIVEDNPDMRCFIRDCLSTDYQVYVAP
DGAKALELMCSAPPDLLITDLMMPVMSGDTLVHKVREKNEFAHIPIMVLSAKPDEKLRVKLLSESVQDYLLKPFSAHELR
ARVSNLISMKIAGDALRKELSDQSNDIALLTHRLIKSRHRLQQSNIALTASEARWKAVYENSAAGIVLTDTENRILNANP
AFQRITGYTEKDLAQLSMEQLTPPNERTQMKQRLARLLQSGGAEYSVECSYLCKNGSTIWANASVSLMSPRVDEPQVILQ
IIDDITEKKQAQETLNQLQQELVQVSRSATMGEFAAYIAHEINQPLSAIMTNANAGTRWIGNEPPNIMEAKEALARIIRD
SDRAADIIRMVRSFLKRQGPVLKPIDLKALVADTTLILKAPSQSNGVSLNVIAGDTLPAIMGDAVQIQQLVINLAMNSIE
AMSQVGCETRQLALSFSSNASNDALIICVKDTGPGIPEDQIGQLFNAFYTTKKEGLGMGLAICLTIAEVHNGKIWAESPP
AGGACFFVSIPVS
>Q8KR08 ~~~tmoT~~~Response regulator protein TmoT~~~
MFLRKYPGQLGRQHMNDQESVIYIVDDDNAVLEALSSLVRSIGLRVKCFSSATAFLNDVGQLACGCLILDVRMPEMSGLD
VQRKLAELGEQIPIIFISGHGDIPMAVKAIKAGAIDFFTKPFREEDLLGAIRTALKLAPQQKENAPQISELKASYESLSK
REQQVLKFVLQGFLNKQTALELDISEATVKVHRHNIMKKMKVSSVQDLVRVTERLKDSLK
>Q4FL38 ~~~tmoV~~~Trimethylamine N-oxide transport system permease protein TmoV~~~COG4176
MELLKKYPKFFQWLFLLIVFFSLCFAIEVPETYNFIRGQAEFIKDPNQSTYTLFGAEVRYYAFDVFWRLPPLLGWLPIWI
NDSLFFLMNEWMPMEFWNEDIQEFRTQPLLLQITRNLTSFMTFLIELIREILLGGVETIVSFSSWDWIDANPWAELPGLP
WTIVTAGAVILGYKLSGKGLALFAGLVMIYISVFGQWKPSMQTLSFILVAAPLSFLFGLTFGVMAFKSKRVEKFLYPILL
VMQTMPQYAVLVPAIVLFGIGDHAAVIITMVVAVPPMILLTLLGLRGIPSEVIEAGRMSGCNNWQLMTKVLIPTARRDIL
IGVNQVIMVCFSMAVISAFIGAKGLGFNLLLALNQLNIGLALEAGLCISLIAILLDKMSLAWANKQIDYFGNLTYFQRNK
NILFFAAAVILGIIFSYLGSFYFKDGSNYLFEVPHNKGISTADFWNKGVDWIWDTFFHTLKIFNTWLIVDVLQPMRALYL
RMPAVATLVLVIGAGYIIGGIRSALVVGGLTLFIALSPWWDRALVTLYMATFGVFISTIIGFTVGIISFQNKHTANFMLG
VCDIFQTFPSFVYLIPVMMLFGVTDTSVLIAVIVYATIPATRYTIEGLRSVPEALHDAATMSGVNKVQRLLKIEFPLAFP
HMMLGLNQTIVFALFMVIIGAFIGTEDLGQYILKALSDKKGAGIGLTLGLCVAFIGLIFDHLIRTWVGKRKKHLGIG
>Q5LT64 ~~~tmoV~~~Trimethylamine N-oxide transport system permease protein TmoV~~~COG4176
MSVTTTETTGTAPALPRKTLGLAMIGLAVMMTLLHYAGLLPAWLHRLPEAIIPPFATWLDAIFNFVKDDLGLLALTRLLT
DGLEVVLDATANLLFGKRRWPNIGPIPWSAIAAMTAVVGYYLGGWRMALLAGGTFVWTAMIGQWDIAMQTMSVLVVAAPL
AFAIGLVLGISAWKSPSFDAVLRPVLAVLQTLPFFTYLLPAVIFFKVGPTAGAVATTVYAIPPMILMTTLGLQKVSPEVV
EAGKMSGCTRWQMLRHVYIPAARTEILVGVNQVIMLCLAMVVLTAFIGMPGLGAKLLAMMGSFKIGRSFEIGVTIVLLAV
TLDRMSKAWVVKLPEHFERGTPFWIRHKFLLMAIGAFVGFTLIAQVVPILSEVGRKQSWSQGKEIDTLIKGFLAIDAVQA
ITNSIRYVLNIWVLNPLRDFMLSIPTVAFVLFISAAALLVAGRREAVLAAAFFGLVALTGWWDRSVITLYSVLAAVSIAL
LLGVPIGVVAARKEKTANAVLLACDTAQTFPSFIYLIPAIMLFGITATSVVMSILIFSMVPLVRYTIEGLRNVPDEMTEA
ADMAGATRMQKLWNVQLPLALPTMAVGFNQAIMFAFFMVIIAAFIGTQDLGQELQRTLAGTDLGKNFVLGICVTLMALTF
DMVIMKWADDKKARLGLN
>Q4FL37 7.6.2.9~~~tmoW~~~Trimethylamine N-oxide transport system ATP-binding protein TmoW~~~COG4175
MSDPVIKCESVYKIFGSNAKKMLHEANGNVDAKTFQDNGCIVGVNNASFEVVKGEMLVVMGLSGSGKSTLLRCISRLTDA
TSGKIYIDGQDLLTLNNKELIELRRNKMGMVFQSFALLPHKTVVENIAFPLQIKGIKTQDSINKAMEMVKLVGLDGRENY
FPRELSGGQQQRVGIARSLAVEPDIWFLDEPFSALDPLIRKEMQDEFLRLQEKLQKTIMFITHDFDEALRLADRIAIMKD
GVIEQLDTPANIVLNPATEYVRKFTEEVPRGKVLKIADLMEKPETENLSDFKVSKNEIIENVAEKILTQEKSVAVTDENN
KIVGSVHPSKIIHTVFSREKK
>Q5LT65 7.6.2.9~~~tmoW~~~Trimethylamine N-oxide transport system ATP-binding protein TmoW~~~COG4175
MRFMGSPVISARNVWKIFGKDPVGYLKTLQPGRSFDDIRADGYIAGVRDVSLDVARGEMLVIMGLSGSGKSTLVRCFSRL
HEITGGSIEVDGQIIGDLSEKDLIELRRNKMGMVFQSFGLLPHRTVLDNVAFPLEMRGQDRHTRRKRALEVIELVGLAGR
EDYFPRELSGGQQQRVGIARSLAIEPDIWFLDEPFSALDPLIRREMQDEFLRLQAMLGKTIVFITHDFDEALRLADRIAI
MKDGAVEQCDTPDQIVMNPTTGYVAKFTEEIDKARVVHAGVLARAGVVGEGQPVEAGATVQQLARLLVNDSRDLIPVADK
GQVIGALDRQGALDILLKAS
>Q4FL33 ~~~tmoX~~~Trimethylamine N-oxide-binding protein~~~COG2113
MKKIVSLMSALVISVVSFAGISNAADSKKPIVIPTHNWSSQIVMAHVIGGIFESMGNNVKYVNTDSQAVYESIRLGDVSL
SHEVWESAFGKSFTTALDKGGLVDWGDHEARTLEDMGYPNWVAEKGLCPGLPDWTALKNPACAKNFTTPDSGGKGRMLEG
PQTWHGDLIPQRVDALGLGDLWTVKFAGSADALWAELVAAEKEGRGTIIFNWTPNFTDGAGFTFIDFPPYTAGCRPEDGG
DGKCGSPDGYLKKAVNADFPKTHPAAAATFKKMSFSTSHIGAMAALVDVDKMTHEDAAKKWLADNKSVWTPFTK
>Q5LT66 ~~~tmoX~~~Trimethylamine N-oxide-binding protein~~~COG2113
MRLFREIAANDPGPTGRMKNMKTFTTALATGVLALCPLAALADSSDPIVIPIHNWSSQIVMSNVVGQIFEEMGVAVEFVT
TDSQAVYESVRLGDVTLELEVWEGAFGASFRAALEKGGIVDVGDHDAVTREDWWYPMWTKDACPGLPDWKALNDCAAVFA
TAETGDKGRYLDGPVDWLKHGKERVEALGMNFEVINAGSAAALWAEIGAAEADKRPVVVFNWTPNFAEAVWPGEFVEFPE
WVDGCDKDPAVGPNPDALYDCGNPATGYLKKAAWEGMEAKWPDAYAVLTRISFTNPQIAEMAKLVDVDEMEPDEAAEAWL
EANEDVWRPWLDG
>A0A4V8H042 1.14.11.72~~~tmpA~~~[2-(trimethylamino)ethyl]phosphonate dioxygenase~~~
MPRSVTADASGSFLTLTFEDGSESRFHAIWLRDNALDPETRSPGNGQRLITIGDIPADTRISTALVDDGALTVTFAPEGK
TVTFPGKWLKSNAYDTDQSSEVGRTSPDVETWDSSQPAPAFDWNEVQSDPKAKRDWLDAIARLGFAKLVNGPVREGALIE
CASMFGFVRETNYGKYFEVRTEVNPTNLAYTGLGLQAHTDNPYRDPVPSLQILYCLENSAEGGDSIVVDGFRAAERLRDE
DPEGFALLAGNPARFEYKGSDGVHLRARRPMIELSPDGEMIAIRFNNRSSAPFVDIPFEKMEAYYAAYRRLGEFIDDPEM
GVSFKLEPGESFIVDNTRVLHARLGYSGSGSRWLQGCYADKDGLFSTLNVLNAQLGG
>P07643 ~~~tmpA~~~Treponemal membrane protein A~~~
MNAHTLVYSGVALACAAMLGSCASGAKEEAEKKAAEQRALLVESAHADRRLMEARIGAQESGADTQHPELFSQIQDVERQ
STDAKIEGDLKKAAGVASEAADKYEILRNRVEVADLQSKIQTHQLAQYDGDSANAAEESWKKALELYETDSAQCLQSTVE
ALESYRKVAHEGFGRLLPDMKARAGAAKTDVGGLKVAVELRPQLEEADSQYQEAREAEEVNARAKAFSGYHRALEIYTEL
GKVVRLKKTEAEKALQSAKTKQKASSDLARSADKSAPLPENAQGFSKEPIEVEPLPNDRLNTTQADESAPIPISDTSSPS
RVQSRGVEDGGRSPKSSMNEEGASR
>A0A4V8H040 1.13.11.90~~~tmpB~~~[1-hydroxy-2-(trimethylamino)ethyl]phosphonate dioxygenase (glycine-betaine-forming)~~~
MSKPDVSKLNRGNIVEFIGGIFDRRGDEEYLGEPVTMAEHMLQGATIAEQNGQPEEIIVGALLHDIGHFTSEFGMFSMDD
TEDRYHEEAGAEVLEQFFPSVITDCVRYHVAAKRYLCATKPEYFNRLSEASIHSLKLQGGPMDAEEVAEFEKNPNLKQII
AVRYLDEAGKRADMETPDYWHFAPMVQRMVDKHMGA
>P29724 ~~~tmpC~~~Membrane lipoprotein TmpC~~~COG1744
MREKWVRAFAGVFCAMLLIGCSKSDRPQMGNAGGAEGGDFVVGMVTDSGDIDDKSFNQQVWEGISRFAQENNAKCKYVTA
STDAEYVPSLSAFADENMGLVVACGSFLVEAVIETSARFPKQKFLVIDAVVQDRDNVVSAVFGQNEGSFLVGVAAALKAK
EAGKSAVGFIVGMELGMMPLFEAGFEAGVKAVDPDIQVVVEVANTFSDPQKGQALAAKLYDSGVNVIFQVAGGTGNGVIK
EARDRRLNGQDVWVIGVDRDQYMDGVYDGSKSVVLTSMVKRADVAAERISKMAYDGSFPGGQSIMFGLEDKAVGIPEENP
NLSSAVMEKIRSFEEKIVSKEIVVPVRSARMMN
>P12921 ~~~tmrB~~~Tunicamycin resistance protein~~~COG1660
MIIWINGAFGSGKTQTAFELHRRLNPSYVYDPEKMGFALRSMVPQEIAKDDFQSYPLWRAFNYSLLASLTDTYRGILIVP
MTIVHPEYFNEIIGRLRQEGRIVHHFTLMASKETLLKRLRTRAEGKNSWAAKQIDRCVEGLSSPIFEDHIQTDNLSIQDV
AENIAARAELPLDPDTRGSLRRFADRLMVKLNHIRIK
>A7NH01 4.2.3.98~~~~~~(+)-T-muurolol synthase ((2E,6E)-farnesyl diphosphate cyclizing)~~~COG0664
MDQDYRARLVYPFSGAISPHADIVDQATLAWAAMFGLLTDSLRHKSRRLQYGLLAARAYPRADREMLQIAADWIAWLFFM
DDQCDETGIGRDLQRMIALHERFLAILDGATPEAHDCALTYALADLRRRLALRAPDNWLRRFSEHVRLYFTANRWETVNR
QRGATPNVATYCAARLFSGAVYACFDLIELAEQIELPFYARHHSIVQQLEQAANNIICWCNDVLSYPKEMQHGDRHNLVL
VIQGEHQCSLPEAIDRALDLHAREVATFVRKRTCVPYFDAAVNTALEKYVTGLQFWICANRDWSLTATRYAPTHKSQEMV
MAVAQQ
>B5GW45 4.2.3.98~~~~~~(+)-T-muurolol synthase ((2E,6E)-farnesyl diphosphate cyclizing)~~~COG3170
MSLNHSDLMFYCPVDDLPHPAASGVNDRTLDWASGQGIPTADRDAGRLRAMAPGLLAARIAPDARGPVLDAFADHHTWLF
AFDDEYCDRADGSGITEWASFLARLHRVVETGESALLPGNPYGLALRDIACRLSTYTTPAQLAEWLEALRSYFAALVWER
SRRRDDDRLQSLDDYLLLRLRNGAMHTSITLLDTVNGYVLPRELRETPGVRALVEMTALLVSVDNDILSHHKESTSGTRE
ANLLDVLGRTGHTTPGEAVAQAVALRNEIMRQFVRVAERVRTPAAVPELYRFTTGLARWIRANLDFSLTTTRYTGPVTER
AALSPHEVPPLSGQGPAPARSDVIGWWWRIPEPLPEPGSDGADTPVRKRRAGDRPPTAGRGGAPHHQRTGPPPPVLPGGI
TASRSSGLQQSTWRREHR
>Q46731 3.1.-.-~~~tnpA~~~Transposase for transposon Tn5~~~
MITSALHRAADWAKSVFSSAALGDPRRTARLVNVAAQLAKYSGKSITISSEGSEAMQEGAYRFIRNPNVSAEAIRKAGAM
QTVKLAQEFPELLAIEDTTSLSYRHQVAEELGKLGSIQDKSRGWWVHSVLLLEATTFRTVGLLHQEWWMRPDDPADADEK
ESGKWLAAAATSRLRMGSMMSNVIAVCDREADIHAYLQDKLAHNERFVVRSKHPRKDVESGLYLYDHLKNQPELGGYQIS
IPQKGVVDKRGKRKNRPARKASLSLRSGRITLKQGNITLNAVLAEEINPPKGETPLKWLLLTSEPVESLAQALRVIDIYT
HRWRIEEFHKAWKTGAGAERQRMEEPDNLERMVSILSFVAVRLLQLRESFTLPQALRAQGLLKEAEHVESQSAETVLTPD
ECQLLGYLDKGKRKRKEKAGSLQWAYMAIARLGGFMDSKRTGIASWGALWEGWEALQSKLDGFLAAKDLMAQGIKI
>P31014 4.1.99.1~~~tnaA1~~~Tryptophanase 1~~~COG3033
MPKGEPFKIKMVEPIRLIPREDRERALKEAHYNPFFLRSSDVYIDLLTDSGTGAMSQFQWSAMMLGDESYAGASSYYRLK
ETVTDITGYEYVIPTHQGRGAEKVAFSQLITRPGMYVLSNMFFDTTRGHVQLAGGRPVDLLIDVPTEEYHPFKGNMDTER
LEQFIREHGAENIACIVMTVTNNSAGGQPVSMANIRETYQIARKYGVLVLFDVARYAENCHFIRMREEGYADKHPIDIAR
EMFSYGDGLMMSAKKDALVNIGGLLAFRDEELFTKVGAAVVPFEGFLTYGGLAGRDLEAMAVGLREALDPDYLAYRVGQV
QYLGEMLRNAGIPIQWPVGGHAVFIDAAKFLPHVPWDQFPGHALTLALYLEGGVRTVEVGSLMIGRDPETGENVRGPFEF
TRLAIPRRVYTNLHLEDVAETVINAFQKREEIRGVKFAREPKVLRHFTAWFDPA
>P31015 4.1.99.1~~~tnaA2~~~Tryptophanase 2~~~COG3033
MPKGEPFKIKMVEPIRLIPREDREAAIKAAHYNPFLLRSSDVYIDLLTDSGTGAMSQFQWSAMMLGDESYAGASSYYRLK
EAVTDITGYEYVLPTHQGRGAEKSAFAQLITRPGMYVLSNMFFDTTRGHVQLAGGRPIDLLLDVPTEEYHPFKGNMDTAR
LEAFIQEHGAENIACIVMTVTNNSAGGQPVSMANIRETSRIARKYGILLLFDVARYAENCHFIRMREEGYADKAPIDIAR
EMFSYGDGLMMSAKKDALVNIGGLLAFKDEELYTRVGGTVVPFEGFLTYGGLAGRDLEAMAVGLREALDPDYLAYRVGQV
EYLGNLLRSAGIPIQWPVGGHAVFIDAAKFLPHIPWDQFPGHALTVALYQEGGVRTVEVGSLVMGRDPETGENVRSPFEF
TRLAIPRRVYTNLHLEDVAETVINAFQKREQIRGVKFTREPKVLRHFTAHFDLV
>P0A853 4.1.99.1~~~tnaA~~~Tryptophanase~~~COG3033
MENFKHLPEPFRIRVIEPVKRTTRAYREEAIIKSGMNPFLLDSEDVFIDLLTDSGTGAVTQSMQAAMMRGDEAYSGSRSY
YALAESVKNIFGYQYTIPTHQGRGAEQIYIPVLIKKREQEKGLDRSKMVAFSNYFFDTTQGHSQINGCTVRNVYIKEAFD
TGVRYDFKGNFDLEGLERGIEEVGPNNVPYIVATITSNSAGGQPVSLANLKAMYSIAKKYDIPVVMDSARFAENAYFIKQ
REAEYKDWTIEQITRETYKYADMLAMSAKKDAMVPMGGLLCMKDDSFFDVYTECRTLCVVQEGFPTYGGLEGGAMERLAV
GLYDGMNLDWLAYRIAQVQYLVDGLEEIGVVCQQAGGHAAFVDAGKLLPHIPADQFPAQALACELYKVAGIRAVEIGSFL
LGRDPKTGKQLPCPAELLRLTIPRATYTQTHMDFIIEAFKHVKENAANIKGLTFTYEPKVLRHFTAKLKEV
>P28796 4.1.99.1~~~tnaA~~~Tryptophanase~~~COG3033
MAKRIVEPFRIKMVEKIRVPSREEREAALKEAGYNPFLLPSSAVYIDLLTDSGTNAMSDHQWAAMITGDEAYAGSRNYYD
LKDKAKELFNYDYIIPAHQGRGAENILFPVLLKYKQKEGKAKNPVFISNFHFDTTAAHVELNGCKAINIVTEKAFDSETY
DDWKGDFDIKKLKENIAQHGADNIVAIVSTVTCNSAGGQPVSMSNLKEVYEIAKQHGIFVVMDSARFCENAYFIKARDPK
YKNATIKEVIFDMYKYADALTMSAKKDPLLNIGGLVAIRDNEEIFTLARQRCVPMEGFVTYGGLAGRDMAAMVQGLEEGT
EEEYLHYRIGQVKYLGDRLREAGIPIQYPTGGHAVFVDCKKLVPQIPGDQFPAQAVINALYLESGVRAVEIGSFLLGRDP
ATGEQKHADMEFMRLTIARRVYTNDHMDYIADALIGLKEKFATLKGLEFEYEPPVLRHFTARLKPIE
>P23173 ~~~tnaB~~~Low affinity tryptophan permease~~~COG0814
MTDQAEKKHSAFWGVMVIAGTVIGGGMFALPVDLAGAWFFWGAFILIIAWFSMLHSGLLLLEANLNYPVGSSFNTITKDL
IGNTWNIISGITVAFVLYILTYAYISANGAIISETISMNLGYHANPRIVGICTAIFVASVLWLSSLAASRITSLFLGLKI
ISFVIVFGSFFFQVDYSILRDATSSTAGTSYFPYIFMALPVCLASFGFHGNIPSLIICYGKRKDKLIKSVVFGSLLALVI
YLFWLYCTMGNIPRESFKAIISSGGNVDSLVKSFLGTKQHGIIEFCLLVFSNLAVASSFFGVTLGLFDYLADLFKIDNSH
GGRFKTVLLTFLPPALLYLIFPNGFIYGIGGAGLCATIWAVIIPAVLAIKARKKFPNQMFTVWGGNLIPAIVILFGITVI
LCWFGNVFNVLPKFG
>P28785 ~~~tnaB~~~Low affinity tryptophan permease~~~COG0814
MENTSVNKEPSIIVGGFVLGGAMIGAGMFSLPTIMSGAWFINSLFILFIVCFFMFHSGIYILECISKYGAGTNYFDISKE
LLPKWACYIANASLIFVLYILIYAYISAAGSIIYEASLLYGINFNLRAIFFIFTIALGATIWWGGACASRLTSIFLFIKI
VLFILAFSGLFFKAKGDLLFSATFAGKSQLYLYPFIFIIIPYAITSFGYHGNVCSLYKLYNQNERKVVKSCIIGCLLALV
IYLLWMIGTMGNLPREQFITIIQKGGNLDAFIDSLYTVLNSKYIEGFLLWFSISAVFCSFLGVAIGLFDYILASLKFKDN
KTGRLKSGVLCFTPPLLLCLFFPNGFLIAIAYAGTAACVWAIICPAVMALKARQKFPNSGFKVWGGKKLIYAVIAFGVVG
IICQSWRNLIYCLFIVK
>P10021 ~~~tnpA~~~Transposase for transposon Tn4430~~~
MGVKQLLSEAQRNELMDLSRLTEWDLVTFRTFSKHDLHLILKHRRGYNRLGFALRLVLIRYPGWSLTEYKDIPQYVVAYV
TSRLRIPPEEFLVYAKRGNTLWEHLGEIRTEYGYQNFSSEYKETLLQFLVQQAMDNNNTLYLIEITISTLRKTKVILPAM
YVIEDIVWEAKQQADQKVYSILHDGLVQEQKDQLDALLLPTINGKSPLAWLKDVPAQPSPESFLKVIDRLQFVQKIGLTI
DTTKINTNRLRQLARLGSKYEPYAFRRFNEVKRYSMLVSFLLEITQDLIDYAIEIHDRLMMNLQTKGKKEQDEIQQANGK
KLNEKILQFITVCGTLIEAKETGKDAFAALDEVMSWNEMVESVEEAKQLSRPLNYDYLDLLNTRYSYVRRYAPTLLRSLH
FRATKSGEPVLQALDTIHELNETGKRKVPHGAPLHFVSNRWQKHVYDDDGNINRHYYELAALTELRNHIRSGDIFVSGSR
HHKAFDDYLIPYDEWNEVSNIPNGLTAPLKAEDYITDRINRLNEHLEWLSKNSEKLEGVDISQGKLHVERLDRGTPEEAK
AFSKLLHSMLPRIKLTDLLIEVASWTGFHDQFIHASTNQSPDQEEQNIVLATLMAMGTNIGLTKMAEATPGISYRQMANA
SQWRMYDDAMVRAQSILVNFQKEQKLSSYWGDGTTSSSDGMRLSIAVRSLHADSNPHYGTGKGGTIYRFVSDQLSAYHVK
VITTNARDALHVLDGLLHHETDLKIEEHYTDTAGYTDQVFALTHLLGFRFAPRIRDLADTKLFSIPGGEEYENVQALLKG
KINVKLIKENYEDIRRLAYSVQTGKVSSALIMGKLGSYARQNKLATALGEMGRIEKTLFTLDYISNKAVRRRVQKGLNKG
EAINALARTIFFGQRGEFRERALQDQLQRASALNIIINAISVWNTVYMEKAVEELKARGEFREDLMPYAWPLGWEHINFL
GEYKFEGLHDTGQMNLRPLRIKEPFYS
>P03012 ~~~tnpR~~~Transposon gamma-delta resolvase~~~
MRLFGYARVSTSQQSLDIQVRALKDAGVKANRIFTDKASGSSSDRKGLDLLRMKVEEGDVILVKKLDRLGRDTADMIQLI
KEFDAQGVSIRFIDDGISTDGEMGKMVVTILSAVAQAERQRILERTNEGRQEAMAKGVVFGRKRKIDRDAVLNMWQQGLG
ASHISKTMNIARSTVYKVINESN
>P0ADI2 ~~~tnpR~~~Transposon Tn3 resolvase~~~
MRIFGYARVSTSQQSLDIQIRALKDAGVKANRIFTDKASGSSTDREGLDLLRMKVEEGDVILVKKLDRLGRDTADMIQLI
KEFDAQGVAVRFIDDGISTDGDMGQMVVTILSAVAQAERRRILERTNEGRQEAKLKGIKFGRRRTVDRNVVLTLHQKGTG
ATEIAHQLSIARSTVYKILEDERAS
>P22886 ~~~Int-Tn~~~Transposase from transposon Tn916~~~COG0582
MSEKRRDNRGRILKTGESQRKDGRYLYKYIDSFGEPQFVYSWKLVATDRVPAGKRDCISLREKIAELQKDIHDGIDVVGK
KMTLCQLYAKQNAQRPKVRKNTETGRKYLMDILKKDKLGVRSIDSIKPSDAKEWAIRMSENGYAYQTINNYKRSLKASFY
IAIQDDCVRKNPFDFQLKAVLDDDTVPKTVLTEEQEEKLLAFAKADKTYSKNYDEILILLKTGLRISEFGGLTLPDLDFE
NRLVNIDHQLLRDTEIGYYIETPKTKSGERQVPMVEEAYQAFKRVLANRKNDKRVEIDGYSDFLFLNRKNYPKVASDYNG
MMKGLVKKYNKYNEDKLPHITPHSLRHTFCTNYANAGMNPKALQYIMGHANIAMTLNYYAHATFDSAMAEMKRLNKEKQQ
ERLVA
>Q45666 ~~~tnrA~~~HTH-type transcriptional regulator TnrA~~~COG0789
MTTEDHSYKDKKVISIGIVSELTGLSVRQIRYYEERKLIYPQRSSRGTRKYSFADVERLMDIANKREDGVQTAEILKDMR
KKEQMLKNDPQVRKKMLEGQLNAHFRYKNR
>G2RUZ1 ~~~tnrA~~~HTH-type transcriptional regulator TnrA~~~
MSTNEASYRDKKVMSIGIVKELTGLSERQIRYYEKRSLLFPDRTNTGIRKYSFSDVERLMDIADRIEEGVQTSEIRTELA
KKDEARKMKEVKNQMLQGQLNAHFKRKL
>P13988 3.1.21.-~~~tnsA~~~Transposon Tn7 transposition protein TnsA~~~
MAKANSSFSEVQIARRIKEGRGQGHGKDYIPWLTVQEVPSSGRSHRIYSHKTGRVHHLLSDLELAVFLSLEWESSVLDIR
EQFPLLPSDTRQIAIDSGIKHPVIRGVDQVMSTDFLVDCKDGPFEQFAIQVKPAAALQDERTLEKLELERRYWQQKQIPW
FIFTDKEINPVVKENIEWLYSVKTEEVSAELLAQLSPLAHILQEKGDENIINVCKQVDIAYDLELGKTLSEIRALTANGF
IKFNIYKSFRANKCADLCISQVVNMEELRYVAN
>P13989 ~~~tnsB~~~Transposon Tn7 transposition protein TnsB~~~
MWQINEVVLFDNDPYRILAIEDGQVVWMQISADKGVPQARAELLLMQYLDEGRLVRTDDPYVHLDLEEPSVDSVSFQKRE
EDYRKILPIINSKDRFDPKVRSELVEHVVQEHKVTKATVYKLLRRYWQRGQTPNALIPDYKNSGAPGERRSATGTAKIGR
AREYGKGEGTKVTPEIERLFRLTIEKHLLNQKGTKTTVAYRRFVDLFAQYFPRIPQEDYPTLRQFRYFYDREYPKAQRLK
SRVKAGVYKKDVRPLSSTATSQALGPGSRYEIDATIADIYLVDHHDRQKIIGRPTLYIVIDVFSRMITGFYIGFENPSYV
VAMQAFVNACSDKTAICAQHDIEISSSDWPCVGLPDVLLADRGELMSHQVEALVSSFNVRVESAPPRRGDAKGIVESTFR
TLQAEFKSFAPGIVEGSRIKSHGETDYRLDASLSVFEFTQIILRTILFRNNHLVMDKYDRDADFPTDLPSIPVQLWQWGM
QHRTGSLRAVEQEQLRVALLPRRKVSISSFGVNLWGLYYSGSEILREGWLQRSTDIARPQHLEAAYDPVLVDTIYLFPQV
GSRVFWRCNLTERSRQFKGLSFWEVWDIQAQEKHNKANAKQDELTKRRELEAFIQQTIQKANKLTPSTTEPKSTRIKQIK
TNKKEAVTSERKKRAEHLKPSSSGDEAKVIPFNAVEADDQEDYSLPTYVPELFQDPPEKDES
>P05846 ~~~tnsC~~~Transposon Tn7 transposition protein TnsC~~~
MSATRIQAVYRDTGVEAYRDNPFIEALPPLQESVNSAASLKSSLQLTSSDLQKSRVIRAHTICRIPDDYFQPLGTHLLLS
ERISVMIRGGYVGRNPKTGDLQKHLQNGYERVQTGELETFRFEEARSTAQSLLLIGCSGSGKTTSLHRILATYPQVIYHR
ELNVEQVVYLKIDCSHNGSLKEICLNFFRALDRALGSNYERRYGLKRHGIETMLALMSQIANAHALGLLVIDEIQHLSRS
RSGGSQEMLNFFVTMVNIIGVPVMLIGTPKAREIFEADLRSARRGAGFGAIFWDPIQQTQRGKPNQEWIAFTDNLWQLQL
LQRKDALLSDEVRDVWYELSQGVMDIVVKLFVLAQLRALALGNERITAGLLRQVYQDELKPVHPMLEALRSGIPERIARY
SDLVVPEIDKRLIQLQLDIAAIQEQTPEEKALQELDTEDQRHLYLMLKEDYDSSLLIPTIKKAFSQNPTMTRQKLLPLVL
QWLMEGETVVSELEKPSKSKKVSAIKVVKPSDWDSLPDTDLRYIYSQRQPEKTMHERLKGKGVIVDMASLFKQAG
>P05845 ~~~tnsE~~~Transposon Tn7 transposition protein TnsE~~~
MVRLATFNDNVQVVHIGHLFRNSGHKEWRIFVWFNPMQERKWTRFTHLPLLSRAKVVNSTTKQINKADRVIEFEASDLQR
AKIIDFPNLSSFASVRNKDGAQSSFIYEAETPYSKTRYHIPQLELARSLFLINSYFCRSCLSSTALQQEFDVQYEVERDH
LEIRILPSSSFPKGALEQSAVVQLLVWLFSDQDVMDSYESIFRHYQQNREIKNGVESWCFSFDPPPMQGWKLHVKGRSSN
EDKDYLVEEIVGLEINAMLPSTTAISHASFQEKEAGDGSTQHIAVSTESVVDDEHLQLDDEETANIDTDTRVIEAEPTWI
SFSRPSRIEKSRRARKSSQTILEKEEATTSENSNLVSTDEPHLGGVLAAADVGGKQDATNYNSIFANRFAAFDELLSILK
TKFACRVLFEETLVLPKVGRSRLHLCKDGSPRVIKAVGVQRNGSEFVLLEVDASDGVKMLSTKVLSGVDSETWRNDFEKI
RRGVVKSSLNWPNSLFDQLYGQDGHRGVNHPKGLGELQVSREDMEGWAERVVREQFTH
>Q70IY1 6.1.2.2~~~tobZ~~~nebramycin 5' synthase~~~
MRVLGLNGWPRDFHDASAALLVDGRIAAFAEEERFTRKKHGYNTAPVQAAAFCLAQAGLTVDDLDAVAFGWDLPAMYRER
LGGWPHSDSEALDILLPRDVFPRRTDPPLHFVQHHLAHAASAYYFSGEDRGAVLIVDGQGEEECVTLAHAEGGKITVLDT
VPGAWSLGFFYEHVSEYTGLGGDNPGKLMGLAAHGTTVDETLSAFAFDSDGYRLNLIDPQARDPEDWDEYSVTERAWFAH
LERIYRLPPNEFVRRYDPAKGRVVRDTRRDPYEYRDLAATAQAALERAVFGLADSVLARTGERTLFVAGGVGLNATMNGK
LLTRSTVDKMFVPPVASDIGVSLGAAAAVAVELGDRIAPMGDTAAWGPEFSPDQVRAALDRTGLAYREPANLEREVAALI
ASGKVVGWAQGRGEVGPRALGQRSLLGSAHSPTMRDHINLRVKDREWWRPFAPSMLRSVSDQVLEVDADFPYMIMTTKVR
AAYAERLPSVVHEDWSTRPQTVTEASNPRYHRMLTELGDLVGDPVCLNTSFNDRGEPIVSSPADALLTFSRLPIDALAVG
PYLVTKDLRH
>A5W4E9 1.18.1.3~~~todA~~~Toluene 1,2-dioxygenase system ferredoxin--NAD(+) reductase component~~~COG1251
MATHVAIIGNGVGGFTTAQALRAEGFEGRISLIGDEPHLPYDRPSLSKAVLDGSLERPPILAEADWYGEARIDMLTGPEV
TALDVQTRTISLDDGTTLSADAIVIATGSRARTMALPGSQLPGVVTLRTYGDVQVLRDSWTSATRLLIVGGGLIGCEVAT
TARKLGLSVTILEAGDELLVRVLGRRIGAWLRGLLTELGVQVELGTGVVGFSGEGQLEQVMASDGRSFVADSALICVGAE
PADQLARQAGLACDRGVIVDHCGATLAKGVFAVGDVASWPLRAGGRRSLETYMNAQRQAAAVAAAILGKNVSAPQLPVSW
TEIAGHRMQMAGDIEGPGDFVSRGMPGSGAALLFRLQERRIQAVVAVDAPRDFALATRLVEARAAIEPARLADLSNSMRD
FVRANEGDLT
>A5W4F0 ~~~todB~~~Toluene 1,2-dioxygenase system ferredoxin subunit~~~COG2146
MTWTYILRQGDLPPGEMQRYEGGPEPVMVCNVDGEFFAVQDTCTHGDWALSDGYLDGDIVECTLHFGKFCVRTGKVKALP
ACKPIKVFPIKVEGDEVHVDLDNGELK
>P13859 1.3.1.-~~~todD~~~Cis-toluene dihydrodiol dehydrogenase~~~COG1028
MRLEGEVALVTGGGAGLGRAIVDRYVAEGARVAVLDKSAAGLEALRKLHGDAIVGVEGDVRSLDSHREAVARCVEAFGKL
DCLVGNAGVWDYLTQLVDIPDDLISEAFEEMFEVNVKGYILAAKAALPALYQSKGSAIFTVSNAGFYPGGGGVLYTAGKH
AVIGLIKQLAHEWGPRIRVNGIAPGGILGSDLRGLKSLDLQDKSISTFPLDDMLKSVLPTGRAATAEEYAGAYVFFATRG
DTVPLTGSVLNFDGGMGVRGLFEASLGAQLDKHFG
>P13453 1.13.11.-~~~todE~~~3-methylcatechol 2,3-dioxygenase~~~COG0346
MSIQRLGYLGFEVADVRSWRTFATTRLGMMEASASETEATFRIDSRAWRLSVSRGPADDYLFAGFEVDSEQGLQEVKESL
QAHGVTVKVEGGELIAKRGVLGLISCTDPFGNRVEIYYGATELFERPFASPTGVSGFQTGDQGLGHYVLSVADVDAALAF
YTKALGFQLADVIDWTIGDGLSVTLYFLYCNGRHHSFAFAKLPGSKRLHHFMLQANGMDDVGLAYDKFDAERAVVMSLGR
HTNDHMISFYGATPSGFAVEYGWGAREVTRHWSVVRYDRISIWGHKFQAPA
>P23133 3.7.1.25~~~todF~~~2-hydroxy-6-oxo-2,4-heptadienoate hydrolase~~~COG2267
MTNVNAEIGRMVLAGGIETNLHDVGAGNPVVLVHGSGPGVTAWANWRTVMPELSRHRRVIAPDMVGFGFTQRPHGIHYGV
ESWVAHLAGILDALELDRVDLVGNSFGGALSLAFAIRFPHRVRRLVLMGAVGVSFELTDGLDAVWGYEPSVPNMRKVMDY
FAYDRSLVSDELAELRYKASTRPGFQEAFASMFPAPRQRWVDALASSDQDIRDIRHETLILHGRDDRVIPLETSLRLNQL
IEPSQLHVFGRCGHWVQIEQNRGFIRLVNDFLAAED
>A5W4E3 2.7.13.3~~~todS~~~Sensor histidine kinase TodS~~~COG0745
MSSLDRKKPQNRSKNNYYNICLKEKGSEELTCEEHARIIFDGLYEFVGLLDAHGNVLEVNQVALEGGGITLEEIRGKPFW
KARWWQISKKTEATQKRLVETASSGEFVRCDVEILGKSGGREVIAVDFSLLPICNEEGSIVYLLAEGRNITDKKKAEAML
ALKNQELEQSVECIRKLDNAKSDFFAKVSHELRTPLSLILGPLEAVMAAEAGRESPYWKQFEVIQRNAMTLLKQVNTLLD
LAKMDARQMGLSYRRANLSQLTRTISSNFEGIAQQKSITFDTKLPVQMVAEVDCEKYERIILNLLSNAFKFTPDGGLIRC
CLSLSRPNYALVTVSDSGPGIPPALRKEIFERFHQLSQEGQQATRGTGLGLSIVKEFVELHRGTISVSDAPGGGALFQVK
LPLNAPEGAYVASNTAPRRDNPQVVDTDEYLLLAPNAENEAEVLPFQSDQPRVLIVEDNPDMRGFIKDCLSSDYQVYVAP
DGAKALELMSNMPPDLLITDLIMPVMSGDMLVHQVRKKNELSHIPIMVLSAKSDAELRVKLLSESVQDFLLKPFSAHELR
ARVSNLVSMKVAGDALRKELSDQGDDIAILTHRLIKSRHRLQQSNIALSASEARWKAVYENSAAGIVLTDPENRILNANP
AFQRITGYGEKDLEGLSMEQLTPSDESPQIKQRLANLLQGGGAEYSVERSYLCKNGSTIWANASVSLMPQRVGESPVILQ
IIDDITEKKQAQENLNQLQQQLVYVSRSATMGEFAAYIAHEINQPLSAIMTNANAGTRWLGNEPSNIPEAKEALARIIRD
SDRAAEIIRMVRSFLKRQETVLKPIDLKALVTDTSLILKAPSQNNSVNLDVVADDELPEIWGDGVQIQQLIINLAMNAIE
AISQADCETRQLTLSFSGNDTGDALVISVKDTGPGISERQMAQLFNAFYTTKKEGLGMGLAICLTITEVHNGKIWVECPP
AGGACFLVSIPARQGSGT
>E0X9C7 2.7.13.3~~~todS~~~Sensor histidine kinase TodS~~~
MSSLDRKKPQNRSKNNYYNICLKEKGSEELTCEEHARIIFDGLYEFVGLLDAHGNVLEVNQVALEGAGITLEEIRGKPFW
KARWWQISKKTEATQKRLVETASSGEFVRCDVEILGKSGGREVIAVDFSLLPICNEEGSIVYLLAEGRNITDKKKAEAML
ALKNQELEQSVERIRKLDNAKSDFFAKVSHELRTPLSLILGPLEAVMAAEAGRESPYWKQFEVIQRNAMTLLKQVNTLLD
LAKMDARQMGLSYRRANLSQLTRTISSNFEGIAQQKSITFDTKLPVQMVAEVDCEKYERIILNLLSNAFKFTPDGGLIRC
CLSLSRPNYALVTVSDSGPGIPPALRKEIFERFHQLSQEGQQATRGTGLGLSIVKEFVELHRGTISVSDAPGGGALFQVK
LPLNAPEGAYVASNTAPRRDNPQVVDTDEYLLLAPNAENEAEVLPFQSDQPRVLIVEDNPDMRGFIKDCLSSDYQVYVAP
DGAKALELMSNMPPDLLITDLMMPVMSGDMLVHQVRKKNELSHIPIMVLSAKSDAELRVKLLSESVQDFLLKPFSAHELR
ARVSNLVSMKVAGDALRKELSDQGDDIAILTHRLIKSRHRLQQSNIALSASEARWKAVYENSAAGIVLTDPENRILNANP
AFQRITGYGEKDLEGLSMEQLTPSDESPQIKQRLANLLQGGGAEYSVERSYLCKNGSTIWANASVSLMPQRVGESPVILQ
IIDDITEKKQAQENLNQLQQQLVYVSRSATMGEFAAYIAHEINQPLSAIMTNANAGTRWLGNEPSNIPEAKEALARIIRD
SDRAAEIIRMVRSFLKRQETVLKPIDLKALVTDTSLILKAPSQNNSVNLDVVADDELPEIWGDGVQIQQLIINLAMNAIE
AISQADCETRQLTLSFSGNDTGDALVISVKDTGPGISERQMAQLFNAFYTTKKEGLGMGLAICLTITEVHNGKIWVECPP
AGGACFLVSIPARQGSGT
>A5W4E2 ~~~todT~~~Response regulator protein TodT~~~COG4566
MPARWGCLFPGKYPCQTGLRHMSDRASVIYILDDDNAVLEALSSLVRSIGLSVECFSSASVFLNDVNRSACGCLILDVRM
PEMSGLDVQRQLKELGEQIPIIFISGHGDIPMAVKAIKAGAVDFFTKPFREEELLGAIRAALKLAPQQRSNAPRVSELKE
NYESLSKREQQVLKFVLRGYLNKQTALELDISEATVKVHRHNIMRKMKVSSIQDLVRVTERLKDSLE
>I7CA98 ~~~todT~~~Response regulator protein TodT~~~
MPARWGCLFPGKYPCQTGLRHMSDRASVIYILDDDNAVLEALSSLVRSIGLSVECFSSASVFLNDVNRSACGCLILDVRM
PEMSGLDVQRQLKELGEQIPIIFISGHGDIPMAVKAIKAGAVDFFTKPFREEELLGAIRAALKLAPQQRSNAPRVSELKE
NYESLSKREQQVLKFVLRGYLNKQTALELDISEATVKVHRHNIMRKMKVSSIQDLVRVTERLKDSLE
>P19934 ~~~tolA~~~Tol-Pal system protein TolA~~~COG3064
MSKATEQNDKLKRAIIISAVLHVILFAALIWSSFDENIEASAGGGGGSSIDAVMVDSGAVVEQYKRMQSQESSAKRSDEQ
RKMKEQQAAEELREKQAAEQERLKQLEKERLAAQEQKKQAEEAAKQAELKQKQAEEAAAKAAADAKAKAEADAKAAEEAA
KKAAADAKKKAEAEAAKAAAEAQKKAEAAAAALKKKAEAAEAAAAEARKKAATEAAEKAKAEAEKKAAAEKAAADKKAAA
EKAAADKKAAEKAAAEKAAADKKAAAEKAAADKKAAAAKAAAEKAAAAKAAAEADDIFGELSSGKNAPKTGGGAKGNNAS
PAGSGNTKNNGASGADINNYAGQIKSAIESKFYDASSYAGKTCTLRIKLAPDGMLLDIKPEGGDPALCQAALAAAKLAKI
PKPPSQAVYEVFKNAPLDFKP
>P50600 ~~~tolA~~~Tol-Pal system protein TolA~~~
MKQQFERSPSESYFWPVVLAVVLHVLIFAMLFVSWAFAPELPPSKPIVQATLYQLKSKSQATTQTNQKIAGEAKKTASKQ
YEVEQLEQKKLEQQKLEQQKLEQQQVAAAKAAEQKKADEARKAEAQKAAEAKKADEAKKAAEAKAAEQKKQADIAKKRAE
DEAKKKAAEDAKKKAAEDAKKKAAEEAKKKAAAEAAKKKAAVEAAKKKAAAAAAAARKAAEDKKARALAELLSDTTERQQ
ALADEVGSEVTGSLDDLIVNLVSQQWRRPPSARNGMSVEVLIEMLPDGTITNASVSRSSGDKPFDSSAVAAVRNVGRIPE
MQQLPRATFDSLYRQRRIIFKPEDLSL
>Q83F59 ~~~tolB~~~Tol-Pal system protein TolB~~~COG0823
MQKRHPIIYLLITLLIFVPVSYGALDLELTKGVTAAIPIAIMPFAGPSVLAPGDETIPQVIKNDLQNSGQFRVTGSGNLD
QTAASLEQIDYSYWRKQKVNALVVGAINPLGMRRYRVAFTLINVFDSNNRLLSESFNVNAKELRNLAHHISDLIYQKLTG
VRGVFSTKIAYVLVQASSENTAAKYTLEVADADGFNPQPLLVSDMPIMSPTWSPDGNKIAYVSFEGHRAAIYLQDLATGR
RQRLSEAPGINGAPAFSPDGKQLALVLTKTGNPKIYILNLADGHLREITKGWSIDTEPAWSPDGKSLLFTSNRDGTPQIY
NYSFADGSINRVTYRGDYNARPSFMPDGKSIIMMHRENGLFGIARQDLSTGQVQILSESGTDESPSLAPNGKMAIYAMEY
GGRGVLAQVSIDGQIKLRLPARNGNVQEPAWSPYLGV
>P0A855 ~~~tolB~~~Tol-Pal system protein TolB~~~COG0823
MKQALRVAFGFLILWASVLHAEVRIVIDSGVDSGRPIGVVPFQWAGPGAAPEDIGGIVAADLRNSGKFNPLDRARLPQQP
GSAQEVQPAAWSALGIDAVVVGQVTPNPDGSYNVAYQLVDTGGAPGTVLAQNSYKVNKQWLRYAGHTASDEVFEKLTGIK
GAFRTRIAYVVQTNGGQFPYELRVSDYDGYNQFVVHRSPQPLMSPAWSPDGSKLAYVTFESGRSALVIQTLANGAVRQVA
SFPRHNGAPAFSPDGSKLAFALSKTGSLNLYVMDLASGQIRQVTDGRSNNTEPTWFPDSQNLAFTSDQAGRPQVYKVNIN
GGAPQRITWEGSQNQDADVSSDGKFMVMVSSNGGQQHIAKQDLATGGVQVLSSTFLDETPSLAPNGTMVIYSSSQGMGSV
LNLVSTDGRFKARLPATDGQVKFPAWSPYL
>P50601 ~~~tolB~~~Tol-Pal system protein TolB~~~
MSTLIRIALFALALMAGAAQAADPLVISSGNDRAIPIAVVPFGFQGGNVLPEDMSNIIGNDLRNSGYFEPLPRQNMISQP
AQASEVIFRDWKAVGVNYVMVGNIVPAGGRLQVQYALFDVGTEQQVLTGSVTGSTDQLRDMSHYIADQSFEKLTGIKGAF
STKMLYVTAERFSVDNTRYTLQRSDYDGARPVTLLQSREPIVSPRFSPDGRRIAYVSFEQKRPRIFIQYVDTGRREQITN
FEGLNGAPAFSPDGNRLAFVLSRDGNPEIYVMDLGSRALRRLTNNLAIDTEPFWGKDGSTLYFTSDRGGKPQIYKMNVNS
GAVDRVTFIGNYNANPKLSADEKTLVMVHRQQGYTNFQIAAQDLQRGNLRVLSNTTLDDSPTVAPNGTMLIYATRQQDRG
VLMLVSINGRVRIPLPTAQGDVREPSWSPYLN
>Q8ZQT5 ~~~tolB~~~Tol-Pal system protein TolB~~~
MKQALRVAFGFLMLWAAVLHAEVRIEITQGVDSARPIGVVPFKWAGPGAAPEDIGGIVAADLRNSGKFNPLDRSRLPQQP
ATAQEVQPTAWSALGIDAVVVGQVTPNPDGSYNVAYQLVDTGGAPGTVLAQNSYKVNKQWLRYAGHTASDEVFEKLTGIK
GAFRTRIAYVVQTNGGQFPYELRVSDYDGYNQFVVHRSPQPLMSPAWSPDGSKLAYVTFESGRSALVIQTLANGAVRQVA
SFPRHNGAPAFSPDGTKLAFALSKTGSLNLYVMDLASGQIRQITDGRSNNTEPTWFPDSQTLAFTSDQAGRPQVYKMNIN
GGAAQRITWEGSQNQDADVSSDGKFMVMVSSNNGQQHIAKQDLVTGGVQVLSSTFLDETPSLAPNGTMVIYSSSQGMGSV
LNLVSTDGRFKARLPATDGQVKSPAWSPYL
>Q8ZGZ1 ~~~tolB~~~Tol-Pal system protein TolB~~~COG0823
MKQAFRVALGFLVLWASVLHAEVRIEITQGVDSARPIGVVPFKWMGPGTPPEEIGAIVGADLRNSGKFNPIDAARMPQQP
STAAEVTPAAWTALGIDAVVVGQVQPSADGSYVVSYQLVDTSGSAGSILAQNQYKVTKQWLRYSAHTVSDEVFEKLTGIK
GAFRTRIAYVVKTNGGKFPHELRVSDYDGYNQFVVHRSPEPLMSPAWSPDGSKIAYVTFESGKSALVIQTLANGAIRQVA
SFPRHNGAPAFSPDGTKLAFALSKSGSLNLYVMDLASGQISQVTDGRSNNTEPSWFPDSQNLAYTSDQGGRPQVYKVNIN
GGVPQRITWEGSQNQNADVSPDGKFLVLVSSNGGAQHIAKQDLETGAVQVLTDTLLDETPSIAPNGTMVIYSSTQGLGSV
LQLVSTDGRFKARLPATDGQVKFPAWSPYL
>P02930 ~~~tolC~~~Outer membrane protein TolC~~~COG1538
MKKLLPILIGLSLSGFSSLSQAENLMQVYQQARLSNPELRKSAADRDAAFEKINEARSPLLPQLGLGADYTYSNGYRDAN
GINSNATSASLQLTQSIFDMSKWRALTLQEKAAGIQDVTYQTDQQTLILNTATAYFNVLNAIDVLSYTQAQKEAIYRQLD
QTTQRFNVGLVAITDVQNARAQYDTVLANEVTARNNLDNAVEQLRQITGNYYPELAALNVENFKTDKPQPVNALLKEAEK
RNLSLLQARLSQDLAREQIRQAQDGHLPTLDLTASTGISDTSYSGSKTRGAAGTQYDDSNMGQNKVGLSFSLPIYQGGMV
NSQVKQAQYNFVGASEQLESAHRSVVQTVRSSFNNINASISSINAYKQAVVSAQSSLDAMEAGYSVGTRTIVDVLDATTT
LYNAKQELANARYNYLINQLNIKSALGTLNEQDLLALNNALSKPVSTNPENVAPQTPEQNAIADGYAPDSPAPVVQQTSA
RTTTSNGHNPFRN
>Q54001 ~~~tolC~~~Outer membrane protein TolC~~~
MQMKKLLPILIGLSLSGFSTLSQAENLMQVYQQARLSNPELRKSAADRDAAFEKINEARSPLLPQLGLGADYTYSNGYRD
ANGINSNETSASLQLTQTLFDMSKWRGLTLQEKAAGIQDVTYQTDQQTLILNTANAYFKVLNAIDVLSYTQAQKEAIYRQ
LDQTTQRFNVGLVAITDVQNARAQYDTVLANEVTVRNNLDNAVEELRQVTGNYYPELASLNVEHFKTDKPKAVNALLKEA
ENRNLSLLQARLSQDLAREQIRQAQDGHLPTLNLTPSTGISDTSYSGSKTNAAQYDDSNMGQNKIGLNFSLPLYQGGMVN
SQVKQAQYNFVGASEQLESAHRSVVQTVRSSFNNINASISSINAYKQAVVSAQSSLDAMEAGYSVGTRTIVDVLDATTTL
YDAKQQLANARYTYLINQLNIKYALGTLNEQDLLALNSTLGKPIPTSPESVAPETPDQDCAADGYNAHSAAPAVQPTAAR
ANSNNGNPFRH
>Q9K2Y1 ~~~tolC~~~Outer membrane protein TolC~~~COG1538
MKKLLPLFVSAALGTLSSAVWAENLAEIYNQAKENDPQLLSVAAQRDAAFEAVTSSRSALLPQINLTAGYNINRSDQAPR
ESDLLSAGINFSQELYQRSSWVSLDTAEKKARQADSQYAATQQGLILRVAKAYFEVLRAQDNLEFVRAEKAAVGRQLEQT
KQRFEVGLSAITDVHDAQAQFDGVLADEVLAENSLTNSYEALREITGQEYSKLAVLDTKRFAASRTTESSEALIEKAQQQ
NLSLLAARISQDVARDNISLASSGHLPSLTLDGGYNYGNNSNDNAKNTSGEEYNDFKIGVNLKVPLYTGGNTTSLTKQAE
FAYVAASQDLEAAYRSVVKDVRAYNNNINASIGALRAYEQAVISAKSALEATEAGFDVGTRTIVDVLDATRRLYDANKNL
SNARYDYILSVLQLRQAIGTLSEQDVMDVNAGLKVAKK
>P0ABU9 ~~~tolQ~~~Tol-Pal system protein TolQ~~~COG0811
MTDMNILDLFLKASLLVKLIMLILIGFSIASWAIIIQRTRILNAAAREAEAFEDKFWSGIELSRLYQESQGKRDNLTGSE
QIFYSGFKEFVRLHRANSHAPEAVVEGASRAMRISMNRELENLETHIPFLGTVGSISPYIGLFGTVWGIMHAFIALGAVK
QATLQMVAPGIAEALIATAIGLFAAIPAVMAYNRLNQRVNKLELNYDNFMEEFTAILHRQAFTVSESNKG
>P0ABV6 ~~~tolR~~~Tol-Pal system protein TolR~~~COG0848
MARARGRGRRDLKSEINIVPLLDVLLVLLLIFMATAPIITQSVEVDLPDATESQAVSSNDNPPVIVEVSGIGQYTVVVEK
DRLERLPPEQVVAEVSSRFKANPKTVFLIGGAKDVPYDEIIKALNLLHSAGVKSVGLMTQPI
>P43769 ~~~tolR~~~Tol-Pal system protein TolR~~~COG0848
MARRQRKAIKSEINIVPFLDVLLVLVLIFMATAPIISQSVQVELPDSVQSQEVSNEDKVPVILEVAGIGKYAISIGGERQ
EGLTEEMVTQLSRQEFDKDNNTLFLVGGAKEVPYEEVIKALNLLHLAGIKSVGLMTNPI
>P0AAR0 ~~~tomB~~~Hha toxicity modulator TomB~~~
MDEYSPKRHDIAQLKFLCETLYHDCLANLEESNHGWVNDPTSAINLQLNELIEHIATFALNYKIKYNEDNKLIEQIDEYL
DDTFMLFSSYGINMQDLQKWRKSGNRLFRCFVNATKENPASLSC
>P02929 ~~~tonB~~~Protein TonB~~~COG0810
MTLDLPRRFPWPTLLSVCIHGAVVAGLLYTSVHQVIELPAPAQPISVTMVTPADLEPPQAVQPPPEPVVEPEPEPEPIPE
PPKEAPVVIEKPKPKPKPKPKPVKKVQEQPKRDVKPVESRPASPFENTAPARLTSSTATAATSKPVTSVASGPRALSRNQ
PQYPARAQALRIEGQVKVKFDVTPDGRVDNVQILSAKPANMFEREVKNAMRRWRYEPGKPGSGIVVNILFKINGTTEIQ
>O25899 ~~~tonB~~~Protein TonB~~~COG0810
MKISPSPRKLSKVSTSVSFLISFALYAIGFGYFLLREDAPEPLAQAGTTKVTMSLASINTNSNTKTNAESAKPKEEPKEK
PKKEEPKKEEPKKEVTKPKPKPKPKPKPKPKPKPEPKPEPKPEPKPEPKVEEVKKEEPKEEPKKEEAKEEAKEKSAPKQV
TTKDIVKEKDKQEESNKTSEGATSEAQAYNPGVSNEFLMKIQTAISSKNRYPKMAQIRGIEGEVLVSFTINADGSVTDIK
VVKSNTTDILNHAALEAIKSAAHLFPKPEETVHLKIPIAYSLKED
>Q51368 ~~~tonB~~~Protein TonB~~~
MSPQPSRSPDRFSLAALAEDHPTAPAQGDESESLPCVNAQRGEPNLRVVDCSGARRDEEVAVEEVLIPYAHGSDPEDVPG
EPPKSRWWLSSGAAVAMHVAIIGALVWVMPTPAELNLGHGELPKTMQVNFVQLEKKAEPTPQPPAAAPEPTPPKIEEPKP
EPPKPKPVEKPKPKPKPKPKPVENAIPKAKPKPEPKPKPEPEPSTEASSQPSPSSAAPPPAPTVGQSTPGAQTAPSGSQG
PAGLPSGSLNDSDIKPLRMDPPVYPRMAQARGIEGRVKVLFTITSDGRIDDIQVLESVPSRMFDREVRQAMAKWRFEPRV
SGGKIVARQATKMFFFKIEKRR
>P39814 5.6.2.1~~~topA~~~DNA topoisomerase 1~~~COG0550
MSDYLVIVESPAKAKTIERYLGKKYKVKASMGHVRDLPKSQMGVDIEQNFEPKYITIRGKGPVLKELKTAAKKAKKVYLA
ADPDREGEAIAWHLAHSLDLDLNSDCRVVFNEITKDAIKESFKHPRMINMDLVDAQQARRILDRLVGYKISPILWKKVKK
GLSAGRVQSVALRLIIDREKEINDFKPEEYWTIDGTFLKGQETFEASFFGKNGKKLPLNSEADVKEILSQLKGNQYTVEK
VTKKERKRNPALPFTTSTLQQEAARKLNFRAKKTMMIAQQLYEGIDLGREGTVGLITYMRTDSTRISNTAVDEAAAFIDQ
TYGKEFLGGKRKPAKKNENAQDAHEAIRPTSVLRKPSELKAVLGRDQMRLYKLIWERFVASQMAPAVLDTMSVDLTNNGL
TFRANGSKVKFSGFMKVYVEGKDDQMEEKDRMLPDLQEGDTVLSKDIEPEQHFTQPPPRYTEARLVKTLEERGIGRPSTY
APTLDTIQRRGYVALDNKRFVPTELGQIVLDLIMEFFPEIINVEFTAKMERDLDHVEEGNTEWVKIIDNFYTDFEKRVKK
AESEMKEVEIEPEYAGEDCELCSSPMVYKMGRYGKFLACSNFPDCRNTKPIVKQIGVKCPSCGEGNIVERKSKKKRVFYG
CDRYPDCEFVSWDKPIERKCPKCGKMLVEKKLKKGIQVQCVECDYKEEPQK
>P06612 5.6.2.1~~~topA~~~DNA topoisomerase 1~~~COG0550
MGKALVIVESPAKAKTINKYLGSDYVVKSSVGHIRDLPTSGSAAKKSADSTSTKTAKKPKKDERGALVNRMGVDPWHNWE
AHYEVLPGKEKVVSELKQLAEKADHIYLATDLDREGEAIAWHLREVIGGDDARYSRVVFNEITKNAIRQAFNKPGELNID
RVNAQQARRFMDRVVGYMVSPLLWKKIARGLSAGRVQSVAVRLVVEREREIKAFVPEEFWEVDASTTTPSGEALALQVTH
QNDKPFRPVNKEQTQAAVSLLEKARYSVLEREDKPTTSKPGAPFITSTLQQAASTRLGFGVKKTMMMAQRLYEAGYITYM
RTDSTNLSQDAVNMVRGYISDNFGKKYLPESPNQYASKENSQEAHEAIRPSDVNVMAESLKDMEADAQKLYQLIWRQFVA
CQMTPAKYDSTTLTVGAGDFRLKARGRILRFDGWTKVMPALRKGDEDRILPAVNKGDALTLVELTPAQHFTKPPARFSEA
SLVKELEKRGIGRPSTYASIISTIQDRGYVRVENRRFYAEKMGEIVTDRLEENFRELMNYDFTAQMENSLDQVANHEAEW
KAVLDHFFSDFTQQLDKAEKDPEEGGMRPNQMVLTSIDCPTCGRKMGIRTASTGVFLGCSGYALPPKERCKTTINLVPEN
EVLNVLEGEDAETNALRAKRRCPKCGTAMDSYLIDPKRKLHVCGNNPTCDGYEIEEGEFRIKGYDGPIVECEKCGSEMHL
KMGRFGKYMACTNEECKNTRKILRNGEVAPPKEDPVPLPELPCEKSDAYFVLRDGAAGVFLAANTFPKSRETRAPLVEEL
YRFRDRLPEKLRYLADAPQQDPEGNKTMVRFSRKTKQQYVSSEKDGKATGWSAFYVDGKWVEGKK
>P55991 5.6.2.1~~~topA~~~DNA topoisomerase 1~~~COG0550
MKHLIIVESPAKAKTIKNFLDKNYEVIASKGHVRDLSKFALGIKIDETGFTPNYVVDKDHKELVKQIIELSKKASITYIA
TDEDREGEAIGYHVACLIGGKLESYPRIVFHEITQNAILNALKTPRKIDMSKVNAQQARRFLDRIVGFKLSSLIASKITK
GLSAGRVQSAALKLVIDKEREIKAFKPLTYFTLDAYFESHLEAQLISYKGNKLKAQELIDEKKAQEIKNELEKESYAISS
IVKKSKKSPTPPPFMTSTLQQSASSLLGFSPTKTMSIAQKLYEGVATPQGVMGVITYMRTDSLNIAKEALEEARNKILKD
YGKDYLPPKAKVYSSKNKNAQEAHEAIRPTSIILEPNALKDYLKPEELRLYTLIYKRFLASQMQDALFESQSVVVACEKG
EFKASGRKLLFDGYYKILGNDDKDKLLPNLKENDPIKLEKLESNAHVTEPPARYSEASLIKVLESLGIGRPSTYAPTISL
LQNRDYIKVEKKQISALESAFKVIEILEKHFEEIVDSKFSASLEEELDNIAQNKADYQQVLKDFYYPFMDKIEAGKKNII
SQKVHEKTGQSCPKCGGELVKKNSRYGEFIACNNYPKCKYVKQTESANDEADQELCEKCGGEMVQKFSRNGAFLACNNYP
ECKNTKSLKNTPNAKETIEGVKCPECGGDIALKRSKKGSFYGCNNYPKCNFLSNHKPINKRCEKCHYLMSERIYRKKKAH
ECIKCKERVFLEEDNG
>A0R5D9 5.6.2.1~~~topA~~~DNA topoisomerase 1~~~COG0550
MAGGDRGSGGTGNVRRLVIVESPTKARKIAGYLGSNYVVESSRGHIRDLPRNAADVPAKFKSEPWARLGVNVDQNFEPLY
IVSPEKKSTVTELKGLLKDVDELYLATDGDREGEAIAWHLLETLKPRVPVKRMVFHEITEPAIRNAAENPRDLDIALVDA
QETRRILDRLYGYEVSPVLWKKVAPKLSAGRVQSVATRIIVQRERERMAFHSASYWDVTAELDASVSDPSASPPKFTAKL
NTVDGRRVATGRDFDSLGQLKRPDEVLVLDEASAGALASGLRGAQLAVTSVEQKPYTRRPYAPFMTSTLQQEAARKLRFS
SERTMSIAQRLYENGYITYMRTDSTTLSESAINAARTQARQLYGEEYVHPSPRQYTRKVKNAQEAHEAIRPAGDVFQTPG
QLHSALDTDEFRLYELIWQRTVASQMADARGTTLSLRIGGSASSGEQVVFNASGRTITFPGFLKAYVESIDELAGGESDD
AESRLPNLTQGQRVDAADLSADGHQTSPPARYTEASLIKALEELGIGRPSTYSSIIKTIQDRGYVQKKGSALVPSWVAFA
VVGLLEQHFGRLVDYDFTAAMEDELDEIANGQEQRTNWLNNFYFGGEHGVEGSIARAGGLKQLVGGNLEGIDAREVNSIK
VFDDSEGRPVYVRVGRNGPYLERMVDDPDNPGEQKPQRANLKEDLTPDELTPELAEKLFATPQEGRSLGIDPETGHEIVA
KDGRFGPYVTEVLPEPEDGGDDGTAGTPAKKGKKPTGPKPRTGSLFRSMDLETVTLEDALKLLSLPRVVGVDPTTNEEIT
AQNGRYGPYLKRGTDSRSLATEDQIFTITLDEALKIYAEPKRRGRQAASAPPLRELGNDPVSGKPMVIKDGRFGPYVTDG
ETNASLRKGDDVLTITDERASELLADRRARGPVKKKAPAKKAAKKAPAKKAAAKKA
>A5U8X0 5.6.2.1~~~topA~~~DNA topoisomerase 1~~~COG0550
MADPKTKGRGSGGNGSGRRLVIVESPTKARKLASYLGSGYIVESSRGHIRDLPRAASDVPAKYKSQPWARLGVNVDADFE
PLYIISPEKRSTVSELRGLLKDVDELYLATDGDREGEAIAWHLLETLKPRIPVKRMVFHEITEPAIRAAAEHPRDLDIDL
VDAQETRRILDRLYGYEVSPVLWKKVAPKLSAGRVQSVATRIIVARERDRMAFRSAAYWDILAKLDASVSDPDAAPPTFS
ARLTAVAGRRVATGRDFDSLGTLRKGDEVIVLDEGSATALAAGLDGTQLTVASAEEKPYARRPYPPFMTSTLQQEASRKL
RFSAERTMSIAQRLYENGYITYMRTDSTTLSESAINAARTQARQLYGDEYVAPAPRQYTRKVKNAQEAHEAIRPAGETFA
TPDAVRRELDGPNIDDFRLYELIWQRTVASQMADARGMTLSLRITGMSGHQEVVFSATGRTLTFPGFLKAYVETVDELVG
GEADDAERRLPHLTPGQRLDIVELTPDGHATNPPARYTEASLVKALEELGIGRPSTYSSIIKTIQDRGYVHKKGSALVPS
WVAFAVTGLLEQHFGRLVDYDFTAAMEDELDEIAAGNERRTNWLNNFYFGGDHGVPDSVARSGGLKKLVGINLEGIDARE
VNSIKLFDDTHGRPIYVRVGKNGPYLERLVAGDTGEPTPQRANLSDSITPDELTLQVAEELFATPQQGRTLGLDPETGHE
IVAREGRFGPYVTEILPEPAADAAAAAQGVKKRQKAAGPKPRTGSLLRSMDLQTVTLEDALRLLSLPRVVGVDPASGEEI
TAQNGRYGPYLKRGNDSRSLVTEDQIFTITLDEALKIYAEPKRRGRQSASAPPLRELGTDPASGKPMVIKDGRFGPYVTD
GETNASLRKGDDVASITDERAAELLADRRARGPAKRPARKAARKVPAKKAAKRD
>P9WG49 5.6.2.1~~~topA~~~DNA topoisomerase 1~~~COG0550
MADPKTKGRGSGGNGSGRRLVIVESPTKARKLASYLGSGYIVESSRGHIRDLPRAASDVPAKYKSQPWARLGVNVDADFE
PLYIISPEKRSTVSELRGLLKDVDELYLATDGDREGEAIAWHLLETLKPRIPVKRMVFHEITEPAIRAAAEHPRDLDIDL
VDAQETRRILDRLYGYEVSPVLWKKVAPKLSAGRVQSVATRIIVARERDRMAFRSAAYWDILAKLDASVSDPDAAPPTFS
ARLTAVAGRRVATGRDFDSLGTLRKGDEVIVLDEGSATALAAGLDGTQLTVASAEEKPYARRPYPPFMTSTLQQEASRKL
RFSAERTMSIAQRLYENGYITYMRTDSTTLSESAINAARTQARQLYGDEYVAPAPRQYTRKVKNAQEAHEAIRPAGETFA
TPDAVRRELDGPNIDDFRLYELIWQRTVASQMADARGMTLSLRITGMSGHQEVVFSATGRTLTFPGFLKAYVETVDELVG
GEADDAERRLPHLTPGQRLDIVELTPDGHATNPPARYTEASLVKALEELGIGRPSTYSSIIKTIQDRGYVHKKGSALVPS
WVAFAVTGLLEQHFGRLVDYDFTAAMEDELDEIAAGNERRTNWLNNFYFGGDHGVPDSVARSGGLKKLVGINLEGIDARE
VNSIKLFDDTHGRPIYVRVGKNGPYLERLVAGDTGEPTPQRANLSDSITPDELTLQVAEELFATPQQGRTLGLDPETGHE
IVAREGRFGPYVTEILPEPAADAAAAAQGVKKRQKAAGPKPRTGSLLRSMDLQTVTLEDALRLLSLPRVVGVDPASGEEI
TAQNGRYGPYLKRGNDSRSLVTEDQIFTITLDEALKIYAEPKRRGRQSASAPPLRELGTDPASGKPMVIKDGRFGPYVTD
GETNASLRKGDDVASITDERAAELLADRRARGPAKRPARKAARKVPAKKAAKRD
>Q7A5Y5 5.6.2.1~~~topA~~~DNA topoisomerase 1~~~
MADNLVIVESPAKAKTIEKYLGKKYKVIASMGHVRDLPRSQMGVDTEDNYEPKYITIRGKGPVVKELKKHAKKAKNVFLA
SDPDREGEAIAWHLSKILELEDSKENRVVFNEITKDAVKESFKNPREIEMNLVDAQQARRILDRLVGYNISPVLWKKVKK
GLSAGRVQSVALRLVIDRENEIRNFKPEEYWTIEGEFRYKKSKFNAKFLHYKNKPFKLKTKKDVEKITAALDGDQFEITN
VTKKEKTRNPANPFTTSTLQQEAARKLNFKARKTMMVAQQLYEGIDLKKQGTIGLITYMRTDSTRISDTAKAEAKQYITD
KYGESYTSKRKASGKQGDQDAHEAIRPSSTMRTPDDMKSFLTKDQYRLYKLIWERFVASQMAPAILDTVSLDITQGDIKF
RANGQTIKFKGFMTLYVETKDDSDSEKENKLPKLEQGDKVTATQIEPAQHYTQPPPRYTEARLVKTLEELKIGRPSTYAP
TIDTIQKRNYVKLESKRFVPTELGEIVHEQVKEYFPEIIDVEFTVNMETLLDKIAEGDITWRKVIDGFFSSFKQDVERAE
EEMEKIEIKDEPAGEDCEVCGSPMVIKMGRYGKFMACSNFPDCRNTKAIVKSIGVKCPKCNDGDVVERKSKKNRVFYGCS
KYPECDFISWDKPIGRDCPKCNQYLVENKKGKTTQVICSNCDYKEAAQK
>P46799 5.6.2.1~~~topA~~~DNA topoisomerase 1~~~COG0550
MSKKVKKYIVVESPAKAKTIKSILGNEYEVFASMGHIIDLPKSKFGVDLEKDFEPEFAVIKGKEKVVEKLKDLAKKGELL
IASDMDREGEAIAWHIARVTNTLGRKNRIVFSEITPRVIREAVKNPREIDMKKVRAQLARRILDRIVGYSLSPVLWRNFK
SNLSAGRVQSATLKLVCDREREILRFVPKKYHRITVNFDGLTAEIDVKEKKFFDAETLKEIQSIDELVVEEKKVSVKKFA
PPEPFKTSTLQQEAYSKLGFSVSKTMMIAQQLYEGVETKDGHIAFITYMRTDSTRVSDYAKEEARNLITEVFGEEYVGSK
RERRKSNAKIQDAHEAIRPTNVFMTPEEAGKYLNSDQKKLYELIWKRFLASQMKPSQYEETRFVLRTKDGKYRFKGTVLK
KIFDGYEKVWKTERNTGEFPFEEGESVKPVVVKIEEQETKPKPRYTEGSLVKEMERLGIGRPSTYASTIKLLLNRGYIKK
IRGYLYPTIVGSVVMDYLEKKYSDVVSVSFTAEMEKDLDEVEQGKKTDKIVLREFYESFSSVFDRNDRIVVDFPTNQKCS
CGKEMRLSFGKYGFYLKCECGKTRSVKNDEIAVIDDGKIFLGRKDSESGSPDGRSVEGKGNLSEKRRKGKKGS
>P14294 5.6.2.1~~~topB~~~DNA topoisomerase 3~~~COG0550
MRLFIAEKPSLARAIADVLPKPHRKGDGFIECGNGQVVTWCIGHLLEQAQPDAYDSRYARWNLADLPIVPEKWQLQPRPS
VTKQLNVIKRFLHEASEIVHAGDPDREGQLLVDEVLDYLQLAPEKRQQVQRCLINDLNPQAVERAIDRLRSNSEFVPLCV
SALARARADWLYGINMTRAYTILGRNAGYQGVLSVGRVQTPVLGLVVRRDEEIENFVAKDFFEVKAHIVTPADERFTAIW
QPSEACEPYQDEEGRLLHRPLAEHVVNRISGQPAIVTSYNDKRESESAPLPFSLSALQIEAAKRFGLSAQNVLDICQKLY
ETHKLITYPRSDCRYLPEEHFAGRHAVMNAISVHAPDLLPQPVVDPDIRNRCWDDKKVDAHHAIIPTARSSAINLTENEA
KVYNLIARQYLMQFCPDAVFRKCVIELDIAKGKFVAKARFLAEAGWRTLLGSKERDEENDGTPLPVVAKGDELLCEKGEV
VERQTQPPRHFTDATLLSAMTGIARFVQDKDLKKILRATDGLGTEATRAGIIELLFKRGFLTKKGRYIHSTDAGKALFHS
LPEMATRPDMTAHWESVLTQISEKQCRYQDFMQPLVGTLYQLIDQAKRTPVRQFRGIVAPGSGGSADKKKAAPRKRSAKK
SPPADEVGSGAIA
>O67108 5.6.2.2~~~gyrA~~~Type 2 topoisomerase subunit A~~~COG0188
MENRENLVEIPIEEEVKQAYIDYAMSVIVGRAIPDVRDGLKPVQRRILYSMYEMGLLPDKPFKKSARIVGETLGKYHPHG
DQAVYEALVRMAQDFTMRYPLIIGQGNFGSIDGDPAAQMRYTEAKLSPLAVEMLTDIDKDTVDFQPNFDDTLMEPEVLPS
KFPNLLCNGTSGIAVGLATSIPPHNLTEVGNALVKLAQNPQISVDEIMEILKGPDFPTGGVIENFAQVKEIYKTGRGIIK
VKGKAHVEKVQGGRERIVITEIPYQVNKAELIKKIADNVRNGKIKEISDIRDETDKEGIRIVVELKRDAKGEEVLKKLYK
YTPLEKGFPVNLVVLIDKEPKLVDIKTLLREFIKHRLEVILRRSKYFLKKVQDRLHIVEGLLKAINFIDDIIERIRRSKD
ASEARNYLMEEFGLSEKQAQAVLDLRLQRLTSLEREKLLEEEKELREKIEYYKKLVASEGERIKVFIEETEELVKKYGDK
RRTFIGGVKEVKEGSITVAVLQDGSIIPVEELPLEKAPVVNILRVPFTEGLFLVSNRGRVYWIAGSQALQGSKVSLKSRE
EKIVGAFIREKFGNRLLLATKKGYVKKIPLAEFEYKAQGMPIIKLTEGDEVVSIASSVDETHILLFTKKGRVARFSVREV
PPSTPGARGVQGIKLEKNDETSGLRIWNGEPYLLVITAKGRVKKISHEEIPKTNRGVKGTEVSGTKDTLVDLIPIKEEVE
LLITTKNGKAFYDKINQKDIPLSTKKSIPRTRWKLEDDEIIKVVIKKSE
>O67137 5.6.2.2~~~gyrB~~~Type 2 topoisomerase subunit B~~~COG0187
MKKRQSQTPQEYTAEAIKAVSGLEHVRLRPAMYIGDIGERGLHHLIWEILDNAVDEAVAGYARNISVTIHRDNSVTVEDD
GRGIPVDIHPETGKPAVEMVFTMLGAGGKFEKKVYTYSGGLHGVGASVVNALSEWLIVEVYRDGKIYRMAFKRGEVVEPL
HVVGETKKRGTKVSFKPDPEIFETTEIKFDIVEKRVRELAYLNPEVKFELTDERLGKHLIYKFDRGIEELVKYLNEGKEP
LFKDIIRIQGEKEGVIVDIAFQYVKDYKERIESFVNNIKTVEGGTHVTGFRSGLSKAVIRMAQGLKLAKELKKSFTGEDV
REGLTAVVACKVPNPQFEGQTKTKLGNQNVKQIVESITYDFLTSYFEKKRDVLKAIVEKAIEAALAREAAKKAKELVRRK
SPLEEGVLPGKLADCSETDPSKCEIFLVEGDSAGGSAKQARDRRYQAILPLRGKIINVEKARIDKVLSNDEIKAIVSALG
CGIGEDLDLKKLRYHKIILMTDADVDGSHIRTLLLTFFYRFMPKLVEEGYVYIAEPPLYRVKKGKKEIYIKDDKEFEHFL
LNEIREKGRLVDAREKEFKGEELVRLLIDLKDYEDAYRALVKSKGENLVNFLLTHRVREEDLRNPARVKEITHLMEEELG
DYRVDTKYNELEGAYDIIFYDDKLGTKTIIDVNFLSSLSYREVLEGIHLHLPVQVFFENKKVEINSLGEIYDKFMDFARS
GMEVQRYKGLGEMNPEQLWETTMNPKTRRLKKVKIEDAAEADRIFTILMGEQVEPRREFIEAYAKEVKHLDV
>A0QPN1 5.6.2.2~~~topoM~~~Topoisomerase subunit TopoM~~~COG0188
MTATLDVPEQNPDLVLDQSADDYWNHYQLTFALYSVSDRAIPSAYDGLKPGQRRLLYQMHDSKLLPGNKPQKSSKVCSAV
TGNLHPHGGASMYGAAALMAADFQRVKVIDGQGAFPRIQGDIPAADRYTEMRLSAPGAALTAELNDHAVPMVPTFDGEWV
EPTVLPAQWPVLLCNGAVGIAEGWATKVPAHNPREVMAACRALLKTPNMTDDRLCKLIPGPDWGSGASVVGTAGLREYIT
TGRGHFTVRGTISVEGKNCIITELPPGVASNTVQDRIRALVESGEMSGVADMSDLTDRRNGLRIVVTAKRGHNAEQIRDQ
LLALTPLESTFAASLVALDENRVPRWWSVRDLIMAFLQLRDSVVLHRSEYRLEKVTARRHLVAGLMKIHLDIDAAVAVIR
GSETVDEARKGLQERFKIDAEQADYVLSLQLRRLTKLDVIELQAEAEKLDAEFAELNDLVTNPESRRKVIDKELVETAKL
FKGPEYDRRTVLDFDATPVSRGDEDGSRERKVNTAWRLDDRGVFSDSHGELLTSGLGWAVWSDGRIKFTTGNGLPFKIRD
IPVAPDITGLVRSGVLPEGYHLALVTRRGKILRVDPASVNPQGVAGNGVAGVKLAADGDEVIAALPVSCANGEAILSIAE
KSWKVTEVADIPVKGRGGAGVAFHPFVKGEDALLAASISKTGYVRGKRAVRPENRAKASIKGSGADVTPAPAE
>A0QPN2 5.6.2.2~~~topoN~~~Topoisomerase subunit TopoN~~~COG0187
MSDPASTIPPAHHPTFWRERAVSYTAADITELDDVQHTRLRPAVNLGLDVLNTALREIVDNAIEEVADPGHGGSTVTITL
HADGSVSVADDGRGLPVDTDPTTGKNGIVKTLGTARAGGKFSAHKDATSTGAGLNGIGAAAAVFISARTDVTVRRDGKTF
LQSFGRGYPGVFEGKEFDPEAPFTRNDTQKLRGVSNRKPDLHGTEVRILFDPAIAPDSTLDIGEVLLRAHAAARMSPGVH
LVVVDEGWPGEEVPPAVLEPFSGPWGTDTLLDLMCTAAGTPLPEVRAVVEGRGEYTTGRGPTPFRWSLTAGPAEPATVAA
FCNTVRTPGGGSHLTAAIKGLSEALAERASRMRDLGLAKNEEGPEPQDFAAVTALAVDTRAPDVAWDSQAKTAVSSRSLN
LAMAPDVARSVTIWAANPANADTVTLWSKLALESARARRSAEGAKARARAASKAKGLGTNLSLPPKLLPSRESGRGSGAE
LFLCEGDSALGTIKAARDATFQAAFPLKGKPPNVYGFPLNKARAKDEFDAIERILGCGVRDHCDPELCRYDRILFASDAD
PDGGNINSSLISMFLDFYRPLVEAGMVYVTMPPLFVVKAGDERIYCQDESERDAAVAQLKASSNRRVEVQRNKGLGEMDA
DDFWNTVLDPQRRTVIRVRPDESEKKLHHTLFGGPPEGRRTWMADVAARVDTSALDLT
>P33225 1.7.2.3~~~torA~~~Trimethylamine-N-oxide reductase 1~~~COG0243
MNNNDLFQASRRRFLAQLGGLTVAGMLGPSLLTPRRATAAQAATDAVISKEGILTGSHWGAIRATVKDGRFVAAKPFELD
KYPSKMIAGLPDHVHNAARIRYPMVRVDWLRKRHLSDTSQRGDNRFVRVSWDEALDMFYEELERVQKTHGPSALLTASGW
QSTGMFHNASGMLAKAIALHGNSVGTGGDYSTGAAQVILPRVVGSMEVYEQQTSWPLVLQNSKTIVLWGSDLLKNQQANW
WCPDHDVYEYYAQLKAKVAAGEIEVISIDPVVTSTHEYLGREHVKHIAVNPQTDVPLQLALAHTLYSENLYDKNFLANYC
VGFEQFLPYLLGEKDGQPKDAAWAEKLTGIDAETIRGLARQMAANRTQIIAGWCVQRMQHGEQWAWMIVVLAAMLGQIGL
PGGGFGFGWHYNGAGTPGRKGVILSGFSGSTSIPPVHDNSDYKGYSSTIPIARFIDAILEPGKVINWNGKSVKLPPLKMC
IFAGTNPFHRHQQINRIIEGLRKLETVIAIDNQWTSTCRFADIVLPATTQFERNDLDQYGNHSNRGIIAMKQVVPPQFEA
RNDFDIFRELCRRFNREEAFTEGLDEMGWLKRIWQEGVQQGKGRGVHLPAFDDFWNNKEYVEFDHPQMFVRHQAFREDPD
LEPLGTPSGLIEIYSKTIADMNYDDCQGHPMWFEKIERSHGGPGSQKYPLHLQSVHPDFRLHSQLCESETLRQQYTVAGK
EPVFINPQDASARGIRNGDVVRVFNARGQVLAGAVVSDRYAPGVARIHEGAWYDPDKGGEPGALCKYGNPNVLTIDIGTS
QLAQATSAHTTLVEIEKYNGTVEQVTAFNGPVEMVAQCEYVPASQVKS
>O87948 1.7.2.3~~~torA~~~Trimethylamine-N-oxide reductase~~~
MNRRDFLKGIASSSFVVLGGSSVLTPLNALAKAGINEDEWLTTGSHFGAFKMKRKNGVIAEVKPFDLDKYPTDMINGIRG
MVYNPSRVRYPMVRLDFLLKGHKSNTHQRGDFRFVRVTWDKALTLFKHSLDEVQTQYGPSGLHAGQTGWRATGQLHSSTS
HMQRAVGMHGNYVKKIGDYSTGAGQTILPYVLGSTEVYAQGTSWPLILEHSDTIVLWSNDPYKNLQVGWNAETHESFAYL
AQLKEKVKQGKIRVISIDPVVTKTQAYLGCEQLYVNPQTDVTLMLAIAHEMISKKLYDDKFIQGYSLGFEEFVPYVMGTK
DGVAKTPEWAAPICGVEAHVIRDLAKTLVKGRTQFMMGWCIQRQQHGEQPYWMAAVLATMIGQIGLPGGGISYGHHYSSI
GVPSSGAAAPGAFPRNLDENQKPLFDSSDFKGASSTIPVARWIDAILEPGKTIDANGSKVVYPDIKMMIFSGNNPWNHHQ
DRNRMKQAFHKLECVVTVDVNWTATCRFSDIVLPACTTYERNDIDVYGAYANRGILAMQKMVEPLFDSLSDFEIFTRFAA
VLGKEKEYTRNMGEMEWLETLYNECKAANAGKFEMPDFATFWKQGYVHFGDGEVWTRHADFRNDPEINPLGTPSGLIEIF
SRKIDQFGYDDCKGHPTWMEKTERSHGGPGSDKHPIWLQSCHPDKRLHSQMCESREYRETYAVNGREPVYISPVDAKARG
IKDGDIVRVFNDRGQLLAGAVVSDNFPKGIVRIHEGAWYGPVGKDGSTEGGAEVGALCSYGDPNTLTLDIGTSKLAQACS
AYTCLVEFEKYQGKVPKVSSFDGPIEVEI
>P33226 ~~~torC~~~Cytochrome c-type protein TorC~~~COG3005
MRKLWNALRRPSARWSVLALVAIGIVIGIALIVLPHVGIKVTSTTEFCVSCHSMQPVYEEYKQSVHFQNASGVRAECHDC
HIPPDIPGMVKRKLEASNDIYQTFIAHSIDTPEKFEAKRAELAEREWARMKENNSATCRSCHNYDAMDHAKQHPEAARQM
KVAAKDNQSCIDCHKGIAHQLPDMSSGFRKQFDELRASANDSGDTLYSIDIKPIYAAKGDKEASGSLLPASEVKVLKRDG
DWLQIEITGWTESAGRQRVLTQFPGKRIFVASIRGDVQQQVKTLEKTTVADTNTEWSKLQATAWMKKGDMVNDIKPIWAY
ADSLYNGTCNQCHGAPEIAHFDANGWIGTLNGMIGFTSLDKREERTLLKYLQMNASDTAGKAHGDKKEEK
>P36662 ~~~torD~~~Chaperone protein TorD~~~COG3381
MTTLTAQQIACVYAWLAQLFSRELDDEQLTQIASAQMAEWFSLLKSEPPLTAAVNELENRIATLTVRDDARLELAADFCG
LFLMTDKQAALPYASAYKQDEQEIKRLLVEAGMETSGNFNEPADHLAIYLELLSHLHFSLGEGTVPARRIDSLRQKTLTA
LWQWLPEFVARCRQYDSFGFYAALSQLLLVLVECDHQNR
>O87949 ~~~torD~~~Chaperone protein TorD~~~
MSQVDINHARALVYQLLSSLFAREVDEQRLKELTSEAAQQFWEQLSLEANFTQSVDKIRSTLNGIKDDEALLELAADYCG
LFLVGTKHSASPYASLYLSGEDEPLLFGEQHQQMSEFLHQSKLQVQSHFPEPADHLAVMLAYMAHLFCHSENSVQLSFLQ
TCFNSWLAKFINHLTQCNKNGFYSAVATLTLAWVKQDIAQLEPAVAIIS
>Q2EES9 ~~~torI~~~Response regulator inhibitor for tor operon~~~COG3311
MQHELQPDSLVDLKFIMADTGFGKTFIYDRIKSGDLPKAKVIHGRARWLYRDHCEFKNKLLSRANG
>P38684 ~~~torR~~~TorCAD operon transcriptional regulatory protein TorR~~~COG0745
MPHHIVIVEDEPVTQARLQSYFTQEGYTVSVTASGAGLREIMQNQSVDLILLDINLPDENGLMLTRALRERSTVGIILVT
GRSDRIDRIVGLEMGADDYVTKPLELRELVVRVKNLLWRIDLARQAQPHTQDNCYRFAGYCLNVSRHTLERDGEPIKLTR
AEYEMLVAFVTNPGEILSRERLLRMLSARRVENPDLRTVDVLIRRLRHKLSADLLVTQHGEGYFLAADVC
>P39453 2.7.13.3~~~torS~~~Sensor protein TorS~~~COG0784
MNLTLTRRLWMGFALMALLTLTSTLVGWYNLRFISQVEKDNTQALIPTMNMARQLSEASAWELFAAQNLTSADNEKMWQA
QGRMLTAQSLKINALLQALREQGFDTTAIEQQEQEISRSLRQQGELVGQRLQLRQQQQQLSQQIVAAADEIARLAQGQAN
NATTSAGATQAGIYDLIEQDQRQAAESALDRLIDIDLEYVNQMNELRLSALRVQQMVMNLGLEQIQKNAPTLEKQLNNAV
KILQRRQIRIEDPGVRAQVATTLTTVSQYSDLLALYQQDSEISNHLQTLAQNNIAQFAQFSSEVSQLVDTIELRNQHGLA
HLEKASARGQYSLLLLGMVSLCALILILWRVVYRSVTRPLAEQTQALQRLLDGDIDSPFPETAGVRELDTIGRLMDAFRS
NVHALNRHREQLAAQVKARTAELQELVIEHRQARAEAEKASQAKSAFLAAMSHEIRTPLYGILGTAQLLADNPALNAQRD
DLRAITDSGESLLTILNDILDYSAIEAGGKNVSVSDEPFEPRPLLESTLQLMSGRVKGRPIRLATAIADDMPCALMGDPR
RIRQVITNLLSNALRFTDEGYIILRSRTDGEQWLVEVEDSGCGIDPAKLAEIFQPFVQVSGKRGGTGLGLTISSRLAQAM
GGELSATSTPEVGSCFCLRLPLRVATAPVPKTVNQAVRLDGLRLLLIEDNPLTQRITIEMLKTSGAQIVAVGNAAQALET
LQNSEPFAAALVDFDLPDIDGITLARQLAQQYPSLVLIGFSAHVIDETLRQRTSSLFRGIIPKPVPREVLGQLLAHYLQL
QVNNDQSLDVSQLNEDAQLMGTEKIHEWLVLFTQHALPLLDEIDIARASQDSEKIKRAAHQLKSSCSSLGMHIASQLCAQ
LEQQPLSAPLPHEEITRSVAALEAWLHKKDLNAI
>P38683 ~~~torT~~~Periplasmic protein TorT~~~COG1879
MRVLLFLLLSLFMLPAFSADNLLRWHDAQHFTVQASTPLKAKRAWKLCALYPSLKDSYWLSLNYGMQEAARRYGVDLKVL
EAGGYSQLATQQAQIDQCKQWGAEAILLGSSTTSFPDLQKQVASLPVIELVNAIDAPQVKSRVGVPWFQMGYQPGRYLVQ
WAHGKPLNVLLMPGPDNAGGSKEMVEGFRAAIAGSPVRIVDIALGDNDIEIQRNLLQEMLERHPEIDVVAGTAIAAEAAM
GEGRNLKTPLTVVSFYLSHQVYRGLKRGRVIMAASDQMVWQGELAVEQAIRQLQGQSVSDNVSPPILVLTPKNADREHIR
RSLSPGGFRPVYFYQHTSAAKK
>P52005 ~~~torY~~~Cytochrome c-type protein TorY~~~COG3005
MRGKKRIGLLFLLIAVVVGGGGLLLAQKVLHKTSDTAFCLSCHSMSKPFEEYQGTVHFSNQKGIRAECADCHIPKSGMDY
LFAKLKASKDIYHEFVSGKIDSDDKFEAHRQEMAETVWKELKATDSATCRSCHSFDAMDIASQSESAQKMHNKAQKDSET
CIDCHKGIAHFPPEIKMDDNAAHELESQAATSVTNGAHIYPFKTSHIGELATVNPGTDLTVVDASGKQPIVLLQGYQMQG
SENTLYLAAGQRLALATLSEEGIKALTVNGEWQADEYGNQWRQASLQGALTDPALADRKPLWQYAEKLDDTYCAGCHAPI
AADHYTVNAWPSIAKGMGARTSMSENELDILTRYFQYNAKDITEKQ
>P46923 1.7.2.3~~~torZ~~~Trimethylamine-N-oxide reductase 2~~~COG0243
MTLTRREFIKHSGIAAGALVVTSAAPLPAWAEEKGGKILTAGRWGAMNVEVKDGKIVSSTGALAKTIPNSLQSTAADQVH
TTARIQHPMVRKSYLDNPLQPAKGRGEDTYVQVSWEQALKLIHEQHDRIRKANGPSAIFAGSYGWRSSGVLHKAQTLLQR
YMNLAGGYSGHSGDYSTGAAQVIMPHVVGSVEVYEQQTSWPLILENSQVVVLWGMNPLNTLKIAWSSTDEQGLEYFHQLK
KSGKPVIAIDPIRSETIEFFDDNATWIAPNMGTDVALMLGIAHTLMTQGKHDKVFLEKYTTGYPQFEEYLTGKSDNTPKS
AVWAAEITGVPEAQIVKLAELMAANRTMLMAGWGIQRQQYGEQKHWMLVTLAAMLGQIGTPGGGFGFSYHYSNGGNPTRV
GGVLPEMSAAIAGHASEAADDGGMTAIPVARIVDALENPGGKYQHNGKEQTYPNIKMIWWAGGGNFTHHQDTNRLIKAWQ
KPEMIVVSECYWTAAAKHADIVLPITTSFERNDLTMTGDYSNQHIVPMKQAVAPQFEARNDFDVFADLAELLKPGGKEIY
TEGKDEMAWLKFFYDAAQKGARAQRVTMPMFNAFWQQNKLIEMRHSEKNEQYVRYGDFRADPVKNALGTPSGKIEIYSKT
LEKFGYKDCPAHPTWLAPDEWKGTADEKQLQLLTAHPAHRLHSQLNYAELRKKYAIADREPITIHTEDAARFGIANGDLV
RVWNKRGQILTGAVVTDGIKKGVVCVHEGAWPDLENGLCKNGSANVLTADIPSSQLANACAGNSALVYIEKYTGNAPKLT
AFDQPAVQA
>P44798 1.7.2.3~~~torZ~~~Trimethylamine-N-oxide reductase~~~COG0243
MKKNNVNEQRRDFLKKTSLGVAGSALSGGMVGVVSKSAVAKEAEMKTVVTAAHWGSIGVVVQDGKVVKSGPAIEPAVPNE
LQTVVADQLYSERRVKCPMVRKGFLANPGKSDTTMRGRDEWVRVSWDEALDLVHNQLKRVRDEHGSTGIFAGSYGWFSCG
SLHASRTLLQRYMNATGGFVGHKGDYSTGAAQVIMPHVLGTIEVYEQQTSWESILESSDIIVLWSANPLTTMRIAWMSTD
QKGIEYFKKFQASGKRIICIDPQKSETCQMLNAEWIPVNTATDVPLMLGIAHTLVEQGKHDKDFLKKYTSGYAKFEEYLL
GKTDGQPKTAEWAAKICGVPAETIKQLAADFASKRTMLMGGWGMQRQRHGEQTHWMLVTLASMLGQIGLPGGGFGLSYHY
SNGGVPTATGGIIGSITASPSGKAGAKTWLDDTSKSAFPLARIADVLLHPGKKIQYNGTEITYPDIKAVYWAGGNPFVHH
QDTNTLVKAFQKPDVVIVNEVNWTPTARMADIVLPATTSYERNDLTMAGDYSMMSVYPMKQVVPPQFEAKNDYDIFVELA
KRAGVEEQYTEGKTEMEWLEEFYNAAFSAARANRVAMPRFDKFWAENKPLSFEAGEAAKKWVRYGEFREDPLLNPLGTPS
GKIEIFSDVVEKMNYNDCKGHPSWMEPEEFAGNVTEEYPLALVTPHPYYRLHSQLAHTSLRQKYAVNDREPVMIHPEDAA
ARGIKDGDIVRIHSKRGQVLAGAAVTENIIKGTVALHEGAWYDPMYLGESEKPLCKNGCANVLTRDEGTSKLAQGNSPNT
CIVQIEKFIGVAPEVTVFKQPKQVA
>Q813X6 3.1.-.-~~~~~~Toxin BC_0920~~~
MSLNMYLGEVQGQTQSMNAVCNATIQGMEQVIQSIDAFAIDTVLQGQTYSSAKSFFVQTFRPLAQGIIYLCEELIRQNDA
FPSQFQSQVASTDVIEQEILEQIREIDRMKASMEAISQAMPIPGMDAMANLFTVMRKKLQEKLDHLYQFNQTSSNNYSTA
LQLAASIAAGLAEVQSGKGFSPASGTFSTQGLNMEWTTSIQAITEERARQAANSIEEGEMCGKLPEKSTGEKIWDGIVEG
TGQAVSDTIDGIKALGDWETWENMGNAALHPIDTLSTMYNTLSDSFINDVINGDAESRAKWGSYALTQVGLGLIGDKGLS
KASKLGQAGKVTKLAKNKIPQAVSHITSNLQMGDRFAFAGGNSLRFRFDTPDFKKAEEKLSTYQFARGESNYGGSNFVNE
NHRSSLSNREIISNLQHTEKFRPNTLKHILEGEINWRGDAMGYHTEVLENTPGKIISGTEEILNDQGIYKARVEVNGTPK
TGNRGFSTFFPKDWSPQKIVDNINEAYNNRTYEFGNTYSGIGSEGIRISMYIDGNGKIISAFPAE
>P04977 2.4.2.-~~~ptxA~~~Pertussis toxin subunit 1~~~
MRCTRAIRQTARTGWLTWLAILAVTAPVTSPAWADDPPATVYRYDSRPPEDVFQNGFTAWGNNDNVLDHLTGRSCQVGSS
NSAFVSTSSSRRYTEVYLEHRMQEAVEAERAGRGTGHFIGYIYEVRADNNFYGAASSYFEYVDTYGDNAGRILAGALATY
QSEYLAHRRIPPENIRRVTRVYHNGITGETTTTEYSNARYVSQQTRANPNPYTSRRSVASIVGTLVRMAPVIGACMARQA
ESSEAMAAWSERAGEAMVLVYYESIAYSF
>P04978 ~~~ptxB~~~Pertussis toxin subunit 2~~~
MPIDRKTLCHLLSVLPLALLGSHVARASTPGIVIPPQEQITQHGGPYGRCANKTRALTVAELRGSGDLQEYLRHVTRGWS
IFALYDGTYLGGEYGGVIKDGTPGGAFDLKTTFCIMTTRNTGQPATDHYYSNVTATRLLSSTNSRLCAVFVRSGQPVIGA
CTSPYDGKYWSMYSRLRKMLYLIYVAGISVRVHVSKEEQYYDYEDATFETYALTGISICNPGSSLC
>P04979 ~~~ptxC~~~Pertussis toxin subunit 3~~~
MLINNKKLLHHILPILVLALLGMRTAQAVAPGIVIPPKALFTQQGGAYGRCPNGTRALTVAELRGNAELQTYLRQITPGW
SIYGLYDGTYLGQAYGGIIKDAPPGAGFIYRETFCITTIYKTGQPAADHYYSKVTATRLLASTNSRLCAVFVRDGQSVIG
ACASPYEGRYRDMYDALRRLLYMIYMSGLAVRVHVSKEEQYYDYEDATFQTYALTGISLCNPAASIC
>P0A3R5 ~~~ptxD~~~Pertussis toxin subunit 4~~~
MLRRFPTRTTAPGQGGARRSRVRALAWLLASGAMTHLSPALADVPYVLVKTNMVVTSVAMKPYEVTPTRMLVCGIAAKLG
AAASSPDAHVPFCFGKDLKRPGSSPMEVMLRAVFMQQRPLRMFLGPKQLTFEGKPALELIRMVECSGKQDCP
>P04981 ~~~ptxE~~~Pertussis toxin subunit 5~~~
MQRQAGLPLKANPMHTIASILLSVLGIYSPADVAGLPTHLYKNFTVQELALKLKGKNQEFCLTAFMSGRSLVRACLSDAG
HEHDTWFDTMLGFAISAYALKSRIALTVEDSPYPGTPGDLLELQICPLNGYCE
>P17452 ~~~toxA~~~Dermonecrotic toxin~~~
MKTKHFFNSDFTVKGKSADEIFRRLCTDHPDKQLNNVKWKEVFINRFGQMMLDTPNPRKIVEKIINEGLEKQGLKNIDPE
TTYFNIFSSSDSSDGNVFHYNSLSESYRVTDACLMNIFVERYFDDWDLLNSLASNGIYSVGKEGAYYPDHDYGPEYNPVW
GPNEQIYHSRVIADILYARSVWDEFKKYFMEYWQKYAQLYTEMLSDTFLAMAIQQYTRQTLTDEGFLMVCNTYYGNKEEV
QITLLDIYGYPSTDIICIEQKGLPTPKVILYIPGGTQPFVEFLNTDDLKQWIAWHLKDNKHMVAFRKHFSLKQRQEGETF
TGIDKALQYIAEESPEWPANKYILYNPTHLETENLFNIMMKRTEQRMLEDSDVQIRSNSEATRDYALSLLETFISQLSAI
DMLVPAVGIPINFALSATALGLSSDIVVNGDSYEKRKYGIGSLVQSALFTGINLIPVISETAEILSSFSRTEEDIPAFFT
EEQALAQRFEIVEEELHSISPDDPPREITDENLHKIRLVRLNNENQPLVVLRRLGGNKFIRIEPITFQEIKGSLVSEVIN
PVTNKTYYVSNAKLLGGSPYSPFRIGLEGVWTPEVLKARASVIGKPIGESYKRILAKLQRIHNSNILDERQGLMHELMEL
IDLYEESQPSSERLNAFRELRTQLEKALYLPEMEALKKQILQIPNKGSGAARFLLRTAMNEMAGKTSESTADLIRFALQD
TVISAPFRGYAGAIPEAIDFPVKYVIEDISVFDKIQTNYWELPAYESWNEGSNSALLPGLLRESQSKGMLSKCRIIENSL
YIGHSYEEMFYSISPYSNQVGGPYELYPFTFFSMLQEVQGDLGFEQAFATRNFFNTLVSDRLSLMENTMLLTESFDYTPW
DAIYGDINYDEQFAAMSINERIEKCMNTYRGVAFQNSSKSIDFFLNNLTTFIDNGLTEIAISDLPYDIVQQEISQFLQGS
NEWKTLDAMLFNLDKGDINGAFRKLLQSAKDNNIKFRAIGHSDNSVPPFNNPYKSLYYKGNIIAEAIEKLDREGQKFVVF
ADSSLLNSTPGTGRPMPGLVQYLKIPATVVDSDGAWQFLPDVASSRVPIEVTELENWQVLTPPQGKILGLKQFKLTAGFP
TEQSRLPLLENSVSEDLREELMQKIDAIKNDVKMNSLVCMEAGSCDSVSPKVAARLKDMGLEAGMGASITWWRREGGMEF
SHQMHTTASFKFAGKEFAVDASHLQFVHDQLDTTILILPVDDWALEIAQRNRAINPFVEYVSKTGNMLALFMPPLFTKPR
LTRAL
>P11439 2.4.2.36~~~eta~~~Exotoxin A~~~
MHLTPHWIPLVASLGLLAGGSFASAAEEAFDLWNECAKACVLDLKDGVRSSRMSVDPAIADTNGQGVLHYSMVLEGGNDA
LKLAIDNALSITSDGLTIRLEGGVEPNKPVRYSYTRQARGSWSLNWLVPIGHEKPSNIKVFIHELNAGNQLSHMSPIYTI
EMGDELLAKLARDATFFVRAHESNEMQPTLAISHAGVSVVMAQAQPRREKRWSEWASGKVLCLLDPLDGVYNYLAQQRCN
LDDTWEGKIYRVLAGNPAKHDLDIKPTVISHRLHFPEGGSLAALTAHQACHLPLETFTRHRQPRGWEQLEQCGYPVQRLV
ALYLAARLSWNQVDQVIRNALASPGSGGDLGEAIREQPEQARLALTLAAAESERFVRQGTGNDEAGAASADVVSLTCPVA
AGECAGPADSGDALLERNYPTGAEFLGDGGDISFSTRGTQNWTVERLLQAHRQLEERGYVFVGYHGTFLEAAQSIVFGGV
RARSQDLDAIWRGFYIAGDPALAYGYAQDQEPDARGRIRNGALLRVYVPRSSLPGFYRTGLTLAAPEAAGEVERLIGHPL
PLRLDAITGPEEEGGRLETILGWPLAERTVVIPSAIPTDPRNVGGDLDPSSIPDKEQAISALPDYASQPGKPPREDLK
>Q3YN09 3.1.-.-~~~toxN~~~Endoribonuclease ToxN~~~
MTNKDNPKFHTISTEYIDYLREADSKVPFNKDEQHSRPYVGVLEKINGHDYFVPLTSRNDKNFNSQVSVKLFDNDEKRIG
VLLVNNMIPVPEKECKEIDIAEKTAADPQYGNLMLKQYLFLKENMDRVTNKVEKVYKDVTVQGKPSHKQKFLKGVCCDFP
KLEEKCQEYKERDQAKERDKARRIAYMRQMGRER
>B8X8Z0 3.1.-.-~~~toxN~~~Endoribonuclease ToxN~~~
MKFYTISSKYIEYLKEFDDKVPNSEDPTYQNPKAFIGIVLEIQGHKYLAPLTSPKKWHNNVKESSLSCFKLHENGVPENQ
LGLINLKFMIPIIEAEVSLLDLGNMPNTPYKRMLYKQLQFIRANSDKIASKSDTLRNLVLQGKMQGTCNFSLLEEKYRDF
GKEAEDTEEGE
>P09852 ~~~toxR~~~Exotoxin A regulatory protein~~~
MTATDRTPPPLKWLCLGNRDANDGFELFAHGIYARNGALVGSKLSLRERRQRVDLSAFLSGAPPLLAEAAVKHLLARLLC
VHRHNTDLELLGKNFIPLHASSLGNAGVCERILASARQLQQHQVELCLLLAIDEQEPASAEYLTSLARLRDSGVRIALHP
QRIDTDARQCFAEVDAGLCDYLGLDARLLAPGPLTRNLRQRKSIEYLNRLLVAQDIQMLCLNVDNEELHQQANALPFAFR
HGRHYSEPFQAWPFSSPAC
>P15795 ~~~toxR~~~Cholera toxin transcriptional activator~~~COG3710
MFGLGHNSKEISMSHIGTKFILAEKFTFDPLSNTLIDKEDSEEIIRLGSNESRILWLLAQRPNEVISRNDLHDFVWREQG
FEVDDSSLTQAISTLRKMLKDSTKSPQYVKTVPKRGYQLIARVETVEEEMARESEAAHDISQPESVNEYAESSSVPSSAT
VVNTPQPANVVTNKSAPNLGNRLLILIAVLLPLAVLLLTNPSQTSFKPLTVVDGVAVNMPNNHPDLSNWLPSIELCVKKY
NEKHTGGLKPIEVIATGGQNNQLTLNYIHSPEVSGENITLRIVANPNDAIKVCE
>P24003 ~~~toxS~~~Transmembrane regulatory protein ToxS~~~
MQNRHIAMGILLLSLLLSSWLYWGSDFKLEQVLTSREWQSKMVSLIKTNSNRPAMGPLSRVDVTSNVKYLPNGTYLRVSI
VKLFSDDNSAESVINISEFGEWDISDNYLLVTPVEFKDISSNQSKDFTDEQLQLITQLFKMDAQQSRRVDIVNERTILFT
SLSHGSTVLFSNS
>O83346 ~~~~~~Putative outer membrane protein assembly factor TP_0326~~~COG4775
MLKKASAFLIASCCVMSLAWAQANDNWYEGKPISAISFEGLEYIARGQLDTIFSQYKGQKWTYELYLEILQKVYDLEYFS
EVSPKAVPTDPEYQYVMLQFTVKERPSVKGIKMVGNSQIRSGDLLSKILLKKGDIYNEVKMKVDQESLRRHYLDQGYAAV
KISCEAKTEAGGVVVQFTIQEGKQTVVSRIQFKGNKAFTESVLKKVLSTQEARFLTSGVFKENALEADKAAVHSYYAERG
YIDARVEGVAKTVDKKTDASRNLVTLTYTVVEGEQYRYGGVTIVGNQIFSTEELQAKIRLKRGAIMNMVAFEQGFQALAD
AYFENGYTSNYLNKEEHRDTAEKTLSFKITVVERERSHVEHIIIKGTKNTKDEVILREMLLKPGDVFSKSKFTDSLRNLF
NLRYFSSLVPDVRPGSEQDLVDIILNVEEQSTANVQFGVTFSGVGEAGTFPLSLFCQWEEKNFLGKGNEISVNATLGSEA
QSLKLGYVERWFLGSPLTVGFDFELTHKNLFVYRAGSYGNGLPHPYTSREQWASSPGLAESFRLKYSRFESAIGAHTGYQ
WYPRYAVIRVNGGVDFRVVKNFYDKDNNQPFDLTVKEQLNWTSINSFWTSVSFDGRDFAYDPSSGWFLGQRCTFNGLVPF
LEKEHSFRSDTKAEFYVTLLNYPVSAVWNLKFVLAFYTGVSVQTYYGRRKSENGKGNGVRSGALVIDGVLVGRGWSEDAK
KNTGDLLLHHWIEFRWPLAHGIVSFDFFFDAAMVYNIESQSPNGSSSASSSSSSSSSSSRTTSSEGLYKMSYGPGLRFTL
PQFPLKLAFANTFTSPGGIPKTKKNWNFVLSFTVNNL
>O67998 ~~~~~~Outer membrane protein TP0453~~~
MIRRRYRGCTQGAWIVSVGMLFASCTSGAWKASVDPLGVVGSGADVYLYFPVAGNENLISRIIENHESKADIKKIVDRTT
AVYGAFFARSKEFRLFGSGSYPYAFTNLIFSRSDGWASTKTEHGITYYESEHTDVSIPAPHFSCVIFGSSKRERMSKMLS
RLVNPDRPQLPPRFEKECTSEGTSQTVALYIKNGGHFITKLLNFPQLNLPLGAMELYLTARRNEYLYTLSLQLGNAKINF
PIQFLISRVLNAHIHVEGDRLIIEDGTISAERLASVISSLYSKKGSS
>P83615 3.4.11.-~~~ptp~~~Prolyl tri/tetrapeptidyl aminopeptidase~~~
MRKALRSLLAASMLIGAIGAGSATAEAASITAPQADIKDRILKIPGMKFVEEKPYQGYRYLVMTYRQPVDHRNPGKGTFE
QRFTLLHKDTDRPTVFFTSGYNVSTNPSRSEPTRIVDGNQVSMEYRFFTPSRPQPADWSKLDIWQAASDQHRLYQALKPV
YGKNWLATGGSKGGMTATYFRRFYPNDMNGTVAYVAPNDVNDKEDSAYDKFFQNVGDKACRTQLNSVQREALVRRDEIVA
RYEKWAKENGKTFKVVGSADKAYENVVLDLVWSFWQYHLQSDCASVPATKASTDELYKFIDDISGFDGYTDQGLERFTPY
YYQAGTQLGAPTVKNPHLKGVLRYPGINQPRSYVPRDIPMTFRPGAMADVDRWVREDSRNMLFVYGQNDPWSGEPFRLGK
GAAARHDYRFYAPGGNHGSNIAQLVADERAKATAEVLKWAGVAPQAVQKDEKAAKPLAPFDAKLDRVKNDKQSALRP
>E5Y945 2.6.1.77~~~tpa~~~Taurine--pyruvate aminotransferase~~~COG0161
MTYDKAELVALDKKYVWHHLTQHKNFEPAIYVKGEGMRITDIDGKTYLDAVSGGVWTVNVGYGRKEIVDAVAKQMMEMCY
FANGIGNVPTIKFSEKLISKMPGMSRVYLSNSGSEANEKAFKIVRQIGQLKHGGKKTGILYRARDYHGTTIGTLSACGQF
ERKVQYGPFAPGFYEFPDCDVYRSKFGDCADLGVKMAKQLEEVILTVGPDELGAVIVEPMTAGGGILVPPAGYYETIREI
CDKYELLLIIDEVVCGLGRTGKWFGYQHFNVQPDIVTMAKGVASGYAPISCTVTTEKVFQDFVNDPADTDAYFRDISTFG
GCTSGPAAALANIEIIERENLLENCTKMGDRLLEGLKGLMAKHPIIGDVRGKGLFAGIEIVKDRATKEPIAEAVANAMVG
AAKQAGVLIGKTSRSFREFNNTLTLCPALIATEADIDEIVAGIDKAFTTVEQKFGL
>Q9APM5 2.6.1.77~~~tpa~~~Taurine--pyruvate aminotransferase~~~
MTYDKAELVALDKKYVWHHLTQHKNFEPAIYVKGEGMRITDIDGKTYLDAVSGGVWTVNVGYGRKEIVDAVAKQMMEMCY
FANGIGNVPTIKFSEKLISKMPGMSRVYLSNSGSEANEKAFKIVRQIGQLKHGGKKTGILYRARDYHGTTIGTLSACGQF
ERKVQYGPFAPGFYEFPDCDVYRSKFGDCADLGVKMAKQLEEVILTVGPDELGAVIVEPMTAGGGILVPPAGYYETIREI
CDKYELLLIIDEVVCGLGRTGKWFGYQHFNVQPDIVTMAKGVASGYAPISCTVTTEKVFQDFVNDPADTDAYFRDISTFG
GCTSGPAAALANIEIIERENLLENCTKMGDRLLEGLKGLMAKHPIIGDVRGKGLFAGIEIVKDRATKEPIAEAVANAMVG
AAKQAGVLIGKTSRSFREFNNTLTLCPALIATEADIDEIVAGIDKAFTTVEQKFGL
>D5AKY0 2.6.1.77~~~tpa~~~Taurine--pyruvate aminotransferase~~~COG0161
MHAAPIPQDAAPIIAADRAHVWHHLSQHKPYETSDPRVFVEGRGMRLWDATGREFLDATSGGVWTVNLGYGRKDVVEAVA
AQLLALPYYAGAAGTVPGARYAEALIAKMPGLSRVYYSNSGSEANEKVYKMVRQISHRHHGGRKGKILFRERDYHGTTIA
ALATSGQAQRAEHYGPFPDGFVSVPHCLEYRAQWDCANYGERAADAIEEVILREGPDSIGCLVLEPITAGGGVIVPPAGY
WEKVSHICRKYNILLHLDEVVCGLGRTGAWFGYQHYGIQPDFVTMAKGVAAGYAAISCTVTTEAVFELFKDAPSDPLCHF
RDISTFGGCTAGPAAALETLRIIEEEGLLQNTAQMGERLLANLRDLAERHAVIGDVRGKGLFCGAELVADRRTKEPLAEA
KVQAVVADCAAQGVLIGATNRSIPGLNTTLCLAPALIASEAEIDRITETIDAALRRLAA
>A0A0H2ZFK2 3.1.3.16~~~tpbA~~~Dual specificity protein phosphatase TpbA~~~
MHRSPLAWLRLLLAAVLGAFLLGGPLHAAETAAPRSPAWAQAVDPSINLYRMSPTLYRSALPNAQSVALLQRLQVKTVVS
FIKDDDRAWLGQAPVRVVSLPTHADRVDDAEVLSVLRQLQAAEREGPVLMHCKHGNNRTGLFAAMYRIVVQGWDKQAALE
EMQRGGFGDEDDMRDASAYVRGADVDGLRLAMANGECSPSRFALCHVREWMAQALDRP
>Q9HXC7 3.1.3.16~~~tpbA~~~Dual specificity protein phosphatase TpbA~~~
MHRSPLAWLRLLLAAVLGAFLLGGPLHAAETAATRSPAWAQAVDPSINLYRMSPTLYRSALPNAQSVALLQRLQVKTVVS
FIKDDDRAWLGQAPVRVLSLPTHADRVDDAEVLSVLRQLQAAEREGPVLMHCKHGNNRTGLFAAMYRIVVQGWDKQAALE
EMQHGGFGDEDDMRDASAYVRGADVDGLRLAMANGECSPSRFAVCHVREWMAQALDRP
>A0A0H2Z7X0 2.7.7.65~~~tpbB~~~Diguanylate cyclase TpbB~~~
MNRRRRYTGSNPSLRRVLYRAHLGVALVAVFTAGLAVTLVGLLTLRAYADPNQQLIARSISYTVEAAVVFGDAQAAEESL
ALIASSEEVSSAIVYDRQGQPLASWHRESTGPLHLLEQQLAHWLLSAPTEQPILHDGQKIGSVEVKGSGGSLLRFLLTGF
AGMVLCLLLTALGAFYLSRRLVRGIVGPLDQLAKVAHTVRRERDFEKRVPEAGIAELSQLGEDFNALLDELESWQARLQD
ENASLAHQAHHDSLTSLPNRAFFEGRLSRALRDASEHREQLAVLFIDSDRFKEINDRLGHAAGDTVLVNIAMRIRGQLRE
SDLVARLGGDEFAVLLAPLASGADALRIADNIIASMQAPIRLSDGSTVSTSLTIGIALYPEHADTPAALLHDADMAMYIA
KRQARGSRRLAELNDPRILQEEKEIDSATPEAPPK
>Q9I4L5 2.7.7.65~~~tpbB~~~Diguanylate cyclase TpbB~~~
MNRRRRYTGSNPSLRRVLYRAHLGVALVAVFTAGLAVTLVGLLTLRAYADPNQQLIARSISYTVEAAVVFGDAQAAEESL
ALIASSEEVSSAIVYDRQGQTLASWHRESTGPLHLLEQQLAHWLLSAPTEQPILHDGQKIGSVEVKGSGGSLLRFLLTGF
AGMVLCLLLTALGAFYLSRRLVRGIVGPLDQLAKVAHTVRRERDFEKRVPEAGIAELSQLGEDFNALLDELESWQARLQD
ENASLAHQAHHDSLTSLPNRAFFEGRLSRALRDANEHREQLAVLFIDSDRFKEINDRLGHAAGDTVLVNIAMRIRGQLRE
SDLVARLGGDEFAVLLAPLASGADALRIADNIIASMQAPIRLSDGSTVSTSLTIGIALYPEHADTPAALLHDADMAMYIA
KRQARGSRRLAELNDPRILQEEKEIDSATPEAPPK
>Q82RR7 4.2.3.96~~~tpc1~~~Avermitilol synthase~~~COG2124
MPQDIDFGLPAPAGISPGLEATRRHNLGWVRRLGLVGDGPSLAWYTSWDMPRLAACGFPHARGAALDLCADAMAFFFVFD
DQFDGPLGRDPARAARVCRRLTGIVHGAGPGPGADACSAAFADVWARSTDGAHPGWVARTAHEWEYYFAAQAHEAINRLR
GTPGDMESYLQVRRGIAGTDLPLSLGERAAGITVPAAAFHSPQLRIMREAAIDVTLMCNDVYSLEKEEARGDMDNLVLVI
EHARRCTRDEAVTAARGEVARRVIRFEQLAREVPALCAQLGLSAVERAHVDTYLGVMEAWMSGYHAWQTQTRRYTGAPHV
LPSTGPGYFDEVLPT
>Q3C1E3 1.14.12.15~~~tphA2I~~~Terephthalate 1,2-dioxygenase, terminal oxygenase component subunit alpha 1~~~
MQESIIQWHGATNTRVPFGIYTDTANADQEQQRIYRGEVWNYLCLESEIPGAGDFRTTFAGETPIVVVRDADQEIYAFEN
RCAHRGALIALEKSGRTDSFQCVYHAWSYNRQGDLTGVAFEKGVKGQGGMPASFCKEEHGPRKLRVAVFCGLVFGSFSED
VPSIEDYLGPEICERIERVLHKPVEVIGRFTQKLPNNWKLYFENVKDSYHASLLHMFFTTFELNRLSQKGGVIVDESGGH
HVSYSMIDRSAKDDSYKDQAIRSDNERYRLKDPSLLEGFEEFEDGVTLQILSVFPGFVLQQIQNSIAVRQLLPKSISSSE
LNWTYLGYADDSTEQRKVRLKQANLIGPAGFISMEDGAVGGFVQRGIAGAANLDAVIEMGGDHEGSSEGRATETSVRGFW
KAYRKHMGQEMQA
>Q3C1D5 1.14.12.15~~~tphA2II~~~Terephthalate 1,2-dioxygenase, terminal oxygenase component subunit alpha 2~~~
MQESIIQWHGATNTRVPFGIYTDTANADQEQQRIYRGEVWNYLCLESEIPGAGDFRTTFAGETPIVVVRDADQEIYAFEN
RCAHRGALIALEKSGRTDSFQCVYHAWSYNRQGDLTGVAFEKGVKGQGGMPASFCKEEHGPRKLRVAVFCGLVFGSFSED
VPSIEDYLGPEICERIERVLHKPVEVIGRFTQKLPNNWKLYFENVKDSYHASLLHMFFTTFELNRLSQKGGVIVDESGGH
HVSYSMIDRGAKDDSYKDQAIRSDNERYRLKDPSLLEGFEEFEDGVTLQILSVFPGFVLQQIQNSIAVRQLLPKSISSSE
LNWTYLGYADDSAEQRKVRLKQANLIGPAGFISMEDGAVGGFVQRGIAGAANLDAVIEMGGDHEGSSEGRATETSVRGFW
KAYRKHMGQEMQA
>B2KZE7 3.4.13.23~~~tpdA~~~Cysteinylglycine-S-conjugate dipeptidase~~~
MSNDKAATSTNFNLTPNRERIFQELSELISHYSPHSMPEHADTHEEAAKWVTAKLEELGLDVTRHPTVDDADTIIGVKEP
VGDAPTILLYSHYDVVPAQNPAVWTNDPLELDERDGRWYGRGAADCKGNVIMHLEALRMVQENGGTDLGLKVVMEGSEEL
GGEDGLGKLIDANPELFTADVIFIGDGGNVAVGIPTLTTHLRGGAQLRFKVDTLEGPVHSGGWGGAAPDAAHALIRIIDS
FFDEHGRTTIEGVDTTAKWEGDPYDRETFRKDARVLDGVQLLGTVDDEPADMVWARPAITVIGFTSVPVEDATNIVNPTA
EAQFNLRVPAPQSAAEVAKKVEEQIRARAPWGAKVEVSITGVNEPFSTDPNGPAVQHFGKCLQDAYGAEHLTVVGTGGSI
PLTVTLQKHFPDAEFALYGVADPAANIHGVDESVDPTEIEHVAIAEAEFLLTYGK
>Q87NI7 3.1.4.52~~~tpdA~~~Cyclic di-GMP phosphodiesterase TpdA~~~COG5001
MIRFELGNQICVCLSENDISLEINNTLHHAIPVSSNEYAILSTIATYGSLNAPISQRVIERKITQHYKMALPENGFKNAV
AALRKKFRKLTEDHVSPTRNIIENIHRTGYFIPFTMLHTHQSGIYQQKRINQHTKNSVRKALRICLRNKRIYTDIAWVLL
VTTAIFFSVCYYAINSIVKHNYLDSALDIADSLSQMSCYADEDQLKGLFDNVKLVESSMMLDRFNIRCLVTPEAVVPVSQ
KAFNEWSDNSNYTTQSFDINNATILVRVKNINLQNNVESHISRFFLSGMKLYTNTGTSFEIGNTNGRYFHYQIKDTGYKE
VYYISGPLKSIILLSLFFLVILRHRSLQAFITYLFAIREFHIKLEPIYNTSTQQNIHYEALSRFKVKNTQRFIETLISNG
LLLIHTILVIRAIYAKQPTLLVPISINVCPSLLRGRNFSTLYQELASRDCRLLTIEITENASMYYTSEIYDNVAKLKLLN
CKISIDDFGTGNNNVSLISKINPDYLKIDREFVIGLKSDDKKVETLRQLIAMGNTYRCTVIVEGVETADSAHLLTTLGAY
IHQGYFYPLHF
>Q3C1E2 1.14.12.15~~~tphA3I~~~Terephthalate 1,2-dioxygenase, terminal oxygenase component subunit beta 1~~~
MINEIQIAAFNAAYAKTVDSDAMEQWPTFFTKDCHYRVTNVDNHAEGLAAGIVWADSQDMLTDRISALREANIYERHRYR
HILGLPSIQSGDATQASASTPFMVLRIMHTGETEVFASGEYLDKFTTIDGKLRLQERIAVCDSTVTDTLMALPL
>Q3C1D4 1.14.12.15~~~tphA3II~~~Terephthalate 1,2-dioxygenase, terminal oxygenase component subunit beta 2~~~
MINEIQIAAFNAAYAKTIDSDAMEQWPTFFTKDCHYCVTNVDNHDEGLAAGIVWADSQDMLTDRISALREANIYERHRYR
HILGLPSIQSGDATQASASTPFMVLRIMHTGETEVFASGEYLDKFTTIDGKLRLQERIAVCDSTVTDTLMALPL
>Q3C1E0 1.14.12.15~~~tphA1I~~~Terephthalate 1,2-dioxygenase, reductase component 1~~~
MNHQIHIHDSDIAFPCAPGQSVLDAALQAGIELPYSCRKGSCGNCASALLDGNITSFNGMAVRSELCTSEQVLLCGCTAA
SDIRIQPSSFRRLDPEARKRFTAKVYSNTLAAPDVSLLRLRLPVGKRAKFEAGQYLLIHLDDGESRSYSMANPPHESDGI
TLHVRHVPGGRFSTIVQQLKSGDTLEIELPFGSIALKPDDTRPLICVAGGTGFAPIKSVLDDLAKRKVQRDITLIWGARN
PSGLYLPSAIDKWRKTWPQFRYIAAITDLGNVPADAHAGRVDDALRTHFGNLHDHVVHCCGSPSLVQSVRTAASDMGLLA
QNFHADVFATSPTGSH
>Q3C1D2 1.14.12.15~~~tphA1II~~~Terephthalate 1,2-dioxygenase, reductase component 2~~~
MNHQIHIHDSDIAFPCAPGQSVLDAALQAGIELPYSCRKGSCGNCASTLLDGNIASFNGMAVRNELCASEQVLLCGCTAA
SDIRIHPSSFRRLDPEARKRFTAKVYSNTLAAPDVSLLRLRLPVGKRAKFEAGQYLLIHLDDGESRSYSMANPPHESDGI
TLHVRHVPGGRFSTIVQQLKSGDTLDIELPFGSIALKPDDARPLICVAGGTGFAPIKSVLDDLAKRKVQRDITLIWGARN
PSGLYLPSAIDKWRKVWPQFRYIAAITDLGDMPADAHAGRVDDALRTHFGNLHDHVVHCCGSPALVQSVRTAASDMGLLA
QDFHADVFATGPTGHH
>P07383 3.1.1.10~~~~~~Tropinesterase~~~
EIIPVPDQAAWNASKKSIQINDAIKMRYVEWGNPSGDPVLLLHGYTDTSRAFSSLAPFLSKDKRYLALDLRGHGGTSIPK
CCYYVSDFAEDVSDFIDKMGLHNTTVIGHSMGSMTAGVLASIHPDKVSRLVLISTALKTGPVLEWVYDTVLQKDFPLDDP
SEFAKEWVAAPGKHDNGMAKNLKTEELAVPKHVWLSAARGFSIINWTAASKYLTAKTLILWGNQNQPMTESMQNDIRAAL
PKAKFIQYNGFGHSMFWEDPEMVAKDLNEFLK
>P16665 ~~~tpf1~~~Antigen TpF1~~~COG0783
MNMCTDGKKYHSTATSAAVGASAPGVPDARAIAAICEQLRQHVADLGVLYIKLHNYHWHIYGIEFKQVHELLEEYYVSVT
EAFDTIAERLLQLGAQAPASMAEYLALSGIAEETEKEITIVSALARVKRDFEYLSTRFSQTQVLAAESGDAVTDGIITDI
LRTLGKAIWMLGATLKA
>A0A160PB22 ~~~tpgA~~~Trimeric autotransporter adhesin- and peptidogylcan-associated protein A~~~
MKAFNKKIMFGVFSGLVMSLSHAAEVESANTQEIHFPEIKDSYLKQVNRYEYDDVARLDKGLTKDQIRHILGNPQFSEGL
FAVKTWNYVLDIREPNSNQYKRCQLRIDFDKQYRSDNLYWKGEQCQGLMAWGINNQSETEQTTLAPGGQSASVLFYFDHA
DKNGVKNAEVIRKIADQIKQSDANSPVFVAGYTDRLGSFQYNQRLSAQRANTVVELLKQQGIRGEQIQYSAENKTDVYQK
CAGINKKIQLVECLAPNRRVNITW
>Q3C1E1 1.3.1.53~~~tphBI~~~1,2-dihydroxy-3,5-cyclohexadiene-1,4-dicarboxylate dehydrogenase~~~
MTIVHRRLALAIGDPHGIGPEIALKALRQLSANERSLIKVYGPWSALEQAAQICQMESLLQDLIHEEAGSLAQPAQWGEI
TPQAGLSTVQSATAAIRACENGEVDAVIACPHHETAIHRAGIAFSGYPSLLANVLGMNEDQVFLMLVGAGLRIVHVTLHE
SVRSALERLSPQLVVNAVQAAVQTCTLLGVPKPQVAVFGINPHASEGQLFGLEDSQITAPAVETLRKCGLAVDGPMGADM
VLAQRKHDLYVAMLHDQGHIPIKLLAPNGASALSIGGRVVLSSVGHGSAMDIAGRGVADSTALLRTIALLGAQPG
>Q5D0X4 1.3.1.53~~~tphB~~~1,2-dihydroxy-3,5-cyclohexadiene-1,4-dicarboxylate dehydrogenase~~~
MTIVHRRLALAIGDPHGIGPEIALKALQQLSATERSLIKVYGPWSALEQAAQICQMESLLQDLIHEEAGSLAQPAQWGEI
TPQAGLSTVQSATAAIRACENGEVDAVIACPHHETAIHRAGIAFSGYPSLLANVLGMNEDEVFLMLVGAGLRIVHVTLHE
SVRSALERLSPQLVVDAVDAAVQTCTLLGVPKPQVAVFGINPHASEGQLFGLEDSQITAPAVETLRKCGLAVDGPMGADM
VLAQRKHDLYVAMLHDQGHIPIKLLAPNGASALSIGGRVVLSSVGHGSAMDIAGRGVADSTALLRTIALLGAQPG
>Q92EU4 5.3.1.1~~~tpi-2~~~Triosephosphate isomerase 2~~~COG0149
MRKPLVGINMKNYINTRAQTSEWLEATIPLLENFSDVDTFIFPSMGTLETTANLLAGTSFGFGPQNMAPEKSGPLTGEFS
VESIIDLNANYVEIGHAERKNLFHEKTSEIAKKIKLALDEKITPVVCVGEEVHANDTNELKNALKKQIEALFQTINLAQW
ENVVLAYEPEWAIGKASSAETNYIESAHQALREIIRELGGDETLVRIIYGGSVSKENAAEIVRQKNVDGLFVGRFGHKPQ
NFADIVSIVSKTKG
>Q4MQ55 5.3.1.1~~~tpiA~~~Triosephosphate isomerase~~~COG0149
MRKPIIAGNWKMNKTLSEAVSFVEEVKGQIPAASAVDAVVCSPALFLERLVAATEGTDLQVGAQNMHFEKNGAFTGEISP
VALSDLKVGYVVLGHSERREMFAETDESVNKKTIAAFEHGLTPIVCCGETLEERESGKTFDLVAGQVTKALAGLTEEQVK
ATVIAYEPIWAIGTGKSSSSADANEVCAHIRKVVAEAVSPEAAEAVRIQYGGSVKPENIKEYMAQSDIDGALVGGASLEP
ASFLGLLGAVK
>P27876 5.3.1.1~~~tpiA~~~Triosephosphate isomerase~~~COG0149
MRKPIIAGNWKMNKTLGEAVSFVEEVKSSIPAADKAEAVVCAPALFLEKLASAVKGTDLKVGAQNMHFEESGAFTGEISP
VALKDLGVDYCVIGHSERREMFAETDETVNKKAHAAFKHGIVPIICVGETLEEREAGKTNDLVADQVKKGLAGLSEEQVA
ASVIAYEPIWAIGTGKSSTAKDANDVCAHIRKTVAESFSQEAADKLRIQYGGSVKPANIKEYMAESDIDGALVGGASLEP
QSFVQLLEEGQYE
>Q8A0U2 5.3.1.1~~~tpiA~~~Triosephosphate isomerase~~~COG0149
MRKNIVAGNWKMNKTLQEGIALAKELNEALANEKPNCDVIICTPFIHLASVTPLVDAAKIGVGAENCADKASGAYTGEVS
AEMVASTGAKYVILGHSERRAYYGETVAILEEKVKLALANGLTPIFCIGEVLEEREANKQNEVVAAQMESVFSLSAEDFS
KIILAYEPVWAIGTGKTASPEQAQEIHAFIRSIVADKYGKEIADNTSILYGGSCKPSNAKELFSNPDVDGGLIGGAALKV
SDFKGIIDAFNA
>Q8L1Z5 5.3.1.1~~~tpiA~~~Triosephosphate isomerase~~~COG0149
MSPNIRPFIAGNWKMNGTGESLGELRAIAAGISSDLGRLFEALICVPATLLSRAFDILGGENILLGGQNCHFDDYGPYTG
DISAFMLKEAGASHVIIGHSERRTVYQESDAIVRAKVQAAWRAGLVALICVGETLEERKSNKVLDVLTRQLEGSLPDGAT
AENIIIAYEPVWAVGTGNTATSADVAEVHAFIHHKMHSRFGDEGAKIRLLYGGSVKPSNAFELLSTAHVNGALIGGASLK
AIDFLTICDVYRKL
>Q8XKU1 5.3.1.1~~~tpiA~~~Triosephosphate isomerase~~~
MRTPIIAGNWKMHYTIDEAVKLVEELKPLVKDAKCEVVVCPTFVCLDAVKKAVEGTNIKVGAQNMHFEEKGAFTGEIAPR
MLEAMNIDYVIIGHSERREYFNETDETCNKKVKAAFAHNLTPILCCGETLEQRENGTTNDVIKAQITADLEGLTKEQAEK
VVIAYEPIWAIGTGKTATSDQANETIAAIRAMVAEMFGQEVADKVRIQYGGSVKPNTIAEQMAKSDIDGALVGGASLVAA
DFAQIVNY
>Q9RUP5 5.3.1.1~~~tpiA~~~Triosephosphate isomerase~~~COG0149
MQTLLALNWKMNKTPTEARSWAEELTTKYAPAEGVDLAVLAPALDLSALAANLPAGIAFGGQDVSAHESGAYTGEISAAM
LKDAGASCVVVGHSERREYHDESDAXVAAKARQAQANGLLPIVCVGENLDVRERGEHVPQTLAQLRGSLEGVGADVVVAY
EPVWAIGTGKTATADDAEELAAAIRGALREQYGARAEGIRVLYGGSVKPENIAEICGKPNVNGALVGGASLKVPDVLGML
DALR
>B1XB85 5.3.1.1~~~tpiA~~~Triosephosphate isomerase~~~
MRHPLVMGNWKLNGSRHMVHELVSNLRKELAGVAGCAVAIAPPEMYIDMAKREAEGSHIMLGAQNVDLNLSGAFTGETSA
AMLKDIGAQYIIIGHSERRTYHKESDELIAKKFAVLKEQGLTPVLCIGETEAENEAGKTEEVCARQIDAVLKTQGAAAFE
GAVIAYEPVWAIGTGKSATPAQAQAVHKFIRDHIAKVDANIAEQVIIQYGGSVNASNAAELFAQPDIDGALVGGASLKAD
AFAVIVKAAEAAKQA
>P0A858 5.3.1.1~~~tpiA~~~Triosephosphate isomerase~~~COG0149
MRHPLVMGNWKLNGSRHMVHELVSNLRKELAGVAGCAVAIAPPEMYIDMAKREAEGSHIMLGAQNVDLNLSGAFTGETSA
AMLKDIGAQYIIIGHSERRTYHKESDELIAKKFAVLKEQGLTPVLCIGETEAENEAGKTEEVCARQIDAVLKTQGAAAFE
GAVIAYEPVWAIGTGKSATPAQAQAVHKFIRDHIAKVDANIAEQVIIQYGGSVNASNAAELFAQPDIDGALVGGASLKAD
AFAVIVKAAEAAKQA
>Q5NII7 5.3.1.1~~~tpiA~~~Triosephosphate isomerase~~~COG0149
MQKLIMGNWKMNGNSTSIKELCSGISQVQYDTSRVAIAVFPSSVYVKEVISQLPEKVGVGLQNITFYDDGAYTGEISARM
LEDIGCDYLLIGHSERRSLFAESDEDVFKKLNKIIDTTITPVVCIGESLDDRQSGKLKQVLATQLSLILENLSVEQLAKV
VIAYEPVWAIGTGVVASLEQIQETHQFIRSLLAKVDERLAKNIKIVYGGSLKAENAKDILSLPDVDGGLIGGASLKAAEF
NEIINQANKICTE
>P00943 5.3.1.1~~~tpiA~~~Triosephosphate isomerase~~~
MRKPIIAGNWKMHKTLAEAVQFVEDVKGHVPPADEVISVVCAPFLFLDRLVQAADGTDLKIGAQTMHFADQGAYTGEVSP
VMLKDLGVTYVILGHSERRQMFAETDETVNKKVLAAFTRGLIPIICCGESLEEREAGQTNAVVASQVEKALAGLTPEQVK
QAVIAYEPIWAIGTGKSSTPEDANSVCGHIRSVVSRLFGPEAAEAIRIQYGGSVKPDNIRDFLAQQQIDGPLVGGASLEP
ASFLQLVEAGRHE
>P43727 5.3.1.1~~~tpiA~~~Triosephosphate isomerase~~~COG0149
MARRPLVMGNWKLNGSKAFTKELIEGLKAELHDVTGCDVAIAPPVMYLGTAEAALSGCGCSCGGKSVIQLGAQNVDINVK
GAFTGDISTEMLKDFGAKYIIIGHSERRTYHKESDEFVAKKFGALKEAGLVPVLCIGESEAENEAGKTEEVCARQIDAVI
NALGVEAFNGAVIAYEPIWAIGTGKSATPAQAQAVHAFIRGHIAAKSQAVAEQVIIQYGGSVNDANAAELFTQPDIDGAL
VGGASLKAPAFAVIVKAAAAAKN
>P56076 5.3.1.1~~~tpiA~~~Triosephosphate isomerase~~~COG0149
MTKIAMANFKSAMPIFKSHAYLKELEKTLKPQHFDRVFVFPDFFGLLPNSFLHFTLGVQNAYPRDCGAFTGEITSKHLEE
LKIHTLLIGHSERRTLLKESPSFLKEKFDFFKSKNFKIVYCIGEELTTREKGFKAVKEFLSEQLENIDLNYPNLVVAYEP
IWAIGTKKSASLEDIYLTHGFLKQILNQKTPLLYGGSVNTQNAKEILGIDSVDGLLIGSASWELENFKTIISFL
>Q93GB7 5.3.1.1~~~tpiA~~~Triosephosphate isomerase~~~
MSRTPIIAGNWKLNMNPKETVEFVNAVKDQLPDPSKVESVICAPAVDLDALLKAAEGSNLHVGAENCYWENSGAFTGETS
PAVLKEMGVQYVIIGHSERRDYFHETDEDINKKAKAIFANGLTPILCCGESLEIREAGKEKEWVVSQIKADLEGLTSEQV
SKLVIAYEPIWAIGTGKTASSDQAEEMCKTIRETVKDLYNEETAENVRIQYGGSVKPANIKELMAKPNIDGGLVGGASLV
PDSYLALVNYQD
>P50918 5.3.1.1~~~tpiA~~~Triosephosphate isomerase~~~COG0149
MSRKPIIAGNWKMNKTLSEAQAFVEAVKNNLPSSDNVESVIGAPALFLAPMAYLRQGSELKLAAENSYFENAGAFTGENS
PAAIVDLGVEYIIIGHSERREYFHETDEDINKKAKAIFAAGATPILCCGETLETFEAGKTAEWVSGQIEAGLAGLSAEQV
SNLVIAYEPIWAIGTGKTATNEIADETCGVVRSTVEKLYGKEVSEAVRIQYGGSVKPETIEGLMAKENIDGALVGGASLE
ADSFLALLEMYK
>Q8F5I5 5.3.1.1~~~tpiA~~~Triosephosphate isomerase~~~
MRKTIIAGNWKMNLSLKEAVFLAHSIREKIPSISKDKVSMVFPSTLHLENVSKILEGSSVIVGAQNCYHSGLAAFTGETS
PDQLKEIGVKVVMVGHSERRQFLGESNFFCNDKIRFLLKNEFTVLYCVGETLSERESGKTLEVLSSQIREGLKGIDSVFF
SNLILAYEPVWAIGTGKVATPSQAQEVHSFIRKEISGLFVGASSISESISILYGGSVKPDNIQDLLKEKDIDGGLVGGAS
QKISSFAELF
>P50921 5.3.1.1~~~tpiA~~~Triosephosphate isomerase~~~
MRHPVVMGNWKLNGSKEMVVDLLNGLNAELEGVTGVDVAVAPPALFVDLAERTLTEAGSAIILGAQNTDLNNSGAFTGDM
SPAMLKEFGATHIIIGHSERREYHAESDEFVAKKFAFLKENGLTPVLCIGESDAQNEAGETMAVCARQLDAVINTQGVEA
LEGAIIAYEPIWAIGTGKAATAEDAQRIHAQIRAHIAEKSEAVAKNVVIQYGGSVKPENAAAYFAQPDIDGALVGGAALD
AKSFAAIAKAAAEAKA
>P9WG43 5.3.1.1~~~tpiA~~~Triosephosphate isomerase~~~COG0149
MSRKPLIAGNWKMNLNHYEAIALVQKIAFSLPDKYYDRVDVAVIPPFTDLRSVQTLVDGDKLRLTYGAQDLSPHDSGAYT
GDVSGAFLAKLGCSYVVVGHSERRTYHNEDDALVAAKAATALKHGLTPIVCIGEHLDVREAGNHVAHNIEQLRGSLAGLL
AEQIGSVVIAYEPVWAIGTGRVASAADAQEVCAAIRKELASLASPRIADTVRVLYGGSVNAKNVGDIVAQDDVDGGLVGG
ASLDGEHFATLAAIAAGGPLP
>P99133 5.3.1.1~~~tpiA~~~Triosephosphate isomerase~~~
MRTPIIAGNWKMNKTVQEAKDFVNALPTLPDSKEVESVICAPAIQLDALTTAVKEGKAQGLEIGAQNTYFEDNGAFTGET
SPVALADLGVKYVVIGHSERRELFHETDEEINKKAHAIFKHGMTPIICVGETDEERESGKANDVVGEQVKKAVAGLSEDQ
LKSVVIAYEPIWAIGTGKSSTSEDANEMCAFVRQTIADLSSKEVSEATRIQYGGSVKPNNIKEYMAQTDIDGALVGGASL
KVEDFVQLLEGAK
>Q6GIL6 5.3.1.1~~~tpiA~~~Triosephosphate isomerase~~~
MRTPIIAGNWKMNKTVQEAKDFVNALPTLPDSKEVESVICAPAIQLDALTTAVKEGKAQGLEIGAQNTYFEDNGAFTGET
SPVALADLGVKYVVIGHSERRELFHETDEEINKKAHAIFKHGMTPIICVGETDEERESGKANDVVGEQVKKAVAGLSEDQ
LKSVVIAYEPIWAIGTGKSSTSEDANEMCAFVRQTIADLSSKEVSEATRIQYGGSVKPNNIKEYMAQTDIDGALVGGASL
KVEDFVQLLEGAK
>Q9Z520 5.3.1.1~~~tpiA~~~Triosephosphate isomerase~~~COG0149
MTTRTPLMAGNWKMNLNHLEAIAHVQKLAFALADKDYDAVEVAVLAPFTDLRSVQTLVDGDKLKIKYGAQDISAHDGGAY
TGEISGPMLAKLKCTYVAVGHSERRQYHAETDEIVNAKVKAAYKHGLTPILCVGEELDVREAGNHVEHTLAQVEGGLKDL
AAEQAESVVIAYEPVWAIGTGKVCGADDAQEVCAAIRGKLAELYSQELADKVRIQYGGSVKSGNVAEIMAKPDIDGALVG
GASLDSDEFVKIVRFRDQ
>B2FNY1 5.3.1.1~~~tpiA~~~Triosephosphate isomerase~~~COG0149
MRRKIVAGNWKLHGSRQFANELLGQVAAGLPLEGVDVVILPPLPYLGELVEDFGETGLAFGAQDVSSNEKGAYTGEVCAA
MLVEVGARYGLVGHSERRQYHHESSELVARKFAAAQHAGLVPVLCVGETLEQREAGQTEAVIASQLAPVLELVGAAGFAQ
AVVAYEPVWAIGTGRTATKEQAQQVHAFIRGEVARIDARIADSLPIVYGGSVKPDNAGELFAQPDVDGGLVGGASLVAAD
FLAIARAAAAN
>Q04JH4 5.3.1.1~~~tpiA~~~Triosephosphate isomerase~~~COG0149
MSRKPFIAGNWKMNKNPEEAKAFVEAVASKLPSSDLVEAGIAAPALDLTTVLAVAKGSNLKVAAQNCYFENAGAFTGETS
PQVLKEIGTDYVVIGHSERRDYFHETDEDINKKAKAIFANGMLPIICCGESLETYEAGKAAEFVGAQVSAALAGLTAEQV
AASVIAYEPIWAIGTGKSASQDDAQKMCKVVRDVVAADFGQEVADKVRVQYGGSVKPENVASYMACPDVDGALVGGASLE
AESFLALLDFVK
>Q5XD48 5.3.1.1~~~tpiA~~~Triosephosphate isomerase~~~
MSRKPIIAGNWKMNKNPQEARAFVEAVASKLPSTDLVDVAVAAPAVDLVTTIEAAKDSVLKVAAQNCYFENTGAFTGETS
PKVLAEMGADYVVIGHSERRDYFHETDEDINKKAKAIFANGLTPIVCCGESLETYEAGKAVEFVGAQVSAALAGLSAEQV
ASLVLAYEPIWAIGTGKSATQDDAQNMCKAVRDVVAADFGQEVADKVRVQYGGSVKPENVKDYMACPDVDGALVGGASLE
AGSFLALLDFLN
>Q59994 5.3.1.1~~~tpiA~~~Triosephosphate isomerase~~~COG0149
MRKIIIAGNWKMHKTQAEAQAFLQGFKPLIEDAAESREVVLCVPFTDLSGMSQQLHGGRVRLGAQNVHWEASGAYTGEIS
AAMLTEIGIHYVVIGHSERRQYFGETDETANLRVLAAQKAGLIPILCVGESKAQRDAGETEQVIVDQVKKGLVNVDQSNL
VIAYEPIWAIGTGDTCAATEANRVIGLIREQLTNSQVTIQYGGSVNANNVDEIMAQPEIDGALVGGASLEPQSFARIVNF
QP
>Q5SJR1 5.3.1.1~~~tpiA~~~Triosephosphate isomerase~~~COG0149
MRRVLVAGNWKMHKTPSEARVWFAELKRLLPPLQSEAAVLPAFPILPVAKEVLAETQVGYGAQDVSAHKEGAYTGEVSAR
MLSDLGCRYAIVGHSERRRYHGETDALVAEKAKRLLEEGITPILCVGEPLEVREKGEAVPYTLRQLRGSLEGVEPPGPEA
LVIAYEPVWAIGTGKNATPEDAEAMHQAIRKALSERYGEAFASRVRILYGGSVNPKNFADLLSMPNVDGGLVGGASLELE
SFLALLRIAG
>P31013 4.1.99.2~~~tpl~~~Tyrosine phenol-lyase~~~
MNYPAEPFRIKSVETVSMIPRDERLKKMQEAGYNTFLLNSKDIYIDLLTDSGTNAMSDKQWAGMMMGDEAYAGSENFYHL
ERTVQELFGFKHIVPTHQGRGAENLLSQLAIKPGQYVAGNMYFTTTRYHQEKNGAVFVDIVRDEAHDAGLNIAFKGDIDL
KKLQKLIDEKGAENIAYICLAVTVNLAGGQPVSMANMRAVRELTEAHGIKVFYDATRCVENAYFIKEQEQGFENKSIAEI
VHEMFSYADGCTMSGKKDCLVNIGGFLCMNDDEMFSSAKELVVVYEGMPSYGGLAGRDMEAMAIGLREAMQYEYIEHRVK
QVRYLGDKLKAAGVPIVEPVGGHAVFLDARRFCEHLTQDEFPAQSLAASIYVETGVRSMERGIISAGRNNVTGEHHRPKL
ETVRLTIPRRVYTYAHMDVVADGIIKLYQHKEDIRGLKFIYEPKQLRFFTARFDYI
>P31011 4.1.99.2~~~tpl~~~Tyrosine phenol-lyase~~~
MNYPAEPFRIKSVETVSMISRDERVKKMQEAGYNTFLLNSKDIYIDLLTDSGTNAMSDKQWAGMMIGDEAYAGSENFYHL
EKTVKELFGFKHIVPTHQGRGAENLLSQLAIKPGQYVAGNMYFTTTRFHQEKNGATFVDIVRDEAHDASLNLPFKGDIDL
NKLATLIKEKGAENIAYICLAVTVNLAGGQPVSMANMRAVHEMASTYGIKIFYDATRCVENAYFIKEQEAGYENVSIKDI
VHEMFSYADGCTMSGKKDCLVNIGGFLCMNDEEMFSAAKELVVVYEGMPSYGGLAGRDMEAMAIGLREAMQYEYIEHRVK
QVRYLGDKLREAGVPIVEPTGGHAVFLDARRFCPHLTQDQFPAQSLAASIYMETGVRSMERGIVSAGRSKETGENHRPKL
ETVRLTIPRRVYTYAHMDVVADGIIKLYQHKEDIRGLTFVYEPKQLRFFTARFDFI
>Q08897 4.1.99.2~~~tpl~~~Tyrosine phenol-lyase~~~COG3033
MQRPWAEPYKIKAVEPIRMTTREYREQAIREAGYNTFLLRSEDVYIDLLTDSGTNAMSDRQWGALMMGDEAYAGARSFFR
LEEAVREIYGFKYVVPTHQGRGAEHLISRILIKPGDYIPGNMYFTTTRTHQELQGGTFVDVIIDEAHDPQASHPFKGNVD
IAKFEALIDRVGADKIPYINVALTVNMAGGQPVSMANLREVRKVCDRHGIRMWSDATRAVENAYFIKEREEGYQDKPVRE
ILKEMMSYFDGCTMSGKKDCLVNIGGFLAMNEEWILQKAREQVVIFEGMPTYGGLAGRDMEAIAQGIYEMVDDDYIAHRI
HQVRYLGEQLLEAGIPIVQPIGGHAVFLDARAFLPHIPQDQFPAQALAAALYVDSGVRAMERGIVSAGRNPQTGEHNYPK
LELVRLTIPRRVYTDRHMDVVAYSVKHLWKERDTIRGLRMVYEPPTLRFFTARFEPIS
>O86262 2.1.1.67~~~tpm~~~Thiopurine S-methyltransferase~~~
MKADFWLQRWSAGQIGFHQSEVNKDLQQYWSSLNVVPGARVLVPLCGKSQDMSWLSGQGYHVVGAELSEAAVERYFTERG
EQPHITSQGDFKVYAAPGIEIWCGDFFALTARDIGHCAAFYDRAAMIALPADMRERYVQHLEALMPQACSGLLITLEYDQ
ALLEGPPFSVPQTWLHRVMSGNWEVTKVGGQDTLHSSARGLKAGLERMDEHVYVLERV
>O07950 ~~~~~~Membrane lipoprotein TpN32~~~COG1464
MKGKTVSAALVGKLIALSVGVVACTQVKDETVGVGVLSEPHARLLEIAKEEVKKQHIELRIVEFTNYVALNEAVMRGDIL
MNFFQHVPHMQQFNQEHNGDLVSVGNVHVEPLALYSRTYRHVSDFPAGAVIAIPNDSSNEARALRLLEAAGFIRMRAGSG
LFATVEDVQQNVRNVVLQEVESALLPRVFDQVDGAVINGNYAIMAGLSARRDGLAVEPDASAYANVLVVKRGNEADARVQ
AVLRALCGGRVRTYLKERYKGGEVAPAL
>O66780 1.11.1.24~~~tpx~~~Thiol peroxidase~~~COG2077
MARTVNLKGNPVTLVGPELKVGDRAPEAVVVTKDLQEKIVGGAKDVVQVIITVPSLDTPVCETETKKFNEIMAGMEGVDV
TVVSMDLPFAQKRFCESFNIQNVTVASDFRYRDMEKYGVLIGEGALKGILARAVFIIDKEGKVAYVQLVPEITEEPNYDE
VVNKVKELI
>P80864 1.11.1.24~~~tpx~~~Thiol peroxidase~~~COG2077
MAEITFKGGPVTLVGQEVKVGDQAPDFTVLTNSLEEKSLADMKGKVTIISVIPSIDTGVCDAQTRRFNEEAAKLGDVNVY
TISADLPFAQARWCGANGIDKVETLSDHRDMSFGEAFGVYIKELRLLARSVFVLDENGKVVYAEYVSEATNHPNYEKPIE
AAKALVK
>P0A862 1.11.1.24~~~tpx~~~Thiol peroxidase~~~COG2077
MSQTVHFQGNPVTVANSIPQAGSKAQTFTLVAKDLSDVTLGQFAGKRKVLNIFPSIDTGVCAASVRKFNQLATEIDNTVV
LCISADLPFAQSRFCGAEGLNNVITLSTFRNAEFLQAYGVAIADGPLKGLAARAVVVIDENDNVIFSQLVDEITTEPDYE
AALAVLKA
>Q57549 1.11.1.24~~~tpx~~~Thiol peroxidase~~~COG2077
MTVTLAGNPIEVGGHFPQVGEIVENFILVGNDLADVALNDFASKRKVLNIFPSIDTGVCATSVRKFNQQAAKLSNTIVLC
ISADLPFAQARFCGAEGIENAKTVSTFRNHALHSQLGVDIQTGPLAGLTSRAVIVLDEQNNVLHSQLVEEIKEEPNYEAA
LAVLA
>O25151 1.11.1.24~~~tpx~~~Thiol peroxidase~~~COG2077
MQKVTFKEETYQLEGKALKVGDKAPDVKLVNGDLQEVNLLKQGVRFQVVSALPSLTGSVCLLQAKHFNEQTGKLPSVSFS
VISMDLPFSQGQICGAEGIKDLRILSDFRYKAFGENYGVLLGKGSLQGLLARSVFVLDDKGVVIYKEIVQNILEEPNYEA
LLKVLK
>P66953 1.11.1.24~~~tpx~~~Thiol peroxidase~~~
MAQITLRGNAINTVGELPAVGSPAPAFTLTGGDLGVISSDQFRGKSVLLNIFPSVDTPVCATSVRTFDERAAASGATVLC
VSKDLPFAQKRFCGAEGTENVMPASAFRDSFGEDYGVTIADGPMAGLLARAIVVIGADGNVAYTELVPEIAQEPNYEAAL
AALGA
>P9WG35 1.11.1.24~~~tpx~~~Thiol peroxidase~~~COG2077
MAQITLRGNAINTVGELPAVGSPAPAFTLTGGDLGVISSDQFRGKSVLLNIFPSVDTPVCATSVRTFDERAAASGATVLC
VSKDLPFAQKRFCGAEGTENVMPASAFRDSFGEDYGVTIADGPMAGLLARAIVVIGADGNVAYTELVPEIAQEPNYEAAL
AALGA
>P99146 1.11.1.24~~~tpx~~~Thiol peroxidase~~~
MTEITFKGGPIHLKGQQINEGDFAPDFTVLDNDLNQVTLADYAGKKKLISVVPSIDTGVCDQQTRKFNSEASKEEGIVLT
ISADLPFAQKRWCASAGLDNVITLSDHRDLSFGENYGVVMEELRLLARAVFVLDVDNKVVYKEIVSEGTDFPDFDAALAA
YKNI
>P0C2J8 1.11.1.24~~~tpx~~~Thiol peroxidase~~~COG2077
MVTFLGNPVSFTGKQLQVGDKALDFSLTTTDLSKKSLADFDGKKKVLSVVPSIDTGICSTQTRRFNEELAGLDNTVVLTV
SMDLPFAQKRWCGAEGLDNAIMLSDYFDHSFGRDYALLINEWHLLARAVFVLDTDNTIRYVEYVDNINSEPNFEAAIAAA
KAL
>P31308 1.11.1.24~~~tpx~~~Thiol peroxidase~~~
MTTFLGNPVTFTGKQLQVGDTAHDFSLTATDLSKKTLADFAGKKKVLSIIPSIDTGVCSTQTRRFNQELSDLDNTVVITV
SVDLPFAQGKWCAAEGIENAVMLSDYFDHSFGRDYAVLINEWHLLARAVLVLDENNTVTYAEYVDNINTEPDYDAAIAAV
KSL
>P0AFS5 ~~~tqsA~~~AI-2 transport protein TqsA~~~COG0628
MAKPIITLNGLKIVIMLGMLVIILCGIRFAAEIIVPFILALFIAVILNPLVQHMVRWRVPRVLAVSILMTIIVMAMVLLL
AYLGSALNELTRTLPQYRNSIMTPLQALEPLLQRVGIDVSVDQLAHYIDPNAAMTLLTNLLTQLSNAMSSIFLLLLTVLF
MLLEVPQLPGKFQQMMARPVEGMAAIQRAIDSVSHYLVLKTAISIITGLVAWAMLAALDVRFAFVWGLLAFALNYIPNIG
SVLAAIPPIAQVLVFNGFYEALLVLAGYLLINLVFGNILEPRIMGRGLGLSTLVVFLSLIFWGWLLGPVGMLLSVPLTII
VKIALEQTAGGQSIAVLLSDLNKE
>P06617 1.13.12.3~~~iaaM~~~Tryptophan 2-monooxygenase~~~
MYDHFNSPSIDILYDYGPFLKKCEMTGGIGSYSAGTPTPRVAIVGAGISGLVAATELLRAGVKDVVLYESRDRIGGRVWS
QVFDQTRPRYIAEMGAMRFPPSATGLFHYLKKFGISTSTTFPDPGVVDTELHYRGKRYHWPAGKKPPELFRRVYEGWQSL
LSEGYLLEGGSLVAPLDITAMLKSGRLEEAAIAWQGWLNVFRDCSFYNAIVCIFTGRHPPGGDRWARPEDFELFGSLGIG
SGGFLPVFQAGFTEILRMVINGYQSDQRLIPDGISSLAARLADQSFDGKALRDRVCFSRVGRISREAEKIIIQTEAGEQR
VFDRVIVTSSNRAMQMIHCLTDSESFLSRDVARAVRETHLTGSSKLFILTRTKFWIKNKLPTTIQSDGLVRGVYCLDYQP
DEPEGHGVVLLSYTWEDDAQKMLAMPDKKTRCQVLVDDLAAIHPTFASYLLPVDGDYERYVLHHDWLTDPHSAGAFKLNY
PGEDVYSQRLFFQPMTANSPNKDTGLYLAGCSCSFAGGWIEGAVQTALNSACAVLRSTGGQLSKGNPLDCINASYRY
>Q45618 ~~~~~~Putative transposase for insertion sequence element IS5376~~~
MITRGEFFMIKEMYERGMSISDIARELGIDRKTVRKYIHSPNPPSKSKRKQRKSKLDPFKPYLQKRMLEDGVFNSEKLFF
EIRQQGYTGGKTILKDYMKPFRETAKKKYTVRYETLPGEQMQVDWKEVGEVVIEGKKVKLSLFVATLGYSRMKYAVFTTS
QDQEHLMECLIQSFKYFGGVPKKVLFDNMKTVTDGREQGVVKWNQRFSEFASYYGFIPKVCRPYRAQTKGKVERAIQYIM
DHFYVGTAFESIEELNFLLHRWLDQVANRKPNATTGISPQERWAEESLKPLPLKDYDTSYLSYRKVHWDGSFSYKGEQWL
LSAEYAGKEILVKERLNGDIRLYFRGEEISHVDQQKKVISFAEKIKKKQTEMAATISPVSVEVDTRPLSVYDAFLRGESS
>P27188 ~~~traB~~~Protein TraB~~~
MNKVQIGAPRTSASPGVIMKPEGSVKIAVMNGSRQVDQVVNGEWLTMKVLPEAGLPKGIHQLSDAKDASKNVHPHKHVGQ
VLHDDGRNVYQFSEGGIVKHSRGIFEKPPVVGKNYEIAYSRGQGKVIGEVSQEQAAKAEQKRSRSI
>P27189 2.7.7.-~~~traC~~~DNA primase TraC~~~
MAEVKKPFHEQVAERLIEQLKAGTAPWQKPWEPGMPGSFIPLNPTTGKRYKGINAIQLMAQGHADPRWMTYKQAAAAGAQ
VRRGEKGTPIQYWKFSEEQTKTDEQTGKPVLDANGDPVKVTVQLERPRVFFATVFNAEQIDGLPPLERKEQTWSAVERAE
HILAASGATIRHGEHDRAFYRPSTDSIHLPDKGQFPSADNYYATALHELGHWTGHPSRLDRDLAHPFGSEGYAKEELRAE
IASMILGDELGIGHDPGQHAAYVGSWIKALQEDPLEIFRAAADAEKIQDFVLAFEQKQIQEQTTQQAIEPAQGATMEQQQ
DQVARPAIAPADELIAQTLRMYRAGAEPAEGNQSLAALTETTLGFELPADWTGRVQVQANVEVEHDGERSVVPAGDREPE
FWGVYANHAWGGHQWLADFAGPDAQTNAEALADRLAVIDAYATANEYEQAAKFARIHEERVRRDPNSTDEDRVAAKEARK
AAEGTAMLHDEDLQRRIADYEREQQEMAQAMNAAEQPAAAQAPAKPERAYLNVPFKEKDEVKALGARWDRQERAWYVPAG
VDPAPFAKWAREGATAAVEARAEAQPTQPTAERPNAAQERVYLAVPYGERQVAKAAGAQWDKVAKSWYAGPNADMGKLQR
WLPDNVPTQQSPAVTPEDEFAEALKSMGCVVTPGGEHPIMDGKKHRIETEGDKKGEKSGFYVGHLDGHPAGYIKNNRTGV
EMKWKAKGYALDPAEKAKMQAEAAAKLAARAEEQERQHEATAQRIGRQAQSLVPITEPTPYLRDKGLQVHAGVLTDQEGQ
KTYIPAYDADGKQWTMQYIQEDGTKRFAKDSRKEGCFHVVGGMDALAAAPALVIGEGYATAATVAEALGHATVAAFDSGN
LQAVAEALHAKFPDKPVVIAGDDDRQVQITQGVNPGRTKAQEAAKAVGGKAIFPIFAPGENAYPKELPPITPENYRNHLH
AEKRLADAAAGKVQLSEADTAKLKESLLNDGQLAALSNMKKHTDFNDLSERSSLGKDGVERQVRSAVGKVLLDEGQRQKV
QQLKQQDIEQQEQRQRRARTY
>P27190 2.7.7.-~~~traC~~~DNA primase TraC~~~
MPITKAEAQGVTRAFVRDYPGALELAYKFREDAAELYGPRAAEVPADMKGGYVPKETLHAGRAYRGRVDVPLQNVESASD
LLMTLRHEVLGHYGANTFAPGEKRALLDGLAAARNEPTLKPLWDDVNRRYAGQSLDVRAEEVFALHCEGIEPSQHQVADQ
VQQRGQQSFTETCIARVRPMQADDLHNIVCMVAQGLRDRSRTQQNFPQFNELFRRDENMEPKKPFHEVVAEKLIEQLKAG
TAPWQKPWEPGEPNAYLPMNPTTGKRYKGINAIHLMAQGRSDARWMTYKQAAAVGAQVRKGEKGTPVQYWKFSEEQDKLD
DSGRPVLDAKGQPVKETVMLERPRVFFATVFNGEQIDGLPPLQPKKEQTWNAVERAEHILKASGATITHAAGDRAFYRPS
TDSITLPERGQFPSSDRYYATALHELGHWTGHASRLDRDLAHPFGSEGYAKEELRAEIASMIVGDELGIGHDPGQHAAYV
GSWIKALQDEPLEVFRAAADAEKIHDYVLAFEQKQVQEQDQQQSQAQDEAVAQALAVDIAEVLDNPDVSFSHYQAFQGDT
LEDALRSRGLETVGSITGTDPEQFYAVAHDRLSPVFGIDPSHTDTDNAYLERKGLAQEFANMAEQLHLAQQLQQHGEQIV
SSIDAEARWSDGQRIFAFHDQDGEPHQVRSLDELNNYAPDQLMALPALTQQQAAVADQEANMTPPTIDQAAALLAAHPAD
AVQGIEQAAAARRQLAAGEIDGQAFADATRQHLGVELPPDWSGELRIVGVAEQDGQTVDAAQAGIEPQAFQVYARKADAQ
FGEDAFAFVAGTRTEGQAEALAERLHLVDALGTDNQHERAAKLARVQEERVRRDPNATEEDISAAKEARKTAEASAMLND
SDAQRRAAELERQERDRQQAQPQAEKPERQYINVPYKEKDEAKSLGARWDRQQQSWYVPPGTDAAPFAKWAQGAATAAVE
PRSAQPAPEAQGEAQKPAQQAQQARQYLAVPYEQRNAAKAAGALWDKAAKSWYVGPRADAAKLERWKPENVQAQQGPAMT
PREEFAEAMRSAGLFTGSNAQGDHPIMDGKRHRVPVEGGKKGALDGFYVGHLDGHPAGRIINNKTGTDITWKSKGYALSD
QEKAKLQAEAAEKLAQRAVEQDKAQEATAQRVGRQMADLVPIEQPTPYLQAKGIQAQAGVMTDREGQKTYIPAFDAEGKQ
WTMQYIQEDGTKRFAKDSKKEGCFHPVGGMDALAAAPALVISEGYATAAQVAEAVGHATVAAFDSGNLEAVAKALHAKFP
DKPVIIAGDDDRHLVMTHGNNPGREKAEAAAQAVGGKAIFPIFAPAENTYPRDLPAITPDSFKTHLRAEQRLADAAAGKV
ELAGDEAAKLKASMLSGAQIAALSTMKQHTDFNDLAHKSELGIEGVKRQIGAAISQVQRDEQQHQEQKHVEKKQQQIEQR
PRRAARIG
>P18004 ~~~traC~~~Protein TraC~~~
MNNPLEAVTQAVNSLVTALKLPDESAKANEVLGEMSFPQFSRLLPYRDYNQESGLFMNDTTMGFMLEAIPINGANESIVE
ALDHMLRTKLPRGIPLCIHLMSSQLVGDRIEYGLREFSWSGEQAERFNAITRAYYMKAAATQFPLPEGMNLPLTLRHYRV
FISYCSPSKKKSRADILEMENLVKIIRASLQGASITTQTVDAQAFIDIVGEMINHNPDSLYPKRRQLDPYSDLNYQCVED
SFDLKVRADYLTLGLRENGRNSTARILNFHLARNPEIAFLWNMADNYSNLLNPELSISCPFILTLTLVVEDQVKTHSEAN
LKYMDLEKKSKTSYAKWFPSVEKEAKEWGELRQRLGSGQSSVVSYFLNITAFCKDNNETALEVEQDILNSFRKNGFELIS
PRFNHMRNFLTCLPFMAGKGLFKQLKEAGVVQRAESFNVANLMPLVADNPLTPAGLLAPTYRNQLAFIDIFFRGMNNTNY
NMAVCGTSGAGKTGLIQPLIRSVLDSGGFAVVFDMGDGYKSLCENMGGVYLDGETLRFNPFANITDIDQSAERVRDQLSV
MASPNGNLDEVHEGLLLQAVRASWLAKENRARIDDVVDFLKNASDSEQYAESPTIRSRLDEMIVLLDQYTANGTYGQYFN
SDEPSLRDDAKMVVLELGGLEDRPSLLVAVMFSLIIYIENRMYRTPRNLKKLNVIDEGWRLLDFKNHKVGEFIEKGYRTA
RRHTGAYITITQNIVDFDSDKASSAARAAWGNSSYKIILKQSAKEFAKYNQLYPDQFLPLQRDMIGKFGAAKDQWFSSFL
LQVENHSSWHRLFVDPLSRAMYSSDGPDFEFVQQKRKEGLSIHEAVWQLAWKKSGPEMASLEAWLEEHEKYRSVA
>P09130 ~~~traD~~~Coupling protein TraD~~~
MSFNAKDMTQGGQIASMRIRMFSQIANIMLYCLFIFFWILVGLVLWIKISWQTFVNGCIYWWCTTLEGMRDLIKSQPVYE
IQYYGKTFRMNAAQVLHDKYMIWCSEQLWSAFVLAAVVALVICLITFFVVSWILGRQGKQQSENEVTGGRQLTDNPKDVA
RMLKKDGKDSDIRIGDLPIIRDSEIQNFCLHGTVGAGKSEVIRRLANYARQRGDMVVIYDRSGEFVKSYYDPSIDKILNP
LDARCAAWDLWKECLTQPDFDNTANTLIPMGTKEDPFWQGSGRTIFAEAAYLMRNDPNRSYSKLVDTLLSIKIEKLRTYL
RNSPAANLVEEKIEKTAISIRAVLTNYVKAIRYLQGIEHNGEPFTIRDWMRGVREDQKNGWLFISSNADTHASLKPVISM
WLSIAIRGLLAMGENRNRRVWFFCDELPTLHKLPDLVEILPEARKFGGCYVFGIQSYAQLEDIYGEKAAASLFDVMNTRA
FFRSPSHKIAEFAAGEIGEKEHLKASEQYSYGADPVRDGVSTGKDMERQTLVSYSDIQSLPDLTCYVTLPGPYPAVKLSL
KYQTRPKVAPEFIPRDINPEMENRLSAVLAAREAEGRQMASLFEPDVPEVVSGEDVTQAEQPQQPVSPAINDKKSDSGVN
VPAGGIEQELKMKPEEEMEQQLPPGISESGEVVDMAAYEAWQQENHPDIQQQMQRREEVNINVHRERGEDVEPGDDF
>P27192 ~~~traD~~~Protein TraD~~~
MNEQTTTNTAAHDEPLAVLPPVDDDAAGREAVREKMADALTPGFQVEFDPDEAERVGAFVEDALSEQDAAASGDDLVEVD
GALEPAFLDDEGPSADIPPFITTTNARELYDLRPGETVAQAAARKASEG
>Q00185 ~~~traG~~~Conjugal transfer protein TraG~~~
MKNRNNAVGPQIRAKKPKASKTVPILAGLSLGAGLQTATQYFAHSFQYQAGLGWNINHVYTPWSILQWAGKWYGQYPDDF
MRAASMGMVVSTVGLLGTAVTQMVKANTGKANDYLHGSARWADKKDIQAAGLLPRPRTVVELVSGKHPPTSSGVYVGGWQ
DKDGKFHYLRHNGPEHVLTYAPTRSGKGVGLVVPTLLSWAHSAVITDLKGELWALTAGWRKKHARNKVVRFEPASAQGSA
CWNPLDEIRLGTEYEVGDVQNLATLIVDPDGKGLESHWQKTSQALLVGVILHALYKAKNEGTPATLPSVDGMLADPNRDV
GELWMEMTTYGHVDGQNHPAVGSAARDMMDRPEEESGSVLSTAKSYLALYRDPVVARNVSKSDFRIKQLMHHDDPVSLFI
VTQPNDKARLRPLVRVMVNMIVRLLADKMDFENGRPVAHYKHRLLMMLDEFPSLGKLEILQESLAFVAGYGIKCYLICQD
INQLKSRETGYGHDESITSNCHVQNAYPPNRVETAEHLSKLTGTTTIVKEQITTSGRRTSALLGNVSRTFQEVQRPLLTP
DECLRMPGPKKSADGSIEEAGDMVVYVAGYPAIYGKQPLYFKDPIFQARAAVPAPKVSDKLIQTATVEEGEGITI
>Q00190 ~~~traH~~~Protein TraH~~~
MSNPNEMTDEEIAAAMEAFDLPQPEPPSTPQAATATDGTLAPSAPAEPSHSASPTLDALDESRRPKAKTVCERCPNSVWF
ASPAELKCYCRVMFLVTWSSKEPNQLTHCDGEFLGQEEG
>P14565 ~~~traI~~~Multifunctional conjugation protein TraI~~~
MMSIAQVRSAGSAGNYYTDKDNYYVLGSMGERWAGRGAEQLGLQGSVDKDVFTRLLEGRLPDGADLSRMQDGSNRHRPGY
DLTFSAPKSVSMMAMLGGDKRLIDAHNQAVDFAVRQVEALASTRVMTDGQSETVLTGNLVMALFNHDTSRDQEPQLHTHA
VVANVTQHNGEWKTLSSDKVGKTGFIENVYANQIAFGRLYREKLKEQVEALGYETEVVGKHGMWEMPGVPVEAFSGRSQT
IREAVGEDASLKSRDVAALDTRKSKQHVDPEIKMAEWMQTLKETGFDIRAYRDAADQRADLRTLTPGPASQDGPDVQQAV
TQAIAGLSERKVQFTYTDVLARTVGILPPENGVIERARAGIDEAISREQLIPLDREKGLFTSGIHVLDELSVRALSRDIM
KQNRVTVHPEKSVPRTAGYSDAVSVLAQDRPSLAIVSGQGGAAGQRERVAELVMMAREQGREVQIIAADRRSQMNMKQDE
RLSGELITGRRQLLEGMAFTPGSTVIVDQGEKLSLKETLTLLDGAARHNVQVLITDSGQRTGTGSALMAMKDAGVNTYRW
QGGEQRPATIISEPDRNVRYARLAGDFAASVKAGEESVAQVSGVREQAILTQAIRSELKTQGVLGLPEVTMTALSPVWLD
SRSRYLRDMYRPGMVMEQWNPETRSHDRYVIDRVTAQSHSLTLRDAQGETQVVRISSLDSSWSLFRPEKMPVADGERLRV
TGKIPGLRVSGGDRLQVASVSEDAMTVVVPGRAEPATLPVSDSPFTALKLENGWVETPGHSVSDSATVFASVTQMAMDNA
TLNGLARSGRDVRLYSSLDETRTAEKLARHPSFTVVSEQIKTRAGETSLETAISHQKSALHTPAQQAIHLALPVVESKKL
AFSMVDLLTEAKSFAAEGTGFTELGGEINAQIKRGDLLYVDVAKGYGTGLLVSRASYEAEKSILRHILEGKEAVMPLMER
VPGELMEKLTSGQRAATRMILETSDRFTVVQGYAGVGKTTQFRAVMSAVNMLPESERPRVVGLGPTHRAVGEMRSAGVDA
QTLASFLHDTQLQQRSGETPDFSNTLFLLDESSMVGNTDMARAYALIAAGGGRAVASGDTDQLQAIAPGQPFRLQQTRSA
ADVAIMKEIVRQTPELREAVYSLINRDVERALSGLESVKPSQVPRQEGAWAPEHSVTEFSHSQEAKLAEAQQKAMLKGEA
FPDVPMTLYEAIVRDYTGRTPEAREQTLIVTHLNEDRRVLNSMIHDVREKAGELGKEQVMVPVLNTANIRDGELRRLSTW
ETHRDALVLVDNVYHRIAGISKDDGLITLQDAEGNTRLISPREAVAEGVTLYTPDTIRVGTGDRMRFTKSDRERGYVANS
VWTVTAVSGDSVTLSDGQQTREIRPGQEQAEQHIDLAYAITAHGAQGASETFAIALEGTEGNRKLMAGFESAYVALSRMK
QHVQVYTDNRQGWTDAINNAVQKGTAHDVFEPKPDREVMNAERLFSTARELRDVAAGRAVLRQAGLAGGDSPARFIAPGR
KYPQPYVALPAFDRNGKSAGIWLNPLTTDDGNGLRGFSGEGRVKGSGDAQFVALQGSRNGESLLADNMQDGVRIARDNPD
SGVVVRIAGEGRPWNPGAITGGRVWGDIPDNSVQPGAGNGEPVTAEVLAQRQAEEAIRRETERRADEIVRKMAENKPDLP
DGKTEQAVREIAGQERDRAAITEREAALPEGVLREPQRVREAVREIARENLLQERLQQMERDMVRDLQKEKTLGGD
>Q00191 ~~~traI~~~Protein TraI~~~
MIAKHVPMRSIKKSDFAELVKYITDEQGKTERLGHVRVTNCEANTLPAVMAEVMATQHGNTRSEADKTYHLLVSFRAGEK
PDAETLRAIEDRICAGLGFAEHQRVSAVHHDTDNLHIHIAINKIHPTRNTIHEPYRAYRALADLCATLERDYGLERDNHE
TRQRVSENRANDMERHAGVESLVGWIKRECLPELQAAQSWEDLHRVLRENGLKLRERGNGFIFEAGDGTTVKASTVSRDL
SKPKLEARFGAFTPAEGGEAPRRREYRAKPLKTRIDTTELYARYQSERQEMGAVRKGELDTLRRRRDRLIEAAMRSNRLR
RAAIKLLGEGRIAKRLMYAQAHKALRADLDKINREYRQGRQAVQERTQRRAWADWLKAEAMKGDDKALAALRAREGRSDL
KGNTIQGSGEAKPGHAAVTDNITKKGTIIYRVGSSAVRDDGDRLQVSREATTDGLDAALRLAMERFGDRITVNGTAEFKE
RIAQAAAAGRLAITFDDAALERRRQELLTKEQAHEQPERNDGRRDRGGDGGIRPAAARTTLNATGGDGDRRDARAVSAGG
TVALRKPNVGRIGRKPPPQSQNRLRALSQLGVVRIAGGAEMLLPRDVPGHVEQQGAEPAHALRRGVSGPGRGLKPEQIAA
AEKYVAEREQKRLNGFDIPKHARYTDYVGALSYAGTRNVEDQALALLRKENDEILVLPVDKATVQRMKRLAIGDPVTVTP
RGSLKTTRGRSR
>P06626 ~~~traJ~~~Protein TraJ~~~
MYPMDRIQQKHARQIDLLENLTAVIQDYPNPACIRDETGKFIFCNTLFHESFLTQDQSAEKWLLSQRDFCELISVTEMEA
YRNEHTHLNLVEDVFIQNRFWTISVQSFLNGHRNIILWQFYDAAHVRHKDSYNQKTIVSDDIRNIIRRMSDDSSVSSYVN
DVFYLYSTGISHNAIARILNISISTSKKHASLICDYFSVSNKDELIILLYNKKFIYYLYEKAMCIINTR
>P05837 ~~~traJ~~~Protein TraJ~~~
MCALDRRERPLNSQSVNKYILNVQNIYRNSPVPVCVRNKNRKILYANGAFIELFSREDKPLSGESYIRLQVEIFLSSLEL
ECQALGHGSAFCRRFNFHGEIYQIRMENVSFYNDESVVLWQINPFPDYPFFALNQSGSNTNTSDKLTIWNDLSPGTLVVF
SFYMLGVGHATIARELGITDRASEDRIKPVKRKIKEFFEHFDLFRVSCIYKGEIDSLLSIIREFYGVK
>P17909 ~~~traJ~~~Protein TraJ~~~
MADETKPTRKGSPPIKVYCLPDERRAIEEKAAAAGMSLSAYLLAVGQGYKITGVVDYEHVRELARINGDLGRLGGLLKLW
LTDDPRTARFGDATILALLAKIEEKQDELGKVMMGVVRPRAEP
>P33786 ~~~traJ~~~Protein TraJ~~~
MTVVDPARFMYERNHFPSLTDKEFETLVLYCQMMNVQMVADYQNRKPDVIIKHLKSCRQKIGVESDFELYFIVINKFVNF
ERVFPELTSEQINILAAFSFYPKRSTIARRFDIYRCDIYDELIKIRNNLGIEDLESLRMLFFMKITVFL
>P17910 ~~~traK~~~Protein TraK~~~
MPKSYTDELAEWVESRAAKKRRRDEAAVAFLAVRADVEAALASGYALVTIWEHMRETGKVKFSYETFRSHARRHIKAKPA
DVPAPQAKAAEPAPAPKTPEPRRPKQGGKAEKPAPAAAPTGFTFNPTPDKKDLL
>Q00188 ~~~traL~~~Protein TraL~~~
MAKIHMVLQGKGGVGKSAIAAIIAQYKMDKGQTPLCIDTDPVNATFEGYKALNVRRLNIMAGDEINSRNFDTLVELIAPT
KDDVVIDNGASSFVPLSHYLISNQVPALLQEMGHELVIHTVVTGGQALLDTVSGFAQLASQFPAEALFVVWLNPYWGPIE
HEGKSFEQMKAYTANKARVSSIIQIPALKEETYGRDFSDMLQERLTFDQALADESLTIMTRQRLKIVRRGLFEQLDAAAV
L
>P10026 ~~~traM~~~Relaxosome protein TraM~~~
MAKVNLYISNDAYEKINAIIEKRRQEGAREKDVSFSATASMLLELGLRVHEAQMERKESAFNQTEFNKLLLECVVKTQSS
VAKILGIESLSPHVSGNSKFEYANMVEDIREKVSSEMERFFPKNDDE
>P07294 ~~~traM~~~Relaxosome protein TraM~~~
MAKVQAYVSDEIVYKINKIVERRRAEGAKSTDVSFSSISTMLLELGLRVYEAQMERKESAFNQAEFNKVLLECAVKTQST
VAKILGIESLSPHVSGNPKFEYANMVEDIRDKVSSEMERFFPENDEE
>P13973 ~~~traM~~~Relaxosome protein TraM~~~
MARVILYISNDVYDKVNAIVEQRRQEGARDKDISVSGTASMLLELGLRVYEAQMERKESAFNQTEFNKLLLECVVKTQSS
VAKILGIESLSPHVSGNPKFEYANMVEDIREKVSSEMERFFPKNDEE
>Q00186 ~~~traM~~~Protein TraM~~~
MSDQIEELIREIAAKHGIAVGRDDPVLILHTINARLMADSAAKQEEILAAFKEELEGIAHRWGEDAKAKAERMLNAALAA
SKDAMAKVMKDSAAQAAEAIRREIDDGLGRQLAAKVADARRVAMMNMIAGGMVLFAAALVVWASL
>P33788 ~~~traM~~~Relaxosome protein TraM~~~
MPKIQTYVNNNVYEQITDLVTIRKQEGIEEASLSNVSSMLLELGLRVYMIQQEKREGGFNQMEYNKLMLENVSRVRAMCT
EILKMSVLNQESIASGNFDYAVIKPAIDKFAREQVSIFFPDDEDDQE
>Q57471 ~~~traM~~~Transcriptional repressor TraM~~~
MELEDANVTKKVELRPLIGLTRGLPPTDLETITIDAIRTHRRLVEKADELFQALPETYKTGQACGGPQHIRYIEASIEMH
AQMSALNTLYSILGFIPKVVVN
>P55408 ~~~traM~~~Probable transcriptional repressor TraM~~~
MNDMGSSEVNDENKEKEARYSVMTKSELEALAVSAIREHRRLLWADQAVYEEWLRASDDPSISGPVLQTLQDEYVARQKR
SEAQQEELSDILDALGFVPDVPFDDDN
>P24082 ~~~traN~~~Mating pair stabilization protein TraN~~~
MKRILPLILALVAGMAQADSNSDYRAGSDFAHQIKGQGSSSIQGFKPQESIPGYNANPDETKYYGGVTAGGDGGLKNDGT
TEWATGETGKTITESFMNKPKDILSPDAPFIQTGRDVVNRADSIVGNTGQQCSAQEISRSEYTNYTCERDLQVEQYCTRT
ARMELQGSTTWETRTLEYEMSQLPAREVNGQYVVSITSPVTGEIVDAHYSWSRTYLQKSVPMTITVLGTPLSWNAKYSAD
ASFTPVQKTLTAGVAFTSSHPVRVGNTKFKRHTAMKLRLVVRVKKASYTPYVVWSESCPFSKELGKLTKTECTEAGGNRT
LVKDGQSYSMYQSCWAYRDTYVTQSADKGTCQTYTDNPACTLVSHQCAFYSEEGACLHEYATYSCESKTSGKVMVCGGDV
FCLDGECDKAQSGKSNDFAEAVSQLAALAAAGKDVAALNGVDVRAFTGQAKFCKKAAAGYSNCCKDSGWGQDIGLAKCSS
DEKALAKAKSNKLTVSVGEFCSKKVLGVCLEKKRSYCQFDSKLAQIVQQQGRNGQLRISFGSAKHPDCRGITVDELQKIQ
FNRLDFTNFYEDLMNNQKIPDSGVLTQKVKEQIADQLKQAGQ
>Q2G2F3 ~~~traP~~~Signal transduction protein TRAP~~~COG2329
MKKLYTSYGTYGFLHQIKINNPTHQLFQFSASDTSVIFEETDGETVLKSPSIYEVIKEIGEFSEHHFYCAIFIPSTEDHA
YQLEKKLISVDDNFRNFGGFKSYRLLRPAKGTTYKIYFGFADRHAYEDFKQSDAFNDHFSKDALSHYFGSSGQHSSYFER
YLYPIKE
>Q7A4W3 ~~~traP~~~Signal transduction protein TRAP~~~
MKKLYTSYGTYGFLHQIKINNPTHQLFQFSASDTSVIFEETDGETVLKSPSIYEVIKEIGEFSEHHFYCAIFIPSTEDHA
YQLEKKLISVDDNFRNFGGFKSYRLLRPAKGTTYKIYFGFADRHAYEDFKQSDAFNDHFSKDALSHYFGSSGQHSSYFER
YLYPIKE
>Q8GQQ1 ~~~traP~~~Signal transduction protein TRAP~~~
MYLYTSYGTYQFLNQIKLNHQERSLFQFSTNDSSIILEESEGKSILKHPSSYQVIDSTGEFNEHHFYSAIFVPTSEDHRQ
QLEKKLLHVDVPLSNFGGFKSYRLLKPTEGSTYKIYFGFANRTAYEDFKASDIFNENFSKDALSQYFGASGQHSSYFERY
LYPIEDH
>Q5HNA3 ~~~traP~~~Signal transduction protein TRAP~~~COG2329
MYLYTSYGTYQFLNQIKLNHQERSLFQFSTNDSSIILEESEGKSILKHPSSYQVIDSTGEFNEHHFYSAIFVPTSEDHRQ
QLEKKLLHVDVPLSNFGGFKSYRLLKPTEGSTYKIYFGFANRTAYEDFKASDIFNENFSKDALSQYFGASGQHSSYFERY
LYPIEDH
>P18033 ~~~traQ~~~Protein TraQ~~~
MISKRRFSLPRLDITGMWVFSLGVWFHIVARLVYSKPWMAFFLAELIAAILVLFGAYQVLDAWIARVSREEREALEARQQ
AMMEGQQEGGHVSH
>P41065 ~~~traR~~~Protein TraR~~~
MSDEADEAYSVTEQLTMTGINRIRQKINAHGIPVYLCEACGNPIPEARRKIFPGVTLCVECQAYQERQRKHYA
>P33905 ~~~traR~~~Transcriptional activator protein TraR~~~
MQHWLDKLTDLAAIEGDECILKTGLADIADHFGFTGYAYLHIQHRHITAVTNYHRQWQSTYFDKKFEALDPVVKRARSRK
HIFTWSGEHERPTLSKDERAFYDHASDFGIRSGITIPIKTANGFMSMFTMASDKPVIDLDREIDAVAAAATIGQIHARIS
FLRTTPTAEDAAWLDPKEATYLRWIAVGKTMEEIADVEGVKYNSVRVKLREAMKRFDVRSKAHLTALAIRRKLI
>P55407 ~~~traR~~~Probable transcriptional activator protein TraR~~~COG2771
MSVNGNLRSLIDMLEAAQDGHMIKIALRSFAHSCGYDRFAYLQKDGTQVRTFHSYPGPWESIYLGSDYFNIDPVLAEAKR
RRDVFFWTADAWPARGSSPLRRFRDEAISHGIRCGVTIPVEGSYGSAMMLTFASPERKVDISGVLDPKKAVQLLMMVHYQ
LKIIAAKTVLNPKQMLSPREMLCLVWASKGKTASVTANLTGINARTVQHYLDKARAKLDAESVPQLVAIAKDRGLV
>P09129 ~~~traS~~~Protein TraS~~~
MITQQIISSELEVLKKHIDSGDIRIPSLWQGLKPGLIIMGWMIFCPLLMSFLITQKTSETLTAVLAGGWLGLIILFIVAR
IRMLYFSLPEEFLKTSSVMRVISSKLKVYFIVYMGVIFLWSFLGGGIIYGFGAILVTVIMAFLIQLDIGRYQFVGVIDAI
NSYVKNKKLSRVK
>P41069 ~~~traV~~~Protein TraV~~~
MKQTSFFIPLLGTLLLYGCAGTSTEFECNATTSDTCMTMEQANEKAKKLERSSEAKPVAASLPRLAEGNFRTMPVQTVTA
TTPSGSRPAVTAHPEQKLLAPRPLFTAAREVKTVVPVSSVTPVTPPRPLRTGEQTAALWIAPYIDNQDVYHQPSSVFFVI
KPSAWGKPRIN
>P06627 ~~~traY~~~Relaxosome protein TraY~~~
MKRFGTRSATGKMVKLKLPVDVESLLIEASNRSGRSRSFEAVIRLKDHLHRYPKFNRAGNIYGKSLVKYLTMRLDDETNQ
LLIAAKNRSGWCKTDEAADRVIDHLIKFPDFYNSEIFREADKEEDITFNTL
>L7N689 ~~~trcR~~~Transcriptional regulatory protein TrcR~~~COG0745
MTTMSGYTRSQRPRQAILGQLPRIHRADGSPIRVLLVDDEPALTNLVKMALHYEGWDVEVAHDGQEAIAKFDKVGPDVLV
LDIMLPDVDGLEILRRVRESDVYTPTLFLTARDSVMDRVTGLTSGADDYMTKPFSLEELVARLRGLLRRSSHLERPADEA
LRVGDLTLDGASREVTRDGTPISLSSTEFELLRFLMRNPRRALSRTEILDRVWNYDFAGRTSIVDLYISYLRKKIDSDRE
PMIHTVRGIGYMLRPPE
>P96368 2.7.13.3~~~trcS~~~Sensor histidine kinase TrcS~~~COG2205
MIPDRNTRSRKAPCWRPRSLRQQLLLGVLAVVTVVLVAVGVVSVLSLSGYVTAMNDAELVESLHALNHSYTRYRDSAQTS
TPTGNLPMSQAVLEFTGQTPGNLIAVLHDGVVIGSAVFSEDGARPAPPDVIRAIEAQVWDGGPPRVESLGSLGAYQVDSS
AAGADRLFVGVSLSLANQIIARKKVTTVALVGAALVVTAALTVWVVGYALRPLRRVAATAAEVATMPLTDDDHQISVRVR
PGDTDPDNEVGIVGHTLNRLLDNVDGALAHRVDSDLRMRQFITDASHELRTPLAAIQGYAELTRQDSSDLPPTTEYALAR
IESEARRMTLLVDELLLLSRLSEGEDLETEDLDLTDLVINAVNDAAVAAPTHRWVKNLPDEPVWVNGDHARLHQLVSNLL
TNAWVHTQPGVTVTIGITCHRTGPNAPCVELSVTDDGPDIDPEILPHLFDRFVRASKSRSNGSGHGLGLAIVSSIVKAHR
GSVTAESGNGQTVFRVRLPMIEQQIATTA
>A8GG78 2.4.2.31~~~tre1~~~NAD(+)--protein-arginine ADP-ribosyltransferase Tre1~~~COG4104
MSELSAARELDEIAHTASEGWMIAGLIGGAIVGAALIAVTGGTAAVAVAAVVAGASAGGGLGEVLGSMSWAPRHVTGVLA
DGSPNVYINGRPAIRAHISTGECSEDGPAKKVVAQGSAKVYINDFPAARINDLLACSAEIHTGSPNVIIGGEKEQTDDIE
PEIPDWVNWTLLAAGAGAAAVLATPAIAILGTLGGLGGGFAGSLIGGAFFGEGSDGQKWSMLAGGFVGGFAGGKGGAKFD
AWRNTKIVEPPPRVTTKVDPISPPRMTLAEAVGQEQAKVWTQTARANAEKNNAQLSTLLTDDQIGAIYGYTTNEGYTALN
PALRGQTPLTPELEAFTGHVTDGLNKLPAYNGETYRGTTLPAHILEQNQIGGTVSDGGFMSTSAKTPFDGDVSISVRGNS
GKQIDFLSKYKNEAEVLYPPNTRFEVINRIEQNGTTHLLYREIP
>P13482 3.2.1.28~~~treA~~~Periplasmic trehalase~~~COG1626
MKSPAPSRPQKMALIPACIFLCFAALSVQAEETPVTPQPPDILLGPLFNDVQNAKLFPDQKTFADAVPNSDPLMILADYR
MQQNQSGFDLRHFVNVNFTLPKEGEKYVPPEGQSLREHIDGLWPVLTRSTENTEKWDSLLPLPEPYVVPGGRFREVYYWD
SYFTMLGLAESGHWDKVADMVANFAHEIDTYGHIPNGNRSYYLSRSQPPFFALMVELLAQHEGDAALKQYLPQMQKEYAY
WMDGVENLQAGQQEKRVVKLQDGTLLNRYWDDRDTPRPESWVEDIATAKSNPNRPATEIYRDLRSAAASGWDFSSRWMDN
PQQLNTLRTTSIVPVDLNSLMFKMEKILARASKAAGDNAMANQYETLANARQKGIEKYLWNDQQGWYADYDLKSHKVRNQ
LTAAALFPLYVNAAAKDRANKMATATKTHLLQPGGLNTTSVKSGQQWDAPNGWAPLQWVATEGLQNYGQKEVAMDISWHF
LTNVQHTYDREKKLVEKYDVSTTGTGGGGGEYPLQDGFGWTNGVTLKMLDLICPKEQPCDNVPATRPTVKSATTQPSTKE
AQPTP
>P39795 3.2.1.93~~~treA~~~Trehalose-6-phosphate hydrolase~~~COG0366
MKTEQTPWWKKAVVYQIYPKSFNDTTGNGVGDLNGIIEKLDYLKTLQVDVLWLTPIYDSPQHDNGYDIRDYYSIYPEYGT
MEDFERLVSEAHKRDLKVVMDLVVNHTSTEHKWFREAISSIDSPYRDFYIWKKPQENGSVPTNWESKFGGSAWELDEASG
QYYLHLFDVTQADLNWENEEVRKHVYDMMHFWFEKGIDGFRLDVINLISKDQRFPNAEEGDGRSFYTDGPRVHEFLHEMN
EKVFSHYDSMTVGEMSSTTVDHCIRYTNPDNKELDMTFSFHHLKVDYPNGEKWALAPFDFLKLKEILSDWQTGMHAGGGW
NALFWCNHDQPRVVSRYGDDGAYRVKSAKMLATAIHMMQGTPYIYQGEELGMTNPKFTDISSYRDVESLNMYHAFKEKGM
ADQDITAILQAKSRDNSRTPVQWDATENGGFTTGTPWIPVAGNYREINAEAALRDQNSVFYHYQKLIQIRKMYDIVTEGT
YEIIAKDDPNIFAYLRHGSNEKLLVINNFYGTEAAFTLPDSLAPDEWKAEVLLTNDEAREGLQNMTLRPYESIVYRLTKP
C
>P28904 3.2.1.93~~~treC~~~Trehalose-6-phosphate hydrolase~~~COG0366
MTHLPHWWQNGVIYQIYPKSFQDTTGSGTGDLRGVIQHLDYLHKLGVDAIWLTPFYVSPQVDNGYDVANYTAIDPTYGTL
DDFDELVTQAKSRGIRIILDMVFNHTSTQHAWFREALNKESPYRQFYIWRDGEPETPPNNWRSKFGGSAWRWHAESEQYY
LHLFAPEQADLNWENPAVRAELKKVCEFWADRGVDGLRLDVVNLISKDPRFPEDLDGDGRRFYTDGPRAHEFLHEMNRDV
FTPRGLMTVGEMSSTSLEHCQRYAALTGSELSMTFNFHHLKVDYPGGEKWTLAKPDFVALKTLFRHWQQGMHNVAWNALF
WCNHDQPRIVSRFGDEGEYRVPAAKMLAMVLHGMQGTPYIYQGEEIGMTNPHFTRITDYRDVESLNMFAELRNDGRDADE
LLAILASKSRDNSRTPMQWSNGDNAGFTAGEPWIGLGDNYQQINVEAALADDSSVFYTYQKLIALRKQEAILTWGNYQDL
LPNSPVLWCYRREWKGQTLLVIANLSREIQPWQAGQMRGNWQLVMHNYEEASPQPCAMNLRPFEAVWWLQK
>P62601 3.2.1.28~~~treF~~~Cytoplasmic trehalase~~~COG1626
MLNQKIQNPNPDELMIEVDLCYELDPYELKLDEMIEAEPEPEMIEGLPASDALTPADRYLELFEHVQSAKIFPDSKTFPD
CAPKMDPLDILIRYRKVRRHRDFDLRKFVENHFWLPEVYSSEYVSDPQNSLKEHIDQLWPVLTREPQDHIPWSSLLALPQ
SYIVPGGRFSETYYWDSYFTMLGLAESGREDLLKCMADNFAWMIENYGHIPNGNRTYYLSRSQPPVFALMVELFEEDGVR
GARRYLDHLKMEYAFWMDGAESLIPNQAYRHVVRMPDGSLLNRYWDDRDTPRDESWLEDVETAKHSGRPPNEVYRDLRAG
AASGWDYSSRWLRDTGRLASIRTTQFIPIDLNAFLFKLESAIANISALKGEKETEALFRQKASARRDAVNRYLWDDENGI
YRDYDWRREQLALFSAAAIVPLYVGMANHEQADRLANAVRSRLLTPGGILASEYETGEQWDKPNGWAPLQWMAIQGFKMY
GDDLLGDEIARSWLKTVNQFYLEQHKLIEKYHIADGVPREGGGGEYPLQDGFGWTNGVVRRLIGLYGEP
>A0R0W9 3.2.1.28~~~~~~Trehalase~~~COG3387
MVLQQTEPTDGADRKASDGPLTVTAPVPYAAGPTLRNPFPPIADYGFLSDCETTCLISSAGSVEWLCVPRPDSPSVFGAI
LDRGAGHFRLGPYGVSVPAARRYLPGSLILETTWQTHTGWLIVRDALVMGPWHDIDTRSRTHRRTPMDWDAEHILLRTVR
CVSGTVELVMSCEPAFDYHRVSATWEYSGPAYGEAIARASRNPDSHPTLRLTTNLRIGIEGREARARTRLTEGDNVFVAL
SWSKHPAPQTYEEAADKMWKTSEAWRQWINVGDFPDHPWRAYLQRSALTLKGLTYSPTGALLAAPTTSLPETPQGERNWD
YRYSWIRDSTFALWGLYTLGLDREADDFFSFIADVSGANNGERHPLQVMYGVGGERSLVEEELHHLSGYDNSRPVRIGNG
AYNQRQHDIWGTMLDSVYLHAKSREQIPDALWPVLKNQVEEAIKHWKEPDRGIWEVRGEPQHFTSSKIMCWVALDRGSKL
AELQGEKSYAQQWRAIAEEIKADVLARGVDKRGVLTQRYGDDALDASLLLAVLTRFLPADDPRIRATVLAIADELTEDGL
VLRYRVEETDDGLAGEEGTFTICSFWLVSALVEIGEISRAKHLCERLLSFASPLHLYAEEIEPRTGRHLGNFPQAFTHLA
LINAVVHVIRAEEEADSSGVFVPANAPM
>P71741 3.2.1.28~~~~~~Trehalase~~~COG3387
MVLHAQPPDQSTETAREAKALAGATDGATATSADLHAPMALSSSSPLRNPFPPIADYAFLSDWETTCLISPAGSVEWLCV
PRPDSPSVFGAILDRSAGHFRLGPYGVSVPSARRYLPGSLIMETTWQTHTGWLIVRDALVMGKWHDIERRSRTHRRTPMD
WDAEHILLRTVRCVSGTVELMMSCEPAFDYHRLGATWEYSAEAYGEAIARANTEPDAHPTLRLTTNLRIGLEGREARART
RMKEGDDVFVALSWTKHPPPQTYDEAADKMWQTTECWRQWINIGNFPDHPWRAYLQRSALTLKGLTYSPTGALLAASTTS
LPETPRGERNWDYRYAWIRDSTFALWGLYTLGLDREADDFFAFIADVSGANNNERHPLQVMYGVGGERSLVEAELHHLSG
YDHARPVRIGNGAYNQRQHDIWGSILDSFYLHAKSREQVPENLWPVLKRQVEEAIKHWREPDRGIWEVRGEPQHFTSSKV
MCWVALDRGAKLAERQGEKSYAQQWRAIADEIKADILEHGVDSRGVFTQRYGDEALDASLLLVVLTRFLPPDDPRVRNTV
LAIADELTEDGLVLRYRVHETDDGLSGEEGTFTICSFWLVSALVEIGEVGRAKRLCERLLSFASPLLLYAEEIEPRSGRH
LGNFPQAFTHLALINAVVHVIRAEEEADSSGMFQPANAPM
>Q9CID5 2.4.1.216~~~trePP~~~Trehalose 6-phosphate phosphorylase~~~COG1554
MTEKDWIIQYDKKEVGKRSYGQESLMSLGNGYLGLRGAPLWSTCSDNHYPGLYVAGVFNRTSTEVAGHDVINEDMVNWPN
PQLIKVYIDGELVDFEASVEKQATIDFKNALQIERYQVKLAKGNLTLVTTKFVDPINFHDFGFVGEIIADFSCKLRIETF
TDGSVLNQNVERYRAFDSKEFEVTKISKGLLVAKTRTSEIELAIASKSFLNGLAFPKIDSENDEILAEAIEIDLQKNQEV
QFDKTIVIASSYESKNPVEFVLTELSATSVSKIQENNTNYWEKVWSDADIVIESDHEDLQRMVRMNIFHIRQAAQHGANQ
FLDASVGSRGLTGEGYRGHIFWDEIFVLPYYAANEPETARDLLLYRINRLTAAQENAKVDGEKGAMFPWQSGLIGDEQSQ
FVHLNTVNNEWEPDNSRRQRHVSLAIVYNLWIYSQLTEDESILTDGGLDLIIETTKFWLNKAELGDDGRYHIDGVMGPDE
YHEAYPGQEGGICDNAYTNLMLTWQLNWLTELSEKGFEIPKELLEKAQKVRKKLYLDIDENGVIAQYAKYFELKEVDFAA
YEAKYGDIHRIDRLMKAEGISPDEYQVAKQADTLMLIYNLGQEHVTKLVKQLAYELPENWLKVNRDYYLARTVHGSTTSR
PVFAGIDVKLGDFDEALDFLITAIGSDYYDIQGGTTAEGVHIGVMGETLEVIQNEFAGLSLREGQFAIAPYLPKSWTKLK
FNQIFRGTKVEILIENGQLLLTASADLLTKVYDDEVQLKAGVQTKFDLK
>Q8GRC3 2.4.1.64~~~treP~~~Alpha,alpha-trehalose phosphorylase~~~
MSWSISSNQLNIENLLNEESLFFTGNGYIGVRGNFEEKYYDGASSIRGTYINAFHDITDINYGEKLYAFPETQQKLVNVI
DAQTVQIYFGEEEERFSLFEGEVIQYERHLHMDKGFSERVIHWRSPGGKEVKLKFKRLTSFIYKELFIQEITIEPVNFFG
KTKVVSTVNGDVSNFVDPSDPRVGSGHAKLLTVSDTVIEGDFVSIETKTKRSNLYAACTSTCRLNIDFQREYVKNEKSVE
TVLTFELTEKAIMTKINIYTDTLRHGDRPLRTGLDLCQKLSCLTFNDLKEQQKHYLDKFWLYADVEISGDQALQEGIRFN
LFHLLQSAGRDRFSNIAAKGLSGEGYEGHYFWDTEIYMVPVFLMTNPELAKQLLIYRYSILDKARERAREMGHRKGALFP
WRTISGGECSSYFPAGTAQYHISADIAYSYVQYYLVTKDLDFLKSYGAELLIETARLWMDTGHYHEGKFKIDAVTGPDEY
TCIVNNNYYTNVMAKHNLRWAAKSVAELEKHAPDTLASLKAKLEITDEEIAEWIKAAEAMYLPYDPTLNINPQDDTFLQK
QVWDFDNTPKEHYPLLLHYHPLTLYRYQVCKQADTVLAHFLLEDEQDESVIRDSYHYYEKITTHDSSLSSCVFSIMAAKI
GELDKAYEYFIETARLDLDNTHGNTKDGLHMANMGGTWMAIVYGFAGLRIKESGLSLAPVIPKQWQSYRFSIQYLGRHIS
VSVDTKGTKVNLLNGEELTIKLYGKKHQLTKDEPLEITFNNGRVD
>Q8L164 2.4.1.64~~~treP~~~Alpha,alpha-trehalose phosphorylase~~~
MANKTKKPIYPFEDWVIRETQFSIDTNYRNETIFTLANGYIGMRGTFEERYSGPKNTSFNGTYINGFYEIHDIVYPEGGY
GFAKIGQTMLNVADSKIIKLYVDGEEFDLLQGKILFYERVLDMKKGFVERKVKWESPTGKILEVKIKRIVSLNRQHLAAI
SFTMQPVNFTGKIRFVSAIDGNVSNINDSEDVRVGSNLKGKVLKTIDKSVEGLKGWIVQKTQKSNFSYACAIDNVLVADS
KYEVSNSLEEDGVKVIVDLEAEKGTSYTLNKFISYYTSKDFDENKLVALALEEIEKAKNDGFETIEKEQEEFLNSFWKDA
DVIIEGDKALQQGIRFNEFHLLQSVGRDGKTNIAAKGLTGGGYEGHYFWDSDIYIMPFFLYTKPEIAKALVMYRYNLLDA
ARSRAKELGHKGALYPWRTIDGPECSAYFPAGTAQYHINADIVYALKRYVEATNDVDFLYDYGCEILFETARFWEDLGAY
IPLKGNKFCINCVTGPDEYTALVDNNAYTNYMAKMNLEYAYDIANKMKKEVPQKYQKVASKLNLKDEEIVAWKKAADNMY
LPYSKELDIIPQDDSFLYKERITVDEIPEDQFPLLLHWHYLNIYRYQICKQPDVLLLMFLQREKFTKDELKKNYDYYEPI
TTHDSSLSPAIFSILANEIGYTDKAYKYFMMTARMDLDDYNDNVKDGIHAASMAGTWSAVVNGFGGMRVYTNELHFEPRL
PKEWNLLSFNVRYKGRKINVKLTKENVVFALLEGEPIEIYYFDKKILLEKGEIK
>P39796 ~~~treR~~~HTH-type transcriptional regulator TreR~~~COG2188
MKVNKFITIYKDIAQQIEGGRWKAEEILPSEHELTAQYGTSRETVRKALHMLAQNGYIQKIRGKGSVVLNREKMQFPVSG
LVSFKELAQTLGKETKTTVHKFGLEPPSELIQKQLRANLDDDIWEVIRSRKIDGEHVILDKDYFFRKHVPHLTKEICENS
IYEYIEGELGLSISYAQKEIVAEPCTDEDRELLDLRGYDHMVVVRNYVFLEDTSLFQYTESRHRLDKFRFVDFARRGK
>P36673 ~~~treR~~~HTH-type transcriptional regulator TreR~~~COG1609
MQNRLTIKDIARLSGVGKSTVSRVLNNESGVSQLTRERVEAVMNQHGFSPSRSARAMRGQSDKVVAIIVTRLDSLSENLA
VQTMLPAFYEQGYDPIMMESQFSPQLVAEHLGVLKRRNIDGVVLFGFTGITEEMLAHWQSSLVLLARDAKGFASVCYDDE
GAIKILMQRLYDQGHRNISYLGVPHSDVTTGKRRHEAYLAFCKAHKLHPVAALPGLAMKQGYENVAKVITPETTALLCAT
DTLALGASKYLQEQRIDTLQLASVGNTPLMKFLHPEIVTVDPGYAEAGRQAACQLIAQVTGRSEPQQIIIPATLS
>A0R6E0 3.2.1.1~~~treS~~~Trehalose synthase/amylase TreS~~~COG0366
MEEHTQGSHVEAGIVEHPNAEDFGHARTLPTDTNWFKHAVFYEVLVRAFYDSNADGIGDLRGLTEKLDYIKWLGVDCLWL
PPFYDSPLRDGGYDIRDFYKVLPEFGTVDDFVTLLDAAHRRGIRIITDLVMNHTSDQHEWFQESRHNPDGPYGDFYVWSD
TSDRYPDARIIFVDTEESNWTFDPVRRQFYWHRFFSHQPDLNYDNPAVQEAMLDVLRFWLDLGIDGFRLDAVPYLFEREG
TNCENLPETHAFLKRCRKAIDDEYPGRVLLAEANQWPADVVAYFGDPDTGGDECHMAFHFPLMPRIFMAVRRESRFPISE
ILAQTPPIPDTAQWGIFLRNHDELTLEMVTDEERDYMYAEYAKDPRMKANVGIRRRLAPLLENDRNQIELFTALLLSLPG
SPVLYYGDEIGMGDIIWLGDRDSVRTPMQWTPDRNAGFSKATPGRLYLPPNQDAVYGYHSVNVEAQLDSSSSLLNWTRNM
LAVRSRHDAFAVGTFRELGGSNPSVLAYIREVTRQQGDGGAKTDAVLCVNNLSRFPQPIELNLQQWAGYIPVEMTGYVEF
PSIGQLPYLLTLPGHGFYWFQLREPDPEPGAQQ
>P9WQ19 3.2.1.1~~~treS~~~Trehalose synthase/amylase TreS~~~COG0366
MNEAEHSVEHPPVQGSHVEGGVVEHPDAKDFGSAAALPADPTWFKHAVFYEVLVRAFFDASADGSGDLRGLIDRLDYLQW
LGIDCIWLPPFYDSPLRDGGYDIRDFYKVLPEFGTVDDFVALVDAAHRRGIRIITDLVMNHTSESHPWFQESRRDPDGPY
GDYYVWSDTSERYTDARIIFVDTEESNWSFDPVRRQFYWHRFFSHQPDLNYDNPAVQEAMIDVIRFWLGLGIDGFRLDAV
PYLFEREGTNCENLPETHAFLKRVRKVVDDEFPGRVLLAEANQWPGDVVEYFGDPNTGGDECHMAFHFPLMPRIFMAVRR
ESRFPISEIIAQTPPIPDMAQWGIFLRNHDELTLEMVTDEERDYMYAEYAKDPRMKANVGIRRRLAPLLDNDRNQIELFT
ALLLSLPGSPVLYYGDEIGMGDVIWLGDRDGVRIPMQWTPDRNAGFSTANPGRLYLPPSQDPVYGYQAVNVEAQRDTSTS
LLNFTRTMLAVRRRHPAFAVGAFQELGGSNPSVLAYVRQVAGDDGDTVLCVNNLSRFPQPIELDLQQWTNYTPVELTGHV
EFPRIGQVPYLLTLPGHGFYWFQLTTHEVGAPPTCGGERRL
>Q1ARU5 2.4.1.245~~~treT~~~Trehalose synthase~~~COG0438
MLQRVNPGHKALADYRSIIRRELYGELQELAGRLRGARVLHINATSFGGGVAEILYTLVPLARDAGLEVEWAIMFGAEPF
FNVTKRFHNALQGADYELTIEDRAIYEEYNRRTAQALAESGEEWDIVFVHDPQPALVREFSGGLGEGTRWIWRCHIDTST
PNRQVLDYLWPYIADYDAQVYTMREYTPPGVEMPGLTLIPPAIDPLSPKNMALSRDDASYIVSQFGVDVERPFLLQVSRF
DPWKDPLGVIDVYRMVKEEVGEVQLVLVGSMAHDDPEGWDYWYKTVNYAGGDPDIFLFSNLTNVGAIEVNAFQSLADVVI
QKSIREGFGLVVSEALWKARPVVASRVGGIPMQITAGGGILIDTIPEAAAACAKLLSDPEFAREMGRRGKEHVRANFLTP
RLLRDDLRLFAKLLGV
>Q44315 5.4.99.15~~~treY~~~Maltooligosyl trehalose synthase~~~
MRTPVSTYRLQIRKGFTLFDAAKTVPYLHSLGVDWVYLSPVLTAEQGSDHGYDVTDPSAVDPERGGPEGLAAVSKAARAA
GMGVLIDIVPNHVGVATPAQNPWWWSLLKEGRQSRYAEAFDVDWDLAGGRIRLPVLGSDDDLDQLEIRDGELRYYDHRFP
LAEGTYAEGDAPRDVHARQHYELIGWRRADNELNYRRFFAVNTLAGVRVEIPAVFDEAHQEVVRWFREDLADGLRIDHPD
GLADPEGYLKRLREVTGGAYLLIEKILEPGEQLPASFECEGTTGYDALADVDRVLVDPRGQEPLDRLDASLRGGEPADYQ
DMIRGTKRRITDGILHSEILRLARLVPGDANVSIDAGADALAEIIAAFPVYRTYLPEGAEVLKEACELAARRRPELDQAI
QALQPLLLDTDLELARRFQQTSGMVMAKGVEDTAFFRYNRLGTLTEVGADPTEFAVEPDEFHARLARRQAELPLSMTTLS
THDTKRSEDTRARISVISEVAGDWEKALNRLRDLAPLPDGPLSALLWQAIAGAWPASRERLQYYALKAAREAGNSTNWTD
PAPAFEEKLKAAVDAVFDNPAVQAEVEALVELLEPYGASNSLAAKLVQLTMPGVPDVYQGTEFWDRSLTDPDNRRPFSFD
DRRAALEQLDAGDLPASFTDERTKLLVTSRALRLRRDRPELFTGYRPVLASGPAAGHLLAFDRGTAAAPGALTLATRLPY
GLEQSGGWRDTAVELNTAMKDELTGAGFGPGAVKIADIFRSFPVALLVPQTGGES
>P9WQ21 5.4.99.15~~~treY~~~Putative maltooligosyl trehalose synthase~~~COG3280
MAFPVISTYRVQMRGRSNGFGFTFADAENLLDYLDDLGVSHLYLSPILTAVGGSTHGYDVTDPTTVSPELGGSDGLARLS
AAARSRGMGLIVDIVPSHVGVGKPEQNAWWWDVLKFGRSSAYAEFFDIDWELGDGRIILPLLGSDSDVANLRVDGDLLRL
GDLALPVAPGSGDGTGPAVHDRQHYRLVGWRHGLCGYRRFFSITSLAGLRQEDRAVFDASHAEVARWFTEGLVDGVRVDH
LDGLSDPSGYLAQLRELLGPNAWIVVEKILAVDEALEPTLPVDGSTGYDVLREIGGVLVDPQGESPLTALVESAGVDYQE
MPAMLADLKVHAAVHTLASELRRLRRCIAAAAGADHPLLPAAVAALLRHIGRYRCDYPGQAAVLPCALAETHSTTPQLAP
GLQLIAAAVARGGEPAVRLQQLCGAVSAKAVEDCMFYRDARLVSLNEVGGEPRRFGVGAAEFHHRAATRARLWPRSMTTL
STHDTKRGEDVRARIGVLSQVPWLWAKFIGHAQAIAPAPDAVTGQFLWQNVFGVWPVSGEVSAALRGRLHTYAEKAIREA
AWHTSWHNPNRAFEDDVHGWLDLVLDGPLASELTGLVAHLNSHAESDALAAKLLALTVPGVPDVYQGSELWDDSLVDPDN
RRPVDYGTRRVALKALQHPKIRVLAAALRLRRTHPESFLGGAYHPVFAAGPAADHVVAFRRGDDILVAVTRWTVRLQQTG
WDHTVLPLPDGSWTDALTGFTASGHTPAVELFADLPVVLLVRDNA
>Q9RX51 3.2.1.141~~~treZ~~~Malto-oligosyltrehalose trehalohydrolase~~~COG0296
MTQTQPVTPTPPASFQTQHDPRTRLGATPLPGGAGTRFRLWTSTARTVAVRVNGTEHVMTSLGGGIYELELPVGPGARYL
FVLDGVPTPDPYARFLPDGVHGEAEVVDFGTFDWTDADWHGIKLADCVFYEVHVGTFTPEGTYRAAAEKLPYLKELGVTA
IQVMPLAAFDGQRGWGYDGAAFYAPYAPYGRPEDLMALVDAAHRLGLGVFLDVVYNHFGPSGNYLSSYAPSYFTDRFSSA
WGMGLDYAEPHMRRYVTGNARMWLRDYHFDGLRLDATPYMTDDSETHILTELAQEIHELGGTHLLLAEDHRNLPDLVTVN
HLDGIWTDDFHHETRVTLTGEQEGYYAGYRGGAEALAYTIRRGWRYEGQFWAVKGEEHERGHPSDALEAPNFVYCIQNHD
QIGNRPLGERLHQSDGVTLHEYRGAAALLLPMTPLLFQGQEWAASTPFQFFSDHAGELGQAVSEGRKKEFGGFSGFSGED
VPDPQAEQTFLNSKLNWAEREGGEHARTLRLYRDLLRLRREDPVLHNRQRENLTTGHDGDVLWVRTVTGAGERVLLWNLG
QDTRAVAEVKLPFTVPRRLLLHTEGREDLTLGAGEAVLVG
>P9WQ23 3.2.1.141~~~treZ~~~Malto-oligosyltrehalose trehalohydrolase~~~COG0296
MPEFRVWAPKPALVRLDVNGAVHAMTRSADGWWHTTVAAPADARYGYLLDDDPTVLPDPRSARQPDGVHARSQRWEPPGQ
FGAARTDTGWPGRSVEGAVIYELHIGTFTTAGTFDAAIEKLDYLVDLGIDFVELMPVNSFAGTRGWGYDGVLWYSVHEPY
GGPDGLVRFIDACHARRLGVLIDAVFNHLGPSGNYLPRFGPYLSSASNPWGDGINIAGADSDEVRHYIIDCALRWMRDFH
ADGLRLDAVHALVDTTAVHVLEELANATRWLSGQLGRPLSLIAETDRNDPRLITRPSHGGYGITAQWNDDIHHAIHTAVS
GERQGYYADFGSLATLAYTLRNGYFHAGTYSSFRRRRHGRALDTSAIPATRLLAYTCTHDQVGNRALGDRPSQYLTGGQL
AIKAALTLGSPYTAMLFMGEEWGASSPFQFFCSHPEPELAHSTVAGRKEEFAEHGWAADDIPDPQDPQTFQRCKLNWAEA
GSGEHARLHRFYRDLIALRHNEADLADPWLDHLMVDYDEQQRWVVMRRGQLMIACNLGAEPTCVPVSGELVLAWESPIIG
DNSTELAAYSLAILRAAEPA
>P07676 ~~~trfA~~~Plasmid replication initiator protein TrfA~~~
MNRTFDRKAYRQELIDAGFSAEDAETIASRTVMRAPRETFQSVGSMVQQATAKIERDSVQLAPPALPAPSAAVERSRRLE
QEAAGLAKSMTIDTRGTMTTKKRKTAGEDLAKQVSEAKQAALLKHTKQQIKEMQLSLFDIAPWPDTMRAMPNDTARSALF
TTRNKKIPREALQNKVIFHVNKDVKITYTGVELRADDDELVWQQVLEYAKRTPIGEPITFTFYELCQDLGWSINGRYYTK
AEECLSRLQATAMGFTSDRVGHLESVSLLHRFRVLDRGKKTSRCQVLIDEEIVVLFAGDHYTKFIWEKYRKLSPTARRMF
DYFSSHREPYPLKLETFRLMCGSDSTRVKKWREQVGEACEELRGSGLVEHAWVNDDLVHCKR
>P0A593 ~~~glbN~~~Group 1 truncated hemoglobin GlbN~~~
MGLLSRLRKREPISIYDKIGGHEAIEVVVEDFYVRVLADDQLSAFFSGTNMSRLKGKQVEFFAAALGGPEPYTGAPMKQV
HQGRGITMHHFSLVAGHLADALTAAGVPSETITEILGVIAPLAVDVTSGESTTAPV
>P9WN25 ~~~glbN~~~Group 1 truncated hemoglobin GlbN~~~COG2346
MGLLSRLRKREPISIYDKIGGHEAIEVVVEDFYVRVLADDQLSAFFSGTNMSRLKGKQVEFFAAALGGPEPYTGAPMKQV
HQGRGITMHHFSLVAGHLADALTAAGVPSETITEILGVIAPLAVDVTSGESTTAPV
>P73925 ~~~glbN~~~Group 1 truncated hemoglobin GlbN~~~COG2346
MSTLYEKLGGTTAVDLAVDKFYERVLQDDRIKHFFADVDMAKQRAHQKAFLTYAFGGTDKYDGRYMREAHKELVENHGLN
GEHFDAVAEDLLATLKEMGVPEDLIAEVAAVAGAPAHKRDVLNQ
>O31607 ~~~yjbI~~~Group 2 truncated hemoglobin YjbI~~~COG2346
MGQSFNAPYEAIGEELLSQLVDTFYERVASHPLLKPIFPSDLTETARKQKQFLTQYLGGPPLYTEEHGHPMLRARHLPFP
ITNERADAWLSCMKDAMDHVGLEGEIREFLFGRLELTARHMVNQTEAEDRSS
>P9WN23 ~~~glbO~~~Group 2 truncated hemoglobin GlbO~~~COG2346
MPKSFYDAVGGAKTFDAIVSRFYAQVAEDEVLRRVYPEDDLAGAEERLRMFLEQYWGGPRTYSEQRGHPRLRMRHAPFRI
SLIERDAWLRCMHTAVASIDSETLDDEHRRELLDYLEMAAHSLVNSPF
>Q0PB48 ~~~ctb~~~Group 3 truncated hemoglobin ctb~~~COG2346
MKFETINQESIAKLMEIFYEKVRKDKDLGPIFNNAIGTSDEEWKEHKAKIGNFWAGMLLGEGDYNGQPLKKHLDLPPFPQ
EFFEIWLKLFEESLNIVYNEEMKNVILQRAQMIASHFQNMLYKYGGH
>P24188 1.14.-.-~~~trhO~~~tRNA uridine(34) hydroxylase~~~COG1054
MPVLHNRISNDALKAKMLAESEPRTTISFYKYFHIADPKATRDALYQLFTALNVFGRVYLAHEGINAQISVPASNVETFR
AQLYAFDPALEGLRLNIALDDDGKSFWVLRMKVRDRIVADGIDDPHFDASNVGEYLQAAEVNAMLDDPDALFIDMRNHYE
YEVGHFENALEIPADTFREQLPKAVEMMQAHKDKKIVMYCTGGIRCEKASAWMKHNGFNKVWHIEGGIIEYARKAREQGL
PVRFIGKNFVFDERMGERISDEIIAHCHQCGAPCDSHTNCKNDGCHLLFIQCPVCAEKYKGCCSEICCEESALPPEEQRR
RRAGRENGNKIFNKSRGRLNTTLCIPDPTE
>Q5ZRP2 1.14.-.-~~~trhO~~~tRNA uridine(34) hydroxylase~~~COG1054
MKDIIIASFYKFIPLNDFESLREPILTKMHEIGIKGTIILAHEGVNGGFAGNREQMNVFYDYLRSDSRFADLHFKETYDN
KNPFDKAKVKLRKEIVTMGVQKVDPSYNAGTYLSPEEWHQFIQDPNVILLDTRNDYEYELGTFKNAINPDIENFREFPDY
VQRNLIDKKDKKIAMFCTGGIRCEKTTAYMKELGFEHVYQLHDGILNYLESIPESESLWEGKCFVFDDRVAVDQKLDRVY
PQLPQDYKYEREQK
>P67330 1.14.-.-~~~trhO~~~tRNA uridine(34) hydroxylase~~~COG1054
MAKDIRVLLYYLYTPIENAEQFAADHLAFCKSIGLKGRILVADEGINGTVSGDYETTQKYMDYVHSLPGMEELWFKIDEE
NEQAFKKMFVRYKKEIVHLGLEDNDFDNDINPLETTGAYLSPKEFKEALLDKDTVVLDTRNDYEYDLGHFRGAIRPDIRN
FRELPQWVRDNKEKFMDKRVVVYCTGGVRCEKFSGWMVREGYKDVGQLHGGIATYGKDPEVQGELWDGKMYVFDERIAVD
VNHVNPTIVGKDWFDGTPCERYVNCGNPFCNRRILTSEENEDKYLRGCSHECRVHPRNRYVSKNELTQAEVIERLAAIGE
SLDQAATV
>P76403 3.4.-.-~~~trhP~~~tRNA hydroxylation protein P~~~COG0826
MFKPELLSPAGTLKNMRYAFAYGADAVYAGQPRYSLRVRNNEFNHENLQLGINEAHALGKKFYVVVNIAPHNAKLKTFIR
DLKPVVEMGPDALIMSDPGLIMLVREHFPEMPIHLSVQANAVNWATVKFWQQMGLTRVILSRELSLEEIEEIRNQVPDME
IEIFVHGALCMAYSGRCLLSGYINKRDPNQGTCTNACRWEYNVQEGKEDDVGNIVHKYEPIPVQNVEPTLGIGAPTDKVF
MIEEAQRPGEYMTAFEDEHGTYIMNSKDLRAIAHVERLTKMGVHSLKIEGRTKSFYYCARTAQVYRKAIDDAAAGKPFDT
SLLETLEGLAHRGYTEGFLRRHTHDDYQNYEYGYSVSDRQQFVGEFTGERKGDLAAVAVKNKFSVGDSLELMTPQGNINF
TLEHMENAKGEAMPIAPGDGYTVWLPVPQDLELNYALLMRNFSGETTRNPHGK
>B0KTG8 3.2.2.19~~~tri1~~~ADP-ribosylarginine hydrolase Tri1~~~COG1397
MIDLRSPNALLSDYVERYAHLSPEPSRQLQQRMDYNVRADAPAEPASKPRWLQSRACTLTPEQALDRAKGALLGLAIGDA
VGTTLEFLPRDREHVNDMVGGGPFRLAAGEWTDDTSMALCLADTYVSQGKFDYATYANALVRWYRHGENSVNGRCFDIGN
ATRNALEGWLREGIGWQGNYDPSTAGNGSIIRLAPTAIFRRHSLSASWWESVTQSSVTHNADEAVNCCQLLAAQLHLALN
GADKEETLAPAVRSLRPRPMIINAGEYKQKSRDQIRSSGYVVDTLEAALWAVWNSNNFHDAILLAANLADDADSVAATAG
QLAGALYGVSGMPPEWVEKVAWSQHIQKLAQELFDRAPQVDELDALLYGKR
>A8GG79 3.2.2.19~~~tri1~~~ADP-ribosylarginine hydrolase Tri1~~~COG1397
MIDLREDTWTLQLYAQRYKGLSPKNSRELQLRMEYDPLKPNLPTSGEEQNSKPEWLNTPPCLIPESESLDKAKGALVGLA
IGDAIGTTLEFLPRDKLHVNDMVGGGPFRLQPGEWTDDTSMALCLAESYISAGRLDITLFREKLVRWYRHGENSSNGRCF
DIGNTTRNALEQYLKHGASWFGNTEPETAGNAAIIRQAPTSIFRRKSLQRTFADSDSQSMATHCAPESMASCQFLGFILN
YLINGSSREKAFSPHVMPLPVRVLLINAGEYKEKKRDEIRSSGYVIDTLEAAMWAVWNTDNFHDAILLAANLGDDADSVA
ATTGQIAGALYGYSNIPKPWLDKLVQQERISNLAEQLFYMAPEEDF
>Q9RDE2 3.4.21.-~~~tri1~~~Tricorn protease homolog 1~~~COG0793
MGVTQPAAPAYLRFPHPHGELVAFTAEDDVWLAPLDGGRAWRVSADNVPVNHPRISPDGTKVAWTSTRDGAPEVHVAPVE
GGPAKRLTHWGSIRTQVRGWTADGRVLALSTYGQASLRRSWARALPLDGGPATTLPYGPVGDVAQGPHTVLLSAPMGREA
AWWKRYRGGTAGKLWIDREDDGEFVRLHDGLDGNIEYPFWVGDRIAFLSDHEGTGALYSSLADGSDLRRHTPVDGFYARH
AATDGSRVVYASAGELWTLDDLDGAEPRRLDIRLGGARVDLQSYPVNAARWFGSASPDHTARGSAVAVRGGVHWVTHRSG
PARALAATPGVRNRLPRTFRVDGEEWVVWVTDAEGDDALEFAPATGLAPGATARRLAAGQLGRVLHLAVAPDGSRVAVAS
HDGRVLLVERESGEVREVDRSEDGDASGLVFSPDSSWLAWSHPGPEPLRQLKLANTTDLSVSEATPLRFKDYSPAFTLDG
KHLAFLSTRSFDPVYDEHVFDLAFVEGARPYLITLAATTPSPFGPQRHGRPFETPDREETPDSEGTPTTRIDIEGLADRI
VPFPVEAARYSRLRAAKDGVLWLRHPLTGVLGASRANPEDPDPNTELERYDLAQQRVEHLGGDADHFEVSGDGKRVLLWT
DGRLKVVPSDRRASGDEDSDTNITVDLGRVRQTVEPAAEWRQMFDETGRIMRDHYWRADMNGVDWDGVLDRYRPVLDRVA
THDDLVDLLWEVHGELGTSHAYVTPRGGHGSGARQGLLGADLSRHEDGAWRIDRVLPSETSDPDARSPLAAPGVAVRAGD
AIVAVAGQAVDPVTGPGPLLVGTAGKPVELTVSPSGGGEVRHAVVVPLADEEPLRYHAWVADRRAYVHEKSGGRLGYLHV
PDMQAPGWAQIHRDLRVEVAREGLVVDVRENRGGHTSQLVVEKLARRIVGWDLPRGMRPTSYPQDAPRGPVVAVANEFSG
SDGDIVNAAIKALGIGPVVGVRTWGGVIGIDSRYRLVDGTLITQPKYAFWLEGYGWGVENHGVDPDVEVPQRPQDHAAGR
DPQLDEAIALALAALEETPAKTPPSLP
>Q9EYU0 3.5.4.45~~~triA~~~Melamine deaminase~~~
MQTLSIQHGTLVTMDQYRRVLGDSWVHVQDGRIVALGVHAESVPPPADRVIDARGKVVLPGFINAHTHVNQILLRGGPSH
GRQLYDWLFNVLYPGQKAMRPEDVAVAVRLYCAEAVRSGITTINDNADSAIYPGNIEAAMAVYGEVGVRVVYARMFFDRM
DGRIQGYVDALKARSPQVELCSIMEETAVAKDRITALSDQYHGTAGGRISVWPAPAITPAVTVEGMRWAQAFARDRAVMW
TLHMAESDHDERLHWMSPAEYMECYGLLDERLQVAHCVYFDRKDVRLLHRHNVKVASQVVSNAYLGSGVAPVPEMVERGM
AVGIGTDDGNCNDSVNMIGDMKFMAHIHRAVHRDADVLTPEKILEMATIDGARSLGMDHEIGSIETGKRADLILLDLRHP
QTTPHHHLAATIVFQAYGNEVDTVLIDGNVVMENRRLSFLPPERELAFLEEAQSRATAILQRANMVANPAWRSL
>P0AGI8 ~~~trkA~~~Trk system potassium uptake protein TrkA~~~COG0569
MKIIILGAGQVGGTLAENLVGENNDITVVDTNGERLRTLQDKFDLRVVQGHGSHPRVLREAGADDADMLVAVTSSDETNM
VACQVAYSLFNTPNRIARIRSPDYVRDADKLFHSDAVPIDHLIAPEQLVIDNIYRLIEYPGALQVVNFAEGKVSLAVVKA
YYGGPLIGNALSTMREHMPHIDTRVAAIFRHDRPIRPQGSTIVEAGDEVFFIAASQHIRAVMSELQRLEKPYKRIMLVGG
GNIGAGLARRLEKDYSVKLIERNQQRAAELAEKLQNTIVFFGDASDQELLAEEHIDQVDLFIAVTNDDEANIMSAMLAKR
MGAKKVMVLIQRRAYVDLVQGSVIDIAISPQQATISALLSHVRKADIVGVSSLRRGVAEAIEAVAHGDESTSRVVGRVID
EIKLPPGTIIGAVVRGNDVMIANDNLRIEQGDHVIMFLTDKKFITDVERLFQPSPFFL
>P9WFZ3 ~~~trkA~~~Trk system potassium uptake protein TrkA~~~COG0569
MKVAVAGAGAVGRSVTRELVENGHDITLIERNPDHLDAAAIPEAHWRLGDACELSLLESIHLEEFDVVVAATGDDKVNVV
LSLLAKTEFAVPRVVARVNDPRNEWLFNDAWGVDVAVSTPRMLASLIEEAVTIGDLVRLMEFRTGQANLVEITLPDNTPW
GGKPVRKLQLPRDAALVTILRGPRVIVPEADEPLEGGDELLFVAVTEAEEELSRLLLPSM
>P23849 ~~~trkG~~~Trk system potassium uptake protein TrkG~~~COG0168
MNTSHVRVVTHMCGFLVWLYSLSMLPPMVVALFYKEKSLFVFFITFVIFFCIGGGAWYTTKKSGIQLRTRDGFIIIVMFW
ILFSVISAFPLWIDSELNLTFIDALFEGVSGITTTGATVIDDVSSLPRAYLYYRSQLNFIGGLGVIVLAVAVLPLLGIGG
AKLYQSEMPGPFKDDKLTPRLADTSRTLWITYSLLGIACIVCYRLAGMPLFDAICHGISTVSLGGFSTHSESIGYFNNYL
VELVAGSFSLLSAFNFTLWYIVISRKTIKPLIRDIELRFFLLIALGVIIVTSFQVWHIGMYDLHGSFIHSFFLASSMLTD
NGLATQDYASWPTHTIVFLLLSSFFGGCIGSTCGGIKSLRFLILFKQSKHEINQLSHPRALLSVNVGGKIVTDRVMRSVW
SFFFLYTLFTVFFILVLNGMGYDFLTSFATVAACINNMGLGFGATASSFGVLNDIAKCLMCIAMILGRLEIYPVIILFSG
FFWRS
>P0AFZ7 ~~~trkH~~~Trk system potassium uptake protein TrkH~~~COG0168
MHFRAITRIVGLLVILFSGTMIIPGLVALIYRDGAGRAFTQTFFVALAIGSMLWWPNRKEKGELKSREGFLIVVLFWTVL
GSVGALPFIFSESPNLTITDAFFESFSGLTTTGATTLVGLDSLPHAILFYRQMLQWFGGMGIIVLAVAILPILGVGGMQL
YRAEMPGPLKDNKMRPRIAETAKTLWLIYVLLTVACALALWFAGMDAFDAIGHSFATIAIGGFSTHDASIGYFDSPTINT
IIAIFLLISGCNYGLHFSLLSGRSLKVYWRDPEFRMFIGVQFTLVVICTLVLWFHNVYSSALMTINQAFFQVVSMATTAG
FTTDSIARWPLFLPVLLLCSAFIGGCAGSTGGGLKVIRILLLFKQGNRELKRLVHPNAVYSIKLGNRALPERILEAVWGF
FSAYALVFIVSMLAIIATGVDDFSAFASVVATLNNLGPGLGVVADNFTSMNPVAKWILIANMLFGRLEVFTLLVLFTPTF
WRE
>E1V6C5 ~~~trkH~~~Trk system potassium uptake protein TrkH~~~COG0168
MSLRMILRILGLLLMMFSLTMVPPILISLLFADGMWQAFVVALGITVGTGALMYLPNRHARKELRTRDGFLIAALFWSVL
GLFGSLPLMLTGAAALSPTDAVFESFSGLTTTGATVITGIDLLPEAILYYRQQLQWLGGMGIVVLAVAILPTLGVGGMAL
YRTEIPGPLKDSKLTPRITETAKALWYIYATLTVTCALAYMAAGMNWFDALGHSFSTVAIGGFSTHDASIGYFDSAAIEL
ICSAFLLISAFSFSLHFVAWRERRLTHYFQDPEARFLMLFLAGLIIITSVSLWLTSDYETLQGLRHAVFEVVSIATTAGF
SVADFSTWPGALPFLLFVAAFVGGCSGSTGGGMKVIRIILILKQGMREVMRLIHPSAVIAVKIGKVSVPDGIAQAVWGFF
SAYVLLFFLMLVGVMATGVDQVTAWSTVGATLNNLGPALGEASAHYGDLPSLAKWILVVAMLLGRLEIFTVVVLFTPAFW
RK
>Q87TN7 ~~~trkH~~~Trk system potassium uptake protein TrkH~~~COG0168
MQFRSIIRIVGLLLALFSVTMLAPALVALLYRDGAGVPFVTTFFVLLFCGAMCWFPNRRHKHELKSRDGFLIVVLFWTVL
GSAGSLPFLIADNPNISVTDAFFESFSALTTTGATVIVGLDELPKAILFYRQFLQWFGGMGIIVLAVAILPVLGIGGMQL
YRAEIPGPVKDTKMTPRIAETAKALWYIYLSLTIACAVAFWLAGMTPFDAISHSFSTIAIGGFSTHDASMGYFDSYAINL
ITVVFLLISACNFTLHFAAFASGGVHPKYYWKDPEFRAFIFIQVLLFLVCFLLLLKHHSYTSPYDAFDQALFQTVSISTT
AGFTTTGFADWPLFLPVLLLFSSFIGGCAGSTGGGMKVIRILLLTLQGARELKRLVHPRAVYTIKVGGSALPQRVVDAVW
GFFSAYALVFVVCMLGLIATGMDELSAFSAVAATLNNLGPGLGEVALHFGDVNDKAKWVLIVSMLFGRLEIFTLLILLTP
TFWRS
>E1V6K4 ~~~trkI~~~Trk system potassium uptake protein TrkI~~~COG0168
MADTRRVTDTLRRWAPILKVLAVLWLVLAIFMAIPLLVLIVESEPDALAFGLSIAIVLAAATLSWIVTWRIPVSLKPWQM
FVLTTLSWVTISSFASLPLVLGAPQLSLTNAVFESVSAITTTGSTILVHIEDLSDGLKLWRGIMQWLGGIGIIVMGIAIL
PFLKVGGMRLFHTESSDWSDKVMPRTGGIAKATLSIYCGFTLLAAMAYYLGGMSPLDAVVHAMTSLATGGFANSDASFGA
YAEQPQLLWMGSLFMLCGALPFVLYIRFLRGSRMALLRDQQVQGLLLLLLLVILALTIWRVSQGTPAFTSLTQVTFNVVS
VVTTTGYASDDYSAWGATAYVAFFYLTFVGGCSGSTSGGMKIFRFQVAMLLLRDQLRYLIHASGVFVSRYNNQPLTDDIT
RGVVAFSFFFFLTVAGLALGLSLLGLDFTTALSGAATAVANVGPGLGETIGPAGNFAPLPDAAKWLLCVGMLMGRLEILT
VLVLLTPMFWRQ
>O53193 ~~~~~~Thioredoxin-like reductase Rv2466c~~~COG2761
MLEKAPQKSVADFWFDPLCPWCWITSRWILEVAKVRDIEVNFHVMSLAILNENRDDLPEQYREGMARAWGPVRVAIAAEQ
AHGAKVLDPLYTAMGNRIHNQGNHELDEVITQSLADAGLPAELAKAATSDAYDNALRKSHHAGMDAVGEDVGTPTIHVNG
VAFFGPVLSKIPRGEEAGKLWDASVTFASYPHFFELKRTRTEPPQFD
>O67010 2.1.1.215~~~trm1~~~tRNA (guanine(26)-N(2)/guanine(27)-N(2))-dimethyltransferase~~~COG1867
MEIVQEGIAKIIVPEIPKTVSSDMPVFYNPRMRVNRDLAVLGLEYLCKKLGRPVKVADPLSASGIRAIRFLLETSCVEKA
YANDISSKAIEIMKENFKLNNIPEDRYEIHGMEANFFLRKEWGFGFDYVDLDPFGTPVPFIESVALSMKRGGILSLTATD
TAPLSGTYPKTCMRRYMARPLRNEFKHEVGIRILIKKVIELAAQYDIAMIPIFAYSHLHYFKLFFVKERGVEKVDKLIEQ
FGYIQYCFNCMNREVVTDLYKFKEKCPHCGSKFHIGGPLWIGKLWDEEFTNFLYEEAQKREEIEKETKRILKLIKEESQL
QTVGFYVLSKLAEKVKLPAQPPIRIAVKFFNGVRTHFVGDGFRTNLSFEEVMKKMEELKEKQKEFLEKKKQG
>P23003 2.1.1.-~~~trmA~~~tRNA/tmRNA (uracil-C(5))-methyltransferase~~~COG2265
MTPEHLPTEQYEAQLAEKVVRLQSMMAPFSDLVPEVFRSPVSHYRMRAEFRIWHDGDDLYHIIFDQQTKSRIRVDSFPAA
SELINQLMTAMIAGVRNNPVLRHKLFQIDYLTTLSNQAVVSLLYHKKLDDEWRQEAEALRDALRAQNLNVHLIGRATKTK
IELDQDYIDERLPVAGKEMIYRQVENSFTQPNAAMNIQMLEWALDVTKGSKGDLLELYCGNGNFSLALARNFDRVLATEI
AKPSVAAAQYNIAANHIDNVQIIRMAAEEFTQAMNGVREFNRLQGIDLKSYQCETIFVDPPRSGLDSETEKMVQAYPRIL
YISCNPETLCKNLETLSQTHKVERLALFDQFPYTHHMECGVLLTAK
>P22038 2.1.1.-~~~trmA~~~tRNA/tmRNA (uracil-C(5))-methyltransferase~~~
MTPEHLPTEQYEAQLAEKVARLQSMMAPFSGLVPEVFRSPVSHYRMRAEFRLWHDGDDLYHIMFDQQTKSRIRVDTFPAA
SQLINTLMKAMIAGVRDNHALRHKLFQIDYLTTLSNQAVVSLLYHKKLDEEWREAATALRDALRAQGLNVHLIGRATKTK
IELDQDYIDERLPVAGKEMIYRQVENSFTQPNAAMNIQMLEWALEVTKDSKGDLLELYCGNGNFSLALARNFNRVLATEI
AKPSVAAAQYNIAANHIDNVQIIRMAAEEFTQAMNGVREFNRLQGIDLKRYQCETIFVDPPRSGLDSETEKMVQAYPRIL
YISCNPETLCKNLETLSQTHTVSRLALFDQFPYTHHMECGVLLTAR
>O66479 2.1.1.33~~~trmB~~~tRNA (guanine-N(7)-)-methyltransferase~~~COG0220
MLCYVNYKRVKRPVEIPNLEVEIGFGRGDFIVKLAKENPDKNFFGIEISQISIEKLMKRVGKKGLKNVYCTNVDAYWGFY
FLFRDNYVENIYMNYPDPWFKKRHHKRRLTKPERLYMFAKKLKLGGEIRIRTDNYEFLEFTKESAKVLDCFEVEEGTLNV
KEPLTKYEQKWLSMGKTLYKLILRKVKEPKFVEHPEVEEVRELFPVKVKVESVDPKKIESREIKLDEEVYFKTFKVWQRD
KDFLVECLLSEKGYLQKFFIQIKRKEDGYVIDVSPYSEVLRTRNLQRSIQTVAQLLS
>O34522 2.1.1.33~~~trmB~~~tRNA (guanine-N(7)-)-methyltransferase~~~COG0220
MRMRHKPWADDFLAENADIAISNPADYKGKWNTVFGNDNPIHIEVGTGKGQFISGMAKQNPDINYIGIELFKSVIVTAVQ
KVKDSEAQNVKLLNIDADTLTDVFEPGEVKRVYLNFSDPWPKKRHEKRRLTYSHFLKKYEEVMGKGGSIHFKTDNRGLFE
YSLKSFSEYGLLLTYVSLDLHNSNLEGNIMTEYEEKFSALGQPIYRAEVEWRT
>P0A8I5 2.1.1.33~~~trmB~~~tRNA (guanine-N(7)-)-methyltransferase~~~COG0220
MKNDVISPEFDENGRPLRRIRSFVRRQGRLTKGQEHALENYWPVMGVEFSEDMLDFPALFGREAPVTLEIGFGMGASLVA
MAKDRPEQDFLGIEVHSPGVGACLASAHEEGLSNLRVMCHDAVEVLHKMIPDNSLRMVQLFFPDPWHKARHNKRRIVQVP
FAELVKSKLQLGGVFHMATDWEPYAEHMLEVMSSIDGYKNLSESNDYVPRPASRPVTKFEQRGHRLGHGVWDLMFERVK
>P44648 2.1.1.33~~~trmB~~~tRNA (guanine-N(7)-)-methyltransferase~~~COG0220
MTQTFADQKRKTVETAEFTEDGRYKRKVRSFVLRTGRLSEFQRNMMNDNWGTLGLDYQTEPFDFAKIYGNDNPVVLEIGF
GMGKSLVDMAFANPDKNYLGIEVHTPGVGACIAYAVEKGVTNLRVICHDATEILRDSIADGALGGLQLFFPDPWHKAKHH
KRRIVQPHFVTQVIQKLGENGFIHMATDWENYAEQMLEVLSANTDLVNTSKNGDYIPRPDFRPLTKFEARGYKLGHGVWD
LYFVKK
>P9WFY9 2.1.1.33~~~trmB~~~tRNA (guanine-N(7)-)-methyltransferase~~~COG0220
MVHHGQMHAQPGVGLRPDTPVASGQLPSTSIRSRRSGISKAQRETWERLWPELGLLALPQSPRGTPVDTRAWFGRDAPVV
LEIGSGSGTSTLAMAKAEPHVDVIAVDVYRRGLAQLLCAIDKVGSDGINIRLILGNAVDVLQHLIAPDSLCGVRVFFPDP
WPKARHHKRRLLQPATMALIADRLVPSGVLHAATDHPGYAEHIAAAGDAEPRLVRVDPDTELLPISVVRPATKYERKAQL
GGGAVIELLWKKHGCSERDLKIR
>P67506 2.1.1.33~~~trmB~~~tRNA (guanine-N(7)-)-methyltransferase~~~COG0220
MRVRNRKGATELLEANPQYVVLNPLEAKAKWRDLFGNDNPIHVEVGSGKGAFVSGMAKQNPDINYIGIDIQKSVLSYALD
KVLEVGVPNIKLLWVDGSDLTDYFEDGEIDRLYLNFSDPWPKKRHEKRRLTYKTFLDTFKRILPENGEIHFKTDNRGLFE
YSLVSFSQYGMKLNGVWLDLHASDFEGNVMTEYEQKFSNKGQVIYRVEAEF
>B0V8J1 2.1.1.228~~~trmD~~~tRNA (guanine-N(1)-)-methyltransferase~~~
MFFAVITLFPEMFDAITAYGISGRAAKRDIVQVTCINPRDFAEGNYRRVDERPFGGGPGMVMMAEPLAKAINHAKQLASR
AGCVHVPVVYMSPQGKTLNEQAVQQFVDYDGLIVLCGRYEGVDERLIQHYVDQEWSIGDYVLSGGELPAMVLLDSIIRRL
PNVMSDEQSAIQDSFVDGLLDCPQYTKPDQFEGLDVPEILKSGHHANIEKWRFLQRYQRTLERRPELIEQVTLTKQQKKW
LSDEQG
>Q2GIL5 2.1.1.228~~~trmD~~~tRNA (guanine-N(1)-)-methyltransferase~~~COG0336
MIFNVLTIFPQMFPGPLGVSNLGSALKKGLWTLNVFDIRAFANNKHNTVDDTPYGGGPGMLLRADVLGRCIDEVLSLHPN
TKLMFTSPRGVSFTQDIARQTMNFDNITLLCGRFEGIDERVVDFYKLQEVSIGDYVLSGGELAAMVIIDTCVRMVPGVIG
NAESLKQESMEGSLEYPQYTRPASWKGMEVPEVLLTGNHGEIEKWRRNASLSITAARRPDLLKDRYGENDVE
>O67463 2.1.1.228~~~trmD~~~tRNA (guanine-N(1)-)-methyltransferase~~~COG0336
MSSNPLRFFVLTIFPHIISCYSEYGIVKQAIKKGKVEVYPIDLREFAPKGQVDDVPYGGLPGMVLKPEPIYEAYDYVVEN
YGKPFVLITEPWGEKLNQKLVNELSKKERIMIICGRYEGVDERVKKIVDMEISLGDFILSGGEIVALAVIDAVSRVLPGV
LSEPQSIQEDSFQNRWLGYPVYTRPREYRGMKVPEELLSGHHKLIELWKLWHRIENTVKKRPDLIPKDLTELEKDILNSI
LSGKSFKEWLKEHKHLL
>Q6G1R9 2.1.1.228~~~trmD~~~tRNA (guanine-N(1)-)-methyltransferase~~~COG0336
MKFQARVLTLYPEMFPGFLGCSLAGQALKQGIWSLETVQIRDFALDKHHSVDDTPAGGGAGMVMRADVLAAALDSCPNDS
PRLLMSPRGRLLNQAYARSLARSSGVTLVCGRFEGVDERIIEARELEEVSIGDYILSGGETAALVLLDAIVRLLPGVMGN
EISAKCESFENGLLEHPQYTRPAVFEGRGIPPVLTSGHHKAIANWRQQQAESLTRQRRPDLYALYNKNRQKT
>Q6NGI5 2.1.1.228~~~trmD~~~tRNA (guanine-N(1)-)-methyltransferase~~~
MRLDVITIFPEYLDPLRHALLGKAIEKDLLSVGVHDLRLWAEDAHKSVDDSPFGGGPGMVMKPTVWGPALDDVATMSGKA
HMGAQLDSARVHVDKPRHDELEGIQFAGYDAAEVAEADKPLLLVPTPAGAPFTQEDARAWSNEEHIVFACGRYEGIDQRV
IEDAKKTYRVREVSIGDYVLIGGEVAVLVIAEAVVRLIPGVLGNTQSHQDDSFSDGLLEGPSYTKPREWRGLEVPEVLTS
GNHAKIERWRREQSLKRTWEVRPELLDGMELDRHDQAYVEGLRRGNTSDNLN
>Q72DU3 2.1.1.228~~~trmD~~~tRNA (guanine-N(1)-)-methyltransferase~~~COG0336
MRCTILTLFPEFFDSPLDAGLMGKARESGLIDVALVNPRAYTTDRHSTVDDRPYGGGPGMVMRVEPWEKALQGIEEPGRI
LMMAPKGRPFTQAMARELAQEESLTILCGRYEGFDARLEEIYPIEAVSMGDFVLNGGETAALAVLEAVSRLVPGFMGKEE
SGTEESFSAGLLEYPHYTRPEDYAGHVVPEVLRSGDHGRIAAWRKECSLRLTLSQRPDILPEAQLDEADMDFLRGLSRNR
PGRNLYCALVHYPVVLKEKNSGATSLTNLDIHDIGRSSCTYGLGGFYVTTPLEDQRRLLDTLLRHWTLGPGSRSNPDRAE
ALGRIKGVDDVRAAIEDIARRTGQVPYVVGTSAKGAGNATPASVRAMLEERPVLLVFGTGHGLAPEVLEGCDAILRPLRW
MDGYNHLSVRAAAAIIMDRLLGDCY
>P0A873 2.1.1.228~~~trmD~~~tRNA (guanine-N(1)-)-methyltransferase~~~COG0336
MWIGIISLFPEMFRAITDYGVTGRAVKNGLLSIQSWSPRDFTHDRHRTVDDRPYGGGPGMLMMVQPLRDAIHAAKAAAGE
GAKVIYLSPQGRKLDQAGVSELATNQKLILVCGRYEGIDERVIQTEIDEEWSIGDYVLSGGELPAMTLIDSVSRFIPGVL
GHEASATEDSFAEGLLDCPHYTRPEVLEGMEVPPVLLSGNHAEIRRWRLKQSLGRTWLRRPELLENLALTEEQARLLAEF
KTEHAQQQHKHDGMA
>P43912 2.1.1.228~~~trmD~~~tRNA (guanine-N(1)-)-methyltransferase~~~COG0336
MWIGVISLFPEMFKAITEFGVTGRAVKHNLLKVECWNPRDFTFDKHKTVDDRPYGGGPGMLMMVQPLRDAIHTAKAAAGE
GAKVIYLSPQGRKLDQGGVTELAQNQKLILVCGRYEGIDERLIQTEIDEEWSIGDYVLTGGELPAMTLIDAVARFIPGVL
GKQASAEEDSFADGLLDCPHYTRPEVLEGLTVPPVLMSGHHEEIRKWRLKQSLQRTWLRRPELLEGLALTDEQRKLLKEA
QAEHNS
>P9WFY7 2.1.1.228~~~trmD~~~tRNA (guanine-N(1)-)-methyltransferase~~~COG0336
MRIDIVTIFPACLDPLRQSLPGKAIESGLVDLNVHDLRRWTHDVHHSVDDAPYGGGPGMVMKAPVWGEALDEICSSETLL
IVPTPAGVLFTQATAQRWTTESHLVFACGRYEGIDQRVVQDAARRMRVEEVSIGDYVLPGGESAAVVMVEAVLRLLAGVL
GNPASHQDDSHSTGLDGLLEGPSYTRPASWRGLDVPEVLLSGDHARIAAWRREVSLQRTRERRPDLSHPD
>B2JF31 2.1.1.228~~~trmD~~~tRNA (guanine-N(1)-)-methyltransferase~~~COG0336
MQFDIVTLFPDMFRALTDWGITSRAAKQERYGLRTWNPRDFTTDNYRTIDDRPYGGGPGMVMLARPLEDAINAAKAAQAE
QGIGGARVVMMSPQGATLNHDKVMRFAAEPGLILLCGRYEAIDQRLIDRVVDEEVSLGDFVLSGGELPAMALIDAVVRHL
PGVLNDAQSAVQDSFVDGLLDCPHYTRPEEYDGVRVPDVLLGGHHAEIEQWRRREALRNTWLKRPDLIVQARKNKLLSRA
DEAWLASLAKDASKH
>Q02RL6 2.1.1.228~~~trmD~~~tRNA (guanine-N(1)-)-methyltransferase~~~
MDKRLWVGVVSIFPEMFRAISDYGITSRAVKQGLLTLTCWNPRVYTEDRHQTVDDRPFGGGPGMVMKIKPLEGALADARQ
AAGGRKAKVIYLSPQGRQLTQAGVRELAEEEALILIAGRYEGIDERFIEEHVDEEWSIGDYVLSGGELPAMVLVDAVTRL
LPGALGHADSAEEDSFTDGLLDCPHYTRPEVYADKRVPEVLLSGNHEHIRRWRLQQALGRTWERRADLLDSRSLSGEEQK
LLAEYIRQRDDS
>Q9HXQ1 2.1.1.228~~~trmD~~~tRNA (guanine-N(1)-)-methyltransferase~~~
MDKRLWVGVVSIFPEMFRAISDYGITSRAVKQGLLTLTCWNPRDYTEDRHQTVDDRPFGGGPGMVMKIKPLEGALADARQ
AAGGRKAKVIYLSPQGRQLTQAGVRELAEEEALILIAGRYEGIDERFIEEHVDEEWSIGDYVLSGGELPAMVLVDAVTRL
LPGALGHADSAEEDSFTDGLLDCPHYTRPEVYADKRVPEVLLSGNHEHIRRWRLQQALGRTWERRADLLDSRSLSGEEQK
LLAEYIRQRDDS
>Q6GHJ5 2.1.1.228~~~trmD~~~tRNA (guanine-N(1)-)-methyltransferase~~~
MKIDYLTLFPEMFDGVLNHSIMKRAQENNKLQINTVNFRDYAINKHNQVDDYPYGGGQGMVLKPEPVFNAMEDLDVTEQA
RVILMCPQGEPFSHQKAVELSKADHIVFICGHYEGYDERIRTHLVTDEISMGDYVLTGGELPAMTMTDAIVRLIPGVLGN
EQSHQDDSFSDGLLEFPQYTRPREFKGLTVPDVLLSGNHANIDAWRHEQKLIRTYNKRPDLIEKYPLTNADKQILERYKI
GLKKG
>P39815 2.1.1.74~~~trmFO~~~Methylenetetrahydrofolate--tRNA-(uracil-5-)-methyltransferase TrmFO~~~COG1206
MNQQTVNVIGAGLAGSEAAWQLAKRGIQVKLYEMRPVKQTPAHHTDKFAELVCSNSLRSNTLANAVGVLKEEMRALDSAI
IAAADECSVPAGGALAVDRHEFAASVTNRVKNHPNVTVINEEVTEIPEGPTIIATGPLTSESLSAQLKELTGEDYLYFYD
AAAPIVEKDSLDMDKVYLKSRYDKGEAAYLNCPMTEEEFDRFHEALTSAETVPLKEFEKEIFFEGCMPIEVMAKRGKKTM
LFGPMKPVGLEHPVTGKRPYAVVQLRQDDAAGTLYNIVGFQTHLKWGDQKEVLKLIPGLENVEIVRYGVMHRNTFINSPS
LLKPTYQFKNRSDLFFAGQMTGVEGYVESAASGLVAGINAAKLVLGEELVIFPQETAIGSMAHYITTTNQKNFQPMNANF
GLLKELPVKIKNKKERNEQYANRAIETIQTISKTI
>Q9S449 2.1.1.74~~~trmFO~~~Methylenetetrahydrofolate--tRNA-(uracil-5-)-methyltransferase TrmFO~~~COG1206
MADQKQRVTVIGGGLAGTECAYQLSRRGVPVVLREMKPQKRSPAHKSDTLAELVCSNSLRSDNPESAIGLLHAELRALGS
LVLSAADANRVPAGDALAVERERFSAAITESLLRQPGVELVAGEVEQLPEDGPVVIATGPLTSDALTRELERHVGTRLYF
YDSIAPILSADSIDMNVAFRQSRYGKGGGDDYLNLPMTKDEYYRFIAEVKAGQKVVPHAFEEPKYFEGCLPIEVMAERGD
DTLAYGPMKPVGLRDPRTGQEPYAVVQLRMEDVGGTSWNMVGFQTRLTWGEQKRIFSSFIPGLQQAEFLRMGQIHRNTFI
DSPRLLAKDLSLKTEPRLYFAGQISGVEGYVESAACGYLVALALHARLTGTEFVPPPATTAMGALLRHVTGEAHPPDYPH
QPSNISFGIFSPLTGRMKKAEKRAAYSARAKQDLAAWLPHAGVPAAGAPEHVDQRSA
>P64235 2.1.1.74~~~trmFO~~~Methylenetetrahydrofolate--tRNA-(uracil-5-)-methyltransferase TrmFO~~~
MTQTVNVIGAGLAGSEAAYQLAERGIKVNLIEMRPVKQTPAHHTDKFAELVCSNSLRGNALTNGVGVLKEEMRRLNSIII
EAADKARVPAGGALAVDRHDFSGYITETLKNHENITVINEEINAIPDGYTIIATGPLTTETLAQEIVDITGKDQLYFYDA
AAPIIEKESIDMDKVYLKSRYDKGEAAYLNCPMTEDEFNRFYDAVLEAEVAPVNSFEKEKYFEGCMPFEVMAERGRKTLL
FGPMKPVGLEDPKTGKRPYAVVQLRQDDAAGTLYNIVGFQTHLKWGAQKEVIKLIPGLENVDIVRYGVMHRNTFINSPDV
LNEKYELISQPNIQFAGQMTGVEGYVESAASGLVAGINLAHKILGKGEVVFPRETMIGSMAYYISHAKNNKNFQPMNANF
GLLPSLETRIKDKKERYEAQANRALDYLENFKKTL
>Q9WZJ3 2.1.1.74~~~trmFO~~~Methylenetetrahydrofolate--tRNA-(uracil-5-)-methyltransferase TrmFO~~~COG1206
MIVNVIGAGLAGSEVAYNLGKRGIRVRLFEMRPKKMTEVHKTGYFAELVCSNSLKSEDITNAEGLLKAEMRLMGSITLEA
AEKARVPSGKALAVDRNIFAKEVTEVIERLESVEIIREEVTEFDPEEGIWVVATGPATSDGLLPFLKKLLGDDLLFFFDA
VSPIVTFESIDMECAFWGDRFGKGKDYINCPLTKEEYEEFWKALVEAEVIEMEDFDRKLLFERCQPIEEIARSGKDALRY
GPLRPTGLVDPRTGKEPYAVVQLRREDKEGRFYSLVGFQTRLKWSEQKRVLRKIPCLRNAEIVRYGVMHRNVYINSPKLL
DIFFRLKKHPNIFFAGQITGVEGYMESAASGIYVAYNVHRILKGLSPLKLPEETMMGALFSYIIEKVEGDLKPMYANFGL
LPPLKVRVKDKFEKRKKLAERAIETMKKFLEENPW
>Q5SID2 2.1.1.74~~~trmFO~~~Methylenetetrahydrofolate--tRNA-(uracil-5-)-methyltransferase TrmFO~~~COG1206
MERVNVVGAGLAGSEAAWTLLRLGVPVRLFEMRPKRMTPAHGTDRFAEIVCSNSLGGEGETNAKGLLQAEMRRAGSLVME
AADLARVPAGGALAVDREEFSGYITERLTGHPLLEVVREEVREIPPGITVLATGPLTSEALAEALKRRFGDHFLAYYDAA
SPIVLYESIDLTKCFRAGRYGQSADYLNCPMTEEEYRRFHQALLEAQRHTPHDWEKLEFFEACVPVEELARRGYQTLLFG
PMKPVGLVDPRTGKEPFAVVQLRQEDKAGRMWSLVGFQTGLKWPEQKRLIQMIPGLENAEIVRYGVMHRNTYLNAPRLLG
ETLEFREAEGLYAAGVLAGVEGYLESAATGFLAGLNAARKALGLPPVAPPEESMLGGLVRYLATANPEGFQPMYANWGLV
PPVEGRMGKKEKRQAMYRRGLEAFSAWLSGLNPPLPRPEAALV
>Q7A794 2.1.1.-~~~~~~Putative TrmH family tRNA/rRNA methyltransferase~~~
MEDTVIVGRHAVREAIITGHPINKILIQEGIKKQQINEILKNAKDQKIIVQTVPKSKLDFLANAPHQGVAALIAPYEYAD
FDQFLKQQKEKEGLSTVLILDGLEDPHNLGSILRTADATGVDGVIIPKRRSVTLTQTVAKASTGAIEHVPVIRVTNLAKT
IDELKDNGFWVAGTEANNATDYRNLEADMSLAIVIGSEGQGMSRLVSDKCDFYIKIPMVGHVNSLNASVAASLMMYEVFR
KRHDVGEI
>O67577 2.1.1.34~~~trmH~~~tRNA (guanosine(18)-2'-O)-methyltransferase~~~COG0566
MVMEYLVLEKRLKRLREVLEKRQKDLIVFADNVKNEHNFSAIVRTCDAVGVLYLYYYHAEGKKAKINEGITQGSHKWVFI
EKVDNPVQKLLEFKNRGFQIVATWLSKESVNFREVDYTKPTVLVVGNELQGVSPEIVEIADKKIVIPMYGMAQSLNVSVA
TGIILYEAQRQREEKGMYSRPSLSEEEIQKILKKWAYEDVIKERKRTLSTS
>P0AGJ2 2.1.1.34~~~trmH~~~tRNA (guanosine(18)-2'-O)-methyltransferase~~~COG0566
MNPTRYARICEMLARRQPDLTVCMEQVHKPHNVSAIIRTADAVGVHEVHAVWPGSRMRTMASAAAGSNSWVQVKTHRTIG
DAVAHLKGQGMQILATHLSDNAVDFREIDYTRPTCILMGQEKTGITQEALALADQDIIIPMIGMVQSLNVSVASALILYE
AQRQRQNAGMYLRENSMLPEAEQQRLLFEGGYPVLAKVAKRKGLPYPHVNQQGEIEADADWWATMQAAG
>Q72GI1 2.1.1.34~~~trmH~~~tRNA (guanosine(18)-2'-O)-methyltransferase~~~COG0566
MRERTEARRRRIEEVLRRRQPDLTVLLENVHKPHNLSAILRTCDAVGVLEAHAVNPTGGVPTFNETSGGSHKWVYLRVHP
DLHEAFRFLKERGFTVYATALREDARDFREVDYTKPTAVLFGAEKWGVSEEALALADGAIKIPMLGMVQSLNVSVAAAVI
LFEAQRQRLKAGLYDRPRLDPELYQKVLADWLRK
>Q5SM16 2.1.1.34~~~trmH~~~tRNA (guanosine(18)-2'-O)-methyltransferase~~~COG0566
MRERTEARRRRIEEVLRRRQPDLTVLLENVHKPHNLSAILRTCDAVGVLEAHAVNPTGGVPTFNETSGGSHKWVYLRVHP
DLHEAFRFLKERGFTVYATALREDARDFREVDYTKPTAVLFGAEKWGVSEEALALADGAIKIPMLGMVQSLNVSVAAAVI
LFEAQRQRLKAGLYDRPRLDPELYQKVLADWLRK
>P9WFZ1 2.1.1.220~~~trmI~~~tRNA (adenine(58)-N(1))-methyltransferase TrmI~~~COG2519
MSATGPFSIGERVQLTDAKGRRYTMSLTPGAEFHTHRGSIAHDAVIGLEQGSVVKSSNGALFLVLRPLLVDYVMSMPRGP
QVIYPKDAAQIVHEGDIFPGARVLEAGAGSGALTLSLLRAVGPAGQVISYEQRADHAEHARRNVSGCYGQPPDNWRLVVS
DLADSELPDGSVDRAVLDMLAPWEVLDAVSRLLVAGGVLMVYVATVTQLSRIVEALRAKQCWTEPRAWETLQRGWNVVGL
AVRPQHSMRGHTAFLVATRRLAPGAVAPAPLGRKREGRDG
>Q8GBB2 2.1.1.220~~~trmI~~~tRNA (adenine(58)-N(1))-methyltransferase TrmI~~~COG2519
MAWPGPLLLKDRKGRAYLVFPKEGGVFHHHKGSVPHEALLEAGPGGVVRTHLGEELSVHRPTLEEYLLHMKRSATPTYPK
DASAMVTLLDLAPGMRVLEAGTGSGGLTLFLARAVGEKGLVESYEARPHHLAQAERNVRAFWQVENVRFHLGKLEEAELE
EAAYDGVALDLMEPWKVLEKAALALKPDRFLVAYLPNITQVLELVRAAEAHPFRLERVLEVGWREWEVRLPVAHPRFQQV
GHTAFLVALRRWKAS
>P0AE01 2.1.1.200~~~trmJ~~~tRNA (cytidine/uridine-2'-O-)-methyltransferase TrmJ~~~COG0565
MLQNIRIVLVETSHTGNMGSVARAMKTMGLTNLWLVNPLVKPDSQAIALAAGASDVIGNAHIVDTLDEALAGCSLVVGTS
ARSRTLPWPMLDPRECGLKSVAEAANTPVALVFGRERVGLTNEELQKCHYHVAIAANPEYSSLNLAMAVQVIAYEVRMAW
LATQENGEQVEHEETPYPLVDDLERFYGHLEQTLLATGFIRENHPGQVMNKLRRLFTRARPESQELNILRGILASIEQQN
KGNKAE
>A0A0H2ZF87 2.1.1.-~~~trmJ~~~tRNA (cytidine/uridine/adenosine-2'-O-)-methyltransferase TrmJ~~~
MLDRIRVVLVNTSHPGNIGGAARAMKNMGLSQLVLVQPESFPHGDAVARASGATDILDAARVVDTLEEALSGCSVVLGTS
ARDRRIPWPLLDPRECATTCLEHLEANGEVALVFGREYAGLTNEELQRCQFHVHIPSDPEFGSLNLAAAVQVLTYEVRMA
WLAAQGKPTKMEKFESTSMLNTELVTADELELYYAHLERTLIDIGFLDPEKPRHLMSRLRRLYGRSAISKLEMNILRGIL
TETQKVARGLSYKRSDD
>P54471 2.1.1.217~~~trmK~~~tRNA (adenine(22)-N(1))-methyltransferase~~~COG2384
MNELKLSKRLQTVAEYIPNGAVMADIGSDHAYLPCYAVLNHKASGAIAGEITDGPFLSAKRQVEKSGLNSHISVRQGDGL
EVIKKGEADAITIAGMGGALIAHILEAGKDKLTGKERLILQPNIHAVHIREWLYKERYALIDEVILEEDGKCYEVLVAEA
GDRDAAYDGISLSAGMLVGPFLAKEKNAVFLKKWTQELQHTQSIYEQISQAADTEQNKQKLKELADRMELLKEVIDHG
>P0AGJ7 2.1.1.207~~~trmL~~~tRNA (cytidine(34)-2'-O)-methyltransferase~~~COG0219
MLNIVLYEPEIPPNTGNIIRLCANTGFRLHIIEPMGFAWDDKRLRRAGLDYHEFTAVTRHHDYRAFLEAENPQRLFALTT
KGTPAHSAVSYQDGDYLMFGPETRGLPASILDALPAEQKIRIPMVPDSRSMNLSNAVSVVVYEAWRQLGYPGAVLRD
>P44868 2.1.1.207~~~trmL~~~tRNA (cytidine(34)-2'-O)-methyltransferase~~~COG0219
MLDIVLYEPEIPQNTGNIIRLCANTGFRLHLIEPLGFTWDDKRLRRSGLDYHEFAEIKRHKTFEAFLESEKPKRLFALTT
KGCPAHSQVKFKLGDYLMFGPETRGIPMSILNEMPMEQKIRIPMTANSRSMNLSNSVAVTVYEAWRQLGYKGAVNLPEVK
>Q74Y93 2.1.1.207~~~trmL~~~tRNA (cytidine(34)-2'-O)-methyltransferase~~~COG0219
MLNIVLFEPEIPPNTGNIIRLCANTGCQLHLIKPLGFTWDDKRLRRAGLDYHEFADIKHHHDYQAFLDSEKLDSTQPARL
FALTTKGTPAHSAVSYQANDYLLFGPETRGLPAYILDALPAQQKIRIPMQADSRSMNLSNAVSVVVYEAWRQLGYPGALL
KE
>P31825 2.1.1.223~~~yfiC~~~tRNA1(Val) (adenine(37)-N6)-methyltransferase~~~COG4123
MSQSTSVLRRNGFTFKQFFVAHDRCAMKVGTDGILLGAWAPVAGVKRCLDIGAGSGLLALMLAQRTDDSVMIDAVELESE
AAAQAQENINQSPWAERINVHTADIQQWITQQTVRFDLIISNPPYYQQGVECSTPQREQARYTTTLDHPSLLTCAAECIT
EEGFFCVVLPEQIGNGFTELALSMGWHLRLRTDVAENEARLPHRVLLAFSPQAGECFSDRLVIRGPDQNYSEAYTALTQA
FYLFM
>Q72IH5 2.1.1.256~~~trmN~~~tRNA (guanine(6)-N2)-methyltransferase~~~COG0116
MWLEATTHPGLEDLLLEELSALYPGEGAEVDARKGRVRIPRAWVGEEALGLRLAHHLVLFRARLLLSREDPLGALERAAL
ALPWPELEGAGSFRVEARREGEHPFTSPEVERRVGEALHRAYGVPVDLKRPAVRVRVDVRGEEAFLGVQLTERPLSRRFP
KAALRGSLTPVLAQALLRLADARPGMRVLDPFTGSGTIALEAASTLGPTSPVYAGDLDEKRLGLAREAALASGLSWIRFL
RADARHLPRFFPEVDRILANPPHGLRLGRKEGLFHLYWDFLRGALALLPPGGRVALLTLRPALLKRALPPGFALRHARVV
EQGGVYPRVFVLEKL
>P28634 2.1.1.-~~~trmO~~~tRNA (adenine(37)-N6)-methyltransferase~~~COG1720
MSSFQFEQIGVIRSPYKEKFAVPRQPGLVKSANGELHLIAPYNQADAVRGLEAFSHLWILFVFHQTMEGGWRPTVRPPRL
GGNARMGVFATRSTFRPNPIGMSLVELKEVVCHKDSVILKLGSLDLVDGTPVVDIKPYLPFAESLPDASASYAQSAPAAE
MAVSFTAEVEKQLLTLEKRYPQLTLFIREVLAQDPRPAYRKGEETGKTYAVWLHDFNVRWRVTDAGFEVFALEPR
>P44740 2.1.1.-~~~trmO~~~tRNA (adenine(37)-N6)-methyltransferase~~~COG1720
MNDLTLSPIAIIHTPYKEKFSVPRQPNLVEDGVGIVELLPPYNSPEAVRGLEQFSHLWLIFQFDQIQQGKWQPTVRPPRL
GGNQRVGVFASRATHRPNPLGLSKVELRQVECINGNIFLHLGAVDLVDGTPIFDIKPYIAYADSEPNAQSSFAQEKLPVK
LTVEFTEQAKSAVKKREEKRPHLSRFIRQVLEQDPRPAYQQGKPSDRIYGMSLYEFNVKWRIKAGTVNCVEVIEIEKDK
>O32036 2.1.1.-~~~trmR~~~tRNA 5-hydroxyuridine methyltransferase~~~COG4122
MTDRYEQINDYIEALLKPRPDNVKRLEAYAEEHHVPIMEKAGMEVLLQILSVKQPKKILEIGTAIGYSAIRMALELPSAE
IYTIERNEKRHEEAVNNIKEFQLDDRIHVFYGDALELADAVHVTAPYDVIFIDAAKGQYQNFFHLYEPMLSPDGVIITDN
VLFKGLVAEDYSKIEPKRRRRLVAKIDEYNHWLMNHPDYQTAIIPVGDGLAISKKKR
>Q9KKP3 2.1.1.-~~~~~~Putative pseudouridine methyltransferase~~~COG1901
MRSFILRARSAPTDSQRLLDEIGGKCHTEILAHCMMNSLFTAQSHREDVVIHLVLESTRDYSRTITVEANEISDVGGFHE
AALIALLVKALDASVGMGKEQTRVVQPGLTVRTISFEALLGELAEHHSLYMMDKKGDSIRDIKIGPNPCFILTDHIPMPK
KSGNSMKRLGVEKISLGPKMLFASQCVTLIHNEIDHQEAGW
>P96116 ~~~troA~~~Zinc-binding protein TroA~~~COG0803
MIRERICACVLALGMLTGFTHAFGSKDAAADGKPLVVTTIGMIADAVKNIAQGDVHLKGLMGPGVDPHLYTATAGDVEWL
GNADLILYNGLHLETKMGEVFSKLRGSRLVVAVSETIPVSQRLSLEEAEFDPHVWFDVKLWSYSVKAVYESLCKLLPGKT
REFTQRYQAYQQQLDKLDAYVRRKAQSLPAERRVLVTAHDAFGYFSRAYGFEVKGLQGVSTASEASAHDMQELAAFIAQR
KLPAIFIESSIPHKNVEALRDAVQARGHVVQIGGELFSDAMGDAGTSEGTYVGMVTHNIDTIVAALAR
>O67502 4.2.1.20~~~trpA~~~Tryptophan synthase alpha chain~~~COG0159
MGRISDKFTELKEKREKALVSYLMVGYPDYETSLKAFKEVLKNGTDILEIGFPFSDPVADGPTIQVAHEVALKNGIRFED
VLELSETLRKEFPDIPFLLMTYYNPIFRIGLEKFCRLSREKGIDGFIVPDLPPEEAEELKAVMKKYVLSFVPLGAPTSTR
KRIKLICEAADEMTYFVSVTGTTGAREKLPYERIKKKVEEYRELCDKPVVVGFGVSKKEHAREIGSFADGVVVGSALVKL
AGQKKIEDLGNLVKELKEGLRE
>Q9PIF1 4.2.1.20~~~trpA~~~Tryptophan synthase alpha chain~~~COG0159
MVDFRKFYKENANVAYTVLGYPNLQTSEAFLQRLDQSPIDILELGVAYSDPIADGEIIADAAKIALDQGVDIHSVFELLA
RIKTKKALVFMVYYNLIFSYGLEKFVKKAKSLGICALIVPELSFEESDDLIKECERYNIALITLVSVTTPKERVKKLVKH
AKGFIYLLASIGITGTKSVEEAILQDKVKEIRSFTNLPIFVGFGIQNNQDVKRMRKVADGVIVGTSIVKCFKQGNLDIIM
KDIEEIFKK
>O84173 4.2.1.20~~~trpA~~~Tryptophan synthase alpha chain~~~
MSKLTQVFKQTKLCIGYLTAGDGGTSYTIEAAKALIQGGVDILELGFPFSDPVADNPEIQVSHDRALAENLTSETLLEIV
EGIRAFNQEVPLILYSYYNPLLQRDLDYLRRLKDAGINGVCVIDLPAPLSHGEKSPFFEDLLAVGLDPILLISAGTTPER
MSLIQEYARGFLYYIPCQATRDSEVGIKEEFRKVREHFDLPIVDRRDICDKKEAAHVLNYSDGFIVKTAFVHQTTMDSSV
ETLTALAQTVIPG
>Q72EU7 4.2.1.20~~~trpA~~~Tryptophan synthase alpha chain~~~COG0159
MSASRLERRIREAQAAGRPALIPFLTAGFPTKERFWDELEALDAAGADIIEVGVPFSDPVADGPVVAAASQRALESGVTL
RWIMDGLAARKGRLRAGLVLMGYLNPFMQYGFERFVSDAADAGVAGCIIPDLPLDEDADLRALLAARDMDLIALVGPNTG
EGRMREYAAVASGYVYVVSVMGTTGVRDGLPVEVADTLARARQCFSIPVALGFGISRPAQLEGLSHPPDAVIFGSALLRH
LDAGGDAASFMKAWAER
>P0A877 4.2.1.20~~~trpA~~~Tryptophan synthase alpha chain~~~COG0159
MERYESLFAQLKERKEGAFVPFVTLGDPGIEQSLKIIDTLIEAGADALELGIPFSDPLADGPTIQNATLRAFAAGVTPAQ
CFEMLALIRQKHPTIPIGLLMYANLVFNKGIDEFYAQCEKVGVDSVLVADVPVEESAPFRQAALRHNVAPIFICPPNADD
DLLRQIASYGRGYTYLLSRAGVTGAENRAALPLNHLVAKLKEYNAAPPLQGFGISAPDQVKAAIDAGAAGAISGSAIVKI
IEQHINEPEKMLAALKVFVQPMKAATRS
>Q5NE80 4.2.1.20~~~trpA~~~Tryptophan synthase alpha chain~~~COG0159
MTNRYTTLFANLEKRNEGAFIPFVTIGDPNKALSFEIIDTLVSSGADALELGIPFSDPLADGPTIQEANIRALESGITPK
DCFDILTKIRAKYPHIPIGLLLYANLVYANGIENFYQKCLDAGVDSILIADVPAHESKEFRDIAKKVGIAQIFIAPPDAS
ESTLKQISELGSGYTYLLSRVGVTGTETAANMPVEDVLTKLREYNAPKPVLGFGISKPEQVQQAIKAGAAGAISGSATVK
IIQNNISNKQKMLNELTYFVKEMKAATLN
>P00930 4.2.1.20~~~trpA~~~Tryptophan synthase alpha chain~~~
MERYETLFAQLKNRQEGAFVPFVTLGDPGPEQSLKIIDALIEGGADALELGIPFSDPLADGPTIQGAALRAFAAGVTPAQ
CFEMLAAIRQKHPTIPIGLLMYANLVFSPGIDAFYAQCARVGVDSVLVADVPVEESAPFRQAAMRHNIAPIFICPPNADD
DLLRQIASYGRGYTYLLSRAGVTGAENRAALPLHHLVEKLAEYHAAPPLQGFGISAPEQVSAAIDAGAAGAISGSAIVKI
IERHLDEPQTMLDELKAFVQSLKAATKTA
>Q5X5Q1 4.2.1.20~~~trpA~~~Tryptophan synthase alpha chain~~~
MNRIDKTLEKLKANRKKMLSPYITAGDPYPELTVSLMHQLVKSGADVLELGIPFSDPMAEGPVIQRAMERALAHSIHCDD
VLNMVRQFRKTDTETPVILMGYLNPIEQYGYDLFAQQAVEAGADGTILVDLPPEEADGVSRVWQKHGLYSIYLCSPTTSA
ERMNFINQHANGYLYYVSLKGVTGSDALKLPELKAQYLQRKAQSKLPLMVGFGIKTPEMAAQVAEFADGVIVGAALINEI
IEAYEAKKDPLQASGALLSSMRQAIDNIGSMV
>Q5ZVY3 4.2.1.20~~~trpA~~~Tryptophan synthase alpha chain~~~COG0159
MNRIDKTLEKLKANRKKMLSPYITAGDPYPELTVSLMHQLVKSGADVLELGIPFSDPMAEGPVIQRAMERALAHSIHCDD
VLNMVRQFRKTDTETPVILMGYLNPIEQYGYDLFAQQAVEAGVDGTILVDLPPEEADGVSRVWQKHGLYSIYLCSPTTSA
ERMNYINQHANGYLYYVSLKGVTGSDALKLPELKAQYLQRKAQSKLPLMVGFGIKTPEMAAQVAEFADGVIVGAALINEI
IEAYEAKKDPLQASGALLSSMRQAIDNIGSMV
>P9WFY1 4.2.1.20~~~trpA~~~Tryptophan synthase alpha chain~~~COG0159
MVAVEQSEASRLGPVFDSCRANNRAALIGYLPTGYPDVPASVAAMTALVESGCDIIEVGVPYSDPGMDGPTIARATEAAL
RGGVRVRDTLAAVEAISIAGGRAVVMTYWNPVLRYGVDAFARDLAAAGGLGLITPDLIPDEAQQWLAASEEHRLDRIFLV
APSSTPERLAATVEASRGFVYAASTMGVTGARDAVSQAAPELVGRVKAVSDIPVGVGLGVRSRAQAAQIAQYADGVIVGS
ALVTALTEGLPRLRALTGELAAGVRLGMSA
>P00929 4.2.1.20~~~trpA~~~Tryptophan synthase alpha chain~~~
MERYENLFAQLNDRREGAFVPFVTLGDPGIEQSLKIIDTLIDAGADALELGVPFSDPLADGPTIQNANLRAFAAGVTPAQ
CFEMLALIREKHPTIPIGLLMYANLVFNNGIDAFYARCEQVGVDSVLVADVPVEESAPFRQAALRHNIAPIFICPPNADD
DLLRQVASYGRGYTYLLSRSGVTGAENRGALPLHHLIEKLKEYHAAPALQGFGISSPEQVSAAVRAGAAGAISGSAIVKI
IEKNLASPKQMLAELRSFVSAMKAASRA
>Q97P33 4.2.1.20~~~trpA~~~Tryptophan synthase alpha chain~~~COG0159
MPKTLTEKLNAIKAAGKGIFVPYIMAGDHEKGLDGLAETIHFLEDLGVSAIEVGIPFSDPVADGPVIEEAGLRSLAHGTS
TQALVETLKTIETEIPLVIMTYFNPLFQYGVENFVKDLADTAVKGLIIPDLPHEHANFVEPFLANTDIALIPLVSLTTGI
ERQKELIEGAEGFIYAVAINGVTGKSGNYRADLDKHLAQLHQVADIPVLTGFGVSSQADLERFNAVSDGVIVGSKIVKAL
HQGEPIQDFIRQAVAYQK
>P16608 4.2.1.20~~~trpA~~~Tryptophan synthase alpha chain~~~COG0159
MTTLEAFAKARSEGRAALIPYLTAGFPSREGFLQAVEEVLPYADLLEIGLPYSDPLGDGPVIQRASELALRKGMSVQGAL
ELVREVRALTEKPLFLMTYLNPVLAWGPERFFGLFKQAGATGVILPDLPPDEDPGLVRLAQEIGLETVFLLAPTSTDARI
ATVVRHATGFVYAVSVTGVTGMRERLPEEVKDLVRRIKARTALPVAVGFGVSGKATAAQAAVADGVVVGSALVRALEEGR
SLAPLLQEIRQGLQRLEANPGLKESSKKPLS
>Q9KST7 4.2.1.20~~~trpA~~~Tryptophan synthase alpha chain~~~COG0159
MNRYQALFQRLSAAQQGAFVPFVTIGDPNPEQSLAIMQTLIDAGADALELGMPFSDPLADGPTIQGANLRALAAKTTPDI
CFELIAQIRARNPETPIGLLMYANLVYARGIDDFYQRCQKAGVDSVLIADVPTNESQPFVAAAEKFGIQPIFIAPPTASD
ETLRAVAQLGKGYTYLLSRAGVTGAETKANMPVHALLERLQQFDAPPALLGFGISEPAQVKQAIEAGAAGAISGSAVVKI
IETHLDNPAKQLTELANFTQAMKKATKI
>Q9HVS4 ~~~~~~Tripeptide-binding protein~~~
MRPRSALRYSLLLLALAASAAIQAQPKTLAVCTEAAPEGFDPARYTSGYTFDASAHPLYNALAAFAPGSATVIPALAESW
DVSADGLVYTFRLRQGVKFHSTDYFKPTREFDADDVLFSFQRMLDPQHPAHDLSPSGYPYADAMQLRDIIERIEKIDEHQ
VRFVLKHPEAPFLADLAMPFGSILSAEYAGQLIARGKGDELNSKPIGTGPFVFTRYRKDAQVRYAANPDYWKGKPAIDHL
VLAITLDPNVRVQRLRRNECQIALTPKPEDVAALRQDPQLTVLEEAAMITSHAAINTRHEPFDDPRVRRAIAMGFNKSSY
LKIVFGDQARPAIGPYPPMLLGYDDSIRDWPYDPERAKALLKEAGVTPDTPLNLYISTGSGPGGNPARVAQLIQSDLAAI
GIRVNIRQFEWGEMVKRTKAGEHDMMLYSWIGDNGDPDNFLTHNLGCASVESGENRARWCDKGFDEAIRKARMSNDESQR
VALYKEAQRIFHEQMPWLPLAHPLMFDAQRKNVSGYRMSPMSARDFSRVKLD
>Q81TL8 4.2.1.20~~~trpB~~~Tryptophan synthase beta chain~~~COG0133
MNYAYPDEKGHYGIYGGRYVPETLMQSVLELEEAYKEAMEDEAFQKELNHYLKTYVGRETPLYFAENMTEYCGGAKIYLK
REDLNHTGAHKINNTIGQALLAVRMGKKKVVAETGAGQHGVATATVCALLGLECVIFMGEEDVRRQKLNVFRMELLGAKV
ESVAAGSGTLKDAVNEALRYWVSHVHDTHYIMGSVLGPHPFPQIVRDFQSVIGNETKKQYEALEGKLPEAVVACIGGGSN
AMGMFYPFVHDEEVALYGVEAAGKGVHTEKHAATLTKGSVGVLHGSMMYLLQNEEGQIQEAHSISAGLDYPGVGPEHSLL
KDIGRVSYHSITDDEALEAFQLLTKKEGIIPALESSHAVAYALKLAPQMKEDEGLVICLSGRGDKDVESIKRYMEEV
>O84172 4.2.1.20~~~trpB~~~Tryptophan synthase beta chain~~~
MFKHKHPFGGAFLPEELLAPIQNLKAEWEILKTQQSFLSELDCILKNYAGRQTPLTEVKNFARAIDGPRVFLKREDLLHT
GAHKLNNALGQCLLAKYLGKTRVVAETGAGQHGVATATACAYLGLDCVVYMGAKDVERQKPNVEKMRFLGAEVVSVTKGS
CGLKDAVNQALQDWATTHSFTHYCLGSALGPLPYPDIVRFFQSVISAEVKEQIHAVAGRDPDILIACIGGGSNAIGFFHH
FIPNPKVQLIGVEGGGLGISSGKHAARFATGRPGVFHGFYSYLLQDDDGQVLQTHSISAGLDYPSVGPDHAEMHESGRAF
YTLATDEEALRAFFLLTRNEGIIPALESSHALAHLVSIAPSLPKEQIVIVNLSGRGDKDLPQIIRRNRGIYE
>P0A879 4.2.1.20~~~trpB~~~Tryptophan synthase beta chain~~~COG0133
MTTLLNPYFGEFGGMYVPQILMPALRQLEEAFVSAQKDPEFQAQFNDLLKNYAGRPTALTKCQNITAGTNTTLYLKREDL
LHGGAHKTNQVLGQALLAKRMGKTEIIAETGAGQHGVASALASALLGLKCRIYMGAKDVERQSPNVFRMRLMGAEVIPVH
SGSATLKDACNEALRDWSGSYETAHYMLGTAAGPHPYPTIVREFQRMIGEETKAQILEREGRLPDAVIACVGGGSNAIGM
FADFINETNVGLIGVEPGGHGIETGEHGAPLKHGRVGIYFGMKAPMMQTEDGQIEESYSISAGLDFPSVGPQHAYLNSTG
RADYVSITDDEALEAFKTLCLHEGIIPALESSHALAHALKMMRENPDKEQLLVVNLSGRGDKDIFTVHDILKARGEI
>Q5NE79 4.2.1.20~~~trpB~~~Tryptophan synthase beta chain~~~COG0133
MSKLNAYFGEYGGQFVPQILVPALDQLEQEFIKAQADESFKQEFKELLQEYAGRPTALTKTRNIVKNTRTKLYLKREDLL
HGGAHKTNQVLGQALLAKRMGKKEIIAETGAGQHGVATALACALLDLKCRVYMGAKDVERQSPNVFRMKLMGAEVIPVHS
GSATLKDACNEALRDWSANYSKAHYLLGTAAGPHPFPTIVREFQRMIGEETKQQMLAKEGRLPDAVIACVGGGSNAIGMF
ADFIDEKNVKLIGVEPAGKGIETGEHGAPLKHGKTGIFFGMKAPLMQNSDGQIEESYSISAGLDFPSVGPQHAHLLAIGR
AKYASATDDEALDAFKLLCKKEGIIPALESSHALAHALKLAYEDPNKEQLLVVNLSGRGDKDIFTVHDILKEKGEI
>P9WFX9 4.2.1.20~~~trpB~~~Tryptophan synthase beta chain~~~COG0133
MTDLSTPDLPRMSAAIAEPTSHDPDSGGHFGGPSGWGGRYVPEALMAVIEEVTAAYQKERVSQDFLDDLDRLQANYAGRP
SPLYEATRLSQHAGSARIFLKREDLNHTGSHKINNVLGQALLARRMGKTRVIAETGAGQHGVATATACALLGLDCVIYMG
GIDTARQALNVARMRLLGAEVVAVQTGSKTLKDAINEAFRDWVANADNTYYCFGTAAGPHPFPTMVRDFQRIIGMEARVQ
IQGQAGRLPDAVVACVGGGSNAIGIFHAFLDDPGVRLVGFEAAGDGVETGRHAATFTAGSPGAFHGSFSYLLQDEDGQTI
ESHSISAGLDYPGVGPEHAWLKEAGRVDYRPITDSEAMDAFGLLCRMEGIIPAIESAHAVAGALKLGVELGRGAVIVVNL
SGRGDKDVETAAKWFGLLGND
>Q2KE82 4.2.1.20~~~trpB~~~Tryptophan synthase beta chain~~~COG0133
MNETPKPNSFRSGPDEDGRFGIYGGRFVAETLMPLILDLQDEWNKAKSDPAFQAELKHLGAHYIGRPSPLYFAERLTAEL
GGAKIYFKREELNHTGSHKINNCIGQILLAKRMGKTRIIAETGAGQHGVASATVAARFGLPCVVYMGATDVERQAPNVFR
MKLLGAEVKPVTAGSGTLKDAMNEALRDWVTNVEDTYYLIGTAAGPHPYPEMVRDFQSVIGAEAKEQMLAAEGRLPDLVV
AAVGGGSNAIGIFHPFLDDSSVKIVGVEAGGKGLQGDEHCASITAGSPGVLHGNRTYLLQDGDGQIKEGHSISAGLDYPG
IGPEHSWLSDIGRVDYVPIMDHEALEAFQTLTRLEGIIPALEAAHAIAEVIKRAPKMGKDEIILMNLSGRGDKDIFTVGK
ILGMGL
>P0A2K1 4.2.1.20~~~trpB~~~Tryptophan synthase beta chain~~~
MTTLLNPYFGEFGGMYVPQILMPALNQLEEAFVSAQKDPEFQAQFADLLKNYAGRPTALTKCQNITAGTRTTLYLKREDL
LHGGAHKTNQVLGQALLAKRMGKSEIIAETGAGQHGVASALASALLGLKCRIYMGAKDVERQSPNVFRMRLMGAEVIPVH
SGSATLKDACNEALRDWSGSYETAHYMLGTAAGPHPYPTIVREFQRMIGEETKAQILDKEGRLPDAVIACVGGGSNAIGM
FADFINDTSVGLIGVEPGGHGIETGEHGAPLKHGRVGIYFGMKAPMMQTADGQIEESYSISAGLDFPSVGPQHAYLNSIG
RADYVSITDDEALEAFKTLCRHEGIIPALESSHALAHALKMMREQPEKEQLLVVNLSGRGDKDIFTVHDILKARGEI
>Q97P32 4.2.1.20~~~trpB~~~Tryptophan synthase beta chain~~~COG0133
MAYQEPNKDGFYGKFGGRFVPETLMTAVLELEKAYRESQADPSFQEELNQLLRQYVGRETPLYYAKNLTQHIGGAKIYLK
REDLNHTGAHKINNALGQVWLAKRMGKKKIIAETGAGQHGVATATAAALFNMECTIYMGEEDVKRQALNVFRMELLGAKV
EAVTDGSRVLKDAVNAALRSWVANIDDTHYILGSALGPHPFPEIVRDFQSVIGREAKQQYRDLTGRDLPDALVACVGGGS
NAIGLFHPFVEDESVAMYGTEAAGLGVDTEHHAATLTKGRPGVLHGSLMDVLQDAHGQILEAFSISAGLDYPGIGPEHSH
YHDIKRASYVPVTDEEALEGFQLLSRVEGIIPALESSHAIAFAVKLAKELGPEKSMIVCLSGRGDKDVVQVKDRLEADAA
KKGEAHA
>P16609 4.2.1.20~~~trpB~~~Tryptophan synthase beta chain~~~COG0133
MLTLPDFPLPDARGRFGPYGGRYVPETLIPALEELEAAYREAKKDPAFLEELDHYLRQFAGRPTPLYHAKRLSEYWGGAQ
VFLKREDLLHTGAHKINNTLGQALLARRMGKRRVIAETGAGQHGVSVATVAALFGLECVVYMGEEDVRRQALNVFRMKLL
GAEVRPVAAGSRTLKDATNEAIRDWITNVRTTFYILGSVVGPHPYPMMVRDFQSVIGEEVKRQSLELFGRLPDALIAAVG
GGSNAIGLFAPFAYLPEGRPKLIGVEAAGEGLSTGRHAASIGAGKRGVLHGSYMYLLYDHDGQITPAHSVSAGLDYPGVG
PEHSYYADAGVAEYASVTDEEALEGFKLLARLEGIIPALESAHAIAYAAKVVPEMDKDQVVVINLSGRGDKDVTEVMRLL
GGEL
>Q2YRR4 4.1.1.48~~~trpC~~~Indole-3-glycerol phosphate synthase~~~
MSTDILRKIEAYKREEIAAAKARLALDELKARTRDQSAPRGFLKALEAKRAAGQFALIAEIKKASPSKGLIRPDFDPPAL
AKAYEEGGAACLSVLTDTPSFQGAPEFLTAARQACSLPALRKDFLFDPYQVYEARSWGADCILIIMASVDDDLAKELEDT
AFALGMDALIEVHDEAEMERALKLSSRLLGVNNRNLRSFEVNLAVSERLAKMAPSDRLLVGESGIFTHEDCLRLEKSGIG
TFLIGESLMRQHDVAAATRALLTGAEKL
>Q9PI11 4.1.1.48~~~trpC~~~Indole-3-glycerol phosphate synthase~~~COG0134
MILDKIFEKTKEDLKERKLKLPYDMLGRSLASNPFFPKDVIKALKRVEKEVKIIAEVKKASPSKGVIREDFDPLSIALNY
EKNKAAAISVLTEPHFFKGSLEYLSLIRRYTQIPLLRKDFIFDEYQILEALVYGADFVLLIAKMLSMKELKKLLEFARHL
GLEALVEIHDKEDLSKAIFAGADIIGINHRNLEDFTMDMSLCEKLIPQIPNSKIIIAESGLENKEFLEHLQNLGVDAFLI
GEYFMREKDEGKALKALL
>P06560 ~~~trpC~~~Tryptophan biosynthesis protein TrpCF~~~COG0134
MTSNNLPTVLESIVEGRRGHLEEIRARIAHVDVDALPKSTRSLFDSLNQGRGGARFIMECKSASPSLGMIREHYQPGEIA
RVYSRYASGISVLCEPDRFGGDYDHLATVAATSHLPVLCKDFIIDPVQVHAARYFGADAILLMLSVLDDEEYAALAAEAA
RFDLDILTEVIDEEEVARAIKLGAKIFGVNHRNLHDLSIDLDRSRRLSKLIPADAVLVSESGVRDTETVRQLGGHSNAFL
VGSQLTSQENVDLAARELVYGPNKVCGLTSPSAAQTARAAGAVYGGLIFEEASPRNVSRETLQKIIAAEPNLRYVAVSRR
TSGYKDLLVDGIFAVQIHAPLQDSVEAEKALIAAVREEVGPQVQVWRAISMSSPLGAEVAAAVEGDVDKLILDAHEGGSG
EVFDWATVPAAVKAKSLLAGGISPDNAAQALAVGCAGLDINSGVEYPAGAGTWAGAKDAGALLKIFATISTFHY
>P00909 ~~~trpC~~~Tryptophan biosynthesis protein TrpCF~~~COG0134
MMQTVLAKIVADKAIWVEARKQQQPLASFQNEVQPSTRHFYDALQGARTAFILECKKASPSKGVIRDDFDPARIAAIYKH
YASAISVLTDEKYFQGSFNFLPIVSQIAPQPILCKDFIIDPYQIYLARYYQADACLLMLSVLDDDQYRQLAAVAHSLEMG
VLTEVSNEEEQERAIALGAKVVGINNRDLRDLSIDLNRTRELAPKLGHNVTVISESGINTYAQVRELSHFANGFLIGSAL
MAHDDLHAAVRRVLLGENKVCGLTRGQDAKAAYDAGAIYGGLIFVATSPRCVNVEQAQEVMAAAPLQYVGVFRNHDIADV
VDKAKVLSLAAVQLHGNEEQLYIDTLREALPAHVAIWKALSVGETLPAREFQHVDKYVLDNGQGGSGQRFDWSLLNGQSL
GNVLLAGGLGADNCVEAAQTGCAGLDFNSAVESQPGIKDARLLASVFQTLRAY
>A0QX95 4.1.1.48~~~trpC~~~Indole-3-glycerol phosphate synthase~~~COG0134
MSSATVLDSIIEGVRADVAAREAVISLDEIKERAKAAPPPLNVMAALREPGIGVIAEVKRASPSRGALASIGDPAELAQA
YQDGGARVISVLTEQRRFNGSLDDLDAVRAAVSIPVLRKDFIVRPYQIHEARAHGADMLLLIVAALDQPVLESLLERTES
LGMTALVEVHTEEEADRALKAGASVIGVNARDLKTLEVDRTVFSRIAPGLPSNVIRVAESGVRGTADLLAYAGAGADAVL
VGEGLVTSGDPRSAVADLVTAGTHPSCPKPAR
>P9WFX7 4.1.1.48~~~trpC~~~Indole-3-glycerol phosphate synthase~~~COG0134
MSPATVLDSILEGVRADVAAREASVSLSEIKAAAAAAPPPLDVMAALREPGIGVIAEVKRASPSAGALATIADPAKLAQA
YQDGGARIVSVVTEQRRFQGSLDDLDAVRASVSIPVLRKDFVVQPYQIHEARAHGADMLLLIVAALEQSVLVSMLDRTES
LGMTALVEVHTEQEADRALKAGAKVIGVNARDLMTLDVDRDCFARIAPGLPSSVIRIAESGVRGTADLLAYAGAGADAVL
VGEGLVTSGDPRAAVADLVTAGTHPSCPKPAR
>Q56319 4.1.1.48~~~trpC~~~Indole-3-glycerol phosphate synthase~~~COG0134
MRRLWEIVEAKKKDILEIDGENLIVQRRNHRFLEVLSGKERVKIIAEFKKASPSAGDINADASLEDFIRMYDELADAISI
LTEKHYFKGDPAFVRAARNLTCRPILAKDFYIDTVQVKLASSVGADAILIIARILTAEQIKEIYEAAEELGMDSLVEVHS
REDLEKVFSVIRPKIIGINTRDLDTFEIKKNVLWELLPLVPDDTVVVAESGIKDPRELKDLRGKVNAVLVGTSIMKAENP
RRFLEEMRAWSE
>Q8YXQ9 2.4.2.18~~~trpD2~~~Anthranilate phosphoribosyltransferase 2~~~COG0547
MTSSPTSTQESSTSWYLLLQQLIDGESLSRSQAAELMQGWLSEAVPPELSGAILTALNFKGVSADELTGMAEVLQSQSKM
GTGENYSQLPITNSPFSIIDTCGTGGDGSSTFNISTAVAFVAAAYGVPVAKHGNRSASSLTGSADVLEALGVNLGASPEK
VQAALQEVGITFLFAPGWHPALKAVATLRRTLRIRTVFNLLGPLVNPLRPTGQVVGLFTPKLLTTVAQALDNLGKQKAIV
LHGRERLDEAGLGDLTDLAVLSDGELQLTTINPQEVGVTPAPIGALRGGDVQENAEILKAVLQGKGTQAQQDAVALNAAL
ALQVAGAVPLLDHAQGVSVAKEILQTGTAWAKLAQLVYFLGN
>J7SZ64 4.1.1.105~~~~~~Tryptophan decarboxylase~~~
MKFWRKYTQQEMDEKITESLEKTLNYDNTKTIGIPGTKLDDTVFYDDHSFVKHSPYLRTFIQNPNHIGCHTYDKADILFG
GTFDIERELIQLLAIDVLNGNDEEFDGYVTQGGTEANIQAMWVYRNYFKKERKAKHEEIAIITSADTHYSAYKGSDLLNI
DIIKVPVDFYSRKIQENTLDSIVKEAKEIGKKYFIVISNMGTTMFGSVDDPDLYANIFDKYNLEYKIHVDGAFGGFIYPI
DNKECKTDFSNKNVSSITLDGHKMLQAPYGTGIFVSRKNLIHNTLTKEATYIENLDVTLSGSRSGSNAVAIWMVLASYGP
YGWMEKINKLRNRTKWLCKQLNDMRIKYYKEDSMNIVTIEEQYVNKEIAEKYFLVPEVHNPTNNWYKIVVMEHVELDILN
SLVYDLRKFNKEHLKAM
>A7B1V0 4.1.1.105~~~~~~Tryptophan decarboxylase~~~COG0076
MSQVIKKKRNTFMIGTEYILNSTQLEEAIKSFVHDFCAEKHEIHDQPVVVEAKEHQEDKIKQIKIPEKGRPVNEVVSEMM
NEVYRYRGDANHPRFFSFVPGPASSVSWLGDIMTSAYNIHAGGSKLAPMVNCIEQEVLKWLAKQVGFTENPGGVFVSGGS
MANITALTAARDNKLTDINLHLGTAYISDQTHSSVAKGLRIIGITDSRIRRIPTNSHFQMDTTKLEEAIETDKKSGYIPF
VVIGTAGTTNTGSIDPLTEISALCKKHDMWFHIDGAYGASVLLSPKYKSLLTGTGLADSISWDAHKWLFQTYGCAMVLVK
DIRNLFHSFHVNPEYLKDLENDIDNVNTWDIGMELTRPARGLKLWLTLQVLGSDLIGSAIEHGFQLAVWAEEALNPKKDW
EIVSPAQMAMINFRYAPKDLTKEEQDILNEKISHRILESGYAAIFTTVLNGKTVLRICAIHPEATQEDMQHTIDLLDQYG
REIYTEMKKA
>B2IXH4 1.4.1.19~~~~~~L-tryptophan dehydrogenase~~~COG0334
MLLFETVREMGHEQVLFCHGKNPEIKAIIAIHDTTLGPAMGATRLMPYVNEEAALKDALRLSRGMTYKAACANIPAGGGK
AVIIANPENKTDDLLRAYGRFVNSLNGRFITGQDVNITPDDVRTISQETNHVVGVSEKSGGPAPITSLGVFLGIKAAVES
RWQSKRLDGMKVAVQGLGNVGKNLCRHLHEHDVKLFVSDVDPAKAEEAKRLFGATVVEPTEIYSLDVDIFSPCALGGILN
SHTIPFLQAQIIAGAANNQLENEQLHSQMLTRKGILYSPDYVINAGGLINVYNEMIGYDEEKAFKQVHNIYDTLLAIFDI
AKEQGVTTNDAAKRLAEDRISNSKRSKSKAIAA
>W8CV45 1.4.1.19~~~nptrpdh~~~L-tryptophan dehydrogenase~~~
MLLFETVREMGHEQVLFCHSKNPEIKAIIAIHDTTLGPAMGATRILPYINEEAALKDALRLSRGMTYKAACANIPAGGGK
AVIIANPENKTDDLLRAYGRFVDSLNGRFITGQDVNITPDDVRTISQETKYVVGVSEKSGGPAPITSLGVFLGIKAAVES
RWQSKRLDGMKVAVQGLGNVGKNLCRHLHEHDVQLFVSDVDPIKAEEVKRLFGATVVEPTEIYSLDVDIFAPCALGGILN
SHTIPFLQASIIAGAANNQLENEQLHSQMLAKKGILYSPDYVINAGGLINVYNEMIGYDEEKAFKQVHNIYDTLLAIFEI
AKEQGVTTNDAARRLAEDRINNSKRSKSKAIAA
>P00500 2.4.2.18~~~trpD~~~Anthranilate phosphoribosyltransferase~~~COG0547
MNIQQALNHITKNIHLTQAQMEDVMRSIMQGEATEAQIGALMMGLRMKGESIDEITAAARVMRELAIKIDVSDIQYLVDI
VGTGGDGQNLFNVSTASSFVIAAAGATIAKHGNRGVSSKSGSSDLLEQAGINLDLDMQQTERCIREMGVGFLFAPNHHKA
MKYAVGPRRELGIRSIFNLLGPLTNPAGVKRFVIGVFSDELCRPIAEVMKQLGAEHVMVVHSKDGLDEISLASQTYIAEL
KNGEVTEWVLNPEDVNIPSQTLSGLIVEDSNASLKLIKDALGRKKSDIGEKAANMIALNAGAGIYVSGLATSYKQGVALA
HDIIYGGQALEKMSILSEFTKALKEYANN
>A0R048 2.4.2.18~~~trpD~~~Anthranilate phosphoribosyltransferase~~~COG0547
MTSGPSQPFPSASGPDDGPSWPRILGRLTTGQNLPNGHAAWAMDQIMTGAATPAQISGFAVSMKMKRPTASEVRELADIM
LTHARRVPTDEIGTDTVDIVGTGGDGANTVNLSTMASIVVAACGVPVVKHGNRAASSLSGGADTLEALGVRIDLGPDDVA
RSVREVGIGFAFAPQFHPSYKHASIVRREIGVPTVFNLLGPLTNPAGPRAGLIGCAWADLAEVMAGVFAARGSSVLVVHG
DDGLDELTTTTTSTIWRVQGGTMERLTFDPAAFGFKRAEISELVGGDASENAAEARAVLGGAKGPVRDAVVLNAAGAMVA
HAGLASDAQWLPAWEAGLARAVEAIDSGAAEQLLARWVRFGQQL
>A5U4M0 2.4.2.18~~~trpD~~~Anthranilate phosphoribosyltransferase~~~COG0547
MALSAEGSSGGSRGGSPKAEAASVPSWPQILGRLTDNRDLARGQAAWAMDQIMTGNARPAQIAAFAVAMTMKAPTADEVG
ELAGVMLSHAHPLPADTVPDDAVDVVGTGGDGVNTVNLSTMAAIVVAAAGVPVVKHGNRAASSLSGGADTLEALGVRIDL
GPDLVARSLAEVGIGFCFAPRFHPSYRHAAAVRREIGVPTVFNLLGPLTNPARPRAGLIGCAFADLAEVMAGVFAARRSS
VLVVHGDDGLDELTTTTTSTIWRVAAGSVDKLTFDPAGFGFARAQLDQLAGGDAQANAAAVRAVLGGARGPVRDAVVLNA
AGAIVAHAGLSSRAEWLPAWEEGLRRASAAIDTGAAEQLLARWVRFGRQI
>P9WFX5 2.4.2.18~~~trpD~~~Anthranilate phosphoribosyltransferase~~~COG0547
MALSAEGSSGGSRGGSPKAEAASVPSWPQILGRLTDNRDLARGQAAWAMDQIMTGNARPAQIAAFAVAMTMKAPTADEVG
ELAGVMLSHAHPLPADTVPDDAVDVVGTGGDGVNTVNLSTMAAIVVAAAGVPVVKHGNRAASSLSGGADTLEALGVRIDL
GPDLVARSLAEVGIGFCFAPRFHPSYRHAAAVRREIGVPTVFNLLGPLTNPARPRAGLIGCAFADLAEVMAGVFAARRSS
VLVVHGDDGLDELTTTTTSTIWRVAAGSVDKLTFDPAGFGFARAQLDQLAGGDAQANAAAVRAVLGGARGPVRDAVVLNA
AGAIVAHAGLSSRAEWLPAWEEGLRRASAAIDTGAAEQLLARWVRFGRQI
>Q5SH88 2.4.2.18~~~trpD~~~Anthranilate phosphoribosyltransferase~~~COG0547
MDAVKKAILGEVLEEEEAYEVMRALMAGEVSPVRAAGLLVALSLRGERPHEIAAMARAMREAARPLRVHRRPLLDIVGTG
GDGKGLMNLSTLAALVAAAGGVAVAKHGNRAASSRAGSADLLEALGVDLEAPPERVGEAIEELGFGFLFARVFHPAMRHV
APVRAELGVRTVFNLLGPLTNPAGADAYVLGVFSPEWLAPMAEALERLGARGLVVHGEGADELVLGENRVVEVGKGAYAL
TPEEVGLKRAPLEALKGGGPEENAALARRLLKGEEKGPLADAVALAAGAGFYAAGKTPSLKEGVALAREVLASGEAYLLL
ERYVAFLRA
>P83827 2.4.2.18~~~trpD~~~Anthranilate phosphoribosyltransferase~~~
MDAVKKAILGEVLEEEEAYEVMRALMAGEVSPVRAAGLLVALSLRGERPHEIAAMARAMREAARPLRVHRRPLLDIVGTG
GDGKGLMNLSTLAALVAAAGGVAVAKHGNRAASSRAGSADLLEALGVDLEAPPERVGEAIEELGFGFLFARVFHPAMRHV
APVRAELGVRTVFNLLGPLTNPAGADAYVLGVFSPEWLAPMAEALERLGARGLVVHGEGADELVLGENRVVEVGKGAYAL
TPEEVGLKRAPLEALKGGGPEENAALARRLLKGEEKGPLADAVALAAGAGFYAAGKTPSLKEGVALAREVLASGEAYLLL
ERYVAFLRA
>Q8PD71 2.4.2.18~~~trpD~~~Anthranilate phosphoribosyltransferase~~~COG0547
MPITPQQALQRTIEHREIFHDEMVDLMRQIMRGEVSDAMVSAILTGLRVKKETIGEIAGAATVMREFSRRVEVTDRRHMV
DIVGTGGDGSHTFNISTCAMFVAAAGGAKVAKHGNRSVSSKSGSADALEALGAVIELQPEQVAASLAQTGIGFMYAPVHH
PAMKVVAPVRREMGVRTIFNILGPLTNPAGSPNILMGVFHPDLVGIQARVLQELGAERALVVWGRDGMDELSLGAGTLVG
ELRDGQVHEYEVHPEDFGIAMSASRNLKVADAAESRAMLLQVLDNVPGPALDIVALNAGAALYVAGVADSIADGIVRARQ
VLADGSARACLDAYVAFTQQATAQG
>P00895 4.1.3.27~~~trpE~~~Anthranilate synthase component 1~~~COG0147
MQTQKPTLELLTCEGAYRDNPTALFHQLCGDRPATLLLESADIDSKDDLKSLLLVDSALRITALGDTVTIQALSGNGEAL
LALLDNALPAGVESEQSPNCRVLRFPPVSPLLDEDARLCSLSVFDAFRLLQNLLNVPKEEREAMFFGGLFSYDLVAGFED
LPQLSAENNCPDFCFYLAETLMVIDHQKKSTRIQASLFAPNEEEKQRLTARLNELRQQLTEAAPPLPVVSVPHMRCECNQ
SDEEFGGVVRLLQKAIRAGEIFQVVPSRRFSLPCPSPLAAYYVLKKSNPSPYMFFMQDNDFTLFGASPESSLKYDATSRQ
IEIYPIAGTRPRGRRADGSLDRDLDSRIELEMRTDHKELSEHLMLVDLARNDLARICTPGSRYVADLTKVDRYSYVMHLV
SRVVGELRHDLDALHAYRACMNMGTLSGAPKVRAMQLIAEAEGRRRGSYGGAVGYFTAHGDLDTCIVIRSALVENGIATV
QAGAGVVLDSVPQSEADETRNKARAVLRAIATAHHAQETF
>A0QX93 4.1.3.27~~~trpE~~~Anthranilate synthase component 1~~~COG0147
MQTTANHSSRSTQTGTRAHGAALAETTSREDFRALATEHRVVPVIRKVLADSETPLSAYRKLAANRPGTFLLESAENGRS
WSRWSFIGAGAPSALTVRDNAAAWLGTAPEGAPSGGDPLDALRATLDLLKTEAMAGLPPLSSGLVGFFAYDMVRRLERLP
ELAVDDLGLPDMLLLLATDIAAVDHHEGTITLIANAVNWNGTDERVDWAYDDAVARLDVMTKALGQPLTSAVATFSRPAP
DHRAQRTMEEYTEIVDKLVGDIEAGEAFQVVPSQRFEMDTAADPLDVYRILRVTNPSPYMYLLNIPDADGGLDFSIVGSS
PEALVTVKDGRATTHPIAGTRWRGATEEEDVLLEKELLADEKERAEHLMLVDLGRNDLGRVCRPGTVRVDDYSHIERYSH
VMHLVSTVTGELAEDKTALDAVTACFPAGTLSGAPKVRAMELIEEVEKTRRGLYGGVVGYLDFAGNADFAIAIRTALMRN
GTAYVQAGGGVVADSNGPYEYTEAANKARAVLNAIAAAATLAEP
>P9WFX2 4.1.3.27~~~trpE~~~Anthranilate synthase component 1~~~
MHADLAATTSREDFRLLAAEHRVVPVTRKVLADSETPLSAYRKLAANRPGTFLLESAENGRSWSRWSFIGAGAPTALTVR
EGQAVWLGAVPKDAPTGGDPLRALQVTLELLATADRQSEPGLPPLSGGMVGFFAYDMVRRLERLPERAVDDLCLPDMLLL
LATDVAAVDHHEGTITLIANAVNWNGTDERVDWAYDDAVARLDVMTAALGQPLPSTVATFSRPEPRHRAQRTVEEYGAIV
EYLVDQIAAGEAFQVVPSQRFEMDTDVDPIDVYRILRVTNPSPYMYLLQVPNSDGAVDFSIVGSSPEALVTVHEGWATTH
PIAGTRWRGRTDDEDVLLEKELLADDKERAEHLMLVDLGRNDLGRVCTPGTVRVEDYSHIERYSHVMHLVSTVTGKLGEG
RTALDAVTACFPAGTLSGAPKVRAMELIEEVEKTRRGLYGGVVGYLDFAGNADFAIAIRTALMRNGTAYVQAGGGVVADS
NGSYEYNEARNKARAVLNAIAAAETLAAPGANRSGC
>P9WFX3 4.1.3.27~~~trpE~~~Anthranilate synthase component 1~~~COG1169
MHADLAATTSREDFRLLAAEHRVVPVTRKVLADSETPLSAYRKLAANRPGTFLLESAENGRSWSRWSFIGAGAPTALTVR
EGQAVWLGAVPKDAPTGGDPLRALQVTLELLATADRQSEPGLPPLSGGMVGFFAYDMVRRLERLPERAVDDLCLPDMLLL
LATDVAAVDHHEGTITLIANAVNWNGTDERVDWAYDDAVARLDVMTAALGQPLPSTVATFSRPEPRHRAQRTVEEYGAIV
EYLVDQIAAGEAFQVVPSQRFEMDTDVDPIDVYRILRVTNPSPYMYLLQVPNSDGAVDFSIVGSSPEALVTVHEGWATTH
PIAGTRWRGRTDDEDVLLEKELLADDKERAEHLMLVDLGRNDLGRVCTPGTVRVEDYSHIERYSHVMHLVSTVTGKLGEG
RTALDAVTACFPAGTLSGAPKVRAMELIEEVEKTRRGLYGGVVGYLDFAGNADFAIAIRTALMRNGTAYVQAGGGVVADS
NGSYEYNEARNKARAVLNAIAAAETLAAPGANRSGC
>P20580 4.1.3.27~~~trpE~~~Anthranilate synthase component 1~~~
MNREEFLRLAADGYNRIPLSFETLADFDTPLSIYLKLADAPNSYLLESVQGGEKWGRYSIIGLPCRTVLRVYDHQVRISI
DGVETERFDCADPLAFVEEFKARYQVPTVPGLPRFDGGLVGYFGYDCVRYVEKRLATCPNPDPLGNPDILLMVSDAVVVF
DNLAGKIHAIVLADPSEENAYERGQARLEELLERLRQPITPRRGLDLEAAQGREPAFRASFTREDYENAVGRIKDYILAG
DCMQVVPSQRMSIEFKAAPIDLYRALRCFNPTPYMYFFNFGDFHVVGSSPEVLVRVEDGLVTVRPIAGTRPRGINEEADL
ALEQDLLSDAKEIAEHLMLIDLGRNDVGRVSDIGAVKVTEKMVIERYSNVMHIVSNVTGQLREGLSAMDALRAILPAGTL
SGAPKIRAMEIIDELEPVKRGVYGGAVGYLAWNGNMDTAIAIRTAVIKNGELHVQAGGGIVADSVPALEWEETINKRRAM
FRAVALAEQSVE
>P00898 4.1.3.27~~~trpE~~~Anthranilate synthase component 1~~~
MQTPKPTLELLTCDAAYRENPTALFHQVCGDRPATLLLESADIDSKDDLKSLLLVDSALRITALGDTVTIQALSDNGASL
LPLLDTALPAGVENDVLPAGRVLRFPPVSPLLDEDARLCSLSVFDAFRLLQGVVNIPTQEREAMFFGGLFAYDLVAGFEA
LPHLEAGNNCPDYCFYLAETLMVIDHQKKSTRIQASLFTASDREKQRLNARLAYLSQQLTQPAPPLPVTPVPDMRCECNQ
SDDAFGAVVRQLQKAIRAGEIFQVVPSRRFSLPCPSPLAAYYVLKKSNPSPYMFFMQDNDFTLFGASPESSLKYDAASRQ
IEIYPIAGTRPRGRRADGTLDRDLDSRIELDMRTDHKELSEHLMLVDLARNDLARICTPGSRYVADLTKVDRYSYVMHLV
SRVVGELRHDLDALHAYRACMNMGTLSGAPKVRAMQLIADAEGQRRGSYGGAVGYFTAHGDLDTCIVIRSALVENGIATV
QAGAGIVLDSVPQSEADETRNKARAVLRAIATAHHAQETF
>P00897 4.1.3.27~~~trpE~~~Anthranilate synthase component 1~~~
MNTKPQLTLLKVQASYRGDPTTLFHQLCGARPATLLLESAEINDKQNLQSLLVIDSALPITALGHTVSVQALTANGPALL
PVLDEALPPEVRNQARPNGRELTFPAIDAVQDEDARLRSLSVFDALRTLLTLVDSPADEREAVMLGGLFAYDLVAGFENL
PAVRQDQRCPDFCFYLAETLLVLDHQRGSARLQASVFSEQASEAQRLQHRLEQLQAELQQPPQPIPHQKLENMQLSCNQS
DEEYGAVVSELQEAIRQGEIFQVVPSRRFSLPCPAPLGPYQTLKDNNPSPYMFFMQDDDFTLFGASPESALKYDAGNRQI
EIYPIAGTRPRGRRADGSLDLDLDSRIELEMRTDHKELAEHLMLVDLARNDLARICQAGSRYVADLTKVDRYSFVMHLVS
RVVGTLRADLDVLHAYQACMNMGTLSGAPKVRAMQLIAALRSTRRGSYGGRVGYFTAHRHLDTCIVIRSAYVEDGHRTVQ
AGAGVVQDSIRRREADETRNKARAVLRAIATAHHAKEVF
>Q56320 5.3.1.24~~~trpF~~~N-(5'-phosphoribosyl)anthranilate isomerase~~~COG0135
MVRVKICGITNLEDALFSVESGADAVGFVFYPKSKRYISPEDARRISVELPPFVFRVGVFVNEEPEKILDVASYVQLNAV
QLHGEEPIELCRKIAERILVIKAVGVSNERDMERALNYREFPILLDTKTPEYGGSGKTFDWSLILPYRDRFRYLVLSGGL
NPENVRSAIDVVRPFAVDVSSGVEAFPGKKDHDSIKMFIKNAKGL
>P00904 ~~~trpGD~~~Bifunctional protein TrpGD~~~COG0512
MADILLLDNIDSFTYNLADQLRSNGHNVVIYRNHIPAQTLIERLATMSNPVLMLSPGPGVPSEAGCMPELLTRLRGKLPI
IGICLGHQAIVEAYGGYVGQAGEILHGKASSIEHDGQAMFAGLTNPLPVARYHSLVGSNIPAGLTINAHFNGMVMAVRHD
ADRVCGFQFHPESILTTQGARLLEQTLAWAQQKLEPANTLQPILEKLYQAQTLSQQESHQLFSAVVRGELKPEQLAAALV
SMKIRGEHPNEIAGAATALLENAAPFPRPDYLFADIVGTGGDGSNSINISTASAFVAAACGLKVAKHGNRSVSSKSGSSD
LLAAFGINLDMNADKSRQALDELGVCFLFAPKYHTGFRHAMPVRQQLKTRTLFNVLGPLINPAHPPLALIGVYSPELVLP
IAETLRVLGYQRAAVVHSGGMDEVSLHAPTIVAELHDGEIKSYQLTAEDFGLTPYHQEQLAGGTPEENRDILTRLLQGKG
DAAHEAAVAANVAMLMRLHGHEDLQANAQTVLEVLRSGSAYDRVTALAARG
>P00905 ~~~trpGD~~~Bifunctional protein TrpGD~~~
MADILLLDNIDSFTWNLADQLRTNGHNVVIYRNHIPAQTLIDRLATMKNPVLMLSPGPGVPSEAGCMPELLTRLRGKLPI
IGICLGHQAIVEAYGGYVGQAGEILHGKASSIEHDGQAMFAGLANPLPVARYHSLVGSNVPAGLTINAHFNGMVMAVRHD
ADRVCGFQFHPESILTTQGARLLEQTLAWAQQKLEPTNTLQPILEKLYQAQTLTQQESHQLFSAVVRGELKPEQLAAALV
SMKIRGEHPNEIAGAATALLENAAPFPRPEYLFADIVGTGGDGSNSINISTASAFVAAACGLKVAKHGNRSVSSKSGSSD
LLAAFGINLDMNADKSRQALDELGVCFLFAPKYHTGLRHAMPVRQQLKTRTLFNVLGPLINPAHPPLALIGVYSPELVLP
IAETLRVLGYQRAAVVHSGGMDEVSLHAPTIVAELHDGEIKSYQLTAEDFGLTPYHQDQLAGGTPEENRDILTRLLQGKG
DAAHEAAVAANVAMLMRLHGQEDLKANAQTVLDVLRNGTAYDRVTALAARG
>P26922 4.1.3.27~~~trpG~~~Anthranilate synthase component 2~~~
MLLLIDNYDSFTYNLVHYLGELGAELDVRRNDSLTVEEAMALRPEGIVLSPGPCDPDKAGICLPLIDAAAKAAVPLMGVC
LGHQAIGQPFGGTVVRAPVPMHGKVDRMFHQGRGVLKDLPSPFRATRYHSLIVERATLPACLEVTGETEDGLIMALSHRE
LPIHGVQFHPESIESEHGHKILENFLNTTRRLETAA
>P9WN35 4.1.3.27~~~trpG~~~Anthranilate synthase component 2~~~COG0512
MRILVVDNYDSFVFNLVQYLGQLGIEAEVWRNDDHRLSDEAAVAGQFDGVLLSPGPGTPERAGASVSIVHACAAAHTPLL
GVCLGHQAIGVAFGATVDRAPELLHGKTSSVFHTNVGVLQGLPDPFTATRYHSLTILPKSLPAVLRVTARTSSGVIMAVQ
HTGLPIHGVQFHPESILTEGGHRILANWLTCCGWTQDDTLVRRLENEVLTAISPHFPTSTASAGEATGRTSA
>P20576 4.1.3.27~~~trpG~~~Anthranilate synthase component 2~~~
MLLMIDNYDSFTYNLVQYFGELKAEVKVVRNDELSVEQIEALAPERIVLSPGPCTPNEAGVSLAVIERFAGKLPLLGVCL
GHQSIGQAFGGEVVRARQVMHGKTSPIHHKDLGVFAGLANPLTVTRYHSLVVKRESLPECLEVTAWTQHADGSLDEIMGV
RHKTLNVEGVQFHPESILTEQGHELLANFLRQQGGVRGEGN
>P00901 4.1.3.27~~~trpG~~~Anthranilate synthase component 2~~~COG0512
MLLMMIDNYDSFTYNVVQYLGELGAEVKVIRNDEMTIAQIEALNPERIVVSPGPCTPSEAGVSIEAILHFAGKLPILGVC
LGHQSIGQAFGGDVVRARQVMHGKTSPVHHRDLGVFTGLNNPLTVTRYHSLVVKRETLPDCLEVTAWTAHEDGSVDEIMG
LRHKTLNIEGVQFHPESILTEQGHELFANFLKQTGGRR
>P00900 4.1.3.27~~~trpG~~~Anthranilate synthase component 2~~~
MADILLLDNVDSFTYNLVDQLRASGHQVVIYRNQIGAEVIIERLQHMEQPVLMLSPGPGTPSEAGCMPELLQRLRGQLPI
IGICLGHQAIVEAYGGQVGQAGEILHGKASAIAHDGEGMFAGMANPLPVARYHSLVGSNIPADLTVNARSGEMVMAVRDD
RRRVCGFQFHPESILTTHGARLLEQTLAWALAK
>P11720 ~~~trpI~~~HTH-type transcriptional regulator TrpI~~~
MSRDLPSLNALRAFEAAARLHSISLAAEELHVTHGAVSRQVRLLEDDLGVALFGKDGRGVKLTDSGVRLRDACGDAFERL
RGVCAELRRQTAEAPFVLGVPGSLLARWFIPRLDQLNRALPDLRLQLSTSEGEFDPRRPGLDAMLWFAEPPWPADMQVFE
LAPERMGPVVSPRLAQETGLAQAPAARLLQEPLLHTASRPQAWPAWAASQGLAAEALRYGQGFEHLYYLLEAAVAGLGVA
IAPEPLVRDDLAAGRLAAPWGFIETDARLALWVPARLHDPRAGRLAQWLREQLAG
>P24908 ~~~~~~Putative transcriptional regulator~~~
MSKVLIVDDHPAIRLAVRLLFERDGFTIVGEADNGAEALQVARKKSPDLAILDIGIPKIDGLEVIARLKSLKLDTKVLVL
TRQNPAQFAPRCLQAGAMGFVSKRENLSELLLAAKAVLAGYIHFPTGALRSINQQSRDNEARMLESLSDREMTVLQYLAN
GNTNKAIAQQLFLSEKTVSTYKSRIMLKLNAHSLAGLIDFARRHELI
>O07515 ~~~trpP~~~Probable tryptophan transport protein~~~
MKTKELVIMALFAAIGAALHSIIPPFLGGMKPDMMLIMMFMGILLFPRVQNVLVIGIVTGIISALTTAFPAGQIPNIIDK
PVSAFLFFSLFLLFRKSRKTGAAAVLTVIGTILSGIVFLSSALLIVGLPGGFAALFAAVVLPAAVLNTISMIIIYPIVQT
ILRRSSFMEAAK
>P0A882 ~~~trpR~~~Trp operon repressor~~~COG2973
MAQQSPYSAAMAEQRHQEWLRFVDLLKNAYQNDLHLPLLNLMLTPDEREALGTRVRIVEELLRGEMSQRELKNELGAGIA
TITRGSNSLKAAPVELRQWLEEVLLKSD
>P0A881 ~~~trpR~~~Trp operon repressor~~~COG2973
MAQQSPYSAAMAEQRHQEWLRFVDLLKNAYQNDLHLPLLNLMLTPDEREALGTRVRIVEELLRGEMSQRELKNELGAGIA
TITRGSNSLKAAPVELRQWLEEVLLKSD
>P80436 6.3.2.-~~~trsA~~~Triostin synthetase I~~~
MLDGFVPWPDHLADEYRRRGIWLGRPLGDLLHDSCRRHADRVAVVCDGHRMTYAELSRRADRLAGGLIGLGIRPLDRVVV
HLPNIPEFVVLVFALLRAGAIPVLALPGHRKSEISHLCAHSGAVAYAVKDEFGGFDYRELAREIPPVRHVLVSGDAQEFT
ALESVGGDDVPLPRVDPSDPALFLLSGGTTGLPKLIPRAHDDYAYVMRATAEAMHVGEEVAYLAVNPVAHQAALACPGVF
GSLLLGGKAVLTSSVRPDEVFPLIRREHVTVTTVVPSVLRLWADSGQRPDLSHLLVQVGSAPLDPALARRAGEVLGCRIM
RWYGISEGLLTHTRFDDPEDVIMGTDGRPMSRDDEVRIVDESLNPVPEGEAGEMIARGPYTIRGYYRAPEENTRSFTPDG
FFRTGDLVRRSPEGDITIVGRIKDVINRAGEKVSAEEVERQLRTHPSVQDAAVVGVPDTVLGERTYAFLVLTGAQIRTSA
VKEFLRGCGLATYKIPDRIVPLDQLPRTPMGKVDKKTLRALAVSSAR
>A0A2N6JFX7 3.5.1.-~~~trtA~~~Triuret hydrolase TrtA~~~
MIRIDATPYPYQFHPRSTALVVIDMQRDFIEEGGFGSALGNDVRPLAAIVPTVAALLQLAREAGMLVVHTRESHLPDLSD
CPRSKRLRGNPTLGIGDVGPMGRILVQGEPGNQILPQLAPVEGELVIDKPGKGAFYATDLHAQLQERRITHLLVAGVTTE
VCVQTSMREANDRGYECLVIEDACASYFPDFHRITLEMLTAQGGIVGWRTPLAQLQAGVAAYTGENP
>Q72DL8 5.4.99.12~~~truA~~~tRNA pseudouridine synthase A~~~COG0101
MARLRLTIAYKGTDLHGWQVQEHATRPRPRTVQGVLEPIVSRMAGEQVRLHAAGRTDAGVHADGQVAHVDIPDHKLGVDW
QKAINAQLPDDICILDVRRAADDFHARFDALGKRYTYRLWLTRRFIPPKLHGQVWATGPLDVYAMDRAARHLAGTHDFAA
FQNQGTDVTSTVRTVHAIRRCPSGTLPAGALLTCSEPYTSWRCTGTHPDQPPATAGHPLAGIGLELVWSFEGDGFLKQMV
RNMMGLLVAVGRGALAADDVPGIMATLDRSRAPATAPACGLTLSEVYYPPCDYPYAR
>P07649 5.4.99.12~~~truA~~~tRNA pseudouridine synthase A~~~COG0101
MSDQQQPPVYKIALGIEYDGSKYYGWQRQNEVRSVQEKLEKALSQVANEPITVFCAGRTDAGVHGTGQVVHFETTALRKD
AAWTLGVNANLPGDIAVRWVKTVPDDFHARFSATARRYRYIIYNHRLRPAVLSKGVTHFYEPLDAERMHRAAQCLLGEND
FTSFRAVQCQSRTPWRNVMHINVTRHGPYVVVDIKANAFVHHMVRNIVGSLMEVGAHNQPESWIAELLAAKDRTLAAATA
KAEGLYLVAVDYPDRYDLPKPPMGPLFLAD
>P9WHP9 5.4.99.12~~~truA~~~tRNA pseudouridine synthase A~~~COG0101
MSLTRRPPKSPPQRPPRISGVVRLRLDIAYDGTDFAGWAAQVGQRTVAGDLDAALTTIFRTPVRLRAAGRTDAGVHASGQ
VAHVDVPADALPNAYPRAGHVGDPEFLPLLRRLGRFLPADVRILDITRAPAGFDARFSALRRHYVYRLSTAPYGVEPQQA
RYITAWPRELDLDAMTAASRDLMGLHDFAAFCRHREGATTIRDLQRLDWSRAGTLVTAHVTADAFCWSMVRSLVGALLAV
GEHRRATTWCRELLTATGRSSDFAVAPAHGLTLIQVDYPPDDQLASRNLVTRDVRSG
>Q5SHU9 5.4.99.12~~~truA~~~tRNA pseudouridine synthase A~~~COG0101
MRRLLLLCEYDGTLFAGLQRQGRGLRTVQGELERALPGIGALPKAVAAGRTDAGVHALAMPFHVDVESAIPVEKVPEALN
RLLPEDLKVVGAREVAPDFHARKDALWRAYRYRILVRPHPSPLLRHRALWVRRPLDLEAMEEALSLLLGRHNFLGFAKEE
TRPGERELLEARLQVAEGEAGLEVRLYFRGKSFLRGQVRGMVGTLLEVGLGKRPPESLKAILKTADRRLAGPTAPAHGLY
FVEAAYPEEKLSP
>P60340 5.4.99.25~~~truB~~~tRNA pseudouridine synthase B~~~COG0130
MSRPRRRGRDINGVLLLDKPQGMSSNDALQKVKRIYNANRAGHTGALDPLATGMLPICLGEATKFSQYLLDSDKRYRVIA
RLGQRTDTSDADGQIVEERPVTFSAEQLAAALDTFRGDIEQIPSMYSALKYQGKKLYEYARQGIEVPREARPITVYELLF
IRHEGNELELEIHCSKGTYIRTIIDDLGEKLGCGAHVIYLRRLAVSKYPVERMVTLEHLRELVEQAEQQDIPAAELLDPL
LMPMDSPASDYPVVNLPLTSSVYFKNGNPVRTSGAPLEGLVRVTEGENGKFIGMGEIDDEGRVAPRRLVVEYPA
>P9WHP7 5.4.99.25~~~truB~~~tRNA pseudouridine synthase B~~~COG0130
MSATGPGIVVIDKPAGMTSHDVVGRCRRIFATRRVGHAGTLDPMATGVLVIGIERATKILGLLTAAPKSYAATIRLGQTT
STEDAEGQVLQSVPAKHLTIEAIDAAMERLRGEIRQVPSSVSAIKVGGRRAYRLARQGRSVQLEARPIRIDRFELLAARR
RDQLIDIDVEIDCSSGTYIRALARDLGDALGVGGHVTALRRTRVGRFELDQARSLDDLAERPALSLSLDEACLLMFARRD
LTAAEASAAANGRSLPAVGIDGVYAACDADGRVIALLRDEGSRTRSVAVLRPATMHPG
>P65855 5.4.99.25~~~truB~~~tRNA pseudouridine synthase B~~~
MYNGILPVYKERGLTSHDVVFKLRKILKTKKIGHTGTLDPEVAGVLPVCIGNATRVSDYVMDMGKAYEATVSIGRSTTTE
DQTGDTLETKGVHSADFNKDDIDRLLENFKGVIEQIPPMYSSVKVNGKKLYEYARNNETVERPKRKVNIKDIGRISELDF
KENECHFKIRVICGKGTYIRTLATDIGVKLGFPAHMSKLTRIESGGFVLKDSLTLEQIKELHEQDSLQNKLFPLEYGLKG
LPSIKIKDSHIKKRILNGQKFNKNEFDNKIKDQIVFIDDDSEKVLAIYMVHPTKESEIKPKKVFN
>Q97QJ3 5.4.99.25~~~truB~~~tRNA pseudouridine synthase B~~~COG0130
MNGIINLKKEAGMTSHDAVFKLRKILGTKKIGHGGTLDPDVVGVLPIAVGKATRMVEFMQDEGKIYEGEITLGYSTKTED
ASGEVVAETPVLSLLDEKLVDEAIASLTGPITQIPPMYSAVKVNGRKLYEYARAGQEVERPERQVTIYQFERTSPISYDG
QLARFTFRVKCSKGTYIRTLSVDLGEKLGYAAHMSHLTRTSAAGLQLEDALALEEIAEKVEAGQLDFLHPLEIGTGDLVK
VFLSPEEATEVRFGRFIELDQTDKELAAFEDDKLLAILEKRGNLYKPRKVFS
>Q9WZW0 5.4.99.25~~~truB~~~tRNA pseudouridine synthase B~~~COG0130
MKHGILVAYKPKGPTSHDVVDEVRKKLKTRKVGHGGTLDPFACGVLIIGVNQGTRILEFYKDLKKVYWVKMRLGLITETF
DITGEVVEERECNVTEEEIREAIFSFVGEYDQVPPAYSAKKYKGERLYKLAREGKIINLPPKRVKIFKIWDVNIEGRDVS
FRVEVSPGTYIRSLCMDIGYKLGCGATAVELVRESVGPHTIEESLNVFEAAPEEIENRIIPLEKCLEWLPRVVVHQESTK
MILNGSQIHLEMLKEWDGFKKGEVVRVFNEEGRLLALAEAERNSSFLETLRKHERNERVLTLRKVFNTR
>P0AA41 5.4.99.26~~~truC~~~tRNA pseudouridine synthase C~~~COG0564
MLEILYQDEWLVAVNKPSGWLVHRSWLDRDEKVVVMQTVRDQIGQHVFTAHRLDRPTSGVLLMGLSSEAGRLLAQQFEQH
QIQKRYHAIVRGWLMEEAVLDYPLVEELDKIADKFAREDKGPQPAVTHYRGLATVEMPVATGRYPTTRYGLVELEPKTGR
KHQLRRHLAHLRHPIIGDSKHGDLRQNRSGAEHFGLQRLMLHASQLSLTHPFTGEPLTIHAGLDDTWMQALSQFGWRGLL
PENERVEFSAPSGQDGEISS
>Q57261 5.4.99.27~~~truD~~~tRNA pseudouridine synthase D~~~COG0585
MIEFDNLTYLHGKPQGTGLLKANPEDFVVVEDLGFEPDGEGEHILVRILKNGCNTRFVADALAKFLKIHAREVSFAGQKD
KHAVTEQWLCARVPGKEMPDLSAFQLEGCQVLEYARHKRKLRLGALKGNAFTLVLREVSNRDDVEQRLIDICVKGVPNYF
GAQRFGIGGSNLQGAQRWAQTNTPVRDRNKRSFWLSAARSALFNQIVAERLKKADVNQVVDGDALQLAGRGSWFVATTEE
LAELQRRVNDKELMITAALPGSGEWGTQREALAFEQAAVAAETELQALLVREKVEAARRAMLLYPQQLSWNWWDDVTVEI
RFWLPAGSFATSVVRELINTTGDYAHIAE
>P44039 5.4.99.27~~~truD~~~tRNA pseudouridine synthase D~~~COG0585
MLEQLPYLALKTPPKTTALLKAECADFIVKEHLGYEMSGDGEFVALYVRKTDCNTLFVGEKLAKFAGVSERNMGYAGLKD
RRAVTEQWFCLQMPGMETPDFSQFELDGVEILTVTRHNRKIRTGSLEGNYFDILLRGAEESDELKARLDFVANFGFPNYF
TEQRFGRDGHNLTQALRWAQGEIKVKDRKKRSFYLSAARSEIFNLVVAARIEKSTINQVLPNDIVQLAGSHSWFKADEKE
DLTALQVRLENQDILLTAPLIGEDILAASEIENEIVNQHSAFDPLMKQERMKAARRPLLMKAKGFSWAFEPEGLRLKFYL
PAGSYATALVRELVNYTEE
>P80880 1.8.1.9~~~trxB~~~Thioredoxin reductase~~~COG0492
MSEEKIYDVIIIGAGPAGMTAAVYTSRANLSTLMIERGIPGGQMANTEDVENYPGFESILGPELSNKMFEHAKKFGAEYA
YGDIKEVIDGKEYKVVKAGSKEYKARAVIIAAGAEYKKIGVPGEKELGGRGVSYCAVCDGAFFKGKELVVVGGGDSAVEE
GVYLTRFASKVTIVHRRDKLRAQSILQARAFDNEKVDFLWNKTVKEIHEENGKVGNVTLVDTVTGEESEFKTDGVFIYIG
MLPLSKPFENLGITNEEGYIETNDRMETKVEGIFAAGDIREKSLRQIVTATGDGSIAAQSVQHYVEELQETLKTLK
>P0A9P4 1.8.1.9~~~trxB~~~Thioredoxin reductase~~~COG0492
MGTTKHSKLLILGSGPAGYTAAVYAARANLQPVLITGMEKGGQLTTTTEVENWPGDPNDLTGPLLMERMHEHATKFETEI
IFDHINKVDLQNRPFRLNGDNGEYTCDALIIATGASARYLGLPSEEAFKGRGVSACATCDGFFYRNQKVAVIGGGNTAVE
EALYLSNIASEVHLIHRRDGFRAEKILIKRLMDKVENGNIILHTNRTLEEVTGDQMGVTGVRLRDTQNSDNIESLDVAGL
FVAIGHSPNTAIFEGQLELENGYIKVQSGIHGNATQTSIPGVFAAGDVMDHIYRQAITSAGTGCMAALDAERYLDGLADA
K
>P43788 1.8.1.9~~~trxB~~~Thioredoxin reductase~~~COG0492
MSDIKHAKLLILGSGPAGYTAAIYAARANLKPVLVTGLQQGGQLTTTDEIENWPGDFEMTTGSGLMQRMLQHAEKFETEI
VFDHINRVDLSSRPFKLFGDVQNFTCDALIIATGASARYIGLPSEENYKGRGVSACATCDGFFYRNKPVGVIGGGNTAVE
EALYLANIASTVHLIHRRDSFRAEKILIDRLYKKVEEGKIVLHTDRTLDEVLGDNMGVTGLRLANTKTGEKEELKLDGLF
VAIGHSPNTEIFQGQLELNNGYIVVKSGLDGNATATSVEGVFAAGDVMDHNYRQAITSAGTGCMAALDAERYLDAQEA
>P56431 1.8.1.9~~~trxB~~~Thioredoxin reductase~~~COG0492
MIDCAIIGGGPAGLSAGLYATRGGVKNAVLFEKGMPGGQITGSSEIENYPGVKEVVSGLDFMQPWQEQCFRFGLKHEMTA
VQRVSKKDSHFVILAEDGKTFEAKSVIIATGGSPKRTGIKGESEYWGKGVSTCATCDGFFYKNKEVAVLGGGDTAVEEAI
YLANICKKVYLIHRRDGFRCAPITLEHAKNNDKIEFLTPYVVEEIKGDASGVSSLSIKNTATNEKRELVVPGFFIFVGYD
VNNAVLKQEDNSMLCKCDEYGSIVVDFSMKTNVQGLFAAGDIRIFAPKQVVCAASDGATAALSVISYLEHH
>P9WHH1 1.8.1.9~~~trxB~~~Thioredoxin reductase~~~COG0492
MTAPPVHDRAHHPVRDVIVIGSGPAGYTAALYAARAQLAPLVFEGTSFGGALMTTTDVENYPGFRNGITGPELMDEMREQ
ALRFGADLRMEDVESVSLHGPLKSVVTADGQTHRARAVILAMGAAARYLQVPGEQELLGRGVSSCATCDGFFFRDQDIAV
IGGGDSAMEEATFLTRFARSVTLVHRRDEFRASKIMLDRARNNDKIRFLTNHTVVAVDGDTTVTGLRVRDTNTGAETTLP
VTGVFVAIGHEPRSGLVREAIDVDPDGYVLVQGRTTSTSLPGVFAAGDLVDRTYRQAVTAAGSGCAAAIDAERWLAEHAA
TGEADSTDALIGAQR
>P50971 1.8.1.9~~~trxB~~~Thioredoxin reductase~~~
MENVYDLAIIGSGPAGLAAALYGARAKMKTIMIEGQKVGGQIVITHEVANYPGSVREATGPSLIERMEEQANEFGAEKVM
DKIVDVDLDGKIKVIKGEKAEYKAKSVILATGAAPRLAGCPGEQELTGKGVSYCATCDADFFEDMEVFVVGGGDTAVEEA
MYLAKFARKVTIVHRRDELRAAKSIQEKAFKNPKLDFMWNSAIEEIKGDGIVESAVFKNLVTGETTEYFANEEDGTFGIF
VFIGYIPKSDVFKGKITLDDAGYIITDDNMKTNVEGVFAAGDIRVKSLRQVVTACADGAIAATQAEKYVEANFEE
>P99101 1.8.1.9~~~trxB~~~Thioredoxin reductase~~~
MTEIDFDIAIIGAGPAGMTAAVYASRANLKTVMIERGIPGGQMANTEEVENFPGFEMITGPDLSTKMFEHAKKFGAVYQY
GDIKSVEDKGEYKVINFGNKELTAKAVIIATGAEYKKIGVPGEQELGGRGVSYCAVCDGAFFKNKRLFVIGGGDSAVEEG
TFLTKFADKVTIVHRRDELRAQRILQDRAFKNDKIDFIWSHTLKSINEKDGKVGSVTLTSTKDGSEETHEADGVFIYIGM
KPLTAPFKDLGITNDVGYIVTKDDMTTSVPGIFAAGDVRDKGLRQIVTATGDGSIAAQSAAEYIEHLNDQA
>P66011 1.8.1.9~~~trxB~~~Thioredoxin reductase~~~
MTEIDFDIAIIGAGPAGMTAAVYASRANLKTVMIERGIPGGQMANTEEVENFPGFEMITGPDLSTKMFEHAKKFGAVYQY
GDIKSVEDKGEYKVINFGNKELTAKAVIIATGAEYKKIGVPGEQELGGRGVSYCAVCDGAFFKNKRLFVIGGGDSAVEEG
TFLTKFADKVTIVHRRDELRAQRILQDRAFKNDKIDFIWSHTLKSINEKDGKVGSVTLTSTKDGSEETHEADGVFIYIGM
KPLTAPFKDLGITNDVGYIVTKDDMTTSVPGIFAAGDVRDKGLRQIVTATGDGSIAAQSAAEYIEHLNDQA
>Q05741 1.8.1.9~~~trxB~~~Thioredoxin reductase~~~COG0492
MSDVRNVIIIGSGPAGYTAALYTARASLQPLVFEGAVTAGGALMNTTDVENFPGFRDGIMGPDLMDNMRAQAERFGAELI
PDDVVSVDLTGDIKTVTDSAGTVHRAKAVIVTTGSQHRKLGLPREDALSGRGVSWCATCDGFFFKDQDIVVVGGGDTAME
EATFLSRFAKSVTIVHRRDSLRASKAMQDRAFADPKISFAWNSEVATIHGEQKLTGLTLRDTKTGETRELAATGLFIAVG
HDPRTELFKGQLDLDDEGYLKVASPSTRTNLTGVFAAGDVVDHTYRQAITAAGTGCSAALDAERYLAALADSEQIAEPAP
AV
>P52215 1.8.1.9~~~trxB~~~Thioredoxin reductase~~~COG0492
MSDVRNVIIIGSGPAGYTAALYTARASLKPLVFEGAVTAGGALMNTTEVENFPGFQDGIMGPELMDNMRAQAERFGAELI
PDDVVAVDLSGEIKTVTDTAGTVHRAKAVIVTTGSQHRKLGLPNEDALSGRGVSWCATCDGFFFKDQDIAVIGGGDTAME
EATFLSRFAKSVTIVHRRDTLRASKAMQERAFADPKISFVWDSEVAEVQGDQKLAGLKLRNVKTGELSDLPVTGLFIAIG
HDPRTELFKGQLDLDPEGYLKVDAPSTRTNLTGVFGAGDVVDHTYRQAITAAGTGCSAAVDAEPFLAALSDEDKAEPEKT
AV
>P24664 3.4.21.4~~~~~~Trypsin~~~
IVGGEDANVQDHPFTVALVTPDGQQFCGGTLAAPNKVVTAAHCTVGSQPADINVVSGRTVMSSNIGTVSKVTNVWVHPEY
QDAAKGFDVSVLTLEAPVKEAPIELAKADDAGYAPDTAATILGWGNTSEGGQQADHLQKATVPVNSDDTCKQAYGEYTPN
AMVCAGVPEGGVDTCQGDSGGPMVVNNKLIGVTSWGEGCARPGKPGVYARVGAYYDVLMEQINAGAV
>P80420 3.4.21.-~~~tlp~~~Trypsin-like protease~~~
MLTVTTLVQLMKRTLAVGAVALAAVSLQPGTATAGPAPVVGGTRAAQGEFPFMVRLSMGCGGALYTQQIVLTAAHCVSGS
GNNTSITATAGVVDLNSSSAIKVKSTKVLQAPGYNGKGKDWALIKLAKPINLPTLKIADTKAYDNGTFTVAGWGAAREGG
GQQRYLLKANVPFVSDASCQSSYGSDLVPSEEICAGLPQGGVDTCQGDSGGPMFRRDNNNAWIQVGIVSWGEGCARPNYP
GVYTEVSTFAAAIKSAAAGM
>P00775 3.4.21.4~~~sprT~~~Trypsin~~~
MKHFLRALKRCSVAVATVAIAVVGLQPVTASAAPNPVVGGTRAAQGEFPFMVRLSMGCGGALYAQDIVLTAAHCVSGSGN
NTSITATGGVVDLQSSSAVKVRSTKVLQAPGYNGTGKDWALIKLAQPINQPTLKIATTTAYNQGTFTVAGWGANREGGSQ
QRYLLKANVPFVSDAACRSAYGNELVANEEICAGYPDTGGVDTCQGDSGGPMFRKDNADEWIQVGIVSWGYGCARPGYPG
VYTEVSTFASAIASAARTL
>Q52725 3.8.1.-~~~trzA~~~S-triazine hydrolase~~~
MTRIAITGGRVLTMDPERRVLEPGTVVVEDQFIAQVGSPTTSTSAAPKSSTPPGWQCSPASSTPTPTSHKSSSGVVHPMT
ATSSNGCTTCSIPASLPTQTTTSESEHCCTAPKPFVLASPLSSTTRTSDPTTSPAPGPPGSPFTDAGIRAIYARMYFDAP
RAELEELVATIHAKAPGAVRMDESASTDHVLADLDQLITRHDRTADGRIRVWPAPAIPFMVSEKGMKAAQEIAASRTDGW
TMHVSEDPIEARVHSMNAPEYLHHLGCLDDRLLAAHCVHIDSRDIRLFRQHDVKISTQPVSNSYLAAGIAPVPEMLAHGV
TVGIGTDDANCNDSVNLISDMKVLALIHRAAHRDASIITPEKIIEMATIDGARCIGMADQIGSLEAGKRADIITLDLRHA
QTTPAHDLAATIVFQAYGNEVNDVLVNGSVVMRDRVLSFLPTPQEEKALYDDASERSAAMLARAGLTGTRTWQTLGS
>P94680 1.5.1.-~~~tsaB1~~~Toluene-4-sulfonate monooxygenase system reductase subunit TsaB1~~~
MSADVPVTVAAVRAVARDVLALELRHANGQPLPGASAGAHIDLALPNGLVRQYSLVNATGQATMDCYQVAVGWDANSRGG
SVWIHEKLKVGQALRVTHRATCSEMAPEHRRVLLLAGGIGVTPIYAMAQACAQQGVDVELWASARSAPRLAYLEELKALL
GQRLHLHADDEQGGPMNLTERLATQRWDAVYACGPAPMLDALTAATAHWAPGSVRMERFKGAEQPASERQPFELVLQRAG
LSTTVDAHESVLDAMERVGVDFPWSCREGICGTCEAPVLEGEVQHLDYVLSPEERAEQRRMMVCVSRCGGGRLVLDI
>O05516 ~~~tsaB~~~tRNA threonylcarbamoyladenosine biosynthesis protein TsaB~~~COG1214
MTILAIDTSNYTLGIALLREDTVIAEYITYLKKNHSVRAMPAVHSLLNDCDMAPQDLSKIVVAKGPGSYTGVRIGVTLAK
TLAWSLDIPISAVSSLETLAANGRHFDGLISPIFDARRGQVYTGLYQYKNGLLEQVVPDQNVMLADWLEMLKEKDRPVLF
LGHDTSLHKQMIEDVLGTKGFIGTAAQHNPRPSELAFLGKEKEAADVHGLVPDYLRLAEAEAKWIESQK
>P76256 ~~~tsaB~~~tRNA threonylcarbamoyladenosine biosynthesis protein TsaB~~~COG1214
MRILAIDTATEACSVALWNDGTVNAHFELCPREHTQRILPMVQDILTTSGTSLTDINALAYGRGPGSFTGVRIGIGIAQG
LALGAELPMIGVSTLMTMAQGAWRKNGATRVLAAIDARMGEVYWAEYQRDENGIWHGEETEAVLKPEIVHERMQQLSGEW
VTVGTGWQAWPDLGKESGLVLRDGEVLLPAAEDMLPIACQMFAEGKTVAVEHAEPVYLRNNVAWKKLPGKE
>P43990 ~~~tsaB~~~tRNA threonylcarbamoyladenosine biosynthesis protein TsaB~~~COG1214
MQNLTLLALDTSTEACSVALLYRGEKTHINELAQRTHTKRILPMIDEILANSGLGLNQVDALAFGRGPGSFTGVRVGAGI
AQGLAFGADLPVIPISNLTAMAQAAFELHQAENVVAAIDARMNEVYFSQVVREKVRSDFGEFFQWREIISEQVCSPEQAI
NQLQNDNAFRVGTGWAAYSQFTEKNLTGSDIALPNALYMLELAQVEYLQKHTISALEIEPIYLRNEVTWKKLPGRE
>Q7CQE0 ~~~tsaB~~~tRNA threonylcarbamoyladenosine biosynthesis protein TsaB~~~
MRILAIDTATEACSVALWNNGTINAHFELCPREHTQRILPMVQEILAASGASLNEIDALAFGRGPGSFTGVRIGIGIAQG
LALGANLPMIGVSTLATMAQGAWRKTGATRVLAAIDARMGEVYWAEYQRDAQGVWQGEETEAVLKPERVGERLKQLSGEW
ATVGTGWSAWPDLAKECGLTLHDGEVSLPAAEDMLPIASQKLAAGETVAVEHAEPVYLRNEVAWKKLPGKE
>Q9WZX7 ~~~tsaB~~~tRNA threonylcarbamoyladenosine biosynthesis protein TsaB~~~COG1214
MNVLALDTSQRIRIGLRKGEDLFEISYTGEKKHAEILPVVVKKLLDELDLKVKDLDVVGVGIGPGGLTGLRVGIATVVGL
VSPYDIPVAPLNSFEMTAKSCPADGVVLVARRARKGYHYCAVYLKDKGLNPLKEPSVVSDEELEEITKEFSPKIVLKDDL
LISPAVLVEESERLFREKKTIHYYEIEPLYLQKSIAELNWEKKKRG
>Q87RD1 ~~~tsaB~~~tRNA threonylcarbamoyladenosine biosynthesis protein TsaB~~~COG1214
MSAKILAIDTATENCSVALLVNDQVISRSEVAPRDHTKKVLPMVDEVLKEAGLTLQDLDALAFGRGPGSFTGVRIGIGIA
QGLAFGAELPMIGVSTLAAMAQASYRLHGATDVAVAIDARMSEVYWARYSRQENGEWIGVDEECVIPPARLAEEAQADSK
TWTTAGTGWSAYQEELAGLPFNTADSEVLYPDSQDIVILAKQELEKGNTVPVEESSPVYLRDNVTWKKLPGRE
>P94681 1.2.1.62~~~tsaC1~~~4-formylbenzenesulfonate dehydrogenase TsaC1/TsaC2~~~
MNLNKQVAIVTGGASGFGAAIARRLSQAGAAVLVADLNAEGAQRMATELNAAGGRALGMACDVSKEADYRAVVDAAIAQL
GGLHIVVNNAGTTHRNKPALAVTEDEFDRVYRVNLKSVYWSAQCALPHFAQQGHGVMVNVASTTGVRPGPGLTWYSGSKA
AMINLTKGLALEFARSGVRINAVNPMIGETPMMADFMGMEDTPANRERFLSRIPLGRFTRPDDVASAVAFLASDDASFLT
GVCLDVDGGRNI
>P45748 2.7.7.87~~~tsaC~~~Threonylcarbamoyl-AMP synthase~~~COG0009
MNNNLQRDAIAAAIDVLNEERVIAYPTEAVFGVGCDPDSETAVMRLLELKQRPVDKGLILIAANYEQLKPYIDDTMLTDV
QRETIFSRWPGPVTFVFPAPATTPRWLTGRFDSLAVRVTDHPLVVALCQAYGKPLVSTSANLSGLPPCRTVDEVRAQFGA
AFPVVPGETGGRLNPSEIRDALTGELFRQG
>P44807 2.7.7.87~~~tsaC~~~Threonylcarbamoyl-AMP synthase~~~COG0009
MNREQIADALRQNQVVAYPTEAVFGLGCNPQSESAVKKLLDLKQRPVEKGLILVAPSLDFFRPFVDFEQINDEQLSRLQG
KYERPTTWIVPAKSTTPHFLTGKFDSIAIRLCDHPSVKALCELTGFALTSTSANLTGEPPCRTADEVRLQFGSDFPVLNE
MVGRAHNPSEIRDLRTNQLFRQG
>P9WGC9 2.7.7.87~~~~~~Putative threonylcarbamoyl-AMP synthase~~~COG0009
MTETFDCADPEQRSRGIVSAVGAIKAGQLVVMPTDTVYGIGADAFDSSAVAALLSAKGRGRDMPVGVLVGSWHTIEGLVY
SMPDGARELIRAFWPGALSLVVVQAPSLQWDLGDAHGTVMLRMPLHPVAIELLREVGPMAVSSANISGHPPPVDAEQARS
QLGDHVAVYLDAGPSEQQAGSTIVDLTGATPRVLRPGPVSTERIAEVLGVDAASLFG
>P94682 1.1.1.257~~~tsaD1~~~4-(hydroxymethyl)benzenesulfonate dehydrogenase TsaD1~~~
MSTVLYRCPELLIGGEWRPGRHEQRLVVRNPATGEPLDELRLASADDLQLALQTTQQAFEHWRQVPAHERCARLERGVAR
LRENTERIAHLLTLEQGKTLAEARMECAMAADLIKWYAEEARRVYGRVIPARLPNSRMEVFKFPVGPVAAFSPWNFPLVL
SARKLGGAIAAGCSIVLKAAEETPASVAAMVDCLNQELPPGVVQLLYGVPAEVSQALIASPVVRKVTFTGSVPVGRHLAE
LSARHLKRITLELGGHAPVIVCGDADIARTVNLMVQHKFRNAGQACLAPTRFFVDRRIYGDFVDAFGATQALRVGAGMAA
ETQMGPVASARRQAAVQDLIARSVAAGARPVASAVPEAGYFVAPTLLADVPLDAPVMSEEPFGPVACAVPFDSLDQAIAQ
ANHNPYGLAGYLFTDSAKAILAVSERLEVGSLAVNGMGVSVPEAPFGGVKDSGYGSESGTEGMEAFLDTKFMHYVA
>O05518 2.3.1.234~~~tsaD~~~tRNA N6-adenosine threonylcarbamoyltransferase~~~COG0533
MSEQKDMYVLGIETSCDETAAAIVKNGKEIISNVVASQIESHKRFGGVVPEIASRHHVEQITLVIEEAFRKAGMTYSDID
AIAVTEGPGLVGALLIGVNAAKALSFAYNIPLVGVHHIAGHIYANRLVEDIVFPALALVVSGGHTELVYMKEHGSFEVIG
ETLDDAAGEAYDKVARTMGLPYPGGPQIDKLAEKGNDNIPLPRAWLEEGSYNFSFSGLKSAVINTLHNASQKGQEIAPED
LSASFQNSVIDVLVTKTARAAKEYDVKQVLLAGGVAANRGLRAALEKEFAQHEGITLVIPPLALCTDNAAMIAAAGTIAF
EKGIRGAYDMNGQPGLELTSYQSLTR
>Q72B00 2.3.1.234~~~tsaD~~~tRNA N6-adenosine threonylcarbamoyltransferase~~~COG0533
MLCIGIETSCDETGLALVRDGRLVHAVMSSQADIHALFGGVVPEIASREHYRLIGPLYDLLLREAGVGPTELDAVAVARG
PGLLGSLLVGMAFAKGLALGFDIPLVGVNHLHAHLLAAGLERELVFPALGLLVSGGHTHIYRIESPVSFRLLGRTLDDAA
GEAYDKVAKMLRLPYPGGRILDQQGRRGIADPRLFPRPYTDNDNLDFSFSGLKTAVQTHLSRHPELVPSPDAAQAVAGGH
DAPEGLRNMCASFNEAVAETLCIKLGRALDGDDGVGVRALIVAGGVAANSYVREHTARLAVSRGLELIVPSPPLCTDNGS
MIAYAGWLLRQGGLSHSLDLEAVPRGRAIPDDYRQGPAGCS
>P05852 2.3.1.234~~~tsaD~~~tRNA N6-adenosine threonylcarbamoyltransferase~~~COG0533
MRVLGIETSCDETGIAIYDDEKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAALKESGLTAKDIDAVAYTA
GPGLVGALLVGATVGRSLAFAWDVPAIPVHHMEGHLLAPMLEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDD
AAGEAFDKTAKLLGLDYPGGPLLSKMAAQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRDNGTDDQTRADIARAFE
DAVVDTLMIKCKRALDQTGFKRLVMAGGVSANRTLRAKLAEMMKKRRGEVFYARPEFCTDNGAMIAYAGMVRFKAGATAD
LGVSVRPRWPLAELPAA
>P36175 2.3.1.234~~~tsaD~~~tRNA N6-adenosine threonylcarbamoyltransferase~~~
MRILGIETSCDETGVAIYDEDKGLVANQLYSQIDMHADYGGVVPELASRDHIRKTLPLIQEALKEANLQPSDIDGIAYTA
GPGLVGALLVGSTIARSLAYAWNVPALGVHHMEGHLLAPMLEENAPEFPFVALLISGGHTQLVKVDGVGQYELLGESIDD
AAGEAFDKTGKLLGLDYPAGVAMSKLAESGTPNRFKFPRPMTDRPGLDFSFSGLKTFAANTIKANLNENGELDEQTKCDI
AHAFQQAVVDTILIKCKRALEQTGYKRLVMAGGVSANKQLRADLAEMMKKLKGEVFYPRPQFCTDNGAMIAYTGFLRLKT
MNKPT
>P9WHT7 2.3.1.234~~~tsaD~~~tRNA N6-adenosine threonylcarbamoyltransferase~~~COG0533
MTTVLGIETSCDETGVGIARLDPDGTVTLLADEVASSVDEHVRFGGVVPEIASRAHLEALGPAMRRALAAAGLKQPDIVA
ATIGPGLAGALLVGVAAAKAYSAAWGVPFYAVNHLGGHLAADVYEHGPLPECVALLVSGGHTHLLHVRSLGEPIIELGST
VDDAAGEAYDKVARLLGLGYPGGKALDDLARTGDRDAIVFPRGMSGPADDRYAFSFSGLKTAVARYVESHAADPGFRTAD
IAAGFQEAVADVLTMKAVRAATALGVSTLLIAGGVAANSRLRELATQRCGEAGRTLRIPSPRLCTDNGAMIAAFAAQLVA
AGAPPSPLDVPSDPGLPVMQGQVR
>P40731 2.3.1.234~~~tsaD~~~tRNA N6-adenosine threonylcarbamoyltransferase~~~
MRVLGIETSCDETGIAIYDDKKGLLANQLYSQVKLHADYGGVVPELASRDHVRKTVPLIQAALKEAGLTASDIDAVAYTA
GPGLVGALLVGATVGRSLAFAWNVPAIPVHHMEGHLLAPMLEDNPPEFPFVALLVSGGHTQLISVTGIGQYELLGESIDD
AAGEAFDKTAKLLGLDYPGGPMLSKMASQGTAGRFVFPRPMTDRPGLDFSFSGLKTFAANTIRSNGGDEQTRADIARAFE
DAVVDTLMIKCKRALESTGFKRLVMAGGVSANRTLRAKLAEMMQKRRGEVFYARPEFCTDNGAMIAYAGMVRFKAGVTAD
LGVTVRPRWPLAELPAA
>Q2FWL2 2.3.1.234~~~tsaD~~~tRNA N6-adenosine threonylcarbamoyltransferase~~~COG0533
MTKDILILAVETSCDETSVSVIKNGRDILSNTVLSQIESHKRFGGVVPEVASRHHVEGITATINEALGDADVSIEDIDAI
AVTEGPGLIGALLIGVNAAKALAFAYDKPLIPVHHIAGHIYANHIEEPLTFPLIALIVSGGHTELVYMKDHLSFEVIGET
RDDAVGEAYDKVARTIGLNYPGGPQVDRLAAEGEDTYSFPRVWLDKDSYDFSFSGLKSAVINQLHNQRQKNIPIIEANVA
TSFQNSVVEVLTFKAIQACKEYGVQRLIVAGGVASNKGLRQSLADQCKVNDIQLTIPSPKLCTDNAAMIGVAGHYLYQQG
RFADLALNGHSNIDLEEYSAE
>Q7A4H8 2.3.1.234~~~tsaD~~~tRNA N6-adenosine threonylcarbamoyltransferase~~~
MTKDILILAVETSCDETSVSVIKNGRDILSNTVLSQIESHKRFGGVVPEVASRHHVEGITTTINEALVDADVSMEDIDAI
AVTEGPGLIGALLIGVNAAKALAFAYDKPLIPVHHIAGHIYANHIEEPLTFPLIALIVSGGHTELVYMKDHLSFEVIGET
RDDAVGEAYDKVARTIGLNYPGGPQVDRLAAEGEDTYSFPRVWLDKDSYDFSFSGLKSAVINQLHNQRQKNIPIIEANVA
TSFQNSVVEVLTFKAIQACKEYSVQRLIVAGGVASNKGLRQSLADQCKVNDIQLTIPSPKLCTDNAAMIGVAGHSLYQQG
RFADLALNGHSNIDLEEYSAE
>Q9WXZ2 2.3.1.234~~~tsaD~~~tRNA N6-adenosine threonylcarbamoyltransferase~~~COG0533
MRVLGIETSCDETAVAVLDDGKNVVVNFTVSQIEVHQKFGGVVPEVAARHHLKNLPILLKKAFEKVPPETVDVVAATYGP
GLIGALLVGLSAAKGLAISLEKPFVGVNHVEAHVQAVFLANPDLKPPLVVLMVSGGHTQLMKVDEDYSMEVLGETLDDSA
GEAFDKVARLLGLGYPGGPVIDRVAKKGDPEKYSFPRPMLDDDSYNFSFAGLKTSVLYFLQREKGYKVEDVAASFQKAVV
DILVEKTFRLARNLGIRKIAFVGGVAANSMLREEVRKRAERWNYEVFFPPLELCTDNALMVAKAGYEKAKRGMFSPLSLN
ADPNLNV
>O05515 ~~~tsaE~~~tRNA threonylcarbamoyladenosine biosynthesis protein TsaE~~~COG0802
MKQLKWRTVNPEETKAIAKLTAAFAKPGDVLTLEGDLGAGKTTFTKGFAEGLGITRIVNSPTFTIIKEYNDGVLPLYHMD
VYRMEDESEDLGLDEYFHGQGVCLVEWAHLIEEQLPQERLQIVIKRAGDDEREITFTAVGNRYEMLCEELSRHDNISN
>P0AF67 ~~~tsaE~~~tRNA threonylcarbamoyladenosine biosynthesis protein TsaE~~~COG0802
MMNRVIPLPDEQATLDLGERVAKACDGATVIYLYGDLGAGKTTFSRGFLQALGHQGNVKSPTYTLVEPYTLDNLMVYHFD
LYRLADPEELEFMGIRDYFANDAICLVEWPQQGTGVLPDPDVEIHIDYQAQGREARVSAVSSAGELLLARLAG
>P44492 ~~~tsaE~~~tRNA threonylcarbamoyladenosine biosynthesis protein TsaE~~~COG0802
MESLTQYIPDEFSMLRFGKKFAEILLKLHTEKAIMVYLNGDLGAGKTTLTRGMLQGIGHQGNVKSPTYTLVEEYNIAGKM
IYHFDLYRLADPEELEFMGIRDYFNTDSICLIEWSEKGQGILPEADILVNIDYYDDARNIELIAQTNLGKNIISAFSN
>P9WFS7 ~~~tsaE~~~tRNA threonylcarbamoyladenosine biosynthesis protein TsaE~~~COG0802
MSREGIRRRPKARAGLTGGGTATLPRVEDTLTLGSRLGEQLCAGDVVVLSGPLGAGKTVLAKGIAMAMDVEGPITSPTFV
LARMHRPRRPGTPAMVHVDVYRLLDHNSADLLSELDSLDLDTDLEDAVVVVEWGEGLAERLSQRHLDVRLERVSHSDTRI
ATWSWGRS
>P22940 ~~~~~~56 kDa type-specific antigen~~~
MKKIMLIASAMSALSLPFSASAIELGEEGGLECGPYGKVGIVGGMITGAESTRLDSTDSEGKKHLSLTTGLPFGGTLAAG
MTIAPGFRAELGVMYLRNISAEVEVGKGKVDSKGEIKADSGGGTDTPIRKRFKLTPPQPTIMPISIADRDVGVDTDILAQ
AAAGQPQLTVEQRAADRIAWLKNYAGIDYMVPDPQNPNARVINPVLLNITQGPPNVQPRPRQNLDILDHGQWRHLVVGVT
ALSHANKPSVTPVKVLSDKITKIYSDIKPFADIAGIDVPDTGLPNSASVEQIQSKMQELNDVLEDLRDSFDGYMGNAFAN
QIQLNFVMPQQAQQQQGQGQQQQAQATAQEAVAAAAVRLLNGNDQIAQLYKDLVKLQRHAGVKKAMEKLAAQQEEDAKNQ
GEGDCKQQQGASEKSKEGKGKETEFDLSMIVGQVKLYADLFTTESFSIYAGVGAGLAHTYGKIDDKDIKGHTGMVASGAL
GVAINAAEGVYVDLEGSYMHSFSKIEEKYSINPLMASVGVRYNF
>Q74FW6 4.3.1.19~~~tdcB~~~L-threonine ammonia-lyase~~~COG1171
MLPYTLIQEADDRLRKRVRRTELIHSHHFSEKLGIPIYFKCENLQRTGAFKIRGALNFMTSQPREALAKGVITASAGNHA
QGVAFSADLLGVPSTVFMPESTPPQKVFATRDYGAEVVLTGRNFDEAYAAAVQAQEERGALFVHPFDDPLVMAGQGTIGL
EVLQELPDVANILVPIGGGGLIAGIATAIRETHPHVRIIGVETAAAPSAHYSLQKGKIVQVPVTVTLADGIAVKKPGVNT
FPIIRDLVDEVVLVEEEEIALAIVALLERTKLLVEGAGAVPLAALLNRRVTDLSGKTVCVLSGGNIDVKTISVVVERGLV
AAGRYLKLKVELDDLPGALARLATEIAEAKANISIITHDRRSKSLPIGKTEVLIELETRGFEHIQEVISHLQGVGYLVDV
LK
>Q9WYJ1 4.3.1.19~~~~~~L-threonine ammonia-lyase~~~
MITLEDIKEAQRTLKNVVHRTALAYSSVLSEVTGGEIYLKMENLQKTGSFKIRGAYNKIAHLSEEERKRGVVAASAGNHA
QGVALAAQIFGIPATIVMPRYAPLSKITKTRNLGAQVILEGNIFDEAYEAALRIQEKTGAVFVHPFNDPHVIAGQGTIGL
EIMEDLPDVEVVVVPVGGGGLISGVSVAIKSMNPEVKVIGVQTENMPSMIASLRRGRAERVEGKPTLADGIAVKKPGDLT
FELVKKYVDEMVAVNEEEIADAILFLLEQAKVVAEGAGAVGVAAVLNKLDVKGKKVAIVISGGNIDVNMIDRIINKGLVK
SGRKVFIETFVMDRPGALKELLGIVAELGANVLSVFHNRSAKEVPIGFAKIELELETVDEKHVEEIERVLIAKGYEVRIV
G
>P94679 1.14.14.-~~~tsaM1~~~Toluene-4-sulfonate monooxygenase system iron-sulfur subunit TsaM1~~~
MFIRNCWYVAAWDTEIPAEGLFHRTLLNEPVLLYRDTQGRVVALENRCCHRSAPLHIGRQEGDCVRCLYHGLKFNPSGAC
VEIPGQEQIPPKTCIKSYPVVERNRLVWIWMGDPARANPDDIVDYFWHDSPEWRMKPGYIHYQANYKLIVDNLLDFTHLA
WVHPTTLGTDSAASLKPVIERDTTGTGKLTITRWYLNDDMSNLHKGVAKFEGKADRWQIYQWSPPALLRMDTGSAPTGTG
APEGRRVPEAVQFRHTSIQTPETETTSHYWFCQARNFDLDDEALTEKIYQGVVVAFEEDRTMIEAHEKILSQVPDRPMVP
IAADAGLNQGRWLLDRLLKAENGGTAP
>Q6XL52 ~~~tsaQ1~~~HTH-type transcriptional regulator TsaQ1/TsaQ2~~~
MATHVKTPDQLQESGTGLVHSLAKGLEILSCFSEGELLGNQQLVELTGLPKATVSRLTSTLVKLGYLQVDPRSRKLAMGA
RVLGLGVSVQRKLGLQRIARPHMEALSQRFGLTVTMGTRDRLSVVLLEVCRPPSLAQLVVNFDAGTHMPLSQTALGLASL
VNSPVKDREQVIEGLRKQLGDQWVEARNRIERAHQEHERYGYIVSQRSLGRDVSGVAVGMVPMGSNTPYVFHMAGPSNQM
PLSLMRSDMGPALKQMVQDIQAEMRAARPPKLVVPKEF
>P94678 ~~~tsaR~~~HTH-type transcriptional regulator TsaR~~~
MKLQTLQALICIEEVGSLRAAAQLLHLSQPALSAAIQQLEDELKAPLLVRTKRGVSLTSFGQAFMKHARLIVTESRRAQE
EIGQLRGRWEGHITFAASPAIALAALPLALASFAREFPDVTVNVRDGMYPAVSPQLRDGTLDFALTAAHKHDIDTDLEAQ
PLYVSDVVIVGQRQHPMANATRLAELQECRWAFSSAPRGPGAIIRNAFARYGLPEPKLGLVCESFLALPGVVAHSDLLTT
MPRTLYERNAFKDQLCSIPLQDALPNPTIYVLRRHDLPVTPAAAGLIRWIQHHALQTG
>Q6A553 ~~~tsaS~~~Probable inner membrane transporter protein TsaS~~~
MAVPRQRAAGGLCGDGHHRFWLGPGGGALLAGSGRCRGGGLVILLDVPASMLHGGLNFRQVRWRAIGAILPGMAVGALIG
LWLMGQLDKRWPLFLLGLYITWVGWRTLRHGQQAARALPGWTHHAGSGLVGVLEVMFATAGPMVIALLQRRLREVAEIRA
TVPVVMVVAASIAIAVLFGAGQIDRAHTFERWLVALPIAFMGVVLGNRLARHIPPPAMRRAMAVLLIASGLSLTQHLWR
>Q8KR68 ~~~tsaT~~~Outer membrane transporter protein TsaT~~~
MNFRRRLCTAALIAALPLASQAQNNTLLLNSVLAPQNPMTKMIVKPWAEKIAQVTEGRVKVDVAPSSLAAPQQQLASVNK
GVFDIAYQFHGLLTDQVKLNQIAQLPFVNTTARGSSVALWRTYQKHFAKANELGEVQVLALFVQPPGVMFGMKGPIDGMD
KLKGRKVYALPGVPSAMMESAGAAVVAAPGARSYEIVSGKTVDAFVGYPTSDAEGLKTLSYATDVTDIPGNLTAVSWVLF
MNKKRWAALSEKDRKAIESISGEAFAQGMKQYDDLETKVRSEAAAKGIKFHMANDAFVKELQTLATPITQAWLKDASSRG
VNGQEALDFYRAQAAANR
>P37918 ~~~~~~56 kDa type-specific antigen~~~
MKKIMLIASAMSALSLPFSASAIELGDEGGLECGPYAKVGVVGGMITGVESTRLDPADAGGKKQLPLTTSMPFGGTLAAG
MTIAPGFRAELGVMYLANVKAEVESGKTGSDADIRSGADSPMPQRYKLTPPQPTIMPISIADRDLGVDIPNVPQGGANHL
GDNLGANDIRRADDRITWLKNYAGVDYMVPDPNNPQARIVNPVLLNIPQGPPNANPRQAMQPCSILNHDHWRHLVVGITA
MSNANKPSVSPIKVLSEKIVQIYRDVKPFARVAGIEVPSDPLPNSASVEQIQNKMQELNDILDEIRDSFDGCIGGNAFAN
QIQLNFRIPQAQQQGQGQQQQQAQATAQEAAAAAAVRVLNNNDQIIKLYKDLVKLKRHAGIKKAMEELAAQDGGCNGGGD
NKKKRGASEDSDAGGASKGGKGKETKETEFDLSMIVGQVKLYADLFTTESFSIYAGLGAGLAYTSGKIDGVDIKANTGMV
ASGALGVAINAAEGVYVDIEGSYMHSFSKIEEKYSINPLMASFGVRYNF
>D3RVD4 1.8.2.2~~~tsdA~~~Thiosulfate dehydrogenase~~~COG3258
MRGDVRVHTASPIAAAWLLAVGLVAHAEEPPTVALTVPAAALLPDGALGESIVRGRRYLSDTPAQLPDFVGNGLACRHCH
PGRDGEVGTEANAAPFVGVVGRFPQYSARHGRLITLEQRIGDCFERSLNGRALALDHPALIDMLAYMSWLSQGVPVGAVV
AGHGIPTLTLEREPDGVHGEALYQARCLACHGADGSGTLDADGRYLFPPLWGPRSFNTGAGMNRQATAAGFIKHKMPLGA
DDSLSDEEAWDVAGFVLTHPRPLFQEPTGD
>Q4FQB7 1.8.2.2~~~tsdA~~~Thiosulfate dehydrogenase~~~COG3258
MKIIPYRKRSVLIATIFAISAVGITGCSDNTETKAVERVEEAAALARVKDLEARAEALKSNMPAANDMTATAGGTDTASG
KPTIKMPDESTIPDDEFGAAVRRGLQISNHTYKELPNNVGNQLNCTSCHLGNGSEAYAAPWNNTPSVYPNYSKRTGRINT
IQERINGCFERSLNGKALDLNSDDMNAMVSYMSWLSQDMPFGVSPEGSGFVKVDKTLEPNTDNGKKLFAEKCSVCHGATG
EGQYNDDGTYVYPAIAGDKSFNDGAGMARTYTAASFIKGKMPFGQGGSLSDQEAVDIASYFTHLPRPIKANKDKDWPNGD
APKDVRR
>A4VND8 1.8.2.2~~~tsdA~~~Thiosulfate dehydrogenase~~~COG3258
MNTQLLVTLLAMSIGGVALAAEIKMDDQSQLTQKAGKGAGESYFQPPQEKDLPANAYGELVQQGRAIFVDTQKYAAEYVG
NGMNCTNCHLEQGRKANSAPLWGAYPMYPAYRKKNDKVNSYAERVQGCFQFSMNGTPPAADSHVINALTAYSYWLSTGAP
TGQELPGRAYPEVPQPQGGFDIAKGKQIYAEQCAVCHGDDGQGQKAGGGYVFPPLWGKDSFNWGAGMHRINTAAAFIKES
MPLGKGGSLSDADAWHVAAYMNSHERPQDPRLIEGSVEKTRLKYHANDGVNLYGQQVDGALLGQGVK
>D5WYQ5 1.8.2.2~~~tsdA~~~Thiosulfate dehydrogenase~~~COG3258
MTRKQTPMTLALCALGLFAASAASALAADAPMAPPKSEINAAVGTGAKFTPPPESAIPDDDFGKMVKLGRDIMLDTPKYA
KDYVGNTLSCVNCHTDAGRMAGSAPLWAAYVSYPAYRGKNKKVNTFEERLQGCFKFSQNGKAPPLGSKTLVALESYSYWL
SKGLPVDEKVAGRGYPNLPEPQQAPDYVRGQKVYEAKCILCHAANGEGQYVNGETVFPPLWGPKSFNWGAGMGSYKNAAK
FIYANMPYGMSYSLSPQEAWDVAYFMDAQERPQDPRWQGSVAATRAKFHDSKFSLYGTKVNGKLLGDIGAPKPR
>Q0SFL5 1.14.13.220~~~tsdB~~~Probable NADH-specific resorcinol 4-hydroxylase~~~COG0654
MSAFAQPPGAGRKAVDVAIVGSGPTGMALAALLGCQGRSVVVLERYTGLYNLPRAAAFDDETMRTFQKLGVAEKMLPGTN
VQRGYVWVNGDDEVLLDIEFDNPGRCGWPAQYMMYQPHLESVLDELITSLPTVEIRRGMTVESVDQQDGDDVLVRATDVE
GSAYLVRARYVVGCDGGNGVVRQFAGGELDDYGFFENWLVCDFQLNRDVPDLPTFRQVCDPAEPIAIVNIGPRFHRFSFR
LESAANREEVVHPDKVWPRVATYLTPEDAELVRVANYTFRSCITTQWRHRRILLAGDAAHQMPPFLAQGMVSGIRDARNL
AWKLDMVLAGHPDSLLDTYQAEREPHVRYITEKAIELGRVQTMRDTALAAQRDAQMIAARKANQKPDKLRYPALSGGLIA
NHGDMFPQGLVSTSSTTALFDEIAGTGWLVVADGPQVLSGIAEGDRTAFTEIGGKEVIFGLTSMFDGAPVSDTAGVYTRW
FAAHECVAAIVRPDGYVFGLARDAAELAGLAKELVAAVAPVPSRPPAPTA
>Q9I2Q1 3.4.19.11~~~tse1~~~Peptidoglycan amidase Tse1~~~
MDSLDQCIVNACKNSWDKSYLAGTPNKDNCSGFVQSVAAELGVPMPRGNANAMVDGLEQSWTKLASGAEAAQKAAQGFLV
IAGLKGRTYGHVAVVISGPLYRQKYPMCWCGSIAGAVGQSQGLKSVGQVWNRTDRDRLNYYVYSLASCSLPRAS
>Q9I0E0 ~~~tse2~~~Toxin Tse2~~~
MSYDYEKTSLTLYRAVFKANYDGDVGRYLHPDKELAEAAEVAPLLHPTFDSPNTPGVPARAPDIVAGRDGLYAPDTGGTS
VFDRAGVLRRADGDFVIPDGTDIPPDLKVKQDSYNKRLQATHYTIMPAKPMYREVLMGQLDNFVRNAIRRQWEKARGL
>Q9HYC5 3.2.1.17~~~tse3~~~Peptidoglycan muramidase Tse3~~~
MTATSDLIESLISYSWDDWQVTRQEARRVIAAIRNDNVPDATIAALDKSGSLIKLFQRVGPPELARSLIASIAGRTTMQR
YQARNALIRSLINNPLGTQTDNWIYFPTITFFDICADLADAAGRLGFAAAGATGVASQAIQGPFSGVGATGVNPTDLPSI
AFGDQLKLLNKDPATVTKYSNPLGDLGAYLSQLSPQDKLNQAQTLVGQPISTLFPDAYPGNPPSRAKVMSAAARKYDLTP
QLIGAIILAEQRDQTRDEDAKDYQAAVSIKSANTSIGLGQVVVSTAIKYELFTDLLGQPVRRGLSRKAVATLLASDEFNI
FATARYIRYVANLASQQDLRKLPKTRGAFPSIDLRAYAGNPRNWPRDNVRALASEYTSRPWDDNLSPGWPMFVDDAYATF
LDPGMRFP
>Q9I739 3.2.2.5~~~tse6~~~NAD(P)(+) glycohydrolase toxin Tse6~~~
MDAQAAARLGDEIAHGFGVAAMVAGAVAGALIGAAVVAATAATGGLAAVILAGSIAAGGLSMFQIVKGLTTIFELPEPTT
GVLIRGSFNVYVNSRNAMRAGDDVSATCSGLPLNHPLWPFPVLIAEGSATVYINGKPAARLQSKMVCGAHIKTGSQNTFI
GGPTERVAFVLDLEEWLHTGLEALGLAALAGGLLLAAMAGVAALVGVVAIGGLMMGGMALLGDLGDRLGPGYRDLFQGVA
GMALLGFGPKLAGRRPAAVTSETAQRRAYLNNKFGRSGNLDHDINYRGNRETAAKFFKSKDIDPADAESYMNGLDFNHPV
RVETLAPGKNLWQYQSPGAPQGNWYTLSPRVQPTELGINPMGTNRAANTIEPKVLNSYRTTQKVEVLRSTAAPTDDFWSV
KGQSYPAKGGAQQLFSNEKGSFGLLPREGS
>Q9I733 3.1.21.1~~~tse7~~~DNase toxin Tse7~~~
MANEVYANNMEISCKAANGKSIAAFPDVCFTPPQAPPTPLGVPIPYPNTGLSKDTTKGTRTIRITRKEVMLKNKSYYKTS
YGDEPGRAPKKGIVTSKIKGKVYFTSWSMNVKFESKNVVRHLDLTTHNHASFPGNTPVWPYLDQATVDAGGGPCSNEVKK
EKKDCADFKPHGSKDACAGLGAGKPSGKKTSNEADRLADKVAARKCLTARRCALQPYKPNSCCPQQTAHHLIEASALHDK
GRGGKGSVPLKGISNYSENKAPCVCAEGVNQNVGTHGLMHTFQSAAAAKSRSGTLQLSNGSSISAKKTTYGTAKRQSMAA
MGKVFPQSKCSKECLSAQLDNYHKQCGINARTPIKAVETGQTDVTAATQAIKTRNARLGATRSRVR
>Q9KS43 ~~~tseL~~~Toxin TseL~~~COG3675
MDSFNYCVQCNPEENWLELEFRSENDEPIDGLLVTITNQSAPSNTYTQTTSSGKVLFGKIAAGEWRASVSQASLLTEVEK
YASRKEGQESPVKKRAAAELDAADKDTKQYRFTTIGDFWDEAPKDEFLQKQHKGIDVNASAEKAGFRLSHNQTYVFEIKA
LRSYMPVIIDTDEFNLVNSYTFALLSKLAYATNDFNRDDGKTIDNQGAISTVISQLKRKERPTYSGDLQAKWLLEEIPYS
KALSAQYYAEDDVGSEGYIIFNDELAIIGVRGTEPYFQSKKPPVDNTKFKIIKAASGMAAVIADKIESATDSPGMKDLII
TDLDAAQIAPEEFGGTYVHRGFYQYTMALLSLMEKDLGLHKIKKFYCCGHSLGGAGALLISALIKDSYHPPVLRLYTYGM
PRVGTRSFVERYQNILHYRHVNNHDLVPQIPTVWMNTDVSEGFHVLDVFKSRVDLMRKMLTDDDDDNYQHHGHLSQLLTY
NSNNQVLLTPKQTQVTMLDLANLATNDSVAMVDGLSDASIVEHGMEQYIPNLFEQLTALSDESLMVHYQRAISALEQEIA
TLQQSYLTVKQAWIESIGNGTPTMNIGRLMSEMHSINKLIENRNKIRGELRQIVSDPQRMPATKFLISQQTLPDEIKVQI
R
>P60778 ~~~tsgA~~~Protein TsgA~~~COG0738
MTNSNRIKLTWISFLSYALTGALVIVTGMVMGNIADYFNLPVSSMSNTFTFLNAGILISIFLNAWLMEIVPLKTQLRFGF
LLMVLAVAGLMFSHSLALFSAAMFILGVVSGITMSIGTFLVTQMYEGRQRGSRLLFTDSFFSMAGMIFPMIAAFLLARSI
EWYWVYACIGLVYVAIFILTFGCEFPALGKHAPKTDAPVEKEKWGIGVLFLSVAALCYILGQLGFISWVPEYAKGLGMSL
NDAGTLVSNFWMSYMVGMWAFSFILRFFDLQRILTVLAGLAAILMYVFNTGTPAHMAWSILALGFFSSAIYTTIITLGSQ
QTKVPSPKLVNFVLTCGTIGTMLTFVVTGPIVEHSGPQAALLTANGLYAVVFVMCFLLGFVSRHRQHNTLTSH
>Q47692 3.4.21.-~~~tsh~~~Temperature-sensitive hemagglutinin tsh autotransporter~~~
MNRIYSLRYSAVARGFIAVSEFARKCVHKSVRRLCFPVLLLIPVLFSAGSLAGTVNNELGYQLFRDFAENKGMFRPGATN
IAIYNKQGEFVGTLDKAAMPDFSAVDSEIGVATLINPQYIASVKHNGGYTNVSFGDGENRYNIVDRNNAPSLDFHAPRLD
KLVTEVAPTAVTAQGAVAGAYLDKERYPVFYRLGSGTQYIKDSNGQLTQMGGAYSWLTGGTVGSLSSYQNGEMISTSSGL
VFDYKLNGAMPIYGEAGDSGSPLFAFDTVQNKWVLVGVLTAGNGAGGRGNNWAVIPLDFIGQKFNEDNDAPVTFRTSEGG
ALEWSFNSSTGAGALTQGTTTYAMHGQQGNDLNAGKNLIFQGQNGQINLKDSVSQGAGSLTFRDNYTVTTSNGSTWTGAG
IVVDNGVSVNWQVNGVKGDNLHKIGEGTLTVQGTGINEGGLKVGDGKVVLNQQADNKGQVQAFSSVNIASGRPTVVLTDE
RQVNPDTVSWGYRGGTLDVNGNSLTFHQLKAADYGAVLANNVDKRATITLDYALRADKVALNGWSESGKGTAGNLYKYNN
PYTNTTDYFILKQSTYGYFPTDQSSNATWEFVGHSQGDAQKLVADRFNTAGYLFHGQLKGNLNVDNRLPEGVTGALVMDG
AADISGTFTQENGRLTLQGHPVIHAYNTQSVADKLAASGDHSVLTQPTSFSQEDWENRSFTFDRLSLKNTDFGLGRNATL
NTTIQADNSSVTLGDSRVFIDKNDGQGTAFTLEEGTSVATKDADKSVFNGTVNLDNQSVLNINDIFNGGIQANNSTVNIS
SDSAVLGNSTLTSTALNLNKGANALASQSFVSDGPVNISDAALSLNSRPDEVSHTLLPVYDYAGSWNLKGDDARLNVGPY
SMLSGNINVQDKGTVTLGGEGELSPDLTLQNQMLYSLFNGYRNIWSGSLNAPDATVSMTDTQWSMNGNSTAGNMKLNRTI
VGFNGGTSPFTTLTTDNLDAVQSAFVMRTDLNKADKLVINKSATGHDNSIWVNFLKKPSNKDTLDIPLVSAPEATADNLF
RASTRVVGFSDVTPILSVRKEDGKKEWVLDGYQVARNDGQGKAAATFMHISYNNFITEVNNLNKRMGDLRDINGEAGTWV
RLLNGSGSADGGFTDHYTLLQMGADRKHELGSMDLFTGVMATYTDTDASADLYSGKTKSWGGGFYASGLFRSGAYFDVIA
KYIHNENKYDLNFAGAGKQNFRSHSLYAGAEVGYRYHLTDTTFVEPQAELVWGRLQGQTFNWNDSGMDVSMRRNSVNPLV
GRTGVVSGKTFSGKDWSLTARAGLHYEFDLTDSADVHLKDAAGEHQINGRKDSRMLYGVGLNARFGDNTRLGLEVERSAF
GKYNTDDAINANIRYSF
>Q9I2Q0 ~~~tsi1~~~Immune protein Tsi1~~~
MKLLAGSFAALFLSLSAQAADCTFTQLEIVPQFGSPNMFGGEDEHVRVMFSNEDPNDDNPDAFPEPPVYLADRDSGNDCR
IEDGGIWSRGGVFLSQDGRRVLMHEFSGSSAELVSYDSATCKVVHREDISGQRWAVDKDGLRLGQKCSGESVDSCAKIVK
RSLAPFCQTAKK
>Q9I0D9 ~~~tsi2~~~Immune protein Tsi2~~~
MNLKPQTLMVAIQCVAARTRELDAQLQNDDPQNAAELEQLLVGYDLAADDLKNAYEQALGQYSGLPPYDRLIEEPAS
>Q9HYC4 ~~~tsi3~~~Immune protein Tsi3~~~
MKTVALILASLALLACTAESGVDFDKTLTHPNGLVVERPVGFDARRSAEGFRFDEGGKLRNPRQLEVQRQDAPPPPDLAS
RRLGDGEARYKVEEDDGGSAGSEYRLWAAKPAGARWIVVSASEQSEDGEPTFALAWALLERARLQ
>Q9I740 ~~~tsi6~~~Immune protein Tsi6~~~
MTPIEYIDRALALVVDRLARYPGYEVLLSAEKQLQYIRSVLLDRSLDRSALHRLTLGSIAVKEFDETDPELSRALKDAYY
VGIRTGRGLKVDLP
>Q9I732 ~~~tsi7~~~Immune protein Tsi7~~~
MLKKLSPIFSNITGVVRYQDLAYVASVSDEIQEQNIAHSYVTEWDCGTWCVAGEDDDMLPWEIVSATVVHEPVEQALFLG
ARGQVFCMGSGDIHEEQLPDGDDAIGGRGNMRGVACIDGVAYACGMDRQVYRRFDENDWRAIDTGARPPAGSEAVVGFEA
IGGFGAREIYAVGWDGEIWQYDGKRWQPRESPTNLILTAICCAEDGSVYACGQAGTLLRGRNDHWEIIAQDDVDEDLWSL
AWFDGALYVSSATAVYTLVGGHLKEVDFGDEQPQRCFHLSAADGVLWSIAAKDIFSFDGQQWTRID
>Q9KN41 ~~~tsiV3~~~Antitoxin protein TsiV3~~~
MNNLLSAYVTMLLILLSISGGAIASENCNDTSGVHQKILVCIQNEIAKSETQIRNNISSKSIDYGFPDDFYSKQRLAIHE
KCMLYINVGGQRGELLMNQCELSMLQGLDIYIQQYIEDVDNS
>P18644 2.1.1.230~~~tsnR~~~23S rRNA (adenosine(1067)-2'-O)-methyltransferase~~~
MTELDTIANPSDPAVQRIIDVTKPSRSNIKTTLIEDVEPLMHSIAAGVEFIEVYGSDSSPFPSELLDLCGRQNIPVRLID
SSIVNQLFKGERKAKTFGIARVPRPARFGDIASRRGDVVVLDGVKIVGNIGAIVRTSLALGASGIILVDSDITSIADRRL
QRASRGYVFSLPVVLSGREEAIAFIRDSGMQLMTLKADGDISVKELGDNPDRLALLFGSEKGGPSDLFEEASSASVSIPM
MSQTESLNVSVSLGIALHERIDRNLAANR
>Q81BL7 ~~~tspO~~~Tryptophan-rich protein TspO~~~
MFMKKSSIIVFFLTYGLFYVSSVLFPIDRTWYDALEKPSWTPPGMTIGMIWAVLFGLIALSVAIIYNNYGFKPKTFWFLF
LLNYIFNQAFSYFQFSQKNLFLATVDCLLVAITTLLLIMFSSNLSKVSAWLLIPYFLWSAFATYLSWTIYSIN
>Q9RFC8 ~~~tspO~~~Tryptophan-rich sensory protein~~~
MNMDWALFLTFLAACGAPATTGALLKPDEWYDNLNKPWWNPPRWVFPLAWTSLYFLMSLAAMRVAQLEGSGQALAFYAAQ
LAFNTLWTPVFFGMKRMATALAVVMVMWLFVAATMWAFFQLDTWAGVLFVPYLIWATAATGLNFEAMRLNWNRPEARA
>Q8KBX2 ~~~crtK-2~~~Tryptophan-rich protein TspO~~~COG3476
MNKQILTLALCIGLCLAVGFAGSTFTPKPASWYYTTLVKPSWNPPDWLFPPVWTILFIMMGTALAKVLGTGWKKNEVNVG
VVLFAIQLMLNLGWSASFFGMQSPLAGLVDIVLLWIFIVLTMLAFARVSKPASLLLVPYLCWVSFASYLNFTILQLNP
>C0JRZ9 2.1.1.106~~~tsrM~~~Tryptophan 2-C-methyltransferase~~~
MLRKGTVALINPNQIHPPIAPYALDVLTTALEASGFEAHVLDLTFHLDDWRQTLRDYFRAERPLLVGVTCRNTDTVYALE
QRPFVDGYKAVIDEVRRLTAAPVVAGGVGFSTMPFALVDYFGIEYGVKGPGEKIICDLARALAEGRSADRIHIPGLLVNR
GPGNVTRVAPPALDPRAAPAPSSSPSPSPAPSSSSAPVPVPLSFAAVGHHESRAWQAETELPYTRRSGEPYKVDNLRYYR
EGGLGSILTKNGCVYKCSFCVEPDAKGTQFARRGITAVVDEMEALTAQGIHDLHTTDSEFNLSIAHSKNLLREIVRRRDH
DATSPLRDLRLWVYCQPSPFDEEFAELLAAAGCAGVNIGADHTRPEMLDGWKVTAKGTRYYDFADTERLVQLCHRNGMLT
MVEALFGMPGETLETMRDCVDRMMELDATVTGFSLGLRLLPYMGLAKSLAEQCDGVRTVRGLQSNNASGPIVLKQLHQCD
GPIEYERQFMFDESGDFRLVCYFSPDLPEAPGTADSPDGIWRASVDFLWDRIPKSEQYRVMLPTLSGSSENDNNYADNPF
LTSLNRKGYTGAFWAHWRDREAIMSGATLPLGELAEAVR
>Q9I750 ~~~tssA1~~~Type VI secretion system component TssA1~~~
MLDVPVLLAAVSPDSPCGDDLEYDAAFLELERIAQGQPERQMGDAVLPAEPPEWPRVRALASELFGRSKDLRVANLLLQS
NVALDGLDGLADGLLLVRELLGQYWDGVYPLLDADDDNDPTFRINALTGLVAEPLLQLVWAIPLVRSRAFGPVNLRAALN
AAGLQRFASETLSPEQIAGAFADADADALAATRRALDGAQEHALAIESGVAERVGSAQGLDLGPLRQLLRQALQVFDLYG
PQGAGESLAPGAEAVADEQVGAAPVAAVAAPAPRASGEIANREDVLRQLDRLLEYYVRHEPSSPVPVLLKRAKTLVTADF
AEIVRNLIPDGISQFETLRGPESE
>Q9I749 ~~~tssB1~~~Type VI secretion system sheath protein TssB1~~~
MGSTTSSQKFIARNRAPRVQIEYDVELYGAEKKVQLPFVMGVMADLAGKPAEPQAAVADRKFLEIDVDNFDARLKAMKPR
VAFNVPNVLTGEGNLSLDITFESMDDFSPAAVARKVDSLNKLLEARTQLANLLTYMDGKTGAEEMIMKAIKDPALLQALA
SAPKPKDDEPQA
>Q9I748 ~~~tssC1~~~Type VI secretion system sheath protein TssC1~~~
MAELSTENLAQGQTTTEQTSEFASLLLQEFKPKTERAREAVETAVRTLAEHALEQTSLISNDAIKSIESIIAALDAKLTA
QVNLIMHHADFQQLESAWRGLHYLVNNTETDEQLKIRVLNISKPELHKTLKKFKGTTWDQSPIFKKLYEEEYGQFGGEPY
GCLVGDYYFDQSPPDVELLGEMAKISAAMHAPFISAASPTVMGMGSWQELSNPRDLTKIFTTPEYAGWRSLRESEDSRYI
GLTMPRFLARLPYGAKTDPVEEFAFEEETDGADSSKYAWANSAYAMAVNINRSFKLYGWCSRIRGVESGGEVQGLPAHTF
PTDDGGVDMKCPTEIAISDRREAELAKNGFMPLLHKKNTDFAAFIGAQSLQKPAEYDDPDATANANLAARLPYLFATCRF
AHYLKCIVRDKIGSFKEKDEMQRWLQDWILNYVDGDPAHSTETTKAQHPLAAAEVVVEEVEGNPGYYNSKFFLRPHYQLE
GLTVSLRLVSKLPSAKEA
>Q9I745 ~~~tssE1~~~Type VI secretion system component TssE1~~~
MAELTLQERLQPSLLDRLTDDEPGNLKEAAERRVLTLNQLKASVLRDLAWLFNTTSLFDHGPAARMPAGNSVLNYGLPAL
AGHTASSVDVHAIEALLTETIATFEPRIIRSSLRVRAQLLPGEMDHNALSFEIEGDLWAEPAPLRLLLTTNLDLETGHVR
VAQGERRRT
>Q9I744 ~~~tssF1~~~Type VI secretion system component TssF1~~~
MNPRLLEYYNQELQHIRESAAEFAEEFPKIAGRLSLSGFECADPYVERLLEGFAYLTARVQLKLDAEYPTFTHNLLEIAY
PHYLAPTPSMTVVQLRPDPNEGALSSGFSIERGASLRGQLGPDDQTACEYRTAHPVTLWPLEVAQADYFGNPAAVLGRLA
ASEPRAKAGLRIRLRSGAGIPFDSLSLDALPLYLHGADEQPYRLYEQLLGNACAVFVRAPDNAWVERLPTSSLRARGFDD
EDALLPVVPRAFQGYRLLQEYFALPARFLFVEFSGLNRALRRCHGEELELVVLFGKHDQRLEGTVDAEQLVPFCTPAINL
FPRRCDRIHLSDRVNEHHVIVDRTRPLDFEVHSLQQVSGHGSGPEQPFQPFYAVRDPARYGREQAYFRVRREPRVLSSKQ
RRKGPRSTYVGSETFVALVDANQAPYRHDLRQLGIAALCTNRDLPLFMPIGAHKSDFTLEDSAPVMQVRCLAGPSRPRAS
RAHDASAWRLISQLSLNYLSLAERGQGAAALRELLRLYGDSGDPALQLQIEGLREVSSKPCTRRLPMPGPIVFGRGLEIT
LDFDENAFRGTGVFLLGAVFERFLARYVSINSFTETVLRTGERGEVMRWQAKPGSRPNL
>Q9I753 ~~~tssK1~~~Type VI secretion system baseplate component TssK1~~~
MSWNNRVVWSEGMFLRPQHFQQHDRYLETLVDGRCRSLLAGGWGFSELKLDDALLTQGKLAIVSARGVLPDGTPFNIPAD
DPAPAPLNVEESLRDGIVYLGLPLKRVGTRDTVEEGEALGGARYVSQVQEVRDDNAAFESRAPVALGSQAFRLLTERDGL
GEYAAVGVARVREKRADQALSLDEDYLPPVLDIAAAPPLASFAKELLGLLHQRGEALAGRVVASSAGGASEIADFLLLQL
VNRAEALTGHLSRVRPLHPQELYRELVALAGEFCTFTASQRRPEEYPVYNHDDLAASFAPVMLALRQALATVIDAKAIAI
PIVEKAYGVHVAMLSDRSLIDNASFVLVVRADVPGESLRGHFPQQAKVGSVEHIRDLVNLQLPGIGLLPMPVAPRQIPYH
AGSTYFELDRGSAHWKQLTHSGGFAFHIAGQFPGLNLAFWAIRG
>P06886 ~~~tst~~~Toxic shock syndrome toxin-1~~~
MNKKLLMNFFIVSPLLLATTATDFTPVPLSSNQIIKTAKASTNDNIKDLLDWYSSGSDTFTNSEVLDNSLGSMRIKNTDG
SISLIIFPSPYYSPAFTKGEKVDLNTKRTKKSQHTSEGTYIHFQISGVTNTEKLPTPIELPLKVKVHGKDSPLKYGPKFD
KKQLAISTLDFEIRHQLTQIHGLYRSSDKTGGYWKITMNDGSTYQSDLSKKFEYNTEKPPINIDEIKTIEAEIN
>P33015 ~~~tsuA~~~Thiosulfate transporter TsuA~~~COG2391
MFSMILSGLICGALLGFVMQRGRFCLTGGFRDMYIVKNNRMFYALLIAISVQSVGVFALIQAGLLTYEAGAFPWLGTVIG
GYIFGLGIVLAGGCATGTWYRAGEGLIGSWIALFTYMVMSAVMRSPHASGLNQTLQHYSTEHNSIAETFNLSVWPLVAVL
LVITLWVVMKELKKPKLKVATLPPRRTGIAHILFEKRWHPFVTAVLIGLIALLAWPLSEATGRMFGLGITSPTANILQFL
VAGDMKYINWGVFLVLGIFVGSFIAAKASREFRVRAADAQTTLRSGLGGVLMGFGASIAGGCSIGNGLVMTAMMTWQGWI
GLVFMILGVWTASWLVYVRPQRKARLATAAAN
>G0GAP6 ~~~tsuA~~~Thiosulfate transporter TsuA~~~
MIWTGLLVGFLFGIVLQRGRICFNSAFRDVLLFKDNYLFKLAVFTLALEMILFVLLSQVGLMQMNPKPLNLVGNIIGGFV
FGLGMVLAGGCASGVTYRVGEGLTTAWFAALFYGLGAYATKSGAFSWWLSWVGQFKSPLSVEESAYYVKGAGPTISSVLG
LNPWIPALVIAALFILWAFGTKTTSRETKFNWKIASVCLALVAGLGFITSTLSGRKYGLGITGGWINLFQGFLTNSPLNW
EGLEIVGIILGAGVAAAVAGEFKLRMPKNPVTYLQVGIGGLLMGIGAVTAGGCNIGHFLTGVPQLALSSWLASIFFILGN
WTMAWILFRR
>P33014 ~~~tsuB~~~Putative sulfur carrier protein TsuB~~~COG0425
MVIKKLDVVTQVCPFPLIEAKAALAEMVSGDELVIEFDCTQATEAIPQWAAEEGHAITDYQQIGDAAWSITVQKA
>I2N045 4.2.3.159~~~~~~Tsukubadiene synthase~~~
MIEVPPFWCPLPIAIHPAADQAEKDARAWAERYGVRLRIADQVQPGRLGAYWAPHGTYEGMLAVGCWNFWAFAFDDHLDE
PLPLDVPVTTSLVQQAVDIPSPPITDDPWAAGAQAVFNMFRDLATPTQVRYCADNHRRWLHGACWRHSNHVNRRLPPLAE
YIPLRMQDAAAQATCLIAVLIGSDISVPEQEMDSPRVRALLETASWTATIDSDLHSFQLEDTQRPVSQHIVSVLMHERGI
GVDEALRQSVALRDRFMTRFLHLQQECARTGSSELARFAHTLGYVISGYLQWAVDTSRYGQTEATFSFTDTPRDDTPEPP
PGIPSVEWLWTL
>P0A927 ~~~tsx~~~Nucleoside-specific channel-forming protein Tsx~~~COG3248
MKKTLLAAGAVLALSSSFTVNAAENDKPQYLSDWWHQSVNVVGSYHTRFGPQIRNDTYLEYEAFAKKDWFDFYGYADAPV
FFGGNSDAKGIWNHGSPLFMEIEPRFSIDKLTNTDLSFGPFKEWYFANNYIYDMGRNKDGRQSTWYMGLGTDIDTGLPMS
LSMNVYAKYQWQNYGAANENEWDGYRFKIKYFVPITDLWGGQLSYIGFTNFDWGSDLGDDSGNAINGIKTRTNNSIASSH
ILALNYDHWHYSVVARYWHDGGQWNDDAELNFGNGNFNVRSTGWGGYLVVGYNF
>P76055 2.8.1.-~~~ttcA~~~tRNA-cytidine(32) 2-sulfurtransferase~~~COG0037
MQENQQITKKEQYNLNKLQKRLRRNVGEAIADFNMIEEGDRIMVCLSGGKDSYTMLEILRNLQQSAPINFSLVAVNLDQK
QPGFPEHVLPEYLEKLGVEYKIVEENTYGIVKEKIPEGKTTCSLCSRLRRGILYRTATELGATKIALGHHRDDILQTLFL
NMFYGGKMKGMPPKLMSDDGKHIVIRPLAYCREKDIQRFADAKAFPIIPCNLCGSQPNLQRQVIADMLRDWDKRYPGRIE
TMFSAMQNVVPSHLCDTNLFDFKGITHGSEVVNGGDLAFDREEIPLQPACWQPEEDENQLDELRLNVVEVK
>Q9JZJ6 2.8.1.-~~~ttcA~~~tRNA-cytidine(32) 2-sulfurtransferase~~~
MSKKTKQELENNKLSKRLRHAVGDAINDFNMIEPDDKIMVCLSGGKDSYALLDILRQLQASAPIDFQLVAVNLDQKQPGF
PEEVLPTYLESIGVPYKIVEEDTYSTVKRVLDEGKTTCSLCSRLRRGILYRTAKELGCTKIALGHHRDDILATLFLNMFY
GGKLKAMPPKLVSDNGEHIVIRPLAYVKEKDLIKYAELKQFPIIPCNLCGSQPNLQRQVIGDMLRDWDKRFPGRIESMFS
ALQNVVPSHLADTELFDFVGLERGQSLKHGGDLAFDSEKMPERFSDGSEEDESEIKIEPQKAERKVINILANKPKTCGS
>Q8ZP88 2.8.1.-~~~ttcA~~~tRNA-cytidine(32) 2-sulfurtransferase~~~
MQEIQKNTKKEQYNLNKLQKRLRRNVGEAIADFNMIEEGDRIMVCLSGGKDSYTMLEILRNLQQSAPINFSLVAVNLDQK
QPGFPEHILPAYLEQLGVEYKIVEENTYGIVKEKIPEGKTTCSLCSRLRRGILYRTATELGATKIALGHHRDDILQTLFL
NMFYGGKMKGMPPKLMSDDGKHIVIRPLAYCREKDIIRFAEAKAFPIIPCNLCGSQPNLQRQVIADMLRDWDKRYPGRIE
TMFSAMQNVVPSHLCDTNLFDFKGITHGSEVVDGGDLAFDREEIPLQPAGWQPEEDDTALEALRLDVIEVK
>P05847 4.2.1.32~~~ttdA~~~L(+)-tartrate dehydratase subunit alpha~~~COG1951
MMSESNKQQAVNKLTEIVANFTAMISTRMPDDVVDKLKQLKDAETSSMGKIIYHTMFDNMQKAIDLNRPACQDTGEIMFF
VKVGSRFPLLGELQSILKQAVEEATVKAPLRHNAVEIFDEVNTGKNTGSGVPWVTWDIIPDNDDAEIEVYMAGGGCTLPG
RSKVLMPSEGYEGVVKFVFENISTLAVNACPPVLVGVGIATSVETAAVLSRKAILRPIGSRHPNPKAAELELRLEEGLNR
LGIGPQGLTGNSSVMGVHIESAARHPSTIGVAVSTGCWAHRRGTLLVHADLTFENLSHTRSAL
>P0AC35 4.2.1.32~~~ttdB~~~L(+)-tartrate dehydratase subunit beta~~~COG1838
MKKILTTPIKAEDLQDIRVGDVIYLTGTLVTCRDVCHRRLIELKRPIPYDLNGKAIFHAGPIVRKNGDKWEMVSVGPTTS
MRMESFEREFIEQTGVKLVVGKGGMGPLTEEGCQKFKALHVIFPAGCAVLAATQVEEIEEVHWTELGMPESLWVCRVKEF
GPLIVSIDTHGNNLIAENKKLFAERRDPIVEEICEHVHYIK
>P39414 ~~~ttdT~~~L-tartrate/succinate antiporter~~~COG0471
MKPSTEWWRYLAPLAVIAIIALLPVPAGLENHTWLYFAVFTGVIVGLILEPVPGAVVAMVGISIIAILSPWLLFSPEQLA
QPGFKFTAKSLSWAVSGFSNSVIWLIFAAFMFGTGYEKTGLGRRIALILVKKMGHRTLFLGYAVMFSELILAPVTPSNSA
RGAGIIYPIIRNLPPLYQSQPNDSSSRSIGSYIMWMGIVADCVTSAIFLTAMAPNLLLIGLMKSASHATLSWGDWFLGML
PLSILLVLLVPWLAYVLYPPVLKSGDQVPRWAETELQAMGPLCSREKRMLGLMVGALVLWIFGGDYIDAAMVGYSVVALM
LLLRIISWDDIVSNKAAWNVFFWLASLITLATGLNNTGFISWFGKLLAGSLSGYSPTMVMVALIVVFYLLRYFFASATAY
TSALAPMMIAAALAMPEIPLPVFCLMVGAAIGLGSILTPYATGPSPIYYGSGYLPTADYWRLGAIFGLIFLVLLVITGLL
WMPVVLL
>A0QQF4 ~~~ttfA~~~Trehalose monomycolate transport factor A~~~
MVPLWFTLSALCFVGAAVLLYVDIDRRRGLGRRRKSWAKSHGFDYEYESEDLLKRWKRGVMSTVGDVTAKNVVLGQIRGE
AVFIFDIEEVATVIALHRKVGTNVVVDLRLKGLKEPRENDIWLLGAIGPRMVYSTNLDAARRACDRRMVTFAHTAPDCAE
IMWNEQNWTLVAMPVTSNRAQWDEGLRTVRQFNDLLRVLPPVPQNGSQAALPRRGGSPSRPLAPTPAGRRELPPGRADVP
PARGDVSRFAPRPEAGRSDAFRRPPPARNGREASHFQR
>Q88N30 ~~~ttgA~~~Probable efflux pump periplasmic linker TtgA~~~COG0845
MQFKPAVTALVSAVALATLLSGCKKEEAAPAAQAPQVGVVTIQPQAFTLTSELPGRTSAYRVAEVRPQVNGIILKRLFKE
GSEVKEGQQLYQIDPAVYEATLANAKANLLATRSLAERYKQLIDEQAVSKQEYDDANAKRLQAEASLKSAQIDLRYTKVL
APISGRIGRSSFTEGALVSNGQTDAMATIQQLDPIYVDVTQSTAELLKLRRDLESGQLQKAGDNAASVQLVLEDGSLFKQ
EGRLEFSEVAVDETTGSVTLRALFPNPDHTLLPGMFVHARLKAGVNANAILAPQQGVTRDLKGAPTALVVNQENKVELRQ
LKASRTLGSDWLIEEGLNPGDRLITEGLQYVRPGVEVKVSDATNVKKPAGPDQANAAKADAKAE
>Q9WWZ9 ~~~ttgA~~~Toluene efflux pump periplasmic linker protein TtgA~~~
MQFKPAVTALVSAVALATLLSGCKKEEAAPAAQAPQVGVVTIQPQAFTLTSELPGRTSAYRVAEVRPQVNGIILKRLFKE
GSEVKEGQQLYQIDPAVYEATLANAKANLLATRSLAERYKQLIDEQAVSKQEYDDANAKRLQAEASLKSAQIDLRYTKVL
APISGRIGRSSFTEGALVSNGQTDAMATIQQLDPIYVDVTQSTAELLKLRRDLESGQLQKAGNNAASVQLVLEDGSLFKQ
EGRLEFSEVAVDETTGSVTLRALFPNPDHTLLPGMFVHARLKAGVNANAILAPQQGVTRDLKGAPTALVVNQENKVELRQ
LKASRTLGSDWLIEEGLNPGDRLITEGLQYVRPGVEVKVSDATNVKKPAGPDQANAAKADAKAE
>Q88N31 ~~~ttgB~~~Probable efflux pump membrane transporter TtgB~~~COG0841
MSKFFIDRPIFAWVIALVIMLVGALSILKLPINQYPSIAPPAIAIAVTYPGASAQTVQDTVVQVIEQQLNGIDNLRYVSS
ESNSDGSMTITATFEQGTNPDTAQVQVQNKLNLATPLLPQEVQQQGIRVTKAVKNFLLVIGLVSEDGSMTKDDLANYIVS
NMQDPISRTAGVGDFQVFGAQYAMRIWLDPAKLNKFQLTPVDVKTAVAAQNVQVSSGQLGGLPALPGTQLNATIIGKTRL
QTAEQFESILLKVNKDGSQVRLGDVAQVGLGGENYAVSAQFNGKPASGLAVKLATGANALDTAKALRETIKGLEPFFPPG
VKAVFPYDTTPVVTESISGVIHTLIEAVVLVFLVMYLFLQNFRATIITTMTVPVVLLGTFGILAAAGFSINTLTMFAMVL
AIGLLVDDAIVVVENVERVMSEEGLPPKEATKRSMEQIQGALVGIALVLSAVLLPMAFFGGSTGVIYRQFSITIVSAMGL
SVLVALIFTPALCATMLKPLKKGEHHTAKGGFFGWFNRNFDRSVNGYERSVGAILRNKVPFLLAYALIVVGMIWLFARIP
TAFLPEEDQGVLFAQVQTPAGSSAERTQVVVDQMREYLLKDEADTVSSVFTVNGFNFAGRGQSSGMAFIMLKPWDERSKE
NSVFALAQRAQQHFFTFRDAMVFAFAPPAVLELGNATGFDVFLQDRGGVGHEKLMEARNQFLAKAAQSKILSAVRPNGLN
DEPQYQLTIDDERASALGVTIADINNTLSIALGASYVNDFIDRGRVKKVYIQGEPSARMSPEDLQKWYVRNGAGEMVPFS
SFAKGEWTYGSPKLSRYNGVEAMEILGAPAPGYSTGEAMAEVERIAGELPSGIGFSWTGMSYEEKLSGSQMPALFALSVL
FVFLCLAALYESWSIPIAVVLVVPLGIIGALIATSLRGLSNDVYFLVGLLTTIGLAAKNAILIVEFAKELHEQGRSLYDA
AIEACRMRLRPIIMTSLAFILGVVPLTIASGAGAGSQHAIGTGVIGGMISATVLAIFWVPLFFVAVSSLFGSKEPEKDVT
PENPRYEAGQ
>O52248 ~~~ttgB~~~Toluene efflux pump membrane transporter TtgB~~~
MSKFFIDRPIFAWVIALVIMLVGALSILKLPINQYPSIAPPAIAIAVTYPGASAQTVQDTVVQVIEQQLNGIDNLRYVSS
ESNSDGSMTITATFEQGTNPDTAQVQVQNKLNLATPLLPQEVQQQGIRVTKAVKNFLLVIGLVSEDGSMTKDDLANYIVS
NMQDPISRTAGVGDFQVFGAQYAMRIWLDPAKLNKFQLTPVDVKTAVAAQNVQVSSGQLGGLPALPGTQLNATIIGKTRL
QTAEQFESILLKVNKDGSQVRLGDVAQVGLGGENYAVSAQFNGKPASGLAVKLATGANALDTAKALRETIKGLEPFFPPG
VKAVFPYDTTPVVTESISGVIHTLIEAVVLVFLVMYLFLQNFRATIITTMTVPVVLLGTFGILAAAGFSINTLTMFAMVL
AIGLLVDDAIVVVENVERVMSEEGLPPKEATKRSMEQIQGALVGIALVLSAVLLPMAFFGGSTGVIYRQFSITIVSAMGL
SVLVALIFTPALCATMLKPLKKGEHHTAKGGFFGWFNRNFDRSVNGYERSVGTILRNKVPFLLAYALIVVGMIWLFARIP
TAFLPEEDQGVLFAQVQTPAGSSAERTQVVVDQMREYLLKDEADTVSSVFTVNGFNFAGRGQSSGMAFIMLKPWDERSKE
NSVFALAQRAQQHFFTFRDAMVFAFAPPAVLELGNATGFDVFLQDRGGVGHAKLMEARNQFLAKAAQSKILSAVRPNGLN
DEPQYQLTIDDERASALGVTIADINNTLSIALGASYVNDFIDRGRVKKVYIQGEPSARMSPEDLQKWYVRNGAGEMVPFS
SFAKGEWTYGSPKLSRYNGVEAMEILGAPAPGYSTGEAMAEVERIAGELPSGIGFSWTGMSYEEKLSGSQMPALFALSVL
FVFLCLAALYESWSIPIAVVLVVPLGIIGALIATSLRGLSNDVYFLVGLLTTIGLAAKNAILIVEFAKELHEQGRSLYDA
AIEACRMRLRPIIMTSLAFILGVVPLTIASGAGAGSQHAIGTGVIGGMISATVLAIFWVPLFFVAVSSLFGSKEPEKDVT
PENPRYEAGQ
>Q88N32 ~~~ttgC~~~Probable efflux pump outer membrane protein TtgC~~~COG1538
MTKSLLSLAVTAFILGGCSLIPDYQTPEAPVAAQWPQGPAYSPTQSADVAAAEQGWRQFFHDPALQQLIQTSLVNNRDLR
VAALNLDAYRAQYRIQRADLFPAVSATGSGSRQRVPANMSQTGESGITSQYSATLGVSAYELDLFGRVRSLTEQALETYL
SSEQARRSTQIALVASVANAYYTWQADQALFKLTEETLKTYEESYNLTRRSNEVGVASALDVSQARTAVEGARVKYSQYQ
RLVAQDVNSLTVLLGTGIPADLAKPLELDADQLAEVPAGLPSDILQRRPDIQEAEHLLKAANANIGAARAAFFPSISLTA
NAGSLSPDMGHLFSGGQGTWLFQPQINLPIFNAGSLKASLDYSKIQKDINVAKYEKTIQTAFQEVSDGLAARKTFEEQLQ
AQRDLVQANQDYYRLAERRYRIGIDSNLTFLDAQRNLFSAQQALIGDRLSQLTSEVNLYKALGGGWYEQTGQANQQASVE
TPKG
>Q9WWZ8 ~~~ttgC~~~Toluene efflux pump outer membrane protein TtgC~~~
MTKSLLSLAVTAFILGGCSLIPDYQTPEAPVAAQWPQGPAYSPTQSADVAAAEQGWRQFFHDPALQQLIQTSLVNNRDLR
VAALNLDAYRAQYRIQRADLFPAVSATGSGSRQRVPANMSQTGESGITSQYSATLGVSAYELDLFGRVRSLTEQALETYL
SSEQARRSTQIALVASVANAYYTWQADQALFKLTEETLKTYEESYNLTRRSNEVGVASALDVSQARTAVEGARVKYSQYQ
RLVAQDVNSLTVLLGTGIPADLAKPLELDADQLAEVPAGLPSDILQRRPDIQEAEHLLKAANANIGAARAAFFPSISLTA
NAGSLSPDMGHLFAGGQGTWLFQPQINLPIFNAGSLKASLDYSKIQKDINVAKYEKTIQTAFQEVSDGLAARKTFEEQLQ
AQRDLVQANQDYYRLAERRYRIGIDSNLTFLDAQRNLFSAQQALIGDRLSQLTSEVNLYKALGGGWYEQTGQANQQASVE
TPKG
>Q9KWV5 ~~~ttgD~~~Toluene efflux pump periplasmic linker protein TtgD~~~
MRLERALRARQLIPLAAIWLLVGCGKQETVESTAVPPEVGVYTVKAQALTLTTDLPGRTSAYRVSEVRPQASGILQKRMF
VEGAEVKQGEQLYQIDPRTYEALLARAEASLLTAQNLARRYERLLDTNAISQQQYDDAMATWKQAQAEAQMARINMQYTK
VLAPITGRIGRSAVTEGALVTNGQAQELATVTQLDPIYVDVNQPITRLLGLKRALESGRLQRVGDNQAQVSLTLDDGTPY
PLKGVLKFSEVSVAPSTGSVTLRAEFPNPDHKLLPGMFVHALLNEGEQQAAILVPHQAVGRDARGVPTVWVVKPDNTVES
REVQTLQTVGNAWLLGAGINDGERVITEGVQLARSGITVKPVAAKNVKLMSEFGSQVQAQAH
>Q9KWV4 ~~~ttgE~~~Toluene efflux pump membrane transporter TtgE~~~
MSRFFIDRPIFAWVLAIIAMLAGALSLTKMPISQYPNIAAPAVSIQVVYPGASAKTVQDTVVQVIEQQLNGLDGFRYMAA
ESASDGSMNIIVTFEQGTNPDIAQVQVQNKLQLATPRLPEEVQRQGLRVVKYQMNFFMVVGLVDKTGKMTNFDLGNLIAS
QLQDPISRINGVGDFLLFGSPYAMRIWLDPGKLNSYQLTPGDVAQAIREQNVQVSSGQLGGLPTRSGVQLNATVVGKTRM
TTPAEFEEILVKVKADGSQVRVKDLGRVVLASENFAISAKYRGQDSAGLGLRLASGGNLLETVKAVKAELEKQKAYLPEG
VEVIYPYDTSPVVEASIDSVVHTILEAVVLVFLVMFLFLQSLRATIIPTLAVPVVLLAAFALLPYFGISINVLTMYAMVL
AIGLLVDDAIVVVENVERLMHDEGLSPLEATRKSMGQISGALVGIGMVLSAVFVPMAFFGGSAGIIYKQFAVTIVICMSL
SVLVALIFTPALCATILKAPENDAHHEKKGFFGWFNRSFDRNSARFERGVGGILKHRGRYLLIFALITAGTGYLFTQIPK
AFLPSEDQGLMMTEVRMPLNASAERTEVVLQEVKDYLLKEEGQLVDHVMTVNGFNFAGRGQNSGLVLVVLKDWAARQAAG
EDVLSVAERANARFARIKDATVMAFVPPAVLEMGNAMGFDLYLQDNLGLGHESLMAARNQFLELAAENPSLRAVRPNGKD
DEPQFQVKIDDEKARALQVSIASINDTMSAAWGSMYVNDFIDLGRVKRVYIQGVDSSRIAPEDFDKWYVRNALGEMVPFS
AFATGEWIHGSPKLERYGGISAVNILGEPAPGFSTGDAMIAIAQIMQQLPSGIGLSYNGLSYEEIRTGDQAPMLYALTVL
IVFLCLAALYESWSVPMSVILVVPLGIFGAVLATLWRGLEADVYFQVGLMTTVGLSAKNAILIIEFAKELYEKEGVPLVK
AAIEAARLRLRPIIMTSLAFTFGVLPMARATGAGAGSQHSIATGVVGGMITATVLAVFFVPLFYVVVVKVFERNKKPAAL
AEEELA
>Q9KWV3 ~~~ttgF~~~Toluene efflux pump outer membrane protein TtgF~~~
MKTHYLSIALSVALSGCSLIPDYQRPPAPIQAGWPQGEAYAKLKAGTHRPSQTRDAELNWQVFFRDPVMRELIATALNNN
RDLRQTALNVEAYRALHRIERSALLPRANTGVGATRQRLPADLSPTGEAGIQSQYDTTLSMSYELDMFGRLRSLERAALQ
EYLAAAETQRSMQIALIADVAIAYLSWRSDQAQLDLARSTLASYENSLNLIKSSREVGTASALDVRQARSLVETARVQQT
LYTRQVAQDMNALQLLLGTKLPADLPISDVLDQPLAALSTGLPADLLLHRPDIRAAEHRLLAANANIGAARAAFFPSITL
TAAAGTASHELDGLFEGGSGLWAFMPRINLPIFTAGRLRGNLDYRNVIKDINVAEYEKSIQTAFREVADGLAARGTFGEQ
LQAQRDLVDNNQAYYKLAYQRYDEGVDNYLAVLDAQRELFAAQQQFLSDRLNQLSSEVRLFKALGGGWDNISSQPLTAQN
>Q93PU5 ~~~ttgG~~~Toluene efflux pump periplasmic linker protein TtgG~~~
MRAERWSQTVRQIRSPRALRVIPLTALMLISGCGEKEQVSSATPPPDVGVYTVRAQALTLTTDLPGRTSAFRVAEVRPQV
SGILQKRSFVEGAEVKLGQQLYQIDPRTYEAQLRRAEANRTSAQNLARRYETLLKTKAVSKQQYDDALAAWKQAEADYQV
ARIDVQYTRVLSPISGRIGRSTVTEGALVTNGQAQSLATVTQLDPIYVDVTQPITKLLGLQKALESGRLQKTGENQAEVS
LTLDDGSAYPLPGTLKFSEVSVDPTTGSVTLRAEFPNPNRKLLPGMFVHALLKEGVQNAAILVPQQAISRDTRGVPSVWV
VKADNTVESREIQTLRTVGNAWLISNGVTEGERIITEGVQRVRSGIAVNAVEAKNVNLVDGFAATTEASAN
>Q93PU4 ~~~ttgH~~~Toluene efflux pump membrane transporter TtgH~~~
MSRFFIDRPIFAWVLAIVAMLAGALSLAKMPISQYPNIAAPAVSIQVSYPGASAQTVQDTVVQVIEQQLSGLDGFRYMSA
ESASDGSMTIIVTFEQGTDPDIAQVQVQNKLQLATPRLPEEVQRQGLRVVKYQMNFFLVMSLVDRSGKLDNFDLGNLIAS
QLQDPISRIPGVGDFQLFGSPYAMRIWLDPGKLNSYQLTPTDVASAIREQNVQVSSGQLGGLPTRSGVQLNATVLGKTRM
TTPSQFDEILVKVNPDGSQVRVKDVGRAELGADSFAISAQYKDSPTASLALRLSTGGNLLETVDAVKKLMEQQKAYLPDG
VEVIYPYDTTPVVEASIESVVHTIFEAVVLVFLVMYLFLQSFRATLIPTLAVPVVLLATFALLPYFGLNINVLTMYAMVL
AIGLLVDDAIVVVENVERLMHDEGLSPLEATRKSMDQISGALVGIGMVLSAVFVPMAFFGGSAGIIYQQFAITIVVCMGL
SILVALVFTPALCVTILKAPEGNSHHERKGFFGWFNRIFDRGTRRFERGVGAMLKGRGRYLLAFLLITGGTGYLFTQIPK
AFLPNEDQGLMMIEVRTPANASAERTEGVLQEVRDYLANDEGALVEHFMTVNGFNFAGRGQNSGLVLITFKDWKERHGAG
QDVFSIAQRANQHFAKIKDASVMAFVPPAILEMGNAMGFNLYLQDNLGLGHEALMAARNQFLQLASQNPKLQAVRPNGKD
DEPQFQVNIDDEKARALQVSIASINETMSAAWGSMYVNDFIDRGRVKRVYVQGEDISRISPEDFDKWYVRNSLGQMVPFS
AFATGEWVNGSPKLERYGGISSLNILGEPAPGYSTGDAMIAIAEIMQQLPAGIGLSYTGLSYEEIQTGDQAPLLYALTVL
IVFLCLAALYESWSVPVSVIMVVPLGILGAVLATLWRDLTADVYFQVGLMTTVGLSAKNAILIVEFAKELYEKEGYPIVK
AAIEAAKLRLRPILMTSLAFTFGVLPMAIASGAGAGSQHSIATGVVGGMITATVLAVFFVPLFYVVVVKLFEGLMKRKPN
AVKEVTHEV
>Q93PU3 ~~~ttgI~~~Toluene efflux pump outer membrane protein TtgI~~~
MKFKSLPMFALLMLGGCSLIPDYQQPAAPMQAQWPTGQAYGGQGDQRSIATALPKAKEFFKDPALVRLLDAALENNRDLR
IAAKNVESYRALYRIQRAERFPTLDGQASGNRTRLPDDLSPTGDSRIDSQYQVGLVTAYELDLFGRIRSLSNQALEKYLA
TEEAQRSVQIALIGDVATTYFLWRTDQALLELTEATLTSYVESLAMIESSAWAGTSSELDVRQARTLVNQAQAQQALYTR
RIAQDVNALELLLGSKIPTDLPKNSPLAMSALGKVPAGLPADLLLNRPDIRSAEHQLMAANANIGAARAAFFPRISLTAS
AGSASSDLDGLFNSGSDSWSFAPQISVPIFNAGKLRANLDYAELQKDVGVATYEKSIQTAFREVADGLAARGTYGKQLSA
QSELVDNYKAYFSLAQQRYDQGVDSYLTVLDAQRELFSSQQKLLNDQLDQINSEVQLYKALGGGWSVSQN
>Q9AIU0 ~~~ttgR~~~HTH-type transcriptional regulator TtgR~~~
MVRRTKEEAQETRAQIIEAAERAFYKRGVARTTLADIAELAGVTRGAIYWHFNNKAELVQALLDSLHETHDHLARASESE
DEVDPLGCMRKLLLQVFNELVLDARTRRINEILHHKCEFTDDMCEIRQQRQSAVLDCHKGITLALANAVRRGQLPGELDA
ERAAVAMFAYVDGLIRRWLLLPDSVDLLGDVEKWVDTGLDMLRLSPALRK
>Q93PU6 ~~~ttgV~~~HTH-type transcriptional regulator TtgV~~~
MNQSDENIGKAGGIQVIARAASIMRALGSHPHGLSLAAIAQLVGLPRSTVQRIINALEEEFLVEALGPAGGFRLGPALGQ
LINQAQTDILSLVKPYLRSLAEELDESVCLASLAGDKIYVLDRIVSERELRVVFPIGINVPAAATAAGKVLLAALPDETL
QAALGEQLPVLTSNTLGRKALVKQLSEVRQSGVASDLDEHIDGVCSFATLLDTYLGYYSLAIVMPSSRASKQSDLIKKAL
LQSKLNIERAIGRASKKAP
>Q93PU7 ~~~ttgW~~~Uncharacterized HTH-type transcriptional regulator TtgW~~~
MARKTAAEAKETRQRIIDAALEVFVAQGVPDATLDQIARKAGVTRGAVYWHFNGKLEVLQAVLASRQHPLELDFTPDLGI
ERSWEAVVVAMLDAVHSPQSKQFSEILIYQGLDESGLIHNRMVQASDRFLQYIR
>B7J3C9 3.12.1.-~~~tth~~~Tetrathionate hydrolase~~~COG1520
MPSIVRNHGPHNKILLSALLLALFGWVPLASAAVAVPMDSTGPYRTVSHPENAPSGVDAGVGPSEWTHAYANPAHNAAFP
VPDDAPEWIRNGVSWLFPEARAWPLANPPFGSKTYGAAEASVTQTQFYGNALGPSVVDGVVYAESDDMFAYAVNAKTGKL
IWRASPVGNNLMGNPLVIGNTVYLSAGSVAFNFANVLRYAHNPSASARGLNVSFNGIYALNRSNGKLLWYFATPGETMAT
PAYDNNTLFIADGAGNAFGINATTGKQVWKTHVGGMDNMSSVTAYRHNIYFAMAIKPYLYCLNESNGHIVWKGTIPGASN
TGIGDVSPAAADGVVVLDATTKPQANKKAMFSNVIRAFDAKTGAVLWTRNMGSGGKIPAFKGGVPMIHNNIVYVGNPVAS
TYQAYELKTGKLLWTWHVPTKVAAGAGRSAPTYYKGLLYITTGQYIFVVNPATGKELHQHHIGGQFGIESPVIVGGTVYL
TNSWDWIMAIPLKTISHGS
>Q9Z4S6 1.8.-.-~~~ttrA~~~Tetrathionate reductase subunit A~~~
MANLTRRQWLKVGLAVGGMVTFGLSYRDVAKRAIDGLLNGTSGKVTRDRIFGNALIPEAQAQTHWQQNPQQTIAMTQCFG
CWTQCGIRARVNADGKVIRIAGNPYHPLSQEHPIDSSVPFSEAMEQLAGESGLDARSTACARGATLLESLYSPLRLLEPM
KRVGKRGEGKWQRISFEQLIEEVVEGGDLFGEGHVDGLRAIHAPDTPIDAKHPSFGPKTNQLLVTNTSDEGRDAFLRRFA
LNSFGSKNFGAHGAYCGLAYRAGSGALMGDLDKNPHVKPDWENVEFALFMGTSPAQSGNPFKRQARQLASARLRENFQYV
VVAPALPLSTVLADPRGRWQPVMPGSDSALAMGMIRWIMDNQRYNADYLAIPGVQAMQQAGEQSWTNATHLVIADELPTL
AGQHLTLRHLTPDGEETPVVLNTDGELVDASTCRQARLFVTQYVTLADGQRVTVKSGLQRLKEAAEKLSLAQYSEQCGVP
EAQIIALAETFTSHGRKAAVISHGGMMAGNGFYNAWSVMMLNALIGNLSLSGGVFVGGGKFNGVSDGPRYNMNSFAGKVK
PSGLSIARSKTAYEASEEYRDKIAGGQSPYPAKAPWYPFVAGQLTELLTSALEGYPYPLKAWISNMSNPFYGVPGLRAVA
EEKLKDPRRLPLFIAIDAFMNETTALADYIVPDTHNFESWGFTAPWGGVASKATTARWPVVAPATHRTADGQPVSMEAFC
IAVAKRLHLPGFGDRAITDPQGNTFPLNRAEDFYLRVAANIAFMGKTPVALANQEDISLTGVSRILPAIQHTLKADEVGR
VAFIYSRGGRFAPEDSGYTEQRLGNAWKKPLQIWNADVAAHRHAITGERFSGCPVWYPARLSDGRAIDDQFPIGQWPLKL
ISFKSNTMSSSTAVIPRLHHVKPANLVALNPQDGERYGLQHGDRVRIITPGGQVVAQISLLNGVMPGVIAIEHGYGHREM
GATQHSLDGVPMPYDPQIRAGINLNDLGFADPTRTITNTWLDWVSGAAVRQGLPAKIERI
>Q7CQM9 ~~~ttrB~~~Tetrathionate reductase subunit B~~~
MWTGVNMDSSKRQFLQQLGVLTAGASLVPLAEAKFPFSPERHEGSPRHRYAMLIDLRRCIGCQSCTVSCTIENQTPQGAF
RTTVNQYQVQREGSQEVTNVLLPRLCNHCDNPPCVPVCPVQATFQREDGIVVVDNKRCVGCAYCVQACPYDARFINHETQ
TADKCTFCVHRLEAGLLPACVESCVGGARIIGDIKDPHSRIATMLHQHRDAIKVLKPENGTSPHVFYLGLDDAFVTPLMG
RAQPALWQEV
>Q9Z4S7 ~~~ttrC~~~Tetrathionate reductase subunit C~~~
MTHSLIIEEVLAHPQDISWLPWAVQYFFFIGIAACAALFACYLHWRKKDAATEENRALLIAITCAITAPLALTADLHQTA
RVWHFYAWPTPWSWMPWGALFLPLFTGFLALWFLAQQIKRLFNKSYNVTKWLALASALCAVGLLIYTGREVSVVLARPIW
FSYAFPVAMFLSALQAFFALMIVAARRDSVRLPKILWGQIWTLAALGLVVAMWVSGDTLSGTAIRQWITVALSAKYYAVG
WVALWVCTLLFCSLALRHPLSQLRRVLLVLSALALCWLMRWTLLIQVQTVPKFNAQFNPYSLPGGTDGWLAILGTFGLWI
ALLIIIRETLNGLTRRLQHG
>Q7CQM8 ~~~ttrR~~~Tetrathionate response regulatory protein TtrR~~~
MATIHLLDDDTAVTNACAFLLESLGYDVKCWTQGADFLAQASLYQAGVVLLDMRMPVLDGQGVHDALRQCGSTLAVVFLT
GHGDVPMAVEQMKRGAVDFLQKPVSVKPLQAALERALTVSSAAVARREIILCYQQLTPKERELASLVAKGFMNREIAEAM
NIAVRTVEVHRARVMEKMQAGSLAELIRRFEKMASPETRIRTTYEP
>Q8ZPP6 2.7.13.3~~~ttrS~~~Tetrathionate sensor histidine kinase TtrS~~~
MRGKTVRRLAVLAAVGLLCHGAWAGTWNIGILAMRGEASTRSHWQPLAKTLSQQLPGETFHIQPLDLHQMQEAVNQGTVQ
FVITNPAQFVQLNSHAPLRWLASLRSTRDGKAVSNVIGSVILTRRDSGITTAHDLIGKTVGAIDAQAFGGYLLGYKALSD
AGLRPERDFHLRFTGFPGDALVYMLREKAVQAAIVPVCLLENMDQEGLINKKDFIALLSRPTPLPCLTSTPLYPDWSFAA
LPAVSDALADRVTRALFNAPAAASFHWGAPASTSQVEALLRDVRQHPQQRRLWLDVKSWLIQHQLMVGGVILAFLLLTLN
YIWVMLLVRRRGKQLERNSVVLHQHERALETARQMSVLGEMTSGFAHELNQPLSAIRHYAQGCLIRLRAADEQHPLLPAL
EQIDQQAQRGADTLRNLRHWVSQAQGNPVLTEAWKAIAIREAIDHVWQLLRMAQQFPTVTLHTEVSAALRVTLPSVLLEQ
VLANIILNAAQAGATHLWIVAERTENGISIVLQDNAGGIDEALLRQAFQPFMTTRKEGMGLGLAICQRLVRYGRGDISIR
NQTAPDGLSGTVVTIHFLHENGGRDGDNSSTG
>P16966 2.3.1.-~~~ttr~~~Acetyltransferase~~~
MNHAQLRRVTAESFAHYRHGLAQLLFETVHGGASVGFMADLDMQQAYAWCDGLKADIAAGSLLLWVVAEDDNVLASAQLS
LCQKPNGLNRAEVQKLMVLPSARGRGLGRQLMDEVEQVAVKHKRGLLHLDTEAGSVAEAFYSALAYTRVGELPGYCATPD
GRLHPTAIYFKTLGQPT
>Q9WY40 2.8.1.-~~~ttuA~~~tRNA-5-methyluridine(54) 2-sulfurtransferase~~~COG0037
MKCTKCGKPASVKLRHYNIKLCKEHFNEFIEQRVEKAIKKFKMFGRNSKILIAVSGGKDSVSLWHMLKKLGYEVDALFIR
AGKSGMVQKAQEIVEKNAELLNTKLHIVDATEYFGGLSTQEISIMLRRPVCSICGVVRRYLMNKFAYENGYDVVVTGHNL
NDEASVLLGNILHWQEGYLERQWPLLPKTHEKLVPKAKPLVLNYEEDIKLYATLNEIPHLEMACPFSVGATSLVYKKILR
ELEEEQPGITLNFYLGFLKRKKEPKFEVEGLRECKECGYPTTAEVCSFCRLRKQVEKRKNKTPA
>Q72LF3 2.8.1.15~~~ttuA~~~tRNA-5-methyluridine(54) 2-sulfurtransferase~~~COG0037
MVCKVCGQKAQVEMRSRGLALCREHYLDWFVKETERAIRRHRMLLPGERVLVAVSGGKDSLALWDVLSRLGYQAVGLHIE
LGIGEYSKRSLEVTQAFARERGLELLVVDLKEAYGFGVPELARLSGRVACSACGLSKRYIINQVAVEEGFRVVATGHNLD
DEAAVLFGNLLNPQEETLSRQGPVLPEKPGLAARVKPFYRFSEREVLSYTLLRGIRYLHEECPNAKGAKSLLYKEALNLV
ERSMPGAKLRFLDGFLEKIRPRLDVGEEVALRECERCGYPTTGAVCAFCRMWDAVYRRAKKRKLLPEEVSFRPRVKPLRA
G
>Q72LF4 ~~~ttuB~~~Sulfur carrier protein TtuB~~~COG2104
MRVVLRLPERKEVEVKGNRPLREVLEELGLNPETVVAVRGEELLTLEDEVREEDTLEVLSAISGG
>Q44471 1.1.1.93~~~ttuC~~~Probable tartrate dehydrogenase/decarboxylase TtuC~~~
MREYKIAAIPADGIGPEVIAAGLQVLEALEQRSGDFKIHTETFDWGSDYYKKHGVMMPADGLDKLKKFDAIFFGAVGAPD
VPDHITLWGLRLPICQGFDQYANVRPTKILPGITPPLRNCGPGDLDWVIVRENSEGEYSGHGGRAHRGLPEEVGTEVAIF
TRVGVTRIMRYAFKLAQARPRKLLTVVTKSNAQRHGMVMWDEIAAEVATEFPDVTWDKMLVDAMTVRMTLKPETLDTIVA
TNLHADILSDLAGALAGSLGVAPTANIDPERRFPSMFEPIHGSAFDITGKGIANPIATFWTAAQMLEHLGERDAAARLMS
AVERVTEAGILTPDVGGTANTRQVTEAVCNAIAGSNILKMAAAE
>P70787 1.1.1.93~~~ttuC~~~Probable tartrate dehydrogenase/decarboxylase TtuC~~~
MREYKIAAIPADGIGPEVIAAGLQVLEALEKRSGDFSIHTETFDWGSDYYKKNGVMMPADGLEQLKKFDAIFFGAVGAPD
VPDHITLWGLRLPICQGFDQYANVRPTKVLPGITPPLRNCGPGDLDWVIVRENSEGEYSGHGGRAHKGLPEEVGTEVAIF
TRVGVTRIMRYAFKLAQARPRKLLTVVTKSNAQRHGMVMWDEIAAEVSKEFPDVTWDKMLVDAMTVRMTLKPQSLDTIVA
TNLHADILSDLAGALAGSLGVAPTANIDPERRFPSMFEPIHGSAFDITGKGIANPVATFWTAAQMLEHLGEKDAATRLMS
AVERVTEAGILTPDVGGTADTQQVTDAVCEAIAGSNILNMAAVG
>O34296 1.1.1.93~~~ttuC'~~~Probable tartrate dehydrogenase/decarboxylase TtuC'~~~
MREYKIAAIPADGIGPEVIAAGLQVLEALEQRSGDFKIHTETFDWGSDYYKKHGVMMPADGLDKLKKFDAIFFGAVGAPD
VPDHITLWGLRLPICQGFDQYANVRPTKILPGITPPLRNCGPGDLDWVIVRENSEGEYSGHGGRAHRGLPEEVGTEVAIF
TRVGVTRIMRYAFKLAQARPRKLLTVVTKSNAQRHGMVMWDEIAAEVATEFPDVTWDKMLVDAMTVRMTLKPETLDTIVA
TNLHADILSDLAGALAGSLGVAPTANIDPERRFPSMFEPIHGSAFDITGKGIANPIATFWTAAQMLEHLGERDAAARLMS
AVERVTEAGILTPDVGGTANTSQVTEAVCNAIAGSNII
>P70792 1.1.1.93~~~ttuC'~~~Probable tartrate dehydrogenase/decarboxylase TtuC'~~~
MREYKIAAIPADGIGPEVIAAGLQVLEALEKRSGDFSIHTETFDWGSDYYKKNGVMMPADGLEQLKKFDAIFFGAVGAPD
VPDHITLWGLRLPICQGFDQYANVRPTKVLPGITPPLRNCGPGDLDWVIVRENSEGEYSGHGGRAHKGLPEEVGTEVAIF
TRVGVTRIMRYAFKLAQARPRKLLTVVTKSNAQRHGMVMWDEIAAEVSKEFPDVTWDKMLVDAMTVRMTLKPQSLDTIVA
TNLHADILSDLAGALAGSLGVAPTANIDPERRFPSMFEPIHGSAFDITGKGIANPVATFWTAAQMLEHLGEKDAATRLMS
AVERVTEAGILTPDVGGTANTQQVTDAVCEAIAGSNIL
>O34295 1.1.1.93~~~ttuC'~~~Probable tartrate dehydrogenase/decarboxylase TtuC'~~~
MREYKIAAIPADGIGPEVIAAGLQVLEALEQRSGDFKIHTETFDWGSDYYKKHGVMMPADGLDKLKKFDAIFFGAVGAPD
VPDHITLWGLRLPICQGFDQYANVRPTKILPGITPPLRNCGPGDLDWVIVRENSEGEYSGHGGRAHRGLPEEVGTEVAIF
TRVGVTRIMRYAFKLAQARPRKLLTVVTKSNAQRHGMVMWDEIAAEVATEFPDVTWDKMLVDAMTVRMTLKPETLDTIVA
TNLHADILSDLAGALAGSLGVAPTANIDPERRFPSMFEPIHGSAFDITGKGIANPIATFWTAAQMLEHLGERDAAARLMG
AVERVTEAGILTPDVGGTANTSQVTEAVCNAIAGSNII
>Q51945 1.1.1.93~~~~~~Tartrate dehydrogenase/decarboxylase~~~
MPAHSFRIAAIPGDGIGLEVLPEGIRVLEAAALKHGLALEFDTFEWASCDYYLQHGKMMPDDWAEQLKQYDAIYFGAVDW
PDKVPDHISLWGSLLKFRREFDQYVNIRPVRLFPGVPCALANRKVGDIDFVVVRENTEGEYSSLGGIMFENTENEIVIQE
SIFTRRGVDRILKYAFDLAEKRERKHVTSATKSNGMAISMPYWDKRTEAMAAHYPHVSWDKQHIDILCARFVLQPERFDV
VVVASNLFGDILSDLGPACAGTIGIAPSANLNPERNFPSLFEPVHGSAPDIFGKNIANPIAMIWSGALMLEFLGQGDERY
QRAHDDMLNAIERVIADGSVTPDMGGTLSTQQVGAAISDTLARLD
>Q72J02 2.7.7.80~~~ttuC~~~Sulfur carrier protein adenylyltransferase~~~COG0476
MRWTKEELDRYHRQMILPQVGPEGQERLKRASVVVVGAGGLGVPVLQYLVAAGVGRVGVVEMDRVEVSNLHRQVLYTTED
VGEPKALVAQKRLQALNPLVRVEAYPVRLTSENALEILRPYDLVVDASDNFPTRYLVNDAAVLLGKPLVFGAIYQFDGQV
AVFHHPTLHGEMGPCYRCLFPKPPPPGAVPSCAEAGVFGVLPAVVGSLMAAEALKVLLGIGKPLAGHLLLYDALEASFRK
LTVRRNPRCPVCGDEPTQRELVDYEAFCGLR
>P70788 1.1.1.81~~~ttuD~~~Putative hydroxypyruvate reductase~~~
MGRCVVIGAGKASAAMAAALDTVWSDVDLSGVVVTRYGHAVPSGRIRIIEASHPVPDDMSVEAAVRIIGAVRDLGPDDLV
IALISGGGSSLLVSPAAGMTLADKKAVNKALLASGATISEMNAVRKQLSGIKGGRLAQLAHPARVVTLVISDVPGDDPSE
IASGPTVANDTTIDDAREIVSRYRLVLPPAAQDVLANGGARNCTGKVSSDVRMIASPSMALDAAAAVAAANGLTPVILGD
ALQGEARDVGTVFAGIAMSASTKGLPVRGPAVLLSGGETSVSLPADTKGRGGRNSEFLLSLAIGLDGAKGIWALSGDTDG
IDGIEDAAGAMIGPDSLARMRGSGIDPRSALSRHDSYTAFKAIDDLVITGPTLTNVNDIRAILIG
>Q44472 1.1.1.81~~~ttuD~~~Putative hydroxypyruvate reductase~~~
MQRNRRIEYLEDGRCRMTWNDVSARQVLRRIFDAAVASADPKIAVVNNLPERPRGRCVVVGAGKASAAMAAAVDAAWPDV
DLSGIVVTRYGHAVPAGRIEILEASHPVPDEMSIKAAEKIFAAVQGLGPDDLVVALISGGGSSLLVSPTGKMTLTDKRAV
NQALLASGATISEMNTVRKHLSAIKGGHLARAALPAKLVTLIISDVPGDDPSEIASGPTVADPTTLADAAAIIARYGIDL
PESARAVLVQGNETPKAGEVAGEIRLVAAPSIALEAAAAAALDAGLCPLILGDALEGEAREMGRVMAGIALSARDKGLPV
AAPAVILSGGESTVSLGAMTEGRGGRNTEFLLSLAVALKGASGIWAIAGDTDGIDGVEDAAGALVAPDSLIRMRDAGIDP
RATLSAHDSYTAFKAIGDLVVTGPTLTNVNDIRAILIG
>Q72JV2 ~~~ttuD~~~Sulfur carrier protein TtuD~~~COG2897
MGYAHPEVLVSTDWVQEHLEDPKVRVLEVDEDILLYDTGHIPGAQKIDWQRDFWDPVVRDFISEEEFAKLMERLGISNDT
TVVLYGDKNNWWAAYAFWFFKYNGHKDVRLMNGGRQKWVEEGRPLTTEVPSYPPGRYEVPYRDESIRAYRDDVLEHIIKV
KEGKGALVDVRSPQEYRGELTHMPDYPQEGALRAGHIPGAKNIPWAKAVNPDGTFKSAEELRALYEPLGITKDKDIVVYC
RIAERSSHSWFVLKYLLGYPHVKNYDGSWTEWGNLVGVPIAKGEE
>O32274 2.7.8.40~~~tuaA~~~Putative undecaprenyl-phosphate N-acetylgalactosaminyl 1-phosphate transferase~~~
MSAEKSMNVSREFSVQQIHSFTLSEKTARYLAIKRVMDIWFALIGLAIALPMIAVFSILICLETPGPAIYTQERVGKGGK
PFKLYKLRSMKIDAEKSGAVWAQKQDPRVTRIGAFIRRTRIDELPQLFNVLKGDMSMIGPRPERPVFTEKFQNEIPGFTQ
RLGSGERRLRYDAEGKADI
>O32273 ~~~tuaB~~~Teichuronic acid biosynthesis protein TuaB~~~COG2244
MPSITKQIMSGAKWTSISTMCITIIQIVQFALLGNMMTLTEFGLVGMITTVTVFAQIVLDMGFGAALIQRDDATERQLST
LYWLNIMTGVLLFVLLYVSSPVIAGFYQREELVFLVRILAIMFLIAPIGQQYQYMLQKQLHFNTLSKIEIFSNVLSFGYL
AIAVFMMDAILAYVISQVLLQSSKGILYWAVYRKKWHPAFVFDLRGMKDFFSFGAFQLSSRLVNRLGANIDMILIGRFIG
AEALGIYNLAYQIVTIPVLKINPIVTRVAFPIFAKNKYENSVIREGFLNMTKMLALVSFPLLIGLVSVSDAFITAVFGEK
WLAAVPILNVLAIVGILRVLMNPNGSVLLAKGRADLAFYWDSGVLLLYGLSLFAAVQTGSLLTVAWVYAIISVVNFLIGR
WLLAYVIKLNLSAYFQSIMKPFLITAAMGIIAFGVSLSTEHFSMQAEMRLAISVAAGALCYLFLLVKAYPQTKSKLLRKG
RLS
>O32272 2.4.-.-~~~tuaC~~~Putative teichuronic acid biosynthesis glycosyltransferase TuaC~~~COG0297
MKILWITSVYPSSMKPGEGVFHETQVQELQKLGLDITVICPRPFHSAPVRMLKKTYRKKDVRPEYEIRKGIPVYRPFYRA
VPGQLKWAQPHRRIASAVLKTMKQRDLYPDLIHAHFAMPSGGAAAVVSESAQIPYVLTLHGSDVNVYPHYSKGAFKAFKR
AVGSASVVLAVSHKLQEEAKKLSGFDSSVLPIGIQLSRFQGNEETKEEIRKRLGLPLDQRLAVYVGRLVREKGIFELSEA
IESLQDSPKAVFVGDGPAKSTLTQKGHIVTGQVPNHQVRDYLLAADLFVLPSYSEGMPTVVIEALALRVPVICTDVGGVS
SLFGKHQHLLIKPKSAQALAEAITRYEHEQIWKPEVADDLYETVQAQFDAGKNAKALHHQYQTVTKTSV
>O32271 1.1.1.22~~~tuaD~~~UDP-glucose 6-dehydrogenase TuaD~~~COG1004
MKKIAVIGTGYVGLVSGTCFAEIGNKVVCCDIDESKIRSLKNGVIPIYEPGLADLVEKNVLDQRLTFTNDIPSAIRASDI
IYIAVGTPMSKTGEADLTYVKAAAKTIGEHLNGYKVIVNKSTVPVGTGKLVQSIVQKASKGRYSFDVVSNPEFLREGSAI
HDTMNMERAVIGSTSHKAAAIIEELHQPFHAPVIKTNLESAEMIKYAANAFLATKISFINDIANICERVGADVSKVADGV
GLDSRIGRKFLKAGIGFGGSCFPKDTTALLQIAKSAGYPFKLIEAVIETNEKQRVHIVDKLLTVMGSVKGRTISVLGLAF
KPNTNDVRSAPALDIIPMLQQLGAHVKAYDPIAIPEASAILGEQVEYYTDVYAAMEDTDACLILTDWPEVKEMELVKVKT
LLKQPVIIDGRNLFSLEEMQAAGYIYHSIGRPAVRGTEPSDKYFPGLPLEELAKDLGSVNL
>O32270 ~~~tuaE~~~Teichuronic acid biosynthesis protein TuaE~~~COG3307
MSIKRSAVHTLALLAAAIFGVVLLLGAIHKDIGFMQMAAVLAVLAIGLFLLTLATAFTTKERLFMAVIYILIACTFLNNA
FFAIHLGFFSLFLYRLLLIAAGCLHIFGMVRNRTHIERWHGLQVKGILLFFAFWFIYGLVSLLWAKSVTVGLKYLALLAM
GIFFIYLIVMYVQKMERLMIVYAIWLVMTVFLMIIGFYNHITHHHLASSTLYSGPEYKQHYPTSVFFNQNDFATFLSISF
FFYITMMKNIKNGYIKAIGLVLSLCALYLIFATGSRASLLGIFAGIAVYIFIVLPPVLKRMAIWLSAAGIALFAVLFASK
IYSKFWELFLAPQTLHSFHDRLPSNVARANLLKNAWHFFLDSYGFGVGAGNVSYYLEHYAVYDTDNVAEVHNWLVEILAN
FGLFIMLGYLSVYAYLIWVLYKFYERKLENQSKLITEGLITAMVSFLVSSISPSSVSNLFFHWVFMALVIAAVNVLRRSR
QMPEPMYR
>O32269 ~~~tuaF~~~Teichuronic acid biosynthesis protein TuaF~~~COG3206
MNDILIRIARRIKKNIIWIIAVPIILGAAGYILPSQIADQKSYTAEDTLAVGSYDHPVYNSTEEIPLLLKSDSFLKEALP
DEKDEDVAEIKEKLTINTESKSLLTLSYSDEDKDRTESVLNAISSTFLKNDQKLYAEREAVIRSSIDALEGESVSEDSKV
DKERFLYELKNTQLNLKAASVTDSETVSETAGGGMSPKKKAVLGVMIGLTIAFMFVVIPEFFRESF
>O32268 2.4.-.-~~~tuaG~~~Putative teichuronic acid biosynthesis glycosyltransferase TuaG~~~COG1215
MTNWKPLVSVITPSYNARDYIEDTVHSVLDQSHPHWEMIIVDDCSTDGTRDILQQYEKIDERIHVVYLEENSGAAVARNK
ALERAQGRYVAFLDSDDKWKKDKLEKQLEFMMERSCAFSFTGYSLMAQDGTPLDKFIHAPESLTYDDALKNTIIGCLTVM
IDREQTGQIQMPNIRTRQDLATWLSLLKKGFTAYGMNECLAEYRLVNNSISSNKWKAAKKTWFVYREIERLHFMKATWCF
VQYAKNAVKKRL
>O32267 2.4.-.-~~~tuaH~~~Putative teichuronic acid biosynthesis glycosyltransferase TuaH~~~COG0438
METKEAIIHVIVATAEWGKDQLRYRRHRLAEFLAAQEETKEVIWVCPAPRAQGKEFQEVHSGIRQFAVKDLLQKKMFRFG
RYTDVFYRHKLSPLLDELTPASANGERCCLWYTFPGFPLLSSLYSWDQVIYDCSDLWAAPISGRSNLLSEFRRKVIKSAE
LRIIQRADSITCSSDYLHKEVDKKLTAGREKVHTVENGVEYELFSANKQAPDRSILQGREGIVLGFIGGIKPKLDFKMIK
EAALQKPDWTFLWVGPDATNGDVSFQELLRLPNVIWTGPADPKEVPHYMELIDIGIMPYKQSPYNQAVFPLKLFEFLAAG
KPVVGTNLPSTSKMQKPYVYEYVEGDHPIDFIAACEKVLGQNGDETYKEMRRNIARTQDWNCLFRQIMKYTGIQKHA
>P70786 ~~~ttuB~~~Putative tartrate transporter~~~
MQAQEVRLLYLFREDGVVTNDLEARVLRKITFRIVPFIMLLYFIAFLDRVNIGFAALTMNQDLGFSSTVFGIGAGIFFVG
YFLFEVPSNLILNKVGARIWIARVMITWGIVSGLMAFVQGTTSFYILRFLLGVAEAGFFPGIILYLSFWFPARRRAAVTA
LFMAAAPLSTVLGSPISGALMEMHGLMGLAGWQWMFLIEAAPALILGVVVLFFLTDRPEKAKWLTEEERNWLVKTMNAEQ
AGRGTASHSVMAGLADIRVIALALVYFGTSAGLYTLGIWAPQIIKQFGLSAIEVGFINAVPGIFAVVAMVLWARHSDRTG
ERTWHVVGACLLAAAGLAFAAGATSVFMVLIALTIVNVGISCSKPPLWSMPTMFLSGPAAAAGIATINSIGNLGGFVGPS
MIGWIKDTTGSFTGGLYFVAGLLLISAILTLILARSSPKAVETRTANQH
>Q44470 ~~~ttuB~~~Putative tartrate transporter~~~
MNSDLETRVLRKITLRIVPFIMLLYFVAFLDRVNIGFAALTMNEDLGFSSTVFGIGAGIFFVGYFLFEVPSNLILNKVGA
RIWIARVMITWGIVSGLMAFVQGTTSFYALRFLLGVAEAGFFPGIILYLSFWFPARRRAAVTAIFMAAAPLSTVLGSPIS
GALMEMHGFLGLAGWQWMFLIEAAPAVILGVVVLFYLTDRPEKAKWLSEDERNWLVKTMNAEQAAKGKASHSILAGLADI
RVIALALVYFGTSAGLYTLGIWAPQIIKEFGLSSLQVGFINAVPGIFAVAAMVLWARHSDKTGERTWHVVGACLLAAVGL
AFATGATSVFTVLIALTLVNIGISCSKPPLWSMPTLFLSGPAAAAGIATINSIGNLGGFVGPSMIGWIKDTTGSFAGGLY
FVAGLLVVSAIVTLVLSRTAPENGGKVAPVQHR
>P9WJ61 2.5.1.153~~~~~~Tuberculosinyl adenosine transferase~~~
MNLVSEKEFLDLPLVSVAEIVRCRGPKVSVFPFDGTRRWFHLECNPQYDDYQQAALRQSIRILKMLFEHGIETVISPIFS
DDLLDRGDRYIVQALEGMALLANDEEILSFYKEHEVHVLFYGDYKKRLPSTAQGAAVVKSFDDLTISTSSNTEHRLCFGV
FGNDAAESVAQFSISWNETHGKPPTRREIIEGYYGEYVDKADMFIGFGRFSTFDFPLLSSGKTSLYFTVAPSYYMTETTL
RRILYDHIYLRHFRPKPDYSAMSADQLNVLRNRYRAQPDRVFGVGCVHDGIWFAEG
>P02944 ~~~~~~Tuberculin-active protein~~~
RLLDDTPEVKVLGAVADAIETPKAEPCIDLDVAGEATFAREDDLPDYVLYAEVTFHEICRDGGSESEGKNGSQMRLIADV
GPESATVAK
>Q74P25 ~~~tubR~~~DNA-binding protein TubR~~~
MKLLSNISMSSSEIIDVLCENLNDGIWALRVLYAEGAMNKEKLWDYINQYHKDYQIENEKDYEGKKILPSRYALDIMTAR
LEGAGLISFKAIGRVRIYDVTDLGNVLIKELEKRVEKNN
>Q8KNP2 ~~~tubR~~~DNA-binding transcriptional repressor TubR~~~
MNRDHFYTLNIAEIAERIGNDDCAYQVLMAFINENGEAQMLNKTAVAEMIQLSKPTVFATVNSFYCAGYIDETRVGRSKI
YTLSDLGVEIVECFKQKAMEMRNL
>Q848W2 ~~~tubR~~~DNA-binding protein TubR~~~
MSDYFEEVMRKLTIEDVSILGWLFQNEANAVFKAIKKSSIADELEYSTANFRKTLNKLEAIHFIGTVTGGKEHKLYLTEY
GQQAVQQAIHHGEENEEVEEI
>Q9X315 3.6.5.-~~~tubZ~~~Tubulin-like protein TubZ~~~
MAGNFSEIESQGNISLKFGFLGLGMGGCAIAAECANKETQIKNNKYPYRAILVNTNSQDFNKIEIKNAGNVRKIQLEGYE
QGAARNPQVGEEAFVKHETKIFETVKQEFEDRDFIWITCGLGGGTGTGALLKAIEMLYEHDYNFGLLLTLPRDAEALKVL
ENATSRIRSIAMNQEAFGSIVLIDNAKLYRKFEEENPSALANEYTSYSNKYIADALHEINLVTSSFTPFSDTHFDASEFA
QVINTPGVLSLAKLELKSNQLDTENPLGYLTQLGNALEKGVLYDTEREELESAKKSALSIVTSPLRASRLYNFSFLNQME
NFLKDRTPYVDERPIAPYVNKHTAKKEEDIVKFYSVVAGLPLPKRVSDIIDEITRIKEEREQANSKKSNAVLNKLFAFDD
SVQEEKPKKKKLNFGAEPEAEVADDSQPTKKKLSF
>Q74P24 3.6.5.-~~~tubZ~~~Tubulin-like protein TubZ~~~
MAGNFSEIESQGNISLKFGFLGLGMGGCAIAAECANKETQIKNNKYPYRAILVNTNSQDFNKIEIKNTGNVRKIQLEGYE
QGAARNPQVGEEAFVKHETKIFEAVKQEFEDRDFIWITCGLGGGTGTGALLKAIEMLYEHDYNFGLLLTLPRDAEALKVL
ENATSRIRSIAMNQEAFGSIVLIDNAKLYRKFEEENPSALANEYTSYSNKYIADALHEINLVTSSFTPFSDTHFDASEFA
QVINTPGVLSLAKLELKSNQLDTENPLGYLTQLGNALEKGVLYDTEREELESAKKSALSIVTSPLRAGRLYNFSFLNQME
NFLKERTPYVDERPIAPYVNKHTTKKEEDIVKFYSVVAGLPLPKRVSDIIDEITRIKEEREQANSKKSNAVLNKLFAFDD
SVQEEKPKKKKLNFGAEPEVEVADDSQPAKKKLSF
>Q8KNP3 3.6.5.-~~~tubZ~~~Tubulin-like protein TubZ~~~
MLLNSNELEHIHSTNHSVNDISIRWGVIGAGQKGNKEADLFAGYKFSNGTTCYPTLAVNFAESDMMHLQNIIKEDRIHFD
GLKGAARTPSVVTDLFDPETNPNANGYLDKLAQELGRKFTNEEGEVIVDQFLICLGAGGGVGTGWGSLVLQLIREQFFPC
PVSMLISLPSGDPDEINNALVLLSEIDEFMREQDRLFGNSDIKPLANVIVNDNTQMQRIIESQKGTKDLKNRYVNWKEVA
NDNVVSTLHEINIIPENYGSDNVTYDPSDLIKLLSIPGRFLTIGKARIAKFDLHSLENSIKRSLDEGFFSAEHQFETATM
YGGFVLRPSNADFFKDVNTENRIRNTLGEYKRLDEIAGKFGDPIWDNEYAVCYTIFAGMTMPKRYISLAREGKELAEKQE
QLRAEAQRKQDEEKVDISFATNRVQKNTFNPYNKNQGFGGASRFSGGKNSAFKRQTSEATSTQNQQEEENIISTLKTSNP
FKKR
>Q93KD6 ~~~tupA~~~Tungstate-binding protein TupA~~~
MKRLLSIITAVMMLALALTGCAAKQSPEGEVEKTQAKGSIILATTTSTSDSGLLDYLLPEFTKDTGIEAKVVAVGTGQAL
QMGKDGEADVLLVHSKAAEEEFVAAGDGLERKDVMYNDFILVGPANDPLKLKQELPNDIVGALKKISEQKFKFISRGDDS
GTHKKELALWTEVGITPEGDYYVSAGRGMGDVLKMADEMQAYTIADRGTYLSMKADLGLDIIVEKDTNLFNQYGVIPVNP
DKNENINAEGAKAFEEWILSEKAQSLIGEYGKEKYGAPLFTPNAAK
>Q0P886 ~~~tupB~~~Tungstate uptake system permease protein TupB~~~COG4662
MLKLLILKRIFLDYIFDGFKQALFLLFNADESVISAIKTTLLSSSISIVLALLIGFPLGFILGFFEFKLKRFIKLIVDTS
LSFPTVAVGLILYALISSRGPLGEFGLLFTIKALILGQFILALPIVIALFSNLIENMNKKHFLLIKSFHLSPLKLVLTMI
YELRFALISVVALAYGRIVAEVGVAMIVGGNIKYDTRTITTAISLETNKGEFASGIALALVLILIAFCLNFITHKLKRT
>Q93KD5 ~~~tupB~~~Tungstate uptake system permease protein TupB~~~
MEYIAEGFRAAMELLVSFDPQVYTIIFLSVFVSSTATAIAAAVSIPLGIFAGISNFRLKRLFSKVLYSLMSVPSVIVGLV
VAIGLSRRGPLGFMQLLYTPTAMIIAQALLVFPLCLGLTYSLSKNRGSEIERIGKTLGAGKLQVIILIIRELKAELFINV
VTTFSRAISEVGAVMIVGGNIKGHTRVITTSIAMLNSMGDYPMAIALGLVLLMISFAINAVIYSLQEE
>Q0P887 7.3.2.6~~~tupC~~~Tungstate uptake system ATP-binding protein TupC~~~COG3839
MIEISNLFFNYQNKEVLKIKNLKLDTSKISILMGANGSGKSTFLRILKFLEGDFSKNISYFGNFKPNNKQKREIYLLFPE
PILLNRSVRANFLFTLKTYGIKEDIEERIKESLMFLNLDESLLSKHPNELSSGQSQKIAFAIALSVRAKYYLLDEPSAFL
DKNTTLLFKKTILKMHENFNTGFLIASHDKHFLDSLAQKKLYLHSGEILEFENTNVFELENQGVKFCNFIDFSNCKKYKD
FKKPPSKIAIDPYKISFFNSKNIPKNNYDFILEKCYIIALRSRKSDVFIRVSCMDKILEFALEKQEFLRFDLKLYEELSL
YFYEDAICFLN
>Q93KD4 7.3.2.6~~~tupC~~~Tungstate uptake system ATP-binding protein TupC~~~
MQITVSNLKKSYGGSTVLDVESLTFESGKITGIIGPNGAGKTTLLNIISGIDMDFEGDVEYSGSDYSEVKRDITMVFQKG
GLLKRSVFENIAYPLKLRGTDKNEIQQTVVELMRHLGIEELSSKKAHKLSGGETQRVALARALAIKPRALLLDEPTASID
PEYMETIEKCIVDYNRKSKATILIITHSMDQARRLCDNIVFLESGRVGEADGFF
>D3RPC0 ~~~tusA~~~Sulfur carrier protein TusA~~~COG0425
MADFDQELDASGLNCPLPILRAKKTLNAMSSGQVLHVIATDPGSVKDFDAFAKQTGNELMESKEEGGKFHFLIKKS
>P0A892 ~~~tusA~~~Sulfur carrier protein TusA~~~COG0425
MTDLFSSPDHTLDALGLRCPEPVMMVRKTVRNMQPGETLLIIADDPATTRDIPGFCTFMEHELVAKETDGLPYRYLIRKG
G
>P0A890 ~~~tusA~~~Sulfur carrier protein TusA~~~COG0425
MTDLFSSPDHTLDALGLRCPEPVMMVRKTVRNMQPGETLLIIADDPATTRDIPGFCTFMEHELVAKETDGLPYRYLIRKG
G
>P45530 ~~~tusB~~~Protein TusB~~~COG2168
MLHTLHRSPWLTDFAALLRLLSEGDELLLLQDGVTAAVDGNRYLESLRNAPIKVYALNEDLIARGLTGQISNDIILIDYT
DFVRLTVKHPSQMAW
>P45531 ~~~tusC~~~Protein TusC~~~COG2923
MKRIAFVFSTAPHGTAAGREGLDALLATSALTDDLAVFFIADGVFQLLPGQKPDAVLARDYIATFKLLGLYDIEQCWVCA
ASLRERGLDPQTPFVVEATPLEADALRRELANYDVILRF
>P45532 2.8.1.-~~~tusD~~~Sulfurtransferase TusD~~~COG1553
MRFAIVVTGPAYGTQQASSAFQFAQALIADGHELSSVFFYREGVYNANQLTSPASDEFDLVRAWQQLNAQHGVALNICVA
AALRRGVVDETEAGRLGLASSNLQQGFTLSGLGALAEASLTCDRVVQF
>P0AB18 2.8.1.-~~~tusE~~~Sulfurtransferase TusE~~~COG2920
MLIFEGKEIETDTEGYLKESSQWSEPLAVVIAENEGISLSPEHWEVVRFVRDFYLEFNTSPAIRMLVKAMANKFGEEKGN
SRYLYRLFPKGPAKQATKIAGLPKPVKCI
>P16525 ~~~tus~~~DNA replication terminus site-binding protein~~~
MARYDLVDRLNTTFRQMEQELAIFAAHLEQHKLLVARVFSLPEVKKEDEHNPLNRIEVKQHLGNDAQSLALRHFRHLFIQ
QQSENRSSKAAVRLPGVLCYQVDNLSQAALVSHIQHINKLKTTFEHIVTVESELPTAARFEWVHRHLPGLITLNAYRTLT
VLHDPATLRFGWANKHIIKNLHRDEVLAQLEKSLKSPRSVAPWTREEWQRKLEREYQDIAALPQNAKLKIKRPVKVQPIA
RVWYKGDQKQVQHACPTPLIALINRDNGAGVPDVGELLNYDADNVQHRYKPQAQPLRLIIPRLHLYVAD
>P54373 ~~~txpA~~~Toxin TxpA~~~
MSTYESLMVMIGFANLIGGIMTWVISLLTLLFMLRKKDTHPIYITVKEKCLHEDPPIKG
>P09095 ~~~tycA~~~Tyrocidine synthase 1~~~
MLANQANLIDNKRELEQHALVPYAQGKSIHQLFEEQAEAFPDRVAIVFENRRLSYQELNRKANQLARALLEKGVQTDSIV
GVMMEKSIENVIAILAVLKAGGAYVPIDIEYPRDRIQYILQDSQTKIVLTQKSVSQLVHDVGYSGEVVVLDEEQLDARET
ANLHQPSKPTDLAYVIYTSGTTGKPKGTMLEHKGIANLQSFFQNSFGVTEQDRIGLFASMSFDASVWEMFMALLSGASLY
ILSKQTIHDFAAFEHYLSENELTIITLPPTYLTHLTPERITSLRIMITAGSASSAPLVNKWKDKLRYINAYGPTETSICA
TIWEAPSNQLSVQSVPIGKPIQNTHIYIVNEDLQLLPTGSEGELCIGGVGLARGYWNRPDLTAEKFVDNPFVPGEKMYRT
GDLAKWLTDGTIEFLGRIDHQVKIRGHRIELGEIESVLLAHEHITEAVVIAREDQHAGQYLCAYYISQQEATPAQLRDYA
AQKLPAYMLPSYFVKLDKMPLTPNDKIDRKALPEPDLTANQSQAAYHPPRTETESILVSIWQNVLGIEKIGIRDNFYSLG
GDSIQAIQVVARLHSYQLKLETKDLLNYPTIEQVALFVKSTTRKSDQGIIAGNVPLTPIQKWFFGKNFTNTGHWNQSSVL
YRPEGFDPKVIQSVMDKIIEHHDALRMVYQHENGNVVQHNRGLGGQLYDFFSYNLTAQPDVQQAIEAETQRLHSSMNLQE
GPLVKVALFQTLHGDHLFLAIHHLVVDGISWRILFEDLATGYAQALAGQAISLPEKTDSFQSWSQWLQEYANEADLLSEI
PYWESLESQAKNVSLPKDYEVTDCKQKSVRNMRIRLHPEETEQLLKHANQAYQTEINDLLLAALGLAFAEWSKLAQIVIH
LEGHGREDIIEQANVARTVGWFTSQYPVLLDLKQTAPLSDYIKLTKENMRKIPRKGIGYDILKHVTLPENRGSLSFRVQP
EVTFNYLGQFDADMRTELFTRSPYSGGNTLGADGKNNLSPESEVYTALNITGLIEGGELVLTFSYSSEQYREESIQQLSQ
SYQKHLLAIIAHCTEKKEVERTPSDFSVKGLQMEEMDDIFELLANTLR
>O30408 ~~~tycB~~~Tyrocidine synthase 2~~~
MSVFSKEQVQDMYALTPMQEGMLFHALLDQEHNSHLVQMSISLQGDLDVGLFTDSLHVLVERYDVFRTLFLYEKLKQPLQ
VVLKQRPIPIEFYDLSACDESEKQLRYTQYKRADQERTFHLAKDPLMRVALFQMSQHDYQVIWSFHHILMDGWCFSIIFD
DLLAIYLSLQNKTALSLEPVQPYSRFINWLEKQNKQAALNYWSDYLEAYEQKTTLPKKEAAFAKAFQPTQYRFSLNRTLT
KQLGTIASQNQVTLSTVIQTIWGVLLQKYNAAHDVLFGSVVSGRPTDIVGIDKMVGLFINTIPFRVQAKAGQTFSELLQA
VHKRTLQSQPYEHVPLYDIQTQSVLKQELIDHLLVIENYPLVEALQKKALNQQIGFTITAVEMFEPTNYDLTVMVMPKEE
LAFRFDYNAALFDEQVVQKLAGHLQQIADCVANNSGVELCQIPLLTEAETSQLLAKRTETAADYPAATMHELFSRQAEKT
PEQVAVVFADQHLTYRELDEKSNQLARFLRKKGIGTGSLVGTLLDRSLDMIVGILGVLKAGGAFVPIDPELPAERIAYML
THSRVPLVVTQNHLRAKVTTPTETIDINTAVIGEESRAPIESLNQPHDLFYIIYTSGTTGQPKGVMLEHRNMANLMHFTF
DQTNIAFHEKVLQYTTCSFDVCYQEIFSTLLSGGQLYLITNELRRHVEKLFAFIQEKQISILSLPVSFLKFIFNEQDYAQ
SFPRCVKHIITAGEQLVVTHELQKYLRQHRVFLHNHYGPSETHVVTTCTMDPGQAIPELPPIGKPISNTGIYILDEGLQL
KPEGIVGELYISGANVGRGYLHQPELTAEKFLDNPYQPGERMYRTGDLALWLPDGQLEFLGRIDHQVKIRGHRIELGEIE
SRLLNHPAIKEAVVIDRADETGGKFLCAYVVLQKALSDEEMRAYLAQALPEYMIPSFFVTLERIPVTPNGKTDRRALPKP
EGSAKTKADYVAPTTELEQKLVAIWEQILGVSPIGIQDHFFTLGGHSLKAIQLISRIQKECQADVPLRVLFEQPTIQALA
AYVEGGEESAYLAIPQAEPQAYYPVSSAQKRMLILNQLDPHSTVYNLPVAMILEGTLDKARLEHAISNLVARHESLRTSF
HTINGEPVSRIHEQGHLPIVYLETAEEQVNEVILGFMQPFDLVTAPLCRVGLVKLAENRHVLIIDMHHIISDGVSSQLIL
NDFSRLYQNKALPEQRIHYKDFAVWEKAWTQTTDYQKQEKYWLDRFAGEIPVLNLPMDYPRPAVQSFEGERYLFRTEKQL
LESLQDVAQKTGTTLYMVLLAAYHVLLSKYSGQDDVMIGTVTAGRVHPDTESMTGMFVNTLAMRNQSAPTKTFRQFLLEV
KDNTLAAFEHGQYPFEELVEKLAIQRNRSRNPLFDTLFILQNMDADLIELDGLTVTPYVPEGEVAKFDLSLEASENQAGL
SFCFEFCTKLFARETIERMSLHYLQILQAVSANTEQELAQIEMLTAHEKQELLVHFNDTAALYPAESTLSQLFEDQAQKT
PEQTAVVFGDKRLTYRELNERANQLAHTLRAKGVQAEQSVGIMAQRSLEMAIGIIAILKAGGAYVPIDPDYPNERIAYML
EDCRRLVLTQQQLAEKMTANVECLYLDEEGSYSPQTENIEPIHTAADLAYIIYTSGTTGRPKGVMVEHRGIVNSVTWNRD
EFALSVRDSGTLSLSFAFDAFALTFFTLIVSGSTVVLMPDHEAKDPIALRNLIAAWECSYVVFVPSMFQAILECSTPADI
RSIQAVMLGGEKLSPKLVQLCKAMHPQMSVMNAYGPTESSVMATYLRDTQPDQPITIGRPIANTAIYIVDQHHQLLPVGV
VGEICIGGHGLARGYWKKPELTAEKFVANPAVPGERMYKTGDLGRWLHDGTIDFIGRVDDQIKVRGYRIEVGEIEAVLLA
YDQTNEAIVVAYQDDRGDSYLAAYVTGKTAIEESELRAHLLRELPAYMVPTYLIQLDAFPLTPNGKVDRKALPKPEGKPA
TGAAYVAPATEVEAKLVAIWENALGISGVGVLDHFFELGGHSLKAMTVVAQVHREFQIDLLLKQFFAAPTIRDLARLIEH
SEQAAGAAIQPAEPQAYYPVSSAQQRMYLLHQLEGAGISYNTPGIIMLEGKLDREQLANALQALVDRHDILRTSFEMVGD
ELVQKIHDRVAVNMEYVTAEEQQIDDLFHAFVRPFDLSVPPLLRMSLVKLADERHLLLYDMHHIAADAASITILFDELAE
LYQGRELPEMRIQYKDFAVWQKALHESDAFKQQEAYWLSTFAGNITAVDVPTDFPRPAVKSFAGGQVTLSMDQELLSALH
ELAAHTNTTLFMVLLAAYNVLLAKYAGQDDIIVGTPISGRSRAELAPVVGMFVHTLAIRNKPTAEKTFKQFLQEVKQNAL
DAFDHQDYPFESLVEKLGIPRDPGRNPLFDTMFILQNDELHAKTLDQLVYRPYESDSALDVAKFDLSFHLTERETDLFLR
LEYCTKLFKQQTVERMAHHFLQILRAVTANPENELQEIEMLTAAEKQMLLVAFNDTHREYRADQTIQQLFEELAEKMPEH
TALVFEEKRMSFRELNERANQLAAVLREKGVGPAQIVALLVERSAEMVIATLATLKAGGAFLPVDPDYPEERIRYMLEDS
QAKLVVTHAHLLHKVSSQSEVVDVDDPGSYATQTDNLPCANTPSDLAYIIYTSGTTGKPKGVMLEHKGVANLQAVFAHHL
GVTPQDRAGHFASISFDASVWDMFGPLLSGATLYVLSRDVINDFQRFAEYVRDNAITFLTLPPTYAIYLEPEQVPSLRTL
ITAGSASSVALVDKWKEKVTYVNGYGPTESTVCATLWKAKPDEPVETITIGKPIQNTKLYIVDDQLQLKAPGQMGELCIS
GLSLARGYWNRPELTAEKFVDNPFVPGTKMYRTGDLARWLPDGTIEYLGRIDHQVKIRGHRVELGEVESVLLRYDTVKEA
AAITHEDDRGQAYLCAYYVAEGEATPAQLRAYMENELPNYMVPAFFIQLEKMPLTPNDKIDRKALPKPNQEENRTEQYAA
PQTELEQLLAGIWADVLGIKQVGTQDNFFELGGDSIKAIQVSTRLNASGWTLAMKELFQYPTIEEAALRVIPNSRESEQG
VVEGEIALTPIQKWFFANNFTDRHHWNQAVMLFREDGFDEGLVRQAFQQIVEHHDALRMVYKQEDGAIKQINRGLTDERF
RFYSYDLKNHANSEARILELSDQIQSSIDLEHGPLVHVALFATKDGDHLLVAIHHLVVDGVSWRILFEDFSSAYSQALHQ
QEIVLPKKTDSFKDWAAQLQKYADSDELLREVAYWHNLETTTTTAALPTDFVTADRKQKHTRTLSFALTVPQTENLLRHV
HHAYHTEMNDLLLTALGLAVKDWAHTNGVVINLEGHGREDIQNEMNVTRTIGWFTSQYPVVLDMEKAEDLPYQIKQTKEN
LRRIPKKGIGYEILRTLTTSQLQPPLAFTLRPEISFNYLGQFESDGKTGGFTFSPLGTGQLFSPESERVFLLDISAMIED
GELRISVGYSRLQYEEKTIASLADSYRKHLLGIIEHCMAKEEGEYTPSDLGDEELSMEELENILEWI
>O30409 ~~~tycC~~~Tyrocidine synthase 3~~~
MKKQENIAKIYPLTPLQEGMLFHAVTDTGSSAYCLQMSATIEGDFHLPLFEKSLNKLVENYEVLRTAFVYQNMQRPRQVV
FKERKVTVPCENIAHLPSAEQDAYIQAYTKQHHAFDLTKDNLMKAAIFQTAENKYRLVWAFHHIIVDGWTLGVLLHKLLT
YYAALRKGEPIPREATKPYSEYIKWLDKQNKDEALAYWQNYLAGYDHQAAFPKKKLGTEASRYEHVEAMFTIAPEKTQQL
IQIANQNQATMSSVFQALWGILASTYKNADDVVFGSVVSGRPPQIQGIESMVGLFINTIPTRVQTNKQQTFSELLQTVQK
QALASATYDFAPLYEIQSTTVLKQELIDHLVTFENYPDHSMKHLEESLGFQFTVESGDEQTSYDLNVVVALAPSNELYVK
LSYNAAVYESSFVNRIEGHLRTVIDQVIGNPHVHLHEIGIITEEEKQQLLVAYNDTAAEYPRDKTIFELIAEQASRTPAK
AAVVCGEDTLTYQELMERSAQLANALREKGIASGSIVSIMAEHSLELIVAIMAVLRSGAAYLPIDPEYPQDRIQYLLDDS
QTTLLLTQSHLQPNIRFAGSVLYLDDRSLYEGGSTSFAPESKPDDLAYMIYTSGSTGNPKGAMITHQGLVNYIWWANKVY
VQGEAVDFPLYSSISFDLTVTSIFTPLLSGNTIHVYRGADKVQVILDIIKDNKVGIIKLTPTHLKLIEHIDGKASSIRRF
IVGGENLPTKLAKQIYDHFGENVQIFNEYGPTETVVGCMIYLYDPQTTTQESVPIGVPADNVQLYLLDASMQPVPVGSLG
EMYIAGDGVAKGYFNRPELTKEKFIDNPFRPGTKMYRTGDLAKWLPDGNMEYAGRMDYQVKIRGHRIEMGEIETRLTQHE
AVKEAVVIVEKDESGQNVLYAYLVSERELTVAELREFLGRTLPSYMIPSFFIRLAEIPLTANGKVERKKLPKPAGAVVTG
TAYAAPQNEIEAKLAEIWQQVLGISQVGIHDDFFDLGGHSLKAMTVVFQVSKALEVELPVKALFEHPTVAELARFLSRSE
KTEYTAIQPVAAQEFYPVSSAQKRMYILQQFEGNGISYNISGAILLEGKLDYARFASAVQQLAERHEALRTSFHRIDGEP
VQKVHEEVEVPLFMLEAPEDQAEKIMREFVRPFDLGVAPLMRTGLLKLGKDRHLFLLDMHHIISDGVSSQILLREFAELY
QGADLQPLSLQYKDFAAWQNELFQTEAYKKQEQHWLNTFADEIPLLNLPTDYPRPSVQSFAGDLVLFAAGKELLERLQQV
ASETGTTLYMILLAAYNVLLSKYTGQEDIIVGTPVAGRSHADVENIMGIFVNTLALRNQPASSKTFAQFLQEVKQNALAA
YDHQDYPFEELVEKLAIQRDISRNPLFDTLFSLENANQQSLAIAELTASPYELFNKISKFDLALNASESPADIQFQLTFA
TKLFKKETVERMARHYLEILRWISEQPTASLADIDMMTEAEKRTLLLNVNDTFVERTAATALHQLVEEQAARTPDEVAVV
YEEYALTYRELNARANQLARLLRSHGTGPDTLIGIMVDRSPGMVVGMLAVLKAGGAYTPIDPSYPPERIQYMLSDSQAPI
LLTQRHLQELAAYQGEIIDVDEEAIYTGADTNLDNVAGKDDLAYVIYTSGSTGNPKGVMISHQAICNHMLWMRETFPLTT
EDAVLQKTPFSFDASVWEFYLPLITGGQLVLAKPDGHRDIAYMTRLIRDEKITTLQMVPSLLDLVMTDPGWSACTSLQRV
FCGGEALTPALVSRFYETQQAQLINLYGPTETTIDATYWPCPRQQEYSAIPIGKPIDNVRLYVVNASNQLQPVGVAGELC
IAGDGLARGYWQREELTKASFVDNPFEPGGTMYRTGDMVRYLPDGHIEYLGRIDHQVKIRGHRIELGEIEATLLQHEAVK
AVVVMARQDGKGQNSLYAYVVAEQDIQTAELRTYLSATLPAYMVPSAFVFLEQLPLSANGKVDRKALPQPEDAAASAAVY
VAPRNEWEAKLAAIWESVLGVEPIGVHDHFFELGGHSLKAMHVISLLQRSFQVDVPLKVLFESPTIAGLAPLVAAARKGT
YTAIPPVEKQEYYPVSAAQKRMFILQQMEGAGISYNMPGFMYLDGKLDTERLQQALKSLVQRHESLRTSFHSVQGETVQR
VHDDVDLAISFGEATEAETRQIAEQFIQPFDLGTAPLLRAGLIKLAPERHLFMLDLHHIVVDGVSIGLLIEEFAQLYHGE
ELPALRIQYKDFAKWQQDWFQTEEFAEQEAYWLNTFTGEIPVLNLPTDYPRPSVKSFAGDRFVFGSGTALPKQLHQLAQE
TGTTLYMVLLAAYNVLLSKYSRQEDIIVGAPTAGRSHAETESIVGMFVNTLALRNEPAGGKTFRDFLAEVKINTLGAFEH
QDYPLDELVDKLDMQRDLSRNPLFDTVFILQNMEQKPFEMEQLTITPYSAEVKQAKFDLSLEAYEENAEIIFSLDYSTKL
FSRETIEKIATHFIQILRAVIAEPEMPLSEITMLTEAEKQRLLVDFNGAHKDFPQNKTLQALFEEQAEKSPQATAVEISG
QPLSYQELNERANQLAATLRERGVQPDQPVGIMANRSVEMVVGILAILKAGGAYVPIDPEYPEERVAYMLTDCQARLVLT
QKHLGAKLGSSVTAECLYLDDESNYGVHRSNLQPINTASDLAYIIYTSGTTGKPKGVMVEHRGIVNNVLWKKAEYQMKVG
DRSLLSLSFAFDAFVLSFFTPVLSGATVVLAEDEEAKDPVSLKKLIAASRCTLMTGVPSLFQAILECSTPADIRPLQTVT
LGGEKITAQLVEKCKQLNPDLVIVNEYGPTESSVVATWQRLAGPDAAITIGRPIANTSLYIVNQYHQLQPIGVVGEICIG
GRGLARGYWNKPALTEEKFVSHPFAAGERMYKTGDLGKWLPDGTIEYIGRIDEQVKVRGYRIEIGEIESALLAAEKLTAA
VVVVYEDQLGQSALAAYFTADEQLDVTKLWSHLSKRLPSYMIPAHFVQLDQLPLTPNGKVDKKALPKPEGKPVTEAQYVA
PTNAVESKLAEIWERVLGVSGIGILDNFFQIGGHSLKAMAVAAQVHREYQVELPLKVLFAQPTIKALAQYVATSGKETYV
PIEPAPLQEYYPVSSAQKRMYVLRQFADTGTVYNMPSALYIEGDLDRKRFEAAIHGLVERHESLRTSFHTVNGEPVQRVH
EHVELNVQYAEVTEAQVEPTVESFVQAFDLTKAPLLRVGLFKLAAKRHLFLLDMHHIISDGVSAGIIMEEFSKLYRGEEL
PALSVHYKDFAVWQSELFQSDVYTEHENYWLNAFSGDIPVLNLPADFSRPLTQSFEGDCVSFQADKALLDDLHKLAQESQ
STLFMVLLAAYNVLLAKYSGQEDIVVGTPIAGRSHADIENVLGMFVNTLALRNYPVETKHFQAFLEEVKQNTLQAYAHQD
YPFEALVEKLDIQRDLSRNPLFDTMFILQNLDQKAYELDGLKLEAYPAQAGNAKFDLTLEAHEDETGIHFALVYSTKLFQ
RESIERMAGHFLQVLRQVVADQATALREISLLSEEERRIVTVDFNNTFAAYPRDLTIQELFEQQAAKTPEHAAVVMDGQM
LTYRELNEKANQLAHVLRQNGVGKESIVGLLADRSLEMITGIMGILKAGGAYLGLDPEHPSERLAYMLEDGGVKVVLVQK
HLLPLVGEGLMPIVLEEESLRPEDCGNPAIVNGASDLAYVMYTSGSTGKPKGVMVEHRNVTRLVMHTNYVQVRESDRMIQ
TGAIGFDAMTFEIFGALLHGASLYLVSKDVLLDAEKLGDFLRTNQITTMWLTSPLFNQLSQDNPAMFDSLRALIVGGEAL
SPKHINRVKSALPDLEIWNGYGPTENTTFSTCYLIEQHFEEQIPIGKPIANSTAYIVDGNNQPQPIGVPGELCVGGDGVA
RGYVNKPELTAEKFVPNPFAPGETMYRTGDLARWLPDGTIEYLGRIDQQVKIRGYRIELGEIETVLSQQAQVKEAVVAVI
EEANGQKALCAYFVPEQAVDAAELREAMSKQLPGYMVPAYYVQMEKLPLTANGKVDRRALPQPSGERTTGSAFVAAQNDT
EAKLQQIWQEVLGIPAIGIHDNFFEIGGHSLKAMNVITQVHKTFQVELPLKALFATPTIHELAAHIAESAFEQFETIQPV
EPAAFYPVSFAQKRMYILHQFEGSGISYNVPSVLVLEGKLDYDRFAAAIQSLVKRHESLRTSFHSVNGEPLQRVHPDVEL
PVRLLEATEDQSESLIQELIQPFDLEIAPLFRVNLIKLGAERHLFFMDMHHIISDGVSLAVIVEEIASLYAGKQLSDLRI
QYKDFAVWQTKLAQSDRFQKQEDFWTRTFAGEIPLLNLPHDYPRPSVQSFDGDTVALGTGHHLLEQLRKLAAETGTTLFM
VLLAAYHVLLSKYAGQEEIVVGTPIAGRSHADVERIVGMFVNTLALKNTAAGSLSFRAFLEDVKQNALHAFEHQDYPFEH
LVEKLQVRRDLSRNPLFDTMFSLGLAESAEGEVADLKVSPYPVNGHIAKFDLSLDAMEKQDGLLVQFSYCTKLFAKETVD
RLAAHYVQLLQTITADPDIELARISVLSKAETEHMLHSFLATKTAYPTDKTFQKLFEEQVEKTPNEIAVLFGNEQLTYQE
LNAKANQLARVLRRKGVKPESTVGILVDRSLYMVIGMLAVLKAGGTFVPIDPDYPLERQAFMLEDSEAKLLLTLQKMNSQ
VAFPYETFYLDTETVDQEETGNLEHVAQPENVAYIIYTSGTTGKPKGVVIEHRSYANVAFAWKDEYHLDSFPVRLLQMAS
FAFDVSTGDFARALLTGGQLVICPNGVKMDPASLYETIRRHEITIFEATPALIMPLMHYVYENELDMSQMKLLILGADSC
PAEDFKTLLARFGQKMRIINSYGVTEACIDTSYYEETDVTAIRSGTVPIGKPLPNMTMYVVDAHLNLQPVGVVGELCIGG
AGVARGYLNRPELTEEKFVPNPFAPGERLYRTGDLAKWRADGNVEFLGRNDHQVKIRGVRIELGEIETQLRKLDGITEAV
VVAREDRGQEKELCAYVVADHKLDTAELRANLLKELPQAMIPAYFVTLDALPLTANGKVDRRSLPAPDVTMLRTTEYVAP
RSVWEARLAQVWEQVLNVPQVGALDDFFALGGHSLRAMRVLSSMHNEYQVDIPLRILFEKPTIQELAAFIEETAKGNVFS
IEPVQKQAYYPVSSAQKRMYILDQFEGVGISYNMPSTMLIEGKLERTRVEAAFQRLIARHESLRTSFAVVNGEPVQNIHE
DVPFALAYSEVTEQEARELVSSLVQPFDLEVAPLIRVSLLKIGEDRYVLFTDMHHSISDGVSSGILLAEWVQLYQGDVLP
ELRIQYKDFAVWQQEFSQSAAFHKQEAYWLQTFADDIPVLNLPTDFTRPSTQSFAGDQCTIGAGKALTEGLHQLAQATGT
TLYMVLLAAYNVLLAKYAGQEDIIVGTPITGRSHADLEPIVGMFVNTLAMRNKPQREKTFSEFLQEVKQNALDAYGHQDY
PFEELVEKLAIARDLSRNPLFDTVFTFQNSTEEVMTLPECTLAPFMTDETGQHAKFDLTFSATEEREEMTIGVEYSTSLF
TRETMERFSRHFLTIAASIVQNPHIRLGEIDMLLPEEKQQILAGFNDTAVSYALDKTLHQLFEEQVDKTPDQAALLFSEQ
SLTYSELNERANRLARVLRAKGVGPDRLVAIMAERSPEMVIGILGILKAGGAYVPVDPGYPQERIQYLLEDSNAALLLSQ
AHLLPLLAQVSSELPECLDLNAELDAGLSGSNLPAVNQPTDLAYVIYTSGTTGKPKGVMIPHQGIVNCLQWRRDEYGFGP
SDKALQVFSFAFDGFVASLFAPLLGGATCVLPQEAAAKDPVALKKLMAATEVTHYYGVPSLFQAILDCSTTTDFNQLRCV
TLGGEKLPVQLVQKTKEKHPAIEINNEYGPTENSVVTTISRSIEAGQAITIGRPLANVQVYIVDEQHHLQPIGVVGELCI
GGAGLARGYLNKPELTAEKFVANPFRPGERMYKTGDLVKWRTDGTIEYIGRADEQVKVRGYRIEIGEIESAVLAYQGIDQ
AVVVARDDDATAGSYLCAYFVAATAVSVSGLRSHLAKELPAYMIPSYFVELDQLPLSANGKVDRKALPKPQQSDATTREY
VAPRNATEQQLAAIWQEVLGVEPIGITDQFFELGGHSLKATLLIAKVYEYMQIELPLNLIFQYPTIEKVADFITHKRFES
RYGTAILLNQETARNVFCFTPIGAQSVYYQKLAAEIQGVSLYSFDFIQDDNRMEQYIAAITAIDPSGPYTLMGYSSGGNL
AFEVAKELEERGYGVTDIILFDSYWKDKAIERTVAETENDIAQLFAEIGENTEMFNMTQEDFQLYAANEFVKQSFVRKTV
SYVMFHNNLVNTGMTTAAIHLIQSELEADEEAPVAAKWNESAWANATQRLLTYSGHGIHSRMLAGDYASQNASILQNILQ
ELFILK
>P69968 ~~~tyeA~~~Protein TyeA~~~
MAYDLSEFMGDIVALVDKRWAGIHDIEHLANAFSLPTPEIKVRFYQDLKRMFRLFPLGVFSDEEQRQNLLQMCQNAIDMA
IESEEEELSELD
>Q9XC67 2.4.1.318~~~tylCV~~~Demethyllactenocin mycarosyltransferase~~~COG1819
MAGLRPGAGVPPGTPWPISPGKHCVRSAISRRARLPYHAPGTPLGRFRRAPDEGSCRMAHIAFFILPAAGHVNPTLGVAE
ELAARGHRVTYALPEDMADRAVRVGARAVTYPLDRERFRADMVPKEESDEYTDEGEFLKVLEWLLDTTADTLPLLESAFA
EDRPDVVANDPSTFWTGLLLAGKWDIPVIRSTPSYASNEHWALHPPFEPGAAQVDPALIELTARAEKLLKEHGTTSDPVA
FAATVQSGPGLFYMPRYFQYAGETFDDRHHFVGPCAPRASFHGTWQRPEDGRPLVMVSLGTIYNERPGIFRACVEAFRDR
PWNILLVLGGGLGAGDLGPLPENVLVRDFVPLGDVLPHTDLLVNHGGTSTAMEALAHGVPIVAMPEMPEPRATARRIAEL
DLGDWLLPGEVTAEKLSGIAQRVLTDDRIRKGLDRMRGEIRRAGGPAVAADVIEGLLSPAA
>Q9ZHQ4 2.1.1.102~~~tylE~~~Demethylmacrocin O-methyltransferase~~~COG3510
MAVQKEATLVRQIIRAAGGHAADVRELVAEHGPEAVTAVLVDEIVSRAPHPVNDVPVLVELAVRSGDALVPRRLAVAQGA
PVRRAAPDDDGFVAMRVEYELDELVRELFGPCRERAAGTRGTTLFPYATSGTGHIDTYFLAAQQATATVLAGCTSAKPDL
NELTSRYLTPKWGSLHWFTPHYDRHFREYRNEEVRVLEIGIGGYQHPEWGGGSLRMWKHFFHRGLIYGLDIEDKSHAEEQ
RITTVVGDQNDPGCLTELAARYGPFDIVIDDGSHINEHVRTSFHALFPHVRPGGLYVIEDLWTAYWPGFGGDSDPGKSDL
TSLGLVKSLVDSLQHQELPEDSGRSPGYADRHVVGLHVYHNLAFIEKGVNSEGGIPGWIPRDFDALVAASSGGAA
>Q9S4D5 2.1.1.101~~~tylF~~~Macrocin O-methyltransferase~~~COG4122
MAPSPDHARDLYIELLKKVVSNVIYEDPTHVAGMITDASFDRTSRESGEDYPTVAHTMIGLKRLDNLHRCLADVVEDGVP
GDFIETGVWRGGACIFARGLLNAYGQADRTVWVADSFQGFPELTGSDHPLDVEIDLHQYNEAVDLPTSEETVRENFARYG
LLDDNVRFLAGWFKDTMPAAPVKQLAVMRLDGDSYGATMDVLDSLYERLSPGGYVIVDDYCIPACREAVHDFRDRLGIRD
TIHRIDRQGAYWRHSG
>Q9ZHQ1 1.14.15.34~~~tylH1~~~20-oxo-5-O-mycaminosyltylactone 23-monooxygenase~~~COG2124
MSSSGDARPSQKGILLPAARANDTDEAAGRRSIAWPVARTCPFSPPEQYAALRAEEPIARAELWDGAPVWLISRQDHVRA
LLADPRVSIHPAKLPRLSPSDGEAEASRSLLTLDPPDHGALRGHFIPEFGLRRVRELRPSVEQIVTGLLDDLTARGDEAD
LLADFALPMATQVICRLLDIPYEDRDYFQERTEQATRPAAGEEALEALLELRDYLDRLISGKTGRESGDGMLGSMVAQAR
GGGLSHADVLDNAVLLLAAGHETTASMVTMSVLVLLQHPTAWRELTVNPGLLPGAVDELLRYLSIADGLRRSATADIEID
GHTIRAGDGLVFLLAAANRDEAVFSEPEAFDIHRSARRHVAFGYGPHQCLGQNLARMELEVALGAVLERLPALRPTTDVA
GLRLKSDSAVFGVYELPVAW
>P95748 2.1.1.235~~~tylM1~~~dTDP-3-amino-3,6-dideoxy-alpha-D-glucopyranose N,N-dimethyltransferase~~~COG2226
MAHSSATAGPQADYSGEIAELYDLVHQGKGKDYHREAADLAALVRRHSPKAASLLDVACGTGMHLRHLADSFGTVEGLEL
SADMLAIARRRNPDAVLHHGDMRDFSLGRRFSAVTCMFSSIGHLAGQAELDAALERFAAHVLPDGVVVVEPWWFPENFTP
GYVAAGTVEAGGTTVTRVSHSSREGEATRIEVHYLVAGPDRGITHHEESHRITLFTREQYERAFTAAGLSVEFMPGGPSG
RGLFTGLPGAKGETR
>P95747 2.4.1.316~~~tylMII~~~Tylactone mycaminosyltransferase~~~COG1819
MRRALDDRRRGPHGPEGKPPMRVLLTCIAHNTHYYNLVPVAWALRAAGHEVRVAAQPALTDTITASGLTAVPVGGNESVL
EFVTEIGGDPGPYQRGMDFAETCGEPLSYEHALGQQTAMSALCFAPFNCDSTIDDMVALARSWRPDLVLWEPFTYAGPIA
AHACGAAHARLLWGPDVILNARAQFRRLAAGQPEERREDPVAEWLGWTLERHGLTAERETVEELIGGQWTLDPTAESLRL
PAAGRVVPFRFVPYNGRSVLPDWLLRKPGRPRVCFTLGVSARETYGRDAVPFHELLAGLGDLDAEIVATLDPGQLSGAGE
VPRNVRAVDFVPMDALLPTCSAVVHHGGAGTCFTATLNGLPQIVVAALWDAPLKGAQLAEAGAGVSIAPEKLDAATLRAG
VVRALEDEDMRRSAGLLRAEMLAEPTPAGLVPQLERLTALHRNGRSRSAPER
>O70023 2.4.1.317~~~tylN~~~O-mycaminosyltylonolide 6-deoxyallosyltransferase~~~COG1819
MRIALLTMGSRGDVQPFVALGTGLRARGHEVVLGAPEALRPLVEQAGLEYRATPGDPDGFFTMPEVVETLRRGPAMRDLM
KALPPAPEEYDQEVLDRIERAGEGVDLVVHAPLTVTTALGEPSTPWLSVNWWPNTSTWTFPAVESGQRRMGPLTPLYNRL
THWRAEREDWGWRRAEVNEFRGRRGLPPFGKSSPLRRLGHPRPHLYPFSPSVLPKPRDWPGQCHVTGYWFWDQPGWRPSP
ELEDFLADGEPPVLLTLGSTWPVHRQEEMVEYAVAAARGARRRLLLVGGPEGALPGDALRVPSADYSWLMPRTAAVVHHG
GFGTTADAVRAGVPQVLVPVFADHPFWAARLRRMGTAARPVPLARMNREALAASVRTAVTDPAMAVRARRLGEAVAAERG
VENACVLIEEWAETRTTAHTPG
>P07650 2.4.2.4~~~deoA~~~Thymidine phosphorylase~~~COG0213
MFLAQEIIRKKRDGHALSDEEIRFFINGIRDNTISEGQIAALAMTIFFHDMTMPERVSLTMAMRDSGTVLDWKSLHLNGP
IVDKHSTGGVGDVTSLMLGPMVAACGGYIPMISGRGLGHTGGTLDKLESIPGFDIFPDDNRFREIIKDVGVAIIGQTSSL
APADKRFYATRDITATVDSIPLITASILAKKLAEGLDALVMDVKVGSGAFMPTYELSEALAEAIVGVANGAGVRTTALLT
DMNQVLASSAGNAVEVREAVQFLTGEYRNPRLFDVTMALCVEMLISGKLAKDDAEARAKLQAVLDNGKAAEVFGRMVAAQ
KGPTDFVENYAKYLPTAMLTKAVYADTEGFVSEMDTRALGMAVVAMGGGRRQASDTIDYSVGFTDMARLGDQVDGQRPLA
VIHAKDENNWQEAAKAVKAAIKLADKAPESTPTVYRRISE
>P9WFS1 2.4.2.4~~~deoA~~~Thymidine phosphorylase~~~COG0213
MTDFAFDAPTVIRTKRDGGRLSDAAIDWVVKAYTDGRVADEQMSALLMAIVWRGMDRGEIARWTAAMLASGARLDFTDLP
LATVDKHSTGGVGDKITLPLVPVVAACGGAVPQASGRGLGHTGGTLDKLESITGFTANLSNQRVREQLCDVGAAIFAAGQ
LAPADAKLYALRDITGTVESLPLIASSIMSKKLAEGAGALVLDVKVGSGAFMRSPVQARELAHTMVELGAAHGVPTRALL
TEMNCPLGRTVGNALEVAEALEVLAGGGPPDVVELTLRLAGEMLELAGIHGRDPAQTLRDGTAMDRFRRLVAAQGGDLSK
PLPIGSHSETVTAGASGTMGDIDAMAVGLAAWRLGAGRSRPGARVQHGAGVRIHRRPGEPVVVGEPLFTLYTNAPERFGA
ARAELAGGWSIRDSPPQVRPLIVDRIV
>Q7CP66 2.4.2.4~~~deoA~~~Thymidine phosphorylase~~~
MFLAQEIIRKKRDGHALSDEEIRFFINGIRDNTISEGQIAALAMTIFFHDMTMPERVSLTMAMRDSGTVLDWKSLNLNGP
IVDKHSTGGVGDVTSLMLGPMVAACGGYVPMISGRGLGHTGGTLDKLEAIPGFDIFPDDNRFREIIQDVGVAIIGQTSSL
APADKRFYATRDITATVDSIPLITGSILAKKLAEGLDALVMDVKVGSGAFMPTYELSEALAEAIVGVANGAGVRTTALLT
DMNQVLASSAGNAVEVREAVQFLTGEYRNPRLFDVTMALCVEMLISGQLAKDDAEARAKLQAVLDNGKAAEVFGRMVAAQ
KGPSDFVENYDKYLPTAMLSKAVYADTEGFISAMDTRALGMAVVSMGGGRRQASDTIDYSVGFTDMARLGDSIDGQRPLA
VIHAKDEASWQEAAKAVKAAIILDDKAPASTPSVYRRITE
>P07023 ~~~tyrA~~~T-protein~~~COG0287
MVAELTALRDQIDEVDKALLNLLAKRLELVAEVGEVKSRFGLPIYVPEREASMLASRRAEAEALGVPPDLIEDVLRRVMR
ESYSSENDKGFKTLCPSLRPVVIVGGGGQMGRLFEKMLTLSGYQVRILEQHDWDRAADIVADAGMVIVSVPIHVTEQVIG
KLPPLPKDCILVDLASVKNGPLQAMLVAHDGPVLGLHPMFGPDSGSLAKQVVVWCDGRKPEAYQWFLEQIQVWGARLHRI
SAVEHDQNMAFIQALRHFATFAYGLHLAEENVQLEQLLALSSPIYRLELAMVGRLFAQDPQLYADIIMSSERNLALIKRY
YKRFGEAIELLEQGDKQAFIDSFRKVEHWFGDYAQRFQSESRVLLRQANDNRQ
>P43902 ~~~tyrA~~~T-protein~~~COG0287
MSFMEALKDLRSEIDSLDRELIQLFAKRLELVSQVGKVKHQHGLPIYAPEREIAMLQARRLEAEKAGISADLIEDVLRRF
MRESYANENQFGFKTINSDIHKIVIVGGYGKLGGLFARYLRASGYPISILDREDWAVAESILANADVVIVSVPINLTLET
IERLKPYLTENMLLADLTSVKREPLAKMLEVHTGAVLGLHPMFGADIASMAKQVVVRCDGRFPERYEWLLEQIQIWGAKI
YQTNATEHDHNMTYIQALRHFSTFANGLHLSKQPINLANLLALSSPIYRLELAMIGRLFAQDAELYADIIMDKSENLAVI
ETLKQTYDEALTFFENNDRQGFIDAFHKVRDWFGDYSEQFLKESRQLLQQANDLKQG
>O69721 1.3.1.12~~~tyrA~~~Prephenate dehydrogenase~~~COG0287
MTRTVAAPPVCVLGLGLIGGSIMRAAAAAGREVFGYNRSVEGAHGARSDGFDAITDLNQTLTRAAATEALIVLAVPMPAL
PGMLAHIRKSAPGCPLTDVTSVKCAVLDEVTAAGLQARYVGGHPMTGTAHSGWTAGHGGLFNRAPWVVSVDDHVDPTVWS
MVMTLALDCGAMVVPAKSDEHDAAAAAVSHLPHLLAEALAVTAAEVPLAFALAAGSFRDATRVAATAPDLVRAMCEANTG
QLAPAADRIIDLLSRARDSLQSHGSIADLADAGHAARTRYDSFPRSDIVTVVIGADKWREQLAAAGRAGGVITSALPSLD
SPQ
>P04693 2.6.1.57~~~tyrB~~~Aromatic-amino-acid aminotransferase~~~COG1448
MFQKVDAYAGDPILTLMERFKEDPRSDKVNLSIGLYYNEDGIIPQLQAVAEAEARLNAQPHGASLYLPMEGLNCYRHAIA
PLLFGADHPVLKQQRVATIQTLGGSGALKVGADFLKRYFPESGVWVSDPTWENHVAIFAGAGFEVSTYPWYDEATNGVRF
NDLLATLKTLPARSIVLLHPCCHNPTGADLTNDQWDAVIEILKARELIPFLDIAYQGFGAGMEEDAYAIRAIASAGLPAL
VSNSFSKIFSLYGERVGGLSVMCEDAEAAGRVLGQLKATVRRNYSSPPNFGAQVVAAVLNDEALKASWLAEVEEMRTRIL
AMRQELVKVLSTEMPERNFDYLLNQRGMFSYTGLSAAQVDRLREEFGVYLIASGRMCVAGLNTANVQRVAKAFAAVM
>O85746 2.6.1.5~~~tyrB~~~Tyrosine aminotransferase~~~
MFQKVDAYAGDPILSLMERFKEDPRSDKVNLSIGLYYNDDGIIPQLQAVAEAEARLNAEPHGASLYLPMEGFSGYRQAIA
PLLFGAEHTALKQNRIASIQTVGGSGALKVGADFLKRYFPESHVWVSDPTWENHIAIFEGAGFEVSTYPWFDKATNGVRF
ENLLAMLQTLPARDIVLLHPCCHNPTGADLTPAQWDRVVEVLKARQLIPFLDIAYQGFGGGLEEDAYAIRAIASAGMPML
VSNSFSKIFSLYGERVGGLSVVCEDSETAGRVLGQLKATVRRNYSSPPSFGAQVVATVLNDAALKATWQAEVDAMRAHIL
TMRQALVDALQQVAPGSKVDYLLKQRGMFSYTGFSAAQVDRLRDEFGVYLIASGRMRVAGLNSRNVQQVAKAFVAVM
>P95468 2.6.1.57~~~tyrB~~~Aromatic-amino-acid aminotransferase~~~
MLGNLKPQAPDKILALMGEFRADPRQGKIDLGVGVYKDATGHTPIMRAVHAAEQRMLETETTKTYAGLSGEPEFQKAMGE
LILGDGLKSETTATLATVGGTGALRQALELARMANPDLRVFVSDPTWPNHVSIMNFMGLPVQTYRYFDAETRGVDFEGMK
ADLAAAKKGDMVLLHGCCHNPTGANLTLDQWAEIASILEKTGALPLIDLAYQGFGDGLEEDAAGTRLIASRIPEVLIAAS
CSKNFGIYRERTGCLLALCADAATRELAQGAMAFLNRQTYSFPPFHGAKIVSTVLTTPELRADWMAELEAVRSGMLRLRE
QLAGELRDLSGSDRFGFVAEHRGMFSRLGATPEQVKRIKEEFGIYMVGDSRINIAGLNDNTIPILARAIIEVGV
>Q04983 1.3.1.43~~~tyrC~~~Cyclohexadienyl dehydrogenase~~~COG0287
MTVFKHIAIIGLGLIGSSAARATKAYCPDVTVSLYDKSEFVRDRARALNLGDNVTDDIQDAVREADLVLLCVPVRAMGIV
AAAMAPALKKDVIICDTGSVKVSVIKTLQDNLPNHIIVPSHPLAGTENNGPDAGFAELFQDHPVILTPDAHTPAQAIAYI
ADYWEEIGGRINLMSAEHHDHVLALTSHLPHVIAYQLIGMVSGYEKKSRTPIMRYSAGSFRDATRVAASEPRLWQDIMLE
NAPALLPVLDHFIADLKKLRTAIASQDGDYLLEHFKESQKARLALKTDHDIRP
>P0DTQ4 4.1.1.25~~~tyrDC~~~L-tyrosine decarboxylase~~~
MKNEKLAKGEMNLNALFIGDKAENGQLYKDLLIDLVDEHLGWRQNYMPQDMPVISSQERTSESYEKTVNHMKDVLNEISS
RMRTHSVPWHTAGRYWGHMNSETLMPSLLAYNFAMLWNGNNVAYESSPATSQMEEEVGHEFAHLMSYKNGWGHIVADGSL
ANLEGLWYARNIKSLPFAMKEVKPELVAGKSDWELLNMPTKEIMDLLESAEDEIDEIKAHSARSGKHLQAIGKWLVPQTK
HYSWLKAADIIGIGLDQVIPVPVDHNYRMDINELEKIVRGLAEEQIPVLGVVGVVGSTEEGAVDSIDKIIALRDELMKDG
IYYYVHVDAAYGGYGRAIFLDEDNNFIPYEDLQDVHEEYGVFKEKKEHISREVYDAYKAIELAESVTIDPHKMGYIPYSA
GGIVIQDIRMRDVISYFATYVFEKGADIPALLGAYILEGSKAGATAASVWAAHHVLPLNVAGYGKLIGASIEGSHHFYNF
LNDLTFKVGDKEIEVHTLTHPDFNMVDYVFKEKGNDDLVAMNKLNHDVYDYASYVKGNIYNNEFITSHTDFAIPDYGNSP
LKFVNSLGFSDEEWNRAGKVTVLRAAVMTPYMNDKEEFDVYAPKIQAALQEKLEQIYDVK
>Q838D6 4.1.1.25~~~tdc~~~L-tyrosine decarboxylase~~~COG0076
MKNEKLAKGEMNLNALFIGDKAENGQLYKDLLIDLVDEHLGWRQNYMPQDMPVISSQERTSESYEKTVNHMKDVLNEISS
RMRTHSVPWHTAGRYWGHMNSETLMPSLLAYNFAMLWNGNNVAYESSPATSQMEEEVGHEFAHLMSYKNGWGHIVADGSL
ANLEGLWYARNIKSLPFAMKEVKPELVAGKSDWELLNMPTKEIMDLLESAEDEIDEIKAHSARSGKHLQAIGKWLVPQTK
HYSWLKAADIIGIGLDQVIPVPVDHNYRMDINELEKIVRGLAEEQIPVLGVVGVVGSTEEGAVDSIDKIIALRDELMKDG
IYYYVHVDAAYGGYGRAIFLDEDNNFIPYEDLQDVHEEYGVFKEKKEHISREVYDAYKAIELAESVTIDPHKMGYIPYSA
GGIVIQDIRMRDVISYFATYVFEKGADIPALLGAYILEGSKAGATAASVWAAHHVLPLNVAGYGKLIGASIEGSHHFYNF
LNDLTFKVGDKEIEVHTLTHPDFNMVDYVFKEKGNDDLVAMNKLNHDVYDYASYVKGNIYNNEFITSHTDFAIPDYGNSP
LKFVNSLGFSDEEWNRAGKVTVLRAAVMTPYMNDKEEFDVYAPKIQAALQEKLEQIYDVK
>A0A481NV25 4.1.1.25~~~tdc~~~L-tyrosine decarboxylase~~~
MKDMDIKAVFIGDKAENGPVYKMLLNKMVDEHLGWRENYIPSDMPAISEGDKLTPDYLATRDHMIEVLDEVSQRLRAGSI
PWHSAGRYWGQMNAETLMPALLAYNYAMLWNPNNVALESSMATSQMEAEVGQDFASLFNMADGWGHIAADGSIANLEGLW
YARCIKSIPLAVKEVLPEKVKNMSEWALLNLSVEEILEMTESFTDEEMDEVKAASSRSGKNIQKLGKWLVPQTKHYSWMK
ALDICGVGLDQMVAIPVQEDYRMDINALEKTIRELADQKIPILGVVAVVGTTEEGQVDSVDKIIQLREKLKDEGIYFYLH
VDAAYGGYARSLFLNEAGEFVPYASLAEFFEEHHVFHHYVTIDKEVYEGFRAISEADSVTIDPHKMGYVPYAAGGIVIKH
KNMRNIISYFAPYVFEKSVKAPDMLGAYILEGSKAGATAAAVWTAHRVLPLNVTGYGQLIGASIEAAQRFREFLEQLHFT
VKGKTIEVYPLNHPDFNMVNWVFKVQDCTDLNAINELNEKMFDRSSYMDGDVYGERFITSHTTFTQEDYGDSPIRFIERM
GLSKEEWQKEQQITLLRAAIMTPYLNDDRIFNFYTKEIAKAMEKKLNEIIK
>J7GQ11 4.1.1.25~~~tdc~~~L-tyrosine decarboxylase~~~
MEKSNRSLKDLDLNALFIGDKAENGQLYKDLLNKLVDEHLGWRKNYIPSDPNMIGPEDQNSPAFKKTVGHMKTVLDQLSE
RIRTESVPWHSAGRYWGHMNSETLMPALLAYNYAMLWNGNNVAYESSPATSQMEEEVGQEFARLMGYDYGWGHIVADGSL
ANLEGLWYARNIKSLPFAMKEVNPELVAGKSDWELLNMPTKEIMDLLENAGSQIDEVKKRSARSGKNLQRLGKWLVPQTK
HYSWMKAADIIGIGLDQVVPVPIDSNYRMDIQALESIIRKYAAEKTPILGVVGVAGSTEEGAVDGIDKIVALRQKLQKEG
IYFYLHVDAAYGGYARALFLDEDDQFIPYKNLQKVHAENHVFTEDKEYIKPEVYAAYKAFDQAESITIDPHKMGYVPYSA
GGIVIQDIRMRDTISYFATYVFEKGADIPALLGAYILEGSKAGATAASVWAAHHTLPLNVTGYGKLEGASIEGAHRYYDF
LKNLKFEVAGKRISVHPLISPDFNMVDYVLKEDGNDDLIEMNRLNHAFYEQASYVKGSLYGKEYIVSHTDFAIPDYGDSP
LAFVESLGFSEVEWRHAGKVTIIRASVMTPYMNQRENFDYFAPRIKKAIQADLEKVYASVNQKENV
>P06845 1.14.18.1~~~melC2~~~Tyrosinase~~~COG2304
MTVRKNQATLTADEKRRFVAAVLELKRSGRYDEFVTTHNAFIIGDTDAGERTGHRSPSFLPWHRRYLLEFERALQSVDAS
VALPYWDWSADRTARASLWAPDFLGGTGRSLDGRVMDGPFAASAGNWPINVRVDGRAYLRRSLGTAVRELPTRAEVESVL
GMATYDTAPWNSASDGFRNHLEGWRGVNLHNRVHVWVGGQMATGMSPNDPVFWLHHAYVDKLWAEWQRRHPGSGYLPAAG
TPDVVDLNDRMKPWNDTSPADLLDHTAHYTFDTD
>P0AAD4 ~~~tyrP~~~Tyrosine-specific transport system~~~COG0814
MKNRTLGSVFIVAGTTIGAGMLAMPLAAAGVGFSVTLILLIGLWALMCYTALLLLEVYQHVPADTGLGTLAKRYLGRYGQ
WLTGFSMMFLMYALTAAYISGAGELLASSISDWTGISMSATAGVLLFTFVAGGVVCVGTSLVDLFNRFLFSAKIIFLVVM
LVLLLPHIHKVNLLTLPLQQGLALSAIPVIFTSFGFHGSVPSIVSYMDGNIRKLRWVFIIGSAIPLVAYIFWQVATLGSI
DSTTFMGLLANHAGLNGLLQALREMVASPHVELAVHLFADLALATSFLGVALGLFDYLADLFQRSNTVGGRLQTGAITFL
PPLAFALFYPRGFVMALGYAGVALAVLALIIPSLLTWQSRKHNPQAGYRVKGGRPALVVVFLCGIAVIGVQFLIAAGLLP
EVG
>P07604 ~~~tyrR~~~HTH-type transcriptional regulatory protein TyrR~~~COG3283
MRLEVFCEDRLGLTRELLDLLVLRGIDLRGIEIDPIGRIYLNFAELEFESFSSLMAEIRRIAGVTDVRTVPWMPSEREHL
ALSALLEALPEPVLSVDMKSKVDMANPASCQLFGQKLDRLRNHTAAQLINGFNFLRWLESEPQDSHNEHVVINGQNFLME
ITPVYLQDENDQHVLTGAVVMLRSTIRMGRQLQNVAAQDVSAFSQIVAVSPKMKHVVEQAQKLAMLSAPLLITGDTGTGK
DLFAYACHQASPRAGKPYLALNCASIPEDAVESELFGHAPEGKKGFFEQANGGSVLLDEIGEMSPRMQAKLLRFLNDGTF
RRVGEDHEVHVDVRVICATQKNLVELVQKGMFREDLYYRLNVLTLNLPPLRDCPQDIMPLTELFVARFADEQGVPRPKLA
ADLNTVLTRYAWPGNVRQLKNAIYRALTQLDGYELRPQDILLPDYDAATVAVGEDAMEGSLDEITSRFERSVLTQLYRNY
PSTRKLAKRLGVSHTAIANKLREYGLSQKKNEE
>Q9ZIB7 ~~~tyrR~~~HTH-type transcriptional regulatory protein TyrR~~~
MRLEVFCQDRIGLARELLDLLVARSIDLRGIEVAASGRIYLNFSTLEFEQFSNLMAEIRRTPGVTDVRTVPYMPSEREHR
VLSALLVAMPEPVFSVDLRTKVELANPAAQNLFNLDENKIRNFTADHLINGFNFARWLESERVQAQAQHVVIEGRDFLME
AHPIYLSEDNDQADQLVGAMVMLKSTARMGRQLQNLVVTDETEFDHIVAVTPRMRQVVEQARKLAMHDAPLLIIGDTGTG
KDMLARACHLRSARGKMPFLALNCASLPDDVAESELFGHAAGAYPNALEGKKGFFEQANGGSVLLDEIGEMSPTMQTKLL
RFLNDGTFRRVGEEHEVHVNVRVICATQKNLFELVQRGEFREDLFYRLNVLTLNLPPLRERVQDIMPLTEIFVARFADEQ
GIPRPRLSSQLNAFLMRYNWPGNVRQLKNALYRALTQLEGHELRPQDIVLPEQALDVSLGEEAMEGTLDQITSRFERSIL
TRLYLSYPSTRKLAKRLGVSHTAIANKLREYGLGQKRGDNE
>P44694 ~~~tyrR~~~HTH-type transcriptional regulatory protein TyrR~~~COG3283
MTISKFNPQKPFECFIVQSEAMKSAVENAKRFAMFDAPLLIQGETGSGKDLLAKACHYQSLRRDKKFIAVNCAGLPDEDA
ESEMFGRKVGDSETIGFFEYANKGTVLLDGIAELSLSLQAKLLRFLTDGSFRRVGEEKEHYANVRVICTSQVPLHLLVEQ
GKVRADLFHRLNVLTINVPALRDRMADIEPLAQGFLQEISEELKIAKPTFDKDFLLYLQKYDWKGNVRELYNTLYRACSL
VQDNHLTIESLNLALPQSAVISLDEFENKTLDEIIGFYEAQVLKLFYAEYPSTRKLAQRLGVSHTAIANKLKQYGIGK
>P0CI79 2.1.1.45~~~thyA1~~~Thymidylate synthase 1~~~COG0207
MTQFDKQYNSIIKDIINNGISDEEFDVRTKWDSDGTPAHTLSVISKQMRFDNSEVPILTTKKVAWKTAIKELLWIWQLKS
NDVNDLNMMGVHIWDQWKQEDGTIGHAYGFQLGKKNRSLNGEKVDQVDYLLHQLKNNPSSRRHITMLWNPDELDAMALTP
CVYETQWYVKHGKLHLEVRARSNDMALGNPFNVFQYNVLQRMIAQVTGYELGEYIFNIGDCHVYTRHIDNLKIQMEREQF
EAPELWINPEVKDFYDFTIDDFKLINYKHGDKLLFEVAV
>P11044 2.1.1.45~~~thyA2~~~Thymidylate synthase 2~~~COG0207
MKQYKDFCRHVLEHGEKKGDRTGTGTISTFGYQMRFNLREGFPMLTTKKLHFKSIAHELLWFLKGDTNVRYLQENGVRIW
NEWADENGELGPVYGSQWRSWRGADGETIDQISRLIEDIKTNPNSRRLIVSAWNVGEIDKMALPPCHCLFQFYVSDGKLS
CQLYQRSADVFLGVPFNIASYALLTMIIAHVTGLEPGEFIHTFGDVHIYQNHIEQVNLQLERDVRPLPQLRFARKVDSIF
NFAFEDFIIEDYDPHPHIKGAVSV
>B7I4U0 2.1.1.45~~~thyA~~~Thymidylate synthase~~~
MRAYLDLLQHILDNGGDKGDRTGTGTRSVFGHQMRFDLSKGFPLLTTKKVHFRSIVIELLWFLKGDTNVKYLQDNKVTIW
DEWATAEQTARFGRPEHELGPVYGHQWRNFGATKNADGTYNQDGFDQIKWLINEIKTNPNSRRLIVSGWNPNEAGQVALP
PCHTLFQFFVQDNKLSCQLYQRSADVFLGVPFNIASYALLTHMIAQVCGLGVGDFVWTGGDTHLYANHFEQAKLQLTREP
LPLCQLKLNPEVKDIFDFKFEDIEIVGYESHPAIKAPVAV
>P67042 2.1.1.45~~~thyA~~~Thymidylate synthase~~~COG0207
MRTYLDLLQHVLDHGVDRDDRTGTGTRSVFGYQMRFDLEEGFPVLTTKKLHLRSIIHELLWFLKGDTNIAYLKENGVTIW
DEWADENGDLGPVYGYQWRSWPAPDGRHIDQIANLLKMLHTNPQSRRLIVSAWNPALVDEMALPPCHCLFQFYVANGRLS
CQLYQRSADIFLGVPFNIASYALLTMMIAQVTGLKPGEFIHTLGDAHIYSNHFEQARLQLTRTPKKLPVMHINPDVKDLF
AFRFEDFRLDGYEADPTIKAPIAV
>Q8NS38 2.1.1.45~~~thyA~~~Thymidylate synthase~~~COG0207
MTVPTPYEDLLRKIAEEGSHKDDRTGTGTTSLFGQQIRFDLNEGFPLLTTKKVHFHSVVGELLWFLQGDSNVKWLQDNNI
RIWNEWADEDGELGPVYGVQWRSWPTPDGRHIDQISGALETLRNNPDSRRNIVSAWNVSELENMALPPCHLLFQLYVADG
KLSCQLYQRSADMFLGVPFNIASYALLTHMFAQQAGLEVGEFIWTGGDCHIYDNHKEQVAEQLSREARPYPTLELNKAAS
MFEYSFDDITVSGYDPHPLIRGKVAV
>P0A886 2.1.1.45~~~thyA~~~Thymidylate synthase~~~COG0207
MKQYLELMQKVLDEGTQKNDRTGTGTLSIFGHQMRFNLQDGFPLVTTKRCHLRSIIHELLWFLQGDTNIAYLHENNVTIW
DEWADENGDLGPVYGKQWRAWPTPDGRHIDQITTVLNQLKNDPDSRRIIVSAWNVGELDKMALAPCHAFFQFYVADGKLS
CQLYQRSCDVFLGLPFNIASYALLVHMMAQQCDLEVGDFVWTGGDTHLYSNHMDQTHLQLSREPRPLPKLIIKRKPESIF
DYRFEDFEIEGYDPHPGIKAPVAI
>P0A884 2.1.1.45~~~thyA~~~Thymidylate synthase~~~COG0207
MKQYLELMQKVLDEGTQKNDRTGTGTLSIFGHQMRFNLQDGFPLVTTKRCHLRSIIHELLWFLQGDTNIAYLHENNVTIW
DEWADENGDLGPVYGKQWRAWPTPDGRHIDQITTVLNQLKNDPDSRRIIVSAWNVGELDKMALAPCHAFFQFYVADGKLS
CQLYQRSCDVFLGLPFNIASYALLVHMMAQQCDLEVGDFVWTGGDTHLYSNHMDQTHLQLSREPRPLPKLIIKRKPESIF
DYRFEDFEIEGYDPHPGIKAPVAI
>Q834R3 2.1.1.45~~~thyA~~~Thymidylate synthase~~~COG0207
MEEAYLALGKKILEEGHFKEDRTGTGTYSLFGYQMRFDLAKGFPLLTTKRVPFGLIKSELLWFLKGDTNIRYLLERNNHI
WDEWAFERYVKSADYQGPDMTDFGHRVLQDPAFAEQYKEEHQKFCDAILNDAEFAEKYGELGNIYGAQWRHWETKDGSFI
DQLANVIEMIKTNPDSRRLIVSAWNPEDVPSMALPPCHTMFQFYVNEGKLSCQLYQRSADVFLGVPFNIASYALLTHLIA
HETGLEVGEFVHTLGDAHLYQNHVEQMQEQLSREVRSFPTLVLNPDKASVFDFDMEDIKVEGYDPHPTIKAPIAV
>P00469 2.1.1.45~~~thyA~~~Thymidylate synthase~~~COG0207
MLEQPYLDLAKKVLDEGHFKPDRTHTGTYSIFGHQMRFDLSKGFPLLTTKKVPFGLIKSELLWFLHGDTNIRFLLQHRNH
IWDEWAFEKWVKSDEYHGPDMTDFGHRSQKDPEFAAVYHEEMAKFDDRVLHDDAFAAKYGDLGLVYGSQWRAWHTSKGDT
IDQLGDVIEQIKTHPYSRRLIVSAWNPEDVPTMALPPCHTLYQFYVNDGKLSLQLYQRSADIFLGVPFNIASYALLTHLV
AHECGLEVGEFIHTFGDAHLYVNHLDQIKEQLSRTPRPAPTLQLNPDKHDIFDFDMKDIKLLNYDPYPAIKAPVAV
>P9WFR9 2.1.1.45~~~thyA~~~Thymidylate synthase ThyA~~~COG0207
MSIVTPYEDLLRFVLETGTPKSDRTGTGTRSLFGQQMRYDLSAGFPLLTTKKVHFKSVAYELLWFLRGDSNIGWLHEHGV
TIWDEWASDTGELGPIYGVQWRSWPAPSGEHIDQISAALDLLRTDPDSRRIIVSAWNVGEIERMALPPCHAFFQFYVADG
RLSCQLYQRSADLFLGVPFNIASYALLTHMMAAQAGLSVGEFIWTGGDCHIYDNHVEQVRLQLSREPRPYPKLLLADRDS
IFEYTYEDIVVKNYDPHPAIKAPVAV
>P48464 2.1.1.45~~~thyA~~~Thymidylate synthase~~~
MKQYLELMQKVLDEGTQKNDRTGTGTLSIFGHQMRFNLQDGFPLVTTKRCHLRSIIHELLWFLQGDTNIAYLHENNVTIW
DEWADENGDLGPVYGKQWRAWPTPDGRHIDQITTVLNQLKNDPDSRRIIVSAWNVGELDKMALAPCHAFFQFYVADGKLS
CQLYQRSCDVFLGLPFNIASYALLVHMMAQQCDLEVGDFVWTGGDTHLYSNHMDQTHLQLSREPRPLPKLIIKRKPESIF
DYRFEDFEIEGYDPHPGVKAPVAI
>P67046 2.1.1.45~~~thyA~~~Thymidylate synthase~~~
MLNSFDAAYHSLCEEVLEIGNTRNDRTNTGTISKFGHQLRFDLSKGFPLLTTKKVSFKLVATELLWFIKGDTNIQYLLKY
NNNIWNEWAFENYIKSDEYNGPDMTDFGHRALSDPEFNEQYKEQMKQFKQRILEDDTFAKQFGDLGNVYGKQWRDWVDKD
GNHFDQLKTVIEQIKHNPDSRRHIVSAWNPTEIDTMALPPCHTMFQFYVQDGKLSCQLYQRSADIFLGVPFNIASYALLT
HLIAKECGLEVGEFVHTFGDAHIYSNHIDAIQTQLARESFNPPTLKINSDKSIFDINYEDLEIVDYESHPAIKAPIAV
>P67049 2.1.1.45~~~thyA~~~Thymidylate synthase~~~COG0207
MTKADTIFKENIERILKEGVFSEQARPKYKDGTVANSKYVTGAFSEYDLSKGEFPITTLRPIAIKSAIKEVLWIYQDQSN
SLEVLNDKYNVHYWNDWEVGDTGTIGERYGAVVKKHDIINKLLKQLETNPWNRRNIISLWDYQAFEETDGLLPCAFQTMF
DVRRVDGEIYLDATLTQRSNDMLVAHHINAMQYVALQMMIAKHFGWKVGKFFYFINNLHIYDNQFEQAQELLRREPSNCQ
PRLVLNVPDGTNFFDIKAEDFELVDYDPVKPQLKFDLAI
>Q46820 ~~~uacF~~~Putative oxidoreductase UacF~~~COG0493
MNKFIAAEAAECIGCHACEIACAVAHNQENWPLSHSDFRPRIHVVGKGQAANPVACHHCNNAPCVTACPVNALTFQSDSV
QLDEQKCIGCKRCAIACPFGVVEMVDTIAQKCDLCNQRSSGTQACIEVCPTQALRLMDDKGLQQIKVARQRKTAAGKASS
DAQPSRSAALLPVNSRKGADKISASERKTHFGEIYCGLDPQQATYESDRCVYCAEKANCNWHCPLHNAIPDYIRLVQEGK
IIEAAELCHQTSSLPEICGRVCPQDRLCEGACTLKDHSGAVSIGNLERYITDTALAMGWRPDVSKVVPRSEKVAVIGAGP
AGLGCADILARAGVQVDVFDRHPEIGGMLTFGIPPFKLDKTVLSQRREIFTAMGIDFHLNCEIGRDITFSDLTSEYDAVF
IGVGTYGMMRADLPHEDAPGVIQALPFLTAHTRQLMGLPESEEYPLTDVEGKRVVVLGGGDTTMDCLRTSIRLNAASVTC
AYRRDEVSMPGSRKEVVNAREEGVEFQFNVQPQYIACDEDGRLTAVGLIRTAMGEPGPDGRRRPRPVAGSEFELPADVLI
MAFGFQAHAMPWLQGSGIKLDKWGLIQTGDVGYLPTQTHLKKVFAGGDAVHGADLVVTAMAAGRQAARDMLTLFDTKAS
>Q46821 ~~~uacT~~~Uric acid transporter UacT~~~COG2233
MSAIDSQLPSSSGQDRPTDEVDRILSPGKLIILGLQHVLVMYAGAVAVPLMIGDRLGLSKEAIAMLISSDLFCCGIVTLL
QCIGIGRFMGIRLPVIMSVTFAAVTPMIAIGMNPDIGLLGIFGATIAAGFITTLLAPLIGRLMPLFPPLVTGVVITSIGL
SIIQVGIDWAAGGKGNPQYGNPVYLGISFAVLIFILLITRYAKGFMSNVAVLLGIVFGFLLSWMMNEVNLSGLHDASWFA
IVTPMSFGMPIFDPVSILTMTAVLIIVFIESMGMFLALGEIVGRKLSSHDIIRGLRVDGVGTMIGGTFNSFPHTSFSQNV
GLVSVTRVHSRWVCISSGIILILFGMVPKMAVLVASIPQFVLGGAGLVMFGMVLATGIRILSRCNYTTNRYNLYIVAISL
GVGMTPTLSHDFFSKLPAVLQPLLHSGIMLATLSAVVLNVFFNGYQHHADLVKESVSDKDLKVRTVRMWLLMRKLKKNEH
GE
>Q4A0V8 ~~~uafA~~~Uro-adherence factor A~~~COG4932
MNKNKKRHRFDFLPNRLNKYSIRKFTNGIASVLIGSTILLGAVIDKEADAAEQQPTSEVYGQTDNNYKSESNKSNSVQHD
EERTNIINDNDEANYSNHSEIPIHDKSSEYDQKPINEQDTSNHHETHLNQNATENHVKEESKEVSTEEESIEDRKTEEST
TEESKAVEEANKENTTEKNDEGSLDLEKEKDTYKDEKDNGKKKNELESHNHRIENKVEDNVYKNNKETNLESKNENVNKD
DKVNTSSSTSIEKPEDNATRSNLINSVNHSLKQLDNAKNNTEKQSLLENYYQTHTNATASDAKKAIEKLNIDFTKQNSDQ
LIALLLIELANQMDKDKVQANVPASKRAETNNESLSIETNTTNIEKTLAKPSTSKFRSANTRATNVVNYAANQSGRNVNH
LVFANTSYEILGGGKKYNQVFMTMDGKLKIKIDYTVDDSVVEGDYFTVDFGKYIHPGTSRKPYRVNNIHDANGRTIAIGS
YDSATNTAKYTFTNYVDIYNNVRGSFSLLSWPFKELVTTDKQSVPVGITVAGEDYTQNVIFNYGNRTVPVISDINYLTKD
FAEFTTYINQNRAFNTGSKVRLSGQGFKFTSPDEIEVYKVLNNSQFRDSFSPDYANLTQVRNPKIIINSDGSATVDLGDI
GTLGYIIRSKPNTLPDFSGIGVLKSEYTFTNNKNQRDTRAHASSIQFVRAELAGFGGFGGYVWFDKNNDGVQNDSNAAAA
GITVNLLDPTGIRLATTTTDITGHYNFDNLTNGNYLVEFVMPEGYIPTQANSTVDDKDSDVVFENGRYIAHVTIKDADNM
TIDAGLVSDTTSESLSLSESLSTSQSLSLSHSLSLSESESTSQSLSLSTSESLSQSLSLSASESLSESESLSESESLSES
ESLSASESLSESESLSESESLSASESLSTSESLSASESLSESESLSESESLSESESLSESESLSTSESLSASESLSESES
LSESESLSASESLSDSESLSASESLSASESLSASESLSASESLSASESLSESESLSDSESLSESESLSESESLSTSESLS
ASESISASESLSTSESLSTSESLSESESLSESESLSEYESLSASESLSASESLSTSESLSESASLSESESLSTSESLSAS
ESISASESLSASESLSESESLSTSESLSTSESLSESESLSESESLSSSESLSSSESLSESESLSTSESISESESLSTSES
LSASESLSESESLSASESISESESLSASESLSEYESLSTSESLSSSESLSESESLSASESLSESESLSASESISESESLS
ESESLSTSESLSASESLSESESLSESESLSESESLSESESLSASESLSESESLSESESLSARESLSQSEALSESESLSTS
ESLSESESLSASESISESEYLSESESLSASESLSESESISASESLSESESLSESESLSESESLSTSESLSASESLSESES
LSESESLSESESLSESESLSESESLSDSESLSESESLSESESLSESESLSDSESLSESESLSESESLSESESLSESESLS
ESELLSESESLSESESISESESISASESLSESESLSESESLSESESLSESESLSESESLSESESLSESESLSASESISAS
ESLSESESLSESESLSESESLSESESLSTSESLSESESLSTSESISESESLSESESLSESESLSESESLSKSESLSESES
LSESESLSTSESLSESESLSASESISESESLSESESLSESESLSTSESLRQSESLSQSLSLSASESLSASESISESESLS
ESESLSASESLSESESLSASESLSESESLSESESLSGSESLSASESLSESESLSESESLSESESLSESESLSGSESLSAS
ESISESESISASESLSESESISASESISESESLSTSESLSTSESLSESESLSASESLSESESLSESESLSESESLSESES
LSASESLSASESLSESESLSESESLSESESLSESESLSESESISESESLSESESLSESESLSESESISDSESISESESLS
ESESLSESESISESESLSESESLSESESLSESESISDSESLSESESLSESESLSESESLSESESLSESESLSASESLSAS
ESLSASESLSESESLSDSESLSESESLSESESLSASESLSASESLSESESLSESESISDSESLSTSESLSESESLSESES
LSGSESLSASESLSASESLSASESLSASESISESESLSESESLSESESLSDSESLSESESLSTSESLSASESLSESESLN
TSESMSESESLGASESISEYESLNRHLNNDGNYEKEKDKLPDTGNEDKHNGLIPLLTALGGIILLRRRRNNEIQDK
>Q9RV58 ~~~~~~Protein DR_1172~~~COG5412
MFERDEHHFPVKRLLLLGALVGAGAYYLSREQNRKALDAKLAELGLKDAAQDVGSSVTKGWEKTKDAAQNAGSVIADKAQ
DVAGEVKSAVAGATAEIKDAGKEVADTAKDAGQNVGQNVKREAADLADQAKDKAQDVKADVSKAADQAKDKAQDVAQNVQ
AGAQQAAANVKDKVQDVKADASKAADQAKDKAQDVAQNVKQGAQQAASDAKDKVQDVKADASRAADQAKDKAQDVAQNVK
QSAQDAKTDVDAKAKSWAFDLRTDAEAGKQGGQTGSTTNNAGTAGNTGMTGNTNTRKN
>A9Q0M7 ~~~ubaA~~~Bacteriocin ubericin-A~~~
MNTIEKFENIKLFSLKKIIGGKTVNYGNGLYCNQKKCWVNWSETATTIVNNSIMNGLTGGNAGWHSGGRA
>M1YW29 ~~~ubact~~~Prokaryotic ubiquitin-like protein UBact~~~
MEMTDPLRREEKKESSPDPKEESGPSRPDVSRPGRDSLLKRMKKVDPKQSEKYKQRTGQ
>B9XEI9 ~~~ubact~~~Prokaryotic ubiquitin-like protein UBact~~~
MPDQAQKTRPVGPGPSGGGEGPGSPKVEKPNTEELLKRMRKVDPDQAKRYRQRTGQ
>A0A0G0MF47 ~~~ubact~~~Prokaryotic ubiquitin-like protein UBact~~~
MPQDQQRKKQFDPNPNRDDSQRKTPVDKEIDDIIDEADEIVKRNKEQAKKKQPSRQ
>P0AGK1 2.5.1.39~~~ubiA~~~4-hydroxybenzoate octaprenyltransferase~~~COG0382
MEWSLTQNKLLAFHRLMRTDKPIGALLLLWPTLWALWVATPGVPQLWILAVFVAGVWLMRAAGCVVNDYADRKFDGHVKR
TANRPLPSGAVTEKEARALFVVLVLISFLLVLTLNTMTILLSIAALALAWVYPFMKRYTHLPQVVLGAAFGWSIPMAFAA
VSESVPLSCWLMFLANILWAVAYDTQYAMVDRDDDVKIGIKSTAILFGQYDKLIIGILQIGVLALMAIIGELNGLGWGYY
WSILVAGALFVYQQKLIANREREACFKAFMNNNYVGLVLFLGLAMSYWHF
>Q7CPB4 2.5.1.39~~~ubiA~~~4-hydroxybenzoate octaprenyltransferase~~~
MEWSLTQSKLLAFHRLMRTDKPIGALLLLWPTLWALWVATPGMPQLWILAVFVAGVWLMRAAGCVVNDYADRKFDGHVKR
TVNRPLPSGAVTEKEARNLFVVLVLLAFLLVLTLNAMTILLSVAALALAWVYPFMKRYTHLPQVVLGAAFGWSIPMAFAA
VSESLPLSCWLMFLANILWAVAYDTQYAMVDRDDDIKIGIKSTAILFGRYDTLIIGILQLGVMALMALIGWLNGLGWGYY
WAVLVAGALFVYQQKLIANREREACFKAFMNNNYVGLVLFLGLAMSYWHF
>P0A6A0 2.7.-.-~~~ubiB~~~Probable protein kinase UbiB~~~COG0661
MTPGEVRRLYFIIRTFLSYGLDELIPKMRITLPLRLWRYSLFWMPNRHKDKLLGERLRLALQELGPVWIKFGQMLSTRRD
LFPPHIADQLALLQDKVAPFDGKLAKQQIEAAMGGLPVEAWFDDFEIKPLASASIAQVHTARLKSNGKEVVIKVIRPDIL
PVIKADLKLIYRLARWVPRLLPDGRRLRPTEVVREYEKTLIDELNLLRESANAIQLRRNFEDSPMLYIPEVYPDYCSEGM
MVMERIYGIPVSDVAALEKNGTNMKLLAERGVQVFFTQVFRDSFFHADMHPGNIFVSYEHPENPKYIGIDCGIVGSLNKE
DKRYLAENFIAFFNRDYRKVAELHVDSGWVPPDTNVEEFEFAIRTVCEPIFEKPLAEISFGHVLLNLFNTARRFNMEVQP
QLVLLQKTLLYVEGVGRQLYPQLDLWKTAKPFLESWIKDQVGIPALVRAFKEKAPFWVEKMPELPELVYDSLRQGKYLQH
SVDKIARELQSNHVRQGQSRYFLGIGATLVLSGTFLLVSRPEWGLMPGWLMAGGLIAWFVGWRKTR
>O07443 2.7.-.-~~~ubiB~~~Probable protein kinase UbiB~~~
MTPGEIKRLYFIIRVFLSYGLDELIPKVKLTLPLRIGRFGFFWIKNQHKGKELGERLRLALQELGPVWIKFGQMLSTRRD
LFPLAIADQLSLLQDKVASFDGKLARRYIEESLGGPLEQWFDDFDEKALASASIAQVHTAKLKENGKEVVLKVIRPDILP
VIKADVKLMYRIANWVPLLPDGRRLRPKEVVREYEKTLIDELNLLRESANAIQLRRNFENSSMLYVPEVYADYCRENVMV
MERIYGIPVSDIAALKAQGTNMKILAERGVKVFFTQVFRDSFFHADMHPGNIFVSYEHPEDPLYIGIDCGIVGSLNKEDK
RYLAENFIAFFNRDYRKVAELHVDSGWVPADTNVEDFEFAIRTVCEPIFEKPLAEISFGHVLLNLFNTARRFNMEVQPQL
VLLQKTLLYIEGLGRQLYPQLDLWKTAKPFLEDWVHSQVGIPAITQALKEKAPYWAEKMPEIPDLIYGALRQHKFLQSNI
EQLSEQLKQQRNKQRKSQYLLGIGATLILCGSLFFISASNRMAIAFMSAGALSWIIGWYKSGKS
>P26602 4.1.3.40~~~ubiC~~~Chorismate pyruvate-lyase~~~COG3161
MSHPALTQLRALRYCKEIPALDPQLLDWLLLEDSMTKRFEQQGKTVSVTMIREGFVEQNEIPEELPLLPKESRYWLREIL
LCADGEPWLAGRTVVPVSTLSGPELALQKLGKTPLGRYLFTSSTLTRDFIEIGRDAGLWGRRSRLRLSGKPLLLTELFLP
ASPLY
>P0AAB4 4.1.1.98~~~ubiD~~~3-octaprenyl-4-hydroxybenzoate carboxy-lyase~~~COG0043
MDAMKYNDLRDFLTLLEQQGELKRITLPVDPHLEITEIADRTLRAGGPALLFENPKGYSMPVLCNLFGTPKRVAMGMGQE
DVSALREVGKLLAFLKEPEPPKGFRDLFDKLPQFKQVLNMPTKRLRGAPCQQKIVSGDDVDLNRIPIMTCWPEDAAPLIT
WGLTVTRGPHKERQNLGIYRQQLIGKNKLIMRWLSHRGGALDYQEWCAAHPGERFPVSVALGADPATILGAVTPVPDTLS
EYAFAGLLRGTKTEVVKCISNDLEVPASAEIVLEGYIEQGETAPEGPYGDHTGYYNEVDSFPVFTVTHITQREDAIYHST
YTGRPPDEPAVLGVALNEVFVPILQKQFPEIVDFYLPPEGCSYRLAVVTIKKQYAGHAKRVMMGVWSFLRQFMYTKFVIV
CDDDVNARDWNDVIWAITTRMDPARDTVLVENTPIDYLDFASPVSGLGSKMGLDATNKWPGETQREWGRPIKKDPDVVAH
IDAIWDELAIFNNGKSA
>P0A887 2.1.1.163~~~ubiE~~~Ubiquinone/menaquinone biosynthesis C-methyltransferase UbiE~~~COG2226
MVDKSQETTHFGFQTVAKEQKADMVAHVFHSVASKYDVMNDLMSFGIHRLWKRFTIDCSGVRRGQTVLDLAGGTGDLTAK
FSRLVGETGKVVLADINESMLKMGREKLRNIGVIGNVEYVQANAEALPFPDNTFDCITISFGLRNVTDKDKALRSMYRVL
KPGGRLLVLEFSKPIIEPLSKAYDAYSFHVLPRIGSLVANDADSYRYLAESIRMHPDQDTLKAMMQDAGFESVDYYNLTA
GVVALHRGYKF
>P75728 1.14.99.60~~~ubiF~~~3-demethoxyubiquinol 3-hydroxylase~~~COG0654
MTNQPTEIAIVGGGMVGGALALGLAQHGFAVTVIEHAEPAPFVADSQPDVRISAISAASVSLLKGLGVWDAVQAMRCHPY
RRLETWEWETAHVVFDAAELKLPLLGYMVENTVLQQALWQALEAHPKVTLRVPGSLIALHRHDDLQELELKGGEVIRAKL
VIGADGANSQVRQMAGIGVHAWQYAQSCMLISVQCENDPGDSTWQQFTPDGPRAFLPLFDNWASLVWYDSPARIRQLQNM
NMAQLQAEIAKHFPSRLGYVTPLAAGAFPLTRRHALQYVQPGLALVGDAAHTIHPLAGQGVNLGYRDVDALIDVLVNARS
YGEAWASYPVLKRYQMRRMADNFIMQSGMDLFYAGFSNNLPPLRFMRNLGLMAAERAGVLKRQALKYALGL
>P17993 2.1.1.222~~~ubiG~~~Ubiquinone biosynthesis O-methyltransferase~~~COG2227
MNAEKSPVNHNVDHEEIAKFEAVASRWWDLEGEFKPLHRINPLRLGYIAERAGGLFGKKVLDVGCGGGILAESMAREGAT
VTGLDMGFEPLQVAKLHALESGIQVDYVQETVEEHAAKHAGQYDVVTCMEMLEHVPDPQSVVRACAQLVKPGGDVFFSTL
NRNGKSWLMAVVGAEYILRMVPKGTHDVKKFIKPAELLGWVDQTSLKERHITGLHYNPITNTFKLGPGVDVNYMLHTQNK
>P25534 1.14.13.-~~~ubiH~~~2-octaprenyl-6-methoxyphenol hydroxylase~~~COG0654
MSVIIVGGGMAGATLALAISRLSHGALPVHLIEATAPESHAHPGFDGRAIALAAGTCQQLARIGVWQSLADCATAITTVH
VSDRGHAGFVTLAAEDYQLAALGQVVELHNVGQRLFALLRKAPGVTLHCPDRVANVARTQSHVEVTLESGETLTGRVLVA
ADGTHSALATACGVDWQQEPYEQLAVIANVATSVAHEGRAFERFTQHGPLAMLPMSDGRCSLVWCHPLERREEVLSWSDE
KFCRELQSAFGWRLGKITHAGKRSAYPLALTHAARSITHRTVLVGNAAQTLHPIAGQGFNLGMRDVMSLAETLTQAQERG
EDMGDYGVLCRYQQRRQSDREATIGVTDSLVHLFANRWAPLVVGRNIGLMTMELFTPARDVLAQRTLGWVAR
>P25535 1.14.13.240~~~ubiI~~~2-octaprenylphenol hydroxylase~~~COG0654
MQSVDVAIVGGGMVGLAVACGLQGSGLRVAVLEQRVQEPLAANAPPQLRVSAINAASEKLLTRLGVWQDILSRRASCYHG
MEVWDKDSFGHISFDDQSMGYSHLGHIVENSVIHYALWNKAHQSSDITLLAPAELQQVAWGENETFLTLKDGSMLTARLV
IGADGANSWLRNKADIPLTFWDYQHHALVATIRTEEPHDAVARQVFHGEGILAFLPLSDPHLCSIVWSLSPEEAQRMQQA
SEDEFNRALNIAFDNRLGLCKVESARQVFPLTGRYARQFASHRLALVGDAAHTIHPLAGQGVNLGFMDAAELIAELKRLH
RQGKDIGQYIYLRRYERSRKHSAALMLAGMQGFRDLFSGTNPAKKLLRDIGLKLADTLPGVKPQLIRQAMGLNDLPEWLR
>P0ADP7 ~~~ubiJ~~~Ubiquinone biosynthesis accessory factor UbiJ~~~COG3165
MPFKPLVTAGIESLLNTFLYRSPALKTARSRLLGKVLRVEVKGFSTSLILVFSERQVDVLGEWAGDADCTVIAYASVLPK
LRDRQQLTALIRSGELEVQGDIQVVQNFVALADLAEFDPAELLAPYTGDIAAEGISKAMRGGAKFLHHGIKRQQRYVAEA
ITEEWRMAPGPLEVAWFAEETAAVERAVDALTKRLEKLEAK
>Q46868 ~~~ubiK~~~Ubiquinone biosynthesis accessory factor UbiK~~~COG2960
MIDPKKIEQIARQVHESMPKGIREFGEDVEKKIRQTLQAQLTRLDLVSREEFDVQTQVLLRTREKLALLEQRISELENRS
TEIKKQPDPETLPPTL
>Q8ZLY9 ~~~ubiK~~~Ubiquinone biosynthesis accessory factor UbiK~~~
MASTYRTTIRANTYQFRETTMIDPKKIEQIARQVHESMPKGIREFGEDIEKKIRQTLQSQLTRLDLVSREEFDVQTQVLL
RTREKLALLEQRLSELEARDKPEEVKPAPAIPPVDPQQE
>Q2RMZ4 1.14.13.-~~~ubiL~~~Ubiquinone hydroxylase UbiL~~~COG0654
MSEPLLRGLAAGDPPSATGPVTGSADKVADVLIVGGGLVGGTLACALAEKGVSVVVIDGEDPEALLAAGYDGRCSAIALA
CQRLLDTIGLWDLLGGESQPILDIRVVDGGSPLFLHYAQAEAQGPMGYMVENRLLRQAILTRLGRLPAATLLAPARMTAL
RRDLDGVSATLSDGQTVRARLVVGADGRRSQVRESAGIGIRTLGYGQTAIVLTVEHERSHRGCAVEHFLPAGPFAILPMP
GNRSSLVWTERSDLVPGLLALPAEHFQAELERRFGDHLGWVRPVGPRFSYRLTLQAANRYVDHRLALVGDAAHGMHPVAG
QGMNYGLRDVAVLAERLVAAQRLGLDPGAPALLAEYEALRRPDNLLMLAITDALVRLFSNDIAPVALARRLGIGAVERMG
PLKRLFMRHAMGTLKLGPEPPRLMRGVPL
>A1KVW0 1.14.13.-~~~ubiM~~~Ubiquinone hydroxylase UbiM~~~
MRLFTYPTPDLIHIKLKVKIRIHPPLHISSTAGFDIIAYLLQIVQTAFKPLQMPSEIIGIRLCKGYFMSLHSDILVVGAG
PAGLSFAAELAGSGLKVTLIERSPLTVLQNPPYDGREIALTHFSREIMQRLGMWDKIPENEIYPLRDAKVLNGRSDYQLH
FPQPTEARGEPADCLGYLISNHNIRRAAYEVVSQLDNVSILTDTVVKEVKTSDNEAQVILENGKILTARLLLAADSRFSQ
TRRQLGISSDMHDYSRTMFVCRMKHTLSNQHTAYECFHYGRTIALLPLEEHLTNTVITVDTDKINSVQNLSPEELAASVK
EQLKGRLGDMELVSSIHHYPLVGMIAKRFYGKRSALIGDAAVGMHPVTAHGFNLGLSSADILAKLILEAEQRGQDIGASS
LLEKYSNKHMLHAHPLYHGTNMMLKLFTNETAPAKLLRGLVLRAGNNFPPLKKLITKQLTG
>P45527 ~~~ubiU~~~Ubiquinone biosynthesis protein UbiU~~~COG0826
MELLCPAGNLPALKAAIENGADAVYIGLKDDTNARHFAGLNFTEKKLQEAVSFVHQHRRKLHIAINTFAHPDGYARWQRA
VDMAAQLGADALILADLAMLEYAAERYPHIERHVSVQASATNEEAINFYHRHFDVARVVLPRVLSIHQVKQLARVTPVPL
EVFAFGSLCIMSEGRCYLSSYLTGESPNTIGACSPARFVRWQQTPQGLESRLNEVLIDRYQDGENAGYPTLCKGRYLVDG
ERYHALEEPTSLNTLELLPELMAANIASVKIEGRQRSPAYVSQVAKVWRQAIDRCKADPQNFVPQSAWMETLGSMSEGTQ
TTLGAYHRKWQ
>P45475 ~~~ubiV~~~Ubiquinone biosynthesis protein UbiV~~~COG0826
MKYSLGPVLWYWPKETLEEFYQQAATSSADVIYLGEAVCSKRRATKVGDWLEMAKSLAGSGKQIVLSTLALVQASSELGE
LKRYVENGEFLIEASDLGVVNMCAERKLPFVAGHALNCYNAVTLKILLKQGMMRWCMPVELSRDWLVNLLNQCDELGIRN
QFEVEVLSYGHLPLAYSARCFTARSEDRPKDECETCCIKYPNGRNVLSQENQQVFVLNGIQTMSGYVYNLGNELASMQGL
VDVVRLSPQGTDTFAMLDAFRANENGAAPLPLTANSDCNGYWRRLAGLELQA
>O66811 2.5.1.129~~~ubiX~~~Flavin prenyltransferase UbiX~~~COG0163
MQKIALCITGASGVIYGIKLLQVLEELDFSVDLVISRNAKVVLKEEHSLTFEEVLKGLKNVRIHEENDFTSPLASGSRLV
HYRGVYVVPCSTNTLSCIANGINKNLIHRVGEVALKERVPLVLLVREAPYNEIHLENMLKITRMGGVVVPASPAFYHKPQ
SIDDMINFVVGKLLDVLRIEHNLYKRWRG
>O84222 2.5.1.129~~~ubiX~~~Flavin prenyltransferase UbiX~~~
MKRYVVGISGASGIVLAVTLVSELARLGHHIDVIISPSAQKTLYYELDTKSFLSTIPQNFHNQIVLHHISSIESSVSSGS
NTIDATIIVPCSVATVAAISCGLADNLLRRVADVALKEKRPLILVPREAPLSAIHLENLLKLAQNGAVILPPMPIWYFKP
QTAEDIANDIVGKILAILQLDSPLIKRWENPR
>P0AG03 2.5.1.129~~~ubiX~~~Flavin prenyltransferase UbiX~~~COG0163
MKRLIVGISGASGAIYGVRLLQVLRDVTDIETHLVMSQAARQTLSLETDFSLREVQALADVTHDARDIAASISSGSFQTL
GMVILPCSIKTLSGIVHSYTDGLLTRAADVVLKERRPLVLCVRETPLHLGHLRLMTQAAEIGAVIMPPVPAFYHRPQSLD
DVINQTVNRVLDQFAITLPEDLFARWQGA
>Q9HX08 2.5.1.129~~~ubiX~~~Flavin prenyltransferase UbiX~~~
MSGPERITLAMTGASGAQYGLRLLDCLVQEEREVHFLISKAAQLVMATETDVALPAKPQAMQAFLTEYCGAAAGQIRVFG
QNDWMAPPASGSSAPNAMVICPCSTGTLSAVATGACNNLIERAADVALKERRPLVLVPREAPFSSIHLENMLKLSNLGAV
ILPAAPGFYHQPQSVEDLVDFVVARILNTLGIPQDMLPRWGEQHLVSDE
>A5H1G9 ~~~ublA~~~Bacteriocin uberolysin~~~
MDILLELAGYTGIASGTAKKVVDAIDKGAAAFVIISIISTVISAGALGAVSASADFIILTVKNYISRNLKAQAVIW
>Q8E372 3.2.1.180~~~~~~Unsaturated chondroitin disaccharide hydrolase~~~COG4225
MMKIKPVKVESIENPKRFLNSRLLTKIEVEEAIEKALKQLYINIDYFGEEYPTPATFNNIYKVMDNTEWTNGFWTGCLWL
AYEYNQDKKLKNIAHKNVLSFLNRINNRIALDHHDLGFLYTPSCTAEYRINGDVKALEATIKAADKLMERYQEKGGFIQA
WGELGYKEHYRLIIDCLLNIQLLFFAYEQTGDEKYRQVAVNHFYASANNVVRDDSSAFHTFYFDPETGEPLKGVTRQGYS
DESSWARGQAWGIYGIPLSYRKMKDYQQIILFKGMTNYFLNRLPEDKVSYWDLIFTDGSGQPRDTSATATAVCGIHEMLK
YLPEVDPDKETYKYAMHTMLRSLIEQYSNNELIAGRPLLLHGVYSWHSGKGVDEGNIWGDYYYLEALIRFYKDWELYW
>Q9A0T3 3.2.1.180~~~ugl~~~Unsaturated chondroitin disaccharide hydrolase~~~
MARPLKTIALEPIKQPERFTKEDFLSQEDITQALDLALKQVRLNMDYFKEDFPTPATKDNQYAIMDNTEWTNAFWTGCLW
LAYEYSGDDAIKALAQANDLSFLDRVTRDIELDHHDLGFLYTPSCMAEWKLLKTPESREAALKAADKLVQRYQDKGGFIQ
AWGELGKKEDYRLIIDCLLNIQLLFFASQETGDNRYRDMAINHFYASANHVIRDDASAYHTFYFDPETGDPVKGVTRQGY
SDDSAWARGQAWGIYGIPLTYRFLKEPELIQLFKGMTHYFLNRLPKDQVSYWDLIFGDGSEQSRDSSATAIAVCGIHEML
KTLPDHDPDKKTYEAAMHSMLRALIKDYANKDLKPGAPLLLHGVYSWHSGKGVDEGNIWGDYYYLEALLRFYKDWNPYW
>Q8DR77 3.2.1.180~~~ugl~~~Unsaturated chondroitin disaccharide hydrolase~~~COG4225
MIKKVTIEKIKSPERFLEVPLLTKEEVGQAIDKVIRQLELNLDYFKEDFPTPATFDNVYPIMDNTEWTNGFWTGELWLAY
EYSQQDAFKNIAHKNVLSFLDRVNKRVELDHHDLGFLYTPSCMAEYKINGDGEAREATLKAADKLIERYQEKGGFIQAWG
DLGKKEHYRLIIDCLLNIQLLFFAYQETGDQKYYDIAESHFYASANNVIRDDASSFHTFYFDPETGQPFKGVTRQGYSDD
SCWARGQSWGVYGIPLTYRHLKDESCFDLFKGVTNYFLNRLPKDHVSYWDLIFNDGSDQSRDSSATAIAVCGIHEMLKHL
PEVDADKDIYKHAMHAMLRSLIEHYANDQFTPGGTSLLHGVYSWHSGKGVDEGNIWGDYYYLEALIRFYKDWNLYW
>P37440 1.-.-.-~~~ucpA~~~Oxidoreductase UcpA~~~COG1028
MGKLTGKTALITGALQGIGEGIARTFARHGANLILLDISPEIEKLADELCGRGHRCTAVVADVRDPASVAAAIKRAKEKE
GRIDILVNNAGVCRLGSFLDMSDDDRDFHIDINIKGVWNVTKAVLPEMIARKDGRIVMMSSVTGDMVADPGETAYALTKA
AIVGLTKSLAVEYAQSGIRVNAICPGYVRTPMAESIARQSNPEDPESVLTEMAKAIPMRRLADPLEVGELAAFLASDESS
YLTGTQNVIDGGSTLPETVSVGI
>Q93SX0 7.1.1.6~~~petC1~~~Cytochrome b6-f complex iron-sulfur subunit 1~~~COG0723
MAQFSESVDVPDMGRRQFMNLLTFGTVTGVALGALYPVVNYFIPPAAGGAGGGTTAKDELGNDVSVSKFLESHNVGDRTL
VQGLKGDPTYIVVESKEAITDYGINAVCTHLGCVVPWNAAENKFKCPCHGSQYDATGKVVRGPAPKSLALSHAKTENDKI
VLTSWTETDFRTGEEPWWS
>P26290 7.1.1.6~~~petC2~~~Cytochrome b6-f complex iron-sulfur subunit 2~~~COG0723
MTQISGSPDVPDLGRRQFMNLLTFGTITGVAAGALYPAVKYLIPPSSGGSGGGVTAKDALGNDVKVTEFLASHNAGDRVL
AQGLKGDPTYIVVQGDDTIANYGINAVCTHLGCVVPWNASENKFMCPCHGSQYNAEGKVVRGPAPLSLALAHATVTDDDK
LVLSTWTETDFRTDEDPWWA
>P51130 7.1.1.8~~~petA~~~Ubiquinol-cytochrome c reductase iron-sulfur subunit~~~COG0723
MTTASSADHPTRRDFLFVATGAAAAVGGAAALWPFISQMNPDASTIAAGAPIEVDLSPIAEGQDIKVFWRGKPIYISHRT
KKQIDEARAVNVASLPDPQSDEARVKSGHEQWLVVIGICTHLGCIPIAHEGNYDGFFCPCHGSQYDSSGRIRQGPAPANL
PVPPYQFVSDTKIQIG
>Q02762 7.1.1.8~~~petA~~~Ubiquinol-cytochrome c reductase iron-sulfur subunit~~~
MSNAEDHAGTRRDFLYYATAGAGAVATGAAVWPLINQMNPSADVQALASIFVDVSSVEPGVQLTVKFLGKPIFIRRRTEA
DIELGRSVQLGQLVDTNARNANIDAGAEATDQNRTLDEAGEWLVMWGVCTHLGCVPIGGVSGDFGGWFCPCHGSHYDSAG
RIRKGPAPENLPIPLAKFIDETTIQLG
>Q46136 7.1.1.6~~~petC~~~Cytochrome b6-f complex iron-sulfur subunit~~~
MAQTGNFKSPARMSSLGQGAAPASSGAVTGGKPREGGLKGVDFERRGFLHKIVGGVGAVVAVSTLYPVVKYIIPPARKIK
NVDELTVGKASEVPDGKSKIFQFNEDKVIVVNKGGALTAVSAVCTHLGCLVNWVDADNQYFCPCHGAKYKLTGEIISGPQ
PLPLKQYKARIEGDSIIISKA
>P83794 7.1.1.6~~~petC~~~Cytochrome b6-f complex iron-sulfur subunit~~~
MAQFTESMDVPDMGRRQFMNLLAFGTVTGVALGALYPLVKYFIPPSGGAVGGGTTAKDKLGNNVKVSKFLESHNAGDRVL
VQGLKGDPTYIVVESKEAIRDYGINAVCTHLGCVVPWNAAENKFKCPCHGSQYDETGKVIRGPAPLSLALCHATVQDDNI
VLTPWTETDFRTGEKPWWV
>P05417 7.1.1.8~~~petA~~~Ubiquinol-cytochrome c reductase iron-sulfur subunit~~~
MSHADEHAGDHGATRRDFLYYATAGAGTVAAGAAAWTLVNQMNPSADVQALASIQVDVSGVETGTQLTVKWLGKPVFIRR
RTEDEIQAGREVDLGQLIDRSAQNSNKPDAPATDENRTMDEAGEWLVMIGVCTHLGCVPIGDGAGDFGGWFCPCHGSHYD
TSGRIRRGPAPQNLHIPVAEFLDDTTIKLG
>P0CY48 7.1.1.8~~~petA~~~Ubiquinol-cytochrome c reductase iron-sulfur subunit~~~
MSHAEDNAGTRRDFLYHATAATGVVVTGAAVWPLINQMNASADVKAMSSIFVDVSAVEVGTQLTVKWRGKPVFIRRRDEK
DIELARSVPLGALRDTSAENANKPGAEATDENRSLAAFDGTNTGEWLVMLGVCTHLGCVPMGDKSGDFGGWFCPCHGSHY
DSAGRIRKGPAPRNLDIPVAAFVDETTIKLG
>D5ANZ2 7.1.1.8~~~petA~~~Ubiquinol-cytochrome c reductase iron-sulfur subunit~~~COG0723
MSHAEDNAGTRRDFLYHATAATGVVVTGAAVWPLINQMNASADVKAMASIFVDVSAVEVGTQLTVKWRGKPVFIRRRDEK
DIELARSVPLGALRDTSAENANKPGAEATDENRTLPAFDGTNTGEWLVMLGVCTHLGCVPMGDKSGDFGGWFCPCHGSHY
DSAGRIRKGPAPRNLDIPVAAFVDETTIKLG
>P23136 7.1.1.8~~~petA~~~Ubiquinol-cytochrome c reductase iron-sulfur subunit~~~
MAEAEHTASTPGGESSRRDFLIYGTTAVGAVGVALAVWPFIDFMNPAADTLALASTEVDVSAIAEGQAITVTWRGKPVFV
RHRTQKEIVVARAVDPASLRDPQTDEARVQQAQWLVMVGVCTHLGCIPLGQKAGDPKGDFDGWFCPCHGSHYDSAGRIRK
GPAPLNLPVPPYAFTDDTTVLIG
>P0C8N8 7.1.1.6~~~petC~~~Cytochrome b6-f complex iron-sulfur subunit~~~COG0723
MAQVSGMSDVPDMGRRQFMNLLTFGTITGTALGALYPVVKYFIPPASGGTGGGAVAKDALGNDIKVSEYLAKHLPGDRSL
AQGIKGDPTYVIVTEDHQIANYGLNAVCTHLGCVVPWNVSENKFICPCHGSQYDSTGKVVRGPAPLSLALVKATVTEDDK
LVFTPWTEIDFRTGKEPWWT
>Q9WYY1 3.2.2.27~~~tmung~~~Type-4 uracil-DNA glycosylase~~~COG1573
MYTREELMEIVSERVKKCTACPLHLNRTNVVVGEGNLDTRIVFVGEGPGEEEDKTGRPFVGRAGMLLTELLRESGIRRED
VYICNVVKCRPPNNRTPTPEEQAACGHFLLAQIEIINPDVIVALGATALSFFVDGKKVSITKVRGNPIDWLGGKKVIPTF
HPSYLLRNRSNELRRIVLEDIEKAKSFIKKEG
>Q5SKC5 3.2.2.27~~~udg~~~Type-4 uracil-DNA glycosylase~~~COG1573
MTLELLQAQAQNCTACRLMEGRTRVVFGEGNPDAKLMIVGEGPGEEEDKTGRPFVGKAGQLLNRILEAAGIPREEVYITN
IVKCRPPQNRAPLPDEAKICTDKWLLKQIELIAPQIIVPLGAVAAEFFLGEKVSITKVRGKWYEWHGIKVFPMFHPAYLL
RNPSRAPGSPKHLTWLDIQEVKRALDALPPKERRPVKAVSQEPLF
>P9WM53 3.2.2.-~~~udgB~~~Type-5 uracil-DNA glycosylase~~~COG1573
MHPKTGRAFRSPVEPGSGWPGDPATPQTPVAADAAQVSALAGGAGSICELNALISVCRACPRLVSWREEVAVVKRRAFAD
QPYWGRPVPGWGSKRPRLLILGLAPAAHGANRTGRMFTGDRSGDQLYAALHRAGLVNSPVSVDAADGLRANRIRITAPVR
CAPPGNSPTPAERLTCSPWLNAEWRLVSDHIRAIVALGGFAWQVALRLAGASGTPKPRFGHGVVTELGAGVRLLGCYHPS
QQNMFTGRLTPTMLDDIFREAKKLAGIE
>Q5SJ65 3.2.2.-~~~udgb~~~Type-5 uracil-DNA glycosylase~~~COG1573
MDREAFVQTLTACRLCPRLVAWREEVVGRKRAFRGEPYWARPVPGFGDPEARILLFGLAPGAHGSNRTGRPFTGDASGAF
LYPLLHEAGLSSKPESLPGDDLRLYGVYLTAAVRCAPPKNKPTPEELRACARWTEVELGLLPEVRVYVALGRIALEALLA
HFGLRKSAHPFRHGAHYPLPGGRHLLASYHVSRQNTQTGRLTREMFLEVLMEAKRLAGL
>Q0P8H3 1.1.1.22~~~kfiD~~~UDP-glucose 6-dehydrogenase~~~COG1004
MKIVIVGIGYVGLANAILFSKNNENEVVLLDIDENKIQSINNHKSPIKDKLIEKFFVQNISKLHATSNIKEAYFNADFAV
IATPTDYDEQLNFFDTRSIENVLKDIKNINSKINVIIKSTVPIGYTKTIKQKFNMSNIVFSPEFLREGSALYDSLYPSRI
IIGDKSVLGKTIGDLFLKNIEKKNVDIFYMDSDEAESVKLFSNTYLAMRVGFFNEVDSYARKHNLNSADIIKGISADDRI
GKYYNNPSFGYGGYCLPKDTKQLLANFYNIPNSLIKAIVETNEIRKKFITQLILEKKPNILGIYRLIMKQNSDNFRNSVI
IDIIKYLQEYNSNIELIIYEPLVKEKKFLNIKVENDFNVFGAKVDLIIANRFDDKLKEIKDKVFSADVFYTDI
>P76373 1.1.1.22~~~ugd~~~UDP-glucose 6-dehydrogenase~~~COG1004
MKITISGTGYVGLSNGLLIAQNHEVVALDILPSRVAMLNDRISPIVDKEIQQFLQSDKIHFNATLDKNEAYRDADYVIIA
TPTDYDPKTNYFNTSSVESVIKDVVEINPYAVMVIKSTVPVGFTAAMHKKYRTENIIFSPEFLREGKALYDNLHPSRIVI
GERSERAERFAALLQEGAIKQNIPMLFTDSTEAEAIKLFANTYLAMRVAYFNELDSYAESLGLNSRQIIEGVCLDPRIGN
HYNNPSFGYGGYCLPKDTKQLLANYQSVPNNLISAIVDANRTRKDFIADAILSRKPQVVGIYRLIMKSGSDNFRASSIQG
IMKRIKAKGVEVIIYEPVMKEDSFFNSRLERDLATFKQQADVIISNRMAEELKDVADKVYTRDLFGSD
>O54068 1.1.1.22~~~rkpK~~~UDP-glucose 6-dehydrogenase~~~COG1004
MKITMIGAGYVGLVSGVCFADFGHDVVCVDKDEGKISALKKGQIPIFEPGLDHLVASNVASGRLNFTDDLKTAVAASDVV
FIAVGTPSRRGDGHADLSYVYAAAREIAANLQGFTVVVTKSTVPVGTGDEVERIIRETNPAADVTVVSNPEFLREGAAIE
DFKRPDRIVIGVDGSDGRAREVMTEVYRPLYLNQSPLVFTTRRTSELIKYAGNAFLAMKITFINEIADLCEKVGANVQDV
ARGIGLDGRIGSKFLHAGPGYGGSCFPKDTLALVKTAQDHDTPVRLVETTVAVNDNRKRAMGRKVIAAAGGDIRGSKIAV
LGLTFKPNTDDMRDSPAIAVVQALQDAGARVTGYDPEGMENARKLIEGLDCARDPYEAAAEADALVIITEWNEFRALDFD
RLKSTMKTPLLVDLRNIYRKDEVAKHGFRYASIGRPD
>P0C0F4 1.1.1.22~~~hasB~~~UDP-glucose 6-dehydrogenase~~~COG1004
MKIAVAGSGYVGLSLGVLLSLQNEVTIVDILPSKVDKINNGLSPIQDEYIEYYLKSKQLSIKATLDSKAAYKEAELVIIA
TPTNYNSRINYFDTQHVETVIKEVLSVNSHATLIIKSTIPIGFITEMRQKFQTDRIIFSPEFLRESKALYDNLYPSRIIV
SCEENDSPKVKADAEKFALLLKSAAKKNNVPVLIMGASEAEAVKLFANTYLALRVAYFNELDTYAESRKLNSHMIIQGIS
YDDRIGMHYNNPSFGYGGYCLPKDTKQLLANYNNIPQTLIEAIVSSNNVRKSYIAKQIINVLKEQESPVKVVGVYRLIMK
SNSDNFRESAIKDVIDILKSKDIKIIIYEPMLNKLESEDQSVLVNDLENFKKQANIIVTNRYDNELQDVKNKVYSRDIFG
RD
>P19638 2.7.1.66~~~dgkA~~~Undecaprenol kinase~~~COG0818
MDSKDHRNELNRFFKSFVHAGRGIWETARTERNFQFHAAAACAVLICGFLVELSIIEWMIIFLLIGGMFSLELLNTAIEH
TVDLITDKHHPLAKAAKDAAAGAVCVFAVISCIIGLLIFLPKL
>Q05888 2.7.1.66~~~dgkA~~~Undecaprenol kinase~~~COG0818
MPMDLRDNKQSQKKWKNRTLTSSLEFALTGIFTAFKEERNMKKHAVSALLAVIAGLVFKVSVIEWLFLLLSIFLVITFEI
VNSAIENVVDLASDYHFSMLAKNAKDMAAGAVLVISGFAALTGLIIFVPKIWFLLFH
>P12758 2.4.2.3~~~udp~~~Uridine phosphorylase~~~COG2820
MSKSDVFHLGLTKNDLQGATLAIVPGDPDRVEKIAALMDKPVKLASHREFTTWRAELDGKPVIVCSTGIGGPSTSIAVEE
LAQLGIRTFLRIGTTGAIQPHINVGDVLVTTASVRLDGASLHFAPLEFPAVADFECTTALVEAAKSIGATTHVGVTASSD
TFYPGQERYDTYSGRVVRHFKGSMEEWQAMGVMNYEMESATLLTMCASQGLRAGMVAGVIVNRTQQEIPNAETMKQTESH
AVKIVVEAARRLL
>P0A1F6 2.4.2.3~~~udp~~~Uridine phosphorylase~~~
MSKSDVFHLGLTKNDLQGAQLAIVPGDPERVEKIAALMDKPVKLASHREFTSWRAELDGKAVIVCSTGIGGPSTSIAVEE
LAQLGIRTFLRIGTTGAIQPHINVGDVLVTTASVRLDGASLHFAPMEFPAVADFACTTALVEAAKSIGATTHVGVTASSD
TFYPGQERYDTYSGRVVRRFKGSMEEWQAMGVMNYEMESATLLTMCASQGLRAGMVAGVIVNRTQQEIPNAETMKQTESH
AVKIVVEAARRLL
>Q9RUE8 ~~~~~~Probable ABC transporter-binding protein DR_1438~~~COG1653
MKKFAAVLGLTVAFAAASQAHAVTLTFACDSVGQGFDECKKGADAWAKKTGNTVKLVQVPKESDARLALYQQQLGAKASD
VDVYMIDVVWPGLIGQHLMDLSKSIPAAEVKAHFPAIVQNNTVGGKLIAMPWFTDAGVLYYRTDLLKKYGYNAPPKTWNE
LATMAQKIQAGERKSNPKFVGYVFQGKNYEGLTCDALEWISSFGGGSIVDPSGKITVNNPKAVQALQAIQGLIGTAAPAA
VTTYGEEEARNVWQAGNSAFMRNWPYAYAAGQKEGSPIAGKIGVAALPAGPGGKPAATLGGWQLAVNAYSKNPKEAADLV
RYLTGAQEQKRRAVQASYNPTIATLYKDKDVLKAVPFFGSLYDVFTNAVARPATVTGSKYNQVSDAFSSAVYSVLTKKSA
PGPALKTLEGQLARIKGRGW
>Q5LUA7 ~~~uehA~~~Ectoine/5-hydroxyectoine-binding periplasmic protein UehA~~~COG1638
MAQSITFTFGAVAAAGIALAAGTAAQADTWRYAFEEAMTDVQGVYAQKFKEEIEANSDHEIQLFPYGTLGESADIMEQTQ
DGILQFVDQSPGFTGSLIPEAQVFFVPYLLPTDQDHLARFFKESKAINDMFKPLYADQGLELLNMFPEGEVAMTTKTPVT
TCSDLDEVKFRVMTNPLLVESYKAFGATPTPLPWGEVYGGLQTNVIQGQENPTFFLYSTKIYEVTDYITYAGHNNFTTAV
MANKDFYDGLSAEDQQLVQNAALAAYDHTVVYQQQAADTELAKIMEAKPEMQVTVLTDEQRSCFKEAAAEVEAKFIEMTG
DSGAAILKQMKADLAATAN
>Q5LUA8 ~~~uehB~~~Ectoine/5-hydroxyectoine TRAP transporter small permease protein UehB~~~COG3090
MQQTHESALPGFLGMLDSAISRIESFLLAMGVLLMAANTVANVVGRFVLGNSIFFSEELNRILIILITFAGISYAARNAR
HIRMSAVYDLLPARLRKGMMVVISVVTAAFMFLLCYYAAKYIGSQASRGRVLPALQIPVWVILIWVPAGFFMTGAQYLLT
AVRNLTSSGIYLSTHVQEGYEDAEEIEI
>Q5LUA9 ~~~uehC~~~Ectoine/5-hydroxyectoine TRAP transporter large permease protein UehC~~~COG1593
MAATIFLTMIVLLLLGFPMMIPLIAGAFIGFLMLFGDLARTETMVQQMLAGIRPASLIAVPMFIFAADIMTRGQSAGRLI
NVVMAYVGHIRGGLAISTAAACTMFGAVSGSTQATVVAIGSPLRPRMLKAGYKDSFVLALIVNASDIAFLIPPSIGMIIY
GVVSSTSIAELFIAGIGPGLLILVLFSAYAYIYAVRNDVPTEPRASWAERARTMRQALWPMGFPVIIIGGIYGGVFSPTE
AAAACVLYALVLEVLVFRSMSLADVYDTAKSTGLITAIVFILVGAGAAFSWVISFAQVPQQILGAIGIAEMGPIGVLFVI
SIAFFIGCMFVDPIVVILVLVPVFAPVVKSVGLDPVLVGTIITLQVAIGSATPPFGCDIFTAIAVFKRPYAEVVRGTPPF
ILMLLGVSVALIFFPQIALFLRDLAFSK
>Q9RU24 ~~~~~~Probable ABC transporter-binding protein DR_1571~~~COG0747
MKKVMMLALALGASTSLAAPFVYPANWTSNKPGDVQTGGTFRSVNLQDFKTLNPFVSSESPNLPAVLSAGSLLGYNPVTG
NYAPYMAEKYTQSADKRTFTFDIRKGMKWSDGKPITVDDWITAYTIDSNKDVGSNTFDYWTINNQPIKVTKVDSNTLKVV
FPKADVTAIEFLSGIFLPQPTHVFMPVWKAKGAQGIKDMWTISTNPDNIVTSGPFMLDRYVRGERAILKKNPYFGEWNKD
SAGKSLPYLDGIQINIVADANAQLAQFLAGNLDTYSPDNRDKLAQVKSAMDGGKVKGTLIPNASARASSDFMVFNMDDSA
TFKTKLFSNVKFRQAMSMLMNRDAMVDLALGGLGEPTYTSVYPVYKDWIPSGMDKYKFNPTAAAKLLAELGFTKKGSDGI
LVDKAGNKLEFTLITNAENNRRQSYAKVIQDEAKKVGVKINVSAIAFNQMTTLLDAKDNFGRRNFDAIIIGLTGGGQVYP
VSGPSVVECKGLGDGGNLHMFNQSNKCRFPFETQAVNLFWKGRAEFDLAKRKAIAAQIQRNEMENQPYIQLAAQTVHFAW
TDRVQGEYNRPQINSLNASTLFGPRDIALTWIKR
>O53732 2.1.1.-~~~ufaA1~~~Tuberculostearic acid methyltransferase UfaA1~~~COG2230
MTVETSQTPSAAIDSDRWPAVAKVPRGPLAAASAAIANRLLRRTATHLPLRLVYSDGTATGAADPRAPSLFIHRPDALAR
RIGRHGLIGFGESYMAGEWSSKELTRVLTVLAGSVDELVPRSLHWLRPITPTFRPSWPDHSRDQARRNIAVHYDLSNDLF
AAFLDETMTYSCAMFTDLLAQPTPAWTELAAAQRRKIDRLLDVAGVQQGSHVLEIGTGWGELCIRAAARGAHIRSVTLSV
EQQRLARQRVAAAGFGHRVEIDLCDYRDVDGQYDSVVSVEMIEAVGYRSWPRYFAALEQLVRPGGPVAIQAITMPHHRML
ATRHTQTWIQKYIFPGGLLPSTQAIIDITGQHTGLRIVDAASLRPHYAETLRLWRERFMQRRDGLAHLGFDEVFARMWEL
YLAYSEAGFRSGYLDVYQWTLIREGPP
>B9J8R3 5.1.3.6~~~lpsL~~~UDP-glucuronate 4-epimerase~~~COG0451
MRYLITGTAGFIGFHLAKRLLDDGHFVVGFDGMTPYYDVKLKEKRTAILARSNGFKAVTGMLEDKAALDHAAELAEPDVI
VHLAAQAGVRYSLENPRSYVDSNLVGSFNVLELARSIQPKHLLLASTSSVYGANEKIPFAESDKADEQMTIYAATKKSME
LMAHSYAHLFHIPTTVFRFFTVYGPWGRPDMALFKFVEAIKHDRPIEIYGEGKMSRDFTYIDDLVEGIVKLIGVIPSEEN
RVVSDTISDTLSKNAPFRIVNIGGGQPVGLMAFVETIEAMLGKRAIRHMLPMQPGDVHNTYAVPDLLVALTGFKPQIEVD
AGVRRFVEWYQENY
>F8C4X8 5.1.3.6~~~~~~UDP-glucuronate 4-epimerase~~~COG0451
MGYYLVTGVAGFIGWRVGEFLLKEGKAVLGVDNLNDAYDVTLKYWRLNELKKSENFKFYQIDITNFQALKTIFETYSISA
VIHLAARAGVRASLENPWVYVDSNITGTLNLLELMKDFGVKKLVLASTSSIYAGQSPPFHEDLKVDTPLSPYAATKKGAE
LLSYTYHHLYGLDISVVRYFTVYGPAGRPDMSIFRFIKWIYEEKPIKIFGDGTQARDFTYIDDIARGTIASLKPLGYEII
NLGGGKNPISINQIIEILERLIGKKAKREYLNFHKADVKVTWADISKAKKLLNWEPEISIEEGLKRTVNWSKENIELIKS
IKV
>Q9RC92 3.2.1.179~~~ugl~~~Unsaturated glucuronyl hydrolase~~~
MWQQAIGDALGITARNLKKFGDRFPHVSDGSNKYVLNDNTDWTDGFWSGILWLCYEYTGDEQYREGAVRTVASFRERLDR
FENLDHHDIGFLYSLSAKAQWIVEKDESARKLALDAADVLMRRWRADAGIIQAWGPKGDPENGGRIIIDCLLNLPLLLWA
GEQTGDPEYRRVAEAHALKSRRFLVRGDDSSYHTFYFDPENGNAIRGGTHQGNTDGSTWTRGQAWGIYGFALNSRYLGNA
DLLETAKRMARHFLARVPEDGVVYWDFEVPQEPSSYRDSSASAITACGLLEIASQLDESDPERQRFIDAAKTTVTALRDG
YAERDDGEAEGFIRRGSYHVRGGISPDDYTIWGDYYYLEALLRLERGVTGYWYERGR
>Q39BA7 4.3.2.3~~~~~~Ureidoglycolate lyase~~~
MKLLRYGPSGQEKPGILDADGRIRDLSAHVPDLSGDVLSDAGLARLRAIDPATLPLVSGEPRIGACVGHVGKFIGIGLNY
ADHAAEAGMPVPKEPVVFGKWTSSICGPNDGIDIPKGSVKTDWEVELGVVIGATCKDVDEARALDYVAGYCVVNDVSERE
WQIERGGQWDKGKGFDTFGPIGPWLVTRDEVPDPQRLDLWLEIDGHRYQNGNTRTMVFTVAQLIAYLSSCMTLQPGDVIT
TGTPPGVGMGIKPSPVFLKAGQMVRLGVEGLGEQLQHTRDAR
>G3XD94 1.1.1.136~~~wbpA~~~UDP-N-acetyl-D-glucosamine 6-dehydrogenase~~~
MIDVNTVVEKFKSRQALIGIVGLGYVGLPLMLRYNAIGFDVLGIDIDDVKVDKLNAGQCYIEHIPQAKIAKARASGFEAT
TDFSRVSECDALILCVPTPLNKYREPDMSFVINTTDALKPYLRVGQVVSLESTTYPGTTEEELLPRVQEGGLVVGRDIYL
VYSPEREDPGNPNFETRTIPKVIGGHTPQCLEVGIALYEQAIDRVVPVSSTKAAEMTKLLENIHRAVNIGLVNEMKIVAD
RMGIDIFEVVDAAATKPFGFTPYYPGPGLGGHCIPIDPFYLTWKAREYGLHTRFIELSGEVNQAMPEYVLGKLMDGLNEA
GRALKGSRVLVLGIAYKKNVDDMRESPSVEIMELIEAKGGMVAYSDPHVPVFPKMREHHFELSSEPLTAENLARFDAVVL
ATDHDKFDYELIKAEAKLVVDSRGKYRSPAAHIIKA
>P10905 ~~~ugpA~~~sn-glycerol-3-phosphate transport system permease protein UgpA~~~COG1175
MSSSRPVFRSRWLPYLLVAPQLIITVIFFIWPAGEALWYSLQSVDPFGFSSQFVGLDNFVTLFHDSYYLDSFWTTIKFST
FVTVSGLLVSLFFAALVEYIVRGSRFYQTLMLLPYAVAPAVAAVLWIFLFNPGRGLITHFLAEFGYDWNHAQNSGQAMFL
VVFASVWKQISYNFLFFYAALQSIPRSLIEAAAIDGAGPIRRFFKIALPLIAPVSFFLLVVNLVYAFFDTFPVIDAATSG
GPVQATTTLIYKIYREGFTGLDLASSAAQSVVLMFLVIVLTVVQFRYVESKVRYQ
>P0AG80 ~~~ugpB~~~sn-glycerol-3-phosphate-binding periplasmic protein UgpB~~~COG1653
MKPLHYTASALALGLALMGNAQAVTTIPFWHSMEGELGKEVDSLAQRFNAENPDYKIVPTYKGNYEQNLSAGIAAFRTGN
APAILQVYEVGTATMMASKAIKPVYDVFKEAGIQFDESQFVPTVSGYYSDSKTGHLLSQPFNSSTPVLYYNKDAFKKAGL
DPEQPPKTWQDLADYAAKLKASGMKCGYASGWQGWIQLENFSAWNGLPFASKNNGFDGTDAVLEFNKPEQVKHIAMLEEM
NKKGDFSYVGRKDESTEKFYNGDCAMTTASSGSLANIREYAKFNYGVGMMPYDADAKDAPQNAIIGGASLWVMQGKDKET
YTGVAKFLDFLAKPENAAEWHQKTGYLPITKAAYDLTREQGFYEKNPGADTATRQMLNKPPLPFTKGLRLGNMPQIRVIV
DEELESVWTGKKTPQQALDTAVERGNQLLRRFEKSTKS
>P10907 7.6.2.10~~~ugpC~~~sn-glycerol-3-phosphate import ATP-binding protein UgpC~~~COG3842
MAGLKLQAVTKSWDGKTQVIKPLTLDVADGEFIVMVGPSGCGKSTLLRMVAGLERVTEGDIWINDQRVTEMEPKDRGIAM
VFQNYALYPHMSVEENMAWGLKIRGMGKQQIAERVKEAARILELDGLLKRRPRELSGGQRQRVAMGRAIVRDPAVFLFDE
PLSNLDAKLRVQMRLELQQLHRRLKTTSLYVTHDQVEAMTLAQRVMVMNGGVAEQIGTPVEVYEKPASLFVASFIGSPAM
NLLTGRVNNEGTHFELDGGIELPLNGGYRQYAGRKMTLGIRPEHIALSSQAEGGVPMVMDTLEILGADNLAHGRWGEQKL
VVRLAHQERPTAGSTLWLHLAENQLHLFDGETGQRV
>Q8ZLF4 7.6.2.10~~~ugpC~~~sn-glycerol-3-phosphate import ATP-binding protein UgpC~~~
MAGLKLQAVTKSWDGKTQVIQPLTLDVADGEFIVMVGPSGCGKSTLLRMVAGLERVTSGDIWIDRKRVTEMEPKDRGIAM
VFQNYALYPHMSVEENMAWGLKIRGMSKAHIEERVREAARILELDGLLKRRPRELSGGQRQRVAMGRAIVREPAVFLFDE
PLSNLDAKLRVQMRLELQHLHRRLRTTSLYVTHDQVEAMTLAQRVMVMNKGVAEQIGTPVEVYEKPASRFVASFIGSPAM
NLLDGVISASGDRFELPGGLALPIGADYRGHAGRNMTLGIRPEHIALSSQAEGGVPLTVDTLEILGADNLAHGRWGDQKL
VVRLAHQQRPAAGSTLWLHLPEHQRHLFDGETGQRV
>P10906 ~~~ugpE~~~sn-glycerol-3-phosphate transport system permease protein UgpE~~~COG0395
MIENRPWLTIFSHTMLILGIAVILFPLYVAFVAATLDKQAVYAAPMTLIPGTHLLENIHNIWVNGVGTNSAPFWRMLLNS
FVMAFSITLGKITVSMLSAFAIVWFRFPLRNLFFWMIFITLMLPVEVRIFPTVEVIANLQMLDSYAGLTLPLMASATATF
LFRQFFMTLPDELVEAARIDGASPMRFFCDIVFPLSKTNLAALFVITFIYGWNQYLWPLLIITDVDLGTTVAGIKGMIAT
GEGTTEWNSVMVAMLLTLIPPVVIVLVMQRAFVRGLVDSEK
>P10908 3.1.4.46~~~ugpQ~~~Glycerophosphodiester phosphodiesterase, cytoplasmic~~~COG0584
MSNWPYPRIVAHRGGGKLAPENTLASIDVGAKYGHKMIEFDAKLSKDGEIFLLHDDNLERTSNGWGVAGELNWQDLLRVD
AGSWYSKMFKGEPLPLLSQVAERCREHGMMANIEIKPTTGTGPLTGKMVALAARELWAGMTPPLLSSFEIDALEAAQQAA
PELPRGLLLDEWRDDWRELTARLGCVSIHLNHKLLNKARVMQLKDAGLRILVYTVNKPQRAAELLRWGVDCICTDAIDVI
GPNFTAQ
>P54166 2.4.1.315~~~ugtP~~~Processive diacylglycerol beta-glucosyltransferase~~~COG0707
MNTNKRVLILTANYGNGHVQVAKTLYEQCVRLGFQHVTVSNLYQESNPIVSEVTQYLYLKSFSIGKQFYRLFYYGVDKIY
NKRKFNIYFKMGNKRLGELVDEHQPDIIINTFPMIVVPEYRRRTGRVIPTFNVMTDFCLHKIWVHENVDKYYVATDYVKE
KLLEIGTHPSNVKITGIPIRPQFEESMPVGPIYKKYNLSPNKKVLLIMAGAHGVLKNVKELCENLVKDDQVQVVVVCGKN
TALKESLSALEAENGDKLKVLGYVERIDELFRITDCMITKPGGITLTEATAIGVPVILYKPVPGQEKENANFFEDRGAAI
VVNRHEEILESVTSLLADEDTLHRMKKNIKDLHLANSSEVILEDILKESEMMTAKQKAKVLS
>Q2FZP7 2.4.1.315~~~ugtP~~~Processive diacylglycerol beta-glucosyltransferase~~~COG0707
MVTQNKKILIITGSFGNGHMQVTQSIVNQLNDMNLDHLSVIEHDLFMEAHPILTSICKKWYINSFKYFRNMYKGFYYSRP
DKLDKCFYKYYGLNKLINLLIKEKPDLILLTFPTPVMSVLTEQFNINIPVATVMTDYRLHKNWITPYSTRYYVATKETKQ
DFIDVGIDPSTVKVTGIPIDNKFETPINQKQWLIDNNLDPDKQTILMSAGAFGVSKGFDTMITDILAKSANAQVVMICGK
SKELKRSLTAKFKSNENVLILGYTKHMNEWMASSQLMITKPGGITITEGFARCIPMIFLNPAPGQELENALYFEEKGFGK
IADTPEEAIKIVASLTNGNEQLTNMISTMEQDKIKYATQTICRDLLDLIGHSSQPQEIYGKVPLYARFFVK
>Q5HH69 2.4.1.315~~~ugtP~~~Processive diacylglycerol beta-glucosyltransferase~~~
MVTQNKKILIITGSFGNGHMQVTQSIVNQLNDMNLDHLSVIEHDLFMEAHPILTSICKKWYINSFKYFRNMYKGFYYSRP
DKLDKCFYKYYGLNKLINLLIKEKPDLILLTFPTPVMSVLTEQFNINIPVATVMTDYRLHKNWITPYSTRYYVATKETKQ
DFIDVGIDPSTVKVTGIPIDNKFETPINQKQWLIDNNLDPDKQTILMSAGAFGVSKGFDTMITDILAKSANAQVVMICGK
SKELKRSLTAKFKSNENVLILGYTKHMNEWMASSQLMITKPGGITITEGFARCIPMIFLNPAPGQELENALYFEEKGFGK
IADTPEEAIKIVASLTNGNEQLTNMISTMEQDKIKYATQTICRDLLDLIGHSSQPQEIYGKVPLYARFFVK
>P9WF04 3.2.1.-~~~~~~Unsaturated 3S-rhamnoglycuronyl hydrolase~~~
MNHTKLKLSAVALTLALGLSACSGESPEKQVQSAESEQMKAVDVDKSMPMQSIESTAKRIGESAANWQIAQFGNLDYIPE
SHRAKSENAKFWIQASFYIGLTRWIDATDDKQLESFVKQVAEKENYELILERPYHADDHAIAQTYLWLAERAGVQEAYMP
TKEVFDMILSKPPQVGLNMGDSESSSGKYHLEGNCQLRWCWADALFMAPRAWAQMTKVTSDPKYLEYGNKEFWAAADYLF
SDEYGLFFRDSRYFDAKSDNGEPVFWGRGNGWVFAAIPMIIEELPEGHPSKDRYIELYKKHAEGLMALQKEDGYWPASLM
DPDKVRTPEVSGTGFITFGLAWGVNNGILTDQRSKDVVEKGWSAITKAVTDDGRVNWVQHVGKSPDPVKESDSQLYGTGA
VLLAASEMLIWNK
>L7P9J4 3.2.1.-~~~~~~Unsaturated 3S-rhamnoglycuronyl hydrolase~~~
MNKSILLLVTLLSLYSCTDTEKTPLEEKDVFNEDYIKTSMIKALEWQEAHPIFAIHPTDWTNGAYYTGVARAHHTTKNMM
YMAALKNQAVANNWQPYTRLYHADDVAISYSYLYVAENEKRRNFSDLEPTKKFLDTHLYEDNAWKAGTNRSKEDKTILWW
WCDALFMAPPVINLYAKQSEQPEYLDEMHKYYMETYNRLYDKEEKLFARDSRFVWDGDDEDKKEPNGEKVFWSRGNGWVI
GGLALLLEDMPEDYKHRDFYVNLYKEMASRILEIQPEDGLWRTSLLSPESYDHGEVSGSAFHTFALAWGINKGLIDKKYT
PAVKKAWKAMANCQHDDGRVGWVQNIGAFPEPASKDSYQNFGTGAFLLAGSEILKMR
>P0AGA6 ~~~uhpA~~~Transcriptional regulatory protein UhpA~~~COG2197
MITVALIDDHLIVRSGFAQLLGLEPDLQVVAEFGSGREALAGLPGRGVQVCICDISMPDISGLELLSQLPKGMATIMLSV
HDSPALVEQALNAGARGFLSKRCSPDELIAAVHTVATGGCYLTPDIAIKLASGRQDPLTKRERQVAEKLAQGMAVKEIAA
ELGLSPKTVHVHRANLMEKLGVSNDVELARRMFDGW
>P09835 2.7.13.3~~~uhpB~~~Signal transduction histidine-protein kinase/phosphatase UhpB~~~COG3851
MKTLFSRLITVIACFFIFSAAWFCLWSISLHLVERPDMAVLLFPFGLRLGLMLQCPRGYWPVLLGAEWLLIYWLTQAVGL
THFPLLMIGSLLTLLPVALISRYRHQRDWRTLLLQGAALTAAALLQSLPWLWHGKESWNALLLTLTGGLTLAPICLVFWH
YLANNTWLPLGPSLVSQPINWRGRHLVWYLLLFVISLWLQLGLPDELSRFTPFCLALPIIALAWHYGWQGALIATLMNAI
ALIASQTWRDHPVDLLLSLLVQSLTGLLLGAGIQRLRELNQSLQKELARNQHLAERLLETEESVRRDVARELHDDIGQTI
TAIRTQAGIVQRLAADNASVKQSGQLIEQLSLGVYDAVRRLLGRLRPRQLDDLTLEQAIRSLMREMELEGRGIVSHLEWR
IDESALSENQRVTLFRVCQEGLNNIVKHADASAVTLQGWQQDERLMLVIEDDGSGLPPGSGQQGFGLTGMRERVTALGGT
LHISCLHGTRVSVSLPQRYV
>P09836 ~~~uhpC~~~Membrane sensor protein UhpC~~~COG2271
MLPFLKAPADAPLMTDKYEIDARYRYWRRHILLTIWLGYALFYFTRKSFNAAVPEILANGVLSRSDIGLLATLFYITYGV
SKFVSGIVSDRSNARYFMGIGLIATGIINILFGFSTSLWAFAVLWVLNAFFQGWGSPVCARLLTAWYSRTERGGWWALWN
TAHNVGGALIPIVMAAAALHYGWRAGMMIAGCMAIVVGIFLCWRLRDRPQALGLPAVGEWRHDALEIAQQQEGAGLTRKE
ILTKYVLLNPYIWLLSFCYVLVYVVRAAINDWGNLYMSETLGVDLVTANTAVTMFELGGFIGALVAGWGSDKLFNGNRGP
MNLIFAAGILLSVGSLWLMPFASYVMQATCFFTIGFFVFGPQMLIGMAAAECSHKEAAGAATGFVGLFAYLGASLAGWPL
AKVLDTWHWSGFFVVISIAAGISALLLLPFLNAQTPREA
>P0AGC0 ~~~uhpT~~~Hexose-6-phosphate:phosphate antiporter~~~COG2271
MLAFLNQVRKPTLDLPLEVRRKMWFKPFMQSYLVVFIGYLTMYLIRKNFNIAQNDMISTYGLSMTQLGMIGLGFSITYGV
GKTLVSYYADGKNTKQFLPFMLILSAICMLGFSASMGSGSVSLFLMIAFYALSGFFQSTGGSCSYSTITKWTPRRKRGTF
LGFWNISHNLGGAGAAGVALFGANYLFDGHVIGMFIFPSIIALIVGFIGLRYGSDSPESYGLGKAEELFGEEISEEDKET
ESTDMTKWQIFVEYVLKNKVIWLLCFANIFLYVVRIGIDQWSTVYAFQELKLSKAVAIQGFTLFEAGALVGTLLWGWLSD
LANGRRGLVACIALALIIATLGVYQHASNEYIYLASLFALGFLVFGPQLLIGVAAVGFVPKKAIGAADGIKGTFAYLIGD
SFAKLGLGMIADGTPVFGLTGWAGTFAALDIAAIGCICLMAIVAVMEERKIRREKKIQQLTVA
>P27670 ~~~uhpT~~~Hexose-6-phosphate:phosphate antiporter~~~
MLAFLNQVRKPTLDLPLDVRRKMWFKPFMQSYLVVFIGYLTMYLIRKNFNIAQNDMISTYGLSMTELGMIGLGFSITYGV
GKTLVSYYADGKNTKQFLPFMLILSAICMLGFSASMGAGSTSLFLMIAFYALSGFFQSTGGSCSYSTITKWTPRRKRGTF
LGFWNISHNLGGAGAAGVALFGANYLFDGHVIGMFIFPSIIALIVGFIGLRFGSDSPESYGLGKAEELFGEEISEEDKET
EENEMTKWQIFVEYVLKNKVIWLLCFSNIFLYVVRIGIDQWSTVYAFQELKLSKEVAIQGFTLFEVGALVGTLLWGWLSD
LANGRRALVACVALALIIATLGVYQHASNQYVYLASLFALGFLVFGPQLLIGVAAVGFVPKKAIGAADGIKGTFAYLIGD
SFAKLGLGMIADGTPVFGLTGWAGTFAALDAAAIGCICLMAMVAVMEERKIRREKKIQQVNIA
>P0CE45 ~~~uidB~~~Glucuronide carrier protein~~~
MNQQLSWRTIVGYSLGDVANNFAFAMGALFLLSYYTDVAGVGAAAAGTMLLLVRVFDAFADVFAGRVVDSVNTRWGKFRP
FLLFGTAPLMIFSVLVFWVPTDWSHGSKVVYAYLTYMGLGLCYSLVNIPYGSLATAMTQQPQSRARLGAARGIAASLTFV
CLAFLIGPSIKNSSPEEMVSVYHFWTIVLAIAGMVLYFICFKSTRENVVRIVAQPSLNISLQTLKRNRPLFMLCIGALCV
LISTFAVSASSLFYVRYVLNDTGLFTVLVLVQNLVGTVASAPLVPGMVARIGKKNTFLIGALLGTCGYLLFFWVSVWSLP
VALVALAIASIGQGVTMTVMWALEADTVEYGEYLTGVRIEGLTYSLFSFTRKCGQAIGGSIPAFILGLSGYIANQVQTPE
VIMGIRTSIALVPCGFMLLAFVIIWFYPLTDKKFKEIVVEIDNRKKVQQQLISDITN
>P0CE44 ~~~uidB~~~Glucuronide carrier protein homolog~~~COG2211
MNQQLSWRTIVGYSLGDVANNFAFAMGALFLLSYYTDVAGVGAAAAGTMLLLVRVFDAFADVFAGRVVDSVNTRWGKFRP
FLLFGTAPLMIFSVLVFWVLTDWSHGSKVVYAYLTYMGLGLCYSLVNIPYGSLATAMTQQPQSRARLGAARGIAASLTFV
CLAFLIGPSIKNSSPEEMVSVYHFWTIVLAIAGMVLYFICFKSTRENVVRIVAQPSLNISLQTLKRNRPLFMLCIGALCV
LISTFAVSASSLFYVRYVLNDTGLFTVLVLVQNLVGTVASAPLVPGMVARIGKKNTFLIGALLGTCGYLLFFWVSVWSLP
VALVALAIASIGQGVTMTVMWALEADTVEYGEYLTGVRIEGLTYSLFSFTRKCGQAIGGSIPAFILGLSGYIANQVQTPE
VIMGIRTSIALVPCGFMLLAFVIIWFYPLTDKKFKEIVVEIDNRKKVQQQLISDITN
>Q47706 ~~~uidC~~~Membrane-associated protein UidC~~~
MRKIVAMAVICLTAASGLTSAYAAQLADDEAGLRIRLKNELRRADKPSAGAGRDIYAWVQGGLLDFNSGYYSNIIGVEGG
AYYVYKLGARADMSTRWYLDGDKSFGFALGAVKIKPSENSLLKLGRFGTDYSYGSLPYRIPLMAGSSQRTLPTVSEGALG
YWALTPNIDLWGMWRSRVFLWTDSTTGIRDEGVYNSQTGKYDKHRARSFLAASWHDDTSRYSLGASVQKDVSNQIQSILE
KSIPLDPNYTLKGELLGFYAQLEGLSRNTSQPNETALVSGQLTWNAPWGSVFGSGGYLRHAMNGAVVDTDIGYPFSLSLD
RNREGMQSWQLGVNYRLTPQFTLTFAPIVTRGYESSKRDVRIEGTGILGGMNYRVSEGPLQGMNFFLAADKGREKRDGST
LGDRLNYWDVKMSIQYDFMLK
>P0ACT6 ~~~uidR~~~HTH-type transcriptional regulator UidR~~~COG1309
MMDNMQTEAQPTRTRILNAAREIFSENGFHSASMKAICKSCAISPGTLYHHFISKEALIQAIILQDQERALARFREPIEG
IHFVDYMVESIVSLTHEAFGQRALVVEIMAEGMRNPQVAAMLKNKHMTITEFVAQRMRDAQQKGEISPDINTAMTSRLLL
DLTYGVLADIEAEDLAREASFAQGLRAMIGGILTAS
>P9WF07 4.2.2.-~~~~~~Ulvan lyase, long isoform~~~
MKCLKTLLVSTTLLTAFSLNAEVTLEQQVKITEEGLHFDGRNLDFSNVGTPDTGEKYDYFFGPNISAHGDAVKTYRHYVF
MTWYKGGKNERNVMLSRYNTLNGELSTIEFPHKHTGFRGDPLVGESHNTIGLSVSPINGTIHMVFDMHAYDNNNHGGKFK
DDFFRYSFSVAGAAELPHSEFTLDKFVKDTSEVSQGDDDYKHLTMTGDLDDKGNFARLTYPKFFTTVDGTLLLYMRLGGN
NNGAYVFNRYDAETESWSTFTKFNENNQKLKGNQYNWGLYGNMKYVNGKLRVGFQQRSNDNSDKYKYQNGVYYAYSDHPD
GFGDWKNHKGEPMTWPLINSDEIKVFEPGDYISHTEANSVYIVGSFDWTVTEKGDIHIISKVRSTDRGRADYEEVYIHSY
KPAGAEEFIISTDFPGASEIYTSGDNVYIVGLEGGRPYVEKAQGGTNNFIRVYEASDGPVFDHGTLYIKDGKVYYYLMER
TSGTAMPLYLQIIDLDLESDANAPIVSFPSPSVTVNQGYEKLSLNISAESPVEGRSIQSVSLYIDDELVRTDDSLPFLFG
HGSKPHETGALGWLDRHEPNPSPLSAGRHVFKAVAVDSEGDSSTATMILNVNSNAPIVSFPQESLEVDEGFERLSLNISA
ESAVEGRTIESVSLYIDGGLVRTDTSLPYLFGHASKPHETGAMGWLDTHSPNPSPLAAGSYEFTAVATDNEGEETTASML
LVVKGEPEPPIVTWPNSTVTVYEGYEKLAITIDAETPVEGRDIQSVTLFRNGELVRVDTRPVWNFGHSFAPYEFGAMGWL
DRHEPNPSPLGVGTHTFTAVARDSAGLESETDMALIVLSLPGPSVMINEGDISLLTEYQNLAITAEASAADDDISLVSLA
LYIDEQLIREIYEPPFIWGSDAYSTELLSLTEGTHLVRVVATDSNNKQSESSIFINIDLLGDLNKDSIVDKADTRLFTSK
LRAGEIMDIRYDFNGDGVVNNRDTRGLVRRCTYSRCSSN
>A0A1Z4F647 4.2.2.-~~~ullA~~~Ulvan lyase, long isoform~~~
MKCLKTLLVSTTLLGAFSLNAEVTLEQQIKITDEGLHFDGRNLDFSNVGSPDTGEKYDYFFGPNISAHGDAVKTYKHYVF
MTWYKGGKNERNVMLSRYNTLSGELSTIEFPHRHTGFRGDPLVGESHNTIGLAVSPINGTIHMVFDMHAYDDNNHGGKFK
DDFFRYSYSVPGAAELPHSEFTLDKFVKDTSEVSQGDDDYKHLTMTGDLGDKGNFARLTYPKFFTTVDGTLLLYMRLGGN
NNGAYVFNRYDAEAEKWSTFTPFNENNQKSKGNPYNWGLYGNMKYINGKLRVGFQQRSSDNTDKYKYQNGVFYAYSDHPD
GFGDWKNHKGEPMTWPLINSDEIKVFEPGDYISHTEANSVYIVGSFDWTVTEKGDIHIISKVRSTDRNRPDYEEVYIHSY
KPAGADEFIISTDFTGASEIYTSGDNVYIVGLEGGRPYVEKAQGGTNNFVRVYKATDGPVFDHGTLYIKDGKVYYYLMER
TSGNAMPLYLQIIDLDLESDANAPLVSFPSPSLTVEQGFEKLSLNISAESPVEGRFIQSVSLYINDELVRTDDSMPYLFG
HGSKPHETGAMGWLDTHEPNPSPLPAGTHIFKAVAVDSEGDSAIATMVLNVNSNAPIVSFPQESLEVDEGFEKLSLNISA
ESAVEGRSIESVSLYINGELVRTDTSLPYLFGHASKPHETGAMGWLDTHSTNPSPLTAGTYEFTAVAIDSEGEESTASMQ
LVVKGEPQPPAVTWPNSTVTVYEGYEKLAITIDAESPVEGRDIQSVTLYRNGELVRVDTRPVWNFGHSFAPYEFGAMGWL
DRHEPNPSPLGVGTHTFTAVAKDSTGLEGESDMTLIVLSLPGPSITINESDVSLLTEYQNLAITADASTANDDITIVSLA
LYLNEQLVREIYEPPFEWGGENYSDELLDLPVGTHLAKVVATDSNNNQTEASMFVTIELLGDLNKDSVVDNKDIRLFTAA
LRNGEEMNIRYDFNDDGVVNNRDTRGLVHRCTYSRCGSN
>A0A2Z6UD27 4.2.2.-~~~ullA~~~Ulvan lyase, long isoform~~~
MTAQKSKYFNRIMTMNTLLFSLLTVGFSQAYADVVLEKQVKITDDGLHFDGKDLNHGNIDSADPGEKYDFFFGPNISAHG
DAVKTYKHYVFMTWYKGGKSERNVMLSRYNTQTQTIATIEFPHRHTGFRGNPLIGESHNTIGLAVSPNNGTIHMVYDLHA
YDDNNHDGKFKDDFFRYSFSVENAADLPDDQFTLDKFVKDTSSISQGPDDYKHISMTGDIADKSNFARLTYPKFFTTTDG
TLLLYMRLGGNNNGAYVFNRYDEETQTWSKFTKFNENNQKNFGNPYNWGLYGNMKYVNGKLRVGFQQRSSDNNDRYQYQN
GVYYAYSDHPEGFGDWKNHKDDEITYPLVNSDEIKVLEPGDYISHQEANSVYIVGSFDWTVTKKGDVHFISKVRSTNRSR
PDYEEEYLHSYKPAGADEFITTTDFTGASQIYTAGDNIYIVGLKNGRPYVERAKGGTNDFVRVYEATSGPVFDHGTLYIK
DGKVYYYLMEKTSGNAMPLYLQIIDLDLESEANAPQVAFPSTSLTVEQGYEQLSLGIDATSSIEGRTIESVTLYLNDELV
RTDTTVPYLFGHASKPHETGAMGWKDEHEPNPNPLGPGEHIFKAVAVDSEGDTGLATMRLTVQSNAPMVSFPTQLIEVDE
GYEKLSVSVDASSSVEGRTIESVTLFINGEEVRTDTTIPYLWGHGSKPHETGAMGWREDHAPNPNPFLAGEYVFTAIATD
SQGEQSETSMTLIVNGEATPPIVTWPNEVVTVTEGYKRLGITIEAEASSENATIESVTLYRNDELVRVDTKYKWNFGHSF
APYEFGAMGWLETHEPNPSPLLAGTHTFKVVAKDSTGLEGEAFMTLIVLPPAGPSISFDEPDIELMTGYESLSVSTYVAT
VNESVDIISVALFIDDVLVRENLEAPYVWGDANHPNELLSLEVGSYEFKAIARDTNDQVSEVSLLVSITLFGDFDGDNDV
DRTDVRAFSSAIRSGEALDQRYDFNEDGVVDRSDTRGLTKICSRPRCAS
>P9WF06 4.2.2.-~~~~~~Ulvan lyase, long isoform~~~
MNGLKMLLFSTTLLTAFTLHAQVTLKQQVKITDEGLHFDGRNLDFSNVGTPDTGEKYDFFFGPNISAHGDAVKTYKHYVF
MTWYKGGKSERNVMLSRYNTLSGELSTIEFPHRHTGFRGDPLVGESHNTIGLSVSPINGTIHMVFDMHAYDNNNHDGKFK
DDFFRYSYSIAGAAELPHSEFTLDKFVKDTSEVSQGENDYKHLTMTGDLSDKGNFARLTYPKFFTTVDGTLLLYMRLGGN
NNGAYVFNRYDAETETWSTFTKFNENNQKLKGNPYNWGLYGNMKYVNGKLRVGFQQRSNDNSDKYKYQNGVYYAYSDHPD
GFGDWKNHKGEPMTWPLINSDEIKVFEPGDYVSHTDANSVYIVGSFDWTVTEKGDIHIISKVRSTDRSRPDYEEVYIHSY
KPAGAEDFIISTDFTGASEIYTSGDNVYIVGLEGGRPYVEKAQGGTNNFVRVYEASDGPTFDHGTLYIKDGKVYYYLMER
TSGNAMPLYLQIIDLDLDLESDANAPIVSFPSPSLTVEQGFEKLSLNIAAESPVEVRTIQSVTLYINDELVRTDTSLPYL
FGHGSKPHETGAMGWLDTHEPNPSPLPAGRHIFKAVAVDSEGDSSVATMMLTVNSNAPIISFPQESLEVDEGFEKLSLNI
SAESAVEGRTIESVSLYIDGEFVRTDTSLPYLFGHASKPHETGAMGWLDTHSPNPSPLTSGTYEFTAVAIDSEGEESTAT
MQLVVKGEPEPPVVTWPNSTVTVYEGYEKLAITIDAESPVEGRDIQSVTLYRNGELVRVDTRPVWNFGHSHAPYEFGAMG
WLDRHDPNPAPLSVGTHTFTAVARDSAGLETESDMTLIVLSLPGPSVMINESDISLLTEYQNLSITADASTANDDTSLVS
LALYIDDQLVREIYEPPFEWGADGYSNELLELSEGSHLARVVATDSNNKQSESSIFINIDLLGDLNKDSVVDKGDTRLFT
AKLRAGETMDIRYDFNGDGVVNNRDTRGLIRRCTYSRCTSN
>A0A109PTH9 4.2.2.-~~~~~~Ulvan lyase, short isoform~~~
MKINLSMRELVSRLSTTLKTAIALSVLTACTANDSVSLTSNISNTSGVLLESQTKITDGALHFDGKKLNHNTFENPSKSQ
AYDYFFGRNISAHGDAVKPYKHFVFMTWYKGGKEERNVMLSRFNTKTGVVKTIQFPHRHTGFRGDPLVGESHNTIGLAVS
PLNGTIHMVYDMHAYVDDDETGRFKGRFVDDFFRYSFSVAGAADVPDDEFTLEQFVKDTSELSQGADDYKHLTMTGNLQD
KENFSALTYPKFYTSDDGELLHYMRWGGNNNGAYYFNKYDAKNQKWTRFTPFNHKDQKTHGNAYNWGLYGQMKYINGKLR
VGFQQRSANNDDRFKYQNGVYYAYSDHPDGLGNWKNVDGEDMTWPLVNSDEIKIFEPGDYIDHTAPNSVHIVTGFDWTVT
ENDDVHFITHVRSTDTKRSDYKEVSIHAFKPANAVDFTITTDFTGADSIYTSGDSIFIIGLKNGYPFVEKAKGGSNDFEV
VYQQASGVKFDHGTIHIENGKAYYYLMEKGAGNALPLHLQVIDLGVTE
>A0A0X9SHN5 4.2.2.-~~~~~~Ulvan lyase, short isoform~~~
MKLNLKASGVARQLTTLAKTVAALSVLTACASSNTATVSSAMSKANGVFLESQTKITNGALHFDGKKLNHNTFEKPSLGP
EYDYFFGKNISAHGDAVKPYKHYVFMTWYKGGKEQRNVMLSRFNTKTGVVKTIQFPHRHTGFRGNPLVGESHNTIGLAVS
PKNGTIHMVYDMHAYVDDDESGRFKGRFVDDFFRYSFSVPGAADVPDDEFTLEKFVKDTSEVSQGTDDFKHLTMTGNLED
KDNFSALTYPKFYKSKEGELLHYMRWGGNNNGAYYFNKYDAEKQVWTRFTPFNHKDQETHGNAYNWGLYGQMKYINGKLR
VGFQQRSANNNDRYKYQNGVYYAYSDHPDGLGDWKNVDGENMTWPLVDSDEIKIFEPGDYIDHQEPNSVHIVGGFDWTVT
ENEDLHFITHVRSTNTKRSDYKEVSIHAFKPANAKDFTVTTDFTGADSIYTSGDSIFIIGLKNGYPFVEKAKGGTNEFEV
VYQQTSGVKFDHGTIHIENGKAYYYLMEKGAGNSLPLHLQVIDLGVSR
>P9WF05 4.2.2.-~~~~~~Ulvan lyase~~~
MIRNDTMLKGQFVLKKTQIALSAALMGSVLLTGCVTQKSNTESNGPKDTQVSYVEYFADNAVGNPLAIVQHPAAIHKNGI
TYVSYQGPKEDPFVATYNHSTKEWSGPFKAGTSELGRRDGGKKFDNHGKPTMLIDDEGYVHIFYGGHGGHSSNGKNPLGN
THFGANKHAVSKKPYDITQWEDLDNITPFGTYNQVVKMDNGDIYLFFRHGAHRSDWVYQKSVDNGRTFSSPVSFLKHKRR
TDIEAVDSWYAWVGKGEGDNLIVSYDYHVCWDGGAGINGRGHTTERHDVYFMNFNTKTNQWSNVEGESLALPVTKEVADD
KTLAMKTGALWTFNGTSHLDNEGHPHIAINAGVDRGAKTGGPKQTRHVWWDGKKWLGGNNIIEGYQGVSRGDFRVTDPSD
IRYLVTYEKEGDAVLSWWDSDEDGNAFSEGSTVLRKNNATFAISALIENAHPEAQMLVAEKESDENIKIYLVGEDGPVPR
ALSNL
>A0A084JZA8 4.2.2.-~~~~~~Ulvan lyase~~~
MIIKQYLLKISLCVLLLGCDSAKKEISKNENSKTEVDYFADNGFGNAVALVQHPSGVYHNGITYVAYQGPLEDPYVASYN
HETKEWKGPFKAGISEMGKDPSRKKKIDNHGKPALLIDNAGYVHIAFGGHGGMRHHGENTLGNYSYGKNLHAVSKKPYDI
SEWETRDNVSLFGTYSQFIKMDNGDIYLFYRHGAHRSDWVYQKSMDNGVTFSEPVSFLKHKRRTNIEAEDSWYPWVSRGN
ADDIIVAFDYHICRDNVNAQDARGHIPERHNVYYMVFDTKNGQWKNVKNERLQMPLTKEMADEKTLVRSIPNDWTFQGIT
DVDPDGNPHVAVLVGPDINARRSGPKRLQHFRWDGQQWLKSNTANLPRGDGDLEVTSATEVSIYLENKTSNDVGEISRWD
SFNGGESFQKSKVFLQRENSGFVISSLIDNPHPDARIIVAEKEEGTDFRKMYLLGDNGPIKRSKKEAQVLND
>A0A1W2VMZ5 4.2.2.-~~~~~~Ulvan Lyase-PL25~~~
MNLNKTLRKNSPSGYKALLTFSIICGLMATGCAHQESLPNSTANSVDRQVGYFADNGVGNPLAIVQHPAGIHKNGITYVS
YQGPKEDPYIASYNHQTGQWQGPFRAGISELGRRDGGKKFDNHGKPTMLIDDEGYIHIFYGGHGGQASNGKNPLGNTHHG
ANKHAVSKRPYDISQWEDLNNITPFGTYNQAIKMDNGDIYLFFRHGAHRSDWVYQKSVDNGRTFASPVSFLKHKRRTDID
AVDSWYAWAGKGQGDNIIVSYDYHVCWDGGAGVNGRGHTTERHDVYFMSFNTKTGEWSNVEGEKLVLPVTREVADEKTMA
MRTGELWTFNGSTHLDAQGQPHIAINAGIDKGAKTGGPKQTRHVRWNGNEWVGGDKVIPQYERVSRGDFMVTDPENIRYL
TTYNQDNDAVLSWWQSHDGGEHFVEDKTVLRKDNASFAISAFIKDAIPDAQMLVAEKVSDEGIKMYLVGEEGAVTRSLVD
LKTAMPTSK
>G8G2V6 4.2.2.-~~~~~~Ulvan lyase NLR42~~~
MVFFKDLFIFKSLIKGSLYSGHMKKKLLNYLPLFALMLFTVSMMAQTAPDEDTNSSIACPSSGVFQNNTTRDVDIANPDN
VGTVDDRTCYADYYETSVYGETWGAYNITFNSNHWDAPNTLQPRIERSLSRSQETGVGSYARFTGTLRILEVGNTGTFGS
TGSYLMQAKGKHTGGGGSNDPAICLYLARPVYGPDANGNQVQVSFDIWREQINFRGGSGAAGRTEVFLRNVLKDEIIDIE
LEVGFRQDPNDPNLKIHYSDAIIGGQVFNWNIPEPERGRESGIRYGVYRVKGGRAQMRWANTTYQKVEVVDNSTIPAADI
YRIKNVETGEYLTSSGSSIIASTSGTGSDKEWEIISAGSGSSYVNIDSQVRGIIRFTGGSSNPGLVSTNFSPPNTDTDKV
WTVIDNNDGTVSFETRNLGRFLYHDTNNMITHSANIDDRSKWNLESTTLSVDSQQIASVGVYPNPTVDGFTISLDNISAE
KVQIFNLLGMLVYEQKTNESSIHIDNMDNFDSGMYIISVTANDNKVYQTKLIVN
>A0A084JZF2 4.2.2.-~~~~~~Ulvan lyase NLR48~~~
MRKLKYNTTRVILMIAFISLSACSSEDAMIEEEQVIPDPDPVAQTDEDTGPVVDCTNQGTNPTRDTDIPNPRNIGDIDDR
SCYANYSESSILGKFWGIYNITDGSNHMDAPNTLQPRIERSLSRSQATGAGSYARFRGVLRILEVGDTGTFSSSGSYFMQ
AKGKHTGGGGSPDPAICLYRAHPVYGDDGNGNQVQVSFDIWREQINFRGGSGSAGRTEVFLKNVLKNEQIDIELEVGFRD
DPNNPGQTLHYADAKIGGEEFNWNIPEPERGIESGIRYGAYRVKGGRAQFRWANTSYTKDEVN
>P39301 ~~~ulaA~~~Ascorbate-specific PTS system EIIC component~~~COG3037
MEILYNIFTVFFNQVMTNAPLLLGIVTCLGYILLRKSVSVIIKGTIKTIIGFMLLQAGSGILTSTFKPVVAKMSEVYGIN
GAISDTYASMMATIDRMGDAYSWVGYAVLLALALNICYVLLRRITGIRTIMLTGHIMFQQAGLIAVTLFIFGYSMWTTII
CTAILVSLYWGITSNMMYKPTQEVTDGCGFSIGHQQQFASWIAYKVAPFLGKKEESVEDLKLPGWLNIFHDNIVSTAIVM
TIFFGAILLSFGIDTVQAMAGKVHWTVYILQTGFSFAVAIFIITQGVRMFVAELSEAFNGISQRLIPGAVLAIDCAAIYS
FAPNAVVWGFMWGTIGQLIAVGILVACGSSILIIPGFIPMFFSNATIGVFANHFGGWRAALKICLVMGMIEIFGCVWAVK
LTGMSAWMGMADWSILAPPMMQGFFSIGIAFMAVIIVIALAYMFFAGRALRAEEDAEKQLAEQSA
>P69822 2.7.1.194~~~ulaB~~~Ascorbate-specific PTS system EIIB component~~~COG3414
MTVRILAVCGNGQGSSMIMKMKVDQFLTQSNIDHTVNSCAVGEYKSELSGADIIIASTHIAGEITVTGNKYVVGVRNMLS
PADFGPKLLEVIKEHFPQDVK
>Q9EXD8 2.7.1.194~~~ulaB~~~Ascorbate-specific PTS system EIIB component~~~
MENKNLHIIAACGNGMGTSMLIKIKVEKIMKELGYTAKVEALSMGQTKGMEHSADIIISSIHLTSEFNPNAKAKIVGVLN
LMDENEIKQALSKVL
>P69820 ~~~ulaC~~~Ascorbate-specific PTS system EIIA component~~~COG1762
MKLRDSLAENKSIRLQAEAETWQEAVKIGVDLLVAADVVEPRYYQAILDGVEQFGPYFVIAPGLAMPHGRPEEGVKKTGF
SLVTLKKPLEFNHDDNDPVDILITMAAVDANTHQEVGIMQIVNLFEDEENFDRLRACRTEQEVLDLIDRTNAAA
>P39304 4.1.1.85~~~ulaD~~~3-keto-L-gulonate-6-phosphate decarboxylase UlaD~~~COG0269
MSLPMLQVALDNQTMDSAYETTRLIAEEVDIIEVGTILCVGEGVRAVRDLKALYPHKIVLADAKIADAGKILSRMCFEAN
ADWVTVICCADINTAKGALDVAKEFNGDVQIELTGYWTWEQAQQWRDAGIGQVVYHRSRDAQAAGVAWGEADITAIKRLS
DMGFKVTVTGGLALEDLPLFKGIPIHVFIAGRSIRDAASPVEAARQFKRSIAELWG
>Q8XDI5 5.1.3.22~~~ulaE~~~L-ribulose-5-phosphate 3-epimerase UlaE~~~COG3623
MLSKQIPLGIYEKALPAGECWLERLQLAKTLGFDFVEMSVDETDERLSRLDWSREQRLALVNAIVETGVRVPSMCLSAHR
RFPLGSEDDAVRAQGLEIMRKAIQFAQDVGIRVIQLAGYDVYYQEANNETRRRFRDGLKESVEMASRAQVTLAMEIMDYP
LMNSISKALGYAHYLNNPWFQLYPDIGNLSAWDNDVQMELQAGIGHIVAVHVKDTKPGVFKNVPFGEGVVDFERCFETLK
QSGYCGPYLIEMWSETAEDPAAEVAKARDWVKARMAKAGMVEAA
>P39305 5.1.3.22~~~ulaE~~~L-ribulose-5-phosphate 3-epimerase UlaE~~~COG3623
MLSKQIPLGIYEKALPAGECWLERLQLAKTLGFDFVEMSVDETDDRLSRLNWSREQRLALVNAIVETGVRVPSMCLSAHR
RFPLGSEDDAVRAQGLEIMRKAIQFAQDVGIRVIQLAGYDVYYQEANNETRRRFRDGLKESVEMASRAQVTLAMEIMDYP
LMSSISKALGYAHYLNNPWFQLYPDIGNLSAWDNDVQMELQAGIGHIVAVHVKDTKPGVFKNVPFGEGVVDFERCFETLK
QSGYCGPYLIEMWSETAEDPAAEVAKARDWVKARMAKAGMVEAA
>P39306 5.1.3.4~~~ulaF~~~L-ribulose-5-phosphate 4-epimerase UlaF~~~COG0235
MQKLKQQVFEANMELPRYGLVTFTWGNVSAIDRERGLVVIKPSGVAYETMKAADMVVVDMSGKVVEGEYRPSSDTATHLE
LYRRYPSLGGIVHTHSTHATAWAQAGLAIPALGTTHADYFFGDIPCTRGLSEEEVQGEYELNTGKVIIETLGNAEPLHTP
GIVVYQHGPFAWGKDAHDAVHNAVVMEEVAKMAWIARGINPQLNHIDSFLMNKHFMRKHGPNAYYGQK
>P39300 3.1.1.-~~~ulaG~~~Probable L-ascorbate-6-phosphate lactonase UlaG~~~COG2220
MSKVKSITRESWILSTFPEWGSWLNEEIEQEQVAPGTFAMWWLGCTGIWLKSEGGTNVCVDFWCGTGKQSHGNPLMKQGH
QMQRMAGVKKLQPNLRTTPFVLDPFAIRQIDAVLATHDHNDHIDVNVAAAVMQNCADDVPFIGPKTCVDLWIGWGVPKER
CIVVKPGDVVKVKDIEIHALDAFDRTALITLPADQKAAGVLPDGMDDRAVNYLFKTPGGSLYHSGDSHYSNYYAKHGNEH
QIDVALGSYGENPRGITDKMTSADMLRMGEALNAKVVIPFHHDIWSNFQADPQEIRVLWEMKKDRLKYGFKPFIWQVGGK
FTWPLDKDNFEYHYPRGFDDCFTIEPDLPFKSFL
>P0A9W0 ~~~ulaR~~~HTH-type transcriptional regulator UlaR~~~COG1349
MTEAQRHQILLEMLAQLGFVTVEKVVERLGISPATARRDINKLDESGKLKKVRNGAEAITQQRPRWTPMNLHQAQNHDEK
VRIAKAASQLVNPGESVVINCGSTAFLLGREMCGKPVQIITNYLPLANYLIDQEHDSVIIMGGQYNKSQSITLSPQGSEN
SLYAGHWMFTSGKGLTAEGLYKTDMLTAMAEQKMLSVVGKLVVLVDSSKIGERAGMLFSRADQIDMLITGKNANPEILQQ
LEAQGVSILRV
>Q6MX39 2.1.1.-~~~umaA~~~S-adenosylmethionine-dependent methyltransferase UmaA~~~COG2230
MTELRPFYEESQSIYDVSDEFFSLFLDPTMAYTCAYFEREDMTLEEAQNAKFDLALDKLHLEPGMTLLDIGCGWGGGLQR
AIENYDVNVIGITLSRNQFEYSKAKLAKIPTERSVQVRLQGWDEFTDKVDRIVSIGAFEAFKMERYAAFFERSYDILPDD
GRMLLHTILTYTQKQMHEMGVKVTMSDVRFMKFIGEEIFPGGQLPAQEDIFKFAQAADFSVEKVQLLQQHYARTLNIWAA
NLEANKDRAIALQSEEIYNKYMHYLTGCEHFFRKGISNVGQFTLTK
>A0A1X9QDU4 ~~~umpA~~~Na(+), Li(+), K(+)/H(+) antiporter subunit A~~~
MNIVYSLANQLKDSLRDLLPVVLVMGFFQLVVIRQPLPDSLALLDLVLGLVFVVVGLTLFIKGLELGLFPLGENLARAFV
KKGSVAWLLIFAFALGFGSTVAEPALIAVAARAGEVVSESGLIADTSAAQASFAFTLRMVVALSVGTAVVVGVLRIFKGW
PLHLMIIAGYIIVLLLTLFAPNELIGIAYDSGGVTTSTITVPLVTALGVGLASTIKGRNPMLDGFGLIAFASVMPIIFVL
LFGIFGLSLVTGST
>A0A1X9QDU5 ~~~umpB~~~Na(+), Li(+), K(+)/H(+) antiporter subunit B~~~
MILLTIFWDTLLDILPIAAIIFGFQYIVIRKRIQRLPQVLAGFFMVWVGLSLFLVGLEQALFPMGELMASQLTNTDFLPA
VEQGVQRHWADYYWVYLFAFAIGASTTIAEPSLIAVSIKAGEISGGTINPFMLRIAVALGMAFGITLGTWRIVMGWPLQW
FVFAAYCLVIIQTLRSPKSIIPLAFDSGGVTTSTITVPIIAALGLGLAASIPGRSALMDGFGMIALACLFPIITVMGYAQ
IAQWKDKRKQTTPHLSYSKAPPPSKGDNNAL
>P04152 ~~~umuC~~~Protein UmuC~~~COG0389
MFALCDVNAFYASCETVFRPDLWGKPVVVLSNNDGCVIARNAEAKALGVKMGDPWFKQKDLFRRCGVVCFSSNYELYADM
SNRVMSTLEELSPRVEIYSIDEAFCDLTGVRNCRDLTDFGREIRATVLQRTHLTVGVGIAQTKTLAKLANHAAKKWQRQT
GGVVDLSNLERQRKLMSALPVDDVWGIGRRISKKLDAMGIKTVLDLADTDIRFIRKHFNVVLERTVRELRGEPCLQLEEF
APTKQEIICSRSFGERITDYPSMRQAICSYAARAAEKLRSEHQYCRFISTFIKTSPFALNEPYYGNSASVKLLTPTQDSR
DIINAATRSLDAIWQAGHRYQKAGVMLGDFFSQGVAQLNLFDDNAPRPGSEQLMTVMDTLNAKEGRGTLYFAGQGIQQQW
QMKRAMLSPRYTTRSSDLLRVK
>P0AG11 3.4.21.-~~~umuD~~~Protein UmuD~~~COG1974
MLFIKPADLREIVTFPLFSDLVQCGFPSPAADYVEQRIDLNQLLIQHPSATYFVKASGDSMIDGGISDGDLLIVDSAITA
SHGDIVIAAVDGEFTVKKLQLRPTVQLIPMNSAYSPITISSEDTLDVFGVVIHVVKAMR
>P22493 3.4.21.-~~~umuD~~~Protein UmuD~~~
MEFFRPTELREIIPLPFFSYLVPCGFPSPAADYIEQRIDLNELLVSHPSSTYFVKASGDSMIEAGISDGDLLVVDSSRNA
DHGDIVIAAIEGEFTVKRLQLRPTVQLIPMNGAYRPIPVGSEDTLDIFGVVTFIIKAVS
>P39615 3.2.2.27~~~ung~~~Uracil-DNA glycosylase~~~COG0692
MKQLLQDSWWNQLKEEFEKPYYQELREMLKREYAEQTIYPDSRDIFNALHYTSYDDVKVVILGQDPYHGPGQAQGLSFSV
KPGVKQPPSLKNIFLELQQDIGCSIPNHGSLVSWAKQGVLLLNTVLTVRRGQANSHKGKGWERLTDRIIDVLSERERPVI
FILWGRHAQMKKERIDTSKHFIIESTHPSPFSARNGFFGSRPFSRANAYLEKMGEAPIDWCIKDL
>Q83CW4 3.2.2.27~~~ung~~~Uracil-DNA glycosylase~~~COG0692
MTTMAETQTWQTVLGEEKQEPYFQEILDFVKKERKAGKIIYPPQKDIFNALKLTPYEAIKVVILGQDPYHGPNQAHGLAF
SVRPGVPAPPSLQNIFKELHADLGVSIPSHGFLEKWAKQGVLLLNAALTVEAGKPQSHANIGWHRFTDKVIESLNDHPEG
IVFLLWGSYAQKKSQLITNLRHRILKAPHPSPLSAARGFLGCRHFSKANQLLHEMGRGEIDWALDEKVS
>Q9RWH9 3.2.2.27~~~ung~~~Uracil-DNA glycosylase~~~COG0692
MTDQPDLFGLAPDAPRPIIPANLPEDWQEALLPEFSAPYFHELTDFLRQERKEYTIYPPAPDVFNALRYTPLGEVKVLIL
GQDPYHGPNQAHGLSFSVRPGVRVPPSLRNIYKELTEDIPGFVAPKHGYLRSWAEQGVLLLNAVLTVRAGQANSHQGKGW
EHFTDAVIKAVNAKEERVVFILWGSYARKKKKLITGKNHVVIESGHPSPLSEQYFFGTRPFSKTNEALEKAGRGPVEWQL
PATVTEE
>P12295 3.2.2.27~~~ung~~~Uracil-DNA glycosylase~~~COG0692
MANELTWHDVLAEEKQQPYFLNTLQTVASERQSGVTIYPPQKDVFNAFRFTELGDVKVVILGQDPYHGPGQAHGLAFSVR
PGIAIPPSLLNMYKELENTIPGFTRPNHGYLESWARQGVLLLNTVLTVRAGQAHSHASLGWETFTDKVISLINQHREGVV
FLLWGSHAQKKGAIIDKQRHHVLKAPHPSPLSAHRGFFGCNHFVLANQWLEQRGETPIDWMPVLPAESE
>P9WFQ9 3.2.2.27~~~ung~~~Uracil-DNA glycosylase~~~COG0692
MTARPLSELVERGWAAALEPVADQVAHMGQFLRAEIAAGRRYLPAGSNVLRAFTFPFDNVRVLIVGQDPYPTPGHAVGLS
FSVAPDVRPWPRSLANIFDEYTADLGYPLPSNGDLTPWAQRGVLLLNRVLTVRPSNPASHRGKGWEAVTECAIRALAARA
APLVAILWGRDASTLKPMLAAGNCVAIESPHPSPLSASRGFFGSRPFSRANELLVGMGAEPIDWRLP
>Q6GJ88 3.2.2.27~~~ung~~~Uracil-DNA glycosylase~~~
MEWSQIFHDITTKHDFKAMHDFLEKEYSTAIVYPDRENIYQAFDLTPFENIKVVILGQDPYHGPNQAHGLAFSVQPNAKF
PPSLRNMYKELADDIGCVRQTPHLQDWAREGVLLLNTVLTVRQGEANSHRDIGWETFTDEIIKAVSDYKEHVVFILWGKP
AQQKIKLIDTSKHCIIKSVHPSPLSAYRGFFGSKPYSKANTYLESVGKSPINWCESEA
>Q9KPK8 3.2.2.27~~~ung~~~Uracil-DNA glycosylase~~~COG0692
MSESLTWHDVIGNEKQQAYFQQTLQFVESQRQAGKVIYPPAKDVFNAFRFTEFGDVKVVILGQDPYHGPNQAHGLCFSVL
PGVKTPPSLVNIYKELAQDIPGFQIPPHGYLQSWAQQGVLLLNTVLTVEQGMAHSHANTGWETFTDRVIDALNQHRNGLI
FLLWGSHAQKKGQMIDRQRHHVLMAPHPSPLSAHRGFLGCRHFSKTNQLLQAQGIAPINWQPELES
>Q7A7I6 ~~~~~~UPF0355 protein SA0372~~~
MADITVVNDTGELYNVINQKKSEGYLESELTIISKSKLHLNDLHDSEISLISTSGTFSDRMTKLLTGEDGEHAVLSRYNL
APDELEKYKQLILDDKMLVVAVRDKSSHKEVQEHNSAYEEIDITHFAEASKGPKA
>A0A0H2VCA1 ~~~upaG~~~Autotransporter adhesin UpaG~~~COG5295
MNKIFKVIWNPATGSYTVASETAKSRGKKSGRSKLLISALVAGGLLSSFGASADNYTGQPTDYGDGSAGDGWVAIGKGAK
ANTFMNTSGASTALGYDAIAEGEYSSAIGSKTLATGGASMAFGVSAKAMGDRSVALGASSVANGDRSMAFGRYAKTNGFT
SLAIGDSSLADGEKTIALGNTAKAYEIMSIALGDNANASKEYAMALGASSKAGGADSLAFGRKSTANSTGSLAIGADSSS
SNDNAIAIGNKTQALGVNSMALGNASQASGESSIALGNTSEASEQNAIALGQGSIASKVNSIALGSNSLSSGENAIALGE
GSAAGGSNSLAFGSQSRANGNDSVAIGVGAAAATDNSVAIGAGSTTDASNTVSVGNSATKRKIVNMAAGAISNTSTDAIN
GSQLYTISDSVAKRLGGGATVGSDGTVTAVSYALRSGTYNNVGDALSGIDNNTLQWNKTAGAFSANHGANATNKITNVAK
GTVSATSTDVVNGSQLYDLQQDALLWNGTAFSAAHGTEATSKITNVTAGNLTAGSTDAVNGSQLKTTNDNVTTNTTNIAT
NTTNITNLTDAVNGLGDDSLLWNKAAGAFSAAHGTEATSKITNVTAGNLTAGSTDAVNGSQLKTTNDNVTTNTTNIATNT
TNITNLTDAVNGLGDDSLLWNKTAGAFSAAHGTDATSKITNVTAGNLTAGSTDAVNGSQLKTTNDNVTTNTTNIATNTTN
ITNLTDAVNGLGDDSLLWNKTAGAFSAAHGTDATSKITNVKAGDLTAGSTDAVNGSQLKTTNDNVSTNTTNITNLTDAVN
GLGDDSLLWNKTAGAFSAAHGTDATSKITNVKAGDLTAGSTDAVNGSQLKTTNDNVSTNTTNITNLTDSVGDLKDDSLLW
NKAAGAFSAAHGTEATSKITNLLAGKISSNSTDAINGSQLYGVADSFTSYLGGGADISDTGVLSGPTYTIGGTDYTNVGD
ALAAINTSFSTSLGDALLWDATAGKFSAKHGINNAPSVITDVANGAVSSTSSDAINGSQLYGVSDYIADALGGNAVVNTD
GSITTPTYAIAGGSYNNVGDALEAIDTTLDDALLWDTTANGGNGAFSAAHGKDKTASVITNVANGAVSATSNDAINGSQL
YSTNKYIADALGGDAEVNADGTITAPTYTIANTDYNNVGEALDALDNNALLWDEDAGAYNASHDGNASKITNVAAGDLST
TSTDAVNGSQLNATNILVTQNSQMINQLAGNTSETYIEENGAGINYVRTNDSGLAFNDASASGIGATAVGYNAVASHASS
VAIGQDSISEVDTGIALGSSSVSSRVIVKGTRNTSVSEEGVVIGYDTTDGELLGALSIGDDGKYRQIINVADGSEAHDAV
TVRQLQNAIGAVATTPTKYYHANSTAEDSLAVGEDSLAMGAKTIVNGNAGIGIGLNTLVLADAINGIAIGSNARANHADS
IAMGNGSQTTRGAQTNYTAYNMDAPQNSVGEFSVGSEDGQRQITNVAAGSADTDAVNVGQLKVTDAQVSQNTQSITNLNT
QVTNLDTRVTNIENGIGDIVTTGSTKYFKTNTDGADANAQGKDSVAIGSGSIAAADNSVALGTGSVADEENTISVGSSTN
QRRITNVAAGVNATDAVNVSQLKSSEAGGVRYDTKADGSIDYSNITLGGGNSGTTRISNVSAGVNNNDAVNYAQLKQSVQ
ETKQYTDQRMVEMDNKLSKTESKLSGGIASAMAMTGLPQAYTPGASMASIGGGTYNGESAVALGVSMVSANGRWVYKLQG
STNSQGEYSAALGAGIQW
>Q5WNX2 3.6.1.27~~~uppP~~~Undecaprenyl-diphosphatase~~~
MALDFIEILKVIFLGIVEGITEWLPISSTGHMLLVDEFITLNMSEAFKEMFFVVIQLGAILAVVVMFWNKMFPFQFKNKS
QSIIKKDTFSLWFKVAVACVPSAIMGILFDDYLDAHLHTPVVIAIMLILYGVLFIVIENRNKKRTATTSTLADISYKTAL
MIGVFQVLSLIPGTSRSGATIIGALLIGVSRVAAAEFTFFLAVPTMLGASAFKLLKFGFDFTSAELLTLVIGMAVAFAVS
VFVIKFLMSYIKKHDFKVFGWYRIVLGILVLLITAI
>P60932 3.6.1.27~~~uppP~~~Undecaprenyl-diphosphatase~~~COG1968
MSDMHSLLIAAILGVVEGLTEFLPVSSTGHMIIVGHLLGFEGDTAKTFEVVIQLGSILAVVVMFWRRLFGLIGIHFGRPL
QHEGESKGRLTLIHILLGMIPAVVLGLLFHDTIKSLFNPINVMYALVVGGLLLIAAECLKPKEPRAPGLDDMTYRQAFMI
GCFQCLALWPGFSRSGATISGGMLMGVSRYAASEFSFLLAVPMMMGATALDLYKSWGFLTSGDIPMFAVGFITAFVVALI
AIKTFLQLIKRISFIPFAIYRFIVAAAVYVVFF
>P9WFF9 3.6.1.27~~~uppP~~~Undecaprenyl-diphosphatase~~~COG1968
MTAAPAMSWWQVIVLAAAQGLTEFLPVSSSGHLAIVSRIFFSGDAGASFTAVSQLGTEAAVVIYFARDIVRILSAWLHGL
VVKAHRNTDYRLGWYVIIGTIPICILGLFFKDDIRSGVRNLWVVVTALVVFSGVIALAEYVGRQSRHIERLTWRDAVVVG
IAQTLALVPGVSRSGSTISAGLFLGLDRELAARFGFLLAIPAVFASGLFSLPDAFHPVTEGMSATGPQLLVATLIAFVLG
LTAVAWLLRFLVRHNMYWFVGYRVLVGTGMLVLLATGTVAAT
>P60473 2.5.1.31~~~uppS~~~Ditrans,polycis-undecaprenyl-diphosphate synthase ((2E,6E)-farnesyl-diphosphate specific)~~~COG0020
MMLSATQPLSEKLPAHGCRHVAIIMDGNGRWAKKQGKIRAFGHKAGAKSVRRAVSFAANNGIEALTLYAFSSENWNRPAQ
EVSALMELFVWALDSEVKSLHRHNVRLRIIGDTSRFNSRLQERIRKSEALTAGNTGLTLNIAANYGGRWDIVQGVRQLAE
KVQQGNLQPDQIDEEMLNQHVCMHELAPVDLVIRTGGEHRISNFLLWQIAYAELYFTDVLWPDFDEQDFEGALNAFANRE
RRFGGTEPGDETA
>P60472 2.5.1.31~~~ispU~~~Ditrans,polycis-undecaprenyl-diphosphate synthase ((2E,6E)-farnesyl-diphosphate specific)~~~COG0020
MMLSATQPLSEKLPAHGCRHVAIIMDGNGRWAKKQGKIRAFGHKAGAKSVRRAVSFAANNGIEALTLYAFSSENWNRPAQ
EVSALMELFVWALDSEVKSLHRHNVRLRIIGDTSRFNSRLQERIRKSEALTAGNTGLTLNIAANYGGRWDIVQGVRQLAE
KVQQGNLQPDQIDEEMLNQHVCMHELAPVDLVIRTGGEHRISNFLLWQIAYAELYFTDVLWPDFDEQDFEGALNAFANRE
RRFGGTEPGDETA
>Q88VJ8 2.5.1.31~~~uppS~~~Ditrans,polycis-undecaprenyl-diphosphate synthase ((2E,6E)-farnesyl-diphosphate specific)~~~COG0020
MFAFFNKNDPADNTDVQLDPERIPAHVAIIMDGNGRWAKARHLPRVAGHKEGMNTVKKITIAASDLGVKVLTLYAFSTEN
WKRPTDEVNYLMQLPVSFFDTFVPDLIKNNVRVQVMGYVDHLPEATQKAVQNAIADTKDCDGMVLNFALNYGSRAEIVTG
VQKIAQQVQDGQLAVGDIDDATIDAALMTAPLAPYNDPDLLIRTSGEERISNFLMWQIAYSELVFTDVKWPDFTAATLQA
CIADFQSRDRRFGGLSDHK
>O82827 2.5.1.31~~~uppS~~~Ditrans,polycis-undecaprenyl-diphosphate synthase ((2E,6E)-farnesyl-diphosphate specific)~~~
MFPIKKRKAIKNNNINAAQIPKHIAIIMDGNGRWAKQKKMPRIKGHYEGMQTVKKITRYASDLGVKYLTLYAFSTENWSR
PKDEVNYLMKLPGDFLNTFLPELIEKNVKVETIGFIDDLPDHTKKAVLEAKEKTKHNTGLTLVFALNYGGRKEIISAVQL
IAERYKSGEISLDEISETHFNEYLFTANMPDPELLIRTSGEERLSNFLIWQCSYSEFVFIDEFWPDFNEESLAQCISIYQ
NRHRRFGGL
>O67914 2.4.2.9~~~upp~~~Uracil phosphoribosyltransferase~~~COG0035
MIVELSHPLIKHKVNTARIQDTSAEKLRKTLKELGFMLVYEALKDILLEEKEVRTWIGNKRFNYLNEEEIVFVPILRAGL
SFLEGALQVVPNAKVGFLGIKRNEETLESHIYYSRLPELKGKIVVILDPMLATGGTLEVALREILKHSPLKVKSVHAIAA
PEGLKRIEEKFKEVEIFVGNVDERLNDKGYIIPGLGDIGDRLYAVSVY
>P70881 2.4.2.9~~~upp~~~Uracil phosphoribosyltransferase~~~
MGKVYVFDHPLIQHKLTYIRDKNTGTKEFRELVDEVATLMAFEITRDLPLEEVEIETPVSKARAKVIAGKKLGVIPILRA
GIGMVDGILKLIPAAKVGHIGLYRDPQTLKPVEYYVKLPSDVEERDFIIVDPMLATGGSAVAAIDALKKRGAKSIKFMCL
IAAPGRVKAVETAHPDVDIYIAALDERLNDHGYIVPGLGDAGDRLFGTK
>Q63VS8 2.4.2.9~~~upp~~~Uracil phosphoribosyltransferase~~~COG0035
MKQDSRFPNLFILDHPLIQHKLTHMRDKDTSTRTFRELLREITLLMGYEITRNLPITTKRVETPLVEIDAPVIAGKKLAI
VPVLRAGVGMSDGLLELIPSARVGHIGVYRADDHRPVEYLVRLPDLEDRIFILCDPMVATGYSAAHAIDVLKRRGVPGER
LMFLALVAAPEGVQVFQDAHPDVKLYVASLDSHLDDHAYIVPGLGDAGDRLFGTKN
>P0A8F0 2.4.2.9~~~upp~~~Uracil phosphoribosyltransferase~~~COG0035
MKIVEVKHPLVKHKLGLMREQDISTKRFRELASEVGSLLTYEATADLETEKVTIEGWNGPVEIDQIKGKKITVVPILRAG
LGMMDGVLENVPSARISVVGMYRNEETLEPVPYFQKLVSNIDERMALIVDPMLATGGSVIATIDLLKKAGCSSIKVLVLV
AAPEGIAALEKAHPDVELYTASIDQGLNEHGYIIPGLGDAGDKIFGTK
>A6TCB0 2.4.2.9~~~upp~~~Uracil phosphoribosyltransferase~~~
MKIVEVKHPLVKHKLGLMREHDISTKRFRELASEVGSLLTYEATADLETEKVTIEGWNGPVEVEQIKGKKITVVPILRAG
LGMMEGVLEHVPSARISVVGIYRNEETLEPVPYFQKLVSNIDERMALVVDPMLATGGSMIATIDLLKNAGCTSIKVLVLV
AAPEGIAALEKAHPDVELYTASVDKGLNEHGYIIPGLGDAGDKIFGTK
>P9WFF3 2.4.2.9~~~upp~~~Uracil phosphoribosyltransferase~~~COG0035
MQVHVVDHPLAAARLTTLRDERTDNAGFRAALRELTLLLIYEATRDAPCEPVPIRTPLAETVGSRLTKPPLLVPVLRAGL
GMVDEAHAALPEAHVGFVGVARDEQTHQPVPYLDSLPDDLTDVPVMVLDPMVATGGSMTHTLGLLISRGAADITVLCVVA
APEGIAALQKAAPNVRLFTAAIDEGLNEVAYIVPGLGDAGDRQFGPR
>P67396 2.4.2.9~~~upp~~~Uracil phosphoribosyltransferase~~~
MSKVHVFDHPLIQHKLSYIRDVNTGTKEFRELVDEVGMLMAYEVTRDLELQDVDIETPVTKMTAKRLAGKKLAIVPILRA
GLGMTDGILSLVPAARVGHIGLYRDPETLKAVEYFAKLPQDITERQIIVVDPMLATGASAIEAITSLKKRGAKNIRFMCL
IAAPEGVEKMHEAHPDVDIYIAALDEKLNDKAYITPGLGDAGDRLFGTK
>Q9WZI0 2.4.2.9~~~upp~~~Uracil phosphoribosyltransferase~~~COG0035
MKNLVVVDHPLIKHKLTIMRDKNTGPKEFRELLREITLLLAYEATRHLKCEEVEVETPITKTIGYRINDKDIVVVPILRA
GLVMADGILELLPNASVGHIGIYRDPETLQAVEYYAKLPPLNDDKEVFLLDPMLATGVSSIKAIEILKENGAKKITLVAL
IAAPEGVEAVEKKYEDVKIYVAALDERLNDHGYIIPGLGDAGDRLFRTK
>Q72J35 2.4.2.9~~~upp~~~Uracil phosphoribosyltransferase~~~COG0035
MRITLVDHPLVQHKLAHLRDKRTGPKDFRELAEEVAMLMAYEAMRDLELEETTVETPIAPARVKVLSGKKLALVAILRAG
LVMVEGILKLVPHARVGHIGLYRDPESLNPVQYYIKLPPDIAERRAFLLDPMLATGGSASLALSLLKERGATGVKLMAIL
AAPEGLERIAKDHPDTEVVVAAIDERLNDHGYIVPGLGDAGDRIYGTK
>O31823 ~~~uptA~~~Undecaprenyl phosphate transporter A~~~COG0586
MGSLISEILTWLTNMGYAGIAIGLMIEIIPSEIVLAYGGYMVSEGTIGFIGAIIAGVIGGTIAQIFIYWIGRYGGRPFLD
KYGKYLLIKKHHIDMSENWFQKYGAGVVFSARFIPVVRHAISIPAGIARMPFLKFVVLTVLAIIPWSILFVYLGIQLGSQ
WDDVENIAGTYTTPIMILAVVVIALYFVIKKRTAIFKR
>P0AGM8 ~~~uraA~~~Uracil permease~~~COG2233
MTRRAIGVSERPPLLQTIPLSLQHLFAMFGATVLVPVLFHINPATVLLFNGIGTLLYLFICKGKIPAYLGSSFAFISPVL
LLLPLGYEVALGGFIMCGVLFCLVSFIVKKAGTGWLDVLFPPAAMGAIVAVIGLELAGVAAGMAGLLPAEGQTPDSKTII
ISITTLAVTVLGSVLFRGFLAIIPILIGVLVGYALSFAMGIVDTTPIINAHWFALPTLYTPRFEWFAILTILPAALVVIA
EHVGHLVVTANIVKKDLLRDPGLHRSMFANGLSTVISGFFGSTPNTTYGENIGVMAITRVYSTWVIGGAAIFAILLSCVG
KLAAAIQMIPLPVMGGVSLLLYGVIGASGIRVLIESKVDYNKAQNLILTSVILIIGVSGAKVNIGAAELKGMALATIVGI
GLSLIFKLISVLRPEEVVLDAEDADITDK
>P0AGM7 ~~~uraA~~~Uracil permease~~~COG2233
MTRRAIGVSERPPLLQTIPLSLQHLFAMFGATVLVPVLFHINPATVLLFNGIGTLLYLFICKGKIPAYLGSSFAFISPVL
LLLPLGYEVALGGFIMCGVLFCLVSFIVKKAGTGWLDVLFPPAAMGAIVAVIGLELAGVAAGMAGLLPAEGQTPDSKTII
ISITTLAVTVLGSVLFRGFLAIIPILIGVLVGYALSFAMGIVDTTPIINAHWFALPTLYTPRFEWFAILTILPAALVVIA
EHVGHLVVTANIVKKDLLRDPGLHRSMFANGLSTVISGFFGSTPNTTYGENIGVMAITRVYSTWVIGGAAIFAILLSCVG
KLAAAIQMIPLPVMGGVSLLLYGVIGASGIRVLIESKVDYNKAQNLILTSVILIIGVSGAKVNIGAAELKGMALATIVGI
GLSLIFKLISVLRPEEVVLDAEDADITDK
>A0A2A5K1W4 ~~~~~~Uracil permease~~~
MKDRVIQVDERLPFLQSIPLSLQHLFAMFGSTVLVPMLLQINPAICLLMNGIGTLIYIFLCKGRIPAYLGSSFAFISPVL
IVISTRSYEAALSGFLVVGLVFCLIGLLVKAVGTGWIEIVFPPAAMGAIVAVIGLELAPTAANMAGFVASAGTEGWSPDP
KVIAVSLVTLLTAVVGNVMFRGFMKIIPILISIIVGYALAAFLGIVDFSIVREAKWFDLPTFYIMKWDWSSIAIIVPAAL
VVVAEHIGHLIVTSNIVGKDLSKDPGLDRSLLGNGVSTVISSFVGSTPNTTYGENIGVLALTRVYSIWIIGGAAVMAIVL
SFVGKLAALIQTIPVPVMGGVSILLFGVIAGSGVRMLVEAKVDYSNPKNLILTAVVLIIGISGAAFKWGNFEMKGMALAT
VIAILLGLFFNIIDKLKWSNE
>C8WLE3 1.3.99.33~~~urdA~~~Urocanate reductase~~~COG1053
MSNLSRRNFITGGAIAALGGTLAIAGCAPKGESSSTVAGAAGEGAQAWTGTANGKGGELTVEVITEGDSIARINPLKSRE
SYGVGTAGIDVLSDLIVKNQTLNVDMVTGATVSSMAFLTAVSDAVDASGMKSSEWKKREKAVPQAPEGLTTDVDVVVVGA
GGAGYAAALTAAEAGKNVVLLEKLGIVGGDTILSGGAMAVPNNWFQKRDGIEDSVEKMAEDMIVGGDHVGDPDLVNVICE
GAYGAMEWLIFNGGVAWQPYERFFGGHSVIRSLIPEGNEGSGIICKLDKRAEGLKNLKVCRNTKADELVQDASGAVVGLK
ATNTATGETYDFKAKAVILAAGGFGSNVEMRMKYNPEMDEKILSTDSVGATGDCHVMAEKIGANLIDMQYIQTYPTCDTQ
TGALLYVGNMRLENRAICINKEGDRFVEEMERRDVISNAIKEQTDGIGYMIFNQDGLDHTDIATVNAAEMDGLFGRGQLA
KGETIAEACEPFGIDAAELQKTVEKWNGYCKDGADPDFNYRAALNPIEGGPYYILAYKPSVHYTMGGLHINTDAQVLDSD
AAPIPGLFAAGEQAGHKMGTNRLGSCSITDVFVFGRVAGANAAALA
>F9UNH3 1.3.99.33~~~urdA~~~Urocanate reductase~~~COG0431
MKFVGIVGTNAQHSYNRMLLEFMQRHFATQAEIEILELTDVPMFDESNDQTDSTIIQNFATKIATADGVIIASPEHNHSV
PSALKSIIEWLSFKIHPLDGQAVMIVGASYSVQGSSRAQLHLRQILDAPGVNASVMPGSEFLLGRAQTAFDDQGNLKVQG
TVDFLDSCFAKFQKFATIVAEMRAPEALSFAPGTYQVTATGHNGELPMRVTLSADRIENIEIDTSSETQGIADVAFERIP
KEIIAGQTLAVDAISGASITSHGVIDGVARAVKEAGANPDDLKKRRATKQVAQPAVKEVTTDVVVVGAGGAGMTAAAKVL
QAGHQAVVLEKFPAVGGNTVRAGGPMNAADPDWQRQFAALPGEKQTLKDLSERDESTIAPEYRADFRKLKQQIDAYLTAN
TNQKGTLFDSTLLHRIQTYLGGQRTDLNGQEIHGQYDLVKELTDNALDSVKWLQSIGVKFDESQVTMPVGAIWRRGHKPM
GDLGFAYIKTLRAFVEQQGGTIMTETPVKELLVTDGQVRGVIATNAAHEKVIVHADAVILASGGFAANTKMLQKYNTYWT
AIDDDVKTTNSPAMTGDGIRLGTSVGAALVGMGFSQMMPVSDPETGELFSGLQVPPANFVMVNQQGKRFVNEYGSRDELT
QAAIDNGSLFYLIADDEIKKTAYNTTQAKIDQQVANGTLFRADTLTDLAQQIGMDPAALTKTIADYNRYVDAGEDPEFHK
TAFDLKVAVAPFYATPRKPATHHTMGGLKIDSDAHVLNTDGQVIDGLYAAGEVAGGIHAGNRLGGNSLSDIFTFGRIAAA
HAVAEHVDPVTA
>B2GCE0 1.3.99.33~~~urdA~~~Urocanate reductase~~~COG1053
MKAGTYKVKAKGHGSSFMPMEVTLSDDAIQRIQVDASGETSGIADEVFKRLPAKIVKGQTLNVDTVAGATISSRGVVGGV
AEAITLAGGDADEWKQRAKPEIATQAAQVEEYQTDVVVVGAGGAGLAAATRSLQHDKQVVILEKFPQLGGNTTRAGGPMN
AADPDWQRDFAALTGEKETLKRLANTPLEQIDPEYRADFERLREQIKEYIASGAQYLFDSNLLHEIQTYLGGKREDLAGH
EIHGRYQLVKTLVDNALDSVKWLADLGVKFDQTDVTMPVGALWRRGHKPVEPMGYAFIHVLGDWVTEHGATILTETRAEH
LLMENGRVVGVVAHKTDGTKVTVRAKSTFLTAGGFGANTPMVQKYNTYWEHIDDDIATTNSPAITGDGISLGQEAGAELT
GMGFIQLMPVSDPVTGELFTGLQTPPGNFIMVNQEGKRFVNEFAERDTLAAAAIAQGGLFYLIADDKIKETAYNTTQESI
DAQVEAGTLFKADTLAELAGKVGMDPATLEDTINKYNSYVDAGHDPEFGKSASHLKCEVAPFYATPRKPAIHHTMGGLAI
DKHGHVLDKAERVIAGLYSAGENAGGLHAGNRLGGNSLADIFTFGRLAADTAAQENG
>Q8CVD0 1.3.99.33~~~urdA~~~Urocanate reductase~~~COG1053
MHYKKSIIGIAVTATAIIAGCQVTHQIVKSQGTAQGKHGEVQVETTFKDGHIVAIDVLKQKENKVLAGAVFKDVKQAIID
NNSIEVDGIAGATVTSKALKEAVGKSIEAAGVTLVATASAKKSEALTPAEYTYDVVIIGSGGAGFSAGLEAIAAGRSAVI
IEKMPIIGGNSLISGAEMNVAGSWVQKNMGITDSKELFISDTLKGGDFKGDPEMVKTMVDNAVGAAEWLRDYVKVEFYPD
QLFQFGGHSVKRALIPKGHTGAEVISKFSIKADEVGLPIHTNTKAEKLIQDQTGRIVGVEAAHNGKTITYHAKRGVVIAT
GGFSSNMEMRKKYNPELDERYGSTGHAGGTGDGIVMAEKIHAAAKNMGYIQSYPICSPTSGAIALIADSRFFGAVLINQK
GERFVEELERRDVISHAILAQPGRYTYVLWNQDIENVAHTVEMHQGELKEFTKDGLMYKVDTLEEAAKVFNIPEDKLLST
IKDVNHYAATGKDEAFNHRSGLVDLSKGPYWILKATPSVHHTMGGLVVDTRTRVLDEQGKVIPGLFAAGEVTGLTHGTNR
LGGNAYTDIIVYGRIAGQEAAK
>F8DIF2 1.3.99.33~~~urdA~~~Urocanate reductase~~~
MKLVAIVGTNAKQSYNRSLLQFMQRHFATKADIEILEITDVPMFNETDDQTDTPVIQKFNQAISEADGVIISTPEHNHTI
PSSLNSLIEWLSFNIHPLDGKPTMIVGASYDVQGSSRAQLHLRQILDAPGVNATVMPGSEFLLGRAHRAFDENGDLIDER
TVDFLDSCFYRFLRFVSVANQLNLPEEIRFEPGTYHVTTEGHNGKLPMDVTVSEDRIEKIEIDSSGESSGIADVVFTRIP
AEIIEGQTLNVDAVSGASVTSNGVLDGVARAVKQAGANPDVLRKRSKAPSALDKEDKTYQADVVIVGGGGAGLAAAAAVL
QAGKKPIVVEKFPAIGGNTVRAGGPMNAPDPAWQGTFEAHPGEANTLQELIAIDESTIDPEYLEDFRALKVEVEQYLQDP
SYLFDSTLLYRIQTYIGGKRKDLQGHEIHGQYDLVSVLTERALESVRWLEDIGVEFVRSEVTMPVGALWRRGHKPVQPMG
YAFISVLQKYVLENGGKILTDSPVKELLVEEGTVKGVRAEGRNGQTILVHADAVVLASGGFGANTKMLQKYNTYWTEIAD
DIATSNTPAVTGDGILLGQSVGADLVGMGFSQMMPVSDPVTGALFSGLQVPPANFIMVNTEGKRFVDEYGSRDKLSQAAI
DNGGLFYLIADDRIKATAYNTSQEKIDAQVKAGTLYRADTIEELAVQIGMDPQVLADTIKKYNSYVDAGFDPEFNKGSFD
LKCEVAPFYATPRKPAVHHTMGGLKIDTSTHVLNEKGQIIPGLYAAGEVAGGLHAGNRLGGNSLTDIFTFGRIAGQTAVK
ENC
>Q8DW88 1.3.99.33~~~urdA~~~Urocanate reductase~~~COG0431
MKLIAIVGTNAKQSYNRILLQFMKRHFVQKADIDIMEIANVPMFNETEDQTDLPAIQNFNTKISQADGVIIATPEHNHTI
PSSLNSLLEWLSFKVHPLDGKPLMIVGASYDVQGSSRAQLHLRQILDAPGVNAAVMPGSEFLLGRAHQAFDEAGNLKSEA
TVDFLESCFFKFLRFVQVANQLNEPEEVSFEAGTYQVTTQGHNGKLPMTVTLSEEKIEKIDIDSSGESSGIADIVFTRIP
NEILEGQTLNVDAVSGASVTSNGVLDGVARAIKLAGGNPDVLRKRPKAPSALDKEDKTYSTDVVIVGGGGAGLAAAARVL
QAGKQVMVLEKFPALGGNTVRSGGLLNAADPEWQKTFPANPGEAHNLSELIQTDEDSIAAEYLADFKELKQQVTNYLKDP
SYLFDSNILHRIQTYIGGKRTDRNGCEVYGNYDLVKVLTDKDLDSVHWLADIGVDFDRSEVSMPVGALWRRSHKPKQPMG
YAFIEALDTYIRKNSGTILTDTAVTDFILENGLIKGVLAKGRNGQTITVHAQAVVLASGGFGANTKMLQQYNTYWSNIDD
NIQTTNSPAITGDGIRLGQSIGAALVGMGFSQMMPVSDPNTGAIFSGLQVPPANFVMVNQEGKRFVDEYGSRDTLSKAAI
DNGGLFYLIADENIKATAMNTSNEKIEEQVAAGTLYRADTLESLAEQIGVDPATLVETINNYNSYVEAGYDPEFDKGAFD
LKVEKAPFYATPRKPATHHTMGGLKIDTQAHVIKEDGNKIPSLYAAGEVTGGIHAGNRLGGNALADIFTFGRIAAETAVT
ECC
>Q2YPD5 3.5.1.5~~~ureC1~~~Urease subunit alpha 1~~~
MPARISRATYAQMFGPTVGDKVRLADTDLIIEVERDLTTYGEEVKFGGGKVIRDGMGQSQLSRAEGAMDTVITNALILDH
SGIYKADIGLLDGRIALIGKAGNPDTQPGISIIIGPGTEIIAGEGKIVTAGGIDTHVHFISPQQVDEALNAGITCMVGGG
TGPAHGTLATTCTPGPWHIARLIQSFDGLPMNIGVFGKGNASLPGALEEMVRAGACGLKLHEDWGCTPAAIDNCLSVADH
FDVQVAIHTDTLNEGGFVEDTLNAFKGRTIHSFHTEGAGGGHAPDIIRVCQYPNVLPASTNPTRPYTVNTIAEHLDMLMV
CHHLSPAIPEDIAFAESRIRKETIAAEDILHDMGAFSIISSDSQAMGRVGEMIIRCWQTADKMKKQRGSLPDDRPGNDNY
RARRYIAKYTINPAIAHGMAHEIGSVEVGKRADLVLWNPAFFGVKPDMVLLGGWIATAPMGDANGSIPTPQPMHTRPMFG
SFGKALTNTSITFVSQAAMDEGLREKIGVDKQLVAVVNTRGGIGKHSMILNNAMPQMEVDPETYEVRADGELLTCEPVDV
VPMAQRYFLF
>Q8G2P8 3.5.1.5~~~ureC1~~~Urease subunit alpha 1~~~
MPARISRATYAQMFGPTVGDKVRLADTDLIIEVERDLTTYGEEVKFGGGKVIRDGMGQSQLSRAEGAMDTVITNALILDH
SGIYKADIGLLDGRIALIGKAGNPDTQPGISIIIGPGTEIIAGEGKIVTAGGIDTHVHFISPQQVDEALNAGITCMVGGG
TGPAHGTLATTCTPGPWHIARLIQSFDGLPMNIGVFGKGNASLPGALEEMVRAGACGLKLHEDWGCTPAAIDNCLSVADH
FDVQVAIHTDTLNEGGFVEDTLNAFKGRTIHSFHTEGAGGGHAPDIIRVCQYPNVLPASTNPTRPYTVNTIAEHLDMLMV
CHHLSPAIPEDIAFAESRIRKETIAAEDILHDMGAFSIISSDSQAMGRVGEMIIRCWQTADKMKKQRGSLPDDRPGNDNY
RARRYIAKYTINPAIAHGMAHEIGSVEVGKRADLVLWNPAFFGVKPDMVLLGGWIATAPMGDANGSIPTPQPMHTRPMFG
SFGKARTNTSITFVSQAAMDEGLREKIGVDKQLVAVVNTRGGIGKHSMILNNAMPQMEVDPETYEVRADGELLTCEPVDV
VPMAQRYFLF
>Q07397 3.5.1.5~~~ureC~~~Urease subunit alpha~~~
MSFSMSRKQYADMFGPTVGDAIRLADSELFIEIEKDYTTYGDEVKFGGGKVIRDGMGQHPLATSDECVDLVLTNAIIVDY
TGIYKADIGIKDGMIASIGKAGNPLLMDGVDMVIGAATEVIAAEGMIVTAGGIDAHIHFICPQQIETALASGVTTMIGGG
TGPATGTNATTCTPGPWNIHRMLQAAEEFPINLGFLGKGNCSDEAPLKEQIEAGAVGLKLHEDWGSTAAAIDTCLKVADR
YDVQVAIHTDTLNEGGFVEDTLKAIDGRVIHTYHTEGAGGGHAPDIIKAAGFPNILPSSTNPTRPYTINTLEEHLDMLMV
CHHLDANIPEDIAFADSRIRKETIAAEDVLHDLGVFSMISSDSQAMGRVGEVIIRTWQTADKMKKQRGKLQEDNGVGDNF
RVKRYIAKYTINPAIAHGIADYVGSVEVGKLADLVVWNPAFFGVKPELVLKGGMIAYSTMGDPNASIPTPQPVLYRPMFA
AKGDAKYQTSITFVSKAAYEKGIHEQLGLKKKVKPVHGIRKLTKKDLILNDKTPKIDVDPQTYEVKVDGQLVTCEPAEIV
PMAQRYFLF
>P77837 3.5.1.5~~~ureC~~~Urease subunit alpha~~~COG0804
MKMSREEYAELFGPTTGDKIRLGDTDLWIEVEKDFTVYGEEMIFGGGKTIRDGMGQNGRITGKDGALDLVITNVVLLDYT
GIVKADVGVKDGRIVGVGKSGNPDIMDGVDPHMVIGAGTEVISGEGKILTAGGVDTHIHFICPQQMEVALSSGVTTLLGG
GTGPATGSKATTCTSGAWYMARMLEAAEEFPINVGFLGKGNASDKAPLIEQVEAGAIGLKLHEDWGTTPSAIKTCMEVVD
EADIQVAIHTDTINEAGFLENTLDAIGDRVIHTYHIEGAGGGHAPDIMKLASYANILPSSTTPTIPYTVNTMDEHLDMMM
VCHHLDAKVPEDVAFSHSRIRAATIAAEDILHDIGAISMTSSDSQAMGRVGEVIIRTWQVADKMKKQRGALAGENGNDNV
RAKRYIAKYTINPAITHGLSHEVGSVEKGKLADLVLWDPVFFGVKPELVLKGGMIARAQMGDPNASIPTPEPVFMRQMYA
SYGKANRSTSITFMSQASIERGVAESLGLEKRISPVKNIRKLSKLDMKLNSALPKIEIDPKTYQVFADGEELSCQPVDYV
PLGQRYFLF
>Q79VJ3 3.5.1.5~~~ureC~~~Urease subunit alpha~~~COG0804
MSFEISRKQYTDLYGPTVGDSVRLADTELFLCVEKDYAAIGEEVAFGGGKVIRDGMGQNGTLVRDVDIPDTVITNVIVLD
YTGVYKADVALRDGKIFRIGKAGNPNVMENVDIVIGVATDIIAGEGKILTAGGIDTHVHFLGTDQVNTALASGITTMIGG
GTGPSQASMATTVTPGQWNTYNMLSAFEGMPMNFGILGKGHGSSKSPLAEQVRAGAIGLKIHEDWGATPSSINTALEVAD
DMDIQVALHSDTLNEAGFVEDTIEAIAGRVIHTFHTEGAGGGHAPDLIRVAALPNVLPASTNPTLPYTRNTVEEHLDMVM
VAHHLNPDIPEDVAFADSRIRAETIAAEDVLHDMGIFSITSSDSQAMGRVGETITRTWQVADHMKRTRGSLTGDAPYNDN
NRLRRFIAKYTINPAIAHGVDYVVGSVEEGKFADLVLWDPKFFGVKPDLVIKGGLMVNSLMGDSNGSIPTPQPRTLRNTW
GAFGQAVSRSSITFLSQDAIDANVPDLLNLRKQIRGVRGVRNLTKRDMKLNAEMPDIRVDPETYQVFVNGELITSKPAET
VPMARRYFLF
>Q08716 3.5.1.5~~~ureB~~~Urease subunit beta~~~COG0804
MKKISRKEYVSMYGPTTGDRVRLGDTDLILEVEHDCTTYGEEIKFGGGKTIRDGMSQTNSPSSYELDLVLTNALIVDYTG
IYKADIGIKDGKIAGIGKAGNKDMQDGVDNNLCVGPATEALAAEGLIVTAGGIDTHIHFISPQQIPTAFASGVTTMIGGG
TGPADGTNATTITPGRANLKSMLRAAEEYAMNLGFLAKGNVSYEPSLRDQIEAGAIGFKIHEDWGSTPAAIHHCLNVADE
YDVQVAIHTDTLNEAGCVEDTLEAIAGRTIHTFHTEGAGGGHAPDVIKMAGEFNILPASTNPTIPFTKNTEAEHMDMLMV
CHHLDKSIKEDVQFADSRIRPQTIAAEDQLHDMGIFSITSSDSQAMGRVGEVITRTWQTADKNKKEFGRLKEEKGDNDNF
RIKRYISKYTINPAIAHGISDYVGSVEVGKYADLVLWSPAFFGIKPNMIIKGGFIALSQMGDANASIPTPQPVYYREMFG
HHGKNKFDTNITFVSQAAYKAGIKEELGLDRVVLPVKNCRNITKKDLKFNDVTAHIDVNPETYKVKVDGKEVTSKAADEL
SLAQLYNLF
>Q93PJ4 3.5.1.5~~~ureB~~~Urease subunit beta~~~COG0804
MIKISRKQYASMYGPTTGDKVRLGDTNLFAEIEKDYTLYGEEIKFGGGKTIRDGMAQSASTYTNELDAVITNAMIIDYTG
IYKADIGIKGGKIVGIGKAGNPDTQDSVNEAMVVGAATEVIAGEGQIITAGGIDTHIHFISPTQIPTALYSGVTTMIGGG
TGPAAGTNATTCTPGKWNMHQMLRAAESYAMNLGFFGKGNSSNEEGLEEQIKAGALGLKVHEDWGSTPAAINHALNVAQK
YDVQVAIHTDTLNEAGCVEDTMKAIDGRTIHTFHTEGAGGGHAPDIIKAAGEPNILPASTNPTIPFTKNTADEHLDMLMV
CHHLDKKIKEDVAFADSRIRPETIAAEDTLHDMGIFSITSSDSQAMGRVGEVITRTWQTADKCKNEFGALKEECGENDNF
RIKRYISKYTINPAIAHGISEYVGSVEVGKFADLVLWKPSMFGIKPEMILKNGMIVAAKIGDSNASIPTPEPVVYAPMFG
SYGKAKYNCAITFVSKIAYDCHIKEELGLERILLPVKNCRNITKKDMKFNDVITPIEVNPETYEVRVNNTKITSKPVEKV
SLGQLYCLF
>P69996 3.5.1.5~~~ureB~~~Urease subunit beta~~~COG0804
MKKISRKEYVSMYGPTTGDKVRLGDTDLIAEVEHDYTIYGEELKFGGGKTLREGMSQSNNPSKEELDLIITNALIVDYTG
IYKADIGIKDGKIAGIGKGGNKDMQDGVKNNLSVGPATEALAGEGLIVTAGGIDTHIHFISPQQIPTAFASGVTTMIGGG
TGPADGTNATTITPGRRNLKWMLRAAEEYSMNLGFLAKGNASNDASLADQIEAGAIGFKIHEDWGTTPSAINHALDVADK
YDVQVAIHTDTLNEAGCVEDTMAAIAGRTMHTFHTEGAGGGHAPDIIKVAGEHNILPASTNPTIPFTVNTEAEHMDMLMV
CHHLDKSIKEDVQFADSRIRPQTIAAEDTLHDMGIFSITSSDSQAMGRVGEVITRTWQTADKNKKEFGRLKEEKGDNDNF
RIKRYLSKYTINPAIAHGISEYVGSVEVGKVADLVLWSPAFFGVKPNMIIKGGFIALSQMGDANASIPTPQPVYYREMFA
HHGKAKYDANITFVSQAAYDKGIKEELGLERQVLPVKNCRNITKKDMQFNDTTAHIEVNPETYHVFVDGKEVTSKPANKV
SLAQLFSIF
>P18314 3.5.1.5~~~ureC~~~Urease subunit alpha~~~
MSNISRQAYADMFGPTVGDKVRLADTELWIEVEDDLTTYGEEVKFGGGKVIRDGMGQGQMLAADCVDLVLTNALIVDHWG
IVKADIGVKDGRIFAIGKAGNPDIQPNVTIPIGAATEVIAAEGKIVTAGGIDTHIHWICPQQAEEALVSGVTTMVGGGTG
PAAGTHATTCTPGPWYISRMLQAADSLPVNIGLLGKGNVSQPDALREQVAAGVIGLKIHEDWGATPAAIDCALTVADEMD
IQVALHSDTLNESGFVEDTLAAIGGRTIHTFHTEGAGGGHAPDIITACAHPNILPSSTNPTLPYTLNTIDEHLDMLMVCH
HLDPDIAEDVAFAESRIRRETIAAEDVLHDLGAFSLTSSDSQAMGRVGEVILRTWQVAHRMKVQRGALAEETGDNDNFRV
KRYIAKYTINPALTHGIAHEVGSIEVGKLADLVVWSPAFFGVKPATVIKGGMIAIAPMGDINASIPTPQPVHYRPMFGAL
GSARHHCRLTFLSQAAAANGVAERLNLRSAIAVVKGCRTVQKADMVHNSLQPNITVDAQTYEVRVDGELITSEPADVLPM
AQRYFLF
>P9WFF1 3.5.1.5~~~ureC~~~Urease subunit alpha~~~COG0804
MARLSRERYAQLYGPTTGDRIRLADTNLLVEVTEDRCGGPGLAGDEAVFGGGKVLRESMGQGRASRADGAPDTVITGAVI
IDYWGIIKADIGIRDGRIVGIGKAGNPDIMTGVHRDLVVGPSTEIISGNRRIVTAGTVDCHVHLICPQIIVEALAAGTTT
IIGGGTGPAEGTKATTVTPGEWHLARMLESLDGWPVNFALLGKGNTVNPDALWEQLRGGASGFKLHEDWGSTPAAIDTCL
AVADVAGVQVALHSDTLNETGFVEDTIGAIAGRSIHAYHTEGAGGGHAPDIITVAAQPNVLPSSTNPTRPHTVNTLDEHL
DMLMVCHHLNPRIPEDLAFAESRIRPSTIAAEDVLHDMGAISMIGSDSQAMGRVGEVVLRTWQTAHVMKARRGALEGDPS
GSQAADNNRVRRYIAKYTICPAIAHGMDHLIGSVEVGKLADLVLWEPAFFGVRPHVVLKGGAIAWAAMGDANASIPTPQP
VLPRPMFGAAAATAAATSVHFVAPQSIDARLADRLAVNRGLAPVADVRAVGKTDLPLNDALPSIEVDPDTFTVRIDGQVW
QPQPAAELPMTQRYFLF
>P17086 3.5.1.5~~~ureC~~~Urease subunit alpha~~~COG0804
MKTISRQAYADMFGPTTGDRLRLADTELFLEIEKDFTTYGEEVKFGGGKVIRDGMGQSQVVSAECVDVLITNAIILDYWG
IVKADIGIKDGRIVGIGKAGNPDVQPNVDIVIGPGTEVVAGEGKIVTAGGIDTHIHFICPQQAQEGLVSGVTTFIGGGTG
PVAGTNATTVTPGIWNMYRMLEAVDELPINVGLFGKGCVSQPEAIREQITAGAIGLKIHEDWGATPMAIHNCLNVADEMD
VQVAIHSDTLNEGGFYEETVKAIAGRVIHVFHTEGAGGGHAPDVIKSVGEPNILPASTNPTMPYTINTVDEHLDMLMVCH
HLDPSIPEDVAFAESRIRRETIAAEDILHDMGAISVMSSDSQAMGRVGEVILRTWQCAHKMKLQRGTLAGDSADNDNNRI
KRYIAKYTINPALAHGIAHTVGSIEKGKLADIVLWDPAFFGVKPALIIKGGMVAYAPMGDINAAIPTPQPVHYRPMYACL
GKAKYQTSMIFMSKAGIEAGVPEKLGLKSLIGRVEGCRHITKASMIHNNYVPHIELDPQTYIVKADGVPLVCEPATELPM
AQRYFLF
>Q9L644 3.5.1.5~~~ureC~~~Urease subunit alpha~~~
MSYKINRKTYAQTYGPTKGDRVRLADTELIIEVEKDFTTYGDEVKFGGGKVIRDGMGQSQVTREDGAVDTVITNALIVDW
WGIVKADVGLKDGKIYEIGKAGNPDIQDNINIIIGSSTEVIAGEGHILTAGSIDTHIHFICPQQIETALASGVTTMLGGG
TGPATGTNATTCTPGAFHISRMIQSAEAFPVNLGFFGKGNSSNETNLFEQVNAGACGLKLHEDWGTTPSTINSCLNVADT
LDVQVCIHTDTLNEAGFVEDTIAAIAGRTIHTFHTEGAGGGHAPDIIKICGENNVLPSSTNPTRPYTKNTLEEHLDMLMV
CHHLDSKIPEDIAFAESRIRRETIAAEDILHDIGAFSIIASDSQAMGRVGEVITRTFQTAHKMKVQRGPLPEDSDRNDNY
RVKRYISKVTINPAIAHGINRFVGSIEKGKIADLVLWKPSFFGVKPELVVKGGSIVWSQMGDANASIPTPGPVHGRPMFA
NYGQSLLKSSFTFLSKNAIELDIPNKLSLQKNCLAVENTRSINKLDLKLNNKLPNITVDPQTYEVFADGVLLSCEPLEEV
PMAQKYFLL
>P41020 3.5.1.5~~~ureC~~~Urease subunit alpha~~~
MKINRQQYAESYGPTVGDQVRLADTDLWIEVEKDTTYGDEAVNFGGGKVLREGMGENGTYTRTENVLDLLLTNALILDYT
GIYKADIGVKDGYIVGIGKGGNPDIMDGVTPNMIVGTATEVIAAEGKIVTAGGIDTHVHFINPDQVDVALANGITTLFGG
GTGPAEGSKATTVTPGPWNIEKMLKSTEGLPINVGILGKGHGSSIAPIMEQIDAGAAGLKIHEDWGATPASIDRSLTVAD
EADVQVAIHSDTLNEAGFLEDTLRAINGRVIHSFHVEGAGGGHAPDIMAMAGHPNVLPSSTNPTRPFTVNTIDEHLDMLM
VCHHLKQNIPEDVAFADSRIRPETIAAEDILHDLGIISMMSTDALAMGRAGEMVLRTWQTADKMKKQRGPLAEEKNGSDN
FRAKRYVSKYTINPAIAQGIAHEVGSIEEGKFADLVLWEPKFFGVKADRVIKGGIIAYAQIGDPSASIPTPQPVMGRRMY
GTVGDLIHDTNITFMSKSSIQQGVPAKLGLKRRIGTVKNCRNIGKKDMKWNDVTTDIDINPETYEVKVDGEVLTCEPVKE
LPMAQRYFLF
>P67404 3.5.1.5~~~ureC~~~Urease subunit alpha~~~
MSFKMTQNQYTSLYGPTVGDSIRLGDTNLFAQIEKDYAVYGEEATFGGGKSIRDGMAQNPRVTRDDVNVADLVISNAVII
DYDKVVKADIGIKNGYIFAIGNAGNPDIMDNVDIIIGSTTDIIAAEGKIVTAGGIDTHVHFINPEQAEVALESGITTHIG
GGTGASEGSKATTVTPGPWHIHRMLEAAEGLPINVGFTGKGQATNPTALIEQINAGAIGLKVHEDWGATPSALSHALDVA
DEFDVQIALHADTLNEAGFMEDTMAAVKDRVLHMYHTEGAGGGHAPDLIKSAAFSNILPSSTNPTLPYTHNTVDEHLDMV
MITHHLNAAIPEDIAFADSRIRKETIAAEDVLQDMGVFSMISSDSQAMGRVGEVITRTWQVAHRMKEQRGPLDGDFEHND
NNRIKRYIAKYTINPAITHGISEYVGSIEPGKLADIVLWDPIFFGVKPELVVKGGLINSAVNGDANGSIPTSEPMKYRKM
YGQYGGNLTSTSMTFVSKTAYENGINRALNLKRMVRPVKNIRQLSKADMKNNSATPKLDVDPQTYEVYVDGEKITSNAAT
ELPLTQRYFLF
>Q4A0J5 3.5.1.5~~~ureC~~~Urease subunit alpha~~~COG0804
MSFKMTQSQYTSLYGPTVGDSVRLGDTNLFARVERDYATYGDEAAFGGGKSIRDGMAQNPNVTRDDKQVADLVITNAMII
DYDKIVKADIGVKNGYIMKIGKAGNPDIMDNVDIIIGATTDIISAEGKIVTAGGIDTHVHFINPEQSQVALESGITTHIG
GGTGASEGTKATTVTPGPWHLHRMLLAAESLPLNIGFTGKGQAVNHTALVEQIHAGAIGLKVHEDWGATPSALDHALQVA
DDYDVQIALHADTLNEAGFMEETMAAVKDRVLHMYHTEGAGGGHAPDLIKSAAYANILPSSTNPTLPYTVNTIDEHLDMV
MITHHLNASIPEDIAFADSRIRKETIAAEDVLQDMGVFSMVSSDSQAMGRVGEVITRTWQVAHRMKEQRGLLDGDSEYND
NNRIKRYIAKYTINPAITHGISDYVGSIDEGKLADIILWEPAFFGVKPDVIVKGGLINAAINGDANGSIPTSEPLKYRKM
YGQLGGNLQSTSMTFVSTTAYENDIGKLLGLKRKLRPVHNIRKLSKKDMKNNNATPDLDVDPQTYEVFVDGEKITSEPAT
ELPLTQRYFLF
>P42873 3.5.1.5~~~ureC~~~Urease subunit alpha~~~COG0804
MSFKMTQSQYTSLYGPTVGDSVRLGDTNLFARVEKDYATYGDEAAFGGGKSIRDGMAQNPNVTRDDKQVADLVITNALIL
DYDKIVKADIGVKNGYIMKIGKAGNPDIMDNVDIIIGATTDIISAEGKIVTAGGIDTHVHFVNPEQSQVALESGITTHIG
GGTGASEGAKATTVTPGPWHLHRMLLAAESLPLNIGFTGKGQAVNHTALVEQIHAGAIGLKVHEDWGATPSALDHALQVA
DDYDVQIALHADTLNEAGFMEETMAAVKDRVLHMYHTEGAGGGHAPDLIKSAAYSNILPSSTNPTLPYTVNTIDEHLDMV
MITHHLNASIPEDIAFADSRIRKETIAAEDVLQDIGVFSMVSSDSQAMGRVGEVITRTWQVAHRMKEQRGSLDGDSEYND
NNRIKRYIAKYTINPAITHGISDYVGSIDEGKLADIIMWEPAFFAVKPDVIVKGGLINPAINGDANGSIPTSEPLKYRKM
YGQLGGNMQGTSMTFVSTTAYENDIGKLLGLKRKLRPVHNIRKLTKADMKNNSATPKIDVDPQTYEVFVDGEKITSEPAT
ELPLTQRYFLF
>P50047 3.5.1.5~~~ureC~~~Urease subunit alpha~~~COG0804
MSFKMDREEYAQHYGPTVGDSVRLGDTNLFAAIEKDFTVYGQESKFGGGKVLRDGMGVSATETRDNPSVVDTIITGATII
DYTGIIKADIGIRDGKIVAIGRGGNPDTMDNVDFVVGASTEAIAAEGLIVTAGGIDLHVHYISADLPEFGLDNGITTLFG
GGTGPADGSNATTCTPGKFHITRMLQAVDDMPANFGFLAKGVGSETEVVEEQIKAGAAGIKTHEDWGATYAGIDNSLKVA
DKYDVSFAVHTDSLNEGGFMENTLESFQGRTVHTFHTEGSGGGHAPDIMVFAGKENILPSSTNPTNPYTTNAIGELLDMV
MVCHHLDPKIPEDVSFAESRVRKQTVAAEDVLHDMGALSIMTSDAMAMGRVGEVAMRCWQLADKMKAQRGPLEGDSEFND
NNRIKRYVAKYTINPAITNGIADYIGSVEVGKFADLVIWEPAQFGAKPKLVLKGGMLTYGVMGDAGSSLPTPQPRIMRKL
YGAYGQAVHETNLTFVSQYAYDHGIKEEIGLNKIVLPVKNTRNLTKRDMKLNDYAPKTIRIDPQTFDVFIDDELVTCEPI
HTTSLSQRYFLF
>O87402 3.5.1.5~~~ureC~~~Urease subunit alpha~~~COG0804
MPYRISRQAYAETYGPTTGDRLRLADTELILEVEKDFTVYGDEVKFGGGKVIRDGMGQSQTPRAGGAVDTVITNALILDW
WGIVKADVGLKDGRIVGIGKAGNPDTQAGVTIVVGPGTEAIAGEGHILTAGGIDTHIHFICPQQIETALASGMTTLMGGG
TGPATGTNATTCTPGAFHIGRMLQAAEGLPVNLGFFGKGNASTPEALEEQVRAGACGLKLHEDWGTTPATIDACLSVADR
MDVQVCIHTDTLNEAGFVEDTIAAIKGRTIHTFHTEGAGGGHAPDIIKICGEANVLPSSTNPTRPYTRNTLEEHLDMLMV
CHHLDPRIPEDVAFAESRIRRETIAAEDILHDLGAFSIIASDSQAMGRVGEVITRTFQTAHKMKVQRGALPQDSSRNDNH
RLKRYIAKVTINPALAHGISSEVGSIETGKLADLVLWKPGFFGIRPEVVIKGGSIVWAQMGDANASIPTPGPVHGRPMFG
AFGKALAPSCLTFVSEAAMDSDIQRHLGLERTCMAVKDTRSVGKSALKLNSALPKVSVDPQTYEVFADGELLTCEPAEVL
PLAQRYLLL
>P0CB00 3.5.1.5~~~ureC~~~Urease subunit alpha~~~
MFKISRKNYSDLYGITTGDSVRLGDTNLWVKVEKDLTTYGEESVFGGGKTLREGMGMNSTMKLDDKLGNAEVMDLVITNA
LILDYTGIYKADIGIKNGKIASIGKSGNPHLTDGVDMVVGISTEVSAGEGKIYTAGGLDTHVHWLEPEIVPVALDGGITT
VIAGGTGMNDGTKATTVSPGKFWVKSALQAADGLPINAGFLAKGQGMEDPIFEQIVAGACGLKIHEDWGATGNAIDLALT
VAEKTDVAVAIHTDTLNEAGFVEHTIAAMKGRTIHAYHTEGAGGGHAPDILESVKYAHILPASTNPTIPYTVNTIAEHLD
MLMVCHHLNPKVPEDVAFADSRIRSQTIAAEDLLHDMGAISIMSSDTLAMGRIGEVVTRSWQMAHKMKAQFGALKGDSEF
NDNNRVKRYVAKYTINPAIAHGIDSYVGSIEVGKLADIVAWEPKFFGAKPYYVVKMGVIARCVAGDPNASIPTCEPVIMR
DQFGTYGRSLTSTSVSFVSKIGLENGIKEEYKLEKELLPVKNCRSINKKSMKWNSATPNLEVDPQTFDAAVDYNDLENWL
EQPAAELAKKLKKTANGKYVLDAEPLTEAPLAQRYFLF
>Q2YPD6 3.5.1.5~~~ureB1~~~Urease subunit beta 1~~~
MIPGEIITLEGDIELNQGQPTVTMRVANTGDRPIQVGSHFHFYEVNAALSFDREKARGQRLDIAAGTAVRFEPGQERDVT
LVPIRGHREIYGFRQMIMGKL
>Q8G2P9 3.5.1.5~~~ureB1~~~Urease subunit beta 1~~~
MIPGEIITLEGDIELNQGQPTVTMRVANTGDRPIQVGSHFHFYEVNAALSFDREKARGQRLDIAAGTAVRFEPGQERDVT
LVPIRGHREIYGFRQMIMGKL
>P14916 3.5.1.5~~~ureA~~~Urease subunit alpha~~~COG0831
MKLTPKELDKLMLHYAGELAKKRKEKGIKLNYVEAVALISAHIMEEARAGKKTAAELMQEGRTLLKPDDVMDGVASMIHE
VGIEAMFPDGTKLVTVHTPIEANGKLVPGELFLKNEDITINEGKKAVSVKVKNVGDRPVQIGSHFHFFEVNRCLDFDREK
TFGKRLDIASGTAVRFEPGEEKSVELIDIGGNRRIFGFNALVDRQADNESKKIALHRAKERGFHGAKSDDNYVKTIKE
>Q07398 3.5.1.5~~~ureB~~~Urease subunit beta~~~
MIPGEYVLKKEPILCNQNKQTIKIRVLNRGDRPVQVGSHFHFFEVNQSLQFHREKAFGMRLNIPAGTAVRFEPGDAKEVE
IIPFSGERKVYGLNNVTNGSVEMGKRK
>P71035 3.5.1.5~~~ureB~~~Urease subunit beta~~~COG0832
MKPGAFQIAEGTITINEGREIREVTVKNTGSRSIQVGSHFHFAEANGALLFDRELAIGMRLDVPSGTSVRFEPGEQKTVS
LVEIRGRKTIRGLNGMADTFIDERGKEKTLANLKQAGWMEGVIR
>Q79VJ4 3.5.1.5~~~ureB~~~Urease subunit beta~~~COG0832
MIPGEYILSSESLTGNVGREAKTIEIINTGDRPVQIGSHFHFAEVNPSISFDRSEGYGFRLDIPSGTAVRLEPGDARTVN
LVAIGGDRIVAGFRDLVDGPLEDLKVNVWEGREDGWRRSSAAGDAPQELPQVEAAERGRKLDDATDVDTNVGTEEGFEEG
RN
>Q03283 3.5.1.5~~~ureB~~~Urease subunit beta~~~
MIPGEIKVNHALGDIELNAGRETQTIQVANHGDRPIQIGSHYHFYEVNDALKFERENTLGFRLNIPAGMAVRFEPGQSRT
VELVAFSGKREIYGFHGKVMGKLESEN
>P18315 3.5.1.5~~~ureB~~~Urease subunit beta~~~
MIPGEYHVKPGQIALNTGRATCRVVVENHGDRPIQVGSHYHFAEVNPALKFDRQQAAGYRLNIPAGTAVRFEPGQKREVE
LVAFAGHRAVFGFRGEVMGPLEVNDE
>P9WFE9 3.5.1.5~~~ureB~~~Urease subunit beta~~~COG0832
MIPGEIFYGSGDIEMNAAALSRLQMRIINAGDRPVQVGSHVHLPQANRALSFDRATAHGYRLDIPAATAVRFEPGIPQIV
GLVPLGGRREVPGLTLNPPGRLDR
>P17087 3.5.1.5~~~ureB~~~Urease subunit beta~~~COG0832
MIPGEIRVNAALGDIELNAGRETKTIQVANHGDRPVQVGSHYHFYEVNEALRFARKETLGFRLNIPAGMAVRFEPGQSRT
VELVAFAGKREIYGFHGKVMGKLESEKK
>P41021 3.5.1.5~~~ureB~~~Urease subunit beta~~~
MSNNNYIVPGEYRVAEGEIEINAGREKTTIRVSNTGDRPIQVGSHIHFVEVNKELLFDRAEGIGRRLNIPSGTAARFEPG
EEMEVELTELGGNREVFGISDLTNGSVDNKELILQRAKELGYKGVE
>Q4A0J4 3.5.1.5~~~ureB~~~Urease subunit beta~~~COG0832
MKPGEIIVKRTEIEVNQGHNATILNVKNTGDRPIQVGSHYHFFEANPALQFDHEKAYGKRLDIPAGAAVRFEPGDEKEVQ
LVEYSGKRKIYGFHGDVNGSIDESRVYKLEDDSTATEVIAEQDKTSENANKGRG
>P42874 3.5.1.5~~~ureB~~~Urease subunit beta~~~COG0832
MKPGEIIVKRTEIEVNRGHNATILDVKNTGDRPIQVGSHYHFFEANPALQFEREKAYGKRLDIPAGAAVRFEPGDEKEVQ
LVEYSGKRRIFGFHGEVNGPIDEARVYKAEDDDSATEIIAEENKVSENANKESGYNR
>Q55054 3.5.1.5~~~ureB~~~Urease subunit beta~~~COG0832
MIPGEYHVASEPIDYNGGYEAISLEVKNVGDRAAQVGSHYHFYEANEAGLQFDREKARGKRLDIPAGTAIRFEPGETKTV
QLIDFGGKRRIFGFNNKVNGFLD
>O87401 3.5.1.5~~~ureB~~~Urease subunit beta~~~COG0832
MAPFIPGELLPEPGEIELNAGRPVTSLHVANSGDRPVQVGSHFHFAEANAALQFDRTAARGQRLDIPAGTAIRFEPGDSR
DVNLIPFAGDRRVIGFNGQINGPLDA
>P0CB01 3.5.1.5~~~ureB~~~Urease subunit beta~~~
MSGSSNQFTPGKLVPGAINFAEGEIVMNEGREAKVISIKNTGDRPIQVGSHFHLFETNSALVFFDEKGNEDKERKVAYGR
RFDIPSGTAIRFEPGDKKEVSVIDLVGTREVWGVNGLVNGKLKK
>P31495 3.5.1.5~~~ureB~~~Urease subunit beta~~~
MSTKTNSTKATSEKTDSLKTNRGTKSSAGYSDQNIPLGGCILADTPITFNENKPVTKVKVRNTGDRPIQVGSHFHFFEAN
RALEFDRAAAYGKRLNISSTTAIRFEPGDETEVPLIPFGGKQTLYGFNNLVDGWTGEGVVPNSERPDKLEAIRRAAERGF
KSSK
>Q2YPD7 3.5.1.5~~~ureA1~~~Urease subunit gamma 1~~~
MNLTPREKDKLLIAMAAMVARRRLERGVKLNHPEAIALVSDFVVEGARDGRTVAELMEAGAHVITREQVMDGVAEMIRDI
QVEATFPDGTKLVTVHEPIR
>Q8G2Q0 3.5.1.5~~~ureA1~~~Urease subunit gamma 1~~~
MNLTPREKDKLLIAMAAMVARRRLERGVKLNHPEAIALVSDFVVEGARDGRTVAELMEAGAYVITREQVMDGVAEMIRDI
QVEATFPDGTKLVTVHEPIR
>Q2YQE0 3.5.1.5~~~ureA2~~~Urease subunit gamma 2~~~
MHLTPREFDKLVIHMLSDVALKRKNKGLKLNHPEAVAVLSAYVLDGAREGKTVEEVMDGARSVLKADDVMDGVPDLLPLI
QVEAVFSDGSRLVSLHNPIT
>Q07399 3.5.1.5~~~ureA~~~Urease subunit gamma~~~
MKLTSREMEKLMIVVAADLARRRKERGLKLNYPEAVAMITYEVLEGARDGKTVAQLMQYGATILTKEDVMEGVAEMIPDI
QIEATFPDGTKLVTVHDPIR
>P75030 3.5.1.5~~~ureA~~~Urease subunit gamma~~~COG0831
MKLTPVEQEKLLIFAAGELAKQRKARGVLLNYPEAAAYITCFIMEGARDGKGVAELMEAGRHVLTEKDVMEGVPEMLDSI
QVEATFPDGVKLVTVHQPISAEVKS
>Q9RHM6 3.5.1.5~~~ureA~~~Urease subunit gamma~~~COG0831
MHITPREQEKLMIVVAADLARRRKDRGLKLNHPEAVALITYELIEGARDGRTVADLMSWGSTILTRDDVLEGIPEMIPDI
QVEATFDDGTKLVTVHNPIR
>Q03282 3.5.1.5~~~ureA~~~Urease subunit gamma~~~
MELTPREKDKLLLFTAGLVAERRLARGLKLNYPEAVALISCAIMEGARDGKTVAQLMSEGRTLLTAEQVMEGVPEMIKDI
QVECTFPDGTKLVSIHDPIV
>P18316 3.5.1.5~~~ureA~~~Urease subunit gamma~~~
MELTPREKDKLLLFTAALVAERRLARGLKLNYPESVALISAFIMEGARDGKSVASLMEEGRHVLTREQVMEGVPEMIPDI
QVEATFPDGSKLVTVHNPII
>P9WFE7 3.5.1.5~~~ureA~~~Urease subunit gamma~~~COG0831
MRLTPHEQERLLLSYAAELARRRRARGLRLNHPEAIAVIADHILEGARDGRTVAELMASGREVLGRDDVMEGVPEMLAEV
QVEATFPDGTKLVTVHQPIA
>P17088 3.5.1.5~~~ureA~~~Urease subunit gamma~~~COG0831
MELTPREKDKLLLFTAGLVAERRLAKGLKLNYPEAVALISCAIMEGAREGKTVAQLMSEGRTVLTAEQVMEGVPEMIKDV
QVECTFPDGTKLVSIHSPIV
>Q9L642 3.5.1.5~~~ureA~~~Urease subunit gamma~~~
MHLSPQEKDKLLIFSAAQLAERRLNRGLKLNYPETVAFLSFQVLEGARDGKSVSQLMSEGTTWLSKKQVMDGISEMVDEV
QVEAVFPDGTKLVTIHNPIN
>Q8RPY7 3.5.1.5~~~ureA~~~Urease subunit gamma~~~
MNLTPREKDKLLISMAAMVARRRLERGVKLNYPEAIALISDFVVEGARDGRPVAELMEAGAHVIGRSQVMEGVAEMIHDV
QVEATFPDGTKLVTVHEPIR
>P41022 3.5.1.5~~~ureA~~~Urease subunit gamma~~~
MHLNPAEKEKLQIFLASELLLRRKARGLKLNYPEAVAIITSFIMEGARDGKTVAMLMEEGKHVLTRDDVMEGVPEMIDDI
QAEATFPDGTKLVTVHNPIS
>Q4A0J3 3.5.1.5~~~ureA~~~Urease subunit gamma~~~COG0831
MHFTQREQDKLMLVIAADLARRRQQRGLKLNYPEAVAIISFELLEGARDGKTVAELMSYGKQILNEDDVMEGVADMLTEM
EIEATFPDGTKLITVHHPIV
>P42875 3.5.1.5~~~ureA~~~Urease subunit gamma~~~COG0831
MHFTQREQDKLMLVIAADLARRRQQRGLKLNYPEAVAIISFELLEGARDGKTVAELMSYGKQILGEDDVMEGVADMLTEM
EIEATFPDGTKLITVHHPIV
>Q55053 3.5.1.5~~~ureA~~~Urease subunit gamma~~~COG0831
MQLTMREQEKMMISLAAMIAQRRKDKGIKLNHPEAVALITDYVLEGAREGKTVAQLMDEARNLLTREDVMEGIAEMIPMI
QVEATFTDSTKLVTVHDPIQ
>O87400 3.5.1.5~~~ureA~~~Urease subunit gamma~~~COG0831
MHLSPQEKDKLLIVTAALLAERRLNRGLKLNHPEAVAWLSFLVLEGARDGKSVAELMQEGTTWLSRNQVMDGIPELVQEV
QIEAVFPDGTKLVTLHDPIR
>Q9FAS7 3.5.1.5~~~ureA~~~Urease subunit gamma~~~
MELTPREKDKLLLFTAGLVAERRRARGLKLNYPEAIALISCEIMEGARDGRTVAELMSYGRTILTAEDVMEGVPEMITDI
QVECTFPDGTKLVSIHDPIV
>Q79VJ0 ~~~ureD1~~~Urease accessory protein UreD~~~COG0829
MTQTQPVGTLRLTIDDQGPQGQSRAVEQFHQGALRVIRPHYLDDSGQVCYTIIAIGGGYLGGDVYEQQFTIKDNAKALIT
TQSATKIYRTPQGPATQHTEINVGENAVLEYLADQTIAYREATYHQFTKVALHPSATFVMSEQITPGWHPDGKHFAYDEM
RLHTEITDSTTGRLVLLDNLLLRPDSREGSFGWTEQYTHSGQMIVMGEGVDKQLVAELNEQLAAHPDVYGAVNFLSAPGT
LLRGFIARTLSNRTEELINLHEHIASLLRGRWRGQEPVNLRKY
>Q03285 ~~~ureD~~~Urease accessory protein UreD~~~
MSDFSGSGWLAEIFLRYELKRGVTRLTDKQHIGPLMVQRPFYPEQGIAHTYLLHPPGGVVGGDKLLINIDVQPHAHALLT
TPGATKFYRSAGGVARQVQTLTVAPNGFLEWLPQENIFFPEAQVRLETHVRIASSSKFISWEIQCLGRPVLNEQFDNGDI
RGRLQFYIDDKLTLAESIFIEGSQKQSAVMREFPMVGSLYIYPASDELKAELHESLAVFFSTEVRPLEYGLTDVDGILVL
RLLGSQTEPMMACFAHIWQATRQYWLGYCPEPPRIWAT
>Q09063 ~~~ureD~~~Urease accessory protein UreD~~~
MLPPLKKGWQATLDLRFHQAGGKTVLASAQHVGPLTVQRPFYPEEETCHLYLLHPPGGIVGGDELTISAHLAPGCHTLIT
MPGASKFYRSSGAQALVRQQLTLAPQATLEWLPQDAIFFPGANARLFTTFHLCASSRLLAWDLLCLGRPVIGETFSHGTL
SNRLEVWVDNEPLLVERLHLQEGELSSIAERPWVGTLLCYPATDALLDGVRDALAPLGLYAGASLTDRLLTVRFLSDDNL
ICQRVMRDVWQFLRPHLTGKSPVLPRIWLT
>P17089 ~~~ureD~~~Urease accessory protein UreD~~~COG0829
MPDFSEKGWLADIALRYELKRGKTCLTEKRHLGPLMVQRPFYPEQGVAHTYLLHPPGGVVGGDTLSININVQPYAHALLT
TPGATKFYRSAGGTASQTQTLTVAQEGFLEWLPQENIFFPDAQVCLTTHIHLASSAKFIGWEMQCFGRPVLNEWFETGKV
KGRLNFYVDERLILTESMRVEGLQKQAAAMREFPMFGSLYIYPATDALKEIIQHHLEKVNPLVEYGLTDVDGILVLRVLG
TQTEPMMACFAQVWQIVRQHWLGYCPEPPRIWAT
>P0C145 ~~~ureE1~~~Urease accessory protein UreE 1~~~
MFRAIAIIRAHEVIDAVPASHIVLERDERHLRRKAITLENGEKILADFAEPVVLEHGDRLVLDDGREIEIRAASEELYEI
RGRDPRHIAELAWHIGNRHLAAQIETDHIFILRDHVIRVMLEGLGATVTDVVAIFSPLRGAYSGGHQHHHGHDHDHGHHG
HDHDHHHPDHE
>Q8G2P7 ~~~ureE1~~~Urease accessory protein UreE 1~~~
MFRAIAIIRAHEVIDAVPASHIVLERDERHLRRKAITLENGEKILADFAEPVVLEHGDRLVLDDGREIEIRAASEELYEI
RGRDPRHIAELAWHIGNRHLAAQIETDRIFILRDHVIRVMLEGLGATVTDVVAIFSPLRGAYSGGHQHHHGHDHDHHHPD
HE
>Q9RHM3 ~~~ureE~~~Urease accessory protein UreE~~~COG2371
MIITAIDTNIYDQPEFVEGRDVIGVRFEDLVLDKRIQRVALPGGEELGLRLNHGHPILREGDVLKADDKTVFVVEIIPTD
VLVITPSDIHQMGFVAHSLGNRHLPAQFSKPGELTEKAAMIVQYDHTVVSFLDEHGIEYQRTELVPPIPFRHSGHTH
>Q09064 ~~~ureE~~~Urease accessory protein UreE~~~COG2371
MIIERLVGNLRDLNPLDFSVDHVDLEWFETRKKIARFKTRQGKDIAIRLKDAPKLGLSQGDILFKEEKEIIAVNILDSEV
IHIQAKSVAEVAKICYEIGNRHAALYYGESQFEFKTPFEKPTLALLEKLGVQNRVLSSKLDSKERLTVSMPHSEPNFKVS
LASDFKVVVK
>P18317 ~~~ureE~~~Urease accessory protein UreE~~~
MLYLTQRLEIPAAATASVTLPIDVRVKSRVKVTLNDGRDAGLLLPRGLLLRGGDVLSNEEGTEFVQVIAADEEVSVVRCD
DPFMLAKACYHLGNRHVPLQIMPGELRYHHDHVLDDMLRQFGLTVTFGQLPFEPEAGAYASESHGHHHAHHDHHAHSH
>P50049 ~~~ureE~~~Urease accessory protein UreE~~~
MLITKIVGHIDDYESSDKKVDWLEVEWEDLNKRILRKETENGTDIAIKLENSGTLRYGDVLYESDDTLIAIRTKLEKVYV
IKPQTMQEMGKMAFEIGNRHTMCIIEDDEILVRYDKTLEKLIDEVGVSYEQSERRFKEPFKYRGHQH
>Q7A429 ~~~ureE~~~Urease accessory protein UreE~~~
MIVEEIQGNIANLSNSEKQKHVEKVYLENSDLVKRIQRVVTDHGTEIGIRLKQPIDLQYGDILYADDHNMIIVDVNSEDL
LVIQPRTLQEMGDIAHQLGNRHLPAQFTETEMLVQYDYLVEDLLKSLGIPYVREDRKVNKAFRHIGHSHD
>Q79VJ2 ~~~ureF~~~Urease accessory protein UreF~~~COG0830
MDLDADFLLLHLSDSALPTGAFAHSFGFETYMDAERITNAEEFQDWLKVLLKVQLTSSDALAMRMFYATPTVSELKRLDE
RLFAGTPAREIREANARMGTRMAEIVAETYSVPLIVEYLELIQHRELSGHPALALALATHSAGIDVDRAIHAHLTATVSS
LIQNAVRGIPLGQMAGQRVMFAMREHIGAAVKRSANLDEIDFCSGDPGLDISQMVHETQRARLFMS
>Q09065 ~~~ureF~~~Urease accessory protein UreF~~~COG0830
MDKGKSVKSTEKSVGMPPKTPKTDNNAHVDNEFLILQVNDAVFPIGSYTHSFGLETYIQQKKVTNKESALEYLKANLSSQ
FLYTEMLSLKLTYESALQQDLKKILGVEEVIMLSTSPMELRLANQKLGNRFIKTLQAMNELDMGEFFNAYAQKTKDPTHA
TSYGVFAASLGIELKKALRHYLYAQTSNMVINCVKSVPLSQNDGQKILLSLQSPFNQLIEKTLELDESHLCTASVQNDIK
AMQHESLYSRLYMS
>P18318 ~~~ureF~~~Urease accessory protein UreF~~~
MSTAEQRLRLMQLASSNLPVGGYSWSQGLEWAVEAGWVLDVAAFERWQRRQMTEGFFTVDLPLFARLYRACEQGDIAAAQ
RWTAYLLACRETRELREEERNRGAAFARLLSDWQPDCPPPWRSLCQQSQLAGMAWLGVRWRIALPEMALSLGYSWIESAV
MAGVKLVPFGQQAAQQLILRLCDHYAAEMPRALAAPDGDIGSATPLAAIASARHETQYSRLFRS
>A6TE44 ~~~ureF~~~Urease accessory protein UreF~~~
MSTAEQRLRLMQLASSNLPVGGYSWSQGLEWAVEAGWVPDVAAFERWQRRQMTEGFFTVDLPLFARLYRACEQGDIAAAQ
RWTAYLLACRETRELREEERNRGAAFARLLSDWQPDCPPPWRSLCQQSQLAGMAWLGVRWRIALPEMALSLGYSWIESAV
MAGVKLVPFGQQAAQQLILRLCDHYAAEMPRALATPDGDIGSATPLAAIASARHETQYSRLFRS
>P17091 ~~~ureF~~~Urease accessory protein UreF~~~COG0830
MMLADLRLYQLVSPSLPVGAFTYSQGLEWAIEKGWVCSAETLSDWLSAQMTGTLATLELPILRQLQTSLAKGDSDTVKYW
CDFMVASRETKELRQEERQRGIAFARLLPQLGIELDDTLQQRVKQTQLMAFALAAVHWHIDSEKLCCAYVWGWLENTVMS
GVKLVPLGQSAGQKMLFALAEQIPAIVELSAHWPQEDIGSFTPAQVIASSRHETQYTRLFRS
>Q79VJ1 ~~~ureG~~~Urease accessory protein UreG~~~COG0378
MGPIRIGVGGPVGAGKTQLVERITRALIDEVSMAAITNDIYTIEDAKILAANGVLPEERIVGIETGGCPHTAIREDTSMN
DAAIKDLVERFPDLELIFVESGGDNLSATFSPELVDFSIYIIDVAQGEKIPRKAGQGMIKSDLFIINKTDLAPYVGANLD
VMVEDAKAFRKNKPFCLTNLRTDDGLDKVLEWIRHEVMMQDLQEA
>Q03287 ~~~ureG~~~Urease accessory protein UreG~~~
MQEYNQPLRIGVGGPVGSGKTALLEVLCKAMRDTYQIAVVTNDIYTQEDAKILTRAEALDADRIIGVETGGCPHTAIRED
ASMNLAAVEELAIRHKNLDIVFVESGGDNLSATFSPELADLTIYVIDVAEGEKIPRKGGPGITHSDLLVINKIDLAPYVG
ASLEVMEADTARMRPVKPYVFTNLKKKVGLETIIEFIIDKGMLGR
>Q09066 ~~~ureG~~~Urease accessory protein UreG~~~COG0378
MVKIGVCGPVGSGKTALIEALTRHMSKDYDMAVITNDIYTKEDAEFMCKNSVMPRERIIGVETGGCPHTAIREDASMNLE
AVEEMHGRFPNLELLLIESGGDNLSATFNPELADFTIFVIDVAEGDKIPRKGGPGITRSDLLVINKIDLAPYVGADLKVM
ERDSKKMRGEKPFIFTNIRAKEGLDDVIAWIKRNALLED
>P18319 ~~~ureG~~~Urease accessory protein UreG~~~
MNSYKHPLRVGVGGPVGSGKTALLEALCKAMRDTWQLAVVTNDIYTKEDQRILTEAGALAPERIVGVETGGCPHTAIRED
ASMNLAAVEALSEKFGNLDLIFVESGGDNLSATFSPELADLTIYVIDVAEGEKIPRKGGPGITKSDFLVINKTDLAPYVG
ASLEVMASDTQRMRGDRPWTFTNLKQGDGLSTIIAFLEDKGMLGK
>P9WFE3 ~~~ureG~~~Urease accessory protein UreG~~~COG0378
MATHSHPHSHTVPARPRRVRKPGEPLRIGVGGPVGSGKTALVAALCRQLRGELSLAVLTNDIYTTEDADFLRTHAVLPDD
RIAAVQTGGCPHTAIRDDITANLDAIDELMAAHDALDLILVESGGDNLTATFSSGLVDAQIFVIDVAGGDKVPRKGGPGV
TYSDLLVVNKTDLAALVGADLAVMARDADAVRDGRPTVLQSLTEDPAASDVVAWVRSQLAADGV
>Q06206 ~~~ureG~~~Urease accessory protein UreG~~~COG0378
MQEYNQPLRIGVGGPVGSGKTALLEVLCKAMRDSYQIAVVTNDIYTQEDAKILTRAQALDADRIIGVETGGCPHTAIRED
ASMNLAAVEELAMRHKNLDIVFVESGGDNLSATFSPELADLTIYVIDVAEGEKIPRKGGPGITHSDLLVINKIDLAPYVG
ASLEVMEADTAKMRPVKPYVFTNLKEKVGLETIIDFIIDKGMLRR
>Q9RP19 ~~~ureG~~~Urease accessory protein UreG~~~
MKTIHLGIGGPVGSGKTTLVKTLSEALKEEYSIAVITNDIYTREDANFLINENILEKDRIIGVETGGCPHTAIREDASMN
FEAIEELKNRFDDLEIILLESGGDNLSATFSPELVDAFIYVIDVSEGGDIPRKGGPGVTRSDFLMVNKTELAPYVGVDLD
TMKNDTIKARNGRPFTFANIKTKKGLDEIIAWIKSDLLLEGKTNESASESK
>Q7A427 ~~~ureG~~~Urease accessory protein UreG~~~
MANPIKIGIGGPVGAGKTQLIEKVVKRLSKEMSIGVITNDIYTKEDEKILVNSGVLPESRIIGVETGGCPHTAIREDASM
NFAAIDELLERHDDIELIFIESGGDNLAATFSPELVDFSIYIIDVAQGEKIPRKGGQGMIKSDFFVINKTDLAPYVGASL
EQMAEDTKVFRGKRPFTFTNLKTDEGLDEVIDWIERDTLLKGLS
>Q09067 ~~~ureH~~~Urease accessory protein UreH~~~COG0829
MNTYAQESKLRLKTKIGADGRCVIEDNFFTPPFKLMAPFYPKDDLAEIMLLAVSPGMMRGDAQDVQLNIGPNCKLRITSQ
SFEKIHNTEDGFASRDMHIVVGENAFLDFAPFPLIPFENAHFKGNTTISLRSSSQLLYSEIIVAGRVARNELFKFNRLHT
KISILQDEKPIYYDNTILDPKTTDLNNMCMFDGYTHYLNLVLVNCPIELSGVRECIEESEGVDGAVSETASSHLCVKALA
KGSEPLLHLREKIARLVTQTTTQKV
>P56874 ~~~ureI~~~Acid-activated urea channel~~~
MLGLVLLYVGIVLISNGICGLTKVDPKSTAVMNFFVGGLSIVCNVVVITYSALHPTAPVEGAEDIVQVSHHLTSFYGPAT
GLLFGFTYLYAAINHTFGLDWRPYSWYSLFVAINTVPAAILSHYSDMLDDHKVLGITEGDWWAIIWLAWGVLWLTAFIEN
ILKIPLGKFTPWLAIIEGILTAWIPAWLLFIQHWV
>Q09068 ~~~ureI~~~Acid-activated urea channel~~~
MLGLVLLYVGIVLISNGICGLTKVDPKSTAVMNFFVGGLSIICNIVVITYSALHPTAPVEGAEDIAQVSHHLTSFYGPAT
GLLFGFTYLYAAINHTFGLDWRPYSWYSLFVAINTIPAAILSHYSDMLDDHKVLGITEGDWWAIIWLAWGVLWLTAFIEN
ILKIPLGKFTPWLAIIEGILTAWIPAWLLFIQHWV
>Q02458 ~~~ureR~~~Urease operon transcriptional activator~~~COG2207
MEYKHILSSNQISLKTFYIENPMIAIVYGAKGEICINGQTITVTTNLTLIIPKYSQVSCDVTNFFPTKPIELHTLVLSET
ELQSVFSLLKPLIKSGAPITRHLPDYHLSTPEVVKTNFTLLQQCLPLEHGTPSQETLFMQQSLFFILLAVYHEGVDILNI
FRFNYDEPKNQAITHLITQDPQRKWHLEDVAKTLYTTPSTLRRHLSKEGVSFCQLLLDVRMPIALNYLTFSNYSVFQISH
RCGFGSNAYFCDAFKRKYGMTPSQFRTQSRQANDPNAIATMASQNDESIKKVF
>Q1AS69 2.1.3.16~~~~~~Ureidoglycine carbamoyltransferase~~~COG0540
MQKEAVRDAANLPSIRSLVRAVGFEGRSLHSINDLTNDQIYALFELARALEPFHRSSVDLLRGSVMVTLFFQPSTRTRMS
FETAMHRLGGAVVTEANPLVSSSAAKEESLADTMRTISKYANVIVLRHPDDVAAREGASYSESPVINGGWGDWEHPTQAL
LDLYTLWRTHGGVEGAKVVVATPDMVHARTGHSMAYGLARLGAEVTLASRSDYRAPEEVIEGLRRVEGAKVREVFDLDQD
GFNDLVSEMDLVYLPGCSAPKGEAAEEFKKMMDEYYVRLETLEKVRESEGRIIRVTHTLPRRPGEMDLRIDDTPHQQYFE
AIAYSVAIRMALVAAIVGA
>O31521 3.2.1.172~~~yesR~~~Unsaturated rhamnogalacturonyl hydrolase YesR~~~COG4225
MAQLIFDEEKVTSVIDRIVKRTFQMDFAWDWPGGVAFYGVAEAYEATENEEYINLLKTWVDEQLEDGLPPLSINGVSIGH
TLLFLHKVTGDDVYLETAAEMAEYVLHKAPRFGEGILQHTVNAAEYVFPEQAWADTLMMAGLFMLRIGRVMEREDYFEDG
LRQFHGHEDVLQDPVTNLYYHAWDNKAQNHLSGIYWGRANGWAALTMAKALPLIEVTHPSFMIIDGSLRDQLSALVRLQD
ESGLWHTILDDPDSYLEVSASAGIASALMSSGKLYTKYVQKSLAAILDAVEEDGRVSRVSAGTAVMKNAEGYKQVPYKRI
QGWGQGLALTFLADVLKTKKRLYQ
>O34559 3.2.1.172~~~yteR~~~Unsaturated rhamnogalacturonyl hydrolase YteR~~~COG4225
MGSMDQSIAVKSPLTYAEALANTIMNTYTVEELPPANRWHYHQGVFLCGVLRLWEATGEKRYFEYAKAYADLLIDDNGNL
LFRRDELDAIQAGLILFPLYEQTKDERYVKAAKRLRSLYGTLNRTSEGGFWHKDGYPYQMWLDGLYMGGPFALKYANLKQ
ETELFDQVVLQESLMRKHTKDAKTGLFYHAWDEAKKMPWANEETGCSPEFWARSIGWYVMSLADMIEELPKKHPNRHVWK
NTLQDMIKSICRYQDKETGLWYQIVDKGDRSDNWLESSGSCLYMYAIAKGINKGYLDRAYETTLLKAYQGLIQHKTETSE
DGAFLVKDICVGTSAGFYDYYVSRERSTNDLHGAGAFILAMTELEPLFRSAGK
>D0VWQ1 1.7.3.3~~~uox~~~Uricase~~~
MTATAETSTGTKVVLGQNQYGKAEVRLVKVTRNTARHEIQDLNVTSQLRGDFEAAHTAGDNAHVVATDTQKNTVYAFARD
GFATTEEFLLRLGKHFTEGFDWVTGGRWAAQQFFWDRINDHDHAFSRNKSEVRTAVLEISGSEQAIVAGIEGLTVLKSTG
SEFHGFPRDKYTTLQETTDRILATDVSARWRYNTVEVDFDAVYASVRGLLLKAFAETHSLALQQTMYEMGRAVIETHPEI
DEIKMSLPNKHHFLVDLQPFGQDNPNEVFYAADRPYGLIEATIQREGSRADHPIWSNIAGFC
>A2RJJ9 ~~~uriP~~~Uridine/deoxyuridine transporter~~~COG0477
MGENTVPQKSSDNVGSIVALMVALLVAIFAFQLNASMLSPALVTMQAQLHTTASSIALTQTIFFTAAALFALFLPRLADL
IGRKKVLIGMLTLTMIGCLISGFATNVGILMIGRILQGAAGPVVPLCLIILHVKVRDEKRYAKLMAILTSINGGIAGVDA
LAGGWLVSHGGFRSVFFVMGITAALAILLVSFGTQESTAKDTPKMDWTGVILLVVAMGALLSAVNALQGSFGNLGLPNWL
LASILALLGLICFVGFWQVEKRVNHPMVPIHYLKQRRTWGLLITTLLTMTGVFAIMNGIIPALGQDGKFGLGLGADMVSL
VTLTPYALAGLFFGPVSGFLAARFGFAKVLRVGLLTTIIGIVLAVAGVLQPSIWLLLLVSTFIGITYAGITNIMLNGLGI
VLSPEDNPGYLPGLNAGMFNLGAGLSFIILYAVPTVLHTSVGGSSSGYISGIVTGLILVIIAFFTSFLIPDSKDCKINLE
>Q59190 2.7.1.48~~~udk~~~Uridine kinase~~~
MAKIIGISGGSGSGKTTVVSKISEFIPEFVLISQDNYYKSVGDYEHEFSKVNFDHPDAFDNNLFYEHLKNLKKNSPIDMP
LYDFINHKRQLKTVLVVPTPVVIVEGIMIFVEERVRNLIDLKIYIDTPNDIRFIRRLRRDISKRGRTVESVIDQYLNTTR
WGYYRFIEPTKEYADLIIPEGGHNDKALYVLSTFLKSLSKEGLDFT
>P67411 2.7.1.48~~~udk~~~Uridine kinase~~~
MKATTIIGIAGGSGSGKTTVTNEIMKNLEGHSVALLAQDYYYKDQKHLTFDERLETNYDHPFAFDNDLLIENLKDLKNGK
AVEVPTYDYASHTRSDITIDFKPKDVIIVEGIFALENKVLRDMMDVKIYVDTDADLRILRRLTRDTKERGRSMDSVINQY
LSVVRPMHDQFIEPTKKYADIIIPEGGSNKVAIDIMTTKIQSLVSKQ
>Q5XBI8 2.7.1.48~~~udk~~~Uridine kinase~~~
MLKKPIIIGVTGGSGGGKTSVSRAILDSFPNARIAMIQHDSYYKDQSHMSFEERVKTNYDHPLAFDTDFMIQQLKELLAG
RPVDIPIYDYKKHTRSNTTFRQDPQDVIIVEGILVLEDERLRDLMDIKLFVDTDDDIRIIRRIKRDMMERGRSLESIIDQ
YTSVVKPMYHQFIEPSKRYADIVIPEGVSNVVAIDVINSKIASILGEV
>P67413 2.7.1.48~~~udk~~~Uridine kinase~~~COG0572
MQNRPIIIGVTGGSGGGKTSVSRAILSHFPDEKISMIEHDSYYKDQSHLTFEERVKTNYDHPFAFDTDLMIEQIKELLAG
RPVDIPTYDYTEHTRSSKTYRQEPQDVFIVEGILVLEDKRLRDLMDIKIFVDTDDDVRIIRRIKRDMEERGRSLDSVINQ
YLGVVKPMYHQFIESTKRYADIVIPEGVSNTVAIDLLTTKIAKILEEARNSK
>Q5SKR5 2.7.1.48~~~udk~~~Uridine kinase~~~COG0572
MSAPKPFVIGIAGGTASGKTTLAQALARTLGERVALLPMDHYYKDLGHLPLEERLRVNYDHPDAFDLALYLEHAQALLRG
LPVEMPVYDFRAYTRSPRRTPVRPAPVVILEGILVLYPKELRDLMDLKVFVDADADERFIRRLKRDVLERGRSLEGVVAQ
YLEQVKPMHLHFVEPTKRYADVIVPRGGQNPVALEMLAAKALARLARMGAA
>Q7CRQ0 1.1.1.203~~~udh~~~Uronate dehydrogenase~~~COG0451
MKRLLVTGAAGQLGRVMRERLAPMAEILRLADLSPLDPAGPNEECVQCDLADANAVNAMVAGCDGIVHLGGISVEKPFEQ
ILQGNIIGLYNLYEAARAHGQPRIVFASSNHTIGYYPQTERLGPDVPARPDGLYGVSKCFGENLARMYFDKFGQETALVR
IGSCTPEPNNYRMLSTWFSHDDFVSLIEAVFRAPVLGCPVVWGASANDAGWWDNSHLGFLGWKPKDNAEAFRRHITETTP
PPDPNDALVRFQGGTFVDNPIFKQS
>Q88NN6 1.1.1.203~~~udh~~~Uronate dehydrogenase~~~COG0451
MTTTPFNRLLLTGAAGGLGKVLRERLKGYAEVLRLSDISPMAPAAGPHEEVITCDLADKAAVHTLVEGVDAIIHFGGVST
EHAFEEILGPNICGVFHVYEAARKHGVKRIIFASSNHTIGFYRQDERIDAHAPRRPDSYYGLSKCYGEDVASFYFDRYGI
ETVSIRIGSSFPQPQNLRMLCTWLSYDDLVQLIERGLFTPGVGHTIVYGASDNRTVWWDNRHAAHLGYVPKDSSETFRAA
VEAQPAPAADDPSMVYQGGAFAVAGPFN
>Q888H1 1.1.1.203~~~udh~~~Uronate dehydrogenase~~~COG0451
MASAHTTQTPFNRLLLTGAAGGLGKVLRETLRPYSHILRLSDIAEMAPAVGDHEEVQVCDLADKDAVHRLVEGVDAILHF
GGVSVERPFEEILGANICGVFHIYEAARRHGVKRVIFASSNHVIGFYKQNETIDAHSPRRPDSYYGLSKSYGEDMASFYF
DRYGIETVSIRIGSSFPEPQNRRMMSTWLSFDDLTRLLERALYTPDVGHTVVYGVSDNKTVWWDNRFASKLDYAPKDSSE
VFRAKVDAQPMPADDDPAMVYQGGAFVASGPFGDK
>Q7A4A4 2.7.7.-~~~~~~Probable uridylyltransferase SA1974~~~
MLDKNQLAKYKQDHLCEYEKIMSNNEKEALEEKVASLDLDFIAKLYNDLYINKKTIDDVSAVSEVKYDIKSQMSDDEIKR
LEEQGLQAIKEGQFAVLLMAGGQGTRLGYKGPKGSFEIEGVSLFELQANQLKTLNHQSGHTIQWYIMTSDINHEETLAYF
EAHSYFGYDQEAIHFFKQDNIVALSEEGKLILNQQGRIMETPNGNGGVFKSLDKAGYLEEMSNNGVKYIFLNNIDNVLVK
VLDPLFAGFTVEHDYDITSKTIQPKPGESVGRLVNVDCKDTVLEYSELDPEVANQFNNANIGIHAFKLGFILNAVNRELP
YHLAIKNLKQLDENFGVIEQPTLKFELFYFDIFTYGTSFVTLQVPREEEFSPLKNKEGKDSVATATEDLRRMGLI
>P08390 ~~~usg~~~USG-1 protein~~~COG0136
MSEGWNIAVLGATGAVGEALLETLAERQFPVGEIYALARNESAGEQLRFGGKTITVQDAAEFDWTQAQLAFFVAGKEATA
AWVEEATNSGCLVIDSSGLFALEPDVPLVVPEVNPFVLTDYRNRNVIAVPDSLTSQLLAALKPLIDQGGLSRISVTSLIS
ASAQGKKAVDALAGQSAKLLNGIPIDEEDFFGRQLAFNMLPLLPDSEGSVREERRIVDEVRKILQDEGLMISASVVQAPV
FYGHAQMVNFEALRPLAAEEARDAFVQGEDIVLSEENEFPTQVGDASGTPHLSVGCVRNDYGMPEQVQFWSVADNVRFGG
ALMAVKIAEKLVQEYLY
>O87014 ~~~usg~~~USG-1 protein homolog~~~
MSQPLNVAVVGATGSVGEALVGLLDERDFPLHRLHLLASAESAGQRMGFAESSLRVGDVDSFDFSSVGLAFFAAAAEVSR
AHAERARAAGCSVIDLSGALEPSVAPPVMVSVNAERLASQAAPFLLSSPCAVAAELCEVLAPLLATLDCRQLNLTACLSV
SSLGREGVKELARQTAELLNARPLEPRLFDRQIAFNLLAQVGAVDAEGHSAIERRIFAEVQALLGERIGPLNVTCIQAPV
FFGDSLSVTLQCAEPVDLAAVTRVLDATKGIEWVGEGDYPTVVGDALGQDETYVGRVRAGQADPCQVNLWIVSDNVRKGA
ALNAVLLGELLIKHYL
>P07024 ~~~ushA~~~Protein UshA~~~COG0737
MKLLQRGVALALLTTFTLASETALAYEQDKTYKITVLHTNDHHGHFWRNEYGEYGLAAQKTLVDGIRKEVAAEGGSVLLL
SGGDINTGVPESDLQDAEPDFRGMNLVGYDAMAIGNHEFDNPLTVLRQQEKWAKFPLLSANIYQKSTGERLFKPWALFKR
QDLKIAVIGLTTDDTAKIGNPEYFTDIEFRKPADEAKLVIQELQQTEKPDIIIAATHMGHYDNGEHGSNAPGDVEMARAL
PAGSLAMIVGGHSQDPVCMAAENKKQVDYVPGTPCKPDQQNGIWIVQAHEWGKYVGRADFEFRNGEMKMVNYQLIPVNLK
KKVTWEDGKSERVLYTPEIAENQQMISLLSPFQNKGKAQLEVKIGETNGRLEGDRDKVRFVQTNMGRLILAAQMDRTGAD
FAVMSGGGIRDSIEAGDISYKNVLKVQPFGNVVVYADMTGKEVIDYLTAVAQMKPDSGAYPQFANVSFVAKDGKLNDLKI
KGEPVDPAKTYRMATLNFNATGGDGYPRLDNKPGYVNTGFIDAEVLKAYIQKSSPLDVSVYEPKGEVSWQ
>P22865 ~~~~~~Secreted 45 kDa protein~~~COG3883
MKKKIISAILMSTVILSAAAPLSGVYADTNSDIAKQDATISSAQSAKAQAQAQVDSLQSKVDSLQQKQTSTKAQIAKIES
EAKALNAQIATLNESIKERTKTLEAQARSAQVNSSATNYMDAVVNSKSLTDVIQKVTAIATVSSANKQMLEQQEKEQKEL
SQKSETVKKNYNQFVSLSQSLDSQAQELTSQQAELKVATLNYQATIATAQDKKQALLDEKAAAEKAAQEAAKKQAAYEAQ
QKEAAQAQAASTAATAKAVEAATSSASASSSQAPQVSTSTDNTTSNASASNSSNSSSNSSSSSSSSSSSSSSSSNSNAGG
NTNSGTSTGNTGGTTTGGSGINSSPIGNPYAGGGCTDYVWQYFAAQGIYIRNIMPGNGGQWASNGPAQGVLHVVGAAPGV
IASSFSADFVGYANSPYGHVAIVKSVNSDGTITIKEGGYGTTWWGHERTVSASGVTFLMPN
>P45680 ~~~uspA2~~~Universal stress protein A homolog 2~~~COG0589
MDNYKKILVALALDPNSDRPLVEKAKELSANRDAQLYLIHAVEHLSSYGAAYGVAAGVDVEDMLLEEAKKRMNEIASQLN
ISSDHQIVKVGPAKFLILEQAKNWGVDLIIVGSHGRHGIQLLLGSTSNAVLHGAKCDVLAVRIKGS
>P0AED0 ~~~uspA~~~Universal stress protein A~~~COG0589
MAYKHILIAVDLSPESKVLVEKAVSMARPYNAKVSLIHVDVNYSDLYTGLIDVNLGDMQKRISEETHHALTELSTNAGYP
ITETLSGSGDLGQVLVDAIKKYDMDLVVCGHHQDFWSKLMSSARQLINTVHVDMLIVPLRDEEE
>P44880 ~~~uspA~~~Universal stress protein A homolog~~~COG0589
MYKHILVAVDLSEESPILLKKAVGIAKRHDAKLSIIHVDVNFSDLYTGLIDVNMSSMQDRISTETQKALLDLAESVDYPI
SEKLSGSGDLGQVLSDAIEQYDVDLLVTGHHQDFWSKLMSSTRQVMNTIKIDMLVVPLRDE
>P0A8S5 ~~~uspB~~~Universal stress protein B~~~
MISTVALFWALCVVCIVNMARYFSSLRALLVVLRNCDPLLYQYVDGGGFFTSHGQPNKQVRLVWYIYAQRYRDHHDDEFI
RRCERVRRQFILTSALCGLVVVSLIALMIWH
>P46888 ~~~uspC~~~Universal stress protein C~~~COG0589
MSYSNILVAVAVTPESQQLLAKAVSIARPVKGHISLITLASDPEMYNQLAAPMLEDLRSVMHEETQSFLDKLIQDAGYPV
DKTFIAYGELSEHILEVCHKHHFDLVICGNHNHSFFSRASCSAKRVIASSEVDVLLVPLTGD
>P0AAB8 ~~~uspD~~~Universal stress protein D~~~COG0589
MAYKHIGVAISGNEEDALLVNKALELARHNDAHLTLIHIDDGLSELYPGIYFPATEDILQLLKNKSDNKLYKLTKNIQWP
KTKLRIERGEMPETLLEIMQKEQCDLLVCGHHHSFINRLMPAYRGMINKMSADLLIVPFIDK
>P0AAC0 ~~~uspE~~~Universal stress protein E~~~COG0589
MAMYQNMLVVIDPNQDDQPALRRAVYLHQRIGGKIKAFLPIYDFSYEMTTLLSPDERTAMRQGVISQRTAWIHEQAKYYL
NAGVPIEIKVVWHNRPFEAIIQEVISGGHDLVLKMAHQHDRLEAVIFTPTDWHLLRKCPSPVWMVKDQPWPEGGKALVAV
NLASEEPYHNALNEKLVKETIELAEQVNHTEVHLVGAYPVTPINIAIELPEFDPSVYNDAIRGQHLLAMKALRQKFGINE
NMTHVEKGLPEEVIPDLAEHLQAGIVVLGTVGRTGISAAFLGNTAEQVIDHLRCDLLVIKPDQYQTPVELDDEEDD
>P44195 ~~~uspE~~~Universal stress protein E homolog~~~COG0589
MKFKNILVVLNPSNEKQYALARAVRLVEEQKNETKVKITALLSVYDLSYEMSALLSSEERSEMHQQVIEKHRHAVQYYLD
KYANPEIELQSHIVWNSNEADAINEEVENNNYDLVVKYTKDEEKLTSLIFTPIDWQLLRKCPIPVLMVRDGDWKHPRRIL
VAVNVSGEQEYQDEFNQELVETGISLAENLNRGNVHLVAAYPSAPINMAIDLPEFNTSGYENGIRGQHLINMKALRQKFG
IDEDHTHVREGFPEEVIPEVAKEIEAELVILGTVGRTGLSAALLGNTAEHVISKLSCNLLGIKPSKKDD
>Q8ZP84 ~~~uspE~~~Universal stress protein E~~~
MAMYQNMLVVIDPNQDDQPALRRAVYLHQRIGGKIKAFLPIYDFSYEMTTLLSPDERTAMRQGVISQRTAWIREQAKYYL
EAGVPIEIKVVWHNRPFEAIIQEVIAGSHDLVLKMAHQHDRLEAVIFTPTDWHLLRKCPSPVWMVKDQPWPEGGKALVAV
NLASEEPYHNALNEKLVKETLQLAEQVNHTEVHLVGAYPVTPINIAIELPEFDPSVYNDAIRGQHLLAMKALRQKFSIDE
KVTHVEKGLPEEVIPDLAEHLQAGIVVLGTVGRTGLSAAFLGNTAEQVIDHLRCDLLVIKPDEYQTPVELDDEDD
>P37903 ~~~uspF~~~Universal stress protein F~~~COG0589
MNRTILVPIDISDSELTQRVISHVEEEAKIDDAEVHFLTVIPSLPYYASLGLAYSAELPAMDDLKAEAKSQLEEIIKKFK
LPTDRVHVHVEEGSPKDRILELAKKIPAHMIIIASHRPDITTYLLGSNAAAVVRHAECSVLVVR
>P67091 ~~~uspF~~~Universal stress protein F~~~
MNRTILVPIDISDSELTQRVISHVEAEAKIDDAKVHFLTVIPSLPYYASLGLAYSAELPAMDDLKAEAKSQLEAIIKKFN
LPADRVQAHVAEGSPKDKILEMAKKLPADMVIIASHRPDITTYLLGSNAAAVVRHAECSVLVVR
>P39177 ~~~uspG~~~Universal stress protein UP12~~~COG0589
MYKTIIMPVDVFEMELSDKAVRHAEFLAQDDGVIHLLHVLPGSASLSLHRFAADVRRFEEHLQHEAQERLQTMVSHFTID
PSRIKQHVRFGSVRDEVNELAEELGADVVVIGSRNPSISTHLLGSNASSVIRHANLPVLVVR
>P42597 3.6.1.-~~~ygjP~~~UTP pyrophosphatase~~~COG1451
MSNLTYLQGYPEQLLSQVRTLINEQRLGDVLAKRYPGTHDYATDKALWQYTQDLKNQFLRNAPPINKVMYDNKIHVLKNA
LGLHTAVSRVQGGKLKAKVEIRVATVFRNAPEPFLRMIVVHELAHLKEKEHNKAFYQLCCHMEPQYHQLEFDTRLWLTQL
SLGQNKI
>Q72CX3 ~~~~~~Urea transporter DVU1160~~~COG4413
MFGEQLLKNPLIEFCDSVCRGCGQVMFQNNTVTGLLFFAGIFYNSTTLGVCAVLGTAASTLTAQLLGVDKPLVRAGLFGF
NGTLAGIALPFFFNYEPAMLGYVALNGAFTTIIMASLLNFLGKWGVPALTAPFVLATWLLMFGVYKLSLFHPGALIAPAL
PSVAGLADMGTVTGRTFMEGLFKGVGEVMFQDNIVTGVIFVVAILVNSRISALFAVIGSLVGLCTALIMHSPETPVRLGL
YGFNSVLCGIAMGGIFFYLNIRTFLYALGCMVLGAIATGAFSVLLSPIGMPALTWPFIVVTWLFLFAGSMFRNIAQVPTE
KAGTPEDNLRSLAIGSR
>P43672 3.6.1.-~~~uup~~~ATP-binding protein Uup~~~COG0488
MSLISMHGAWLSFSDAPLLDNAELHIEDNERVCLVGRNGAGKSTLMKILNREQGLDDGRIIYEQDLIVARLQQDPPRNVE
GSVYDFVAEGIEEQAEYLKRYHDISRLVMNDPSEKNLNELAKVQEQLDHHNLWQLENRINEVLAQLGLDPNVALSSLSGG
WLRKAALGRALVSNPRVLLLDEPTNHLDIETIDWLEGFLKTFNGTIIFISHDRSFIRNMATRIVDLDRGKLVTYPGNYDQ
YLLEKEEALRVEELQNAEFDRKLAQEEVWIRQGIKARRTRNEGRVRALKAMRRERGERREVMGTAKMQVEEASRSGKIVF
EMEDVCYQVNGKQLVKDFSAQVLRGDKIALIGPNGCGKTTLLKLMLGQLQADSGRIHVGTKLEVAYFDQHRAELDPDKTV
MDNLAEGKQEVMVNGKPRHVLGYLQDFLFHPKRAMTPVRALSGGERNRLLLARLFLKPSNLLILDEPTNDLDVETLELLE
ELIDSYQGTVLLVSHDRQFVDNTVTECWIFEGGGKIGRYVGGYHDARGQQEQYVALKQPAVKKTEEAAAAKAETVKRSSS
KLSYKLQRELEQLPQLLEDLEAKLEALQTQVADASFFSQPHEQTQKVLADMAAAEQELEQAFERWEYLEALKNGG
>P46303 ~~~pdg~~~Ultraviolet N-glycosylase/AP lyase~~~COG0177
METESTGTPTGETRLALVRRARRIDRILAETYPYAVAELDFETPFELLVATVLSAQTTDVRVNAATPALFARFPDAHAMA
AATEPELQELVRSTGFYRNKASAILRLSQELVGRHDGEVPARLEDLVALPGVGRKTAFVVLGNAFGQPGITVDTHFGRLA
RRLGFTDETDPGKGRARRGRPVPPARDWTMLSHRLIFHGRRVCHARRPACGRCPIARWCPSYAAGETDPERARALLAYEL
KPGREELLELLRAGRTAGAAGPRPRAGGXAPGLPAQPFR
>P0A698 ~~~uvrA~~~UvrABC system protein A~~~COG0178
MDKIEVRGARTHNLKNINLVIPRDKLIVVTGLSGSGKSSLAFDTLYAEGQRRYVESLSAYARQFLSLMEKPDVDHIEGLS
PAISIEQKSTSHNPRSTVGTITEIHDYLRLLFARVGEPRCPDHDVPLAAQTVSQMVDNVLSQPEGKRLMLLAPIIKERKG
EHTKTLENLASQGYIRARIDGEVCDLSDPPKLELQKKHTIEVVVDRFKVRDDLTQRLAESFETALELSGGTAVVADMDDP
KAEELLFSANFACPICGYSMRELEPRLFSFNNPAGACPTCDGLGVQQYFDPDRVIQNPELSLAGGAIRGWDRRNFYYFQM
LKSLADHYKFDVEAPWGSLSANVHKVVLYGSGKENIEFKYMNDRGDTSIRRHPFEGVLHNMERRYKETESSAVREELAKF
ISNRPCASCEGTRLRREARHVYVENTPLPAISDMSIGHAMEFFNNLKLAGQRAKIAEKILKEIGDRLKFLVNVGLNYLTL
SRSAETLSGGEAQRIRLASQIGAGLVGVMYVLDEPSIGLHQRDNERLLGTLIHLRDLGNTVIVVEHDEDAIRAADHVIDI
GPGAGVHGGEVVAEGPLEAIMAVPESLTGQYMSGKRKIEVPKKRVPANPEKVLKLTGARGNNLKDVTLTLPVGLFTCITG
VSGSGKSTLINDTLFPIAQRQLNGATIAEPAPYRDIQGLEHFDKVIDIDQSPIGRTPRSNPATYTGVFTPVRELFAGVPE
SRARGYTPGRFSFNVRGGRCEACQGDGVIKVEMHFLPDIYVPCDQCKGKRYNRETLEIKYKGKTIHEVLDMTIEEAREFF
DAVPALARKLQTLMDVGLTYIRLGQSATTLSGGEAQRVKLARELSKRGTGQTLYILDEPTTGLHFADIQQLLDVLHKLRD
QGNTIVVIEHNLDVIKTADWIVDLGPEGGSGGGEILVSGTPETVAECEASHTARFLKPML
>P9WQK7 ~~~uvrA~~~UvrABC system protein A~~~COG0178
MADRLIVKGAREHNLRSVDLDLPRDALIVFTGLSGSGKSSLAFDTIFAEGQRRYVESLSAYARQFLGQMDKPDVDFIEGL
SPAVSIDQKSTNRNPRSTVGTITEVYDYLRLLYARAGTPHCPTCGERVARQTPQQIVDQVLAMPEGTRFLVLAPVVRTRK
GEFADLFDKLNAQGYSRVRVDGVVHPLTDPPKLKKQEKHDIEVVVDRLTVKAAAKRRLTDSVETALNLADGIVVLEFVDH
ELGAPHREQRFSEKLACPNGHALAVDDLEPRSFSFNSPYGACPECSGLGIRKEVDPELVVPDPDRTLAQGAVAPWSNGHT
AEYFTRMMAGLGEALGFDVDTPWRKLPAKARKAILEGADEQVHVRYRNRYGRTRSYYADFEGVLAFLQRKMSQTESEQMK
ERYEGFMRDVPCPVCAGTRLKPEILAVTLAGESKGEHGAKSIAEVCELSIADCADFLNALTLGPREQAIAGQVLKEIRSR
LGFLLDVGLEYLSLSRAAATLSGGEAQRIRLATQIGSGLVGVLYVLDEPSIGLHQRDNRRLIETLTRLRDLGNTLIVVEH
DEDTIEHADWIVDIGPGAGEHGGRIVHSGPYDELLRNKDSITGAYLSGRESIEIPAIRRSVDPRRQLTVVGAREHNLRGI
DVSFPLGVLTSVTGVSGSGKSTLVNDILAAVLANRLNGARQVPGRHTRVTGLDYLDKLVRVDQSPIGRTPRSNPATYTGV
FDKIRTLFAATTEAKVRGYQPGRFSFNVKGGRCEACTGDGTIKIEMNFLPDVYVPCEVCQGARYNRETLEVHYKGKTVSE
VLDMSIEEAAEFFEPIAGVHRYLRTLVDVGLGYVRLGQPAPTLSGGEAQRVKLASELQKRSTGRTVYILDEPTTGLHFDD
IRKLLNVINGLVDKGNTVIVIEHNLDVIKTSDWIIDLGPEGGAGGGTVVAQGTPEDVAAVPASYTGKFLAEVVGGGASAA
TSRSNRRRNVSA
>P63383 ~~~uvrA~~~UvrABC system protein A~~~
MKEPSIVVKGARAHNLKDIDIELPKNKLIVMTGLSGSGKSSLAFDTIYAEGQRRYVESLSAYARQFLGQMDKPDVDTIEG
LSPAISIDQKTTSKNPRSTVATVTEIYDYIRLLYARVGKPYCPNHNIEIESQTVQQMVDRIMELEARTKIQLLAPVIAHR
KGSHEKLIEDIGKKGYVRLRIDGEIVDVNDVPTLDKNKNHTIEVVVDRLVVKDGIETRLADSIETALELSEGQLTVDVID
GEDLKFSESHACPICGFSIGELEPRMFSFNSPFGACPTCDGLGQKLTVDVDLVVPDKDKTLNEGAIEPWIPTSSDFYPTL
LKRVCEVYKINMDKPFKKLTERQRDILLYGSGDKEIEFTFTQRQGGTRKRTMVFEGVVPNISRRFHESPSEYTREMMSKY
MTELPCETCHGKRLSREALSVYVGGLNIGEVVEYSISQALNYYKNIDLSEQDQAIANQILKEIISRLTFLNNVGLEYLTL
NRASGTLSGGEAQRIRLATQIGSRLTGVLYVLDEPSIGLHQRDNDRLINTLKEMRDLGNTLIVVEHDDDTMRAADYLVDI
GPGAGEHGGQIVSSGTPQKVMKDKKSLTGQYLSGKKRIDVPEYRRPASDRKISIRGARSNNLKGIDVDIPLSIMTVVTGV
SGSGKSSLVNEVLYKSLAQKINKSKVKPGLYDKIEGIDQLDKIIDIDQSPIGRTPRSNPATYTGVFDDIRDVFAQTNEAK
IRGYQKGRFSFNVKGGRCEACKGDGIIKIEMHFLPDVYVPCEVCDGKRYNRETLEVTYKGKNIADILEMTVEEATQFFEN
IPKIKRKLQTLVDVGLGYVTLGQQATTLSGGEAQRVKLASELHKRSTGKSIYILDELTTGLHVDDISRLLKVLNRLVENG
DTVVIIEHNLDVIKTADYIIDLGPEGGSGGGTIVATGTPEDIAQTKSSYTGKYLKEVLERDKQNTEDK
>Q9WYV0 ~~~uvrA~~~UvrABC system protein A~~~COG0178
MNEIVVKGARVHNLKNITVRIPKNRLVVITGVSGSGKSSLAMDTIYAEGQRRYLESLSTYARQFLGNLKKPDVDEIEGLS
PAIAIDQKTVSHNPRSTVGTVTEIYDYLRVLYARIGKAHCPECGRPLEKKSIDEILQDLFNSFKEGSRIYILAPVATEKK
GTFKKEIEEFISKGFARIEIDGEIYRLEEVPELDKNKRHTVKLVVDRLILETRNEHRILDSLELAMREGKGFVEIRNVDT
GESKIFSENLMCPVCGIGFPEITPKLFSFNSPYGACPNCHGLGFTFEVDPSLVIDEEKSVLEGAIIPYRWDRRLSRWVAR
EIEKRGVSPHLPFKDLPEDVKEFILYGDDRFEGVVPKVQRWHRETESPEMKEWLEKNFIVQRTCSVCGGRRLNREALSVK
INGLNIHEFTELSISEELEFLKNLNLTEREREIVGELLKEIEKRLEFLVDVGLEYLTLSRSATTLSGGESQRIRLATQIG
SGLTGVIYVLDEPTIGLHPRDTERLIKTLKKLRDLGNTVIVVEHDEEVIRNADHIIDIGPGGGTNGGRVVFQGTVDELLK
NPDSSLTGEYLSGKRKITVNKTRRLPYASLKIKGVRHNNLKNIDVEIPLGVFVCVTGVSGSGKSSLVMETLYPALMNLLH
KTKLPAGEFDSIEGHENIDKMIAIDQSPIGRTPRSNPATYTKVFDEIRSLFAMTPAAKARGYNKSRFSFNLKGGRCEACQ
GQGYVKIEMLFLPDVYVECDVCKGKRYNRETLEITYKGKNISDILDMTVDEALEFFKNIPSIKRTLQVLHDVGLGYVKLG
QPATTLSGGEAQRIKLASELRKRDTGRTLYILDEPTVGLHFEDVRKLVEVLHRLVDRGNTVIVIEHNLDVIKNADHIIDL
GPEGGKEGGYIVATGTPEEIAKNPHSYTGRFLKNVL
>P56981 ~~~uvrB~~~UvrABC system protein B~~~
MVEGRFQLVAPYEPQGDQPQAIAKLVDGLRRGVKHQTLLGATGTGKTFTISNVIAQVNKPTLVIAHNKTLAGQLYSELKE
FFPHNAVEYFVSYYDYYQPEAYVPQTDTYIEKDAKINDEIDKLRHSATSALFERRDVIIVASVSCIYGLGSPEEYRELVV
SLRVGMEIERNALLRRLVDIQYDRNDIDFRGTFRVRGDVVEIFPASRDEHCIRVEFFGDEIERIREVDALTGKVLGEREH
VAIFPASHFVTREEKMRLAIQNIEQELEERLAELRAQGKLLEAQRLEQRTRYDLEMMREMGFCSGIENYSRHLALRPPGS
TPYTLLDYFPDDFLIIVDESHVTLPQLRGMYNGDRARKQVLVDHGFRLPSALDNRPLTFEEFEQKINQIIYVSATPGPYE
LEHSPGVVEQIIRPTGLLDPTIDVRPTKGQIDDLIGEIRERVERNERTLVTTLTKKMAEDLTDYLKEAGIKVAYLHSEIK
TLERIEIIRDLRLGKYDVLVGINLLREGLDIPEVSLVAILDADKEGFLRSERSLIQTIGRAARNANGHVIMYADTITKSM
EIAIQETKRRRAIQEEYNRKHGIVPRTVKKEIRDVIRATYAAEETEMYEAKPAAAMTKQEREELIRTLEAEMKEAAKALD
FERAAQLRDIIFELKAEG
>P37954 ~~~uvrB~~~UvrABC system protein B~~~COG0556
MKDRFELVSKYQPQGDQPKAIEKLVKGIQEGKKHQTLLGATGTGKTFTVSNLIKEVNKPTLVIAHNKTLAGQLYSEFKEF
FPNNAVEYFVSYYDYYQPEAYVPQTDTFIEKDASINDEIDKLRHSATSALFERRDVIIIASVSCIYGLGSPEEYREMVVS
LRTEMEIERNELLRKLVDIQYARNDIDFQRGTFRVRGDVVEIFPASRDEHCVRVEFFGDEIERIREVDALTGEILGDRDH
VAIFPASHFVTRAEKMEKAIQNIEKELEEQLKVMHENGKLLEAQRLEQRTRYDLEMMREMGFCSGIENYSRHLTLRPPGS
TPYTLLDYFPDDFMIVVDESHVTIPQVRGMFNGDQARKQVLVDHGFRLPSALDNRPLRFEEFEKHMHNIVYVSATPGPYE
IEHTDEMVEQIIRPTGLLDPLIDVRPIEGQIDDLIGEIQARIERNERVLVTTLTKKMSEDLTDYLKEIGIKVNYLHSEIK
TLERIEIIRDLRLGKYDVLVGINLLREGLDIPEVSLVAILDADKEGFLRSERSLIQTIGRAARNAEGRVIMYADKITKSM
EIAINETKRRREQQERFNEEHGITPKTINKEIRDVIRATVAAEDKAEYKTKAAPKLSKMTKKERQKVVEQMEHEMKEAAK
ALDFERAAELRDLLLELKAEG
>P0A8F8 ~~~uvrB~~~UvrABC system protein B~~~COG0556
MSKPFKLNSAFKPSGDQPEAIRRLEEGLEDGLAHQTLLGVTGSGKTFTIANVIADLQRPTMVLAPNKTLAAQLYGEMKEF
FPENAVEYFVSYYDYYQPEAYVPSSDTFIEKDASVNEHIEQMRLSATKAMLERRDVVVVASVSAIYGLGDPDLYLKMMLH
LTVGMIIDQRAILRRLAELQYARNDQAFQRGTFRVRGEVIDIFPAESDDIALRVELFDEEVERLSLFDPLTGQIVSTIPR
FTIYPKTHYVTPRERIVQAMEEIKEELAARRKVLLENNKLLEEQRLTQRTQFDLEMMNELGYCSGIENYSRFLSGRGPGE
PPPTLFDYLPADGLLVVDESHVTIPQIGGMYRGDRARKETLVEYGFRLPSALDNRPLKFEEFEALAPQTIYVSATPGNYE
LEKSGGDVVDQVVRPTGLLDPIIEVRPVATQVDDLLSEIRQRAAINERVLVTTLTKRMAEDLTEYLEEHGERVRYLHSDI
DTVERMEIIRDLRLGEFDVLVGINLLREGLDMPEVSLVAILDADKEGFLRSERSLIQTIGRAARNVNGKAILYGDKITPS
MAKAIGETERRREKQQKYNEEHGITPQGLNKKVVDILALGQNIAKTKAKGRGKSRPIVEPDNVPMDMSPKALQQKIHELE
GLMMQHAQNLEFEEAAQIRDQLHQLRELFIAAS
>P9WFC7 ~~~uvrB~~~UvrABC system protein B~~~COG0556
MAFATEHPVVAHSEYRAVEEIVRAGGHFEVVSPHAPAGDQPAAIDELERRINAGERDVVLLGATGTGKSATTAWLIERLQ
RPTLVMAPNKTLAAQLANELREMLPHNAVEYFVSYYDYYQPEAYIAQTDTYIEKDSSINDDVERLRHSATSALLSRRDVV
VVASVSCIYGLGTPQSYLDRSVELKVGEEVPRDGLLRLLVDVQYTRNDMSFTRGSFRVRGDTVEIIPSYEELAVRIEFFG
DEIEALYYLHPLTGEVIRQVDSLRIFPATHYVAGPERMAHAVSAIEEELAERLAELESQGKLLEAQRLRMRTNYDIEMMR
QVGFCSGIENYSRHIDGRGPGTPPATLLDYFPEDFLLVIDESHVTVPQIGGMYEGDISRKRNLVEYGFRLPSACDNRPLT
WEEFADRIGQTVYLSATPGPYELSQTGGEFVEQVIRPTGLVDPKVVVKPTKGQIDDLIGEIRTRADADQRVLVTTLTKKM
AEDLTDYLLEMGIRVRYLHSEVDTLRRVELLRQLRLGDYDVLVGINLLREGLDLPEVSLVAILDADKEGFLRSSRSLIQT
IGRAARNVSGEVHMYADKITDSMREAIDETERRRAKQIAYNEANGIDPQPLRKKIADILDQVYREADDTAVVEVGGSGRN
ASRGRRAQGEPGRAVSAGVFEGRDTSAMPRAELADLIKDLTAQMMAAARDLQFELAARFRDEIADLKRELRGMDAAGLK
>P67425 ~~~uvrB~~~UvrABC system protein B~~~
MTMVEHYPFKIHSDFEPQGDQPQAIKEIVDGIKAGKRHQTLLGATGTGKTFTMSNVIKEVGKPTLIIAHNKTLAGQLYSE
FKEFFPENRVEYFVSYYDYYQPEAYVPSTDTFIEKDASINDEIDQLRHSATSALFERDDVIIIASVSCIYGLGNPEEYKD
LVVSVRVGMEMDRSELLRKLVDVQYTRNDIDFQRGTFRVRGDVVEIFPASKEELCIRVEFFGDEIDRIREVNYLTGEVLK
EREHFAIFPASHFVTREEKLKVAIERIEKELEERLKELRDENKLLEAQRLEQRTNYDLEMMREMGFCSGIENYSVHLTLR
PLGSTPYTLLDYFGDDWLVMIDESHVTLPQVRGMYNGDRARKQVLVDHGFRLPSALDNRPLKFEEFEEKTKQLVYVSATP
GPYEIEHTDKMVEQIIRPTGLLDPKIEVRPTENQIDDLLSEIQTRVERNERVLVTTLTKKMSEDLTTYMKEAGIKVNYLH
SEIKTLERIEIIRDLRMGTYDVIVGINLLREGIDIPEVSLVVILDADKEGFLRSNRSLIQTIGRAARNDKGEVIMYADKM
TDSMKYAIDETQRRREIQMKHNEKHGITPKTINKKIHDLISATVENDENNDKAQTVIPKKMTKKERQKTIDNIEKEMKQA
AKDLDFEKATELRDMLFELKAEG
>Q56243 ~~~uvrB~~~UvrABC system protein B~~~COG0556
MTFRYRGPSPKGDQPKAIAGLVEALRDGERFVTLLGATGTGKTVTMAKVIEALGRPALVLAPNKILAAQLAAEFRELFPE
NAVEYFISYYDYYQPEAYVPGKDLYIEKDASINPEIERLRHSTTRSLLTRRDVIVVASVSAIYGLGDPREYRARNLVVER
GKPYPREVLLERLLELGYQRNDIDLSPGRFRAKGEVLEIFPAYETEPIRVELFGDEVERISQVHPVTGERLRELPGFVLF
PATHYLSPEGLEEILKEIEKELWERVRYFEERGEVLYAQRLKERTLYDLEMLRVMGTCPGVENYARYFTGKAPGEPPYTL
LDYFPEDFLVFLDESHVTVPQLQGMYRGDYARKKTLVDYGFRLPSALDNRPLRFEEFLERVSQVVFVSATPGPFELAHSG
RVVEQIIRPTGLLDPLVRVKPTENQILDLMEGIRERAARGERTLVTVLTVRMAEELTSFLVEHGIRARYLHHELDAFERQ
ALIRDLRLGHYDCLVGINLLREGLDIPEVSLVAILDADKEGFLRSERSLIQTIGRAARNARGEVWLYADRVSEAMQRAIE
ETNRRRALQEAYNLEHGITPETVRKEVRAVIRPEGYEEAPLEADLSGEDLRERIAELELAMWQAAEALDFERAARLRDEI
RALEARLQGVRAPEPVPGGRKRKRR
>Q9RUN0 ~~~uvrC~~~UvrABC system protein C~~~COG0322
MHFDDLPVLPTTPGVYIFRKGGVPIYIGKANNLRSRVSQHFKAGGKSGKFTKLAESLEFISAANEVEALVLEANLIKQHR
PHYNVLLKDDKHYPFLKLTNEAYPMLVVTRRVLKDGANYYGPYPDASAVRRVKHLIDTMFPLRKNSGLPMQKKPRPCLNY
HMGRCLGPCIDAAQPDEYRQAVEDVKALLEGRAAPVIARLKEDMKVAAQGQDFEQAARLRDRVQAVEKLFGTEQHAFVSE
ETDLDFLGAAQAGEFAMVQLFRMRGGRVVGRDKRFLTGADETGLGEIIGAFVADYYTQATHVPPLILLPAEYEDAALWSE
FLSRQAGRRVEMRTPKRGDKTDLIEMAQRNAAVGLDSEMALLERRGDHPGLDALKDVLALPERPWRIEGYDNSNLFGTNI
VSGMVVFEGGRSRRGEHRRFKVRGLEHPDDYESMKQTIYRRFTGSLADKLPLPDLMLIDGGRGQVNAALDALKEAGVQVP
VVGLAKREERLILPGRYGAQWWLETGTEVGVDRELLLPHTHPALRMLIGVRDEVHNYAVSYHRKLRGEGMLRSVFDDLPG
IGQKRRDALLEHFTSLEDLAAAPVEHIAAVPGMTLRAAQSVKEFLQAREAQLPRAGG
>P0A8G0 ~~~uvrC~~~UvrABC system protein C~~~COG0322
MSDQFDAKAFLKTVTSQPGVYRMYDAGGTVIYVGKAKDLKKRLSSYFRSNLASRKTEALVAQIQQIDVTVTHTETEALLL
EHNYIKLYQPRYNVLLRDDKSYPFIFLSGDTHPRLAMHRGAKHAKGEYFGPFPNGYAVRETLALLQKIFPIRQCENSVYR
NRSRPCLQYQIGRCLGPCVEGLVSEEEYAQQVEYVRLFLSGKDDQVLTQLISRMETASQNLEFEEAARIRDQIQAVRRVT
EKQFVSNTGDDLDVIGVAFDAGMACVHVLFIRQGKVLGSRSYFPKVPGGTELSEVVETFVGQFYLQGSQMRTLPGEILLD
FNLSDKTLLADSLSELAGRKINVQTKPRGDRARYLKLARTNAATALTSKLSQQSTVHQRLTALASVLKLPEVKRMECFDI
SHTMGEQTVASCVVFDANGPLRAEYRRYNITGITPGDDYAAMNQVLRRRYGKAIDDSKIPDVILIDGGKGQLAQAKNVFA
ELDVSWDKNHPLLLGVAKGADRKAGLETLFFEPEGEGFSLPPDSPALHVIQHIRDESHDHAIGGHRKKRAKVKNTSSLET
IEGVGPKRRQMLLKYMGGLQGLRNASVEEIAKVPGISQGLAEKIFWSLKH
>Q5KWH6 ~~~uvrC~~~UvrABC system protein C~~~COG0322
MNERLKEKLAVLPEQPGCYLMKDKHGTVIYVGKAKSLKARVRSYFTGTHDGKTQRLVEEIADFEYIVTSSNAEALILEMN
LIKKHDPKYNVMLKDDKSYPFIKITAEKHPRLLITRKVKKDGGKYFGPYPNVQAANETKKLLDRLYPLRKCSTLPSRACL
YYHMGQCLAPCVHPVSDEQNKAMVEQIVRFLNGGYEDVKRELAEKMHEAAETLEFERAKEYRDQIAAIEMTMEKQKMMLN
DFIDRDVFGYAYDKGWMCVQVFFLRQGKLIERDVSIFPLYQDPDEEMLTFLGQFYAKAHHLKPKEVVLPSDIDGELAREL
LGVAVVQPKKGKKKELVELASKNAAIALKEKFYFIERDEERTIKAVERLGERLGIPAPRRIEAFDNSNIYGADPVSALVV
FLDGKPAKKEYRKYKVKTVAGPNDYETMREVVRRRYTRVLKEGLPLPDLIIIDGGKGHLSAVRDVLENELGLDVPLAGLA
KDEKHRTSELLAGDPPDVVPLDRQSQEFYLLQRIQDEVHRFAVMFHRKTRQKTMFHSVLDDIPGVGEKRKKALLNYFGSV
KKMKEATVEELQRANIPRAVAEKIYEKLHE
>P56428 ~~~uvrC~~~UvrABC system protein C~~~COG0322
MADLLSSLKNLPNSSGVYQYFDKNRQLLYIGKAKNLKKRIKSYFSIRNNEITPNHRASLRIQMMVKQIAFLETILVENEQ
DALILENSLIKQLKPKYNILLRDDKTYPYIYMDFSTDFPIPLITRKILKQPGVKYFGPFTSGAKDILDSLYELLPLVQKK
NCIKDKKACIFYQIERCKAPCENKITKEEYLKIAKECLEMIENKDRLIKELELKMERLSNNLRFEEALIYRDRIAKIQKI
APFTCMDLAKLYDLDIFAFYGASNKAVLVKMFMRGGKIISSAFEKIHSLNGFDTDEAMKQAIINHYQSHLPLMPEQILLN
ACSNETLKELQEFISHQYSKKIALSIPKKGDKLALIEIAMKNAQEIFSQEKTSNEDLILEEARSLFKLECMPYRVEIFDT
SHHSSSQCVGGMVVYENNAFQKNSYRRYHLKGSDEYTQMSELLTRRALDFAKEPPPNLWVIDGGRAQLNIALEILKSSGS
FVEVIAISKEKRDSKAYRSKGGAKDIIHTPSDTFKLLPSDKRLQWVQKLRDESHRYAINFHRSTKLKNMKQIALLKEKGI
GEASVKKLLDYFGSFEAIEKASEQEKNAVLKKRI
>P9WFC5 ~~~uvrC~~~UvrABC system protein C~~~COG0322
MPDPATYRPAPGSIPVEPGVYRFRDQHGRVIYVGKAKSLRSRLTSYFADVASLAPRTRQLVTTAAKVEWTVVGTEVEALQ
LEYTWIKEFDPRFNVRYRDDKSYPVLAVTLGEEFPRLMVYRGPRRKGVRYFGPYSHAWAIRETLDLLTRVFPARTCSAGV
FKRHRQIDRPCLLGYIDKCSAPCIGRVDAAQHRQIVADFCDFLSGKTDRFARALEQQMNAAAEQLDFERAARLRDDLSAL
KRAMEKQAVVLGDGTDADVVAFADDELEAAVQVFHVRGGRVRGQRGWIVEKPGEPGDSGIQLVEQFLTQFYGDQAALDDA
ADESANPVPREVLVPCLPSNAEELASWLSGLRGSRVVLRVPRRGDKRALAETVHRNAEDALQQHKLKRASDFNARSAALQ
SIQDSLGLADAPLRIECVDVSHVQGTDVVGSLVVFEDGLPRKSDYRHFGIREAAGQGRSDDVACIAEVTRRRFLRHLRDQ
SDPDLLSPERKSRRFAYPPNLYVVDGGAPQVNAASAVIDELGVTDVAVIGLAKRLEEVWVPSEPDPIIMPRNSEGLYLLQ
RVRDEAHRFAITYHRSKRSTRMTASALDSVPGLGEHRRKALVTHFGSIARLKEATVDEITAVPGIGVATATAVHDALRPD
SSGAAR
>Q9WYA3 ~~~uvrC~~~UvrABC system protein C~~~COG0322
MKEKIRKKILLAPEEPGVYIFKNKGVPIYIGKAKRLSNRLRSYLNPQTEKVFRIGEEADELETIVVMNEREAFILEANLI
KKYRPKYNVRLKDTDFYPYIRISDDEIPYVEIVKRKLWDGTYFGPYTSVQFVRNLLEILQKIMGFRTCKSDLKRIKRPCF
LYHLGRCIGPCIGNIESHEEAIRKLREFLSGNMEEVFDYLKEKMETHSKMLDFENAAKYRDLLLNLSNVLESQGVVFEEN
INCDVLVHAHDLFVVLRVRNGYLVGKISFEMEGGNVEDFIREYYISGRGDIPKTLILESDLDEMDYSSLGFEYVGPPRST
TEEDLLEKAKKNLENELKMRGLRKEALEELMKLLNMKDFPYRIEGIDISHLQGKYTVASLVVFEDGFPKKGDYRRYKIEQ
DHPDDYESIRTVVKRRYSKHPLPNLLFVDGGIGQVNAAIEALKEIGKDCPVVGLAKKEETVVFENREIHLPHDHPVLRLL
VQIRDETHRFAVSYHRKRREKESLRSVLDNVPGIGPIRKKKLIEHFGSLENIRSASLEEIARVIGSTEIARRVLDIL
>P9WMQ1 5.6.2.4~~~uvrD1~~~ATP-dependent DNA helicase UvrD1~~~COG0210
MSVHATDAKPPGPSPADQLLDGLNPQQRQAVVHEGSPLLIVAGAGSGKTAVLTRRIAYLMAARGVGVGQILAITFTNKAA
AEMRERVVGLVGEKARYMWVSTFHSTCVRILRNQAALIEGLNSNFSIYDADDSRRLLQMVGRDLGLDIKRYSPRLLANAI
SNLKNELIDPHQALAGLTEDSDDLARAVASVYDEYQRRLRAANALDFDDLIGETVAVLQAFPQIAQYYRRRFRHVLVDEY
QDTNHAQYVLVRELVGRDSNDGIPPGELCVVGDADQSIYAFRGATIRNIEDFERDYPDTRTILLEQNYRSTQNILSAANS
VIARNAGRREKRLWTDAGAGELIVGYVADNEHDEARFVAEEIDALAEGSEITYNDVAVFYRTNNSSRSLEEVLIRAGIPY
KVVGGVRFYERKEIRDIVAYLRVLDNPGDAVSLRRILNTPRRGIGDRAEACVAVYAENTGVGFGDALVAAAQGKVPMLNT
RAEKAIAGFVEMFDELRGRLDDDLGELVEAVLERTGYRRELEASTDPQELARLDNLNELVSVAHEFSTDRENAAALGPDD
EDVPDTGVLADFLERVSLVADADEIPEHGAGVVTLMTLHTAKGLEFPVVFVTGWEDGMFPHMRALDNPTELSEERRLAYV
GITRARQRLYVSRAIVRSSWGQPMLNPESRFLREIPQELIDWRRTAPKPSFSAPVSGAGRFGSARPSPTRSGASRRPLLV
LQVGDRVTHDKYGLGRVEEVSGVGESAMSLIDFGSSGRVKLMHNHAPVTKL
>P9WMP9 5.6.2.4~~~uvrD2~~~ATP-dependent DNA helicase UvrD2~~~COG0210
MSIASDPLIAGLDDQQREAVLAPRGPVCVLAGAGTGKTRTITHRIASLVASGHVAAGQVLAVTFTQRAAGEMRSRLRALD
AAARTGSGVGAVQALTFHAAAYRQLRYFWSRVIADTGWQLLDSKFAVVARAASRTRLHASTDDVRDLAGEIEWAKASLIG
PEEYVTAVAAARRDPPLDAAQIAAVYSEYEALKARGDGVTLLDFDDLLLHTAAAIENDAAVAEEFQDRYRCFVVDEYQDV
TPLQQRVLSAWLGDRDDLTVVGDANQTIYSFTGASPRFLLDFSRRFPDAAVVRLERDYRSTPQVVSLANRVIAAARGRVA
GSKLRLSGQREPGPVPSFHEHSDEPAEAATVAASIARLIASGTPPSEVAILYRVNAQSEVYEEALTQAGIAYQVRGGEGF
FNRQEIKQALLALQRVSERDTDAALSDVVRAVLAPLGLTAQPPVGTRARERWEALTALAELVDDELAQRPALQLPGLLAE
LRRRAEARHPPVVQGVTLASLHAAKGLEWDAVFLVGLADGTLPISHALAHGPNSEPVEEERRLLYVGITRARVHLALSWA
LSRSPGGRQSRKPSRFLNGIAPQTRADPVPGTSRRNRGAAARCRICNNELNTSAAVMLRRCETCAADVDEELLLQLKSWR
LSTAKEQNVPAYVVFTDNTLIAIAELLPTDDAALIAIPGIGARKLEQYGSDVLQLVRGRT
>P03018 5.6.2.4~~~uvrD~~~DNA helicase II~~~COG0210
MDVSYLLDSLNDKQREAVAAPRSNLLVLAGAGSGKTRVLVHRIAWLMSVENCSPYSIMAVTFTNKAAAEMRHRIGQLMGT
SQGGMWVGTFHGLAHRLLRAHHMDANLPQDFQILDSEDQLRLLKRLIKAMNLDEKQWPPRQAMWYINSQKDEGLRPHHIQ
SYGNPVEQTWQKVYQAYQEACDRAGLVDFAELLLRAHELWLNKPHILQHYRERFTNILVDEFQDTNNIQYAWIRLLAGDT
GKVMIVGDDDQSIYGWRGAQVENIQRFLNDFPGAETIRLEQNYRSTSNILSAANALIENNNGRLGKKLWTDGADGEPISL
YCAFNELDEARFVVNRIKTWQDNGGALAECAILYRSNAQSRVLEEALLQASMPYRIYGGMRFFERQEIKDALSYLRLIAN
RNDDAAFERVVNTPTRGIGDRTLDVVRQTSRDRQLTLWQACRELLQEKALAGRAASALQRFMELIDALAQETADMPLHVQ
TDRVIKDSGLRTMYEQEKGEKGQTRIENLEELVTATRQFSYNEEDEDLMPLQAFLSHAALEAGEGQADTWQDAVQLMTLH
SAKGLEFPQVFIVGMEEGMFPSQMSLDEGGRLEEERRLAYVGVTRAMQKLTLTYAETRRLYGKEVYHRPSRFIGELPEEC
VEEVRLRATVSRPVSHQRMGTPMVENDSGYKLGQRVRHAKFGEGTIVNMEGSGEHSRLQVAFQGQGIKWLVAAYARLESV
>Q02322 5.6.2.4~~~uvrD~~~DNA helicase II~~~COG0210
MMDISELLDGLNDKQRERVAAPLGNHLVLAGAGSGKTRVLTHRIAWLIAVENISEGSIMAVTFTNKAAAEMRHRIQSTLA
KHAQHQLFGMWIGTFHSIAHRLLRAHHLDVGLPQDFQILDSEDQLRLIKRLLKLHNFDEKAFPPKQACWYINNKKDEGLR
PNDIEDFNDRQEREWIKIYQIYQDTCDRAGLVDFAELLIRVYELFEKKPLILQRYQQRFQHILVDEFQDTNKIQYKWIKI
LAGKTGQVMIVGDDDQSIYGWRGAQIENIQKFLKDFKAETIRLEQNYRSTANILNSANELIANNSDRLGKNLWTEGEKGD
PVGIYSAFNELDEAKFVASQIQDWVEHGGKLDDCAVLYRSNSQSRVIEEALIRCQIPYRIYGGMRFFERQEIKDALAYLR
LINNRQDDAAFERVINTPTRGIGDRTLDILRNLTRERQITLWQAVQVATQENMLAGRASTALLRFQELINSLQLDTAEMP
LFAQTDFVIKHSGLYEMYQQEKGEKGEVRIENLEELVTATREFIKPDNAEEMTELTAFLTHASLEAGEEQASPHQSCVEM
MTLHSAKGLEFPRVFMVGVEEGLFPSFRSFEEPGRLEEERRLAYVGITRAKKKLTISYAESRRLYAKEERHLPSRFIAEL
PRECIQEIRLRGTVTRAMNLAKVGSLSNTSAVENEWKMGQKVKHEKFGFGTVINVEGSENNTRLQIAFQAQGIKWLIAHL
AKLEKVR
>P0AED5 ~~~uvrY~~~Response regulator UvrY~~~COG2197
MINVLLVDDHELVRAGIRRILEDIKGIKVVGEASCGEDAVKWCRTNAVDVVLMDMSMPGIGGLEATRKIARSTADVKIIM
LTVHTENPLPAKVMQAGAAGYLSKGAAPQEVVSAIRSVYSGQRYIASDIAQQMALSQIEPEKTESPFASLSERELQIMLM
ITKGQKVNEISEQLNLSPKTVNSYRYRMFSKLNIHGDVELTHLAIRHGLCNAETLSSQ
>Q9RTE6 3.-.-.-~~~uvsE~~~UV DNA damage endonuclease~~~COG4294
MTSACEAVPQLGLVCLTVGPEVRFRTVTLSRYRALSPAEREAKLLDLYSSNIKTLRGAADYCAAHDIRLYRLSSSLFPML
DLAGDDTGAAVLTHLAPQLLEAGHAFTDAGVRLLMHPEQFIVLNSDRPEVRESSVRAMSAHARVMDGLGLARTPWNLLLL
HGGKGGRGAELAALIPDLPDPVRLRLGLENDERAYSPAELLPICEATGTPLVFDAHHHVVHDKLPDQEDPSVREWVLRAR
ATWQPPEWQVVHLSNGIEGPQDRRHSHLIADFPSAYADVPWIEVEAKGKEEAIAALRLMAPFKKD
>O34673 4.2.1.7~~~uxaA~~~Altronate dehydratase~~~COG2721
MKSFIKIHKQDNVLLALRDIQKGERLHAYGVSIEVKDDIKRGHKIALQSIKENDSIVKYGFPIGHASQDISIGEHIHVHN
TKTNLSDIQLYSYTPRFDENPYSNENRTFKGFRRENGDAGVRNELWIVPTVGCVNGIAEKMLQRFVRETGDIAPFDNVLV
LKHQYGCSQLGDDHENTKQILLNAIRHPNAGGVLVLGLGCENNELARMKEALQDVNLKRVKFLESQSVTDEMEAGVALLK
EIHEAAKGDKREDIPLSELKIGLKCGGSDGFSGITANPLLGRFSDYLIAQGGSTVLTEVPEMFGAETILMQRAANEEVFH
KIVDLINDFKQYFIKHDQPVYENPSPGNKAGGISTLEDKSLGCTQKAGISPVTDVLKYGEVLKTKGLTLLSAPGNDLIAS
SALAAAGCQIVLFTTGRGTPFGTFVPTVKVATNTELYEAKPHWIDFNAGLLAEDDVHEEYVLREFIHYMIEVASGQLVNH
EKNDFKELAIFKSGVTL
>P42604 4.2.1.7~~~uxaA~~~Altronate dehydratase~~~COG2721
MQYIKIHALDNVAVALADLAEGTEVSVDNQTVTLRQDVARGHKFALTDIAKGANVIKYGLPIGYALADIAAGVHVHAHNT
RTNLSDLDQYRYQPDFQDLPAQAADREVQIYRRANGDVGVRNELWILPTVGCVNGIARQIQNRFLKETNNAEGTDGVFLF
SHTYGCSQLGDDHINTRTMLQNMVRHPNAGAVLVIGLGCENNQVAAFRETLGDIDPERVHFMICQQQDDEIEAGIEHLHQ
LYNVMRNDKREPGKLSELKFGLECGGSDGLSGITANPMLGRFSDYVIANGGTTVLTEVPEMFGAEQLLMDHCRDEATFEK
LVTMVNDFKQYFIAHDQPIYENPSPGNKAGGITTLEDKSLGCTQKAGSSVVVDVLRYGERLKTPGLNLLSAPGNDAVATS
ALAGAGCHMVLFSTGRGTPYGGFVPTVKIATNSELAAKKKHWIDFDAGQLIHGKAMPQLLEEFIDTIVEFANGKQTCNER
NDFRELAIFKSGVTL
>O34354 1.1.1.58~~~uxaB~~~Altronate oxidoreductase~~~COG0246
MQKLNKNVYDHYTQYPEKILQFGEGNFLRGFIDWQIDQLNQHTDFNGSVAVVQPRGSEKIKRLNEQDGLYTLFLQGMKDG
EAVNEHMIINSISRGIDLFSDYEAYKELASSERLRFIISNTTEAGIVCDEKDRLEDRPQKTFPGKLTAFLYFRYQAFKGD
QTKGCVLIPCELIENNGEKLRETVLHYAHLWKLEEGFTQWIHEANTFCNSLVDRIVPGFPVDSIDEITADLGYQDDLIVV
GEQYYLWVIEGPDWIGKELPFAAAGLHTKIVSDLTPYRTKKVRILNGAHTAMTPVALLYGLKTVRDAVEHPEVGRFIREL
IDDEILPVLKMEGLSQYADDVLNRFKNPYIKHYLESIALNAISKFKTRNLPTLKEYAEQKGQLPERLVFSFSALLYFYHD
NETLQDDPAVLQFFKEVWCQEDGDMLRIASRVLGEQRLWGADLNEIPKLTDRVAVYLNHIHELGMQRALEQYCIQGGEVR
>P0A6L7 1.1.1.58~~~uxaB~~~Altronate oxidoreductase~~~COG0246
MKTLNRRDFPGAQYPERIIQFGEGNFLRAFVDWQIDLLNEHTDLNSGVVVVRPIETSFPPSLSTQDGLYTTIIRGLNEKG
EAVSDARLIRSVNREISVYSEYDEFLKLAHNPEMRFVFSNTTEAGISYHAGDKFDDAPAVSYPAKLTRLLFERFSHFNGA
LDKGWIIIPCELIDYNGDALRELVLRYAQEWALPEAFIQWLDQANSFCSTLVDRIVTGYPRDEVAKLEEELGYHDGFLDT
AEHFYLFVIQGPKSLATELRLDKYPLNVLIVDDIKPYKERKVAILNGAHTALVPVAFQAGLDTVGEAMNDAEICAFVEKA
IYEEIIPVLDLPRDELESFASAVTGRFRNPYIKHQLLSIALNGMTKFRTRILPQLLAGQKANGTLPARLTFALAALIAFY
RGERNGETYPVQDDAHWLERYQQLWSQHRDRVIGTQELVAIVLAEKDHWEQDLTQVPGLVEQVANDLDAILEKGMREAVR
PLC
>O34808 5.3.1.12~~~uxaC~~~Uronate isomerase~~~COG1904
MEPFMGKNFLLKNETAVSLYHNYAKDMPIIDYHCHLSPKEIYENKTFQNITEAWLYGDHYKWRIMRANGIEETYITGDAP
DEEKFMAWAKTVPMAIGNPLYNWTHLELQRFFGIYEILNEKSGSAIWKQTNKLLKGEGFGARDLIVKSNVKVVCTTDDPV
DSLEYHLLLKEDKDFPVSVLPGFRPDKGLEINREGFPEWVQALEDAAAISITTYDEFLKALEKRVRFFHSAGGRVSDHAI
DTMVFAETTKEEAGRIFSDRLQGTEVSCEDEKKFKTYTLQFLCGLYAELDWAMQFHINALRNTNTKMMKRLGPDTGYDSM
NDEEIAKPLYKLLNSVEMKNQLPKTILYSLNPNDNYVIASMINSFQDGITPGKIQFGTAWWFNDTKDGMLDQMKALSNVG
LFSRFIGMLTDSRSFLSYTRHEYFRRIVCNLIGEWVENGEVPRDMELLGSIVQGICYDNAKHYFQFQEEKANV
>Q9A874 5.3.1.12~~~uxaC~~~Uronate isomerase~~~COG1904
MARPLSFHEDRLFPSDPATRSYARGLYALVKDLPIISPHGHTDPSWFATNAPFQDATDLLLAPDHYLFRMLYSQGVSLDA
LKVRSKAGVPDTDPREAWRVFASHFYLFRGTPSWVWLNHVFSQVFGFTEFLEASNADDYFDRITAALATDAFRPRALFDR
FNIETLATTEGPHESLQHHAAIRESGWGGHVITAYRPDAVIDFEDERSPRAFERFAETSGQDVYSWKSYLEAHRLRRQAF
IDAGATSSDHGHPTAATADLSDVEAEALFNSLVKGDVTPEKAELFRAQMLTEMAKMSLDDGLVMQIHPGSHRNHNVGLLN
SHGRDKGADIPMRTEYVDALKPLLTRLGNDPRLSIILFTLDETTYSRELAPLAGHYPVLKLGPSWWFHDSPEGMMRFREQ
VTETAGFYNTVGFNDDTRAFLSIPARHDVARRVDSAFLARMVAEHRMDLVEAEELIVDLTYNLPKKAYKLDQRPDWARPA
TLRAAAE
>P0A8G3 5.3.1.12~~~uxaC~~~Uronate isomerase~~~COG1904
MTPFMTEDFLLDTEFARRLYHDYAKDQPIFDYHCHLPPQQIAEDYRFKNLYDIWLKGDHYKWRAMRTNGVAERLCTGDAS
DREKFDAWAATVPHTIGNPLYHWTHLELRRPFGITGKLLSPSTADEIWNECNELLAQDNFSARGIMQQMNVKMVGTTDDP
IDSLEHHAEIAKDGSFTIKVLPSWRPDKAFNIEQATFNDYMAKLGEVSDTDIRRFADLQTALTKRLDHFAAHGCKVSDHA
LDVVMFAEANEAELDSILARRLAGETLSEHEVAQFKTAVLVFLGAEYARRGWVQQYHIGALRNNNLRQFKLLGPDVGFDS
INDRPMAEELSKLLSKQNEENLLPKTILYCLNPRDNEVLGTMIGNFQGEGMPGKMQFGSGWWFNDQKDGMERQMTQLAQL
GLLSRFVGMLTDSRSFLSYTRHEYFRRILCQMIGRWVEAGEAPADINLLGEMVKNICFNNARDYFAIELN
>Q8ZM23 5.3.1.12~~~uxaC~~~Uronate isomerase~~~
MATFMTEDFLLKNDIARTLYHKYAAPMPIYDFHCHLSPQEIADDRRFDNLGQIWLEGDHYKWRALRSAGVDESLITGKET
SDYEKYMAWANTVPKTLGNPLYHWTHLELRRPFGITGTLFGPDTAESIWTQCNEKLATPAFSARGIMQQMNVRMVGTTDD
PIDSLEYHRQIAADDSIDIEVAPSWRPDKVFKIELDGFVDYLRKLEAAADVSITRFDDLRQALTRRLDHFAACGCRASDH
GIETLRFAPVPDDAQLDAILGKRLAGETLSELEIAQFTTAVLVWLGRQYAARGWVMQLHIGAIRNNNTRMFRLLGPDTGF
DSIGDNNISWALSRLLDSMDVTNELPKTILYCLNPRDNEVLATMIGNFQGPGIAGKVQFGSGWWFNDQKDGMLRQLEQLS
QMGLLSQFVGMLTDSRSFLSYTRHEYFRRILCNLLGQWAQDGEIPDDEAMLSRMVQDICFNNAQRYFTIK
>Q9WXR9 5.3.1.12~~~uxaC~~~Uronate isomerase~~~COG1904
MFLGEDYLLTNRAAVRLFNEVKDLPIVDPHNHLDAKDIVENKPWNDIWEVEGATDHYVWELMRRCGVSEEYITGSRSNKE
KWLALAKVFPRFVGNPTYEWIHLDLWRRFNIKKVISEETAEEIWEETKKKLPEMTPQKLLRDMKVEILCTTDDPVSTLEH
HRKAKEAVEGVTILPTWRPDRAMNVDKEGWREYVEKMGERYGEDTSTLDGFLNALWKSHEHFKEHGCVASDHALLEPSVY
YVDENRARAVHEKAFSGEKLTQDEINDYKAFMMVQFGKMNQETNWVTQLHIGALRDYRDSLFKTLGPDSGGDISTNFLRI
AEGLRYFLNEFDGKLKIVLYVLDPTHLPTISTIARAFPNVYVGAPWWFNDSPFGMEMHLKYLASVDLLYNLAGMVTDSRK
LLSFGSRTEMFRRVLSNVVGEMVEKGQIPIKEARELVKHVSYDGPKALFFG
>Q9WYS1 5.1.2.7~~~uxaE~~~Tagaturonate/fructuronate epimerase~~~
MVLKVFKDHFGRGYEVYEKSYREKDSLSFFLTKEEEGKILVVAGEKAPEGLSFFKKQRAEGVSFFFCERNHENLEVLRKY
FPDLKPVRAGLRASFGTGDRLGITTPAHVRALKDSGLFPIFAQQSVRENERTGRTWRDVLDDATWGVFQEGYSEGFGADA
DHVKRPEDLVSAAREGFTMFTIDPSDHVRNLSKLTEKERNEKFEEILRKERIDRIYLGKKYSVLGEKIEFDEKNLRDAAL
VYYDAIAHVDMMYQILKDETPDFDFEVSVDETETPTSPLFHIFVVEELRRRGVEFTNLALRFIGEWEKGIDYKGDLAQFE
REIKMHAEIARMFEGYKISLHSGSDKFSVYPAFASATGGLFHVKTAGTSYLEAVKVISMVNPELFREIYRCTLDHFEEDR
KSYHISADLSKVPEVEKVKDEDLPGLFEDINVRQLIHVTYGSVLKDASLKERLFKTLEQNEELFYETVAKHIKRHVDLLE
G
>O34346 4.2.1.8~~~uxuA~~~Mannonate dehydratase~~~COG1312
MNMTFRWYGRGNDTVTLEYVKQIPGVKGIVWALHQKPVGDVWEKEEIRAETEYIQSYGFHAEVVESVNVHEAIKLGNEER
GRYIENYKQTIRNLAGFGVKVICYNFMPVFDWTRTDMFRPLEDGSTALFFEKAKVESLDPQELIRTVEEASDMTLPGWEP
EKLARIKELFAAYRTVDEEKLWDNLSFFLQEILPVAEAYGVQMAIHPDDPPWPIFGLPRIITGEASYKKLRAISDSPSNC
ITLCTGSMGANPANDMVEIAKTYAGIAPFSHIRNVKIYENGDFIETSHLTKDGSINIQGVMEELHKQDYEGYVRPDHGRH
LWGEQCRPGYGLYDRALGIMYLNGLWDAYEAMAKKEVGI
>P24215 4.2.1.8~~~uxuA~~~Mannonate dehydratase~~~COG1312
MEQTWRWYGPNDPVSLADVRQAGATGVVTALHHIPNGEVWSVEEILKRKAIIEDAGLVWSVVESVPIHEDIKTHTGNYEQ
WIANYQQTLRNLAQCGIRTVCYNFMPVLDWTRTDLEYVLPDGSKALRFDQIEFAAFEMHILKRPGAEADYTEEEIAQAAE
RFATMSDEDKARLTRNIIAGLPGAEEGYTLDQFRKHLELYKDIDKAKLRENFAVFLKAIIPVAEEVGVRMAVHPDDPPRP
ILGLPRIVSTIEDMQWMVDTVNSMANGFTMCTGSYGVRADNDLVDMIKQFGPRIYFTHLRSTMREDNPKTFHEAAHLNGD
VDMYEVVKAIVEEEHRRKAEGKEDLIPMRPDHGHQMLDDLKKKTNPGYSAIGRLKGLAEVRGVELAIQRAFFSR
>Q82ZC9 4.2.1.8~~~uxuA~~~Mannonate dehydratase~~~COG1312
MKWGFRWYGAAGDAIPLKHIRQIPGITGVVGTLLNKLPGDVWTVAEIQALKQSVEQEGLALLGIESVAIHDAIKAGTDQR
DHYIDNYRQTLRNLGKCGISLVCYSFKPIFGWAKTDLAYENEDGSLSLLFDQAVVENMQPEDMYQLIHSQSKGFRLPGWE
EERLQQFQELKAMYAGVTEEDLVENLRYFLERVIPVCEEENIKMGIHPDDPPWEIFGLPRITKNLADLKRILSLVDSPAN
GITFCTGSLGADPTNDLPTMIREIGHRINFVHFRNVKYLGEHRFEETAHPSVAGSLDMAELMQALVDVGYEGVIRPDHGR
AIWDEKAMPGYGLYDRAMGLTYIQGLYEATKAKQNRK
>A4VVI4 4.2.1.8~~~uxuA~~~Mannonate dehydratase~~~COG1312
MKMSFRWYGKKDPVTLEEIKAIPGMQGIVTAVYDVPVGQAWPLENILELKKMVEEAGLEITVIESIPVHEDIKQGKPNRD
ALIENYKTSIRNVGAAGIPVVCYNFMPVFDWTRSDLHHPLPDGSTSLAFLKSDLAGVDPVADDLNLPGWDSSYSKEEMKA
IIENYRQNISEEDLWANLEYFIKAILPTAEEAGVKMAIHPDDPPYGIFGLPRIITGQEAVERFLNLYDSEHNGITMCVGS
YASDPKNDVLAMTEYALKRNRINFMHTRNVTAGAWGFQETAHLSQAGDIDMNAVVKLLVDYDWQGSLRPDHGRRIWGDQT
KTPGYGLYDRALGATYFNGLYEANMRAAGKTPDFGIKAKTVGTKEG
>O34896 1.-.-.-~~~uxuB~~~Uncharacterized oxidoreductase UxuB~~~COG1028
MIPLHENLAGKTAVITGGSGVLCSAMARELARHGMKVAILNRTAEKGQAVVKEITAAGGTACAVAADVLDRMSLERAKED
ILGQFGAVDLLINGAGGNHPDAITDVETYEEAGEGQSFFDMDERGFLTVFSTNFTGAFLASQVFGKELLKADSPAIINLS
SMSAYSPMTKVPAYSAAKASINNFTMWMAVHFAETGLRVNAIAPGFFLTKQNHDLLINQDGTFTSRSHKIIAGTPMKRFG
KPEDLLGTLLWLADESYSGFVTGITVPVDGGFMAYSGV
>Q9WXS3 1.1.1.57~~~uxuB~~~D-mannonate oxidoreductase~~~COG0246
MRLNRETIKDRAAWEKIGVRPPYFDLDEVEKNTKEQPKWVHFGGGNIFRGFVAAVLQNLLEEGKEDTGINVIELFDYEVI
DKVYKPYDNLSIAVTIKPDGDFEKRIIASVMEALKGDPSHPDWERAKEIFRNPSLQLASLTITEKGYNIEDQAGNLFPQV
MEDMKNGPVSPQTSMGKVAALLYERFKAGRLPIALLSLDNFSRNGEKLYSSVKRISEEWVKSGLVEKDFIDYLEKDVAFP
WSMIDKIVPGPSEFIKEHLEKLGIEGMEIFVTSKRTHIAPFVNMEWAQYLVIEDSFPNGRPKLEGADRNVFLTDRETVEK
AERMKVTTCLNPLHTALAIFGCLLGYKKIADEMKDPLLKKLVEGVGEEGIKVVVDPGIINPREFLNEVINIRLPNPYLPD
TPQRIATDTSQKMPIRFGETIKAYHERPDLDPRNLKYIPLVIAGWCRYLMGIDDEGREMQLSPDPLLENLRSYVSKIKFG
DPESTDDHLKPILSSQQLFRVNLYEVGLGEKIEELFKKMITGPRAVRKTLEEVVGREDG
>Q48247 ~~~vacA~~~Vacuolating cytotoxin autotransporter~~~
MEIQQTHRKINRPLVSLALVGALVSITPQQSHAAFFTTVIIPAIVGGIATGTAVGTVSGLLSWGLKQAEEANKTPDKPDK
VWRIQAGKGFNEFPNKEYDLYRSLLSSKIDGGWDWGNAARHYWVKGGQQNKLEVDMKDAVGTYTLSGLRNFTGGDLDVNM
QKATLRLGQFNGNSFTSYKDSADRTTRVDFNAKNISIDNFVEINNRVGSGAGRKASSTVLTLQASEGITSDKNAEISLYD
GATLNLASSSVKLMGNVWMGRLQYVGAYLAPSYSTINTSKVTGEVNFNHLTVGDKNAAQAGIIANKKTNIGTLDLWQSAG
LNIIAPPEGGYKDKPNNTPSQSGAKNDKNESAKNDKQESSQNNSNTQVINPPNSAQKTEVQPTQVIDGPFAGGKDTVVNI
NRINTNADGTIRVGGFKASLTTNAAHLHIGKGGVNLSNQASGRSLIVENLTGNITVDGPLRVNNQVGGYALAGSSANFEF
KAGTDTKNGTATFNNDISLGRFVNLKVDAHTANFKGIDTGNGGFNTLDFSGVTDKVNINKLITASTNVAVKNFNINELIV
KTNGISVGEYTHFSEDIGSQSRINTVRLETGTRSLFSGGVKFKGGEKLVIDEFYYSPWNYFDARNIKNVEITNKLAFGPQ
GSPWGTSKLMFNNLTLGQNAVMDYSQFSNLTIQGDFINNQGTINYLVRGGKVATLSVGNAAAMMFNNDIDSATGFYKPLI
KINSAQDLIKNTEHVLLKAKIIGYGNVSTGTNGISNVNLEEQFKERLALYNNNNRMDTCVVRNTDDIKACGMAIGDQSMV
NNPDNYKYLIGKAWKNIGISKTANGSKISVYYLGNSTPTENGGNTTNLPTNTTSNARSANNALAQNAPFAQPSATPNLVA
INQHDFGTIESVFELANRSKDIDTLYANSGAQGRDLLQTLLIDSHDAGYARKMIDATSANEITKQLNTATTTLNNIASLE
HKTSGLQTLSLSNAMILNSRLVNLSRRHTNHIDSFAKRLQALKDQKFASLESAAEVLYQFAPKYEKPTNVWANAIGGTSL
NNGSNASLYGTSAGVDAYLNGQVEAIVGGFGSYGYSSFNNRANSLNSGANNTNFGVYSRIFANQHEFDFEAQGALGSDQS
SLNFKSALLQDLNQSYHYLAYSAATRASYGYDFAFFRNALVLKPSVGVSYNHLGSTNFKSNSTNQVALKNGSSSQHLFNA
SANVEARYYYGDTSYFYMNAGVLQEFAHVGSNNAASLNTFKVNAARNPLNTHARVMMGGELKLAKEVFLNLGVVYLHNLI
SNIGHFASNLGMRYSF
>Q48245 ~~~vacA~~~Vacuolating cytotoxin autotransporter~~~
MEIQQTHRKINRPLVSLALVGALVSITPQQSHAAFFTTVIIPAIVGGIATGTAVGTVSGLLGWGLKQAEEANKTPDKPDK
VWRIQAGKGFNEFPNKEYDLYKSLLSSKIDGGWDWGNAATHYWIKGGQWNKLEVDMKDAVGTYKLSGLRNFTGGDLDVNM
QKATLRLGQFNGNSFTSYKDSADRTTRVDFNAKNILIDNFLEINNRVGSGAGRKASSTVLTLQASEGITSSKNAEISLYD
GATLNLASNSVKLNGNVWMGRLQYVGAYLAPSYSTINTSKVTGEVNFNHLTVGDHNAAQAGIIASNKTHIGTLDLWQSAG
LNIIAPPEGGYKDKPNNTPSQSGAKNDKQESSQNNSNTQVINPPNSTQKTEVQPTQVIDGPFAGGKDTVVNIDRINTKAD
GTIKVGGFKASLTTNAAHLNIGKGGVNLSNQASGRTLLVENLTGNITVDGPLRVNNQVGGYALAGSSANFEFKAGVDTKN
GTATFNNDISLGRFVNLKVDAHTANFKGIDTGNGGFNTLDFSGVTNKVNINKLITASTNVAVKNFNINELIVKTNGVSVG
EYTHFSEDIGSQSRINTVRLETGTRSIFSGGVKFKSGEKLVIDEFYYSPWNYFDARNIKNVEITRKFASSTPENPWGTSK
LMFNNLTLGQNAVMDYSQFSNLTIQGDFINNQGTINYLVRGGKVATLNVGNAAAMMFNNDIDSATGFYKPLIKINSAQDL
IKNTEHVLLKAKIIGYGNVSTGTNGISNVNLEEQFKERLALYNNNNRMDTCVVRNTDDIKACGMAIGNQSMVNNPDNYKY
LIGKAWKNIGISKTANGSKISVYYLGNSTPTENGGNTTNLPTNTTNNARFASYALIKNAPFAHSATPNLVAINQHDFGTI
ESVFELANRSKDIDTLYANSGAQGRDLLQTLLIDSHDAGYARTMIDATSANEITKQLNTATTTLNNIASLEHKTSSLQTL
SLSNAMILNSRLVNLSRRHTNNIDSFAKRLQALKDQRFASLESAAEVLYQFAPKYEKPTNVWANAIGGASLNNGGNASLY
GTSAGVDAYLNGQVEAIVGGFGSYGYSSFNNQANSLNSGANNTNFGVYSRIFANQHEFDFEAQGALGSDQSSLNFKSALL
RDLNQSYNYLAYSAATRASYGYDFAFFRNALVLKPSVGVSYNHLGSTNFKSNSTNKVALSNGSSSQHLFNASANVEARYY
YGDTSYFYMNAGVLQEFANFGSSNAVSLNTFKVNATRNPLNTHARVMMGGELKLAKEVFLNLGVVYLHNLISNIGHFASN
LGMRYSF
>H2K887 4.2.3.152~~~valA~~~2-epi-5-epi-valiolone synthase~~~COG0337
MTMTKQSSLSPGSRLYDYTTQDGAAWRVSALKEVSYDVVVQPRLLDPANPALADALSSGTTPARRLIVIDATVRSLYGEQ
LAAYLAGHDVEFHLCVIDAHESAKVMETVFEVVDAMDAFGVPRRHAPVLAMGGGVLTDIVGLAASLYRRATPYVRIPTTL
IGMIDAGIGAKTGVNFREHKNRLGTYHPSSLTLIDPGFLATLDARHLRNGLAEILKVALVKDAELFDLLEGHGASLVEQR
MQPGEGGTGGAALTVLRRAVQGMLEELQPNLWEHQLRRLVDFGHSFSPSVEMAALPELLHGEAVCIDMALSSVLAHHRGL
LTEAELGRVLDVMRLLHLPVLHPVCTPDLMRAALADTVKHRDGWQHMPLPRGIGDAVFVNDVTQREIEAALLTLAERDRV
PRWRALHGAVDMGV
>H2K888 2.7.7.91~~~valB~~~Valienol-1-phosphate guanylyltransferase~~~COG0448
MDGVRAVLLAGGEGRRMGPLGRGRLKPLVPFGGTSRLIDFSIANVHRSGLRDVLLLSQYEERRLMDDLHLVWNGRHRGFR
IDFGPYDAVYRRSPGKLPEQLPERTWPLERGTADALLTKAEYVFRQGDAEASEILVLHADHVYRFDYGDMIREHRASKAA
LTVSYQRIERRYVHLFGMVEFDGDGLLTAFEEKPDDPTSDLVFAAFCLFDAATLRRYLEQLRGTDWQHDISRDVIPAMLA
GGELIRGYEVKSYWEDIGTVDRYHRAHRGLLRADPTLALSDMPLTVAPEVPRHLVPGGPGRRASVVAADVANEGEIVSSV
VYPGARIGVDAHVVDCVVLPGAQVPDGTHLASAIVLEDGSVQQCEAEREEVAL
>Q3T6E2 2.7.1.214~~~valC~~~C(7)-cyclitol 7-kinase~~~
MTALTLAPCDLVVADLGGTTLRVGRITAGTSEVHDVKRVPTNGLGRYGALAPQELQDRVMEQLTREIAAHLTRPGQAPAQ
AVAVSFAGPMTADGVVLAGPTLWGGPAAPLPVADVLTQRLGLPVVAANDVTAAAWRYAAAEPEPFCLTTVSSGIGNKVFR
HGEIVIDQLGYGGEIGHWLVDHAEDAAPCECGGRGHLGAIASGRGALFAVRAAAAADASAFARSALAGPSGGVPEAITNE
AFAAAARAGDTFARESLRRSLRPLASAVSLLFTAIGVRRYLFVGGFALALGDTFLTLLGDELVRVGCFGLDEYATRAMLA
LGEDDDDHCLIGIGQLAAARLGAPRAVEVTA
>H2K893 2.4.1.338~~~valG~~~Validoxylamine A glucosyltransferase~~~COG1216
MPGATSHVPLLSVVIPTYNRAALLDRTLGTLARQTTALEDFEVVVSDDGSTDTTRDVVRSYEDRLRIKYVFQEDLGYRVA
SARNGGARLASAPLLAFLDTGVLAGPQYVQSVLAAHAGPAPAKVVLGCCYGYDPRNPHPELHSLVEEFPPEEAVRRVGDA
PWFQDMRLPEFTAVDFDLSRMHMPWLWFWTLNVSLPAADFWRVGGFDEDFTGWGGEDIELGYRLHAHGIPMTVSRESWGI
EAPHERTHEANVSSLMLNCDRFVRKHPSLLPELFWAVTNRGIFGSVETERLRFEEWASQARGQQVLDEIAIGLDTLPPSQ
HTQRVAVFGSGTEGLPITPRQNVELFLCDYDEGVLARQESRDDAAVSTWHLSGLRTPWPDQHFDLVIITSRMDGPRQAWG
EAFTKEAHRIASSVVEPSLRGD
>H2K885 2.5.1.135~~~valL~~~Validamine 7-phosphate valienyltransferase~~~COG0380
MTGSEIFLASKRAAITYDTDPATGEPRAWLAPGGTGNVVAEQAGVLNISWIASADSEDDRRASALNPDGVTMELHSGREI
LVRLIRHDPAVFRNVQNFMTANLMWAANNYGWDRWTQPSFGSDAREGWADFGRFTRDFADAILKSSAQSADPVYLVHDYQ
LVGVPALLREQRPDAPILLFVHIPWPSADYWRILPKEIRTGILHGMLPATTIGFFADRWCRNFLESVADLLPDARIDREA
MTVEWRGHRTRLRTMPLGYSPLTLDGRNPQLPEGIEEWADGHRLVVHSGRTDPIKNAERAVRAFVLAARGGGLEKTRMLV
RMNPNRLYVPANADYVHRVETAVAEANAELGSDTVRIDNDNDVNHTIACFRRADLLIFNSTVDGQNLSTFEAPLVNERDA
DVILSETCGAAEVLGEYCRSVNPFDLVEQAEAISAALAAGPRQRAEAAARRRDAARPWTLEAWVQAQLDGLAADHAARTA
TAERFDTAPAVSTRADL
>P25051 6.1.2.1~~~vanA~~~Vancomycin/teicoplanin A-type resistance protein VanA~~~
MNRIKVAILFGGCSEEHDVSVKSAIEIAANINKEKYEPLYIGITKSGVWKMCEKPCAEWENDNCYSAVLSPDKKMHGLLV
KKNHEYEINHVDVAFSALHGKSGEDGSIQGLFELSGIPFVGCDIQSSAICMDKSLTYIVAKNAGIATPAFWVINKDDRPV
AATFTYPVFVKPARSGSSFGVKKVNSADELDYAIESARQYDSKILIEQAVSGCEVGCAVLGNSAALVVGEVDQIRLQYGI
FRIHQEVEPEKGSENAVITVPADLSAEERGRIQETAKKIYKALGCRGLARVDMFLQDNGRIVLNEVNTLPGFTSYSRYPR
MMAAAGIALPELIDRLIVLALKG
>Q06893 6.1.2.1~~~vanB~~~Vancomycin B-type resistance protein VanB~~~COG1181
MNKIKVAIIFGGCSEEHDVSVKSAIEIAANINTEKFDPHYIGITKNGVWKLCKKPCTEWEADSLPAIFSPDRKTHGLLVM
KEREYETRRIDVAFPVLHGKCGEDGAIQGLFELSGIPYVGCDIQSSAACMDKSLAYILTKNAGIAVPEFQMIEKGDKPEA
RTLTYPVFVKPARSGSSFGVTKVNSTEELNAAIEAAGQYDGKILIEQAISGCEVGCAVMGNEDDLIVGEVDQIRLSHGIF
RIHQENEPEKGSENAMIIVPADIPVEERNRVQETAKKVYRVLGCRGLARVDLFLQEDGGIVLNEVNTLPGFTSYSRYPRM
AAAAGITLPALIDSLITLAIER
>P29753 6.3.2.-~~~vanC~~~Vancomycin C-type resistance protein VanC~~~
MKKIAVLFGGNSPEYSVSLTSAASVIQAIDPLKYEVMTIGIAPTMDWYWYQGNLANVRNDTWLEDHKNCHQLTFSSQGFI
LGEKRIVPDVLFPVLHGKYGEDGCIQGLLELMNLPYVGCHVAASALCMNKWLLHQLADTMGIASAPTLLLSRYENDPATI
DRFIQDHGFPIFIKPNEAGSSKGITKVTDKTALQSALTTAFAYGSTVLIQKAIAGIEIGCGILGNEQLTIGACDAISLVD
GFFDFEEKYQLISATITVPAPLPLALESQIKEQAQLLYRNLGLTGLARIDFFVTNQGAIYLNEINTMPGFTGHSRYPAMM
AEVGLSYEILVEQLIALAEEDKR
>Q05709 1.1.1.-~~~vanH~~~D-specific alpha-keto acid dehydrogenase~~~
MNNIGITVYGCEQDEADAFHALSPRFGVMATIINANVSESNAKSAPFNQCISVGHKSEISASILLALKRAGVKYISTRSI
GCNHIDTTAAKRMGITVDNVAYSPDSVADYTMMLILMAVRNVKSIVRSVEKHDFRLDSDRGKVLSDMTVGVVGTGQIGKA
VIERLRGFGCKVLAYSRSRSIEVNYVPFDELLQNSDIVTLHVPLNTDTHYIISHEQIQRMKQGAFLINTGRGPLVDTYEL
VKALENGKLGGAALDVLEGEEEFFYSDCTQKPIDNQFLLKLQRMPNVIITPHTAYYTEQALRDTVEKTIKNCLDFERRQE
HE
>Q9FDA3 2.3.1.184~~~vanM~~~Acyl-homoserine-lactone synthase VanM~~~
MKIISSLGSRLTNSLPIEKKQRALVEFVINTYQPQQRADLFRALTEHRKNQLLNLFPEHHNKSFSILFELMDYRELIRRY
PNTFGEEIAHLEQAVSECYSHWLDFWCECEIAAIKTLFPIEADAPHRIELPLKDCAYRGFLIDQIEDSELWVTTPSHPQK
MPIKDAITLSNLELFIKGEKWYEMLPLLSLSQKGKHFVLLKHPNNEASPTLVASMLVQDWSVNQTWLSYAQQFSNEQWQF
CLPDHSYDVLMELQLFKPALSKCDSLPEFDHQFRRQLTGTQAVCEVLRLTVSGNAQQKLYFLYLAQKKLIQILNQMGYKI
GFTIIEQPFILNFYQTIEPKSYFRSGYNDLNNNGKQSYRGFWMIERMDKVFSDTNFRDYRHAVSQRRKYDLTKRKEKEYA
>Q06240 2.7.13.3~~~vanS~~~Sensor protein VanS~~~
MVIKLKNKKNDYSKLERKLYMYIVAIVVVAIVFVLYIRSMIRGKLGDWILSILENKYDLNHLDAMKLYQYSIRNNIDIFI
YVAIVISILILCRVMLSKFAKYFDEINTGIDVLIQNEDKQIELSAEMDVMEQKLNTLKRTLEKREQDAKLAEQRKNDVVM
YLAHDIKTPLTSIIGYLSLLDEAPDMPVDQKAKYVHITLDKAYRLEQLIDEFFEITRYNLQTITLTKTHIDLYYMLVQMT
DEFYPQLSAHGKQAVIHAPEDLTVSGDPDKLARVFNNILKNAAAYSEDNSIIDITAGLSGDVVSIEFKNTGSIPKDKLAA
IFEKFYRLDNARSSDTGGAGLGLAIAKEIIVQHGGQIYAESNDNYTTFRVELPAMPDLVDKRRS
>Q9X3P3 5.1.1.-~~~vanT~~~Serine/alanine racemase~~~
MKNKGIDQFRVIAAMMVVAIHCLPLHYLWPEGDILITLTIFRVAVPFFFMISGYYVFAELAVANSYPSRQRVFNFIKKQL
KVYLLATLMFLPLALYSQTIGFDLPVGTLVQVLLVNGILYHLWYFPALITGSLLLTSLLIHVSFKKVFWLAAGLYLIGLG
GDSWFGLIQQTPIEPFYTAVFHLLDGTRNGIFFTPLFLCLGVLVRKQSEKRSLSKTALFFLISLIGLLIESAYLHGFSIP
KHDSMYLFLPVVLFFLFPLILRWHPHRTWKHPGQLSLWLYLLHPYTIAGTHFLSQKISILQNNLINYLVVLILTIGFICL
FLRQKHSWFRHKQTTPVKRAVKEFSKTALLHNLQEIQRIISPKTKVMAVVKADAYGCGAKEVAPVLEQAGIDFFAVATID
EGIRLRKNAVKSPILVLGYTSPKRIKELRRYSLTQSIISEGHAVALSQRKVAIDCHLAIDTGMHRLGVTPTIDSILSIFD
LPFLTISGVYSHLGSADRLNPDSMIRTQKQIACFDQILLELDQRQISYGITHLQSSYGILNYPDLNYDYVRPGILLTGSL
SDTNEPTKQRVSLQPILTLKAQLITKRVVAKGEAIGYGQTAVANQETTVGVVSIGYCDGLPRSLSNQEFCLSYRGQSLPQ
IGLICMDMLLIDLSHCPTIPIESEIEILTDWSDTAEQVQTITNELICRIGPRVSARIK
>Q47747 ~~~vanW~~~Vancomycin B-type resistance protein VanW~~~COG2720
MNRKRLTQRFPFLLPMRQAQRKICFYAGMRFDGCCYAQTIGEKTLPYLLFETDCALYNHNTGFDMIYQENKVFNLKLAAK
TLNGLLIKPGETFSFWRLVRHADKDTPYKDGLTVANGKLTTMSGGGMCQMSNLLFWVFLHTPLTIIQRSGHVVKEFPEPN
SDEIKGVDATISEGWIDLKVRNDTDCTYQIWVTLDDEKIIGQVFADKQPQALYKIANGSIQYVRESGGIYEYAKVERMQV
ALGTGEIIDCKLLYTNKCKICYPLPESVDIQEANQ
>Q47749 3.4.13.22~~~vanXB~~~D-alanyl-D-alanine dipeptidase~~~COG2173
MENGFLFLDEMLHGVRWDAKYATWDNFTGKPVDGYEVNRIIGTKAVALALREAQIHAAALGYGLLLWDGYRPKSAVDCFL
RWAAQPEDNLTKEKYYPNIERAELITKGYVASQSSHSRGSTIDLTLYHLDTGELVSMGSNFDFMDERSHHTAKGIGNAEA
QNRRCLRKIMESSGFQSYRFEWWHYKLIDEPYPDTYFNFAVS
>Q06241 3.4.13.22~~~vanX~~~D-alanyl-D-alanine dipeptidase~~~
MEIGFTFLDEIVHGVRWDAKYATWDNFTGKPVDGYEVNRIVGTYELAESLLKAKELAATQGYGLLLWDGYRPKRAVNCFM
QWAAQPENNLTKESYYPNIDRTEMISKGYVASKSSHSRGSAIDLTLYRLDTGELVPMGSRFDFMDERSHHAANGISCNEA
QNRRRLRSIMENSGFEAYSLEWWHYVLRDEPYPNSYFDFPVK
>Q9XAK6 3.4.13.22~~~vanX~~~D-alanyl-D-alanine dipeptidase~~~COG2173
MTGDFAFVDELVSGIRWDAKYATWDNFTGKPVDGYLANRIVGTKALCAALGRAQERAEDLGFGLLLWDGYRPQRAVDCFL
RWSQQPEDGRTKARHYPNIGRAEMFDRGYVAARSGHSRGATVDLTLYHLTTGELAAMGGGHDLMDPISHHDARDVPRAEA
ANRRHLRSIMAACGFASYACEWWHYTLKEEPHPDTYFDFPIA
>Q47746 3.4.17.-~~~vanYB~~~D-alanyl-D-alanine carboxypeptidase~~~COG1876
MEKSNYHSNVNHHKRHMKQSGEKRAFLWAFIISFTVCTLFLGWRLVSVLEATQLPPIPATHTGSGTGVAENPEENTLATA
KEQGDEQEWSLILVNRQNPIPAQYDVELEQLSNGERIDIRISPYLQDLFDAARADGVYPIVASGYRTTEKQQEIMDEKVA
EYKAKGYTSAQAKAEAETWVAVPGTSEHQLGLAVDINADGIHSTGNEVYRWLDENSYRFGFIRRYPPDKTEITGVSNEPW
HYRYVGIEAATKIYHQGLCLEEYLNTEK
>P37711 3.4.17.-~~~vanY~~~D-alanyl-D-alanine carboxypeptidase~~~
MKKLFFLLLLLFLIYLGYDYVNEALFSQEKVEFQNYDQNPKEHLENSGTSENTQEKTITEEQVYQGNLLLINSKYPVRQE
SVKSDIVNLSKHDELINGYGLLDSNIYMSKEIAQKFSEMVNDAVKGGVSHFIINSGYRDFDEQSVLYQEMGAEYALPAGY
SEHNSGLSLDVGSSLTKMERAPEGKWIEENAWKYGFILRYPEDKTELTGIQYEPWHIRYVGLPHSAIMKEKNFVLEEYMD
YLKEEKTISVSVNGEKYEIFYYPVTKNTTIHVPTNLRYEISGNNIDGVIVTVFPGSTHTNSRR
>E4QWH3 ~~~vapB1~~~Antitoxin VapB1~~~
MLTKVFQSGNSQAVRIPMDFRFDVDTVEIFRKENGDVVLRPVSKKTDDFLALFEGFDETFIQALEARDDLPPQERENL
>Q4QNL8 ~~~vapB1~~~Antitoxin VapB1~~~
MLTKVFQSGNSQAVRIPMDFRFDVDTVEIFRKENGDVVLRPVSKKTDDFLALFEGFDETFIQALEARDDLPPQERENL
>Q57534 ~~~vapB1~~~Antitoxin VapB1~~~COG4456
MLTKVFQSGNSQAVRIPMDFRFDVDTVEIFRKENGDVVLRPVSKKTDDFLALFEGFDETFIQALEARDDLPPQERENL
>Q4QLV9 ~~~vapB2~~~Antitoxin VapB2~~~
MIEASVFMTNRSQAVRLPAEVRFSEEIKKLSVRVSGSDRILSPLNQSWDSFFLNDQAVSDDFMNEREIAFQPEREAL
>O07227 ~~~vapB2~~~Antitoxin VapB2~~~
MSDVLIRDIPDDVLASLDAIAARLGLSRTEYIRRRLAQDAQTARVTVTAADLRRLRGAVAGLGDPELMRQAWR
>Q4UNB3 ~~~vapB2~~~Antitoxin VapB2~~~COG4456
MNKAKIFMNGQSQAVRLPKEFRFSVKEVSVIPLGKGIVLQPLPNSWKDVFQEMAEISSDDIFPEGRKDLPPQKRKYFE
>P9WJ59 ~~~vapB3~~~Antitoxin VapB3~~~
MLSRRTKTIVVCTLVCMARLNVYVPDELAERARARGLNVSALTQAAISAELENSATDAWLEGLEPRSTGARHDDVLGAID
AARDEFEA
>P9WF21 ~~~vapB4~~~Antitoxin VapB4~~~COG4118
MSATIPARDLRNHTAEVLRRVAAGEEIEVLKDNRPVARIVPLKRRRQWLPAAEVIGELVRLGPDTTNLGEELRETLTQTT
DDVRW
>P9WF19 ~~~vapB5~~~Putative antitoxin VapB5~~~COG4118
MSEVASRELRNDTAGVLRRVRAGEDVTITVSGRPVAVLTPVRPRRRRWLSKTEFLSRLRGAQADPGLRNDLAVLAGDTTE
DLGPIR
>A0QRY5 ~~~vapB~~~Antitoxin VapB~~~COG4423
MALSIKHPEADRLARELAARTGETLTEAVVMALRERLARTVGRTQVVPLREELAAIRRRCAALPVLDDRTAESILGYDDR
GLPS
>Q7CPV2 ~~~vapB~~~Antitoxin VapB~~~
MHTTLFFSNRTQAVRLPKSISFPEDVKHVEIIAVGRSRIITPVGESWDSWFDGEGASTDFMSTREQPAVQEREGF
>O06663 ~~~vapB~~~Antitoxin VapB~~~
METTVFLSNRSQAVRLPKAVALPENVKRVEVIAVGRTRIITPAGETWDEWFDGHSVSTDFMDNREQPGMQERESF
>E4QWH2 3.1.-.-~~~vapC1~~~Ribonuclease VapC1~~~
MIYMLDTNIIIYLMKNRPKIIAERVSQLLPNDRLVMSFITYAELIKGAFGSQNYEQSIRAIELLTERVNVLYPNEQICLH
YGKWANTLKKQGRPIGNNDLWIACHALSLNAVLITHNVKEFQRITDLQWQDWTK
>Q4QNL7 3.1.-.-~~~vapC1~~~Ribonuclease VapC1~~~
MIYMLDTNIIIYLMKNRPKIIAERVSQLLPNDRLVMSFITYAELIKGAFGSQNYEQSIRAIELLTERVNVLYPNEQICLH
YGKWANTLKKQGRPIGNNDLWIACHALSLNAVLITHNVKEFQRITDLQWQDWTK
>Q57122 3.1.-.-~~~vapC1~~~Ribonuclease VapC1~~~COG1487
MIYMLDTNIIIYLMKNRPKIIAERVSQLLPNDRLVMSFITYAELIKGAFGSQNYEQSIRAIELLTERVNVLYPNEQICLH
YGKWANTLKKQGRPIGNNDLWFACHALSLNAVLITHNVKEFQRITDLQWQDWTK
>P9WFB9 3.1.-.-~~~vapC2~~~Ribonuclease VapC2~~~COG1487
MTDQRWLIDKSALVRLTDSPDMEIWSNRIERGLVHITGVTRLEVGFSAECGEIARREFREPPLSAMPVEYLTPRIEDRAL
EVQTLLADRGHHRGPSIPDLLIAATAELSGLTVLHVDKDFDAIAALTGQKTERLTHRPPSA
>Q4UNB2 3.1.-.-~~~vapC2~~~Ribonuclease VapC2~~~COG1487
MIYMLDTNICVYAINKHPDSYYNNLELLAKNNTIAISSIVLAELQYGVSKSKKKEQNQSKLDIFLSRLEIIDFSAKCTFY
YGELRTELEQKGLIIGNNDLLIASHAIAENATLVTNNIKEFKRIPNLILENWDK
>P9WFB7 3.1.-.-~~~vapC3~~~Ribonuclease VapC3~~~COG4113
MRASPTSPPEQVVVDASAMVDLLARTSDRCSAVRARLARTAMHAPAHFDAEVLSALGRMQRAGALTVAYVDAALEELRQV
PVTRHGLSSLLAGAWSRRDTLRLTDALYVELAETAGLVLLTTDERLARAWPSAHAIG
>O07783 3.1.-.-~~~vapC4~~~Ribonuclease VapC4~~~COG1487
MNVRRALADTSVFIGIEATRFDPDRFAGYEWGVSVVTLGELRLGVLQASGPEAAARRLSTYQLAQRFEPLGIDEAVSEAW
ALLVSKLRAAKLRVPINDSWIAATAVAHGIAILTQDNDYAAMPDVEVITI
>P96917 3.1.-.-~~~vapC5~~~Ribonuclease VapC5~~~COG1487
MSTTPAAGVLDTSVFIATESGRQLDEALIPDRVATTVVTLAELRVGVLAAATTDIRAQRLATLESVADMETLPVDDDAAR
MWARLRIHLAESGRRVRINDLWIAAVAASRALPVITQDDDFAALDGAASVEIIRV
>P9WFA9 3.1.-.-~~~vapC9~~~Ribonuclease VapC9~~~COG4113
MIVVDASAALAALLNDGQARQLIAAERLHVPHLVDSEIASGLRRLAQRDRLGAADGRRALQTWRRLAVTRYPVVGLFERI
WEIRANLSAYDASYVALAEALNCALVTADLRLSDTGQAQCPITVVPR
>A0QRY6 3.1.-.-~~~vapC~~~Ribonuclease VapC~~~COG3742
MVIDTSALVAILTDEPDAELLEGAVADDPVRTMSTASYLETAIVIESRFGEPGGRELDLWLHRASVALVAVDADQADAAR
LAYRRYGKGRHRAGLNYGDCFSYALAKVSGQPLLFKGEAFRLTDVAAVH
>Q8ZM86 3.1.-.-~~~vapC~~~tRNA(fMet)-specific endonuclease VapC~~~
MLKFMLDTNTCIFTIKNKPEHIRERFNLNTSRMCISSITLMELIYGAEKSLAPERNLAVVEGFISRLEVLDYDTQAAIHT
GQIRAELARKGTPVGPYDQMIAGHAGSRGLVVVTNNLREFERIPGIRIEDWC
>O06662 3.1.-.-~~~vapC~~~tRNA(fMet)-specific endonuclease VapC~~~
MLKFMLDTNICIFTIKNKPASVRERFNLNQGKMCISSVTLMELIYGAEKSQMPERNLAVIEGFVSRIDVLDYDAAAATHT
GQIRAELARQGRPVGPFDQMIAGHARSRGLIIVTNNTREFERVGGLRTEDWS
>P71351 3.1.-.-~~~vapD~~~Endoribonuclease VapD~~~COG3309
MYAIAFLVVKDTQDYHPKGVQQAYTDIGAVLAKFGFVRTQGSLYINMNEDMANLFQAMNALKQLAWISQSVRDIRAFRIE
QWSDFTDFIRN
>O05728 3.1.-.-~~~vapD~~~Endoribonuclease VapD~~~COG3309
MYALAFDLKIEILKKEYGEPYNKAYDDLRQELELLGFEWTQGSVYVNYSKENTLAQVYKAINKLSQIEWFKKSVRDIRAF
KVEDFSDFTEIVKS
>P26839 2.3.1.-~~~vat~~~Virginiamycin A acetyltransferase~~~
MNLNNDHGPDPENILPIKGNRNLQFIKPTITNENILVGEYSYYDSKRGESFEDQVLYHYEVIGDKLIIGRFCSIGPGTTF
IMNGANHRMDGSTYPFHLFRMGWEKYMPSLKDLPLKGDIEIGNDVWIGRDVTIMPGVKIGDGAIIAAEAVVTKNVAPYSI
VGGNPLKFIRKRFSDGVIEEWLALQWWNLDMKIINENLPFIINGDIEMLKRKRKLLDDT
>Q56403 7.1.2.2~~~atpA~~~V-type ATP synthase alpha chain~~~COG1155
MIQGVIQKIAGPAVIAKGMLGARMYDICKVGEEGLVGEIIRLDGDTAFVQVYEDTSGLKVGEPVVSTGLPLAVELGPGML
NGIYDGIQRPLERIREKTGIYITRGVVVHALDREKKWAWTPMVKPGDEVRGGMVLGTVPEFGFTHKILVPPDVRGRVKEV
KPAGEYTVEEPVVVLEDGTELKMYHTWPVRRARPVQRKLDPNTPFLTGMRILDVLFPVAMGGTAAIPGPFGSGKTVTQQS
LAKWSNADVVVYVGCGERGNEMTDVLVEFPELTDPKTGGPLMHRTVLIANTSNMPVAAREASIYVGVTIAEYFRDQGFSV
ALMADSTSRWAEALREISSRLEEMPAEEGYPPYLAARLAAFYERAGKVITLGGEEGAVTIVGAVSPPGGDMSEPVTQSTL
RIVGAFWRLDASLAFRRHFPAINWNGSYSLFTSALDPWYRENVAEDYPELRDAISELLQREAGLQEIVQLVGPDALQDAE
RLVIEVGRIIREDFLQQNAYHEVDAYCSMKKAYGIMKMILAFYKEAEAAIKRGVSIDEILQLPVLERIGRARYVSEEEFP
AYFEEAMKEIQGAFKALA
>Q56404 ~~~atpB~~~V-type ATP synthase beta chain~~~COG1156
MDLLKKEYTGITYISGPLLFVENAKDLAYGAIVDIKDGTGRVRGGQVIEVSEEYAVIQVFEETTGLDLATTSVSLVEDVA
RLGVSKEMLGRRFNGIGKPIDGLPPITPEKRLPITGLPLNPVARRKPEQFIQTGISTIDVMNTLVRGQKLPIFSGSGLPA
NEIAAQIARQATVRPDLSGEGEKEEPFAVVFAAMGITQRELSYFIQEFERTGALSRSVLFLNKADDPTIERILTPRMALT
VAEYLAFEHDYHVLVILTDMTNYCEALREIGAAREEIPGRRGYPGYMYTDLATIYERAGVVEGKKGSVTQIPILSMPDDD
RTHPIPDLTGYITEGQIQLSRELHRKGIYPPIDPLPSLSRLMNNGVGKGKTREDHKQVSDQLYSAYANGVDIRKLVAIIG
EDALTENDRRYLQFADAFERFFINQGQQNRSIEESLQIAWALLSMLPQGELKRISKDHIGKYYGQKLEEIWGAPQALD
>P74902 ~~~atpC~~~V-type ATP synthase subunit C~~~COG1527
MADDFAYLNARVRVRRGTLLKESFFQEALDLSFADFLRLLSETVYGGELAGQGLPDVDRAVLRTQAKLVGDLPRLVTGEA
REAVRLLLLRNDLHNLQALLRAKATGRPFEEVLLLPGTLREEVWRQAYEAQDPAGMAQVLAVPGHPLARALRAVLRETQD
LARVEALLAKRFFEDVAKAAKGLDQPALRDYLALEVDAENLRTAFKLQGSGLAPDAFFLKGGRFVDRVRFARLMEGDYAV
LDELSGTPFSGLSGVRDLKALERGLRCVLLKEAKKGVQDPLGVGLVLAYVKEREWEAVRLRLLARRAYFGLPRAQVEEEV
VCP
>P50870 2.3.1.-~~~vatD~~~Streptogramin A acetyltransferase~~~
MGPNPMKMYPIEGNKSVQFIKPILEKLENVEVGEYSYYDSKNGETFDKQILYHYPILNDKLKIGKFCSIGPGVTIIMNGA
NHRMDGSTYPFNLFGNGWEKHMPKLDQLPIKGDTIIGNDVWIGKDVVIMPGVKIGDGAIVAANSVVVKDIAPYMLAGGNP
ANEIKQRFDQDTINQLLDIKWWNWPIDIINENIDKILDNSIIREVIWKK
>O87880 ~~~atpD~~~V-type ATP synthase subunit D~~~COG1394
MSQVSPTRMNLLQRRGQLRLAQKGVDLLKKKRDALVAEFFGLVREAMEARKALDQAAKEAYAALLLAQAFDGPEVVAGAA
LGVPPLEGVEAEVENVWGSKVPRLKATFPDGALLSPVGTPAYTLEASRAFRRYAEALIRVANTETRLKKIGEEIKKTTRR
VNALEQVVIPGIRAQIRFIQQVLEQREREDTFRLKRIKGKIEAREAEEEGGRPNPQVEIGAGL
>Q845T3 ~~~vatD~~~Ferric aerobactin-binding protein VatD~~~
MLSAALAFNSYALDITHEMGTTSFETTPKKVVALDWVLTETVLSLGIELEGAANISGYQQWVAEPHLNADAIDVGSRREP
NLELLSNIKPDVILISKHLAAAYEPLSKIAPVLVYSVYSEDKQPLESAKRITRSLGKLFDKEQQAEQVIAQTDQRLAANG
AKITSAGKAEKPLLFARFINDKTLRIHSEGSLAQDTINAMGLKNDWQEPTNLWGFTTTGTEKLAEHQKANVMIFGPLSQE
ERQQLTQSPLWQAMEFSRTDSVYELPAIWTFGGLLAAQRLSDHITGRLTQPQ
>P74901 ~~~atpE~~~V-type ATP synthase subunit E~~~COG1390
MSKLEAILSQEVEAEIQALLQEAEAKAEAVKREAEEKAKALLQARERALEAQYRAALRRAESAGELLVATARTQARGEVL
EEVRRRVREALEALPQKPEWPEVVRKLALEALEALPGAKALVANPEDLPHLEALARERGVELQAEPALRLGVRAVGAEGK
TQVENSLLARLDRAWDALSSKVAQALWG
>P74903 ~~~atpF~~~V-type ATP synthase subunit F~~~COG1436
MAVIADPETAQGFRLAGLEGYGASSAEEAQSLLETLVERGGYALVAVDEALLPDPERAVERLMRGRDLPVLLPIAGLKEA
FQGHDVEGYMRELVRKTIGFDIKL
>E6Z0R4 ~~~~~~Antitoxin VbhA~~~
MLSEEEIEYRRRDARNALASQRLEGLEPDPQVVAQMERVVVGELETSDVIKDLMERIKREEI
>E6Z0R3 2.7.7.108~~~vbhT~~~Protein adenylyltransferase VbhT~~~
MRKYEGSNDPYTDPETGVMYNLLGIKDQARLERVESAFAYIRSFELGRTSISGKFDLDHMKKIHKKLFGDVYEWAGKTRL
VDIVKDNSKFAHYTQIESYAPQITQQLAREQHLRGLDANEFSQRAGYYMGELNALHPFREGNGRTLREFIWQLAREAGYH
IDWDRVERQEMTRASIESYYGNSDLMSALIRRNLTEFTVNRRVDVSQGINERVLSHIDIDKEWPQKGFNIAIQTTQQAPY
LSSYTDTSNLEEKAQNALRNEQSYVDTFKELNDHLKTIYKDPQAAALKIEQTILAGKGDKLPDILAKAPNKVGELRGSDR
LIDKLKSAGKERKAALYNVPLAISTIRRLQSFYKNSYEKHMDKLTREREQLKVEVPSLSQEAVAYMKNVEVGRNNYSKIP
ENINKEFVQLESALNRRFGKDVIYKRNFNLSKEIASKQTYDKKLVNELQTAIKFLQQRHIQKQNNLAITRTPSKGITR
>Q69GM4 1.21.99.-~~~vcrA~~~Chloroethene reductive dehalogenase~~~COG2768
MSKFHKTISRRDFMKGLGLAGAGIGAVAASAPVFHDIDELVSSEANSTKDQPWYVKHREHFDPTITVDWDIFDRYDGYQH
KGVYEGPPDAPFTSWGNRLQVRMSGEEQKKRILAAKKERFPGWDGGLHGRGDQRADALFYAVTQPFPGSGEEGHGLFQPY
PDQPGKFYARWGLYGPPHDSAPPDGSVPKWEGTPEDNFLMLRAAAKYFGAGGVGALNLADPKCKKLIYKKAQPMTLGKGT
YSEIGGPGMIDAKIYPKVPDHAVPINFKEADYSYYNDAEWVIPTKCESIFTFTLPQPQELNKRTGGIAGAGSYTVYKDFA
RVGTLVQMFIKYLGYHALYWPIGWGPGGCFTTFDGQGEQGRTGAAIHWKFGSSQRGSERVITDLPIAPTPPIDAGMFEFC
KTCYICRDVCVSGGVHQEDEPTWDSGNWWNVQGYLGYRTDWSGCHNQCGMCQSSCPFTYLGLENASLVHKIVKGVVANTT
VFNSFFTNMEKALGYGDLTMENSNWWKEEGPIYGFDPGT
>Q69GM3 ~~~vcrB~~~Probable chloroethene reductive dehalogenase membrane anchor protein~~~
MDAIYFFLTIALAVGLTMLFTWFKKNNITLKWNEWVLGILGLLLALFAIQHTYASATYEFEYTSAWIVGVIVLLLAVVPL
LFAARSVRRRVDK
>Q9KKZ4 2.7.7.65~~~vdcA~~~Diguanylate cyclase VdcA~~~COG3706
MMTTEDFKKSTANLKKVVPLMMKHHVAATPVNYALWYTYVDQAIPQLNAEMDSVLKNFGLCPPASGEHLYQQYIATKAET
NINQLRANVEVLLGEISSSMSDTLSDTSSFANVIDKSFKDLERVEQDNLSIEEVMTVIRRLVSDSKDIRHSTNFLNNQLN
AATLEISRLKEQLAKVQKDALFDSLSGLYNRRAFDGDMFTLIHAGQQVSLIMLDIDHFKALNDNYGHLFGDQIIRAIAKR
LQSLCRDGVTAYRYGGEEFALIAPHKSLRIARQFAESVRRSIEKLTVKDRRSGQSVGSITASFGVVEKIEGDSLESLIGR
ADGLLYEAKNLGRNRVMPL
>Q9X698 ~~~vdcD~~~Protein VdcD~~~
MNHLPVECPRCAFEDISLLATSPVPGVWDVVQCGRCLYTWRTIEPARRTRRDAYPDSFKLTAEDIENAIEVPAVPPLLK
>K9UV87 1.2.1.67~~~vdh~~~Vanillin dehydrogenase~~~
MSFLDDEKWTGRVFTGSWERAAGGDAAVIEPATGDELGRVGIASPQDLAASAAKAAEAQRAWAATSFQERAAVLRRAGDL
WQQHAAELKDWLIRESGSIPGKADFELHVAAQECYEAAALPSHPTGEVLPSEAPRLSMARRVPAGVVGVIAPFNAPLILS
IRSVAPALALGNSVVLKPDPRTAVCGGVALARVFEEAGLPAGVLHVLPGGPDVGAALVEDKHVRVISFTGSTAAGRAVGE
SAGRHLKRAHLELGGNSALIVLDDADLEQAMSAAAWGSFFHQGQICMTTGRHLVHASLYDEYVDRLADKASHLPVGNPFT
EQVALGPIIDAKQRDKIHGLVTSSVDAGAKVAAGGTYEDLFYRATVLAGAGPSVPAYDQEVFGPVAPVAKFTSLDEAAKL
ASESEYGLSLGIITADVAKGLALADRIPTGIAHINDQTVNDEALAPFGGVFDSGTGSRFGGPAANIEAFTETRWVTMRGD
VAGYPF
>Q8NMB0 1.2.1.67~~~vdh~~~Vanillin dehydrogenase~~~COG1012
MTATFAGIDATKHLIGGQWVEGNSDRISTNINPYDDSVIAESKQASIADVDAAYEAAKKAQAEWAATPAAERSAIIYRAA
ELLEEHREEIVEWLIKESGSTRSKANLEITLAGNITKESASFPGRVHGRISPSNTPGKENRVYRVAKGVVGVISPWNFPL
NLSIRSVAPALAVGNAVVIKPASDTPVTGGVIPARIFEEAGVPAGVISTVAGAGSEIGDHFVTHAVPKLISFTGSTPVGR
RVGELAINGGPMKTVALELGGNAPFVVLADADIDAAAQAAAVGAFLHQGQICMSINRVIVDAAVHDEFLEKFVEAVKNIP
TGDPSAEGTLVGPVINDSQLSGLKEKIELAKKEGATVQVEGPIEGRLVHPHVFSDVTSDMEIAREEIFGPLISVLKADDE
AHAAELANASDFGLSAAVWSKDIDRAAQFALQIDSGMVHINDLTVNDEPHVMFGGSKNSGLGRFNGDWAIEEFTTDRWIG
IKRS
>O05619 1.2.1.67~~~vdh~~~Vanillin dehydrogenase~~~
MFHVPLLIGGKPCSASDERTFERRSPLTGEVVSRVAAASLEDADAAVAAAQAAFPEWAALAPSERRARLLRAADLLEDRS
SEFTAAASETGAAGNWYGFNVYLAAGMLREAAAMTTQIQGDVIPSNVPGSFAMAVRQPCGVVLGIAPWNAPVILGVRAVA
MPLACGNTVVLKSSELSPFTHRLIGQVLHDAGLGDGVVNVISNAPQDAPAVVERLIANPAVRRVNFTGSTHVGRIIGELS
ARHLKPAVLELGGKAPFLVLDDADLDAAVEAAAFGAYFNQGQICMSTERLIVTAVADAFVEKLARKVATLRAGDPNDPQS
VLGSLIDANAGQRIQVLVDDALAKGARQVVGGGLDGSIMQPMLLDQVTEEMRLYREESFGPVAVVLRGDGDEELLRLAND
SEFGLSAAIFSRDVSRAMELAQRVDSGICHINGPTVHDEAQMPFGGVKSSGYGSFGSRASIEHFTQLRWLTIQNGPRHYP
I
>O69056 1.4.1.23~~~vdh~~~Valine dehydrogenase~~~
MTDVTGAPADVLHTLFHSDQGGHEQVVLCQDRASGLKAVIALHSTALGPALGGTRFYPYANEAEAVADALNLARGMSYKN
AMAGLEHGGGKAVIIGDPEQIKSEELLLAYGRFVASLGGRYVTACDVGTYVADMDVVARECRWTTGRSPENGGAGDSSVL
TSFGVYQGMRAAAQHLWGDPTLRDRTVGIAGVGKVGHHLVEHLLAEGAHVVVTDVRKDVVRSLTERHPSVVAVADTDALI
RVENLDIYAPCALGGALNDETVPVLTAKVVCGAANNQLAHPGVEKDLADRGILYAPDYVVNAGGVIQVADELHGFDFDRC
KAKAAKIYDTTLAIFARAKEDGIPPAAAADRIAEQRMAEARARR
>Q06539 1.4.1.23~~~vdh~~~Valine dehydrogenase~~~COG0334
MTDVNGAPADVLHTLFHSDQGGHEQVVLCQDRASGLKAVIALHSTALGPALGGTRFYPYASEAEAVADALNLARGMSYKN
AMAGLDHGGGKAVIIGDPEQIKSEELLLAYGRFVASLGGRYVTACDVGTYVADMDVVARECRWTTGRSPENGGAGDSSVL
TSFGVYQGMRAAAQHLWGDPTLRDRTVGIAGVGKVGHHLVEHLLAEGAHVVVTDVRKDVVRGITERHPSVVAVADTDALI
RVENLDIYAPCALGGALNDDTVPVLTAKVVCGAANNQLAHPGVEKDLADRGILYAPDYVVNAGGVIQVADELHGFDFDRC
KAKASKIYDTTLAIFARAKEDGIPPAAAADRIAEQRMAEARPRP
>P37466 ~~~veg~~~Protein Veg~~~COG4466
MAKTLSDIKRSLDGNLGKRLTLKANGGRRKTIERSGILAETYPSVFVIQLDQDENSFERVSYSYADILTETVELTFNDDA
ASSVAF
>P76214 ~~~ves~~~Protein Ves~~~COG3758
MEYFDMRKMSVNLWRNAAGETREICTFPPAKRDFYWRASIASIAANGEFSLFPGMERIVTLLEGGEMLLESADRFNHTLK
PFQPFAFAADQVVKAKLTAGQMSMDFNIMTRLDVCKAKVRIAERTFTTFGSRGGVVFVINGAWQLGDKLLTTDQGACWFD
GRHTLRLLQPQGKLLFSEINWLAGHSPDQVQ
>P55222 ~~~vfr~~~cAMP-activated global transcriptional regulator Vfr~~~
MVAITHTPKLKHLDKLLAHCHRRRYTAKSTIIYAGDRCETLFFIIKGSVTILIEDDDGREMIIGYLNSGDFFGELGLFEK
EGSEQERSAWVRAKVECEVAEISYAKFRELSQQDSEILYTLGSQMADRLRKTTRKVGDLAFLDVTGRVARTLLDLCQQPD
AMTHPDGMQIKITRQEIGRIVGCSREMVGRVLKSLEEQGLVHVKGKTMVVFGTR
>P44228 ~~~~~~Mu-like prophage FluMu protein gp35~~~
MDKTFCVVVQNRIKEGYRRAGFSFHLGDNSLAAVSESQLAQLKADPRLVVQITETGSQEGGEGLSKEPAGSDEQKQLRAD
PPSTDLNTFTVEQLKAQLTERGITFKQSATKAELIALFAPADGEKSEA
>Q7W0D3 4.2.99.-~~~vgb~~~Virginiamycin B lyase~~~COG4257
MNQVEMTEFPVGKPEEALYGVASTPDGALWFTLAKGNAIGRLSPDGEVSRFPLPHADGQPTTITCGPDGRPWFTLSSANA
VGRLSPDGALRMFELPRPASRPFGIAAGHDGCLWFAEMAGDRIGRITIDGDIEEYDLPVKGGYPSCMAAGRDGLMWFTLN
QAGAIGSISATAAPRIFPLGAADAAPVGIASDAQGALWIAQAGNGAIARFDAGGRITEFPLHSRAARPHALAADAAGNLW
FTEWGANRIGRISEAGDTAGYELAAPGSEPHGIAIDPHGCVWAALETGSLVRLQASPRD
>P17978 4.2.99.-~~~vgb~~~Virginiamycin B lyase~~~
MEFKLQELNLTNQDTGPYGITVSDKGKVWITQHKANMISCINLDGKITEYPLPTPDAKVMCLTISSDGEVWFTENAANKI
GRITKKGIIKEYTLPNPDSAPYGITEGPNGDIWFTEMNGNRIGRITDDGKIREYELPNKGSYPSFITLGSDNALWFTENQ
NNAIGRITESGDITEFKIPTPASGPVGITKGNDDALWFVEIIGNKIGRITPLGEITEFKIPTPNARPHAITAGAGIDLWF
TEWGANKIGRLTSNNIIEEYPIQIKSGEPHGICFDGETIWFAMECDKIGKLTLIKDNME
>Q9KZX7 4.2.99.-~~~vgb~~~Virginiamycin B lyase~~~COG4257
MNEINESYDTDSVREFTVSDADAGPYALAEGPDGALWFTLVHRGAVARRDPDDGRVTVHPVGDGPTVIAPGPDGALWFTE
YRAHRIGRITPEGHYASFAPLTPEGGPFGITAGPDGAMWFTLSSADRVGRVTMDGEVTEHPAPGAFPSALTAGPDGALWC
TLNQGNAIGRLTPDGHGTAYPLPTPGAAPVGIAAGPDGALWFTEIGAGRIGRITVTGDLTEYPLSDPAARPHAVTAGPNG
ALWFTEWGSGRVGRITVDGRVTSYPLSRTDCEPHGIAVHDGALWCALETGSLARIQVPA
>Q9I741 ~~~vgrG1a~~~Type VI secretion system spike protein VgrG1a~~~
MQLTRLVQVDCPLGPDVLLLQRMEGREELGRLFAYELHLVSENPNLPLEQLLGKPMSLSLELPGGSRRFFHGIVARCSQV
AGHGQFAGYQATLRPWPWLLTRTSDCRIFQNQSVPEIIKQVFRNLGFSDFEDALTRPYREWEYCVQYRETSFDFISRLME
QEGIYYWFRHEQKRHILVLSDAYGAHRSPGGYASVPYYPPTLGHRERDHFFDWQMAREVQPGSLTLNDYDFQRPGARLEV
RSNIARPHAAADYPLYDYPGEYVQSQDGEQYARNRIEAIQAQHERVRLRGVVRGIGAGHLFRLSGYPRDDQNREYLVVGA
EYRVVQELYETGSGGAGSQFESELDCIDASQSFRLLPQTPVPVVRGPQTAVVVGPKGEEIWTDQYGRVKVHFHWDRHDQS
NENSSCWIRVSQAWAGKNWGSMQIPRIGQEVIVSFLEGDPDRPIITGRVYNAEQTVPYELPANATQSGMKSRSSKGGTPA
NFNEIRMEDKKGAEQLYIHAERNQDNLVENDASLSVGHDRNKSIGHDELARIGNNRTRAVKLNDTLLVGGAKSDSVTGTY
LIEAGAQIRLVCGKSVVEFNADGTINISGSAFNLYASGNGNIDTGGRLDLNSGGASEVDAKGKGVQGTIDGQVQAMFPPP
AKG
>Q9I0F3 ~~~vgrG1c~~~Type VI secretion system spike protein VgrG1c~~~
MAIGQPFATALGCTIIWRAAGALFGPPAAGDYQGRFPMLFSQHTRLVHVDSPLGPEVLQLQRLEGREELGRLFSHELELV
SSNPALPLDALLGKPMSLALELPGGSRRYFHGIVARCSQGAGAGQFASYQVTLRPWLWLLTRTSDCRIFQNQKVPDIIKQ
VFRDLGFSDFEDALSRSYREWEYCVQYRETSFDFVSRLMEQEGIYYWFRHEKKRHILVLSDAYGAHHSPAGYTSVPYYPP
SLGHRERDHFFDWHMAREVQPGSLSLNDYDFQRPGTRLEVRSNVGRAHAAADYPLYDYPGEYVQSQDGEHYARTRIEAIQ
TQYERVRLRGCARGIGAGHLFHLSNYPRLDQNREYLVVGAEYRVVQELYETGNGGGGAQFESELDCIDAGQAFRPLPSTP
VPVVRGPQTAVVVGPKGEEIWTDQYGRVKVHFHWDRHDQSNENSSCWMRVSQAWAGKNWGSIQIPRIGQEVIVSFLEGDP
DRPIITGRVYNAEQTVPYELPANATQSGTKSRSSKGGTPANFNEIRMEDKKGAEQLFIHAERNQDIEVENDESHWVGHDR
TKTIDHDETVHVKHDRTETVDNNETITVHANRSKTVDRNETVRIGMNKTETILMASLQNVGMGRMENVGLGYSLNVGMMM
NTVVGLNQSTQVMKKKTLSVGDSYEVSVGGSDDGSKITLDGQSITLGSQRIELTADREILLRCGQSTIRLTPGEIEILSP
NVDINC
>Q9I6M7 3.4.24.-~~~vgrG2b~~~Type VI secretion system spike protein VgrG2b~~~
MRQRDLKFTFVVGEGKLAFDVVEFELEEALCEPFRLNLKLASDKNAIDFKQVLDQPGTFTLWQDGRPARYVHGIVSHFTQ
GSSGFRRTRYELLLEPQLARLELCCNWRIFQEKSVPEILQALLKEHRVLDYEQRIYHEHLPREYCVQAGDSDHYLHDRLA
FEEGLVYYFRFDEHRHTLVCSDRLYVQERIAGGPVLFSAQPEGDNPQPVLHSFRYSENVRTARQTQRDYSFKRPTYDQEH
HLAGEALEHQGSSYERYDYPGRYKRSGAGRPFTESRLRGHRRDARVASVSGDDPRLIPGHAFALEGHPRADFNAWWRPVR
VVHRGTQYAGQEEESADAPLGVSYDLRAELVPEDVEWRPAPLPRPRIDGPQIATVVGPAGEEIHCDEWGRVKVQFPWDRE
GRHDEFSTCWIRVAQNWAGADWGHMAIPRIGQEVIVDYLDGDCDQPIVTGRTYRATNRPPYALPDHKILSTIKSKEYKGS
RANELRIDDTTAQISAALMSDHGASALHLGYLTHPRPEGGKPRGEGFELRTDEHGAVRAAKGLLLSTEEQLRAGAGHLDR
GVVVQVLEAALELARELGDYAGEHQGVGHDAAPQQTLQEAVRDLGHGANDESGKSNGGKPAIALSGPAGIAAATPASLTL
AAGEHVDSVARQNQQVTAGQKVVINAGSDIGLFAQGGELRQITHQGPMLLQAQKNDIRLEAKQSVEVSASQQHVLVTAKE
HITLMCGGAYLTLKGGNIELGMPGNFVVKAAKHSHVGAASLEAELPQFEVGETQRRFVLKQLDGQTAMPNVPYTITMANG
EVIEGVTDAEGATQLLQKDAMNIAKVDMKHTKSPASAVAGIAAAVGAAVAVGKLLGGPDAEAGRALSEGEISLAKGVFGD
SIDYSTVRLRDEDYVPWQGKDYVMAPNGHIYFGEELRGVADWSLESLQRQGLFIHEMTHVWQHQHGVNVLLVGAYQQARQ
FLLGDQYAYRLEPGKTLKDYNIEQQGDIVRDYFLEKNEFGEASANSRFAGVLKNFPTGY
>A0KJB0 2.4.2.31~~~vgrG1~~~Type VI secretion system spike protein VgrG1~~~COG3501
MADSTGLQFTVKVGALPESTFVVAEFALDEALNRPFNLRLELASAQPDIDFGAVLDQPCELLVWYNGELQRRVCGVVSDF
AQGDSGFRRTRYQLMVQPALWRLSLRQNCRIFQAQKPDEILSILLQEHGITDYAFALKNEHAKREYCVQYRETDLDFVNR
LAAEEGMFYFHEFEAGKHRIVFADDAAALTAGPELFFNLGNRSLEQGPYVRQFHYREAVRPSDVELKDYSCKTPAYGLSH
KKQGSELEHQRDTYQHFDYPGRYKQDPSGKAFAQHRLDALRNDAVAGQVKSNCAALLPGQTFSLTEHPNGSLNTDWQIVR
IRHTGEQPQALEEEGGSGPTVYHNEFGVVKASTTWRARIGSPEAPHKPMVDGPQIAMVVGPDGEEIYCDEHGRVKLQFPW
DRYGSSNDQSSCWVRVSQGWAGGQYGMMAIPRIGHEVIVSFLEGDPDQPIVTGRTYHATNRPPYELPANKTRTVLRTETH
QGEGFNELRFEDQAGQEEIYIHGQKDLNVLIENDAAWHIKHDQHTDIDNERVTRVRKVPGEEGAPPSLGNDHLTVEGEKR
DHIKADYSLTVDTSMHQKLGQSWLTQAGQEVHVKAGAKVVLEAGSEITVKVGGCFIKVDGGGVTLVGPTIKMNSGGNAGS
GSGWAGKVPKSMEGMLDSPHTRWMKFYHLDSELMPLAGTPYKAVLSDGSVREGTLDGEGMALLEDVPAGTASVTYDLQDT
FADLPRESISALTGHLDSLSDEG
>K7WKL8 2.4.2.31~~~vgrG1~~~Type VI secretion system spike protein VgrG1~~~
MADSTGLQFTVKVGALPENTFVVAEFALDEALNRPFNLRLELASAQPDIDFGAVLDQPCELLVWYNGELQRRVCGVVSDF
AQGDSGFRRTRYQLRVLPALWRLSLRQNSRIFQAQKPDEILSILLQEHGITDYAFALKNEHAKREYCVQYRESDLDFVNR
LAAEEGMFYFHEFEAGKHRIVFADDAAALTQGPELFFNLGNRSLEQGPYVRQFHYREAVRPSDVELKDYSFKTPAYGLSH
KKVGAELTHQRDTYQHFDFPGRYKEDPSGKAFAQHRLDALRNDAVAGQAKSNCAALLPGQSFSLTEHPNGSLNTDWQIVR
IQHTGLQPQALEEEGGSGPTVYHNEFGVVKASTTWRARIGSPEAPHKPMVDGPQIAIVVGPDGEEIYCDEHGRVKLQFPW
DRYGSSNDQSSCWVRVSQGWAGGQYGMMAIPRIGHEVIVSFLEGDPDQPIVTGRTYHATNRPPYELPANKTRTVLRTETH
QGEGFNELRFEDQVGQEEIYIHGQKDLNVLIENDAAWHIKHDEHTDVDNERVTRIKANDHLTVEGEKRDQIKADYSLTVD
TSMHQKLGDSWLTQAGQEVHVKAGAKVVLEAGSEITVKVGGCFIKVDGGGVTLVGPTIKMNSGGSPSSGSGWGGKSPVDP
LGVSVPPKPKVPLTPAQLATMKSAAPFCEECEKCKEGGCEI
>A0A0H3AIG7 6.3.2.-~~~vgrG1~~~Actin cross-linking toxin VgrG1~~~COG3501
MATLAYSIEVEGLEDETLVVRGFHGQESLSNSVFLGQACYGFRYEVQLASRVSNLTAEQMVDKRAELKLYRNSQLVQRVH
GIVRAFSQGDIGHHHTFYQLTLVPALERLSLRHNSRIFQKQTVPEILSILLQEMGINDYAFALKRDGVQREFCVQYRESD
IDFLHRLAAEEGLVYSFVHEAGKHTLYFSDASDSLSKLPEPIPYNALVGGAIDTPYIHGLTYRTQAEVSEVQLKDYSFKK
PAYSFLQTVQGTELDYQQTRYQHFDAPGRYKDDVNGAAFSQIRLDYLRRHAHTATGQSNEPLLRAGYKFDLQEHLDPAMN
RDWVVVSINHQGEQPQALQEDGGSGATTYSNQFSLIPGHLHWRAEPQPKPQVDGPMIATVVGPEGEEIFCDEHGRVKIHF
PWDRYSNGNEQSSCWVRVSQGWAGSQYGFIAIPRIGHEVIVEFLNGDPDQPIITGRTYHATNTPPYTLPEHKTKTVLRTE
THQGEGFNELSFEDQAGKEQIYLHAQKDFDGLIENDQFTQIKHNQHLTVEWESREAVTGEQVLSIEGSLHVKTGKVRVNE
AGTEIHVKAGQKVVIEAGSEITVKAGGSFVKVDPAGVHLSGALVNLNSGGSAGSGSGFGGAMPALPGGLEPAVALAPPQT
ISYQALLQAEQANVPAVKVCPLAAQEATPAVNSITPPPPPPIAPPMAPPQPIMNPQPTANAQPNLGRSTKATPDFPTHFP
KSSIGIENELAGLVVAMPANSAQKFGYVKSAQGDALFMLTKDMNQGSYQRPPSLQDGKNYQNWQTHTVELVSYPCEMDDK
AAVETRKQAMLWLATHFTTHIDQSNHQPLAPIQSEDGRFVIEITNAKHVIAAGNGISAESQGQTITMTPSGQQATVGVAA
KGFGTSATPELRLLESAPWYQKSLKSQFASLTSAENLDDKELAANVFAYLTSIYLKTAELAKKFGIYINEWDPMSEQITP
NANGLTDPKVKNAWEILPRTKPSKIVEILSKSDAKAVMKHIKPQLQSRYSESLSKNVFQYFQDGGEVAGHGINNATVGDK
HSPELAILFEFRTVPNELQSYLPKTESTTKSEVKLLDQFDPMKRKTVIQQVESLV
>Q9KS45 6.3.2.-~~~vgrG1~~~Actin cross-linking toxin VgrG1~~~COG3501
MATLAYSIEVEGLEDETLVVRGFHGQESLSNSVFLGQACYGFRYEVQLASRVSNLTAEQMVDKRAELKLYRNSQLVQRVH
GIVRAFSQGDIGHHHTFYQLTLVPALERLSLRHNSRIFQKQTVPEILSILLQEMGINDYAFALKRDGVQREFCVQYRESD
IDFLHRLAAEEGLVYSFVHEAGKHTLYFSDASDSLSKLPEPIPYNALVGGAIDTPYIHGLTYRTQAEVSEVQLKDYSFKK
PAYSFLQTVQGTELDYQQTRYQHFDAPGRYKDDVNGAAFSQIRLDYLRRHAHTATGQSNEPLLRAGYKFDLQEHLDPAMN
RDWVVVSINHQGEQPQALQEDGGSGATTYSNQFSLIPGHLHWRAEPQPKPQVDGPMIATVVGPEGEEIFCDEHGRVKIHF
PWDRYSNGNEQSSCWVRVSQGWAGSQYGFIAIPRIGHEVIVEFLNGDPDQPIITGRTYHATNTPPYTLPEHKTKTVLRTE
THQGEGFNELSFEDQAGKEQIYLHAQKDFDGLIENDHTTVIRHDHHLTVENDQFTQIKHNQHLTVEWESREAVTGEQVLS
IEGSLHVKTGKVWVNEAGTEIHVKAGQKVVIEAGSEITVKAGGSFVKVDPAGVHLSGALVNLNSGGSAGSGSGFGGAMPA
LPGGLEPAVALAPPQTISYQALLQAEQANVPAVKVCPLAAQEATPAVNSITPPPPPPIAPPMAPPQPIMNPQPTANAQPN
LGRSTKATPDFPTHFPKSSIGIENELAGLVVAMPANSAQKFGYVKSAQGDALFMLTKDMNQGSYQRPPSLQDGKNYQNWQ
THTVELVSYPCEMDDKAAVETRKQAMLWLATHFTTHIDQSNHQPLAPIQSEDGRFVIEITNAKHVIAAGNGISAESQGQT
ITMTPSGQQATVGVAAKGFGTSATPELRLLESAPWYQKSLKSQFASLTSAENLDDKELAANVFAYLTSIYLKTAELAKKF
GIYINEWDPMSEQITPNANGLTDPKVKNAWEILPRTKPSKIVEILSKSDAKAVMKHIKPQLQSRYSESLSKNVFQYFQDG
GEVAGHGINNATVGDKHSPELAILFEFRTVPNELQSYLPKTESTTKSEVKLLDQFDPMKRKTVIQQVESLVQNSGDAFDK
WYQSYRDSMNQPPVKNAKKIASANQKAQWVKEHNPQEWQRIIA
>Q9KN42 3.2.1.-~~~vgrG3~~~Type VI secretion system spike protein VgrG3~~~COG3409
MARLQFQLKVDGLEDESLVVRGFEGQESLSDSVWRCEPCYGFRYQVDLASALSNLTAEQFVDQTAHLTILRDGQVVQQIN
GIVRQLSKGDTGHRHTFYSLTLVPALERLSLRSNSRIFQQQSVPEIISILLQEMGIEDYAFALKRECAQREFCVQYRETD
LQFLHRIAAEEGLVYSHLHEAQKHTLLFTDSSDSQPKLAKPVPYNALAGGEINLPYVVDLQFKTTAQVSHTELKDYSFKK
PAYGFTQRTQGKDIAYQQPNYEHFDAPGRYKDDANGKAFSQIRLEYLRRDALLADAKSDEPLLLAGVRFDLQDHLDHAMN
RDWLVVQANHQGTQPQALQEEGGSGATTYSNQLKLIPAHITWRARPCAKPQVDGPMIATVVGPQGEEIYCDNFGRVKVHF
PWDRYSSSNEKSSCWVRVAQEWAGSQYGSMAIPRVGHEVIVSFLNGDPDQPIITGRTYHATNTAPYALPDHKTKTVLRTE
THQGQGYNELSFEDQAGSEQILLHAQKDWDALIEHDHTEVIRHDQHLTVDNDRFTRIQRNQHLTVEGEVRSKIALDSSHE
VGASLQHKVGQRIAVEAGKEISLKSGAKIVVEAGAELTLKAGGSFVKVDAGGVHLVGPAINLNAGGSAGSGSAYGGQLAA
APRMLAQAKPVAELVQPDIAASMQSGAARVIDVASLPTMMPSSANNTANDEPVAEEKTPERILKSDLLKPSDELEKLAKR
QASAYRQGNHSDEVKLLQEALIKLGFDLGKAGADGDFGSKTKTAIEQFQKSYQPSHQTHPSYSIGAVDGIVGKGTLLALD
EALMDGWVYENNIYQIWPLGKTSEKYESAGRGPGVISTGNGDYGGASYGCYQMSSNLGVVQKYIQSSKFKEFFSGLNPAT
KEFNVVWQDIASRYPQEFREEQHQFIKRTHYDIQIGHLRGKGLLFEHNRAAVHDLIWSTSVQFGGRTNLIFNALNGQNME
SMTDKDIIILVQDYKLVNTERLFKSSPSWWSDLKKRAVSEKKALLELEIDGLEVDIK
>P0ADN0 ~~~viaA~~~Protein ViaA~~~COG2425
MLTLDTLNVMLAVSEEGLIEEMIIALLASPQLAVFFEKFPRLKAAITDDVPRWREALRSRLKDARVPPELTEEVMCYQQS
QLLSTPQFIVQLPQILDLLHRLNSPWAEQARQLVDANSTITSALHTLFLQRWRLSLIVQATTLNQQLLEEEREQLLSEVQ
ERMTLSGQLEPILADNNTAAGRLWDMSAGQLKRGDYQLIVKYGEFLNEQPELKRLAEQLGRSREAKSIPRNDAQMETFRT
MVREPATVPEQVDGLQQSDDILRLLPPELATLGITELEYEFYRRLVEKQLLTYRLHGESWREKVIERPVVHKDYDEQPRG
PFIVCVDTSGSMGGFNEQCAKAFCLALMRIALAENRRCYIMLFSTEIVRYELSGPQGIEQAIRFLSQQFRGGTDLASCFR
AIMERLQSREWFDADAVVISDFIAQRLPDDVTSKVKELQRVHQHRFHAVAMSAHGKPGIMRIFDHIWRFDTGMRSRLLRR
WRR
>P0C6D3 3.3.2.1~~~vibB~~~Vibriobactin-specific isochorismatase~~~COG1535
MAIPKIASYPLPVSLPTNKVDWRIDASRAVLLIHDMQEYFVHYFDSQAEPIPSLIKHIQQLKAHAKQAGIPVVYTAQPAN
QDPAERALLSDFWGPGLSEETAIIAPLAPESGDVQLTKWRYSAFKKSPLLDWLRETGRDQLIITGVYAHIGILSTALDAF
MFDIQPFVIGDGVADFSLSDHEFSLRYISGRTGAVKSTQQACLEIAAQHSKLTGLSLRTMQHDVAAALNLSVDEVDVQEN
LLFLGLDSIRAIQLLEKWKAQGADISFAQLMEHVTLQQWWQTIQANLHQPCSA
>Q9S3V1 1.4.3.23~~~vioA~~~Flavin-dependent L-tryptophan oxidase VioA~~~COG1231
MKHSSDICIVGAGISGLTCASHLLDSPACRGLSLRIFDMQQEAGGRIRSKMLDGKASIELGAGRYSPQLHPHFQSAMQHY
SQKSEVYPFTQLKFKSHVQQKLKRAMNELSPRLKEHGKESFLQFVSRYQGHDSAVGMIRSMGYDALFLPDISAEMAYDIV
GKHPEIQSVTDNDANQWFAAETGFAGLIQGIKAKVKAAGARFSLGYRLLSVRTDGDGYLLQLAGDDGWKLEHRTRHLILA
IPPSAMAGLNVDFPEAWSGARYGSLPLFKGFLTYGEPWWLDYKLDDQVLIVDNPLRKIYFKGDKYLFFYTDSEMANYWRG
CVAEGEDGYLEQIRTHLASALGIVRERIPQPLAHVHKYWAHGVEFCRDSDIDHPSALSHRDSGIIACSDAYTEHCGWMEG
GLLSAREASRLLLQRIAA
>Q9XCW4 2.6.1.33~~~vioA~~~dTDP-4-amino-4,6-dideoxy-D-glucose transaminase~~~
MNDKTIPVTQPSLPELAEFMPYLEKIWKNKWLTNNGPFHQELEEKLCEFLGVQHISLFNNATIALITALQALRITGEVIT
TPYSFVATSHAILWNGLTPVFVDIENDGYNIDYRKIEQAITPKTSAILPVHCYSTPCEVEEIQKIADNYGLKVIYDAAHA
FGVNFKGGKVYLTMVIYQFLVSMRRKSSINFEGGAIISPDAKTKLRIDRLKNFGIADELTVTAPGINGKMSEINAAFGLV
QLKHIEGSISKRKIIDSLYRNLLKGTPGITIFPGNINTNSNYSYFPILIDDGFHMSRDQAYELLKKNNILSRKYFYPLIS
NMPMYRGLISASVDNLPIANSVADKVLCLPIYTDLNEEIVVKITKLLLGKM
>Q6U1I3 2.6.1.33~~~vioA~~~dTDP-4-amino-4,6-dideoxy-D-glucose transaminase~~~
MEKPIFVTQPNLPPLEEFIPYLEIIWQNKQFTNNGPMHQKLEKKLCEFLGVEYISLFNNGTIALITAVQALGVKGEVITT
PYSFVATAHSLVLNGLKPVFVDIDPKTLNIDPRRIEEAITPETQAIMPVHCYGNPCDTQAIADIAQKYNLKVIYDAAHAF
GVEDDDGSVLRHGDLSVLSFHATKVFSTFEGGAIVCNSKEMKEKIDRLKNFGYIDETNINIIGSNGKMSEVNAAFGLLQL
EHMDTFLRGRMNADMFYRQKLKDITGISIVIPSGQKISNFSYFPILVESDFPLSRDELFNYLKNQNIFARRYFYPVIPDF
QAYLNVGEVCDVKNAREIASKVLCLPMHAELSSDILEYIVSTIREIK
>Q9S3V0 1.21.98.-~~~vioB~~~2-imino-3-(indol-3-yl)propanoate dimerase~~~COG1633
MSILDFPRIHFRGWARVNAPTANRDPHGHIDMASNTVAMAGEPFDLARHPTEFHRHLRSLGPRFGLDGRADPEGPFSLAE
GYNAAGNNHFSWESATVSHVQWDGGEADRGDGLVGARLALWGHYNDYLRTTFNRARWVDSDPTRRDAAQIYAGQFTISPA
GAGPGTPWLFTADIDDSHGARWTRGGHIAERGGHFLDEEFGLARLFQFSVPKDHPHFLFHPGPFDSEAWRRLQLALEDDD
VLGLTVQYALFNMSTPPQPNSPVFHDMVGVVGLWRRGELASYPAGRLLRPRQPGLGDLTLRVSGGRVALNLACAIPFSTR
AAQPSAPDRLTPDLGAKLPLGDLLLRDEDGALLARVPQALYQDYWTNHGIVDLPLLREPRGSLTLSSELAEWREQDWVTQ
SDASNLYLEAPDRRHGRFFPESIALRSYFRGEARARPDIPHRIEGMGLVGVESRQDGDAAEWRLTGLRPGPARIVLDDGA
EAIPLRVLPDDWALDDATVEEVDYAFLYRHVMAYYELVYPFMSDKVFSLADRCKCETYARLMWQMCDPQNRNKSYYMPST
RELSAPKARLFLKYLAHVEGQARLQAPPPAGPARIESKAQLAAELRKAVDLELSVMLQYLYAAYSIPNYAQGQQRVRDGA
WTAEQLQLACGSGDRRRDGGIRAALLEIAHEEMIHYLVVNNLLMALGEPFYAGVPLMGEAARQAFGLDTEFALEPFSEST
LARFVRLEWPHFIPAPGKSIADCYAAIRQAFLDLPDLFGGEAGKRGGEHHLFLNELTNRAHPGYQLEVFDRDSALFGIAF
VTDQGEGGALDSPHYEHSHFQRLREMSARIMAQSAPFEPALPALRNPVLDESPGCQRVADGRARALMALYQGVYELMFAM
MAQHFAVKPLGSLRRSRLMNAAIDLMTGLLRPLSCALMNLPSGIAGRTAGPPLPGPVDTRSYDDYALGCRMLARRCERLL
EQASMLEPGWLPDAQMELLDFYRRQMLDLACGKLSREA
>Q9XCW3 2.3.1.209~~~vioB~~~dTDP-4-amino-4,6-dideoxy-D-glucose acyltransferase~~~
MAYLDEIQLKEMGFKSVGENVKISDKASFYGCDNISIGNNVRIDDFCVFSAGEGGIDIHDYIHIAVYSSIIGKGKVTISD
YANISSRVSIYSSNEYYSGNYMSNPVVPSEYTNIHSGTVFIGKHVIIGCGSIVLPDVILHEGAAIGALSVVKEDCEAFTV
NVGIPAKPISERSKKLLELESVFKPSAIGDNL
>Q9S3U9 1.14.13.224~~~vioC~~~Violacein synthase~~~COG0654
MKRAIIVGGGLAGGLTAIYLAKRGYEVHVVEKRGDPLRDLSSYVDVVSSRAIGVSMTVRGIKSVLAAGIPRAELDACGEP
IVAMAFSVGGQYRMRELKPLEDFRPLSLNRAAFQKLLNKYANLAGVRYYFEHKCLDVDLDGKSVLIQGKDGQPQRLQGDM
IIGADGAHSAVRQAMQSGLRRFEFQQTFFRHGYKTLVLPDAQALGYRKDTLYFFGMDSGGLFAGRAATIPDGSVSIAVCL
PYSGSPSLTTTDEPTMRAFFDRYFGGLPRDARDEMLRQFLAKPSNDLINVRSSTFHYKGNVLLLGDAAHATAPFLGQGMN
MALEDARTFVELLDRHQGDQDKAFPEFTELRKVQADAMQDMARANYDVLSCSNPIFFMRARYTRYMHSKFPGLYPPDMAE
KLYFTSEPYDRLQQIQRKQNVWYKIGRVN
>Q9S3U8 1.14.13.217~~~vioD~~~Protodeoxyviolaceinate monooxygenase~~~COG0654
MKILVIGAGPAGLVFASQLKQARPLWAIDIVEKNDEQEVLGWGVVLPGRPGQHPANPLSYLDAPERLNPQFLEDFKLVHH
NEPSLMSTGVLLCGVERRGLVHALRDKCRSQGIAIRFESPLLEHGELPLADYDLVVLANGVNHKTAHFTEALVPQVDYGR
NKYIWYGTSQLFDQMNLVFRTHGKDIFIAHAYKYSDTMSTFIVECSEETYARARLGEMSEEASAEYVAKVFQAELGGHGL
VSQPGLGWRNFMTLSHDRCHDGKLVLLGDALQSGHFSIGHGTTMAVVVAQLLVKALCTEDGVPAALKRFEERALPLVQLF
RGHADNSRVWFETVEERMHLSSAEFVQSFDARRKSLPPMPEALAQNLRYALQR
>Q8ZM36 ~~~~~~Virulence protein STM3117~~~
MLFFNVASLKYKHHESIQMIIDRIDHLVLTVSDISTTIRFYEEVLGFSAVTFKQNRKALIFGAQKINLHQQEMEFEPKAS
RPTPGSADLCFITSTPINDVVSEILQAGISIVEGPVERTGATGEIMSIYIRDPDGNLIEISQYV
>P10799 2.7.13.3~~~virA~~~Wide host range VirA protein~~~
MNGRYSPTRQDFKTGAKPWSILALIVAAMIFAFMAVASWQDNATTQAILSQLRSINADSASLQRDVLRAHTGTVANYRPI
ISRLGALRKNLEDLKQLFRQSHIVSESNAAQLLRQLEVSLNSADAAVAAFGAQNVRLQDSLASFTRALSSLPGKASTDQT
LEKPTELASMMLQFLRQPSPAISFEISLELERLQKQRGLDEAPVRILAREGPIILSLLPQVKDLVNMIQTSDTAEIAEML
QRECLEVYSLKNVEERSARIFLGSASVGLCLYIITLVYRLRKKTDWLARRLDYEELIKEIGVCFEGEAATTSSAQAALRI
IQRFFDADTCALALVDHDRRWAVETFGAKHPKPVWDDSVLREIVSRTKADERATVFRIISSKKIVHLPLEIPGLSILLAH
KSTDKLIAVCSLGYQSYRPRPCQGEIQLLELATACLCHYIDVRRKQTQCDVLARRLEHAQRLEAVGTLAGGIAHEFNNIL
GSILGHAELAQNSVSRTSVTRRYIDYIISSGDRAMLIIDQILTLSRKQERMIKPFSVSELVTEIAPLLRMALPPNIELSF
RFDQMQSVIEGSPLELQQVLINICKNASQAMTANGQIDIIISQAFLPVKKILAHGVMPPGDYVLLSISDNGGGIPEAVLP
HIFEPFFTTRARNGGTGLGLASVHGHISAFAGYIDVSSTVGHGTRFDIYLPPSSKEPVNPDSFFGRNKAPRGNGEIVALV
EPDDLLREAYEDKIAALGYEPVGFRTFNEIRDWISKGNEADLVMVDQASLPEDQSPNSVDLVLKTASIIIGGNDLKMTLS
REDVTRDLYLPKPISSRTMAHAILTKIKT
>Q7BU69 3.4.22.-~~~virA~~~Cysteine protease-like VirA~~~
MQTSNITNHERNDSSWMSTVKSTTEVSWNKLSFCDILLKIITFGIYSPHETLAEKHSEKKLMDSFSPSLSQDKMDGEFAH
ANIDGISIRLCLNKGICSVFYLDGDKIQSTQLSSKEYNNLLSSLPPKQFNLGKVHTITAPVSGNFKTHKPAPEVIETAIN
CCTSIIPNDDYFHVKDTDFNSVWHDIYRDIRASDSNSTKIYFNNIEIPLKLIADLINELGINEFIDSKKELQMLSYNQVN
KIINSNFPQQDLCFQTEKLLFTSLFQDPAFISALTSAFWQSLHITSSSVEHIYAQIMSENIENRLNFMPEQRVINNCGHI
IKINAVVPKNDTAISASGGRAYEVSSSILPSHITCNGVGINKIETSYLVHAGTLPSSEGLRNAIPPESRQVSFAIISPDV
>Q9RPY4 ~~~virB1~~~Type IV secretion system protein virB1~~~
MVPFLVLAQQCAPTVAPQTMAAIVQVESGFNPYAIGVVGGRLVRQPVSLDEAITTAQSLEAKGWNFSLGIAQVNRYNLPK
YGSTYAQAFDPCKNLKMGSKILEDCYRRAIVKMPGQEQGALRAAFSCYYAGNFTGGFKTKPGSPSYVQKVVASADVTTKP
IAVVPMIRKTPDAAAAVAAPVKKRQPADRNSVLVDLHPSSQSMPATGAANAPVRLKTEQPATTDAPPGKDNTDGVVVF
>P17792 ~~~virB2~~~Protein virB2~~~
MRCFERYRVHLNRLSLSNAVMRMVSGYAPSVVGAMGWSIFSSGPAAAQSAGGGTDPATMVNNICTFILGPFGQSLAVLGI
VAIGISWMFGRASLGLVAGVVGGIVIMFGASFLGKTLTGGG
>Q9R3F2 ~~~virB2~~~Type IV secretion system protein virB2~~~COG3838
MTDTISRNIIFIIIMLLLTALVVSDPSYAAAATGSASGLGNVDNVLQSIVTMMTGTTAKLIATICVAAVGIGWMYGFIDL
RKAAYCLIGIGIVFGASALVSKLTSAS
>Q7CEG0 ~~~virB2~~~Type IV secretion system protein virB2~~~
MKTASPSKKSLSRILPHLLLALIVSIAAIEPNLAHANGGLDKVNTSMQKVLDLLSGVSITIVTIAIIWSGYKMAFRHARF
MDVVPVLGGALVVGAAAEIASYLLR
>P09776 ~~~virB2~~~Protein virB2~~~
MRCFERYRLHLNRLSLSNAMMRVISSCAPSLGGAMAWSISSCGPAAAQSAGGGTDPATMVNNICTFILGPFGQSLAVLGI
VAIGISWMFGRRSLGLVAGVVGGIVIMFGASFLGQTLTGGS
>Q9S3N1 ~~~virB3~~~Type IV secretion system protein virB3~~~COG3702
MNEDPLFLACTRPAMFAGVTMEAMAFNVMATSILFILTSGFTMIGLGIGLHFVLREITKHDHNQFRVLFAWLNTRGKQKN
LNRWGGGSTSPLRLIRTYEELNR
>Q7CEG1 ~~~virB3~~~Type IV secretion system protein virB3~~~
MTTAPQESNARSAGYRGDPIFKGCTRPAMLFGVPVIPLVIVGGSIVLLSVWISMFILPLIVPIVLVMRQITQTDDQMFRL
LGLKAQFRLIHFNRTGRFWRASAYSPIAFTKRKRES
>Q9R2W4 ~~~virB4~~~Type IV secretion system protein virB4~~~COG3451
MSMMKRESLPEDYIPYIRHINQHVIALNSRCLMTVMVVEGVNFDTADIDQLNSLHNQLNTLLKNIADERVALYSHIIRRR
ETIYPESQFFSSFAATLDEKYKKKMVSQELYRNDLFVSLLWNPASDKTEQLASFFQRLAKAKKTQSEPDQEAIRKIEELS
QDLIEGLESYGARLLSVYAHGGILFSEQSEFLHQLVGGRRERIPLTFGTIASTIYSDRVIFGKETIEIRHESNERFAGMF
GWKEYPSKTRPGMTDGLLTAPFEFILTQSFVFKSKAAASVIMGRKQNQMINAADRASSQIEALDEALDDLESNRFVLGEH
HLSLAVFANHPKALAEYLSKARAHLTNGGAVIAREDLGLEAAWWAQLPGNFSYRARSGAITSRNFAALSPFHSFPIGKLE
GNVWGTAVALLKTQAGSPYYFNFHYGDLGNTFVCGPSGSGKTVIVNFLLAQLQKHNPTMVFFDKDQGAEIFVRAGGGKYK
PLKNGQPTGIAPLKGMEYTEKNKVFLRNWVLKLVTAEGQTVTEEERQDIAKAIDALGNLPHAQRSLSALQLFFDNTSKEG
IAIRLQRWLKGNDLGWVFDNDQDDLNLDSQFIGYDMTDFLDNEEIRRPLMMYLFNRILDLIDGRRIIIVIDEFWKALEDD
SFKAFAQDRLKTIRKQNGMMLFATQSPKDALNSTIAHTIIEQCPTQIFFPNQKANYKDYVEDFKLTEREFELIQSELSRE
SRRFLIKQGQSSVVAELNLRGMNDEIAVLSGTTKNIELVNQIISEYGADPDIWLPIFHQRRENQ
>Q9RPY1 ~~~virB4~~~Type IV secretion system protein virB4~~~
MGAQSKYAQQLNNERSLAPFIPFRSQVGPTTVITRDGDFVRTWRIAGLAFETQDKEELLIRKDQLNTLFRAIASNNVALW
SHNVRRRTWDHLKSFFSNPFCDALDKKYYGSFSGYRMMSNELYLTVIYRPVPAKISRLFNVAVHRSHAEILQEQQLAIRK
LDEIGNQIETSLRRYGGDDGRGIEVLSTYEDKHGALCSQQLEFYNFLLSGEWQKVRVPSCPLDEYLGTGWVYAGTETIEI
RTANATRYARGIDFKDYASHTEPGILNGLMYSDYEYVITQSFSFMTKRDGKEFLTRQKQRLQNTEDGSASQIMEMDIAID
QLGRGDFVMGEYHYSLLVFAEDMETVRHNTSHAMNILQDNGFLATVIATATDAAFYAQLPCNWRYRPRVAGLTSLNFAGL
SCFHNFRAGKRDGNPWGQALTLLKTPSGQPAYLNFHYSKGDEDNFDKKLLGNTRIIGQSGAGKTVLMNFCLAQAQKYLHN
APMGMCNVFFDKDQGAKGTILAIGGKYLAIRNGEPTGFNPFQMEPTAGNILFLEKLVQVLVSRDGQHVTTTDESRISHAI
RTVMRMRPELRRLSTVLQNVTEGSDRQDRENSVAKRLAKWCFDDGTGKRGTFWWVLDCPQDQIDFNTHSNYGFDGTDFLD
NADVRTPISMYLLHRMELAIDGRRFIYWMDEAWKWVDDEAFSEFANNKQLTIRKQNGLGVFATQMPSSLLNSKVASALVQ
QVATEIYLPNPKADYHEYTDGFKVTNEEFDIIRSMSEESRMFLVKQGHHSMICRLELNGFDDELAILSGSSDNNELLDQV
IAEVGDDPSVWLPVFQERRKARIASSKSTGR
>P0A3W0 ~~~virB4~~~Protein virB4~~~
MLGASGTTERSGEIYLPYIGHLSDHIVLLEDGSIMSIARIDGVAFELEEIEMRNARCRAFNTLLRNIADDHVSIYAHLVR
HADVPSSAPRHFRSVFAASLNEAFEQRVLSGQLLRNDHFLTLIVYPQAALGKVKRRFTKLSGKRENDLAGQIRNMEDLWH
VVAGSLKAYGLHRLGIREKQGVLFTEIGEALRLIMTGRFTPVPVVSGSLGASIYTDRVICGKRGLEIRTPKDSYVGSIYS
FREYPAKTRPGMLNALLSLDFPLVLTQSFSFLTRPQAHAKLSLKSSQMLSSGDKAVTQIGKLSEAEDALASNEFVMGSHH
LSLCVYADDLNSLGDRGARARTRMADAGAVVVQEGIGMEAAYWSQLPGNFKWRTRPGAITSRNFAGFVSFENFPEGASSG
HWGTAIARFRTNGGTPFDYIPHEHDVGMTAIFGPIGRGKTTLMMFVLAMLEQSMVDRAGTVVFFDKDRGGELLVRATGGT
YLALRRGTPSGLAPLRGLENTAASHDFLREWIVALIESDGRGGISPEENRRLVRGIHRQLSFDPQMRSIAGLREFLLHGP
AEGAGARLQRWCRGHALGWAFDGEVDEVKLDPSITGFDMTHLLEYEEVCAPAAAYLLHRIGAMIDGRRFVMSCDEFRAYL
LNPKFSAVVDKFLLTVRKNNGMLILATQQPEHVLESPLGASLVAQCMTKIFYPSPTADRSAYIDGLKCTEKEFQAIREDM
TVGSRKFLLKRESGSVICEFDLRDMREYVAVLSGRANTVRFAARLREAQEGNSSGWLSEFMARHHEAED
>P17795 ~~~virB5~~~Protein virB5~~~
MKIMQLVAAAMAVSLLSVGPARAQFVVSDPATEAETLATALETAANLEQTITMVAMLTSAYGVTGLLTSLNQKNQYPSTR
DLDTEMFSPRMPMSTTARAITTDTDRAVVGGDAEADLLRSQITGSANSAGIAADNLETMDKRLTANAETSTQLSRSRNIM
QATVTNGLLLKQIHDAMIQNVQATSLLTMTTAQAGLHEAEEAAAQRKEHQKTAVIFGAVP
>Q9RPY0 ~~~virB5~~~Type IV secretion system protein virB5~~~
MKKIILSFAFALTVTSTAHAQLPVTDAGSIAQNLANHLEQMVKFAQQIEQLKQQFEEQKMQFDALTGYRGLGDILRDPTL
RSYLPHNWRDLYEAVMSGGYLAAAGETANLLRKSQVYDPCASISDKDQRIACEAKVVKPVQDKVMTSKAYDATDKRLQEI
ESLMQEINKTGDPKAIAELQGRIESENAMIQNEDTRLHLYQQMAEAQDKLLDERQHELDAKDNARRGYPQPKALEAAY
>Q9RND2 ~~~virB6~~~Type IV secretion system protein VirB6~~~COG3704
MSDFSFSPFESISGYILQPLNNVMNTTVSGLSSAISAPLNLASIIFIFLYGYNVMTGRVALSMNSLLNNVVKIVIVTTMA
TNADTFNTYVKNIFFGDLANAIGNALNSNPSSANVFDYILLKTSARYQEVLAAAWFLEKIMVGLLGSLMIMAVIVFCIGG
FIVQMFAQVALVMIIGLGPLFISLYLFNATRKFTDAWITTLVNFTILQVLVIMLGTIMCKIILYVLDGTYESIYFLFPPV
VVISIVGAILFRALPGIASALSSGGPYFNAGISSGGQIFTMLSSGAKTGRNAAKSAASTLSGAAGTATKAAKIGGNGRGR
F
>Q9RPX9 ~~~virB6~~~Type IV secretion system protein VirB6~~~
MVNPVIFEFIGTSIHNQLNNYVTMVASNTMNMIATTAVLAGGLYYTAMGILMSVGRIEGPFSQLVISCIKFMLIAAFALN
ISTYSEWVIDTVHNMESGFADAFAGNHGTPSSTIYQTLDNSLGKGWNIAAMLFEKGDNRGLTQIVQGFSELLLSFLVAGS
TLILAGPTGAMIVATNAVIAILLGIGPLFILALGWAPTRGFFDRWFGAIVTSILQVALLSAVLSISSAIFSRMVAAINLA
SATQSTLFSCLSLTAVTIVMPYMMYKVYEYGGILGSSISAATISLGSLAVNTATSGGGAMTSIFSGSSGGGGSGSAKAGG
ESSYSAGGNAMWSPAYRQHVLGQFNRD
>Q7CEG2 ~~~virB7~~~Type IV secretion system putative lipoprotein virB7~~~
MKKVILAFVATAFLAGCTTTGPAVVPVLDGKPRVPVNKSVPAKPPLAQPNPVDTYED
>P0A3W4 ~~~virB7~~~Outer membrane lipoprotein virB7~~~
MKYCLLCLVVALSGCQTNDTIASCKGPIFPLNVGRWQPTPSDLQLRNSGGRYDGA
>P17798 ~~~virB8~~~Protein virB8~~~
MKGSEYALLVARETLAEHYKEVEAFQTARAKSARRLSKVIAAVATIAVLGNVAQAFTIATMVPLIRLVPVYLWIRPDGTV
DSEVSVSRLPATQEEAVVNASLWEYVRLRESYDADTAQYAYDLVSNFSAPMVRQNYQQFFNYPNPTSPQVILGKHGRLEV
EHIASNDVTPGVQQIRYKRTLIVDGKMPMASTWTATVRYEKVTSLPGRLRLTNPGGLVVTSYQTSEDTVSNAGHSEP
>Q6G2B4 ~~~virB8~~~Type IV secretion system protein virB8~~~COG3736
MKHSLRTLWRLRVKINEFNEYIKEARSFDIDRMHGMRQRMRIAMALTVLFGLMTIALALAVAALTPLKTVEPFVIRVDNS
TGIIETVSALKETPNDYDEAITRYFASKYVRAREGFQLSEAEHNFRLVSLLSSPEEQSRFAKWYAGNNPESPQNIYQNMI
ATVTIKSISFLSKDLIQVRYYKTVRELNDKENISHWVSILNFSYINAQISTQDRLINPLGFQVSEYRSDPEVIQ
>Q6FYW3 ~~~virB8~~~Type IV secretion system protein virB8~~~COG3736
MKNSLIKIRKSLVKSDAFDEYVKEARSFDIDRMHSLQQRMRIAMTLTVLFGLMTIALALAVAALTPLKTVEPFVIRVDNS
TGIIETVSALKETPNNYDEAITRYFAGKYVRAREGFQLSEAEYNFRLISLLSSPEEQNRFAKWYSGNNPESPQNIYHNMT
AKVTIKSISFLSKDLIQVRYYKTIRELNGKENISHWVSILNFSYINAHISTEDRLINPLGFQVSEYRSDPEVIK
>Q7CEG3 ~~~virB8~~~Type IV secretion system protein virB8~~~
MFGRKQSPQKSVKNGQGNAPSVYDEALNWEAAHVRLVEKSERRAWKIAGAFGTITVLLGIGIAGMLPLKQHVPYLVRVNA
QTGAPDILTSLDEKSVSYDTVMDKYWLSQYVIARETYDWYTLQKDYETVGMLSSPSEGQSYASQFQGDKALDKQYGSNVR
TSVTIVSIVPNGKGIGTVRFAKTTKRTNETGDGETTHWIATIGYQYVNPSLMSESARLTNPLGFNVTSYRVDPEMGVVQ
>P17799 ~~~virB9~~~Protein virB9~~~
MTKKAFLTLACLLFAAIGARAEDTPTAGRLDPRMRYLAYNPDQVVRLSTAVGATLVVTFGANETVTAVAVSNSKDLAALP
RGNYLFFKASKVLPPQPVVVLTASDAGMRRYVFSISSKTLPHLDKEQADLYYSVQFAYPADDAAARQKAAQEKAVADRIR
AEAQYQQRAEGLLEQPATTVGAEDKNWHYVAQGDRSLLPLEVFDDGFTTVFHFPGNVRIPSIYTINPDGKEAVANYSVKG
SYVEISSVSRGWRLRDGHTVLCIWNTAYDPVGRRPETGTVRPDVKRVLKEVRG
>Q6G2B3 ~~~virB9~~~Type IV secretion system protein virB9~~~COG3504
MMRILKTLFLAFIAAISCYTTPSFAETAPVSARKDNRIRFVNYDPYNVTKIIGSIRSSVQLEFADDEEVTYVGIGNSVAW
QVAPAGHFVFLKPREVQPVTNLQIVTSRQDGTKRSYQFELQVREGDVSAGNDTYFLVKFRYPEDEALRKKLAEAAKAAQR
EENFVNDIFNTHEDFGPRNWAYEAQGSPLIEPASVYDNGKTTTFTFLGNTEIPAIYLVSLDGQEALVPKTIKGNKVIVHA
TAAQFTLRRGNDVLCIFNKRFVPAGVNPETGTTSPSVQRKVNIGNGYE
>Q9RPX6 ~~~virB9~~~Type IV secretion system protein virB9~~~
MKRFLLACILITLASPSWATKIPSGSKYDSRIQYVDYNSGDVVLVRALPGVGARIVFAPGENIEDVASGFTQGWEFKASH
NILYLKARSMTLSHSNQSIDMAPEPGKWDTNLMVTTDQRMYDFDLRLMPGRNNQRVAYRVQFRYPAAAAAAAVAAAQKRV
VQARMNARPSPVNWNYTMQVGTNSASIAPTLAYDDGRFTYLRFPNNRDFPAAFLVAEDKSESIVNSHIDPSAPDILVLHR
VAKQMVLRLGNKVIGIYNESFNPDGVPARDGTTVPGVKRVIKSPGENLQ
>P0A3W6 ~~~virB9~~~Protein virB9~~~
MTRKALFILACLFAAATGAEAEDTPMAGKLDPRMRYLAYNPDQVVRLSTAVGATLVVTFATNETVTSVAVSNSKDLAALP
RGNYLFFKASQVLTPQPVIVLTASDSGMRRYVFSISSKTLSHLDKEQPDLYYSVQFAYPADDAAARRREAQQRAVVDRLH
AEAQYQRKAEDLLDQPVTALGATDSNWHYVAQGDRSLLPLEVFDNGFTTVFHFPGNVRIPSIYTINPDGKEAVANYSVKG
SDVEISSVSRGWRLRDGHTVLCIWNAAYDPVGQRPQTGTVRPDVKRVLKGAKG
>P17800 ~~~~~~Protein virB10~~~
MNDDNQQSAHDVDASGSLVSDTHHRRLSGAQKLIVGGVVLALSLSLIWLGGREKKENGDAPPSTMIATNTKPFHPAPIDV
TLDPPAAQEAVQPTAPPPARSEPERHEPRPEETPIFAYTSGDQGTSKRVQQGETDRRREGNGEDSPLPKVEVSAENDLSI
RMKPTELQPTRATLLPHPDFMVTEGTIIPCILQTAIDTSLAGYVKCVLPWDVRGTTNNVVLLDRGTTVVGEIQRGLQQGD
ARVFVLWDRAETPDHAMISLASPSADELGRSGLPGTVDNHFWQRFSGAMLLSVVQGAFQAASTYAGSSGGGTSFNSVQNN
GEQTADTALKATINIPPTLKKNQGDTVSIFVARDLDFSGIYQLRMAGRAARGRDRRP
>Q6G2B2 ~~~~~~Type IV secretion system protein virB10~~~COG2948
MNDPMDENNLLNDRDMIKDGHGKKQRPNTSKAAALVILFGVCLYLAYSTLFTEKQQPVEVQKEGIIKQTELFRPAPPKPV
SLEPTIEKNNVLLPKVELPTPPKKTTNSDDSLLEAAQRAPVLAYANTQKGQGSTEKNKDISANQPEAKPDETAQRFNHLL
KPTTLEGIRAAKLGNRNYIIAMGASIPCILETAISSDQQGFASCIVSRDILSDNGRVVLLDKGTQIVGEYRAGLKKGQKR
LFVLWNRAKTPNGIIITLASPATDALGRSGMDGDIDNHWLERIGSALLVSIVKDATNYVKGRLPKDQDKNNSETISSGQN
IANIAVENYANIPPTLSKNQGEMVNVFVARDLDFSNVYKLKVIENKKQIVNRALSRNFYKNSAVICNEPKLAHIER
>Q9RPX5 ~~~~~~Type IV secretion system protein virB10~~~
MTQENIPVQPGTLDGERGLPTVNENGSGRTRKVLLFLFVVGFIVVLLLLLVFHMRGNAENNHHSDKTMVQTSTVPMRTFK
LPPPPPPPPPAPPEPPAPPPAPAMPIAEPAAAALSLPPLPDDTPAKDDVLDKSASALMVVTKSSGDTNAQTAGDTVVQTT
NARIQALLDSQKNTKQDAGSLGTLLHGTQTDARMASLLRNRDFLLAKGSIINCALQTRLDSTVPGMAACVVTRNMYSDNG
KVLLIERGSTISGEYDANVKQGMARIYVLWTRVKTPNGVVIDLDSPGADPLGGAGLPGYIDSHFWKRFGGALMLSTIETL
GRYATQKVGGGGSNQINLNTGGGESTSNLASTALKDTINIPPTLYKNQGEEIGIYIARDLDFSSVYDVKPK
>Q9RNC7 ~~~~~~Type IV secretion system protein VirB11~~~COG0630
MNQNLHTLSDETVAIVLTKLEPISTFLKDENLFEIVINRPYQVMTEGVEGWKTIETPALSFNELMGIAKVVASYSKQNIS
EKNPILSATLPGNERIQIVIPPAVEKNTISMTIRKPSSRSFSLEDLANKGLFSVCEQVSFTPLNNYLSHLSELKHIDHDL
VRAYAKKDFVFFLNQAVQCQKNILIAGKTGSGKTTLSKALIAKIPDDERIITIEDTPELVVPQPNYVSMIYSKDGQGLAS
VGPKELLESALRMRPDRILLQELRDGTAFYYIRNVNSGHPGSITTVHASTALAAFEQMTLLVKESEGGGDLERDDIRGLL
ISMIDIIIQCKRIEGKFKVTEIYYDPFKQRNIFGGN
>Q8FXK7 ~~~~~~Type IV secretion system protein VirB11~~~
MMSNRSDFIVPDEAAVKRAASVNFHLEPLRPWLDDPQITEVCVNRPGEVFCERASAWEYYAVPNLDYEHLISLGTATARF
VDQDISDSRPVLSAILPMGERIQIVRPPACEHGTISVTIRKPSFTRRTLEDYAQQGFFKHVRPMSKSLTPFEQELLALKE
AGDYMSFLRRAVQLERVIVVAGETGSGKTTLMKALMQEIPFDQRLITIEDVPELFLPDHPNHVHLFYPSEAKEEENAPVT
AATLLRSCLRMKPTRILLAELRGGEAYDFINVAASGHGGSITSCHAGSCELTFERLALMVLQNRQGRQLPYEIIRRLLYL
VVDVVVHVHNGVHDGTGRHISEVWYDPNTKRALSLQHSEKT
>P0A247 ~~~virB~~~Virulence regulon transcriptional activator VirB~~~
MVDLCNDLLSIKEGQKKEFTLHSGNKVSFIKAKIPHKRIQDLTFVNQKTNVRDQESLTEESLADIIKTIKLQQFFPVIGR
EIDGRIEILDGTRRRASAIYAGADLEVLYSKEYISTLDARKLANDIQTAKEHSIRELGIGLNFLKVSGMSYKDIAKKENL
SRAKVTRAFQAASVPQEIISLFPIASELNFNDYKILFNYYKGLEKANESLSSTLPILKEEIKDLDTNLPPDIYKKEILNI
IKKSKNRKQNPSLKVDSLFISKDKRTYIKRKENKTNRTLIFTLSKINKTVQREIDEAIRDIISRHLSSS
>P07165 ~~~virC1~~~Protein virC1~~~
MKLLTFCSFKGGAGKTTALMGLCAAFASDGKRLALFDADENRPLTRWKENALRSNTWGSFCEVYAAEEMALLEAAYEDAE
LQGFDYALADTHGGSSELNNTIIASSNLLLIPTMLTPLDIDEALSTYRYVIELLLSENLAIPTAVLRQRVPVGRLTTSQR
AMSDMLASLPVVQSPMHERDAFAAMKERGMLHLTLLNMRTDPTMRLLERNLRIAMEELVTISKLVSEALEG
>P07166 ~~~virC2~~~Protein virC2~~~
MGIRKPALSVGEARRLAAARPEIVHPSLPVATQNSTLPQPPENLDEEDRRPAPATAKRCHSSDQQSMLTVDALSSTTAPE
KIQVFLSARPPAPEVSKIYDNLILQYSPSKSLQMILRRALGDFENMLADGSFRAAPKSYPIPHTAFEKSIIVQTSRMFPV
SLIEAARNHFDPLGLETARAFGHKLATAALACFFAREKATNS
>P18591 3.1.-.-~~~virD1~~~T-DNA border endonuclease VirD1~~~
MSQGSRPTSSDIAVNQRECVKVEGFKVVSTRLRSAEYESFSHQARLLGLSDSMAIRVAVRRIGGFLEIDAETRHRMEAIL
QSIGTLSSNIAALLSAYAENPTMDLEALRAERIAFGKSFADLDGLLRSILSVSRRRIDGCSLLKDAL
>P18592 3.1.-.-~~~virD2~~~T-DNA border endonuclease VirD2~~~
MPDRAQVIIRIVPGGGTKTLQQIINQLEYLSRKGRLELQRSARHLDIPLPPDQIHELARSWVQETGTYDESQPDEERQQE
LTTHIIVSFPAGTSQVAAYAASREWAAEMFGSGAGGGRYNYLTAFHIDRDHPHLHVVVNRRELLGHGWLKISRRHPQLNY
DALRIKMAEISLRHGIALDASRRAERGITERPITYAQYRRLEREQARQIRFEDADLEQSSPQGDHPEFSQPFDTSPFEAS
AGGPEDMPRPNNRQNESQVHLQEPAGVSNEAGVLVRVALETERLAQPFVSETILADDIGSGSSRVAEGRVESANRTPDIP
RAATEAATHTTHDRQRRAKRPHDDDGGPSGAKRVTLEGIAVGPQANAGEQDGSSGPLVRQAGTSRPSPPTATTRASTATD
SLSATAHLQQRRGVLSKRPREDDDGEPSERKRERDERSKDGRGGNRR
>P06668 3.1.-.-~~~virD2~~~T-DNA border endonuclease VirD2~~~
MPDRAQVIIRIVPGGGTKTLQQIINQLEYLSRKGKLELQRSARHLDIPVPPDQIRELAQSWVTEAGIYDESQSDDDRQQD
LTTHIIVSFPAGTDQTAAYEASREWAAEMFGSGYGGGRYNYLTAYHVDRDHPHLHVVVNRRELLGHGWLKISRRHPQLNY
DGLRKKMAEISLRHGIVLDATSRAERGIAERPITYAEHRRLERMQAQKIQFEDTDFDETSPEEDRRDLSQSFDPFRSDPS
TGEPDRATRHDKQPLEQHARFQESAGSSIKADARIRVSLESERSAQPSASKIPVIGHFGIETSYVAEASVRKRSGIFGTS
RPVTDVAMHTVKRQQRSKRRNDEEAGPSGANRKGLKAAQVDSEANVGEQDTRDDSNKAADPVSASIGTEQPEASPKRPRD
RHDGELGGRKRARGNRRDDGRGGT
>P18594 ~~~virD4~~~Protein VirD4~~~
MNSSKTTPQRLAVSIVCSLAAGFCAASLYVTFRHGFNGEAMMTFSVFAFWYETPLYMGHATPVFYCGLAIVVSTSIVVLL
SQLIISFRNHEHHGTARWAGFGEMRHAGYLQRYNRIKGPIFGKTCGPRWFGSYLTNGEQPHSLVVAPTRAGKGVGVVIPT
LLTFKGSVIALDVKGELFELTSRARKAGGDAVFKFSPLDPERRTHCYNPVLDIAALPPERQFTETRRLAANLITAKGKGA
EGFIDGARDLFVAGILTCIERGTPTIGAVYDLFAQPGEKYKLFAHLAEESRNKEAQRIFDNMAGNDTKILTSYTSVLGDG
GLNLWADPLVKAATSRSDFSVYDLRRKRTCVYLCVSPNDLEVVAPLMRLLFQQVVSILQRSLPGKDERHEVLFLLDEFKH
LGKLEAIETAITTIAGYKGRFMFIIQSLSALTGIYDDAGKQNFLSNTGVQVFMATADDETPTYISKAIGDYTFKARSTSY
SQARMFDHNIQISDQGAPLLRPEQVRLLDDNNEIVLIKGHPPLKLRKVRYYSDRMLRRLFECQIGALPEPASLMLSEGVH
RDGQDLSQQAAVTEAQGLGDIDSIPNNMEAATPQNSEMDDEQDSLPTGIDVPQGLIESDEVKEDAGGVVPDFGVSAEMAP
AMIAQQQLLEQIIALQQRYGPASSHSVK
>P08063 ~~~virE1~~~Protein virE1~~~
MVIIKLNANKNMPVLAVEKPQEIHKEELSDHHQSNGFTSLDLEMIELENFVLHCPLPEENLAG
>P08062 ~~~virE2~~~Single-strand DNA-binding protein~~~
MDPKAEGNGENITETAAGNVETSDFVNLKRQKREGVNSTGMSEIDMTGSQETPEHNMHGSPTHTDDLGPRLDADMLDSQS
SHVSSSAQGNRSEVENELSNLFAKMALPGHDRRTDEYILVRQTGQDKFAGTTKCNLDHLPTKAEFNASCRLYRDGVGNYY
PPPLAFERIDIPEQLAAQLHNLEPREQSKQCFQYKLEVWNRAHAEMGITGTDIFYQTDKNIKLDRNYKLRPEDRYIQTEK
YGRREIQKRYEHQFQAGSLLPDILIKTPQNDIHFSYRFAGDAYANKRFEEFERAIKTKYGSDTEIKLKSKSGIMHDSKYL
ESWERGSADIRFAEFAGENRAHNKQFPAATVNMGRQPDGQGGMTRDRHVSVDYLLQNLPNSPWTQALKEGKLWDRVQVLA
RDGNRYMSPSRLEYSDPEHFTQLMDQVGLPVSMGRQSHANSVKFEQFDRQAAVIVADGPNLREVPDLSPEKLQQLSQKDV
LIADRNEKGQRTGTYTNVVEYERLMMKLPSDAAQLLAEPSDRYSRAFVRPEPALPPISDSRRTYESRPRGPTVNSL
>P0A3W8 ~~~virE2~~~Single-strand DNA-binding protein~~~
MDLSGNEKSRPWKKANVSSSTISDIQMTNGENLESGSPTRTEVLSPRLDDGSVDSSSSLYSGSEHGNQAEIQKELSALFS
NMSLPGNDRRPDEYILVRQTGQDAFTGIAKGNLDHMPTKAEFNACCRLYRDGAGNYYPPPLAFDKISVPAQLEETWGMME
AKERNKLRFQYKLDVWNHAHADMGITGTEIFYQTDKNIKLDRNYKLRPEDRYVQTERYGRREIQKRYQHELQAGSLLPDI
MIKTPKNDIHFVYRFAGDNYANKQFSEFEHTVKRRYGGETEIKLKSKSGIMHDSKYLESWERGSADIRFAEFVGENRAHN
RQFPTATVNMGQQPDGQGGLTRDRHVSVEFLMQSAPNSPWAQALKKGELWDRVQLLARDGNRYLSPHRLEYSDPEHFTEL
MNRVGLPASMGRQSHAASIKFEKFDAQAAVIVINGPELRDIHDLSPENLQNVSTKDVIVADRNENGQRTGTYTSVAEYER
LQLRLPADAAGVLGEAADKYSRDFVRPEPASRPISDSRRIYESRPRSQSVNSF
>P15597 ~~~virF~~~Virulence protein F~~~
MRNSSLRDASGSNDAQVPHKTELLNLPDHVLTEVAKRLATNNPVESAENIANFSKSHRFTRDAVRTEPLEKFSSRLKILS
RNAKLLSHAVRHAATLPDGEQLSEAQLSQMRSEVATRPVLGVAYTHQDGQPEERLSGNHLDHKINNIPNLVFNVAEPIMF
NEISALEVMAEVRPIARSIKTAHDDARAELMSADRPRSTRGL
>P0A2T1 ~~~virF~~~Virulence regulon transcriptional activator VirF~~~
MMDMGHKNKIDIKVRLHNYIILYAKRCSMTVSSGNETLTIDEGQIAFIERNIQINVSIKKSDSINPFEIISLDRNLLLSI
IRIMEPIYSFQHSYSEEKRGLNKKIFLLSEEEVSIDLFKSIKEMPFGKRKIYSLACLLSAVSDEEALYTSISIASSLSFS
DQIRKIVEKNIEKRWRLSDISNNLNLSEIAVRKRLESEKLTFQQILLDIRMHHAAKLLLNSQSYINDVSRLIGISSPSYF
IRKFNEYYGITPKKFYLYHKKF
>P0C2V5 ~~~virF~~~Virulence regulon transcriptional activator VirF~~~
MASLEIIKLEWATPIFKVVEHSQDGLYILLQGQISWQNSSQTYDLDEGNMLFLRRGSYAVRCGTKEPCQLLWIPLPGSFL
STFLHRFGSLLSEIRRDNATPKPLLIFNISPILSQSIQNLCAILERSDFPSVLTQLRIEELLLLLAFSSQGALFLSALRH
LGNRPEERLQKFMEENYLQGWKLSKFAREFGMGLTTFKELFGTVYGISPRAWISERRILYAHQLLLNGKMSIVDIAMEAG
FSSQSYFTQSYRRRFGCTPSQARLTKIATTG
>P9WMJ3 ~~~virS~~~HTH-type transcriptional regulator VirS~~~COG2207
MELGSLIRATNLWGYTDLMRELGADPLPFLRRFDIPPGIEHQEDAFMSLAGFVRMLEASAAELDCPDFGLRLARWQGLGI
LGPVAVIARNAATLFGGLEAIGRYLYVHSPALTLTVSSTTARSNVRFGYEVTEPGIPYPLQGYELSMANAARMIRLLGGP
QARARVFSFRHAQLGTDAAYREALGCTVRFGRTWCGFEVDHRLAGRPIDHADPETKRIATKYLESQYLPSDATLSERVVG
LARRLLPTGQCSAEAIADQLDMHPRTLQRRLAAEGLRCHDLIERERRAQAARYLAQPGLYLSQIAVLLGYSEQSALNRSC
RRWFGMTPRQYRAYGGVSGR
>A3UNN4 2.4.2.31~~~~~~Putative NAD(+)--arginine ADP-ribosyltransferase Vis~~~
MNTRFLLLLCCLSFTTFSQPFDAIKQPNRSEEEVTQLAEDFKDWSKASNGWRYSFITANEKEAVEDFSISGYQTANDYLR
ATDTSTWGVAGADARQYIRTVKSALNKLPKYKGTAYRGTWVKLSLLNKLEEGDVLVEPAFTSTSTLPEVAKRFSVVHPNS
PQRLKRVLFEVKINQGGHTIAGLSEYSKEAEVLFAPNAHFRITQIERTSNHTYIGVETVKASAVKNTQKYNLYSGEEVEA
SFWHSLVCT
>A5F661 ~~~viuA~~~Ferric vibriobactin receptor ViuA~~~COG1629
MAVLCPARVSVAENKKFKLHTLSAMMMGLFTGSFAYAETQNTSNQEQEMPVLVVIGEKTQRSIYETSASVEVFDQDTIER
TPGATEIDDLLQLIPNLVDSGQSNNMPTIRGIDGSGPSVGGLASFAGTSPRLNMSIDGRSLTYSEIAFGPRSLWDMQQVE
IYLGPQSYIQGRNTSAGAIVMKSNDPTHHFESAVKAGIGESDYSQTAGMISAPIIQDELAFRLSFDQQKRDSFVDLAAFE
PAGDPKKIEMNSVRGKLLYEPSALDGFKTTLTLSHMDSRGPQTENINVAGNEAFRPVYETASFTTAWDIIWHLNDLFTFE
NNLVYADFSYDRYTNPNSRGDFNTDGKEFHIEPLLRYIALDGSVNTLIGARYYQSSQDDMYIDAASAYPMDGRTKAKSVF
AEVTYALTPSINVNLAGRFEREQVKRNVSHPRYKLDYDETSSVFLPKLDVAYTPVQGQTYGIKAAKGYNASGAGLAFNSM
QFTGFRPYEFEQESIWNYEFYTRHRFSHSVEVLTNLFYNDFDSMQMTQTTSSGDVFIANLDEASTYGAEIGSRWYATSSL
ELFANLGLLKTEFKETTGNTKELPRAPKMSANVGLLYDFGQGFEFSSNAAYTGSYFSESGNSEKFAIDSYWVANAQLAYV
FEHGRATLYATNLLDSDKTTLYLSTNNTLDQLKQQPRMIGASVQLNF
>A5F660 1.16.1.7~~~viuB~~~Ferric vibriobactin reductase ViuB~~~COG2375
MSNEVERVYPRLLDFVRKKYVSKNLLRVTLTGEDLIGFPEDQNGSHIKVFFPNQASGILQLPIREGDKVIWPEHKPVPRA
YTVRQYRAQSNELDIDFVVHGEGTPGGGWALKAQTGSQLGLIGPGGPDPLIEPADWHIMAGDLSAVPAISAILEKMPSQA
KGYVFLEVDDIEDKHDISHPEQMVIKWLVRDPNQAQPVLAMAIEQLPVPQGAESLSAFVAGENESVIACRKILRNEYRIA
RDKIYAIPYWKRGKNEEAYHEERHVVMDEEF
>Q2YJ50 ~~~vjbR~~~HTH-type quorum sensing-dependent transcriptional regulator VjbR~~~
MSLDLVHFPNYKKTFFGSSFQSDTLALLTRIRDEIGCRYVTHTYRGRVGDCTKVNSADLTVLMTLPATWVARYSSKNYFA
IDPVFQEDAPYYRNDTSAIARDLKEDADICPAVAELLHDAEKHGLGNLFIAVSARNPKGVAGCTVFTFEVEDEDRTQFLA
RMRPRLLSLAGIIHGTVCGCKDANSVASLLTPREVDCLRWAANGKTDGEIAEILSIARWTVVTYLQNAKIKLNCSNRTSA
VATALSLGIIDMPEVQHLV
>Q8YAY5 ~~~vjbR~~~HTH-type quorum sensing-dependent transcriptional regulator VjbR~~~COG2771
MSLDLVHFPNYKKTFFGSSFQSDTLALLTRIRDEIGCRYVTHTYRGRVGDCTKVNSADLTVLMTLPATWVARYSSKNYFA
IDPVFQEDAPYYRNDTSAIARDLKEDADICPAVAELLHDAEKHGLGNLFIAVSARNPKGVAGCTVFTFEVEDEDRTQFLA
RMRPRLLSLAGIIHGTVCGCKDANSVASLLTPREVDCLRWAANGKTDGEIAEILSIARWTVVTYLQNAKIKLNCSNRTSA
VATALSLGIIDMPEVQHLV
>Q2JJF6 1.17.4.-~~~~~~Vitamin K epoxide reductase homolog~~~COG4243
MASYLKLKAQEETWLQRHSRLILAILAGLGSLLTAYLTYTKLTEQPAAFCTGDGGCDLVLSSRWAEFLGIPTAAVGLLGF
LGVLALAVLPDGLPLVKRWRWPALFGLVSAMTAFEMYMLYLMVAVLRQFCMYCTTAIILVAGLGLVTVLGHRWLDGGKLA
FSYILVAFLTLVTTIGVYANQVPPPSPLAVGLAAHLRQIGGTMYGAYWCPHCQDQKELFGAAFDQVPYVECSPNGPGTPQ
AQECTEAGITSYPTWIINGRTYTGVRSLEALAVASGYPLEEGR
>Q15JG4 2.7.7.91~~~vldB~~~Valienol-1-phosphate guanylyltransferase~~~
MDGVRAVLLAGGEGRRMGPLGRGRLKPLVPFGGTSRLIDFSIANVHRSGLRDVLLLSQYEERRLMDDLHLVWNGRHRGFR
IDFGPYDAVYRRSPGKLPEQLPERIWPLERGTADALLTKAEYVFRQGDAEASEILVLHADHVYRFDYGDMIREHRASKAA
LTVSYQRIERRYVHLFGMVEFDGDGLLTAFEEKPDDPTSDLVFAAFCLFDAATLRRYLEQLRGTDWQHDISRDVIPAMLA
GGELIRGYEVKSYWEDIGTVDRYHRAHRGLLRADPTLALSDMPLTVAPEVPRHLVPGGPGRRASVVAADVANEGEIVSSV
VYPGARIGVDAHVVDCVVLPGARVPDGTHLASAIVLEDGSVQQCEAEREEVAL
>Q15JG1 2.5.1.135~~~vldE~~~Validamine 7-phosphate valienyltransferase~~~
MTGSEIFLASKRAAITYDTDPATGEPRAWLAPGGTGNVVAEQAGVLNISWIASADSEDDRRASALNPDGVTMELHSGREI
LVRLIRHDPAVFRNVQNFMTANLMWAANNYGWDRWTQPSFGSDAREGWADFGRFTRDFADAILKSSAQSADPVYLVHDYQ
LVGVPALLREQRPDAPILLFVHIPWPSADYWRILPKEIRTGILHGMLPATTIGFFADRWCRNFLESVADLLPDARIDREA
MTVEWRGHRTRLRTMPLGYSPLTLDGRNPQLPEGIEEWADGHRLVVHSGRTDPIKNAERAVRAFVLAARGGGLEKTRMLV
RMNPNRLYVPANADYVHRVETAVAEANAELGSDTVRIDNDNDVNHTIACFRRADLLIFNSTVDGQNLSTFEAPLVNERDA
DVILSETCGAAEVLGEYCRSVNPFDLVEQAEAISAALAAGPRQRAEAAARRRDAARPWTLEAWVQAQLDGLAADHAARTA
TAERFDTAPAVSTRADL
>Q15JF8 3.1.3.101~~~vldH~~~Validoxylamine A 7'-phosphate phosphatase~~~
MYKVALFDLDGTLINSEHKNREAWARLFRRHGVPYDDSVLRSFTGRPAKEAMADHVASFAGHSVDELCAEVAAYAALPDM
PAAVTVDGAMELLHQLQQMRVPLGVVTSGPRDYAESALTTLGALQLLDVLITADDVSRGKPDPEGYSTACSALNVEPSQA
IVFEDAPAGILAAKRAGIFCVGLTTTHDAEALAEADVLLKDLTEVRWPHIGPS
>Q15JG7 1.14.11.52~~~vldW~~~Validamycin A dioxygenase~~~
MTGSVPIVDLEAWRAADEENRASLAEIIDGALHTVGTFLLAGHGVPAELTARMRTAGRSFFDLPWEKKEPHAVQRPHDNG
WRGLVKHRTDTIEGTGGAPDLHEAFHMGPTHRTGDDAFDALYYPANKWPAELPELRETALAYTAHMTRVAGAVMEMLAGV
LGLEPAFFTSRCEHATWTQSVNWYPSLDTVGQTAEGQMRVGPHTDFGTITLLDRQQGVSGLEVWSEEDGWFAPPFVEGTL
LVNLGDLMHQWTDGRWRSLRHRVLAPSASAPQEELVSLVYFFDADPEAELVPLAAPVGGGAGMPTVNVGETILKKNIQML
TDLKGHGLFQGELSLSRPGSADSPGSSPADDHPSRPGRHPAQGPQ
>P96072 1.14.14.30~~~vlmH~~~Isobutylamine N-hydroxylase~~~
MRSLDAARDTCERLHPGLIKALEELPLLEREAEGSPVLDIFRAHGGAGLLVPSAYGGHGADALDAVRVTRALGACSPSLA
AAATMHNFTAAMLFALTDRVIPPTDEQKKLLARVAPEGMLLASGWAEGRTQQDILNPSVKATPVDDGFILNGSKKPCSLS
RSMDILTASVILPDETGQQSLAVPLIMADSPGISVHPFWESPVLAGSQSNEVRLKDVHVPEKLIIRGTPDDPGRLDDLQT
ATFVWFELLITSAYVGAASALTELVMERDRGSVTDRAALGIQLESAVGLTEGVARAVRDGVFGEEAVAAALTARFAVQKT
LAAISDQAIELLGGIAFIKSPELAYLSSALHPLAFHPPGRTSSSPHLVEYFSGGPLEI
>O34138 1.5.1.38~~~vlmR~~~NADPH-flavin oxidoreductase~~~
MTPSAAATGHEAADEQRLRELRGLTRQLPTGVAVVTAQDGEVAHGATVSTVSVLSQQPLRIGVSLRRGSYLTGLIRQRRV
FALNVLSSRQSAVADWFANPERPRGWRQFDYVRWTAHPKAGMPVLEDALAQLHCRLTDLIPLGASDDLLVAEVLDGRGRN
GRPLVNFNGRLHDVEFRGVVRVSRDQPSAVTSLE
>P21875 ~~~~~~Variable large protein 21~~~
MRKRISAIINKLNISIMMMIVVLMIGCGQQPEAGKTGAAGGEKQGAGSLSEVLMEVGKSAENAFYSFLELVSDTLGFTAK
STTKKEDVGGYFNSLGGKLGEASNELEQVAKNSEAGIEKNDASKNPIRSAVNAAKKTLEALKGYLDSLGTVGDSNPVGWA
SNNAQGAAVDEAELKKAYKALKGIMDTAEGAGVARPEVGNIAVKVGNGTDNKDGAKILATDGAAAVGDAGKAAAILTTVS
GKEMLASIVNSTEDKAVKITGNVTVETTPLEFAVGGNGAHLSQNANSKAAAVAGGIALRSLVKGGKLAADNNDDDKASQG
VGITAANKLLVAVEDIIKKTVKNVLEKAKGEIDKARDPKPAGQQ
>P21876 ~~~vlp7~~~Variable large protein 7~~~
MRKRISAIINKLNISIIIMTVVLMIGCGQQPEAGKTGVSGGVNGNLGNSLMELGRSAENAFYAFIELVSDVLGFTAKSDT
TKQEVGGYFNSLGAKLGEASNDLEQVAVKAETGVDKSDSSKNPIREAVNEAKEVLGTLKGYVESLGTIGDSNPVGYANNA
AGSGTTAADDELRKAFKALQEIVKAATDAGVKALKIGATTLQANGGADNKEGAKILATSGGNPAAADVAKAAAILSSVSG
EEMLSSIVKSGENDAQLAAAADGNTSAISFAKGGSDAHLAGANTPKAAAVAGGIALRSLVKTGKLAAGAADNATGGGKEV
QGVGVAAANKLLRAVEDVIKKTVKNVLEKAKEKIDKARGSQEPVSESSK
>P39115 ~~~vmlR~~~Ribosome protection protein VmlR~~~COG0488
MKEIVTLTNVSYEVKDQTVFKHVNASVQQGDIIGIIGKNGAGKSTLLHLIHNDLAPAQGQILRKDIKLALVEQETAAYSF
ADQTPAEKKLLEKWHVPLRDFHQLSGGEKLKARLAKGLSEDADLLLLDEPTNHLDEKSLQFLIQQLKHYNGTVILVSHDR
YFLDEAATKIWSLEDQTLIEFKGNYSGYMKFREKKRLTQQREYEKQQKMVERIEAQMNGLASWSEKAHAQSTKKEGFKEY
HRVKAKRTDAQIKSKQKRLEKELEKAKAEPVTPEYTVRFSIDTTHKTGKRFLEVQNVTKAFGERTLFKNANFTIQHGEKV
AIIGPNGSGKTTLLNIILGQETAEGSVWVSPSANIGYLTQEVFDLPLEQTPEELFENETFKARGHVQNLMRHLGFTAAQW
TEPIKHMSMGERVKIKLMAYILEEKDVLILDEPTNHLDLPSREQLEETLSQYSGTLLAVSHDRYFLEKTTNSKLVISNNG
IEKQLNDVPSERNEREELRLKLETERQEVLGKLSFMTPNDKGYKELDQAFNELTKRIKELDHQDKKD
>P12627 ~~~vnfA~~~Nitrogen fixation protein VnfA~~~
MSSLPQYCECGLGECRTDVLPLLYEMSQIATESGDLSSIISILLRLMKRHMKVVRGMVTLYDRDSGSIVLHESFGLSPEE
AGKGVYLLGEGIIGRVVETGQSIVVPCIRDEPAFLNRTGSRDRDSDDANLSFICVPILRGRQVMGTISAERLYDNAELLK
LDVEVLSILATTTAQAVELYLVENVENVALEAENRRLRSALGERFKPANIIGNSKPMLEVYQLIERVVRTRTTVLILGES
GVGKELVAGAIHYNSPAAKGPFVKFNCASLPESVIESELFGHEKGSFTGAIGLRKGRFEEAAGGTIFLDEVGEMSLTTQA
KLLRVLQERSFERVGGNTTIHVDLRVIAATNRNLAEMVADGTFAEDLYYRLNVFPITIPPLRERGSDIITLADHFVSRFS
REMGIEVNRISTPRLNMLQSYQWPGNVRELENVIERAMLLSEDGVIHGYHLPPSLQAPVVGDSEAPPDGLEARLGAIEYE
LIVEALKLHHGNMTEAATHLGLTARVLGLRMGKYNLNYKDYR
>P15332 1.18.6.1~~~vnfD~~~Nitrogenase vanadium-iron protein alpha chain~~~
MPMVLLECDKDIPERQKHIYLKAPNEDTREFLPIANAATIPGTLSERGCLLRRKLVIGGVLKDTIQMIHGPLGCAYDTWH
TKRYPTDNGHFNMKYVWSTDMKESHVVFGGEKRLEQRMHEAFDEMPDIKRMIVYTTCPTALIGDDIKAVAKKVMKERPDV
DVFTVECPGFSGVSQSKGHHVLNIGWINEKVETMEKEITSEYTMNFIGDFNIQGDTQLLQTYWDRLGIQVVAHFTGNGTY
DDLRCMHQAQLNVVNCARSSGYIANELKKRYGIPRLDIDSWGFSYMAEGIRKICAFFGIEEKGERLIAEEYAKWKPKLDW
YKERLQGKKMAIWTGGPRLWHWTKSVEDDLGIQVVAMSSKFGHEEDFEKVIARGKEGTYYIDDGNELEFFEIIDLVKPDV
IFTGPRVGELVKKLHIPYVNGHGYHNGPYMGFEGFVNLARDTYNAVHNPLRHLAAVDIRDSSQTTPVIVRGAA
>P16855 1.18.6.1~~~vnfD~~~Nitrogenase vanadium-iron protein alpha chain~~~
MPMVLLECDKDIPERQKHIYLKAPNEDTREFLPIANAATIPGTLSERGCAFCGAKLVIGGVLKDTIQMIHGPLGCAYDTW
HTKRYPTDNGHFNMKYVWSTDMKESHVVFGGEKRLEKSMHEAFDEMPDIKRMIVYTTCPTALIGDDIKAVAKKVMKDRPD
VDVFTVECPGFSGVSQSKGHHVLNIGWINEKVETMEKEITSEYTMNFIGDFNIQGDTQLLQTYWDRLGIQVVAHFTGNGT
YDDLRCMHQAQLNVVNCARSSGYIANELKKRYGIPRLDIDSWGFNYMAEGIRKICAFFGIEEKGEELIAEEYAKWKPKLD
WYKERLQGKKMAIWTGGPRLWHWTKSVEDDLGVQVVAMSSKFGHEEDFEKVIARGKEGTYYIDDGNELEFFEIIDLVKPD
VIFTGPRVGELVKKLHIPYVNGHGYHNGPYMGFEGFVNLARDMYNAVHNPLRHLAAVDIRDKSQTTPVIVRGAA
>P15333 1.18.6.1~~~vnfG~~~Nitrogenase vanadium-iron protein delta chain~~~
MSQSHLDDLFDYTEERCLWQFFSRTWDREENIEGVLGQVARLLTGQEPLRGTPQERLFYADALAMANDVRERFPWASQIN
HEEIHFLIDGLKSRLVDTVIQSSTNRELNHHLY
>P16857 1.18.6.1~~~vnfG~~~Nitrogenase vanadium-iron protein delta chain~~~
MSQSHLDDLFAYVEERCLWQFFSRTWDREENIEGVLNQVGRLLTGQEPLRGTPQERLFYADALAMANDVRERFPWASQVN
KEEIEFLLDGLKSRLVDVTITRSTNRELNHHLY
>P15334 1.18.6.1~~~vnfK~~~Nitrogenase vanadium-iron protein beta chain~~~
MSNCELTVLKPAEVKLVKREREGIINPMYDCQPAGAQYAGIGVKDCIPLVHGGQGCTMFVRLLFAQHFKENFDVASTSLH
EESAVFGGAKRVEEGVLVLARRYPELRLIPIITTCSTEVIGDDIEGTINVCNRALAAEFPERKIYLAPVHTPSFKGSHVT
GYAECVKSMFKTITEVHGKGQPSGKLNVFPGWVNPGDVVLLKRYFKEMGVDATVFMDTEDFDSPMLPNKSIETHGRTTVE
DIADSANALATLALARYEGATTGEYLEKTFAVPNSLVNTPYGIKNTDDMLRKIAEITGKEIPESLVREPRIAWIALADLA
HMFFANKKVAIFGHPDLVLGLAQFCLEVELEPVLLLIGDDQGSKYKKDPRLQELKDAAHFDMEIVHNADLWELEKRINDG
LQLDLIMGHSKGRYVAIEANIPMVRVGFPTFDRAGLYRKPSIGYQGAMELGEMIANAMFAHMEYTRNKEWILNTW
>P16856 1.18.6.1~~~vnfK~~~Nitrogenase vanadium-iron protein beta chain~~~
MSNCELTVLKPAEVKLSPRDREGIINPMYDCQPAGAQYAGIGIKDCIPLVHGGQGCTMFVRLLFAQHFKENFDVASTSLH
EESAVFGGAKRVEEGVLVLARRYPNLRVIPIITTCSTEVIGDDIEGSIRVCNRALEAEFPDRKIYLAPVHTPSFKGSHVT
GYAECVKSVFKTITDAHGKGQPSGKLNVFPGWVNPGDVVLLKRYFKEMDVEANIYMDTEDFDSPMLPNKSIETHGRTTVE
DIADSANALATLSLARYEGNTTGELLQKTFAVPNALVNTPYGIKNTDDMLRKIAEVTGKEIPESLVRERGIALDALADLA
HMFFANKKVAIFGHPDLVLGLAQFCMEVELEPVLLLIGDDQGNKYKKDPRIEELKNTAHFDIEIVHNADLWELEKRINAG
LQLDLIMGHSKGRYVAIEANIPMVRVGFPTFDRAGLYRKPSIGYQGAMELGEMIANAMFAHMEYTRNKEWILNTW
>Q9KL83 ~~~volA~~~Lysophospholipase VolA~~~COG2267
MKQVIKLSLLCSALWLAGCGDETNSSGASTEVVYESYIQQALQRDTTIKFALSGKDANVPLPSFALMNAKDGTLEIPPGS
NTSGSNPLVAMGQVDGWPITMPLFLDFKGAGLADNIITSGIYLYELTDSMTGSPSIKALLTNGVDYTAVSSAASDKILIM
PTKALNASSEYILAVTSEVSDANGNPVGTSASYAALKSKNKIYSEGDIATLQKVTQGVEKIFQLSGVDETQIVYSTWFST
QSVSNTLFATRGATASAFASGSNQLETVWKQTGLGLDTAYTIQLGTPVDFAAALTADDNFSTYVGADKKTAILGTYTANT
VDVTKGTVRLPYYLETGSNWNTQPFESAMPSLAKIKAALADSKEQLTIGSQLLAAGIDTTKLATDASEQLKLMGLTLTKS
DGTALDPERYITRYSPVPKVKSVQDVPFLLFTPAGAAPTDIVIYQHGVTTAKENAYAFAKNLTAVGLAVIAIDLPLHGER
SLDSTRSANSDPLAYINLTYLAVARDNLRQSILDVLGLRAALTLSQPLFTGTRLSGINVGTGSKVRMLGHSLGGIVGTSA
IAESNKTLGSTAADAMYSFSGAAIQNSGGQISNLLLGSAFFGPKIKHNVALSASTEYKGFADAQCASLDDSACYNLFTSL
ATQEQLAQVTSGFQMFSYAAQTLLDTIDPYSVVSTKLNNGGLTTPLYFSEVDGDSVVPNKVSNPTGSLVYLSPQFAGTEP
LATLLGLTTVNAGQTAPNATKSFVQFNSTAKHSTFVAPQDAGYADLAHHTEMQTETADFLADDSLGAVSNSNSVLK
>Q87P32 2.7.7.108~~~vopS~~~Protein adenylyltransferase VopS~~~COG3177
MISFGNVSALQAAMPQARNEILNEGKLSIGGKEYTINAATQEFTRANPTSGAVARFFEATGKLFREGSTQSVAKAITKAV
FDNEQGQAQRLQTSSSVEHGQMLFKDANLKTPSDVLNAFAKLDSKMVKSHAAELSQLAERAMTEVMLETDSGKNLKALIG
DDAVKSLAVRVVKDYGGGVAAAQKNPEVRINQMQAVFDMEVMHLKAAQRHIEGLASTDLNQGVYAEGLPEDAFNKAGVTN
NVERAAAWIINASNSKGNDAENITSLLKEYATNGKDLLNMDNLKELHARLVPNVERDYRGPNISGGTLPSSIGGEGMLKQ
HIEGFLKENPVADKDLGKHLFAGVIGYHGFTDGNGRMGRMLYAIAELRNDSFNPLAMNAENSLHGIK
>P9WLZ1 ~~~~~~Putative antitoxin VapB10~~~
MKRTNIYLDEEQTASLDKLAAQEGVSRAELIRLLLNRALTTAGDDLASDLQAINDSFGTLRHLDPPVRRSGGREQHLAQV
WRATS
>P9WLU3 ~~~~~~Antitoxin VapB11~~~COG5450
MYRWCMSRTNIDIDDELAAEVMRRFGLTTKRAAVDLALRRLVGSPLSREFLLGLEGVGWEGDLDDLRSDRPD
>P9WJ53 ~~~~~~Putative antitoxin VapB12~~~
MSAMVQIRNVPDELLHELKARAAAQRMSLSDFLLARLAEIAEEPALDDVLDRLAALPRRDLGASAAELVDEARSE
>P9WLM7 ~~~~~~Antitoxin VapB15~~~COG5450
MYSGVVSRTNIEIDDELVAAAQRMYRLDSKRSAVDLALRRLVGEPLGRDEALALQGSGFDFSNDEIESFSDTDRKLADES
>P9WJ49 ~~~~~~Putative antitoxin VapB17~~~
MTVKRTTIELDEDLVRAAQAVTGETLRATVERALQQLVAAAAEQAAARRRRIVDHLAHAGTHVDADVLLSEQAWR
>P95006 ~~~~~~Putative antitoxin VapB19~~~
MRTQVTLGKEELELLDRAAKASGASRSELIRRAIHRAYGTGSKQERLAALDHSRGSWRGRDFTGTEYVDAIRGDLNERLA
RLGLA
>P9WJ45 ~~~~~~Antitoxin VapB20~~~
MKRLQIYIDEDVDRALAVEARRRRTSKAALIREYVAEHLRQPGPDPVDAFVGSFVGEADLSASVDDVVYGKHE
>P9WJ43 ~~~~~~Antitoxin VapB21~~~COG5450
MHRGYALVVCSPGVTRTMIDIDDDLLARAAKELGTTTKKDTVHAALRAALRASAARSLMNRMAENATGTQDEALVNAMWR
DGHPENTA
>P71622 ~~~~~~Antitoxin VapB22~~~COG4118
MTATEVKAKILSLLDEVAQGEEIEITKHGRTVARLVAATGPHALKGRFSGVAMAAADDDELFTTGVSWNVS
>P0CW33 ~~~~~~Antitoxin VapB25~~~
MRTTVSISDELLATAKRRARERGQSLGAVIEDALRRELAAARTGGARPTVPVFDAGTGPRPGIDLTSNTVLSEVLDEGLE
LNSRK
>O53778 ~~~~~~Antitoxin VapB26~~~
MDKTTVYLPDELKAAVKRAARQRGVSEAQVIRESIRAAVGGAKPPPRGGLYAGSEPIARRVDELLAGFGER
>O07779 ~~~~~~Antitoxin VapB27~~~COG2002
MKAVVDAAGRIVVPKPLREALGLQPGSTVEISRYGAGLHLIPTGRTARLEEENGVLVATGETTIDDEVVFGLIDSGRK
>P9WJ39 ~~~~~~Antitoxin VapB28~~~COG4423
MALNIKDPSVHQAVKQIAKITGESQARAVATAVNERLARLRSDDLAARLLAIGHKTASRMSPEAKRLDHDALLYDERGLP
A
>P9WJ37 ~~~~~~Putative antitoxin VapB29~~~
MRTTIDLPQDLHKQALAIARDTHRTLSETVADLMRRGLAANRPTALSSDPRTGLPLVSVGTVVTSEDVRSLEDEQ
>P9WJ35 ~~~~~~Antitoxin VapB30~~~COG4423
MALSIKHPEADRLARALAARTGETLTEAVVTALRERLARETGRARVVPLRDELAAIRHRCAALPVVDNRSAEAILGYDER
GLPA
>O53811 ~~~~~~Antitoxin VapB31~~~
MRTTVSISDEILAAAKRRARERGQSLGAVIEDALRREFAAAHVGGARPTVPVFDGGTGPRRGIDLTSNRALSEVLDEGLE
LNSRK
>P9WJ33 ~~~~~~Antitoxin VapB32~~~COG5450
MRTTVTVDDALLAKAAELTGVKEKSTLLREGLQTLVRVESARRLAALGGTDPQATAAPRRRTSPR
>O50456 ~~~~~~Antitoxin VapB33~~~
MRTTLTLDDDVVRLVEDAVHRERRPMKQVINDALRRALAPPVKRQEQYRLEPHESAVRSGLDLAGFNKLADELEDEALLD
ATRRAR
>P9WF17 ~~~~~~Antitoxin VapB35~~~COG4118
MNEVSIRTLNQETSKVLARVKRGEEINLTERGKVIARIIPASAGPLDSLISTGSVQPARVHGPAPRPTIPMRGGLDSGTL
LERMRAEERY
>P9WJ29 ~~~~~~Putative antitoxin VapB36~~~COG4423
MALNIKDPEVDRLAAELADRLHTSKTAAIRHALSAQLAFLESRAGDREAQLLDILRTEIWPLLADRSPITKLEREQILGY
DPATGV
>P9WJ25 ~~~~~~Putative antitoxin VapB38~~~
MRTTLDLDDDVIAAARELASSQRRSLGSVISELARRGLMPGRVEADDGLPVIRVPAGTPPITPEMVRRALDED
>P9WJ23 ~~~~~~Antitoxin VapB39~~~
MRTTLQIDDDVLEDARSIARSEGKSVGAVISELARRSLRPVGIVEVDGFPVFDVPPDAPTVTSEDVVRALEDDV
>P9WFC3 ~~~~~~Antitoxin VapB40~~~COG2002
MRTTIDVAGRLVIPKRIRERLGLRGNDQVEITERDGRIEIEPAPTGVELVREGSVLVARPERPLPPLTDEIVRETLDRTR
R
>P9WJ21 ~~~~~~Antitoxin VapB41~~~
MKTTLDLPDELMRAIKVRAAQQGRKMKDVVTELLRSGLSQTHSGAPIPTPRRVQLPLVHCGGAATREQEMTPERVAAALL
DQEAQWWSGHDDAAL
>P9WL41 ~~~~~~Antitoxin VapB43~~~COG3905
MRTTIRIDDELYREVKAKAARSGRTVAAVLEDAVRRGLNPPKPQAAGRYRVQPSGKGGLRPGVDLSSNAALAEAMNDGVS
VDAVR
>O53464 ~~~~~~Putative antitoxin VapB45~~~COG2442
MAGDQELELRFDVPLYTLAEASRYLVVPRATLATWADGYERRPANAPAVQGQPIITALPHPTGSHARLPFVGIAEAYVLN
AFRRAGVPMQRIRPSLDWLIKNVGPHALASQDLCTDGAEVLWRFAERSGEGSPDDLVVRGLIVPRSGQYVFKEIVEHYLQ
QISFADDNLASMIRLPQYGDANVVLDPRRGYGQPVFDGSGVRVADVLGPLRAGATFQAVADDYGVTPDQLRDALDAIAA
>P9WF13 ~~~~~~Antitoxin VapB46~~~COG4118
MTPTACATVSTMTSVGVRALRQRASELLRRVEAGETIEITDRGRPVALLSPLPQGGPYEQLLASGEIERATLDVVDLPEP
LDLDAGVELPSVTLARLREHER
>P9WF23 ~~~~~~Antitoxin VapB47~~~COG4118
MRATVGLVEAIGIRELRQHASRYLARVEAGEELGVTNKGRLVARLIPVQAAERSREALIESGVLIPARRPQNLLDVTAEP
ARGRKRTLSDVLNEMRDEQ
>P9WF15 ~~~~~~Putative antitoxin VapB49~~~COG4118
MQLGRKVTSHHDIDRFGVASTADESVYRPLPPRLRLAQVNLSRRRCRTQSDMYKSRFSECTVQSVDVSVTELRAHLSDWL
DRARAGGEVVITERGIPIARLAALDSTDTLERLTAEGVIGKATAQRPVAAGRPRPRPQRPVSDRVSDQRR
>I6WXS6 ~~~~~~Putative antitoxin VapB51~~~
MAKHLVDIDEQALNMARTELGTTTIKDTVNAALRQATSQRVQRVAAALDTLAAAPPEDRAEAWR
>P9WFA7 3.1.-.-~~~~~~Ribonuclease VapC10~~~COG1487
MILVDSDVLIAHLRGVVAARDWLVSARKDGPLAISVVSTAELIGGMRTAERREVWRLLASFRVQPATEVIARRAGDMMRR
YRRSHNRIGLGDYLIAATADVQDLQLATLNVWHFPMFEQLKPPFAVPGHRPRA
>P9WFA5 3.1.-.-~~~~~~Ribonuclease VapC11~~~COG1487
MILIDTSAWVEYFRATGSIAAVEVRRLLSEEAARIAMCEPIAMEILSGALDDNTHTTLERLVNGLPSLNVDDAIDFRAAA
GIYRAARRAGETVRSINDCLIAALAIRHGARIVHRDADFDVIARITNLQAASFR
>P9WFA1 3.1.-.-~~~~~~Ribonuclease VapC13~~~COG1848
MILVDSNIPMYLVGASHPHKLDAQRLLESALSGGERLVTDAEVLQEICHRYVAIKRREAIQPAFDAIIGVVDEVLPIERT
DVEHARDALLRYQTLSARDALHIAVMAHHDITRLMSFDRGFDSYPGIKRLA
>P9WF97 3.1.-.-~~~~~~Ribonuclease VapC15~~~COG1487
MIVDTSVWIAYLSTSESLASRWLADRIAADSTVIVPEVVMMELLIGKTDEDTAALRRRLLQRFAIEPLAPVRDAEDAAAI
HRRCRRGGDTVRSLIDCQVAAMALRIGVAVAHRDRDYEAIRTHCGLRTEPLF
>P9WF95 3.1.-.-~~~~~~Ribonuclease VapC17~~~COG1487
MTTWILDKSAHVRLVAGATPPAGIDLTDLAICDIGELEWLYSARSATDYDSQQTSLRAYQILRAPSDIFDRVRHLQRDLA
HHRGMWHRTPLPDLFIAETALHHRAGVLHHDRDYKRIAVVRPGFQACELSRGR
>P9WF93 3.1.-.-~~~~~~Ribonuclease VapC19~~~COG1487
MKLIDTTIAVDHLRGEPAAAVLLAELINNGEEIAASELVRFELLAGVRESELAALEAFFSAVVWTLVTEDIARIGGRLAR
RYRSSHRGIDDVDYLIAATAIVVDADLLTTNVRHFPMFPDLQPPY
>P95004 3.1.-.-~~~~~~23S rRNA-specific endonuclease VapC20~~~COG2402
MIFVDTSFWAALGNAGDARHGTAKRLWASKPPVVMTSNHVLGETWTLLNRRCGHRAAVAAAAIRLSTVVRVEHVTADLEE
QAWEWLVRHDEREYSFVDATSFAVMRKKGIQNAYAFDGDFSAAGFVEVRPE
>P9WF91 3.1.-.-~~~~~~Ribonuclease VapC21~~~COG1487
MTTRYLLDKSAAYRAHLPAVRHRLEPLMERGLLARCGITDLEFGVSARSREDHRTLGTYRRDALEYVNTPDTVWVRAWEI
QEALTDKGFHRSVKIPDLIIAAVAEHHGIPVMHYDQDFERIAAITRQPVEWVVAPGTA
>P71623 3.1.-.-~~~~~~Ribonuclease VapC22~~~COG3744
MTTVLLDSHVAYWWSAEPQRLSMAASQAIEHADELAVAAISWFELAWLAEQERIQLAIPVLSWLQQLAEHVRTVGITPSV
AATAVALPSSFPGDPADRLIYATAIEHGWRLVTKDRRLRSHRHPRPVTVW
>P9WF87 3.1.-.-~~~~~~Ribonuclease VapC24~~~COG1848
MLSIDTNILLYAQNRDCPEHDAAAAFLVECAGRADVAVCELVLMELYQLLRNPTVVTRPLEGPEAAEVCQTFRRNRRWAL
LENAPVMNEVWVLAATPRIARRRLFDARLALTLRHHGVDEFATRNINGFTDFGFSRVWDPITSDG
>P9WF85 3.1.-.-~~~~~~Ribonuclease VapC25~~~COG1848
MFLIDVNVLLAAHRGDHPNHRTVRPWFDRLLAADDPFTVPNLVWASFLRLTTNRRIFEIPSPRADAFAFVEAVNAQPHHL
PTSPGPRHLVLLRKLCDEADASGDLIPDAVLGAIAVEHHCAVVSLDRDFARFASVRHIRPPI
>O53779 3.1.-.-~~~~~~Ribonuclease VapC26~~~COG2402
MIIDTSALLAYFDAAEPDHAAVSECIDSSADALVVSPYVVAELDYLVATRVGVDAELAVLRELAGGAWELANCGAAEIEQ
AARIVTKYQDQRIGIADAANVVLADRYRTRTILTLDRRHFSALRPIGGGRFTVIP
>P9WF83 3.1.-.-~~~~~~Ribonuclease VapC27~~~COG1848
MKPPLAVDTSVAIPLLVRTHTAHAAVVAWWAHREAALCGHALAETYSVLTRLPRDLRLAPMDAARLLTERFAAPLLLSSR
TTEHLPRVLAQFEITGGAVYDALVALAAAEHRAELATRDARAKDTYEKIGVHVVVAA
>P9WF81 3.1.-.-~~~~~~Ribonuclease VapC28~~~COG3742
MIVDTSAIIAILRDEDDAAAYADALANADVRRLSAASYLECGIVLDSQRDPVISRALDELIEEAEFVVEPVTERQARLAR
AAYADFGRGSGHPAGLNFGDCLSYALAIDRREPLLWKGNDFGHTGVQRALDRR
>P9WF79 3.1.-.-~~~~~~Ribonuclease VapC29~~~COG1848
MTVLLDANVLIALVVAEHVHHDAAADWLMASDTGFATCPMTQGSLVRFLVRSGQSAAAARDVVSAVQCTSRHEFWPDALS
FAGVEVAGVVGHRQVTDAYLAQLARSHDGQLATLDSGLAHLHGDVAVLIPTTT
>P9WF77 3.1.-.-~~~~~~Ribonuclease VapC30~~~COG3742
MVIDTSALVAMLSDEPDAERFEAAVEADHIRLMSTASYLETALVIEARFGEPGGRELDLWLHRAAVDLVAVHADQADAAR
AAYRTYGKGRHRAGLNYGDCFSYGLAKISGQPLLFKGEDFQHTDIATVALP
>P9WF75 3.1.-.-~~~~~~Ribonuclease VapC31~~~COG1848
MFLLDANVLLAAHRGDHPNHRTVRPWFDRLLAADDPFTVPNLVWASFLRLATNRRIFEIPSPRAEAFAFVEAVTAQPHHL
PTNPGPRHLMLLRKLCDEADASGDLIPDAVLAAIAVGHHCAVVSLDRDFARFASVRHIRPPL
>P9WF73 3.1.-.-~~~~~~Ribonuclease VapC32~~~COG1487
MILVDTSVWIEHLRAADARLVELLGDDEAGCHPLVIEELALGSIKQRDVVLDLLANLYQFPVVTHDEVLRLVGRRRLWGR
GLGAVDANLLGSVALVGGARLWTRDKRLKAACAESGVALAEEVS
>P9WF69 3.1.-.-~~~~~~Ribonuclease VapC33~~~COG1848
MIIPDINLLLYAVITGFPQHRRAHAWWQDTVNGHTRIGLTYPALFGFLRIATSARVLAAPLPTADAIAYVREWLSQPNVD
LLTAGPRHLDIALGLLDKLGTASHLTTDVQLAAYGIEYDAEIHSSDTDFARFADLKWTDPLRE
>P9WF67 3.1.-.-~~~~~~Ribonuclease VapC35~~~COG1848
MIYLETSALVKLIRIEVESDALADWLDDRTELRWITSALTEVELSRAIRAVSPEGLPAVPSVLARLDRFEIDAVIRSTAA
AYPNPALRSLDAIHLATAQTAGSVAPLTALVTYDNRLKEAAEALSLAVVAPGQAR
>P9WF65 3.1.-.-~~~~~~Ribonuclease VapC36~~~COG3742
MIVDTSAVVALVQGERPHATLVAAALAGAHSPVMSAPTVAECLIVLTARHGPVARTIFERLRSEIGLSVSSFTAEHAAAT
QRAFLRYGKGRHRAALNFGDCMTYATAQLGHQPLLAVGNDFPQTDLEFRGVVGYWPGVA
>O53501 3.1.-.-~~~~~~Ribonuclease VapC37~~~COG1848
MKIVDANVLLYAVNTTSEHHKPSLRWLDGALSGADRVGFAWVPLLAFVRLATKVGLFPRPLPREAAITQVADWLAAPSAV
LVNPTVRHADILARMLTYVGTGANLVNDAHLAALAVEHRASIVSYDSDFGRFEGVRWDQPPALL
>O53219 3.1.-.-~~~~~~Ribonuclease VapC38~~~COG1848
MALLDVNALVALAWDSHIHHARIREWFTANATLGWATCPLTEAGFVRVSTNPKVLPSAIGIADARRVLVALRAVGGHRFL
ADDVSLVDDDVPLIVGYRQVTDAHLLTLARRRGVRLVTFDAGVFTLAQQRPKTPVELLTIL
>P9WF63 3.1.-.-~~~~~~Ribonuclease VapC39~~~COG1848
MTALLDVNVLIALGWPNHVHHAAAQRWFTQFSSNGWATTPITEAGYVRISSNRSVMQVSTTPAIAIAQLAAMTSLAGHTF
WPDDVPLIVGSAGDRDAVSNHRRVTDCHLIALAARYGGRLVTFDAALADSASAGLVEVL
>P9WF61 3.1.-.-~~~~~~Ribonuclease VapC40~~~COG1848
MIAPDTSVLVAGFATWHEGHEAAVRALNRGVHLIAHAAVETYSVLTRLPPPHRIAPVAVHAYLADITSSNYLALDACSYR
GLTDHLAEHDVTGGATYDALVGFTAKAAGAKLLTRDLRAVETYERLRVEVELVT
>P9WF59 3.1.-.-~~~~~~Ribonuclease VapC41~~~COG1848
MLLCDTNIWLALALSGHVHHRASRAWLDTINAPGVIHFCRATQQSLLRLLTNRTVLGAYGSPPLTNREAWAAYAAFLDDD
RIVLAGAEPDGLEAQWRAFAVRQSPAPKVWMDAYLAAFALTGGFELVTTDTAFTQYGGIELRLLAK
>P9WF57 3.1.-.-~~~~~~Ribonuclease VapC42~~~COG3742
MIVDTSAIVAIVSGESGAQVLKEALERSPNSRMSAPNYVELCAIMQRRDRPEISRLVDRLLDDYGIQVEAVDADQARVAA
QAYRDYGRGSGHPARLNLGDTYSYALAQVTGEPLLFRGDDFTHTDIRPACT
>P9WF55 3.1.-.-~~~~~~Ribonuclease VapC43~~~COG1848
MLCVDVNVLVYAHRADLREHADYRGLLERLANDDEPLGLPDSVLAGFIRVVTNRRVFTEPTSPQDAWQAVDALLAAPAAM
RLRPGERHWMAFRQLASDVDANGNDIADAHLAAYALENNATWLSADRGFARFRRLRWRHPLDGQTHL
>P9WF53 3.1.-.-~~~~~~Ribonuclease VapC44~~~COG1848
MRALLDVNVLLALLDRDHVDHERARAWITGQIERGWASCAITQNGFVRVISQPRYPSPISVAHAIDLLARATHTRYHEFW
SCTVSILDSKVIDRSRLHSPKQVTDAYLLALAVAHDGRFVTFDQSIALTAVPGATKQHLATL
>O53465 3.1.-.-~~~~~~Putative ribonuclease VapC45~~~
MQPDRNLLADLDHIFVDRSLGAVQVPQLLRDAGFRLTTMREHYGETQAQSVSDHKWIAMTAECGWIGFHKDANIRRNAVE
RRTVLDTGARLFCVPRADILAEQVAARYIASLAAIARAARFPGPFIYTVHPSKIVRVL
>O50411 3.1.-.-~~~~~~Ribonuclease VapC46~~~COG1848
MAAIYLDSSAIVKLAVREPESDALRRYLRTRHPRVSSALARAEVMRALLDKGESARKAGRRALAHLDLLRVDKRVLDLAG
GLLPFELRTLDAIHLATAQRLGVDLGRLCTYDDRMRDAAKTLGMAVIAPS
>P9WF49 3.1.-.-~~~~~~Ribonuclease VapC47~~~COG1848
MIYMDTSALTKLLISEPETTELRTWLTAQSGQGEDAATSTLGRVESMRVVARYGQPGQTERARYLLDGLDILPLTEPVIG
LAETIGPATLRSLDAIHLAAAAQIKRELTAFVTYDHRLLSGCREVGFVTASPGAVR
>P9WF47 3.1.-.-~~~~~~Ribonuclease VapC48~~~COG1848
MSETFDVDVLVHATHRASPFHDKAKTLVERFLAGPGLVYLLWPVALGYLRVVTHPTLLGAPLAPEVAVENIEQFTSRPHV
RQVGEANGFWPVYRRVADPVKPRGNLVPDAHLVALMRHHGIATIWSHDRDFRKFEGIRIRDPFSG
>P9WF51 3.1.-.-~~~~~~Ribonuclease VapC49~~~COG1848
MPLVYFDASAFVKLLTTETGSSLASALWDGCDAALSSRLAYPEVRAALAAAARNHDLTESELADAERDWEDFWAATRPVE
LTATVEQHAGHLARAHALRGADAVHLASALAVGDPGLVVAVWDRRLHTGAHAAGCRVAPAQLDP
>L0TGF0 3.1.-.-~~~~~~Putative ribonuclease VapC50~~~COG1569
MPCCGSLTRAPIGLCGRRTSWPRLGEPWSTASTSAPNGLTTAFAFGYNDLIAAMNNHYKDRHVLAAAVRERAEVIVTTNL
KHFPDDALKPYQIKALHPDDFLLDQLDLYEEATKAVILGMVDAYIDPPFTPHSLLDALGEQVPQFAAKARRLFPSGSPFG
LGVLLPFDQ
>L0T5V6 3.1.-.-~~~~~~Ribonuclease VapC51~~~COG1487
MALKYLLDTSVIKRLSRPAVRRAVEPLAEAGAVARTQITDLEVGYSARNETEWQRLMVALSAFDLIESTASHHRRALGIQ
RLLAARSQRGRKIPDLLIAAAGEEHGLVVLHYDADFDLIAAVTGQPCQWIVPAGTID
>F9USN6 1.1.1.-~~~vprA~~~Vinyl phenol reductase~~~COG1053
MTLAKHDSYDIVVVGTGAAGTAAALEAAQHGASVLLLEKGRHTGGSSNYTEGLFAVDSYLQKAQNINVSATDVLKEEVDY
SKYRADSRIWRRYLDDSANTVQWLKDQGVEYEGVQAMGAGEATWHIYKGMGQAVLHDALQPQAQKLGVELLTSTTAITLH
QATDGAITGVMIQSAATNETQVINTAAVILATGGYLNNPDMMQKLTHYDTRRLIPVSSGKGTGDGLRLAWQAGAQQYGTG
MAMLFGGYLKDPSEPSFKYMASQMETAAGQQPLLWLNEHGERFVDEAVVYNFSYAGNALYTQNQVFSILDQGVINKMAQD
GNFMGLGVYVRRGEKMTKLQAEIDAAVAANKPFIFKANTIEALATKMHLPVDQVTHSIQTYNQYCDNGQDDDFGKNPEYL
VKVSQGPFYGFELNVGAFCTMGGLKVTTNNEVLDTTGQPITGLYAAGNDAAGLTGDTYGPNMPGTCVGYAFYSGRNSGRH
AAQYTHQQSIVSH
>Q2FX09 ~~~vraR~~~Response regulator protein VraR~~~COG2197
MTIKVLFVDDHEMVRIGISSYLSTQSDIEVVGEGASGKEAIAKAHELKPDLILMDLLMDDMDGVEATTQIKKDLPQIKVL
MLTSFIEDKEVYRALDAGVDSYILKTTSAKDIADAVRKTSRGESVFEPEVLVKMRNRMKKRAELYEMLTEREMEILLLIA
KGYSNQEIASASHITIKTVKTHVSNILSKLEVQDRTQAVIYAFQHNLIQ
>Q7A2Q1 ~~~vraR~~~Response regulator protein VraR~~~
MTIKVLFVDDHEMVRIGISSYLSTQSDIEVVGEGASGKEAIAKAHELKPDLILMDLLMEDMDGVEATTQIKKDLPQIKVL
MLTSFIEDKEVYRALDAGVDSYILKTTSAKDIADAVRKTSRGESVFEPEVLVKMRNRMKKRAELYEMLTEREMEILLLIA
KGYSNQEIASASHITIKTVKTHVSNILSKLEVQDRTQAVIYAFQHNLIQ
>Q7A4R9 ~~~vraR~~~Response regulator protein VraR~~~
MTIKVLFVDDHEMVRIGISSYLSTQSDIEVVGEGASGKEAIAKAHELKPDLILMDLLMEDMDGVEATTQIKKDLPQIKVL
MLTSFIEDKEVYRALDAGVDSYILKTTSAKDIADAVRKTSRGESVFEPEVLVKMRNRMKKRAELYEMLTEREMEILLLIA
KGYSNQEIASASHITIKTVKTHVSNILSKLEVQDRTQAVIYAFQHNLIQ
>Q7A2Q0 2.7.13.3~~~vraS~~~Sensor protein VraS~~~
MNHYNRTIGSMLILVYSMLAAFLFIDKVFVNIIYFQGMFYTQIFGIPVFLFLNLIIILLCIIVGSVLAYKINQQNDWIKT
QIERSMEGETVGINDQNIEIYSETLDLYHTLVPLNQELHKLRLKTQNLTNENYNINDVKVKKIIEDERQRLARELHDSVS
QQLFAASMMLSAIKETKLEPPLDQQIPILEKMVQDSQLEMRALLLHLRPLGLKDKSLGEGIKDLVIDLQKKVPMKVVHEI
QDFKVPKGIEDHLFRITQEAISNTLRHSNGTKVTVELFNKDDYLLLRIQDNGKGFNVDEKLEQSYGLKNMRERALEIGAT
FHIVSLPDSGTRIEVKAPLNKEDSYDD
>Q99SZ7 2.7.13.3~~~vraS~~~Sensor protein VraS~~~
MNHYIRTIGSMLILVYSMLAAFLFIDKVFVNIIYFQGMFYTQIFGIPVFLFLNLIIILLCIIVGSVLAYKINQQNDWIKT
QIERSMEGETVGINDQNIEIYSETLDLYHTLVPLNQELHKLRLKTQNLTNENYNINDVKVKKIIEDERQRLARELHDSVS
QQLFAASMMLSAIKETKLEPPLDQQIPILEKMVQDSQLEMRALLLHLRPLGLKDKSLGEGIKDLVIDLQKKVPMKVVHEI
QDFKVPKGIEDHLFRITQEAISNTLRHSNGTKVTVELFNKDDYLLLRIQDNGKGFNVDEKLEQSYGLKNMRERALEIGAT
FHIVSLPDSGTRIEVKAPLNKEDSYDD
>P21455 ~~~mkaB~~~28.1 kDa virulence protein~~~
MNMNQTTSPALSQVETAIRVPAGIFAKYNYYSVFDIVRQTRKQFINANMSWPGSRGGKTWDLAMGQAQYIRCMFRENQLT
RRVRGTLQQTPDNGTNLSSSAVGGIQGQAERRPDLATLMVVNDAINQQIPTLLPTHFPHDQVELSLLNTDVSLEDIISES
SIDWPWFLSNSLTGDNSNYAMELASRLSPEQQTLPTEPDNSTATDLTSFYQTNLGLKTADYTPFEALNTFARQLAITVPP
GGTVDCGYSACQPAV
>P13041 ~~~mkaC~~~Virulence genes transcriptional activator~~~
MDFLINKKLKIFITLMETGSFSIATSVLYITRTPLSRVISDLERELKQRLFIRKNGTLIPTEFAQTIYRKVKSHYIFLHA
LEQEIGPTGKTKQLEIIFDEIYPGSLKNLIISALTISGQKTNIMGRAVNSQIIEELCQTNNCIVISARNYFHRESLVCRT
SVEGGVMLFIPKKFFLCGKPDINRLAGTPVLFHEGAKNFNLDTIYHFFEQTLGITNPAFSFDNVDLFSSLYRLQQGLAML
LIPVRVCRALGLSTDHALHIKGVALCTSLYYPTKKRETPDYRKAIKLIQQELKQSTF
>P0A2N2 ~~~vsdE~~~Virulence protein vsdE~~~
MRVSGSASSQDIISRINSKNINNNDSNEVKRIKDALCIESKERILYPQNLSRDNLKQMARYVNNTYVHYSGNCVLLSACL
HYNIHHRQDILSSKNTASPTVGLDSAIVDKIIFGHELNQSYCLNSIDEVEKEILNRYDIKRESSFIISAENYIAPIIGEC
RHDFNAVVICEYDKKPYVQFIDSWKTSNILPSLQEIKKHFSSSGEFYVRAYDEKHD
>Q9KVG9 ~~~vspR~~~Transcriptional regulator VspR~~~
MRRSMKISAEMYKLLIERTLDGFSVIELRDEFIVIKDSLIDPDEAYKKVYRQILRFIKKGWLNGEGSGRQKRYFQTDTFK
ALHAEPKSENVDIEIVLNQDYSVLVSERNQYKGELEIVLGEIDEYQSLNIRFPELEPKLITLLDEAKERSACLLGKVNGL
TNVLKVLSGQKIVHQKFKTKALLFERT
>P09184 3.1.-.-~~~vsr~~~DNA mismatch endonuclease Vsr~~~COG3727
MADVHDKATRSKNMRAIATRDTAIEKRLASLLTGQGLAFRVQDASLPGRPDFVVDEYRCVIFTHGCFWHHHHCYLFKVPA
TRTEFWLEKIGKNVERDRRDISRLQELGWRVLIVWECALRGREKLTDEALTERLEEWICGEGASAQIDTQGIHLLA
>Q9L9Z9 ~~~vuuA~~~Ferric vulnibactin receptor VuuA~~~
MAALRPARTSVAEKKTFKLHALSAVVMGLCASGQAYAQTESTNSNKKEEMPVVVVIGEKTERTIYDTSSSVQVFDQETID
NTPGATEIDDLLQLIPNMVDSGQGNSMPTVRGIDGSGPSIGGLASFAGTSPRLNMSIDGRSLTYSEIAFGPRSLWDMQQV
EVYLGPQSYIQGRNASAGAIVMKTNDPTHHFESAVKAGVGERNYSQTAAMISAPIIQDELAFRLSFDQQKRDSFVDLASY
EPAGDAKKIEMNSVRGKLLYEPSALAGFKTTLGVSHMDSRGPQSESTNVVGNEAFRPVYETKSLSTAWDISWQLNEVLTF
ENNLVYSKFAFDRYTNPLQKGDYTAEGKEFHVEPLLRYLSLGGRVNALVGARYYKSSQDDEYVDATSANPMSGSTKTQSA
FAELTYALTQSIDVTVAGRYEKERVKRKVSDPRFKLDHDDTLSVFLPKFDIAFKPDMAQTFGFKVAKGYNSGGAGLAFNP
ILGGGFSPYQFEEEYIWNYEFYTRHRLGNSVELMTNTFYNDFDSMQMTQTLSNGDVLIANLDNAKTYGAEIGTRWYATDS
LELFANLGLLKTEYKEVNGTSKELERAPNMTGNLGGQYSFFDGFELSANAAYTGDYFSDRSNTEIVKIDAYWVANAQLAY
VFENGRAALFATNLFDSDKTTLYARGSLNEPLKQQPRMIGASLQLNF
>V5XKC3 1.16.1.7~~~vuuB~~~Ferric vulnibactin reductase VuuB~~~
MSDSPERVYPMLLDFVRKETISKNLLRVTLTGEDLIGFPEDQNGSHIKVFFPNQASGILQLPVREGDNVIWPEHKPVPRA
YSVRQYRAAVNELDIDFVTHGEETPGGGWALKADIGSQIGLIGPAGPDPLIEPADWHIIAGDLSAVPAISAILEKLPSDA
KGYVFIEVDEIEDIHDLVHPEEMAINWLMRNPHDTEPALAKAIKQLPSPEKATSLSAFIAGENQSVINCRKILRNDYQIA
RDKLYAIPYWKRGKTEEAYHDERHDVMDAVY
>P19247 ~~~vvhA~~~Cytolysin~~~
MKKMTLFTLSLLATAVQVGAQEYVPIVEKPIYITSSKIKCVLHTSGDFNATRDWCNAGASIDVRVNVAQMRSVQSATSDG
FTPDAKIVRFTVDADKPGTGIHLVNELQQDHSWFQSWANRRTYIGPFASSYDLWVKPVSGYTPKKARDLPQNENKNYQHR
DTYGYSIGINGKVGAEVNKDGPKVGGEVSGSFTYNYSKTLVFDTKDYRINNRSSLSDFDISFEREFGECDELRRQELGCY
FTAAHWGSGWVFDKTKFNPISYSNFKPNYDVLYEAPVSETGVTDFEMGVKLNYRARFGTVLPSALFSVYGSAGSSTNSST
VKQRIRIDWNHPLFEAEAHVTLQSLSNNDLCLDVYGENGDKTVAGGSVNGWSCHGSWNQVWGLDKEERYRSRVASDRCLT
VNADKTLTVEQCGANLAQKWYWEGDKLISRYVDGNNTRYLLNIVGGRNVQVTPENEANQARWKPTLQQVKL
>Q9KM24 2.7.13.3~~~vxrA~~~Sensor histidine kinase VxrA~~~COG0642
MRYSFCMLEKTNIPLIRALNLTLVSLCFAMLPNPVHADSLPERIDLFVSLFDYNSATTSYDIRSIQTDFPTRLLTPDSML
PQTSEYPLKDIQLLYKLAQSCTGKLPLSPLITEPLVFTRSLCKGSSLSPRWFARSGLIHPGGGTYAFRYAEKYPAQFANL
LPYMHIQERPNAAEGTLLYHLQNMGEDAINALVSGASMFGSGSDLWLRKGDIYYLFNEETWLTNANKAGLSYSLLSADNT
CFIQRGNICWDVEDHSDLLRTSMIILVIANIFLVLGWSGYRWNSKRQEMRSRMLILQILTHELRTPIASLSLTVEGFRRE
FEHLPESLYDEFRRLCEDSRRLRQLAEASKDYLQSDSKPLASDWVPSVEEWLQYKVEEEFSGNVTLKLNQDIAAKLNVYW
LGTCVDNLLRNAVKYGVAPVTLEVITQTNLVTFKVTDQGSLTHRDWRHLRKPFVSKSGLGLGLTIVESMVGRMGGKMSLE
GPPTTFILEIPCETDTASR
>Q9KM23 ~~~vxrB~~~Transcriptional regulatory protein VxrB~~~COG0745
MSNQWWDEWAEKCHSKAHRQLLFWRYLVKQTLLLVEDDKNLADGLLVSLEQAGYDCLHAETIADVKQHWDKADLVILDRQ
LPDGDSVQHLMDWKKIKDIPVILLTALVTVKDKVTGLDAGANDYLTKPFAEAELFARIRAQLRSPDSGQDDSKVVTSNLT
IDKATREVFFNGESITLTRTEFDLLLFLASNLGRVFTRDELLDHVWGYNHFPTTRTVDTHVLQLRQKLPGLEIETLRGVG
YKMKA
>A0R006 ~~~~~~Cell wall synthesis protein Wag31~~~COG3599
MPLTPADVHNVAFSKPPIGKRGYNEDEVDAFLDLVENELTRLIEENADLRQRVAELDQELAAARSGAGASSQATSSIPLY
EPEPEPAPAPPQPVYEAPAQPAAPQSEDTAVRAARVLSLAQDTADRLTSTAKAEADKLLSDARAQAEAMVSDARQTAETT
VSEARQRADAMLADAQTRSEAQLRQAQEKADALQADAERKHSEIMGTINQQRTVLEGRLEQLRTFEREYRTRLKTYLESQ
LEELGQRGSAAPVDSSANSDASGFGQFNRGNN
>P9WMU1 ~~~~~~Cell wall synthesis protein Wag31~~~COG3599
MPLTPADVHNVAFSKPPIGKRGYNEDEVDAFLDLVENELTRLIEENSDLRQRINELDQELAAGGGAGVTPQATQAIPAYE
PEPGKPAPAAVSAGMNEEQALKAARVLSLAQDTADRLTNTAKAESDKMLADARANAEQILGEARHTADATVAEARQRADA
MLADAQSRSEAQLRQAQEKADALQADAERKHSEIMGTINQQRAVLEGRLEQLRTFEREYRTRLKTYLESQLEELGQRGSA
APVDSNADAGGFDQFNRGKN
>Q45614 2.7.13.3~~~walK~~~Sensor histidine kinase WalK~~~COG5002
MNKVGFFRSIQFKITLIYVLLIIIAMQIIGVYFVNQVEKSLISSYEQSLNQRIDNLSYYIEQEYKSDNDSTVIKDDVSRI
LNDFTKSDEVREISFVDKSYEVVGSSKPYGEEVAGKQTTDLIFKRIFSTKQSYLRKYYDPKSKIRVLISAKPVMTENQEV
VGAIYVVASMEDVFNQMKTINTILASGTGLALVLTALLGIFLARTITHPLSDMRKQAMELAKGNFSRKVKKYGHDEIGQL
ATTFNHLTRELEDAQAMTEGERRKLASVIAYMTDGVIATNRNGAIILLNSPALELLNVSRETALEMPITSLLGLQENYTF
EDLVEQQDSMLLEIERDDELTVLRVNFSVIQREHGKIDGLIAVIYDVTEQEKMDQERREFVANVSHELRTPLTTMRSYLE
ALAEGAWENKDIAPRFLMVTQNETERMIRLVNDLLQLSKFDSKDYQFNREWIQIVRFMSLIIDRFEMTKEQHVEFIRNLP
DRDLYVEIDQDKITQVLDNIISNALKYSPEGGHVTFSIDVNEEEELLYISVKDEGIGIPKKDVEKVFDRFYRVDKARTRK
LGGTGLGLAIAKEMVQAHGGDIWADSIEGKGTTITFTLPYKEEQEDDWDEA
>Q2G2U4 2.7.13.3~~~walK~~~Sensor protein kinase WalK~~~COG5002
MKWLKQLQSLHTKLVIVYVLLIIIGMQIIGLYFTNNLEKELLDNFKKNITQYAKQLEISIEKVYDEKGSVNAQKDIQNLL
SEYANRQEIGEIRFIDKDQIIIATTKQSNRSLINQKANDSSVQKALSLGQSNDHLILKDYGGGKDRVWVYNIPVKVDKKV
IGNIYIESKINDVYNQLNNINQIFIVGTAISLLITVILGFFIARTITKPITDMRNQTVEMSRGNYTQRVKIYGNDEIGEL
ALAFNNLSKRVQEAQANTESEKRRLDSVITHMSDGIIATDRRGRIRIVNDMALKMLGMAKEDIIGYYMLSVLSLEDEFKL
EEIQENNDSFLLDLNEEEGLIARVNFSTIVQETGFVTGYIAVLHDVTEQQQVERERREFVANVSHELRTPLTSMNSYIEA
LEEGAWKDEELAPQFLSVTREETERMIRLVNDLLQLSKMDNESDQINKEIIDFNMFINKIINRHEMSAKDTTFIRDIPKK
TIFTEFDPDKMTQVFDNVITNAMKYSRGDKRVEFHVKQNPLYNRMTIRIKDNGIGIPINKVDKIFDRFYRVDKARTRKMG
GTGLGLAISKEIVEAHNGRIWANSVEGQGTSIFITLPCEVIEDGDWDE
>Q7A8E0 2.7.13.3~~~walK~~~Sensor protein kinase WalK~~~
MKWLKQLQSLHTKLVIVYVLLIIIGMQIIGLYFTNNLEKELLDNFKKNITQYAKQLEISIEKVYDEKGSVNAQKDIQNLL
SEYANRQEIGEIRFIDKDQIIIATTKQSNRSLINQKANDSSVQKALSLGQSNDHLILKDYGGGKDRVWVYNIPVKVDKKV
IGNIYIESKINDVYNQLNNINQIFIVGTAISLLITVILGFFIARTITKPITDMRNQTVEMSRGNYTQRVKIYGNDEIGEL
ALAFNNLSKRVQEAQANTESEKRRLDSVITHMSDGIIATDRRGRIRIVNDMALKMLGMAKEDIIGYYMLSVLSLEDEFKL
EEIQENNDSFLLDLNEEEGLIARVNFSTIVQETGFVTGYIAVLHDVTEQQQVERERREFVANVSHELRTPLTSMNSYIEA
LEEGAWKDEELAPQFLSVTREETERMIRLVNDLLQLSKMDNESDQINKEIIDFNMFINKIINRHEMSAKDTTFIRDIPKK
TIFTEFDPDKMTQVFDNVITNAMKYSRGDKRVEFHVKQNPLYNRMTIRIKDNGIGIPINKVDKIFDRFYRVDKARTRKMG
GTGLGLAISKEIVEAHNGRIWANSVEGQGTSIFITLPCEVIEDGDWDE
>Q5HK19 2.7.13.3~~~walK~~~Sensor protein kinase WalK~~~COG5002
MKWLKQLQSLHTKLVIVYVLLIIIGMQIIGLYFTNSLEKELLDNFKKNITQYAKQLDVNIEKVYKDKDKGSVNAQKDIQD
LLNEYANRQEIGEIRFIDKDQIIMATTKQSNRGLINQKVNDGSVQKALSLGQTNDHMVLKDYGSGKERVWVYNIPVKVDK
QTIGDIYIESKINDVYNQLNNINQIFIVGTAISLFITVILGFFIARTITKPITDMRNQTVEMSKGNYTQRVKIYGNDEIG
ELALAFNNLSKRVQEAQANTESEKRRLDSVITHMSDGILATDRRGRVRIANDMALKMLGLAKEDVIGYYMLGVLNLENEF
SLEEIQENSDSFLLDINEEEGIIARVNFSTIVQETGFVTGYIAVLHDVTEQQQVERERREFVANVSHELRTPLTSMNSYI
EALEEGAWQDKELAPSFLSVTREETERMIRLVNDLLQLSKMDNESDQITKEIIDFNMFINKIINRHEMAAKDTTFVREIP
QQTIFAEIDPDKMTQVFDNVITNAMKYSRGEKRVEFHVKQNALYNRMTIRIKDNGIGIPINKVDKIFDRFYRVDKARTRK
MGGTGLGLAISKEIVEAHNGRIWANSVEGQGTSIFITLPCEIIEDGDWDE
>Q8CU87 2.7.13.3~~~walK~~~Sensor protein kinase WalK~~~COG5002
MKWLKQLQSLHTKLVIVYVLLIIIGMQIIGLYFTNSLEKELLDNFKKNITQYAKQLDVNIEKVYKDKDKGSVNAQKDIQD
LLNEYANRQEIGEIRFIDKDQIIMATTKQSNRGLINQKVNDGSVQKALSLGQTNDHMVLKDYGSGKERVWVYNIPVKVDK
QTIGDIYIESKINDVYNQLNNINQIFIVGTAISLFITVILGFFIARTITKPITDMRNQTVEMSKGNYTQRVKIYGNDEIG
ELALAFNNLSKRVQEAQANTESEKRRLDSVITHMSDGILATDRRGRVRIANDMALKMLGLAKEDVIGYYMLGVLNLENEF
SLEEIQENSDSFLLDINEEEGIIARVNFSTIVQETGFVTGYIAVLHDVTEQQQVERERREFVANVSHELRTPLTSMNSYI
EALEEGAWQDKELAPSFLSVTREETERMIRLVNDLLQLSKMDNESDQITKEIIDFNMFINKIINRHEMAAKDTTFVREIP
QQTIFAEIDPDKMTQVFDNVITNAMKYSRGEKRVEFHVKQNALYNRMTIRIKDNGIGIPINKVDKIFDRFYRVDKARTRK
MGGTGLGLAISKEIVEAHNGRIWANSVEGQGTSIFITLPCEIIEDGDWDE
>A0A0H2ZNH9 2.7.13.3~~~walK~~~Sensor histidine protein kinase/phosphatase WalK~~~COG5002
MLDLLKQTIFTRDFIFILILLGFILVVTLLLLENRRDNIQLKQINQKVKDLIAGDYSKVLDMQGGSEITNITNNLNDLSE
VIRLTQENLEQESKRLNSILFYMTDGVLATNRRGQIIMINDTAKKQLGLVKEDVLNRSILELLKIEENYELRDLITQSPE
LLLDSQDINGEYLNLRVRFALIRRESGFISGLVAVLHDTTEQEKEERERRLFVSNVSHELRTPLTSVKSYLEALDEGALC
ETVAPDFIKVSLDETNRMMRMVTDLLHLSRIDNATSHLDVELINFTAFITFILNRFDKMKGQEKEKKYELVRDYPINSIW
MEIDTDKMTQVVDNILNNAIKYSPDGGKITVRMKTTEDQMILSISDHGLGIPKQDLPRIFDRFYRVDRARSRAQGGTGLG
LSIAKEIIKQHKGFIWAKSEYGKGSTFTIVLPYDKDAVKEEVWEDEVED
>Q8DPL8 2.7.13.3~~~walK~~~Sensor histidine protein kinase/phosphatase WalK~~~COG5002
MLDLLKQTIFTRDFIFILILLGFILVVTLLLLENRRDNIQLKQINQKVKDLIAGDYSKVLDMQGGSEITNITNNLNDLSE
VIRLTQENLEQESKRLNSILFYMTDGVLATNRRGQIIMINDTAKKQLGLVKEDVLNRSILELLKIEENYELRDLITQSPE
LLLDSQDINGEYLNLRVRFALIRRESGFISGLVAVLHDTTEQEKEERERRLFVSNVSHELRTPLTSVKSYLEALDEGALC
ETVAPDFIKVSLDETNRMMRMVTDLLHLSRIDNATSHLDVELINFTAFITFILNRFDKMKGQEKEKKYELVRDYPINSIW
MEIDTDKMTQVVDNILNNAIKYSPDGGKITVRMKTTEDQMILSISDHGLGIPKQDLPRIFDRFYRVDRARSRAQGGTGLG
LSIAKEIIKQHKGFIWAKSEYGKGSTFTIVLPYDKDAVKEEVWEDEVED
>P37478 ~~~walR~~~Transcriptional regulatory protein WalR~~~COG0745
MDKKILVVDDEKPIADILEFNLRKEGYEVHCAHDGNEAVEMVEELQPDLILLDIMLPNKDGVEVCREVRKKYDMPIIMLT
AKDSEIDKVIGLEIGADDYVTKPFSTRELLARVKANLRRQLTTAPAEEEPSSNEIHIGSLVIFPDAYVVSKRDETIELTH
REFELLHYLAKHIGQVMTREHLLQTVWGYDYFGDVRTVDVTVRRLREKIEDNPSHPNWIVTRRGVGYYLRNPEQD
>Q2G2U6 ~~~walR~~~Transcriptional regulatory protein WalR~~~COG0745
MARKVVVVDDEKPIADILEFNLKKEGYDVYCAYDGNDAVDLIYEEEPDIVLLDIMLPGRDGMEVCREVRKKYEMPIIMLT
AKDSEIDKVLGLELGADDYVTKPFSTRELIARVKANLRRHYSQPAQDTGNVTNEITIKDIVIYPDAYSIKKRGEDIELTH
REFELFHYLSKHMGQVMTREHLLQTVWGYDYFGDVRTVDVTIRRLREKIEDDPSHPEYIVTRRGVGYFLQQHE
>Q7A8E1 ~~~walR~~~Transcriptional regulatory protein WalR~~~
MARKVVVVDDEKPIADILEFNLKKEGYDVYCAYDGNDAVDLIYEEEPDIVLLDIMLPGRDGMEVCREVRKKYEMPIIMLT
AKDSEIDKVLGLELGADDYVTKPFSTRELIARVKANLRRHYSQPAQDTGNVTNEITIKDIVIYPDAYSIKKRGEDIELTH
REFELFHYLSKHMGQVMTREHLLQTVWGYDYFGDVRTVDVTIRRLREKIEDDPSHPEYIVTRRGVGYFLQQHE
>Q9RDT5 ~~~walR~~~Transcriptional regulatory protein WalR~~~
MARKVVVVDDEKPIADILEFNLKKEGYDVYCAYDGNDAVDLIYEEEPDIVLLDIMLPGRDGMEVCREVRKKYEMPIIMLT
AKDSEIDKVLGLELGADDYVTKPFSTRELIARVKANLRRHYSQPAQDTGNVTNEITIKDIVIYPDAYSIKKRGEDIELTH
REFELFHYLSKHMGQVMTREHLLQTVWGYDYFGDVRTVDVTIRRLREKIEDDPSHPEYIVTRRGVGYFLQQHE
>D4G3R4 3.1.-.-~~~wapA~~~tRNA(Glu)-specific nuclease WapA~~~
MSEYYFYYKRREKMKKRKRRNFKRFIAAFLVLALIISLVPADVLAKTTEEENGNRIVADDPEETLQKEQTEEAVPFDPKD
INKEGEITSERTENTKLYYEGDGVYKQEVYLDPIHTKETPDADWEDISPELKESTSKQVETENAILNSDFQKQMKNGLYA
TFEHNDHKVTYSLVEAKAPNKTSLTPKDTSADYKTDSNEIVYPDVFPNIDLQTFTFNENIKEDLVLHQYDGYNTFTFQLK
TDLQAKEQEDGSIDFSDEKGKVVFSVPKPFMTDSKLDELSGEVERSDKVSYKLEKNEEGYLLHLTADENWLKDPERVYPV
SIDPSTSLSVSSDTFVMSAYPTTNYSASSQKWDANLKAYVLKTGYYDKTTGTNYAFMKFNNLKPIQNMTVTKATLKTYVA
HSYYGTKATGLWLDTVNSNYDNAKVTWNTKPASKNIGKADVHKGQWASYDVTAAVKSWNSGGANYGFKLHTNGNGKEYWK
KLISSANSANKPYIEVTYTIPKGNTPTIKAYHNGDSTGYFDISWKKVEGAKGYKVWIYNGKEYQAISAGNVTSWSTKGKK
IWPTSAEIASKRYKLHLDGKDGAELALDPSPVYKNSGGSYATSKNYWIGVSAIFDQGEGAMSAPAKPVIPNVGKAQAPSA
KGYNNGNATGYFDLSWKAVSGATGYKVQVFNGKGFETLDLGNQTSWTTKGKKIWPTSAEIKAGKYALHLKDGSGAELPIN
PGPTYKNAGGDGAKKNYSFKIIAYNKDGEAIASPAANPALPDIARPKNLTGYLYTNTKSSQTGYVNLIWEKVQNAKGYKV
NIYNGKEYQSFDVGDADHWTTQNKNIWPTSEEIKAGSYKLHTDGKGGELALDPSPVYNNANGNYKGKKNYSFTLVAYDAN
GETIPTAPFNPTFHEGAEFLGTEEYWSIIDIPSGQLNGATGNVIVNEEDLSIDGRGPGLGLSRTYNSLDSSDHLFGQGWY
ADAETSVISTDGGAMYIDEDATTHRFTKKADGTYQPPTGVYLELTETADQFILKTKDQTNAYFNKKGGKLQKVVDGHNNA
TVYTYNDKNQLTAITDASGRKLTFTYDENGHVTSITGPKNKKVTYSYENDLLKKVTDTDGTVTSYDYDGEGRLVKQYSAN
STEAKPVFTEYQYSGHRLEKAINAKKETYVYSYDADKKTLLMTQPNGRKVQYGYNEAGNPIQVIDDAEGLKITTNTKYEG
NNVVEDVDPNDVGTGKATESYQYDKDGNVTSVKDAYGTETYEYNKNNDVTKMKDTEGNVTDIAYDGLDAVSETDQSGKSS
SAAVYDKYGNQIQSSKDLSASTNILKDGSFEAQKSGWNLTASKDSGKISIITDKSGVLSGSKALEILSQSTSAGTDHGFS
SATQTVELEPNTTYTLSGKIKTDLAKTRAYFNIDLRDKDQKRIQWIHNEYSALAGKNDWTKRQITFTTPANAGKAVVYME
VDHNDKDGKGKAWFDEVQLEKGEVSSSYNPVQNSSFTSATENWNVSGASVDSEEGFNDDVSLKAARTSASQAGSVTKQTV
VLGQSANDKPVYLTLTGMSKASSVKFTDEKDYSLQANVTYADGSTGVYNAKFPSGTQEWNRAAVVIPKTKPINKVDISIL
FQKSATGTVWFDDIRLIEGSLLTKSTYDSNGNYVTKEEDELGYATSTDYDETGKKTAETDAKGEKTTYTYDQADQLTNMT
LSNGTSILHSYDKEGNEVSKTIRAGADQTYKYEYDVMGKLVKTTDPLGNVLASEYDANSNLTKTISPNGNEVSLSYDGTD
RVKSKSYNGTEKYNFTYDKNGNETSVVNKEQNTTKKRTFDNKNRLTELTDRGGSQTWTYPSDSDKLKTFSWTHGDQKGTN
QFTYNKLDQMIEMKDSTSSYSFDYDENGNVQTFITGNGGGTSFSYDERNLVSSLHIGDKNGGSILTESYEYDANGNRTTI
NSSASGKVKYEYGKLNQLVKETHEDGTVIEYTYDGFGNRKTVTTVKDGSSKTVNASFNIMNQLTKVNDESISYDKNGNRT
SDGKFTYTWDAEDNLTAVTKKGEDKPFATYKYDEKGNRIQKTVNGKVTNYFYDGDSLNVLYETDADNNVTKSYTYGDSGQ
LLSYTENGKKYFYHYNAHGDVIAISDSTGKTVAKYQYDAWGNPTKTEASDEVKDNRYRYAGYQYDEETGLYYLMARYYEP
RNGVFLSLDPDPGSDGDSLDQNGYTYGNNNPVMNVDPDGHWVWFVVNAGFAVYDGYKAYKSGKGWKGVAVAAASGFVGGG
KLKLTKKIGKWATSRHWYKGTFKTKRKSLDYHHNKHIVRNGKSYSKKRYTKVARAFYRSNKHLREKVILATGKKGYRIKN
GKRTGYYTRSGKVVTFVNNKWKKKKKR
>G4NYJ6 3.1.-.-~~~wapA~~~tRNA3(Ser)-specific nuclease WapA~~~
MKKRKRRTFKRFIAAFLVLSLMISLLPADVLAKTTEEEAGNRIVSDDPEETPRNEQTEEAVPFPSKDINKEGEITSERTE
NTKLYYEGDGVYKQEVYLDPIHTKETPNADWEDISPELKESTSKQVETENAILNSDFQKQMKNGLYATFEHNDHKVTYSL
VEAKGPNKTSLTPKDTSADYKTDSNEIVYPDVFPNIDLQTFTFNENIKEDLVLHQYDGYNTFTFQVKTDLQAKEQEDGSI
DFSDEKGKVVFSVPKPFMTDSKLDELSGEVERSDKVSYKLEKNEEGYLLHLTADENWLKDPERVYPVSIDPSTSLSVSSD
TFVMSAYPTTNYSASSQKWDANLKSYVLKTGYYDKTTGTNYAFMKFNNLKPIQNMAVTKATLKTYVAHSYYGTKATGLWL
DTVNSNYDNGKVTWNTKPASKNIGKAEVHKGQWASYDVTAAVKSWNSGGANYGFKLHTNGNGKEYWKKLISSANSANKPY
IEVTYTIPKGNTPSIKAYHNGDSTGYFDISWKKVEGAKGYKVWIYNGKEYQAISAGNVTSWSTKGKKIWPTSAEIASKRY
KLHVDGKDGAELALDPSPVYKNSGGSYATSKNYWIGVSAIFDQGEGAMSAPAKPVIPNVGKAQAPSTKGYNNGNATGYFD
LSWKAVSGATGYKVQVFNGKGFETLDLGNQTSWTTKGKKIWPTSAEIKAGKYALHLKDGNGAELPINPGPTYKNAGGDGA
KKNYSFKIIAYNKDGEAIASPAANPTLPDIAKPKNLTGYLYTNTKSSQTGYVNLIWEKVQNAKGYKVNIYNGKEYQSYDV
GDVDHWTTQNKNIWPTPEEIKAGSYKLHTDGKGRELALDPSPVYNNANGNYKGKKNYSFTLSAYNANGETIPTAPFNPTF
HEGAEFLGTEEYWSIIDIPSGQLNGATGNVIVNEEDLSIDGRGPGLGLSRTYNSLDTSDHLFGQGWYADAETSVISTDGG
AMYIDEDATTHRFTKKADGTYQPPTGVYLELTETADQFILKTKDQTNAYFNKKGGKLQKVVDGHNNATVYTYNDKNQLTA
ITDASGRKLTFTYDENGHVTSITGPKNKKVTYSYESDLLKKVTDTDGTVTSFDYDAEGRLVKQYSANSTEAKPVFTEYQY
SGHRLEKAINAKKETYVYSYDADKKTLLMTQPNGRKVQYGYNEAGNPIQVIDDAEGLKITTNTKYEGNNVVEDVDPNDVG
TGKATESYQYDKDGNVTSVKDAYGTETYEYNKNNDVTKMKDTEGNVTDIAYDGLDAVSETDQSGKSSSAAVYDKYGNQIQ
SSKDLSASTNILKDGSFEAQKSGWNLTASKDSGKISVIADKSGVLSGSKALEVLSQSTSAGTDHGYSSATQTVELEPNTT
YTLSGKIKTDLAKSRAYFNIDLRDKDQKRIQWIHNEYSALAGKNDWTKRQITFTTPANAGKAVVYMEVDHRDKDGKGKAW
FGEVQLEKGEVSSSYNPVQNSSFTAATENWSVSGASVDSEEGFNDDVSLKAARTSASQAGSVTKQTVVLGQNANDKPVYL
TLTGMSKASSVKFTDEKDYSLQANVTYADGSTGVYNAKFPSGTQEWNRAAVVIPKTKPINKVDISILFQKSATGTVWFDD
IRLIEGSLLTKSTYDSNGNYVTKEEDELGFSTSTDYDETGKKTAETDAKGEKTTYTYDQADQLTNMTLSNGTSILHSYDK
EGNEVSKTIRAGADQTYKYEYDVMGKLVKTTDPLGNVLASEYDANSNLTKTISPNGNEVSLSYDGTDRVKSKSYNGTEKY
NFTYDKNGNETSVVNKEQNTTKKRTFDNKNRLTELTDRGGSQTWTYPSDSDKLKTFSWSHGDQKGTNQFTYNKLDQMIEM
KDSTSSYSFDYDENGNVQTFITGNGGGTSFSYDERNLVSSLHIGDKNGGSILTESYEYDANGNRTTINSSASGKVQYEYG
KLNQLVKETHEDGTVIEYTYDGFGNRKTVTTVKDGSSKTVNASFNIMNQLTKVNDESISYDKNGNRTSDGKFTYTWDAED
NLTAVTKKGEDKPFATYKYDEKGNRIQKTVNGNVTNYFYDGDSLNVLYETDADNKVTKSYTYGDSGQLLSYTENGKKYFY
HYNAHGDVIAISDSSGKTLAKYQYDAWGNPTKTEASDEVKDNRYRYAGYQYDEETGLYYLMARYYEPRNGVFLSLDPDPG
SDGDSLDQNGYAYGNNNPVMNVDPDGHWVWLVVNAGFAAYDGYKAYKSGKGWKGAAWAAASNFGPGKIFKGAKRVYRFAK
SGKNFNWKHIKKDHGPKSKARMPNGQPKSKFRSAKTLRRTTKATARTRPAYVQKDGRTVHFKKFKKPIGRKTNGRHTYTV
KVVKSGRYVVTSYPY
>Q07833 3.1.-.-~~~wapA~~~tRNA nuclease WapA~~~COG3209
MKKRKRRNFKRFIAAFLVLALMISLVPADVLAKSTEEENGNRIAADDPEETLQKEQTEEAVPFDPKDINKEGEITSERTE
NTKLYYEGDGVYKQEVYLDPIHTKETPDADWEDISPELKESTSKQVETENAILNSDFQKQMKNGLYATFEHNDHKVTYSL
AEAKGPNKTSLTPKDTSADYKTDSNEIVYPDVFPNIDLQTFTFNENIKEDLVLHQYNGYNTFTFQLKTDLQAKEQEDGSI
DFSDEKGKVVFSVPKPFMTDSKLDELSGEVERSDKVSYKLEKNEEGYLLHLTADENWLKDPERVYPVSIDPSTSLSVSSD
TFVMSAYPTTNYSASSQKWDANLKAYVLKTGYYDKTTGTNYAFMKFNNLKPIQNMTVTKATLKTYVAHSYYGTKATGLWL
DTVNSNYDNAKVTWNTKPASKNIGKADVHKGQWASYDVTAAVKSWNSGGANYGFKLHTNGNGKEYWKKLISSANSANKPY
IEVTYTIPKGNTPTIKAYHNGDSTGYFDISWKKVEGAKGYKVWIYNGKEYQAISAGNVTSWSTKGKKIWPTSAEIASKRY
KLHLDGKDGAELALDPSPVYKNSGGSYATSKNYWIGVSAIFDQGEGAMSAPAKPVIPNVGKAQAPSAKGYNNGNATGYFD
LSWKAVSGATGYKVQVFNGKGFETLDLGNQTSWTTKGKKIWPTSAEIKAGKYALHLKDGSGAELPINPGPTYKNAGGDGA
KRNYSFKIIAYNKDGEAIASPAATPALPDIARPKNVTGYLYTNTKSSQTGYVNLIWEKVQNAKGYKVNIYNGKEYQSFDV
GDADHWTTQNKNIWPTSEEIKAGSYKLHTDGKGGELALDPSPVYNNANGNYKGKKNYSFTLVAYDANGETIPTAPFNPTF
HEGAEFLGTEEYWSIIDIPSGQLNGATGNVIVNEEDLSIDGRGPGLGLSRTYNSLDSSDHLFGQGWYADAETSVISTDGG
AMYIDEDATTHRFTKKADGTYQPPTGVYLELTETADQFILKTKDQTNAYFNKKGGKLQKVVDGHNNATVYTYNDKNQLTA
ITDASGRKLTFTYDENGHVTSITGPKNKKVTYSYENDLLKKVTDTDGTVTSYDYDSEGRLVKQYSANSTEAKPVFTEYQY
SGHRLEKAINAKKETYVYSYDADKKTLLMTQPNGRKVQYGYNEAGNPIQVIDDAEGLKITTNTKYEGNNVVEDVDPNDVG
TGKATESYQYDKDGNVTSVKDAYGTETYEYNKNNDVTKMKDTEGNVTDIAYDGLDAVSETDQSGKSSSAAVYDKYGNQIQ
SSKDLSASTNILKDGSFEAQKSGWNLTASKDSGKISVIADKSGVLSGSKALEVLSQSTSAGTDHGYSSATQTVELEPNTT
YTLSGKIKTDLAKSRAYFNIDLRDKDQKRIQWIHNEYSALAGKNDWTKRQITFTTPANAGKAVVYMEVDHKDKDGKGKAW
FDEVQLEKGEVSSSYNPVQNSSFTSATENWNVSGASVDSEEGFNDDVSLKAARTSASQAGSVTKQTVVLGQSANDKPVYL
TLTGMSKASSVKFTDEKDYSLQANVTYADGSTGIYNAKFPSGTQEWNRAAVVIPKTKPINKVDISILFQKSATGTVWFDD
IRLIEGSLLTKSTYDSNGNYVTKEEDELGYATSTDYDETGKKTSETDAKGEKTTYTYDQADQLTNMTLSNGTSILHSYDK
EGNEVSKTIRAGADQTYKFEYDVMGKLVKTTDPLGNVLASEYDANSNLTKTISPNGNEVSLSYDGTDRVKSKSYNGTEKY
IFTYDKNGNETSVVNKEQNTTKKRTFDNKNRLTELTDRGGSQTWTYPSDSDKLKTFSWIHGDQKGTNQFTYNKLDQMIEM
KDSTSSYSFDYDENGNVQTFITGNGGGTSFSYDERNLVSSLHIGDKNGGDILTESYEYDANGNRTTINSSASGKVQYEYG
KLNQLVKETHEDGTVIEYTYDGFGNRKTVTTIKDGSSKTVNASFNIMNQLTKVNDESISYDKNGNRTSDGKFTYTWDAED
NLTAVTKKGEDKPFATYKYDEKGNRIQKTVNGKVTNYFYDGDSLNVLYETDADNNVTKSYTYGDSGQLLSYTENGKKYFY
HYNAHGDIIAISDSTGKTVAKYQYDAWGNPTKTEASDEVKDNRYRYAGYQYDEETGLYYLMARYYEPRNGVFLSLDPDPG
SDGDSLDQNGYAYGNNNPVMNVDPDGHWVWLVVNAGFAAYDGYKAYKSGKGWKGAAWAAASNFGPGKIFKGASRAYKFTK
KAVKITGHTRHGLNQSIGRNGGRGVNLRAKLNAVRSPKKVIKQPNGATKYVGKKATVVLNKRGKVITAYGSSRAKGSKHV
FHTHGKGNKSKRRR
>D4G3R3 ~~~wapI~~~Immunity protein WapI~~~
MKFFKRYNIDQKTLDEFKKYYVLLHGPFPNDMYDFEEETNTSLDEFYEFFALITGSLNYIIEDKKIPRYQREMLKKTFYE
HYPHFRNYKSDILKYQELSECLEFHEKIRILINKLITGG
>G4NYJ5 ~~~wapI~~~Immunity protein WapI~~~
MILSSFLKIERMSSMHELKFRDLNVEVNEEDTVLLSMTESEIIVLTKKEIDYGAKYINHIVLYCNTDGRFLNSFSIQTNE
QVINVQKVDDSFLFLIDKEYEDSVRDVEPNIYLWNPIEGFHQSFYAGRYINSMIIDQNKNLWVGYDETGIFSCVDQEIST
RGINKFVLKNGKYELYFHGVSSYVIDQYFSTFVSEDAIYLYYRSMGEDYLQKLNLLGETLERVEVGIECSSCIKNGSSIY
LFSRDDDSYNIEKVFKTNDMQNYVEQKISNENNGESLCFTQVASYKDKVAGIDHNNKLFLLNNQSL
>Q07836 ~~~wapI~~~Immunity protein WapI~~~
MAKIKDDCIELELTPRRYQELDDDPFILSVFELLENKKAVVRDFSAVLLESEYKVLISGIETMIKGNQDSISLETIEPFL
FLSIDQENGNYRIKIKIIFDDYKESKSNSNLFEINCNEEKLESFVTALKLNLENTKNPSKLP
>Q03084 2.4.1.303~~~wbbD~~~UDP-Gal:alpha-D-GlcNAc-diphosphoundecaprenol beta-1,3-galactosyltransferase~~~
MSDDTPKFSVLMAIYIKDSPLFLSEALQSIYKNTVAPDEVIIIRDGKVTSELNSVIDSWRRYLNIKDFTLEKNMGLGAAL
NFGLNQCMHDLVIRADSDDINRTNRFECILDFMTKNGDVHILSSWVEEFEFNPGDKGIIKKVPSRNSILKYSKNRSPFNH
PAVAFKKCEIMRVGGYGNEYLYEDYALWLKSLANGCNGDNIQQVLVDMRFSKETAKRRGGIKYAISEIKAQYHFYRANYI
SYQDFIINIITRIFVRLLPTSFRGYIYKKVIRRFL
>P37749 2.4.1.-~~~wbbI~~~Beta-1,6-galactofuranosyltransferase WbbI~~~COG0438
MYFLNDLNFSRRDAGFKARKDALDIASDYENISVVNIPLWGGVVQRIISSVKLSTFLCGLENKDVLIFNFPMAKPFWHIL
SFFHRLLKFRIVPLIHDIDELRGGGGSDSVRLATCDMVISHNPQMTKYLSKYMSQDKIKDIKIFDYLVSSDVEHRDVTDK
QRGVIYAGNLSRHKCSFIYTEGCDFTLFGVNYENKDNPKYLGSFDAQSPEKINLPGMQFGLIWDGDSVETCSGAFGDYLK
FNNPHKTSLYLSMELPVFIWDKAALADFIVDNRIGYAVGSIKEMQEIVDSMTIETYKQISENTKIISQKIRTGSYFRDVL
EEVIDDLKTR
>P36667 2.4.1.-~~~wbbL~~~Rhamnosyltransferase WbbL~~~
MVYIIIVSHGHEDYIKKLLENLNADDEHYKIIVRDNKDSLLLKQICQHYAGLDYISGGVYGFGHNNNIAVAYVKEKYRPA
DDDYILFLNPDIIMKHDDLLTYIKYVESKRYAFSTLCLFRDEAKSLHDYSVRKFPVLSDFIVSFMLGINKTKIPKESIYS
DTVVDWCAGSFMLVRFSDFVRVNGFDQGYFMYCEDIDLCLRLSLAGVRLHYVPAFHAIHYAHHDNRSFFSKAFRWHLKST
FRYLARKRILSNRNFDRISSVFHP
>P9WMY3 2.4.1.289~~~wbbL~~~N-acetylglucosaminyl-diphospho-decaprenol L-rhamnosyltransferase~~~COG1216
MTDVLPVVAVTYSPGPHLERFLASLSLATERPVSVLLADNGSTDGTPQAAVQRYPNVRLLPTGANLGYGTAVNRTIAQLG
EMAGDAGEPWVDDWVIVANPDVQWGPGSIDALLDAASRWPRAGALGPLIRDPDGSVYPSARQMPSLIRGGMHAVLGPFWP
RNPWTTAYRQERLEPSERPVGWLSGSCLLVRRSAFGQVGGFDERYFMYMEDVDLGDRLGKAGWLSVYVPSAEVLHHKAHS
TGRDPASHLAAHHKSTYIFLADRHSGWWRAPLRWTLRGSLALRSHLMVRSSLRRSRRRKLKLVEGRH
>J7I4B7 ~~~wbdD~~~O-antigen chain terminator bifunctional methyltransferase/kinase WbdD~~~
MTKDLNTLVSELPEIYQTIFGHPEWDGDAARDCNQRLDLITEQYDNLSRALGRPLNVLDLGCAQGFFSLSLASKGATIVG
IDFQQENINVCRALAEENPDFAAEFRVGRIEEVIAALEEGEFDLAIGLSVFHHIVHLHGIDEVKRLLSRLADVTQAVILE
LAVKEEPFYWGVSQPDDPRELIEQCAFYRLIGEFDTHLSPVPRPMYLVSNHRVLINDFNQPFQHWQNQPYAGAGLAHKRS
RRYFFGEDYVCKFFYYDMPHGILTAEESQRNKYELHNEIKFLTQPPAGFDAPAVLAHGENAQSGWLVMEKLPGRLLSDML
AAGEEIDREKILGSLLRSLAALEKQGFWHDDVRPWNVMVDARQHARLIDFGSIVTTPQDCSWPTNLVQSFFVFVNELFAE
NKSWNGFWRSAPVHPFNLPQPWSNWLYAVWQEPVERWNFVLLLALFEKKAKLPSAEQQRGATEQWIIAQETVLLELQSRV
RNESAGSEALRGQIHTLEQQMAQLQSAQDAFVEKAQQQVEVSHELTWLGENMEQLAALLQTAQAHAQADVQPELPPETAE
LLQRLEAANREIHHLSNENQQLRQEIEKIHRSRSWRMTKGYRYLGLQIHLLRQYGFVQRCKHFIKRVLRFVFSFMRKHPQ
VKHTAVNGLHKLGLYQPAYRLYRRMNPLPHSQYQADAQILSQTELQVMHPELLPPEVYEIYLKLTKNK
>Q2SYH7 5.1.3.25~~~wbiB~~~dTDP-L-rhamnose 4-epimerase~~~
MSDVNASLVDGKKILVTGGAGFIGCAISERLAARASRYVVMDNLHPQIHANAVRPVALHEKAELVVADVTDAGAWDALLS
DFQPEIIIHLAAETGTGQSLTEASRHALVNVVGTTRLTDAIVKHGIAVEHILLTSSRAVYGEGAWQKADGTIVYPGQRGR
AQLEAAQWDFPGMTMLPSRADRTEPRPTSVYGATKLAQEHVLRAWSLATKTPLSILRLQNVYGPGQSLTNSYTGIVALFS
RLAREKKVIPLYEDGNVTRDFVSIDDVADAIVATLAREPEALSLFDIGSGQATSILDMARIIAAHYGAPEPQVNGAFRDG
DVRHAACDLSESLANLGWKPQWSLERGIGELQTWIAQELDRKN
>Q9XC60 1.1.1.367~~~wbjC~~~UDP-2-acetamido-2,6-beta-L-arabino-hexul-4-ose reductase~~~
MKVLVTGANGFVGRNLCAHLAERGGIEVVPFTRESSVGNLPELIRSVDFIFHLAGVNRPEKPEEFKIGNSELTYALCEAV
RSNGRAIPLLYTSSIQAEVDNEYGLSKRAAEEHLQVLGEDIGCPVYIFRLPNVFGKWSRPNYNSAVATFCHNIIRDIPIQ
INNSSAEITLVYIDDVVRTFMKVMDGKLSNAVSLQVEPQYQISVGELAEQLYEFRNSRKSLTTARVGSGLTRALYSTYLS
FLPEDSFSYDVPMHSDPRGTFVEMLKTADSGQFSFFTAHPGVTRGGHYHHSKTEKFLVIKGMARFKFRNILTGAFYEICT
NGEKAEIVETVPGWTHDITNVGTDDMVVMLWANEVFDRENPDTYACSVGEGA
>F8WJP6 2.1.2.14~~~wbkC~~~GDP-perosamine N-formyltransferase~~~COG0223
MAIAPNTRVLVAGYGLPAEFCVTTLIGMGVEIDKIAVATHREDNRNCGLHSMLRLRNIQFTTAAANSEEFYEFGANFDPD
MIISMHYRSLIPGRFLKLAKKGSVNLHPSLLPAYRGTNSVAWVIINGESETGFSYHRMDENFDTGAILLQERISVEETDT
AFSLFHRQIARAMLRLEEVILKLDQGDPGFAQLGEASYYARELPFGGVIDPRWSEVQIDRFIRAMFFPPFPPAVLKIDGK
VYYVPSIDIYRSLMRGIPS
>P0DMP6 2.4.1.306~~~wbnH~~~O-antigen biosynthesis glycosyltransferase WbnH~~~
MKNVGFIVTKSEIGGAQTWVNEISNLIKEECNIFLITSEEGWLTHKDVFAGVFVIPGIKKYFDFLTLFKLRKILKENNIS
TLIASSANAGVYARLVRLLVDFKCIYVSHGWSCLYNGGRLKSIFCIVEKYLSLLTDVIWCVSKSDEKKAIENIGIKEPKI
ITVSNSVPQMPRCNNKQLQYKVLFVGRLTHPKRPELLANVISKKPQYSLHIVGGGERLESLKKQFSECENIHFLGEVNNF
YNYHEYDLFSLISDSEGLPMSGLEAHTAAIPLLLSDVGGCFELIEGNGLLVENTEDDIGYKLDKIFDDYENYREQAIRAS
GKFVIENYASAYKSIILG
>Q5JBG6 2.4.1.309~~~wbnI~~~O-antigen biosynthesis glycosyltransferase WbnI~~~
MVINIFYICTGEYKRFFDKFYLSCEDKFIPEFGKKYYVFTDSDRIYFSKYLNVEVINVEKNCWPLNTLLRFSYFLKVIDK
LQTNSYTFFFNANAVIVKEIPFSTFMESDLIGVIHPGYKNRISILYPWERRKNATCYLGYLKKGIYYQGCFNGGKTASFK
RLIQICNMMTMADLKKNLIAKVHDESYLNYYYYYNKPLLLSELYSWPEKYGENKDAKIIMRDKERESWYGNIKK
>Q4KXC9 2.4.1.122~~~wbnJ~~~O-antigen biosynthesis glycosyltransferase WbnJ~~~
MSLRILDMISVIMAVHRYDKYVDISIDSILNQTYSDFELIIIANGGDCFEIAKQLKHYTELDNRVKIYTLEIGQLSFALN
YAVTKCKYSIIARMDSDDVSLPLRLEKQYMYMLQNDLEMVGTGIRLINENGEFIKELKYPNHNKINKILPFKNCFAHPTL
MFKKDVILKQRGYCGGFNSEDYDLWLRILNECPNIRWDNLSECLLNYRIHNKSTQKSALAYYECASYSLREFLKKRTITN
FLSCLYHFCKALIK
>Q58YV9 2.4.1.308~~~wbnK~~~O-antigen biosynthesis glycosyltransferase WbnK~~~
MYSCLSGGLGNQMFQYAAAYILQRKLKQRSLVLDDSYFLDCSNRDTRRRFELNQFNICYDRLTTSKEKKEISIIRHVNRY
RLPLFVTNSIFGVLLKKNYLPEAKFYEFLNNCKLQVKNGYCLFSYFQDATLIDSHRDMILPLFQINEDLLNLCNDLHIYK
KVICENANTTSLHIRRGDYITNPHASKFHGVLPMDYYEKAIRYIEDVQGEQVIIVFSDDVKWAENTFANQPNYYVVNNSE
CEYSAIDMFLMSKCKNNIIANSTYSWWGAWLNTFEDKIVVSPRKWFAGNNKSKLTMDSWINL
>G3XD23 1.1.1.335~~~wbpB~~~UDP-N-acetyl-2-amino-2-deoxy-D-glucuronate oxidase~~~
MKNFALIGAAGYIAPRHMRAIKDTGNCLVSAYDINDSVGIIDSISPQSEFFTEFEFFLDHASNLKRDSATALDYVSICSP
NYLHYPHIAAGLRLGCDVICEKPLVPTPEMLDQLAVIERETDKRLYNILQLRHHQAIIALKDKVAREKSPHKYEVDLTYI
TSRGNWYLKSWKGDPRKSFGVATNIGVHFYDMLHFIFGKLQRNVVHFTSEYKAAGYLEYEQARVRWFLSVDANDLPESVK
GKKPTYRSITVNGEEMEFSEGFTDLHTTSYEEILAGRGYGIDDARHCVETVNTIRSAVIVPASDNEGHPFVAALAR
>G3XD01 2.3.1.201~~~wbpD~~~UDP-2-acetamido-3-amino-2,3-dideoxy-D-glucuronate N-acetyltransferase~~~
MSYYQHPSAIVDDGAQIGSDSRVWHFVHICAGARIGAGVSLGQNVFVGNKVVIGDRCKIQNNVSVYDNVTLEEGVFCGPS
MVFTNVYNPRSLIERKDQYRNTLVKKGATLGANCTIVCGVTIGEYAFVGAGAVINKNVPSYALMVGVPARQIGWMSEFGE
QLQLNEQGEAVCSHSGARYVLNGKILSKVDV
>Q9HZ76 2.6.1.98~~~wbpE~~~UDP-2-acetamido-2-deoxy-3-oxo-D-glucuronate aminotransferase~~~
MIEFIDLKNQQARIKDKIDAGIQRVLRHGQYILGPEVTELEDRLADFVGAKYCISCANGTDALQIVQMALGVGPGDEVIT
PGFTYVATAETVALLGAKPVYVDIDPRTYNLDPQLLEAAITPRTKAIIPVSLYGQCADFDAINAIASKYGIPVIEDAAQS
FGASYKGKRSCNLSTVACTSFFPSKPLGCYGDGGAIFTNDDELATAIRQIARHGQDRRYHHIRVGVNSRLDTLQAAILLP
KLEIFEEEIALRQKVAAEYDLSLKQVGIGTPFIEVNNISVYAQYTVRMDNRESVQASLKAAGVPTAVHYPIPLNKQPAVA
DEKAKLPVGDKAATQVMSLPMHPYLDTASIKIICAALTN
>G3XD61 5.1.3.23~~~wbpI~~~UDP-2,3-diacetamido-2,3-dideoxy-D-glucuronate 2-epimerase~~~
MKILTIIGARPQFIKASVVSKAIIEQQTLSEIIVHTGQHFDANMSEIFFEQLGIPKPDYQLDIHGGTHGQMTGRMLMEIE
DVILKEKPHRVLVYGDTNSTLAGALAASKLHVPIAHIEAGLRSFNMRMPEEINRILTDQVSDILFCPTRVAIDNLKNEGF
ERKAAKIVNVGDVMQDSALFFAQRATSPIGLASQDGFILATLHRAENTDDPVRLTSIVEALNEIQINVAPVVLPLHPRTR
GVIERLGLKLEVQVIDPVGYLEMIWLLQRSGLVLTDSGGVQKEAFFFGKPCVTMRDQTEWVELVTCGANVLVGAARDMIV
ESARTSLGKTIQDDGQLYGGGQASSRIAEYLAKL
>Q9HTC0 2.4.1.-~~~wbpZ~~~D-rhamnosyltransferase WbpZ~~~
MRVLHFYKTYLSETVGGIEQVIFQLCESSGSWGIDNHVLTLSSDPHPPVVPFGGHVVHRARLDLQLASTGFSLSVFKQFR
ELAAEADVVNYHFPWPFMDLVHFLTGMNKPSVVTYHSDIIRQRVLLKLYRPLMSRFLHSVDRIVAASPNYFSTSDVLRQY
REKTRVITYGLDKDGYPKPATQRLEHWREKLGPRFFLFVGVMRYYKGLHILLDALQGTDYPVVIVGAGPLQAELYAQAAA
LGLRNVHFLGRVDDEDKVALLQLSYAMVFPSHLRSEAFGISLLEGAMYGKPMISSEIGTGTSYINIHGETGLVVPPSQPA
AFRQAMRWLWEHPQQAEEMGRNAEARYRQLFTAEEMGRRWSELYRELLEEKASSRYVKAAR
>P71241 2.7.8.31~~~wcaJ~~~UDP-glucose:undecaprenyl-phosphate glucose-1-phosphate transferase~~~COG2148
MTNLKKRERAKTNASLISMVQRFSDITIMFAGLWLVCEVSGLSFLYMHLLVALITLVVFQMLGGITDFYRSWRGVRAATE
FALLLQNWTLSVIFSAGLVAFNNDFDTQLKIWLAWYALTSIGLVVCRSCIRIGAGWLRNHGYNKRMVAVAGDLAAGQMLM
ESFRNQPWLGFEVVGVYHDPKPGGVSNDWAGNLQQLVEDAKAGKIHNVYIAMQMCDGARVKKLVHQLADTTCSVLLIPDV
FTFNILHSRLEEMNGVPVVPLYDTPLSGVNRLLKRAEDIVLATLILLLISPVLCCIALAVKLSSPGPVIFRQTRYGMDGK
PIKVWKFRSMKVMENDKVVTQATQNDPRVTKVGNFLRRTSLDELPQFINVLTGGMSIVGPRPHAVAHNEQYRQLIEGYML
RHKVKPGITGWAQINGWRGETDTLEKMEKRVEFDLEYIREWSVWFDIKIVFLTVFKGFVNKAAY
>B3FN88 2.7.8.40~~~wecA~~~UDP-N-acetylgalactosamine-undecaprenyl-phosphate N-acetylgalactosaminephosphotransferase~~~COG2148
MSYQRRHSRWYERVLFSPPSLFFLGAMLAVCLPALERWGWGFWEYFDAVRVNTLGGAFVAFLLTGIVLYRFLRYPGASPV
AYMIPTVTTLYGSLVGALFFLRLPYSRQVLFESYVVALLCCWVVYFIGRRYRTPKYALLPFGDYQPLMHHTCVEWRLLDK
PDLGAVRYDAVVADLRDDDLAGEWERFLARCALAHIPVYHIKQISETLTGRVKIDHLHENQLGSLLPSPIYAFIKRGMDI
LAAVIAIPLFSPLMLATAVLIKLESPGPVMFLQNRVGKGNRDFRIYKFRSMCQNSEQHGAQFAQDGDMRVTRVGKVIRKL
RIDELPQFFNVLKGDMSLIGPRPEQRTFVDQFDREIPFYMYRHIVRPGISGWAQVVHGYAADADDTRIKIEHDFYYIKNF
SLWLDVLIVFKTIRTILTGFGAR
>P0AC78 2.7.8.33~~~wecA~~~Undecaprenyl-phosphate alpha-N-acetylglucosaminyl 1-phosphate transferase~~~COG0472
MNLLTVSTDLISIFLFTTLFLFFARKVAKKVGLVDKPNFRKRHQGLIPLVGGISVYAGICFTFGIVDYYIPHASLYLACA
GVLVFIGALDDRFDISVKIRATIQAAVGIVMMVFGKLYLSSLGYIFGSWEMVLGPFGYFLTLFAVWAAINAFNMVDGIDG
LLGGLSCVSFAAIGMILWFDGQTSLAIWCFAMIAAILPYIMLNLGILGRRYKVFMGDAGSTLIGFTVIWILLETTQGKTH
PISPVTALWIIAIPLMDMVAIMYRRLRKGMSPFSPDRQHIHHLIMRAGFTSRQAFVLITLAAALLASIGVLAEYSHFVPE
WVMLVLFLLAFFLYGYCIKRAWKVARFIKRVKRRLRRNRGGSPNLTK
>A0R211 2.7.8.35~~~wecA~~~Decaprenyl-phosphate N-acetylglucosaminephosphotransferase~~~COG0472
MLQYGAPVITATRETGMDSQVVLALSDTGAGVPLRELALVGLTAAIITYFATGWVRVLAIRFGAVAYPRERDVHVQPTPR
MGGLAMYIGVASAVLLASQLPALTRGFVYSTGMPAVVVAGGLIMAIGLIDDRWGLDALTKFAGQITAASVLVTMGVAWSV
LYIPIGGVGTIVLDQVSSILLTLALTVSIINAMNFVDGLDGLAAGLGLITALAICVFSVGLLRDHGGDVLFYPPAVISVV
LAGACLGFLPHNFHRAKIFMGDSGSMLIGLMLGAASTTAAGPISQNAYGARDVFALLSPFLLVVAVMLVPALDTLLAIVR
RTRAGRSPLSPDKMHLHHRLLQIGHSHRRAVLLIYLWVGIIAFGAASTIFFDPGQTAMVMGVAIVVAIVVTLIPLLRRGP
DGAQEP
>P9WMW5 2.7.8.35~~~wecA~~~Decaprenyl-phosphate N-acetylglucosaminephosphotransferase~~~COG0472
MQYGLEVSSDVAGVAGGLLALSYRGAGVPLRELALVGLTAAIITYFATGPVRMLASRLGAVAYPRERDVHVTPTPRMGGL
AMFLGIVGAVFLASQLPALTRGFVYSTGMPAVLVAGAVIMGIGLIDDRWGLDALTKFAGQITAASVLVTMGVAWSVLYIP
VGGVGTIVLDQASSILLTLALTVSIVNAMNFVDGLDGLAAGLGLITALAICMFSVGLLRDHGGDVLYYPPAVISVVLAGA
CLGFLPHNFHRAKIFMGDSGSMLIGLMLAAASTTAAGPISQNAYGARDVFALLSPFLLVVAVMFVPMLDLLLAIVRRTRA
GRSAFSPDKMHLHHRLLQIGHSHRRVVLIIYLWVGIVAFGAASSIFFNPRDTAAVMLGAIVVAGVATLIPLLRRGDDYYD
PDLD
>Q9X1N5 2.7.8.33~~~wecA~~~Undecaprenyl-phosphate alpha-N-acetylglucosaminyl 1-phosphate transferase~~~COG0472
MWEAIISFFLTSVLSVFAKKTEFLDRPDSRKSHGRAVPPVGGVSIFLTLLIFERDNPFFLFSIPLFLLGLLDDLFDLSYR
IKLAVTALVAVWFSTAVTIEVSIFGARIHPVFFVIWFVGMVNAFNVVDGLDGLLSGISLFSSLMIGERSLAFSIIGFLPW
NLPDAKVFLGNSGSFLLGAYLSTASVVFFEGDLGYATLFLGFPFYEIVFSFVRRLVVKKNPFSPDEKHTHHVFSRKIGKW
KTLLILVSFSLMFNLLGLSQKFYFIFLYVVLCCVLLFTYCVLQRGNGNLKL
>P27828 5.1.3.14~~~wecB~~~UDP-N-acetylglucosamine 2-epimerase~~~COG0381
MKVLTVFGTRPEAIKMAPLVHALAKDPFFEAKVCVTAQHREMLDQVLKLFSIVPDYDLNIMQPGQGLTEITCRILEGLKP
ILAEFKPDVVLVHGDTTTTLATSLAAFYQRIPVGHVEAGLRTGDLYSPWPEEANRTLTGHLAMYHFSPTETSRQNLLREN
VADSRIFITGNTVIDALLWVRDQVMSSDKLRSELAANYPFIDPDKKMILVTGHRRESFGRGFEEICHALADIATTHQDIQ
IVYPVHLNPNVREPVNRILGHVKNVILIDPQEYLPFVWLMNHAWLILTDSGGIQEEAPSLGKPVLVMRDTTERPEAVTAG
TVRLVGTDKQRIVEEVTRLLKDENEYQAMSRAHNPYGDGQACSRILEALKNNRISL
>P27829 1.1.1.336~~~wecC~~~UDP-N-acetyl-D-mannosamine dehydrogenase~~~COG0677
MSFATISVIGLGYIGLPTAAAFASRQKQVIGVDINQHAVDTINRGEIHIVEPDLASVVKTAVEGGFLRASTTPVEADAWL
IAVPTPFKGDHEPDMTYVESAARSIAPVLKKGALVILESTSPVGSTEKMAEWLAEMRPDLTFPQQVGEQADVNIAYCPER
VLPGQVMVELIKNDRVIGGMTPVCSARASELYKIFLEGECVVTNSRTAEMCKLTENSFRDVNIAFANELSLICADQGINV
WELIRLANRHPRVNILQPGPGVGGHCIAVDPWFIVAQNPQQARLIRTAREVNDHKPFWVIDQVKAAVADCLAATDKRASE
LKIACFGLAFKPNIDDLRESPAMEIAELIAQWHSGETLVVEPNIHQLPKKLTGLCTLAQLDEALATADVLVMLVDHSQFK
VINGDNVHQQYVVDAKGVWR
>Q8FBQ3 2.3.1.210~~~wecD~~~dTDP-fucosamine acetyltransferase~~~COG0456
MPVRASIEPLTWENAFFGVNSAIVRITSEAPLLTPDALAPWSRVQAKIAASNTGELDALQQLGFSLVEGEVDLALPVNNV
SDSGAVVAQETDIPALRQLASAAFAQSRFRAPWYAPDASGRFYAQWIENAVRGTFDHQCLILRAASGDIRGYVSLRELNA
TDARIGLLAGRGAGAELMQTALNWAYARGKTTLRVATQMGNTAALKRYIQSGANVESTAYWLYR
>P27833 2.6.1.59~~~wecE~~~dTDP-4-amino-4,6-dideoxygalactose transaminase~~~COG0399
MIPFNAPPVVGTELDYMQSAMGSGKLCGDGGFTRRCQQWLEQRFGSAKVLLTPSCTASLEMAALLLDIQPGDEVIMPSYT
FVSTANAFVLRGAKIVFVDVRPDTMNIDETLIEAAITDKTRVIVPVHYAGVACEMDTIMALAKKHNLFVVEDAAQGVMST
YKGRALGTIGHIGCFSFHETKNYTAGGEGGATLINDKALIERAEIIREKGTNRSQFFRGQVDKYTWRDIGSSYLMSDLQA
AYLWAQLEAADRINQQRLALWQNYYDALAPLAKAGRIELPSIPDGCVQNAHMFYIKLRDIDDRSALINFLKEAEIMAVFH
YIPLHGCPAGEHFGEFHGEDRYTTKESERLLRLPLFYNLSPVNQRTVIATLLNYFS
>P56258 2.4.1.325~~~wecF~~~TDP-N-acetylfucosamine:lipid II N-acetylfucosaminyltransferase~~~COG0554
MTVLIHVLGSDIPHHNRTVLRFFNDALAATSEHAREFMVVGKDDGLSDSCPALSVQFFPGKKSLAEAVIAKAKANRQQRF
FFHGQFNPTLWLALLSGGIKPSQFFWHIWGADLYELSSGLRYKLFYPLRRLAQKRVGCVFATRGDLSFFAKTHPKVRGEL
LFFPTRMDPSLNTMANDRQREGKMTILVGNSGDRSNEHIAALRAVHQQFGDTVKVVVPMGYPPNNEAYIEEVRQAGLELF
SEENLQILSEKLEFDAYLALLRQCDLGYFIFARQQGIGTLCLLIQAGIPCVLNRENPFWQDMTEQHLPVLFTTDDLNEDI
VREAQRQLASVDKNTIAFFSPNYLQGWQRALAIAAREVA
>P27836 2.4.1.180~~~wecG~~~UDP-N-acetyl-D-mannosaminuronic acid transferase~~~COG1922
MNNNTTAPTYTLRGLQLIGWRDMQHALDYLFADGQLKQGTLVAINAEKMLTIEDNAEVRELINAAEFKYADGISVVRSVR
KKYPQAQVSRVAGADLWEELMARAGKEGTPVFLVGGKPEVLAQTEAKLRNQWNVNIVGSQDGYFKPEQRQALFERIHASG
AQIVTVAMGSPKQEIIMRDCRLVHPDALYMGVGGTYDVFTGHVKRAPKIWQTLGLEWLYRLLSQPSRIKRQLRLLRYLRW
HYTGNL
>P37669 2.3.1.-~~~wecH~~~O-acetyltransferase WecH~~~COG3274
MQPKIYWIDNLRGIACLMVVMIHTTTWYVTNAHSVSPVTWDIANVLNSASRVSVPLFFMISGYLFFGERSAQPRHFLRIG
LCLIFYSAIALLYIALFTSINMELALKNLLQKPVFYHLWFFFAIAVIYLVSPLIQVKNVGGKMLLVLMAVIGIIANPNTV
PQKIDGFEWLPINLYINGDTFYYILYGMLGRAIGMMDTQHKALSWVSAALFATGVFIISRGTLYELQWRGNFADTWYLYC
GPMVFICAIALLTLVKNTLDTRTIRGLGLISRHSLGIYGFHALIIHALRTRGIELKNWPILDIIWIFCATLAASLLLSML
VQRIDRNRLVS
>Q077R2 2.4.1.305~~~wfaP~~~UDP-Glc:alpha-D-GlcNAc-diphosphoundecaprenol beta-1,3-glucosyltransferase WfaP~~~
MELVSIIIAAYNCKDTIYATVESALSQTYKNIEIIICDDSSTDDTWDIINKIKDSRIICIKNNYCKGAAGARNCALKIAK
GRYIAFLDSDDYWVTTKISNQIHFMETEKVFFSYSNYYIEKDFVITGVFSSPPEINYGAMLKYCNIACSTVILDRTGVKN
ISFPYIDKEDYALWLNILSKGIKARNTNLVDTYYRVHAGSVSANKFKELIRQSNVLKSIGIKAHHRIICLFYYAINGLIK
HCFSYRDKRNA
>B5L3X1 2.4.1.304~~~wfeD~~~UDP-Gal:alpha-D-GlcNAc-diphosphoundecaprenol beta-1,4-galactosyltransferase~~~
MIDNLIKRTPEINRLLENKRVTGVVTFVNPYSYYKIKEYNKISQLDYIYIDGILLLKLFNFVNGTKIKRHSFDYSSIAKT
VFNYSIQNKMKIGLIGSKDYEIEQAVKNIRKKHPGIDISYFHSGYFSSLEEKSSVIDSVIKKSDIIICGLGTPAQEELAL
DIKIKSNEHLIFTCGGFFTQTASRADFYYPWIKRYNLMWLQRIVLYKHVRKRFFIDYPKFIVRFISENLMKIFTRSN
>B5L3F2 2.4.1.305~~~wfgD~~~UDP-Glc:alpha-D-GlcNAc-diphosphoundecaprenol beta-1,3-glucosyltransferase WfgD~~~
MDDYLVSIIMPSYNAEHTISASISSVLKQTYANWELLVCDDDSSDNTRFKVLEFSDSRIKLLTNEYAKGAAGARNTALKY
ASGRFIAFLDSDDIWIANKLEMQISMMLKNNISFMYGNYEIINNNSIVGKFVAPQKITYNKLLKNCGIGCLTVVLDRTLL
NPFSFPFVHKEDYYLWLSILKDNNISAINCGFICSKYRLSQSSISSNKFKELKRQWDVLGDFVENPLARIYYLLNYIVIG
IKKHAFDYKNGKK
>O53353 ~~~whiB2~~~Transcriptional regulator WhiB2~~~
MVPEAPAPFEEPLPPEATDQWQDRALCAQTDPEAFFPEKGGSTREAKKICMGCEVRHECLEYALAHDERFGIWGGLSERE
RRRLKRGII
>Q7D5T7 ~~~whiB2~~~Transcriptional regulator WhiB2~~~
MVPEAPAPFEEPLPPEATDQWQDRALCAQTDPEAFFPEKGGSTREAKKICMGCEVRHECLEYALAHDERFGIWGGLSERE
RRRLKRGII
>P71592 ~~~whiB5~~~Transcriptional regulator WhiB5~~~
MAHPCATDPELWFGYPDDDGSDGAAKARAYERSATQARIQCLRRCPLLQQRRCAQHAVEHRVEYGVWAGIKLPGGQYRKR
EQLAAAHDVLRRIAGGEINSRQLPDNAALLARNEGLEVTPVPGVVVHLPIAQVGPQPAA
>P0DKR7 ~~~whiB5~~~Transcriptional regulator WhiB5~~~
MAHPCATDPELWFGYPDDDGSDGAAKARAYERSATQARIQCLRRCPLLQQRRCAQHAVEHRVEYGVWAGIKLPGGQYRKR
EQLAAAHDVLRRIAGGEINSRQLPDNAALLARNEGLEVTPVPGVVVHLPIAQVGPQPAA
>Q6MX01 ~~~whiB7~~~Probable transcriptional regulator WhiB7~~~
MSVLTVPRQTPRQRLPVLPCHVGDPDLWFADTPAGLEVAKTLCVSCPIRRQCLAAALQRAEPWGVWGGEIFDQGSIVSHK
RPRGRPRKDAVA
>Q8VJ53 ~~~whiB7~~~Probable transcriptional regulator WhiB7~~~
MSVLTVPRQTPRQRLPVLPCHVGDPDLWFADTPAGLEVAKTLCVSCPIRRQCLAAALQRAEPWGVWGGEIFDQGSIVSHK
RPRGRPRKDAVA
>O06975 ~~~whiA~~~Probable cell division protein WhiA~~~COG1481
MSFASETKKELTNLEVKDCCINAELSALIRMNGALSFTNRHLVLDVQTENAAIARRIYTLLKKQYDVSVELLVRKKMRLK
KNNVYIVRFSENAKAILEDLKILGENFVFERSISKELVKKRCCKRSYMRGAFLAGGSVNNPETSSYHLEIFSLYKEHNDS
LCDLLNEFQLNSKTLERKKGYITYLKEAEKITEFLNVIGAHNSLLRFEDVRIVRDMRNSVNRLVNCETANLNKTIGASLR
QVENIKYIDERIGLEALPEKLREIAQLRIDYQEVTLKELGEMVASGKISKSGINHRLRKLDEIAEQLRTGQTVTLK
>A0QWV9 ~~~whiA~~~Probable cell division protein WhiA~~~COG1481
MAMTAEVKDELSRLVVNSVSARRAEVASLLRFAGGLHIVAGRVVVEAEVDLGIIARRLRKDIYDLYGYNAVVHVLSASGI
RKNTRYVVRVANDGEALARQTGLLDMRGRPVRGLPAQVVGGSVGDAEAAWRGAFLAHGSLTEPGRSSALEVSCPGPEAAL
ALVGAARRLGVSAKAREVRGSDRVVVRDGEAIGALLTRMGAQDTRLTWEERRMRREVRATANRLANFDDANLRRSARAAV
AAAARVERALEILGDSVPDHLAAAGKLRVEHRQASLEELGRLADPPMTKDAVAGRIRRLLSMADRKAKQEGIPDTESAVT
PDLLDDA
>P9WF45 ~~~whiA~~~Probable cell division protein WhiA~~~COG1481
MAMTTDVKDELSRLVVKSVSARRAEVTSLLRFAGGLHIVGGRVVVEAELDLGSIARRLRKEIFELYGYTAVVHVLSASGI
RKSTRYVLRVANDGEALARQTGLLDMRGRPVRGLPAQVVGGSIDDAEAAWRGAFLAHGSLTEPGRSSALEVSCPGPEAAL
ALVGAARRLGVGAKAREVRGADRVVVRDGEAIGALLTRMGAQDTRLVWEERRLRREVRATANRLANFDDANLRRSARAAV
AAAARVERALEILGDTVPEHLASAGKLRVEHRQASLEELGRLADPPMTKDAVAGRIRRLLSMADRKAKVDGIPDTESVVT
PDLLEDA
>Q9Z515 ~~~whiA~~~Probable cell division protein WhiA~~~COG1481
MTAAVKSEISQLPVTRTCCRKAEVSAVLRFAGGLHLVSGRIVIEAELDTGNAARRLKRDILEIFGHSSELIVMAPGGLRR
GSRFVVRVVAGGDQLARQTGLVDGRGRPIRGLPPQVVSGATCDAEAAWRGAFLAHGSLTEPGRSSSLEVTCPGPEAALAL
VGAARRLSIPAKAREVRGVDRVVVRDGDAIGALLTRLGAHDSVLAWEERRLRREVRATANRLANFDDANLRRSARAAVAA
GARVQRALEILADDVPEHLAAAGRLRMEHKQASLEELGALADPPLTKDAVAGRIRRLLAMADKRASDLGIAGTDANLGEE
ELADNLVG
>F2RGQ4 ~~~whiA~~~Probable cell division protein WhiA~~~COG1481
MTAAVKDEISRLPVTRTCCRKAEVSSILRFAGGLHLVSGRIVIEAELDTAMAARRLKRDILEIFGHSSELIVMAPGGLRR
GSRFVVRVVAGGDQLARQTGLVDGRGRPIRGLPPQVVSGATCDAEAAWRGAFLAHGSLTEPGRSSSLEVTCPGPEAALAL
VGAARRLSIAAKAREVRGVDRVVVRDGDAIGALLTRLGAHESVLAWEERRMRREVRATANRLANFDDANLRRSARAAVAA
GARVQRALEILGEEVPEHLAAAGRLRMEHKQASLEELGALADPPLTKDAVAGRIRRLLAMADKRAQDLGIPGTESNLTEE
LDDSLVG
>Q9X234 ~~~whiA~~~Probable cell division protein WhiA~~~
MVSLLRRTFSEEIKEELVNVPFGSREEVISELLGFIKARGDLDVKSRHIVFSLHSFAASRRLLNLMKYLSKPVSEIIVEK
SHNIKKRYIKITAEYSESFMVIEPFFDVALFVSFLRGLFLSGGSMTNPRYHYHLEINLFEEETLALTRKSLKDFFNINAG
IIELRNTRKLYIKSIKDILVFLEAIGVQRKLEEIDRIVTERKVIGDVNRTVNFIEANAIRTANSTARQIRAIELIKENMG
LENLPEDLRRVALVRLRNKELSLRELGKKLNLTKSQIYSKLKRIIKIAERFGDVK
>Q8G5J9 ~~~whiB1~~~Transcriptional regulator WhiB1~~~
MSSAFDWRAKAACRDKDPELFFPVGNTGAAYQQIEEAKAVCRTCKVIDACLKCALDTNQDYGVWGGLSEDERRALKRRAM
RARRSQAMQMQI
>P9WF42 ~~~whiB1~~~Transcriptional regulator WhiB1~~~
MDWRHKAVCRDEDPELFFPVGNSGPALAQIADAKLVCNRCPVTTECLSWALNTGQDSGVWGGMSEDERRALKRRNARTKA
RTGV
>P9WF43 ~~~whiB1~~~Transcriptional regulator WhiB1~~~
MDWRHKAVCRDEDPELFFPVGNSGPALAQIADAKLVCNRCPVTTECLSWALNTGQDSGVWGGMSEDERRALKRRNARTKA
RTGV
>Q8G5K1 ~~~whiB2~~~Transcriptional regulator WhiB2~~~
MWGVVDDSAEEPWHELWGLFRPDGDVSWQHKALCAQTDPEAFFPEKGGSTRDAKRVCAKCEVREQCLKWAIDHDERFGIW
GGMSERERRRYKREHRERA
>A0QTG3 ~~~whiB2~~~Transcriptional regulator WhiB2~~~
MSYESGDFDRVVRFDNRLLGSVSHAPHIDTGSTPTGAAGRPQLSLVPDSFDVAPEAEEDQWQERALCAQTDPEAFFPEKG
GSTREAKRICQGCEVRDACLEYALAHDERFGIWGGLSERERRRLKRGII
>Q9S426 ~~~whmD~~~Transcriptional regulator WhiB2~~~
MSYESGDFDRVVRFDNRLLGSVSHAPHIDTGSTPTGAAGRPQLSLVPDSFDVAPEAEEDQWQERALCAQTDPEAFFPEKG
GSTREAKRICQGCEVRDACLEYALAHDERFGIWGGLSERERRRLKRGII
>A0QST8 ~~~whiB3~~~Redox-responsive transcriptional regulator WhiB3~~~
MPQPQQLPGPNADIWDWQMRGLCRGVDSSMFFHPDGERGRARAQREMRAKEMCRSCPVIAQCRSHALAVGEPYGIWGGLS
ESERELLLKRGIRRSA
>P9WF40 ~~~whiB3~~~Redox- and pH-responsive transcriptional regulator WhiB3~~~
MPQPEQLPGPNADIWNWQLQGLCRGMDSSMFFHPDGERGRARTQREQRAKEMCRRCPVIEACRSHALEVGEPYGVWGGLS
ESERDLLLKGTMGRTRGIRRTA
>P9WF41 ~~~whiB3~~~Redox- and pH-responsive transcriptional regulator WhiB3~~~
MPQPEQLPGPNADIWNWQLQGLCRGMDSSMFFHPDGERGRARTQREQRAKEMCRRCPVIEACRSHALEVGEPYGVWGGLS
ESERDLLLKGTMGRTRGIRRTA
>P9WF38 ~~~whiB4~~~Transcriptional regulator WhiB4~~~
MSGTRPAARRTNLTAAQNVVRSVDAEERIAWVSKALCRTTDPDELFVRGAAQRKAAVICRHCPVMQECAADALDNKVEFG
VWGGMTERQRRALLKQHPEVVSWSDYLEKRKRRTGTAG
>P9WF39 ~~~whiB4~~~Transcriptional regulator WhiB4~~~
MSGTRPAARRTNLTAAQNVVRSVDAEERIAWVSKALCRTTDPDELFVRGAAQRKAAVICRHCPVMQECAADALDNKVEFG
VWGGMTERQRRALLKQHPEVVSWSDYLEKRKRRTGTAG
>P9WF36 ~~~whiB6~~~Probable transcriptional regulator WhiB6~~~
MRYAFAAEATTCNAFWRNVDMTVTALYEVPLGVCTQDPDRWTTTPDDEAKTLCRACPRRWLCARDAVESAGAEGLWAGVV
IPESGRARAFALGQLRSLAERNGYPVRDHRVSAQSA
>P9WF37 ~~~whiB6~~~Probable transcriptional regulator WhiB6~~~
MRYAFAAEATTCNAFWRNVDMTVTALYEVPLGVCTQDPDRWTTTPDDEAKTLCRACPRRWLCARDAVESAGAEGLWAGVV
IPESGRARAFALGQLRSLAERNGYPVRDHRVSAQSA
>A0QTT1 ~~~whiB7~~~Transcriptional regulator WhiB7~~~
MSIAMTAPTTGVAPMTCETRLPAVPCHVGDPDLWFAENPGDLERAKALCAGCPIRVQCLTAALERQEPWGVWGGEILDRG
SIVARKRPRGRPRKDSGGNPAAA
>Q7AKN0 ~~~whiB~~~Transcriptional regulator WhiB~~~
MTELVQQLLVDDADEELGWQERALCAQTDPESFFPEKGGSTREAKKVCLACEVRSECLEYALANDERFGIWGGLSERERR
RLKKAAV
>Q7AKI9 ~~~whiD~~~Transcriptional regulator WhiD~~~
MADFSRLPGPNADLWDWQLLAACRGVDSSLFFHPEGERGAARSARENSAKEVCMRCPVRAECAAHALAVREPYGVWGGLT
EDEREELMGRARNRLVAATASASAGGEAAGPH
>P54423 3.4.21.-~~~wprA~~~Cell wall-associated protease~~~COG1404
MKRRKFSSVVAAVLIFALIFSLFSPGTKAAAAGAIDQAAALENGKEQTGAMKEPEQVKWYKVTPGATDIQKNSHMALTVK
SDSVLNVSVYPSKEKALKDETFEMYRSFTAEDGKSEVIFPYAWSGPYYVKVEYLGEEEPEDGGTAEAAAEAKYTIGYKGT
KKQPSDLEEEEACPVEMSVDQKKSGKGILDKLRSIRDEQLSQTAEGKELTSLYYKAAPFIVAKLALNKTARNEIYQDLVT
LKPLFDDVSENGASSSYKVTEKDQKAINRLYDKALQSVPSFLKEEIKKQADRLNMKQLQGKTAGAILTENNIAAKSEVQT
TKVIFKVKDNKSLSSVHNEMKGFSASAQSKKDISNVKKAKKLFDNLYSFELPKDEKQNGAYTASAKRVKSAAATLSKMSN
VEFAEPVQEYKSLANDIQYPYQWPLKNNGENGGVKNADVKYEPANTLLSKRKLNDTLIAVVDTGVDSTLADLKGKVRTDL
GHNFVGRNNNAMDDQGHGTHVAGIIAAQSDNGYSMTGLNAKAKIIPVKVLDSAGSGDTEQIALGIKYAADKGAKVINLSL
GGGYSRVLEFALKYAADKNVLIAAASGNDGENALSYPASSKYVMSVGATNRMDMTADFSNYGKGLDISAPGSDIPSLVPN
GNVTYMSGTSMATPYAAAAAGLLFAQNPKLKRTEVEDMLKKTADDISFESVDGGEEELYDDYGDPIEIPKTPGVDWHSGY
GRLNVMKAVSAADLQLKVNKLESTQTAVRGSAKEGTLIEVMNGKKKLGSAKAGKDNAFKVNIATQKQDQVLYLKATKGDA
KTSYKVVVVKGKPSGTPKVNAVKTKDTAVKGKANSKAMIRVKNKSKKVIASAKADAKGTFSVKIKKQKAGTVLYVTAVDT
DKKESKEAKVVVEK
>Q8GGG1 2.3.1.20~~~wax-dgaT~~~O-acyltransferase WSD~~~COG1020
MRPLHPIDFIFLSLEKRQQPMHVGGLFLFQIPDNAPDTFIQDLVNDIRISKSIPVPPFNNKLNGLFWDEDEEFDLDHHFR
HIALPHPGRIRELLIYISQEHSTLLDRAKPLWTCNIIEGIEGNRFAMYFKIHHAMVDGVAGMRLIEKSLSHDVTEKSIVP
PWCVEGKRAKRLREPKTGKIKKIMSGIKSQLQATPTVIQELSQTVFKDIGRNPDHVSSFQAPCSILNQRVSSSRRFAAQS
FDLDRFRNIAKSLNVTINDVVLAVCSGALRAYLMSHNSLPSKPLIAMVPASIRNDDSDVSNRITMILANLATHKDDPLQR
LEIIRRSVQNSKQRFKRMTSDQILNYSAVVYGPAGLNIISGMMPKRQAFNLVISNVPGPREPLYWNGAKLDALYPASIVL
DGQALNITMTSYLDKLEVGLIACRNALPRMQNLLTHLEEEIQLFEGVIAKQEDIKTAN
>Q88MS8 2.1.1.-~~~wspC~~~Probable biofilm formation methyltransferase WspC~~~COG0457
MNEQRFFRFLRERIGLDVESVGAPMVERALRQRCVAAGAMDLDDYWLRLQQSADEQQALIEAVIVPETWFFRYPESFTAL
ASLAHKRLAQLAGARPLRLLSLPCSTGEEPYSLAMALFDAGMAPGAFLVDGMDISPSSVAKAGQAVYGRNAFRGSELGFR
ERYFDALDEGHRLHERVRQQVSLRVGNVLDPALASRDGLYDFVFCRNLLIYFDVPTQQRVFEVLKRLLHPQGVLFIGPAE
GSLLARMGMRPLGIAQSFAYVRHEGDSAPLAAAPAQTAKRAFTTLPAPVYPQPSVPLPRSRRVLPVAARPARAREHSHEG
ASELLAGIARLANAGASEQARSECQRYLSQYPPSAQVYYWLGLLSDTEGDAQQALSHYRKALYLEPQHPEALVHLAALLA
AQGDLAGARRLQERAARAGRESER
>P0A930 ~~~wza~~~Putative polysaccharide export protein Wza~~~COG1596
MMKSKMKLMPLLVSVTLISGCTVLPGSNMSTMGKDVIKQQDADFDLDKMVNVYPLTPRLIDQLRPRPNVARPNMTLESEI
ANYQYRVGPGDVLNVTVWDHPELTTPAGQYRSSSDTGNWVQPDGTMFYPYIGKVHVVGKTLAEIRSDITGRLATYIADPQ
VDVNIAAFRSQKAYISGQVNKSGQQAITNVPLTILDAINAAGGLTDTADWRNVVLTHNGREERISLQALMQNGDLNQNRL
LYPGDILYVPRNDDLKVFVMGEVKKQSTLKMDFSGMTLTEALGNAEGIDMTTSNASGIFVIRPLKGEGGRNGKIANIYQL
DMSDATSLVMATEFRLQPYDVVYVTTAPVSRWNRLINQLLPTISGVRYMTDTASDIHNW
>P0AAB2 3.1.3.48~~~wzb~~~Low molecular weight protein-tyrosine-phosphatase Wzb~~~COG0394
MFNNILVVCVGNICRSPTAERLLQRYHPELKVESAGLGALVGKGADPTAISVAAEHQLSLEGHCARQISRRLCRNYDLIL
TMEKRHIERLCEMAPEMRGKVMLFGHWDNECEIPDPYRKSRETFAAVYTLLERSARQWAQALNAEQV
>P76387 2.7.10.-~~~wzc~~~Tyrosine-protein kinase wzc~~~COG0489
MTEKVKQHAAPVTGSDEIDIGRLVGTVIEARWWVIGITTVFALCAVVYTFFATPIYSADALVQIEQNSGNSLVQDIGSAL
ANKPPASDAEIQLIRSRLVLGKTVDDLDLDIAVSKNTFPIFGAGWDRLMGRQNETVKVTTFNRPKEMADQVFTLNVLDNK
NYTLSSDGGFSARGQAGQMLKKEGVTLMVEAIHASPGSEFTVTKYSTLGMINQLQNSLTVTENGKDAGVLSLTYTGEDRE
QIRDILNSIARNYQEQNIERKSAEASKSLAFLAQQLPEVRSRLDVAENKLNAFRQDKDSVDLPLEAKAVLDSMVNIDAQL
NELTFKEAEISKLYTKVHPAYRTLLEKRQALEDEKAKLNGRVTAMPKTQQEIVRLTRDVESGQQVYMQLLNKEQELKITE
ASTVGDVRIVDPAITQPGVLKPKKGLIILGAIILGLMLSIVGVLLRSLFNRGIESPQVLEEHGISVYASIPLSEWQKARD
SVKTIKGIKRYKQSQLLAVGNPTDLAIEAIRSLRTSLHFAMMQAQNNVLMMTGVSPSIGKTFVCANLAAVISQTNKRVLL
IDCDMRKGYTHELLGTNNVNGLSEILIGQGDITTAAKPTSIAKFDLIPRGQVPPNPSELLMSERFAELVNWASKNYDLVL
IDTPPILAVTDAAIVGRHVGTTLMVARYAVNTLKEVETSLSRFEQNGIPVKGVILNSIFRRASAYQDYGYYEYEYKSDAK
>P77377 ~~~wzxC~~~Lipopolysaccharide biosynthesis protein WzxC~~~COG2244
MSLREKTISGAKWSAIATVIIIGLGLVQMTVLARIIDNHQFGLLTVSLVIIALADTLSDFGIANSIIQRKEISHLELTTL
YWLNVGLGIVVCVAVFLLSDLIGDVLNNPDLAPLIKTLSLAFVVIPHGQQFRALMQKELEFNKIGMIETSAVLAGFTCTV
VSAHFWPLAMTAILGYLVNSAVRTLLFGYFGRKIYRPGLHFSLASVAPNLRFGAWLTADSIINYLNTNLSTLVLARILGA
GVAGGYNLAYNVAVVPPMKLNPIITRVLFPAFAKIQDDTEKLRVNFYKLLSVVGIINFPALLGLMVVSNNFVPLVFGEKW
NSIIPVLQLLCVVGLLRSVGNPIGSLLMAKARVDISFKFNVFKTFLFIPAIVIGGQMAGAIGVTLGFLLVQIINTILSYF
VMIKPVLGSSYRQYILSLWLPFYLSLPTLVVSYALGIVLKGQLALGMLLAVQIATGVLAFVVMIVLSRHPLVVEVKRQFC
RSEKMKMLLRAG
>P0AAA7 ~~~wzxE~~~Lipid III flippase~~~COG2244
MSLAKASLWTAASTLVKIGAGLLVGKLLAVSFGPAGLGLAANFRQLITVLGVLAGAGIFNGVTKYVAQYHDNPQQLRRVV
GTSSAMVLGFSTLMALVFVLAAAPISQGLFGNTDYQGLVRLVALVQMGIAWGNLLLALMKGFRDAAGNALSLIVGSLIGV
LAYYVSYRLGGYEGALLGLALIPALVVIPAAIMLIKRGVIPLSYLKPSWDNGLAGQLSKFTLMALITSVTLPVAYIMMRK
LLAAQYSWDEVGIWQGVSSISDAYLQFITASFSVYLLPTLSRLTEKRDITREVVKSLKFVLPAVAAASFTVWLLRDFAIW
LLLSNKFTAMRDLFAWQLVGDVLKVGAYVFGYLVIAKASLRFYILAEVSQFTLLMVFAHWLIPAHGALGAAQAYMATYIV
YFSLCCGVFLLWRRRA
>P27835 ~~~wzyE~~~Probable ECA polymerase~~~
MSLLQFSGLFVVWLLCTLFIATLTWFEFRRVRFNFNVFFSLLFLLTFFFGFPLTSVLVFRFDVGVAPPEILLQALLSAGC
FYAVYYVTYKTRLRKRVADVPRRPLFTMNRVETNLTWVILMGIALVSVGIFFMHNGFLLFRLNSYSQIFSSEVSGVALKR
FFYFFIPAMLVVYFLRQDSKAWLFFLVSTVAFGLLTYMIVGGTRANIIIAFAIFLFIGIIRGWISLWMLAAAGVLGIVGM
FWLALKRYGMNVSGDEAFYTFLYLTRDTFSPWENLALLLQNYDNIDFQGLAPIVRDFYVFIPSWLWPGRPSMVLNSANYF
TWEVLNNHSGLAISPTLIGSLVVMGGALFIPLGAIVVGLIIKWFDWLYELGNREPNRYKAAILHSFCFGAIFNMIVLARE
GLDSFVSRVVFFIVVFGACLMIAKLLYWLFESAGLIHKRTKSSLRTQVEG
>P35272 ~~~wzzB~~~Chain length determinant protein~~~COG3765
MRVENNNVSGQNHDPEQIDLIDLLVQLWRGKMTIIISVIVAIALAIGYLAVAKEKWTSTAIITQPDVGQIAGYNNAMNVI
YGQAAPKVSDLQETLIGRFSSAFSALAETLDNQEEPEKLTIEPSVKNQQLPLTVSYVGQTAEGAQMKLAQYIQQVDDKVN
QELEKDLKDNIALGRKNLQDSLRTQEVVAQEQKDLRIRQIQEALQYANQAQVTKPQIQQTQDVTQDTMFLLGSEALESMI
KHEATRPLVFSSNYYQTRQNLLDIDNLDVDKLDIHAYRYVMKPTLPIRRDSPKKAITLILAVLLGGMVGAGIVLGRNALR
NYNAK
>P76372 ~~~wzzB~~~Chain length determinant protein~~~COG3765
MRVENNNVSGQNHDPEQIDLIDLLVQLWRGKMTIIISVIVAIALAIGYLAVAKEKWTSTAIITQPDVGQIAGYNNAMNVI
YGQAAPKVSDLQETLIGRFSSAFSALAETLDNQEEREKLTIEPSVKNQQLPLTVSYVGQTAEGAQMKLAQYIQQVDDKVN
QELEKDLKDNIALGRKNLQDSLRTQEVVAQEQKDLRIRQIQEALQYANQAQVTKPQIQQTGEDITQDTLFLLGSEALESM
IKHEATRPLVFSPNYYQTRQNLLDIESLKVDDLDIHAYRYVMKPMLPIRRDSPKKAITLILAVLLGGMVGAGIVLGRNAL
RNYNAK
>Q04866 ~~~wzzB~~~Chain length determinant protein~~~
MTVDSNTSSGRGNDPEQIDLIELLLQLWRGKMTIIVAVIIAILLAVGYLMIAKEKWTSTAIITQPDAAQVATYTNALNVL
YGGNAPKISEVQANFISRFSSAFSALSEVLDNQKEREKLTIEQSVKGQALPLSVSYVSTTAEGAQRRLAEYIQQVDEEVA
KELEVDLKDNITLQTKTLQESLETQEVVAQEQKDLRIKQIEEALRYADEAKITQPQIQQTQDVTQDTMFLLGSDALKSMI
QNEATRPLVFSPAYYQTKQTLLDIKNLKVTADTVHVYRYVMKPTLPVRRDSPKTAITLVLAVLLGGMIGAGIVLGRNALR
SYKPKAL
>P37792 ~~~wzzB~~~Chain length determinant protein~~~
MRVENNNVSGQNHDPEQIDLIDLLVQLWRGKMTIIISVIVAIALAIGYLAVAKEKWTSTAIITQPDVGQIAGYNNAMNVI
YGQAAPKVSDLQETLIGRFSSAFSALAETLDNQEEPEKLTIEPSVKNQQLPLTVSYVGQTAEGAQMKLAQYIQQVDDKVN
QELEKDLKDNIALGRKNLQDSLRTQEVVAQEQKDLRIRQIQEALQYANQAQVTKPQVQQTEDVTQDTLFLLGSEALESMI
KHEATRPLVFSPNYYQTRQNLLDIEKLKFDDLDIHAYRYVMKPTLPIRRDSPKKAITLILAVLLGGMVGAGIVLGRNALR
NYNAK
>P0AG01 ~~~wzzE~~~ECA polysaccharide chain length modulation protein~~~COG3765
MTQPMPGKPAEDAENELDIRGLFRTLWAGKLWIIGMGLAFALIALAYTFFARQEWSSTAITDRPTVNMLGGYYSQQQFLR
NLDVRSNMASADQPSVMDEAYKEFVMQLASWDTRREFWLQTDYYKQRMVGNSKADAALLDEMINNIQFIPGDFTRAVNDS
VKLIAETAPDANNLLRQYVAFASQRAASHLNDELKGAWAARTIQMKAQVKRQEEVAKAIYDRRMNSIEQALKIAEQHNIS
RSATDVPAEELPDSEMFLLGRPMLQARLENLQAVGPAFDLDYDQNRAMLNTLNVGPTLDPRFQTYRYLRTPEEPVKRDSP
RRAFLMIMWGIVGGLIGAGVALTRRCSK
>P0AG00 ~~~wzzE~~~ECA polysaccharide chain length modulation protein~~~COG3765
MTQPMPGKPAEDAENELDIRGLFRTLWAGKLWIIGMGLAFALIALAYTFFARQEWSSTAITDRPTVNMLGGYYSQQQFLR
NLDVRSNMASADQPSVMDEAYKEFVMQLASWDTRREFWLQTDYYKQRMVGNSKADAALLDEMINNIQFIPGDFTRAVNDS
VKLIAETAPDANNLLRQYVAFASQRAASHLNDELKGAWAARTIQMKAQVKRQEEVAKAIYDRRMNSIEQALKIAEQHNIS
RSATDVPAEELPDSEMFLLGRPMLQARLENLQAVGPAFDLDYDQNRAMLNTLNVGPTLDPRFQTYRYLRTPEEPVKRDSP
RRAFLMIMWGIVGGLIGAGVALTRRCSK
>O87082 1.14.13.69~~~xamoA~~~Alkene monooxygenase system, oxygenase component subunit alpha~~~COG3350
MALLNRDDWYDIARDVDWTLSYVDRAVAFPEEWKGEKDICGTAWDDWDEPFRVSFREYVMVQRDKEASVGAIREAMVRAK
AYEKLDDGHKATSHLHMGTITMVEHMAVTMQSRFVRFAPSARWRSLGAFGMLDETRHTQLDLRFSHDLLNDSPSFDWSQR
AFHTDEWAVLATRNLFDDIMLNADCVEAALATSLTLEHGFTNIQFVALASDAMEAGDVNFSNLLSSIQTDEARHAQLGFP
TLDVMMKHDPKRAQQILDVAFWRSYRIFQAVTGVSMDYYTPVAKRQMSFKEFMLEWIVKHHERILRDYGLQKPWYWDTFE
KTLDHGHHALHIGTWFWRPTLFWDPNGGVSREERRWLNQKYPNWEESWGVLWDEIISNINAGNIEKTLPETLPMLCNVTN
LPIGSHWDRFHLKPEQLVYKGRLYTFDSDVSKWIFELDPERYAGHTNVVDRFIGGQIQPMTIEGVLNWMGLTPEVMGKDV
FNYRWAGDYAENRIAAE
>Q9ZET6 1.14.13.69~~~xamoB~~~Alkene monooxygenase system, oxygenase component subunit gamma~~~
MSLFPIVGRFVGDFVPHLVAVDTSDTIDQIAEKVAVHTVGRRLPPDPTATGYEVLLDGETLDGGATLEAIMTKREMLPLQ
WFDVRFKK
>Q9ZET5 ~~~xamoC~~~Alkene monooxygenase system, ferredoxin component~~~COG2146
MNLHAPNAEQDDIEYVDVCAVDDLWDGEMDVFDVGEHEVLLVKHEGRFHAYDGICPHQSVSLVEGHLTEDGVLICKAHEW
QFSVEGGQGINPANVCLQSFPLKVEGGRVLIGTEPLPKEGEA
>Q9ZET4 ~~~xamoD~~~Alkene monooxygenase system, effector subunit~~~COG3445
MSNATVDDMDENLVGPVIRAGDLADAVIDAVIADNPGKEVHVIERGDYVRIHTDRDCRLTRASIEQALGRSFVLAAIEAE
MSSFKGRMSSSDSEMRWYYKS
>Q9ZET3 1.14.13.69~~~xamoE~~~Alkene monooxygenase system, oxygenase component subunit beta~~~
MTQQRPTRTRERKKTWTAFGNLGRKPTDYEVVTHNMNHTMRGTPLELSPTVHANVWLKKNRDEIALKVDSWDLFRDPDRT
TYDTYVKMQDDQETYVDNLLLSYTGEGRYDEELSSRSLDLLSAGLTPTRYLGHGLQMLAAYIQQLAPSAYVGNCAVFQTS
DALRRVQRVAYRTRQLADAHPARGFGSGDRAVWEKSPDWQPIRKAIEELLVTFEWDKALAGTNFVVKPILDELFLNHLAR
LLHVEGDELDSLVLRNLHGDAQRHARWTAALGRFAVEQNVNNRTVLRDAIAGWHETGEAVLAAGAGMLASRAPSADAAKI
ADEVRATLAQLHANAGLGHDA
>A7IPX7 1.18.1.3~~~xamoF~~~Alkene monooxygenase system, ferredoxin--NAD(+) reductase component~~~COG0543
MRLNDGRSFSCRSDQTVLHAALAAGIDMPYECASGSCGSCRCRLSHGSVSLLWPEAPGLSARDRQKGDRILACQSTPSSD
LEINVRAGDALLEPPPRRHAARVTVKETLCASVIRLVLNVGGPIHFLPGQFFILDLPGAGRRAYSVANLENAAGGIELLI
KRKIGGAGTAALFDQCAPGMGLVIEGPYGRAYLRADSARGIVAVAGGSGLAPMLSILRGALARGFGGPMDLYFGVNTAEE
LFCVPELSALQAAGARVHLALRDGGPGPAGLHRQAGLIGDALVAGEPDLKAKDLYVAGPAPMTDDILARTVRQEAIPADR
VFFDRFV
>Q9AQS0 4.2.2.12~~~xly~~~Xanthan lyase~~~
MLSGILIAALLMTLWGGWQPDIAHASDEFDALRIKWATLLTGGPALDPADSDIAARTDKLAQDANDYWEDMDLSSSRTYI
WYALRGNGTSDNVNAVYERLRTMALAATTVGSSLYGNADLKEDILDALDWLYVNSYNSTRSRSAYNWWHWQLGIPMSLND
IAVLLYDDISAARMATYMDTIDYFTPSIGLTGANRAWQAIVVGVRAVIVKDAVKLAAARNGLSGTGIFPYATGGDGFYAD
GSFVQHTTFAYTGGYGSSVLETTANLMYLLSGSTWSVSDPNQSNVWQWIYEAYRPLLYKGAMMDMVRGREISRSYAQDHA
VGHGIVASIVRLAQFAPAPHAAAFKQIAKRVIQEDTFSSFYGDVSTDTIRLAKAIVDDPSIAPAAAPNLYKQYAAMDRAV
LQRPGFALGLALYSTRISSYESINSENGRGWYTGAGATYLYNQDLAQYSEDYWPTVDAYRIPGTTVASGTPIASGTGTSS
WTGGVSLAGQYGASGMDLSYGAYNLSARKSWFMFDDEIVALGSGISSTAGIPIETVVDNRKLNGAGDNAWTANGAALSTG
LGVAQTLTGVNWVHLAGNTADGSDIGYYFPGGATLQTKREARTGTWKQINNRPATPSTAVTRNYETMWIDHGTNPSGASY
GYVLLPNKTSAQVGAYAADPAIEIVVNTSGVQSVKEKTLGLVGANFWTDTTQTADLITSNKKASVMTREIADERLEASVS
DPTQANNGTIAIELARSAEGYSADPGITVTQLAPTIKFTVNVNGAKGKSFHASFQLGEDTSGPVDPGEPELPSVIVDNAD
SAGVTRTGTWKTASTQTDRYGANYLHDDNAGKGTKSVTFTPNLPIAGSYEVYLMWPAHFNREDAVQVDVGHASGTTRTAV
DQRSGGGVWHSIGTYEFLAGSGGSVTIRNDALGSPDGYVVADAVKFVAVG
>P0AGM9 ~~~xanP~~~Xanthine permease XanP~~~COG2233
MSVSTLESENAQPVAQTQNSELIYRLEDRPPLPQTLFAACQHLLAMFVAVITPALLICQALGLPAQDTQHIISMSLFASG
VASIIQIKAWGPVGSGLLSIQGTSFNFVAPLIMGGTALKTGGADVPTMMAALFGTLMLASCTEMVISRVLHLARRIITPL
VSGVVVMIIGLSLIQVGLTSIGGGYAAMSDNTFGAPKNLLLAGVVLALIILLNRQRNPYLRVASLVIAMAAGYALAWFMG
MLPESNEPMTQELIMVPTPLYYGLGIEWSLLLPLMLVFMITSLETIGDITATSDVSEQPVSGPLYMKRLKGGVLANGLNS
FVSAVFNTFPNSCFGQNNGVIQLTGVASRYVGFVVALMLIVLGLFPAVSGFVQHIPEPVLGGATLVMFGTIAASGVRIVS
REPLNRRAILIIALSLAVGLGVSQQPLILQFAPEWLKNLLSSGIAAGGITAIVLNLIFPPEKQ
>Q60106 3.4.21.101~~~~~~Xanthomonalisin~~~
MKIEKTALTVAIALAMSSLSAHAEDAWVSTHTQAAMSPPASTQVLAASSTSATTTGNAYTLNMTGSPRIDGAAVTALEAD
HPLHVEVALKLRNPDALQTFLAGVTTPGSALFGKFLTPSQFTERFGPTQSQVDAVVAHLQQAGFTNIEVAPNRLLISADG
TAGAATNGFRTSIKRFSANGREFFANDAPALVPASLGDSVNAVLGLQNVSVKHTLHHVYHPEDVTVPGPNVGTQAAAAVA
AHHPQDFAAIYGGSSLPAATNTAVGIITWGSITQTVTDLNSFTSGAGLATVNSTITKVGSGTFANDPDSNGEWSLDSQDI
VGIAGGVKQLIFYTSANGDSSSSGITDAGITASYNRAVTDNIAKLINVSLGEDETAAQQSGTQAADDAIFQQAVAQGQTF
SIASGDAGVYQWSTDPTSGSPGYVANSAGTVKIDLTHYSVSEPASSPYVIQVGGTTLSTSGTTWSGETVWNEGLSAIAPS
QGDNNQRLWATGGGVSLYEAAPSWQSSVSSSTKRVGPDLAFDAASSSGALIVVNGSTEQVGGTSLASPLFVGAFARIESA
ANNAIGFPASKFYQAFPTQTSLLHDVTSGNNGYQSHGYTAATGFDEATGFGSFDIGKLNTYAQANWVTGGGGGSTNAPPV
ANFSVATTGLVATFTDSSTDSDGSIASHAWTFGDGSTSTATSPSHTYSAAGTYSVAETVTDNAGATSTKTSSVTVSSSGG
TGGGTVLQNGVAATGLSAAKNGQLKYTVAIPSGAKSLKIAISGGTGDADLYVKFGSAPTTSSYDCRPYVTGNTESCSFAS
PQTGTYYVLLNGYAAFSGVSLKATWTN
>P67444 ~~~xanQ~~~Xanthine permease XanQ~~~COG2233
MSDINHAGSDLIFELEDRPPFHQALVGAITHLLAIFVPMVTPALIVGAALQLSAETTAYLVSMAMIASGIGTWLQVNRYG
IVGSGLLSIQSVNFSFVTVMIALGSSMKSDGFHEELIMSSLLGVSFVGAFLVVGSSFILPYLRRVITPTVSGIVVLMIGL
SLIKVGIIDFGGGFAAKSSGTFGNYEHLGVGLLVLIVVIGFNCCRSPLLRMGGIAIGLCVGYIASLCLGMVDFSSMRNLP
LITIPHPFKYGFSFSFHQFLVVGTIYLLSVLEAVGDITATAMVSRRPIQGEEYQSRLKGGVLADGLVSVIASAVGSLPLT
TFAQNNGVIQMTGVASRYVGRTIAVMLVILGLFPMIGGFFTTIPSAVLGGAMTLMFSMIAIAGIRIIITNGLKRRETLIV
ATSLGLGLGVSYDPEIFKILPASIYVLVENPICAGGLTAILLNIILPGGYRQENVLPGITSAEEMD
>P45563 2.4.2.1~~~xapA~~~Purine nucleoside phosphorylase 2~~~COG0005
MSQVQFSHNPLFCIDIIKTYKPDFTPRVAFILGSGLGALADQIENAVAISYEKLPGFPVSTVHGHAGELVLGHLQGVPVV
CMKGRGHFYEGRGMTIMTDAIRTFKLLGCELLFCTNAAGSLRPEVGAGSLVALKDHINTMPGTPMVGLNDDRFGERFFSL
ANAYDAEYRALLQKVAKEEGFPLTEGVFVSYPGPNFETAAEIRMMQIIGGDVVGMSVVPEVISARHCDLKVVAVSAITNM
AEGLSDVKLSHAQTLAAAELSKQNFINLICGFLRKIA
>P45562 ~~~xapB~~~Xanthosine permease~~~COG2211
MSIAMRLKVMSFLQYFIWGSWLVTLGSYMINTLHFTGANVGMVYSSKGIAAIIMPGIMGIIADKWLRAERAYMLCHLVCA
GVLFYAASVTDPDMMFWVMLVNAMAFMPTIALSNSVSYSCLAQAGLDPVTAFPPIRVFGTVGFIVAMWAVSLLHLELSSL
QLYIASGASLLLSAYALTLPKIPVAEKKATTSLASKLGLDAFVLFKNPRMAIFFLFAMMLGAVLQITNVFGNPFLHDFAR
NPEFADSFVVKYPSILLSVSQMAEVGFILTIPFFLKRFGIKTVMLMSMVAWTLRFGFFAYGDPSTTGFILLLLSMIVYGC
AFDFFNISGSVFVEQEVDSSIRASAQGLFMTMVNGVGAWVGSILSGMAVDYFSVDGVKDWQTIWLVFAGYALFLAVIFFF
GFKYNHDPEKIKHRAVTH
>O32147 1.17.1.4~~~pucA~~~Probable xanthine dehydrogenase subunit A~~~COG1975
MGNFHTMLDALLEDQEEAVLATIVQVEGSAYRKAGASMLFKKKGRRIGLLSGGCVEEDVFQRISALGDQLTSTLIPYDMR
SEDDLSWGMGAGCNGIIHVHAERITQEKRRHYEKVRDCLHSGKAVTSVIKIESSHYLFLTENGHFGNWPDAPLQDIQRTV
STLHLPHFDQTTNMFIQRIEPKPRLILFGAGPDNVPLANLAADTGFSVIVTDWRPAYCTSSLFPKADQLITAFPEQMLSE
FQFFPHDAAVVATHHYQHDQTIINFLFSQNLHYIGLLGSANRTKRLLSGKHPPSHFYSPVGLKIGAEGPEEIAVSVVAEI
IQTRKRVAVV
>Q46799 1.17.1.4~~~xdhA~~~Putative xanthine dehydrogenase molybdenum-binding subunit XdhA~~~COG1529
MRVDAIAKVTGRARYTDDYVMAGMCYAKYVRSPIAHGYAVSINDEQARSLPGVLAIFTWEDVPDIPFATAGHAWTLDENK
RDTADRALLTRHVRHHGDAVAIVVARDELTAEKAAQLVSIEWQELPVITTPEAALAEDAAPIHNGGNLLKQSTMSTGNVQ
QTIDAADYQVQGHYQTPVIQHCHMESVTSLAWMEDDSRITIVSSTQIPHIVRRVVGQALDIPWSCVRVIKPFVGGGFGNK
QDVLEEPMAAFLTSKLGGIPVKVSLSREECFLATRTRHAFTIDGQMGVNRDGTLKGYSLDVLSNTGAYASHGHSIASAGG
NKVAYLYPRCAYAYSSKTCYTNLPSAGAMRGYGAPQVVFAVESMLDDAATALGIDPVEIRLRNAAREGDANPLTGKRIYS
AGLPECLEKGRKIFEWEKRRAECQNQQGNLRRGVGVACFSYTSNTWPVGVEIAGARLLMNQDGTINVQSGATEIGQGADT
VFSQMVAETVGVPVSDVRVISTQDTDVTPFDPGAFASRQSYVAAPALRSAALLLKEKIIAHAAVMLHQSAMNLTLIKGHI
VLVERPEEPLMSLKDLAMDAFYHPERGGQLSAESSIKTTTNPPAFGCTFVDLTVDIALCKVTINRILNVHDSGHILNPLL
AEGQVHGGMGMGIGWALFEEMIIDAKSGVVRNPNLLDYKMPTMPDLPQLESAFVEINEPQSAYGHKSLGEPPIIPVAAAI
RNAVKMATGVAINTLPLTPKRLYEEFHLAGLI
>O32145 1.17.1.4~~~pucC~~~Probable xanthine dehydrogenase subunit C~~~COG1319
MNGQVTKARMNIQLWRPAALDEAYSLLEKLAPDVCAASGSTLLQLQWDKGTLPKQHLVSLEGIDEMRGISTSDTHVSIGG
LTSLNECRKNPLIKRALSCFSDAASAVAAPGIRSRATIGGNIASKIGDFIPLLLVLGAELIVYQKELIRLPLGAWLSEED
FRTAIVTRVIIPRAEGERVFYHKLGRRQAFTGAAAVAAGRFLKDGSIRLAAGHADITPRRLLDSEAKWMAPGWDPHELYK
TLIHELPFSSDVFMSAAYRKKAAANVIMAELMAEGGE
>O32144 1.17.1.4~~~pucD~~~Probable xanthine dehydrogenase subunit D~~~COG1529
MIINKPSRVRPDGRGKVTGELKYMTDLSFPGMLYGKVLRSAYPHAEIVSVCTIKAEKMEGVQAVVTHKDVPGLNRFGIVI
PDQPVLCEDRVRYVGDAIAAVAAETEEIAEAALELIQVEYKELEVMDSPEKALRPNAQRLHEDGNILHRAFFSNGDVEEG
FQASDTVFEETYELPRQMHTYMETEGGVAVPEDDGGFTMYAGTQHGYKDRFQLARIFDIPEEKIRIVSSPMGGSFGGKDE
LNIQPYAALLALKSGRPVKIHQTRKESVRSGIKRHPMKITIKTGADHSGNLLAHDVKIVADTGAYATLGPAVLDFSVEHA
AGPYRIPNIRTEGISVFTNNGVAGEFRGFGGNQITFALETHLDRLSGMLGIDPLELRRKNIRKPHDLGPLEHRIAPTDGA
AQVLNAISKSPILKKTSRNCGYLQRGTGAAITMHGGGLGFGRMDAAGGRLSLSSEGKITASFGFEECGQGILAAIEQIVM
EELGCAAEDISIVIGDTAKVPKSGSSTASRGTSMVWHAIQRLKKPFLAQLKKRAAEWSGCSAENLIPGAAGLRDKNTKAL
VVTYKELAEKGPLAEETAFDFPTTPDPVVGGHFLYSFGAAAVEVEVDLLTGDVKLIDCEHAIAAGPVVSPQGYRGQIEGG
AAMALGYTLMEEAKMTDGRYAAENLDHYLIPGIKDVPDMKLIAIEDLMKGDVYGPRGVGEIGTIAITPAIVKAVHDAVGC
WINKLPISREELLEAIDRKGLKQWT
>O32143 1.17.1.4~~~pucE~~~Probable xanthine dehydrogenase subunit E~~~COG2080
MDIKEAGPFPVKKEQFRMTVNGQAWEVAAVPTTHLSDLLRKEFQLTGTKVSCGIGRCGACSILIDGKLANACMTMAYQAD
GHSITTIEGLQKEELDMCQTAFLEEGGFQCGYCTPGMIIALKALFRETPQPSDKDIEEGLAGNLCRCTGYGGIMRSACRI
RRELNGGRRESGF
>Q9A9Z0 1.1.1.175~~~xylB~~~D-xylose 1-dehydrogenase~~~COG1028
MSSAIYPSLKGKRVVITGGGSGIGAGLTAGFARQGAEVIFLDIADEDSRALEAELAGSPIPPVYKRCDLMNLEAIKAVFA
EIGDVDVLVNNAGNDDRHKLADVTGAYWDERINVNLRHMLFCTQAVAPGMKKRGGGAVINFGSISWHLGLEDLVLYETAK
AGIEGMTRALARELGPDDIRVTCVVPGNVKTKRQEKWYTPEGEAQIVAAQCLKGRIVPENVAALVLFLASDDASLCTGHE
YWIDAGWR
>B8H1Z0 1.1.1.175~~~xylB~~~D-xylose 1-dehydrogenase~~~
MSSAIYPSLKGKRVVITGGGSGIGAGLTAGFARQGAEVIFLDIADEDSRALEAELAGSPIPPVYKRCDLMNLEAIKAVFA
EIGDVDVLVNNAGNDDRHKLADVTGAYWDERINVNLRHMLFCTQAVAPGMKKRGGGAVINFGSISWHLGLEDLVLYETAK
AGIEGMTRALARELGPDDIRVTCVVPGNVKTKRQEKWYTPEGEAQIVAAQCLKGRIVPENVAALVLFLASDDASLCTGHE
YWIDAGWR
>Q8GAK6 1.1.1.179~~~xdh~~~D-xylose dehydrogenase~~~COG0673
MTKTAIVRVAMNGITGRMGYRQHLLRSILPIRDAGGFTLEDGTKVQIEPILVGRNEAKIRELAEKHKVAEWSTDLDSVVN
DPTVDIIFDASMTSLRAATLKKAMLAGKHIFTEKPTAETLEEAIELARIGKQAGVTAGVVHDKLYLPGLVKLRRLVDEGF
FGRILSIRGEFGYWVFEGDVQAAQRPSWNYRKEDGGGMTTDMFCHWNYVLEGIIGKVKSVNAKTATHIPTRWDEAGKEYK
ATADDASYGIFELETPGGDDVIGQINSSWAVRVYRDELVEFQVDGTHGSAVAGLNKCVAQQRAHTPKPVWNPDLPVTESF
RDQWQEVPANAELDNGFKLQWEEFLRDVVAGREHRFGLLSAARGVQLAELGLQSNDERRTIDIPEITL
>Q56837 4.4.1.23~~~xecA1~~~2-hydroxypropyl-CoM lyase~~~COG0620
MLIRGEDVTIPTSMVGNYPNPRWWDAQFARTWTGDQEPPDALIQESLEDAVAAIARDQERAGLDIISDGRVHGDNYAEQA
LYYYYRRLGYDLKGGYLGFPIYSRLHAGTLTGEVRRHGAIMVEQAKALKKATGKPTKVQYTGVQALTQATNDLHYKSSRD
RAMAIAKAINEDIREVDALGVDFIQIDEFTWPYFFEDWAIEAFNAAVDGVKNAKIIAHVCWGNWGGTPAYYPDETAASGE
IFDLTKRKAEATKATATGSIVPKAYEARLDVLNLESCGRRSDDLSGLHVMKNHPLPDNVSFWAGVIDVKSTITETADEVA
NRIRRLLEIVPADRLGVTTDCGLILLQRYIAQDKLHALVEGTKIVRAELAKAKQAA
>Q56839 1.8.1.5~~~xecC~~~2-oxopropyl-CoM reductase, carboxylating~~~COG1249
MKVWNARNDHLTINQWATRIDEILEAPDGGEVIYNVDENDPREYDAIFIGGGAAGRFGSAYLRAMGGRQLIVDRWPFLGG
SCPHNACVPHHLFSDCAAELMLARTFSGQYWFPDMTEKVVGIKEVVDLFRAGRNGPHGIMNFQSKEQLNLEYILNCPAKV
IDNHTVEAAGKVFKAKNLILAVGAGPGTLDVPGVNAKGVFDHATLVEELDYEPGSTVVVVGGSKTAVEYGCFFNATGRRT
VMLVRTEPLKLIKDNETRAYVLDRMKEQGMEIISGSNVTRIEEDANGRVQAVVAMTPNGEMRIETDFVFLGLGEQPRSAE
LAKILGLDLGPKGEVLVNEYLQTSVPNVYAVGDLIGGPMEMFKARKSGCYAARNVMGEKISYTPKNYPDFLHTHYEVSFL
GMGEEEARAAGHEIVTIKMPPDTENGLNVALPASDRTMLYAFGKGTAHMSGFQKIVIDAKTRKVLGAHHVGYGAKDAFQY
LNVLIKQGLTVDELGDMDELFLNPTHFIQLSRLRAGSKNLVSL
>B0TZW0 5.1.1.-~~~~~~L-amino acid-D/L-Glu epimerase~~~COG4948
MSKIIDIKTSIIKIPLKRTFITAVRSTNHIDSLAVELTLDNGVKGYGVAPATTAITGDTLQGMQYIIREIFAPVILGSDL
SDYKQTLELAFKKVMFNSAAKMAIDLAYHDLLAKEQDISVAKLLGAKANSIVTDVSISCGNVAETIQNIQNGVEANFTAI
KVKTGADFNRDIQLLKALDNEFSKNIKFRFDANQGWNLAQTKQFIEEINKYSLNVEIIEQPVKYYDIKAMAEITKFSNIP
VVADESVFDAKDAERVIDEQACNMINIKLAKTGGILEAQKIKKLADSAGISCMVGCMMESPAGILATASFALAEDITVAD
LDPLDWVAKDLYSDYITFNEPNIILKDNLKGFGFNL
>P39797 ~~~xepA~~~Phage-like element PBSX protein XepA~~~
MVKYQYEFPLDKAGKAGAVKPYRGGKNDFVTPVSNLSGVAEILTNAALKATEAYSQLGQDRLGAVLISKVKGWAYADREG
TLFIEESDNNNVWTTTAAVNVAAGVLTATDWVYLSKRYYRFRYVNGNLQQSEFVLYQSVGAGEMDVRVNEKTPLQIDFAE
NQTHDGRLKVEARKTFDFVFHENAESASEGAALPVDGAAHLLVEVYGTAEMSEVKFWGKSVSGQKLPIRGVKTDDATTAS
STLGKAEAWAFDIKGFKEIIMEIISITGGTLSVKGTAVS
>P39776 ~~~xerC~~~Tyrosine recombinase XerC~~~COG4974
MENVKNFVKLFVEYLQIEKNYSQYTIVNYVDSIEEFETFLRVQGINGFEEAAYQDTRIFLTEAYEKGLSRRTISKKISAL
RSFYKFLMREKLIEENPFQLVHLPKQEKRIPKFLYQKELEELFEVSDISQPAGMRDQALLELLYATGMRVSECCSITIND
VDLFMDTVLVHGKGKKQRYIPFGSYAREALKVYMNSGRQCLLMKAKEPHDLLFVNQRGGPLTARGIRHILSGLVQKASST
LHIHPHMLRHTFATHLLNEGADLRSVQELLGHSNLSSTQIYTHVSKEMLRNTYMSHHPRAFKKN
>P0A8P6 ~~~xerC~~~Tyrosine recombinase XerC~~~COG4973
MTDLHTDVERYLRYLSVERQLSPITLLNYQRQLEAIINFASENGLQSWQQCDVTMVRNFAVRSRRKGLGAASLALRLSAL
RSFFDWLVSQNELKANPAKGVSAPKAPRHLPKNIDVDDMNRLLDIDINDPLAVRDRAMLEVMYGAGLRLSELVGLDIKHL
DLESGEVWVMGKGSKERRLPIGRNAVAWIEHWLDLRDLFGSEDDALFLSKLGKRISARNVQKRFAEWGIKQGLNNHVHPH
KLRHSFATHMLESSGDLRGVQELLGHANLSTTQIYTHLDFQHLASVYDAAHPRAKRGK
>P44818 ~~~xerC~~~Tyrosine recombinase XerC~~~COG4973
MLTALNRYWDYLRIERQMSPHTITNYQHQLDATIKILAQQDIHSWTQVTPSVVRFILAESKKQGLKEKSLALRLSALRRF
LSFLVQQGELKVNPATGISAPKQGRHLPKNMDGEQVQQLLANDSKEPIDIRDRAILELMYSSGLRLSELQGLDLNSINTR
VREVRVIGKGNKERVVPFGRYASHAIQEWLKVRALFNPKDEALFVSQLGNRISHRAIQKRLETWGIRQGLNSHLNPHKLR
HSFATHMLEASSDLRAVQELLGHSNLSTTQIYTHLNFQHLAEVYDQAHPRAKRKK
>P9WF35 ~~~xerC~~~Tyrosine recombinase XerC~~~COG4974
MQAILDEFDEYLALQCGRSVHTRRAYLGDLRSLFAFLADRGSSLDALTLSVLRSWLAATAGAGAARTTLARRTSAVKAFT
AWAVRRGLLAGDPAARLQVPKARRTLPAVLRQDQALRAMAAAESGAEQGDPLALRDRLIVELLYATGIRVSELCGLDVDD
IDTGHRLVRVLGKGNKQRTVPFGQPAADALHAWLVDGRRALVTAESGHALLLGARGRRLDVRQARTAVHQTVAAVDGAPD
MGPHGLRHSAATHLLEGGADLRVVQELLGHSSLATTQLYTHVAVARLRAVHERAHPRA
>O31207 ~~~xerC~~~Tyrosine recombinase XerC~~~
MSQIIDVPETLSLAIDSFLSYIEVERRLSPVTVENYQRQLMTIAQMMVAIKINQWSLLESQHVRMLLAKSHRSGLQPASL
ALRFSALRSFLDWQVSQGMLAVNPAKGVRTPKSGRHLPKNMDVDEVSQLMNIDLKDPLSVRDRTMLEVMYGAGLRLSELT
NLNINDIDLQEGEVRVLGKGSKERKVPLGRKAVEWLQHWFAMRELYSPEDTAVFISTKSGKRLSVRSVQKRFELWGVKQG
LSSHVNPHKLRHSFATHLLESSGDLRAVQELLGHANLSTTQVYTHLDFQHLAKVYDAAHPRAKREKS
>Q9CG32 ~~~ynbA~~~Tyrosine recombinase XerD-like~~~COG4974
MKLPNEIDEYLASRNFSENTRSNYHYDLVSLQAFFEDKSLTTENLELYKIQISNLSPAAQRRKISSANQYLLFLYQRQKV
DQYFKIKQVVQKKSQTAQSYHPMIKEFPEFYGPLTCPGQFLALLILEFGLNFAEIQKLKWENFNWNFKYLTIEKAGIKRV
LPIREKFAIRVKAINNADELFAKSRQFLYTELKKFTNYSSKEIREQYILHQVKAGKSIYELATLLGLTTITTLEKYYR
>P0A4S9 ~~~~~~Tyrosine recombinase XerD-like~~~COG4974
MRDRISAFLEEKQGLSVNSKQSYKYDLEQFLDMVGERISETSLKIYQAQLANLKISAQKRKISACNQFLYFLYQKGEVDS
FYRLELAKQAEKKTEKPEILYLDSFWQESDHPEGRLLALLILEMGLLPSEILAIKVADINLDFQVLRISKASQQRIVTIP
TALLSELEPLMGQTYLFERGEKPYSRQWAFRQLESFVKEKGFPSLSAQVLREQFILRQIENKVDLYEIAKKLGLKTVLTL
EKYR
>Q2YR40 ~~~xerD~~~Tyrosine recombinase XerD~~~
MTMRASLAIENFLEMMSAERGAAQNTLESYRRDLEAAAEELAAKGVNLAEAETGHIRMTLDTMAAQGFAPTSQARRLSAL
RQFFRFLYSEGFRQDDPTGILYAPKKQKPLPKIMSVENVGKLLDRAALEANEAAEPGERIKALRLHALLETLYATGLRVS
ELVGLPVTVARTDHRFLLVRGKGSKDRMVPLSRKARDALQKFLTLRDSLPGSDDNPWLFPAFSESGHLARQVFARELKGL
AARAGLAASSASPHVLRHAFASHLLQNGADLRTVQQLLGHADISTTQIYTHVLEERLHKLVSEHHPLAD
>P0A8P8 ~~~xerD~~~Tyrosine recombinase XerD~~~COG4974
MKQDLARIEQFLDALWLEKNLAENTLNAYRRDLSMMVEWLHHRGLTLATAQSDDLQALLAERLEGGYKATSSARLLSAVR
RLFQYLYREKFREDDPSAHLASPKLPQRLPKDLSEAQVERLLQAPLIDQPLELRDKAMLEVLYATGLRVSELVGLTMSDI
SLRQGVVRVIGKGNKERLVPLGEEAVYWLETYLEHGRPWLLNGVSIDVLFPSQRAQQMTRQTFWHRIKHYAVLAGIDSEK
LSPHVLRHAFATHLLNHGADLRVVQMLLGHSDLSTTQIYTHVATERLRQLHQQHHPRA
>P9WF33 ~~~xerD~~~Tyrosine recombinase XerD~~~COG4974
MKTLALQLQGYLDHLTIERGVAANTLSSYRRDLRRYSKHLEERGITDLAKVGEHDVSEFLVALRRGDPDSGTAALSAVSA
ARALIAVRGLHRFAAAEGLAELDVARAVRPPTPSRRLPKSLTIDEVLSLLEGAGGDKPSDGPLTLRNRAVLELLYSTGAR
ISEAVGLDLDDIDTHARSVLLRGKGGKQRLVPVGRPAVHALDAYLVRGRPDLARRGRGTAAIFLNARGGRLSRQSAWQVL
QDAAERAGITAGVSPHMLRHSFATHLLEGGADVRVVQELLGHASVTTTQIYTLVTVHALREVWAGAHPRAR
>O31206 ~~~xerD~~~Tyrosine recombinase XerD~~~
MTQTKLSTQTTTDNNQEDNDVIIEQFLDSIWLEQGLSANTLSAYRLDLQALSQWLVTQKLNWLSVTTLDLHAFLATRLDE
GYKATSAARLLSTLRRFFQYLYREKLRQDDPSALLSTPKLPKRLPKDLSEQQVENLLSAPCIDEPIELRDKAMLEVLYAC
GLRVSELVGLSLSDISLRQGVLRVIGKGDKERLVPLGEEAIYWLEQYLQYGRPALMQGKTDDIVFPSLRGQKMTRQTFWH
RIKHYAVIAGIDSEKLSPHVLRHAFATHLLNHGADLRVVQMLLGHSDLSTTQIYTHVATERLKVLHQQHHPRG
>P0A2P6 ~~~xerD~~~Tyrosine recombinase XerD~~~
MEQDLARIEQFLDALWLERNLAENTLSAYRRDLSMVVAWLHHRGKTLATAQADDLQTLLAERVEGGYKATSSARLLSAMR
RFFQHLYREKYREDDPSAQLASPKLPQRLPKDLSEAQVERLLQAPLIDQPLELRDKAMLEVLYATGLRVSELVGLTMSDI
SLRQGVVRVIGKGNKERLVPLGEEAVYWLETYLEHGRPWLLNGVSIDVLFPSQRAQQMTRQTFWHRIKHYAVLAGIDSEK
LSPHVLRHAFATHLLNHGADLRVVQMLLGHSDLSTTQIYTHVATERLRQLHQQHHPRA
>P0A0P0 ~~~xerD~~~Tyrosine recombinase XerD~~~
METIIEEYLRFIQIEKGLSSNTIGAYRRDLKKYQDYMTEHHISHIDFIDRQLIQECLGHLIDQGQSAKSIARFISTIRSF
HQFAIREKYAAKDPTVLLDSPKYDKKLPDVLNVDEVLALLETPDLNKINGYRDRTMLELLYATGMRVSELIHLELENVNL
IMGFVRVFGKGDKERIVPLGDAVIEYLTTYIETIRPQLLKKTVTEVLFLNMHGKPLSRQAIWKMIKQNGVKANIKKTLTP
HTLRHSFATHLLENGADLRAVQEMLGHSDISTTQLYTHVSKSQIRKMYNQFHPRA
>Q0PA27 ~~~xerH~~~Tyrosine recombinase XerH~~~COG0582
MKYPLDCEENFEKSFLFWLAKYVKFKLNSLSNKELKNPQALAEVNFALTKGVKNIDELDALAKKARNAGLSGVNTYFNPL
KKVFEYLNFYKLYSLKQIDEELIVEVLASITGALSDASKKNYRIAVINFFDFLDKQNEEDEKAHIFDINLKNWAGIAGSK
GVKLPEFMSKEELKKFLDAIENADFKNNTIRNKLIIKIIIFTGIRVSEAINIKMGDISEENDLYIIRIRAKGNKYRVVMI
KKELIYDLLKNVSINYISKDALLFVNKKGTPLTQSYVSRIVEQLLFRAGIRKQKNGAHMLRHTFATLLYKKQKDLVLVQE
ALGHASLNTSRIYTHFDNDKLKLAAQVAKELSDS
>O25386 ~~~xerH~~~Tyrosine recombinase XerH~~~COG4974
MKHPLEELKDPTENLLLWIGRFLRYKCTSLSNSQVKDQNKVFECLNELNQACSSSQLEKVCKKARNAGLLGINTYALPLL
KFHEYFSKARLITERLAFNSLKNIDEVMLAEFLSVYTGGLSLATKKNYRIALLGLFSYIDKQNQDENEKSYIYNITLKNI
SGVNQSAGNKLPTHLNNEELEKFLESIDKIEMSAKVRARNRLLIKIIVFTGMRSNEALQLKIKDFTLENGCYTILIKGKG
DKYRAVMLKAFHIESLLKEWLIERELYPVKNDLLFCNQKGSALTQAYLYKQVERIINFAGLRREKNGAHMLRHSFATLLY
QKRHDLILVQEALGHASLNTSRIYTHFDKQRLEEAASIWEEN
>A2RKP9 ~~~xerS~~~Tyrosine recombinase XerS~~~COG4974
MKREQLIQNIEKLKHVMPPYVLEYYQSKLTIPYSLNTLYEYLKEYERFFSWLVDSGVADVDKITDVSLSVLENLTKRDLE
SFILYLRERPRLNTHSTRYGVSQTTINRTLSALSSLYKYLTEEVENEDGEPYFYRNVMKKVQTKKKSETLASRAENIKGK
LFLGDETQGFLDYIDSEYEKTLSNRARSSFFKNKERDLAIIALILASGIRLSEAVNVDLRDLNLNTMIVEVTRKGGKRDA
VPFAPFAKTYFERYLEVRSQRYKTTAKDTAFFVTLYRDIASRIDPSSVEKLVAKYSQAFKVRVTPHKLRHTLATRLYAQT
NSQVLVSNQLGHASTQVTDLYTHIINEEQKNALDSL
>Q97QP2 ~~~xerS~~~Tyrosine recombinase XerS~~~COG4974
MKREILLERIDKLKQLMPWYVLEYYQSKLAVPYSFTTLYEYLKEYDRFFSWVLESGISNADKISDIPLSVLENMSKKDME
SFILYLRERPLLNANTTKQGVSQTTINRTLSALSSLYKYLTEEVENDQGEPYFYRNVMKKVSTKKKKETLAARAENIKQK
LFLGDETEGFLTYIDQEHPQQLSNRALSSFNKNKERDLAIIALLLASGVRLSEAVNLDLRDLNLKMMVIDVTRKGCKRDS
VNVAAFAKPYLENYLAIRNQRYKTEKTDTALFLTLYRGVPNRIDASSVEKMVAKYSEDFKVRVTPHKLRHTLATRLYDAT
KSQVLVSHQLGHASTQVTDLYTHIVSDEQKNALDSL
>Q7ZAK7 ~~~xerS~~~Tyrosine recombinase XerS~~~COG4974
MKREILLERIDKLKQLMPWYVLEYYQSKLAVPYSFTTLYEYLKEYDRFFSWVLESGISNADKISDIPLSVLENMSKKDME
SFILYLRERPLLNANTTKQGVSQTTINRTLSALSSLYKYLTEEVENDQGEPYFYRNVMKKVSTKKKKETLAARAENIKQK
LFLGDETEGFLTYIDQEHPQQLSNRALSSFNKNKERDLAIIALLLASGVRLSEAVNLDLRDLNLKMMVIDVTRKGGKRDS
VNVAAFAKPYLENYLAIRNQRYKTEKTDTALFLTLYRGVPNRIDASSVEKMVAKYSEDFKVRVTPHKLRHTLATRLYDAT
KSQVLVSHQLGHASTQVTDLYTHIVNDEQKNALDSL
>Q9AEM9 4.1.2.22~~~xfp~~~Xylulose-5-phosphate/fructose-6-phosphate phosphoketolase~~~
MTNPVIGTPWQKLDRPVSEEAIEGMDKYWRVANYMSIGQIYLRSNPLMKEPFTRDDVKHRLVGHWGTTPGLNFLLAHINR
LIADHQQNTVFIMGPGHGGPAGTAQSYIDGTYTEYYPNITKDEAGLQKFFRQFSYPGGIPSHFAPETPGSIHEGGELGYA
LSHAYGAIMDNPSLFVPCIIGDGEAETGPLATGWQSNKLVNPRTDGIVLPILHLNGYKIANPTILARISDEELHDFFRGM
GYHPYEFVAGFDNEDHLSIHRRFAELFETIFDEICDIKAAAQTDDMTRPFYPMLIFRTPKGWTCPKFIDGKKTEGSWRAH
QVPLASARDTEAHFEVLKGWMESYKPEELFNADGSIKEDVTAFMPKGELRIGANPNANGGRIREDLKLPELDQYEITGVK
EYGHGWGQVEAPRSLGAYCRDIIKNNPDSFRVFGPDETASNRLNATYEVTKKQWDNGYLSALVDENMAVTGQVVEQLSEH
QCEGFLEAYLLTGRHGIWSSYESFVHVIDSMLNQHAKWLEATVREIPWRKPISSVNLLVSSHVWRQDHNGFSHQDPGVTS
VLLNKTFNNDHVTNIYFATDANMLLAIAEKCFKSTNKINAIFAGKQPAATWITLDEVRAELEAGAAEWKWASNAKSNDEV
QVVLAAAGDVPTQEIMAASDALNKMGIKFKVVNVVDLIKLQSSKENDEAMSDEDFADLFTADKPVLFAYHSYAQDVRGLI
YDRPNHDNFTVVGYKEQGSTTTPFDMVRVNDMDRYALQAKALELIDADKYADKINELNEFRKTAFQFAVDNGYDIPEFTD
WVYPDVKVDETSMLSATAATAGDNE
>Q70DK5 3.2.1.-~~~xghA~~~Xyloglucanase Xgh74A~~~
MVKKFTSKIKAAVFAAVVAATAIFGPAISSQAVTSVPYKWDNVVIGGGGGFMPGIVFNETEKDLIYARADIGGAYRWDPS
TETWIPLLDHFQMDEYSYYGVESIATDPVDPNRVYIVAGMYTNDWLPNMGAILRSTDRGETWEKTILPFKMGGNMPGRSM
GERLAIDPNDNRILYLGTRCGNGLWRSTDYGVTWSKVESFPNPGTYIYDPNFDYTKDIIGVVWVVFDKSSSTPGNPTKTI
YVGVADKNESIYRSTDGGVTWKAVPGQPKGLLPHHGVLASNGMLYITYGDTCGPYDGNGKGQVWKFNTRTGEWIDITPIP
YSSSDNRFCFAGLAVDRQNPDIIMVTSMNAWWPDEYIFRSTDGGATWKNIWEWGMYPERILHYEIDISAAPWLDWGTEKQ
LPEINPKLGWMIGDIEIDPFNSDRMMYVTGATIYGCDNLTDWDRGGKVKIEVKATGIEECAVLDLVSPPEGAPLVSAVGD
LVGFVHDDLKVGPKKMHVPSYSSGTGIDYAELVPNFMALVAKADLYDVKKISFSYDGGRNWFQPPNEAPNSVGGGSVAVA
ADAKSVIWTPENASPAVTTDNGNSWKVCTNLGMGAVVASDRVNGKKFYAFYNGKFYISTDGGLTFTDTKAPQLPKSVNKI
KAVPGKEGHVWLAAREGGLWRSTDGGYTFEKLSNVDTAHVVGFGKAAPGQDYMAIYITGKIDNVLGFFRSDDAGKTWVRI
NDDEHGYGAVDTAITGDPRVYGRVYIATNGRGIVYGEPASDEPVPTPPQVDKGLVGDLNGDNRINSTDLTLMKRYILKSI
EDLPVEDDLWAADINGDGKINSTDYTYLKKYLLQAIPELPKK
>Q3MUH7 3.2.1.-~~~~~~Xyloglucanase~~~
MKTFLGKKLWMASLAVALAAGSFAALPEMTSAAPSEPYTWKNVVTGAGGGFVPGIIFNESEKDLIYARTDIGGAYRWNPA
NESWIPLTDFVGWDDWNKNGVDALATDPVDPDRVYLAVGTYTNSWDKNNGAILRSTDRGDTWQTTTLPFKVGGNMPGRSM
GERLVVDPNDNRILYFGARSGNGLWRSSDYGATWSKVTSFPNPGTYVQDPANEYGSDIVGLAWITFDKSSGQVGQATQTI
YVGVADTAQSIYRSTDGGATWTAVPGQPTGYLPHHGVLDADGSLYITYSNGVGPYDGTKGDVWKLNTSTGAWTNISPIPS
SSADNYFGYGGLAVDAQEPGTLMVATLNSWWPDAILFRSKDGGTTWTRIWEFDGYPNRKFRYTQNISAAPWLTFGTTPAP
PEVSPKLGWMIGDLEIDPFDSDRMMYGTGATIYGTNNLTNWDNNEKIDISVMAKGVEEMAVLDLVSPPSGAHLVSGLGDV
NGFRHDDLDQPPAKMFSSPNYASTESLDFAELNPSTMVRVGKADYAADPNAKSIGLSSDGGTNWYKANAEPAGTAGGGTV
AISSDGSKLVWSTSDKGVHYSSTGGNSWTASTGIPAQAKVISDRVNPNKFYGFAAGKIYVSVNGGVSFSQTAAAGLPVDG
NADLDAVPGVEGELWFAGGNEDGGPYGLWHSTDSGASFAKLSNVEEADSIGFGKAAPGRNSAALYAVAQIDGTRGFFRSD
DGGASWVRINDDAHQYARVTTITGDPRIYGRVYLGTNGRGILYADPVGGNNGGETPPVSHSGISPQSTEFDLNADRQADI
PVALTLNGNTLASIRNGNHVLVQGSDYTMSGSQVFLSKTYLATLSKGVQSLVFRFSAGNDATLSITVKDTTQVPLPEGSI
RIEMYNGTTSATANSINPKFKLTNTGTAPLQLADVNIRYYYTIDGEKPLNFFCDWATAGSANVTGTFSALPAAVNGADHV
LEIGFTASAGTLAAGQSTEVQVRFSKTDWTNFTQTDDYSFAASSTAYENWSKVTGYVSGTLQWGIEP
>P0A9M5 2.4.2.-~~~gpt~~~Xanthine-guanine phosphoribosyltransferase~~~COG2236
MSEKYIVTWDMLQIHARKLASRLMPSEQWKGIIAVSRGGLVPGALLARELGIRHVDTVCISSYDHDNQRELKVLKRAEGD
GEGFIVIDDLVDTGGTAVAIREMYPKAHFVTIFAKPAGRPLVDDYVVDIPQDTWIEQPWDMGVVFVPPISGR
>Q8ZC05 2.4.2.-~~~gpt~~~Xanthine-guanine phosphoribosyltransferase~~~COG2236
MNEKYVVTWDMLQIHARKLAQRLLPAEQWKGIIAVSRGGLVPAGILARELGIRYVDTVCISSYDHDNQRDLKVLKRAEGD
GEGFIVIDDLVDTGGTATAIREMYPKAHFVTIFAKPAGRPLVDDYVVDIPQNTWIEQPWDMAVTFVAPLSGK
>O31490 ~~~xis~~~ICEBs1 excisionase~~~
MKGEFLTARDIQKILGVKQAKSYDIIRTLNAQMKEEGYMVIQGKVSRAKFEECYCYKGPKSQTG
>P54327 ~~~xkdG~~~Putative prophage capsid protein XkdG~~~COG4653
MRNQEIIRKAEMSLSALKSGGLMNPAQASAFIRMVQNTPTIFSESRVIQMENDSQKFEKIGFGQRILRAAQEGKALSNDE
LTVPTTSTVQLNTKEVIAEINITYDTLENNIEKDGLQQTIMQILAERAAVDIEELIVNGDTASADPYLAQLDGIRKQAVS
HIVDMNGEELSRATFKKGLKAVPPKYLRIPQEFRFYTSHGLEVEWKDRVADRQTNLGDQAVQGGLSTAFGVPVKGVSNIQ
PYTVGEGDAQYDASDIILTHPKNIILGFSRNIRIEVDKDIRSRKFIIVLTAKLDSKFEEEDACAKLINVKE
>P54328 ~~~xkdH~~~Phage-like element PBSX protein XkdH~~~
MSYRQMLIHRCDIYHEAAQAPSAGRFGIPADRLQPVISYPDTPDEQDVPCYFTEKTQQLIQEEPDQTVYHSFLVHFPLSA
DIRVNDKIIWENHKYILKLPKRIRHHHWEVVAVRDESL
>P54332 ~~~xkdM~~~Phage-like element PBSX protein XkdM~~~
MALKAQNTISGKEGRLFLDGEEMAHIKTFEANVEKNKSEVNIMGRRMTGHKTTGANGTGTATFYKVTSKFVLLMMDYVKK
GSDPYFTLQAVLDDQSSGRGTERVTLYDVNFDSAKIASLDVDSEALEEEVPFTFEDFDVPEKLSDTF
>P54342 ~~~xkdW~~~Phage-like element PBSX protein XkdW~~~
MILYDAIMYKYPNAVSRKDFELRNDGNGSYIEKWNLRAPLPTQAELETWWEELQKNPPYEPPDQVELLAQELSQEKLARK
QLEELNKTLGNELSDIKLSLLSLKGDYAE
>A9ZND1 3.2.1.72~~~xloA~~~Xylan 1,3-beta-xylosidase~~~
MTTTIQNPILKGFNPDPSIVRVGDDYYIATSTFEWFPGIQLHHSRDLINWRLVGHALTRTSQLNMMGMDNSEGVYAPALT
YSDGTFWLCFSNVHSCRGGNWMATPSYVVTADSIEGPWSEPVPIGNYGFDPSLFHDDDGKKYMLNMIWGGRAKTNFFGGI
IMQEFDADEGKLVGAPKTVFEGTELGCTEGPQLLKKDDYYYLITAEGGTERNHAVTVCRSKHIWGPYEVHPENPILTSRF
QEHAELSRAGHGFLVETQTGEWYMSHLCGRRIPNPEGYQFMPKYDNGFSILGRESALQKAHWQDDWPYIATGKTPVVEVE
APNLPLHPWPESPARDEFIDPTLSLISTLREPVSEKWLSLSERPGFLRLKGRHYLYSRYEQSMVARRFQAHNATVETKLE
FKPNTPYEMAGLCAYYARNGHYFLKMTANDLGERVLQVVGNINDVYGEYSNDVVIGDADTVYMRLELKTQWYQYSYSLDG
VDWYEIGPALNSTPLSDEGGPDIFRFTGSFAALFVADITGQKRHADFDYFEYLEH
>P39800 3.5.1.28~~~xlyA~~~N-acetylmuramoyl-L-alanine amidase XlyA~~~COG3409
MVNIIQDFIPVGANNRPGYAMTPLYITVHNTANTAVGADAAAHARYLKNPDTTTSWHFTVDDTEIYQHLPLNENGWHAGD
GNGSGNRASIGIEICENADGDFAKATANAQWLIKTLMAEHNISLANVVPHKYWSGKECPRKLLDTWDSFKAGIGGGGSQT
YVVKQGDTLTSIARAFGVTVAQLQEWNNIEDPNLIRVGQVLIVSAPSAAEKPELYPLPDGIIQLTTPYTSGEHVFQVQRA
LAALYFYPDKGAVNNGIDGVYGPKTADAVARFQSVNGLTADGIYGPATKEKIAAQLS
>P38506 3.1.-.-~~~ygdG~~~Flap endonuclease Xni~~~COG0258
MAVHLLIVDALNLIRRIHAVQGSPCVETCQHALDQLIMHSQPTHAVAVFDDENRSSGWRHQRLPDYKAGRPPMPEELHDE
MPALRAAFEQRGVPCWSTSGNEADDLAATLAVKVTQAGHQATIVSTDKGYCQLLSPTLRIRDYFQKRWLDAPFIDKEFGV
QPQQLPDYWGLAGISSSKVPGVAGIGPKSATQLLVEFQSLEGIYENLDAVAEKWRKKLETHKEMAFLCRDIARLQTDLHI
DGNLQQLRLVR
>Q4UWF4 2.7.7.-~~~xopAC~~~Uridine 5'-monophosphate transferase~~~
MDKNLNLWDMSTFIQQYGALTADHPTHTPEDSPQTVPSPRSSSAHSPEIQELRSLQETRPARLGARSQSRSSKHGLQQCS
SSPSDESFRLHAELAAWCERVETKPSLLAKLGCCAAPPVVGDHREQRREAMERIMRCLDAGQAGTQLTLRDLNLSQLPPG
LHRLAHLRDLDVADNVNLTRLPEDLSLCKHLERINADGCSIAALPSKIGALKNLSEISLAFNELRTLPDSIGQCSSLTTI
VVPGCKINKLPASLANLTQLKKLDVAANIELSELSPHMNLDDVAVHSTQTRLGLMHRIFKAPTFDPETRQRLSYQASALR
DRWAALSHHLSPQARARVDQMREGASTTLSSQDHKASTAWKTATEKVSSWAEEGAPITLDRIFKLNQLLLPEGDDDNDPI
GGQLRKVGIQAAPSNTWTECRYPPPETLKDEMAKFSGWLEHSEQQAHARDALGHIEFAAQLHQRLVSLHPFDDANGRTAR
LAMDWALQRHGLPPAPPVGEASRLPASFLGGKRVSPEKVVLETLEGIATVMNQVHQ
>A6WE36 3.6.4.12~~~XPB~~~DNA helicase XPB~~~COG1061
MTDGPLIVQSDKTLLLEVDHPRAGACRAAIAPFAELERAPEHVHTYRLTPLGLWNARAAGHDAEQVVDTLLEFSRYSVPH
ALLVDVAETMARYGRLQLVKDEEHGLVLRSLDPAVLEEVLRSRKSAPLLGTRIAPDAVLVHPSERGNLKQVLLKLGWPAE
DLAGYVDGEAHAIDLAEDGWALRPYQSEAVDNFWNGGSGVVVLPCGAGKTLVGAAAMAKARATTLILVTNTVSARQWRDE
LLKRTSLTEDEIGEYSGARKEIRPVTIATYQVVTTKRKGVYPHLELFDARDWGLILYDEVHLLPAPIFRMTADLQARRRL
GLTATLVREDGREGDVFSLIGPKRYDAPWKDIEAQGYIAPADCVEVRVTLPDAERLAYATAEDDEKYRLCSTSLSKSRVV
EKLVAQHAGEPTLVIGQYIDQLDDLAARLDAPVIKGETTVKERQRLFDAFRHGEITTLVVSKVANFSIDLPEAKVAIQVS
GSFGSRQEEAQRLGRVLRPKGDHGSARFYTVVSRDTKDQDYAAHRQRFLAEQGYAYRIVDADDIDGGVPDADGVLPG
>O53873 3.6.4.12~~~XPB~~~DNA helicase XPB~~~COG1061
MTDGPLIVQSDKTVLLEVDHELAGAARAAIAPFAELERAPEHVHTYRITPLALWNARAAGHDAEQVVDALVSYSRYAVPQ
PLLVDIVDTMARYGRLQLVKNPAHGLTLVSLDRAVLEEVLRNKKIAPMLGARIDDDTVVVHPSERGRVKQLLLKIGWPAE
DLAGYVDGEAHPISLHQEGWQLRDYQRLAADSFWAGGSGVVVLPCGAGKTLVGAAAMAKAGATTLILVTNIVAARQWKRE
LVARTSLTENEIGEFSGERKEIRPVTISTYQMITRRTKGEYRHLELFDSRDWGLIIYDEVHLLPAPVFRMTADLQSKRRL
GLTATLIREDGREGDVFSLIGPKRYDAPWKDIEAQGWIAPAECVEVRVTMTDSERMMYATAEPEERYRICSTVHTKIAVV
KSILAKHPDEQTLVIGAYLDQLDELGAELGAPVIQGSTRTSEREALFDAFRRGEVATLVVSKVANFSIDLPEAAVAVQVS
GTFGSRQEEAQRLGRILRPKADGGGAIFYSVVARDSLDAEYAAHRQRFLAEQGYGYIIRDADDLLGPAI
>Q937F6 4.1.2.9~~~xpkA~~~Xylulose-5-phosphate phosphoketolase~~~
MSTDYSSPAYLQKVDKYWRAANYLSVGQLYLKDNPLLQRPLKASDVKVHPIGHWGTIAGQNFIYAHLNRVINKYGLKMFY
VEGPGHGGQVMVSNSYLDGTYTDIYPEITQDVEGMQKLFKQFSFPGGVASHAAPETPGSIHEGGELGYSISHGVGAILDN
PDEIAAVVVGDGESETGPLATSWQSTKFINPINDGAVLPILNLNGFKISNPTIFGRTSDEKIKQYFESMNWEPIFVEGDD
PEKVHPALAKAMDEAVEKIKAIQKNARENDDATLPVWPMIVFRAPKGWTGPKSWDGDKIEGSFRAHQIPIPVDQTDMEHA
DALVDWLESYQPKELFNEDGSLKDDIKEIIPTGDARMAANPITNGGVDPKALNLPNFRDYAVDTSKHGANVKQDMIVWSD
YLRDVIKKNPDNFRLFGPDETMSNRLYGVFETTNRQWMEDIHPDSDQYEAPAGRVLDAQLSEHQAEGWLEGYVLTGRHGL
FASYEAFLRVVDSMLTQHFKWLRKANELDWRKKYPSLNIIAASTVFQQDHNGYTHQDPGALTHLAEKKPEYIREYLPADA
NSLLAVGDVIFRSQEKINYVVTSKHPRQQWFSIEEAKQLVDNGLGIIDWASTDQGSEPDIVFAAAGTEPTLETLAAIQLL
HDSFPDMKIRFVNVVDILKLRSPEKDPRGLSDAEFDHYFTKDKPVVFAFHGYEDLVRDIFFDRHNHNLHVHGYRENGDIT
TPFDVRVMNQMDRFDLAKSAIAAQPAMENTGAAFVQDMDNMLAKHNAYIRDAGTDLPEVNDWQWKGLK
>P29040 ~~~xpsN~~~General secretion pathway protein N~~~
MRLEMIGLRTWLLATVVGWALLVCVLAVAGLGKRVELLPDDPALVQRLPALPAPAPERLGPFEKYAEIAAHPAFAEDRLP
HPFFLSGNDGSGAASTVRLTGVLLTSTFKMATLTLDPADSVRVQLGGDAVKGYRLLALQPRSATIEGPGGTQTLELQVFN
GQGGQPPTAIGGRPQAPGAVPPLPPNVPPAPATPAPPPAEVPQQQPGGQAPPTVPPQRSDGAQEAPRPSDEQMRAIRERI
EARRRQLQQQRQGGSTPGQTQ
>P42085 2.4.2.22~~~xpt~~~Xanthine phosphoribosyltransferase~~~COG0503
MEALKRKIEEEGVVLSDQVLKVDSFLNHQIDPLLMQRIGDEFASRFAKDGITKIVTIESSGIAPAVMTGLKLGVPVVFAR
KHKSLTLTDNLLTASVYSFTKQTESQIAVSGTHLSDQDHVLIIDDFLANGQAAHGLVSIVKQAGASIAGIGIVIEKSFQP
GRDELVKLGYRVESLARIQSLEEGKVSFVQEVHS
>Q831Y0 2.4.2.22~~~xpt~~~Xanthine phosphoribosyltransferase~~~COG0503
MKELVERIKNDGRVLGEGVLKVDSFITHQVDPELMEAMGNRFAEVFAEAGITKVITIEASGIAPALYAAQKLGVPMIFAR
KAKSLTMDEELLTASVYSFTKQVTSQISISRKFLSDADKVLIIDDFLANGQAAKGLVELCQQAGAKVEGIGIVIEKSFQD
GRQLLEDMGLNVVSLARIASLSEGTVTFLEEDA
>Q7A7I5 2.4.2.22~~~xpt~~~Xanthine phosphoribosyltransferase~~~
MELLGQKVKEDGVVIDEKILKVDGFLNHQIDAKLMNEVGRTFYEQFKDKGITKILTIEASGIAPAIMAALHFDVPCLFAK
KAKPSTLTDGYYETSIHSFTKNKTSTVIVSKEFLSEEDTVLIIDDFLANGDASLGLYDIAQQANAKTAGIGIVVEKSFQN
GHQRLEEAGLTVSSLCKVASLEGNKVTLVGEE
>P23789 ~~~xre~~~HTH-type transcriptional regulator Xre~~~COG1396
MIGGRLKSLRGKRTQEEIASHIGVSRARYSHYENGRSEPDYDTLQKLADYFQVTTDYLLTGKDKKSDDDMFSDPDLQLAY
RDMQDFSPESKQQAIEFINYLKEKEKNRKPKNK
>Q7N4I0 ~~~xre~~~Antitoxin Xre~~~COG5642
MRVFTPSHKVQSNPLWRTVGFPSSSGPHLNAILNEGLPVTIVDKIQNWSTFGKGDILRIAGIQIRSYSRRCSGKGKFTAD
ESQRIARFVRVMDHAVDLFNGDKDKAAQWMKRPIRGLGYVTPESMLDTESGALDVMNLIGRIEHGIVS
>Q88K58 ~~~xre~~~Antitoxin Xre~~~COG5642
MSANAEKEHAMLAEVLRDNGYHEYRARLQALLDIPELASDFEIHTRITDGFAATWLVKLTERGVLTPVERDQIIPLRTLK
SRIERDQPLTVDESDRLFRSAHITAMAEAVFGEAGKAKRWLSKPKERFSGLTPMQMLTTQQGTTQVEEMLLQIAEGYGL
>A1JNG6 ~~~xre~~~Antitoxin Xre~~~COG5642
MRAYHPTPATKTRALWREIGLPASRGTVLVDSIKMGFSVDVIDSIHLWASIPKAEILRATGIPSRSLTRRRTHDGRFTPE
ESERIARFVRVMDAAVDLFGGDKGKAITWMSTPIKGLGHRSPDSLLETETGALEVCDLIGRLEHGVFS
>Q84H41 2.3.3.15~~~xsc~~~Sulfoacetaldehyde acetyltransferase~~~
MAATDNRKVVEGVHKMTPSEAFVETCVANGVSEMFGIMGSAFMDAMDIFAPAGIRLIPVVHEQGAAHMADGYARVSGRHG
VVIGQNGPGISNCVTGIAAAYWAHSPVVIVTPETGTMGMGLGGFQEANQLPMFQEFTKYQGHVCNPKRMAEFTGRVFDRA
MSEMGPTQLNIPRDYFYGEIECEIPKPMRVDRGHGGEASLQAAVELLKTAKFPVILAGGGVVMGDAVEEAKQLAERLGAP
VATGYLRNDAFPAKHPLWAGPLGYQGSKAAMKLIAQADVVIALGSRMGPFGTLPQHGMDYWPKAAKIIQIEADHTNLGLV
KKIAVGINGDAKAVAAELSRRLADVTLGCDATKAARADTIATEKAAWEKELDGWTHERDPYSLDMIEEAKGERTPTGGSY
LHPRQVLRELEKAMPARVMVSTDIGNINSVANSYLRFDEPRSFFAPMSFGNCGYALPTIIGAKCAAPDRPAIAYAGDGAW
GMSMMEIMTAVRHDIPVTAVVFHNRQWGAEKKNQVDFYNRRFVAGELESESFSDIAKAMGAEGIVVDHIEDVGPALQKAI
DMQMKEGKTCVIEIMCTRELGDPFRRDALSKPVRMLDKYKDYV
>Q84H44 2.3.3.15~~~xsc~~~Sulfoacetaldehyde acetyltransferase~~~
MANDTRQVVQGVQEMTPSEAFVETMVANGVTEIFGIMGSAFMDAMDIFAPAGIKLIPVVHEQGAAHMADGFARVSGRTGV
VIGQNGPGISNCVTAIAAAYWAHTPVVIVTPEAGTTGIGLGGFQEARQLPMFQEFTKYQGHVTHPARMAEYTARCFARAR
DEMGPAQLNIPRDYFYGKIKCEIPLPQPLDRGPGGAQSLDAAARLLAEAKFPVIISGGGVVMGDAVEECKALAERLGAPV
VNSYLHNDSFPASHPLWCGPLGYQGSKAAMKLLADADVVLALGTRLGPFGTLPQHGLDYWPKNARIIQVDADSKMLGLVK
KITVGVCGDAKASAAEISRRIDGMKLACDANKAERAARIQAEKDAWEQELTDWTHERDPFSLDMIEEQSKEEGNWLHPRQ
VLRELEKAMPEDVMVSTDIGNINSVANSYLRFEKPRSFFAAMSWGNCGYAFPTIIGAKVAAPHRPAVSYAGDGAWGMSMS
EIMTCVRHDIPVTAVVFHNRQWGAEKKNQVDFYNRRFVAGELESESFAGIARAMGAEGVVVDRIEDVGPALKKAIDAQMN
DRKTTVIEIMCTRELGDPFRRDALSKPVRLLEKYRDYT
>Q93PS3 2.3.3.15~~~xsc~~~Sulfoacetaldehyde acetyltransferase~~~
MAKVKMTPSEAMTEVLVNEGVTHVTGILGSAFMDMLDLWPTAGIEFIAVRHEQTAGHMQDAYCRITGKASVCIGQNGPGV
TNLVTCVAAANQAHTPMVVLGPSAGTPTVGWDGFQECDQVSIFRSITKQVLQVPHPSRAGDVLRTAFRIAYAERGPVYVD
IPRNYFYGEVYEEILRPDQYRAMNVRGAGDATELARATEILAAAKNPVIISGRGVVDADAFAEVKEIAHMLTAPVAMSYL
HNDTYPADDELWVGPIGYMGAKSAMYSLQDADVILAIGSRLSVFGTLPQYDINYFPENAKIIQIEVNPKQIGRRHPVTVP
IIGDAKLATAELIKLLKAKGDVKPNAERLAKIQERRNDWFKEIEEMAMMPGNPINPRRVLFEVAKLMPEDAILTTDIGNV
ASTANSYFKFTKPKKHIAALTFGNTGFAYQAGLGAQMAEPDSPVVAIVGDGAWGQSLHEISTAVQYKLPVIACVFRNMAW
CAEKKNQIDFYNNRFVGTEIPNPISFIPAAEAFGAKGIRVEKPEDIADAFKQGLAWRAEGHPVVLEFVVDGTILAPPFRK
DALALPTRYLPKYEHLDAKYFPKN
>D5AKX8 2.3.3.15~~~xsc~~~Sulfoacetaldehyde acetyltransferase~~~COG0028
MRMTTEEAFVKVLQRHGIDTAFGIIGSAFMPISDLFPRAGIRFFDCAHEGSGGMMADGFTRASGRMAMIIAQNGPGVTNF
VTAVKTAYWNHTPMLVVTPQAANRTIGQGGFQEVEQMALFRDMVCWQEELRDPARIAEVLDRVIRKARRASAPAQINLPR
DMFTKIIDIELPQGVDLPRPAPDAQALDRAAALLSSARFPVILNGAGVVLAEAIPDTVALAERLEAPVCTGYQHNDAFPG
SHPLFAGPLGYNGSKAAMQLMSQADVVLCLGTRLNPFSTLPGYGIDYWPKAAAVIQVDINPDRIGLTRPVTLGIAADAGA
VARGILARLGAQAGDQDRAERAARIATTKSRWAQELASMDHEEDDPGTSWNERARAAKPGWMSPRMAWRAITAALPPEAI
LSSDIGNNCAIGNAYPSFAAGRKYLAPGLFGPCGYGLPAIIGAKIACPETPVVGFAGDGAFGISVTELTAIGRADWPAIT
MVVFRNYQWGAEKRNSTLWYDDNFVGTELDLQVSYAGIAQACGLQGVVARTMEELTEALRKALADQAAGKTTLIEALINQ
ELGEPFRRDAMTKPVVVAGIDPADMRPQPR
>A3SR25 2.3.3.15~~~xsc~~~Sulfoacetaldehyde acetyltransferase~~~COG0028
MLFRASQPEDKPMKMTTEEAFVKTLQMHGIQHAFGIIGSAMMPISDIFGKAGITFWDCAHEGSGGMMADGYTRATGKMSM
MIAQNGPGITNFVTAVKTAYWNHTPLLLVTPQAANKTMGQGGFQEVEQMAAFKDMVCYQEEVRDPTRMAEVLNRVILNAK
RYSAPAQINVPRDYFTQVIDIELPKIVDFERPSGGEEALDEAAKLLSEAKFPVILNGAGVILAGAIPATAELAERLDAPV
CCGYQHNDAFPGSHPLHAGPLGYNGSKAGMELISKADVVLALGTRLNPFSTLPGYGIDYWPKDAKIIQVDVKPERIGLTK
PVAVGIVGDAKKVAKTILAKLSDTAGDADREERKATIAKTKSAWAQELSSMDHEQDDPGTTWNERARGAKPDWMSPRMAW
RAIQAALPKEAIISSDIGNNCAIGNAYPSFEEGRKYLAPGLFGPCGYGLPAVVGAKIGCPDTPVVGFSGDGAFGIAVNEL
TAIGRGEWPAVTHVVFRNYQWGAEKRNSTLWFDDNFVGTELDEQVSYAGIAKACGLKGVVARTMDELTDALDQAIKDQKA
GTTTLIEAMINQELGEPFRRDAMKKPVAVAGIDPADMREQQVD
>Q59675 3.2.1.8~~~~~~Endo-beta-1,4-xylanase Xyn10C~~~
MKKIQQLLMLSLISSTLIACGGGGGGGSTPTTSSSPQSSSPASTPSSASSSSIISSSSLSSSLSSSSLSSSSLSSSSASS
VSSSSVAASEGNVVIEVDMANGWRGNASGSTSHSGITYSADGVTFAALGDGVGAVFDIARPTTLEDAVIAMVVNVSAEFK
ASEANLQIFAQLKEDWSKGEWDCLAASSELTADTDLTLTCTIDEDDDKFNQTARDVQVGIQAKGTPAGTITIKSVTITLA
QEAYSANVDHLRDLAPSDFPIGVAVSNTDSATYNLLTNSREQAVVKKHFNHLTAGNIMKMSYMQPTEGNFNFTNADAFVD
WATENNMTVHGHALVWHSDYQVPNFMKNWAGSAEDFLAALDTHITTIVDHYEAKGNLVSWDVVNEAIDDNSPANFRTTDS
AFYVKSGNSSVYIERAFQTARAADPAVILYYNDYNIEQNNAKTTKMVDMVKDFQARSIPIDGVGFQMHVCMNYPSIANIS
AAMKKVVDLGLLVKITELDVAVNQPHCDAYPANKINPLTEAAQLAQKKRYCDVVKAYLDTVPVNQRGGISVWGTTDANTW
LDGLYREQFEDEKISWPLLFDNNYNDKPALRGFADALIGTQCTNTH
>Q59674 ~~~~~~Bifunctional xylanase/xylan deacetylase~~~
MKLPTLGKCVVRTLMGAVALGAISVNAQTLSSNSTGTNNGFYYTFWKDSGDASMTLLSGGRYQSSWGNSTNNWVGGKGWN
PGNNSRVISYSGSYGVDSSQNSYLALYGWTRSPLIEYYVIESYGSYNPASCSGGTDYGSFQSDGATYNVRRCQRVNQPSI
DGTQTFYQYFSVRNPKKGFGNISGTITFANHVNFWASKGLNLGNHNYQVLATEGYQSRGSSDITVSEGTSGGGTSSVGGA
SSSVNSSTGGGSSGGITVRARGANGSEHINLRVGGAVVANWTLGTSFQNYLYSGNASGDIQVQFDNDASGRDVVVDYIIV
NGETRQAEDMEHNSAVYANGRCGGGSYSENMHCNGEIGFGYTYDCFSGNCSGGNGGSNSSAGNSSSGNTGGGGSNCSGYV
GITFDDGPNSNTATLVNLLRQNNLTPVTWFNQGNNVASNAHLMSQQLSVGEVHNHSYTHPHMTSWTYQQVYDELNRTNQA
IQNAGAPKPTLFRPPYGELNSTIQQAAQALGLRVVTWDVDSQDWNGASAAAIANAANQLQNGQVILMHDGSYTNTNSAIA
QIATNLRAKGLCPGRIDPNTGRAVAPSSSGGSSSVALSSSSRSSSSAGGNTGGNCQCNWWGTFYPLCQTQTSGWGWENSR
SCISTSTCNSQGTGGGGVVCN
>P83513 ~~~~~~Bifunctional xylanase/deacetylase~~~
MSATLLVPSMTVKAADTIYNNKTGNQDGYDYELWKDTGNTSMTLNAGGTFDCSWSNINNALFRKGKKFDSTQTYQQIGNI
TFDYGCDYRPNGNSYLCVYGWTVDPLVEYYIVDSWGTWRPPGGTPKGQIQVDGGTYDVYETTRYNAPSIQGDTTFKQYFS
VRTSKRTSGTISVSEHFKAWERMGMRCGNFMKPALNIEGYQSSGSASVYKNNMTIGGSSSSSGNQGGNQGGNTGNENAGN
NLVTVADADKIQCETMTKSGQYTGNISSPFNGVALYANNDAVKYTQYFASGTHDFTLRGCSNNNKMARVDLKIGGQNKGT
FYYGDSYPAEYTIKNVSHGTGNQTIELVVTADDGQWDAYLDYFNNSVEPGCSLVPGAVVVLVALGSSSNTGNNSGTNTQN
QKLIALTFDDGPSSTTSQVLDMLEKYNVKATFFLIGQNVNSNTASIVQRQVKMGCELACHSYTHEDMTKMNASQIRNQID
WTASAIKNTAGVDVKFFRPPYISVNNTMYQNIDLPFIQGSMHNDWESSTSASQRVNSVLSSAKDGDIILLHDFQGNSQTV
SALPQIIEGLKNQGYTFVTVSELFEMKGVNPNVEYKIWSNVK
>Q8VP72 3.2.1.8~~~~~~Endo-1,4-beta-xylanase Xyn11B~~~
MKIFQNTKNVIVSIAWAAALCTSAVSAQTLTSNSTGTNNGFYYTFWKDSGDASMTLLSGGRYQSSWNSSTNNWVGGKGWN
PGSSSRVISYSGYYGVDSSQNSYLALYGWTRSPLIEYYVIESYGSYNPASCSGGTDYGSFQSDGATYNVRRCQRVNQPSI
DGNQTFYQYFSVRNPKKGFGNISGTITFANHANFWATKGLNLGNHNYQVLATEGYQSRGSSDITVSQGGSSGGGNSSSSS
SASGGGSKIIVVRARGTAGGESITLRVGNTNVATWTLTTTMTNYTATTSASGGSLVQYTNDSGNRDVQVDYISVNGSIRQ
SEDQTYNTGVYQNGSCGGGNGRSEWLHCNGAIGYGDI
>D5EY13 ~~~~~~Endo-1,4-beta-xylanase/feruloyl esterase~~~COG2382
MKKLLVALSLIAGSLTASAQWGRPVDYAAGPGLKDAYKDYFTVGVAVNKFNISDPAQTAIVKKQFNSVTAENAWKPGEIH
PKEGVWNFGLADSIANFCRENGIKMRGHCLCWHSQFADWMFTDKKGKPVKKEVFYQRLREHIHTVVNRYKDVVYAWDVVN
EAMADDGRPFEFVDGKMVPASPYRQSRHFKLCGDEFIAKAFEFAREADPTGVLMYNDYSCVDEGKRERIYNMVKKMKEAG
VPIDGIGMQGHYNIYFPDEEKLEKAINRFSEIVNTIHITELDLRTNTESGGQLMFSRGEAKPQPGYMQTLQEDQYARLFK
IFRKHKDVIKNVTFWNLSDKDSWLGVNNHPLPFDENFKAKRSLQIIRDFDAAMDNRKPKEDFVPNPMNQPGQEYPMVNSE
GYARFRVEAPDAKSVIVSLGLGGRGGTVLRKDKNGVWTGTTEGPMDPGFHYYHLTIDGGVFNDPGTHNYFGSCRWESGIE
IPAKDQDFYAYRKDINHGNIQQVTFWSESTGKMQTANVYLPYGYGKVVKGKQERYPVLYLQHGWGENETSWPVQGKAGLI
MDNLIADGKIKPFIVVMAYGLTNDFKFGSIGKFTAEEFEKVLIDELIPTIDKNFLTKADKWNRAMAGLSMGGMETKLITL
RRPEMFGYWGLLSGGTYMPEEIKDPKAVKYIFVGCGDKENPEGINKSVEALKAAGFKAEGLVSEGTAHEFLTWRRCLEKM
AQSLFK
>D5EY15 3.2.1.37~~~xyl3A~~~Xylan 1,4-beta-xylosidase~~~COG1472
MKYQLFLSLALCVGLGASAQTLPYQNPNLSAKERAVDLCSRLTLEEKAMLMLDESPAIPRLGIKKFFWWSEALHGAANMG
NVTNFPEPVGMAASFNPHLLFKVFDIASTEFRAQYNHRMYDLNGEDMKMRSLSVWTPNVNIFRDPRWGRGQETYGEDPYL
TSVMGVQVVKGLQGPEDARYRKLWACAKHYAVHSGPEYTRHTANLTDVSARDFWETYMPAFKTLVKDAKVREVMCAYQRL
DDDPCCGSTRLLQQILRDEWGFEYLVVSDCGAVSDFYENHKSSSDAVHGTSKAVLAGTDVECGFNYAYKSLPEAVRKGLL
SEKEVDKHVIRLLEGRFDLGEMDDPSLVEWSKIPYSAMSTKASANVALDMARQTIVLLQNKNNILPLKKNAEKIAIIGPN
AHNEPMMWGNYNGTPNHTVTILDGVKAKQKKLVYIPGCDLTNDKVMECHLATDCVTPDGKKGLKGTFWNNTEMAGKPFTT
EYYTKPVNVTTAGMHVFAPNLPIEDFSAKYETTFTAKEAGEYVVNVESTGHFELYVNGKQQFVNHIWRATPTRTVLKAEK
GQKFDIEVRFQTVKTWGASMKIDVARELNIDYQETIAQLKGINKVIFCGGIAPSLEGEEMPVNIEGFKGGDRTSIELPKV
QREFLKALKAAGKQVIYVNCSGSAIALQPETESCDAIVQAWYPGQEGGTAVADVLFGDYNPGGKLSVTFYKNDQQLPDYE
DYSMKGRTYRYFDDALFPFGYGLSYTTFEVGEAKVEAATDGALYNVQIPVTNTGTKNGSETIQLYIRNLQDPDGPLKSLR
GFERLDIKAGKTATANLKLTKESLEFWDAETNTMRTKPGKYEILYGTSSLDKDLKKLTITL
>P45702 3.2.1.37~~~xylA~~~Beta-xylosidase~~~
MPTNLFFNAHHSPVGAFASFTLGFPGKSGGLDLELARPPRQNVLIGVESLHESGLYHVLPFLETAEEDESKRYDIENPDP
NPQKPNILIPFAKEEIQREFHVATDTWKAGDLTFTIYSPVKAVPNPETADEEELKLALVPAVIVEMTIDNTNGTRARRAF
FGFEGTDPYTSMRRIDDTCPQLRGVGQGRILSIVSKDEGVRSALHFSMEDILTAQLEENWTFGLGKVGALIVDVPAGEKK
TYQFAVCFYRGGYVTAGMDASYFYTRFFQNIEEVGLYALEQAEVLKEQSFRSNKLIEKEWLSDDQTFMMAHAIRSYYGNT
QLLEHEGKPIWVVNEGEYRMMNTFDLTVDQLFFELKLNPWTVKNVLDLYVERYSYEDRVRFPGEETEYPSGISFTHDMGV
ANTFSRPHYSSYELYGISGCFSHMTHEQLVNWVLCAAVYIEQTKDWAWRDKRLAILEQCLESMVRRDHPDPEQRNGVMGL
DSTRTMGGAEITTYDSLDVSLGQARNNLYLAGKCWAAYVALEKLFRDVGKEELAALAGEQAEKCAATIVSHVTDDGYIPA
IMGEGNDSKIIPAIEGLVFPYFTNCHEALDENGRFGAYIQALRNHLQYVLREGICLFPDGGWKISSTSNNSWLSKIYLCQ
FIARHILGWEWDEQGKRADAAHVAWLTHPTLSIWSWSDQIIAGEITGSKYYPRGVTSILWLEEGE
>P12851 5.3.1.5~~~xylA~~~Xylose isomerase~~~COG2115
MSVQATREDKFSFGLWTVGWQARDAFGDATRTALDPVEAVHKLAEIGAYGITFHDDDLVPFGSDAQTRDGIIAGFKKALD
ETGLIVPMVTTNLFTHPVFKDGGFTSNDRSVRRYAIRKVLRQMDLGAELGAKTLVLWGGREGAEYDSAKDVSAALDRYRE
ALNLLAQYSEDRGYGLRFAIEPKPNEPRGDILLPTAGHAIAFVQELERPELFGINPETGHEQMSNLNFTQGIAQALWHKK
LFHIDLNGQHGPKFDQDLVFGHGDLLNAFSLVDLLENGPDGAPAYDGPRHFDYKPSRTEDYDGVWESAKANIRMYLLLKE
RAKAFRADPEVQEALAASKVAELKTPTLNPGEGYAELLADRSAFEDYDADAVGAKGFGFVKLNQLAIEHLLGAR
>P12070 5.3.1.5~~~xylA~~~Xylose isomerase~~~
MSVQPTPADHFTFGLWTVGWTGADPFGVATRKNLDPVEAVHKLAELGAYGITFHDNDLIPFDATEAEREKILGDFNQALK
DTGLKVPMVTTNLFSHPVFKDGGFTSNDRSIRRFALAKVLHNIDLAAEMGAETFVMWGGREGSEYDGSKDLAAALDRMRE
GVDTAAGYIKDKGYNLRIALEPKPNEPRGDIFLPTVGHGLAFIEQLEHGDIVGLNPETGHEQMAGLNFTHGIAQALWAEK
LFHIDLNGQRGIKYDQDLVFGHGDLTSAFFTVDLLENGFPNGGPKYTGPRHFDYKPSRTDGYDGVWDSAKANMSMYLLLK
ERALAFRADPEVQEAMKTSGVFELGETTLNAGESAADLMNDSASFAGFDAEAAAERNFAFIRLNQLAIEHLLGSR
>P54272 5.3.1.5~~~xylA~~~Xylose isomerase~~~
MPYFDNISTIAYEGPASKNPLAFKFYNPEEKVGDKTMEEHLRFSVAYWHTFTGDGSDPFGAGNMIRPWNKYSGMDLAKAR
VEAAFEFFEKLNIPFFCFHDVDIAPEGETLKETYKNLDIIVDMIEEYMKTSKTKLLWNTANLFTHPRFVHGAATSCNADV
FAYAAAKVKKGLEIAKRLGAENYVFWGGREGYETLLNTDMKLELDNLARFLHMAVDYAKEIGFDGQFLIEPKPKEPTKHQ
YDFDVATALAFLQTYGLKDYFKFNIEANHATLAGHTFEHELRVARIHGMLGSVDANQGDMLLGWDTDEFPTDLYSTTLAM
YEILKNGGLGRGGLNFDAKVRRGSFEPEDLFYAHIAGMDSFAVGLKVAHRLIEDRVFDEFIEERYKSYTEGIGREIVEGT
VDFHKLEAHALQLGEIQNQSGRQERLKTLLNQYLLEVCAAR
>Q8A9M2 5.3.1.5~~~xylA~~~Xylose isomerase~~~COG2115
MATKEFFPGIEKIKFEGKDSKNPMAFRYYDAEKVINGKKMKDWLRFAMAWWHTLCAEGGDQFGGGTKQFPWNGNADAIQA
AKDKMDAGFEFMQKMGIEYYCFHDVDLVSEGASVEEYEANLKEIVAYAKQKQAETGIKLLWGTANVFGHARYMNGAATNP
DFDVVARAAVQIKNAIDATIELGGENYVFWGGREGYMSLLNTDQKREKEHLAQMLTIARDYARARGFKGTFLIEPKPMEP
TKHQYDVDTETVIGFLKAHGLDKDFKVNIEVNHATLAGHTFEHELAVAVDNGMLGSIDANRGDYQNGWDTDQFPIDNYEL
TQAMMQIIRNGGLGTGGTNFDAKTRRNSTDLEDIFIAHIAGMDAMARALESAAALLDESPYKKMLADRYASFDGGKGKEF
EDGKLTLEDVVAYAKTKGEPKQTSGKQELYEAILNMYC
>P00944 5.3.1.5~~~xylA~~~Xylose isomerase~~~COG2115
MQAYFDQLDRVRYEGSKSSNPLAFRHYNPDELVLGKRMEEHLRFAACYWHTFCWNGADMFGVGAFNRPWQQPGEALALAK
RKADVAFEFFHKLHVPFYCFHDVDVSPEGASLKEYINNFAQMVDVLAGKQEESGVKLLWGTANCFTNPRYGAGAATNPDP
EVFSWAATQVVTAMEATHKLGGENYVLWGGREGYETLLNTDLRQEREQLGRFMQMVVEHKHKIGFQGTLLIEPKPQEPTK
HQYDYDAATVYGFLKQFGLEKEIKLNIEANHATLAGHSFHHEIATAIALGLFGSVDANRGDAQLGWDTDQFPNSVEENAL
VMYEILKAGGFTTGGLNFDAKVRRQSTDKYDLFYGHIGAMDTMALALKIAARMIEDGELDKRIAQRYSGWNSELGQQILK
GQMSLADLAKYAQEHHLSPVHQSGRQEQLENLVNHYLFDK
>P54273 5.3.1.5~~~xylA~~~Xylose isomerase~~~
MPYFDNISTIAYEGPASKNPLAFKFYNPEEKVGDKTMEEHLRFSVAYWHTFTGDGSDPFGAGNMIRPWNKYSGMDLAKAR
VEAAFEFFEKLNIPFFCFHDVDIAPEGETLKETYKNLDIIVDMIEEYMKTSKTKLLWNTANLFTHPRFVHGAATSCNADV
FAYAAAKVKKGLEIAKRLGAENYVFWGGREGYETLLNTDMKLELDNLARFLHMAVDYAKEIGFDGQFLIEPKPKEPTKHQ
YDFDVATALAFLQTYGLKDYFKFNIEANHATLAGHTFEHELRVARIHGMLGSVDANQGDMLLGWDTDEFPTDLYSTTLAM
YEILKNGGLGRGGLNFDAKVRRGSFEPEDLFYAHIAGMDSFAVGLKVAHRLIEDRVFDEFIEERYKSYTEGIGREIVEGT
ADFHKLEAHALQLGEIQNQSGRQERLKTLLNQYLLEVCAAR
>P29443 5.3.1.5~~~xylA~~~Xylose isomerase~~~
MTEEYWKGVDKIQYVGHQDKKSGLGFQYYNPEEEIMGKKMKDWLRFAVAYWHTFDQRLVDPFGDGTAQRPYDKYTDPMDL
ALAKVDAAFEFYQKLGVDYLCFHDRDLAPEGDTLRETNANLDKVVDKIVEYQKTSGMKVLWNTSNMFTNPRFVEGAATSP
YADVFAYSAAQLKHSLEIGKRVGSENYVFWGGREGYESLWNTNMKQEQEHAAKIFHMAKDYANEIGFDAQMLLEPKPKEP
TTHQYDFDAATTIAFMKEYDLDKDFKLNLEGNHANLAGHTYQHEIRVAREAGLLGSLDANQGDKLIGWDIDEYPSNLYET
TAAMYEVVENGSIGPRGGLNFDAKPRRSAFAPEDLFLGHIVGMDSFAAGLRVAAAMKQDGFLDNLKADRYSSYKSGVGAD
IESGKADLKSLEAYAIDKPQSELIAATHSDHLEEIKDTINHYIIDTLSK
>P21394 ~~~xylA~~~Xylene/toluene monooxygenase electron transfer component XylA~~~
MNEFFKKISGLFVPPPESTVSVRGQGFQFKVPRGQTILESALHQGIAFPHDCKVGSCGTCKYKLISGRVNELTSSAMGLS
GDLYQSGYRLGCQCIPKEDLEIELDTVLGQALVPIETSALISKQKRLAHDIVEMEVVPDKQIAFYPGQYADVECAECSAV
RSYSFSAPPQPDGSLSFHVRLVPGGVFSGWLFGGDRTGATLTLRAPYGQFGLHESNATMVCVAGGTGLAPIKCVLQSMTQ
AQRERDVLLFFGARQQRDLYCLDEIEALQLDWGGRFELIPVLSEESSTSSWKGKRGMVTEYFKEYLTGQPYEGYLCGPPP
MVDAAETELVRLGVARELVFADRFYNRPPC
>P24299 5.3.1.5~~~xylA~~~Xylose isomerase~~~
MNYQPTPEDRFTFGLWTVGWEGRDPFGDATRTALDPVESVRRLAELGAHGVTFHDDDLIPFGSSDSERYEHVKRFRQALD
DTGMKVPMATTNLFTHPVFKDGGFTANDRDVRRYALRKTIRNIDLAVELGAETYVAWGGREGAESGGAKDVRDALDRMKE
AFDLLGEYVTSQGYDIRFAIEPKPNEPRGDILLPTVGHALAFIERLERPELYGVNPEVGHEQMAGLNFPHGIAQALWAGK
LFHIDLNGQNGIKYDQDLRFGAGDLRAAFWLVDLLESAGYSGPRHFDFKPPRTEDFDGVWASAAGCMRNYLILKERAAAF
RADPEVQEALRASRLDELARPTAADGLQALLDDRSAFEEFDVDAAAARGMAFERLDQLAMDHLLGARGAAA
>P50910 5.3.1.5~~~xylA~~~Xylose isomerase~~~
MSYQPTPEDKFTFGLWTVGWQGRDPFGDATRGALDPAESVRRLAELGAHGVTFHDDDLIPFGATDSERAEHIKRFRQGLD
ETGMKVPMATTNLFTHPVFKDGGFTANDRDVRRYAVRKTIRNIDLAVELGAQTYVAWGGREGAESGAAKDVRVALDRMKE
AFDLLGEYVTSQGYDTPFAIEPKPNEPRGDILLPTIGHALAFIDGLERPELYGVNPEVGHEQMAGLNFPHGIAQALWAGK
LFHIDLNGQSGIKYDQDLRFGPGDLAAAFWLVDLLESAGYEGPRHFDFKPPRTEDFDGVWASAAGCMRNYLILKERAAAF
RADPEVQEALRAARLDELAQPTAGDGLQALLPDRSAFEDFDPDAAAARGMAFERLDQLAMDHLLGARG
>P37031 5.3.1.5~~~xylA~~~Xylose isomerase~~~
MSFQPTPEDRFTFGLWTVGWQGRDPFGDATRPALDPVETVQRLAELGAYGVTFHDDDLIPFGSSDTERESHIKRFRQALD
ATGMTVPMATTNLFTHPVFKDGGFTANDRDVRRYALRKTIGNIDLAAELGAKTYVAWGGREGAESGGAKDVRDALDRMKE
AFDLLGEYVTAQGYDLRFAIEPKPNEPRGDILLPTVGHALAFIERLERPELYGVNPEVGHEQMAGLNFPHGIAQALWAGK
LFHIDLNGQSGIKYDQDLRFGAGDLRAAFWLVDLLETAGYEGPRHFDFKPPRTEDFDGVWASAAGCMRNYLILKDRAAAF
RADPEVQEALRAARLDQLAQPTAADGLDALLADRAAFEDFDVDAAAARGMAFEHLDQLAMDHLLGARG
>P15587 5.3.1.5~~~xylA~~~Xylose isomerase~~~
MSYQPTPEDRFTFGLWTVGWQGRDPFGDATRPALDPVETVQRLAELGAHGVTFHDDDLIPFGSSDTERESHIKRFRQALD
ATGMTVPMATTNLFTHPVFKDGGFTANDRDVRRYALRKTIRNIDLAVELGAKTYVAWGGREGAESGAAKDVRVALDRMKE
AFDLLGEYVTSQGYDTRFAIEPKPNEPRGDILLPTVGHALAFIERLERPELYGVNPEVGHEQMAGLNFPHGIAQALWAGK
LFHIDLNGQSGIKYDQDLRFGAGDLRAAFWLVDLLESAGYEGPRHFDFKPPRTEDIDGVWASAAGCMRNYLILKERAAAF
RADPEVQEALRASRLDELAQPTAADGVQELLADRTAFEDFDVDAAAARGMAFERLDQLAMDHLLGAR
>P24300 5.3.1.5~~~xylA~~~Xylose isomerase~~~
MNYQPTPEDRFTFGLWTVGWQGRDPFGDATRRALDPVESVRRLAELGAHGVTFHDDDLIPFGSSDSEREEHVKRFRQALD
DTGMKVPMATTNLFTHPVFKDGGFTANDRDVRRYALRKTIRNIDLAVELGAETYVAWGGREGAESGGAKDVRDALDRMKE
AFDLLGEYVTSQGYDIRFAIEPKPNEPRGDILLPTVGHALAFIERLERPELYGVNPEVGHEQMAGLNFPHGIAQALWAGK
LFHIDLNGQNGIKYDQDLRFGAGDLRAAFWLVDLLESAGYSGPRHFDFKPPRTEDFDGVWASAAGCMRNYLILKERAAAF
RADPEVQEALRASRLDELARPTAADGLQALLDDRSAFEEFDVDAAAARGMAFERLDQLAMDHLLGARG
>P09033 5.3.1.5~~~xylA~~~Xylose isomerase~~~
MSFQPTPEDKFTFGLWTVGWQGRDPFGDATRPALDPVETVQRLAELGAYGVTFHDDDLIPFGSSDTERESHIKRFRQALD
ATGMTVPMATTNLFTHPVFKDGGFTANDRDVRRYALRKTIRNIDLAAELGAKTYVAWGGREGAESGGAKDVRDALDRMKE
AFDLLGEYVTAQGYDLRFAIEPKPNEPRGDILLPTVGHALAFIERLERPELYGVNPEVGHEQMAGLNFPHGIAQALWAGK
LFHIDLNGQSGIKYDQDLRFGAGDLRAAFWLVDLLESAGYEGPRHFDFKPPRTEDFDGVWASAEGCMRNYLILKERAAAF
RADPEVQEALRAARLDQLAQPTAADGLEALLADRTAFEDFDVEAAAARAAWPFERLDQLAMDHLLGARG
>P56681 5.3.1.5~~~xylA~~~Xylose isomerase~~~
MYEPKPEHRFTFGLWTVGNVGRDPFGDAVRERLDPVYVGHKLAELGVHGVNLHDEDLIPRGTPPAERDQIVRRFKRALDE
TGLKVPMVTGNLFSDPGFKDGGFTSRDPWVRAYAFRKSLETMDLGAELGAEIYVVWPGREGAEVEATGKARKVWDWVREP
LNFMAAYAEDQGYGYRFALEPKPNEPRGDIYFATVGSMLALIHTLERPERFGLNPEFAHETMAGLNFVHAVAQALDAGKL
LHIDLNGQRMNRFDQDLRFGSENLKAAFLLVDLLESSGYQGPRHFDAHALRTEDEEGVWAFARGCMRTYLILKERAEAFR
EDPEVKELLAAYYQEDPAALPLMDPYSHEKAEALKRAELPLEAKRHRGYALERLDQLAVEYLLGVRG
>P45687 5.3.1.5~~~xylA~~~Xylose isomerase~~~
MAEFFPEIPKVQFEGKESTNPLAFKFYDPEEIIDGKPLKDHLKFSVAFWHTFVNEGRDPFGDPTADRPWNRYTDPMDKAF
ARVDALFEFCEKLNIEYFCFHDRDIAPEGKTLRETNKILDKVVERIKERMKDSNVKLLWGTANLFSHPRYMHGAATTCSA
DVFAYAAAQVKKALEITKELGGEGYVFWGGREGYETLLNTDLGFELENLARFLRMAVDYAKRIGFTGQFLIEPKPKEPTK
HQYDFDVATAYAFLKSHGLDEYFKFNIEANHATLAGHTFQHELRMARILGKLGSIDANQGDLLLGWDTDQFPTNVYDTTL
AMYEVIKAGGFTKGGLNFDAKVRRASYKVEDLFIGHIAGMDTFALGFKVAYKLVKDGVLDKFIEEKYRSFREGIGRDIVE
GKVDFEKLEEYIIDKETIELPSGKQEYLESLINSYIVKTILELR
>P30435 5.3.1.5~~~xylA~~~Xylose isomerase~~~
MNKYFENVSKIKYEGPKSNNPYSFKFYNPEEVIDGKTMEEHLRFSIAYWHTFTADGTDQFGKATMQRPWNHYTDPMDIAK
RRVEAAFEFFDKINAPFFCFHDRDIAPEGDTLRETNKNLDTIVAMIKDYLKTSKTKVLWGTANLFSNPRFVHGASTSCNA
DVFAYSAAQVKKALEITKELGRENYVFWGGREGYETLLNTDMELELDNFARFLHMAVDYAKEIGFEGQFLIEPKPKEPTK
HQYDFDVANVLAFLRKYDLDKYFKVNIEANHATLAFHDFQHELRYARINGVLGSIDANTGDMLLGWDTDQFPTDIRMTTL
AMYEVIKMGGFDKGGLNFDAKVRRASFEPEDLFLGHIAGMDAFAKGFKVAYKLVKDGVFDKFIEERYASYKEGIGADIVS
GKADFKSLEKYALEHSQIVNKSGRQELLESILNQYLFAE
>P48790 ~~~xylA~~~Xylosidase/arabinosidase~~~
MRKQRFNPYLPSWEYIPDAEPYVFNGRVYIYGSHDRFNGHAFCLNDYVCWSAPVDDLSEWRYEGVIYRKTDDPLNPDGRM
CLYAPDVTLGPDGRYYLYYVLDKVPVVSVAVCDTPAGKYEFYGYVRYADGTRLGEREGDWPQFDPAVLTEGERTYLYTGF
CPKGDKSRKGAMATVLGPDMLTVVEEPVIIVPSEPYSRGSGFEGHEFFEAPSIRKKGDTYYFIYSSVVMHELCYATSKHP
TKGFKYGGVIVSNCDLHIDSYKPAEKPMYYGGNNHGSIVEINGEWYIFYHRHTNGTSFSRQGCMEKIKILEDGSIPQVEM
TSCGSADEPLPGRGEYPAYIACNLFCGEESVYTDLTGAWMNNQFPKITQDGKDGDEEPGYIANMKDSATAGFKYFDCKGI
KSVKIKVRGYCRGVFEVKTSWNGEVLGKIPVEFSNIWTEFSASIPIPDGIHALYFTYRGSGSASLKSFTLCTD
>P26997 5.3.1.5~~~xylA~~~Xylose isomerase~~~
MYEPKPEHRFTFGLWTVGNVGRDPFGDAVRERLDPVYVVHKLAELGAYGVNLHDEDLIPRGTPPQERDQIVRRFKKALDE
TGLKVPMVTANLFSDPAFKDGAFTSPDPWVRAYALRKSLETMDLGAELGAEIYVVWPGREGAEVEATGKARKVWDWVREA
LNFMAAYAEDQGYGYRFALEPKPNEPRGDIYFATVGSMLAFIHTLDRPERFGLNPEFAHETMAGLNFVHAVAQALDAGKL
FHIDLNDQRMSRFDQDLRFGSENLKAAFFLVDLLESSGYQGPRHFDAHALRTEDEEGVWAFARGCMRTYLILKERAEAFR
EDPEVKELLAAYYQEDPAALALLGPYSREKAEALKRAELPLEAKRRRGYALERLDQLAVEYLLGVRG
>P19148 5.3.1.5~~~xylA~~~Xylose isomerase~~~
MNKYFENVSKIKYEGPKSNNPYSFKFYNPEEVIDGKTMEEHLRFSIAYWHTFTADGTDQFGKATMQRPWNHYTDPMDIAK
ARVEAAFEFFDKINAPYFCFHDRDIAPEGDTLRETNKNLDTIVAMIKDYLKTSKTKVLWGTANLFSNPRFVHGASTSCNA
DVFAYSAAQVKKALEITKELGGENYVFWGGREGYETLLNTDMEFELDNFARFLHMAVDYAKEIGFEGQFLIEPKPKEPTK
HQYDFDVANVLAFLRKYDLDKYFKVNIEANHATLAFHDFQHELRYARINGVLGSIDANTGDMLLGWDTDQFPTDIRMTTL
AMYEVIKMGGFDKGGLNFDAKVRRASFEPEDLFLGHIAGMDAFAKGFKVAYKLVKDRVFDKFIEERYASYKDGIGADIVS
GKADFRSLEKYALERSQIVNKSGRQELLESILNQYLFAE
>P49943 ~~~xsa~~~Xylosidase/arabinosidase~~~
MKTEKRYLVPGDYMADPAVHVFDGKLYIYPSHDWESGIAENDNGDHFNMKDYHVYSMDDVMNGEIKDHGVVLSTEDIPWA
GRQLWDCDVVCKDGKYYMYFPLKDQNDIFRIGVAVSDKPYGPFIPEANPMKGSYSIDPAVWDDGDGNYYIYFGGLWGGQL
QRYRNNKALESAILPEGEEEAIPSRVARLSEDMMEFAEEPRAVVILDEDGKPLTAGDTERRFFEASWMHKYNGKYYFSYS
TGDTHLLCYATGDNPYGPFTYQGVILTPVVGWTTHHAIVEFKGKWYLFHHDCVPSEGKTWLRSLKVCELQYDADGRIITI
EGKDE
>P09099 2.7.1.17~~~xylB~~~Xylulose kinase~~~COG1070
MYIGIDLGTSGVKVILLNEQGEVVAAQTEKLTVSRPHPLWSEQDPEQWWQATDRAMKALGDQHSLQDVKALGIAGQMHGA
TLLDAQQRVLRPAILWNDGRCAQECTLLEARVPQSRVITGNLMMPGFTAPKLLWVQRHEPEIFRQIDKVLLPKDYLRLRM
TGEFASDMSDAAGTMWLDVAKRDWSDVMLQACDLSRDQMPALYEGSEITGALLPEVAKAWGMATVPVVAGGGDNAAGAVG
VGMVDANQAMLSLGTSGVYFAVSEGFLSKPESAVHSFCHALPQRWHLMSVMLSAASCLDWAAKLTGLSNVPALIAAAQQA
DESAEPVWFLPYLSGERTPHNNPQAKGVFFGLTHQHGPNELARAVLEGVGYALADGMDVVHACGIKPQSVTLIGGGARSE
YWRQMLADISGQQLDYRTGGDVGPALGAARLAQIAANPEKSLIELLPQLPLEQSHLPDAQRYAAYQPRRETFRRLYQQLL
PLMA
>P35850 2.7.1.17~~~xylB~~~Xylulose kinase~~~
MGKYVLGVDLGTSAVKVSALDHSGQIVAQESFDYDLIQKQPGYNEQNPEDWVSGTTVAIVRLILNDHLDASNIEGLSYSG
QMHGLVLLDENKKVLRPAILWNDTRSTPQREEIEAKLGDEFVHITRNQPLEGFTLTKLLWVKQNEPDIWAKAKYFVLPKD
YVRYRMTGNLAMDYSDATGTVLLDVAKGEWSQKICAALDIPMSMCPPLIKSIDLAGTVTPAYAEFSGLTTDTKVFGGAAD
NAAGAVGAGILHPNMVLSSIGTSGVVLKYEDNADVNYHGVLQFEDHAIPDKFYSMGVTLAAGYSFTWFKKTFAPAEDFTD
VVASAAKSTVGANGLLYTPYIVGERAPYADADIRGSFTGVDGTHQRYDFVRAVLEGIIFSFRDLFDIYEENGGDFDTVVS
IGGGAKSPLWLQIQADIFNRKVVSLTNEQGPGMGAAMIAATGLGWFDSLQDCAETFVHFGKAYEPNPDNVKKYEKMHAIY
KQVYQQTKTISEQLLDYRRAEL
>P39849 1.1.1.90~~~xylB~~~Aryl-alcohol dehydrogenase~~~
MEIKAAIVRQKNGPFLLEHVALNEPAEDQVLVRLVATGLCHTDLVCRDQHYPVPLPMVFGHEGAGVVERVGSAVKKVQPG
DHVVLTFYTCGSCDACLSGDPTSCANSFGPNFMGRSVTGECTIHDHQGAEVGASFFGQSSFATYALSYERNTVKVTKDVP
LELLGPLGCGIQTGAGSVLNALNPPAGSAIAIFGAGAVGLSAVMAAVVAGCTTIIAVDVKENRLELASELGATHIINPAA
NDPIEAIKEIFADGVPYVLETSGLPAVLTQAILSSAIGGEIGIVGAPPMGATVPVDINFLLFNRKLRGIVEGQSISDIFI
PRLVELYRQGKFPFDKLIKFYPFDEINRAAEDSEKGVTLKPVLRIG
>Q9A9Z1 3.1.1.110~~~xylC~~~D-xylonolactone lactonase~~~COG3386
MTAQVTCVWDLKATLGEGPIWHGDTLWFVDIKQRKIHNYHPATGERFSFDAPDQVTFLAPIVGATGFVVGLKTGIHRFHP
ATGFSLLLEVEDAALNNRPNDATVDAQGRLWFGTMHDGEENNSGSLYRMDLTGVARMDRDICITNGPCVSPDGKTFYHTD
TLEKTIYAFDLAEDGLLSNKRVFVQFALGDDVYPDGSVVDSEGYLWTALWGGFGAVRFSPQGDAVTRIELPAPNVTKPCF
GGPDLKTLYFTTARKGLSDETLAQYPLAGGVFAVPVDVAGQPQHEVRLV
>P43503 1.2.1.28~~~xylC~~~Benzaldehyde dehydrogenase [NAD(+)]~~~
MRETKEQPIWYGKVFSSNWVEARGGVANVVDPSNGDILGITGVANGEDVDAAVNAAKRAQKEWAAIPFSERAAIVRKAAE
KLKEREYEFADWNVRECGAIRPKGLWEAGIAYEQMHQAAGLASLPNGTLFPSAVPGRMNLCQRVPVGVVGVIAPWNFPLF
LAMRSVAPALALGNAVILKPDLQTAVTGGALIAEIFSDAGMPDGVLHVLPGGADVGESMVANSGINMISFTGSTQVGRLI
GEKCGRMLKKVALELGGNNVHIVLPDADLEGAVSCAAWGTFLHQGQVCMAAGRHLVHRDVAQQYAEKLALRAKNLVVGDP
NSDQVHLGPLINEKQVVRVHALVESAQRAGAQVLAGGTYQDRYYQATVIMDVKPEMEVFKSEIFGPVAPITVFDSIEEAI
ELANCSEYGLAASIHTRALATGLDIAKRLNTGMVHINDQPINCEPHVPFGGMGASGSGGRFGGPASIEEFTQSQWISMVE
KPANYPF
>Q9A9Z2 4.2.1.82~~~xylD~~~D-xylonate dehydratase~~~COG0129
MRSALSNRTPRRFRSRDWFDNPDHIDMTALYLERFMNYGITPEELRSGKPIIGIAQTGSDISPCNRIHLDLVQRVRDGIR
DAGGIPMEFPVHPIFENCRRPTAALDRNLSYLGLVETLHGYPIDAVVLTTGCDKTTPAGIMAATTVNIPAIVLSGGPMLD
GWHENELVGSGTVIWRSRRKLAAGEITEEEFIDRAASSAPSAGHCNTMGTASTMNAVAEALGLSLTGCAAIPAPYRERGQ
MAYKTGQRIVDLAYDDVKPLDILTKQAFENAIALVAAAGGSTNAQPHIVAMARHAGVEITADDWRAAYDIPLIVNMQPAG
KYLGERFHRAGGAPAVLWELLQQGRLHGDVLTVTGKTMSENLQGRETSDREVIFPYHEPLAEKAGFLVLKGNLFDFAIMK
SSVIGEEFRKRYLSQPGQEGVFEARAIVFDGSDDYHKRINDPALEIDERCILVIRGAGPIGWPGSAEVVNMQPPDHLLKK
GIMSLPTLGDGRQSGTADSPSILNASPESAIGGGLSWLRTGDTIRIDLNTGRCDALVDEATIAARKQDGIPAVPATMTPW
QEIYRAHASQLDTGGVLEFAVKYQDLAAKLPRHNH
>Q59545 1.1.1.9~~~~~~D-xylulose reductase~~~
MIMKALVLEKAGKIAIQDWQSNEVLGDDDVEIKIHTVGICGSDVHYYQHGRIGPFVVDEPMVLGHEASGVITAAGKNVKH
LKVGDRVCMEPGIPDLQSPQSRAGIYNLDPAVRFWATPPIDGCLRESVIHPAAFTFKLPDNVSFAQGAMVEPLAIGMQSA
TKAGIKPGDIGLVIGAGTIGIITQSALAGGCSDVIICDVFDEKLKVAEKYQGLHAVNSKDQQALADKVRELTGGEGVNVL
FECSGAKPVIASISDHIAPGGTAVLVGMPIDPAPLDIVAAQAKEVTFKTILRYANMYPRTIRLLSSGKLNVAPLLSATYK
FKDSVEAYERAAEPVRLM
>P06622 1.13.11.2~~~xylE~~~Metapyrocatechase~~~
MNKGVMRPGHVQLRVLDMSKALEHYVELLGLIEMDRDDQGRVYLKAWTEVDKFSLVLREADEPGMDFMGFKVVDEDALRQ
LERDLMAYGCAVEQLPAGELNSCGRRVRFQAPSGHHFELYADKEYTGKWGLNDVNPEAWPRDLKGMAAVRFDHALMYGDE
LPATYDLFTKVLGFYLAEQVLDENGTRVAQFLSLSTKAHDVAFIHHPEKGRLHHVSFHLETWEDLLRAADLISMTDTSID
IGPTRHGLTHGKTIYFFDPSGNRNEVFCGGDYNYPDHKPVTWTTDQLGKAIFYHDRILNERFMTVLT
>P0AGF4 ~~~xylE~~~D-xylose-proton symporter~~~COG0477
MNTQYNSSYIFSITLVATLGGLLFGYDTAVISGTVESLNTVFVAPQNLSESAANSLLGFCVASALIGCIIGGALGGYCSN
RFGRRDSLKIAAVLFFISGVGSAWPELGFTSINPDNTVPVYLAGYVPEFVIYRIIGGIGVGLASMLSPMYIAELAPAHIR
GKLVSFNQFAIIFGQLLVYCVNYFIARSGDASWLNTDGWRYMFASECIPALLFLMLLYTVPESPRWLMSRGKQEQAEGIL
RKIMGNTLATQAVQEIKHSLDHGRKTGGRLLMFGVGVIVIGVMLSIFQQFVGINVVLYYAPEVFKTLGASTDIALLQTII
VGVINLTFTVLAIMTVDKFGRKPLQIIGALGMAIGMFSLGTAFYTQAPGIVALLSMLFYVAAFAMSWGPVCWVLLSEIFP
NAIRGKALAIAVAAQWLANYFVSWTFPMMDKNSWLVAHFHNGFSYWIYGCMGVLAALFMWKFVPETKGKTLEELEALWEP
ETKKTQQTATL
>P37387 ~~~xylF~~~D-xylose-binding periplasmic protein~~~COG4213
MKIKNILLTLCTSLLLTNVAAHAKEVKIGMAIDDLRLERWQKDRDIFVKKAESLGAKVFVQSANGNEETQMSQIENMINR
GVDVLVIIPYNGQVLSNVVKEAKQEGIKVLAYDRMINDADIDFYISFDNEKVGELQAKALVDIVPQGNYFLMGGSPVDNN
AKLFRAGQMKVLKPYVDSGKIKVVGDQWVDGWLPENALKIMENALTANNNKIDAVVASNDATAGGAIQALSAQGLSGKVA
ISGQDADLAGIKRIAAGTQTMTVYKPITLLANTAAEIAVELGNGQEPKADTTLNNGLKDVPSRLLTPIDVNKNNIKDTVI
KDGFHKESEL
>P23106 3.7.1.9~~~xylF~~~2-hydroxymuconate semialdehyde hydrolase~~~
MNAPQQSPEIGREILAAGYRTNLHDQGEGFPALLIHGSGPASPPGPTGAGSFRSSQTRRVIAPDMLGFGYSERPADGKYS
QARWVEHAIGVLDALGIQQGDIVGNSFGGGLALALAIRHPERVRRLVLMGSVGVSFPITAGLETAWGYTPSLANMRRLLD
LFAHDRTLVNDELAELRYQASIRPGFQESFAAMFPPPRQNGVDDLASNETDIRALPNETLVIHGREDRIIPLQASLTLAQ
WIPNAQLHVFGQCGHWTQIEHAERFARLVENFLAEADALHS
>P37388 7.5.2.10~~~xylG~~~Xylose import ATP-binding protein XylG~~~COG1129
MPYLLEMKNITKTFGSVKAIDNVCLRLNAGEIVSLCGENGSGKSTLMKVLCGIYPHGSYEGEIIFAGEEIQASHIRDTER
KGIAIIHQELALVKELTVLENIFLGNEITHNGIMDYDLMTLRCQKLLAQVSLSISPDTRVGDLGLGQQQLVEIAKALNKQ
VRLLILDEPTASLTEQETSILLDIIRDLQQHGIACIYISHKLNEVKAISDTICVIRDGQHIGTRDAAGMSEDDIITMMVG
RELTALYPNEPHTTGDEILRIEHLTAWHPVNRHIKRVNDVSFSLKRGEILGIAGLVGAGRTETIQCLFGVWPGQWEGKIY
IDGKQVDIRNCQQAIAQGIAMVPEDRKRDGIVPVMAVGKNITLAALNKFTGGISQLDDAAEQKCILESIQQLKVKTSSPD
LAIGRLSGGNQQKAILARCLLLNPRILILDEPTRGIDIGAKYEIYKLINQLVQQGIAVIVISSELPEVLGLSDRVLVMHE
GKLKANLINHNLTQEQVMEAALRSEHHVEKQSV
>P23105 1.2.1.85~~~xylG~~~2-hydroxymuconic semialdehyde dehydrogenase~~~
MKEIKHFISGELVGSASGKLFDNVSPANGQVIGRVHEAGRAEVDAAVRAARAALKGPWGKMTVAERAEILHRVADGITAR
FGEFLEARMPGHRQAEVAGQPHRHSARRANFKVFADLLKNVANEAFEMATPDGAGALNYGVRRPKGVIGVISPWNLPLLL
MTWKVGPALACGNCVVVKPSEETPLTATLLGEVMQAAGVPAGVYNVVHGFGGDSAGAFLTEHPDVDAYTFTGETGTGETI
MRAAAKGVRQVSLELGGKNAGIVFADCDMDKAIEGTLRSAFANCGQVCLGTERVYVERPIFDAFVARLKAGAEALKIGEP
NDPEANFGPLISHKPREKVPSYYQQAVDDGATVVTGGGVPEMPAHLAGGAWVQPTIWTGLADDSAVVTEEIFGPCCHIRP
FDSEEEAIELANSLPYGLASAIWTENVRRAHRVAGQIEAGIVWVNSWFLRDLRTAFGGSKQSGIGREGGVHSLEFYTELK
NICVKL
>P0AGI4 ~~~xylH~~~Xylose transport system permease protein XylH~~~COG4214
MSKSNPSEVKLAVPTSGGFSGLKSLNLQVFVMIAAIIAIMLFFTWTTDGAYLSARNVSNLLRQTAITGILAVGMVFVIIS
AEIDLSVGSMMGLLGGVAAICDVWLGWPLPLTIIVTLVLGLLLGAWNGWWVAYRKVPSFIVTLAGMLAFRGILIGITNGT
TVSPTSAAMSQIGQSYLPASTGFIIGALGLMAFVGWQWRGRMRRQALGLQSPASTAVVGRQALTAIIVLGAIWLLNDYRG
VPTPVLLLTLLLLGGMFMATRTAFGRRIYAIGGNLEAARLSGINVERTKLAVFAINGLMVAIAGLILSSRLGAGSPSAGN
IAELDAIAACVIGGTSLAGGVGSVAGAVMGAFIMASLDNGMSMMDVPTFWQYIVKGAILLLAVWMDSATKRRS
>P23107 4.2.-.-~~~xylJ~~~2-hydroxypent-2,4-dienoate hydratase~~~
MDKTLINELGDELYQAMVQRETVTPLTSRGFDISVEDAYHISLRMLERRLAAGERVIGKKIGVTSKAVQNMLGVHQPDFG
YLTDAMVYNSGEAMPISEKLIQPRAEGEIAFILKKDLMGPGVTNADVLAATECVIPCFEVVDSRIQDWKIKIQDTVADNA
SCGLFVLGDQAVSPRQVDLVTCGMLVEKNGQLLSTGAGAAALGSPVNCVAWLANTLGHFGIA
>P21395 1.14.15.26~~~xylM~~~Xylene/toluene monooxygenase hydroxylase component XylM~~~
MDTLRYYLIPVVTACGLIGFYYGGYWVWLGAATFPALMVLDVILPKDFSARKVSPFFADLTQYLQLPLMIGLYGLLVFGV
ENGRIELSEPLQVAGCILSLAWLSGVPTLPVSHELMHRRHWLPRKMAQLLAMFYGDPNRDIAHVNTHHLYLDTPLDSDTP
YRGQTIYSFVISATVGSVKDAIKIEAETLRRKGQSPWNLSNKTYQYVALLLALPGLVSYLGGPALGLVTIASMIIAKGIV
EGFNYFQHYGLVRDLDQPILLHHAWNHMGTIVRPLGCEITNHINHHIDGYTRFYELRPEKEAPQMPSLFVCFLLGLIPPL
WFALIAKPKLRDWDQRYATPGERELAMAANKKAGWPLWCESELGRVASI
>P96792 ~~~xylP~~~Isoprimeverose transporter~~~
MSVSMQHHSSEASPATQPASVIHKNMVPWSERFSYSLSDFACNLSFSLVSTYLMFFYTDVFGISAAIVGTLFLVARIVDA
FDGPFWGIMIDHTHTRWGKSRPYWLWFAIPFAVFSVLCFTVPNMSTGMKVVWAYVTYIGVDVLYSAVNIPITSILPSLTS
NPQERVTLSTIRQFMGTLGATIISTIALPLVAYFGGGSTSSAHGWFMVALIMAVIAMVIFFIVFANTKERVQTVQSKKSI
PIKTSLKALKRNWPWVIVIFINFIYWLGMQTRSQVTVYFFKYNMHDATLASFILGLQLVALLAVVITPWTAKRIGKRNTM
LMGMLLAIVGQLILWGGSKALNVPTITVGTIVGYLGTGFVSGLIAVMLADSVDYGEWKNGVRAEGIVTSFSSFSAKFGMG
IGGAVTGLILSAGGYVANHAQSAQALNAIEMNYVWVPIVGFGLSAIALLFYKVDKIEPKMLADLEQKHAQENALADDQK
>P96793 3.2.1.177~~~xylQ~~~Alpha-xylosidase XylQ~~~
MKFTNGYWLNREEYDVNSPKETYDAQQNGKTITAFAPYTRIMSRGDQLNLGTTTITLTSPVENVIGVKLEHFDTNEHGPE
FKINNLDPEVAIEVNDQVASLQSGDLKVTLPLRTDFEMKFTANGQLVTQSETKPQATIWNHDTKVNYMREQLSMGIDEKI
YGLGERFTNFVKNGQVVDTWNQDGGTGSEQAYKNIPFYISSNGYGVFVDESQRVSFEIGSENVDRVQFSTEGQSLQYYVI
YGPTPKEVLHRYTQLTGAIKLPPAWSFGLWLTTSFTTDYSEETVLKFIDGMQEHHIPLDVFHFDCFWQKGFEWCTLEWDK
EQFPDPEGLLKKIHDRGIKVCVWLNPYIAQKSPLFKEAKDKGYLLTRENGDIWQWDLWQAGNGFVDFTNPAAVKWYQDKL
KVLLDMGVDSFKTDFGERIPAEDVKFFDGSNPQQEHNYYTLQYNRAVYEVIQQEKGADEAVLFARSQRLVHNPIQYTGAA
TISRSTAQCVIQLRGGLSFLLSGFGFWSHDIGGFEDGPGTPTADLYKRWSQFGLLSSHSRYHGSDVYRVPWNFDDEAVEN
TRKYVNKLSLMPYIYTEAAHAAAAYGNPLMRPMFLEFGDDDNVYDNATQYMFGSKILVAPIFNDQGKAHFYLPSGKWTSI
LDGKVYQAPRTGEWVNEVFDELDLPVLVRQNSIIVRNEKAVDAAYDYTKDVDIHLYQIQDGNVSSKVVDEHGQDTAEIKV
ERANGRIVINTVGLTGDSTVYVHENNDTIKLSLVDGKAEVTL
>P0ACI3 ~~~xylR~~~Xylose operon regulatory protein~~~COG1609
MFTKRHRITLLFNANKAYDRQVVEGVGEYLQASQSEWDIFIEEDFRARIDKIKDWLGDGVIADFDDKQIEQALADVDVPI
VGVGGSYHLAESYPPVHYIATDNYALVESAFLHLKEKGVNRFAFYGLPESSGKRWATEREYAFRQLVAEEKYRGVVYQGL
ETAPENWQHAQNRLADWLQTLPPQTGIIAVTDARARHILQVCEHLHIPVPEKLCVIGIDNEELTRYLSRVALSSVAQGAR
QMGYQAAKLLHRLLDKEEMPLQRILVPPVRVIERRSTDYRSLTDPAVIQAMHYIRNHACKGIKVDQVLDAVGISRSNLEK
RFKEEVGETIHAMIHAEKLEKARSLLISTTLSINEISQMCGYPSLQYFYSVFKKAYDTTPKEYRDVNSEVML
>P31434 3.2.1.177~~~yicI~~~Alpha-xylosidase~~~COG1501
MKISDGNWLIQPGLNLIHPLQVFEVEQQDNEMVVYAAPRDVRERTWQLDTPLFTLRFFSPQEGIVGVRIEHFQGALNNGP
HYPLNILQDVKVTIENTERYAEFKSGNLSARVSKGEFWSLDFLRNGERITGSQVKNNGYVQDTNNQRNYMFERLDLGVGE
TVYGLGERFTALVRNGQTVETWNRDGGTSTEQAYKNIPFYMTNRGYGVLVNHPQCVSFEVGSEKVSKVQFSVESEYLEYF
VIDGPTPKAVLDRYTRFTGRPALPPAWSFGLWLTTSFTTNYDEATVNSFIDGMAERNLPLHVFHFDCFWMKAFQWCDFEW
DPLTFPDPEGMIRRLKAKGLKICVWINPYIGQKSPVFKELQEKGYLLKRPDGSLWQWDKWQPGLAIYDFTNPDACKWYAD
KLKGLVAMGVDCFKTDFGERIPTDVQWFDGSDPQKMHNHYAYIYNELVWNVLKDTVGEEEAVLFARSASVGAQKFPVHWG
GDCYANYESMAESLRGGLSIGLSGFGFWSHDIGGFENTAPAHVYKRWCAFGLLSSHSRLHGSKSYRVPWAYDDESCDVVR
FFTQLKCRMMPYLYREAARANARGTPMMRAMMMEFPDDPACDYLDRQYMLGDNVMVAPVFTEAGDVQFYLPEGRWTHLWH
NDELDGSRWHKQQHGFLSLPVYVRDNTLLALGNNDQRPDYVWHEGTAFHLFNLQDGHEAVCEVPAADGSVIFTLKAARTG
NTITVTGAGEAKNWTLCLRNVVKVNGLQDGSQAESEQGLVVKPQGNALTITL
>O52733 ~~~xylT~~~D-xylose transporter~~~
MRKVSTGFVYFFGALGGLLFGYDTGVISGAILFIQKQMNLGSWQQGWVVSAVLLGAILGAAIIGPSSDRFGRRKLLLLSA
IIFFVGALGSAFSPEFWTLIISRIILGMAVGAASALIPTYLAELAPSDKRGTVSSLFQLMVMTGILLAYITNYSFSGFYT
GWRWMLGFAAIPAALLFLGGLILPESPRFLVKSGHLDEARHVLDTMNKHDQVAVNKEINDIQESAKIVSGGWSELFGKMV
RPSLIIGIGLAIFQQVMGCNTVLYYAPTIFTDVGFGVSAALLAHIGIGIFNVIVTAIAVAIMDKIDRKKIVNIGAVGMGI
SLFVMSIGMKFSGGSQTAAIISVIALTVYIAFFSATWGPVMWVMIGEVFPLNIRGLGNSFASVINWTANMIVSLTFPSLL
DFFGTGSLFIGYGILCFASIWFVQKKVFETRNRSLEDIEATLRAKTGEDAAELSTTK
>V9TXH2 3.2.1.8~~~~~~Endo-1,4-beta-xylanase Xyn11E~~~
MFKFGKKLMTVVLAASMSFGVFAATTGATDYWQNWTDGGGTVNAVNGSGGNYSVNWQNTGNFVVGKGWTYGTPNRVVNYN
AGVFSPSGNGYLTFYGWTRNALIEYYVVDNWGTYRPTGTYKGTVNSDGGTYDIYTTMRYNQPSIDGYSTFPQYWSVRQSK
RPIGVNSQITFQNHVNAWASKGMNLGSSWSYQVLATEGYQSSGSSNVTVW
>P40943 3.2.1.8~~~~~~Endo-1,4-beta-xylanase~~~
MRNVVRKPLTIGLALTLLLPMGMTATSAKNADSYAKKPHISALNAPQLDQRYKNEFTIGAAVEPYQLQNEKDVQMLKRHF
NSIVAENVMKPISIQPEEGKFNFEQADRIVKFAKANGMDIRFHTLVWHSQVPQWFFLDKEGKPMVNETDPVKREQNKQLL
LKRLETHIKTIVERYKDDIKYWDVVNEVVGDDGKLRNSPWYQIAGIDYIKVAFQAARKYGGDNIKLYMNDYNTEVEPKRT
ALYNLVKQLKEEGVPIDGIGHQSHIQIGWPSEAEIEKTINMFAALGLDNQITELDVSMYGWPPRAYPTYDAIPKQKFLDQ
AARYDRLFKLYEKLSDKISNVTFWGIADNHTWLDSRADVYYDANGNVVVDPNAPYAKVEKGKGKDAPFVFGPDYKVKPAY
WAIIDHK
>P45703 3.2.1.8~~~xynA~~~Endo-1,4-beta-xylanase~~~
MCSSIPSLREVFANDFRIGAAVNPVTLEAQQSLLIRHVNSLTAENHMKFEHLQPEEGRFTFDIAIKSSTSPFSSHGVRGH
TLVWHNQTPSWVFQDSQGHFVGRDVLLERMKSHISTVVQRYKGKVYCWDVINEAVADEGSEWLRSSTWRQIIGDDFIQQA
FLYAHEADPEALLFYNDYNECFPEKREKIYTLVKSLRDKGIPIHGIGMQAHWSLNRPTLDEIRAAIERYASLGVILHITE
LDISMFEFDDHRKDLAAPTNEMVERQAERYEQIFSLFKEYRDVIQNVTFWGIADDHTWLDHFPVQGRKNWPLLFDEQHNP
KPAFWRVVNI
>C6CRV0 3.2.1.8~~~xynA1~~~Endo-1,4-beta-xylanase A~~~COG3693
MSRSLKKFVSILLAAALLIPIGRLAPVAEAAENPTIVYHEDFAIDKGKAIQSGGASLTQVTGKVFDGNNDGSALYVSNRA
NTWDAADFKFADIGLQNGKTYTVTVKGYVDQDATVPSGAQAFLQAVDSNNYGFLASANFAAGTAFTLTKEFTVDTSVSTQ
LRVQSSEEGKAVPFYIGDILITANPTTTTNTVYHEDFATDKGKAVQSGGANLAQVADKVFDGNDDGKALYVSNRANTWDA
ADFKFADIGLQNGKTYTVTVKGYVDQDATVPSGAQAFLQAVDSNNYGFLASANFAARSAFTLTKEFTVDTSVSTQLRVQS
SEEGKAVPFYIGDILITETVNSGGGQEDPPRPPALPFNTITFEDQTAGGFTGRAGTETLTVTNESNHTADGSYSLKVEGR
TTSWHGPSLRVEKYVDKGYEYKVTAWVKLLSPETSTKLELASQVGDGGSANYPSLASKTITAADGWVQLQGNYRYNSVGG
EYLTIYVQSSNATASYYIDDISFESTGSGPVGIQKDLAPLKDVYKNDFLIGNAISAEDLEGTRLELLKMHHDVVTAGNAM
KPDALQPTKGNFTFTAADAMIDKVLAEGMKMHGHVLVWHQQSPAWLNTKKDDNNNTVPLGRDEALDNLRTHIQTVMKHFG
NKVISWDVVNEAMNDNPSNPADYKASLRQTPWYQAIGSDYVEQAFLAAREVLDENPSWNIKLYYNDYNEDNQNKATAIYN
MVKDINDRYAAAHNGKLLIDGVGMQGHYNINTNPDNVKLSLEKFISLGVEVSVSELDVTAGNNYTLPENLAVGQAYLYAQ
LFKLYKEHADHIARVTFWGMDDNTSWRAENNPLLFDKNLQAKPAYYGVIDPDKYMEEHAPESKDANQAEAQYGTPVIDGT
VDSIWSNAQAMPVNRYQMAWQGATGTAKALWDDQNLYVLIQVSDSQLNKANENAWEQDSVEVFLDQNNGKTTFYQNDDGQ
YRVNFDNETSFSPASIAAGFESQTKKTANSYTVELKIPLTAVTPANQKKLGFDVQINDATDGARTSVAAWNDTTGNGYQD
TSVYGELTLAGKGTGGTGTVGTTVPQTGNVVKNPDGSTTLKPEVKTTNGNAVGTVTGDDLKKALDQAAPAAGGKKQVIID
VPLQANAATYAVQLPTQSLKSQDGYQLTAKIANAFIQIPSNMLANTNVTTDQVSIRVAKASLDNVDAATRELIGNRPVID
LSLVAGGNVIAWNNPTAPVTVAVPYAPTAEELKHPEHILIWYIDGSGKATPVPNSRYDAALGAVVFQTTHFSTYAAVSVF
TTFGDLAKVPWAKEAIDAMASRGVIKGTGENTFSPAASIKRADFIALLVRALELHGTGTTDTAMFSDVPANAYYYNELAV
AKQLGIATGFEDNTFKPDSSISRQDMMVLTTRALAVLGKQLPAGGSLNAFSDAASVAGYAQDSVAALVKAGVVQGSGSKL
APNDQLTRAEAAVILYRIWKLQ
>Q8GJ44 3.2.1.8~~~xynA~~~Endo-1,4-beta-xylanase A~~~
MKRKVKKMAAMATSIIMAIMIILHSIPVLAGRIIYDNETGTHGGYDYELWKDYGNTIMELNDGGTFSCQWSNIGNALFRK
GRKFNSDKTYQELGDIVVEYGCDYNPNGNSYLCVYGWTRNPLVEYYIVESWGSWRPPGATPKGTITVDGGTYEIYETTRV
NQPSIDGTATFQQYWSVRTSKRTSGTISVTEHFKQWERMGMRMGKMYEVALTVEGYQSSGYANVYKNEIRIGANPTPAPS
QSPIRRDAFSIIEAEEYNSTNSSTLQVIGTPNNGRGIGYIENGNTVTYSNIDFGSGATGFSATVATEVNTSIQIRSDSPT
GTLLGTLYVSSTGSWNTYQTVSTNISKITGVHDIVLVFSGPVNVDNFIFSRSSPVPAPGDNTRDAYSIIQAEDYDSSYGP
NLQIFSLPGGGSAIGYIENGYSTTYNNVNFANGLSSITARVATQISTSIQVRAGGATGTLLGTIYVPSTNSWDSYQNVTA
NLSNITGVHDITLVFSGPVNVDYFVFTPANVNSGPTSPVGGTRSAFSNIQAEDYDSSYGPNLQIFSLPGGGSAIGYIENG
YSTTYKNIDFGDGATSVTARVATQNATTIQVRLGSPSGTLLGTIYVGSTGSFDTYRDVSATISNTAGVKDIVLVFSGPVN
VDWFVFSKSGT
>P33558 3.2.1.8~~~xynA~~~Endo-1,4-beta-xylanase A~~~
MKRKVKKMAAMATSIIMAIMIILHSIPVLAGRIIYDNETGTHGGYDYELWKDYGNTIMELNDGGTFSCQWSNIGNALFRK
GRKFNSDKTYQELGDIVVEYGCDYNPNGNSYLCVYGWTRNPLVEYYIVESWGSWRPPGATPKGTITQWMAGTYEIYETTR
VNQPSIDGTATFQQYWSVRTSKRTSGTISVTEHFKQWERMGMRMGKMYEVALTVEGYQSSGYANVYKNEIRIGANPTPAP
SQSPIRRDAFSIIEAEEYNSTNSSTLQVIGTPNNGRGIGYIENGNTVTYSNIDFGSGATGFSATVATEVNTSIQIRSDSP
TGTLLGTLYVSSTGSWNTYQTVSTNISKITGVHDIVLVFSGPVNVDNFIFSRSSPVPAPGDNTRDAYSIIQAEDYDSSYG
PNLQIFSLPGGGSAIGYIENGYSTTYKNIDFGDGATSVTARVATQNATTIQVRLGSPSGTLLGTIYVGSTGSFDTYRDVS
ATISNTAGVKDIVLVFSGPVNVDWFVFSKSGT
>P49942 3.2.1.8~~~xylI~~~Endo-1,4-beta-xylanase A~~~
MKLKRIILLLLTVMFSFSYGEVFAKDGSSLKKALKNKFLIGVSVNTHQSSGKDVAAVEIVKKNFNSIVAENCMKSSVIHP
KENKYNFAQADEFVSFGESNQMAIIGHCLIWHSQLAPWFCVDKDGNNVSPEVLKKRMKDHITTIVKRYKGRIKGWDVVNE
AIEDNGAYRKTKFYEILGEEYIPLAFQYAHEADPDAELYYNDYSMAQPGRREAVVKMVNDLKKRGIRIDAIGMQGHIGMD
YPKISEFEKSMLAFAGTGVKIMITELDLTVIPSPNPNVGAEVSASFEYKKEMNPYPDGLPEEVSKAWTERMNDFFRLFLK
HHNLITRVTLWGVADQNSWRNDWPMRGRTDYPLLFDRNYQPKPVVGLIIKEAEKTK
>P00694 3.2.1.8~~~xynA~~~Endo-1,4-beta-xylanase A~~~
MNLRKLRLLFVMCIGLTLILTAVPAHARTITNNEMGNHSGYDYELWKDYGNTSMTLNNGGAFSAGWNNIGNALFRKGKKF
DSTRTHHQLGNISINYNASFNPGGNSYLCVYGWTQSPLAEYYIVDSWGTYRPTGAYKGSFYADGGTYDIYETTRVNQPSI
IGIATFKQYWSVRQTKRTSGTVSVSAHFRKWESLGMPMGKMYETAFTVEGYQSSGSANVMTNQLFIGN
>P18429 3.2.1.8~~~xynA~~~Endo-1,4-beta-xylanase A~~~COG0726
MFKFKKNFLVGLSAALMSISLFSATASAASTDYWQNWTDGGGIVNAVNGSGGNYSVNWSNTGNFVVGKGWTTGSPFRTIN
YNAGVWAPNGNGYLTLYGWTRSPLIEYYVVDSWGTYRPTGTYKGTVKSDGGTYDIYTTTRYNAPSIDGDRTTFTQYWSVR
QSKRPTGSNATITFSNHVNAWKSHGMNLGSNWAYQVMATEGYQSSGSSNVTVW
>P23556 3.2.1.8~~~xynA~~~Endo-1,4-beta-xylanase A~~~
MRCLIVCENLEMLNLSLAKTYKDYFKIGAAVTAKDLEGVHRDILLKHFNSLTPENAMKFENIHPEEQRYNFEEVARIKEF
AIKNDMKLRGHTFVWHNQTPGWVFLDKNGEEASKELVIERLREHIKTLCERYKDVVYAWDVVNEAVEDKTEKLLRESNWR
KIIGDDYIKIAFEIAREYAGDAKLFYNDYNNEMPYKLEKTYKVLKELLERGTPIDGIGIQAHWNIWDKNLVSNLKKAIEV
YASLGLEIHITELDISVFEFEDKRTDLFEPTPEMLELQAKVYEDVFAVFREYKDVITSVTLWGISDRHTWKDNFPVKGRK
DWPLLFDVNGKPKEALYRILRF
>P14768 3.2.1.8~~~xynA~~~Endo-1,4-beta-xylanase A~~~COG3693
MRTAMAKSLGAAAFLGAALFAHTLAAQTATCSYNITNEWNTGYTGDITITNRGSSAINGWSVNWQYATNRLSSSWNANVS
GSNPYSASNLSWNGNIQPGQSVSFGFQVNKNGGSAERPSVGGSICSGSVASSSAPASSVPSSIASSSPSSVASSVISSMA
SSSPVSSSSVASSTPGSSSGNQQCNWYGTLYPLCVTTTNGWGWEDQRSCIARSTCAAQPAPFGIVGSGSSTPVSSSSSSL
SSSSVVSSIRSSSSSSSSSVATGNGLASLADFPIGVAVAASGGNADIFTSSARQNIVRAEFNQITAENIMKMSYMYSGSN
FSFTNSDRLVSWAAQNGQTVHGHALVWHPSYQLPNWASDSNANFRQDFARHIDTVAAHFAGQVKSWDVVNEALFDSADDP
DGRGSANGYRQSVFYRQFGGPEYIDEAFRRARAADPTAELYYNDFNTEENGAKTTALVNLVQRLLNNGVPIDGVGFQMHV
MNDYPSIANIRQAMQKIVALSPTLKIKITELDVRLNNPYDGNSSNDYTNRNDCAVSCAGLDRQKARYKEIVQAYLEVVPP
GRRGGITVWGIADPDSWLYTHQNLPDWPLLFNDNLQPKPAYQGVVEALSGR
>P07528 3.2.1.8~~~xynA~~~Endo-1,4-beta-xylanase A~~~COG3693
MITLFRKPFVAGLAISLLVGGGIGNVAAAQGGPPKSGVFGENEKRNDQPFAWQVASLSERYQEQFDIGAAVEPYQLEGRQ
AQILKHHYNSLVAENAMKPESLQPREGEWNWEGADKIVEFARKHNMELRFHTLVWHSQVPEWFFIDEDGNRMVDETDPDK
REANKQLLLERMENHIKTVVERYKDDVTSWDVVNEVIDDGGGLRESEWYQITGTDYIKVAFETARKYGGEEAKLYINDYN
TEVPSKRDDLYNLVKDLLEQGVPIDGVGHQSHIQIGWPSIEDTRASFEKFTSLGLDNQVTELDMSLYGWPPTGAYTSYDD
IPAELLQAQADRYDQLFELYEELAADISSVTFWGIADNHTWLDGRAREYNNGVGIDAPFVFDHNYRVKPAYWRIID
>P09850 3.2.1.8~~~xlnA~~~Endo-1,4-beta-xylanase~~~
MFKFKKNFLVGLSAALMSISLFSATASAASTDYWQNWTDGGGIVNAVNGSGGNYSVNWSNTGNFVVGKGWTTGSPFRTIN
YNAGVWAPNGNGYLTLYGWTRSPLIEYYVVDSWGTYRPTGTYKGTVKSDGGTYDIYTTTRYNAPSIDGDRTTFTQYWSVR
QSKRPTGSNATITFTNHVNAWKSHGMNLGSNWAYQVMATEGYQSSGSSNVTVW
>P26514 3.2.1.8~~~xlnA~~~Endo-1,4-beta-xylanase A~~~
MGSYALPRSGVRRSIRVLLLALVVGVLGTATALIAPPGAHAAESTLGAAAAQSGRYFGTAIASGRLSDSTYTSIAGREFN
MVTAENEMKIDATEPQRGQFNFSSADRVYNWAVQNGKQVRGHTLAWHSQQPGWMQSLSGSALRQAMIDHINGVMAHYKGK
IVQWDVVNEAFADGSSGARRDSNLQRSGNDWIEVAFRTARAADPSAKLCYNDYNVENWTWAKTQAMYNMVRDFKQRGVPI
DCVGFQSHFNSGSPYNSNFRTTLQNFAALGVDVAITELDIQGAPASTYANVTNDCLAVSRCLGITVWGVRDSDSWRSEQT
PLLFNNDGSKKAAYTAVLDALNGGDSSEPPADGGQIKGVGSGRCLDVPDASTSDGTQLQLWDCHSGTNQQWAATDAGELR
VYGDKCLDAAGTSNGSKVQIYSCWGGDNQKWRLNSDGSVVGVQSGLCLDAVGNGTANGTLIQLYTCSNGSNQRWTRT
>B4XVN1 3.2.1.8~~~xynAS9~~~Endo-1,4-beta-xylanase A~~~
MFRHHPTRGRRTAGLLAAALATLSAGLTAVAPAHPARADTATLGELAEAKGRYFGSATDNPELPDTQYTQILGSEFSQIT
VGNTMKWQYTEPSRGRFDYTAAEEIVDLAESNGQSVRGHTLVWHNQLPSWVDDVPAGELLGVMRDHITHEVDHFKGRLIH
WDVVNEAFEEDGSRRQSVFQQKIGDSYIAEAFKAARAADPDVKLYYNDYNIEGIGPKSDAVYEMVKSFKAQGIPIDGVGM
QAHLIAGQVPASLQENIRRFADLGVDVALTELDIRMTLPRTAAKDAQQATDYGAVVEACLVVSRCVGITVWDYTDKYSWV
PSVFPGQGAALPWDEDFAKKPAYHAIAAALNGGSPAPGGNCTATYRVTSQWQGGFTAEITVGNDHTAPITGWTVTWTLSS
GQSISHMWNGNLTVNGQDVTVRDVGYNGTLGGNGSTTFGFQGEGVADTPADVTCTPGRPSGTSA
>Q60037 3.2.1.8~~~xynA~~~Endo-1,4-beta-xylanase A~~~COG3693
MQVRKRRGLLDVSTAVLVGILAGFLGVVLAASGVLSFGKEASSKGDSSLETVLALSFEGTTEGVVPFGKDVVLTASQDVA
ADGEYSLKVENRTSPWDGVEIDLTGKVKSGADYLLSFQVYQSSDAPQLFNVVARTEDEKGERYDVILDKVVVSDHWKEIL
VPFSPTFEGTPAKYSLIIVASKNTNFNFYLDKVQVLAPKESGPKVIYETSFENGVGDWQPRGDVNIEASSEVAHSGKSSL
FISNRQKGWQGAQINLKGILKTGKTYAFEAWVYQNSGQDQTIIMTMQRKYSSDASTQYEWIKSATVPSGQWVQLSGTYTI
PAGVTVEDLTLYFESQNPTLEFYVDDVKIVDTTSAEIKIEMEPEKEIPALKEVLKDYFKVGVALPSKVFLNPKDIELITK
HFNSITAENEMKPESLLAGIENGKLKFRFETADKYIQFVEENGMVIRGHTLVWHNQTPDWFFKDENGNLLSKEAMTERLK
EYIHTVVGHFKGKVYAWDVVNEAVDPNQPDGLRRSTWYQIMGPDYIELAFKFAREADPDAKLFYNDYNTFEPRKRDIIYN
LVKDLKEKGLIDGIGMQCHISLATDIKQIEEAIKKFSTIPGIEIHITELDMSVYRDSSSNYPEAPRTALIEQAHKMMQLF
EIFKKYSNVITNVTFWGLKDDYSWRATRRNDWPLIFDKDHQAKLAYWAIVAPEVLPPLPKESRISEGEAVVVGMMDDSYL
MSKPIEILDEEGNVKATIRAVWKDSTIYIYGEVQDKTKKPAEDGVAIFINPNNERTPYLQPDDTYAVLWTNWKTEVNRED
VQVKKFVGPGFRRYSFEMSITIPGVEFKKDSYIGFDAAVIDDGKWYSWSDTTNSQKTNTMNYGTLKLEGIMVATAKYGTP
VIDGEIDEIWNTTEEIETKAVAMGSLDKNATAKVRVLWDENYLYVLAIVKDPVLNKDNSNPWEQDSVEIFIDENNHKTGY
YEDDDAQFRVNYMNEQTFGTGGSPARFKTAVKLIEGGYIVEAAIKWKTIKPTPNTVIGFNIQVNDANEKGQRVGIISWSD
PTNNSWRDPSKFGNLRLIK
>Q60042 3.2.1.8~~~xynA~~~Endo-1,4-beta-xylanase A~~~
MRKKRRGFLNASTAVLVGILAGFLGVVLAATGALGFAVRESLLLKQFLFLSFEGNTDGASPFGKDVVVTASQDVAADGEY
SLKVENRTSVWDGVEIDLTGKVNTGTDYLLSFHVYQTSDSPQLFSVLARTEDEKGERYKILADKVVVPNYWKEILVPFSP
TFEGTPAKFSLIITSPKKTDFVFYVDNVQVLTPKEAGPKVVYETSFEKGIGDWQPRGSDVKISISPKVAHSGKKSLFVSN
RQKGWHGAQISLKGILKTGKTYAFEAWVYQESGQDQTIIMTMQRKYSSDSSTKYEWIKAATVPSGQWVQLSGTYTIPAGV
TVEDLTLYFESQNPTLEFYVDDVKVVDTTSAEIKLEMNPEEEIPALKDVLKDYFRVGVALPSKVFINQKDIALISKHSNS
STAENEMKPDSLLAGIENGKLKFRFETADKYIEFAQQNGMVVRGHTLVWHNQTPEWFFKDENGNLLSKEEMTERLREYIH
TVVGHFKGKVYAWDVVNEAVDPNQPDGLRRSTWYQIMGPDYIELAFKFAREADPNAKLFYNDYNTFEPKKRDIIYNLVKS
LKEKGLIDGIGMQCHISLATDIRQIEEAIKKFSTIPGIEIHITELDISVYRDSTSNYSEAPRTALIEQAHKMAQLFKIFK
KYSNVITNVTFWGLKDDYSWRATRRNDWPLIFDKDYQAKLAYWAIVAPEVLPPLPKESKISEGEAVVVGMMDDSYMMSKP
IEIYDEEGNVKATIRAIWKDSTIYVYGEVQDATKKPAEDGVAIFINPNNERTPYLQPDDTYVVLWTNWKSEVNREDVEVK
KFVGPGFRRYSFEMSITIPGVEFKKDSYIGFDVAVIDDGKWYSWSDTTNSQKTNTMNYGTLKLEGVMVATAKYGTPVIDG
EIDDIWNTTEEIETKSVAMGSLEKNATAKVRVLWDEENLYVLAIVKDPVLNKDNSNPWEQDSVEIFIDENNHKTGYYEDD
DAQFRVNYMNEQSFGTGASAARFKTAVKLIEGGYIVEAAIKWKTIKPSPNTVIGFNVQVNDANEKGQRVGIISWSDPTNN
SWRDPSKFGNLRLIK
>P36917 3.2.1.8~~~xynA~~~Endo-1,4-beta-xylanase A~~~
MMKNNVDRIVSIVTALIMIFGASLFSPPIRVFADDTNINLVSNGDFESGTIDGWIKQGNPTLAVTTEQAIGQYSMKVTGR
TQTYEGPAYSFLGKMQKGESYSVSLKVRLVSGQNSSNPLITVTMFREDDNGKHYDTIVWQKQVSEDSWTTVSGTYTLDYI
GTLKTLYMYVESPDPTLEYYIDDVVVTTQNPIQVGNVIANETFENGNTSGWIGTGSSVVKAVYGVAHSGDYSLLTTGRTA
NWNGPSYDLTGKIVPGQQYNVDFWVKFVNGNDTEQIKATVKATSDKDNYIQVNDFANVNKGEWTEIKGSFTLPVADYSGI
SIYVESQNPTLEFYIDDFSVIGEISNNQITIQNDIPDLYSVFKDYFPIGVAVDPSRLNDADPHAQLTAKHFNMLVAENAM
KPESLQPTEGNFTFDNADKIVDYAIAHNMKMRGHTLLWHNQVPDWFFQDPSDPSKSASRDLLLQRLKTHITTVLDHFKTK
YGSQNPIIGWDVVNEVLDDNGNLRNSKWLQIIGPDYIEKAFEYAHEADPSMKLFINDYNIENNGVKTQAMYDLVKKLKSE
GVPIDGIGMQMHININSNIDNIKASIEKLASLGVEIQVTELDMNMNGNISNEALLKQARLYKQLFDLFKAEKQYITAVVF
WGVSDDVTWLSKPNAPLLFDSKLQAKPAFWAVVDPSKAIPDIQSAKALEGSPTIGANVDSSWKLVKPLYVNTYVEGTVGA
TATVKSMWDTKNLYLLVQVSDNTPSNNDGIEIFVDKNDDKSTSYETDDERYTIKRDGTGSSDITKYVTSNADGYVAQLAI
PIEDISPAVNDKIGFDIRINDDKGNGKIDAITVWNDYTNSQNTNTSYFGDIVLSKSAQIATAIYGTPVIDGKVDDIWNNV
EPISTNTWILGSNGATATQKMMWDDKYLYVLADVTDSNLNKSSINPYEQDSVEVFVDQNNDKTTYYENDDGQYRVNYDNE
QSFGGSTNSNGFKSATSLTQSGYIVEEAIPWTSITPSNGTIIGFDLQVNNADENGKRTGIVTWCDPSGNSWQDTSGFGNL
LLTGKPSGALKKGVTFDDIKNSWAKDAIEVLASRHIVEGMTDTQYEPNKTVTRAEFTAMILRLLNIKEEQYSGEFSDVNS
GDWYANAIEAAYKAGIIEGDGKNARPNDSITREEMTQ
>P07129 3.2.1.37~~~xynB~~~Beta-xylosidase~~~
MKITNPVLKGFNPDPSICRAGEDYYMAVSTFEWFPGVQIYHSKDLIHWRLAARPLQKTSQLDMKGNPDSGGVWAPCLSYA
DGQFWLIYSDIKVVDGPFKDGHNYLVTADAVDGEWSDPVRLNSSGFDPSLFHDPSGKKYVLNMLWDHREKHHSFAGIALQ
EYSVSEKKLVGERKVIFKGTPIKLTEAPHLYYINDVYYLLTAEGGTRYEHAATIARSSRIDGPYEVHPDNPILTAFHAPS
HPLQKCGHASIVQTHTNEWYLAHLTGRPIHSSKESIFQQRGWCPLGRETAIQKLEWKDGWPYVVGGKEGLLEVEAPAMSV
KEFSPTYHIVDEFKDSSLNRHFQTLRIPFTDQIGSVTENPHHLRLYGQESLTSKFTQAFVARRWQSFYFEAETAVSFFPK
NFQQAAGLVNYYNTENWTALQVTYDDALGRILELSVCENLAFSQPLIKKIIIPDEIPYVYLKVTVQRETYTYSYSFDQQE
WEKIDVPLESTHLSDDFIRGGGFFTGAFVGMQCQDTSGERLPADFKYFRYEETTE
>P94489 3.2.1.37~~~xynB~~~Beta-xylosidase~~~COG3507
MKITNPVLKGFNPDPSICRAGEDYYIAVSTFEWFPGVQIHHSKDLVNWHLVAHPLQRVSQLDMKGNPNSGGVWAPCLSYS
DGKFWLIYTDVKVVDGAWKDCHNYLVTCETINGDWSEPIKLNSSGFDASLFHDTDGKKYLLNMLWDHRIDRHSFGGIVIQ
EYSDKEQKLIGKPKVIFEGTDRKLTEAPHLYHIGNYYYLLTAEGGTRYEHAATIARSANIEGPYEVHPDNPILTSWHDPG
NPLQKCGHASIVQTHTDEWYLAHLTGRPIHPDDDSIFQQRGYCPLGRETAIQKLYWKDEWPYVVGGKEGSLEVDAPSIPE
TIFEATYPEVDEFEDSTLNINFQTLRIPFTNELGSLTQAPNHLRLFGHESLTSTFTQAFVARRWQSLHFEAETAVEFYPE
NFQQAAGLVNYYNTENWTALQVTHDEELGRILELTICDNFSFSQPLNNKIVIPREVKYVYLRVNIEKDKYYYFYSFNKED
WHKIDIALESKKLSDDYIRGGGFFTGAFVGMQCQDTSGNHIPADFRYFRYKEK
>P23030 3.2.1.8~~~xynB~~~Endo-1,4-beta-xylanase B~~~COG3693
MTISASDYRHPGNFLKRTTALLCVGTALTALAFNASAACTYTIDSEWSTGFTANITLKNDTGAAINNWNVNWQYSSNRMT
SGWNANFSGTNPYNATNMSWNGSIAPGQSISFGLQGEKNGSTAERPTVTGAACNSATTSSVASSSSTPTTSSSSASSVAS
ALLLQEAQAGFCRVDGTIDNNHTGFTGSGFANTNNAQGAAVVWAIDATSSGRRTLTIRYANGGTANRNGSLVINGGSNGN
YTVSLPTTGAWTTWQTATIDVDLVQGNNIVQLSATTAEGLPNIDSLSVVGGTVRAGNCGSVSSSSSVQSSSSSSSSSAAS
AKKFIGNITTSGAVRSDFTRYWNQITPENESKWGSVEGTRNVYNWAPLDRIYAYARQNNIPVKAHTFVWGAQSPSWLNNL
SGPEVAVEIEQWIRDYCARYPDTAMIDVVNEAVPGHQPAGYAQRAFGNNWIQRVFQLARQYCPNSILILNDYNNIRWQHN
EFIALAKAQGNYIDAVGLQAHELKGMTAAQVKTAIDNIWNQVGKPIYISEYDIGDTNDQVQLQNFQAHFPVFYNHPHVHG
ITLWGYVVGRTWIEGSGLIQDNGTPRPAMTWLINNYLNQ
>Q9ZFM2 3.2.1.37~~~xynB~~~Beta-xylosidase~~~
MKVVNVPSNGREKFKKNWKFCVGTGRLGLALQKEYLDHLKLVQEKIGFRYIRGHGLLSDDVGIYREVEIDGEMKPFYNFT
YIDRIVDSYLALNIRPFIEFGFMPKALASGDQTVFYWKGNVTPPKDYNKWRDLIVAVVSHFIERYGIEEVRTWLFEVWNE
PNLVNFWKDANKQEYFKLYEVTARAVKSVDPHLQVGGPAICGGSDEWITDFLHFCAERRVPVDFVSRHAYTSKAPHKKTF
EYYYQELELEPPEDMLEQFKTVRALIRQSPFPHLPLHITEYNTSYSPINPVHDTALNAAYIARILSEGGDYVDSFSYWTF
SDVFEEMDVPKALFHGGFGLVALHSIPKPTFHAFTFFNALGDELLYRDGEMIVTRRKDGSIAAVLWNLVMEKGEGLTKEV
QLVIPVSFSAVFIKRQIVNEQYGNAWRVWKQMGRPRFPSRQAVETLPSAQPHVMTEQRRATDGVIHLSIVLSKNEVTLIE
IEQVRDETSTYVGLDDGEITSYSS
>O69231 3.2.1.8~~~xynB~~~Endo-1,4-beta-xylanase B~~~
MSTEIPSLSASYANSFKIGAAVHTRMLQTEGEFIAKHYNSVTAENQMKFEEVHPREHEYTFEAADEIVDFAVARGIGVRG
HTLVWHNQTPAWMFEDASGGTASREMMLSRLKQHIDTVVGRYKDQIYAWDVVNEAIEDKTDLIMRDTKWLRLLGEDYLVQ
AFNMAHEADPNALLFYNDYNETDPVKREKIYNLVRSLLDQGAPVHGIGMQGHWNIHGPSMDEIRQAIERYASLDVQLHVT
ELDLSVFRHEDQRTDLTEPTAEMAELQQKRYEDIFGLFREYRSNITSVTFWGVADNYTWLDNFPVRGRKNWPFVFDTELQ
PKDSFWRIIGQD
>P26515 3.2.1.8~~~xlnB~~~Endo-1,4-beta-xylanase B~~~
MNLLVQPRRRRRGPVTLLVRSAWAVALAALAALMLPGTAQADTVVTTNQEGTNNGYYYSFWTDSQGTVSMNMGSGGQYST
SWRNTGNFVAGKGWANGGRRTVQYSGSFNPSGNAYLALYGWTSNPLVEYYIVDNWGTYRPTGEYKGTVTSDGGTYDIYKT
TRVNKPSVEGTRTFDQYWSVRQSKRTGGTITTGNHFDAWARAGMPLGNFSYYMIMATEGYQSSGSSSINVGGTGGGDSGG
GDNGGGGGGCTATVSAGQKWGDRYNLDVSVSGASDWTVTMNVPSPAKVLSNWNVNASYPSAQTLTARLNGSGNNWGATIQ
ANANWTWPSVSCSAG
>D7EZJ3 3.2.1.8~~~xynBS9~~~Endo-1,4-beta-xylanase B~~~
MHDAPAQRKRRRPGRIGPLPRSSRFARLKLLIASACAALLATLALPPGAAHAQTVTSNQTGNHNGYFYSFWTDAPGTVSA
TMGSGGNYSTSWRNTGNFVIGKGWSTGGRRTVTYSGSFNPSGNAYLTLYGWSRNPLVEYYIVDNWGTYRPTGTFKGTVTT
DGGTYDIYQTTRYNAPSIEGNKTFNQYWSVRQQKRTGGTITTGNHFDAWARAGMQLGSHDYMIMATEGYQSSGSSNITVG
GTSGGGGGGGGGGGCTATLSAGERWDDRYNLNVSVSGSSNWTVTMNVPSPATILSTWNITATWPSSQVLVARPNGSGNNF
GVTIKHNGNWTWPTVSCSTG
>P36906 3.2.1.37~~~xynB~~~Beta-xylosidase~~~
MIKVRVPDFSDKKFSDRWRYCVGTGRLGLALQKEYIETLKYVKENIDFKYIRGHGLLCDDVGIYREDVVGDEVKPFYNFT
YIDRIFDSFLEIGIRPFVEIGFMPKKLASGTQTVFYWEGNVTPPKDYEKWSDLVKAVLHHFISRYGIEEVLKWPFEIWNE
PNLKEFWKDADEKEYFKLYKVTAKAIKEVNENLKVGGPAICGGADYWIEDFLNFCYEENVPVDFVSRHATTSKQGEYTPH
LIYQEIMPSEYMLNEFKTVREIIKNSHFPNLPFHITEYNTSYSPQNPVHDTPFNAAYIARILSEGGDYVDSFSYWTFSDV
FEERDVPRSQFHGGFGLVALNMIPKPTFYTFKFFNAMGEEMLYRDEHMLVTRRDDGSVALIAWNEVMDKTENPDEDYEVE
IPVRFRDVFIKRQLIDEEHGNPWGTWIHMGRPRYPSKEQVNTLREVAKPEIMTSQPVANDGYLNLKFKLGKNAVVLYELT
ERIDESSTYIGLDDSKINGY
>Q45070 3.2.1.136~~~xynC~~~Glucuronoxylanase XynC~~~COG5520
MIPRIKKTICVLLVCFTMLSVMLGPGATEVLAASDVTVNVSAEKQVIRGFGGMNHPAWAGDLTAAQRETAFGNGQNQLGF
SILRIHVDENRNNWYKEVETAKSAVKHGAIVFASPWNPPSDMVETFNRNGDTSAKRLKYNKYAAYAQHLNDFVTFMKNNG
VNLYAISVQNEPDYAHEWTWWTPQEILRFMRENAGSINARVIAPESFQYLKNLSDPILNDPQALANMDILGTHLYGTQVS
QFPYPLFKQKGAGKDLWMTEVYYPNSDTNSADRWPEALDVSQHIHNAMVEGDFQAYVWWYIRRSYGPMKEDGTISKRGYN
MAHFSKFVRPGYVRIDATKNPNANVYVSAYKGDNKVVIVAINKSNTGVNQNFVLQNGSASNVSRWITSSSSNLQPGTNLT
VSGNHFWAHLPAQSVTTFVVNR
>P23031 3.2.1.55~~~xynC~~~Alpha-L-arabinofuranosidase C~~~COG3509
MINHNKTPNILAKVFKRTCGLVSTGAALAILSQAASAACTYTIDSEWSTGFTANITLKNDTGAAINNWNVNWQYSSNRMT
SGWNANFSGTNPYNATNMSWNGSIAPGQSISFGLQGEKNGSTAERPTVTGAACNSATTSSVASSSSTPTTSSSSASSVAS
ALLLQEAQAGFCRVDGTIDNNHTGFTGSGFANTNNAQGAAVVWAIDATSSGRRTLTIRYANGGTANRNGSLVINGGSNGN
YTVSLPTTGAWTTWQTATIDVDLVQGNNIVQLSATTAEGLPNIDSLSVVGGTVRAGNCGSVSSSSSVQSSSSSSSTPSQT
CELKAPLRWTSTGPLISPKNPGWISIKDPSIVKYNDTYHVYATYYDTAYRSMYTSFTDWNTAQQAPHISMNGSRVGNTVA
PQVFYFRPHNKWYLITQWAGAYATTDDIRNPNWSAKQKLLQGEPNGALDFWVICNDTHCYLYFSRDDGVLYVSKTTLANF
PNFSGYSIVMEDHRGNGNSYLFEAANVYKLDGQNRYLLMVEAYISGPRFFRSWTATSLDGPWTPLADTEANPFAGNNNVE
WSTGKWADGISHGELIRSGHDEKMTVDPCNLEFLYQGASGPGSTYNTIPYKLGLLRLKK
>P35811 3.2.1.8~~~xynC~~~Endo-1,4-beta-xylanase C~~~COG0726
MKTFSVTKSSVVFAMALGMASTAFAQDFCSNAQHSGQKVTITSNQTGKIGDIGYELWDENGHGGSATFYSDGSMDCNITG
AKDYLCRAGLSLGSNKTYKELGGDMIAEFKLVKSGAQNVGYSYIGIYGWMEGVSGTPSQLVEYYVIDNTLANDMPGSWIG
NERKGTITVDGGTYTVYRNTRTGPAIKNSGNVTFYQYFSVRTSPRDCGTINISEHMRQWEKMGLTMGKLYEAKVLGEAGN
VNGEVRGGHMDFPHAKVYVKNGSDPVSSSSVKSSSSTDAPKSSSSKGNGNVSGKIDACKDVMGHEGKETRTQGQNNSSVT
GNVGSSPYHYEIWYQGGNNSMTFYDNGTYKASWNGTNDFLARVGFKYDEKHTYEELGPIDAYYKWSKQGSAGGYNYIGIY
GWTVDPLVEYYIVDDWFNKPGANLLGQRKGEFTVDGDTYEIWQNTRVQQPSIKGTQTFPQYFSVRKSARSCGHIDITAHM
KKWEELGMKMGKMYEAKVLVEAGGGSGSFDVTYFKMTDKAHPLAQPEPESSSSEAKVESSSSTVALHAAPKMELKSGNFQ
VFDMQGRFLGTVKLDAGASVAQVLKANFKNAGIYMVKQGNFMQRVAVK
>O69230 3.2.1.8~~~xynC~~~Endo-1,4-beta-xylanase C~~~
MRGKWLRLCLAAVLIVSLLPGLGAGEWKASAAKAGDILLSHSFEEGTTQGWTARGGVKVDVTAEQAYQGKQSLQTTGRTE
AWNGPSLSLTDVVHKNEVVEISGYVKLVAGSAPPDLKFTVERRDRNGDTQYDQVNAAEQVTDQKWVKLQGQYSYEQGSSL
LLYLESTDAKAAYLLDEFQIRLVKAAPENPGEPGEAGQALFKAYFEDGNIGNWRARGTEKLEVVSGIGHNSNRSLKTSSR
SETYHGPLVEVLPYLQKGSTVHISFWAMYDEGPATQVINGSLEKEFNRDTANLEYAMFASTTLNKGQWKKIEADIIVPAE
STGISGLRMYAETPWKQSSEVTETDTIPFYVDDVQITATEAIAIEKNIPDLAKKLGSSYALGAAIDQTALDPKDPHSELL
TKHFNSITAGNFMKMDAMQPTEGKFVWSEADKLVNFAAANNMQVRGHTLLWHSQVPDWFFTDPNDPSKPATREQLMQRMK
THIQTIVSRYKGKVHTWDVVNEVISDGGGLRNQASGSKWRDIIGDVDGDGDDSDYIELAFRYAREADPDAVLVINDYGIE
GSVSKMNDMVKLVEKLLAKGTPIDAIGFQMHVSMYGPDIKQIREAFNRAAALGVHIQVTELDMSIYSGNSEQEKPVTDEM
MLEQAYRYRALFDLFKEFDDRGVMDSVTLWGLADDGTWLDDFPVKGRKDAPLLFDRKLKAKPAYWALVDPSTLPVYRNEW
TASQAKVSLPDRKGQEDIIWGAVRALPFSHVIEGAVGTTGEVKTLWDGKQLNLRIEVKDATRLKGDQVEVFVSPEDMTAG
KKNSTPKDGQYIFNRDGGKGKDQKLYQVKENKSGYVVYASLPLSSADLAAGKVLSLDFRITDKQPNGKTSIVVWNDVNNQ
QPQKTENRGKLKLGFDLKHAKVMYGTPTVDGKEDKLWKKAVTITTDVKVTGNSGAKAKAKLLWDEKYLYVLAEVKDPLLS
KKSANAHEQDSIELFIDLNKNQTNSYEEDDAQYRVNFDNETSFGGSPRKELFKSATRLTKEGYIVEAAIPLENVRTKESK
WIGFDLQVNDDGAGDGKRSSVFMWSDPSGNSYRDTSGFGSLLLMKK
>P26220 3.2.1.8~~~xlnC~~~Endo-1,4-beta-xylanase C~~~
MQQDGTQQDRIKQSPAPLNGMSRRGFLGGAGTLALATASGLLLPGTAHAATTITTNQTGTDGMYYSFWTDGGGSVSMTLN
GGGSYSTQWTNCGNFVAGKGWSTGDGNVRYNGYFNPVGNGYGCLYGWTSNPLVEYYIVDNWGSYRPTGTYKGTVSSDGGT
YDIYQTTRYNAPSVEGTKTFQQYWSVRQSKVTSGSGTITTGNHFDAWARAGMNMGQFRYYMIMATEGYQSSGSSNITVSG
>Q45071 3.2.1.55~~~xynD~~~Arabinoxylan arabinofuranohydrolase~~~COG3507
MRKKCSVCLWILVLLLSCLSGKSAYAATSTTIAKHIGNSNPLIDHHLGADPVALTYNGRVYIYMSSDDYEYNSNGTIKDN
SFANLNRVFVISSADMVNWTDHGAIPVAGANGANGGRGIAKWAGASWAPSIAVKKINGKDKFFLYFANSGGGIGVLTADS
PIGPWTDPIGKPLVTPSTPGMSGVVWLFDPAVFVDDDGTGYLYAGGGVPGVSNPTQGQWANPKTARVIKLGPDMTSVVGS
ASTIDAPFMFEDSGLHKYNGTYYYSYCINFGGTHPADKPPGEIGYMTSSSPMGPFTYRGHFLKNPGAFFGGGGNNHHAVF
NFKNEWYVVYHAQTVSSALFGAGKGYRSPHINKLVHNADGSIQEVAANYAGVTQISNLNPYNRVEAETFAWNGRILTEKS
TAPGGPVNNQHVTSIQNGDWIAVGNADFGAGGARSFKANVASTLGGKIEVRLDSADGKLVGTLNVPSTGGAQTWREIETA
VSGATGVHKVFFVFTGTGTGNLFNFDYWQFTQR
>P54865 ~~~xynD~~~Bifunctional xylanase/deacetylase~~~
MSDSFEATRTTRRRRPLQALTGLLAAGALVAGALAAASPAAAAVTSNTTGTHDGYFYSFWTDSPGSVSMDLNSGGGYTRW
SNTGNFVAGKGWSTGGRKTVSYSGQFNPSRNAYLTLYGWTQSPLVEYYIVDSWGTYRPTGTFMGTVTSDGGTYDIYRTQR
VNKPSIEGDSSTFYQYWSVRQQKRTGGTITSGNHFDAWASKGMNLGRHNYMIMATEGYQSSGSSSITVSEGSGGGGGGDT
GGGGGSTGCSVTATRAEEWSDRFNVTYSVSGSSAWTVNLALNGSQTIQASWNANVTGSGSTRTVTPNGSGNTFGVTVMKN
GSSTTPAATCAGSGGGTATPTPTPTPTPTPQSCSAGYVGLTFDDGPNTGTTNQILSTLTQYGATATVFPTGQNAQGNPSL
MQAYKNAGVQIGNHSWDHPHLVNMSQSDMQSQLTRTQQAIQQTAGVTPTLFRPPYGESNATLRQVESSLGLREIIWDVDS
QDWNNASASQIRQAASRLTNGQIILMHDWPAATVQALPGILQDLRSRNLCTGHISSSTGRAVAPSSAGGGGGGGGGTGSC
SVSAVRGEEWADRFNVTYSVSGSSSWVVTLGLNGGQSVQSSWNAALTGSSGTVTARPNGSGNSFGVTFYKNGSSATPGAT
CATG
>P45796 3.2.1.55~~~xynD~~~Arabinoxylan arabinofuranohydrolase~~~COG3507
MIRKCLVLFLSFALLLSVFPMLNVDAANRPLAKIPGNSNPLMDHKLGADPYSLVYDGRVYIFMSSDTYVYNKDGSIKEND
FSALDRIQVISSTDMVNWTDHGTIPVAGANNKNSGRGIAKWASNSWAPAVAHKKINGRDKFFLYFANGGAGIGVLTADTP
IGPWTDPLGKALVTHSTPGMAGVTWLFDPAVLVDDDGTGYLYSGGGIPNESDPASIANPKTARVIKLGADMTSVIGSATT
IDAPYLFEDSGIHKYNGKYYYSYCINFAGTHPQQYPAGEIGYMVSDNPMGPFTYKGHFLKNPYTFFGVGGNNHHAVFNFK
NEWYVVYHAQTVSKAQIGAGKGYRSPHINKLVHKEDGSISEVQGNMTGIAQLSNMNPYTRVEAETIAWQAGVTTEPTQAS
GGPISNLNVTNIHNGDWIAVGKADFGSAGAKTFKANVATNVGGNIEVRLDSETGPLVGSLKVPSTGGMQTWREVETTINN
ATGVHNIYLVFTGSGSGNLLNLDAWQFTPNTGGNTITKVEAENMKIGGTYAGKISAPFDGVALYANADYVSYSQYFANST
HNISVRGASSNAGTAKVDLVIGGVTVGSFNFTGKTPTVQTLSNITHATGDQEIKLALTSDDGTWDAYVDFIEFSL
>P77300 ~~~xynR~~~HTH-type transcriptional regulator XynR~~~COG1414
MPIIQSVERALQILDLFNEQATELKITDISKLMGLSKSTLHSLLKTLQLHGYIDQNPENGKYRLGMKLVERGHFVVGSID
IRQKAKGWLTELSRRTGQTTHLGILDGREGVYIEKIEGKLAAIAYSRIGRRLPVHATAIGKVLIAWLGEAELNALLEGYQ
YTTFTPATLASREALMSALAQTREQGYALDSEENEQGVRCVAVPVWNHESRVIAALSLSTLTSRVDDAELANFREQLQQA
GLALSRALGYPA
>P51584 3.2.1.8~~~xynY~~~Endo-1,4-beta-xylanase Y~~~
MKNKRVLAKITALVVLLGVFFVLPSNISQLYADYEVVHDTFEVNFDGWCNLGVDTYLTAVENEGNNGTRGMMVINRSSAS
DGAYSEKGFYLDGGVEYKYSVFVKHNGTGTETFKLSVSYLDSETEEENKEVIATKDVVAGEWTEISAKYKAPKTAVNITL
SITTDSTVDFIFDDVTITRKGMAEANTVYAANAVLKDMYANYFRVGSVLNSGTVNNSSIKALILREFNSITCENEMKPDA
TLVQSGSTNTNIRVSLNRAASILNFCAQNNIAVRGHTLVWHSQTPQWFFKDNFQDNGNWVSQSVMDQRLESYIKNMFAEI
QRQYPSLNLYAYDVVNEAVSDDANRTRYYGGAREPGYGNGRSPWVQIYGDNKFIEKAFTYARKYAPANCKLYYNDYNEYW
DHKRDCIASICANLYNKGLLDGVGMQSHINADMNGFSGIQNYKAALQKYINIGCDVQITELDISTENGKFSLQQQADKYK
AVFQAAVDINRTSSKGKVTAVCVWGPNDANTWLGSQNAPLLFNANNQPKPAYNAVASIIPQSEWGDGNNPAGGGGGGKPE
EPDANGYYYHDTFEGSVGQWTARGPAEVLLSGRTAYKGSESLLVRNRTAAWNGAQRALNPRTFVPGNTYCFSVVASFIEG
ASSTTFCMKLQYVDGSGTQRYDTIDMKTVGPNQWVHLYNPQYRIPSDATDMYVYVETADDTINFYIDEAIGAVAGTVIEG
PAPQPTQPPVLLGDVNGDGTINSTDLTMLKRSVLRAITLTDDAKARADVDKNGSINSTDVLLLSRYLLRVIDKFPVAENP
SSSFKYESAVQYRPAPDSYLNPCPQAGRIVKETYTGINGTKSLNVYLPYGYDPNKKYNIFYLMHGGGENENTIFSNDVKL
QNILDHAIMNGELEPLIVVTPTFNGGNCTAQNFYQEFRQNVIPFVESKYSTYAESTTPQGIAASRMHRGFGGFSMGGLTT
WYVMVNCLDYVAYFMPLSGDYWYGNSPQDKANSIAEAINRSGLSKREYFVFAATGSDHIAYANMNPQIEAMKALPHFDYT
SDFSKGNFYFLVAPGATHWWGYVRHYIYDALPYFFHE
>P10478 3.2.1.8~~~xynZ~~~Endo-1,4-beta-xylanase Z~~~COG2382
MSRKLFSVLLVGLMLMTSLLVTISSTSAASLPTMPPSGYDQVRNGVPRGQVVNISYFSTATNSTRPARVYLPPGYSKDKK
YSVLYLLHGIGGSENDWFEGGGRANVIADNLIAEGKIKPLIIVTPNTNAAGPGIADGYENFTKDLLNSLIPYIESNYSVY
TDREHRAIAGLSMGGGQSFNIGLTNLDKFAYIGPISAAPNTYPNERLFPDGGKAAREKLKLLFIACGTNDSLIGFGQRVH
EYCVANNINHVYWLIQGGGHDFNVWKPGLWNFLQMADEAGLTRDGNTPVPTPSPKPANTRIEAEDYDGINSSSIEIIGVP
PEGGRGIGYITSGDYLVYKSIDFGNGATSFKAKVANANTSNIELRLNGPNGTLIGTLSVKSTGDWNTYEEQTCSISKVTG
INDLYLVFKGPVNIDWFTFGVESSSTGLGDLNGDGNINSSDLQALKRHLLGISPLTGEALLRADVNRSGKVDSTDYSVLK
RYILRIITEFPGQGDVQTPNPSVTPTQTPIPTISGNALRDYAEARGIKIGTCVNYPFYNNSDPTYNSILQREFSMVVCEN
EMKFDALQPRQNVFDFSKGDQLLAFAERNGMQMRGHTLIWHNQNPSWLTNGNWNRDSLLAVMKNHITTVMTHYKGKIVEW
DVANECMDDSGNGLRSSIWRNVIGQDYLDYAFRYAREADPDALLFYNDYNIEDLGPKSNAVFNMIKSMKERGVPIDGVGF
QCHFINGMSPEYLASIDQNIKRYAEIGVIVSFTEIDIRIPQSENPATAFQVQANNYKELMKICLANPNCNTFVMWGFTDK
YTWIPGTFPGYGNPLIYDSNYNPKPAYNAIKEALMGY
>Q9ZBU1 1.1.3.41~~~xyoA~~~Alditol oxidase~~~COG0277
MSDITVTNWAGNITYTAKELLRPHSLDALRALVADSARVRVLGSGHSFNEIAEPGDGGVLLSLAGLPSVVDVDTAARTVR
VGGGVRYAELARVVHARGLALPNMASLPHISVAGSVATGTHGSGVGNGSLASVVREVELVTADGSTVVIARGDERFGGAV
TSLGALGVVTSLTLDLEPAYEMEQHVFTELPLAGLDPATFETVMAAAYSVSLFTDWRAPGFRQVWLKRRTDRPLDGFPYA
APAAEKMHPVPGMPAVNCTEQFGVPGPWHERLPHFRAEFTPSSGAELQSEYLMPREHALAALHAMDAIRETLAPVLQTCE
IRTVAADAQWLSPAYGRDTVAAHFTWVEDTAAVLPVVRRLEEALVPFAARPHWGKVFTVPAGELRALYPRLADFGALAGA
LDPAGKFTNAFVRGVLAG
>Q9KX73 1.1.3.41~~~xyoA~~~Alditol oxidase~~~
MSTAVTNWAGNITYTAKEVHRPATAEELADVVARSAWGACAGAAGHSFNEIADPGPDGVLLRLDALPAETDVDTTARTVR
VGGGVRYAELARVVHAHGLALPNMASLPHISVAGSVATGTHGSGVTNGPLAAPVREVELVTADGSQVRIAPGERRFGGAV
TSLGALGVVTALTLDLEPAFEVGQHLFTELPLRGLDFETVAAAGYSVSLFTDWREPGFRQVWLKRRTDQELPDFPWARPA
TVALHPVPGMPAENCTQQFGVPGPWHERLPHFRAEFTPSSGAELQSEYLLPRAHALDALDAVDRIRDTVAPVLQTCEVRT
VAPDEQWLGPSHGRDTVALHFTWVKDTEAVLPVVRRLEEALDAFDPRPHWGKVFTTSAAALRARYPRLADFRALARELDP
SGKFTNTFLRDLLDG
>A0QYB3 ~~~xypA~~~Xylitol-binding protein~~~COG1879
MNITSKIGAIAAAGAVGLGLTACGAGDTAANSDTKRIGVTVYDMSSFITEGKEGMDTYAKANNIELVWNSANNDVSTQAS
QVDSLINQGVDAIIVVPVQADSLGPQVASAKSKGIPLLAVNAALETPDLAGNVQPDDVAAGAQEMQMMADRLGGKGNIVI
LQGPLGGSGEINRGKGIDQVLAKYPDIKVLAKDTANWKRDEAVNKMKNWISSFGPQIDGVVAQNDDMGLGALQALKEAGR
TGVPIVGIDGIEDGLNAVKSGDFIGTSLQNGTVELSAGLAVADALVKGEDVKTDPVYVMPAITKDNVDVAIEHVVTERQK
FLDGLVELTQQNLKTGDIAYEGIPGQTQP
>P44447 3.1.3.-~~~~~~Putative phosphatase HI_0003~~~COG0561
MMYKAVFSDFNGTLLTSQHTISPRTVVVIKRLTANGIPFVPISARSPLGILPYWKQLETNNVLVAFSGALILNQNLEPIY
SVQIEPKDILEINTVLAEHPLLGVNYYTNNDCHARDVENKWVIYERSVTKIEIHPFDEVATRSPHKIQIIGEAEEIIEIE
VLLKEKFPHLSICRSHANFLEVMHKSATKGSAVRFLEDYFGVQTNEVIAFGDNFNDLDMLEHVGLGVAMGNAPNEIKQAA
NVVTATNNEDGLALILEEKFPE
>O83050 ~~~~~~Uncharacterized protein TP_0004~~~COG5512
MNNGVNKLSDLLVLTTEYIQASYETEAFDAHREWVCIVGNPVALHSTLVDIRNGKVVVKVTHPGWAQYLLLKKDEIVHAL
RRRYPSLGVTGMSTYVDSTSRTPSAKKDMQGLSVSEKQTRPVPELAEVFEQLRTLFQVKTEEPSH
>P9WMA7 ~~~~~~Uncharacterized protein Rv0007~~~COG3266
MTAPNEPGALSKGDGPNADGLVDRGGAHRAATGPGRIPDAGDPPPWQRAATRQSQAGHRQPPPVSHPEGRPTNPPAAADA
RLNRFISGASAPVTGPAAAVRTPQPDPDASLGCGDGSPAEAYASELPDLSGPTPRAPQRNPAPARPAEGGAGSRGDSAAG
SSGGRSITAESRDARVQLSARRSRGPVRASMQIRRIDPWSTLKVSLLLSVALFFVWMITVAFLYLVLGGMGVWAKLNSNV
GDLLNNASGSSAELVSSGTIFGGAFLIGLVNIVLMTALATIGAFVYNLITDLIGGIEVTLADRD
>P9WMA4 ~~~~~~Uncharacterized protein MT0087~~~
MSPGSRRASPQSAREVVELDRDEAMRLLASVDHGRVVFTRAALPAIRPVNHLVVDGRVIGRTRLTAKVSVAVRSSADAGV
VVAYEADDLDPRRRTGWSVVVTGLATEVSDPEQVARYQRLLHPWVNMAMDTVVAIEPEIVTGIRIVADSRTP
>P9WMA5 ~~~~~~Uncharacterized protein Rv0080~~~COG3467
MSPGSRRASPQSAREVVELDRDEAMRLLASVDHGRVVFTRAALPAIRPVNHLVVDGRVIGRTRLTAKVSVAVRSSADAGV
VVAYEADDLDPRRRTGWSVVVTGLATEVSDPEQVARYQRLLHPWVNMAMDTVVAIEPEIVTGIRIVADSRTP
>P9WMI6 ~~~~~~Uncharacterized HTH-type transcriptional regulator MT0088~~~
MESEPLYKLKAEFFKTLAHPARIRILELLVERDRSVGELLSSDVGLESSNLSQQLGVLRRAGVVAARRDGNAMIYSIAAP
DIAELLAVARKVLARVLSDRVAVLEDLRAGGSAT
>P9WMI7 ~~~~~~Uncharacterized HTH-type transcriptional regulator Rv0081~~~COG0640
MESEPLYKLKAEFFKTLAHPARIRILELLVERDRSVGELLSSDVGLESSNLSQQLGVLRRAGVVAARRDGNAMIYSIAAP
DIAELLAVARKVLARVLSDRVAVLEDLRAGGSAT
>P75103 ~~~~~~UPF0134 protein MPN_010~~~
MAYSPSLNDIKSILNKYTSKDYELKCENRYDGKLELWLKGVFEEIVKTPGTRYVTHKQLDEKLKNFVTKTEFKEFQTVVM
ESFAVQNQNIDAQGEQIKELQVEQKAQGKTLQLILEALQGINKRLDNLESK
>P96825 1.1.1.-~~~~~~Putative short-chain type dehydrogenase/reductase Rv0148~~~COG1028
MPGVQDRVIVVTGAGGGLGREYALTLAGEGASVVVNDLGGARDGTGAGSAMADEVVAEIRDKGGRAVANYDSVATEDGAA
NIIKTALDEFGAVHGVVSNAGILRDGTFHKMSFENWDAVLKVHLYGGYHVLRAAWPHFREQSYGRVVVATSTSGLFGNFG
QTNYGAAKLGLVGLINTLALEGAKYNIHANALAPIAATRMTQDILPPEVLEKLTPEFVAPVVAYLCTEECADNASVYVVG
GGKVQRVALFGNDGANFDKPPSVQDVAARWAEITDLSGAKIAGFKL
>Q9WXM9 ~~~~~~UPF0166 protein TM_0021~~~COG1993
MKLLKIYLGEKDKHSGKPLFEYLVKRAYELGMKGVTVYRGIMGFGHKRHMHRSDFFSLSPDLPIVLEIVDEEERINLFLK
EIDNIDFDGLVFTADVNVVKMG
>P9WMI3 ~~~~~~Uncharacterized HTH-type transcriptional regulator Rv0023~~~COG1396
MSRESAGAAIRALRESRDWSLADLAAATGVSTMGLSYLERGARKPHKSTVQKVENGLGLPPGTYSRLLVAADPDAELARL
IAAQPSNPTAVRRAGAVVVDRHSDTDVLEGYAEAQLDAIKSVIDRLPATTSNEYETYILSVIAQCVKAEMLAASSWRVAV
NAGADSTGRLMEHLRALEATRGALLERMPTSLSARFDRACAQSSLPEAVVAALIGVGADEMWDIRNRGVIPAGALPRVRA
FVDAIEASHDADEGQQ
>P9WMA1 ~~~~~~Uncharacterized protein Rv0025~~~
MSEQAGSSVAVIQERQALLARQHDAVAEADRELADVLASAHAAMRESVRRLDAIAAELDRAVPDQDQLAVDTPMGAREFQ
TFLVAKQREIVAVVAAAHELDRAKSAVLKRLRAQYTEPAR
>P47273 ~~~~~~Uncharacterized protein MG027~~~
MAITVKGLTNKLTRTQRRIAVVEFIFSLLFFLPKEAEVIQADFLEYDTKERQLNEWQKLIVKAFSENIFSFQKKIEEQQL
KNQLEIQTKYNKISGKKIDLLTTAVVLCALSEQKAHNTDKPLLISEALLIMDHYSQGAEKKQTHALLDKLL
>P0DMM2 ~~~~~~Uncharacterized protein Rv0028A~~~
MADSALQQQLDEVRALLTRARELFGPNPIEPPTDIAPDPDSTKTWLI
>P44465 ~~~~~~UPF0250 protein HI_0028~~~COG2921
MTIENDYAKLKELMEFPAKMTFKVAGINREGLAQDLIQVVQKYIKGDYIPKEKRSSKGTYNSVSIDIIAENFDQVETLYK
ELAKVEGVKMVI
>P9WM97 ~~~~~~Uncharacterized protein Rv0028~~~
MTDANPAFDTVHPSGHILVRSCRGGYMHSVSLSEAAMETDAETLAEAILLTADVSCLKALLEVRNEIVAAGHTPSAQVPT
TDDLNVAIEKLLAHQLRRRNR
>P9WM95 ~~~~~~Uncharacterized protein Rv0030~~~
MVSGSDSRSEPSQLSDRDLVESVLRDLSEAADKWEALVTQAETVTYSVDLGDVRAVANSDGRLLELTLHPGVMTGYAHGE
LADRVNLAITALRDEVEAENRARYGGRLQ
>P9WM93 ~~~~~~Uncharacterized protein Rv0034~~~COG3631
MTDDADLDLVRRTFAAFARGDLAELTQCFAPDVEQFVPGKHALAGVFRGVDNVVACLGDTAAAADGTMTVTLEDVLSNTD
GQVIAVYRLRASRAGKVLDQREAILVTVAGGRITRLSEFYADPAATESFWA
>O24876 ~~~~~~Nucleoid-associated protein HP_0035~~~COG0718
MDFSQLGGLLDGMKKEFSQLEEKNKDTIHTSKSGGGMVSVSFNGLGELVDLQIDDSLLEDKEAMQIYLMSALNDGYKAVE
ENRKNLAFNMLGNFAKL
>P9WM91 ~~~~~~Uncharacterized protein Rv0036c~~~
MADPGPFVADLRAESDDLDALVAHLPADRWADPTPAPGWTIAHQIGHLLWTDRVALTAVTDEAGFAELMTAAAANPAGFV
DDAATELAAVSPAELLTDWRVTRGRLHEELLAVPDGRKLAWFGPPMSAASMATARLMETWAHGLDVADALGVIRPATQRL
RSIAHLGVRTRDYAFIVNNLTPPAEPFLVELRGPSGDTWSWGPSDAAQRVTGSAEDFCFLVTQRRALSTLDVNAVGEDAQ
RWLTIAQAFAGPPGRGR
>P9WJY1 ~~~~~~Uncharacterized MFS-type transporter Rv0037c~~~COG2814
MPRVEVGLVIHSRMHARAPVDVWRSVRSLPDFWRLLQVRVASQFGDGLFQAGLAGALLFNPDRAADPMAIAGAFAVLFLP
YSLLGPFAGALMDRWDRRWVLVGANTGRLALIAGVGTILAVGAGDVPLLVGALVANGLARFVASGLSAALPHVVPREQVV
TMNSVAIASGAVSAFLGANFMLLPRWLLGSGDEGASAIVFLVAIPVSIALLWSLRFGPRVLGPDDTERAIHGSAVYAVVT
GWLHGARTVVQLPTVAAGLSGLAAHRMVVGINSLLILLLVRHVTARAVGGLGTALLFFAATGLGAFLANVLTPTAIRRWG
RYATANGALAAAATIQVAAAGLLVPVMVVCGFLLGVAGQVVKLCADSAMQMDVDDALRGHVFAVQDALFWVSYILSITVA
AALIPEHGHAPVFVLFGSAIYLAGLVVHTIVGRRGQPVIGR
>P9WFK5 ~~~~~~UPF0301 protein Rv0038~~~COG1678
MVAPHEDPEDHVAPAAQRVRAGTLLLANTDLLEPTFRRSVIYIVEHNDGGTLGVVLNRPSETAVYNVLPQWAKLAAKPKT
MFIGGPVKRDAALCLAVLRVGADPEGVPGLRHVAGRLVMVDLDADPEVLAAAVEGVRIYAGYSGWTIGQLEGEIERDDWI
VLSALPSDVLVGPRADLWGQVLRRQPLPLSLLATHPIDLSRN
>C0QY54 ~~~ssuA~~~Putative ABC transporter periplasmic binding protein BHWA1_00430~~~COG0715
MRKTVKIITSLVLIACLFLLASCANKGSSSSGDATTLKVAVMPFLNSVPIEYMINNKLDEKYGFKIETVYFPSGGPMNEA
LGAGLWEVGTLSAASVYSLANYNAHVVADIGHSEGGIEVLVNPDSDILTVKGVNKDFPEVYGDAATLKGKTIAVPTGTIS
HLNVIHWLRAINVDPNTVNIVHMEFPQAYQALKAKKIDAAALNPPTSFSAEADGMKIVSSLTTLNIPQYDSIIVSDEAFN
NKKDTIVLYIKAFLEACDALQADPNMAAQELLNWYTKNGSTSTLEACQSEVQTRPFVTTEEIKNIKTGDSVRITADFFAS
QSLITEDKLTVVDQNVDTELLGKALN
>P9WMG9 ~~~~~~Uncharacterized HTH-type transcriptional regulator Rv0043c~~~COG1802
MPKKYGVKEKDQVVAHILNLLLTGKLRSGDRVDRNEIAHGLGVSRVPIQEALVQLEHDGIVSTRYHRGAFIERFDVATIL
EHHELDGLLNGIASARAAANPTPRILGQLDAVMRSLRNSKESRAFAECVWEYRRTVNDEYAGPRLHATIRASQNLIPRVF
WMTYQNSRDDVLPFYEEENAAIHRREPEAARAACIGRSELMAQTMLAELFRRRVLVPPEGACPGPFGAPIPGFARSYQPS
SPVP
>P44481 1.-.-.-~~~~~~Uncharacterized oxidoreductase HI_0048~~~COG1028
MEFTMNIAANHNLENKLIIITGAGGVLCSFLAKQLAYTKANIALLDLNFEAADKVAKEINQSGGKAKAYKTNVLELENIK
EVRNQIETDFGTCDILINGAGGNNPKATTDNEFHQFDLNETTRTFFDLDKSGIEFVFNLNYLGSLLPTQVFAKDMLGKQG
ANIINISSMNAFTPLTKIPAYSGAKAAISNFTQWLAVYFSKVGIRCNAIAPGFLVSNQNLALLFDTEGKPTDRANKILTN
TPMGRFGESEELLGALLFLIDENYSAFVNGVVLPVDGGFSAYSGV
>Q9KGL3 ~~~~~~UPF0213 protein BH0048~~~COG2827
MNHYVYILECKDGSWYTGYTTDVDRRIKKHASGKGAKYTRGRGPFRLVATWAFPSKEEAMRWEYEVKHLSRRKKEQLVSL
KGGPYENTTKLSTT
>P9WM87 ~~~~~~Uncharacterized protein Rv0048c~~~
MAKWLGAPLARGVSTATRAKDSDRQDACRILDDALRDGELSMEEHRERVSAATKAVTLGDLQRLVADLQVESAPAQMPAL
KSRAKRTELGLLAAAFVASVLLGVGIGWGVYGNTRSPLDFTSDPGAKPDGIAPVVLTPPRQLHSLGGLTGLLEQTRKRFG
DTMGYRLVIYPEYASLDRVDPADDRRVLAYTYRGGWGDATSSAKSIADVSVVDLSKFDAKTAVGIMRGAPETLGLKQSDV
KSMYLIVEPVKDPTTPAALSLSLYVSSDYGGGYLVFAGDGTIKHVSYPS
>P9WM85 ~~~~~~Uncharacterized protein Rv0049~~~
MDYTLRRRSLLAEVYSGRTGVSEVCDANPYLLRAAKFHGKPSRVICPICRKEQLTLVSWVFGEHLGAVSGSARTAEELIL
LATRFSEFAVHVVEVCRTCSWNHLVKSYVLGAARPARPPRGSGGTRTARNGARTASE
>Q9I783 ~~~~~~Uncharacterized protein PA0049~~~
MRYARHASRYSLFTLAVSAALLPGAGWAANGDLAGARKPPSVACSWNREAALSYEERRLDTPLPFSGANVVTHDQTPLAE
RIVKGAGFDGFEPAFAKRLCAADGRTPVTSYAKALKLVTEEGRALWRAAVDRAQGRRAIPAGALPASDDRMLYWTRLYMT
RTLRQWAPSFHLGKAQAQALQWRFERASRGQLDIDLPRRYAADGSRYRRMIISGFDVFTLGTPGTANTGLRNGNPSGATA
LALDGREFRLADGSLLRIEAYLLPVSYDPFNRGMQEDTLGPWFRPGPRRVDASITISQGGANQFWLEAWNGRFHGSSAGN
DGIVYCPADSALPNYVLPLGSVTNPGTAPISLRGSGCNINPPRRWLGYDSASRWRQNLPAQFSKASLPVRQLLAADTWRG
IERPPGATSQAAEGFDVTWHTNYDFFPDCANPRTENVPTNGVMNAMPDPSLVLPPNRRICARNGGGGDYLSNESAYRNTV
LRDAFRLEIPAGHIHVPVMNNYYTGVPASGGGARNDNAISDARYEAYRSAIVAQTRALLVGVGNALAQGAQAD
>P71336 ~~~~~~Uncharacterized protein HI_0052~~~COG1638
MKSIKGLGKLLLASSILFSSSAFAKTIIKLGHYNSDIHPSHIALQEYFKKTIENETNHKYEIRLYPNNQLGGEDQIVNGL
RNGTIEAGITGLLLQNVDPIFGVWEWPYLFKDNQEAKKVLESPIANKIGQKMEKYGIKLLAYGMNGFRVISSNKKLEKFD
DFKGLRLRVPLNSLFVDWAKAMNINPQSMPLSEVFTALEQKVIDGQENPYMLIKDSGLYEVQKYIIQSNHIFSPGLLQIS
LKTWNKIPKEDQIIFEKAAKLYQEKEWELAIKTELEVKDYLAKHGNEIIVPSEAFKNDMVNASKVLYDSFYKKYDWAKDV
VQKINEAK
>Q2G1Q1 ~~~~~~Uncharacterized lipoprotein SAOUHSC_00052~~~
MKRLNKLVLGIIFLFLVISITAGCGIGKEAEVKKSFEKTLSMYPIKNLEDLYDKEGYRDDQFDKNDKGTWIINSEMVIQP
NNEDMVAKGMVLYMNRNTKTTNGYYYVDVTKDEDEGKPHDNEKRYPVKMVDNKIIPTKEIKDEKIKKEIENFKFFVQYGD
FKNLKNYKDGDISYNPEVPSYSAKYQLTNDDYNVKQLRKRYDIPTSKAPKLLLKGSGNLKGSSVGYKDIEFTFVEKKEEN
IYFSDSLDYKKSGDV
>Q2G1Q0 ~~~~~~Uncharacterized lipoprotein SAOUHSC_00053~~~
MIKRVNKLVLGISLLFLVISITAGCGMGKEAEIKKSFEKTLSMYPIKNLEDLYDKEGYRDDQFDKNDKGTWIVNSQMAIQ
NKGEALKIKGMLLKIDRNTRSAKGFYYTNEIKTEKYEVAQDNQKKYPVKMINNKFISTEEVKEENIKKEIENFKFFAQYS
NFKDLMNYKDGDISYNPEVPSYSAQYQLTNDDYNVKQLRKRYDIPTNKAPKLLLKGTGNLKGSSVGYKKIEFTFLENKNE
NIYFTDSLHLEPSEDK
>C0QYX7 ~~~~~~Uncharacterized protein BHWA1_00569~~~COG2385
MNSIYAFANQNIIRVQLTDVKAPYTINIKGPYKAYNYKYESEIISALTNETVMVVENRLGLKVNEVGVYKEGIVFETQDG
FTLNGIEYYGSLMFIPYNDTMIVVNELNIEDYVKGVLPHEMSPDWPMEALKAQAVAARTYAMYHILKNANKLPFDVDNTT
KYQVYNGKEKMNWSVEQAVDRTRYEIAVYKGKVIATYFSALCGGHTDSAENVFGVAVPYLGGVACPYCNAQIKPWTNALS
YNELNNDLANYSVHATEKSSIGISTDPKSGKATNIKIDNNDITSRDFRTTLSPRLVPSLNFTIKKVDNGIIITGKGSGHG
VGMCQWGAYGMAQVKKDYKEILKFYYNGVDIVDYNRVNKEFEPDVWGN
>P9WM82 ~~~~~~Uncharacterized protein MT0595~~~
MKAKVGDWLVIKGATIDQPDHRGLIIEVRSSDGSPPYVVRWLETDHVATVIPGPDAVVVTAEEQNAADERAQHRFGAVQS
AILHARGT
>P9WM83 ~~~~~~Uncharacterized protein Rv0569~~~COG2905
MKAKVGDWLVIKGATIDQPDHRGLIIEVRSSDGSPPYVVRWLETDHVATVIPGPDAVVVTAEEQNAADERAQHRFGAVQS
AILHARGT
>P9WHK0 ~~~~~~Putative phosphoribosyl transferase MT0597~~~
MKLFDDRGDAGRQLAQRLAQLSGKAVVVLGLPRGGVPVAFEVAKSLQAPLDVLVVRKLGVPFQPELAFGAIGEDGVRVLN
DDVVRGTHLDAAAMDAVERKQLIELQRRAERFRRGRDRIPLTGRIAVIVDDGIATGATAKAACQVARAHGADKVVLAVPI
GPDDIVARFAGYADEVVCLATPALFFAVGQGYRNFTQTSDDEVVAFLDRAHRDFAEAGAIDAAADPPLRDEEVQVVAGPV
PVAGHLTVPEKPRGIVVFAHGSGSSRHSIRNRYVAEVLTGAGFATLLFDLLTPEEERNRANVFDIELLASRLIDVTGWLA
TQPDTASLPVGYFGASTGAGAALVAAADPRVNVRAVVSRGGRPDLAGDSLGSVVAPTLLIVGGRDQVVLELNQRAQAVIP
GKCQLTVVPGATHLFEEPGTLEQVAKLACDWFIDHLCGPGPSG
>P9WHK1 ~~~~~~Putative phosphoribosyl transferase Rv0571c~~~COG1073
MKLFDDRGDAGRQLAQRLAQLSGKAVVVLGLPRGGVPVAFEVAKSLQAPLDVLVVRKLGVPFQPELAFGAIGEDGVRVLN
DDVVRGTHLDAAAMDAVERKQLIELQRRAERFRRGRDRIPLTGRIAVIVDDGIATGATAKAACQVARAHGADKVVLAVPI
GPDDIVARFAGYADEVVCLATPALFFAVGQGYRNFTQTSDDEVVAFLDRAHRDFAEAGAIDAAADPPLRDEEVQVVAGPV
PVAGHLTVPEKPRGIVVFAHGSGSSRHSIRNRYVAEVLTGAGFATLLFDLLTPEEERNRANVFDIELLASRLIDVTGWLA
TQPDTASLPVGYFGASTGAGAALVAAADPRVNVRAVVSRGGRPDLAGDSLGSVVAPTLLIVGGRDQVVLELNQRAQAVIP
GKCQLTVVPGATHLFEEPGTLEQVAKLACDWFIDHLCGPGPSG
>P9WM80 ~~~~~~Uncharacterized protein MT0599~~~
MGEHAIKRHMRQRKPTKHPLAQKRGARILVLTDDPRRSVLIVPGCHLDSMRREKNAYYFQDGNALVGMVVSGGTVEYDAD
DRTYVVQLTDGRHTTESSFEHSSPSRSPQSDDL
>P9WM81 ~~~~~~Uncharacterized protein Rv0572c~~~
MGEHAIKRHMRQRKPTKHPLAQKRGARILVFTDDPRRSVLIVPGCHLDSMRREKNAYYFQDGNALVGMVVSGGTVEYDAD
DRTYVVQLTDGRHTTESSFEHSSPSRSPQSDDL
>P9WM78 ~~~~~~Probable polyglutamine synthesis accessory protein MT0602~~~
MAGNPDVVTVLLGGDVMLGRGVDQILPHPGKPQLRERYMRDATGYVRLAERVNGRIPLPVDWRWPWGEALAVLENTATDV
CLINLETTITADGEFADRKPVCYRMHPDNVPALTALRPHVCALANNHILDFGYQGLTDTVAALAGAGIQSVGAGADLLAA
RRSALVTVGHERRVIVGSVAAESSGVPESWAARRDRPGVWLIRDPAQRDVADDVAAQVLADKRPGDIAIVSMHWGSNWGY
ATAPGDVAFAHRLIDAGIDMVHGHSSHHPRPIEIYRGKPILYGCGDVVDDYEGIGGHESFRSELRLLYLTVTDPASGNLI
SLQMLPLRVSRMRLQRASQTDTEWLRNTIERISRRFGIRVVTRPDNLLEVVPAANLTSKE
>P9WM79 ~~~~~~Probable polyglutamine synthesis accessory protein Rv0574c~~~COG2843
MAGNPDVVTVLLGGDVMLGRGVDQILPHPGKPQLRERYMRDATGYVRLAERVNGRIPLPVDWRWPWGEALAVLENTATDV
CLINLETTITADGEFADRKPVCYRMHPDNVPALTALRPHVCALANNHILDFGYQGLTDTVAALAGAGIQSVGAGADLLAA
RRSALVTVGHERRVIVGSVAAESSGVPESWAARRDRPGVWLIRDPAQRDVADDVAAQVLADKRPGDIAIVSMHWGSNWGY
ATAPGDVAFAHRLIDAGIDMVHGHSSHHPRPIEIYRGKPILYGCGDVVDDYEGIGGHESFRSELRLLYLTVTDPASGNLI
SLQMLPLRVSRMRLQRASQTDTEWLRNTIERISRRFGIRVVTRPDNLLEVVPAANLTSKE
>P9WGS7 1.-.-.-~~~~~~Uncharacterized NAD-dependent oxidoreductase Rv0687~~~COG1028
MSARGGSLHGRVAFVTGAARAQGRSHAVRLAREGADIVALDICAPVSGSVTYPPATSEDLGETVRAVEAEGRKVLAREVD
IRDDAELRRLVADGVEQFGRLDIVVANAGVLGWGRLWELTDEQWETVIGVNLTGTWRTLRATVPAMIDAGNGGSIVVVSS
SAGLKATPGNGHYAASKHALVALTNTLAIELGEFGIRVNSIHPYSVDTPMIEPEAMIQTFAKHPGYVHSFPPMPLQPKGF
MTPDEISDVVVWLAGDGSGALSGNQIPVDKGALKY
>P9WG17 ~~~~~~Uncharacterized ABC transporter permease Rv0072~~~COG0577
MLFAALRDMQWRKRRLVITIISTGLIFGMTLVLTGLANGFRVEARHTVDSMGVDVFVVRSGAAGPFLGSIPFPDVDLARV
AAEPGVMAAAPLGSVGTIMKEGTSTRNVTVFGAPEHGPGMPRVSEGRSPSKPDEVAASSTMGRHLGDTVEVGARRLRVVG
IVPNSTALAKIPNVFLTTEGLQKLAYNGQPNITSIGIIGMPRQLPEGYQTFDRVGAVNDLVRPLKVAVNSISIVAVLLWI
VAVLIVGSVVYLSALERLRDFAVFKAIGTPTRSIMAGLALQALVIALLAAVVGVVLAQVLAPLFPMIVAVPVGAYLALPV
AAIVIGLFASVAGLKRVVTVDPAQAFGGP
>P9WQK5 ~~~~~~Uncharacterized ABC transporter ATP-binding protein Rv0073~~~COG0664
MGDLSIQNLVVEYYSGGYALRPINGLNLDVAAGSLVMLLGPSGCGKTTLLSCLGGILRPKSGAIKFDEVDITTLQGAELA
NYRRNKVGIVFQAFNLVPSLTAVENVMVPLRSAGMSRRASRRRAEELLARVNLAERMNHRPGDLSGGQQQRVAVARAIAL
DPPLILADEPTAHLDFIQVEEVLRLIRELADGERVVVVATHDSRMLPMADRVVELTPDFAETNRPPETVHLQAGEVLFEQ
STMGDLIYVVSEGEFEIVHELADGGEELVKVAGPGDYFGEIGVLFHLPRSATVRARSDATAVGYTVQAFRERLGVGGLRD
LIEHRALAND
>O53803 ~~~~~~Protein Rv0740~~~
MTGPPRSYTGRRDLIAEKLEPYFQISAMLPKNTRPTSETAEEFWDNSLWCSWGDRETGYTRTVTVSICQVADGEREAEGV
RDMMRLECPAGLDLRTPNPEAYEITGQRPGEFVFVLGYLGHVRAIVGNCYIEIMPMGTRVELSKLADVALDIGRSVGCSA
YENDFTLPDIPTQWRNQPLGWYTQGLAPYLPGLSDPKDAAEG
>P71838 1.3.99.-~~~~~~KsdD-like steroid dehydrogenase Rv0785~~~COG3573
MALTCTDMSDAVAGSDAEGLTADAIVVGAGLAGLVAACELADRGLRVLILDQENRANVGGQAFWSFGGLFLVNSPEQRRL
GIRDSHELALQDWLGTAAFDRPEDYWPEQWAHAYVDFAAGEKRSWLRARGLKIFPLVGWAERGGYDAQGHGNSVPRFHIT
WGTGPALVDIFVRQLRDRPTVRFAHRHQVDKLIVEGNAVTGVRGTVLEPSDEPRGAPSSRKSVGKFEFRASAVIVASGGI
GGNHELVRKNWPRRMGRIPKQLLSGVPAHVDGRMIGIAQKAGAAVINPDRMWHYTEGITNYDPIWPRHGIRIIPGPSSLW
LDAAGKRLPVPLFPGFDTLGTLEYITKSGHDYTWFVLNAKIIEKEFALSGQEQNPDLTGRRLGQLLRSRAHAGPPGPVQA
FIDRGVDCVHANSLRELVAAMNELPDVVPLDYETVAAAVTARDREVVNKYSKDGQITAIRAARRYRGDRFGRVVAPHRLT
DPKAGPLIAVKLHILTRKTLGGIETDLDARVLKADGTPLAGLYAAGEVAGFGGGGVHGYRALEGTFLGGCIFSGRAAGRG
AAEDIR
>O86332 1.-.-.-~~~~~~Putative monooxygenase Rv0793~~~COG1359
MTSPVAVIARFMPRPDARSALRALLDAMITPTRAEDGCRSYDLYESADGGELVLFERYRSRIALDEHRGSPHYLNYRAQV
GELLTRPVAVTVLAPLDEASA
>Q92ZM6 1.5.1.-~~~~~~Probable flavin reductase~~~
MTAEVFDPRALRDAFGAFATGVTVVTASDAAGKPIGFTANSFTSVSLDPPLLLVCLAKSSRNYESMTSAGRFAINVLSET
QKDVSNTFARPVEDRFAAVDWRLGRDGCPIFSDVAAWFECSMQDIIEAGDHVIIIGRVTAFENSGLNGLGYARGGYFTPR
LAGKAVSAAVEGEIRLGAVLEQQGAVFLAGNETLSLPNCTVEGGDPARTLAAYLEQLTGLNVTIGFLYSVYEDKSDGRQN
IVYHALASDGAPRQGRFLRPAELAAAKFSSSATADIINRFVLESSIGNFGIYFGDETGGTVHPIANKDAHS
>Q9RY71 3.6.1.-~~~~~~Nudix hydrolase DR_0079~~~COG0494
MGGVSDERLDLVNERDEVVGQILRTDPALRWERVRVVNAFLRNSQGQLWIPRRSPSKSLFPNALDVSVGGAVQSGETYEE
AFRREAREELNVEIDALSWRPLASFSPFQTTLSSFMCVYELRSDATPIFNPNDISGGEWLTPEHLLARIAAGEAAKGDLA
ELVRRCYREEE
>O53871 2.3.1.-~~~fadA~~~Putative acyltransferase Rv0859~~~COG0183
MSEEAFIYEAIRTPRGKQKNGSLHEVKPLSLVVGLIDELRKRHPDLDENLISDVILGCVSPVGDQGGDIARAAVLASGMP
VTSGGVQLNRFCASGLEAVNTAAQKVRSGWDDLVLAGGVESMSRVPMGSDGGAMGLDPATNYDVMFVPQSIGADLIATIE
GFSREDVDAYALRSQQKAAEAWSGGYFAKSVVPVRDQNGLLILDHDEHMRPDTTKEGLAKLKPAFEGLAALGGFDDVALQ
KYHWVEKINHVHTGGNSSGIVDGAALVMIGSAAAGKLQGLTPRARIVATATSGADPVIMLTGPTPATRKVLDRAGLTVDD
IDLFELNEAFASVVLKFQKDLNIPDEKLNVNGGAIAMGHPLGATGAMILGTMVDELERRNARRALITLCIGGGMGVATII
ERV
>P9WM73 ~~~~~~Uncharacterized protein Rv0088~~~COG3832
MSVYKHAPSRVRLRQTRSTVVKGRSGSLSWRRVRTGDLGLAVWGGREEYRAVKPGTPGIQPKGDMMTVTVVDAGPGRVSR
SVEVAAPAAELFAIVADPRRHRELDGSGTVRGNIKVPAKLVVGSKFSTKMKLFGLPYRITSRVTALKPNELVEWSHPLGH
RWRWEFESLSPTLTRVTETFDYHAAGAIKNGLKFYEMTGFAKSNAAGIEATLAKLSDQYARGRA
>Q9K1M2 ~~~~~~Putative outer membrane protein NMB0088~~~
MTPSALKKTVLLLGTAFAAASVHASGYHFGTQSVNAQSTANAAAAEAADASTIFYNPAGLTKLDSSQISVNANIVLPSIH
YEADSATDFTGLPVQGSKSGKITKTTVAPHIYGAYKVNDNLTVGLGVYVPFGSATEYEKDSVLRHNINKLGLTSIAVEPV
AAWKLNDRHSFGAGIIAQHTSAELRKYADWGIKSKAEILTAKPPKPNGVAEAAKIQADGHADVKGSDWGFGYQLAWMWDI
NDRARVGVNYRSKVSHTLKGDAEWAADGAAAKAMWSTMLAANGYTANEKARVKIVTPESLSVHGMYKVSDKADLFGDVTW
TRHSRFDKAELVFEKEKTVVKGKSDRTTITPNWRNTYKVGFGGSYQISEPLQLRAGIAFDKSPVRNADYRMNSLPDGNRI
WFSAGMKYHIGKNHVVDAAYTHIHINDTSYRTAKASGNDVDSKGASSARFKNHADIIGLQYTYKFK
>P9WM71 ~~~~~~Uncharacterized protein Rv0090~~~COG3305
MAKNQNRIRNRWELITCGLGGHVTYAPDDAALAARLRASTGLGEVWRCLRCGDFALGGPQGRGAPEDAPLIMRGKALRQA
IIIRALGVERLVRALVLALAAWAVWEFRGARGAIQATLDRDLPVLRAAGFKVDQMTVIHALEKALAAKPSTLALITGMLA
AYAVLQAVEGVGLWLLKRWGEYFAVVATSIFLPLEVHDLAKGITTTRVVTFSINVAAVVYLLISKRLFGVRGGRKAYDVE
RRGEQLLDLERAAMLT
>Q9ABX9 ~~~~~~Uncharacterized signaling protein CC_0091~~~COG3300
MTLKERCVFKLLTCLTTQHDLRLVLVASAVCLAGCFTTFRLYSRMRGARGVVRAAWLLLTGLVAGSSVWATHFIAMVAFT
PGLKTGYSPTGTLLSLMIAALFMASGFAVASAQRSTTNDFAGGVLIGLGVAAMHYMGMSAFVTQGQLVWEHATVGMSAVL
GVGGATAALLLAGTARTIRRQAVGGGMLCLGIVMLHFTGMSAITIVPDASLTVPDQLLSGGMLTLAVGSITSMIILGGLG
AVAIESQTSRSALERIRRLANAAYEGLVVVQSGRINDANAAFCDLVGAPLAELVGRPLFGEILTFDEADPSREDVRREGR
LRPLVGGREIPVEVFSRLMDDGARVETSGLTVLAVRDLRERRAAEEKIRYLAEHDGLTGLLNRNSLQMRLAAAIDRVEAS
GESLAVICIDLDHFKEANDQHGHLAGDALLVETARRLQSAVQAPSFAARLGGDEFIVVQIAGGDQPAVAAELAGRLIEML
AAPVPFDGQELAMGSSLGVSLYPDDGRTAEALMANADMALYRAKESGRGVYRFFKREMDDTIRERRNLARDLRQGIADNE
LIVHYQPLARAADGEVCGFEALVRWKHPTRGMIPPLDFIPVAEENGLIEALGDWVLRRACADAAAWEKPLRIAVNLSPIQ
LHNPALPTLVHEVLITTGLSPSRLELEITESALFKDYQRALDNLRRLKALGVRIAMDDFGTGFSSLSTLQSFPFDKIKID
KSFVENIHRHDRATAIVRAVLGLGRSLEIPVVAEGVETEEQILFLRGEDCAELQGYAIGRPAPVDALTMWTTAGDPGAIA
PKSKTRRSA
>P9WM69 ~~~~~~Uncharacterized protein Rv0093c~~~COG5660
MLAQATTAGSFNHHASTVLQGCRGVPAAMWSEPAGAIRRHCATIDGMDCEVAREALSARLDGERAPVPSARVDEHLGECS
ACRAWFTQVASQAGDLRRLAESRPVVPPVGRLGIRRAPRRQHSPMTWRRWALLCVGIAQIALGTVQGFGLDVGLTHQHPT
GAGTHLLNESTSWSIALGVIMVGAALWPSAAAGLAGVLTAFVAILTGYVIVDALSGAVSTTRILTHLPVVIGAVLAIMVW
RSASGPRPRPDAVAAEPDIVLPDNASRGRRRGHLWPTDGSAA
>Q2G260 ~~~~~~Uncharacterized protein SAOUHSC_00094~~~
MKKLATVGSLIVTSTLVFSSMPFQNAHADTTSMNVSNKQSQNVQNHRPYGGVVPQGMTQAQYTELEKALPQLSAGSNMQD
YNMKLYDATQNIADKYNVIITTNVGVFKPHAVRDMNGHALPLTKDGNFYQTNVDANGVNHGGSEMVQNKTGHMSQQGHMN
QNTHEPTATHATRSYAIIKPSNDESKSKYAFIKSSNEPK
>Q57134 ~~~~~~Uncharacterized protein HI_1008~~~COG1555
MKTLFTSVVLCGALVVSSSFAEEKATXQTAQSVVTTQAEAQVAPAVVSDKLNINTATASEIQKSLTGIGAKKAEAIVQYR
EKHGNFXNAEQLLEVQGIGKATLEKNRDRIIF
>P54984 3.-.-.-~~~~~~Probable hydrolase sll0100~~~COG1473
MELKNLAQTLLPRLVEIRRHLHAHPELSGQEYQTAAYVAGVLSSCGLHVEEAIGKTGVVGQLSGKGDDPRLLAIRTDMDA
LPIEEMVSLPFASRHPGVMHACGHDIHTTLGLGTAMVLSQMGHRLPGDVRFLFQPAEEIAQGASWMIQDGAMKGVSHILG
VHVFPSIPAQQVGIRYGALTAAADDLEIFIQGESGHGARPHEAIDAIWIAAQVITALQQAISRTQNPLRPMVLSLGQISG
GRAPNVIADQVRMAGTVRSLHPETHAQLPQWIEGIVANVCQTYGAKYEVNYRRGVPSVQNDAQLNKLLENAVREAWGESA
LQIIPEPSLGAEDFALYLEHAPGAMFRLGTGFGDRQMNHPLHHPRFEADEAAILTGVVTLSYAAWQYWQNIAI
>Q81H14 ~~~~~~UPF0145 protein BC_1012~~~
MIVTTTSGIQGKEIIEYIDIVNGEAIMGANIVRDLFASVRDVVGGRAGSYESKLKEARDIAMDEMKELAKQKGANAIVGV
DVDYEVVRDGMLMVAVSGTAVRI
>O05598 6.2.1.-~~~~~~Putative ligase Rv1013~~~COG0318
MSRFTEKMFHNARTATTGMVTGEPHMPVRHTWGEVHERARCIAGGLAAAGVGLGDVVGVLAGFPVEIAPTAQALWMRGAS
LTMLHQPTPRTDLAVWAEDTMTVIGMIEAKAVIVSEPFLVAIPILEQKGMQVLTVADLLASDPIGPIEVGEDDLALMQLT
SGSTGSPKAVQITHRNIYSNAEAMFVGAQYDVDKDVMVSWLPCFHDMGMVGFLTIPMFFGAELVKVTPMDFLRDTLLWAK
LIDKYQGTMTAAPNFAYALLAKRLRRQAKPGDFDLSTLRFALSGAEPVEPADVEDLLDAGKPFGLRPSAILPAYGMAETT
LAVSFSECNAGLVVDEVDADLLAALRRAVPATKGNTRRLATLGPLLQDLEARIIDEQGDVMPARGVGVIELRGESLTPGY
LTMGGFIPAQDEHGWYDTGDLGYLTEEGHVVVCGRVKDVIIMAGRNIYPTDIERAAGRVDGVRPGCAVAVRLDAGHSRES
FAVAVESNAFEDPAEVRRIEHQVAHEVVAEVDVRPRNVVVLGPGTIPKTPSGKLRRANSVTLVT
>A4IM41 ~~~~~~Putative regulatory protein GTNG_1019~~~COG2052
MMMKFINIGYGNMVSAARIITIVSPDSAPIKRIIQDAREKGKLVDATHGRRTRAVIITDSDHVILSSVQPETVANRLYGS
DDFSEEG
>Q99UT4 2.3.1.-~~~~~~Uncharacterized N-acetyltransferase SA1019~~~
MSEIKRLEINYKTDELFENFRAFGNKDLYMVNELNGQMIDASSDSPFYGIFVGDQLGARMALLKKGDVEEIYFPDFEDYI
LLWKLEVLPKYQNRGYASELIDFAKSFNMPIKAIGRNDSKDFFLHHGFTDVEAKNIEGHDVLLWKP
>Q8DPT4 ~~~~~~DegV domain-containing protein spr1019~~~COG1307
MTKIKIVTDSSVTIEPELVKQLDITIVPLSVMIDNVVYSDADLKEEGKFLQLMQESKNLPKTSQPPVGVFAEIFEDLCKD
GGQILAIHMSHALSGTVEAARQGASLSTADVIVVDSSFTDQALKFQVVEAAKLAQEGKDMEAILSHVEEVKNHTELYIGV
STLENLVKGGRIGRVTGLLSSLLNIRVVMQMKDHELQPMVKGRGTKTFKKWLDELITSLSERAVAEIGISYSGSDDWAKE
MKESLQAYVEKPISVLETGSIIQTHTGENAWAILIRYHS
>Q9RVK2 3.6.1.61~~~~~~Nudix hydrolase DR_1025~~~COG1051
MEHDERTHVPVELRAAGVVLLNERGDILLVQEKGIPGHPEKAGLWHIPSGAVEDGENPQDAAVREACEETGLRVRPVKFL
GAYLGRFPDGVLILRHVWLAEPEPGQTLAPAFTDEIAEASFVSREDFAQLYAAGQIRMYQTKLFYADALREKGFPALPV
>P44992 ~~~~~~Uncharacterized protein HI_1028~~~COG1638
MKLFNFKKLSMLIAGFTLVTSPALAEISLRFGYEAPRSDSQHSAAKKFNDLLMKKTKGEIKLKLFPDSTLGNAQTMISSV
RGGTIDLEMSGSPNFTGLEPKLNVIDIPFIFKDREHVYKVLDGEVGQNLLKDLEKQGLKGLAFWDVGFRAFSNSKQTVTK
PEHIKGLKVRTNQNPMYIEAFKLLGSNPVPMPLAELYTALETRAVDAQEHPIGIFWSSKLYEVQKYLSLTNHGYTPLIVV
MNKAKFDSLLPALQTAIIEAAKEAGQFQRDLNVKNEQNIISKLRKQGVEVIEKINTEPFKTLIEEKVRKSFIEKHGDDLL
KKVDALSE
>Q97R12 2.1.1.-~~~~~~Uncharacterized RNA methyltransferase SP_1029~~~COG2265
MLKKNDIVEVEIVDLTHEGAGVAKVDGLVFFVENALPSEKILMRVLKVNKKIGFGKVEKYLVQSPHRNQDLDLAYLRSGI
ADLGHLSYPEQLKFKTKQVKDSLYKIAGIADVEVAETLGMEHPVKYRNKAQVPVRRVNGVLETGFFRKNSHNLMPLEDFF
IQDPVIDQVVVALRDLLRRFDLKPYDEKEQSGLIRNLVVRRGHYSGQIMVVLVTTRPKVFRVDQLIEQVIKQFPEIVSVM
QNINDQNTNAIFGKEWRTLYGQDYITDQMLGNDFQIAGPAFYQVNTEMAEKLYQTAIDFAELKKDDVIIDAYSGIGTIGL
SVAKHVKEVYGVELIPEAVENSQKNASLNKITNAHYVCDTAENAMKKWLKEGIQPTVILVDPPRKGLTESFIKASAQTGA
DRIAYISCNVATMARDIKLYQELGYELKKVQPVDLFPQTHHVETVALLSKLDVDKHISVEIELDEMDLTSAESKATYAQI
KEYVWNKFELKVSTLYIAQIKKKCGIELREHYNKSKKDKQIIPQCTPEKEEAIMDALRHFKMI
>P9WM63 ~~~~~~Uncharacterized protein Rv0102~~~COG3336
MGTHGATKSATSAVPTPRSNSMAMVRLAIGLLGVCAVVAAFGLVSGARRYAEAGNPYPGAFVSVAEPVGFFAASLAGALC
LGALIHVVMTAKPEPDGLIDAAAFRIHLLAERVSGLWLGLAATMVVIQAAHDTGVGPARLLASGALSDSVAASEMARGWI
VAAICALVVATALRLYTRWLGHVVLLVPTVLAVVATAVTGNPGQGPDHDYATSAAIVFAVAFATLTGLKIAAALAGTTPS
RAVLVTQVTCGALALAYGAMLLYLFIPGWAVDSDFARLGLLAGVILTSVWLFDCWRLLVRPPHAGRRRGGGSGAALAMMA
AMASIAAMAVMTAPRFLTHAFTAWDVFLGYELPQPPTIARVLTVWRFDSLIGAAGVVLAIGYAAGFAALRRRGNSWPVGR
LIAWLTGCAALVFTSGSGVRAYGSAMFSVHMAEHMTLNMFIPVLLVLGGPVTLALRVLPVTGDGRPPGAREWLTWLLHSR
VTTFLSHPITAFVLFVASPYIVYFTPLFDTFVRYHWGHEFMAIHFLVVGYLFYWAIIGIDPGPRRLPYPGRIGLLFAVMP
FHAFFGIALMTMSSTVGATFYRSVNLPWLSSIIADQHLGGGIAWSLTELPVIMVIVALVTQWARQDRRVASREDRHADSD
YADDELEAYNAMLRELSRMRR
>P44096 ~~~~~~UPF0234 protein HI_1034~~~COG1666
MPSFDIVSEITLHEVRNAVENANRVLSTRYDFRGVEAVIELNEKNETIKITTESDFQLEQLIEILIGSCIKRGIEHSSLD
IPAESEHHGKLYSKEIKLKQGIETEMAKKITKLVKDSKIKVQTQIQGEQVRVTGKSRDDLQAVIQLVKSAELGQPFQFNN
FRD
>P44515 ~~~~~~Uncharacterized protein HI_0103~~~COG1393
MITVYGIKNCDTVKKALKWLADHNIEHKLHDYRVDGLDLNFLTQAETQFGWDVLVNKRSTTWRNLDEQVKNSLDKTTALS
VLAENPTLIKRPIILQDGKALIGFNEKEYQAAFA
>Q97QZ6 2.7.1.-~~~~~~Putative lipid kinase SP_1045~~~COG1597
MKKAMVIINPTSGGEKALDYKEKLENKAKEYFEYVETKITEKALDATHFAEEASREQYDAVVVFGGDGTVNEVISGIDER
DYIPKLGIIPGGTGNLITKLLEINQDIDGAIDELDFDLTNKIDIGKANDNYFGYIFSIGSLPEAIHNVEIEDKTKFGILT
YAVNTMKSVMTDQVFNIKVETENGNYVGEASHVLVLLTNYFADKKIFEENKDGYANILILKDASIFSKLSVIPDLLKGDV
VANDNIEYIKARNIKISSDSELESDVDGDKSDNLPVEIKVLAQRVEVFSKPKED
>P44103 ~~~~~~Uncharacterized protein HI_1048~~~COG1305
MKKLIAVAVLSACGSLAHANTNIPNYNTDAHLYEFTQTYDLVVPKGSQGQTNLWVPLPFNGEYQQVKSIHFEGNYMNAYV
TENNKYGAKTLFATWNKDAQKRDLKVMMVIETKDREPMVKGALENYTPPKDIQYSVDVQEYLKATPHIKTDGIVKEFPDK
ILGKETNPLKKAELIHHWFVKNMERDNSVLGCGDGDVEKILTTGVLKGKCTDINSVFVALARAAGIPAREIFGIRLGAAE
KMGKYSKGAFGSANEQGIANVSGGQHCRAEFYLAGFGWVPVDSADVAKMRLAEKKSVEDKDTQAVAKYLFGNWEANWVGF
NHARDFDLYPQPELAPINNFGYPYAEVGGDPLNSFDPKEFKYDYVSKKL
>Q97QZ0 ~~~~~~Uncharacterized protein SP_1052~~~COG0613
MRGFNNKIKSVYQELTNSKEKFGSFHKTLIHLHTPVSYDYKLFSNWTATKYRKITEDELYDIFFENKKIKVDKTIFFSNF
DKVVFSSSKEYISFLMLAEAIIKNGIEIVVVTDHNTTKGIKKLQMAVSIIMKNYPIYDIHPHILHGVEISAADKLHIVCI
YDYEQESWVNQWLSENIISEKDGSYQHSLTIMKDFNNQKIVNYIAHFNSYDILKKGSHLSGAYKRKIFSKENTRFWSLIL
TRKNLRNNLIFSIKKLVY
>Q97QY9 ~~~~~~Uncharacterized protein SP_1053~~~COG1196
MGQKVVAMLDFLLAYSDYSKDFRPLIIDQPEDNLDNRYIYRHLVQQFRDVKAQRQIILATHNATIVTNSMTDQVVIMESD
GVNGWIESQGYVSEKYIKNHIINQLEGGKDSFKHKMSIYETALSE
>O53405 ~~~~~~Seven-bladed beta-propeller protein Rv1057~~~COG3391
MSVMNGREVARESRDAQVFEFGTAPGSAVVKIPVQGGPIGGIAISRDGSLLVVTNNGTDTVSVVGTDTCRVTQTVTSVNE
PFAIAMGNAEANRAYVSTVSSAYDAIAVIDVATNTVLGTHPLALSVSDLTLSPDDKYLYVSRNGTRGADVAVLDTTTGAL
IDVVDVSQAPGTTTQCVRMSPDGSVLYVGANGPSGGLLVVITTRAQSDGGRIGSRSRSRQKSSKPRGNQAAAGLRVVATI
DIGSSVRDVALSPDGAIAYVASCGSDFGAVVDVIDTRTHQITSSRAISEIGGLVTRVSVSGDADRAYLVSEDRVTVLCTR
THDVIGTIRTGQPSCVVESPDGKYLYIADYSGTITRTAVASTIVSGTEQLALQRRGSMQWFSPELQQYAPALA
>P9WIY9 ~~~~~~Uncharacterized NTE family protein Rv1063c~~~COG1752
MPAPAALRVRGSSSPRVALALGSGGARGYAHIGVIQALRERGYDIVGIAGSSMGAVVGGVHAAGRLDEFAHWAKSLTQRT
ILRLLDPSISAAGILRAEKILDAVRDIVGPVAIEQLPIPYTAVATDLLAGKSVWFQRGPLDAAIRASIAIPGVIAPHEVD
GRLLADGGILDPLPMAPIAGVNADLTIAVSLNGSEAGPARDAEPNVTAEWLNRMVRSTSALFDVSAARSLLDRPTARAVL
SRFGAAAAESDSWSQAPEIEQRPAGPPADREEAADTPGLPKMGSFEVMNRTIDIAQSALARHTLAGYPADLLIEVPRSTC
RSLEFHRAVEVIAVGRALATQALEAFEIDDDESAAATIEG
>Q7A5Z4 ~~~~~~Uncharacterized protein SA1069~~~
MISKINGKLFADMIIQGAQNLSNNADLVDSLNVYPVPDGDTGTNMNLTMTSGREEVENNLSKNIGELGKTFSKGLLMGAR
GNSGVILSQLFRGFCKNIESESEINSKLLAESFQAGVETAYKAVMKPVEGTILTVAKDAAQAAIEKANNTEDCIELMEYI
IVKANESLENTPNLLAVLKEVGVVDSGGKGLLCVYEGFLKALKGEKVEAKVAKIDKDEFVHDEHDFHGVINTEDIIYGYC
TEMMVRFGKNKKAFDEQEFRQDMSQFGDSLLVINDEEIVKVHVHTEYPGKVFNYGQQYGELIKLKVENMREQHREVIRKE
QHTAKPKMETVETAIITISMGEGISEIFKSMGATHIISGGQTMNPSTEDIVKVIEQSKCKRAIILPNNKNILMASEQAAS
IVDAEAVVIPTKSIPQGISALFQYDVDATLEENKAQMADSVNNVKSGSLTYAVRDTKIDGVEIKKDAFMGLIEDKIVSSQ
SDQLTTVTELLNEMLAEDSEILTVIIGQDAEQAVTDNMINWIEEQYPDVEVEVHEGGQPIYQYFFSVE
>P9WPI5 3.6.5.-~~~~~~Zinc chaperone Rv0106~~~COG0523
MRTPVILVAGQDHTDEVTGALLRRTGTVVVEHRFDGHVVRRMTATLSRGELITTEDALEFAHGCVSCTIRDDLLVLLRRL
HRRDNVGRIVVHLAPWLEPQPICWAIDHVRVCVGHGYPDGPAALDVRVAAVVTCVDCVRWLPQSLGEDELPDGRTVAQVT
VGQAEFADLLVLTHPEPVAVAVLRRLAPRARITGGVDRVELALAHLDDNSRRGRTDTPHTPLLAGLPPLAADGEVAIVEF
SARRPFHPQRLHAAVDLLLDGVVRTRGRLWLANRPDQVMWLESAGGGLRVASAGKWLAAMAASEVAYVDLERRLFADLMW
VYPFGDRHTAMTVLVCGADPTDIVNALNAALLSDDEMASPQRWQSYVDPFGDWHDDPCHEMPDAAGEFSAHRNSGESR
>Q9RVF9 3.4.21.-~~~~~~Uncharacterized peptidase DR_1070~~~COG3340
MRLLLTSFQHPSMAQFIGGKRVAYIPDAARSYADAPFVQKEREGLEKQGLELINLPLSHTDLAAVETTLNAVDGVYVAGG
ETFDLLQVLRSTGSDKVITRRVRQGLPYIGCSAGSVVAGPTIEAVSLMDSPDIAPDLKDYTGLGLTELAVIPHASGSISQ
FPIETIADTVRTYGERWPLCLLRDGQALWIEDGEVRLLN
>P9WP51 4.2.1.22~~~cbs~~~Probable cystathionine beta-synthase Rv1077~~~COG0031
MRIAQHISELIGGTPLVRLNSVVPDGAGTVAAKVEYLNPGGSSKDRIAVKMIEAAEASGQLKPGGTIVEPTSGNTGVGLA
LVAQRRGYKCVFVCPDKVSEDKRNVLIAYGAEVVVCPTAVPPHDPASYYSVSDRLVRDIDGAWKPDQYANPEGPASHYVT
TGPEIWADTEGKVTHFVAGIGTGGTITGAGRYLKEVSGGRVRIVGADPEGSVYSGGAGRPYLVEGVGEDFWPAAYDPSVP
DEIIAVSDSDSFDMTRRLAREEAMLVGGSCGMAVVAALKVAEEAGPDALIVVLLPDGGRGYMSKIFNDAWMSSYGFLRSR
LDGSTEQSTVGDVLRRKSGALPALVHTHPSETVRDAIGILREYGVSQMPVVGAEPPVMAGEVAGSVSERELLSAVFEGRA
KLADAVSAHMSPPLRMIGAGELVSAAGKALRDWDALMVVEEGKPVGVITRYDLLGFLSEGAGRR
>P0DV55 ~~~~~~Protein Rv1078A~~~
MSHLPLHHPAAVVTLRPLRPRAAVATRLRRHRLAGGPTRRLRRRPAVTRRRRPDRRFVRCRPSPTRRGLPGCWRHSSTGP
HT
>Q57017 ~~~~~~UPF0053 protein HI_0107~~~COG4536
MDSIPLSTLFIILIICLVLSAYFSGSETGLLSLNKYRLRFLSEQGNKGAKKAEKLLEKPDTLLSFILIFNNLVNISASAI
ATVIGMRLYGDAGVAIATGLLTFVMLVFSEIFPKTVAAMHAEKVSFFSSHILTSLLKIFYPLVWLMNIFTKSLMQIVGLK
LDMQKQVISSEELRSIVSEAGEATPNEQHPQMLLSILDMETVTVDDIMVPRNEIGGINIDDDWRAIMRQLNHAAHNRVVL
YKGSLDEQVLGILRVREAFRLLLEKNEFTKETLIRAADEVYFIPESTPLKTQLANFRTNKERIGLVVDEYGDIKGLVTLE
DILEEIVGDFTTSTAPSIDKEVIQQSDGSMIIDGSANLRDLNKMFNWELDTEDARTFNGLILEHLEEIPDEGTICEIDGL
LITILEVGDNMIKQAKVVKL
>P45024 ~~~~~~Probable amino-acid ABC transporter-binding protein HI_1080~~~COG0834
MKKLLFTTALLTGAIAFSTFSHAGEIADRVEKTKTLLVGTEGTYAPFTFHDKSGKLTGFDVEVIRKVAEKLGLKVEFKET
QWDAMYAGLNAKRFDVIANQTNPSPERLKKYSFTTPYNYSGGVIVTKSSDNSIKSFEDLKGRKSAQSATSNWGKDAKAAG
AQILVVDGLAQSLELIKQGRAEATINDKLAVLDYFKQHPNSGLKIAYDRGDKTPTAFAFLQGEDALITKFNQVLEALRQD
GTLKQISIEWFGYDITQ
>Q9X0H0 ~~~~~~Putative anti-sigma factor antagonist TM_1081~~~COG1366
MFPYKIVDDVVILMPNKELNIENAHLFKKWVFDEFLNKGYNKIFLVLSDVESIDSFSLGVIVNILKSISSSGGFFALVSP
NEKVERVLSLTNLDRIVKIYDTISEAMEEVRRK
>P45026 ~~~~~~Uncharacterized protein HI_1082~~~COG5007
MELQKIEQILKDTLNIAEVYAQGENAHFGVIVVSDEIAALSRVKQQQTIYAPLMPYFSTGEIHALTIKTYTVEKWKRDRA
LNQFN
>Q7DDI1 ~~~~~~UPF0339 protein NMB1088~~~
MYFEIYKDAKGEYRWRLKAANHEIIAQGEGYTSKQNCQHAVDLLKSTTAATPVKEV
>Q97QW0 ~~~~~~UPF0758 protein SP_1088~~~COG2003
MYSISFQEDSLLPRERLAKEGVEALSNQELLAILLRTGTRQASVFEIAQKVLNNLSSLTDLKKMTLQELQSLSGIGRVKA
IELQAMIELGHRIHKHETLEMESILSSQKLAKKMQQELGDKKQEHLVALYLNTQNQIIHQQTIFIGSVTRSIAEPREILH
YAIKHMATSLILVHNHPSGAVAPSQNDDHVTKLVKEACELMGIVLLDHLIVSHSNYFSYREKTDLI
>P9WFM3 ~~~~~~Putative transport protein Rv1101c~~~COG0628
MNTEFTLTQKRALAILTLIALLFGAYFLRNYFVLIVVAAVGAYLFTPLFKWFTKRFNTGLSAACTLLSALAAVVVPVGAL
VGLAIVQIARMVDSVADWVRTTDLSTLGDKILQFVNGLFDRVPFLHITVTADALRKAMISVAQNVGEWLLHFLRDAAGSL
AGVITSAIIFVYVFVALLVNREKLRTLIGQLNPLGEDVTDLYLQKMGSMVRGTVNGQFVIAACQGVAGAASIYIAGFHHG
FFIFAIVLTALSIIPLGGGIVTIPFGIGMIFYGNIAGGIFVLLWHLLVVTNIDNVLRPILVPRDARLNSALMLLSVFAGI
TMFGPWGIIIGPVLMILIVTTIDVYLAVYKGVELEQFEAPPVRRRWLPRRGPATSRNAPPPSTAE
>P67266 ~~~~~~Nucleoid-associated protein SP_1102~~~COG0718
MMNMQNMMRQAQKLQKQMEQSQAELAAMQFVGKSAQDLVQATLTGDKKVVSIDFNPAVVDPEDLETLSDMTVQAINSALE
QIDETTKKKLGAFAGKLPF
>P63333 3.4.24.-~~~~~~Putative zinc metalloprotease SA1105~~~
MSYLVTIIAFIIVFGVLVTVHEYGHMFFAKRAGIMCPEFAIGMGPKIFSFRKNETLYTIRLLPVGGYVRMAGDGLEEPPV
EPGMNVKIKLNEENEITHIILDDHHKFQQIEAIEVKKCDFKDDLFIEGITAYDNERHHFKIARKSFFVENGSLVQIAPRD
RQFAHKKPWPKFLTLFAGPLFNFILALVLFIGLAYYQGTPTSTVEQVADKYPAQQAGLQKGDKIVQIGKYKISEFDDVDK
ALDKVKDNKTTVKFERDGKTKSVELTPKKTERKLTKVSSETKYVLGFQPASEHTLFKPIVYGFESFLKGSTLIFTAVVGM
LASIFTGGFSFDMLNGPVGIYHNVDSVVKAGIISLIGYTALLSVNLGIMNLIPIPALDGGRILFVIYEAIFRKPVNKKAE
TTIIAIGAIFMVVIMILVTWNDIRRYFL
>P9WM59 ~~~~~~Uncharacterized protein Rv1109c~~~
MATAPYGVRLLVGAATVAVEETMKLPRTILMYPMTLASQAAHVVMRFQQGLAELVIKGDNTLETLFPPKDEKPEWATFDE
DLPDALEGTSIPLLGLSDASEAKNDDRRSDGRFALYSVSDTPETTTASRSADRSTNPKTAKHPKSAAKPTVPTPAVAAEL
DYPALTLAQLRARLHTLDVPELEALLAYEQATKARAPFQTLLANRITRATAK
>Q97QT7 ~~~~~~DegV domain-containing protein SP_1112~~~COG1307
MTKIKIVTDSSVTIEPELVKQLDITIVPLSVMIDNVVYSDADLKEEGKFLQLMQESKNLPKTSQPPVGVFAEIFEDLCKD
GGQILAIHMSHALSGTVEAARQGASLSTADVTVVDSSFTDQALKFQVVEAAKLAQEGKDMEAILSHVEEVKNHTELYIGV
STLENLVKGGRISRVTGLLSSLLNIRAVMQMKDHELQPMVKGRGTKTFKKWLDELITSLSERAVAEIGISYSGSDDWAKE
MKESLQAYVEKPISVLETGSIIQTHTGENAWAILIRYHS
>Q7DDE8 ~~~~~~Putative lipoprotein NMB1124/NMB1162~~~
MKPLILGLAAVLALSACQVQKAPDFDYTSFKESKPASILVVPPLNESPDVNGTWGVLASTAAPLSEAGYYVFPAAVVEET
FKQNGLTNAADIHAVRPEKLHQIFGNDAVLYITVTEYGTSYQILDSVTTVSAKARLVDSRNGKELWSGSASIREGSNNSN
SGLLGALVSAVVNQIANSLTDRGYQVSKTAAYNLLSPYSHNGILKGPRFVEEQPK
>Q7DDH4 ~~~~~~Putative lipoprotein NMB1126/NMB1164~~~
MKTVSTAVVLAAAAVSLTGCATESSRSLEVEKVASYNTQYHGVRTPISVGTFDNRSSFQKGIFSDGEDRLGSQAKTILVT
HLQQTNRFNVLNRTNLNALKQESGISGKAHNLKGADYVVTGDVTEFGRRDVGDHQLFGILGRGKSQIAYAKVALNIVNVN
TSEIVYSAQGAGEYALSNREIIGFGGTSGYDATLNGKVLDLAIREAVNSLVQAVDNGAWQPNR
>P9WGQ7 1.-.-.-~~~~~~Uncharacterized oxidoreductase Rv1144~~~COG1028
MKTKDAVAVVTGGASGLGLATTKRLLDAGAQVVVVDLRGDDVVGGLGDRARFAQADVTDEAAVSNALELADSLGPVRVVV
NCAGTGNAIRVLSRDGVFPLAAFRKIVDINLVGTFNVLRLGAERIAKTEPIGEERGVIINTASVAAFDGQIGQAAYSASK
GGVVGMTLPIARDLASKLIRVVTIAPGLFDTPLLASLPAEAKASLGQQVPHPSRLGNPDEYGALVLHIIENPMLNGEVIR
LDGAIRMAPR
>P9WM55 ~~~~~~Uncharacterized protein Rv1148c~~~COG1403
MRSDTREEISAALDAYHASLSRVLDLKCDALTTPELLACLQRLEVERRRQGAAEHALINQLAGQACEEELGGTLRTALAN
RLHITPGEASRRIAEAEDLGERRALTGEPLPAQLTATAAAQREGKIGREHIKEIQAFFKELSAAVDLGIREAAEAQLAEL
ATSRRPDHLHGLATQLMDWLHPDGNFSDQERARKRGITMGKQEFDGMSRISGLLTPELRATIEAVLAKLAAPGACNPDDQ
TPLVDDTPDADAVRRDTRSQAQRNHDAFLAALRGLLASGELGQHKGLPVTIVVSTTLKELEAATGKGVTGGGSRVPMSDL
IRMASHANHYLALFDGAKPLALYHTKRLASPAQRIMLYAKDRGCSRPGCDAPAYHSEVHHVTPWTTTHRTDINDLTLACG
PDNRLVEKGWKTRKNAHGDTEWLPPPHLDHGQPRINRYHHPAKILCEQDDDEPH
>P43786 ~~~~~~Uncharacterized protein HI_1159~~~COG3118
MIAAQFRIQALPTTYLFKEAQALDAFPSVLDKSSLIQRLSIILPKEEDLKFQQALDFLQVENYEAALPLLKDAWELSDKK
NSDVALLYAETYIAMKKTEPAQEILNQIPLQDRDSRWHGLQAQIELQIQAADTPEIQQLQADYAKNPTAEIAIKLAVQLH
QAGRNEEALTLLFGILKTDLGAQNGEVKQQFLSILSAMGNADPLTNKFRRLLYSLLY
>P45083 3.1.2.-~~~~~~Putative esterase HI_1161~~~COG2050
MLWKKTFTLENLNQLCSNSAVSHLGIEISAFGEDWIEATMPVDHRTMQPFGVLHGGVSVALAETIGSLAGSLCLEEGKTV
VGLDINANHLRPVRSGKVTARATPINLGRNIQVWQIDIRTEENKLCCVSRLTLSVINL
>P44116 ~~~~~~Uncharacterized protein HI_1162~~~COG2852
MITCLPLLAGEGARRKGDIGMRNKNKRLAQYATELRRNMTDAEYALWYHLRNKLFCGIRFNRQVIIGHYIVDFCSRKLKL
VIELDGIQHVEQEQYDLERTKFLTAQGYKVIRFWNDEVLKNIDNVLEAIYVEIEHLSPPHFGSSPHKWVEPRLEVLK
>Q9X0P5 ~~~~~~UPF0173 metal-dependent hydrolase TM_1162~~~COG2220
MKVTFLGHAVVLIEGKKNIIIDPFISGNPVCPVKLEGLPKIDYILVTHGHGDHLGDAVEIAKKNDATVISNYEICHYLGK
KGVKTHAMHIGGSYLFDFGRVKMTPAVHGSGILDGDSMIYGGNPSGFLITIEGKKIYHAGDTGLTREMELLAEENVDVAF
LPIGGNFVMDVEDAVRAAVMIKPKKVVPMHYGTWELIFADVELFKKKVEEKGVECVILEPGESLEL
>A0QRM0 ~~~~~~UPF0234 protein MSMEG_1165/MSMEI_1134~~~COG1666
MADSSFDVVSKVDRQEVDNALNQAAKELATRFDFRGTDTTIAWKGDEVIELTSSTEERVKAAVDVFKEKLVRRDISMKAF
DAEDPQPSGKTYKVTGTIKQGITSEQAKKITKLIRDEGPKGVKAQIQGDEIRVSSKKRDDLQAVIALLKGADLDVALQFV
NYR
>P44117 ~~~~~~UPF0265 protein HI_1168~~~COG2926
MEIVNKQSFQDVLEYVRMYRLKNRIKRDMEDNNRKIRDNQKRILLLDNLNQYIRDDMTIAEVRGIIESMRDDYESRVDDY
TIRNAELSKQRREASTKMKEQKKAHAELLKNAEK
>Q9ZJY0 ~~~~~~Protein jhp_1168~~~COG1466
MYRKDLDNYLKQRLPKAVFLYGEFDFFIHYYIQTISALFKGNNPDTETSLFYASDYEKSQIATLLEQDSLFGGSSLVILK
LDFALHKKFKENDINPFLKALERPSHNRLIIGLYNAKSDTTKYKYTSEIIVKFFQKSPLKDEAICVRFFTPKAWESLKFL
QERANFLHLDISGHLLNALFEINNEDLSVSFNDLDKLAVLNAPITLEDIQELSSNAGDMDLQKLILGLFLKKSVLDIYDY
LLKEGKKDADILRGLERYFYQLFLFFAHIKTTGLMDAKEVLGYAPPKEIVENYAKNALRLKEAGYKRVFEIFRLWHLQSM
QGQKELGFLYLTPIQKIINP
>A4QD57 ~~~~~~Uncharacterized peptidase cgR_1176~~~
MSSASFTTKALSVLAALTAASAPLVAASPAHALANARNVTGSSTTSDSIVRLHIGNTACTGTMITPTWAITARHCIPEDG
IAGAAIGSSTLSQFQQVSQAILHPTADLALVELPNQASSNTVDLYGAHVQPGENGQAAGWGGYSAFGQNVAQQADVQIQR
RVVNVPSPDRTAVLLEGTVSNGRLVPGDSGGPLYINGQLAGVLSMSTDVENDALDGTVGWYIPVAEHAEWIAYYTGKHIA
PIAGAPAELVDATANPTFIPAPQPFTGSSIGGWALGSS
>P60076 ~~~~~~UPF0291 protein SA1176~~~
MSNSDLNIERINELAKKKKEVGLTQEEAKEQTALRKAYLESFRKGFKQQIENTKVIDPEGNDVTPEKIKEIQQKRDNKN
>O50434 2.6.1.-~~~~~~Probable aminotransferase Rv1178~~~COG0436
MEPLHPDQRRFLRRAGIAGRCGQGWHDRERPASGQGSGAAERGRLSRLGAAPARGGVSASLPVFPWDTLADAKALAGAHP
DGIVDLSVGTPVDPVAPLIQEALAAASAAPGYPATAGTARLRESVVAALARRYGITRLTEAAVLPVIGTKELIAWLPTLL
GLGGADLVVVPELAYPTYDVGARLAGTRVLRADALTQLGPQSPALLYLNSPSNPTGRVLGVDHLRKVVEWARGRGVLVVS
DECYLGLGWDAEPVSVLHPSVCDGDHTGLLAVHSLSKSSSLAGYRAGFVVGDLEIVAELLAVRKHAGMMVPAPVQAAMVA
ALDDDAHERQQRERYAQRRAALLPALGSAGFAVDYSDAGLYLWATRGEPCRDSAAWLAQRGILVAPGDFYGPGGAQHVRV
ALTATDERVAAAVGRLTC
>P67291 ~~~~~~UPF0154 protein SA1178~~~
MATWLAIIFIVAALILGLIGGFLLARKYMMDYLKKNPPINEEMLRMMMMQMGQKPSQKKINQMMTMMNKNMDQNMKSAKK
>Q9RV46 3.6.1.-~~~~~~Nudix hydrolase DR_1184~~~COG0494
MTAPHDPLDDIQADPWALWLSGRTRTALELPHYRRAAVLVALTREADPRVLLTVRSSELPTHKGQIAFPGGSLDAGETPT
QAALREAQEEVALDPAAVTLLGELDDVFTPVGFHVTPVLGRIAPEALDTLRVTPEVAQIITPTLAELRAVPLVRERRTLP
DGTEVPLYRYPWRGLDIWGMTARVLHDLLEQGPG
>Q7A5S4 ~~~~~~Uncharacterized protein SA1186~~~
MQIELTDAAVTWFKNELELPENNKVLVFFVRYGGEFQLKQGFSPAFTVEPKEDVDIGYEQQYDDLNVVVAEKDLWYFEDD
HIIVNVVDHEDEISYSTK
>Q2FZ58 ~~~~~~Uncharacterized protein SAOUHSC_01193~~~COG1461
MISKINGKLFADMIIQGAQNLSNNADLVDSLNVYPVPDGDTGTNMNLTMTSGREEVENNLSKNIGELGKTFSKGLLMGAR
GNSGVILSQLFRGFCKNIESESEINSKLLAESFQAGVETAYKAVMKPVEGTILTVAKDAAQAAIEKANNTEDCIELMEYI
IVKANESLENTPNLLAVLKEVGVVDSGGKGLLCVYEGFLKALKGEKVEAKVAKIDKDEFVHDEHDFHGVINTEDIIYGYC
TEMMVRFGKNKKAFDEQEFRQDMSQFGDSLLVINDEEIVKVHVHTEYPGKVFNYGQQYGELIKLKVENMREQHREVIRKE
QHTAKPKMETVETAIITISMGEGISEIFKSMGATHIISGGQTMNPSTEDIVKVIEQSKCKRAIILPNNKNILMASEQAAS
IVDAEAVVIPTKSIPQGISALFQYDVDATLEENKAQMADSVNNVKSGSLTYAVRDTKIDGVEIKKDAFMGLIEDKIVSSQ
SDQLTTVTELLNEMLAEDSEILTVIIGQDAEQAVTDNMINWIEEQYPDVEVEVHEGGQPIYQYFFSVE
>P99132 5.3.2.-~~~~~~Probable tautomerase SA1195.1~~~
MPIVNVKLLEGRSDEQLKNLVSEVTDAVEKTTGANRQAIHVVIEEMKPNHYGVAGVRKSDQ
>P67253 ~~~~~~UPF0122 protein SPy_1201/M5005_Spy0916~~~
MNIMEIEKTNRMNALFEFYAALLTDKQMNYIELYYADDYSLAEIADEFGVSRQAVYDNIKRTEKILETYEMKLHMYSDYV
VRSEIFDDMIAHYPHDEYLQEKISILTSIDNRE
>Q836B3 ~~~~~~UPF0297 protein EF_1202~~~COG4472
MGFTDETVRFDFDDNRKKAISETLETVYRALEEKGYNPINQIVGYLLSGDPAYIPRYQDARNLIRRHERDEIMEELTKYY
LANHGIDIK
>P45114 ~~~~~~Probable TonB-dependent receptor HI_1217~~~COG1629
MKKAIKLNLITLGLINTIGMTITQAQAEETLGQIDVVEKVISNDKKPFTEAKAKSTRENVFKETQTIDQVIRSIPGAFTQ
QDKGSGVVSVNIRGENGLGRVNTMVDGVTQTFYSTALDSGQSGGSSQFGAAIDPNFIAGVDVNKSNFSGASGINALAGSA
NFRTLGVNDVITDDKPFGIILKGMTGSNATKSNFMTMAAGRKWLDNGGYVGVVYGYSQREVSQDYRIGGGERLASLGQDI
LAKEKEAYFRNAGYILNPEGQWTPDLSKKHWSCNKPDYQKNGDCSYYRIGSAAKTRREILQELLTNGKKPKDIEKLQKGN
DGIEETDKSFERNKDQYSVAPIEPGSLQSRSRSHLLKFEYGDDHQNLGAQLRTLDNKIGSRKIENRNYQVNYNFNNNSYL
DLNLMAAHNIGKTIYPKGGFFAGWQVADKLITKNVANIVDINNSHTFLLPKEIDLKTTLGFNYFTNEYSKNRFPEELSLF
YNDASHDQGLYSHSKRGRYSGTKSLLPQRSVILQPSGKQKFKTVYFDTALSKGIYHLNYSVNFTHYAFNGEYVGYENTAG
QQINEPILHKSGHKKAFNHSATLSAELSDYFMPFFTYSRTHRMPNIQEMFFSQVSNAGVNTALKPEQSDTYQLGFNTYKK
GLFTQDDVLGVKLVGYRSFIKNYIHNVYGVWWRDGMPTWAESNGFKYTIAHQNYKPIVKKSGVELEINYDMGRFFANVSY
AYQRTNQPTNYADASPRPNNASQEDILKQGYGLSRVSMLPKDYGRLELGTRWFDQKLTLGLAARYYGKSKRATIEEEYIN
GSRFKKNTLRRENYYAVKKTEDIKKQPIILDLHVSYEPIKDLIIKAEVQNLLDKRYVDPLDAGNDAASQRYYSSLNNSIE
CAQDSSACGGSDKTVLYNFARGRTYILSLNYKF
>P9WJZ7 2.1.1.-~~~~~~Probable O-methyltransferase Rv1220c~~~COG4122
MDGTPGHDDMPGQPAPSRGESLWAHAEGSISEDVILAGARERATDIGAGAVTPAVGALLCLLAKLSGGKAVAEVGTGAGV
SGLWLLSGMRDDGVLTTIDIEPEHLRLARQAFAEAGIGPSRTRLISGRAQEVLTRLADASYDLVFIDADPIDQPDYVAEG
VRLLRSGGVIVVHRAALGGRAGDPGARDAEVIAVREAARLIAEDERLTPALVPLGDGVLAAVRD
>P67248 ~~~~~~UPF0122 protein SAV1236~~~
MGQNDLVKTLRMNYLFDFYQSLLTNKQRNYLELFYLEDYSLSEIADTFNVSRQAVYDNIRRTGDLVEDYEKKLELYQKFE
QRREIYDEMKQHLSNPEQIQRYIQQLEDLE
>O25842 ~~~~~~Protein HP_1247~~~COG1466
MYRKDLDHYLKQRLPKAVFLYGEFDFFIHYYIQTISALFKCDNPDIETSLFYASDYEKSQIATLLEQDSLFGGSSLVVLK
LDFALHKKFKENDINLFLKALERPSHNRLIIGLYNAKSDTTKYKYTSDAIVKFFQKSPLKDEAICARFFIPKTWESLKFL
QERANFLHLDISGHLLNALFEINNEDLGVSFNDLDKLAVLNAPITLEDIQELSSNAGDMDLQKLILGLFLKKSALDIYDY
LLKEGKKDADILRGLERYFYQLFLFFAHIKTTGLMDAKEVLGYAPPKEIAENYAKNALRLKEAGYKRVFEIFRLWHIQSM
QGQKELGFLYLTSIQKIINP
>P9WMD5 ~~~~~~Uncharacterized HTH-type transcriptional regulator Rv1255c~~~COG1309
MAGTDWLSARRTELAADRILDAAERLFTQRDPASIGMNEIAKAAGCSRATLYRYFDSREALRTAYVHRETRRLGREIMVK
IADVVEPAERLLVSITTTLRMVRDNPALAAWFTTTRPPIGGEMAGRSEVIAALAAAFLNSLGPDDPTTVERRARWVVRML
TSLLMFPGRDEADERAMIAEFVVPIVTPASAAARKAGHPGPE
>P67371 ~~~~~~DegV domain-containing protein SA1258~~~
MTKQIIVTDSTSDLSKEYLEANNIHVIPLSLTIEGASYVDQVDITSEEFINHIENDEDVKTSQPAIGEFISAYEELGKDG
SEIISIHLSSGLSGTYNTAYQASQMVDANVTVIDSKSISFSLGYQIQHLVELVKEGVSTSEIVKKLNHLRENIKLFVVIG
QLNQLIKGGRISKTKGLIGNLMKIKPIGTLDDGRLELVHNARTQNSSIQYLKKEIAEFIGDHEIKSIGVAHANVIEYVDK
LKKVFNEAFHVNNYDINVTTPVISAHTGQGAIGLVVLKK
>P9WM51 ~~~~~~Uncharacterized protein Rv1260~~~COG0654
MKTVVVSGASVAGTAAAYWLGRHGYSVTMVERHPGLRPGGQAIDVRGPALDVLERMGLLAAAQEHKTRIRGASFVDRDGN
ELFRDTESTPTGGPVNSPDIELLRDDLVELLYGATQPSVEYLFDDSISTLQDDGDSVRVTFERAAAREFDLVIGADGLHS
NVRRLVFGPEEQFVKRLGTHAAIFTVPNFLELDYWQTWHYGDSTMAGVYSARNNTEARAALAFMDTELRIDYRDTEAQFA
ELQRRMAEDGWVRAQLLHYMRSAPDFYFDEMSQILMDRWSRGRVALVGDAGYCCSPLSGQGTSVALLGAYILAGELKAAG
DDYQLGFANYHAEFHGFVERNQWLVSDNIPGGAPIPQEEFERIVHSITIKDY
>Q7A5M6 ~~~~~~UPF0403 protein SA1261~~~
MNAYDAYMKEIAQQMRGELTQNGFTSLETSEAVSEYMNQVNADDTTFVVINSTCGCAAGLARPAAVAVATQNEHRPTNTV
TVFAGQDKEATATMREFIQQAPSSPSYALFKGQDLVYFMPREFIEGRDINDIAMDLKDAFDENCK
>P9WMU9 4.6.1.1~~~~~~pH-sensitive adenylate cyclase Rv1264~~~COG2114
MTDHVREADDANIDDLLGDLGGTARAERAKLVEWLLEQGITPDEIRATNPPLLLATRHLVGDDGTYVSAREISENYGVDL
ELLQRVQRAVGLARVDDPDAVVHMRADGEAAARAQRFVELGLNPDQVVLVVRVLAEGLSHAAEAMRYTALEAIMRPGATE
LDIAKGSQALVSQIVPLLGPMIQDMLFMQLRHMMETEAVNAGERAAGKPLPGARQVTVAFADLVGFTQLGEVVSAEELGH
LAGRLAGLARDLTAPPVWFIKTIGDAVMLVCPDPAPLLDTVLKLVEVVDTDNNFPRLRAGVASGMAVSRAGDWFGSPVNV
ASRVTGVARPGAVLVADSVREALGDAPEADGFQWSFAGPRRLRGIRGDVRLFRVRRGATRTGSGGAAQDDDLAGSSP
>P9WM49 ~~~~~~Uncharacterized protein Rv1265~~~
MVLARPDAVFAPARNRCHVSLPVNAMSLKMKVCNHVIMRHHHMHGRRYGRPGGWQQAQQPDASGAAEWFAGRLPEDWFDG
DPTVIVDREEITVIGKLPGLESPEEESAARASGRVSRFRDETRPERMTIADEAQNRYGRKVSWGVEVGGERILFTHIAVP
VMTRLKQPERQVLDTLVDAGVARSRSDALAWSVKLVGEHTEEWLAKLRTAMSAVDDLRAQGPDLPA
>P9WM45 ~~~~~~Protein Rv1269c~~~
MTTMITLRRRFAVAVAGVATAAATTVTLAPAPANAADVYGAIAYSGNGSWGRSWDYPTRAAAEATAVKSCGYSDCKVLTS
FTACGAVAANDRAYQGGVGPTLAAAMKDALTKLGGGYIDTWACN
>P75349 3.1.4.-~~~~~~Putative metallophosphoesterase MG207 homolog~~~
MTKVLVLSDTHGYNDRWLAVMKLHNPDVVIHAGDHLTTKKFMDQNATFWVAGNHDVVGEEIQMFELEGIQFVLMHGHQAP
RHDLKQWYKMLVDQAKSYLCDVLIVGHSHIEHYETIDGIQVINPGSLEIPRNPRKLPTYCNLNLSQGRISDLTFHFPRD
>P9WQJ1 ~~~~~~Uncharacterized ABC transporter ATP-binding protein Rv1273c~~~COG1132
MLLALLRQHIRPYRRLVAMLMMLQLVSTLASLYLPTVNAAIVDDGVAKGDTATIVRLGAVMLGVTGLQVLCAIGAVYLGS
RTGAGFGRDLRSAMFEHIITFSERETARFGAPTLLTRSTNDVRQILFLVQMTATVLVTAPIMCVGGIIMAIHQEAALTWL
LLVSVPILAVANYWIISHMLPLFRRMQSLIDGINRVMRDQLSGVRVVRAFTREGYERDKFAQANTALSNAALSAGNWQAL
MLPVTTLTINASSVALIWFGGLRIDSGQMQVGSLIAFLSYFAQILMAVLMATMTLAVLPRASVCAERITEVLSTPAALGN
PDNPKFPTDGVTGVVRLAGATFTYPGADCPVLQDISLTARPGTTTAIVGSTGSGKSTLVSLICRLYDVTAGAVLVDGIDV
REYHTERLWSAIGLVPQRSYLFSGTVADNLRYGGGPDQVVTEQEMWEALRVAAADGFVQTDGLQTRVAQGGVNFSGGQRQ
RLAIARAVIRRPAIYVFDDAFSALDVHTDAKVHASLRQVSGDATIIVVTQRISNAAQADQVIVVDNGKIVGTGTHETLLA
DCPTYAEFAASQSLSATVGGVG
>A0QRX8 ~~~~~~Uncharacterized protein MSMEG_1276/MSMEI_1238.1~~~
MAQLVRDRIPELIRASGRTPVIRTLSDDEYWAALDAKLDEEVAELRAADTRDAALEGGRHRHRGESASGNGIQHGVPPNV
ALIPSGSTLLTPARSGHV
>P9WGF9 3.1.3.-~~~~~~Uncharacterized protein Rv1276c~~~COG2062
MRHAKSAYPDGIADHDRPLAPRGIREAGLAGGWLRANLPAVDAVLCSTATRARQTLAHTGIDAPARYAERLYGAAPGTVI
EEINRVGDNVTTLLVVGHEPTTSALAIVLASISGTDAAVAERISEKFPTSGIAVLRVAGHWADVEPGCAALVGFHVPR
>Q57431 1.-.-.-~~~~~~Putative NAD(P)H nitroreductase~~~COG0778
MTQLTREQVLELFHQRSSTRYYDPTKKISDEDFECILECGRLSPSSVGSEPWKFLVIQNKTLREKMKPFSWGMINQLDNC
SHLVVILAKKNARYDSPFFVDVMARKGLNAEQQQAALTKYKALQEEDMKLLENDRTLFDWCSKQTYIALANMLTGASALG
IDSCPIEGFHYDKMNECLAEEGLFDPQEYAVSVAATFGYRSRDIAKKSRKGLDEVVKWVG
>P9WM41 ~~~~~~Uncharacterized protein Rv1278~~~COG0419
MKLHRLALTNYRGIAHRDVEFPDHGVVVVCGANEIGKSSMVEALDLLLEYKDRSTKKEVKQVKPTNADVGSEVIAEISSG
PYRFVYRKRFHKRCETELTVLAPRREQLTGDEAHERVRTMLAETVDTELWHAQRVLQAASTAAVDLSGCDALSRALDLAA
GDDAALSGTESLLIERIEAEYARYFTPTGRPTGEWSAAVSRLAAAEAAVADCAAAVAEVDDGVRRHTELTEQVAELSQQL
LAHQLRLEAARVAAEKIAAITDDAREAKLIATAAAATSGASTAAHAGRLGLLTEIDTRTAAVVAAEAKARQAADEQATAR
AEAEACDAALTEATQVLTAVRLRAESARRTLDQLADCEEADRLAARLARIDDIEGDRDRVCAELSAVTLTEELLSRIERA
AAAVDRGGAQLASISAAVEFTAAVDIELGVGDQRVSLSAGQSWSVTATGPTEVKVPGVLTARIVPGATALDFQAKYAAAQ
QELADALAAGEVADLAAARSADLCRRELLSRRDQLTATLAGLCGDEQVDQLRSRLEQLCAGQPAELDLVSTDTATARAEL
DAVEAARIAAEKDCETRRQIAAGAARRLAETSTRATVLQNAAAAESAELGAAMTRLACERASVGDDELAAKAEADLRVLQ
TAEQRVIDLADELAATAPDAVAAELAEAADAVELLRERHDEAIRALHEVGVELSVFGTQGRKGKLDAAETEREHAASHHA
RVGRRARAARLLRSVMARHRDTTRLRYVEPYRAELHRLGRPVFGPSFEVEVDTDLRIRSRTLDDRTVPYECLSGGAKEQL
GILARLAGAALVAKEDAVPVLIDDALGFTDPERLAKMGEVFDTIGADGQVIVLTCSPTRYGGVKGAHRIDLDAIQ
>A0QRY1 ~~~~~~Uncharacterized protein MSMEG_1279/MSMEI_1241~~~COG4942
MDYDAITALRDRHPAWRLLRAGNASLVLSFLGEFFVEANRGACPAGQVAEALDNHLYALSAGSGESRYPKEPRAYLEDWA
ATDAGYLRRFYPPGDDEVHYEVTPAFEKAYAWVVNLQSRSFVGTESRLHTVVALLRQIVHGTEVQPDVRLAELRRRRAEL
EAEIAAVEAGDIAVLDPTAVRDRYQQLSTTARELLSDFREVEENFRLLDRAARERIAAWEGSKGGLLEELVGSRSEITGS
DQGRSFQAFYDFLLSEQRQAELAELIAKVSALDVLEADPRIRRIHHDWSEAADRAQRTVRQISEQLRRFLDDQVWLENRR
VLDLVRAVESIALEVRDAPPTFGLEVDEPGIEIALPFERPLYQPPTEVAVESHVSAATEEVNADLLFAQTYIDQARLADT
IRAVLPEDSSALLSDVVAVHPIEQGAAEIVGYLALNEDDLAIDVDDTEETVLEYPDPADPDITKRARLPKVTVRRR
>P9WMV5 1.1.-.-~~~~~~Uncharacterized GMC-type oxidoreductase Rv1279~~~COG2303
MDTQSDYVVVGTGSAGAVVASRLSTDPATTVVALEAGPRDKNRFIGVPAAFSKLFRSEIDWDYLTEPQPELDGREIYWPR
GKVLGGSSSMNAMMWVRGFASDYDEWAARAGPRWSYADVLGYFRRIENVTAAWHFVSGDDSGVTGPLHISRQRSPRSVTA
AWLAAARECGFAAARPNSPRPEGFCETVVTQRRGARFSTADAYLKPAMRRKNLRVLTGATATRVVIDGDRAVGVEYQSDG
QTRIVYARREVVLCAGAVNSPQLLMLSGIGDRDHLAEHDIDTVYHAPEVGCNLLDHLVTVLGFDVEKDSLFAAEKPGQLI
SYLLRRRGMLTSNVGEAYGFVRSRPELKLPDLELIFAPAPFYDEALVPPAGHGVVFGPILVAPQSRGQITLRSADPHAKP
VIEPRYLSDLGGVDRAAMMAGLRICARIAQARPLRDLLGSIARPRNSTELDEATLELALATCSHTLYHPMGTCRMGSDEA
SVVDPQLRVRGVDGLRVADASVMPSTVRGHTHAPSVLIGEKAADLIRS
>P9WGU5 ~~~~~~Uncharacterized protein Rv1280c~~~COG0747
MADRGQRRGCAPGIASALRASFQGKSRPWTQTRYWAFALLTPLVVAMVLTGCSASGTQLELAPTADRRAAVGTTSDINQQ
DPATLQDGGNLRLSLTDFPPNFNILHIDGNNAEVAAMMKATLPRAFIIGPDGSTTVDTNYFTSIELTRTAPQVVTYTINP
EAVWSDGTPITWRDIASQIHAISGADKAFEIASSSGAERVASVTRGVDDRQAVVTFAKPYAEWRGMFAGNGMLLPASMTA
TPEAFNKGQLDGPGPSAGPFVVSALDRTAQRIVLTRNPRWWGARPRLDSITYLVLDDAARLPALQNNTIDATGVGTLDQL
TIAARTKGISIRRAPGPSWYHFTLNGAPGSILADKALRLAIAKGIDRYTIARVAQYGLTSDPVPLNNHVFVAGQDGYQDN
SGVVAYNPEQAKRELDALGWRRSGAFREKDGRQLVIRDLFYDAQSTRQFAQIAQHTLAQIGVKLELQAKSGSGFFSDYVN
VGAFDIAQFGWVGDAFPLSSLTQIYASDGESNFGKIGSPQIDAAIERTLAELDPGKARALANQVDELIWAEGFSLPLTQS
PGTVAVRSTLANFGATGLADLDYTAIGFMRR
>P9WQJ5 ~~~~~~Uncharacterized ABC transporter ATP-binding protein Rv1281c~~~COG4172
MSPLLEVTDLAVTFRTDGDPVTAVRGISYRVEPGEVVAMVGESGSGKSAAAMAVVGLLPEYAQVRGSVRLQGTELLGLAD
NAMSRFRGKAIGTVFQDPMSALTPVYTVGDQIAEAIEVHQPRVGKKAARRRAVELLDLVGISQPQRRSRAFPHELSGGER
QRVVIAIAIANDPDLLICDEPTTALDVTVQAQILDVLKAARDVTGAGVLIITHDLGVVAEFADRALVMYAGRVVESAGVN
DLYRDRRMPYTVGLLGSVPRLDAAQGTRLVPIPGAPPSLAGLAPGCPFAPRCPLVIDECLTAEPELLDVATDHRAACIRT
ELVTGRSAADIYRVKTEARPAALGDASVVVRVRHLVKTYRLAKGVVLRRAIGEVRAVDGISLELRQGRTLGIVGESGSGK
STTLHEILELAAPQSGSIEVLGTDVATLGTAERRSLRRDIQVVFQDPVASLDPRLPVFDLIAEPLQANGFGKNETHARVA
ELLDIVGLRHGDASRYPAEFSGGQKQRIGIARALALQPKILALDEPVSALDVSIQAGIINLLLDLQEQFGLSYLFVSHDL
SVVKHLAHQVAVMLAGTVVEQGDSEEVFGNPKHEYTRRLLGAVPQPDPARRG
>Q92G43 ~~~~~~Putative adhesin RC1281~~~
MKKLLLIAAASTALLTSGLSFADCDMNSSVDSSTNSSMSSSVENQWYLKLNAGGVIFNKTKPKGADFKLNNIKSNIKSNT
GFTGEIGAGYYIMDNLRTDLTIGTVASSHLKKSKTYPDGNSFSVKNKPTIVSVLLNGYVDFVDLSMFKVFAGAGVGAAFV
KEKIHSKDIKGGVTDTFNGTTKNKTNFAYQLSLGTSFEVAQGVKAELVYSWRDYGKTKNTTKTINGDKVKFGGTHYKGHN
LMAGLRFDM
>P9WFZ9 ~~~~~~Putative peptide transport permease protein Rv1282c~~~COG1173
MTEFASRRTLVVRRFLRNRAAVASLAALLLLFVSAYALPPLLPYSYDDLDFNALLQPPGTKHWLGTNALGQDLLAQTLRG
MQKSMLIGVCVAVISTGIAATVGAISGYFGGWRDRTLMWVVDLLLVVPSFILIAIVTPRTKNSANIMFLVLLLAGFGWMI
SSRMVRGMTMSLREREFIRAARYMGVSSRRIIVGHVVPNVASILIIDAALNVAAAILAETGLSFLGFGIQPPDVSLGTLI
ADGTASATAFPWVFLFPASILVLILVCANLTGDGLRDALDPASRSLRRGVR
>P9WFZ7 ~~~~~~Putative peptide transport permease protein Rv1283c~~~COG0601
MTRYLARRLLNYLVLLALASFLTYCLTSLAFSPLESLMQRSPRPPQAVIDAKAHDLGLDRPILARYANWVSHAVRGDFGT
TITGQPVGTELGRRIGVSLRLLVVGSVFGTVAGVVIGAWGAIRQYRLSDRVMTTLALLVLSTPTFVVANLLILGALRVNW
AVGIQLFDYTGETSPGVAGGVWDRLGDRLQHLILPSLTLALAAAAGFSRYQRNAMLDVLGQDFIRTARAKGLTRRRALLK
HGLRTALIPMATLFAYGVAGLVTGAVFVEKIFGWHGMGEWMVRGISTQDTNIVAAITVFSGAVVLLAGLLSDVIYAALDP
RVRVS
>P9WME3 ~~~~~~Putative HTH-type transcriptional regulator Rv1287~~~COG1959
MRMSAKAEYAVRAMVQLATAASGTVVKTDDLAAAQGIPPQFLVDILTNLRTDRLVRSHRGREGGYELARPGTEISIADVL
RCIDGPLASVRDIGLGDLPYSGPTTALTDVWRALRASMRSVLEETTLADVAGGALPEHVAQLADDYRAQESTRHGASRHG
D
>Q97QD1 ~~~~~~UPF0122 protein SP_1288~~~COG2739
MEIEKTNRMNALFEFYAALLTDKQMNYIELYYADDYSLAEIAEEFGVSRQAVYDNIKRTEKILEDYEMKLHMYSDYIVRS
QIFDQILERYPKDDFLQEQIEILTSIDNRE
>P9WM37 ~~~~~~Uncharacterized protein Rv1289~~~
MCVSVGESVAQSLQQWDRKLWDVAMLHACNAVDETGRKRYPTLGVGTRFRTALRDSLDIYGVMATPGVDLEKTRFPVGVR
SDLLPDKRPDIADVLYGIHRWLHGHADESSVEFEVSPYVNASAALRIANDGKIQLPKSAILGLLAVAVFAPENKGEVIPP
DYQLSWYDHVFFISVWWGWQDHFREIVNVDRASLVALDFGDLWNGWTPVG
>P9WM35 ~~~~~~Uncharacterized protein Rv1290c~~~COG4325
MLQRSLGVNGRKLAMSARSAKRERKNASTAASKCYVVPPSARGWVHAYSVTATSMLNRRKAILDYLQGAVWVLPTFGVAI
GLGSGAVLSMIPVKSGTLIDKLMFQGTPGDARGVLIVVSATMITTIGIVFSLTVLSLQIASSQFSVRLLRTFLRDVPNQV
VLAIFACTFAYSTGGLHTVGEHRDGGAFIPKVAVTGSLALAFVSIAALIYFLHHLMHSIQIDTIMDKVRLRTLGLVDQLY
PESDTADRQVETPPSPPADAVPLLAPHSGYLQTVDVDDIAELAAASRYTALLVTFVGDYVTAGGLLGWCWRRGTAPGAPG
SDFPQRCLRHVHIGFERTLQQDIRFGLRQMVDIALRALSPALNDPYTAIQVVHHLSAVESVLASRALPDDVRRDRAGELL
FWLPYPSFATYLHVGCAQIRRYGSREPLVLTALLQLLSAVAQNCVDPSRRVAVQTQIALVVRAAQREFADESDRAMVLGA
AARATEVVERPGTLAPPPSTFGQVAAAQAAASTIRSADRDG
>P9WM31 ~~~~~~Uncharacterized protein Rv1303~~~
MTTPAQDAPLVFPSVAFRPVRLFFINVGLAAVAMLVAGVFGHLTVGMFLGLGLLLGLLNALLVRRSAESITAKEHPLKRS
MALNSASRLAIITILGLIIAYIFRPAGLGVVFGLAFFQVLLVATTALPVLKKLRTATEEPVATYSSNGQTGGSEGRSASD
D
>Q8YAJ5 ~~~~~~Cell wall protein Lmo0130~~~COG0737
MKVNKFFKKTTHVLLVAGLTIGLTAPFTGTTAQAAADTVPIQILGINDFHGALETASKDASGSPIGGADYLATNLDNATN
SFLQANPGATTDNAIRVQAGDMVGASPAVSGLLQDEPTMKVLQKMNFEVGTLGNHEFDEGLPEYKRILDGVSTNKFGPIV
EAYPRVKSDMKIVAANVVNKGTNTVAEGFLPYYVKEIDGVKVGFIGIVTTEIPNLVLANHIKDYDFLDEAETIVKYSAEL
RGQGVNAIVVLSHVPALSTGNPNTGTKQDVAGEAANMMTKANELDPNNSVDLVLAGHNHQYTNGLVGKTRIVQSYNNGKA
FSDVTGELDKTTGDFVSPPDAKITYNTRSVTPNADITAVTEDAKSRIEGVINETIGLANKDVISRDTNPDNKAIDDKESE
LGNMITDAQRYMANKAGADVDFAMTNNGGIRSDLTTRLANGQNEITWGAAQAVQPFGNILQVVEMTGADILEALNQQYLS
NQTYFLQISGLKYTFTDTDDLDHAYKVASVTTEDGTPLKTDQKYKVVINDFLFGGGDGFSAFKKANLVTAIDPDTETFIN
YIKDQKAAGKVITAQKEGRKVYKSQAEIDKETKDAAIKAIKEATKINKLAEKDKTLTGTTLPGATVSVQKATANARMALA
AGPNATADANGKFSVDVTSLNLKKGDQITTTITDPNGYSTTFQATVQAAATTPPDNGNGGTDNGNGNGNNGGTDGNGGTN
NGNGSGTNGGTTTTEDPTTTTSNTSTTGTSSNTSLPTTGDTAGLATVFGVILTTTALYVLRKRS
>Q7A0W3 ~~~~~~UPF0346 protein MW1311~~~
MKNYSFYQFVMTVRGRHDDKGRLAEEIFDDLAFPKHDDDFNILSDYIETHGDFTLPMSVFDDLYEEYTEWLKF
>P9WM29 ~~~~~~Uncharacterized protein Rv1312~~~
MSAPMIGMVVLVVVLGLAVLALSYRLWKLRQGGTAGIMRDIPAVGGHGWRHGVIRYRGGEAAFYRLSSLRLWPDRRLSRR
GVEIISRRAPRGDEFDIMTDEIVVVELCDSTQDRRVGYEIALDRGALTAFLSWLESRPSPRARRRSM
>Q9RUR8 ~~~~~~Uncharacterized protein DR_1314~~~COG3861
MTQASLIRLSELNNDAQYNLNDTSMYNPVGAAAYGVNGDKIGTVRDALVEPETGRIRYFLVDVGGWFSSKEVLVPVGYGR
VDDSGVYFDSLTKDQVKDMSEYRADQAYSSEMMDTDERVLRGNQSQEEYHQRAYQTPDRLQLLEERLVVNKDRFKAGSVQ
IGKRIETRQETVSVPLQREEVVIERHAVTDGRAVEGAVLGEGHQTMSVDLEAERANISKQAYVTEEVSVGKRAVTETQQV
TETVGREVLDVNQTGDVRTTEGTALTDDTTKRNI
>Q8NWR0 ~~~~~~DegV domain-containing protein MW1315~~~
MTKQIIVTDSTSDLSKEYLEANNIHVIPLSLTIEGASYVDQVDITSEEFINHIENDEDVKTSQPAIGEFISAYEELGKDG
SEIISIHLSSGLSGTYNTAYQASQMVDANVTVIDSKSISFGLGYQIQHLVELVKEGVSTSEIVKKLNHLRENIKLFVVIG
QLNQLIKGGRISKTKGLIGNLMKIKPIGTLDDGRLELVHNARTQNSSIQYLKKEIAEFIGDHEIKSIGVAHANVIEYVDK
LKKVFNEAFHVNNYDINVTTPVISAHTGQGAIGLVVLKK
>P9WQ33 ~~~~~~Uncharacterized protein Rv1318c~~~COG2114
MSAKKSTAQRLGRVLETVTRQSGRLPETPAYGSWLLGRVSESQRRRRVRIQVMLTALVVTANLLGIGVALLLVTIAIPEP
SIVRDTPRWLTFGVVPGYVLLALALGSYALTRQTVQALRWAIEGRKPTREEERRTFLAPWRVAVGHLMFWGVGTALLTTL
YGLINNAFIPRFLFAVSFCGVLVATATYLHTEFALRPFAAQALEAGPPPRRLAPGILGRTMVVWLLGSGVPVVGIALMAM
FEMVLLNLTRMQFATGVLIISMVTLVFGFILMWILAWLTATPVRVVRAALRRVERGELRTNLVVFDGTELGELQRGFNAM
VAGLRERERVRDLFGRHVGREVAAAAERERSKLGGEERHVAVVFIDIVGSTQLVTSRPPADVVKLLNKFFAIVVDEVDRH
HGLVNKFEGDASLTIFGAPNRLPCPEDKALAAARAIADRLVNEMPECQAGIGVAAGQVIAGNVGARERFEYTVIGEPVNE
AARLCELAKSRPGKLLASAQAVDAASEEERARWSLGRHVKLRGHDQPVRLAKPVGLTKPRR
>P9WQ31 ~~~~~~Uncharacterized protein Rv1319c~~~COG2114
MPAKKTMAQRLGQALETMTRQCGQLPETPAYGSWLLGRVSESPSRRWVRIKRIVTVYIMTANLTGIVVALLVVTFAFPVP
SIYTDAPWWVTFGVAPAYATLALAIGTYWITTRIVRASIRWAIEERAPSQADGRNTLLLPFRVAAVHLILWDIGGALLAT
LYGLANRVFVTIILFSVTICGVLVATNCYLFTEFALRPVAAKALEAGRPPRRFAPGIMGRTMTVWSLGSGVPVTGIATTA
LYVLLVHNLTETQLASAVLILSITTLIFGFLVMWILAWLTAAPVRVVRAALKRVEQGDLRGDLVVFDGTELGELQRGFNA
MVNGLRERERVRDLFGRHVGREVAAAAERERPQLGGEDRHAAVVFVDIVGSTQLVDNQPAAHVVKLLNRFFAIVVNEVDR
HHGLINKFAGDAALAIFGAPNRLDRPEDAALAAARAIADRLANEMPEVQAGIGVAAGQIVAGNVGAKQRFEYTVVGKPVN
QAARLCELAKSHPARLLASSDTLHAASETERAHWSLGETVTLRGHEQPTRLAVPT
>P43951 ~~~~~~Uncharacterized protein HI_0131~~~COG1840
MKFNKISLSVSTALLAAGLAVSGSANAKGRLVVYCSATNILCETTTKAFGEKYDVKTSFIRNGSGSTFAKVEAEKNNPQA
DVWFGGTFDPQAQAAELGLIEPYKSKHIDEIVERFREPAKTKGHYVSSIYMGILGFGVNTERLAKLGIKEVPKCWKDLTD
PRLKGEVQIADPQSAGTAYTALATFVQLWGEKEAFDFLKELHPNVSQYTKSGITPSRNSARGEATIGVGFLHDYALEKRN
GAPLELVVPCEGTGYELGGVSILKGARNIDNAKLFVDWALSKEGQELAWKQGDSLQILTNTTAEQSPTAFDPNKLKLINY
DFEKYGATEQRKALIEKWVQEVKLAK
>P9WQ29 ~~~~~~Uncharacterized protein Rv1320c~~~COG2114
MPSEKATTRHLPGAVETLSPRTGRRPETPAYGSWLLGRVSESPRMRRVRIQGMLTVAILVTNVIGLIVGAMLLTVAFPKP
SVILDAPHWVSFGIVPGYCVLAFILGTYWLTRQTARALRWAIEERTPSHDEARSAFLVPLRVALAVLFLWGAAAALWTII
YGLANRLFIPRFLFSMGVIGVVAATSCYLLTEFALRPMAAQALEVGATPRSLVRGIVGRTMLVWLLCSGVPNVGVALTAI
FDDTFWELSNDQFMITVLILWAPLLIFGFILMWILAWLTATPVRVVREALNRVEQGDLSGDLVVFDGTELGELQRGFNRM
VEGLRERERVRDLFGRHVGREVAAAAERERPKLGGEERHVAVVFVDIVGSTQLVTSRPAAEVVMLLNRFFTVIVDEVNHH
RGLVNKFQGDASLAVFGAPNRLSHPEDAALATARAIADRLASEMPECQAGIGVAAGQVVAGNVGAHERFEYTVIGEPVNE
AARLCELAKSYPSRLLASSQTLRGASENECARWSLGETVTLRGHDQPIRLTSPVQQLQMPAQSADIVGGALGDHQTHTIY
RGAHPTD
>P9WM27 ~~~~~~Uncharacterized protein Rv1322~~~
MARRRKPLHRQRPEPPSWALRRVEAGPDGHEYEVRPVAAARAVKTYRCPGCDHEIRSGTAHVVVWPTDLPQAGVDDRRHW
HTPCWANRATRGPTRKWT
>P9WG61 ~~~~~~Uncharacterized protein Rv1324~~~COG3118
MTRPRPPLGPAMAGAVDLSGIKQRAQQNAAASTDADRALSTPSGVTEITEANFEDEVIVRSDEVPVVVLLWSPRSEVCVD
LLDTLSGLAAAAKGKWSLASVNVDVAPRVAQIFGVQAVPTVVALAAGQPISSFQGLQPADQLSRWVDSLLSATAGKLKGA
ASSEESTEVDPAVAQARQQLEDGDFVAARKSYQAILDANPGSVEAKAAIRQIEFLIRATAQRPDAVSVADSLSDDIDAAF
AAADVQVLNQDVSAAFERLIALVRRTSGEERTRVRTRLIELFELFDPADPEVVAGRRNLANALY
>Q9JZ25 ~~~~~~Uncharacterized protein NMB1327~~~
MNNPDLPYRQALECLSQKQYNFTEVRRLLTEAFSAGHPAAAFELAKHLMDADSPYQDREQGMEMLRIAAEQGHPYARYNL
AYIQELEGAPPETLIPLYRPLAEEGLPEAQVRLMYLLYASRHFEEALEWAKTSAKNNNPHGQYLLAQYCRYGTPPDFETA
HLLYRKSAAQGLPEAHWQLGLQYRFGQGTKVDTAQAVNHLRAAAQQGYIPAYTPLAELILPTAPDEAVHWFQQAAQENDP
DAHAALADIYLQGKHLERNHKLALHHAEAAAAERHPEGLRILGDICRYGLGIAPDTEKARHYYRQAAEAGSLSAYQKLIS
DSALNHPDQYGGIKDSAIRRQRAERLYQKAQALHYGLQCAPEYAAALKLYTEAAELGHSKAQTNLGSMYYFGQGMTADYN
EARKWFEKAAAKKDSMAFYNLACIHYSGHGVEPDKEKACRYLQEAINNGYGQKSVLQELLQQWQNAV
>Q87Q20 ~~~~~~Uncharacterized protein VP1330~~~COG3938
MRQGTFFCIDAHTCGNPVRLVAGGVPPLEGNTMSEKRQYFLEHYDWIRQALMFEPRGHSMMSGSVVLPPCSDNADASILF
IETSGCLPMCGHGTIGTVTTAIENRLITPKEEGRLILDVPAGQIEVHYQTKGDKVTSVKIFNVPAYLAHQDVTVEIEGLG
EITVDVAYGGNYYVIVDPQENYAGLEHYSPDEILMLSPKVRTAVSKAVECIHPNDPTVCGVSHVLWTGKPTQEGATARNA
VFYGDKALDRSPCGTGTSARMAQWHAKGKLKSGEDFVHESIIGSLFNGRIEGITEVNGQTAILPSIEGWAQVYGHNTIWV
DDEDPYAYGFEVK
>P71376 ~~~~~~RNA-binding protein HI_1333~~~COG1534
MTTLSTKQKQFLKGLAHHLNPVVMLGGNGLTEGVLAEIENALNHHELIKVKVAGADRETKQLIINAIVRETKAAQVQTIG
HILVLYRPSEEAKIQLPRK
>P9WM23 3.4.11.-~~~~~~Uncharacterized aminopeptidase Rv1333~~~COG3191
MNSITDVGGIRVGHYQRLDPDASLGAGWACGVTVVLPPPGTVGAVDCRGGAPGTRETDLLDPANSVRFVDALLLAGGSAY
GLAAADGVMRWLEEHRRGVAMDSGVVPIVPGAVIFDLPVGGWNCRPTADFGYSACAAAGVDVAVGTVGVGVGARAGALKG
GVGTASATLQSGVTVGVLAVVNAAGNVVDPATGLPWMADLVGEFALRAPPAEQIAALAQLSSPLGAFNTPFNTTIGVIAC
DAALSPAACRRIAIAAHDGLARTIRPAHTPLDGDTVFALATGAVAVPPEAGVPAALSPETQLVTAVGAAAADCLARAVLA
GVLNAQPVAGIPTYRDMFPGAFGS
>Q9JZ20 ~~~~~~Uncharacterized protein NMB1333~~~
MRYKPLLLALMLVFSTPAVAAHDAAHNRSAEVKKQTKNKKEQPEAAEGKKEKGKNGAVKDKKTGGKEAAKEGKESKKTAK
NRKEAEKEATSRQSARKGREGDKKSKAEHKKAHGKPVSGSKEKNAKTQPENKQGKKEAKGQGNPRKGGKAEKDTVSANKK
VRSDKNGKAVKQDKKYREEKNAKTDSDELKAAVAAATNDVENKKALLKQSEGMLLHVSNSLKQLQEERIRQERIRQARGN
LASVNRKQREAWDKFQKLNTELNRLKTEVAATKAQISRFVSGNYKNSQPNAVALFLKNAEPGQKNRFLRYTRYVNASNRE
VVKDLEKQQKALAVQEQKINNELARLKKIQANVQSLLKKQGVTDAAEQTESRRQNAKIAKDARKLLEQKGNEQQLNKLLS
NLEKKKAEHRIQDAEAKRKLAEARLAAAEKARKEAAQQKAEARRAEMSNLTAEDRNIQAPSVMGIGSADGFSRMQGRLKK
PVDGVPTGLFGQNRSGGDIWKGVFYSTAPATVESIAPGTVSYADELDGYGKVVVVDHGENYISIYAGLSEISVGKGYMVA
AGSKIGSSGSLPDGEEGLYLQIRYQGQVLNPSSWIR
>P71378 ~~~~~~Uncharacterized protein HI_1339/HI_1462.1~~~
MEKIMKKLTLALVLGSALVVTGCFDKQEAKQKVEDTKQTVASVASETKDAAANTMTEVKEKAQQLSTDVKNKVAEKVEDA
KEVIKSATEAASEKVGEMKEAASEKASEMKEAVSEKATQAVDAVKEATK
>P9WGC1 ~~~~~~Uncharacterized protein Rv1339~~~COG1234
MRRCIPHRCIGHGTVVSVRITVLGCSGSVVGPDSPASGYLLRAPHTPPLVIDFGGGVLGALQRHADPASVHVLLSHLHAD
HCLDLPGLFVWRRYHPSRPSGKALLYGPSDTWSRLGAASSPYGGEIDDCSDIFDVHHWADSEPVTLGALTIVPRLVAHPT
ESFGLRITDPSGASLAYSGDTGICDQLVELARGVDVFLCEASWTHSPKHPPDLHLSGTEAGMVAAQAGVRELLLTHIPPW
TSREDVISEAKAEFDGPVHAVVCDETFEVRRAG
>P75265 ~~~~~~Uncharacterized lipoprotein MG186 homolog~~~
MKGFSCSRPGYLTGLLLLAVAPILTACTRDYTTKNEFQLTTAQQAKLKPATIEYWRDGDTPEINYASEERRKEAEQKSKE
NAKKEDKKEEKKTEDSQDSSSASTQVRSSKHGLRIYGIDTPEKHVSSKGDSTGDEKIEAEKASNYAEKLIPKGSTVWVWS
LNTYSYDREVGALFFKSNPKQTFFQSFEVAMVEAGHAIPIAGTGLNLIADPELSADDPLSVIGLQLANAANKAYNAKINI
WSHDTDGYRSLTAVYKLRGADISWTRFLDEANGYSSASAGTGASLYQLWDQRQAKLAQKGS
>P9WM19 ~~~~~~Uncharacterized protein Rv1342c~~~
MTAPETPAAQHAEPAIAVERIRTALLGYRIMAWTTGLWLIALCYEIVVRYVVKVDNPPTWIGVVHGWVYFTYLLLTLNLA
VKVRWPLGKTAGVLLAGTIPLLGIVVEHFQTKEIKARFGL
>Q7A5G0 ~~~~~~UPF0403 protein SA1345~~~
MDMNFDLYMNGVVEQARNEIESAGYEQLTTAEDVDKVLKQDGTTLVMINSVCGCAGGIARPAASHALHYDVLPDRLVTVF
AGQDKEATQRAREYFEGYAPSSPSFALVKDGKITEMIERHQIEGHDVMNVINQLQTLFNKYCEER
>P45173 ~~~~~~Uncharacterized protein HI_1349~~~COG0783
MSKTSIGLDKVQSAELADKLNELLATYQVFYTNVRGYHWNIKGVNFFALHAKFEEIYTNLVARVDEVAERILTLGYTPNN
AYSQYLKISRIKEDIAVSEAQECLSGTLQGLKTLLDQQREILAFANNANDEGTASQMSDYIKEQEKLVWMFQAACQTCHN
>P75264 ~~~~~~Putative ABC transporter ATP-binding protein MG187 homolog~~~
MEKIQAEKSQSAIEFKNIVVDFGESIAIDNINLTVKKKELVTLLGPSGCGKTTSLSVIAGLIAPTSGQVLFNGYDVTKKP
PQQRKLGLVFQNYALYPHMSVFENIVFPLYSDTSWREAIFEKNTWAQHDINCLILKANGATSEELAELNRLMQQRIDEPK
RMAYQINDLMVSVFQKQSELEANLKLIPRKKQFAIISLSKETLSQIRDVETKAKAALETADSAEVEQTIKSELKQKLSEI
KANYHDEKANIKAYWWEMLANIKTELKTEKTAIKQTNDYAKLKELKWKIHFEPLNLKKQYRSYFKQLKAKYSLKDGNLTE
SELSQIEELQKRIVSLKDFINRTAKEVAEKLEITKILHKRPANISGGQQQRVAIARAIVRRPKVLLMDEPLSNLDAKLRV
QTRQWIRKFQQDLQITTVFVTHDQEEAMSISDTIVCMSTGKVQQIGSPSELYLKPANEFVATFLGSPEMNIVNATVKAGQ
LLWNENPLVKTKFDLPDGAIRVGFRYDEVTAPKNDGSPVFSGTLISVENLGKHMVGVVESNGVQLNVRLELSHQFEVGNA
VKFTIKPNGLHFFDPQTTQRVEVKHV
>P9WGR9 1.-.-.-~~~fabG2~~~Uncharacterized oxidoreductase Rv1350~~~COG1028
MASLLNARTAVITGGAQGLGLAIGQRFVAEGARVVLGDVNLEATEVAAKRLGGDDVALAVRCDVTQADDVDILIRTAVER
FGGLDVMVNNAGITRDATMRTMTEEQFDQVIAVHLKGTWNGTRLAAAIMRERKRGAIVNMSSVSGKVGMVGQTNYSAAKA
GIVGMTKAAAKELAHLGIRVNAIAPGLIRSAMTEAMPQRIWDQKLAEVPMGRAGEPSEVASVAVFLASDLSSYMTGTVLD
VTGGRFI
>P9WM15 ~~~~~~Uncharacterized protein Rv1352~~~
MARTLALRASAGLVAGMAMAAITLAPGARAETGEQFPGDGVFLVGTDIAPGTYRTEGPSNPLILVFGRVSELSTCSWSTH
SAPEVSNENIVDTNTSMGPMSVVIPPTVAAFQTHNCKLWMRIS
>P9WM03 ~~~~~~Uncharacterized protein Rv1360~~~COG2141
MGGARRLKLDGSIPNQLARAADAAVALERNGFDGGWTAEASHDPFLPLLLAAEHTSRLELGTNIAVAFARNPMIVANVGW
DLQTYSKGRLILGLGTQIRPHIEKRFSMPWGHPARRMREFVAALRAIWLAWQDGTKLCFEGEFYTHKIMTPMFTPEPQPY
PVPRVFIAAVGEAMTEMCGEVADGHLGHPMVSKRYLTEVSVPALLRGLARSGRDRSAFEVSCEVMVATGADDAELAAACT
ATRKQIAFYGSTPAYRKVLEQHGWGDLHPELHRLSKLGEWEAMGGLIDDEMLGAFAVVGPVDTIAGALRNRCEGVVDRVL
PIFMAASQECINAALQDFRR
>P9WM01 ~~~~~~Uncharacterized protein Rv1362c~~~COG0443
MTDDVRDVNTETTDATEVAEIDSAAGEAGDSATEAFDTDSATESTAQKGQRHRDLWRMQVTLKPVPVILILLMLISGGAT
GWLYLEQYRPDQQTDSGAARAAVAAASDGTIALLSYSPDTLDQDFATARSHLAGDFLSYYDQFTQQIVAPAAKQKSLKTT
AKVVRAAVSELHPDSAVVLVFVDQSTTSKDSPNPSMAASSVMVTLAKVDGNWLITKFTPV
>P9WLZ9 ~~~~~~Uncharacterized protein Rv1363c~~~
MAETTEPPSDAGTSQADAMALAAEAEAAEAEALAAAARARARAARLKREALAMAPAEDENVPEEYADWEDAEDYDDYDDY
EAADQEAARSASWRRRLRVRLPRLSTIAMAAAVVIICGFTGLSGYIVWQHHEATERQQRAAAFAAGAKQGVINMTSLDFN
KAKEDVARVIDSSTGEFRDDFQQRAADFTKVVEQSKVVTEGTVNATAVESMNEHSAVVLVAATSRVTNSAGAKDEPRAWR
LKVTVTEEGGQYKMSKVEFVP
>P9WLZ5 ~~~~~~Uncharacterized protein Rv1366~~~COG2357
MVVALVGSAIVDLHSRPPWSNNAVRRLGVALRDGVDPPVDCPSYAEVMLWHADLAAEVQDRIEGRSWSASELLVTSRAKS
QDTLLAKLRRRPYLQLNTIQDIAGVRIDADLLLGEQTRLAREIADHFGADQPAIHDLRDHPHAGYRAVHVWLRLPAGRVE
IQIRTILQSLWANFYELLADAYGRGIRYDERPEQLAAGVVPAQLQELVGVMQDASADLAMHEAEWQHCAEIEYPGQRAMA
LGEASKNKATVLATTKFRLERAINEAESAGGGG
>P9WLZ3 ~~~~~~Protein Rv1367c~~~COG1680
MNLDGNQASIREVCDAGLLSGAVTMVWQREKLLQVNEIGYRDIDAGVPMQRDTLFRIASMTKPVTVAAAMSLVDEGKLAL
RDPITRWAPELCKVAVLDDAAGPLDRTHPARRAILIEDLLTHTSGLAYGFSVSGPISRAYQRLPFGQGPDVWLAALATLP
LVHQPGDRVTYSHAIDVLGVIVSRIEDAPLYQIIDERVLGPAGMTDTGFYVSADAQRRAATMYRLDEQDRLRHDVMGPPH
VTPPSFCNAGGGLWSTADDYLRFVRMLLGDGTVDGVRVLSPESVRLMRTDRLTDEQKRHSFLGAPFWVGRGFGLNLSVVT
DPAKSRPLFGPGGLGTFSWPGAYGTWWQADPSADLILLYLIQHCPDLSVDAAAAVAGNPSLAKLRTAQPKFVRRTYRALG
L
>Q97Q59 ~~~~~~UPF0342 protein SP_1372~~~COG3679
MSNIYDSANELSRGLRGLPEYKAVKAAKDAIAADAEASKIFTDYLAFQEEIQKLAQTGQMPDASFQAKMEGFGKQIQGNS
LLSEFFTKQQQLAIYLSDIEKIVFEPVSELLK
>Q6GH41 5.3.2.-~~~~~~Probable tautomerase SAR1376~~~
MPIVNVKLLEGRSDEQLKNLVSEVTDAVEKTTGANRQAIHVVIEEMKPNHYGVAGVRKSDQ
>O86237 ~~~~~~Uncharacterized protein HI_1388.1~~~COG1942
MITVFGLKSKLAPRREKLAEVIYNSLHLGLDIPKGKHAIRFLCLEKEDFYYPFDRSDDYTVIEINLMAGRMEGTKKRLIK
MLFSELEYKLGIRAHDVEITIKEQPAHCWGFRGMTGDEARDLDYDIYV
>P9WMJ1 ~~~~~~Uncharacterized HTH-type transcriptional regulator Rv1395~~~COG2207
MGHLPPPAEVRHPVYATRVLCEVANERGVPTADVLAGTAIEPADLDDPDAVVGALDEITAVRRLLARLPDDAGIGIDVGS
RFALTHFGLFGFAVMSCGTLRELLTIAMRYFALTTMHVDITLFETADDCLVELDASHLPADVRGFFIERDIAGIIATTTS
FALPLAAKYADQVSAELAVDAELLRPLLELVPVHDVAFGRAHNRVHFPRAMFDEPLPQADRHTLEMCIAQCDVLMQRNER
RRGITALVRSKLFRDSGLFPTFTDVAGELDMHPRTLRRRLAEEGTSFRALLGEARSTVAVDLLRNVGLTVQQVSTRLGYT
EVSTFSHAFKRWYGVAPSEYSRRG
>P9WLY7 ~~~~~~Uncharacterized protein Rv1405c~~~COG2226
MTIDTPAREDQTLAATHRAMWALGDYALMAEEVMAPLGPILVAAAGIGPGVRVLDVAAGSGNISLPAAKTGATVISTDLT
PELLQRSQARAAQQGLTLQYQEANAQALPFADDEFDTVISAIGVMFAPDHQAAADELVRVCRPGGTIGVISWTCEGFFGR
MLATIRPYRPSVSADLPPSALWGREAYVTGLLGDGVTGLKTARGLLEVKRFDTAQAVHDYFKNNYGPTIEAYAHIGDNAV
LAAELDRQLVELAAQYLSDGVMEWEYLLLTAEKR
>P9WGX3 2.1.1.-~~~~~~Putative methyltransferase Rv1407~~~COG0144
MTPRSRGPRRRPLDPARRAAFETLRAVSARDAYANLVLPALLAQRGIGGRDAAFATELTYGTCRARGLLDAVIGAAAERS
PQAIDPVLLDLLRLGTYQLLRTRVDAHAAVSTTVEQAGIEFDSARAGFVNGVLRTIAGRDERSWVGELAPDAQNDPIGHA
AFVHAHPRWIAQAFADALGAAVGELEAVLASDDERPAVHLAARPGVLTAGELARAVRGTVGRYSPFAVYLPRGDPGRLAP
VRDGQALVQDEGSQLVARALTLAPVDGDTGRWLDLCAGPGGKTALLAGLGLQCAARVTAVEPSPHRADLVAQNTRGLPVE
LLRVDGRHTDLDPGFDRVLVDAPCTGLGALRRRPEARWRRQPADVAALAKLQRELLSAAIALTRPGGVVLYATCSPHLAE
TVGAVADALRRHPVHALDTRPLFEPVIAGLGEGPHVQLWPHRHGTDAMFAAALRRLT
>P73594 ~~~~~~WD repeat-containing protein slr1409~~~COG2319
MRIFPVFLLTFSLFLIKEEIVTAEVKVSAPVVNQSGIKLNLERQFTGSDVAINRIHFSPDGQFLLTAAADGVGTLWTKEG
EMLGQLQGQKPPMFNARLSPDRQILITTGYDGTIRLWNLQGELLEEQQPHRAAVADAIFSPDSQIIVTCSDDGQTKIFTR
QGQEIASVLKSGTARNLAYHPQGLLIASVSDSGSLHLINPNGKIEREISTGQGRINNVNFSPNGEQLLTSGINGSAKLWN
LAGELIHEYKVVPTGWVNSAQFYPKGEWLATASDDGTIRFWQKDGQLIYELPLVNARLTSLSFSPDGKQLAATSSQGQVW
VFNLSY
>P44184 ~~~~~~Uncharacterized protein HI_1410~~~COG1783
MFREIQKSISDSVIQMLADQIEMLSLQAFFDVQKTQIIGQNGSRFTFAGLKTNITSIKSMTGIDVVWVEEGENVSKESWD
ILIPTIREDGSQIIVSFNPKNILDDTYQRFVIHPPERCKSVLVNWQDNPYFPKELMEDMEQMRERDYELYRHVYEGEPVA
DSDLAIIKPVWIEYAVDAHLKLGFTAKGMKKVGFDVADEGADSNANAFVHGSVVLDIEVWKNGDVIDSANRTNQSAVKFK
ADLIIFDSIGVGAGVKAHFKRLPKSLQVEGFNAGGAVAYPEREYIKGKKNQDMFSNIKAQSWWALRDRFYKTYRAVKYGD
VYPDDELISLSSKIKELEYLKAELSRPRVDYDNNGRVKVESKKDMKKRGIPSPNMADALVMCYAPTKPKSLLDL
>Q9X1D0 ~~~~~~Uncharacterized protein TM_1410~~~COG2342
MSHLKNILFIIIVSLFFISSCSTVMSTEGWFMPFDNWLYQLQNADPVEISSSGFEIAVIDYSKDGSESGEYSPEEIKIMV
DAGVVPVAYVNIGQAEDYRFYWKESWYTNTPEWLGEEDPAWPGNYFVKYWYNEWKEIVFSYLDRVIDQGFKGIYLDRIDS
FEYWAQEGVISRRSAARKMINFVLEIAEYVRERKPDMLIIPQNGENILDFDDGQLASTVSGWAVENLFYLKTIPLEENET
KSRLEYLIRLNRKGKFILSVDYVDDGSDSFENISRILDYYEKAKRNGCIPYAARSDLELDEMNVIEGIQPPEALKDYESR
TYR
>Q8DP17 ~~~~~~DegV domain-containing protein spr1415~~~COG1307
MKLAVFTDSSAYLSAETLQREDLFVLDIPVNIDGEEYVEGINLSAEEFYQKMAQASELPKTSQPSIAKLDEILTSLKEQG
YTHALGLFLSSGISGFYQSIQYMVDDYEGLTIAFPDTLITSAPLGIMVESVFNWRDQGDDFASIQDKLAIQISRTSAFIM
VDDLDHLVKGGRLSNGAAILGNLLSIKPILYFNDQGVIEVYEKVRTEKKATKRLIEIIKETTASGQYRVIVIHGNAPEKA
EELRQHLLDFGLGSDVSLATFGSVIGTHLGAGSIALGYIPVI
>Q83BT6 ~~~~~~Uncharacterized HTH-type transcriptional regulator CBU_1416~~~COG1974
MASLSSNLKTLMTSVHINASELARRTGIAQPIIHRLSTGQNTNPKLATIKPIARYFMVNISQLIGEEPLPSDQSPQITGN
YRAWNRVPLISWKDATSWPEALPHYQTSDEVMYISTDANVSKLAYGLIIQGCAMEPLFPNGTTIIVEPERKPKDRDFVVV
RLQGEPEARLRQIITEGNDRYLKSLNPELEKLEVARLAQEDQFLGVMAQAKVDFLR
>P9WLY1 ~~~~~~Uncharacterized protein Rv1417~~~
MTAAPNDWDVVLRPHWTPLFAYAAAFLIAVAHVAGGLLLKVGSSGVVFQTADQVAMGALGLVLAGAVLLFARPRLRVGSA
GLSVRNLLGDRIVGWSEVIGVSFPGGSRWARIDLADDEYIPVMAIQAVDKDRAVAAMDTVRSLLARYRPDLCAR
>P9WLX9 ~~~~~~Uncharacterized protein Rv1419~~~
MGELRLVGGVLRVLVVVGAVFDVAVLNAGAASADGPVQLKSRLGDVCLDAPSGSWFSPLVINPCNGTDFQRWNLTDDRQV
ESVAFPGECVNIGNALWARLQPCVNWISQHWTVQPDGLVKSDLDACLTVLGGPDPGTWVSTRWCDPNAPDQQWDSVP
>Q9X1D7 ~~~~~~Protein TM_1420~~~COG1905
MIVRVCMGSSCHLKGSYEVVRRFQELQKKYNFKLYGSLCFGNCSQGVCVEIDGRLFSRVTPENAEEILKKVLQNG
>P9WFQ3 ~~~~~~Nucleotide-binding protein Rv1421~~~COG1660
MMNHARGVENRSEGGGIDVVLVTGLSGAGRGTAAKVLEDLGWYVADNLPPQLITRMVDFGLAAGSRITQLAVVMDVRSRG
FTGDLDSVRNELATRAITPRVVFMEASDDTLVRRYEQNRRSHPLQGEQTLAEGIAAERRMLAPVRATADLIIDTSTLSVG
GLRDSIERAFGGDGGATTSVTVESFGFKYGLPMDADMVMDVRFLPNPHWVDELRPLTGQHPAVRDYVLHRPGAAEFLESY
HRLLSLVVDGYRREGKRYMTIAIGCTGGKHRSVAIAEALMGLLRSDQQLSVRALHRDLGRE
>O25966 ~~~~~~Uncharacterized protein HP_1423~~~COG1188
MRIDKFLQSVGLVKRRVLATDMCNVGAVWLNGSCAKASKEVKAGDTISLHYLKGIEEYTILQIPALKNVPRKDTHLYIAP
KTKE
>P9WLX7 ~~~~~~Uncharacterized protein Rv1424c~~~
MTVVPGAPSRPASAVSRPSYRQCVQASAQTSARRYSFPSYRRPPAEKLVFPVLLGILTLLLSACQTASASGYNEPRGYDR
ATLKLVFSMDLGMCLNRFTYDSKLAPSRPQVVACDSREARIRNDGFHANAPSCMRIDYELITQNHRAYYCLKYLVRVGYC
YPAVTTPGKPPSVLLYAPSACDESLPSPRVATALVPGTRSANREFSRFVVTEIKSLGAGGRCDSASVSLQPPEEIEGPAI
PPASSQLVCVAPK
>P9WKC1 2.3.1.20~~~~~~Putative diacyglycerol O-acyltransferase Rv1425~~~COG1020
MKRLSSVDAAFWSAETAGWHMHVGALAICDPSDAPEYSFQRLRELIIERLPEIPQLRWRVTGAPLGLDRPWFVEDEELDI
DFHIRRIGVPAPGGRRELEELVGRLMSYKLDRSRPLWELWVIEGVEGGRIATLTKMHHAIVDGVSGAGLGEILLDITPEP
RPPQQETVGFVGFQIPGLERRAIGALINVGIMTPFRIVRLLEQTVRQQIAALGVAGKPARYFEAPKTRFNAPVSPHRRVT
GTRVELARAKAVKDAFGVKLNDVVLALVAGAARQYLQKRDELPAKPLIAQIPVSTRSEETKADVGNQVSSMTASLATHIE
DPAKRLAAIHESTLSAKEMAKAPSAHQIMGLTETTPPGLLQLAARAYTASGLSHNLAPINLVVSNVPGPPFPLYMAGARL
DSLVPLGPPVMDVALNITCFSYQDYLDFGLVTTPEVANDIDEMADAIEPALAELERAAE
>P44196 ~~~~~~Uncharacterized protein HI_1427~~~COG5266
MKKTLITSLLVLSGIAQAHEVWVQAPTKLASGSVLKAELAYGDYPYVEKIPEARLKIFAPMEIIHQNGEKQTLIQKGENY
QYQSEKALSDGSYWVTATYKPTFWSQNAEGWKMDNLKGLENPTYCEQTQMFGKSLVTVGKKPLNAEMAMTRVGLPLEIVP
LRDPSKAKSGEPFPVQIFYQDQPLAGETVIATADTIIVKDLEASTGHREPQGFSGKTDSQGRVNIIPLIDGIWKIKVIHK
TPFADQQICQQSASYSTLILPVGKGLAKLPPKPEHHHH
>Q5XAJ8 ~~~~~~Uncharacterized protein Spy1430~~~
MCLSYNRRNQRKKVVLMTIKKFSLLAIASLSLLSLAACDMDDKDDHMDNQPKTSQTSKKVKLSEDKAKSIALKDASVTEA
DAQMLSVTQDNEDGKAVYEIEFQNKDQEYSYTIDANSGDIVEKSSEPIND
>Q9JYT7 ~~~~~~Uncharacterized protein NMB1437~~~
MSARENILAKLKKADALPMEEPAVFDYYREMGVSWGSEVERLKHWAAAMRAVKTEIYWVTKSNWMQVFREAAEGKGLKNI
LLPLATEHGQIARAALADSNIEPIAFEREIDTWKTEFFTNIDAGFSGAQCGIARTGTLMLFSSPEEPRTLSLVPPVHFCL
FDTSKMYNEFHNAVEGEKLVENGMPTNVFLISGPSKTADIQLTLAYGAHGPRDLVILAILPDHISPADLEENA
>Q9X1F5 ~~~~~~Putative anti-sigma factor antagonist TM_1442~~~COG1366
MNNLKLDIVEQDDKAIVRVQGDIDAYNSSELKEQLRNFISTTSKKKIVLDLSSVSYMDSAGLGTLVVILKDAKINGKEFI
LSSLKESISRILKLTHLDKIFKITDTVEEA
>Q7A598 ~~~~~~UPF0473 protein SA1443~~~
MTEHNHDSQLEINNEEELLTLFDEEGNEVLYRKVLEFYHPEFKKEYVILAEEGAQSDEDDMIELVPMINEPDESGDGGKL
VPIETDEEWDMIEEVVNTEMEE
>P60359 ~~~~~~UPF0297 protein SA1445~~~
MENFDKTMKFDYEELPTQDVRDVLNNVYRTLDERGYNAVNQIVGYLLSGDPAYIPRQNEARNQIRHIDRDVIMEELVSYY
LKEQNK
>P44199 ~~~~~~UPF0263 protein HI_1450~~~COG3099
MTTEIKKLDPDTAIDIAYDIFLEMAGENLDPADILLFNLQFEERGGVEFVETADDWEEEIGVLIDPEEYAEVWVGLVNEQ
DEMDDVFAKFLISHREEDREFHVIWKK
>Q7A593 ~~~~~~UPF0337 protein SA1452~~~
MADESKFDQFKGNVKETVGNVTDNKELEKEGQQDKATGKAKEVVENAKNKITDAIDKLKK
>Q57127 ~~~~~~Uncharacterized protein HI_1453~~~COG0526
MKKLLSIFLMAFSLNAFAQTNLADVQLKDLNNQPVTLSQYKGKPVYVKMWASWCPICLAGLAEIDDLSAEKDRNFEVITI
VSPDHKGEKDTADFIEWYKGLEYKNITVLLDEKGEIIDKARVRGYPFNLFLDSDLNLKKTVPGHLGAEQIRVFAEK
>Q57201 ~~~~~~Uncharacterized protein HI_1457~~~COG3637
MKKLLIVTMLFTLALSAQAQWYVQGDLGASKIDITHVNSSNSPSFTQRISVGYAFDKNFRLAVDYTNYGKVTANYADVVD
VSLKGKSLGLTGFYDFDLADFKPYVGVRVSTNGADVTANARYYRIEAFATETRIGIGALAGVQYKLTDNVALNTNIEYNR
LASNVSDVGVKAGLRFSF
>O53150 2.4.1.-~~~~~~Alpha-(1->6)-mannopyranosyltransferase Rv1459c~~~
MAARHHTLSWSIASLHGDEQAVGAPLTTTELTALARTRLFGATGTVLMAIGALGAGARPVVQDPTFGVRLLNLPSRIQTV
SLTMTTTGAVMMALAWLMLGRFTLGRRRMSRGKLDRTLLLWMLPLLIAPPMYSKDVYSYLAQSEIGRDGLDPYRVGPASG
LGLGHVFTLSVPSLWRETPAPYGPLFLWIGRGISSLTGENIVAAVLCHRLVVLIGVTLIVWATPRLAQRCGVAEVSALWL
GAANPLLIMHLVAGIHNEALMLGLMLTGVEFALRGLDMANTPRPSPETWRLGPATIRASRRPELGASPRAGASRAVKPRP
EWGPLAMLLAGSILITLSSQVKLPSLLAMGFVTTVLAYRWGGNLRALLLAAAVMASLTLAIMAILGWASGLGFGWINTLG
TANVVRSWMSPPTLLALGTGHVGILLGLGDHTTAVLSLTRAIGVLIITVMVCWLLLAVLRGRLHPIGGLGVALAVTVLLF
PVVQPWYLLWAIIPLAAWATRPGFRVAAILATLIVGIFGPTANGDRFALFQIVDATAASAIIVILLIALTYTRLPWRPLA
AEQVVTAAESASKTPATRRPTAAPDAYADST
>P9WFP7 ~~~~~~UPF0051 protein Rv1461~~~COG0719
MTLTPEASKSVAQPPTQAPLTQEEAIASLGRYGYGWADSDVAGANAQRGLSEAVVRDISAKKNEPDWMLQSRLKALRIFD
RKPIPKWGSNLDGIDFDNIKYFVRSTEKQAASWDDLPEDIRNTYDRLGIPEAEKQRLVAGVAAQYESEVVYHQIREDLEA
QGVIFLDTDTGLREHPDIFKEYFGTVIPAGDNKFSALNTAVWSGGSFIYVPPGVHVDIPLQAYFRINTENMGQFERTLII
ADEGSYVHYVEGCLPAGELITTADGDLRPIESIRVGDFVTGHDGRPHRVTAVQVRDLDGELFTFTPMSPANAFSVTAEHP
LLAIPRDEVRVMRKERNGWKAEVNSTKLRSAEPRWIAAKDVAEGDFLIYPKPKPIPHRTVLPLEFARLAGYYLAEGHACL
TNGCESLIFSFHSDEFEYVEDVRQACKSLYEKSGSVLIEEHKHSARVTVYTKAGYAAMRDNVGIGSSNKKLSDLLMRQDE
TFLRELVDAYVNGDGNVTRRNGAVWKRVHTTSRLWAFQLQSILARLGHYATVELRRPGGPGVIMGRNVVRKDIYQVQWTE
GGRGPKQARDCGDYFAVPIKKRAVREAHEPVYNLDVENPDSYLAYGFAVHNCTAPIYKSDSLHSAVVEIIVKPHARVRYT
TIQNWSNNVYNLVTKRARAEAGATMEWIDGNIGSKVTMKYPAVWMTGEHAKGEVLSVAFAGEDQHQDTGAKMLHLAPNTS
SNIVSKSVARGGGRTSYRGLVQVNKGAHGSRSSVKCDALLVDTVSRSDTYPYVDIREDDVTMGHEATVSKVSENQLFYLM
SRGLTEDEAMAMVVRGFVEPIAKELPMEYALELNRLIELQMEGAVG
>P9WFP5 ~~~~~~UPF0051 protein Rv1462~~~COG0719
MTAPGLTAAVEGIAHNKGELFASFDVDAFEVPHGRDEIWRFTPLRRLRGLHDGSARATGSATITVSERPGVYTQTVRRGD
PRLGEGGVPTDRVAAQAFSSFNSATLVTVERDTQVVEPVGITVTGPGEGAVAYGHLQVRIEELGEAVVVIDHRGGGTYAD
NVEFVVDDAARLTAVWIADWADNTVHLSAHHARIGKDAVLRHVTVMLGGDVVRMSAGVRFCGAGGDAELLGLYFADDGQH
LESRLLVDHAHPDCKSNVLYKGALQGDPASSLPDAHTVWVGDVLIRAQATGTDTFEVNRNLVLTDGARADSVPNLEIETG
EIVGAGHASATGRFDDEQLFYLRSRGIPEAQARRLVVRGFFGEIIAKIAVPEVRERLTAAIEHELEITESTEKTTVS
>Q9X1H9 ~~~~~~Fatty acid-binding protein TM_1468~~~COG1307
MKVKILVDSTADVPFSWMEKYDIDSIPLYVVWEDGRSEPDEREPEEIMNFYKRIREAGSVPKTSQPSVEDFKKRYLKYKE
EDYDVVLVLTLSSKLSGTYNSAVLASKEVDIPVYVVDTLLASGAIPLPARVAREMLENGATIEEVLKKLDERMKNKDFKA
IFYVSNFDYLVKGGRVSKFQGFVGNLLKIRVCLHIENGELIPYRKVRGDKKAIEALIEKLREDTPEGSKLRVIGVHADNE
AGVVELLNTLRKSYEVVDEIISPMGKVITTHVGPGTVGFGIEVLERKR
>P9WFJ3 2.1.1.-~~~~~~Putative S-adenosyl-L-methionine-dependent methyltransferase Rv0146~~~COG3315
MRTHDDTWDIKTSVGATAVMVAAARAVETDRPDPLIRDPYARLLVTNAGAGAIWEAMLDPTLVAKAAAIDAETAAIVAYL
RSYQAVRTNFFDTYFASAVAAGIRQVVILASGLDSRAYRLDWPAGTIVYEIDQPKVLSYKSTTLAENGVTPSAGRREVPA
DLRQDWPAALRDAGFDPTARTAWLAEGLLMYLPAEAQDRLFTQVGAVSVAGSRIAAETAPVHGEERRAEMRARFKKVADV
LGIEQTIDVQELVYHDQDRASVADWLTDHGWRARSQRAPDEMRRVGRWVEGVPMADDPTAFAEFVTAERL
>Q97PW7 ~~~~~~UPF0291 protein SP_1473~~~COG4224
MDPKKIARINELAKKKKTEGLTPEEKVEQAKLREEYIEGYRRAVRHHIEGIKIVDEEGNDVTPEKLRQVQREKGLHGRSL
DDPNS
>P44207 ~~~~~~Uncharacterized HTH-type transcriptional regulator HI_1476~~~COG2932
MQYQNQDNFPERIEYLVDKLNGPSEFARKTGVTLSTITRWRKGEADPSRSNLVKIAEVTGVSIEWLATGKIKEEKTTEEK
PAGSLVSRAFERMQAMLEEGVSMIDSYSSINVSAGFGSFNEGITQPDGQEPYSDELLTSLGVKADNCAVFWANGNSMLPT
INNYDQMLVDLSRKEIQGDRIYLVQNGESVWVKRVKMEWDGISLISDNKEEYPPISITGENAQNLQIIGQVVHIGHSLI
>P44209 ~~~~~~Uncharacterized protein HI_1480~~~
MSETDLLLKMVRQPVKLYSVATLFHEFSEVITKLEHSVQKEPTSLLSEENWHKQFLKFAQALPAHGSASWLNLDDALQAV
VGNSRSAFLHQLIAKLKSRHLQVLELNKIGSEPLDLSNLPAPFYVLLPESFAARITLLVQDKALPYVRVSFEYWHA
>P9WLX5 ~~~~~~Uncharacterized protein Rv1480~~~COG1721
MTESKAPAVVHPPSMLRGDIDDPKLAAALRTLELTVKQKLDGVLHGDHLGLIPGPGSEPGESRLYQPGDDVRRMDWAVTA
RTTHPHVRQMIADRELETWLVVDMSASLDFGTACCEKRDLAVAAAAAITFLNSGGGNRLGALIANGAAMTRVPARTGRQH
QHTMLRTIATMPQAPAGVRGDLAVAIDALRRPERRRGMAVIISDFLGPINWMRPLRAIAARHEVLAIEVLDPRDVELPDV
GDVVLQDAESGVVREFSIDPALRDDFARAAAAHRADVARTIRGCGAPLLSLRTDRDWLADIVRFVASRRRGALAGHQ
>P9WFJ7 ~~~~~~UPF0353 protein Rv1481~~~COG2304
MTLPLLGPMTLSGFAHSWFFLFLFVVAGLVALYILMQLARQRRMLRFANMELLESVAPKRPSRWRHVPAILLVLSLLLFT
IAMAGPTHDVRIPRNRAVVMLVIDVSQSMRATDVEPSRMVAAQEAAKQFADELTPGINLGLIAYAGTATVLVSPTTNREA
TKNALDKLQFADRTATGEAIFTALQAIATVGAVIGGGDTPPPARIVLFSDGKETMPTNPDNPKGAYTAARTAKDQGVPIS
TISFGTPYGFVEINDQRQPVPVDDETMKKVAQLSGGNSYNAATLAELRAVYSSLQQQIGYETIKGDASVGWLRLGALALA
LAALAALLINRRLPT
>P44210 ~~~~~~Uncharacterized protein HI_1482~~~
MQKVYNKMAGEMMSPRNAVIHNQLAMLELATLECEALGIEVETVEWFDIGKPRLVVKDCSALRHLIKTGKAFNYGSEVKN
GIRIYLNQMMVKGVKFIWKSDVTKH
>P74615 ~~~~~~Protein sll1483~~~COG2335
MKTAARIVAFTALTGFALGMPTVAMAEMETTEKSAVVSQAATDSAMTIVEVAAGNETFSTLVAAVKAADLVEALSAEGPF
TVFAPTNDAFAALPAGTVESLLLPENKDKLVKILTYHVVPGKITAAQVQSGEVASLAGEALTFKVKDGKVKVNKATVISA
DVDASNGVIHVIDQVILPPM
>P9WLX3 ~~~~~~Uncharacterized protein Rv1486c~~~
MWCPSVSLSIWANAWLAGKAAPDDVLDALSLWAPTQSVAAYDAVAAGHTGLPWPDVHDAGTVSLLQTLRAAVGRRRLRGT
INVVLPVPGDVRGLAAGTQFEHDALAAGEAVIVANPEDPGSAVGLVPEFSYGDVDEAAQSEPLTPELCALSWMVYSLPGA
PVLEHYELGDAEYALRSAVRSAAEALSTIGLGSSDVAKPRGLVEQLLESSRQHRVPDHAPSRALRVLENAAHVDAIIAVS
AGLSRLPIGTQSLSDAQRATDALRPLTAVVRSARMSAVTAILHSAWPD
>P9WPR9 ~~~~~~Uncharacterized protein Rv1488~~~COG0330
MQGAVAGLVFLAVLVIFAIIVVAKSVALIPQAEAAVIERLGRYSRTVSGQLTLLVPFIDRVRARVDLRERVVSFPPQPVI
TEDNLTLNIDTVVYFQVTVPQAAVYEISNYIVGVEQLTTTTLRNVVGGMTLEQTLTSRDQINAQLRGVLDEATGRWGLRV
ARVELRSIDPPPSIQASMEKQMKADREKRAMILTAEGTREAAIKQAEGQKQAQILAAEGAKQAAILAAEADRQSRMLRAQ
GERAAAYLQAQGQAKAIEKTFAAIKAGRPTPEMLAYQYLQTLPEMARGDANKVWVVPSDFNAALQGFTRLLGKPGEDGVF
RFEPSPVEDQPKHAADGDDAEVAGWFSTDTDPSIARAVATAEAIARKPVEGSLGTPPRLTQ
>P67372 ~~~~~~DegV domain-containing protein SPy_1493/M5005_Spy1226~~~
MGTIKIVTDSSITIEPELIKALDITVVPLSVMIDSKLYSDNDLKEEGHFLSLMKASKSLPKTSQPPVGLFAETYENLVKK
GVTDIVAIHLSPALSGTIEASRQGAEIAEAPVTVLDSGFTDQAMKFQVVEAAKMAKAGASLNEILAAVQAIKSKTELYIG
VSTLENLVKGGRIGRVTGVLSSLLNVKVVMALKNDELKTLVKGRGNKTFTKWLDSYLAKNSHRPIAEIAISYAGEASLAL
TLKERIAAYYNHSISVLETGSIIQTHTGEGAFAVMVRYE
>O67466 3.1.-.-~~~~~~Putative esterase aq_1494~~~COG0824
MPFIYRRRVQFYETDAQGIVHHSNYFRYFEEARGEFLRSKGFPYSKMRDMGLEVVLLNAYCEYKKPLFYDDVFEVHLNLE
ELSRFTFTFSYIVFKEDIAVAKANTKHCMVKNGKIVSIPKEVLEVLKD
>P9WPZ1 3.6.-.-~~~~~~Probable GTPase Rv1496~~~COG1703
MAMMAASHDDDTVDGLATAVRGGDRAALPRAITLVESTRPDHREQAQQLLLRLLPDSGNAHRVGITGVPGVGKSTAIEAL
GMHLIERGHRVAVLAVDPSSTRTGGSILGDKTRMARLAVHPNAYIRPSPTSGTLGGVTRATRETVVLLEAAGFDVILIET
VGVGQSEVAVANMVDTFVLLTLARTGDQLQGIKKGVLELADIVVVNKADGEHHKEARLAARELSAAIRLIYPREALWRPP
VLTMSAVEGRGLAELWDTVERHRQVLTGAGEFDARRRDQQVDWTWQLVRDAVLDRVWSNPTVRKVRSELERRVRAGELTP
ALAAQQILEIANLTDR
>Q7DDB6 ~~~~~~Probable TonB-dependent receptor NMB1497~~~
MRSSFRLKPICFYLMGVTLYHYSYAEDAGRAGSEAQIQVLEDVHVKAKRVPKDKKVFTDARAVSTRQDIFKSSENLDNIV
RSIPGAFTQQDKSSGIVSLNIRGDSGFGRVNTMVDGITQTFYSTSTDAGRAGGSSQFGASVDSNFIAGLDVVKGSFSGSA
GINSLAGSANLRTLGVDDVVQGNNTYGLLLKGLTGTNSTKGNAMAAIGARKWLESGASVGVLYGHSRRSVAQNYRVGGGG
QHIGNFGAEYLERRKQRYFVQEGALKFNSDSGKWERDLQRQQWKYKPYKNYNNQELQKYIEEHDKSWRENLAPQYDITPI
DPSSLKQQSAGNLFKLEYDGVFNKYTAQFRDLNTKIGSRKIINRNYQFNYGLSLNPYTNLNLTAAYNSGRQKYPKGSKFT
GWGLLKDFETYNNAKILDLNNTATFRLPRETELQTTLGFNYFHNEYGKNRFPEELGLFFDGPDQDNGLYSYLGRFKGDKG
LLPQKSTIVQPAGSQYFNTFYFDAALKKDIYRLNYSTNTVGYRFGGEYTGYYGSDDEFKRAFGENSPTYKKHCNRSCGIY
EPVLKKYGKKRANNHSVSISADFGDYFMPFASYSRTHRMPNIQEMYFSQIGDSGVHTALKPERANTWQFGFNTYKKGLLK
QDDTLGLKLVGYRSRIDNYIHNVYGKWWDLNGDIPSWVSSTGLAYTIQHRNFKDKVHKHGFELELNYDYGRFFTNLSYAY
QKSTQPTNFSDASESPNNASKEDQLKQGYGLSRVSALPRDYGRLEVGTRWLGNKLTLGGAMRYFGKSIRATAEERYIDGT
NGGNTSNFRQLGKRSIKQTETLARQPLIFDFYAAYEPKKNLIFRAEVKNLFDRRYIDPLDAGNDAATQRYYSSFDPKDKD
EDVTCNADKTLCNGKYGGTSKSVLTNFARGRTFLMTMSYKF
>P44222 ~~~~~~Uncharacterized protein HI_1498~~~
MWLAHSHYTLACESIRSPLCKLPARLGGRTMISEFWEFVRSNFGVISTLIAIFIGAFWLKLDSKYAKKHDLSQLADIARS
HDNRLATLESKVENLPTAVDVERLKTLLTDVKGDTKATSRQVDAMSHQVGLLLEAKLKE
>P9WLW9 2.1.1.-~~~~~~Uncharacterised methyltransferase Rv1498c~~~COG2226
MRCIIKRLFQNILTRSKRGSADGGSAEALPPKSLRQFVGGAYKEVGAEFVGYLVDLCGLQPDEAVLDVGCGSGRMALPLT
GYLNSEGRYAGFDISQKAIAWCQEHITSAHPNFQFEVSDIYNSLYNPKGKYQSLDFRFPYPDASFDVVFLTSVFTHMFPP
DVEHYLDEISRVLKPGGRCLCTYFLLNDESLAHIAEGKSAHNFQHEGPGYRTIHKKRPEEAIGLPETFVRDVYGKFGLAV
HEPLHYGSWSGREPRLSFQDIVIATKTAS
>P9WI91 ~~~~~~Uncharacterized protein Rv1501~~~COG5285
MIPVKVENNTSLDQVQDALNCVGYAVVEDVLDEASLAATRDRMYRVQERILTEIGKERLARAGELGVLRLMMKYDPHFFT
FLEIPEVLSIVDRVLSETAILHLQNGFILPSFPPFSTPDVFQNAFHQDFPRVLSGYIASVNIMFAIDPFTRDTGATLVVP
GSHQRIEKPDHTYLARNAVPVQCAAGSLFVFDSTLWHAAGRNTSGKDRLAINHQFTRSFFKQQIDYVRALGDAVVLEQPA
RTQQLLGWYSRVVTNLDEYYQPPDKRLYRKGQG
>P9WLW7 ~~~~~~Uncharacterized protein Rv1502~~~COG1621
MAWRKLGRIFAPSGELDWSRSHAALPVPEWIEGDIFRIYFSGRDGQNRSSIGSVIVDLAVGGKILDIPAEPILRPGARGM
FDDCGVSIGSIVRAGDTRLLYYTGWNLAVTVPWKNTIGVAISEAGAPFERWSTFPVVALDERDPFSLSYPWVIQDGGTYR
MWYGSNLGWGEGTDEIPHVIRYAQSRDGVHWEKQDRVHIDTSGSDNSAACRPYVVRDAGVYRMWFCARGAKYRIYCATSE
DGLTWRQLGKDEGIDVSPDSWDSDMIEYPCVFDHRGQRFMLYSGDGYGRTGFGLAVLEN
>P60357 ~~~~~~UPF0297 protein lmo1503~~~COG4472
MDSKDQTMFYNFGDDSIEEDVKKLMKQVYVALEEKGYNPVNQIVGYLLSGDPAYIPRHKDARSMIRRLERDEIIEELVKA
YLKNNEIGEK
>P9WLW5 ~~~~~~Uncharacterized protein Rv1507c~~~COG0224
MKKVAIVQSNYIPWRGYFDLIAFVDEFIIYDDMQYTKRDWRNRNRIKTSQGLQWITVPVQVKGRFHQKIRETLIDGTDWA
KAHWRALEFNYSAAAHFAEIADWLAPIYLEEQHTNLSLLNRRLLNAICSYLGISTRLANSWDYELADGKTERLANLCQQA
AATEYVSGPSARSYVDERVFDELSIRVTWFDYDGYRDYKQLWGGFEPAVSILDLLFNVGAEAPDYLRYCRQ
>P9WLW1 ~~~~~~Uncharacterized protein Rv1510~~~COG2244
MYERRHERGMCDRAVEMTDVGATAAPTGPIARGSVARVGAATALAVACVYTVIYLAARDLPPACFSIFAVFWGALGIATG
ATHGLLQETTREVRWVRSTQIVAGHRTHPLRVAGMIGTVAAVVIAGSSPLWSRQLFVEGRWLSVGLLSVGVAGFCAQATL
LGALAGVDRWTQYGSLMVTDAVIRLAVAAAAVVIGWGLAGYLWAATAGAVAWLLMLMASPTARSAASLLTPGGIATFVRG
AAHSITAAGASAILVMGFPVLLKVTSDQLGAKGGAVILAVTLTRAPLLVPLSAMQGNLIAHFVDRRTQRLRALIAPALVV
GGIGAVGMLAAGLTGPWLLRVGFGPDYQTGGALLAWLTAAAVAIAMLTLTGAAAVAAALHRAYLLGWVSATVASTLLLLL
PMPLETRTVIALLFGPTVGIAIHVAALARRPD
>P73954 ~~~~~~Membrane-associated protein slr1513~~~COG0347
MAKPANKLVIVTEKILLKKIAKIIDESGAKGYTVMNTGGKGSRNVRSSGQPNTSDIEANIKFEILTETREMAEEIADRVA
VKYFNDYAGIIYICSAEVLYGHTFCGPEGC
>P9WMX9 2.4.-.-~~~~~~Uncharacterized glycosyltransferase Rv1514c~~~COG1216
MTSAPTVSVITISFNDLDGLQRTVKSVRAQRYRGRIEHIVIDGGSGDDVVAYLSGCEPGFAYWQSEPDGGRYDAMNQGIA
HASGDLLWFLHSADRFSGPDVVAQAVEALSGKGPVSELWGFGMDRLVGLDRVRGPIPFSLRKFLAGKQVVPHQASFFGSS
LVAKIGGYDLDFGIAADQEFILRAALVCEPVTIRCVLCEFDTTGVGSHREPSAVFGDLRRMGDLHRRYPFGGRRISHAYL
RGREFYAYNSRFWENVFTRMSK
>P9WLV5 ~~~~~~Uncharacterized protein Rv1520~~~COG0463
MSIVSISYNQEEYIREALDGFAAQRTEFPVEVIIADDASTDATPRIIGEYAARYPQLFRPILRQTNIGVHANFKDVLSAA
RGEYLALCEGDDYWTDPLKLSKQVKYLDRHPETTVCFHPVRVIYEDGAKDSEFPPLSWRRDLSVDALLARNFIQTNSVVY
RRQPSYDDIPANVMPIDWYLHVRHAVGGEIAMLPETMAVYRRHAHGIWHSAYTDRRKFWETRGHGMAATLEAMLDLVHGH
REREAIVGEVSAWVLREIGKTPGRQGRALLLKSIADHPRMTMLSLQHRWAQTPWRRFKRRLSTELSSLAALAYATRRRAL
EGRDGGYRETTSPPTGRGRNVRGSHA
>P9WN07 2.4.-.-~~~~~~Uncharacterized glycosyltransferase Rv1524~~~COG1819
MKFVVASYGTRGDIEPCAAVGLELQRRGHDVCLAVPPNLIGFVETAGLSAVAYGSRDSQEQLDEQFLHNAWKLQNPIKLL
REAMAPVTEGWAELSAMLTPVAAGADLLLTGQIYQEVVANVAEHHGIPLAALHFYPVRANGEIAFPARLPAPLVRSTITA
IDWLYWRMTKGVEDAQRRELGLPKASTPAPRRMAVRGSLEIQAYDALCFPGLAAEWGGRRPFVGALTMESATDADDEVAS
WIAADTPPIYFGFGSMPIGSLADRVAMISAACAELGERALICSGPSDATGIPQFDHVKVVRVVSHAAVFPTCRAVVHHGG
AGTTAAGLRAGIPTLILWVTSDQPIWAAQIKQLKVGRGRRFSSATKESLIADLRTILAPDYVTRAREIASRMTKPAASVT
ATADLLEDAARRAR
>P71391 ~~~~~~Putative binding protein HI_1525~~~COG0725
MKKLVAVTSMILTTFSVQAADLYLYAGAGLKEPVEKIIHQYEQETGNKVTVEYGGSGQILARYNTVKSGDLFLAGSEDYV
TKLQKTNDVNNIGTIVLHVPVMAIRKDKISGIDSFKALAESSLRLGIGDSKAMALGKGAEKMFELSGYQKQLNDKIVVKA
ATVKQLMLYLLNGDVDAAVVGRSGAWKVRDKVELLPSPKGTPEEKVTIGLLFSSKYPKEAQQLFDFFKSPQGVKYFTDEG
FLPAK
>P9WLV3 ~~~wbbL2~~~Uncharacterized protein Rv1525~~~COG1216
MYAPLVSLMITVPVFGQHEYTHALVADLEREGADYLIVDNRGDYPRIGTERVSTPGENLGWAGGSELGFRLAFAEGYSHA
MTLNNDTRVSKGFVAALLDSRLPADAGMVGPMFDVGFPFAVADEKPDAESYVPRARYRKVPAVEGTALVMSRDCWDAVGG
MDLSTFGRYGWGLDLDLALRARKSGYGLYTTEMAYINHFGRKTANTHFGGHRYHWGASAAMIRGLRRTHGWPAAMGILRE
MGMAHHRKWHKSFPLTCPASC
>P99149 ~~~~~~UPF0173 metal-dependent hydrolase SA1529~~~
MKLSFHGQSTIYLEGNNKKVIVDPFISNNPKCDLNIETVQVDYIVLTHGHFDHFGDVVELAKKTGATVIGSAEMADYLSS
YHGVENVHGMNIGGKANFDFGSVKFVQAFHSSSFTHENGIPVYLGMPMGIVFEVEGKTIYHTGDTGLFSDMSLIAKRHPV
DVCFVPIGDNFTMGIDDASYAINEFIKPKISVPIHYDTFPLIEQDPQQFKDAVNVGDVQILKPGESVQF
>Q6NDF6 ~~~~~~S-adenosyl-L-methionine-binding protein RPA0152~~~COG1720
MDATDDIRAGELASDWSGSPDAGVVFIGRIHTPWNRLKECPRHGRADGPVCRIEVFETWLPALAGIDDGTLLEVFYWLHR
SRRDLLLQCPRNDGDARGTFSIRSPLRPNPIGTSIARVDRRDGANLFIRGLDCLDGTPLVDLKPDRAEFMPLAPPKPGDF
QVGEPRR
>Q7A552 3.4.-.-~~~~~~Uncharacterized peptidase SA1530~~~
MTKISKIIDELNNQQADAAWITTPLNVYYFTGYRSEPHERLFALLIKKDGKQVLFCPKMEVEEVKASPFTGEIVGYLDTE
NPFSLYPQTINKLLIESEHLTVARQKQLISGFNVNSFGDVDLTIKQLRNIKSEDEISKIRKAAELADKCIEIGVSYLKEG
VTEREVVNHIEQTIKQYGVNEMSFDTMVLFGDHAASPHGTPGDRRLKSNEYVLFDLGVIYEHYCSDMTRTIKFGEPSKEA
QEIYNIVLEAETSAIQAIKPGIPLKDIDHIARNIISEKGYGEYFPHRLGHGLGLQEHEYQDVSSTNSNLLEAGMVITIEP
GIYVPGVAGVRIEDDILVTNEGYEVLTHYEK
>P64665 ~~~~~~Uncharacterized protein HP_1531~~~
MFEKIRKILADIEDSQNEIEMLLKLANLSLGDFIEIKRGSMDMPKGVNEAFFTQLSEEVERLKELINALNKIKKGLLVF
>Q7A551 ~~~~~~Putative universal stress protein SA1532~~~
MITYKNILIAVDGSHEAEWAFNRAVGVAKRNDAKLTIVNVIDSRTYSSYEVYDAQFTEKSKHFAEELLNGYKEVATNAGV
KDVETRLEFGSPKSIIPKKLAHEINADLIMSGTSGLNAVERFIVGSVSESIVRHAPCDVLVVRTEELPADFQPQVATTQL
REKYQN
>O06179 1.13.12.-~~~~~~Putative monooxygenase Rv1533~~~COG2070
MRTRVAELLGAEFPICAFSHCRDVVAAVSNAGGFGILGAVAHSPKRLESELTWIEEHTGGKPYGVDVLLPPKYIGAEQGG
IDAQQARELIPEGHRTFVDDLLVRYGIPAVTDRQRSSSAGGLHISPKGYQPLLDVAFAHDIRLIASALGPPPPDLVERAH
NHDVLVAALAGTAQHARRHAAAGVDLIVAQGTEAGGHTGEVATMVLVPEVVDAVSPTPVLAAGGIARGRQIAAALALGAE
GVWCGSVWLTTEEAETPPVVKDKFLAATSSDTVRSRSLTGKPARMLRTAWTDEWDRPDSPDPLGMPLQSALVSDPQLRIN
QAAGQPGAKARELATYFVGQVVGSLDRVRSARSVVLDMVEEFIDTVGQLQGLVQR
>Q5XA93 ~~~~~~Uncharacterized protein Spy1535~~~
MTTEYIGEIVISPRVLEVITGIATTQVEGVHSLHNKKMADSFNKASLGKGVYLQTEEDGSVTADIYVYLQYGVKVPTVSM
NIQKTVKSAVYDMAEVPISAVNIHVEGIVAEKTPKPDLKSLFDEDFLDD
>Q97PR7 ~~~~~~UPF0213 protein SP_1535~~~COG2827
MDHKAYMYVLECRDGSYYIGYTTDMRRRLAIHNSGKGAKYTRARLPVKLIYAQGFASKEEAMSAEALLKRKKRPQKEEFL
SENQDRNLLSYFEESWGVL
>P9WFJ1 2.1.1.-~~~~~~Putative S-adenosyl-L-methionine-dependent methyltransferase Rv0145~~~COG3315
MSSLPSSRRTAGDTWAITESVGATALGVAAARAVETAATNPLIRDEFAKVLVSSAGTAWARLADADLAWLDGDQLGRRVH
RVACDYQAVRTHFFDEYFGAAVDAGVRQVVILAAGLDARAYRLNWPAGTVVYEIDQPSVLEYKAGILQSHGAVPTARRHA
VAVDLRDDWPAALIAAGFDGTQPTAWLAEGLLPYLPGDAADRLFDMVTALSAPGSQVAVEAFTMNTKGNTQRWNRMRERL
GLDIDVQALTYHEPDRSDAAQWLATHGWQVHSVSNREEMARLGRAIPQDLVDETVRTTLLRGRLVTPAQPA
>P9WHQ3 5.4.99.-~~~~~~Uncharacterized RNA pseudouridine synthase Rv1540~~~COG0564
MADRSMPVPDGLAGMRVDTGLARLLGLSRTAAAALAEEGAVELNGVPAGKSDRLVSGALLQVRLPEAPAPLQNTPIDIEG
MTILYSDDDIVAVDKPAAVAAHASVGWTGPTVLGGLAAAGYRITTSGVHERQGIVHRLDVGTSGVMVVAISERAYTVLKR
AFKYRTVDKRYHALVQGHPDPSSGTIDAPIGRHRGHEWKFAITKNGRHSLTHYDTLEAFVAASLLDVHLETGRTHQIRVH
FAALHHPCCGDLVYGADPKLAKRLGLDRQWLHARSLAFAHPADGRRVEIVSPYPADLQHALKILRGEG
>Q81SV3 ~~~~~~UPF0302 protein BA_1542/GBAA_1542/BAS1430~~~COG5582
MNTPVSVNEKKDFVKWFLNNYQLKQRECVWILNYLMSHDQLMHKVHFVEHAKYCPRGLVMSANCVKDTPFHFFKQNVMTT
DAEKSFHDIRLNRDEDIYIQLNFKSSFQNANYVAVLEENPYLPKHIEVNEKDRLLAERFLEESVFSFRRERLLKQIDEAL
DKQDKEAFHRLTAELKML
>P9WGS1 1.-.-.-~~~~~~Uncharacterized oxidoreductase Rv1543~~~COG1028
MNLGDLTNFVEKPLAAVSNIVNTPNSAGRYRPFYLRNLLDAVQGRNLNDAVKGKVVLITGGSSGIGAAAAKKIAEAGGTV
VLVARTLENLENVANDIRAIRGNGGTAHVYPCDLSDMDAIAVMADQVLGDLGGVDILINNAGRSIRRSLELSYDRIHDYQ
RTMQLNYLGAVQLILKFIPGMRERHFGHIVNVSSVGVQTRAPRFGAYIASKAALDSLCDALQAETVHDNVRFTTVHMALV
RTPMISPTTIYDKFPTLTPDQAAGVITDAIVHRPRRASSPFGQFAAVADAVNPAVMDRVRNRAFNMFGDSSAAKGSESQT
DTSELDKRSETFVRATRGIHW
>P9WLU7 ~~~~~~Uncharacterized protein Rv1546~~~COG3427
MASVELSADVPISPQDTWDHVSELSELGEWLVIHEGWRSELPDQLGEGVQIVGVARAMGMRNRVTWRVTKWDPPHEVAMT
GSGKGGTKYGVTLTVRPTKGGSALGLRLELGGRALFGPLGSAAARAVKGDVEKSLKQFAELYG
>P44251 ~~~~~~Uncharacterized protein HI_1552~~~COG2830
MKTKFYDYQGEHLILYFAGWGTPPDAVNHLILPENHDLLICYDYQDLNLDFDLSAYRHIRLVAWSMGVWVAERVLQGIRL
KSATAVNGTGLPCDDSFGIPYAIFKGTLENLTENTRLKFERRICGDKASFERYQLFPARPFDEIHQELTALFAMIQQDKR
IDLIHWANAWVSSRDKIFTPANQHQYWALRCAVQEIEGEHYVFSRFTHWSALWDH
>P45250 1.-.-.-~~~~~~Putative 2-hydroxyacid dehydrogenase HI_1556~~~COG1052
MKIVFLDSTAIPKHISIPRPSFEHTWTEYEHTSAEQTIERVKDADIVITSKVIFDRETLQQLPKLKLIAITATGTNNVDL
VAAEEMGIAVRNVTGYSSTTVPEHVIGLIFSLKHSLAGWLRDQTEAKWAESKQFCYFDYPITDVRGSTLGVFGKGCLGTE
VGRLANAVGMKVLYAEHKDATVCREGYTPFDEVLKQADIVTLHCPLTETTKDLINAETLSKMKKGAFLINTGRGPLIDEL
ALVDALKTGHLGGAALDVMVKEPPEKDNPLILAAKTMPNLIITPHIAWASDSAVTTLVGKVMQNIEEFVQQLHQK
>P9WMD1 ~~~~~~Uncharacterized HTH-type transcriptional regulator Rv1556~~~COG1309
MVGAVTQIADRPTDPSPWSPRETELLAVTLRLLQEHGYDRLTVDAVAASARASKATVYRRWPSKAELVLAAFIEGIRQVA
VPPNTGNLRDDLLRLGELICREVGQHASTIRAVLVEVSRNPALNDVLQHQFVDHRKALIQYILQQAVDRGEISSAAISDE
LWDLLPGYLIFRSIIPNRPPTQDTVQALVDDVILPSLTRSTG
>A0A089QKZ7 ~~~~~~Uncharacterized protein Rv1155A~~~
MGESKSPQESSSEGETKRKFREALDRKMAQSSSGSDHKDGGGKQSRAHGPVASRREFRRKSG
>Q7A531 ~~~~~~UPF0478 protein SA1560~~~
MDWILPIAGIIAAIAFLILCIGIVAVLNSVKKNLDYVAKTLDGVEGQVQGITRETTDLLHKVNRLTEDIQGKVDRLNSVV
DAVKGIGDSVQTLNSSVDRVTNSITHNISQNEDKISQVVQWSNVAMEIADKWQNRHYRRGSANYKANNVATDANHSYTSR
VDK
>P44254 ~~~~~~Uncharacterized protein HI_1562~~~
MLSKDPKVLIKLGELEKDKSKAKKYFGDACDLRSQEGCDKYRELNQKQDTNK
>Q7A528 ~~~~~~UPF0354 protein SA1564~~~
MNTFQMRDKLKERLSHLDVDFKFNREEETLRIYRTDNNKGITIKLNAIVAKYEDKKEKIVDEIVYYVDEAIAQMADKTLE
SISSSQIMPVIRATSFDKKTKQGVPFIYDEHTAETAVYYAVDLGKSYRLIDESMLEDLKLTEQQIREMSLFNVRKLSNSY
TTDEVKGNIFYFINSNDGYDASRILNTAFLNEIEAQCQGEMLVAVPHQDVLIIADIRNKTGYDVMAHLTMEFFTKGLVPI
TSLSFGYKQGHLEPIFILGKNNKQKRDPNVIQRLEANRRKFNKDK
>P74596 ~~~~~~Uncharacterized protein slr1565~~~COG0316
MLQLTPSAAQEIKRLQHSRQLTRHHFRLAVRPGGCAGWLYHLDFVPEITADDLEYESGGVTVLVDSQSAGYLHNLKLDYA
EDLMGGGFRFTNPNAAQVCSCSLSFAPNLEKNL
>Q83BE4 ~~~~~~Probable transcriptional regulatory protein CBU_1566~~~COG0217
MAGHSKWANIKHAKARQDAKRGKVFTKLIREITVAARLGGEDIDSNPRLRAVVDKAFAANMPKDTITRAIKRGAGSGAGD
NLVEVRYEGYGPSGVAVMVDCLTDNKNRTVAEVRHAFSKCDGNLGTEGSVAYLFKQRGLITFPPNSDEEKIMEIALEVGA
EDVTTNDDGSIDVTTLPEDFEKIRNAMKAADLNPSHAEVTVLASTEVGLDKDSAEQMLRLTEMLEDLDDVQNVYSNADYP
EEVL
>Q9X1Q6 ~~~~~~Uncharacterized protein TM_1570~~~COG4752
MLEKVYVALIHYPIKGKDGSIISTAVTNLDVHDIARTARTYNLKGYYIVTNLRAQQDMVSKMLKFWREGFGSRYNPSRAE
SLKLVKLKSYLEDVLEDIESVEGERPLIFFTSAKKRENDISFEEGRRIIIETEKPVLILLGTGWGLPDEILEISDYVLEP
IRAQSDFNHLSVRAAAAIIIDRLIGENYARRD
>O67517 ~~~~~~Probable transcriptional regulatory protein aq_1575~~~COG0217
MAGHSHWAQIKHKKAKVDAQRGKLFSKLIREIIVATRLGGPNPEFNPRLRTAIEQAKKANMPWENIERAIKKGAGELEGE
QFEEVIYEGYAPGGVAVMVLATTDNRNRTTSEVRHVFTKHGGNLGASGCVSYLFERKGYIEVPAKEVSEEELLEKAIEVG
AEDVQPGEEVHIIYTVPEELYEVKENLEKLGVPIEKAQITWKPISTVQINDEETAQKVIKLLNALEELDDVQQVIANFEI
PEEILQKVG
>Q8YFD6 5.1.1.8~~~~~~Protein BMEI1586~~~COG3938
MRSTKVIHIVGCHAEGEVGDVIVGGVAPPPGETVWEQSRFIANDETLRNFVLNKPRGGVFRHVNLLVPPKDPRAQMGFII
MEPADTPPMSGSNSICVSTVLLDSGIIAMQEPVTHMVLEAPGGIIEVEAECRNGKAERISVRNVPSFADRLDAPLDVTGL
GTIMVDTAYGGDSFVIVDAAQIGMKIEPGQARELAEIGVKITKAANEQLGFRHPERDWRHISFCQITEPVTREGDVLTGV
NTVAIRPAKFDRSPTGTGCSARMAVLHAKGQMKAGERFIGKSVLGTEFHCRLDKVLELGGKPAISPIISGRAWVTGTSQL
MLDPSDPFPHGYRLSDTWPRDE
>P9WLT7 ~~~~~~Uncharacterized protein Rv1590~~~
MVEIVAGKQRAPVAAGVYNVYTGELADTATPTAARMGLEPPRFCAQCGRRMVVQVRPDGWWARCSRHGQVDSADLATQR
>P9WLT5 ~~~~~~Uncharacterized protein Rv1591~~~
MLGLSATGVLVGGLWAWIAPPIHAVVAITRAGERVHEYLGSESQNFFIAPFMLLGLLSVLAVVASALMWQWREHRGPQMV
AGLSIGLTTAAAIAAGVGALVVRLRYGALDFDTVPLSRGDHALTYVTQAPPVFFARRPLQIALTLMWPAGIASLVYALLA
AGTARDDLGGYPAVDPSSNARTEALETPQAPVS
>P9WK89 ~~~~~~Probable inactive lipase Rv1592c~~~COG1073
MVEPGNLAGATGAEWIGRPPHEELQRKVRPLLPSDDPFYFPPAGYQHAVPGTVLRSRDVELAFMGLIPQPVTATQLLYRT
TNMYGNPEATVTTVIVPAELAPGQTCPLLSYQCAIDAMSSRCFPSYALRRRAKALGSLTQMELLMISAALAEGWAVSVPD
HEGPKGLWGSPYEPGYRVLDGIRAALNSERVGLSPATPIGLWGYSGGGLASAWAAEACGEYAPDLDIVGAVLGSPVGDLG
HTFRRLNGTLLAGLPALVVAALQHSYPGLARVIKEHANDEGRQLLEQLTEMTTVDAVIRMAGRDMGDFLDEPLEDILSTP
EISHVFGDTKLGSAVPTPPVLIVQAVHDYLIDVSDIDALADSYTAGGANVTYHRDLFSEHVSLHPLSAPMTLRWLTDRFA
GKPLTDHRVRTTWPTIFNPMTYAGMARLAVIAAKVITGRKLSRRPL
>Q5XA33 ~~~~~~Uncharacterized protein Spy1595~~~
MVLFLIRIFSDSDKEENMGIEKTVSELADILGVSRQAVNNRVKSLPEEDLDKNEKGVTVVKRSGLVKLEEIYKKTIFDDE
PISEETKQRELLEILVDEKNTEITRLYEQLKAKDAQLASKDEQMRVKDVQIAEKDKQLDQQQQLTAKAMADKETLKLELE
EAKAEANQARLQVEEVQAEVGPKKGFLTRLFAK
>P44271 ~~~~~~UPF0111 protein HI_1603~~~COG1392
MAMNNILGLFAHSPLKPLQKHSEKVTECSDLLIPFFQTTFSKNWEQAEEKRLEISQCEREADSLKREIRLKLPRGLFLPI
DRTDLLELVTQQDKLANYAKDIAGRMIGRQFGIPEEMQEEFLHYVKRSLDAIHQAHRVIEEMDKLLETGFKGRELKLVND
MIQELDSIEDDTDQMQIKLRKMLYTIESRYNPIDVMFLYKIIEWVGVLADQAQRVGSRIELMLARS
>A0QSU4 1.-.-.-~~~~~~Uncharacterized oxidoreductase MSMEG_1603/MSMEI_1564~~~COG0516
MVEIGMGRTARRTYELEDVTIVPSRRTRSSKDVSTAWQLDAYRFEIPVIAHPTDALVSPEFAIEMGRLGGLGVLNGEGLI
GRHADVEEKIAQVVEVAAKEPEPSAAIRLLQQLHAAPLDPDLLGAAVARIREAGVTTAVRVSPQNAQALTPTLVAAGIDL
LVIQGTIISAERVASDGEPLNLKTFISELDVPVVAGGVLDHRTALHLMRTGAAGVIVGYGSTSGVTTSDEVLGISVPMAT
AIADAAAARREYLDETGGRYVHVLADGDIHSSGDLAKAIACGADAVVLGTPLATSAEALGNGWFWPAAAAHPSLPRGALL
QVALGERPSLEQVLTGPSDDPFGSLNLVGGLRRSMAKAGYCDLKEFQKVGLTVGS
>Q5XA14 ~~~~~~Uncharacterized protein Spy1614~~~
MTVKINTKDGLIELSDDVIATVVGGSATEIFGVVGMASKSAIKDNFQSLLRKENYAKGVVVKSTDLGISVDVYTVMSYGV
KISEVSKNIQERVKFNLESQLGLTADMVNVYVQNIKVVGEN
>Q9I3A4 3.1.2.-~~~~~~Putative esterase PA1618~~~
MSLWRQTPDLEQLNASQKNSIGDLLGIRFEAFDDESLTASMPVDSRTHQPFGLLHGGASVVLAESLGSMASYLCVDTSQY
YCVGLEVNANHLRGLRSGRVTAVARAIHLGRTTHVWDIRLSGDDGKPSCIARLTMAVVPLAGRAG
>P44276 ~~~~~~Uncharacterized protein HI_1624~~~COG5266
MELKKIAVGLTALLGMSVANAHNVWLEPASSQDEYVVKFGHEQTETYPESKLKSIQALNSQGKLTAVDYQFRNGEAYLMP
KSDLVFVHFDNGVWSKLPSGKYVEKTKREEPTAEFSTNPVKFGKAILKWDAESFKSHQQAYELIPQEKAQANKPLSILVL
HNGKPVQGIKVGVSEDAPFNLTNEKGIAQFTPTKGFNKVWAEFEEKVTNNADYDRRTVEYMLTFDAQ
>Q9I398 ~~~~~~Uncharacterized protein PA1624~~~
MRGFLLLSLGVFSFSALAADLPGSHDLDILPRFPRAEIVDFRQAPSEERIYPLGAISRISGRLRMEGEVRAEGELTALTY
RLPPEHSSQEAFAAARTALLKADATPLFWCERRDCGSSSLLANAVFGNAKLYGPDEQQAYLLVRLAAPQENSLVAVYSIT
RGNRRAYLQAEELKADAPLAELLPSPATLLRLLKANGELTLSHVPAEPAGSWLELLVRTLRLDTGVRVELSGKHAQEWRD
ALRGQGVLNSRMELGQSEVEGLHLNWLR
>O67549 ~~~~~~Uncharacterized protein aq_1627~~~
MPAIFTHEGKVEGVPGNYPLTAENLFRIGLALCTLWILDKEIEEPTLSIPETNFVTLALSVGFMNAGGSVNVGKGGDIKL
FLQKGEIYVLEFQPLSETDIKKLESILFGRAPIPKKTGEDIGSFKC
>O24970 ~~~~~~Probable transcriptional regulatory protein HP_0162~~~COG0217
MGRAFEYRRAAKEKRWDKMSKVFPKLAKAITLAAKDGGSEPDTNAKLRTAILNAKAQNMPKDNIDAAIKRASSKEGNLSE
ITYEGKANFGVLIIMECMTDNPTRTIANLKSYFNKTQGASIVPNGSLEFMFNRKSVFECLKNEVENLKLSLEDLEFALID
YGLEELEEVEDKIIIRGDYNSFKLLNEGFESLKLPILKASLQRIATTPIELNDEQMELTEKLLDRIEDDDDVVALYTNIE
>P9WJX3 ~~~~~~Probable multidrug-efflux transporter Rv1634~~~COG2814
MTETASETGSWRELLSRYLGTSIVLAGGVALYATNEFLTISLLPSTIADIGGSRLYAWVTTLYLVGSVVAATTVNTMLLR
VGARSSYLMGLAVFGLASLVCAAAPSMQILVAGRTLQGIAGGLLAGLGYALINSTLPKSLWTRGSALVSAMWGVATLIGP
ATGGLFAQLGLWRWAFGVMTLLTALMAMLVPVALGAGGVGPGGETPVGSTHKVPVWSLLLMGAAALAISVAALPNYLVQT
AGLLAAAALLVAVFVVVDWRIHAAVLPPSVFGSGPLKWIYLTMSVQMIAAMVDTYVPLFGQRLGHLTPVAAGFLGAALAV
GWTVGEVASASLNSARVIGHVVAAAPLVMASGLALGAVTQRADAPVGIIALWALALLIIGTGIGIAWPHLTVRAMDSVAD
PAESSAAAAAINVVQLISGAFGAGLAGVVVNTAKGGEVAAARGLYMAFTVLAAAGVIASYQATHRDRRLPR
>P9WFC9 ~~~~~~Universal stress protein Rv1636~~~COG0589
MSAYKTVVVGTDGSDSSMRAVDRAAQIAGADAKLIIASAYLPQHEDARAADILKDESYKVTGTAPIYEILHDAKERAHNA
GAKNVEERPIVGAPVDALVNLADEEKADLLVVGNVGLSTIAGRLLGSVPANVSRRAKVDVLIVHTT
>Q9JYD0 ~~~~~~Uncharacterized membrane protein NMB1645~~~
MLNPSRKLVELVRILDEGGFIFSGDPVQATEALRRVDGSTEEKIIRRAEMIDRNRMLRETLERVRAGSFWLWVVAATFAF
FTGFSVTYLLMDNQGLNFFLVLAGVLGMNTLMLAVWLAMLFLRVKVGRFFSSPATWFRGKDPVNQAVLRLYADEWRQPSV
RWKIGATSHSLWLCTLLGMLVSVLLLLLVRQYTFNWESTLLSNAASVRAVEMLAWLPSKLGFPVPDARAVIEGRLNGNIA
DARAWSGLLVGSIACYGILPRLLAWVVCKILLKTSENGLDLEKPYYQAVIRRWQNKITDADTRRETVSAVSPKIILNDAP
KWAVMLETEWQDGEWFEGRLAQEWLDKGVATNREQVAALETELKQKPAQLLIGVRAQTVPDRGVLRQIVRLSEAAQGGAV
VQLLAEQGLSDDLSEKLEHWRNALAECGAAWLEPDRAAQEGRLKDQ
>Q8NQ03 ~~~~~~Uncharacterized protein Cgl1651/cg1859~~~
MKLFSRTSLVALGTAAAITLSGVTAPAFADEDSNAAVSALKTAEDNTPEAPGASTPLKLEQPGTITGVPGKAITPVTVKV
VAGEAESFTSDNLPSGLLIDNTGKITGTPKKEFTGSAKIIAKNEAGVEAEVYVNFDFNEEPSSEEPSSGSSDTDNIENWI
KIITAVIGALTTILTFSTKLDSFLK
>Q8UEU8 ~~~~~~Uncharacterized protein Atu1656~~~COG0823
MRQSTLHTRLSTGPGGSMRSSIEIFNIRTRKMRVVWQTPELFEAPNWSPDGKYLLLNSEGLLYRLSLAGDPSPEKVDTGF
ATICNNDHGISPDGALYAISDKVEFGKSAIYLLPSTGGTPRLMTKNLPSYWHGWSPDGKSFTYCGIRDQVFDIYSMDIDS
GVETRLTHGEGRNDGPDYSPDGRWIYFNSSRTGQMQIWRVRVDGSSVERITDSAYGDWFPHPSPSGDKVVFVSYDADVFD
HPRDLDVRVQLMDMDGGNVETLFDLFGGQGTMNSPNWSPDGDEFAYVRYFPVE
>Q7A4V3 ~~~~~~UPF0342 protein SA1663~~~
MAVNLYDYANQLEQALRESEEYKAIKEAFANVKANEESKKLFDEFRETQINFQQKQMQGEEIAEEDLQKAQEQAQAIEKD
ENISALMNAEQKMSQVFQEINQIIVKPLDEIYAD
>Q7A4V2 ~~~~~~UPF0754 membrane protein SA1664~~~
MNALFIIIFMIVVGAIIGGITNVIAIRMLFHPFKPYYIFKFRVPFTPGLIPKRREEIATKIGQVIEEHLLTETLINEKLK
SEQSQQAIESMIQQQLQKLTKDQLSIKQITSQIDIDLEQVLQTNGNQYIESQLNNYYTKHQNQTIASLLPNQLVTFLDQH
VDNATDLLCDRARNYLSSAKGTQDINDMLDTFFHEKGKLIGMLQMFMTKESIADRIQQELIRLTSHPKARTIVTSLITNE
YQTFKDKPLNELLDASQFNEIAENLSVYVTTYASNQANKPVVTLMPQFVDYLEGQLSSKLANLIIEKLSIHLSTIMKKVD
LRGLIEEQINTFDLDYIEKLIIEIANKELKLIMSLGFILGGIIGFFQGLVAIFV
>P44288 ~~~~~~Uncharacterized protein HI_1672~~~COG3008
MTEKNNSSSIEEKYQERTANLRKTKRISPFWLLPFIALCIGAILFFQIVKERGTSITITFTNGSGIVADKTQIRYQGLQI
GIVKEVHFTDNLQKVEVVANINPEASSILRENTKFWLVQPNVSLAGISGLDSLVSGNYITLQPGDGDREDEFIAEEQGPI
AQVSAGDLLIHLISDDLGSISIGASVYFKKLPVGKIYDYRINKNNKVEIDVVIDKAYAKFVKKDSRFWNISGINANISPS
GLNLNVESLNAVVQGAVSFDSPADSPKADENSHFTLYTNLKAAKRGIEIKVTIPASSALIAGQTEVYSQDNAIGILAKLS
AVENNDEILEGSLLIDPNQASLFKANSKIVLRNKKIDLGNLAEPKKFFRGEYFDVIAGDGETKHQFNVIKENELLLNAPN
TLVLTLTAPENYGVSEGQNVFYNNMIIGQIVSQTIDVNGVQFKAAIASEYRNLIHENTQFVAATNFDISVGLDGLRFESA
TPEKWLQGGVRVLTKQGLGKAKDSYPLYQNISNAEHGITGNILTPTITLHTQTLPSIDKGSLVLYRQFEVGKILSIKPKT
NNFDVDIYIYPAYQHLLTDKSRFWVESAAKIDVSPKGISIQATPLARSLKGAISFDNGGSGNNRTLYANESYAKSIGFVI
TLITDDATNLSKGMNLRYLGLDVGQIDSIQLDAKAKRITAKALINPNYMNIIAKEGANFTIISPQISAGGIDNLDSLLQP
YIDIEIGNGNTKTQFNLAQTAPQRNKFSNGTPFILETRDAMNLSEGSPILYRGVEVGTVKKFELNSLGDRVLVHIAIMPK
YSHLVRQNTEFWIASGYDFSLGWKGAVFNTGSVQQLLKGGISFSTPAEKEIQPQAQPNKRFLLQINRPEEVQTWGSGALS
K
>P44290 ~~~~~~UPF0319 protein HI_1681~~~COG3110
MKLRAVVLGLATLCTSTATFAGMVSTSSNLEFLAIDGQKASKSLGKAKTFTVDDTQNHQVVVRLNEIVGSGSNQSLFESN
PVIVTFQGNAEDLVISAPVIRNLDSGDKFNQMPNITVKTKSGNAISAKVDVLKQEGLFPSGNVLNDLAEYNASGAAASVS
KFAATTVASSVAVAPAGNAKANKGKVVVQGENVAEQQLQYWFQQADKETQTRFLNWAKSHK
>Q7A4T3 7.6.2.-~~~~~~Putative multidrug export ATP-binding/permease protein SA1683~~~
MIKRYLQFVKPYKYRIFATIIVGIIKFGIPMLIPLLIKYAIDGVINNHALTTDEKVHHLTIAIGIALFIFVIVRPPIEFI
RQYLAQWTSNKILYDIRKKLYNHLQALSARFYANNQVGQVISRVINDVEQTKDFILTGLMNIWLDCITIIIALSIMFFLD
VKLTLAALFIFPFYILTVYVFFGRLRKLTRERSQALAEVQGFLHERVQGISVVKSFAIEDNEAKNFDKKNTNFLTRALKH
TRWNAYSFAAINTVTDIGPIIVIGVGAYLAISGSITVGTLAAFVGYLELLFGPLRRLVASFTTLTQSFASMDRVFQLIDE
DYDIKNGVGAQPIEIKQGRIDIDHVSFQYNDNEAPILKDINLSIEKGETVAFVGMSGGGKSTLINLIPRFYDVTSGQILI
DGHNIKDFLTGSLRNQIGLVQQDNILFSDTVKENILLGRPTATDEEVVEAAKMANAHDFIMNLPQGYDTEVGERGVKLSG
GQKQRLSIARIFLNNPPILILDEATSALDLESESIIQEALDVLSKDRTTLIVAHRLSTITHADKIVVIENGHIVETGTHR
ELIAKQGAYEHLYSIQNL
>Q7A4S2 ~~~~~~UPF0435 protein SA1696~~~
MAMTNEEKVLAIREKLNIVNQGLLDPEKYKNANEEELTDIYDFVQSRERLSPSEVTAIADALGQLRHD
>L0TAD5 2.1.1.-~~~~~~Probable O-methyltransferase Rv1703c~~~COG4122
MAAGIRNITTTGQIGDGREAAAVDYVLAHAGAGNIDDVLATIDKFAYEKSMLINVGDEKGTLLDAAVRRADPALALELGT
YLGYGALRIARAAPEARVYSVELAEANASNARRIWAHAGVDDRVVCVVGTIGDGGRTLDALTEHGFATGTLDFVFLDHDK
KAYLPDLQSILDRGWLHPGSIVVADNVRVPGAPKYRAYMRRQQGMSWNTIEHKTHLEYQTLVPDLVLESEYLG
>A8H392 4.2.1.77~~~~~~Protein Spea_1705~~~COG3938
MQSITITPDLSSNFKDFVTIDAHTEGEPLRVIISGYPEIKGSTILEKRQYVQQNLDTYRKLLMHEPRGHADMYGALITEA
VTEEADFGVLFLHNEGYSSMCGHGILALVKVMCQTDSIDLGLEPRTIKIDSPAGLITAKAYRDSQGKIQASFKNVDSWAD
ALNCSVNVEGFGEVNYDIGFGGAYYAYVDADEHGISCGQDNVAQLIDVGRRIKHAVMASHTLVHPLEEDLSFLYGTIFTS
KKVTNPEAHSRHVCIFADGEVDRSPTGTGVSARVALLYAKGEVALNTPIMIESIVDGRMIVSASAESEFHGKQGVIPEVS
GRSFITGKHQFFIDPDDVFQNGFMLR
>P9WLT1 ~~~~~~Uncharacterized protein Rv1708~~~COG1192
MPAGLPGQASVAVRLSCDVPPDARHHEPRPGMTDHPDTGNGIGLTGRPPRAIPDPAPRSSHGPAKVIAMCNQKGGVGKTT
STINLGAALGEYGRRVLLVDMDPQGALSAGLGVPHYELDKTIHNVLVEPRVSIDDVLIHSRVKNMDLVPSNIDLSAAEIQ
LVNEVGREQTLARALYPVLDRYDYVLIDCQPSLGLLTVNGLACTDGVIIPTECEFFSLRGLALLTDTVDKVRDRLNPKLD
ISGILITRYDPRTVNSREVMARVVERFGDLVFDTVITRTVRFPETSVAGEPITTWAPKSAGALAYRALARELIDRFGM
>P44293 ~~~~~~Uncharacterized protein HI_1709~~~COG3111
MKKFALATIFALATTSAFAGFNGNNSQGGFQQAAPAAISVKQALSAADNSMITLVGNITQQIDDDEFWFTDGTGQIKIEI
KKRVWNGLNVDSKDKVKIYGKLDNEVFEKAELDVLRIEKAE
>P9WHQ1 5.4.99.-~~~~~~Uncharacterized RNA pseudouridine synthase Rv1711~~~COG1187
MMAEPEESREPRGIRLQKVLSQAGIASRRAAEKMIVDGRVEVDGHVVTELGTRVDPQVAVVRVDGARVVLDDSLVYLALN
KPRGMHSTMSDDRGRPCIGDLIERKVRGTKKLFHVGRLDADTEGLMLLTNDGELAHRLMHPSHEVPKTYLATVTGSVPRG
LGRTLRAGIELDDGPAFVDDFAVVDAIPGKTLVRVTLHEGRNRIVRRLLAAAGFPVEALVRTDIGAVSLGKQRPGSVRAL
RSNEIGQLYQAVGL
>P71977 ~~~~~~HTH-type transcriptional regulator Rv1719~~~COG1414
MSAEEQDTRSGGIQVIARAAELLRVLQAHPGGLSQAEIGERVGMARSTVSRILNALEDEGLVASRGARGPYRLGPEITRM
ATTVRLGVVTEMHPFLTELSRELDETVDLSILDGDRADVVDQVVPPQRLRAVSAVGESFPLYCCANGKALLAALPPERQA
RALPSRLAPLTANTITDRAALRDELNRIRVDGVAYDREEQTEGICAVGAVLRGVSVELVAVSVPVPAQRFYGREAELAGA
LLAWVSKVDAWFNGTEDRK
>Q9I310 ~~~~~~Uncharacterized signaling protein PA1727~~~
MLISSYNQVLVAFSLIVAILASYTALDMAGRVTLAKGREALSWLIGGAFAMGFGIWSMHFVGMLAFSLPIPLGYDLGLTL
LSLLLAVGSSAFALWLVCQAELPWQRLALGALLMGSGIAAMHYTGMAALLMMPGIVYDPLWLGLSILIAVIASGAALWIA
FRLRHGSRRIVLVRAGAALVMGCAIVGMHYTGMAAAQFPLGSFCGAAGRGIDNGWLAVLVIVITLAVIAIALIVSVLDSR
LEARTSVLATSLARANRELIQLALHDNLTKLPNRMLLDDRLEQAIQQAIRDDRRFAVLFMDLDGFKAVNDAYGHHLGDLL
LIEVAERIRANVRAQDTIARLGGDEFVLLIEAREPADAATLAEKLVKRISQPYQISRHEVRISASIGIALYPGDGQTRHE
LMINADAAMYHAKDQGRNGYCFFESSMNANAQEQLQLLHDLRQALERRQLVLHYQPKVLAPNGPMIGVEALLRWEHPQHG
LITPGQFLPLAEKTGLIVQIGEWVLDEACRQMRLWLDGGHADWNIAVNLSALQFAHAGLVDSVRNALLRHSLEPSHLILE
VTESTAMRDADASLVILEQLSAMGVGISIDDFGTGYSSLLYLKRLPASELKIDRGFINELAHDSDDAAIVSAIVALGRTL
NLKIVAEGVETEAQQEFLTRLGCNSLQGFLLGRPMPAEQLLASVA
>P61544 ~~~~~~UPF0316 protein SA1727~~~
MSFVTENPWLMVLTIFIINVCYVTFLTMRTILTLKGYRYIAASVSFLEVLVYIVGLGLVMSNLDHIQNIIAYAFGFSIGI
IVGMKIEEKLALGYTVVNVTSAEYELDLPNELRNLGYGVTHYAAFGRDGSRMVMQILTPRKYERKLMDTIKNLDPKAFII
AYEPRNIHGGFWTKGIRRRKLKDYEPEELESVVEHEIQSK
>P9WLS8 ~~~~~~Probable membrane protein MT1774~~~
MIATTRDREGATMITFRLRLPCRTILRVFSRNSLVRGTDRLEAVVMLLAVTVSLLTIPFAAAAGTAVHDSRSHVYAHQAQ
TRHPATATVIDHEGVIDSNTTATSAPPRTKITVPARWVVNGIERSGEVNAKPGTKSGDRVGIWVDSAGQLVDEPAPPARA
IADAALAALGLWLSVAAVAGALLALTRAILIRVRNASWQHDIDSLFCTQR
>P9WLS9 ~~~~~~Probable membrane protein Rv1733c~~~
MIATTRDREGATMITFRLRLPCRTILRVFSRNPLVRGTDRLEAVVMLLAVTVSLLTIPFAAAAGTAVQDSRSHVYAHQAQ
TRHPATATVIDHEGVIDSNTTATSAPPRTKITVPARWVVNGIERSGEVNAKPGTKSGDRVGIWVDSAGQLVDEPAPPARA
IADAALAALGLWLSVAAVAGALLALTRAILIRVRNASWQHDIDSLFCTQR
>P9WLS6 ~~~~~~Uncharacterized protein MT1774.1~~~
MTNVGDQGVDAVFGVIYPPQVALVSFGKPAQRVCAVDGAIHVMTTVLATLPADHGCSDDHRGALFFLSINELTRCAAVTG
>P9WLS7 ~~~~~~Uncharacterized protein Rv1734c~~~COG0508
MTNVGDQGVDAVFGVIYPPQVALVSFGKPAQRVCAVDGAIHVMTTVLATLPADHGCSDDHRGALFFLSINELTRCAAVTG
>P9WLS4 ~~~~~~Uncharacterized membrane protein MT1776~~~
MFLYVAVGSLVVARLLLYPLRPADLTPPYWVAMGATAITVLAGAHIVEMADAPMAIVTSGLVAGASVVFWAFGPWLIPPL
VAASIWKHVVHRVPLRYEATLWSVVFPLGMYGVGAYRLGLAAHLPIVESIGEFEGWVALAVWTITFVAMLHHLAATIGRS
GRSSHAIGAADDTHAIICRPPRSFDHQVRAFRRNQPM
>P9WLS5 ~~~~~~Uncharacterized membrane protein Rv1735c~~~COG1275
MFLYVAVGSLVVARLLLYPLRPADLTPPYWVAMGATAITVLAGAHIVEMADAPMAIVTSGLVAGASVVFWAFGPWLIPPL
VAASIWKHVVHRVPLRYEATLWSVVFPLGMYGVGAYRLGLAAHLPIVESIGEFEGWVALAVWTITFVAMLHHLAATIGRS
GRSSHAIGAADDTHAIICRPPRSFDHQVRAFRRNQPM
>Q7A4P4 ~~~~~~Uncharacterized protein SA1737~~~
MTNGYIGSYTKKNGKGIYRFELNENQSRIDLLEIGFELEASTYLVRNNEVLYGINKEGEQCGVASLKIDDNGELHLLNKC
LSSKAGTGCYVSISEDKRYLFEAVYGAGIIRMYELNTHTGEIIRLIQELAHDFPTGTHERQDHPHAHYINQTPDGKYVAV
TDLGADRIVTYKFDDNGFEFYKESLFKDSDGTRHIEFHDNGKFAYVVHELSNTVSVAEYNDGKFEELERHLTIPENFDGD
TKLAAVRLSHDQQFLYVSNRGHDSIAIFKVLDNGQHLELVTITESGGQFPRDFNIASSDDLLVCAHEQGDSVVTVFERSK
ETGKITLCDNTRVASEGVCVIF
>P9WLS2 ~~~~~~Uncharacterized protein MT1780~~~
MCGDQSDHVLQHWTVDISIDEHEGLTRAKARLRWREKELVGVGLARLNPADRNVPEIGDELSVARALSDLGKRMLKVSTH
DIEAVTHQPARLLY
>P9WLS3 ~~~~~~Uncharacterized protein Rv1738~~~
MCGDQSDHVLQHWTVDISIDEHEGLTRAKARLRWREKELVGVGLARLNPADRNVPEIGDELSVARALSDLGKRMLKVSTH
DIEAVTHQPARLLY
>P9WGF7 ~~~~~~Probable sulfate transporter Rv1739c~~~COG0659
MIPTMTSAGWAPGVVQFREYQRRWLRGDVLAGLTVAAYLIPQAMAYATVAGLPPAAGLWASIAPLAIYALLGSSRQLSIG
PESATALMTAAVLAPMAAGDLRRYAVLAATLGLLVGLICLLAGTARLGFLASLRSRPVLVGYMAGIALVMISSQLGTITG
TSVEGNEFFSEVHSFATSVTRVHWPTFVLAMSVLALLTMLTRWAPRAPGPIIAVLAATMLVAVMSLDAKGIAIVGRIPSG
LPTPGVPPVSVEDLRALIIPAAGIAIVTFTDGVLTARAFAARRGQEVNANAELRAVGACNIAAGLTHGFPVSSSSSRTAL
ADVVGGRTQLYSLIALGLVVIVMVFASGLLAMFPIAALGALVVYAALRLIDLSEFRRLARFRRSELMLALATTAAVLGLG
VFYGVLAAVALSILELLRRVAHPHDSVLGFVPGIAGMHDIDDYPQAKRVPGLVVYRYDAPLCFANAEDFRRRALTVVDQD
PGQVEWFVLNAESNVEVDLTALDALDQLRTELLRRGIVFAMARVKQDLRESLRAASLLDKIGEDHIFMTLPTAVQAFRRR
>Q83AX3 ~~~~~~Uncharacterized protein CBU_1754~~~
MVDDEKREVSEEIEEALKKLHLDDVDWARALSPHEILYLLDRCPFLQIVSTNEIEAFSETKFITAQSGWTIHHYGEAMSS
SPGPLLFQGGDYRILGDDDEGDDGEGGTIVNPGKGTIVKQAFTTAAEMIALAQKSGWRGVRIIDGHPLMQWAAWMQATDD
AFHLEGYEPDEKARKKRERVKRSEVEDQLKINVKPTRR
>P56112 5.2.1.8~~~~~~Putative peptidyl-prolyl cis-trans isomerase HP_0175~~~COG0760
MKKNILNLALVGALSTSFLMAKPAHNANNATHNTKKTTDSSAGVLATVDGRPITKSDFDMIKQRNPNFDFDKLKEKEKEA
LIDQAIRTALVENEAKTEKLDSTPEFKAMMEAVKKQALVEFWAKKQAEEVKKVQIPEKEMQDFYNANKDQLFVKQEAHAR
HILVKTEDEAKRIISEIDKQPKAKKEAKFIELANRDTIDPNSKNAQNGGDLGKFQKNQMAPDFSKAAFALTPGDYTKTPV
KTEFGYHIIYLISKDSPVTYTYEQAKPTIKGMLQEKLFQERMNQRIEELRKHAKIVINK
>P9WKB9 2.3.1.20~~~~~~Putative diacyglycerol O-acyltransferase Rv1760~~~COG1020
MPRGCAGARFACNACLNFLAGLGISEPISPGWAAMERLSGLDAFFLYMETPSQPLNVCCVLELDTSTMPGGYTYGRFHAA
LEKYVKAAPEFRMKLADTELNLDHPVWVDDDNFQIRHHLRRVAMPAPGGRRELAEICGYIAGLPLDRDRPLWEMWVIEGG
ARSDTVAVMLKVHHAVVDGVAGANLLSHLCSLQPDAPAPQPVRGTGGGNVLQIAASGLEGFASRPVRLATVVPATVLTLV
RTLLRAREGRTMAAPFSAPPTPFNGPLGRLRNIAYTQLDMRDVKRVKDRFGVTINDVVVALCAGALRRFLLEHGVLPEAP
LVATVPVSVHDKSDRPGRNQATWMFCRVPSQISDPAQRIRTIAAGNTVAKDHAAAIGPTLLHDWIQFGGSTMFGAAMRIL
PHISITHSPAYNLILSNVPGPQAQLYFLGCRMDSMFPLGPLLGNAGLNITVMSLNGELGVGIVSCPDLLPDLWGVADGFP
EALKELLECSDDQPEGSNHQDS
>O06797 ~~~~~~Encapsulin nanocompartment protein Rv1762c~~~COG0393
MQSSSLDPVASERLSHAEKSFTSDLSINEFALLHGAGFEPIELVMGVSVYHVGFQFSGMRQQQELGVLTEATYRARWNAM
ARMQAEADALKADGIVGVRLNWRHHGEGGEHLEFMAVGTAVRYTAKPGAFRRPNGQAFSSHLSGQDMVTLLRSGFAPVAF
VMGNCVFHIAVQGFMQTLRQIGRNMEMPQWTQGNYQARELAMSRMQSEAERDGATGVVGVHFAISNYAWGVHTVEFYTAG
TAVRRTGSGETITPSFVLPMDS
>P73628 ~~~~~~Thylakoid protein sll1769~~~COG3937
MQNQVLQAFFLGRAFAEVLSEKVEDGVTNALSELGKFDAEQRENLRQFIAEVQSRAANDVTQEGAAIATVDGPVSADELQ
ETLDKLRAEIASLKSELKNYRDNQG
>P9WFH9 2.1.1.-~~~~~~Putative S-adenosyl-L-methionine-dependent methyltransferase Rv1729c~~~COG3315
MARTDDDNWDLTSSVGVTATIVAVGRALATKDPRGLINDPFAEPLVRAVGLDLFTKMMDGELDMSTIADVSPAVAQAMVY
GNAVRTKYFDDYLLNATAGGIRQVAILASGLDSRAYRLPWPTRTVVYEIDQPKVMEFKTTTLADLGAEPSAIRRAVPIDL
RADWPTALQAAGFDSAAPTAWLAEGLLIYLKPQTQDRLFDNITALSAPGSMVATEFVTGIADFSAERARTISNPFRCHGV
DVDLASLVYTGPRNHVLDYLAAKGWQPEGVSLAELFRRSGLDVRAADDDTIFISGCLTDHSSISPPTAAGWR
>Q4KFT4 ~~~~~~UPF0434 protein PFL_1779~~~COG2835
MDTKLLDILACPICKGPLKLSADKTELISKGAGLAYPIRDGIPVMLESEARTLTTEERLDK
>P61167 ~~~~~~UPF0311 protein RPA1785~~~
MTPTLETKYVFTITARIGDVTSAGEIGTGVRRIIPILGGEVKGEGISGQVLPFGADFQIIRPNELIELEAKYAFETDDGA
VVYVENVGIRFGPVELLRKLKRGEPVDPKVIYFRTRPRFETGHPNYQWLMQYLFVGSAARHADRVVIDVHQVL
>O66565 ~~~~~~Universal stress protein Aq_178~~~
MKVLLVLTDAYSDCEKAITYAVNFSEKLGAELDILAVLEDVYNLERANVTFGLPFPPEIKEESKKRIERRLREVWEKLTG
STEIPGVEYRIGPLSEEVKKFVEGKGYELVVWACYPSAYLCKVIDGLNLASLIVK
>Q7VKS7 ~~~~~~UPF0301 protein HD_1794~~~COG1678
MFGNLQGKFIIATPEMDDEYFDRTVIYICEHNDNGTIGVIINTPTDLSVLELLTRMDFQMAKPRIYTQDQMVLNGGPVNQ
DRGFIVHSKTDHEFTHSYKVTDDITLTTSGDVLDSFGTQTAPEKFIVCLGCSTWKPHQLEQEIAQNYWLLSEANNQTLFE
TSYLDRWVEANEMLGISGILAPAGRA
>Q5X9I2 ~~~~~~UPF0297 protein M6_Spy1796~~~
MGFTDETVRFKLDDGDKRQISETLTAVYHSLDEKGYNPINQIVGYVLSGDPAYVPRYNDARNQIRKYERDEIVEELVRYY
LQGNGIDVK
>Q2YU19 ~~~~~~UPF0374 protein SAB1800c~~~
MVRESIPKEGENIKIQSYKHDGKIHRVWSETTILKGTDHVVIGGNDHTLVTESDGRTWITREPAIVYFHSEYWFNVICMF
REDGIYYYCNLSSPFVCDEEALKYIDYDLDIKVYPNGKYHLLDEDEYEQHMNQMNYPHDIDIILRRNVDILQQWIEQKKG
PFAPDFIKVWKERYKKIRQY
>Q97P44 ~~~~~~Putative trans-acting regulator SP_1800~~~COG3711
MRDLLSKKSHRQLELLELLFEHKRWFHRSELAELLNCTERAVKDDLSHVKSAFPDLIFHSSTNGIRIINTDDSDIEMVYH
HFFKHSTHFSILEFIFFNEGCQAESICKEFYISSSSLYRIISQINKVIKRQFQFEVSLTPVQIIGNERDIRYFFAQYFSE
KYYFLEWPFENFSSEPLSQLLELVYKETSFPMNLSTHRMLKLLLVTNLYRIKFGHFMEVDKDSFNDQSLDFLMQAEGIEG
VAQSFESEYNISLDEEVVCQLFVSYFQKMFFIDESLFMKCVKKDSYVEKSYHLLSDFIDQISVKYQIEMENKDNLIWHLH
NTAHLYRQELFTEFILFDQKGNTIRNFQNIFPKFVSDIKKELSHYLETLEVCSSSMMVNHLSYTFITHTKHLVINLLQNQ
PKLKVLVMSNFDQYHAKFVAETLSYYCSNNFELEVWTELELSKESLEDSPYDIIISNFIIPPIENKRLIYSNNINTVSLI
YLLNAMMFIRLDE
>P9WJJ0 1.6.-.-~~~~~~NADH dehydrogenase-like protein MT1860~~~
MTRVVVIGSGFAGLWAALGAARRLDELAVPAGTVDVMVVSNKPFHDIRVRNYEADLSACRIPLGDVLGPAGVAHVTAEVT
AIDADGRRVTTSTGASYSYDRLVLASGSHVVKPALPGLAEFGFDVDTYDGAVRLQQHLQGLAGGPLTSAAATVVVVGAGL
TGIETACELPGRLHALFARGDGVTPRVVLIDHNPFVGSDMGLSARPVIEQALLDNGVETRTGVSVAAVSPGGVTLSSGER
LAAATVVWCAGMRASRLTEQLPVARDRLGRLQVDDYLRVIGVPAMFAAGDVAAARMDDEHLSVMSCQHGRPMGRYAGCNV
INDLFDQPLLALRIPWYVTVLDLGSAGAVYTEGWERKVVSQGAPAKTTKQSINTRRIYPPLNGSRADLLAAAAPRVQPRP
>P9WJJ1 1.6.-.-~~~~~~NADH dehydrogenase-like protein Rv1812c~~~COG1252
MTRVVVIGSGFAGLWAALGAARRLDELAVLAGTVDVMVVSNKPFHDIRVRNYEADLSACRIPLGDVLGPAGVAHVTAEVT
AIDADGRRVTTSTGASYSYDRLVLASGSHVVKPALPGLAEFGFDVDTYDGAVRLQQHLQGLAGGPLTSAAATVVVVGAGL
TGIETACELPGRLHALFARGDGVTPRVVLIDHNPFVGSDMGLSARPVIEQALLDNGVETRTGVSVAAVSPGGVTLSSGER
LAAATVVWCAGMRASRLTEQLPVARDRLGRLQVDDYLRVIGVPAMFAAGDVAAARMDDEHLSVMSCQHGRPMGRYAGCNV
INDLFDQPLLALRIPWYVTVLDLGSAGAVYTEGWERKVVSQGAPAKTTKQSINTRRIYPPLNGSRADLLAAAAPRVQPRP
>P9WLS0 ~~~~~~Uncharacterized protein MT1861~~~
MITNLRRRTAMAAAGLGAALGLGILLVPTVDAHLANGSMSEVMMSEIAGLPIPPIIHYGAIAYAPSGASGKAWHQRTPAR
AEQVALEKCGDKTCKVVSRFTRCGAVAYNGSKYQGGTGLTRRAAEDDAVNRLEGGRIVNWACN
>P9WLS1 ~~~~~~Uncharacterized protein Rv1813c~~~
MITNLRRRTAMAAAGLGAALGLGILLVPTVDAHLANGSMSEVMMSEIAGLPIPPIIHYGAIAYAPSGASGKAWHQRTPAR
AEQVALEKCGDKTCKVVSRFTRCGAVAYNGSKYQGGTGLTRRAAEDDAVNRLEGGRIVNWACN
>P9WLR9 ~~~~~~Uncharacterized protein Rv1815~~~
MVRLVPRAFAATVALLAAGFSPATASADPVLVFPGMEIRQDNHVCTLGYVDPALKIAFTAGHCRGGGAVTSRDYKVIGHL
RAIRDNTPSGSTVATHELIADYEAIVLADDVTASNILPSGRALESRPGVVLHPGQAVCHFGVSTGETCGTVESVNNGWFT
MSHGVLSEKGDSGGPVYLAPDGGPAQIVGIFNSVWGGFPAAVSWRSTSEQVHADLGVTPLA
>Q2FXM0 ~~~~~~UPF0173 metal-dependent hydrolase SAOUHSC_01815~~~COG2220
MKLSFHGQSTIYLEGNNKKVIVDPFISNNPKCDLNIETVQVDYIVLTHGHFDHFGDVVELAKKTGATVIGSAEMADYLSS
YHGVENVHGMNIGGKANFDFGSVKFVQAFHSSSFTHENGIPVYLGMPMGIVFEVEGKTIYHTGDTGLFSDMSLIAKRHPV
DVCFVPIGDNFTMGIDDASYAINEFIKPKISVPIHYDTFPLIEQDPQQFKDAVNVGDVQILKPGESVQF
>P9WMC9 ~~~~~~HTH-type transcriptional regulator Rv1816~~~COG1309
MCQTCRVGKRRDAREQIEAKIVELGRRQLLDHGAAGLSLRAIARNLGMVSSAVYRYVSSRDELLTLLLVDAYSDLADTVD
RARDDTVADSWSDDVIAIARAVRGWAVTNPARWALLYGSPVPGYHAPPDRTAGVATRVVGAFFDAIAAGIATGDIRLTDD
VAPQPMSSDFEKIRQEFGFPGDDRVVTKCFLLWAGVVGAISLEVFGQYGADMLTDPGVVFDAQTRLLVAVLAEH
>P9WI85 1.13.11.24~~~~~~Putative quercetin 2,3-dioxygenase Rv0181c~~~COG1741
MTATVEIRRAADRAVTTTSWLKSRHSFSFGDHYDPDNTHHGLLLVNNDDQMEPASGFDPHPHRDMEIVTWVLRGALRHQD
SAGNSGVIYPGLAQRMSAGTGILHSEMNDSATEPVHFVQMWVIPDATGITASYQQQEIDDELLRAGLVTIASGIPGQDAA
LTLHNSSASLHGARLRPGATVSLPCAPFLHLFVAYGRLTLEGGGELADGDAVRFTDADARGLTANEPSEVLIWEMHAKLG
DSAT
>P9WFG1 ~~~~~~UPF0749 protein Rv1823~~~COG3879
MAESDRLLGGYDPNAGYSAHAGAQPQRIPVPSLLRALLSEHLDAGYAAVAAERERAAAPRCWQARAVSWMWQALAATLVA
AVFAAAVAQARSVAPGVRAAQQLLVASVRSTQAAATTLAQRRSTLSAKVDDVRRIVLADDAEGQRLLARLDVLSLAAASA
PVVGPGLTVTVTDPGASPNLSDVSKQRVSGSQQIILDRDLQLVVNSLWESGAEAISIDGVRIGPNVTIRQAGGAILVDNN
PTSSPYTILAVGPPHAMQDVFDRSAGLYRLRLLETSYGVGVSVNVGDGLALPAGATRDVKFAKQIGP
>P9WFG3 ~~~~~~UPF0749 protein Rv1825~~~COG3879
MSENRPEPVAAETSAATTARHSQADAGAHDAVRRGRHELPADHPRSKVGPLRRTRLTEILRGGRSRLVFGTLAILLCLVL
GVAIVTQVRQTDSGDSLETARPADLLVLLDSLRQREATLNAEVIDLQNTLNALQASGNTDQAALESAQARLAALSILVGA
VGATGPGVMITIDDPGPGVAPEVMIDVINELRAAGAEAIQINDAHRSVRVGVDTWVVGVPGSLTVDTKVLSPPYSILAIG
DPPTLAAAMNIPGGAQDGVKRVGGRMVVQQADRVDVTALRQPKQHQYAQPVK
>P9WME7 ~~~~~~Uncharacterized HTH-type transcriptional regulator Rv1828~~~COG0789
MSAPDSPALAGMSIGAVLDLLRPDFPDVTISKIRFLEAEGLVTPRRASSGYRRFTAYDCARLRFILTAQRDHYLPLKVIR
AQLDAQPDGELPPFGSPYVLPRLVPVAGDSAGGVGSDTASVSLTGIRLSREDLLERSEVADELLTALLKAGVITTGPGGF
FDEHAVVILQCARALAEYGVEPRHLRAFRSAADRQSDLIAQIAGPLVKAGKAGARDRADDLAREVAALAITLHTSLIKSA
VRDVLHR
>P9WLR5 ~~~~~~Uncharacterized protein Rv1829~~~COG1259
MGEVRVVGIRVEQPQNQPVLLLREANGDRYLPIWIGQSEAAAIALEQQGVEPPRPLTHDLIRDLIAALGHSLKEVRIVDL
QEGTFYADLIFDRNIKVSARPSDSVAIALRVGVPIYVEEAVLAQAGLLIPDESDEEATTAVREDEVEKFKEFLDSVSPDD
FKAT
>P9WME5 ~~~~~~Uncharacterized HTH-type transcriptional regulator Rv1830~~~COG0789
MTQLVTRARSARGSTLGEQPRQDQLDFADHTGTAGDGNDGAAAASGPVQPGLFPDDSVPDELVGYRGPSACQIAGITYRQ
LDYWARTSLVVPSIRSAAGSGSQRLYSFKDILVLKIVKRLLDTGISLHNIRVAVDHLRQRGVQDLANITLFSDGTTVYEC
TSAEEVVDLLQGGQGVFGIAVSGAMRELTGVIADFHGERADGGESIAAPEDELASRRKHRDRKIG
>P9WIQ9 3.1.1.-~~~~~~Putative serine esterase Rv1835c~~~COG2936
MTRRGGSDAAWYSAPDQRSAYPRYRGMRYSSCYVTMRDGVRIAIDLYLPAGLTSAARLPAILHQTRYYRSLQLRWPLRML
LGGKPLQHIAADKRRRRRFVASGYAWVDVDVRGSGASFGARVCEWSSDEIRDGAEIVDWIVRQPWCNGTVAALGNSYDGT
SAELLLVNQHPAVRVIAPCFSLFDVYTDIAFPGGIHAAWFTDTWGRYNEALDRNALHEVVGWWAKLPVTGMQPVQEDRDR
SLRDGAIAAHRGNYDVHQIAGSLTFRDDVSASDPYRGQPDARLEPIGTPIESGSINLISPHNYWRDVQASGAAIYSYSGW
FDGGYAHAAIKRFLTVSTPGSHLILGPWNHTGGWRVDPLRGLSRPDFDHDGELLRFIDHHVKGADTGIGSEPPVHYFTMV
ENRWKSADTWPPPATTQSYYLSADRQLRPDAPDCDSGADEYVVDQTAGTGERSRWRSQVGIGGHVCYPDRKAQDAKLLTY
TSAPLDHPLEVTGHVVVTLFITSTSSDGTFFVYLEDVDPRGRVAYITEGQLRAIHRRLSDGPPPYRQVVPYRTFASGDAW
PLVPGEIARLTFDLLPTSYLFQPGHRIRIAIAGADASHFAILPGCAPTVRVYRSRMHASRIDLPVIQP
>P9WLQ9 ~~~~~~Uncharacterized protein Rv1836c~~~COG2304
MGRHSKPDPEDSVDDLSDGHAAEQQHWEDISGSYDYPGVDQPDDGPLSSEGHYSAVGGYSASGSEDYPDIPPRPDWEPTG
AEPIAAAPPPLFRFGHRGPGDWQAGHRSADGRRGVSIGVIVALVAVVVMVAGVILWRFFGDALSNRSHTAAARCVGGKDT
VAVIADPSIADQVKESADSYNASAGPVGDRCVAVAVTSAGSDAVINGFIGKWPTELGGQPGLWIPSSSISAARLTGAAGS
QAISDSRSLVISPVLLAVRPELQQALANQNWAALPGLQTNPNSLSGLDLPAWGSLRLAMPSSGNGDAAYLAGEAVAAASA
PAGAPATAGIGAVRTLMGARPKLADDSLTAAMDTLLKPGDVATAPVHAVVTTEQQLFQRGQSLSDAENTLGSWLPPGPAA
VADYPTVLLSGAWLSQEQTSAASAFARYLHKPEQLAKLARAGFRVSDVKPPSSPVTSFPALPSTLSVGDDSMRATLADTM
VTASAGVAATIMLDQSMPNDEGGNSRLSNVVAALENRIKAMPPSSVVGLWTFDGREGRTEVPAGPLADPVNGQPRPAALT
AALGKQYSSGGGAVSFTTLRLIYQEMLANYRVGQANSVLVITAGPHTDQTLDGPGLQDFIRKSADPAKPIAVNIIDFGAD
PDRATWEAVAQLSGGSYQNLETSASPDLATAVNIFLS
>O83213 ~~~~~~Uncharacterized protein TP_0183~~~
MKKGVRSSRLLILFVLFAHAVHAAPRVGVYRLEVSGVPAHTETTINDALFSFIRELRGYHVVDCREQAVPHRFPEKGNLD
YIFCGAMDLTPEGIRLAVALKGKDHNATRLLSKTYETAARILLDSRHLVRDVFDRSVPLTGNQTETSSMRHTRAGEESVS
SLDALAGSWHSTEEGERIVIVSEGRGIAVLRSGLSVPLKLKISDGVLVVSQKGAVNARQFSHFPPEIAQKLAQEARPLQW
RFPKISGNNRLSGVRTAPVVRGAGHTASVEYEEVPEEWVRN
>P9WLQ7 ~~~~~~Uncharacterized protein Rv1841c~~~COG1253
MDVLSAVLLALLLIGANAFFVGAEFALISARRDRLEALAEQGKATAVTVIRAGEQLPAMLTGAQLGVTVSSILLGRVGEP
AVVKLLQLSFGLSGVPPALLHTLSLAIVVALHVLLGEMVPKNIALAGPERTAMLLVPPYLVYVRLARPFIAFYNNCANAI
LRLVGVQPKDELDIAVSTAELSEMIAESLSEGLLDHEEHTRLTRALRIRTRLVADVAVPLVNIRAVQVSAVGSGPTIGGV
EQALAQTGYSRFPVVDRGGRFIGYLHIKDVLTLGDNPQTVIDLAVVRPLPRVPQSLPLADALSRMRRINSHLALVTADNG
SVVGMVALEDVVEDLVGTMRDGTHR
>P9WFP3 ~~~~~~UPF0053 protein Rv1842c~~~COG1253
MNLTDTVATILAILALTAGTGVFVAAEFSLTALDRSTVEANARGGTSRDRFIQRAHHRLSFQLSGAQLGISITTLATGYL
TEPLVAELPHPGLVAVGMSDRVADGLITFFALVIVTSLSMVFGELVPKYLAVARPLRTARSVVAGQVLFSLLLTPAIRLT
NGAANWIVRRLGIEPAEELRSARTPQELVSLVRSSARSGALDDATAWLMRRSLQFGALTAEELMTPRSKIVALQTDDTIA
DLVAAAAASGFSRFPVVEGDLDATVGIVHVKQVFEVPPGDRAHTLLTTVAEPVAVVPSTLDGDAVMAQVRASALQTAMVV
DEYGGTAGMVTLEDLIEEIVGDVRDEHDDATPDVVAAGNGWRVSGLLRIDEVASATGYRAPDGPYETIGGLVLRELGHIP
VAGETVELTALDQDGLPDDSMRWLATVIQMDGRRIDLLELIKMGGHADPGSGRGR
>P9WIM3 3.1.2.-~~~~~~Putative esterase Rv1847~~~COG2050
MQPSPDSPAPLNVTVPFDSELGLQFTELGPDGARAQLDVRPKLLQLTGVVHGGVYCAMIESIASMAAFAWLNSHGEGGSV
VGVNNNTDFVRSISSGMVYGTAEPLHRGRRQQLWLVTITDDTDRVVARGQVRLQNLEARP
>P9WGQ1 1.-.-.-~~~~~~Putative oxidoreductase Rv1856c~~~COG1028
MEVLVTGGDTDLGRTMAEGFRNDGHKVTLVGARRGDLEVAAKELDVDAVVCDTTDPTSLTEARGLFPRHLDTIVNVPAPS
WDAGDPRAYSVSDTANAWRNALDATVLSVVLTVQSVGDHLRSGGSIVSVVAENPPAGGAESAIKAALSNWIAGQAAVFGT
RGITINTVACGRSVQTGYEGLSRTPAPVAAEIARLALFLTTPAARHITGQTLHVSHGALAHFG
>O67709 ~~~~~~Protein aq_1857~~~COG0316
MQEQAQQFIFKVTDKAVEEIKKVAQENNIENPILRIRVVPGGCSGFQYAMGFDDTVEEGDHVFEYDGVKVVIDPFSMPYV
NGAELDYVVDFMGGGFTIRNPNATGSCGCGSSFSCG
>P64898 ~~~~~~Uncharacterized protein Mb1858~~~
MTDMNPDIEKDQTSDEVTVETTSVFRADFLSELDAPAQAGTESAVSGVEGLPPGSALLVVKRGPNAGSRFLLDQAITSAG
RHPDSDIFLDDVTVSRRHAEFRLENNEFNVVDVGSLNGTYVNREPVDSAVLANGDEVQIGKFRLVFLTGPKQGEDDGSTG
GP
>P95149 2.8.3.-~~~~~~Probable CoA-transferase Rv1866~~~COG1804
MLDLSDGCSAGGTDMVTRLLADLGADVLKVEPPGGSPGRHVRPTLAGTSIGFAMHNANKRSAVLNPLDESDRRRFLDLAA
SADIVVDCGLPGQAAAYGASCAELADRYRHLVALSITDFGAAGPRSSWRATDPVLYAMSGALSRSGPTAGTPVLPPDGIA
SATAAVQAAWAVLVAYFNRLRCGTGDYIDFSRFDAVVMALDPPFGAHGQVAAGIRSTGRWRGRPKNQDAYPIYPCRDGYV
RFCVMAPRQWRGLRRWLGEPEDFQDPKYDVIGARLAAWPQISVLVAKLCAEKTMKELVAAGQALGVPITAVLTPSRILAS
EHFQAVGAITDAELVPGVRTGVPTGYFVVDGKRAGFRTPAPAAGQDEPRWLADPAPVPPPSGRVGGYPFEGLRILDLGII
VAGGELSRLFGDLGAEVIKVESADHPDGLRQTRVGDAMSESFAWTHRNHLALGLDLRNSEGKAIFGRLVAESDAVFANFK
PGTLTSLGFSYDVLHAFNPRIVLAGSSAFGNRGPWSTRMGYGPLVRAATGVTRVWTSDEAQPDNSRHPFYDATTIFPDHV
VGRVGALLALAALIHRDRTGGGAHVHISQAEVVVNQLDTMFVAEAARATDVAEIHPDTSVHAVYPCAGDDEWCVISIRSD
DEWRRATSVFGQPELANDPRFGASRSRVANRSELVAAVSAWTSTRTPVQAAGALQAAGVAAGPMNRPSDILEDPQLIERN
LFRDMVHPLIARPLPAETGPAPFRHIPQAPQRPAPLPGQDSVQICRKLLGMTADETERLINERVMFGPAVTA
>Q99T13 7.6.2.-~~~~~~Putative multidrug export ATP-binding/permease protein SAV1866~~~
MIKRYLQFVKPYKYRIFATIIVGIIKFGIPMLIPLLIKYAIDGVINNHALTTDEKVHHLTIAIGIALFIFVIVRPPIEFI
RQYLAQWTSNKILYDIRKKLYNHLQALSARFYANNQVGQVISRVINDVEQTKDFILTGLMNIWLDCITIIIALSIMFFLD
VKLTLAALFIFPFYILTVYVFFGRLRKLTRERSQALAEVQGFLHERVQGISVVKSFAIEDNEAKNFDKKNTNFLTRALKH
TRWNAYSFAAINTVTDIGPIIVIGVGAYLAISGSITVGTLAAFVGYLELLFGPLRRLVASFTTLTQSFASMDRVFQLIDE
DYDIKNGVGAQPIEIKQGRIDIDHVSFQYNDNEAPILKDINLSIEKGETVAFVGMSGGGKSTLINLIPRFYDVTSGQILI
DGHNIKDFLTGSLRNQIGLVQQDNILFSDTVKENILLGRPTATDEEVVEAAKMANAHDFIMNLPQGYDTEVGERGVKLSG
GQKQRLSIARIFLNNPPILILDEATSALDLESESIIQEALDVLSKDRTTLIVAHRLSTITHADKIVVIENGHIVETGTHR
ELIAKQGAYEHLYSIQNL
>P44558 ~~~~~~Uncharacterized HTH-type transcriptional regulator HI_0186~~~COG0789
MTYTTAKAAEKIGISAYTLRFYDKEGLLPNVGRDEYGNRRFTDKDLQWLSLLQCLKNTGMSLKDIKRFAECTIIGDDTIE
ERLSLFENQTKNVKCQIAELKRYLDLLEYKLAFYQKAKALGSVKAVNLPQIPETS
>P0A0K0 ~~~~~~Uncharacterized protein SAV1875~~~
MTKKVAIILANEFEDIEYSSPKEALENAGFNTVVIGDTANSEVVGKHGEKVTVDVGIAEAKPEDYDALLIPGGFSPDHLR
GDTEGRYGTFAKYFTKNDVPTFAICHGPQILIDTDDLKGRTLTAVLNVRKDLSNAGAHVVDESVVVDNNIVTSRVPDDLD
DFNREIVKQLQ
>P73321 ~~~~~~Protein slr1894~~~COG0783
MATINIGIPEADRIKIAESLKKLLADTYTLYLQTHNFHWNVTGPQFRDLHLMFEEQYNELALAVDDIAERIRSLDVFAPG
TYKEFAKLSSVQEVDGIPTSKEMVDILTKGHETIVQSCRDVLKCSQPADDESTIALASDRMRVHEKTAWMLRAMNK
>O07737 1.1.1.1~~~~~~Probable zinc-binding alcohol dehydrogenase Rv1895~~~COG1063
MRAVVIDGAGSVRVNTQPDPALPGPDGVVVAVTAAGICGSDLHFYEGEYPFTEPVALGHEAVGTIVEAGPQVRTVGVGDL
VMVSSVAGCGVCPGCETHDPVMCFSGPMIFGAGVLGGAQADLLAVPAADFQVLKIPEGITTEQALLLTDNLATGWAAAQR
ADISFGSAVAVIGLGAVGLCALRSAFIHGAATVFAVDRVKGRLQRAATWGATPIPSPAAETILAATRGRGADSVIDAVGT
DASMSDALNAVRPGGTVSVVGVHDLQPFPVPALTCLLRSITLRMTMAPVQRTWPELIPLLQSGRLDVDGIFTTTLPLDEA
AKGYATARARSGEELRFCLRPDSRDVLGAHETVDLYVHVRRCQSVADLQLEGAADGVDGPSMLN
>P9WFQ1 ~~~~~~UPF0045 protein Rv1898~~~COG0011
MSVLVAFSVTPLGVGEGVGEIVTEAIRVVRDSGLPNQTDAMFTVIEGDTWAEVMAVVQRAVEAVAARAPRVSAVIKVDWR
PGVTDAMTQKVATVERYLLRPE
>P9WK29 ~~~~~~Uncharacterized protein Rv1899c~~~COG2110
MAAMRAHARRRHPHALMSRAAGLPRLSWFAGLTWFAGGSTGAGCAAHPALAGLTAGARCPAYAAISASTARPAATAGTTP
ATGASGSARPTDAAGMADLARPGVVATHAVRTLGTTGSRAIGLCPCQPLDCPRSPQATLNLGSMGRSLDGPQWRRARVRL
CGRWWRRSNTTRGASPRPPSTCRGDNVSMIELEVHQADVTKLELDAITNAANTRLRHAGGVAAAIARAGGPELQRESTEK
APIGLGEAVETTAGDMPARYVIHAATMELGGPTSGEIITAATAATLRKADELGCRSLALVAFGTGVGGFPLDDAARLMVG
AVRRHRPGSLQRVVFAVHGDAAERAFSAAIQAGEDTARR
>Q97NV6 ~~~~~~UPF0374 protein SP_1903~~~COG3557
MKLPKEGDFITIQSYKHDGSLHRTWRDTMVLKTTENAIIGVNDHTLVTESDGRRWVTREPAIVYFHKKYWFNIIAMIRDN
GTSYYCNMASPYYLDEEALKYIDYDLDVKIFTDGEKRLLDVEEYERHKRKMNYSDDLDYILKEHVKILVDWINNGRGPFS
EAYVNIWYKRYVELKNR
>P9WFN5 ~~~~~~UPF0098 protein Rv1910c~~~COG1881
MESTVAHAFHRFALAILGLALPVALVAYGGNGDSRKAAPLAPKAAALGRSMPETPTGDVLTISSPAFADGAPIPEQYTCK
GANIAPPLTWSAPFGGALVVDDPDAPREPYVHWIVIGIAPGAGSTADGETPGGGISLPNSSGQPAYTGPCPPAGTGTHHY
RFTLYHLPAVPPLAGLAGTQAARVIAQAATMQARLIGTYEG
>Q97SX1 ~~~~~~UPF0297 protein SP_0192~~~COG4472
MGFTEETVRFKLDDSNKKEISETLTDVYASLNDKGYNPINQIVGYVLSGDPAYVPRYNNARNQIRKYERDEIVEELVRYY
LKGQGVDL
>P95283 ~~~~~~HTH-type transcriptional regulator Rv1931c~~~COG4977
MVIVGFPGDPVDTVILPGGAGVDAARSEPALIDWVKAVSGTARRVVTVCTGAFLAAEAGLLGRTPSDDALGLCRTFRPRI
SGRSGRCRPDLHAQFAEGVDRGWSHRRHRPRAGTGRRRPRHRDCPDGCPLARPVSAPTRWADPVRGSGVDATRQTDLDPP
GAGGHRGRAGGAHRIGELAQRAAMSPRHFTRVFSDEVGEAPGRYVERIRTEAARRQLEETHDTVVAIAARCGFGTAETMR
RSFIRRVGISPDQYRKAFA
>Q81RU6 ~~~~~~Uncharacterized HTH-type transcriptional regulator BA_1941/GBAA_1941/BAS1801~~~COG1846
MRDNTIGSLIWLRLIRFTNQSNQMSNEFLKRFDLTTAQFDVLLQIRTYQPLTQMELAEKVTVTQGGISRMLTRLEKEGYI
VRKQDWKTKTISLTEQGEAALERALPEQLAFQSSFFDDVLNEEEQKILYELMTKVHKHSEKKELPKE
>P9WLQ5 ~~~~~~Uncharacterized protein Rv1945~~~COG1403
MRSDTREEISAALDAYHASLSRVLDLKCDALTTPELLACLQRLEVERRRQGAAEHALINQLAGQACEEELGGTLRTALAN
RLHITPGEASRRIAEAEDLGERRALTGEPLPAQLTATAAAQREGKIGREHIKEIQAFFKELSAAVDLGIREAAEAQLAEL
ATSRRPDHLHGLATQLMDWLHPDGNFSDQERARKRGITMGKQEFDGMSRISGLLTPELRATIEAVLAKLAAPGACNPDDQ
TPVVDDTPDADAVRRDTRSQAQRHHDGLLAGLRGLLASGELGQHRGLPVTVVVSTTLKELEAATGKGVTGGGSRVPMSDL
IRMASNAHHYLALFDGAKPLALYHTKRLASPAQRIMLYAKDRGCSRPGCDAPAYHSEVHHVTPWTTTHRTDINDLTLACG
PDNRLVEKGWKTRKNAKGDTEWLPPAHLDHGQPRINRYHHPEKILCEPDDDEPH
>P9WFH7 2.1.1.-~~~~~~Putative S-adenosyl-L-methionine-dependent methyltransferase Rv1896c~~~COG3315
MTTPEYGSLRSDDDHWDIVSNVGYTALLVAGWRALHTTGPKPLVQDEYAKHFITASADPYLEGLLANPRTSEDGTAFPRL
YGVQTRFFDDFFNCADEAGIRQAVIVAAGLDCRAYRLDWQPGTTVFEIDVPKVLEFKARVLSERGAVPKAHRVAVPADLR
TDWPTPLTAAGFDPQRPSAWSVEGLLPYLTGDAQYALFARIDELCAPGSRVALGALGSRLDHEQLAALETAHPGVNMSGD
VNFSALTYDDKTDPVEWLVEHGWAVDPVRSTLELQVGYGLTPPDVDVKIDSFMRSQYITAVRA
>Q97SX0 ~~~~~~UPF0473 protein SP_0194~~~COG3906
MSHDHNHDHEERELITLVDEQGNETLFEILLTIDGKEEFGKNYVLLVPVNAEEDEDGQVEIQAYSFIENEDGTEGELQPI
PEDSEDEWNMIEEVFNSFMEE
>P9WLQ3 ~~~~~~Uncharacterized protein Rv1954c~~~
MAAGSGGGTVGLVLPRVASLSGLDGAPTVPEGSDKALMHLGDPPRRCDTHPDGTSSAAAALVLRRIDVHPLLTGLGRGRQ
TVSLRNGHLVATANRAILSRRRSRLTRGRSFTSHLITSCPRLDDHQHRHPTRCRAEHAGCTVATCIPNAHDPAPGHQTPR
WGPFRLKPAYTRI
>A0QTT7 ~~~~~~UPF0182 protein MSMEG_1959/MSMEI_1915~~~COG1615
MGMRPTARMPKLTRRSRVLIAFALVAVLLLLLGPRLIDTYVDWLWFGELGYRSVFTTVLATRLIVFVVVALAIGAIVFAG
LALAYRTRPVFVPTAGPNDPVARYRTTVMARLRLFGIGVPVFIGLLAGIVAQSYWVKIQLFLHGGDFGITDPEFGKDLGF
YAFDLPFYRLVLTYLFVATFLAFVANLLGHYLFGGIRLTGRVGALSRAARIQLISLAGTLIVLKAFAYWLDRYELLSNDR
SAKPFTGAGYTDINAVLPAKLIMLAIAVICAVAVFSALVLRDLRIPAIGVALLLLSSLVVGAGWPLIVEQFSVKPNAAQK
EAEYISRSIEATRHAYGLTDETVTYRNYENTGQTTAAQVAADRATTSNIRLLDPTIVSPAFTQFQQGKNFYYFPDQLSID
RYIGPDGNLRDYVVAARELNPDRLIDNQRDWINRHTVYTHGNGFIASPANTVRGVANDPNQNGGYPEFLASVVGANGSVI
SPGPAPLDQPRIYFGPVISNTPADYAIVGKTGDTDREYDYETNTETKNYTYGGKGGVPIGNWLNRSVFAAKFAERNFLFS
NVIGENSKILFNRDPAERVEAVAPWLTTDTSVYPAIVNKRMVWIVDGYTTLDNYPYSELTTLSSATADSNEVAVNRLAPD
KKVSYIRNSVKATVDAYDGTVTLYAQDENDPVLKAWMDVFPGTVKPKADITPELQAHLRYPEDLFKVQRALLAKYHVDNP
VTFFSAQDFWDVPLDPNPTASSFQPPYYIVAKDLVKNDNSASFQLTSALNRFQRDFLAAYVSASSDPETYGKLTVLTIPG
QVNGPKLAFNAISTDTAVSQDLGVIGRDNQNRIRWGNLLTLPVADGGLLYVAPVYASPGSSDAASSYPRLIRVAMLYNDR
VGYGPTVSDALTELFGPGAGATATDVAPAEGRPAQSTPNGQQPAASPPPAANADGRPAQAPPPPSAATPTGPVQISQAKA
EALQDLESALTAAQEAQRSGDFAEYGQALQRLNDAMKKYDSAK
>Q57362 ~~~~~~Uncharacterized MscS family protein HI_0195.1~~~COG3096
MIRKLMKIPPFFTALFASAMFTLSVSQGVLAANSTNVLPTEQSLKADLANAQKMSEGEAKNRLLAELQTSIDLLQQIQAQ
QKINDALQTTLSHSESEIRKNNAEIQALKKQQETATSTDDNAQSQDYLQNSLTKLNDQLQDTQNALSTANAQLAGQSSIS
ERAQAALTENVVRTQQINQQLANNDIGSTLRKQYQIDLQLIDLKNSYNQNLLKNNDQLSLLYQSRYNLLNLRLQVQQQNI
IAIQEVINQKNLQQSQNQVEQAQQQQKTVQNDYIQKELDRNAQLGQYLLQQTEKANSLTQDELRMRNILDSLTQTQRTID
EQISALQGTLVLSRIIQQQKQKLPTNLNIQGLSKQIADLRVHIFDITQKRNELYDLDNYINKVESEDGKQFTEAERTQVK
TLLTERRKMTSDLIKSLNNQLNLAISLELTQLQITQISDQIQSKLEQQSFWVKSNNPINLDWVKMLPRALIEQFNGMLKK
LGFPTNYDNLPYLLMYFLGLFIVGGAIFKFKNRIKQQLNKINREIHRLDTDSQWSTPLALLLTAFLTLSSTLWFLAVCQM
IGFFFFKNPEEFWHWSFSMAGYWWFFTFWISLFRPNGIFVNHFESSKENAQRFRGVIQRIIVVVVLLLNTSVFSNVTDAG
LANDVLGQINTIAALIFCAAIIAPRFNRVLRSYEPETNKHHWLIRIVQIGFRLIPVGLIVLIVLGYYYTALNLIEHFIHS
YIAWCVWWLVRNTIYRGITVSSRRLAHRRLAEKRRQKALENNYENISSDDVVAVGEPEESLALNDVRSQLLRFVDLFIWT
ALLGIFYYVWSDLVTVVSYLREITLWQQTTTTDAGTVMESITLFNLLVALVIVGITYVLVRNISGILEVLIFSRVNLSQG
TPYTITTLLTYIFIAIGGAWAFATLGMSWSKLQWLFAALSVGLGFGMQEIFANFVSGIILLFERPIRVGDVVTINEVSGT
VAKIRIRAITLIDFDRKEVIVPNKSFVTGQVTNWALSNTMTRLVISVGVAYGSDLTLVRQLLLQAADEQPTILRDPKPSA
YFLTFGASTLDHELRVYVEQVGDRTSTTDALNRRINELFAEHNIDIAFNQLDVFIKNNDTGEEIPFVDVKK
>O67776 3.4.24.-~~~~~~Putative zinc metalloprotease aq_1964~~~COG0750
MGLIAFLILIGVLVWVHEFGHFLMAKLFRVKVEIFSIGFGPPIFRRQWGETVYQIAALPLGGYVKLYGEEENVHDPRAFS
TKKPWQKILIALGGPLFNFLFTILVFALVYTAGVEVPKYLKEPVVVGYVQRDSIAQKIGIKPGDKIIKINGYEVRTWEDL
RDALIRLSLDGVKETTLFLERNGEVLHLTIKVPNVQKGEELGIAPLVKPVVGGVKKGSPADQVGIKPGDLILEVNGKKIN
TWYELVEEVRKSQGKAIKLKILRNGKMIEKELIPAKDPKTGTYFIGLFPKTETVVEKKPFGEALASAVNRTWELTVLTLK
TIAGLITGKVSFQTLGGPIAIAQIAGQAAQSGFIPYLVMMAFISLQLGIFNLIPLPILDGGLILLFAIEWLRGRPLPEKF
KEYWQRVGLAIIITLTIFVFINDILRLLR
>P9WME1 ~~~~~~Uncharacterized HTH-type transcriptional regulator Rv0196~~~COG1309
MQGPRERMVVSAALLIRERGAHATAISDVLQHSGAPRGSAYHYFPGGRTQLLCEAVDYAGEHVAAMINEAEGGLELLDAL
IDKYRQQLLSTDFRAGCPIAAVSVEAGDEQDRERMAPVIARAAAVFDRWSDLTAQRFIADGIPPDRAHELAVLATSTLEG
AILLARVRRDLTPLDLVHRQLRNLLLAELPERSR
>O67784 ~~~~~~Uncharacterized protein aq_1974~~~
MLKKILSLFKKEEPKTEEKPTEVEEKKEEREEKEEKKVRELTPQELELFKRAMGITPHNYWQWASRTNNFKLLTDGEWVW
VEGYEEHIGKQLPLNQARAWSWEFIKNRLKELNL
>P9WQM5 ~~~~~~Uncharacterized transporter Rv1979c~~~COG0531
MVGPRTRGYAIHKLGFCSVVMLGINSIIGAGIFLTPGEVIGLAGPFAPMAYVLAGIFAGVVAIVFATAARYVRTNGASYA
YTTAAFGRRIGIYVGVTHAITASIAWGVLASFFVSTLLRVAFPDKAWADAEQLFSVKTLTFLGFIGVLLAINLFGNRAIK
WANGTSTVGKAFALSAFIVGGLWIITTQHVNNYATAWSAYSATPYSLLGVAEIGKGTFSSMALATIVALYAFTGFESIAN
AAEEMDAPDRNLPRAIPIAIFSVGAIYLLTLTVAMLLGSNKIAASDDTVKLAAAIGNATFRTIIVVGALISMFGINVAAS
FGAPRLWTALADSGVLPTRLSRKNQYDVPMVSFAITASLALAFPLALRFDNLHLTGLAVIARFVQFIIVPIALIALARSQ
AVEHAAVRRNAFTDKVLPLVAIVVSVGLAVSYDYRCIFLVRGGPNYFSIALIVITFIVVPAMAYLHYYRIIRRVGDRPST
R
>Q99S93 ~~~~~~UPF0457 protein SA1975.1~~~
MAMTVKKDNNEVRIQWRVADIKIPTSEIKNITQDQDIHAVPKLDSKDVSRIGSTFGKTNRVIIDTEDHEYIIYTQNDQKV
YNELTK
>P72925 ~~~~~~UPF0367 protein ssl1972~~~
MISIDLTLKYSPMPVSVQRKEKDGAEALYQTIVTAMQGDRPQVLELTCEKQTEKKVAIMSDQISAVIVSEKDGAASAGKV
PGFAALGQIVNQG
>Q9RSY2 ~~~~~~Uncharacterized protein DR_1987~~~
MKHILFPTVSAADAFIADLQSRGVVQPQVGTMNMSRRVQQAAGDTMSTGTTTGTVTTPAATTTTTNTTSYADGGYVDGGG
TAEDAGAGAVKGTVAGALTGAAAAVIGTAATVATGGLALPVILGMTALGSGVGAAVGAVGGAAGVDETGGTSYDSYSDSY
TTNYEADDAYYNRVNESVNAGGRAVAVDDNVPQDVLMDAVNKHGGEILNS
>P9WLQ1 ~~~~~~Uncharacterized protein Rv1987~~~COG3469
MAGLNIYVRRWRTALHATVSALIVAILGLAITPVASAATARATLSVTSTWQTGFIARFTITNSSTAPLTDWKLEFDLPAG
ESVLHTWNSTVARSGTHYVLSPANWNRIIAPGGSATGGLRGGLTGSYSPPSSCLLNGQYPCT
>P9WLP5 ~~~~~~Uncharacterized protein Rv1993c~~~
MVTHELLVKAAGAVLTGLVGVSAYETLRKALGTAPIRRASVTVMEWGLRGTRRAEAAAESARLTVADVVAEARGRIGEEA
PLPAGARVDE
>P9WLP3 ~~~~~~Uncharacterized protein Rv1995~~~COG3945
MVASGAATKGVTVMKQTPPAAVGRRHLLEISASAAGVIALSACSGSPPEPGKGRPDTTPEQEVPVTAPEDLMREHGVLKR
ILLIYREGIRRLQADDQSPAPALNESAQIIRRFIEDYHGQLEEQYVFPKLEQAGKLTDITSVLRTQHQRGRVLTDRVLAA
TTAAAAFDQPARDTLAQDMAAYIRMFEPHEAREDTVVFPALRDVMSAVEFRDMAETFEDEEHRRFGEAGFQSVVDKVADI
EKSLGIYDLSQFTPS
>P9WLP0 ~~~~~~Universal stress protein MT2052~~~
MSAQQTNLGIVVGVDGSPCSHTAVEWAARDAQMRNVALRVVQVVPPVITAPEGWAFEYSRFQEAQKREIVEHSYLVAQAH
QIVEQAHKVALEASSSGRAAQITGEVLHGQIVPTLTNISRQVAMVVLGYRGQGAVAGALLGSVSSSLVRHAHGPVAVIPE
EPRPARPPHAPVVVGIDGSPTSGLAAEIAFDEASRRGVDLVALHAWSDMGPLDFPRLNWAPIEWRNLEDEQEKMLARRLS
GWQDRYPDVVVHKVVVCDRPAPRLLELAQTAQLVVVGSHGRGGFPGMHLGSVSRAVVNSGQAPVIVARIPQDPAVPA
>P9WLP1 ~~~~~~Universal stress protein Rv1996~~~COG0589
MSAQQTNLGIVVGVDGSPCSHTAVEWAARDAQMRNVALRVVQVVPPVITAPEGWAFEYSRFQEAQKREIVEHSYLVAQAH
QIVEQAHKVALEASSSGRAAQITGEVLHGQIVPTLANISRQVAMVVLGYRGQGAVAGALLGSVSSSLVRHAHGPVAVIPE
EPRPARPPHAPVVVGIDGSPTSGLAAEIAFDEASRRGVDLVALHAWSDMGPLDFPRLNWAPIEWRNLEDEQEKMLARRLS
GWQDRYPDVVVHKVVVCDRPAPRLLELAQTAQLVVVGSHGRGGFPGMHLGSVSRAVVNSGQAPVIVARIPQDPAVPA
>P9WLN8 ~~~~~~Uncharacterized protein MT2054~~~
MSFHDLHHQGVPFVLPNAWDVPSALAYLAEGFTAIGTTSFGVSSSGGHPDGHRATRGANIALAAALAPLQCYVSVDIEDG
YSDEPDAIADYVAQLSTAGINIEDSSAEKLIDPALAAAKIVAIKQRNPEVFVNARVDTYWLRQHADTTSTIQRALRYVDA
GADGVFVPLANDPDELAELTRNIPCPVNTLPVPGLTIADLGELGVARVSTGSVPYSAGLYAAAHAARAVSDGEQLPRSVP
YAELQARLVDYENRTSTT
>P9WLN9 ~~~~~~Uncharacterized protein Rv1998c~~~COG2513
MSFHDLHHQGVPFVLPNAWDVPSALAYLAEGFTAIGTTSFGVSSSGGHPDGHRATRGANIALAAALAPLQCYVSVDIEDG
YSDEPDAIADYVAQLSTAGINIEDSSAEKLIDPALAAAKIVAIKQRNPEVFVNARVDTYWLRQHADTTSTIQRALRYVDA
GADGVFVPLANDPDELAELTRNIPCPVNTLPVPGLTIADLGELGVARVSTGSVPYSAGLYAAAHAARAVSDGEQLPRSVP
YAELQARLVDYENRTSTT
>P9WQM3 ~~~~~~Uncharacterized transporter Rv1999c~~~COG0531
MRRPLDPRDIPDELRRRLGLLDAVVIGLGSMIGAGIFAALAPAAYAAGSGLLLGLAVAAVVAYCNAISSARLAARYPASG
GTYVYGRMRLGDFWGYLAGWGFVVGKTASCAAMALTVGFYVWPAQAHAVAVAVVVALTAVNYAGIQKSAWLTRSIVAVVL
VVLTAVVVAAYGSGAADPARLDIGVDAHVWGMLQAAGLLFFAFAGYARIATLGEEVRDPARTIPRAIPLALGITLAVYAL
VAVAVIAVLGPQRLARAAAPLSEAMRVAGVNWLIPVVQIGAAVAALGSLLALILGVSRTTLAMARDRHLPRWLAAVHPRF
KVPFRAELVVGAVVAALAATADIRGAIGFSSFGVLVYYAIANASALTLGLDEGRPRRLIPLVGLIGCVVLAFALPLSSVA
AGAAVLGVGVAAYGVRRIITRRARQTDSGDTQRSGHPSAT
>Q9RXV7 ~~~~~~Nucleoid-associated protein DR_0199~~~COG0718
MDMKKLMKQMQQAQVAAGKIQDELAAQSVEGTASGLVTVQMNGHGKVTSLKIKPEAVDGDDVEALEDLILAAINDAAEKA
EGLQREATAGLGLPGF
>P9WLN7 ~~~~~~Uncharacterized protein Rv2000~~~COG0121
MRPGFVGLGFGQWPVYVVRWPKLHLTPRQRKRVLHRRRLLTDRPISLSQIPIRTGGPMNDPWPRPTQGPAKTIETDYLVI
GAGAMGMAFTDTLITESGARVVMIDRACQPGGHWTTAYPFVRLHQPSAYYGVNSRALGNNTIDLVGWNQGLNELAPVGEI
CAYFDAVLQQQLLPTGRVDYFPMSEYLGDGRFRTLAGTEYVVTVNRRIVDATYLRAVVPSMRPAPYSVAPGVDCVAPNEL
PKLGTRDRYVVVGAGKTGMDVCLWLLRNDVCPDKLTWIMPRDSWLIDRATLQPGPTFVRQFRESYGATLEAIGAATSTDD
LFDRLETAGTLLRIDPSVRPSMYRCATVSHLELEQLRRIRDIVRMGHVQRIEPTTIVLDGGSVPATPTALYIDCTADGAP
QRPAKPVFDADHLTLQAVRGCQQVFSAAFIAHVEFAYEDDAVKNELCTPIPHPDCDLDWMRLMHSDLGNFQRWLNDPDLT
DWLSSARLNLLADLLPPLSHKPRVRERVVSMFQKRLGTAGDQLAKLLDAATATTEQR
>P9WLN5 ~~~~~~Uncharacterized protein Rv2001~~~COG3884
MHHNRDVDLALVERPSSGYVYTTGWRLATTDIDEHQQLRLDGVARYIQEVGAEHLADAQLAEVHPHWIVLRTVIDVINPI
ELPSDITFHRWCAALSTRWCSMRVQLQGSAGGRIETEGFWICVNKDTLTPSRLTDDCIARFGSTTENHRLKWRPWLTGPN
IDGTETPFPLRRTDIDPFEHVNNTIYWHGVHEILCQIPTLTAPYRAVLEYRSPIKSGEPLTIRYEQHDDVVRMHFVVGDD
VRAAALLRRL
>P9WJZ4 ~~~~~~Uncharacterized protein MT2059~~~
MVKRSRATRLSPSIWSGWESPQCRSIRARLLLPRGRSRPPNADCCWNQLAVTPDTRMPASSAAGRDAAAYDAWYDSPTGR
PILATEVAALRPLIEVFAQPRLEIGVGTGRFADLLGVRFGLDPSRDALMFARRRGVLVANAVGEAVPFVSRHFGAVLMAF
TLCFVTDPAAIFRETRRLLADGGGLVIGFLPRGTPWADLYALRAARGQPGYRDARFYTAAELEQLLADSGFRVIARRCTL
HQPPGLARYDIEAAHDGIQAGAGFVAISAVDQAHEPKDDHPLESE
>P9WJZ5 ~~~~~~Uncharacterized protein Rv2003c~~~COG2226
MVKRSRATRLSPSIWSGWESPQCRSIRARLLLPRGRSRPPNADCCWNQLAVTPDTRMPASSAAGRDAAAYDAWYDSPTGR
PILATEVAALRPLIEVFAQPRLEIGVGTGRFADLLGVRFGLDPSRDALMFARRRGVLVANAVGEAVPFVSRHFGAVLMAF
TLCFVTDPAAIFRETRRLLADGGGLVIGFLPRGTPWADLYALRAARGQPGYRDARFYTAAELEQLLADSGFRVIARRCTL
HQPPGLARYDIEAAHDGIQAGAGFVAISAVDQAHEPKDDHPLESE
>P9WLN2 ~~~~~~Uncharacterized protein MT2060~~~
MDSPTNDGTCDAHPVTDEPFIDVRETHTAVVVLAGDRAFKAKKPVVTDFCDFRTAEQRERACIREFELNSRLAAQSYLGI
AHLSDPSGGHAEPVVVMRRYRDKQRLASMVTAGLPVEGALDAIAEVLARFHQRAQRNRCIDTQGEVGAVARRWHENLAEL
RHHADKVVSGDVIRRIEHMVDEFVSGREVLFAGRIKEGCIVDGHADLLADDIFLVDGEPALLDCLEFEDELRYLDRIDDA
AFLAMDLEFLGRKDLGDYFLAGYAVRSGDTAPASLRDFYIAYRAVVRAKVECVRFSQGKPEAAADAVRHLIIATQHLQHA
TVRLALVGGNPGTGKSTLARGVAELVGAQVISTDDVRRRLRDCGVITGEPGVLDSGLYSRANVVAVYQEALRKARLLLGS
GHSVILDGTWGDPQMRACARRLAADTHSAIVEFRCSATVDVMADRIVARAGGNSDATAEIAAALAARQADWDTGHRIDTA
GPRERSVGQAYHIWRSAI
>P9WLN3 ~~~~~~Uncharacterized protein Rv2004c~~~COG0645
MDSPTNDGTCDAHPVTDEPFIDVRETHTAVVVLAGDRAFKAKKPVVTDFCDFRTAEQRERACIREFELNSRLAAQSYLGI
AHLSDPSGGHAEPVVVMRRYRDKQRLASMVTAGLPVEGALDAIAEVLARFHQRAQRNRCIDTQGEVGAVARRWHENLAEL
RHHADKVVSGDVIRRIEHMVDEFVSGREVLFAGRIKEGCIVDGHADLLADDIFLVDGEPALLDCLEFEDELRYLDRIDDA
AFLAMDLEFLGRKDLGDYFLAGYAVRSGDTAPASLRDFYIAYRAVVRAKVECVRFSQGKPEAAADAVRHLIIATQHLQHA
TVRLALVGGNPGTGKSTLARGVAELVGAQVISTDDVRRRLRDCGVITGEPGVLDSGLYSRANVVAVYQEALRKARLLLGS
GHSVILDGTWGDPQMRACARRLAADTHSAIVEFRCSATVDVMADRIVARAGGNSDATAEIAAALAARQADWDTGHRIDTA
GPRERSVGQAYHIWRSAI
>Q2G2M8 ~~~~~~UPF0374 protein SAOUHSC_02004~~~COG3557
MVRESIPKEGENIKIQSYKHDGKIHRVWSETTILKGTDHVVIGGNDHTLVTESDGRTWITREPAIVYFHSEYWFNVICMF
REDGIYYYCNLSSPFVCDEEALKYIDYDLDIKVYPNGKYHLLDEDEYEQHMNQMNYPHDIDIILRRNVDILQQWIEQKKG
PFAPDFIKVWKERYKKIRQY
>P9WLN0 ~~~~~~Universal stress protein MT2061~~~
MSKPRKQHGVVVGVDGSLESDAAACWGATDAAMRNIPLTVVHVVNADVATWPPMPYPETWGVWQEDEGRQIVANAVKLAK
EAVGADRKLSVKSELVFSTPVPTMVEISNEAEMVVLGSSGRGALARGLLGSVSSSLVRRAGCPVAVIHSDDAVIPDPQHA
PVLVGIDGSPVSELATAVAFDEASRRGVELIAVHAWSDVEVVELPGLDFSAVQQEAELSLAERLAGWQERYPDVPVSRVV
VCDRPARKLVQKSASAQLVVVGSHGRGGLTGMLLGSVSNAVLHAARVPVIVARQS
>P9WLN1 ~~~~~~Universal stress protein Rv2005c~~~COG0589
MSKPRKQHGVVVGVDGSLESDAAACWGATDAAMRNIPLTVVHVVNADVATWPPMPYPETWGVWQEDEGRQIVANAVKLAK
EAVGADRKLSVKSELVFSTPVPTMVEISNEAEMVVLGSSGRGALARGLLGSVSSSLVRRAGCPVAVIHSDDAVIPDPQHA
PVLVGIDGSPVSELATAVAFDEASRRGVELIAVHAWSDVEVVELPGLDFSAVQQEAELSLAERLAGWQERYPDVPVSRVV
VCDRPARKLVQKSASAQLVVVGSHGRGGLTGMLLGSVSNAVLHAARVPVIVARQS
>Q7A484 3.-.-.-~~~~~~Uncharacterized hydrolase SA2005~~~
MSKRLLLFDFDETYFKHNTNEEDLSHLREMEKLLEKLTNNNEVITAVLTGSTFQSVMDKMDQVNMTFKPLHIFSDLSSKM
FTWNNGEYVESETYKKKVLSEPFLFEDIEDILRHISAQYNVEFIPQRAFEGNETHYNFYFHSTGNHNNDSRILEALVRYA
NDQNYTARFSRSNPLAGDPENAYDIDFTPSNAGKLYATQFLMKKYNIPVKSILGFGDSGNDEAYLSYLEHAYLMSNSRDE
ALKQKFRLTKYPYYQGITLHVKEFVEGKYDY
>P9WN14 3.2.1.-~~~~~~Uncharacterized glycosyl hydrolase MT2062~~~
MRCGIVVNVTGPPPTIDRRYHDAVIVGLDNVVDKATRVHAAAWTKFLDDYLTRRPQRTGEDHCPLTHDDYRRFLAGKPDG
VADFLAARGIRLPPGSPTDLTDDTVYGLQNLERQTFLQLLNTGVPEGKSIASFARRLQVAGVRVAAHTSHRNYGHTLDAT
GLAEVFAVFVDGAVTAELGLPAEPNPAGLIETAKRLGANPGRCVVIDSCQTGLRAGRNGGFALVIAVDAHGDAENLLSSG
ADAVVADLAAVTVGSGDAAISTIPDALQVYSQLKRLLTGRRPAVFLDFDGTLSDIVERPEAATLVDGAAEALRALAAQCP
VAVISGRDLADVRNRVKVDGLWLAGSHGFELVAPDGSHHQNAAATAAIDGLAEAAAQLADALREIAGAVVEHKRFAVAVH
YRNVADDSVDNLIAAVRRLGHAAGLRVTTGRKVVELRPDIAWDKGKALDWIGERLGPAEVGPDLRLPIYIGDDLTDEDAF
DAVRFTGVGIVVRHNEHGDRRSAATFRLECPYTVCQFLSQLACDLQEAVQHDDPWTLVFHGYDPGQERLREALCAVGNGY
LGSRGCAPESAESEAHYPGTYVAGVYNQLTDHIEGCTVDNESLVNLPNWLSLTFRIDGGAWFNVDTVELLSYRQTFDLRR
ATLTRSLRFRDAGGRVTTMTQERFASMNRPNLVALQTRIESENWSGTVDFRSLVDGGVHNTLVDRYRQLSSQHLTTAEIE
VLADSVLLRTQTSQSGIAIAVAARSTLWRDGQRVDAQYRVARDTNRGGHDIQVTLSAGQSVTLEKVATIFTSRDAATLTA
AISAQRCLGEAGRYAELCQQHVRAWARLWERCAIDLTGNTEELRLVRLHLLHLLQTISPHTAELDAGVPARGLNGEAYRG
HVFWDALFVAPVLSLRMPKVARSLLDYRYRRLPAARRAAHRAGHLGAMYPWQSGSDGSEVSQQLHLNPRSGRWTPDPSDR
AHHVGLAVAYNAWHYYQVTGDRQYLVDCGAELLVEIARFWVGLAKLDDSRGRYLIRGVIGPDEFHSGYPGNEYDGIDNNA
YTNVMAVWVILRAMEALDLLPLTDRRHLIEKLGLTTQERDQWDDVSRRMFVPFHDGVISQFEGYSELAELDWDHYRHRYG
NIQRLDRILEAEGDSVNNYQASKQADALMLLYLLSSDELIGLLARLGYRFAPTQIPGTVDYYLARTSDGSTLSAVVHAWV
LARANRSNAMEYFRQVLRSDIADVQGGTTQEGIHLAAMAGSIDLLQRCYSGLELRDDRLVLSPQWPEALGPLEFPFVYRR
HQLSLRISGRSATLTAESGDAEPIEVECRGHVQRLRCGHTIEVGCSR
>P9WN15 3.2.1.-~~~~~~Uncharacterized glycosyl hydrolase Rv2006~~~COG0561
MRCGIVVNVTGPPPTIDRRYHDAVIVGLDNVVDKATRVHAAAWTKFLDDYLTRRPQRTGEDHCPLTHDDYRRFLAGKPDG
VADFLAARGIRLPPGSPTDLTDDTVYGLQNLERQTFLQLLNTGVPEGKSIASFARRLQVAGVRVAAHTSHRNYGHTLDAT
GLAEVFAVFVDGAVTAELGLPAEPNPAGLIETAKRLGANPGRCVVIDSCQTGLRAGRNGGFALVIAVDAHGDAENLLSSG
ADAVVADLAAVTVGSGDAAISTIPDALQVYSQLKRLLTGRRPAVFLDFDGTLSDIVERPEAATLVDGAAEALRALAAQCP
VAVISGRDLADVRNRVKVDGLWLAGSHGFELVAPDGSHHQNAAATAAIDGLAEAAAQLADALREIAGAVVEHKRFAVAVH
YRNVADDSVDNLIAAVRRLGHAAGLRVTTGRKVVELRPDIAWDKGKALDWIGERLGPAEVGPDLRLPIYIGDDLTDEDAF
DAVRFTGVGIVVRHNEHGDRRSAATFRLECPYTVCQFLSQLACDLQEAVQHDDPWTLVFHGYDPGQERLREALCAVGNGY
LGSRGCAPESAESEAHYPGTYVAGVYNQLTDHIEGCTVDNESLVNLPNWLSLTFRIDGGAWFNVDTVELLSYRQTFDLRR
ATLTRSLRFRDAGGRVTTMTQERFASMNRPNLVALQTRIESENWSGTVDFRSLVDGGVHNTLVDRYRQLSSQHLTTAEIE
VLADSVLLRTQTSQSGIAIAVAARSTLWRDGQRVDAQYRVARDTNRGGHDIQVTLSAGQSVTLEKVATIFTSRDAATLTA
AISAQRCLGEAGRYAELCQQHVRAWARLWERCAIDLTGNTEELRLVRLHLLHLLQTISPHTAELDAGVPARGLNGEAYRG
HVFWDALFVAPVLSLRMPKVARSLLDYRYRRLPAARRAAHRAGHLGAMYPWQSGSDGSEVSQQLHLNPRSGRWTPDPSDR
AHHVGLAVAYNAWHYYQVTGDRQYLVDCGAELLVEIARFWVGLAKLDDSRGRYLIRGVIGPDEFHSGYPGNEYDGIDNNA
YTNVMAVWVILRAMEALDLLPLTDRRHLIEKLGLTTQERDQWDDVSRRMFVPFHDGVISQFEGYSELAELDWDHYRHRYG
NIQRLDRILEAEGDSVNNYQASKQADALMLLYLLSSDELIGLLARLGYRFAPTQIPGTVDYYLARTSDGSTLSAVVHAWV
LARANRSNAMEYFRQVLRSDIADVQGGTTQEGIHLAAMAGSIDLLQRCYSGLELRDDRLVLSPQWPEALGPLEFPFVYRR
HQLSLRISGRSATLTAESGDAEPIEVECRGHVQRLRCGHTIEVGCSR
>Q7WKU6 ~~~~~~UPF0434 protein BB2007~~~COG2835
MESRLLDILVCPVCKGRLEFQRAQAELVCNADRLAFPVRDGVPIMLEAEARSLDAEAPAQPS
>P9WLM9 ~~~~~~Uncharacterized protein Rv2008c~~~COG1373
MDEIESLIGLRPTPLTWPVVIAGDFLGVWDPPPSLPGAANHEISAPTARISCMLIERRDAAARLRRALHRAPVVLLTGPR
QAGKTTLSRLVGKSAPECTFDAENPVDATRLADPMLALSGLSGLITIDEAQRIPDLFPVLRVLVDRPVMPARFLILGSAS
PDLVGLASESLAGRVELVELSGLTVRDVGSSAADRLWLRGGLPPSFTARSNEDSAAWRDGYITTFLERDLAQLGVRIPAA
TMRRAWTMLAHYHGQLFSGAELARSLDVAQTTARRYLDALTDALVVRQLTPWFANIGKRQRRSPKIYIRDTGLLHRLLGI
DDRLALERNPKLGASWEGFVLEQLAALLAPNPLYYWRTQQDAELDLYVELSGRPYGFEIKRTSTPSISRSMRSALVDLQL
ARLAIVYPGEHRFPLSDTVVAVPADQILTTGSVDELLALLK
>Q8NP09 ~~~~~~Uncharacterized membrane protein Cgl2017/cg2211~~~
MAGSSHTIEPEIYRGVSTLDEPSAAWGWHGLKRNTIQLAGWISVLFMLGYNFGNHKGHVETIWLLVITALLVIGLLIHLF
EPKLSQVRTITSRNKPVGHVEPDWTYDQATLTGTWGNLTDSQLRSVNIEPSRVAHLRAADSAKELDN
>P9WGF5 ~~~~~~Probable cation efflux system protein Rv2025c~~~COG0053
MTHDHAHSRGVPAMIKEIFAPHSHDAADSVDDTLESTAAGIRTVKISLLVLGLTALIQIVIVVMSGSVALAADTIHNFAD
ALTAVPLWIAFALGAKPATRRYTYGFGRVEDLAGSFVVAMITMSAIIAGYEAIARLIHPQQIEHVGWVALAGLVGFIGNE
WVALYRIRVGHRIGSAALIADGLHARTDGFTSLAVLCSAGGVALGFPLADPIVGLLITAAILAVLRTAARDVFRRLLDGV
DPAMVDAAEQALAARPGVQAVRSVRMRWIGHRLHADAELDVDPALDLAQAHRIAHDAEHELTHTVPKLTTALIHAYPAEH
GSSIPDRGRTVE
>P9WFD1 ~~~~~~Universal stress protein Rv2026c~~~COG0589
MSAATAKYGILVGVDGSAQSNAAVAWAAREAVMRQLPITLLHIVAPVVVGWPVGQLYANMTEWQKDNAQQVIEQAREALT
NSLGESKPPQVHTELVFSNVVPTLIDASQQAWLMVVGSQGMGALGRLLLGSISTALLHHARCPVAIIHSGNGATPDSDAP
VLVGIDGSPASEAATALAFDEASRRRVDLVALHAWTDLGMFPVLGMDWREREKREAEVLAERLAGWQEQYPDVRVHRSLV
CDKPARWLLEHSEQAQLVVVGSHGRGGFSGMLLGSVSSAVAHSVRIPVIVVRPS
>P9WFD8 ~~~~~~Universal stress protein MT2087~~~
MNQSHKPPSIVVGIDGSKPAVQAALWAVDEAASRDIPLRLLYAIEPDDPGYAAHGAAARKLAAAENAVRYAFTAVEAADR
PVKVEVEITQERPVTSLIRASAAAALVCVGAIGVHHFRPERVGSTAAALALSAQCPVAIVRPHRVPIGRDAAWIVVEADG
SSDIGVLLGAVMAEARLRDSPVRVVTCRQSGVGDTGDDVRASLDRWLARWQPRYPDVRVQSAAVHGELLDYLAGLGRSVH
MVVLSASDQEHVEQLVGAPGNAVLQEAGCTLLVVGQQYL
>P9WFD9 ~~~~~~Universal stress protein Rv2028c~~~COG0589
MNQSHKPPSIVVGIDGSKPAVQAALWAVDEAASRDIPLRLLYAIEPDDPGYAAHGAAARKLAAAENAVRYAFTAVEAADR
PVKVEVEITQERPVTSLIRASAAAALVCVGAIGVHHFRPERVGSTAAALALSAQCPVAIVRPHRVPIGRDAAWIVVEADG
SSDIGVLLGAVMAEARLRDSPVRVVTCRQSGVGDTGDDVRASLDRWLARWQPRYPDVRVQSAAVHGELLDYLAGLGRSVH
MVVLSASDQEHVEQLVGAPGNAVLQEAGCTLLVVGQQYL
>P9WLM0 ~~~~~~Uncharacterized protein MT2089~~~
MLMTAAADVTRRSPRRVFRDRREAGRVLAELLAAYRDQPDVIVLGLARGGLPVAWEVAAALHAPLDAFVVRKLGAPGHDE
FAVGALASGGRVVVNDDVVRGLRITPQQLRDIAEREGRELLRRESAYRGERPPTDITGKTVIVVDDGLATGASMFAAVQA
LRDAQPAQIVIAVPAAPESTCREFAGLVDDVVCATMPTPFLAVGESFWDFRQVTDEEVRRLLATPTAGPSLRRPAASTAA
DVLRRVAIDAPGGVPTHEVLAELVGDARIVLIGESSHGTHEFYQARAAMTQWLIEEKGFGAVAAEADWPDAYRVNRYVRG
LGEDTNADEALSGFERFPAWMWRNTVVRDFVEWLRTRNQRYESGALRQAGFYGLDLYSLHRSIQEVISYLDKVDPRAAAR
ARARYACFDHACADDGQAYGFAAAFGAGPSCEREAVEQLVDVQRNALAYARQDGLLAEDELFYAQQNAQTVRDAEVYYRA
MFSGRVTSWNLRDQHMAQTLGSLLTHLDRHLDAPPARIVVWAHNSHVGDARATEVWADGQLTLGQIVRERYGDESRSIGF
STYTGTVTAASEWGGIAQRKAVRPALHGSVEELFHQTADSFLVSARLSRDAEAPLDVVRLGRAIGVVYLPATERQSHYLH
VRPADQFDAMIHIDQTRALEPLEVTSRWIAGENPETYPTGL
>P9WLM1 ~~~~~~Uncharacterized protein Rv2030c~~~COG1926
MLMTAAADVTRRSPRRVFRDRREAGRVLAELLAAYRDQPDVIVLGLARGGLPVAWEVAAALHAPLDAFVVRKLGAPGHDE
FAVGALASGGRVVVNDDVVRGLRITPQQLRDIAEREGRELLRRESAYRGERPPTDITGKTVIVVDDGLATGASMFAAVQA
LRDAQPAQIVIAVPAAPESTCREFAGLVDDVVCATMPTPFLAVGESFWDFRQVTDEEVRRLLATPTAGPSLRRPAASTAA
DVLRRVAIDAPGGVPTHEVLAELVGDARIVLIGESSHGTHEFYQARAAMTQWLIEEKGFGAVAAEADWPDAYRVNRYVRG
LGEDTNADEALSGFERFPAWMWRNTVVRDFVEWLRTRNQRYESGALRQAGFYGLDLYSLHRSIQEVISYLDKVDPRAAAR
ARARYACFDHACADDGQAYGFAAAFGAGPSCEREAVEQLVDVQRNALAYARQDGLLAEDELFYAQQNAQTVRDAEVYYRA
MFSGRVTSWNLRDQHMAQTLGSLLTHLDRHLDAPPARIVVWAHNSHVGDARATEVWADGQLTLGQIVRERYGDESRSIGF
STYTGTVTAASEWGGIAQRKAVRPALHGSVEELFHQTADSFLVSARLSRDAEAPLDVVRLGRAIGVVYLPATERQSHYLH
VRPADQFDAMIHIDQTRALEPLEVTSRWIAGENPETYPTGL
>P9WIH5 ~~~~~~Uncharacterized protein Rv2047c~~~COG0451
MRIAVTGASGVLGRGLTARLLSQGHEVVGIARHRPDSWPSSADFIAADIRDATAVESAMTGADVVAHCAWVRGRNDHINI
DGTANVLKAMAETGTGRIVFTSSGHQPRVEQMLADCGLEWVAVRCALIFGRNVDNWVQRLFALPVLPAGYADRVVQVVHS
DDAQRLLVRALLDTVIDSGPVNLAAPGELTFRRIAAALGRPMVPIGSPVLRRVTSFAELELLHSAPLMDVTLLRDRWGFQ
PAWNAEECLEDFTLAVRGRIGLGKRTFSLPWRLANIQDLPAVDSPADDGVAPRLAGPEGANGEFDTPIDPRFPTYLATNL
SEALPGPFSPSSASVTVRGLRAGGVGIAERLRPSGVIQREIAMRTVAVFAHRLYGAITSAHFMAATVPFAKPATIVSNSG
FFGPSMASLPIFGAQRPPSESSRARRWLRTLRNIGVFGVNLVGLSAGSPRDTDAYVADVDRLERLAFDNLATHDDRRLLS
LILLARDHVVHGWVLASGSFMLCAAFNVLLRGLCGRDTAPAAGPELVSARSVEAVQRLVAAARRDPVVIRLLAEPGERLD
KLAVEAPEFHSAVLAELTLIGHRGPAEVEMAATSYADNPELLVRMVAKTLRAVPAPQPPTPVIPLRAKPVALLAARQLRD
REVRRDRMVRAIWVLRALLREYGRRLTEAGVFDTPDDVFYLLVDEIDALPADVSGLVARRRAEQRRLAGIVPPTVFSGSW
EPSPSSAAALAAGDTLRGVGVCGGRVRGRVRIVRPETIDDLQPGEILVAEVTDVGYTAAFCYAAAVVTELGGPMSHAAVV
AREFGFPCVVDAQGATRFLPPGALVEVDGATGEIHVVELASEDGPALPGSDLSR
>P43963 ~~~~~~Uncharacterized protein HI_0205~~~
MKISFHSVLIGLASFIGVQQGVIANPSSHQSISTESAEALKQQFSMALAKQDKQQITNLQKKLTALFSLPPQFLDNQIQI
SEKILTRIFKTDKNLTPKFLDYLYFEPINTVDANLIQEMKKNLLVSFLANDQAKIYIRQTDNSEQFVQTLMERGAKADQI
ILLSLNAKGIFQKIIEQIRQDFPNQTIFSITENRISLITPSSEIKSRLALANMMFNRQFKGVEVDDFSYLDQPRENLQHN
NDAIRYKTFQAMLEGLN
>P9WFM5 ~~~~~~Putative transport protein Rv0205~~~COG0628
MSASLDDASVAPLVRKTAAWAWRFLVILAAMVALLWVLNKFEVIVVPVLLALMLSALLVPPVDWLDSRGLPHAVAVTLVL
LSGFAVLGGILTFVVSQFIAGLPHLVTEVERSIDSARRWLIEGPAHLRGEQIDNAGNAAIEALRNNQAKLTSGALSTAAT
ITELVTAAVLVLFTLIFFLYGGRSIWQYVTKAFPASVRDRVRAAGRAGYASLIGYARATFLVALTDAAGVGAGLAVMGVP
LALPLASLVFFGAFIPLIGAVVAGFLAVVVALLAKGIGYALITVGLLIAVNQLEAHLLQPLVMGRAVSIHPLAVVLAIAA
GGVLAGVVGALLAVPTVAFFNNAVQVLLGGNPFADVADVSSDHLTEV
>P9WLL9 ~~~~~~Uncharacterized protein Rv2067c~~~COG0500
MTDDHPRADIVSRQYHRWLYPHPIADLEAWTTANWEWFDPVHSHRILWPDREYRPDLDILIAGCGTNQAAIFAFTNRAAK
VVAIDISRPALDHQQYLKDKHGLANLELHLLPIEELATLGRDFDLVVSTGVLHHLADPRAGMKELAHCLRRDGVVAAMLY
GKYGRIGVELLGSVFRDLGLGQDDASIKLAKEAISLLPTYHPLRNYLTKARDLLSDSALVDTFLHGRQRSYTVEECVDLV
TSAGLVFQGWFHKAPYYPHDFFVPNSEFYAAVNTLPEVKAWSVMERLETLNATHLFMACRRDRPKEQYTIDFSTVAALDY
VPLMRTRCGVSGTDMFWPGWRMAPSPAQLAFLQQVDGRRTIREIAGCVARTGEPSGGSLADLEEFGRKLFQSLWRLDFVA
VALPASG
>A4QFQ3 3.4.-.-~~~~~~Probable endopeptidase cgR_2070~~~
MGKHRRNNSNATRKAVAASAVALGATAAIASPAQAAEVVVPGTGISVDIAGIETTPGLNNVPGIDQWIPSLSSQAAPTAY
AAVIDAPAAEAQAAPAASTGQAIVDAARTKIGSPYGWGATGPNAFDCSGLTSWAYSQVGKSIPRTSQAQAAQGTPVAYSD
LQAGDIVAFYSGATHVGIYSGHGTVIHALNSSTPLSEHSLDYMPFHSAVRF
>P9WGR3 1.-.-.-~~~~~~Uncharacterized oxidoreductase Rv2073c~~~COG0300
MDDTGAAPVVIFGGRSQIGGELARRLAAGATMVLAARNADQLADQAAALRAAGAIAVHTREFDADDLAAHGPLVASLVAE
HGPIGTAVLAFGILGDQARAETDAAHAVAIVHTDYVAQVSLLTHLAAAMRTAGRGSLVVFSSVAGIRVRRANYVYGSAKA
GLDGFASGLADALHGTGVRLLIARPGFVIGRMTEGMTPAPLSVTPERVAAATARALVNGKRVVWIPWALRPMFVALRLLP
RFVWRRMPR
>Q02QI1 ~~~~~~STAS-domain containing protein PA14_20770~~~
MAITALPSADGQELTIQIQGRFDFGAHQDFRDAYERVAITPRRYVVDLRNATYLDSSALGMLLLLRDHAGGENAQISLAN
CSPEVRKILAISNFEQLFKIS
>Q83A32 ~~~~~~Uncharacterized protein CBU_2079~~~
MSYIKRDHTALRDIAMKTFLKVVGLAASLSAASVAFSSYQLIIKNNYNQTVAISLFDDQGGTHDAGEVEANGQTKITAHL
DQASGFCLNVAGKREVVCSYKSGSHPNGTITIDSTGRYCIYNNTKKISGGHGCG
>Q10690 ~~~~~~Uncharacterized protein Rv2082~~~
MAGDLPPGRWSALLVGAWWPARPDAPMAGVTYWRKAAQLKRNEANDLRNERSLLAVNQGRTADDLLERYWRGEQRLATIA
HQCEVKSDQSEQVADAVNYLRDRLTEIAQSGNQQINQILAGKGPIEAKVAAVNAVIEQSNAMADHVGATAMSNIIDATQR
VFDETIGGDAHTWLRDHGVSLDTPARPRPVTAEDMTSMTANSPAGSPFGAAPSAPSHSTTTSGPPTAPTPTSPFGTAPMV
LSSSSTSSGPPTAPTPTSPFGTAPMPPGPPPPGTVSPPLPPSAPAVGVGGPSVPAAGMPPAAAAATAPLSPQSLGQSFTT
GMTTGTPAAAGAQALSAGALHAATEPLPPPAPPPTTPTVTTPTVATATTAGIPHIPDSAPTPSPAPIAPPTTDNASAMTP
IAPMVANGPPASPAPPAAAPAGPLPAYGADLRPPVTTPPATPPTPTGPISGAAVTPSSPAAGGSLMSPVVNKSTAPATTQ
AQPSNPTPPLASATAAATTGAAAGDTSRRAAEQQRLRRILDTVARQEPGLSWAAGLRDNGQTTLLVTDLASGWIPPHIRL
PAHITLLEPAPRRRHATVTDLLGTTTVAAAHHPHGYLSQPDPDTPALTGDRTARIAPTIDELGPTLVETVRRHDTLPPIA
QAVVVAATRNYGVPDNETDLLHHKTTEIHQAVLTTYPNHDIATVVDWMLLAAINALIAGDQSGANYHLAWAIAAISTRRS
R
>P9WLK1 ~~~~~~Uncharacterized protein Rv2084~~~
MPVSDDSSSAFDLICAEIERQLRGGELLMDAAAASELLLTVRYQLDTQPRPLVIVHGPLFQAVKAARAQVYGRLIQLRHA
RCEVLDERWQLRPTGQRDVRALLIDVLNVLLAAITAAGVERAYACAERRAMAAAVVAKNYRDALGVELQCNSVCRAAAEA
IHALAHRTGATEDADCLPPVDVIHADVTRRMHGEVATDVVAAGELVIAARHLLDPMPRGELSYGPLHEGGNAARKSVYRR
LVQLWQARRAVTDGDVDLRDARTLLTDLDSILREMRTAATIQQSGTAGDGGGGRRQDSRRRNGPRRPARRGTSRGRRCAP
RVAIGWHTPIGDPLAVEGVEEIGASLPGRESTPSDDGGSLHPSGRPRRVHRRRWCGLGLC
>P9WLJ5 ~~~~~~Uncharacterized protein Rv2091c~~~
MSGPQGSDPRQPWQPPGQGADHSSDPTVAAGYPWQQQPTQEATWQAPAYTPQYQQPADPAYPQQYPQPTPGYAQPEQFGA
QPTQLGVPGQYGQYQQPGQYGQPGQYGQPGQYAPPGQYPGQYGPYGQSGQGSKRSVAVIGGVIAVMAVLFIGAVLILGFW
APGFFVTTKLDVIKAQAGVQQVLTDETTGYGAKNVKDVKCNNGSDPTVKKGATFECTVSIDGTSKRVTVTFQDNKGTYEV
GRPQ
>Q7A417 1.1.1.-~~~~~~Putative 2-hydroxyacid dehydrogenase SA2098~~~
MEKVYVAGAIPEVGLKLLQEHFEVEMYEGKGLVDKDTLIKGVKNATALISLLSTNVDKDVIDAGKDLKIIANYGAGFNNI
DIEYAREKSIDVTNTPKASTNATADLTIGLVLAVARRIVEGDQLSRTTGFDGWAPLFFRGREVSGKTIGIIGLGEIGSAV
ARRARAFDMDVLYTGPNRKEEKEREIGAKYVDLDTLLKNADFITINAAYNPKMHHLIDTEQFKMMKSTVYLINASRGPIV
HEQALVQALKDNEIEGAALDVYEFEPDITDDLKSLNNVVLTPHIGNATFEARDMMSKIVANAAISAVQGEKPQFVVN
>O53500 ~~~~~~Uncharacterized protein Rv2102~~~COG4279
MSSTWYPPPSRPRPVEGGIKARSTRGAIAQTWWSERFIAVLEDIGLGNRLQRGRSYARKGQVISLQVDAGLVTALVQGSR
ARPYRIRIGIPAFGKSQWAHVERTLAENAWYAAKLLSGEMPEDIEDVFAGLGLSLFPGTARELSLDCSCPDYAVPCKHLA
ATFYLLAESFDEDPFAILAWRGREREDLLANLAAARADGAAPAADHAEQVAQPLTDCLDRYYARQADINVPSPPATPSTA
LLDQLPDTGLSARGRPLTELLRPAYHALTHHHNSAGG
>O66586 ~~~~~~Uncharacterized globin-like protein aq_211~~~COG1017
MLSEETIRVIKSTVPLLKEHGTEITARMYELLFSKYPKTKELFAGASEEQPKKLANAIIAYATYIDRLEELDNAISTIAR
SHVRRNVKPEHYPLVKECLLQAIEEVLNPGEEVLKAWEEAYDFLAKTLITLEKKLYSQP
>P40182 ~~~~~~Uncharacterized protein SCO2127~~~
MSEELPPSEAPRPDEADAVDETRATGETRGAAQEPGASDADAWATACAEDLEAEKARRRAAYGPPPGSAAEELRRLVDTV
ADKLSGLQSPLLGQVAGPAAQQVVRQVVQQAKAAVEPVIERNPDLFDHLAAAGGELLAAYRSAVQNQERRWTTGDTAPKD
PSDPRDLDERGTDGRDEGDGGTGPGQRIDLD
>Q87MV2 ~~~~~~UPF0352 protein VP2129~~~COG3082
MPITSKYTDEQVEKILAEVALVLEKHAASPELTLMIAGNIATNVLNQRVAASQRKLIAEKFAQALMSSLETPKTH
>O06242 ~~~~~~Protein Rv2133c~~~COG5032
MCCSGLAMTLRDDEHAVLADGELTVLGRIRSASNATFLCESTLGLRSLHCVYKPVSGERPLWDFPDGTLAGRELSAYLVS
TQLGWNLVPHTIIRDGPAGIGMLQLWVQQPGDAVDSDPLPGPDLVDLFPAHRPRPGYLPVLRAYDYAGDEVVLMHADDIR
LRRMAVFDVLINNADRKGGHILCGIDGQVYGVDHGLCLHVENKLRTVLWGWAGKPIDDQILQAVAGLADALGGPLAEALA
GRIAAAEIGALRRRAQSLLDQPVMPGPNGHRPIPWPAF
>P75556 ~~~~~~Uncharacterized protein MG075 homolog~~~
MKLSAIISLSVAGTVGTTAVVVPTTITLVNKTHQVEHESEQSDFQDIRFGLNSVKLPKAQPAAATRITVENGTDKLVNYK
SSPQQLFLAKNALKDKLQGEFDKFLSDAKAFPALTADLQEWVDQQLFNPNQSFFDLSAPRSNFTLSSDKKASLDFIFRFT
NFTESVQLLKLPEGVSVVVDSKQSFDYYVNASAQKLLVLPLSLPDYTLGLNYMFDHITLNGKVVNKFSFNPFKTNLNLAF
SNVYNGVDVFEAQKNLVGKGKYLNTHVKAEDVKKDVNANIKNQFDIAKIIAELMGKALKEFGNQQEGQPLSFLKVMDKVK
EDFEKLFNLVRPGLGKFVKDLIQSSSQAENKITVYKLIFDNKKTILNLLKELSIPELNSSLGLVDVLFDGITDSDGLYER
LQSFKDLIVPAVKTNEKTAALSPLIEELLTQKDTYVFDLIQKHKGILTNLLKNFLADFQKSTPFMADQVAIFTELFDNEG
AFDLFGEADFVDKIAELFLTKRTVKNGEKIETKDSLLVTSLKSLLGEKVAALGDLLDSYIFKNELLNRSVEVAKAEAKDT
KGATDYKKEQAKALKKLFKHIGENTLSKTNLDKITLKEVKNTENVELEETETTLKVKKLDVEYKVELGNFEIKNGLIKAM
LEFLPDTKDLETTLDKLLFKGESYKAMKDKYIKEGFPGYGWAKGVVPGAFESIENTFKSAIDKTKSIRDLFGDMLFGNDL
SSVKETDSFITLGGSFDIKYGGENLNVLPAYYSLINSEIGYQIIGVDTTIDATKVKVELKNKEYKGKSPAINGQVKLSQS
FFNVWTNMFDSITKQIFQKKYEFKDNIQVFARNEDNTSRLELDISDPEQRVIPFAFVDGFGIQLKAVDKNITKEAGNTEP
KSPVIQLYEALNKEKDQKQQSKQSPKQLDTKTQLGYLLKLGDNWSKDDYKSLIDDTIINNNYLEASFNSKITVDRLGIPI
DLWLFKIWPKFNLEIPMQGSLQLYSSSVIFPYGIYDTSVQDAAKIVKRLNFTDMGFKLNDPKPNFWFVGF
>Q02UQ8 ~~~~~~Uncharacterized protein PA14_02130~~~
MSDLHIPGTQSTPAIQGDWQAGRLSMQGDSYPENSYELFGQVIDWVERFLADGQRPLELDLRLLYLNTSSIKAMMDILDL
LEEAHQGGRPVSLRWHYDRRNERVAELAEEFREDCSFPFAIQAHDE
>P9WFN1 ~~~~~~UPF0098 protein Rv2140c~~~COG1881
MTTSPDPYAALPKLPSFSLTSTSITDGQPLATPQVSGIMGAGGADASPQLRWSGFPSETRSFAVTVYDPDAPTLSGFWHW
AVANLPANVTELPEGVGDGRELPGGALTLVNDAGMRRYVGAAPPPGHGVHRYYVAVHAVKVEKLDLPEDASPAYLGFNLF
QHAIARAVIFGTYEQR
>A3DHB8 ~~~~~~Nucleoid-associated protein Cthe_2143~~~COG0718
MAKGGFPGFGGNINNLVKQAQKMQRDMERVQEELKEKTVEASAGGGAVTVVATGRKDIKEITIKPEVVDPDDVEMLQDLI
LAAVNEALRKADEMVTAEISKITGGLGGIPGLF
>Q72L49 ~~~~~~UPF0340 protein TT_C0214~~~COG4475
MEGIRRAAQRAVEEFLQAFPMGPGSLFVLGGSTSEVLGERVGTRPSLEAAHAVLEGLLPPLLERGVHVAVQACEHLNRAL
VVERETARAFGLEEVAVFPHPKAGGALATAAFLRFQDPVVVESLKAQAHGGMDIGGVLIGMHLRPVAVPLRLSVRKIGEA
VLLAAKTRPKLVGGARAVYTREEMLKKLEEFLPKPP
>Q45222 ~~~~~~Uncharacterized protein blr2150~~~
MIQTERAVQQVLEWGRSLTGFADEHAVEAVRGGQYILQRIHPSLRGTSARTGRDPQDETLIVTFYRELALLFWLDDCNDL
GLISPEQLAAVEQALGQGVPCALPGFEGCAVLRASLATLAYDRRDYAQLLDDTRCYSAALRAGHAQAVAAERWSYAEYLH
NGIDSIAYANVFCCLSLLWGLDMATLRARPAFRQVLRLISAIGRLQNDLHGCDKDRSAGEADNAVILLLQRYPAMPVVEF
LNDELAGHTRMLHRVMAEERFPAPWGPLIEAMAAIRVQYYRTSTSRYRSDAVRGGQRAPA
>Q7A3W5 ~~~~~~Uncharacterized lipoprotein SA2158~~~
MKRLVTGLLALSLFLAACGQDSDQQKDSNKEKDDKAKTEQQDKKTNDSSKDKKDNKDDSKDVNKDNKDNSANDNQQQSNS
NATNNDQNQTNNNQSSNNQKSSYVAPYYGQNAAPVARQIYPFNGNKTQALQQLPNFQTALNAANNEANKFGSNNKVYNDY
SIEEHNGNYKYVFSFKDPNANGKYSIVTVDYTGQAMVTDPNYQQ
>O53509 ~~~~~~DNA-binding protein Rv2175c~~~
MPGRAPGSTLARVGSIPAGDDVLDPDEPTYDLPRVAELLGVPVSKVAQQLREGHLVAVRRAGGVVIPQVFFTNSGQVVKS
LPGLLTILHDGGYRDTEIMRWLFTPDPSLTITRDGSRDAVSNARPVDALHAHQAREVVRRAQAMAY
>Q8EF26 ~~~~~~UPF0352 protein SO_2176~~~COG3082
MAIQSKYSNTQVESLIAEILVVLEKHKAPTDLSLMALGNCVTHLLERKVPSESRQAVAEQFAKALAQSVKSN
>O53518 ~~~~~~Protein Rv2184c~~~COG0003
MSDSGTPAQARISLFVGKGGVGKSTLASATAVCDAGAGQRVLVVSTDQAHSLGDVLGIAVPPTGQGDPVRVLAYDPEAGG
GFLDALALDTLALLEGRWLHVVETLDRRFPGSELSSIAPEELCALPGIQEVLGLHAVGELAAARRWDRIVVDCASTADAL
RMLTLPATFGLYVERAWPRHRRLSIGADDGRSAVLAELLERIRASVERLSTLLTDGALVSAHLVLTPERVVAAEAVRTLG
SLALMGVRVEELLVNQLLVQDENYEYRSLPDHPAFHWYAERIGEQRAVLDDLDATIGDVALVLVPHLAGEPIGPKALGGL
LDSARRRQGSAPPGPLQPIVDLESGSGLASIYRLRLALPQLDPGTLTLGRADDDLIVSAGGMRRRVRLASVLRRCTVLDA
HLRGGELTVRFRPNPEVWPT
>Q9ZB78 ~~~~~~Uncharacterized protein MG218.1~~~COG1269
MVNNEYQQLNTLVESDDEADLVIANLVKQLNELKQILVSLDNQEASATAVTDKKEEEYNQNQSSFHNFSKETLQKQAKRG
FLLLERCSLVGLQQLELEYVNLLGRSFDSYQQKTELLNNLKELVDEHFSDTEKIINTLEKIFDVIGGSEYTPVLNSFFNK
LLSDPDPIQREIGLRQFIITLRQRFKKLSQKIDSSLKQIETEAKIATEQVQNSEVMFGPPDIANDHELNLNWPDSETDAI
LSSMENELEAALLAKHQEEPPLIVTPPSLIKPTVSQPEVEVVTPTNNTNFQPQVDLKPTDLKKQQKKKPLNFITRPVFKS
NLPPKLSKDDIVHYAHQLLEKNTHNE
>P9WLJ1 3.1.-.-~~~~~~Putative bifunctional exonuclease/endonuclease protein Rv2191~~~COG0322
MQGPNVAAMGATGGTQLSFADLAHAQGAAWTPADEMSLRETTFVVVDLETTGGRTTGNDATPPDAITEIGAVKVCGGAVL
GEFATLVNPQHSIPPQIVRLTGITTAMVGNAPTIDAVLPMFFEFAGDSVLVAHNAGFDIGFLRAAARRCDITWPQPQVLC
TMRLARRVLSRDEAPSVRLAALARLFAVASNPTHRALDDARATVDVLHALIERVGNQGVHTYAELRSYLPNVTQAQRCKR
VLAETLPHRPGVYLFRGPSGEVLYVGTAADLRRRVSQYFNGTDRRKRMTEMVMLASSIDHVECAHPLEAGVRELRMLSTH
APPYNRRSKFPYRWWWVALTDEAFPRLSVIRAPRHDRVVGPFRSRSKAAETAALLARCTGLRTCTTRLTRSARHGPACPE
LEVSACPAARDVTAAQYAEAVLRAAALIGGLDNAALAAAVQQVTELAERRRYESAARLRDHLATAIEALWHGQRLRALAA
LPELIAAKPDGPREGGYQLAVIRHGQLAAAGRAPRGVPPMPVVDAIRRGAQAILPTPAPLGGALVEEIALIARWLAEPGV
RIVGVSNDAAGLASPVRSAGPWAAWAATARSAQLAGEQLSRGWQSDLPTEPHPSREQLFGRTGVDCRTGPPQPLLPGRQP
FSTAG
>P9WLI9 ~~~~~~Uncharacterized protein Rv2197c~~~
MVSRYSAYRRGPDVISPDVIDRILVGACAAVWLVFTGVSVAAAVALMDLGRGFHEMAGNPHTTWVLYAVIVVSALVIVGA
IPVLLRARRMAEAEPATRPTGASVRGGRSIGSGHPAKRAVAESAPVQHADAFEVAAEWSSEAVDRIWLRGTVVLTSAIGI
ALIAVAAATYLMAVGHDGPSWISYGLAGVVTAGMPVIEWLYARQLRRVVAPQSS
>P9WLI7 ~~~~~~Uncharacterized protein Rv2203~~~
MPGPHSPNPGVGTNGPAPYPEPSSHEPQALDYPHDLGAAEPAFAPGPADDAALPPAAYPGVPPQVSYPKRRHKRLLIGIV
VALALVSAMTAAIIYGVRTNGANTAGTFSEGPAKTAIQGYLNALENRDVDTIVRNALCGIHDGVRDKRSDQALAKLSSDA
FRKQFSQVEVTSIDKIVYWSQYQAQVLFTMQVTPAAGGPPRGQVQGIAQLLFQRGQVLVCSYVLRTAGSY
>P9WMN5 ~~~~~~Protein Rv2204c~~~COG0316
MTVQNEPSAKTHGVILTEAAAAKAKSLLDQEGRDDLALRIAVQPGGCAGLRYNLFFDDRTLDGDQTAEFGGVRLIVDRMS
APYVEGASIDFVDTIEKQGFTIDNPNATGSCACGDSFN
>P9WMT7 ~~~~~~Uncharacterized protein Rv2205c~~~COG1929
MKGSQASDDATGSLGPGRLQLPAMRVLVAPDCYGDSLSAVEAAAAIATGWTRSRPGDSFIVAPQSDGGPGFVEVLGSRLG
ETRRLRVCGPLNTVVNAAWVFDPGSATAYLECAQACGLGLLGGPPTPETALAAHSKGVGQLIAAALRAGAARIVVGLGGS
ACTDGGKGMIAELGGLDAARRQLADVEVIAASDVEYPLLGPWGTARVFAPQKGADMATVAVLEGRLAAWAIELDAAAGRG
VSAEPGAGAAGGIGAGLLAVGGRYQSGAAIIAEHTHFADDLADAELIVTGEGRFDEQSLHGKVVGAIAAAARPLAIPVIV
LAGQVSLDKSALRSAGIMAALSIAEYAGSVRLALADAANQLMGLASQVAARLGNSGPSGYR
>P9WLI5 ~~~~~~Uncharacterized protein Rv2206~~~
MKLLGHRKSHGHQRADASPDAGSKDGCRPDSGRTSGSDTSRGSQTTGPKGRPTPKRNQSRRHTKKGPVAPAPMTAAQARA
RRKSLAGPKLSREERRAEKAANRARMTERRERMMAGEEAYLLPRDRGPVRRYVRDVVDSRRNLLGLFMPSALTLLFVMFA
VPQVQFYLSPAMLILLALMTIDAIILGRKVGRLVDTKFPSNTESRWRLGLYAAGRASQIRRLRAPRPQVERGGDVG
>P9WMU7 ~~~~~~Uncharacterized protein Rv2212~~~COG2114
MYDSLDFDALEAAGIANPRERAGLLTYLDELGFTVEEMVQAERRGRLFGLAGDVLLWSGPPIYTLATAADELGLSADDVA
RAWSLLGLTVAGPDVPTLSQADVDALATWVALKALVGEDGAFGLLRVLGTAMARLAEAESTMIRAGSPNIQMTHTHDELA
TARAYRAAAEFVPRIGALIDTVHRHHLASARTYFEGVIGDTSASVTCGIGFADLSSFTALTQALTPAQLQDLLTEFDAAV
TDVVHADGGRLVKFIGDAVMWVSSSPERLVRAAVDLVDHPGARAAELQVRAGLAYGTVLALNGDYFGNPVNLAARLVAAA
APGQILAAAQLRDMLPDWPALAHGPLTLKGFDAPVMAFELHDNPRARDADTPSPAASD
>P9WGP7 ~~~~~~Epimerase family protein Rv2216~~~COG1090
MANAVVAIAGSSGLIGSALTAALRAADHTVLRIVRRAPANSEELHWNPESGEFDPHALTDVDAVVNLCGVNIAQRRWSGA
FKQSLRDSRITPTEVLSAAVADAGVATLINASAVGYYGNTKDRVVDENDSAGTGFLAQLCVDWETATRPAQQSGARVVLA
RTGVVLSPAGGMLRRMRPLFSVGLGARLGSGRQYMSWISLEDEVRALQFAIAQPNLSGPVNLTGPAPVTNAEFTTAFGRA
VNRPTPLMLPSVAVRAAFGEFADEGLLIGQRAIPSALERAGFQFHHNTIGEALGYATTRPG
>Q01609 ~~~~~~Uncharacterized protein PA2218~~~
MENAMETKHSNRARSRKGALRGAVLAGALMALVGCQTSPAATTSSNTGGTNMQLQLTQEWDKTFPLSAKVEHRKVTFANR
YGITLAADLYLPKNRGGDRLPAIVIGGPFGAVKEQSSGLYAQTMAERGFVTLAFDPSYTGESGGQPRNVASPDINTEDFS
AAVDFISLLPEVNRERIGVIGICGWGGMALNAVAVDKRVKAVVTSTMYDMTRVMSKGYNDSVTLEQRTRTLEQLGQQRWK
DAESGTPAYQPPYNELKGGEAQFLVDYHDYYMTPRGYHPRAVNSGNAWTMTTPLSFMNMPILTYIKEISPRPILLIHGER
AHSRYFSETAYAAAAEPKELLIVPGASHVDLYDRLDRIPFDRIAGFFDEHL
>P9WLI1 ~~~~~~Uncharacterized protein Rv2219~~~
MAKPRNAAESKAAKAQANAARKAAARQRRAQLWQAFTLQRKEDKRLLPYMIGAFLLIVGASVGVGVWAGGFTMFTMIPLG
VLLGALVAFVIFGRRAQRTVYRKAEGQTGAAAWALDNLRGKWRVTPGVAATGNLDAVHRVIGRPGVIFVGEGSAARVKPL
LAQEKKRTARLVGDVPIYDIIVGNGDGEVPLAKLERHLTRLPANITVKQMDTVESRLAALGSRAGAGVMPKGPLPTTAKM
RSVQRTVRRK
>P9WKB7 2.3.1.20~~~~~~Putative diacyglycerol O-acyltransferase Rv0221~~~COG1020
MKRLSGWDAVLLYSETPNVHMHTLKVAVIELDSDRQEFGVDAFREVIAGRLHKLEPLGYQLVDVPLKFHHPMWREHCQVD
LNYHIRPWRLRAPGGRRELDEAVGEIASTPLNRDHPLWEMYFVEGLANHRIAVVAKIHHALADGVASANMMARGMDLLPG
PEVGRYVPDPAPTKRQLLSAAFIDHLRHLGRIPATIRYTTQGLGRVRRSSRKLSPALTMPFTPPPTFMNHRLTPERRFAT
ATLALIDVKATAKLLGATINDMVLAMSTGALRTLLLRYDGKAEPLLASVPVSYDFSPERISGNRFTGMLVALPADSDDPL
QRVRVCHENAVSAKESHQLLGPELISRWAAYWPPAGAEALFRWLSERDGQNKVLNLNISNVPGPRERGRVGAALVTEIYS
VGPLTAGSGLNITVWSYVDQLNISVLTDGSTVQDPHEVTAGMIADFIEIRRAAGLSVELTVVESAMAQA
>P9WLH9 ~~~~~~Uncharacterized protein Rv2226~~~COG3025
MPVEAPRPARHLEVERKFDVIESTVSPSFEGIAAVVRVEQSPTQQLDAVYFDTPSHDLARNQITLRRRTGGADAGWHLKL
PAGPDKRTEMRAPLSASGDAVPAELLDVVLAIVRDQPVQPVARISTHRESQILYGAGGDALAEFCNDDVTAWSAGAFHAA
GAADNGPAEQQWREWELELVTTDGTADTKLLDRLANRLLDAGAAPAGHGSKLARVLGATSPGELPNGPQPPADPVHRAVS
EQVEQLLLWDRAVRADAYDAVHQMRVTTRKIRSLLTDSQESFGLKESAWVIDELRELADVLGVARDAEVLGDRYQRELDA
LAPELVRGRVRERLVDGARRRYQTGLRRSLIALRSQRYFRLLDALDALVSERAHATSGEESAPVTIDAAYRRVRKAAKAA
KTAGDQAGDHHRDEALHLIRKRAKRLRYTAAATGADNVSQEAKVIQTLLGDHQDSVVSREHLIQQAIAANTAGEDTFTYG
LLYQQEADLAERCREQLEAALRKLDKAVRKARD
>P0A5B0 ~~~~~~Protein Mb2227c~~~
MTVQNEPSAKTHGVILTEAAAAKAKSLLDQEGRDDLALRIAVQPGGCAGLRYNLFFDDRTLDGDQTAEFGGVRLIVDRMS
APYVEGASIDFVDTIEKQGFTIDNPNATGSCACGDSFN
>P9WLH3 ~~~~~~Uncharacterized protein Rv2229c~~~COG1579
MKAGVAQQRSLLELAKLDAELTRIAHRATHLPQRAAYQQVQAEHNAANDRMAALRIAAEDLDGQVSRFESEIDAVRKRGD
RDRSLLTSGATDAKQLADLQHELDSLQRRQASLEDALLEVLERREELQAQQTAESRALQALRADLAAAQQALDEALAEID
QARHQHSSQRDMLTATLDPELAGLYERQRAGGGPGAGRLQGHRCGACRIEIGRGELAQISAAAEDEVVRCPECGAILLRL
EGFEE
>P9WQ89 2.6.1.-~~~~~~Uncharacterized aminotransferase Rv2231c~~~COG0079
MLWILGPHTGPLLFDAVASLDTSPLAAARYHGDQDVAPGVLDFAVNVRHDRPPEWLVRQLAALLPELARYPSTDDVHRAQ
DAVAERHGRTRDEVLPLVGAAEGFALLHNLSPVRAAIVVPAFTEPAIALSAAGITAHHVVLKPPFVLDTAHVPDDADLVV
VGNPTNPTSVLHLREQLLELRRPGRILVVDEAFADWVPGEPQSLADDSLPDVLVLRSLTKTWSLAGLRVGYALGSPDVLA
RLTVQRAHWPLGTLQLTAIAACCAPRAVAAAAADAVRLTALRAEMVAGLRSVGAEVVDGAAPFVLFNIADADGLRNYLQS
KGIAVRRGDTFVGLDARYLRAAVRPEWPVLVAAIAEWAKRGGRR
>P9WGA7 ~~~~~~Uncharacterized SURF1-like protein Rv2235~~~COG3346
MPRLAFLLRPGWLALALVVVAFTYLCFTVLAPWQLGKNAKTSRENQQIRYSLDTPPVPLKTLLPQQDSSAPDAQWRRVTA
TGQYLPDVQVLARLRVVEGDQAFEVLAPFVVDGGPTVLVDRGYVRPQVGSHVPPIPRLPVQTVTITARLRDSEPSVAGKD
PFVRDGFQQVYSINTGQVAALTGVQLAGSYLQLIEDQPGGLGVLGVPHLDPGPFLSYGIQWISFGILAPIGLGYFAYAEI
RARRREKAGSPPPDKPMTVEQKLADRYGRRR
>P9WLH1 ~~~~~~Uncharacterized protein Rv2237~~~COG3662
MLLPAANVIMQLAVPGVGYGVLESPVDSGNVYKHPFKRARTTGTYLAVATIGTESDRALIRGAVDVAHRQVRSTASSPVS
YNAFDPKLQLWVAACLYRYFVDQHEFLYGPLEDATADAVYQDAKRLGTTLQVPEGMWPPDRVAFDEYWKRSLDGLQIDAP
VREHLRGVASVAFLPWPLRAVAGPFNLFATTGFLAPEFRAMMQLEWSQAQQRRFEWLLSVLRLADRLIPHRAWIFVYQLY
LWDMRFRARHGRRIV
>P9WLG9 ~~~~~~Uncharacterized protein Rv2239c~~~
MPIATVCTWPAETEGGSTVVAADHASNYARKLGIQRDQLIQEWGWDEDTDDDIRAAIEEACGGELLDEDTDEVIDVVLLW
WRDGDGDLVDTLMDAIGPLAEDGVIWVVTPKTGQPGHVLPAEIAEAAPTAGLMPTSSVNLGNWSASRLVQPKSRAGKR
>P9WLG7 ~~~~~~Uncharacterized protein Rv2240c~~~
MLIGWRAVPRRHGGELPRRGALALGCIALLLMGIVGCTTVTDGTAMPDTNVAPAYRSSVSASVSASAATSSIRESQRQQS
LTTKAIRTSCDALAATSKDAIDKVNAYVAAFNQGRNTGPTEGPAIDALNNSASTVSGSLSAALSAQLGDALNAYVDAARA
VANAIGAHASTAEFNRRVDRLNDTKTKALTMCVAAF
>P9WPH5 ~~~~~~Uncharacterized protein Rv2242~~~COG3835
MNDNQLAPVARPRSPLELLDTVPDSLLRRLKQYSGRLATEAVSAMQERLPFFADLEASQRASVALVVQTAVVNFVEWMHD
PHSDVGYTAQAFELVPQDLTRRIALRQTVDMVRVTMEFFEEVVPLLARSEEQLTALTVGILKYSRDLAFTAATAYADAAE
ARGTWDSRMEASVVDAVVRGDTGPELLSRAAALNWDTTAPATVLVGTPAPGPNGSNSDGDSERASQDVRDTAARHGRAAL
TDVHGTWLVAIVSGQLSPTEKFLKDLLAAFADAPVVIGPTAPMLTAAHRSASEAISGMNAVAGWRGAPRPVLARELLPER
ALMGDASAIVALHTDVMRPLADAGPTLIETLDAYLDCGGAIEACARKLFVHPNTVRYRLKRITDFTGRDPTQPRDAYVLR
VAATVGQLNYPTPH
>P9WJZ9 2.1.1.-~~~~~~Uncharacterized methyltransferase Rv0224c~~~COG0500
MAVTDVFARRATLRRSLRLLADFRYEQRDPARFYRTLAADTAAMIGDLWLATHSEPPVGRTLLDVGGGPGYFATAFSDAG
VGYIGVEPDPDEMHAAGPAFTGRPGMFVRASGMALPFADDSVDICLSSNVAEHVPRPWQLGTEMLRVTKPGGLVVLSYTV
WLGPFGGHEMGLSHYLGGARAAARYVRKHGHPAKNNYGSSLFAVSAAEGLRWAAGTGAALAVFPRYHPRWAWWLTSVPVL
REFLVSNLVLVLTP
>P9WMC5 ~~~~~~Uncharacterized HTH-type transcriptional regulator Rv2250c~~~COG1309
MLSMSNDRADTGGRILRAAASCVVDYGVDRVTLAEIARRAGVSRPTVYRRWPDTRSIMASMLTSHIADVLREVPLDGDDR
EALVKQIVAVADRLRGDDLIMSVMHSELARVYITERLGTSQQVLIEGLAARLTVAQRSGSVRSGDARRLATMVLLIAQST
IQSADIVDSILDSAALATELTHALNGYLC
>O53532 2.1.1.-~~~~~~S-adenosylmethionine-dependent methyltransferase Rv2258c~~~COG2230
MSGALETTEEFGNRFVAAIDSAGLAILVSVGHQTGLLDTMAGLPPATSMEIAEAAGLEERYVREWLGGMTTGQIVEYDAG
SSTYSLPAHRAGMLTRAAGPDNLAVIAQFVSLLGEVEQKVIRCFREGGGVPYSEYPRFHKLMAEMSGMVFDAALIDVVLP
LVDGLPDRLRSGADVADFGCGSGRAVKLMAQAFGASRFTGIDFSDEAVAAGTEEAARLGLANATFERHDLAELDKVGAYD
VITVFDAIHDQAQPARVLQNIYRALRPGGVLLMVDIKASSQLEDNVGVPLSTYLYTTSLMHCMTVSLALDGAGLGTVWGR
QLATSMLADAGFTDVTVAEIESDVLNNYYIARK
>P62035 ~~~~~~Probable transcriptional regulatory protein DVU_2259~~~COG0217
MAGHSKWANIQHRKGRQDAKRGKMFTKAAKEIIIAAKAGGDPVGNSRLRAAIAAAKAINLPKDKIENAIKKGTGELAGGD
ILEMAYEGYGPGGVALIVEVATDNKNRTVAEVRHILSKHGGSMGESGCVAWMFDRKGVITLEKDKYTEEQLMEVALEAGA
EDVTDEGESWEVVTAAADFNAVREALEAAGVEMQSAEFTMVPQNEIEVDAETGRKLMRLVDALEDNDDVQNVHANFDLPD
ELLAELG
>Q7A3L9 1.-.-.-~~~~~~Uncharacterized oxidoreductase SA2266~~~
MTVLTDKIAVVTGAGSGIGEAIATLLHEEGAKVVLAGRNKDKLQNVANQLAQDSVKVVPTDVTNKEEVDELMKIAQQTFG
GLDIVINSAGQMLSSKITDYQVDEWDSMIDVNIKGTLYTAQAALPTMLEQSSGHLINIASISGFEVTKSSTIYSATKAAV
HTITQGLEKELAKTGVKVTSISPGMVDTAITAAYNPSDRKKLDPQDIAEAVLYALTQPKHVNVNEITVRPV
>P9WLF9 ~~~~~~Uncharacterized protein Rv2269c~~~
MANDARPLARLANCRVGDQSSATHAYTVGPVLGVPPTGGVDLRYGGRAGIGRSETVTDHGAVGRRYHQPCAGQIRLSELR
VTILLRCETLCETAQLLRCPPLPCDCSTPL
>Q7A3L7 ~~~~~~Uncharacterized protein SA2269~~~
MIHSKKLTLGICLVLLIILIVGYVIMTKTNGRNAQIKDTFNQTLKLYPTKNLDDFYDKEGFRDQEFKKGDKGTWIVNSEM
VIEPKGKDMETRGMVLYINRNTRTTKGYYFISEMTDDSNGRPKDDEKRYPVKMEHNKIIPTKPLPNDKLRKEIENFKFFV
QYGDFKDINDYKDGDISYNPNVPSYSAKYQLKNDDYNVKQLRKRYNIPTNKAPKLLIKGDGDLKGSSVGSKNLEFTFVEN
KEENIYFTDSVQYTSSEDTSYESN
>P9WLF7 ~~~~~~Uncharacterized protein Rv2271~~~
MTTPPDKARRRFLRDAYKNAERVARTALLTIDQDQLEQLLDYVDERLGEQPCDHTARHAQRWAQSHRIEWETLAEGLQEF
GGYCDCEIVMNVEPEAIFG
>P9WLF5 ~~~~~~Uncharacterized protein Rv2272~~~COG2149
MADDSNDTATDVEPDYRFTLANERTFLAWQRTALGLLAAAVALVQLVPELTIPGARQVLGVVLAILAILTSGMGLLRWQQ
ADRAMRRHLPLPRHPTPGYLAVGLCVVGVVALALVVAKAITG
>Q7A3L3 ~~~~~~Uncharacterized lipoprotein SA2273~~~
MIHSKKLTLGICLVLLIILIGGCIIMTKINSRNAQIKDTFNQTLNVYPTKNLDDFYDKEGFRDQEFDKRDKGTWIINSGM
YIQLKGGALKSRAMVLYINRNTRTAKGYFLISETTEDKKGYVHNKDKKYPVKMERNRIIPTKPITDEKLKKEIENFKFFV
QYGNFKDFKDYKDGDISYNPNVPSYSAKYQLNNDDYNVQQLRKRYDISTKRAPELKLRGSGDLKGSSVGSKELEFNFVRN
KEENVYFSDGINFKPTEEMNHEQN
>P9WLF1 ~~~~~~Uncharacterized protein Rv2277c~~~COG0584
MPGRFTVALVIALGGTCGVADALPLGQTDDPMIVAHRAGTRDFPENTVLAITNAVAAGVDGMWLTVQVSSDGVPVLYRPS
DLATLTDGAGPVNSKTVQQLQQLNAGWNFTTPGVEGHPYRQRATPIPTLEQAIGATPPDMTLFLDLKQTPPQPLVSAVAQ
VLTRTGAAGRSIVYSTNADITAAASRQEGLQVAESRDVTRQRLFNMALNHHCDPQPDPGKWAGFELHRDVTVTEEFTLGS
GISAVNAELWDEASVDCFRSQSGMKVMGFAVKTVDDYRLAHKIGLDAVLVDSPLAAQQWRH
>P9WIT1 1.-.-.-~~~~~~Uncharacterized FAD-linked oxidoreductase Rv2280~~~COG0277
MSEMTARFSEIVGNANLLTGDAIPEDYAHDEELTGPPQKPAYAAKPATPEEVAQLLKAASENGVPVTARGSGCGLSGAAR
PVEGGLLISFDRMNKVLEVDTANQVAVVQPGVALTDLDAATADTGLRYTVYPGELSSSVGGNVGTNAGGMRAVKYGVARH
NVLGLQAVLPTGEIIRTGGRMAKVSTGYDLTQLIIGSEGTLALVTEVIVKLHPRLDHNASVLAPFADFDQVMAAVPKILA
SGLAPDILEYIDNTSMAALISTQNLELGIPDQIRDSCEAYLLVALENRIADRLFEDIQTVGEMLMELGAVDAYVLEGGSA
RKLIEAREKAFWAAKALGADDIIDTVVPRASMPKFLSTARGLAAAADGAAVGCGHAGDGNVHMAIACKDPEKKKKLMTDI
FALAMELGGAISGEHGVGRAKTGYFLELEDPVKISLMRRIKQSFDPAGILNPGVVFGDT
>P9WMF3 ~~~~~~Uncharacterized HTH-type transcriptional regulator Rv2282c~~~COG0583
MPLSSRMPGLTCFEIFLAIAEAGSLGGAARELGLTQQAVSRRLASMEAQIGVRLAIRTTRGSQLTPAGIVVAEWAARLLE
VADEIDAGLGSLRTEGRQRIRVVASQTIAEQLMPHWMLSLRAADMRRGGTVPEVILTATNSEHAIAAVRDGIADLGFIEN
PCPPTGLGSVVVARDELVVVVPPGHKWARRSRVVSARELAQTPLVTREPNSGIRDSLTAALRDTLGEDMQQAPPVLELSS
AAAVRAAVLAGAGPAAMSRLAIADDLAFGRLLAVDIPALNLRRQLRAIWVGGRTPPAGAIRDLLSHITSRST
>P9WKB5 2.3.1.20~~~~~~Putative diacyglycerol O-acyltransferase Rv2285~~~COG1020
MKLLSPLDQMFARMEAPRTPMHIGAFAVFDLPKGAPRRFIRDLYEAISQLAFLPFPFDSVIAGGASMAYWRQVQPDPSYH
VRLSALPYPGTGRDLGALVERLHSTPLDMAKPLWELHLIEGLTGRQFAMYFKAHHCAVDGLGGVNLIKSWLTTDPEAPPG
SGKPEPFGDDYDLASVLAAATTKRAVEGVSAVSELAGRLSSMVLGANSSVRAALTTPRTPFNTRVNRHRRLAVQVLKLPR
LKAVAHATDCTVNDVILASVGGACRRYLQELGDLPTNTLTASVPVGFERDADTVNAASGFVAPLGTSIEDPVARLTTISA
STTRGKAELLAMSPNALQHYSVFGLLPIAVGQKTGALGVIPPLFNFTVSNVVLSKDPLYLSGAKLDVIVPMSFLCDGYGL
NVTLVGYTDKVVLGFLGCRDTLPHLQRLAQYTGAAFEELETAALP
>P9WLE7 ~~~~~~Uncharacterized protein Rv2286c~~~COG2761
MTTVDFHFDPLCPFAYQTSVWIRDVRAQLGITINWRFFSLEEINLVAGKKHPWERDWSYGWSLMRIGALLRRTNMSLLDR
WYAAIGHELHTLGGKPHDPAVARRLLCDVGVNAAILDAALDDPTTHDDVRADHQRVVAAGGYGVPTLFLDGQCLFGPVLV
DPPAGPAALNLWSVVTGMAGLPHVYELQRPKSPADVELIAQQLRPYLDGRDWVSINRGEIVDIDRLAGRS
>P9WJI3 ~~~~~~Uncharacterized Na(+)/H(+) exchanger Rv2287~~~COG0025
MNGRRTIGEDGLVFGLVVIVALVAAVVVGTVLGHRYRVGPPVLLILSGSLLGLIPRFGDVQIDGEVVLLLFLPAILYWES
MNTSFREIRWNLRVIVMFSIGLVIATAVAVSWTARALGMESHAAAVLGAVLSPTDAAAVAGLAKRLPRRALTVLRGESLI
NDGTALVLFAVTVAVAEGAAGIGPAALVGRFVVSYLGGIMAGLLVGGLVTLLRRRIDAPLEEGALSLLTPFAAFLLAQSL
KCSGVVAVLVSALVLTYVGPTVIRARSRLQAHAFWDIATFLINGSLWVFVGVQIPGAIDHIAGEDGGLPRATVLALAVTG
VVIATRIAWVQATTVLGHTVDRVLKKPTRHVGFRQRCVTSWAGFRGAVSLAAALAVPMTTNSGAPFPDRNLIIFVVSVVI
LVTVLVQGTSLPTVVRWARMPEDVAHANELQLARTRSAQAALDALPTVADELGVAPDLVKHLEKEYEERAVLVMADGADS
ATSDLAERNDLVRRVRLGVLQHQRQAVTTLRNQNLIDDIVLRELQAAMDLEEVQLLDPADAE
>P9WFL7 ~~~~~~UPF0167 protein Rv2295~~~COG3196
MPVEETSTPQKLPQFRYHPDPVGTGSIVADEVSCVSCEQRRPYTYTGPVYAEEELNEAICPWCIADGSAASRFDATFTDA
MWAVPDDVPEDVTEEVLCRTPGFTGWLQEEWLHHCGDAAAFLGPVGASEVADLPDALDALRNEYRGYDWPADKIEEFILT
LDRNGLATAYLFRCLSCGVHLAYADFA
>P9WLD9 ~~~~~~Uncharacterized protein Rv2297~~~
MAMEMAMMGLLGTVVGASAMGIGGIAKSIAEAYVPGVAAAKDRRQQMNVDLQARRYEAVRVWRSGLCSASNAYRQWEAGS
RDTHAPNVVGDEWFEGLRPHLPTTGEAAKFRTAYEVRCDNPTLMVLSLEIGRIEKEWMVEASGRTPKHRG
>P9WQA7 1.-.-.-~~~~~~Uncharacterized oxidoreductase Rv2298~~~COG0667
MKYLDVDGIGQVSRIGLGTWQFGSREWGYGDRYATGAARDIVKRARALGVTLFDTAEIYGLGKSERILGEALGDDRTEVV
VASKVFPVAPFPAVIKNRERASARRLQLNRIPLYQIHQPNPVVPDSVIMPGMRDLLDSGDIGAAGVSNYSLARWRKADAA
LGRPVVSNQVHFSLAHPDALEDLVPFAELENRIVIAYSPLAQGLLGGKYGLENRPGGVRALNPLFGTENLRRIEPLLATL
RAIAVDVDAKPAQVALAWLISLPGVVAIPGASSVEQLEFNVAAADIELSAQSRDALTDAARAFRPVSTGRFLTDMVREKV
SRR
>Q7U2J0 2.1.1.-~~~~~~Uncharacterized methyltransferase Mb0229c~~~
MAVTDVFARRATLRRSLRLLADFRYEQRDPARFYRTLAADTAAMIGDLWLATHSEPPVGRTLLDVGGGPGYFATAFSDAG
VGYIGVEPDPDEMHAAGPAFTGRPGMFVRASGMALPLADDSVDICLSSNVAEHVPRPWQLGTEMLRVTKPGGLVVLSYTV
WLGPFGGHEMGLSHYLGGARAAARYVRKHGHPAKNNYGSSLFAVSAAEGLRWAAGTGAALAVFPRYHPRWAWWLTSVPVL
REFLVSNLVLVLTP
>P9WLD7 3.-.-.-~~~~~~Probable metallo-hydrolase Rv2300c~~~COG0491
MVATRGRPCPTNFSRPQRPRVAGNGTKSQRCRGRLTTSMLGVAPEAKGPPVKVHHLNCGTMNAFGIALLCHVLLVETDDG
LVLVDTGFGIQDCLDPGRVGLFRHVLRPAFLQAETAARQIEQLGYRTSDVRHIVLTHFDFDHIGGIADFPEAHLHVTAAE
ARGAIHAPSLRERLRYRRGQWAHGPKLVEHGPDGEPWRGFASAKPLDSIGTGVVLVPMPGHTRGHAAVAVDAGHRWVLHC
GDAFYHRGTLDGRFRVPFVMRAEEKLLSYNRNQLRDNQARIVELHRRHDPDLLIVCAHDPDLYQLARDTA
>P9WLD5 ~~~~~~Uncharacterized protein Rv2302~~~COG2905
MHAKVGDYLVVKGTTTERHDQHAEIIEVRSADGSPPYVVRWLVNGHETTVYPGSDAVVVTATEHAEAEKRAAARAGHAAT
>Q3JRV4 3.-.-.-~~~~~~Probable metallo-hydrolase BURPS1710b_2304~~~
MTVEGFFDPATCTISYLLFDSGSGECALIDSVLDYDPKSGRTRTASADQLIARVAALGARVRWLLETHVHADHLSAAPYL
KTRVGGEIAIGRHVTRVQDVFGKLFNAGPAFAHDGSQFDRLLDDGDTLALGALSIRAMHTPGHTPACMTYVVTEAHAAHD
ARDAAAFVGDTLFMPDYGTARCDFPGGDARSLYRSIRKVLSLPPATRLYMCHDYQPNGRAIQYASTVADELRENVHIREG
VTEDDFVAMRTARDATLDMPVLMLPSVQVNMRAGRLPEPEDNGVRYLKIPLDAI
>P72255 ~~~~~~UPF0758 protein~~~
MLRERFLTGGAEAMPDYELLELILFRALPRQDVKPLARRLLDVFGSFGHVLAAPPARLTEVTGVGEAVVQELKIVEAAAQ
RLARSRVLQRPVLSSWQALLDYCHTAMAHRPTEQFRVLFLDRKNLLIADEEQARGTVDHVPVYPREVVKRALELDASALI
LVHNHPSGDPTPSQADIAMTDQIRHAAEALGLVLHDHLVIGKGRELSFRAEGLL
>P9WLD1 ~~~~~~Uncharacterized protein Rv2305~~~COG1020
MTQTLRLTALDEMFITDDIDIVPSVQIEARVSGRFDLDRLAAALRAAVAKHALARARLGRASLTARTLYWEVPDRADHLA
VEITDEPVGEVRSRFYARAPELHRSPVFAVAVVRETVGDRLLLNFHHAAFDGMGGLRLLLSLARAYAGEPDEVGGPPIEE
ARNLKGVAGSRDLFDVLIRARGLAKPAIDRKRTTRVAPDGGSPDGPRFVFAPLTIESDEMATAVARRPEGATVNDLAMAA
LALTILQWNRTHDVPAADSVSVNMPVNFRPTAWSTEVISNFASYLAIVLRVDEVTDLEKATAIVAGITGPLKQSGAAGWV
VDLLEGGKVLPAMLKRQLQLLLPLVEDRFVESVCLSNLGRVDVPAFGGEAGDTTEVWFSPTAAMSVMPIGVGLVGFGGTL
RAMFRGDGRTIGGEALGRFAALYRDTLLT
>P9WLC7 ~~~~~~Uncharacterized protein Rv2307c~~~COG1073
MSLKRCRALPVVAIVALVASGVIMFIWSQQRRLIYFPSAGPVPSASSVLPAGRDVVVETQDGMRLGGWYFPHTSGGSGPA
VLVCNGNAGDRSMRAELAVALHGLGLSVLLFDYRGYGGNPGRPSEQGLAADARAAQEWLSGQSDVDPARIAYFGESLGAA
VAVGLAVQRPPAALVLRSPFTSLAEVGAVHYPWLPLRRLLLDHYPSIERIASVHAPVLVIAGGSDDIVPATLSERLVAAA
AEPKRYVVVPGVGHNDPELLDGRVMLDAIRRFLTETAVLGQ
>P72699 ~~~~~~UPF0045 protein sll0230~~~COG0011
MCQNLGKFEIVVSHYRRVKAMNVIVDLCVVPLGVGVSVGQYVAACQKVLAEAGLKHTMHAYGTNIEGDWDEVFAAVKACH
EAVHALGAPRITSSMRFGTRTDRPQTMDEKVKSVETWLENS
>Q7A3H8 1.-.-.-~~~~~~Putative NAD(P)H nitroreductase SA2311~~~
MSNMNQTIMDAFHFRHATKQFDPQKKVSKEDFETILESGRLSPSSLGLEPWKFVVIQDQALRDELKAHSWGAAKQLDTAS
HFVLIFARKNVTSRSPYVQHMLRDIKKYEAQTIPAVEQKFDAFQADFHISDNDQALYDWSSKQTYIALGNMMTTAALLGI
DSCPMEGFSLDTVTDILANKGILDTEQFGLSVMVAFGYRQQDPPKNKTRQAYEDVIEWVGPKE
>P9WLB7 ~~~~~~Uncharacterized protein Rv2313c~~~COG2128
MPAPVSVRDDLCRLVALSPGDGRIAGLVRQVCARALSLPSLPCEVAVNEPESPAEAVVAEFAEQFSVDVSAITGEQRSLL
WTHLGEDAFGAVVAMYIADFVPRVRAGLEALGVGKEYLGWVTGPISWDHNTDLSAAVFNGFLPAVARMRALDPVTSELVR
LRGAAQHNCRVCKSLREVSALDAGGSETLYGEIERFDTSVLLDVRAKAALRYADALIWTPAHLAVDVAVEVRSRFSDDEA
VELTFDIMRNASNKVAVSLGADAPRVQQGTERYRIGLDGQTVFG
>P9WLB5 ~~~~~~Universal stress protein Rv2319c~~~COG0589
MTIVVGYLAGKVGPSALHLAVRVARMHKTSLTVATIVRRHWPTPSLARVDAEYELWSEQLAAASAREAQRYLRRLADGIE
VSYHHRAHRSVSAGLLDVVEELEAEVLVLGSFPSGRRARVLIGSTADRLLHSSPVPVAITPRRYRCYTDRLTRLSCGYSA
TSGSVDVVRRCGHLASRYGVPMRVITFAVRGRTMYPPEVGLHAEASVLEAWAAQARELLEKLRINGVVSEDVVLQVVTGN
GWAQALDAADWQDGEILALGTSPFGDVARVFLGSWSGKIIRYSPVPVLVLPG
>P9WPI7 ~~~~~~Uncharacterized protein Rv2325c~~~COG0619
MTTTSAPARNGTRRPSRPIVLLIPVPGSSVIHDLWAGTKLLVVFGISVLLTFYPGWVTIGMMAALVLAAARIAHIPRGAL
PSVPRWLWIVLAIGFLTAALAGGTPVVAVGGVQLGLGGALHFLRITALSVVLLALGAMVSWTTNVAEISPAVATLGRPFR
VLRIPVDEWAVALALALRAFPMLIDEFQVLYAARRLRPKRMPPSRKARRQRHARELIDLLAAAITVTLRRADEMGDAITA
RGGTGQLSAHPGRPKLADWVTLAITAMASGTAVAIESLILHS
>P9WQI7 ~~~~~~Uncharacterized ABC transporter ATP-binding protein Rv2326c~~~COG1122
MCCAVCGPEPGRIGEVTPLGPCPAQHRGGPLRPSELAQASVMAALCAVTAIISVVVPFAAGLALLGTVPTGLLAYRYRLR
VLAAATVAAGMIAFLIAGLGGFMGVVHSAYIGGLTGIVKRRGRGTPTVVVSSLIGGFVFGAAMVGMLAAMVRLRHLIFKV
MTANVDGIAATLARMHMQGAAADVKRYFAEGLQYWPWVLLGYFNIGIMIVSLIGWWALSRLLERMRGIPDVHKLDPPPGD
DVDALIGPVPVRLDKVRFRYPRAGQDALREVSLDVRAGEHLAIIGANGSGKTTLMLILAGRAPTSGTVDRPGTVGLGKLG
GTAVVLQHPESQVLGTRVADDVVWGLPLGTTADVGRLLSEVGLEALAERDTGSLSGGELQRLALAAALAREPAMLIADEV
TTMVDQQGRDALLAVLSGLTQRHRTALVHITHYDNEADSADRTLSLSDSPDNTDMVHTAAMPAPVIGVDQPQHAPALELV
GVGHEYASGTPWAKTALRDINFVVEQGDGVLIHGGNGSGKSTLAWIMAGLTIPTTGACLLDGRPTHEQVGAVALSFQAAR
LQLMRSRVDLEVASAAGFSASEQDRVAAALTVVGLDPALGARRIDQLSGGQMRRVVLAGLLARAPRALILDEPLAGLDAA
SQRGLLRLLEDLRRARGLTVVVVSHDFAGMEELCPRTLHLRDGVLESAAASEAGGMS
>Q8UIR1 ~~~~~~UPF0339 protein Atu0232~~~COG3422
MYKFEIYQDKAGEYRFRFKASNGETMFSSEGYKAKASAIHAIESIKRNSAGADTVDLTTMTA
>Q1QV19 5.1.1.8~~~~~~Protein Csal_2339~~~COG3938
MSEELTLQLIDLHAGGDVSRIVTGGIDPLPGNTVREKMEYLREDADGLRQLLLSEPYGIPEMSVDLLVPASDPEAEVGYI
IMEVMGYPIYSGSNTICTATAVLESGLVPKREGHQRFILESAAGLVHIEARVENGVVEAITCEGLPSYIDTYRASIHVPS
VGDVTYSVAYSGGFYAMVDAAELGFSLNRDEEARLAECAHAIVEAIQAERGFSHYTLGDVGPLPFLHFMGPVEQVADGFF
RSRSTTYVHPGVICRSTTGTGTSARLALMHHEGTLQPGDKLETVSLRGTGFIGEFTGSRQEGDHRVAENTITGKAHMLAR
SDIVINCNDPLVECGSLHHILTSGHRRTEALPVEEEVSLSD
>Q5XDZ5 ~~~~~~Uncharacterized protein M6_Spy0233~~~
MYQVIHTNKESKMVDNRMRFTIDRSMQFPLVEIDLEHGGSVYLQQGSMVYHTENVTLNTKLNGKGSGLGKLVGAIGRSMV
SGESMFITQAMSDGDGKLALAPNTPGQIVALELGEKQYRLNDGAFLALDGSAQYKMERQNIGKALFGGQGGLFVMTTEGL
GTLLANSFGSIKKITLDGGTMTIDNAHVVAWSRELDYDIHLENGFMQSIGTGEGVINTFRGHGEIYIQSLNLEQFAGTLK
RYLPTSSN
>P9WFJ5 ~~~~~~UPF0603 protein Rv2345~~~COG1512
MRLVRLLGMVLTILAAGLLLGPPAGAQPPFRLSNYVTDNAGVLTSSGRTAVTAAVDRLYADRRIRLWVVYVENFSGQSAL
NWAQRTTRTSELGNYDALLAVATTGREYAFLVPSAMPGVSEGQVDNVRRYQIEPALHDGDYSGAAVAAANGLNRSPSSSS
RVVLLVTVGIIVIVVAVLLVVMRHRNRRRRADELAAARRVDPTNVMALAAVPLQALDDLSRSMVVDVDNAVRTSTNELAL
AIEEFGERRTAPFTQAVNNAKAALSQAFTVRQQLDDNTPETPAQRRELLTRVIVSAAHADRELASQTEAFEKLRDLVINA
PARLDLLTQQYVELTTRIGPTQQRLAELHTEFDAAAMTSIAGNVTTATERLAFADRNISAARDLADQAVSGRQAGLVDAV
RAAESALGQARALLDAVDSAATDIRHAVASLPAVVADIQTGIKRANQHLQQAQQPQTGRTGDLIAARDAAARALDRARGA
ADPLTAFDQLTKVDADLDRLLATLAEEQATADRLNRSLEQALFTAESRVRAVSEYIDTRRGSIGPEARTRLAEAKRQLEA
AHDRKSSNPTEAIAYANAASTLAAHAQSLANADVQSAQRAYTRRGGNNAGAILGGIIIGDLLSGGTRGGLGGWIPTSFGG
SSNAPGSSPDGGFLGGGGRF
>Q6NEC9 ~~~~~~UPF0371 protein DIP2346~~~
MVNTIGFDREKYIEMQSQHIRERREALGGKLYLEMGGKLFDDMHASRVLPGFTPDNKIAMLDRIKDEVEILVCINAKDLE
RHKIRADLGISYEEDVLRLVDVFRDRGFLVEHVVLTQLENDNRLALAFIERLQRLGIKVSRHRVIPGYPTDMDRIVSDEG
FGLNEYAETTRDLVVVTAPGPGSGKLATCLSQVYHEHKRGVAAGYAKFETFPIWNLPLEHPVNLAYEAATVDLNDANVID
HFHLAAYGEQTVNYNRDVEAFPLLKTLLERLMGESPYQSPTDMGVNMAGNCISDDAACRHASEQEIIRRYFKALVEEART
GKDSTQSDRAAVVMAKAGIKASQRVVVEPARQVEERTSLPGCAIELVDGSIITGATSDLLGCSSSMLLNALKHLAGIDDA
IHLLSPESIEPIQTLKTVHLGSSNPRLHTDEVLIALSVSAATDSNAQKALDQLKNLRGCDVHTTTILGSVDEGIFRNLGV
LVTSDPKFQKNKLYQKR
>Q8NN60 ~~~~~~DegV domain-containing protein Cgl2349/cg2579~~~COG1307
MPVRVIVDSSACLPTHVAEDLDITVINLHVMNNGEERSTSGLSSLELAASYARQLERGGDDGVLALHISKELSSTWSAAV
TAAAVFDDDSVRVVDTSSLGMAVGAAAMAAARMAKDGASLQECYDIAVDTLKRSETWIYLHRIDEIWKSGRISTATAMVS
TALATRPIMRFNGGRMEIAAKTRTQSKAFAKLVELAQIRADGEPVFIAIGQNEAREAAKQLEELLRNALPEGSSFMSVDI
DPTLAVHSGPGAVSVSAVFANQAPELSTGKAGAK
>Q6GEG9 ~~~~~~Uncharacterized HTH-type transcriptional regulator SAR2349~~~
MLSQEFFNSFITIYRPYLKLTEPILEKHNIYYGQWLILRDIAKHQPTTLIEISHRRAIEKPTARKTLKALIENDLITVEN
SLEDKRQKFLTLTPKGHELYEIVCLDVQKLQQAVVAKTNISQDQMQETINVMNQIHEILLKEAHND
>A0QUV5 2.1.1.-~~~~~~Probable S-adenosylmethionine-dependent methyltransferase MSMEG_2350/MSMEI_2290~~~COG2227
MGDPDNALPSALPLTGERTIPGLAEENYWFRRHEVVYQRLAHRCAGRDVLEAGCGEGYGADLIADVARRVIGLDYDEATV
AHVRARYPRVDIRHGNLAELPLPDASVDVVVNFQVIEHLWDQAQFVSECFRVLRPGGVFLVSTPNRITFSPGRDTPLNPF
HTRELNAAELTELLETAGFEVEDTLGVFHGAGLAELDARHGGSIIEAQVQRAVADAPWDEQLLADVAAVRTDDFDLTPAA
ERDIDDSLDLVAIAVRP
>Q2FWE4 ~~~~~~UPF0340 protein SAOUHSC_02355~~~COG4475
MKDLTMLLDELKDMSFFNKGDICLIGCSTSEVIGEKIGTVGSMEVAETIFNALDVVSKETGVTFAFQGCEHINRAITIEK
SQYNPLTMEEVSVVPDVHAGGSLATYAFQHMKDPIVVEHITVPCGIDIGQTLIGMHIKHVCVPVRTSVKQVGQAIVTIAT
SRPKKIGGERAKYQ
>Q5HDJ2 ~~~~~~Uncharacterized HTH-type transcriptional regulator SACOL2360~~~
MMKLNLFINANETESYIDIHAPKMNDNVQSIINAVNDLDKSHTLVGYIDKEIHIINVSDVITFQVINKNVTAITSNQKFK
LKLRLYELEKQLPQHFIRISKSEIVNKYYIEKLLLEPNGLIRMYLKDAHYTYSSRRYLKSIKERLSI
>P9WFP1 ~~~~~~UPF0053 protein Rv2366c~~~COG1253
MTGYYQLLGSIVLIGLGGLFAAIDAAISTVSPARVDELVRDQRPGAGSLRKVMADRPRYVNLVVLLRTSCEITATALLVV
FIRYHFSMVWGLYLAAGIMVLASFVVVGVGPRTLGRQNAYSISLATALPLRLISWLLMPISRLLVLLGNALTPGRGFRNG
PFASEIELREVVDLAQQRGVVAADERRMIESVFELGDTPAREVMVPRTEMIWIESDKTAGQAMTLAVRSGHSRIPVIGEN
VDDIVGVVYLKDLVEQTFCSTNGGRETTVARVMRPAVFVPDSKPLDALLREMQRDRNHMALLVDEYGAIAGLVSIEDVLE
EIVGEIADEYDQAETAPVEDLGDKRFRVSARLPIEDVGELYGVEFDDDLDVDTVGGLLALELGRVPLPGAEVISHGLRLH
AEGGTDHRGRVRIGTVLLSPAEPDGADDEEADHPG
>Q7A3C4 3.-.-.-~~~~~~Uncharacterized hydrolase SA2367~~~
METLELQGAKLRYHQVGQGPVLIFIPGANGTGNIFLPLAEQLKDHFTVVAVDRRDYGESELTEPLPDSASNPDSDYRVKR
DAQDIAELAKSLSDEPVYILGSSSGSIVAMHVLKDYPEVVKKIAFHEPPINTFLPDSTYWKDKNDDIVHQILTEGLEKGM
KTFGETLNIAPIDAKMMSQPADTEEGRIEQYKRTMFWSEFEIRQYTHSDITLDDFTKYSDKITLLNGTDSRGSFPQDVNF
YINKETGIPIVDIPGGHLGYIQKPEGFADVLLNMWG
>P9WLB1 ~~~~~~Putative secreted protein Rv0236.1~~~
MNRIVAPAAASVVVGLLLGAAAIFGVTLMVQQDKKPPLPGGDPSSSVLNRVEYGNRS
>I6YD99 ~~~~~~Uncharacterized protein Rv2386A/RVBD_2386A~~~
MFVIRLADGEEVHGECDELTINPATGVLTVCRVDGFEETTTHYSPSAWRSVTHRKRGVGVRPSLVSTAQ
>P67382 ~~~~~~UPF0237 protein SP_0238~~~COG3830
MKAIITVVGKDKSGIVAGVSGKIAELGLNIDDISQTVLDEYFTMMAVVSSDEKQDFTYLRNEFEAFGQTLNVKINIQSAA
IFEAMYNI
>Q97ST4 ~~~~~~UPF0210 protein SP_0239~~~COG2848
MDIRQVTETIAMIEEQNFDIRTITMGISLLDCIDPDINRAAEKIYQKITTKAANLVAVGDEIAAELGIPIVNKRVSVTPI
SLIGAATDATDYVVLAKALDKAAKEIGVDFIGGFSALVQKGYQKGDEILINSIPRALAETDKVCSSVNIGSTKSGINMTA
VADMGRIIKETANLSDMGVAKLVVFANAVEDNPFMAGAFHGVGEADVIINVGVSGPGVVKRALEKVRGQSFDVVAETVKK
TAFKITRIGQLVGQMASERLGVEFGIVDLSLAPTPAVGDSVARVLEEMGLETVGTHGTTAALALLNDQVKKGGVMACNQV
GGLSGAFIPVSEDEGMIAAVQNGSLNLEKLEAMTAICSVGLDMIAIPEDTPAETIAAMIADEAAIGVINMKTTAVRIIPK
GKEGDMIEFGGLLGTAPVMKVNGASSVDFISRGGQIPAPIHSFKN
>A0QV09 1.1.1.-~~~~~~Aldo-keto reductase MSMEG_2407/MSMEI_2346~~~COG0656
MTASHGQAAAIPTVTLNDDNTLPVVGIGVGELSDSEAERSVSAALEAGYRLIDTAAAYGNEAAVGRAIAASGIPRDEIYV
TTKLATPDQGFTSSQAAARASLERLGLDYVDLYLIHWPGGDTSKYVDSWGGLMKVKEDGIARSIGVCNFGAEDLETIVSL
TYFTPAVNQIELHPLLNQAALREVNAGYNIVTEAYGPLGVGRLLDHPAVTAIAEAHGRTAAQVLLRWSIQLGNVVISRSA
NPERIASNLDVFGFELTADEMETLNGLDDGTRFRPDPATYTGS
>A0QV10 1.1.1.-~~~~~~Aldo-keto reductase MSMEG_2408/MSMEI_2347~~~COG0656
MSPRITLNDGNSIPQVGLGVWQTPAEDTERAVAAALQAGYRHIDTAAAYRNETETGRAIANSGVPREDIFLVTKLWNSDQ
GYDATLAAFDASVQRLGVDYLDLYLIHWPVPENNKFVDTFKAFAHLRDQGRIRSIGVSNFEPEHLTTLIEETGIVPAVNQ
IELHPLLPQQELRDVHAKLGIATEAWSPLGQGSLLADPVITGIAEQHGKTPAQVLIRWHIQLGNIVIPKSVNPERIASNF
DVFDFELSGQDITSIASLETGKRLGPDPRTFNFTG
>P9WLA9 ~~~~~~Uncharacterized protein Rv2411c~~~COG2308
MRRVSLPNQLNETRRRSPTRGERIFGGYNTSDVYAMAFDEMFDAQGIVRGPYKGIYAELAPSDASELKARADALGRAFID
QGITFSLSGQERPFPLDLVPRVISAPEWTRLERGITQRVKALECYLDDIYGDQEILRDGVIPRRLVTSCEHFHRQAVGIV
PPNGVRIHVAGIDLIRDHRGDFRVLEDNLRSPSGVSYVMENRRTMARVFPNLFATHRVRAVDDYASHLLRALRNSAATNE
ADPTVVVLTPGVYNSAYFEHSLLARQMGVELVEGRDLFCRDNQVYMRTTEGERQVDVIYRRIDDAFLDPLQFRADSVLGV
AGLVNAARAGNVVLSSAIGNGVGDDKLVYTYVPTMIEYYLHEKPLLANVETLRCWLDDEREEVLDRIRELVLKPVEGSGG
YGIVFGPEASQAELAAVSQKIRDDPRSWIAQPMMELSTVPTRIEGTLAPRYVDLRPFAVNDGNEVWVLPGGLTRVALVEG
SRVVNSSQGGGSKDTWVLAPRASAAARELGAAQIVRSLPQPLCDPTVDASGYEPHDQQPQQQQQQQQQAFH
>P9WP05 ~~~~~~DegV domain-containing protein Rv2417c~~~COG1307
MTVVVVTDTSCRLPADLREQWSIRQVPLHILLDGLDLRDGVDEIPDDIHKRHATTAGATPVELSAAYQRALADSGGDGVV
AVHISSALSGTFRAAELTAAELGPAVRVIDSRSAAMGVGFAALAAGRAAAAGDELDTVARAAAAAVSRIHAFVAVARLDN
LRRSGRISGAKAWLGTALALKPLLSVDDGKLVLVQRVRTVSNATAVMIDRVCQLVGDRPAALAVHHVADPAAANDVAAAL
AERLPACEPAMVTAMGPVLALHVGAGAVGVCVDVGASPPA
>P60876 ~~~~~~Uncharacterized protein SA2420.1~~~
MKKLAVILTLVGGLYYAFKKYQERVNQAPNIEY
>A7HD44 ~~~~~~Response regulator receiver protein Anae109_2439~~~COG0745
MRRYLIVDDNRDFAENLAEILRDGGDEVAIAENGQEALALARKTRFDALLTDMRMPLMGGAELVHELRRIDPGAAAMVIT
AHVADDALEAARREGLLAVLPKPVAVPRILDLLAAARRDGLVAVVEDDSRMSDNLCEALRGRGFAAVTAASVTETERLGP
VEPFCALVDLRVPGGADGDALRRLRERFPGLPVIVVTGTHEVPPVPHQGYFTKPFDTAELLSAVERLHRERGQTVVPE
>O53176 1.3.1.-~~~~~~Putative trans-acting enoyl reductase Rv2449c~~~COG3268
MTATPREFDIVLYGATGFVGKLTAEYLARAGGDARIALAGRSTQRVLAVREALGESAQTWPILTADASLPSTLQAMAARA
QVVVTTVGPYTRYGLPLVAACAAAGTDYADLTGEPMFMRNSIDLYHKQAADTGARIVHACGFDSVPSDLSVYALYHAARE
DGAGELTDTNCVVRSFKGGFSGGTIASMLEVLSTASNDPDARRQLSDPYMLSPDRGAEPELGPQPDLPSRRGRRLAPELA
GVWTAGFIMAPTNTRIVRRSNALLDWAYGRRFRYSETMSVGSTVLAPVVSVVGGGVGNAMFGLASRYIRLLPRGLVKRVV
PKPGTGPSAAARERGYYRIETYTTTTTGARYLARMAQDGDPGYKATSVLLGECGLALALDRDKLSDMRGVLTPAAAMGDA
LLERLPAAGVSLQTTRLAS
>P9WJX1 ~~~~~~Uncharacterized MFS-type transporter Rv2456c~~~COG2814
MSGTVVAVPPRVARALDLLNFSLADVRDGLGPYLSIYLLLIHDWDQASIGFVMAVGGIAAIVAQTPIGALVDRTTAKRAL
VVAGAVLVTAAAVAMPLFAGLYSISVLQAVTGIASSVFAPALAAITLGAVGPQFFARRIGRNEAFNHAGNASAAGATGAL
AYFFGPVVVFWVLAGMALISVLATLRIPPDAVDHDLARGMDHAPGEPHPQPSRFTVLAHNRELVIFGAAVVAFHFANAAM
LPLVGELLALHNRDEGTALMSSCIVAAQVVMVPVAYVVGTRADAWGRKPIFLVGFAVLTARGFLYTLSDNSYWLVGVQLL
DGIGAGIFGALFPLVVQDVTHGTGHFNISLGAVTTATGIGAALSNLVAGWIVVVAGYDAAFMSLGALAGAGFLLYLVAMP
ETVDSDVRVRSRPTLGGK
>Q831P3 ~~~~~~UPF0358 protein EF_2458~~~COG4838
MDEGISKKFAIQLLEDDAERIKMLIRNQKNSLCISQCKAFEEVVDTQMYGFSRQVTYATRLGILTNDEGHRLLSDLEREL
NQLYTDVYEETQEKNEIGKEG
>P9WLA7 ~~~~~~Uncharacterized protein Rv2468c~~~COG1512
MTHRSSRLEVGPVARGDVATIEHAELPPGWVLTTSGRISGVTEPGELSVHYPFPIADLVALDDALTYSSRACQVRFAIYL
GDLGRDTAARAREILGKVPTPDNAVLLAVSPNQCAIEVVYGSQVRGRGAESAAPLGVAAASSAFEQGELVDGLISAIRVL
SAGIAPG
>P43972 ~~~~~~Uncharacterized protein HI_0246~~~COG0641
MNLTKLLPAFAAAVVLSACAKDAPEMTKSSAQIAEMQTLPTITDKTVVYSCNKQTVTAVYQFENQEPVAAMVSVGDGIIA
KDFTRDKSQNDFTSFVSGDYVWNVDSGLTLDKFDSVVPVNLIQKGKSSDNIIVKNCDVNVKATKKANL
>Q7A339 ~~~~~~UPF0312 protein SA2479~~~
MTNFTFDGAHSSLEFQIKHLMVSKVKGSFDQFDVAVEGDINDFSTLKATATIIPSSINTKNEARDNHLKSGDFFGTDEFD
KITFETKSVTENKVVGDLTIKGITNEETFDVEFNGVSKNPMDGSQVTGVIVTGTINRENYGINFNQALETGGVMLGKDVK
FEASAEFSISE
>P9WKB3 2.3.1.20~~~~~~Putative diacyglycerol O-acyltransferase Rv2484c~~~COG1020
MAESGESPRLSDELGPVDYLMHRGEANPRTRSGIMALELLDGTPDWDRFRTRFENASRRVLRLRQKVVVPTLPTAAPRWV
VDPDFNLDFHVRRVRVSGPATLREVLDLAEVILQSPLDISRPLWTATLVEGMADGRAAMLLHVSHAVTDGVGGVEMFAQI
YDLERDPPPRSTPPQPIPEDLSPNDLMRRGINHLPIAVVGGVLDALSGAVSMAGRAVLEPVSTVSGILGYARSGIRVLNR
AAEPSPLLRRRSLTTRTEAIDIRLADLHKAAKAGGGSINDAYLAGLCGALRRYHEALGVPISTLPMAVPVNLRAEGDAAG
GNQFTGVNLAAPVGTIDPVARMKKIRAQMTQRRDEPAMNIIGSIAPVLSVLPTAVLEGITGSVIGSDVQASNVPVYPGDT
YLAGAKILRQYGIGPLPGVAMMVVLISRGGWCTVTVRYDRASVRNDELFAQCLQAGFDEILALAGGPAPRVLPASFDTQG
AGSVPRSVSGS
>Q44849 ~~~~~~Putative outer membrane protein BBA03~~~
MKKTIIVFIILAFMLNCKNKSNDAEPNNDLDEKSQAKSNLVDEDRIEFSKATPLEKLVSRLNLNNTEKETLTFLTNLLKE
KLVDPNIGLHFKNSGGDESKIEESVQKFLSELKEDEIKDLLAKIKENKDKKEKDPEELNTYKSILASGFDGIFNQADSKT
TLNKLKDTI
>O53672 ~~~~~~Uncharacterized protein Rv0250c~~~
MSTTAELAELHDLVGGLRRCVTALKARFGDNPATRRIVIDADRILTDIELLDTDVSELDLERAAVPQPSEKIAIPDTEYD
REFWRDVDDEGVGGHRY
>Q9EXC9 ~~~~~~Protein MG115 homolog~~~
MYARLIAEKLLNHKLTIATAESVTGGLLSSSLTDIAGASRFFKGAIVAYSNELKKSLLNVKQSTLINHGAVSRYCVREMA
LGLMQKLNVDIAVACSGVAGPDALENQAVGSLFFCVIVANKAYDFETKLPAGSRNELRQLFVQKILQTVEHILSEIS
>P9WFP9 ~~~~~~UPF0047 protein Rv2556c~~~COG0432
MDTDVLDVDTARRRIVDLTDAVRAFCTAHDDGLCNVFVPHATAGVAIIETGAGSDEDLVDTLVRLLPRDDRYRHAHGSYG
HGADHLLPAFVAPSVTVPVSGGQPLLGTWQSIVLVDLNQDNPRRSVRLSFVEG
>P9WLA5 ~~~~~~Uncharacterized protein Rv2557~~~COG1359
MTGGATGALPRTMKEGWIVYARSTTIQAQSECIDTGIAHVRDVVMPALQGMDGCIGVSLLVDRQSGRCIATSAWETAEAM
HASREQVTPIRDRCAEMFGGTPAVEEWEIAAMHRDHRSAEGACVRATWVKVPADQVDQGIEYYKSSVLPQIEGLDGFCSA
SLLVDRTSGRAVSSATFDSFDAMERNRDQSNALKATSLREAGGEELDECEFELALAHLRVPELV
>P9WLA3 ~~~~~~Uncharacterized protein Rv2558~~~COG1359
MPGSAGWRKVFGGTGGATGALPRHGRGSIVYARSTTIEAQPLSVDIGIAHVRDVVMPALQEIDGCVGVSLLVDRQSGRCI
ATSAWETLEAMRASVERVAPIRDRAALMFAGSARVEEWDIALLHRDHPSHEGACVRATWLKVVPDQLGRSLEFYRTSVLP
ELESLDGFCSASLMVDHPACRRAVSCSTFDSMDAMARNRDRASELRSRRVRELGAEVLDVAEFELAIAHLRVPELV
>P9WQN1 ~~~~~~Uncharacterized AAA domain-containing protein Rv2559c~~~COG2256
MPEAVSDGLFDVPGVPMTSGHDLGASAGAPLAVRMRPASLDEVVGQDHLLAPGSPLRRLVEGSGVASVILYGPPGSGKTT
LAALISQATGRRFEALSALSAGVKEVRAVIENSRKALLHGEQTVLFIDEVHRFSKTQQDALLSAVEHRVVLLVAATTENP
SFSVVAPLLSRSLILQLRPLTAEDTRAVVQRAIDDPRGLGRAVAVAPEAVDLLVQLAAGDARRALTALEVAAEAAQAAGE
LVSVQTIERSVDKAAVRYDRDGDQHYDVVSAFIKSVRGSDVDAALHYLARMLVAGEDPRFIARRLMILASEDIGMAGPSA
LQVAVAAAQTVALIGMPEAQLTLAHATIHLATAPKSNAVTTALAAAMNDIKAGKAGLVPAHLRDGHYSGAAALGNAQGYK
YSHDDPDGVVAQQYPPDELVDVDYYRPTGRGGEREIAGRLDRLRAIIRKKRG
>P9WLA1 ~~~~~~Uncharacterized protein Rv2560~~~COG5473
MSQPPEHPGNPADPQGGNQGAGSYPPPGYGAPPPPPGYGPPPGTYLPPGYNAPPPPPGYGPPPGPPPPGYPTHLQSSGFS
VGDAISWSWNRFTQNAVTLVVPVLAYAVALAAVIGATAGLVVALSDRATTAYTNTSGVSSESVDITMTPAAGIVMFLGYI
ALFALVLYMHAGILTGCLDIADGKPVTIATFFRPRNLGLVLVTGLLIVAVTFIGGLLCVIPGLIFGFVAQFAVAFAVDRS
TSPIDSVKASIETVGSNIGGSVLSWLAQLTAVLVGELLCFVGMLIGIPVAALIHVYTYRKLSGGQVVEAVRPAPPVGWPP
GPQLA
>P9WG15 ~~~~~~Uncharacterized ABC transporter permease Rv2563~~~COG0577
MLFAALRDVQWRKRRLVIAIVSTGLVFAMTLVLTGLVNGFRVEAERTVDSMGVDAFVVKAGAAGPFLGSTPFAQIDLPQV
ARAPGVLAAAPLATAPSTIRQGTSARNVTAFGAPEHGPGMPRVSDGRAPSTPDEVAVSSTLGRNLGDDLQVGARTLRIVG
IVPESTALAKIPNIFLTTEGLQQLAYNGQPTISSIGIDGMPRQLPDGYQTVNRADAVSDLMRPLKVAVDAITVVAVLLWI
VAALIVGSVVYLSALERLRDFAVFKAIGVPTRSILAGLALQAVVVALLAAVVGGILSLLLAPLFPMTVVVPLSAFVALPA
IATVIGLLASVAGLRRVVAIDPALAFGGP
>P9WQI5 ~~~~~~Uncharacterized ABC transporter ATP-binding protein Rv2564~~~COG0664
MGGLTISDLVVEYSSGGYAVRPIDGLSLDVAPGSLVILLGPSGCGKTTLLSCLGGILRPKSGSIKFDDVDITTLEGAALA
KYRRDKVGIVFQAFNLVSSLTALENVMVPLRAAGVSRAAARKRAEDLLIRVNLGERMKHRPGDMSGGQQQRVAVARAIAL
DPQLILADEPTAHLDFIQVEEVLRLIRSLAQGDRVVVVATHDSRMLPLADRVLELMPAQVSPNQPPETVHVKAGEVLFEQ
STMGDLIYVVSEGEFEIVRELADGGEELVKTAAPGDYFGEIGVLFHLPRSATVRARSDATAVGYTAQAFRERLGVTRVAD
LIEHRELASE
>P9WIY7 ~~~~~~Uncharacterized NTE family protein Rv2565~~~COG0664
MTTARRRPKRRGTDARTALRNVPILADIDDEQLERLATTVERRHVPANQWLFHAGEPADSIYIVDSGRFVAVAPEGHVFA
EMASGDSIGDLGVIAGAARSAGVRALRDGVVWRIAAETFTDMLEATPLLQSAMLRAMARMLRQSRPAKTARRPRVIGVVS
NGDTAAAPMVDAIATSLDSHGRTAVIAPPVETTSAVQEYDELVEAFSETLDRAERSNDWVLVVADRGAGDLWRHYVSAQS
DRLVVLVDQRYPPDAVDSLATQRPVHLITCLAEPDPSWWDRLAPVSHHPANSDGFGALARRIAGRSLGLVMAGGGARGLA
HFGVYQELTEAGVVIDRFGGTSSGAIASAAFALGMDAGDAIAAAREFIAGSDPLGDYTIPISALTRGGRVDRLVQGFFGN
TLIEHLPRGFFSVSADMITGDQIIHRRGSVSGAVRASISIPGLIPPVHNGEQLLVDGGLLNNLPANVMCADTDGEVICVD
LRRTFVPSKGFGLLPPIVTPPGLLRRLLTGTDNALPPLQETLLRAFDLAASTANLRELPRVAAIIEPDVSKIGVLNFKQI
DAALEAGRMAARAALQAQPDLVR
>P9WL97 ~~~~~~Uncharacterized protein Rv2567~~~COG2307
MAPSASAATNGYDVDRLLAGYRTARAQETLFDLRDGPGAGYDEFVDDDGNVRPTWTELADAVAERGKAGLDRLRSVVHSL
IDHDGITYTAIDAHRDALTGDHDLEPGPWRLDPLPLVISAADWEVLEAGLVQRSRLLDAILADLYGPRSMLTEGVLPPEM
LFAHPGYVRAANGIQMPGRHQLFMHACDLSRLPDGTFQVNADWTQAPSGSGYAMADRRVVAHAVPDLYEELAPRPTTPFA
QALRLALIDAAPDVAQDPVVVVLSPGIYSETAFDQAYLATLLGFPLVESADLVVRDGKLWMRSLGTLKRVDVVLRRVDAH
YADPLDLRADSRLGVVGLVEAQHRGTVTVVNTLGSGILENPGLLRFLPQLSERLLDESPLLHTAPVYWGGIASERSHLLA
NVSSLLIKSTVSGETLVGPTLSSAQLADLAVRIEAMPWQWVGQELPQFSSAPTNHAGVLSSAGVGMRLFTVAQRSGYAPM
IGGLGYVLAPGPAAYTLKTVAAKDIWVRPTERAHAEVITVPVLAPPAKTGAGTWAVSSPRVLSDLFWMGRYGERAENMAR
LLIVTRERYHVFRHQQDTDESECVPVLMAALGKITGYDTATGAGSAYDRADMIAVAPSTLWSLTVDPDRPGSLVQSVEGL
ALAAQAVRDQLSNDTWMVLANVERAVEHKSDPPQSLAEADAVLASAQAETLAGMLTLSGVAGESMVHDVGWTMMDIGKRI
ERGLWLTALLQATLSTVRHPAAEQAIIEATLVACESSVIYRRRTVGKFSVAAVTELMLFDAQNPRSLVYQLERLRADLKD
LPGSSGSSRPERMVDEMNTRLRRSHPEELEEVSADGLRAELAELLAGIHASLRDVADVLTATQLALPGGMQPLWGPDQRR
VMPA
>P9WL95 ~~~~~~Uncharacterized protein Rv2568c~~~COG4307
MRDFHCPNCGQRLAFENSACLSCGSALGFSLGRMALLVIADDADVQLCANLHLAQCNWLVPSDQLGGLCSSCVLTIERPS
DTNTAGLAEFARAEGAKRRLIAELHELKLPIVGRDQDPDHGLAFRLLSSAHENVTTGHQNGVITLDLAEGDDVHREQLRV
EMDEPYRTLLGHFRHEIGHYYFYRLIASSSDYLSRFNELFGDPDADYSQALDRHYRGGPPEGWQDSFVSSYATMHASEDW
AETFAHYLHIRDALDTAAWCGLAPASATFDRPALGPSAFNTIIDKWLPLSWSLNMVNRSMGHDDLYPFVLPAAVLEKMRF
IHTVVDEVAPDFEPAHSRRTV
>P9WL93 ~~~~~~Uncharacterized protein Rv2569c~~~COG1305
MSADSSLSLPLSGTHRYRVTHRTEYRYSDVVTSSYGRGFLTPRNSLRQRCVAHRLTIDPAPADRSTSRDGYGNISSYFHV
TEPHRTLTITSDSIVDVSPPPPGLYTSGPALQPWEAARPAGLPGSLATEFTLDLNPPEITDAVREYAAPSFLPKRPLVEV
LRDLASRIYTDFTYRSGSTTISTGVNEVLLAREGVCQDFARLAIACLRANGLAACYVSGYLATDPPPGKDRMIGIDATHA
WASVWTPQQPGRFEWLGLDPTNDQLVDQRYIVVGRGRDYADVPPLRGIIYTNSENSVIDVSVDVVPFEGDALHA
>P9WL87 ~~~~~~Uncharacterized protein Rv2574~~~COG3427
MYPCERVGLSFTETAPYLFRNTVDLAITPEQLFEVLADPQAWPRWATVITKVTWTSPEPFGAGTTRIVEMRGGIVGDEEF
ISWEPFTRMAFRFNECSTRAVGAFAEDYRVQAIPGGCRLTWTMAQKLAGPARPALFVFRPLLNLALRRFLRNLRRYTDAR
FAAAQQS
>P9WL85 ~~~~~~Uncharacterized protein Rv2575~~~COG2321
MTFNEGVQIDTSTTSTSGSGGGRRLAIGGGLGGLLVVVVAMLLGVDPGGVLSQQPLDTRDHVAPGFDLSQCRTGADANRF
VQCRVVATGNSVDAVWKPLLPGYTRPHMRLFSGQVGTGCGPASSEVGPFYCPVDKTAYFDTDFFQVLVTQFGSSGGPFAE
EYVVAHEYGHHVQNLLGVLGRAQQGAQGAAGSGVRTELQADCYAGVWAYYASTVKQESTGVPYLEPLSDKDIQDALAAAA
AVGDDRIQQQTTGRTNPETWTHGSAAQRQKWFTVGYQTGDPNICDTFSAADLG
>P9WL83 ~~~~~~Uncharacterized protein Rv2576c~~~
MPAGVGNASGSVLDMTSVRTVPSAVALVTFAGAALSGVIPAIARADPVGHQVTYTVTTTSDLMANIRYMSADPPSMAAFN
ADSSKYMITLHTPIAGGQPLVYTATLANPSQWAIVTASGGLRVNPEFHCEIVVDGQVVVSQDGGSGVQCSTRPW
>P9WMW3 3.-.-.-~~~~~~Uncharacterized protein Rv2581c~~~COG0491
MLITGFPAGLLACNCYVLAERPGTDAVIVDPGQGAMGTLRRILDKNRLTPAAVLLTHGHIDHIWSAQKVSDTFGCPTYVH
PADRFMLTDPIYGLGPRIAQLVAGAFFREPKQVVELDRDGDKIDLGGISVNIDHTPGHTRGSVVFRVLQATNNDKDIVFT
GDTLFERAIGRTDLAGGSGRDLLRSIVDKLLVLDDSTVVLPGHGNSTTIGAERRFNPFLEGLSR
>P9WL77 ~~~~~~Uncharacterized lipoprotein Rv2585c~~~COG0747
MAPRRRRHTRIAGLRVVGTATLVAATTLTACSGSAAAQIDYVVDGALVTYNTNTVIGAASAGAQAFARTLTGFGYHGPDG
QVVADRDFGTVSVVEGSPLILDYQISDDAVYSDGRPVTCDDLVLAWAAQSGRFPGFDAATQAGYVDIANIECTAGQKKAR
VSFIPDRSVVDHSQLFTATSLMPSHVIADQLHIDVTAALLSNNVSAVEQIARLWNSTWDLKPGRSHDEVRSRFPSSGPYK
IESVLDDGAVVLVANDRWWGTKAITKRITVWPQGADIQDRVNNRSVDVVDVAAGSSGSLVTPDSYQRTDYPSAGIEQLIF
APQGSLAQSRTRRALALCVPRDAIARDAGVPIANSRLSPATDDALTDADGAAEARQFGRVDPAAARDALGGTPLTVRIGY
GRPNARLAATIGTIADACAPAGITVSDVTVDTPGPQALRDGKIDVLLASTGGATGSGSSGSCAMDAYDLHSGNGNNLSGY
ANAQIDGIISALAVSADPAERARLLAEAAPVLWDEMPTLPLYRQQRTLLMSTKMYAVSRNPTRWGAGWNMDRWALAR
>P9WL73 ~~~~~~Uncharacterized protein Rv2597~~~
MGNLLVVIAVALFIAAIVVLVVAIRRPKTPATPGGRRDPLAFDAMPQFGPRQLGPGAIVSHGGIDYVVRGSVTFREGPFV
WWEHLLEGGDTPTWLSVQEDDGRLELAMWVKRTDLGLQPGGQHVIDGVTFQETERGHAGYTTEGTTGLPAGGEMDYVDCA
SAGQGADESMLLSFERWAPDMGWEIATGKSVLAGELTVYPAPPVSA
>P9WL69 ~~~~~~Uncharacterized protein Rv2599~~~
MSRNRLFLVAGSLAVAAAVSLISGITLLNRDVGSYIASHYRQESRDVNGTRYLCTGSPKQVATTLVKYQTPAARASHTDT
EYLRYRNNIVTVGPDGTYPCIIRVENLSAGYNHGAYVFLGPGFTPGSPSGGSGGSPGGPGGSK
>P9WGA5 ~~~~~~Probable transcriptional regulatory protein Rv2603c~~~COG0217
MSGHSKWATTKHKKAVVDARRGKMFARLIKNIEVAARVGGGDPAGNPTLYDAIQKAKKSSVPNENIERARKRGAGEEAGG
ADWQTIMYEGYAPNGVAVLIECLTDNRNRAASEVRVAMTRNGGTMADPGSVSYLFSRKGVVTLEKNGLTEDDVLAAVLEA
GAEDVNDLGDSFEVISEPAELVAVRSALQDAGIDYESAEASFQPSVSVPVDLDGARKVFKLVDALEDSDDVQNVWTNVDV
SDEVLAALDDE
>Q481E4 ~~~~~~UPF0352 protein CPS_2611~~~COG3082
MPIVSKYSNERVEKIIQDLLDVLVKEEVTPDLALMCLGNAVTNIIAQVPESKRVAVVDNFTKALKQSV
>P9WFD6 ~~~~~~Universal stress protein MT2698~~~
MSSGNSSLGIIVGIDDSPAAQVAVRWAARDAELRKIPLTLVHAVSPEVATWLEVPLPPGVLRWQQDHGRHLIDDALKVVE
QASLRAGPPTVHSEIVPAAAVPTLVDMSKDAVLMVVGCLGSGRWPGRLLGSVSSGLLRHAHCPVVIIHDEDSVMPHPQQA
PVLVGVDGSSASELATAIAFDEASRRNVDLVALHAWSDVDVSEWPGIDWPATQSMAEQVLAERLAGWQERYPNVAITRVV
VRDQPARQLVQRSEEAQLVVVGSRGRGGYAGMLVGSVGETVAQLARTPVIVARESLT
>P9WFD7 ~~~~~~Universal stress protein Rv2623~~~COG0589
MSSGNSSLGIIVGIDDSPAAQVAVRWAARDAELRKIPLTLVHAVSPEVATWLEVPLPPGVLRWQQDHGRHLIDDALKVVE
QASLRAGPPTVHSEIVPAAAVPTLVDMSKDAVLMVVGCLGSGRWPGRLLGSVSSGLLRHAHCPVVIIHDEDSVMPHPQQA
PVLVGVDGSSASELATAIAFDEASRRNVDLVALHAWSDVDVSEWPGIDWPATQSMAEQVLAERLAGWQERYPNVAITRVV
VRDQPARQLVQRSEEAQLVVVGSRGRGGYAGMLVGSVGETVAQLARTPVIVARESLT
>P9WFD4 ~~~~~~Universal stress protein MT2699~~~
MSGRGEPTMKTIIVGIDGSHAAITAALWGVDEAISRAVPLRLVSVIKPTHPSPDDYDRDLAHAERSLREAQSAVEAAGKL
VKIETDIPRGPAGPVLVEASRDAEMICVGSVGIGRYASSILGSTATELAEKAHCPVAVMRSKVDQPASDINWIVVRMTDA
PDNEAVLEYAAREAKLRQAPILALGGRPEELREIPDGEFERRVQDWHHRHPDVRVYPITTHTGIARFLADHDERVQLAVI
GGGEAGQLARLVGPSGHPVFRHAECSVLVVRR
>P9WFD5 ~~~~~~Universal stress protein Rv2624c~~~COG0589
MSGRGEPTMKTIIVGIDGSHAAITAALWGVDEAISRAVPLRLVSVIKPTHPSPDDYDRDLAHAERSLREAQSAVEAAGKL
VKIETDIPRGPAGPVLVEASRDAEMICVGSVGIGRYASSILGSTATELAEKAHCPVAVMRSKVDQPASDINWIVVRMTDA
PDNEAVLEYAAREAKLRQAPILALGGRPEELREIPDGEFERRVQDWHHRHPDVRVYPITTHTGIARFLADHDERVQLAVI
GGGEAGQLARLVGPSGHPVFRHAECSVLVVRR
>P9WL66 ~~~~~~Uncharacterized protein MT2702~~~
MASSASDGTHERSAFRLSPPVLSGAMGPFMHTGLYVAQSWRDYLGQQPDKLPIARPTIALAAQAFRDEIVLLGLKARRPV
SNHRVFERISQEVAAGLEFYGNRGWLEKPSGFFAQPPPLTEVAVRKVKDRRRSFYRIFFDSGFTPHPGEPGSQRWLSYTA
NNREYALLLRHPEPRPWLVCVHGTEMGRAPLDLAVFRAWKLHDELGLNIVMPVLPMHGPRGQGLPKGAVFPGEDVLDDVH
GTAQAVWDIRRLLSWIRSQEEESLIGLNGLSLGGYIASLVASLEEGLACAILGVPVADLIELLGRHCGLRHKDPRRHTVK
MAEPIGRMISPLSLTPLVPMPGRFIYAGIADRLVHPREQVTRLWEHWGKPEIVWYPGGHTGFFQSRPVRRFVQAALEQSG
LLDAPRTQRDRSA
>P9WL67 ~~~~~~Uncharacterized protein Rv2627c~~~COG1073
MASSASDGTHERSAFRLSPPVLSGAMGPFMHTGLYVAQSWRDYLGQQPDKLPIARPTIALAAQAFRDEIVLLGLKARRPV
SNHRVFERISQEVAAGLEFYGNRRWLEKPSGFFAQPPPLTEVAVRKVKDRRRSFYRIFFDSGFTPHPGEPGSQRWLSYTA
NNREYALLLRHPEPRPWLVCVHGTEMGRAPLDLAVFRAWKLHDELGLNIVMPVLPMHGPRGQGLPKGAVFPGEDVLDDVH
GTAQAVWDIRRLLSWIRSQEEESLIGLNGLSLGGYIASLVASLEEGLACAILGVPVADLIELLGRHCGLRHKDPRRHTVK
MAEPIGRMISPLSLTPLVPMPGRFIYAGIADRLVHPREQVTRLWEHWGKPEIVWYPGGHTGFFQSRPVRRFVQAALEQSG
LLDAPRTQRDRSA
>P9WL64 ~~~~~~Putative uncharacterized protein MT2703~~~
MSTQRPRHSGIRAVGPYAWAGRCGRIGRWGVHQEAMMNLAIWHPRKVQSATIYQVTDRSHDGRTARVPGDEITSTVSGWL
SELGTQSPLADELARAVRIGDWPAAYAIGEHLSVEIAVAV
>P9WL65 ~~~~~~Putative uncharacterized protein Rv2628~~~
MSTQRPRHSGIRAVGPYAWAGRCGRIGRWGVHQEAMMNLAIWHPRKVQSATIYQVTDRSHDGRTARVPGDEITSTVSGWL
SELGTQSPLADELARAVRIGDWPAAYAIGEHLSVEIAVAV
>P9WL62 ~~~~~~Uncharacterized protein MT2704~~~
MRSERLRWLVAAEGPFASVYFDDSHDTLDAVERREATWRDVRKHLESRDAKQELIDSLEEAVRDSRPAVGQRGRALIATG
EQVLVNEHLIGPPPATVIRLSDYPYVVPLIDLEMRRPTYVFAAVDHTGADVKLYQGATISSTKIDGVGYPVHKPVTAGWN
GYGDFQHTTEEAIRMNCRAVADHLTRLVDAADPEVVFVSGEVRSRTDLLSTLPQRVAVRVSQLHAGPRKSALDEEEIWDL
TSAEFTRRRYAEITNVAQQFEAEIGRGSGLAAQGLAEVCAALRDGDVDTLIVGELGEATVVTGKARTTVARDADMLSELG
EPVDRVARADEALPFAAIAVGAALVRDDNRIAPLDGVGALLRYAATNRLGSHRS
>P9WL63 ~~~~~~Uncharacterized protein Rv2629~~~COG1503
MRSERLRWLVAAEGPFASVYFDDSHDTLDAVERREATWRDVRKHLESRDAKQELIDSLEEAVRDSRPAVGQRGRALIATG
EQVLVNEHLIGPPPATVIRLSDYPYVVPLIDLEMRRPTYVFAAVDHTGADVKLYQGATISSTKIDGVGYPVHKPVTAGWN
GYGDFQHTTEEAIRMNCRAVADHLTRLVDAADPEVVFVSGEVRSRTDLLSTLPQRVAVRVSQLHAGPRKSALDEEEIWDL
TSAEFTRRRYAEITNVAQQFEAEIGRGSGLAAQGLAEVCAALRDGDVDTLIVGELGEATVVTGKARTTVARDADMLSELG
EPVDRVARADEALPFAAIAVGAALVRDDNRIAPLDGVGALLRYAATNRLGSHRS
>P9WL61 ~~~~~~Uncharacterized protein Rv2632c~~~
MTDSEHVGKTCQIDVLIEEHDERTRAKARLSWAGRQMVGVGLARLDPADEPVAQIGDELAIARALSDLANQLFALTSSDI
EASTHQPVTGLHH
>P9WL59 ~~~~~~Uncharacterized protein Rv2633c~~~COG5592
MNAYDVLKRHHTVLKGLGRKVGEAPVNSEERHVLFDEMLIELDIHFRIEDDLYYPALSAAGKPITGTHAEHRQVVDQLAT
LLRTPQRAPGYEEEWNVFRTVLEAHADVEERDMIPAPTPVHITDAELEELGDKMAARIEQLRGSPLYTLRTKGKADLLKA
I
>P9WL55 2.7.1.-~~~~~~Putative O-phosphotransferase Rv2636~~~COG3896
MINPTRARRMRYRLAAMAGMPEGKLILLNGGSSAGKTSLALAFQDLAAECWMHIGIDLFWFALPPEQLDLARVRPEYYTW
DSAVEADGLEWFTVHPGPILDLAMHSRYRAIRAYLDNGMNVIADDVIWTREWLVDALRVFEGCRVWMVGVHVSDEEGARR
ELERGDRHPGWNRGSARAAHADAEYDFELDTTATPVHELARELHESYQACPYPMAFNRLRKRFLS
>P9WP07 ~~~~~~Uncharacterized membrane protein Rv2637~~~COG0586
MDVEALLQSIPPLMVYLVVGAVVGIESLGIPLPGEIVLVSAAVLSSHPELAVNPIGVGGAAVIGAVVGDSIGYSIGRRFG
LPLFDRLGRRFPKHFGPGHVALAERLFNRWGVRAVFLGRFIALLRIFAGPLAGALKMPYPRFLAANVTGGICWAGGTTAL
VYFAGMAAQHWLERFSWIALVIAVIAGITAAILLRERTSRAIAELEAEHCRKAGTTAA
>Q9K9K7 ~~~~~~UPF0223 protein BH2638~~~COG4476
MKTTLPISLDWSTEEVIDVVHFFQAIEQAYDQGIAREDLLGKYRRFKEIVPSKSEEKQLFRAYEQENDVSCYQTIKKARE
EMEEHIQM
>I6X4W0 ~~~~~~Putative anti-anti-sigma factor Rv2638~~~COG1366
MGLITTEPRSSPHPLSPRLVHELGDPHSTLRATTDGSGAALLIHAGGEIDGRNEHLWRQLVTEAAAGVTAPGPLIVDVTG
LDFMGCCAFAALADEAQRCRCRGIDLRLVSHQPIVARIAEAGGLSRVLPIYPTVDTALGKGTAGPARC
>Q9CCZ4 2.1.1.-~~~~~~Putative S-adenosyl-L-methionine-dependent methyltransferase ML2640~~~COG3315
MRTHDDTWDIKTSVGTTAVMVAAARAAETDRPDALIRDPYAKLLVTNTGAGALWEAMLDPSMVAKVEAIDAEAAAMVEHM
RSYQAVRTNFFDTYFNNAVIDGIRQFVILASGLDSRAYRLDWPTGTTVYEIDQPKVLAYKSTTLAEHGVTPTADRREVPI
DLRQDWPPALRSAGFDPSARTAWLAEGLLMYLPATAQDGLFTEIGGLSAVGSRIAVETSPLHGDEWREQMQLRFRRVSDA
LGFEQAVDVQELIYHDENRAVVADWLNRHGWRATAQSAPDEMRRVGRWGDGVPMADDKDAFAEFVTAHRL
>P9WL51 ~~~~~~Uncharacterized protein Rv2645~~~
MTTTPRQPLFCAHADTNGDPGRCACGQQLADVGPATPPPPWCEPGTEPIWEQLTERYGGVTICQWTRYFPAGDPVAADVW
IAADDRVVDGRVLRTQPAIHYTEPPVLGIGPAAARRLAAELLNAADTLDDGRRQLDDLGEHRR
>A7LXT4 ~~~~~~IPT/TIG domain-containing protein BACOVA_02650~~~
MKSIYKYLDTRLFLIGLLVLPFLAVVSCQNDDDDAIPVIHYIRVTDPAKADSTFTDVNPGTMIVVVGEHLGGTQKVYIND
QEVSFNRNYVTSTSIILTVPNELELTGQNPELKGEIRIETEHGVAAYNMHVLSPAPYITRISATYPIKPGDQMTVIGGNF
YEVQAVYLSTEQPAKDGTRPVDVQEITNYEVNNKYSQITLTAPANLLEEGYLVVECYTSSAVTEFKKNGPKPVVTAVSST
MPVVGSTVTITGQNFIEVSRVNINGEFDIPVGDITTSNTFDEISFVLPQAPTQSGHISVTAIGGTVESAEIFYPLENVIL
NYDGIGSHVWGDCSFVVADGSSAPYVSNGTCLGITGTVSASNYWWKQSYSNAQWVNTSIIPGNIPIDDLKLQFECFVKEV
FTGPVFQIAMCENFDAALNGYVPVSSFTGKTETGKWMQCSVSLSSVVADATYQDFLNRNSTHIGVYATNPGSSQATIEVY
FDNFRIVRK
>P9WJ13 ~~~~~~Toxin Rv2653c~~~
MTHKRTKRQPAIAAGLNAPRRNRVGRQHGWPADVPSAEQRRAQRQRDLEAIRRAYAEMVATSHEIDDDTAELALLSMHLD
DEQRRLEAGMKLGWHPYHFPDEPDSKQ
>P9WJ11 ~~~~~~Antitoxin Rv2654c~~~
MSGHALAARTLLAAADELVGGPPVEASAAALAGDAAGAWRTAAVELARALVRAVAESHGVAAVLFAATAAAAAAVDRGDP
P
>P9WL47 ~~~~~~Uncharacterized protein Rv2658c~~~
MADAVKYVVMCNCDDEPGALIIAWIDDERPAGGHIQMRSNTRFTETQWGRHIEWKLECRACRKYAPISEMTAAAILDGFG
AKLHELRTSTIPDADDPSIAEARHVIPFSALCLRLSQLGG
>Q8NMB4 ~~~~~~Uncharacterized protein Cgl2664/cg2949~~~COG2847
MEDESVKSLNLAARRGALVTVAAASALALASCSAGQITQTSSQVAAVDGNQAGSANDPVLVRDVTVHLTTDGEAGVKFTA
INQDTSHTSHTLESVTVDGEEVELDDAEPIERNCSLVADIQSELDLIEEPEVGCIQHVATSLENPGFAYGGVVPVEFVFD
TGAITIDATVSAPVLESGVENREVGGDTAEASHH
>P9WPC7 ~~~~~~Uncharacterized protein Rv2667~~~COG0542
MPEPTPTAYPVRLDELINAIKRVHSDVLDQLSDAVLAAEHLGEIADHLIGHFVDQARRSGASWSDIGKSMGVTKQAAQKR
FVPRAEATTLDSNQGFRRFTPRARNAVVAAQNAAHGAASSEITPDHLLLGVLTDPAALATALLQQQEIDIATLRTAVTLP
PAVTEPPQPIPFSGPARKVLELTFREALRLGHNYIGTEHLLLALLELEDGDGPLHRSGVDKSRAEADLITTLASLTGANA
AGATDAGATDAG
>P9WQG5 2.3.1.-~~~~~~Uncharacterized N-acetyltransferase Rv2669~~~COG0456
MTDADELAAVAARTFPLACPPAVAPEHIASFVDANLSSARFAEYLTDPRRAILTARHDGRIVGYAMLIRGDDRDVELSKL
YLLPGYHGTGAAAALMHKVLATAADWGALRVWLGVNQKNQRAQRFYAKTGFKINGTRTFRLGAHHENDYVMVRELV
>P9WPD7 ~~~~~~Uncharacterized transporter Rv2685~~~COG1055
MSIIAITVFVAGYALIASDRVSKTRVALTCAAIMVGAGIVGSDDVFYSHEAGIDWDVIFLLLGMMIIVSVLRHTGVFEYV
AIWAVKRANAAPLRIMILLVLVTALGSALLDNVTTVLLIAPVTLLVCDRLGVNSTPFLVAEVFASNVGGAATLVGDPPNI
IIASRAGLTFNDFLIHMAPAVLVVMIALIGLLPWLLGSVTAEPDRVADVLSLNEREAIHDRGLLIKCGVVLVLVFAAFIA
HPVLHIQPSLVALLGAGVLVRFSGLERSDYLSSVEWDTLLFFAGLFVMVGALVKTGVVEQLARAATELTGGNELLTVGLI
LGISAPVSGIIDNIPYVATMTPIVTELVAAMPGHVHPDTFWWALALSADFGGNLTAVAASANVVMLGIARRSGTPISFWK
FTRKGAVVTAVSLVLSAVYLWLRYFVFG
>Q882E2 ~~~~~~UPF0502 protein PSPTO_2686~~~COG3132
MSIESSATPTTPNAEALQLNSTEVRILGCLIEKQATNPETYPLTLNALVIACNQKTSRDPVMNLTQGQVGQSLRALEGRG
LTRLVMGSRADRWEHKVDKGLELVPAQVILTGLLLLRGPQTVSELLTRSNRMHDFEDSEQVVHQLERLIARGLATLVPRQ
SGQREDRYMHLIGDPEDLQDLLAARQQAPERGNAASPAATQRLDELEARIAALEERLARLE
>Q830S9 ~~~~~~UPF0213 protein EF_2693~~~COG2827
MENKKSHYFYVLLCQDGSFYGGYTTEPERRLTEHNSGTGAKYTRLAKRRPVIMIHTEKFETRSEATKAEAAFKKLTRKQK
EQYLKTFH
>Q9RXP1 ~~~~~~Uncharacterized protein DR_0269~~~COG0513
MTNADEQNMGQQEGTDTATTAQDTNTQTVGTQSENTQNTQQASDAQTEQTPAELRDVLPQLTGEVDEDDVLDAQEEQAAG
QSARKVDEDGEEYEEEFIDADDLMGLLGEMKEMLEAQSKEIRGLRREMRELRESQGGGFRGGDRGGDRGGFRPREDRGGF
GGDRDRGGFRPREDRGERSFGGDRGGDRGGFRPREDRGGFGGDRDRGGFRPREDRGERSFGGDRGGDRGGQGGFRPREDR
GGFGDRDRGGFRPREDRGERNFGGDRGGDRGGQGGFRPREDRNFGDREFRPRTDDAQGNQEGGFRPRARADRGWANRRTD
EE
>Q8Y3W5 ~~~~~~Cell wall protein Lmo2714~~~
MKKFHGLILTALVTCLIIPFSLAASAETNENGNNSTPTEEKTTDTTENIITEEKTEPAEPTKDTTVTPPQKSKVQTTIDI
TDSQEVYSYEQNSKIDLKDFTATLTDYTDTKLTNYRLAANSKLSTAQTGIQEVTLLGDNEDGKTTAVKLNYLVKEKTAQL
TITDMNFDLETKIFTGKTKPFASVYMSSPADENFGEGTTTADKDGYFYAEWATTPKQITAYAFDESGNYSEEVTYQVPAK
KQNEAVVTAPDKVKKIETKNAVKSATTKPTDVTKVTKDDKTDLPSTGDKGTEWIFVVAGVIVILVAVLLLRKRKK
>P9WNH3 ~~~~~~Uncharacterized protein Rv2715~~~COG2267
MTERKRNLRPVRDVAPPTLQFRTVHGYRRAFRIAGSGPAILLIHGIGDNSTTWNGVHAKLAQRFTVIAPDLLGHGQSDKP
RADYSVAAYANGMRDLLSVLDIERVTIVGHSLGGGVAMQFAYQFPQLVDRLILVSAGGVTKDVNIVFRLASLPMGSEAMA
LLRLPLVLPAVQIAGRIVGKAIGTTSLGHDLPNVLRILDDLPEPTASAAFGRTLRAVVDWRGQMVTMLDRCYLTEAIPVQ
IIWGTKDVVLPVRHAHMAHAAMPGSQLEIFEGSGHFPFHDDPARFIDIVERFMDTTEPAEYDQAALRALLRRGGGEATVT
GSADTRVAVLNAIGSNERSAT
>P9WL43 ~~~~~~Uncharacterized protein Rv2716~~~COG0384
MAIEVSVLRVFTDSDGNFGNPLGVINASKVEHRDRQQLAAQSGYSETIFVDLPSPGSTTAHATIHTPRTEIPFAGHPTVG
ASWWLRERGTPINTLQVPAGIVQVSYHGDLTAISARSEWAPEFAIHDLDSLDALAAADPADFPDDIAHYLWTWTDRSAGS
LRARMFAANLGVTEDEATGAAAIRITDYLSRDLTITQGKGSLIHTTWSPEGWVRVAGRVVSDGVAQLD
>P9WG93 ~~~~~~Uncharacterized membrane protein Rv2723~~~COG0861
MGASGLVWTLTIVLIAGLMLVDYVLHVRKTHVPTLRQAVIQSATFVGIAILFGIAVVVFGGSELAVEYFACYLTDEALSV
DNLFVFLVIISSFGVPRLAQQKVLLFGIAFALVTRTGFIFVGAALIENFNSAFYLFGLVLLVMAGNLARPTGLESRDAET
LKRSVIIRLADRFLRTSQDYNGDRLFTVSNNKRMMTPLLLVMIAVGGTDILFAFDSIPALFGLTQNVYLVFAATAFSLLG
LRQLYFLIDGLLDRLVYLSYGLAVILGFIGVKLMLEALHDNKIPFINGGKPVPTVEVSTTQSLTVIIIVLLITTAASFWS
ARGRAQNAMARARRYATAYLDLHYETESAERDKIFTALLAAERQINTLPTKYRMQPGQDDDLMTLLCRAHAARDAHM
>A0QVX6 ~~~~~~Uncharacterized protein MSMEG_2731/MSMEI_2664~~~COG1196
MRGPPRGVRGADMTTSEPGGATPKPTPRPTPHPTPRPVPRPSRVAPVVAAAPSSDPHRFGRVDPDGTVWLITGSGERVIG
AWQAGDTESAFAHFGRRYDDLHTEVALLERRLATGTGDARKIKSAAAALAESLPTASVLGDVDALSARLSSILEQADEAA
QNERAQRDEYRAAQTARKEALAAEAEDIAANSTQWKAAGDRLREILDEWRTITGLERKADDALWKRYSAARETFNRRRGS
HFAELDRSRVSARQAKEELCERAEALADSTDWGATSAAFRDLLAEWKAAGRAAKDVDDALWQRFKAAQDTFFSARNAAHA
ERDAEFKANAAAKEALLAEAEKIDLSDNEAARAALRVIGEKWDAIGKVPRERAAELERRLRAIEKKIREAPTGGVDPEAK
ARADQFRARAEQFERQAEKAEAAGRTKDAEEARASAAQWRQWAEAAAESLEKRR
>Q5XDV5 ~~~~~~Probable ABC transporter ATP-binding protein M6_Spy0273~~~
MSILEINNLHVSIEGKEILKGVNLTLKTGEVAAIMGPNGTGKSTLSAAIMGNPNYEVTQGQILLDGVNILDLEVDERARL
GLFLAMQYPSEIPGITNAEFMRAAMNAGKADEDKISVRDFITKLDEKMALLGMKEEMAERYLNEGFSGGEKKRNEILQLL
MLEPKFALLDEIDSGLDIDALKVVSKGVNEMRGKDFGAMIITHYQRLLNYITPDLVHVMMDGRIVLSGDAALATRLEKEG
YAGIAQDLGIEYKEES
>P0DMQ8 ~~~~~~Uncharacterized protein Rv2742A~~~
MSDNAIRPRPNPWQYIRYCYGARLPDSMRDWVRNDLAGKGAAIRMMIRVAVPAVLVLAPFWLIPTSLDVHLSMTLPILIP
FVYFSHALNKVWRRHMLRVHNLDPELVDEHARQRDAHIHRAYIERYGPRPDPND
>I6YA50 ~~~~~~Putative envelope-preserving system protein Rv2743c~~~
MAVKAGQRRPWRSLLQRGVDTAGDLADLVAQKISVAIDPRARLLRRRRRALRWGLVFTAGCLLWGLVTALLAAWGWFTSL
LVITGTIAVTQAIPATLLLLRYRWLRSEPLPVRRPASVRRLPPPGSAARPAMSALGASERGFFSLLGVMERGAMLPADEI
RDLTAAANQTSAAMVATAAEVVSMERAVQCSAASRSYLVPTINAFTAQLSTGVRQYNEMVTAAAQLVSSANGAGGAGPGQ
QRYREELAGATDRLVAWAQAFDELGGLPRR
>O33285 ~~~~~~Putative envelope-preserving system protein Rv2742c~~~
MLVDELGVKIVHAQHVPAPYLVQRMREIHERDENRQRHAQVDVQRRRDQPERGQHQHRRNRDADHHPDGRTLAGQIVAHP
VSHRVRQPRPVAIADVLPRVGPRADCVVAHSLQGSPRRRERRRGQTAHQRLGRRSGNAIACPLYLENAAGPEPDTKRAEG
RRFGAFGGGDLRWMADRVPRQGSGRRGLGSRSGAGVPQGADARGWRHTADGVPRVGQPAIRRGVPGFWCWLDHVLTGFGG
RNAICAIEDGVEPRVAWWALCTDFDVPRSMGRRTPGG
>P9WGS5 1.-.-.-~~~~~~Uncharacterized oxidoreductase Rv2750~~~COG1028
MIDRPLEGKVAFITGAARGLGRAHAVRLAADGANIIAVDICEQIASVPYPLSTADDLAATVELVEDAGGGIVARQGDVRD
RASLSVALQAGLDEFGRLDIVVANAGIAMMQAGDDGWRDVIDVNLTGVFHTVQVAIPTLIEQGTGGSIVLISSAAGLVGI
GSSDPGSLGYAAAKHGVVGLMRAYANHLAPQNIRVNSVHPCGVDTPMINNEFFQQWLTTADMDAPHNLGNALPVELVQPT
DIANAVAWLASEEARYVTGVTLPVDAGFVNKR
>A5U683 ~~~~~~Uncharacterized protein MRA_2757~~~COG1196
MTADEPRSDDSSGSAPQPAATPVPRPGPRPGPRPVPRPTSYPVGAHPPSDPHRFGRIDDDGTVWLVSASGERIVGSWQAG
DPEAAFAHFGRRFDDLSTEIMLMDERLASGTGDARKIKAHAIALAETLPTACVLGDVDALADRLTSIRDRAEVIAAADRS
RREEHRAAQTARKEALAAEAEELAANATQWKVAGDRLRAILDEWKTISGVDRKVDDALWKRYSTARDTFNRRRGSHFAEL
DRERSGVRQSKERLCERAEELSESTDWTATSAEFRKLLADWKAAGRASKDVDDALWRRFKAAQDSFFTARNAATAEKEAE
LRANADAKEALLAEAERLDTTNHEAARAALRSIAEKWDAIGKVSRERAAELERRLRAVEKKVREAGEADWSDPQARARAE
QFRARAEQFEHQAEKAAAAGRTKEADEAKANAEQWRQWAEAAADALTRRP
>Q9L082 3.1.3.-~~~~~~Phosphatase SCO2771~~~
MPIPGTPSRAELAEHLVRTRIAGDVATPRENNLSHYRKLANGDRGFWLGLELGDRWSDEQDVLAVMAERVGVNDDPEHRY
GQDTIDPELTISALERMAGRLRKAADGGQRVLFATGHPGGLLDVHRATAAALRDAGCEIVVIPEGLTTEEGYVQQFADVS
VLEHGASLWHTHSGEPMKAILTGLEREGRPLPDLVVADHGWAGYAAQHGVDSVGYADCNDPALFLAESEGTLQVAVPLDD
HVVSPRYYDPMTAYLLTEAGLK
>O33317 2.3.1.-~~~~~~Probable N-acetyltransferase Rv2775~~~COG0456
MKPSNIRIRAAKPIDFPKVAAMHYPVWRQSWTGILDPYLLDMIGSPKLWVEESYPQSLKRGGWSMWIAESGGQPIGMTMF
GPDIAHPDRIQIDALYVAENSQRHGIGGRLLNRALHSHPSADMILWCAEKNSKARGFYEKKDFHIDGRTFTWKPLSGVNV
PHVGYRLYRSAPPG
>P9WHT5 3.4.24.-~~~~~~Uncharacterized zinc protease Rv2782c~~~COG0612
MPRRSPADPAAALAPRRTTLPGGLRVVTEFLPAVHSASVGVWVGVGSRDEGATVAGAAHFLEHLLFKSTPTRSAVDIAQA
MDAVGGELNAFTAKEHTCYYAHVLGSDLPLAVDLVADVVLNGRCAADDVEVERDVVLEEIAMRDDDPEDALADMFLAALF
GDHPVGRPVIGSAQSVSVMTRAQLQSFHLRRYTPERMVVAAAGNVDHDGLVALVREHFGSRLVRGRRPVAPRKGTGRVNG
SPRLTLVSRDAEQTHVSLGIRTPGRGWEHRWALSVLHTALGGGLSSRLFQEVRETRGLAYSVYSALDLFADSGALSVYAA
CLPERFADVMRVTADVLESVARDGITEAECGIAKGSLRGGLVLGLEDSSSRMSRLGRSELNYGKHRSIEHTLRQIEQVTV
EEVNAVARHLLSRRYGAAVLGPHGSKRSLPQQLRAMVG
>Q2FVD0 ~~~~~~Uncharacterized protein SAOUHSC_02783~~~
MIHSKKLTLGICLVLLIILIVGYVIMTKTNGRNAQIKDTFNQTLKLYPTKNLDDFYDKEGFRDQEFKKGDKGTWIVNSEM
VIEPKGKDMETRGMVLYINRNTRTTKGYYFISEMTDDSNGRPKDDEKRYPVKMEHNKIIPTKPLPNDKLKKEIENFKFFV
QYGNFKDINDYKDGDISYNPNVPSYSAKYQLNNDDYNVQQLRKRYDIPTKQAPKLLLKGDGDLKGSSVGSRSLEFTFVEN
KEENIYFTDSVQYTPSEDTRYESN
>P47523 ~~~~~~Uncharacterized protein MG281~~~
MQFKKHKNSVKFKRKLFWTIGVLGAGALTTFSAVMITNLVNQSGYALVASGRSGNLGFKLFSTQSPSAEVKLKSLSLNDG
SYQSEIDLSGGANFREKFRNFANELSEAITNSPKGLDRPVPKTEISGLIKTGDNFITPSFKAGYYDHVASDGSLLSYYQS
TEYFNNRVLMPILQTTNGTLMANNRGYDDVFRQVPSFSGWSNTKATTVSTSNNLTYDKWTYFAAKGSPLYDSYPNHFFED
VKTLAIDAKDISALKTTIDSEKPTYLIIRGLSGNGSQLNELQLPESVKKVSLYGDYTGVNVAKQIFANVVELEFYSTSKA
NSFGFNPLVLGSKTNVIYDLFASKPFTHIDLTQVTLQNSDNSAIDANKLKQAVGDIYNYRRFERQFQGYFAGGYIDKYLV
KNVNTNKDSDDDLVYRSLKELNLHLEEAYREGDNTYYRVNENYYPGASIYENERASRDSEFQNEILKRAEQNGVTFDENI
KRITASGKYSVQFQKLENDTDSSLERMTKAVEGLVTVIGEEKFETVDITGVSSDTNEVKSLAKELKTNALGVKLKL
>Q2FVA4 1.-.-.-~~~~~~Putative NAD(P)H nitroreductase SAOUHSC_02829~~~COG0778
MSNMNQTIMDAFHFRHATKQFDPQKKVSKEDFETILESGRLSPSSLGLEPWKFVVIQDQALRDELKAHSWGAAKQLDTAS
HFVLIFARKNVTSRSPYVQHMLRDIKKYEAQTIPAVEQKFDAFQADFHISDNDQALYDWSSKQTYIALGNMMTTAALLGI
DSCPMEGFSLDTVTDILANKGILDTEQFGLSVMVAFGYRQQEPPKNKTRQAYEDVIEWVGPKE
>Q9RYM9 ~~~~~~Uncharacterized protein DR_A0282~~~COG4932
MFMKSKAAGSEFDGAVAKDNVNTRLKIAQFMSADPNATADVPQLKLETPTAFKANGEVDTWGPLGSGTAFNDVVNVRAYS
VKNSDQPRVMRYFLFSLVNIDKDGTWSDVRPAAGLYEQDPGYVTPGVDPNNKGQGQDSGLVSLDATGLEGDVYLQVVGLD
FNYNRVAYLVPLKLNRTKAASEVVAPTNVRAIAYTLSTRIDYLYKTQDPVLDAPTSGTNLWVTTSWDAPVTLSGYRGFRV
LRSTKAEGPYSQVAFAGEAQCAKPADAKATTRRCTVSDNTASLITDQDYFYKVVAAGTNEATSDVAPTHTLPIFQPKLLS
PGKDVHDVDLTPNYTVKLNLFQTGATGAVMNLRVADFITGESYAYAAKRLTVRKELGETQILSNLQGTSNYYVFRDSYAT
DNDPKTNNDTVTYDAASDVLTVPHQFEVDYLGGNKVPLQANRRYSWYIDSGYAYRLADPSKPTTAANNYIAAYSVYSDPS
DTVRVVPGGVKQGGAEVNDFTTRQ
>Q81PH1 ~~~~~~Uncharacterized protein BA_2834/GBAA_2834/BAS2643~~~COG3938
MRTQKVFTTIDTHTGGNPTRTLISGLPKLLGETMAEKMLHMKKEYDWIRKLLMNEPRGHDVMSGALLTDPCHPDADIGVI
YIETGGYLPMCGHDTIGVCTALIESGLIPVVEPITSLKLDTPAGLVEVDIFVRDGKAKEVSFCNIPAFILKHITVDVENI
GTVEADIAYGGNFYAIIDAKSVGLELVPEHASTIIDKAIHIRNIINERFEIIHPEYSFIRGLTHVEFYTDPTHESAHVKN
TVVVPPGGIDRSPCGTGTSAKLAVLYANQKIEMNEEFVHESIVGSLFKGCVINTTNVANMEAVVTKITGSAWLMGMHRFF
YNEKDPLKEGFLLIPPMEHETEDVK
>P9WPR3 ~~~~~~Uncharacterized protein Rv2850c~~~COG1239
MKPYPFSAIVGHDRLRLALLLCAVRPEIGGALIRGEKGTAKSTAVRGLAALLSVATGSTETGLVELPLGATEDRVVGSLD
LQRVMRDGEHAFSPGLLARAHGGVLYVDEVNLLHDHLVDILLDAAAMGRVHVERDGISHSHEARFVLIGTMNPEEGELRP
QLLDRFGLTVDVQASRDIDVRVQVIRRRMAYEADPDAFVARYADADAELAHRIAAARATVDDVVLGDNELRRIAALCAAF
DVDGMRADLVVARTAAAHAAWRGVRTVEEQDIRAAAELALPHRRRRDPFDDHGIDRDQLDEALALASVDPEPEPDPPGGG
QSANEPASQPNSRSKSTEPGAPSSMGDDPPRPASPRLRSSPRPSAPPSKIFRTRALRVPGVGTGAPGRRSRARNASGSVV
AAAEVSDPDAHGLHLFATLLAAGERAFGAGPLRPWPDDVRRAIREGREGNLVIFVVDASGSMAARDRMAAVSGATLSLLR
DAYQRRDKVAVITFRQHEATLLLSPTSSAHIAGRRLARFSTGGKTPLAEGLLAARALIIREKVRDRARRPLVVVLTDGRA
TAGPDPLGRSRTAAAGLVAEGAAAVVVDCETSYVRLGLAAQLARQLGAPVVRLEQLHADYLVHAVRGVA
>P9WFQ5 ~~~~~~UPF0039 protein Rv2851c~~~COG2153
MTEALRRVWAKDLDARALYELLKLRVEVFVVEQACPYPELDGRDLLAETRHFWLETPDGEVTCTLRLMEEHAGGEKVFRI
GRLCTKRDARGQGHSNRLLCAALAEVGDYPCRIDAQAYLTAMYAQHGFVRDGDEFLDDGIPHVPMLRPGSGQVERP
>O33341 2.4.2.-~~~~~~Putative glutamine amidotransferase Rv2859c~~~COG2071
MDLSASRSDGGDPLRPASPRLRSPVSDGGDPLRPASPRLRSPVSDGGDPLRPASPRLRSPLGASRPVVGLTAYLEQVRTG
VWDIPAGYLPADYFEGITMAGGVAVLLPPQPVDPESVGCVLDSLHALVITGGYDLDPAAYGQEPHPATDHPRPGRDAWEF
ALLRGALQRGMPVLGICRGTQVLNVALGGTLHQHLPDILGHSGHRAGNGVFTRLPVHTASGTRLAELIGESADVPCYHHQ
AIDQVGEGLVVSAVDVDGVIEALELPGDTFVLAVQWHPEKSLDDLRLFKALVDAASGYAGRQSQAEPR
>P9WL35 ~~~~~~Uncharacterized protein Rv2886c~~~COG2452
MSRILTHVPGRTVNRSYALPALVGSAAGRLSGNHSHGREAYIALPQWACSRQPSTPPLQTPGRINALWSLRPVLPMPGRG
CQLLRLGGRWLSVVCCRNGSMNLVVWAEGNGVARVIAYRWLRVGRLPVPARRVGRVILVDEPAGQPGRWGRTAVCARLSS
ADQKVDLDRQVVGVTAWATAEQIPVGKVVTEVGSALYGRRRTFLTLLGDPTVRRIVMKRRDRLGRFGFECVQAVLAADGR
ELVVVDSADVDDDVVGDITEILTSICARLYGKRAAGNRAARAVAAAARAGGHEAR
>Q9WYC4 ~~~~~~Uncharacterized ABC transporter ATP-binding protein TM_0288~~~COG1132
MPEIRRRPHGPILEKPALKNPTATLRRLLGYLRPHTFTLIMVFVFVTVSSILGVLSPYLIGKTIDVVFVPRRFDLLPRYM
LILGTIYALTSLLFWLQGKIMLTLSQDVVFRLRKELFEKLQRVPVGFFDRTPHGDIISRVINDVDNINNVLGNSIIQFFS
GIVTLAGAVIMMFRVNVILSLVTLSIVPLTVLITQIVSSQTRKYFYENQRVLGQLNGIIEEDISGLTVIKLFTREEKEME
KFDRVNESLRKVGTKAQIFSGVLPPLMNMVNNLGFALISGFGGWLALKDIITVGTIATFIGYSRQFTRPLNELSNQFNMI
QMALASAERIFEILDLEEEKDDPDAVELREVRGEIEFKNVWFSYDKKKPVLKDITFHIKPGQKVALVGPTGSGKTTIVNL
LMRFYDVDRGQILVDGIDIRKIKRSSLRSSIGIVLQDTILFSTTVKENLKYGNPGATDEEIKEAAKLTHSDHFIKHLPEG
YETVLTDNGEDLSQGQRQLLAITRAFLANPKILILDEATSNVDTKTEKSIQAAMWKLMEGKTSIIIAHRLNTIKNADLII
VLRDGEIVEMGKHDELIQKRGFYYELFTSQYGLVVEKE
>P9WL31 ~~~~~~Uncharacterized protein Rv2895c~~~COG2375
MAGRPLHAFEVVATRHLAPHMVRVVLGGSGFDTFVPSDFTDSYIKLVFVDDDVDVGRLPRPLTLDSFADLPTAKRPPVRT
MTVRHVDAAAREIAVDIVLHGEHGVAGPWAAGAQRGQPIYLMGPGGAYAPDPAADWHLLAGDESAIPAIAAALEALPPDA
IGRAFIEVAGPDDEIGLTAPDAVEVNWVYRGGRADLVPEDRAGDHAPLIEAVTTTAWLPGQVHVFIHGEAQAVMHNLRPY
VRNERGVDAKWASSISGYWRRGRTEEMFRKWKKELAEAEAGTH
>P9WJP9 1.-.-.-~~~~~~Uncharacterized oxidoreductase Rv2900c~~~COG0243
MYVEAVRWQRSAASRDVLADYDEQAVTVAPRKREAAGVRAVMVSLQRGMQQMGALRTAAALARLNQRNGFDCPGCAWPEE
PGGRKLAEFCENGAKAVAEEATKRTVTAEFFARHSVAELSAKPEYWLSQQGRLAHPMVLRPGDDHYRPISWDAAYQLIAE
QLNGLDSPDRAVFYTSGRTSNEAAFCYQLLVRSFGTNNLPDCSNMCHESSGAALTDSIGIGKGSVTIGDVEHADLIVIAG
QNPGTNHPRMLSVLGKAKANGAKIIAVNPLPEAGLIRFKDPQKVNGVVGHGIPIADEFVQIRLGGDMALFAGLGRLLLEA
EERVPGSVVDRSFVDNHCAGFDGYRRRTLQVGLDTVMDATGIELAQLQRVAAMLMASQRTVICWAMGLTQHAHAVATIGE
VTNVLLLRGMIGKPGAGVCPVRGHSNVQGDRTMGIWEKMPEQFLAALDREFGITSPRAHGFDTVAAIRAMRDGRVSVFMG
MGGNFASATPDTAVTEAALRRCALTVQVSTKLNRSHLVHGATALILPTLGRTDRDTRNGRKQLVSVEDSMSMVHLSRGSL
HPPSDQVRSEVQIICQLARALFGPGHPVPWERFADDYDTIRDAIAAVVPGCDDYNHKVRVPDGFQLPHPPRDAREFRTST
GKANFAVNPLQWVPVPPGRLVLQTLRSHDQYNTTIYGLDDRYRGVKGGRRVVFINPADIETFGLTAGDRVDLVSEWTDGQ
GGLQERRAKDFLVVAYSTPVGNAAAYYPETNPLVPLDHTAAQSNTPVSKAIIVRLEPTA
>P9WL27 ~~~~~~Uncharacterized protein Rv2901c~~~
MSAEDLEKYETEMELSLYREYKDIVGQFSYVVETERRFYLANSVEMVPRNTDGEVYFELRLADAWVWDMYRPARFVKQVR
VVTFKDVNIEEVEKPELRLPE
>P9WL25 ~~~~~~Uncharacterized protein Rv2910c~~~COG5517
MCAVLDRSMLSVAEISDRLEIQQLLVDYSSAIDQRRFDDLDRVFTPDAYIDYRALGGIDGRYPKIKQWLSQVLGNFPVYA
HMLGNFSVRVDGDTASSRVICFNPMVFAGDRQQVLFCGLWYDDDFVRTPDGWRIIRRVETKCFQKMM
>P9WMC7 ~~~~~~Uncharacterized HTH-type transcriptional regulator Rv2912c~~~COG1309
MARTQQQRREETVARLLQASIDTIIEVGYARASAAVITKRAGVSVGALFRHFETMGDFMAATAYEVLRRQLETFTKQVAE
IPADRPALPAALTILRDITAGSTNAVLYELMVAARTDEKLKETLQNVLGQYSAKIHDAARALPGAESFPEETFPVIVALM
TNVFDGAAIVRGVLPQPELEEQRIPMLTALLTAGL
>P9WJH9 ~~~~~~Uncharacterized protein Rv2913c~~~COG3653
MLAWRQLNDLEETVTYDVIIRDGLWFDGTGNAPLTRTLGIRDGVVATVAAGALDETGCPEVVDAAGKWVVPGFIDVHTHY
DAEVLLDPGLRESVRHGVTTVLLGNCSLSTVYANSEDAADLFSRVEAVPREFVLGALRDNQTWSTPAEYIEAIDALPLGP
NVSSLLGHSDLRTAVLGLDRATDDTVRPTEAELAKMAKLLDEALEAGMLGMSGMDAAIDKLDGDRFRSRALPSTFATWRE
RRKLISVLRHRGRILQSAPDVDNPVSALLFFLASSRIFNRRKGVRMSMLVSADAKSMPLAVHVFGLGTRVLNKLLGSQVR
FQHLPVPFELYSDGIDLPVFEEFGAGTAALHLRDQLQRNELLADRSYRRSFRREFDRIKLGPSLWHRDFHDAVIVECPDK
SLIGKSFGAIADERGLHPLDAFLDVLVDNGERNVRWTTIVANHRPNQLNKLAAEPSVHMGFSDAGAHLRNMAFYNFGLRL
LKRARDADRAGQPFLSIERAVYRLTGELAEWFGIGAGTLRQGDRADFAVIDPTHLDESVDGYHEEAVPYYGGLRRMVNRN
DATVVATGVGGTVVFRGGQFGGQFRDGYGQNVKSGRYLRAGELGAALSRSA
>P9WL23 ~~~~~~Uncharacterized protein Rv2915c~~~COG1228
MKRVDTIRPRSRAVRLHVRGLGLPDETAIQLWIVDGRISTEPVAGADTVFDGGWILPGLVDAHCHVGLGKHGNVELDEAI
AQAETERDVGALLLRDCGSPTDTRGLDDHEDLPRIIRAGRHLARPKRYIAGFAVELEDESQLPAAVAEQARRGDGWVKLV
GDWIDRQIGDLAPLWSDDVLKAAIDTAHAQGARVTAHVFSEDALPGLINAGIDCIEHGTGLTDDTIALMLEHGTALVPTL
INLENFPGIADAAGRYPTYAAHMRDLYARGYGRVAAAREAGVPVYAGTDAGSTIEHGRIADEVAALQRIGMTAHEALGAA
CWDARRWLGRPGLDDRASADLLCYAQDPRQGPGVLQHPDLVILRGRTFGP
>P43979 ~~~~~~Uncharacterized protein HI_0291~~~COG2608
MKTITLNIKGIHCGCCVKNLTQVLTELDGVQSADVQLEGKANITFDENRVNVAQLIEVIEDAGFDATE
>P9WL19 ~~~~~~Uncharacterized protein Rv2923c~~~COG1765
MTQLWVERTGTRRYIGRSTRGAQVLVGSEDVDGVFTPGELLKIALAACSGMASDQPLARRLGDDYQAVVKVSGAADRDQE
RYPLIEETMELDLSGLTEDEKERLLVVINRAVELACTVGRTLKSGTTVNLEVVDVGA
>P9WL17 ~~~~~~Uncharacterized protein Rv2926c~~~COG1399
MDLGGVRRRISLMARQHGPTAQRHVASPMTVDIARLGRRPGAMFELHDTVHSPARIGLELIAIDQGALLDLDLRVESVSE
GVLVTGTVAAPTVGECARCLSPVRGRVQVALTELFAYPDSATDETTEEDEVGRVVDETIDLEQPIIDAVGLELPFSPVCR
PDCPGLCPQCGVPLASEPGHRHEQIDPRWAKLVEMLGPESDTLRGER
>P9WL15 ~~~~~~Uncharacterized protein Rv2927c~~~COG3599
MYRVFEALDELSAIVEEARGVPMTAGCVVPRGDVLELIDDIKDAIPGELDDAQDVLDARDSMLQDAKTHADSMVSSATTE
AESILNHARTEADRILSDAKAQADRMVSEARQHSERMVADAREEAIRIATAAKREYEASVSRAQAECDRLIENGNISYEK
AVQEGIKEQQRLVSQNEVVAAANAESTRLVDTAHAEADRLRGECDIYVDNKLAEFEEFLNGTLRSVGRGRHQLRTAAGTH
DYAVR
>P9WFI9 2.1.1.-~~~~~~Putative S-adenosyl-L-methionine-dependent methyltransferase Rv0281~~~COG3315
MRTEGDSWDITTSVGSTALFVATARALEAQKSDPLVVDPYAEAFCRAVGGSWADVLDGKLPDHKLKSTDFGEHFVNFQGA
RTKYFDEYFRRAAAAGARQVVILAAGLDSRAYRLPWPDGTTVFELDRPQVLDFKREVLASHGAQPRALRREIAVDLRDDW
PQALRDSGFDAAAPSAWIAEGLLIYLPATAQERLFTGIDALAGRRSHVAVEDGAPMGPDEYAAKVEEERAAIAEGAEEHP
FFQLVYNERCAPAAEWFGERGWTAVATLLNDYLEAVGRPVPGPESEAGPMFARNTLVSAARV
>A0QWH1 ~~~~~~Probable transcriptional regulatory protein MSMEG_2940/MSMEI_2866~~~COG0217
MSGHSKWATTKHKKAVIDAKRGKMFAKLIKNIEVAARVGGGDPGGNPTLYDAIQKAKKSSVPNDNIERARKRGAGEEAGG
ADWQNITYEGYGPNGVAVLVECLTDNRNRAAGEVRVAMTRNGGNMADPGSVAYLFSRKGVVTLEKNGLTEDDVLLAVLEA
GAEEVNDLGDSFEIISEPSDLVAVRTALQEAGIDYDSADASFQPSVTVPVDLEGARKVLKLVDALEDSDDVQDVYTNMDI
PDDVAAQLDEE
>I6YET7 ~~~~~~Putative permease Rv2963~~~COG0701
MTSTKVEDRVTAAVLGAIGHALALTASMTWEILWALILGFALSAVVQAVVRRSTIVTLLGDDRPRTLVIATGLGAASSSC
SYAAVALARSLFRKGANFTAAMAFEIGSTNLVVELGIILALLMGWQFTAAEFVGGPIMILVLAVLFRLFVGARLIDAARE
QAERGLAGSMEGHAAMDMSIKREGSFWRRLLSPPGFTSIAHVFVMEWLAILRDLILGLLIAGAIAAWVPESFWQSFFLAN
HPAWSAVWGPIIGPIVAIVSFVCSIGNVPLAAVLWNGGISFGGVIAFIFADLLILPILNIYRKYYGARMMLVLLGTFYAS
MVVAGYLIELLFGTTNLIPSQRSATVMTAEISWNYTTWLNVIFLVIAAALVVRFITSGGLPMLRMMGGSPDAPHDHHDRH
DDHLGH
>P9WQA5 1.1.1.-~~~~~~Aldo-keto reductase Rv2971~~~COG0656
MTGESGAAAAPSITLNDEHTMPVLGLGVAELSDDETERAVSAALEIGCRLIDTAYAYGNEAAVGRAIAASGVAREELFVT
TKLATPDQGFTRSQEACRASLDRLGLDYVDLYLIHWPAPPVGKYVDAWGGMIQSRGEGHARSIGVSNFTAENIENLIDLT
FVTPAVNQIELHPLLNQDELRKANAQHTVVTQSYCPLALGRLLDNPTVTSIASEYVKTPAQVLLRWNLQLGNAVVVRSAR
PERIASNFDVFDFELAAEHMDALGGLNDGTRVREDPLTYAGT
>Q2G222 ~~~~~~N-acetylmuramoyl-L-alanine amidase domain-containing protein SAOUHSC_02979~~~COG1705
MPKNKILIYLLSTTLVLPTLVSPTAYADTPQKDTTAKTTSHDSKKSNDDETSKDTTSKDIDKADKNNTSNQDNNDKKFKT
IDDSTSDSNNIIDFIYKNLPQTNINQLLTKNKYDDNYSLTTLIQNLFNLNSDISDYEQPRNGEKSTNDSNKNSDNSIKND
TDTQSSKQDKADNQKAPKSNNTKPSTSNKQPNSPKPTQPNQSNSQPASDDKANQKSSSKDNQSMSDSALDSILDQYSEDA
KKTQKDYASQSKKDKNEKSNTKNPQLPTQDELKHKSKPAQSFNNDVNQKDTRATSLFETDPSISNNDDSGQFNVVDSKDT
RQFVKSIAKDAHRIGQDNDIYASVMIAQAILESDSGRSALAKSPNHNLFGIKGAFEGNSVPFNTLEADGNQLYSINAGFR
KYPSTKESLKDYSDLIKNGIDGNRTIYKPTWKSEADSYKDATSHLSKTYATDPNYAKKLNSIIKHYQLTQFDDERMPDLD
KYERSIKDYDDSSDEFKPFREVSDSMPYPHGQCTWYVYNRMKQFGTSISGDLGDAHNWNNRAQYRDYQVSHTPKRHAAVV
FEAGQFGADQHYGHVAFVEKVNSDGSIVISESNVKGLGIISHRTINAAAAEELSYITGK
>P9WJ09 ~~~~~~Antitoxin Rv0298~~~
MTKEKISVTVDAAVLAAIDADARAAGLNRSEMIEQALRNEHLRVALRDYTAKTVPALDIDAYAQRVYQANRAAGS
>I6Y276 ~~~~~~Protein Rv2993c~~~COG0179
MRIGRIASPDGVAFASIDGELGEPSEMTAREIAEHPFGTPTFTGRSWPLADVRLLAPILASKVVCVGKNYADHIAEMGGR
PPADPVIFLKPNTAIIGPNTPIRLPANASPVHFEGELAIVIGRACKDVPAAQAVDNILGYTIGNDVSARDQQQSDGQWTR
AKGHDTFCPVGPWIVTDLAPFDPADLELRTVVNGDVKQHARTSLMIHDIGAIVEWISAIMTLLPGDLILTGTPAGVGPIE
DGDTVSITIEGIGTLTNPVVRKGKP
>P9WJW7 ~~~~~~Uncharacterized MFS-type transporter Rv2994~~~COG2271
MSRDPTGVGARWAIMIVSLGVTASSFLFINGVAFLIPRLENARGTPLSHAGLLASMPSWGLVVTMFAWGYLLDHVGERMV
MAVGSALTAAAAYAAASVHSLLWIGVFLFLGGMAAGGCNSAGGRLVSGWFPPQQRGLAMGIRQTAQPLGIASGALVIPEL
AERGVHAGLMFPAVVCTLAAVASVLGIVDPPRKSRTKASEQELASPYRGSSILWRIHAASALLMMPQTVTVTFMLVWLIN
HHGWSVAQAGVLVTISQLLGALGRVAVGRWSDHVGSRMRPVRLIAAAAAATLFLLAAVDNEGSRYDVLLMIAISVIAVLD
NGLEATAITEYAGPYWSGRALGIQNTTQRLMAAAGPPLFGSLITTAAYPTAWALCGVFPLAAVPLVPVRLLPPGLETRAR
RQSVRRHRWWQAVRCHAWPNGPRRPGPPGQPRRVRQGGTAITPPT
>Q7TXI6 1.1.1.-~~~~~~Aldo-keto reductase BQ2027_MB2996~~~
MTGESGAAAAPSITLNDEHTMPVLGLGVAELSDDETERAVSAALEIGCRLIDTAYAYGNEAAVGRAIAASGVAREELFVT
TKLATPDQGFTRSQEACRASLDRLGLDYVDLYLIHWPAPPVGKYVDAWGGMIQSRGEGHARSIGVSNFTAEHIENLIDLT
FVTPAVNQIELHPLLNQDELRKANAQHTVVTQSYCPLALGRLLDNPTVTSIASEYVKTPAQVLLRWNLQLGNAVVVRSAR
PERIASNFDVFDFELAAEHMDALGGLNDGTRVREDPLTYAGT
>O07226 ~~~~~~Toxin Rv0299~~~
MIAPGDIAPRRDSEHELYVAVLSNALHRAADTGRVITCPFIPGRVPEDLLAMVVAVEQPNGTLLPELVQWLHVAALGAPL
GNAGVAALREAASVVTALLC
>A0A0H3CC29 ~~~~~~UPF0276 protein CCNA_03000~~~
MTLQPFDGFGLGLRPPHYRAFLDSERPLVDFVEVISENFMVGGGRPLHVIDAVRERYPVALHGVSMSVGSADGVKLDYLR
RLKGLADRVDPMWVSDHLCWTGVEGFNSHDLLPVPYTEEAMAVVCANIALAQDVLERPLLLENPSSYVTFANDAMAEHQF
LAEMCARTGCYLLLDINNIYVSASNHGFDPYEYLAAVPVDRVLQIHLAGHSQGRELLIDTHDQPVPDSVWALYEAAAGRF
GPVAAMIERDDDIPPLDDLLAELDVARARWAAGRRGSLAA
>P9WJZ1 2.1.1.-~~~~~~Probable S-adenosylmethionine-dependent methyltransferase Rv3030~~~COG2227
MCAFVPHVPRHSRGDNPPSASTASPAVLTLTGERTIPDLDIENYWFRRHQVVYQRLAPRCTARDVLEAGCGEGYGADLIA
CVARQVIAVDYDETAVAHVRSRYPRVEVMQANLAELPLPDASVDVVVNFQVIEHLWDQARFVRECARVLRGSGLLMVSTP
NRITFSPGRDTPINPFHTRELNADELTSLLIDAGFVDVAMCGLFHGPRLRDMDARHGGSIIDAQIMRAVAGAPWPPELAA
DVAAVTTADFEMVAAGHDRDIDDSLDLIAIAVRP
>O53281 2.3.1.-~~~~~~Probable acetyltransferase Rv3034c~~~COG0110
MNVLSLGSSSGVVWGRVPITAPAGAATGVTSRADAHSQMRRYAQTGPTAKLSSAPMTTMWGAPLHRRWRGSRLRDPRQAK
FLTLASLKWVLANRAYTPWYLVRYWRLLRFKLANPHIITRGMVFLGKGVEIHATPELAQLEIGRWVHIGDKNTIRAHEGS
LRFGDKVVLGRDNVINTYLDIEIGDSVLMADWCYICDFDHRMDDITLPIKDQGIIKSPVRIGPDTWIGVKVSVLRGTTIG
RGCVLGSHAVVRGAIPDYSIAVGAPAKVVKNRQLSWEASAAQRAELAAALADIERKKAAR
>I6XFZ8 ~~~~~~Protein Rv3035~~~COG1520
MGGCGSADSWVEAAPAQGWPAQYGDAANSSYTTTNGATNLTLRWTRSVKGSLAAGPALSARGYLALNGQTPAGCSLMEWQ
NDNNGRQRWCVRLVQGGGFAGPLFDGFDNLYVGQPGAIISFPPTQWTRWRQPVIGMPSTPRFLGHGRLLVSTHLGQLLVF
DTRRGMVVGSPVDLVDGIDPTDATRGLADCAPARPGCPVAAAPAFSSVNGTVVVSVWQPGEPAAKLVGLKYHAEQLVREW
TSDAVSAGVLASPVLSADGSTVYVNGRDHRLWALNAADGKAKWSAPLGFLAQTPPALTPHGLIVSGGGPDTALAAFRDAG
DHAEGAWRRDDVTALSTASLAGTGVGYTVISGPNHDGTPGLSLLVFDPANGHTVNSYPLPGATGYPVGVSVGNDRRVVTA
TSDGQVYSFAP
>I6YB21 ~~~~~~Uncharacterized protein Rv3067~~~
MRDEPPTDTAAAPTTGAAPEIDTAREYEVTAEYQSWRVVWGSAAALLTVGVGIGAAILLGWFTLAHRHPDQPGAAATPPP
AGLTTRSAPTAAPPSTLQSPDLDSVFLGNLHDRGISFTNPDAAVYNGKMVCTNLGGGMTVQQVVEALQSSSPALGDRTTA
YVAVSIRTYCPKYDAVLPPGS
>A0A0S2DN66 3.6.4.-~~~~~~Probable ATPase FE772_23070~~~
MTPTVPELEFAQATAFESNVPPLFPGRAGPDDGADRLRLRLAVRGWARPLDLIAASNSPDDVMAILGALAAEVETAVQTR
PNQWYLSLAARKRELARRDDAQLRAAASASVPGDDDDPVLHAMRIALADAEPTLSALGTDVLAALGNACDWLGERWRHAV
GAQRIAGELASRALDADLRRMTRDPMVGRSHRDTLARLIGFATAATQTPLSVAYVYGGGGAGKTTLLSFLQRDLSQRAEP
VPVVRIDFDEPAIDPTRMVTLNIALVEQLARSVPAVCDRASDMLPALRDTALVQHDAGLSRGGPRKRIKSSRPESMLKAE
SVASQAASDEGSILYRLLAPDVVAGPILIVFDTAELVLAQSDHVASSLVSWLGFLHNEAGARDLRLVIAGRDPPDDPDLG
HAANSLLSRLKDTGARIETPIGLPELDPAEAQQLLRNCGVDDPTAAAEAAAAVPGNPLLLRITADALLQGEAELRESVRR
AHRDSRIDADSARNYLLRRVVAHVRDPIARPYVLAATYSPVVTAKLLEEVVIPAVDRSEREPGPGIPANKKAATAKAKRV
FDALASTYWLTRQTLRSETVPFNREIRAFALKLLAATPEGALLERDVRQSAAIHHLRRRSADDRALALYHLAVLGHPYTV
PRDLASVQTVLRDVIEELPADLRNRLAPPVGSVAPGVARGVARSSIDDMSDSDWQRYLEGDERSKRAGEGAQMVKADRAR
EALDLYLARPTRPPGLPPTFVIQAQADLGEWDEGIADIDAILEREADDWLARKSIGPEALSRIYWITRLALQAYGRLSTP
HAMLLRNASETATGPGLSTLPALIAVAETMSGEPIMAGRMRSQALKTDAAGRTLLCPIHGHLPETIELFDAGIAVVQSDW
RQRMIQLAPARLIRANEAHLRDLQSRLDALHAKPIAQVNQLFNKMRTGIAVEPTTMAGLNSAVLLLRGLTPEFYRPLREA
LLALCENGVASPMMRHAVDPLFERMSIRPAEMEPATFYRRLSNNPNAWCTAFIVYADRCRLLPGLCESLARYADSPKSRR
IASSFLAWDKALCRGTSSDWGQPAKTRK
>P9WKB1 2.3.1.20~~~~~~Putative diacyglycerol O-acyltransferase Rv3087~~~COG1020
MRRLNGVDALMLYLDGGSAYNHTLKISVLDPSTDPDGWSWPKARQMFEERAHLLPVFRLRYLPTPLGLHHPIWVEDPEFD
LDAHVRRVVCPAPGGMAEFCALVEQIYAHPLDRDRPLWQTWVVEGLDGGRVALVTLLHHAYSDGVGVLDMLAAFYNDTPD
EAPVVAPPWEPPPLPSTRQRLGWALRDLPSRLGKIAPTVRAVRDRVRIEREFAKDGDRRVPPTFDRSAPPGPFQRGLSRS
RRFSCESFPLAEVREVSKTLGVTINDVFLACVAGAVRRYLERCGSPPTDAMVATMPLAVTPAAERAHPGNYSSVDYVWLR
ADIADPLERLHATHLAAEATKQHFAQTKDADVGAVVELLPERLISGLARANARTKGRFDTFKNVVVSNVPGPREPRYLGR
WRVDQWFSTGQISHGATLNMTVWSYCDQFNLCVMADAVAVRNTWELLGGFRASHEELLAAARAQATPKEMAT
>O05769 3.4.-.-~~~~~~Protease Rv3090~~~COG0330
MTWQIVFVVICVIVAGVAALFWRLPSDDTTRSRAKTVTIAAVAAAAVFFFLGCFTIVGTRQFAIMTTFGRPTGVSLNNGF
HGKWPWQMTHPMDGAVQIDKYVKEGNTDQRITVRLGNQSTALADVSIRWQLKQAAAPELFQQYKTFDNVRVNLIERNLSV
ALNEVFAGFNPLDPRNLDVSPLPSLAKRAADILRQDVGGQVDIFDVNVPTIQYDQSTEDKINQLNQQRAQTSIALEAQRT
AEAQAKANEILSRSISDDPNVVVQNCITAAINKGISPLGCWPGSSALPTIAVPGR
>P9WMG3 ~~~~~~Uncharacterized HTH-type transcriptional regulator Rv3095~~~COG1733
MAVSDLSHRFEGESVGRALELVGERWTLLILREAFFGVRRFGQLARNLGIPRPTLSSRLRMLVEVGLFDRVPYSSDPERH
EYRLTEAGRDLFAAIVVLMQWGDEYLPRPEGPPIKLRHHTCGEHADPRLICTHCGEEITARNVTPEPGPGFKAKLASS
>Q8CZ70 ~~~~~~UPF0371 protein spr0309~~~COG4868
MKKQAFSSEQYLNLQRDHILERINQFDGKLYLEFGGKMLEDFHAARVLPGYEPDNKIKLLQELKEQVEVVIAINASNIEH
SKARGDLGISYDQEVLRLIDKFNELGIFVGSVVITQYAGQPAADAFRNQLEKNGIDSYLHYPIKGYPTDMDHIISPEGMG
KNDYIKTSRNLIVVTAPGPGSGKLATCMSNMYHDQINGIKSGYAKFETFPIWNLPLHHPVNLAYEAATADLDDVNMIDPF
HLQTYGETTVNYNRDIEIFPVLKRMLERILGKSPYASPTDMGVNMVGFAITDDEAAVEASKQEIIRRYYQTVLDFKAEKV
GEAAVKKIELLMNDLGITPADRKVAVVARQKAEETGGPALAFELPNGEIVTGKNSELFGPTAAALINAIKKSADIAKEVK
LIEPEVVKPIQGLKIDHLGSRNPRLHSNEILIALAITATENPDAARAMEELGNLKGSEAHSTIILTDEDKNVLRKLGINV
TFDPYYQYDRLYRK
>P0CG96 ~~~sseC1~~~Uncharacterized protein Rv3118~~~
MCSGPKQGLTLPASVDLEKETVITGRVVDGDGQAVGGAFVRLLDSSDEFTAEVVASATGDFRFFAAPGSWTLRALSAAGN
GDAVVQPSGAGIHEVDVKIT
>P9WL08 ~~~~~~Uncharacterized protein MT3211~~~
MVIRFDQIGSLVLSMKSLASLSFQRCLRENSSLVAALDRLDAAVDELSALSFDALTTPERDRARRDRDHHPWSRSRSQLS
PRMAHGAVHQCQWPKAVWAVIDNP
>P9WL09 ~~~~~~Uncharacterized protein Rv3126c~~~
MVIRFDQIGSLVLSMKSLASLSFQRCLRENSSLVAALDRLDAAVDELSALSFDALTTPERDRARRDRDHHPWSRSRSQLS
PRMAHGAVHQCQWPKAVWAVIDNP
>P9WL06 ~~~~~~Uncharacterized protein MT3212~~~
MLKNAVLLACRAPSVHNSQPWRWVAESGSEHTTVHLFVNRHRTVPATDHSGRQAIISCGAVLDHLRIAMTAAHWQANITR
FPQPNQPDQLATVEFSPIDHVTAGQRNRAQAILQRRTDRLPFDSPMYWHLFEPALRDAVDKDVAMLDVVSDDQRTRLVVA
SQLSEVLRRDDPYYHAELEWWTSPFVLAHGVPPDTLASDAERLRVDLGRDFPVRSYQNRRAELADDRSKVLVLSTPSDTR
ADALRCGEVLSTILLECTMAGMATCTLTHLIESSDSRDIVRGLTRQRGEPQALIRVGIAPPLAAVPAPTPRRPLDSVLQI
RQTPEKGRNASDRNARETGWFSPP
>P9WL07 ~~~~~~Uncharacterized protein Rv3127~~~COG0778
MLKNAVLLACRAPSVHNSQPWRWVAESGSEHTTVHLFVNRHRTVPATDHSGRQAIISCGAVLDHLRIAMTAAHWQANITR
FPQPNQPDQLATVEFSPIDHVTAGQRNRAQAILQRRTDRLPFDSPMYWHLFEPALRDAVDKDVAMLDVVSDDQRTRLVVA
SQLSEVLRRDDPYYHAELEWWTSPFVLAHGVPPDTLASDAERLRVDLGRDFPVRSYQNRRAELADDRSKVLVLSTPSDTR
ADALRCGEVLSTILLECTMAGMATCTLTHLIESSDSRDIVRGLTRQRGEPQALIRVGIAPPLAAVPAPTPRRPLDSVLQI
RQTPEKGRNASDRNARETGWFSPP
>P9WL04 ~~~~~~Uncharacterized protein MT3215~~~
MVQGRTVLFRTAEGAKLFSAVAKCAVAFEADDHNVAEGWSVIVKVRAQVLTTDAGVREAERAQLLPWTATLKRHCVRVIP
WEITGRHFRFGPEPDRSQTFACEASSHNQR
>P9WL05 ~~~~~~Uncharacterized protein Rv3129~~~COG3467
MVQGRTVLFRTAEGAKLFSAVAKCAVAFEADDHNVAEGWSVIVKVRAQVLTTDAGVREAERAQLLPWTATLKRHCVRVIP
WEITGRHFRFGPEPDRSQTFACEASSHNQR
>P9WIZ6 ~~~~~~Putative NAD(P)H nitroreductase MT3217~~~
MTAAVDGKGPAAMNTHFPDAETVRTVLTLAVRAPSIHNTQPWRWRVCPTSLELFSRPDMQLRSTDPDGRELILSCGVALH
HCVVALASLGWQAKVNRFPDPKDRCHLATIGVQPLVPDQADVALAAAIPRRRTDRRAYSCWPVPGGDIALMAARAARGGV
MLRQVSALDRMKAIVAQAVLDHVTDEEYLRELTIWSGRYGSVAGVPARNEPPSDPSAPIPGRLFAGPGLSQPSDVLPADD
GAAILALGTETDDRLARLRAGEAASIVLLTATAMGLACCPITEPLEIAKTRDAVRAEVFGAGGYPQMLLRVGWAPINADP
LPPTPRRELSQVVEWPEELLRQRC
>P9WIZ7 ~~~~~~Putative NAD(P)H nitroreductase Rv3131~~~COG0778
MTAAVDGKGPAAMNTHFPDAETVRTVLTLAVRAPSIHNTQPWRWRVCPTSLELFSRPDMQLRSTDPDGRELILSCGVALH
HCVVALASLGWQAKVNRFPDPKDRCHLATIGVQPLVPDQADVALAAAIPRRRTDRRAYSCWPVPGGDIALMAARAARGGV
MLRQVSALDRMKAIVAQAVLDHVTDEEYLRELTIWSGRYGSVAGVPARNEPPSDPSAPIPGRLFAGPGLSQPSDVLPADD
GAAILALGTETDDRLARLRAGEAASIVLLTATAMGLACCPITEPLEIAKTRDAVRAEVFGAGGYPQMLLRVGWAPINADP
LPPTPRRELSQVVEWPEELLRQRC
>P9WFD2 ~~~~~~Universal stress protein MT3220~~~
MSDPRPARAVVVGIDGSRAATHAALWAVDEAVNRDIPLRLVYVIDPSQLSAAGEGGGQSAARAALHDASRKVEATGQPVK
IETEVLCGRPLTKLMQESRSAAMLCVGSVGLDHVRGRRGSVAATLAGSALCPVAVIHPSPAEPATTSQVSAVVAEVDNGV
VLRHAFEEARLRGVPLRAVAVHAAETPDDVEQGSRLAHVHLSRRLAHWTRLYPEVRVDRAIAGGSACRHLAANAKPGQLF
VADSHSAHELCGAYQPGCAVLTVRSANL
>P9WFD3 ~~~~~~Universal stress protein Rv3134c~~~COG0589
MSDPRPARAVVVGIDGSRAATHAALWAVDEAVNRDIPLRLVYVIDPSQLSAAGEGGGQSAARAALHDASRKVEATGQPVK
IETEVLCGRPLTKLMQESRSAAMLCVGSVGLDHVRGRRGSVAATLAGSALCPVAVIHPSPAEPATTSQVSAVVAEVDNGV
VLRHAFEEARLRGVPLRAVAVHAAETPDDVEQGSRLAHVHLSRRLAHWTRLYPEVRVDRAIAGGSACRHLAANAKPGQLF
VADSHSAHELCGAYQPGCAVLTVRSANL
>P9WL03 ~~~~~~Uncharacterized protein Rv0313~~~
MGDYGPFGFDPDEFDRVIREGSEGLRDAFERIGRFLSSSGAGTGWSAIFEDLSRRSRPAPETAGEAGDGVWAIYTVDADG
GARVEQVYATELDALRANKDNTDPKRKVRFLPYGIAVSVLDDPVDEAQ
>P9WGL7 ~~~~~~Uncharacterized response regulatory protein Rv3143~~~COG0784
MPDSSTALRILVYSDNVQTRERVMRALGKRLHPDLPDLTYVEVATGPMVIRQMDRGGIDLAILDGEATPTGGMGIAKQLK
DELASCPPILVLTGRPDDTWLASWSRAEAAVPHPVDPIVLGRTVLSLLRAPAH
>P44634 ~~~~~~Probable transcriptional regulatory protein HI_0315~~~COG0217
MAGHSKWANIKHRKAAQDAQRGKIFTKLIRELVTAAKIGGGDVSANPRLRAAVDKALSNNMTRDTINRAIDRGVGGGDDT
NMETKIYEGYGPGGTAVMVECLSDNANRTISQVRPSFTKCGGNLGTEGSVGYLFSKKGLILIAEADEDALTEAAIEAGAD
DIQPQDDGSFEIYTAWEDLGSVRDGIEAAGFKVQEAEVTMIPSTTVDLDIETAPKLLRLIDMLEDCDDVQNVYHNGEICD
EVASQL
>P9WI99 2.7.1.-~~~~~~Putative aminoglycoside phosphotransferase~~~COG3173
MANEPAIGAIDRLQRSSRDVTTLPAVISRWLSSVLPGGAAPEVTVESGVDSTGMSSETIILTARWQQDGRSIQQKLVARV
APAAEDVPVFPTYRLDHQFEVIRLVGELTDVPVPRVRWIETTGDVLGTPFFLMDYVEGVVPPDVMPYTFGDNWFADAPAE
RQRQLQDATVAALATLHSIPNAQNTFSFLTQGRTSDTTLHRHFNWVRSWYDFAVEGIGRSPLLERTFEWLQSHWPDDAAA
REPVLLWGDARVGNVLYRDFQPVAVLDWEMVALGPRELDVAWMIFAHRVFQELAGLATLPGLPEVMREDDVRATYQALTG
VELGDLHWFYVYSGVMWACVFMRTGARRVHFGEIEKPDDVESLFYHAGLMKHLLGEEH
>O53334 ~~~~~~Probable mycobacterial cidal antitoxin Rv3188~~~
MAVTLDRAVEASEIVDALKPFGVTQVDVAAVIQVSDRAVRGWRTGDIRPERYDRLAQLRDLVLLLSDSLTPRGVGQWLHA
KNRLLDGQRPVDLLAKDRYEDVRSAAESFIDGAYV
>O53335 2.4.2.-~~~~~~Probable NAD(+) phosphorylase Rv3189~~~
MKLADAIATAPRRTLKGTYWHQGPTRHPVTSCADPARGPGRYHRTGEPGVWYASNKEQGAWAELFRHFVDDGVDPFEVRR
RVGRVAVTLQVLDLTDERTRSHLGVDETDLLSDDYTTTQAIAAARDANFDAVLAPAAALPGCQTLAVFVHALPNIEPERS
EVRQPPPRLANLLPLIRPHEHMPDSVRRLLATLTRAGAEAIRRRRR
>P9WFL3 ~~~~~~UPF0182 protein Rv3193c~~~COG1615
MGMRSAARMPKLTRRSRILIMIALGVIVLLLAGPRLIDAYVDWLWFGELGYRSVFTTMLATRIVVCLVAGVVVGGIVFGG
LALAYRTRPVFVPDADNDPVARYRAVVLARLRLVGIGIPAAIGLLAGIVAQSYWARIQLFLHGGDFGVRDPQFGRDLGFY
AFELPFYRLMLSYMLVSVFLAFVANLVAHYIFGGIRLSGRTGALSRSARVQLVSLVGVLVLLKAVAYWLDRYELLSHTRG
GKPFTGAGYTDINAVLPAKLILMAIALICAAAVFSAIALRDLRIPAIGLVLLLLSSLIVGAGWPLIVEQISVKPNAAQKE
SEYISRSITATRQAYGLTSDVVTYRNYSGDSPATAQQVAADRATTSNIRLLDPTIVSPAFTQFQQGKNFYYFPDQLSIDR
YLDRNGNLRDYVVAARELNPDRLIDNQRDWINRHTVYTHGNGFIASPANTVRGIANDPNQNGGYPEFLVNVVGANGTVVS
DGPAPLDQPRIYFGPVISNTSADYAIVGRNGDDREYDYETNIDTKRYTYTGSGGVPLGGWLARSVFAAKFAERNFLFSNV
IGSNSKILFNRDPAQRVEAVAPWLTTDSAVYPAIVNKRLVWIVDGYTTLDNYPYSELTSLSSATADSNEVAFNRLVPDKK
VSYIRNSVKATVDAYDGTVTLYQQDEKDPVLKAWMQVFPGTVKPKSDIAPELAEHLRYPEDLFKVQRMLLAKYHVNDPVT
FFSTSDFWDVPLDPNPTASSYQPPYYIVAKNIAKDDNSASYQLISAMNRFKRDYLAAYISASSDPATYGNLTVLTIPGQV
NGPKLANNAITTDPAVSQDLGVIGRDNQNRIRWGNLLTLPVARGGLLYVEPVYASPGASDAASSYPRLIRVAMMYNDKVG
YGPTVRDALTGLFGPGAGATATGIAPTEAAVPPSPAANPPPPASGPQPPPVTAAPPVPVGAVTLSPAKVAALQEIQAAIG
AARDAQKKGDFAAYGSALQRLDEAITKFNDAG
>P9WN17 1.-.-.-~~~~~~Putative glutaredoxin Rv3198.1~~~COG0695
MITAALTIYTTSWCGYCLRLKTALTANRIAYDEVDIEHNRAAAEFVGSVNGGNRTVPTVKFADGSTLTNPSADEVKAKLV
KIAG
>P0DMM3 ~~~~~~Uncharacterized protein Rv3202A~~~
MATMAAVVGGGPQDEIPEADAVEQGRAVDFDDEAGLDTAYLSGGAGDRDASEADVVDQAFVVPVADDEEIDR
>Q6NCZ4 ~~~~~~UPF0102 protein RPA0323~~~COG0792
MAKTDRSQPSVLARIAAFRTGLSAEASAADYLERQGYRILARRFKTRCGEIDLVAQRDALVAFVEVKARGNVDDAAYAVT
PRQQSRIVAAAEAWLSRHPEHAMSELRFDAILIAPNTAPRHLPGAFDATP
>O08446 ~~~~~~HTH-type transcriptional regulator Rv0324~~~COG0607
MAGQSDRKAALLDQVARVGKALANGRRLQILDLLAQGERAVEAIATATGMNLTTASANLQALKSGGLVEARREGTRQYYR
IAGEDVARLFALVQVVADEHLADVAVAAADVLGSPEDAITRAELLRRREAGEVTLVDVRPHEEYQAGHIPGAINIPIAEL
ADRLAELTGDRDIVAYCRGAYCVMAPDAVRIARDAGREVKRLDDGMLEWRLAGLPVDEGAPVGHGD
>O05897 ~~~~~~Protein Rv3254~~~COG0654
MTGRVGNPKDHAVVIGASIAGLCAARVLSDFYSTVTVFERDELPEAPANRATVPQDRHLHMLMARGAQEFDSLFPGLLHD
MVAAGVPMLENRPDCIYLGAAGHVLGTGHTLRKEFTAYVPSRPHLEWQLRRRVLQLSNVQIVRRLVTEPQFERRQQRVVG
VLLDSPGSGQDREREEFIAADLVVDAAGRGTRLPVWLTQWGYRRPAEDTVDIGISYASHQFRIPDGLIAEKVVVAGASHD
QSLGLGMLCYEDGTWVLTTFGVADAKPPPTFDEMRALADKLLPARFTAALAQAQPIGCPAFHAFPASRWRRYDKLERFPR
GIVPFGDAVASFNPTFGQGMTMTSLQAGHLRRALKARNSAMKGDLAAELNRATAKTTYPVWMMNAIGDISFHHATAEPLP
RWWRPAGSLFDQFLGAAETDPVLAEWFLRRFSLLDSLYMVPSVPIIGRAIAHNLRLWLKEQRERRQPVTTRRSP
>P96873 ~~~~~~TIGR03089 family protein~~~COG0318
MTTPTTLSGAILDPMLRADPVGPRITYYDDATGERIELSAVTLANWAAKTGNLLRDELAAGPASRVAILLPAHWQTAAVL
FGVWWIGAQAILDDSPADVALCTADRLAEADAVVNSAAVAGEVAVLSLDPFGRPATGLPVGVTDYATAVRVHGDQIVPEH
NPGPVLAGRSVEQILRDCAASAAARGLTAADRVLSTASWAGPDELVDGLLAILAAGASLVQVANPDPAMLQRRIATEKVT
RVL
>P9WGC3 ~~~~~~Uncharacterized SufE-like protein Rv3284~~~COG2166
MTAPASLPAPLAEVVSDFAEVQGQDKLRLLLEFANELPALPSHLAESAMEPVPECQSPLFLHVDASDPNRVRLHFSAPAE
APTTRGFASILAAGLDEQPAADILAVPEDFYTELGLAALISPLRLRGMSAMLARIKRRLREAD
>O66665 ~~~~~~Uncharacterized protein aq_328~~~
MQEKYNFGKVSSQHKNYSKIETMLRPKGFDKLDHYFRTELDIDLTDETIELLLNSVKAAFGKLFYGAEQRARWNGRDFIA
LADLNITKALEEHIKNFQKIEQDMGVDELLEYIAFIPPVEMNVGEDLKSEYRNIMGGLLLMHADVIKKATGERKPSREAM
EFVAQIVDKVF
>Q55535 3.1.3.48~~~~~~Putative low molecular weight protein-tyrosine-phosphatase slr0328~~~COG0394
MKLLFVCLGNICRSPAAENIMNAQIDQAGLGAKIVCDSAGTSSYHVGDSPDRRMTESLKKRGYRVQGRARQFFPEDFAEF
DLILAMDGDNYRNILAQDPAGQYHHKVKMICDYTEKFGDREVPDPYYGGQAGFEHVIDLLEDACGNLLTSLGKELVN
>P75455 ~~~~~~Uncharacterized protein MG237 homolog~~~
MINKPNQFVNHLSALKKHFASYKELREAFNDYHKHNGDELTTFFLHQFDKVMELVKQKDFKTAQSRCEEELAAPYLPKPL
VSFFQSLLQLVNHDLLEQQNAALASLPAAKIIELVLQDYPNKLNMIHYLLPKTKAFVKPHLLQRLQFVLTDSELLELKRF
SFFQALNQIPGFQGEQVEYFNSKLKQKFTLTLGEFEIAQQPDAKAYFEQLITQIQQLFLKEPVNAEFANEIIDAFLVSYF
PLHPPVPLAQLAAKIYEYVSQIVLNEAVNLKDELIKLIVHTLYEQLDRPVGDEN
>Q02MN4 ~~~~~~Uncharacterized protein PA14_33160~~~
MIVRTLLIAAALLGGTAQAAESTDLCGANLQKLDDILAVRGKTSVTGARVTEVKELQAKARQDQASGDTKGCITATTQAL
QILQNAGKK
>Q9A385 ~~~~~~UPF0335 protein CC_3319~~~COG3750
MADDAIPHTDVLNSTAQGQLKSIIERVERLEVEKAEIMEQIKEVYAEAKGNGFDVKVLKKVVRIRKQDRAKRQEEDAILD
LYLSAIGEI
>Q97DZ9 ~~~~~~UPF0311 protein CA_C3321~~~
MDITNIKEMNYEEVFSITITVDKPILIGQDDIVGRRQLIPIISGKVSGNNFNGKVLPGGIDSQIVRPDGKCELSARYAIR
LDDGAAIYIENNGIRTVPDEYIEAVKSGEFVDPNAYYFRTIPTFETYSPKYKWMMNHIFVCCASRLPENVLLKFYKIS
>Q9K7N6 ~~~~~~UPF0309 protein BH3325~~~COG4821
MTSSFTDYCKFFNRILSEVQETQEQAIIKGAHLVSEAVMNGGRFYVFGSGHSHMIAEEIYNRAGGLALVTAILPPELMLH
ERPNKSTYLERIEGLSKSYLKLHQVTNKDVIMIISNSGRNTVPVEMAIESRNIGAKVIAMTSMKHSQKVTSRHKSGKKLY
EYADVVLDNGAPVGDAGFQIANSEIYSGATSDSIGCFLAQALIVETLHLLVQQGFEPPVFKSSNVDGADLYNDKIFNEYV
KW
>O53379 2.6.1.-~~~~~~Probable aminotransferase Rv3329~~~COG0161
MPRIRKPPSRSLPEPSAALANTTTRNLWLHFARHGAGIQHPVIVRGDGVTIFDDRGKSYLDALSGLFVVQVGYGRAELAE
AAARQAGTLGYFPLWGYATPPAIELAERLARYAPGDLNRVFFTSGGTEAVETAWKVAKQYFKLTGKPGKQKVISRSIAYH
GTTQGALAITGLPLFKAPFEPLTPGGFRVPNTNFYRAPLHTDLKEFGRWAADRIAEAIEFEGPDTVAAVFLEPVQNAGGC
IPAPPGYFERVREICDRYDVLLVSDEVICAFGRIGSMFACEDLGYVPDMITCAKGLTSGYSPLGAMIASDRLFEPFNDGE
TMFAHGYTFGGHPVSAAVGLANLDIFEREGLSDHVKRNSPALRATLEKLYDLPIVGDIRGEGYFFGIELVKDQATKQTFT
DDERARLLGQVSAALFEAGLYCRTDDRGDPVVQVAPPLISGQPEFDTIETILRSVLTDTGRKYLHL
>Q9HYR3 ~~~~~~Uncharacterized PhzA/B-like protein PA3332~~~
MNAKEILVHSLRLLENGDARGWCDLFHPEGVLEFPYAPPGWKTRFEGRETIWAHMRLFPEHLTVRFTDVQFYETADPDLA
IGEFHGDGVATVSGGKLAQDYISVLRTRDGQILLYRDFWNPLRHLEALGGVEAAAKIVQGA
>P9WK01 2.1.1.-~~~~~~Uncharacterized methyltransferase Rv3342~~~COG2226
MTCSRRDMSLSFGSAVGAYERGRPSYPPEAIDWLLPAAARRVLDLGAGTGKLTTRLVERGLDVVAVDPIPEMLDVLRAAL
PQTVALLGTAEEIPLDDNSVDAVLVAQAWHWVDPARAIPEVARVLRPGGRLGLVWNTRDERLGWVRELGEIIGRDGDPVR
DRVTLPEPFTTVQRHQVEWTNYLTPQALIDLVASRSYCITSPAQVRTKTLDRVRQLLATHPALANSNGLALPYVTVCVRA
TLA
>Q7NSS5 ~~~~~~UPF0434 protein CV_3345~~~COG2835
MDAKFLEILVCPLCKGPLVFDKSKDELICKGDRLAFPIKDGIPMMLESEARELAPEEEVK
>Q8EBZ9 ~~~~~~UPF0301 protein SO_3346~~~COG1678
MESLQNHFLIAMPSLDDTFFERTVIYLCEHDEKGAMGLVINKPLGIEVNSLLEQMDLPTEQVSADLAMGSQVLMGGPVSQ
DRGFVLHTSQPYWANSTELGSGLMLTTSRDVLTAIGSKRSPDKFLVALGYAGWSKNQLEQELADNSWLTIPADHALLFDI
NHEDRWQQASRSLGFEAWQLSTQAGHA
>A0A0H3CEP9 ~~~~~~UPF0276 protein CCNA_03364~~~
MTPSAGLGLKSQHYGDAIACDAEGLWFEVHPENYMSAGGPRLAALEAVRARRPVSLHGVGLSLAADTDPDPEHLQALKRL
VDRFDPFVVSEHLAWSTHRGAHHPDLLPFPRTRAALDRICGNVARMQDALQRRVLIENPSLYLPLKGHALDEVDFLEALA
TRTGCGLLVDVNNVFVSAQNLGYAPETYLDALPAHAIGEIHLAGHAPDPGGSNLLIDTHGAPVAEVVWTLYARLIARIGP
RPTLIERDDDIPDFAALMAERNRAVAVLASGQTAREPAHV
>P9WKA9 2.3.1.20~~~~~~Putative diacyglycerol O-acyltransferase Rv3371~~~COG1020
MAQLTALDAGFLKSRDPERHPGLAIGAVAVVNGAAPSYDQLKTVLTERIKSIPRCTQVLATEWIDYPGFDLTQHVRRVAL
PRPGDEAELFRAIALALERPLDPDRPLWECWIIEGLNGNRWAILIKIHHCMAGAMSAAHLLARLCDDADGSAFANNVDIK
QIPPYGDARSWAETLWRMSVSIAGAVCTAAARAVSWPAVTSPAGPVTTRRRYQAVRVPRDAVDAVCHKFGVTANDVALAA
ITEGFRTVLLHRGQQPRADSLRTLEKTDGSSAMLPYLPVEYDDPVRRLRTVHNRSQQSGRRQPDSLSDYTPLMLCAKMIH
ALARLPQQGIVTLATSAPRPRHQLRLMGQKMDQVLPIPPTALQLSTGIAVLSYGDELVFGITADYDAASEMQQLVNGIEL
GVARLVALSDDSVLLFTKDRRKRSSRALPSAARRGRPSVPTARARH
>P9WMS5 3.1.-.-~~~~~~Phosphatase Rv3376~~~COG1011
MSISAVVFDRDGVLTSFDWTRAEEDVRRITGLPLEEIERRWGGWLNGLTIDDAFVETQPISEFLSSLARELELGSKARDE
LVRLDYMAFAQGYPDARPALEEARRRGLKVGVLTNNSLLVSARSLLQCAALHDLVDVVLSSQMIGAAKPDPRAYQAIAEA
LGVSTTSCLFFDDIADWVEGARCAGMRAYLVDRSGQTRDGVVRDLSSLGAILDGAGP
>Q97DU2 3.1.3.-~~~~~~Nucleotidase CA_C3379~~~COG5663
MEQLNICIDIDGTITDAYYWIDLCNSYFKTSITEKDATQYYIHKILNVPLEEYNEFYEKYKYKLHSEQKLRKDVKSVITK
LSQNNNIFFVTARERDLTILTYSYLRKKEIPYDSLFILGTHHKVPTARQLNCDLFIEDNYDNALELSKAGFKVLLIDTYY
NRKPLNQNIIRFYNWDEVYGIVDRLFEKSEAI
>P9WKZ9 ~~~~~~Uncharacterized protein Rv3395c~~~COG4544
MTAAFASDQRLENGAEQLESLRRQMALLSEKVSGGPSRSGDLVPAGPVSLPPGTVGVLSGARSLLLSMVASVTAAGGNAA
IVGQPDIGLLAAVEMGADLSRLAVIPDPGTDPVEVAAVLIDGMDLVVLGLGGRRVTRARARAVVARARQKGCTLLVTDGD
WQGVSTRLAARVCGYEITPALRGVPTPGLGRISGVRLQINGRGR
>P9WFH1 2.1.1.-~~~~~~Putative S-adenosyl-L-methionine-dependent methyltransferase Rv3399~~~COG3315
MARPMGKLPSNTRKCAQCAMAEALLEIAGQTINQKDLGRSGRMTRTDNDTWDLASSVGATATMIATARALASRAENPLIN
DPFAEPLVRAVGIDLFTRLASGELRLEDIGDHATGGRWMIDNIAIRTKFYDDFFGDATTAGIRQVVILAAGLDTRAYRLP
WPPGTVVYEIDQPAVIKFKTRALANLNAEPNAERHAVAVDLRNDWPTALKNAGFDPARPTAFSAEGLLSYLPPQGQDRLL
DAITALSAPDSRLATQSPLVLDLAEEDEKKMRMKSAAEAWRERGFDLDLTELIYFDQRNDVADYLAGSGWQVTTSTGKEL
FAAQGLPPFADDHITRFADRRYISAVLK
>P9WKZ7 ~~~~~~Uncharacterized protein Rv3400~~~COG0637
MANWYRPNYPEVRSRVLGLPEKVRACLFDLDGVLTDTASLHTKAWKAMFDAYLAERAERTGEKFVPFDPAADYHTYVDGK
KREDGVRSFLSSRAIEIPDGSPDDPGAAETVYGLGNRKNDMLHKLLRDDGAQVFDGSRRYLEAVTAAGLGVAVVSSSANT
RDVLATTGLDRFVQQRVDGVTLREEHIAGKPAPDSFLRAAELLGVTPDAAAVFEDALSGVAAGRAGNFAVVVGINRTGRA
AQAAQLRRHGADVVVTDLAELL
>P9WN13 3.2.1.-~~~~~~Uncharacterized glycosyl hydrolase Rv3401~~~COG1554
MITEDAFPVEPWQVRETKLNLNLLAQSESLFALSNGHIGLRGNLDEGEPFGLPGTYLNSFYEIRPLPYAEAGYGYPEAGQ
TVVDVTNGKIFRLLVGDEPFDVRYGELISHERILDLRAGTLTRRAHWRSPAGKQVKVTSTRLVSLAHRSVAAIEYVVEAI
EEFVRVTVQSELVTNEDVPETSADPRVSAILDRPLQAVEHERTERGALLMHRTRASALMMAAGMEHEVEVPGRVEITTDA
RPDLARTTVICGLRPGQKLRIVKYLAYGWSSLRSRPALRDQAAGALHGARYSGWQGLLDAQRAYLDDFWDSADVEVEGDP
ECQQAVRFGLFHLLQASARAERRAIPSKGLTGTGYDGHAFWDTEGFVLPVLTYTAPHAVADALRWRASTLDLAKERAAEL
GLEGAAFPWRTIRGQESSAYWPAGTAAWHINADIAMAFERYRIVTGDGSLEEECGLAVLIETARLWLSLGHHDRHGVWHL
DGVTGPDEYTAVVRDNVFTNLMAAHNLHTAADACLRHPEAAEAMGVTTEEMAAWRDAADAANIPYDEELGVHQQCEGFTT
LAEWDFEANTTYPLLLHEAYVRLYPAQVIKQADLVLAMQWQSHAFTPEQKARNVDYYERRMVRDSSLSACTQAVMCAEVG
HLELAHDYAYEAALIDLRDLHRNTRDGLHMASLAGAWTALVVGFGGLRDDEGILSIDPQLPDGISRLRFRLRWRGFRLIV
DANHTDVTFILGDGPGTQLTMRHAGQDLTLHTDTPSTIAVRTRKPLLPPPPQPPGREPVHRRALAR
>P9WGJ7 ~~~~~~Protein Rv3402c~~~COG0399
MKIRTLSGSVLEPPSAVRATPGTSMLKLEPGGSTIPKIPFIRPSFPGPAELAEDFVQIAQANWYTNFGPNERRFARALRD
YLGPHLHVATLANGTLALLAALHVSFGAGTRDRYLLMPSFTFVGVAQAALWTGYRPWFIDIDANTWQPCVHSARAVIERF
RDRIAGILLANVFGVGNPQISVWEELAAEWELPIVLDSAAGFGSTYADGERLGGRGACEIFSFHATKPFAVGEGGALVSR
DPRLVEHAYKFQNFGLVQTRESIQLGMNGKLSEISAAIGLRQLVGLDRRLASRRKVLECYRTGMADAGVRFQDNANVASL
CFASACCTSADHKAAVLGSLRRHAIEARDYYNPPQHRHPYFVTNAELVESTDLAVTADICSRIVSLPVHDHMAPDDVARV
VAAVQEAEVRGE
>P9WKZ5 ~~~~~~Uncharacterized protein Rv3403c~~~COG4529
MLAFPYLMTMITPPTFDVAFIGSGAACSMTLLEMADALLSSPSASPKLRIAVVERDEQFWCGIPYGQRSSIGSLAIQKLD
DFADEPEKAAYRIWLEQNKQRWLAFFQAEGGAAAARWICDNRDALDGNQWGELYLPRFLFGVFLSEQMIAAIAALGERDL
AEIVTIRAEAMSAHSADGHYRIGLRPSGNGPTAIAAGKVVVAIGSPPTKAILASDSEPAFTYINDFYSPGGESNVARLRD
SLDRVESWEKRNVLVVGSNATSLEALYLMRHDARIRARVRSITVISRSGVLPYMICNQPPEFDFPRLRTLLCTEAIAAAD
LMSAIRDDLATAEERSLNLADLYDAVAALFGQALHKMDLVQQEEFFCVHGMNFTKLVRRAGRDCRQASEELAADGTLSLL
AGEVLRVDACASGQPFATMTYRAAGAEHTHPVPFAAVVNCGGFEELDTCSSPFLVSAMQNGLCRPNRTNRGLLVNDDFEA
SPGFCVIGPLVGGNFTPKIRFWHVESAPRVRSLAKSLAASLLASLQPVALAPC
>P9WKI5 1.-.-.-~~~guaB3~~~Uncharacterized oxidoreductase Rv3410c~~~COG0516
MVEIGMGRTARRTYELSEISIVPSRRTRSSKDVSTAWQLDAYRFEIPVVAHPTDALVSPEFAIELGRLGGLGVLNGEGLI
GRHLDVEAKIAQLLEAAAADPEPSTAIRLLQELHAAPLNPDLLGAAVARIREAGVTTAVRVSPQNAQWLTPVLVAAGIDL
LVIQGTIVSAERVASDGEPLNLKTFISELDIPVVAGGVLDHRTALHLMRTGAAGVIVGYGSTQGVTTTDEVLGISVPMAT
AIADAAAARRDYLDETGGRYVHVLADGDIHTSGELAKAIACGADAVVLGTPLAESAEALGEGWFWPAAAAHPSLPRGALL
QIAVGERPPLARVLGGPSDDPFGGLNLVGGLRRSMAKAGYCDLKEFQKVGLTVGG
>P9WKY9 ~~~~~~Uncharacterized protein Rv3412~~~
MRDHLPPGLPPDPFADDPCDPSAALEAVEPGQPLDQQERMAVEADLADLAVYEALLAHKGIRGLVVCCDECQQDHYHDWD
MLRSNLLQLLIDGTVRPHEPAYDPEPDSYVTWDYCRGYADASLNEAAPDADRFRRR
>P44649 ~~~~~~Uncharacterized protein HI_0341~~~COG3171
MSKSYNQRQRKKLHLAEFQELGFLVNFQFAEGTAIETVDETVDRFINEVIQPNGLAYEGSGYLHWEGLVCLEKIGKCDES
HRETVKKWLETNGLQQIEVSELFDIWWEYPTKVE
>P9WKY7 ~~~~~~Uncharacterized protein Rv3421c~~~COG1214
MSRVQISTVLAIDTATPAVTAGIVRRHDLVVLGERVTVDARAHAERLTPNVLAALADAALTMADLDAVVVGCGPGPFTGL
RAGMASAAAYGHALGIPVYGVCSLDAIGGQTIGDTLVVTDARRREVYWARYCDGIRTVGPAVNAAADVDPGPALAVAGAP
EHAALFALPCVEPSRPSPAGLVAAVNWADKPAPLVPLYLRRPDAKPLAVCT
>B8H4R9 ~~~~~~UPF0335 protein CCNA_03428~~~
MADDAIPHTDVLNSTAQGQLKSIIERVERLEVEKAEIMEQIKEVYAEAKGNGFDVKVLKKVVRIRKQDRAKRQEEDAILD
LYLSAIGEI
>Q9HYH1 ~~~~~~Uncharacterized protein PA3435~~~
MKVAILSGSVYGTAEEVARHAQKLLSAAGLEASHLPRASLDELKAFAPEAFLVVTSTTGMGELPDNLQPLYYAIRDQLPA
WHGLPGGVIGLGDSSYGDTFCGGGEQVRELFGELGVREVLPMLRLDASETVTPETDAEPWLAEFAAALKG
>Q9HYE3 ~~~~~~UPF0270 protein PA3463~~~
MLIPHDLLEADTLNNLLEDFVTREGTDNGDETPLDVRVERARHALRRGEAVILFDPESQQCQLMLRSEVPAELLRD
>O06342 ~~~~~~Uncharacterized membrane protein Rv3479~~~COG1752
MFPAAVGVLWQSGLRDPTPPGGPHGIEGLSLAFEKPSPVTALTQELRFATTMTGGVSLAIWMAGVTREINLLAQASQWRR
LGGTFPTNSQLTNESAASLRLYAQLIDLLDMVVDVDILSGTSAGGINAALLASSRVTGSDLGGIRDLWLDLGALTELLRD
PRDKKTPSLLYGDERIFAALAKRLPKLATGPFPPTTFPEAARTPSTTLYITTTLLAGETSRFTDSFGTLVQDVDLRGLFT
FTETDLARPDTAPALALAARSSASFPLAFEPSFLPFTKGTAKKGEVPARPAMAPFTSLTRPHWVSDGGLLDNRPIGVLFK
RIFDRPARRPVRRVLLFVVPSSGPAPDPMHEPPPDNVDEPLGLIDGLLKGLAAVTTQSIAADLRAIRAHQDCMEARTDAK
LRLAELAATLRNGTRLLTPSLLTDYRTREATKQAQTLTSALLRRLSTCPPESGPATESLPKSWSAELTVGGDADKVCRQQ
ITATILLSWSQPTAQPLPQSPAELARFGQPAYDLAKGCALTVIRAAFQLARSDADIAALAEVTEAIHRAWRPTASSDLSV
LVRTMCSRPAIRQGSLENAADQLAADYLQQSTVPGDAWERLGAALVNAYPTLTQLAASASADSGAPTDSLLARDHVAAGQ
LETYLSYLGTYPGRADDSRDAPTMAWKLFDLATTQRAMLPADAEIEQGLELVQVSADTRSLLAPDWQTAQQKLTGMRLHH
FGAFYKRSWRANDWMWGRLDGAGWLVHVLLDPRRVRWIVGERADTNGPQSGAQWFLGKLKELGAPDFPSPGYPLPAVGGG
PAQHLTEDMLLDELGFLDDPAKPLPASIPWTALWLSQAWQQRVLEEELDGLANTVLDPQPGKLPDWSPTSSRTWATKVLA
AHPGDAKYALLNENPIAGETFASDKGSPLMAHTVAKAAATAAGAAGSVRQLPSVLKPPLITLRTLTLSGYRVVSLTKGIA
RSTIIAGALLLVLGVAAAIQSVTVFGVTGLIAAGTGGLLVVLGTWQVSGRLLFALLSFSVVGAVLALATPVVREWLFGTQ
QQPGWVGTHAYWLGAQWWHPLVVVGLIALVAIMIAAATPGRR
>O25114 5.4.99.-~~~~~~Uncharacterized RNA pseudouridine synthase HP_0347~~~COG0564
MPFVEEEFEILKPTKALFFVRDVLKCSLKEAQRHLDKQRLKQNQQAVRKSQIIQGVVRLIHFKPNEKTQALVFETKDFGV
FDKPHQVYTHPKGYFYHESLLDCIQSHFGKNAHPAHRLDYETSGLVLAGKTLQSVKDLKALFMQKKVKKTYLALAHGLVE
KSMKIDKPILTPQNIQKDLHIRSKISPLGKQSITLVEPLSYNPFLDISLLKITPLTGRTHQIRLHLSSVDHRIVGEGLYG
VADENAREYLQLKRENNAPLLMLHAASLEFEFKGAIYKIASPMPERFMPFLKDLSFFY
>P9WKA7 2.3.1.20~~~~~~Putative diacyglycerol O-acyltransferase Rv3480c~~~COG1020
MSQTARRLGPQDMFFLYSESSTTMMHVGALMPFTPPSGAPPDLLRQLVDESKASEVVEPWSLRLSHPELLYHPTQSWVVD
DNFDLDYHVRRSALASPGDERELGIPVSRLHSHALDLRRPPWEVHFIEGLEGGRFAIYIKMHHSLIDGYTGQKMLARSLS
TDPHDTTHPLFFNIPTPGRSPADTQDSVGGGLIAGAGNVLDGLGDVVRGLGGLVSGVGSVLGSVAGAGRSTFELTKALVN
AQLRSDHEYRNLVGSVQAPHCILNTRISRNRRFATQQYPLDRLKAIGAQYDATINDVALAIIGGGLRRFLDELGELPNKS
LIVVLPVNVRPKDDEGGGNAVATILATLGTDVADPVQRLAAVTASTRAAKAQLRSMDKDAILAYSAALMAPYGVQLASTL
SGVKPPWPYTFNLCVSNVPGPEDVLYLRGSRMEASYPVSLVAHSQALNVTLQSYAGTLNFGFIGCRDTLPHLQRLAVYTG
EALDQLAAADGAAGLGS
>O06349 ~~~~~~Uncharacterized membrane protein Rv3486~~~COG2259
MAITGSAAPSWPRLLHAEGPPSVICIRLLVGLVFLSEGIQKFMYPDQLGPGRFERIGIPAATFFADLDGVVEIVCGTLVL
LGLLTRVAAVPLLIDMVGAIVLTKLRALQPGGFLGVEGFWGMAHAARTDLSMLLGLIFLLWSGPGRWSLDRRLSKRATAC
GAR
>P75429 3.1.-.-~~~~~~Putative phosphatase/phosphodiesterase MPN_349~~~
MMNSIKFIFLGDVYGKAGRNIIKNNLAQLKSKYQADLVIVNAENTTHGKGLSLKHYEFLKEAGVNYITMGNHTWFQKLDL
AVVINKKDLVRPLNLDTSFAFHNLGQGSLVFEFNKAKIRITNLLGTSVPLPFKTTNPFKVLKELILKRDCDLHIVDFHAE
TTSEKNAFCMAFDGYVTTIFGTHTHVPSADLRITPKGSAYITDVGMCGPGFGSVIGANPEQSIRLFCAGSREHFEVSKCG
AQLNGVFFEVDVNTKKVIKTEAIRIVEDDPRYLKQDYFNLI
>O53565 1.-.-.-~~~~~~Putative coenzyme F420-dependent oxidoreductase Rv3520c~~~COG2141
MEAGMKLGLQLGYWGAQPPQNHAELVAAAEDAGFDTVFTAEAWGSDAYTPLAWWGSSTQRVRLGTSVIQLSARTPTACAM
AALTLDHLSGGRHILGLGVSGPQVVEGWYGQRFPKPLARTREYIDIVRQVWARESPVTSAGPHYRLPLTGEGTTGLGKAL
KPITHPLRADIPIMLGAEGPKNVALAAEICDGWLPIFYSPRMAGMYNEWLDEGFARPGARRSREDFEICATAQVVITDDR
AAAFAGIKPFLALYMGGMGAEETNFHADVYRRMGYTQVVDEVTKLFRSGRKDEAAEIIPDELVDDAVIVGDIDHVRKQMA
VWEAAGVTMMVVTAGSAEQVRDLAALV
>Q6FF54 ~~~~~~UPF0301 protein ACIAD0353~~~COG1678
MTKQYLTHRCLIAPPEMADDFFANTVIYLARHDDEGAQGLIINRPSGIQVRELLNDLDIEADHVQPHEVLQGGPLRPEAG
FVLHTGQPVWHSSIAVGENLCITTSKDILDAIAHNEGVGRYQIALGYASWTKNQLEGEISRGDWLICDADMDLIFNLPYD
ERWDAAYKKLGVDRIWLSSEIGHA
>P47596 ~~~~~~Uncharacterized protein MG354~~~
MEQNNIKEQLISFFNQACSTHQERLDFICSTRESDTFSSVDVPLEPIKNIIEITKDENQQIEITKIAVNNIKTLSSVGAT
GQYMASFFSTNSEPAIIFCVIYFLYHFGFLKDNNKKQIIKKAYETIADNIADYLNEN
>P9WFY5 2.1.1.-~~~~~~Uncharacterized tRNA/rRNA methyltransferase Rv3579c~~~COG0566
MPGNSRRRGAVRKSGTKKGAGVGSGGQRRRGLEGRGPTPPAHLRPHHPAAKRARAQPRRPVKRADETETVLGRNPVLECL
RAGVPATALYVALGTEADERLTECVARAADSGIAIVELLRADLDRMTANHLHQGIALQVPPYNYAHPDDLLAAALDQPPA
LLVALDNLSDPRNLGAIVRSVAAFGGHGVLIPQRRSASVTAVAWRTSAGAAARIPVARATNLTRTLKGWADRGVRVIGLD
AGGGTALDDVDGTDSLVVVVGSEGKGLSRLVRQNCDEVVSIPMAAQAESLNASVAAGVVLAEIARQRRRPREPREQTQNR
MI
>P44658 ~~~~~~Putative thiamine biosynthesis protein HI_0357~~~COG0715
MKTIIRYFSFVMGLMLTLPSFAKEKISIMLDWYVNPDHAAIIVAQQKGFFEKNNLEVEIIEPADPALPPKLAAAEKVDLA
VSYQPQLYQQVAEGLPLVRVGSLISNPLNSVVVLKKSNLKSLADLKGKKVGYSVSGFEDGLLDTMLHSIGLSNKDVELVN
VNWSLSPSLLTGQVDAVIGAFRNFELNQLALEKQEGIAFFPEQYGVPAYDELILVANKNSVTDKKTSAFLTALEQATSYL
QAHPNEAWQAFVSYKPNELNTPLNQLAWKDTLPFLANKPRQLDAKRYQQMAEFMQQKGLIPKALALKEYAVEIE
>Q02M20 ~~~~~~Uncharacterized protein PA14_35840~~~
MTLLETTNEGAAEPVRLTTPLGQAIGNLFARALPLLDGNPPGSLKVFVFGGCAVHLLTHARGSADIDAEIEAARVLRKDE
IMAVYTPPEGYEDGDGRDLQVYLDQNYTNALGPLHEDYRERAIPMEGFEGEQPLHIFVAAGVDLAISKLGRFTENDQYDI
EQLIECGRVDVGQFVTLATEAIDYAVGNRSAMLGCLKLVIAKYLEDGRDATSGS
>Q57449 ~~~~~~Putative metal-binding protein HI_0362~~~COG0803
MRNSFKIMTALALGLFAMQANAKFKVVTTFTVIQDIAQNVAGNAATVESITKPGAEIHEYEPTPKDIVKAQSADLILWNG
LNLERWFERFFQNVKDKPAVVVTEGIQPLSIYEGPYKDAPNPHAWMSPSNALIYIENIKNALVKYDPQNAAVYEKNAADY
AQKIKQLDEPLRAKLAQIPEAQRWLVTSEGAFSYLAKDYNLKEGYLWPINAEQQGTPQQVRKVIDLVRKNNIPVVFSEST
ISAKPAQQVAKESGAKYGGVLYVDSLSAKNGPVPTYIDLLNVTVSTIVKGFGK
>P9WI89 ~~~~~~Uncharacterized protein Rv3633~~~COG5285
MTQSSSVERLVGEIDEFGYTVVEDVLDADSVAAYLADTRRLERELPTVIANSTTVVKGLARPGHVPVDRVDHDWVRIDNL
LLHGTRYEALPVHPKLLPVIEGVLGRDCLLSWCMTSNQLPGAVAQRLHCDDEMYPLPRPHQPLLCNALIALCDFTADNGA
TQVVPGSHRWPERPSPPYPEGKPVEINAGDALIWNGSLWHTAAANRTDAPRPALTINFCVGFVRQQVNQQLSIPRELVRC
FEPRLQELIGYGLYAGKMGRIDWRPPADYLDADRHPFLDAVADRLQTSVRL
>A3D8P6 1.14.11.-~~~~~~PKHD-type hydroxylase Sbal_3634~~~
MLIEIPNVFSKQEVSHLREQLDARRWIDGNQTSGAMATTRKRNQQLDKDDPVAVALGQQIMDRLLAHPQFVSAALPLQFY
PPLFNRYQGGETFGYHIDNAIRSTPDGMIRTDLSATLFLSEPENYQGGELVIQDTYGQQSIKLSAGSLVLYPSSSLHQVT
PVLSGERTAAFMWLQSMVRDEGQRRLLFQLDQSIQSLTAQTAAEQELFNLSGVYHNLLRRWSEL
>Q57F22 ~~~~~~Uncharacterized protein BruAb1_0363~~~
MRSTKVIHIVGCHAEGEVGDVIVGGVAPPPGETVWEQSRFIANDETLRNFVLNEPRGGVFRHVNLLVPPKDPRAQMGFII
MEPADTPPMSGSNSICVSTVLLDSGIIAMQEPVTHMVLEAPGGIIEVEAECRNGKAERISVRNVPSFADRLDAPLDVTGL
GTIMVDTAYGGDSFVIVDAAQIGMKIEPGQARELAEIGVKITKAANEQLGFRHPERDWRHISFCQITEPVTREGDVLTGV
NTVAIRPAKLDRSPTGTGCSARMAVLHAKGQMKAGERFIGKSVLGTEFHCRLDKVLELGGKPAISPIISGRAWVTGTSQL
MLDPSDPFPHGYRLSDTWPRDE
>P58495 3.4.21.-~~~~~~Uncharacterized peptidase Lmo0363~~~COG3340
MKNLFLTSSFKDVVPLFTEFESNLQGKTVTFIPTASTVEEVTFYVEAGKKALESLGLLVEELDIATESLGEITTKLRKND
FIYVTGGNTFFLLQELKRTGADKLILEEIAAGKLYIGESAGAVITSPNIAYIQTMDSTKKAVNLTNYDALNLVDFSTLPH
YNNTPFKEITQKIVTEYAGKSQIYPISNHEAIFIRGKEVITKRLS
>Q9HXY3 3.4.24.-~~~~~~Putative zinc metalloprotease PA3649~~~
MSALYMIVGTLVALGVLVTFHEFGHFWVARRCGVKVLRFSVGFGTPLVRWHDRHGTEFVVAAIPLGGYVKMLDEREAEVP
AHLLEQSFNRKTVRQRIAIVAAGPIANFLLAILFFWVVALLGSQQVRPVIGSVAPESLAAQAGLEAGQELLAVDGEPVTG
WNGVNLQLVRRLGESGTLEVRVQEKGSNVDSTHQVRLDGWLKGEDNPDPIASLGIRPWRPALPPVLAELDPKGPAQAAGL
KLGDRLQSIDGIAVDDWQQVVDSVRARPGQRVQLKVLRDGEVLDVALELAVRGEGKARSGYMGAGVAGTEWPAEMLREVS
YGPLEAVGQALSRTWTMSLLTLDSIKKMLLGELSVKNLSGPITIAKVAGASAQSGVGDFLNFLAYLSISLGVLNLLPIPV
LDGGHLLFYLVEWVRGRPLSERVQAWGMQIGISLVVGVMLLALVNDLSRL
>P9WP09 ~~~~~~Uncharacterized membrane protein Rv0364~~~COG0586
MSTAVTAMPDILDPMYWLGANGVFGSAVLPGILIIVFIETGLLFPLLPGESLLFTGGLLSASPAPPVTIGVLAPCVALVA
VLGDQTAYFIGRRIGPALFKKEDSRFFKKHYVTESHAFFEKYGKWTIILARFVPIARTFVPVIAGVSYMRYPVFLGFDIV
GGVAWGAGVTLAGYFLGSVPFVHMNFQLIILAIVFVSLLPALVSAARVYRARRNAPQSDPDPLVLPE
>P9WMT3 ~~~~~~Putative conjugal transfer protein Rv3659c~~~COG4962
MLGDTEVLANLRVLQTELTGAGILEPLLSADGTTDVLVTAPDSVWVDDGNGLRRSQIRFADESAVRRLAQRLALAAGRRL
DDAQPWVDGQLTGIGVGGFAVRLHAVLPPVATQGTCLSLRVLRPATQDLAALAAAGAIDPAAAALVADIVTARLAFLVCG
GTGAGKTTLLAAMLGAVSPDERIVCVEDAAELAPRHPHLVKLVARRANVEGIGEVTVRQLVRQALRMRPDRIVVGEVRGA
EVVDLLAALNTGHEGGAGTVHANNPGEVPARMEALGALGGLDRAALHSQLAAAVQVLLHVARDRAGRRRLAEIAVLRQAE
GRVQAVTVWHADRGMSDDAAALHDLLRSRASA
>P9WKX7 ~~~~~~Uncharacterized protein Rv3660c~~~COG0455
MLTDPGLRDELDRVAAAVGVRVVHLGGRHPVSRKTWSAAAAVVLDHAAADRCGRLALPRRTHVSVLTGTEAATATWAAAI
TVGAQHVLRMPEQEGELVRELAEAAESARDDGICGAVVAVIGGRGGAGASLFAVALAQAAADALLVDLDPWAGGIDLLVG
GETAPGLRWPDLALQGGRLNWSAVRAALPRPRGISVLSGTRRGYELDAGPVDAVIDAGRRGGVTVVCDLPRRLTDATQAA
LDAADLVVLVSPCDVRACAAAATMAPVLTAINPNLGLVVRGPSPGGLRAAEVADVAGVPLLASMRAQPRLAEQLEHGGLR
LRRRSVLASAARRVLGVLPRAGSGRHGRAA
>P9WGJ1 3.1.3.-~~~~~~Probable phosphatase Rv3661~~~COG0560
MTVSDSPAQRQTPPQTPGGTAPRARTAAFFDLDKTIIAKSSTLAFSKPFFAQGLLNRRAVLKSSYAQFIFLLSGADHDQM
DRMRTHLTNMCAGWDVAQVRSIVNETLHDIVTPLVFAEAADLIAAHKLCGRDVVVVSASGEEIVGPIARALGATHAMATR
MIVEDGKYTGEVAFYCYGEGKAQAIRELAASEGYPLEHCYAYSDSITDLPMLEAVGHASVVNPDRGLRKEASVRGWPVLS
FSRPVSLRDRIPAPSAAAIATTAAVGISALAAGAVTYALLRRFAFQP
>Q2YPK6 ~~~~~~Uncharacterized protein BAB1_0366~~~
MRSTKVIHIVGCHAEGEVGDVIVGGVAPPPGETVWEQSRFIANDETLRNFVLNEPRGGVFRHVNLLVPPKDPRAQMGFII
MEPADTPPMSGSNSICVSTVLLDSGIIAMQEPVTHMVLEAPGGIIEVEAECRNGKAERISVRNVPSFADRLDAPLDVTGL
GTIMVDTAYGGDSFVIVDAAQIGMKIEPGQARELAEIGVKITKAANEQLGFRHPERDWRHISFCQITEPVTREGDVLTGV
NTVAIRPAKLDRSPTGTGCSARMAVLHAKGQMKAGERFIGKSVLGTEFHCRLDKVLELGGKPAISPIISGRAWVTGTSQL
MLDPSDPFPHGYRLSDTWPRDE
>P9WHR9 3.4.21.-~~~~~~Serine protease Rv3671c~~~COG0265
MTPSQWLDIAVLAVAFIAAISGWRAGALGSMLSFGGVLLGATAGVLLAPHIVSQISAPRAKLFAALFLILALVVVGEVAG
VVLGRAVRGAIRNRPIRLIDSVIGVGVQLVVVLTAAWLLAMPLTQSKEQPELAAAVKGSRVLARVNEAAPTWLKTVPKRL
SALLNTSGLPAVLEPFSRTPVIPVASPDPALVNNPVVAATEPSVVKIRSLAPRCQKVLEGTGFVISPDRVMTNAHVVAGS
NNVTVYAGDKPFEATVVSYDPSVDVAILAVPHLPPPPLVFAAEPAKTGADVVVLGYPGGGNFTATPARIREAIRLSGPDI
YGDPEPVTRDVYTIRADVEQGDSGGPLIDLNGQVLGVVFGAAIDDAETGFVLTAGEVAGQLAKIGATQPVGTGACVS
>P9WKX5 ~~~~~~Putative ATPase Rv3679~~~COG0003
MVATTSSGGSSVGWPSRLSGVRLHLVTGKGGTGKSTIAAALALTLAAGGRKVLLVEVEGRQGIAQLFDVPPLPYQELKIA
TAERGGQVNALAIDIEAAFLEYLDMFYNLGIAGRAMRRIGAVEFATTIAPGLRDVLLTGKIKETVVRLDKNKLPVYDAIV
VDAPPTGRIARFLDVTKAVSDLAKGGPVHAQSEGVVKLLHSNQTAIHLVTLLEALPVQETLEAIEELAQMELPIGSVIVN
RNIPAHLEPQDLAKAAEGEVDADSVRAGLLTAGVKLPDADFAGLLTETIQHATRITARAEIAQQLDALQVPRLELPTVSD
GVDLGSLYELSESLAQQGVR
>Q92EU1 ~~~~~~Putative glycerol transporter Lin0367~~~
MSIGIALATIAGGFLFPFAIRMMWGKMVDEWGAIGGWMAAAFIVGTVWTINHGIPKSMIYQSGTVWVDMAVAAGIGVFTA
SLLTGGKFSKSVVNLAAAVVGGVLGGFLLSLFL
>Q92EU0 ~~~~~~Putative glycerol transporter Lin0368~~~
MKFFRGMIGFCIAGMIVMSVWTPLAENYGIFGGYLAAFIIIGPMWFMNHHVGLIENDEDAAFVDMAVGIGICGIMRDVFM
RGGGELVSSLPTIGLVAIGAVLAGIVAAAIEKDMARKHEAKQEKTEPGMNIKEEERLNENQLV
>O69659 ~~~~~~Uncharacterized membrane protein Rv3691~~~
MGAGPVIPTRLATVRRRRPWRGVLLTLAAVAVVASIGTYLTAPRPGGAMAPASTSSTGGHALATLLGNHGVEVVVADSIA
DVEAAARPDSLLLVAQTQYLVDNALLDRLAKAPGDLLLVAPTSRTRTALTPQLRIAAASPFNSQPNCTLREANRAGSVQW
GPSDTYQATGDLVLTSCYGGALVRFRAEGRTITVVGSSNFMTNGGLLPAGNAALAMNLAGNRPRLVWYAPDHIEGEMSSP
SSLSDLIPENVHWTIWQLWLVVLLVALWKGRRIGPLVAEELPVVIRASETVEGRGRLYRSRRARDRAADALRTATLQRLR
PRLGVGAGAPAPAVVTTIAQRSKADPPFVAYHLFGPAPATDNDLLQLARALDDIERQVTHS
>Q4UQD0 ~~~~~~UPF0234 protein XC_3703~~~
MPSFDVISEVDKHELTNAVDQANRELDTRFDFKGVEAKFELEDGKVINQSAPSDFQVKQMTDILRARLLARGIDVRCLEF
GDVETNLAGARQKVTVKQGIEQKQAKQLVAKLKEAKLKVEAQINGDKLRVTGKKRDDLQDAIAVLKKADFELPLQFDNFR
D
>P60855 ~~~~~~Uncharacterized protein SA0370~~~
MLTKEFAQRVELSEKQVRKIVQHLEERGYQLSKTEYRGREATDFKEEDIELFKDIADKVKQTNSYDLAFDELEKEKDFLQ
VIVKNDDKNLPTNQNVAQLVEDLRLEIQKMREERHLLGQMMNQVHQQQQELKELQNQLTSKIDSNSESLKAIQTSQEAIQ
EAQASQAKVLAESTNKVEKNAVTEDKADSKDSKVAGVNTSTDAKTDTKAENAGDGTATKVDKEDQISATEAIEKASVEQS
KNENAAETSNKEATVDADAQHDAEQQVAEAHAEASKQATSNDSLEAKAENDSTASQSEMSEPKPQEEKKGFFARLFNL
>P9WNR9 ~~~~~~Nucleoid-associated protein Rv3716c~~~COG0718
MQPGGDMSALLAQAQQMQQKLLEAQQQLANSEVHGQAGGGLVKVVVKGSGEVIGVTIDPKVVDPDDIETLQDLIVGAMRD
ASQQVTKMAQERLGALAGAMRPPAPPAAPPGAPGMPGMPGMPGAPGAPPVPGI
>P75410 ~~~~~~Uncharacterized protein MPN_371~~~
MRIEAANLAGSLWICSVVNHGVQGVGVPSWVPDPELEGAVPKSSALSWTCWLLLEPRLIGALARLLVSSSIWPLSSESDF
FFTATCNALTLVSPDEPHVGWIGQIQMWLKNQWPQRPGVFHCSSRCPPRRSSPSSQTLPRWWKYFDHSRFAAVVSPTPFA
TAHSTPRCAARVKRQTGRDWRGLAPPRRGPCFWPRFTVGVQPHSNQSRQRG
>Q97SJ0 ~~~~~~UPF0398 protein SP_0371~~~COG4474
MATALVLGYSAFDLGLFSDKDPRLKLIKKAIRKDLEAMAADGVSWLVFTGSLGFEYWVLEVAQEMKTEYGFQLATIFAFE
THGENWNEGNQMKLSRFKQVDFVKYAYPRYEHKGQLRDYQQFLLENTTSSYLFYDEENETKLAYFYQKMKNQEDYFIKRL
TFDELNELAENFSEN
>P75408 ~~~~~~Uncharacterized protein MPN_373~~~
MVSDGGGQTDNNAEGGNLRIALTKNAFNPNQSTTVDIPYKIENRSVGNNKEQKTLVFDFSGLNPYEYNMIVGALFTDSSF
INDAYAPIQSTFQRQLKEFLQVKYENQVGANGSFDLFKPRSLSSQQLVQGERSLDGFTVELNANGGSFNFLTHVDPLVAG
LTVAAIASVVVAGAVTYLVVRRYRKRNEFVDKIFASNIRAKQWR
>P9WKA5 2.3.1.20~~~~~~Putative diacyglycerol O-acyltransferase Rv3740c~~~COG1020
MSPIDALFLSAESREHPLHVGALQLFEPPAGAGRGFVRETYQAMLQCREIAPLFRKRPTSLHGALINLGWSTDADVDLGY
HARRSALPAPGRVRELLELTSRLHSNLLDRHRPLWETHVIEGLRDGRFAIYSKMHHALVDGVSGLTLMRQPMTTDPIEGK
LRTAWSPATQHTAIKRRRGRLQQLGGMLGSVAGLAPSTLRLARSALIEQQLTLPFGAPHTMLNVAVGGARRCAAQSWPLD
RVKAVKDAAGVSLNDVVLAMCAGALREYLDDNDALPDTPLVAMVPVSLRTDRDSVGGNMVGAVLCNLATHLDDPADRLNA
IHASMRGNKNVLSQLPRAQALAVSLLLLSPAALNTLPGLAKATPPPFNVCISNVPGAREPLYFNGARMVGNYPMSLVLDG
QALNITLTSTADSLDFGVVGCRRSVPHVQRVLSHLETSLKELERAVGL
>O69720 ~~~~~~Protein Rv3753c~~~
MGAQRASMQRPAADTPDGFGVAVVREEGRWRCSPMGPKALTSLRAAETELRELRSAGAVFGLLDVDDEFFVIVRPAPSGT
RLLLSDATAALDYDIAAEVLDNLDAEIDPEDLEDADPFEEGDLGLLSDIGLPEAVLGVILDETDLYADEQLGRIAREMGF
ADQLSAVIDRLGR
>O69726 ~~~~~~Uncharacterized membrane protein Rv3760~~~COG5416
MTSNPSSSADQPLSGTTVPGSVPGKAPEEPPVKFTRAAAVWSALIVGFLILILLLIFIAQNTASAQFAFFGWRWSLPLGV
AILLAAVGGGLITVFAGTARILQLRRAAKKTHAAALR
>O69731 ~~~~~~Uncharacterized protein Rv3766~~~COG0739
MTDTLFADVSEYQVPVNNSYPYRVLSIRVCDGTYRDRNFAHNYRWMRSAFDSGRLTFGIVYTYARPNWWANANTVRSMID
AAGGLHPRVALMLDVESGGNPPGDGSSWINRLYWNLADYAGSPVRIIGYANAYDFFNMWRVRPAGLRVIGAGYGSNPNLP
GQVAHQYTDGSGYSPNLPQGAPPFGRCDMNSANGLTPQQFAAACGVTTTGGPLMALTDEEQTELLTKVREIWDQLRGPNG
AGWPQLGQNEQGQDLTPVDAIAVIKNDVAAMLAE
>P9WQ67 ~~~~~~Uncharacterized protein Rv3778c~~~COG0520
MAYDVARVRGLHPSLGDGWVHFDAPAGMLIPDSVATTVSTAFRRSGASTVGAHPSARRSAAVLDAAREAVADLVNADPGG
VVLGADRAVLLSLLAEASSSRAGLGYEVIVSRLDDEANIAPWLRAAHRYGAKVKWAEVDIETGELPTWQWESLISKSTRL
VAVNSASGTLGGVTDLRAMTKLVHDVGALVVVDHSAAAPYRLLDIRETDADVVTVNAHAWGGPPIGAMVFRDPSVMNSFG
SVSTNPYATGPARLEIGVHQFGLLAGVVASIEYLAALDESARGSRRERLAVSMQSADAYLNRVFDYLMVSLRSLPLVMLI
GRPEAQIPVVSFAVHKVPADRVVQRLADNGILAIANTGSRVLDVLGVNDVGGAVTVGLAHYSTMAEVDQLVRALASLG
>Q9EXD3 ~~~~~~Uncharacterized protein MPN_377~~~
MSKDKKNKVEQLEPVDLFERTKLEDTQVLNDVELDDIKKLEELKKELENTFEPRTRIEIKREIKELERKLRRNR
>P9WMF7 ~~~~~~Uncharacterized HTH-type transcriptional regulator Rv0377~~~COG0583
MTPAQLRAYSAVVRLGSVRAAAAELGLSDAGVSMHVAALRKELDDPLFTRTGAGLAFTPGGLRLASRAVEILGLQQQTAI
EVTEAAHGRRLLRIAASSAFAEHAAPGLIELFSSRADDLSVELSVHPTSRFRELICSRAVDIAIGPASESSIGSDGSIFL
RPFLKYQIITVVAPNSPLAAGIPMPALLRHQQWMLGPSAGSVDGEIATMLRGLAIPESQQRIFQSDAAALEEVMRVGGAT
LAIGFAVAKDLAAGRLVHVTGPGLDRAGEWCVATLAPSARQPAVSELVGFISTPRCIQAMIPGSGVGVTRFRPKVHVTLW
S
>P9WKW9 ~~~~~~Uncharacterized protein Rv3786c~~~COG0739
MRILAMTRAHNAGRTLAATLDSLAVFSDDIYVIDDRSTDDTAEILANHPAVTNVVRARPDLPPTPWLIPESAGLELLYRM
ADFCRPDWVMMVDADWLVETDIDLRAVLARTPDDIVALMCPMVSRWDDPEYPDLIPVMGTAEALRGPLWRWYPGLRAGGK
LMHNPHWPANITDHGRIGQLPGVRLVHSGWSTLAERILRVEHYLRLDPDYRFNFGVAYDRSLLFGYALDEVDLLKADYRR
RIRGDFDPLEPGGRLPIDREPRAIGRGYGPHAGGFHPGVDFATDPGTPVYAVASGAVSAIDEVDGLVSLTIARCELDVVY
VFRPGDEGRLVLGDRIAAGAQLGTIGAQGESADGYLHFEVRTQDGHVNPVRYLANMGLRPWPPPGRLRAVSGSYPPATPC
TITAEDR
>P9WKW7 ~~~~~~Uncharacterized protein Rv3788~~~COG0782
MSEKVESKGLADAARDHLAAELARLRQRRDRLEVEVKNDRGMIGDHGDAAEAIQRADELAILGDRINELDRRLRTGPTPW
SGSETLPGGTEVTLRFPDGEVVTMHVISVVEETPVGREAETLTARSPLGQALAGHQPGDTVTYSTPQGPNQVQLLAVKLP
S
>P44676 2.1.1.-~~~~~~Uncharacterized tRNA/rRNA methyltransferase HI_0380~~~COG0565
MLENIRIVLIETSHSGNIGSAARAMKTMGLTQLCLVSPKSVDEQSYALSAGAENIVKNARVVDSFDEAVDDCSLVIGTSA
RLRHLQNTLIEPRECAEKVVAYKGKIAIVFGRERIGLTNEELLKCHYHLNIPANPDYSSLNLAMAVQLVSYELRMAFLVQ
NNKKNSLSLIEKNYPTTDQLAYFFDYTERIYQSLGFIQNQGVMRKLKRLYYRAKLEKNELNILNGMLSAVEKRIDLTKED
N
>P9WH21 1.-.-.-~~~~~~Putative Rieske 2Fe-2S iron-sulfur protein Rv3818~~~COG2146
MQVTSVGHAGFLIQTQAGSILCDPWVNPAYFASWFPFPDNSGLDWGALGECDYLYVSHLHKDHFDAENLRAHVNKDAVVL
LPDFPVPDLRNELQKLGFHRFFETTDSVKHRLRGPNGDLDVMIIALRAPADGPIGDSALVVADGETTAFNMNDARPVDLD
VLASEFGHIDVHMLQYSGAIWYPMVYDMPARAKDAFGAQKRQRQMDRARQYIAQVGATWVVPSAGPPCFLAPELRHLNDD
GSDPANIFPDQMVFLDQMRAHGQDGGLLMIPGSTADFTGTTLNSLRHPLPAEQVEAIFTTDKAAYIADYADRMAPVLAAQ
KAGWAAAAGEPLLQPLRTLFEPIMLQSNEICDGIGYPVELAIGPETIVLDFPKRAVREPIPDERFRYGFAIAPELVRTVL
RDNEPDWVNTIFLSTRFRAWRVGGYNEYLYTFFKCLTDERIAYADGWFAEAHDDSSSITLNGWEIQRRCPHLKADLSKFG
VVEGNTLTCNLHGWQWRLDDGRCLTARGHQLRSSRP
>P9WKW5 ~~~~~~Uncharacterized membrane protein Rv3835~~~
MLDAPEQDPVDPGDPASPPHGEAEQPLPGPRWPRALRASATRRALLLTALGGLLIAGLVTAIPAVGRAPERLAGYIASNP
VPSTGAKINASFNRVASGDCLMWPDGTPESAAIVSCADEHRFEVAESIDMRTFPGMEYGQNAAPPSPARIQQISEEQCEA
AVRRYLGTKFDPNSKFTISMLWPGDRAWRQAGERRMLCGLQSPGPNNQQLAFKGKVADIDQSKVWPAGTCLGIDATTNQP
IDVPVDCAAPHAMEVSGTVNLAERFPDALPSEPEQDGFIKDACTRMTDAYLAPLKLRTTTLTLIYPTLTLPSWSAGSRVV
ACSIGATLGNGGWATLVNSAKGALLINGQPPVPPPDIPEERLNLPPIPLQLPTPRPAPPAQQLPSTPPGTQHLPAQQPVV
TPTRPPESHAPASAAPAETQPPPPDAGAPPATQSPEATPPGPAEPAPAG
>P9WFH5 2.1.1.-~~~~~~Putative S-adenosyl-L-methionine-dependent methyltransferase Rv3767c~~~COG3315
MPRTDNDSWAITESVGATALGVAAARAAETESDNPLINDPFARIFVDAAGDGIWSMYTNRTLLAGATDLDPDLRAPIQQM
IDFMAARTAFFDEYFLATADAGVRQVVILASGLDSRAWRLPWPDGTVVYELDQPKVLEFKSATLRQHGAQPASQLVNVPI
DLRQDWPKALQKAGFDPSKPCAWLAEGLVRYLPARAQDLLFERIDALSRPGSWLASNVPGAGFLDPERMRRQRADMRRMR
AAAAKLVETEISDVDDLWYAEQRTAVAEWLRERGWDVSTATLPELLARYGRSIPHSGEDSIPPNLFVSAQRATS
>Q8EAL4 ~~~~~~UPF0339 protein SO_3888~~~COG3422
MSGWYELSKSSNDQFKFVLKAGNGEVILTSELYTGKSGAMNGIESVQTNSPIEARYAKEVAKNDKPYFNLKAANHQIIGT
SQMYSSTAARDNGIKSVMENGKTTTIKDLT
>P9WFH3 2.1.1.-~~~~~~Putative S-adenosyl-L-methionine-dependent methyltransferase Rv3787c~~~COG3315
MARTDDDSWDLATGVGATATLVAAGRARAARAAQPLIDDPFAEPLVRAVGVEFLTRWATGELDAADVDDPDAAWGLQRMT
TELVVRTRYFDQFFLDAAAAGVRQAVILASGLDARGYRLPWPADTTVFEVDQPRVLEFKAQTLAGLGAQPTADLRMVPAD
LRHDWPDALRRGGFDAAEPAAWIAEGLFGYLPPDAQNRLLDHVTDLSAPGSRLALEAFLGSADRDSARVEEMIRTATRGW
REHGFHLDIWALNYAGPRHEVSGYLDNHGWRSVGTTTAQLLAAHDLPAAPALPAGLADRPNYWTCVLG
>Q8U919 4.2.1.-~~~~~~Putative hydro-lyase Atu3911~~~COG4336
MTIPTSYLNHTDAEAARKARATYRDGLVAPTSGIAPGFTQANMIVLPRDWAFDFLLYAQRNPKPCPVLDVSDPGSPTTLL
APGADLRTDLPLYRIWRDGKLAEETADATSAWAERDDLVAFLIGCSFTFETPMVEAGIEIRHMTDKSNVPMYLTNRPCRP
AGRLKGNMVVSMRPIPASRVADAATISGRFPAVHGAPVHVGAPEQIGISDLSKPDFGDAVRIEPGEVPVFWACGVTPQAA
VMASGVPFAITHAPGHMFITDIPDTAYHA
>Q9HX91 ~~~~~~Uncharacterized protein PA3922~~~
MKTTKILLHTGVLALSLLATQVMAAVSADEAAKLGTSLTPLGAEKAGNADGSIPAWDGGLATNAGSVDSRGFLANPYASE
QPLFTITAQNVDQYKDKLTPGQLAMFKRYPDTYKIPVYKTHRSATVPAAVQEAAKRNATTTKLVEGGNGLENFDTANPFP
IPQNGLEVIWNHITRYRGGSVRRLVTQATPQVNGSYQLVYFQDAFTFRTNLKDYNPNKPSNVLFYFKQRVTAPSRLAGNV
LLVHETLNQVKEPRLAWLYNAGQRRVRRAPQVSYDGPGTAADGLRTSDNFDMYNGAPDRYDWKLEGKKEIYIPYNSYKLD
DPKIKYSEIVKAGHINQDLTRYELHRVWHVVATLKPGERHIYAKRDFYIDEDTWQAAEIDHYDGRGTLWRVAEAHAEQYY
DKQVPWYAVETLYDLLSGRYLALGMKNEEKQAYDFNYSASESDYTPAALRQEGVR
>A0QZA1 ~~~~~~Universal stress protein MSMEG_3950/MSMEI_3859~~~COG0589
MVQSATEYGILVGVDSSAESDAAVRWAAREASLHDAPITLMHVIAPVVVSWPAGPYMATVLECQEENARHAIEQAQKVVA
DCLGETHGLTVQTEIRKESVARTLIDASKSAQMVVVGNRGMGALGRVLLGSTSTSLLHYASGPVVVVHGDDQAAHDSRLP
VLLGIDGSPASEVATSHAFDEASRRGVDLVALHVWIDVGDIPPIGPTWEEQEETGRALLAERLAGWQERYPDVKVHRRVE
RAQPAYWLLEEAKQAQLVVVGSHGRGGFTGMLLGSVSSRVAQSATTPVMVVRPR
>P43994 ~~~~~~UPF0125 protein HI_0395~~~COG2914
MNQINIEIAYAFPERYYLKSFQVDEGITVQTAITQSGILSQFPEIDLSTNKIGIFSRPIKLTDVLKEGDRIEIYRPLLAD
PKEIRRKRAAEQAAAKDKEKGA
>P44683 1.14.11.-~~~~~~Probable ribosomal oxygenase HI_0396~~~COG2850
MTALSSVDFCLPEHITPEIFLRDYWQKKPLVIRNGLPEIVGQFEPQDIIELAQNEDVTARLVKTFSDDDWKVFFSPLSEK
DFQKLPEKWSVLVQNLEQWSPELGQLWNKFGFIPQWQRDDIMVSYAPKGGSVGKHYDEYDVFLVQGYGHRRWQVGKWCDA
STEFKPNQSIRIFDDMGELVIDEVMNPGDILYIPARMAHYGVAEDDCLTFSFGLRYPNLSNLIDGISKGFCHQDPDLNLS
EFDLPLRLSQSEQRTGKLADENIQAMKQLLLDKLAHSEAFDTLFKQAVASAVSSRRYELLVSDEMCDPDEVRSILEEDGA
FLSQDNNCKLLYTENPLRIYANGEWLDELNIIESEVLKRLSDGESLDWAFLSDLANKTEDPETSMDLLLDSICNWVDDGW
ALIE
>Q7A7G5 ~~~lpl2~~~Uncharacterized lipoprotein SA0397~~~
MGYLKKLALFISVIILGIFIIGCDSSSDTAEKAKEDSKEEQIKKSFAKTLDMYPIKNLEDLYDKEGYRDGEFKKGDKGTW
TLLTSFSKSNKPGEIDDEGMVLYLNRNTKKATGYYFVNKIYDDISKNQNEKKYRVELKNNKIILLDNVEDEKLKQKIENF
KFFSQYADFKDLKNYQDGSITTNENVPRYEAEYKLNNSDTNVKKLRDIYPITTKKAPILKLHIDGDIKGSSVGYKKIEYK
FSKVKDQETTLRDYLNFGPSDEDS
>A0QZE3 3.5.-.-~~~~~~Putative hydrolase MSMEG_3995/MSMEI_3903~~~COG0624
MTVPVNATNLRIPLDTGRDREFLDSWAELEAIGATPAGGVERQAGTAEDGQMRDWLSRWLRTRGFSVEVDPIGNLFGLLE
FNPGAPYVLVGSHLDSQPRGGRFDGAYGVLAGAVAADRTRRYVTRSGFTPRYNVAVVDWFNEEGSRFKPSMMGSAVFTGT
LDLEEALNTTDDDGVSVRDALAAINGIGDREVFSSTGPRQLAAYAEIHIEQGRELEKNNVTIGLVDRTWAANKYELNVVG
IQGHTGATAIEDRQDALLGAALIVVALRDIADEFGEELHTSCGQLTVLPNSPVVVPREVHMHLDLRSDNDELLAAADAAL
RRRIAEAEIRAGVKVEHRKAHVWPGHHYQPQGVELARDAANDLGISSMLVQTRAGHDSTNMKEIVPSVMLFVPSVEGISH
AEAEYTSDEDLCSGVDLLTEVVARMLDGSLDAAGAGHP
>Q9ADP8 ~~~~~~Probable transporter SCO4007~~~COG2814
MPSSPSSTTPAPTSTPAARREPSGKGPSGAAARLFLPLIALCTAVTAANIYLAAPLLPLIAHDMGSTPSAVAWLASVAQL
GYAAGLLFFAPLGDSVNRRRLVAALSLVATAALLTAAASAGTGALAGAVLVASAATVVPQLLVPLVAERAPADRRARHVA
AVIAGLFTGVVAARVLGGLAGQAFGWRAVFVGAAVLTAVLGLATAYILPVERRQRRGPLFAGLVAIPGLVRRSPDLWRAC
VRQAGMYGAWSALWTSLALLLTEGEGYGMTTAAAGLFGLFGLAASVVAPLAGGLVDRFGAAKVVRSAYALAALSVPLFWL
GGQVMAALCAAAVLVHAALVASHVANQTLALTTTSAPATANTAYVVAGFAGGALASALAGPAFGHWGWGGVCAVAGAWLV
LGWTATAVRPARSARSARSARSVRSVRSAR
>P0DH73 ~~~prgT~~~Putative regulatory protein PrgT~~~
MTKKEQSIWRKEMLALMNEDADWYRNEDTERFKRIQELAKKIETASTRQFSSHISKERFEAYQRMGLQFKEIAEEFHITT
TALQQWHKDNGYPIYNKNNRK
>Q6HDF0 2.1.1.-~~~~~~Uncharacterized methyltransferase BT9727_4108~~~
MGTEFNGLFDEWAHTYDSFVQGEDIQYKEVFAHYEDILEDVVNKSFGNVLEFGVGTGNLTNKLLLAGRTVYGIEPSREMR
MIAKEKLPKEFSITEGDFLSFEVPTSIDTIVSTYAFHHLTDDEKNVAIAKYSQLLNKGGKIVFADTIFADQDAYDKTVEA
AKQRGFHQLANDLQTEYYTRIPVMQTIFENNGFHVTFTRLNHFVWVMEATKQ
>Q73SC8 1.-.-.-~~~~~~Uncharacterized NAD-dependent oxidoreductase MAP_4146~~~COG1028
MAGQAGSLQGRVAFITGAARGQGRSHAVRLAAEGADIIACDICAPVSASVTYAPASPEDLDETARLVEDQGRKALTRVLD
VRDDAALRELVADGMEQFGRLDVVVANAGVLSWGRVWELTDEQWDTVIGVNLTGTWRTLRATVPAMIEAGNGGSIVVVSS
SAGLKATPGNGHYSASKHGLTALTNTLAIELGEYGIRVNSIHPYSVETPMIEPEAMMEIFARHPSFVHSFPPMPVQPNGF
MTADEVADVVAWLAGDGSGTLTGTQIPVDKGALKY
>A0QZZ6 ~~~~~~Universal stress protein MSMEG_4207~~~COG0589
MIVVGYSADPFGRAAVEHGIEEAKRRDTGLLVINATAGDAYVDARFARSGEVHDVEAHLQDSGVPFEIRQPVGVDATEEL
LTAMDSPDAELLVIGIRHRNPVGKLLLGSVAQRLLLECPKPVLAVKPHGF
>P75364 ~~~~~~Uncharacterized protein MG296 homolog~~~
MKPQLLALKQFVQTEFEKVDFETFRQNFNRCLEREQSTLLIYEDDDYDDQSFFLKPMLSDAFFISSEVVKQLDLLAVLVD
NPKGDVKSCCQSFYEALTLFISALAITKGVDVGRYHQQLGKRFGVLTVY
>Q9I690 ~~~~~~UPF0312 protein PA0423~~~
MLKKTLAALALGSALFTAGQAMAADYKIDKEGQHAFIEFRIKHLGYSWLYGRFNDFDGSFTFDEKNPSADKVKVTINTNS
VDTNHAERDKHLRSGDFLNVSKNPTATFESTEVKANGDSADITGNLTLNGVTKPVTIKAKLIGQGDDPWGGYRAGFEGSA
TLKLKDFGIKMDLGPASQEVELLLSVEGIRQ
>P44709 ~~~~~~Uncharacterized protein HI_0431~~~COG3068
MRNPIHKRLENLESWQHLTFMAALCERMAPNFKLFCQMNELSAETKTYQNILNLVWEYLTVKDVKINFENQLEKLETIIP
DVNDYDSFGVVPALDACQALAEILHAIIAGETLEKAVEISLISLGTIRVLLETETGRDWSESKLKENEDIQTELDVQWQV
YRLLKECEKRDIELILALKNEIRTEGISNIGIEFHQ
>P99126 ~~~~~~Nucleoid-associated protein SA0437~~~
MRGGGNMQQMMKQMQKMQKKMAQEQEKLKEERIVGTAGGGMVAVTVTGHKEVVDVEIKEEAVDPDDIEMLQDLVLAATNE
AMNKADELTQERLGKHTQGLNIPGM
>P9WKW3 ~~~~~~Uncharacterized protein Rv0441c~~~
MGAKKVDLKRLAAALPDYPFAYLITVDDGHRVHTVAVEPVLRELPDGPDGPRAVVDVGLIGGRTRQNLAHRSEVTLLWPP
SDPSGYSLIVDGRAQASDAGPDDDTARCGVVPIRALLHRDAAPDSPTAAKGCLHDCVVFSVP
>P44711 ~~~~~~Nucleoid-associated protein HI_0442~~~COG0718
MFGKGGLGGLMKQAQQMQEKMQKMQEEIAQLEVTGESGAGLVKITINGAHNCRRIDIDPSLMEDDKEMLEDLIAAAFNDA
VRRAEELQKEKMASVTAGMPLPPGMKFPF
>Q87WS9 ~~~~~~UPF0307 protein PSPTO_4464~~~COG3028
MVDSYDDSLDGEKSKTQVKRELHALVDLGERLTTLKADVLAKLPLTDALRKALAEAPKHTANIARKRHILFIGKLMRDQD
QEAILVLLDQLDASTRQYNERFHNLERWRDRLIAGDDADLEKFVIEYPDADRQQLRSLIRQAQHEVARNKPPATSRKIFK
YIRELDELQRGLR
>Q9HVT1 ~~~~~~Uncharacterized protein PA4490~~~
MRRLTAFGLALLLLASGVARGEPAVTLDPQQSQVFRAWFVRIAQEQLRQGPSPRWHQQDCAGLVRFAANEALKVHDGKWL
RANGLSNRYLPPELALSPEQRRLAQNWQQGGGQVGPYVNAIKLVQFNSRLVGRDLNQARPGDLMFYDQGDDQHLMIWMGR
SIAYHTGSSTPTDNGMRSVSLQQLMTWKDTRWIPDESNPNFIGIYRLAFLSQ
>Q8YNL5 ~~~~~~Uncharacterized protein alr4550~~~COG3659
MSNLLWKSLVVSPAVLGATLLVSSAAIAATNATTELSVTETVVPTELAQQPEIVAQAAPITEDTKVIDQVNRYSNEGKGN
AQSQVTSVSQFSDVQPTDWAFQALQSLVERYGCIAGYPNGTYRGNRALTRYEFAAGLNACLDRVNELIATATADLVTKQD
LATLQRLQEEFSAELATLRGRVDALEARTAELEANQFSTTTKLVGEAIFAVTDAFGENTGDANNTVFQNRVRLGLQTSFT
GRDVLTTRLAAGNATGFDFRDNNNNSIGASGQGLQTFQVGSTGNNNVEIDRLTYEAPFGPAQVYLAASGGRHSHYAAVNN
PYFFDKTDGGNGALSTFSSENPIYRIGGGAGIAFNVPFGQGGSILRPSSFTVGYLASDANNPGPNQGLFNGDYAALGQLN
FSVGDRLALAATYVHGYHGASGSALFDSGANGAIVGTSLANNNSFLNASSSNSYGLSAAFRPSDKLSVSGFVSYSDVTGF
GANDDREVWSYGIGVALPDFGKRGNVLGIFAGAQPYARGVQAGANEVPYQVEGFYKYRVSDNISITPGVIWVTNPGQNSN
ADDAIIGTLRTTFTF
>P75327 ~~~~~~Uncharacterized lipoprotein MG321 homolog~~~
MKFQRKYWGLLSTLGVSSAVALSACAAQARDVYVTSSASDLLKNNSVPMSMFNVSPTSSFFGSKYAGLTTYIATGSNKDD
GVNVATQTQEKLVLELATSVKGYKKKDKATSSQKTSTNSSCTTTSSGTSTSGEDDWECIGEIKRQSSSNGQNNQQSKSIT
EEEKFQEISQKATRYEFAIDTGIKWVDNNGKPVKDASGNDVKLSSKDFERGFEAYILSSELRFNRNGYFIDLMGLDVKKT
VGMTKKNGAQVQMKVASSDEKDGEEKTVKITDDAYNPEDYQSTDDSKFNVYLTSPFPFLLSMMSKEFFFPIPHTHPKVKA
IKVGKDSPLVYNEKNGSKILDQTKTNFDGIYGGGVNAWRDTWSVGPYYVESFNQSQIVFKRNSEYDTHITPNLPKTREDN
EKPIPTMINYFQPGATPEVFYSNYIAGGLSSAEVPYSQQEDARSRFAGTGDLRWVKVQKTAQSAQITYSSRPYVVEGETV
KTNSNITETEAKFLYNSESEEALTIRAGINGLINWQNLAIILLPNSGDLNYSIVPFGIFKEKGKNGASVQQKAVSTTEGS
DLMNDYYYKIEKEQRLGLIPQREGNYEKNKNVLESATVKINYYSSKATSGQAGAAASAAFAKNNNTSDNTQQNQTSSVEA
KSVNVTKHSFVQALKKVGFSGSNPLHFNMKLGNSSLSANGVDYYNAVKQALTELGTADNGEKLIVPEIILGDAQGPTRNE
WYIGLSSVLGFSSWSPDYDGVGTWLDAATQLNDQGGGDVITYSSGAHIVRTLLLAASQKDVHSKFTQKIDQQNTASTTSD
VTVKKADSSQDSSKSNTEEEKWDDVTSADLFKDDPYVLKNFGDAKAQAAQRSTGSTTSGNGTQASLEFTKKALSLLKFLV
DNGILDKEKVKEAIKDPNKYLGKRDKIENGTNKPSKNEDFIGYELKDIYKKAAQLNRFNSIWAEKDTDNAKFLITVVDSY
FPVLPVPAAGLNETSPTLLKPWFQFRSAPSGNGTIRDYGFIPENK
>Q9K0V1 ~~~~~~Uncharacterized protein NMB0459~~~
MSNWKPNIPYNDLPPLPPKQDIESKTILKRCIAARASLARLKQAAELIPNQAMLINTLPVMEARASSEIENIVTTTDKLF
QSLQMDTERQDPATKEALQYRTALFAGYESLTSRPLCTQTAIMVCNAIKHPYEMAIRKTGGTALKGGNSGNVVYTPPEGE
ETIRGKLANWERFIHESGDLDPLIIMAAAHYQFEAIHPFTDGNGRTGRILNSLLLIEKGLLDLPILYLSRYIIENRADYY
RLLLGVTERQDWESWIIYILDGVADTADWTVSKIDAIRRLFEQTRQHIRTHAQGIYTHELVNLLFEQPYTRIANLEAAGI
AKRQTASKYLKELSDIGVLQEIVIGRDKLFIHPRLMELLRGEGNSFTSF
>Q9F2U7 ~~~~~~UPF0234 protein SCO4614~~~COG1666
MADSSFDIVSKVERQEVDNALNQAAKEISQRYDFKGVGASISWSGEKILMEANSEDRVTAVLDVFQSKLIKRGISLKALD
AGEPQLSGKEYKIFASIEEGISQENAKKVAKLIRDEGPKGVKAQVQGEELRVSSKSRDDLQTVISLLKGQDFDFALQFVN
YR
>Q57144 ~~~~~~Uncharacterized protein HI_0461~~~COG2990
MSTKNHFIFPTYVQMYPYSKDRPFLKQVREKLRYYGYKWLYQKQCSQLVDFLNTETQWQSLFTQDYYRTNTILTTFCDKR
FSASERLTAITENLRLAEEKMGRSLCQQLLDQQHIVLTQLTEDLRLSLSINHIDPFEGYFSINIRNQNNERVYDSSFTFL
SPNKLLIASIQGPSSDNAQELVKQATKALHGMRPMFMLVNGFKMLAEKWQCELVGIPHKAQGKYRLSARSKILFNYDEFW
QENQGEYRHNYWQLPLHIERKQLEDIASKKRSMYRKRYEMLDQMALDIQQL
>I6X961 ~~~~~~Protein Rv0461~~~
MVCWLRSRWRPVADNDYRSAPGTEPFVPDFDTGAHSQRFLSLAGQQDRAGKSWPGSTPKPQEDPVGVAPSASVEVLGSEP
AATLAHSVTVPGRYTYLKWWKFVLVVLGVWIGAGEVGLSLFYWWYHTLDKTAAVFVVLVYVVACTVGGLILALVPGRPLI
TALSLGVMSGPFASVAAAAPLYGYYYCERMSHCLVGVIPY
>P42810 ~~~~~~TPR repeat-containing protein PA4667~~~
MMAPPSQVQGRLSSSMNKSLALLTVTLLLGGCQSLIHKTPDGTPPVEDTAVETKAKPEKYGSFSEDSLYSLLVAELAGQR
NRFDIALSNYVVQAQKTRDPGVSERAFRIAEYLGADQEALDTSLLWARSAPDNLDAQRAAAIQLARAGRYEESMVYMEKV
LNGQGDTHFDFLALSAAETDPDTRAGLLQSFDHLLKKYPNNGQLLFGKALLLQQDGRPDEALTLLEDNSASRHEVAPLLL
RSRLLQSMKRSDEALPLLKAGIKEHPDDKRVRLAYARLLVEQNRLDDAKAEFAGLVQQFPDDDDLRFSLALVCLEAQAWD
EARIYLEELVERDSHVDAAHFNLGRLAEEQKDTARALDEYAQVGPGNDFLPAQLRQTDVLLKAGRVDEAAQRLDKARSEQ
PDYAIQLYLIEAEALSNNDQQEKAWQAIQEGLKQYPEDLNLLYTRSMLAEKRNDLAQMEKDLRFVIAREPDNAMALNALG
YTLADRTTRYGEARELILKAHKLNPDDPAILDSMGWINYRQGKLADAERYLRQALQRYPDHEVAAHLGEVLWAQGRQGDA
RAIWREYLDKQPDSDVLRRTIKRLTGAETP
>Q9KUP8 ~~~~~~UPF0301 protein VC_0467~~~COG1678
MNLTNHFLVAMPSMKDPYFKRSVIYICEHNQDGAMGLMINAPIDITVGGMLKQVDIEPAYPQSHQENLKKPVFNGGPVSE
DRGFILHRPRDHYESSMKMTDDIAVTTSKDILTVLGTEAEPEGYIVALGYSGWSAGQLEVELTENSWLTIEADPELIFNT
PVHEKWQKAIQKLGISPAQLSSDAGHA
>A0R1B5 ~~~~~~Uncharacterized protein MSMEG_4692/MSMEI_4575~~~COG1512
MASGDIATVANAELDLPYGSALTSSGRISAVTEPGELSVHYPFPTMDLVVLDDALKYGSRAAKARFAVYIGPLGADTAAT
AREILANVPTPENAVLLAVSPDQRAIEVVYGADVKGRGIESAAPLGVSAAAASFKEGNLIDGLISAVRVMSAGVSPA
>P9WMD9 ~~~~~~Uncharacterized HTH-type transcriptional regulator Rv0472c~~~COG1309
MAERIPAVTVKTDGRKRRWHQHKVERRNELVDGTIEAIRRHGRFLSMDEIAAEIGVSKTVLYRYFVDKNDLTTAVMMRFT
QTTLIPNMIAALSADMDGFELTREIIRVYVETVAAQPEPYRFVMANSSASKSKVIADSERIIARMLAVMLRRRMQEAGMD
TGGVEPWAYLIVGGVQLATHSWMSDPRMSSDELIDYLTMLSWSALCGIVEAGGSLEKFREQPHPSPIVPAWGQV
>Q9HV61 ~~~~~~UPF0337 protein PA4738~~~
MNSDVIKGKWKQLTGKIKERWGDLTDDDLQAADGHAEYLVGKLQERYGWSKERAEQEVRDFSDRL
>P9WMH9 ~~~~~~Uncharacterized HTH-type transcriptional regulator Rv0474~~~COG1396
MSSEEKLAAKVSTKASDVASDIGSFIRSQRETAHVSMRQLAERSGVSNPYLSQVERGLRKPSADVLSQIAKALRVSAEVL
YVRAGILEPSETSQVRDAIITDTAITERQKQILLDIYASFTHQNEATREECPSDPTPTDD
>P9WKV9 ~~~~~~Uncharacterized protein Rv0477~~~
MKALVAVSAVAVVALLGVSSAQADPEADPGAGEANYGGPPSSPRLVDHTEWAQWGSLPSLRVYPSQVGRTASRRLGMAAA
DAAWAEVLALSPEADTAGMRAQFICHWQYAEIRQPGKPSWNLEPWRPVVDDSEMLASGCNPGSPEESF
>P9WKV7 ~~~~~~Uncharacterized protein Rv0479c~~~
MTNPQGPPNDPSPWARPGDQGPLARPPASSEASTGRLRPGEPAGHIQEPVSPPTQPEQQPQTEHLAASHAHTRRSGRQAA
HQAWDPTGLLAAQEEEPAAVKTKRRARRDPLTVFLVLIIVFSLVLAGLIGGELYARHVANSKVAQAVACVVKDQATASFG
VAPLLLWQVATRHFTNISVETAGNQIRDAKGMQIKLTIQNVRLKNTPNSRGTIGALDATITWSSEGIKESVQNAIPILGA
FVTSSVVTHPADGTVELKGLLNNITAKPIVAGKGLELQIINFNTLGFSLPKETVQSTLNEFTSSLTKNYPLGIHADSVQV
TSTGVVSRFSTRDAAIPTGIQNPCFSHI
>P9WJ01 3.5.-.-~~~~~~Hydrolase Rv0480c~~~COG0388
MRIALAQIRSGTDPAANLQLVGKYAGEAATAGAQLVVFPEATMCRLGVPLRQVAEPVDGPWANGVRRIATEAGITVIAGM
FTPTGDGRVTNTLIAAGPGTPNQPDAHYHKIHLYDAFGFTESRTVAPGREPVVVVVDGVRVGLTVCYDIRFPALYTELAR
RGAQLIAVCASWGSGPGKLEQWTLLARARALDSMSYVAAAGQADPGDARTGVGASSAAPTGVGGSLVASPLGEVVVSAGT
QPQLLVADIDVDNVAAARDRIAVLRNQTDFVQIDKAQSRG
>P9WKV5 ~~~~~~Uncharacterized protein Rv0481c~~~COG3832
MPRSFDMSADYEGSVEEVHRAFYEADYWKARLAETPVDVATLESIRVGGDSGDDGTIEVVTLQMVRSHNLPGLVTQLHRG
DLSVRREETWGPVKEGIATASIAGSIVDAPVNLWGTAVLSPIPESGGSRMTLQVTIQVRIPFIGGKLERLIGTQLSQLVT
IEQRFTTLWITNNV
>Q97SA4 ~~~~~~UPF0397 protein SP_0482~~~COG4720
MEIKFTIKQVVAVGIGAALFVVIGMINIPTPVPNTSIQLQYAVQALLSIIFGPIIGLLVGLIGHAIKDSLVGYGLWWTWI
IASGLFGLVVGLFRKYVRVINGVFDWKDILIFNLIQLLANALVWGVLAPLGDVVIYQEAAEKVFAQGIVAGIANGVSVAI
AGTLLLLAYAGTQTRAGSLKKD
>P9WGR5 1.-.-.-~~~~~~Uncharacterized oxidoreductase Rv0484c~~~COG4221
MTTIGTRKRVAVVTGASSGIGEATARTLAAQGFHVVAVARRADRITALANQIGGTAIVADVTDDAAVEALARALSRVDVL
VNNAGGAKGLQFVADADLEHWRWMWDTNVLGTLRVTRALLPKLIDSGDGLIVTVTSIAAIEVYDGGAGYTAAKHAQGALH
RTLRGELLGKPVRLTEIAPGAVETEFSLVRFDGDQQRADAVYAGMTPLVAADVAEVIGFVATRPSHVNLDQIVIRPRDQA
SASRRATHPVR
>P9WKV1 ~~~~~~Transcriptional regulator Rv0485~~~COG1940
MYSTNRTSQSLSRKPGRKHQLRSHRYVMPPSLHLSDSAAASVFRAVRLRGPVGRDVIAGSTSLSIATVNRQVIALLEAGL
LRERADLAVSGAIGRPRVPVEVNHEPFVTLGIHIGARTTSIVATDLFGRTLDTVETPTPRNAAGAALTSLADSADRYLQR
WRRRRALWVGVTLGGAVDSATGHVDHPRLGWRQAPVGPVLADALGLPVSVASHVDAMAGAELMLGMRRFAPSSSTSLYVY
ARETVGYALMIGGRVHCPASGPGTIAPLPVHSEMLGGTGQLESTVSDEAVLAAARRLRIIPGIASRTRTGGSATAITDLL
RVARAGNQQAKELLAERARVLGGAVALLRDLLNPDEVVVGGQAFTEYPEAMEQVEAAFTAGSVLAPRDIRVTVFGNRVQE
AGAGIVSLSGLYADPLGALRRSGALDARLQDTAPEALA
>P9WKU9 ~~~~~~Uncharacterized protein Rv0487~~~
MTSSLPTVQRVIQNALEVSQLKYSQHPRPGGAPPALIVELPGERKLKINTILSVGEHSVRVEAFVCRKPDENREDVYRFL
LRRNRRLYGVAYTLDNVGDIYLVGQMALSAVDADEVDRVLGQVLEVVDSDFNALLELGFRSSIQREWQWRLSRGESLQNL
QAFAHLRPTTMQSAQRDEKELGG
>P9WMV7 1.1.-.-~~~~~~Uncharacterized GMC-type oxidoreductase Rv0492c~~~COG2303
MSRLADRAKSYPLASFGAALLPPELGGPLPAQFVQRVDRYVTRLPATSRFAVRAGLASLAAASYLTTGRSLPRLHPDERA
RVLHRIAALSPEVAAAVEGLKAIVLLANGADTYAHELLARAQEHDAARPDAELTVILSADSPSVTRADAVVVGSGAGGAM
VARTLARAGLDVVVLEEGRRWTVEEFRSTHPVDRYAGLYRGAGATVALGRPAVVLPMGRAVGGTTVVNSGTCFRPSLAVQ
RRWRDEFGLGLADPDQLGRRLDDAEQTLRVAPVPLEIMGRNGRLLLQAAKSLGWRAAPIPRNAPGCRGCCQCAIGCPSNA
KFGVHLNALPQACAAGARIISWARVERILHRAGRAYGVRARRPDGTTLDVLADAVVVAAGATETPGLLRRSGLGGHPRLG
HNLALHPATMLAGLFDDDVFAWRGVLQSAAVHEFHESDGVLIEATSTPPGMGSMVFPGYGAELLRWLDRAPQIATFGAMV
ADRGVGTVRSVRGETVVRYDIAPGEIAKLRVALQAIGRLLFAAGAVEVLTGIPGAPPMRSLPELQDVLRRANPRSLHLAA
FHPTGTAAAGADEQLCPVDATGRLRGVEGVWVADASILPSCPEVNPQLSIMAMALAVADQTVAKVVGVR
>P9WKU7 ~~~~~~Uncharacterized protein Rv0493c~~~
MGESTTQPAGGAAVDDETRSAALPRWRGAAGRLEVWYATLSDPLTRTGLWVHCETVAPTTGGPYAHGWVTWFPPDAPPGT
ERFGPQPAQPAAGPAWFDIAGVRMAPAELTGRTRSLAWELSWKDTAAPLWTFPRVAWERELLPGAQVVIAPTAVFAGSLA
VGETTHRVDSWRGSVAHIYGHGNAKRWGWIHADLGDGDVLEVVTAVSHKPGLRRLAPLAFVRFRIDGKDWPASPLPSLRM
RTTLGVRHWQLEGRIGGREALIRVDQPPERCVSLGYTDPDGAKAVCTNTEQADIHIELGGRHWSVLGTGHAEVGLRGTAA
PAIKEGTPA
>P9WMG7 ~~~~~~Uncharacterized HTH-type transcriptional regulator Rv0494~~~COG2186
MVEPMNQSSVFQPPDRQRVDERIATTIADAILDGVFPPGSTLPPERDLAERLGVNRTSLRQGLARLQQMGLIEVRHGSGS
VVRDPEGLTHPAVVEALVRKLGPDFLVELLEIRAALGPLIGRLAAARSTPEDAEALCAALEVVQQADTAAARQAADLAYF
RVLIHSTRNRALGLLYRWVEHAFGGREHALTGAYDDADPVLTDLRAINGAVLAGDPAAAAATVEAYLNASALRMVKSYRD
RA
>O25237 ~~~~~~Uncharacterized protein HP_0495~~~
MPSDSKKPTIIYPCLWDYRVIMTTKDTSTLKELLETYQRPFKLEFKNTSKNAKFYSFNVSMEVSNESERNEIFQKISQLD
KVVQTL
>P9WKU5 ~~~~~~Uncharacterized protein Rv0495c~~~
MWRPAQGARWHVPAVLGYGGIPRRASWSNVESVANSRRRPVHPGQEVELDFAREWVEFYDPDNPEHLIAADLTWLLSRWA
CVFGTPACQGTVAGRPNDGCCSHGAFLSDDDDRTRLADAVHKLTDDDWQFRAKGLRRKGYLELDEHDGQPQHRTRKHKGA
CIFLNRPGFAGGAGCALHSKALKLGVPPLTMKPDVCWQLPIRRSQEWVTRPDGTEILKTTLTEYDRRGWGSGGADLHWYC
TGDPAAHVGTKQVWQSLADELTELLGEKAYGELAAMCKRRSQLGLIAVHPATRAAQ
>P9WKU3 ~~~~~~Uncharacterized protein Rv0497~~~
MTGPHPETESSGNRQISVAELLARQGVTGAPARRRRRRRGDSDAITVAELTGEIPIIRDDHHHAGPDAHASQSPAANGRV
QVGEAAPQSPAEPVAEQVAEEPTRTVYWSQPEPRWPKSPPQDRRESGPELSEYPRPLRHTHSDRAPAGPPSGAEHMSPDP
VEHYPDLWVDVLDTEVGEAEAETEVREAQPGRGERHAAAAAAGTDVEGDGAAEARVARRALDVVPTLWRGALVVLQSILA
VAFGAGLFIAFDQLWRWNSIVALVLSVMVILGLVVSVRAVRKTEDIASTLIAVAVGALITLGPLALLQSG
>P9WKU1 ~~~~~~Uncharacterized protein Rv0498~~~COG1082
MRPAIKVGLSTASVYPLRAEAAFEYADRLGYDGVELMVWGESVSQDIDAVRKLSRRYRVPVLSVHAPCLLISQRVWGANP
ILKLDRSVRAAEQLGAQTVVVHPPFRWQRRYAEGFSDQVAALEAASTVMVAVENMFPFRADRFFGAGQSRERMRKRGGGP
GPAISAFAPSYDPLDGNHAHYTLDLSHTATAGTDSLDMARRMGPGLVHLHLCDGSGLPADEHLVPGRGTQPTAEVCQMLA
GSGFVGHVVLEVSTSSARSANERESMLAESLQFARTHLLR
>Q9HUH4 1.-.-.-~~~~~~Probable FAD-dependent oxidoreductase PA4991~~~
MPQALSTDILIVGGGIAGLWLNARLRRAGYATVLVESASLGGGQSVKSQGIIHGGAKYALHGALTGASEAIADMPRRWRA
CLGSDGELDLRGVRLLSEAHYLWSPGGLAGSLTSFFASKAVRSRVEQAKGEDLPPALRDKGFKGKAYRLTEIVFDVPDLI
RRLAELAGDSLLAGERIEPLREGRELAGLCVDGREIRAQRVVLSAGAGNEALLRELGLEQPAMQRRPLHMVMVKAATLKP
LYAHCLGAGPKPRITVTTHPTRDGQSVWYLGGDIAETDGVARDEAAQIAEARRELAKLLPWIDLGQAQWATLRVDRAEPA
QSNLLRPDNAFLAEQGRLLVGWPTKLALAPDFADRVCARLEEDGIRPSEHAALPQLPRPPLAEPAWEVAFA
>P9WKT9 ~~~~~~Uncharacterized protein Rv0499~~~COG2050
MNALFTTAMALRPLDSDPGNPACRVFEGELNEHWTIGPKVHGGAMVALCANAARTAYGAAGQQPMRQPVAVSASFLWAPD
PGTMRLVTSIRKRGRRISVADVELTQGGRTAVHAVVTLGEPEHFLPGVDGSGGASGTAPLLSANPVVELMAPEPPEGVVP
IGPGHQLAGLVHLGEGCDVRPVLSTLRSATDGRPPVIQLWARPRGVAPDALFALLCGDLSAPVTFAVDRTGWAPTVALTA
YLRALPADGWLRVLCTCVEIGQDWFDEDHIVVDRLGRIVVQTRQLAMVPAQ
>P9WKT7 ~~~~~~Putative DNA-binding protein Rv0500A~~~COG3311
MTSTNGPSARDTGFVEGQQAKTQLLTVAEVAALMRVSKMTVYRLVHNGELPAVRVGRSFRVHAKAVHDMLETSYFDAG
>P9WKT3 ~~~~~~Uncharacterized protein Rv0501~~~COG0451
MSSSNGRGGAGGVGGSSEHPQYPKVVLVTGACRFLGGYLTARLAQNPLINRVIAVDAIAPSKDMLRRMGRAEFVRADIRN
PFIAKVIRNGEVDTVVHAAAASYAPRSGGSAALKELNVMGAMQLFAACQKAPSVRRVVLKSTSEVYGSSPHDPVMFTEDS
SSRRPFSQGFPKDSLDIEGYVRALGRRRPDIAVTILRLANMIGPAMDTTLSRYLAGPLVPTIFGRDARLQLLHEQDALGA
LERAAMAGKAGTFNIGADGILMLSQAIRRAGRIPVPVPGFGVWALDSLRRANHYTELNREQFAYLSYGRVMDTTRMRVEL
GYQPKWTTVEAFDDYFRGRGLTPIIDPHRVRSWEGRAVGLAQRWGSRNPIPWSGLR
>Q55487 2.4.-.-~~~~~~Uncharacterized glycosyltransferase sll0501~~~COG0463
MTIELSIVIPMYNEEDNLEHLFARLLEVLTPLKITYEIICVNDGSKDKTLKQLIDCYQSNRQIKIVNLSRNFGKEIALSA
GIDYAQGNAVIPIDADLQDPPELIHELVDKWREGYDIVYATRRSRQGETWVKQFTAKMFYKVIGRMTEIKIPPNTGDFRL
MDRKVVNAIKQLPERTRFMKGLFAWVGYRQTFVLFDREPRFQGQTKWNYWKLWNFALDGIFSFSLLPLKVWTYLGSIISL
LSLAYASFLILKTITLGVDVPGYASLMVAILFLGGVQLISLGVIGEYLGRVYEEVKARPLYLVSDLWGLEYLPLEKLN
>P9WKT1 ~~~~~~Uncharacterized protein Rv0502~~~COG0204
MGNVAGETRANVIPLHTNRSRVAARRRAGQRAESRQHPSLLSDPNDRASAEQIAAVVREIDEHRRAAGATTSSTEATPND
LAQLVAAVAGFLRQRLTGDYSVDEFGFDPHFNSAIVRPLLRFFFKSWFRVEVSGVENIPRDGAALVVANHAGVLPFDGLM
LSVAVHDEHPAHRDLRLLAADMVFDLPVIGEAARKAGHTMACTTDAHRLLASGELTAVFPEGYKGLGKRFEDRYRLQRFG
RGGFVSAALRTKAPIVPCSIIGSEEIYPMLTDVKLLARLFGLPYFPITPLFPLAGPVGLVPLPSKWRIAFGEPICTADYA
STDADDPMVTFELTDQVRETIQQTLYRLLAGRRNIFFG
>P9WFK3 ~~~~~~UPF0336 protein Rv0504c~~~COG2030
MTVPEEAQTLIGKHYRAPDHFLVGREKIREFAVAVKDDHPTHYSEPDAAAAGYPALVAPLTFLAIAGRRVQLEIFTKFNI
PINIARVFHRDQKFRFHRPILANDKLYFDTYLDSVIESHGTVLAEIRSEVTDAEGKPVVTSVVTMLGEAAHHEADADATV
AAIASI
>A0R2D5 2.1.1.-~~~~~~Putative O-methyltransferase MSMEG_5073/MSMEI_4947~~~COG4122
MASSAEAIVTHAERSISEDAIVAAARERADDIGAGAVTPAVGALLSVLARLTGGRAVVEVGTGAGVSGLWLLSGMRDDGV
LTTIDVEPEHQRIAKQGFSEAGVGPGRTRLISGRAQEVLTRLADESYDLVFIDADPVDQPQFVVEGVRLLRSGGAIVVHR
AALGGRAGDADARDAEVTAVREAARLIAEDERLTPVLIPLGDGLLAAVRD
>P9WKS9 ~~~~~~Uncharacterized protein Rv0508~~~COG0695
MSRPQVELLTRAGCAICVRVAEQLAELSSELGFDMMTIDVDVAASTGNPGLRAEFGDRLPVVLLDGREHSYWEVDEHRLR
ADIARSTFGSPPDKRLP
>Q5XD68 ~~~~~~Uncharacterized protein M6_Spy0510~~~
MEKKEKSMNKSFKNLVIGAVSGVAAAYFLSTEKGKALKNRAEKAYQAYKESPDDYHQFAKEKGSEYSHLARDTFYDVKDK
LASGDLTKEDMLDLLKDKTTAFVQKTKETFAEVEAKEKQDDVIIDLNEDDIIIDYTEQDEPVSDTLDKH
>Q7A788 ~~~~~~Uncharacterized epimerase/dehydratase SA0511~~~
MKKIMITGALGQIGTELVVKCREIYGTDNVLATDIREPEADSPVQNGPFEILDVTDRDRMFELVRDFEADSLMHMAALLS
ATAEKNPILAWDLNMGGLMNALEAARTYNLHFFTPSSIGAFGDSTPKVNTPQVTIQQPTTMYGVNKVAGELLCQYYFKRF
GVDTRSVRFPGLISHVKEPGGGTTDYAVEIYFKAVREGHYTSFIDKGTYMDMMYMDDAIEAIIKLMEADDAKLETRNGYN
LSAMSFDPEMVKEAIQEYYPNFTLDYDVDPIRQGIANSWPDSIDTSCSRGEWGFDPKYDLASMTKLMLEAIEQKDTVKNN
N
>P44744 ~~~~~~Uncharacterized protein HI_0521~~~COG1328
MLASLQDILDTVKANNLTYHQKLMTLGNIAERLFDPRDLLGYTDEEWGFLQNQMICDLCEGYAIYRPRYILPDYNVYIQK
GCEFLELPPPKDLDEALDGLLILYSHVPSITTYPVYIGRLDVLLEPFITDEEKDYIKIKRFLNHIDKTVPDSFCHANIGP
YDTKAGRLILRAVIDLEAPTPNMTIRYDKSKTSREFAELAAKACLLVSKPSFANDAYYISDLGEEYGVASCYNALPECGG
AYTLTRLRLGTIARTCKSADEMLNELLPRVAKCALSTMDKRHKFVVEESNFFNSSFLEKEGFIKRTNFTGMFAIVGLADA
TNHLLQCEGLNETFGKSVRGDEIATAIMDKLKEITDAHEGVYAERTGNRYLLHAQVGASNHEEDKRNAPAHRIRVGEEPT
LLAHLKQSAPFHKYFPSGTGDLFAFDQTYVDHCDAVVDIIDGAFSLGYRYITTYLKNTDLIRVTGYLVKKSEVEKYRKGE
VALRDTTWYGSGTDECANVFDRQLRDEKDVIAEK
>O06391 ~~~~~~Uncharacterized protein Rv0525~~~COG0406
MPEETQVHVVRHGEVHNPTGILYGRLPGFHLSATGAAQAAAVADALADRDIVAVIASPLQRAQETAAPIAARHDLAVETD
PDLIESANFFEGRRVGPGDGAWRDPRVWWQLRNPFTPSWGEPYVDIAARMTTAVDKARVRGAGHEVVCVSHQLPVWTLRL
YLTGKRLWHDPRRRDCALASVTSLIYDGDRLVDVVYSQPAAL
>P44746 ~~~~~~Uncharacterized ferredoxin-like protein HI_0527~~~COG1145
MALLITSKCTNCDMCLPECPNEAISIGDEIYVIDPILCTECVGHYDTPTCQKVCPITNCIKPDPEHQETEEQLWERFVMI
HHSDKL
>A0R316 ~~~~~~Seven-bladed beta-propeller protein MSMEG_5308~~~COG3391
MTMAKNILRAITAIVRRDAKTPDTDDSAGSVTFDGALDDLDVAGLVEVGRGPIADIAIDADRETIVVTNSAADCLTVINP
YTLAPVGSVRLNGEPFAVAAADDRAYVSVVTAGHDAIKVVDTITGSVLAEYPLAMTVTALAMSPDGKRVFVGRSGHDRID
VAVIDTAAERVGTIDLASGAGAGVDALRVDASGKRLYVATTDPRGSRMVTVNIETAQIESTVWIGAPIRDLALGADGKAL
VLTSDRQRRGVVHIVDLSTAAVVGAIQIGGAPTQLVLSPDATRAYVVDYDRVIVLCTLTNEILGSIDVGMQPAAVAVRRD
GARVYVADYSGQVNAFDVAAELPALYSRLVASEPRQDATTVLPSLQTA
>Q5XD45 3.1.3.-~~~~~~Putative phosphatase M6_Spy0533~~~
MSIKLVAVDIDGTLLTDDRRITDDVFQAVQEAKAQGVHVVIATGRPIAGVISLLEQLELNHKGNHVITFNGGLVQDAETG
EEIVKELMTYDDYLETEFLSRKLGVHMHAITKEGIYTANRNIGKYTVHESTLVNMPIFYRTPEEMTNKEIIKMMMIDEPD
LLDAAIKQIPQHFFDKYTIVKSTPFYLEFMPKTVSKGNAIKHLAKKLGLDMSQTMAIGDAENDRAMLEVVANPVVMENGV
PELKKIAKYITKSNNDSGVAHAIRKWVLN
>Q87UC6 4.2.1.-~~~~~~Putative hydro-lyase PSPTO_5379~~~COG4336
MNAFDRARQSAIAAAREARGTYRNGLVTPTAGVAPGMTQANLIALPRDWAYDFLLYAQRNPKACPILDVSDAGSPTTLLA
EGSDLRTDIPMYRIWRDGKLAEEVSDATQAWAEHDDMVAFLIGCSFTFETPLQEAGIEVRHITDGCNVPMYRTNRACRPA
GRLHGEMVVSMRPIPADRVAEASAISGRYPSVHGAPVHIGEPGRLGINDLSRPDFGDAVSIKPGEVPVFWACGVTPQAAV
MASGVPFAITHSPGYMFITDVPDSTYHV
>P9WMY1 2.4.-.-~~~~~~Uncharacterized glycosyltransferase Rv0539~~~COG1215
MAGDAVTVVLPCLNEEESLPAVLAAIPAGYRALVVDNNSTDDTATVAARHGAQVVVEPRPGYGSAVHAGVLAATTPIVAV
IDADGSMDAGDLPKLVAELDKGADLVTGRRRPVAGLHWPWVARVGTVVMSWRLRTRHRLPVHDIAPMRVARREALLDLGV
VDRRSGYPLELLVRAAAAGWRVVELDVSYGPRTGGKSKVSGSLRGSIIAILDFWKVIS
>Q5SKV3 3.1.-.-~~~~~~Uncharacterized PIN and TRAM-domain containing protein TTHA0540~~~COG4956
MKTRHLVYLAFALLGLGLAGLLEDWGLLPQSPSLLSLNRLYLALAGLLTGLLLGPRLEGALEARLKRLRSLPPEVVVATT
LGSTIGLLLAVLLTTLLAQVPGFSPVHSLLLALGLVALFVYLALGYRAYFRLPEPKPAPRGGKVLDTSVLVDGRVAEVAA
VGFLEGPLWVPHFVLKELQHFADSQDPLRRAKGRRGLETLERLREAAPLEVLETTPKGESVDEKLLFLARDLEAALVTND
HALLQMARIYGVKALSIQALAQALRPQLQVGDTLKLLILKEGKEPHQGVGYLEDGSMVVVDGGSRYRGQEIEVVVTQAIQ
TQVGRLFFARPAQGAQ
>A0R3D6 6.2.1.-~~~~~~Putative ligase MSMEG_5435/MSMEI_5285~~~COG0318
MSRFTEKMYRNARTVTTGMVTGEPHEPVRHTWGEVHERARRIAGGLAAAGIGHGDAVGVLAGFPVEIAPTAQGVWMRGAS
LTMLHQPTPRTDLAVWAEDTMNVIGMIELKAVIISEPFLVATPVLEEKGILVLKVADLLAADPIDPVETGEDDLALMQLT
SGSTGSPKAVQITHRNIHSNAEAMFVGAKYDVEKDVMVSWLPCFHDMGMVGFLTIPMYFGAELVKVTPMDFLRDTLLWAK
LIDKYKGTMTAAPNFAYALLAKRLRRQAKPGDFDLSTLRFALSGAEPVEPADVEDFLDAGKPFGLRPEAILPAYGMAETT
LAVSFSPCGEGLVVDEVDADLLAALRRAVPATKGNTKRLASLGPLLNDLEARVVDENGEVMPARGVGVIELRGEPVTPGY
ITMGGFVPAQDEYGWYDTGDLGYITEEGNIVVCGRVKDVIIMAGRNIYPTDIERAAGRVEGVRPGCAVAVRLDAGHSRET
FAVAVESNAWQDPAEVRRIEHQVAHEVVSEVDVRPRNVVVLGPGSIPKTPSGKLRRANSVSLVTQ
>P75224 ~~~~~~Uncharacterized protein MG376 homolog~~~
MLNRVFLEGEIESSCWSVKKTGFLVTIKQMRFFGERLFTDYYVIYANGQLAYELEKHTKKYKTISIEGILRTYLERKSEI
WKTTIEIVKIFNPKNEIVIDYKEI
>Q5XD24 3.-.-.-~~~~~~Probable metallo-hydrolase M6_Spy0554~~~
MPFIFRYSFFNKALIFWYTILMKIYKTINHIAGENTYYLVNDQAVILIDPGSNGQEIISKIKSFEKPLVAILLTHTHYDH
IFSLDLVRDAFDHPPVYVSEKEAAWLSSPDDNLSGLGRHDDIINVIARPAENFFKLKQPYQLNGFEFTVLPTPGHSWGGV
SFVFHSDELVVTGDALFRETIGRTDLPTSNFEDLITGIRQELFTLPNHYRVYPGHGPSTTICHEKNANPFFH
>P75223 ~~~~~~Uncharacterized protein MG377 homolog~~~
MATNLKSTAKLVKPIQYDEVIEVERIFADPAFIEQHRQRILASFKDAKESALYHELTHIVIKDNLFSCAMNAIVGYFEFN
IDEAELKNVMEGLKRDVIQGAEDNTVQAIAEKIIKKALVFNHLQKEWKVEITDEVVKNVISLYYEKTNQSVREYLDDKQK
FEGVRTALLEERMVLETINHFKFHFNLTGQLPN
>P9WKL3 ~~~~~~Uncharacterized protein Rv0559c~~~
MKGTKLAVVVGMTVAAVSLAAPAQADDYDAPFNNTIHRFGIYGPQDYNAWLAKISCERLSRGVDGDAYKSATFLQRNLPR
GTTQGQAFQFLGAAIDHYCPEHVGVLQRAGTR
>P9WFK9 ~~~~~~UPF0234 protein Rv0566c~~~COG1666
MADSSFDIVSKVDRQEVDNALNQAAKELATRFDFRGTDTKIAWKGDEAVELTSSTEERVKAAVDVFKEKLIRRDISLKAF
EAGEPQASGKTYKVTGALKQGISSENAKKITKLIRDAGPKNVKTQIQGDEVRVTSKKRDDLQAVIAMLKKADLDVALQFV
NYR
>P0DMM4 ~~~~~~Uncharacterized protein Rv0572A~~~
MAADPQCTRCKQTIEPGWLYITAHRRGQAGIVDDGAVLIHVPGECPHPGEHVPRS
>A0R4D0 ~~~~~~Uncharacterized protein MSMEG_5790/MSMEI_5637~~~
MPAGVDLEKETVITGRVVDGSGQAVGGAFVRLLDGSDEFTAEVVASATGDFRFFAAPGTWTVRALSSAGNGNVTVAPTGA
GIHEVDVKVA
>A0R4H4 1.3.99.-~~~~~~KsdD-like steroid dehydrogenase MSMEG_5835~~~COG3573
MADADVIVVGAGLAGLVAACELVERGHSVIIVDQENAANIGGQAFWSFGGLFFVNSPEQRRLGIRDSQELALQDWLGTAG
FDRPEDHWPREWAHAYVDFAAGEKRSWLRARGLQTFPLVGWAERGGYDALGHGNSVPRFHITWGTGPALVEIFARRIRDS
VRVRFAHRHRVDELIVNAGLVAGVRGSILEPSNAPRGVASSRKVVGDFEFRASAVIVASGGIGGNLELVRKNWPARLGRV
PDQLISGVPAHVDGRMIGIAESAGAHVINNDRMWHYTEGITNYDPVWPNHGIRILPGPSSLWLDANGDRLPVPLYPGYDT
LGTLEHICRSGQDYTWFILNARIIAKEFALSGQEQNPDLTSRNVRDLLSRVKPGAPAPVQAFVDHGVDFVSATSLRDLVA
GMNDLPDVVPLDYAKVAAEVTARDREVANRFTKDGQITAIRAARNYLGDRFTRVVAPHRLTDPKAGPLIAVKLHILTRKT
LGGLETDLDSRVLKEDGTTFGGLYAAGEAAGFGGGGVHGYRSLEGTFLGGCIFSGRAAGRGAAADIA
>P68591 ~~~~~~UPF0340 protein TTHA0583~~~COG4475
MEGIRRAAQRAAEEFLQAFPMAPGSLFVLGGSTSEVLGERVGTRPSLEAAHAVLEGLLPPLLERGVHVAVQACEHLNRAL
VVERETARAFGLEEVAVFPHPKAGGALATAAFLRFRDPVMVESLKAQAHGGMDIGGVLIGMHLRPVAVPLRLSVRKIGEA
VLLAAKTRPKLVGGARAVYTREEMLKKLEEFLPKPP
>Q57051 3.-.-.-~~~~~~Uncharacterized hydrolase HI_0588~~~COG0624
MSINLNRVQNLIEKLAFISSVPNELTRLAFTEEDEKAHNMIIELCKEYDLSIRRDSIGNLFIRKAGKEDFLPAVAFGSHI
DTVVNAGKFDGPLGSVAGLEILLQLCEQNIQTRYPLELIIFTCEESSRFNFATLGSKVMCGIVNQEKLSSLRDKQGKGLS
EAMAEVGMNFNLVNQAKRDAKEFKCFFELHIEQGPRLENEGKTIGVVTGIAAPIRAIVKIKGQADHSGATAMHYRHDALL
GGSELSLAIERAAIQAGHSTVATVGNITAKPGVMNVVPGYCELLVDIRGTHVQARDSVFELLQEEISKVSEKRGLLIELQ
LISKDNPIILPENMVNQIAETAHSLGYSYEIMPSGAGHDAMHMATLCPTGMIFIPSHLGISHNPLEFTDWKDIEAGIKVL
QKVILEQAEVC
>A0R518 1.1.1.-~~~~~~Putative short-chain type dehydrogenase/reductase MSMEG_6031/MSMEI_5872~~~COG1028
MAGRVEGKVAFITGAARGQGRSHAVRLAEEGADIIAVDVCRRISSNEDIPASTPEDLAETVELVKGLNRRIVAEEVDVRD
YDALKAVVDSGVEQLGGLDIVVANAGIGNGGATLDKTSEADWDDMIGVNLSGVWKTVKAAVPHLISGGNGGSIILTSSVG
GLKAYPHTGHYIAAKHGVVGLMRTFAVELGQHSIRVNSVHPTNVNTPLFMNEGTMKLFRPDLENPGPDDMAVVAQMMHVL
PVGWVEPRDISNAVLFLASDEARYVTGLPMTVDAGSMLK
>P44773 ~~~~~~Uncharacterized protein HI_0603~~~COG2959
MAKEQPNDLTEQLTDTPKTAVEQAETMQSVPQTIVKKTGTALSLLAILVALGIGGAGYYFGQQQMAKIQQKLTALENQTG
ANLSSNNTNNNKRLTQLEQSLKTAQENIAQLEQLIVSKTGEITSLQTQMKQVSQLAIAQQPSDWLFSEADFLLNNALRKL
VLDNDVDTAVSLLKLADETLVKVNNSQANEIRSAINQDLKQLLSLSSVDQNAIMQKLSQLANTVDELQRL
>P0DN33 ~~~~~~Uncharacterized protein Rv0609B~~~
MTDEKCVRCGGDQLVEGAVVWNAPLRFKREGAGHFNRGTQVNAVACETCGHIDLYLESRARGSTK
>Q8KES1 ~~~~~~UPF0758 protein CT0611~~~COG2003
MRIHDIDPDNRPRERFLRSGKESLSPAELLALILRSGTAGLNIIDTCNKLISEHGLERLADLSIQELQKTPGIGEAKAMQ
IAAIFELQRRLHFARNMNLKVKGARDVFEYMKGRIPDETKEHLFVLFLSTKNQILRHETITIGTLTASLIHPREIFKAAI
RESAHSIILVHNHPSGDVQPSNADKQVTSILKKAGDLLQIELLDHVIVGNNDWFSFRDHALL
>Q55707 ~~~~~~Protein sll0617~~~COG1842
MGLFDRLGRVVRANLNDLVSKAEDPEKVLEQAVIDMQEDLVQLRQAVARTIAEEKRTEQRLNQDTQEAKKWEDRAKLALT
NGEENLAREALARKKSLTDTAAAYQTQLAQQRTMSENLRRNLAALEAKISEAKTKKNMLQARAKAAKANAELQQTLGGLG
TSSATSAFERMENKVLDMEATSQAAGELAGFGIENQFAQLEASSGVEDELAALKASMAGGALPGTSAATPQLEAAPVDSS
VPANNASQDDAVIDQELDDLRRRLNNL
>P67182 ~~~~~~Probable transcriptional regulatory protein SA0624~~~
MGRKWNNIKEKKAQKDKNTSRIYAKFGKEIYVAAKSGEPNPESNQALRLVLERAKTYSVPNHIIEKAIDKAKGAGDENFD
HLRYEGFGPSGSMLIVDALTNNVNRTASDVRAAFGKNGGNMGVSGSVAYMFDHVATFGIEGKSVDEILETLMEQDVDVND
VIDDNGLTIVYAEPDQFAVVQDALRAAGVEEFKVAEFEMLPQTDIELSEADQVTFEKLIDALEDLEDVQNVFHNVDLK
>P9WFS5 ~~~~~~TVP38/TMEM64 family membrane protein Rv0625c~~~COG0398
MSTHNDSAPTSRRRHIVRLVVFAGFLVGMFYLVAATDVIDVAAVRGAVSATGPAAPLTYVVVSAVLGALFVPGPILAASS
GLLFGPLVGVFVTLGATVGTAVVASLVGRRAGRASARALLGGERADRTDALIERCGLWAVVGQRFVPGISDAFASYAFGT
FGVPLWQMAVGAFIGSAPRAFAYTALGAAIGDRSPLLASCAIAVWCVTAIIGAFAARHGYRQWRAHARGDGADGGVEDPD
REVGAR
>A0R5R7 ~~~~~~Putative aminotransferase MSMEG_6286/MSMEI_6121~~~COG1167
MSFQSLGRDDLLAQHELQQRNYAELQAKQLKLDLTRGKPSPEQLDLSNGLLSLPGDGADAYRDGHGTDTRNYGGVQGLPE
LRAIFAELLGLPVENLIAGNNASLEMMHDSVVFSLLHGGLDSPRPWSAEPTVKFLCPAPGYDRHFAITESFGIENVPVPI
REDGPDVDVIEQLVASDPTIKGIWCVPVYSNPTGATYSTDVIRRLVQMPTAAKDFRLMWDNAYAVHTLTDEFVEPVDVLG
LAAAAGNPNRPLVFASTSKITFAGAGVSFLGASADNIAWYLKHAGKKSIGPDKVNQLRHLRFFGDADGVRRQMQRHRELI
APKFALVAEILEDRLGESKIASWTDPKGGYFVSLDVWPGTAKRTVALAKDAGIAVTEAGSAFPYRKDPEDKNIRIAPTFP
SLPDVRDAIDGLATCALLAATEALLGDK
>P9WKS7 ~~~~~~Uncharacterized protein Rv0628c~~~COG4398
MRIGVGVSTAPDVRRAAAEAAAHAREELAGGTPALAVLLGSRSHTDQAVDLLAAVQASVEPAALIGCVAQGIVAGRHELE
NEPAVAVWLASGPPAETFHLDFVRTGSGALITGYRFDRTAHDLHLLLPDPYSFPSNLLIEHLNTDLPGTTVVGGVVSGGR
RRGDTRLFRDRDVLTSGLVGVRLPGAHSVSVVSQGCRPIGEPYIVTGADGAVITELGGRPPLHRLREIVLGMAPDEQELV
SRGLQIGIVVDEHLAVPGQGDFLIRGLLGADPTTGAIGIGEVVEVGATVQFQVRDAAAADKDLRLAVERAAAELPGPPVG
GLLFTCNGRGRRMFGVTDHDASTIEDLLGGIPLAGFFAAGEIGPVAGHNALHGFTASMALFVD
>P9WKS5 ~~~~~~Uncharacterized protein Rv0634A~~~COG5450
MGSDCGCGGYLWSMLKRVEIEVDDDLIQKVIRRYRVKGAREAVNLALRTLLGEADTAEHGHDDEYDEFSDPNAWVPRRSR
DTG
>P9WFK1 ~~~~~~UPF0336 protein Rv0635~~~COG2030
MALSADIVGMHYRYPDHYEVEREKIREYAVAVQNDDAWYFEEDGAAELGYKGLLAPLTFICVFGYKAQAAFFKHANIATA
EAQIVQVDQVLKFEKPIVAGDKLYCDVYVDSVREAHGTQIIVTKNIVTNEEGDLVQETYTTLAGRAGEDGEGFSDGAA
>P9WFJ9 ~~~~~~UPF0336 protein Rv0637~~~COG2030
MALKTDIRGMIWRYPDYFIVGREQCREFARAVKCDHPAFFSEEAAADLGYDALVAPLTFVTILAKYVQLDFFRHVDVGME
TMQIVQVDQRFVFHKPVLAGDKLWARMDIHSVDERFGADIVVTRNLCTNDDGELVMEAYTTLMGQQGDGSARLKWDKESG
QVIRTA
>Q5L2A5 ~~~~~~UPF0342 protein GK0640~~~COG3679
MSEPLHALAKQLEQAIRASEPFQQLKRAYEDVRRDETAYRMFANVRDIQLQLHEKQMRGAAILPDEIEQAQKAMALAQQN
EKLARLMALEQQMSMTIAEVQQIAMKPLEELHRSFMEGR
>A0R635 1.-.-.-~~~~~~Putative Rieske 2Fe-2S iron-sulfur protein MSMEG_6410/MSMEI_6242~~~COG2146
MQVTSVGHAGFLIESRAGSILCDPWVNPAYFASWFPFPDNSQLDWDALGDVDYLYVSHLHKDHFDPEHLRRYVNKDAVVL
LPDYPVPDLRRELEKLGFHNFFETTDSVKHTVSGPKGDLDVMIIALRAPADGPIGDSGLVVSDRVTTVFNMNDARPVDLD
VLHTDFGQIDVHMLQYSGAIWYPMVYDMPARAKEAFGIQKRQRQMDRCRQYIAQVGATWVVPSAGPPCFLDPELRDLNDD
HGDPANIFPDQMVFLEQLRIHGHDGGLLMIPGSTADFTGSTLNSLTHPVDDPESIFTTGKAAYIEDYAQRMAPVLAAEKA
RWAPSAGEPMLEALRALFEPIMTQTDQICDGIGYPVELRLTGRDHNETVVLDFPKRVVREPIPDEKFRYGFEIPAALVRT
VLRDEEPDWVNTIFLSTRFRAWRVGGYNEYLYTFFKCLTDERIAYADGWFAEAHDDSSSITLDGFQIQRRCPHLKADLSK
FGVVEGNTLTCNLHGWQWNLENGRCLTTKGHELRCQKL
>Q5XCT5 ~~~~~~UPF0342 protein M6_Spy0643~~~
MSQEIYDYANQLERAVRALPEYQKVLEVKEAIQADASASQLFDEFVAMQEKIQGMMQSGQMPTAEEQTSIQELSQKIEAN
DQLKAYFEAQQALSVYMSDIERIVFAPLKDLVK
>P9WQI1 ~~~~~~Uncharacterized protein Rv0647c~~~COG0661
MRAEIGPDFRPHYTFGDAYPASERAHVNWELSAPVWHTAQMGSTTHREVAKLDRVPLPVEAARVAATGWQVTRTAVRFIG
RLPRKGPWQQKVIKELPQTFADLGPTYVKFGQIIASSPGAFGESLSREFRGLLDRVPPAKTDEVHKLFVEELGDEPARLF
ASFEEEPFASASIAQVHYATLRSGEEVVVKIQRPGIRRRVAADLQILKRFAQTVELAKLGRRLSAQDVVADFADNLAEEL
DFRLEAQSMEAWVSHLHASPLGKNIRVPQVHWDFTTERVLTMERVHGIRIDNAAAIRKAGFDGVELVKALLFSVFEGGLR
HGLFHGDLHAGNLYVDEAGRIVFFDFGIMGRIDPRTRWLLRELVYALLVKKDHAAAGKIVVLMGAVGTMKPETQAAKDLE
RFATPLTMQSLGDMSYADIGRQLSALADAYDVKLPRELVLIGKQFLYVERYMKLLAPRWQMMSDPQLTGYFANFMVEVSR
EHQSDIEV
>A0R6E3 ~~~~~~Uncharacterized protein MSMEG_6518/MSMEI_6344~~~COG1716
MYTAAIDALPPVGDAEFPERAAVVLSGLRKLQGSLAEAASRSRATPSVIVALSGVRTRYDELMTTAAEGPGATLGQRLYV
ARLRAKLTTAEAANGIGVRKDLIEAVEAEEPATEAETAQIKDLIAALGG
>Q8DQI6 ~~~~~~DegV domain-containing protein spr0652~~~COG1307
MTWKIIADSGCDYRQLPTPAINTTFVSVPLTIQVADQVFVDDASLDIDQMMETMYATAEASKSACPSPDDYLRAFEGAKN
IFLVTITGTLSGSHNSAQLAKNIYLEDHPDTKIHVIDSLSAGGEVDLLVEKLNDLIDQGLSFEEVVEAITAYQEKTKLLF
VLAKVDNLVKNGRLSKLIGTVVGLLNIRMVGKASETGTLELLQKARGSKKSVQAAYDELVKAGYAGGRIVMAQRNNEKCC
QQLSERIRETFPQADIKILPTSGLCSFYAEEGGLLMGYEID
>Q9RMX2 ~~~~~~Uncharacterized protein pXO2-61/BXB0075/GBAA_pXO2_0075~~~
MEEIKCLLCRYLKERQEKFISDWKKKVIIRERDPYKEEIIKNGEHLLSAFIMYLKEEISLQEIEITSKKIARERIDAKVN
IAEFIHNTNVAKIEIMNILTLLNPDLQQYQALVKKINQFFDHLIYYTVHSYYEQKA
>Q83DN9 ~~~~~~Uncharacterized protein CBU_0658~~~COG0718
MSAILPTEADRRLLSDIKESITDMQQQMQATYSNLADLKLVGESHDKTVRITMTATYNFEDIEFDERALQGGVKEFKWRI
REAWKNLCETIQKTTQSKTIELLQSMRIPEDIRNLSVEEEGGEGGEGGQGTRGMIGNPIASGG
>A0R6Q0 ~~~~~~Uncharacterized protein MSMEG_6630~~~COG0346
MSDHEVKMVVLSTENLDESIKFYETLGFSLKFRDGAHFAALDGGAVTLALATPVDHPIPGKVVVGIKTADVDAAAKEIEA
TGGAIIKGPYDDAHERRAVVYDNTGNGLVFYSPLKR
>Q57538 ~~~~~~Probable ABC transporter ATP-binding/permease protein HI_0664~~~COG1132
MRKNGFVVMGHLLKLVTPLAHIMAFTITMGTLGFLAAIFIMVLGATGLVNLLNFDTHLSFSGILTALIVLAVARGALRYL
EQMSGHYIAFKLLALLRDKVFSSLRRLAFVKLQDKQAGQLVSLVTNDIELLEVFYAHTIAPIMIAFFTSAILLLVFAQLS
SWFVLVALAAYLTVGVILPIITTKLAREDGRRYRELVGEMNDFFLDSVRGMKEIQLFGYAKQRLDEIQQRSQKIDTAFER
IKDQEAKVRVYTEVAVSVFNIIMLFTGLILFSLDKIDFAAFLIGVILLMSSYGPVIALSNLSSNLLQTLASGERVLSLLA
EEPELKDVESAVDLKDVSRIDVENVNFAYGEEQILSDVSLSVKKGEILGIHGRSGSGKSTLLKLLMRFYDPKSGSIKING
ETLPNINTCSLRDNMAYITQQTYIFNETIEENIRLARRDATLEEIMEAAKKASIHDFILSLPQGYQTKMTELGGNLSDGE
KQRIGIARAFLHNAPIILLDEPTSNLDSLNEAMILKSLLNVKAEKLIILVSHRQSTMAICDQVIGIENGRMS
>O84673 ~~~~~~Uncharacterized protein CT_666~~~
MASGSCSAFNFNQMLDGVCKYVQGVQQYLTELETSTQGTVDLGTMFNLQFRMQILSQYMESVSNILTAVNTEMITMARAV
KGS
>P44034 ~~~~~~Uncharacterized protein HI_0666~~~COG3550
MRDLVRSGKVFLYGEFIGLLREDHRGFHFSYNPDYQGIPLSLSFPIEQSPFHSDTLFPYFASLVPEGWLKHKYALHQRID
ESDMFRFLLNNGENMLGAVQIQEEKQ
>O84678 ~~~~~~Uncharacterized protein CT_671~~~
MELNKTSESLFSAKIDHNHPRTEAHEPRDQREVRVFSLEGRSSTRQEKADRMPGRTSSRQESSKGSEEGAVHESTAGVSS
KEEEESKGDGFFTGGNPTSGMALVETPMAVVSEAMVETSTMTVSQVDLQWVEQLVTSTVESLLVADIDGKQLVEIVLDNS
NTVPAAFCGANLTLVQTGEEISVSFSNFVDQAQLTEATQLVQQNPKQLVSLVESLKARQLNLTELVVGNVAVSLPTIEKI
ETPLHMIAATIRHHDQEGDQEGEGRQDQHQGQHQEKKVEEAHI
>O53789 ~~~~~~Uncharacterized HTH-type transcriptional regulator Rv0681~~~COG1309
MAAQPQAPSAGGRPRAGKAVKSVARPAKLSRESIVEGALTFLDREGWDSLTINALATQLGTKGPSLYNHVDSLEDLRRAV
RIRVIDDIITMLNRVGAGRARDDAVLVMAGAYRSYAHHHPGRYSAFTRMPLGGDDPEYTAATRGAAAPVIAVLSSYGLDG
EQAFYAALEFWSALHGFVLLEMTGVMDDIDTDAVFTDMVLRLAAGMERRTTHGGTAST
>Q7A6T6 2.7.1.-~~~~~~Putative lipid kinase SA0681~~~
MENKYTHGVLFYHEHSGLKNINQGIGEVTTALSSICKHLSIQLSENEGDIIKYCQEIKTKNYAKDVDILFILGGDGTVNE
LINGVMSHDLQLPIGILPGGTFNDFTKTLNIAPNHKQASEQMISAQVGTYDVIKINNQYALNFVGLGLIVQNAENVQDGS
KDIFGKLSYIGSTVKTLLNPTQFNYQLSIDDKTYSGETTMILTANGPFIGGSRIPLTDLSPQDGELNTFIFNEQSFSILN
DIFKKRDSMNWNEITQGIEHIPGKKISLTTDPAMKVDIDGEISLETPIDIEVIPNAIQLLTVNDL
>P75109 ~~~~~~Uncharacterized ABC transporter permease MG468 homolog~~~
MFSFFKQIFKSLKKFFFLLFGIIFVLFSIIFLETSILQLSNNLVNTYTALVQKTNSSDIVAPAIFKESSPVYKTELKEEK
RHFSKIKLTEKKINFIWPYQESDFGSDSEKKDSTTKSSDNNPRKGDVNDKDKLFLARKRGILKAYGEANIAEKRIYKGLA
VSFQNTYSFTGTDQESNNLNQNTVSDPQNLIYDKEGNLLGYFVDGLILDGIPLRAGIARFPGDKGKGEDKKTTKKKSEIK
QASSATTVLQPLAAQAKMTDAKETTNNEEPKKDSNVEEQYTTNNKDKVWFKSDETQAGSSSGESETSKLSTSYLFTGGQE
AANWFPNLYANVPIVLPISPGSQFWLETNPFKEIIEVFQKEKEEKEKQSFSLTFTLDTSKLSHLDKEEFDWLEKQAETIS
SGSSFGDYNLKKKINSLKSFELNINKDWLKNKVKAEKETILDSLPGFSNSDKNTIFSTQSGDAKSGTQSNPSSLIALRSS
VSFKPQLQQTNVALAQQQQDKQESSADDGVKDPTFSDVQTEFDKIGTENHTPQKNLNNVYAALLHQWKSIFQEDLVKKVT
ALLEKYRDAFLKAKALKELEFSRQNLAIATNVSSEESASFLVSNKDSQKYNDLSIIEGINFKSWLAKEKSNPLDMVYGGK
SNSEGFLEKVEYEFKPTQTDEKKKAAAKTTQGTTDSLTQLADASSSSSSSSTGDTKSTSTKFQIYPKLANILAQAQLPEA
SSIPDTLTNAIKQWSTLDKKGFEALDDTGKSKAANNYVALLSYFTPEFQDPNELVVTNRQKLDIPIIFKNGVNPLTLPTD
QQSLVVQTPEAHGAVVSQQWLFKHDKEVLPLEGEYSWKEALENPKNLPNWLNDLPDKYKFSINGLTFAILGVGESVETGY
PVLSTNSPLPNSQDEGLIFLNEQGYRSVLFAVPAASEENYYAFKSDDIKAKFPGQDPIQVVASKLKGYLNVPDSDLAFNV
KDISKFQYLTTARNYFPDLVQNYLAIASVVIAAFLSILALYLTILLIKSFIKKNQTEFSIIRAGGFSTAKFIAGMSVFAG
IVALASSFFGVLFAFLLERQVKGIISRYWFIALPANSFNWISFFGSMLLIFVIFQFISWIAFKQLFSKPVNVLIDQGNET
KFSVFLHLLKRKSYTMTPLGKFRVSLIISRLSRLFTYVGLSSIALLLIGIAGTIPQKFGAAQSNTVLNRNFNYRLNLQTP
TEQSGWYAIQPYSRFGQTDDSLGIKALYKDKGDQIQQQQQQQQQQGNDKEHPYNLKELKISDRGGNPIKHNGKEIELGNL
LLPSFGGAQQLNTDENFFRHASLSKWLIDFPIRVGGANINPWEIVEKSIPKQITQLLSASSDQFLIAVLTDDYFNNLNNN
GFLTRNPRTNFIQLDAARVLTQINVFNPGGVKFNEQFLKFLTKVYGDPELSYQDSKLTYGIVPVDPQIEETYTYVQGPFG
FKETELNPDSPYTLTGISPDSKFVNLTDSGGNSLRSLISSDSEMNVIVNAGFQYANNTKIGDFIFIQPKNTATRYSEKFL
KSPPKTPTVKFRVVGVSTDAFGQELYINQNIANRLLKLNGFDGRGVIKDVVKDGQSTDDSGGTSSGGGSCGGGSTSSTTK
DKYKIEYVKPTGYVPFNGVFSKELNPSLVSKALVLNSNIGVWGNFTDFGNNFTNLVKGKENKIITSILPSDPDILKQLAK
EKGENGVDSMTYENLRKKVIEKYTSEWSSTQSLASGARGIFGDNIMVPALKLDAAGASAQIIRNNAEVLFNTVNQVDGFL
LGTIIPFIFITCVVLGISMLEEMKRIFISLKSIGYKDSQNLVSLLCFFIPAFVLSLLISIAILAGLLVGVQALVFGVAQV
FLTNVFEFLPYMVGIVLFGATIFVIGSYFWIKLRSAELKEGF
>Q50315 ~~~~~~Uncharacterized protein MPN_687~~~
MAISKKKRFFFDLAQDEDDAETVQEVKKVEQQLKLEPVVQPQHDLTNQTKANQSSQDRKFFSKDMPQFDFGPLLKFGDEF
VKSFNQFPKQEPQTSTQPVNVQPQSEPTNFNNQVPTQPVHQTAEVHLNEFQQPTTTNFNQQPVATSNIQVEATQPIVEPV
PQPEPQPAVEQPQVKQTTRPSNKLQEEENLPPPKAKVPGIIPLERQERLTTGVHFYTSTRVWNKVKRYAKAVNIPISRIL
TMILDQVIEE
>O51632 ~~~~~~Uncharacterized protein BB_0689~~~
MKKLIIIFTLFLSQACNLSTMHKIDTKEDMKILYSEIAELRKKLNLNHLEIDDTLEKVAKEYAIKLGENRTITHTLFGTT
PMQRIHKYDQSFNLTREILASGIELNRVVNAWLNSPSHKEALINTDTDKIGGYRLKTTDNIDIFVVLFGKRKYKN
>P0A0N1 ~~~~~~DegV domain-containing protein SA0704~~~
MKIAVMTDSTSYLSQDLIDKYNIQIAPLSVTFDDGKNFTESNEIAIEEFYNKMASSQTIPTTSQPAIGEWITKYEMLRDQ
GYTDIIVICLSSGISGSYQSSYQAGEMVEGVNVHAFDSKLAAMIEGCYVLRAIEMVEEGYEPQQIIDDLTNMREHTGAYL
IVDDLKNLQKSGRITGAQAWVGTLLKMKPVLKFEDGKIIPEEKVRTKKRAIQTLEKKVLDIVKDFEEVTLFVINGDHFED
GQALYKKLQEDCPSGYQVAYSEFGPVVAAHLGSGGLGLGYVGRKIRLT
>P44839 ~~~~~~RutC family protein HI_0719~~~COG0251
MMTQIIHTEKAPAAIGPYVQAVDLGNLVLTSGQIPVNPATGEVPADIVAQARQSLENVKAIIEKAGLTAADIVKTTVFVK
DLNDFAAVNAEYERFFKENNHPNFPARSCVEVARLPKDVGLEIEAIAVRK
>P67109 ~~~~~~Nucleotide-binding protein SA0720~~~
MDNNEKEKSKSELLVVTGLSGAGKSLVIQCLEDMGYFCVDNLPPVLLPKFVELMEQGNPSLRKVAIAIDLRGKELFNSLV
AVVDKVKSESDVIIDVMFLEASTEKLISRYKETRRAHPLMEQGKRSLINAINDEREHLSQIRSIANFVIDTTKLSPKELK
ERIRRYYEDEEFETFTINVTSFGFKHGIQMDADLVFDVRFLPNPYYVVDLRPLTGLDKDVYNYVMKWKETEIFFEKLTDL
LDFMIPGYKKEGKSQLVIAIGCTGGQHRSVALAERLGNYLNEVFEYNVYVHHRDAHIESGEKK
>Q7A6Q5 ~~~~~~Epimerase family protein SA0724~~~
MKQYLITGGTGMVGSQLVNEIKKSDSHITILTRHDQISNDKKISYVNWAKSGWEHKVPQNIDVVINLAGATLNKRWTPEY
KQTLMLSRIQSTQALYELFKSRNKAPKVLFNASATGYYPPDLFMSYTEVYKTLPFDFLSDIVYQWERFAQQFEQLGTRVV
IGRFGMILSNEGGALQTMKLPYKYYIGGKLGSGQQWYSWIHINDLIQAILFLINNESASGPFNLTAPIPERQNLFGYTLA
RAMHKPHETWAPSLAMRLILGQMSTVVLDTQKVLPNKIQALGFQFKYSNLKMALEDLIKE
>Q1CW44 ~~~~~~Uncharacterized protein MXAN_7266~~~
MSLRKSKYEVLERLEVGLRIVSLALVVAKTLVELVDTTLI
>P44045 ~~~~~~Uncharacterized protein HI_0732~~~
MLFSGDELANNTVLELRAQGQLSAFNKQPNLTFETDAPAILQQAVAQTRE
>O84741 ~~~~~~UPF0098 protein CT_736~~~
MQLTSQAFSYGRPIPKKYSCQGVGISPPLSFSDVPREAKSLVLIVEDPDVPPSVREDGLWIHWIVYNLSPVVSNLAEGAQ
IFAVQGLNTAGEIGYCPPCPPDAKHRYYFYAYALDVVLSDEEGVTKEQLLEAMDGHIIATAELMGTYEKD
>P9WKS3 ~~~~~~Uncharacterized protein Rv0738~~~COG1576
MDPLMAHQRAQDAFAALLANVRADQLGGPTPCSEWTINDLIEHVVGGNEQVGRWAASPIEPPARPDGLVAAHQAAAAVAH
EIFAAPGGMSATFKLPLGEVPGQVFIGLRTTDVLTHAWDLAAATGQSTDLDPELAVERLAAARALVGPQFRGPGKPFADE
KPCPRERPPADQLAAFLGRTVR
>Q57290 5.4.2.8~~~~~~Probable phosphomannomutase~~~COG1109
MGMNRVLVSQAAGGLAEYLKGYDKEPSIVIGYDGRKNSDVFARDTAEIMAGAGVKAYLLPRKLPTPVLAYAIQYFDTTAG
VMVTASHNPPEDNGYKVYLGKANGGGQIVSPADKDIAALIDKVAAGNIQDLPRSDNYVVLNDEVVDAYITKTASLAKEPA
CDINYVYTAMHGVGYEVLSKTLAKAGLPQPHVVADQVWPDGTFPTVNFPNPEEKGALDLAIKVAKEKNAEFIIANDPDAD
RLAVAVPDAQGNWKSLHGNVVGCFLGWYLAKQYQGKQGTLACSLVSSPALAEIAKKYSFQSEETLTGFKYIGKVSGLLFG
FEEALGYLVDPDKVRDKDGISAAIVFLDLVRNLKKQGKTLADYADEFTKEFGAYVSGQISIRVSDLSEIGKLMTALRNNP
PAEIAGVKVAQFIDHIKTDRQSDILVFNLENGGRLIARPSGTEPKIKFYLDARGKDPKDADRVLAEFDEGVRHILRQDAY
GKQDC
>P9WFI7 2.1.1.-~~~~~~Putative S-adenosyl-L-methionine-dependent methyltransferase Rv0726c~~~COG3315
MTYTGSIRCEGDTWDLASSVGATATMVAAARAMATRAANPLINDQFAEPLVRAVGVDVLTRLASGELTASDIDDPERPNA
SMVRMAEHHAVRTKFFDEFFMDATRAGIRQVVILASGLDSRAYRLAWPAQTVVYEIDQPQVMEFKTRTLAELGATPTADR
RVVTADLRADWPTALGAAGFDPTQPTAWSAEGLLRYLPPEAQDRLLDNVTALSVPDSRFATESIRNFKPHHEERMRERMT
ILANRWRAYGFDLDMNELVYFGDRNEPASYLSDNGWLLTEIKSQDLLTANGFQPFEDEEVPLPDFFYVSARLQRKHRQYP
AHRKPAPSWRHTACPVNELSKSAAYTMTRSDAHQASTTAPPPPGLTG
>O83732 ~~~~~~Uncharacterized protein TP_0751~~~
MNRPLLSVAGSLFVAAWALYIFSCFQHGHVPPRRIPPHDTFGALPTAALPSNARDTAAHPSDTADNTSGSSTTTDPRSHG
NAPPAPVGGAAQTHTQPPVQTAMRIALWNRATHGEQGALQHLLAGLWIQTEISPNSGDIHPLLFFDREHAEITFSRASVQ
EIFLVDSAHTHRKTVSFLTRNTAISSIRRRLEVTFESHEVIHVRAVEDVARLKIGSTSMWDGQYTRYHAGPASAPSP
>P44863 ~~~~~~Uncharacterized protein HI_0755~~~COG2861
MNILIKSAVKNFIVFSTALYTSFSFAQSKLAIVIDDVGYHLKEDAAIFAMPREISVAIIPAAPYARARNQEAKSQGRDIL
IHMPMQPVSAVKIEDGGLHLGMSAAQVNDRVNTAKNIVRDAIGMNNHMGSAATADPQLMTYLMTALQEKHLFFLDSRTIG
KSVAGKIAKEQGVRSLDRHIFLDDSNEFADVQRQFKAAIHYARKHGSAIAIGHPRPNTIAVLQAGLRNLPEDIQLVGMGN
LWRNEKVIPPKPFILLFSEVPAPTSIEPFEPVGLLRGIPK
>P9WFI5 2.1.1.-~~~~~~Putative S-adenosyl-L-methionine-dependent methyltransferase Rv0731c~~~COG3315
MTQTGSARFEGDSWDLASSVGLTATMVAAARAVAGRAPGALVNDQFAEPLVRAVGVDFFVRMASGELDPDELAEDEANGL
RRFADAMAIRTHYFDNFFLDATRAGIRQAVILASGLDSRAYRLRWPAGTIVFEVDQPQVIDFKTTTLAGLGAAPTTDRRT
VAVDLRDDWPTALQKAGFDNAQRTAWIAEGLLGYLSAEAQDRLLDQITAQSVPGSQFATEVLRDINRLNEEELRGRMRRL
AERFRRHGLDLDMSGLVYFGDRTDARTYLADHGWRTASASTTDLLAEHGLPPIDGDDAPFGEVIYVSAELKQKHQDTR
>Q9PFB4 ~~~~~~Probable membrane transporter protein XF_0764~~~COG0730
MTIQSLIVTIGSGGLVGFALGLLGGGGSILATPLLLYVVGVTNPHIAIGTSAVAVSANAYANLIAHAWKGHVWWRSAVIF
ALVGTLGAFLGSSIGMLIDGQRLLLLFGLLMAMVGLLMLRGRATAPHAEQHQTVLRMCMKTSAVAILTGAASGFFGIGGG
FLIVPALIFATRMPTINAIGSSLLAVGTFGLITTLNYARHDLVDWTIAMEFIVGGITGGGLGTLLATRFSASKHLLNRVF
GLIVIAVAIYVIWRSWASLVA
>Q9PFB3 ~~~~~~Probable transporter XF_0765~~~COG2391
MSLHVTLRFTVALAAGLLFGFGLALSEMINPIRVLSFLNVASGHWNPSLLFVLGSALAVAFPGMALQRRLKRPLLDECFH
LPSKKVIDRRIVFGSAIFGTGWGLTGLCPGPAIASLSTGLGPVLLFVAAMAAGMIIHDRIVVRCLS
>Q9PFB2 ~~~~~~Probable transporter XF_0766~~~COG2391
MSEYWYPILGGILLGLSTVMLLLLNGRIAGISGIVGRLLQGGNPAQDIPFVVGLVLGPLVFSVIFDRFPSVTVAATWPTI
IVAGLLVGLGTRMSAGCTSGHGIAGIARHSPRSIVATAIFLISGMATATFMGVYQ
>P9WMD7 ~~~~~~Uncharacterized HTH-type transcriptional regulator Rv0767c~~~COG1309
MSSDVLVTTPAQRQTEPHAEAVSRNRRQQATFRKVLAAAMATLREKSYADLTVRLVAARAKVAPATAYTYFSSKNHLIAE
VYLDLVRQVPCVTDVNVPMPIRVTSSLRHLALVVADEPEIGAACTAALLDGGADPAVRAVRDRIGAEIHRRITSAIGPGA
DPGTVFALEMAFFGALVQAGSGTFTYHEIADRLGYVVGLILAGANEPSTGGSE
>P9WGQ9 1.-.-.-~~~~~~Uncharacterized oxidoreductase Rv0769~~~COG1028
MFDSKVAIVTGAAQGIGQAYAQALAREGASVVVADINADGAAAVAKQIVADGGTAIHVPVDVSDEDSAKAMVDRAVGAFG
GIDYLVNNAAIYGGMKLDLLLTVPLDYYKKFMSVNHDGVLVCTRAVYKHMAKRGGGAIVNQSSTAAWLYSNFYGLAKVGV
NGLTQQLARELGGMKIRINAIAPGPIDTEATRTVTPAELVKNMVQTIPLSRMGTPEDLVGMCLFLLSDSASWITGQIFNV
DGGQIIRS
>P9WNY3 1.1.-.-~~~~~~Uncharacterized oxidoreductase Rv0770~~~COG2084
MTAHPETPRLGYIGLGNQGAPMAKRLLDWPGGLTVFDVRVEAMAPFVEGGATAAASVSDVAEADIISITVFDDAQVSSVI
TADNGLATHAKPGTIVAIHSTIADTTAVDLAEKLKPQGIHIVDAPVSGGAAAAAKGELAVMVGADDEAFQRIKEPFSRWA
SLLIHAGEPGAGTRMKLARNMLTFVSYAAAAEAQRLAEACGLDLVALGKVVRHSDSFTGGAGAIMFRNTTAPMEPADPLR
PLLEHTRGLGEKDLSLALALGEVVSVDLPLAQLALQRLAAGLGVPHPDTEPAKET
>Q7A6L9 ~~~~~~UPF0337 protein SA0772~~~
MADESKFEQAKGNVKETVGNVTDNKNLENEGKEDKASGKAKEFVENAKEKATDFIDKVKGNKGE
>Q7A6L4 ~~~~~~UPF0051 protein SA0778~~~
MAKKAPDVGDYKYGFHDDDVSIFRSERGLTENIVREISNMKNEPEWMLDFRLKSLKLFYKMPMPQWGGDLSELNFDDITY
YVKPSEQAERSWDEVPEEIKRTFDKLGIPEAEQKYLAGVSAQYESEVVYHNMEKELEEKGIIFKDTDSALQENEELFKKY
FASVVPAADNKFAALNSAVWSGGSFIYVPKNIKLDTPLQAYFRINSENMGQFERTLIIADEGASVHYVEGCTAPVYTTSS
LHSAVVEIIVHKDAHVRYTTIQNWANNVYNLVTKRTFVYENGNMEWVDGNLGSKLTMKYPNCVLLGEGAKGSTLSIAFAG
KGQVQDAGAKMIHKAPNTSSTIVSKSISKNGGKVIYRGIVHFGRKAKGARSNIECDTLILDNESTSDTIPYNEVFNDQIS
LEHEAKVSKVSEEQLFYLMSRGISEEEATEMIVMGFIEPFTKELPMEYAVEMNRLIKFEMEGSIG
>Q6GIR5 2.7.1.-~~~~~~Putative lipid kinase SAR0780~~~
MENKYTHGVLFYHEHSGLKNINQGIGEVTTALSSICKHLSIQLSENEGDIIKYCQEIKAKDYAKDVDILFILGGDGTVNE
LINGVMTHDLQLPIGILPGGTFNDFTKTLNIAPNHKQASEQMISAQVGTYDVIKINNQYALNFVGLGLIVQNAENVQDGS
KDIFGKLSYIGSTVKTLLNPTQFNYQLSIDDKTYSGETTMILSANGPFIGGSRIPLTDLSPQDGELNTFIFNEQSFSILN
DIFKKRDSMNWNEITQGIEHIPGKKISLTTDPTMKVDIDGEISLETPIDIEVIPNAIQLLTVNDL
>P71839 ~~~~~~Protein Rv0786c~~~COG2220
MQLTHFGHSCLLAEFGQTRLLFDPGTFSHGFEGITGLSAILITHQHPDHIDVTRLPTLLEDNPAAELYADPQTAAQLGEP
WRAVHVGDELPLAELTVRAVGGCHAVIHPEIPVIENISYLVGDSKHRARLMHPGDALFVPGEQVDVLATPAAAPWMKISE
AVDYLRAVAPARAVPIHQAIVAPDARGIYYGRLTEMTTTDFQVLPEESAVTF
>O05979 ~~~~~~Uncharacterized protein RP789~~~COG2230
MSLKSTTSSLTTNNHDKTINSVQSLVNGTGTVADHNPYDEVPYESYPYAITNPYHLSTLATLFGINAPEVENSKILELGC
AAGGNLIPHAVLYPNAHFVGVDLSKVQIDEANKNVRALGLKNIEFHHCSITDIDDSFGKFDYIICHGVISWVPKIVRDKI
FKVCNRNLSTNGIAYISYNTLPGWNMVRTIRDMMLYHSSSFTNIRDRIAQSRLLLEFVKDSLEHSKTPYAEVLKTEAGLL
AKQTDHYLRHDHLEEENAQFYFHEFMNEARKHNLQYLADCNISTMYLGNMPPKVVEQLKAVNDIVRTEQYMDFITNRRFR
TTLLCHNDLKINRNINNDDIKKFNIIFNVIPEKPLKEVDLNNATENLQFFLNGNKESNLSTTSPYMKAILYTFSENLNNP
LSFKQVTSEANTKLNNTKLNEIKNELLNNAMKLVLQGYISITNQKHRSKPVLDKPKTTQMVIYQAKYTPSMWVTNLKHEP
IGVNFFEKFALRYMDGRNDKKAIIEAILGHVEKGELTLSREGQKIENKEEIRKELESLFTPMIEKFCSNALLV
>O86331 ~~~~~~HTH-type transcriptional regulator Rv0792c~~~COG2188
MTSVKLDLDAADLRISRGSVPASTQLAEALKAQIIQQRLPRGGRLPSERELIDRSGLSRVTVRAAVGMLQRQGWLVRRQG
LGTFVADPVEQELSCGVRTITEVLLSCGVTPQVDVLSHQTGPAPQRISETLGLVEVLCIRRRIRTGDQPLALVTAYLPPG
VGPAVEPLLSGSADTETTYAMWERRLGVRIAQATHEIHAAGASPDVADALGLAVGSPVLVVDRTSYTNDGKPLEVVVFHH
RPERYQFSVTLPRTLPGSGAGIIEKRDFA
>Q5XCC7 2.7.1.-~~~~~~Probable phosphotransferase enzyme IIB component M6_Spy0801~~~
MITQIRVDDRLVHGQVAVVWTKELNAPLLVVANDEAAKNEITQMTLKMAVPNGMKLLIRSVEDSIKLFNDPRATDKRIFV
IVNSVKDACAIAKEVPDLEAVNVANVGRFDKSDPASKVKVTPSLLLNPEEMAAAKELVSLPELDVFNQVLPSNTKVHLSQ
LVN
>P9WQG7 2.8.3.-~~~~~~Putative succinyl-CoA transferase Rv0802c~~~COG1670
MSRHWPLFDLRITTPRLQLQLPTEELCDQLIDTILEGVHDPDRMPFSVPWTRASREDLPFNTLSHLWQQLAGFKRDDWSL
PLAVLVDGRAVGVQALSSKDFPITRQVDSGSWLGLRYQGHGYGTEMRAAVLYFAFAELEAQVATSRSFVDNPASIAVSRR
NGYRDNGLDRVAREGAMAEALLFRLTRDDWQRHRTVEVRVDGFDRCRPLFGPLEPPRY
>Q9Z7A3 ~~~~~~Protein CPn_0803/CP_1068/CPj0803/CpB0832~~~
MAAKTKTLELEDNVFLLLEGNLKRIFATPIGYTTFREFQNVVFNCANGQQEIANFFFEMLINGKLTQELAPQQKQAAHSL
IAEFMMPIRVAKDIHERGEFINFITSDMLTQQERCIFLNRLARVDGQEFLLMTDVQNTCHLIRHLLARLLEAQKNPVGEK
NLQEIQEEITSLKNHFDELTKALQ
>Q02TE1 ~~~gpFI~~~Putative prophage major tail sheath protein~~~
MSFFHGVTVTNVDIGARTIALPASSVIGLCDVFTPGAQASAKPNVPVLLTSKKDAAAAFGIGSSIYLACEAIYNRAQAVI
VAVGVEAAETPEAQASAVIGGVSAAGERTGLQALLDGKSRFNAQPRLLVAPGHSAQQAVATAMDGLAEKLRAIAILDGPN
STDEAAVAYAKNFGSKRLFMVDPGVQVWDSATNAARKAPASAYAAGLFAWTDAEYGFWSSPSNKEIKGITGTSRPVEFLD
GDETCRANLLNNANIATIIRDDGYRLWGNRTLSSDSKWAFVTRVRTMDLVMDAILAGHKWAVDRGITKTYVKDVTEGLRA
FMRDLKNQGAVINFEVYADPDLNSASQLAQGKVYWNIRFTDVPPAENPNFRVEVTDQWLTEVLDVA
>Q5XCB9 1.-.-.-~~~~~~Putative NAD(P)H nitroreductase Spy0809~~~
MKFLELNKKRHAIKTFNDQPVDYEDLRTAIEIATLAPSANNIQPWKFVVVQEKKAELAKGLPLANKVQVEQAQYVVALFS
DTDLALRSRKIARIGVKSLPDDLIGYYMETLPPRFAAFNEVQTGEYLAINAGIVAMNLVLSLTDQKIASNIILGFDKSTT
NGILDIDPRFRPELLITVGYSDEKPEPSYRLPVDEVIERR
>Q9ZCE4 ~~~~~~Uncharacterized protein RP812~~~COG0271
MAISAEELEKILKKSFPSSVIKITDLVGDQDHYALEISDAQFNGLSLINQHKLVKNALSEILNKKLHSISIKTISIP
>P0CG95 ~~~sseC2~~~Uncharacterized protein Rv0814c~~~
MCSGPKQGLTLPASVDLEKETVITGRVVDGDGQAVGGAFVRLLDSSDEFTAEVVASATGDFRFFAAPGSWTLRALSAAGN
GDAVVQPSGAGIHEVDVKIT
>P44882 ~~~~~~UPF0149 protein HI_0817~~~COG3079
MLISHSDLNQQLKSAGIGFNATELHGFLSGLLCGGLKDQSWLPLLYQFSNDNHAYPTGLVQPVTELYEQISQTLSDVEGF
TFELGLTEDENVFTQADSLSDWANQFLLGIGLAQPELAKEKGEIGEAVDDLQDICQLGYDEDDNEEELAEALEEIIEYVR
TIAMLFYSHFNEGEIESKPVLH
>P44886 3.1.2.-~~~~~~Uncharacterized acyl-CoA thioester hydrolase HI_0827~~~COG1607
MSANFTDKNGRQSKGVLLLRTLAMPSDTNANGDIFGGWIMSQMDMGGAILAKEIAHGRVVTVAVESMNFIKPISVGDVVC
CYGQCLKVGRSSIKIKVEVWVKKVASEPIGERYCVTDAVFTFVAVDNNGRSRTIPRENNQELEKALALISEQPL
>P44887 ~~~~~~Uncharacterized protein HI_0828~~~COG2350
MYYVIFAQDIPNTLEKRLAVREQHLARLKQLQAENRLLTAGPNPAIDDENPSEAGFTGSTVIAQFENLQAAKDWAAQDPY
VEAGVYADVIVKPFKKVF
>Q9ZCC9 ~~~~~~Putative adhesin RP828~~~COG3637
MKKLLLIATASATILSSSVSFAECIDNEWYLRADAGVAMFNKEQDKATGVKLKSNKAIPIDLGIGYYISENVRADLTLGT
TIGGKLKKYGAATNTHFTGTNVSVSHKPTVTRLLINGYVDLTSFDMFDVFVGGGVGPALVKEKISGVSGLASNTKNKTNV
SYKLIFGTSAQIADGVKVELAYSWINDGKTKTHNVMYKGASVQTGGMRYQSHNLTVGVRFGI
>Q7A6H3 ~~~~~~Uncharacterized protein SA0829~~~
MKFLSFKYNDKTSYGVKVKREDAVWDLTQVFADFAEGDFHPKTLLAGLQQNHTLDFQEQVRKAVVAAEDSGKAEDYKISF
NDIEFLPPVTPPNNVIAFGRNYKDHANELNHEVEKLYVFTKAASSLTGDNATIPNHKDITDQLDYEGELGIVIGKSGEKI
PKALALDYVYGYTIINDITDRKAQSEQDQAFLSKSLTGGCPMGPYIVTKDELPLPENVNIVTKVNNEIRQDGNTGEMILK
IDELIEEISKYVALHPGDIIATGTPAGVGAGMQPPKFLQPGDEVKVTIDNIGTLTTYIAK
>P44897 ~~~~~~UPF0352 protein HI_0840~~~COG3082
MAQHSKYSDAQLSAIVNDMIAVLEKHKAPVDLSLIALGNMASNLLTTSVPQTQCEALAQAFSNSLINAVKTR
>P0DG80 ~~~~~~UPF0122 protein SpyM3_0842~~~
MNIMEIEKTNRMNALFEFYAALLTDKQMNYIELYYADDYSLAEIADEFGVSRQAVYDNIKRTEKILETYEMKLHMYSDYV
VRSEIFDDMIAHYPHDEYLQEKISILTSIDNRE
>P31811 ~~~~~~UPF0438 protein HI_0847~~~COG3085
MAASFSVTRRFFDDKNYPRGFSRHGDYTIKESQVLEQYGQAFKALDLGEREPATKEEKDFVAFCRGERAAETFFEKTWNK
YRTRINTKKRVYTLSSDVSEAASGGEDYSGE
>P9WFI3 2.1.1.-~~~~~~Putative S-adenosyl-L-methionine-dependent methyltransferase Rv0830~~~COG3315
MVRADRDRWDLATSVGATATMVAAQRALAADPRYALIDDPYAAPLVRAVGMDVYTRLVDWQIPVEGDSEFDPQRMATGMA
CRTRFFDQFFLDATHSGIGQFVILASGLDARAYRLAWPVGSIVYEVDMPEVIEFKTATLSDLGAEPATERRTVAVDLRDD
WATALQTAGFDPKVPAAWSAEGLLVYLPVEAQDALFDNITALSAPGSRLAFEFVPDTAIFADERWRNYHNRMSELGFDID
LNELVYHGQRGHVLDYLTRDGWQTSALTVTQLYEANGFAYPDDELATAFADLTYSSATLMR
>P44061 1.14.99.-~~~~~~Probable heme oxygenase HI_0854~~~COG0748
MDFNRIITHMNDHHQDDMAVLCKKFGGEKEITDVTLVNVDFAGLDFKYNGGKTLRVEFPQQADAGSIKQVIINLCVENKP
VANYDRIKAKIDEFRQEFKSCVLSTLDKDGLPMASYAPIIFFDGKYYIYISAIAEHYENLKRNPNQVEVMFLEDENKAKS
IIVRTRLRYKASARFIPREDPIVEKVLDKLAETMNDVGGIKTIREFTDFDLVELTFGTGRFVRGFGQAYLIDANGEISHI
GVKGNPHEKESDK
>O84866 ~~~~~~Protein CT_858~~~
MKMNRIWLLLLTFSSAIHSPVQGESLVCKNALQDLSFLEHLLQVKYAPKTWKEQYLGWDLVQSSVSAQQKLRTQENPSTS
FCQQVLADFIGGLNDFHAGVTFFAIESAYLPYTVQKSSDGRFYFVDIMTFSSEIRVGDELLEVDGAPVQDVLATLYGSNH
KGTAAEESAALRTLFSRMASLGHKVPSGRTTLKIRRPFGTTREVRVKWRYVPEGVGDLATIAPSIRAPQLQKSMRSFFPK
KDDAFHRSSSLFYSPMVPHFWAELRNHYATSGLKSGYNIGSTDGFLPVIGPVIWESEGLFRAYISSVTDGDGKSHKVGFL
RIPTYSWQDMEDFDPSGPPPWEEFAKIIQVFSSNTEALIIDQTNNPGGSVLYLYALLSMLTDRPLELPKHRMILTQDEVV
DALDWLTLLENVDTNVESRLALGDNMEGYTVDLQVAEYLKSFGRQVLNCWSKGDIELSTPIPLFGFEKIHPHPRVQYSKP
ICVLINEQDFSCADFFPVVLKDNDRALIVGTRTAGAGGFVFNVQFPNRTGIKTCSLTGSLAVREHGAFIENIGVEPHIDL
PFTANDIRYKGYSEYLDKVKKLVCQLINNDGTIILAEDGSF
>P9WQF7 1.3.-.-~~~~~~Probable acyl-CoA dehydrogenase FadE10~~~COG1960
MAQQTQVTEEQARALAEESRESGWDKPSFAKELFLGRFPLGLIHPFPKPSDAEEARTEAFLVKLREFLDTVDGSVIERAA
QIPDEYVKGLAELGCFGLKIPSEYGGLNMSQVAYNRVLMMVTTVHSSLGALLSAHQSIGVPEPLKLAGTAEQKRRFLPRC
AAGAISAFLLTEPDVGSDPARMASTATPIDDGQAYELEGVKLWTTNGVVADLLVVMARVPRSEGHRGGISAFVVEADSPG
ITVERRNKFMGLRGIENGVTRLHRVRVPKDNLIGREGDGLKIALTTLNAGRLSLPAIATGVAKQALKIAREWSVERVQWG
KPVGQHEAVASKISFIAATNYALDAVVELSSQMADEGRNDIRIEAALAKLWSSEMACLVGDELLQIRGGRGYETAESLAA
RGERAVPVEQMVRDLRINRIFEGSSEIMRLLIAREAVDAHLTAAGDLANPKADLRQKAAAAAGASGFYAKWLPKLVFGEG
QLPTTYREFGALATHLRFVERSSRKLARNTFYGMARWQASLEKKQGFLGRIVDIGAELFAISAACVRAEAQRTADPVEGE
QAYELAEAFCQQATLRVEALFDALWSNTDSIDVRLANDVLEGRYTWLEQGILDQSEGTGPWIASWEPGPSTEANLARRFL
TVSPSSEAKL
>Q7A6D4 3.1.-.-~~~~~~Putative phosphoesterase SA0873~~~
MILGLALIPSKSFQEAVDSYRKRYDKQYSRIKPHVTIKAPFEIEDGDLDSVIEQVRARINGIPAVEVHATKASSFKPTNN
VIYFKVAKTDDLEELFNRFNGEDFYGEAEHVFVPHFTIAQGLSSQEFEDIFGQVALAGVDHKEIIDELTLLRFDDDEDKW
KVIETFKLA
>A1IQS5 ~~~~~~UPF0434 protein NMA0874~~~
MEKKFLDILVCPVTKGRLEYHQDKQELWSRQAKLAYPIKDGIPYMLENEARALSEEELKA
>P9WKR7 ~~~~~~Uncharacterized protein Rv0875c~~~
MKRGVATLPVILVILLSVAAGAGAWLLVRGHGPQQPEISAYSHGHLTRVGPYLYCNVVDLDDCQTPQAQGELPVSERYPV
QLSVPEVISRAPWRLLQVYQDPANTTSTLFRPDTRLAVTIPTVDPQRGRLTGIVVQLLTLVVDHSGELRDVPHAEWSVRL
IF
>P9WKR5 ~~~~~~Uncharacterized protein Rv0876c~~~COG2814
MSGRRGDHPGRMAPTPGRRTRNGSVNGHPGMANYPPDDANYRRSRRPPPMPSANRYLPPLGEQPEPERSRVPPRTTRAGE
RITVTRAAAMRSREMGSRMYLLVHRAATADGADKSGLTALTWPVMANFAVDSAMAVALANTLFFAAASGESKSRVALYLL
ITIAPFAVIAPLIGPALDRLQHGRRVALALSFGLRTALAVVLIMNYDGATGSFPSWVLYPCALAMMVFSKSFSVLRSAVT
PRVMPPTIDLVRVNSRLTVFGLLGGTIAGGAIAAGVEFVCTHLFQLPGALFVVVAITIAGASLSMRIPRWVEVTSGEVPA
TLSYHRDRGRLRRRWPEEVKNLGGTLRQPLGRNIITSLWGNCTIKVMVGFLFLYPAFVAKAHEANGWVQLGMLGLIGAAA
AVGNFAGNFTSARLQLGRPAVLVVRCTVLVTVLAIAAAVAGSLAATAIATLITAGSSAIAKASLDASLQHDLPEESRASG
FGRSESTLQLAWVLGGAVGVLVYTELWVGFTAVSALLILGLAQTIVSFRGDSLIPGLGGNRPVMAEQETTRRGAAVAPQ
>P9WKR3 ~~~~~~Uncharacterized protein Rv0877~~~
MTGPTEESAVATVADWPEGLAAVLRGAADQARAAVVEFSGPEAVGDYLGVSYEDGNAATHRFIAHLPGYQGWQWAVVVAS
YSGADHATISEVVLVPGPTALLAPDWVPWEQRVRPGDLSPGDLLAPAKDDPRLVPGYTASGDAQVDETAAEIGLGRRWVM
SAWGRAQSAQRWHDGDYGPGSAMARSTKRVCRDCGFFLPLAGSLGAMFGVCGNELSADGHVVDRQYGCGAHSDTTAPAGG
STPIYEPYDDGVLDIIEKPAES
>Q8Y8L7 ~~~~~~Cell wall protein Lmo0880~~~COG1388
MKKRWLVFAIICLIITGFLSPKAEAATDYGSSFFTNVSLQNQNGEQATNFKENSKVRVAYDFVITQPVASGETMTLTIPD
QLKLINYGGFPLMDSQGNTIANATIDQVTGTITLTFTDYVNTHTDLSGSLFYNATFNSKNIQTDQVNPIAFPVKNTTQTV
TPYISKVNSGGGTGSPTIVFKQGRMDDKDLSILHWTVTLNNALTPIDNAVYTDTLGSGQNLLGSATIKYRDANKKVIATN
IQPIALDADRNFELSIGALNNQSVVITYDTKITTKQKSYTNKATLSGDNLDAVSRNATVNDYGSGGQGTGTPPAPPVKEE
PPFIPAEKQPIEKTVETDFGPLEIVKDSEQNGKIKVIYKVKDGDTLPGVANKFDVSVAEIKDWNNLTSDTLQAGQKLQLT
IEKTLLSKITVPPVQKVTSTTRVDGVVKATGVLPHTGDSNPFIPFVTGLSLIALGFTFGRKS
>P9WMF1 ~~~~~~HTH-type transcriptional regulator Rv0880~~~COG1846
MLDSDARLASDLSLAVMRLSRQLRFRNPSSPVSLSQLSALTTLANEGAMTPGALAIRERVRPPSMTRVIASLADMGFVDR
APHPIDGRQVLVSVSESGAELVKAARRARQEWLAERLATLNRSERDILRSAADLMLALVDESP
>P9WKQ7 ~~~~~~Uncharacterized protein Rv0883c~~~
MRELKVVGLDADGKNIICQGAIPSEQFKLPVDDRLRAALRDDSVQPEQAQLDIEVTNVLSPKEIQARIRAGASVEQVAAA
SGSDIARIRRFAHPVLLERSRAAELATAAHPVLADGPAVLTMQETVAAALVARGLNPDSLTWDAWRNEDSRWTVQLAWKA
GRSDNLAHFRFTPGAHGGTATAIDDTAHELINPTFNRPLRPLAPVAHLDFDEPEPAQPTLTVPSAQPVSNRRGKPAIPAW
EDVLLGVRSGGRR
>P9WKQ5 ~~~~~~Uncharacterized protein Rv0885~~~COG3396
MDRTRIVRRWRRNMDVADDAEYVEMLATLSEGSVRRNFNPYTDIDWESPEFAVTDNDPRWILPATDPLGRHPWYQAQSRE
RQIEIGMWRQANVAKVGLHFESILIRGLMNYTFWMPNGSPEYRYCLHESVEECNHTMMFQEMVNRVGADVPGLPRRLRWV
SPLVPLVAGPLPVAFFIGVLAGEEPIDHTQKNVLREGKSLHPIMERVMSIHVAEEARHISFAHEYLRKRLPRLTRMQRFW
ISLYFPLTMRSLCNAIVVPPKAFWEEFDIPREVKKELFFGSPESRKWLCDMFADARMLAHDTGLMNPIARLVWRLCKIDG
KPSRYRSEPQRQHLAAAPAA
>P9WKQ3 ~~~~~~Uncharacterized protein Rv0887c~~~COG2764
MSLSGPRIGRAHQQQGDTMAINVEPALSPHLVVDDAASAIDFYVKAFDAVELGRVPGPDGKLIHAALRINGFTVMLNDDV
PQMCGGKSMTPTSLGGTPVTIHLTVTDVDAKFQRALNAGATVVTALEDQLWGDRYGVVADPFGHHWSLGQPVREVNMDEI
QAAMSSQGDG
>Q9RVY3 ~~~~~~Uncharacterized protein DR_0888~~~COG1652
MWPFGKSTADRVKDAFKANPVLAPLGLEVQESRGTVKVTGEVARQSQIGLINAVAGGINGVKNIDVSGVTVLQQASAPAA
QTAPTTPAQTSPSVQDSPSTPVQMPDIVQQGAGDVEIEDTSRIAKAVLSAIRGNGELANNPIDVLQSGNSVILRGAVDSD
HELRLAEQLARGVQGVSGVDISGLRVAQGAKELAKDKDEDTGDTVYTVKPGDSLSKIAEHYYGDQMEYKKIAHYNNISNP
DLIQPGQKLRIPG
>P9WMG1 ~~~~~~Putative HTH-type transcriptional regulator Rv0890c~~~COG2197
MRALLAQNRLVTLCGTGGVGKTRLAIQIASASELRDGLCFVDLAPITESGIVAATAARAVGLPDQPGRSTMDSLRRFIGN
RRMLMVLDNCEHLLDACAALVVELLGACPELTILATSREPIGMAGEITWRVPSMSITDEAVELFADRASRVQPGFTIANH
NAAAVGEICRRLDGIPLAIEFAAARVRSMSPLEIADGLDDCFRLLAGGVRGAVQRQQTLRASIDWSHALLTETEQILFRR
LAPFVGGFDLAAVRAVAAGSDLDPFSVLDQLTLLVDKSLVVADDCQGRTRYRLLETVRRYALEKLGDSGEADVHARHRDY
YTALAASLNTPADNDHQRLVARAETEIDNLRAAFAWSRENGHITEALQLASSLQPIWFGRAHLREGLSWFNSILEDQRFH
RLAVSTAVRARALADKAMLSTWLATSPVGATDIIAPAQQALAMAREVGDPAALVRALTACGCSSGYNAEAAAPYFAEATD
LARAIDDKWTLCQILYWRGVGTCISGDPNALRAAAEECRDLADTIGDRFVSRHCSLWLSLAQMWAGNLTEALELSREITA
EAEASNDVPTKVLGLYTQAQVLAYCGASAAHAIAGACIAAATELGGVYQGIGYAAMTYAALAAGDVTAALEASDAARPIL
RAQPDQVTMHQVLMAQLALAGGDAIAARQFANDAVDATNGWHRMVALTIRARVATARGEPELARDDAHAALACGAELHIY
QGMPDAMELLAGLAGEVGSHSEGVRLLGAAAALRQQTRQVRFKIWDAGYQASVTALREAMGDEDFDRAWAEGAALSTDEA
IAYAQRGRGERKRPARGWGSLTPTERDVVRLVSEGLSNKDIAKRLFVSPRTVQTHLTHVYAKLGLPSRVQLVDEAARRGS
PS
>P9WNG1 1.14.13.-~~~~~~Uncharacterized monooxygenase Rv0892~~~COG2072
MTGRCPTVAVVGAGMSGMCVAITLLSAGITDVCIYEKADDVGGTWRDNTYPGLTCDVPSRLYQYSFAKNPNWTQMFSRGG
EIQDYLRGIAERYGLRHRIRFGATVVSARFDDGRWVLRTDSGTESTVDFLISATGVLHHPRIPPIAGLDDFRGTVFHSAR
WDHTVPLLGRRIAVIGTGSTGVQLVCGLAGVAGKVTMFQRTAQWVLPWPNPRYSKLARVFHRAFPCLGSLAYKAYSLSFE
TFAVALSNPGLHRKLVGAVCRASLRRVRDPRLRRALTPDYEPMCKRLVMSGGFYRAIQRDDVELVTAGIDHVEHRGIVTD
DGVLHEVDVIVLATGFDSHAFFRPMQLTGRDGIRIDDVWQDGPHAHQTVAIPGFPNFFMMLGPHSPVGNFPLTAVAESQA
EHIVQWIKRWRHGEFDTMEPKSAATEAYNTVLRAAMPNTVWTTGCDSWYLNKDGIPEVWPFAPAKHRAMLANLHPEEYDL
RRYAAVRATSRPQSA
>P44923 ~~~~~~Uncharacterized HTH-type transcriptional regulator HI_0893~~~COG1309
MRQAKTDLAEQIFSATDRLMAREGLNQLSMLKLAKEANVAAGTIYLYFKNKDELLEQFAHRVFSMFMATLEKDFDETKPF
FEQYRQMWKNIWYFLQENPTILSNLKQYESLPNFKDICKNIKNCRWDLFCHQAQKAGLLAELSEDILFLLSLKTAINLAS
DAKFIDFDLKPEILESVIERSWRAIQK
>P9WFI1 2.1.1.-~~~~~~Putative S-adenosyl-L-methionine-dependent methyltransferase Rv0893c~~~COG3315
MRTEDDSWDVTTSVGSTGLLVAAARALETQKADPLAIDPYAEVFCRAAGGEWADVLDGKLPDHYLTTGDFGEHFVNFQGA
RTRYFDEYFSRATAAGMKQVVILAAGLDSRAFRLQWPIGTTIFELDRPQVLDFKNAVLADYHIRPRAQRRSVAVDLRDEW
QIALCNNGFDANRPSAWIAEGLLVYLSAEAQQRLFIGIDTLASPGSHVAVEEATPLDPCEFAAKLERERAANAQGDPRRF
FQMVYNERWARATEWFDERGWRATATPLAEYLRRVGRAVPEADTEAAPMVTAITFVSAVRTGLVADPARTSPSSTSIGFK
RFEAD
>P9WKA3 2.3.1.20~~~~~~Putative diacyglycerol O-acyltransferase Rv0895~~~COG1020
MRQQQEADVVALGRKPGLLCVPERFRAMDLPMAAADALFLWAETPTRPLHVGALAVLSQPDNGTGRYLRKVFSAAVARQQ
VAPWWRRRPHRSLTSLGQWSWRTETEVDLDYHVRLSALPPRAGTAELWALVSELHAGMLDRSRPLWQVDLIEGLPGGRCA
VYVKVHHALADGVSVMRLLQRIVTADPHQRQMPTLWEVPAQASVAKHTAPRGSSRPLTLAKGVLGQARGVPGMVRVVADT
TWRAAQCRSGPLTLAAPHTPLNEPIAGARSVAGCSFPIERLRQVAEHADATINDVVLAMCGGALRAYLISRGALPGAPLI
AMVPVSLRDTAVIDVFGQGPGNKIGTLMCSLATHLASPVERLSAIRASMRDGKAAIAGRSRNQALAMSALGAAPLALAMA
LGRVPAPLRPPNVTISNVPGPQGALYWNGARLDALYLLSAPVDGAALNITCSGTNEQITFGLTGCRRAVPALSILTDQLA
HELELLVGVSEAGPGTRLRRIAGRR
>P9WKP7 ~~~~~~Uncharacterized protein Rv0897c~~~COG1233
MSDHDRDFDVVVVGGGHNGLVAAAYLARAGLRVRLLERLAQTGGAAVSIQAFDGVEVALSRYSYLVSLLPSRIVADLGAP
VRLARRPFSSYTPAPATAGRSGLLIGPTGEPRAAHLAAIGAAPDAHGFAAFYRRCRLVTARLWPTLIEPLRTREQARRDI
VEYGGHEAAAAWQAMVDEPIGHAIAGAVANDLLRGVIATDALIGTFARMHEPSLMQNICFLYHLVGGGTGVWHVPIGGMG
SVTSALATAAARHGAEIVTGADVFALDPDGTVRYHSDGSDGAEHLVRGRFVLVGVTPAVLASLLGEPVAALAPGAQVKVN
MVVRRLPRLRDDSVTPQQAFAGTFHVNETWSQLDAAYSQAASGRLPDPLPCEAYCHSLTDPSILSARLRDAGAQTLTVFG
LHTPHSVFGDTEGLAERLTAAVLASLNSVLAEPIQDVLWTDAQSKPCIETTTTLDLQRTLGMTGGNIFHGALSWPFADND
DPLDTPARQWGVATDHERIMLCGSGARRGGAVSGIGGHNAAMAVLACLASRRKSP
>P9WKP5 ~~~~~~Uncharacterized protein Rv0898c~~~
MGKGRKPTDSETLAHIRDLVAEEKALRAQLRHGGISESEEQQQLRRIEIELDQCWDLLRQRRALRQTGGDPREAVVRPAD
QVEGYTG
>Q81UH1 ~~~~~~Uncharacterized protein BA_0901/GBAA_0901/BAS0853~~~COG3938
MKVSKVYTTIDAHVAGEPLRIITGGVPEIKGETQLERRWYCMEHLDYLREVLMYEPRGHHGMYGCIITPPASAHADFGVL
FMHNEGWSTMCGHGIIAVITVGIETGMFETKQKFIIDSPAGEVIAYAKYNGSEVESVSFENVPSFVYKKDVPIKIDNYEF
QVDIAFGGAFYAVVDSKEFGLKVDFKDLSAIQQWGGKIKHYIESKMEVKHPLEEGLKGIYGVIFSDDPKGEGATLRNVTI
FADGQVDRSPCGTGTSARIATLFEKGILQKGEIFIHECITDGEFEGEVLSVTAVHTYEAVVPKVTGNAFITGFHQFVVDP
RDDLNRGFLLG
>Q63F05 ~~~~~~UPF0145 protein BCE33L0904~~~
MIVTTTSGIQGKEIIEYIDIVNGEAIMGANIVRDLFASVRDVVGGRAGSYESKLKEARDIAMDEMKELAKQKGANAIVGV
DVDYEVVRDGMLMVAVSGTAVRI
>P9WKP3 ~~~~~~Uncharacterized protein Rv0906~~~COG2220
MVRRALRLAAGTASLAAGTWLLRALHGTPAALGADAASIRAVSEQSPNYRDGAFVNLDPASMFTLDREELRLIVWELVAR
HSASRPAAPIPLASPNIYRGDASRLAVSWFGHSTALLEIDGYRVLTDPVWSDRCSPSDVVGPQRLHPPPVQLAALPAVDA
VVISHDHYDHLDIDTVVALVGMQRAPFLVPLGVGAHLRSWGVPQDRIVELDWNQSAQVDELTVVCVPARHFSGRFLSRNT
TLWASWAFVGPNHRAYFGGDTGYTKSFTQIGADHGPFDLTLLPIGAYNTAWPDIHMNPEEAVRAHLDVTDSGSGMLVPVH
WGTFRLAPHPWGEPVERLLAAAEPEHVTVAVPLPGQRVDPTGPMRLHPWWRL
>P9WJ07 ~~~~~~Antitoxin Rv0909~~~
MGILDKVKNLLSQNADKVETVINKAGEFVDEQTQGNYSDAIHKLHDAASNVVGMSDQQS
>P9WJ05 ~~~~~~Toxin Rv0910~~~COG3427
MAKLSGSIDVPLPPEEAWMHASDLTRYREWLTIHKVWRSKLPEVLEKGTVVESYVEVKGMPNRIKWTIVRYKPPEGMTLN
GDGVGGVKVKLIAKVAPKEHGSVVSFDVHLGGPALLGPIGMIVAAALRADIRESLQNFVTVFAG
>O25581 5.3.2.-~~~~~~Probable tautomerase HP_0924~~~COG1942
MPFINIKLVPENGGPTNEQKQQLIEGVSDLMVKVLNKNKASIVVIIDEVDSNNYGLGGESVHHLRQKN
>P9WGQ5 1.-.-.-~~~~~~Uncharacterized oxidoreductase Rv0927c~~~COG1028
MILDMFRLDDKVAVITGGGRGLGAAIALAFAQAGADVLIASRTSSELDAVAEQIRAAGRRAHTVAADLAHPEVTAQLAGQ
AVGAFGKLDIVVNNVGGTMPNTLLSTSTKDLADAFAFNVGTAHALTVAAVPLMLEHSGGGSVINISSTMGRLAARGFAAY
GTAKAALAHYTRLAALDLCPRVRVNAIAPGSILTSALEVVAANDELRAPMEQATPLRRLGDPVDIAAAAVYLASPAGSFL
TGKTLEVDGGLTFPNLDLPIPDL
>Q9JZR5 ~~~~~~Uncharacterized protein NMB0928~~~
MPSEPFGRHNATNTLISITQDDTMTHIKPVIAALALIGLAACSGSKTEQPKLDYQSRSHRLIKLEVPPDLNNPDQGNLYR
LPAGSGAVRASDLEKRRTPAVQQPADAEVLKSVKGVRLERDGSQRWLVVDGKSPAEIWPLLKAFWQENGFDIKSEEPAIG
QMETEWAENRAKIPQDSLRRLFDKVGLGGIYSTGERDKFIVRIEQGKNGVSDIFFAHKAMKEVYGGKDKDTTVWQPSPSD
PNLEAAFLTRFMQYLGVDGQQAENASAKKPTLPAANEMARIEGKSLIVFGDYGRNWRRTVLALDRIGLTVVGQNTERHAF
LVQKAPNESNAVTEQKPGLFKRLLGKGKAEKPAEQPELIVYAEPVANGSRIVLLNKDGSAYAGKDASALLGKLHSELR
>P44941 ~~~~~~Uncharacterized protein HI_0933~~~COG2081
MSQYSENIIIGAGAAGLFCAAQLAKLGKSVTVFDNGKKIGRKILMSGGGFCNFTNLEVTPAHYLSQNPHFVKSALARYTN
WDFISLVAEQGITYHEKELGQLFCDEGAEQIVEMLKSECDKYGAKILLRSEVSQVERIQNDEKVRFVLQVNSTQWQCKNL
IVATGGLSMPGLGATPFGYQIAEQFGIPVIPPRASLVPFTYRETDKFLTALSGISLPVTITALCGKSFYNQLLFTHRGIS
GPAVLQISNYWQPTESVEIDLLPNHNVEEEINQAKQSSPKQMLKTILVRLLPKKLVELWIEQGIVQDEVIANISKVRVKN
LVDFIHHWEFTPNGTEGYRTAEVTMGGVDTKVISSKTMESNQVSGLYFIGEVLDVTGWLGGYNFQWAWSSAYACALSISR
Q
>P9WKP1 ~~~~~~Uncharacterized protein Rv0940c~~~COG2141
MRFSYAEAMTDFTFYIPLAKAAEAAGYSSMTIPDSIAYPFESDSKYPYTPDGNREFMDGKPFIETFVLTAALGAVTTRLR
FNFFVLKLPIRPPALVAKQAGSLAALIGNRVGLGVGTSPWPEDYELMGVPFAKRGKRIDECIEIVRGLTTGDYFEFHGEF
YDIPKTKMTPAPTQPIPILVGGHADAALRRAARADGWMHGGGDPDELDRLIARVKRLREEAGKTSPFEIHVISLDGFTVD
GVKRLEDKGVTDVIVGFRVPYTMGPDTEPLQTKIRNLEMFAENVIAKV
>P9WGR7 1.-.-.-~~~~~~Uncharacterized oxidoreductase Rv0945~~~COG4221
MLTGVTRQKILITGASSGLGAGMARSFAAQGRDLALCARRTDRLTELKAELSQRYPDIKIAVAELDVNDHERVPKVFAEL
SDEIGGIDRVIVNAGIGKGARLGSGKLWANKATIETNLVAALVQIETALDMFNQRGSGHLVLISSVLGVKGVPGVKAAYA
ASKAGVRSLGESLRAEYAQRPIRVTVLEPGYIESEMTAKSASTMLMVDNATGVKALVAAIEREPGRAAVPWWPWAPLVRL
MWVLPPRLTRRFA
>Q97R80 ~~~~~~UPF0346 protein SP_0947~~~COG4479
MRKSFYTWLMTERNPKSNSPKAILADLAFEESAFPKHTDDFDEVSRFLEEHASFSFNLGDFDSIWQEYLEH
>Q45979 ~~~~~~Uncharacterized protein CC_0952~~~COG3190
MDLIESIRALAALAFTLGLIGLAAWALRKYGPDSIGRAIAARQDRRLKVIESLALDPTRRLVVVSLDGEERLVLLGDGRL
LDWTPKGPPPASALSPSPVAEPEPVV
>Q83CZ8 ~~~~~~Uncharacterized protein CBU_0952~~~
MKKLTVTFLTFISIFFAATAAFAENRPILNTINYQQQVEKWVTTDSADVMVSVNVTTKEKKFDALQHQVMKKLEELSDGR
QWHIDSFSMSQDQSGLEVLSWEVRSRMPLALVNSLRQKIDSLSQAGQQYKIQNVDFEPSLVEKEKAFAELRQRVYDQVKV
ELDNLNKSFPNGHYFLHSIDFVSPPLYAANQKELTLMRSAPSEKTAVTLGRNLMLIANVKVATFLNK
>P9WKN5 ~~~~~~Uncharacterized protein Rv0953c~~~COG2141
MHYGLVLFTSDRGITPAAAARLAESHGFRTFYVPEHTHIPVKRQAAHPTTGDASLPDDRYMRTLDPWVSLGAASAVTSRI
RLATAVALPVEHDPITLAKSIATLDHLSHGRVSVGVGFGWNTDELVDHGVPPGRRRTMLREYLEAMRALWTQEEACYDGE
FVKFGPSWAWPKPVQPHIPVLVGAAGTEKNFKWIARSADGWITTPRDVDIDEPVKLLQDIWAAAGRDGLPQIVALDVKPV
PDKLARWAELGVTEVLFGMPDRSADDAAAYVERLAAKLACCV
>P0CV86 ~~~~~~Uncharacterized protein Rv1954A~~~
MARGRVVCIGDAGCDCTPGVFRATAGGMPVLVVIESGTGGDQMARKATSPGKPAPTSGQYRPVGGGNEVTVPKGHRLPPS
PKPGQKWVNVDPTKNKSGRG
>P9WKN3 ~~~~~~Uncharacterized protein Rv0955~~~
MNRVSASADDRAAGARPARDLVRVAFGPGVVALGIIAAVTLLQLLIANSDMTGAWGAIASMWLGVHLVPISIGGRALGVM
PLLPVLLMVWATARSTARATSPQSSGLVVRWVVASALGGPLLMAAIALAVIHDASSVVTELQTPSALRAFTSVLVVHSVG
AATGVWSRVGRRALAATALPDWLHDSMRAAAAGVLALLGLSGVVTAGSLVVHWATMQELYGITDSIFGQFSLTVLSVLYA
PNVIVGTSAIAVGSSAHIGFATFSSFAVLGGDIPALPILAAAPTPPLGPAWVALLIVGASSGVAVGQQCARRALPFVAAM
AKLLVAAVAGALVMAVLGYGGGGRLGNFGDVGVDEGALVLGVLFWFTFVGWVTVVIAGGISRRPKRLRPAPPVELDADES
SPPVDMFDGAASEQPPASVAEDVPPSHDDIANGLKAPTADDEALPLSDEPPPRAD
>O83922 ~~~~~~Uncharacterized protein TP_0956~~~
MKHPSVRVCCFAFASCLLCAGCSLKRLAFSSLSHTLAPFPEGELDAHLSDADFTRVFTEEDDLDLVAQSLPLVLKVYEAL
HLQNPAHRGLSLAVGRLYIMYANAFVQTPAQYLPEDEFEAQNEAYSRARKLYLRGARYALSSLETAYPGFTREVFSGDEQ
RLHKVLSRCTRVDVGTLYWVGTGYVAAFALTPLGSALPDTVHAAVMMLERACDLWPSYQEGAVWNVLTKFYAAAPESFGG
GMEKAHTAFEHLTRYCSAHDPDHHITYADALCIPLNNRAGFDEALDRALAIDPESVPHNKLLVILSQKRARWLKAHVQDF
FLD
>Q99UZ6 ~~~~~~UPF0637 protein SA0957~~~
MTKYTFKPKDFKAFNVEGLDARMEALNEYIRPQLHELGEYFSDFFTSQTGETFYPHVAKHARRSVNAPKDTWVAFATSKR
GYKMLPHFQIGMFEDQLFVMFGIMHEAKDKATRAKVFERKFKAIQQLPDDYRVCLDHMKPDKPFIKDLTDDDLKEAIQRA
INVKKGEFFIARAITPQDKRLKSDKAFIAFLEETFDQFLPFYSA
>P9WKN1 ~~~~~~Uncharacterized protein Rv0959~~~COG4867
MAKSDGDDPLRPASPRLRSSRRHSLRYSAYTGGPDPLAPPVDLRDALEQIGQDVMAGASPRRALSELLRRGTRNLTGADR
LAAEVNRRRRELLRRNNLDGTLQEIKKLLDEAVLAERKELARALDDDARFAELQLDALPASPAKAVQELAEYRWRSGQAR
EKYEQIKDLLGRELLDQRFAGMKQALAGATDDDRRRVTEMLDDLNDLLDKHARGEDTQRDFDEFMTKHGEFFPENPRNVE
ELLDSLAKRAAAAQRFRNSLSQEQRDELDALAQQAFGSPALMRALDRLDAHLQAARPGEDWTGSQQFSGDNPFGMGEGTQ
ALADIAELEQLAEQLSQSYPGASMDDVDLDALARQLGDQAAVDARTLAELERALVNQGFLDRGSDGQWRLSPKAMRRLGE
TALRDVAQQLSGRHGERDHRRAGAAGELTGATRPWQFGDTEPWHVARTLTNAVLRQAAAVHDRIRITVEDVEVAETETRT
QAAVALLVDTSFSMVMENRWLPMKRTALALHHLVCTRFRSDALQIIAFGRYARTVTAAELTGLAGVYEQGTNLHHALALA
GRHLRRHAGAQPVVLVVTDGEPTAHLEDFDGDGTSVFFDYPPHPRTIAHTVRGFDDMARLGAQVTIFRLGSDPGLARFID
QVARRVQGRVVVPDLDGLGAAVVGDYLRFRRR
>Q9JZN9 ~~~~~~Probable TonB-dependent receptor NMB0964~~~
MAQTTLKPIVLSILLINTPLLAQAHETEQSVDLETVSVVGKSRPRATSGLLHTSTASDKIISGDTLRQKAVNLGDALDGV
PGIHASQYGGGASAPVIRGQTGRRIKVLNHHGETGDMADFSPDHAIMVDTALSQQVEILRGPVTLLYSSGNVAGLVDVAD
GKIPEKMPENGVSGELGLRLSSGNLEKLTSGGINIGLGKNFVLHTEGLYRKSGDYAVPRYRNLKRLPDSHADSQTGSIGL
SWVGEKGFIGVAYSDRRDQYGLPAHSHEYDDCHADIIWQKSLINKRYLQLYPHLLTEEDIDYDNPGLSCGFHDDDNAHAH
THSGRPWIDLRNKRYELRAEWKQPFPGFEALRVHLNRNDYRHDEKAGDAVENFFNNQTQNARIELRHQPIGRLKGSWGVQ
YLQQKSSALSAISEAVKQPMLLDNKVQHYSFFGVEQANWDNFTLEGGVRVEKQKASIQYDKALIDRENYYNHPLPDLGAH
RQTARSFALSGNWYFTPQHKLSLTASHQERLPSTQELYAHGKHVATNTFEVGNKHLNKERSNNIELALGYEGDRWQYNLA
LYRNRFGNYIYAQTLNDGRGPKSIEDDSEMKLVRYNQSGADFYGAEGEIYFKPTPRYRIGVSGDYVRGRLKNLPSLPGRE
DAYGNRPFIAQDDQNAPRVPAARLGFHLKASLTDRIDANLDYYRVFAQNKLARYETRTPGHHMLNLGANYRRNTRYGEWN
WYVKADNLLNQSVYAHSSFLSDTPQMGRSFTGGVNVKF
>P9WKM1 ~~~~~~Uncharacterized protein Rv0966c~~~COG4758
MSNSAQRDARNSRDESARASDTDRIQIAQLLAYAAEQGRLQLTDYEDRLARAYAATTYQELDRLRADLPGAAIGPRRGGE
CNPAPSTLLLALLGGFERRGRWNVPKKLTTFTLWGSGVLDLRYADFTSTEVDIRAYSIMGAQTILLPPEVNVEIHGHRVM
GGFDRKVVGEGTRGVPTVRIRGFSLWGDVGIKRKPRKPRK
>P9WKL9 ~~~~~~Uncharacterized protein Rv0968~~~
MVWHGFLAKAVPTVVTGAVGVAAYEALRKMVVKAPLRAATVSVAAWGIRLAREAERKAGESAEQARLMFADVLAEASERA
GEEVPPLAVAGSDDGHDH
>P9WKL7 ~~~~~~Uncharacterized protein Rv0970~~~
MIHDLMLRWVVTGLFVLTAAECGLAIIAKRRPWTLIVNHGLHFAMAVAMAVMAWPWGARVPTTGPAVFFLLAAVWFGATA
VVAVRGTATRGLYGYHGLMMLATAWMYAAMNPRLLPVRSCTEYATEPDGSMPAMDMTAMNMPPNSGSPIWFSAVNWIGTV
GFAVAAVFWACRFVMERRQEATQSRLPGSIGQAMMAAGMAMLFFAMLFPV
>Q9X078 ~~~~~~Putative sulfur carrier protein TM_0983~~~COG0425
MAKYQVTKTLDVRGEVCPVPDVETKRALQNMKPGEILEVWIDYPMSKERIPETVKKLGHEVLEIEEVGPSEWKIYIKVK
>P9WQK1 ~~~~~~Uncharacterized ABC transporter ATP-binding protein Rv0986~~~COG1136
MNRQPIVQLSNLSWTFREGETRRQVLDHITFDFEPGEFVALLGQSGSGKSTLLNLISGIEKPTTGDVTINGFAITQKTER
DRTLFRRDQIGIVFQFFNLIPTLTVLENITLPQELAGVSQRKAAVVARDLLEKVGMADRERTFPDKLSGGEQQRVAISRA
LAHNPMLVLADEPTGNLDSDTGDKVLDVLLDLTRQAGKTLIMATHSPSMTQHADRVVNLQGGRLIPAVNRENQTDQPAST
ILLPTSYE
>O53900 ~~~~~~Uncharacterized ABC transporter permease protein Rv0987~~~COG0577
MNDQAPVAYAPLWRTAWRRLRQRPFQYILLVLGIALGVAMIVAIDVSSNSAQRAFDLSAAAITGKSTHRLVSGPAGVDQQ
LYVDLRRHGYDFSAPVIEGYVLARGLGNRAMQFMGTDPFAESAFRSPLWSNQNIAELGGFLTRPNGVVLSRQVAQKYGLA
VGDRIALQVKGAPTTVTLVGLLTPADEVSNQKLSDLIIADISTAQELFHMPGRLSHIDLIIKDEATATRIQQRLPAGVRM
ETSDTQRDTVKQMTDAFTVNLTALSLIALLVGIFLIYNTVTFNVVQRRPFFAILRCLGVTREQLFWLIMTESLVAGLIGT
GLGLLIGIWLGEGLIGLVTQTINDFYFVINVRNVSVSAESLLKGLIIGIFAAMLATLPPAIEAMRTVPASTLRRSSLESK
ITKLMPWLWVAWFGLGSFGVLMLWLPGNNLVVAFVGLFSVLIALALIAPPLTRFVMLRLAPGLGRLLGPIGRMAPRNIVR
SLSRTSIAIAALMMAVSLMVGVSISVGSFRQTLANWLEVTLKSDVYVSPPTLTSGRPSGNLPVDAVRNISKWPGVRDAVM
ARYSSVFAPDWGREVELMAVSGDISDGKRPYRWIDGNKDTLWPRFLAGKGVMLSEPMVSRQHLQMPPRPITLMTDSGPQT
FPVLAVFSDYTSDQGVILMDRASYRAHWQDDDVTTMFLFLASGANSGALIDQLQAAFAGREDIVIQSTHSVREASMFIFD
RSFTITIALQLVATVVAFIGVLSALMSLELDRAHELGVFRAIGMTTRQLWKLMFIETGLMGGMAGLMALPTGCILAWILV
RIINVRSFGWTLQMHFESAHFLRALLVAVVAALAAGMYPAWRLGRMTIRTAIREE
>Q7A161 ~~~~~~UPF0358 protein MW0995~~~
MAKQATMKNAALKQLTKDADEILHLIKVQLDNLTLPSCPLYEEVLDTQMFGLQKEVDFAVKLGLVDREDGKQIMLRLEKE
LSKLHEAFTLV
>P0A8I3 ~~~yaaA~~~Peroxide stress resistance protein YaaA~~~COG3022
MLILISPAKTLDYQSPLTTTRYTLPELLDNSQQLIHEARKLTPPQISTLMRISDKLAGINAARFHDWQPDFTPANARQAI
LAFKGDVYTGLQAETFSEDDFDFAQQHLRMLSGLYGVLRPLDLMQPYRLEMGIRLENARGKDLYQFWGDIITNKLNEALA
AQGDNVVINLASDEYFKSVKPKKLNAEIIKPVFLDEKNGKFKIISFYAKKARGLMSRFIIENRLTKPEQLTGFNSEGYFF
DEDSSSNGELVFKRYEQR
>P37542 ~~~yabA~~~Initiation-control protein YabA~~~COG4467
MDKKELFDTVINLEEQIGSLYRQLGDLKQHIGEMIEENHHLQLENKHLRKRLDDTTQQIEKFKADKKESKTQKTEQTDIG
EGYDNLARLYQEGFHICNVHYGSVRKEDCLFCLSFLNKK
>Q04KY8 ~~~~~~Initiation-control protein YabA~~~COG4467
MDKKELFDALDDFSQQLLVTLADVEAIKKNLKSLVEENTALRLENSKLRERLGEVEADAPVKAKHVRESVRRIYRDGFHV
CNDFYGQRREQDEECMFCDELLYRE
>P37546 ~~~yabE~~~Putative cell wall shaping protein YabE~~~COG3583
MKKLFSVKLSKSKVILVAACLLLAGSGTAYAAHELTKQSVSVSINGKKKHIRTHANTVGDLLETLDIKTRDEDKITPAKQ
TKITADMDVVYEAAKPVKLTINGEEKTLWSTAKTVGALLDEQDVDVKEQDQIDPAIDTDISKDMKINIEPAFQVTVNDAG
KQKKIWTTSTTVADFLKQQKMNIKDEDKIKPALDAKLTKGKADITITRIEKVTDVVEEKIAFDVKKQEDASLEKGKEKVV
QKGKEGKLKKHFEVVKENGKEVSRELVKEETAEQSKDKVIAVGTKQSSPKFETVSASGDSKTVVSRSNESTGKVMTVSST
AYTASCSGCSGHTATGVNLKNNPNAKVIAVDPNVIPLGSKVHVEGYGYAIAADTGSAIKGNKIDVFFPEKSSAYRWGNKT
VKIKILN
>P37548 3.4.-.-~~~yabG~~~Sporulation-specific protease YabG~~~
MQFQIGDMVARKSYQMDVLFRIIGIEQTSKGNSIAILHGDEVRLIADSDFSDLVAVKKDEQMMRKKKDESRMNESLELLR
QDYKLLREKQEYYATSQYQHQEHYFHMPGKVLHLDGDEAYLKKCLNVYKKIGVPVYGIHCHEKKMSASIEVLLDKYRPDI
LVITGHDAYSKQKGGIDDLNAYRHSKHFVETVQTARKKIPHLDQLVIFAGACQSHFESLIRAGANFASSPSRVNIHALDP
VYIVAKISFTPFMERINVWEVLRNTLTREKGLGGIETRGVLRIGMPYKSN
>P30149 ~~~yabI~~~Inner membrane protein YabI~~~COG0586
MQALLEHFITQSTVYSLMAVVLVAFLESLALVGLILPGTVLMAGLGALIGSGELSFWHAWLAGIIGCLMGDWISFWLGWR
FKKPLHRWSFLKKNKALLDKTEHALHQHSMFTILVGRFVGPTRPLVPMVAGMLDLPVAKFITPNIIGCLLWPPFYFLPGI
LAGAAIDIPAGMQSGEFKWLLLATAVFLWVGGWLCWRLWRSGKATDRLSHYLSRGRLLWLTPLISAIGVVALVVLIRHPL
MPVYIDILRKVVGV
>P37557 ~~~yabO~~~Uncharacterized protein YabO~~~COG1188
MRLDKFLKVSRLIKRRTLAKEVADQGRISINGNQAKASSDVKPGDELTVRFGQKLVTVQVNELKDTTKKEEAANMYTILK
EEKLGE
>P37558 ~~~yabP~~~Spore protein YabP~~~
MNSYYDQKGSSSVPEQHDVTMKGRKHLDISGVKHVESFDNEEFLLETVMGMLSVRGQNLQMKNLDVEKGIVSIKGRVFDL
VYLDEQQGDKAKGFFSKLFK
>P37559 ~~~yabQ~~~Spore protein YabQ~~~
MTLTTQFYTMLAMSGMGLWLGASLDTYRLFVIRAKTARWLLFIHDILFWIMQGLLFFYVLLHVNEGEFRIYIFLAVLLGV
ATYQSLCKRIYIKILKFVIYLVVSVYQFFKKLIQHVLFRPIVWTCGAIIWLAAFLFKKTYSLIGFLLLCLYKIVMVLCFP
IRFIAKQCLKLLPVKMRLTFRRYFEKGAGFLKKKKKLLITIRTTITRFLKR
>P0A8H8 ~~~yacG~~~DNA gyrase inhibitor YacG~~~COG3024
MSETITVNCPTCGKTVVWGEISPFRPFCSKRCQLIDLGEWAAEEKRIPSSGDLSESDDWSEEPKQ
>Q06754 3.1.-.-~~~yacL~~~Uncharacterized PIN and TRAM-domain containing protein YacL~~~COG4956
MLKRIVQAFFIIFGGVVGIFLIPELFVLLNIQDIPLITNAYTSAAIGAIIFFLISIWGTEYVVNWVKWIEDSLLKAPVPD
LLFGSLGLVFGLIIAYLIVNVIPLDNIPYRIFSTIIPVFLAFFLGYLGFQVGFKKKDELISLFSISARMQKKKGTADEEH
EVQDKKLKILDTSVIIDGRIADICQTGFLEGVIVIPQFVLEELQHIADSSDVLKRNRGRRGLDILNRIQKELDIEVEIYE
GDFEDIQEVDSKLVKLAKLTSGVVVTNDFNLNKVCELQKVAVLNINDLANAVKPVVLPGEEMNVQVIKDGKEHNQGVAYL
DDGTMIVVEEGRNYIGKHIDVLVTSVLQTAAGRMIFAKPKLLEKAL
>P0A8E5 ~~~yacL~~~UPF0231 protein YacL~~~COG3112
MDYEFLRDITGVVKVRMSMGHEVVGHWFNEEVKENLALLDEVEQAAHALKGSERSWQRAGHEYTLWMDGEEVMVRANQLE
FAGDEMEEGMNYYDEESLSLCGVEDFLQVVAAYRNFVQQK
>P37574 ~~~yacP~~~Uncharacterized protein YacP~~~COG3688
MDILLVDGYNMIGAWPQLKDLKANSFEEARDVLIQKMAEYQSYTGNRVIVVFDAHLVKGLEKKQTNHRVEVIFTKENETA
DERIEKLAQALNNIATQIHVATSDYTEQWAIFGQGALRKSARELLREVETIERRIERRVRKITSEKPAGKIALSEEVLKT
FEKWRRGDLD
>P31489 ~~~yadA~~~Adhesin YadA~~~
MTKDFKISVSAALISALFSSPYAFADDYDGIPNLTAVQISPNADPALGLEYPVRPPVPGAGGLNASAKGIHSIAIGATAE
AAKGAAVAVGAGSIATGVNSVAIGPLSKALGDSAVTYGAASTAQKDGVAIGARASTSDTGVAVGFNSKADAKNSVAIGHS
SHVAANHGYSIAIGDRSKTDRENSVSIGHESLNRQLTHLAAGTKDTDAVNVAQLKKEIEKTQENTNKRSAELLANANAYA
DNKSSSVLGIANNYTDSKSAETLENARKEAFAQSKDVLNMAKAHSNSVARTTLETAEEHANSVARTTLETAEEHANKKSA
EALASANVYADSKSSHTLKTANSYTDVTVSNSTKKAIRESNQYTDHKFRQLDNRLDKLDTRVDKGLASSAALNSLFQPYG
VGKVNFTAGVGGYRSSQALAIGSGYRVNENVALKAGVAYAGSSDVMYNASFNIEW
>P0C2W0 ~~~yadA~~~Adhesin YadA~~~
MTKDFKISVSAALISALFSSPYAFANNDEVHFTAVQISPNSDPDSHVMIFQPEVRAPGGTNALAKGTHSIAVGASAEAAE
RAAVAVGAGSIATGVNSVAIGPLSKALGDSAVTYGAGSTAQKDGVAIGARASTSDTGVAVGFNSKVDAKNSVSIGHSSHV
AVDHDYSIAIGDRSKTDRKNSVSIGHESLNRQLTHLAAGTKDTDAVNVAQLKKEIEKTQENANKKSAEVLGIANNYTDSK
SAETLENARKEAFDLSNDALDMAKKHSNSVARTTLETAEEHTNKKSAETLASANVYADSKSSHTLKTANSYTDVTVSNST
KKAIRESNQYTDHKFHQLDNRLDKLDTRVDKGLASSAALNSLFQPYGVGKVNFTAGVGGYRSSQALAIGSGYRVNESVAL
KAGVAYAGSSDVMYNASFNIEW
>A1JUB7 ~~~yadA~~~Adhesin YadA~~~COG1293
MTKDFKISVSAALISALFSSPYAFANNDEVHFTAVQISPNADPDSHVVIFQPAAEALGGTNALAKSIHSIAVGASAEAAK
QAAVAVGAGSIATGVNSVAIGPLSKALGDSAVTYGAASTAQKDGVAIGARAFTSDTGVAVGFNSKVDAKNSVAIGHSSHV
AVDHDYSIAIGDRSKTDRKNSVSIGHESLNRQLTHLAAGTKDTDAVNVAQLKKEIEKTQVNANKKSAEVLGIANNYTDSK
SAETLENARKEAFDLSNDALDMAKKHSNSVARTTLETAEEHTNKKSAETLARANVYADSKSSHTLQTANSYTDVTVSNST
KKAIRESNQYTDHKFRQLDNRLDKLDTRVDKGLASSAALNSLFQPYGVGKVNFTAGVGGYRSSQALAIGSGYRVNESVAL
KAGVAYAGSSDVMYNASFNIEW
>P10858 ~~~yadA~~~Adhesin YadA~~~
MTKDFKISVSAALISALFSSPYAFAEEPEDGNDGIPRLSAVQISPNVDPKLGVGLYPAKPILRQENPKLPPRGPQGPEKK
RARLAEAIQPQVLGGLDARAKGIHSIAIGATAEAAKPAAVAVGAGSIATGVNSVAIGPLSKALGDSAVTYGASSTAQKDG
VAIGARASASDTGVAVGFNSKVDAQNSVAIGHSSHVAADHGYSIAIGDLSKTDRENSVSIGHESLNRQLTHLAAGTKDND
AVNVAQLKKEMAETLENARKETLAQSNDVLDAAKKHSNSVARTTLETAEEHANKKSAEALVSAKVYADSNSSHTLKTANS
YTDVTVSSSTKKAISESNQYTDHKFSQLDNRLDKLDKRVDKGLASSAALNSLFQPYGVGKVNFTAGVGGYRSSQALAIGS
GYRVNESVALKAGVAYAGSSNVMYNASFNIEW
>P31058 ~~~yadC~~~Uncharacterized fimbrial-like protein YadC~~~COG3539
MKTIFRYILFLALYSCCNTVSAYTSFIVGNNAGVDNYRGPSTAAQMTFNYTSTASNLVFYKPTQLGPTGVKMYWSYLDTG
TGGGILYCNTSGRANPGPITIENAMVYSGKDYGGHKLFNTSVPGLYYTMLISRVWSAYDTITDIQSPGIYIGDPSNQEFF
FSVTDSDLQTKGCNKADDYDKFWAIGGIVHNITVEFYTDTNFDPTLNQQVQLSSSSNYLYSFKAYSPGTKVVDHSNHIYV
NFTLNNVKLTLPTCFTSILTGPSVNGSTVRMGEYSSGTIKNGASPVPFDISLQNCIRVRNIETKLVTGKVGTQNTQLLGN
TLTGSTAAKGVGVLIEGLATSKNPLMTLKPNDTNSVYIDYETEDDTSDGVYPNQGNGTSQPLHFQATLKQDGNIAIEPGE
FKATSTFQVTYP
>P36879 ~~~yadG~~~Uncharacterized ABC transporter ATP-binding protein YadG~~~COG1131
MTIALELQQLKKTYPGGVQALRGIDLQVEAGDFYALLGPNGAGKSTTIGIISSLVNKTSGRVSVFGYDLEKDVVNAKRQL
GLVPQEFNFNPFETVQQIVVNQAGYYGVERKEAYIRSEKYLKQLDLWGKRNERARMLSGGMKRRLMIARALMHEPKLLIL
DEPTAGVDIELRRSMWGFLKDLNDKGTTIILTTHYLEEAEMLCRNIGIIQHGELVENTSMKALLAKLKSETFILDLAPKS
PLPKLDGYQYRLVDTATLEVEVLREQGINSVFTQLSEQGIQVLSMRNKANRLEELFVSLVNEKQGDRA
>P0AFN6 ~~~yadH~~~Inner membrane transport permease YadH~~~COG0842
MMHLYWVALKSIWAKEIHRFMRIWVQTLVPPVITMTLYFIIFGNLIGSRIGDMHGFSYMQFIVPGLIMMSVITNAYANVA
SSFFGAKFQRNIEELLVAPVPTHVIIAGYVGGGVARGLFVGILVTAISLFFVPFQVHSWVFVALTLVLTAVLFSLAGLLN
GVFAKTFDDISLVPTFVLTPLTYLGGVFYSLTLLPPFWQGLSHLNPIVYMISGFRYGFLGINDVPLVTTFGVLVVFIVAF
YLICWSLIQRGRGLRS
>P37016 ~~~yadK~~~Uncharacterized fimbrial-like protein YadK~~~COG3539
MHPTQRKLMKRIILFLSLLFCIACPAIAGQDIDLVANVKNSTCKSGISNQGNIDLGVVGVGYFSGNVTPESYQPGGKEFT
ITVSDCALQGTGDVLNQLHIDFRALSGVMAAGSRQIFANEISSGASNVGVVIFSTQDSANTFNVLNASGGSRSVYPVMSD
DMNGSSWKFSTRMQKIDPALSVTSGQLMSHVLVDIYYE
>P37017 ~~~yadL~~~Uncharacterized fimbrial-like protein YadL~~~COG3539
MMTFKNLRYGLSSSVVLAASLFSVLSYAATDSIGLTVITTVEMGTCTATLVNDSDQDISVVDFGDVYISEINAKTKVKTF
KLKFKDCAGIPNKKAQIKLTKRATCEGTANDGAGFANGSTAADKASAVAVEVWSTVTPATGSATQFSCVTPASQEVTIST
AANAVVYYPMSARLVVEKNKTVNNVTAGKFSAPATFTVTYN
>P37018 ~~~yadM~~~Uncharacterized fimbrial-like protein YadM~~~COG3539
MIKTTPHKIVILMGILLSPSVFATDINVEFTATVKATTCNITLTGNNVTNDGNNNYTLRIPKMGLDKIANKTTESQADFK
LVASGCSSGISWIDTTLTGNASSSSPKLIIPQSGDSSSTTSNIGMGFKKRTTDDATFLKPNSAEKIRWSTDEMQPDKGLE
MTVALRETDAGQGVPGNFRALATFNFIYQ
>P37050 ~~~yadN~~~Uncharacterized fimbrial-like protein YadN~~~COG3539
MSKKLGFALSGLMLAMVAGTASADMDGGQLNISGLVVDNTCETRVDGGNKDGLILLQTATVGEIDAGVLNDTVGAKAKPF
SITVDCSKANPNPGSTAKMTFGSVFFGNSKGTLNNDMSINNPSDGVNIALHNIDGSTIKQVQINNPGDVYTKALDATTKS
AVYDFKASYVRAVADQTATAGYVKTNTAYTITYQ
>P0AFP0 ~~~yadS~~~UPF0126 inner membrane protein YadS~~~COG2860
MLVYWLDIVGTAVFAISGVLLAGKLRMDPFGVLVLGVVTAVGGGTIRDMALDHGPVFWVKDPTDLVVAMVTSMLTIVLVR
QPRRLPKWMLPVLDAVGLAVFVGIGVNKAFNAEAGPLIAVCMGVITGVGGGIIRDVLAREIPMILRTEIYATACIIGGIV
HATAYYTFSVPLETASMMGMVVTLLIRLAAIRWHLKLPTFALDENGR
>P33128 ~~~yadV~~~Probable fimbrial chaperone YadV~~~COG3121
MFFNTKHTTALCFVTCMAFSSSSIADIVISGTRVIYKSDQKSVNVRLENKGNNPLLVQSWLDTGDDNAEPGSITVPFTAT
PPVSRIDAKRGQTIKLMYTASTSLPKDRESVFWFNVLEVPPKPDAEKVANQSLLQLAFRTRIKLFYRPDGLKGNPSEAPL
ALKWFWSGSEGKASLRVTNPTPYYVSFSSGDLEASGKRYPIDVKMIAPFSDEVMKVNGLNGKANSAKVHFYAINDFGGAI
EGNARL
>P0DPM7 ~~~yadW~~~Protein YadW~~~
MAIIIGLEFAQLPMSFGAKYE
>P0A8K7 ~~~yaeP~~~UPF0253 protein YaeP~~~
MEKYCELIRKRYAEIASGDLGYVPDALGCVLKVLNEMAADDALSEAVREKAAYAAANLLVSDYVNE
>P0AA97 ~~~yaeQ~~~Uncharacterized protein YaeQ~~~COG4681
MALKATIYKATVNVADLDRNQFLDASLTLARHPSETQERMMLRLLAWLKYADERLQFTRGLCADDEPEAWLRNDHLGIDL
WIELGLPDERRIKKACTQAAEVALFTYNSRAAQIWWQQNQSKCVQFANLSVWYLDDEQLAKVSAFADRTMTLQATIQDGV
IWLSDDKNNLEVNLTAWQQPS
>Q47147 2.4.2.-~~~yafJ~~~Putative glutamine amidotransferase YafJ~~~COG0121
MCELLGMSANVPTDICFSFTGLVQRGGGTGPHKDGWGITFYEGKGCRTFKDPQPSFNSPIAKLVQDYPIKSCSVVAHIRQ
ANRGEVALENTHPFTRELWGRNWTYAHNGQLTGYKSLETGNFRPVGETDSEKAFCWLLHKLTQRYPRTPGNMAAVFKYIA
SLADELRQKGVFNMLLSDGRYVMAYCSTNLHWITRRAPFGVATLLDQDVEIDFSSQTTPNDVVTVIATQPLTGNETWQKI
MPGEWRLFCLGERVV
>Q47156 ~~~yafN~~~Antitoxin YafN~~~COG2161
MHRILAEKSVNITELRKNPAKYFIDQPVAVLSNNRPAGYLLSASAFEALMDMLAEQEEKKPIKARFRPSAARLEEITRRA
EQYLNDMTDDDFNDFKE
>Q47157 3.1.-.-~~~yafO~~~mRNA interferase toxin YafO~~~
MRVFKTKLIRLQLTAEELDALTADFISYKRDGVLPDIFGRDALYDDSFTWPLIKFERVAHIHLANENNPFPPQLRQFSRT
NDEAHLVYCQGAFDEQAWLLIAILKPEPHKLARDNNQMHKIGKMAEAFRMRF
>Q47158 2.3.1.-~~~yafP~~~Uncharacterized N-acetyltransferase YafP~~~COG0454
MNNIQIRNYQPGDFQQLCAIFIRAVTMTASQHYSPQQISAWAQIDESRWKEKLAKSQVWVAIINAQPVGFISRIEHYIDM
LFVDPEYTRRGVASALLKPLIKSESELTVDASITAKPFFERYGFQTVKQQRVECRGAWFTNFYMRYKPQH
>A0A140NAP5 3.1.-.-~~~yafQ~~~mRNA interferase toxin YafQ~~~COG3041
MIQRDIEYSGQFSKDVKLAQKRHKDMNKLKYLMTLLINNTLPLPAVYKDHPLQSSWKGYRDAHVEPDWILIYKLTDKLLR
FERTGTHAALFG
>Q47149 3.1.-.-~~~yafQ~~~mRNA interferase toxin YafQ~~~COG3041
MIQRDIEYSGQYSKDVKLAQKRHKDMNKLKYLMTLLINNTLPLPAVYKDHPLQGSWKGYRDAHVEPDWILIYKLTDKLLR
FERTGTHAALFG
>A0A140NDS5 3.5.1.3~~~yafV~~~Omega-amidase YafV~~~COG0388
MPGLKITLLQQPLVWMDGPANLRHFDRQLEGITGRDVIVLPEMFTSGFAMEAAASSLAQDDVVNWMTAKAQQCNALIAGS
VALQTESGSVNRFLLVEPGGTVHFYDKRHLFRMVDEHLHYKAGNARVIVEWRGWRILPLVCYDLRFPVWSRNLNDYDLAL
YVANWPAPRSLHWQALLTARAIENQAYVAGCNRVGSDGNGCHYRGDSRVINPQGEIIATADAHQATRIDAELSMAALREY
REKFPAWQDADEFRLW
>P0DP65 3.5.1.3~~~yafV~~~Omega-amidase YafV~~~
MKVQIYQLPIIFGDSSKNETQITQWFEKNMNAEVDVVVLPEMWNNGYDLEHLNEKADNNLGQSFSFIKHLAEKYKVDIVA
GSVSNIRNYQIFNTAFSVNKSGQLINEYDKVHLVPMLREHEFLTAGENVAEPFQLSDGTYVTQLICYDLRFPELLRYPAR
SGAKIAFYVAQWPMSRLQHWHSLLKARAIENNMFVIGTNSTGFDGNTEYAGHSIVINPNGDLVGELNESADILTVDLNLN
EVEQQRENIPVFKSIKLDLYK
>Q47684 ~~~yafW~~~Antitoxin YafW~~~
MSNPTRGLQREITLRLGARLVQEGNRLHYLADRASITGKFSDIECRKLDETFPHFILQMESMLTTGELSPHHAHCVTLYH
NDLTCEADTLGSCGYVYIAIYPTQR
>P77365 ~~~yafY~~~Lipoprotein YafY~~~COG2378
MKRKTLPLLALVATTLFLIACDDRSDDLKAISKFKDLTPPRFSDVVSHQDDVSEEWSQVDYLSGPTLQVLRTRQSPDGCE
DGSYYYLVDMQEKTVQPLMNALCIADNIKLEYQEVTDPYTKEKYFEYAHDGKLMGQLLIPSNPDNQE
>P37007 ~~~yagA~~~Uncharacterized protein YagA~~~COG2801
MESLMPWDARDTMSLRTEFVLFASQDGANIRSLCRRFGISPATGYKWLQRWAQEGAAGLQDRPRIPHHSPNRSSDDITAL
LRMAHDRHERWGARKIKRWLEDQGHTMPAFSTVHNLMARHGLLPGASPGIPATGRFEHDAPNRLWQMDFKGHFPFGGGRC
HPLTLLDDHSRFSLCLAHCTDERRETVQQQLVSVFERYGLPDRMTMDNGSPWGDTTGTWTALELWLMRLGIRVGHSRPYH
PQTQGKLERFHRSLKAEVLQGKWFADSGELQRAFDHWRTVYNLERPHEALDMAVPGSRYQPSARQYSGNTTPPEYDEGVM
VRKVDISGKLSVKGVSLSAGKAFRGERVGLKEMQEDGSYEVWWYSTKVGVIDLKKKSITMGKGC
>P75682 4.1.2.51~~~yagE~~~Putative 2-dehydro-3-deoxy-D-gluconate aldolase YagE~~~COG0329
MPQSALFTGIIPPVSTIFTADGQLDKPGTAALIDDLIKAGVDGLFFLGSGGEFSQLGAEERKAIARFAIDHVDRRVPVLI
GTGGTNARETIELSQHAQQAGADGIVVINPYYWKVSEANLIRYFEQVADSVTLPVMLYNFPALTGQDLTPALVKTLADSR
SNIIGIKDTIDSVAHLRSMIHTVKGAHPHFTVLCGYDDHLFNTLLLGGDGAISASGNFAPQVSVNLLKAWRDGDVAKAAG
YHQTLLQIPQMYQLDTPFVNVIKEAIVLCGRPVSTHVLPPASPLDEPRKAQLKTLLQQLKLC
>P77596 4.2.1.82~~~yagF~~~D-xylonate dehydratase YagF~~~COG0129
MTIEKIFTPQDDAFYAVITHAAGPQGALPLTPQMLMESPSGNLFGMTQNAGMGWDANKLTGKEVLIIGTQGGIRAGDGRP
IALGYHTGHWEIGMQMQAAAKEITRNGGIPFAAFVSDPCDGRSQGTHGMFDSLPYRNDAAIVFRRLIRSLPTRRAVIGVA
TCDKGLPATMIALAAMHDLPTILVPGGATLPPTVGEDAGKVQTIGARFANHELSLQEAAELGCRACASPGGGCQFLGTAG
TSQVVAEALGLALPHSALAPSGQAVWLEIARQSARAVSELDSRGITTRDILSDKAIENAMVIHAAFGGSTNLLLHIPAIA
HAAGCTIPDVEHWTRINRKVPRLVSVLPNGPDYHPTVRAFLAGGVPEVMLHLRDLGLLHLDAMTVTGQTVGENLEWWQAS
ERRARFRQCLREQDGVEPDDVILPPEKAKAKGLTSTVCFPTGNIAPEGSVIKATAIDPSVVGEDGVYHHTGRVRVFVSEA
QAIKAIKREEIVQGDIMVVIGGGPSGTGMEETYQLTSALKHISWGKTVSLITDARFSGVSTGACFGHVSPEALAGGPIGK
LRDNDIIEIAVDRLTLTGSVNFIGTADNPLTPEEGARELARRQTHPDLHAHDFLPDDTRLWAALQSVSGGTWKGCIYDTD
KIIEVINAGKKALGI
>P75683 ~~~yagG~~~Putative glycoside/cation symporter YagG~~~COG2211
MTQLTMKDKIGYGLGDTACGFVWQATMFLLAYFYTDVFGLSAGIMGTLFLVSRVLDAVTDPLMGLLVDRTRTRHGQFRPF
LLWGAIPFGIVCVLTFYTPDFSAQGKIIYACVTYILLTLVYTFVNVPYCAMPGVITADPKERHALQSWRFFLAAAGSLAI
SGIALPLVSIIGKGDEQVGYFGAMCVLGLSGVVLLYVCFFTTKERYTFEVQPGSSVAKDLKLLLGNSQWRIMCAFKMMAT
CSNVVRGGATLYFVKYVMDHPELATQFLLYGSLATMFGSLCSSRLLGRFDRVTAFKWIIVAYSLISLLIFVTPAEHIALI
FALNILFLFVFNTTTPLQWLMASDVVDYEESRSGRRLDGLVFSTYLFSLKIGLAIGGAVVGWILAYVNYSASSSVQPVEV
LTTIKILFCVVPVVLYAGMFIMLSLYKLTDARVEAISRQLIKHRAAQGEAVPDAATAASH
>P0AAA1 ~~~yagU~~~Inner membrane protein YagU~~~COG3477
MNIFEQTPPNRRRYGLAAFIGLIAGVVSAFVKWGAEVPLPPRSPVDMFNAACGPESLIRAAGQIDCSRNFLNPPYIFLRD
WLGLTDPNAAVYTFAGHVFNWVGVTHIIFSIVFAVGYCVVAEVFPKIKLWQGLLAGALAQLFVHMISFPLMGLTPPLFDL
PWYENVSEIFGHLVWFWSIEIIRRDLRNRITHEPDPEIPLGSNR
>P77700 ~~~yahB~~~Uncharacterized HTH-type transcriptional regulator YahB~~~COG0583
MNSIFTEENLLAFTTAARFGSFSKAAEELGLTTSAISYTIKRMETGLDVVLFTRSTRSIELTESGRYFFRKATDLLNDFY
AIKRRIDTISQGIEARVRICINQLLYTPKHTARLLQVLKKQFPTCQITVTTEVYNGVWDAIINNQANIAIGAPDTLLDGG
GIDYTEIGAIRWAFAIAPDHPLAFVPEPIAESQLRLYPNIMVEDTAHTINKKVGWLLHGQESILVPDFNTKCQCQILGEG
IGFLPDYMVREAMTQSLLVTRQIHNPRQDSRMLLATQHSATGQVTQWIKKQFAPNGILTGIYQDLLHREN
>P75691 1.1.1.2~~~yahK~~~Aldehyde reductase YahK~~~COG1064
MKIKAVGAYSAKQPLEPMDITRREPGPNDVKIEIAYCGVCHSDLHQVRSEWAGTVYPCVPGHEIVGRVVAVGDQVEKYAP
GDLVGVGCIVDSCKHCEECEDGLENYCDHMTGTYNSPTPDEPGHTLGGYSQQIVVHERYVLRIRHPQEQLAAVAPLLCAG
ITTYSPLRHWQAGPGKKVGVVGIGGLGHMGIKLAHAMGAHVVAFTTSEAKREAAKALGADEVVNSRNADEMAAHLKSFDF
ILNTVAAPHNLDDFTTLLKRDGTMTLVGAPATPHKSPEVFNLIMKRRAIAGSMIGGIPETQEMLDFCAEHGIVADIEMIR
ADQINEAYERMLRGDVKYRFVIDNRTLTD
>P0DPN0 ~~~yahV~~~Protein YahV~~~
MCDILLNVLNIVFIGIAIILVIIC
>P0AAN5 ~~~yaiA~~~Uncharacterized protein YaiA~~~
MPTKPPYPREAYIVTIEKGKPGQTVTWYQLRADHPKPDSLISEHPTAQEAMDAKKRYEDPDKE
>P0A8D3 ~~~yaiI~~~UPF0178 protein YaiI~~~COG1671
MTIWVDADACPNVIKEILYRAAERMQMPLVLVANQSLRVPPSRFIRTLRVAAGFDVADNEIVRQCEAGDLVITADIPLAA
EAIEKGAAALNPRGERYTPATIRERLTMRDFMDTLRASGIQTGGPDSLSQRDRQAFAAELEKWWLEVQRSRG
>P0AAP7 ~~~yaiY~~~Inner membrane protein YaiY~~~
MADFTLSKSLFSGKYRNASSTPGNIAYALFVLFCFWAGAQLLNLLVHAPGVYERLMQVQETGRPRVEIGLGVGTIFGLIP
FLVGCLIFAVVALWLHWRHRRQ
>P0ADZ7 ~~~yajC~~~Sec translocon accessory complex subunit YajC~~~COG1862
MSFFISDAVAATGAPAQGSPMSLILMLVVFGLIFYFMILRPQQKRTKEHKKLMDSIAKGDEVLTNGGLVGRVTKVAENGY
IAIALNDTTEVVIKRDFVAAVLPKGTMKAL
>P9WL75 ~~~yajC~~~Sec translocon accessory complex subunit YajC~~~COG1862
MESFVLFLPFLLIMGGFMYFASRRQRRAMQATIDLHDSLQPGERVHTTSGLEATIVAIADDTIDLEIAPGVVTTWMKLAI
RDRILPDDDIDEELNEDLDKDVDDVAGERRVTNDS
>P46122 ~~~yajI~~~Uncharacterized lipoprotein YajI~~~COG4238
MNTNVFRLLLLGSLFSLSACVQQSEVRQMKHSVSTLNQEMTQLNQETVKITQQNRLNAKSSSGVYLLPGAKTPARLESQI
GTLRMSLVNITPDADGTTLTLRIQGESNDPLPAFSGTVEYGQIQGTIDNFQEINVQNQLINAPASVLAPSDVDIPLQLKG
ISVDQLGFVRIHDIQPVMQ
>Q46948 3.1.2.-~~~yajL~~~Protein/nucleic acid deglycase 3~~~COG0693
MSASALVCLAPGSEETEAVTTIDLLVRGGIKVTTASVASDGNLAITCSRGVKLLADAPLVEVADGEYDVIVLPGGIKGAE
CFRDSTLLVETVKQFHRSGRIVAAICAAPATVLVPHDIFPIGNMTGFPTLKDKIPAEQWLDKRVVWDARVKLLTSQGPGT
AIDFGLKIIDLLVGREKAHEVASQLVMAAGIYNYYE
>P77735 1.1.-.-~~~yajO~~~1-deoxyxylulose-5-phosphate synthase YajO~~~COG0667
MQYNPLGKTDLRVSRLCLGCMTFGEPDRGNHAWTLPEESSRPIIKRALEGGINFFDTANSYSDGSSEEIVGRALRDFARR
EDVVVATKVFHRVGDLPEGLSRAQILRSIDDSLRRLGMDYVDILQIHRWDYNTPIEETLEALNDVVKAGKARYIGASSMH
ASQFAQALELQKQHGWAQFVSMQDHYNLIYREEEREMLPLCYQEGVAVIPWSPLARGRLTRPWGETTARLVSDEVGKNLY
KESDENDAQIAERLTGVSEELGATRAQVALAWLLSKPGIAAPIIGTSREEQLDELLNAVDITLKPEQIAELETPYKPHPV
VGFK
>P0A8E7 ~~~yajQ~~~UPF0234 protein YajQ~~~COG1666
MPSFDIVSEVDLQEARNAVDNASREVESRFDFRNVEASFELNDASKTIKVLSESDFQVNQLLDILRAKLLKRGIEGSSLD
VPENIVHSGKTWFVEAKLKQGIESATQKKIVKMIKDSKLKVQAQIQGDEIRVTGKSRDDLQAVMAMVRGGDLGQPFQFKN
FRD
>P77726 ~~~yajR~~~Inner membrane transport protein YajR~~~COG2814
MNDYKMTPGERRATWGLGTVFSLRMLGMFMVLPVLTTYGMALQGASEALIGIAIGIYGLTQAVFQIPFGLLSDRIGRKPL
IVGGLAVFAAGSVIAALSDSIWGIILGRALQGSGAIAAAVMALLSDLTREQNRTKAMAFIGVSFGITFAIAMVLGPIITH
KLGLHALFWMIAILATTGIALTIWVVPNSSTHVLNRESGMVKGSFSKVLAEPRLLKLNFGIMCLHILLMSTFVALPGQLA
DAGFPAAEHWKVYLATMLIAFGSVVPFIIYAEVKRKMKQVFVFCVGLIVVAEIVLWNAQTQFWQLVVGVQLFFVAFNLME
ALLPSLISKESPAGYKGTAMGVYSTSQFLGVAIGGSLGGWINGMFDGQGVFLAGAMLAAVWLTVASTMKEPPYVSSLRIE
IPANIAANEALKVRLLETEGIKEVLIAEEEHSAYVKIDSKVTNRFEIEQAIRQA
>P05449 ~~~~~~ATP synthase subunits region ORF 6~~~
MKPVPTYVQDKDESTLMFSVCSLVRDQAKYDRLLESFERFGFTPDKAEFLAADNREGNQFHGFSWHKQMLPRCKGRYVIF
CHEDVELVDRGYDDLVAAIEALEEADPKWLVAGVAGSPWRPLNHSVTAQALHISDVFGNDRRRGNVPCRVESLDECFLLM
RRLKPVLNSYDMQGFHYYGADLCLQAEFLGGRAYAIDFHLHHYGRAIADENFHRLRQEMAQKYRRWFPGRILHCVTGRVA
LGGGWYEAR
>P0AAQ9 ~~~ybaA~~~Uncharacterized protein YbaA~~~
MKYVDGFVVAVPADKKDAYREMAAKAAPLFKEFGALRIVECWASDVPDGKVTDFRMAVKAEENEEVVFSWIEYPSKEVRD
AANQKMMSDPRMKEFGESMPFDGKRMIYGGFESIIDE
>P0A8B5 ~~~ybaB~~~Nucleoid-associated protein YbaB~~~COG0718
MFGKGGLGNLMKQAQQMQEKMQKMQEEIAQLEVTGESGAGLVKVTINGAHNCRRVEIDPSLLEDDKEMLEDLVAAAFNDA
ARRIEETQKEKMASVSSGMQLPPGFKMPF
>P0AAR3 4.2.-.-~~~ybaK~~~Cys-tRNA(Pro)/Cys-tRNA(Cys) deacylase YbaK~~~COG2606
MTPAVKLLEKNKISFQIHTYEHDPAETNFGDEVVKKLGLNPDQVYKTLLVAVNGDMKHLAVAVTPVAGQLDLKKVAKALG
AKKVEMADPMVAQRSTGYLVGGISPLGQKKRLPTIIDAPAQEFATIYVSGGKRGLDIELAAGDLAKILDAKFADIARRD
>P45202 4.2.-.-~~~ybaK~~~Cys-tRNA(Pro)/Cys-tRNA(Cys) deacylase YbaK~~~COG2606
MTPAIDLLKKQKIPFILHTYDHDPNNQHFGDEAAEKLGIDPNRSFKTLLVAENGDQKKLACFVLATANMLNLKKAAKSIG
VKKVEMADKDAAQKSTGYLVGGISPLGQKKRVKTVINSTALEFETIYVSGGKRGLSVEIAPQDLAKVLGAEFTDIVDE
>P39830 ~~~ybaL~~~Putative cation/proton antiporter YbaL~~~COG1226
MHHATPLITTIVGGLVLAFILGMLANKLRISPLVGYLLAGVLAGPFTPGFVADTKLAPELAELGVILLMFGVGLHFSLKD
LMAVKAIAIPGAIAQIAVATLLGMALSAVLGWSLMTGIVFGLCLSTASTVVLLRALEERQLIDSQRGQIAIGWLIVEDLV
MVLTLVLLPAVAGMMEQGDVGFATLAVDMGITIGKVIAFIAIMMLVGRRLVPWIMARSAATGSRELFTLSVLALALGVAF
GAVELFDVSFALGAFFAGMVLNESELSHRAAHDTLPLRDAFAVLFFVSVGMLFDPLILIQQPLAVLATLAIILFGKSLAA
FFLVRLFGHSQRTALTIAASLAQIGEFAFILAGLGMALNLLPQAGQNLVLAGAILSIMLNPVLFALLEKYLAKTETLEEQ
TLEEAIEEEKQIPVDICNHALLVGYGRVGSLLGEKLLASDIPLVVIETSRTRVDELRERGVRAVLGNAANEEIMQLAHLE
CAKWLILTIPNGYEAGEIVASARAKNPDIEIIARAHYDDEVAYITERGANQVVMGEREIARTMLELLETPPAGEVVTG
>P0AAR5 ~~~ybaN~~~Inner membrane protein YbaN~~~COG2832
MQRIILIIIGWLAVVLGTLGVVLPVLPTTPFILLAAWCFARSSPRFHAWLLYRSWFGSYLRFWQKHHAMPRGVKPRAILL
ILLTFAISLWFVQMPWVRIMLLVILACLLFYMWRIPVIDEKQEKH
>P0A9T6 ~~~ybaQ~~~Uncharacterized HTH-type transcriptional regulator YbaQ~~~COG3093
MKQATRKPTTPGDILLYEYLEPLDLKINELAELLHVHRNSVSALINNNRKLTTEMAFRLAKVFDTTVDFWLNLQAAVDLW
EVENNMRTQEELGRIETVAEYLARREERAKKVA
>P77400 ~~~ybaT~~~Inner membrane transport protein YbaT~~~COG0531
MMNTEGNNGNKPLGLWNVVSIGIGAMVGAGIFALLGQAALLMEASTWVAFAFGGIVAMFSGYAYARLGASYPSNGGIIDF
FRRGLGNGVFSLALSLLYLLTLAVSIAMVARAFGAYAVQFLHEGSQEEHLILLYALGIIAVMTLFNSLSNHAVGRLEVIL
VGIKMMILLLLIIAGVWSLQPAHISVSAPPSSGAFFSCIGITFLAYAGFGMMANAADKVKDPQVIMPRAFLVAIGVTTLL
YISLALVLLSDVSALELEKYADTAVAQAASPLLGHVGYVIVVIGALLATASAINANLFAVFNIMDNMGSERELPKLMNKS
LWRQSTWGNIIVVVLIMLMTAALNLGSLASVASATFLICYLAVFVVAIRLRHDIHASLPILIVGTLVMLLVIVGFIYSLW
SQGSRALIWIIGSLLLSLIVAMVMKRNKTV
>P77717 ~~~ybaY~~~Uncharacterized lipoprotein YbaY~~~COG3126
MKLVHMASGLAVAIALAACADKSADIQTPAPAANTSISATQQPAIQQPNVSGTVWIRQKVALPPDAVLTVTLSDASLADA
PSKVLAQKAVRTEGKQSPFSFVLSFNPADVQPNARILLSAAITVNDKLVFITDTVQPVINQGGTKADLTLVPVQQTAVPV
QASGGATTTVPSTSPTQVNPSSAVPAPTQY
>P55192 ~~~ybbA~~~Uncharacterized protein YbbA~~~COG2819
MKGSLSEHKAGNRRFTLYLPPSYSTDSGGFPAVYVQDGSSLFQNQIELLESAFQQQRLPELVLIGIEPENRLDEYTPWPA
ASLSDRFTDFGGMGYHYLSDITNQFIPLIEENWNVTREPQSRGMIGASLGGLISMFAILKYPSMFGKIGSISGSYWYENA
AETIHISSLKPGTARVFMSIGSEEGREKQSIQRHMLKKTKQVHQSLKEKGFTEDQLCLSIEKGAVHHHKYFCKQFINALE
WLYGKNRSTL
>P33668 ~~~ybbC~~~Uncharacterized protein YbbC~~~
MKYSSIFSMLSFFILFACNETAVYGSDENIIFMRYVEKLHLDKYSVKNTVKTETMAIQLAEIYVRYRYGERIAEEEKPYL
ITELPDSWVVEGAKLPYEVAGGVFIIEINKKNGCVLNFLHSK
>Q45581 ~~~ybbH~~~Uncharacterized HTH-type transcriptional regulator YbbH~~~COG1737
MATGGLAIIQSMKHKLPPSERKLADYILAHPHKAIESTVNEISALANSSDAAVIRLCKSLGLKGFQDLKMRVAGDLAKPT
FQGYRDIVPHEPLPSISEKTAGNAIQAIQDTSDLMDYKELERAVSLLLKAHTVHFIGLGASGIVAKDAQQKWLRIHKQAT
AFTDTHLVASLIANADKDDIVFAISFSGETQEIVELFAMAKEKGITTISLTQFSQTSVSALADVPLYTAHSNEALIRSAA
TSSRLAQLFIIDVLFLGMAAEQYETTTGYIDKTRAAIQSMRIK
>P0AAS3 ~~~ybbJ~~~Inner membrane protein YbbJ~~~COG1585
MMELMVVHPHIFWLSLGGLLLAAEMLGGNGYLLWSGVAAVITGLVVWLVPLGWEWQGVMFAILTLLAAWLWWKWLSRRVR
EQKHSDSHLNQRGQQLIGRRFVLESPLVNGRGHMRVGDSSWPVSASEDLGAGTHVEVIAIEGITLHIRAVSS
>P77328 ~~~ybbY~~~Putative purine permease YbbY~~~COG2233
MFNFAVSRESLLSGFQWFFFIFCNTVVVPPTLLSAFQLPQSSLLTLTQYAFLATALACFAQAFCGHRRAIMEGPGGLWWG
TILTITLGEASRGTPINDIATSLAVGIALSGVLTMLIGFSGLGHRLARLFTPSVMVLFMLMLGAQLTTIFFKGMLGLPFG
IADPNFKIQLPPFALSVAVMCLVLAMIIFLPQRFARYGLLVGTITGWLLWYFCFPSSHSLSGELHWQWFPLGSGGALSPG
IILTAVITGLVNISNTYGAIRGTDVFYPQQGAGNTRYRRSFVATGFMTLITVPLAVIPFSPFVSSIGLLTQTGDYTRRSF
IYGSVICLLVALVPALTRLFCSIPLPVSSAVMLVSYLPLLFSALVFSQQITFTARNIYRLALPLFVGIFLMALPPVYLQD
LPLTLRPLLSNGLLVGILLAVLMDNLIPWERIE
>P45570 ~~~ybcI~~~Inner membrane protein YbcI~~~COG1988
MPTVITHAAVPLCIGLGLGSKVIPPRLLFAGIILAMLPDADVLSFKFGVAYGNVFGHRGFTHSLVFAFVVPLLCVFIGRR
WFRAGLIRCWLFLTVSLLSHSLLDSVTTGGKGVGWLWPWSDERFFAPWQVIKVAPFALSRYTTPYGHQVIISELMWVWLP
GMLLMGMLWWRRR
>P0AAS7 ~~~ybcJ~~~Putative RNA-binding protein YbcJ~~~COG2501
MATFSLGKHPHVELCDLLKLEGWSESGAQAKIAIAEGQVKVDGAVETRKRCKIVAGQTVSFAGHSVQVVA
>P77368 ~~~ybcL~~~UPF0098 protein YbcL~~~COG1881
MKTLIVSTVLAFITFSAQAAAFQVTSNEIKTGEQLTTSHVFSGFGCEGGNTSPSLTWSGVPEGTKSFAVTVYDPDAPTGS
GWWHWTVVNIPATVTYLPVDAGRRDGTKLPTGAVQGRNDFGYAGFGGACPPKGDKPHHYQFKVWALKTEKIPVDSNSSGA
LVGYMLNANKIATAEITPVYEIK
>P68661 3.1.-.-~~~ybcO~~~Putative nuclease YbcO~~~
MADLRKAARGRECQVRIPGVCNGNPETSVLAHIRLTGLCGTGTKPPDLIATIACSACHDEIDRRTHFVDAGYAKECALEG
MARTQVIWLKEGVIKA
>P0AAT4 ~~~ybdG~~~Miniconductance mechanosensitive channel YbdG~~~COG0668
MQDLISQVEDLAGIEIDHTTSMVMIFGIIFLTAVVVHIILHWVVLRTFEKRAIASSRLWLQIITQNKLFHRLAFTLQGII
VNIQAVFWLQKGTEAADILTTCAQLWIMMYALLSVFSLLDVILNLAQKFPAASQLPLKGIFQGIKLIGAILVGILMISLL
IGQSPAILISGLGAMAAVLMLVFKDPILGLVAGIQLSANDMLKLGDWLEMPKYGADGAVIDIGLTTVKVRNWDNTITTIP
TWSLVSDSFKNWSGMSASGGRRIKRSISIDVTSIRFLDEDEMQRLNKAHLLKPYLTSRHQEINEWNRQQGSTESVLNLRR
MTNIGTFRAYLNEYLRNHPRIRKDMTLMVRQLAPGDNGLPLEIYAFTNTVVWLEYESIQADIFDHIFAIVEEFGLRLHQS
PTGNDIRSLAGAFKQ
>P77806 2.6.1.88~~~ybdL~~~Methionine aminotransferase~~~COG0436
MTNNPLIPQSKLPQLGTTIFTQMSALAQQHQAINLSQGFPDFDGPRYLQERLAHHVAQGANQYAPMTGVQALREAIAQKT
ERLYGYQPDADSDITVTAGATEALYAAITALVRNGDEVICFDPSYDSYAPAIALSGGIVKRMALQPPHFRVDWQEFAALL
SERTRLVILNTPHNPSATVWQQADFAALWQAIAGHEIFVISDEVYEHINFSQQGHASVLAHPQLRERAVAVSSFGKTYHM
TGWKVGYCVAPAPISAEIRKVHQYLTFSVNTPAQLALADMLRAEPEHYLALPDFYRQKRDILVNALNESRLEILPCEGTY
FLLVDYSAVSTLDDVEFCQWLTQEHGVAAIPLSVFCADPFPHKLIRLCFAKKESTLLAAAERLRQL
>O31435 2.7.11.1~~~ybdM~~~Probable serine/threonine-protein kinase YbdM~~~COG0515
MALKLLKKLLFDRPLKNGVILNHQYKIEECLGMGGYGLVYLCTDILAQTPYVLKQLRPTKAKKEKEKVRFQQEIKLLKNI
HHPQIPGFIDEFIIDGQAYYVMQFIEGENIEELLFFRKQPFTELMALQLISQLLEIIEYLHDRLIFHSDIRTPNIIINDG
RLCLIDFGLAKQLTPEEMEEIKVRKQDDFFDLGETLLFLLYSQYKGKKKKNGTWLEELTLTKEVTLLLKRLLGIEEEYQH
TASIREDLNRAIQSVT
>P77174 ~~~ybdM~~~Uncharacterized protein YbdM~~~COG1475
MGDTMQQRLTQDLTQFLASLPEDDRIKAINEIRMAIHQVSPFREEPVDCVLWVKNSQLMPNDYNPNNVAPPEKKLLQKSI
EIDGFTQPIVVTHTDKNAMEIVDGFHRHEIGKGSSSLKLRLKGYLPVTCLEGTRNQRIAATIRHNRARGRHQITAMSEIV
RELSQLGWDDNKIGKELGMDSDEVLRLKQINGLQELFADRQYSRAWTVK
>P77746 ~~~ybdO~~~Uncharacterized HTH-type transcriptional regulator YbdO~~~COG0583
MANLYDLKKFDLNLLVIFECIYQHLSISKAAESLYITPSAVSQSLQRLRAQFNDPLFIRSGKGIAPTTTGLNLHHHLEKN
LRGLEQTINIVNKSELKKNFIIYGPQLISCSNNSMLIRCLRQDSSVEIECHDILMSAENAEELLVHRKADLVITQMPVIS
RSVICMPLHTIRNTLICSNRHPRITDNSTYEQIMAEEFTQLISKSAGVDDIQMEIDERFMNRKISFRGSSLLTIINSIAV
TDLLGIVPYELYNSYRDFLNLKEIKLEHPLPSIKLYISYNKSSLNNLVFSRFIDRLNESF
>P18393 ~~~ybdZ~~~Enterobactin biosynthesis protein YbdZ~~~COG3251
MAFSNPFDDPQGAFYILRNAQGQFSLWPQQCVLPAGWDIVCQPQSQASCQQWLEAHWRTLTPTNFTQLQEAQ
>P0A8J4 ~~~ybeD~~~UPF0250 protein YbeD~~~COG2921
MKTKLNELLEFPTPFTYKVMGQALPELVDQVVEVVQRHAPGDYTPTVKPSSKGNYHSVSITINATHIEQVETLYEELGKI
DIVRMVL
>A0A140NCB4 3.5.1.128~~~ybeM~~~Deaminated glutathione amidase~~~COG0388
MLVAAGQFAVTSVWEKNAEICASLMAQAAENDVSLFVLPEALLARDDHDADLSVKSAQLLEGEFLGRLRRESKRNMMTTI
LTIHVPSTPGRAWNMLVALQAGNIVARYAKLHLYDAFAIQESRRVDAGNEIAPLLEVEGMKVGLMTCYDLRFPELALAQA
LQGAEILVLPAAWVRGPLKEHHWSTLLAARALDTTCYMVAAGECGNKNIGQSRIIDPFGVTIAAASEMPALIMAEVTPER
VRQVRAQLPVLNNRRFAPPQLL
>P77296 ~~~ybeT~~~Sel1-repeat-containing protein YbeT~~~COG0790
MNKKLMYIFAIFIVAAITCISQPKKTTLRDKAMVNYAFDYLSSPGSLPFTTAATELSAIHGHSTSQYRLGEFYLHGSDGK
PLDYTQARYWYEQSAEQENPRAQSKLGWIYLKGLGVKPDTRKAILWYKEAAEQGYAHAQYTLGLIYRNGSGINVNHYESQ
KWLKLTAKQHYKNAERLLAGLPAH
>O67367 3.1.-.-~~~ybeY~~~Endoribonuclease YbeY~~~COG0319
MSSTKRQKNRVLVKLKKRKVRKDKIEKWAELALSALGLNNVELSVYITDDQEIRELNKTYRKKDKPTDVLSFPMGEEFGG
YKILGDVVISQDTAERQARELGHSLEEEVKRLIVHGIVHLLGYDHEKGGEEEKKFRELENYVLSKLSKAL
>P0A898 3.1.-.-~~~ybeY~~~Endoribonuclease YbeY~~~COG0319
MSQVILDLQLACEDNSGLPEESQFQTWLNAVIPQFQEESEVTIRVVDTAESHSLNLTYRGKDKPTNVLSFPFEVPPGMEM
SLLGDLVICRQVVEKEAQEQGKPLEAHWAHMVVHGSLHLLGYDHIEDDEAEEMEALETEIMLALGYEDPYIAEKE
>P71335 3.1.-.-~~~ybeY~~~Endoribonuclease YbeY~~~COG0319
MGSVLVDLQIATENIEGLPTEEQIVQWATGAVQPEGNEVEMTVRIVDEAESHELNLTYRGKDRPTNVLSFPFECPDEVEL
PLLGDLVICRQVVEREASEQEKPLMAHWAHMVVHGSLHLLGYDHIEDDEAEEMESLETQIMQGLGFDDPYLAEK
>A0R0T0 3.1.-.-~~~ybeY~~~Endoribonuclease YbeY~~~COG0319
MSIEVSNESGYDVSEPELISVARFVIEKMDVHPAAELSMVLLDSAAMADLHMRWMDLPGPTDVMSFPMDELEPGGRPDAP
EPGPAMLGDIVLCPEFAEQQAAKAGHSLGHELALLTVHGVLHLLGYDHAEPDEEKEMFALQRQLLEEWVADQVEAYHADR
QSEKDRRLLDKSRYFDEP
>P9WGX9 3.1.-.-~~~ybeY~~~Endoribonuclease YbeY~~~COG0319
MSIEVANESGIDVSEAELVSVARFVIAKMDVNPCAELSMLLLDTAAMADLHMRWMDLPGPTDVMSFPMDELEPGGRPDAP
EPGPSMLGDIVLCPEFAAEQAAAAGHSLGHELALLTIHGVLHLLGYDHAEPDEEKEMFALQDRLLEEWVADQVEAYQHDR
QDEKDRRLLDKSRYFDL
>P67136 3.1.-.-~~~ybeY~~~Endoribonuclease YbeY~~~
MFTIDFSDHTGLVKDAWYKQIEDLLEFAKKEEHIEDDAELSVTFVDKQEIQEINRTYRDKDKVTDVISFALEEDEPEIDF
SGLDIPRVLGDIIICTDVAQEQANNYGHSFERELGFLALHGFLHLLGYDHMTEADEKEMFGRQDTILNAYGLTRD
>Q9X1J7 3.1.-.-~~~ybeY~~~Endoribonuclease YbeY~~~COG0319
MIRILGEGKGSKLLENLKEKLEEIVKKEIGDVHVNVILVSEDEIKELNQQFRGQDRPTDVLTFPLMEEDVYGEIYVCPLI
VEENAREFNNTFEKELLEVVIHGILHLAGYDHEFEDKNSKEMFEKQKKYVEEVWGEWRSNPSEDSDPGKR
>P0AAU7 ~~~ybfE~~~Uncharacterized protein YbfE~~~
MAKEQTDRTTLDLFAHERRPGRPKTNPLSRDEQLRINKRNQLKRDKVRGLKRVELKLNAEAVEALNELAESRNMSRSELI
EEMLMQQLAALRSQGIV
>P75736 3.1.-.-~~~ybfF~~~Esterase YbfF~~~COG0596
MKLNIRAQTAQNQHNNSPIVLVHGLFGSLDNLGVLARDLVNDHNIIQVDMRNHGLSPRDPVMNYPAMAQDLVDTLDAQQI
DKATFIGHSMGGKAVMALTALASDRIDKLVAIDIAPVDYHVRRHDEIFAAINAVSESDAQTRQQAAAIMRQHLNEEGVIQ
FLLKSFVDGEWRFNVPVLWDQYPHIVGWEKIPAWDHPALFIPGGNSPYVSEQYRDDLLAQFPQARAHVIAGAGHWVHAEK
PDAVLRAIRRYLND
>O31452 3.1.1.1~~~ybfK~~~Carboxylesterase YbfK~~~COG0596
MIQDSMQFAAVESGLRFYQAYDQSLSLWPIESEAFYVSTRFGKTHIIASGPKDAPSLILLHGGLFSSAMWYPNIAAWSSQ
FRTYAVDIIGDKNKSIPSAAMETRADFAEWMKDVFDSLGLETAHLAGLSLGGSHIVNFLLRAPERVERAVVISPAEAFIS
FHPDVYKYAAELTGARGAESYIKWITGDSYDLHPLLQRQIVAGVEWQDEQRSLKPTENGFPYVFTDQELKSIQVPVLLMF
GEHEAMYHQQMAFERASVLVPGIQAEIVKNAGHLLSLEQPEYVNQRVLSFLCGGIK
>O31460 ~~~ybgB~~~Uncharacterized membrane protein YbgB~~~
MFLFTNGKVLWGAVIAAFILSIVFYPFLPTQMPIHYDVANSPDLTVNKLAGTVMLPVLMVVFAWARKINWQFVFAVYILL
ICHIVVLCLAL
>P0A8Z3 3.1.2.-~~~ybgC~~~Acyl-CoA thioester hydrolase YbgC~~~COG0824
MNTTLFRWPVRVYYEDTDAGGVVYHASYVAFYERARTEMLRHHHFSQQALMAERVAFVVRKMTVEYYAPARLDDMLEIQT
EITSMRGTSLVFTQRIVNAENTLLNEAEVLVVCVDPLKMKPRALPKSIVAEFKQ
>P44679 3.1.2.-~~~ybgC~~~Acyl-CoA thioesterase YbgC~~~COG0824
MLDNGFSFPVRVYYEDTDAGGVVYHARYLHFFERARTEYLRTLNFTQQTLLEEQQLAFVVKTLAIDYCVAAKLDDLLMVE
TEVSEVKGATILFEQRLMRNTLMLSKATVKVACVDLGKMKPVAFPKEVKAAFHHLK
>P94842 3.1.2.-~~~ybgC~~~Acyl-CoA thioesterase YbgC~~~COG0824
MRCRVYYEDTDSEGVVYHANYLKYCERARSEFFFKQNVLPENEEGVFVIRSIKADFFTPASLGQVLEIRTQIKELRKVFV
VLFQEIYCIQNASLEPMKPFKVFASEIKFGFVNRSTYSPIAIPKLFKELLNAI
>P37909 ~~~ybgD~~~Uncharacterized fimbrial-like protein YbgD~~~COG3539
MFKGQKTLAALAVSLLFTAPVYAADEGSGEIHFKGEVIEAPCEIHPEDIDKNIDLGQVTTTHINREHHSNKVAVDIRLIN
CDLPASDNGSGMPVSKVGVTFDSTAKTTGATPLLSNTSAGEATGVGVRLMDKNDGNIVLGSAAPDLDLDASSSEQTLNFF
AWMEQIDNAVDVTAGEVTANATYVLDYK
>P0DPN5 ~~~ybgU~~~Protein YbgU~~~
MRKSYEVGISPKINLCNSVEVLTNSFGTVISGRQV
>P21829 3.1.3.74~~~ybhA~~~Pyridoxal phosphate phosphatase YbhA~~~COG0561
MTTRVIALDLDGTLLTPKKTLLPSSIEALARAREAGYQLIIVTGRHHVAIHPFYQALALDTPAICCNGTYLYDYHAKTVL
EADPMPVIKALQLIEMLNEHHIHGLMYVDDAMVYEHPTGHVIRTSNWAQTLPPEQRPTFTQVASLAETAQQVNAVWKFAL
THDDLPQLQHFGKHVEHELGLECEWSWHDQVDIARGGNSKGKRLTKWVEAQGWSMENVVAFGDNFNDISMLEAAGTGVAM
GNADDAVKARANIVIGDNTTDSIAQFIYSHLI
>P12994 ~~~ybhB~~~UPF0098 protein YbhB~~~COG1881
MKLISNDLRDGDKLPHRHVFNGMGYDGDNISPHLAWDDVPAGTKSFVVTCYDPDAPTGSGWWHWVVVNLPADTRVLPQGF
GSGLVAMPDGVLQTRTDFGKTGYDGAAPPKGETHRYIFTVHALDIERIDVDEGASGAMVGFNVHFHSLASASITAMFS
>P46130 3.1.2.-~~~ybhC~~~Putative acyl-CoA thioester hydrolase YbhC~~~COG4677
MNTFSVSRLALALAFGVTLTACSSTPPDQRPSDQTAPGTSSRPILSAKEAQNFDAQHYFASLTPGAAAWNPSPITLPAQP
DFVVGPAGTQGVTHTTIQAAVDAAIIKRTNKRQYIAVMPGEYQGTVYVPAAPGGITLYGTGEKPIDVKIGLSLDGGMSPA
DWRHDVNPRGKYMPGKPAWYMYDSCQSKRSDSIGVLCSAVFWSQNNGLQLQNLTIENTLGDSVDAGNHPAVALRTDGDQV
QINNVNILGRQNTFFVTNSGVQNRLETNRQPRTLVTNSYIEGDVDIVSGRGAVVFDNTEFRVVNSRTQQEAYVFAPATLS
NIYYGFLAVNSRFNAFGDGVAQLGRSLDVDANTNGQVVIRDSAINEGFNTAKPWADAVISNRPFAGNTGSVDDNDEIQRN
LNDTNYNRMWEYNNRGVGSKVVAEAKK
>P0A9U1 ~~~ybhF~~~Probable multidrug ABC transporter ATP-binding protein YbhF~~~COG1131
MNDAVITLNGLEKRFPGMDKPAVAPLDCTIHAGYVTGLVGPDGAGKTTLMRMLAGLLKPDSGSATVIGFDPIKNDGALHA
VLGYMPQKFGLYEDLTVMENLNLYADLRSVTGEARKQTFARLLEFTSLGPFTGRLAGKLSGGMKQKLGLACTLVGEPKVL
LLDEPGVGVDPISRRELWQMVHELAGEGMLILWSTSYLDEAEQCRDVLLMNEGELLYQGEPKALTQTMAGRSFLMTSPHE
GNRKLLQRALKLPQVSDGMIQGKSVRLILKKEATPDDIRHADGMPEININETTPRFEDAFIDLLGGAGTSESPLGAILHT
VEGTPGETVIEAKELTKKFGDFAATDHVNFAVKRGEIFGLLGPNGAGKSTTFKMMCGLLVPTSGQALVLGMDLKESSGKA
RQHLGYMAQKFSLYGNLTVEQNLRFFSGVYGLRGRAQNEKISRMSEAFGLKSIASHATDELPLGFKQRLALACSLMHEPD
ILFLDEPTSGVDPLTRREFWLHINSMVEKGVTVMVTTHFMDEAEYCDRIGLVYRGKLIASGTPDDLKAQSANDEQPDPTM
EQAFIQLIHDWDKEHSNE
>P75777 ~~~ybhG~~~UPF0194 membrane protein YbhG~~~COG0845
MMKKPVVIGLAVVVLAAVVAGGYWWYQSRQDNGLTLYGNVDIRTVNLSFRVGGRVESLAVDEGDAIKAGQVLGELDHKPY
EIALMQAKAGVSVAQAQYDLMLAGYRNEEIAQAAAAVKQAQAAYDYAQNFYNRQQGLWKSRTISANDLENARSSRDQAQA
TLKSAQDKLRQYRSGNREQDIAQAKASLEQAQAQLAQAELNLQDSTLIAPSDGTLLTRAVEPGTVLNEGGTVFTVSLTRP
VWVRAYVDERNLDQAQPGRKVLLYTDGRPDKPYHGQIGFVSPTAEFTPKTVETPDLRTDLVYRLRIVVTDADDALRQGMP
VTVQFGDEAGHE
>P0AAV8 5.-.-.-~~~ybhH~~~Putative isomerase YbhH~~~COG2828
MKKIPCVMMRGGTSRGAFLLAEHLPEDQTQRDKILMAIMGSGNDLEIDGIGGGNPLTSKVAIISRSSDPRADVDYLFAQV
IVHEQRVDTTPNCGNMLSGVGAFAIENGLIAATSPVTRVRIRNVNTGTFIEADVQTPNGVVEYEGSARIDGVPGTAAPVA
LTFLNAAGTKTGKVFPTDNQIDYFDDVPVTCIDMAMPVVIIPAEYLGKTGYELPAELDADKALLARIESIRLQAGKAMGL
GDVSNMVIPKPVLISPAQKGGAINVRYFMPHSCHRALAITGAIAISSSCALEGTVTRQIVPSVGYGNINIEHPSGALDVH
LSNEGQDATTLRASVIRTTRKIFSGEVYLP
>P75763 ~~~ybhI~~~Inner membrane protein YbhI~~~COG0471
MNKKSLWKLILILAIPCIIGFMPAPAGLSELAWVLFGIYLAAIVGLVIKPFPEPVVLLIAVAASMVVVGNLSDGAFKTTA
VLSGYSSGTTWLVFSAFTLSAAFVTTGLGKRIAYLLIGKIGNTTLGLGYVTVFLDLVLAPATPSNTARAGGIVLPIINSV
AVALGSEPEKSPRRVGHYLMMSIYMVTKTTSYMFFTAMAGNILALKMINDILHLQISWGGWALAAGLPGIIMLLVTPLVI
YTMYPPEIKKVDNKTIAKAGLAELGPMKIREKMLLGVFVLALLGWIFSKSLGVDESTVAIVVMATMLLLGIVTWEDVVKN
KGGWNTLIWYGGIIGLSSLLSKVKFFEWLAEVFKNNLAFDGHGNVAFFVIIFLSIIVRYFFASGSAYIVAMLPVFAMLAN
VSGAPLMLTALALLFSNSYGGMVTHYGGAAGPVIFGVGYNDIKSWWLVGAVLTILTFLVHITLGVWWWNMLIGWNML
>P0AAC4 ~~~ybhL~~~Inner membrane protein YbhL~~~COG0670
MDRFPRSDSIVQPRAGLQTYMAQVYGWMTVGLLLTAFVAWYAANSAAVMELLFTNRVFLIGLIIAQLALVIVLSAMIQKL
SAGVTTMLFMLYSALTGLTLSSIFIVYTAASIASTFVVTAGMFGAMSLYGYTTKRDLSGFGNMLFMALIGIVLASLVNFW
LKSEALMWAVTYIGVIVFVGLTAYDTQKLKNMGEQIDTRDTSNLRKYSILGALTLYLDFINLFLMLLRIFGNRR
>P75770 ~~~ybhN~~~Inner membrane protein YbhN~~~COG0392
MSKSHPRWRLAKKILTWLFFIAVIVLLVVYAKKVDWEEVWKVIRDYNRVALLSAVGLVVVSYLIYGCYDLLARFYCGHKL
AKRQVMLVSFICYAFNLTLSTWVGGIGMRYRLYSRLGLPGSTITRIFSLSITTNWLGYILLAGIIFTAGVVELPDHWYVD
QTTLRILGIGLLMIIAVYLWFCAFAKHRHMTIKGQKLVLPSWKFALAQMLISSVNWMVMGAIIWLLLGQSVNYFFVLGVL
LVSSIAGVIVHIPAGIGVLEAVFIALLAGEHTSKGTIIAALLAYRVLYYFIPLLLALICYLLLESQAKKLRAKNEAAM
>P0AAW5 ~~~ybhQ~~~Inner membrane protein YbhQ~~~
MKWQQRVRVATGLSCWQIMLHLLVVALLVVGWMSKTLVHVGVGLCALYCVTVVMMLVFQRHPEQRWREVADVLEELTTTW
YFGAALIVLWLLSRVLENNFLLAIAGLAILAGPAVVSLLAKDKKLHHLTSKHRVRR
>P0AFP9 ~~~ybhR~~~Probable multidrug ABC transporter permease YbhR~~~COG0842
MFHRLWTLIRKELQSLLREPQTRAILILPVLIQVILFPFAATLEVTNATIAIYDEDNGEHSVELTQRFARASAFTHVLLL
KSPQEIRPTIDTQKALLLVRFPADFSRKLDTFQTAPLQLILDGRNSNSAQIAANYLQQIVKNYQQELLEGKPKPNNSELV
VRNWYNPNLDYKWFVVPSLIAMITTIGVMIVTSLSVAREREQGTLDQLLVSPLTTWQIFIGKAVPALIVATFQATIVLAI
GIWAYQIPFAGSLALFYFTMVIYGLSLVGFGLLISSLCSTQQQAFIGVFVFMMPAILLSGYVSPVENMPVWLQNLTWINP
IRHFTDITKQIYLKDASLDIVWNSLWPLLVITATTGSAAYAMFRRKVM
>P0AFQ2 ~~~ybhS~~~Probable multidrug ABC transporter permease YbhS~~~COG0842
MSNPILSWRRVRALCVKETRQIVRDPSSWLIAVVIPLLLLFIFGYGINLDSSKLRVGILLEQRSEAALDFTHTMTGSPYI
DATISDNRQELIAKMQAGKIRGLVVIPVDFAEQMERANATAPIQVITDGSEPNTANFVQGYVEGIWQIWQMQRAEDNGQT
FEPLIDVQTRYWFNPAAISQHFIIPGAVTIIMTVIGAILTSLVVAREWERGTMEALLSTEITRTELLLCKLIPYYFLGML
AMLLCMLVSVFILGVPYRGSLLILFFISSLFLLSTLGMGLLISTITRNQFNAAQVALNAAFLPSIMLSGFIFQIDSMPAV
IRAVTYIIPARYFVSTLQSLFLAGNIPVVLVVNVLFLIASAVMFIGLTWLKTKRRLD
>P30177 ~~~ybiB~~~Uncharacterized protein YbiB~~~COG0547
MDYRKIIKEIGRGKNHARDLDRDTARGLYAHMLNGEVPDLELGGVLIALRIKGEGEAEMLGFYEAMQNHTIKLTPPAGKP
MPIVIPSYNGARKQANLTPLLAILLHKLGFPVVVHGVSEDPTRVLTETIFELMGITPTLHGGQAQAKLDEHQPVFMPVGA
FCPPLEKQLAMRWRMGVRNSAHTLAKLATPFAEGEALRLSSVSHPEYIGRVAKFFSDIGGRALLMHGTEGEVYANPQRCP
QINLIDREGMRVLYEKQDTAGSELLPQAKDPETTAQWIERCLAGSEPIPESLKIQMACCLVATGEAATISDGLARVNQAF
>P41039 ~~~ybiI~~~Uncharacterized protein YbiI~~~COG1734
MASGWANDDAVNEQINSTIEDAIARARGEIPRGESLDECEECGAPIPQARREAIPGVRLCIHCQQEKDLQKPAYTGYNRR
GSKDSQLR
>P75783 ~~~ybiO~~~Moderate conductance mechanosensitive channel YbiO~~~COG0668
MRWILFILFCLLGAPAHAVSIPGVTTTTTTDSTTEPAPEPDIEQKKAAYGALADVLDNDTSRKELIDQLRTVAATPPAEP
VPKIVPPTLVEEQTVLQKVTEVSRHYGEALSARFGQLYRNITGSPHKPFNPQTFSNALTHFSMLAVLVFGFYWLIRLCAL
PLYRKMGQWARQKNRERSNWLQLPAMIIGAFIIDLLLLALTLFVGQVLSDNLNAGSRTIAFQQSLFLNAFALIEFFKAVL
RLIFCPNVAELRPFTIQDESARYWSRRLSWLSSLIGYGLIVAVPIISNQVNVQIGALANVIIMLCMTVWALYLIFRNKKE
ITQHLLNFAEHSLAFFSLFIRAFALVWHWLASAYFIVLFFFSLFDPGNSLKFMMGATVRSLAIIGIAAFVSGMFSRWLAK
TITLSPHTQRNYPELQKRLNGWLSAALKTARILTVCVAVMLLLSAWGLFDFWNWLQNGAGQKTVDILIRIALILFFSAVG
WTVLASLIENRLASDIHGRPLPSARTRTLLTLFRNALAVIISTITIMIVLSEIGVNIAPLLAGAGALGLAISFGSQTLVK
DIITGVFIQFENGMNTGDLVTIGPLTGTVERMSIRSVGVRQDTGAYHIIPWSSITTFANFVRGIGSVVANYDVDRHEDAD
KANQALKDAVAELMENEEIRGLIIGEPNFAGIVGLSNTAFTLRVSFTTLPLKQWTVRFALDSQVKKHFDLAGVRAPVQTY
QVLPAPGATPAEPLPPGEPTL
>P75788 ~~~ybiR~~~Inner membrane protein YbiR~~~COG1055
MSLPFLRTLQGDRFFQLLILVGIGLSFFVPFAPKSWPAAIDWHTIITLSGLMLLTKGVELSGYFDVLGRKMVRRFATERR
LAMFMVLAAALLSTFLTNDVALFIVVPLTITLKRLCEIPVNRLIIFEALAVNAGSLLTPIGNPQNILIWGRSGLSFAGFI
AQMAPLAGAMMLTLLLLCWCCFPGKAMQYHTGVQTPEWKPRLVWSCLGLYIVFLTALEFKQELWGLVIVAAGFALLARRV
VLSVDWTLLLVFMAMFIDVHLLTQLPALQGVLGNVSHLSEPGLWLTAIGLSQVISNVPSTILLLNYVPPSLLLVWAVNVG
GFGLLPGSLANLIALRMANDRRIWWRFHLYSIPMLLWAALVGYVLLVILPAN
>P0AAX8 2.-.-.-~~~ybiS~~~Probable L,D-transpeptidase YbiS~~~COG1376
MNMKLKTLFAAAFAVVGFCSTASAVTYPLPTDGSRLVGQNQVITIPEGNTQPLEYFAAEYQMGLSNMMEANPGVDTFLPK
GGTVLNIPQQLILPDTVHEGIVINSAEMRLYYYPKGTNTVIVLPIGIGQLGKDTPINWTTKVERKKAGPTWTPTAKMHAE
YRAAGEPLPAVVPAGPDNPMGLYALYIGRLYAIHGTNANFGIGLRVSHGCVRLRNEDIKFLFEKVPVGTRVQFIDEPVKA
TTEPDGSRYIEVHNPLSTTEAQFEGQEIVPITLTKSVQTVTGQPDVDQVVLDEAIKNRSGMPVRLN
>P0A9U4 ~~~ybiT~~~Probable ATP-binding protein YbiT~~~COG0488
MLVSSNVTMQFGSKPLFENISVKFGGGNRYGLIGANGSGKSTFMKILGGDLEPTLGNVSLDPNERIGKLRQDQFAFEEFT
VLDTVIMGHKELWEVKQERDRIYALPEMSEEDGYKVADLEVKYGEMDGYSAEARAGELLLGVGIPVEQHYGPMSEVAPGW
KLRVLLAQALFADPDILLLDEPTNNLDIDTIRWLEQVLNERDSTMIIISHDRHFLNMVCTHMADLDYGELRVYPGNYDEY
MTAATQARERLLADNAKKKAQIAELQSFVSRFSANASKSRQATSRARQIDKIKLEEVKASSRQNPFIRFEQDKKLFRNAL
EVEGLTKGFDNGPLFKNLNLLLEVGEKLAVLGTNGVGKSTLLKTLVGDLQPDSGTVKWSENARIGYYAQDHEYEFENDLT
VFEWMSQWKQEGDDEQAVRSILGRLLFSQDDIKKPAKVLSGGEKGRMLFGKLMMQKPNILIMDEPTNHLDMESIESLNMA
LELYQGTLIFVSHDREFVSSLATRILEITPERVIDFSGNYEDYLRSKGIE
>P0A9U3 ~~~ybiT~~~Probable ATP-binding protein YbiT~~~COG0488
MLVSSNVTMQFGSKPLFENISVKFGGGNRYGLIGANGSGKSTFMKILGGDLEPTLGNVSLDPNERIGKLRQDQFAFEEFT
VLDTVIMGHKELWEVKQERDRIYALPEMSEEDGYKVADLEVKYGEMDGYSAEARAGELLLGVGIPVEQHYGPMSEVAPGW
KLRVLLAQALFADPDILLLDEPTNNLDIDTIRWLEQVLNERDSTMIIISHDRHFLNMVCTHMADLDYGELRVYPGNYDEY
MTAATQARERLLADNAKKKAQIAELQSFVSRFSANASKSRQATSRARQIDKIKLEEVKASSRQNPFIRFEQDKKLFRNAL
EVEGLTKGFDNGPLFKNLNLLLEVGEKLAVLGTNGVGKSTLLKTLVGDLQPDSGTVKWSENARIGYYAQDHEYEFENDLT
VFEWMSQWKQEGDDEQAVRSILGRLLFSQDDIKKPAKVLSGGEKGRMLFGKLMMQKPNILIMDEPTNHLDMESIESLNMA
LELYQGTLIFVSHDREFVSSLATRILEITPERVIDFSGNYEDYLRSKGIE
>P75791 ~~~ybiU~~~Uncharacterized protein YbiU~~~
MASTFTSDTLPADHKAAIRQMKHALRAQLGDVQQIFNQLSDDIATRVAEINALKAQGDAVWPVLSYADIKAGHVTAEQRE
QIKRRGCAVIKGHFPREQALGWDQSMLDYLDRNRFDEVYKGPGDNFFGTLSASRPEIYPIYWSQAQMQARQSEEMANAQS
FLNRLWTFESDGKQWFNPDVSVIYPDRIRRRPPGTTSKGLGAHTDSGALERWLLPAYQRVFANVFNGNLAQYDPWHAAHR
TEVEEYTVDNTTKCSVFRTFQGWTALSDMLPGQGLLHVVPIPEAMAYVLLRPLLDDVPEDELCGVAPGRVLPVSEQWHPL
LIEALTSIPKLEAGDSVWWHCDVIHSVAPVENQQGWGNVMYIPAAPMCEKNLAYAHKVKAALEKGASPGDFPREDYETNW
EGRFTLADLNIHGKRALGMDV
>P75806 3.6.1.27~~~ybjG~~~Putative undecaprenyl-diphosphatase YbjG~~~COG0671
MLENLNLSLFSLINATPDSAPWMISLAIFIAKDLITVVPLLAVVLWLWGLTAQRQLVIKIAIALAVSLFVSWTMGHLFPH
DRPFVENIGYNFLHHAADDSFPSDHGTVIFTFALAFLCWHRLWSGSLLMVLAVVIAWSRVYLGVHWPLDMLGGLLAGMIG
CLSAQIIWQAMGHKLYQRLQSWYRVCFALPIRKGWVRD
>P75809 3.1.3.104~~~ybjI~~~5-amino-6-(5-phospho-D-ribitylamino)uracil phosphatase YbjI~~~COG0561
MSIKLIAVDMDGTFLSDQKTYNRERFMAQYQQMKAQGIRFVVASGNQYYQLISFFPEIANEIAFVAENGGWVVSEGKDVF
NGELSKDAFATVVEHLLTRPEVEIIACGKNSAYTLKKYDDAMKTVAEMYYHRLEYVDNFDNLEDIFFKFGLNLSDELIPQ
VQKALHEAIGDIMVSVHTGNGSIDLIIPGVHKANGLRQLQKLWGIDDSEVVVFGDGGNDIEMLRQAGFSFAMENAGSAVV
AAAKYRAGSNNREGVLDVIDKVLKHEAPFDQ
>P75810 ~~~ybjJ~~~Inner membrane protein YbjJ~~~COG0738
MTVNSSRNALKRRTWALFMFFFLPGLLMASWATRTPAIRDILSVSIAEMGGVLFGLSIGSMSGILCSAWLVKRFGTRNVI
LVTMSCALIGMMILSLALWLTSPLLFAVGLGVFGASFGSAEVAINVEGAAVEREMNKTVLPMMHGFYSLGTLAGAGVGMA
LTAFGVPATVHILLAALVGIAPIYIAIQAIPDGTGKNAADGTQHGEKGVPFYRDIQLLLIGVVVLAMAFAEGSANDWLPL
LMVDGHGFSPTSGSLIYAGFTLGMTVGRFTGGWFIDRYSRVAVVRASALMGALGIGLIIFVDSAWVAGVSVVLWGLGASL
GFPLTISAASDTGPDAPTRVSVVATTGYLAFLVGPPLLGYLGEHYGLRSAMLVVLALVILAAIVAKAVAKPDTKTQTAME
NS
>P64439 ~~~ybjM~~~Inner membrane protein YbjM~~~
MKHKQRWAGAICCFVLFIVVCLFLATHMKGAFRAAGHPEIGLLFFILPGAVASFFSQRREVLKPLFGAMLAAPCSMLIMR
LFFSPTRSFWQELAWLLSAVFWCALGALCFLFISSLFKPQHRKNQ
>P0AAY6 ~~~ybjN~~~Uncharacterized protein YbjN~~~
MTSLVVPGLDTLRQWLDDLGMSFFECDNCQALHLPHMQNFDGVFDAKIDLIDNTILFSAMAEVRPSAVLPLAADLSAINA
SSLTVKAFLDMQDDNLPKLVVCQSLSVMQGVTYEQFAWFVRQSEEQISMVILEANAHQLLLPTDDEGQNNVTENYFLH
>P0AAZ0 ~~~ybjO~~~Inner membrane protein YbjO~~~
MEDETLGFFKKTSSSHARLNVPALVQVAALAIIMIRGLDVLMIFNTLGVRGIGEFIHRSVQTWSLTLVFLSSLVLVFIEI
WCAFSLVKGRRWARWLYLLTQITAASYLWAASLGYGYPELFSIPGESKREIFHSLMLQKLPDMLILMLLFVPSTSRRFFQ
LQ
>P75818 ~~~ybjP~~~Uncharacterized lipoprotein YbjP~~~
MRYSKLTMLIPCALLLSACTTVTPAYKDNGTRSGPCVEGGPDNVAQQFYDYRILHRSNDITALRPYLSDKLATLLSDASR
DNNHRELLTNDPFSSRTTLPDSAHVASASTIPNRDARNIPLRVDLKQGDQGWQDEVLMIQEGQCWVIDDVRYLGGSVHAT
AGTLRQSIENR
>P0A8C1 ~~~ybjQ~~~UPF0145 protein YbjQ~~~COG0393
MQFSTTPTLEGQTIVEYCGVVTGEAILGANIFRDFFAGIRDIVGGRSGAYEKELRKAREIAFEELGSQARALGADAVVGI
DIDYETVGQNGSMLMVSVSGTAVKTRR
>Q83LS2 ~~~ybjQ~~~UPF0145 protein YbjQ~~~
MQFSTTPTLEGLTIVEYCGVVTGEAILGANIFRDFFAGIRDIVGGRSGAYEKELRKAREIAFEELGSQARALGADAVVGI
DIDYETVGQNGSMLMVSVSGTAVKTRR
>P54427 3.5.2.6~~~ybxI~~~Probable beta-lactamase YbxI~~~COG2602
MKKWIYVVLVLSIAGIGGFSVHAASSAHEKHLNVSKMNVDDEFKDTDGTFILHDLQKDQTFVYNRKRANQRQTPQSTFKV
VNALIGLQVKAVRDEYDVKRWDGVKREFESWNRDHTLGSAMRESAIWYYQALARDIGEERMKTWLHTLSYGNEDISGGID
QFWLQSSLTISPLEQETFLEKLAKEELPFDKPVMKIVKRMMIQEEGDHYTLYGKTGTRLTDMGLGWFVGFIKTEHGSYVF
VTNVDDSGTKAKNITVDILKKYGLITS
>Q48452 2.7.10.-~~~~~~Putative tyrosine-protein kinase in cps region~~~
MTSISKKKQPETVDDLDFGRMVGELIDHRKIIIALTSFATLIALLYAFFATPIYKADALIQVEQKQANAILSNLSQMLPD
SQPQSAPEIALIQSRMILGKTVDDLNLQAVVSPKYFPIFGRGWARLSGEHQGNIQLSRLYVSSSIGDEENPPEFTLKVKD
SNRYVIEFGGEEINGKVGELIEKDGITLKIDEINAKPGAEFTIKYVSKLKAIADLQENLSVADQGKDTGILILSYLGDDP
LKIKNIVDSISENYLAQNISRQAAQDEKSLEFLNKQLPMVRSDLDSAEDKLNDFRKRNDSVDLSLEAKSVLDQIVNVDNQ
LNELTFRESEISQLYTKEHPTYKALMEKRKTLQDERGKLNKRVATMPETQQEILRLSRDVESGRAVYMQLLNRQQELNIA
KSSAIGNVRIIDSAVTQHKPVKPKKIIVVLAGLFIGLVISVSLVLVRILLRKGIETPEQLEELGINVYASIPVSESNPKN
VIAKRLNKRDDSRPKVLLATENPADLAIEAIRGLRTSLHFAMLEARNNLLMISGASPNAGKTFVSSNLSSVISQTGKKVI
FIDADLRKGYTHKLFNIKNTNGLSDYLSGRVALDKIINNLQTEGFDYISRGSVPPNPAELLMHNRLAELLEWANKSYDIV
ILDTPPILAVADAAIIGNYVGTTLLVARFEENTPKEIDISVKRFQNSGVNIKGCILNGVVKKATNKYGYGYNYYDYSYSD
KK
>P72583 ~~~~~~Ycf53-like protein~~~COG0515
MSDNLTELSQQLHDASEKKQLTAIAALAEMGEGGQGILLDYLAKNVPLEKPVLAVGNVYQTLRNLEQETITTQLQRNYPT
GIFPLQSAQGIDYLPLQEALGSQDFETADEITRDKLCELAGPGASQRQWLYFTEVEKFPALDLHTINALWWLHSNGNFGF
SVQRRLWLASGKEFTKLWPKIGWKSGNVWTRWPKGFTWDLSAPQGHLPLLNQLRGVRVAESLYRHPVWSQYGW
>P72777 ~~~~~~Ycf54-like protein~~~
MESWALTTPPIDIVNQYLFFIKRKTNYMATYYYALASQKFLLEEEPFEEVLKERRRDYGEKNKEIDFWQVIQPAFLNAPE
LAEAKAKAPEKNVAIVSTNKSFIVWVKLRLEYVLTGEFEAPSDAIPDPLASLD
>P21367 4.-.-.-~~~ycaC~~~Probable hydrolase YcaC~~~COG1335
MTKPYVRLDKNDAAVLLVDHQAGLLSLVRDIEPDKFKNNVLALGDLAKYFNLPTILTTSFETGPNGPLVPELKAQFPDTP
YIARPGNINAWDNEDFVKAVKATGKKQLIIAGVVTEVCVAFPALSAIEEGFDVFVVTDASGTFNEITRHSAWDRLSQAGA
QLMTWFGVACELHRDWRNDIEGLATLFSNHIPDYRNLMTSYDTLTKQK
>P21503 ~~~ycaD~~~Uncharacterized MFS-type transporter YcaD~~~COG0477
MSTYTQPVMLLLSGLLLLTLAIAVLNTLVPLWLAQEHMSTWQVGVVSSSYFTGNLVGTLLTGYVIKRIGFNRSYYLASFI
FAAGCAGLGLMIGFWSWLAWRFVAGVGCAMIWVVVESALMCSGTSRNRGRLLAAYMMVYYVGTFLGQLLVSKVSTELMSV
LPWVTGLTLAGILPLLFTRVLNQQAENHDSTSITSMLKLRQARLGVNGCIISGIVLGSLYGLMPLYLNHKGVSNASIGFW
MAVLVSAGILGQWPIGRLADKFGRLLVLRVQVFVVILGSIAMLSQAAMAPALFILGAAGFTLYPVAMAWACEKVEHHQLV
AMNQALLLSYTVGSLLGPSFTAMLMQNFSDNLLFIMIASVSFIYLLMLLRNAGHTPKPVAHV
>P43674 3.4.-.-~~~ycaL~~~Metalloprotease YcaL~~~COG0501
MKNTKLLLAIATSAALLTGCQNTHGIDTNMAISSGLNAYKAATLSDADAKAIANQGCAEMDSGNQVASKSSKYGKRLAKI
AKALGNNINGTPVNYKVYMTSDVNAWAMANGCVRVYSGLMDMMNDNEIEGVLGHELGHVALGHSLAEMKASYAIVAARDA
ISATSGVASQLSRSQLGDIAEGAINAKYSRDKESEADDFSFDLLKKRGISTQGLVGSFETLASLDGGRTQSMFDSHPPST
ERAQHIRDRIASGK
>P75835 ~~~ycaM~~~Inner membrane transporter YcaM~~~COG0531
MAGNVQEKQLRWYNIALMSFITVWGFGNVVNNYANQGLVVVFSWVFIFALYFTPYALIVGQLGSTFKDGKGGVSTWIKHT
MGPGLAYLAAWTYWVVHIPYLAQKPQAILIALGWAMKGDGSLIKEYSVVALQGLTLVLFIFFMWVASRGMKSLKIVGSVA
GIAMFVMSLLYVAMAVTAPAITEVHIATTNITWETFIPHIDFTYITTISMLVFAVGGAEKISPYVNQTRNPGKEFPKGML
CLAVMVAVCAILGSLAMGMMFDSRNIPDDLMTNGQYYAFQKLGEYYNMGNTLMVIYAIANTLGQVAALVFSIDAPLKVLL
GDADSKYIPASLCRTNASGTPVNGYFLTLVLVAILIMLPTLGIGDMNNLYKWLLNLNSVVMPLRYLWVFVAFIAVVRLAQ
KYKPEYVFIRNKPLAMTVGIWCFAFTAFACLTGIFPKMEAFTAEWTFQLALNVATPFVLVGLGLIFPLLARKANSK
>P75838 ~~~ycaO~~~Ribosomal protein S12 methylthiotransferase accessory factor YcaO~~~COG1944
MTQTFIPGKDAALEDSIARFQQKLSDLGFQIEEASWLNPVPNVWSVHIRDKECALCFTNGKGATKKAALASALGEYFERL
STNYFFADFWLGETIANGPFVHYPNEKWFPLTENDDVPEGLLDDRLRAFYDPENELTGSMLIDLQSGNEDRGICGLPFTR
QSDNQTVYIPMNIIGNLYVSNGMSAGNTRNEARVQGLSEVFERYVKNRIIAESISLPEIPADVLARYPAVVEAIETLEAE
GFPIFAYDGSLGGQYPVICVVLFNPANGTCFASFGAHPDFGVALERTVTELLQGRGLKDLDVFTPPTFDDEEVAEHTNLE
THFIDSSGLISWDLFKQDADYPFVDWNFSGTTEEEFATLMAIFNKEDKEVYIADYEHLGVYACRIIVPGMSDIYPAEDLW
LANNSMGSHLRETILSLPGSEWEKEDYLNLIEQLDEEGFDDFTRVRELLGLATGSDNGWYTLRIGELKAMLALAGGDLEQ
ALVWTEWTMEFNSSVFSPERANYYRCLQTLLLLAQEEDRQPLQYLNAFVRMYGADAVEAASAAMSGEAAFYGLQPVDSDL
HAFAAHQSLLKAYEKLQRAKAAFWAK
>P0AAZ7 ~~~ycaR~~~UPF0434 protein YcaR~~~COG2835
MDHRLLEIIACPVCNGKLWYNQEKQELICKLDNLAFPLRDGIPVLLETEARVLTADESKS
>P22525 2.-.-.-~~~ycbB~~~Probable L,D-transpeptidase YcbB~~~COG2989
MLLNMMCGRQLSAISLCLAVTFAPLFNAQADEPEVIPGDSPVAVSEQGEALPQAQATAIMAGIQPLPEGAAEKARTQIES
QLPAGYKPVYLNQLQLLYAARDMQPMWENRDAVKAFQQQLAEVAIAGFQPQFNKWVELLTDPGVNGMARDVVLSDAMMGY
LHFIANIPVKGTRWLYSSKPYALATPPLSVINQWQLALDKGQLPTFVAGLAPQHPQYAAMHESLLALLCDTKPWPQLTGK
ATLRPGQWSNDVPALREILQRTGMLDGGPKITLPGDDTPTDAVVSPSAVTVETAETKPMDKQTTSRSKPAPAVRAAYDNE
LVEAVKRFQAWQGLGADGAIGPATRDWLNVTPAQRAGVLALNIQRLRLLPTELSTGIMVNIPAYSLVYYQNGNQVLDSRV
IVGRPDRKTPMMSSALNNVVVNPPWNVPPTLARKDILPKVRNDPGYLESHGYTVMRGWNSREAIDPWQVDWSTITASNLP
FRFQQAPGPRNSLGRYKFNMPSSEAIYLHDTPNHNLFKRDTRALSSGCVRVNKASDLANMLLQDAGWNDKRISDALKQGD
TRYVNIRQSIPVNLYYLTAFVGADGRTQYRTDIYNYDLPARSSSQIVSKAEQLIR
>P40876 ~~~ycbF~~~Uncharacterized fimbrial chaperone YcbF~~~COG3121
MTNTWNRLALLIFAVLSLLVAGELQAGVVVGGTRFIFPADRESISILLTNTSQESWLINSKINRPTRWAGGEASTVPAPL
LAAPPLILLKPGTTGTLRLLRTESDILPVDRETLFELSIASVPSGKVENQSVKVAMRSVFKLFWRPEGLPGDPLEAYQQL
RWTRNSQGVQLTNPTPYYINLIQVSVNGKALSNVGVVPPKSQRQTSWCQAIAPCHVAWRAINDYGGLSAKKEQNLP
>P75859 ~~~ycbU~~~Uncharacterized fimbrial-like protein YcbU~~~COG3539
MKKKTIFQCVILFFSILNIHVGMAGPEQVSMHIYGNVVDQGCDVATKSALQNIHIGDFNISDFQAANTVSTAADLNIDIT
GCAAGITGADVLFSGEADTLAPTLLKLTDTGGSGGMATGIAVQILDAQSQQEIPLNQVQPLTPLKAGDNTLKYQLRYKST
KAGATGGNATAVLYFDLVYQ
>P75860 ~~~ycbV~~~Uncharacterized fimbrial-like protein YcbV~~~COG3539
MLKRIIWILFLLGLTWGCELFAHDGTVNISGSFRRNTCVLAQDSKQINVQLGDVSLTRFSHGNYGPEKSFIINLQDCGTD
VSTVDVTFSGTPDGVQSEMLSIESGTDAASGLAIAILDDAKILIPLNQASKDYSLHSGKVPLTFYAQLRPVNSDVQSGKV
NASATFVLHYD
>P75863 ~~~ycbX~~~Uncharacterized protein YcbX~~~COG0633
MATLIRLFIHPVKSMRGIGLTHALADVSGLAFDRIFMITEPDGTFITARQFPQMVRFTPSPVHDGLHLTAPDGSSAYVRF
ADFATQDAPTEVWGTHFTARIAPDAINKWLSGFFSREVQLRWVGPQMTRRVKRHNTVPLSFADGYPYLLANEASLRDLQQ
RCPASVKMEQFRPNLVVSGASAWEEDRWKVIRIGDVVFDVVKPCSRCIFTTVSPEKGQKHPAGEPLKTLQSFRTAQDNGD
VDFGQNLIARNSGVIRVGDEVEILATAPAKIYGAAAADDTANITQQPDANVDIDWQGQAFRGNNQQVLLEQLENQGIRIP
YSCRAGICGSCRVQLLEGEVTPLKKSAMGDDGTILCCSCVPKTALKLAR
>P0AAC6 ~~~yccA~~~Modulator of FtsH protease YccA~~~COG0670
MDRIVSSSHDRTSLLSTHKVLRNTYFLLSLTLAFSAITATASTVLMLPSPGLILTLVGMYGLMFLTYKTANKPTGIISAF
AFTGFLGYILGPILNTYLSAGMGDVIAMALGGTALVFFCCSAYVLTTRKDMSFLGGMLMAGIVVVLIGMVANIFLQLPAL
HLAISAVFILISSGAILFETSNIIHGGETNYIRATVSLYVSLYNIFVSLLSILGFASRD
>P0AB12 ~~~yccF~~~Inner membrane protein YccF~~~COG3304
MRTVLNILNFVLGGFATTLGWLLATLVSIVLIFTLPLTRSCWEITKLSLVPYGNEAIHVDELNPAGKNVLLNTGGTVLNI
FWLIFFGWWLCLMHIATGIAQCISIIGIPVGIANFKIAAIALWPVGRRVVSVETAQAAREANARRRFE
>P0AB14 ~~~yccJ~~~Uncharacterized protein YccJ~~~
MPTQEAKAHHVGEWASLRNTSPEIAEAIFEVAGYDEKMAEKIWEEGSDEVLVKAFAKTDKDSLFWGEQTIERKNV
>P52636 ~~~yccM~~~Putative electron transport protein YccM~~~COG0348
MAENKRTRWQRRPGTTGGKLPWNDWRNATTWRKATQLLLLAMNIYIAITFWYWVRYYETASSTTFVARPGGIEGWLPIAG
LMNLKYSLVTGQLPSVHAAAMLLLVAFIVISLLLKKAFCSWLCPVGTLSELIGDLGNKLFGRQCVLPRWLDIPLRGVKYL
LLSFFIYIALLMPAQAIHYFMLSPYSVVMDVKMLDFFRHMGTATLISVTVLLIASLFIRHAWCRYLCPYGALMGVVSLLS
PFKIRRNAESCIDCGKCAKNCPSRIPVDKLIQVRTVECTGCMTCVESCPVASTLTFSLQKPAANKKAFALSGWLMTLLVL
GIMFAVIGYAMYAGVWQSPVPEELYRRLIPQAPMIGH
>P75870 ~~~yccS~~~Inner membrane protein YccS~~~COG1289
MLSPLLKRYTWNSAWLYYARIFIALCGTTAFPWWLGDVKLTIPLTLGMVAAALTDLDDRLAGRLRNLIITLFCFFIASAS
VELLFPWPWLFAIGLTLSTSGFILLGGLGQRYATIAFGALLIAIYTMLGTSLYEHWYQQPMYLLAGAVWYNVLTLIGHLL
FPVRPLQDNLARCYEQLARYLELKSRMFDPDIEDQSQAPLYDLALANGLLMATLNQTKLSLLTRLRGDRGQRGTRRTLHY
YFVAQDIHERASSSHIQYQTLREHFRHSDVLFRFQRLMSMQGQACQQLSRCILLRQPYQHDPHFERAFTHIDAALERMRD
NGAPADLLKTLGFLLNNLRAIDAQLATIESEQAQALPHNNDENELADDSPHGLSDIWLRLSRHFTPESALFRHAVRMSLV
LCFGYAIIQITGMHHGYWILLTSLFVCQPNYNATRHRLKLRIIGTLVGIAIGIPVLWFVPSLEGQLVLLVITGVLFFAFR
NVQYAHATMFITLLVLLCFNLLGEGFEVALPRVIDTLIGCAIAWAAVSYIWPDWQFRNLPRMLERATEANCRYLDAILEQ
YHQGRDNRLAYRIARRDAHNRDAELASVVSNMSSEPNVTPQIREAAFRLLCLNHTFTSYISALGAHREQLTNPEILAFLD
DAVCYVDDALHHQPADEERVNEALASLKQRMQQLEPRADSKEPLVVQQVGLLIALLPEIGRLQRQITQVPQETPVSA
>P75874 ~~~yccU~~~Uncharacterized protein YccU~~~COG1832
MKETDIAGILTSTHTIALVGASDKPDRPSYRVMKYLLDQGYHVIPVSPKVAGKTLLGQKGYGTLADVPEKVDMVDVFRNS
EAAWGVAQEAIAIGAKTLWMQLGVINEQAAVLARDAGLNVVMDRCPAIEIPRLGLAK
>O34538 ~~~ycdA~~~Uncharacterized lipoprotein YcdA~~~
MFQKKTYAVFLILLLMMFTAACSGSKTSAEKKESETEKSSDIAQVKIKDVSYTLPSKYDKSTSDDQLVLKVNVAVKNTGK
DPLNVDSMDFTLYQGDTKMSDTDPEDYSEKLQGSTINADKSVEGNLFFVVDKGKQYELNYTPESYGDKKPKSVTFKIDGK
DKKILATADKLQDSAKALSAYVDVLLFGKDNADFEKITGANKNEIVNDFNESAKDGYLSASGLSSTYADSKALDNIVNGI
KEGLSKNSSIQAKTTSISKDEAIVEATVKPVDASSLSDRIEDKVKDYYSKNSSASYEEAVKYALQVYPEEFKKLGPASSE
KTVEVKMKKNDIDQWQLDMDDYRAAELVEAFIKE
>P75914 3.1.3.-~~~ycdX~~~Probable phosphatase YcdX~~~COG1387
MYPVDLHMHTVASTHAYSTLSDYIAQAKQKGIKLFAITDHGPDMEDAPHHWHFINMRIWPRVVDGVGILRGIEANIKNVD
GEIDCSGKMFDSLDLIIAGFHEPVFAPHDKATNTQAMIATIASGNVHIISHPGNPKYEIDVKAVAEAAAKHQVALEINNS
SFLHSRKGSEDNCREVAAAVRDAGGWVALGSDSHTAFTMGEFEECLKILDAVDFPPERILNVSPRRLLNFLESRGMAPIA
EFADL
>P75915 ~~~ycdY~~~Chaperone protein YcdY~~~COG3381
MNEFSILCRVLGSLYYRQPQDPLLVPLFTLIREGKLAANWPLEQDELLTRLQKSCDMTQVSADYNALFIGDECAVPPYRS
AWVEGATEAEVRAFLSERGMPLADTPADHIGTLLLAASWLEDQSTEDESEALETLFSEYLLPWCGAFLGKVEAHATTPFW
RTMAPLTRDAISAMWDELEEDSEE
>P75916 ~~~ycdZ~~~Inner membrane protein YcdZ~~~
MNILLSIAITTGILSGIWGWVAVSLGLLSWAGFLGCTAYFACPQGGLKGLAISAATLLSGVVWAMVIIYGSALAPHLEIL
GYVITGIVAFLMCIQAKQLLLSFVPGTFIGACATFAGQGDWKLVLPSLALGLIFGYAMKNSGLWLAARSAKTAHREQEIK
NKA
>P0AB26 ~~~yceB~~~Uncharacterized lipoprotein YceB~~~
MNKFLFAAALIVSGLLVGCNQLTQYTITEQEINQSLAKHNNFSKDIGLPGVADAHIVLTNLTSQIGREEPNKVTLTGDAN
LDMNSLFGSQKATMKLKLKALPVFDKEKGAIFLKEMEVVDATVQPEKMQTVMQTLLPYLNQALRNYFNQQPAYVLREDGS
QGEAMAKKLAKGIEVKPGEIVIPFTD
>P0AB28 ~~~yceD~~~Large ribosomal RNA subunit accumulation protein YceD~~~COG1399
MQKVKLPLTLDPVRTAQKRLDYQGIYTPDQVERVAESVVSVDSDVECSMSFAIDNQRLAVLNGDAKVTVTLECQRCGKPF
THQVYTTYCFSPVRSDEQAEALPEAYEPIEVNEFGEIDLLAMVEDEIILALPVVPVHDSEHCEVSEADMVFGELPEEAQK
PNPFAVLASLKRK
>P0A1T2 ~~~yceD~~~Large ribosomal RNA subunit accumulation protein YceD~~~
MQKVKLPLTLDPVRTAQKRLDYQGIYTPDQVERVAESVVSVDSDVECSMSFAIDNQRLAVLTGDAVVTVSLECQRCGKPF
THQVHTTYCFSPVRSDEQAEALPEAYEPIEVNEFGEIDLLATVEDEIILALPVVPVHDSEHCEVSEADMVFGELPDEAQK
PNPFAVLASLKRK
>P29217 ~~~yceH~~~UPF0502 protein YceH~~~COG3132
MKYQLTALEARVIGCLLEKQVTTPEQYPLSVNGVVTACNQKTNREPVMNLSESEVQEQLDNLVKRHYLRTVSGFGNRVTK
YEQRFCNSEFGDLKLSAAEVALITTLLLRGAQTPGELRSRAARMYEFSDMAEVESTLEQLANREDGPFVVRLAREPGKRE
NRYMHLFSGEVEDQPAVTDMSNAVDGDLQARVEALEIEVAELKQRLDSLLAHLGD
>P0A8X2 ~~~yceI~~~Protein YceI~~~COG2353
MKKSLLGLTFASLMFSAGSAVAADYKIDKEGQHAFVNFRIQHLGYSWLYGTFKDFDGTFTFDEKNPAADKVNVTINTTSV
DTNHAERDKHLRSADFLNTAKYPQATFTSTSVKKDGDELDITGDLTLNGVTKPVTLEAKLIGQGDDPWGGKRAGFEAEGK
IKLKDFNIKTDLGPASQEVDLIISVEGVQQK
>P75931 1.-.-.-~~~yceM~~~Putative oxidoreductase YceM~~~COG0673
MKKLRIGVVGLGGIAQKAWLPVLAAASDWTLQGAWSPTRAKALPICESWRIPYADSLSSLAASCDAVFVHSSTASHFDVV
STLLNAGVHVCVDKPLAENLRDAERLVELAARKKLTLMVGFNRRFAPLYGELKTQLATAASLRMDKHRSNSVGPHDLYFT
LLDDYLHVVDTALWLSGGKASLDGGTLLTNDAGEMLFAEHHFSAGPLQITTCMHRRAGSQRETVQAVTDGALIDITDMRE
WREERGQGVVHKPIPGWQSTLEQRGFVGCARHFIECVQNQTVPQTAGEQAVLAQRIVDKIWRDAMSE
>P37168 1.-.-.-~~~yceM~~~Putative oxidoreductase YceM~~~
MRTLRIGIVGLGGIAQKAWLPVLTNTAGWTLQGAWSPSRDKALRICESWRIPYVDSLANLASSCDAVFVHSSTASHYAVV
SELLNAGVHVCVDKPLAENLRDAERLVALAAQKKLTLMVGFNRRFAPLYRELKTRLGTAASLRMDKHRTDSVGPHDLRFT
LLDDYLHVVDTALWLAGGEARLASGTLLTSESGEMCYAEHHFSADKLQITTSMHRRAGSQRESVQAVTDGGLYDVTDMRE
WREERGQGILIKPIPSWQTTLEQRGFVGCARHFIDCVQNQTVPETAGEQAILAQRVVEALWRDAISE
>P64442 ~~~yceO~~~Uncharacterized protein YceO~~~
MRPFLQEYLMRRLLHYLINNIREHLMLYLFLWGLLAIMDLIYVFYF
>Q55438 ~~~~~~Photosystem II reaction center protein Ycf12~~~
MELLAALNLEPIFQLTFLGLIVLAGPAVVFVLAFRGGDL
>Q8DJI1 ~~~~~~Photosystem II reaction center protein Ycf12~~~
MGIFNGIIEFLSNINFEVIAQLTMIAMIGIAGPMIIFLLAVRRGNL
>P73069 ~~~~~~Photosystem II assembly lipoprotein Ycf48~~~COG4447
MPVKFPSLKFEQLKQLVLVAAIAVFCVSCSHVPDLAFNPWQEIALETDSTFADIAFTEDPNHGWLVGTKETIFETTDGGD
TWEQKLIDLGEEKASFSAVSFSGNEGWITGKPSILLHTTDGGQTWARIPLSEKLPGAPYSIIALGPQTAEMITDLGAIYK
TTNGGKNWKALVEGAVGVARTIQRSTDGRYVAVSARGNFYSTWAPGQTEWTPHNRNSSRRLQTMGYGKDGQLWLLARGGQ
LQFSTDPDAEEWSDVIAPQDKGSWGLLDLSFRTPEEVWVAGASGNLLMSQDGGQTWAKDTGVEDIPANLYRVVFLSPEKG
FVLGQDGILLKYNPSTEVAMVP
>Q8DI95 ~~~~~~Photosystem II assembly protein Ycf48~~~COG4447
MFAKQIDIHWQKMKGIKFLHWLLGTVLLWVSLSTPALAIPALDYNPWEAIQLPTTATILDMSFIDRHHGWLVGVNATLME
TRDGGQTWEPRTLVLDHSDYRFNSVSFQGNEGWIVGEPPIMLHTTDGGQSWSQIPLDPKLPGSPRLIKALGNGSAEMITN
VGAIYRTKDSGKNWQALVQEAIGVMRNLNRSPSGEYVAVSSRGSFYSTWEPGQTAWEPHNRTTSRRLHNMGFTPDGRLWM
IVNGGKIAFSDPDNSENWGELLSPLRRNSVGFLDLAYRTPNEVWLAGGAGALLCSQDGGQTWQQDVDVKKVPSNFYKILF
FSPDQGFILGQKGILLRYVTDLTAAPA
>P0AFQ7 3.1.-.-~~~ycfH~~~Uncharacterized metal-dependent hydrolase YcfH~~~COG0084
MFLVDSHCHLDGLDYESLHKDVDDVLAKAAARDVKFCLAVATTLPGYLHMRDLVGERDNVVFSCGVHPLNQNDPYDVEDL
RRLAAEEGVVALGETGLDYYYTPETKVRQQESFIHHIQIGRELNKPVIVHTRDARADTLAILREEKVTDCGGVLHCFTED
RETAGKLLDLGFYISFSGIVTFRNAEQLRDAARYVPLDRLLVETDSPYLAPVPHRGKENQPAMVRDVAEYMAVLKGVAVE
ELAQVTTDNFARLFHIDASRLQSIR
>P0A8E1 ~~~ycfP~~~UPF0227 protein YcfP~~~COG3150
MIIYLHGFDSNSPGNHEKVLQLQFIDPDVRLISYSTRHPKHDMQHLLKEVDKMLQLNVDERPLICGVGLGGYWAERIGFL
CDIRQVIFNPNLFPYENMEGKIDRPEEYADIATKCVTNFREKNRDRCLVILSRNDEALNSQRTSEELHHYYEIVWDEEQT
HKFKNISPHLQRIKAFKTLG
>P75954 2.-.-.-~~~ycfS~~~Probable L,D-transpeptidase YcfS~~~COG1376
MMIKTRFSRWLTFFTFAAAVALALPAKANTWPLPPAGSRLVGENKFHVVENDGGSLEAIAKKYNVGFLALLQANPGVDPY
VPRAGSVLTIPLQTLLPDAPREGIVINIAELRLYYYPPGKNSVTVYPIGIGQLGGDTLTPTMVTTVSDKRANPTWTPTAN
IRARYKAQGIELPAVVPAGLDNPMGHHAIRLAAYGGVYLLHGTNADFGIGMRVSSGCIRLRDDDIKTLFSQVTPGTKVNI
INTPIKVSAEPNGARLVEVHQPLSEKIDDDPQLLPITLNSAMQSFKDAAQTDAEVMQHVMDVRSGMPVDVRRHQVSPQTL
>P75955 ~~~ycfT~~~Inner membrane protein YcfT~~~COG4763
MKQKELWINQIKGLCICLVVIYHSVITFYPHLTTFQHPLSEVLSKCWIYFNLYLAPFRMPVFFFISGYLIRRYIDSVPWG
NCLDKRIWNIFWVLALWGVVQWLALSALNQWLAPERDLSNASNAAYADSTGEFLHGMITASTSLWYLYALIVYFVVCKIF
SRLALPLFALFVLLSVAVNFVPTPWWGMNSVIRNLPYYSLGAWFGATIMTCVKEVPLRRHLLMASLLTVLAVGAWLFTIS
LLLSLVSIVVIMKLFYQYEQRFGMRSTSLLNVIGSNTIAIYTTHRILVEIFSLTLLAQMNAARWSPQVELTLLLVYPFVS
LFICTVAGLLVRKLSQRAFSDLLFSPPSLPAAVSYSR
>P75961 ~~~ycfZ~~~Inner membrane protein YcfZ~~~COG1512
MKKFIILLSLLILLPLTAASKPLIPIMKTLFTDVTGTVPDAEEIAHKAELFRQQTGIAPFIVVLPDINNEASLRQNGKAM
LAHASSSLSDVKGSVLLLFTTREPRLIMITNGQVESGLDDKHLGLLIENHTLAYLNADLWYQGINNALAVLQAQILKQST
PPLTYYPHPGQQHENAPPGSTNTLGFIAWAATFILFSRIFYYTTRFIYALKFAVAMTIANMGYQALCLYIDNSFAITRIS
PLWAGLIGVCTFIAALLLTSKR
>O31474 2.1.1.-~~~ycgJ~~~Uncharacterized methyltransferase YcgJ~~~COG2226
MTNETPFSKNAEMYRDEKVFAEGEDLGLMIKTAECRAEHRVLDIGAGAGHTALAFSPYVQECIGVDATKEMVEVASSFAQ
EKGVENVRFQQGTAESLPFPDDSFDIITCRYAAHHFSDVRKAVREVARVLKQDGRFLLVDHYAPEDPVLDEFVNHLNRLR
DPSHVRESSLSEWQAMFSANQLAYQDIQKWNLPIQYDSWIKRGGTPADREKQIITHLNHASDEARDTFCITLNQNGQPIS
FCLKAILIQGIKR
>P0AB44 ~~~ycgL~~~Protein YcgL~~~COG3100
MPKPGILKSKSMFCVIYRSSKRDQTYLYVEKKDDFSRVPEELMKGFGQPQLAMILPLDGRKKLVNADIEKVKQALTEQGY
YLQLPPPPEDLLKQHLSVMGQKTDDTNK
>P0AB43 ~~~ycgL~~~Protein YcgL~~~COG3100
MPKPGILKSKSMFCVIYRSSKRDQTYLYVEKKDDFSRVPEELMKGFGQPQLAMILPLDGRKKLVNADIEKVKQALTEQGY
YLQLPPPPEDLLKQHLSVMGQKTDDTNK
>P76004 ~~~ycgM~~~Uncharacterized protein YcgM~~~COG0179
MYQHHNWQGALLDYPVSKVVCVGSNYAKHIKEMGSAVPEEPVLFIKPETALCDLRQPLAIPSDFGSVHHEVELAVLIGAT
LRQATEEHVRKAIAGYGVALDLTLRDVQGKMKKAGQPWEKAKAFDNSCPLSGFIPAAEFTGDPQNTTLSLSVNGEQRQQG
TTADMIHKIVPLIAYMSKFFTLKAGDVVLTGTPDGVGPLQSGDELTVTFDGHSLTTRVL
>Q9KNC3 ~~~~~~Cyclic di-GMP binding protein VCA0042~~~
MNSRPAEKIDNNDGQTETPRSKTVSTINSTDALAMVEHSSELTLSITTPVGTKFVCRTPFIGTHTDKFLLVEMPKISADD
LQYFFQEGFWMNIRAISPRGEGALIHFRSQLMHILQEPVPMAFLSIPNTMQVSQLRKEPRFELNLAGKVLFDEHRGDCEL
RDLSRSGCRFITPPLGKTYQVGDLVALEIFSDLRGTKTFPPLTGKICNLQRSLHHARYGLEFNEEGRNNAKNLLAQLKFN
GTKLTLNAEKKA
>P76010 ~~~ycgR~~~Flagellar brake protein YcgR~~~COG5581
MSHYHEQFLKQNPLAVLGVLRDLHKAAIPLRLSWNGGQLISKLLAITPDKLVLDFGSQAEDNIAVLKAQHITITAETQGA
KVEFTVEQLQQSEYLQLPAFITVPPPTLWFVQRRRYFRISAPLHPPYFCQTKLADNSTLRFRLYDLSLGGMGALLETAKP
AELQEGMRFAQIEVNMGQWGVFHFDAQLISISERKVIDGKNETITTPRLSFRFLNVSPTVERQLQRIIFSLEREAREKAD
KVRD
>Q88EQ6 ~~~ycgR~~~Flagellar brake protein YcgR~~~COG5581
MFNESDAPQPPKVLSTPLEIAANLRQLQESHDPLIITFHDRSHRFQSYVVHVDRESNTLALDEMIPRDGEKFIENGEHFR
VEGFHDGVRIAWECDHALKISEVDGHRCYSGPLPQEVTYHQRRNAFRAALKLSQLVDIILDGAHLKGNGAMRGKLLDISA
TGCKLRFEGNVEDRLQLGQVYERFKAGNPLGLVDTMVELRHLHYEERINTTFAGVRFHNLSGQAQRKIESFVYQLQREAR
RFDKDDY
>Q8ZP19 ~~~ycgR~~~Flagellar brake protein YcgR~~~
MSGYNEQFLKKNPLAILGVLRDLNKNQVPLRISWAHGQFISKILAVDPEKLIVDYGSQEYENSAVLRAGQVAIIAETQGA
KVEFTLPQLVTGEYQRLPAFITPLPSSLWFVQRREYFRIGAPLYPPYYGVTTLPDTRTLRFRLFDLSLGGMGALLESAIP
DGLIEGARFSQVELNMGQWGIFHVDAQLISISERKVIDGKNETITTPRLSFRFLNVSPAVERELQRIIFSLEREARERAN
KVRE
>P75991 ~~~ycgZ~~~Probable two-component-system connector protein YcgZ~~~
MHQNSVTLDSAGAITRYFAKANLHTQQETLGEIVTEILKDGRNLSRKSLCAKLLCRLEHATGEEEQKHYNALIGLLFE
>P37518 ~~~ychF~~~Ribosome-binding ATPase YchF~~~COG0012
MALTAGIVGLPNVGKSTLFNAITQAGAESANYPFCTIDPNVGIVEVPDDRLQKLTELVNPKKTVPTAFEFTDIAGIVKGA
SKGEGLGNKFLSHIRQVDAICHVVRAFSDDNITHVSGKVDPIDDIETINLELILADMETVEKRITRVSKLAKQKDKDAVF
EFEILSKLKEAFESEKPARSVEFTEEQQKLVKQLHLLTSKPILYVANVSEDEVADPSGNENVAKIREYAAGENAEVIVVC
AKIESEIAELEGEEKQMFLEELGIQESGLDQLIKASYSLLGLATYFTAGEQEVRAWTFKKGMKAPECAGIIHSDFERGFI
RAETVAYEDLLAGGGMAGAKEAGKVRLEGKEYVVQDGDVIHFRFNV
>P0ABU2 ~~~ychF~~~Ribosome-binding ATPase YchF~~~COG0012
MGFKCGIVGLPNVGKSTLFNALTKAGIEAANFPFCTIEPNTGVVPMPDPRLDQLAEIVKPQRTLPTTMEFVDIAGLVKGA
SKGEGLGNQFLTNIRETEAIGHVVRCFENDNIIHVSGKVNPADDIEVINTELALADLDTCERAIHRVQKKAKGGDKDAKA
ELAVLEKCLPQLENAGMLRALDLSAEEKAAIRYLSFLTLKPTMYIANVNEDGFENNPYLDQVREIAAKEGSVVVPVCAAV
EADIAELDDEERDEFMQELGLEEPGLNRVIRAGYKLLNLQTYFTAGVKEVRAWTIPVGATAPQAAGKIHTDFEKGFIRAQ
TISFEDFITYKGEQGAKEAGKMRAEGKDYIVKDGDVMNFLFNV
>P44681 ~~~ychF~~~Ribosome-binding ATPase YchF~~~COG0012
MGFKCGIVGLPNVGKSTLFNALTKAGIEAANYPFCTIEPNTGVVPMPDPRLDALAEIVKPERILPTTMEFVDIAGLVAGA
SKGEGLGNKFLANIRETDAIGHVVRCFENDDIVHVAGKIDPLDDIDTINTELALADLDSCERAIQRLQKRAKGGDKEAKF
ELSVMEKILPVLENAGMIRSVGLDKEELQAIKSYNFLTLKPTMYIANVNEDGFENNPYLDRVREIAAKEGAVVVPVCAAI
ESEIAELDDEEKVEFLQDLGIEEPGLNRVIRAGYALLNLQTYFTAGVKEVRAWTVSVGATAPKAAAVIHTDFEKGFIRAE
VIAYEDFIQFNGENGAKEAGKWRLEGKDYIVQDGDVMHFRFNV
>P37052 ~~~ychJ~~~UPF0225 protein YchJ~~~COG3012
MSQLCPCGSAVEYSLCCHPYVSGEKVAPDPEHLMRSRYCAFVMQDADYLIKTWHPSCGAAALRAELMAGFAHTEWLGLTV
FEHCWQDADNIGFVSFVARFTEGGKTGAIIERSRFLKENGQWYYIDGTRPQFGRNDPCPCGSGKKFKKCCGQ
>P0AB52 ~~~ychN~~~Protein YchN~~~COG1553
MQKIVIVANGAPYGSESLFNSLRLAIALREQESNLDLRLFLMSDAVTAGLRGQKPGEGYNIQQMLEILTAQNVPVKLCKT
CTDGRGISTLPLIDGVEIGTLVELAQWTLSADKVLTF
>C0SP99 2.-.-.-~~~yciB~~~Putative L,D-transpeptidase YciB~~~COG1376
MKLSLFIIAVLMPVILLSACSDHAEEHASINTKKTVENITDVRKTAKTSIDWTKPSGGEYPDIKQKHVWIDVNVKEQKAY
IKEGSNTIYTMMISSGLDQTKDDATPKGTFYVEPERGEWFFSEGYQEGAEYWVSWKNHGEFLFHSVPMTKDQKVIKTEAE
KLGTKASHGCIRLTIPDAKWVYENIPEHTKVVIS
>P0A710 ~~~yciB~~~Inner membrane-spanning protein YciB~~~COG2917
MKQFLDFLPLVVFFAFYKIYDIYAATAALIVATAIVLIYSWVRFRKVEKMALITFVLVVVFGGLTLFFHNDEFIKWKVTV
IYALFAGALLVSQWVMKKPLIQRMLGKELTLPQPVWSKLNLAWAVFFILCGLANIYIAFWLPQNIWVNFKVFGLTALTLI
FTLLSGIYIYRHMPQEDKS
>P95745 ~~~yciB~~~Inner membrane-spanning protein YciB~~~
MKQFLDFLPLVVFFAFYKIYDIYAATAALIVATAIVLIYSWVRFRKVEKMALITFVLVVVFGGLTLFFHNDEFIKWKVTV
IYALFAGALLVSQWVMKKPLIQRMLSKELTLPQPVWSKLNLAWAVFFILCGLANIYIAFWLPQNIWVNFKVFGLTALTLI
FTLLSGIYIYRHMPQEDKS
>P94400 3.6.5.-~~~yciC~~~Zinc chaperone YciC~~~COG0523
MKKIPVTVLSGYLGAGKTTLLNSILQNREGLKIAVIVNDMSEVNIDAGLVKQEGGLSRTDEKLVEMSNGCICCTLREDLL
IEVEKLAKDGRFDYIVIESTGISEPIPVAQTFSYIDEEMGIDLTKFCQLDTMVTVVDANRFWHDYQSGESLLDRKEALGE
KDEREIADLLIDQIEFCDVLILNKCDLVSEQELEQLENVLRKLQPRARFIRSVKGNVKPQEILHTGLFNFEEASGSAGWI
QELTAGHAEHTPETEEYGISSFVYKRRLPFHSTRFYRWLDQMPKNVVRAKGIVWCASHNNLALLMSQAGPSVTIEPVSYW
VAALPKLEQEQVKQQEPEILEEWDPEFGDRLTQLVFIGTDLDEETITKELDQCLLTEYEFDSDWSLFEDPFKWKLNQ
>P21365 ~~~yciC~~~UPF0259 membrane protein YciC~~~
MSITAQSVYRDTGNFFRNQFMTILLVSLLCAFITVVLGHVFSPSDAQLAQLNDGVPVSGSSGLFDLVQNMSPEQQQILLQ
ASAASTFSGLIGNAILAGGVILIIQLVSAGQRVSALRAIGASAPILPKLFILIFLTTLLVQIGIMLVVVPGIIMAILLAL
APVMLVQDKMGVFASMRSSMRLTWANMRLVAPAVLSWLLAKTLLLLFASSFAALTPEIGAVLANTLSNLISAILLIYLFR
LYMLIRQ
>P21363 ~~~yciE~~~Protein YciE~~~COG3685
MNRIEHYHDWLRDAHAMEKQAESMLESMASRIDNYPELRARIEQHLSETKNQIVQLETILDRNDISRSVIKDSMSKMAAL
GQSIGGIFPSDEIVKGSISGYVFEQFEIACYTSLLAAAKNAGDTASIPTIEAILNEEKQMADWLIQNIPQTTEKFLIRSE
TDGVEAKK
>P21362 ~~~yciF~~~Protein YciF~~~COG3685
MNMKTIEDVFIHLLSDTYSAEKQLTRALAKLARATSNEKLSQAFHAHLEETHGQIERIDQVVESESNLKIKRMKCVAMEG
LIEEANEVIESTEKNEVRDAALIAAAQKVEHYEIASYGTLATLAEQLGYRKAAKLLKETLEEEKATDIKLTDLAINNVNK
KAENKA
>P08245 ~~~yciH~~~Uncharacterized protein YciH~~~COG0023
MSDSNSRLVYSTETGRIDEPKAAPVRPKGDGVVRIQRQTSGRKGKGVCLITGVDLDDAELTKLAAELKKKCGCGGAVKDG
VIEIQGDKRDLLKSLLEAKGMKVKLAGG
>P0AB55 ~~~yciI~~~Protein YciI~~~COG2350
MLYVIYAQDKADSLEKRLSVRPAHLARLQLLHDEGRLLTAGPMPAVDSNDPGAAGFTGSTVIAEFESLEAAQAWADADPY
VAAGVYEHVSVKPFKKVF
>P31808 1.-.-.-~~~yciK~~~Uncharacterized oxidoreductase YciK~~~COG1028
MHYQPKQDLLNDRIILVTGASDGIGREAAMTYARYGATVILLGRNEEKLRQVASHINEETGRQPQWFILDLLTCTSENCQ
QLAQRIAVNYPRLDGVLHNAGLLGDVCPMSEQNPQVWQDVMQVNVNATFMLTQALLPLLLKSDAGSLVFTSSSVGRQGRA
NWGAYAASKFATEGMMQVLADEYQQRLRVNCINPGGTRTAMRASAFPTEDPQKLKTPADIMPLYLWLMGDDSRRKTGMTF
DAQPGRKPGISQ
>P0AB61 ~~~yciN~~~Protein YciN~~~
MNKETQPIDRETLLKEANKIIREHEDTLAGIEATGVTQRNGVLVFTGDYFLDEQGLPTAKSTAVFNMFKHLAHVLSEKYH
LVD
>P0AB64 ~~~yciN~~~Protein YciN~~~
MNKETQPIDRETLLKEANKIIREHEDTLAGIEATGVTQRNGVLVFTGDYFLDEQGLPTAKSTAVFNMFKHLAHVLSEKYH
LVD
>P0AFR4 ~~~yciO~~~Uncharacterized protein YciO~~~COG0009
MSQFFYIHPDNPQQRLINQAVEIVRKGGVIVYPTDSGYALGCKIEDKNAMERICRIRQLPDGHNFTLMCRDLSELSTYSF
VDNVAFRLMKNNTPGNYTFILKGTKEVPRRLLQEKRKTIGMRVPSNPIAQALLEALGEPMLSTSLMLPGSEFTESDPEEI
KDRLEKQVDLIIHGGYLGQKPTTVIDLTDDTPVVVREGVGDVKPFL
>P0A8L7 ~~~yciU~~~UPF0263 protein YciU~~~COG3099
MDMDLNNRLTEDETLEQAYDIFLELAADNLDPADVLLFNLQFEERGGAELFDPAEDWQEHVDFDLNPDFFAEVVIGLADS
EDGEINDVFARILLCREKDHKLCHIIWRE
>A5A614 ~~~yciZ~~~UPF0509 protein YciZ~~~
MSEFDAQRVAERIDIVLDILVAGDYHSAIHNLEILKAELLRQVAESTPDIPKAPWEI
>P0A8R7 ~~~ycjF~~~UPF0283 membrane protein YcjF~~~COG3768
MTEPLKPRIDFDGPLEVDQNPKFRAQQTFDENQAQNFAPATLDEAQEEEGQVEAVMDAALRPKRSLWRKMVMGGLALFGA
SVVGQGVQWTMNAWQTQDWVALGGCAAGALIIGAGVGSVVTEWRRLWRLRQRAHERDEARDLLHSHGTGKGRAFCEKLAQ
QAGIDQSHPALQRWYASIHETQNDREVVSLYAHLVQPVLDAQARREISRSAAESTLMIAVSPLALVDMAFIAWRNLRLIN
RIATLYGIELGYYSRLRLFKLVLLNIAFAGASELVREVGMDWMSQDLAARLSTRAAQGIGAGLLTARLGIKAMELCRPLP
WIDDDKPRLGDFRRQLIGQVKETLQKGKTPSEK
>P0AFR7 ~~~ycjO~~~Inner membrane ABC transporter permease protein YcjO~~~COG1175
MNRLFSGRSDMPFALLLLAPSLLLLGGLVAWPMVSNIEISFLRLPLNPNIESTFVGVSNYVRILSDPGFWHSLWMTVWYT
ALVVAGSTVLGLAVAMFFNREFRLRKTARSLVILSYVTPSISLVFAWKYMFNNGYGIVNYLGVDLLHLYEQAPLWFDNPG
SSFVLVVLFAIWRYFPYAFISFLAILQTIDKSLYEAAEMDGANAWQRFRIVTLPAIMPVLATVVTLRTIWMFYMFADVYL
LTTKVDILGVYLYKTAFAFNDLGKAAAISVVLFIIIFAVILLTRKRVNLNGNK
>P77716 ~~~ycjP~~~Inner membrane ABC transporter permease protein YcjP~~~COG0395
MATNKRTLSRIGFYCGLALFLIITLFPFFVMLMTSFKGAKEAISLHPTLLPQQWTLEHYVDIFNPMIFPFVDYFRNSLVV
SVVSSVVAVFLGILGAYALSRLRFKGRMTINASFYTVYMFSGILLVVPLFKIITALGIYDTEMALIITMVTQTLPTAVFM
LKSYFDTIPDEIEEAAMMDGLNRLQIIFRITVPLAMSGLISVFVYCFMVAWNDYLFASIFLSSASNFTLPVGLNALFSTP
DYIWGRMMAASLVTALPVVIMYALSERFIKSGLTAGGVKG
>P76043 1.1.1.-~~~ycjQ~~~D-guloside 3-dehydrogenase~~~COG1063
MKKLVATAPRVAALVEYEDRAILANEVKIRVRFGAPKHGTEVVDFRAASPFIDEDFNGEWQMFTPRPADAPRGIEFGKFQ
LGNMVVGDIIECGSDVTDYAVGDSVCGYGPLSETVIINAVNNYKLRKMPQGSSWKNAVCYDPAQFAMSGVRDANVRVGDF
VVVVGLGAIGQIAIQLAKRAGASVVIGVDPIAHRCDIARRHGADFCLNPIGTDVGKEIKTLTGKQGADVIIETSGYADAL
QSALRGLAYGGTISYVAFAKPFAEGFNLGREAHFNNAKIVFSRACSEPNPDYPRWSRKRIEETCWELLMNGYLNCEDLID
PVVTFANSPESYMQYVDQHPEQSIKMGVTF
>P76044 5.1.3.-~~~ycjR~~~3-dehydro-D-guloside 4-epimerase~~~COG1082
MKIGTQNQAFFPENILEKFRYIKEMGFDGFEIDGKLLVNNIEEVKAAIKETGLPVTTACGGYDGWIGDFIEERRLNGLKQ
IERILEALAEVGGKGIVVPAAWGMFTFRLPPMTSPRSLDGDRKMVSDSLRVLEQVAARTGTVVYLEPLNRYQDHMINTLA
DARRYIVENDLKHVQIIGDFYHMNIEEDNLAQALHDNRDLLGHVHIADNHRYQPGSGTLDFHALFEQLRADNYQGYVVYE
GRIRAEDPAQAYRDSLAWLRTC
>P77503 1.1.1.-~~~ycjS~~~D-glucoside 3-dehydrogenase~~~COG0673
MKSAMTSSPLRVAIIGAGQVADKVHASYYCTRNDLELVAVCDSRLSQAQALAEKYGNASVWDDPQAMLLAVKPDVVSVCS
PNRFHYEHTLMALEAGCHVMCEKPPAMTPEQAREMCDTARKLGKVLAYDFHHRFALDTQQLREQVTNGVLGEIYVTTARA
LRRCGVPGWGVFTNKELQGGGPLIDIGIHMLDAAMYVLGFPAVKSVNAHSFQKIGTQKSCGQFGEWDPATYSVEDSLFGT
IEFHNGGILWLETSFALNIREQSIMNVSFCGDKAGATLFPAHIYTDNNGELMTLMQREIADDNRHLRSMEAFINHVQGKP
VMIADAEQGYIIQQLVAALYQSAETGTRVEL
>P76046 ~~~ycjX~~~Uncharacterized protein YcjX~~~COG3106
MKRLKNELNALVNRGVDRHLRLAVTGLSRSGKTAFITAMVNQLLNIHAGARLPLLSAVREERLLGVKRIPQRDFGIPRFT
YDEGLAQLYGDPPAWPTPTRGVSEIRLALRFKSNDSLLRHFKDTSTLYLEIVDYPGEWLLDLPMLAQDYLSWSRQMTGLL
NGQRGEWSAKWRMMSEGLDPLAPADENRLADIAAAWTDYLHHCKEQGLHFIQPGRFVLPGDMAGAPALQFFPWPDVDTWG
ESKLAQADKHTNAGMLRERFNYYCEKVVKGFYKNHFLRFDRQIVLVDCLQPLNSGPQAFNDMRLALTQLMQSFHYGQRTL
FRRLFSPVIDKLLFAATKADHVTIDQHANMVSLLQQLIQDAWQNAAFEGISMDCLGLASVQATTSGIIDVNGEKIPALRG
NRLSDGAPLTVYPGEVPARLPGQAFWDKQGFQFEAFRPQVMDVDKPLPHIRLDAALEFLIGDKLR
>P76049 ~~~ycjY~~~Uncharacterized protein YcjY~~~COG1073
MMNNKVSFTNSNNPTISLSAVIYFPPKFDETRQYQAIVLSHPGGGVKEQTAGTYAKKLAEKGFVTIAYDASYQGESGGEP
RQLENPYIRTEDISAVIDYLTTLSYVDNTRIGAMGICAGAGYTANAAIQDRRIKAIGTVSAVNIGSIFRNGWENNVKSID
ALPYVEAGSNARTSDISSGEYAIMPLAPMKESDAPNEELRQAWEYYHTPRAQYPTAPGYATLRSLNQIITYDAYHMAEVY
LTQPTQIVAGSQAGSKWMSDDLYDRASSQDKRYHIVEGANHMDLYDGKAYVAEAISVLAPFFEETL
>P42400 ~~~yckB~~~Probable ABC transporter extracellular-binding protein YckB~~~COG0834
MKSFMHSKAVIFSFTMAFFLILAACSGKNEADSKDTGWEQIKDKGKIVVATSGTLYPTSYHDTDSGSDKLTGYEVEVVRE
AAKRLGLKVEFKEMGIDGMLTAVNSGQVDAAANDIDVTKDREEKFAFSTPYKYSYGTAIVRKDDLSGIKTLKDLKGKKAA
GAATTVYMEVARKYGAKEVIYDNATNEQYLKDVANGRTDVILNDYYLQTLALAAFPDLNITIHPDIKYMPNKQALVMKKS
NAALQKKMNEALKEMSKDGSLTKLSKQFFNKADVSKKIDADVQDVDL
>P94405 4.1.1.61~~~bsdC~~~Phenolic acid decarboxylase~~~COG0043
MAYQDFREFLAALEKEGQLLTVNEEVKPEPDLGASARAASNLGDKSPALLFNNIYGYHNARIAMNVIGSWPNHAMMLGMP
KDTPVKEQFFEFAKRYDQFPMPVKREETAPFHENEITEDINLFDILPLFRINQGDGGYYLDKACVISRDLEDPDNFGKQN
VGIYRMQVKGKDRLGIQPVPQHDIAIHLRQAEERGINLPVTIALGCEPVITTAASTPLLYDQSEYEMAGAIQGEPYRIVK
SKLSDLDVPWGAEVVLEGEIIAGEREYEGPFGEFTGHYSGGRSMPIIKIKRVYHRNNPIFEHLYLGMPWTECDYMIGINT
CVPLYQQLKEAYPNEIVAVNAMYTHGLIAIVSTKTRYGGFAKAVGMRALTTPHGLGYCKMVIVVDEDVDPFNLPQVMWAL
STKMHPKHDAVIIPDLSVLPLDPGSNPSGITHKMILDATTPVAPETRGHYSQPLDSPLTTKEWEQKLMDLMNK
>Q7DBA7 4.1.1.61~~~edcC~~~Phenolic acid decarboxylase~~~COG0043
MAFDDLRSFLQALDDHGQLLKISEEVNAEPDLAAAANATGRIGDGAPALWFDNIRGFTDARVAMNTIGSWQNHAISLGLP
PNTPVKKQIDEFIRRWDNFPIAPERRANPAWAQNTVDGDEINLFDILPLFRLNDGDGGFYLDKACVVSRDPLDPDNFGKQ
NVGIYRMEVKGKRKLGLQPVPMHDIALHLHKAEERGEDLPIAITLGNDPIITLMGATPLKYDQSEYEMAGALRESPYPIA
TAPLTGFDVPWGSEVILEGVIESRKREIEGPFGEFTGHYSGGRNMTVVRIDKVSYRTRPIFESLYLGMPWTEIDYLMGPA
TCVPLYQQLKAEFPEVQAVNAMYTHGLLAIISTKKRYGGFARAVGLRAMTTPHGLGYVKMVIMVDEDVDPFNLPQVMWAL
SSKVNPAGDLVQLPNMSVLELDPGSSPAGITDKLIIDATTPVAPDNRGHYSQPVVDLPETKAWAEKLTAMLAARK
>B3EWP1 ~~~~~~UPF0065 protein in clcB-clcD intergenic region~~~
MVSRVLAQRVSQTLGQPVTVENRPGSGGIIGSAEVARSPADGYTLLVNSTVLAVDKWFYPNVAYDARKAFAPVALLATIP
SVLVVPADSPYKDARSLLAYAKANPGKVSFASAGMGTSIHLAAALLAAQAGVDLLHVPYKGSTPAAADLIAGRVSMMVDS
ITAQQSFIKSGRVRALGVTSLQPAPSLPGIPPLAQAADLPKFEVLTWFGLFVPSRTSPDIVKVLNTAMNEALKTPEVQKA
LADIGASAQGGTSQALGVLWDNEIDRWGQLIVHHRLNTNEL
>Q9S4M7 4.1.1.63~~~shdC~~~Phenolic acid decarboxylase~~~
MAKVYKDLREFLEVLEQEGQLIRVKEEVNPEPDIAAAGRAAANLGKNQPAVFFEKIKGYKYSVVTNVHGSWQNHALMLGL
DKNTSTKDQFYELNRRWDKFPVPPNVVKREAAPCKENVIDKDINLFEILPLYRINEQDGGFYISKASVVTADPEYPDDFN
KLNVGTYRIQVKDRDRVGIQALAMHDIAVQLEKAEAENKPLPIAITIGNNPLVTFMASTPVGYNQNEYEFVGALQDGVPM
DIVKSDLYDHLYVPAGSEVVLEGHIIPRVRTVEGPFGEFPGSYSGARLQCEVKIDRITHRTNPIFENLYLGIPWTEIDYL
MALNTSVPLYKQLKETMPEVVAVNAMYTHGIGVIISTKVRYGGYAKGVAFRLLSTPHGMPYSKIVIVVDEFVDPFNLEQV
MWALTTRVHPGKDVSIIENCPGMPLDPSTNPPGMHTKMIIDATTPVPPEPNPRETQLLDPPDGTEEWEEKLKELLKNQNR
>Q9X697 4.1.1.-~~~vdcC~~~Phenolic acid decarboxylase~~~
MAYDDLRSFLDTLEKEGQLLRITDEVLPEPDLAAAANATGRIGENAPALHFDNVKGFTDARIAMNVHGSWANHALALGLP
KNTPVKEQVEEFARRWDAFPVAPERREEAPWRENTQEGEDVDLFSVLPLFRLNDGDGGFYLDKAAVVSRDPEDRDDFGKQ
NVGTYRIQVIGTNRLAFHPAMHDVAQHLRKAEEKGEDLPIAITLGNDPVMAIVAGMPMAYDQSEYEMAGALRGAPAPIAT
APLTGFDVPWGSEVVIEGVIESRKRRIEGPFGEFTGHYSGGRRMPVIRVERVSYRHEPVFESLYLGMPWNECDYLVGPNT
CVPLLKQLRAEFPEVQAVNAMYTHGLMVIISTAKRYGGFAKAVGMRAMTTPHGLGYVAQVILVDEDVDPFNLPQVMWAMS
AKVNPKDDVVVIPNLSVLELAPAAQPAGISSKMIIDATTPVAPDVRGNFSTPAKDLPETAEWAARLQRLIAARV
>P94418 ~~~yclN~~~Petrobactin import system permease protein YclN~~~COG4606
MKLRYLFILLIILAVTSVFIGVEDLSPLDLFDLSKQEASTLFASRLPRLISIVIAGLSMSICGLIMQQISRNKFVSPTTA
GTMDWARLGILISLLLFTSASPLIKMLVAFVFALAGNFLFMKILERIKFNDTIFIPLVGLMLGNIVSSIATFIAYKYDLI
QNVSSWLQGDFSLVVKGRYELLYLSIPLVIIAYVYADKFTLAGMGESFSVNLGLKYKRVVNIGLIIVSLITSLVILTVGM
LPFLGLIIPNIVSIYRGDNLKSSLPHTVLLGAVFVLFCDILGRIIIFPYEISIGLMVGIIGSGIFLFMLLRRKAYA
>P94419 ~~~yclO~~~Petrobactin import system permease protein YclO~~~COG4605
MRNQMKIALLVGLAIVCIGLFLFYDLGNWDYTLPRRIKKVAAIVLTGGAIAFSTMIFQTITNNRILTPSILGLDSLYMLI
QTGIIFLFGSANMVIMNKNINFIISVLLMILFSLVLYQIMFKGEGRNIFFLLLIGIVFGTLFSSLSSFMQMLIDPNEFQV
VQDKMFASFNNINTDLLWLAFIIFLLTGVYVWRFTKFFDVLSLGREHAVNLGIDYDKVVKQMLIVVAILVSVSTALVGPI
MFLGLLVVNLAREFLKTYKHSYLIAGSVFISIIALVGGQFVVEKVFTFSTTLSVIINFAGGIYFIYLLLKENKSW
>P94420 7.2.2.-~~~yclP~~~Petrobactin import ATP-binding protein YclP~~~COG4604
MVEVRNVSKQYGGKVVLEETSVTIQKGKITSFIGPNGAGKSTLLSIMSRLIKKDSGEIYIDGQEIGACDSKELAKKMSIL
KQANQINIRLTIKDLVSFGRFPYSQGRLTEEDWVHINQALSYMKLEDIQDKYLDQLSGGQCQRAFIAMVIAQDTDYIFLD
EPLNNLDMKHSVEIMKLLKRLVEELGKTIVIVIHDINFASVYSDYIVALKNGRIVKEGPPEEMIETSVLEEIYDMTIPIQ
TIDNQRIGVYFS
>P94421 ~~~yclQ~~~Petrobactin-binding protein YclQ~~~COG4607
MKKFALLFIALVTAVVISACGNQSTSSKGSDTKKEQITVKHQLDKNGTKVPKNPKKVVVFDFGSLDTLDKLGLDDIVAGL
PKQVLPKYLSKFKDDKYADVGSLKEPDFDKVAELDPDLIIISARQSESYKEFSKIAPTIYLGVDTAKYMESFKSDAETIG
KIFDKEDKVKDELANIDHSIADVKKTAEKLNKNGLVIMANDGKISAFGPKSRYGLIHDVFGVAPADQNIKASTHGQSVSY
EYISKTNPDYLFVIDRGTAIGETSSTKQVVENDYVKNVNAVKNGHVIYLDSATWYLSGGGLESMTQMIKEVKDGLEK
>P94425 1.-.-.-~~~ycnE~~~Putative monooxygenase YcnE~~~COG1359
MIVLQAYIKVKPEKREEFLSEAQSLVQHSRAEEGNAQYDLFEKVGEENTFVMLEKWKDEAAMKFHNETAHFQGFVAKGKE
LLSAPLDVVRTELSE
>P94431 ~~~ycnI~~~Uncharacterized protein YcnI~~~COG4549
MLKKIALTLCPAIVGSLLFFTAPASAHVSVKPAESAAGSWETYTMKVPSEKNLPTTKVVLKMPKDVEFQQYEPIPGWKVS
TQKHDDKSVSVTWEATDGGIQEGQFQQFTFVAKNPDKAEEAAWDAYQYYKDGSIVEWTGDEDADTPHSITNITSAKQVTD
EHGATKTEDDSENSGSSALDITAMVLSAAAIILSVAALVKKKRA
>C0SP95 ~~~ycnJ~~~Copper transport protein YcnJ~~~COG1276
MKRNRWWIILLLFLVFLPKTSFAHAYIVKSSPGENSELKSAPAQVEIEFNEPVEEGFHYIKVYNSNGDRVDTDKTEIKKD
NHHIMTVKLKKNLPKDVYRAEWNAVSADGHPVSGVIPFSIGKADGGFSSQKAADSALNPGTAADRAILYTALSLFIGTVF
FHLFWYKGKSEQLVKRTRRILTGSIAALGLALLLQLPIQTKANAGGGWGSAFQPGYIRETLFETAGGSIWIIQAALFVLL
ALSVIPAIRKNRFSSFGYWTAPLIFFFGLLLAKAFTGHAAVVEEKTVGILMDFLHLTSASIWVGGIAALVLLLSKEWRQP
DKTLAWETVRRFSPWALTAVGVILFSGLLNGFFIIRSMDSLFHTAYGQALLVKSGLFVFMLVLGAIHFLLTRKQRRTGIS
RTLKAEWAIGIAVLITAAVFTSLPSPPEPAPEPFYQTKAIENGQSVSLSISPNQPGKNVFELRVTDHNGDPVKNIQQITL
TVYKTGLSGSENKSTFTLKEKTKGVFQDQNLSINEKGNWKIKVHGLTGDFNEINIMFTKTN
>P94433 ~~~ycnK~~~HTH-type transcriptional repressor YcnK~~~COG1349
MLPINRQQHILKWLKEEGSLRISDISARFGVSEMTVYRDVNQLVQSNQVIKTAGGITLPVRTPQTDHMCSYCLKPVNQAH
SVQLITVNQDIEQLCCAHCAFLRYADKTEEVSHLICRDFLLQTTVSAGSAYFVVNAELNLHCCQPQAIPFATLDHAERFQ
KGFGGAVCTFDQALEDMLQDRKKRCTCTKK
>P94434 ~~~ycnL~~~Uncharacterized protein YcnL~~~
MKETPCPNCGKPLTGDMVRSSNVPCQFRCGHCRERLYEYKVSAPIMLVSLAAIVLLIYLLMLLRNAAGSVLPAVQHVPMA
VFALVCAYPVFIVSERMIAKYVIQNGNIIYRGKRKGS
>P29300 ~~~~~~Uncharacterized phycocyanin operon protein Z~~~
MSIETFFQQLKHPNPNVRNQGMWGIADNYDAEVINRLMALLDEEDTTYRRAAVKTLGAIGHASVTPLVAALLNSDNMTVR
SSAAKALAQVVICHPDEPLSEEGVQGLKAALQDPNPVVNIASVMAMGEIGAPVVHLLIEALQTTENPALAVSLVNAIAST
GDSRGIDVLQAIINDEAADSYVRETATSAISRLEMVAGFKRN
>P21732 ~~~~~~Uncharacterized 20 kDa protein in cryB1 5'region~~~
MKVEGGESMHESEEGRDVPNGITKHKHHIPFQCIVSLPSGFQIEKPNDLKLVYDVSHLSMTKDTCKKRIEIDECGQVEID
LQVLKIKGVLSFIGNFSIEPIVCENMYTTVDRNPSISLSFQDTVYVDHILKYSVQQLPHYVIDGDHIQVRELQIKLMKEN
PQSAQISGLFCFVYE
>P21733 ~~~~~~Uncharacterized 29.1 kDa protein in cryB1 5'region~~~
MLKYHFPNVCEDELINIYSYGDFKGQGKYICLFKIENQSFLFWRNDKGNKIYTNLESISVEIINTNNTYNQSQNVCPQDL
VDTYNQSQNVCPQDLVDTYNQSQNVCPQDLVDTYNQSQNVCPQDLVDTYNQSQNVCPQDLVDTYNQSQNVCPQDLVDTYN
QSQNVYTQDLIDTYNQSQNVCPQDLVDTYNQSQNVCPQDLVDTYNQSQNVCPQDLVDTYNQSQNVCPQDLNVYTQDLIDT
YNQSQNCDCGCK
>P42962 3.1.3.104~~~ycsE~~~5-amino-6-(5-phospho-D-ribitylamino)uracil phosphatase YcsE~~~COG0561
MSVQREDVDIKLIAIDMDGTLLNDEQLISDENRKAIREAEDKGVYVVISTGRTLMTCRELAESLKLSSFLITANGSEIWD
SNFNLVERKLLHTDHIQMMWDLRNKHNTNFWASTVNKVWRGEFPENITDHEWLKFGFDIEDDDIRNEVLEELRKNKELEI
TNSSPTNIEVNALGINKAAALAKVTEKLGFTMENVMAMGDSLNDIAMIKEAGLGVAMGNAQDIVKETADYITDTNIEDGV
AKAIRHWVL
>P96579 2.3.1.-~~~ydaF~~~Putative ribosomal N-acetyltransferase YdaF~~~COG1670
MFTCKVNEHITIRLLEPKDAERLAELIIQNQQRLGKWLFFAENPSSADTYRETIIPDWRRQYADLNGIEAGLLYDGSLCG
MISLHNLDQVNRKAEIGYWIAKEFEGKGIITAACRKLITYAFEELELNRVAICAAVGNEKSRAVPERIGFLEEGKARDGL
YVNGMHHDLVYYSLLKREWEGEK
>P76061 ~~~ydaG~~~Uncharacterized protein YdaG~~~
MVHYEVVQYLMDCCGITYNQAVQALRSNDWDLWQAEVAIRSNKM
>P96591 ~~~ydaP~~~Putative thiamine pyrophosphate-containing protein YdaP~~~COG0028
MAHKTAGQAMTELLEQWGVDHVYGIPGDSINEFIEELRHERNQLKFIQTRHEEVAALAAAAEAKLTGKIGVCLSIAGPGA
VHLLNGLYDAKADGAPVLAIAGQVSSGEVGRDYFQEIKLEQMFEDVAVFNREVHSAESLPDLLNQAIRTAYSKKGVAVLS
VSDDLFAEKIKREPVYTSPVYIEGNLEPKKEQLVTCAQYINNAKKPIILAGQGMKKAKRELLEFADKAAAPIVVTLPAKG
VVPDKHPHFLGNLGQIGTKPAYEAMEECDLLIMLGTSFPYRDYLPDDTPAIQLDSDPAKIGKRYPVTAGLVCDSALGLRE
LTEYIERKEDRRFLNACTEHMQHWWNEIEKDETEATTPLKPQQVVARLQEAAADDAVLSVDVGTVTVWMARHFKMNANQD
FIVSSWLATMGCGLPGAIAASLSEPERQAIAVCGDGGFSMVMQDLPTAVKYKLPITVVILNNENLGMIEYEQQVKGNIDY
VTKLQNVDYAAFAESCGAKGIKVTKAEELAPAFHEALHSDQPVVVDVMIGNEPPLPGKITYGQAKGFSKYMLKNFFENQK
FEMPSLKKSLKRLF
>P76066 ~~~ydaW~~~Protein YdaW~~~COG1846
METVFDALKAMGKATSIELAARLDISREEVLNELWELKKAGFVDKSAYTWRVADNNVQQEQPAQAELPEEITTATVAKIS
ECDLTATIEQRGPQTADELATLFGTTSRKVASTLAMAISKGRLIRVNQGGKFRYCIPGDNLPAEPKAASVSPLWLSASSS
ACHGVLIITVITPSPTKNSATKMPEN
>P76069 ~~~ydaY~~~Protein YdaY~~~
MSRSSDNDQYRSRNALIRRHIEKMDASLHVGTKEFDISKVSEVDSVDDLLIDNAARYLLKDWKGVGELVNGVEVALEYTA
ERGIALLKQNPELYWQILAEAASIAQGKEQQKQDTIKKP
>P96608 1.3.99.-~~~ydbM~~~Putative acyl-CoA dehydrogenase YdbM~~~COG1960
MSLFIQNDQQRQWMEKIGRIADEFQQTAAEDDEQGRFPAEKIQKLRDAGYTALTLPASHGGGGISVYDMLLFQERLARGD
APTALSIGWHLSVIGELGEGNSWDEDVFAFVAKEVQNGAVINRAATEAKTGSPTRGGRPGTHAVKKDGKWAVNGRKTFTT
MSQALDYFLVTAWIEDKQTTGVFLIHKDDPGLSIEETWDMMAMRATGSHDLVLNEVMLDENKLVELLQGPRGAKPNGWLL
HIPAIYLGVAQAARDYAVQFASEYSPNSLNGPIKNVPAVQQRTGEMELELLNARHFLFHIAQLYDDPVRRPHLTSELGAA
KHIVTNAALSVVDKAMRIVGAKSLERTNPLQRYYRDVRAGLHNPPMDDAVIHKLAAEAFES
>P96611 ~~~ydbP~~~Thioredoxin-like protein YdbP~~~COG0526
MKKITTNEQFNELIQSDKEIIVKFYADWCPDCTRMNMFIGDILEEYNQNDWYELNKDELPDLAEKYQVMGIPSLLIFKNG
EKTAHLHSANAKTPEEVTEFLSEHIS
>P96619 ~~~ydcC~~~Sporulation protein YdcC~~~COG2834
MKKVRKSFVLLLTGLLAVLILSACGQKTQQDIVAGLDEKAKEYTSYKAKAKMTIETGSEPQVYNVEIWHKKPSLYRVYLE
NPKKDQNQVILRNENGVFVLTPSLNKSFRFQSDWPNNSSQVYLFESLVKDVQNDSDAVFTAKEKKYVFETKTNYQHNKML
PTQEITFNKKDMSPSSVKVMDTDRKVMVKVEFSSFEFNKQFDKESFDEKKNMTLSQMDVATSAKPSDTFAVKTPLELPLG
VKLLEEKDISTEDGKRIIMTYGGEKSFTLIQEKAQIAKASSSVTLNGEPVNLGYTIGALSDASLSWTYDGVDYLLSSKDL
SKEEMVTVAKSMQGQSSK
>P34209 ~~~ydcF~~~Protein YdcF~~~COG1434
MNITPFPTLSPATIDAINVIGQWLAQDDFSGEVPYQADCVILAGNAVMPTIDAACKIARDQQIPLLISGGIGHSTTFLYS
AIAQHPHYNTIRTTGRAEATILADIAHQFWHIPHEKIWIEDQSTNCGENARFSIALLNQAVERVHTAIVVQDPTMQRRTM
ATFRRMTGDNPDAPRWLSYPGFVPQLGNNADSVIFINQLQGLWPVERYLSLLTGELPRLRDDSDGYGPRGRDFIVHVDFP
AEVIHAWQTLKHDAVLIEAMESRSLR
>P76103 ~~~ydcO~~~Inner membrane protein YdcO~~~COG3135
MRLFSIPPPTLLAGFLAVLIGYASSAAIIWQAAIVAGATTAQISGWMTALGLAMGVSTLTLTLWYRVPVLTAWSTPGAAL
LVTGLQGLTLNEAIGVFIVTNALIVLCGITGLFARLMRIIPHSLAAAMLAGILLRFGLQAFASLDGQFTLCGSMLLVWLA
TKAVAPRYAVIAAMIIGIVIVIAQGDVVTTDVVFKPVLPTYITPDFSFAHSLSVALPLFLVTMASQNAPGIAAMKAAGYS
APVSPLIVFTGLLALVFSPFGVYSVGIAAITAAICQSPEAHPDKDQRWLAAAVAGIFYLLAGLFGSAITGMMAALPVSWI
QMLAGLALLSTIGGSLYQALHNERERDAAVVAFLVTASGLTLVGIGSAFWGLIAGGVCYVVLNLIADRNRY
>P76108 2.3.1.-~~~ydcS~~~Bifunctional polyhydroxybutyrate synthase / ABC transporter periplasmic binding protein~~~COG0687
MSKTFARSSLCALSMTIMTAHAAEPPTNLDKPEGRLDIIAWPGYIERGQTDKQYDWVTQFEKETGCAVNVKTAATSDEMV
SLMTKGGYDLVTASGDASLRLIMGKRVQPINTALIPNWKTLDPRVVKGDWFNVGGKVYGTPYQWGPNLLMYNTKTFPTPP
DSWQVVFVEQNLPDGKSNKGRVQAYDGPIYIADAALFVKATQPQLGISDPYQLTEEQYQAVLKVLRAQHSLIHRYWHDTT
VQMSDFKNEGVVASSAWPYQANALKAEGQPVATVFPKEGVTGWADTTMLHSEAKHPVCAYKWMNWSLTPKVQGDVAAWFG
SLPVVPEGCKASPLLGEKGCETNGFNYFDKIAFWKTPIAEGGKFVPYSRWTQDYIAIMGGR
>P77156 ~~~ydcU~~~Inner membrane ABC transporter permease protein YdcU~~~COG1176
MAMNVLQSPSRPGLGKVSGFFWHNPGLGLFLLLLGPLMWFGIVYFGSLLTLLWQGFYTFDDFTMSVTPELTLANIRALFN
PANYDIILRTLTMAVAVTIASAILAFPMAWYMARYTSGKMKAFFYIAVMLPMWASYIVKAYAWTLLLAKDGVAQWFLQHL
GLEPLLTAFLTLPAVGGNTLSTSGLGRFLVFLYIWLPFMILPVQAALERLPPSLLQASADLGARPRQTFRYVVLPLAIPG
IAAGSIFTFSLTLGDFIVPQLVGPPGYFIGNMVYSQQGAIGNMPMAAAFTLVPIILIALYLAFVKRLGAFDAL
>P0AFR9 ~~~ydcV~~~Inner membrane ABC transporter permease protein YdcV~~~COG1177
MHSERAPFFLKLAAWGGVVFLHFPILIIAAYAFNTEDAAFSFPPQGLTLRWFSVAAQRSDILDAVTLSLKVAALATLIAL
VLGTLAAAALWRRDFFGKNAISLLLLLPIALPGIVTGLALLTAFKTINLEPGFFTIVVGHATFCVVVVFNNVIARFRRTS
WSLVEASMDLGANGWQTFRYVVLPNLSSALLAGGMLAFALSFDEIIVTTFTAGHERTLPLWLLNQLGRPRDVPVTNVVAL
LVMLVTTLPILGAWWLTREGDNGQ
>P64455 ~~~ydcY~~~Uncharacterized protein YdcY~~~
MSHLDEVIARVDAAIEESVIAHMNELLIALSDDAELSREDRYTQQQRLRTAIAHHGRKHKEDMEARHEQLTKGGTIL
>P76111 ~~~ydcZ~~~Inner membrane protein YdcZ~~~COG3238
MNQSLTLAFLIAAGIGLVVQNTLMVRITQTSSTILIAMLLNSLVGIVLFVSILWFKQGMAGFGELVSSVRWWTLIPGLLG
SFFVFASISGYQNVGAATTIAVLVASQLIGGLMLDIFRSHGVPLRALFGPICGAILLVVGAWLVARRSF
>P31826 ~~~yddA~~~Inner membrane ABC transporter ATP-binding protein YddA~~~COG4178
MITIPITLRMLIAKYLCLLKPFWLRKNNKTSVLLIIIILAMILGVVKIQVWLNDWNNDFFNALSQKETDKLWQLVLWFPA
LLGIFVLISVNKTWLIKLLTIRWREWLTDYYLNRWFADKNYYFTQIYGEHKNTDNPDQRIAEDILLLISKTLSLSFGFIQ
SLSMLITFTVILWESAGTLSFTVGGTEWNIQGYMVYTVVLIVIGGTLFTHKVGKRIRPLNVEKQRSEATFRTNLVQHNKQ
AELIALSNAESLQRQELSDNFHTIKENWHRLMNRQRWLDYWQNIYSRSLSVLPYFLLLPQFISGQINLGGLMKSRQAFML
VSNNLSWFIYKYDELAELAAVIDRLYEFHQLTEQRPTNKPKNCQHAVQVADASIRTPDNKIILENLNFHVSPGKWLLLKG
YSGAGKTTLLKTLSHCWPWFKGDISSPADSWYVSQTPLIKTGLLKEIICKALPLPVDDKSLSEVLHQVGLGKLAARIHDH
DRWGDILSSGEKQRIALARLILRRPKWIFLDETTSHLEEQEAIRLLRLVREKLPTSGVIMVTHQPGVWNLADDICDISAV
L
>P31827 ~~~yddB~~~Uncharacterized protein YddB~~~COG4771
MKRVLIPGVILCGADVAQAVDDKNMYMHFFEEMTVYAPVPVPVNGNTHYTSESIERLPTGNGNISDLLRTNPAVRMDSTQ
STSLNQGDIRPEKISIHGASPYQNAYLIDGISATNNLNPANESDASSATNISGMSQGYYLDVSLLDNVTLYDSFVPVEFG
RFNGGVIDAKIKRFNADDSKVKLGYRTTRSDWLTSHIDENNKSAFNQGSSGSTYYSPDFKKNFYTLSFNQELADNFGVTA
GLSRRQSDITRADYVSNDGIVAGRAQYKNVIDTALSKFTWFASDRFTHDLTLKYTGSSRDYNTSTFPQSDREMGNKSYGL
AWDMDTQLAWAKLRTTVGWDHISDYTRHDHDIWYTELSCTYGDITGRCTRGGLGHISQAVDNYTFKTRLDWQKFAVGNVS
HQPYFGAEYIYSDAWTERHNQSESYVINAAGKKTNHTIYHKGKGRLGIDNYTLYMADRISWRNVSLMPGVRYDYDNYLSN
HNISPRFMTEWDIFANQTSMITAGYNRYYGGNILDMGLRDIRNSWTESVSGNKTLTRYQDLKTPYNDELAMGLQQKIGKN
VIARANYVYREAHDQISKSSRTDSATKTTITEYNNDGKTKTHSFSLSFELAEPLHIRQVDINPQIVFSYIKSKGNLSLNN
GYEESNTGDNQVVYNGNLVSYDSVPVADFNNPLKISLNMDFTHQPSGLVWANTLAWQEARKARIILGKTNAQYISEYSDY
KQYVDEKLDSSLTWDTRLSWTPQFLQQQNLTISADILNVLDSKTAVDTTNTGVATYASGRTFWLDVSMKF
>P37757 5.1.-.-~~~yddE~~~Uncharacterized isomerase YddE~~~COG0384
MKPQVYHVDAFTSQPFRGNSAGVVFPADNLSEAQMQLIARELGHSETAFLLHSDDSDVRIRYFTPTVEVPICGHATVAAH
YVRAKVLGLGNCTIWQTSLAGKHRVTIEKHNDDYRISLEQGTPGFEPPLEGETRAAIINALHLTEDDILPGLPIQVATTG
HSKVMIPLKPEVDIDALSPDLNALTAISKKIGCNGFFPFQIRPGKNETDGRMFSPAIGIVEDPVTGNANGPMGAWLVHHN
VLPHDGNVLRVKGHQGRALGRDGMIEVTVTIRDNQPEKVTISGTAVILFHAEWAIEL
>P46136 ~~~yddG~~~Aromatic amino acid exporter YddG~~~COG0697
MTRQKATLIGLIAIVLWSTMVGLIRGVSEGLGPVGGAAAIYSLSGLLLIFTVGFPRIRQIPKGYLLAGSLLFVSYEICLA
LSLGYAATHHQAIEVGMVNYLWPSLTILFAILFNGQKTNWLIVPGLLLALVGVCWVLGGDNGLHYDEIINNITTSPLSYF
LAFIGAFIWAAYCTVTNKYARGFNGITVFVLLTGASLWVYYFLTPQPEMIFSTPVMIKLISAAFTLGFAYAAWNVGILHG
NVTIMAVGSYFTPVLSSALAAVLLSAPLSFSFWQGALMVCGGSLLCWLATRRG
>D0ZXP9 ~~~yddG~~~Aromatic amino acid exporter YddG~~~
MTSQKATLIGLVAIVLWSTMVGLIRGVSEGLGPVGGAAMIYSLSGLLLIFTVGLPDIRRFPGRYLIAGSVLFVSYEICLA
LSLGYAATRHQAIEVGMVNYLWPSLTILFAILFNGQKTNWLIVPGLLIALTGVCWVLGGENGLNPGEIISNVATSPLSYL
LAFLGAFIWATYCTVTNKYARGFNGITVFVLLTAVALWLHYFLTPQPAMIFSLPVIAKLFTAALTLGFAYAAWNVGILHG
NVTIMAVGSYFTPVMSSALAALLLSSPLSFSFWQGAVMVCVGSLLCWLATRRR
>D7A5Q8 ~~~yddG~~~Aromatic amino acid exporter YddG~~~COG0697
MSRSSATLIGFTAILLWSTLALATSSTGAVPPFLLTALTFTIGGAVGIAAGLARGVGLSVLRQPWPVWVHGIGGLFGYHF
FYFSALKLAPPAEAGLVAYLWPLLIVLFSAFLPGERLRPAHVAGALMGLAGTVVLLGARAGGFGFAPEYVPGYLAAAACA
VIWSVYSVASRRFARVPTEVVAGFCLATAALSALCHILFEPSVWPVGSEWLAVVALGIGPVGIAFYTWDIGMKRGDVRLL
GVLSYAAPVLSTLLLVVAGFAAPSGALAIACALIVGGAAVATLLARR
>P67700 ~~~yddM~~~Uncharacterized HTH-type transcriptional regulator YddM~~~COG3093
MKMANHPRPGDIIQESLDELNVSLREFARAMEIAPSTASRLLTGKAALTPEMAIKLSVVIGSSPQMWLNLQNAWSLAEAE
KTVDVSRLRRLVTQ
>P96658 3.2.-.-~~~ydeA~~~Uncharacterized protease YdeA~~~COG0693
MKKALFLILDQYADWEGVYLASALNQREDWSVHTVSLDPIVSSIGGFKTSVDYIIGLEPANFNLLVMIGGDSWSNDNKKL
LHFVKTAFQKNIPIAAICGAVDFLAKNGLLNNHSHTGNFVYLWKDYKQYKPISSFVEKQAVRDKNLVTANGTAPIEFTNL
ILEMIDFDTPENIEKMMYMNRYGFYHFCDKYGNPFVD
>P96671 3.-.-.-~~~ydeN~~~Putative hydrolase YdeN~~~COG3545
MTKQVYIIHGYRASSTNHWFPWLKKRLLADGVQADILNMPNPLQPRLEDWLDTLSLYQHTLHENTYLVAHSLGCPAILRF
LEHLQLRKQLGGIILVSGFAKSLPTLQMLDEFTQGSFDHQKIIESAKHRAVIASKDDQIVPFSFSKDLAQQIDAALYEVQ
HGGHFLEDEGFTSLPIVYDVLTSYFSKETR
>P76135 ~~~ydeO~~~HTH-type transcriptional regulator YdeO~~~COG2207
MSLVCSVIFIHHAFNANILDKDYAFSDGEILMVDNAVRTHFEPYERHFKEIGFTENTIKKYLQCTNIQTVTVPVPAKFLR
ASNVPTGLLNEMIAYLNSEERNHHNFSELLLFSCLSIFAACKGFITLLTNGVLSVSGKVRNIVNMKPAHPWKLKDICDCL
YISESLLKKKLKQEQTTFSQILLDARMQHAKNLIRVEGSVNKIAEQCGYASTSYFIYAFRKHFGNSPKRVSKEYRCQSHT
GMNTGNTMNALAI
>P77561 ~~~ydeP~~~Protein YdeP~~~COG0243
MKKKIESYQGAAGGWGAVKSVANAVRKQMDIRQDVIAMFDMNKPEGFDCPGCAWPDPKHSASFDICENGAKAIAWEVTDK
QVNASFFAENTVQSLLTWGDHELEAAGRLTQPLKYDAVSDCYKPLSWQQAFDEIGARLQSYSDPNQVEFYTSGRTSNEAA
FLYQLFAREYGSNNFPDCSNMCHEPTSVGLAASIGVGKGTVLLEDFEKCDLVICIGHNPGTNHPRMLTSLRALVKRGAKM
IAINPLQERGLERFTAPQNPFEMLTNSETQLASAYYNVRIGGDMALLKGMMRLLIERDDAASAAGRPSLLDDEFIQTHTV
GFDELRRDVLNSEWKDIERISGLSQTQIAELADAYAAAERTIICYGMGITQHEHGTQNVQQLVNLLLMKGNIGKPGAGIC
PLRGHSNVQGDRTVGITEKPSAEFLARLGERYGFTPPHAPGHAAIASMQAICTGQARALICMGGNFALAMPDREASAVPL
TQLDLAVHVATKLNRSHLLTARHSYILPVLGRSEIDMQKNGAQAVTVEDSMSMIHASRGVLKPAGVMLKSECAVVAGIAQ
AALPQSVVAWEYLVEDYDRIRNDIEAVLPEFADYNQRIRHPGGFHLINAAAERRWMTPSGKANFITSKGLLEDPSSAFNS
KLVMATVRSHDQYNTTIYGMDDRYRGVFGQRDVVFMSAKQAKICRVKNGERVNLIALTPDGKRSSRRMDRLKVVIYPMAD
RSLVTYFPESNHMLTLDNHDPLSGIPGYKSIPVELEPSN
>P77588 ~~~ydeQ~~~Uncharacterized fimbrial-like protein YdeQ~~~COG3539
MGKTISIKVLFGIYLLLMAGKVFAFSCNVDGGSSIGAGTTSVYVNLDPVIQPGQNLVVDLSQHISCWNDYGGWYDTDHIN
LVQGSAFAGSLQSYKGSLYWNNVTYPFPLTTNTNVLDIGDKTPMPLPLKLYITPVGAAGGVVIKAGEVIARIHMYKIATL
GSGNPRNFTWNIISNNNVVMPTGGCTVDSRNVTVDLPDFPGSAEIPLGVYCSSEQKLSFYLSGATTDSSRQVFANTAPDA
TKASGVGVTLMRNGKILATGENVSLGTVNKSKVPLGLSATYGQTGNKVSAGTVQSVIGVTFIYE
>P29009 ~~~ydfB~~~Uncharacterized protein YdfB~~~
MDFDTIMEKAYEEYFEGLAEGEEALSFSEFKQALSSSAKSNG
>P39831 1.1.1.381~~~ydfG~~~NADP-dependent 3-hydroxy acid dehydrogenase YdfG~~~COG4221
MIVLVTGATAGFGECITRRFIQQGHKVIATGRRQERLQELKDELGDNLYIAQLDVRNRAAIEEMLASLPAEWCNIDILVN
NAGLALGMEPAHKASVEDWETMIDTNNKGLVYMTRAVLPGMVERNHGHIINIGSTAGSWPYAGGNVYGATKAFVRQFSLN
LRTDLHGTAVRVTDIEPGLVGGTEFSNVRFKGDDGKAEKTYQNTVALTPEDVSEAVWWVSTLPAHVNINTLEMMPVTQSY
AGLNVHRQ
>P96687 ~~~ydfJ~~~Membrane protein YdfJ~~~COG2409
MSKMLYTLGGWVARNRIKAICAWIVVLVAAIGLAVTLKPSFSEDMSIPDTPSEKAMDVIQKEFPHGPDKGSIRVIFGAGD
GEKLTGKPAKKAIEDTFKEISKDDSVDSIASPFVTGTIAKDGTVAYADIQYKSSADDIKDYSIKHLKDSLKMADDEGLQT
ELSGDVPGAEMEIGGVSEIVGIILAFVVLAITFGSLLIAGLPILTALIGLGVSIGLVLIGTQVFDIASVSLSLAGMIGLA
VGIDYALFIFTKHRQFLGEGIQKNESIARAVGTAGSAVVFAGLTVIVALCGLTVVNIPFMSAMGLTAGLSVLMAVLASIT
LVPAVLSIAGKRMIPKSNKKIEKQSTETNVWGRFVTKNPIMLSVCSILILIVISIPSMHLELGLPDAGMKAKDNPDRRAY
DLLAEGFGEGFNGQLTIVADATNATENKAEAFADAVKEIKGLDHVASVTPAMPNKEGNFAIITVVPETGPNDVTTKDLVH
DVRSLSDKNGVDLLVTGSTAVNIDISDRLNDAIPVFAVLIVGFAFVLLTIVFRSLLVPLVAVAGFMLTMTATLGICVFVL
QDGNLIDFFKIPEKGPILAFLPILSIGILFGLAMDYQVFLVSRMREEYVKTNNPVQAIQAGLKHSGPVVTAAGLIMIFVF
AGFIFAGEASIKANGLALSFGVLFDAFIVRMTLIPSVMKLMGNAAWYLPKWLDKIIPNVDIEGHQLTKEIQPEIDHEQKK
QISV
>P77228 ~~~ydfJ~~~Putative transporter YdfJ~~~COG0477
MDFQLYSLGAALVFHEIFFPESSTAMALILAMGTYGAGYVARIVGAFIFGKMGDRIGRKKVLFITITMMGICTTLIGVLP
TYAQIGVFAPILLVTLRIIQGLGAGAEISGAGTMLAEYAPKGKRGIISSFVAMGTNCGTLSATAIWAFMFFILSKEELLA
WGWRIPFLASVVVMVFAIWLRMNLKESPVFEKVNDSNQPTAKPAPAGSMFQSKSFWLATGLRFGQAGNSGLIQTFLAGYL
VQTLLFNKAIPTDALMISSILGFMTIPFLGWLSDKIGRRIPYIIMNTSAIVLAWPMLSIIVDKSYAPSTIMVALIVIHNC
AVLGLFALENITMAEMFGCKNRFTRMAISKEIGGLIASGFGPILAGIFCTMTESWYPIAIMIMAYSVIGLISALKMPEVK
DRDLSALEDAAEDQPRVVRAAQPSRSL
>P96689 ~~~ydfK~~~Uncharacterized membrane protein YdfK~~~COG1811
MFGTIFNTVMIIAGSIIGGIFKKGIKDEYQDILMQAMGFAAVALGINAITQHLPDSKYPILFIVSLAIGGLLGQIINLEL
RFNKLVNKFSKSNLAEGLSTAVLLFCIGSLSILGPVEAALHGDYTYLLTNGMLDGITSIVLASTFGFGIAAAALVLFSWQ
GSIYLFAQVMESAINTDLINEITIVGGILILSSGLSILGIKKFKTLNLLPSLLIPPVVIFVIHAFGLRF
>P76154 ~~~ydfK~~~Cold shock protein YdfK~~~
MKSKDTLKWFPAQLPEVRIILGDAVVEVAKQGRPINTRTLLDYIEGNIKKKSWLDNKELLQTAISVLKDNQNLNGKM
>P76156 ~~~ydfO~~~Uncharacterized protein YdfO~~~COG5562
MDQVVIFKQIFDKVRNDLNYQWFYSELKRHNVSHYIYYLATENVHIVLKNDNTVLLKGLKNIVSVKFSKDRHLIETTSNK
LKSREITFQEYRRNLAKAGVFRWVTNIHEQKRYYYTFDNSLLFTESIQKTTQILPR
>P76160 ~~~ydfR~~~Uncharacterized protein YdfR~~~
MTQDYELVVKGVRNFENKVTVTVALQDKERFDGEIFDLDVAMDRVEGAALEFYEAAARRSVRQVFLEVAEKLSEKVESYL
QHQYSFKIENPANKHERPHHKYL
>P64463 ~~~ydfZ~~~Putative selenoprotein YdfZ~~~
MTTYDRNRNAITTGSRVMVSGTGHTGKILSIDTEGLTAEQIRRGKTVVVEGCEEKLAPLDLIRLGMN
>P77804 ~~~ydgA~~~Protein YdgA~~~COG5339
MNKSLVAVGVIVALGVVWTGGAWYTGKKIETHLEDMVAQANAQLKLTAPESNLEVSYQNYHRGVFSSQLQLLVKPIAGKE
NPWIKSGQSVIFNESVDHGPFPLAQLKKLNLIPSMASIQTTLVNNEVSKPLFDMAKGETPFEINSRIGYSGDSSSDISLK
PLNYEQKDEKVAFSGGEFQLNADRDGKAISLSGEAQSGRIDAVNEYNQKVQLTFNNLKTDGSSTLASFGERVGNQKLSLE
KMTISVEGKELALLEGMEISGKSDLVNDGKTINSQLDYSLNSLKVQNQDLGSGKLTLKVGQIDGEAWHQFSQQYNAQTQA
LLAQPEIANNPELYQEKVTEAFFSALPLMLKGDPVITIAPLSWKNSQGESALNLSLFLKDPATTKEAPQTLAQEVDRSVK
SLDAKLTIPVDMATEFMTQVAKLEGYQEDQAKKLAKQQVEGASAMGQMFRLTTLQDNTITTSLQYANGQITLNGQKMSLE
DFVGMFAMPALNVPAVPAIPQQ
>P0ACX0 ~~~ydgC~~~Inner membrane protein YdgC~~~COG3136
MGLVIKAALGALVVLLIGVLAKTKNYYIAGLIPLFPTFALIAHYIVASERGIEALRATIIFSMWSIIPYFVYLVSLWYFT
GMMRLPAAFVGSVACWGISAWVLIICWIKLH
>P76176 3.4.21.-~~~ydgD~~~Uncharacterized serine protease YdgD~~~COG3591
MRTTIAVVLGAISLTSAFVFADKPDVARSANDEVSTLFFGHDDRVPVNDTTQSPWDAVGQLETASGNLCTATLIAPNLAL
TAGHCLLTPPKGKADKAVALRFVSNKGLWRYEIHDIEGRVDPTLGKRLKADGDGWIVPPAAAPWDFGLIVLRNPPSGITP
LPLFEGDKAALTAALKAAGRKVTQAGYPEDHLDTLYSHQNCEVTGWAQTSVMSHQCDTLPGDSGSPLMLHTDDGWQLIGV
QSSAPAAKDRWRADNRAISVTGFRDKLDQLSQK
>P96706 ~~~ydgH~~~Putative membrane protein YdgH~~~COG1033
MRAIIKFKWAIAAIVLALTVVLSLFSPNLTELANQKGQAQLPADAVSERANAILKQAGEDNNSISVVFTLDNAIKKETEN
QLRIIIDKIKKIDGVEEVTSPLSAEKEVKDQLMSKDKKTVLMPVTITGSDKKAEKIADEIYQIVPDDLTAYITGASLINQ
DFAHSSEEGLKKTEVITVCLIIGLLLIVFRSVVTPFIPIVVVGFSYLISQSILGILVYNVDFPISTFTQTFLVAILFGIG
TDYCILLLTRFREELANGHDKKEAALIAYRTGGKTLFISGFAVLIGFSALGFAKFAIFQSAVGVAVGVGILMIILYTLLP
LFMVTLGEKLFWPSKKVLSHSDNKLWAFLGRHSVARPFLFIVITVVITLPFILTYDDQISFDSTAEISSDYKSIKALEAI
KDGFGEGKAFPINVVVKGDKDLTTADTIPYLGNISKAIEKVDHVDSVMTITQPTGKKIKDLYIDNQLGSVSDGLDKTVKG
IADVQSGLTDIENGLNQMAGQTGSASNGGSGGSLGDAADGLGKINQQLQLVSKQISQTGNTAQTVQQLTAISGQLGQIQT
GLEQANQQLSGQQAQAGTLTESLKKLSEGVKSANEGLTKVSDGITASSDILEDMSKSPTVRDTGIFIPDQVMKDKDFKKS
IDQYSFADGKGVQLSVVLDSNPYSEQAITTINQIKKAVANEVDGTPLENAQIVYGGVTSMNADLKELSTTDFSRTMVIMI
IGLFIVLTILFRSMIMPIYMIASLLLTYYTSISITELIFVNGLGNAGVSWAVPFFSFVILIALGVDYSIFLLDRFKEEVH
LGIEQGVVRSMSKMGSVIITAAIILAGTFAAMMPSGVNTLMQVASVIIIGLLLYGLVILPLFIPAIIATFGEGNWWPFGR
KKGKE
>P76177 ~~~ydgH~~~Protein YdgH~~~
MKLKNTLLASALLSAMAFSVNAATELTPEQAAAVKPFDRVVVTGRFNAIGEAVKAVSRRADKEGAASFYVVDTSDFGNSG
NWRVVADLYKADAEKAEETSNRVINGVVELPKDQAVLIEPFDTVTVQGFYRSQPEVNDAITKAAKAKGAYSFYIVRQIDA
NQGGNQRITAFIYKKDAKKRIVQSPDVIPADSEAGRAALAAGGEAAKKVEIPGVATTASPSSEVGRFFETQSSKGGRYTV
TLPDGTKVEELNKATAAMMVPFDSIKFSGNYGNMTEVSYQVAKRAAKKGAKYYHITRQWQERGNNLTVSADLYK
>P77376 1.-.-.-~~~ydgJ~~~Uncharacterized oxidoreductase YdgJ~~~COG0673
MSDNIRVGLIGYGYASKTFHAPLIAGTPGQELAVISSSDETKVKADWPTVTVVSEPKHLFNDPNIDLIVIPTPNDTHFPL
AKAALEAGKHVVVDKPFTVTLSQARELDALAKSLGRVLSVFHNRRWDSDFLTLKGLLAEGVLGEVAYFESHFDRFRPQVR
DRWREQGGPGSGIWYDLAPHLLDQAITLFGLPVSMTVDLAQLRPGAQSTDYFHAILSYPQRRVILHGTMLAAAESARYIV
HGSRGSYVKYGLDPQEERLKNGERLPQEDWGYDMRDGVLTRVEGEERVEETLLTVPGNYPAYYAAIRDALNGDGENPVPA
SQAIQVMELIELGIESAKHRATLCLA
>P76180 ~~~ydgK~~~Inner membrane protein YdgK~~~
MTTTTPQRIGGWLLGPLAWLLVALLSTTLALLLYTAALSSPQTFQTLGGQALTTQILWGVSFITAIALWYYTLWLTIAFF
KRRRCVPKHYIIWLLISVLLAVKAFAFSPVEDGIAVRQLLFTLLATALIVPYFKRSSRVKATFVNP
>Q7CQK5 ~~~ydgT~~~Transcription modulator YdgT~~~
MTVQDYLLKFRKISSLESLEKLFDHLNYTLTDDMDIVNMYRAADHRRAELVSGGRLFDVGQVPQSVWRYVQ
>A5A617 ~~~ydgU~~~Uncharacterized protein YdgU~~~
MVGRYRFEFILIILILCALITARFYLS
>P0DSF5 ~~~ydgV~~~Protein YdgV~~~
MDNLFRTLFSTFTHLRTSSTILLVGEQHWRNAL
>P37597 ~~~ydhC~~~Inner membrane transport protein YdhC~~~COG2814
MQPGKRFLVWLAGLSVLGFLATDMYLPAFAAIQADLQTPASAVSASLSLFLAGFAAAQLLWGPLSDRYGRKPVLLIGLTI
FALGSLGMLWVENAATLLVLRFVQAVGVCAAAVIWQALVTDYYPSQKVNRIFAAIMPLVGLSPALAPLLGSWLLVHFSWQ
AIFATLFAITVVLILPIFWLKPTTKARNNSQDGLTFTDLLRSKTYRGNVLIYAACSASFFAWLTGSPFILSEMGYSPAVI
GLSYVPQTIAFLIGGYGCRAALQKWQGKQLLPWLLVLFAVSVIATWAAGFISHVSLVEILIPFCVMAIANGAIYPIVVAQ
ALRPFPHATGRAAALQNTLQLGLCFLASLVVSWLISISTPLLTTTSVMLSTVVLVALGYMMQRCEEVGCQNHGNAEVAHS
ESH
>O05495 3.2.-.-~~~ydhD~~~Putative sporulation-specific glycosylase YdhD~~~COG1388
MFIHIVGPGDSLFSIGRRYGASVDQIRGVNGLDETNIVPGQALLIPLYVYTVQPRDTLTAIAAKAFVPLERLRAANPGIS
PNALQAGAKITIPSISNYIAGTLSFYVLRNPDLDRELINDYAPYSSSISIFEYHIAPNGDIANQLNDAAAIETTWQRRVT
PLATITNLTSGGFSTEIVHQVLNNPTARTNLVNNIYDLVSTRGYGGVTIDFEQVSAADRDLFTGFLRQLRDRLQAGGYVL
TIAVPAKTSDNIPWLRGYDYGGIGAVVNYMFIMAYDWHHAGSEPGPVAPITEIRRTIEFTIAQVPSRKIIIGVPLYGYDW
IIPYQPGTVASAISNQNAIERAMRYQAPIQYSAEYQSPFFRYSDQQGRTHEVWFEDVRSMSRKMQIVREYRLQAIGAWQL
TLGFTPGPWLLRKFFTIRKV
>P76187 1.-.-.-~~~ydhF~~~Oxidoreductase YdhF~~~COG4989
MVQRITIAPQGPEFSRFVMGYWRLMDWNMSARQLVSFIEEHLDLGVTTVDHADIYGGYQCEAAFGEALKLAPHLRERMEI
VSKCGIATTAREENVIGHYITDRDHIIKSAEQSLINLATDHLDLLLIHRPDPLMDADEVADAFKHLHQSGKVRHFGVSNF
TPAQFALLQSRLPFTLATNQVEISPVHQPLLLDGTLDQLQQLRVRPMAWSCLGGGRLFNDDYFQPLRDELAVVAEELNAG
SIEQVVYAWVLRLPSQPLPIIGSGKIERVRAAVEAETLKMTRQQWFRIRKAALGYDVP
>O05503 ~~~ydhK~~~Uncharacterized protein YdhK~~~COG1388
MSAGKSYRKKMKQRRMNMKISKYALGILMLSLVFVLSACGNNNSTKESTHDNHSDSSTHEEMDHSGSADVPEGLQESKNP
KYKVGSQVIINTSHMKGMKGAEATVTGAYDTTAYVVSYTPTNGGQRVDHHKWVIQEEIKDAGDKTLQPGDQVILEASHMK
GMKGATAEIDSAEKTTVYMVDYTSTTSGEKVKNHKWVTEDELSAK
>P77389 ~~~ydhP~~~Inner membrane transport protein YdhP~~~COG2814
MKINYPLLALAIGAFGIGTTEFSPMGLLPVIARGVDVSIPAAGMLISAYAVGVMVGAPLMTLLLSHRARRSALIFLMAIF
TLGNVLSAIAPDYMTLMLSRILTSLNHGAFFGLGSVVAASVVPKHKQASAVATMFMGLTLANIGGVPAATWLGETIGWRM
SFLATAGLGVISMVSLFFSLPKGGAGARPEVKKELAVLMRPQVLSALLTTVLGAGAMFTLYTYISPVLQSITHATPVFVT
AMLVLIGVGFSIGNYLGGKLADRSVNGTLKGFLLLLMVIMLAIPFLARNEFGAAISMVVWGAATFAVVPPLQMRVMRVAS
EAPGLSSSVNIGAFNLGNALGAAAGGAVISAGLGYSFVPVMGAIVAGLALLLVFMSARKQPETVCVANS
>P0ACX3 1.-.-.-~~~ydhR~~~Putative monooxygenase YdhR~~~
MATLLQLHFAFNGPFGDAMAEQLKPLAESINQEPGFLWKVWTESEKNHEAGGIYLFTDEKSALAYLEKHTARLKNLGVEE
VVAKVFDVNEPLSQINQAKLA
>P77147 ~~~ydhT~~~Uncharacterized protein YdhT~~~
MIITQADLREWRIGAVMYRWFLRHFPRGGSYADIHHALIEEGYTDWAESLVEYAWKKWLADENFAHQEVSSMQKLATDPG
ERPFCSQFARSDDHARIGCCEDNARIATAGYAAQIASMGYSVRIGSVGFNSHIGSSGERARVAVTGNSSRISSAGDSSRI
ANTGMRVRVCTLGERCHVASNGDLVQIASFGANARIANSGDNVHIIASGENSTVVSTGVVDSIILGPGGSAALAYHDGER
VRFAVAIEGENNIRAGVRYRLNEQHQFVEC
>P77409 ~~~ydhU~~~Putative cytochrome YdhU~~~COG4117
MNPSQHAEQFQSQLANYVPQFTPEFWPVWLIIAGVLLVGMWLVLGLHALLRARGVKKSATDHGEKIYLYSKAVRLWHWSN
ALLFVLLLASGLINHFAMVGATAVKSLVAVHEVCGFLLLACWLGFVLINAVGDNGHHYRIRRQGWLERAAKQTRFYLFGI
MQGEEHPFPATTQSKFNPLQQVAYVGVMYGLLPLLLLTGLLCLYPQAVGDVFPGVRYWLLQTHFALAFISLFFIFGHLYL
CTTGRTPHETFKSMVDGYHRH
>P76192 1.-.-.-~~~ydhV~~~Uncharacterized oxidoreductase YdhV~~~COG2414
MANGWTGNILRVNLTTGNITLEDSSKFKSFVGGMGFGYKIMYDEVPPGTKPFDEANKLVFATGPLTGSGAPCSSRVNITS
LSTFTKGNLVVDAHMGGFFAAQMKFAGYDVIIIEGKAKSPVWLKIKDDKVSLEKADFLWGKGTRATTEEICRLTSPETCV
AAIGQAGENLVPLSGMLNSRNHSGGAGTGAIMGSKNLKAIAVEGTKGVNIADRQEMKRLNDYMMTELIGANNNHVVPSTP
QSWAEYSDPKSRWTARKGLFWGAAEGGPIETGEIPPGNQNTVGFRTYKSVFDLGPAAEKYTVKMSGCHSCPIRCMTQMNI
PRVKEFGVPSTGGNTCVANFVHTTIFPNGPKDFEDKDDGRVIGNLVGLNLFDDYGLWCNYGQLHRDFTYCYSKGVFKRVL
PAEEYAEIRWDQLEAGDVNFIKDFYYRLAHRVGELSHLADGSYAIAERWNLGEEYWGYAKNKLWSPFGYPVHHANEASAQ
VGSIVNCMFNRDCMTHTHINFIGSGLPLKLQREVAKELFGSEDAYDETKNYTPINDAKIKYAKWSLLRVCLHNAVTLCNW
VWPMTVSPLKSRNYRGDLALEAKFFKAITGEEMTQEKLDLAAERIFTLHRAYTVKLMQTKDMRNEHDLICSWVFDKDPQI
PVFTEGTDKMDRDDMHASLTMFYKEMGWDPQLGCPTRETLQRLGLEDIAADLAAHNLLPA
>P77564 ~~~ydhW~~~Uncharacterized protein YdhW~~~
MGKMNHQDELPLAKVSEVDEAKRQWLQGMRHPVDTVTEPEPAEILAEFIRQHSAAGQLVARAVFLSPPYLVAEEELSVLL
ESIKQNGDYADIACLTGSKDDYYYSTQAMSENYAAMSLQVVEQDICRAIAHAVRFECQTYPRPYKVAMLMQAPYYFQEAQ
IEAAIAAMDVAPEYADIRQVESSTAVLYLFSERFMTYGKAYGLCEWFEVEQFQNP
>P77375 ~~~ydhX~~~Uncharacterized ferredoxin-like protein YdhX~~~COG0437
MSFTRRKFVLGMGTVIFFTGSASSLLANTRQEKEVRYAMIHDESRCNGCNICARACRKTNHVPAQGSRLSIAHIPVTDND
NETQYHFFRQSCQHCEDAPCIDVCPTGASWRDEQGIVRVEKSQCIGCSYCIGACPYQVRYLNPVTKVADKCDFCAESRLA
KGFPPICVSACPEHALIFGREDSPEIQAWLQQNKYYQYQLPGAGKPHLYRRFGQHLIKKENV
>P0AAL6 ~~~ydhY~~~Uncharacterized ferredoxin-like protein YdhY~~~COG0437
MNPVDRPLLDIGLTRLEFLRISGKGLAGLTIAPALLSLLGCKQEDIDSGTVGLINTPKGVLVTQRARCTGCHRCEISCTN
FNDGSVGTFFSRIKIHRNYFFGDNGVGSGGGLYGDLNYTADTCRQCKEPQCMNVCPIGAITWQQKEGCITVDHKRCIGCS
ACTTACPWMMATVNTESKKSSKCVLCGECANACPTGALKIIEWKDITV
>P0A6D5 1.1.1.282~~~ydiB~~~Quinate/shikimate dehydrogenase~~~COG0169
MDVTAKYELIGLMAYPIRHSLSPEMQNKALEKAGLPFTYMAFEVDNDSFPGAIEGLKALKMRGTGVSMPNKQLACEYVDE
LTPAAKLVGAINTIVNDDGYLRGYNTDGTGHIRAIKESGFDIKGKTMVLLGAGGASTAIGAQGAIEGLKEIKLFNRRDEF
FDKALAFAQRVNENTDCVVTVTDLADQQAFAEALASADILTNGTKVGMKPLENESLVNDISLLHPGLLVTECVYNPHMTK
LLQQAQQAGCKTIDGYGMLLWQGAEQFTLWTGKDFPLEYVKQVMGFGA
>Q8ZPR4 1.1.1.282~~~ydiB~~~Quinate/shikimate dehydrogenase~~~
MDVTAKYELIGLMAYPIRHSLSPEMQNKALEKAGLPYTYMAFEVDNTTFASAIEGLKALKMRGTGVSMPNKQLACEYVDE
LTPAAKLVGAINTIVNDDGYLRGYNTDGTGHIRAIKESGFDMRGKTMVLLGAGGAATAIGAQAAIEGIKEIKLFNRKDDF
FEKAVAFAKRVNENTDCVVTVTDLADQHAFTEALASADILTNGTKVGMKPLENESLIGDVSLLRPELLVTECVYNPHMTK
LLQQAQQAGCKTIDGYGMLLWQGAEQFELWTGKAFPLDYVKQVMGFTA
>P0ACX9 ~~~ydiE~~~Uncharacterized protein YdiE~~~COG4256
MRYTDSRKLTPETDANHKTASPQPIRRISSQTLLGPDGKLIIDHDGQEYLLRKTQAGKLLLTK
>Q8X5X6 2.8.3.8~~~ydiF~~~Acetate CoA-transferase YdiF~~~COG4670
MKPVKPPRINGRVPVLSAQEAVNYIPDEATLCVLGAGGGILEATTLITALADKYKQTQTPRNLSIISPTGLGDRADRGIS
PLAQEGLVKWALCGHWGQSPRISDLAEQNKIIAYNYPQGVLTQTLRAAAAHQPGIISDIGIGTFVDPRQQGGKLNEVTKE
DLIKLVEFDNKEYLYYKAIAPDIAFIRATTCDSEGYATFEDEVMYLDALVIAQAVHNNGGIVMMQVQKMVKKATLHPKSV
RIPGYLVDIVVVDPDQSQLYGGAPVNRFISGDFTLDDSTKLSLPLNQRKLVARRALFEMRKGAVGNVGVGIADGIGLVAR
EEGCADDFILTVETGPIGGITSQGIAFGANVNTRAILDMTSQFDFYHGGGLDVCYLSFAEVDQHGNVGVHKFNGKIMGTG
GFIDISATSKKIIFCGTLTAGSLKTEIADGKLNIVQEGRVKKFIRELPEITFSGKIALERGLDVRYITERAVFTLKEDGL
HLIEIAPGVDLQKDILDKMDFTPVISPELKLMDERLFIDAAMGFVLPEAAH
>P0AFS7 ~~~ydiK~~~Putative transport protein YdiK~~~COG0628
MVNVRQPRDVAQILLSVLFLAIMIVACLWIVQPFILGFAWAGTVVIATWPVLLRLQKIMFGRRSLAVLVMTLLLVMVFII
PIALLVNSIVDGSGPLIKAISSGDMTLPDLAWLNTIPVIGAKLYAGWHNLLDMGGTAIMAKVRPYIGTTTTWFVGQAAHI
GRFMVHCALMLLFSALLYWRGEQVAQGIRHFATRLAGVRGDAAVLLAAQAIRAVALGVVVTALVQAVLGGIGLAVSGVPY
ATLLTVLMILSCLVQLGPLPVLIPAIIWLYWTGDTTWGTVLLVWSGVVGTLDNVIRPMLIRMGADLPLILILSGVIGGLI
AFGMIGLFIGPVLLAVSWRLFAAWVEEVPPPTDQPEEILEELGEIEKPNK
>O34672 ~~~ydiM~~~Uncharacterized protein YdiM~~~COG3378
MLSDNNFVSETLENVQYLLPGAKVIKLRGYSRAHKVYTIAKSPVEKWKVAAGLSGSEIAILIRKGHWIGASIPAGGIVID
IDDSKQGELVKGLLDAQNFHCHCIRTLMGGSLFLRITNMGKRKLNK
>P76197 ~~~ydiM~~~Inner membrane transport protein YdiM~~~COG0738
MKNPYFPTALGLYFNYLVHGMGVLLMSLNMASLETLWQTNAAGVSIVISSLGIGRLSVLLFAGLLSDRFGRRPFIMLGMC
CYMAFFFGILQTNNIIIAYVFGFLAGMANSFLDAGTYPSLMEAFPRSPGTANILIKAFVSSGQFLLPLIISLLVWAELWF
GWSFMIAAGIMFINALFLYRCTFPPHPGRRLPVIKKTTSSTEHRCSIIDLASYTLYGYISMATFYLVSQWLAQYGQFVAG
MSYTMSIKLLSIYTVGSLLCVFITAPLIRNTVRPTTLLMLYTFISFIALFTVCLHPTFYVVIIFAFVIGFTSAGGVVQIG
LTLMAERFPYAKGKATGIYYSAGSIATFTIPLITAHLSQRSIADIMWFDTAIAAIGFLLALFIGLRSRKKTRHHSLKENV
APGG
>O34608 ~~~ydiN~~~Uncharacterized protein YdiN~~~
MIKFSVILGMIRCSLTHITTKNTVNALKRMIYPKQKPSFFHEFKVLYKLLKKFCIKGIMIKNIRSCMGYFL
>P76198 ~~~ydiN~~~Inner membrane transport protein YdiN~~~COG2271
MSQNKAFSTPFILAVLCIYFSYFLHGISVITLAQNMSSLAEKFSTDNAGIAYLISGIGLGRLISILFFGVISDKFGRRAV
ILMAVIMYLLFFFGIPACPNLTLAYGLAVCVGIANSALDTGGYPALMECFPKASGSAVILVKAMVSFGQMFYPMLVSYML
LNNIWYGYGLIIPGILFVLITLMLLKSKFPSQLVDASVTNELPQMNSKPLVWLEGVSSVLFGVAAFSTFYVIVVWMPKYA
MAFAGMSEAEALKTISYYSMGSLVCVFIFAALLKKMVRPIWANVFNSALATITAAIIYLYPSPLVCNAGAFVIGFSAAGG
ILQLGVSVMSEFFPKSKAKVTSIYMMMGGLANFVIPLITGYLSNIGLQYIIVLDFTFALLALITAIIVFIRYYRVFIIPE
NDVRFGERKFCTRLNTIKHRG
>O34939 2.1.1.37~~~ydiO~~~Type II methyltransferase M1.BsuMI~~~COG0270
MTNFILNENKQLSLAIEDENIENFYIDGTDLVRKIIRRSGSGVTSRVPVLSTQDLENKNLHELYDESWLRMKNRPNTELT
TESINIADLFSGCGGLSLGVWEACRALGINPRFSFACDLNEAALSVYEKNFSPDFSLNESIEKHINGELGAPLTVEEQRI
KDKVKKIDFILAGPPCQGHSDLNNHTRRKDPRNALLMRVSRVIELFQPSSVLVENVPGIIHDKSGSFKEFKNHLKTQGYY
FDEIVLNAEKLGVSQARRRYFIFASKTPVSSLNQINEFYSTNSRPISWAISDLVENVGDDIFNTASEHSLENKRRIEYLF
ENNLFELPNSERPDCHRLKPHSYKSVYGRMYWDRPAPTITRGFGSTGQGRFVHSLLKRTITPHEAARIQFFPDFFNFGDL
RRRQYQDVIGNAVPSKLSYLLALHQLR
>O34680 2.1.1.37~~~ydiP~~~Type II methyltransferase M2.BsuMI~~~COG0270
MKVVSLFSGIGGIELGLHQSGHTTEIFCEVDPLAKAVLSKNFPGVKIEDDINEIRELPSCDLVAAGFPCQDLSQAGGKEG
IDGSRSGLVKKLFELIEKKEHANRPPWILIENVPYMLRLNRGKAMSYLTSVLSELGYTWAYRTVDARCFGLPQRRHRVIL
LASLFEDPKDVIFSQDHSEPDLDGKPSVVDHSNYYGFYWTEGLRGVGWAREAVPPIKCGSSVGIASPPAVWSPYEDIVGT
INIRDAERLQGFPEDWTNITTETGKDIKEGARWRLVGNAVSVRVSKWIGENLSQPKGSISDFEGELVTKTWPSAAWGYGD
KKYKVPVSKWVANTEQIAISEFLNHPLKPLSARALNGFLGRAARCTNVNYSDEFINSLERCKDRQLQKV
>O35025 3.1.21.4~~~ydiR~~~Type II restriction enzyme BsuMI component YdiR~~~
MTFIKRLEDAYETLLGNYPAGVSSTSTSKYNEIRKIVSEAFLIGENEVYVTGTSRRISNLDTRFAQGNQRNKHTRMAVAF
ISIPSVDDSELDELIIRTRNSAITTSSKFCNGEERGTIFDGILLFLVFEGETKVYPLAFLVFENDFELKEKAEELIPGIE
LKEYPRANQSPAQENNKSAKNEDEESAKSYVVFLDIEEDGSIVEFVEDKDKTYRIGDMIWTASHTNGSSAITRRLEVIEV
VENLVVCKIKHKYNEPVDKNSLLKFVNIEQDLISFLDLHPNVQNGSEGFVSGDIVDENATTSSDDLPEDFENN
>O34885 3.1.21.4~~~ydiS~~~Type II restriction enzyme BsuMI component YdiS~~~COG1401
MEISKQTSDLLLSLEKKKGTLPKFSVLRSIPRNRIIYGAPGTGKSNYLEREVGKIFGDNPYVFTRVTFFPGYTYGQFIGA
YKPVPIYKKLSGEEEIFSSNFRDKMENFEPMIDYQFVPGPFIDVLIKALKNRYTNFILIIEEINRANAASVFGDIFQLLD
RNKNGESDYPVTFGPDIMNYLARNGIKDEMIKLPSNFFIWATMNNADQGVLPLDTAFKRRWSFEYLELEKYRKAVDSWKL
SLRYKGHNKVIMWNDFRDIINKRLKGKVPEDKLLGPFFLKESELWNQNVFKNKLLYYLKEDVFKHNPTIDFLNASTFSEL
IEKYDGSDNIFTFDIDDSSFVSD
>P76204 ~~~ydiV~~~Putative anti-FlhC(2)FlhD(4) factor YdiV~~~COG2200
MKIFLENLYHSDCYFLPIRDNQQVLVGVELITHFSSEDGTVRIPTSRVIAQLTEEQHWQLFSEQLELLKSCQHFFIQHKL
FAWLNLTPQVATLLLERDNYAGELLKYPFIELLINENYPHLNEGKDNRGLLSLSQVYPLVLGNLGAGNSTMKAVFDGLFT
RVMLDKSFIQQQITHRSFEPFIRAIQAQISPCCNCIIAGGIDTAEILAQITPFDFHALQGCLWPAVPINQITTLVQR
>D0ZW85 ~~~ydiV~~~Anti-FlhC(2)FlhD(4) factor YdiV~~~
MIASLDELYHSELFFLPVMDENARLVGLEIIATFAAEDGAVRMPTELVAPRLSVEEQYCLFVEKLALLETCQHFFIQHKL
IAWLNLPPAISDLLLLDSELFSRAARFPFLELAINENYPGLNQGKNNETLANLAMHFPLMLANFGAGEASTKAIFDGLFK
RVMLDKNFIQQRAEMISFEPFMHAIVAQISSSCESLMIAGIDTEAMFARAAPLGFSAFQGGLWPPVPVSQLIKLVQR
>Q8ZPS6 ~~~ydiV~~~Anti-FlhC(2)FlhD(4) factor YdiV~~~
MIASLDELYHSELFFLPVMDENARLVGLEIIATFAAEDGAVRMPTELVAPRLSVEEQYCLFVEKLALLETCQHFFIQHKL
IAWLNLPPAISDLLLLDSELFSRAARFPFLELAINENYPGLNQGKNNETLANLAMHFPLMLANFGAGEASTKAIFDGLFK
RVMLDKNFIQQRAEMISFEPFMHAIVAQISSSCESLMIAGIDTEAMFARAAPLGFSAFQGGLWPPVPVSQLIKLVQR
>O34303 3.1.21.4~~~ydjA~~~Type II restriction enzyme BsuMI component YdjA~~~
MDKSSKFFFEDQKYNKERIVRVLGGNLALLKSKGILYEDSSGDLIFNYVGVISNGRNVIFILPKYCNRHLDEHSKRTLFN
KLLKIFKKYSGLNKSRESDYFVSELDSDEVSDFMIADYLLNDFSLNGYYQKKFTEYEIDGEGIIDWSKTVNEITPVFSKG
VPYYFSTYNEVVQKDEYHLIVKIHKWALSKYFNDFGVILGFTGLEFDKSCDGMKILDYADFFGSVINKEIVNTYVDRDVK
LLKALKTAIDREENQFSKRPTLSLYGTKYFHRVWEEVCKTVFSHVNEYVKKISRPNWINFTDIEVNKEKKTLEPDIIKAF
EYRSKEYFLILDAKYYNINFDGKKLEGNPGVEDITKQLLYDKALEKLSRGKTKHNAFLFPSSNSTNTFKVFGSVDFDFLD
IAAVTLVYISAEQVYNLYLENKTFSTDDLFKFVSEINKSKKRHSVITSTLYGNMFLFTKRLSDKN
>P0ACY1 1.-.-.-~~~ydjA~~~Putative NAD(P)H nitroreductase YdjA~~~COG0778
MDALELLINRRSASRLAEPAPTGEQLQNILRAGMRAPDHKSMQPWHFFVIEGEGRERFSAVLEQGAIAAGSDDKAIDKAR
NAPFRAPLIITVVAKCEENHKVPRWEQEMSAGCAVMAMQMAAVAQGFGGIWRSGALTESPVVREAFGCREQDKIVGFLYL
GTPQLKASTSINVPDPTPFVTYF
>P59745 3.5.1.-~~~~~~Carbohydrate deacetylase~~~COG3394
MSNKKLIINADDFGYTPAVTQGIIEAHKRGVVTSTTALPTSPYFLEAMESARISAPTLAIGVHLTLTLNQAKPILPREMV
PSLVDEAGYFWHQSIFEEKVNLEEVYNEWDAQIISFMKSGRRPDHIDSHHNVHGKNKKLLGVALALARKYQLPLRNASRS
IETKDYLELYQDVRTPDEMLYQFYDKAISTETILQLLDMVVCSEGEVFEINCHPAFIDTILQNQSGYCMPRIREVEILTS
QEVKEAIEERGILLANYESLAM
>Q53WD3 3.5.1.-~~~~~~Carbohydrate deacetylase~~~
MDLLERLGLGGRRVLILHHDDLGLTHAQNGAYQALGLPTGSVMVPGAWASGVKGEDLGVHLVLTSEWPAPRMRPLTEGES
LRDEAGYFPESLEALWRKARAEEVERELKAQIQAAAKLFSPTHLDAHQGAVLRPDLAEVYLRLAEAYRLVPLVPESLEGL
GVPPPFLPELERLLYETPFPQVRFLDPYGLPPEERLGFYLDLAHLPPGLYYLVHHSALPTPEGRALPDWPTREADYFALS
HPEVRRVLAEFHPLTWRAVREALF
>P38055 ~~~ydjE~~~Inner membrane metabolite transport protein YdjE~~~COG2814
MEQYDQIGARLDRLPLARFHYRIFGIISFSLLLTGFLSYSGNVVLAKLVSNGWSNNFLNAAFTSALMFGYFIGSLTGGFI
GDYFGRRRAFRINLLIVGIAATGAAFVPDMYWLIFFRFLMGTGMGALIMVGYASFTEFIPATVRGKWSARLSFVGNWSPM
LSAAIGVVVIAFFSWRIMFLLGGIGILLAWFLSGKYFIESPRWLAGKGQIAGAECQLREVEQQIEREKSIRLPPLTSYQS
NSKVKVIKGTFWLLFKGEMLRRTLVAITVLIAMNISLYTITVWIPTIFVNSGIDVDKSILMTAVIMIGAPVGIFIAALII
DHFPRRLFGSTLLIIIAVLGYIYSIQTTEWAILIYGLVMIFFLYMYVCFASAVYIPELWPTHLRLRGSGFVNAVGRIVAV
FTPYGVAALLTHYGSITVFMVLGVMLLLCALVLSIFGIETRKVSLEEISEVN
>P77721 ~~~ydjF~~~Uncharacterized HTH-type transcriptional regulator YdjF~~~COG1349
MAAKDRIQAIKQMVANDKKVTVSNLSGIFQVTEETIRRDLEKLEDEGFLTRTYGGAVLNTAMLTENIHFYKRASSFYEEK
QLIARKALPFIDNKTTMAADSSSTVMELLKLLQDRSGLTLLTNSAEAIHVLAQSEIKVVSTGGELNKNTLSLQGRITKEI
IRRYHVDIMVMSCKGLDINSGALDSNEAEAEIKKTMIRQATEVALLVDHSKFDRKAFVQLADFSHINYIITDKSPGAEWI
AFCKDNNIQLVW
>P77493 2.7.1.-~~~ydjH~~~Uncharacterized sugar kinase YdjH~~~COG0524
MDNLDVICIGAAIVDIPLQPVSKNIFDVDSYPLERIAMTTGGDAINEATIISRLGHRTALMSRIGKDAAGQFILDHCRKE
NIDIQSLKQDVSIDTSINVGLVTEDGERTFVTNRNGSLWKLNIDDVDFARFSQAKLLSLASIFNSPLLDGKALTEIFTQA
KARQMIICADMIKPRLNETLDDICEALSYVDYLFPNFAEAKLLTGKETLDEIADCFLACGVKTVVIKTGKDGCFIKRGDM
TMKVPAVAGITAIDTIGAGDNFASGFIAALLEGKNLRECARFANATAAISVLSVGATTGVKNRKLVEQLLEEYEG
>P77704 ~~~ydjI~~~Uncharacterized protein YdjI~~~COG0191
MLADIRYWENDATNKHYAIAHFNVWNAEMLMGVIDAAEEAKSPVIISFGTGFVGNTSFEDFSHMMVSMAQKATVPVITHW
DHGRSMEIIHNAWTHGMNSLMRDASAFDFEENIRLTKEAVDFFHPLGIPVEAELGHVGNETVYEEALAGYHYTDPDQAAE
FVERTGCDSLAVAIGNQHGVYTSEPQLNFEVVKRVRDAVSVPLVLHGASGISDADIKTAISLGIAKINIHTELCQAAMVA
VKENQDQPFLHLEREVRKAVKERALEKIKLFGSDGKAE
>P77280 1.-.-.-~~~ydjJ~~~Uncharacterized zinc-type alcohol dehydrogenase-like protein YdjJ~~~COG1063
MKNSKAILQVPGTMKIISAEIPVPKEDEVLIKVEYVGICGSDVHGFESGPFIPPKDPNQEIGLGHECAGTVVAVGSRVRK
FKPGDRVNIEPGVPCGHCRYCLEGKYNICPDVDFMATQPNYRGALTHYLCHPESFTYKLPDNMDTMEGALVEPAAVGMHA
AMLADVKPGKKIIILGAGCIGLMTLQACKCLGATEIAVVDVLEKRLAMAEQLGATVVINGAKEDTIARCQQFTEDMGADI
VFETAGSAVTVKQAPYLVMRGGKIMIVGTVPGDSAINFLKINREVTIQTVFRYANRYPVTIEAISSGRFDVKSMVTHIYD
YRDVQQAFEESVNNKRDIIKGVIKISD
>P40775 ~~~ydjM~~~Uncharacterized protein YdjM~~~COG0797
MLKKVILAAFILVGSTLGAFSFSSDASAKHVNGNITWYNGVGKKGSSGKKLGHWDCATKIGFDVPRNGTKIRAYAKAKPK
KVITVYKNDVGRMPNAVLDVSPKAFKALGYPLSKGKVAGHYSY
>P64481 ~~~ydjM~~~Inner membrane protein YdjM~~~COG1988
MTAEGHLLFSIACAVFAKNAELTPVLAQGDWWHIVPSAILTCLLPDIDHPKSFLGQRLKWISKPIARAFGHRGFTHSLLA
VFALLATFYLKVPEGWFIPADALQGMVLGYLSHILADMLTPAGVPLLWPCRWRFRLPILVPQKGNQLERFICMALFVWSV
WMPHSLPENSAVRWSSQMINTLQIQFHRLIKHQVEY
>O34759 ~~~ydjO~~~Uncharacterized protein YdjO~~~
MSYYNKRNQEPLPKEDVSTWECTKEDCNGWTRKNFASSDTPLCPLCGSKMVDGIRSLVNLQNNSQTKTS
>O34592 3.-.-.-~~~ydjP~~~AB hydrolase superfamily protein YdjP~~~COG2267
MPYIILEDQTRLYYETHGSGTPILFIHGVLMSGQFFHKQFSVLSANYQCIRLDLRGHGESDKVLHGHTISQYARDIREFL
NAMELDHVVLAGWSMGAFVVWDYLNQFGNDNIQAAVIIDQSASDYQWEGWEHGPFDFDGLKTAMHAIQTDPLPFYESFIQ
NMFAEPPAETETEWMLAEILKQPAAISSTILFNQTAADYRGTLQNINVPALLCFGEDKKFFSTAAGEHLRSNIPNATLVT
FPKSSHCPFLEEPDAFNSTLLSFLDGVIGKS
>P76221 ~~~ydjZ~~~TVP38/TMEM64 family inner membrane protein YdjZ~~~COG0398
MMMMQSRKIWYYRITLIILLFAMLLAWALLPGVHEFINRSVAAFAAVDQQGIERFIQSYGALAAVVSFLLMILQAIAAPL
PAFLITFANASLFGAFWGGLLSWTSSMAGAALCFFIARVMGREVVEKLTGKTVLDSMDGFFTRYGKHTILVCRLLPFVPF
DPISYAAGLTSIRFRSFFIATGLGQLPATIVYSWAGSMLTGGTFWFVTGLFILFALTVVIFMAKKIWLERQKRNA
>P39173 5.1.3.15~~~yeaD~~~Putative glucose-6-phosphate 1-epimerase~~~COG0676
MIKKIFALPVIEQISPVLSRRKLDELDLIVVDHPQVKASFALQGAHLLSWKPAGEEEVLWLSNNTPFKNGVAIRGGVPVC
WPWFGPAAQQGLPAHGFARNLPWTLKSHHEDADGVALTFELTQSEETKKFWPHDFTLLAHFRVGKTCEIDLESHGEFETT
SALHTYFNVGDIAKVSVSGLGDRFIDKVNDAKENVLTDGIQTFPDRTDRVYLNPQDCSVINDEALNRIIAVGHQHHLNVV
GWNPGPALSISMGDMPDDGYKTFVCVETAYASETQKVTKEKPAHLAQSIRVAKR
>Q8ZPV9 5.1.3.15~~~yeaD~~~Putative glucose-6-phosphate 1-epimerase~~~
MINKIFALPVIEQLTPVLSRRQLDDLDLIVVDHPQVKASFALQGAHLLSWKPVGEEEVLWLSNNTPFKTGVALRGGVPIC
WPWFGPAAQQGLPSHGFARNLPWALKAHNEDDNGVMLTFELQSSEATRKYWPHDFTLLARFKVGKTCEIELEAHGEFATT
SALHSYFNVGDIANVKVSGLGDRFIDKVNDAKEGVLTDGIQTFPDRTDRVYLNPEACSVIHDATLNRTIDVVHHHHLNVV
GWNPGPALSVSMGDMPDDGYKTFVCVETVYATAPQQATEEKPSRLAQTICVAKR
>P64488 ~~~yeaR~~~Uncharacterized protein YeaR~~~COG3615
MLQIPQNYIHTRSTPFWNKQTAPAGIFERHLDKGTRPGVYPRLSVMHGAVKYLGYADEHSAEPDQVILIEAGQFAVFPPE
KWHNIEAMTDDTYFNIDFFVAPEVLMEGAQQRKVIHNGK
>P0A8A0 ~~~yebC~~~Probable transcriptional regulatory protein YebC~~~COG0217
MAGHSKWANTRHRKAAQDAKRGKIFTKIIRELVTAAKLGGGDPDANPRLRAAVDKALSNNMTRDTLNRAIARGVGGDDDA
NMETIIYEGYGPGGTAIMIECLSDNRNRTVAEVRHAFSKCGGNLGTDGSVAYLFSKKGVISFEKGDEDTIMEAALEAGAE
DVVTYDDGAIDVYTAWEEMGKVRDALEAAGLKADSAEVSMIPSTKADMDAETAPKLMRLIDMLEDCDDVQEVYHNGEISD
EVAATL
>P33218 ~~~yebE~~~Inner membrane protein YebE~~~COG2979
MANWLNQLQSLLGQSSSSTSSSADQGLVKLLVPGALGGLAGLLVANKSARKLLTKYGTNALLVGGGAVAGTVLWNKYKDK
IRAAHQDEPQFGAQSTPLDERTARLILALVFAAKSDGHIDAKERAAIDQQLRGAGVEEQGRVLIEQAIEQPLDPQRLATG
VRNEEEALEIYFLSCAAIDIDHFMERSYLNALGDALKIPQDVRDGIERDLEQQKRTLAE
>P33219 ~~~yebF~~~Protein YebF~~~
MKKRGAFLGLLLVSACASVFAANNETSKSVTFPKCEDLDAAGIAASVKRDYQQNRVARWADDQKIVGQADPVAWVSLQDI
QGKDDKWSVPLTVRGKSADIHYQVSVDCKAGMAEYQRR
>P0AD03 ~~~yebS~~~Intermembrane transport protein YebS~~~COG2995
MALNTPQITPTKKITVRAIGEELPRGDYQRCPQCDMLFSLPEINSHQSAYCPRCQAKIRDGRDWSLTRLAAMAFTMLLLM
PFAWGEPLLHIWLLGIRIDANVMQGIWQMTKQGDAITGSMVFFCVIGAPLILVTSIAYLWFGNRLGMNLRPVLLMLERLK
EWVMLDIYLVGIGVASIKVQDYAHIQAGVGLFSFVALVILTTVTLSHLNVEELWERFYPQRPATRRDEKLRVCLGCHFTG
YPDQRGRCPRCHIPLRLRRRHSLQKCWAALLASIVLLLPANLLPISIIYLNGGRQEDTILSGIMSLASSNIAVAGIVFIA
SILVPFTKVIVMFTLLLSIHFKCQQGLRTRILLLRMVTWIGRWSMLDLFVISLTMSLINRDQILAFTMGPAAFYFGAAVI
LTILAVEWLDSRLLWDAHESGNARFDD
>P76272 ~~~yebT~~~Intermembrane transport protein YebT~~~COG3008
MSQETPASTTEAQIKNKRRISPFWLLPFIALMIASWLIWDSYQDRGNTVTIDFMSADGIVPGRTPVRYQGVEVGTVQDIS
LSDDLRKIEVKVSIKSDMKDALREETQFWLVTPKASLAGVSGLDALVGGNYIGMMPGKGKEQDHFVALDTQPKYRLDNGD
LMIHLQAPDLGSLNSGSLVYFRKIPVGKVYDYAINPNKQGVVIDVLIERRFTDLVKKGSRFWNVSGVDANVSISGAKVKL
ESLAALVNGAIAFDSPEESKPAEAEDTFGLYEDLAHSQRGVIIKLELPSGAGLTADSTPLMYQGLEVGQLTKLDLNPGGK
VTGEMTVDPSVVTLLRENTRIELRNPKLSLSDANLSALLTGKTFELVPGDGEPRKEFVVVPGEKALLHEPDVLTLTLTAP
ESYGIDAGQPLILHGVQVGQVIDRKLTSKGVTFTVAIEPQHRELVKGDSKFVVNSRVDVKVGLDGVEFLGASASEWINGG
IRILPGDKGEMKASYPLYANLEKALENSLSDLPTTTVSLSAETLPDVQAGSVVLYRKFEVGEVITVRPRANAFDIDLHIK
PEYRNLLTSNSVFWAEGGAKVQLNGSGLTVQASPLSRALKGAISFDNLSGASASQRKGDKRILYASETAARAVGGQITLH
AFDAGKLAVGMPIRYLGIDIGQIQTLDLITARNEVQAKAVLYPEYVQTFARGGTRFSVVTPQISAAGVEHLDTILQPYIN
VEPGRGNPRRDFELQEATITDSRYLDGLSIIVEAPEAGSLGIGTPVLFRGLEVGTVTGMTLGTLSDRVMIAMRISKRYQH
LVRNNSVFWLASGYSLDFGLTGGVVKTGTFNQFIRGGIAFATPPGTPLAPKAQEGKHFLLQESEPKEWREWGTALPK
>P64503 ~~~yebV~~~Uncharacterized protein YebV~~~
MKTSVRIGAFEIDDGELHGESPGDRTLTIPCKSDPDLCMQLDAWDAETSIPALLNGEHSVLYRTRYDQQSDAWIMRLA
>P64506 ~~~yebY~~~Uncharacterized protein YebY~~~
MMKKSILAFLLLTSSAAALAAPQVITVSRFEVGKDKWAFNREEVMLTCRPGNALYVINPSTLVQYPLNDIAQKEVASGKT
NAQPISVIQIDDPNNPGEKMSLAPFIERAEKLC
>P76278 ~~~yebZ~~~Inner membrane protein YebZ~~~COG1276
MLAFTWIALRFIHFTSLMLVFGFAMYGAWLAPLTIRRLLAKRFLRLQQHAAVWSLISATAMLAVQGGLMGTGWTDVFSPN
IWQAVLQTQFGGIWLWQIVLALVTLIVALMQPRNMPRLLFMLTTAQFILLAGVGHATLNEGVTAKIHQTNHAIHLICAAA
WFGGLLPVLWCMQLIKGRWRHQAIQALMRFSWCGHFAVIGVLASGVLNALLITGFPPTLTTYWGQLLLLKAILVMIMVVI
ALANRYVLVPRMRQDEDRAAPWFVWMTKLEWAIGAVVLVIISLLATLEPF
>P0ADI7 3.-.-.-~~~yecD~~~Isochorismatase family protein YecD~~~COG1335
MLELNAKTTALVVIDLQEGILPFAGGPHTADEVVNRAGKLAAKFRASGQPVFLVRVGWSADYAEALKQPVDAPSPAKVLP
ENWWQHPAALGATDSDIEIIKRQWGAFYGTDLELQLRRRGIDTIVLCGISTNIGVESTARNAWELGFNLVIAEDACSAAS
AEQHNNSINHIYPRIARVRSVEEILNAL
>P52007 ~~~yecM~~~Protein YecM~~~COG3102
MANWQSIDELQDIASDLPRFIHALDELSRRLGLNITPLTADHISLRCHQNATAERWRRGFEQCGELLSENMINGRPICLF
KLHEPVQVAHWQFSIVELPWPGEKRYPHEGWEHIEIVLPGDPETLNARALALLSDEGLSLPGISVKTSSPKGEHERLPNP
TLAVTDGKTTIKFHPWSIEEIVASEQSA
>P64515 ~~~yecN~~~Inner membrane protein YecN~~~COG3788
MVSALYAVLSALLLMKFSFDVVRLRMQYRVAYGDGGFSELQSAIRIHGNAVEYIPIAIVLMLFMEMNGAETWMVHICGIV
LLAGRLMHYYGFHHRLFRWRRSGMSATWCALLLMVLANLWYMPWELVFSLR
>P0DPP3 ~~~yecU~~~Protein YecU~~~
MIKIFIGHYINVFYSTADITLKKQPLLFLAKLMVYSAALTFFTANFHCNMTRKINEYA
>P0AA70 ~~~yedA~~~Uncharacterized inner membrane transporter YedA~~~COG0697
MRFRQLLPLFGALFALYIIWGSTYFVIRIGVESWPPLMMAGVRFLAAGILLLAFLLLRGHKLPPLRPLLNAALIGLLLLA
VGNGMVTVAEHQNVPSGIAAVVVATVPLFTLCFSRLFGIKTRKLEWVGIAIGLAGIIMLNSGGNLSGNPWGAILILIGSI
SWAFGSVYGSRITLPVGMMAGAIEMLAAGVVLMIASMIAGEKLTALPSLSGFLAVGYLALFGSIIAINAYMYLIRNVSPA
LATSYAYVNPVVAVLLGTGLGGETLSKIEWLALGVIVFAVVLVTLGKYLFPAKPVVAPVIQDASSE
>P31064 ~~~yedE~~~Probable transporter YedE~~~COG2391
MSWQQFKHAWLIKFWAPIPAVIAAGILSTYYFGITGTFWAVTGEFTRWGGQLLQLFGVHAEEWGYFKIIHLEGSPLTRID
GMMILGMFGGCFAAALWANNVKLRMPRSRIRIMQAIIGGIIAGFGARLAMGCNLAAFFTGIPQFSLHAWFFAIATAIGSW
FGARFTLLPIFRIPVKMQKVSAASPLTQKPDQARRRFRLGMLVFFGMLGWALLTAMNQPKLGLAMLFGVGFGLLIERAQI
CFTSAFRDMWITGRTHMAKAIIIGMAVSAIGIFSYVQLGVEPKIMWAGPNAVIGGLLFGFGIVLAGGCETGWMYRAVEGQ
VHYWWVGLGNVIGSTILAYYWDDFAPALATDWDKINLLKTFGPMGGLLVTYLLLFAALMLIIGWEKRFFRRAAPQTAKEI
A
>P0AA31 ~~~yedF~~~Putative sulfur carrier protein YedF~~~COG0425
MKNIVPDYRLDMVGEPCPYPAVATLEAMPQLKKGEILEVVSDCPQSINNIPLDARNHGYTVLDIQQDGPTIRYLIQK
>Q8GAX4 3.-.-.-~~~~~~Uncharacterized hydrolase in edin-B 3'region~~~
MSKRLLLFDFDETYFKHNTNEEDLSHLREMEKLLEKLTNNNEVMTAVLTGSTFQSVMDKMDQVNMTFKPLHIFSDLSSKM
FTWNNGEYIESETYKKKVFSEPFLFEDIEDILRHISAQYNVEFIPQRAFEGNETHYNFYFHSTGNHSNDRRILEALVRYA
NDQNYTARFSRSNPLAGDPENAYDIDFTPSNAGKLYATQFLMKKYNIPVKSILGFGDSGNDEAYLSYLEHAYLMSNSRDE
ALKQKFRLTKYPYYQGITLHVKEFVEGKYDY
>P46125 ~~~yedI~~~Inner membrane protein YedI~~~COG2354
MLLAGSSLLTLLDDIATLLDDISVMGKLAAKKTAGVLGDDLSLNAQQVSGVRANRELPVVWGVAKGSLINKVILVPLALI
ISAFIPWAITPLLMIGGAFLCFEGVEKVLHMLEARKHKEDPAQSQQRLEKLAAQDPLKFEKDKIKGAIRTDFILSAEIVA
ITLGIVAEAPLLNQVLVLSGIALVVTVGVYGLVGVIVKIDDLGYWLAEKSSALMQALGKGLLIIAPWLMKALSIVGTLAM
FLVGGGIVVHGIAPLHHAIEHFAGQQSAVVAMILPTVLNLILGFIIGGIVVLGVKAVEKMRGQAH
>P46144 ~~~yedJ~~~Uncharacterized protein YedJ~~~COG1418
MDLQHWQAQFENWLKNHHQHQDAAHDVCHFRRVWATAQKLAADDDVDMLVILTACYFHDIVSLAKNHPQRQRSSILAAEE
TRRLLREEFEQFPAEKIEAVCHAIAAHSFSAQIAPLTTEAKIVQDADRLEALGAIGLARVFAVSGALGVALFDGEDPFAQ
HRPLDDKRYALDHFQTKLLKLPQTMQTARGKQLAQHNAHFLVEFMAKLSAELAGENEGVDHKVIDAFSSAG
>P76318 3.4.-.-~~~yedK~~~Abasic site processing protein YedK~~~COG2135
MCGRFAQSQTREDYLALLAEDIERDIPYDPEPIGRYNVAPGTKVLLLSERDEHLHLDPVFWGYAPGWWDKPPLINARVET
AATSRMFKPLWQHGRAICFADGWFEWKKEGDKKQPFFIYRADGQPIFMAAIGSTPFERGDEAEGFLIVTAAADQGLVDIH
DRRPLVLSPEAAREWMRQEISGKEASEIAASGCVPANQFSWHPVSRAVGNVKNQGAELIQPV
>J9RX10 ~~~yedS~~~Outer membrane protein YedS~~~COG3203
MKRKVLAMLVPALLVAGAANAAEIYNKDGNKVDFYGKMVGERIWSNTDDNNSENEDTSYARFGVKGESQITSELTGFGQF
EYNLDASKPEGSNQEKTRLTFAGLKYNELGSFDYGRNYGVAYDAAAYTDMLVEWGGDSWASADNFMNGRTNGVATYRNSD
FFGLVDGLNFAVQYQGKNSNRGVTKQNGDGYALSVDYNIEGFGFVGAYSKSDRTNEQAGDGYGDNAEVWSLAAKYDANNI
YAAMMYGETRNMTVLANDHFANKTQNFEAVVQYQFDFGLRPSLGYVYSKGKDLYARDGHKGVDADRVNYIEVGTWYYFNK
NMNVYTAYKFNLLDKDDAAITDAATDDQFAVGIVYQF
>G8QM63 ~~~yedY1~~~Putative protein-methionine-sulfoxide reductase subunit YedZ1~~~COG2041
MFPKKNRPMIANGDRVLKDAFKDVGKYIELPARRAFLQRSMTLGGLSLLTGCAITDDESVESALLAMSRFNNCVQGWLFD
PNQLAPIYPESMITRPFPFNAYYGDDEVREVDAASFRLEVSGLVSDKHAWTLDELHALPQVDQVTRHICVEGWSAIGKWS
GVPFTTFLRLVGADLDAKYVGFKCADDYYTSIDMATALHPQTQLTLSYDGQTLPARYGFPMKLRMPTKLGYKNPKYIQAL
FVTNSYPGGYWEDQGYNWFGGS
>G8QM62 ~~~yedZ1~~~Putative protein-methionine-sulfoxide reductase subunit YedZ1~~~COG4117
MSKEIYAIHPPWLRVTHWLYVVAVVILVMSGWRIYDASPFFPFFIPKEITLGGWLGGALQWHFAAMWLLAVNGLIYLFFN
IFSGRLWHKFFPLSLRAIVADLLDALKGKLSHADLSHYNALQRAAYLFAIADIIVIVLSGLVLWKSVQFPILRELLGGYE
AARRVHFIGMSALVAFVGVHVAMVALVPRTLIAMIRGH
>P33011 ~~~yeeA~~~Inner membrane protein YeeA~~~COG1289
MRADKSLSPFEIRVYRHYRIVHGTRVALAFLLTFLIIRLFTIPESTWPLVTMVVIMGPISFWGNVVPRAFERIGGTVLGS
ILGLIALQLELISLPLMLVWCAAAMFLCGWLALGKKPYQGLLIGVTLAIVVGSPTGEIDTALWRSGDVILGSLLAMLFTG
IWPQRAFIHWRIQLAKSLTEYNRVYQSAFSPNLLERPRLESHLQKLLTDAVKMRGLIAPASKETRIPKSIYEGIQTINRN
LVCMLELQINAYWATRPSHFVLLNAQKLRDTQHMMQQILLSLVHALYEGNPQPVFANTEKLNDAVEELRQLLNNHHDLKV
VETPIYGYVWLNMETAHQLELLSNLICRALRK
>E0TU96 ~~~yeeF~~~Toxin YeeF~~~
MKVLEVKTLLSEATDRAKEYKELRTQMVNLRKALKSVADLGDSEFSGKGASNIKAFYHDHVGVTDQWIDYIDMKIAFFNS
IAGAAEDKGLSDAYIEESFLEHELANAHKKSKSIMSEQKKAMKDILNDIDNILPLDLFSTETFNDELADANDKRKKTLEK
LDALDEDLKTEYALSEPNEQFIKSDFQKLQEATGKGKNATPIHYNAKAYRESDIHKKKGDIEKRTEAYLKIKKEEAKERE
IEKLKERLKNYDYADADEFYEMAKTIGYENLTAEQQRYFTQIENTRELEAGFKGVAVGLYDSGKDAVVGLWDMVTDPGGT
VEAITGAVAHPIKTYEAISAAIEESYQKDMVNGDTYSRARWVSYAVGTVVTSIVGTKGVGAVSKTGTATKVTTKVKTAAS
KSATAQKAITVSKQTIDHIKQKVNKGIEVSKKHVKTKLNQIGDLTLADILPYHPRHDLVPAGVPYNAVNGVTLKEGLQKF
AKVILPKPYGTSSSGRRTPAPVVPPVTVKYGEHYARWSRKKVLKPNIIYKTKEGYTYTTDNYGRITSVKADLQLGEAKRN
QYAQSHAGKPQDRKPDDDGGHLIATQFKGSGQFDNIVPMNSQINRSGGRWYEMEQEWAKALKEEPPQKVNVNIKAIYKGD
SLRPDKFIVKFRIGDADFEKVTIKNQSGG
>O31506 ~~~yeeF~~~Toxin YeeF~~~COG5444
MKVFEAKTLLSEATDRAKEYKELRTQMVNLRKALKGVADLSDSEFSGKGASNIKAFYHDHVGVADQWIDYIDMKIAFFNS
IAGAAEDKGLSDAYIEESFLEHELANANKKSKSIMSEQKKAMKDILNDIDDILPLDLFSTETFKDELADANDKRKKTLEK
LDALDEDLKTEYALSEPNEQFIKSDFQKLQEATGKGKNATPIHYNAKAYRESDIHKKKGDIEKRTEAYLKIKKEEAKERE
IEKLKERLKNYDYADADEFYEMAKTIGYENLTAEQQRYFTQIENTRELEAGFKGVAVGLYDSGKDAVVGLWDMVTDPGGT
VEAITGAMAHPIKTYEAISAAIEESYQKDMVNGDTYSRARWVSYAVGTVVTSIVGTKGVGAVSKTGTAAKVTTKVKTAAS
KSATAQKAITVSKQTVDHIKQKVNTGIEVSKKHVKTKLNQIGDLTLADILPYHPRHDLVPAGVPYNAVNGVTLKEGLQKF
AKVILPKPYGTSSSGRRTPAPHVPPVTVKYGEHFARWSRKKVLKPNIIYKTKEGYTYTTDNYGRITSVKADLQLGEAKRN
QYAQTNAGKPQDRKPDDDGGHLIATQFKGSGQFDNIVPMNSQINRSGGKWYEMEQEWAKALSKKPPKKVAVQIEPVYSGD
SLRPSYFDVTYKIGSRKEISVSIKNQPGG
>O31510 ~~~yeeK~~~Spore coat protein YeeK~~~
MTDTRHMYGGPGFGHYQGFGIGHPGYGMQSTGYPGYGMYGGHPGYGMQGYPDHGIHGGVGGYPGYGGYGGYPSGGYGGSP
GTGSYPSMHHENDGHHHYYHHHHDGKDNLHHHHHHVGKDNHHHHHDGHYGHHHHHMGHWGKDGYK
>P0A8A2 ~~~yeeN~~~Probable transcriptional regulatory protein YeeN~~~COG0217
MGRKWANIVAKKTAKDGATSKIYAKFGVEIYAAAKQGEPDPELNTSLKFVIERAKQAQVPKHVIDKAIDKAKGGGDETFV
QGRYEGFGPNGSMIIAETLTSNVNRTIANVRTIFNKKGGNIGAAGSVSYMFDNTGVIVFKGTDPDHIFEILLEAEVDVRD
VTEEEGNIVIYTEPTDLHKGIAALKAAGITEFSTTELEMIAQSEVELSPEDLEIFEGLVDALEDDDDVQKVYHNVANL
>P76352 ~~~yeeO~~~Probable FMN/FAD exporter YeeO~~~COG0534
MLRHILTAKNLLSNPIFKFPNCLPFLSTVCCICRQFVGENLCSFADSPSLFEMWFHFLQLRSALNISSALRQVVHGTRWH
AKRKSYKVLFWREITPLAVPIFMENACVLLMGVLSTFLVSWLGKDAMAGVGLADSFNMVIMAFFAAIDLGTTVVVAFSLG
KRDRRRARVATRQSLVIMTLFAVLLATLIHHFGEQIIDFVAGDATTEVKALALTYLELTVLSYPAAAITLIGSGALRGAG
NTKIPLLINGSLNILNIIISGILIYGLFSWPGLGFVGAGLGLTISRYIGAVAILWVLAIGFNPALRISLKSYFKPLNFSI
IWEVMGIGIPASVESVLFTSGRLLTQMFVAGMGTSVIAGNFIAFSIAALINLPGSALGSASTIITGRRLGVGQIAQAEIQ
LRHVFWLSTLGLTAIAWLTAPFAGVMASFYTQDPQVKHVVVILIWLNALFMPIWSASWVLPAGFKGARDARYAMWVSMLS
MWGCRVVVGYVLGIMLGWGVVGVWMGMFADWAVRAVLFYWRMVTGRWLWKYPRPEPQKCEKKPVVSE
>P76361 ~~~yeeR~~~Inner membrane protein YeeR~~~COG2020
MLQIVGALILLIAGFAILRLLFRALISTASALAGLILLCLFGPALLAGYITERITRLFHIRWLAGVFLTIAGMIISFMWG
LDGKHIALEAHTFDSVKFILTTALAGGLLAVPLQIKNIQQNGITPEDISKEINGYYCCFYTAFFLMACSACAPLIALQYD
ISPSLMWWGGLLYWLAALVTLLWAASQIQALKKLTCAISQTLEEQPVLNSKSWLTSLQNDYSLPDSLTERIWLTLISQRI
SRGELREFELADGNWLLNNAWYERNMAGFNEQLKENLSFTPDELKTLFRNRLNLSPEANDDFLDRCLDGGDWYPFSEGRR
FVSFHHVDELRICASCGLTEVHHAPENHKPDPEWYCSSLCRETETLCQEIYERPYNSFISDATANGLILMKLPETWSTNE
KMFASGGQGHGFAAERGNHIVDRVRLKNARILGDNNARNGADRLVSGTEIQTKYCSTAARSVGAAFDGQNGQYRYMGNNG
PMQLEVPRDQYAGAVETMRNKIREGKVEER
>P64526 ~~~yeeW~~~Protein YeeW~~~
MMTLEADSVNVQALDMGHIVVDIDGVNITELINKAAENGYSLRVVDDRDSTETPATYASPHQLL
>P0A8M6 ~~~yeeX~~~UPF0265 protein YeeX~~~COG2926
METTKPSFQDVLEFVRLFRRKNKLQREIQDVEKKIRDNQKRVLLLDNLSDYIKPGMSVEAIQGIIASMKGDYEDRVDDYI
IKNAELSKERRDISKKLKAMGEMKNGEAK
>P0AD12 ~~~yeeZ~~~Protein YeeZ~~~COG0451
MKKVAIVGLGWLGMPLAMSLSARGWQVTGSKTTQDGVEAARMSGIDSYLLRMEPELVCDSDDLDALMDADALVITLPARR
SGPGDEFYLQAVQELVDSALAHRIPRIIFTSSTSVYGDAQGTVKETTPRNPVTNSGRVLEELEDWLHNLPGTSVDILRLA
GLVGPGRHPGRFFAGKTAPDGEHGVNLVHLEDVIGAITLLLQAPKGGHIYNICAPAHPARNVFYPQMARLLGLEPPQFRN
SLDSGKGKIIDGSRICNELGFEYQYPDPLVMPLE
>P69346 ~~~yefM~~~Antitoxin YefM~~~COG2161
MRTISYSEARQNLSATMMKAVEDHAPILITRQNGEACVLMSLEEYNSLEETAYLLRSPANARRLMDSIDSLKSGKGTEKD
IIE
>P76393 2.7.-.-~~~yegI~~~Protein kinase YegI~~~COG4248
MKTNIKVFTSTGELTTLGRELGKGGEGAVYDIEEFVDSVAKIYHTPPPALKQDKLAFMAATADAQLLNYVAWPQATLHGG
RGGKVIGFMMPKVSGKEPIHMIYSPAHRRQSYPHCAWDFLLYVARNIASSFATVHEHGHVVGDVNQNSFMVGRDSKVVLI
DSDSFQINANGTLHLCEVGVSHFTPPELQTLPSFVGFERTENHDNFGLALLIFHVLFGGRHPYSGVPLISDAGNALETDI
THFRYAYASDNQRRGLKPPPRSIPLSMLPSDVEAMFQQAFTESGVATGRPTAKAWVAALDSLRQQLKKCIVSAMHVYPAH
LTDCPWCALDNQGVIYFIDLGEEVITTGGNFVLAKVWAMVMASVAPPALQLPLPDHFQPTGRPLPLGLLRREYIILLEIA
LSALSLLLCGLQAEPRYIILVPVLAAIWIIGSLTSKAYKAEVQQRREAFNRAKMDYDHLVRQIQQVGGLEGFIAKRTMLE
KMKDEILGLPEEEKRALAALHDTARERQKQKFLEGFFIDVASIPGVGPARKAALRSFGIETAADVTRRGVKQVKGFGDHL
TQAVIDWKASCERRFVFRPNEAITPADRQAVMAKMTAKRHRLESALTVGATELQRFRLHAPARTMPLMEPLRQAAEKLAQ
AQADLSRC
>P76402 ~~~yegP~~~UPF0339 protein YegP~~~COG3422
MAGWFELSKSSDNQFRFVLKAGNGETILTSELYTSKTSAEKGIASVRSNSPQEERYEKKTASNGKFYFNLKAANHQIIGS
SQMYATAQSRETGIASVKANGTSQTVKDNT
>P76407 2.7.1.-~~~yegS~~~Lipid kinase YegS~~~COG1597
MAEFPASLLILNGKSTDNLPLREAIMLLREEGMTIHVRVTWEKGDAARYVEEARKFGVATVIAGGGDGTINEVSTALIQC
EGDDIPALGILPLGTANDFATSVGIPEALDKALKLAIAGDAIAIDMAQVNKQTCFINMATGGFGTRITTETPEKLKAALG
SVSYIIHGLMRMDTLQPDRCEIRGENFHWQGDALVIGIGNGRQAGGGQQLCPNALINDGLLQLRIFTGDEILPALVSTLK
SDEDNPNIIEGASSWFDIQAPHDITFNLDGEPLSGQNFHIEILPAALRCRLPPDCPLLR
>Q8ZNP1 2.7.1.-~~~yegS~~~Probable lipid kinase YegS~~~
MANFPASLLILNGKSADNQPLREAITLLRDEGIQIHVRVTWEKGDAQRYVDEARRLGVETVIAGGGDGTINEVSTALIQI
RDGVAPALGLLPLGTANDFATSAGIPEALDKALKLAIAGNAMEIDMAMVNDKTCFINMATGGFGTRITTETPEKLKAALG
GVSYLIHGLMRMDTLTPDRCEIRGENFHWQGDALVIGIGNGRQAGGGQQLCPTALINDGLLQLRIFTGEELLPALFSTLT
QSDDNPNIIDGASAWFDIHAPHEITFNLDGEPLSGQEFHIEVLPGALRCRLPPDCPLLR
>P76417 ~~~yegT~~~Putative nucleoside transporter YegT~~~COG2211
MKTTAKLSFMMFVEWFIWGAWFVPLWLWLSKSGFSAGEIGWSYACTAIAAILSPILVGSITDRFFSAQKVLAVLMFAGAL
LMYFAAQQTTFAGFFPLLLAYSLTYMPTIALTNSIAFANVPDVERDFPRIRVMGTIGWIASGLACGFLPQILGYADISPT
NIPLLITAGSSALLGVFAFFLPDTPPKSTGKMDIKVMLGLDALILLRDKNFLVFFFCSFLFAMPLAFYYIFANGYLTEVG
MKNATGWMTLGQFSEIFFMLALPFFTKRFGIKKVLLLGLVTAAIRYGFFIYGSADEYFTYALLFLGILLHGVSYDFYYVT
AYIYVDKKAPVHMRTAAQGLITLCCQGFGSLLGYRLGGVMMEKMFAYQEPVNGLTFNWSGMWTFGAVMIAIIAVLFMIFF
RESDNEITAIKVDDRDIALTQGEVK
>P33340 ~~~yehA~~~Uncharacterized fimbrial-like protein YehA~~~COG3539
MEIRIMLFILMMMVMPVSYAACYSELSVQHNLVVQGDFALTQTQMATYEHNFNDSSCVSTNTITPMSPSDIIVGLYNDTI
KLNLHFEWTNKNNITLSNNQTSFTSGYSVTVTPAASNAKVNVSAGGGGSVMINGVATLSSASSSTRGSAAVQFLLCLLGG
KSWDACVNSYRNALAQNAGVYSFNLTLSYNPITTTCKPDDLLITLDSIPVSQLPATGNKATINSKQGDIILRCKNLLGQQ
NQTSRKMQVYLSSSDLLTNSNTILKGAEDNGVGFILESNGSPVTLLNITNSSKGYTNLKEVAAKSKLTDTTVSIPITASY
YVYDTNKVKSGALEATALINVKYD
>P33341 ~~~yehB~~~Outer membrane usher protein YehB~~~COG3188
MLRMTPLASAIVALLLGIEAYAAEETFDTHFMIGGMKDQQVANIRLDDNQPLPGQYDIDIYVNKQWRGKYEIIVKDNPQE
TCLSREVIKRLGINSDNFASGKQCLTFEQLVQGGSYTWDIGVFRLDFSVPQAWVEELESGYVPPENWERGINAFYTSYYL
SQYYSDYKASGNNKSTYVRFNSGLNLLGWQLHSDASFSKTNNNPGVWKSNTLYLERGFAQLLGTLRVGDMYTSSDIFDSV
RFRGVRLFRDMQMLPNSKQNFTPRVQGIAQSNALVTIEQNGFVVYQKEVPPGPFAITDLQLAGGGADLDVSVKEADGSVT
TYLVPYAAVPNMLQPGVSKYDLAAGRSHIEGASKQSDFVQAGYQYGFNNLLTLYGGSMVANNYYAFTLGAGWNTRIGAIS
VDATKSHSKQDNGDVFDGQSYQIAYNKFVSQTSTRFGLAAWRYSSRDYRTFNDHVWANNKDNYRRDENDVYDIADYYQND
FGRKNSFSANMSQSLPEGWGSVSLSTLWRDYWGRSGSSKDYQLSYSNNLRRISYTLAASQAYDENHHEEKRFNIFISIPF
DWGDDVSTPRRQIYMSNSTTFDDQGFASNNTGLSGTVGSRDQFNYGVNLSHQHQGNETTAGANLTWNAPVATVNGSYSQS
STYRQAGASVSGGIVAWSGGVNLANRLSETFAVMNAPGIKDAYVNGQKYRTTNRNGVVIYDGMTPYRENHLMLDVSQSDS
EAELRGNRKIAAPYRGAVVLVNFDTDQRKPWFIKALRADGQSLTFGYEVNDIHGHNIGVVGQGSQLFIRTNEVPPSVNVA
IDKQQGLSCTITFGKEIDESRNYICQ
>P33342 ~~~yehC~~~Probable fimbrial chaperone YehC~~~COG3121
MAAIPWRPFNLRGIKMKGLLSLLIFSMVLPAHAGIVIYGTRIIYPAENKEVMVQLMNQGNRSSLLQAWIDDGDTSLPPEK
IQVPFMLTPPVAKIGANSGQQVKIKIMPNKLPTNKESIFYLNVLDIPPNSPEQEGKNALKFAMQNRIKLFYRPAGIAPVN
KATFKKLLVNRSGNGLVIKNDSANWVTISDVKANNVKVNYETIMIAPLESQSVNVKSNNANNWHLTIIDDHGNYISDKI
>P33343 ~~~yehD~~~Uncharacterized fimbrial-like protein YehD~~~COG3539
MKRSIIAAAVFSSFFMSAGVFAADVDTGTLTIKGNIAESPCKFEAGGDSVSINMPTVPTSVFEGKAKYSTYDDAVGVTSS
MLKISCPKEVAGVKLSLITNDKITGNDKAIASSNDTVGYYLYLGDNSDVLDVSAPFNIESYKTAEGQYAIPFKAKYLKLT
DNSVQSGDVLSSLVMRVAQD
>P33345 ~~~yehF~~~Protein YehF~~~
MRHFIYQDEKSHKFRAVEQQGNELHISWGKVGTKGQSQIKSFSDAAAAAKAELKLIAEKVKKGYVEQAKDNSLQPSQTVT
GSLKVADLSTIIQEQPSFVAETRAPDKNTDAVLPWLAKDIAVVFPPEVVHTTLSHRRFPGVPVQQADKLPQLRRLACSVS
QRDNKTATFDFSACSLEWQNTVAQAISQIDGLKTTQLPSPVMAVLTALEMKCTRYKVREDVMDQIVQEGGLEYATDVIIH
LQQIDIEWDYANNVIIILPSGIAPSYLEQYSRFE
>P33348 ~~~yehL~~~Uncharacterized protein YehL~~~COG0714
MSPQNNHLQRPPAAVLYADELAKLKQNDNAPCPPGWQLSLPAARAFILGDEAQNISRKVVISPSAVERMLVTLATGRGLM
LVGEPGTAKSLLSELLATAISGDAGLTIQGGASTTEDQIKYGWNYALLINHGPSTEALVPAPLYQGMRDGKIVRFEEITR
TPLEVQDCLLGMLSDRVMTGPELTGEASQLYAREGFNIIATANTRDRGVNEMSAALKRRFDFETVFPIMDFAQELELVAS
ASARLLAHSGIPHKVPDAVLELLVRTFRDLRANGEKKTSMDTLTAIMSTAEAVNVAHAVGVRAWFLANRAGEPADLVECI
AGTIVKDNEEDRARLRRYFEQRVATHKEAHWQAYYQARHRLP
>P33354 ~~~yehR~~~Uncharacterized lipoprotein YehR~~~COG4808
MKAFNKLFSLVVASVLVFSLAGCGDKEESKKFSANLNGTEIAITYVYKGDKVLKQSSETKIQFASIGATTKEDAAKTLEP
LSAKYKNIAGVEEKLTYTDTYAQENVTIDMEKVDFKALQGISGINVSAEDAKKGITMAQMELVMKAAGFKEVK
>P33359 ~~~yehW~~~Glycine betaine uptake system permease protein YehW~~~COG1174
MKMLRDPLFWLIALFVALIFWLPYSQPLFAALFPQLPRPVYQQESFAALALAHFWLVGISSLFAVIIGTGAGIAVTRPWG
AEFRPLVETIAAVGQTFPPVAVLAIAVPVIGFGLQPAIIALILYGVLPVLQATLAGLGAIDASVTEVAKGMGMSRGQRVR
KVELPLAAPVILAGVRTSVIINIGTATIASTVGASTLGTPIIIGLSGFNTAYVIQGALLVALAAIIADRLFERLVQALSQ
HAK
>P33360 7.4.2.-~~~yehX~~~Glycine betaine uptake system ATP-binding protein YehX~~~COG1125
MIEFSHVSKLFGAQKAVNDLNLNFQEGSFSVLIGTSGSGKSTTLKMINRLVEHDSGEIRFAGEEIRSLPVLELRRRMGYA
IQSIGLFPHWSVAQNIATVPQLQKWSRARIDDRIDELMALLGLESNLRERYPHQLSGGQQQRVGVARALAADPQVLLMDE
PFGALDPVTRGALQQEMTRIHRLLGRTIVLVTHDIDEALRLAEHLVLMDHGEVVQQGNPLTMLTRPANDFVRQFFGRSEL
GVRLLSLRSVADYVRREERADGEALAEEMTLRDALSLFVARGCEVLPVVNMQGQPCGTLHFQDLLVEA
>P33361 ~~~yehY~~~Glycine betaine uptake system permease protein YehY~~~COG1174
MTYFRINPVLALLLLLTAIAAALPFISYAPNRLVSGEGRHLWQLWPQTIWMLVGVGCAWLTACFIPGKKGSICALILAQF
VFVLLVWGAGKAATQLAQNGSALARTSLGSGFWLAAALALLACSDAIRRISTHPLWRWLLHMQIAIIPLWLLYSGTLNDL
SLMKEYANRQDVFDDALAQHLTLLFGAVLPALVIGVPLGIWCYFSTARQGAIFSLLNVIQTVPSVALFGLLIAPLAALVT
AFPWLGTLGIAGTGMTPALIALVLYALLPLVRGVVVGLNQIPRDVLESARAMGMSGAQRFLHVQLPLALPVFLRSLRVVM
VQTVGMAVIAALIGAGGFGALVFQGLLSSAIDLVLLGVIPVIVLAVLTDALFDLLIALLKVKRND
>P33362 ~~~yehZ~~~Glycine betaine-binding protein YehZ~~~COG1732
MPLLKLWAGSLVMLAAVSLPLQAASPVKVGSKIDTEGALLGNIILQVLESHGVPTVNKVQLGTTPVVRGAITSGELDIYP
EYTGNGAFFFKDENDAAWKNAQQGYEKVKKLDSEHNKLIWLTPAPANNTWTIAVRQDVAEKNKLTSLADLSRYLQEGGTF
KLAASAEFIERADALPAFEKAYGFKLGQDQLLSLAGGDTAVTIKAAAQQTSGVNAAMAYGTDGPVAALGLQTLSDPQGVQ
PIYAPAPVVRESVLREYPQMAQWLQPVFASLDAKTLQQLNASIAVEGLDAKKVAADYLKQKGWTK
>P62723 ~~~yeiH~~~UPF0324 inner membrane protein YeiH~~~COG2855
MTNITLQKQHRTLWHFIPGLALSAVITGVALWGGSIPAVAGAGFSALTLAILLGMVLGNTIYPHIWKSCDGGVLFAKQYL
LRLGIILYGFRLTFSQIADVGISGIIIDVLTLSSTFLLACFLGQKVFGLDKHTSWLIGAGSSICGAAAVLATEPVVKAEA
SKVTVAVATVVIFGTVAIFLYPAIYPLMSQWFSPETFGIYIGSTVHEVAQVVAAGHAISPDAENAAVISKMLRVMMLAPF
LILLAARVKQLSGANSGEKSKITIPWFAILFIVVAIFNSFHLLPQSVVNMLVTLDTFLLAMAMAALGLTTHVSALKKAGA
KPLLMALVLFAWLIVGGGAINYVIQSVIA
>P0A9E9 ~~~yeiL~~~Regulatory protein YeiL~~~COG0664
MSESAFKDCFLTDVSADTRLFHFLARDYIVQEGQQPSWLFYLTRGRARLYATLANGRVSLIDFFAAPCFIGEIELIDKDH
EPRAVQAIEECWCLALPMKHYRPLLLNDTLFLRKLCVTLSHKNYRNIVSLTQNQSFPLVNRLAAFILLSQEGDLYHEKHT
QAAEYLGVSYRHLLYVLAQFIHDGLLIKSKKGYLIKNRKQLSGLALEMDPENKFSGMMQ
>P33030 3.6.5.-~~~yeiR~~~Zinc chaperone YeiR~~~COG0523
MTRTNLITGFLGSGKTTSILHLLAHKDPNEKWAVLVNEFGEVGIDGALLADSGALLKEIPGGCMCCVNGLPMQVGLNTLL
RQGKPDRLLIEPTGLGHPKQILDLLTAPVYEPWIDLRATLCILDPRLLLDEKSASNENFRDQLAAADIIVANKSDRTTPE
SEQALQRWWQQNGGDRQLIHSEHGKVDGHLLDLPRRNLAELPASAAHSHQHVVKKGLAALSLPEHQRWRRSLNSGQGYQA
CGWIFDADTVFDTIGILEWARLAPVERVKGVLRIPEGLVRINRQGDDLHIETQNVAPPDSRIELISSSEADWNALQSALL
KLRLATTA
>P0AFT8 ~~~yeiW~~~UPF0153 protein YeiW~~~
MECRPGCGACCTAPSISSPIPGMPDGKPANTPCIQLDEQQRCKIFTSPLRPKVCAGLQASAEMCGNSRQQAMTWLIDLEM
LTAP
>P33913 ~~~yejA~~~Uncharacterized protein YejA~~~COG4166
MIVRILLLFIALFTFGVQAQAIKESYAFAVLGEPRYAFNFNHFDYVNPAAPKGGQITLSALGTFDNFNRYALRGNPGART
EQLYDTLFTTSDDEPGSYYPLIAESARYADDYSWVEVAINPRARFHDGSPITARDVEFTFQKFMTEGVPQFRLVYKGTTV
KAIAPLTVRIELAKPGKEDMLSLFSLPVFPEKYWKDHKLSDPLATPPLASGPYRVTSWKMGQNIVYSRVKDYWAANLPVN
RGRWNFDTIRYDYYLDDNVAFEAFKAGAFDLRMENDAKNWATRYTGKNFDKKYIIKDEQKNESAQDTRWLAFNIQRPVFS
DRRVREAITLAFDFEWMNKALFYNAWSRTNSYFQNTEYAARNYPDAAELVLLAPMKKDLPSEVFTQIYQPPVSKGDGYDR
DNLLKADKLLNEAGWVLKGQQRVNATTGQPLSFELLLPASSNSQWVLPFQHSLQRLGINMDIRKVDNSQITNRMRSRDYD
MMPRVWRAMPWPSSDLQISWSSEYINSTYNAPGVQSPVIDSLINQIIAAQGNKEKLLPLGRALDRVLTWNYYMLPMWYMA
EDRLAWWDKFSQPAVRPIYSLGIDTWWYDVNKAAKLPSASKQGE
>P0AFU0 ~~~yejB~~~Inner membrane ABC transporter permease protein YejB~~~COG4174
MGAYLIRRLLLVIPTLWAIITINFFIVQIAPGGPVDQAIAAIEFGNAGVLPGAGGEGVRASHAQTGVGNISDSNYRGGRG
LDPEVIAEITHRYGFDKPIHERYFKMLWDYIRFDFGDSLFRSASVLTLIKDSLPVSITLGLWSTLIIYLVSIPLGIRKAV
YNGSRFDVWSSAFIIIGYAIPAFLFAILLIVFFAGGSYFDLFPLRGLVSANFDSLPWYQKITDYLWHITLPVLATVIGGF
AALTMLTKNSFLDEVRKQYVVTARAKGVSEKNILWKHVFRNAMLLVIAGFPATFISMFFTGSLLIEVMFSLNGLGLLGYE
ATVSRDYPVMFGTLYIFTLIGLLLNIVSDISYTLVDPRIDFEGR
>P33915 ~~~yejE~~~Inner membrane ABC transporter permease protein YejE~~~COG4239
MSRLSPVNQARWARFRHNRRGYWSLWIFLVLFGLSLCSELIANDKPLLVRYDGSWYFPLLKNYSESDFGGPLASQADYQD
PWLKQRLENNGWVLWAPIRFGATSINFATNKPFPSPPSRQNWLGTDANGGDVLARILYGTRISVLFGLMLTLCSSVMGVL
AGALQGYYGGKVDLWGQRFIEVWSGMPTLFLIILLSSVVQPNFWWLLAITVLFGWMSLVGVVRAEFLRTRNFDYIRAAQA
LGVSDRSIILRHMLPNAMVATLTFLPFILCSSITTLTSLDFLGFGLPLGSPSLGELLLQGKNNLQAPWLGITAFLSVAIL
LSLLIFIGEAVRDAFDPNKAV
>P33916 ~~~yejF~~~Uncharacterized ABC transporter ATP-binding protein YejF~~~COG4172
MTQTLLAIENLSVGFRHQQTVRTVVNDVSLQIEAGETLALVGESGSGKSVTALSILRLLPSPPVEYLSGDIRFHGESLLH
ASDQTLRGVRGNKIAMIFQEPMVSLNPLHTLEKQLYEVLSLHRGMRREAARGEILNCLDRVGIRQAAKRLTDYPHQLSGG
ERQRVMIAMALLTRPELLIADEPTTALDVSVQAQILQLLRELQGELNMGMLFITHNLSIVRKLAHRVAVMQNGRCVEQNY
AATLFASPTHPYTQKLLNSEPSGDPVPLPEPASTLLDVEQLQVAFPIRKGILKRIVDHNVVVKNISFTLRAGETLGLVGE
SGSGKSTTGLALLRLINSQGSIIFDGQPLQNLNRRQLLPIRHRIQVVFQDPNSSLNPRLNVLQIIEEGLRVHQPTLSAAQ
REQQVIAVMHEVGLDPETRHRYPAEFSGGQRQRIAIARALILKPSLIILDEPTSSLDKTVQAQILTLLKSLQQKHQLAYL
FISHDLHVVRALCHQVIILRQGEVVEQGPCARVFATPQQEYTRQLLALS
>P0AD21 ~~~yejG~~~Uncharacterized protein YejG~~~
MTSLQLSIVHRLPQNYRWSAGFAGSKVEPIPQNGPCGDNSLVALKLLSPDGDNAWSVMYKLSQALSDIEVPCSVLECEGE
PCLFVNRQDEFAATCRLKNFGVAIAEPFSNYNPF
>P0AD24 ~~~yejL~~~UPF0352 protein YejL~~~COG3082
MPQISRYSDEQVEQLLAELLNVLEKHKAPTDLSLMVLGNMVTNLINTSIAPAQRQAIANSFARALQSSINEDKAH
>P0AD27 ~~~yejM~~~Inner membrane protein YejM~~~COG3083
MVTHRQRYREKVSQMVSWGHWFALFNILLSLVIGSRYLFIADWPTTLAGRIYSYVSIIGHFSFLVFATYLLILFPLTFIV
GSQRLMRFLSVILATAGMTLLLIDSEVFTRFHLHLNPIVWQLVINPDENEMARDWQLMFISVPVILLLELVFATWSWQKL
RSLTRRRRFARPLAAFLFIAFIASHVVYIWADANFYRPITMQRANLPLSYPMTARRFLEKHGLLDAQEYQRRLIEQGNPD
AVSVQYPLSELRYRDMGTGQNVLLITVDGLNYSRFEKQMPALAGFAEQNISFTRHMSSGNTTDNGIFGLFYGISPSYMDG
ILSTRTPAALITALNQQGYQLGLFSSDGFTSPLYRQALLSDFSMPSVRTQSDEQTATQWINWLGRYAQEDNRWFSWVSFN
GTNIDDSNQQAFARKYSRAAGNVDDQINRVLNALRDSGKLDNTVVIITAGRGIPLSEEEETFDWSHGHLQVPLVIHWPGT
PAQRINALTDHTDLMTTLMQRLLHVSTPASEYSQGQDLFNPQRRHYWVTAADNDTLAITTPKKTLVLNNNGKYRTYNLRG
ERVKDEKPQLSLLLQVLTDEKRFIAN
>P40709 ~~~yejM~~~Inner membrane protein YejM~~~
MVTHRQRYREKVSQMVSWGHWFALFNILLATLLGSRYLFVADWPTTLAGRIYSYLSIVGHFSFLVFATYLLILFPLTFIV
MSQRLMRFLSAILATAGMTLLLIDSEVFTRFHLHLNPIVWELVINPDQNEMARDWQLMFISVPVILLIEMLFATWSWQKL
RSLTRRRHFARPLAAFFFVSFIASHLIYIWADANFYRPITMQRANLPLSYPMTARRFLEKHGLLDAQEYQRRLVEQGNPE
AVSVQYPLSNLHYRDMGTGQNVLLITVDGLNYSRFEKQMPELATFAEQNIDFTRHMSSGNTTDNGIFGLFYGISPGYMDG
VLSTRTPAALITALNQQGYQLGLFSSDGFASPLYRQALLSDFSMPAAQTQSDAQTASQWIDWLGRYAQEDNRWFSWISFN
GTNIDDSNQKNFVKRYASAASDVDAQINRVLNALREAGKFDNTVVIITAGRGIPLTPEENRFDWSQGHLQVPLVIHWPGT
PAQRINVLTDHTDVMTTLMQRLLHVSTPANEYSQGQDIFTVPRRHNWVTAADGSTLAITTPQMTLVLNNNGHYQTYDLHG
EKIKDQKPQLSLLLQVLTEEKRFIAN
>B6A877 ~~~yenA1~~~Toxin subunit YenA1~~~
MDKYNNYSNVIKNKSSISPLLAAAAKIEPEITVLSSASKSNRSQYSQSLADTLLGLGYRSIFDIAKVSRQRFIKRHDESL
LGNGAVIFDKAVSMANQVLQKYRKNRLEKSNSPLVPQTSSSTDASSESQTNKLPEYNQLFPEPWDNFCRPGAIEALDSPA
SYLLDLYKFIQSVELDGSNQARKLETRRADIPKLSLDNDALYKEVTALSIVNDVLSGSAREYIDQSGQADKAVNQILGDT
HFPFTLPYSLPTQQINKGLGASNIELGTVIQRVDPQFSWNTTQEKYNQVLLAYTQLSSEQIALLSLPDVFTQNFLTQTEL
SAGYLSASTTEILAEKDLSRHGYIVKAADNIKGPTQLVEHSDASYDVIELTCTNQAKETITVKLRGENIITYQRTKARMV
PFDNSSPFSRQLKLTFVAEDNPSLGNLDKGPYFANMDIYAAEWVRENVSSETMVSRPFLTMTYRIAIAKAGASLEELQPE
ADAFFINNFGLSAEDSSQLVKLVAFGDQTGSKAEEIESLLSCGENLPIVSPNVIFANPIFGSYFNDEPFPAPYHFGGVYI
NAHQRNAMTIIRAEGGREIQSLSNFRLERLNRFIRLQRWLDLPSHQLDLLLTSVMQADADNSQQEITEPVLKSLGLFRHL
NLQYKITPEIFSSWLYQLTPFAVSGEIAFFDRIFNREQLFDQPFILDGGSFTYLDAKGSDAKSVKQLCAGLNISAVTFQF
IAPLVQSALGLEAGTLVRSFEVVSSLYRLVSIPQTFGLSTEDGLILMNILTDEMGYLAKQPAFDDKQTQDKDFLSIILKM
EALSAWLTKNNLTPASLALLLGVTRLAVVPTNNMVTFFKGIANGLSENVCLTTDDFQRQELEGADWWTLLSTNQVIDDMG
LVLDIHPVWGKSDEEMLMEKIQSIGVSNDNNTLSIIVQILIQAKNAQENLLSQTISAEYGVERSVVPLQLRWLGSNVYSV
LNQVLNNTPTDISSIVPKLSELTYSLLIYTQLINSLKLNKEFIFLRLTQPNWLGLTQPKLSTQLSLPEIYLITCYQDWVV
NANKNEDSIHEYLEFANIKKTEAEKTLVDNSEKCAELLAEILAWDAGEILKAASLLGLNPPQATNVFEIDWIRRLQTLSE
KTMISTEYLWQMGDLTENSEFSLKEGVGEAVMAALKAQGDSDNV
>B6A878 ~~~yenA2~~~Toxin subunit YenA2~~~
MSNSIEAKLQEDLRDALVDYYLGQIVPNSKDFTNLRSTIKNVDDLYDHLLLDTQVSAKVITSRLSLVTQSVQQYINRIAL
NLEPGLSINQQEATDWEEFANRYGYWAANQQLRMFPEIYVDPTLRLTKTEFFFQLESALNQGKLTDDVAQKAVLGYLNNF
EEVSNLEIIAGYQDGIDIENDKTYFVARTRMQPYRYFWRSLDASQRNANSQELYPTAWSEWKAISVPLENVANGIVRPIM
MDNRLYISWFEVAEEKETDSDGNIIVSGRYRTKIRLAHLGFDGVWSSGTTLREEVLADQMEEMIAVVDRMEDEPRLALVA
FKEMSESWDVVFSYICDSMLIESSNLPTTTHPPKPGDGDKGLSDLDDYGANLVWFYLHETANGGKAEYKQLILYPVIINR
DWPIELDKTHQGDFGTVDDFTLNSNYTGDELSLYLQSSSTYKYDFSKSKNIIYGIWKEDANNNRCWLNYKLLTPEDYEPQ
INATLVMCDKGDVNIITGFSLPNGGVDAGGKIKVTLRVGKKLRDKFQIKQFSQTQYLQFPEASSADVWYIGKQIRLNTLF
AKELIGKASRSLDLVLSWETQNSRLEEAILGGAAELIDLDGANGIYFWELFFHMPFMVSWRFNVEQRYEDANRWVKYLFN
PFECEDEPALLLGKPPYWNSRPLVDEPFKGYSLTQPSDPDAIAASDPIHYRKAVFNFLTKNIIDQGDMEYRKLQPSARTL
ARLSYSTASSLLGRRPDVQLTSFWQPLTLEDASYKTDSEIRAIEMQSQPLTFEPVVHDQTMSAVDNDIFMYPMNNELRGL
WDRIENRIYNLRHNLTLDGKEINMDLYDSSISPRGLMKQRYQRVVTARNASKMNFKVPNYRFEPMLNRSKSGVETLIQFG
STLLSLLERKDSLSFDAYQMIQSGDLYRFSIDLQQQDIDINKASLEALQVSKQSAQDRYDHFKELYDENISSTEQKVIEL
QSQAANSLLMAQGMRTAAAALDVIPNIYGLAVGGSHWGAPLNAAAEIIMIKYQADSSKSESLSVSESYRRRRQEWELQYK
QAEWEVNSVEQQINLQNMQIKAANKRLEQVEAQQQQAMALLDYFSERFTNESLYTWLISQLSSLYLQAYDAVLSLCLSAE
ASLLYELNLGEQSFVGGGGWNDLYQGLMAGETLKLALMRMERVYVEQNSRRQEITKTISLKALLGESWPAELNKLKQKTP
INFNLEEQIFVEDYQELYQRRIKSVSVSLPMLVGPYEDVCAQLTQTSSSYSTRADLKTVENMLTKRTFADTPHLVRSIQP
NQQISLSTGVNDSGLFMLNFDDERFLPFEGSGVDSSWRLQFTNLKQNLDSLNDVILHVKYTAAIGSSTFSQGVRKILANI
NNDE
>B6A880 ~~~yenB~~~Toxin subunit YenB~~~
MQNSQEMAITTLSLPKGGGAINGMGESVGQAGPDGMVTFSIPLPFSAGRGVAPALSLSYSSGAGNGPFGMGWQCSAMSIS
RRTQKGVPQYNEDDEFLSPSGEVMAIALNDSGFEDVRTANRLQGIPLPFSYKVTRYQPRLIQDFIKIEYWQPVKQTDGTP
FWIIYSPDGQTHILGKNSHSRVANAENPSQIASWLLEETVTPTGEHIYYQYSGENQVNCTDAEIALHPQDSAQRYLARID
YGNISPQASLFVLDEELPNLTQWLFHLVFDYGERDISINKIPTFEGGTTGWLARPDMFSRYDFGIEIRNRRLCHQVLGFH
RLEALNDRDVTDEIPVLVNRLTLDYDLNNSVSTLVAVRQVAYETDGSPITQPPLEFDYQRFDTGSIPGWQEMPQLEAFNG
YQPYQMIDLYGEGTPGILYQETPGAWWYKSPQRQIGGDSNAVTYGAMKALPKIPRLQEGATLMDINGDGRLDWVITSAGV
RGFHSIHSTGEWTHFTPLNTLPTEYFHPKAQLADLVGAGLSDLVLIGPKSVRLYANQQGNWAPAQDVTQAENVSLPVIGI
DSRQLVAFADMLGSGQQHLVEITADSVKCWPNMGHGRFGQPLTLEGFSQPQTSFNPDRVFLADIDGSGTNDIIYAHSECL
EIYLNESGNRFSKPISLLLPDGVNFDNTCQLQAADIQGLGIASLVMTVPHMSPTHWRCDLALNKPWLLNVMNNNRGAETC
LFYRSSAQFWLDEKQLVEAAGQQPECHLPFPMHLHWRSEIFDEITGNRLTQEQEYAHGSWDGQEREFRGFGRLIQRDTDG
FAQGTVDIPTHPSRTVSWFATGIPEIDTTLSAEFWRGDDQAFSPFSPRFTRWENDSEAGSDVAFIPSEHDAFWLNRAMKG
QLLRSELYGDDGTPEAEIPYSVTEMRHQVRALPTTDATVPSAWCSTIETRSYQYQRVAADPQCSQQVVIKADRYGSPLLS
VAINYPRRKKPEKSPYPDDLPETLFDSSYDTQQQQLHLTKQQQNYFHLTNDDNWLLGLPKEQRNDGYQYDQERAPANGFT
LETLIASNSLIGSNQPFTYLGQSRVAYQGGVDEQPSLQALVAYGETAILDEKTLQAFVGVLDSKTRDELLFSAGYQLAPR
LFRVESEPDVWVARQGYSEFGDYSQFWRPLSQRSTLLTGKTTLKWDKHYCVVIETQDAAQLVTQARYDYRFLTPYSLTDA
NDNQHYVVLNPFGEVIASRFWGTEAGKDAGYSTPQAKPFVVPATIEAALALSPGIPVAHCAIFEPESWMQKLTQHDVSER
MADNGTLWNALLQARFVTEDGYVCALGRRRWMARHGLSVLMLTLLAEIPRTPPHSLTITTDRYDSDDQQQLRQRILFSDG
FGRLLQSAQRVEAGESWQRSEDSSLVVNVSGTPALVVTDNRWAVSGRTEYDGKGQGIRVYQPYFLDDWRYLSDDSARTDL
FADTHIYDPLGREYQVITAKGYRRERQYTPWFVVNQDENDTAANLAI
>B6A881 ~~~yenC1~~~Toxin subunit YenC1~~~
MNQFDSALHQGTPGVSVLDNRGHVIRELRYYRHPDTPQEIAERIAFHQYDSYGFISQSIDPRLAERRKQDSSVKPNLSYF
TALSGEVLRTDGVDAGTIFSLNDIAARPAISISATGVSHTWQYEGENRPGRVLSRSEREKDREERIIERFYWAGSDASQK
ANNLAGQCLRHYNSAGLNQTLSIALTGTPISACFQPLLESAEPEWQGTNESAWLELLTPEIFTTYNRADANGETLVQTDA
MGNIQRLAYDVAGFLKSSWLSLKGGQEQIIVKSLTYSAAGQKLQEEHGNGVLTTYSYEAETQRLIGIRTERPAGHLSGAR
VFQDLRYTYDPVGNVLRITNDAEATRFWRNQKVVPENTYIYDTLYQLVSANGREMANIPQQSSQLPTLSPIDNNAYTNYI
RNYHYDSAGNLMQIRHTSAAANNSYTTNITVSKYSNRAVLSSLTDDVDKVEAFFDAAGRQNQLLPGQTLSWNARGELAKV
TPVARDGQESDSETYRYDANSQRVSKMAIQQSNNNTQTRRVLYLAGLERRTIHQGNTLFETLLVVKIGEAGRAQVQAMHW
ELGQPTEVANDELRYSYDNLIGSSGLEVDGTGQLISQEEYYPYGGTAVWMARSQREASDKAYGYSGKERDATGLYYYGFR
YYQPWAGRWLSADPAGTIDGLNLFRMVRNNPIVLHDPDGLAPGFFERISSFRKKDTLTISSLKGTGPFYTRSESEIDIDF
LFSRQDRDKDFPPQNHKELSAEDRREVLEVSSGENITSANKSSKWYAGTHWETKPLKNNTDLVVLHNGVQGAAGININLN
DIKPGRSVLVTAGTLTGCTMITGVKGNNFYALHAGTGTPSENWVTGEHGVTDNFRMLNKLIPDAGIDLNPEAVNDSLLTI
LDYFDNGTIAYNGKKGSEIHRDADNILNYRTTGYENTVGVSFSLLTKDKNGEVSASTLLELGELKPHKKHRTRGQFGMTE
LKYEARKNTVVKLR
>B6A882 ~~~yenC2~~~Toxin subunit YenC2~~~
MDIQLFSKTPSVTVFDNRGLSVRDIAYRRHPDTPKVTEECITYHQFDFRGFLAQSLDPRLNHKEVTNFSYLTDLNGNIIY
TQSVDAGNTLVLNDTEGRSVIAMTNISRGENGKDDLSLAVTRTFQYENAPLPGRPLSVTEQVNGENARITEHFVYAGNTP
QEKNLNLAGQCVSYYDAAGLIQTDSVSLTGKPLSVSRKLLKNLDDTNILADWQGNDTSAWNSLLATEIYTTVTRTDAAGA
VLTTIDAVGNQQRVAFDIAGQLSASWLTLKGGQEQVIIKVLTYSAAGQKLREEGGNGVVTTYTYEAETQRLIGIKTERPN
GHAAGAKVLQDLRYEYDPVGNVLSITNDAEETRFWRNQKVVPENAYRYDSLYQLVSASGREVAGAGQQGSDLPSPLVPLP
SDSSVYTNYTRTYTYDSAGNLMRIRHSAPATNNNYTLNITVSERSNRGVMSSLTENPADVDALFTASGSQKCLQQGQSLI
WTPRGELRTVLLVARGETADDSESYRYDGSSQRILKISSQQTNHSARVQRALYLPGLEWRTMTGGVAEAENLQVICIGEA
GRAQVRVLHWESGKPDGIINDQIRWSYDNLTCSSGLEVDGDGLVISMEEYYPYGGTAVWAARSHIETAYKTVRYSGKERD
ATGLYYYGFRYYQPWAGRWLSADPAGTVDGLNLYRMVRNNPLRLTDPDGMAPLDWLDLDTTNASRDIVKAIYQLNQIDGP
HRGVRDTYQRMTESTGMILQETLNNEAVLKGIKQKDKEKKSRGMKFTNSKLKTYAAHAGVLNTLQPDPVYKDGFLNLPGS
LGNKNTFPGVELIEDKVKPSLSQYHPDKLGKSQRWKPESSLGYYRVADTEAFITGIRSQYKSSGTDLHAVVEGRIRDHLL
ANNNVLPKMAGIAGLHAEVQALNYIISNPDIEGGNAERLNGSYIFTQRLVGDVNQDFPACYNCSGIISGLENVMTGRVNN
DVRLKRRKSF
>O34909 3.5.4.2~~~yerA~~~Putative adenine deaminase YerA~~~COG1001
MSERTFNWKNKDIRAQVDVVDSKLLPTLLLRNALVLNPYVKQWLKKNIWIYQDRIVYVGHELPNRAEEIHTIDCEGKYIV
PGYIEPHAHPFQIYNPQTLAEYVSQYGTTTFVNDNLFLLLQSGKKKALTILNELKKQPVQYFWWSRYDLQTEVLNEDHVL
PFDVRKQWIEHPDVIQGGEMTGWPRLVDGDDLMLHCMQATKKQRKRIEGHFPGASDKTLTKMKLFGADCDHEAMTGDEVM
RRLELGYYVSLRNSSIRPDVRKILQELHEKGFRYYDHFFYTTDGATPNFYKGGMTNELIRIALEEGVPAIDAYNMASFNI
AKYYQMDDYLGVVGPGRLASLNILEDPLNPNPVTVLSKGTILRENGCDLKAFTKTDWHKGGLVPLELSYDMTMDDLQFSM
PMGVKMRNAVIMEPYMIEIDNSMEQLSFDHDESYLTMLDRHGKWRVNTMIKGFASSVQGFVSSFTTTGDIVAIGKNKADM
LLAFARMKEIGGGIVLAENGNILHEIPLALCGCASSEAYEDVLEKEQKLRDLLTERGYEFCDPIYTLLFLQSTHLPYIRI
TPRGIFDVMKKTVLFPSIMR
>P31490 ~~~yerA~~~YopE regulator~~~
MYSFEQAITQLFQQLSLSIPDTIEPVIGVKVGEFACHITEHPVGQILMFTLPSLDNNNEKETLLSHNIFSQDILKPILSW
DEVGGHPVLWNRQPLNNLDNNSLYTQLEMLVQGAERLQTSSLISPPRSFS
>P31491 ~~~yerA~~~YopE regulator~~~
MYSFEQAITQLFQQLSLSIPDTIEPVIGVKVGEFACHITEHPVGQILMFTLPSLDNNDEKETLLSHNIFSQDILKPILSW
DEVGGHPVLWNRQPLNSLDNNSLYTQLEMLVQGAERLQTSSLISPPRSFS
>O34968 ~~~yerB~~~Putative lipoprotein YerB~~~COG1470
MKKWMTVCALCFVFFLLVSCQQKDAVPDTAKKLKAPLTGLKTEQKVTERRPVAVVVNNHPKARPQSGLSKADIVIEALAE
GQITRFLAIFQSQMPETVGPVRSAREYFVTLSNGFDSIFVHHGWSPGAKKQLESGAADYMNGLDFDGSLFWRADFSKPPH
NSYTSYDYIKKAAEQKGYKLKQETNPLLFQTSDAKPANESYNVRVDYGTNNVTNLVEYNYDKKAEFYTRSSDGVITTDRE
TGKPVAMQNIFIVEASHHIIDQDGRRDIDLESGGKGLLFQHGNVIETDWKQVNGRIVPVKDGKWLPFVPGKTWINIVPDL
DAASISKGEGV
>O34629 ~~~yerH~~~Uncharacterized lipoprotein YerH~~~COG4851
MKKTLALAATAAVLMLSACSSGFGGEKEEEITQKTAKSSEKAIVPKYNISDSYYKMVLPFKAGKARGLTTEQLNTRLDID
EFETGLMRLAQDSFSTDDYLFQEGQYLDEDTVLSWLARKKTGSDLKKAEKEDKNFKNEGLNPALPSSGSTEEKNESSPIY
LASMLEHDYLVRKDKNSIQLGGVMIGLALNSVYYYREKTGDPQKEVEIKDSTLRQQGEKIAQEVINRLRKKDNLKNVPIT
VALYKQASKTSIVPGNFIAKTEVKAGSTDISNWDDINEKYVFYPADTTTAEKYPDDTEVFKRFKNSIEEYFPNYTGVVGT
ALYENDEMKKMKIDIPMQFYGKSEVVAFTQFLTGEVMDYYSKSSVDVEVNITSSDGQEAVIIRNAGDKEPTVHIYD
>O31511 ~~~yesE~~~Uncharacterized protein YesE~~~COG3631
MLMNEFEKACETLRKFMAYMLEKDMKSWTELWDENAVFEFPYAPEGSPKRIEGKAAIYDYIKDYPKQIHLSSFTAPTVYR
SADSNTVIAEFQCDGHVIETGLPYRQSYISVIETRDGRIVRYRDYWNPLVVKEAFGGSFLQTEESGK
>O31518 ~~~yesO~~~Putative ABC transporter substrate-binding protein YesO~~~COG1653
MKKICYVLLSLVCVFLFSGCSAGEEASGKKEDVTLRIAWWGGQPRHDYTTKVIELYEKKNPHVHIEAEFANWDDYWKKLA
PMSAAGQLPDVIQMDTAYLAQYGKKNQLEDLTPYTKDGTIDVSSIDENMLSGGKIDNKLYGFTLGVNVLSVIANEDLLKK
AGVSINQENWTWEDYEKLAYDLQEKAGVYGSNGMHPPDIFFPYYLRTKGERFYKEDGTGLAYQDDQLFVDYFERQLRLVK
AKTSPTPDESAQIKGMEDDFIVKGKSAITWNYSNQYLGFARLTDSPLSLYLPPEQMQEKALTLKPSMLFSIPKSSEHKKE
AAKFINFFVNNEEANQLIKGERGVPVSDKVADAIKPKLNEEETNIVEYVETASKNISKADPPEPVGSAEVIKLLKDTSDQ
ILYQKVSPEKAAKTFRKKANEILERNN
>O31522 ~~~yesS~~~HTH-type transcriptional regulator YesS~~~COG2207
MKKQPQQKERVSRWKGTYFKRNFVFILLIACIPGLLTGGAIYFLSIDKVEKELQRSHETQVAREVSRMDEKLGVLELALT
QMAYDSSLMNGLAERDLEKDFTFSYQLTKKLFLLRDQQPLIEQASIFLNSPRPLVLSPEYSALTEQEALRKYRSLLASDH
SIYWKRSGNQAMLIQLIPGAAEKPFGAIMLAVDPKEMESILQSLSPYPDGSALLLDQNREVLFHEGEKDFEKTLLHEIKK
QPAENGHFQMEWKGKVYSVSFGEMNRMHQQWTFVSAAPLSAITAPMVFLSKVIIVMLVICIGLAVCMTWYASKKIYRPIQ
HLLGLFTGGEKKTWQASGQDEFKWIEKRWQDLSLESRKLQQQLLRQTPHMKKSFLQHLLQSDFYYDNEESLRFRMEEAGW
NIGGNVFHVLDLQVTGLRCEKSIFREGDEQLAVFTLTNIAEELAAKRVFQLSILDIGRLSVTVLVMKTNSSDLKAYITEL
ARQFGDVTGLCLTSTLSSKTERVTEIPSLFQDVKCGKSRRKFANRDQVIDLQADFPRDEQFAPYYPFELEKQIVQTIRLE
RKQEAKELIDGCMKELSEKAAIDRHVHSALIQLFSRIQEDILHSGLNPSELFQHRNPFLDISELREPNEAAAWLMDVVVT
PYLSKLEGRKNRQQKQLAERVIAMIHEQYMADISLESCADALGMNSYTLSKAFKQVTGINFIDYVTQIRIEKAKELLVNT
NKKIHDVSEEVGYRHNYFNRIFKKQVGMPPGVFRQMYQETP
>O31524 ~~~yesU~~~Uncharacterized protein YesU~~~
MYKEGACLYRNPLRSKSDVKDWRMEGGGQISFDDHSLHLSHVQDEAHFVFWCPETFPDGIIVTWDFSPIEQPGLCMLFFA
AAGIRGEDLFDPSLRKRTGTYPEYHSGDINALHLSYFRRKYAEERAFRTCNLRKSRGFHLAAMGADPLPSPDDADSPYRM
KLIKDKGYVHFSINGLPILEWMDDGSTYGPVLTKGKIGFRQMAPMKAVYRDFAVHQAVRR
>O31525 ~~~yesV~~~Uncharacterized protein YesV~~~COG5578
MKTTVTDALYAGCEAVVKIAWLNGLWLLFTLLGGVLFGWAPSTAAMCAVIRKWLMGQKDVPIFSLFLDTYKKEFLKVNAI
GLAFSALLLILSANYHYFSASTNWLSFAVTSCTLLAGLLYIIALMYVFPLYVHYQLPLRKYIPQALLFGAMRPLTTGCML
IGCGFVLYLLYTLPGLIPFYGPCLFGLVLMFFALRGFQKTEAQHHQAG
>O31526 4.2.2.23~~~yesW~~~Rhamnogalacturonan endolyase YesW~~~COG3401
MRRSCLMIRRRKRMFTAVTLLVLLVMGTSVCPVKAEGAARQMEALNRGLVAVKTDGGIFVSWRFLGTENASVLFNVYRDG
QKLNAAPVKTTNYVDKNGSAGSTYTVRAVVNGTEQPASEKASVWAQPYHSVPLDKPAGGTTPKGESYTYSANDASVGDVD
GDGQYELILKWDPSNSKDNSQDGYTGDVLIDAYKLDGTKLWRINLGKNIRAGAHYTQFMVYDLDGDGKAEVAMKTADGTK
DGTGKVIGNANADYRNEQGRVLSGPEYLTVFQGSTGKELVTANFEPARGNVSDWGDSYGNRVDRFLAGIAYLDGQRPSLI
MTRGYYAKTMLVAYNFRDGKLSKLWTLDSSKSGNEAFAGQGNHNLSIADVDGDGKDEIIFGSMAVDHDGKGMYSTGLGHG
DALHTGDLDPGRPGLEVFQVHEDKNAKYGLSFRDAATGKILWGVYAGKDVGRGMAADIDPRYPGQEVWANGSLYSAKGVK
IGSGVPSSTNFGIWWDGDLLREQLDSNRIDKWDYQNGVSKNMLTASGAAANNGTKATPTLQADLLGDWREEVVWRTEDSS
ALRIYTTTIPTEHRLYTLMHDPVYRLGIAWQNIAYNQPPHTSFFLGDGMAEQPKPNMYTP
>O31527 4.2.2.24~~~yesX~~~Rhamnogalacturonan exolyase YesX~~~COG3401
MKPKKRQMEYLTRGLIAVQTEQGVFVSWRFLGTDHETTAFHLYRDGKRITRDPIAESTNFLDQNGTADSVYQVAAVNKGR
EEKLSKKARVWQENVLEVPLAKPEGGVTPDGKPYTYSANDASVGDIDGDGEYEMILKWDPSNSKDNAHDGYTGEVLIDAY
KLDGTFLWRINLGRNIRAGAHYTQFMVYDLDGDGKAEIAMKTADGTTDGKGHIIGDEQADFRNEQGRILSGPEYLTVFKG
ETGEALTTVEYEPPRGKLEDWGDGYGNRMDRFLAGTAYLDGERPSLVMARGYYTRTVLVAYDFRNGRLKKRWVFDSNQPG
HEAYAGQGNHSLSVADVDGDGKDEIIYGAMAVDHDGTGLYSTGLGHGDAMHVGDLDPSRKGLEVFQVHEDATKPYGLSLR
DAGTGEILWGVHAGTDVGRGMAAHIDPSYKGSLVWGIDPPGNDGMSYGLFTSKGEKISDKAPSSANFAIWWDGDLVRELL
DHDWDGTIGRPKIEKWDAENGCLKTIFQPAGVLSNNGTKGNPVLQANLFGDWREEVIWRTEDSSALRIYTTTHLTRHCFY
TLMHDPVYRLGIAWQNTAYNQPPHTSFYLGTGMKKPPKPALYIAGSKAEAPL
>O31533 ~~~yetF~~~UPF0702 transmembrane protein YetF~~~COG2323
MGNYLSVAVELVCGLGILFIILKLLGKTQFSQITPFDFISALILGELVGNAVYDHEIKIKEIIFASLLWGVLIYIIEFIT
QKMKSSRKFLEGEPNIVIRKGELQYKVMKKNKIDINQLQSLLRQAGSFSIQEVEYAILETNGMVSVLPKSDFDKPTNKDL
QIPSKSVSLPITLIIDGEIVRDNLKEAGVDEQWLKQELKKKNIDKTEDVLFAEWHKNKPLYTVTYEQSRST
>O31539 ~~~yetJ~~~Uncharacterized protein YetJ~~~COG0670
MQATVHESKQSIMQRILTVFVFTLLIATVGLFIGQFVPVALMLPLSILEVAMIILAFWMRRRKAVGYAFVYTFAFVSGIT
LFPIVSHYASIAGAYVVLEAFGSTFVIFAVLGTIGAKMKKDLSFLWSFLLVAVLALAVVGIFNIFSPLNSAAMMAYSVIG
TIVFSLYILYDLNQIKHRHITEDLIPVMALSLYLDFINLFINLLRFFGILSSDD
>O31541 ~~~yetL~~~HTH-type transcriptional repressor YetL~~~COG1846
MELKHLPKYKHITEHAETYANIDAGSLELFLSLFDISKKMNHVMEHYFAGRGLSEGKFKILMLLFDAKDHRLSPTELAKR
SNVTKATITGLLDGLARDGFVSRRHHTEDKRKISIELTTEGKARLEQFLPGHFSKISAVMENYSDEEKDMFVKMLGDLFE
RLSVFKD
>O06489 1.14.-.-~~~yetM~~~Putative FAD-dependent monooxygenase YetM~~~COG0654
MKHMLIAGGGIGGLSAAISLRKAGFSVTLCEAASENRKTGAGILQPQNALAVLKELGVFEDCCKHGFQTEWFKTFDEQGN
LLFQVSESFLDDSLPGRNNILRKTLNDILMKHAEAVGVDIKWGKKVVAYEETAESVTALCEDGEKMQADILAGFDGIHSV
VRDKMLQKETEKEHLGMGAWRFYIELPDYTFEDATFMYRSGDTQIGVVPLAQHAGYVFVLQPCTSDYWDEEDTRFDRVKE
ILSGFRGLDFVTKHMSKQHPVIFNKLEQVAVQEPWHKGRVIIGGDAAHAGAPTLAQGAAMAIEDAIVLAEELQNHADHET
ALQAYYKRRAPRALKVQNLSSEIVRRRLKGEPGAEELIGECYAVLREGY
>O31537 ~~~yezB~~~Uncharacterized protein YezB~~~
MLETVPVRCVERKITSLVVDLSGVPIVDTMVAQQLYNLSKTLFLLGVKAVFSGIRPDVAQTSIQLGLDFSEYETYGTLKQ
ALENMGVRCIVEELEENK
>E0TU95 ~~~yezG~~~Immunity protein YezG~~~
MDTPKMGDLYQRIANQINEMIPSEWENVYLYAEILDDSSEVYFYFNIPGKNEFLYSHNIPEHFNVSEDIYDDLLIELQES
FEELREEYEKNNPETWTNLTLKLDRTGQFSIDYNYEDVIASELNGSQRKAVWVYKNLGLMPKRKTVRDFLEDYIKTNEGK
I
>C0H3X4 ~~~yezG~~~Immunity protein YezG~~~
MQDLYQLIGEKLNDIIPGEWTKIYLYAEVLDDSTMVLFHFRTPENNQIIYSQDIPSHYNVSKDIFKTLLRELRELFEELR
TEHRNNNDDVWTNLTLTLDRSGEFQLDYNYDDILASELDGYERIAIWEYKNLGILPEDEDDKEFVISYLGL
>P45508 ~~~yfaL~~~Probable autotransporter YfaL~~~COG3468
MRIIFLRKEYLSLLPSMIASLFSANGVAAVTDSCQGYDVKASCQASRQSLSGITQDWSIADGQWLVFSDMTNNASGGAVF
LQQGAEFSLLPENETGMTLFANNTVTGEYNNGGAIFAKENSTLNLTDVIFSGNVAGGYGGAIYSSGTNDTGAVDLRVTNA
MFRNNIANDGKGGAIYTINNDVYLSDVIFDNNQAYTSTSYSDGDGGAIDVTDNNSDSKHPSGYTIVNNTAFTNNTAEGYG
GAIYTNSVTAPYLIDISVDDSYSQNGGVLVDENNSAAGYGDGPSSAAGGFMYLGLSEVTFDIADGKTLVIGNTENDGAVD
SIAGTGLITKTGSGDLVLNADNNDFTGEMQIENGEVTLGRSNSLMNVGDTHCQDDPQDCYGLTIGSIDQYQNQAELNVGS
TQQTFVHALTGFQNGTLNIDAGGNVTVNQGSFAGIIEGAGQLTIAQNGSYVLAGAQSMALTGDIVVDDGAVLSLEGDAAD
LTALQDDPQSIVLNGGVLDLSDFSTWQSGTSYNDGLEVSGSSGTVIGSQDVVDLAGGDNLHIGGDGKDGVYVVVDASDGQ
VSLANNNSYLGTTQIASGTLMVSDNSQLGDTHYNRQVIFTDKQQESVMEITSDVDTRSDAAGHGRDIEMRADGEVAVDAG
VDTQWGALMADSSGQHQDEGSTLTKTGAGTLELTASGTTQSAVRVEEGTLKGDVADILPYASSLWVGDGATFVTGADQDI
QSIDAISSGTIDISDGTVLRLTGQDTSVALNASLFNGDGTLVNATDGVTLTGELNTNLETDSLTYLSNVTVNGNLTNTSG
AVSLQNGVAGDTLTVNGDYTGGGTLLLDSELNGDDSVSDQLVMNGNTAGNTTVVVNSITGIGEPTSTGIKVVDFAADPTQ
FQNNAQFSLAGSGYVNMGAYDYTLVEDNNDWYLRSQEVTPPSPPDPDPTPDPDPTPDPDPTPDPEPTPAYQPVLNAKVGG
YLNNLRAANQAFMMERRDHAGGDGQTLNLRVIGGDYHYTAAGQLAQHEDTSTVQLSGDLFSGRWGTDGEWMLGIVGGYSD
NQGDSRSNMTGTRADNQNHGYAVGLTSSWFQHGNQKQGAWLDSWLQYAWFSNDVSEQEDGTDHYHSSGIIASLEAGYQWL
PGRGVVIEPQAQVIYQGVQQDDFTAANRARVSQSQGDDIQTRLGLHSEWRTAVHVIPTLDLNYYHDPHSTEIEEDGSTIS
DDAVKQRGEIKVGVTGNISQRVSLRGSVAWQKGSDDFAQTAGFLSMTVKW
>P76483 ~~~yfbM~~~Protein YfbM~~~
MGMIGYFAEIDSEKINQLLESTEKPLMDNIHDTLSGLRRLDIDKRWDFLHFGLTGTSAFDPAKNDPLSRAVLGEHSLEDG
IDGFLGLTWNQELAATIDRLESLDRNELRKQFSIKRLNEMEIYPGVTFSEELEGQLFASIMLDMEKLISAYRRMLRQGNH
ALTVIVG
>P0A8W8 ~~~yfbU~~~UPF0304 protein YfbU~~~COG3013
MEMTNAQRLILSNQYKMMTMLDPANAERYRRLQTIIERGYGLQMRELDREFGELKEETCRTIIDIMEMYHALHVSWSNLQ
DQQSIDERRVTFLGFDAATEARYLGYVRFMVNVEGRYTHFDAGTHGFNAQTPMWEKYQRMLNVWHACPRQYHLSANEINQ
IINA
>P0A8D9 ~~~yfbV~~~UPF0208 membrane protein YfbV~~~COG3092
MSTPDNRSVNFFSLFRRGQHYSKTWPLEKRLAPVFVENRVIKMTRYAIRFMPPIAVFTLCWQIALGGQLGPAVATALFAL
SLPMQGLWWLGKRSVTPLPPAILNWFYEVRGKLQESGQVLAPVEGKPDYQALADTLKRAFKQLDKTFLDDL
>P0AD30 ~~~yfcA~~~Probable membrane transporter protein YfcA~~~COG0730
METFNSLFMVSPLLLGVLFFVAMLAGFIDSIAGGGGLLTIPALMAAGMSPANALATNKLQACGGSISATIYFIRRKVVSL
SDQKLNIAMTFVGSMSGALLVQYVQADVLRQILPILVICIGLYFLLMPKLGEEDRQRRMYGLPFALIAGGCVGFYDGFFG
PAAGSFYALAFVTLCGFNLAKATAHAKLLNATSNIGGLLLFILGGKVIWATGFVMLVGQFLGARMGSRLVLSKGQKLIRP
MIVIVSAVMSAKLLYDSHGQEILHWLGMN
>P65556 3.6.-.-~~~yfcD~~~Uncharacterized Nudix hydrolase YfcD~~~COG1443
MEQRRLASTEWVDIVNEENEVIAQASREQMRAQCLRHRATYIVVHDGMGKILVQRRTETKDFLPGMLDATAGGVVQADEQ
LLESARREAEEELGIAGVPFAEHGQFYFEDKNCRVWGALFSCVSHGPFALQEDEVSEVCWLTPEEITARCDEFTPDSLKA
LALWMKRNAKNEAVETETAE
>P67095 3.1.4.-~~~yfcE~~~Phosphodiesterase YfcE~~~COG0622
MMKLMFASDIHGSLPATERVLELFAQSGAQWLVILGDVLNHGPRNALPEGYAPAKVAERLNEVAHKVIAVRGNCDSEVDQ
MLLHFPITAPWQQVLLEKQRLFLTHGHLFGPENLPALNQNDVLVYGHTHLPVAEQRGEIFHFNPGSVSIPKGGNPASYGM
LDNDVLSVIALNDQSIIAQVAINP
>P77544 2.5.1.18~~~yfcF~~~Glutathione S-transferase YfcF~~~COG0625
MSKPAITLWSDAHFFSPYVLSAWVALQEKGLSFHIKTIDLDSGEHLQPTWQGYGQTRRVPLLQIDDFELSESSAIAEYLE
DRFAPPTWERIYPLDLENRARARQIQAWLRSDLMPIREERPTDVVFAGAKKAPLTAEGKASAEKLFAMAEHLLVLGQPNL
FGEWCIADTDLALMINRLVLHGDEVPERLVDYATFQWQRASVQRFIALSAKQSG
>P77526 1.8.4.-~~~yfcG~~~Disulfide-bond oxidoreductase YfcG~~~COG0625
MIDLYFAPTPNGHKITLFLEEAELDYRLIKVDLGKGGQFRPEFLRISPNNKIPAIVDHSPADGGEPLSLFESGAILLYLA
EKTGLFLSHETRERAATLQWLFWQVGGLGPMLGQNHHFNHAAPQTIPYAIERYQVETQRLYHVLNKRLENSPWLGGENYS
IADIACWPWVNAWTRQRIDLAMYPAVKNWHERIRSRPATGQALLKAQLGDERSDS
>P77549 ~~~yfcJ~~~Uncharacterized MFS-type transporter YfcJ~~~COG2814
MTAVSQTETRSSANFSLFRIAFAVFLTYMTVGLPLPVIPLFVHHELGYGNTMVGIAVGIQFLATVLTRGYAGRLADQYGA
KRSALQGMLACGLAGGALLLAAILPVSAPFKFALLVVGRLILGFGESQLLTGALTWGLGIVGPKHSGKVMSWNGMAIYGA
LAVGAPLGLLIHSHYGFAALAITTMVLPVLAWACNGTVRKVPALAGERPSLWSVVGLIWKPGLGLALQGVGFAVIGTFVS
LYFASKGWAMAGFTLTAFGGAFVVMRVMFGWMPDRFGGVKVAIVSLLVETVGLLLLWQAPGAWVALAGAALTGAGCSLIF
PALGVEVVKRVPSQVRGTALGGYAAFQDIALGVSGPLAGMLATTFGYSSVFLAGAISAVLGIIVTILSFRRG
>P76498 ~~~yfcO~~~Uncharacterized protein YfcO~~~
MKILRWLFALVMLIATTEAMAAGHSVDVYYGYNGDSRNIATFNLKIMMPSAVYVGEYKSSQWLMTGEILQNVSWSGPPPA
PSVKLIGYHQNINKASCPGLPSGWNCGYYTFEVIVSAEIESYFSCPWLVIMNDSEASPGGVTYQGPDSHDTICPSVSVQP
YDVSWNENYVSKSKLLTLQSTGGVVEKTLSTYLMKDGKLCDSTQMNETGGYCRWVAQMITFTASGCDKAEVSVTPNRHPI
TDKQLHDMVVRVDTSSMQPIDSTCRFQYILNEL
>P76499 ~~~yfcP~~~Uncharacterized fimbrial-like protein YfcP~~~COG3539
MNKSMIQSGGYVLLAGLILAMSSTLFAADNNLHFSGNLLSKSCALVVDGQYLAEVRFPTVSRQDLNVAGQSARVPVVFKL
KDCKGPAGYNVKVTLTGVEDSEQPGFLALDTSSTAQGVGIGMEKTDGMQVAINNTNGATFALTNGNNDINFRAWLQAKSG
RDVTIGEFTASLTATFEYI
>P76500 ~~~yfcQ~~~Uncharacterized fimbrial-like protein YfcQ~~~COG3539
MRKTFLTLLCVSSAIAHAADEDITFHGTLLSPPTCSISGGKTIEVEFRDLIIDDINGNYGRKEVPYELTCDSTTRHPDWE
MTLTWTGTQTSFNDAAIETDVPGFGIELQHDGQRFKLNTPLAINATDFTQKPKLEAVPVKASDAVLSDTNFSAYATLRVD
YQ
>P76501 ~~~yfcR~~~Uncharacterized fimbrial-like protein YfcR~~~COG3539
MTGGVMSQKFVVGAGLLVCSVCSLSAMAGSKPVDLILRVLVDAPPPCSIKGSQVEFGNMIADNVDGTNYRQDAKYTLNCT
NSLANDLRMQLKGNTSTINGETVLSTNITGLGIRIENSADNSLFAVGENSWTPFNINNQPQLKAVPVKASGAQLAAGEFN
ASLTMVVDYQ
>P77599 ~~~yfcS~~~Probable fimbrial chaperone YfcS~~~COG3121
MSDLLCSAKLGAMTLALLLSATSLSALASVTPDRTRLIFNESDKSISVTLRNNDPKLPYLAQSWIEDEKGNKITSPLTVL
PPVQRIDSMMNGQVKVQGMPDINKLPADRESMFYFNVREIPPKSNKPNTLQIALQTRIKLFWRPKALEKVSMKSPWQHKV
TLTRSGQAFTVNNPTPYYVIISNASAQKNGNPAAGFSPLVIEPKTTVPLNVKMDSVPVLTYVNDFGARMPLFFQCNGNSC
QVDEEQSRKG
>P77288 ~~~yfcV~~~Uncharacterized fimbrial-like protein YfcV~~~COG3539
MSKFVKTAIAAAMVMGAFTSTATIAAGNNGTARFYGTIEDSVCSIVPDDHKLEVDMGDIGAEKLKNNGTTTPKNFQIRLQ
DCVFDTQETMTTTFTGTVSSANSGNYYTIFNTDTGAAFNNVSLAIGDSLGTSYKSGMGIDQKIVKDTATNKGKAKQTLNF
KAWLVGAADAPDLGNFEANTTFQITYL
>P37327 ~~~yfdC~~~Inner membrane protein YfdC~~~COG2116
MDNDKIDQHSDEIEVESEEKERGKKIEIDEDRLPSRAMAIHEHIRQDGEKELERDAMALLWSAIAAGLSMGASLLAKGIF
QVELEGVPGSFLLENLGYTFGFIIVIMARQQLFTENTVTAVLPVMQKPTMSNVGLLIRLWGVVLLGNILGTGIAAWAFEY
MPIFNEETRDAFVKIGMDVMKNTPSEMFANAIISGWLIATMVWMFPAAGAAKIVVIILMTWLIALGDTTHIVVGSVEILY
LVFNGTLHWSDFIWPFALPTLAGNICGGTFIFALMSHAQIRNDMSNKRKAEARQKAERAENIKKNYKNPA
>P76520 ~~~yfdX~~~Protein YfdX~~~
MKRLIMATMVTAILASSTVWAADNAPVAAQQQTQQVQQTQKTAAAAERISEQGLYAMRDVQVARLALFHGDPEKAKELTN
EASALLSDDSTEWAKFAKPGKKTNLNDDQYIVINASVGISESYVATPEKEAAIKIANEKMAKGDKKGAMEELRLAGVGVM
ENQYLMPLKQTRNALADAQKLLDKKQYYEANLALKGAEDGIIVDSEALFVN
>Q56952 ~~~yfeA~~~Iron-binding protein YfeA~~~COG0803
MIERLNSPFLRAAALFTIVAFSSLISTAALAENNPSDTAKKFKVVTTFTIIQDIAQNIAGDVAVVESITKPGAEIHDYQP
TPRDIVKAQSADLILWNGMNLERWFEKFFESIKDVPSAVVTAGITPLPIREGPYSGIANPHAWMSPSNALIYIENIRKAL
VEHDPAHAETYNRNAQAYAEKIKALDAPLRERLSRIPAEQRWLVTSEGAFSYLAKDYGFKEVYLWPINAEQQGIPQQVRH
VIDIIRENKIPVVFSESTISDKPAKQVSKETGAQYGGVLYVDSLSGEKGPVPTYISLINMTVDTIAKGFGQ
>P77619 3.4.16.4~~~yfeW~~~Putative D-alanyl-D-alanine carboxypeptidase~~~COG1680
MKRTMLYLSLLAVSCSVSAAKYPVLTESSPEKAGFNVERLNQMDRWISQQVDVGYPSVNLLIIKDNQIVYRKAWGAAKKY
DGSVLMEQPVKATTGTLYDLASNTKMYATNFALQKLMSEGKLHPDDRIAKYIPGFADSPNDTIKGKNTLRISDLLHHSGG
FPADPQYPNKAVAGALYSQDKGQTLEMIKRTPLEYQPGSKHIYSDVDYMLLGFIVESVTGQPLDRYVEESIYRPLGLTHT
VFNPLLKGFKPQQIAATELNGNTRDGVIHFPNIRTSTLWGQVHDEKAFYSMGGVSGHAGLFSNTGDIAVLMQTMLNGGGY
GDVQLFNAETVKMFTTSSKEDATFGLGWRVNGNATMTPTFGTLASPQTYGHTGWTGTVTVIDPVNHMTIVMLSNKPHSPV
ADPQKNPNMFESGQLPIATYGWVVDQVYAALKQK
>Q8XBI9 1.11.1.-~~~yfeX~~~Dye-decolorizing peroxidase YfeX~~~COG2837
MSQVQSGILPEHCRAAIWIEANVKGEVDALRAASKTFADKLATFEAKFPDAHLGAVVAFGNNIWRALSGGVGAEELKDFP
GYGKGLAPTTQFDVLIHILSLRHDVNFSVAQAAMEAFGDCIEVKEEIHGFRWVEERDLSGFVDGTENPAGEETRREVAVI
KDGVDAGGSYVFVQRWEHNLKQLNRMSVHDQEMMIGRTKEANEEIDGDERPETSHLTRVDLKEDGKGLKIVRQSLPYGTA
SGTHGLYFCAYCARLHNIEQQLLSMFGDTDGKRDAMLRFTKPVTGGYYFAPSLDKLMAL
>P76536 1.11.1.-~~~yfeX~~~Dye-decolorizing peroxidase YfeX~~~COG2837
MSQVQSGILPEHCRAAIWIEANVKGEVDALRAASKTFADKLATFEAKFPDAHLGAVVAFGNNTWRALSGGVGAEELKDFP
GYGKGLAPTTQFDVLIHILSLRHDVNFSVAQAAMEAFGDCIEVKEEIHGFRWVEERDLSGFVDGTENPAGEETRREVAVI
KDGVDAGGSYVFVQRWEHNLKQLNRMSVHDQEMVIGRTKEANEEIDGDERPETSHLTRVDLKEDGKGLKIVRQSLPYGTA
SGTHGLYFCAYCARLHNIEQQLLSMFGDTDGKRDAMLRFTKPVTGGYYFAPSLDKLMAL
>P76537 ~~~yfeY~~~Uncharacterized protein YfeY~~~
MKSLRLMLCAMPLMLTGCSTMSSVNWSAANPWNWFGSSTKVSEQGVGELTASTPLQEQAIADALDGDYRLRSGMKTANGN
VVRFFEVMKGDNVAMVINGDQGTISRIDVLDSDIPADTGVKIGTPFSDLYSKAFGNCQKADGDDNRAVECKAEGSQHISY
QFSGEWRGPEGLMPSDDTLKNWKVSKIIWRR
>P76538 ~~~yfeZ~~~Inner membrane protein YfeZ~~~
MKSTEFHPVHYDAHGRLRLPLLFWLVLLLQARTWVLFVIAGASREQGTALLNLFYPDHDNFWLGLIPGIPAVLAFLLSGR
RATFPRTWRVLYFLLLLAQVVLLCWQPWLWLNGESVSGIGLALVVADIVALIWLLTNRRLRACFYEVKE
>P24178 ~~~yffB~~~Protein YffB~~~COG1393
MVTLYGIKNCDTIKKARRWLEANNIDYRFHDYRVDGLDSELLNDFINELGWEALLNTRGTTWRKLDETTRNKITDAASAA
ALMTEMPAIIKRPLLCVPGKPMLLGFSDSSYQQFFHEV
>P76569 ~~~yfgD~~~Uncharacterized protein YfgD~~~COG1393
MTKQVKIYHNPRCSKSRETLNLLKENGVEPEVVLYLETPADAATLRDLLKILGMNSARELMRQKEDLYKELNLADSSLSE
EALIQAMVDNPKLMERPIVVANGKARIGRPPEQVLEIVG
>P64545 ~~~yfgG~~~Protein YfgG~~~
MSQATSMRKRHRFNSRMTRIVLLISFIFFFGRFIYSSVGAWQHHQSKKEAQQSTLSVESPVQR
>P76575 ~~~yfgJ~~~Uncharacterized protein YfgJ~~~COG1645
MELHCPQCQHVLDQDNGHARCRSCGEFIEMKALCPDCHQPLQVLKACGAVDYFCQHGHGLISKKRVEFVLA
>P76576 ~~~yfgM~~~Ancillary SecYEG translocon subunit~~~COG2976
MEIYENENDQVEAVKRFFAENGKALAVGVILGVGALIGWRYWNSHQVDSARSASLAYQNAVTAVSEGKPDSIPAAEKFAA
ENKNTYGALASLELAQQFVDKNELEKAAAQLQQGLADTSDENLKAVINLRLARVQVQLKQADAALKTLDTIKGEGWAAIV
ADLRGEALLSKGDKQGARSAWEAGVKSDVTPALSEMMQMKINNLSI
>P43989 ~~~~~~Ancillary SecYEG translocon subunit~~~COG2976
MAYSIEEEQEINQLKDWWKENGKTIIVAFILGVGGMFGWRYWQTHQAEQIAQASAQYDTLINSVQQDEQAKKANIEQFVQ
ANSKTAYAVFALLDEAKKATEKQDFSAAEANLNQALTQSQDEVLTSIVALRLSAVQFQLGQLDNALSTLNQVKGESFNAR
KAILTGDIQVAKGDKVAAKNSFEQAQQSGSQLEQQMAKMKLNNL
>O31569 ~~~yfhA~~~Probable siderophore transport system permease protein YfhA~~~COG0609
MKLRFGVTAAEKKAWIVFLVLLGLTAAVLIISAGLGQRFIPPWDVAKTFFGAGSKLDELMIMSFRMPRILTALCAGVCLA
AAGAILQGLVRNPLASPDIIGITGGAAVAVVLLMMFFSDRSSSLTISLSWLPAAAFIGASAVGLIVYLLAYKNGASTFRL
VLIGIGFSMSAQAMTTLLMIKGPIYRASQANVYITGSVYGSNWQHVKIAIILSVILLFICFVALKNMNIQVLGEDIAAGA
GSAVQRNRFFLLLLSTALTGCAVSVAGTIGFVGLMAPHIARRLVGSSYGALLPASALIGALLVLTADIVGRTLFAPVEVP
AGVFTAAIGAPYFIYLLYKTRNS
>O31576 ~~~yfhH~~~Uncharacterized protein YfhH~~~
MEKRYSQMTPHELNTEIALLSEKARKAEQHGIINELAVLERKITMAKAYLLNPEDYSPGETYRVENTEDEFTISYLNGVF
AWGYRTSSPQQEEALPISVLQEKE
>P52102 ~~~yfhL~~~Ferredoxin YfhL~~~COG1145
MALLITKKCINCDMCEPECPNEAISMGDHIYEINSDKCTECVGHYETPTCQKVCPIPNTIVKDPAHVETEEQLWDKFVLM
HHADKI
>O31583 ~~~yfhP~~~Uncharacterized protein YfhP~~~COG1988
MDTGTHVVMGIALGGIATLDPVVGSDPAMAHAVMIATLAGSQAPDIDTVLKLKNNAVYIRNHRGFTHSIPAVLFWSVIIP
AILYLFYPQADFLHLWLWTLLAVVLHVFVDIFNAYGTQAIRPFSKKWVALGLINTFDPFIFISHLAAIAIWYAGGSPGIT
FLSLYIILVGYYLVRLIMQLRIKRKLHEMIHDEIESIIISPTMKFRQWRIAVTTAHAFYVGRSMEGHVVILDTFNRVPVP
ETDVMHAAKQDDNIAAFLSFSPVYRWEVDTFKDHYEVRFIDLRYRSKGHYPFVAIVHIGHDLTIRSSYTGWIFSEEKLQK
KLKLGSI
>O31585 ~~~yfhS~~~Uncharacterized protein YfhS~~~
MYVGRDMSELNMVSKKDWKNSELAYFHHAFQQIMPYLNEEGQSKYRELTQEIEARGGMKRNEADYSHGTRVSYD
>P0AD49 ~~~raiA~~~Ribosome-associated inhibitor A~~~COG1544
MTMNITSKQMEITPAIRQHVADRLAKLEKWQTHLINPHIILSKEPQGFVADATINTPNGVLVASGKHEDMYTAINELINK
LERQLNKLQHKGEARRAATSVKDANFVEEVEEE
>P71346 ~~~yfiA~~~Ribosome-associated factor Y~~~COG1544
MTLNITSKQMDITPAIREHLEERLAKLGKWQTQLISPHFVLNKVPNGFSVEASIGTPLGNLLASATSDDMYKAINEVEEK
LERQLNKLQHKSESRRADERLKDSFEN
>Q9I4L6 ~~~yfiB~~~Outer-membrane lipoprotein YfiB~~~
MLPQRLHPSRLLALALFSLVLGLAGCQTKPPQTGLSAEQIAVLQEQGFELRDEGWEFGMSSKVLFGNNLDRLNPDSRNTL
TKIARALLAVDIDKVRLEGHTDNYGDEGYNQKLSERRAESVAAVFREAGMPAANIEVRGLGMSKPVADNKTRAGRSENRR
VAIIVPAE
>P0AGJ5 2.1.1.-~~~yfiF~~~Uncharacterized tRNA/rRNA methyltransferase YfiF~~~COG0566
MNDEMKGKSGKVKVMYVRSDDDSDKRTHNPRTGKGGGRPGKSRADGGRRPARDDKQSQPRDRKWEDSPWRTVSRAPGDET
PEKADHGGISGKSFIDPEVLRRQRAEETRVYGENACQALFQSRPEAIVRAWFIQSVTPRFKEALRWMAANRKAYHVVDEA
ELTKASGTEHHGGVCFLIKKRNGTTVQQWVSQAGAQDCVLALENESNPHNLGGMMRSCAHFGVKGVVVQDAALLESGAAI
RTAEGGAEHVQPITGDNIVNVLDDFRQAGYTVVTTSSEQGKPLFKTSLPAKMVLVLGQEYEGLPDAARDPNDLRVKIDGT
GNVAGLNISVATGVLLGEWWRQNKA
>O31560 ~~~yfiR~~~Uncharacterized HTH-type transcriptional regulator YfiR~~~COG1309
MSPKVTKEHKDKRQAEILEAAKTVFKRKGFELTTMKDVVEESGFSRGGVYLYFSSTEEMFRRIIETGLDEGLRKLDKSAE
HQSVWASISSYLDELTEGLRDVADTLAPVQFEYLVTAWRNEERRQYLEKRYDLFVERFSRLLQKGIDQGEFQPVQPLATI
AKFFLNMNDGIIQNALYFDEEKADVSGLAESAKLYLKTVLQADEK
>Q9I4L4 ~~~yfiR~~~Negative regulator YfiR~~~
MPSLPTLQPLDLYRRTLACLVLAVSCLGGGGLWADDARTSIEQRSNAVSQVLLGIFSYVRWPKEPAVLQLCVVGPTEYAD
GLLRGMVQANGRRVHAERRAVDNPDLGTLCNVIYLGVVDERERQQVFRSLAGHPVLSISERGTECSVGSMFCLNVGGPRI
TFEANLDSIARSGVRVHPSVLKLARRQATP
>P0DSG1 ~~~yfiS~~~Protein YfiS~~~
MEVGKLGKPYPLLNLAYVGV
>O31562 3.-.-.-~~~yfiT~~~Putative metal-dependent hydrolase YfiT~~~COG2318
MTSVNLSYPIGEYKPRESISKEQKDKWIQVLEEVPAKLKQAVEVMTDSQLDTPYRDGGWTVRQVVHHLADSHMNSYIRFK
LSLTEETPAIRPYDEKAWSELKDSKTADPSGSLALLQELHGRWTALLRTLTDQQFKRGFYHPDTKEIITLENALGLYVWH
SHHHIAHITELSRRMGWS
>O31567 ~~~yfiY~~~Probable siderophore-binding lipoprotein YfiY~~~COG0614
MKKHISMLFVFLMAVMVLSACNSSESSSNSEVSSSKTRTVKHAMGTSDNIPANPKRIVVLTNEGTEALLALGIKPVGAVK
SWKGDPWYDYLKDDMKGVKNVGLETEPNVEAIAELKPDLIIGNKVRQEKIYDQLNAIAPTVFAESLAGNWKDNLTLYANA
VNKADKGKEVIADFDKRVSDLKNKLGDQTNKTVSVVRFLSGESRIYYTDSFPGIILDQLGFKRPEKQVELFKKQKDQFTF
STDSKESIPDMDADVLFYFTYKADNAKENEKWANQWTSSSLWKNLKAVKSGNAHEVDDVVWTTAGGIKAANYLLDDIETY
FLKTK
>O31568 ~~~yfiZ~~~Probable siderophore transport system permease protein YfiZ~~~COG0609
MICKKASSKWIVLVCLIFILLTAVCASVVYGYTGTSWRQVYQAFTSFNGTNEHVIIKDVRLPRALVATVVGASLAAAGAL
MQALTKNPLASPGIFGINAGAGFFIVAGSFFLHIQSPQALVWSSFLGAAFTAAIVYAAGSLGREGLTPIKLTLAGAAMAA
MFSSLTQGLLSVNELELAQVLFWLTGSVQGRSLDLLMTMFPYAAAALVICFFLGQKINLLVMGEDVAKGLGQKTGLLKFV
MALCVVMLAGSAVAIAGPISFIGIIIPHFARFVVGNDYRWVLPFSAVLGAILLVCADIGARYIIMPQEVPVGVMTAIIGM
PVFVYIARRGAKL
>P37908 ~~~yfjD~~~UPF0053 inner membrane protein YfjD~~~COG4536
MEHISTTTLIIILIIMVVISAYFSGSETGMMTLNRYRLRHMAKQGNRSAKRVEKLLRKPDRLISLVLIGNNLVNILASAL
GTIVGMRLYGDAGVAIATGVLTFVVLVFAEVLPKTIAALYPEKVAYPSSFLLAPLQILMMPLVWLLNAITRMLMRMMGIK
TDIVVSGSLSKEELRTIVHESRSQISRRNQDMLLSVLDLEKMTVDDIMVPRSEIIGIDINDDWKSILRQLSHSPHGRIVL
YRDSLDDAISMLRVREAWRLMSEKKEFTKETMLRAADEIYFVPEGTPLSTQLVKFQRNKKKVGLVVNEYGDIQGLVTVED
ILEEIVGDFTTSMSPTLAEEVTPQNDGSVIIDGTANVREINKAFNWHLPEDDARTVNGVILEALEEIPVAGTRVRIGEYD
IDILDVQDNMIKQVKVFPVKPLRESVAE
>O52982 ~~~yfjS~~~Lipoprotein YfjS~~~
MKRKTLPLLALVATSLFLSACDDRSDDLKAISKFKDLTPPRFSDVVSRQDDVSEEWSQVGFSSGLTLQVLRTRESPDGCE
GGSYYYLVDMEEKTVQPLMNALCIADNIKLEYHEVTDPYTKEKYFEYSHDGKLMGRLLIPSNPDNRE
>P52141 ~~~yfjZ~~~Antitoxin YfjZ~~~
MSNTTWGLQRDITPRLGARLVQEGNQLHYLADRASITGKFSDAECPKLDVVFPHFISQIESMLTTGELNPRHAQCVTLYH
NGFTCEADTLGSCGYVYIAVYPTQR
>O35043 ~~~yfkC~~~Uncharacterized MscS family protein YfkC~~~COG0668
MRMKETLTEIFQNKIVDILLVAVILWIGVFIINRLVQLFFKRTDFIEEKKEKTIESLVRSVTQYTATIGFIFYVISLFVH
DFGKILAGAGVAGIVIGFGAQSLIKDVLAGVFLIYERQLHKGDYVTVNNLFNGTVEEIGLRSLQIREWSGKLLTISNGEV
RQIENYNIDFMRITESFLISFKEDPDRVYSVLEEACDMLNEELRDSLKRDEFGNPTEPFQIHGITALNKINRGVEFTVKG
MVKDDDYFSASLAVRRVLVRQLYQNNVQMLEEAVRIERTQ
>O34579 ~~~yfkD~~~Uncharacterized protein YfkD~~~
MMKKLFHSTLIVLLFFSFFGVQPIHAKKQFKVPNSVASISKENTYPNASQDQPMLQPSKLAKELLDHSEVKIENPHLIKM
LNESNISGTPLAVGYRATIFLGKWALGYESNETVANWEYKKINTNRADNRGGKETAEMHYAQEQQYRVKGGLTAKVPNAE
DVKSMMMQKAMKKTNLPLAFETVIGAGTKRDQIYKVAPKKIGYLHAYAPAVNEKGKVTYGEVYLVLKGNKRKLVVKNVTS
QGIGAWIPVQDHVTFGFQLSSLPR
>O35016 3.1.3.48~~~yfkJ~~~Low molecular weight protein-tyrosine-phosphatase YfkJ~~~COG0394
MISVLFVCLGNICRSPMAEAIFRDLAAKKGLEGKIKADSAGIGGWHIGNPPHEGTQEILRREGISFDGMLARQVSEQDLD
DFDYIIAMDAENIGSLRSMAGFKNTSHIKRLLDYVEDSDLADVPDPYYTGNFEEVCQLIKTGCEQLLASIQKEKQL
>O35019 ~~~yfkK~~~UPF0435 protein YfkK~~~COG4840
MSSPNTETLTQMIEEISQKLNMLNVGVIKAEDFSDEKIEDLTYLHRMVMKKESFSPSEMQAIAQELASLRK
>O34475 1.-.-.-~~~yfkO~~~Putative NAD(P)H nitroreductase YfkO~~~COG0778
MADLKTQILDAYNFRHATKEFDPNKKVSDSDFEFILETGRLSPSSLGLEPWKFVVVQNPEFREKLREYTWGAQKQLPTAS
HFVLILARTAKDIKYNADYIKRHLKEVKQMPQDVYEGYLSKTEEFQKNDLHLLESDRTLFDWASKQTYIALGNMMTAAAQ
IGVDSCPIEGFQYDHIHRILEEEGLLENGSFDISVMVAFGYRVRDPRPKTRSAVEDVVKWV
>O34306 ~~~yflH~~~Uncharacterized protein YflH~~~
MNRDQEKIQIENEMNAMHGTIKEDILKDFEEFKGYLKKQVNRGKKLGLDDGKLVKSAAILGDYLAKHEEPQNGEEMLLQE
LWSVADEDEKEHLAQLLVKLVDKQ
>O34409 3.-.-.-~~~yflN~~~Probable metallo-hydrolase YflN~~~COG0491
MSDPYMPLTSVRSGAGFEAAKGVHGLTVQIANVYFIQLPSEPHSFVLIDAGMPQSAGVIVNEAKQRFGEGFQLKAIILTH
GHFDHIGAIEEILEHWDVPVYIHSREMPYVTGKEDYPPARPDSKSGLVAKLSPLFPRHSIDISSHVQALPEDGSVPFLDE
WMWIATPGHTPGHISLFRDDGRVLVAGDAVITVEQEKMADVLIQKQELHGPPAYFTPDTETAAESILKLAGLEPEALLTG
HGIPMTGKNFRSDLTELANRLSSI
>O34726 ~~~yflS~~~Putative malate transporter YflS~~~COG0471
MASEKDAGKQSAVKLVPLLITVAVGLIIWFIPAPSGLEPKAWHLFAIFVATIIGFISKPLPMGAIAIFALAVTALTGTLS
IEDTLSGFGNKTIWLIVIAFFISRGFIKTGLGARISYVFVQKFGKKTLGLSYSLLFSDLILSPAIPSNTARAGGIIFPII
RSLSETFGSSPANGTERKIGAFLLKTGFQGNLITSAMFLTAMAANPLIAKLAHDVAGVDLTWTSWAIAAIVPGLVSLIIT
PLVIYKLYPPEIKETPDAAKIATEKLKEMGPFKKSELSMVIVFLLVLVLWIFGGSFNIDATTTALIGLAVLLLSQVLTWD
DIKKEQGAWDTLTWFAALVMLANFLNELGMVSWFSNAMKSSVSGFSWIVAFIILIVVYYYSHYFFASATAHISAMYSAFL
AVVVAAGAPPLLAALSLAFISNLFGSTTHYGSGAAPVFFGAGYIPQGKWWSIGFILSIVHIIVWLVIGGLWWKVLGIW
>O34626 ~~~yfmB~~~Uncharacterized protein YfmB~~~
MQYFSPEQQYNAWIVSDLVKQIFHKRAGCSPGIHELAVFAEEHFHIDIDFVFSIIMNIGDIEFALTDEIEKKLSGYLSTL
LPYVTADMFETSKANAHAFLSRRHGNAAYHLFVSDDAFMRKQ
>O34348 ~~~yfmC~~~Fe(3+)-citrate-binding protein YfmC~~~COG4594
MRTYSNKLIAIMSVLLLACLIVSGCSSSQNNNGSGKSESKDSRVIHDEEGKTTVSGTPKRVVVLELSFLDAVHNLGITPV
GIADDNKKDMIKKLVGSSIDYTSVGTRSEPNLEVISSLKPDLIIADAERHKNIYKQLKKIAPTIELKSREATYDETIDSF
TTIAKALNKEDEGKEKLAEHKKVINDLKAELPKDENRNIVLGVARADSFQLHTSSSYDGEIFKMLGFTHAVKSDNAYQEV
SLEQLSKIDPDILFISANEGKTIVDEWKTNPLWKNLKAVKNGQVYDADRDTWTRFRGIKSSETSAKDVLKKVYNK
>O34812 1.-.-.-~~~yfmJ~~~Putative NADP-dependent oxidoreductase YfmJ~~~COG2130
MTASQQQIQLARRPQGIPVHEDFRFETIPVPEPKQGEVLVKTLYVSVDPYMRGRMQDTKSYVEPFALDKALSGGVIAEVV
SDGNHLKKGDIVIGNLSWQEFSAVSESALRKIDTSLAPASAYLGILGMTGLTAYFGLLDIGRPKEGETVVVSGAAGAVGS
TVGQIAKIKGARVVGIAGSDEKIDYLKQELQFDEAINYKTADDIQKALQNACPDGVDVYFDNVGGPISDAVMNLLNEFAR
IPVCGAISSYNAESEADDMGPRVQSKLIKTKSLMQGFIVSDYSDRFSEGAKQLAEWLKAGKLHYEETITEGFENIPDAFL
GLFKGENKGKQLIKVSDPS
>O34750 3.6.4.13~~~yfmL~~~Probable ATP-dependent RNA helicase YfmL~~~COG0513
MTQTWPFLHNAQSFIQENWNASGFQKPTPVQEQAAQLIMDGKDVIAESPTGTGKTLAYALPVLERIKPEQKHPQAVILAP
SRELVMQIFQVIQDWKAGSELRAASLIGGANVKKQVEKLKKHPHIIVGTPGRVFELIKAKKLKMHEVKTIVLDETDQLVL
PEHRETMKQIIKTTLRDRQLLCFSATLKKETEDVLRELAQEPEVLKVQRSKAEAGKVKHQYLICDQRDKVKLLQKLSRLE
GMQALVFVRDIGNLSVYAEKLAYHHVELGVLHSEAKKMERAKIIATFEDGEFPLLLATDIAARGLDIENLPYVIHADIPD
EDGYVHRSGRTGRAGKEGNVLSLVTKLEESKLKKMAKKLGVELSEAVYAGGKLKTK
>O06473 ~~~yfmO~~~Multidrug efflux protein YfmO~~~COG2814
MDKTTQVNQKTGLLSQPKAVWAVAFACVISFMGIGLVDPILPAIAAQLHASPSEVSLLFTSYLLVTGFMMFFSGAISSRI
GAKWTLILGLIFIIVFAGLGGSSSSIAQLVGYRGGWGLGNALFISTALAVIVGVSVGGSAQAIILYEAALGLGISVGPLA
GGELGSISWRAPFFGVSVLMFIALIAISFMLPKLPKPAKRVGVFDAMKALKYKGLLTMAVSAFLYNFGFFILLAYSPFVL
DLDEHGLGYVFFGWGLLLAITSVFTAPLVHKALGTVRSLVVLFIAFAAILVIMGIWTDHQTLIITCIVVAGAVLGMVNTI
MTTAVMGSAPVERSIASSAYSSVRFIGGALAPWIAGMLSEHFTASTPYTVGGIVVFVGMLVLLMGRKHLAGIKAGH
>O06474 ~~~yfmP~~~HTH-type transcriptional regulator YfmP~~~COG0789
MEWMKIDQVAKRSGLTKRTIRFYEEIGLIPAPKRTDGGVRLYSEDDMEELEKVISTKEVLGFSLQELQHFMETSRQLELN
KEGYLLSLDPKERKEKLEEIQETLNHQLDLIDEKIRTFQSFKERLQGMKGKAERAIQSIE
>O06477 ~~~yfmS~~~Putative sensory transducer protein YfmS~~~COG0840
MELTINTEKETADILDAFIKVAPYLNSLVQDDITIGIYDTEKLLVNIPAKTFSLNVKAGDPLQEGDIITDAIRSNQKKTS
MVPKELFGFPLIARAIPLHDENGRVIGGVGLGTSLEESSKLHDVAESLSAVVEQTAAAISDISESINGFSTQMSGISSQA
KKVSESAGEIADISVTVKGISDQSNLLGLNAAIEAARAGESGKGFSVVADEIRKLATHSKENVGQIDQITKKIHSLLKGL
EESIESINQHTDGQAAAVEQISATMQEISGSAQHLAKMAEKALEEE
>O06480 3.-.-.-~~~yfnB~~~Putative HAD-hydrolase YfnB~~~COG1011
MKRYRTLLFDVDDTILDFQAAEALALRLLFEDQNIPLTNDMKAQYKTINQGLWRAFEEGKMTRDEVVNTRFSALLKEYGY
EADGALLEQKYRRFLEEGHQLIDGAFDLISNLQQQFDLYIVTNGVSHTQYKRLRDSGLFPFFKDIFVSEDTGFQKPMKEY
FNYVFERIPQFSAEHTLIIGDSLTADIKGGQLAGLDTCWMNPDMKPNVPEIIPTYEIRKLEELYHILNIENTVSC
>P0ADQ7 ~~~ygaM~~~Uncharacterized protein YgaM~~~COG4575
MFNRPNRNDVDDGVQDIQNDVNQLADSLESVLKSWGSDAKGEAEAARSKAQALLKETRARMHGRTRVQQAARDAVGCADS
FVRERPWCSVGTAAAVGIFIGALLSMRKS
>P55734 ~~~ygaP~~~Inner membrane protein YgaP~~~COG0607
MALTTISPHDAQELIARGAKLIDIRDADEYLREHIPEADLAPLSVLEQSGLPAKLRHEQIIFHCQAGKRTSNNADKLAAI
AAPAEIFLLEDGIDGWKKAGLPVAVNKSQPLPLMRQVQIAAGGLILIGVVLGYTVNSGFFLLSGFVGAGLLFAGISGFCG
MARLLDKMPWNQRA
>P77295 ~~~ygaV~~~Probable HTH-type transcriptional regulator YgaV~~~COG0640
MTELAQLQASAEQAAALLKAMSHPKRLLILCMLSGSPGTSAGELTRITGLSASATSQHLARMRDEGLIDSQRDAQRILYS
IKNEAVNAIIATLKNVYCP
>P76630 ~~~ygaZ~~~Inner membrane protein YgaZ~~~COG1296
MESPTPQPAPGSATFMEGCKDSLPIVISYIPVAFAFGLNATRLGFSPLESVFFSCIIYAGASQFVITAMLAAGSSLWIAA
LTVMAMDVRHVLYGPSLRSRIIQRLQKSKTALWAFGLTDEVFAAATAKLVRNNRRWSENWMIGIAFSSWSSWVFGTVIGA
FSGSGLLQGYPAVEAALGFMLPALFMSFLLASFQRKQSLCVTAALVGALAGVTLFSIPVAILAGIVCGCLTALIQAFWQG
APDEL
>P46141 ~~~ygbE~~~Inner membrane protein YgbE~~~
MRNSHNITLTNNDSLTEDEETTWSLPGAVVGFISWLFALAMPMLIYGSNTLFFFIYTWPFFLALMPVAVVVGIALHSLMD
GKLRYSIVFTLVTVGIMFGALFMWLLG
>Q46892 ~~~ygbN~~~Inner membrane permease YgbN~~~COG2610
MSTITLLCIALAGVIMLLLLVIKAKVQPFVALLLVSLLVALAAGIPAGEVGKVMIAGMGGVLGSVTIIIGLGAMLGRMIE
HSGGAESLANYFSRKLGDKRTIAALTLAAFFLGIPVFFDVGFIILAPIIYGFAKVAKISPLKFGLPVAGIMLTVHVAVPP
HPGPVAAAGLLHADIGWLTIIGIAISIPVGVVGYFAAKIINKRQYAMSVEVLEQMQLAPASEEGATKLSDKINPPGVALV
TSLIVIPIAIIMAGTVSATLMPPSHPLLGTLQLIGSPMVALMIALVLAFWLLALRRGWSLQHTSDIMGSALPTAAVVILV
TGAGGVFGKVLVESGVGKALANMLQMIDLPLLPAAFIISLALRASQGSATVAILTTGGLLSEAVMGLNPIQCVLVTLAAC
FGGLGASHINDSGFWIVTKYLGLSVADGLKTWTVLTTILGFTGFLITWCVWAVI
>Q46906 ~~~ygcP~~~Uncharacterized protein YgcP~~~COG1954
MPLLHLLRQNPVIAAVKDNASLQLAIDSECQFISVLYGNICTISNIVKKIKNAGKYAFIHVDLLEGASNKEVVIQFLKLV
TEADGIISTKASMLKAARAEGFFCIHRLFIVDSISFHNIDKQVAQSNPDCIEILPGCMPKVLGWVTEKIRQPLIAGGLVC
DEEDARNAINAGVVALSTTNTGVWTLAKKLL
>Q46909 ~~~ygcS~~~Inner membrane metabolite transport protein YgcS~~~COG0477
MNTSPVRMDDLPLNRFHCRIAALTFGAHLTDGYVLGVIGYAIIQLTPAMQLTPFMAGMIGGSALLGLFLGSLVLGWISDH
IGRQKIFTFSFLLITLASFLQFFATTPEHLIGLRILIGIGLGGDYSVGHTLLAEFSPRRHRGILLGAFSVVWTVGYVLAS
IAGHHFISENPEAWRWLLASAALPALLITLLRWGTPESPRWLLRQGRFAEAHAIVHRYFGPHVLLGDEVVTATHKHIKTL
FSSRYWRRTAFNSVFFVCLVIPWFVIYTWLPTIAQTIGLEDALTASLMLNALLIVGALLGLVLTHLLAHRKFLLGSFLLL
AATLVVMACLPSGSSLTLLLFVLFSTTISAVSNLVGILPAESFPTDIRSLGVGFATAMSRLGAAVSTGLLPWVLAQWGMQ
VTLLLLATVLLVGFVVTWLWAPETKALPLVAAGNVGGANEHSVSV
>P0ADR2 ~~~ygdD~~~UPF0382 inner membrane protein YgdD~~~COG2363
MTSRFMLIFAAISGFIFVALGAFGAHVLSKTMGAVEMGWIQTGLEYQAFHTLAILGLAVAMQRRISIWFYWSSVFLALGT
VLFSGSLYCLALSHLRLWAFVTPVGGVSFLAGWALMLVGAIRLKRKGVSHE
>P67127 ~~~ygdQ~~~UPF0053 inner membrane protein YgdQ~~~COG0861
MLFAWITDPNAWLALGTLTLLEIVLGIDNIIFLSLVVAKLPTAQRAHARRLGLAGAMVMRLALLASIAWVTRLTNPLFTI
FSQEISARDLILLLGGLFLIWKASKEIHESIEGEEEGLKTRVSSFLGAIVQIMLLDIIFSLDSVITAVGLSDHLFIMMAA
VVIAVGVMMFAARSIGDFVERHPSVKMLALSFLILVGFTLILESFDIHVPKGYIYFAMFFSIAVESLNLIRNKKNPL
>P65294 ~~~ygdR~~~Uncharacterized lipoprotein YgdR~~~
MKKWAVIISAVGLAFAVSGCSSDYVMATKDGRMILTDGKPEIDDDTGLVSYHDQQGNAMQINRDDVSQIIER
>A0A0H3JGH6 5.1.1.13~~~ygeA~~~L-aspartate/glutamate-specific racemase~~~COG1794
MKTIGLLGGMSWESTIPYYRLINEGIKQRLGGLHSAQVLLHSVDFHEIEECQRRGEWDKTGDILAEAALGLQRAGAEGIV
LCTNTMHKVADAIESRCTLPFLHIADATGRAITGAGMTRVALLGTRYTMEQDFYRGRLTEQFSINCLIPEADERAKINQI
IFEELCLGQFTEASRAYYAQVIARLAEQGAQGVIFGCTEIGLLVPEERSVLPVFDTAAIHAEDAVAFMLS
>A0A140N890 5.1.1.13~~~ygeA~~~L-aspartate/glutamate-specific racemase~~~COG1794
MKTIGLLGGMSWESTIPYYRLINEGIKQRLGGLHSAQVLLHSVDFHEIEECQRRGEWDKTGDILAEAALGLQRAGAEGIV
LCTNTMHKVADAIESRCTLPFLHIADATGRAITGAGMTRVALLGTRYTMEQDFYRGRLTEQFSINCLIPEADERAKINQI
IFEELCLGQFTEASRAYYAQVIARLAEQGAQGVIFGCTEIGLLVPEERSVLPVFDTAAIHAEDAVAFMLS
>P03813 5.1.1.10~~~ygeA~~~Broad specificity amino-acid racemase YgeA~~~COG1794
MKTIGLLGGMSWESTIPYYRLINEGIKQRLGGLHSAQVLLHSVDFHEIEECQRRGEWDKTGDILAEAALGLQRAGAEGIV
LCTNTMHKVADAIESRCTLPFLHIADATGRAITGAGMTRVALLGTRYTMEQDFYRGRLTEQFSINCLIPEADERAKINQI
IFEELCLGQFTEASRAYCAQVIARLAEQGAQGVIFGCTEIGLLVPEERSVLPVFDTAAIHAEDAVAFMLS
>Q46791 ~~~ygeK~~~Uncharacterized response regulatory protein YgeK~~~COG2197
MGAELVKWVKSHKIDAHIITFVAKMPYIDSIKLLEAGAKGCVWKTSHPAKLNRAIDSISNGYTYFDSVHMDCEKISSRYS
SDNQLTNRESEILQLIADGKTNKEIANFLQLSRKTVETHRLNIMKKLDVHSGIELIKTALRMGVCTI
>Q46803 2.1.3.-~~~ygeW~~~Putative carbamoyltransferase YgeW~~~COG0078
MMKTVNELIKDINSLTSHLHEKDFLLTWEQTPDELKQVLDVAAALKALRAENISTKVFNSGLGISVFRDNSTRTRFSYAS
ALNLLGLAQQDLDEGKSQIAHGETVRETANMISFCADAIGIRDDMYLGAGNAYMREVGAALDDGYKQGVLPQRPALVNLQ
CDIDHPTQSMADLAWLREHFGSLENLKGKKIAMTWAYSPSYGKPLSVPQGIIGLMTRFGMDVTLAHPEGYDLIPDVVEVA
KNNAKASGGSFRQVTSMEEAFKDADIVYPKSWAPYKVMEERTELLRANDHEGLKALEKQCLAQNAQHKDWHCTEEMMELT
RDGEALYMHCLPADISGVSCKEGEVTEGVFEKYRIATYKEASWKPYIIAAMILSRKYAKPGALLEQLLKEAQERVK
>P52037 1.-.-.-~~~ygfF~~~Uncharacterized oxidoreductase YgfF~~~COG1028
MAIALVTGGSRGIGRATALLLAQEGYTVAVNYQQNLHAAQEVMNLITQAGGKAFVLQADISDENQVVAMFTAIDQHDEPL
AALVNNAGILFTQCTVENLTAERINRVLSTNVTGYFLCCREAVKRMALKNGGSGGAIVNVSSVASRLGSPGEYVDYAASK
GAIDTLTTGLSLEVAAQGIRVNCVRPGFIYTEMHASGGEPGRVDRVKSNIPMQRGGQAEEVAQAIVWLLSDKASYVTGSF
IDLAGGK
>Q46811 ~~~ygfK~~~Putative oxidoreductase YgfK~~~COG0493
MGDIMRPIPFEELLTRIFDEYQQQRSIFGIPEQQFYSPVKGKTVSVFGETCATPVGPAAGPHTQLAQNIVTSWLTGGRFI
ELKTVQILDRLELEKPCIDAEDECFNTEWSTEFTLLKAWDEYLKAWFALHLLEAMFQPSDSGKSFIFNMSVGYNLEGIKQ
PPMQQFIDNMMDASDHPKFAQYRDTLNKLLQDDAFLARHGLQEKRESLQALPARIPTSMVHGVTLSTMHGCPPHEIEAIC
RYMLEEKGLNTFVKLNPTLLGYARVREILDVCGFGYIGLKEESFDHDLKLTQALEMLERLMALAKEKSLGFGVKLTNTLG
TINNKGALPGEEMYMSGRALFPLSINVAAVLSRAFDGKLPISYSGGASQLTIRDIFDTGIRPITMATDLLKPGGYLRLSA
CMRELEGSDAWGLDHVDVERLNRLAADALTMEYTQKHWKPEERIEVAEDLPLTDCYVAPCVTACAIKQDIPEYIRLLGEH
RYADALELIYQRNALPAITGHICDHQCQYNCTRLDYDSALNIRELKKVALEKGWDEYKQRWHKPAGSGSRHPVAVIGAGP
AGLAAGYFLARAGHPVTLFEREANAGGVVKNIIPQFRIPAELIQHDIDFVAAHGVKFEYGCSPDLTIEQLKNQGFHYVLI
ATGTDKNSGVKLAGDNQNVWKSLPFLREYNKGTALKLGKHVVVVGAGNTAMDCARAALRVPGVEKATIVYRRSLQEMPAW
REEYEEALHDGVEFRFLNNPERFDADGTLTLRVMSLGEPDEKGRRRPVETNETVTLLVDSLITAIGEQQDTEALNAMGVP
LDKNGWPDVDHNGETRLTDVFMIGDVQRGPSSIVAAVGTARRATDAILSRENIRSHQNDKYWNNVNPAEIYQRKGDISIT
LVNSDDRDAFVAQEAARCLECNYVCSKCVDVCPNRANVSIAVPGFQNRFQTLHLDAYCNECGNCAQFCPWNGKPYKDKIT
VFSLAQDFDNSSNPGFLVEDCRVRVRLNNQSWVLNIDSKGQFNNVPPELNDMCRIISHVHQHHHYLLGRVEV
>Q46824 ~~~ygfX~~~Inner membrane protein YgfX~~~
MVLWQSDLRVSWRAQWLSLLIHGLVAAVILLMPWPLSYTPLWMVLLSLVVFDCVRSQRRINARQGEIRLLMDGRLRWQGQ
EWSIVKAPWMIKSGMMLRLRSDGGKRQHLWLAADSMDEAEWRDLRRILLQQETQR
>G4V4G3 ~~~ygfX~~~Inner membrane protein YgfX~~~
MVQWQCNLRVSWRMQLFSLLAHGLLVLLILLAPWPDGYMSVWLGLVTLVMFGFIRSQRNIKSRHGEIVLYNENHLRWQHR
EWQITKRPWVLKNGILLSLRTTEGKGPRRRSLWLASDSMRNEEWRHLCHLLLQHKNWQSDRLM
>P0ADE8 ~~~ygfZ~~~tRNA-modifying protein YgfZ~~~COG0354
MAFTPFPPRQPTASARLPLTLMTLDDWALATITGADSEKYMQGQVTADVSQMAEDQHLLAAHCDAKGKMWSNLRLFRDGD
GFAWIERRSVREPQLTELKKYAVFSKVTIAPDDERVLLGVAGFQARAALANLFSELPSKEKQVVKEGATTLLWFEHPAER
FLIVTDEATANMLTDKLRGEAELNNSQQWLALNIEAGFPVIDAANSGQFIPQATNLQALGGISFKKGCYTGQEMVARAKF
RGANKRALWLLAGSASRLPEAGEDLELKMGENWRRTGTVLAAVKLEDGQVVVQVVMNNDMEPDSIFRVRDDANTLHIEPL
PYSLEE
>P52052 ~~~yggR~~~Uncharacterized protein YggR~~~COG2805
MNMEEIVALSVKHNVSDLHLCSAWPARWRIRGRMEAAPFDTPDVEELLREWLDDDQRAILLENGQLDFAVSLAENQRLRG
SAFAQRHGISLALRLLPSHCPQLEQLGAPTVLPELLKSENGLILVTGATGSGKSTTLAAMVGYLNQHADAHILTLEDPVE
YLYASQRCLIQQREIGLHCMTFASGLRAALREDPDVILLGELRDSETIRLALTAAETGHLVLATLHTRGAAQAVERLVDS
FPAQEKDPVRNQLAGSLRAVLSQKLEVDKQEGRVALFELLINTPAVGNLIREGKTHQLPHVIQTGQQVGMITFQQSYQHR
VGEGRL
>Q8XCU6 ~~~yggU~~~UPF0235 protein YggU~~~COG1872
MSAVTVNDDGLVLRLYIQPKASRDSIVGLHGDEVKVAITAPPVDGQANSHLVKFLGKQFRVAKSQVVIEKGELGRHKQIK
IINPQQIPPEVAALIN
>P0AG84 1.-.-.-~~~yghA~~~Uncharacterized oxidoreductase YghA~~~COG1028
MSHLKDPTTQYYTGEYPKQKQPTPGIQAKMTPVPDCGEKTYVGSGRLKDRKALVTGGDSGIGRAAAIAYAREGADVAISY
LPVEEEDAQDVKKIIEECGRKAVLLPGDLSDEKFARSLVHEAHKALGGLDIMALVAGKQVAIPDIADLTSEQFQKTFAIN
VFALFWLTQEAIPLLPKGASIITTSSIQAYQPSPHLLDYAATKAAILNYSRGLAKQVAEKGIRVNIVAPGPIWTALQISG
GQTQDKIPQFGQQTPMKRAGQPAELAPVYVYLASQESSYVTAEVHGVCGGEHLG
>P0AA60 ~~~yghB~~~Inner membrane protein YghB~~~COG0586
MAVIQDIIAALWQHDFAALADPHIVSVVYFVMFATLFLENGLLPASFLPGDSLLILAGALIAQGVMDFLPTIAILTAAAS
LGCWLSYIQGRWLGNTKTVKGWLAQLPAKYHQRATCMFDRHGLLALLAGRFLAFVRTLLPTMAGISGLPNRRFQFFNWLS
GLLWVSVVTSFGYALSMIPFVKRHEDQVMTFLMILPIALLTAGLLGTLFVVIKKKYCNA
>Q46833 ~~~yghE~~~Putative type II secretion system L-type protein YghE~~~
MIHQQHMRNIAQWLQENGITRATVAPDWMSIPCGFMACDAQRVICRIDECRGWSAGLALAPVMFRAQLNEQDLPLSLTVV
GIAPEKLSAWAGADAERLTVTALPAITTYGEPEGNLLTGPWQPRVSYRKQWARWRVMILPILLILVALAVERGVTLWSVS
EQVAQSRTQAEEQFLTLFPEQKRIVNLRSQVTMALKKYRPQADDTRLLAELSAIASTLKSASLSDIEMRGFTFDQKRQIL
HLQLRAANFASFDKLRSVLATDYVVQQDALQKEGDAVSGGVTLRRK
>Q46835 ~~~yghG~~~Lipoprotein YghG~~~
MSIKQMPGRVLISLLLSVTGLLSGCASHNENASLLAKKQAQNISQNLPIKSAGYTLVLAQSSGTTVKMTIISEAGTQTTQ
TPDAFLTSYQRQMCADPTVKLMITEGINYSITINDTRTGNQYQRKLDRTTCGIVKA
>Q46840 ~~~yghO~~~Protein YghO~~~COG0456
MMNPVLWLQMTNMISYQGLVRTFLNKNDLKAFIAFPSSLYPDDPNWIPPLFIERNEHLSAKNPGTDHIIWQAWVAKKAGQ
IVGRITAQIDTLHRERYGKDTGHFGMIDAIDDPQVFAALFGAAEAWLKSQGASKISGPFSLNINQESGLLIEGFDTPPCA
MMPHGKPWYAAHIEQLGYHKGIDLLAWWMQRTDLTFSPALKKLMDQVRKKVTIRCINRQRFAEEMQILREIFNSGWQHNW
GFVPFTEHEFATMGDQLKYLVPDDMIYIAEIDSAPCAFIVGLPNINEAIADLNGSLFPFGWAKLLWRLKVSGVRTARVPL
MGVRDEYQFSRIGPVIALLLIEALRDPFARRKIDALEMSWILETNTGMNNMLERIGAEPYKRYRLYEKQI
>Q46841 ~~~yghQ~~~Inner membrane protein YghQ~~~COG2244
MAGFNIKHWFADGAFRTIIRNSAWLGSSNVVSALLGLLALSCAGKGMTPAMFGVLVIVQSYAKSISDFIKFQTWQLVVQY
GTPALTNNNPQQFRNVVSFSFSLDIVSGAVAIVGGIALLPFLSHSLGLDDQSFWLAALYCTLIPSMASSTPTGILRAVDR
FDLIAVQQATKPFLRAAGSVVAWYFDFGFAGFVIAWYVSNLVGGTMYWWFAARELRRRNIHNAFKLNLFESARYIKGAWS
FVWSTNIAHSIWSARNSCSTVLVGIVLGPAAAGLFKIAMTFFDAAGTPAGLLGKSFYPEVMRLDPRTTRPWLLGVKSGLL
AGGIGILVALAVLIVGKPLISLVFGVKYLEAYDLI
>Q46845 1.8.4.-~~~yghU~~~Disulfide-bond oxidoreductase YghU~~~COG0625
MTDNTYQPAKVWTWDKSAGGAFANINRPVSGPTHEKTLPVGKHPLQLYSLGTPNGQKVTIMLEELLALGVTGAEYDAWLI
RIGDGDQFSSGFVEVNPNSKIPALRDHTHNPPIRVFESGSILLYLAEKFGYFLPQDLAKRTETMNWLFWLQGAAPFLGGG
FGHFYHYAPVKIEYAINRFTMEAKRLLDVLDKQLAQHKFVAGDEYTIADMAIWPWFGNVVLGGVYDAAEFLDAGSYKHVQ
RWAKEVGERPAVKRGRIVNRTNGPLNEQLHERHDASDFETNTEDKRQG
>P0ADT5 6.3.1.-~~~ygiC~~~Putative acid--amine ligase YgiC~~~COG0754
MERVSITERPDWREKAHEYGFNFHTMYGEPYWCEDAYYKLTLAQVEKLEEVTAELHQMCLKVVEKVIASDELMTKFRIPK
HTWSFVRQSWLTHQPSLYSRLDLAWDGTGEPKLLENNADTPTSLYEAAFFQWIWLEDQLNAGNLPEGSDQFNSLQEKLID
RFVELREQYGFQLLHLTCCRDTVEDRGTIQYLQDCATEAEIATEFLYIDDIGLGEKGQFTDLQDQVISNLFKLYPWEFML
REMFSTKLEDAGVRWLEPAWKSIISNKALLPLLWEMFPNHPNLLPAYFAEDDHPQMEKYVVKPIFSREGANVSIIENGKT
IEAAEGPYGEEGMIVQQFHPLPKFGDSYMLIGSWLVNDQPAGIGIREDRALITQDMSRFYPHIFVE
>P24197 1.13.11.29~~~ygiD~~~4,5-DOPA dioxygenase extradiol~~~COG3384
MTPLVKDIIMSSTRMPALFLGHGSPMNVLEDNLYTRSWQKLGMTLPRPQAIVVVSAHWFTRGTGVTAMETPPTIHDFGGF
PQALYDTHYPAPGSPALAQRLVELLAPIPVTLDKEAWGFDHGSWGVLIKMYPDADIPMVQLSIDSSKPAAWHFEMGRKLA
ALRDEGIMLVASGNVVHNLRTVKWHGDSSPYPWATSFNEYVKANLTWQGPVEQHPLVNYLDHEGGTLSNPTPEHYLPLLY
VLGAWDGQEPITIPVEGIEMGSLSMLSVQIG
>P0ADU2 1.-.-.-~~~ygiN~~~Probable quinol monooxygenase YgiN~~~COG1359
MLTVIAEIRTRPGQHHRQAVLDQFAKIVPTVLKEEGCHGYAPMVDCAAGVSFQSMAPDSIVMIEQWESIAHLEAHLQTPH
MKAYSEAVKGDVLEMNIRILQPGI
>Q46863 ~~~ygiS~~~Probable deoxycholate-binding periplasmic protein YgiS~~~COG4166
MYTRNLLWLVSLVSAAPLYAADVPANTPLAPQQVFRYNNHSDPGTLDPQKVEENTAAQIVLDLFEGLVWMDGEGQVQPAQ
AERWEILDGGKRYIFHLRSGLQWSDGQPLTAEDFVLGWQRAVDPKTASPFAGYLAQAHINNAAAIVAGKADVTSLGVKAT
DDRTLEVTLEQPVPWFTTMLAWPTLFPVPHHVIAKHGDSWSKPENMVYNGAFVLDQWVVNEKITARKNPKYRDAQHTVLQ
QVEYLALDNSVTGYNRYRAGEVDLTWVPAQQIPAIEKSLPGELRIIPRLNSEYYNFNLEKPPFNDVRVRRALYLTVDRQL
IAQKVLGLRTPATTLTPPEVKGFSATTFDELQKPMSERVAMAKALLKQAGYDASHPLRFELFYNKYDLHEKTAIALSSEW
KKWLGAQVTLRTMEWKTYLDARRAGDFMLSRQSWDATYNDASSFLNTLKSDSEENVGHWKNAQYDALLNQATQITDATKR
NALYQQAEVIINQQAPLIPIYYQPLIKLLKPYVGGFPLHNPQDYVYSKELYIKAH
>P0ADU5 ~~~ygiW~~~Protein YgiW~~~COG3111
MKKFAAVIAVMALCSAPVMAAEQGGFSGPSATQSQAGGFQGPNGSVTTVESAKSLRDDTWVTLRGNIVERISDDLYVFKD
ASGTINVDIDHKRWNGVTVTPKDTVEIQGEVDKDWNSVEIDVKQIRKVNP
>Q46867 ~~~ygiZ~~~Inner membrane protein YgiZ~~~
MLKQKIKTIFEALLYIMLTYWLIDSFFAFNKYDWMLESGGNICSIPSVSGEDRILQAMIAAFFLLTPLIILILRKLFMRE
MFEFWVYVFSLGICLVCGWWLFWGRFIFCY
>P42589 ~~~ygjH~~~tRNA-binding protein YgjH~~~COG0073
METVAYADFARLEMRVGKIVEVKRHENADKLYIVQVDVGQKTLQTVTSLVPYYSEEELMGKTVVVLCNLQKAKMRGETSE
CMLLCAETDDGSESVLLTPERMMPAGVRVV
>P42590 ~~~ygjI~~~Inner membrane transporter YgjI~~~COG0531
MSDTKRNTIGKFGLLSLTFAAVYSFNNVINNNIELGLASAPMFFLATIFYFIPFCLIIAEFVSLNKNSEAGVYAWVKSSL
GGRWAFITAYTYWFVNLFFFTSLLPRVIAYASYAFLGYEYIMTPVATTIISMVLFAFSTWVSTNGAKMLGPITSVTSTLM
LLLTLSYILLAGTALVGGVQPADAITVDAMIPNFNWAFLGVTTWIFMAAGGAESVAVYVNDVKGGSKSFVKVIILAGIFI
GVLYSVSSVLINVFVSSKELKFTGGSVQVFHGMAAYFGLPEALMNRFVGLVSFTAMFGSLLMWTATPVKIFFSEIPEGIF
GKKTVELNENGVPARAAWIQFLIVIPLMIIPMLGSNTVQDLMNTIINMTAAASMLPPLFIMLAYLNLRAKLDHLPRDFRM
GSRRTGIIVVSMLIAIFAVGFVASTFPTGANILTIIFYNVGGIVIFLGFAWWKYSKYIKGLTAEERHIEATPASNVD
>P42592 3.2.1.-~~~ygjK~~~Glucosidase YgjK~~~COG1626
MKIKTILTPVTCALLISFSAHAANADNYKNVINRTGAPQYMKDYDYDDHQRFNPFFDLGAWHGHLLPDGPNTMGGFPGVA
LLTEEYINFMASNFDRLTVWQDGKKVDFTLEAYSIPGALVQKLTAKDVQVEMTLRFATPRTSLLETKITSNKPLDLVWDG
ELLEKLEAKEGKPLSDKTIAGEYPDYQRKISATRDGLKVTFGKVRATWDLLTSGESEYQVHKSLPVQTEINGNRFTSKAH
INGSTTLYTTYSHLLTAQEVSKEQMQIRDILARPAFYLTASQQRWEEYLKKGLTNPDATPEQTRVAVKAIETLNGNWRSP
GGAVKFNTVTPSVTGRWFSGNQTWPWDTWKQAFAMAHFNPDIAKENIRAVFSWQIQPGDSVRPQDVGFVPDLIAWNLSPE
RGGDGGNWNERNTKPSLAAWSVMEVYNVTQDKTWVAEMYPKLVAYHDWWLRNRDHNGNGVPEYGATRDKAHNTESGEMLF
TVKKGDKEETQSGLNNYARVVEKGQYDSLEIPAQVAASWESGRDDAAVFGFIDKEQLDKYVANGGKRSDWTVKFAENRSQ
DGTLLGYSLLQESVDQASYMYSDNHYLAEMATILGKPEEAKRYRQLAQQLADYINTCMFDPTTQFYYDVRIEDKPLANGC
AGKPIVERGKGPEGWSPLFNGAATQANADAVVKVMLDPKEFNTFVPLGTAALTNPAFGADIYWRGRVWVDQFWFGLKGME
RYGYRDDALKLADTFFRHAKGLTADGPIQENYNPLTGAQQGAPNFSWSAAHLYMLYNDFFRKQ
>P42603 ~~~ygjV~~~Inner membrane protein YgjV~~~
MTAYWLAQGVGVIAFLIGITTFFNRDERRFKKQLSVYSAVIGVHFFLLGTYPAGASAILNAIRTLITLRTRSLWVMAIFI
VLTGGIGLAKFHHPVELLPVIGTIVSTWALFCCKGLTMRCVMWFSTCCWVIHNFWAGSIGGTMIEGSFLLMNGLNIIRFW
RMQKRGIDPFKVEKTPSAVDERG
>P64590 ~~~yhaH~~~Inner membrane protein YhaH~~~COG3152
MDWYLKVLKNYVGFRGRARRKEYWMFILVNIIFTFVLGLLDKMLGWQRAGGEGILTTIYGILVFLPWWAVQFRRLHDTDR
SAWWALLFLIPFIGWLIIIVFNCQAGTPGENRFGPDPKLEP
>O07517 ~~~yhaI~~~Uncharacterized protein YhaI~~~
MDSMDHRIERLEYYIQLLVKTVDMDRYPFYALLIDKGLSKEEGEAVMRICDELSEELATQKAQGFVTFDKLLALFAGQLN
EKLDVHETIFALYEQGLYQELMEVFIDIMKHFD
>P64592 ~~~yhaI~~~Inner membrane protein YhaI~~~COG3152
MQWYLSVLKNYVGFSGRARRKEYWMFTLINAIVGAIINVIQLILGLELPYLSMLYLLATFLPVLALAIRRLHDTDRSGAW
ALLFFVPFIGWLVLLVFFCTEGTSGSNRYGNDPKFGSN
>P67661 ~~~yhaJ~~~HTH-type transcriptional regulator YhaJ~~~COG0583
MAKERALTLEALRVMDAIDRRGSFAAAADELGRVPSALSYTMQKLEEELDVVLFDRSGHRTKFTNVGRMLLERGRVLLEA
ADKLTTDAEALARGWETHLTIVTEALVPTPAFFPLIDKLAAKANTQLAIITEVLAGAWERLEQGRADIVIAPDMHFRSSS
EINSRKLYTLMNVYVAAPDHPIHQEPEPLSEVTRVKYRGIAVADTARERPVLTVQLLDKQPRLTVSTIEDKRQALLAGLG
VATMPYPMVEKDIAEGRLRVVSPESTSEIDIIMAWRRDSMGEAKSWCLREIPKLFNGK
>P58115 ~~~yhaK~~~Pirin-like protein YhaK~~~COG1741
MITTRTARQCGQADYGWLQARYTFSFGHYFDPKLLGYASLRVLNQEVLAPGAAFQPRTYPKVDILNVILDGEAEYRDSEG
NHVQASAGEALLLSTQPGVSYSEHNLSKDKPLTRMQLWLDACPQRENPLIQKLALNMGKQHLIASPEGAMGSLQLRQQVW
LHHIVLDKGESANFQLHGPRTYLQSIHGKFHALTHHEEKAALTCGDGAFIRDEANITLVADSPLRALLIDLPV
>P42624 ~~~yhaK~~~Pirin-like protein YhaK~~~COG1741
MITTRTARQCGQADYGWLQARYTFSFGHYFDPKLLGYASLRVLNQEVLAPGAAFQPRTYPKVDILNVILDGEAEYRDSEG
NHVQASAGEALLLSTQPGVSYSEHNLSKDKPLTRMQLWLDACPQRENPLIQKLALNMGKQQLIASPEGAMGSLQLRQQVW
LHHIVLDKGESANFQLHGPRAYLQSIHGKFHALTHHEEKAALTCGDGAFIRDEANITLVADSPLRALLIDLPV
>O07520 ~~~yhaL~~~Sporulation protein YhaL~~~
MLFFPWWVYLCIVGIIFSAYKLVAAAKEEEKVDQAFIEKEGQIYMERMEKERERRSSQQHEEENQNHSIA
>O07521 3.1.-.-~~~yhaM~~~3'-5' exoribonuclease YhaM~~~COG3481
MAKGIMLHEVGEQVDQYLLIKSSTKGIASNGKPFLTLMLQDQSGDIEAKLWDAKQSDEVTYAPQTIVKVVGDVHHYRGRT
QLKLRNIRPVSEQENVNIDDFLETAPIPKNEMMDTITQYIFEMKNPNIQRITRFLVKKHEAEFMDYPAATKNHHEFVSGL
AYHVVSMLNLAKAIADLYPSLDRDLLYAGVILHDLGKVKELSGPVSTSYTVEGNLLGHISIMVTELSKAAEELQIDSEEV
LILQHLILSHHGKAEWGSPKPPMVKEAEILHYIDNLDAKMNMMDRALERVKPGEYTERVFALENRSFYKPTFHK
>Q8XAF6 ~~~yhaM~~~UPF0597 protein YhaM~~~COG3681
MFDSTLNPLWQRYILAVQEEVKPALGCTEPISLALAAAVAAAELEGPVERVEAWVSPNLMKNGLGVTVPGTGMVGLPIAA
ALGALGGNANAGLEVLKDATAQAIADAKALLAAGKVSVKIQEPCNEILFSRAKVWNGEKWACVTIVGGHTNIVHIETHDG
VVFTQQACVAEGEQESPLSVLSRTTLAEILKFVNEVPFAAIRFILDSAKLNCALSQEGLSGKWGLHIGATLEKQCERGLL
AKDLSSSIVIRTSAASDARMGGATLPAMSNSGSGNQGITATMPVVVVAEHFGADDERLARALMLSHLSAIYIHNQLPRLS
ALCAATTAAMGAAAGMAWLVDGRYETISMAISSMIGDVSGMICDGASNSCAMKVSTSASAAWKAVLMALDDTAVTGNEGI
VAHDVEQSIANLCALASHSMQQTDRQIIEIMASKAR
>P42626 ~~~yhaM~~~UPF0597 protein YhaM~~~COG3681
MFDSTLNPLWQRYILAVQEEVKPALGCTEPISLALAAAVAAAELEGPVERVEAWVSPNLMKNGLGVTVPGTGMVGLPIAA
ALGALGGNANAGLEVLKDATAQAIADAKALLAAGKVSVKIQEPCDEILFSRAKVWNGEKWACVTIVGGHTNIVHIETHDG
VVFTQQACVAEGEQESPLTVLSRTTLAEILKFVNEVPFAAIRFILDSAKLNCALSQEGLSGKWGLHIGATLEKQCERGLL
AKDLSSSIVIRTSAASDARMGGATLPAMSNSGSGNQGITATMPVVVVAEHFGADDERLARALMLSHLSAIYIHNQLPRLS
ALCAATTAAMGAAAGMAWLVDGRYETISMAISSMIGDVSGMICDGASNSCAMKVSTSASAAWKAVLMALDDTAVTGNEGI
VAHDVEQSIANLCALASHSMQQTDRQIIEIMASKAR
>O08455 ~~~yhaN~~~Uncharacterized protein YhaN~~~COG4717
MTALRIISLHIYQYGKFSNRTFDFSASPVQVIYGLNEAGKTTMMSFIESMLFGFPKTKKYEPKTGGVYGGVLEAEHPEYG
VLKIERTKGTAEKLSVYTEKGEVKQGDVLKQLFQGTDRSLYKAIYSFDVFGLQEIHAFNRDKIGEFLLFSSLFGAEAVSK
LDSRLTKESERLYKPNGRNPQLNQELETLKQLAVKLKQAEAEEAGYHQLLEEKRTLEARLAAAETELKETAGHIRMIEGA
IERKPLLNEKATLEQVIAEFPEHAGQFPADGLHQLEKYESHLHPKSAQLEALRVKMAELDKQRQKLIPDKELLAKETLIQ
ELSAAFHMYQSWGEQLAAIQAQLRQTSAQTAAGLEQLNKTDENELLNMNTSYDYEWQLQQAVQQYVQARDRKRQLDETFE
LARQELEDAEKAVRAASSAILENSQRKDKEAALRAYDETQGQHQEQAKLREQLTFFERQQAKQKKTVIAAGMLFIVLFSL
LQQWIPAISFGAALIVYWLVSGKSSSRNSRETRQPMTDISPAEAEMLREALWEDDRNKQHLLTQRAALQQKEAAYERVIQ
QFEQWEAEMAPSFTQVERFMNELGFKEDPSFLLDAYSLMKDVKKEVKKKHELTIEAGRLKKHRRTFEERVSMLLPVNQSQ
DISISDALHTLRKNIEREKEIEKQKKEIETDIHYTKEQMLELEQEIQYFHAQIEQLFAAAAAKDRDAFFAIAAISRQLKD
TENKLHHVNAQLQGGYPEELELADSNTLSELKDKQFVENERKERLTEEIEQLRSQIALLSVKQEQLEASGMVSDLKLQTE
MQKERVKETAKKWASIQMVKQVIRNKLERHKKIELPRLLETAGEFFRPLTDGNYQTIYFSETDDSIMVMHRDGTVYHAEE
LSQGTCEQLYTAIRFALAVTRQGESKLPFQLDDSFVHFDQERLKRVLHVLYDLSEGGRQILYFTCHEHVKDAFHSSQIIH
LVS
>O07522 ~~~yhaO~~~Uncharacterized metallophosphoesterase YhaO~~~COG0420
MLTDLTFIHAADLHLDSPFYGISHLPEPIFARIKESTFASVRHMIDAAVREHVDFILLAGDLFDEANRSLKAQLFLKKQF
ERLRECGISVYVIFGNHDHLGGEWTPIEWPENVHIFSSAVPEEKSFFKEGRRIASIYGFSYQARALMENQAARYRRSTDA
PFHIGMLHGTLSGSEGHDPYCPFTHDDLVKSGMDYWALGHIHKRQVLSAEHPAVIYPGNTQARHIKETGDKGYYLVHVTN
GDISYEFQRAHDVLWEKAAVDVTEAKNMTALFQMVEDTFSKLRKKGSPVCVRLVLQGTAPEWLLEAPKGTLDEFLEALQE
QEAEEERFVWPLRLDDETENEANLTNLDPFFGGLFEDIDRSDLSDVLEGLERHPVYRRHADRFSQEEVKEIKEQAQIILK
RQLKVLDT
>O07523 ~~~yhaP~~~Uncharacterized protein YhaP~~~COG1668
MNKFWIMLSHTYKNKIMAKSFIISTVITVLLVLVVTNLESIISLFQGDDAKEKIAVVDETNELYPVFSKQLKAVDTDGDL
DVKLSKQSEDEVTKQVKDESLDGMLIIKRDEKGTISGTYKALTISDESTYQTLQQALTQTKTAVGTAELGVSQETISSLY
APVTVGQKALKEGAKSEEELGQTVGLVYIMLFVIYFSVIMYASMIAMEVATEKSSRVMEILISSMPPIQQMFAKLLGIGL
VGITQLAIIIGAGSLSLKLNQKSETASSVGGFLNLTDVSATTVIYAVIFFLLGYFLYATLAAFLGSVVSRIEDVQQTITP
MTLLVVAGFMIAMFGLNAPDAGFITVTSFIPFFTPMIMFLRVGMLDIPFWQAAVGIGITLLTIVILAVIGARIYKGGVLI
YGNSSAFKAIKQALRLAKN
>P64594 3.1.-.-~~~yhaV~~~Ribonuclease toxin YhaV~~~
MDFPQRVNGWALYAHPCFQETYDALVAEVETLKGKDPENYQRKAATKLLAVVHKVIEEHITVNPSSPAFRHGKSLGSGKN
KDWSRVKFGAGRYRLFFRYSEKEKVIILGWMNDENTLRTYGKKTDAYTVFSKMLKRGHPPADWETLTRETEETH
>O07539 ~~~yhaX~~~Stress response protein YhaX~~~COG0561
MSKQLLALNIDGALLRSNGKIHQATKDAIEYVKKKGIYVTLVTNRHFRSAQKIAKSLKLDAKLITHSGAYIAEKIDAPFF
EKRISDDHTFNIVQVLESYQCNIRLLHEKYSIGNKKKVNSNLLGKALIHPSDPIFYPVQFVESLSDLLMDEPVSAPVIEV
YTEHDIQHDITETITKAFPAVDVIRVNDEKLNIVPKGVSKEAGLALVASELGLSMDDVVAIGHQYDDLPMIELAGLGVAM
GNAVPEIKRKADWVTRSNDEQGVAYMMKEYFRMQQRKGFLDKFHMKRV
>O07541 ~~~yhaZ~~~Uncharacterized protein YhaZ~~~COG4335
MADLKEIYNEELISQLIHHVRSSYPDFNKNRFLDTLRLEDWPELTLKERMRRVTVSLYETLPKQYVEALTILRDTAPHFK
GLSGILFPDYVEQYGLAHWEESIKALESFTQYSTSEFAVRPFLLLDQEKMIAQLLAWSEHKNEHVRRLASEGSRPRLPWG
KSIPALKSDPSPVLPILEKLMQDESLYVRKSVANNLNDISKTHPHLLRKVADQWYGTHPHTDWIIKHAYRTLLKKGDKQA
LALFGYENADSIQLHDLTCQPKRIVIGESLEFSFYIHSDRDQKVRIEYAIDFVKARGQRHQKVFKITETNIRKNETKSYT
RIQSFKDLTTRKHYKGIHTLSVIINGEVKDSLDFQVC
>P0AA73 ~~~yhbE~~~Uncharacterized inner membrane transporter YhbE~~~COG0697
MKQQAGIGILLALTTAICWGALPIAMKQVLEVMEPPTIVFYRFLMASIGLGAILAVKKRLPPLRVFRKPRWLILLAVATA
GLFGNFILFSSSLQYLSPTASQVIGQLSPVGMMVASVFILKEKMRSTQVVGALMLLSGLVMFFNTSLVEIFTKLTDYTWG
VIFGVGAATVWVSYGVAQKVLLRRLASPQILFLLYTLCTIALFPLAKPGVIAQLSHWQLACLIFCGLNTLVGYGALAEAM
ARWQAAQVSAIITLTPLFTLFFSDLLSLAWPDFFARPMLNLLGYLGAFVVVAGAMYSAIGHRIWGGLRKHTTVVSQPRAG
E
>P45742 ~~~yhbH~~~Stress response UPF0229 protein YhbH~~~COG2718
MSQNDSGHFLISEENWSLHRKGFDDQQRHQKKVQEAIKNNLPDLVTEESIIMSNGKDVVKIPIRSLDEYKIRYNYDKNKH
VGQGDGESQVGDVVARDGSDKKQGPGKGQGAGDQAGEDYYEAEVSLMDLEEALFKELELPNLQQKERDNIIHTDIEFNDI
RKTGLTGNIDKKRTMMSAFKRNAMSGKPSFYPIYPEDLKYKTWNDITKPESKAVVLAMMDTSGSMGVWEKYMARSFFFWM
TRFLRTKYETVEIEFIAHHTEARVVSEEDFFSKGESGGTICSSVYRKSLELIDEKYNPARYNIYPFHFSDGDNLTSDNAR
CVKLVNDIMKKANLFCYGEVNQYNRHSTLMSAYKNVKDEKFKYYILKQKSDVFQALKNFFRNEESGVSHQFS
>P45470 3.1.2.-~~~yhbO~~~Protein/nucleic acid deglycase 2~~~COG0693
MSKKIAVLITDEFEDSEFTSPADEFRKAGHEVITIEKQAGKTVKGKKGEASVTIDKSIDEVTPAEFDALLLPGGHSPDYL
RGDNRFVTFTRDFVNSGKPVFAICHGPQLLISADVIRGRKLTAVKPIIIDVKNAGAEFYDQEVVVDKDQLVTSRTPDDLP
AFNREALRLLGA
>P42640 2.-.-.-~~~yhbX~~~Putative transferase YhbX~~~COG2194
MTVFNKFARTFKSHWLLYLCVIVFGITNLVASSGAHMVQRLLFFVLTILVVKRISSLPLRLLVAAPFVLLTAADMSISLY
SWCTFGTTFNDGFAISVLQSDPDEVVKMLGMYIPYLCAFAFLSLLFLAVIIKYDVSLPTKKVTGILLLIVISGSLFSACQ
FAYKDAKNKKAFSPYILASRFATYTPFFNLNYFALAAKEHQRLLSIANTVPYFQLSVRDTGIDTYVLIVGESVRVDNMSL
YGYTRSTTPQVEAQRKQIKLFNQAISGAPYTALSVPLSLTADSVLSHDIHNYPDNIINMANQAGFQTFWLSSQSAFRQNG
TAVTSIAMRAMETVYVRGFDELLLPHLSQALQQNTQQKKLIVLHLNGSHEPACSAYPQSSAVFQPQDDQDACYDNSIHYT
DSLLGQVFELLKDRRASVMYFADHGLERDPTKKNVYFHGGREASQQAYHVPMFIWYSPVLGDGVDRTTENNIFSTAYNNY
LINAWMGVTKPEQPQTLEEVIAHYKGDSRVVDANHDVFDYVMLRKEFTEDKQGNPTPEGQG
>P0AGK4 ~~~yhbY~~~RNA-binding protein YhbY~~~COG1534
MNLSTKQKQHLKGLAHPLKPVVLLGSNGLTEGVLAEIEQALEHHELIKVKIATEDRETKTLIVEAIVRETGACNVQVIGK
TLVLYRPTKERKISLPR
>P45423 3.1.-.-~~~yhcG~~~Putative nuclease YhcG~~~COG4804
MESLSEGTTAGYQQIHDGIIHLVDSARTETVRSVNALMTATYQEIGRRIVEFEQGGEARAAYGAQLIKRLSKDLCLRYKR
GFSAKNLRQMRLFYLFFQHVEIHQTMSGELTPLGIPQTPSAEFPSAKIWQTLSAKSFPLPRSTYVRLLSVKNADARSFYE
KETLRCGWSVRQLERQIATQFYERTLLSHDKSAMLQQHAPAETHILPQQAIRDPFVLEFLELKDEYSESDFEEALINHLM
DFMLELGDDFAFVGRQRRLRIDDNWFRVDLLFFHRRLRCLLIVDLKVGKFSYSDAGQMNMYLNYAKEHWTLPDENPPIGL
VLCAEKGAGEAHYALAGLPNTVLASEYKMQLPDEKRLADELVRTQAVLEEGYRRR
>P54598 ~~~yhcN~~~Probable spore germination lipoprotein YhcN~~~
MFGKKQVLASVLLIPLLMTGCGVADQGEGRRDNNDVRNVNYRNPANDDMRNVNNRDNVDNNVNDNVNNNRVNDDNNNDRK
LEVADEAADKVTDLKEVKHADIIVAGNQAYVAVVLTNGNKGAVENNLKKKIAKKVRSTDKNIDNVYVSANPDFVERMQGY
GKRIQNGDPIAGLFDEFTQTVQRVFPNAE
>P64618 ~~~yhcO~~~Uncharacterized protein YhcO~~~COG2732
MNIYTFDFDEIESQEDFYRDFSQTFGLAKDKVRDLDSLWDVLMNDVLPLPLEIEFVHLGEKTRRRFGALILLFDEAEEEL
EGHLRFNVRH
>P54602 3.1.31.-~~~yhcR~~~Endonuclease YhcR~~~COG0737
MLSVEMISRQNRCHYVYKGGNMMRRILHIVLITALMFLNVMYTFEAVKAAEPQQPISIEKAIQQKEGQALVEGYAVGQAV
SPQHYKLTSPFSNDYNVALADRKNKTSPEHILPVQIPSAFRSQFGLQTNPLLLGKKITVQGKLENYFNTTGLKNVQSMNV
TDDTKTPPAEQQVTINEARGRLNEEVTIKGIITADQNAIGGGKLSTFLQDETGGINIYSPSPEQFPELKEGMDVTVTGKI
TSYQGLKEIVPNSSGIKINQSNQSLPAPKHLTINELINGSLGDQYEGRLVKLTAFVSSIPSSPAGGGYNVTMIDDDHHAM
TLRVMNETGVINELDEGKWYEFTGVLSRYQTFQLLPRKSADLKLLEEQPAPPSAEGEYEGIVDRVVDGDTIHLKSPVLGT
TKIRFVNVDAPETYHTPKNDADENQLRFGKKASDYLKTVLSPGDKITVKVGSEAKDSYGRLLGQVITESGSNVNLELVKN
GYAPTYFIWPVDNEEDYQQFQAAVAAAKKDQKGIWNENDPLMEMPFEFRAREQGKGLTRYVGDSSNKTYVQPADWKKIAV
ENRIFFASASEAESAGYKKRQTAPQEHVPLRILSMNDLHGKIDQQYELDLDGNGTVDGTFGRMDYAAAYLKEKKAEKKNS
LIVHAGDMIGGSSPVSSLLQDEPTVELMEDIGFDVGTVGNHEFDEGTDELLRILNGGDHPKGTSGYDGQNFPLVCANCKM
KSTGEPFLPAYDIINVEGVPVAFIGVVTQSAAGMVMPEGIKNIEFTDEATAVNKAAEELKKKGVKAIAVLAHMSAEQNGN
AITGESADLANKTDSEIDVIFAAHNHQVVNGEVNGKLIVQAFEYGKAIGVVDVEIDKTTKDIVKKSAEIVYVDQSKIEPD
VSASAILKKYETIAEPIISEVVGEAAVDMEGGYSNDGDTPLGNLIADGMRAAMKTDFALMNGGGIREALKKGPITWGDLY
NIQPFGNVLTKLEIKGKDLREIINAQISPVFGPDYSISGFTYTWDKETGKAVDMKMADGTEIQPDATYTLTVNNFMATAT
GAKYQPIGLLGKNPVTGPEDLEATVEYVKSFDEPIAYTKEGRIKLAEASDIEDPVTEDPITEEPGDDPGTEDPIKEDPRP
GEDLPDIKETPGTAPVHQLPPSAISRFNEIPINNTKTADTANSISTLPLQTETAESGSDHQLPDTSAGYYNFMVIGAAVT
LSGTYLYVRRKRSASRT
>P28638 2.1.1.72~~~yhdJ~~~DNA adenine methyltransferase YhdJ~~~COG2189
MRTGCEPTRFGNEAKTIIHGDALAELKKIPAESVDLIFADPPYNIGKNFDGLIEAWKEDLFIDWLFEVIAECHRVLKKQG
SMYIMNSTENMPFIDLQCRKLFTIKSRIVWSYDSSGVQAKKHYGSMYEPILMMVKDAKNYTFNGDAILVEAKTGSQRALI
DYRKNPPQPYNHQKVPGNVWDFPRVRYLMDEYENHPTQKPEALLKRIILASSNPGDIVLDPFAGSFTTGAVAIASGRKFI
GIEINSEYIKMGLRRLDVASHYSAEELAKVKKRKTGNLSKRSRLSEVDPDLITK
>O07580 ~~~yhdK~~~Probable anti-sigma-M factor YhdK~~~
MELVRIFKEHNVFGWISVGTAVLSLLLLNLAIISNVTFYSYQMLPFAMAAVPFGVVELFIKRGRTGPGLLGVILNLFVII
CVYTIVSVDTNLQFGF
>O07581 ~~~yhdL~~~Probable anti-sigma-M factor YhdL~~~
MMNEEFKKRFDQYKNGEMSDQEMTAFEEELEKLEVYQELIDSELEDDNDWDLSISPEKQKAILAYGKRKSYLRISVLAVI
STLMILPLCTLGSYLYYGMGGKHSTGNEFMETAAVTVALTMPNVLVDTSGLKSQVKLFGMNTEFPLQKQIGTKTAAVGNE
RVEMFYNKVKAPAVNYYDLEVNKTGHYFTHPSNKSEQTTAKAEKTLSTLPEGTVSEVYLSYDRAYPTKDVYNKFKGYDVS
FLWNAIETEKNTNKTASTEPLGYPGKDSKFLAALNTKGKSNGDQFINALKFMSKHEKWAQVISKRKDLNVDNRLDYVEKN
GVNVYGSVVTGPTKEIQRMLKNKSVKSANVGEVELWNW
>P45566 ~~~yhdT~~~Uncharacterized membrane protein YhdT~~~COG3924
MDTRFVQAHKEARWALGLTLLYLAVWLVAAYLSGVAPGFTGFPRWFEMACILTPLLFIGLCWAMVKFIYRDIPLEDDDAA
>P45766 ~~~yhdW~~~Putative amino-acid ABC transporter-binding protein YhdW~~~COG0834
MKKMMIATLAAASVLLAVANQAXAGATLDAVQKKGFVQCGISDGLPGFSYADADGKFSGIDVDICRGVAAAVFGDDTKVK
YTPLTAKERFTALQSGEVDLLSRNTTWTSSRDAGMGMAFTGVTYYDGIGFLTHDKAGLKSAKELDGATVCIQAGTDTELN
VADYFKANNMKYTPVTFDRSDESAKALESGRCDTLASDQSQLYALRIKLSNPAEWIVLPEVISKEPLGPVVRRGDDEWFS
IVRWTLFAMLNAEEMGINSQNVDEKAANPATPDMAHLLGKEGDYGKDLKLDNKWAYNIIKQVGNYSEIFERNVGSESPLK
IKRGQNNLWNNGGIQYAPPVR
>P45768 ~~~yhdY~~~Inner membrane amino-acid ABC transporter permease protein YhdY~~~COG0765
MTKVLLSHPPRPASHNSSRAMVWVRKNLFSSWSNSLLTIGCIWLMWELIPPLLNWAFLQANWVGSTRADCTKAGACWVFI
HERFGQFMYGLYPHDQRWRINLALLIGLVSIAPMFWKILPHRGRYIAAWAVIYPLIVWWLMYGGFFALERVETRQWGGLT
LTLIIASVGIAGALPWGILLALGRRSHMPIVRILSVIFIEFWRGVPLITVLFMSSVMLPLFMAEGTSIDKLIRALVGVIL
FQSAYVAEVVRGGLQALPKGQYEAAESLALGYWKTQGLVILPQALKLVIPGLVNTIIALFKDTSLVIIIGLFDLFSSVQQ
ATVDPAWLGMSTEGYVFAALIYWIFCFSMSRYSQYLEKRFNTGRTPH
>O07542 ~~~yheA~~~UPF0342 protein YheA~~~COG3679
MAVNFYDVAYDLENALRGSEEFTRLKNLYDEVNADESAKRMFENFRDVQLRLQQKQMAGEEITQEEVTQAQKTVALVQQH
EKISQLMEAEQRMSMLIGELNKIIMKPLEELYGSVEG
>O07545 ~~~yheD~~~Endospore coat-associated protein YheD~~~COG0189
MNPKRFLIGIDKTSENTLFLPSSLKQDGLLHAAFGTKVVRCHVAYRRHLEQTVLLSENLFHELLLPHRSRADILIHDHTV
HIGPLVGIFTAGFTVSLERPFKDRSLFFSKLVTLHEQAGGYCFVFGAHQINWEEGTIEGLLYRENGWEKKIVPLPNVVYD
RLPNRKIEDSLLLQHTKKRLIDEYQIPWFNKTFFNKWNVHQLLEKDPRTAPFLPRSELTPSVELIDELCGAYKKVYIKPA
NGALGTGIYQLTRTDGGLTVKHTNDAKTFTSIDYSDAASFLAEFQKHHNPSDFLIQQGVDLIEFQGKPADFRVHTNKNRK
GKWTVTAIAVKISGKNSITTHLSNGGTVKTLAEVYDDPAERVEVIKKLSAAALTASHVLHDHIEGFIGEIGFDFGIDQNG
KVWMFEANSRPGRSIFSHPNLHHVDSLTKRRSFEYASYLSEKAITSPEALWPS
>O07549 7.6.2.-~~~yheH~~~Probable multidrug resistance ABC transporter ATP-binding/permease protein YheH~~~COG1132
MKIGKTLWRYALLYRKLLITAVLLLTVAVGAELTGPFIGKKMIDDHILGIEKTWYEAAEKDKNAVQFHGVSYVREDRLQE
PVSKAKEAHIYQVGMAFYFVDQAVSFDGNRTVSDGKLTITNGDKSRAYAAEKLTKQELFQFYQPEIKGMVLLICLYGGLL
VFSVFFQYGQHYLLQMSANRIIQKMRQDVFSHIQKMPIRYFDNLPAGKVVARITNDTEAIRDLYVTVLSTFVTSGIYMFG
IFTALFLLDVKLAFVCLAIVPIIWLWSVIYRRYASYYNQKIRSINSDINAKMNESIQGMTIIQAFRHQKETMREFEELNE
SHFYFQNRMLNLNSLMSHNLVNVIRNLAFVCLIWHFGGASLNAAGIVSIGVLYAFVDYLNRLFQPITGIVNQFSKLELAR
VSAGRVFELLEEKNTEEAGEPAKERALGRVEFRDVSFAYQEGEEVLKHISFTAQKGETVALVGHTGSGKSSILNLLFRFY
DAQKGDVLIDGKSIYNMSRQELRSHMGIVLQDPYLFSGTIGSNVSLDDERMTEEEIKNALRQVGAEPLLKKLPKGINEPV
IEKGSTLSSGERQLISFARALAFDPAILILDEATAHIDTETEAVIQKALDVVKQGRTTFVIAHRLSTIRNADQILVLDKG
EIVERGNHEELMALEGQYYQMYELQKGQKHSIA
>O07550 7.6.2.-~~~yheI~~~Probable multidrug resistance ABC transporter ATP-binding/permease protein YheI~~~COG1132
MFSVLKKLGWFFKAYWLRYTIAIVLLLAVNVIEMFPPKLLGNAIDDMKAGAFTAEGLLFYIGIFFVLTAAVYIMSYFWMH
QLFGGANLMEKILRTKLMGHLLTMSPPFYEKNRTGDLMARGTNDLQAVSLTTGFGILTLVDSTMFMMTIFLTMGFLISWK
LTFAAIIPLPVMAIAISLYGSKIHERFTEAQNAFGALNDRVLESVSGVRVIRAYVQETNDVRRFNEMTADVYQKNMKVAF
IDSLFEPTVKLLVGASYLIGLGYGAFLVFRNELTLGELVSFNVYLGMMIWPMFAIGELINVMQRGNASLDRVNETLSYET
DVTDPKQPADLKEPGDIVFSHVSFTYPSSTSDNLQDISFTVRKGQTVGIAGKTGSGKTTIIKQLLRQYPPGEGSITFSGV
PIQQIPLDRLRGWIGYVPQDHLLFSRTVKENILYGKQDATDKEVQQAIAEAHFEKDLHMLPSGLETMVGEKGVALSGGQK
QRISIARALMANPEILILDDSLSAVDAKTEAAIIKNIRENRKGKTTFILTHRLSAVEHADLILVMDGGVIAERGTHQELL
ANNGWYREQYERQQLFTAEEGGAGA
>A0A0H2VBH0 ~~~yheS~~~Probable ATP-binding protein YheS~~~COG0488
MIVFSSLQIRRGVRVLLDNATATINPGQKVGLVGKNGCGKSTLLALLKNEISADGGSYTFPGSWQLAWVNQETPALPQAA
LEYVIDGDREYRQLEAQLHDANERNDGHAIATIHGKLDAIDAWSIRSRAASLLHGLGFSNEQLERPVSDFSGGWRMRLNL
AQALICRSDLLLLDEPTNHLDLDAVIWLEKWLKSYQGTLILISHDRDFLDPIVDKIIHIEQQSMFEYTGNYSSFEVQRAT
RLAQQQAMYESQQERVAHLQSYIDRFRAKATKAKQAQSRIKMLERMELIAPAHVDNPFRFSFRAPESLPNPLLKMEKVSA
GYGDRIILDSIKLNLVPGSRIGLLGRNGAGKSTLIKLLAGELAPVSGEIGLAKGIKLGYFAQHQLEYLRADESPLQHLAR
LAPQELEQKLRDYLGGFGFQGDKVTEETRRFSGGEKARLVLALIVWQRPNLLLLDEPTNHLDLDMRQALTEALIEFEGAL
VVVSHDRHLLRSTTDDLYLVHDRKVEPFDGDLEDYQQWLSDVQKQENQTDEAPKENANSAQARKDQKRREAELRAQTQPL
RKEIARLEKEMEKLNAQLAQAEEKLGDSELYDQNRKAELTACLQQQASAKSGLEECEMAWLEAQEQLEQMLLEGQSN
>P63389 ~~~yheS~~~Probable ATP-binding protein YheS~~~COG0488
MIVFSSLQIRRGVRVLLDNATATINPGQKVGLVGKNGCGKSTLLALLKNEISADGGSYTFPGSWQLAWVNQETPALPQAA
LEYVIDGDREYRQLEAQLHDANERNDGHAIATIHGKLDAIDAWSIRSRAASLLHGLGFSNEQLERPVSDFSGGWRMRLNL
AQALICRSDLLLLDEPTNHLDLDAVIWLEKWLKSYQGTLILISHDRDFLDPIVDKIIHIEQQSMFEYTGNYSSFEVQRAT
RLAQQQAMYESQQERVAHLQSYIDRFRAKATKAKQAQSRIKMLERMELIAPAHVDNPFRFSFRAPESLPNPLLKMEKVSA
GYGDRIILDSIKLNLVPGSRIGLLGRNGAGKSTLIKLLAGELAPVSGEIGLAKGIKLGYFAQHQLEYLRADESPIQHLAR
LAPQELEQKLRDYLGGFGFQGDKVTEETRRFSGGEKARLVLALIVWQRPNLLLLDEPTNHLDLDMRQALTEALIEFEGAL
VVVSHDRHLLRSTTDDLYLVHDRKVEPFDGDLEDYQQWLSDVQKQENQTDEAPKENANSAQARKDQKRREAELRAQTQPL
RKEIARLEKEMEKLNAQLAQAEEKLGDSELYDQSRKAELTACLQQQASAKSGLEECEMAWLEAQEQLEQMLLEGQSN
>P44808 ~~~yheS~~~Probable ATP-binding protein YheS~~~COG0488
MIIFSNLSLKRGQTELLENASATINPKQKVGLVGKNGCGKSSLFALLKKELMPEGGEVNYPANWRVSWVNQETPALDISA
IDYVIQGDREYCRLQQKLERANERNDGNAIARIHGQLETLDAWTIQSRAASLLHGLGFSQEETIQPVKAFSGGWRMRLNL
AQALLCPSDLLLLDEPTNHLDLDAVIWLERWLVQYQGTLVLISHDRDFLDPIVTKILHIENQKLNEYTGDYSSFEVQRAT
KLAQQTAMYRQQQQKISHLQKYIDRFKAKATKAKQAQSRMKALERMELIAPAYVDNPFTFEFRPPQSLPNPLVMIEQASA
GYGIGESAVEILSKIKLNLVPGSRIGLLGKNGAGKSTLIKLLAGELTALSGTVQLAKGVQLGYFAQHQLDTLRADESALW
HMQKLAPEQTEQQVRDYLGSFAFHGDKVNQAVKSFSGGEKARLVLALIVWQRPNLLLLDEPTNHLDLDMRQALTEALVDY
EGSLVVVSHDRHLLRNTVEEFYLVHDKKVEEFKGDLEDYQKWLSEQNSTSENKVSEKVGDNENSVQNRKEQKRREAELRQ
QTAPLRKKITQLEEKMNKFSSELANIENQLADTELYNAENKEKLTALLAQQVDVKKALDDVETEWMTAQEELEEMLQA
>P0ADX1 ~~~yhfA~~~Protein YhfA~~~COG1765
MQARVKWVEGLTFLGESASGHQILMDGNSGDKAPSPMEMVLMAAGGCSAIDVVSILQKGRQDVVDCEVKLTSERREEAPR
LFTHINLHFIVTGRDLKDAAVARAVDLSAEKYCSVALMLEKAVNITHSYEVVAA
>P0ADX5 ~~~yhfG~~~Uncharacterized protein YhfG~~~
MKKLTDKQKSRLWELQRNRNFQASRRLEGVEMPLVTLTAAEALARLEELRSHYER
>O07609 4.-.-.-~~~yhfK~~~Uncharacterized sugar epimerase YhfK~~~COG0702
MKVFLIGANGQIGQRLVSLFQDNPDHSIRAMVRKEEQKASLEAAGAEAVLANLEGSPEEIAAAAKGCDAIIFTAGSGGST
GYDKTLLVDLDGAAKAIEAAAIAGIKRFIMVSALQAHNRENWNEALKPYYVAKHYADKILEASGLTYTIIRPGGLRNEPG
TGTVSAAKDLERGFISRDDVAKTVIASLDEKNTENRAFDLTEGDTPIAEALKKL
>O07615 1.6.5.-~~~yhfP~~~Putative quinone oxidoreductase YhfP~~~COG0604
MSTLFQALQAEKNADDVSVHVKTISTEDLPKDGVLIKVAYSGINYKDGLAGKAGGNIVREYPLILGIDAAGTVVSSNDPR
FAEGDEVIATSYELGVSRDGGLSEYASVPGDWLVPLPQNLSLKEAMVYGTAGFTAALSVHRLEQNGLSPEKGSVLVTGAT
GGVGGIAVSMLNKRGYDVVASTGNREAADYLKQLGASEVISREDVYDGTLKALSKQQWQGAVDPVGGKQLASLLSKIQYG
GSVAVSGLTGGGEVPATVYPFILRGVSLLGIDSVYCPMDVRAAVWERMSSDLKPDQLLTIVDREVSLEETPGALKDILQN
RIQGRVIVKL
>C0SP94 ~~~yhfQ~~~Putative ABC transporter substrate-binding lipoprotein YhfQ~~~COG4594
MKKTLIILTVLLLSVLTAACSSSSGNQNSKEHKVAVTHDLGKTNVPEHPKRVVVLELGFIDTLLDLGITPVGVADDNKAK
QLINKDVLKKIDGYTSVGTRSQPSMEKIASLKPDLIIADTTRHKKVYDQLKKIAPTIALNNLNADYQDTIDASLTIAKAV
GKEKEMEKKLTAHEEKLSETKQKISANSQSVLLIGNTNDTIMARDENFFTSRLLTQVGYRYAISTSGNSDSSNGGDSVNM
KMTLEQLLKTDPDVIILMTGKTDDLDADGKRPIEKNVLWKKLKAVKNGHVYHVDRAVWSLRRSVDGANAILDELQKEMPA
AKK
>O07618 2.3.1.-~~~yhfS~~~Putative acetyl-CoA C-acetyltransferase YhfS~~~COG0183
MNAVIVDAKRTIFGNQNGLLKPFLPEDLAAPIIRCLSRKLEDQVDEVILGNATGRGGNLARLSALQAGLPLSVPGMTIDR
QCGSGLEAVRYACSLIQAGAGTMYIAGGSESSSQSPFSERARFSPDAIGDPDMGIAAEYTAARYSISRSMQDEYALLSHQ
RSRNAHDEGFYREEVVALGELETDEAFLKTRPIEAIIPRAKPVFDTSSGTVTAANSSGIADGAAALLVMEEEKAAALGLK
PVLRFIGSAVSGIHPNFPPAAPVVAIRQLLHTHDVTPDDIDLFEINEAFAVKICVCSQELGIPFSKINVRGGALALGHPY
GASGAALVTRLFYEAKRRPDCQYAVAAIGSGGGIGLALLFEVLA
>P45545 ~~~yhfS~~~Uncharacterized protein YhfS~~~COG0626
MKTFPLQSLTIIEAQQKQFALVDSICRHFPGSEFLTGGDLGLTPGLNQPRVTQRVEQVLADAFHAQAAALVQGAGTGAIR
AGLAALLKPGQRLLVHDAPVYPTTRVIIEQMGLTLITVDFNDLSALKQVVDEQQPDAALVQHTRQQPQDSYVLADVLATL
RAAGVPALTDDNYAVMKVARIGCECGANVSTFSCFKLFGPEGVGAVVGDADVINRIRATLYSGGSQIQGAQALEVLRGLV
FAPVMHAVQAGVSERLLALLNGGAVPEVKSAVIANAQSKVLIVEFHQPIAARVLEEAQKRGALPYPVGAESKYEIPPLFY
RLSGTFRQANPQSEHCAIRINPNRSGEETVLRILRESIASI
>O07619 6.2.1.-~~~yhfT~~~Uncharacterized acyl--CoA ligase YhfT~~~COG0318
MTITHTYSSTAETSPGRVAIQTESEQITYHDWDRLVSQTANWLRSQPSMPNRVAILLPNSLAFLQLFAGAAAAGCTAIPI
DTRWSPAECKERLSISNADLVVTLAFFKNKLTDSQTPVVLLDNCMADISEAAADPLPTIDPEHPFYMGFTSGSTGKPKAF
TRSHRSWMESFTCTETDFSISSDDKVLIPGALMSSHFLYGAVSTLFLGGTVCLLKKFSPAKAKEWLCRESISVLYTVPTM
TDALARIEGFPDSPVKIISSGADWPAESKKKLAAAWPHLKLYDFYGTSELSFVTFSSPEDSKRKPHSAGRPFHNVRIEIR
NAGGERCQPGEIGKIFVKSPMRFSGYVNGSTPDEWMTVDDMGYVDEEGFLYISGRENGMIVYGGLNIFPEEIERVLLACP
EVESAAVVGIPDEYWGEIAVAVILGNANARTLKAWCKQKLASYKIPKKWVFADSLPETSSGKIARSRVKKWLEESVQYK
>P45550 ~~~yhfX~~~Uncharacterized protein YhfX~~~COG3457
MFVEALKRQNPALISAALSLWQQGKIAPDSWVIDVDQILENGKRLIETARLYGIELYLMTKQFGRNPWLAEKLLALGYSG
IVAVDYKEARVMRRAGLPVAHQGHLVQIPCHQVADAVEQGTDVITVFTLDKAREVSAAAVKAGRIQSVLLKVYSDDDFLY
PGQESGFALKVLPEIVAEIQNLPGLHLAGLTHFPCLLWDEAVGKVLPTPNLHTLIQARDQLAKSGIALEQLNAPSATSCT
SLPLLAQYGVTHAEPGHALTGTIPANQQGDQPERIAMLWLSEISHHFRGDSYCYGGGYYRRGHAQHALVFTPENQKITET
NLKTVDDSSIDYTLPLAGEFPVSSAVVLCFRTQIFVTRSDVVLVSGIHRGEPEIVGRYDSLGNSLGA
>P46837 ~~~yhgF~~~Protein YhgF~~~COG2183
MMNDSFCRIIAGEIQARPEQVDAAVRLLDEGNTVPFIARYRKEITGGLDDTQLRNLETRLSYLRELEERRQAILKSISEQ
GKLTDDLAKAINATLSKTELEDLYLPYKPKRRTRGQIAIEAGLEPLADLLWSDPSHTPEVAAAQYVYADKGVADTKAALD
GARYILMERFAEDAALLAKVRDYLWKNAHLVSTVVSGKEEEGAKFRDYFDHHEPLSTVPSHRALAMFRGRNEGVLQLSLN
ADPQFDEPPKESYCEQIIMDHLGLRLNNAPADSWRKGVVSWTWRIKVLMHLETELMGTVRERAEDEAINVFARNLHDLLM
AAPAGLRATMGLDPGLRTGVKVAVVDATGKLVATDTIYPHTGQAAKAAMTVAALCEKHNVELVAIGNGTASRETERFYLD
VQKQFPKVTAQKVIVSEAGASVYSASELAAQEFPDLDVSLRGAVSIARRLQDPLAELVKIDPKSIGVGQYQHDVSQTQLA
RKLDAVVEDCVNAVGVDLNTASVPLLTRVAGLTRMMAQNIVAWRDENGQFQNRQQLLKVSRLGPKAFEQCAGFLRINHGD
NPLDASTVHPEAYPVVERILAATQQALKDLMGNSSELRNLKASDFTDEKFGVPTVTDIIKELEKPGRDPRPEFKTAQFAD
GVETMNDLQPGMILEGAVTNVTNFGAFVDIGVHQDGLVHISSLSNKFVEDPHTVVKAGDIVKVKVLEVDLQRKRIALTMR
LDEQPGETNARRGGGNERPQNNRPAAKPRGREAQPAGNSAMMDALAAAMGKKR
>P67143 ~~~yhgN~~~UPF0056 inner membrane protein YhgN~~~COG2095
MNEIISAAVLLILIMDPLGNLPIFMSVLKHTEPKRRRAIMVRELLIALLVMLVFLFAGEKILAFLSLRAETVSISGGIIL
FLIAIKMIFPSASGNSSGLPAGEEPFIVPLAIPLVAGPTILATLMLLSHQYPNQMGHLVIALLLAWGGTFVILLQSSLFL
RLLGEKGVNALERLMGLILVMMATQMFLDGIRMWMKG
>P0ADX7 ~~~yhhA~~~Uncharacterized protein YhhA~~~
MKRLLILTALLPFVGFAQPINTLNNPNQPGYQIPSQQRMQTQMQTQQIQQKGMLNQQLKTQTQLQQQHLENQINNNSQRV
LQSQPGERNPARQQMLPNTNGGMLNSNRNPDSSLNQQHMLPERRNGDMLNQPSTPQPDIPLKTIGP
>P0AGH1 ~~~yhhJ~~~Inner membrane transport permease YhhJ~~~COG0842
MRHLRNIFNLGIKELRSLLGDKAMLTLIVFSFTVSVYSSATVTPGSLNLAPIAIADMDQSQLSNRIVNSFYRPWFLPPEM
ITADEMDAGLDAGRYTFAINIPPNFQRDVLAGRQPDIQVNVDATRMSQAFTGNGYIQNIINGEVNSFVARYRDNSEPLVS
LETRMRFNPNLDPAWFGGVMAIINNITMLAIVLTGSALIREREHGTVEHLLVMPITPFEIMMAKIWSMGLVVLVVSGLSL
VLMVKGVLGVPIEGSIPLFMLGVALSLFATTSIGIFMGTIARSMPQLGLLVILVLLPLQMLSGGSTPRESMPQMVQDIML
TMPTTHFVSLAQAILYRGAGFEIVWPQFLTLMAIGGAFFTIALLRFRKTIGTMA
>P37621 ~~~yhhS~~~Uncharacterized MFS-type transporter YhhS~~~COG0477
MPEPVAEPALNGLRLNLRIVSIVMFNFASYLTIGLPLAVLPGYVHDVMGFSAFWAGLVISLQYFATLLSRPHAGRYADSL
GPKKIVVFGLCGCFLSGLGYLTAGLTASLPVISLLLLCLGRVILGIGQSFAGTGSTLWGVGVVGSLHIGRVISWNGIVTY
GAMAMGAPLGVVFYHWGGLQALALIIMGVALVAILLAIPRPTVKASKGKPLPFRAVLGRVWLYGMALALASAGFGVIATF
ITLFYDAKGWDGAAFALTLFSCAFVGTRLLFPNGINRIGGLNVAMICFSVEIIGLLLVGVATMPWMAKIGVLLAGAGFSL
VFPALGVVAVKAVPQQNQGAALATYTVFMDLSLGVTGPLAGLVMSWAGVPVIYLAAAGLVAIALLLTWRLKKRPPEHVPE
AASSS
>P0AGM0 ~~~yhhT~~~Putative transport protein YhhT~~~COG0628
METPQPDKTGMHILLKLASLVVILAGIHAAADIIVQLLLALFFAIVLNPLVTWFIRRGVQRPVAITIVVVVMLIALTALV
GVLAASFNEFISMLPKFNKELTRKLFKLQEMLPFLNLHMSPERMLQRMDSEKVVTFTTALMTGLSGAMASVLLLVMTVVF
MLFEVRHVPYKMRFALNNPQIHIAGLHRALKGVSHYLALKTLLSLWTGVIVWLGLELMGVQFALMWAVLAFLLNYVPNIG
AVISAVPPMIQVLLFNGVYECILVGALFLVVHMVIGNILEPRMMGHRLGMSTMVVFLSLLIWGWLLGPVGMLLSVPLTSV
CKIWMETTKGGSKLAILLGPGRPKSRLPG
>P46852 1.13.11.24~~~yhhW~~~Quercetin 2,3-dioxygenase~~~COG1741
MIYLRKANERGHANHGWLDSWHTFSFANYYDPNFMGFSALRVINDDVIEAGQGFGTHPHKDMEILTYVLEGTVEHQDSMG
NKEQVPAGEFQIMSAGTGIRHSEYNPSSTERLHLYQIWIMPEENGITPRYEQRRFDAVQGKQLVLSPDARDGSLKVHQDM
ELYRWALLKDEQSVHQIAAERRVWIQVVKGNVTINGVKASTSDGLAIWDEQAISIHADSDSEVLLFDLPPV
>P46853 1.-.-.-~~~yhhX~~~Uncharacterized oxidoreductase YhhX~~~COG0673
MVINCAFIGFGKSTTRYHLPYVLNRKDSWHVAHIFRRHAKPEEQAPIYSHIHFTSDLDEVLNDPDVKLVVVCTHADSHFE
YAKRALEAGKNVLVEKPFTPTLAQAKELFALAKSKGLTVTPYQNRRFDSCFLTAKKAIESGKLGEIVEVESHFDYYRPVA
ETKPGLPQDGAFYGLGVHTMDQIISLFGRPDHVAYDIRSLRNKANPDDTFEAQLFYGDLKAIVKTSHLVKIDYPKFIVHG
KKGSFIKYGIDQQETSLKANIMPGEPGFAADDSVGVLEYVNDEGVTVREEMKPEMGDYGRVYDALYQTITHGAPNYVKES
EVLTNLEILERGFEQASPSTVTLAK
>P9WML3 ~~~~~~Uncharacterized HIT-like protein Rv0759c~~~COG0537
MSIFTKIINRELPGRFVYEDDDVVAFLTIEPMTQGHTLVVPRAEIDHWQNVDPALFGRVMSVSQLIGKAVCRAFSTQRAG
MIIAGLEVPHLHIHVFPTRSLSDFGFANVDRNPSPGSLDEAQAKIRAALAQLA
>P37630 ~~~yhiM~~~Inner membrane protein YhiM~~~
MNIYIGWLFKLIPLIMGLICIALGGFVLESSGQSEYFVAGHVLISLAAICLALFTTAFIIISQLTRGVNTFYNTLFPIIG
YAGSIITMIWGWALLAGNDVMADEFVAGHVIFGVGMIAACVSTVAASSGHFLLIPKNAAGSKSDGTPVQAYSSLIGNCLI
AVPVLLTLLGFIWSITLLRSADITPHYVAGHVLLGLTAICACLIGLVATIVHQTRNTFSTKEHWLWCYWVIFLGSITVLQ
GIYVLVSSDASARLAPGIILICLGMICYSIFSKVWLLALVWRRTCSLANRIPMIPVFTCLFCLFLASFLAEMAQTDMGYF
IPSRVLVGLGAVCFTLFSIVSILEAGSAKK
>P64382 ~~~~~~Uncharacterized HIT-like protein HP_0404~~~COG0537
MNVFEKIIQGEIPCSKILENERFLSFYDINPKAKVHALVIPKQSIQDFNGITPELMAQMTSFIFEVVEKLGIKEKGYKLL
TNVGKNAGQEVMHLHFHILSGDKH
>P0DSG8 ~~~yhiY~~~Protein YhiY~~~
MMTTLLPVFTKPSPLALNALRAGRICRFLLIPDGRIR
>P37640 ~~~yhjB~~~Putative HTH-type transcriptional regulator YhjB~~~COG2197
MQIVMFDRQSIFIHGMKISLQQRIPGVSIQGASQADELWQKLESYPEALVMLDGDQDGEFCYWLLQKTVVQFPEVKVLIT
ATDCNKRWLQEVIHFNVLAIVPRDSTVETFALAVNSAAMGMMFLPGDWRTTPEKDIKDLKSLSARQREILTMLAAGESNK
EIGRALNISTGTVKAHLESLYRRLEVKNRTQAAMMLNISS
>P37642 ~~~yhjD~~~Inner membrane protein YhjD~~~COG1295
MTQENEIKRPIQDLEHEPIKPLDNSEKGSKVSQALETVTTTAEKVQRQPVIAHLIRATERFNDRLGNQFGAAITYFSFLS
MIPILMVSFAAGGFVLASHPMLLQDIFDKILQNISDPTLAATLKNTINTAVQQRTTVGLVGLAVALYSGINWMGNLREAI
RAQSRDVWERSPQDQEKFWVKYLRDFISLIGLLIALIVTLSITSVAGSAQQMIISALHLNSIEWLKPTWRLIGLAISIFA
NYLLFFWIFWRLPRHRPRKKALIRGTFLAAIGFEVIKIVMTYTLPSLMKSPSGAAFGSVLGLMAFFYFFARLTLFCAAWI
ATAEYKDDPRMPGKTQP
>P37643 ~~~yhjE~~~Inner membrane metabolite transport protein YhjE~~~COG0477
MQATATTLDHEQEYTPINSRNKVLVASLIGTAIEFFDFYIYATAAVIVFPHIFFPQGDPTAATLQSLATFAIAFVARPIG
SAVFGHFGDRVGRKATLVASLLTMGISTVVIGLLPGYATIGIFAPLLLALARFGQGLGLGGEWGGAALLATENAPPRKRA
LYGSFPQLGAPIGFFFANGTFLLLSWLLTDEQFMSWGWRVPFIFSAVLVIIGLYVRVSLHESPVFEKVAKAKKQVKIPLG
TLLTKHVRVTVLGTFIMLATYTLFYIMTVYSMTFSTAAAPVGLGLPRNEVLWMLMMAVIGFGVMVPVAGLLADAFGRRKS
MVIITTLIILFALFAFNPLLGSGNPILVFAFLLLGLSLMGLTFGPMGALLPELFPTEVRYTGASFSYNVASILGASVAPY
IAAWLQTNYGLGAVGLYLAAMAGLTLIALLLTHETRHQSL
>P37648 ~~~yhjJ~~~Protein YhjJ~~~COG0612
MQGTKIRLLAGGLLMMATAGYVQADALQPDPAWQQGTLSNGLQWQVLTTPQRPSDRVEIRLLVNTGSLAESTQQSGYSHA
IPRIALTQSGGLDAAQARSLWQQGIDPKRPMPPVIVSYDTTLFNLSLPNNRNDLLKEALSYLANATGKLTITPETINHAL
QSQDMVATWPADTKEGWWRYRLKGSTLLGHDPADPLKQPVEAEKIKDFYQKWYTPDAMTLLVVGNVDARSVVDQINKTFG
ELKGKRETPAPVPTLSPLRAEAVSIMTDAVRQDRLSIMWDTPWQPIRESAALLRYWRADLAREALFWHVQQALSASNSKD
IGLGFDCRVLYLRAQCAINIESPNDKLNSNLNLVARELAKVRDKGLPEEEFNALVAQKKLELQKLFAAYARADTDILMGQ
RMRSLQNQVVDIAPEQYQKLRQDFLNSLTVEMLNQDLRQQLSNDMALILLQPKGEPEFNMKALQAVWDQIMAPSTAAATT
SVATDDVHPEVTDIPPAQ
>O07571 ~~~yhjQ~~~Uncharacterized cysteine-rich protein YhjQ~~~
MEQYSEACIEACIDCMKACNHCFTKCLEESVQHHLSGCIRLDRECADICALAVKAMQTDSPFMKEICALCADICEACGTE
CGKHDHDHCQACAKACFTCAEQCRSMAA
>P0ADJ3 ~~~yhjR~~~Protein YhjR~~~
MNNNEPDTLPDPAIGYIFQNDIVALKQAFSLPDIDYADISQREQLAAALKRWPLLAEFAQQK
>P37660 ~~~yhjV~~~Inner membrane transport protein YhjV~~~COG0814
MQHNTLSKHNQKLPFTRYDFGWVLLCIGMAIGAGTVLMPVQIGLKGIWVFITAAIIAYPATWVVQDIYLKTLSESDSCND
YTDIISHYLGKNWGIFLGVIYFLMIIHGIFIYSLSVVFDSASYLKTFGLTDADLSQSLLYKVAIFAVLVAIASGGERLLF
KISGPMVVVKVGIIVVFGFAMIPHWNFANITAFPQASVFFRDVLLTIPFCFFSAVFIQVLNPMNIAYRKREADKVLATRL
ALRTHRISYITLIAVILFFAFSFTFSISHEEAVSAFEQNISALALAAQVIPGHIIHITSTVLNIFAVLTAFFGIYLGFHE
AIKGIILNLLSRIIDTKKINSRVLTLAICAFIVITLTIWVSFRVSVLVFFQLGSPLYGIVSCLIPFFLIYKVAQLEKLRG
FKAWLILLYGILLCLSPLLKLIE
>P37662 ~~~yhjX~~~Uncharacterized MFS-type transporter YhjX~~~COG2223
MTPSNYQRTRWLTLIGTIITQFALGSVYTWSLFNGALSAKLDAPVSQVAFSFGLLSLGLAISSSVAGKLQERFGVKRVTM
ASGILLGLGFFLTAHSDNLMMLWLSAGVLVGLADGAGYLLTLSNCVKWFPERKGLISAFAIGSYGLGSLGFKFIDTQLLE
TVGLEKTFVIWGAIALLMIVFGATLMKDAPKQEVKTSNGVVEKDYTLAESMRKPQYWMLAVMFLTACMSGLYVIGVAKDI
AQSLAHLDVVSAANAVTVISIANLSGRLVLGILSDKIARIRVITIGQVISLVGMAALLFAPLNAVTFFAAIACVAFNFGG
TITVFPSLVSEFFGLNNLAKNYGVIYLGFGIGSICGSIIASLFGGFYVTFYVIFALLILSLALSTTIRQPEQKMLREAHG
SL
>P9WKH5 ~~~~~~Insertion element IS6110 uncharacterized 12.0 kDa protein~~~COG2963
MSGGSSRRYPPELRERAVRMVAEIRGQHDSEWAAISEVARLLGVGCAETVRKWVRQAQVDAGARPGTTTEESAELKRLRR
DNAELRRANAILKTASAFFAAELDRPAR
>P0ADJ8 ~~~yiaA~~~Inner membrane protein YiaA~~~COG4682
MDNKISTYSPAFSIVSWIALVGGIVTYLLGLWNAEMQLNEKGYYFAVLVLGLFSAASYQKTVRDKYEGIPTTSIYYMTCL
TVFIISVALLMVGLWNATLLLSEKGFYGLAFFLSLFGAVAVQKNIRDAGINPPKETQVTQEEYSE
>P11286 ~~~yiaB~~~Inner membrane protein YiaB~~~COG4682
MKTSKTVAKLLFVVGALVYLVGLWISCPLLSGKGYFLGVLMTATFGNYAYLRAEKLGQLDDFFTHICQLVALITIGLLFI
GVLNAPINTYEMVIYPIAFFVCLFGQMRLFRSA
>P37664 2.3.1.-~~~yiaC~~~Peptidyl-lysine N-acetyltransferase YiaC~~~COG0456
MIREAQRSELPAILELWLESTTWGHPFIKANYWRDCIPLVRDAYLANAQNWVWEEDGKLLGFVSIMEGRFLAAMFVAPKA
VRRGIGKALMQYVQQRHPHLMLEVYQKNQPAINFYQAQGFHIVDCAWQDETQLPTWIMSWPVVQTL
>P37665 ~~~yiaD~~~Probable lipoprotein YiaD~~~COG2885
MKKRVYLIAAVVSGALAVSGCTTNPYTGEREAGKSAIGAGLGSLVGAGIGALSSSKKDRGKGALIGAAAGAALGGGVGYY
MDVQEAKLRDKMRGTGVSVTRSGDNIILNMPNNVTFDSSSATLKPAGANTLTGVAMVLKEYPKTAVNVIGYTDSTGGHDL
NMRLSQQRADSVASALITQGVDASRIRTQGLGPANPIASNSTAEGKAQNRRVEITLSPL
>P37671 ~~~yiaJ~~~DNA-binding transcriptional repressor YiaJ~~~COG1414
MGKEVMGKKENEMAQEKERPAGSQSLFRGLMLIEILSNYPNGCPLAHLSELAGLNKSTVHRLLQGLQSCGYVTTAPAAGS
YRLTTKFIAVGQKALSSLNIIHIAAPHLEALNIATGETINFSSREDDHAILIYKLEPTTGMLRTRAYIGQHMPLYCSAMG
KIYMAFGHPDYVKSYWESHQHEIQPLTRNTITELPAMFDELAHIRESGAAMDREENELGVSCIAVPVFDIHGRVPYAVSI
SLSTSRLKQVGEKNLLKPLRETAQAISNELGFTVRDDLGAIT
>P37674 ~~~yiaM~~~2,3-diketo-L-gulonate TRAP transporter small permease protein YiaM~~~COG3090
MKKILEAILAINLAVLSCIVFINIILRYGFQTSILSVDELSRYLFVWLTFIGAIVAFMDNAHVQVTFLVEKLSPAWQRRV
ALVTHSLILFICGALAWGATLKTIQDWSDYSPILGLPIGLMYAACLPTSLVIAFFELRHLYQLITRSNSLTSPPQGA
>P37676 ~~~yiaO~~~2,3-diketo-L-gulonate-binding periplasmic protein YiaO~~~COG1638
MKLRSVTYALFIAGLAAFSTSSLAAQSLRFGYETSQTDSQHIAAKKFNDLLQERTKGELKLKLFPDSTLGNAQAMISGVR
GGTIDMEMSGSNNFAGLSPVMNLLDVPFLFRDTAHAHKTLDGKVGDDLKASLEGKGLKVLAYWENGWRDVTNSRAPVKTP
ADLKGLKIRTNNSPMNIAAFKVFGANPIPMPFAEVYTGLETRTIDAQEHPINVVWSAKFFEVQKFLSLTHHAYSPLLVVI
NKAKFDGLSPEFQQALVSSAQEAGNYQRKLVAEDQQKIIDGMKEAGVEVITDLDRKAFSDALGNQVRDMFVKDVPQGADL
LKAVDEVQ
>P37683 ~~~yiaV~~~Inner membrane protein YiaV~~~COG1566
MDLLIILTYVAFAWAMFKIFKIPVNKWTIPTAALGGIFIVSGLILLMNYNHPYTFKAQKAVISIPVVPQVTGVVIEVTDK
KNTLIKKGEVLFRLDPTRYQARVDRLMADIVTAEHKQRALGAELDEMAANTQQAKATRDKFAKEYQRYARGSQAKVNPFS
ERDIDVARQNYLAQEASVKSSAAEQKQIQSQLDSLVLGEHSQIASLKAQLAEAKYNLEQTIVRAPSDGYVTQVLIRPGTY
AASLPLRPVMVFIPDQKRQIVAQFRQNSLLRLAPGDDAEVVFNALPGKVFSGKLAAISPAVPGGAYQSTGTLQTLNTAPG
SDGVIATIELDEHTDLSALPDGIYAQVAVYSDHFSHVSVMRKVLLRMTSWVHYLYLDH
>P0ADK4 ~~~yiaW~~~Inner membrane protein YiaW~~~
MFLDYFALGVLIFVFLVIFYGIIILHDIPYLIAKKRNHPHADAIHVAGWVSLFTLHVIWPFLWIWATLYRPERGWGMQSH
DSSVMQLQQRIAGLEKQLADIKSSSAE
>A1XPK1 ~~~yiaX1~~~Uncharacterized protein YiaX1~~~
MKNNTGYIIGAYPCAPSFHQKSEDEEKAFWRQLADTPDIRGLEQPCLEHLHPLGDEWLMRHTPADWQIVVTAIMETMRRR
GGNGGFGLASSDEEQRKACVAYYRHLYQKINAINAANAGKIVALELHAAPCASNPNVAQATDAFARSLKEVASWDWSCSL
VLEHCDAMTGPAPRKGFLPLENVLETLAGYEISVAINWARSAIEGQDTTLPLTHTRQASQAGKLGALMFSGTTLNGEYGE
WQDLHAPFSPFCAQSLMTHTHVRELLACAGSDALQFLGFKLLEINPDADVNHRIAILRDGIAALNKAQQ
>P0ADK6 ~~~yibA~~~Protein YibA~~~COG1413
MSNTYQKRKASKEYGLYNQCKKLNDDELFRLLDDHNSLKRISSARVLQLRGGQDAVRLAIEFCSDKNYIRRDIGAFILGQ
IKICKKCEDNVFNILNNMALNDKSACVRATAIESTAQRCKKNPIYSPKIVEQSQITAFDKSTNVRRATAFAISVINDKAT
IPLLINLLKDPNGDVRNWAAFAININKYDNSDIRDCFVEMLQDKNEEVRIEAIIGLSYRKDKRVLSVLCDELKKNTVYDD
IIEAAGELGDKTLLPVLDTMLYKFDDNEIITSAIDKLKRS
>P0ACA1 ~~~yibF~~~Uncharacterized GST-like protein YibF~~~COG0625
MKLVGSYTSPFVRKLSILLLEKGITFEFINELPYNADNGVAQFNPLGKVPVLVTEEGECWFDSPIIAEYIELMNVAPAML
PRDPLESLRVRKIEALADGIMDAGLVSVREQARPAAQQSEDELLRQREKINRSLDVLEGYLVDGTLKTDTVNLATIAIAC
AVGYLNFRRVAPGWCVDRPHLVKLVENLFSRESFARTEPPKA
>P0AFV0 ~~~yibH~~~Inner membrane protein YibH~~~COG1566
MDLLIVLTYVALAWAVFKIFRIPVNQWTLATAALGGVFLVSGLILLMNYNHPYTFTAQKAVIAIPITPQVTGIVTEVTDK
NNQLIQKGEVLFKLDPVRYQARVDRLQADLMTATHNIKTLRAQLTEAQANTTQVSAERDRLFKNYQRYLKGSQAAVNPFS
ERDIDDARQNFLAQDALVKGSVAEQAQIQSQLDSMVNGEQSQIVSLRAQLTEAKYNLEQTVIRAPSNGYVTQVLIRPGTY
AAALPLRPVMVFIPEQKRQIVAQFRQNSLLRLKPGDDAEVVFNALPGQVFHGKLTSILPVVPGGSYQAQGVLQSLTVVPG
TDGVLGTIELDPNDDIDALPDGIYAQVAVYSDHFSHVSVMRKVLLRMTSWMHYLYLDH
>P0ADK8 ~~~yibL~~~Uncharacterized protein YibL~~~
MKEVEKNEIKRLSDRLDAIRHQQADLSLVEAADKYAELEKEKATLEAEIARLREVHSQKLSKEAQKLMKMPFQRAITKKE
QADMGKLKKSVRGLVVVHPMTALGREMGLQEMTGFSKTAF
>P0AG27 ~~~yibN~~~Uncharacterized protein YibN~~~COG0607
MQEIMQFVGRHPILSIAWIALLVAVLVTTFKSLTSKVKVITRGEATRLINKEDAVVVDLRQRDDFRKGHIAGSINLLPSE
IKANNVGELEKHKDKPVIVVDGSGMQCQEPANALTKAGFAQVFVLKEGVAGWAGENLPLVRGK
>P0DSH3 ~~~yibX~~~Protein YibX~~~
MEKIPIRPFSDPASIISLCRCTLENSNAPLSLLCSATNKFILSARDSADLNEKGDFMNIFKPISYIASLAPREVTLLALV
>P0DSH4 ~~~yibY~~~Protein YibY~~~
MIIGMLRAHMITSLSPIPTMPSVNINKPKARFLYAI
>Q182S9 3.1.26.-~~~~~~Probable endoribonuclease YicC~~~COG1561
MAISMTGFGRGEYKDDNYYFLVECKTINHKYSDINIRLPRKISFLEDKVRNLVKNYVKRGRVDLYIKFDLLGKEDVNLNF
DEGLASQYIDILKEIKNKFDIIDDISVMNVAKFPDIVKIEEKEEDEDLLWSMLNQAVEDALIKLREMRSEEGKKLAEDIA
MRCDLLKNHIEEIEKYSSSVVEDYREKLNLRISELLDDPSIIDENRLAQEVAIYADKSSITEEIVRFKSHIGQLKNTIFK
DDSIGRKIDFLIQEMNRETNTIGSKSSDINITNLVVEVKSELEKIREQIQNIE
>P23839 3.1.26.-~~~yicC~~~Endoribonuclease YicC~~~COG1561
MIRSMTAYARREIKGEWGSATWEMRSVNQRYLETYFRLPEQFRSLEPVVRERIRSRLTRGKVECTLRYEPDVSAQGELIL
NEKLAKQLVTAANWVKMQSDEGEINPVDILRWPGVMAAQEQDLDAIAAEILAALDGTLDDFIVARETEGQALKALIEQRL
EGVTAEVVKVRSHMPEILQWQRERLVAKLEDAQVQLENNRLEQELVLLAQRIDVAEELDRLEAHVKETYNILKKKEAVGR
RLDFMMQEFNRESNTLASKSINAEVTNSAIELKVLIEQMREQIQNIE
>P44726 3.1.26.-~~~~~~Probable endoribonuclease YicC~~~COG1561
MIYSMTAFARLEVKKDWGDAVWEIRSVNQRYLENFFRLPEQFRGLENTLREKLRQNLTRGKIECSLRIETKKQANAELNL
NKELANQVIQSLQWIKAQAGEGEINLTDVLRYPGVVEAQEQDLDAISQDLLTAFDDLLTDFIAMRGREGEKLNDIIQQRL
DSIAVETDKVRSQMPAVLQWQRERLLQRFEDAQLNLDPQRVEQEMILLAQRIDVAEELDRLQMHVKETTNILKKGGAVGR
KLDFMMQELNRESNTLASKSINADITASAVELKVLIEQMREQIQNLE
>P0AGM2 ~~~yicG~~~UPF0126 inner membrane protein YicG~~~COG2860
MLLHILYLVGITAEAMTGALAAGRRRMDTFGVIIIATATAIGGGSVRDILLGHYPLGWVKHPEYVIIVATAAVLTTIVAP
VMPYLRKVFLVLDALGLVVFSIIGAQVALDMGHGPIIAVVAAVTTGVFGGVLRDMFCKRIPLVFQKELYAGVSFASAVLY
IALQHYVSNHDVVIISTLVFGFFARLLALRLKLGLPVFYYSHEGH
>P31435 ~~~yicJ~~~Inner membrane symporter YicJ~~~COG2211
MKSEVLSVKEKIGYGMGDAASHIIFDNVMLYMMFFYTDIFGIPAGFVGTMFLVARALDAISDPCMGLLADRTRSRWGKFR
PWVLFGALPFGIVCVLAYSTPDLSMNGKMIYAAITYTLLTLLYTVVNIPYCALGGVITNDPTQRISLQSWRFVLATAGGM
LSTVLMMPLVNLIGGDNKPLGFQGGIAVLSVVAFMMLAFCFFTTKERVEAPPTTTSMREDLRDIWQNDQWRIVGLLTIFN
ILAVCVRGGAMMYYVTWILGTPEVFVAFLTTYCVGNLIGSALAKPLTDWKCKVTIFWWTNALLAVISLAMFFVPMQASIT
MFVFIFVIGVLHQLVTPIQWVMMSDTVDYGEWCNGKRLTGISFAGTLFVLKLGLAFGGALIGWMLAYGGYDAAEKAQNSA
TISIIIALFTIVPAICYLLSAIIAKRYYSLTTHNLKTVMEQLAQGKRRCQQQFTSQEVQN
>P31437 ~~~yicL~~~Uncharacterized inner membrane transporter YicL~~~COG0697
MGSTRKGMLNVLIAAVLWGSSGVCAQYIMEQSQMSSQFLTMTRLIFAGLILLTLSFVHGDKIFSIINNHKDAISLLIFSV
VGALTVQLTFLLTIEKSNAATATVLQFLSPTIIVAWFSLVRKSRPGILVFCAILTSLVGTFLLVTHGNPTSLSISPAALF
WGIASAFAAAFYTTYPSTLIARYGTLPVVGWSMLIGGLILLPFYARQGTNFVVNGSLILAFFYLVVIGTSLTFSLYLKGA
QLIGGPKASILSCAEPLSSALLSLLLLGITFTLPDWLGTLLILSSVILISMDSRRRARKINRPARHK
>P0A8Y5 3.1.3.23~~~yidA~~~Sugar phosphatase YidA~~~COG0561
MAIKLIAIDMDGTLLLPDHTISPAVKNAIAAARARGVNVVLTTGRPYAGVHNYLKELHMEQPGDYCITYNGALVQKAADG
STVAQTALSYDDYRFLEKLSREVGSHFHALDRTTLYTANRDISYYTVHESFVATIPLVFCEAEKMDPNTQFLKVMMIDEP
AILDQAIARIPQEVKEKYTVLKSAPYFLEILDKRVNKGTGVKSLADVLGIKPEEIMAIGDQENDIAMIEYAGVGVAMDNA
IPSVKEVANFVTKSNLEDGVAFAIEKYVLN
>Q9KDP2 ~~~yidC2~~~Membrane protein insertase YidC 2~~~COG0706
MNYMKRRLLLFAGILLLVALAGCSTTDPITSESEGIWNHFFVYPMSWLITTVANLLNGSYGLSIIIVTILIRLALLPLTL
KQQKSMRAMQVIRPEMEAIQKKYKEKGSKDPKVQQEMQKELLGLYQKHGVNPMAGCLPLFIQLPILMAFYFAIMRTEEIR
YHTFLWFDLGQPDYILPFVAGITTYFQFKMTMSHQQQMQKTNPSDSDNPMANMMQMQMKVMLYVMPVMIIIAGLSLPSAL
SLYWVIGNIFMIIQTYFIVVKAPPLEVEQTKQKSSKPNKA
>Q8DN93 ~~~yidC2~~~Membrane protein insertase YidC 2~~~COG0706
MGVKKKLKLTSLLGLSLLIMTACATNGVTSDITAESADFWSKLVYFFAEIIRFLSFDISIGVGIILFTVLIRTVLLPVFQ
VQMVASRKMQEAQPRIKALREQYPGRDMESRTKLEQEMRKVFKEMGVRQSDSLWPILIQMPVILALFQALSRVDFLKTGH
FLWINLGSVDTTLVLPILAAVFTFLSTWLSNKALSERNGATTAMMYGIPVLIFIFAVYAPGGVALYWTVSNAYQVLQTYF
LNNPFKIIAEREAVVQAQKDLENRKRKAKKKAQKTK
>P25714 ~~~yidC~~~Membrane protein insertase YidC~~~COG0706
MDSQRNLLVIALLFVSFMIWQAWEQDKNPQPQAQQTTQTTTTAAGSAADQGVPASGQGKLISVKTDVLDLTINTRGGDVE
QALLPAYPKELNSTQPFQLLETSPQFIYQAQSGLTGRDGPDNPANGPRPLYNVEKDAYVLAEGQNELQVPMTYTDAAGNT
FTKTFVLKRGDYAVNVNYNVQNAGEKPLEISSFGQLKQSITLPPHLDTGSSNFALHTFRGAAYSTPDEKYEKYKFDTIAD
NENLNISSKGGWVAMLQQYFATAWIPHNDGTNNFYTANLGNGIAAIGYKSQPVLVQPGQTGAMNSTLWVGPEIQDKMAAV
APHLDLTVDYGWLWFISQPLFKLLKWIHSFVGNWGFSIIIITFIVRGIMYPLTKAQYTSMAKMRMLQPKIQAMRERLGDD
KQRISQEMMALYKAEKVNPLGGCFPLLIQMPIFLALYYMLMGSVELRQAPFALWIHDLSAQDPYYILPILMGVTMFFIQK
MSPTTVTDPMQQKIMTFMPVIFTVFFLWFPSGLVLYYIVSNLVTIIQQQLIYRGLEKRGLHSREKKKS
>Q1R4M9 ~~~yidC~~~Membrane protein insertase YidC~~~
MDSQRNLLVIALLFVSFMIWQAWEQDKNPQPQAQQTTQTTTTAAGSAADQGVPASGQGKLISVKTDVLDLTINTRGGDVE
QALLPAYPKELNSTQPFQLLETSPQFIYQAQSGLTGRDGPDNPANGPRPLYNVEKDAYVLAEGQNELQVPMTYTDAAGNT
FTKTFVLKRGDYAVNVNYNVQNAGEKPLEISTFGQLKQSITLPPHLDTGSSNFALHTFRGAAYSTPDEKYEKYKFDTIAD
NENLNISSKGGWVAMLQQYFATAWIPHNDGTNNFYTANLGNGIAAIGYKSQPVLVQPGQTGAMNSTLWVGPEIQDKMAAV
APHLDLTVDYGWLWFISQPLFKLLKWIHSFVGNWGFSIIIITFIVRGIMYPLTKAQYTSMAKMRMLQPKIQAMRERLGDD
KQRISQEMMALYKAEKVNPLGGCFPLLIQMPIFLALYYMLMGSVELRQAPFALWIHDLSAQDPYYILPILMGVTMFFIQK
MSPTTVTDPMQQKIMTFMPVIFTVFFLWFPSGLVLYYIVSNLVTIIQQQLIYRGLEKRGLHSREKKKS
>P9WIT5 ~~~yidC~~~Membrane protein insertase YidC~~~COG0706
MSLLFDFFSLDFIYYPVSWIMWVWYRLFAFVLGPSNFFAWALSVMFLVFTLRALLYKPFVRQIRTTRQMQELQPQIKALQ
KKYGKDRQRMALEMQKLQREHGFNPILGCLPMLAQIPVFLGLYHVLRSFNRTTGGFGQPHLSVIENRLTGNYVFSPVDVG
HFLDANLFGAPIGAYMTQRSGLDAFVDFSRPALIAVGVPVMILAGIATYFNSRASIARQSAEAAANPQTAMMNKLALYVF
PLGVVVGGPFLPLAIILYWFSNNIWTFGQQHYVFGMIEKEEEAKKQEAVRRRAANAPAPGAKPKRSPKTAPATNAAAPTE
AGDTDDGAESDASTERPADTSNPARRNSGPSARTPRPGVRPKKRKR
>P65629 ~~~yidC~~~Membrane protein insertase YidC~~~
MKKKALLPLFLGIMVFLAGCDYSKPEKRSGFFYNTFVDPMKNVLDWLGNNLLNDNYGLAIIILVLVIRIILLPFMLSNYK
NSHMMRQKMKVAKPEVEKIQEKVKRARTQEEKMAANQELMQVYKKYDMNPIKSMLGCLPMLIQLPIIMGLYFVLKDQLVD
GLFKYPHFLWFDLGRPDIWITIIAGVLYFIQAYVSSKTMPDEQRQMGYMMMVISPIMIIWISLSSASALGLYWSVSAAFL
VVQTHFANIYYEKVAKKEVQPFIEAYEREHNGGSNKKGKNTQVVSKKKKK
>P74155 ~~~yidC~~~Membrane protein insertase YidC~~~COG0706
MDFGIGFISTNIMLPILDFFFGIVHSYGFAIIALTLVIRLGLYPLSAGQIRNMRKMRITQPLMKERQEEIQKRYKDDPAK
QQEEMAKVMKEFGNPLAGCLPLLLQMPILFALFATLRGSPFSDINYTVDLQILPQEQVERIVPQTFSTKPQNIYVDEALH
YPIAVFLPGGKMLGVGEKTQLEIQSTEGKAFNQVIPEKNSQILTPTYSVTKGEDRISVNPDGTIEALVPGDATVQVTIPG
IAARTGFLFIKALGQVGVTGENGEINWDILGMIVFFGFSIYLNQELSGASGGGAPNAQAQQQQTINKITPILFSGMFLFF
PLPAGVLMYIVMANVFQTIQTLILMREPLPENLQKLLDEQQKATQGRESLPFEKKSSKKKEKTS
>Q9X1H2 ~~~yidC~~~Membrane protein insertase YidC~~~COG0706
MVLRKVVAILLAILPIFLFAVEPIKVVRSEKEIVVLTRFEEYHFDLEKGILKDFYTLVDGRKHVFTYGNDGFDVLDEGTP
LTVIEEPIVTGVGKVSEGFSDEVSIVYNYGYVKKIFTIKNNENYTFFVDIESSKPVDVTVPRVSVDTSTDRYLENYFASF
NPKTRTLVLLKHDEGLLFEGTLKVNGQKRFIVFMGPNKRTLIKKAFPEDYDVLIKALVNIPGFNKWYDSVFYGLVWFFWW
LKDLTKNFGWAIMLFTLIVRLILYPLYHAQTKSLINMRKLQPQIEAIKKKYKDPTKQQEALLKLYREAGVNPASGCLMLL
IQLPIFMLLWSVIRYYVEEFAYSGSFLIWKDLSAGGFSNNWLFLVITIVASYYTTLLTSQDARTAWQGIIMSVIFPFLFV
GLPSGLFLYYATNTLIQLAVTYYTYKRYKIKGLTTRELLGLPKKA
>P0A8C8 ~~~yidD~~~Putative membrane protein insertion efficiency factor~~~COG0759
MAPPLSPGSRVLIALIRVYQRLISPLLGPHCRFTPTCSSYGIEALRRFGVIKGSWLTVKRVLKCHPLHPGGDDPVPPGPF
DTREH
>Q97NX8 ~~~~~~Putative membrane protein insertion efficiency factor~~~COG0759
MKRILIAPVRFYQRFISPVFPPSCRFELTCSNYMIQAIEKHGFKGVLMGLARILRCHPWSKTGKDPVPDRFSLKRNQEGE
>P0ADL6 ~~~yidG~~~Inner membrane protein YidG~~~
MPDSRKARRIADPGLQPERTSLAWFRTMLGYGALMALAIKHNWHQAGMLFWISIGILAIVALILWHYTRNRNLMDVTNSD
FSQFHVVRDKFLISLAVLSLAILFAVTHIHQLIVFIERVA
>P0ADM0 ~~~yidH~~~Inner membrane protein YidH~~~COG2149
MKISRLGEAPDYRFSLANERTFLAWIRTALGFLAAGVGLDQLAPDFATPVIRELLALLLCLFSGGLAMYGYLRWLRNEKA
MRLKEDLPYTNSLLIISLILMVVAVIVMGLVLYAG
>P31446 ~~~yidI~~~Inner membrane protein YidI~~~
MGIIAQNKISSLGMLFGAIALMMGIIHFSFGPFSAPPPTFESIVADKTAEIKRGLLAGIKGEKITTVEKKEDVDVDKILN
QSGIALAIAALLCAFIGGMRKENRWGIRGALVFGGGTLAFHTLLFGIGIVCSILLIFLIFSFLTGGSLV
>P31448 ~~~yidK~~~Uncharacterized symporter YidK~~~COG4146
MNSLQILSFVGFTLLVAVITWWKVRKTDTGSQQGYFLAGRSLKAPVIAASLMLTNLSTEQLVGLSGQAYKSGMSVMGWEV
TSAVTLIFLALIFLPRYLKRGIATIPDFLEERYDKTTRIIIDFCFLIATGVCFLPIVLYSGALALNSLFHVGESLQISHG
AAIWLLVILLGLAGILYAVIGGLRAMAVADSINGIGLVIGGLMVPVFGLIAMGKGSFMQGIEQLTTVHAEKLNSIGGPTD
PLPIGAAFTGLILVNTFYWCTNQGIVQRTLASKSLAEGQKGALLTAVLKMLDPLVLVLPGLIAFHLYQDLPKADMAYPTL
VNNVLPVPMVGFFGAVLFGAVISTFNGFLNSASTLFSMGIYRRIINQNAEPQQLVTVGRKFGFFIAIVSVLVAPWIANAP
QGLYSWMKQLNGIYNVPLVTIIIMGFFFPRIPALAAKVAMGIGIISYITINYLVKFDFHFLYVLACTFCINVVVMLVIGF
IKPRATPFTFKDAFAVDMKPWKNVKIASIGILFAMIGVYAGLAEFGGYGTRWLAMISYFIAAVVIVYLIFDSWRHRHDPA
VTFTPDGKDSL
>P31463 ~~~yidZ~~~HTH-type transcriptional regulator YidZ~~~COG0583
MKKSITTLDLNLLLCLQLLMQERSVTKAAKRINVTPSAVSKSLAKLRAWFDDPLFVNSPLGLSPTPLMVSMEQNLAEWMQ
MSNLLLDKPHHQTPRGLKFELAAESPLMMIMLNALSKQIYQRYPQATIKLRNWDYDSLDAITRGEVDIGFSGRESHPRSR
ELLSSLPLAIDYEVLFSDVPCVWLRKDHPALHQTWNLDTFLRYPHISICWEQSDTWALDNVLQELGRERTIAMSLPEFEQ
SLFMAAQPDNLLLATAPRYCQYYNQLHQLPLVALPLPFDESQQKKLEVPFTLLWHKRNSHNPKIVWLRETIKNLYASMA
>P31467 3.1.3.-~~~yieH~~~6-phosphogluconate phosphatase~~~COG0637
MSRIEAVFFDCDGTLVDSEVICSRAYVTMFQEFGITLDPEEVFKRFKGVKLYEIIDIVSLEHGVTLAKTEAEHVYRAEVA
RLFDSELEAIEGAGALLSAITAPMCVVSNGPNNKMQHSMGKLNMLHYFPDKLFSGYDIQRWKPDPALMFHAAKAMNVNVE
NCILVDDSVAGAQSGIDAGMEVFYFCADPHNKPIVHPKVTTFTHLSQLPELWKARGWDITA
>P0ADN2 ~~~yifE~~~UPF0438 protein YifE~~~COG3085
MAESFTTTNRYFDNKHYPRGFSRHGDFTIKEAQLLERHGYAFNELDLGKREPVTEEEKLFVAVCRGEREPVTEAERVWSK
YMTRIKRPKRFHTLSGGKPQVEGAEDYTDSDD
>P27837 ~~~yifK~~~Probable transport protein YifK~~~COG1113
MADNKPELQRGLEARHIELIALGGTIGVGLFMGAASTLKWAGPSVLLAYIIAGLFVFFIMRSMGEMLFLEPVTGSFAVYA
HRYMSPFFGYLTAWSYWFMWMAVGISEITAIGVYVQFWFPEMAQWIPALIAVALVALANLAAVRLYGEIEFWFAMIKVTT
IIVMIVIGLGVIFFGFGNGGQSIGFSNLTEHGGFFAGGWKGFLTALCIVVASYQGVELIGITAGEAKNPQVTLRSAVGKV
LWRILIFYVGAIFVIVTIFPWNEIGSNGSPFVLTFAKIGITAAAGIINFVVLTAALSGCNSGMYSCGRMLYALAKNRQLP
AAMAKVSRHGVPVAGVAVSIAILLIGSCLNYIIPNPQRVFVYVYSASVLPGMVPWFVILISQLRFRRAHKAAIASHPFRS
ILFPWANYVTMAFLICVLIGMYFNEDTRMSLFVGIIFMLAVTAIYKVFGLNRHGKAHKLEE
>P0ADP0 3.1.3.104~~~yigB~~~5-amino-6-(5-phospho-D-ribitylamino)uracil phosphatase YigB~~~COG1011
MRFYRPLGRISALTFDLDDTLYDNRPVILRTEREALTFVQNYHPALRSFQNEDLQRLRQAVREAEPEIYHDVTRWRFRSI
EQAMLDAGLSAEEASAGAHAAMINFAKWRSRIDVPQQTHDTLKQLAKKWPLVAITNGNAQPELFGLGDYFEFVLRAGPHG
RSKPFSDMYFLAAEKLNVPIGEILHVGDDLTTDVGGAIRSGMQACWIRPENGDLMQTWDSRLLPHLEISRLASLTSLI
>P27843 ~~~yigG~~~Inner membrane protein YigG~~~
MLRIFIPTSNGKISRRRYIFSFILINFIFAFLIIFFNDGEAGFLVIVSTIVLHYLVINMNCQRLRDSGFIYIKTYVFGTL
AVYIISIITMIAEDFACSGNGSMIFLICYFSTFSMLMLAPTDSSKQ
>P0ADP2 3.1.2.20~~~yigI~~~Medium/long-chain acyl-CoA thioesterase YigI~~~COG2050
MSAVLTAEQALKLVGEMFVYHMPFNRALGMELERYEKEFAQLAFKNQPMMVGNWAQSILHGGVIASALDVAAGLVCVGST
LTRHETISEDELRQRLSRMGTIDLRVDYLRPGRGERFTATSSLLRAGNKVAVARVELHNEEQLYIASATATYMVG
>P27848 3.1.3.74~~~yigL~~~Pyridoxal phosphate phosphatase YigL~~~COG0561
MYQVVASDLDGTLLSPDHTLSPYAKETLKLLTARGINFVFATGRHHVDVGQIRDNLEIKSYMITSNGARVHDLDGNLIFA
HNLDRDIASDLFGVVNDNPDIITNVYRDDEWFMNRHRPEEMRFFKEAVFQYALYEPGLLEPEGVSKVFFTCDSHEQLLPL
EQAINARWGDRVNVSFSTLTCLEVMAGGVSKGHALEAVAKKLGYSLKDCIAFGDGMNDAEMLSMAGKGCIMGSAHQRLKD
LHPELEVIGTNADDAVPHYLRKLYLS
>P27862 ~~~yigZ~~~IMPACT family member YigZ~~~COG1739
MESWLIPAAPVTVVEEIKKSRFITMLAHTDGVEAAKAFVESVRAEHPDARHHCVAWVAGAPDDSQQLGFSDDGEPAGTAG
KPMLAQLMGSGVGEITAVVVRYYGGILLGTGGLVKAYGGGVNQALRQLTTQRKTPLTEYTLQCEYHQLTGIEALLGQCDG
KIINSDYQAFVLLRVALPAAKVAEFSAKLADFSRGSLQLLAIEE
>P0ADP9 ~~~yihD~~~Protein YihD~~~COG3084
MKCKRLNEVIELLQPAWQKEPDLNLLQFLQKLAKESGFDGELADLTDDILIYHLKMRDSAKDAVIPGLQKDYEEDFKTAL
LRARGVIKE
>P32129 2.3.-.-~~~yihG~~~Probable acyltransferase YihG~~~COG0204
MANLLNKFIMTRILAAITLLLSIVLTILVTIFCSVPIIIAGIVKLLLPVPVIWRKVSRFCDFMMYCWCEGLAVLLHLNPH
LQWEVHGLEGLSKKNWYLLICNHRSWADIVVLCVLFRKHIPMNKYFLKQQLAWVPFLGLACWSLDMPFMKRYSRAYLLRH
PERRGKDVETTRRSCEKFRLHPTTIVNFVEGSRFTQEKHQQTHSTFQNLLPPKAAGIAMALNVLGKQFDKLLNVTLCYPD
NNRQPFFDMLSGKLTRIVVHVDLQPIADELHGDYINDKSFKRHFQQWLNSLWQEKDRLLTSLMSSQRQNK
>P0A8H6 ~~~yihI~~~Der GTPase-activating protein YihI~~~COG3078
MKPSSSNSRSKGHAKARRKTREELDQEARDRKRQKKRRGHAPGSRAAGGNTTSGSKGQNAPKDPRIGSKTPIPLGVTEKV
TKQHKPKSEKPMLSPQAELELLETDERLDALLERLEAGETLSAEEQSWVDAKLDRIDELMQKLGLSYDDDEEEEEDEKQE
DMMRLLRGN
>P32135 ~~~yihN~~~Inner membrane protein YihN~~~COG2271
MLTKKKWALFSLLTLCGGTIYKLPSLKDAFYIPMQEYFHLTNGQIGNAMSVNSFVTTVGFFLSIYFADKLPRRYTMSFSL
IATGLLGVYLTTMPGYWGILFVWALFGVTCDMMNWPVLLKSVSRLGNSEQQGRLFGFFETGRGIVDTVVAFSALAVFTWF
GSGLLGFKAGIWFYSLIVIAVGIIIFFVLNDKEEAPSVEVKKEDGASKNTSMTSVLKDKTIWLIAFNVFFVYAVYCGLTF
FIPFLKNIYLLPVALVGAYGIINQYCLKMIGGPIGGMISDKILKSPSKYLCYTFIISTAALVLLIMLPHESMPVYLGMAC
TLGFGAIVFTQRAVFFAPIGEAKIAENKTGAAMALGSFIGYAPAMFCFSLYGYILDLNPGIIGYKIVFGIMACFAFSGAV
VSVMLVKRISQRKKEMLAAEA
>P32137 ~~~yihP~~~Putative 2,3-dihydroxypropane-1-sulfonate exporter~~~COG2211
MSHITTEDPATLRLPFKEKLSYGIGDLASNILLDIGTLYLLKFYTDVLGLPGTYGGIIFLISKFFTAFTDMGTGIMLDSR
RKIGPKGKFRPFILYASFPVTLLAIANFVGTPFDVTGKTVMATILFMLYGLFFSMMNCSYGAMVPAITKNPNERASLAAW
RQGGATLGLLLCTVGFVPVMNLIEGNQQLGYIFAATLFSLFGLLFMWICYSGVKERYVETQPANPAQKPGLLQSFRAIAG
NRPLFILCIANLCTLGAFNVKLAIQVYYTQYVLNDPILLSYMGFFSMGCIFIGVFLMPASVRRFGKKKVYIGGLLIWVLG
DLLNYFFGGGSVSFVAFSCLAFFGSAFVNSLNWALVSDTVEYGEWRTGVRSEGTVYTGFTFFRKVSQALAGFFPGWMLTQ
IGYVPNVAQADHTIEGLRQLIFIYPSALAVVTIVAMGCFYSLNEKMYVRIVEEIEARKRTA
>P32139 ~~~yihR~~~Uncharacterized protein YihR~~~COG2017
MSLIKVPCMQITNMHCSGQTVSLAAGDYHATIVTVGAGLAELTFQGCHLVIPHKPEEMPLAHLGKVLIPWPNRIANGCYR
YQGQEYQLPINEHSSKAAIHGLLAWRDWQISELTATSVTLTAFLPPSYGYPFMLASQVVYSLNAHTGLSVEIASQNIGTV
AAPYGVGIHPYLTCNLTSVDEYLFQLPANQVYAVDEHANPTTLHHVDELDLNFTQAKKIAATKIDHTFKTANDLWEMTIT
HPQQALSVSLCSDQLWVQVYSGEKLQRQGLAVEPMSCPPNAFNSGIDLLLLESGKPHRLFFNIYGQRK
>P0A8Y3 3.1.3.10~~~yihX~~~Alpha-D-glucose 1-phosphate phosphatase YihX~~~COG1011
MLYIFDLGNVIVDIDFNRVLGAWSDLTRIPLASLKKSFHMGEAFHQHERGEISDEAFAEALCHEMALPLSYEQFSHGWQA
VFVALRPEVIAIMHKLREQGHRVVVLSNTNRLHTTFWPEEYPEIRDAADHIYLSQDLGMRKPEARIYQHVLQAEGFSPSD
TVFFDDNADNIEGANQLGITSILVKDKTTIPDYFAKVLC
>P0A8K8 ~~~yihY~~~UPF0761 membrane protein YihY~~~COG1295
MLKTIQDKARHRTRPLWAWLKLLWQRIDEDNMTTLAGNLAYVSLLSLVPLVAVVFALFAAFPMFSDVSIQLRHFIFANFL
PATGDVIQRYIEQFVANSNKMTAVGACGLIVTALLLMYSIDSALNTIWRSKRARPKIYSFAVYWMILTLGPLLAGASLAI
SSYLLSLRWASDLNTVIDNVLRIFPLLLSWISFWLLYSIVPTIRVPNRDAIVGAFVAALLFEAGKKGFALYITMFPSYQL
IYGVLAVIPILFVWVYWTWCIVLLGAEITVTLGEYRKLKQAAEQEEDDEP
>P32157 ~~~yiiM~~~Protein YiiM~~~COG2258
MRYPVDVYTGKIQAYPEGKPSAIAKIQVDGELMLTELGLEGDEQAEKKVHGGPDRALCHYPREHYLYWAREFPEQAELFV
APAFGENLSTDGLTESNVYMGDIFRWGEALIQVSQPRSPCYKLNYHFDISDIAQLMQNTGKVGWLYSVIAPGKVSADAPL
ELVSRVSDVTVQEAAAIAWHMPFDDDQYHRLLSAAGLSKSWTRTMQKRRLSGKIEDFSRRLWGK
>P32162 ~~~yiiS~~~UPF0381 protein YiiS~~~COG3691
MKDVVDKCSTKGCAIDIGTVIDNDNCTSKFSRFFATREEAESFMTKLKELAAATSSADEGASVAYKIKDLEGQVELDAAF
TFSCQAEMIIFELSLRSLA
>P0AF40 ~~~yijD~~~Inner membrane protein YijD~~~
MKQANQDRGTLLLALVAGLSINGTFAALFSSIVPFSVFPIISLVLTVYCLHQRYLNRTMPVGLPGLAAACFILGVLLYST
VVRAEYPDIGSNFFPAVLSVIMVFWIGAKMRNRKQEVAE
>P0ABT8 ~~~yijE~~~Probable cystine transporter YijE~~~COG0697
MSAAGKSNPLAISGLVVLTLIWSYSWIFMKQVTSYIGAFDFTALRCIFGALVLFIVLLLRGRGMRPTPFKYTLAIALLQT
CGMVGLAQWALVSGGAGKVAILSYTMPFWVVIFAALFLGERLRRGQYFAILIAAFGLFLVLQPWQLDFSSMKSAMLAILS
GVSWGASAIVAKRLYARHPRVDLLSLTSWQMLYAALVMSVVALLVPQREIDWQPTVFWALAYSAILATALAWSLWLFVLK
NLPASIASLSTLAVPVCGVLFSWWLLGENPGAVEGSGIVLIVLALALVSRKKKEAVSVKRI
>O06722 3.1.3.-~~~yisI~~~Aspartyl-phosphate phosphatase YisI~~~
MNSKIEEMRITLIETAQKYGMNSKETIQCSQELDILLNTRIKEEMIFGRYLENSRM
>O06724 ~~~yisK~~~Uncharacterized protein YisK~~~COG0179
MKFATGELYNRMFVGLIIDDEKIMDLQKAEKKLFELETIPGSLIECIAEGDKFVAHARQLAEWAKKPNDELGSFMYSLSE
VKLHAPIPKPSKNIICIGKNYRDHAIEMGSEADIPEHPMVFTKSPVTVTGHGDIVKSHEEVTSQLDYEGELAVVIGKSGT
RISKEDAYDHVFGYTIVNDITARDLQKRHKQFFIGKSLDTTCPMGPVLVHKSSIQEPERLKVETRVNGELRQSGSASDMI
FSIPELIETLSKGMTLEAGDIIATGTPSGVGKGFTPPKFLRSGDKIDITIDPIGTLSNQIG
>O06728 3.1.7.6~~~yisP~~~Farnesyl diphosphate phosphatase YisP~~~COG1562
MKEIKEAYQQCGQIVGEYAPACFKALSYLPLKQRQASWAVLSFCHTAASADEKVLPAFEAKADHVYQRTNNGKQHLWKAF
DHAYRTFTLESEPFREFIAAQKEDAKPYDDLDELLMYAYRTGGAAGLMLLPILTRRKQDQLKQAAVSLGLAIQLVRFLSD
LGTDQQKNRIPRQVMQQFGYTEADLQKGTVNKAFTMTWEYIAFEAEAYLEECQDALPLFPQYSQKTVKAALHLHRAVLEK
IRAKQHDVFQYHFALTETEVKQILSDI
>O06741 5.-.-.-~~~yitF~~~Putative isomerase YitF~~~COG4948
MKIVRIETFPLFHRLEKPYGDANGFKRYRTCYLIRIITESGIDGWGECVDWLPALHVGFTKRIIPFLLGKQAGSRLSLVR
TIQKWHQRAASAVSMALTEIAAKAADCSVCELWGGRYREEIPVYASFQSYSDSPQWISRSVSNVEAQLKKGFEQIKVKIG
GTSFKEDVRHINALQHTAGSSITMILDANQSYDAAAAFKWERYFSEWTNIGWLEEPLPFDQPQDYAMLRSRLSVPVAGGE
NMKGPAQYVPLLSQRCLDIIQPDVMHVNGIDEFRDCLQLARYFGVRASAHAYDGSLSRLYALFAQACLPPWSKMKNDHIE
PIEWDVMENPFTDLVSLQPSKGMVHIPKGKGIGTEINMEIVNRYKWDGSAY
>O06745 ~~~yitJ~~~Bifunctional homocysteine S-methyltransferase/5,10-methylenetetrahydrofolate reductase~~~COG0646
MGLLEDLQRQVLIGDGAMGTLLYSYGIDRCFEELNISKPEEIQRIHKAYVEAGANIIQTNTYGANYIKLSRHGLEDDIKK
MNQEAVKIARASAGDAYVLGTMGGIRTFNKNAYSLDEIKRSFREQLYLLLHEEPDGLLLETYYDLEEAREVLKIARKETD
LPIMLNVSMHEQGVLQDGTPLSDALRSIADLGADIVGINCRLGPYHMIEALSEVPIFDDVFLSVYPNSSLPSLEEGRLVY
ETDDTYFQNSASEFRKQGARIIGGCCGTTPNHIRAMAEAVGGLAPITEKEVKTRAKEFISVHHERTEPGLDEIAAKKRSI
IVELDPPKKLSFDKFLSAAAELKEAGIDALTLADNSLATPRISNVACGALVKQQLDMRSLVHITCRDRNIIGLQSHLMGL
DTLGLNDVLAITGDPSKIGDFPGATSVYDLTSFDLIRLIKQFNEGLSLSGKPLGKKTNFSVAAAFNPNVRHLDKAVKRLE
KKIDCGADYFVSQPVYSEQQLVDIHNETKHLKTPIYIGIMPLTSSRNAEFIHNEIPGIKLSDTIREKMAHAGEDKEKQKA
EGLAIARSLLDTACELFNGIYLITPFLRSDLTAELTSYIQQKDEQRQNIFLH
>P70947 3.1.3.104~~~yitU~~~5-amino-6-(5-phospho-D-ribitylamino)uracil phosphatase YitU~~~COG0561
METKPYLIALDLDGTLLKDDKTISENTLHTIQRLKDDGHYVCISTGRPYRSSSMYYQQMELTTPIVNFNGAFVHHPQDDS
WGRYHTSLPLDVVKQLVDISESYNVHNVLAEVIDDVYFHYHDEHLIDAFNMNTTNVTVGDLRENLGEDVTSVLIHAKEED
VPAIRSYLSDVHAEVIDHRRWAAPWHVIEIIKSGMNKAVGLQKISDYYGVPRERIIAFGDEDNDLEMLEFAGCGVAMGNG
IDAVKQIANRTTATNEEDGVARFLKEYFSL
>Q7WY73 ~~~yizA~~~Uncharacterized protein YizA~~~COG2318
MMKFFEYNWQVRDQWFTWCHQLTTEELLKNRLGGVENILYTLFHIIDVEYSWIRAIQGKEDIAVQFADYQTLNKVKSLSN
TFRTEIIDVLQTHSDQIKDELVSVPWETGVLYTRDEILHHIIAHEIHHIGQLSVWARELKLSPVSASFIGRTLKPIHSY
>P09163 2.3.1.-~~~yjaB~~~Peptidyl-lysine N-acetyltransferase YjaB~~~COG0456
MVISIRRSRHEEGEELVAIWCRSVDATHDFLSAEYRTELEDLVRSFLPEAPLWVAVNERDQPVGFMLLSGQHMDALFIDP
DVRGCGVGRVLVEHALSMAPELTTNVNEQNEQAVGFYKKVGFKVTGRSEVDDLGKPYPLLNLAYVGA
>P27375 ~~~yjaZ~~~Protein YjaZ~~~
MKQEVEKWRPFGHPDGDIRDLSFLDAHQAVYVQHHEGKEPLEYRFWVTYSLHCFTKDYEHQTNEEKQSLMYHAPKESRPF
CQHRYNLARTHLKRTILALPESNVIHAGYGSYAVIEVDLDGGDKAFYFVAFRAFREKKKLRLHVTSAYPISEKQKGKSVK
FFTIAYNLLRNKQLPQPSK
>O31601 2.3.1.-~~~yjbC~~~Putative acetyltransferase YjbC~~~COG0456
MNWYEKLSEYFPIEEMKSKAHMEALLKERSDIYHKDEGKHHILMFAEFDSFIFVDYLYVSKDARGQGLGGKLIAKLKKKN
KPILLEVEPVDEDDTDTEKRLRFYQREHFKHAQSIGYRRRSLATNEVNKMEILYWSPKTESEEEILEAMKQTYENIHTYK
DEKWYGESYEKTDEVLEIIDEEKQKNIFDQLS
>P68206 ~~~yjbJ~~~UPF0337 protein YjbJ~~~COG3237
MNKDEAGGNWKQFKGKVKEQWGKLTDDDMTIIEGKRDQLVGKIQERYGYQKDQAEKEVVDWETRNEYRW
>O31609 3.6.1.-~~~yjbK~~~Putative triphosphatase YjbK~~~COG4116
MSQEIEIEFKNMLTKQEFKNIASALQLTEKDFTDQKNHYFDTDSFALKQKHAALRIRRKNGKYVLTLKEPADVGLLETHQ
QLSEVSDLAGFSVPEGPVKDQLHKLQIDTDAIQYFGSLATNRAEKETEKGLIVLDHSRYLNKEDYEIEFEAADWHEGRQA
FEKLLQQFSIPQRETKNKILRFYEEKRKSI
>O31611 2.7.6.5~~~yjbM~~~GTP pyrophosphokinase YjbM~~~COG2357
MDDKQWERFLVPYRQAVEELKVKLKGIRTLYEYEDDHSPIEFVTGRVKPVASILEKARRKSIPLHEIETMQDIAGLRIMC
QFVDDIQIVKEMLFARKDFTVVDQRDYIAEHKESGYRSYHLVVLYPLQTVSGEKHVLVEIQIRTLAMNFWATIEHSLNYK
YSGNIPEKVKLRLQRASEAASRLDEEMSEIRGEVQEAQAAFSRKKKGSEQQ
>P0AF50 ~~~yjbR~~~Uncharacterized protein YjbR~~~COG2315
MTISELLQYCMAKPGAEQSVHNDWKATQIKVEDVLFAMVKEVENRPAVSLKTSPELAELLRQQHSDVRPSRHLNKAHWST
VYLDGSLPDSQIYYLVDASYQQAVNLLPEEKRKLLVQL
>O31623 ~~~yjcA~~~Sporulation protein YjcA~~~
MVINGLTIVLLSLAVFRLARLLVFDTIMAPLRSLFHEEKEEKDADGNIETYIVIKGTGVRAFIGELLSCYWCTGVWCAGF
LILCQAVIPQAAQWLILLLAIAGLAGIIETLVSKWLQE
>O31626 5.6.2.4~~~yjcD~~~Putative ATP-dependent DNA helicase YjcD~~~COG0210
MKCARLNDRIIHLHTYSREHYQFLFEEGIKGHLFCSHCGKPVLLRLNIADPPEFIHRQPGDFPACEEACEPKPSKEGKKE
DDQESGVIRLPKGKAIAADPSPAVTEWHRPRSIKPGTPFVPKTIEPDTSLFPSVGLNTDQLKAVTETEGPLLVLAGAGSG
KTRVLTARAAHMIEHLGIPPENMLLVTFTTKAVAEMKERMANQYGLQPAKVRRIVTGTFHSLFYKILYHSNSAKWNGEHL
LKMEWQREQYIKKALYEEGIDEKESPVDQALQQIGFWKNTYVPNERIPLKDEWEKQVYRLYEHYERQKKEHSQFDFDDMA
SACYELFIERPDLLEQYQSRFTYILIDEFQDINPVQYKIMQMLASPEQNLCCVGDDDQSIYAFRGSNPSFILDFQKDYPG
AKTIYLTANYRSTHPIVSSADIVVKKNKNRYAKTLEAARDDIQVPVLFYPYDEEEEATMVVSDIKEKIQNGASPEDFAVL
YRTNSGGRAIYERLHQSSIPYTADRGVQSFYSRRIVRQILAYLYASQNEDDTEAIKHLLPALFLKQSALNTLKALSITED
CTMIKALAKLPDLKPFQLDKIKKIVPFFASLRTMKPVEAITFAEGKMGFSEYLKKRGNEGNKLEKGSDDLRDIKVVAKKF
KTIPDFLAHVDHMRAAEKNRTDEHGVQLMTIHRSKGLEFKTVYVLGTVDGSIPHDFSLETARKGDEAALEEERRLLYVAM
TRAKQHLYLSCPANRRGKTANRSRFLYPLLQKARQPLHH
>O31628 2.3.1.-~~~yjcF~~~Uncharacterized N-acetyltransferase YjcF~~~COG2153
MKAVIAKNEEQLKDAFYVREEVFVKEQNVPAEEEIDELENESEHIVVYDGEKPVGAGRWRMKDGYGKLERICVLKSHRSA
GVGGIIMKALEKAAADGGASGFILNAQTQAVPFYKKHGYRVLSEKEFLDAGIPHLQMMKD
>O31629 3.1.-.-~~~yjcG~~~Putative phosphoesterase YjcG~~~COG1514
MKYGIVLFPSKKLQDLANSYRKRYDPSYSLIPPHLTLRASFECAEEKADQLVSHLRNIAKESHPLVLKMTKYSSFAPVNN
VIYIKAEPTEELKTLNEKLYTGVLAGEQEYNFVPHVTVGQNLSDDEHSDVLGQLKMQEVSHEEIVDRFHLLYQLENGSWT
VYETFLLGRGE
>P0AF54 ~~~yjcH~~~Inner membrane protein YjcH~~~COG3162
MNGTIYQRIEDNAHFRELVEKRQRFATILSIIMLAVYIGFILLIAFAPGWLGTPLNPNTSVTRGIPIGVGVIVISFVLTG
IYIWRANGEFDRLNNEVLHEVQAS
>O31639 ~~~yjcQ~~~Uncharacterized protein YjcQ~~~
MNKDKLRYAILKEIFEGNTPLSENDIGVTEDQFDDAVNFLKREGYIIGVHYSDDRPHLYKLGPELTEKGENYLKENGTWS
KAYKTIKEIKDWIK
>O31641 ~~~yjcS~~~Uncharacterized protein YjcS~~~COG1359
MKSHKMMGGGISMHYITACLKIISDKDLNEIMKEFKKLEEETNKEEGCITFHAYPLEPSERKIMLWEIWENEEAVKIHFT
KKHTIDVQKQELTEVEWLMKSNVND
>C0H3Y8 ~~~yjcZ~~~Sporulation protein YjcZ~~~
MGFGYGFGGGYGGGCYGGYAGGYGGGYGSTFVLLVVLFILLIIVGASFF
>O31643 ~~~yjdB~~~Uncharacterized protein YjdB~~~
MNFKKTVVSALSISALALSVSGVASAHEINSTPTEVKNISISPTHVIKIQDYNLPLKVGETYSVKNNSATRYWTDNQKVA
EVDQNGLVTAKSKGKATITLFKGTAVFGKVYVTVY
>O31647 ~~~yjdF~~~Uncharacterized protein YjdF~~~
MSVTFPSSGCPSFFNIFYEEGIVMKLTIYYDGQFWVGVVEVVDNGKLRAFRHLFGKEPRDSEVLEFVHNQLLNMMAQAEQ
EGVRLQGRRQKKINPKRLQRQVSKELKNAGVTSKAQEAIKLELEARKQKKKQIMKEQREHVKEQRYMLKKQKAKKKHRGK
>P39270 ~~~yjdF~~~Inner membrane protein YjdF~~~COG3647
MTRTLKPLILNTSALTLTLILIYTGISAHDKLTWLMEVTPVIIVVQLLLATARRYPLTPLLYTLIFLHAIILMVGGQYTY
AKVPVGFEVQEWLGLSRNPYDKLGHFFQGLVPALVAREILVRGMYVRGRKMVAFLVCCVALAISAMYELIEWWAALAMGQ
GADDFLGTQGDQWDTQSDMFCALLGALTTVIFLARFHCRQLRRFGLITG
>P39274 ~~~yjdJ~~~Uncharacterized protein YjdJ~~~COG2388
MEIREGHNKFYINDKQGKQIAEIVFVPTGENLAIIEHTDVDESLKGQGIGKQLVAKVVEKMRREKRKIIPLCPFAKHEFD
KTREYDDIRS
>P39277 ~~~yjeH~~~L-methionine/branched-chain amino acid exporter YjeH~~~COG0531
MSGLKQELGLAQGIGLLSTSLLGTGVFAVPALAALVAGNNSLWAWPVLIILVFPIAIVFAILGRHYPSAGGVAHFVGMAF
GSRLERVTGWLFLSVIPVGLPAALQIAAGFGQAMFGWHSWQLLLAELGTLALVWYIGTRGASSSANLQTVIAGLIVALIV
AIWWAGDIKPANIPFPAPGNIELTGLFAALSVMFWCFVGLEAFAHLASEFKNPERDFPRALMIGLLLAGLVYWGCTVVVL
HFDAYGEKMAAAASLPKIVVQLFGVGALWIACVIGYLACFASLNIYIQSFARLVWSQAQHNPDHYLARLSSRHIPNNALN
AVLGCCVVSTLVIHALEINLDALIIYANGIFIMIYLLCMLAGCKLLQGRYRLLAVVGGLLCVLLLAMVGWKSLYALIMLA
GLWLLLPKRKTPENGITT
>P39282 ~~~yjeM~~~Inner membrane transporter YjeM~~~COG0531
MPHTIKKMSLIGLILMIFTSVFGFANSPSAYYLMGYSAIPFYIFSALLFFIPFALMMAEMGAAYRKEEGGIYSWMNNSVG
PRFAFIGTFMWFSSYIIWMVSTSAKVWVPFSTFLYGSDMTQHWRIAGLEPTQVVGLLAVAWMILVTVVASKGINKIARIT
AVGGIAVMCLNLVLLLVSITILLLNGGHFAQDINFLASPNPGYQSGLAMLSFVVFAIFAYGGIEAVGGLVDKTENPEKNF
AKGIVFAAIVISIGYSLAIFLWGVSTNWQQVLSNGSVNLGNITYVLMKSLGMTLGNALHLSPEASLSLGVWFARITGLSM
FLAYTGAFFTLCYSPLKAIIQGTPKALWPEPMTRLNAMGMPSIAMWMQCGLVTVFILLVSFGGGTASAFFNKLTLMANVS
MTLPYLFLALAFPFFKARQDLDRPFVIFKTHLSAMIATVVVVLVVTFANVFTIIQPVVEAGDWDSTLWMIGGPVFFSLLA
MAIYQNYCSRVAKNPQWAVE
>P39284 ~~~yjeO~~~Inner membrane protein YjeO~~~
MSARMFVLCCIWFIVAFLWITITSALDKEWMIDGRGINNVCDVLMYLEEDDTRDVGVIMTLPLFFPFLWFALWRKKRGWF
MYATALAIFGYWLWQFFLRYQFCL
>O34438 ~~~yjfB~~~Uncharacterized protein YjfB~~~
MDIPALSVAMHQASLAQNVNIALTKKRLDTAQQNADQTLKMIQHPTLGQTIDVKA
>P33222 6.3.1.-~~~yjfC~~~Putative acid--amine ligase YjfC~~~COG0754
MLRHNVPVRRDLDQIAADNGFDFHIIDNEIYWDESRAYRFTLRQIEEQIEKPTAELHQMCLEVVDRAVKDEEILTQLAIP
PLYWDVIAESWRARDPSLYGRMDFAWCGNAPVKLLEYNADTPTSLYESAYFQWLWLEDARRSGIIPRDADQYNAIQERLI
SRFSELYSREPFYFCCCQDTDEDRSTVLYLQDCAQQAGQESRFIYIEDLGLGVGGVLTDLDDNVIQRAFKLYPLEWMMRD
DNGPLLRKRREQWVEPLWKSILSNKGLMPLLWRFFPGHPNLLASWFDGEKPQIAAGESYVRKPIYSREGGNVTIFDGKNN
VVDHADGDYADEPMIYQAFQPLPRFGDSYTLIGSWIVDDEACGMGIREDNTLITKDTSRFVPHYIAG
>P37772 ~~~yjfF~~~Inner membrane ABC transporter permease protein YjfF~~~COG1172
MIKRNLPLMITIGVFVLGYLYCLTQFPGFASTRVICNILTDNAFLGIIAVGMTFVILSGGIDLSVGSVIAFTGVFLAKVI
GDFGLSPLLAFPLVLVMGCAFGAFMGLLIDALKIPAFIITLAGMFFLRGVSYLVSEESIPINHPIYDTLSSLAWKIPGGG
RLSAMGLLMLAVVVIGIFLAHRTRFGNQVYAIGGNATSANLMGISTRSTTIRIYMLSTGLATLAGIVFSIYTQAGYALAG
VGVELDAIASVVIGGTLLSGGVGTVLGTLFGVAIQGLIQTYINFDGTLSSWWTKIAIGILLFIFIALQRGLTVLWENRQS
SPVTRVNIAQQ
>P0AF80 ~~~yjfL~~~UPF0719 inner membrane protein YjfL~~~COG3766
MHILDSLLAFSAYFFIGVAMVIIFLFIYSKITPHNEWQLIKNNNTAASLAFSGTLLGYVIPLSSAAINAVSIPDYFAWGG
IALVIQLLVFAGVRLYMPALSEKIINHNTAAGMFMGTAALAGGIFNAACMTW
>P0A8X0 ~~~yjgA~~~UPF0307 protein YjgA~~~COG3028
MTKQPEDWLDDVPGDDIEDEDDEIIWVSKSEIKRDAEELKRLGAEIVDLGKNALDKIPLDADLRAAIELAQRIKMEGRRR
QLQLIGKMLRQRDVEPIRQALDKLKNRHNQQVVLFHKLENLRDRLIDQGDDAIAEVLNLWPDADRQQLRTLIRNAKKEKE
GNKPPKSARQIFQYLRELAENEG
>O34960 ~~~yjgB~~~Uncharacterized protein YjgB~~~
MKKTMSAITAAAAVTSCFTGFGAASFSAPAKAAAQTNTLSENTNQSAAELVKNLYNTAYKGEMPQQAQGLTINKSTKGDV
HAAFGEPERPVGGDNRFDLYHWNMGQPGYGFSYHKDMTISEIRYFGTGVERQLNLGGVTPEVLQKQLGPVNRVLTVPFTD
EIDYVYDTGRYELHFVIGTDQTADHVNLKAK
>P39332 ~~~yjgH~~~RutC family protein YjgH~~~COG0251
MVERTAVFPAGRHSLYAEHRYSAAIRSGDLLFVSGQVGSREDGTPEPDFQQQVRLAFDNLHATLAAAGCTFDDIIDVTSF
HTDPENQFEDIMTVKNEIFSAPPYPNWTAVGVTWLAGFDFEIKVIARIPEQ
>P39336 ~~~yjgL~~~Uncharacterized protein YjgL~~~
MSKISDLNYSQHITLADNFKQKSEVLNTWRVGMNDFARIAGGQDNRRNILSPGAFLEFLAKIFTLGYVDFSKRSNEAGRN
MMAHIKSSSYSKDTNGNEKMKFYMNNPVGERADSPKVIIEISLSTITTMGTRQGHTAIIFPQPDGSTNRYEGKSFERKDE
SSLHLITNKVLACYQSEANKKIARLLNNNQELNNLQKLNNLQKLNNLLKLNNIQGLNNPQELNNPQNLNDSQELNNSQEL
NSPQELNDPQELNNSQDLNNSKVSCTVSVDSTITGLLKEPLNNALLAIRNEHLLLMPHVCDESISYLLGEKGILEEIDKL
YALNDHGIDNDKVGNNEINDIKVNLSHILIDSLDDAKVNLTPVIDSILETFSKSPYINDVRILDWCFNKSMQYFDDTKKI
KHACSVINHINLRSDQSKIAETLFFNLDKEPYKNSPELQGLIWNKLVVYVNEFNLSNREKTNLIQRLFDNVESIFNEVPV
SILVNDIFMNDFFMKNPEMINWYFPQLLKSYEGEKIYFDNLKYDLNDNDKESNKEILKNQPDNVIKEKLNNEYKLRFRMM
QTILQSRVNVLPYINEQRLNKLNPPENLRIAIEHFGWKNRPITA
>P39338 ~~~yjgN~~~Inner membrane protein YjgN~~~COG4269
MAQVINEMDVPSHSFVFHGTGERYFLICVVNVLLTIITLGIYLPWALMKCKRYLYANMEVNGQRFSYGITGGNVFVSCLF
FVFFYFAILMTVSADMPLVGCVLTLLLLVLLIFMAAKGLRHQALMTSLNGVRFSFNCSMKGFWWVTFFLPILMAIGMGTV
FFISTKMLPANSSSSVIISMVLMAIVGIVSIGIFNGTLYSLVMSFLWSNTSFGIHRFKVKLDTTYCIKYAILAFLALLPF
LAVAGYIIFDQILNAYDSSVYANDDIENLQQFMEMQRKMIIAQLIYYFGIAVSTSYLTVSLRNHFMSNLSLNDGRIRFRL
TLTYHGMLYRMCALVVISGITGGLAYPLLKIWMIDWQAKNTYLLGDLDDLPLINKEEQPDKGFLASISRGVMPSLPFL
>O34725 ~~~yjhA~~~Uncharacterized lipoprotein YjhA~~~
MKKVLLLLFVLTIGLALSACSQSSDASEKEKPKEKKSQEELEKELDKELKKGGEPKTKKDDQIHKIGETFKAGHTNFTVN
KVDRVQKGEYMNVGGAVNEETKTIKDDEERLIIEVTMENIGEDSISYNFIGFDLRDKNDQSVRPVFSIEEKGRILMGGTL
VSGKKVTGVLSYVIPKGEQKHYTLVYNPFLADTNSSNTEERVKDDIDYLVKLD
>P39352 ~~~yjhB~~~Putative metabolite transport protein YjhB~~~COG2814
MATAWYKQVNPPQRKALFSAWLGYVFDGFDFMMIFYILHIIKADLGITDIQATLIGTVAFIARPIGGGFFGAMADKYGRK
PMMMWAIFIYSVGTGLSGIATNLYMLAVCRFIVGLGMSGEYACASTYAVESWPKNLQSKASAFLVSGFSVGNIIAAQIIP
QFAEVYGWRNSFFIGLLPVLLVLWIRKSAPESQEWIEDKYKDKSTFLSVFRKPHLSISMIVFLVCFCLFGANWPINGLLP
SYLADNGVNTVVISTLMTIAGLGTLTGTIFFGFVGDKIGVKKAFVVGLITSFIFLCPLFFISVKNSSLIGLCLFGLMFTN
LGIAGLVPKFIYDYFPTKLRGLGTGLIYNLGATGGMAAPVLATYISGYYGLGVSLFIVTVAFSALLILLVGFDIPGKIYK
LSVAK
>P39353 1.-.-.-~~~yjhC~~~Uncharacterized oxidoreductase YjhC~~~COG0673
MINYGVVGVGYFGAELARFMNMHDNAKITCVYDPENGENIARELQCINMSSLDALVSSKLVDCVIVATPNYLHKEPVIKA
AKNKKHVFCEKPIALSYEDCVDMVKACKEAGVTFMAGHIMNFFNGVQYARKLIKEGVIGEILSCHTKRNGWENKQERLSW
KKMKEQSGGHLYHHIHELDCVQHLLGEIPETVTMIGGNLAHSGPGFGNEDDMLFMTLEFPSGKLATLEWGSAFNWPEHYV
IINGTKGSIKIDMQETAGSLRIGGQTKHFLVHETQEEDDDRRKGNMTSEMDGAIAYGHPGKKTPLWLASLIRKETLFLHN
ILCGAKPEEDYIDLLNGEAAMSAIATADAATLSRSQDRKVKISEIIKHTSVM
>P39358 4.2.1.82~~~yjhG~~~D-xylonate dehydratase YjhG~~~COG0129
MSVRNIFADESHDIYTVRTHADGPDGELPLTAEMLINRPSGDLFGMTMNAGMGWSPDELDRDGILLLSTLGGLRGADGKP
VALALHQGHYELDIQMKAAAEVIKANHALPYAVYVSDPCDGRTQGTTGMFDSLPYRNDASMVMRRLIRSLPDAKAVIGVA
SCDKGLPATMMALAAQHNIATVLVPGGATLPAKDGEDNGKVQTIGARFANGELSLQDARRAGCKACASSGGGCQFLGTAG
TSQVVAEGLGLAIPHSALAPSGEPVWREIARASARAALNLSQKGITTREILTDKAIENAMTVHAAFGGSTNLLLHIPAIA
HQAGCHIPTVDDWIRINKRVPRLVSVLPNGPVYHPTVNAFMAGGVPEVMLHLRSLGLLHEDVMTVTGSTLKENLDWWEHS
ERRQRFKQLLLDQEQINADEVIMSPQQAKARGLTSTITFPVGNIAPEGSVIKSTAIDPSMIDEQGIYYHKGVAKVYLSEK
SAIYDIKHDKIKAGDILVIIGVGPSGTGMEETYQVTSALKHLSYGKHVSLITDARFSGVSTGACIGHVGPEALAGGPIGK
LRTGDLIEIKIDCRELHGEVNFLGTRSDEQLPSQEEATAILNARPSHQDLLPDPELPDDTRLWAMLQAVSGGTWTGCIYD
VNKIGAALRDFMNKN
>P39359 4.1.2.28~~~yjhH~~~Probable 2-dehydro-3-deoxy-D-pentonate aldolase YjhH~~~COG0329
MKKFSGIIPPVSSTFHRDGTLDKKAMREVADFLINKGVDGLFYLGTGGEFSQMNTAQRMALAEEAVTIVDGRVPVLIGVG
SPSTDEAVKLAQHAQAYGADGIVAINPYYWKVAPRNLDDYYQQIARSVTLPVILYNFPDLTGQDLTPETVTRLALQNENI
VGIKDTIDSVGHLRTMINTVKSVRPSFSVFCGYDDHLLNTMLLGGDGAITASANFAPELSVGIYRAWREGDLATAATLNK
KLLQLPAIYALETPFVSLIKYSMQCVGLPVETYCLPPILEASEEAKDKVHVLLTAQGILPV
>P39367 ~~~yjhP~~~Uncharacterized protein YjhP~~~COG2519
MDIPRIFTISESEHRIHNPFTEEKYATLGRVLRMKPGTRILDLGSGSGEMLCTWARDHGITGTGIDMSSLFTAQAKRRAE
ELGVSERVHFIHNDAAGYVANEKCDVAACVGATWIAGGFAGAEELLAQSLKPGGIMLIGEPYWRQLPATEEIAQACGVSS
TSDFLTLPGLVGAFDDLGYDVVEMVLADQEGWDRYEAAKWLTMRRWLEANPDDDFAAEVRAELNIAPKRYVTYARECFGW
GVFALIAR
>P24203 3.6.5.-~~~yjiA~~~Zinc chaperone YjiA~~~COG0523
MNPIAVTLLTGFLGAGKTTLLRHILNEQHGYKIAVIENEFGEVSVDDQLIGDRATQIKTLTNGCICCSRSNELEDALLDL
LDNLDKGNIQFDRLVIECTGMADPGPIIQTFFSHEVLCQRYLLDGVIALVDAVHADEQMNQFTIAQSQVGYADRILLTKT
DVAGEAEKLHERLARINARAPVYTVTHGDIDLGLLFNTNGFMLEENVVSTKPRFHFIADKQNDISSIVVELDYPVDISEV
SRVMENLLLESADKLLRYKGMLWIDGEPNRLLFQGVQRLYSADWDRPWGDEKPHSTMVFIGIQLPEEEIRAAFAGLRK
>O34374 1.14.-.-~~~yjiB~~~Putative cytochrome P450 YjiB~~~COG2124
MNVLNRRQALQRALLNGKNKQDAYHPFPWYESMRKDAPVSFDEENQVWSVFLYDDVKKVVGDKELFSSCMPQQTSSIGNS
IINMDPPKHTKIRSVVNKAFTPRVMKQWEPRIQEITDELIQKFQGRSEFDLVHDFSYPLPVIVISELLGVPSAHMEQFKA
WSDLLVSTPKDKSEEAEKAFLEERDKCEEELAAFFAGIIEEKRNKPEQDIISILVEAEETGEKLSGEELIPFCTLLLVAG
NETTTNLISNAMYSILETPGVYEELRSHPELMPQAVEEALRFRAPAPVLRRIAKRDTEIGGHLIKEGDMVLAFVASANRD
EAKFDRPHMFDIRRHPNPHIAFGHGIHFCLGAPLARLEANIALTSLISAFPHMECVSITPIENSVIYGLKSFRVKM
>P39376 ~~~yjiE~~~HTH-type transcriptional regulator YjiE~~~COG0583
MDDCGAILHNIETKWLYDFLTLEKCRNFSQAAVSRNVSQPAFSRRIRALEQAIGVELFNRQVTPLQLSEQGKIFHSQIRH
LLQQLESNLAELRGGSDYAQRKIKIAAAHSLSLGLLPSIISQMPPLFTWAIEAIDVDEAVDKLREGQSDCIFSFHDEDLL
EAPFDHIRLFESQLFPVCASDEHGEALFNLAQPHFPLLNYSRNSYMGRLINRTLTRHSELSFSTFFVSSMSELLKQVALD
GCGIAWLPEYAIQQEIRSGKLVVLNRDELVIPIQAYAYRMNTRMNPVAERFWRELRELEIVLS
>P0AEH8 ~~~yjiG~~~Inner membrane protein YjiG~~~COG0700
MTTQVRKNVMDMFIDGARRGFTIATTNLLPNVVMAFVIIQALKITGLLDWVGHICEPVMALWGLPGEAATVLLAALMSMG
GAVGVAASLATAGALTGHDVTVLLPAMYLMGNPVQNVGRCLGTAEVNAKYYPHIITVCVINALLSIWVMQLIV
>Q8FA95 ~~~yjiK~~~Uncharacterized protein YjiK~~~COG3204
MTKSISLSKRIFVIVILFVIVAVCTFFVQSCARKSNHAASFQNYHATIDGKEIAGITNNISSLTWSAQSNTLFSTINKPA
AIVEMTTNGDLIRTIPLDFVKDLETIEYIGDNQFVISDERDYAIYVISLTPNSEVKILKKIKIPLQESPTNCGFEGLAYS
RQDHTFWFFKEKNPIEVYKVNGLLSSNELHISKDKALQRQFTLDDVSGAEFNQQKNTLLVLSHESRALQEVTLVGEVIGE
MSLTKGSRGLSHNIKQAEGVAMDASGNIYIVSEPNRFYRFTPQSSH
>P0ADD2 ~~~yjjB~~~Probable succinate transporter subunit YjjB~~~COG3610
MGVIEFLLALAQDMILAAIPAVGFAMVFNVPVRALRWCALLGSIGHGSRMILMTSGLNIEWSTFMASMLVGTIGIQWSRW
YLAHPKVFTVAAVIPMFPGISAYTAMISAVKISQLGYSEPLMITLLTNFLTASSIVGALSIGLSIPGLWLYRKRPRV
>A0A0H3FQN0 ~~~yjjB~~~Probable succinate transporter subunit YjjB~~~COG3610
MGIISYLFDLAQDMALAAIPAVGFAMVFNVPQRALRWCALLGAIGHGSRMVMMSAGFNIEWATFLAALLVGSIGIQWSRW
YLAHPKIFTVAAVIPMFPGISAYTAMISAVKISHFGYSEEMMIMLLSNFLKASSIVGALSIGLSIPGLWLYRKRPRV
>P0A8Y1 3.1.3.5~~~yjjG~~~Pyrimidine 5'-nucleotidase YjjG~~~COG1011
MKWDWIFFDADETLFTFDSFTGLQRMFLDYSVTFTAEDFQDYQAVNKPLWVDYQNGAITSLQLQHGRFESWAERLNVEPG
KLNEAFINAMAEICTPLPGAVSLLNAIRGNAKIGIITNGFSALQQVRLERTGLRDYFDLLVISEEVGVAKPNKKIFDYAL
EQAGNPDRSRVLMVGDTAESDILGGINAGLATCWLNAHHREQPEGIAPTWTVSSLHELEQLLCKH
>P37342 ~~~yjjI~~~Uncharacterized protein YjjI~~~COG1328
MPTSHENALQQRCQQIVTSPVLSPEQKRHFLALEAENNLPYPQLPAEARRALDEGVICDMFEGHAPYKPRYVLPDYARFL
ANGSEWLELEGAKDLDDALSLLTILYHHVPSVTSMPVYLGQLDALLQPYVRILTQDEIDVRIKRFWRYLDRTLPDAFMHA
NIGPSDSPITRAILRADAELKQVSPNLTFIYDPEITPDDLLLEVAKNICECSKPHIANGPVHDKIFTKGGYGIVSCYNSL
PLAGGGSTLVRLNLKAIAERSESLDDFFTRTLPHYCQQQIAIIDARCEFLYQQSHFFENSFLVKEGLINPERFVPMFGMY
GLAEAVNLLCEKEGIAARYGKEAAANEVGYRISAQLAEFVANTPVKYGWQKRAMLHAQSGISSDIGTTPGARLPYGDEPD
PITHLQTVAPHHAYYYSGISDILTLDETIKRNPQALVQLCLGAFKAGMREFTANVSGNDLVRVTGYMVRLSDLEKYRAEG
SRTNTTWLGEEAARNTRILERQPRVISHEQQMRFSQ
>P39410 2.-.-.-~~~yjjJ~~~Toxin YjjJ~~~COG3550
MSELTDLLLQGPRSAPELRQRLAISQATFSRLVAREDRVIRFGKARATRYALLRPYRGIERIPVWRVDDTGKAHKFADIR
LCWPQGSCLVTGADGDEQWFDGLPWYLTDLRPQGFLGRAWGRKLAAQLNLTDDIRLWQEEDVLYALTVFNGEYTGGWLVG
EGNYQRWITAQHPAEIPLDQKLTHYEQLASDALAGEIVGSSAGGEQPKFTYYAQTPSGNKHVLVKFTVPQQTAVSQRWGD
LLIAESIAAQILRDGGIHAIESTVLVTSNRQVFLEAERFDCKGNDGRLPIVSLEAVQSEFISSPGSWPQAMRRLCEQQLV
THQSVAQTEVIWAFGRLIANSDMHAGNLSFYLSEPPFALTPVYDMLPMVYAPNSAGMLRDAAIEVKFDLNVSKSAWLTAI
PLAQQFWQTVARDPRISEAFRHIAQEMPEKIRQIEEKVARMGG
>P0ADD5 ~~~yjjP~~~Probable succinate transporter subunit YjjP~~~COG2966
MQTEQQRAVTRLCIQCGLFLLQHGAESALVDELSSRLGRALGMDSVESSISSNAIVLTTIKDGQCLTSTRKNHDRGINMH
VVTEVQHIVILAEHHLLDYKGVEKRFSQIQPLRYPRWLVALMVGLSCACFCKLNNGGWDGAVITFFASTTAMYIRQLLAQ
RHLHPQINFCLTAFAATTISGLLLQLPTFSNTPTIAMAASVLLLVPGFPLINAVADMFKGHINTGLARWAIASLLTLATC
VGVVMALTIWGLRGWV
>A0A0H3FRB6 ~~~yjjP~~~Probable succinate transporter subunit YjjP~~~COG2966
MQADKSEQRAVTRLCIQCALYLLQHGAESALVEELSTRLGRALGMDSVESAISSNAIVLTTIKDGECLTSTRKNSDRGIN
MHVVTEVQHIVIMAEHKLLDYKDVEKRFSQIKPLRYPRWLLVLMVGLSCACFCKLNNGGWDGAVVTFFASTVAMYIRQLL
THRSMHPQINFCITAFVATTISGLMLRLPAFSETPTIAMAASVLLLVPGFPLINAVADMFKGHINTGLARWAIASLLTLA
TCIGVVMAMTMWGLRGWA
>P0ADD7 ~~~yjjQ~~~Putative transcription factor YjjQ~~~COG2197
MLPGCCKNGIVISKIPVMQAGLKEVMRTHFPEYEIISSASAEDLTLLQLRRSGLVIADLAGESEDPRSVCEHYYSLISQY
REIHWVFMVSRSWYSQAVELLMCPTATLLSDVEPIENLVKTVRSGNTHAERISAMLTSPAMTETHDFSYRSVILTLSERK
VLRLLGKGWGINQIASLLKKSNKTISAQKNSAMRRLAIHSNAEMYAWINSAQGARELNLPSVYGDAAEWNTAELRREMSH
S
>P39408 3.1.-.-~~~yjjV~~~Uncharacterized metal-dependent hydrolase YjjV~~~COG0084
MICRFIDTHCHFDFPPFSGDEEASLQRAAQAGVGKIIVPATEAENFARVLALAENYQPLYAALGLHPGMLEKHSDVSLEQ
LQQALERRPAKVVAVGEIGLDLFGDDPQFERQQWLLDEQLKLAKRYDLPVILHSRRTHDKLAMHLKRHDLPRTGVVHGFS
GSLQQAERFVQLGYKIGVGGTITYPRASKTRDVIAKLPLASLLLETDAPDMPLNGFQGQPNRPEQAARVFAVLCELRREP
ADEIAQALLNNTYTLFNVP
>P39409 1.97.1.-~~~yjjW~~~Putative glycyl-radical enzyme activating enzyme YjjW~~~COG1180
MNSRCALVSKIIPFSCVDGPGSRLALFLQGCNLRCKNCHNPWTMGRCNDCGECVPQCPHQALQIVDGKVVWNAVVCEQCD
TCLKRCPQHATPMAQSMSVDEVLSHVRKAVLFIEGITVSGGEATTQLPFVVALFTAIKNDPQLRHLTCLVDSNGMLSETG
WEKLLPVCDGAMLDLKAWGSECHQQLTGRDNQQIKRSIYLLAERGKLAELRLLVIPGQVDYLQHIEELAAFIKGLGDVPV
RLNAFHAHGVYGEAQSWASATPEDVEPLADALKVRGVSRLIFPALYL
>O34633 ~~~yjlC~~~Uncharacterized protein YjlC~~~
MPETIDQTNASVSQSQQDLIDQLLKPEVQESLTVLVDQLPKLTELVNILTKSYDFAQSVATDEVLKSDTVGAITEILEPV
KETAKEVAATAIEAKDRAEASNETIGLFGLLRMLKDPQAQKLFRFANSYLEVMNERENQK
>P80861 1.6.99.-~~~yjlD~~~NADH dehydrogenase-like protein YjlD~~~COG1252
MSKHIVILGAGYGGVLSALTVRKHYTKEQARVTVVNKYPTHQIITELHRLAAGNVSEKAVAMPLEKLFKGKDIDLKIAEV
SSFSVDKKEVALADGSTLTYDALVVGLGSVTAYFGIPGLEENSMVLKSAADANKVFQHVEDRVREYSKTKNEADATILIG
GGGLTGVELVGELADIMPNLAKKYGVDHKEIKLKLVEAGPKILPVLPDDLIERATASLEKRGVEFLTGLPVTNVEGNVID
LKDGSKVVANTFVWTGGVQGNPLVGESGLEVNRGRATVNDFLQSTSHEDVFVAGDSAVYFGPDGRPYPPTAQIAWQMGEL
IGYNLFAYLEGKTLETFKPVNSGTLASLGRKDAVAIIGANSTPLKGLPASLMKEASNVRYLTHIKGLFSLAY
>O34961 ~~~yjmB~~~Uncharacterized symporter YjmB~~~COG2211
MRTELANKVVSVETEKRLSLKEKMSYGFGDFGNGFMFDLGQIYLLKYFTDVAGIPAAMAGGIFLVSKLFAAITDPIVGSS
IDYRKNIGKRGKFRPYLLIGSIVLAVLTVLIFLSPNVSTTGKLIYAYASYMIWGIGYSFVNIPYGSLGAAMTQNSEDRTS
ISTFRQIGSLGALFITSVAVMPLLVKFDNPKVGYPVVMGLFAALGVFWFYICYRNCKERIIISEAPKEKLTLSSVVKTFI
TNKPLLTLVLMTIFSISAYNIKSAMLVYFAQYNLGNVELMAYMNFIIIGSSFLGVVFLPKLVKMFGKKRTAMIGFGISVA
ADLINFMLPSNVYVFTILASIAFIGISIPNGITWALVSDIIDYGEWKSGERKEATTYSLFNFSRKLAQSLSGFLSGIGLG
IIGYVPNAVQTAQALIGIKALLLLYPAIALALAMFIIGFLYKLTDQQHAQIVQDLHQKS
>O34736 1.1.1.-~~~yjmC~~~Uncharacterized oxidoreductase YjmC~~~COG2055
MKTITIAAEEAKELVWQKLDGAGLNERDAEKVADVLVHADLRNVHSHGVLHTEHYVNRLLAGGINPGAQPVFKETGPVTG
VLDGDDGFGHVNCDMAMDHAIDMAKKKGVGMVTAVNSSHCGALSYFVQKAADEKLIGMAMTHTDSIVVPFGGRTPILGTN
PIAYGVPAKHKKPFILDMATSKVAFGKILQAREEGKEIPEGWGVDENGEAVTDPDKVVSLSTFGGPKGYGLSIVVDVFSG
LLAGAAFGPHIAKMYNGLDQKRKLGHYVCAINPSFFTDWDTFLEQMDAMIDELQQSPPAVGFERVYVPGEIEQLHEERNK
KNGISIARSVYEFLKSR
>O35045 1.-.-.-~~~yjmD~~~Uncharacterized zinc-type alcohol dehydrogenase-like protein YjmD~~~COG1063
MKAVQVRKAYDLVTAEVKKPVLSKDDEVLVKVKRVGICGSDMHIYHGTNPLATLPRVIGHEVTGQVEAVGANVQSLKPGD
HVVIEPISYCGSCYACRKGRPNVCAKLSVFGVHEDGGMREYIVLPERQLHAVSKDLPWEEAVMAEPYTIGAQAVWRGQVE
KGDTVLIQGAGPIGICVLKMAKLAGAAVMMTDLNNERLAFAKENGADAVVNVQAEHVAERVLEWTGNEGANVVIDAVCLP
ETFALSIEAVSPAGHVVVLGFDERAAQISQLPITKKEVTITGSRLQTNQFPKVVELLNGGRLMHNGLVTHTFSVDDVHHA
FQFIKEHPDQVRKAVITFD
>O34334 ~~~yjoA~~~Uncharacterized protein YjoA~~~COG2318
MCQSNQIVSHFLSHRNVTNELAEKISKDHYSYKPAETSMSAEELVKHILTSFHLFANVIKEGNASPFQNKQEETETDLNV
LAKTYTEKTVAILEQLTEEQLDREIDLTSAFGRKVTGRALLQLAMEHEIHHKGNLFVYVREMGHTELPFYQQRM
>O34703 3.-.-.-~~~yjoB~~~Uncharacterized ATPase YjoB~~~COG0465
MTNIPFIYQYEEKENERAAAGYGTFGYLITRIEETLYDQYGVFYELYASDDPNTEYWELLVEDVRSGSLEPEHVAYIFEK
LEKKTFAYDEDEKEPDYTVHKSIRNSVYAYPEKGVAFARIPYFQDGSIMSFDCLFAVNDEKMRAFLEGVRPRLWEKSKRK
VTVFTDGDGGTSREQEAIVREVQRSQVIMNPLLKKEIYRSIDQFFHSDKSFYQTYDIPYKRGILLYGPPGNGKTTLVKSI
AGSIDAPVAYWQITEFTSSETIEEVFQAARRLAPAVLVIEDIDSMPEDVRSFFLNTLDGATSKEGLFLIGTTNYPEEIDP
GLMNRAGRFDRAYEIGLPDEELRLEYMKMRGFGIFLSEGEIKNAAKLTEGFSFAQLGELYVSSALQWHQEGNHHIETMVK
DMTGEQRKSQRGSWMERNKVGFH
>Q736M3 3.4.14.13~~~ykfC~~~Gamma-D-glutamyl-L-lysine dipeptidyl-peptidase~~~
MKKVGTAFLTTLFIFSSFTSAHAEEKKDSKAFIDVSAATLWTAPDSLRPIDVPSATNPVDLWKWTKSMTLDEKLWLTNAN
KLETQALLGQEVTVVDKKGDWVKVLVHGQPTPRNEEGYPGWMPEKQLTYNQEFADKTNEPFVLVTKPTAILYINPSEKHK
SLEVSYNTRLPLLSEDTISYRVLLPNGQKAWLRKNDGTFYRSQNDIPTPAADDLINTGKMFLGLPYIWAGTSGFGFDCSG
FTHTIYKSHGITIPRDSGPQSRNGVAVDKEHLQKGDLIFFAHDQGKGSVHHVAMYIGDGNMIHSPRAERSVEIIPLNTPG
YIEEYAGARRYLP
>O35010 3.4.14.13~~~ykfC~~~Gamma-D-glutamyl-L-lysine dipeptidyl-peptidase~~~COG0791
MMHTVISAVANIWTAPDSPRPSDQFMLQPTVMIRDWLERMTYDERLGLCTDNVIQTQVLFGEKVLVTAEQGEWVSVIVPS
QPSRKDPRGYPGWMKKYQLEKTKPIHTQHDVMISKPAAFLYRSNGEKEIELSFLTVLPLIAKENGYFKVSTVFGERFVRQ
SDAVPVSQQKGTAEDIIQTGAFFLGLPYLWGGISGFGFDCSGFMYSIFKANGYSIPRDAGDQAKAGKGVPLDDMKAGDLL
FFAYEEGKGAIHHVGLYVGGGKMLHSPKTGKSIEILTLTETIYEKELCAVRRCFSE
>P75677 ~~~ykfF~~~UPF0401 protein YkfF~~~
MTQSVLLPPGPFTRRQAQAVTTTYSNITLEDDQGSHFRLVVRDTEGRMVWRAWNFEPDAGEGLNRYIRTSGIRTDTATR
>P77692 ~~~ykfI~~~Toxin YkfI~~~
MKTLPAITQRAVKPCLSPVAVWQMLLTRLLEQHYGLTINDTPFCNEAVIKEHIDAGITLADAVNFLVEKYELVRIDRKGF
SWQEQSPYLRAADILRARQATGLLRQSRNNVVR
>C1P5Z8 ~~~ykgR~~~Uncharacterized membrane protein YkgR~~~
MKENKVQQISHKLINIVVFVAIVEYAYLFLHFY
>P0DPM8 ~~~ykgS~~~Protein YkgS~~~
MRMIGLLYDFKDYASKMAENMARLAALLHYFSGDGGDISVTG
>P0DPM9 ~~~ykgV~~~Protein YkgV~~~
MSAFKLPDTSQSQLISTAELAKIISYKSQTIRKWLCQDKLPEGLPRPKQINGRHYWLRKDVLDFIDTFSVRESL
>P0DPN1 ~~~ykiC~~~Protein YkiC~~~
MNYKAFTQIAIDLLSAKLCNCTQAIMTHIIASFLAFMFF
>P0DPN2 ~~~ykiD~~~Protein YkiD~~~
MTQRPWSKLQRKTHNIAALKIIARRSE
>P0DSH9 ~~~ykiE~~~Protein YkiE~~~
MGWSLCFWHVSVRMTHDVVCYY
>O31709 ~~~yknW~~~Membrane protein YknW~~~
METNVEKNSGTATEKPSLFGVITSPSVQFERIRERPAVWGPLLIVAAIIIVGAVLQSLGTDYSELLKSQDTQGLSAEQME
TVATITKFGGMAGAIIGGIAALFIAPLIYWLCVKVSGGVTTYKKMLSLSLFVSLISSLGLLVNGIVAFTTDVNPLYSTTS
LAGIIPSDGALASVLNTFEIFSIWSFVLLAIGLHKTGGISKKAGWISAIILFGILVVFSLFSGLINSVAGA
>O31710 ~~~yknX~~~Putative efflux system component YknX~~~COG0845
MKKVWIGIGIAVIVALFVGINIYRSAAPTSGSAGKEVQTGSVEENEISSTVMVPGTLKFSNEQYVFYEADKGTLEDIKVK
EGDKVKKGTALVTYTNEQLSLEKEQNQLTSESNRLQIDQIQEKLKALDSKERELEKQVGKKEAEKQIESERTELQMQKKT
AEIELKQTELQRQSLANRVSDLEVKSEIEGTVISVNQEAASKKSDIQEPVIHIGNPKDLVVSGKLSEYDTLKVKKGQKVT
LTSDVIQGKTWKGTVSAVGLVPDQQESAAAQGTEQAVQYPLQVKIKGNLPEGKPGFKFIMNIETDKRKANTLPSKAVKKE
DDQYYVYTVKDGKAKRVDVKIGEVTDDLTEIKEGLTQDDQVILNPSDQVTDGMEVKS
>O31711 7.6.2.-~~~yknY~~~Uncharacterized ABC transporter ATP-binding protein YknY~~~COG1136
MIQLSNVRKSYQIGKETFDVLHSIDLDIHQGEYVSIMGPSGSGKSTIMNIIGCLDRPTSGTYQLDGEDISSYKDKELAAV
RNRSIGFVFQQFQLLPRLNAKKNVELPMIYSGIGKKERQERAERALEKVGLADRMLHMPNELSGGQKQRVAIARAIVNEP
KLILADEPTGALDTKTSEAIMDQFTALNAEGTTIVLVTHEPEVADCTNRIVMVRDGNIVPASSGQRSVGE
>O31712 ~~~yknZ~~~Uncharacterized ABC transporter permease YknZ~~~COG0577
MSLLENIRMALSSVLAHKMRSILTMLGIIIGVGSVIVVVAVGQGGEQMLKQSISGPGNTVELYYMPSDEELASNPNAAAE
STFTENDIKGLKGIEGIKQVVASTSESMKARYHEEETDATVNGINDGYMNVNSLKIESGRTFTDNDFLAGNRVGIISQKM
AKELFDKTSPLGEVVWINGQPVEIIGVLKKVTGLLSFDLSEMYVPFNMMKSSFGTSDFSNVSLQVESADDIKSAGKEAAQ
LVNDNHGTEDSYQVMNMEEIAAGIGKVTAIMTTIIGSIAGISLLVGGIGVMNIMLVSVTERTREIGIRKSLGATRGQILT
QFLIESVVLTLIGGLVGIGIGYGGAALVSAIAGWPSLISWQVVCGGVLFSMLIGVIFGMLPANKAAKLDPIEALRYE
>O34572 ~~~ykoC~~~Putative HMP/thiamine permease protein YkoC~~~COG0619
MKQPLHLINPTVKAAAVFCCVVMLSFIYNPYTPACFYIIIVAGVLLAAGIPLKKWLLFTIPFLILAFGCVWTAAVFGKVP
TTPDNFLFQAGPISINSDNVSVGISLGFRILCFSALSMMFVFTTDPILFMLSLVQQCRLSPKLAYGVIAGFRFLPLLKDE
VQLIQQAHKIRGGAAESGIINKISALKRYTIPLLASAIRKAERTALAMESKGFTGSRNRTYYRTLSVNRRDWVFFCLVLL
LFAGSFLVSLCFAS
>O34362 7.6.2.-~~~ykoD~~~Putative HMP/thiamine import ATP-binding protein YkoD~~~COG1122
MQAFDELLTVEQLSFSYEEDEKPVFQDISFELQKGECVLLLGPSGCGKSSLALCLNGLYPEACDGIQSGHVFLFQKPVTD
AETSETITQHAGVVFQDPDQQFCMLTVEDEIAFGLENLQIPKEEMTEKINAVLEKLRITHLKEKMISTLSGGQKQKVALA
CILAMEPELIILDEPTSLLDPFSAREFVHLMKDLQREKGFSLLVIEHQLDEWAPWIERTIVLDKSGKKALDGLTKNLFQH
EAETLKKLGIAIPKVCHLQEKLSMPFTLSKEMLFKEPIPAGHVKKKKAPSGESVLEVSSLSFARGQQAIFKDISFSLREG
SLTALVGPNGTGKSTLLSVLASLMKPQSGKILLYDQPLQKYKEKELRKRMGFVFQNPEHQFVTDTVYDELLFGQKANAET
EKKAQHLLQRFGLAHLADHHPFAISQGQKRRLSVATMLMHDVKVLLLDEPTFGQDARTAAECMEMIQRIKAEGTAVLMIT
HDMELVSSYADSVLVLHDTGLAFDGSPAQLFSQETGLVQKAKLTLPLLYEWMAFQEEVRDEATVTSH
>O34738 ~~~ykoE~~~Putative HMP/thiamine permease protein YkoE~~~COG4721
MKSWKVKEIVIMSVISIVFAVVYLLFTHFGNVLAGMFGPIAYEPIYGIWFIVSVIAAYMIRKPGAALVSEIIAALVECLL
GNPSGPMVIVIGIVQGLGAEAVFLATRWKAYSLPVLMLAGMGSSVASFIYDLFVSGYAAYSPGYLLIMLVIRLISGALLA
GLLGKAVSDSLAYTGVLNGMALGKELKKKRKRASEHASL
>O34911 ~~~ykoF~~~Putative HMP/thiamine-binding protein YkoF~~~
MEHICGTSRIAGFRFSLYPMTDDFISVIKSALKKTDTSKVWTKTDHISTVLRGSIDHVFDAAKAIYLHAANSEQHIVMNG
TFSIGCPGDTQGDTYLSKGDKRVNEDAVRGLKAEAPCQFALYPMNEPDYMGLIMEAVDIAKAQGTFVQGVHYASELDGDA
HDVFSTLEAVFRMAEQQTNHITMTVNLSANSPSRKNRKQG
>O35012 ~~~ykoJ~~~Uncharacterized protein YkoJ~~~COG3212
MLKKKWMVGLLAGCLAAGGFSYNAFATENNENRQASSKTDALTEQEAEAIAKTVVDGTVEDIDRDLYNGKEVYEVEIEKE
GEDYDVYVDIHTKQALNDPLKEKAEQVAITKEEAEEIALKQTGGTVTESKLDEDDGAYIYEMEIQTKQGTETEFEISAKD
GRIIKQEIDD
>O34763 ~~~ykoL~~~Stress response protein YkoL~~~
MSNLLKSALEKERRHYSEKLYQIGVYNKEVMNKMTISELRKEYAYFFRSITNHKNYPYTR
>O34755 2.4.-.-~~~ykoT~~~Uncharacterized glycosyltransferase YkoT~~~COG0463
MKQSQPVLTIVVPCFNEEEVFQETSHQLTEVVDDLIEEKLIAEDSKILFVDDGSKDRTWALIAMESIRNKKVTGLKLACN
VGHQKALLAGLHKAKNRSDCVISIDADLQDDISVIRDFMLKYHEGCEIVYGVRRSRKTDTFFKRTTALGFYRLMNKLGIK
LIYNHADFRLMNKRSLEELERYPEANLFLRGIVPMIGFKSAEVLYDRKERFAGKTKYPLKKMLSFAFNGITSFSVAPIRF
FTLLGFVLFFLSAVAGIGAFIQKLLGHTNAGWASLIISIWFLGGLQLMGIGIIGEYIGTIFSEVKRRPKYAIDIDLYNEQ
LSPLQRQEKERLEKKYS
>P39759 2.3.2.-~~~ykqA~~~Putative gamma-glutamylcyclotransferase YkqA~~~COG2105
MNSLLFVYGTLRKHEKNHHLLAQSACINEQARTKGSLFAAKEGPTVVFNDEDEGYIYGEVYEADELCIHKLDQFFQGYHK
QTVFVETDVGIKIALIYFMNKDGCAGFTKISSGDWKEHQMISKSKNPIYYFAYGSCMDNARFQKAGVDHYFQDPVGRAVL
KGYTTRFTLKREDGSRADMLEDGGTTEGVLYRIPYSALSYLYKREGVESLTYRPAFVDVEAGGRHYKDCLTFLVLQKEAE
IAPPQHYQIEIERGAELYLSPEFTEKLKRHMNSLPKG
>Q45498 ~~~yktB~~~UPF0637 protein YktB~~~COG4493
MTQMRFTEEDFNTFTIEGLDARMEVLKETVRPKLTALGEHFAPTLSALTGDEMFPHVAKHARRSVNPPADSWVAFANSKR
GYKKLPHFQIGLWESHVFVWFAIIYESPIKEEYGKLLEVNQETITKNIPDSFVWSADHTKPGVHKQSEMDKEQLKTLFER
LQTVKKAELLCGIQLQKEEVLNMNNQEFLQRIDDAFKQLAFLYRLTQKVTQA
>O34816 2.-.-.-~~~ykuD~~~Putative L,D-transpeptidase YkuD~~~COG1376
MLTYQVKQGDTLNSIAADFRISTAALLQANPSLQAGLTAGQSIVIPGLPDPYTIPYHIAVSIGAKTLTLSLNNRVMKTYP
IAVGKILTQTPTGEFYIINRQRNPGGPFGAYWLSLSKQHYGIHGTNNPASIGKAVSKGCIRMHNKDVIELASIVPNGTRV
TINR
>O35014 ~~~ykuI~~~Uncharacterized EAL-domain containing protein YkuI~~~COG2200
MLDPLDILTNIDDVLPYYQAIFSAEEQKVVGYEVLGRILADSEIQSLGPFFLDAGIPEEYKLEVDNRIIRQALDRFLEAD
SDLLIFMNQDANLLMLDHGESFLELLKEYEAKGIELHRFVLEITEHNFEGDIEQLYHMLAYYRTYGIKIAVDNIGKESSN
LDRIALLSPDLLKIDLQALKVSQPSPSYEHVLYSISLLARKIGAALLYEDIEANFQLQYAWRNGGRYFQGYYLVSPSETF
LERDVLKQRLKTEFHQFITHEKKKLETVYEHSEQFYKRVHQAVTSLRKNNLSSDDDFIKKLAEELTDCSFRIYMCDEEGD
QLTGNVFKQDGEWIYQPEYAEKNWSWRPYFLENIMRMRNLRKGFFSDLYSDLETGEMIRTFSYPMDDQMYLFIDLPYSYL
YEQDGLI
>O34588 ~~~ykuJ~~~Uncharacterized protein YkuJ~~~COG4703
MSQLMGIITRLQSLQETAEAANEPMQRYFEVNGEKICSVKYFEKNQTFELTVFQKGEKPNTYPFDNIDMVSIEIFELLQ
>O34897 ~~~ykuT~~~Uncharacterized MscS family protein YkuT~~~COG0668
MDFIKQYDWAGLITNAGVLLIKLAIMILLYFIVRSLGMKIIKHLFAKFEEQNSLSIGRAHTLRSLTLNIFAYTLIFIFFV
MVLDLFHYDPSALLAGAGIVGLAVGFGAQGLVSDIVTGFFILLEKQLDVGDYITVSTFDGIVEQVGLRTTQIRSFDGTLH
YIPNRNITNVSNHSRGTMQALVDIKVPAERNIDEMIHILQQVCDETAAALPQIKEGPNVIGIQELGTSEIVIRVIAKTEN
MEQWRVERVLRKEIKNAFDRAFPKETE
>O31699 1.8.-.-~~~ykuV~~~Thiol-disulfide oxidoreductase YkuV~~~COG0526
MKLRQPMPELTGEKAWLNGEVTREQLIGEKPTLIHFWSISCHLCKEAMPQVNEFRDKYQDQLNVVAVHMPRSEDDLDPGK
IKETAAEHDITQPIFVDSDHALTDAFENEYVPAYYVFDKTGQLRHFQAGGSGMKMLEKRVNRVLAETE
>O31681 ~~~ykvP~~~Spore protein YkvP~~~COG1388
MKILFLESHPMWLYGLPNGFREAGHTVMISGPVSRENITEMIDEFKPDLIVSMGWTPEHSREKQAWIRNATKKVNIPLVY
WATEDPLHTQNFTIPLIKKMKPDFVFTVTPSLCKTYESMGIKAAHLDFAFHESIHHQIKPLAKYSCDIAVVANAYPNFLE
EHPEVFRSSSLKTLIRPLIRKGIRIDFWGRNWEKMSKYIGREIPRNWIHGYLDYTEAYKVYSSAKIIIGLQNCESQLTQR
TYEILGSGGFLLTSDTPAVRGKFKPGRDLIVSSSPKETLEKVKYYLNHDSERKKIQINGKKAVKNDSYRHRAERMLEVLK
SRGIIRNLGETIHYVDVLKEKYVIHHVTPGETLSIIASKYNVSLQQLMELNHFKSDQIYAGQIIKIREKLFRNINNKLF
>O31683 ~~~ykvR~~~Uncharacterized protein YkvR~~~
MKTLRLNNVTLEMAAYQEESEPKRKIAFTLNVTSETYHDIAVLLYEKTFNVEVPERDLAFRGEMTNYSTSLTNLYEPGAV
SEFYIEITEIDKNADS
>O31685 ~~~ykvT~~~Uncharacterized protein YkvT~~~COG3773
MTTKFTALAVFLLCFMPAAKIEHTQASLLSAKKTKHEEAKWLTHIDRNTNESFPSLSADKDKKIKPIKLSAHTEKKEKDK
PDKTNDEKETYTQSEKELLSRLVHAEAKGESYKGKVAVASVVLNRTEKKGFPDTIRGVIYQKNAFEPVANGSINQKPDKE
SIEAAEEALSSKNRETDAIFFYNPKTASDNWIRSRKIVEKIGRHVFAV
>O31686 ~~~ykvU~~~Sporulation protein YkvU~~~COG2244
MNRFVKGIILLSIAAFFAECLEFVVNMILARELGEHGMGLYMSILPTIFLIIVIASLELPISISKFIAESNPKLHESMLR
HAFRMTAIFTAFSTAAASIALPFIPVFDTYHPFIKGIVIGLIPVVAFTSIARGYFMGVQKMGKIAIANVLKKIIQLLCLF
LFFQWYSFELDMAVLISLFVLVVSDVVVLVYLYSQFIMARRALSGQQHIHLRGKDVRKRLLAVSIPTTGLRIFHAVVNAI
EPFLVKGALLAAGVAGTAAIDQYGMLAGVAVTIGSFPAFIAHSLMVVMIPSISEAYALSQYDIVLKRLKQSIFITLGYGI
PAVWVMFQFAGPLTHLFFHSPEAQYYLQLLWPYFLFHLFVMPLQACLIGMGFVKEAFYHNVWSHIVALSMMYVLGSMENL
QMLGIILGMNTGMILLTSLHYATICKALKVSVFLTGGTRTPRIEG
>O34948 1.1.-.-~~~ykwC~~~Uncharacterized oxidoreductase YkwC~~~COG2084
MKKTIGFIGLGVMGKSMASHILNDGHPVLVYTRTKEKAESILQKGAIWKDTVKDLSKEADVIITMVGYPSDVEEVYFGSN
GIIENAKEGAYLIDMTTSKPSLAKKIAEAAKEKALFALDAPVSGGDIGAQNGTLAIMVGGEKEAFEACMPIFSLMGENIQ
YQGPAGSGQHTKMCNQIAIAAGMIGVAEAMAYAQKSGLEPENVLKSITTGAAGSWSLSNLAPRMLQGNFEPGFYVKHFIK
DMGIALEEAELMGEEMPGLSLAKSLYDKLAAQGEENSGTQSIYKLWVK
>O31697 ~~~ykzF~~~Uncharacterized protein YkzF~~~
MRMSLIGERFTEEEQKLLLNILINHEYAIELLSSEINDIETGTKNVDGTTYKKLVTLYDRFRFEN
>O07627 ~~~ylaC~~~RNA polymerase sigma factor YlaC~~~COG1595
MKHRDSIEDLYRQYYQEILNYLFRRTHHLETAKDLAQDTFVKALNGLASFRGHSSIRTWLYTIAHHTFINWYRRDVKYQF
TEISKNEGLTQTTYDQPEQYLSRTVKSETLRQELLKLKDQHQSVLILREFQELSYEEIAEILGWSLSKVNTTLHRARLEL
KKNMTKSREEERI
>P0AAS0 ~~~ylaC~~~Inner membrane protein YlaC~~~
MTEIQRLLTETIESLNTREKRDNKPRFSISFIRKHPGLFIGMYVAFFATLAVMLQSETLSGSVWLLVVLFILLNGFFFFD
VYPRYRYEDIDVLDFRVCYNGEWYNTRFVPAALVEAILNSPRVADVHKEQLQKMIVRKGELSFYDIFTLARAESTS
>O07638 ~~~ylaN~~~UPF0358 protein YlaN~~~COG4838
MASEMIVDHRQKAFELLKVDAEKILKLIRVQMDNLTMPQCPLYEEVLDTQMFGLSREIDFAVRLGLVDEKDGKDLLYTLE
RELSALHDAFTAK
>O34412 ~~~ylbF~~~Regulatory protein YlbF~~~COG3679
MYATMESVRLQSEAQQLAEMILQSETAENYRNCYKRLQEDEEAGRIIRSFIKIKEQYEDVQRFGKYHPDYREISRKMREI
KRELDLNDKVADFKRAENELQSILDEVSVEIGTAVSEHVKVPTGNPYFDGLSSCGGGCGSGGSCGCKVS
>O34765 ~~~ylbJ~~~Sporulation integral membrane protein YlbJ~~~COG3314
MNLSKINTLLIASFFLFLTATVISHPQASFEASKTGLSMWWEVVFPSLLPFFILSELLIGFGIVRFVGVLLEPFMRPIFR
VPGVGGFVLAMGMASGNPAGAKLTARLRQENQISRVEAERLASFTNSSNPLFIFGAVAVGFFQNASLGILLASAHYLGNL
AVGLTMRSYGRKEEQHLRRKKTIPFPSVKDALHALHTARLAEKRPLGKILGDAVTSSVQTLLMVGGFIILFSVFNRLLSV
VQLTDVLSVFTKGALTLFQLPTQLDIPLLSGMFEITLGSKLVSETDVSLLQKAIIVSFILGFSGFSVQAQVAGILSETDI
RFKPFFIARLLQGVYAAVFVMLLWKPLYTAWHDPYQYVFKSLNSSDMSTAFINGWNLLVQIGPAVTLCALFTYIIIFSYR
LTNGTKKG
>O34470 ~~~ylbL~~~Uncharacterized protein YlbL~~~COG3480
MLRKKHFSWMLVILILIAVLSFIKLPYYITKPGEATELASLIKVEGGYPEKGSLSLMTVKVGPANPFTYVWAKMHPYYEI
VPDESIKEEGESDKEYMKRQLQMMKSSQENAVIAAYQKAGKKVSYSFNGIYASSVVENMPAKGKIEVGDKIISADGKNYQ
SAEKLIDYISSKKAGDKVTLKIEREEKEKRVTLTLKQFPDEPDRAGIGVSLYTDRNVKVEPDIDFEIENIGGPSAGLMMS
LEIYNQLTKPDETKGYDIAGTGTIDVDGKVGPIGGIDQKVVAADKAGKDIFFAPNQNGASNSDYKNAVKTAKDIDSNMKI
VPVDTMQDAIDYLNKLKAKST
>O34468 2.3.1.-~~~ylbP~~~Uncharacterized N-acetyltransferase YlbP~~~COG0454
MTKVERLLINYKTLEEFKKFKEYGIQELSMLEELQDNIIENDSTSPFYGIYFGDKLVARMSLYQVNGKSNPYFDNRQDYL
ELWKLEVLPGYQNRGYGRALVEFAKSFKMPIRTNPRMKSAEFWNKMNFKTVKYDMARDKGENPLIWHPDMDREMTPGESA
>P21211 ~~~~~~Uncharacterized 12.2 kDa protein in lcrE 5'region~~~
MLSLDQIPHHIRHGIVGSRLIQIRGRVTQVTGTLLKAVVPGVRIGELCYLRNPDNSLSLQAEVIGFAQHQALLIPLGEMY
GISSNTEVSPTGQCIRLGWVNICWGRCWMV
>Q47272 ~~~ylcG~~~Uncharacterized protein YlcG~~~
MMFEFNMAELLRHRWGRLRLYRFPGSVLTDYRILKNYAKTLTGAGV
>P0DPN3 ~~~ylcJ~~~Protein YlcJ~~~
MSLVLCFLLMSLFFMYSFVLSRLWRKKIAIRLLLYIQDNVTLIVFLNKK
>P0DPN4 ~~~yldA~~~Protein YldA~~~
MAEAFYILIGFLIMAAIIVMAVLYLENHS
>P75804 1.1.5.-~~~yliI~~~Aldose sugar dehydrogenase YliI~~~COG2133
MHRQSFFLVPLICLSSALWAAPATVNVEVLQDKLDHPWALAFLPDNHGMLITLRGGELRHWQAGKGLSAPLSGVPDVWAH
GQGGLLDVVLAPDFAQSRRIWLSYSEVGDDGKAGTAVGYGRLSDDLSKVTDFRTVFRQMPKLSTGNHFGGRLVFDGKGYL
FIALGENNQRPTAQDLDKLQGKLVRLTDQGEIPDDNPFIKESGARAEIWSYGIRNPQGMAMNPWSNALWLNEHGPRGGDE
INIPQKGKNYGWPLATWGINYSGFKIPEAKGEIVAGTEQPVFYWKDSPAVSGMAFYNSDKFPQWQQKLFIGALKDKDVIV
MSVNGDKVTEDGRILTDRGQRIRDVRTGPDGYLYVLTDESSGELLKVSPRN
>P0DPN6 ~~~yliM~~~Protein YliM~~~
METFCYMKWPVRHHKSRRVSH
>P0DSE8 ~~~yljB~~~Protein YljB~~~
MNQKFEAVNAIDRNVTDVADANDR
>O31723 7.-.-.-~~~ylmA~~~Uncharacterized ABC transporter ATP-binding protein YlmA~~~COG1119
MILQLDNVSLKRNGKWILKDIHWKVEEKENWVLYGLNGAGKTALLNMLCSYYFPTSGEMQVLGHEFGKTELGEKLRRKIG
LVSAALQQKLYPADSAFEIALSGAYASIGLYETPSKETREKAIGLLEDLGAIEYADRRYETLSQGEKQRALIARALMADP
ELLILDEPVTGLDFIAREKLLDTITYIANKENAPSILYVTHHAEEILPVFDKALLLKQGEVFGSGEIKEMLTDQILSAFF
DTPIHVLWNQDRPFLTRAEPITNA
>O31725 ~~~ylmC~~~Probable sporulation protein YlmC~~~COG1873
MISISEFQVKDVVNVSNGKKLGSIGDIDINVTTGKIQAIILGGNGKVLGFFGKEEELVIPWRNIVKIGEDVILVRLSEPH
A
>O34441 3.1.26.-~~~yloC~~~Endoribonuclease YloC~~~COG1561
MIRSMTGFGSASKTQDDLSVSVELKSVNYRFKEVNARLPRPLLYFEDKLKQTILQHIQRGRIELFVSVDSDMLVEKRLEI
DWPLLDEFVAAARDMKKRYQLSAEPDVMDFFKLDHVVQVHEEQTQNDKLEALIIDAAEEAVKGLCEMREKEGLLLAKDCL
MHIDQLEELVRETELLAADVVSRYRERLYARIKEWTEDVLDESRLVTECAIFADRSDITEEITRLKSHFAQFRDILASGG
AVGRKLDFLVQELNREANTIGSKANDHQITKLVVEMKSSIEKIKEQVQNIE
>P27461 ~~~ylpA~~~Lipoprotein YlpA~~~
MKKNMKLIAITAVLSSVLVLSGCGAMSTAIKKRNLEVKTQMSETIWLEPSSQKTVYLQIKNTSDKNMLGLAPKITKAVQD
KGYTVTSSPEDAHYWIQANVLKADKMDLREAEGFLSQGYQGAALGAALGAGITGYNSNSAGASLGVGLAAGLVGMVADAM
VEDINYTMVTDVQISEKTDTPLQTDNVAALKQGTSGYKVQTSTQTGNKHQYQTRVVSSANKVNLKFEEAQPVLEDQLAKS
IANIL
>O31737 ~~~ylqB~~~Uncharacterized protein YlqB~~~
MKKIGLLFMLCLAALFTIGFPAQQADAAEAPYKASITNISTDGGVYGKINYGQGQYWRVKYNITVSGKLLDQNGQPVPNA
PVRFEADTKVGNTTQTASGTTDANGTFEVPMYLGPAAGYYTYYTSVSVHYYDIIPFRVFSGESRLVSTDNSLYHFAYQVR
R
>P40742 ~~~ylxH~~~Flagellum site-determining protein YlxH~~~COG0455
MQMNRYDQAATLRAKMEKRERVLPMVYSQKAKTLAVISGKGGVGKSNITLNMALALQDKGKKVLLIDLDIGMGNIDILIG
NSSSATIIDVLTDRKPLLQSLSVGPKGLRYISGGTGLDVMFQLDQRKWTFFANELSHALSQFDYVLFDMGAGLSKDQLPF
ILSAEDILIITTPEPTAIMDAYSAVKHLVLTENKLSMKVAVNRCRDQKEGLDAFARLSRTIHMFLDVQVQFAGSVSDDVI
VSKAVVEQVPFFIKSPQAKASRSVRILADALFEREETRHKEDKQTFIEKLSSFLMRRA
>P32729 ~~~rulQ~~~RNA-binding protein YlxQ~~~COG1358
MSGMEWFPLLGLANRARKVVSGEDLVIKEIRNARAKLVLLTEDASSNTAKKVTDKCNYYKVPYKKVESRAVLGRSIGKEA
RVVVAVTDQGFANKLISLLD
>C0SPA3 ~~~ylxW~~~UPF0749 protein YlxW~~~COG3879
MRGKSAVLLSLIMLIAGFLISFSFQMTKENNKSAAETEEWKKEYALRDELLKQEKENKKFEKELYQKQNKVRQAENKLKK
EKSEYYNVLEDTEKYRMYIGEVGVQGEGVEVTLEDASYIPEGENVNSYIVHESHIFQVVNELYISGAAAVAVNGQRLTHD
SYIKCNGPVVTVDGVQHPAPFTVSAIGDPDVLLPSLNIAGGLIDQLSMDHISVSAEKEKNVQMKPILKTKE
>P50619 ~~~ymaB~~~Protein YmaB~~~COG4112
MGKMDEMILVAPRDDVFKKESLTFQGVYSEDSRVAEIMAQIEAAYREMRRGDAEEDPRFKQPIPYVVIKREDEVFLYERL
AGGGESRLHNKLSLGFGGHMNAIEGAASFAEVLKLNTDRELEEELQINEEDKQAIVTLGLINDDENSVGKVHIGILSALQ
LKPGAQVEVKEKEQIAGKWMKVSELKQDDIYNRLETWSQFVVDILE
>O31789 ~~~ymaC~~~UPF0714 protein YmaC~~~COG4195
MRRFLLNVILVLAIVLFLRYVHYSLEPEPSNQPDTYSNFSSLAENESPADYDISYNEKKGSKVLIMSPHGGRIEGGVSEL
VRYFNNEYSTYLFEGLKSHDNQTLHITSTNFDEPLAKKKIKEHQYVVAFHGYKGENKNTLVGGTDRKRAKMIVRALERRG
FSAELASSKSGLAGLNAENINNQGETGLSIQLEISREQREAFFDDFYYKNRKYTKNSEFYAYVSAIKGVLEKEYS
>O31779 ~~~ymcA~~~Uncharacterized protein YmcA~~~COG4550
MTLYSKKDIVQQARNLAKMISETEEVDFFKRAEAQINENDKVSTIVNQIKALQKQAVNLKHYEKHEALKQVEAKIDALQE
ELEEIPVIQEFRDSQMEVNDLLQLVAHTISNQVTNEIITSTGGDLLKGETGSKVKHSNNSCSL
>P0AAA5 ~~~ymcE~~~Uncharacterized protein YmcE~~~
MRRWISQNNIRLPRGAFFISALFFFNAVCIVSDNLLIIESFGEMAYNISYLTRVPGTNTLLACCCLLRPEEVNSEY
>P0DPC8 ~~~ymcF~~~Protein YmcF~~~
MTQHLHFRCPCCHGSQYRTSAFDVTERNPLGAKCIFCKSTMITFDNVALQIRTDHAPLDFTK
>P75917 ~~~ymdA~~~Uncharacterized protein YmdA~~~
MFRPFLNSLMLGSLFFPFIAIAGSTVQGGVIHFYGQIVEPACDVSTQSSPVEMNCPQNGSIPGKTYSSKALMSGNVKNAQ
IASVKVQYLDKQKKLAVMNIEYN
>O31775 3.1.4.16~~~ymdB~~~2',3'-cyclic-nucleotide 2'-phosphodiesterase~~~COG1692
MRILFIGDVVGSPGRDMVKEYVPKLKTKYKPHFTIINGENAAHGKGLTEKIYHSLIQSGADAITMGNHTWDKKEIFDFID
DVPNLVRPANFPEGTPGKGITYVKANGKELAVINLQGRTFLPPLDDPFLKADELIAEAAKRTPYIFIDFHAEATSEKLAL
GWYTDGRASAVVGTHTHVQTADNRILPKGTAYITDVGMTGPYDGILGMDRETIIKRFKTNLPVRFTVAEGKTTLSGVVID
IDDQTKKAVKIERILINDDHMFFE
>P0A8D6 3.1.1.106~~~ymdB~~~O-acetyl-ADP-ribose deacetylase~~~COG2110
MKTRIHVVQGDITKLAVDVIVNAANPSLMGGGGVDGAIHRAAGPALLDACLKVRQQQGDCPTGHAVITLAGDLPAKAVVH
TVGPVWRGGEQNEDQLLQDAYLNSLRLVAANSYTSVAFPAISTGVYGYPRAAAAEIAVKTVSEFITRHALPEQVYFVCYD
EENAHLYERLLTQQGDE
>P75962 ~~~ymfA~~~Inner membrane protein YmfA~~~
MSQDSKVFFRIFLGIGLVLILISVVVFYNQFTYSKDAIHTEGVIVDTVWHSSHSHRTGKDGSWYPVVAFRPTPDYTLIFN
SSIGSDFYEDSEGDKVNVYYSPGHPEKAEINNPWVNFFKWGFIGIMGVIFIAVGLLISMPSSKKSRRKRKSRP
>P75992 ~~~ymgA~~~Probable two-component-system connector protein YmgA~~~
MKTSDNERIKYEITGQAVLQILRMKINFSLQTLIKQLLVMKSAEEDAFRRDLIDSIIRDFSNSDSGGPNRRTATADNKSM
FNGKKINRIH
>P75994 ~~~ymgC~~~Uncharacterized protein YmgC~~~
MNNSIPERFIFQCALFKNLEREVFMTHGYVDSHIIDQALRLRLKDETSVILSDLYLQILQYIEMHKTTLTDIIINDRESV
LS
>P0AB46 ~~~ymgD~~~Uncharacterized protein YmgD~~~
MKKFALLAGLFVFAPMTWAQDYNIKNGLPSETYITCAEANEMAKTDSAQVAEIVAVMGNASVASRDLKIEQSPELSAKVV
EKLNQVCAKDPQMLLITAIDDTMRAIGKK
>P58034 ~~~ymgF~~~Inner membrane protein YmgF~~~
MNNSNNLDYFTLYIIFSIAFMLITLLVILIAKPSTGLGEVLVTINLLNALVWLAINLVNRLRERLVNHRDQQ
>Q7DFV3 ~~~ymgG~~~UPF0757 protein YmgG~~~
MKKKILAFGLISALFCSTPAMADMNRTTKGALLGAGVGLLTGNGVNGVLKGAAVGAGVGAVTEKGRDGKNARKGAKVGAA
VGAVTGVLTGNGLEGAIKGAVIGGTGGAILGKMK
>P0DPN9 ~~~ymgL~~~Protein YmgL~~~
MEIKVQRLSLWMINTVFLLSPINNHQTNTINLIFEM
>P0DPO0 ~~~ymgM~~~Protein YmgM~~~
MDDKQLQAQAAFSKASQPAIDASLNLRFSFLFSHPYANLQHFIIFFLGHRPDHPGKLYLVTDNRCRA
>P0CB62 ~~~ymiA~~~Protein YmiA~~~
MRLAMPSGNQEPRRDPELKRKAWLAVFLGSALFWVVVALLIWKVWG
>P0DPO1 ~~~ymiC~~~Protein YmiC~~~
MINTNMKYWSWMGAFSLSMLFWAELLWIITH
>P0DSF0 ~~~ymiD~~~Protein YmiD~~~
MQKLPLKEKCLTATANYHPGIRYIMTGYSAKYIYSSTYARFR
>P0CD93 ~~~ymjD~~~Protein YmjD~~~
MKHIQIRNSDMDWHIAANNLG
>P0DPO2 ~~~ymjE~~~Protein YmjE~~~
MPMIKSPHGEGGCVCAPPATDWTPPPLLPLLNRFDFRSTRPQTLLRRGGSNYGY
>P0A3X1 ~~~ymoA~~~Modulating protein YmoA~~~
MTKTDYLMRLRKCTTIDTLERVIEKNKYELSDDELELFYSAADHRLAELTMNKLYDKIPPTVWQHVK
>P0A3X0 ~~~ymoA~~~Modulating protein YmoA~~~
MTKTDYLMRLRKCTTIDTLERVIEKNKYELSDDELELFYSAADHRLAELTMNKLYDKIPPTVWQHVK
>O31797 ~~~ymzC~~~Uncharacterized protein YmzC~~~
MFESEAELRRIRIALVWIAVFLLFGACGNQDTIIETDNGNSDYETPQPTSFPLEHNHFGVMEDGYIKIYEYNESRNEVKL
KKEYADDELE
>P76073 ~~~ynaE~~~Uncharacterized protein YnaE~~~
MKSKDTLKWFPAQLPEVRIILGDAVVEVAKQGRPINTRTLLDYIEGNIKKTSWLDNKELLQTAISVLKDNQNLNGKM
>P0AEB5 ~~~ynaI~~~Low conductance mechanosensitive channel YnaI~~~COG0668
MIAELFTNNALNLVIIFGSCAALILMSFWFRRGNRKRKGFLFHAVQFLIYTIIISAVGSIINYVIENYKLKFITPGVIDF
ICTSLIAVILTIKLFLLINQFEKQQIKKGRDITSARIMSRIIKITIIVVLVLLYGEHFGMSLSGLLTFGGIGGLAVGMAG
KDILSNFFSGIMLYFDRPFSIGDWIRSPDRNIEGTVAEIGWRITKITTFDNRPLYVPNSLFSSISVENPGRMTNRRITTT
IGLRYEDAAKVGVIVEAVREMLKNHPAIDQRQTLLVYFNQFADSSLNIMVYCFTKTTVWAEWLAAQQDVYLKIIDIVQSH
GADFAFPSQTLYMDNITPPEQGR
>P0DPO3 ~~~ynaL~~~Protein YnaL~~~
MTTLIYLQIPVPEPIPGDPVPVPDPIPRPQPMPDPPPDEEPIKLSHRERRSARIRAC
>P0DPO4 ~~~ynaM~~~Protein YnaM~~~
MNSILIITSLLIIFSIFSHALIKLGIGISNNPDKTDV
>P76090 ~~~ynbA~~~Inner membrane protein YnbA~~~COG0558
MTLYQIKPLFQSLLRPTMFWLYKHHVTANHITLAALALSLLTGLLLMLAAQPILFLLLPIVLFIRMALNALDGMLARECN
QQTRLGAILNETGDVISDIALYLPFLFLPESNASLVILMLFCTILTEFCGLLAQTINGVRSYAGPFGKSDRALIFGLWGL
AVAIYPQWMQWNNLLWSIASILLLWTAINRCRSVLLMSAEI
>P76092 ~~~ynbC~~~Uncharacterized protein YnbC~~~COG2267
MENSRIPGEHFFTTSDNTALFYRHWPALQPGAKKVIVLFHRGHEHSGRLQHLVDELAMPDTAFYAWDARGHGKSSGPRGY
SPSLARSVRDVDEFVRFAASDSQVGLEEVVVIAQSVGAVLVATWIHDYAPAIRGLVLASPAFKVKLYVPLARPALALWHR
LRGLFFINSYVKGRYLTHDRQRGASFNNDPLITRAIAVNILLDLYKTSERIIRDAAAITLPTQLLISGDDYVVHRQPQID
FYQRLRSPLKELHLLPGFYHDTLGEENRALAFEKMQSFISRLYANKSQKFDYQHEDCTGPSADRWRLLSGGPVPLSPVDL
AYRFMRKAMKLFGTHSSGLHLGMSTGFDSGSSLDYVYQNQPQGSNAFGRLVDKIYLNSVGWRGIRQRKTHLQILIKQAVA
DLHAKGLAVRVVDIAAGHGRYVLDALANEPAVSDILLRDYSELNVAQGQEMIAQRGMSGRVRFEQGDAFNPEELSALTPR
PTLAIVSGLYELFPENEQVKNSLAGLANAIEPGGILIYTGQPWHPQLEMIAGVLTSHKDGKPWVMRVRSQGEMDSLVRDA
GFDKCTQRIDEWGIFTVSMAVRRDN
>P76093 ~~~ynbD~~~Uncharacterized protein YnbD~~~COG0671
MLQGAGWLLLLAPFFFFTYGSLNQFTAVQDLNSHDIPSQVFGWETAIPFLPWTIVPYWSLDLLYGFSLFVCSTTFEQRRL
VHRLILATVMACCGFLLYPLKFSFIRPEVSGVTGWLFSQLELFDLPYNQSPSLHIILCWLLWRHFRQHLAERWRKVCGGW
FLLIAISTLTTWQHHFIDVITGLAVGMLIDWMVPVDRRWNYQKPDQRRIKIALPYVVGAGSCIVLMELMMMIQLWWSVWL
CWPVLSLLIIGRGYGGLGAITTGKDSQGKLPPAVYWLTLPCRIGMWLSMRWFCRRLEPVSKMTAGVYLGAFPRHIPAQNA
VLDVTFEFPRGRATKDRLYFCVPMLDLVVPEEGELRQAVAMLETLREEQGSVLVHCALGLSRSALVVAAWLLCYGHCKTV
NEAISYIRARRPQIVLTDEHKAMLRLWENR
>C1P600 ~~~ynbG~~~Uncharacterized protein YnbG~~~
MKYINCVYNINYKLKPHSHYK
>P94492 3.1.-.-~~~yncB~~~Endonuclease YncB~~~COG1525
MKKILISMIAIVLSITLAACGSNHAAKNHSDSNGTEQVSQDTHSNEYNQTEQKAGTPHSKNQKKLVNVTLDRAIDGDTIK
VIYNGKKDTVRYLLVDTPETKKPNSCVQPYGEDASKRNKELVNSGKLQLEFDKGDRRDKYGRLLAYVYVDGKSVQETLLK
EGLARVAYVYEPNTKYIDQFRLDEQEAKSDKLSIWSKSGYVTNRGFNGCVK
>P76116 ~~~yncE~~~Uncharacterized protein YncE~~~COG3391
MHLRHLFSSRLRGSLLLGSLLVVSSFSTQAAEEMLRKAVGKGAYEMAYSQQENALWLATSQSRKLDKGGVVYRLDPVTLE
VTQAIHNDLKPFGATINNTTQTLWFGNTVNSAVTAIDAKTGEVKGRLVLDDRKRTEEVRPLQPRELVADDATNTVYISGI
GKESVIWVVDGGNIKLKTAIQNTGKMSTGLALDSEGKRLYTTNADGELITIDTADNKILSRKKLLDDGKEHFFINISLDT
ARQRAFITDSKAAEVLVVDTRNGNILAKVAAPESLAVLFNPARNEAYVTHRQAGKVSVIDAKSYKVVKTFDTPTHPNSLA
LSADGKTLYVSVKQKSTKQQEATQPDDVIRIAL
>O31801 3.6.1.23~~~yncF~~~Deoxyuridine 5'-triphosphate nucleotidohydrolase YncF~~~COG0756
MTMQIKIKYLDETQTRISKIEQGDWIDLRAAEDVTIKKDEFKLVPLGVAMELPEGYEAHVVPRSSTYKNFGVIQTNSMGV
IDESYKGDNDFWFFPAYALRDTEIKKGDRICQFRIMKKMPAVELVEVEHLGNEDRGGLGSTGTK
>A5A615 ~~~yncL~~~Uncharacterized protein YncL~~~
MNVSSRTVVLINFFAAVGLFTLISMRFGWFI
>O31803 ~~~yncM~~~Uncharacterized protein YncM~~~
MAKPLSKGGILVKKVLIAGAVGTAVLFGTLSSGIPGLPAADAQVAKAASELPNGIGGRVYLNSTGAVFTAKIVLPETVKN
NDSVSTPYIYSGFRATSGTEADIGLQYSKQYNVWKPLMKVGSKNEETYIEGKDKFTYNKGFRPGSTVQMTIYKNLSGNTR
MTLWGTNNDGYTGRIITEIQGTNIGTISKWKTLATAAVSYESQRDAIKATFSTSFNNITIDNKAVTPVVDTQDFAKVSVA
GNNVTISVNK
>P0DPO5 ~~~yncO~~~Protein YncO~~~
MIYITIFMILPCPVPCSHVFLYVFYIFLFLVLFIMTIYQSSQKLHFSNCYHNNQHHNSLHN
>P0DSF2 ~~~yncP~~~Protein YncP~~~
MNLLVKCAGKIPALALTWTCRP
>O31806 ~~~yndB~~~Uncharacterized protein YndB~~~COG3832
MAQNNENALPDITKSITLEAPIQKVWETVSTSEGIAKWFMPNDFQLKEGQEFHLQSPFGPSPCKVLAVQAPTELSFEWDT
EGWVVTFQLEDLGEKTGFTLIHSGWKEPNEVIGKANEKSSVVRGKMDGGWTGIVNERLRKAVEE
>O31809 ~~~yndE~~~Spore germination protein YndE~~~COG0531
MFSPTSKITTAQATIIIINYMLAAGVLTLPRTVTEQTQSPDGWISVLLGGVLAVIAGMIIAKLSQQYPKETFYEYSRHIV
GKWLGHLISIVFITYFLALGAFEVRVMSEIVDFFLLEGTPSWAIIMTVLWIGLYSITQGLDPIARLFEMIFPITVIIFLT
IALMSLGIFEINNLRPVLGDGIMPVLRGVKTTNLSFTCSEIMFILVAFMKKPKNAVKAVVIGTGVVTSFYMITMIMVIGA
LSVEGVVTRTWPGLDLMRSFEIPGLIFERFESFLLVIWIMQLFATFIITFYAASLGVSQVFKKKPLSCMFGLLPVIYILS
CMPKNENDVFILGDTVSHIALYIFGALPILLLVISKWRKRGEK
>O31815 ~~~yndL~~~UPF0714 protein YndL~~~COG4195
MKPAKVSLLRRLLHSLKHVDCNIAKRFPSTIKIVKLLMIFMVFTPISSIYAEDVYQNFEELKNNEDPSDYGVVTKETGSP
VLVLAIHGGGIEGGTSEVARELSKEYSMYLFEGLKSAGNSVLHITSTHFDEPRALKMTGNHEYVISLHGYAEEDQQIEVG
GTDRVRAADLVEKLQHAGFPAVLLNMDHPHAGVSPNNIANKSKTGLSIQIEMSTGFRKSLFGIFSLKSRAVTQNERFYEF
TEVMFRFLKNSY
>Q45056 ~~~yneA~~~Cell division suppressor protein YneA~~~COG1388
MIMSKESIIFVGLFTVILSAVILMLSYTSSGQELNQYVKIEVQQGDTLWSIADQVADTKKINKNDFIEWVADKNQLQTSD
IQPGDELVIPLKKKHQDAYELATVR
>Q45057 ~~~yneB~~~Resolvase homolog YneB~~~COG1961
MKALIYARVSTNKEQQETSLKRQEEELTAIAAENGMEVVKVISEKASGYEMDRDGVFELLDEIKNADIDVILVQDETRLG
RGNAKIALLHCIYREGVKVYTTAHRGELELSEADSMVLEIVSIVEEYQRKIHNMKIRRGMKRAVKNGFKPQKNLKNQHGN
SGKEKIEVPISEIVRLRANKLTFAEIAATLRGFGYDVSKATVHRRFQEYIENEETAE
>P45711 ~~~yneK~~~Membrane protein YneK~~~
MLEGWFLWFILFWVIMMVVLLSIGGFFMFRKFLKRLPKEDGRSELDWQDYYIENSRHLWNDENKQFLDELTAPVPELFRD
AAKAKIAGKIGELALKEKVAKIDQQLMIKGYILATPKRDHTFLKRHLRDKKIDLEPYQTLLK
>P0DPO7 ~~~yneP~~~Protein YneP~~~
MTKHPTGIYVGCLVKVIRRRLRMELKESVINYSPFVLQHP
>P76169 ~~~ynfA~~~UPF0060 membrane protein YnfA~~~COG1742
MIKTTLLFFATALCEIIGCFLPWLWLKRNASIWLLLPAGISLALFVWLLTLHPAASGRVYAAYGGVYVCTALMWLRVVDG
VKLTLYDWTGALIALCGMLIIVAGWGRT
>P76170 ~~~ynfB~~~UPF0482 protein YnfB~~~
MKITLSKRIGLLAILLPCALALSTTVHAETNKLVIESGDSAQSRQHAAMEKEQWNDTRNLRQKVNKRTEKEWDKADAAFD
NRDKCEQSANINAYWEPNTLRCLDRRTGRVITP
>P76172 ~~~ynfD~~~Uncharacterized protein YnfD~~~
MKLSTCCAALLLALASPAVLAAPGSCERIQSDISQRIINNGVPESSFTLSIVPNDQVDQPDSQVVGHCANDTHKILYTRT
TSGNVSAPAQSSQDGAPAEPQ
>Q45069 ~~~ynfE~~~Uncharacterized protein YnfE~~~
MDEILKQYMVLYKKMSNMINGPDYPGKEKDIQHQKDQIEVYEKQLQQGFSTDYDYDVFADSVIKCAYGDMTLEDLEAVYY
GLTTPFF
>P77374 1.8.99.-~~~ynfE~~~Putative dimethyl sulfoxide reductase chain YnfE~~~COG0243
MSKNERMVGISRRTLVKSTAIGSLALAAGGFSLPFTLRNAAAAVQQAREKVVWGACSVNCGSRCALRLHVKDNEVTWVET
DNTGSDEYGNHQVRACLRGRSIRRRINHPDRLNYPMKRVGKRGEGKFERISWDEALDTIASSLKKTVEQYGNEAVYIQYS
SGIVGGNMTRSSPSASAVKRLMNCYGGSLNQYGSYSTAQISCAMPYTYGSNDGNSTTDIENSKLVVMFGNNPAETRMSGG
GITYLLEKAREKSNAKMIVIDPRYTDTAAGREDEWLPIRPGTDAALVAGIAWVLINENLVDQPFLDKYCVGYDEKTLPAD
APKNGHYKAYILGEGDDKTAKTPQWASQITGIPEDRIIKLAREIGTAKPAYICQGWGPQRQANGELTARAIAMLPILTGN
VGISGGNSGARESTYTITIERLPVLDNPVKTSISCFSWTDAIDHGPQMTAIRDGVRGKDKLDVPIKFIWNYAGNTLVNQH
SDINKTHEILQDESKCEMIVVIENFMTSSAKYADILLPDLMTVEQEDIIPNDYAGNMGYLIFLQPVTSEKFERKPIYWIL
SEVAKRLGPDVYQKFTEGRTQEQWLQHLYAKMLAKDPALPSYDELKKMGIYKRKDPNGHFVAYKAFRDDPEANPLKTPSG
KIEIYSSRLAEIARTWELEKDEVISPLPVYASTFEGWNSPERRTFPLQLFGFHYKSRTHSTYGNIDLLKAACRQEVWINP
IDAQKRGIANGDMVRVFNHRGEVRLPAKVTPRILPGVSAMGQGAWHEANMSGDKIDHGGCVNTLTTLRPSPLAKGNPQHT
NLVEIEKI
>P77783 1.8.99.-~~~ynfF~~~Probable dimethyl sulfoxide reductase chain YnfF~~~COG0243
MKIHTTEALMKAEISRRSLMKTSALGSLALASSAFTLPFSQMVRAAEAPVEEKAVWSSCTVNCGSRCLLRLHVKDDTVYW
VESDTTGDDVYGNHQVRACLRGRSIRRRMNHPDRLKYPMKRVGKRGEGKFERISWDEALDTISDNLRRILKDYGNEAVHV
LYGTGVDGGNITNSNVPYRLMNSCGGFLSRYGSYSTAQISAAMSYMFGANDGNSPDDIANTKLVVMFGNNPAETRMSGGG
VTYYVEQARERSNARMIVIDPRYNDTAAGREDEWLPIRPGTDGALACAIAWVLITENMVDQPFLDKYCVGYDEKTLPANA
PRNAHYKAYILGEGPDGIAKTPEWAAKITSIPAEKIIQLAREIGSAKPAYICQGWGPQRHSNGEQTSRAIAMLSVLTGNV
GINGGNSGVREGSWDLGVEWFPMLENPVKTQISVFTWTDAIDHGTEMTATRDGVRGKEKLDVPIKFLWCYASNTLINQHG
DINHTHEVLQDDSKCEMIVGIDHFMTASAKYCDILLPDLMPTEQEDLISHESAGNMGYVILAQPATSAKFERKPIYWMLS
EVAKRLGPDVYQTFTEGRSQHEWIKYLHAKTKERNPEMPDYEEMKTTGIFKKKCPEEHYVAFRAFREDPQANPLKTPSGK
IEIYSERLAKIADTWELKKDEIIHPLPAYTPGFDGWDDPLRKTYPLQLTGFHYKARTHSSYGNIDVLQQACPQEVWINPI
DAQARGIRHGDTVRVFNNNGEMLIAAKVTPRILPGVTAIGQGAWLKADMFGDRVDHGGSINILTSHRPSPLAKGNPSHSN
LVQIEKV
>P43531 ~~~ynfM~~~Inner membrane transport protein YnfM~~~COG2814
MSRTTTVDGAPASDTDKQSISQPNQFIKRGTPQFMRVTLALFSAGLATFALLYCVQPILPVLSQEFGLTPANSSISLSIS
TAMLAIGLLFTGPLSDAIGRKPVMVTALLLASICTLLSTMMTSWHGILIMRALIGLSLSGVAAVGMTYLSEEIHPSFVAF
SMGLYISGNSIGGMSGRLISGVFTDFFNWRIALAAIGCFALASALMFWKILPESRHFRPTSLRPKTLFINFRLHWRDRGL
PLLFAEGFLLMGSFVTLFNYIGYRLMLSPWHVSQAVVGLLSLAYLTGTWSSPKAGTMTTRYGRGPVMLFSTGVMLFGLLM
TLFSSLWLIFAGMLLFSAGFFAAHSVASSWIGPRAKRAKGQASSLYLFSYYLGSSIAGTLGGVFWHNYGWNGVGAFIALM
LVIALLVGTRLHRRLHA
>P76157 ~~~ynfN~~~Uncharacterized protein YnfN~~~
MREYPNGEKTHLTVMAAGFPSLTGDHKVIYVAADRHVTSEEILEAAIRLLS
>P0DPP1 ~~~ynfP~~~Protein YnfP~~~
MTIEKHERSTKDLVKAAVSGWLGTALEFMDFKSHAC
>P0DPC9 ~~~ynfQ~~~Protein YnfQ~~~
MTNHIHFRCPCCHGSQYRTSSFDVSDMNPFGAKCIFCKSMMITFDNISQYLNASRLSLDLKK
>P0DPO9 ~~~ynfR~~~Protein YnfR~~~
MKAPSGAFLLGVYSMDTHILR
>P0DPP0 ~~~ynfS~~~Protein YnfS~~~
MNNPVCLDDWLIGFKSLCCTLAVIALLII
>P0DPO8 ~~~ynfT~~~Protein YnfT~~~
MNSILIITSLLIIFSIFSHALIKLGIGISNNPDKTDV
>P0DSF4 ~~~ynfU~~~Putative zinc-binding protein YnfU~~~
MSERKNSKSRRNYLVKCSCPNCTQESEHSFSRVQKGALLICPHCNKVFQTNLKAVA
>O31822 2.7.7.9~~~yngB~~~UTP--glucose-1-phosphate uridylyltransferase YngB~~~COG1210
MRKKVRKAVIPAAGLGTRFLPATKAQPKEMLPIVDKPAIQYIVEEAAESGIEDILIITGRNKRSIEDHFDRSAELEFNLR
EKGKTETLKEMQQIADLANIHYIRQKEPLGLGHAVLCAEHFIGDEPFAVLLGDDIMVSETPALRQLMDVYDVYGTEVVGV
QSVLPEDVSKYGIINTSGSQGHVYEVNDLVEKPSPEEAPSEIAVMGRYVLNSSIFSVLKTIGRGAGNEIQLTDALREVCR
KEPIHARLLEGNRYDIGDKLGCFKASTEIGLMRPEMRSQLLAYLEDVIKRETKEMLR
>A5A618 ~~~ynhF~~~Uncharacterized protein YnhF~~~
MSTDLKFSLVTTIIVLGLIVAVGLTAALH
>P76193 2.-.-.-~~~ynhG~~~Probable L,D-transpeptidase YnhG~~~COG1376
MKRASLLTLTLIGAFSAIQAAWAVDYPLPPTGSRLVGQNQTYTVQEGDKNLQAIARRFDTAAMLILEANNTIAPVPKPGT
TITIPSQLLLPDAPRQGIIVNLAELRLYYYPPGENIVQVYPIGIGLQGLETPVMETRVGQKIPNPTWTPTAGIRQRSLER
GIKLPPVVPAGPNNPLGRYALRLAHGNGEYLIHGTSAPDSVGLRVSSGCIRMNAPDIKALFSSVRTGTPVKVINEPVKYS
VEPNGMRYVEVHRPLSAEEQQNVQTMPYTLPAGFTQFKDNKAVDQKLVDKALYRRAGYPVSVSSGATPAASNAPSVESAQ
NGEPEQGNMLRVTQ
>P0DUW2 ~~~ynhH~~~Protein YnhH~~~
MDCRTQRLKNRHSRWAETQAHIPLHAFSMSPILRARHHHFRNTGFAFRQCAQKASFSHPSQLKD
>Q2EES1 ~~~yniD~~~Uncharacterized protein YniD~~~
MPTKRFDKKHWKMVVVLLAICGAMLLLRWAAMIWG
>P76223 ~~~ynjB~~~Protein YnjB~~~COG4134
MRHCGWLLGLLSLFSLATHASDWQEIKNEAKGQTVWFNAWGGDTAINRYLDWVSGEMKTHYAINLKIVRLADAADAVKRI
QTEAAAGRKTGGSVDLLWVNGENFRTLKEANLLQTGWAETLPNWRYVDTQLPVREDFSVPTQGAESPWGGAQLTFIARRD
VTPQPPQTPQALLEFAKANPGTVTYPRPPDFTGTAFLEQLLIMLTPDPAALKEAPDDATFARVTAPLWQYLDVLHPYLWR
EGKDFPPSPARMDALLKAGTLRLSLTFNPAHAQQKIASGDLPASSYSFGFREGMIGNVHFVTIPANANASAAAKVVANFL
LSPDAQLRKADPAVWGDPSVLDPQKLPDGQRESLQSRMPQDLPPVLAEPHAGWVNALEQEWLHRYGTH
>P76224 ~~~ynjC~~~Inner membrane ABC transporter permease protein YnjC~~~COG4135
MATPLRYALIFLLWAMVAVIYAPLIPAALTLISPALSLTHWQALFADPQLPQALLATLVSTTIAAVGALLIALLVIVALW
PGPKWQRMCARLPWLLAIPHVAFATSALLLFADGGLLYDYFPYFTPPMDRFGIGLGLTLAVKESAFLLWILAAVLSEKWL
LQQVIVLDSLGYSRWQCLNWLLLPSVAPALAMAMLAIVAWSLSVVDVAIILGPGNPPTLAVISWQWLTQGDIDQQTKGAL
ASLLLMLLLAAYVLLSYLLWRSWRRTIPRVDGVRKPATPLLPGNTLAIFLPLTGVLCVVLLAILADQSTINSEALINSLT
MGLVATFIALLLLLLWLEWGPQRRQLWLWLPILLPALPLVAGQYTLALWLKLDGSWTAVVWGHLLWVMPWMLFILQPAWQ
RIDSRLILIAQTLGWSRAKIFFYVKCPLMLRPVLIAFAVGFAVGIAQYMPTLWLGAGRFPTLTTEAVALSSGGSNGILAA
QALWQLLLPLIIFALTALVAKWVGYVRQGLR
>P78067 2.8.1.1~~~ynjE~~~Thiosulfate sulfurtransferase YnjE~~~COG2897
MKRVSQMTALAMALGLACASSWAAELAKPLTLDQLQQQNGKAIDTRPSAFYNGWPQTLNGPSGHELAALNLSASWLDKMS
TEQLNAWIKQHNLKTDAPVALYGNDKDVDAVKTRLQKAGLTHISILSDALSEPSRLQKLPHFEQLVYPQWLHDLQQGKEV
TAKPAGDWKVIEAAWGAPKLYLISHIPGADYIDTNEVESEPLWNKVSDEQLKAMLAKHGIRHDTTVILYGRDVYAAARVA
QIMLYAGVKDVRLLDGGWQTWSDAGLPVERGTPPKVKAEPDFGVKIPAQPQLMLDMEQARGLLHRQDASLVSIRSWPEFI
GTTSGYSYIKPKGEIAGARWGHAGSDSTHMEDFHNPDGTMRSADDITAMWKAWNIKPEQQVSFYCGTGWRASETFMYARA
MGWKNVSVYDGGWYEWSSDPKNPVATGERGPDSSK
>P76226 ~~~ynjF~~~Inner membrane protein YnjF~~~COG0558
MLDRHLHPRIKPLLHQCVRVLDKPGITPDGLTLVGFAIGVLALPFLALGWYLAALVVILLNRLLDGLDGALARRRELTDA
GGFLDISLDFLFYALVPFGFILAAPEQNALAGGWLLFAFIGTGSSFLAFAALAAKHQIDNPGYAHKSFYYLGGLTEGTET
ILLFVLGCLFPAWFAWFAWIFGALCWMTTFTRVWSGYLTLKSLQRQ
>P76228 ~~~ynjI~~~Inner membrane protein YnjI~~~
MKKVLLQNHPGSEKYSFNGWEIFNSNFERMIKENKAMLLCKWGFYLTCVVAVMFVFAAITSNGLNERGLITAGCSFLYLL
IMMGLIVRAGFKAKKEQLHYYQAKGIEPLSIEKLQALQLIAPYRFYHKQWSETLEFWPRKPEPGKDTFQYHVLPFDSIDI
ISKRRESLEDQWGIEDSESYCALMEHFLSGDHGANTFKANMEEAPEQVIALLNKFAVFPSDYISDCANHSSGKSSAKLIW
AAELSWMISISSTAFQNGTIEEELAWHYIMLASRKAHELFESEEDYQKNSQMGFLYWHICCYRRKLTDAELEACYRYDKQ
FWEHYSKKCRWPIRNVPWGASSVKYS
>O31818 ~~~ynzC~~~UPF0291 protein YnzC~~~COG4224
MISNAKIARINELAAKAKAGVITEEEKAEQQKLRQEYLKGFRSSMKNTLKSVKIIDPEGNDVTPEKLKREQRNNKLH
>O31819 3.1.3.-~~~ynzD~~~Aspartyl-phosphate phosphatase YnzD~~~
MIREHLLKEIEKKRAELLQIVMANGMTSHITIELSQELDHLLIQYQKQRLRAVAGDE
>O31802 ~~~ynzH~~~Uncharacterized protein YnzH~~~
MGYYKKYKEEYYTWKKTYYKKYYDNDKKHYDCDKYYDHDKKHYDYDKKYDDHDKKYYDDHDYHYEKKYYDDDDHYYDFVE
SYKKHH
>P76257 3.6.4.12~~~yoaA~~~Probable ATP-dependent DNA helicase YoaA~~~COG1199
MTDDFAPDGQLAKAIPGFKPREPQRQMAVAVTQAIEKGQPLVVEAGTGTGKTYAYLAPALRAKKKVIISTGSKALQDQLY
SRDLPTVSKALKYTGNVALLKGRSNYLCLERLEQQALAGGDLPVQILSDVILLRSWSNQTVDGDISTCVSVAEDSQAWPL
VTSTNDNCLGSDCPMYKDCFVVKARKKAMDADVVVVNHHLFLADMVVKESGFGELIPEADVMIFDEAHQLPDIASQYFGQ
SLSSRQLLDLAKDITIAYRTELKDTQQLQKCADRLAQSAQDFRLQLGEPGYRGNLRELLANPQIQRAFLLLDDTLELCYD
VAKLSLGRSALLDAAFERATLYRTRLKRLKEINQPGYSYWYECTSRHFTLALTPLSVADKFKELMAQKPGSWIFTSATLS
VNDDLHHFTSRLGIEQAESLLLPSPFDYSRQALLCVLRNLPQTNQPGSARQLAAMLRPIIEANNGRCFMLCTSHAMMRDL
AEQFRATMTLPVLLQGETSKGQLLQQFVSAGNALLVATSSFWEGVDVRGDTLSLVIIDKLPFTSPDDPLLKARMEDCRLR
GGDPFDEVQLPDAVITLKQGVGRLIRDADDRGVLVICDNRLVMRPYGATFLASLPPAPRTRDIARAVRFLAIPSSR
>O34864 ~~~yoaB~~~Putative transporter YoaB~~~COG2814
MLDKIGIPKRLAWGFLGVVLFMMGDGLEQGWLSPFLIENGLTVQQSASIFSIYGIALAIASWFSGVCLEAFGAKRTMFMG
LLFYVIGTAAFIVFGFEQLNLPVMYVTYFVKGLGYPLFAYSFLTWVIYRTPQSKLSTAVGWFWIAYCLGMFVFGAWYSSY
AIKAFGYLNTLWSSIFWVCLGAFFALFINKDRFEKKKRKRSETAEELLKGVTILFTNPRVLTGGIIRIINSIGTYGFPVF
LPMHMAQHGISTNVWLQIWGTIFLGNIVFNLIFGIVGDKFGWKNTVIWFGGVGCGIFTVLLYYAPVFSGGSLAVVSVIGF
IWGGLLAGYVPIGAIVPTVAGKDKGAAMSVLNLAAGLSAFVGPALAWLFIGLVGAQGVVWIFAALYLASAVLTKCIHIPE
EKAVKEETSPQYAS
>O34861 2.7.1.-~~~yoaC~~~Putative sugar kinase YoaC~~~COG1070
MKKQKGYLVFDIGTGNARVAVVSVTGSVLTVEREDIEYSTETLYPDSRYFSPQVLWKQVMNLAKRALSRSCDIDIIGLTS
TSQRQGIVLIDQNGNPFLGLPNIDNRGREWEAGIPDWEEIYSSTGRLPTALFSALKLYGLKQRQPSLWEKTASFTSISDW
VTYQLSGILTYEPSQATETLLFDVKQNTWSEEMCDIFGFSPSILPPLVRAGTAIGTIANEYASELGLSINAKVIAGGGDT
QLAVKSTGAGLEDIVIVSGTTTPITKITEDHGDTKHKAWLNCHTDQGHWLVETNPGITGLNYQKLKQIFYPNETYEVMEE
EISALAKEDHACVAALGSYLSAEKNALTRGGFLFDAPLSAHLKRAHFVRAALEEIAFSIKWNFDILTEVTPFERDYVWVC
GGGFQSKALTQYIADLLQKKVYVQEGYHQASVVGAAVICNETFQLTEEMSANVRVIEPKDCQIELALYEEWKQTQRFFSG
SESKVLI
>O34815 1.1.1.-~~~yoaD~~~Putative 2-hydroxyacid dehydrogenase YoaD~~~COG0111
MKNTMKRMFCSMTVLVTAPYNEEGRKELENLFGSVAYQSWKEQGRAYREDELIQLLKATNATGLITELDQVTDSVFASVP
ELSFVGVCRGMPSNVDVAAASKRGIPVFYTPGRNAQAVAEMFIGNVISFLRHTSASNQWLKDGEWDSDYLQAYVKFKGNE
LTGKTVGMIGFGAVGQRIAKLLTAFDCKIKYYDPYIQDDHPLYEKASLKTVFSDSDIVSVHLPRTEETLGLIDRQYFDLM
KESAIFVNTSRAVVVNREDLLFVLKEHKISGAILDVFYHEPPEESDYELISLPNVLATPHLAGATFEVEDHHVTILNKAL
KKWKGEKTLNIQTMYNKDALKTGG
>P0AEC0 ~~~yoaE~~~UPF0053 inner membrane protein YoaE~~~COG1253
MEFLMDPSIWAGLLTLVVLEIVLGIDNLVFIAILADKLPPKQRDKARLLGLSLALIMRLGLLSLISWMVTLTKPLFTVMD
FSFSGRDLIMLFGGIFLLFKATTELHERLENRDHDSGHGKGYASFWVVVTQIVILDAVFSLDAVITAVGMVNHLPVMMAA
VVIAMAVMLLASKPLTRFVNQHPTVVVLCLSFLLMIGLSLVAEGFGFHIPKGYLYAAIGFSIIIEVFNQIARRNFIRHQS
TLPLRARTADAILRLMGGKRQANVQHDADNPMPMPIPEGAFAEEERYMINGVLTLASRSLRGIMTPRGEISWVDANLGVD
EIREQLLSSPHSLFPVCRGELDEIIGIVRAKELLVALEEGVDVAAIASASPAIIVPETLDPINLLGVLRRARGSFVIVTN
EFGVVQGLVTPLDVLEAIAGEFPDADETPEIITDGDGWLVKGGTDLHALQQALDVEHLADDDDIATVAGLVISANGHIPR
VGDVIDVGPLHITIIEANDYRVDLVRIVKEQPAHDEDE
>P64496 ~~~yoaG~~~Protein YoaG~~~
MGKATYTVTVTNNSNGVSVDYETETPMTLLVPEVAAEVIKDLVNTVRSYDTENEHDVCGW
>C0SP89 ~~~yoaH~~~Putative methyl-accepting chemotaxis protein YoaH~~~COG0840
MKVKTKLLGIISILVVSIIGIGGSSVFMISSTVKKNEELKDKMEFQKEIKHIQYELTGLSNDERGFLITRDKEYDEGMKG
KADDVLKSLDRVNDLIDEEKYQSNIEDIKTSFTQYRALNQQVVTAYSSNPKKAETIHFGEERTIRKEGVAPAVNKLSDRL
DQEVEDLKDEIQGNGKMSQSLIIIVTGISVILGIVLSIMLLKSIMVPLRSINKQLEEIAHGEADLTKKVIVKNKDEFGQL
AQSFNSFTHSLTQIVKQISSSSEQVAASSEELSASAEESKSTSEHISRAMQMAADSNVKQSSMTEKSAESITELLDSISS
VASNTGNIADLSSSMRDKAEIGSKSVNKMLDQMKFIDKSVDSAGNGLQTLVASTAEISDISSLITTISEQTNLLALNAAI
EAARAGEQGKGFAVVAEEVRKLADETNKSANHIQSVVATIQNESIETVNNIKVVQENVSSGIVLSQETTGNFNEILNLVE
QVTSQIQEVAAATQQLTSGVEVIQHTVHTLAAGTKETSANTEAVANSSQEQLHSMGEISYAAESLSQLAEELQTVINRFK
Y
>O34918 ~~~yoaJ~~~Expansin-YoaJ~~~COG4305
MKKIMSAFVGMVLLTIFCFSPQASAAYDDLHEGYATYTGSGYSGGAFLLDPIPSDMEITAINPADLNYGGVKAALAGSYL
EVEGPKGKTTVYVTDLYPEGARGALDLSPNAFRKIGNMKDGKINIKWRVVKAPITGNFTYRIKEGSSRWWAAIQVRNHKY
PVMKMEYEKDGKWINMEKMDYNHFVSTNLGTGSLKVRMTDIRGKVVKDTIPKLPESGTSKAYTVPGHVQFPE
>C1P603 ~~~yoaJ~~~Uncharacterized protein YoaJ~~~
MKKTTIIMMGVAIIVVLGTELGWW
>C1P602 ~~~yoaK~~~Uncharacterized membrane protein YoaK~~~
MRIGIIFPVVIFITAVVFLAWFFIGGYAAPGA
>P0DPP2 ~~~yoaL~~~Protein YoaL~~~
MDRHRRHFSIRPFNACLSGTLCRTFRLHFVVTPALFLASNSYSLSRSLSWNS
>O31835 ~~~yobA~~~Uncharacterized protein YobA~~~
MPKIGVSLIVLIMLIIFLAGCNKNEQNGDETKMQSLVGYVVLKDNERAILITDTKAPGKEDYNLSEGQLMNKFKNNIVIV
GLSEIDNTDDLKRGEKIKVWFHTRKESNPPSATIQKYELL
>P0AA57 ~~~yobA~~~Protein YobA~~~COG2372
MASTARSLRYALAILTTSLVTPSVWAHAHLTHQYPAANAQVTAAPQAITLNFSEGVETGFSGAKITGPKNENIKTLPAKR
NEQDQKQLIVPLADSLKPGTYTVDWHVVSVDGHKTKGHYTFSVK
>P67601 ~~~yobD~~~UPF0266 membrane protein YobD~~~COG4811
MTITDLVLILFIAALLAFAIYDQFIMPRRNGPTLLAIPLLRRGRIDSVIFVGLIVILIYNNVTNHGALITTWLLSALALM
GFYIFWIRVPKIIFKQKGFFFANVWIEYSRIKAMNLSEDGVLVMQLEQRRLLIRVRNIDDLEKIYKLLVSTQ
>P64508 ~~~yobF~~~Protein YobF~~~
MCGIFSKEVLSKHVDVEYRFSAEPYIGASCSNVSVLSMLCLRAKKTI
>C1P604 ~~~yobI~~~Uncharacterized protein YobI~~~
MYIFITHFFTEYVILKYLLPI
>O34596 ~~~yobK~~~Immunity protein YobK~~~
MIYSKVENFINENKQNAIFTEGASHENIGRIEENLQCDLPNSYKWFLEKYGAGGLFGVLVLGYNFDHASVVNRTNEYKEH
YGLTDGLVVIEDVDYFAYCLDTNKMKDGECPVVEWDRVIGYQDTVADSFIEFFYNKIQEAKDDWDEDEDWDD
>O34330 ~~~yobL~~~Toxin YobL~~~COG5444
MKVFEADSLLFEADKRTKEYKELRSQMVKLKKAFKEVANLDDSEFSGKGADNIKAFYHGHVGVTDQWIDLIDMKIAFLSS
MSATLEDAKMSDAYIEESFLEHELANAYAKSKSIMSEQKKAMKDILNNINDILPLEIFSTEDFKDKLSSADDKREKTIDK
LNKLDEDLKTEYAETEPNEQFIQQDFKKLQESTGKGKNATPIHYNAKAYRESDIHKKKGDIEKHSEAYLSVKKEEAKERE
IKELKKKLNDGVSDPDEYLEIAKKVGYENLEPTQVQLAVQIEQAKQLEGAGEITWDIVKGVGVGLYDVGKDTVTGIWDFI
TDPGETLSALGNAAMHPVKTYDAISAAIEESYQKDMVNGDAYSRSRWVTYAIGSVAVAVVGTKGAGAINKADAAGKVINK
ASQAGKKIKDVKIPDLLPYNPKYKLALADNVPYNVVDSQNLKNELLTNAKKIPDGTRKPFTGQKKSPPWLNKEKYDAYEI
EGKVKAKGKVKDVSRRVYTMKDIDINQKTEFGVTNLQLMKNGNAPYAKDGTQINLHHLIQEEPGPMLEIPNSLHTKYSDV
IHQLKSDGESFRNDKVLKAQYESFRKRYWKWRAKQFENEN
>O34669 ~~~yocH~~~Cell wall-binding protein YocH~~~COG1388
MKKTIMSFVAVAALSTTAFGAHASAKEITVQKGDTLWGISQKNGVNLKDLKEWNKLTSDKIIAGEKLTISSEETTTTGQY
TIKAGDTLSKIAQKFGTTVNNLKVWNNLSSDMIYAGSTLSVKGQATAANTATENAQTNAPQAAPKQEAVQKEQPKQEAVQ
QQPKQETKAEAETSVNTEEKAVQSNTNNQEASKELTVTATAYTANDGGISGVTATGIDLNKNPNAKVIAVDPNVIPLGSK
VYVEGYGEATAADTGGAIKGNKIDVFVPSKSDASNWGVKTVSVKVLN
>O34844 ~~~yodB~~~HTH-type transcriptional regulator YodB~~~COG1733
MGNTMCPKMESAFSLLGKRWNGLIIHVLMDGPKRFKEITETIPMISQKMLAERLKELEQNEIVERQVLPETPVKVIYTLT
EKGTALQAVFQEMQAWADQFCEPGDTVCEEEK
>P81102 1.-.-.-~~~yodC~~~Putative NAD(P)H nitroreductase YodC~~~COG0778
MTNTLDVLKARASVKEYDTNAPISKEELTELLDLATKAPSAWNLQHWHFTVFHSDESKAELLPVAYNQKQIVESSAVVAI
LGDLKANENGEEVYAELASQGYITDEIKQTLLGQINGAYQSEQFARDSAFLNASLAAMQLMIAAKAKGYDTCAIGGFNKE
QFQKQFDISERYVPVMLISIGKAVKPAHQSNRLPLSKVSTWL
>P0DSF8 ~~~yodE~~~Protein YodE~~~
MGNDAFKLSSADRGDITINNESGHLIVNTAILSGDIVTLRGGEIRLVL
>O34745 ~~~yodF~~~Uncharacterized symporter YodF~~~COG0591
MQGNLTALLITAIIVLTVVCIGFLAGRDKSSRTSVEEWSVGGRRFGGLLVWFLVGADLYTAYTFLGLTSTAFTGGSVAFF
AIPYSVLAYFIAYFFLPKLWKVAKIHKLTTLADYARERFNSKLLASLVAIVGVLMLIPYICLQLSGIQDTLQVAGTGYIN
VKFVVIISFILVALYTFFSGIKGPTYTAIIKDILVWVIMLFMVVSLPLIHFNGWTPMIDTLVKEAPQMLTIPSEGPKGIP
WFITASIVSALALFMWAHAATGVFTAKSADAVRKNSMFLPLYNIVLILVIFLGFIAFLVLPEDTNPRLALLHLIQTSYGG
VAQGFAYATIALASLIPCSIMAIGASNLFANNLYRDLIHPNVSQSKLTLVTRSMVFVVIGLALLFGMLFPTALVTLQLLG
VSGMVQIFPAIAVSLFWKNQTKEATVIGLLAGLAVTFIVYITQSAHGIYEGFWGLAANMIAVVILNPLFVKNAGSNPVIE
GLFGKKQDANPNQKGA
>O34866 3.4.-.-~~~yodJ~~~Putative carboxypeptidase YodJ~~~COG1876
MKKSGKWFSLAAALSVTAIVGAGCSMSNGDAQKDTKTTAETKQTEQKTADSKKSNTQNSEFSLESQYFNDIKKVDGLETI
QNPENILALVNKQYALPGNYEPSDLVIPDVEFSFEEKIQKRYIRKEAADALKTMFDAAKKEGYELAAVSGYRSYDRQKVI
FDNEVSLKGERKAKEAVAYPGESEHQTGLAMDISSRSNGFELNEAFGSTADGKWVQDNAYKYGFIIRYPKNKEDITKYEY
EPWHLRYVGKKAAKVIQDNDLTLEEYFEKVKKI
>O34895 2.3.1.-~~~yodP~~~N-acetyltransferase YodP~~~COG0456
MLKSIKSSGVTAVLDHDGFNKRIRVVRYDGAIEKALPDIVAAAKEENAEKIIVYAKQHDEPILAKQLFAPEGYLKGYYLG
HSACVMVRYLSESRRQTDSYTEEQEIIEAIYRTAPRLRNDSTPVFTMRKAETNDMYQLSMLYKKVFRTYPTPVFDPAYIE
KTMNANTVYYIMLDHDRLISAASAEINPELGHAEITDCAVLPEYRGHSLTSFLIEALEKEMAGEDIVHVFSLARASSFGM
NAVLYHSGYQYGGRLINNCFIAEGLENMNIWCKQL
>O34841 ~~~yoeB~~~Uncharacterized protein YoeB~~~
MKKCLLFLTTIALILSLSTNAFAKNTSGDLSQKQALQLALSAREHFWNTMSGHNPKVKKAVCPSGTFEYQNLQYVYMCSD
LGTKAKAVNYLTPIFTKTAIEKGFKDYHFTVSKGKLAVPIGDGDNLLNWKKSTAKLISKKGSTITYEFTVPTLDGSPSAK
RKVTFVKENKKWKVNQFDAVI
>P69348 3.1.-.-~~~yoeB~~~Toxin YoeB~~~COG4115
MKLIWSEESWDDYLYWQETDKRIVKKINELIKDTRRTPFEGKGKPEPLKHNLSGFWSRRITEEHRLVYAVTDDSLLIAAC
RYHY
>C1P606 ~~~yoeI~~~Uncharacterized protein YoeI~~~
MGQFFAYATVITVKENDHVA
>O34685 ~~~yofA~~~HTH-type transcriptional regulator YofA~~~COG0583
MESGDLKIFQAVAREGSITKAAQMLNYVQSNVTARVHNLEEDLNIRLFHRTNRGMKLTAAGENLLQYADQVLSLLDQAEK
STRMSRQPKGPLRIGSLETMAVTHLPEHAASFLRRFPEVDLSVNTADTHHLIQQVLDHKVDGAFVYGPVEHAAVRQLHVS
HDELVLISSREGTAEDMLQQPMLFFGAGCSHRDRVKRLLEEAGIHNQKIIEFGTLEAIIKGVSAGMGTALLPKSAVDGSE
HRTNVWIHQLPPSYQDLEIVFIYRKDFFITSAFQTFLDEINEMKR
>P0AD17 ~~~yohC~~~Inner membrane protein YohC~~~
MSHVWGLFSHPDREMQVINRENETISHHYTHHVLLMAAIPVICAFIGTTQIGWNFGDGTILKLSWFTGLALAVLFYGVML
AGVAVMGRVIWWMARNYPQRPSLAHCMVFAGYVATPLFLSGLVALYPLVWLCALVGTVALFYTGYLLYLGIPSFLNINKE
EGLSFSSSTLAIGVLVLEVLLALTVILWGYGYRLF
>P33366 ~~~yohD~~~Inner membrane protein YohD~~~COG0586
MDLNTLISQYGYAALVIGSLAEGETVTLLGGVAAHQGLLKFPLVVLSVALGGMIGDQVLYLCGRRFGGKLLRRFSKHQDK
IERAQKLIQRHPYLFVIGTRFMYGFRVIGPTLIGASQLPPKIFLPLNILGAFAWALIFTTIGYAGGQVIAPWLHNLDQHL
KHWVWLILVVVLVVGVRWWLKRRGKKKPDHQA
>P33368 1.-.-.-~~~yohF~~~Uncharacterized oxidoreductase YohF~~~COG1028
MAQVAIITASDSGIGKECALLLAQQGFDIGITWHSDEEGAKDTAREVVSHGVRAEIVQLDLGNLPEGALALEKLIQRLGR
IDVLVNNAGAMTKAPFLDMAFDEWRKIFTVDVDGAFLCSQIAARQMVKQGQGGRIINITSVHEHTPLPDASAYTAAKHAL
GGLTKAMALELVRHKILVNAVAPGAIATPMNGMDDSDVKPDAEPSIPLRRFGATHEIASLVVWLCSEGANYTTGQSLIVD
GGFMLANPQFNPE
>P60632 ~~~yohJ~~~UPF0299 membrane protein YohJ~~~COG1380
MSKTLNIIWQYLRAFVLIYACLYAGIFIASLLPVTIPGSIIGMLILFVLLALQILPAKWVNPGCYVLIRYMALLFVPIGV
GVMQYFDLLRAQFGPVVVSCAVSTLVVFLVVSWSSQLVHGERKVVGQKGSEE
>P0AD19 ~~~yohK~~~Inner membrane protein YohK~~~COG1346
MMANIWWSLPLTLIVFFAARKLAARYKFPLLNPLLVAMVVIIPFLMLTGISYDSYFKGSEVLNDLLQPAVVALAYPLYEQ
LHQIRARWKSIITICFIGSVVAMVTGTSVALLMGASPEIAASILPKSVTTPIAMAVGGSIGGIPAISAVCVIFVGILGAV
FGHTLLNAMRIRTKAARGLAMGTASHALGTARCAELDYQEGAFSSLALVLCGIITSLIAPFLFPIILAVMG
>Q2EES6 ~~~yohO~~~UPF0387 membrane protein YohO~~~
MRIAKIGVIALFLFMALGGIGGVMLAGYTFILRAG
>C1P609 ~~~yohP~~~Uncharacterized membrane protein YohP~~~
MKIILWAVLIIFLIGLLVVTGVFKMIF
>O31858 ~~~yojF~~~Uncharacterized protein YojF~~~COG2120
MKAIIKEDVQASLERYADRPVYIHLETTTGSYSAHLNEKNMTVVAYIRNAKVTYHQAKIKGNGPYRVGLKTEEGWIYAEG
LTEYTVDEENRLLMAGHLPGGKLAISLQISEKPFTV
>P33941 ~~~yojI~~~ABC transporter ATP-binding/permease protein YojI~~~COG4615
MELLVLVWRQYRWPFISVMALSLASAALGIGLIAFINQRLIETADTSLLVLPEFLGLLLLLMAVTLGSQLALTTLGHHFV
YRLRSEFIKRILDTHVERIEQLGSASLLAGLTSDVRNITIAFVRLPELVQGIILTIGSAAYLWMLSGKMLLVTAIWMAIT
IWGGFVLVARVYKHMATLRETEDKLYTDFQTVLEGRKELTLNRERAEYVFNNLYIPDAQEYRHHIIRADTFHLSAVNWSN
IMMLGAIGLVFWMANSLGWADTNVAATYSLTLLFLRTPLLSAVGALPTLLTAQVAFNKLNKFALAPFKAEFPRPQAFPNW
QTLELRNVTFAYQDNAFSVGPINLTIKRGELLFLIGGNGSGKSTLAMLLTGLYQPQSGEILLDGKPVSGEQPEDYRKLFS
AVFTDVWLFDQLLGPEGKPANPQLVEKWLAQLKMAHKLELSNGRIVNLKLSKGQKKRVALLLALAEERDIILLDEWAADQ
DPHFRREFYQVLLPLMQEMGKTIFAISHDDHYFIHADRLLEMRNGQLSELTGEERDAASRDAVARTA
>O31851 ~~~yojM~~~Superoxide dismutase-like protein YojM~~~COG2032
MHRLLLLMMLTALGVAGCGQKKPPDPPNRVPEKKVVETSAFGHHVQLVNREGKAVGFIEIKESDDEGLDIHISANSLRPG
ASLGFHIYEKGSCVRPDFESAGGPFNPLNKEHGFNNPMGHHAGDLPNLEVGADGKVDVIMNAPDTSLKKGSKLNILDEDG
SAFIIHEQADDYLTNPSGNSGARIVCGALLGNNEKQ
>O32003 2.3.1.-~~~yokD~~~SPbeta prophage-derived aminoglycoside N(3')-acetyltransferase-like protein YokD~~~COG2746
MKKIVESTTFPRTKQSITEDLKALGLKKGMTVLVHSSLSSIGWVNGGAVAVIQALIDVVTEEGTIVMPSQSVELSDPKEW
GNPPVPEEWWDIIRESMPAYNSNYTPTTRGMGQIVELFRSYPEVKRSNHPNYSFVAWGKHKNKILNQHPLEFGLGEQSPL
GKLYIRESYVLLLGADFDSSTCFHLAEYRIPYQKIINRGAPIIVEGKRVWKEYKELEFREELFQEVGQAFEAEHNMKVGK
VGSANCRLFSLTEAVDFAEKWFINNDSKNIKK
>O32001 3.1.-.-~~~yokF~~~SPbeta prophage-derived endonuclease YokF~~~COG1525
MKKVLLGFAAFTLSLSLAACSSNDSEKVSTEKETPQASTDVEKKTEQKESTKEKTADKSKEKDKKELVDVTLDRAVDGDT
IKVTYNGNVDTVRYLLIDTPETKKPNSCVQPYGEDASKRNKELVNSGKLQLEFDKGDRRDKYGRLLAYVYVDGKSVQETL
LKEGLARVAYVYEPNTKYIDQFKKDEQEAKSEKLSIWSKNGYVTDRGFNGCVKEKTTAVKKATTSKPAATQPTTPKASSE
TSTTTEKEASSETTGGTETFKNCTELRKKYPNGVPSSHPAYQSKMDRDHDNYACER
>O31998 ~~~yokI~~~Toxin YokI~~~COG5444
MKVFEADSLLSEADKRTKEYKELRSQMVKLKKAFKAVADLDDSKFSGKGADNIKAFYHDHVGVTDQWIDLIDMKIVFLSS
ISAKLEDAKMSDAYIEESFLEHELVNAYTKSKSIMSEQKKAMKDILNDINDILPLEIFSTEDFKDKLSSADDKREKTIDK
INKLDEDLKTEYAETEQNEQFIQQDFKKLQESTGKGKNATPIHYSAKAYRESDIHKKKGDIEQHSEAYLTVKKEEAKERE
IKELKKKLNDGVSDPDEYLEIAKKVGYENLEPAQVQLAVQIEQAKQLEGAGEITWDIVKGVGVGLYDVGKDTVTGLWDFI
TDPGETLSALGNAVIHPVKTYDAISAAIEESYQKDMVNGDAYSRSRWVTYAIGSVAAAVIGTKGAGAINKADAAGKVINK
ASQAGKKIKDVKIPDLLPYNPKYDLAMAGDVPYNVVDGENLKNQLMSFAKGSDKEVKPFDVVDYRPSNSPLENHHGVMDV
WAKHNVPNYVSRGSNTPTVALTKEQHNATKKVYREWLFEKTGKKVGGKVNWKEVSPREIQELTEKMFDAANVPKEARQQY
YNAFNQYNFRK
>O31997 ~~~yokJ~~~Immunity protein YokJ~~~
MSIDMLIKKIASTSDCRLFEADGLPVIDEKHQLPKDISEFYEQCGGAVLYENADYPIYIVRPAEFELANPIIVGELCEED
ISSEWYIVCTDGKGEYLTIDLNDQRKGKCYDSFFDRHGIVGETQVIASSFTDLIQRLLENKGKHWYWLRDDYVSLGDAYD
GIEIE
>O31994 ~~~yolA~~~SPbeta prophage-derived uncharacterized protein YolA~~~
MKKRITYSLLALLAVVAFAFTDSSKAKAAEALPLYYLQITGITSDGNDFAWDNLTSSQTKAPNVLKGNKLYVKARFMGYT
KLTVITGKDGKNLLYNGTAKMFKSDAILGQNKVVIGWDKYFEIPMDALQDNSIQIKALSSGTTFVYSQKIDFERE
>O31965 ~~~yomS~~~SPbeta prophage-derived uncharacterized protein YomS~~~
MTETTENVVITIPDKTSFTFHEAATSPSEGEEFVVGHFRELTVKISGSSTSREIKFYAVDENGEKTALSGTNKTDFQLGS
STLNTNEYWDFDIAGLFKVMFEVVSVTGDVTVKGIVVS
>O31947 ~~~yonK~~~SPbeta prophage-derived uncharacterized protein YonK~~~
MASKKVHQINVKGFFDMDVMEVTEQTKEAEYTYDFKEILSEFNGKNVSITVKEENELPVKGVE
>O31945 2.7.7.6~~~yonO~~~DNA-directed RNA polymerase YonO~~~
MKGKKDGLNKQVHIYSIDTSAFYNDQENKLHNKILKSYRYRDHLRKLEHVDKKHKKYITQRIISLKEKLYNAFNDHNQIR
TLRTDSLKDNNVISLFDSVLTRTLGIKENSLSEEIMVVQTYHFQILRDIIDKGFIHNNEKYVYFTSSAGQIRTKKSCFIK
QSTLDKYQNALTCGLSVEHINAQGGSSINKWNSYMALSNSASSSWEIDIDKAIVVNDLETNVSSLVDYIDRDTYEITRKI
MDIPIEHTDGCGMMLPSLSQKSFMVRLPWVKGLLVPFDFRKFAEKHSSFIVKDVYGKEWDIIKDDIQIIFTKSQFKMWKY
YDSWDDYRYKFKKYGCLGAKLNEEDPSVEGKLTYQMLQTLTDITDEELKQISSKTVSEITQLGTDKETMMKVLGATEKNK
HKTSLQEALLIYPELLNDDHTKEIIKNKKKSMIKDAKSGKLLVSDARYTYLCPDLYAFCERLFLGIESPKGLLSGSDVHC
SLYDEGYIDILRSPHLFREHGVRWNKKNEEYEKWFITPGVYTSIHDPISKLLQFDNDGDKALIISDELIVNIAKRNMADI
VPLYYEMSVAQKQEINSRNIYEALTLAYGINIGEYSNNITKIWNSDNINLDVIKWLCMENNFTIDFAKTLFMPTRPDHVD
EKIKDYIKNKVPHFFINAKDKEEHSVESINESTVNKLDSIIPSDRINFAAVAGKFDYRFLLKNKEIKLNEAVINEYKRLD
RNKKWLMNDEEAKPGQKLYVYKIIKQKLLEIHNDDGFITDVLVKHLYKKKSKYKSTLWECFGDIVLENIKHNLKTFKGCC
ICGKAFKPTSNKAKYCQSCGKKKERDKYKKYNKKRINHR
>P31492 ~~~yopE~~~Outer membrane virulence protein YopE~~~
MKISSFISTSLPLPASVSGSSSVGEMSGRSVSQQKSDQYANNLAGRTESPQGSSLASRIIERLSSMAHSVIGFIQRMFSE
GSHKPVVTPALTPAQMPSPTSFSDSIKQLAAETLPKYMQQLSSLDAETLQKNHDQFATGSGPLRGSITQCQGLMQFCGGE
LQAEASAILNTPVCGIPFSQWGTVGGAASAYVASGVDLTQAANEIKGLGQQMQQLLSLM
>P31493 ~~~yopE~~~Outer membrane virulence protein YopE~~~COG5599
MKISSFISTSLPLPTSVSGSSSVGEMSGRSVSQQTSDQYANNLAGRTESPQGSSLASRIIERLSSVAHSVIGFIQRMFSE
GSHKPVVTPAPTPAQMPSPTSFSDSIKQLAAETLPKYMQQLNSLDAEMLQKNHDQFATGSGPLRGSITQCQGLMQFCGGE
LQAEASAILNTPVCGIPFSQWGTIGGAASAYVASGVDLTQAANEIKGLAQQMQKLLSLM
>P08008 ~~~yopE~~~Outer membrane virulence protein YopE~~~
MKISSFISTSLPLPTSVSGSSSVGEMSGRSVSQQTSDQYANNLAGRTESPQGSSLASRIIERLSSVAHSVIGFIQRMFSE
GSHKPVVTPAPTPAQMPSPTSFSDSIKQLAAETLPKYMQQLNSLDAEMLQKNHDQFATGSGPLRGSITQCQGLMQFCGGE
LQAEASAILNTPVCGIPFSQWGTIGGAASAYVASGVDLTQAANEIKGLAQQMQKLLSLM
>P15273 3.1.3.48~~~yopH~~~Tyrosine-protein phosphatase YopH~~~
MNLSLSDLHRQVSRLVQQESGDCTGKLRGNVAANKETTFQGLTIASGARESEKVFAQTVLSHVANIVLTQEDTAKLLQST
VKHNLNNYELRSVGNGNSVLVSLRSDQMTLQDAKVLLEAALRQESGARGHVSSHSHSVLHAPGTPVREGLRSHLDPRTPP
LPPRERPHTSGHHGAGEARATAPSTVSPYGPEARAELSSRLTTLRNTLAPATNDPRYLQACGGEKLNRFRDIQCCRQTAV
RADLNANYIQVGNTRTIACQYPLQSQLESHFRMLAENRTPVLAVLASSSEIANQRFGMPDYFRQSGTYGSITVESKMTQQ
VGLGDGIMADMYTLTIREAGQKTISVPVVHVGNWPDQTAVSSEVTKALASLVDQTAETKRNMYESKGSSAVADDSKLRPV
IHCRAGVGRTAQLIGAMCMNDSRNSQLSVEDMVSQMRVQRNGIMVQKDEQLDVLIKLAEGQGRPLLNS
>P08538 3.1.3.48~~~yopH~~~Tyrosine-protein phosphatase YopH~~~
MNLSLSDLHRQVSRLVQQESGDCTGKLRGNVAANKETTFQGLTIASGARESEKVFAQTVLSHVANVVLTQEDTAKLLQST
VKHNLNNYDLRSVGNGNSVLVSLRSDQMTLQDAKVLLEAALRQESGARGHVSSHSHSALHAPGTPVREGLRSHLDPRTPP
LPPRERPHTSGHHGAGEARATAPSTVSPYGPEARAELSSRLTTLRNTLAPATNDPRYLQACGGEKLNRFRDIQCCRQTAV
RADLNANYIQVGNTRTIACQYPLQSQLESHFRMLAENRTPVLAVLASSSEIANQRFGMPDYFRQSGTYGSITVESKMTQQ
VGLGDGIMADMYTLTIREAGQKTISVPVVHVGNWPDQTAVSSEVTKALASLVDQTAETKRNMYESKGSSAVGDDSKLRPV
IHCRAGVGRTAQLIGAMCMNDSRNSQLSVEDMVSQMRVQRNGIMVQKDEQLDVLIKLAEGQGRPLLNS
>O68718 2.3.1.-~~~yopJ~~~Serine/threonine-protein acetyltransferase YopJ~~~
MIGPISQINISGGLSEKETSSLISNEELKNIITQLETDISDGSWFHKNYSRMDVEVMPALVIQANNKYPEMNLNLVTSPL
DLSIEIKNVIENGVRSSRFIINMGEGGIHFSVIDYKHINGKTSLILFEPANFNSMGPAMLAIRTKTAIERYQLPDCHFSM
VEMDIQRSSSECGIFSLALAKKLYIERDSLLKIHEDNIKGILSDGENPLPHDKLDPYLPVTFYKHTQGKKRLNEYLNTNP
QGVGTVVNKKNETIVNRFDNNKSIVDGKELSVSVHKKRIAEYKTLLKV
>A0A0N9NCU6 2.3.1.-~~~yopJ~~~Serine/threonine-protein acetyltransferase YopJ~~~
MIGPISQINISGGLSEKETSSLISNEELKNIITQLETDISDGSWFHKNYSRMDVEVMPALVIQANNKYPEMNLNLVTSPL
DLSIEIKNVIENGVRSSRFIINMGEGGIHFSVIDYKHINGKTSLILFEPANFNSMGPAMLAIRTKTAIERYQLPDCHFSM
VEMDIQRSSSECGIFSFALAKKLYIERDSLLKIHEDNIKGILSDGENPLPHDKLDPYLPVTFYKHTQGKKRLNEYLNTNP
QGVGTVVNKKNETIVNRFDNNKSIVDGKELSVSVHKKRIAEYKTLLKV
>P0DUD0 2.3.1.-~~~yopJ~~~Serine/threonine-protein acetyltransferase YopJ~~~
MIGPISQINISGGLSEKETSSLISNEELKNIITQLETDISDGSWFHKNYSRMDVEVMPALVIQANNKYPEMNLNLVTSPL
DLSIEIKNVIENGVRSSRFIINMGEGGIHFSVIDYKHINGKTSLILFEPANFNSMGPAMLAIRTKTAIERYQLPDCHFSM
VEMDIQRSSSECGIFSFALAKKLYIERDSLLKIHEDNIKGILSDGENPLPHDKLDPYLPVTFYKHTQGKKRLNEYLNTNP
QGVGTVVNKKNETIVNRFDNNKSIVDGKELSVSVHKKRIAEYKTLLKV
>O31927 ~~~yopK~~~SPbeta prophage-derived uncharacterized protein YopK~~~
MELIRIAMKKDLENDNSLMNKWATVAGLKNPNPLYDFLNHDGKTFNEFSSIVNIVKSQYPDREYELMKDYCLNLDVKTKA
ARSALEYADANMFFEIEDVLIDSMISCSNMKSKEYGKVYKIHRELSNSVITEFEAVKRLGKLNIKTPEMNSFSRLLLLYH
YLSTGNFSPMAQLIKQIDLSEISENMYIRNTYQTRVHVLMSNIKLNENSLEECREYSKKALESTNILRFQVFSYLTIGNS
LLFSNYELAQENFLKGLSISVQNENYNMIFQQALCFLNNVWRKENKWINFESDSIMDLQEQAHCFINFNENSKAKEVLDK
LDLLVHNDNELAMHYYLKGRLEQNKACFYSSIEYFKKSNDKFLIRLPLLELQKMGENQKLLELLLL
>P17778 ~~~yopM~~~Outer membrane protein YopM~~~COG4886
MFINPRNVSNTFLQEPLRHSSNLTEMPVEAENVKSKTEYYNAWSEWERNAPPGNGEQREMAVSRLRDCLDRQAHELELNN
LGLSSLPELPPHLESLVASCNSLTELPELPQSLKSLLVDNNNLKALSDLPPLLEYLGVSNNQLEKLPELQNSSFLKIIDV
DNNSLKKLPDLPPSLEFIAAGNNQLEELPELQNLPFLTAIYADNNSLKKLPDLPLSLESIVAGNNILEELPELQNLPFLT
TIYADNNLLKTLPDLPPSLEALNVRDNYLTDLPELPQSLTFLDVSENIFSGLSELPPNLYYLNASSNEIRSLCDLPPSLE
ELNVSNNKLIELPALPPRLERLIASFNHLAEVPELPQNLKQLHVEYNPLREFPDIPESVEDLRMNSERVVDPYEFAHETT
DKLEDDVFE
>P16945 ~~~yopN~~~Outer membrane protein YopN~~~
MTTLHNLSYGNTPLRNEHPEIASSQIVNQTLGQFRGESVQIVSGTLQSIADMAEEVTFVFSERKELSLDKRKLSDSQARV
SDVEEQVNQYLSKVPELKQKQNVSELLSLLSNSPNISLSQLKAYLEGKSEEPSEQFKMLCGLRDALKGRPELAHLLHLVE
QALVSMVEEQEEAIVLGARITPEAYRESQSGVNPLQPLRDTYRDAVMGYQGINAIWSDLQKRFPNGDIDSVILFLQKALS
ADLQSQQSGSEREKLEIVISDLQKLKEFRSVSDQVKGFWQLFSEGITNGLRPF
>P68640 ~~~yopN~~~Outer membrane protein YopN~~~
MTTLHNLSYGNTPLHNERPEIASSQIVNQTLGQFRGESVQIVSGTLQSIADMAEEVTFVFSERKELSLDKRKLSDSQARV
SDVEEQVNQYLSKVPELEQKQNVSELLSLLSNSPNISLSQLKAYLEGKSEEPSEQFKMLCGLRDALKGRPELAHLSHLVE
QALVSMAEEQGETIVLGARITPEAYRESQSGVNPLQPLRDTYRDAVMGYQGIYAIWSDLQKRFPNGDIDSVILFLQKALS
ADLQSQQSGSGREKLGIVISDLQKLKEFGSVSDQVKGFWQFFSEGKTNGVRPF
>Q663K1 ~~~yopN~~~Outer membrane protein YopN~~~
MTTLHNLSYGNTPLHNERPEIASSQIVNQTLGQFRGESVQIVSGTLQSIADMAEEVTFVFSERKELSLDKRKLSDSQARV
SDVEEQVNQYLSKVPELEQKQNVSELLSLLSNSPNISLSQLKAYLEGKSEEPSEQFKMLCGLRDALKGRPELAHLSHLVE
QALVSMAEEQGETIVLGARITPEAYRESQSGVNPLQPLRDTYRDAVMGYQGIYAIWSDLQKRFPNGDIDSVILFLQKALS
ADLQSQQSGSGREKLGIVINDLQKLKEFGSVSDQVKGFWQFFSEGKTNGVRPF
>Q01249 ~~~yscH~~~Type 3 secretion system regulator YopR~~~
MTVTLNRGSITSLMSSSQAVSTLQPAASELKTQLEHKLKSESAEKTREVLWQQYYASNPPDHAVLEVLATPVREALLARF
GQHQGPVVPAIDLPELRSVLQQFDSFGKRREAILLQVLEGIKPNESQVGLPYLSELINKELMILLPYNSIVDSLLHNSHQ
IDMET
>P68590 ~~~yscH~~~Type 3 secretion system regulator YopR~~~
MTVTLNRGSITSLMSSSQAVSTLQPVASELKTQLENKLKSESAEKTREVLWQQYYASNPPDHAVLEVLATPVREALLARF
GQHQGSVVPAIDLPELRSVLQQFDSFGKRWEAILLQVLEGIKPNESQVGLPYLSELINKELMILLPSNSIVDSLLHNSHQ
IDMDT
>Q663I2 ~~~yscH~~~Type 3 secretion system regulator YopR~~~
MTVTLNRGSITSLMSSSQAVSTLQPVASELKTQLENKLKSESAEKTREVLWQQYYASNPPDHAVLEVLAMPVREALLARF
GQHQGSVVPAIDLPELRSVLQQFDSFGKRWEAILLQVLEGIKPNESQVGLPYLSELINKELMILLPSNSIVDSLLHNSHQ
IDMDT
>O34498 ~~~yopT~~~SPbeta prophage-derived uncharacterized protein YopT~~~
MAGYLNNIALNLEIVLKNKADSPEVSETLVTRICENLLLSKEVSFLKADGSVENFKLSDMEYEITNTEELPE
>O68703 3.4.22.-~~~yopT~~~Cysteine protease YopT~~~COG3177
MNSIHGHYHIQLSNYSAGENLQSATLTEGVIGAHRVKVETALSHSNLQKKLSATIKHNQSGRSMLDRKLTSDGKANQRSS
FTFSMIMYRMIHFVLSTRVPAVRESVANYGGNINFKFAQTKGAFLHKIIKHSDTASGVCEALCAHWIRSHAQGQSLFDQL
YVGGRKGKFQIDTLYSIKQLQIDGCKADVDQDEVTLDWFKKNGISERMIERHCLLRPVDVTGTTESEGLDQLLNAILDTH
GIGYGYKKIHLSGQMSAHAIAAYVNEKSGVTFFDPNFGEFHFSDKEKFRKWFTNSFWGNSMYHYPLGVGQRFRVLTFDSK
EV
>Q93RN4 3.4.22.-~~~yopT~~~Cysteine protease YopT~~~
MNSIHGHYHIQLSNYSAGENLQSATLTEGVIGAHRVKVETALSHSNLQKKLSATIKHNQSGRSMLDRKLTSDGKANQRSS
FTFSMIMYRMIHFVLSTRVPAVRESVANYGGNINFKFAQTKGAFLHKIIKHSDTASGVCEALCAHWIRSHAQGQSLFDQL
YVGGRKGKFQIDTLYSIKQLQIDGCKADVDQDEVTLDWFKKNGISERMIERHCLLRPVDVTGTTESEGLDQLLNAILDTH
GIGYGYKKIHLSGQMSAHAIAAYVNEKSGVTFFDPNFGEFHFSDKEKFRKWFTNSFWDNSMYHYPLGVGQRFRVLTFDSK
EV
>O34401 ~~~yopX~~~SPbeta prophage-derived uncharacterized protein YopX~~~
MNTAYRVWDGEQMHYWDDEGLSLIIKSNGDWTLKRLYTDVLVPVVDSTNRNAALMWGAKVRGKFIYDRSIVKITSDDKES
SDVCEVKFSDGVFQVDVSKISADYDVTAVGWVEYATIEVIGDVYQNPELLEGVK
>O31903 3.1.-.-~~~yorK~~~Putative SPbeta prophage-derived single-strand DNA-specific exonuclease YorK~~~COG0608
MEYRLIGDNDYNFDPLATILKNRGIEDPKLFVNVDQSSVIHYSKLNNIDKAADCLIRHLNNKNKLFVQVDSDVDGYTSSS
IIINYIKKICPKANIHYRIQDGKEHGIFIDTIPDDVDLVIIPDAGSSQFEEHEALNKRGTEIIVIDHHECERVSEHAIVV
NNQLSPNYSNKTLTGAGMAYKFCQAIDEKLNKNEAEQFLDLVSIGNIADSADSRNLETRYFMNEGLKKIKHPLLKKLFKK
QEFSTKGDKSIQNTQFFINPLINAAIRVGSSEEKDQMMRAFLLSKEKVPYKKRGQSETELVSIHEDTVRILGNLKAKQKR
IVDAAGVEIKNRIEEKSLTANKVLIVYIEGILDKSLTGLVANQLAEEYKKPVLLARNDPEKGKDILSGSIRGYDKGFIKD
FKKELIDTGLFEFVEGHPNAAGFAIKRQNLILVNKVLNEKFKDINIEEDIQNVDFEIPAKRLRKEFILQLDGYKDYWGYK
VEEPLIAITDLEIEVEQIEHLGKKNKTTVKFKHGDIEYIRFKSDENYFNQLTASNGTLVINVIGKAKANEYKGKKTPQIE
IYELEVVRTKQKELVF
>O31898 ~~~yorP~~~SPbeta prophage-derived uncharacterized protein YorP~~~
MPKYWSYPVGLAVEINNNARYGCPHHVGRKGKIIEHLHSATYDYAVSDETGDITYFKEHELTPLKGGLAYV
>O31896 ~~~yorR~~~SPbeta prophage-derived uncharacterized protein YorR~~~COG0125
MTLIILEGPDCCFKSTVAAKLSKELKYPIIKGSSFELAKSGNEKLFEHFNKLADEDNVIIDRFVYSNLVYAKKFKDYSIL
TERQLRFIEDKIKAKAKVVYLHADPSVIKKRLRVRGDEYIEGKDIDSILELYREVMSNAGLHTYSWDTGQWSSDEIAKDI
IFLVE
>O34919 3.6.1.23~~~yosS~~~SPbeta prophage-derived deoxyuridine 5'-triphosphate nucleotidohydrolase YosS~~~COG0756
MQIKIKYLDETQTRINKMEQGDWIDLRAAEDVAIKKDEFKLVPLGVAMELPEGYEAHVVPRSSTYKNFGVIQTNSMGVID
ESYKGDNDFWFFPAYALRDTKIKKGDRICQFRIMKKMPAVDLIEVDRLGNGDRGGHGSTGTK
>O31864 ~~~yozE~~~UPF0346 protein YozE~~~COG4479
MKSFYHYLLKYRHPKPKDSISEFANQAYEDHSFPKTSTDYHEISSYLELNADYLHTMATFDEAWDQYESEVHGR
>P14503 ~~~~~~Uncharacterized 27.7 kDa protein~~~
MNNEKNKQDRENLNRQDERKSSEIKSERKSGLDLIEVRKQLDRIESVANEKNENESEKLENLKNRIEELEKSEQDRNERT
NILMKRLEASTSNFNNKLDVFEEKTKHARLNFETTAKHYIKRLDEDNLKLDFQQAIQDELSDTKDEIREVTKQAREETKE
YKEILESKIKDHNKVVDKSNTALKVMTKGVTNIFFVLIIFVLVMLVTGPIVISLVLNIYIALLMVLLMTTKVHGDI
>P50733 ~~~ypbG~~~Uncharacterized protein YpbG~~~COG1408
MKLSVKIAGVLTVAAAAMTAKMYATAKGNHLKTHTFPLSKMKGKPPLTIFFISDIHKRLIDQDLLEKARSHAPHLVIIGG
DLAEGGVPSARIEENIKRLVHFGVPIVFVWGNNDYEVRQHKLYSIFKAHGVITLRNESVPFSYNGHTIAIAGVDDIRMEM
DHYEEAIKELDESQLNILVCHNPEIHEQINEDDGIDVILSGHTHGGQIRFGKFGPYELGKTGIVKNAAYLISNGYGTTKV
PLRLGAEPETHIVTLCGPE
>P0AA93 2.7.13.3~~~ypdA~~~Sensor histidine kinase YpdA~~~COG3275
MHEIFNMLLAVFDRAALMLICLFFLIRIRLFRELLHKSAHSPKELLAVTAIFSLFALFSTWSGVPVEGSLVNVRIIAVMS
GGILFGPWVGIITGVIAGIHRYLIDIGGVTAIPCFITSILAGCISGWINLKIPKAQRWRVGILGGMLCETLTMILVIVWA
PTTALGIDIVSKIGIPMILGSVCIGFIVLLVQSVEGEKEASAARQAKLALDIANKTLPLFRHVNSESLRKVCEIIRDDIH
ADAVAITNTDHVLAYVGVGEHNYQNGDDFISPTTRQAMNYGKIIIKNNDEAHRTPEIHSMLVIPLWEKGVVTGTLKIYYC
HAHQITSSLQEMAVGLSQIISTQLEVSRAEQLREMANKAELRALQSKINPHFLFNALNAISSSIRLNPDTARQLIFNLSR
YLRYNIELKDDEQIDIKKELYQIKDYIAIEQARFGDKLTVIYDIDEEVNCCIPSLLIQPLVENAIVHGIQPCKGKGVVTI
SVAECGNRVRIAVRDTGHGIDPKVIERVEANEMPGNKIGLLNVHHRVKLLYGEGLHIRRLEPGTEIAFYIPNQRTPVASQ
ATLLL
>P0AE39 ~~~ypdB~~~Transcriptional regulatory protein YpdB~~~COG3279
MKVIIVEDEFLAQQELSWLIKEHSQMEIVGTFDDGLDVLKFLQHNRVDAIFLDINIPSLDGVLLAQNISQFAHKPFIVFI
TAWKEHAVEAFELEAFDYILKPYQESRITGMLQKLEAAWQQQQTSSTPAATVTRENDTINLVKDERIIVTPINDIYYAEA
HEKMTFVYTRRESYVMPMNITEFCSKLPPSHFFRCHRSFCVNLNKIREIEPWFNNTYILRLKDLDFEVPVSRSKVKEFRQ
LMHL
>P77585 3.4.11.-~~~ypdE~~~Aminopeptidase YpdE~~~COG1363
MDLSLLKALSEADAIASSEQEVRQILLEEADRLQKEVRFDGLGSVLIRLNESTGPKVMICAHMDEVGFMVRSISREGAID
VLPVGNVRMAARQLQPVRITTREECKIPGLLDGDRQGNDVSAMRVDIGARSYDEVMQAGIRPGDRVTFDTTFQVLPHQRV
MGKAFDDRLGCYLLVTLLRELHDAELPAEVWLVASSSEEVGLRGGQTATRAVSPDVAIVLDTACWAKNFDYGAANHRQIG
NGPMLVLSDKSLIAPPKLTAWVETVAAEIGVPLQADMFSNGGTDGGAVHLTGTGVPTVVMGPATRHGHCAASIADCRDIL
QMQQLLSALIQRLTRETVVQLTDFR
>P76524 3.4.11.-~~~ypdF~~~Aminopeptidase YpdF~~~COG0006
MTLLASLRDWLKAQQLDAVLLSSRQNKQPHLGISTGSGYVVISRESAHILVDSRYYVEVEARAQGYQLHLLDATNTLTTI
VNQIIADEQLQTLGFEGQQVSWETAHRWQSELNAKLVSATPDVLRQIKTPEEVEKIRLACGIADRGAEHIRRFIQAGMSE
REIAAELEWFMRQQGAEKASFDTIVASGWRGALPHGKASDKIVAAGEFVTLDFGALYQGYCSDMTRTLLVNGEGVSAESH
LLFNVYQIVLQAQLAAISAIRPGVRCQQVDDAARRVITEAGYGDYFGHNTGHAIGIEVHEDPRFSPRDTTTLQPGMLLTV
EPGIYLPGQGGVRIEDVVLVTPQGAEVLYAMPKTVLLTGEA
>C1P610 ~~~ypdK~~~Uncharacterized membrane protein YpdK~~~
MKYFFMGISFMVIVWAGTFALMI
>P76539 2.3.1.-~~~ypeA~~~Acetyltransferase YpeA~~~COG0456
MEIRVFRQEDFEEVITLWERCDLLRPWNDPEMDIERKMNHDVSLFLVAEVNGDVVGTVMGGYDGHRGSAYYLGVHPEFRG
RGIANALLNRLEKKLIARGCPKIQINVPEDNDMVLGMYERLGYEHADVLSLGKRLIEDEEY
>P63422 2.3.1.-~~~ypeA~~~Acetyltransferase YpeA~~~
MEIRVFRQEDFEEVITLWERCDLLRPWNDPEMDIERKMNHDVSLFLVAEVNGEVVGTVMGGYDGHRGSAYYLGVHPEFRG
RGIANALLNRLEKKLIARGCPKIQINVPEDNDMVLGMYERLGYEHADVLSLGKRLIEDEEY
>P38490 ~~~ypeB~~~Sporulation protein YpeB~~~COG2959
MIRGILIAVLGIAIVGTGYWGYKEHQEKDAVLLHAENNYQRAFHELTYQVDQLHDKIGTTLAMNSQKSLSPALIDVWRIT
SEAHNSVSQLPLTLMPFNKTEELLSKIGDFSYKTSVRDLDQKPLDKNEYTSLNKLYQQSEDIQNELRHVQHLVMSKNLRW
MDVEMALASDEKQSDNTIINSFKTVEKNVGAFSTGTDLGPSFTSTKKEEKGFSHLKGKQISEQEAKQIAERFAPDDNYSI
KVVKSGKKTNRDVYSISMKDPDHKAVIYMDITKKGGHPVYLIQNREVKDQKISLNDGSNRALAFLKKNGFETDDLEIDES
AQYDKIGVFSYVPVENKVRMYPEAIRMKVALDDGEVVGFSARDFLTSHRKRTIPKPAITEAEAKSKLNKNVQVRETRLAL
ITNELGQEVLCYEMLGTIENDTFRMYINAKDGSEEKVEKLKNAEPIYKDL
>P0DUW4 ~~~ypeD~~~Protein YpeD~~~
MNLIIILVWHSMKHSKGYVLEAQDDELSR
>P38491 ~~~ypfA~~~Uncharacterized protein YpfA~~~COG5581
MIEIGENVLLEYIEENELKKAKSKAVSIENNELLIAYPVDVVTGRTVILHNDMEVTVEFVGKDEVPYRFISRIKGKVKDK
LQMICLEMPPREKMKRIQRRQYVRTDAVLDVQIQPGNEEEIRTLSYNISAGGIAVVLADGLSFQSGESLRLIIRLPEEEH
TRQIETEAVVRRIFNDPKSEKRKMTLEYSEIAAGDQQALLQYCIRRQLNKRRKARME
>P0AD47 ~~~yphA~~~Inner membrane protein YphA~~~COG2259
MNTLRYFDFGAARPVLLLIARIAVVLIFIIFGFPKMMGFDGTVQYMASLGAPMPMLAAIIAVVMEVPAAILIVLGFFTRP
LAVLFIFYTLGTAVIGHHYWDMTGDAVGPNMINFWKNVSIAGAFLLLAITGPGAISLDRR
>P76584 ~~~yphB~~~Uncharacterized protein YphB~~~COG2017
MTIYTLSHGSLKLDVSDQGGVIEGFWRDTTPLLRPGKKSGVATDASCFPLVPFANRVSGNRFVWQGREYQLQPNVEWDAH
YLHGDGWLGEWQCVSHSDDSLCLVYEHRSGVYHYRVSQAFHLTADTLTVTLSVTNQGAETLPFGTGWHPYFPLSPQTRIQ
AQASGYWLEREQWLAGEFCEQLPQELDFNQPAPLPRQWVNNGFAGWNGQARIEQPQEGYAIIMETTPPAPCYFIFVSDPA
FDKGYAFDFFCLEPMSHAPDDHHRPEGGDLIALAPGESTTSEMSLRVEWL
>P77269 ~~~yphF~~~ABC transporter periplasmic-binding protein YphF~~~COG1879
MPTKMRTTRNLLLMATLLGSALFARAAEKEMTIGAIYLDTQGYYAGVRQGVQDAAKDSSVQVQLIETNAQGDISKESTFV
DTLVARNVDAIILSAVSENGSSRTVRRASEAGIPVICYNTCINQKGVDKYVSAYLVGDPLEFGKKLGNAAADYFIANKID
QPKIAVINCEAFEVCVQRRKGFEEVLKSRVPGAQIVANQEGTVLDKAISVGEKLIISTPDLNAIMGESGGATLGAVKAVR
NQNQAGKIAVFGSDMTTEIAQELENNQVLKAVVDISGKKMGNAVFAQTLKVINKQADGEKVIQVPIDLYTKTEDGKQWLA
THVDGLP
>P54389 ~~~ypiA~~~TPR repeat-containing protein YpiA~~~COG0457
MNTLIQEAIKLVEAGETEKGLNTLSKAEKQLHDEDKAIAAQLYYEWGDVEKAISLISDLHDLYPNETELTNFYAELLIDI
DEEEKALAVLETIPETDPSYPESLLLMADLYQMQGLFEVSEQKLFQAKSILDNEPVIDFALGELYFAQGAYAKAVQYFKT
TAEEQSEIGGVNVHQRLAESLSASGEFEDAIPWYEKAVDENPDPNTIFGYGFTALQAGLVKTAIKQLSDLKELDPSYTSL
YMPLSKSYEAEGMYEEALKTAKEGIRYDEYNKELFLYAAKMALKIGKSEEGKKLLQEALALDPGFVEALHTLLAVYHKEE
DYDQIIDLIQEVRSYGEEDPKYNWYLASAYTELEQYEEAKQSFEAAYLHYREDRDFLYEYASFLLEEGLQKEALPLLKKV
LEMDGANEELEETILRIEDEFSR
>P42979 ~~~ypjD~~~Uncharacterized protein YpjD~~~COG1694
MSDKTMKDIQAEVDRYIGQFKEGYFSPLAMMARLTEELGELAREVNHRYGEKPKKATEDDKSMEEEIGDVLFVLVCLANS
LDISLEEAHDRVMHKFNTRDKDRWTRKEEGK
>P64432 ~~~ypjD~~~Inner membrane protein YpjD~~~COG4137
MPVFALLALVAYSVSLALIVPGLLQKNGGWRRMAIISAVIALVCHAIALEARILPDGDSGQNLSLLNVGSLVSLMICTVM
TIVASRNRGWLLLPIVYAFALINLALATFMPNEYITHLEATPGMLVHIGLSLFSYATLIIAALYALQLAWIDYQLKNKKL
AFNQEMPPLMSIERKMFHITQIGVVLLTLTLCTGLFYMHNLFSMENIDKAVLSIVAWFVYIVLLWGHYHEGWRGRRVVWF
NVAGAVILTLAYFGSRIVQQLIS
>Q46953 ~~~ypjF~~~Toxin YpjF~~~
MNTLPATISQAAKPCLSPVAVWQMLLTRLLEQHYGLTLNDTPFSDETVIKEHIDAGITLADAVNFLVEKYELVRIDHRGF
SWQQQSPYISVVDILRARRSTGLLKTNVK
>P54173 ~~~ypjQ~~~Uncharacterized protein YpjQ~~~COG1267
MKKYTMNEMVDITKDMLNKRGVMIEDIARIVQKLQEKYNPNLPLSVCMENVEKVLNKREIIHAVLTGLALDQLAEQKLLP
EPLQHLVETDEPLYGIDEIIPLSIVNVYGSIGLTNFGYLDKEKIGIIKELDESPDGIHTFLDDIVAALAAAAASRIAHTH
QDLQDEEKEQDEKPVVS
>Q9RI12 2.7.11.1~~~ypkA~~~Protein kinase YpkA~~~COG0515
MKSVKIMGTMPPSISLAKAHERISQHWQNPVGELNIGGKRYRIIDNQVLRLNPHSGFSLFREGVGKIFSGKMFNFSIARN
LTDTLHAAQKTTSQELRSDIPNALSNLFGAKPQTELPLGWKGEPLSGAPDLEGMRVAETDKFAEGESHISIIETKDKQRL
VAKIERSIAEGHLFAELEAYKHIYKTAGKHPNLANVHGMAVVPYGNRKEEALLMDEVDGWRCSDTLRTLADSWKQGKINS
EAYWGTIKFIAHRLLDVTNHLAKAGVVHNDIKPGNVVFDRASGEPVVIDLGLHSRSGEQPKGFTESFKAPELGVGNLGAS
EKSDVFLVVSTLLHCIEGFEKNPEIKPNQGLRFITSEPAHVMDENGYPIHRPGIAGVETAYTRFITDILGVSADSRPDSN
EARLHEFLSDGTIDEESAKQILKDTLTGEMSPLSTDVRRITPKKLRELSDLLRTHLSSAATKQLDMGGVLSDLDTMLVAL
DKAEREGGVDKDQLKSFNSLILKTYRVIEDYVKGREGDTKNSSTEVSPYHRSNFMLSIVEPSLQRIQKHLDQTHSFSDIG
SLVRAHKHLETLLEVLVTLSQQGQPVSSETYGFLNRLTEAKITLSQQLNTLQQQQESAKAQLSILINRSGSWADVARQSL
QRFDSTRPVVKFGTEQYTAIHRQMMAAHAAITLQEVSEFTDDMRNFTVDSIPLLIQLGRSSLMDEHLVEQREKLRELTTI
AERLNRLEREWM
>Q05608 2.7.11.1~~~ypkA~~~Protein kinase YpkA~~~
MKSVKIMGTMPPSISLAKAHERISQHWQNPVGELNIGGKRYRIIDNQVLRLNPHSGFSLFREGVGKIFSGKMFNFSIARN
LTDTLHAAQKTTSQELRSDIPNALSNLFGAKPQTELPLGWKGEPLSGAPDLEGMRVAETDKFAEGESHISIIETKDKQRL
VAKIERSIAEGHLFAELEAYKHIYKTAGKHPNLANVHGMAVVPYGNRKEEALLMDEVDGWRCSDTLRTLADSWKQGKINS
EAYWGTIKFIAHRLLDVTNHLAKAGVVHNDIKPGNVVFDRASGEPVVIDLGLHSRSGEQPKGFTESFKAPELGVGNLGAS
EKSDVFLVVSTLLHCIEGFEKNPEIKPNQGLRFITSEPAHVMDENGYPIHRPGIAGVETAYTRFITDILGVSADSRPDSN
EARLHEFLSDGTIDEESAKQILKDTLTGEMSPLSTDVRRITPKKLRELSDLLRTHLSSAATKQLDMGGVLSDLDTMLVAL
DKAEREGGVDKDQLKSFNSLILKTYRVIEDYVKGREGDTKNSSTEVSPYHRSNFMLSIVEPSLQRIQKHLDQTHSFSDIG
SLVRAHKHLETLLEVLVTLSQQGQPVSSETYGFLNRLAEAKITLSQQLNTLQQQQESAKAQLSILINRSGSWADVARQSL
QRFDSTRPVVKFGTEQYTAIHRQMMAAHAAITLQEVSEFTDDMRNFTVDSIPLLIQLGRSSLMDEHLVEQREKLRELTTI
AERLNRLEREWM
>P54156 ~~~yplP~~~Putative sigma L-dependent transcriptional regulator YplP~~~COG1221
MNSAPKLNTFQHLIGEHQTFLEAKRIAKQFSLSELPVLITGKIGTGKNHFAHAIHLESSRSNEPFISVNCSTHSEETLIH
ELFGPNGNTGVFQKAVRGTLFLDDVWRMPASVQAQLLKALDSDTEKPRMICASADRSVEHTFRQDLFYRLNILTLTLPEL
SERKSDIPLLTQHFLSNSGQQLLIDPSVFPVLEKHAFEGNVRELKNAADYMAAVSSGGTIQPYDLPPYIRGTIDGKTSKK
KAKLLTLMEKAEFLFILETIKVLNEKGEPASRRIISEHSKNTQTSLTPQQVRSRLDYLEKKDYVTKSRGRAGTKITFEGL
SFIETLKNQMI
>P54396 ~~~ypmB~~~Uncharacterized protein YpmB~~~COG5353
MRKKALIFTVIFGIIFLAVLLVSASIYKSAMAQKEEGHEAAAAEAKKETDLAHVDQVETFVGKEKYYVVKGTDKKGTALY
VWVPADKKAKILSKEAKEGISEDKAAKIIKDEGLVSKQKEVHLAREGNVLLWEVTYLDKEGQYSLSYVDFTTGKILKNIT
P
>P39789 ~~~ypoC~~~Uncharacterized protein YpoC~~~
MTQAKEVLASYEQYLRSLGQKSSSDMKKTLQTNPVYFDFCTELQGDLPWEDSGKYVPLLFEVWDDIKASLLPVFQTRKSR
CDQNEMLKGIVCLLASLHWTAGEPVKSLDWQELREKSYPAKPINWAERVEFILLKPTQYHCFIQLDELITEMKKHFYKYH
AMNR
>P50833 ~~~yppE~~~Uncharacterized protein YppE~~~
MLSQTLLEMTEQMIEVAEKGADRYQEGKNSNHSYDFFETIKPAVEENDELAARWAEGALELIKVRRPKYVHKEQIEAVKD
NFLELVLQSYVHHIHKKRFKDITESVLYTLHAVKDEIAREDSR
>P50830 3.6.4.-~~~yprA~~~Uncharacterized ATP-dependent helicase YprA~~~COG1111
MKKKSLTELISDLKGNENVVNWHEIEPREAKTRPMPESIDERIKAALSKRGIDELYTHQYSAFQYVQKGESIVTVTPTAS
GKTLCYNLPVLQSIAQDETNRALYLFPTKALAQDQKSELNEIIDEMGIDIKSFTYDGDTSPAIRQKVRKAGHIVITNPDM
LHSAILPHHTKWVSLFENLKYIVIDELHTYRGVFGSHVANVIRRLKRICRFYGSDPVFICTSATIANPKELGEQLTGKPM
RLVDDNGAPSGRKHFVFYNPPIVNKPLNIRRSATAEVNELAKEFLKNKVQTIVFARSRVRVEIILSHIQELVKKEIGTKS
IRGYRGGYLPKERREIERGLREGDILGVVSTNALELGVDIGQLQVCVMTGYPGSVASAWQQAGRAGRRHGESLIIMVANS
TPIDQYIVRHPEYFFNRSPESARINPENLIILVDHLKCAAYELPFRADEEFGAMEVSDILEYLQEEAVLHRNGERYHWAS
ESFPASNISLRSASQENVVIVDQSDIANVRIIGEMDRFSAMTLLHDEAIYLHEGVQYQVEKLDWDHKKAYVRKVDVEYYT
DANLAVQLKVLEIDKTKEKSRTSLHYGDVTVNALPTIFKKIKMTTFENIGSGPIHLPEEELHTSAAWLEIKTADEDIGEK
TLEQLLLGISNVLQHIVPVYIMCDRNDVHVVSQIKAAHTGLPTIFLYDHYPGGIGLAEEVFKRFSDINEAAKQLITHCPC
HDGCPSCIGTEIEGIKAKERILQLLDQMS
>P50838 ~~~ypsA~~~UPF0398 protein YpsA~~~COG4474
MKVLAITGYKPFELGIFKQDDKALYYIKKAIKNRLIAFLDEGLEWILISGQLGVELWAAEAAYDLQEEYPDLKVAVITPF
YEQEKNWKEPNKEQYEAVLAQADYEASLTHRPYESPLQFKQKNQFFIDKSDGLLLLYDPEKEGSPKYMLGTAEKRREQDG
YPIYFITMDDLRVTVEEDSY
>P50840 2.1.1.-~~~ypsC~~~Putative RNA methyltransferase YpsC~~~COG0116
MKKYTLIATAPMGIEAVVAKEVRDLGYECKVDNGKVIFEGDALAICRANLWLRTADRIKVQVASFKAKTFDELFEKTKAI
NWRSFIPENGKFPVIGKSVKSTLASVPDCQRIVKKAIVEKLKLQSGKANDWIEETGAEYKVEISLLKDQALITLDSSGTG
LHKRGYRVDQGGAPIKETLAAALVQLTNWTPDRPFVDPFCGSGTIAIEAALIGQNIAPGFNRDFVSEDWEWIGKDLWNKA
RLEVEEKANYDQPLTIFASDIDHRMVQIAKENAEEAGLGDLIQFKQMQVKDFTTNLEFGVIVGNPPYGERLGEKKAVEQM
YKEMGQAFEPLDTWSVYMLTSNENFEEAYGRKATKKRKLFNGFIKTDYYQYWSKVRPQRKKTENA
>P31847 ~~~ypuA~~~Uncharacterized protein YpuA~~~COG4086
MKKIWIGMLAAAVLLLMVPKVSLADAAVGDVIVTLGADLSESDKQKVLDEMNVPDNATTVTVTNKEEHEYLGKYISNAQI
GSRAISSSSITIAKKGSGLNVETHNISGITDEMYLNALMTAGVKDAKVYVTAPFEVSGTAALTGLIKAYEVSSDEAISED
VKQVANQELVTTSELGDKIGNENAAALIAKIKEEFAKNGVPDNKADIEKQVDDAASDLNVTLTDSQKNQLVSLFNKMKNA
DIDWGQVSDQLDKAKDKITKFIESDEGKNFIQKVIDFFVSIWNAIVSIFK
>P0ADR0 ~~~yqaA~~~Inner membrane protein YqaA~~~COG1238
MSEALSLFSLFASSFLSATLLPGNSEVVLVAMLLSGISHPWVLVLTATMGNSLGGLTNVILGRFFPLRKTSRWQEKATGW
LKRYGAVTLLLSWMPVVGDLLCLLAGWMRISWGPVIFFLCLGKALRYVAVAAATVQGMMWWH
>P77475 3.1.3.-~~~yqaB~~~Fructose-1-phosphate phosphatase YqaB~~~COG0637
MYERYAGLIFDMDGTILDTEPTHRKAWREVLGHYGLQYDIQAMIALNGSPTWRIAQAIIELNQADLDPHALAREKTEAVR
SMLLDSVEPLPLVDVVKSWHGRRPMAVGTGSESAIAEALLAHLGLRHYFDAVVAADHVKHHKPAPDTFLLCAQRMGVQPT
QCVVFEDADFGIQAARAAGMDAVDVRLL
>P45906 ~~~yqaI~~~Uncharacterized protein YqaI~~~
MVENPMVINNWHDKLTETDVQIDFYGDEVTPVDDYVIDGGEIILRENLERYLREQLGFEFKNAQ
>P45920 ~~~yqbD~~~Uncharacterized protein YqbD~~~COG0338
MPRELVNAKITHVSYVDKAANQKQFFIVKSEKQPDFQKEVRILAKEADEQKLVYGIVYEPDTVDAHGDFMTAAEIEKAAH
GFLKDARQIDKQHDFQGGVGEVVESYVAPADFEMNGETIKKGSWVLVTKASEEVWEQIKKGEITGYSMAGTAETIEKQEK
PVSQEKTDEKGLFNLLKNFFVGKQQQSYEEPVAKAGRKFSASNLQEIKNAHAALGNLLSQVETKEGEEEMTSEEVTKSIQ
EALEPIKKRLETLEKEEELNKKDKEKEEETEKEGEKLKKAISEAVQPLADRIEAIEKSRGTSKQTEESGSEQVQKSIWSG
LF
>P45922 ~~~yqbF~~~Uncharacterized protein YqbF~~~
MFTAKLIKGKTYNVMGITFRAGVSQTVPKKLYEYLNENPYFILTQELNNQKDDPINYTESELKGMNKAEHESIISNLGRN
PSDFKNADERIAYILKQIDNKGE
>P45923 ~~~yqbG~~~Uncharacterized protein YqbG~~~
MLLITPDELKSYSVFESVKTRPDELLKQDILEATADIILKVGHDFSDAEYIPLPETVRLALLKLSQFYALINGDESIIKG
YTTEKIGDYSYTLGDGSSLQKPDVYALIKDYVKPADPDLEGIEAKVRMRSI
>P45930 ~~~yqbN~~~Uncharacterized protein YqbN~~~
MSEKQNEKVYDLSFFMPGQTIEAEEVEVPISKRFVDKEGNVVPFIFKAITTERIDELEKENTTYKNVKGRGRVKELDSQR
FYARIAVETTVYPNFKAKELREAYKTEDPVEVAKRVLSVGGEYANWLNKAIEINGFDDDLEDLEEAAKN
>P45931 ~~~yqbO~~~Uncharacterized protein YqbO~~~COG3953
MAKLTATFELHDKISRKLRMIQGNAERLKRAANGPLIFEAEDRTERVMRQIDRSANRLTARARLLEMGLDDRVSNGLHSI
RQQAEDLTEGSHEVTVSVNDQATPRFRLIRGGLTDLNSSHAEPTVSVRDHASNQLDEIRRHVTDVDSEHAEPTVSIKDRA
SAALDAIEAKIDSLKGATITLAVAGGFSAGSIMGSGKSTMSQDAYVSATSNVNKKDVAKMTDQIYFNNKAGSSREEVSLS
LRNLSQQTGASKKALAELTESSSKIAQLMNADQAEVDRAFSSMYNNLKLSGKQSGDLIAYVYRNAGDQADDLLDTMNEYS
STFKDLKLTGGQIANAMIKGTKGGARNFDNLADSMREFNIRRTEMSDSQVDAFKTLFGAKETKKMFKGFKDGSISGEESL
FRVAKALSKVKDKTKRAAIATELIGTQYEDLKQPILDMAEGIGTSAKTSGELERSFTKLRDNNPMTPVNDAMRDFESISK
DMGTSLLTGLGPAFDKISSFINSKEGQEKLKEIKKDIADLGEEIGDKLNVAIEWSVNHWDDLKTAIKVVIPSLIGLIGYL
KILRPLLKGIGTVGSDAAGVIRKLIPKRTPKAGTNTQSERRNRNSNRNASTRGRESKTATGPTSLPRSGSLTYCCCSDGG
KNDRIRRRRGKRVLGRRGNPNRMNPSDSSIAVSSERLERRRSGRTVGTNPTRDSRSAIITTRSELYSAGRAAGGTSKFGK
VLSPLKSVGKFAKGVPLLGTALAATDLIGMNKDNVGEKIGSAGGGLAGAATGAAIGSVIPGVGTAIGGLVGGIAGTMGGS
SLGKAFDGSEVKKKLNSTLFDQKWWSEKWSGIKSNAKTSINGLSDTWSNVKEKVKSTLFNSEWWSEKWSGVKSWAQNKWN
SASSVWESVKGKIKSTLFSEKWWSGKWEGVKSWAQSKWDSASSVWQSVKGKLKSTLFSEKWWSGKWESVKSWSKNKWDNA
KSIWKSVKSSISETLFSKKWWSEKWQSVKELGSSILGGVKEVGGKVASSAKKTAGKAWGYVKSGVNYLFGSGKEKPKKHA
TGGYITKPTISWIGEAGKEFVIPVENNKGRGKMLLSQAASKLGMSVVDDIASASSAGGEPATSPLVRSAAVTASVSPIID
TSSLDEQATSFGQQFTKSFDQGIRDNVVSMEAWKQKNVGQPMNNLISYSPNYGKQVVNGYAKGQNSTSTGTDGFLQTKVK
MPFQNTVNKSSSWGSGTIKGFASGQNSSQTGTDQYVSTHINKPFIRSKESSNGWGSGMIGNFVSGMTSKASEVNEAAKEL
AKKVEKAFREELDIHSPSRVMMSLGRFASIGIVKGLDSVDVKKFAEKQAGSLAAAYSGMGAVSGNVKQWLMAAIMATKTP
MSWLPGLMTIAQHESGGNPKAINLWDSNAKAGHPSQGLMQTIPSTFNAHKLPGMNNILNPIHNAAAAIGYIKSRYGSINN
VPGIRSMRHGGPYVGYANGGLITKEQIARVGEGNKREWIIPEERGIRGRYLLAQAAKALGMEVTDPSEKGQTELSSGQVT
AATTGRNQTTFKAAGGKEVIIQFNGDQHFHNDQDMNSLVAKIKQALVDELEQDINIGTKGVVAFD
>P65367 ~~~yqcA~~~Flavodoxin YqcA~~~COG0716
MAEIGIFVGTMYGNSLLVAEEAEAILTAQGHKATVFEDPELSDWLPYQDKYVLVVTSTTGQGDLPDSIVPLFQGIKDSLG
FQPNLRYGVIALGDSSYVNFCNGGKQFDALLQEQSAQRVGEMLLIDASENPEPETESNPWVEQWGTLLS
>Q46919 ~~~yqcC~~~Uncharacterized protein YqcC~~~COG3098
MTTHDRVRLQLQALEALLREHQHWRNDEPQPHQFNSTQPFFMDTMEPLEWLQWVLIPRMHDLLDNKQPLPGAFAVAPYYE
MALATDHPQRALILAELEKLDALFADDAS
>P77031 ~~~yqcE~~~Inner membrane protein YqcE~~~COG2223
MQHNSYRRWITLAIISFSGGVSFDLAYLRYIYQIPMAKFMGFSNTEIGLIMSTFGIAAIILYAPSGVIADKFSHRKMITS
AMIITGLLGLLMATYPPLWVMLCIQIAFAITTILMLWSVSIKAASLLGDHSEQGKIMGWMEGLRGVGVMSLAVFTMWVFS
RFAPDDSTSLKTVIIIYSVVYILLGILCWFFVSDNNNLRSANNEEKQSFQLSDILAVLRISTTWYCSMVIFGVFTIYAIL
SYSTNYLTEMYGMSLVAASYMGIVINKIFRALCGPLGGIITTYSKVKSPTRVIQILSVLGLLTLTALLVTNSNPQSVAMG
IGLILLLGFTCYASRGLYWACPGEARTPSYIMGTTVGICSVIGFLPDVFVYPIIGHWQDTLPAAEAYRNMWLMGMAALGM
VIVFTFLLFQKIRTADSAPAMASSK
>P45941 ~~~yqcF~~~Immunity protein YqcF~~~
MGVTQENKVIARTVLGAFGGKPKVTKYWDDNKNSSIDILSVSDQPQEGITSYSTLGLSDHSINYEVNGTPLRIEIVAAME
SASDIYANVLSTCAFNIINSNFTCAPGVIFKNVISMYDQETDMKHIMFVPPFLWEEDLELLEFSNKNVTWLMALPISEGE
LQVAEKHGSDYLQDLLESKQIDIFDIKRESVV
>P45942 ~~~yqcG~~~Toxin YqcG~~~COG5444
MKVFEAKTLLTEAEKRAQEYKDLKSKMVKLKKAFKAVADLDDSEFSGKGANNIKSFYEDQAGIADQWIDLIEMKISFLTS
IPGFLEDANLSDAYIEETFLAHELANAYTKSKSIMSEQKKAMKDILNDINDILPLDLFSTETFKNELSSAEKKRKEAIEK
MDEVDQNLTSEYGLSEANEQMIQADYQALMNATAKGKSASPIHYNAKAYRDSEIHKMTEDVKKQSTDYISFKDQQAEQRR
IAKEQEELANRPWYEKSWDAVCNFTGEVSGYYDYKRAADGVDPVTGEKLTAGQRVAAGAMAAAGYIPIVGWAGKLAKGGK
AVYSTSKALYRADKALDVYKTPKTFHALQNSSKGLYGLASANGFSEAITGRDMFGNKVSKERQEQSLSGAMAMLVPFGAR
GINKKLNAKSSSRVSEASTNTSKKPKVPKTYKRPTYFRKGVRDKVWENAKDSTGSVKDPLTKQVMKKDEPWDMGHKPGYE
FRKHQQSAMERNISRKQFLDEHNNPDHYQPELPSSNRSHKGEDMTDDYFGD
>C1P612 ~~~yqcG~~~Uncharacterized protein YqcG~~~
MSEENKENGFNHVKTFTKIIFIFSVLVFNDNEYKITDAAVNLFIQI
>P63340 ~~~yqeG~~~Inner membrane transport protein YqeG~~~COG0814
MSNIWSKEETLWSFALYGTAVGAGTLFLPIQLGSAGAVVLFITALVAWPLTYWPHKALCQFILSSKTSAGEGITGAVTHY
YGKKIGNLITTLYFIAFFVVVLIYAVAITNSLTEQLAKHMVIDLRIRMLVSLGVVLILNLIFLMGRHATIRVMGFLVFPL
IAYFLFLSIYLVGSWQPDLLTTQVEFNQNTLHQIWISIPVMVFAFSHTPIISTFAIDRREKYGEHAMDKCKKIMKVAYLI
ICISVLFFVFSCLLSIPPSYIEAAKEEGVTILSALSMLPNAPAWLSISGIIVAVVAMSKSFLGTYFGVIEGATEVVKTTL
QQVGVKKSRAFNRALSIMLVSLITFIVCCINPNAISMIYAISGPLIAMILFIMPTLSTYLIPALKPWRSIGNLITLIVGI
LCVSVMFFS
>P54453 ~~~yqeH~~~Uncharacterized protein YqeH~~~COG1161
MEKVVCIGCGVTIQTEDKTGLGYAPPASLTKENVICQRCFRLKNYNEIQDVSLTDDDFLNILHGIGETDSLVVKIVDIFD
FNGSWINGLQRLVGGNPILLVGNKADILPKSLKRERLIQWMKREAKELGLKPVDVFLVSAGRGQGIREVIDAIEHYRNGK
DVYVVGCTNVGKSTFINRIIKEVSGEEDIITTSQFPGTTLDAIEIPLDDGSSLYDTPGIINNHQMAHYVNKKDLKILSPK
KELKPRTFQLNDQQTLYFGGLARFDYVSGERSPFICYMPNELMIHRTKLENADALYEKHAGELLTPPGKDEMDEFPELVA
HTFTIKDKKTDIVFSGLGWVTVHDADKKVTAYAPKGVHVFVRRSLI
>C1P613 ~~~yqeL~~~Uncharacterized protein YqeL~~~
MKDVDQIFDALDCHILREYLILLFYD
>P54459 ~~~yqeN~~~Uncharacterized protein YqeN~~~COG1466
MVFDVWKSLKKGEVHPVYCLYGKETYLLQETVSRIRQTVVDQETKDFNLSVFDLEEDPLDQAIADAETFPFMGERRLVIV
KNPYFLTGEKKKEKIEHNVSALESYIQSPAPYTVFVLLAPYEKLDERKKLTKALKKHAFMMEAKELNAKETTDFTVNLAK
TEQKTIGTEAAEHLVLLVNGHLSSIFQEIQKLCTFIGDREEITLDDVKMLVARSLEQNIFELINKIVNRKRTESLQIFYD
LLKQNEEPIKIMALISNQFRLILQTKYFAEQGYGQKQIASNLKVHPFRVKLAMDQARLFSEEELRLIIEQLAVMDYEMKT
GKKDKQLLLELFLLQLLKRNEKNDPHY
>P54464 ~~~yqeY~~~Uncharacterized protein YqeY~~~COG1610
MSLLERLNQDMKLYMKNREKDKLTVVRMVKASLQNEAIKLKKDSLTEDEELTVLSRELKQRKDSLQEFSNANRLDLVDKV
QKELDILEVYLPEQLSEEELRTIVNETIAEVGASSKADMGKVMGAIMPKVKGKADGSLINKLVSSQLS
>P67153 ~~~yqfA~~~UPF0073 inner membrane protein YqfA~~~COG1272
MVQKPLIKQGYSLAEEIANSVSHGIGLVFGIVGLVLLLVQAVDLNASATAITSYSLYGGSMILLFLASTLYHAIPHQRAK
MWLKKFDHCAIYLLIAGTYTPFLLVGLDSPLARGLMIVIWSLALLGILFKLTIAHRFKILSLVTYLAMGWLSLVVIYEMA
VKLAAGSVTLLAVGGVVYSLGVIFYVCKRIPYNHAIWHGFVLGGSVCHFLAIYLYIGQA
>C1P614 ~~~yqfG~~~Uncharacterized protein YqfG~~~
MNFLMRAIFSLLLLFTLSIPVISDCVAMAIESRFKYMMLLF
>P0DPP4 ~~~yqfH~~~Protein YqfH~~~
MINQVSVYRQPPVLSGCRQVKTI
>P0DPP5 ~~~yqfI~~~Protein YqfI~~~
MSKNTKSKNNGIRKYNAKTEVKLVYFK
>P54484 ~~~yqgA~~~Cell wall-binding protein YqgA~~~
MKQGKFSVFLILLLMLTLVVAPKEKAEAASSGWQPVSGISGCKIRVITDAYTYTKSATSIDAYAETNGKCGKLNYKSFGV
SIVEGGDIGPQYSGYFSSRTPTKKFYFSKLPKPTGTPWAVGLSVYKGKAKGAAFVYINPQKR
>P64567 ~~~yqgB~~~Uncharacterized protein YqgB~~~
MKKKPVAQLERQHSLLENPCAYGLLSQFQAAIVVNCFTLNKII
>P64570 ~~~yqgC~~~Protein YqgC~~~
MGITSAGMQSRDAECGERIFTRTVRQVKQQTTVHYFVSPPRPPVKTNPQAKTLISTRLEVATRKKRRVLFI
>P0A8W5 ~~~yqgE~~~UPF0301 protein YqgE~~~COG1678
MNLQHHFLIAMPALQDPIFRRSVVYICEHNTNGAMGIIVNKPLENLKIEGILEKLKITPEPRDESIRLDKPVMLGGPLAE
DRGFILHTPPSNFASSIRISDNTVMTTSRDVLETLGTDKQPSDVLVALGYASWEKGQLEQEILDNAWLTAPADLNILFKT
PIADRWREAAKLIGVDILTMPGVAGHA
>O34634 3.1.-.-~~~yrrK~~~Putative pre-16S rRNA nuclease~~~COG0816
MRILGLDLGTKTLGVALSDEMGWTAQGIETIKINEAEGDYGLSRLSELIKDYTIDKIVLGFPKNMNGTVGPRGEASQTFA
KVLETTYNVPVVLWDERLTTMAAEKMLIAADVSRQKRKKVIDKMAAVMILQGYLDSLN
>Q9RRI2 3.1.-.-~~~yqgF~~~Ribonuclease YqgF~~~COG0816
MLARMSGPDPAPAALPTVLALDVSKSRIGFAVSAGRLAFGRGSVDRKRLPLDLKAVRLKVEETGAERLVLGLPLRTDGKP
SPTADRVRAFGRVLMDKGYTVEYQDERFTTQRARALGAADEDEAAAVQILELWLMR
>P0A8I1 3.1.-.-~~~yqgF~~~Putative pre-16S rRNA nuclease~~~COG0816
MSGTLLAFDFGTKSIGVAVGQRITGTARPLPAIKAQDGTPDWNIIERLLKEWQPDEIIVGLPLNMDGTEQPLTARARKFA
NRIHGRFGVEVKLHDERLSTVEARSGLFEQGGYRALNKGKVDSASAVIILESYFEQGY
>P9WGV7 3.1.-.-~~~~~~Putative pre-16S rRNA nuclease~~~COG0816
MVPAQHRPPDRPGDPAHDPGRGRRLGIDVGAARIGVACSDPDAILATPVETVRRDRSGKHLRRLAALAAELEAVEVIVGL
PRTLADRIGRSAQDAIELAEALARRVSPTPVRLADERLTTVSAQRSLRQAGVRASEQRAVIDQAAAVAILQSWLDERLAA
MAGTQEGSDA
>Q5SHA1 3.1.-.-~~~~~~Putative pre-16S rRNA nuclease~~~COG0816
MRVGALDVGEARIGLAVGEEGVPLASGRGYLVRKTLEEDVEALLDFVRREGLGKLVVGLPLRTDLKESAQAGKVLPLVEA
LRARGVEVELWDERFTTKLAQERLKHAPKRLRRDKGKLDELAAVVLLEDYLARGI
>P0DSG2 ~~~yqgG~~~Protein YqgG~~~
MNRCLLLNLSHRSGEDSFPALCISALHTCRCYTHLGASQDSRAGYSY
>P0DSG3 ~~~yqgH~~~Protein YqgH~~~
MHAISLHSLAYRRFARNVSLSMLLLMR
>P54491 ~~~yqgN~~~Uncharacterized protein YqgN~~~COG0212
MKSQLRKKTLEALSALSNEDILQKTERMYKYLFSLPEWQNAGTIAVTISRGLEIPTRPVIEQAWEEGKQVCIPKCHPDTK
KMQFRTYQTDDQLETVYAGLLEPVIEKTKEVNPSQIDLMIVPGVCFDVNGFRVGFGGGYYDRYLSEYEGKTVSLLLECQL
FAHVPRLPHDIPVHKLITEDRIISCFS
>P54494 ~~~yqgQ~~~Uncharacterized protein YqgQ~~~COG4483
MNTFYDVQQLLKTFGHIVYFGDRELEIEFMLDELKELYMNHMIEKEQWARAAAVLRKELEQTKNGRDFYKG
>Q46856 1.1.1.2~~~yqhD~~~Alcohol dehydrogenase YqhD~~~COG1979
MNNFNLHTPTRILFGKGAIAGLREQIPHDARVLITYGGGSVKKTGVLDQVLDALKGMDVLEFGGIEPNPAYETLMNAVKL
VREQKVTFLLAVGGGSVLDGTKFIAAAANYPENIDPWHILQTGGKEIKSAIPMGCVLTLPATGSESNAGAVISRKTTGDK
QAFHSAHVQPVFAVLDPVYTYTLPPRQVANGVVDAFVHTVEQYVTKPVDAKIQDRFAEGILLTLIEDGPKALKEPENYDV
RANVMWAATQALNGLIGAGVPQDWATHMLGHELTAMHGLDHAQTLAIVLPALWNEKRDTKRAKLLQYAERVWNITEGSDD
ERIDAAIAATRNFFEQLGVPTHLSDYGLDGSSIPALLKKLEEHGMTQLGENHDITLDVSRRIYEAAR
>P0DPP6 ~~~yqhI~~~Protein YqhI~~~
MPRLTAKDFPQELLDYYDYYAHGKISKREFLNLAAKCGRRDDGISVV
>P0DPP7 ~~~yqiD~~~Protein YqiD~~~
MFIAWYWIVLIALVVVGYFLHLKRYCRAFRQDRDALLEARNKYLNSTREETAEKVE
>P76657 ~~~yqiJ~~~Inner membrane protein YqiJ~~~COG1585
MILFADYNTPYLFAISFVLLIGLLEIFALICGHMLSGALDAHLDHYDSITTGHISQALHYLNIGRLPALVVLCLLAGFFG
LIGILLQHACIMVWQSPLSNLFVVPVSLLFTIIAVHYTGKIVAPWIPRDHSSAITEEEYIGSMALITGHQATSGNPCEGK
LTDQFGQIHYLLLEPEEGKIFTKGDKVLIICRLSATRYLAENNPWPQIL
>P77306 ~~~yqiK~~~Flotillin family inner membrane protein YqiK~~~COG2268
MDDIVNSVPSWMFTAIIAVCILFIIGIIFARLYRRASAEQAFVRTGLGGQKVVMSGGAIVMPIFHEIIPINMNTLKLEVS
RSTIDSLITKDRMRVDVVVAFFVRVKPSVEGIATAAQTLGQRTLSPEDLRMLVEDKFVDALRATAAQMTMHELQDTRENF
VQGVQNTVAEDLSKNGLELESVSLTNFNQTSKEHFNPNNAFDAEGLTKLTQETERRRRERNEVEQDVEVAVREKNRDALS
RKLEIEQQEAFMTLEQEQQVKTRTAEQNARIAAFEAERRREAEQTRILAERQIQETEIDREQAVRSRKVEAEREVRIKEI
EQQQVTEIANQTKSIAIAAKSEQQSQAEARANLALAEAVSAQQNVETTRQTAEADRAKQVALIAAAQDAETKAVELTVRA
KAEKEAAEMQAAAIVELAEATRKKGLAEAEAQRALNDAINVLSDEQTSLKFKLALLQALPAVIEKSVEPMKSIDGIKIIQ
VDGLNRGGAAGDANTGNVGGGNLAEQALSAALSYRTQAPLIDSLLNEIGVSGGSLAALTSPLTSTTPVEEKAE
>P0DSG5 ~~~yqiM~~~Protein YqiM~~~
MLMYQTRRTYQNSNNIAVVHLLKPAWR
>P0AA63 ~~~yqjA~~~Inner membrane protein YqjA~~~COG0586
MELLTQLLQALWAQDFETLANPSMIGMLYFVLFVILFLENGLLPAAFLPGDSLLVLVGVLIAKGAMGYPQTILLLTVAAS
LGCWVSYIQGRWLGNTRTVQNWLSHLPAHYHQRAHHLFHKHGLSALLIGRFIAFVRTLLPTIAGLSGLNNARFQFFNWMS
GLLWVLILTTLGYMLGKTPVFLKYEDQLMSCLMLLPVVLLVFGLAGSLVVLWKKKYGNRG
>P42616 ~~~yqjC~~~Protein YqjC~~~COG1422
MKYRIALAVSLFALSAGSYATTLCQEKEQNILKEISYAEKHQNQNRIDGLNKALSEVRANCSDSQLRADHQKKIAKQKDE
VAERQQDLAEAKQKGDADKIAKRERKLAEAQEELKKLEARDY
>P64581 ~~~yqjD~~~Uncharacterized protein YqjD~~~COG4575
MSKEHTTEHLRAELKSLSDTLEEVLSSSGEKSKEELSKIRSKAEQALKQSRYRLGETGDAIAKQTRVAAARADEYVRENP
WTGVGIGAAIGVVLGVLLSRR
>P64585 ~~~yqjE~~~Inner membrane protein YqjE~~~COG5393
MADTHHAQGPGKSVLGIGQRIVSIMVEMVETRLRLAVVELEEEKANLFQLLLMLGLTMLFAAFGLMSLMVLIIWAVDPQY
RLNAMIATTVVLLLLALIGGIWTLRKSRKSTLLRHTRHELANDRQLLEEESREQ
>P42619 ~~~yqjF~~~Inner membrane protein YqjF~~~COG2259
MKKLEDVGVLVARILMPILFITAGWGKITGYAGTQQYMEAMGVPGFMLPLVILLEFGGGLAILFGFLTRTTALFTAGFTL
LTAFLFHSNFAEGVNSLMFMKNLTISGGFLLLAITGPGAYSIDRLLNKKW
>P42620 1.8.5.7~~~yqjG~~~Glutathionyl-hydroquinone reductase YqjG~~~COG0435
MGQLIDGVWHDTWYDTKSTGGKFQRSASAFRNWLTADGAPGPTGTGGFIAEKDRYHLYVSLACPWAHRTLIMRKLKGLEP
FISVSVVNPLMLENGWTFDDSFPGATGDTLYQNEFLYQLYLHADPHYSGRVTVPVLWDKKNHTIVSNESAEIIRMFNTAF
DALGAKAGDYYPPALQTKIDELNGWIYDTVNNGVYKAGFATSQEAYDEAVAKVFESLARLEQILGQHRYLTGNQLTEADI
RLWTTLVRFDPVYVTHFKCDKHRISDYLNLYGFLRDIYQMPGIAETVNFDHIRNHYFRSHKTINPTGIISIGPWQDLDEP
HGRDVRFG
>Q46871 1.16.1.9~~~yqjH~~~NADPH-dependent ferric-chelate reductase~~~COG2375
MNNTPRYPQRVRNDLRFRELTVLRVERISAGFQRIVLGGEALDGFTSRGFDDHSKLFFPQPDAHFVPPTVTEEGIVWPEG
PRPPSRDYTPLYDELRHELAIDFFIHDGGVASGWAMQAQPGDKLTVAGPRGSLVVPEDYAYQLYVCDESGMPALRRRLET
LSKLAVKPQVSALVSVRDNACQDYLAHLDGFNIEWLAHDEQAVDARLAQMQIPADDYFIWITGEGKVVKNLSRRFEAEQY
DPQRVRAAAYWHAK
>P64588 ~~~yqjI~~~Transcriptional regulator YqjI~~~COG1695
MSHHHEGCCKHEGQPRHEGCCKGEKSEHEHCGHGHQHEHGQCCGGRHGRGGGRRQRFFGHGELRLVILDILSRDDSHGYE
LIKAIENLTQGNYTPSPGVIYPTLDFLQEQSLITIREEEGGKKQIALTEQGAQWLEENREQVEMIEERIKARCVGAALRQ
NPQMKRALDNFKAVLDLRVNQSDISDAQIKKIIAVIDRAAFDITQLD
>P54562 ~~~yqjY~~~Uncharacterized protein YqjY~~~COG0456
MDIRTITSSDYEMVTSVLNEWWGGRQLKEKLPRLFFEHFQDTSFITSEHNSMTGFLIGFQSQSDPETAYIHFSGVHPDFR
KMQIGKQLYDVFIETVKQRGCTRVKCVTSPVNKVSIAYHTKLGFDIEKGTKTVNGISVFANYDGPGQDRVLFVKNI
>P54563 ~~~yqjZ~~~Uncharacterized protein YqjZ~~~COG2329
MMDFLSKTPEPPYYAVIFSSVKSENDTGYGETAERMVSLAADQPGFLGVESVREADGRGITVSYWDSMDAINHWRHHTEH
QAAKEKGRSVWYESYAVRVAKVDRQRLFQENTND
>P54573 ~~~yqkK~~~Uncharacterized protein YqkK~~~
MAKSQAKKKRGHRLRNGGRDVLLSRGSTPSFSTHGRMTKSKKEILNKRKHKNPYDHTAVDDKDFFVPQKAA
>O32019 ~~~yqzG~~~Uncharacterized protein YqzG~~~
MMIKQCVICLSLLVFGTTAAHAEETPLVTARHMSKWEEIAVKEAKKRYPLAQVLFKQKVWDRKRKDEAVKQYHLTLREGS
KEFGVFVTISFDPYSQKVNKIAILEEYQ
>O06006 3.2.-.-~~~yraA~~~Putative cysteine protease YraA~~~COG0693
MSKKIAVLVTDQFEDIEYTSPVKAYEEAGYSVVAIDLEAGKEVTGKHGEKVKIDKAISDVDASDFDALLIPGGFSPDLLR
ADDRPGEFAKAFVENKKPVFAICHGPQVLIDTDLLKGKDITGYRSIRKDLINAGANYKDAEVVVSHNIVTSRTPDDLEAF
NRESLNLLK
>P42913 ~~~yraH~~~Uncharacterized fimbrial-like protein YraH~~~COG3539
MNKVTKTAIAGLLALFAGNAAATDGEIVFDGEILKSACEINDSDKKIEVALGHYNAEQFRNIGERSPKIPFTIPLVNCPM
TGWEHDNGNVEASFRLWLETRDNGTVPNFPNLAKVGSFAGIAATGVGIRIDDAESGNIMPLNAMGNDNTVYQIPAESNGI
VNVDLIAYYVSTVVPSEITPGEADAIVNVTLDYR
>P42914 ~~~yraI~~~Probable fimbrial chaperone YraI~~~COG3121
MSKRTFAVILTLLCSFCIGQALAGGIVLQRTRVIYDASRKEAALPVANKGAETPYLLQSWVDNIDGKSRAPFIITPPLFR
LEAGDDSSLRIIKTADNLPENKESLFYINVRAIPAKKKSDDVNANELTLVFKTRIKMFYRPAHLKGRVNDAWKSLEFKRS
DHSLNIYNPTEYYVVFAGLAVDKTDLTSKIEYIAPGEHKQLPLPASGGKNVKWAAINDYGGSSGTETRPLQ
>P42915 ~~~yraJ~~~Outer membrane usher protein YraJ~~~COG3188
MPQRHHQGHKRTPKQLALIIKRCLPMVLTGSGMLCTTANAEEYYFDPIMLETTKSGMQTTDLSRFSKKYAQLPGTYQVDI
WLNKKKVSQKKITFTANAEQLLQPQFTVEQLRELGIKVDEIPALAEKDDDSVINSLEQIIPGTAAEFDFNHQQLNLSIPQ
IALYRDARGYVSPSRWDDGIPTLFTNYSFTGSDNRYRQGNRSQRQYLNMQNGANFGPWRLRNYSTWTRNDQTSSWNTISS
YLQRDIKALKSQLLLGESATSGSIFSSYTFTGVQLASDDNMLPNSQRGFAPTVRGIANSSAIVTIRQNGYVIYQSNVSAG
AFEINDLYPSSNSGDLEVTIEESDGTQRRFIQPYSSLPMMQRPGHLKYSATAGRYRADANSDSKEPEFAEATAIYGLNNT
FTLYGGLLGSEDYYALGIGIGGTLGALGALSMDINRADTQFDNQHSFHGYQWRTQYIKDIPETNTNIAVSYYRYTNDGYF
SFNEANTRNWDYNSRQKSEIQFNISQTIFDGVSLYASGSQQDYWGNNDKNRNISVGVSGQQWGVGYSLNYQYSRYTDQNN
DRALSLNLSIPLERWLPRSRVSYQMTSQKDRPTQHEMRLDGSLLDDGRLSYSLEQSLDDDNNHNSSLNASYRSPYGTFSA
GYSYGNDSSQYNYGVTGGVVIHPHGVTLSQYLGNAFALIDANGASGVRIQNYPGIATDPFGYAVVPYLTTYQENRLSVDT
TQLPDNVDLEQTTQFVVPNRGAMVAARFNANIGYRVLVTVSDRNGKPLPFGALASNDDTGQQSIVDEGGILYLSGISSKS
QSWTVRWGNQADQQCQFAFSTPDSEPTTSVLQGTAQCH
>P43319 ~~~yraK~~~Uncharacterized fimbrial-like protein YraK~~~COG3539
MKRAPLITGLLLISTSCAYASSGGCGADSTSGATNYSSVVDDVTVNQTDNVTGREFTSATLSSTNWQYACSCSAGKAVKL
VYMVSPVLTTTGHQTGYYKLNDSLDIKTTLQANDIPGLTTDQVVSVNTRFTQIKNNTVYSAATQTGVCQGDTSRYGPVNI
GANTTFTLYVTKPFLGSMTIPKTDIAVIKGAWVDGMGSPSTGDFHDLVKLSIQGNLTAPQSCKINQGDVIKVNFGFINGQ
KFTTRNAMPDGFTPVDFDITYDCGDTSKIKNSLQMRIDGTTGVVDQYNLVARRRSSDNVPDVGIRIENLGGGVANIPFQN
GILPVDPSGHGTVNMRAWPVNLVGGELETGKFQGTATITVIVR
>P45394 ~~~yrbG~~~Inner membrane protein YrbG~~~COG0530
MLLATALLIVGLLLVVYSADRLVFAASILCRTFGIPPLIIGMTVVSIGTSLPEVIVSLAASLHEQRDLAVGTALGSNIIN
ILLILGLAALVRPFTVHSDVLRRELPLMLLVSVVAGSVLYDGQLSRSDGIFLLFLAVLWLLFIVKLARQAERQGTDSLTR
EQLAELPRDGGLPVAFLWLGIALIIMPVATRMVVDNATVLANYFAISELTMGLTAIAIGTSLPELATAIAGVRKGENDIA
VGNIIGANIFNIVIVLGLPALITPGEIDPLAYSRDYSVMLLVSIIFALLCWRRSPQPGRGVGVLLTGGFIVWLAMLYWLS
PILVE
>P64610 ~~~yrbL~~~Uncharacterized protein YrbL~~~COG0515
MIRLSEQSPLGTGRHRKCYAHPEDAQRCIKIVYHRGDGGDKEIRRELKYYAHLGRRLKDWSGIPRYHGTVETDCGTGYVY
DVIADFDGKPSITLTEFAEQCRYEEDIAQLRQLLKQLKRYLQDNRIVTMSLKPQNILCHRISESEVIPVVCDNIGESTLI
PLATWSKWCCLRKQERLWKRFIAQPALAIALQKDLQPRESKTLALTSREA
>C1P618 ~~~yrbN~~~Uncharacterized protein YrbN~~~
MKIADQFHDELCRLAAINFEAHVLHG
>P0A9W9 ~~~yrdA~~~Protein YrdA~~~COG0663
MSDVLRPYRDLFPQIGQRVMIDDSSVVIGDVRLADDVGIWPLVVIRGDVHYVQIGARTNIQDGSMLHVTHKSSYNPDGNP
LTIGEDVTVGHKVMLHGCTIGNRVLVGMGSILLDGAIVEDDVMIGAGSLVPQNKRLESGYLYLGSPVKQIRPLSDEEKAG
LRYSANNYVKWKDEYLDQGNQTQP
>P64636 3.1.3.5~~~yrfG~~~GMP/IMP nucleotidase YrfG~~~COG1011
MHINIAWQDVDTVLLDMDGTLLDLAFDNYFWQKLVPETWGAKNGVTPQEAMEYMRQQYHDVQHTLNWYCLDYWSEQLGLD
ICAMTTEMGPRAVLREDTIPFLEALKASGKQRILLTNAHPHNLAVKLEHTGLDAHLDLLLSTHTFGYPKEDQRLWHAVAE
ATGLKAERTLFIDDSEAILDAAAQFGIRYCLGVTNPDSGIAEKQYQRHPSLNDYRRLIPSLM
>O05400 2.1.1.-~~~yrhH~~~Putative methyltransferase YrhH~~~COG2226
MLENRLYKRSDFWKLFSRKYKLTKTIEHMMIDSIDIQENDRILEIGIGNGTVFKSITKKLKKGSLKSIDPSKRKVRQISR
ANRKNMGNGEVFHGYPEDIPFDDRTFNKVFSLHTVQSCTDIRLALREIYRVLQIDGRFYISIDTNTGEKEKTYIQLLKDQ
HFRDLSVIRRASCLCIVAVK
>P0DSG9 ~~~yriA~~~Protein YriA~~~
MLYWLGILIAYADRRPDKVFTLIR
>O34452 ~~~yrrB~~~TPR repeat-containing protein YrrB~~~COG0457
MQEGDYEKAAEAFTKAIEENKEDAIPYINFANLLSSVNELERALAFYDKALELDSSAATAYYGAGNVYVVKEMYKEAKDM
FEKALRAGMENGDLFYMLGTVLVKLEQPKLALPYLQRAVELNENDTEARFQFGMCLANEGMLDEALSQFAAVTEQDPGHA
DAFYNAGVTYAYKENREKALEMLDKAIDIQPDHMLALHAKKLIDPS
>O32029 2.1.1.-~~~yrrT~~~Uncharacterized methyltransferase YrrT~~~COG2226
MGREFIPLFEDWAATYDQTVQGLDIQYKEAFRGYDHILDAIVRKSGTHVLEFGPGTGNLTAKLLDAGKTVFGIEPSPAMR
KLASDKLSGRTEIVDGDFLTFPEPPFQADTIVSSYAFHHLTDEEKRAAIKQYGKYLHLHDKIVFADTVFENAQAYQQAID
KARSQGFYQLANDLETEHYPTLDALKEMFTAEGFAVRFTQQNDFVWIMEAIKR
>P0DSH1 ~~~ysaE~~~Protein YsaE~~~
MRNAVSKAGIISRRRLLLFQFAG
>P0C2M7 ~~~yscA~~~Yop proteins translocation protein A~~~
MSQITTKHITVLFRRWMAIICCLIIKIAYLAY
>P0C2M8 ~~~yscB~~~Chaperone protein YscB~~~
MQNLLKNLATSLGRKPFVADKQGVYRLTIDKHLVMLTPHGSELVLRTPIDAPMLREGNNVNVTLLRSLMQQALAWAKRYP
QTLVLDDCGQLVLEARLRLQELDTHGLQEVINKQLALLEHLIPQLTPFSVASRVGWN
>Q56973 ~~~yscB~~~Chaperone protein YscB~~~
MQNLLKNLAASLGRKPFVADKQGVYRLTIDKHLVMLAPHGSELVLRTPIDAPMLREGNNVNVTLLRSLMQQALAWAKRYP
QTLVLDDCGQLVLEARLRLQELDTHGLQEVINKQLALLEHLIPQLTPFSVASRVGWN
>Q01245 ~~~yscD~~~Yop proteins translocation protein D~~~
MSWVCRFYQGKHRGVEVELPHGRCVFGSDPLQSDIVLSDSEIAPVHLVLMVDEEGIRLTDSAEPLLQEGLPVPLGTLLRA
GTCLEVGFLLWTFVAVGQPLPETLQVPTQRKEPTDRLPRSRLGVGLGVLSLLLLLTFLGMLGHGLWREYNQDGQLVEQEV
RRLLATAAYKDVVLTSPKEGEPWLLTGYIQDNHARLSLQNFLESHGIPFRLELRSMEELRQGAEFILQRLGYHGIEVSLA
PQAGWLQLNGEVSEEIQKQKIDSLLQAEVPGLLGVENKVRIAGNQRKRLDALLEQFGLDSDFTVNVKGELIELRGQVNDE
KLSSFNQLQQTFRQEFGNRPKLELVNVGGQPQHDELNFEVQAISLGKVPYVVLDNHQRYPEGAILNNGVRILAIRRDAVI
VSKGKREFVIQLNGGKPR
>Q01246 ~~~yscE~~~Type 3 secretion system chaperone YscE~~~
MTQLEEQLHNVETVRSITMQLEMALAKLKKDMMRGGDAKQYQVWQSESKAIESAIAIIHYVAGGLK
>O68692 ~~~yscE~~~Type 3 secretion system chaperone YscE~~~
MTQLEEQLHNVETVRSITMQLEMALTKLKKDMMRGGDAKQYQVWQRESKALESAIAIIHYVAGDLK
>Q01248 ~~~yscG~~~Type 3 secretion system chaperone YscG~~~
MKYKLNVLLAEIALIGTGNHCHEEANCIAEWLHLKGEEEAVQLIQLSSLMNRGDYASALQQGNKSTYPDLEPWLALCEYR
LGLGNALESRLNRLATSQDPRIQTFVNGMKEQLKT
>O68690 ~~~yscG~~~Type 3 secretion system chaperone YscG~~~
MKYKLNVLLAEIALIGTGNHYHEEANCIAEWLHLKGEEEAVQLIRLSSLMNRGDYASALQQGNKLAYPDLEPWLALCEYR
LGLGSALESRLNRLARSQDPRIQTFVNGMREQLKT
>Q01250 ~~~yscI~~~Yop proteins translocation protein I~~~
MPNIEIAQADEVIITTLEELGPVEPTTEQIMRFDAAMSEDTQGLGHSLLKEVSDIQKTFKTAKSDLHTKLAVSVDNPNDL
MLMQWSLIRITIQEELIAKTAGRMSQNVETLSKGG
>P69971 ~~~yscI~~~Yop proteins translocation protein I~~~
MPNIEIAQADEVIITTLEELGPAEPTTDQIMRFDAAMSEDTQGLGHSLLKEVSDIQKSFKTVKSDLHTKLAVSVDNPNDL
MLMQWSLIRITIQEELIAKTAGRMSQNVETLSKGG
>Q01251 ~~~yscJ~~~Yop proteins translocation lipoprotein J~~~
MKVKTSLSTLILILFLTGCKVDLYTGISQKEGNEMLALLRQEGLSADKEPDKDGKIKLLVEESDVAQAIDILKRKGYPHE
SFSTLQDVFPKDGLISSPIEELARLNYAKAQEISRTLSEIDGVLVARVHVVLPEEQNNKGKKGVAASASVFIKHAADIQF
DTYIPQIKQLVNNSIEGLAYDRISVILVPSVDVRQSSHLPRNTSILSIQVSEESKGRLIGLLSLLILLLPVTNLAQYFWL
QRKK
>P69972 ~~~yscJ~~~Yop proteins translocation lipoprotein J~~~COG4669
MKVKTSLSTLILILFLTGCKVDLYTGISQKEGNEMLALLRQEGLSADKEPDKDGKIKLLVEESDVAQAIDILKRKGYPHE
SFSTLQDVFPKDGLISSPIEELARLNYAKAQEISRTLSEIDGVLVARVHVVLPEEQNNKGKKGVAASASVFIKHAADIQF
DTYIPQIKQLVNNSIEGLAYDRISVILVPSVDVRQSSHLPRNTSILSIQVSEESKGHLIGLLSLLILLLPVTNLAQYFWL
QRKK
>P69973 ~~~yscJ~~~Yop proteins translocation lipoprotein J~~~
MKVKTSLSTLILILFLTGCKVDLYTGISQKEGNEMLALLRQEGLSADKEPDKDGKIKLLVEESDVAQAIDILKRKGYPHE
SFSTLQDVFPKDGLISSPIEELARLNYAKAQEISRTLSEIDGVLVARVHVVLPEEQNNKGKKGVAASASVFIKHAADIQF
DTYIPQIKQLVNNSIEGLAYDRISVILVPSVDVRQSSHLPRNTSILSIQVSEESKGHLIGLLSLLILLLPVTNLAQYFWL
QRKK
>Q01252 ~~~yscK~~~Yop proteins translocation protein K~~~
MMENYITSFQLRFCPAAYLHLEQLPSLWRSILPYLPQWRDSAHLNAALLDEFSLDTDYEEPHGLGALPLQPQSQLELLLC
RLGLVLHGEAIRRCVLASPLQQLLTLVNQETLRQIIVQHELLIGPWPTNWQRPLPTEIESRTMIQSGLAFWLAAMEPQPQ
AWCKRLSLRLPLATPSEPWLVAESQRPLAQTLCHKLVKQVMPTCSHLFK
>P69974 ~~~yscK~~~Yop proteins translocation protein K~~~
MMENYITSFQLRFCPAAYLHLEQLPSLWRSILPYLPQWRDSAHLNAALLDEFSLDTDYEEPHGLGALPLQPQSQLELLLC
RLGLVLHGEAIRRCVLASPLQQLLTLVNQETLRQIIVQHELLIGPWPTHWQRPLPTEIESRTMIQSGLAFWLAAMEPQPQ
AWCKRLSLRLPLATPSEPWLVAESQRPLAQTLCHKLVKQVTPTCSHLFK
>P69975 ~~~yscK~~~Yop proteins translocation protein K~~~
MMENYITSFQLRFCPAAYLHLEQLPSLWRSILPYLPQWRDSAHLNAALLDEFSLDTDYEEPHGLGALPLQPQSQLELLLC
RLGLVLHGEAIRRCVLASPLQQLLTLVNQETLRQIIVQHELLIGPWPTHWQRPLPTEIESRTMIQSGLAFWLAAMEPQPQ
AWCKRLSLRLPLATPSEPWLVAESQRPLAQTLCHKLVKQVTPTCSHLFK
>Q01254 ~~~yscM~~~Yop proteins translocation protein M~~~
MKINTLQSLINQQITQVGHGGQAGRLTETNPLTENSHQISTAEKAFASEVLEHVKNTALSRHDIACLLPRVSNLELKQGK
AGEVIVTGLRTEQLSLSDAKLLLEAAMRQDTAADG
>P69979 ~~~yscM~~~Yop proteins translocation protein M~~~
MKINTLQSLINQQITQVGHGGQAGRLTETNPLTENSHQISTAEKAFANEVLEHVKNTALSRHDIACLLPRVSNLELKQGK
AGEVIVTGLRTEQLSLSDAKLLLEAAMRQDTAADG
>P40294 ~~~yscO~~~Yop proteins translocation protein O~~~
MIRRLHRVKVLRVERAEKAIKTQQACLQAAHRRHQEAVQTSQDYHLWRIDEEQRLFDQRKNTTLNCKDLEKWQRQIASLR
EKEANYELECAKLLERLANERERLTLCQKMLQQARHKENKFLELVRREDEDELNQQHYQEEQEQEEFLQHHRNA
>P40296 ~~~yscQ~~~Yop proteins translocation protein Q~~~
MSLLTLPQAKLSELSLRQRLSHYQQNYLWEEGKLELTVSEPPSSLNCILQLQWKGTHFTLYCFGNDLANWLTADLLGAPF
FTLPKELQLALLERQTVFLPKLVCNDIATASLSVTQPLLSLRLSRDNAHISFWLTSAEALFALLPARPNSERIPLPILIS
LRWHKVYLTLDEVDSLRLGDVLLAPEGSGPNSPVLAYVGENPWGYFQLQSNKLEFIGMSHESDELNPEPLTDLNQLPVQV
SFEVGRQILDWHTLTSLEPGSLIDLTTPVDGEVRLLANGRLLGHGRLVEIQGRLGVRIERLTEVTIS
>P69986 ~~~yscU~~~Yop proteins translocation protein U~~~COG4792
MSGEKTEQPTPKKIRDARKKGQVAKSKEVVSTALIVALSAMLMGLSDYYFEHFSKLMLIPAEQSYLPFSQALSYVVDNVL
LEFFYLCFPLLTVAALMAIASHVVQYGFLISGEAIKPDIKKINPIEGAKRIFSIKSLVEFLKSILKVVLLSILIWIIIKG
NLVTLLQLPTCGIECITPLLGQILRQLMVICTVGFVVISIADYAFEYYQYIKELKMSKDEIKREYKEMEGSPEIKSKRRQ
FHQEIQSRNMRENVKRSSVVVANPTHIAIGILYKRGETPLPLVTFKYTDAQVQTVRKIAEEEGVPILQRIPLARALYWDA
LVDHYIPAEQIEATAEVLRWLERQNIEKQHSEML
>P69987 ~~~yscU~~~Yop proteins translocation protein U~~~
MSGEKTEQPTPKKIRDARKKGQVAKSKEVVSTALIVALSAMLMGLSDYYFEHFSKLMLIPAEQSYLPFSQALSYVVDNVL
LEFFYLCFPLLTVAALMAIASHVVQYGFLISGEAIKPDIKKINPIEGAKRIFSIKSLVEFLKSILKVVLLSILIWIIIKG
NLVTLLQLPTCGIECITPLLGQILRQLMVICTVGFVVISIADYAFEYYQYIKELKMSKDEIKREYKEMEGSPEIKSKRRQ
FHQEIQSRNMRENVKRSSVVVANPTHIAIGILYKRGETPLPLVTFKYTDAQVQTVRKIAEEEGVPILQRIPLARALYWDA
LVDHYIPAEQIEATAEVLRWLERQNIEKQHSEML
>P0C2N4 ~~~yscX~~~Yop proteins translocation protein X~~~
MSRIITAPHIGIEKLSAISLEELSCGLPDRYALPPDGHPVEPHLERLYPTAQSKRSLWDFASPGYTFHGLHRAQDYRREL
DTLQSLLTTSQSSELQAAAALLKCQQDDDRLLQIILNLLHKV
>P61416 ~~~yscX~~~Yop proteins translocation protein X~~~
MSRIITAPHIGIEKLSAISLEELSCGLPERYALPPDGHPVEPHLERLYPTAQSKRSLWDFASPGYTFHGLHRAQDYRREL
DTLQSLLTTSQSSELQAAAALLKCQQDDDRLLQIILNLLHKV
>P0C2N2 ~~~yscY~~~Chaperone protein YscY~~~
MNITLTKRQQEFLLLNGWLQLQCGHAERACILLDALLTLNPEHLAGRRCRLVALLNNNQGERAEKEAQWLISHDPLQAGN
WLCLSRAQQLNGDLDKARHAYQHYLELKDHNESP
>P61417 ~~~yscY~~~Chaperone protein YscY~~~COG0457
MNITLTKRQQEFLLLNGWLQLQCGHAERACILLDALLTLNPEHLAGRRCRLVALLNNNQGERAEKEAQWLISHDPLQAGN
WLCLSRAQQLNGDLDKARHAYQHYLELKDHNESP
>P94520 ~~~ysdB~~~Sigma-w pathway protein YsdB~~~
MFVMVLRIILLALFAYCIYAVVKYVANPKRRLKLAQSKEHFYIIDEQNNTRKNFQLTYKGVLFEGEKHIPSKDHPLFIHT
IFVWTESPEKLKHFSAKDFENIEEKVLERYPNCKIDWDQPIKLAKKAEER
>P94521 3.4.11.-~~~ysdC~~~Putative aminopeptidase YsdC~~~COG1363
MAKLDETLTMLKDLTDAKGIPGNEREVRQVMKSYIEPFADEVTTDRLGSLIAKKTGAENGPKIMIAGHLDEVGFMVTQIT
DKGFIRFQTVGGWWAQVMLAQRVTIVTKKGEITGVIGSKPPHILSPEARKKSVEIKDMFIDIGASSREEALEWGVLPGDM
IVPHFEFTVMNNEKFLLAKAWDNRIGCAIAIDVLRNLQNTDHPNIVYGVGTVQEEVGLRGAKTAAHTIQPDIAFGVDVGI
AGDTPGISEKEAQSKMGKGPQIIVYDASMVSHKGLRDAVVATAEEAGIPYQFDAIAGGGTDSGAIHLTANGVPALSITIA
TRYIHTHAAMLHRDDYENAVKLITEVIKKLDRKTVDEITYQ
>P0DPP8 ~~~ysdD~~~Protein YsdD~~~
MTIDKNWLNRSNKDPGRSLRFTHQPV
>P0DSH6 ~~~ysdE~~~Protein YsdE~~~
MNVSQIYARNGELFSGRICKQKRQ
>C1P620 ~~~yshB~~~Uncharacterized protein YshB~~~
MLESIINLVSSGAVDSHTPQTAVAAVLCAAMIGLFS
>P42955 ~~~yslB~~~Uncharacterized protein YslB~~~COG1719
MKSKFEASIDNLKEIEMNAYAYELIREIVLPDMLGQDYSSMMYWAGKHLARKFPLESWEEFPAFFEEAGWGTLTNVSAKK
QELEFELEGPIISNRLKHQKEPCFQLEAGFIAEQIQLMNDQIAESYEQVKKRADKVVLTVKWDMKDPV
>P94562 2.3.1.-~~~ysnE~~~Uncharacterized N-acetyltransferase YsnE~~~COG0454
MHIKIDDLTGRQVVSLVNEHLHSMTLMSPPESIHALGLEKLRGPEITFWSAWEGDELAGCGALKELDTRHGEIKSMRTSA
SHLRKGVAKQVLQHIIEEAEKRGYERLSLETGSMASFEPARKLYESFGFQYCEPFADYGEDPNSVFMTKKL
>P94560 ~~~ysnF~~~Stress response protein YsnF~~~COG3861
MKSIVGVYETPQETIAAIEGLLTKGYDSDDISVVTSRRDTDYLESRTGTEVNQAIDAHQDESESFFDKLKDYFTMDDTAT
HSKALSDLDIKTDEIDKYQEDLDDGKLLVAVDTDADVIAPIDNGNALSGGFSSTNELDYTTKEEKTMPLREEQLKVDKED
VQTGEVEIGKEVKTEKRDMDIPVRHDEIYVERRPVDENKTDAAPVNDSEEIRVPIVEEKLEVTKKPVVTDEVVVGKRTVE
ENEHISETVKKEEPRLNKEGKVDGLDDDPLNNK
>Q02170 ~~~ysxA~~~UPF0758 protein YsxA~~~COG2003
MVIHDLPLKLKDFPMKEKPRERLLKVGAENLANHELLAILLRTGTKHESVLDLSNRLLRSFDGLRLLKEASVEELSSIPG
IGMVKAIQILAAVELGSRIHKLANEEHFVIRSPEDGANLVMEDMRFLTQEHFVCLYLNTKNQVIHKRTVFIGSLNSSIVH
PREVFKEAFKRSAASFICVHNHPSGDPTPSREDIEVTRRLFECGNLIGIELLDHLVIGDKKFVSLKEKGYL
>C0SP79 ~~~ytaF~~~Probable sporulation protein YtaF~~~COG1971
MQMVSILLLALAVSLDSFSVGFTYGLRKMKIPFKAILVIACCSGAVMFISMLIGSFLTKFFPVYVTEKLGGLILVGIGAW
VLYQFFKPAKDKEYLLHEKTLLNLEVRSLGIVIHILRKPMSADIDKSGVINGIEAVLLGFALSIDAFGAGIGAAILGFSP
IVMSIAVAIMSSLFVSIGINAGHFLSKWKWIDKMAFLPGLLLITIGLWKL
>O34678 1.-.-.-~~~ytbE~~~Uncharacterized oxidoreductase YtbE~~~COG0656
MTTHLQAKATLHNGVEMPWFGLGVFQVEEGSELVNAVKTAIVHGYRSIDTAAIYGNEAGVGEGIREGIEEAGISREDLFI
TSKVWNADLGYEETLAAFETSLSKLGLDYLDLYLIHWPVEGKYKEAWRALETLYKEGRIKAIGVSNFQIHHLEDLMTAAE
IKPMINQVEFHPRLTQKELIRYCQNQGIQMEAWSPLMQGQLLDHPVLADIAQTYNKSVAQIILRWDLQHGIITIPKSTKE
HRIKENASVFDFELTQDDMNRIDALNENLRVGPDPDNFDF
>O34533 ~~~ytcD~~~Uncharacterized HTH-type transcriptional regulator YtcD~~~COG1733
MEKKKYNISVEATLEVIGGKWKCVILCHLTHGKKRTSELKRLMPNITQKMLTQQLRELEADGVINRIVYNQVPPKVEYEL
SEYGRSLEGILDMLCAWGANHINRVYGDTFSVLEESVLNDKLKQES
>P53561 ~~~ytcP~~~Polygalacturonan/rhamnogalacturonan transport system permease protein YtcP~~~COG0395
MKNRLFDMLIYGFLLMFALICVLPFIHVIAASFATVEEVVSKKFILIPTTFSLDAYRYIFSTDIIYKSLLVSVFVTVIGT
AVSMFLSSLMAYGLSRRDLIGRQPLMFLVVFTMLFSGGMIPTFLVVKSLGLLDSYWALILPTAINAFNLIILKNFFQNIP
SSLEESAKIDGCNDLGIFFKIVLPLSLPAIATISLFYAVTYWNTYMTAILYLNDSAKWPIQVLLRQIVIVSSGMQGDMSE
MGSGSPPPEQTIKMAVIVVATIPVLLVYPFIQKHFAKGALLGSVKG
>Q795R2 ~~~ytcQ~~~Polygalacturonan/rhamnogalacturonan-binding protein YtcQ~~~COG1653
MGNKWRVLLIVLVLALGGVLAGCKGTDQSSAEGKAGPDSKVKLSWMAILYHQQPPKDRAIKEIEKLTNTELDITWVPDAV
KEDRLNAALAAGNLPQIVTIQDIKNSSVMNAFRSGMFWEIGDYIKDYPNLNKMNKLINKNVTIDGKLYGIYRERPLSRQG
IVIRKDWLDNLNLKTPKTLDELYEVAKAFTEDDPDKDGKDDTFGLADRNDLIYGAFKTIGSYEGMPTDWKESGGKFTPDF
MTQEYKDTMNYMKKLRDNGYMNKDFPVTSKTQQQELFSQGKAGIYIGNMVDAVNLRDHASDKSMKLEIINRIKGPDGKER
VWASGGHNGVFAFPKTSVKTEAELKRILAFFDRIAEEDVYSLMTYGIDGVHYNKGEDKTFTRKESQVKDWQTDIQPLSAL
IAIDKAYLKNTGDPLRTAYEELTEDNEKIIVSNPAESLYSASESERGDELKKIIDDATYKYMIGDITESQFDKEVEKWES
SGGKQIIQEYEEAFKQAK
>C0SPB3 ~~~yteP~~~Polygalacturonan/rhamnogalacturonan transport system permease protein YteP~~~COG4209
MKTAEAQAPAVDAVIFKKEKRKRLLIKLIQQKYLYLMILPGCIYFLLFKYVPMWGIVIAFQDYQPFLGILGSEWVGLKHF
IRLFTEPTFFLLLKNTLVLFALNLAIFFPVPILLALLLNEVRIALFKKFVQTLIYIPHFMSWVIVVSLSFVLLTVDGGLI
NELIVFFGGEKINFLLNEEWFRPLYILQVIWREAGWSTIIYLAAITAVDPQLYEAAKMDGAGRLRQMWHITLPAIKSVIV
VLLILKIGDTLELGFEHVYLLLNATNREVAEIFDTYVYTAGLKQGQFSYSTAVGVFKAAVGLILVMLANRLAKKFGEEGI
Y
>C0SP80 ~~~yteS~~~Putative lipoprotein YteS~~~COG1653
MTKRIRTALCVIVSVLFLASCSSRPDGMHVILFSDMQAGVQEKIKKAAEQNAGKVDIFPAFQEKLLTEITAHEGDVFIVP
EDMFQAYDDPENFQPLNGLPPEKTSPYTTVNKKTGEKTIYAVQIEKGKKQLNGYSFQLNRDMAAFIPVYAEKTEEALQLI
SQLTEAR
>O34371 1.-.-.-~~~yteT~~~Putative oxidoreductase YteT~~~COG0673
MKNIVFCGLSSRAFSMFIKPLMERFSTHYEITGLLDADPKRFAVCKKKFPELAHVPEFSEDAFDEMMRVSKPDIVIVAGR
DDTHVAYIVKSLQWNTDVITEKPMVTTVQDANRVLEAEAKSEGKVTVAFNYRYSPFHRKIKEMILDGKIGRVTSVDLNWY
IDTYHGASYFKRWNRSRQFSGGLSVHKSTHHFDLVNWWLGQNPEEVFAYGALNYYGPDSEWNPLPEEDGRFCGTCRVKEK
CHYYSRWHPRSSKASIKDDHLEAGDQSSLYTAYRPDACIFDEEIDIEDTYVAAVKYDGGALLSYSIIFSAPYEGYRLTIN
GTKGRIESNEFHEPSRIPFAFPEQTIEYYPLFESKQTIQVVKNEGGHGGGDPLLLEDLFLGKDPLRRYDILAGAEAGAYS
IAVGEGMWRSVAEKKPIGMKELFQMQNV
>P69506 ~~~ytfE~~~Iron-sulfur cluster repair protein YtfE~~~COG2846
MAYRDQPLGELALSIPRASALFRKYDMDYCCGGKQTLARAAARKELDVEVIEAELAKLAEQPIEKDWRSAPLAEIIDHII
VRYHDRHREQLPELILQATKVERVHADKPSVPKGLTKYLTMLHEELSSHMMKEEQILFPMIKQGMGSQAMGPISVMESEH
DEAGELLEVIKHTTNNVTPPPEACTTWKAMYNGINELIDDLMDHISLENNVLFPRALAGE
>P39314 ~~~ytfF~~~Inner membrane protein YtfF~~~COG0697
MISGVLYALLAGLMWGLIFVGPLIVPEYPAMLQSMGRYLALGLIALPIAWLGRVRLRQLARRDWLTALMLTMMGNLIYYF
CLASAIQRTGAPVSTMIIGTLPVVIPVFANLLYSQRDGKLAWGKLAPALICIGIGLACVNIAELNHGLPDFDWARYTSGI
VLALVSVVCWAWYALRNARWLRENPDKHPMMWATAQALVTLPVSLIGYLVACYWLNTQTPDFSLPFGPRPLVFISLMVAI
AVLCSWVGALCWNVASQLLPTVILGPLIVFETLAGLLYTFLLRQQMPPLMTLSGIALLVIGVVIAVRAKPEKPLTESVSE
S
>O34806 ~~~ytfJ~~~Uncharacterized spore protein YtfJ~~~COG3874
MADHPIQGLMKTAMENLKEMIDVNTIIGDPVETPDGSVILTVSKVGFGFAAGGSEFGGKPAEKKSEDDETREQKLPFGGG
SGGGVSITPIAFLIVGSTGIRMLHLDENTHLIEKILDAAPQTLERIQQMFKKNNKNQSQGQNQNQMNNMNY
>P39187 ~~~ytfJ~~~Uncharacterized protein YtfJ~~~COG3054
MTLRKILALTCLLLPMMASAHQFETGQRVPPIGITDRGELVLDKDQFSYKTWNSAQLVGKVRVLQHIAGRTSAKEKNATL
IEAIKSAKLPHDRYQTTTIVNTDDAIPGSGMFVRSSLESNKKLYPWSQFIVDSNGVALGAWQLDEESSAVVVLDKDGRVQ
WAKDGALTPEEVQQVMDLLQKLLK
>D2TN58 ~~~ytfP~~~Gamma-glutamylcyclotransferase family protein ytfP~~~COG2105
MRIFVYGSLRTKQGNSHWMTNALLLGEYSIDNYQLYSLGHYPGAVPGNGTVHGEVYRIDNATLAELDALRTRGGEYARQL
IQTPYGSAWMYVYQRPVEGLTLIESGNWLDRDQY
>P0AE48 ~~~ytfP~~~Gamma-glutamylcyclotransferase family protein YtfP~~~COG2105
MRIFVYGSLRHKQGNSHWMTNAQLLGDFSIDNYQLYSLGHYPGAVPGNGTVHGEVYRIDNATLAELDALRTRGGEYARQL
IQTPYGSAWMYVYQRPVDGLKLIESGDWLDRDK
>P39325 ~~~ytfQ~~~Galactofuranose-binding protein YtfQ~~~COG1879
MWKRLLIVSAVSAAMSSMALAAPLTVGFSQVGSESGWRAAETNVAKSEAEKRGITLKIADGQQKQENQIKAVRSFVAQGV
DAIFIAPVVATGWEPVLKEAKDAEIPVFLLDRSIDVKDKSLYMTTVTADNILEGKLIGDWLVKEVNGKPCNVVELQGTVG
ASVAIDRKKGFAEAIKNAPNIKIIRSQSGDFTRSKGKEVMESFIKAENNGKNICMVYAHNDDMVIGAIQAIKEAGLKPGK
DILTGSIDGVPDIYKAMMDGEANASVELTPNMAGPAFDALEKYKKDGTMPEKLTLTKSTLYLPDTAKEELEKKKNMGY
>Q6BEX0 7.5.2.9~~~ytfR~~~Galactofuranose transporter ATP-binding protein YtfR~~~COG1129
MTTDQHQEILRTEGLSKFFPGVKALDNVDFSLRRGEIMALLGENGAGKSTLIKALTGVYHADRGTIWLEGQAISPKNTAH
AQQLGIGTVYQEVNLLPNMSVADNLFIGREPKRFGLLRRKEMEKRATELMASYGFSLDVREPLNRFSVAMQQIVAICRAI
DLSAKVLILDEPTASLDTQEVELLFDLMRQLRDRGVSLIFVTHFLDQVYQVSDRITVLRNGSFVGCRETCELPQIELVKM
MLGRELDTHALQRAGRTLLSDKPVAAFKNYGKKGTIAPFDLEVRPGEIVGLAGLLGSGRTETAEVIFGIKPADSGTALIK
GKPQNLRSPHQASVLGIGFCPEDRKTDGIIAAASVRENIILALQAQRGWLRPISRKEQQEIAERFIRQLGIRTPSTEQPI
EFLSGGNQQKVLLSRWLLTRPQFLILDEPTRGIDVGAHAEIIRLIETLCADGLALLVISSELEELVGYADRVIIMRDRKQ
VAEIPLAELSVPAIMNAIAA
>P39328 ~~~ytfT~~~Galactofuranose transporter permease protein YtfT~~~COG1172
MMPQSLPDTTTPKRRFRWPTGMPQLVALLLVLLVDSLVAPHFWQVVLQDGRLFGSPIDILNRAAPVALLAIGMTLVIATG
GIDLSVGAVMAIAGATTAAMTVAGFSLPIVLLSALGTGILAGLWNGILVAILKIQPFVATLILMVAGRGVAQLITAGQIV
TFNSPDLSWFGSGSLLFLPTPVIIAVLTLILFWLLTRKTALGMFIEAVGINIRAAKNAGVNTRIIVMLTYVLSGLCAAIA
GIIVAADIRGADANNAGLWLELDAILAVVIGGGSLMGGRFNLLLSVVGALIIQGMNTGILLSGFPPEMNQVVKAVVVLCV
LIVQSQRFISLIKGVRSRDKT
>Q9S529 ~~~ytgA~~~Metal-binding protein YtgA~~~
MSFFHTRKYKLILRGLLCLAGCFLMNSCSSSRGNQPADESIYVLSMNRMICDCVSRITGDRVKNIVLIDGAIDPHSYEMV
KGDEDRMAMSQLIFCNGLGLEHSASLRKHLEGNPKVVDLGQRLLNKNCFDLLSEEGFPDPHIWTDMRVWGAAVKEMAAAL
IQQFPQYEEDFQKNADQILSEMEELDRWAARSLSTIPEKNRYLVTGHNAFSYFTRRYLSSDAERVSGEWRSRCISPEGLS
PEAQISIRDIMRVVEYISANDVEVVFLEDTLNQDALRKIVSCSKSGQKIRLAKSPLYSDNVCDNYFSTFQHNVRTITEEL
GGTVLE
>C0SP90 1.10.3.-~~~ythA~~~Putative cytochrome bd menaquinol oxidase subunit I~~~COG1271
MDDLVLARSLFGTTMGFHIIFATLGVGLPLMILVAELIYQKTKDDHYAIMAKRWTKAQAVLLGVAIPTGTIAGTQLALLW
PGFMEVIGRVMSLPFQIEIYAFFVEALFMSIYVYAADRLSPAMRIVAVFFVLVGAAASAVLITNVHAFEGTPAGFKILNG
KITDVDPWAAFFNPSFFITAGHVVLSAFMTGAFIVASVAAYKMIRTRKKERVYRFHRKALLLALTIGGIFSLLTALNGHE
SAQMLYEYQPEKLAGAEGLFETRSHAPLAIGGFTDPNEEKVKWAIEIPWALSFLAANRFDTVVKGLNAFPRDEWPPLFIH
TLFNAMVGVGMLLILYSIIGVVWRKVLKKDRFPTWLLIIFMTAGPFSLIGIEFGWIFACTGRQPWVIYHLLKTSDVVTTT
GSIGVLFLFFTFVYAVLGAAVVYVLLYYFRKHPVDEDLNTAES
>A8DYQ1 ~~~ythA~~~Uncharacterized protein YthA~~~
MIKNFIFDNLIILAVPFMIKTSLKTNLIFFFLCVFVPHMAS
>O34505 ~~~ythB~~~Putative cytochrome bd menaquinol oxidase subunit II~~~COG1294
MEISTDALIAISIIWGFVFIYAVMATMDFGAGFWSMIYLNKEHMKATDIANRFLSPTWEVTNVFIVAIVVALFSFFPGAT
FVLGTVLLIPGSMILLLLAIRSGFLVFSNTAKERKTLRYISGISGFIIPAILILVLPVTHGGFIEKTDGIYNLNMSKIFS
SPNAYSFIGFAILSTLFLSSLLLADFSNVAEEQDAYRAYRKSALITGPISLLFAVCIMVTMRNEANWLYSGMMNDFSWII
ASFITFVIAGIALFLPNKSFGQNIGKPRLALVAIGIQYFLASYAYGRAHLPYMIYPDVTVMSGFTEPATFRALFATYIVA
FIILFPGFFFFWKMFMRDKRYIRQEE
>P0DSI0 ~~~ythB~~~Protein YthB~~~
MNKLPAHLSRQNCKIASTNLSEIIPRRAAVLK
>P0DPC4 ~~~ytiC~~~Protein YtiC~~~
MPVNGIFDVFDMLSIYIIYKLIVSNNTWLIMRK
>P0DPC5 ~~~ytiD~~~Protein YtiD~~~
MADYAEINNFPPELSSSGDKYFHLRNYSEYSEYTSGFFLSLMIFIKS
>O35013 3.6.1.55~~~ytkD~~~Putative 8-oxo-dGTP diphosphatase YtkD~~~COG0494
MYEFKDYYQNTVQLSFDDQPFSDSPKHVWVICRFGGKWLLTEHEDRGYEFPGGKVEPMECAEEAALREVKEETGARVKSL
KYLGQYKVLGKEKVIVKNIYFADIEKLEKQADYFETKGPVLFHELPENLSRNKKFSFIMKDSVLPISLKKLKESGWIE
>O34799 2.7.1.-~~~ytlR~~~Putative lipid kinase YtlR~~~COG1597
MSHWFFIINPTAGHRNGLRVWKSIQKELIKRKVEHRSFLTEHPGHAEVLARQISTIQEYKLKRLIVIGGDGTMHEVVNGL
KDVDDIELSFVPAGAYNDFSRGFSIKKIDLIQEIKKVKRPLTRTFHLGSVNFLQDKSQILYFMNHIGIGFDAYVNKKAME
FPLRRVFLFLRLRFLVYPLSHLHASATFKPFTLACTTEDETREFHDVWFAVVSNHPFYGGGMKAAPLANPREKTFDIVIV
ENQPFLKKYWLLCLMAFGKHTKMDGVTMFKAKDITFYTKDKIPFHADGEIMGTTPFRLASSPSPLRIKT
>O34365 ~~~ytmB~~~Uncharacterized protein YtmB~~~
MGMPVEFNTLIVTKGKEVRIDENIFTLEKDGYRVYPMEIPMDVRKTKFGEKSGTAEVQKLQWEEGRTIITYKLTSLHSVN
>O34760 3.1.1.-~~~ytnP~~~Probable quorum-quenching lactonase YtnP~~~COG0491
METMKIGNITLTWLDGGVTHMDGGAMFGVVPKPLWSKKYPVNEKNQIELRTDPILIQKDGLNIIIDAGIGYGKLTDKQKR
NYGVTQESNVKPSLAALGLTVADIDVIAMTHLHFDHACGLTEYEGERLVSVFPNAVIYTSAVEWDEMRHPNIRSKNTYWK
ENWEAVAGQVKTFEDTLTITEGITMHHTGGHSDGHSVLICEDAGETAVHMADLMPTHAHRNPLWVLAYDDYPMTSIPQKQ
KWQAFAAEKDAWFIFYHDAEYRALQWEEDGSIKKSVKRMKR
>O34707 4.2.3.130~~~ytpB~~~Tetraprenyl-beta-curcumene synthase~~~
MTVPEHPFGLMAKVYRDIFPLVHQELDIWKQKSESIHNSELKAQATASIRDKTFHCEGGGILALLSGSQKQKCVEFIIAY
QTISDYLDNLCDRSTSLDPQDFRMLHASMQDALTVGAELQNYYQFREEQDDSGYLHELVKTCQRVLGSIEHYDMIKPYLL
ELCGYYCDLQVHKHVIEHERVPRLEKWFTQYESELPEMEWYEFSACAGSTLGIFCLVAYSFQPDFTESTAKKIRDSYFPY
IQGLHILLDYLIDQEEDLLEGDLNFCSYYQSHEEMMDRLEHFIHKADEHLQGIPHENFHRLINRGLLGVYLSDDKVAGQK
EMGRLAKKLIKASGKTSLFFYINGRAYRKFQKMSWMKNSKKKAQIIC
>O34357 ~~~ytpP~~~Thioredoxin-like protein YtpP~~~COG0526
MKKIESTQELEKAVKDDWSVFMFSADWCPDCRFVEPFLPELEANFPEFTYYYVDRDKFIDTCAEWEIYGIPSFVVFNEGK
EVNRFVSKDRKTKEEIEQFLTDSLAKA
>O34712 ~~~ytrA~~~HTH-type transcriptional repressor YtrA~~~COG1725
MIQIDPRSSTPIYEQIIQQMKELCLKGIMKPGDKLPSVRELATIIIANPNTVSKAYKELEREGIIETLRGRGTYISENAK
TTLVEGKMTMIKEQLKQLIIDAHYAGVELEKLHEWIKEISADVKGGKKND
>O34641 ~~~ytrB~~~ABC transporter ATP-binding protein YtrB~~~COG1131
MIELRQLSKAIDGNQVLKDVSLTIEKGEIFGLLGRNGSGKTTMLRLIQQIIFADSGTILFDGVEIKKHPKVKQNIIYMPV
QNPFYDKYTYKQLVDILRRIYPKFDVTYANELMNRYEIPETKKYRELSTGLKKQLSLVLSFAARPALILLDEPTDGIDAV
TRHDVLQLMVDEVAERDTSILITSHRLEDIERMCNRIGFLEDNSLTNVMDLDELKEEYIKIQMAFDTDVNLAIREQNIPM
LDQAGVFYTVLIPKSDEEKKSFLRELKPKVWNELPVNLEEVFIAKFGGKRRW
>O34898 ~~~ytrC~~~Probable ABC transporter permease YtrC~~~
MVDRGLLYREWKQNQVVILLSIAFLVLANPLSIVNTYLSYQGCLDRQDPQYCDFIVNYSISNLIDINWVPGVILAVCFLG
MERSKGTMDFILSLPYNRSQIFQTKFWLGGFVIVLSQLIGFLLAWLLILVYNPEHVYFFEHSSIGVIVISFMAFSLVMAA
GALTGNAFAQLLTAFSAAILPYLIIALPVGNLEVVFGANIWEIFPSPESYFSLASNLSNLVPISYVVNEWLINSKYLLLI
PAVMSMLFYLIGFISFKKLPSERNGHFFLWNRLDRPVQILVMAFGILGFGLFGYYTGHSIIGYILGMIIGAVAGFFVSYF
SIYKKTKV
>O34953 ~~~ytrD~~~Probable ABC transporter permease YtrD~~~
MPDSGLLYKEWRQNKVALVITILVFILGNPLSILNMYLIYQGCVTGKENWVGPCVFSVDYLNSSFISLFWIWGVVLAVSQ
LGIERSKSFFDFTLSLPYTRGQIFHAKFLTGGMVIVVPQLIGYVLSVLLIMLLKPDQAVYFHNYSLGMIIVSMLAYSLVM
AGGALTGNSFAQLLVSFTVAISPFLLISLPVINLEILFGGSIDFIHGPVPKWVQYFIPIIYVDSKWAENSPYYLVIPAIM
TIIFYIIGYISFVKMSNERNGYFFLWKPLNRPVQIIVIIIGIMGFGYFGFTASESFAGYLIGMGTGAVIGFLISYFAIYK
KMKLL
>O34392 ~~~ytrE~~~ABC transporter ATP-binding protein YtrE~~~COG1136
MIDVQHIDHSFTIGKKGRENEVPVLKDVSLSVAKGEIACIVGRSGSGKSTLLNLISGYISPTKGRIVINGTDVTGFNEKE
WAQFRLDHFGFIFQSFQLIPGLTTYENVEMPLALKGIKPSERKQKVQDMLKRVGLENHAAHYPNELSGGQQQRVSIARAL
ILNPSIILADEPTGSLDSETEHEVLELIQQLNRERGITFVIITHDDEVASIGHSKFQLHDGVLKGGITVEV
>O35005 ~~~ytrF~~~ABC transporter permease YtrF~~~COG0577
MRFKDQVHFIRRNMKKNRLRVFMTILATTMACAFLVVLSSVGFGIQKTITDMTMSQQIVTKVSVMGKEGDKPIKKADLEK
YDHVRSVVERTQVYEPNKATLGNRTNESSNLIFTNMNDELKANMELEKGRVAKSENEIVVGYDFAKRLLTKKESEEYNKK
IEEAKGNPEDIKEPKGYTKDILNKTIELSVSKTDSKTGDVTKTKTYDFKIVGITKKPSQDWMEDSNIFISDQFKKDFSEF
LDFKGGNVETNIGVFADKFENVEQLTNDLTDDGYYVTSVTTELEGANTFFMVFKIGLIFVGCIAVIISAIGIFNTMTMAV
TERTQEIGIMKAIGASPSIIRRMFLMESAYIGILGCVIGIIISYGVSYLVNLAVPMILAATSGGDAGDLNYTFSYIPASL
VIIAVVICGGVAVISGMNPARKATKTNVLTALRREL
>C0H3P8 ~~~ytrH~~~Sporulation membrane protein YtrH~~~
MDQEAGFMVNFINSYFIALGVLIGGALIGGLGAYLAGEPPLTAITKLANRLKIWALVAAIGGTFDAVYSFERGILEGNTR
DIFKQLLLIISAMGGAQSGWLIISWLTQEHLSS
>O34460 ~~~ytrI~~~Sporulation membrane protein YtrI~~~
MRVPQHYKKPGWQRFFAGMMCGAVISWFFFLFTYGTFQEEQVSLIEKQKEHVKDLNNQISIYQEDLHKLNEDNKRKLLIQ
SVSVKLLNGDKYKISQPDKTKFEEHVKDDISEVITKDIESVYQTKDLLKRTIENKVYMINEKKYEATVRELIIYTRLTVE
LEISFAT
>Q795Q5 ~~~yttA~~~Uncharacterized membrane protein YttA~~~COG2433
MEMVLAFLGFLACLIALGYGLYHLVRYVLKKEKRFSKRLFWPLFIGGLVLLFTGAALAEPDTAAANAEKKYSALNAEYKN
LTKEHEELEKEYKSVSSEAKKLKDNKEDQDKLEKLKNENSDLKKTQKSLKAEIKELQENQKQLKEDAKTAKAENETLRQD
KTKLENQLKETESQTASSHEDTGSSSNNTSKSDETKTADKAEGCNIKGSRNGIYHTPGSTYYDRTTDPAEMFCSVEEAEA
AGYRAPKR
>O34970 ~~~yttP~~~Probable HTH-type transcriptional regulator YttP~~~COG1309
MKVSTKDKIIESAVMLFNQKGFSGTSVREIAKSADVNVAHISYYFKGKGGLMEHLVSEFYEGYSKTLETAASNISTQSTQ
EQLLQLVFDILSYQHNHRQLTRFVYREVTIDSTLIREIMSTYLMKEKYIFQLIIEEGEKQREYLTLPLPHFILQLKSLLM
MPYLQPQYISEVLYMQPHEPYFYKMYFEEIKIWIRSVFRTGDVALTN
>O32067 ~~~ytzE~~~Uncharacterized HTH-type transcriptional regulator YtzE~~~COG1349
MKPSTNRMLTRIKSVYMFIQEKGLVTTQELVDEFGITPRTIQRDLNVLAYNDLVHSPSRGKWETTRKKVKITS
>P0A0N2 ~~~~~~DegV domain-containing protein~~~
MKIAVMTDSTSYLSQDLIDKYNIQIAPLSVTFDDGKNFTESNEIAIEEFYNKMASSQTIPTTSQPAIGEWITKYEMLRDQ
GYTDIIVICLSSGISGSYQSSYQAGEMVEGVNVHAFDSKLAAMIEGCYVLRAIEMVEEGYEPQQIIDDLTNMREHTGAYL
IVDDLKNLQKSGRITGAQAWVGTLLKMKPVLKFEDGKIIPEEKVRTKKRAIQTLEKKVLDIVKDFEEVTLFVINGDHFED
GQALYKKLQEDCPSGYQVAYSEFGPVVAAHLGSGGLGLGYVGRKIRLT
>Q9RPX3 ~~~~~~Type IV secretion system putative outer membrane lipoprotein BRA0058/BS1330_II0058~~~
MRTLVMVACAVSLAACSSPPKPPTVSGRHRIPINSPAAQEELRLQVFPQEPTAQATMWPARPPKQTVSVYFPQDVTVFRP
TSAQINQLHTLLWPVPKHINVRGLTDNNCPPPGDTQVARVRALAIYNWLINQGVSASRITISYAPVKDYASNAPLSPGRV
LNRRVDIEILRK
>O32079 ~~~yuaD~~~Putative metal-sulfur cluster biosynthesis proteins YuaD~~~
MWKRMTAKAEGLYIADTKSFVTKQMDKLDFDYGGIPGDLHFGLTKKAGAREPMFSRGTEIFNRRQISIVSIEECNEIALK
MGVPRILPEWLGANVAVSGMPDLTSLKEGSRIIFPSGAALLCEGENDPCIQPGEVIQSYYPDQPKLASAFVRHALGIRGI
VCIVERPGAVYTGDEIEVHSYQRKVKRKAERV
>O32101 ~~~yueB~~~ESX secretion system protein YueB~~~COG1511
MTEQRKSLIKLISAVIIILLLPVLFFRFIGDDPTKKAVNSTRQIAVVNEDTGVLSDEVKSDEEDKSAQFGKEVAAVLGER
PDYSWTVVNRSAAETGLASKKYDAIVYIPSDFSKNILSYDKDHPQKATLEFSIQDNLNAVNKEKVQRELEDAQKTMNKKM
SALYWNFVSQKVDNIRGEFDKIVNKESEFQNVMYNFYKPSSNDLAGEIKQQKDLIDELKKSMNEAQGTTKEKASTAEEAK
NTLKEFIDTVERYKEYQENQKKLLLAAQDSTQQQIRTGLDAIQAQQKANQFSERMSGLATGIGQAKTQIGLTNLALNNAE
KLRQNQVPLQEMGMKKIENDMFNAFLSRYKSQYEAIKYQNLNQLQENIGKNRLSLLKPKESDEKEDGEDTSDNKDDTDKE
DIEDIKLDLEKQRDELKNVATEIKDISEGLKEPEQEKPTTPDAEEPSTDDSPNTEEPSNDIPTSDDQPTNEDTGSSEEGT
QDNGSQNDVQTNIETGQKHQESSKNVPEQDTNTENTGTSKTDFSLIELADENDGSNQSDGLQGDGADGETDISGAKKRLN
EAAIKLEEIENALQEKQEEHNNKLEKHIDELNQEIKELNKTVSKLNDQIGDLTKKLVDFDNNVNDAYRLIYNLEDEIIQT
LQSRGYIDQKEKLSSIFSSRIETDNISNLMKYYNSLNLYKSTLNDNLDLGSLTIIKGEVIQEQDGNVQSVLALTPEESAS
WEALKNNTMQTDEDINSFIDGMTKFADDYSGYIRDSQAGVLDELTKISESAAKASEQLVTGATQESATFSNDGLSGTMAL
SVQDTVGQEVLQMSDMMGSLSDRQSGIIDYTTNMQQSVNDVQAKADTLNNNWGKNVASTKLVRNDVYGILGNTLVDGQNN
GYVYDYLANPLKISGEVPEEKIQTVPPVVILVIVLISSLLIGYFSSYYQNAPLLVKGALFGILNILVGLMISLFGLNIYS
LPDDQTIKWSVFTILLLVASSAFIRTAFRFGSIPGWVASAAMILFYVAPLIDLIMPNFTFEDPVSKVYIDIQYGTGHLFT
MGITVLLIITVIAVALPLIIRLMAEHTAESDETYEA
>O32092 ~~~yueI~~~Uncharacterized protein YueI~~~COG5506
MSEDKMDLYLQQGMYGPLETKPDERHLFLGSLRERVVLALTKGQVLRSKPYKEAEHELKNSHNVTLLINGELQYQSYSSY
IQMASRYGVPFKIVSDLQFHTPLGIVIAADIAVNRELIYIQDDIYNRSVLKS
>O32108 ~~~yuiC~~~Uncharacterized protein YuiC~~~COG3584
MMLNMIRRLLMTCLFLLAFGTTFLSVSGIEAKDLSKWVQEHQEKHLKHAGLRLKALQQKQTQTTSAAEDKTKPLEEAFDW
DEYPVQRVTATGYTAGAESTGKNPGDPLYGLTYSGVKVKRDLYSTVAADPSVFPIGTILFIPNYGLGVVADTGSAIKGNR
LDLYFETVKDVYNEWGKKTLDVYVIKKGTGKITEDELEKLNETKSLQVFRNQYKTVKE
>P71070 ~~~yukC~~~ESX secretion system protein YukC~~~COG4499
MSGEQKSYLENQLEAVAEKTDAGYTFTFQREKIKLLDGLEANVIKDINPFFHKEIDVTDDEVIITIQPPSSYKAFRFMKA
KDKKSKWQFAYQLVQAVQQHNLSRLNLIVAPENIVFDKGLTPYFLHYGVKESIPPYERDEERVWQELKAAAALAVDGAFA
FEDYLKFNETLTFSAEAKAILDAESYDDLLELIQTHIDELEAKAKTYIHIPRKKWNIQRYIGLGLIVLLVPALIYSMYAL
FFAQPKHQAIVDSNRAFLNKQYSEVISTLSKYDAESLPESVQYQLATSYVEVENLGSAKTKNIENNLVTLQSDPQHFLYW
IDYGRGEYKEAISIGRKLEYNDYIYFALAKYKQQLLSEDTNDEDIQKELDSVNSELEKAQKERQENKQSNSETSLVDTSE
EQTQTDEEKQAEEKAAEEKAAAEEKAKKEEQKEKEDEKKETEKKDEKKDDK
>P71071 ~~~yukD~~~ESX secretion system protein YukD~~~COG5417
MYIDITIDLKHYNGSVFDLRLSDYHPVKKVIDIAWQAQSVSMPPREGHWIRVVNKDKVFSGECKLSDCGITNGDRLEIL
>C0SP85 ~~~yukE~~~Protein YukE~~~COG4842
MAGLIRVTPEELRAMAKQYGVESQEVLNQVDRLNRMISDLKSMWEGASSEAFADQYEQLKPSFIKMSDLLQDVNQQLDQT
ANTLESTDQDIANQIRG
>O32131 ~~~yunB~~~Sporulation protein YunB~~~
MPRYRGPFRKRGPLPFRYVMLLSVVFFILSTTVSLWMINGSIKPVLMDIGEMETKRIATEVIQDSIEDYMSDSENMKDMF
QMNSDENGNLTTIDFNTQVVNSVKTKVTKQLQAHLKEMETHTGHSGASENIMINIPLGQVTGNSLLGNLGPKIPVRFNLI
GDAFTDVKTKIKPYGINNALIDISIFVEIKVKVIIPFASKTAVVTNNVPVSIKAVQGEVPQFYNGSGGSGVTPSVQLPSS
KENGADSKKEKSSK
>O32152 ~~~yurK~~~Uncharacterized HTH-type transcriptional regulator YurK~~~COG2188
MLNNGSSTPLYIQLKQIITDDIKKGVYSPTAKLPTENELCTKYNVSRITVRKAILDLVEEGYLIRQQGKGTFVKSPKLKR
ELIAVNGYSEFMESTGKKPKHHVLSHDIIPASKPIAEKLQIQPESPVVELKRILYNDDQPLTFEVTHYPLDLFPGIDTFI
ADGVSMHDILKQQYKVVPTHNTKLLNVVYAQQEESKYLDCDIGDALFEIDKTAFTSNDQPIYCSLFLMHTNRVTFTINSP
YT
>O32181 ~~~yusO~~~Uncharacterized HTH-type transcriptional regulator YusO~~~COG1846
MKSADQLMSDIQLSLQALFQKIQPEMLESMEKQGVTPAQLFVLASLKKHGSLKVSEIAERMEVKPSAVTLMADRLEQKNL
IARTHNTKDRRVIDLSLTDEGDIKFEEVLAGRKAIMARYLSFLTEEEMLQAAHITAKLAQAAETDEKQNMKRGNG
>O32188 ~~~yusV~~~Probable siderophore transport system ATP-binding protein YusV~~~COG1120
MGMSAISTETLSLGYGDAVIIDELNLTIPKGEITVFIGSNGCGKSTLLRSLARLMKPRGGSVLLEGRAIAKLPTKEVAKE
LAILPQGPSAPEGLTVHQLVKQGRYPYQNWLKQWSKEDEEAVERALKATKLEDMADRAVDSLSGGQRQRAWIAMTLAQET
DIILLDEPTTYLDMTHQIEILDLLFELNEKEDRTIVMVLHDLNLACRYAHHLVAIKDKRIYAEGRPEEVITCDLVQNVFS
MNCQVTQDPLFGTPLCIPHGRGRCIVQEAAFTSHG
>O32127 ~~~yutD~~~Putative antitoxin YutD~~~COG4470
MILIQNAEFELVHNFKDGFNEEAFKARYSDILNKYDYIVGDWGYGQLRLKGFFDDQNQKATFETKISTLDEYIYEYCNFG
CAYFVLKRIRK
>O32126 3.1.-.-~~~yutE~~~Putative RNase YutE~~~COG2445
MYFVDRSKIEKTLGFFEHQLALFDSQTDWQSEIGELALQRIGHLLIECILDTGNDMIDGFIMRDPGSYDDIMDILVDEKV
VTEKEGDELKKLIAYRKTLVQQYLLADSGELYRLIKAHQTALQDFPKRIRSYLETELGPVSAFK
>O32125 3.1.3.5~~~yutF~~~5'-nucleotidase YutF~~~COG0647
MKTYKGYLIDLDGTMYNGTEKIEEACEFVRTLKDRGVPYLFVTNNSSRTPKQVADKLVSFDIPATEEQVFTTSMATAQHI
AQQKKDASVYVIGEEGIRQAIEENGLTFGGENADFVVVGIDRSITYEKFAVGCLAIRNGARFISTNGDIAIPTERGLLPG
NGSLTSVLTVSTGVQPVFIGKPESIIMEQAMRVLGTDVSETLMVGDNYATDIMAGINAGMDTLLVHTGVTKREHMTDDME
KPTHAIDSLTEWIPYI
>O32234 3.-.-.-~~~yvaM~~~AB hydrolase superfamily protein YvaM~~~COG0596
MPLISIDSRKHLFYEEYGQGIPIIFIHPPGMGRKVFYYQRLLSKHFRVIFPDLSGHGDSDHIDQPASISYYANEIAQFMD
ALHIDKAVLFGYSAGGLIAQHIGFTRPDKVSHLILSGAYPAVHNVIGQKLHKLGMYLLEKNPGLLMKILAGSHTKDRQLR
SILTDHMKKADQAHWHQYYLDSLGYNCIEQLPRLEMPMLFMYGGLRDWTFTNAGYYRRSCRHAEFFRLEYQGHQLPTKQW
KTCNELVTGFVLTHHS
>O32248 2.3.1.-~~~yvbK~~~Uncharacterized N-acetyltransferase YvbK~~~COG0456
MNMQKLRIELGEETNDELYDLLLLADPSKDIVDEYLERGECYTAWAGDELAGVYVLLKTRPQTVEIVNIAVKESLQKKGF
GKQLVLDAIEKAKKLGADTIEIGTGNSSIHQLSLYQKCGFRIQAIDHDFFLRHYDEDIFENGIQCRDMVRLYLDL
>O32257 ~~~yvbW~~~Uncharacterized amino acid permease YvbW~~~COG1113
MKNDNQTLKRTMTSRHIMMMALGGAIGAGLFKGSSSAIDVAGPSVIIAYLLGGIILLFIMQGLAEMAVRNRNARTFRDLV
QQVLGNYAAYFLDWIYWKMWVLNIAAEAVVAAIFIQYWLPGCPIWVLALGISLIVTIVNLLSVKIFAETEYWLAMIKITV
IIIFIILGLLLLFVSFGDHTASGFSNLTDHGGFFPHGGTGLITAMLVVIYSYGGTEIIGVTLAETKNPEKVVPKAVRSTL
TRIVAFYLLPFFIIVSLIPWNQVNSVPESPFVMVFKMVGIPGADHIMNAVILLAIISSMNSGLYGSSRILYTQASDGRLP
KVFSKLSSKNVPMFAILMCTSSLYIGVLISLFAGSQTFNYLMGSLGYTVLFIWLIIGFAHLKSRKQQTETPAYYVKWFPY
TTWFAIVALLAILIGVIMTTSIVITGITAAIYLLITVAYLVKGRKHQ
>O06965 ~~~yvcA~~~Putative lipoprotein YvcA~~~
MKKIIFICFSLLLALTGGCSMNDNDKNSTNDNKTEAVKPKDMDPKDLPQVPAFQDEKTREYMVSTKEEEPGYYLLESKLK
GFRMLFPEDGKYLSRRSSLTGKNKESIGFNSYDKDTNVMFDGHVTYYKEESFANEPKTMLDIVSGKNDYKGEYKKSSKKK
TDIYTAKKKDIFDDIDRKYNYSYSYFGYVKSTEEDNLGVEYAFTLGCKNENQPCSLDEEKAKNKVEKLINSITFLIDKKE
K
>O06973 ~~~yvcJ~~~Nucleotide-binding protein YvcJ~~~COG1660
MSVSESHDIQLVIITGMSGAGKTVAIQSFEDLGYFCVDNLPPSLLPKFLELMKESNSKMSKVALVMDLRGREFFDRLIEA
LDEMAENPWITPRILFLDAKDSILVTRYKETRRSHPLAATGLPLEGIALERELLEELKGRSQIIYDTSDMKPRDLREKIV
KHFATNQGETFTVNVMSFGFKYGIPIDADLVFDVRFLPNPYYIESMRPLTGKDKEVSSYVMKWNETQKFNEKLIDLLSFM
LPSYKREGKSQVVIAIGCTGGQHRSVTLAENLADYFKKDYYTHVTHRDIEKRSRK
>O06986 ~~~yvdD~~~LOG family protein YvdD~~~COG1611
MKTICVFAGSNPGGNEAYKRKAAELGVYMAEQGIGLVYGGSRVGLMGTIADAIMENGGTAIGVMPSGLFSGEVVHQNLTE
LIEVNGMHERKAKMSELADGFISMPGGFGTYEELFEVLCWAQIGIHQKPIGLYNVNGYFEPMMKMVKYSIQEGFSNESHL
KLIHSSSRPDELIEQMQNYSYPILEKKWTEI
>O06997 1.21.-.-~~~yvdP~~~Uncharacterized FAD-linked oxidoreductase YvdP~~~COG0277
MGSTQLTGRVIFKGDPGYTEAIKNWNPYVDVYPLVFVFAQNSYDVSNAIKWARENKVPLRVRSGRHALDKNLSVVSGGIV
IDVSDMNKVFLDEENAIATVQTGIPVGPLVKGLARDGFMAPFGDSPTVGIGGITMGGGFGVLSRSIGLISDNLLALKTVD
AKGRIIHADQSHNEDLLWASRGGGGGNFGYNTQYTFKVHRAPKTATVFNIIWPWEQLETVFKAWQKWAPFVDERLGCYLE
IYSKINGLCHAEGIFLGSKTELIRLLKPLLHAGTPTEADIKTLYYPDAIDFLDPDEPIPGRNDQSVKFSSAWGHDFWSDE
PISIMRKFLEDATGTEANFFFINWGGAISRVPKDETAFFWRHPLFYTEWTASWKNKSQEDSNLASVERVRQLMQPYVAGS
YVNVPDQNIENFGKEYYGANFARLREIKAKYDPENVFRFPQSIPPSR
>O07001 ~~~yvdT~~~Uncharacterized HTH-type transcriptional regulator YvdT~~~COG1309
MPKQTSGKYEKILQAAIEVISEKGLDKASISDIVKKAGTAQGTFYLYFSSKNALIPAIAENLLTHTLDQIKGRLHGDEDF
WTVLDILIDETFLITERHKDIIVLCYSGLAIDHSMEKWETIYQPYYSWLEKIINKAIANHEVTEGINSKWTARTIINLVE
NTAERFYIGFEQDENVEVYKKEIFTFLKRSLGTA
>P71051 2.7.10.2~~~yveL~~~Putative tyrosine-protein kinase YveL~~~COG0489
MIFRKKKARRGLAQISVLHNKSVVAEQYRTIRTNIEFSSVQTNLRSILVTSSVPGEGKSFSAANLAAVFAQQQEKKVLLV
DADLRKPTINQTFQVDNVTGLTNVLVGNASLSETVQKTPIDNLYVLTSGPTPPNPAELLSSKAMGDLISEIYEQFSLVIF
DSPPLLAVADAQILANQTDGSVLVVLSGKTKTDTVLKAKDALEQSNAKLLGALLNKKKMKKSEHYSY
>P71066 ~~~yvfG~~~Uncharacterized protein YvfG~~~
MSELFSVPYFIENLKQHIEMNQSEDKIHAMNSYYRSVVSTLVQDQLTKNAVVLKRIQHLDEAYNKVKRGESK
>O32211 ~~~yvgO~~~Stress response protein YvgO~~~
MKRIRIPMTLALGAALTIAPLSFASAEENPAPKMSQTTTAGTTAADVGLNVNLDVLGIANQIADAIKSAQNRDGFVKNLM
ESSFYASGQKYNVMVFNLSQEYEDHLNGVQFYGSAVYDGITYGIWVFEDGTFTNKGDGGWINWAFRGWFDRDGSTVAFHR
P
>P96502 ~~~yviE~~~Uncharacterized protein YviE~~~
MQIPRLIMHSVQGKIGLTTTPASLKMEQPQADLEIEQPSAEMEISVTPGKLTIDQTQAWEELDRKHVFKRIEEAAQQGHE
DVMEGIARTAEEGDELMKIENKGNPIASQARRNSEMHQIQLGENYAPSLSRVKIQYTPSQLDVQITPRKPVIQAEPHKPI
VEYTPGNVKVDMLQYPDLNIDVEYPKESPEK
>C0H3R5 ~~~yvrJ~~~Uncharacterized protein YvrJ~~~
MTVKDFDSPSNRLPTKLNEGKQKKVNYYFPPSVFQNTLQQFTFYRFSSLTYMTAEKEGRYFPMDQVFIEEVVKQIGNLGF
PALIAMYLLTRFEKKFDQLIELMTELKDHAKK
>O34686 ~~~yvrL~~~Membrane-bound negative regulator YvrL~~~
MKHQNPSKRLLRLSIKYLLAAAAVVLTYFAVIYILFSLAGTSYRSAAHVLLFAVVFLVLGLCFEPFERLMIHSFTFFKTG
KRLFILLAGIVQLLFLWMTAHTTDQLISDIWLSTTEEMIVAAVFLILDKCNSALPS
>P39737 ~~~yvyC~~~Uncharacterized protein YvyC~~~COG1334
MNIERLTTLQPVWDRYDTQIHNQKDNDNEVPVHQVSYTNLAEMVGEMNKLLEPSQVHLKFELHDKLNEYYVKVIEDSTNE
VIREIPPKRWLDFYAAMTEFLGLFVDEKK
>P39808 ~~~yvyG~~~Uncharacterized protein YvyG~~~COG3418
MSAKAIIEQLKRLCVLHEHLLTLSEEKTEALKAGKTKELSNILTKEQKYIQAITQTEDDRIKTTSAFLGYSENNTISACI
AKTSGSEKEELEQLYESLSQVLGRLKKVNEMNRQLTRDALQFISISYDMLVPKENNFNYSKSIKAELPKSSKMKLFDSKA
>O34366 ~~~yvzB~~~Putative flagellin YvzB~~~COG1344
MDALIEEVDGISNRTEFNGKKLLDGTETDGFTFQIGANAGQQLTVNIDSMSSTALGVNALNVTDFANTPFDTQLESIDTA
INNVSNQRAKLGAVQNRLEHTINNLGASSENLTAAESRIRDVDMAKEMSEFTKNNIPSQASQAMLAQANQQPQNVLQLLR
>P39583 2.7.6.5~~~ywaC~~~GTP pyrophosphokinase YwaC~~~COG2357
MDLSVTHMDDLKTVMEDWKNELLVYKFALDALDTKFSIISQEYNLIHGHNPIEHTKSRVKSFESIVNKLMRKGCEITTKE
MKEHIHDIAGVRIICSFISDIYNVVNVLKQHEDLRIVKVKDYIQTPKPNGYRSLHLIIEMPVNLTNRVEYVKAEIQIRTI
AMDFWASLEHKIYYKLNNDVPKQLTDELKEAAEIAHYLDEKMLGIKKEVD
>P39603 ~~~ywcE~~~Spore morphogenesis and germination protein YwcE~~~
MMDMFFAYLLVASATPLFIWLDNKKVALSAIPPIILMWVFFFFYATESLSPLGHTLMIILFAVNVIVAHIAAFIIYGLPY
LRRKRSS
>P39617 ~~~ywdI~~~Uncharacterized protein YwdI~~~
MNIHISALIQKMEEELKKAKTAERDEELKRYVAVVRSLCDVVLDQPENASAPRIQPSVTPSPAAPPSTDQLMMEKMMGSA
GLNKYRKQEKEKQEEDGNGESLFDF
>P39618 ~~~ywdJ~~~Putative purine permease YwdJ~~~COG2233
MKLVLGALQWTAFIIAAAIVVPVAVAQSFHLDHSDSARLIQSTFFVLGIAAVIQCLKGHRLPINESPAGLWWGVYTIYAG
LTGTVFATYGDTLRGLQGALLVSAVCFFLLSVFKVIDRLAKLFTPVVTGVYLLLLVMQLSQPIIKGILGIGYRQDGVDGL
VFGLALVVIAAAFIMTNSNIMFFKQYSILLALFGGWVLFAAAGAAKPIEMPDRLFQLPSLFPFGTPLFNSGLIITSIFIT
ILLIVNMLASMKVVDIAMKKFSKQPDGKHHERHAGFAASFSHLLSGLTGAIAPVPISGAAGFIETTKMPSKKPFMLGSIL
VIVISVIPFFMNTFASLPSPVGFAVNFVVFSAMGGLAFAEFDSYEKEESKRVRSIIGISLLTGVGIMFVPETALKGLHPV
FISLLSNGLVLGTLAAIAADQFQLWRRRKSDNLVSTENKH
>P39619 ~~~ywdK~~~UPF0382 membrane protein YwdK~~~COG2363
MKVFIILGAINALLAVGLGAFGAHGLEGKIPDKYLQVWHTGVQYHMYHALGLFVVAFLADKLSGIGSVTTAGWLMFAGIV
LFSGSLYILSVTQISILGAITPLGGVAFIISWIMIVVAAVKYL
>P71003 ~~~ywhK~~~Uncharacterized protein YwhK~~~COG3391
MRKNKLSFKEKAGAEQEECLCFCEGEASREAMFQAPIELPEGFVVDPAEAVANVTWNADSLSCVSDRCLIQTGPEPDEVG
VQFAVRLQGTVTLLVSVSPVRNQYGQGDGAVSVIHNAEIDQVVYYSDESDDCPDISDITIENLVVVPPFYGSPLSVTGTI
VLVPEPEPVYAFTANPNDQSVSVIDTNTDTVVTTIALPYNPAGIEITPDKSAVFVLHPNNNVISVIDYDTLTVTATILLD
QPPRLIRFIPNHEFAYVFTGTAVYVIGIDTLTVDRSIPVEGYDVAIDPNGLFAYVLNFGIVQKVDLTTGEVTGTIERELI
VSTIETNWPERYAYVLEQEFFFNYLTVIDLNTFTISSTQELEYEGEYRMFTSGAEVYLYDGFTGNLYSVSPNGAGVIGNV
PQSATDYAFTPNGDFLYATRFIEQSIIVYNTDDYSEETVISLGVSPGAITI
>O07624 ~~~ywiB~~~Uncharacterized beta-barrel protein YwiB~~~COG4506
MKQETPITLHVKSVIEDDGNQEVIEFRTTGFYYVKQNKVYLSYYEEHDLGKVKTIVKVSEGEVLVMRSGAVKMNQRFVTG
ASTIAKYKMSFGELELKTSTKSIQSDLDEEKGRISIAYDMHVGDEQEHLHNMTITYEGGTHA
>P45861 ~~~ywjA~~~Uncharacterized ABC transporter ATP-binding protein YwjA~~~COG1132
MLRQFFSYYKPYKTLFFLDFFSAIAGGLMELSFPLIVNYFIDTLLPGRDWGLIIATSIGLFAVYALSSALQYIVTYWGHM
LGINIETDMRKSLFDHLQKLSFKFYDNNKTGTLMSKLTNDLMYIGEVAHHGPEDLFIAVMTILGAFGVMLFINWQLALLT
FIIMPIVIWLALYFNKKMTKAFTTLNKDIGDFSARVENNIGGIRLVQAFGNEAFEKERFAVNNQRFRVTKLSSYKIMAKN
GSISYMLTRFVTLFVLLCGTWFVIRGSLSYGEFVAFVLLTNVLFRPIDKINAIIEMYPRGIAGFKSYMELMETEPDIQDS
PDSKDVSGLKGNIRYKHVSFGYDDHHNVLNDINLSIQAGETVAFVGPSGAGKSTLCSLLPRFYEASEGDITIDGISIKDM
TLSSLRGQIGVVQQDVFLFSGTLRENIAYGRLGASEEDIWQAVKQAHLEELVHNMPDGLDTMIGERGVKLSGGQKQRLSI
ARMFLKNPSILILDEATSALDTETEAAIQKALQELSEGRTTLVIAHRLATIKDADRIVVVTNNGIEEQGRHQDLIEAGGL
YSRLHQAQFGQMVHR
>P39156 5.3.1.-~~~ywlF~~~Putative sugar phosphate isomerase YwlF~~~COG0698
MKVAIASDHGGVHIRNEIKELMDELQIEYIDMGCDCGSGSVDYPDYAFPVAEKVVSGEVDRGILICGTGIGMSISANKVK
GIRCALAHDTFSAKATREHNDTNILAMGERVIGPGLAREIAKIWLTTEFTGGRHQTRIGKISDYEEKNL
>P39157 ~~~ywlG~~~UPF0340 protein YwlG~~~COG4475
MNELKQTWKTMLSEFQDQAELKQDQLFVLGCSTSEVAGSRIGTSGSVDIAESIYSGLAELREKTGIHLAFQCCEHLNRAL
VVEAETAKLFRLPTVSAVPVPKAGGAMASYAFKQMKSPVLVETIQADAGIDIGDTFIGMHLKPVAVPVRVSQNSLGSAHV
TLARTRPKLIGGVRAVYECE
>O32277 ~~~ywmB~~~Uncharacterized protein YwmB~~~
MKKKQVSHAIIISVMLSFVIAVFHTIHASELTPLAQMAEGMERQDVSIDKWTLHAKQNLSLTEKEFYQKVQRLKQEYRQY
DWVIAREDKMIKAIGTYTDKKNRTSFRLQLVTTLKKHNPTSYLLYEQMSLETPDSWNDTYEQFERETLGIFQEKVVIFTC
LNGHLDDNMNIVLQKKANQLLNEFQARSVEHVVEPNFVSISAFTDEWEEYIMTSKHKMNLQIALRSAGMGGKHTVTVGTP
IVTTEY
>P71036 ~~~ywnA~~~Putative HTH-type transcriptional regulator YwnA~~~COG1959
MINSRLAVAIHILSLISMDEKTSSEIIADSVNTNPVVVRRMISLLKKADILTSRAGVPGASLKKDPADISLLEVYRAVQK
QEELFAVHENPNPKCPVGKKIQNALDETFESVQRAMENELASKSLKDVMNHLF
>P71043 2.3.1.183~~~ywnH~~~Putative phosphinothricin acetyltransferase YwnH~~~COG1247
MTLRLAEHRDLEAVVAIYNSTIASRMVTADTEPVTPEDRMEWFSGHTESRPLYVAEDENGNVAAWISFETFYGRPAYNKT
AEVSIYIDEACRGKGVGSYLLQEALRIAPNLGIRSLMAFIFGHNKPSLKLFEKHGFAEWGLFPGIAEMDGKRYDLKILGR
ELS
>P71045 ~~~ywnJ~~~Uncharacterized membrane protein YwnJ~~~
MNRLLLAGWIFFILLSVCTESFSGMVVSQTVAFHFQPHPDLSQFLVMDFTELTVPEAFIQKIGHAFSFFVLTYLLWKQRG
SIRSAAAGSFAFAFFTEVLQLFFSRNGCIRDVLIDAVGIGLFYGLYVLAKRRKQEMYEKY
>P94587 3.4.22.-~~~ywpE~~~Putative sortase YwpE~~~COG3764
MRRDQKMGEGNYPLAGHHLKQKNLLFGPLENIKTGAQIVITDFKKDYIYSVTSKDIISEMDADVVEETNKKEITLITCDK
AVKTEGRLVVKGELVDSFGHTN
>P94589 ~~~ywpG~~~Protein YwpG~~~
MNQFRLKEIYIDGVPSESHLIQKEVTYMLSRKEVFEVHLNKKGRISFLYETQDGMEQYKIKLSMPEKKLSFQWFAWDGSS
YVRMNTQNWLTKQIFFRFLKSTYFFKGKKQKMFLAEGKMKTKDKNRG
>P94592 3.1.3.-~~~ywpJ~~~Phosphatase YwpJ~~~COG0561
MKLIAIDLDGTLLNSKHQVSLENENALRQAQRDGIEVVVSTGRAHFDVMSIFEPLGIKTWVISANGAVIHDPEGRLYHHE
TIDKKRAYDILSWLESENYYYEVFTGSAIYTPQNGRELLDVELDRFRSANPEADLSVLKQAAEVQYSQSGFAYINSFQEL
FEADEPIDFYNILGFSFFKEKLEAGWKRYEHAEDLTLVSSAEHNFELSSRKASKGQALKRLAKQLNIPLEETAAVGDSLN
DKSMLEAAGKGVAMGNAREDIKSIADAVTLTNDEHGVAHMMKHLL
>P94593 3.6.4.-~~~ywqA~~~Uncharacterized ATP-dependent helicase YwqA~~~COG0553
MASLKEILIHVEQMEDGSFTLSAFDENEQPLPYSHMKKHLFQWHESSFYGTFLEDVSFIGTTAVLLSPWMTVELLGKNSF
NSFSSVQLTEETEPLIEAASTIYEFIADGDFMPDYDAWTNGVFRWKDRDNILEGFTAEWFSAAVQDYIQYDDDLREKWEH
IKEKSPAVTTFRGHFLDEEDFLEGIGWIDDQSPFTVGLRLNEPDFDGDEWKIEMFLRDKKSGAVEFFDGLKSLKKSWQAY
SDKIAREQDRFHRTVPWLSFDSGTTLISEEEAWIFLSEASETLVDMGVEILLPSWWQIVRDSNMMLKAKVSSSPRGESFV
GMNALLDFNWRFATNGIELTEAEFNELVASNRRLVNIRGQWVKIDPQFIKQMKRLMEKAESEGLHMSDILARELMDQQDG
GLEDSDLIDTSAFAGIQFDLSKQLRSLIRKLTAAENLPEHKVSPSFKGTLRPYQKYGMNWLLFLRESGFGACLADDMGLG
KTIQMIAYFLHVKESGRQKTPHLIIAPTSVLGNWQRELQTFAPDLSVALHYGPRRPKGDDFAAHYENADVVLTSYGLSHA
DTEELSSVTWNTICLDEAQNIKNAHTKQSRAIRKLKGLHHIALSGTPMENRLTELWSIFDFMNKGYLGSLTGFHKRYVLP
IEKDRDEKRIGQLQQLIRPFLLRRTKRDEEVALNLPEKLEEKEFIPLSAEQASLYEQLVKDTFDHMTSLTGMQRKALILS
MLGRLKQICDHPALYLKEEQTELLAGRSVKLEKLLELMTAIRAQNESCLIFTQYIQMGNMMKRLLEKTFGEPVQFLNGSL
SKQERDTLVEKFQRKEYPTLILSLKAGGTGLNLTAANHVIHYDRWWNPAVENQATDRAYRIGQERFVHVHKMITTGTIEE
KIDVMLESKQTLNDQIIQSENWITELSTQELEELFTLSATAQ
>P96715 ~~~ywqC~~~Probable capsular polysaccharide biosynthesis protein YwqC~~~COG3944
MGESTSLKEILSTLTKRILLIMIVTAAATAAGGLISFFALTPIYENSTQILVNQSKNERKEVQFNDVQTNLQLINTYNVI
IKSPAILDEVIKEMGLSMTSQELNDKITVSSEQDSQVVNISVRDENAETAAHIANTIASVFQDKITSIMNVDNVSILSKA
EVSEHPSPVSPKPLLNIAIAFAAGLAGSIGLAFLLEHLDNTIKSEEQLESLLDIPVLGTVSTIANEQKTAKTLQGFQSEK
TGSGHFGA
>P96716 2.7.10.2~~~ywqD~~~Tyrosine-protein kinase YwqD~~~COG0489
MALRKNRGSRMQRNVIAMTEPKSLNSEQYRTIRTNIEFASVDRQMKSVMITSACPGEGKSTTAANLAVVFAQQGKKVLLI
DADLRKPTVHTAFFLENTVGLTSVLLKKSSMEQAVQASNEKHLDVLTSGPIPPNPAELLSSKWMKELAYEACAAYDMVIF
DTPPILAVADAQILGNVADGSVLVISSGKTEKEQAAKAKEALATCKSKLLGAIMNGKKLSKHSEYGYYGTKDNFMQK
>P96717 3.1.3.48~~~ywqE~~~Tyrosine-protein phosphatase YwqE~~~COG4464
MIDIHCHILPAMDDGAGDSADSIEMARAAVRQGIRTIIATPHHNNGVYKNEPAAVREAADQLNKRLIKEDIPLHVLPGQE
IRIYGEVEQDLAKRQLLSLNDTKYILIEFPFDHVPRYAEQLFYDLQLKGYIPVIAHPERNREIRENPSLLYHLVEKGAAS
QITSGSLAGIFGKQLKAFSLRLVEANLIHFVASDAHNVKTRNFHTQEALYVLEKEFGSELPYMLTENAELLLRNQTIFRQ
PPQPVKRRKLFGFF
>P96718 1.1.1.22~~~ywqF~~~UDP-glucose 6-dehydrogenase YwqF~~~COG1004
MNITVIGTGYVGLVTGVSLSEIGHHVTCIDIDAHKIDEMRKGISPIFEPGLEELMRKNTADGRLNFETSYEKGLAQADII
FIAVGTPQKSDGHANLEQITDAAKRIAKHVKRDTVVVTKSTVPVGTNDLINGLITEHLAEPVSISVASNPEFLREGSAIY
DTFHGDRIVIGTADEKTANTLEELFRPFQIPIYQTDIRSAEMIKYASNAFLATKISFINEISNICEKVGADIEAVAYGMG
QDKRIGSQFLKAGIGYGGSCFPKDTNALVQIAGNVEHDFELLKSVIKVNNNQQAMLVDKALNRLGGVTGKTIALLGLSFK
PNTDDMREAPSIVIADRLAALDARIRAYDPIAVSHAKHVLPQAVEYKETIEEAVKGSDAVMILTDWADIKQFPLAAYQDL
METPLIFDGRNCYTLDEALAAGVEYYSVGRKAVVPSGAIQ
>P96719 ~~~ywqG~~~Uncharacterized protein YwqG~~~COG3878
MNHLPEKMRPYRDLLEKSAKEYVKLNVRKGKTGRYDSKIAGDPYFPKHETYPTDENGQPMKLLAQINFSHIPQLDGYPSS
GILQFYISVHDDVYGLNFDDRCEQKNFRVIYFENIVENDDELVSDFSFIGTGECDFPILSEAAVEPVKSSEWVLPTDFQF
EQYTGMETMEFFGQFGEDEEDIYNELAENGFGHKIGGYASFTQHDPREYAYKEHTIMLLQIDSDDDIDSMWGDVGIANFF
ITPEDLRKKDFSNVLYNWDCS
>P96722 ~~~ywqJ~~~Toxin YwqJ~~~COG5444
MSKVFESKSLIEEAKSRKKQYETLEEQLNTLKKAFQGVADLGDNFKGNGADNIKDFFQGQAEIVDSWLTLVSAQIAFLNG
ISGDIKDQELNDSYVETSFLDHELPNGDLKASEIVSAHKEEIDSILSGISDIIDLDMYTLDDYADKMGDAQKIRRDTITA
VDKLDESLTTEYQNLESLDNAVLTKYSVLMQATSNGKSASPMYYDKKAFHSNEVYKSVIEVENQGTTYIDAKTQQAEARR
LQEKAEEEANKPWYEKTWDGVCNFTGEVTGYYDYKRATEGVDPVTGEKLSTAERVTAGAMAAAGFIPVVGWAGRAFKGGK
AIYKTGKAAIAAEHALDAYKTGKSLDILKMTEMGAYGLVASNGFSEAVTGRDMFGNKVSEEKRKQGALEAITIIGGAGLA
HYFDRLYQKNAPYVNKVSNESLISNIAKTTEEKQTRLQYLRNKHGVLSKEDLHHRINLRAEVLNELSRIKSSGLTKKQRG
PAVAGVLDKKTGNYYFGINNIDGKPPKVLHPLIHDRIVNMPTELKEGYIKTSGAGSHAEVNALNEALLQRPDADLKDLMV
YVVSARKINKKMPEGVPMPRCPHCEYITQNTNYIPEALKYGK
>P96723 ~~~ywqK~~~Immunity protein YwqK~~~COG2849
MENEYDMKSIKEKGVDFEDLWFSSVSDEILDNPEDENGQPFTGLAYELYPNGQIIYFTKYKNGLAHGLTCEFYENGNKKS
EKEYRYGQLHGISIIWFENGRKKSEQQYEHSILISEKNWDEEGNLLNKYEIDTDSPHFEILESRRETHINLGRE
>P96726 1.-.-.-~~~ywqN~~~Putative NAD(P)H-dependent FMN-containing oxidoreductase YwqN~~~COG0655
MKIAVINGGTRSGGNTDVLAEKAVQGFDAEHIYLQKYPIQPIEDLRHAQGGFRPVQDDYDSIIERILQCHILIFATPIYW
FGMSGTLKLFIDRWSQTLRDPRFPDFKQQMSVKQAYVIAVGGDNPKIKGLPLIQQFEHIFHFMGMSFKGYVLGEGNRPGD
ILRDHQALSAASRLLKRSDAI
>O05218 2.3.2.2~~~ywrD~~~Glutathione hydrolase-like YwrD proenzyme~~~COG0405
MNKSVIGTKQMVVSPHYLASQAGNRILDKGGNAFDAAVAVSACLAVVYPHMTGLGGDSFWLTFHQETKAVKVYNGSGRSG
KNVTRDVYKGKSAIPLRGIDSAITVPGMVDSWDAVLKEYGRLSLADVLEPARDYAQNGFPVSADQCRHTEKNIELLASTP
YTADIFTRRGKAPVPGERFVQKELADSLNLIAEKGRSAFYEGDLAQRIVSHLQNNGSYMTIDDFKAHRGEWAAPVSSDYR
GYSVYQAPPNSQGFTGLLTLNILENYDFTQIEHGSFEYYHVLVEALKKSFLDRDAVLTDPAFADIPLERLLDKRYAKQLA
EEIGYLAIPAESRPVGSDTAYAAVIDADGNAVSFIQSLYFEFGSAVTAGDTGILLQNRGSFFSLDENHVNTLEPRKRTFH
TLMPAMVCKGGKPKILYGTQGGEGQPQTQTAIITRMLDYGMHPQQAISEPRWVWGRTWGEEYEGLRVEGRFTDKTIQKLK
DSGHLVEVVGDYDPLMGQAAAIKVDEEGFLQGGADPRGDGAAVGI
>P96729 ~~~ywsB~~~Cell wall-binding protein YwsB~~~COG3103
MNKPTKLFSTLALAAGMTAAAAGGAGTIHAQQPETTVSIDDLYSYPIDSYLVSAEALNVRTKPSASSQKADTLHLGDSLK
LISFSNADWAKVKYKNGKTGFVSTHYIVKAATTVKTKTKTKVYTSADGKSIKTLPADTSVSFLGWSKTNKGGFDFDWVFV
DYGGTTGYMKTKDLHMTK
>P96741 3.1.3.104~~~ywtE~~~5-amino-6-(5-phospho-D-ribitylamino)uracil phosphatase YwtE~~~COG0561
MKCIAIDLDGTLLNKESVISAENREAIKRAVDAGILVTICTGRATFDVKALLDDLDIPIIAANGGTIHDTGYRLISRTLM
DQEAGKAIADYLLSKNIYFEVYTDDHLLSPFDGEAKLHAELDILKSANPNEQTDDLWQGAMTQFKQFGIKPIPHIESVFD
GGENIYKLLCFSFDMDKLKQAKEELKHHKKLAQTSSGKHIIEILPASSGKGRALTKLADIYGIETQDIYAIGDSPNDLSM
FEVAGHRIAMENAIDELKEKSTFVTKSNDENGVAYFIDQLLSGQYA
>P42105 ~~~yxaF~~~Uncharacterized HTH-type transcriptional regulator YxaF~~~COG1309
MTSRGDSREKILHTASRLFQLQGYHATGLNQIVKESGAPKGSLYHFFPNGKEELAIEAVTYTGKIVEHLIQQSMDESSDP
VEAIQLFIKKTASQFDNTESIKGIPVGLLASETALISEPLRTVCMKVFKSWEAVFARKLMENGFAEEEANQLGTLINSMI
EGGIMLSLTNKDKTPLLLIAEQIPVLVRKKG
>P42111 ~~~yxaL~~~Uncharacterized protein YxaL~~~COG1520
MVKSFRMKALIAGAAVAAAVSAGAVSDVPAAKVLQPTAAYAAETVFSQNNGASGFLPGRYDVQAMAPAMFNWSRESRFAG
NTDGTLKWQNDIRTTPQNGAGAVIDGDGTVYLHSRDGEMKAFNPDGSVKWVTGNLGKTYTQSPVLGTNGVIYLASYDKKI
YFIDKETGEILTTVPLSGGPSSETVIGSDGTLYFSTLDNYVHAIKPTSKSTWTERWKLKTNGVVSSVPVLAKNGTVYVGT
YNYVFYAINSGTGQVKWSRTTSNAFKGYPVIDKDGNIYAGNQDGQLYAYTSTGSLKWTFPLNGFSSSSPAIDHNGNIYIG
SGSGELFSISKNGDMNWSFYTDGPVRTAPLIDAKGTVYFGSDDMKVYAADANGNELWSYQTDSNVVSSPQLAEDGTLYIG
SYTKLMAFGK
>P46327 ~~~yxbC~~~Uncharacterized protein YxbC~~~COG2850
MSAVTESVLESIISPVTMSEFLEEYWPVKPLVARGEVERFTSIPGFEKVRTLENVLAIYNNPVMVVGDAVIEESEGITDR
FLVSPAEALEWYEKGAALEFDFTDLFIPQVRRWIEKLKAELRLPAGTSSKAIVYAAKNGGGFKAHFDAYTNLIFQIQGEK
TWKLAKNENVSNPMQHYDLSEAPYYPDDLQSYWKGDPPKEDLPDAEIVNLTPGTMLYLPRGLWHSTKSDQATLALNITFG
QPAWLDLMLAALRKKLISDNRFRELAVNHQSLHESSKSELNGYLESLIQTLSENAETLTPEQIFQSQDSDFDPYQSTQLV
FRQLLTSYKF
>P42423 ~~~yxdL~~~ABC transporter ATP-binding protein YxdL~~~COG1136
MANMLEVKHINKTYKGQVSYQALKQISFSIEEGEFTAVMGPSGSGKTTLLNIISTIDRPDSGDILINGENPHRLKRTKLA
HFRRKQLGFVFQDFNLLDTLTIGENIMLPLTLEKEAPSVMEEKLHGIAAKLGIENLLNKRTFEVSGGQRQRAAIARAVIH
KPSLILADEPTGNLDSKATKDVMETLQSLNRDDHVTALMVTHDPVSASYCRRVIFIKDGELFNEIYRGENRQVFYEQILD
VLSMLGGNANDLSSVRL
>P42424 ~~~yxdM~~~ABC transporter permease protein YxdM~~~COG0577
MTFLQFAYKNVTRNKRAYLAFFLSSAFSVLIFFTFAMFLFHPALKEGYLNNIAKKGLTAAEWMIFVFSFLFVLYSVNAFL
KSRNKEFGILLMQGITPGQLRKLITAENMIIGVMSIAAGIIGGFIFSKTFFTVGAYILEMDALPLYMPWKALGITACGFL
LLFFFLSQFTILFVRSNTVIKLIKGTGKVKPEPKPSVLLSLFGIACLCGGYGMVLKGNVHGAEPFIILLLTVIGTYFFFS
QSSIWILRALKKWKTFYLRGKNIIWVSDLVYRLKDNARLFFIVSIISAVAFTATGVLAMYKSTVGAEESAYEMEYLSYSN
NPKEQTHLKDIDHELKTHGFTYTKDKIDVSYVRYQEGETVPPVYMISESDAAKYFHVKVNGLKEDEAVYFPGTYDRNFKN
EAPDQLKLLNQKGELSDQKLSVKEVKKPLISLNAIIAVNDQTFDQLKSLGDKASLYGYSYDHWKDSLEISQSLQNEIYGN
YIDVHSDFASKAGTYYDTVQLPSLSLFIGLFIAIVFFVAAASFLYFRLFTDLDEDRERYRSLAKIGLSEREMAQSVTIQL
AILFFFPFVIAVMHTLFALRTLAVEGYSDVAGPLSLTIGGFFIFQLLFFLAVRSSYLKKMNK
>P54940 ~~~yxeA~~~Uncharacterized protein YxeA~~~COG5294
MKKAMAILAVLAAAAVICGLLFFHNDVTDRFNPFIHQQDVYVQIDRDGRHLSPGGTEYTLDGYNASGKKEEVTFFAGKEL
RKNAYLKVKAKGKYVETWEEVKFEDMPDSVQSKLK
>P54941 ~~~yxeB~~~Iron(3+)-hydroxamate-binding protein YxeB~~~COG0614
MKKNILLVGMLVLLLMFVSACSGTASKGSSSDSASEKTEMRTYKSPKGNVNIPAHPKRIVTDFYAGELLSVGANVVGSGS
WSFDNPFLKSKLKNVKDVGDPISVEKVMELQPDLIVVMNEENVDKLKKIAPTVVIPYNTAKNVEDTVSMFGDIAGAKDQA
KSFMADFNKKAEAAKKKIAGVIDKDATFGIYENTDKGEFWVFNDNGGRGGQAVYNALGLKAPEKIEQDVIKKGEMKQLSQ
EVIPEYAADYMFITDYNPKGESKTLDKLENSSIWKNLDAVKHNRVFINDFDSFYPYDPISVSKQVDIITDMLIKRAEENK
K
>P54944 ~~~yxeE~~~Uncharacterized protein YxeE~~~
MNPYQYYSPQLPQQEPYYSHYEYNPYPQQDVYDPYQMDRQPALERRIAALERQNEQQSRELTRLTNEDRRQNREIARIAE
QVNQLSQAVERHTRRLNRLNQRLRTVENRLNIPFTAGEGGF
>P54945 ~~~yxeF~~~Uncharacterized protein YxeF~~~
MVIPLRNKYGILFLIAVCIMVSGCQQQKEETPFYYGTWDEGRAPGPTDGVKSATVTFTEDEVVETEVMEGRGEVQLPFMA
YKVISQSTDGSIEIQYLGPYYPLKSTLKRGENGTLIWEQNGQRKTMTRIESKTGREEKDEKSKS
>P54948 ~~~yxeI~~~Uncharacterized protein YxeI~~~COG3049
MCTSLTLETADRKHVLARTMDFAFQLGTEVILYPRRYSWNSEADGRAHQTQYAFIGMGRKLGNILFADGINESGLSCAAL
YFPGYAEYEKTIREDTVHIVPHEFVTWVLSVCQSLEDVKEKIRSLTIVEKKLDLLDTVLPLHWILSDRTGRNLTIEPRAD
GLKVYDNQPGVMTNSPDFIWHVTNLQQYTGIRPKQLESKEMGGLALSAFGQGLGTVGLPGDYTPPSRFVRAVYLKEHLEP
AADETKGVTAAFQILANMTIPKGAVITEEDEIHYTQYTSVMCNETGNYYFHHYDNRQIQKVNLFHEDLDCLEPKVFSAKA
EESIHELN
>P54952 ~~~yxeM~~~Probable amino-acid-binding protein YxeM~~~COG0834
MKMKKWTVLVVAALLAVLSACGNGNSSSKEDDNVLHVGATGQSYPFAYKENGKLTGFDVEVMEAVAKKIDMKLDWKLLEF
SGLMGELQTGKLDTISNQVAVTDERKETYNFTKPYAYAGTQIVVKKDNTDIKSVDDLKGKTVAAVLGSNHAKNLESKDPD
KKINIKTYETQEGTLKDVAYGRVDAYVNSRTVLIAQIKKTGLPLKLAGDPIVYEQVAFPFAKDDAHDKLRKKVNKALDEL
RKDGTLKKLSEKYFNEDITVEQKH
>P54953 ~~~yxeN~~~Probable amino-acid permease protein YxeN~~~COG0765
MNTIDWEFMISAFPTLIQALPITLFMAIAAMIFAIIGGLILALITKNKIPVLHQLSKLYISFFRGVPTLVQLFLIYYGLP
QLFPEMSKMTALTAAIIGLSLKNAAYLAEIFRAALNSVDDGQLEACLSVGMTKFQAYRRIILPQAIRNAIPATGNTFIGL
LKETSLAFTLGVMEMFAQGKMYASGNLKYFETYLAVAIVYWVLTIIYSILQDLFERAMSKPYRT
>P54954 7.4.2.-~~~yxeO~~~Probable amino-acid import ATP-binding protein YxeO~~~COG1126
MITVKNIRKAFKDLVVLDGIDLEVKRGEVVAIIGPSGSGKSTLLRCLNLLERPDQGLIEIGEAKLNAEKFTRKEAHRLRQ
QTAMVFQNYNLFKNKTALQNITEALIVAQHKPRDEAKRIGMEILKQVGLEHKADSYPITMSGGQQQRIGIARALAVNPHA
ILLDEPTSALDPELVTGVLQVIKSIAEKQTTMIIVTHEMAFAKEVADQVIFMADGHIIEQGTPEELFDHPKNERTKRFIK
QVGEPAELI
>P54956 ~~~yxeQ~~~Uncharacterized protein YxeQ~~~COG2079
MSKQGLTAGLAEAVRTSQPEHSVDAIHEAKKGLLDFTAASFAGREDKGIQKLLRLIEDEGGRPLVPVIGQGKKAAPLQSA
MLNGFIAHALDFDDVHSDVRGHPSAVIVPALIASAARGHDERLLGAYVVGVEVMARLGESIGSRHYEKGWHNTGTLGAIA
AACAVGYAEELTQEELEKAIGFAATQSAGMRVQFGTEMKPLHAGLAAQAGLLAVKLAQSEFGGSRTALDGETGFFSLYGD
VEKAQHTLLNDWGAPWRIVQPGLWFKIYPFCSAAHHAADAVRQLISEETISAANTERIEVIFPPGGDAALTERSPKAGEE
GRFSVEYVIALALHGHGLTVEHFSSQPIPNGVQTTMGRIQRVYDNAIQPAPHAVPKGRFTIVRAYLSDGRICEARVDCPK
GAPGNELSEEDIIEKLTLTVPQEKARRIITAVEKADIKEFLAHIE
>P42296 ~~~yxiD~~~Toxin YxiD~~~COG5444
MKTLDVHALHEGIQHTIEKLDKQKQQLEKLEKSVEHLAGMKDALKGKGGDAIRTFYEECHKPFLLFFGIFIDEYKKVLKQ
TQHAISSVESNSHGMIAEAFLSHDARHGVKHAREVTEQLTDAVNRQTSAIDHIVSLPTVNDSFFRMETEQAERLISDTLN
KLFQFDGQQTQALEAAKSDFQTMKKYIDQLETMYTGPKIEITGYKSGSILKSQEEENINQIFGAINPQMKQADDSPMEMM
LKKLAENEKSKVDSVVKTGDSKKVSKNIIVINGKVYNTSEHREHIKTDFSNAEVKQVVYNDTLYNVYISGNDMKLEPVVS
LSDIKVDENGYVKILETAVELTGVYDLFKAATGRDPVSGEKVTGKDRVVASINSVPFAKIAKLEKLIDINKLINNGKKAK
KASEVKNVAKDKGKIANDVSGSANKINSDLIKKYARDIEQRTGRELPKNQIDKLKEALRNKEYKKMSPIETAKHRTKFDK
VKNKVIKEWEENTGQKWPVYKENVVSEKTGKIIRKKGDKYDAHHIIENTFGGEHEWWNMHPAKFPNEHQAGIHGTGSPAN
ELFKGGKKK
>P42304 3.1.-.-~~~yxiM~~~Uncharacterized esterase YxiM~~~COG2755
MKKWMAAVFVMMLMLCFGGIENVKAAEPKVYQFDFGSGSMEPGYIGVRASDRYDRSKGYGFQTPENMRDVAASGAGVKSD
AVQFLAYGTKSNNTFNVDLPNGLYEVKVTLGNTARASVAAEGVFQVINMTGDGAEDTFQIPVTDGQLNLLVTEGKAGTAF
TLSALKIKKLSDQPVTNRTIYVGGDSTVCNYYPLNSSKQAGWGQMLPHYIDKHTFQVRNMASGGQIARGFRNDGQLEAIL
KYIKPGDYFMLQLGINDTNPKHKESEAEFKEVMRDMIRQVKAKGADVILSTPQGRATDFTSEGIHSSVNRWYRASILALA
EEEKTYLIDLNVLSSAYFTSIGPERTLGLYMDGDTLHPNRAGADALARLAVQELKRQGIAGF
>P94356 ~~~yxkC~~~Uncharacterized protein YxkC~~~
MRHKIITFILAVVVIIIIGNMIGGGGGSEATSKTSSSSKAETEKTYNIGDTVKTEKTEVTVTKVEDRDTVGTQYVEKKAS
EGGTIVAVQYTIKNVSKKPISSFSIPTVKLVDADGTSYDSDIDASVNYATETKVDNSKILSDLNPNIKVTGVKAFEVDKE
AYAKGTWKLKFSNDVIVKIK
>P94371 ~~~yxlC~~~Uncharacterized protein YxlC~~~
MNKEKLSDHLKSEWKKIDQTANPSIPNQKELLHQLSQMKAEYRKKLLQEIILFVFCALMVVSAAILAFTQAPAVFIVLQV
CVLAVLPILIAAEKKRHLGECEVKRG
>P94372 ~~~yxlD~~~Negative regulatory protein YxlD~~~
MTQTEIIITVAACLIVLAQGIFLFIDAKKRNHMAWVWGIVGLIQAPMPLICYYFFVIRPDRKKRGIKQ
>P94373 ~~~yxlE~~~Negative regulatory protein YxlE~~~
MNISWEMILPLIVLQLALAVFALISCIKEERTNGPKWMWAAIIVCINIIGPILFFTVGRKQR
>P94374 7.-.-.-~~~yxlF~~~Uncharacterized ABC transporter ATP-binding protein YxlF~~~COG1131
MLSIESLCKSYRHHEAVKNVSFHVNENECVALLGPNGAGKTTTLQMLAGLLSPTSGTIKLLGEKKLDRRLIGYLPQYPAF
YSWMTANEFLTFAGRLSGLSKRKCQEKIGEMLEFVGLHEAAHKRIGGYSGGMKQRLGLAQALLHKPKFLILDEPVSALDP
TGRFEVLDMMRELKKHMAVLFSTHVLHDAEQVCDQVVIMKNGEISWKGELQELKQQQQTNVFTLSVKEKLEGWLEEKPYV
SAIVYKNPSQAVFELPDIHAGRSLLSDCIRKGLTVTRFEQKTESLEDVYLKVVHA
>P94375 ~~~yxlG~~~Uncharacterized transmembrane protein YxlG~~~COG1277
MKVMMALLQKEWLEGWKSGKLIWLPIAMMIVGLTQPLTIYYMPEIIAHGGNLPDGMKISFTMPSGSEVMVSTLSQFNTLG
MALVIFSVMGSVANERNQGVTALIMSRPVTAAHYIVSKWLIQSVIGIMSFAAGYGLAYYYVRLLFEDASFSRFAASLGLY
ALWVIFIVTAGLAGSTIFRSVGAAAACGIGLTAAVSFAVSLFPDGAKWLPAEICKQAEHILLHGERADFFGWSLTFSILC
IMLLAVFSVWRFRRYESY
>P40737 ~~~yxxD~~~Immunity protein YxxD~~~
MSYNFIKSNKENDFYPVNQSEIEEVEKNLNLKLPSELVNFYLEVGYGFIKGSEFNTNRILDPYSVRDFRLRINDFEFYPD
IEIYDEFENDKLIFFEGSESALMSIELNDNNKNPIYYYDIQIATSLTEFLRKIEENDQYYLDLLDDE
>P37508 ~~~yyaP~~~Uncharacterized protein YyaP~~~COG0262
MTNNLKQRRIILDLAVTLDGFIEGKNGEVDWCIMDPDMGFTDFLNQIDTILYGRKSFDLWGQYIPKNEDPDTEKELWKLV
HSKKKYVFSRTQNEIDNQAIFINDNILEEVNKLKKNPGKDIWLYGGASLITTFINLGLVDEFRLSIHPVVLGEGKPLFID
VKQRINLKMVNTRTFSSGVVQIVYHWNG
>P37496 ~~~yybH~~~Uncharacterized protein YybH~~~COG4319
MEQQLKDIISACDLAIQNEDFDTLMNYYSEDAVLVVKPGMIARGKEEIKKAFITIANYFNHHIVPTQGKMILLEAGDTVL
VLSQTLLDSDKKDSEYAMERRATYVFKKNAQGEWLCVIDNSYGTDLIGV
>P37486 ~~~yybR~~~Uncharacterized HTH-type transcriptional regulator YybR~~~COG1733
MSEKKNIYPNKEGCPVEFTLDVIGGKWKGILFYHMIDGKKRFNEFRRICPSITQRMLTLQLRELEADGIVHREVYHQVPP
KVEYSLTEFGRTLEPIVLQMKEWGESNRDVLESYRSNGLVKDQQK
>P37482 ~~~yycB~~~Uncharacterized transporter YycB~~~COG2807
MPHPHNKKIQSFWLITGIIFIAFNLRPAITSVGPVISSIRAELHMSNGAAGFLTALPLLSFAVLSPLAPKLGQRLGNERT
LWLGLVILLIGVLTRSTGYTAALFFGTALIGVGIAIGNVLLPSLIKHKYPEKPGIMISLYTTSMNIFAALASGVSVPLAT
QMNGGWKQAFLLWGGLALLALLIWIPQLRHRDTANQTMKLQSSSIWASKMAWYVTIFMGLQSFLFYSSIAWFPEILRSHG
IDTATAGWMVSLMQFASLPSTFLTPVLADRVKQQRGIVAALASVYLIGLCGLLAGGSHTLLAIWMIIIGIGQGSSISLAL
TLIGLRSENAQQAAALSGMSQSFGYLLAAVGPIFVGYLFDQTHSWTMPIVLLIAALIVMGAAGQGAGRDRYIFQSEKQRN
SA
>P37481 ~~~yycC~~~Uncharacterized protein YycC~~~COG3093
MRPLQISAETAQKLAESLNLPLEQIMHMPQHILLAKMAELQKEDKS
>P37479 ~~~yycE~~~Uncharacterized protein YycE~~~COG0346
MGKRFSSFQAAQIRIARPTGQLDEIIRFYEEGLCLKRIGEFSQHNGYDGVMFGLPHADYHLEFTQYEGGSTAPVPHPDSL
LVFYVPNAVELAAITSKLKHMGYQEVESENPYWSNGGVTIEDPDGWRIVFMNSKGISGK
>Q794W0 ~~~yycH~~~Two-component system WalR/WalK regulatory protein YycH~~~COG4863
MKRENIKTILLTVLVVISLVFTWGIWTFQPNFSEGSSSTESTVRVKHKIEKTTQKLSETVRPRDMFIHDDGAHYKVDDNA
LYEEIWSDLPHWDVKGIKDISDQYDKAGFKSWFYGIGGSEAKLDLQFSDTIPIDIFQTLFKWSNQSFEYSSFDHILIPFN
ETKANKKIYLVSYSKQLILEVTVESANYRNIMNDLKNRQSNMPAFSLFSIGSKKEFLLPNKPLTMDKKEFVTESIKTNTF
KQALFSDPSIVREDSNYNNRNVLTDGISRLDVNLSQRQVQFQQRNLVQSTSYQTGELIKKSQKYLEDTGSWTDHYQFFNI
NDSQQLSFYIFMDQIPVINSTAKPFGATSAITVQWANDDILSYKRPNYSLGTNPIKTSETELMGGSEVKMLLSKQTAYDT
DKIDQIFLAYQLVSTSTNDDPLVELEPVWAMKVNGKIVPITKDLLRKEGANSGVE
>Q45612 ~~~yycI~~~Two-component system WalR/WalK regulatory protein YycI~~~COG4853
MEWNKTKSIFIVAFLILDIFLGYQFFQKWQATGKEYEVIKNDVEHDMKADHITYEGLNKEATEGYRITANQKSFSKEEIE
ALKDQKPLMDMPSDDHKVTSLKMKFANPIALSKKDIEDDAQALVSSKIQDGEKYKLWKVDKSKKEIIFFQTYEGHYIYQK
TDNPSNMIGQVVLHLNGKNEVVSYDQTTLETFKQIQKESLITEMDAVELLYYQNQLKEYSTVKSCKFGYVAQYPLTSTQV
LAPVWRITVEYEKKVNGEKKTVQEYFTVNALESTILDTDQ
>C0SP91 3.-.-.-~~~yycJ~~~Putative metallo-hydrolase YycJ~~~COG1235
MSLQFSVLASGSTGNAFYLETEDHAFLVDAGLSGKAMDGLMAQIGRKLDDVDGIFVTHEHSDHIKGLGVVARKYKLPIYA
NEKTWKAMENQIGKIDTDQKFVFPMETVKSFGGLDVESFGVSHDAAEPMFYVFHYSGRKLALMTDTGYVSDRMKGIIRSA
NVFVFESNHDVGMLQMGRYPWSIKRRILSDVGHVSNEDAALAMTDVIGDETSRIYLAHLSQDNNMKELARMSVQQTLASK
GFVTGETFDLYDTDPKKATPLCAV
>O32293 2.3.1.-~~~yycN~~~Uncharacterized N-acetyltransferase YycN~~~COG0456
MTIMLTPMQTEEFRSYLTYTTKHYAEEKVKAGTWLPEDAQLLSKQVFTDLLPRGLETPHHHLWSLKLNEKDIVGWLWIHA
EPEHPQQEAFIYDFGLYEPYRGKGYAKQALAALDQAARSMGIRKLSLHVFAHNQTARKLYEQTGFQETDVVMSKKL
>Q45596 ~~~yydF~~~Putative exported peptide YydF~~~
MKKEITNNETVKNLEFKGLLDESQKLAKVNDLWYFVKSKENRWILGSGH
>Q45594 ~~~yydH~~~Putative peptide zinc metalloprotease protein YydH~~~COG1994
MKILKYEDEKYEVLVQNNVFIKDKKSGEYYKNSLNSLSDKQLLRFKMYKEKVSPKFFYLFLSFTALMFILNYIHLIKLQN
GLSSVFYGWKMWIIIVIYFIMNIVLHELGHIYSLKFFGKNFDKVGFKLNFYVFPAFYVQLNETYMLSRNEKIIVHLFGLF
INYLLINTLELINQFTFSSEALTMAFMLFSSTLLWNLIPILNSDGYKILLAFLSLDEYSRFKTNHWLVLTIQIIGIGLAV
NSVVHWILYIVN
>Q45593 ~~~yydI~~~Probable peptide export ATP-binding protein YydI~~~COG1131
MNIANYTLKVKGKTLLQDTDLHFSSGKINHVVGKNGVGKSQLAKDFLLNNSKRIGRDIRQNVSLISSSSNIPNDVSKDFL
LHFLSKKFDAKMIDKIAYLLNLDNIDGKVLIKNLSDGQKQKLKLLSFLLEDKNIIVLDEITNSLDKKTVIEIHGFLNKYI
QENPEKIIINITHDLSDLKAIEGDYYIFNHQEIQQYHSVDKLIEVYINE
>Q45592 ~~~yydJ~~~Probable peptide export permease protein YydJ~~~
MKLEFKKSISNKVIIILGAMFVFLFLLGYFLLVGIDKVSNVTPEMFFFSSYTVATQFGLMLFSFVIAFFINREYSNKNIL
FYKLIGENIYTFFYKKIAVLFLECFAFITLGLLIISLMYHDFSHFALLLFLFSAVILQYILIIGTISVLCPNILISIGVS
IVYWMTSVILVAISNKTFGFIAPFEAGNTMYPRIERVLQSDNMTLGSNDVLFIILYLVSIIIINAIVLRFSKTRWIKMGL
>Q45591 ~~~yydK~~~Uncharacterized HTH-type transcriptional regulator YydK~~~COG2188
MLKYQQIATEIETYIEEHQLQQGDKLPVLETLMAQFEVSKSTITKSLELLEQKGAIFQVRGSGIFVRKHKRKGYISLLSN
QGFKKDLEDFNVTSKVIELDVRKPTPEAAENLNIGMDEDIYYVKRVRYINGQTLCYEESYYTKSIVTYLNNEIVSHSIFH
YIREGLGLKIGFSDLFLHVGQLNEEEAEYLGLEAGLPKLYIESIFHLTNGQPFDYSKISYNYEQSQFVVQANSFLL
>P94542 ~~~zapA~~~Cell division protein ZapA~~~COG3027
MSDGKKTKTTVDIYGQHFTIVGEESRAHMRYVAGIVDDKMREINEKNPYLDINKLAVLTAVNVVHDYVKLQEKCEKLERQ
LKEKD
>P0ADS2 ~~~zapA~~~Cell division protein ZapA~~~COG3027
MSAQPVDIQIFGRSLRVNCPPDQRDALNQAADDLNQRLQDLKERTRVTNTEQLVFIAALNISYELAQEKAKTRDYAASME
QRIRMLQQTIEQALLEQGRITEKTNQNFE
>Q9HTW3 ~~~zapA~~~Cell division protein ZapA~~~
MSQSNTLTVQILDKEYCINCPDDERANLESAARYLDGKMREIRSSGKVIGADRVAVMAALNITHDLLHRKERLDQESSST
RERVRELLDRVDRALANPADAGEA
>P0AF36 ~~~zapB~~~Cell division protein ZapB~~~COG3074
MTMSLEVFEKLEAKVQQAIDTITLLQMEIEELKEKNNSLSQEVQNAQHQREELERENNHLKEQQNGWQERLQALLGRMEE
V
>P44812 ~~~zapB~~~Cell division protein ZapB~~~COG3074
MSLEILDQLEEKIKQAVETIQLLQLEVEELKEKNAESQRNIENLQTENEQLKNEHRNWQEHIRSLLGKFDNV
>P75862 ~~~zapC~~~Cell division protein ZapC~~~
MRIKPDDNWRWYYDEEHDRMMLDLANGMLFRSRFARKMLTPDAFSPAGFCVDDAALYFSFEEKCRDFNLSKEQKAELVLN
ALVAIRYLKPQMPKSWHFVSHGEMWVPMPGDAACVWLSDTHEQVNLLVVESGENAALCLLAQPCVVIAGRAMQLGDAIKI
MNDRLKPQVNVDSFSLEQAV
>P36680 ~~~zapD~~~Cell division protein ZapD~~~COG4582
MQTQVLFEHPLNEKMRTWLRIEFLIQQLTVNLPIVDHAGALHFFRNVSELLDVFERGEVRTELLKELDRQQRKLQTWIGV
PGVDQSRIEALIQQLKAAGSVLISAPRIGQFLREDRLIALVRQRLSIPGGCCSFDLPTLHIWLHLPQAQRDSQVETWIAS
LNPLTQALTMVLDLIRQSAPFRKQTSLNGFYQDNGGDADLLRLNLSLDSQLYPQISGHKSRFAIRFMPLDTENGQVPERL
DFELACC
>P67693 ~~~zapD~~~Cell division protein ZapD~~~
MHTQVLFEHPLNEKMRTWLRIEFLIQQLSINLPIADHAGALHFFRNISDLLDVFERGEVRTELLKELERQQRKLQAWVEV
PGVDQDRIEALRQQLKSAGSVLISAPRIGQQLREDRLIALVRQRLSIPGGCCSFDLPTLHIWLHLQQAQRDAQIESWLAS
LNPLTQALTLVLDLIRNSAPFRKQTSLNGFYQDNGDDADLLRLMLTLDSQLYPQISGHKSRFAIRFMPLDSENGLVPERL
DFELACC
>Q87LT3 ~~~zapD~~~Cell division protein ZapD~~~COG4582
MTTHKFEHPLNEKTRIYLRVESLLRQAHLASGFADNHQYQLFFRALFDMVEIFEQIQLKSELAKDLEKQRLSYRHWLNVE
GVDQEALNSLLNEIDVVHSQLMGAERFGQALKEDRFLSSIRQRFNLPGGSCCFDLPALHYWLHLPIERKKHDANQWQKSL
KPLSDALTLWLKLARETGHFKAQIARAGFFQSDADEANILRLHIPMKYGVYPMISGHKNRFAIKFMAFENGQACSQDVEF
ELAVCS
>P64612 ~~~zapE~~~Cell division protein ZapE~~~COG1485
MQSVTPTSQYLKALNEGSHQPDDVQKEAVSRLEIIYQELINSTPPAPRTSGLMARVGKLWGKREDTKHTPVRGLYMWGGV
GRGKTWLMDLFYQSLPGERKQRLHFHRFMLRVHEELTALQGQTDPLEIIADRFKAETDVLCFDEFFVSDITDAMLLGGLM
KALFARGITLVATSNIPPDELYRNGLQRARFLPAIDAIKQHCDVMNVDAGVDYRLRTLTQAHLWLSPLHDETRAQMDKLW
LALAGGKRENSPTLEINHRPLATMGVENQTLAVSFTTLCVDARSQHDYIALSRLFHTVMLFDVPVMTRLMESEARRFIAL
VDEFYERHVKLVVSAEVPLYEIYQGDRLKFEFQRCLSRLQEMQSEEYLKREHLAG
>P0ADW3 ~~~zapG~~~Z-ring associated protein G~~~COG3105
MTWEYALIGLVVGIIIGAVAMRFGNRKLRQQQALQYELEKNKAELDEYREELVSHFARSAELLDTMAHDYRQLYQHMAKS
SSSLLPELSAEANPFRNRLAESEASNDQAPVQMPRDYSEGASGLLRTGAKRD
>Q7VLF5 ~~~zapG~~~Z-ring associated protein G~~~COG3105
MEQWTTEIWFSISIAFLIGTLCGVLVMRFFKGNIQQQIQLKSELASAEAKIEEQKQQLERHFEQSANLLENLAEDYKKLY
THFAQNSEQLLPESNQVEFFKRLKNHANGDEDNQPRDYSDGSSGLLKS
>P99173 ~~~~~~Zinc-type alcohol dehydrogenase-like protein SA1988~~~
MKMIGFEKPFKLEEGNLFKVYEQRKPTPENDDILVKVNSISVNPVDTKQRQMKVTQAPRVLGFDAIGTVEAIGPDVTLFS
PGDVVFYAGSPNRQGSNATYQLVSEAIVAKAPHNISANEAVSLPLTGITAYETFFDTFKISHNPSENVGKSVLIINGAGG
VGSIATQIAKRYGLTVITTASRQETTEWCEKMGADIVLNHKEDLVRQFKEKEIPLVDYIFCTYNTDLYYNTMIELIKPLG
HITTIVAFNEDQDLNALKLKSITFTHEFMFARPIHRTPDMIKQHEYLEDITKNIELGHYQPTTTQVFEGLSPENLYQAHQ
LLEKQSMIGKLVINI
>A0R2W4 2.5.1.68~~~uppS~~~(2Z,6E)-farnesyl diphosphate synthase~~~COG0020
MDIIPPRLKEPAYRIYEMRLRHELVRSKAQLPRHIAVLCDGNRRWARDAGYDDVSIGYRKGAAKIAEMLRWCQAAGIEMA
TIYLLSTENLQRDPDELTALIEIITDVVEEICAPYNKWSVRTVGDLELLGDEPARRLREAVESTTTKGANFHVNVAVAYG
GRQEIVDAVRSLLSKELANGATAEQLIEAVTVDGISENLYTSGQPDPDLVIRTSGEQRLSGFLLWQSAYSEMWFTEAYWP
AFRRVDFLRALRDYTARHRRFGK
>P9WFF5 2.5.1.68~~~~~~(2Z,6E)-farnesyl diphosphate synthase~~~COG0020
MEIIPPRLKEPLYRLYELRLRQGLAASKSDLPRHIAVLCDGNRRWARSAGYDDVSYGYRMGAAKIAEMLRWCHEAGIELA
TVYLLSTENLQRDPDELAALIEIITDVVEEICAPANHWSVRTVGDLGLIGEEPARRLRGAVESTPEVASFHVNVAVGYGG
RREIVDAVRALLSKELANGATAEELVDAVTVEGISENLYTSGQPDPDLVIRTSGEQRLSGFLLWQSAYSEMWFTEAHWPA
FRHVDFLRALRDYSARHRSYGR
>Q47SS3 2.5.1.68~~~~~~(2Z,6E)-farnesyl diphosphate synthase~~~COG0020
MGLLRIPLYRLYERRLERALSNAPKPRHVGVILDGNRRWARLSGLSSPKEGHRAGAEKIFELLDWCDEVGVQVVTLWLLS
TDNLARPPEELEPLFEIIENTVRRLCNEGRRVNPMGALDLLPASTAQVMKEAGTTTERNPGLLVNVAVGYGGRREIADAV
RSLLLEEAAKGTTLEELAERLDLDDIAKHLYTRGQPDPDLLIRTSGEQRLSGFLLWQSAHSEFYFCEVFWPAFRKIDFLR
ALRSYSVRQRRFGC
>Q55940 ~~~ziaR~~~Transcriptional repressor SmtB homolog~~~COG0640
MSKSSLSKSQSCQNEEMPLCDQPLVHLEQVRQVQPEVMSLDQAQQMAEFFSALADPSRLRLMSALARQELCVCDLAAAMK
VSESAVSHQLRILRSQRLVKYRRVGRNVYYSLADNHVMNLYREVADHLQESD
>P76344 ~~~zinT~~~Metal-binding protein ZinT~~~COG3443
MAIRLYKLAVALGVFIVSAPAFSHGHHSHGKPLTEVEQKAANGVFDDANVQNRTLSDWDGVWQSVYPLLQSGKLDPVFQK
KADADKTKTFAEIKDYYHKGYATDIEMIGIEDGIVEFHRNNETTSCKYDYDGYKILTYKSGKKGVRYLFECKDPESKAPK
YIQFSDHIIAPRKSSHFHIFMGNDSQQSLLNEMENWPTYYPYQLSSEEVVEEMMSH
>P77173 ~~~zipA~~~Cell division protein ZipA~~~COG3115
MMQDLRLILIIVGAIAIIALLVHGFWTSRKERSSMFRDRPLKRMKSKRDDDSYDEDVEDDEGVGEVRVHRVNHAPANAQE
HEAARPSPQHQYQPPYASAQPRQPVQQPPEAQVPPQHAPHPAQPVQQPAYQPQPEQPLQQPVSPQVAPAPQPVHSAPQPA
QQAFQPAEPVAAPQPEPVAEPAPVMDKPKRKEAVIIMNVAAHHGSELNGELLLNSIQQAGFIFGDMNIYHRHLSPDGSGP
ALFSLANMVKPGTFDPEMKDFTTPGVTIFMQVPSYGDELQNFKLMLQSAQHIADEVGGVVLDDQRRMMTPQKLREYQDII
REVKDANA
>A0A0H3LM39 ~~~~~~Zinc transporter ZIPB~~~COG0428
MNQPSSLAADLRGAWHAQAQSHPLITLGLAASAAGVVLLLVAGIVNALTGENRVHVGYAVLGGAAGFAATALGALMALGL
RAISARTQDAMLGFAAGMMLAASAFSLILPGLDAAGTIVGPGPAAAAVVALGLGLGVLLMLGLDYFTPHEHERTGHQGPE
AARVNRVWLFVLTIILHNLPEGMAIGVSFATGDLRIGLPLTSAIAIQDVPEGLAVALALRAVGLPIGRAVLVAVASGLME
PLGALVGVGISSGFALAYPISMGLAAGAMIFVVSHEVIPETHRNGHETTATVGLMAGFALMMFLDTALG
>P75757 ~~~zitB~~~Zinc transporter ZitB~~~COG1230
MAHSHSHTSSHLPEDNNARRLLYAFGVTAGFMLVEVVGGFLSGSLALLADAGHMLTDTAALLFALLAVQFSRRPPTIRHT
FGWLRLTTLAAFVNAIALVVITILIVWEAIERFRTPRPVEGGMMMAIAVAGLLANILSFWLLHHGSEEKNLNVRAAALHV
LGDLLGSVGAIIAALIIIWTGWTPADPILSILVSLLVLRSAWRLLKDSVNELLEGAPVSLDIAELKRRMCREIPEVRNVH
HVHVWMVGEKPVMTLHVQVIPPHDHDALLDQIQHYLMDHYQIEHATIQMEYQPCHGPDCHLNEGVSGHSHHHH
>A2RNS2 ~~~zitR~~~Transcriptional regulator ZitR~~~COG1846
MSLANQIDQFLGAIMQFAENKHEILLGECESNVKLTSTQEHILMILAAEVSTNARIAEQLKISPAAVTKALKKLQEQELI
KSSRATNDERVVLWSLTEKAIPVAKEHAAHHEKTLSTYQELGDKFTDEEQKVISQFLSVLTEEFR
>Q7BQ71 6.2.1.72~~~zmaJ~~~Zwittermicin A synthase ZmaJ~~~
MSTCIQKLFEEQVVRTPDEVAVIFKKETLTYKELNEKSNQLARLLREGGVGPDTVVGIMVERSIEMVVGIFGILKAGGAY
LPLSPNHPSSRLQFIIEDSGAKLILTQKQILHRFQDSLKADMLALDSISYEGKGENLECINKPSDLVYVIYTSGSTGKPK
GVMIEHSALINRIEWMQEAYPISSKDTILQKTPYTFDVSVWEMFWWAIVGAKVCILAPGMEKFPQAIIETTESNDVTIMH
FVPSMLSAFLHYLDVTGETNRIKSLKQVFVSGEALLSQHINRFNKLLNFSNGTLLTNLYGPTEATIDVTAYDCPTHEITE
GSVPIGRPIKNIEMFVVDKYGNKLPEGHIGELCISGIGLARGYVNRPQLTAEKFVQYSLDTRIYKTGDLALIRSDGNIEF
HGRIDFQVKVNGLRIELGEIESCLMSCEGVLQCAVIVRQESEMVVKLIAFYESENDIELERLKKYLRLFLPDYMIPNSFV
RVNEMPLTDSGKIDRKVLALLGSDKYSHHTTLVGGSVNEESSKDS
>P37617 7.2.2.-~~~zntA~~~Zinc/cadmium/lead-transporting P-type ATPase~~~COG2217
MSTPDNHGKKAPQFAAFKPLTTVQNANDCCCDGACSSTPTLSENVSGTRYSWKVSGMDCAACARKVENAVRQLAGVNQVQ
VLFATEKLVVDADNDIRAQVESALQKAGYSLRDEQAAEEPQASRLKENLPLITLIVMMAISWGLEQFNHPFGQLAFIATT
LVGLYPIARQALRLIKSGSYFAIETLMSVAAIGALFIGATAEAAMVLLLFLIGERLEGWAASRARQGVSALMALKPETAT
RLRKGEREEVAINSLRPGDVIEVAAGGRLPADGKLLSPFASFDESALTGESIPVERATGDKVPAGATSVDRLVTLEVLSE
PGASAIDRILKLIEEAEERRAPIERFIDRFSRIYTPAIMAVALLVTLVPPLLFAASWQEWIYKGLTLLLIGCPCALVIST
PAAITSGLAAAARRGALIKGGAALEQLGRVTQVAFDKTGTLTVGKPRVTAIHPATGISESELLTLAAAVEQGATHPLAQA
IVREAQVAELAIPTAESQRALVGSGIEAQVNGERVLICAAGKHPADAFTGLINELESAGQTVVLVVRNDDVLGVIALQDT
LRADAATAISELNALGVKGVILTGDNPRAAAAIAGELGLEFKAGLLPEDKVKAVTELNQHAPLAMVGDGINDAPAMKAAA
IGIAMGSGTDVALETADAALTHNHLRGLVQMIELARATHANIRQNITIALGLKGIFLVTTLLGMTGLWLAVLADTGATVL
VTANALRLLRRR
>Q3YW59 7.2.2.-~~~zntA~~~Zinc/cadmium/lead-transporting P-type ATPase~~~
MSTPDNHGKKAPQFAAFKPLTTVQNANDCCCDGACSSSPTLSENVSGTRYSWKVSGMDCAACARKVENAVRQLAGVNQVQ
VLFATEKLVVDADNDIRAQVESAVQKAGYSLRDEQAADEPQASRLKENLPLITLIVMMAISWGLEQFNHPFGQLAFIATT
LVGLYPIARQALRLIKSGSYFAIETLMSVAAIGALFIGATAEAAMVLLLFLIGERLEGWAASRARQGVSALMALKPETAT
RLRNGEREEVAINSLRPGDVIEVAAGGRLPADGKLLSPFASFDESALTGESIPVERATGDKVPAGATSVDRLVTLEVLSE
PGASAIDRILKLIEEAEERRAPIERFIDRFSRIYTPAIMAVALLVTLVPPLLFAASWQEWIYKGLTLLLIGCPCALVIST
PAAITSGLAAAARRGALIKGGAALEQLGRVTQVAFDKTGTLTVGKPRVTAIHPATGISESELLTLAAAVEQGATHPLAQA
IVREAQVAELAIPTAESQRALVGSGIEAQVNGERVLICAAGKHPADAFAGLINELESAGQTVVLVVRNDDVLGIIALQDT
LRADAATAISELNALGVKGVILTGDNPRAAAAIAGELGLEFKAGLLPEDKVKAVTKLNQHAPLAMVGDGINDAPAMKAAA
IGIAMGSGTDVALETADAALTHNHLRGLVQMIELARATHANIRQNITIALGLKGIFLVTTLLGMTGLWLAVLADTGATVL
VTANALRLLRRR
>P64423 ~~~zntB~~~Zinc transport protein ZntB~~~COG0598
MEAIKGSDVNVPDAVFAWMLDGRGGVKPLENTDVIDEAHPCWLHLNYVHHDSAQWLATTPLLPNNVRDALAGESTRPRVS
RLGEGTLITLRCINGSTDERPDQLVAMRVYMDGRLIVSTRQRKVLALDDVVSDLEEGTGPTDCGGWLVDVCDALTDHSSE
FIEQLHDKIIDLEDNLLDQQIPPRGFLALLRKQLIVMRRYMAPQRDVYARLASERLPWMSDDQRRRMQDIADRLGRGLDE
IDACIARTGVMADEIAQVMQENLARRTYTMSLMAMVFLPSTFLTGLFGVNLGGIPGGGWQFGFSIFCILLVVLIGGVALW
LHRSKWL
>Q9EYX5 ~~~zntB~~~Zinc transport protein ZntB~~~
MEAIKGSDVNVPDAVFAWLLDGRGGVKPLEDNDVIDSQHPCWLHLNYTHPDSARWLASTPLLPNNVRDALAGESSRPRVS
RMGEGTLITLRCINGSTDERPDQLVAMRLYMDERFIVSTRQRKVLALDDVVSDLQEGTGPVDCGGWLVDVCDALTDHASE
FIEELHDKIIDLEDNLLDQQIPPRGFLALLRKQLIVMRRYMAPQRDVYARLASERLPWMSDDHRRRMQDIADRLGRGLDE
IDACIARTGIMADEIAQVMQESLARRTYTMSLMAMVFLPSTFLTGLFGVNLGGIPGGGWRFGFSLFCILLVVLIGGVTLW
LHRSKWL
>Q87M69 ~~~zntB~~~Zinc transport protein ZntB~~~COG0598
MGFMIEHWDFSTPMATQETTTAEHIQPNHWYHCERLHPDIRSWLEDNHVPRATVDHLLADESRPSFHPLDDDNFMLILRG
INMNENASPEDMLSIRILYFQGALISTRKIPSRAIMEIRQALAEHKGPKSLASLLNQIIEGLNGKIDLYLDTIEETLNEF
DVNDESTYNHIAAQKALISIKRFIRPQQYAIRDLIESESELVTSRPHQYRFAHNNITRINETIEFYLGEVALFQDEIKHN
RDEKTNKNSYLFTLVATIFLPTSFLTGLLGINIGGMPGVESSMAFTWFCIALIVIFGLEWLLFKRLGFTNKTDDE
>P0ACS5 ~~~zntR~~~HTH-type transcriptional regulator ZntR~~~COG0789
MYRIGELAKMAEVTPDTIRYYEKQQMMEHEVRTEGGFRLYTESDLQRLKFIRHARQLGFSLESIRELLSIRIDPEHHTCQ
ESKGIVQERLQEVEARIAELQSMQRSLQRLNDACCGTAHSSVYCSILEALEQGASGVKSGC
>O34966 ~~~znuA~~~High-affinity zinc uptake system protein ZnuA~~~COG0803
MFKKWSGLFVIAACFLLVAACGNSSTKGSADSKGDKLHVVTTFYPMYEFTKQIVKDKGDVDLLIPSSVEPHDWEPTPKDI
ANIQDADLFVYNSEYMETWVPSAEKSMGQGHAVFVNASKGIDLMEGSEEEHEEHDHGEHEHSHAMDPHVWLSPVLAQKEV
KNITAQIVKQDPDNKEYYEKNSKEYIAKLQDLDKLYRTTAKKAEKKEFITQHTAFGYLAKEYGLKQVPIAGLSPDQEPSA
ASLAKLKTYAKEHNVKVIYFEEIASSKVADTLASEIGAKTEVLNTLEGLSKEEQDKGLGYIDIMKQNLDALKDSLLVKS
>P39172 ~~~znuA~~~High-affinity zinc uptake system protein ZnuA~~~COG4531
MLHKKTLLFAALSAALWGGATQAADAAVVASLKPVGFIASAIADGVTETEVLLPDGASEHDYSLRPSDVKRLQNADLVVW
VGPEMEAFMQKPVSKLPGAKQVTIAQLEDVKPLLMKSIHGDDDDHDHAEKSDEDHHHGDFNMHLWLSPEIARATAVAIHG
KLVELMPQSRAKLDANLKDFEAQLASTETQVGNELAPLKGKGYFVFHDAYGYFEKQFGLTPLGHFTVNPEIQPGAQRLHE
IRTQLVEQKATCVFAEPQFRPAVVESVARGTSVRMGTLDPLGTNIKLGKTSYSEFLSQLANQYASCLKGD
>P44526 ~~~znuA~~~High-affinity zinc uptake system protein ZnuA~~~COG4531
MKKLLKISAISAALLSAPMMANADVLASVKPLGFIVSSIADGVTGTQVLVPAGASPHDYNLKLSDIQKVKSADLVVWIGE
DIDSFLDKPISQIERKKVITIADLADVKPLLSKAHHEHFHEDGDHDHDHKHEHKHDHKHDHDHDHDHKHEHKHDHEHHDH
DHHEGLTTNWHVWYSPAISKIVAQKVADKLTAQFPDKKALIAQNLSDFNRTLAEQSEKITAQLANVKDKGFYVFHDAYGY
FNDAYGLKQTGYFTINPLVAPGAKTLAHIKEEIDEHKVNCLFAEPQFTPKVIESLAKNTKVNVGQLDPIGDKVTLGKNSY
ATFLQSTADSYMECLAK
>A1B9L0 ~~~znuA~~~High-affinity zinc uptake system protein ZnuA~~~COG4531
MIRPSSLVLAAALGTAALPARAEVPRVVTDIPVVGAMVQQVMGDLGQPEILLQAGGDPHSYQLRPSQARSLQDADLLIWV
GPELTPWLERAATSLSAQSEMLALLDLPATHRRDYAGGEHEHEHEHEHEHEHEHEHDGHGHAEEQAHHDHDHSGTDPHAW
LDPANGQAWLAGIAETLSRHDPDNAGVYAANAAKAADEIAALDEELKSALTPTQGKRFVVFHDAYGYFTEHYGLEPAIAV
SLGDASTPSAARLRAIRDEIAEEGAVCAFPEANHDPKLIAAVIEGSEIRQGAALDPEGTGATPGAGLYAELLRGMGQALA
DCLGAD
>O34610 ~~~znuB~~~High-affinity zinc uptake system membrane protein ZnuB~~~COG1108
MEMFDLEFMRRAFLAGGMIAVMAPILGVYLVLRRQALMADTLSHISLSGVAIGFFLSTNITAASIVVVTIGAIGIEYMRR
AYRTYSEVSIAILMAAGLSFAMFLISLSKGTANMSIDQYLFGSLVTVNQQQVYIISIITLLILLYFIVLRRPLYLLTFDE
ATAKTSGINTNVLSLSFSIVTGLAISVIIPIIGVLLVSALLVLPAAFAIRIAKGFNMVFITAILISLFSVFTGLTSSYQL
GTPPGPSITLLLIVLLLIGFAVQGVWTFIKKEAQRKKRSR
>P39832 ~~~znuB~~~High-affinity zinc uptake system membrane protein ZnuB~~~COG1108
MIELLFPGWLAGIMLACAAGPLGSFVVWRRMSYFGDTLAHASLLGVAFGLLLDVNPFYAVIAVTLLLAGGLVWLEKRPQL
AIDTLLGIMAHSALSLGLVVVSLMSNIRVDLMAYLFGDLLAVTPEDLISIAIGVVIVVAILFWQWRNLLSMTISPDLAFV
DGVKLQRVKLLLMLVTALTIGVAMKFVGALIITSLLIIPAATARRFARTPEQMAGVAVLVGMVAVTGGLTFSAVYDTPAG
PSVVLCAALLFILSMMKKQAS
>O34946 7.2.2.20~~~znuC~~~High-affinity zinc uptake system ATP-binding protein ZnuC~~~COG1121
MNLVSLKDIVFGYSHTPVLDKVSLDIESGEFVGITGPNGASKSTLIKVMLGMLKPWEGTVTISKRNTEGKRLTIGYVPQQ
ISSFNAGFPSTVLELVQSGRYTKGKWFKRLNEEDHLEVEKALKMVEMWDLRHRKIGDLSGGQKQKICIARMLASNPDLLM
LDEPTTAVDYDSRKGFYEFMHHLVKNHNRTVVMVTHEQNEVQQFLDKVIRLERGEKGGWKCLTWNSCDELF
>P0A9X1 7.2.2.20~~~znuC~~~Zinc import ATP-binding protein ZnuC~~~COG1121
MTSLVSLENVSVSFGQRRVLSDVSLELKPGKILTLLGPNGAGKSTLVRVVLGLVTPDEGVIKRNGKLRIGYVPQKLYLDT
TLPLTVNRFLRLRPGTHKEDILPALKRVQAGHLINAPMQKLSGGETQRVLLARALLNRPQLLVLDEPTQGVDVNGQVALY
DLIDQLRRELDCGVLMVSHDLHLVMAKTDEVLCLNHHICCSGTPEVVSLHPEFISMFGPRGAEQLGIYRHHHNHRHDLQG
RIVLRRGNDRS
>Q8ZNV7 7.2.2.20~~~znuC~~~Zinc import ATP-binding protein ZnuC~~~
MTSLVSLENVSVSFGQRRVLSDVSLELSPGKILTLLGPNGAGKSTLVRVVLGLVAPDEGVIKHNGQLRIGYVPQKLYLDT
TLPLTVNRFLRLRPGTQKTDILPALKRVQAGHLIDAPMQKLSGGETQRVLLARALLNRPQLLVLDEPTQGVDVNGQVALY
DLIDQLRRELDCAVLMVSHDLHLVMAKTDEVLCLNHHICCSGAPEVVSMHPEFISMFGPRGAEQLGIYRHHHNHRHDLQG
RIVLRRGNGHS
>A7ZI10 ~~~zorA~~~Zorya protein ZorA~~~
MSWLNSVLLALTSVQPYMVPATVIGLVSFAFLCFIFFYFFRAVKIINGLKKYTQSINGIENNEPGNQLQHLQSLFVQPEL
KHAWNEFEESLHSQYELEDGEEKIVRIRATAPSASFFSEQQLVDIPLNTEFFKHLPGILTGVGIIGTFYGLMIGLNHFDP
STPEQVSSSVNNLLRDVLYAFLGSAFAITFSILITWLEKFCLAKCYKYLEKFTAALDALYDSGVGEEYLASLVKSSNESA
TQARHLKESLVTDLRDMLLHLANSQKVENERLATTLSTTYRETGQQFAEQVSGAIENSLKSPLDKIAGAVQTASGDQSGM
VQNMLQDVLTAFMAKLDTTFGQQFTNLNEMMGQTVGAIQTMQTGFGALLQDMRQVSDDSRQGSAQLIEQLLSEMKSGQQA
MQAGMNDMLTSLQTSVAKIGAEGEGAGERMARQLEKMFADSEAREKAQAEHMTAFIEAIQNSVQQGQSATMEKMAASVES
LGEQLGSLFGQIDKGQQQISANQQANQQSLHEQTQRVMSEVDDQIKQLVETVASQHQGTTETLRLLAEQTNRQIQDMHTG
ADKMRLAAERFEHAGDRVSEANHLTADVLNKAQSAGSSLSLATSELTSVVADYRNNREAVSKSIAMLELLAANTQSEQTT
RTQFIADLKQHGERLQSYNREAQAFMENVSDVLGKGFEDFSEGVSRSLDKTLGKLDVEMAKASTLLAGSVEQIGESVSEL
DDVLSRVRA
>A7ZI09 ~~~zorB~~~Zorya protein ZorB~~~
MFGNAFGVKKRRSDEAEKPFWISYADLMTAMMVLFLVVMVASLSSVTQRIQRAEQGEKTRGQDISRLCERLELHARNVNK
TIVVDCHDNRISFGEAGRFDHNQFFLNAEGQKALQDVVPLVLEASNSEEGKKWFKQIVIEGFTDTDGSYLYNLHLSLQRS
EWVMCSLLDSRSPLQKNISAEQQLQIRKLFLAGGVSFNNAKESKEASRRVELRMQFFGLKDKRDKADEVDFPPVVNKEVC
QLVMPL
>A7ZI08 ~~~zorC~~~Zorya protein ZorC~~~
MTPALNSLSQRIAARLSSSQRDDHYLHNDFHALASAALDMEKRFDKAEKIPLPPQKMRLAALRRLRLAQELTEREWRMVF
YGLADNDPSYPDQPVLLEDDTFFPEVNNAIKKRLETKTLKRRDWAALCSSYFAYQNPSPETNPHWCVLRGHIAQGYMVVK
AAIRREKSWMKTIEFYHDIFTPQAGGVISRQLLAGESNSLSSLEKIAQIPDSSWLWKRIFTVLLAQLDTLDDPQFLDKIS
WLLGLAAQWVRFRDDIMTATLTRYYHSIYRDQAHSALKQAALEYWDNPQLKSQQNKWHQYVSEPVAAMVRGWLAKQDLMH
FFELLRGNGDVDQARLHYWLRFANQMGFTRIVMGTDAWQDRGSDFVKFREENKGRLSYLRGGRNFDNAMIMQINDYLFVE
FSGTGNAMYAYRIGHAPFNPESRTLDINIHLKDKGRCVLRLPHTPRAEGYNKVRITGWMLKYDDELRQLGIRWMAEEAIK
FVDKKASSPASMSDIKIINPLRDTAIQHLVEGSSCIVSDNRQKGGVLSVQLNTPDDTIERELLRLGFAPVAKEPHRYWIK
>A7ZI07 ~~~zorD~~~Zorya protein ZorD~~~
MLKRLLSKLTGNRQQIEHHLKNQYQVEENGLSFPLSLVDDSQLWALASWLEQLAEEDYLISLTDRWLLSWDALYRLLEDE
EHASSLPLIGVPDVLPLRASLSSRGALSDSDFRVWIAEWATLPARKPIRFSRTGAILTHENQQYLLSRENWALLQATEQL
SAQKNQTPGETTNQLGWAAIRKCAKQAAAKFDDYLEKTHVVKPTSLSLRLRKATVADTAVIEIEPHFEDQPANWLGSFDK
NSQVHDSYRIPGENGELSHVIIPPEVKEVLNSIHSIPSRRVAGSEALSFVRNPYTFLGEDAASVIAPEEHEQALFDARIF
FHHFRLIPQLNAENKIAEVTLVLEPVSPVPQPEITFGFSAPRELDKFIQQLGISVAAQMPAGSWQGYELELSQFTEQQWH
DCQALLTRWQQEIEGKEFSDVLDIAKYGDRVIGIGEFEKISSPWLTKAQSENWLPDDIDFSAFSVETLSGWQPENLHHFD
ELQERITQAEAVGETHITAPWNDSQLPLDAAKTFSKNWEKQQSTANESQGNVADKTARAVLKIEQNIEETAYIKQRRNSL
LNARHAEPEIPLSLKEHIRLKDHQREGVAWLQQLFLRSPEETAGCLLADDMGLGKTLQILSFLVWFIEKFPQEPPSLIVA
PVSLLDNWERELDNFFYTAGIPVLKLYGETIKAVKYPKQAIPAHLQSQGIKNLLKPGWQGEAKIILTTYETLRDQEFSLA
RQPWSIMVCDEAQKIKNPAALITHAANAVQARFKVACTGTPVENTLVDLWSLFDFAQPGLLGALNEFGKHYVRPIENEDG
RDTERLESLRALIEPQTLRRTKEEVARDLPQKIEVESCKQLTLSGVQKQLYLSSVANWQQQQALSEGMQQAGTGMLGLLH
RLKLICAHPAVVNPEPRFRDNSPKLNWLLKILAELKHTTKDKVIIFTELRDLQRELQHAIHQKFGFRPVIINGDTSTKSQ
SQNSRQRLIDDFQAQPGFGVIILSTVAVGFGVNVQKANHVIHFTRCWNPAKEDQATDRAYRIGQTKDVYVYYPTVKDTEI
TTFEETLDDLLQRRRALARDMLCATPDLSGADFEAILKGA
>Q8X3T7 ~~~zorO~~~Small toxic protein ZorO~~~
MDTLTQKLTVLIAVLELLVALLRLIDLLK
>Q8X3T6 ~~~zorP~~~Small toxic protein ZorP~~~
MDSLTQKLTVLIAVLELLVALLRLIDLLK
>O31688 7.2.2.20~~~zosA~~~Zinc-transporting ATPase~~~COG2217
MNEQVIVQRDPHEPLKTDKREKNWAQHAELIAALVSGALILAGWLLSGYQVLSIILFLLAFVIGGFAKAKEGIEETLESK
TLNVELLMIFAAIGSALIGYWAEGAILIFIFSLSGALETYTMNKSSRDLTSLMQLEPEEATLMVNGETKRVPVSDLQAGD
MIVIKPGERVAADGIIESGSTSLDESALTGESMPVEKNTGDTVFTGTVNRNGSLTVRVTKANEDSLFRKIIKLVESAQNS
VSPAQAFIERFENAYVKGVLIAVALLLFVPHFALGWSWSETFYRAMVFMVVASPCALVASIMPAALSLISNGARNGMLVK
GSVFLEQLGSVQMIAFDKTGTVTKGQPAVETIRIAEGFSEAEVLEAVYAIETQSSHPLAQAITAYAESRGVNQSGYISIE
ETSGFGVMAEVSGAKWKVGKAGFIGEEMAAQFMKQTASDVIQSGHTIVFVKKDDQIAGCIALKDQIRPEAKEVMEELNRL
GIKTAMLTGDHEDTAQAIAKEAGMTTVVAECLPDQKVNEIKRLKEEFGTIAMVGDGINDAPALKAADVGIAMGGGTDVAL
ETADMVLMKNDLKKLVNMCRLSRKMNRIIKQNIVFSLAVICLLICANFLQAMELPFGVIGHEGSTILVILNGLRLLK
>P38442 ~~~zot~~~Zona occludens toxin~~~COG4128
MSIFIHHGAPGSYKTSGALWLRLLPAIKSGRHIITNVRGLNLERMAKYLKMDVSDISIEFIDTDHPDGRLTMARFWHWAR
KDAFLFIDECGRIWPPRLTVTNLKALDTPPDLVAEDRPESFEVAFDMHRHHGWDICLTTPNIAKVHNMIREAAEIGYRHF
NRATVGLGAKFTLTTHDAANSGQMDSHALTRQVKKIPSPIFKMYASTTTGKARDTMAGTALWKDRKILFLFGMVFLMFSY
SFYGLHDNPIFTGGNDATIESEQSEPQSKATVGNAVGSKAVAPASFGFCIGRLCVQDGFVTVGDERYRLVDNLDIPYRGL
WATGHHIYKDTLTVFFETESGSVPTELFASSYRYKVLPLPDFNHFVVFDTFAAQALWVEVKRGLPIKTENDKKGLNSIF
>P0AAA9 ~~~zraP~~~Zinc resistance-associated protein~~~COG3678
MKRNTKIALVMMALSAMAMGSTSAFAHGGHGMWQQNAAPLTSEQQTAWQKIHNDFYAQSSALQQQLVTKRYEYNALLAAN
PPDSSKINAVAKEMENLRQSLDELRVKRDIAMAEAGIPRGAGMGMGYGGCGGGGHMGMGHW
>Q9L9I0 ~~~zraP~~~Zinc resistance-associated protein~~~
MKRNNKSAIALIALSLLALSSGAAFAGHHWGNNDGMWQQGGSPLTTEQQATAQKIYDDYYTQTSALRQQLISKRYEYNAL
LTASSPDTAKINAVAKEMESLGQKLDEQRVKRDVAMAQAGIPRGAGMGYGGCGGYGGGYHRGGGHMGMGNW
>P14375 ~~~zraR~~~Transcriptional regulatory protein ZraR~~~COG2204
MTHDNIDILVVDDDISHCTILQALLRGWGYNVALANSGRQALEQVREQVFDLVLCDVRMAEMDGIATLKEIKALNPAIPV
LIMTAYSSVETAVEALKTGALDYLIKPLDFDNLQATLEKALAHTHSIDAETPAVTASQFGMVGKSPAMQHLLSEIALVAP
SEATVLIHGDSGTGKELVARAIHASSARSEKPLVTLNCAALNESLLESELFGHEKGAFTGADKRREGRFVEADGGTLFLD
EIGDISPMMQVRLLRAIQEREVQRVGSNQIISVDVRLIAATHRDLAAEVNAGRFRQDLYYRLNVVAIEVPSLRQRREDIP
LLAGHFLQRFAERNRKAVKGFTPQAMDLLIHYDWPGNIRELENAVERAVVLLTGEYISERELPLAIASTPIPLGQSQDIQ
PLVEVEKEVILAALEKTGGNKTEAARQLGITRKTLLAKLSR
>P25852 ~~~zraR~~~Transcriptional regulatory protein ZraR~~~
MIRGKIDILVVDDDVSHCTILQALLRGWGYNVALAYSGHDALAQVREKVFDLVLCDVRMAEMDGIATLKEIKALNPAIPI
LIMTAFSSVETAVEALKAGALDYLIKPLDFDRLQETLEKALAHTRETGAELPSASAAQFGMIGSSPAMQHLLNEIAMVAP
SDATVLIHGDSGTGKELVARALHACSARSDRPLVTLNCAALNESLLESELFGHEKGAFTGADKRREGRFVEADGGTLFLD
EIGDISPLMQVRLLRAIQEREVQRVGSNQTISVDVRLIAATHRDLAEEVSAGRFRQDLYYRLNVVAIEMPSLRQRREDIP
LLADHFLRRFAERNRKVVKGFTPQAMDLLIHYDWPGNIRELENAIERAVVLLTGEYISERELPLAIAATPIKTEYSGEIQ
PLVDVEKEVILAALEKTGGNKTEAARQLGITRKTLLAKLSR
>P14377 2.7.13.3~~~zraS~~~Sensor protein ZraS~~~COG4191
MRFMQRSKDSLAKWLSAILPVVIVGLVGLFAVTVIRDYGRASEADRQALLEKGNVLIRALESGSRVGMGMRMHHVQQQAL
LEEMAGQPGVLWFAVTDAQGIIILHSDPDKVGRALYSPDEMQKLKPEENSRWRLLGKTETTPALEVYRLFQPMSAPWRHG
MHNMPRCNGKAVPQVDAQQAIFIAVDASDLVATQSGEKRNTLIILFALATVLLASVLSFFWYRRYLRSRQLLQDEMKRKE
KLVALGHLAAGVAHEIRNPLSSIKGLAKYFAERAPAGGEAHQLAQVMAKEADRLNRVVSELLELVKPTHLALQAVDLNTL
INHSLQLVSQDANSREIQLRFTANDTLPEIQADPDRLTQVLLNLYLNAIQAIGQHGVISVTASESGAGVKISVTDSGKGI
AADQLDAIFTPYFTTKAEGTGLGLAVVHNIVEQHGGTIQVASQEGKGSTFTLWLPVNITRKDPQG
>P37461 2.7.13.3~~~zraS~~~Sensor protein ZraS~~~
MSFIRLHKDAAATWLSRLLPAAIFILVGLFSIMVIRDYGRESAAARQTLLEKGNVLIRALESGTRVGMGMRMHHAQQQTL
LEEMAGQPGVLWFAVTDAQGVIITHSNPGMVGKSLYSPSEMHQLNPGPQERWRRVDVAANGETVPALEIYRQFQPLFGMR
GHGMRGHGMARSANDDEPAKQTIFIAFDASELAATQAREWRNTLIVLSALAAVLLATLLAFFWHQRYQRSHRELLDAMKR
KEKLVAMGHLAAGVAHEIRNPLSSIKGLAKYFAERTPAGGESHELAQVMAKEADRLNRVVSELLELVKPAHLTLQTVNLN
DIITHSLNLVSQDAQSREIQLRFTANETLKRIQADPDRLTQVLLNLYLNAIHAIGRQGTISVEAKESGTDRVIITVTDSG
KGIAPDQLEAIFTPYFTTKADGTGLGLAVVQNIIEQHGGAIKVKSIEGKGAVFTIWLPVIARQQD
>Q54944 2.7.1.176~~~~~~Toxin zeta~~~
MANIVNFTDKQFENRLNDNLEELIQGKKAVESPTAFLLGGQPGSGKTSLRSAIFEETQGNVIVIDNDTFKQQHPNFDELV
KLYEKDVVKHVTPYSNRMTEAIISRLSDQGYNLVIEGTGRTTDVPIQTATMLQAKGYETKMYVMAVPKINSYLGTIERYE
TMYADDPMTARATPKQAHDIVVKNLPTNLETLHKTGLFSDIRLYNREGVKLYSSLETPSISPKETLEKELNRKVSGKEIQ
PTLERIEQKMVLNKHQETPEFKAIQQKLESLQPPTPPIPKTPKLPGI
>P0A8H3 ~~~zupT~~~Zinc transporter ZupT~~~COG0428
MSVPLILTILAGAATFIGAFLGVLGQKPSNRLLAFSLGFAAGIMLLISLMEMLPAALAAEGMSPVLGYGMFIFGLLGYFG
LDRMLPHAHPQDLMQKSVQPLPKSIKRTAILLTLGISLHNFPEGIATFVTASSNLELGFGIALAVALHNIPEGLAVAGPV
YAATGSKRTAILWAGISGLAEILGGVLAWLILGSMISPVVMAAIMAAVAGIMVALSVDELMPLAKEIDPNNNPSYGVLCG
MSVMGFSLVLLQTAGIG
>P67470 ~~~zupT~~~Zinc transporter ZupT~~~
MSVPLILTLLAGAATFIGAFLGVLGQKPSNRVLAFSLGFAAGIMLLISLMEMLPAALDTEGMSPVLGYGMFIIGLLGYFG
LDRLLPHAHPQDLVQKRQQPLPGSIKRTAILLTLGISLHNFPEGIATFVTASSNLELGFGIALAVALHNIPEGLAVAGPV
YAATGSKRTAIFWAGISGMAEILGGVLAWLILGSLVSPIVMAAIMAAVAGIMVALSVDELMPLAKEIDPNNNPSYGVLCG
MSIMGLSLVILQTIGIG
>P54479 ~~~zur~~~Zinc-specific metallo-regulatory protein~~~COG0735
MNVQEALNLLKENGYKYTNKREDMLQLFADSDRYLTAKNVLSALNDDYPGLSFDTIYRNLSLYEELGILETTELSGEKLF
RFKCSFTHHHHHFICLACGKTKEIESCPMDKLCDDLDGYQVSGHKFEIYGTCPDCTAENQENTTA
>P0AC51 ~~~zur~~~Zinc uptake regulation protein~~~COG0735
MEKTTTQELLAQAEKICAQRNVRLTPQRLEVLRLMSLQDGAISAYDLLDLLREAEPQAKPPTVYRALDFLLEQGFVHKVE
STNSYVLCHLFDQPTHTSAMFICDRCGAVKEECAEGVEDIMHTLAAKMGFALRHNVIEAHGLCAACVEVEACRHPEQCQH
DHSVQVKKKPR
>P9WN85 ~~~zur~~~Zinc uptake regulation protein~~~COG0735
MSAAGVRSTRQRAAISTLLETLDDFRSAQELHDELRRRGENIGLTTVYRTLQSMASSGLVDTLHTDTGESVYRRCSEHHH
HHLVCRSCGSTIEVGDHEVEAWAAEVATKHGFSDVSHTIEIFGTCSDCRS
